LOCUS AB000115 2058 bp mRNA PRI 23-OCT-1997 DEFINITION Homo sapiens mRNA expressed in osteoblast, complete cds. ACCESSION AB000115 NID g2564034 KEYWORDS GS3686. SOURCE Homo sapiens cancellous bone osteoblast cDNA to mRNA, clone:GS3686. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Ohno,I., Hashimoto,J., Takaoka,K., Ochi,T., Okubo,K. and Matsubara,K. TITLE The cloning of a cDNA for novel genes expressed in human osteoblast JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2058) AUTHORS Ohno,I. TITLE Direct Submission JOURNAL Submitted (26-DEC-1996) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..2058 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="osteoblast" /clone="GS3686" /tissue_type="cancellous bone" CDS 242..1483 /note="The submitters designated this product as GS3686" /codon_start=1 /db_xref="PID:d1019799" /db_xref="PID:g1769802" /translation="MVERCSRQGCTITMAYIDYNMIVAFMLGNYINLRESSTEPNDSL WFSLQKKNDTTEIETLLLNTAPKIIDEQLVCRLSKTDIFIICRDNKIYLDKMITRNLK LRFYGHRQYLECEVFRVEGIKDNLDDIKRIIKAREHRNRLLADIRDYRPYADLVSEIR ILLVGPVGSGKSSFFNSVKSIFHGHVTGQAVVGSDTTSITERYRIYSVKDGKNGKSLP FMLCDTMGLDGAEGAGLCMDDIPHILKGCMPDRYQFNSRKPITPEHSTFITSPSLKDR IHCVAYVLDINSIDNLYSKMLAKVKQVHKEVLNCGIAYVALLTKVDDCSEVLQDNFLN MSRSMTSQSRVMNVHKMLGIPISNILMVGNYASDLELDPMKDILILSALRQMLRAADD FLEDLPLEETGAIERALQPCI" polyA_site 2058 /note="18 A nucleotides" BASE COUNT 628 a 392 c 413 g 625 t ORIGIN 1 gcacgaggaa gccacagatc tcttaagaac tttctgtctc caaaccgtgg ctgctcgata 61 aatcagacag aacagttaat cctcaattta agcctgatct aacccctaga aacagatata 121 gaacaatgga agtgacaaca agattgacat ggaatgatga aaatcatctg cgcaactgct 181 tggaaatgtt tctttgagtc ttctctataa gtctagtgtt catggaggta gcattgaaga 241 tatggttgaa agatgcagcc gtcagggatg tactataaca atggcttaca ttgattacaa 301 tatgattgta gcctttatgc ttggaaatta tattaattta cgtgaaagtt ctacagagcc 361 aaatgattcc ctatggtttt cacttcaaaa gaaaaatgac accactgaaa tagaaacttt 421 actcttaaat acagcaccaa aaattattga tgagcaactg gtgtgtcgtt tatcgaaaac 481 ggatattttc attatatgtc gagataataa aatttatcta gataaaatga taacaagaaa 541 cttgaaacta aggttttatg gccaccgtca gtatttggaa tgtgaagttt ttcgagttga 601 aggaattaag gataacctag acgacataaa gaggataatt aaagccagag agcacagaaa 661 taggcttcta gcagacatca gagactatag gccctatgca gacttggttt cagaaattcg 721 tattcttttg gtgggtccag ttgggtctgg aaagtccagt tttttcaatt cagtcaagtc 781 tatttttcat ggccatgtga ctggccaagc cgtagtgggg tctgatacca ccagcataac 841 cgagcggtat aggatatatt ctgttaaaga tggaaaaaat ggaaaatctc tgccatttat 901 gttgtgtgac actatggggc tagatggggc agaaggagca ggactgtgca tggatgacat 961 tccccacatc ttaaaaggtt gtatgccaga cagatatcag tttaattccc gtaaaccaat 1021 tacacctgag cattctactt ttatcacctc tccatctctg aaggacagga ttcactgtgt 1081 ggcttatgtc ttagacatca actctattga caatctctac tctaaaatgt tggcaaaagt 1141 gaagcaagtt cacaaagaag tattaaactg tggtatagca tatgtggcct tgcttactaa 1201 agtggatgat tgcagtgagg ttcttcaaga caacttttta aacatgagta gatctatgac 1261 ttctcaaagc cgggtcatga atgtccataa aatgctaggc attcctattt ccaatatttt 1321 gatggttgga aattatgctt cagatttgga actggacccc atgaaggata ttctcatcct 1381 ctctgcactg aggcagatgc tgcgggctgc agatgatttt ttagaagatt tgcctcttga 1441 ggaaactggt gcaattgaga gagcgttaca gccctgcatt tgagataagt tgccttgatt 1501 ctgacatttg gcccagcctg tactggtgtg ccgcaatgag agtcaatctc tattgacagc 1561 ctgcttcaga ttttgctttt gttcgttttg ccttctgtcc ttggaacagt catatctcaa 1621 gttcaaaggc caaaacctga gaagcggtgg gctaagatag gtcctactgc aaaccacccc 1681 tccatatttc cgtaccattt acaattcagt ttctgtgaca tctttttaaa ccactggagg 1741 aaaaatgaga tattctctaa tttattcttc tataacactc tatatagagc tatgtgagta 1801 ctaatcacat tgaataatag ttataaaatt attgtataga catctgcttc ttaaacagat 1861 tgtgagttct ttgagaaaca gcgtggattt tacttatctg tgtattcaca gagcttagca 1921 cagtgcctgg taatgagcaa gcatacttgc cattactttt ccttcccact ctctccaaca 1981 tcacattcac tttaaatttt tctgtatata gaaaggaaaa ctagcctggg caacatgatg 2041 aaaccccatc tccactgc // LOCUS AB000220 5176 bp mRNA PRI 10-JAN-1997 DEFINITION Human mRNA for semaphorin E, complete cds. ACCESSION AB000220 NID g1777306 KEYWORDS semaphorin E. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5176) AUTHORS Yamada,T. TITLE Human semaphorin E homologue JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5176) AUTHORS Yamada,T. TITLE Direct Submission JOURNAL Submitted (04-JAN-1997) to the DDBJ/EMBL/GenBank databases. Tesshi Yamada, National Cancer Center Research Institute, Pathology Division; 1-1 Tsukiji 5-chome, Chuo-ku, Tokyo 104, Japan (Tel:+81-3-3542-2511(ex.4206), Fax:+81-3-3248-2737) FEATURES Location/Qualifiers source 1..5176 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 563..2413 /codon_start=1 /product="semaphorin E" /db_xref="PID:d1019814" /db_xref="PID:g1777307" /translation="MAFRTICVLVGVFICSICVKGSSQPQARVYLTFDELRETKTSEY FSLSHHPLDYRILLMDEDQDRIYVGSKDHILSLNINNISQEALSVFWPASTIKVEECK MAGKDPTHGCGNFVRVIQTFNRTHLYVCGSGAFSPVCTYLNRGRRSEDQVFMIDSKCE SGKGRCSFNPNVNTVSVMINEELFSGMYIDFMGTDAAIFRSLTKRNAVRTDQHNSKWL SEPMFVDAHVIPDGTDPNDAKVYFFFKEKLTDNNRSTKQIHSMIARICPNDTGGLRSL VNKWTTFLKARLVCSVTDEDGPETHFDELEDVFLLETDNPRTTLVYGIFTTSSSVFKG SAVCVYHLSDIQTVFNGPFAHKEGPNHQLISYQGRIPYPRPGTCPGGAFTPNMRTTKE FPDDVVTFIRNHPLMYNSIYPIHKRPLIVRIGTDYKYTKIAVDRVNAADGRYHVLFLG TDRGTVQKVVVLPTNNSVSGELILEELEVFKNHAPITTMKISSKKQQLYVSSNEGVSQ VSLHRCHIYGTACADCCLARDPYCAWDGHSCSRFYPTGKRRSRRQDVRHGNPLTQCRG FNLKAYRNAAEIVQYGVKITPLFWSVPPSLRRHLSSGCYRKTKTGGKRLS" polyA_site 5176 /note="10 A nucleotides" BASE COUNT 1588 a 989 c 1052 g 1547 t ORIGIN 1 ggactgcgaa aggagcaggg ttgcggagct agggctccag cctgcggccg cgcattcttg 61 cgtctggcca gccgcgagct ctaagggtcg gccccgcccg gtccgccccc gcggctccct 121 gccaggctct cgcgggcgcg ctcggggtgg ggcctcgcgg ctggcggaga tgcggccggg 181 gctgcgcggt ggtgatgcga gcctgctggg cggcgcgccg gggcagccgg agccgcgcgc 241 cgcggcgctg taatcggaca ccaagagcgc tcgcccccgg cctccggcca ctttccattc 301 actccgaggt gcttgattga gcgacgcgga gaagagctcc gggtgccgcg gcactgcagc 361 gctgagattc ctttacaaag aaactcagag gaccgggaag aaagaatttc acctttgcga 421 cgtgctagaa aataaggtcg tctgggaaaa ggactggaga cacaagcgca tccaaccccg 481 gtagcaaact gatgactttt ccgtgctgat ttctttcaac ctcggtattt tcccttggat 541 attaacttgc atatctgaag aaatggcatt ccggacaatt tgcgtgttgg ttggagtatt 601 tatttgttct atctgtgtga aaggatcttc ccagccccaa gcaagagttt atttaacatt 661 tgatgaactt cgagaaacca agacctctga atacttcagc ctttcccacc atcctttaga 721 ctacaggatt ttattaatgg atgaagatca ggaccggata tatgtgggaa gcaaagatca 781 cattctttcc ctgaatatta acaatataag tcaagaagct ttgagtgttt tctggccagc 841 atctacaatc aaagttgaag aatgcaaaat ggctggcaaa gatcccacac acggctgtgg 901 gaactttgtc cgtgtaattc agactttcaa tcgcacacat ttgtatgtct gtgggagtgg 961 cgctttcagt cctgtctgta cttacttgaa cagagggagg agatcagagg accaagtttt 1021 catgattgac tccaagtgtg aatctggaaa aggacgctgc tctttcaacc ccaacgtgaa 1081 cacggtgtct gttatgatca atgaggagct tttctctgga atgtatatag atttcatggg 1141 gacagatgct gctatttttc gaagtttaac caagaggaat gcggtcagaa ctgatcaaca 1201 taattccaaa tggctaagtg aacctatgtt tgtagatgca catgtcatcc cagatggtac 1261 tgatccaaat gatgctaagg tgtacttctt cttcaaagaa aaactgactg acaataacag 1321 gagcacgaaa cagattcatt ccatgattgc tcgaatatgt cctaatgaca ctggtggact 1381 gcgtagcctt gtcaacaagt ggaccacttt cttaaaggcg aggctggtgt gctcggtaac 1441 agatgaagac ggcccagaaa cacactttga tgaattagag gatgtgtttc tgctggaaac 1501 tgataacccg aggacaacac tagtgtatgg catttttaca acatcaagct cagttttcaa 1561 aggatcagcc gtgtgtgtgt atcatttatc tgatatacag actgtgttta atgggccttt 1621 tgcccacaaa gaagggccca atcatcagct gatttcctat cagggcagaa ttccatatcc 1681 tcgccctgga acttgtccag gaggagcatt tacacccaat atgcgaacca ccaaggagtt 1741 cccagatgat gttgtcactt ttattcggaa ccatcctctc atgtacaatt ccatctaccc 1801 aatccacaaa aggcctttga ttgttcgtat tggcactgac tacaagtaca caaagatagc 1861 tgtggatcga gtgaacgctg ctgatgggag ataccatgtc ctgtttctcg gaacagatcg 1921 gggtactgtg caaaaagtgg ttgttcttcc tactaacaac tctgtcagtg gcgagctcat 1981 tctggaggag ctggaagtct ttaagaatca tgctcctata acaacaatga aaatttcatc 2041 taaaaagcaa cagttgtatg tgagttccaa tgaaggggtt tcccaagtat ctctgcaccg 2101 ctgccacatc tatggtacag cctgtgctga ctgctgcctg gcgcgggacc cttattgcgc 2161 ctgggatggc cattcctgtt ccagattcta cccaactggg aaacggagga gccgaagaca 2221 agatgtgaga catggaaacc cactgactca atgcagagga tttaatctaa aagcatacag 2281 aaatgcagct gaaattgtgc agtatggagt aaaaataaca ccacttttct ggagtgtgcc 2341 cccaagtctc cgcaggcatc tatcaagtgg ctgttacaga aagacaaaga caggaggaaa 2401 gaggttaagc tgaatgaacg aataatagcc acttcacagg gactcctgat ccgctctgtt 2461 cagggttctg accaaggact ttatcactgc attgctacag aaaatagttt caagcagacc 2521 atagccaaga tcaacttcaa agttttagat tcagaaatgg tggctgttgt gacggacaaa 2581 tggtccccgt ggacctgggc cagctctgtg agggctttac ccttccaccc gaaggacatc 2641 atgggggcat tcagccactc agaaatgcag atgattaacc aatactgcaa agacactcgg 2701 cagcaacatc agcagggaga tgaatcacag aaaatgagag gggactatgg caagttaaag 2761 gccctcatca atagtcggaa aagtagaaac aggaggaatc agttgccaga gtcataatat 2821 tttcttatgt gggtcttatg cttccattaa caaatgctct gtcttcaatg atcaaatttt 2881 gagcaaagaa acttgtgctt taccaagggg aattactgaa aaaggtgatt actcctgaag 2941 tgagttttac acgaactgaa atgagcatgc attttcttgt atgatagtga ctagcactag 3001 acatgtcatg gtcctcatgg tgcatataaa tatatttaac ttaacccaga ttttatttat 3061 atctttattc accttttctt caaaatcgat atggtggctg caaaactaga attgttgcat 3121 ccctcaattg aatgagggcc atatccctgt ggtattcctt tcctgctttg gggctttaga 3181 attctaattg tcagtgattt tgtatatgaa aacaagttcc aaatccacag cttttacgta 3241 gtaaaagtca taaatgcata tgacagaatg gctatcaaaa gaaatagaaa aggaagacgg 3301 catttaaagt tgtataaaaa cacgagttat tcataaagag aaaatgatga gtttttatgg 3361 ttccaatgaa atatcttccc ctttttttaa gattgtaaaa ataatcagtt actggtatct 3421 gtcactgacc tttgtttcct tattcaggaa gataaaaatc agtaacctac cccatgaaga 3481 tatttggtgg gagttatatc agtgaagcag tttggtttat attcttatgt tatcaccttc 3541 caaacaaaag cacttacttt ttttggaagt tatttaattt attttagact caaagaatat 3601 aatcttgcac tactcagtta ttactgtttg ttctcttatt ccctagtctg tgtggcaaat 3661 taaacaatat aagaaggaaa aatttgaagt attagacttc taaataaggg gtgaaatcat 3721 cagaaagaaa aatcaaagta gaaactacta attttttaag aggaatttat aacaaatatg 3781 gctagttttc aacttcagta ctcaaattca atgattcttc cttttattaa aaccagtctc 3841 agatatcata ctgattttta agtcaacact atatatttta tgatcttttc agtgtgatgg 3901 caaggtgctt gttatgtcta gaaagtaaga aaacaatatg aggagacatt ctgtctttca 3961 aaaggtaatg gtacatacgt tcactggtct ctaagtgtaa aagtagtaaa ttttgtgatg 4021 aataaaataa ttatctccta attgtatgtt agaataattt tattagaata atttcatact 4081 gaaattattt tctccaaata aaaattagat ggaaaaatgt gaaaaaaatt attcatgctc 4141 tcatatatat tttaaaaaca ctacttttgc ttttttattt accttttaag acattttcat 4201 gcttccaggt aaaaacagat attgtaccat gtacctaatc caaatatcat ataaacattt 4261 tatttatagt taataatcta tgatgaaggt aattaaagta gattatggcc tttttaagta 4321 ttgcagtcta aaacttcaaa aactaaaatc attgtcaaaa ttaatatgat tattaatcag 4381 aatatcagat atgattcact atttaaacta tgataaatta tgataatata tgaggaggcc 4441 tcgctatagc aaaaatagtt aaaatgctga cataacacca aacttcattt tttaaaaaat 4501 ctgttgttcc aaatgtgtat aattttaaag taatttctaa agcagtttat tataatggtt 4561 tgcctgctta aaaggtataa ttaaacttct tttctcttct acattgacac acagaaatgt 4621 gtcaatgtaa agccaaaacc atcttctgtg tttatggcca atctattctc aaagttaaaa 4681 gtaaaattgt ttcagagtca cagttccctt tatttcacat aagcccaaac tgatagacag 4741 taacggtgtt tagttttata ctatatttgt gctatttaat tctttctatt ttcacaatta 4801 ttaaattgtg tacactttca ttacttttaa aaatgtagaa attcttcatg aacataactc 4861 tgctgaatgt aaaagaaaat tttttttcaa aaatgctgtt aatgtatact actggtggtt 4921 gattggtttt attttatgta gcttgacaat tcagtgactt aatatctatt ccatttgtat 4981 tgtacataaa attttctaga aatacacttt tttccaaagt gtaagtttgt gaatagattt 5041 tagcatgatg aaactgtcat aatggtgaat gttcaatctg tgtaagaaaa caaactaaat 5101 gtagttgtca cactaaaatt taattggata ttgatgaaat cattggcctg gcaaaataaa 5161 acatgttgaa ttcccc // LOCUS AB000263 368 bp mRNA PRI 23-APR-1997 DEFINITION Human mRNA for prepro cortistatin like peptide, complete cds. ACCESSION AB000263 NID g2055231 KEYWORDS prepro cortistatin like peptide. SOURCE Homo sapiens Brain cDNA to mRNA, clone:phCSP6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 368) AUTHORS Fukusumi,S. TITLE Direct Submission JOURNAL Submitted (06-JAN-1997) to the DDBJ/EMBL/GenBank databases. Shoji Fukusumi, Takeda Chemical Ind. Ltd., Discovery Reserch Laboratories I; Wadai 10, Tsukuba, Ibaraki 300-42, Japan (Tel:0298-64-5039, Fax:0298-64-5000) REFERENCE 2 (bases 1 to 368) AUTHORS Fukusumi,S. TITLE Identification and characterization of a human novel cortistatin like peptide JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Fukusumi,S., Kitada,C., Takekawa,S., Kizawa,H., Sakamoto,J., Miyamoto,M., Hinuma,S., Kitano,K. and Fujino,M. TITLE Identification and characterization of a novel human cortistatin-like peptide JOURNAL Biochem. Biophys. Res. Commun. 232 (1), 157-163 (1997) MEDLINE 97236300 FEATURES Location/Qualifiers source 1..368 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phCSP6" /tissue_type="Brain" CDS 6..323 /codon_start=1 /product="prepro cortistatin like peptide" /db_xref="PID:d1020551" /db_xref="PID:g2055232" /translation="MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEHMQEAAGIRKS SLLTFLAWWFEWTSQASAGPLIGEEAREVARRQEGAPPQQSARRDRMPCRNFFWKTFS SCK" BASE COUNT 79 a 123 c 105 g 61 t ORIGIN 1 acaagatgcc attgtccccc ggcctcctgc tgctgctgct ctccggggcc acggccaccg 61 ctgccctgcc cctggagggt ggccccaccg gccgagacag cgagcatatg caggaagcgg 121 caggaataag gaaaagcagc ctcctgactt tcctcgcttg gtggtttgag tggacctccc 181 aggccagtgc cgggcccctc ataggagagg aagctcggga ggtggccagg cggcaggaag 241 gcgcaccccc ccagcaatcc gcgcgccggg acagaatgcc ctgcaggaac ttcttctgga 301 agaccttctc ctcctgcaaa taaaacctca cccatgaatg ctcacgcaag tttaattaca 361 gacctgaa // LOCUS AB000276 2393 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens mRNA for DAP-1 beta, complete cds. ACCESSION AB000276 NID g2588975 KEYWORDS DAP-1 beta. SOURCE Homo sapiens tissue_lib:brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Satoh,K., Yanai,H., Senda,T., Kohu,K., Nakamura,T., Okumura,N., Matsumine,A., Kobayashi,S., Toyoshima,K. and Akiyama,T. TITLE DAP-1, a novel protein that interacts with the guanylate kinase-like domains of hDLG and PSD-95 JOURNAL Genes Cells 2 (6), 415-424 (1997) MEDLINE 97431353 REFERENCE 2 (bases 1 to 2393) AUTHORS Satoh,K. TITLE Direct Submission JOURNAL Submitted (07-JAN-1997) to the DDBJ/EMBL/GenBank databases. Kiyotoshi Satoh, Institute for Microbial Diseases, Osaka University, Department of Oncogene Research; 3-1 Yamadaoka, Suita 565, Japan (E-mail:satokiyo@biken.osaka-u.ac.jp, Tel:+81-6-879-8304, Fax:+81-6-879-8305) FEATURES Location/Qualifiers source 1..2393 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="brain" CDS 20..2047 /codon_start=1 /product="DAP-1 beta" /db_xref="PID:g2588976" /translation="MNLIFHKDILFGIPANKVPQDEWTGYTPRGKDDEIPCRRMRSGS YIKAMGDEDSGDSDTSPKPSPKVAARRESYLKATQPSLTELTTLKISNEHSPKLQIRS HSYLRAVSEVSINRSLDSLDPAGLLTSPKFRSRNESYMRAMSTISQVSEMEVNGQFES VCESVFSELESQAVEALDLPMPGCFRMRSHSYVRAIEKGCSQDDECVSLRSSSPPRTT TTVRTIQSSTVSSCITTYKKTPPPVPPRTTTKPFISITAQSSTESAQDAYMDGQGQRG DIISQSGLSNSTESLDSMKALTAAIEAANAQIHGPASQHMGNNTATVTTTTTIATVTT EDRKKDHFKKNRCLSIGIQVDDAEEPDKTGENKAPSKFQSVGVQVEEEKCFRRFTRSN SVTTAVQADLDFHDNLENSLESIEDNSCPGPMARQFSRDASTSTVSIQGSGNHYHACA ADDDFDTDFDPSILPPPDPWIDSITEDPLEAVQRSVCHRDGHWFLKLLQAERDRMEGW CQQMEREERENNLPEDILGKIRTAVGSAQLLMAQKFYQFRELCEENLNPNAHPRPTSQ DLAGFWDMLQLSIENISMKFDELHQLKANNWKQMDPLDKKERRAPPPVPKKPAKGPAP LIRERSLESSQRQEARKRLMAAKRAASVRQNSATESAESIEIYIPEAQTRL" BASE COUNT 597 a 739 c 611 g 446 t ORIGIN 1 agtgcagctg agatgaataa tgaacttaat tttccataaa gacattctgt ttggcattcc 61 agctaataag gttccacaag atgaatggac agggtacacc ccacgaggta aagatgatga 121 aattccatgc cgaagaatgc ggagtggcag ttatatcaag gccatggggg atgaagacag 181 tggagactca gacacgagtc ctaagccttc tccaaaagtt gctgcgcgga gagaaagcta 241 tctcaaggct actcagccat cccttacaga actcaccaca ctcaaaatct ccaatgaaca 301 ctcacccaaa ctccagatcc ggagtcatag ttacctgagg gcagtgagtg aagtctccat 361 caaccggagc ctggacagcc tggaccctgc aggcttgctc acatcaccaa agttccgctc 421 caggaatgag agctacatgc gagccatgag caccatcagc caggtgagcg agatggaagt 481 gaacgggcag ttcgagtccg tgtgcgagtc cgtgttcagc gagctggagt cgcaggccgt 541 ggaagcgctg gacctgccca tgcccggctg cttccgcatg cggagccaca gctatgtgcg 601 ggccattgag aaaggctgct cccaggacga cgagtgcgtg tccctgaggt cgtcctcgcc 661 gccgcgcacc accaccaccg ttaggaccat ccagagcagc acggtgtcat cttgcattac 721 aacatataag aagacaccac ctccagtccc acccagaact accacgaaac ctttcatttc 781 tatcacagcc cagagtagca cagagtcagc ccaggatgcc tacatggacg gacagggcca 841 gcgaggagat attatcagcc agtctggact cagcaactcc accgagagcc tggacagtat 901 gaaggctctg acagccgcca tcgaagctgc aaacgcccag atccatggcc ctgccagtca 961 acacatgggc aataacactg ccactgtcac caccacgact accatagcca ccgtcaccac 1021 ggaggacagg aagaaggacc actttaagaa aaatcgatgc ctgtctatcg ggatacaggt 1081 ggatgatgct gaagaacctg acaaaacagg ggagaataaa gcacccagta agttccagtc 1141 cgtgggagtg caagtagaag aagagaagtg cttccgcagg ttcactcgat ccaacagtgt 1201 gacgacagca gtacaggccg acctggactt ccatgataat ctggaaaatt ctctggaatc 1261 tatagaggac aattcgtgtc ctggccccat ggccagacag ttctcccgcg atgccagcac 1321 ctccacagtc agcattcagg gctcaggaaa ccattaccat gcctgtgccg ccgatgatga 1381 ctttgacacg gattttgacc cctctattct gcctcctccg gacccctgga ttgactctat 1441 cactgaagac cctctggagg ccgtgcaaag gtcagtgtgc caccgggatg gccactggtt 1501 cctgaagctt ctccaggcag agcgagaccg catggagggg tggtgtcaac agatggagcg 1561 ggaagaacgg gaaaacaacc tgcccgaaga cattctagga aaaatccgaa ccgcagtggg 1621 cagtgcccaa cttctcatgg cccagaaatt ctaccagttc agagaactgt gtgaagaaaa 1681 cctgaatcct aatgctcatc caagacccac ctcccaggat ttggcggggt tttgggacat 1741 gctgcagttg tccatagaaa atattagtat gaaatttgat gaacttcatc agttaaaggc 1801 caataattgg aaacagatgg atcctcttga caagaaggag agaagggccc ctcctccagt 1861 gccaaagaag ccggcgaagg gccccgcgcc gctgatccgg gagcgctcgc tggagagctc 1921 gcagcgccag gaggcccgca agcgcctgat ggccgccaag cgcgccgcgt ccgtccgcca 1981 gaactcggcc accgagagcg ccgagagcat cgagatctac atccccgagg cgcagacccg 2041 gctctgagcg ccccgcagcc cggccgccgc cgccaagcat ctgtcccctc ctcccccggc 2101 cgctcctctg ccggctgcct ctccccctcc gagcccgtcc gctcccgagc tcggtgactt 2161 ccactgtcgc ggtgtagttg tccacctcgc aggagccgcc ccccgggccc ccctcagccc 2221 cccacttccc gtacccgttt gcccatctcc ttcttcaccg agcttcgccc cctgtcctga 2281 tgccgtcgcc ctgcctcata ctgagatcca accctttatt ttctgggcaa agccaaaccc 2341 acctgtgtag aagtgatgcc tttaggtcac ccgccgtcct cagttctctc gag // LOCUS AB000360 2582 bp DNA PRI 17-OCT-1997 DEFINITION Homo sapiens PIGC gene, complete cds. ACCESSION AB000360 NID g2547041 KEYWORDS PIGC; glycosylphosphatidylinositol-synthesis gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hong,Y., Ohishi,K., Inoue,N., Endo,Y., Fujita,T., Takeda,J. and Kinoshita,T. TITLE Structures and chromosomal localizations of the glycosylphosphatidylinositol synthesis gene PIGC and its pseudogene PIGCP1 JOURNAL Genomics 44 (3), 347-349 (1997) MEDLINE 97468149 REFERENCE 2 (bases 1 to 2582) AUTHORS Hong,Y. TITLE Direct Submission JOURNAL Submitted (08-JAN-1997) to the DDBJ/EMBL/GenBank databases. Yeongjin Hong, Research Institute for Microbial Diseases, Immunoregulation; 3-1 Yamada-oka, Suita, Osaka 565, Japan (E-mail:kohishi@biken.osaka-u.ac.jp, Tel:81-6-879-8329, Fax:81-6-875-5233) FEATURES Location/Qualifiers source 1..2582 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q23-q25" exon 808..2266 gene 1101..1994 /gene="PIGC" CDS 1101..1994 /gene="PIGC" /standard_name="glycosylphosphatidylinositol-synthesis gene" /codon_start=1 /db_xref="PID:d1023736" /db_xref="PID:g2547042" /translation="MYAQPVTNTKEVKWQKVLYERQPFPDNYVDRRFLEELRKNIHAR KYQYWAVVFESSVVIQQLCSVCVFVVIWWYMDEGLLAPHWLLGTGLASSLIGYVLFDL IDGGEGRKKSGQTRWADLKSALVFITFTYGFSPVLKTLTESVSTDTIYAMSVFMLLGH LIFFDYGANAAIVSSTLSLNMAIFASVCLASRLPRSLHAFIMVTFAIQIFALWPMLQK KLKACTPRSYVGVTLLFAFSAVGGLLSISAVGAVLFALLLMSISCLCSFYLIRLQLFK ENIHGPWDEAEIKEDLSRFLS" mutation 1896 /gene="PIGC" /replace="c" polyA_signal 2246..2251 mutation 2259 /replace="t" repeat_region 2331..2356 /rpt_unit=gt BASE COUNT 694 a 494 c 581 g 813 t ORIGIN 1 ggatccctgc tgcagagggg gtaacggtgt ctggcttgcc aagcaatatt tgttgtggtc 61 tatcatggaa gaaataaagt cgggcaatat gaattttttt tttctcaaat ttgccggatg 121 gctgtggtgt ttctgactct tagttttctc attgtgaaaa aggaatgatt atcttcttcg 181 atcctctcaa gagtttcctt gttttgagta gattgatagc tctttaaagg atgctaagct 241 cagctaatgg aagaagagtc tagtttcttt gaggctttga ttttggttaa actatagagc 301 tcataccttt ctgtatggtg cagcttacta ttgtctttgg attggtaact taaaaaatac 361 aaataacatg cctttgagaa ccaataaaaa ctatggatat tatccctata aatttacaca 421 aatccagata taagcatgca atgtgatata cctaagggat atgtgaacca ctgagttaag 481 aactgcttta gagggagata caatgtgaga cacaggcttt gggataagac tttggtttga 541 atcctggctc tgctctgtta ccttagggca aagttactta agcatcttga atctcagctt 601 ttttaccaaa gcaggactaa tactaactta caaggtggtg aggattaagt gaaagaagat 661 acataaggca cttagcacat agtaggtact caataagcga tagctaacag atgtctatta 721 ttattcaagg aattataatt ttcaaatctg aaatgcagtt ttaatgtccc ataaggtgac 781 taccacatac atttttctca gacttttagt aaactgagtt gatttgactt tatctcagta 841 ctactcttga cctttcacaa ctttcgtagg ttcacagtct ctctttttct aggaacttgg 901 ctgtgttgtc ctgcctcaga gacaaattca tctattgtag gcctagcccc tgcctttgaa 961 aacaaggaaa ggttggtaga acatcaacac agcatggaat ttccagggag gtctcatttc 1021 aaaacttcat aaagaacaag aaccacctgg acttctgtga gggcgatgat taaactggcc 1081 tgagtttgaa tgaaaggata atgtatgctc aacctgtgac taacaccaag gaggtcaagt 1141 ggcagaaggt cttgtatgag cgacagccct ttcctgataa ctatgtggac cggcgattcc 1201 tggaagagct ccggaaaaac atccatgctc ggaaatacca atattgggct gtggtatttg 1261 agtccagtgt ggtgatccag cagctgtgca gtgtttgtgt ttttgtggtt atctggtggt 1321 atatggatga gggtcttctg gccccccatt ggcttttagg gactggcctg gcttcttcac 1381 tgattgggta tgttttgttt gatctcattg atggaggtga agggcggaag aagagtgggc 1441 agacccggtg ggctgacctg aagagtgccc tagtcttcat tactttcact tatgggtttt 1501 caccagtgct gaagaccctt acagagtctg tcagcactga caccatctat gccatgtcag 1561 tcttcatgct gttaggccat ctcatctttt ttgactatgg tgccaatgct gccattgtat 1621 ccagcacact atccttgaac atggccatct ttgcttctgt atgcttggca tcacgtcttc 1681 cccggtccct gcatgccttc atcatggtga catttgccat tcagattttt gccctgtggc 1741 ccatgttgca gaagaaacta aaggcatgta ctccccggag ctatgtgggg gtcacactgc 1801 tttttgcatt ttcagccgtg ggaggcctac tgtccattag tgctgtggga gccgtactct 1861 ttgcccttct gctgatgtct atctcatgtc tgtgttcatt ctacctcatt cgcttgcagc 1921 tttttaaaga aaacattcat gggccttggg atgaagctga aatcaaggaa gacttgtcca 1981 ggttcctcag ttaaattagg acatccatta cattattaaa gcaagctgat agattagcct 2041 cctaactagt atagaactta aagacagagt tccattctgg aagcagcatg tcattgtggt 2101 aagagaatag agatcaaaac caaaaaaaat gaaccaaagg cttgggtggt gagggtgctt 2161 atcctttctg ttattttgta gatgaaaaaa ctttctgggg acctcttgaa ttacatgctg 2221 taacatatga agtgatgtgg tttctattaa aaaaataaca catccatcaa gttgtctcat 2281 gatttttcca taaacaggag gcagacagag gggcatgaag agtgaagtaa gtgtgtgtgt 2341 gtgtgtgtgt gtgtgtaaag tcacttcttt ctaccctttt caatgtgcta atgctctttt 2401 atttatctag ggctcaaatc ttagaacaca gggtgctatg ctcagttttg ttgcccaaga 2461 tcacagaatt ggttacttaa ccttgactca gagtttctac cttgttctta gggaagcata 2521 tcacaactaa ttgcaaagca gagtgtgatg tgtcacaata agcagaatgc tagggggaat 2581 tc // LOCUS AB000409 2617 bp mRNA PRI 09-MAY-1997 DEFINITION Human mRNA for MNK1, complete cds. ACCESSION AB000409 NID g2077824 KEYWORDS MNK1. SOURCE Homo sapiens cell_line:HeLa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2617) AUTHORS Fukunaga,R. and Hunter,T. TITLE Direct Submission JOURNAL Submitted (11-JAN-1997) to the DDBJ/EMBL/GenBank databases. Rikiro Fukunaga, Osaka University Medical School, Department of Genetics; 2-2 Yamadaoka, Suita, Osaka 565, Japan (E-mail:fukunaga@genetic.med.osaka-u.ac.jp, Tel:81-6-879-3318, Fax:81-6-879-3319) REFERENCE 2 (sites) AUTHORS Fukunaga,R. and Hunter,T. TITLE MNK1, a new MAP kinase-activated protein kinase, isolated by a novel expression screening method for identifying protein kinase substrates JOURNAL EMBO J. 16 (8), 1921-1933 (1997) MEDLINE 97299869 FEATURES Location/Qualifiers source 1..2617 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 188..1462 /note="MAP kinase-activated protein kinase" /codon_start=1 /product="MNK1" /db_xref="PID:d1020674" /db_xref="PID:g2077825" /translation="MVSSQKLEKPIEMGSSEPLPIADGDRRRKKKRRGRATDSLPGKF EDMYKLTSELLGEGAYAKVQGAVSLQNGKEYAVKIIEKQAGHSRSRVFREVETLYQCQ GNKNILELIEFFEDDTRFYLVFEKLQGGSILAHIQKQKHFNEREASRVVRDVAAALDF LHTKGIAHRDLKPENILCESPEKVSPVKICDFDLGSGMKLNNSCTPITTPELTTPCGS AEYMAPEVVEVFTDQATFYDKRCDLWSLGVVLYIMLSGYPPFVGHCGADCGWDRGEVC RVCQNKLFESIQEGKYEFPDKDWAHISSEAKDLISKLLVRDAKQRLSAAQVLQHPWVQ GQAPEKGLPTPQVLQRNSSTMDLTLFAAEAIALNRQLSQHEENELAEEPEALADGLCS MKLSPPCKSRLARRRALAQAGRGEDRSPPTAL" polyA_signal 2601..2606 polyA_site 2617 /note="9 a nucleotides" BASE COUNT 639 a 684 c 675 g 619 t ORIGIN 1 gcgatctgca ggtaggggtg cgcgcgaccg ctccccggcg ggagccagcg aaggtttcca 61 tgtcagaggc cgatggagaa ctgaagattg ccacctacgc acaaaggcca ttgagacact 121 tcgtgtagct ggaagacacc aacttcctga caggagcttt atttcatttg ggatttcaag 181 tttacagatg gtatcttctc aaaagttgga aaaacctata gagatgggca gtagcgaacc 241 ccttcccatc gcagatggtg acaggaggag gaagaagaag cggaggggcc gggccactga 301 ctccttgcca ggaaagtttg aagatatgta caagctgacc tctgaattgc ttggagaggg 361 agcctatgcc aaagttcaag gtgccgtgag cctacagaat ggcaaagagt atgccgtcaa 421 aatcatcgag aaacaagcag ggcacagtcg gagtagggtg tttcgagagg tggagacgct 481 gtatcagtgt cagggaaaca agaacatttt ggagctgatt gagttctttg aagatgacac 541 aaggttttac ttggtctttg agaaattgca aggaggttcc atcttagccc acatccagaa 601 gcaaaagcac ttcaatgagc gagaagccag ccgagtggtg cgggacgttg ctgctgccct 661 tgacttcctg cataccaaag gcattgctca tcgtgatctg aaaccagaaa atatattgtg 721 tgaatctcca gaaaaggtgt ctccagtgaa aatctgtgac tttgacttgg gcagtgggat 781 gaaactgaac aactcctgta cccccataac cacaccagag ctgaccaccc catgtggctc 841 tgcagaatac atggcccctg aggtagtgga ggtcttcacg gaccaggcca cattctacga 901 caagcgctgt gacctgtgga gcctgggcgt ggtcctctac atcatgctga gtggctaccc 961 acccttcgtg ggtcactgcg gggccgactg tggctgggac cggggcgagg tctgcagggt 1021 gtgccagaac aagctgtttg aaagcatcca ggaaggcaag tatgagtttc ctgacaagga 1081 ctgggcacac atctccagtg aagccaaaga cctcatctcc aagctcctgg tgcgagatgc 1141 aaagcagaga cttagcgccg cccaagttct gcagcaccca tgggtgcagg ggcaagctcc 1201 agaaaaggga ctccccacgc cgcaagtcct ccagaggaac agcagcacaa tggacctgac 1261 gctcttcgca gctgaggcca tcgcccttaa ccgccagcta tctcagcacg aagagaacga 1321 actagcagag gagccagagg cactagctga tggcctctgc tccatgaagc tttcccctcc 1381 ctgcaagtca cgcctggccc ggagacgggc cctggcccag gcaggccgtg gtgaagacag 1441 gagcccgccc acagcactct gaaatgctcc agtcacacct tataggccct aggcctggcc 1501 aggcattgtc ccctggaaac ctgtgtggct aaagtctgct gagcaggcag cagcctctgc 1561 tctgtggctc cattcaggct ttttcatcta cgaaggccct gaggttccca tcaaccccca 1621 tttccctagg gtcctggagg aaaaagcttt ttccaaaggg gttgtctttg aaaaggaaag 1681 caatcacttc tcactttgca taattgcctg cagcaggaac atctcttcac tgggctccac 1741 ctgctcaccc gcctgcagat ctgggatcca gcctgctctc accgctgtag ctgtggcggc 1801 tggggctgca gcctgcaggg agaagcaaga agcatcagtt gacagaggct gccgacacgt 1861 gcctcttccc tctcttctct gtcaccctcc tctggcggtc cttccacctt cctctgtcct 1921 ccggatgtcc tctttgcccg tcttctccct tggctgagca aagccatccc ctcaattcag 1981 ggaagggcaa ggagccttcc tcattcagga aatcaaatca gtcttccggt ctgcagcacg 2041 gaaaagcaca taatctttct ttgctgtgac tgaaatgtat ccctcgttta tcatcccctt 2101 tgtttgtgat tgctgctaaa gtcagtagta tcgttttttt aaaaaaaaag tttggtgttt 2161 ttaaccatct gttccagcaa agatgatacc ttaaactccc actgcaagcc catgaacttc 2221 ccagagagtg gaacggcttg ctcttctttc tagaatgtcc atgcacttgg gttttaatca 2281 gcagttccct attattctga ttttaagctg ttcctgtgat gaacttagag acagcatcgg 2341 tgtctgctgc tgtgtcccca ggtcttgtgt gggtggcaca gatctgggca gttagatagt 2401 gctctgtgcc taaggtgaag ccacactagg gtgaagcctc acttccctgt ttgagcaatg 2461 cagtgcctgc tgcccgtgtg catgaaggta cagccattca tataagtgga actattgagt 2521 tacataaaga aaatagattt gcatttgtca ggcagacgtt tatacaacac cacggtgctt 2581 ttatacattg tgcttatttt aataaaactg aaattct // LOCUS AB000410 1559 bp mRNA PRI 25-MAR-1997 DEFINITION Human hOGG1 mRNA, complete cds. ACCESSION AB000410 NID g1906756 KEYWORDS hOGG1. SOURCE Homo sapiens cell_line:Hela cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Arai,K., Morishita,K., Shinmura,K., Kohno,T., Taniwaki,M., Ohwada,S. and Yokota,J. TITLE Cloning of a human homolog of the yeast OGG1 gene that is involved in t he repair of oxidative DNA damage JOURNAL Oncogene (1997) In press REFERENCE 2 (bases 1 to 1559) AUTHORS Arai,K., Morishita,K., Shinmura,K., Kohno,T., Taniwaki,M., Ohwada,S. and Yokota,J. TITLE Cloning of a human homolog of the yeast OGG1 gene that is involved in t he repair of oxidative DNA damage JOURNAL Unpublished (1997) REFERENCE 3 (bases 1 to 1559) AUTHORS Yokota,J. TITLE Direct Submission JOURNAL Submitted (11-JAN-1997) to the DDBJ/EMBL/GenBank databases. Jun Yokota, National Cancer Center Research Institute, Biology Division; 5-1-1 Tsukiji, Chuo-ku, Tokyo 104, Japan (E-mail:jyokota@gan2.ncc.go.jp, Tel:03-3542-2511, Fax:03-3542-0807) FEATURES Location/Qualifiers source 1..1559 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /chromosome="3" /map="3p26.2" mRNA 1..1559 gene 269..1306 /gene="hOGG1" CDS 269..1306 /gene="hOGG1" /codon_start=1 /db_xref="PID:d1019847" /db_xref="PID:g1906757" /translation="MPARALLPRRMGHRTLASTPALWASIPCPRSELRLDLVLPSGQS FRWREQSPAHWSGVLADQVWTLTQTEEQLHCTVYRGDKSQASRPTPDELEAVRKYFQL DVTLAQLYHHWGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSNNNIARITGMVE RLCQAFGPRLIQLDDVTYHGFPSLQALAGPEVEAHLRKLGLGYRARYVSASARAILEE QGGLAWLQQLRESSYEEAHKALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQ RDYSWHPTTSQAKGPSPQTNKELGNFFRSLWGPYAGWAQAVLFSADLRQSRHAQEPPA KRRKGSKGPEG" BASE COUNT 333 a 454 c 469 g 303 t ORIGIN 1 gagaagataa gtcgcaagga gggggcggga cccacacctc aggaaagccg gagaattggg 61 gcacgcaagc gggggggctt tgatgacccc ccaaagggcg aggcatgcag gaggtggagg 121 aattaagtga aacagggaag gttgttaaac agcaccgtgt gggcgaggcc ttaagggtcg 181 tggtcctcgt ctgggcgggg tctttgggcg tcgacgaggc ctggttctgg gtaggcgggg 241 ctactacggg gcggtgcctg ctgtggaaat gcctgcccgc gcgcttctgc ccaggcgcat 301 ggggcatcgt actctagcct ccactcctgc cctgtgggcc tccatcccgt gccctcgctc 361 tgagctgcgc ctggacctgg ttctgccttc tggacaatct ttccggtgga gggagcaaag 421 tcctgcacac tggagtggtg tactagcgga tcaagtatgg acactgactc agactgagga 481 gcagctccac tgcactgtgt accgaggaga caagagccag gctagcaggc ccacaccaga 541 cgagctggag gccgtgcgca agtacttcca gctagatgtt accctggctc aactgtatca 601 ccactggggt tccgtggact cccacttcca agaggtggct cagaaattcc aaggtgtgcg 661 actgctgcga caagacccca tcgaatgcct tttctctttt atctgttcct ccaacaacaa 721 catcgcccgc atcactggca tggtggagcg gctgtgccag gcttttggac ctcggctcat 781 ccagcttgat gatgtcacct accatggctt ccccagcctg caggccctgg ctgggccaga 841 ggtggaggct catctcagga agctgggcct gggctatcgt gcccgttacg tgagtgccag 901 tgcccgagcc atcctggaag aacagggcgg gctagcctgg ctgcagcagc tacgagagtc 961 ctcatatgag gaggcccaca aggccctctg catcctgcct ggagtgggca ccaaggtggc 1021 tgactgcatc tgcctgatgg ccctagacaa gccccaggct gtgcccgtgg atgtccatat 1081 gtggcacatt gcccaacgtg actacagctg gcaccctacc acgtcccagg cgaagggacc 1141 gagcccccag accaacaagg aactgggaaa ctttttccgg agcctgtggg gaccttatgc 1201 tggctgggcc caagcggtgc tgttcagtgc cgacctgcgc caatcccgcc atgctcagga 1261 gccaccagca aagcgcagaa agggttccaa agggccggaa ggctagatgg ggcaccctgg 1321 acaaagaaat tccccaagca ccttcccctc cattccccac ttctctctcc ccatccccac 1381 ccagtctcat gttggggagg ggcctccctg tgactacctc aaaggccagg cacccccaaa 1441 tcaagcagtc agtttgcaca acaagatggg gtgggggata ttgagggaga cagcgctaag 1501 gatggtttta tcttcccttt attacaagaa ggaacaataa aatagaaaca tttgtatgg // LOCUS AB000449 1662 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for VRK1, complete cds. ACCESSION AB000449 NID g1827449 KEYWORDS VRK1. SOURCE Homo sapiens fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nezu,Ji., Oku,A., Jones,M.H. and Shimane,M. TITLE Identification of two novel human putative Serine/Threonine kinases, VRK1 and VRK2, with structural similarity to vaccinia virus B1R kinase JOURNAL Genomics 45 (2), 327-331 (1997) MEDLINE 98008921 REFERENCE 2 (bases 1 to 1662) AUTHORS Nezu,J. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) to the DDBJ/EMBL/GenBank databases. Jun-ichi Nezu, Chugai Research Institute for Molecular Medicine, Inc., Gene Search Program; 153-2 Nagai, Niihari, Ibaraki 300-41, Japan (E-mail:nezuj@tk.chugai-pharm.co.jp, Tel:81-298-30-6211, Fax:81-298-30-6270) FEATURES Location/Qualifiers source 1..1662 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /dev_stage="fetal" /map="between D14S265 and AFM063XE7" /tissue_type="liver" gene 76..1266 /gene="VRK1" CDS 76..1266 /gene="VRK1" /standard_name="Vaccinia virus BIR kinase related kinase 1" /codon_start=1 /product="VRK1" /db_xref="PID:g1827450" /translation="MPRVKAAQAGRQSSAKRHLAEQFAVGEIITDMAKKEWKVGLPIG QGGFGCIYLADMNSSESVGSDAPCVVKVEPSDNGPLFTELKFYQRAAKPEQIQKWIRT RKLKYLGVPKYWGSGLHDKNGKSYRFMIMDRFGSDLQKIYEANAKRFSRKTVLQLSLR ILDILEYIHEHEYVHGDIKASNLLLNYKNPDQVYLVDYGLAYRYCPEGVHKEYKEDPK RCHDGTIEFTSIDAHNGVAPSRRGDLEILGYCMIQWLTGHLPWEDNLKDPKYVRDSKI RYRENIASLMDKCFPEKNKPGEIAKYMETVKLLDYTEKPLYENLRDILLQGLKAIGSK DDGKLDLSVVENGGLKAKTITKKRKKEIEESKEPGVEDTEWSNTQTEEAIQTRSRTRK RVQK" BASE COUNT 552 a 274 c 368 g 468 t ORIGIN 1 ccgagttacg agtcggcgaa agcggcggga agttcgtact gggcagaacg cgacgggtct 61 gcggcttagg tgaaaatgcc tcgtgtaaaa gcagctcaag ctggaagaca gagctctgca 121 aagagacatc ttgcagaaca atttgcagtt ggagagataa taactgacat ggcaaaaaag 181 gaatggaaag taggattacc cattggccaa ggaggctttg gctgtatata tcttgctgat 241 atgaattctt cagagtcagt tggcagtgat gcaccttgtg ttgtaaaagt ggaacccagt 301 gacaatggac ctctttttac tgaattaaag ttctaccaac gagctgcaaa accagagcaa 361 attcagaaat ggattcgtac ccgtaagctg aagtacctgg gtgttcctaa gtattggggg 421 tctggtctac atgacaaaaa tggaaaaagt tacaggttta tgataatgga tcgctttggg 481 agtgaccttc agaaaatata tgaagcaaat gccaaaaggt tttctcggaa aactgtcttg 541 cagctaagct taagaattct ggatattctg gaatatattc acgagcatga gtatgtgcat 601 ggagatatca aggcctcaaa tcttcttctg aactacaaga atcctgacca ggtgtacttg 661 gtagattatg gccttgctta tcggtactgc ccagaaggag ttcataaaga atacaaagaa 721 gaccccaaaa gatgtcacga tggcactatt gaattcacga gcatcgatgc acacaatggt 781 gtggccccat caagacgtgg tgatttggaa atacttggtt attgcatgat ccaatggctt 841 actggccatc ttccttggga ggataatttg aaagatccta aatatgttag agattccaaa 901 attagataca gagaaaatat tgcaagtttg atggacaaat gttttcctga gaaaaacaaa 961 ccaggtgaaa ttgccaaata catggaaaca gtgaaattac tagactacac tgaaaaacct 1021 ctttatgaaa atttacgtga cattcttttg caaggactaa aagctatagg aagtaaggat 1081 gatggcaaat tggacctcag tgttgtggag aatggaggtt tgaaagcaaa aacaataaca 1141 aagaagcgaa agaaagaaat tgaagaaagc aaggaacctg gtgttgaaga tacggaatgg 1201 tcaaacacac agacagagga ggccatacag acccgttcaa gaaccagaaa gagagtccag 1261 aagtaattca gatgctgtga accagatttc cttttctttg ttttcttttg acttttttct 1321 ccttttctgt tagaactgtt ttattttcct gtgagtcttg cgaggtggaa ttaatgatta 1381 aatactcatg tgttcagaaa acataaactt tttttataaa aatattttgt acaattcatt 1441 aaaggctaat ttatgaaatt tgaaaatctt caggttatac tccttaagtt atcccaaagc 1501 cgtgtgtttg tgatgttttg gagtacatat atatgaaaat tattatgaca cgcacttttc 1561 taatcattgt acatttctca gagtggataa aaatgtttga caaagtcctc acttttaagg 1621 aaatgcaaag cttaaaataa aactctcttt tgtttgatgc ag // LOCUS AB000450 1833 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for VRK2, complete cds. ACCESSION AB000450 NID g1827451 KEYWORDS VRK2. SOURCE Homo sapiens fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nezu,Ji., Oku,A., Jones,M.H. and Shimane,M. TITLE Identification of two novel human putative Serine/Threonine kinases, VRK1 and VRK2, with structural similarity to vaccinia virus B1R kinase JOURNAL Genomics 45 (2), 327-331 (1997) MEDLINE 98008921 REFERENCE 2 (bases 1 to 1833) AUTHORS Nezu,J. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) to the DDBJ/EMBL/GenBank databases. Jun-ichi Nezu, Chugai Research Institute for Molecular Medicine, Inc., Gene Search Program; 153-2 Nagai, Niihari, Ibaraki 300-41, Japan (E-mail:nezuj@tk.chugai-pharm.co.jp, Tel:81-298-30-6211, Fax:81-298-30-6270) FEATURES Location/Qualifiers source 1..1833 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /dev_stage="fetal" /map="between CHLC.GATA23H01 - D2S357" /tissue_type="liver" gene 131..1657 /gene="VRK2" CDS 131..1657 /gene="VRK2" /standard_name="Vaccinia virus BIR kinase related kinase 2" /codon_start=1 /product="VRK2" /db_xref="PID:g1827452" /translation="MPPKRNEKYKLPIPFPEGKVLDDMEGNQWVLGKKIGSGGFGLIY LAFPTNKPEKDARHVVKVEYQENGPLFSELKFYQRVAKKDCIKKWIERKQLDYLGIPL FYGSGLTEFKGRSYRFMVMERLGIDLQKISGQNGTFKKSTVLQLGIRMLDVLEYIHEN EYVHGDVKAANLLLGYKNPDQVYLADYGLSYRYCPNGNHKQYQENPRKGHNGTIEFTS LDAHKGVALSRRSDVEILGYCMLRWLCGKLPWEQNLKDPVAVQTAKTNLLDELPQSVL KWAPSGSSCCEIAQFLVCAHSLAYDEKPNYQALKKILNPHGIPLGPLDFSTKGQSINV HTPNSQKVDSQKAATKQVNKAHNRLIEKKVHSERSAESCATWKVQKEEKLIGLMNNEA AQESTRRRQKYQESQEPLNEVNSFPQKISYTQFPNSFYEPHQDFTSPDIFKKSRSPSW YKYTSTVSTGITDLESSTGLWPTISQFTLSEETNADVYYYRIIIPVLLMLVFLALFFL " BASE COUNT 604 a 350 c 397 g 482 t ORIGIN 1 ctgcactgcg aggccgacgc agctggagag aagttaggca ggtcctaggg agggcaggct 61 cgagtgctgg gcccgcctcc ccgcgggact gtaggcccgg gggctccgcc tcgtcgcagc 121 ggcagaagtg atgccaccaa aaagaaatga aaaatacaaa cttcctattc catttccaga 181 aggcaaggtt ctggatgata tggaaggcaa tcagtgggta ctgggcaaga agattggctc 241 tggaggattt ggattgatat atttagcttt ccccacaaat aaaccagaga aagatgcaag 301 acatgtagta aaagtggaat atcaagaaaa tggcccgtta ttttcagaac ttaaatttta 361 tcagagagtt gcaaaaaaag actgtatcaa aaagtggata gaacgcaaac aacttgatta 421 tttaggaatt cctctgtttt atggatctgg tctgactgaa ttcaagggaa gaagttacag 481 atttatggta atggaaagac taggaataga tttacagaag atctcaggcc agaatggtac 541 ctttaaaaag tcaactgtcc tgcaattagg tatccgaatg ttggatgtac tggaatatat 601 acatgaaaat gaatatgttc atggtgatgt aaaagcagca aatctacttt tgggttacaa 661 aaatccagac caggtttatc ttgcagatta tggactttcc tacagatatt gtcccaatgg 721 gaaccacaaa cagtatcagg aaaatcctag aaaaggccat aatgggacaa tagagtttac 781 cagtttggat gcccacaagg gagtagcctt gtccagacga agtgacgttg agatcctcgg 841 ctactgcatg ctgcggtggt tgtgtgggaa acttccctgg gaacagaacc tgaaggaccc 901 tgtggctgtg cagactgcta aaacaaatct gttggacgag ctcccccagt cagtgcttaa 961 atgggctcct tctggaagca gttgctgtga aatagcccaa tttttggtat gtgctcatag 1021 tttagcatat gatgaaaagc caaactatca agccctcaag aaaattttga accctcatgg 1081 aataccttta ggaccactgg acttttccac aaaaggacag agtataaatg tccatactcc 1141 aaacagtcaa aaagttgatt cacaaaaggc tgcaacaaag caagtcaaca aggcacacaa 1201 taggttaatc gaaaaaaaag tccacagtga gagaagcgct gagtcctgtg caacatggaa 1261 agtgcagaaa gaggagaaac tgattggatt gatgaacaat gaagcagctc aggaaagcac 1321 aaggagaaga cagaaatatc aagagtctca agaacctttg aatgaagtaa acagtttccc 1381 acaaaaaatc agctatacac aattcccaaa ctcattttat gagcctcatc aagattttac 1441 cagtccagat atattcaaga agtcaagatc tccatcttgg tataaataca cttccacagt 1501 cagcacgggg atcacagact tagaaagttc aactggactt tggcctacaa tttcccagtt 1561 tactcttagt gaagagacaa acgcagatgt ttattattat cgcatcatca tacctgtcct 1621 tttgatgtta gtatttcttg ctttattttt tctctgaaga tgataccaaa attccttttg 1681 ataatttttt aagtttccag ctcttcaccg aaatgttgta ttcttatttc agtgtttcct 1741 tccagacatt tttaaggtaa ttggctttaa aaagagaaca tattttaaca aagtttgtgg 1801 acactctaaa aaataaaatt gctttgtact agt // LOCUS AB000459 4710 bp mRNA PRI 03-MAR-1997 DEFINITION Human mRNA, clone RES4-22A, complete cds. ACCESSION AB000459 NID g1843385 KEYWORDS Huntington's disease. SOURCE Homo sapiens fetus brain cDNA to mRNA, clone_lib:human fetal brain cDNA library clone:RES4-22A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4710) AUTHORS Hadano,S., Ishida,Y. and Ikeda,J. TITLE The primary structure of five novel genes located close to the Huntington's disease gene on human chromosome 4p16.3 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 4710) AUTHORS Hadano,S. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) to the DDBJ/EMBL/GenBank databases. Shinji Hadano, Japan Science and Technology Corporation (JST), NeuroGenes project/ICORP; Tokai University School of Medicine, Bohseidai, Isehara, Kanagawa 259-11, Japan (E-mail:shinji@igsp1.med.u-tokai.ac.jp, Tel:81-463-91-5095, Fax:81-463-91-4993) FEATURES Location/Qualifiers source 1..4710 /organism="Homo sapiens" /note="located close to the Huntington's disease gene" /db_xref="taxon:9606" /chromosome="4" /clone="RES4-22A" /clone_lib="human fetal brain cDNA library" /dev_stage="fetus" /map="4p16.3" /tissue_type="brain" exon 1..113 /number=1 exon 114..281 /number=2 exon 282..516 /number=3 CDS 352..4026 /note="unnamed protein product" /codon_start=1 /db_xref="PID:d1019861" /db_xref="PID:g1843386" /translation="MKVRLLRQLSAAAKVKAPSGLQGPPQAHQFISLLLEEYGALCQA ARSISTFLGTLENEHLKKFQVTWELHNKHLFENLVFSEPLLQSNLPALVSQIRLGTTT HDTCSEDTYSTLLQRYQRSEEELRRVAEEWLECQKRIDAYVDEQMTMKTKQRMLTEDW ELFKQRRFIEEQLTNKKAVTGENNFTDTMRHVLSSRLSMPDCPNCNYRRRCACDDCSL SHILTCGIMDPPVTDDIHIHQLPLQVDPAPDYLAERSPPSVSSASSGSGSSSPITIQQ HPRLILTDSGSAPTFCSDDEDVAPLSAKFADIYPLSNYDDTEVVANMNGIHSELNGGG ENMALKDESPQISSTSSSSSEADDEEADGESSGEPPGAPKEDGVLGSRSPRTEESKAD SPPPSYPTQQAEQAPNTCECHVCKQEASGLTPSAMTAGALPPGHQFLSPEKPTHPALH LYPHIHGHVPLHTVPHLPRPLIHPTLYATPPFTHSKALPPAPVQNHTNKHQVFNASLQ DHIYPSCFGNTPEWNSSKFISLWGSEVMNDKNWNPGTFLPDTISGSEILGPTLSETRP EALPPPSSNETPAVSDSKEKKNAAKKKCLYNFQDAFMEANKVVMATSSATSSVSCTAT TVQSSNSQFRVSSKRPPSVGDVFHGISKEDHRHSAPAAPRNSPTGLAPLPALSPAALS PAALSPASTPHLANLAAPSFPKTATTTPGFVDTRKSFCPAPLPPATDGSISAPPSVCS DPDCEGHRCENGVYDPQQDDGDESADEDSCSEHSSSTSTSTNQKEGKYCDCCYCEFFG HGGPPAAPTSRNYAEMREKLRLRLTKRKEEQPKKMDQISERESVVDHRRVEDLLQFIN SSETKPVSSTRAAKRARHKQRKLEEKARLEAEARAREHLHLQEEQRRREEEEDEEEEE DRFKEEFQRLQELQKLRAVKKKKKERPSKDCPKLDMLTRNFQAATESVPNSGNIHNGS LEQTEEPETSSHSPSRHMNHSEPRPGLGADGDAADPVDTRDSKFLLPKEVNGKQHEPL SFFFDIMQHHKEGNGKQKLRQTSKASSEPARRPTEPPKATEGQSKPRAQTESKAKVVD LMSITEQKREERKVNSNNNNKKQLNHIKDEKSNPTPMEPTSPGEHQQNSKLVLAESPQ PKGKNKKNKKKKGDRVNNSIDDVFLPKDIDLDSVDMDETEREVEYFKRFCLDSARQTR QRLSINWSNFSLKKATFAAH" exon 517..641 /number=4 exon 642..789 /number=5 exon 790..867 /number=6 exon 868..980 /number=7 exon 981..1223 /number=8 exon 1224..1377 /number=9 exon 1378..1557 /number=10 exon 1558..1809 /number=11 exon 1810..2008 /number=12 exon 2009..2281 /number=13 exon 2282..2570 /number=14 exon 2571..2754 /number=15 exon 2755..2985 /number=16 exon 2986..3850 /number=17 exon 3851..3932 /number=19 exon 3933..4710 /number=20 polyA_signal 4680..4685 BASE COUNT 1289 a 1294 c 1215 g 912 t ORIGIN 1 cacaatgaca tgcagacctg catattggag ctggacggag aaactgggct aatgtgacag 61 acagcaacaa gagtaaggca gttgcttcgc tattgagaga aagaaccata tgaagaaatt 121 tcggcagagg cggaccggga acctcagcag ctgcagaact actggtcaga agtgcgctac 181 acggtgcgct gcatctaccg ccaggcagga accccgctgg cagatgacca ggaccagtct 241 ctggtgcctg acaaggaggg agtgaaggag ctcgtggata ggctctgcga gagggacccc 301 taccagctgt accagcgtct ggaacagcaa gctcgagagt atgtgctgga gatgaaggtc 361 cgcctgctcc ggcagctgtc ggctgcggcc aaggtgaagg caccatctgg cctgcagggc 421 ccgccgcaag cgcaccagtt catctccctc ctgcttgagg agtacggcgc cctctgccag 481 gccgcacgct ccatcagcac cttccttggc actctggaaa atgaacactt gaaaaagttc 541 caagtgacgt gggaactgca taataaacac ctgtttgaaa atctggtctt ttcggagcca 601 cttcttcaga gcaacttgcc cgcactggtg tcacagatca ggctaggaac caccacacac 661 gacacctgca gtgaggacac atacagtacc ttgctgcaga ggtaccagcg ttccgaggag 721 gagctgcgca gagtcgccga ggagtggctg gagtgccaga agaggatcga cgcctatgtc 781 gacgagcaga tgacaatgaa aaccaagcag cgcatgttaa cagaagactg ggagcttttt 841 aaacaaagaa gattcattga agaacagtta accaataaga aagcagttac tggcgagaac 901 aacttcacag acaccatgag gcacgtgtta tcgtcccggc tgagcatgcc cgactgcccc 961 aactgcaact acaggagaag atgtgcttgc gatgactgca gtctctcaca catcctcacg 1021 tgtggtatca tggacccccc cgtcactgat gacatccaca ttcaccagct cccacttcaa 1081 gtggatcctg ctcctgacta tcttgctgag aggagcccgc ccagtgtgtc atctgcaagc 1141 tcggggtccg gctccagctc tcccatcaca attcagcagc accccaggct catcctcaca 1201 gacagtggct cggcaccaac tttttgtagt gatgatgaag atgttgcacc attgtcagcc 1261 aaatttgctg atatttatcc attgagtaat tatgatgata ccgaggtggt ggccaacatg 1321 aatggaatcc acagcgaatt gaatggtggc ggggaaaaca tggccctgaa ggatgagtct 1381 cctcagataa gcagtaccag cagtagttcc tcagaagctg atgatgaaga agcggacggc 1441 gagagtagtg gggagccccc aggggccccg aaggaagatg gagtgctggg aagcaggagc 1501 cccaggacag aggagagcaa agcagacagt ccacccccat cctacccaac acagcaggct 1561 gaacaagctc caaacacttg tgaatgtcat gtttgtaagc aagaagcttc tggactgaca 1621 ccatctgcaa tgacagccgg agcccttcct cctggccatc agttcttgag cccagagaag 1681 cccacacacc ctgcactgca cctttaccct cacatccatg gacatgtgcc tttgcacact 1741 gttccacacc tgccacgccc tctcatccac cccaccttgt atgcaacgcc ccccttcaca 1801 cacagtaagg ctttaccgcc agcacctgtt cagaatcaca caaataagca tcaggtattc 1861 aatgcatctc ttcaagacca tatttatccg agctgttttg ggaatactcc agagtggaat 1921 agttctaaat ttataagtct ttggggatca gaagtgatga atgataagaa ctggaatcct 1981 ggcactttct tgccagatac aatttctggg agtgaaatat tagggccaac actctcagaa 2041 acaagaccgg aagcccttcc acctccatct agcaatgaaa cacctgcagt ctcggatagt 2101 aaagagaaaa agaatgctgc aaaaaagaaa tgtttataca atttccaaga tgctttcatg 2161 gaagcaaata aagttgtcat ggccacgtca tcagccacgt cctctgtgtc ctgcacagct 2221 accacagtgc agtccagcaa cagccagttc agagtgtcat ccaagagacc tccttcagta 2281 ggtgacgtgt ttcatggcat cagcaaggag gaccacagac actcggcccc agccgccccg 2341 aggaatagcc ccacgggctt ggcccccctc ccagcgctct cgcctgctgc gctgtcacct 2401 gctgcgctct cacctgcctc cacacctcac cttgcaaatc ttgcagcccc atcattcccc 2461 aaaacagcaa ccacaactcc tgggtttgtg gacacacgca agagtttctg tcctgcaccc 2521 ctacccccgg ccacagatgg ctccattagc gcccctccaa gtgtctgcag tgaccctgac 2581 tgcgaagggc accgctgcga gaatggtgtc tacgacccac agcaggatga tggggacgag 2641 agtgcagatg aggacagctg ctctgagcac agctccagca cctcgacctc caccaaccag 2701 aaggagggca agtactgcga ctgctgctac tgcgaattct ttgggcacgg cgggcctcca 2761 gctgcaccaa caagtagaaa ttatgcagaa atgagggaaa agcttcgctt acggctgacc 2821 aagaggaaag aggagcaacc taaaaaaatg gaccagatct cagaaaggga aagcgtcgtt 2881 gaccatcgga gggtggagga tttgttgcag tttataaata gctccgaaac caaaccagtg 2941 agcagcacgc gtgcagcgaa gcgagcaagg cataagcaaa ggaagctgga ggagaaagct 3001 cgcctagaag cagaggccag ggcccgggag cacctgcacc tccaggagga gcagaggcgg 3061 cgggaggagg aggaggatga ggaagaagag gaggatcgtt tcaaggagga atttcagcgg 3121 cttcaggagc ttcagaagct aagagctgta aaaaagaaga agaaggagag gccaagtaaa 3181 gactgcccca agttggacat gctcactaga aatttccagg cagcaacaga gtctgttcct 3241 aactctggaa acatccacaa tggctcacta gagcaaactg aagaaccaga aacctcttct 3301 cactccccat ccaggcatat gaaccactca gagcccaggc cagggctagg ggctgatggg 3361 gatgctgcag accccgtcga caccagagac tccaaatttc tcctccccaa ggaggtgaat 3421 gggaagcagc atgagccact ctcttttttc ttcgacatca tgcagcacca taaagaagga 3481 aatggcaagc agaagctgag gcagaccagc aaggccagca gcgagccagc gaggaggccc 3541 acagagcccc ccaaggccac agaggggcag tccaagcccc gggcccagac tgagtcaaag 3601 gctaaggtgg tcgacctcat gtccatcaca gagcagaaaa gagaggagag aaaagtcaac 3661 agtaataaca ataacaaaaa gcagctgaac cacatcaagg acgaaaagtc aaacccaacc 3721 cctatggagc ccacctctcc cggtgagcat cagcagaaca gcaagctggt gctggcagag 3781 tcccctcagc caaagggcaa gaacaagaaa aataagaaga agaaaggaga cagagtcaac 3841 aattcaattg atgatgtctt tctacctaaa gatattgacc tagacagtgt ggatatggat 3901 gagacagaga gggaagtgga atatttcaaa aggttctgct tggattctgc tagacagacc 3961 cgacaaagac tgtctatcaa ctggtccaat tttagcttga aaaaagccac ctttgctgcc 4021 cactgaatga ggactccctg gagagggaca cgcgagaggc aggccaggct gcaccacccc 4081 aagagccacg cccctcgctg gcgccccaga gccgtggtgc ttgccaaggg ctgtgcggag 4141 ctggtgctgc ctgaaacccc agaccgagaa gttgatgctc ggcccacgcc gttagctcgt 4201 gtgcgtgtag tctgtgcgtg agactccttc gattgtagct ctgtgctgtc ggattggaac 4261 agtagttccc gccaagtcct cccaccaccg cggcctcgga ggcctgggcc gtggccagat 4321 aggagtttgc atcatccacg tggctccgtt gcctctgcat tgcgccctgt cctgtcatgt 4381 gtcctcaccg gggtatcggc cgtcactcag ctctcctgtg cccctgcgtc tcaccctagg 4441 cgggctgggc ggggcaggcc tcctttgttc tccacaatct actgtctccg agtgtacacg 4501 ttgcgctgtt tgtgtttgat ccccctgact tgtagccagc ttgtgtaaga tcccttgcag 4561 aacgagaaag ttaaaaacaa gcccacccag tactcacacc atcaagtctg ttatagagtg 4621 tacgactgta ttaacacgga ggcctgcctg gctacttttt taacatattg ttaagtaata 4681 ttaaaatcat gtctttcttt ttgaaagatg // LOCUS AB000462 7300 bp mRNA PRI 03-MAR-1997 DEFINITION Human mRNA for SH3 binding protein, clone RES4-23A, complete cds. ACCESSION AB000462 NID g1843391 KEYWORDS SH3 binding protein; Huntington's disease. SOURCE Homo sapiens adult brain cDNA to mRNA, clone:RES4-23A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7300) AUTHORS Hadano,S., Ishida,Y. and Ikeda,J. TITLE The primary structure of five novel genes located close to the Huntington's disease gene on human chromosome 4p16.3 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 7300) AUTHORS Hadano,S. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) to the DDBJ/EMBL/GenBank databases. Shinji Hadano, Japan Science and Technology Corporation (JST), NeuroGenes project/ICORP; Tokai University School of Medicine, Bohseidai, Isehara, Kanagawa 259-11, Japan (E-mail:shinji@igsp1.med.u-tokai.ac.jp, Tel:81-463-91-5095, Fax:81-463-91-4993) FEATURES Location/Qualifiers source 1..7300 /organism="Homo sapiens" /note="located close to the Huntington's disease gene" /db_xref="taxon:9606" /chromosome="4" /clone="RES4-23A" /dev_stage="adult" /map="4p16.3" /tissue_type="brain" exon 1..257 /number=1 exon 258..397 /number=2 CDS 262..1947 /codon_start=1 /product="SH3 binding protein" /db_xref="PID:d1019864" /db_xref="PID:g1843392" /translation="MAAEEMHWPVPMKAIGAQNLLTMPGGVAKAGYLHKKGGTQLQLL KWPLRFVIIHKRCVYYFKSSTSASPQGAFSLSGYNRVMRAAEETTSNNVFPFKIIHIS KKHRTWFFSASSEEERKSWMALLRREIGHFHEKKDLPLDTSDSSSDTDSFYGAVERPV DISLSPYPTDNEDYEHDDEDDSYLEPDSPEPGRLEDALMHPPAYPPPPVPTPRKPAFS DMPRAHSFTSKGPGPLLPPPPPKHGLPDVGLAAEDSKRDPLCPRRAEPCPRVPATPRR MSDPPLSTMPTAPGLRKPPCFRESASPSPEPWTPGHGACSTSSAAIMATATSRNCDKL KSFHLSPRGPPTSEPPPVPANKPKFLKIAEEDPPREAAMPGLFVPPVAPRPPALKLPV PEAMARPAVLPRPEKPQLPHLQRSPLDGQSFRSFSFEKPRQPSQADTGGDDSDEDYEK VPLPNSVFVNTTESCEVERLFKATSPRGEPQDGLYCIRNSSTKSGKVLVVWDETSNKV RNYRIFEKDSKFYLEGEVLFVSVGSMVEHYHTHVLPSHQSLLLRHPYGYTGPR" exon 398..500 /number=3 exon 501..618 /number=5 exon 619..689 /number=6 exon 690..778 /number=7 exon 779..847 /number=8 exon 848..1496 /number=9 exon 1497..1611 /number=10 exon 1612..1667 /number=11 exon 1668..1749 /number=12 exon 1750..1809 /number=13 exon 1810..7300 /number=14 polyA_signal 2380..2385 polyA_signal 5134..5139 polyA_signal 6242..6247 polyA_signal 6266..6271 BASE COUNT 1478 a 2074 c 1983 g 1765 t ORIGIN 1 cagccgggtg acccaggccg aggccggcag aagacagcct gatgccttga agacttcctc 61 ttgcactttt gttggagggt gctggtttgc taaaagcaga gagtattttt ctttttattt 121 attttgtttt taatttttta attttagctc cagctcagtt gcccagactg gagagcagtg 181 gccaatcata gcttactgcc tcctggaact cctggctcaa tcgatcctcc tggataagcc 241 tcctccgggt actatagctt catggcggct gaagagatgc attggcctgt ccctatgaag 301 gccattggtg cccagaacct gctaaccatg cctgggggcg tggccaaggc tggctacctg 361 cacaagaagg gcggtaccca gctgcagctg ctgaaatggc ccctgcgctt tgtcatcatc 421 cacaaacgct gcgtctacta cttcaagagt agcacctctg cctccccgca gggcgccttc 481 tccctgagtg gctataaccg ggtgatgcgg gcggctgagg agaccacgtc caacaacgtt 541 ttccccttca agatcatcca catcagcaag aagcaccgca cgtggttctt ctcggcctcc 601 tccgaggagg agcgcaagag ctggatggcc ttgctgcgca gggagattgg ccacttccac 661 gaaaagaaag acctgccctt ggacaccagc gactccagct cggacacaga cagcttctac 721 ggcgcagttg agcggcctgt ggatatcagc ctttccccgt accccacgga caatgaagac 781 tatgagcacg acgatgagga tgactcctac ctggagcctg actccccgga gcccggaagg 841 cttgaggatg ccctgatgca cccaccggct tacccaccac ccccagtgcc cacgcccagg 901 aagccagcct tctctgacat gccccgggcc cactccttta cctccaaggg ccccggtccc 961 ctactgccac ccccgccccc taagcacggc ctcccagatg ttggcctggc tgctgaggac 1021 tccaagaggg acccactgtg cccgaggcgg gctgagcctt gccccagggt acctgctacc 1081 ccccgaagga tgagcgatcc ccctctgagc accatgccca ccgcacccgg cctccggaaa 1141 cccccttgct tccgggagag tgccagcccc agcccggagc cctggacccc tggccacggg 1201 gcctgctcca cttccagtgc tgccatcatg gccactgcca cctccagaaa ctgtgacaaa 1261 ctcaagtcct tccacctgtc cccccgagga ccacccacat ctgagccccc acctgtgcca 1321 gccaacaagc ccaagttcct gaagatagct gaagaggacc ccccaaggga ggcagccatg 1381 cccggactct ttgtgccccc cgtggctccc cggcctcctg cgctgaagct gccagtgcct 1441 gaggccatgg cgcggcccgc agtcctgccc aggccagaga agccgcagct cccgcacctc 1501 cagcgatcac ccctcgatgg gcagagtttc aggagcttct cctttgaaaa gccccggcaa 1561 ccctcacagg ctgacactgg cggggacgac tcggacgagg actatgagaa ggtgccactg 1621 cccaactcgg tcttcgtcaa caccacggag tcctgcgaag tggaaaggtt gttcaaggct 1681 acaagccccc ggggagagcc ccaggatgga ctctactgca tccggaactc ctctaccaag 1741 tcggggaagg tcctggttgt gtgggacgaa acctctaaca aagtgaggaa ctatcgcatt 1801 tttgagaagg actctaagtt ctacctggag ggcgaggtcc tgtttgtgag tgtgggcagc 1861 atggtggagc actaccacac ccacgtgctg cccagccacc agagcctgct gctgcggcac 1921 ccctacggct acactgggcc taggtgatgg cagtccatgt ggctgccagg ccaaggcagt 1981 cacaggggcc ctgaccccag gccacacaga cggacatggg cccacatggg agggtgagca 2041 ggagcaaggc tgtgcttgcc tagggcctct gtgatggaca tctcgtagga cccagccagt 2101 ctcatccagc aggttgggtt ctagggctga accaggcgcc aggctccaga ggacgaaggg 2161 actctgttgc cccacactaa cttgccctgt cccaatccca gaaacccagg accaagctgt 2221 gcctgggctc caaggacagg aacactggtc cccccatcac actcacccct aagtgggctg 2281 ggagccaggc agggccaggg cagctgggtg ggggccgggg ctggccctgg gacccccagg 2341 aacgctaaga cacaggctcc agtaggggct gttgcctcca ataaagcagc agtgagcttt 2401 gccttggtgg ctggggcttg attgggaagg aggggattac cagcttactg ggtgcccatg 2461 ctgatgtcta agtggtgacc gcagcagtac ccgggaaccc caacagttgg ttgtcttgtc 2521 ttccagggtg caggtcactg agtgacttcc ccagggtgca cagcgagtaa cagatcagga 2581 cccaaacttg ggcagtctgg gctgggagcc cacaccccac tcaccagttc tgctgcctca 2641 ggtcaggcca gggcagtgct gctgcagagc tagaaggccc tgcagctaca gctgcttcat 2701 tccctgcatt agtgcctggt tactgggtac ctcctgagtg gctgtccccg ttccagaact 2761 tgcatacact gagcgggcta cagagctaga agccctgcag ctacagctgc ttcattccct 2821 gcattagcga gcagttattg ggtacctcct gcatgcctgg tcccattcca gacaggggcc 2881 tctggcctgg ctgagttcac agcccagtct ggggacagct gggtatgagg tgcttacggc 2941 acagtgtcca gggcagctgg gtgtgcaggg actgggggct cccggaagat tttttggagg 3001 aagtaacagc tacgatggga tgggaacagt ggaccctaag caggccaagg gtgcgtaggg 3061 acggtggtac ccagatgccc aagtcttcca ggcaatacct ggctcaggcc cagccccaat 3121 ccatcccctt actttctgcc atggagttcc agcaggtcac tctccctggc acaccttcca 3181 ggctggattt ttaatgaaac agactcaggg aggtaggggc tggcagggac cctagaatcc 3241 ttgtgatttt tcttagcacc ttatgtcagg gaaacctaaa ctgaggtcag cacttgggcc 3301 cactgacagt gactgactgg gggagaaggt cctgcagccc ccttcccctg ggtgtgttct 3361 ggggacctgt ggtttgctgg cggaaacaaa tgatgaggct ggttagcgga tgtgggaggc 3421 tgtgacccca gggggccata gggtgcggtg gaactgcagg ccctgcagat gacggcagcc 3481 agctgcttcc aggaaccagg tgtccaaggc cacctctgca ggggtttcct cttcagcctg 3541 cctggggtga gaggtcagtg caccacagcc gaggctggag cacagggagc ttctgttgtt 3601 ctgatctatc tctggaaaac cagccattcc tcctccctgc agtcagaatt ctttgccctg 3661 tctgacctga acttgcttag ggagtcatgc cactccccac tgtggccata gtttctcttc 3721 ctgtaaaatt ttattatttt agttttttgt ttttgagatg tagtctcacc ctgtcgccca 3781 ggctggagtg caatgccgtg atctccgctc actgccacct ccgcctctct agttcaagcg 3841 attttcctgc ctcagcctcc cgagtagctg ggattccagg cgcccgccac cacgcctggc 3901 taattttttg tatttttagt agagacggga ttttatcatg ttggccaggc tggtctcgaa 3961 ctcctgacct caggtgatct gcccaccttg gcctcccaaa gtgctgggat tacaggcatg 4021 agccactgtg cctggcccct tcctgtaaaa tttttaaatg gagaattggg tgcgagatgt 4081 ggtttccagc ctggtgcctg gggtgctgag ctagtgagtg gtgcagtcca ggacaccttt 4141 gctttatgtc acttacacgg tcacctggag ccggctcaag tggctaaagc atcctggggc 4201 ccagagccag gtgatagtcc ctctggccaa ctggacagtt gaggcttgtg gttaacccga 4261 agcccagctg gggccttggt ccagcttcgc ttcccagatt ctgcacctgc tagcacagct 4321 gtccacgtct gtgtgagctg ttctaggccg agggcctcag tttcaagagt gtgttggggt 4381 gggatggggc aggccgtggt cctccagcat gaagaaggag ccatgaggag ttcccatgac 4441 ctcccgagac ttgccataag tgttctagtc cacatataag ggtagggttg ggattaccat 4501 ttactgacca catctgtgag gtgccgagct gggtgcttga catcatttgc ttggagaagc 4561 agctgttagt agacccattt tacaggtgag agaaccaagt ctcacagagg cctgggttca 4621 agtcccacct ctgccactaa ctggcatgtg accctatcta tccttcactg ctctgagcct 4681 agaccctggc ccctgcctgg ctccctgcca ggctccctgc cacccctcac gacctctgat 4741 ggtcgttgtg ggggtctctt gcctggctcc cagggctagg gttagggctc tggaggtgct 4801 ttcactcaac caagggggcc acagcactgg ggagtgaaac tgccccgcct caccctgcgt 4861 tgccctctgg gtctgtgagg gtgggctggc aggaggccta ggccttgccc taggggcagt 4921 cctgcttcct cattttatag atagggaaac tgaggctttg ggaggactca ctgacatacc 4981 taccttcaag atgagttcag gtgggctcag ttctggggct tgggaaaagg gccccagtgg 5041 ctttgggaag cacccccagc ccagggtgaa acatgcttct tctcttcctg tggttccatc 5101 cgaaggattg tggtgagccc cgtgccttca gttaataaag atttgtattg tgaaaagatt 5161 ttttcttttt tttttgggac acagtctcac tctgtcgccc aggctagagt ggattggcgt 5221 gatctcggct caatgcaaat ctccagggtt caatcgattc tcctgcctca ccctcccatg 5281 tagctgggat tacagctgcc tgccaaattt ttgtattttt agtggaaccg gggtttcacc 5341 atgttggcca ggctggtctt gaactcctga cctcaactga tccgcccacc ttggcctccc 5401 aagtgctggg attacaggcg cgagccacgg cgcccagcct tgaaaagatg tttttagaac 5461 cagaagaaac ctcggttccc actgatcctt ctgggccacg ttgtgcggag ctcccctgct 5521 ggttggggct cagcgcagcc ccagggaggt gcttcctgca cctcaggatg ggcgagggtg 5581 ggcattgggg gagaggggga cctgggacct gcggcttagt tccctgaggc aggcagggct 5641 tattggggcc atttcataga aaggcagatt gaagctcagc agggaagaag cttttgaggg 5701 tgatccaggc gctggaggga tggcctagga caccagggtc acaccaggaa catgggaggg 5761 ccgtgcttgt ctctagacga ggggaatggg ggaagggcca caacctctgt ttctgtgacc 5821 cagcagcatc aagcccctcg ctgggcacct cgcacacacc ccctgcctta tctctgcctg 5881 cacgccctgt tccctccacc tagactgcct gctgaggggg cagtgccagg aggttgcctg 5941 tccttgggga agaggggcag tgaccctgtg aagatgcttg acagacaacc cccaccacct 6001 cagaagtgtg tgtgagtggt gaaccctttt aagccatctt ccagccattc tcactggagg 6061 gagatttgat gggtacagag cagaccccta cctgtctacc ctccttcgga cccctaggaa 6121 gcttcgcagg ccttccaggc tgccagacag ctgccctggc gttgccgtct gcttcttccc 6181 tggccccact ctgaggggct cagagctgag gcagaatccc tttttcattc atttcctgca 6241 gaataaaaca acatacagaa aagtgaataa aacataaatg cacaacctaa cacactgtta 6301 ggaagtaaac gatctgcaac caccatcagg aaatagtttt gccagcaccc aagtgccctc 6361 ccctcacagt gtcacttccg gcctctctgc cctggcttat gtgagtcttg tgttcttgtt 6421 tttctaaaaa gtcttcagca cccaattatg caagcattgc agtattttcc tgtttctgtg 6481 ctttatcccc ttgaatcata cagatgcaaa ttctggcagc tggcttcttt ggctcgttat 6541 tatgtctgtg agatttattc atgttgctgt gcgtagtata gtttgtgcat gttcattgct 6601 aaaaacttcc attgtttggc tgtatcgtag ttcacagatt catttcactg tcagtcaagc 6661 ttgtccaatg catgcagccc aggatgcctt tgaatgtggc ccaacacaaa tttgtaaact 6721 ttcttaaaac attataaaga tttttgtttg cgattttttt ttttagctca tcagctatag 6781 ttagtggtag tgtattttat gcgtgacccg agacagttct tccggtatgg tccatggaag 6841 ccaaaagatt ggacatgcct gctgtagatg gacagttggt ttgtttctag tttggggtaa 6901 ctacacacaa tgctgctagc aacagttttg tccatgtctc tgatgcacgt gtgttttttg 6961 caaatggtgc acaaattttt ctagggtttg tactcaggag tctgactcct gggttctagg 7021 gtatgaagat ctttctaaat attgttctag tttacgtgcc caccagcagt aaaacagaat 7081 tcccttgcct tcccatcctt ggcagacatt tcacttttgc cagtctggtg gggtgtatag 7141 ttatggcctt aatttgcatt tagctaatta ccaaggagat tgagcatatt tttatgtttt 7201 tattaaccat tttgattttg tctcctgtga agtgtctatc atcttttgcc cattttttaa 7261 cttgttgtct ttttcttttt cttttctttt tttttttttt // LOCUS AB000468 2903 bp mRNA PRI 03-MAR-1997 DEFINITION Human mRNA for zinc finger protein, clone RES4-26, complete cds. ACCESSION AB000468 NID g1843400 KEYWORDS zinc finger protein; Huntington's disease. SOURCE Homo sapiens fetus brain cDNA to mRNA, clone_lib:human fetal brain cDNA library clone:RES4-26. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2903) AUTHORS Hadano,S., Ishida,Y. and Ikeda,J. TITLE The primary structure of five novel genes located close to the Huntington's disease gene on human chromosome 4p16.3 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 2903) AUTHORS Hadano,S. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) to the DDBJ/EMBL/GenBank databases. Shinji Hadano, Japan Science and Technology Corporation (JST), NeuroGenes project/ICORP; Tokai University School of Medicine, Bohseidai, Isehara, Kanagawa 259-11, Japan (E-mail:shinji@igsp1.med.u-tokai.ac.jp, Tel:81-463-91-5095, Fax:81-463-91-4993) FEATURES Location/Qualifiers source 1..2903 /organism="Homo sapiens" /note="located close to the Huntington's disease gene" /db_xref="taxon:9606" /chromosome="4" /clone="RES4-26" /clone_lib="human fetal brain cDNA library" /dev_stage="fetus" /map="4p16.3" /tissue_type="brain" exon 1..140 /number=1 exon 141..306 /number=2 CDS 298..870 /codon_start=1 /product="zinc finger protein" /db_xref="PID:d1019867" /db_xref="PID:g1843401" /translation="MSTRKRRGGAINSRQAQKRTREATSTPEISLEAEPIELVETAGD EIVDLTCESLEPVVVDLTHNDSVVIVDERRRPRRNARRLPQDHADSCVVSSDDEELSR DRDVYVTTHTPRNARDEGATGLRPSGTVSCPICMDGYSEIVQNGRLIVSTECGHVFCS QCLRDSLKNANTCPTCRKKINHKRYHPIYI" exon 307..421 /number=3 exon 422..501 /number=4 exon 502..511 /number=5 exon 512..671 /number=6 exon 672..720 /number=7 exon 721..2902 /number=8 polyA_signal 2881..2886 polyA_site 2903 BASE COUNT 711 a 738 c 692 g 762 t ORIGIN 1 tgctgttgag gcggcggcat ctttctcgag gagctctcct gggcggctga agaaggagct 61 tcttctccgg agtgcgccgg cggtggcgcc tgcggaccta actagctcca ggttaggccg 121 agctttgcgg gaaagcagcg gacttgaaaa tactggaaat ctgtccggat ccaaattatt 181 ttgcaagcca gatgagtaac cagagggcat gaaaggttga gaacatttga cttccctgca 241 aaccttggta tagatcactt ccttttctgt aggaaaggaa aggcaccaaa gagcacaatg 301 agtacaagaa agcgtcgtgg tggagcaata aattctagac aagctcagaa gcgaactcgg 361 gaagcaacct ccacccccga gatctccttg gaagcagaac ccatagaact cgtggaaact 421 gctggagatg aaattgtgga cctcacttgt gaatctttag agcctgtggt ggttgatctg 481 actcacaatg actctgttgt gattgttgac gaaagaagaa gaccaaggag gaatgctagg 541 aggctgcccc aggaccatgc tgacagctgt gtggtgagca gtgacgatga ggagttgtcc 601 agggacagag acgtatatgt gactacccat actcccagaa acgccaggga tgagggcgct 661 acaggcctca ggccctcagg tactgtcagt tgtcccatct gcatggacgg atactcagag 721 atcgtgcaga atggacgtct catcgtttcc acagaatgcg gccatgtctt ctgtagccag 781 tgcctccgtg attccctgaa gaatgctaat acttgcccaa cttgtaggaa aaagatcaac 841 cacaaacggt accaccccat ttatatatga agtattcaga gccccccagg agagacggat 901 ggacagacag acagccaggt tctccagtgg tatctgcctc cattttcctg agatcaaaaa 961 gactgtttcg aaaccaacat ctgatatgta aactgctctt ttgtttccaa ccccttcctt 1021 ttgttatctc cagtttgatg ctatggcgct ggacccaggg ccctcccagg ccatctctgt 1081 tcctctgggg tggtccagtt ctagagtggg agaaagggag tcaggcgcat tgggaatcgt 1141 ggttccagtc tggttgcaga atctgcacat ttgccaagaa attttccctg tttggaaagt 1201 ttgccccagc tttcccgggc acaccacctt ttgtcccaag tgtctgccgg tcgaccaatc 1261 tgcctgccac acattgacca agccagaccc ggttcaccca gctcgaggat cccaggttga 1321 agagtggccc cttgaggccc tggaaagacc aatcactgga cttcttccct tgagagtcag 1381 aggtcacccg tgattctgcc tgcaccttat cattgatctg cagtgatttc tgcaaatcaa 1441 gagaactctg cagggcactc ccctgtttcc taagaacgaa aaagtgcaat aaaggccatt 1501 cgttacctac ttttcagcag cccacaagat gtagcactat tagtgtcccc ctcagaggct 1561 taatgttgcc tgtggagcag tgcccatccc agcccgtttc tgcccaccag ttgttctcag 1621 gaaccttacc catgctccag cgtccttcac ctggcacagg acatgcaaga taaatagggc 1681 aggcacgtgt ttgggtgtcc tctcttttct gataaaatcc atcccgtgtt tgccacacgc 1741 cctccagtcc tcagttccca ctgcctaacg tctgcccccg tgtagatact gagaggtggt 1801 ggcagtaatt gtggccttat cagccgctca gttccaggct tttgcccagg tcactgttgc 1861 cccatgttcg gagaacctgg cccacctgtc ttggctttct catccttccc aacccagtgc 1921 cgtttatttc agaagcttcc tggccactgg gcttggatgc ttcgggcttc tgactgctcc 1981 ataggttttg actggtgaaa caggggccca gatgacaacc tctccttcgc tccacaggta 2041 cgcgggagcc tcaggttctc tcaggggcag caaagtggcc caagctgccc ctgacagcac 2101 agggcctggg ggtggctaac gagagaggcc ttacagtgcc ggcatgcctc ctcttccact 2161 gtcgtccttc ctcagagggc ctcacgccaa acaaacggcc ttttcgtgtg aaacatcttc 2221 agggcgggaa aggggccact tctggctttg ttagcaataa ctgaccttca gtttaccctt 2281 ctgaaggagc aggactcagc acagaattca ctttagacgg ggctgaagga gtgtccctcc 2341 tctatgtgaa aagaaaattg ttttattctt cattctgact ttttaactgt ttggctcact 2401 tccagttagt ttgaatgaaa ataataattt tctacttgga gttgaagagg gcagaatccg 2461 cagctctcat cattgtgatg tgtagcatgt ctgccctctg actggacatc attgccatta 2521 actttcttct gggcatcacg gcaatgtcac gatgcccaga cttggagcaa ggcaaccttg 2581 gagtcagtcc actcataaaa tatggtaaca cccattttaa aatttaagtt ttgtccttaa 2641 agacaacttc agtggttaat tataaaagtt gtgttacttc gtcctaaatt aaattgatag 2701 aaagatttaa aaatgtgttt tgtttctact attcagaaac tgcgaactag ggaaaggttg 2761 gtatgaaaaa atgtctttcc ttttttcaat gtacatagtt caactctttc tttgttacat 2821 ttaaactata tccatggata tcagtctgct ttggactcct ctgctagtgt tacagatgga 2881 aataaaacca ttaatttgaa cca // LOCUS AB000516 3712 bp mRNA PRI 31-JAN-1998 DEFINITION Homo sapiens mRNA for DSIF p160, complete cds. ACCESSION AB000516 NID g2723379 KEYWORDS DSIF p160. SOURCE Homo sapiens cell_line:HeLa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wada,T., Takagi,T., Yamaguchi,Y., Ferdous,A., Imai,T., Hirose,S., Sugimoto,S., Yano,K., Hartzog,G.A., Winston,F., Buratowski,S. and Handa,H. TITLE DSIF, a novel transcription elongation factor that regulates RNA polymerase II processivity, is composed of human Spt4 and Spt5 homologs JOURNAL Genes Dev. (1998) In press REFERENCE 2 (bases 1 to 3712) AUTHORS Handa,H. TITLE Direct Submission JOURNAL Submitted (17-JAN-1997) to the DDBJ/EMBL/GenBank databases. Hiroshi Handa, Tokyo Institute of Technology, Faculty of Bioscience and Biotechnology; 4259 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226, Japan (E-mail:hhanda@bio.titech.ac.jp, Tel:045-924-5797, Fax:045-924-5834) FEATURES Location/Qualifiers source 1..3712 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 154..3417 /codon_start=1 /product="DSIF p160" /db_xref="PID:d1024982" /db_xref="PID:g2723380" /translation="MSDSEDSNFSEEEDSERSSDGEEAEVDEERRSAAGSEKEEEPED EEEEEEEEEYDEEEEEEDDDRPPKKPRHGGFILDEADVDDEYEDEDQWEDGAEDILEK EEIEASNIDNVVLDEDRSGARRLQNLWRDQREEELGEYYMKKYAKSSVGETVYGGSDE LSDDITQQQLLPGVKDPNLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAP EHVKGYIYVEAYKQTHVKQAIEGVGNLRLGYWNQQMVPIKEMTDVLKVVKEVANLKPK SWVRLKRGIYKDDIAQVDYVEPSQNTISLKMIPRIDYDRIKARMSLKDWFAKRKKFKR PPQRLFDAEKIRSLGGDVASDGDFLIFEGNRYSRKGFLFKSFAMSAVITEGVKPTLSE LEKFEDQPEGIDLEVVTESTGKEREHNFQPGDNVEVCEGELINLQGKILSVDGNKITI MPKHEDLKDMLEFPAQELRKYFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTM HELKVLPRDLQLCSETASGVDVGGQHEWGELVQLDPQTVGVIVRLERETFQVLNMYGK VVTVRHQAVTRKKDNRFAVALDSEQNNIHVKDIVKVIDGPHSGREGEIRHLFRSFAFL HCKKLVENGGMFVCKTRHLVLAGGSKPRDVTNFTVGGFAPMSPRISSPMHPSAGGQRG GFGSPGGGSGGMSRGRGRRDNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTC QTISVDRQRLTTVGSRRPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQDGSR TPHYGSQTPLHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTRSPQAYGGTPN PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQV APSPAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGI EQNSSDWVTTDIQVKVRDTYLDTQVVGQTGVIRSVTGGMCSVYLKDSEKVVSISSEHL EPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKLLEA" BASE COUNT 885 a 1025 c 1121 g 681 t ORIGIN 1 cgtcagcccc agtcaggcgt cgtgcgaaca gcagctggta ccgaaggcgg aggtggagcc 61 cgagagggaa ccagcgggga aactgaggct cggggtggag cgcaggattg tgggacgcgc 121 caagactgct gtctttccca gcagcagcgg aagatgtcgg acagcgagga cagcaacttt 181 tccgaggagg aggacagcga gcgcagcagt gacggcgagg aggccgaggt agacgaagag 241 cggcggagtg cagcgggcag tgagaaagaa gaagagcctg aggacgaaga ggaggaggaa 301 gaggaggagg aatatgatga ggaagaggag gaagaagatg atgaccgacc ccccaagaaa 361 ccccgccatg gaggcttcat tctggacgag gctgatgttg acgatgagta tgaggacgag 421 gaccagtggg aggatggagc agaggacatt ctagagaaag aagagattga agcctccaat 481 atcgataatg ttgtcctgga tgaagatcgt tctggggctc gccgcctgca aaacctctgg 541 agggaccagc gagaagaaga actgggcgag tattacatga agaaatacgc caagtcatct 601 gtgggagaga cggtgtatgg aggatctgat gagctctcag acgacatcac ccagcagcag 661 ctgctcccag gagtcaagga tcccaatctg tggactgtca aatgtaagat tggggaggaa 721 cgggccacgg ccatttcctt gatgcgcaag ttcattgcct accagttcac agacacgccc 781 ctgcagatca agtcagtagt ggcaccagag catgtgaagg gctacatcta cgtggaggcc 841 tacaagcaga cccacgtgaa gcaggccatt gagggggtgg gcaacctgcg gcttggctac 901 tggaaccagc agatggtgcc catcaaggag atgacagacg tgctcaaagt ggtgaaggag 961 gtggccaacc tgaaaccaaa gtcctgggtc cgcctcaagc ggggcatcta caaggatgac 1021 attgctcagg tggactacgt ggagcccagc cagaacacca tctccctgaa gatgatccca 1081 cgcatcgact acgatcgcat caaggcccgc atgagcttga aagactggtt tgccaaaagg 1141 aagaagttta agcggcctcc acagaggctg tttgatgctg agaagatcag gtccctgggg 1201 ggtgatgttg cctctgatgg tgacttcctc atctttgagg ggaaccgtta cagccggaag 1261 ggctttctgt tcaagagctt cgccatgtct gctgtgatca cggagggtgt gaagccaaca 1321 ctctctgagc tggaaaagtt tgaggaccag ccagagggca ttgacctgga ggtggtgact 1381 gagagcacag ggaaggagcg ggagcacaac ttccaacctg gggacaacgt ggaggtctgt 1441 gagggtgagc tcatcaacct gcagggcaag atcctcagcg tggatggcaa caagatcacc 1501 atcatgccca agcatgagga cctcaaggac atgttggagt tcccagccca ggaacttaga 1561 aaatacttca agatggggga ccacgtgaag gtgattgctg gccgattcga gggcgacaca 1621 ggcctcattg tgcgggtgga ggagaatttc gttatcctgt tctctgacct caccatgcat 1681 gagctgaagg tgctcccccg ggacctgcag ctctgctcag agacagcatc aggtgtggat 1741 gttgggggcc agcatgaatg gggcgagctg gtgcagctgg atccccagac tgtgggtgtc 1801 atcgtgcgac tagaacggga gaccttccag gtgctgaaca tgtacgggaa ggtggtgact 1861 gtcagacatc aggctgtgac ccggaagaag gacaaccgct ttgctgtggc cttggactca 1921 gagcagaaca acatccatgt gaaagacatc gttaaggtca ttgatggccc ccactcaggc 1981 cgagaagggg agattcgcca tctcttccga agcttcgcct tcctacattg caagaaactg 2041 gtggagaacg ggggcatgtt tgtctgcaag acccgccacc tggtgctggc tgggggctca 2101 aagccccgtg atgtgaccaa cttcaccgtg ggtggctttg cgcctatgag tccccggatc 2161 agcagcccca tgcaccccag tgctggaggt cagcgtggcg gctttggtag cccaggtggc 2221 ggcagtggtg gcatgagcag gggccggggc cggagggaca acgaactcat cggccagacc 2281 gtgcgcatct cccaggggcc ctacaaaggc tacatcggtg tggtgaaaga tgccacagag 2341 tccacggccc gtgtggagct gcactccacc tgccagacca tctctgtgga ccgtcagcgg 2401 ctcaccacgg tgggctcacg gcgcccgggc ggcatgacct cgacctatgg gaggacgccc 2461 atgtatggct cccagacgcc catgtatggc tctggctccc gaacacccat gtacggctca 2521 cagacacccc tccaggatgg tagccgcacc ccacactacg gctcacagac gcccctgcat 2581 gatggcagcc gcactcctgc ccagagtggg gcctgggacc ccaacaaccc caacacgccg 2641 tcacgggctg aggaagaata tgagtatgct ttcgatgatg agcccacccg gtccccgcag 2701 gcctatgggg gaacccccaa tccccaaaca cctggctacc cagacccctc gtccccacag 2761 gtcaacccac aatacaaccc gcagacgcca gggacgccgg ccatgtacaa cacagaccag 2821 ttctctccct atgctgcccc ctccccacaa ggttcctacc agcccagccc cagcccccag 2881 agttaccacc aggtggcgcc aagcccagca ggctaccaga atacccactc cccagccagc 2941 taccacccta caccgtcgcc catggcctat caggctagcc ccagcccgag ccccgttggc 3001 tacagtccta tgacacctgg agctccctcc cctggtggct acaacccaca cacgccaggc 3061 tcaggcatcg agcagaactc cagcgactgg gtaaccactg acattcaggt gaaggtgcgg 3121 gacacctacc tggatacaca ggtggtggga cagacaggtg tcatccgcag tgtcacgggg 3181 ggcatgtgct ctgtgtacct gaaggacagt gagaaggttg tcagcatttc cagtgagcac 3241 ctggagccta tcacccccac caagaacaac aaggtgaaag tgatcctggg cgaggatcgg 3301 gaagccacgg gcgtcctact gagcattgat ggtgaggatg gcattgtccg tatggacctt 3361 gatgagcagc tcaagatcct caacctccgc ttcctgggga agctcctgga agcctgaagc 3421 aggcagggcc ggtggacttc gtcggatgaa gagtgatcct ccttccttcc ctggcccttg 3481 gctgtgacac aagatcctcc tgcagggcta ggcggattgt tctggatttc cttttgtttt 3541 tccttttagt tttccatctt ttccctccct ggtgctcatt ggaatctgag tagagtctgg 3601 gggagggtcc ccaccttcct gtacctcctc cccacagctt gcttttgttg taccgtcttt 3661 caataaaaag aagctgtttg gtctaaaaaa aaaaaaaact cgtgccgaat tc // LOCUS AB000520 2110 bp mRNA PRI 26-SEP-1997 DEFINITION Homo sapiens mRNA for APS, complete cds. ACCESSION AB000520 NID g2447035 KEYWORDS APS. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokouchi,M., Suzuki,R., Masuhara,M., Komiya,S., Inoue,A. and Yoshimura,A. TITLE Cloning and characterization of APS, an adaptor molecule containing PH and SH2 domains that is tyrosine phosphorylated upon B-cell receptor stimulation JOURNAL Oncogene 15 (1), 7-15 (1997) MEDLINE 97377002 REFERENCE 2 (bases 1 to 2110) AUTHORS Yoshimura,A. TITLE Direct Submission JOURNAL Submitted (18-JAN-1997) to the DDBJ/EMBL/GenBank databases. Akihiko Yoshimura, Kurume university, Institute of Life Science; 2432-3 Aikawa-machi, Kurume 839, Japan (Tel:0942-37-6313, Fax:0942-31-5212) FEATURES Location/Qualifiers source 1..2110 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 128..2026 /codon_start=1 /product="APS" /db_xref="PID:d1023381" /db_xref="PID:g2447036" /translation="MNGAGPGPAAAAPVPVPVPVPDWRQFCELHAQAAAVDFAHKFCR FLRDNPAYDTPDAGASFSRHFAANFLDVFGEEVRRVLVAGPTTRGAAVSAEAMEPELA DTSALKAASYGHSRSSEDVSTHAATKARVRKGFSLRNMSLCVVDGVRDMWHRRASPEP DAAAAPRTAEPRDKWTRRLRLSRTLAAKVELVDIQREGALRFMVADDAAAGSGGSAQW QKCRLLLRRAVAEERFRLEFFVPPKASRPKVSIPLSAIIEVRTTMPLEMPEKDNTFVL KVENGAEYILETIDSLQKHSWVADIQGCVDPGDSEEDTELSCTRGGCLASRVASCSCE LLTDAVDLPRPPETTAVGAVVTAPHSRGRDAVRESLIHVPLETFLQTLESPGGSGSDS NNTGEQGAETDPEAEPELELSDYPWFHGTLSRVKAAQLVLAGGPRNHGLFVIRQSETR PGEYVLTFNFQGKAKHLRLSLNGHGQCHVQHLWFQSVLDMLRHFHTHPIPLESGGSAD ITLRSYVRAQDPPPEPGPTPPAAPASPACWSDSPGQHYFSSLAAAACPPASPSDAAGA SSSSASSSSAASGPAPPRPVEGQLSARSRSNSAERLLEAVAATAAEEPPEAAPGRARA VENQYSFY" BASE COUNT 346 a 774 c 663 g 327 t ORIGIN 1 ggatccaagc tattgtcctg cccatggctt cccatctcag gacgctctct ggccgctatc 61 atcccagcag tggagttcag cccactactc tgaaccagcc gcaggtggct gctatgggac 121 tgaagccatg aatggtgccg gccctggccc cgccgcagcc gccccggtcc cagtcccggt 181 cccggtcccg gactggcggc agttctgcga gctgcatgcg caggcggccg ccgtggactt 241 tgcgcacaag ttctgccgtt tcctgcggga caacccagct tacgacacgc ccgacgccgg 301 cgcctccttc tcccgccact tcgccgccaa cttcctggac gtcttcggcg aggaggtgcg 361 ccgcgtgctg gtggctgggc cgacgactcg gggcgcggcc gtgagcgcag aggccatgga 421 gccggagctc gcggacacct ctgcactcaa ggcggcgtcc tacggccact cgcggagctc 481 ggaggacgtg tccacgcacg cggccaccaa ggcccgcgtt cgcaagggct tctcgctgcg 541 caacatgagc ctgtgcgtgg tggacggcgt gcgcgacatg tggcaccggc gcgcctcgcc 601 cgagcccgac gcggcagctg ccccgcgcac cgccgagccc cgcgacaagt ggacgcggcg 661 cctgaggctg tcgcggacgc tggctgccaa ggtggagctg gtggacattc aacgcgaggg 721 ggcgctgcgc ttcatggtgg ccgacgacgc ggccgcgggc tccgggggct cggctcagtg 781 gcagaagtgc cgcctgctcc tgcgcagggc tgtggccgag gaacgcttcc gcctggagtt 841 cttcgtgccg cccaaagcct ccaggcccaa ggtcagcatc ccactgtcag ccatcattga 901 ggtccgcacc accatgcccc tggaaatgcc agagaaggat aacacattcg tcctcaaggt 961 agagaatgga gccgaataca tcttggagac catcgactct ctgcagaagc actcgtgggt 1021 agctgacatc cagggctgcg tggaccccgg tgacagtgag gaagacaccg agctctcctg 1081 tacccgagga ggctgtctgg ccagccgcgt ggcctcctgc agctgtgagc tcctgactga 1141 tgcagtcgac ctgccccgcc ccccagagac gacagccgtg ggtgcagtgg tgacagcccc 1201 ccacagccga ggtcgagatg ccgtcagaga atccctgatc cacgtcccgc tagagacctt 1261 tctgcagacc ctggaatccc cgggcggcag cggcagtgac agcaataaca caggggaaca 1321 gggtgcagag acggatcccg aggctgaacc cgagctggag ctatccgact acccatggtt 1381 ccacgggaca ctgtcccggg tcaaggctgc tcaactggtt ctggcagggg ggccccggaa 1441 ccacggcctc ttcgtgatcc gccaaagtga gactcggcct ggggagtacg tgctgacctt 1501 caacttccag ggcaaggcca agcacctgcg cctgtccctg aacggccacg gccagtgtca 1561 cgtacagcat ctgtggttcc agtctgtgct tgacatgctc cgccacttcc acacacaccc 1621 catcccactg gagtcagggg gctcggccga catcaccctt cgcagctatg tgcgggccca 1681 ggacccccca ccagagccgg gccccacgcc ccctgccgcg cccgcgtccc cggcctgctg 1741 gagcgactcg cccggccagc actacttctc cagcctcgcc gcggccgcct gcccgcctgc 1801 ctcgccctcc gacgccgccg gcgcctcctc gtcttccgcc tcgtcgtcct ctgccgcgtc 1861 ggggcccgcc cccccgcgcc ccgtcgaggg ccagctcagc gcgcggagcc gcagcaacag 1921 cgccgagcgc ctgctggagg ccgtggccgc caccgccgcc gaggagcccc cggaggccgc 1981 gcccggccgc gcgcgcgccg tggagaacca gtactccttc tactagcccg cggcgccgcc 2041 cgggtgggac acgccaagct cttcagtgaa gacacgatgt tattaaaagc ctgttttagg 2101 gactgcaaaa // LOCUS AB000634 2217 bp mRNA PRI 01-SEP-1997 DEFINITION Homo sapiens mRNA for protein phosphatase 2A delta (B'') regulatory subunit, delta1 isoform, complete cds. ACCESSION AB000634 NID g2189946 KEYWORDS protein phosphatase 2A delta (B'') regulatory subunit, delta1 isoform. SOURCE Homo sapiens cerebral cortex cDNA to mRNA, clone_lib:lambda ZAP II clone:CC6.2.2. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2217) AUTHORS Tanabe,O. TITLE Direct Submission JOURNAL Submitted (23-JAN-1997) to the DDBJ/EMBL/GenBank databases. Osamu Tanabe, Hiroshima University School of Medicine, Department of Biochemistry; Kasumi 1-2-3, Minami-ku, Hiroshima, Hiroshima 734, Japan (E-mail:otanabe@mcai.med.hiroshima-u.ac.jp, Tel:082-257-5136, Fax:082-257-5088) REFERENCE 2 (bases 1 to 263; 360 to 2217) AUTHORS Tanabe,O., Nagase,T., Murakami,T., Nozaki,H., Usui,H., Nishito,Y., Hayashi,H., Kagamiyama,H. and Takeda,M. TITLE Molecular cloning of a 74-kDa regulatory subunit (B' or delta) of human protein phosphatase 2A JOURNAL FEBS Lett. 379 (1), 107-111 (1996) MEDLINE 96159032 REFERENCE 3 (sites) AUTHORS Tanabe,O., Gomez,G.A., Nishito,Y., Usui,H. and Takeda,M. TITLE Molecular heterogeneity of the cDNA encoding a 74-kDa regulatory subunit (B' or delta) of human protein phosphatase 2A JOURNAL FEBS Lett. 408 (1), 52-56 (1997) MEDLINE 97324098 FEATURES Location/Qualifiers source 1..2217 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CC6.2.2" /clone_lib="lambda ZAP II" /tissue_type="cerebral cortex" 5'UTR <1..14 CDS 15..1823 /codon_start=1 /product="protein phosphatase 2A delta (B'') regulatory subunit, delta1 isoform" /db_xref="PID:d1021213" /db_xref="PID:g2189947" /translation="MPYKLKKEKEPPKVAKCTAKPSSSGKDGGGENTEEAQPQPQPQP QPQAQSQPPSSNKRPSNSTPPPTQLSKIKYSGGPQIVKKERRQSSSRFNLSKNRELQK LPALKDSPTQEREELFIQKLRQCCVLFDFVSDPLSDLKFKEVKRAGLNEMVEYITHSR DVVTEAIYPEAVTMFSVNLFRTLPPSSNPTGAEFDPEEDEPTLEAAWPHLQLVYEFFL RFLESPDFQPNIAKKYIDQKFVLALLDLFDSEDPRERDFLKTILHRIYGKFLGLRAYI RRQINHIFYRFIYETEHHNGIAELLEILGSIINGFALPLKEEHKMFLIRVLLPLHKVK SLSVYHPQLAYCVVQFLEKESSLTEPVIVGLLKFWPKTHSPKEVMFLNELEEILDVIE PSEFSKVMEPLFRQLAKCVSSPHFQVAERALYYWNNEYIMSLISDNAARVLPIMFPAL YRNSKSHWNKTIHGLIYNALKLFMEMNQKLFDDCTQQYKAEKQKGRFRMKEREEMWQK IEELARLNPQYPMFRAPPPLPPVYSMETETPTAEDIQLLKRTVETEAVQMLKDIKKEK VLLRRKSELPQDVYTIKALEAHKRAEEFLTASQEAL" misc_difference 42..359 /note="Possible splice variant of the delta subunit of protein phosphatase 2A. Variant contains a deletion of bp 42-359." /citation=[3] /replace="" misc_difference 264..359 /note="Possible splice variant of the delta subunit of protein phosphatase 2A. Variant contains a deletion of bp 264-359." /citation=[2] /replace="" 3'UTR 1824..>2217 BASE COUNT 545 a 656 c 557 g 459 t ORIGIN 1 ccggacgggc cgagatgccc tataaactga aaaaggagaa ggagcccccc aaggttgcca 61 aatgcacagc caagcctagc agctcgggca aggatggtgg aggcgagaac actgaggagg 121 cccagccgca gccccagccc cagccccagc cccaagccca gtctcagcca ccgtcatcca 181 acaagcgtcc cagcaatagc acgccgcccc ccacgcagct cagcaaaatc aagtactcag 241 gggggcccca gattgtcaag aaggagcgac ggcaaagctc ctcccgcttc aacctcagca 301 agaatcggga gctgcagaag cttcctgccc tgaaagattc gccaacccag gagcgggagg 361 agctgtttat ccagaagcta cgccagtgct gtgtcctctt tgacttcgtg tcagacccac 421 tcagtgacct caaattcaag gaggtgaagc gggcaggact caacgagatg gtggagtaca 481 tcacccatag ccgtgatgtt gtcactgagg ccatttaccc tgaggctgtc accatgtttt 541 cagtgaacct cttccggacg ctgccacctt catcgaatcc cacaggggct gagtttgacc 601 cagaggaaga tgagcccacc ctggaagctg cttggccaca tctccagctc gtgtatgagt 661 tcttcttacg tttccttgag tctcctgatt tccagccaaa catagccaag aagtacatcg 721 accagaagtt tgtacttgct ctcctagacc tatttgacag tgaggatcct cgagagcggg 781 acttcctcaa gaccattttg catcgcatct atggcaagtt tttggggctc cgggcttata 841 tccgtaggca gatcaaccac atcttctaca ggttcatcta cgagacggag catcacaacg 901 ggattgctga gctcctggag atcctgggca gcatcatcaa tggctttgcc ctgcccctta 961 aagaagagca caagatgttc ctcatccgtg tcctacttcc ccttcacaag gtcaagtccc 1021 tgagtgtcta ccaccctcag ctggcatact gtgtggtaca attcctggag aaggagagca 1081 gtctgactga gccggtaatt gtgggacttc tcaagttttg gcccaagacc cacagcccca 1141 aggaggtgat gttcttgaat gagctggagg agattctgga cgtcattgaa ccttctgagt 1201 tcagcaaagt gatggaaccc ctcttccgcc agctggccaa gtgtgtctct agcccccatt 1261 tccaggtggc agagcgtgct ctctattact ggaacaatga gtacatcatg agcctgataa 1321 gtgacaatgc tgcccgagtc ctccccatca tgttccctgc actctacagg aactccaaga 1381 gccactggaa caagacaatc catggactga tctataatgc cctgaagttg tttatggaaa 1441 tgaatcagaa gctgtttgat gactgcacac aacaatacaa ggcagagaag cagaagggcc 1501 ggttccgaat gaaggaaagg gaagagatgt ggcaaaaaat cgaggagctg gcccggctta 1561 atccccagta tcccatgttc cgagcccctc caccactgcc ccctgtgtac tcgatggaga 1621 cagagacccc cacagctgag gacatccagc ttctgaagag gactgtggag actgaggctg 1681 ttcagatgct aaaagacatc aagaaggaga aagtgctgct gcggaggaag tcggagctgc 1741 cccaggacgt gtacaccatc aaggcactgg aggcgcacaa gcgggcggaa gagttcctaa 1801 ctgccagcca ggaggctctc tgacccctca cgttcctacc acagggccac agcccacaca 1861 gccctgggac actgccctgg ccctccatac tctgctccct actggctgtc ttgggggaag 1921 gcagcgcctc tctagctact caagggaggg ggatgtgggc acttgaagca gggacaccca 1981 cagaatggtc cctcttctcc ccaaaaggtg ttcatgcctc cctgtggcta gtacaggctg 2041 agcactaaga tgcttagtgc tcagacaacc tggggatgcc tgtcccctac ctgctcctca 2101 cccacagcta cctgaggctg ctctgagaag tacacacagg aatacatacg ctcctctatt 2161 cttcccttca tcctcatttg aacgccaggt atctcccctc ctctctctcc cctgcag // LOCUS AB000712 1665 bp mRNA PRI 27-OCT-1997 DEFINITION Homo sapiens hCPE-R mRNA for CPE-receptor, complete cds. ACCESSION AB000712 NID g2570124 KEYWORDS CPE-receptor. SOURCE Homo sapiens tissue_lib:fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Katahira,J., Sugiyama,H., Inoue,N., Horiguchi,Y., Matsuda,M. and Sugimoto,N. TITLE Clostridium perfringens enterotoxin utilizes two structurally related membrane proteins as functional receptors in vivo JOURNAL J. Biol. Chem. 272 (42), 26652-26658 (1997) MEDLINE 97476271 REFERENCE 2 (bases 1 to 1665) AUTHORS Katahira,J. TITLE Direct Submission JOURNAL Submitted (26-JAN-1997) to the DDBJ/EMBL/GenBank databases. Jun Katahira, Institute for Microbial Diseases, Osaka University, Department of Bacterial Toxinology; 3-1, Yamadaoka, Suita, Osaka 565, Japan (E-mail:katahira@biken.osaka-u.ac.jp, Tel:81-6-879-8285, Fax:81-6-879-8283) FEATURES Location/Qualifiers source 1..1665 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="fetal brain" gene 183..812 /gene="hCPE-R" CDS 183..812 /gene="hCPE-R" /codon_start=1 /product="CPE-receptor" /db_xref="PID:d1023860" /db_xref="PID:g2570125" /translation="MASMGLQVMGIALAVLGWLAVMLCCALPMWRVTAFIGSNIVTSQ TIWEGLWMNCVVQSTGQMQCKVYDSLLALPQDLQAARALVIISIIVAALGVLLSVVGG KCTNCLEDESAKAKTMIVAGVVFLLAGLMVIVPVSWTAHNIIQDFYNPLVASGQKREM GASLYVGWAASGLLLLGGGLLCCNCPPRTDKPYSAKYSAARSAAASNYV" BASE COUNT 282 a 498 c 513 g 372 t ORIGIN 1 gaaggaactg gttctgctca cacttgctgg cttgcgcatc aggactggct ttatctcctg 61 actcacggtg caaaggtgca ctctgcgaac gttaagtccg tccccagcgc ttggaatcct 121 acggccccca cagccggatc ccctcagcct tccaggtcct caactcccgt ggacgctgaa 181 caatggcctc catggggcta caggtaatgg gcatcgcgct ggccgtcctg ggctggctgg 241 ccgtcatgct gtgctgcgcg ctgcccatgt ggcgcgtgac ggccttcatc ggcagcaaca 301 ttgtcacctc gcagaccatc tgggagggcc tatggatgaa ctgcgtggtg cagagcaccg 361 gccagatgca gtgcaaggtg tacgactcgc tgctggcact gccgcaggac ctgcaggcgg 421 cccgcgccct cgtcatcatc agcatcatcg tggctgctct gggcgtgctg ctgtccgtgg 481 tggggggcaa gtgtaccaac tgcctggagg atgaaagcgc caaggccaag accatgatcg 541 tggcgggcgt ggtgttcctg ttggccggcc ttatggtgat agtgccggtg tcctggacgg 601 cccacaacat catccaagac ttctacaatc cgctggtggc ctccgggcag aagcgggaga 661 tgggtgcctc gctctacgtc ggctgggccg cctccggcct gctgctcctt ggcggggggc 721 tgctttgctg caactgtcca ccccgcacag acaagcctta ctccgccaag tattctgctg 781 cccgctctgc tgctgccagc aactacgtgt aaggtgccac ggctccactc tgttcctctc 841 tgctttgttc ttccctggac tgagctcagc gcaggctgtg accccaggag ggccctgcca 901 cgggccactg gctgctgggg actggggact gggcagagac tgagccaggc aggaaggcag 961 cagccttcag cctctctggc ccactcggac aacttcccaa ggccgcctcc tgctagcaag 1021 aacagagtcc accctcctct ggatattggg gagggacgga agtgacaggg tgtggtggtg 1081 gagtggggag ctggcttctg ctggccagga tagcttaacc ctgactttgg gatctgcctg 1141 catcggcgtt ggccactgtc cccatttaca ttttccccac tctgtctgcc tgcatctcct 1201 ctgttccggg taggccttga tatcacctct gggactgtgc cttgctcacc gaaacccgcg 1261 cccaggagta tggctgaggc cttgcccacc cacctgcctg ggaagtgcag agtggatgga 1321 cgggtttaga ggggaggggc gaaggtgctg taaacaggtt tgggcagtgg tgggggaggg 1381 ggccagagag gcggctcagg ttgcccagct ctgtggcctc aggactctct gcctcacccg 1441 cttcagccca gggcccctgg agactgatcc cctctgagtc ctctgcccct tccaaggaca 1501 ctaatgagcc tgggagggtg gcagggagga ggggacagct tcacccttgg aagtcctggg 1561 gtttttcctc ttccttcttt gtggtttctg ttttgtaatt taagaagagc tattcatcac 1621 tgtaattatt attattttct acaataaatg ggacctgtgc acagg // LOCUS AB000714 1250 bp mRNA PRI 27-OCT-1997 DEFINITION Homo sapiens hRVP1 mRNA for RVP1, complete cds. ACCESSION AB000714 NID g2570128 KEYWORDS RVP1. SOURCE Homo sapiens tissue_lib:lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Katahira,J., Sugiyama,H., Inoue,N., Horiguchi,Y., Matsuda,M. and Sugimoto,N. TITLE Clostridium perfringens enterotoxin utilizes two structurally related membrane proteins as functional receptors in vivo JOURNAL J. Biol. Chem. 272 (42), 26652-26658 (1997) MEDLINE 97476271 REFERENCE 2 (bases 1 to 1250) AUTHORS Katahira,J. TITLE Direct Submission JOURNAL Submitted (26-JAN-1997) to the DDBJ/EMBL/GenBank databases. Jun Katahira, Institute for Microbial Diseases, Osaka University, Department of Bacterial Toxinology; 3-1, Yamadaoka, Suita, Osaka 565, Japan (E-mail:katahira@biken.osaka-u.ac.jp, Tel:81-6-879-8285, Fax:81-6-879-8283) FEATURES Location/Qualifiers source 1..1250 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lung" gene 199..861 /gene="hRVP1" CDS 199..861 /gene="hRVP1" /codon_start=1 /product="RVP1" /db_xref="PID:d1023862" /db_xref="PID:g2570129" /translation="MSMGLEITGTALAVLGWLGTIVCCALPMWRVSAFIGSNIITSQN IWEGLWMNCVVQSTGQMQCKVYDSLLALPQDLQAARALIVVAILLAAFGLLVALVGAQ CTNCVQDDTAKAKITIVAGVLFLLAALLTLVPVSWSANTIIRDFYNPVVPEAQKREMG AGLYVGWAAAALQLLGGALLCCSCPPREKKYTATKVVYSAPRSTGPGASLGTGYDRKD YV" polyA_site 1250 /note="13 A nucleotides" BASE COUNT 202 a 452 c 409 g 187 t ORIGIN 1 aattcggcac gagggcaggt gcaggcgcac gcggcgagag cgtatggagc cgagccgtta 61 gcgcgcgccg tcggtgagtc agtccgtccg tccgtccgtc cgtcggggcg ccgcagctcc 121 cgccaggccc agcggccccg gcccctcgtc tccccgcacc cggagccacc cggtggagcg 181 ggccttgccg cggcagccat gtccatgggc ctggagatca cgggcaccgc gctggccgtg 241 ctgggctggc tgggcaccat cgtgtgctgc gcgttgccca tgtggcgcgt gtcggccttc 301 atcggcagca acatcatcac gtcgcagaac atctgggagg gcctgtggat gaactgcgtg 361 gtgcagagca ccggccagat gcagtgcaag gtgtacgact cgctgctggc actgccacag 421 gaccttcagg cggcccgcgc cctcatcgtg gtggccatcc tgctggccgc cttcgggctg 481 ctagtggcgc tggtgggcgc ccagtgcacc aactgcgtgc aggacgacac ggccaaggcc 541 aagatcacca tcgtggcagg cgtgctgttc cttctcgccg ccctgctcac cctcgtgccg 601 gtgtcctggt cggccaacac cattatccgg gacttctaca accccgtggt gcccgaggcg 661 cagaagcgcg agatgggcgc gggcctgtac gtgggctggg cggccgcggc gctgcagctg 721 ctggggggcg cgctgctctg ctgctcgtgt cccccacgcg agaagaagta cacggccacc 781 aaggtcgtct actccgcgcc gcgctccacc ggcccgggag ccagcctggg cacaggctac 841 gaccgcaagg actacgtcta agggacagac gcagggagac cccaccacca ccaccaccac 901 caacaccacc accaccaccg cgagctggag cgcgcaccag gccatccagc gtgcagcctt 961 gcctcggagg ccagcccacc cccagaagcc aggaagcccc cgcgctggac tggggcagct 1021 tccccagcag ccacggcttt gcgggccggg cagtcgactt cggggcccag ggaccaacct 1081 gcatggactg tgaaacctca cccttctgga gcacggggcc tgggtgaccg ccaatacttg 1141 accaccccgt cgagccccat cgggccgctg cccccatgtc gcgctgggca gggaccggca 1201 gccctggaag gggcacttga tatttttcaa taaaagcctc tcgttttagc // LOCUS AB000732 4474 bp DNA PRI 28-JAN-1998 DEFINITION Homo sapiens gene for insulin receptor substrate-2, complete cds. ACCESSION AB000732 NID g2809058 KEYWORDS insulin receptor substrate-2; IRS-2. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ogihara,T., Isobe,T., Ichimura,T., Taoka,M., Funaki,M., Sakoda,H., Onishi,Y., Inukai,K., Anai,M., Fukushima,Y., Kikuchi,M., Yazaki,Y., Oka,Y. and Asano,T. TITLE 14-3-3 protein binds to insulin receptor substrate-1, one of the binding sites of which is in the phosphotyrosine binding domain JOURNAL J. Biol. Chem. 272 (40), 25267-25274 (1997) MEDLINE 97460123 REFERENCE 2 (bases 1 to 4474) AUTHORS Asano,T. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) to the DDBJ/EMBL/GenBank databases. Tomoichiro Asano, University of Tokyo, 3rd Department of Internal Medicine; 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan (E-mail:asano-tky@umin.u-tokyo.ac.jp, Tel:+81-3-3815-5411, Fax:+81-3-5803-1874) COMMENT Sequence updated (22-Jan-1998). FEATURES Location/Qualifiers source 1..4474 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3975 /gene="IRS-2" CDS 1..3975 /gene="IRS-2" /codon_start=1 /product="insulin receptor substrate-2" /db_xref="PID:d1025417" /db_xref="PID:g2809059" /translation="MASPPRHGPPGPASGDGPNLNNNNNNNNHSVRKCGYLRKQKHGH KRFFVLRGPGAGGDKATAGGGSAPQPPRLEYYESEKNWRSKAGAPKRVIALDCCLNIN KRADPKHKYLIALYTKDEYFAVAAENEQEQEGWYRALTDLVSEGRAAAGDAPPAAAPA ASCSASLPGAVGGSAGAAGAEDSYGLVAPATAAYREVWQVNLKPKGLGQSKNLTGVYR LCLSARTIGFVKLNCEQPSVTLQLMNIRRCGHSDSFFFIEVGRSAVTGPGELWMQADD SVVAQNIHETILEAMKALKELFEFRPRSKSQSSGSSATHPISVPGARRHHHLVNLPPS QTGLVRRSRTDSLAATPPAAKCSSCRVRTASEGDGGAAAGAAAAGARPVSVAGSPLSP GPVRAPLSRSHTLIGGCRAAGTKWHCFPAGGGLQHSRSMSMPVEHLPPAATSPGSLSS SSDHGWGSYPPPPGPHPLLPHPLHHGPGQRPSSGSASASGSPSDPGFMSLDEYGSSPG DLRAFCSHRSNTPESIAETPPARDGGGGGEFYGYMTMDRPLSHCGRSYRRVSGDAAQD LDRGLRKRTYSLTTPARQRPVPQPSSASLDEYTLMRATFSGSAGRLCPSCPASSPKVA YHPYPEDYGDIEIGSHRSSSSNLGADDGYMPMTPGAALAGSGSGSCRSDDYMPMSPAS VSAPKQILQPRAAAAAAAAVPFAGPAGPAPTFAAGRTFPASGGGYKASSPAESSPEDS GYMRMWCGSKLSMEHADGKLLPNGDYLNVSPSDAVTTGTPPDFFSAALHPGGEPLRGV PGCCYSSLPRSYKAPYTCGGDSDQYVLMSSPVGRILEEERLEPQATPGPTQAASAFGA GPTQPPHPVVPSPVRPSGGRPEGFLGQRGRAVRPTRLSLEGLPSLPSMHEYPLPPEPK SPGEYINIDFGEPGARLSPPAPPLLASAASSSSLLSASSPALSLGSGTPGTSSDSRQR SPLSDYMNLDFSSPKSPKPGAPSGHPVGSLDGLLSPEASSPYPPLPPRPSASPSSSLQ PPPPPPAPGELYRLPPASAVATAQGPGAASSLSSDTGDNGDYTEMAFGVAATPPQPIA APPKPEAARVASPTSGVKRLSLMEQVSGVEAFLQASQPPDPHRGAKVIRADPQGGRRR HSSETFSSTTTVTPVSPSFAHNPKRHNSASVENVSLRKSSEGGVGVGPGGGDEPPTSP RQLQPAPPLAPQGRPWTPGQPGGLVGCPGSGGSPMRRETSAGFQNGLKYIAIDVREEP GLPPQPQPPPPPLPQPGDKSSWGRTRSLGGLISAVGVGSTRGGCGGPGPGAPAPCPTT YAQH" BASE COUNT 720 a 1659 c 1408 g 687 t ORIGIN 1 atggcgagcc cgccgcggca cgggccgccc gggccggcga gcggagacgg ccccaacctc 61 aacaacaaca acaacaacaa caaccacagc gtgcgcaagt gcggctacct gcgcaagcag 121 aagcatggcc acaagcgctt cttcgtgctg cgcggacccg gcgcgggcgg cgacaaggcc 181 acggcgggcg gggggtcggc gccgcaaccg ccgcggctcg agtactacga aagcgaaaaa 241 aattggcgga gcaaggcagg cgcgccgaaa cgggtgatcg ctctcgactg ctgcctgaac 301 atcaacaagc gcgccgaccc caagcacaag tacctgatcg ccctctacac caaggacgag 361 tacttcgccg tggccgccga gaacgagcag gagcaggagg gctggtaccg cgcgctcacc 421 gacctggtca gcgagggccg cgcggccgcc ggagacgcgc cccccgccgc cgcgcccgcc 481 gcgtcctgca gcgcctccct gcccggcgcc gtgggcggtt ctgccggcgc cgccggggcc 541 gaggacagct acgggctggt ggctcccgcc acggccgcct accgtgaggt gtggcaggtg 601 aacctgaagc ccaagggtct gggccagagc aagaacctga cgggggtgta ccgtctgtgc 661 ctgtctgcgc gcaccatcgg cttcgtgaag ctcaactgcg agcagccgtc ggtgacgctg 721 cagctcatga acatccgccg ctgcggccac tcggacagct tcttcttcat cgaggtgggc 781 cgctcggccg tcacaggccc cggcgagctg tggatgcagg cggacgactc ggtggtggcg 841 cagaacatcc acgagaccat cctggaggcc atgaaggcgc tcaaggagct cttcgagttc 901 cggccgcgca gtaagagcca atcgtcgggg tcgtcggcca cgcaccccat cagcgtcccc 961 ggcgcgcgcc gccaccacca cctggtcaac ctgcccccca gccagacggg cctggtgcgc 1021 cgctcgcgca ccgacagcct ggccgccacc ccgccggcgg ccaagtgcag ctcgtgccgg 1081 gtgcgcaccg ccagcgaggg cgacggcggc gcggcggcgg gagcggcggc cgcgggcgcc 1141 aggccggtgt cggtggctgg gagccccctg agccccgggc cggtgcgcgc gcccctgagc 1201 cgctcgcaca ccctgatcgg cggctgccgg gccgcgggaa caaagtggca ttgcttcccg 1261 gcagggggcg gattgcaaca cagccgttcg atgtccatgc ccgtggagca tttgccgcca 1321 gccgccacca gcccgggttc cttgtcttcc agcagcgacc acggttgggg ttcttacccg 1381 ccgccgcccg gcccgcaccc gcttttgccg catccgttgc accacggccc cggccagcgg 1441 ccttccagcg gcagcgcttc cgcttcgggc tcccccagcg accccggttt catgtccctg 1501 gacgagtacg gctccagccc aggcgacctg cgcgccttct gcagccaccg aagcaacacg 1561 cccgagtcca tcgcggagac gcccccggcc cgagacggcg gcggcggcgg tgagttttac 1621 gggtacatga ccatggacag gcccctgagc cactgtggcc gctcctaccg ccgggtctcg 1681 ggggacgcgg cccaggacct ggaccgaggg ctgcgcaaga ggacctactc cctgaccacg 1741 ccagcccggc agcggccggt gccccagccc tcctctgcct cgctggatga atacaccctg 1801 atgcgggcca ccttctcggg cagcgcgggc cgcctctgcc cgtcctgccc cgcgtcctct 1861 cccaaggtgg cctaccaccc ctacccagag gactacggag acatcgagat cggctcccac 1921 aggagctcca gcagcaacct gggggcagac gacggctaca tgcccatgac gcccggcgcg 1981 gcccttgcgg gcagtgggag cggcagctgc aggagcgacg actacatgcc catgagcccc 2041 gccagcgtgt ccgcccccaa gcagattttg cagcccaggg ccgccgccgc cgccgccgcc 2101 gccgtgcctt ttgcggggcc tgcggggcca gcacccacct ttgcggcggg caggacattc 2161 ccggcgagtg ggggcggcta caaggccagc tcgcccgccg agagctcccc cgaggacagt 2221 gggtacatgc gcatgtggtg cggttccaag ctgtccatgg agcatgcaga tggcaagctg 2281 ctgcccaacg gggactacct caacgtgtcc cccagcgacg cggtcaccac gggcaccccg 2341 cccgacttct tctccgcagc cctgcacccc ggcggggagc cgctcagggg cgttcccggc 2401 tgctgctaca gctccttgcc ccgctcctac aaggccccct acacctgtgg cggggacagc 2461 gaccagtacg tgctcatgag ctcccccgtg gggcgcatcc tggaggagga gcgtctggag 2521 cctcaggcca ccccagggcc cacccaggcg gccagcgcct tcggggccgg ccccacgcag 2581 ccccctcacc ctgtagtgcc ttcgcccgtg cggcctagcg gcggccgccc ggagggcttc 2641 ttgggccagc gcggccgggc ggtgaggccc acgcgcctgt ccctggaggg gctgcccagc 2701 ctgcccagca tgcacgagta cccactgcca ccggagccca agagccccgg cgagtacatc 2761 aacatcgact ttggcgagcc cggggcccgc ctgtcgccgc ccgcgcctcc cctgctggcg 2821 tcggcggcct cgtcctcatc gctattgtcc gccagcagcc cggccttgtc gttgggctca 2881 ggcaccccgg gcaccagcag cgacagccgg cagcggtctc cgctctccga ctacatgaac 2941 ctcgacttca gctcccccaa gtctcctaag ccgggcgccc cgagcggcca ccccgtgggc 3001 tccttggacg gcctcctgtc ccccgaggcc tcctccccgt atccgccgtt gcccccgcgt 3061 ccgtccgcgt ccccgtcgtc gtctctgcag ccgccgccac cgccgccggc cccgggggag 3121 ctgtaccgcc tgccccccgc ctcggccgtt gccaccgccc agggcccggg cgccgcctca 3181 tcgttgtcct cggacaccgg ggacaatggt gactacaccg agatggcttt tggtgtggcc 3241 gccaccccgc cgcaacctat cgcggccccc ccgaagccag aagctgcccg cgtggccagc 3301 ccgacgtcgg gcgtgaagag gctgagcctc atggagcagg tgtcgggagt cgaggccttc 3361 ctgcaggcca gccagccccc ggacccccac cgcggcgcca aggtcatccg cgcagacccg 3421 caggggggcc gccgccgcca cagttccgag accttctcct ccaccacgac ggtcaccccc 3481 gtgtccccgt ccttcgccca caaccccaag cgccacaact cggcctccgt ggaaaatgtc 3541 tctctcagga aaagcagcga gggcggcgtg ggtgtcggcc ctggaggggg cgacgagccg 3601 cccacctccc cacgacagtt gcagccggcg ccccctttgg caccgcaggg ccggccgtgg 3661 accccgggtc agcccggggg cttggtcggt tgtcctggga gcggtggatc gcccatgcgc 3721 agagagacct ctgccggttt ccagaatggt ctcaagtaca tcgccatcga cgtgagggag 3781 gagcccgggc tgccacccca gccgcagccg ccgccgccgc cgcttcctca gccgggagac 3841 aagagctcct ggggccggac ccgaagcctc gggggtctca tcagcgctgt gggcgtcggc 3901 agcacccgcg gcgggtgcgg ggggccgggt cccggtgccc ctgccccctg cccaacaacc 3961 tacgcccagc attgacttct tgtcccacca cttgaaggag gccaccattg tgaaagagtg 4021 aagatctgtc tggctttatc accaggatgt cacatgtcag agaatatcat taaaagaaga 4081 cgctcagcac tgtttcagcc cgaagctgct tgcagttttc ttttggatct gagcaatgac 4141 tgtgtttgga aacatctgtg gactctgtta gatgaggcac caacaaggca aggtcacctg 4201 cctctttccc ttgttcccgg atggggcatt catcattgtg ctgtttgcgt tttgttttgt 4261 tttgttttaa caaaattagc tgaagaagtt attctcaaga aaattggatg ttttcattgg 4321 ccttcttaaa ttgtggccag tgtcttttaa tttcttcttc ttttcctttt ggcaaagcag 4381 atataaccct cagcatgcta ggagagtgca cccgtacact atggaagtgg taaaatctgg 4441 tatttactgg cttacactca aaacgaccac agtc // LOCUS AB000734 1216 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens mRNA for TIP3, complete cds. ACCESSION AB000734 NID g2627028 KEYWORDS tip3; TIP3. SOURCE Homo sapiens blood B-lymphocytes cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ohya,Ki., Kajigaya,S., Yamashita,Y., Miyazato,A., Hatake,K., Miura,Y., Ikeda,U., Shimada,K., Ozawa,K. and Mano,H. TITLE SOCS-1/JAB/SSI-1 can bind to and suppress tec protein-tyrosine kinase JOURNAL J. Biol. Chem. 272 (43), 27178-27182 (1997) MEDLINE 98001695 REFERENCE 2 (bases 1 to 1216) AUTHORS Mano,H. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) to the DDBJ/EMBL/GenBank databases. Hiroyuki Mano, Jichi Medical School, Department of Molecular Biology; 3311-1 Yakushiji, Minamikawachi-Machi, Kawachi-gun, Tochigi 329-04, Japan (E-mail:hmano@jichi.ac.jp, Tel:0285-44-2111(ex.3482), Fax:0285-44-8675) FEATURES Location/Qualifiers source 1..1216 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-lymphocytes" /tissue_type="blood" gene 155..790 /gene="tip3" CDS 155..790 /gene="tip3" /note="similar to U88325:SOCS-1, AB000676:JAB, AB000710:SSI-1" /codon_start=1 /product="TIP3" /db_xref="PID:g2627029" /translation="MVAHNQVAADNAVSTAAEPRRRPEPSSSSSSSPAAPARPRPCPA VPAPAPGDTHFRTFRSHADYRRITRASALLDACGFYWGPLSVHGAHERLRAEPVGTFL VRDSRQRNCFFALSVKMASGPTSIRVHFQAGRFHLDGSRESFDCLFELLEHYVAAPRR MLGAPLRQRRVRPLQELCRQRIVATVGRENLARIPLNPVLRDYLSSFPFQI" polyA_site 1216 /note="12 A nucleotides" BASE COUNT 172 a 449 c 355 g 240 t ORIGIN 1 ggcagctgca cggctcctgg ccccggagca tgcgcgagag ccgccccgga gcgccccgga 61 gccccccgcc gtcccgcccg cggcgtcccg cgccccgccg ccagcgcacc cccggacgct 121 atggcccacc cctccggctg gccccttctg taggatggta gcacacaacc aggtggcagc 181 cgacaatgca gtctccacag cagcagagcc ccgacggcgg ccagaacctt cctcctcttc 241 ctcctcctcg cccgcggccc ccgcgcgccc gcggccgtgc cccgcggtcc cggccccggc 301 ccccggcgac acgcacttcc gcacattccg ttcgcacgcc gattaccggc gcatcacgcg 361 cgccagcgcg ctcctggacg cctgcggatt ctactggggg cccctgagcg tgcacggggc 421 gcacgagcgg ctgcgcgccg agcccgtggg caccttcctg gtgcgcgaca gccgccagcg 481 gaactgcttt ttcgccctta gcgtgaagat ggcctcggga cccacgagca tccgcgtgca 541 ctttcaggcc ggccgctttc acctggatgg cagccgcgag agcttcgact gcctcttcga 601 gctgctggag cactacgtgg cggcgccgcg ccgcatgctg ggggccccgc tgcgccagcg 661 ccgcgtgcgg ccgctgcagg agctgtgccg ccagcgcatc gtggccaccg tgggccgcga 721 gaacctggct cgcatccccc tcaaccccgt cctccgcgac tacctgagct ccttcccctt 781 ccagatttga ccggcagcgc ccgccgtgca cgcagcatta actgggatgc cgtgttattt 841 tgttattact tgcctggaac catgtgggta ccctccccgg cctgggttgg agggagcgga 901 tgggtgtagg ggcgaggcgc ctcccgccct cggctggaga cgaggccgca gaccccttct 961 cacctcttga gggggtcctc cccctcctgg tgctccctct gggtccccct ggttgttgta 1021 gcagcttaac tgtatctgga gccaggacct gaactcgcac ctcctacctc ttcatgttta 1081 catataccca gtatctttgc acaaaccagg ggttggggga gggtctctgg ctttattttt 1141 ctgctgtgca gaatcctatt ttatattttt taaagtcagt ttaggtaata aactttatta 1201 tgaaagtttt tttttt // LOCUS AB000812 2396 bp mRNA PRI 13-MAY-1997 DEFINITION Human mRNA for BMAL1b, complete cds. ACCESSION AB000812 NID g2094734 KEYWORDS BMAL1b. SOURCE Homo sapiens male brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2396) AUTHORS Ikeda,M. TITLE Direct Submission JOURNAL Submitted (30-JAN-1997) to the DDBJ/EMBL/GenBank databases. Masaaki Ikeda, Saitama Medical School, Department of Physiology; 38 Morohongo, Iruma-gun, Saitama 350-04, Japan (E-mail:mikeda@saitama-med.ac.jp, Tel:+81-492-76-1150, Fax:+81-492-95-5573) REFERENCE 2 (bases 1 to 2396) AUTHORS Ikeda,M. and Nomura,M. TITLE cDNA Cloning and Tissue-specific Expression of a Novel Basic Helix-Loop -Helix/PAS protein (BMAL1) and Identification of Its Alternatively Spliced Varia nts with Alternative Translation Initiation Site Usage JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Ikeda,M. and Nomura,M. TITLE cDNA cloning and tissue-specific expression of a novel basic helix-loop-helix/PAS protein (BMAL1) and identification of alternatively spliced variants with alternative translation initiation site usage JOURNAL Biochem. Biophys. Res. Commun. 233 (1), 258-264 (1997) MEDLINE 97289529 FEATURES Location/Qualifiers source 1..2396 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="brain" CDS 41..1921 /codon_start=1 /product="BMAL1b" /db_xref="PID:d1020725" /db_xref="PID:g2094735" /translation="MADQRMDISSTISDFMSPGPTDLLSSSLGTSGVDCNRKRKGSST DYQESMDTDKDDPHGRLEYTEHQGRIKNAREAHSQIEKRRRDKMNSFIDELASLVPTC NAMSRKLDKLTVLRMAVQHMRTLRGATNPYTEANYKPTFLSDDELKHLILRAADGFLF VVGCDRGKILFVSESVFKILNYSQNDLIGQSLFDYLHPKDIAKVKEQLSSSDTAPRER LIDAKTGLPVKTDITPGPSRLCSGARRSFFCRMKCNRPSVKVEDKDFPSTCSKKKADR KSFCTIHSTGYLKSWPPTKMGLDEDNEPDNEGCNLSCLVAIGRLHSHVVPQPVNGEIR VKSMEYVSRHAIDGKFVFVDQRATAILAYLPQELLGTSCYEYFHQDDIGHLAECHRQV LQTREKITTNCYKFKIKDGSFITLRSRWFSFMNPWTKEVEYIVSTNTVVLANVLEGGD PTFPQLTASPHSMDSMLPSGEGGPKRTHPTVPGIPGGTRAGAGKIGRMIAEEIMEIHR IRGSSPSSCGSSPLNITSTPPPDASSPGGKKILNGGTPDIPSSGLLSGQAQENPGYPY SDSSSILGENPHIGIDMIDNDQGSSSPSNDEAAMAVIMSLLEADAGLGGPVDFSDLPW PL" BASE COUNT 710 a 527 c 538 g 621 t ORIGIN 1 ctggatctgg ggtgtaagaa ctgtgacttc agatcatcca atggcagacc agagaatgga 61 catttcttca accatcagtg atttcatgtc cccgggcccc accgacctgc tttccagctc 121 tcttggtacc agtggtgtgg attgcaaccg caaacggaaa ggcagctcca ctgactacca 181 agaaagcatg gacacagaca aagatgaccc tcatggaagg ttagaatata cagaacacca 241 aggaaggata aaaaatgcaa gggaagctca cagtcagatt gaaaagcggc gtcgggataa 301 aatgaacagt tttatagatg aattggcttc tttggtacca acatgcaacg caatgtccag 361 gaaattagat aaacttactg tgctaaggat ggctgttcag cacatgagaa cattaagagg 421 tgccaccaat ccatacacag aagcaaacta caaaccaact tttctatcag acgatgaatt 481 gaaacacctc attctcaggg cagcagatgg atttttgttt gtcgtaggat gtgaccgagg 541 gaagatactc tttgtctcag agtctgtctt caagatcctc aactacagcc agaatgatct 601 gattggtcag agtttgtttg actacctgca tcctaaagat attgccaaag tcaaggagca 661 gctctcctcc tctgacaccg caccccggga gcggctcata gatgcaaaaa ctggacttcc 721 agttaaaaca gatataaccc ctgggccatc tcgattatgt tctggagcac gacgttcttt 781 cttctgtagg atgaagtgta acaggccttc agtaaaggtt gaagacaagg acttcccctc 841 tacctgctca aagaaaaaag cagatcgaaa aagcttctgc acaatccaca gcacaggcta 901 tttgaaaagc tggccaccca caaagatggg gctggatgaa gacaacgaac cagacaatga 961 ggggtgtaac ctcagctgcc tcgtcgcaat tggacgactg cattctcatg tagttccaca 1021 accagtgaac ggggaaatca gggtgaaatc tatggaatat gtttctcggc acgcgataga 1081 tggcaagttt gtttttgtag accagagggc aacagctatt ttggcatatt taccacaaga 1141 acttctaggc acatcgtgtt atgaatattt tcaccaagat gacataggac atcttgcaga 1201 atgtcatagg caagttttac agacgagaga aaaaattaca actaattgct ataaatttaa 1261 aatcaaagat ggttctttta tcacactacg gagtcgatgg ttcagtttca tgaacccttg 1321 gaccaaggaa gtagaatata ttgtctcaac taacactgtt gttttagcca acgtcctgga 1381 aggcggggac ccaaccttcc cacagctcac agcatccccc cacagcatgg acagcatgct 1441 gccctctgga gaaggtggcc caaagaggac ccaccccact gttccaggga ttccaggggg 1501 aacccgggct ggggcaggaa aaataggccg aatgattgct gaggaaatca tggaaatcca 1561 caggataaga gggtcatcgc cttctagctg tggctccagc ccattgaaca tcacgagtac 1621 gcctccccct gatgcctctt ctccaggagg caagaagatt ttaaatggag ggactccaga 1681 cattccttcc agtggcctac tatcaggcca ggctcaggag aacccaggtt atccatattc 1741 tgatagttct tctattcttg gtgagaaccc ccacataggt atagacatga ttgacaacga 1801 ccaaggatca agtagtccca gtaatgatga ggcagcaatg gctgtcatca tgagcctctt 1861 ggaagcagat gctggactgg gtggccctgt tgactttagt gacttgccat ggccgctgta 1921 aacactacat gttgctttgg caacagctat agtatcaaag tgcattactg gtggagtttt 1981 acagtctgtg aagcttactg gataaggaga gaatagcttt tatgtactga cttcataaaa 2041 gccatctcag agccattgat acaagtcaat cttactatat gtaacttcag acaaagtgga 2101 actaagcctg ctccagtgtt tcctcatcat tgattattgg gccagctgtg gatagcttgc 2161 attaattgta tattttggat tctgtttgtg ttgaattttt taatcattgt gcacagaagc 2221 atcattggta gcttttatat gcaaatggtc atttcagatg tatggtgttt ttacactaca 2281 aagaagtccc ccatgtggat atttcttata ctaattgtat cataaagccg tttattcttc 2341 cttgtaagaa tcctttacta taaatatggg ttaaagtata atgtactaga cagtta // LOCUS AB000824 1854 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens mRNA for trehalase, complete cds. ACCESSION AB000824 NID g2789460 KEYWORDS trehalase. SOURCE Homo sapiens cell_line:kidney cDNA to mRNA, clone_lib:lambda gt11. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishihara,R., Taketani,S., Sasai-Takedatsu,M., Kino,M., Tokunaga,R. and Kobayashi,Y. TITLE Molecular cloning, sequencing and expression of cDNA encoding human trehalase JOURNAL Gene 202 (1-2), 69-74 (1997) MEDLINE 98087419 REFERENCE 2 (bases 1 to 1854) AUTHORS Ishihara,R. TITLE Direct Submission JOURNAL Submitted (01-FEB-1997) to the DDBJ/EMBL/GenBank databases. Reiko Ishihara, Kansai Medical University, Department of Pediatrics; 10-15 Fumizonocho, Moriguchi, Osaka 570, Japan (E-mail:taketani@takii.kmu.ac.jp, Tel:06-992-1001(ex.2504), Fax:06-993-5101) FEATURES Location/Qualifiers source 1..1854 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="kidney" /clone_lib="lambda gt11" CDS 56..1807 /codon_start=1 /product="trehalase" /db_xref="PID:d1025293" /db_xref="PID:g2789461" /translation="MPGRTWELCLLLLLGLGLGSQEALPPPCESEIYCHGELLNQVQM AKLYQDDKQFVDMPLSIAPEQVLQTFTELSRDHNHSIPREQLQAFVHEHFQAKGQELQ PWTPADWKDSPQFLQKISDAKLRAWAGQLHQLWKKLGKKMKPEVLSHPERFSLIYSEH PFIVPGGRFVEFYYWDSYWVMEGLLLSEMAETVKGMLQNFLDLVKTYGHVPNGGRVYY LQRSQPPLLTLMMDCYLTHTNDTAFLQENIETLALELDFWTKNRTVSVSLEGKNYLLN RYYVPYGGPRPESYSKDVELADTLPEGDREALWAELKAGAESGWDFSSRWLIGGPNPN SLSGIRTSKLVPVDLNAFLCQAEELMSNFYSRLGNDSQATKYRILRSQRLAALNTVLW DEQTGAWFDYDLEKKKKNREFYPSNLTPLWAGCFSDPGVADKALKYLEDNRILTYQYG IPTSLQKTGQQWDFPNAWAPLQDLVIRGLAKAPLRRAQEVAFQLAQNWIRTNFDVYSQ KSAMYEKYDVSNGGQPGGGGEYEVQEGFGWDEGVVLMLLDRYGDRLTSGAKLAFLEPH CLAATLLPSLLLSLLPW" BASE COUNT 407 a 535 c 527 g 385 t ORIGIN 1 gaattccggg ccgaaggtgc ctgggcttgc tcattcagtc acagtcacag ccaccatgcc 61 agggaggacc tgggagctgt gcctgctact gctgctgggg ctgggactgg ggtcccagga 121 ggccctaccc ccaccctgtg agagtgagat ttactgccac ggggagctcc taaaccaagt 181 tcaaatggcc aagctctacc aggatgacaa gcagtttgtg gacatgccac tgtctatagc 241 tccagaacaa gtcctgcaga ccttcactga gctgtccagg gaccacaatc acagcatccc 301 cagggagcag ctgcaggcgt ttgtccacga acacttccag gccaaggggc aggagctgca 361 gccctggacc cctgcagact ggaaagacag cccccagttc ctgcagaaga tttcagatgc 421 caaactgcgt gcctgggcag ggcagctgca tcagctctgg aagaagctgg ggaagaagat 481 gaagccagag gttctcagcc accctgagcg gttctctctc atatactcag aacatccttt 541 cattgtgcct ggcggtcgct ttgttgagtt ctactactgg gactcctact gggtcatgga 601 gggtctgctc ctctcagaga tggctgagac ggtgaagggc atgctgcaga acttcttgga 661 cctggtgaaa acctatgggc atgtccccaa tggtgggcgc gtgtactact tgcagcggag 721 ccagccccca ctcttgaccc tcatgatgga ttgctacttg actcacacca atgacaccgc 781 ctttctacag gaaaacattg aaacactagc cttggaattg gacttttgga ccaagaacag 841 gactgtctct gtgagcttgg agggaaagaa ctacctcctg aatcgctatt atgtccctta 901 tgggggaccc aggcctgagt cctacagcaa agatgtggag ttggctgaca ccttgccaga 961 aggagaccgg gaggctctgt gggctgagct caaggctggg gctgagtctg gctgggactt 1021 ctcttcacgc tggctcattg gaggcccaaa ccccaactcg cttagcggca tccgaacaag 1081 caaactggtg cctgttgacc tgaatgcctt cctatgccaa gcagaggagc tgatgagcaa 1141 cttctattcc aggctgggga acgactccca ggccacgaag tacagaatcc tgcggtcgca 1201 gcgcttggcc gccctgaaca cagtcctgtg ggatgagcag accggagcct ggttcgatta 1261 cgaccttgag aagaagaaga aaaaccggga gttttaccca tccaacctca ctccactctg 1321 ggccgggtgt ttctctgacc ctggcgtggc ggacaaggct ctgaaatacc tggaggacaa 1381 ccggatcctg acttaccagt atgggatccc gacctctctc cagaagacag gccagcagtg 1441 ggatttcccc aatgcctggg cccccctgca ggacttggtc atcagaggcc tggccaaggc 1501 acctttacgt cgggcccagg aagtggcttt ccagctggct cagaattgga tccgaaccaa 1561 ttttgatgtc tactcgcaga agtcagccat gtatgagaag tatgacgtca gcaacggtgg 1621 acagcccggt gggggaggag aatatgaagt tcaggaggga tttggctggg acgaaggtgt 1681 ggtcctgatg ctgctggacc gctatggtga ccggctgacc tcaggggcca agctggcttt 1741 cctggagccc cactgcctgg cggccaccct tctgcccagc ctcctgctca gcctcctgcc 1801 atggtgacag ccctcctctc ctcacctggc cccagctcct gccccccgga attc // LOCUS AB000887 687 bp mRNA PRI 05-JUN-1997 DEFINITION Human mRNA for EBI1-ligand chemokine, complete cds. ACCESSION AB000887 NID g2189952 KEYWORDS EBI1-ligand chemokine; ELC. SOURCE Homo sapiens fetal tissue_lib:lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 687) AUTHORS Yoshida,R., Imai,T., Hieshima,K., Kusuda,J., Baba,M., Kitaura,M., Nishimura,M., Kakizaki,M., Nomiyama,H. and Yoshie,O. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) to the DDBJ/EMBL/GenBank databases. Hisayuki Nomiyama, Kumamoto University Medical School, Department of Biochemistry; Honjo 2-2-1, Kumamoto, Kumamoto 860, Japan (E-mail:nomiyama@gpo.kumamoto-u.ac.jp, Tel:+81-96-373-5063) REFERENCE 2 (sites) AUTHORS Yoshida,R., Imai,T., Hieshima,K., Kusuda,J., Baba,M., Kitaura,M., Nishimura,M., Kakizaki,M., Nomiyama,H. and Yoshie,O. TITLE Molecular cloning of a novel human CC chemokine EBI1-ligand chemokine that is a specific functional ligand for EBI1, CCR7 JOURNAL J. Biol. Chem. 272 (21), 13803-13809 (1997) MEDLINE 97298088 FEATURES Location/Qualifiers source 1..687 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_lib="lung" gene 139..435 /gene="ELC" CDS 139..435 /gene="ELC" /note="CC chemokine" /codon_start=1 /product="EBI1-ligand chemokine" /db_xref="PID:d1021215" /db_xref="PID:g2189953" /translation="MALLLALSLLVLWTSPAPTLSGTNDAEDCCLSVTQKPIPGYIVR NFHYLLIKDGCRVPAVVFTTLRGRQLCAPPDQPWVERIIQRLQRTSAKMKRRSS" mat_peptide 202..432 /gene="ELC" /product="EBI1-ligand chemokine" polyA_signal 657..662 BASE COUNT 154 a 223 c 173 g 137 t ORIGIN 1 cattcccagc ctcacatcac tcacaccttg catttcaccc ctgcatccca gtcgccctgc 61 agcctcacac agatcctgca cacacccaga cagctggcgc tcacacattc accgttggcc 121 tgcctctgtt caccctccat ggccctgcta ctggccctca gcctgctggt tctctggact 181 tccccagccc caactctgag tggcaccaat gatgctgaag actgctgcct gtctgtgacc 241 cagaaaccca tccctgggta catcgtgagg aacttccact accttctcat caaggatggc 301 tgcagggtgc ctgctgtagt gttcaccaca ctgaggggcc gccagctctg tgcaccccca 361 gaccagccct gggtagaacg catcatccag agactgcaga ggacctcagc caagatgaag 421 cgccgcagca gttaacctat gaccgtgcag agggagcccg gagtccgagt caagcattgt 481 gaattattac ctaacctggg gaaccgagga ccagaaggaa ggaccaggct tccagctcct 541 ctgcaccaga cctgaccagc caggacaggg cctggggtgt gtgtgagtgt gagtgtgagc 601 gagagggtga gtgtggtcag agtaaagctg ctccaccccc agattgcaat gctaccaata 661 aagccgcctg gtgtttacaa ctaattg // LOCUS AB000888 937 bp mRNA PRI 06-OCT-1997 DEFINITION Homo sapiens mRNA for phosphatidic acid phosphatase 2a, complete cds. ACCESSION AB000888 NID g2467297 KEYWORDS phosphatidic acid phosphatase 2a. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kai,M., Wada,I., Imai,Si., Sakane,F. and Kanoh,H. TITLE Cloning and characterization of two human isozymes of Mg2+-independent phosphatidic acid phosphatase JOURNAL J. Biol. Chem. 272 (39), 24572-24578 (1997) MEDLINE 97450990 REFERENCE 2 (bases 1 to 937) AUTHORS Kai,M. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) to the DDBJ/EMBL/GenBank databases. Masahiro Kai, Sapporo Medical University, Department of Biochemistry; South-1, West-17, Chuo-ku, Sapporo 060, Japan (E-mail:kai@sapmed.ac.jp, Tel:011-611-2111) FEATURES Location/Qualifiers source 1..937 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 48..902 /note="similar to DDBJ Accession Number D84376 : mouse PAP-2" /codon_start=1 /product="phosphatidic acid phosphatase 2a" /db_xref="PID:d1023461" /db_xref="PID:g2467298" /translation="MFDKTRLPYVALDVLCVLLAGLPFAILTSRHTPFQRGVFCNDES IKYPYKEDTIPYALLGGIIIPFSIIVIILGETLSVYCNLLHSNSFIRNNYIATIYKAI GTFLFGAAASQSLTDIAKYSIGRLRPHFLDVCDPDWSKINCSDGYIEYYICRGNAERV KEGRLSFYSGHSSFSMYCMLFVALYLQARMKGDWARLLRPTLQFGLVAVSIYVGLSRV SDYKHHWSDVLTGLIQGALVAILVAVYVSDFFKERTSFKERKEEDSHTTLHETPTTGN HYPSNHQP" BASE COUNT 240 a 218 c 209 g 270 t ORIGIN 1 accgcagctc agtccatcgc ccttgccggg cagcccgggc agagaccatg ttcgacaaga 61 cgcggctgcc gtacgtggcc ctcgatgtgc tctgcgtgtt gctggctgga ttgccttttg 121 caattcttac ttcaaggcat acccccttcc aacgaggagt attctgtaat gatgagtcca 181 tcaagtaccc ttacaaagaa gacaccatac cttatgcgtt attaggtgga ataatcattc 241 cattcagtat tatcgttatt attcttggag aaaccctgtc tgtttactgt aaccttttgc 301 actcaaattc ctttatcagg aataactaca tagccactat ttacaaagcc attggaacct 361 ttttatttgg tgcagctgct agtcagtccc tgactgacat tgccaagtat tcaataggca 421 gactgcggcc tcacttcttg gatgtttgtg atccagattg gtcaaaaatc aactgcagcg 481 atggttacat tgaatactac atatgtcgag ggaatgcaga aagagttaag gaaggcaggt 541 tgtccttcta ttcaggccac tcttcgtttt ccatgtactg catgctgttt gtggcacttt 601 atcttcaagc caggatgaag ggagactggg caagactctt acgccccaca ctgcaatttg 661 gtcttgttgc cgtatccatt tatgtgggcc tttctcgagt ttctgattat aaacaccact 721 ggagcgatgt gttgactgga ctcattcagg gagctctggt tgcaatatta gttgctgtat 781 atgtatcgga tttcttcaaa gaaagaactt cttttaaaga aagaaaagag gaggactctc 841 atacaactct gcatgaaaca ccaacaactg ggaatcacta tccgagcaat caccagcctt 901 gaaaggcagc agggtgccca ggtgaagctg gcctgtt // LOCUS AB000889 1024 bp mRNA PRI 06-OCT-1997 DEFINITION Homo sapiens mRNA for phosphatidic acid phosphatase 2b, complete cds. ACCESSION AB000889 NID g2467299 KEYWORDS phosphatidic acid phosphatase 2b. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kai,M., Wada,I., Imai,Si., Sakane,F. and Kanoh,H. TITLE Cloning and characterization of two human isozymes of Mg2+-independent phosphatidic acid phosphatase JOURNAL J. Biol. Chem. 272 (39), 24572-24578 (1997) MEDLINE 97450990 REFERENCE 2 (bases 1 to 1024) AUTHORS Kai,M. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) to the DDBJ/EMBL/GenBank databases. Masahiro Kai, Sapporo Medical University, Department of Biochemistry; South-1, West-17, Chuo-ku, Sapporo 060, Japan (E-mail:kai@sapmed.ac.jp, Tel:011-611-2111) FEATURES Location/Qualifiers source 1..1024 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 7..942 /codon_start=1 /product="phosphatidic acid phosphatase 2b" /db_xref="PID:d1023462" /db_xref="PID:g2467300" /translation="MQNYKYDKAIVPESKNGGSPALNNNPRRSGSKRVLLICLDLFCL FMAGLPFLIIETSTIKPYHRGFYCNDESIKYPLKTGETINDAVLCAVGIVIAILAIIT GEFYRIYYLKKSRSTIQNPYVAALYKQVGCFLFGCAISQSFTDIAKVSIGRLRPHFLS VCNPDFSQINCSEGYIQNYRCRGDDSKVQEARKSFFSGHASFSMYTMLYLVLYLQARF TWRGARLLRPLLQFTLIMMAFYTGLSRVSDHKHHPSDVLAGFAQGALVACCIVFFVSD LFKTKTTLSLPAPAIRKEILSPVDIIDRNNHHNMM" BASE COUNT 235 a 308 c 240 g 241 t ORIGIN 1 agcgccatgc aaaactacaa gtacgacaaa gcgatcgtcc cggagagcaa gaacggcggc 61 agcccggcgc tcaacaacaa cccgaggagg agcggcagca agcgggtgct gctcatctgc 121 ctcgacctct tctgcctctt catggcgggc ctccccttcc tcatcatcga gacaagcacc 181 atcaagcctt accaccgagg gttttactgc aatgatgaga gcatcaagta cccactgaaa 241 actggtgaga caataaatga cgctgtgctc tgtgccgtgg ggatcgtcat tgccatcctc 301 gcgatcatca cgggggaatt ctaccggatc tattacctga agaagtcgcg gtcgacgatt 361 cagaacccct acgtggcagc actctataag caagtgggct gcttcctctt tggctgtgcc 421 atcagccagt ctttcacaga cattgccaaa gtgtccatag ggcgcctgcg tcctcacttc 481 ttgagtgtct gcaaccctga tttcagccag atcaactgct ctgaaggcta cattcagaac 541 tacagatgca gaggtgatga cagcaaagtc caggaagcca ggaagtcctt cttctctggc 601 catgcctcct tctccatgta cactatgctg tatttggtgc tatacctgca ggcccgcttc 661 acttggcgag gagcccgcct gctccggccc ctcctgcagt tcaccttgat catgatggcc 721 ttctacacgg gactgtctcg cgtatcagac cacaagcacc atcccagtga tgttctggca 781 ggatttgctc aaggagccct ggtggcctgc tgcatagttt tcttcgtgtc tgacctcttc 841 aagactaaga cgacgctctc cctgcctgcc cctgctatcc ggaaggaaat cctttcacct 901 gtggacatta ttgacaggaa caatcaccac aacatgatgt aggtgccacc cacctcctga 961 gctgtttttg taaaatgact gctgacagca agttcttgct gctctccaat ctcatcagac 1021 agta // LOCUS AB001325 1442 bp mRNA PRI 27-FEB-1997 DEFINITION Human AQP3 gene for aquaporine 3 (water channel), partail cds. ACCESSION AB001325 D25280 NID g1854373 KEYWORDS aquaporin 3; AQP3; water channel. SOURCE Homo sapiens kidney cDNA to mRNA, clone:HUM-AQP3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishibashi,K., Sasaki,S., Saito,F., Ikeuchi,T. and Marumo,F. TITLE Structure and chromosomal localization of a human water channel (AQP3) gene JOURNAL Genomics 27 (2), 352-354 (1995) MEDLINE 96044445 REMARK Erratum:[Genomics 1995 Dec 10;30(3):633] REFERENCE 2 (bases 1 to 1442) AUTHORS Ishibashi,K. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) to the DDBJ/EMBL/GenBank databases. Kenichi Ishibashi, Tokyo Medical and Dental University, 2nd Internal Medicine; Yushima 1-5-45, Bunkyo-ku, Tokyo 113, Japan (Tel:03-5803-5223, Fax:03-5803-0132) COMMENT D25280:Submitted (13-Nov-1993) to DDBJ by:Kenichi Ishibashi. FEATURES Location/Qualifiers source 1..1442 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HUM-AQP3" /tissue_type="kidney" gene 61..939 /gene="AQP3" CDS 61..939 /gene="AQP3" /codon_start=1 /product="aquaporin 3" /db_xref="PID:d1019987" /db_xref="PID:g1854374" /translation="MGRQKELVSRCGEMLHIRYRLLRQALAECLGTLILVMFGCGSVA QVVLSRGTHGGFLTINLAFGFAVTLGILIAGQVSGAHLNPAVTFAMCFLAREPWIKLP IYTLAQTLGAFLGAGIVFGLYYDAIWHFADNQLFVSGPNGTAGIFATYPSGHLDMING FFDQFIGTASLIVCVLAIVDPYNNPVPRGLEAFTVGLVVLVIGTSMGFNSGYAVNPAR DFGPRLFTALAGWGSAVFTTGQHWWWVPIVSPLLGSIAGVFVYQLMIGCHLEQPPPSN EEENVKLAHVKHKEQI" BASE COUNT 248 a 448 c 413 g 333 t ORIGIN 1 ccggggatcc acgcgcgccg ccacccctgc ccgcccgaca gcgccgccgc ctgccccgcc 61 atgggtcgac agaaggagct ggtgtcccgc tgcggggaga tgctccacat ccgctaccgg 121 ttgctccgac aggcgctggc cgagtgcctg gggaccctca tcctcgtgat gtttggctgt 181 ggctccgtgg cccaggttgt gctcagccgg ggcacccacg gtggtttcct caccatcaac 241 ctggcctttg gctttgctgt cactctgggc atcctcatcg ctggccaggt ctctggggcc 301 cacctgaacc ctgccgtgac ctttgccatg tgcttcctgg ctcgtgagcc ctggatcaag 361 ctgcccatct acaccctggc acagacgctg ggagccttct tgggtgctgg aatagttttt 421 gggctgtatt atgatgcaat ctggcacttt gccgacaacc agctttttgt ttcgggcccc 481 aatggcacag ccggcatctt tgctacctac ccctctggac acttggatat gatcaatggc 541 ttctttgacc agttcatagg cacagcctcc cttatcgtgt gtgtgctggc cattgttgac 601 ccttacaaca accccgtccc ccgaggcctg gaggccttca ccgtgggcct ggtggtcctg 661 gtcattggca cctccatggg cttcaactcc ggctatgccg tcaaccctgc ccgggacttt 721 ggcccccgcc tttttacagc ccttgcgggc tggggctctg cagtcttcac gaccggccag 781 cattggtggt gggtgcccat cgtgtcccca ctcctgggct ccattgcggg tgtcttcgtg 841 taccagctga tgatcggctg ccacctggag cagcccccac cctccaacga ggaagagaat 901 gtgaagctgg cccatgtgaa gcacaaggag cagatctgag tggcaagggc catctcccac 961 tccgctgccc tggccttgag catccactga ctgtccaagg ccactcccaa gaagcccccc 1021 ttcacgatcc accctttcag gctaaggagc tccctatcta ccctcacccc acgaagacag 1081 ccccttcagg atttccactg gaccttgccc aaatagcacc ttaggccact gcccctaagc 1141 tggggtggaa ccggaatttg ggtcaataca tccttttgtc tcccaaggga agagaatggg 1201 cagcaggtat gtgtgtgtgt gtgcatgtgt gcatgtgtgt gcatgtgtgt gcaggggtgt 1261 gtgtgtgggg ggggttccca gatattcagg gcaagaccag tcggaaggat ctgctattgg 1321 ggacccagag acagggaggc agcctgtcca tctgtgcata aggagaggaa agttccaggg 1381 tgtgtatgtt ttcaggggcc ttcacatgga ggagctgcag atagatatgt gtttctccgg 1441 aa // LOCUS AB001466 3114 bp mRNA PRI 04-FEB-1998 DEFINITION Homo sapiens mRNA for Efs1, complete cds. ACCESSION AB001466 NID g2829301 KEYWORDS Efs1. SOURCE Homo sapiens 2 years old female hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishino,M., Ohba,T., Inazawa,J., Sasaki,H., Ariyama,Y. and Sasaki,T. TITLE Identification of an Efs isoform that lacks the SH3 domain and chromosomal mapping of human Efs JOURNAL Oncogene 15 (14), 1741-1745 (1997) MEDLINE 98007665 REFERENCE 2 (bases 1 to 3114) AUTHORS Ishino,M. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) to the DDBJ/EMBL/GenBank databases. Masaho Ishino, Sapporo Medical University, Department of Biochemistry, Cancer Research Institute; S1, W17, Chuo-ku, Sapporo, Hokkaido 060, Japan (E-mail:ishino@cc.sapmed.ac.jp, Tel:011-611-2111, Fax:011-612-5861) FEATURES Location/Qualifiers source 1..3114 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /tissue_type="hippocampus" CDS 609..2294 /codon_start=1 /product="Efs1" /db_xref="PID:d1025508" /db_xref="PID:g2829302" /translation="MAIATSTQLARALYDNTAESPQELSFRRGDVLRVLQREGAGGLD GWCLCSLHGQQGIVPANRVKLLPAGPAPKPSLSPASPAQPGSPYPAPDHSNEDQEVYV VPPPARPCPTSGPPAGPCPPSPDLIYKIPRASGTQLAAPRDALEVYDVPPTALRVPSS GPYDCPASFSHPLTRVAPQPPGEDDAPYDVPLTPKPPAELEPDLEWEGGREPGPPIYA APSNLKRASALLNLYEAPEELLADGEGGGTDEGIYDVPLLGPEAPPSPEPPGALASHD QDTLAQLLARSPPPPHRPRLPSAESLSRRPLPALPVPEAPSPSPVPSPAPGRKGSIQD RPLPPPPPRLPGYGGPKVEGDPEGREMEDDPAGHHNEYEGIPMAEEYDYVHLKGMDKA QGSRPPDQACTGDPELPERGMPAPQEALSPGEPLVVSTGDLQLLYFYAGQCQSHYSAL QAAVAALMSSTQANQPPRLFVPHSKRVVVAAHRLVFVGDTLGRLAASAPLRAQVRAAG TALGQALRATVLAVKGAALGYPSSPAIQEMVQCVTELAGQALQFTTLLTSLAP" polyA_site 3114 /note="10 A nucleotides" BASE COUNT 570 a 1055 c 893 g 596 t ORIGIN 1 ctccaggcaa cttggggcaa gcgtctcagt tctcgctctc ccttcctccc agcggggtcg 61 ccgcagaccc cagccctggg agcaccgctc tgcagcgcgg ccggcgggtg gagacggttg 121 gcccctaaac tcgctcgtcc agcccaaccg ccccggcggc ttctcccagc cctcgaggct 181 ctcctgagcg gcctggagag gcgtcgagcg cagcccagcg cccgcctgct cacccgcccc 241 ggcccgggaa gggaattttc ggatcctgcg agcccggggc gcccccgcgg cctagggcgg 301 gcagctcccg gggcctggcc gagccggtgg cgcccgggag gccgcgggga cagcacgcag 361 cgcgcgccct tggatgccgt cccgcagcga cgccccggcc cgccccgctc ctcctcctgc 421 ctggctagcc tgcctctcat ttgggaagtt ttgtgggttt tctttctcct cctccaacct 481 tggcggaggc cacgactcag gcgccacagc tgggggctag aggccgcgga ccatggtgcg 541 gggcagccac cgctgaagtc agcaaaaccg agcctggcct gaggcaggct gcgcgggagg 601 ccaaagccat ggccattgcc acgtcgaccc agctggcccg ggcactgtat gacaacaccg 661 ctgagtcccc ccaggagctg tccttccgcc gaggggatgt cctacgggtc ctgcagagag 721 agggcgctgg tggactggac ggctggtgcc tctgctccct acacggccag cagggcattg 781 tgcccgccaa cagggtgaag ctcttgcctg ctggcccagc acccaagccc agcctctctc 841 ctgcgtcccc agcccagcct ggctcaccat atccagcccc agatcacagc aatgaggacc 901 aggaggtgta tgtggtgccg cccccagctc ggccctgtcc aacctcagga cctccagctg 961 gaccttgccc accctctcct gacctcatct acaaaatccc cagagctagt gggacccagc 1021 tggctgctcc cagagatgcc ttggaggtct acgatgtgcc ccccaccgcc ctccgggtgc 1081 cctccagtgg cccctatgac tgccctgcct ccttttccca ccctctgacc cgggttgccc 1141 cgcagccccc tggagaggat gatgctccct atgatgtgcc tctgacccca aagccacctg 1201 cagagctgga accagatctg gagtgggaag gaggccggga gccggggccc cccatctatg 1261 ctgccccctc caacctgaaa cgagcgtcag ccttactcaa tttgtatgaa gcacccgagg 1321 aactgctggc agacggggag ggcgggggca ctgatgaggg gatctacgat gtgcctctgc 1381 tggggccaga ggctccccct tctccagagc cccctggagc cttggcctcc catgaccagg 1441 acaccctggc ccagcttctg gccagaagcc ccccaccccc acacaggccc cggctcccct 1501 cagctgagag cctgtcccgc cgccctctgc ctgccctgcc tgtccctgag gcccccagcc 1561 cctccccagt gccctctcct gccccaggcc ggaagggcag catccaggac cggcctctgc 1621 ccccaccccc accccgcctg cctggttatg gaggccccaa ggtcgagggg gatccagagg 1681 gcagggagat ggaggatgac ccagcaggac accacaatga gtacgagggc attccgatgg 1741 ccgaggagta tgactatgtc cacctgaagg gcatggacaa agctcaggga tctaggcccc 1801 cggatcaggc ctgcacaggg gatcctgaac tgcccgagag ggggatgccg gcgccgcagg 1861 aggccctgtc cccaggggag ccactggttg tgtccaccgg agatctgcag ctcctgtact 1921 tctatgctgg gcaatgccag agccactact cagccctgca ggcagccgtg gcagccctga 1981 tgtccagtac ccaggctaat cagcccccgc gccttttcgt gccccacagc aagagggtgg 2041 tggtggctgc tcatcgcctg gtgtttgttg gggacaccct gggccggctg gcagcctctg 2101 cccctctgag agcacaggtc agggctgcag gtacagcact gggccaggca ttgcgggcca 2161 ctgtgctggc tgtcaaggga gctgccctgg gctacccatc cagccctgcc atccaagaga 2221 tggtgcagtg tgtaacagaa ctggcagggc aggccctgca attcactacc ctgctcacta 2281 gcctggctcc atgaaggtcc tttggcacag ctctgctcct cccctgcctg ccaaagcccc 2341 cctttaggcc ttgggtggct ggaaggcttt gttaagggac taggagaaat gggggtatct 2401 ttcccctttc ctgccctttc tgctcatctc aacctctcac agaggtgtct tctcccccta 2461 acctacagct ttttgtacaa gccattttgt gtaaattatt tatatttaat attattccct 2521 gctttgtcag gagcaggtac taggctctgg ggcagtgagg aactagatcc ttctctcctc 2581 agcctagggt ggaggtcact gcactaccac ccacctctgg aagactggct gtgaaaagtc 2641 aggtggcaga aacctggggc cacatagagc ctctctcttt tcctgtttct tggctctaga 2701 agatcagcac tgcactgtta gctgagagtg cgggcaagac ataaactgtc cagagtttga 2761 aggttctcgg aaagaccgga gggcttctcc ccacagaagg cggagagagc tggggctcag 2821 acatgggtgt gcaccttaat aaaccctgct gtctgcctcc ctgactctgc ttcttgggag 2881 catggtgagc agccctggtg ctcagcagcc atacctatgg gacacacact acgaaaagga 2941 tgcctttagg gtttggggga gattttactc ctttcttcaa caactattca ctggacaagt 3001 tctctgctcc catgacgcgc caggcacagt tctgcaagta tattgtgaat gtattgttct 3061 agtgggatac acaaataagt cagttaaaat acataaataa aaacataaac ctgc // LOCUS AB001517 43051 bp DNA PRI 08-JUL-1997 DEFINITION Homo sapiens DNA for TMEM1 protein, PWP2 protein, KNP-I alpha protein and KNP-I beta protein, partial and complete cds. ACCESSION AB001517 NID g2250697 KEYWORDS PWP2; TMEM1; KNP-I; PWP2 protein; TMEM1 protein; KNP-I alpha protein; KNP-I beta protein; alternative splicing. SOURCE Homo sapiens B-lymphoblastoid cell_line:GM130B DNA, clone_lib:cosmid library clone:D6B5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 43051) AUTHORS Nagamine,K., Kudoh,J., Minoshima,S., Kawasaki,K., Asakawa,S., Ito,F. and Shimizu,N. TITLE Direct Submission JOURNAL Submitted (28-FEB-1997) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) REFERENCE 2 (sites) AUTHORS Nagamine,K., Kudoh,J., Minoshima,S., Kawasaki,K., Asakawa,S., Ito,F. and Shimizu,N. TITLE Isolation of cDNA for a novel human protein KNP-I that is homologous to the E. coli SCRP-27A protein from the autoimmune polyglandular disease type I (APECED) region of chromosome 21q22.3 JOURNAL Biochem. Biophys. Res. Commun. 225 (2), 608-616 (1996) MEDLINE 96354831 REFERENCE 3 (sites) AUTHORS Nagamine,K., Kudoh,J., Minoshima,S., Kawasaki,K., Asakawa,S., Ito,F. and Shimizu,N. TITLE Genomic organization and complete nucleotide sequence of the human PWP2 gene on chromosome 21 JOURNAL Genomics 42 (3), 528-531 (1997) MEDLINE 97349125 FEATURES Location/Qualifiers source 1..43051 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="GM130B" /cell_type="B-lymphoblastoid" /chromosome="21" /clone="D6B5" /clone_lib="cosmid library" /map="21q22.3" gene 1823..5594 /gene="TMEM1" exon 1823..2017 /gene="TMEM1" /number=22 CDS join(<1823..2017,2338..2576) /gene="TMEM1" /codon_start=3 /product="TMEM1 protein" /db_xref="PID:d1021982" /db_xref="PID:g2250698" /translation="VDNSSNWAVCGKSCGVISMPVAARATHRVHMEVMPLFAGYLPLP DVRLFKYLPHHSAHSSQLDADSWIENDSLSVDKHGDDQPDSSSLKSRGSVHSACSSEH KGLPMPRLQALPAGQVFNSSSGTQVLVIPSQDDHVLEVSVT" exon 2338..5594 /gene="TMEM1" /number=23 3'UTR 2577..5594 /gene="TMEM1" exon 6438..6479 /gene="PWP2" /number=1 5'UTR 6438..6461 /gene="PWP2" gene 6438..30226 /gene="PWP2" CDS join(6462..6479,8029..8141,12798..12892,13224..13323, 13670..13813,14309..14444,14736..14965,16865..17008, 17808..17878,18432..18568,19393..19541,19676..19823, 19997..20147,21222..21402,23625..23772,25056..25164, 25945..26010,26977..27195,27292..27442,27597..27671, 29643..29817) /gene="PWP2" /codon_start=1 /product="PWP2 protein" /db_xref="PID:d1021983" /db_xref="PID:g2250699" /translation="MKFAYRFSNLLGTVYRRGNLNFTCDGNSVISPVGNRVTVFDLKN NKSDTLPLATRYNVKCVGLSPDGRLAIIVDEGGDALLVSLVCRSVLHHFHFKGSVHSV SFSPDGRKFVVTKGNIAQMYHAPGKKREFNAFVLDKTYFGPYDETTCIDWTDDSRCFV VGSKDMSTWVFGAERWDNLIYYALGGHKDAIVACFFESNSLDLYSLSQDGVLCMWQCD TPPEGLRLKPPAGWKADLLQREEEEEEEEDQEGDRETTIRGKATPAEEEKTGKVKYSR LAKYFFNKEGDFNNLTAAAFHKKSHLLVTGFASGIFHLHELPEFNLIHSLSISDQSIA SVAINSSGDWIAFGCSGLGQLLVWEWQSESYVLKQQGHFNSMVALAYSPDGQYIVTGG DDGKVKVWNTLSGFCFVTFTEHSSGVTGVTFTATGYVVVTSSMDGTVRAFDLHRYRNF RTFTSPRPTQFSCVAVDASGEIVSAGAQDSFEIFVWSMQTGRLLDVLSGHEGPISGLC FNPMKSVLASASWDKTVRLWDMFDSWRTKETLALTSDALAVTFRPDGAELAVATLNSQ ITFWDPENAVQTGSIEGRHDLKTGRKELDKITAKHAAKGKAFTALCYSADGHSILAGG MSKFVCIYHVREQILMKRFEISCNLSLDAMEEFLNRRKMTEFGNLALIDQDAGQEDGV AIPLPGVRKGDMSSRHFKPEIRVTSLRFSPTGRCWAATTTEGLLIYSLDTRVLFDPFE LDTSVTPGRVREALRQQDFTRAILMALRLNESKLVQEALEAVPRGEIEVVTSSLPELY VEKVLEFLASSFEVSRHLEFYLLWTHKLLMLHGQKLKSRAGTLLPVIQFLQKSIQRHL DDLSKLCSWNHYNMQYALAVSKQRGTKRSLDPLGSEEEAEASEDDSLHLLGGGGRDSE EEMLA" exon 8029..8141 /gene="PWP2" /number=2 exon 12798..12892 /gene="PWP2" /number=3 exon 13224..13323 /gene="PWP2" /number=4 exon 13670..13813 /gene="PWP2" /number=5 exon 14309..14444 /gene="PWP2" /number=6 exon 14736..14965 /gene="PWP2" /number=7 exon 16865..17008 /gene="PWP2" /number=8 exon 17808..17878 /gene="PWP2" /number=9 exon 18432..18568 /gene="PWP2" /number=10 exon 19393..19541 /gene="PWP2" /number=11 exon 19676..19823 /gene="PWP2" /number=12 exon 19997..20147 /gene="PWP2" /number=13 exon 21222..21402 /gene="PWP2" /number=14 exon 23625..23772 /gene="PWP2" /number=15 exon 25056..25164 /gene="PWP2" /number=16 exon 25945..26010 /gene="PWP2" /number=17 exon 26977..27195 /gene="PWP2" /number=18 exon 27292..27442 /gene="PWP2" /number=19 exon 27597..27671 /gene="PWP2" /number=20 exon 29643..30226 /gene="PWP2" /number=21 3'UTR 29818..30226 /gene="PWP2" exon 32676..32884 /gene="KNP-I" /number=1 gene 32676..42407 /gene="KNP-I" 5'UTR 32676..32743 /gene="KNP-I" CDS join(32744..32884,33148..33200,35106..35219,36223..36342, 39296..39388,42251..>42407) /gene="KNP-I" /note="alternative splicing" /codon_start=1 /product="KNP-I alpha protein" /db_xref="PID:d1021984" /db_xref="PID:g2250700" /translation="MAAVRVLVASRLAAASAFTSLSPGGRTPSQRAALHLSVPRPAAR VALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLSTFAVDGKDCKVNKEVE RVLKEFHQAGKPIGLCCIAPVLAAKVLRGVEVTVGHEQEEGGKWPYAGTAEAIKALGA KHCVKEVV" CDS join(32744..32884,33148..33200,35106..35219,36223..36342, 42251..>42407) /gene="KNP-I" /note="alternative splicing" /codon_start=1 /product="KNP-I beta protein" /db_xref="PID:d1021985" /db_xref="PID:g2250701" /translation="MAAVRVLVASRLAAASAFTSLSPGGRTPSQRAALHLSVPRPAAR VALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLLCCIAPVLAAKVLRGVE VTVGHEQEEGGKWPYAGTAEAIKALGAKHCVKEVV" exon 33148..33200 /gene="KNP-I" /number=2 exon 35106..35219 /gene="KNP-I" /number=3 exon 36223..36342 /gene="KNP-I" /number=4 exon 39296..39388 /gene="KNP-I" /number=5 exon 42251..42407 /gene="KNP-I" /number=6 BASE COUNT 8563 a 11363 c 12711 g 10414 t ORIGIN 1 gatcagcctg gctaatatgg cgaaacccca tctctactaa gaatacaaaa attagctggg 61 cgtggtggca ggcgcctata agcccagcta cttgggaggc tgaggcagga gaattgcctg 121 aacccaggag gcggaggttg cagtgagcca aaattgcgcc actgcactcc agcctgggca 181 acgagcaaaa ctccgcctca acaacaacaa caacaacaaa acaaccaaca tcaaatttat 241 ggtgaagctt gggtggaaga atggtgaaat cattgatgct ttatgaaaag tttatgaaat 301 tctttggaca gtgccccaaa gaattagtag tttgcaaatg gatatcttgt tttaagaagg 361 gacgagatga tgctgaagat taaggccata gcaagcagac catccacatc cattttcaag 421 gaaaaaattc accttgttca tgccctaatt ggagggaact gacaacagca gaaatgacag 481 ccaacaccgc agacatctga actggttcag cttatgcaat tacgcctgaa gggtgagcaa 541 gcgttccact cagtgggtgc caaaacccta gtacccaggt ccgtggccga ccagagcaga 601 gcttcccatg gaaacttcaa acacacggga tcaagatcct gaagcattta ctcaaagaac 661 cataacagga gatgcagtgt ggccttccca gtaccatcct gaagacaaag cagacacggc 721 aatggctacc aagaggtggg gtggccccac caaagcagaa gcagactggt caagagcaga 781 ggccacagca acaggttttt gggatactca aggcattttg cttgttgact ttctggagag 841 tcaaaggaca gcagcgtctg tgtattatga gattgttttg aggaagatag caagagcttt 901 ggtagaaaaa cccctgggga agcttcacca gagagtcctc ctccaccacg acaaagctcc 961 tgctcattcc tctaacaagc ccaacttcgt gagggtttct atgggaaatc atgaggcatc 1021 caccttgtgg acctgattcg gctccttctg actgattttg gtttcctgat cttaagaagt 1081 ctttaaaagg cacccatttt tattcagcta ataatgtaga gaaagactgc gttgacatgg 1141 ctaaattctc aggcctccca gttcttcagg gatggactaa atggcatcgt aagcaatggc 1201 atcattgctt acaaacatgt cttgaccttg gtggggctta tgttgagaaa taaagcttat 1261 attttctgtt ttcatctttg gattccattt ttgcatgaac tttttgtagt tccctcatac 1321 gtattctctt tccatcctta aaaagcagct ctgtgaggct ggtggtggtc tcagcactat 1381 acagcggtca ttgagcctgg aggccttggg gtgctcttct gagtggcaga gttggcttca 1441 gacccagtgg tcttggatac tgccacttac cagacagaac ccattaactg ccaaagtcac 1501 catactgtgg aagatgacac aaaaatttga aaacaaaatt aaatcattta ttaacctcac 1561 aggaaaaatg cagttttctg tacacgtcca gcataattca ttctaatcca gaactaattg 1621 aaattggact caatgaaata ggggttgcca ccgtgaaagt gcatggcttt atcttcaggc 1681 gcagctacaa ggcactgggc tctgtgtccg cagtgttttg tcagacagcg ttgggtgaga 1741 ggattgggta acgacggccg tgactctcgc ctgagtactg gaaggcccgt cctggatgac 1801 acatcgcttc ctctcgtttc agttgtcgac aacagtagca actgggcagt gtgtgggaaa 1861 agctgcggtg tcatctccat gccagtggct gctcgggcca ctcacagggt ccacatggaa 1921 gtgatgccgc tcttcgccgg gtatctcccc ctgcccgacg tcaggctgtt caagtacctc 1981 ccccatcatt ctgcacactc ctcccaactg gacgctggta aggactttgg aagaaaaagt 2041 ttaccttcag tatttgagca ggtgagcact ttgtgcgaga ggtgataaac ttatgactga 2101 gcaggttgga gaactagcgg cgtgttttca gtgccggtag aaactggtac tagaagcctc 2161 atccgctgtt agaatcttag gcacctgctc tgaaagccaa gcctctgtgc acagagtgct 2221 ctgtaaaggg gcagcctcgg gtggcattct gcacgttcag cctggctccg tgtggctttg 2281 cccgtcccca ggtgagtgtc ggcgcagctt gggtctaact tcctgtgtgt tttgcagaca 2341 gctggataga aaacgacagc ctgtcagtag acaagcacgg ggacgaccag ccggacagca 2401 gcagcctcaa gagcaggggc agcgtgcatt cggcctgcag cagcgagcac aaaggcctac 2461 ccatgccccg gctgcaggca ctgccggccg gccaggtctt caactccagc tcgggcacac 2521 aagtcctggt catccccagc caagatgacc acgtcctgga agtcagtgta acatgacaac 2581 gccagggtga acacacgcca cttcccagct aggagtgcac tttatgggac tgtgactgga 2641 ctcttccgtt ctggctccag ccagaccttc agtggtcctg cctggccgtg gggacatcag 2701 agagtgtcat cacgcagctg gccagctgag ttctgttgtt gttttcatgc cgcctgtgat 2761 ctcagattcc tgcttttctc accccgtccc catgctggtg tccgacgccg cttactcaga 2821 gccctggcct ccctccccct acctcacacg ctgctcatga aagtttccac ccacgctgtc 2881 tccacggaac agcctccgtc tgctggctct tcgtggaagg ccatttgtct ttcaggtaga 2941 cactcagcag ccctcacggt cttagtgacg tgtgtgcctt tctggtcaca cagctgccca 3001 gtttcctgat cggggtggat ttgtgtcccc taaggggtaa aacagccgtt taccgcagat 3061 cctctcattg tgcttttcta gaataacacc cttctagggg aggcgggtgg gggagggagg 3121 gatcataacc ccttctgtgc cttgggatgc cggagctggg ggacctggag gcccatcagc 3181 cggagccacg tgaaaggtac tgaagaaagc tgagacccgg ctgtgaggag cgcctcagcg 3241 gtgaggtggt ttagggataa atgtttctgg aaccctgtgg tcccccataa tgttgataga 3301 atatcatatg cactgggagt taaatatatt taatttaatg atcattatat atgtgggggt 3361 taatatgttg tttttctgtc cctttaaagt ctttacatgt aattgtagct gtataatcgt 3421 tatttttctt ttgcatctta agtcttagaa attaagatat tccatcgtga ggatgagaga 3481 ggtcctcagt gtgtttttgg tctggttgta gggaaggact caagtcctgg aatgtcctcc 3541 actggtctac tgagttgcag tcacactgtt ccaatggatt atttgctttc ggttgtaaat 3601 ttaattgtac atatggttga tttattattt ttaaaaatac agactaactg atgtaatgtt 3661 tatgtataag ttgcaccaaa aatcaaggac aaaaataagt gtgtttgttt ttacaggtgt 3721 gaaagtcaca gcttgtaaat aagtgttgta tgtattaaac cttttccagt tctccaaagc 3781 gatgtatttt tgtacacttg aaatagagta ctcttaattt actgggcaaa tgtgcttgga 3841 attgaacttg acaagattag ctcaagcaga tagagtcggg tccagcagtg ggtggccctc 3901 gtgtgaatcc ccgtggatgt gcaagttgtg gagagaagga gcaccgggtt cctgcccagc 3961 actgtgcttg cgggaggcgg tggggcatgg gaggaaggag gcacagaccg gggaaatatg 4021 acagccgtca tttccagtat tctctgtgtt gtcttttagc tcattcaata aataaaggtg 4081 gtgtgatttt tttttcctcc tgtctttttc atttgtagaa actggagacg tgtaaagaag 4141 ataaataatt gtgtaattaa actttccaga aatttatctt cctcatgtgc agtttaacaa 4201 acttggtcaa actagttagc aaattagaac ttcagaatct aatgatagtt tagggtttct 4261 aaaataaggt tttttattgt aaaaattgac gattgccctg catttctacc aagtcctgtg 4321 aataaagaga tgggagattt gattccgtca gaagagactg taatccgtgt cgtcagcctg 4381 ggagccttcc ccagtgtaat gtagctttct ctcttacctt ctggaagagg gaatgtttca 4441 tttattactg tttgattttc ttgtatctgg ttctactccc aggatgaaat tatccaacta 4501 catatatatt tagaggaaga aagtgaaggg gaaatttaaa atgtttacgg cgcttaattg 4561 cctggaaatg aaatgaaatc aaatttatca gtttttttcc ccctaattac ccaaaagatc 4621 ttttgcaaac tatgttacat gaatgcttct gcctctttaa gacaaagaag aatgtcaccc 4681 aaaattgtca tttttttctt aatgttcatc ataaaagtcc taaaagagta actgtaattg 4741 gatgtttatt gtttttatct aaagtaaggt gtatgtgttt gagacaagct ggttttgttg 4801 ataaagagat gttaaataat tgtgaagcca gatatgcaat gtgtatctga aaagcaagga 4861 atttgcagcc gttttacaaa tatctgtgga acatgtaaat actgtcaaat ggaaaataaa 4921 ataagttata atttttgtga atttcatggg atgtcctatg attggaaaaa ttataactct 4981 tctgattcta atgtggaaat tgttgtattt aatctgaaaa tgactttacc tacaacagtt 5041 ccattgtcag cacagcctag gaaggtcaga tcctgtatta attactctta gtggagatgc 5101 cagatatccc atacagaatt agcagagaaa atacacacag gcttctattc aaattttctt 5161 tagtgcttaa aattaagttt taaaatgaaa tcagacactg caggtttgta tataaaatga 5221 aaagctatac tactttttac aaaagggcaa actgggctga tgtaaatgtt ttactttcaa 5281 ctgtgttctt taaaataaat cctacctggt ttttaaattt tatttttcat gaaaatgctc 5341 ctttctctac atttattcat cctatataca tcaggctgta agaccccccc cagtcatcat 5401 taatacaatg tgttgggatt ctgtgactgg aaaaggtgac aagttggtga ctttgacact 5461 gcaggtattc cattttcatg gtttactatg aaaagtcatt tttcatatta tgtaatatat 5521 tgttagatta aaaccattgt attaagactt taaaatgtaa gcattgtaat tctgaaaata 5581 cacattttaa gaagaaacta ttttgcactg gatcttttgc tgcgacatgg tgtgacaaca 5641 gtatgtaatg tgaagatttt attgttttat ctccctagtc cttgttcctg actcttgggc 5701 atagattatt tgccttttgg gaaacattcg gattattttt tttttttttt tttttttttt 5761 ctctgagaca gtcatgctct gttgcctagg ctggagtgca atggcgcgat ctgggctcac 5821 tgcagcctcc gcctcccagg ttcaagccat tctcctgcct cagcctcccg agtagctggg 5881 actacaggca tgcgcctcat ttttatactt tagtagagat gatgtttcac catgttggcc 5941 aggctagtct cgaactgctg acctcaggtc atccacccgc ctcagtctcc caggcgtgag 6001 ccaccgcacc ggccagcatt cgggacttta ttcataattc cttacccaac agttcacaag 6061 aacgaagaca tttgggtgtg cagcgctctc atgttgtgga cgctgctgtg ccaggctggg 6121 cgtccgtgcg gagcgcgccc agcgctggag cgcgcctaag aggcgccctg cagtgtggct 6181 ccggggtgtg ggcggggccc ggagaaggcc cccgccttgg gaggggttgg gcctcgcgtt 6241 caaactccgc ctcggggcgg ggcgagggcg acgcagctcc tcctgcctcc tgggccgctg 6301 cggacggtgg tgcgcgaccc cgtcccgggc gcgcgcgctg tgggcggtgc acgccgtcgc 6361 ttcccggaag tgcgtgctgt gggcggtgcc gcgcggaccc ccgggaagtg tctctgtggg 6421 cggccgccgg gttgagctgc ggcacacgtg cgacggccgt gatgaagttc gcttaccggg 6481 tgagcgcggg cggcgggcgg tctgttggat cggcgcggct cttccggtgg acggcgcctg 6541 agctggctcc gggcgggctg ggggcgcgtg gtctgctttt gcggcgtctg gccgcgtggg 6601 ggtcgccggc tgtctgcgcg cccgtggcct cggggacgcg gggctgtggt ggggctcctg 6661 ggagagctgg gacggaggcg ccgacggctt ctcggaggga acagggacat ttgtcggggt 6721 cccgggatat ttttaacctg agtatggccc aacctaaagg ctggagttgg tgcagtcggg 6781 gagggcacag caccctcggg agggatgagg aggtgggacg gggcggggac gcgggcgcgc 6841 gcaggcgggg tgagaacagc agggacccgg ggcccgggac tggtccctaa cgagcgagcg 6901 tccaggtgag cggaagggcg gggaacccag gtgtgagctc tgtttcccga ggccggaggg 6961 aattcctgag gcggacgccc tcagctttga gagccaggga agacttgtcc cagatgtgaa 7021 gtgcaggagg cgggtgagac aggtaataag agggggccgt ggagccggcc tgggggtctc 7081 tgaagcgttt gctacagctg gccggttatg gggacgcctg tgtttcccac agtggccctg 7141 accgttgcgc ttccagggtt gcacgggccg cctgctgctg agcccaccgt gtactacact 7201 tgtcctgtcc gaaactgggc atggtactgg cccctcagag tgctgctcct gtgtcccctg 7261 tgctggtgga gccaccgctg tgtgcccagc accaaaccag aaaccatatc cacgtcctca 7321 caggccagga cttcatccgc tccacattcc tactgcaagg tggcaaaaaa tgtgccttgg 7381 agcctgcctg gccgcttaaa gtccttctgc tgaccaagct cctccctgag gtcctgaggg 7441 cgctgcccac tcacagcttc agggcctggc tggtgccttc ccatttagat tccagctcag 7501 gtgacccttc caggtgcctc tccgtgacca ccgtctaaag tagcccctgc tcccacactt 7561 ctcatccccc tcctgctaga gagagaatat aaggtgtgcg tgtctgtgcg catggcagtc 7621 tccagtgaac gctcgtgctc gtgagttagt aaataggatg ctggcttacc acacttctgg 7681 aagccctgag ttctaagcac ttttggggtg ctgttccatt gttgcagctt ttttcttggt 7741 gaagtgtggg ctccatgccc ctcttcacct ggcagacgcc tcctgtcttg ggaggcttcc 7801 tcagtgctca caggggcatc ttacccatct ctgtgggagc ttggggccgc ccgggctcct 7861 ctggtctcct gaggatcagg ctcacttctt cctctcttac ccaggaaagg tgaacgggcc 7921 cccaaggtgg ggttctctat gccatctctc tgcccagatg cctcacttag tgcttttctg 7981 ggagtcgact gttttctttg tgttaagaca attttgtttt cttcacagtt ttcaaatttg 8041 ctgggtacgg tgtaccggcg tgggaaccta aattttacct gcgatggaaa ttcagttatc 8101 agtcccgtgg gcaatagagt cactgtattt gaccttaaaa agtaagtatg tgaaagtggt 8161 attcataata gtgatactga ttactattat caccattttt taagtaacat atgaaaactg 8221 atagtgtctt ctgtgccttc cttttaatgg tgcacccgtt aattaaggga gcccaggaca 8281 ggactacagg acagactagg gtcagcagat cagttgtgga aattggcagc cagacccagt 8341 tcttcgttgt agtgccttag ccaggcccca taaagtggtg tgtcacttgg ctccgttctg 8401 tgggaaggga ccctgtccct gtgctgtgaa gggtgggtgg gggaaaaggt gtagctcagg 8461 gttagccctt cccccatcag ggaaggcccc taagggctga aaggccaggg ctgctggggg 8521 agttgaaggt accttggacc cctgctcggg agccctgctg ccactgggcc tcaccagtcg 8581 gacgggcagg cccaaagctt ccagctctgc agccggaggt cagttttatt cgctggggtt 8641 tagcggtttc ttcctgtgtg ctgttgcact ttgcttcact ttgttgcact ctggctctgt 8701 gcggatgctg ttttaccacg aagcgaaggt gtgtggaaac actgcttgga gaaagttggt 8761 cggtgccatt tttccagcag tgggggctca cttcatgtct ctgtgtcaca ttttggtaat 8821 tttgtgatat ttcaaagttt attatacctg tgagggtgat ttgtgatcat ctttgctgtt 8881 gctatcgtga ttgttctggg gtgccgtgat ccgtgcccat gtaagacagc gagcttaatc 8941 ggtaagcact gtgtgtgctg tgcctgctcc accgaccggc tgttccccac ccccccacgt 9001 ctctcttcct ctacttgggt ctccctattc cctgacacaa acaatgctga aattaggcca 9061 gttaataatg gcctacagtg gccactacgt gttcatgtga aaggaagagt cacacaactc 9121 tcacctgaaa tcagaagcta gaagggactc agcttagtga ggaaggcatg tcaaaagctg 9181 aaataggcca aaagccaggt ctctgccacc aaacaggcaa gttgtgaatg caaagaaaaa 9241 gcttttgaag gaaattaaaa gtgctactcc agtgaacaca caaattacaa gaaagccaaa 9301 atagccttat tgcagatacg gataaaagtt tgagtgatcc agatagaaga tcaaaccagc 9361 cacaacattc ctttaagcaa gtcaaagcct aatgcacagc aaggccctac ctgtcctctg 9421 ttctgtgagg gctgaggtga ggaagctgca ggaaagaagt tggaagctag cagaggttgg 9481 ttcatgaagt ttaaggaaag aagccatctc tacaacataa aagtgcaaag cgaagcagca 9541 agtgctatgg agaagctgca ggaagttatc cagaagatct agctgagatc attgatgaag 9601 ctgactacac taaacacaaa atttcaatgg agacaaaaca gccttctatt ggaaaaagat 9661 ggccaggcgc agtggctcat gcctgtaatc ccaacacttt gggaagccaa ggtaggcaga 9721 tcatttgagg ccaggagctt gagaccagcc tggccaatat ggtgaaaccc cgtctctact 9781 aaaagtacaa aaattagcca ggcatggtgg cacattcctg taatgccagc tactcaggag 9841 gctgaggtgg aagaattgct ggaacctggg aagcagaggt tgcagtgagc tgagatcgtg 9901 ccagtacact ccagcctggg caacagagca agactctatc taaaaataaa taaattaaaa 9961 aaaaagaaaa agatgccatc gaggacttac atagctagag aaaagtcagt gtctggcttc 10021 aaacaacagg ctgactctct tattaggggc taatgcagct ggggccatta agttgaagcc 10081 agtgctaatt cactgttctg aaaatcctaa tgcccttcac aatgatgccc aagctactct 10141 gcctgtgctc tataagtgga acagcaaagt ctggacgaca gcacatctgt ttacagcatg 10201 gtttcctgaa tctttccagc ccattgttga acccactaag gttcctttca aaatatttct 10261 gctgttggac aacgtacctg gtcacccatg agctccaatg gagacgcata aggagatgct 10321 gtcttcatgc ctgctaacac cacatctctt cttcagccta cgggagtaac tttaactttc 10381 aagtcttacc atttaagaaa tatattttat aaggctctag ctgccaaaga tagtgattcc 10441 tgtaatggat ctgggaaaag ccaataaaaa gctttctggg aaggagtcgc cattctagtt 10501 gtcatttcat gattcattca atgagggcaa agtgtcgaca ttaacaggag tctgacagaa 10561 gtttagtatc aagcctcatg gatgactttg aagggttcga gactttggtg gaagaagtta 10621 ctgcagatgc agtaaagata gcaagaggat tcgaaatgaa agcagagcct gaaatgggac 10681 tgaattgccg caatgttgcc atcaatcttg aatgaatgag gagctgctgc ttgtggatga 10741 gcaaagcaag tggtttcttt tctttttttt gagacaatgt ctcactcttg tcccccaggc 10801 tggagtgcta tggcatgatc ttggctcact gcaacctcca cctcctgggt tcaagcaatt 10861 ctcctgcctc gggcccccct accaagtagc tgggattaca ggcgcctacc accacgccca 10921 gctaattttt gtatttttag tagagaagga gtttcatcat tgttggccag gctggtctag 10981 aactcctgac ctcaggtgat ccacccgcct cagcctccca aagtgctggg attacaggtg 11041 tgagccacca tgcccggcca gcaagtggtt tcttgagatg gaatctactt ctggtgagga 11101 tgctgtaaag actgttgaaa tgacaacaaa ggatttagaa tatgacatca acttagttga 11161 taaaacgctg ggagcatttg tgaggattga ctccggtttt gaaagaaggt ctgttgtggg 11221 taaaatgcta tcaaacagta tcacatgcta cagagacatc ttgtgaaagg aagagtccat 11281 cgatacggca agcttcactg ttgtgttagg aattggcaac ggccaccccg ttttcagcag 11341 ctgccactgt gcctggtcag cagccatcca catcgaggca gactgtctgc cagcaaaaag 11401 atgaagactc actgaaggct cagatgattg ttagcatttt ttagcagtaa agtgttttta 11461 aattaaggga tgcacgttgt tgctttagac agaatgcttg tgcacactga tagactacag 11521 cacacatgag tagactgcag cgcacaccaa taaagtacac acttcgacga cagtacacac 11581 gaacagacta caactcacac gaatagagca cagtgcacgc ttacagacta cagcacacgc 11641 tagtagacta cgccgtactg tttacagtat ggtgtaaaca cagctcggct gcttatgggg 11701 aagctggaat gtgtgtgcct cactcagttg cagtgtaagc gttatctgca gcatctccgc 11761 gggattcctg tagcccctta ttttccattt ttctagcaac agtgttacat gggtgcactg 11821 tcatggtggt taattcctgt tgggcactta gcttgcttct ggtattttga tgtcacactg 11881 aggcggaata atcctctcat catatcatcc tggaagcaga attgatgtat ccaaagccca 11941 tttttaaggc ttttaatgca tttcactaga gttcctccag ggacaacgtc atttcttttt 12001 cttttttttt ttgagacaga gtcttgctct gtcacctagg ctggagtgca gtggtgcggt 12061 cccagctcac tgcaacctct gcctcccagg ttcaagcgat tctcctgcgt cagcctcttg 12121 agtagctggt attacaggcg cccaccacca cgcctggcta attttttggg tttttttgtt 12181 tgtttgtttg ttttgaggca gagtcttgcc ctgtcgccca ggctggagtg cagtggtgca 12241 accttggctc actgcaagct acacctcccg ggtttacgcc attctcctgc ctcagcctcc 12301 tgaatagccg ggactacagg cgcccgccac cacacccggc taattttttg tatttttagt 12361 aaagacgggg tttcaccgtg ttagtgagga tggtctcaat ctcctgacct cgtgatccgc 12421 ccgccttggc ctctcaaagt gccgggatta caggtgtgag ccaccgcgcc tgtccaattt 12481 tttgtttttt gtttttatag tagagacggg gtttcaccat gttgaccagg caggtctcgt 12541 actcctgacc ttatgatatg cccgccttgg cctcccaaag cactaggatt ataggcgtga 12601 gccaccgtgc ccggccaaca atgtcatttc tttttcagga gtggggggta tccacagatg 12661 agctctggct tgtggggtcc agtaagattg gaggcgctac ttggggaggg tgagccgctg 12721 gtcccaggcc aggcctggcc tgtctccctc atgtgacttg aggaccaaca gcacactgtg 12781 gtttttgtct ttatcagcaa caaatctgac acgctgcccc tggccactcg gtacaacgtc 12841 aagtgcgtgg ggctgtcccc ggatggccgc ctcgctatca tcgtcgatga aggtacttgc 12901 ccttgatgtg ggcgggtact gaggggacca gtgaggagga ctcagggctg tgtgggtctg 12961 aaatgatccg ctccccagtc gcctccgtgt tggggcccgg gtggggattg gggatgaggg 13021 tgtcatgggc gaggctgcgt ccccgccgca ggctgtgtct gtggcatcac ctgtggcagg 13081 gcagggcctc atggcccttg ggcattctca gtcctgactc ctcacagtgg acgcgagaaa 13141 gacctttctc cgtttttccc gtgctatgta tctggggtgg cccctggggt agctgcgcgg 13201 tggtgacctc tggcgtcctg cagggggcga tgcgctgctg gtcagcctgg tctgcaggtc 13261 tgtgctgcac cacttccact tcaagggctc tgtgcacagt gtgtccttct cccctgatgg 13321 caggtaaggg ggacagcttg ggggcaggag ggttcggccg tgcatgccga cggtgaggcc 13381 cactgcccag ctcgtttccc tgggccccgg gttgtgacgt tgggttgggc acacggccca 13441 agctctggca ttggtgaccc ccttttctgg aggagaaggc ttggtgcggc caggctgccc 13501 tccctccagg gctgtttctt ccctgacttt tgcttcctct gctccttggc ctccatcctg 13561 caggaggcgg gactccagaa ggcccaggtg ggtgcaggct gagactccac agtggcctta 13621 gggccattgc tggtcttgaa tatcaactat gtccgcgttc cttccatagg aagtttgttg 13681 tcacaaaggg taacattgcc cagatgtatc atgcccctgg gaagaagcgg gagttcaacg 13741 ccttcgttct ggacaagacc tattttgggc cctacgatga gaccacctgc atcgactgga 13801 cggatgactc caggtgcggc ctcagaggct tcgggggagc ggggcttgag agaggcccct 13861 tggacaggtc acccccttgg gccctgcagc ttcgttccgc aatggcatct cggcctcctt 13921 tggggctggg cgctgcccac accatcagga ctggctgggt ggctgtggtc cagtctttgc 13981 ggctctgaag tgtgtgctgt ggccgagcct cctgcccctg cactgccctt tcacctgttt 14041 gtcccagaaa ctgggaatgc tgtggttggc tcgggcagcg tctgggagga gttaggaagg 14101 cagacagggc agggacagcg tctgggaggt gtgaggaagg cagacagggc aggggctgtg 14161 ccggggtgtc ttcctgtgcc tggaggcgtc tcagctcact ctgagaccac gtggggctgt 14221 agcggtctcc gtgcagccct ctggagcttg cacctggctg ctttgttatg gccaagccct 14281 gaggaggtgc catctccctc ttttccaggt gctttgtggt tgggagcaaa gacatgtcca 14341 cctgggtgtt cggagccgag cgctgggaca acctcatcta ctatgcactg gggggacata 14401 aggatgccat cgtggcctgc ttctttgaat ccaacagcct ggacgtatgt ccctttgcca 14461 agttccctag cctgatggtg gccaaaagcg gctagttcag ggaggcccca ggctgcaggc 14521 ggcctcttcc tgcggttccc ccaggctgtt cccgtgtggc ctgggggcat ctagggaccg 14581 atggcagctg tggcggccca tcgagaattg ttctcagagt tcagaggggc cgtcggtgcc 14641 actggtccct ggatctgagc tggagggagg tgccgggttg tggcagctgg gccagtgtgg 14701 ctggggctgg ggtgaccctg tgcttcccct tgcagctgta ctcactcagc caggacggag 14761 tgctgtgcat gtggcagtgt gacacgcccc ccgagggctt gcggctgaag ccccctgcgg 14821 gctggaaagc agacctgttg cagcgggagg aggaagagga ggaggaggag gaccaggagg 14881 gcgacagaga gaccaccatc cggggaaaag ccactccggc cgaggaggag aagacaggaa 14941 aagtgaagta ctcacggctg gccaagtagg tctctgaggt gtggtgggct gtggcggggc 15001 actcctgtgg gaatctcttg cccccagagg gatctggatg tggggtcccc agggccgagg 15061 agtgggaggt acagagagca ggcctgcgtc tcagggcccc ggtttgagca ccctcgagtg 15121 cttgggtgtt gaccctgccc cttaagcagg gtccctgcct cggctgtctg agctgcctgt 15181 gtgcccaggg cagggcagag ggcggtgggt aggggtgcat agggggcagg gaggtgtgcg 15241 ggcactccgt gacgggaata gcagcacctt gggccctaag ggggttggga gttgtctgtg 15301 gccccaccag cttctgtgga gtttggtgtg ggcgatagtc ccttggcctc gtgcagggag 15361 atgccgggag agcatttggc aagggtggag ggagggaccg ggctgtgagt gggcgcagcg 15421 agtggtctga gtcctgcttc ggggcccgct ggtgggtggt acttggtgct ttttcaggca 15481 ctgagctggg ccgtgcagcc ccgcatccgg actcagtccc ccagacgaga cgagatgaga 15541 tgagactgag cccggcccca cggactgcct gtgggatggc cttagctgtg cctggagccc 15601 tggcagagat gctgccctga ttcggggtgt cccgcgtctg ggctgtgggg ggaggctggt 15661 caggatcaca tgaaggcaga gttgcccgca gctatggtgt ctccgtcacc cgggtgcccc 15721 agcctgggcc cctccttgct gcagttgctt agcgcctgct gtgtgcaggg ctctgaactt 15781 ggcactgaga cacactgccc acccttgtga ggcctcctct ccgccgggga ggagaggtgc 15841 agtgatgagg tggctgcgtg cggaggtggg cggcagttgg ggtccctggg tggggaaggc 15901 gcccacacaa ggccctgcct gtggagggag ccaggcggcc agaacgggct gcctttggag 15961 gggttggggt ctgctgggta ggtgggagtt gtggccatga gagccctggg tctgaggagc 16021 agggagggcc ggctggactc cagtgggagg agggtgcttt gccaagggca gcggagttgg 16081 agggtggcca gaacaggtcc aagtgggtca gtgcccacaa gtgagtcagt gcccgcgggt 16141 gggtcagtgc ccgtgagtcc gtgcccgtca gtgcccgcgg ggacttgggg gtgttgtgct 16201 ggagtcaaga cttgaatggg gactggagga agaggaaggg atgatgggcc ccccaacccc 16261 gccctgggga agggaggttc caggcagctc caggaagaca gtgtgcctgc taggcgggga 16321 ctgcgggctg tcagcgcgaa acatgtgaag ggagggcctg ctgcattggc agccagtgct 16381 ccctgtcact tgttcaggag ggacggggtc cagaaaggag agggaaggac cagggctgag 16441 ctacaacgtg acttggggag gttttgaggg cgggggtgcc atcagggtca ccttcgtctg 16501 ctgatgggga ccgagggttc cttagaggag gaagcagcag gggatgatgg gattcgggtg 16561 tgcctggcgg ccatgccgcg cctcccgcac ctcctctgcc tcccatgggc actgctgtct 16621 ctgggtgcac cactagctct gctgtggggc cagctggcga cagtggagat gctgtgtgag 16681 gttcctttgg ggtcccgagc ttgggatgga ggcgggcagc ttcagggcct ggggtggaga 16741 gggagagggc aggtctggga cctcctgcct ggatctgtgg cctgtgtcct ggagcccaga 16801 gcaagctggg gtggcagtct ggctctgcgc tcctcctcac aactcttctc tctcgtcaat 16861 tcaggtactt cttcaataaa gaaggggatt ttaacaacct gacagctgca gcatttcata 16921 agaagtctca cctcttggtc actggctttg cttctggaat cttccatctt catgagctgc 16981 cagagtttaa cctcatccac tccctgaggt aagcctttgc tcgcagtggg gtgtggtttt 17041 atgcactcac tggccctgaa tctgagggcc cagccaagcc tgcccttagc agtgggggca 17101 gcagagccac ttggagcgcc ggctcaggcc acaggacggc agcccgaaac ctgcccggcc 17161 gctggtgctg gtcagtgcga cagtgttgcc gcccccactg gcagtcagca gcattggtct 17221 cgggtcagtc acagcctttc aagtttttgt ctccgctttt cttcatctgt gagacacaga 17281 gaacgaaaat actatgctca taaaattgtg aagttgcaag ttgttgaacc ctggcctccc 17341 ctctgggcct tctctgattc ttttctttga gacagagtcc tgctctgtcg cccaggctgg 17401 agtgcggtgg cgtaattcgg ttcactgcaa cctccgcctc ccgggctcaa atgatcctcc 17461 cacttcagcc tcctgaggag ctggggccac aggtgtgcgc cactacaccc ggctagtttt 17521 taaatttttt gtcgagacgg gtcttgctgg gttttcccag cctggtcttg aactcctggg 17581 ttcaagcaat ccttgcctcc gcctcccaat gtgctgggat gacaggcgtg agccactgca 17641 cccagccctc tgatacttga actgaagtga attgttggag tctggtgagt gggtggggta 17701 agctctgcac acagggatag aatctggtgt ctggggctgc cctgagtggg cactggggct 17761 tcagcccggg caggctggcc tcccacttac aggcccgttc tctgcagcat ttcagatcag 17821 agcatcgcct cagtggccat caatagctcg ggggactgga ttgcttttgg ctgttcaggt 17881 ttgtcccccg cctgggtggt agagatggac tccccattag ggaccagtgc tgcccggcta 17941 caggcatact tgacagccac ccactggggg tgccctcccc tcccccagtt gtcttccatg 18001 gggtgccctc tcccccagcc gcctttcaga aggggccctc ccctccccca gctgtctgcc 18061 atggggtgcc ctcccctcct ccagccacct ttcagggggt gccttcttct cttccagctg 18121 cctcccaggg gtgcccttcc tgctctggcc ctcctcaagc acctctctgt cttaagcccc 18181 ttgctcaggg gtcgggggtc atagcctgcc tcagttgtga cctgcaacca ctgggttgca 18241 cgggcggggc ccatcactgg ctggtgaagc tgcagcatcg ggcagtggtc ccagcctatg 18301 cttgggggtt tgtgcgtttc accccctgcc cggcagcttt ctcagtcctg tgccaagtgg 18361 gaaggtgggc cgggccagtc tgacctgggg tgggcctgag ccgggtaccc cagcttcccc 18421 cgtgtgcaca ggcctgggcc agctgctggt gtgggagtgg cagagtgagt cctacgtgct 18481 caagcagcag ggccacttca acagcatggt ggccctggcc tactcgcccg acggacagta 18541 catcgtgact ggcggggacg acggcaaggt aggctcctgt ccccgtcccg ttggcctctg 18601 tgcctggggg gctgtgaaga tgcagtggtc tgtggggccg tgtcttgacg gtagggctgg 18661 tgttcacagc ttcacccggg ctcatgagct cagtaggcgt ttaaaaacat acaaaaatgt 18721 gaatggagag ggttgggtgt ccctctttcc cctttctgac ttggtctcct gcacggtggt 18781 ggctgcgtgg cggggagagg gtctgtggtg ccggcagccg gcatccagcc agtgtccaag 18841 gggctccagt acgtggccga gggtgggtcc atggtactgg cagctggcat ccggccagca 18901 tccacagggt tctggtgtgt gcagtgcccc tgcgggccat ctaggcctta aatgggctcc 18961 tcctggaggt gacaggaaga gactggggca gacggcccac tcaccaccca ctcagcacag 19021 cctcaggcct gttggggagg ctgagtgagc cagggcagcc gtgggcacag gggcaaggct 19081 ctcaggatgt ggcgtcacat gcgggggtct cagggatggg gtgtcagttg caggtggggg 19141 tctcagggat ggggtatcag tcacagttgg gggtctcagg gatggggttt cagtcatgtg 19201 cggggatctc ggatggagtg tcagtcacag ttgggggtct cagggatggg gtattactca 19261 cgggtagggt ctcagggatg gggcagtaca ggggcaagca tcttaggtgg ggcggcatgg 19321 ggacagggat cctggccgaa gccgtggcac acagggttgg ggtctgcccc cgcccctcct 19381 gtcctgcctc aggtcaaggt gtggaacacc ctcagcggct tctgcttcgt cacttttacg 19441 gagcactcca gcggggtgac cggtgtgacc tttactgcca ccggctacgt tgtggtgacc 19501 tcatccatgg acgggaccgt gcgagccttt gaccttcaca ggtgatgttt ttgctccgga 19561 ttggcttggg gcaggcttcc cccacgcaga ggtgaaggtg gagggtgagg ctgcagtgca 19621 gctgctgggg caggggaccc tggcacgtga ctctgacctt gcctcttctc tgcaggtacc 19681 gaaacttccg caccttcacc tctccacgcc ccacccagtt ctcctgtgtg gcggtggatg 19741 cgagcggtga gatcgtctct gcaggggcgc aggactcctt tgagattttc gtgtggtcca 19801 tgcagacagg caggctcctt gatgtaagca ccctgagggg ctgggctggg gctcaggagg 19861 ggccctcctg tctcctggga gaggagcgca gggagctgag gttacactcc gagggcccat 19921 gcccctggcc cagagctgcc cagagagccg cctcggccgc cacctccact cttcacgcct 19981 ccttgctcta cggtaggttt tgtctggaca cgaagggccc atcagtggtc tgtgttttaa 20041 cccaatgaag tccgtcctgg ccagtgcctc ctgggacaag acagtgcgcc tatgggacat 20101 gtttgacagc tggaggacca aggagacgct ggccctgacc tctgatggtg agcacgaggc 20161 agcaggcagg agcagcggcc tggggaccag cagcatgctg tcacacccac tgccctgttc 20221 ctcgttgtgg gtttggtgtg aaccttctgg tgggaagcag ggggtggctt ttatccagac 20281 ctgctggtgt ggcagccctt gaggttggca gtacctagga gacagcagat tggcgagtca 20341 ggccagcgtg gcttccacaa gtccttgtgg ggctcccgct ctgtgggcaa gtacgccagg 20401 caggagagaa cagctccccg agggtgaagg gctgctgccg cagctgcttc ctgattttcg 20461 gagttgcttt ggagctctgc cagggctgtg tcactgggag aaggacacag ggaccatggg 20521 gggcccagac gctggatgcg gccagtcccg gaatctgcag actgggagaa ggatgagggg 20581 gccataggtg ggcccggaag ccagatgcgg ctgcccaggg atctgcagac tggcctcagg 20641 ccactgcctg tctttctgac ggggctgtac tagaggcagc cgtggcctcc tgtgttcata 20701 tggtctgtgg ctgtcaatgc ctcagcggcg gtgttgaaag accacgtggc tcacaaagac 20761 ccagcccatc catgtccggg tgtggacaga ctctgccatc ccaggaggct ggttctctct 20821 gcttcccgct gcctcacccc acccccaggc ctggcattgg ccaggctttt ggcatcacat 20881 gtggcttctt tgcagaggct ctgtattcta gtctgtccag ccgccctcag taaccaccat 20941 cactcaccca tcagcctcac cttcccagcg ccccctccat cggcctttgt gggtcctgtg 21001 gccccagctg gaccgcctgg tgctcttcac tggcgctgag ttgttcctca cctgtctggg 21061 ttggagcttc tcctgggagg agattcttgt gtaccagacc aaacaggcgc tggtctctta 21121 ggggtgccag ggcacggcct ggtctccagc caagggtggt tggtggcctg cctgcgggag 21181 actggggtct tcgttgggcc ttactttgca tctctgttta gctctggctg tgacttttcg 21241 ccctgatggt gcggagctgg ctgtggccac actgaactca cagatcacct tctgggaccc 21301 tgagaacgcg gtgcagacgg gctccattga gggcaggcat gacctcaaga ctggcaggaa 21361 ggagctggac aagattacag ccaagcacgc ggccaagggg aagtgagtgt cagcatcggg 21421 cctcctgatt tgagacccca gccagccacc aggctcctgc agcttctcca tttttggcct 21481 tgttctttgt tcctgagacc gctgagtccc tgtgggtgga ggcaggtggt cctgagccct 21541 ctggggagag tccatgtcct ccttaactcc tgggatcgca cagcccccag cacaggagga 21601 cgtctggatt tactgaacat aaacttgttc agctgagagc ttgcatccac ttccaaacct 21661 tgatcctgaa ttatgacatc gtcctggtgt gcagagggga gagcagccgg gtgcccgagc 21721 tcacactgga gacctggcac ttggggaggc tgaggcgggt ggtttgcttg agcccagaag 21781 tttgagacca gcctgggcta cgtggcaaaa tgccatctct acaaaaagta gaaaaattag 21841 ccaggtgtgg tggcacatgt ctgtggtcct agctagtcgg gaagctgagg caggaggatc 21901 gcttgaaccc gggaggtcaa ggctgtagtg aggatggcgt catgccattg cactccagcc 21961 tgggtgctag agcaagaccc tgtctcaaaa acaaaacaaa caaaagcaga aaacagccct 22021 ggcagcctcc taagagtaaa aatgcaggtc ctcagtcacc tcacttgact agagagtgct 22081 acgtggatgg tgaagacata gtgtcccctt gtctcaagga gtcgggccag cacagcactg 22141 accaaattgc ctgcgcgggg tcctggctgg gctgtgaggc tcaactgggt gggggtcagg 22201 cttaggagaa gggggaggat ctgtgggatg tgcccggggt gggtggcctt gggggctgga 22261 cctgtgtccc aggagctcag aacctgttca gcagaaggtc ttcctccaga aagaagtagc 22321 gaccctgtct tggaagtggt atgaatgcct gtgtgcagtc ctcatgggtt cgccctgagt 22381 gtggaactgc tgggtgctgg tggttctgtt tccagcttgg aggagctgca cggttgccat 22441 tcccacgagg aacgctgtcg gcttctggct tctccacctc ctcgccagca cttgttgctt 22501 tattattatt atttttttaa tgagagctgt tggggtggat gtgaattcgt atctcattgt 22561 agttgtgatt gtggtttctc tagtgacatt cagcattttc tgtctgcggc ttggccgttt 22621 gtaagtcttc tttggagaag tgtctgttca ggtctcttgc ctgtttttga attgggttgt 22681 ttggattttg gttgtggatt ttaggggttt ctctgatacc tgatgtgcag atatttgttc 22741 cgctttgtga actgtctttc caccgttttg gtggcactct tagttctgat gatgtccact 22801 ttgctcattt tttctgttgt tctttgttct ctactgtcat aactaaggga ccttgcccac 22861 cccgaggcgt ctgttgagcg gggtggtaag ggctggtttc tggtttctgc aggactaagc 22921 agacgtgact gagagtaggc tgcttagggt ggaggccgac tggtcgccgt tttgtgctga 22981 cctcagcagg gaacactgtt ccttgagcca cttgttcctt ttgtgaaaaa gacattgatt 23041 ctcaggaagc ggaagcatct aagggtggtg ctggctcaga gcggaaccag gtgatggcac 23101 cattggacat tgtgggtttt aaaagtctcg cgtgagttga ggcagcgagc cttggcgaaa 23161 gttgctgagg ctggtgtttg cttcctggag ggctggtata ggatctcctc tatttcctaa 23221 ggttagtttt tattttattt atttttattt ttatttttat ttttgatttt tgacacggtc 23281 tcgctcttgt cacccaggct agagtacagg agtgcaatca tggctcactg cagcctccaa 23341 cccctgggct caagtgatcc tcccacatca gcctcctgag tagctgggac tacaggcatg 23401 tgccaccgca ccctgttaat tttatttttt gtagaaacag ggtcttggta tattgtccaa 23461 gcttgtcttg aactcctgga ctcaagcaat cttcccaact cagcctccca aagtgctgag 23521 ataacaggcg ggagccacca tgcaggccta aagttggttt ttaggagcca tacgctgagg 23581 tttccgtggt ggccaccgta agtccatgtc tcctctccac ccagggcctt caccgccctg 23641 tgctactctg cagacggcca cagcatcctg gcgggaggca tgtccaagtt cgtgtgcatc 23701 taccacgtcc gtgagcagat tctcatgaag aggttcgaga tctcttgcaa cctgtctttg 23761 gacgccatgg aggtgagccg ccagcgcggg gccggatgga tgttgcttcc aatgcaggtg 23821 gaatgcgtcg ggctcctgcg ttctgcactc ctggctccat ggggccttgg gcacagttgt 23881 ggtgcttgtc ctgtatctaa gtgcacgggc ccctcacctg ccccagcaga gtccggccgt 23941 gaagccagca cctgcctcag ctgtgtgtgg agccacggtc cccaaccctt ttggcatttg 24001 ggaccggttt cgtggaaggc aatttttcca cagacagggt tggggggatt gattttggga 24061 tgaaactgtt tcacctcaga tcaagatcac caggcattag attctcagaa ggagggtgca 24121 acctagatcc ctcacgtgtc gttcacaata gggttcgcac tcctgtgaga atctaatgct 24181 gccactgatc tgtcaggagg cggagctcag gtggtgatgc gagcaatggg gagcggctgt 24241 aaattcagat gaagcttcac tcattcacct gcccaccact cacctcctgc cgtgcagcct 24301 gtttcctaac agaccatggg ccagtaccag gggttgggaa ccactgctgt ggatgaccct 24361 gccgaggctg gttgctagga ggctggagct gtgaggggca gtgcccagca cacgctggct 24421 caggagacac ggacatctgt cagggaagga gtgaacgaag agggtgctgg taggccctgc 24481 ccgatggctt cctgcccggg gtcctgtgta caaggctctg ccttcactgt ccccactggt 24541 gcctgccctt ccccctcagc acagccctcg gctcaggtcc cactcacatg ctggctcctg 24601 caggacctca gaatgagagt gaggaagccc tggtagcctc taagcagggg tgtggctgct 24661 gggttcacct gtccacgccc catcggggca gaagcccact gtcccctaaa tgtttctggg 24721 ggaaagtgtc cttggcattt tcacaaggga gtagacggcc atcctcgctg tgagccccag 24781 cctttgtgat ggtggcagca atgccagcat gtagacctca gtccatcccc ctccgcagcc 24841 tccgcacctg ccctgctcca cagcttccac acccaccctg tctgcagcta gctctgcagg 24901 aagagtccct ggggcactgc agggctctct ccccgtcctt gttctagaca gggctgaggc 24961 ttgtctggtg ttggggtcaa agagcaaggg taggagacga gggccaggcg gccaggccat 25021 cagaagcaag gcgtccactg cttttgctgt tctaggaatt tttgaaccga agaaaaatga 25081 cagagtttgg caacctggca ctaattgatc aggatgctgg gcaggaggat ggagtcgcga 25141 taccactgcc aggcgtcagg aaaggtgagc agaggttcct cccgcatctg cccaccactc 25201 accggtcctg ggtgaccacg cattcatgcc cctggttggg gctgcagtgt gtggaaataa 25261 cagttactac atggtgacat ttgctggaca ctgggggtgt caaaggccca gtgggttggt 25321 gctggtgccg tcagcccctg ggcaggagaa gctgcccatg tgttgctaca cgtggcctcc 25381 ctgccctggc gggacccgtc catttgctct tgggactggg tgccccgtct gctgcacagc 25441 acacggagca ttctggcgtg aggccatgtg cgccgcacag agagccaggt gctctagaag 25501 atctgggtcc caggtagacc atcctgatgg gacactcgcc agccagaaac cctgagtgct 25561 tccatatgca ggcccagggt accgtgttga gcacagcaga cccactcctg ggctgggctg 25621 gggccctttt ctgggttgga cacacagtct gctggccggg ctggcttttt tgagcatctg 25681 ccccatgggc tgggccatct gtttggcagc tcacccggaa cagcctccgc ggccccagca 25741 cgggcccacc tgccgcagct gcagcgaggg aagggctgct tctgcctccg ctttgtgcag 25801 gacccaatga gtagaggctg ctgctcttga tggggctccc acagcacccc ccagccagcc 25861 gggagaccgc aggatgactt gcttcgtggg gagagcagcc tggatatgct gtccagtgat 25921 cattaacggt tcttttttcc ccaggtgaca tgagttctcg gcacttcaaa cctgagatca 25981 gggtgacctc actccgcttc tctcccactg gtgagcactg agccatgggc ttttggtgcc 26041 ggtgggcact ggggcatctg tgccatagca aggctggcca ccatggagct gtctgcaggc 26101 tctgggctgt gccctgtgcc cctgggtaag aacatggccc tgcaaaccct gcaggccaca 26161 agatggctgt tgtccctgag gatggatggc tgtcacggtg ccaacctcag atgcactgag 26221 ctttgaaaca tggcaggcct ggtatgccct tgctctagca gctccccgag tgccctggac 26281 gtgctgcctg ggggcggctc tgccatcacg gagagcccgt ctcctgcgca ctgcccccat 26341 cctctcgcag tggggaggag ggtctgtgcc acaggcctgt ggagcctggc ggtgcacatg 26401 gtccccacac agaggagcca ggagcagggc tattggcaag cgtgctccag cctagactga 26461 gtgtcagagc caggggcaga gaggcccatg tgagccaggg gtctgtgccc tcgggcccaa 26521 gggccaaatc tggctggcca cctgctttgg tcactgatgt tttactggga cattttactg 26581 ggcgtttttt tgttggcgga tccttctggc tgtctctagt gatccgtgca gcactgagta 26641 ggtgcagcag agaccacagg gcctgcggag ctgaagacgc ctgagctaga cagaagactt 26701 ggctggctcc cggcgtcaag agaggaggac gcatgcgtgt acatatttgc ttatatgtgc 26761 atccctggga gagtaacttc tgggatgatc cattcattcc ttcccagggg agtagtgggg 26821 tgggggctag aggcagacgt cccggttcct cctgattttt gcactttgtc ccaggggaat 26881 gtacttaata ctgtttcttt aaaaaaaaaa aaaaagaaaa aaaatgttgg gaggaatgag 26941 cttaaatttt taggaaagca ctgcttaatt tcccagggcg ctgctgggcg gccaccacca 27001 cggagggact cctcatctac tccctggaca cccgcgtgct ctttgacccg tttgagctgg 27061 acaccagcgt cacccccggg agggtgcgcg aggcactgcg ccagcaggac ttcaccaggg 27121 ccatcctcat ggccctccgg ctcaacgaga gcaaactggt gcaggaggcc ctggaggcgg 27181 tgcccagggg cgagagtgag ttggggcttc ggtgtcgggc gccggggtca tcctctcttc 27241 ccctggtttg agttggtggc agaataagca tgattttttc tccttttgca gttgaagtgg 27301 tcacctcctc ccttcctgaa ctgtatgtgg agaaagtgct ggagttttta gcttcctcct 27361 ttgaagtgtc tcgccacctg gaattctacc tcctctggac tcacaaactg ctcatgttgc 27421 acggacagaa gctgaagtcc aggtagaggg tctcccccgc agttcatcgg tggctcaggt 27481 cactggacca gcttgtcctg ctggataaaa gggaactttg acatgccaat gcagcatgaa 27541 gtgagcccac tgtaccacgg ggtgagcccc gcacgcttag ggagtctccc tttcagagcc 27601 gggacgctgc tgcctgtcat tcagttcctc cagaagagca tccagcggca cctggacgac 27661 ctgtcgaaac tgtacgtgtg ggtgcagggc ctgggggtgg gtggggtgca cggtcccctg 27721 ctggtccaca ctgtggagtg ctccttgtgt catctgattt gaggacaaaa ggaaccagtg 27781 ctggaggctt gtcttctcac ctgcgggtga actgcacctg cccggccacc ccgtggttga 27841 gcagcttggg aaatggcctg gcgctgaatg gaaagcagtg agcactgtgg caggcagaca 27901 ggctgcaggt ggcgcgctcc aggtgcggcc ggaatcctcc aggctcttgg agggattgcc 27961 gcattcgtag gaccaggcag cctttcctga atttgcaggt cacttgcaca ggcagtgggg 28021 gtggtcacag gactgtgagg ccaggcagcc tggagtcctg gaggtgtggc tgccctgagc 28081 tgtggctccc ctagaaaagg gtgctgttgc tgctgctgcg aggattcatc cgcgggccta 28141 ggacagggcc tggccctccg cagtgctcac tctgcagggg tgcccgcctc ttctcctgtc 28201 atttctttgt ggtttgtgcc ttttttgtcc ctaccaattc accttcccta aaggtcctgc 28261 agagccggga ctaggcaggc ctcagagtag ccatgtgagg gatatcaccc tctgttgcca 28321 atcagttgcc gggtttccca gctgcgtttc tgaacggtct gttctctgtc ctgtttgcga 28381 gttgctctgt ccagggttca ggggcagagc ctgagtttgg gccttgtggg ttcatgtggt 28441 cctcctgtct gtccttgaag tgtcactcag tctttggaaa gagtcctgga gtccaggtcg 28501 agcctgtcct cccctcgtgt tctgggtgcc ctacagaagg ctgggatgtg ccctgatcca 28561 cagttttggt gccaggcctt ccctcggctg agatgtcctg gacttttcac agagctctgt 28621 aggggaggag ggttgtctag ctcccttatt tatgtattta tttatttttt aattcattat 28681 tattattatt attatttgag atgaagtctc gctctgttgc ttaggctgga gtgcagtggc 28741 gtgatctcgg cacattgcaa cctctgcctc ccaggttcaa gcaattctcc tgcctcagcc 28801 tcccaagtag ctgtggttac aggcacacgc caccaccccc agctaatttt tgtattttta 28861 gtagagacag ggtttcacct tgttggccag gctggtctga aactcctgac ctcaggtgat 28921 ctgcctgcct ctgcctctca aacaaagtgc tgggattata ggtatgagcc accatacccg 28981 gcccagctcc tttagattta ttccctctta tgggatgttt ttagagttca cgtaagtgca 29041 ttgttccttg gacctcctgt ctaggcgttc actgccaatc tgtagggttg ccattggctg 29101 tcaggaagag tcctgcacag agggtcctgt gccctgcaga tgcgatgggg cgcttccttc 29161 ctttctcatc catttatctt tcagttctcc tttcctatgt aaagtgatgg tggcatcact 29221 tttctccttt attccattaa ctcagaaaat taactgattc agagcgtgcc ttcctcccct 29281 ctcgctccac aacaccacat agctctgcag cggtctcttc ctaagtgttt ggtggaactt 29341 gctggtgaag ccatctagat tgggcccagc ggaagggctg agctgccagg atgaccccaa 29401 agggatggga tgttacagtt gggtccttgg gtccttgaag gcctttgctg ccccagggct 29461 ggatggggca gaaccagatt gttggggcag ggaaaagagg aaaaaaatgg ctgctggggt 29521 ggaggaaccg gcatgtccag cccctgctgc ttttctcagg ggtcacagag gaggtggttg 29581 ggtgcaggac aggccagggc ctgggcgttg ctggcattga ccatcccctg tgtgattgac 29641 agctgtagct ggaaccacta taacatgcag tacgcactag cagtttccaa gcagcggggc 29701 acaaaacgct ccctagaccc gctgggaagt gaggaggagg cagaagcatc tgaagatgac 29761 agcctgcatc tgcttggagg aggaggcaga gactcagaag aagagatgct ggcctagagc 29821 cagccggttg cagcgttgga ttgtgccggc taagacctgc cagggagatg ggacccttgt 29881 gccacctggg ccagcaaaga ggaggggtcc agagaacagc tgaaatactg tcactagtgg 29941 tagtgacttg cttttcctgt gcacacatgt agcccatcag gacagcgagc cgacgggtca 30001 cgccaggggc cggcacgcac tggcacctgg ccccaggagc ggggccgtgt gaacggtgat 30061 gaatgttgaa aatgcgtctc agagaggtat tcacatgaac tttgtatgag acttatttat 30121 atctttaaca taaaggtttg ataaagaact tagggattaa aaaaaagatg gagttttcta 30181 acgtgaggat gaagtctaca cttcagatta aaaagggtta tgttgtatcc tgctgtcttg 30241 tgggggctca agacctgcct gccttaccac cagggcctcc gtcctgggga agcaggctga 30301 cagagcaggg tgctcctgct gtctagcggg ggctctgcct tgtgacctgt gcatatcctt 30361 ggggtcaggg gcacagagcc tctccctgac ctcttgggga tgctttggag tctggggctg 30421 ggattggcct gtcttggtga gctcactcag acctgcacca gctctgcgtg ggccacacgg 30481 aggaggagac agctcctccc ctctgcagtg cccttgggaa tagtcatcat gagagaggct 30541 tgggtctcca tcatggtagg gcatcctggt ccccacagct cagtaacagc agcctccagg 30601 gtcaggacct gaggctgaca ctgaggtggg cacttcaccg actgccagtg ctgtctgcca 30661 ggcccagcct gtccctcgcc agcggtgtgg cccccaagac cctgggtagt acctggctcc 30721 cttgcttatc tgggaagtgg agctgatatc tgcctgccgg cctcccgggg gtggcgtgag 30781 gccagatgtg ctagagagtg gagatgaata cgacaggcag tgccttgtgg cccacagcac 30841 tgtacctgca ggcctgcaga ccagaggtct ttgtagaagg gaggagctca ggccctgatg 30901 acaccaggtg ttctgtggct gcacaatctc cgtgacccag acgagagaga atccttagct 30961 atcgcctacc caaagcacaa gttccagtgt tcccctccct tgcctcgccc ttttccgtga 31021 aacaatttcc agcgttcccc ctcctcgcct cacccttttc tgtgtcacat gtgcatggcg 31081 atggacacct gagtagtcgc ccctgggaca actggagctg cggctctgca gtggggtctg 31141 ggagctcccg gccaggggca gctcccgcac acttggtaga gacagggtcc tggcccccac 31201 accgactgag tgagagtctg cattcggcat ttcctttctg agtggcgctt ggtgtttttt 31261 tgtgccatcc cacaacccca tcgctcccac cagctgatgc tccctggctg gccgctcccc 31321 aggagctgag aagcctgaag agggccaggc acggaggagc aggaacgagg acagtcaaat 31381 gtgccaagct tgtctttggt tgtaactggt aatgtcccat gtctgttatt ctagttcccc 31441 actatcttgc aaggattggc agtaaagtcc tagtgatgga caacgcggca gtggggaaag 31501 tgacagccgt ggctagatca tcagatgtgt ctgtagaatc cggtgtgtcc ctgtgaacat 31561 ccgccagcag taaccattca cattatttca aaaacctgca ttattaggtt gaaaatgggt 31621 aaatcgggga aaaagccagc acttatccag ctattcctct gtgaactaac ccatgcataa 31681 caaacagatt tttttaatgg aagaattcca gctaggaccc gagcagctgg gatcttcaca 31741 ctgaccgtcc atgctgtgaa cgtaaaaaac aaaaacaaaa aaacaaaccc aaacagcgcc 31801 cgggaatggt ggctcacgcc tgtaatccca gcactttggg agtccgaggc gggcagatca 31861 cctgagctcg ggagttcaag accagccaga ccaacatgga gaaaccccgt ctctactaaa 31921 aatacaaaat tagcctggcg tggtggcgca tgcctgtagt cccagctact tgggaggctg 31981 aggcagaaga gtcgcttgaa cctgggaggc ggaggttgcg ttgagccgac ttcgcgccat 32041 tgcactccag cccgggcaac aagagggaaa ctccgtctca aaaaaaaaaa aaaaaaaacc 32101 caaacagaat cggcggctca cttgccccca gactaaccca gatcaagccc ctgggtggaa 32161 ctgcagtttc caggatacaa catcccgggg tcagtaaaac ccggaggggt cgctccgtgg 32221 gagccggagc cgcgaggaga cagtcatgga agcggagggg tagtcctgac cccgcgtggg 32281 tcctgacgcc gagattaaga cgagtggcca tttaggagga tgtggaccct ggacgctcgt 32341 ggtgagttaa ggatgagacg gaggtaaggt aagaagcgcc ggactgagcc gccccgaggc 32401 agccttgctc tgcggatggc ggaaagggtg cgccgcctgt ggaggcgtcc gatgggggcg 32461 gggctgggga ccccgggagt caccagcatc cctcagcctc gagcacgagc cctcagccac 32521 caccggaagg aagacagggt tcccggaact ctattcggag gctctgcgca ggcgcggccc 32581 ccgcccaccg gcgaactcac tggatagggc tgagacgggg gcgggtcttg gctccgccca 32641 gaaggctgcg caggcgcagt cccgacgagc aacgcgtttg tagaggggtg ggtgcgcacg 32701 ctctgtccct gcgtgacctt ccgaccccgc tgtcctcacc gcaatggcgg ctgtgagggt 32761 cctggtggcc tcgaggctcg ctgcggcatc tgcattcacg tccctgtccc ccggcggtcg 32821 gacgccttcc cagcgcgcag cccttcacct ctccgtgccg cgccccgcgg ccagggtcgc 32881 gctggtgagt ggacggaggg ggtgaggtca gctcccgcct ccagagatca gccttcgctg 32941 ttctctgccc aaggtcggcc cctcttccta ggtccgctcc cgccgcccca cgtcggctcc 33001 ttcaccccaa gtcagctcct gcggcccagg tcggccccat tttcccgggt cagcgcccgc 33061 ggcccaggtc ggcccctttc cccccagtca actcccgccg cccaggtcgg cccctttccc 33121 ccgggtcagc tcccgctcct ctcgcaggtg ctgtctggat gcggagtcta cgatgggacc 33181 gagatccacg aggcctcggc gtaagtcctc aggggcagct ggtcctccac cccgggggcc 33241 tcgagaaggc ctctccacca gtgaagttta ctcttttttg aacagcttag agaaagcatg 33301 tattttattt tatttttact ttttgtcttt tgagacagcg cccagtctgg agtgcaatgg 33361 cgcgatcttg gctcactgcg gcctccgcct cctgggttca agcgattctc ctgcctcagc 33421 cttccgagta gctgggacta caagcatgca ccaccacacc caactaattt gtgtattttt 33481 ggtagatacg gagtttcacc atgttggcca ggctggtctc aaactcctga cctcaagtga 33541 tcctcccgct ttggcctccc aaagtgctgg gattacaggc atgagccact gcacccggcc 33601 gagagagctt gtattttaaa taccctattc ttggctaaaa aaaagcaaaa acgacttgca 33661 aaacccaggg agacgctaag aaagaacagt ttttaagtgt cgcttatgct gtcacctaga 33721 gcctagaaat gacttagcat ttcacttgag tgtttttcag gacacacaca ttttgaaaac 33781 actgagatca cactcacccg tgttttgcaa atgtttctga cgcatgggaa gtaattccca 33841 ttcattcagt tctcaacatc attttaacag ttgcgtggta tttctgggtc ctagacaggt 33901 tactgtttac gtaactggtc agctgttgct gagtgcttgg tggcctgatt tcctgtgcta 33961 ctggaaatga ctgcaggccg aaccagtagg gcaggaggcc accacgcgtg ccaggtacct 34021 gtagttccca tgtggttcag ggtgaactgg agaggggaga gctgggtggt ggcgaaaggg 34081 gtgtcaggcc atggagagtg gcctgaggct ggtggctttg tgggtggccc accttggctg 34141 gttggagttg gcactaggca gagtgagctg ggggaaaggg ccaggatgtg tttcatgatt 34201 tggccttggt gggatagaaa atagccccag aatagcttga gccctcccag gcagaacggc 34261 agggtcttgt gtctggggtg tctgggtgga tttcttatgg gtagcggaac ccaagtgcca 34321 gggccaggtg ctgacctctc cctctggcaa gtcactgcac ttcatgcctc ggtttcctcc 34381 tccacgtaag gggcagtgag ggcttagtgt gtgggtgggc ttagccacct atctcctgtt 34441 ggcactggga gttgcggatg ccagcgtgag cccccataat gtcagctgtt gtccccagga 34501 ggagcttctg ttgcccttgc ccttggggag atggagctgc ctggcctgtg gtgttgtcct 34561 ggggtctctt gtgtctctcc caccatctgt gatgggaaag ttgagggaga ctagaaattg 34621 aactaagtag tgaccgaaag ccccgtcaca ccgtctgtgc acgtggcact ctgagcacac 34681 tgctgcttct cactcttggt gcttgcagca agcaactgga ggagggcagc caggccctgg 34741 gtgtaggtag agcactcagc agccggggac ccgcacgcct ccaggctgtg ccctcagtgg 34801 agtcacccgg cccccatacc ccagtttgca aagggagctg ttgggaccag ctgatctcca 34861 tagcccactt ctgagggttt gtcatagtcg taacccacca accgacagtg aattggaatg 34921 ggggtgtggg ggacgtttgg cttggtatgg aaccaactgt aactttttgc agagcaagta 34981 agatattaat atacattagg aatttgctcc tggtacttag ctaaaatcct gtttttaaaa 35041 atgtgttaac acaaatctga gtggggcacg ttttgaagag ttgtctctgg ttttcccccg 35101 tgcaggatcc tggtgcacct gagccgtgga ggggctgaag tccagatctt tgctcctgac 35161 gtccctcaga tgcacgtgat tgaccacacc aaggggcagc cgtccgaagg cgagagcagg 35221 tgtgggggtg ggattggaac ttgcttcttg tctacctccc acggtgcagc ctttttttgc 35281 tggcttagct taaaccaagt cttgtgtgca aatgaacgtc cttctgagca gctgtgctcg 35341 ggctgttcca gaagtaatgc tgccaaggct caggagtgcc ttccttccag tactcacaga 35401 aggactgaaa atggcgaccg gatcgcttcc cactgagtca ggctgctcat accttggtct 35461 taagcaaagg tggtggttct cagcttggtc gcccaccttc ttcccctgtc tgggctccag 35521 atctgttttg ctggttccca gcagcatgtc tgttactgcg gtgtgtgtgg ttctggcctg 35581 tgtggcctga gatccccttc ccttgcctgt cctgcctgac agccctgtgc aggccattcc 35641 aaggtgcctt ccaactggcc tcggaccacg agaggtcgga gctggagatt cgaggaagaa 35701 gccatagctg cctcctgggg ctgtgcttct caccagtggg gacccctgtg atttcttttc 35761 ctgcctggga ccctggcttc tgcacctgga cttgcacctt cggcccccac acctgggggc 35821 tagcagcttc ctgcggtggc tggccttcgg gttgcttcac tggtcctgag tcctggcttc 35881 ttaccttccc ctctgagtcc caccgtcctg atgagtgcac agaattctgt ctgctttaga 35941 cactcggaga gacttccatt tcccatgggc cttgactgca gaagattcag agtgatttgg 36001 cgcaggttct gaagtgctga tcggaagagc ccagcataga tttagcgttg ctagaaccat 36061 gttgtctttc caggtagtga ccaaacatgc tgacctggaa gtaagtcccc gtgtggctga 36121 gagccacgtg tgtggagcgt gttgcctgtg ggctgtgccc ctgccgcgct ccacagtcct 36181 tcacctgcat gctgaggctt tttgccctgt gctctttccc aggaatgttt tgaccgagtc 36241 tgcgaggatc gcccgtggca aaatcacaga cctggccaac ctcagtgcag ccaaccatga 36301 tgctgccatc tttccaggag gctttggagc ggctaaaaac ctgtgcgtat ttgagctcca 36361 aggtcttccg ctttccatgt ggagcagatg ggaagggggt gcacctgtct gctgtccaat 36421 gtggtcagtg gagcggcgat gggagggggt ggtgatagga gcagtctctg ggtttagggt 36481 ctgggcaggg cacgtgcctg ccctctgact tgggagctgt tcttagggtt gggtcttagg 36541 tagcgttgta gcgagccatg cttttgtttc tgcttctctg ctgcctcttc tctcagcatt 36601 cctggtaccc agcagcttgt cagagtgggt ttggggctcc ctcttatctc atcactgtgc 36661 tgtaggtctc gtcacgttga aatcagtctc tgagtattta ctccttgcaa gtgccagttt 36721 tttttctctc tctctctctt tttttttttt tttttctgta gagatgaggt cttggtctgt 36781 tgcccaggca gggatgcagt ggcacgatcc tagttcactg cagccatgat ctcctgggct 36841 caagccatcc tccagcctca tcctcctgag tagccgagac tataggcgca tgtcaccatg 36901 cccatggttt tgtttttttt tttttttggc catgtatgta gcccaaattg gtctggaact 36961 tctgggctcg agtaaccctc ctgcctccca aagtgctggg attccaggcc tgagccaccg 37021 cgcctggccc caattctctt atggaacctt gcaggatcag aaatgggttt ccttcttggt 37081 agtttgggcg gcggtcctga caggaagaag ggaggccgcc ttggcgtggc tgttttggta 37141 gctggaccgt gaatcatgga ggcctggctg gctcccacac ggtggcgtcc ttgagttcca 37201 gatcctcttt ctgttctctg tgtctcctct ctctctttgc acattcatgg aggttctctg 37261 ggcaccgccc ccatggccct gcctgagagg tggaggtggt gccaggctga ccctccaggg 37321 acagctctgc cacctgcagg agcactggcc acagtgcatc ggagcgtgca ggtggcttct 37381 gtgccatggt gtgggtggca gccccagcac ggccggcccc ggtcggcagc tcctgtttca 37441 agggcaggtt ggaggctggc tggccgcttg ccccaccctg ggcagcctgc acccagagtc 37501 gaggggtagg cagtctccat ccgaagaatg cgcccctcct tatcccgaaa cctcagaggc 37561 ctccttcctc agatgccagc tgttttctca gtgcccttgg ccgtgccctc acccctctct 37621 caccggaagc cccaccccag cctgtcgtcc ccggtctcct gaaagccagc aagcctcgtc 37681 ggtgccgccc acacccctga cctcacgcag ccccacgtca ggctcgatct tttgtccagt 37741 tgtccctgcg tcgctgccgt ctccacgagg ctctgaccac tgacaccgtg tgtggcgggg 37801 cgcaggggca gctggagggt gtgctggggg ctccgtgccc tggcatggct cctccacatt 37861 ttgccaaccc agaggcacag gcgtgaggtg tgtggtggtt ctcagggtca gtgggtcttg 37921 gccttggctt tcaaggtgtt gatgggccct gggactgatg tgcccgtgtg tctgagccca 37981 cgtcccctgg gcgaagctcc cattttcact gacctgtcca gagcgttctg ggtgagcctg 38041 ccctgcgtgt ccatggtgcg gcgcgctcta atgagctgca gccgtggcca ctctgcaccc 38101 tctgagacag tcacccatgg ggacggcgaa cacccactgc ctccagaaga ttcccagcac 38161 tctgccagca aagccatgtg gaaaatgctg gggcagagga ctggggtgcg tgagggtggc 38221 gtagagccca ggggtgggtt cgtgggaagg tgtgtccact tgtagatttg gtggcattgc 38281 cagatcacac gccccatggg cagtgtggag agacctcccc tgccccgcgt gccctccaca 38341 gctcctgctc ctcggcctga ggtgtggatg ctggtgtcgc aggtaaactg gcgtttcttt 38401 cacggtcatc gaagcaagag tgcttggtat gcctggaggc cgactgcgtt tctttttttg 38461 ggaactcctg gagtttcctg ctgccccttt gttttaagga aaaactctca caccgtgttt 38521 ctcctcctcc caccacaaca gttgtccaca cagaaggtgg gtgtgaaggg tcttccccat 38581 gcaccacgca ggggtcgctg ctgccttcta attcagttcc aacagtatgg acccggggta 38641 gcgtccgatc gcacgggttg agggctccgt ccccaaaact gccctctgac acataccggt 38701 aaactgtctg agcctctgga actgaccaat tggcttcaag ttggggttcc cacaaccccg 38761 tttgctggcg tggctcacag tgctcaggga aacatgacgt cagcattctc ccccggggag 38821 tgggggaccc tctcaggaga gggtcctgag acccacagtc agaaaggcgg ggaacattag 38881 agtcctgctt tggggcaggt aaagggcagg cgggaaggtc agaggctgcc cgaggcccga 38941 cacagccaac gtcgtaacag aaagctaaca agcagcacgg gagttaggag ccggaaacca 39001 gcgtctgagt ctcagcaccg cagcccgggt tctgccagct gctccggaag gttccagggc 39061 gtgggccctc tcgccctggt ggcctcctgc tgcaccactt cctgatcctc ctgcttccta 39121 tgcttatttc tgtttaaaaa acaggaaaat atgttcacat caaacgcgca tatgtgcagc 39181 catgcccatg tcccttgctg cagctgactg agggtctgga gctgtgccta ggaagccccc 39241 caggaataga ccctggttgt gaggctgtct gaggaggcct tttgtctccc ggtaggagca 39301 cgtttgccgt ggacgggaaa gattgcaagg tgaataaaga agtggagcgt gtcctgaagg 39361 agttccacca ggccgggaag cccatcgggt gtgtacagcc gtgcaggcct cggggaggga 39421 gggaggaggg tgcgtgggcc cgcggcctct gttctcagcc aggagctgcc gctgatcctg 39481 aaaactcaca gctgcttttc ttgttttaac tgaggggatt tcacacgatg tcactggtga 39541 cttaacagga aggagaggca cagtgcactt ccctagagga ggcctggaat tctcttggct 39601 cccagaggtg tccctaggca gtccctcagg acccctatcc tttaccagac cccttctccc 39661 aagcccctta gccacagtac ccccaacccc gctgacttct ggccagtcac tgcagcacgt 39721 gctccatccc tgacgccatc cactcccagc ggcctccctc ctggggagcc acagccaacc 39781 acgccccttt gtccctgctc agacaaagcc tgggtcctga ggttggcggc acgagtaggg 39841 cgcccactcc ttcatccacg taagcgcctt ggcttacgtg gcgggtcgca gccccgcagg 39901 gagtgtgtgc tcttcagcgt ctgctgccgg gaagtcactc tcagatgtgt gagggtgacc 39961 cagcaaggcc gggttccgcc tcggcaggtg tgatgcttcc acgagcctcc cggcacctgc 40021 tccttggtgg tcctggaacg cggcgctgtg ctgtccaggt ctcagctgcg tctggtgctg 40081 gcagggcgag agccgcactt gcgtgggtga acatcagtga ggcgcactcg cgatggccgc 40141 tcactaccga gttctgtcag ccgaacaagg atgaggatct atcggaatcc tctgtctccc 40201 catccacagt cactcactta aggacctgcc tggtcactgt ccagggaggc ccctaaagtg 40261 gagcctcagg aagaggtgca ggctcagagc tcctgaagga ggtccctgct gcagccgtgt 40321 cacagccggg gtgaaggttt cgcccatggt ctccacctcg ggatcccgtc cttgtcagag 40381 aagtgttcct tggatgagcg cccagaggga ggagcccatg gccctttctt ctgtgactga 40441 ttaaaaatct cgatgaaaca agacagcccc tggcaggtga accactgcct ctgagcatgc 40501 ctcccctgaa aacagtgtgg ggcgcaagcc catctcctcc ccaatccagc catgaggctc 40561 ccctggaaac agcacagggc gcaagcctgt ctcctccccg gtccagccat gaggctcccc 40621 tgaaaactgc agggcacaag cccgtctgct ccccggtcca gccatggggc ttccgtgcac 40681 ggtgacccac aggctttttt cctgaggaaa cctaaatcct ttcccactgg ccccacccac 40741 ccctcagcct ggtgtcagag gcctgcgtgg ccagccaggg gctgttaggc ggcccagccg 40801 ctcactgctg ggaccccttc cttgaatgca gtagtcgagg tgggcgctgg gctgtgccag 40861 gagccactgt ggcatctgta gctggcagtt gatggctggg cggctgcacg cagcgtcggg 40921 agctgctgag aacctgcttc ctggactttc tgcctgcgca gagggcggca ggggggtttc 40981 atctcatagg cagccagaag ttgcttcagg acatgcagtt ttgggggcgg gtttgttttg 41041 agagtatcgc tctgttgccc aggctggagt gatcttggct cactgcaacc tccgcctccc 41101 ggggtcaagc aattctcctg cctcagcctc ccaagtagct gggattacag gtgtctgcca 41161 ccaggccgaa cttttgtaat tttaatagag acaagttttc gccatgttag tcaggctggt 41221 ctcgaactcc cgacctcagg tgttctgcct gcctcagctg cccaaagtgc tgggattaca 41281 ggcatgagcc actgcacctg gctgtgaaat gtgtttttaa cagtgtttcg ttacagctca 41341 cggtagcagt ttccacctgg gcctgtcggg agtgggtagg actgtagaca gcgggaggag 41401 ggatgaaggg tgcggggtgt tgagggcagg gtggggcctg acagctgtgg gctacaccac 41461 aggttgccag gggatggggc tgctgttaat gaggccgtac tccttccact gttgcccctg 41521 ggtggccatg aggcgctgga cgacacaggt cactgaaggc cctgctgtgg gcacagtgcc 41581 tggcacacag gggcctctgt tgacttggca gggaggggtt cctgatgaag cagaagcagg 41641 tcccaggggt gggtgtggca ggaccctcag tttcagggct gctgcggcct gtgcgtcagg 41701 gtcaggggcg tgcgtccacc agcctgcact tgttgagagt tgtgctgcag gtcggaggct 41761 ggcgaggctg ctcctggccc gttgtgcctc ctgtagaagg ggggctggtg tgaatgagaa 41821 gtgcagctgt gttatctatt gctgggtcac agaccacccc acaactcagg gctggggcag 41881 ccgcacatca ttaccctctc tgtgagtgga ctttggcagc ccgggactct cgccagtctg 41941 gggtgtcctg gggcttgggg cttagcccat ggctgttggg ggttcccctc aggggtcctt 42001 ggcctggtgc ttgagtgcct gcagggcgtg cttggcgggg agtggggctg aggacacctc 42061 actctcagcc ttgagtcacc tggtgccacc tccatcacat cccatgggcc acagccttct 42121 ggcagcgcag gctcacaggg ggcagggtcg gcagggctcc cgtggcaggg cagcctgggg 42181 tcatggcgag ggtaatttct gttctgcagg ccatgcagcc tgcctgaccc gtgttctttt 42241 cgcctttcag cttgtgctgc attgcacctg tcctcgcggc caaggtgctc agaggcgtcg 42301 aggtgactgt gggccacgag caggaggaag gtggcaagtg gccttatgcc gggaccgcag 42361 aggccatcaa ggccctgggt gccaagcact gcgtgaagga agtggtcata tccttcctgg 42421 tagccaggcc cggcccgctg tcgtgcttgt ccctgagacg tgcataggga cgcccctccc 42481 tccgggctgt cttggtgggt ggcctcttca ccgggagctg cacctgctgc tctccctgtg 42541 gggcctcccc cagtgggcac cgcagccatg tgtctaaggg cagagagcag atggctgtga 42601 cagcccagct ggtgtgaggt gggtcacaga cacgtcagaa gcgcagctct ggtgcgtggt 42661 tggcagatct ggcagatcca ggtcccttgc ctgactgttt agtctgctca gcaacaccca 42721 ggcagccaca ctccgggccc agcgggggga gggtgaggaa agagtgtgtg cgaggtgtgg 42781 agggagagca gccgggcgtg caggaggcgg gggcttcctg agatgctggg ggaaagcggc 42841 cgggtgtgca ggagaagggg cttcctggct tcatttgggc ggaagttctc aggccagttt 42901 gggtggccag gtgattcagc cagccttgcg gcccccagcc cctcctgttg aggtgtgtac 42961 cctgagttct gagatgatgg accaagaaca ccagcggggc tgttgcccag ggagatttgg 43021 agggaggctc ctggaaggcc ctgcctggat c // LOCUS AB001575 1028 bp mRNA PRI 11-MAR-1997 DEFINITION Human mRNA for endonuclease III homolog, complete cds. ACCESSION AB001575 NID g1881375 KEYWORDS endonuclease III homolog. SOURCE Homo sapiens bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ikeda,S., Sarker,A.H., Kaminaka,S., Yamamoto,K. and Seki,S. TITLE cDNA cloning and expression of a human homolog (hNTH) of Escherichia co li endonuclease III JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1028) AUTHORS Ikeda,S. TITLE Direct Submission JOURNAL Submitted (05-MAR-1997) to the DDBJ/EMBL/GenBank databases. Shogo Ikeda, Okayama University of Science, Department of Biochemistry; 1-1 Ridaicho, Okayama, Okayama 700, Japan FEATURES Location/Qualifiers source 1..1028 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" gene 11..925 /gene="hNTH" CDS 11..925 /gene="hNTH" /codon_start=1 /product="endonuclease III homolog" /db_xref="PID:d1020166" /db_xref="PID:g1881376" /translation="MTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSH SPVKRPRKAQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPV DHLGTEHCYDSSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTD DATLGKLIYPVGFWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAM AVAWGTVSGIAVDTHVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVG FGQQTCLPVHPRCHACLNQALCPAAQGL" polyA_site 1028 /note="22 A nucleotides" BASE COUNT 209 a 302 c 352 g 165 t ORIGIN 1 ggagtccggc atgaccgcct tgagcgcgag gatgctgacc cggagccgga gcctgggacc 61 cggggctggg ccgcgggggt gtagggagga gcccgggcct ctccggagaa gagaggctgc 121 agcagaagcg aggaaaagcc acagccccgt gaagcgtccg cggaaagcac agagactgcg 181 tgtggcctat gagggctcgg acagtgagaa aggtgagggg gctgagcccc tcaaggtgcc 241 agtctgggag ccccaggact ggcagcaaca gctggtcaac atccgtgcca tgaggaacaa 301 aaaggatgca cctgtggacc atctggggac tgagcactgc tatgactcca gtgccccccc 361 aaaggtacgc aggtaccagg tgctgctgtc actgatgctc tccagccaaa ccaaagacca 421 ggtgacggcg ggcgccatgc agcgactgcg ggcgcggggc ctgacggtgg acagcatcct 481 gcagacagat gatgccacgc tgggcaagct catctacccc gtcggtttct ggaggagcaa 541 ggtgaaatac atcaagcaga ccagcgccat cctgcagcag cactacggtg gggacatccc 601 agcctctgtg gccgagctgg tggcgctgcc gggtgttggg cccaagatgg cacacctggc 661 tatggctgtg gcctggggca ctgtgtcagg cattgcagtg gacacgcatg tgcacagaat 721 cgccaacagg ctgaggtgga ccaagaaggc aaccaagtcc ccagaggaga cccgcgccgc 781 cctggaggag tggctgccta gggagctgtg gcacgagatc aatggactct tggtgggctt 841 cggccagcag acctgtctgc ctgtgcaccc tcgctgccac gcctgcctca accaagccct 901 ctgcccggcc gcccagggtc tctgatggcc gcatggctct ggccgaggtg ccgctgtggc 961 caccgtctgt gaagtggctt tacgcttcag gaagccacgc ctgttgaata aagctttggt 1021 gtgtttgc // LOCUS AB001636 3028 bp mRNA PRI 13-DEC-1997 DEFINITION Homo sapiens mRNA for ATP-dependent RNA helicase #46, complete cds. ACCESSION AB001636 NID g2696612 KEYWORDS ATP-dependent RNA helicase #46. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Imamura,O., Sugawara,M. and Furuichi,Y. TITLE Cloning and characterization of a putative human RNA helicase gene of the DEAH-box protein family JOURNAL Biochem. Biophys. Res. Commun. 240 (2), 335-340 (1997) MEDLINE 98049832 REFERENCE 2 (bases 1 to 3028) AUTHORS Imamura,O. TITLE Direct Submission JOURNAL Submitted (07-MAR-1997) to the DDBJ/EMBL/GenBank databases. Osamu Imamura, AGENE Research Institute, Department of Molecular Biology; 200 Kajiwara, Kamakura, Kanagawa 247, Japan (E-mail:osamui@po.iijnet.or.jp, Tel:81-467-46-4815, Fax:81-467-48-6595) FEATURES Location/Qualifiers source 1..3028 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 162..2603 /note="putative" /codon_start=1 /product="ATP-dependent RNA helicase #46" /db_xref="PID:d1024893" /db_xref="PID:g2696613" /translation="MSKRHRLDLGEDYPSGKKRAGTDGKDRDRDRDREDRSKDRDRER DRGDREREREKEKEKELRASTNAMLISAGLPPLKASHSAHSTHSAHSTHSTHSAHSTH AGHAGHTSLPQCINPFTNLPHTPRYYDILKKRLQLPVWEYKDRFTDILGRHQSFVLVG ETGSGKTTQIPHRCVEYMRSLPGPKRGVACTQPRRVAAMSVAQRVADEMDVMLGQEVG YSIRFEDCSSAKTFFMYMTDGMLLREAMNDPLLERYGVIILDEAHERTLATDILMGVL KEVVRQRSDLKVIVMSATLDAGKFQIYFDNCPLLTIPGRTHPVEIFYTPEPERDYLEA AIRTVIQIHMCEEEEGDLLLFLTGQEEIDEACKRIKREVDDLGPEVGDIKIIPLYSTL PPQQQQRIFEPPPPKKQNGAIGRKVVVSTNIAETSLTIDGVVFVIDPGFAKQKVYNPR IRVESLLVTAISKASAQQRAGRAGRTRPGKCFRLYTEKAYKTEMQDNTYPEILRSNLG SVVLQLKKLGIDDLVHFDFMDPPAPETLMRALELLNYLAALNDDGDLTELGSMMAEFP LDPQLAKMVIASCDYNCSNEVLSITAMLSVPQCFVRPTEAKKAADEAKMRFAHIDGDH LTLLNVYHAFKQNHESVQWCYDNFINYRSLMSADNVRQQLSRIMDRFNLPRRSTDFTS RDYYINIRKALVTGYFMQVAHLERTGHYLTVKDNQVVQLHPSTVLDHKPEWVLYNEFV LTTKNYIRTCTDIKPEWLVKIAPQYYDMSNFPQCEAKRQLDRIIAQTSIQGIFTVLNS VLRTEVIERTALKDE" BASE COUNT 890 a 590 c 693 g 855 t ORIGIN 1 ctcgtcgccg ccgccatttt agctgttggt tccggccgca ccgtgtgggc tgtagtagcg 61 ggaggggtgg gggtcctcca gagttaagtg gctgtcctcg actgtgccca tacagcagcc 121 agctttcttc cttaataact gcccgttcga agagtgcgag gatgtccaag cggcaccggt 181 tggacctagg ggaggattac ccctctggca agaagcgtgc ggggaccgat gggaaggatc 241 gagatcgaga ccgggatcgt gaagatcggt ctaaagatcg agaccgagaa cgtgatagag 301 gagatagaga gcgagagagg gagaaagaaa aggagaagga gttgcgagct tcaacaaatg 361 ctatgcttat cagtgctgga ttaccacccc tgaaagcttc ccattcagct cactcaaccc 421 actcagcaca ttcaacgcat tctacacatt ctgctcattc aacgcatgcc ggacatgcag 481 gtcacacgtc acttccacag tgcattaatc cgttcaccaa cttaccccat actcctcgat 541 actatgatat tctaaagaaa cgtcttcagc tccctgtttg ggaatacaag gataggttta 601 cagatattct gggtagacat cagtcctttg tactggttgg tgagactggg tctggtaaaa 661 caacacaaat tccacaccgg tgtgtggagt acatgcgatc attaccagga cccaagagag 721 gagttgcctg tacccaaccc aggagagtgg ctgcaatgag tgtggctcag agagttgctg 781 atgagatgga tgtgatgttg ggccaggaag ttggttactc cattcgattt gaagactgca 841 gtagtgcaaa aacatttttt atgtatatga ctgatgggat gttacttcgt gaagctatga 901 atgatcccct cctggagcgt tatggtgtaa taattcttga tgaggctcat gagaggacac 961 tggctacaga tattctaatg ggtgttctga aggaagttgt aagacagaga tcagatttaa 1021 aggttatagt tatgagcgct actctagatg caggaaaatt ccagatttac tttgataact 1081 gtcctctcct aactattcct gggcgtacac atcctgttga gatcttctat actccagaac 1141 cagagagaga ttatcttgaa gcagcaattc gaacagttat ccagattcat atgtgtgaag 1201 aggaagaggg agatcttctt cttttcttaa ctggtcaaga ggaaattgat gaagcctgta 1261 agagaataaa gcgtgaagtt gatgatttgg gccctgaagt tggtgacatt aaaatcattc 1321 cattgtattc tacacttcca cctcagcagc agcaacgcat ttttgagcct ccacctccca 1381 aaaaacagaa tggagcaatt ggaagaaagg tagttgtgtc aactaacata gcagagacgt 1441 ctttgacaat agatggtgtg gtgtttgtga ttgatcctgg atttgcgaaa cagaaggtct 1501 acaatcctcg aatcagagtt gagtcccttt tggtgacagc tattagtaaa gcttcagctc 1561 agcaaagggc tggtcgagct ggacgtacca gacctggaaa atgcttcaga ctttacacag 1621 agaaagctta taaaacagaa atgcaggata acacctatcc tgagattttg cgttctaatt 1681 taggatcagt tgtgttacaa ttgaagaaac ttggtattga tgacttggta cattttgatt 1741 ttatggatcc accagctcct gaaactctga tgagagccct ggaacttttg aattacctgg 1801 ctgctttaaa tgatgatgga gatctgactg aattgggatc catgatggca gagtttcctc 1861 tagatccaca gctcgcaaaa atggttattg caagttgtga ctacaactgt tctaatgagg 1921 tcctatctat tactgctatg ttgtcagtcc cacagtgttt tgttcgcccc acggaggcca 1981 agaaagccgc agatgaggcc aagatgagat ttgcccacat agatggagat catctgacac 2041 tgctgaacgt ctaccatgct tttaaacaaa atcatgaatc ggttcagtgg tgttatgaca 2101 acttcattaa ctacaggtcc ctgatgtccg cggacaatgt acgccagcag ctatctcgaa 2161 ttatggacag atttaatttg cctcgtcgaa gtactgactt tacaagcagg gactattata 2221 ttaatataag aaaagctttg gttactgggt attttatgca ggtggcacat ttagaacgaa 2281 cagggcatta cttaactgtg aaagataacc aggtggttca gttgcatccc tctactgttc 2341 ttgaccacaa acctgaatgg gtgctttata atgagtttgt tctaacaaca aagaattaca 2401 tccggacatg tacagacatc aagccagaat ggttggtgaa aattgcccct caatattatg 2461 acatgagcaa tttcccacag tgtgaagcaa agagacagtt ggaccgcatc attgcccaaa 2521 cttcaatcca aggaatattc acagtactga attcagtgct tagaactgaa gttattgaga 2581 ggacagcttt aaaagatgaa tgaactcaaa agttcgagtt gtgctcttca cgttggttcg 2641 ataatggcct ttatttgaaa gctttttaat ttttctttac agtaaatatt ccattctgat 2701 ttcataaatt aaacatttat gcctcccttt tgtgttgaca ctgtagctca tactggaaaa 2761 gtcgatcaat gttttgcagt ttattgaaag tagttctata tataacaatg ttataagcat 2821 ttctttagaa atggttgaaa atgcttctaa aatgtgatta tcgaccatgg tatgcatgat 2881 cgttgtaatt gttgacattc cttttagaag ttgtgaaatg ttacaacttg tgcttatgta 2941 gacacaattt tttgtttcag taccagaggc actgacttca ataaagttta tttatacgga 3001 aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS AB001838 603 bp mRNA PRI 08-APR-1997 DEFINITION Human mRNA for recoverin, complete cds. ACCESSION AB001838 NID g1902889 KEYWORDS recoverin. SOURCE Homo sapiens small cell lung cancer cell_line:MN-1112 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 603) AUTHORS Matsubara,S. TITLE Direct Submission JOURNAL Submitted (12-MAR-1997) to the DDBJ/EMBL/GenBank databases. Shuji Matsubara, Kagawa Medical University, First Department of Internal Medicine; 1750-1 Ikenobe, Miki-Cho, Kita-Gun, Kagawa 761-07, Japan (E-mail:mshuzi@kms.ac.jp, Tel:81-878-98-5111, Fax:81-878-91-0573) REFERENCE 2 (sites) AUTHORS Matsubara,S., Yamaji,Y., Sato,M., Fujita,J. and Takahara,J. TITLE Expression of a photoreceptor protein, recoverin, as a cancer-associated retinopathy autoantigen in human lung cancer cell lines JOURNAL Br. J. Cancer 74 (9), 1419-1422 (1996) MEDLINE 97069555 REFERENCE 3 (sites) AUTHORS Yamaji,Y., Matsubara,S., Yamadori,I., Sato,M., Fujita,T., Fujita,J. and Takahara,J. TITLE Characterization of a small-cell-lung-carcinoma cell line from a patient with cancer-associated retinopathy JOURNAL Int. J. Cancer 65 (5), 671-676 (1996) MEDLINE 96178432 FEATURES Location/Qualifiers source 1..603 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MN-1112" /cell_type="small cell lung cancer" CDS 1..603 /codon_start=1 /product="recoverin" /db_xref="PID:d1020217" /db_xref="PID:g1902890" /translation="MGNSKSGALSKEILEELQLNTKFSEEELCSWYQSFLKDCPTGRI TQQQFQSIYAKFFPDTDPKAYAQHVFRSFDSNLDGTLDFKEYVIALHMTTAGKTNQKL EWAFSLYDVDGNGTISKNEVLEIVMAIFKMITPEDVKLLPDDENTPEKRAEKIWKYFG KNDDDKLTEKEFIEGTLANKEILRLIQFEPQKVKEKMKNA" BASE COUNT 172 a 163 c 160 g 108 t ORIGIN 1 atggggaaca gcaaaagtgg ggccctgtcc aaggagatcc tggaggagct gcagctgaac 61 accaagttct cggaggagga gctgtgctcc tggtaccagt ccttcctgaa ggactgtccc 121 accggccgca tcacccagca gcagttccag agcatctacg ccaagttctt ccccgacacc 181 gaccccaagg cctacgccca gcatgtgttc cgcagcttcg attccaacct cgacggcacc 241 ctggacttca aggagtacgt catcgccctg cacatgacca ccgcgggcaa gaccaaccag 301 aagctggagt gggccttctc cctctacgac gtggacggta acgggaccat cagcaagaat 361 gaagtgctgg agatcgtcat ggctattttc aaaatgatca ctcccgagga cgtgaagctc 421 cttccagacg atgaaaacac gccggaaaag cgagccgaga agatctggaa gtactttgga 481 aagaatgatg atgataaact tacagagaaa gaattcattg aggggacact ggccaataag 541 gaaattctgc gactgatcca gtttgagcct caaaaagtga aggaaaagat gaagaacgcc 601 tga // LOCUS AB001895 3811 bp mRNA PRI 21-JAN-1998 DEFINITION Homo sapiens mRNA for B120, complete cds. ACCESSION AB001895 NID g2588990 KEYWORDS B120. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Takeuchi,T., Chen,B.K., Qiu,Y., Sonobe,H. and Ohtsuki,Y. TITLE Molecular cloning and expression of a novel human cDNA containing CAG repeats JOURNAL Gene 204 (1-2), 71-77 (1997) MEDLINE 98094256 REFERENCE 2 (bases 1 to 3811) AUTHORS Takeuchi,T. TITLE Direct Submission JOURNAL Submitted (14-MAR-1997) to the DDBJ/EMBL/GenBank databases. Tamotsu Takeuchi, Kochi Medical School, Dept. of Pathology; Okocho, Kohasu, Nankoku, Kochi 783, Japan (Tel:0888-88-2335, Fax:0888-80-2336) FEATURES Location/Qualifiers source 1..3811 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p35-36.1" CDS 288..3716 /codon_start=1 /product="B120" /db_xref="PID:d1024146" /db_xref="PID:g2588991" /translation="MDQMGKMRPQPYGGTNPYSQQQGPPSDPQQGHGYPGQPYGSQTP QRYPMTVQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQQQPPY SQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQSTTQQ HPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPPQ ELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSP GVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPA AVPGNQMPPRPPSGSSDSIMHPSMNQSSIAQDRGYMQRNSQMPQYSSPQPGSALSPRQ LSGGQIHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGIN PMGAGGQMHGQPGIPPYGTLPPGRMSHASMGNRPYGPNNGQYATSGWVRDVSPPGGMN RKTQETAVAMHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYS MGGTMANNSAGMAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEK ITKLYELGGGPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLT QVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADS KKSQPKIQPPSPAGSGSMQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSR SNSVGIQDAFNDGSDSTFQKRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKA PGSDPFMSSGQGPNGGMGDPYSRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPE GNMSTGAPQSNLMPSNPDSGMYSPSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFP SQQTTMYQQQQQNYKRPMDGTYGPPAKRHEGEMYSVPYSTGQGLPQQQQLPPAQPQPA SQPQAAQPSPQQDVYNQYGNAYPATATAATERRPAGGPQNQFPFQFGRDRVSAPPGTN AQQNMPPQMMGGPIQASAEVAQQGTMWQGRNDMTYNYANRQSTALPPRAPPIMA" repeat_region 339..926 /note="EWS-like repeat" repeat_region 3117..3137 /note="CAG repeat" /rpt_unit=3117..3119 BASE COUNT 968 a 1198 c 925 g 720 t ORIGIN 1 tggatctcaa gcagtaccac agttacaaag tagtttttag cttagcttac aaggtgtttt 61 cacaaatagg tggtattttc atttttcaaa tgacaaaatt agggttctga ggcgggtcag 121 ttgacttaaa ggttactagg ttggtctcat tgctctttca aagtaactgt atttctttat 181 agcatacaga ctaaaaaaac ctgtgtacct gggttatata ttcggtggcc agaggcatca 241 aagctcaggt taatgaaatg ctctttattt tgtagccatc cagtccaatg gatcagatgg 301 gcaagatgag acctcagcca tatggcggga ctaacccata ctcgcagcaa cagggacctc 361 cgtcagaccc gcagcaagga catgggtacc cagggcagcc atacgggtcc cagaccccgc 421 agcggtaccc gatgaccgtg cagggccggg cgcagagtgc catgggcggc ctctcttata 481 cacagcagat tcctccttat ggacaacaag gccccagcgg gtatggtcaa cagggccaga 541 ctccatatta caaccagcaa agtcctcacc ctcagcagca gcagccaccc tactcccagc 601 aaccaccgtc ccagacccct catgcccaac cttcgtatca gcagcagcca cagtctcaac 661 caccacagct ccagtcctct cagcctccat actcccagca gccatcccag cctccacatc 721 agcagtcccc ggctccatac ccctcccagc agtcgacgac acagcagcac ccccagagcc 781 agccccccta ctcacagcca caggctcagt ctccttacca gcagcagcaa cctcagcagc 841 cagcaccctc gacgctctcc cagcaggctg cgtatcctca gccccagtct cagcagtccc 901 agcaaactgc ctattcccag cagcgcttcc ctccaccgca ggagctatct caagattcat 961 ttgggtctca ggcatcctca gccccctcaa tgacctccag taagggaggg caagaagata 1021 tgaacctgag ccttcagtca agaccctcca gcttgcctga tctatctggt tcaatagatg 1081 acctccccat ggggacagaa ggagctctga gtcctggagt gagcacatca gggatttcca 1141 gcagccaagg agagcagagt aatccagctc agtctccttt ctctcctcat acctcccctc 1201 acctgcctgg catccgaggc ccttccccgt cccctgttgg ctctcccgcc agtgttgctc 1261 agtctcgctc aggaccactc tcgcctgctg cagtgccagg caaccagatg ccacctcggc 1321 cacccagtgg cagttcggac agcatcatgc atccttccat gaaccaatca agcattgccc 1381 aagatcgagg ttatatgcag aggaactccc agatgcccca gtacagttcc ccccagcccg 1441 gctcagcctt atctccgcgt cagctttccg gaggacagat acacacaggc atgggctcct 1501 accagcagaa ctccatgggg agctatggtc cccagggggg tcagtatggc ccacaaggtg 1561 gctaccccag gcagccaaac tataatgcct tgcccaatgc caactacccc agtgcaggca 1621 tggctggagg cataaacccc atgggtgccg gaggtcaaat gcatggacag cctggcatcc 1681 caccttatgg cacactccct ccagggagga tgagtcacgc ctccatgggc aaccggcctt 1741 atggccctaa caatggccaa tatgccacct caggttgggt cagggatgtg tccccaccag 1801 ggggcatgaa ccggaaaacc caagaaactg ctgtcgccat gcatgttgct gccaactcta 1861 tccaaaacag gccgccaggc taccccaata tgaatcaagg gggcatgatg ggaactggac 1921 ctccttatgg acaagggatt aatagtatgg ctggcatgat caaccctcag ggacccccat 1981 attccatggg tggaaccatg gccaacaatt ctgcagggat ggcagccagc ccagagatga 2041 tgggccttgg ggatgtaaag ttaactccag ccaccaaaat gaacaacaag gcagatggga 2101 cacccaagac agaatccaaa tccaagaaat ccagttcttc tactacaacc aatgagaaga 2161 tcaccaagtt gtatgagctg ggtggtgggc ctgagaggaa gatgtgggtg gaccgttatc 2221 tggccttcac tgaggagaag gccatgggca tgacaaatct gcctgctgtg ggtaggaaac 2281 ctctggacct ctatcgcctc tatgtgtctg tgaaggagat tggtggattg actcaggtca 2341 acaagaacaa aaaatggcgg gaacttgcaa ccaacctcaa tgtgggcaca tcaagcagtg 2401 ctgccagctc cttgaaaaag cagtatatcc agtgtctcta tgcctttgaa tgcaagattg 2461 aacggggaga agaccctccc ccagacatct ttgcagctgc tgattccaag aagtcccagc 2521 ccaagatcca gcctccctct cctgcgggat caggatctat gcaggggccc cagactcccc 2581 agtcaaccag cagttccatg gcagaaggag gagacttaaa gccaccaact ccagcatcca 2641 caccacacag tcagatcccc ccattgccag gcatgagcag gagcaattca gttgggatcc 2701 aggatgcctt taatgatgga agtgactcca cattccagaa gcggaattcc atgactccaa 2761 accctgggta tcagcccagt atgaatacct ctgacatgat ggggcgcatg tcctatgagc 2821 caaataagga tccttatggc agcatgagga aagctccagg gagtgatccc ttcatgtcct 2881 cagggcaggg ccccaacggc gggatgggtg acccctacag tcgtgctgcc ggccctgggc 2941 taggaaatgt ggcgatggga ccacgacagc actatcccta tggaggtcct tatgacagag 3001 tgaggacgga gcctggaata gggcctgagg gaaacatgag cactggggcc ccacagtcga 3061 atctcatgcc ttccaaccca gactcgggga tgtattctcc tagccgctac cccccgcagc 3121 agcagcagca gcagcagcaa cgacatgatt cctatggcaa tcagttctcc acccaaggca 3181 ccccttctgg cagccccttc cccagccagc agactacaat gtatcaacag caacagcaga 3241 attacaagcg gccaatggat ggcacatatg gccctcctgc caagcggcac gaaggggaga 3301 tgtacagcgt gccatacagc actgggcagg ggctgcctca gcagcagcag ttgcccccag 3361 cccagcccca gcctgccagc cagccacaag ctgcccagcc ttcccctcag caagatgtat 3421 acaaccagta tggcaatgcc tatcctgcca ctgccacagc tgctactgag cgccgaccag 3481 caggcggccc ccagaaccaa tttccattcc agtttggccg agaccgtgtc tctgcacccc 3541 ctggcaccaa tgcccagcaa aacatgccac cacaaatgat gggcggcccc atacaggcat 3601 cagctgaggt tgctcagcaa ggcaccatgt ggcaggggcg taatgacatg acctataatt 3661 atgccaacag gcagagcacg gctctgcccc ccagggcccc gcctatcatg gcgtgaaccg 3721 aacagatgaa atgctgcaca cagatcagag ggccaaccac gaaggctcgt ggccttccca 3781 tggcacacgc cagcccccat atggtccctc t // LOCUS AB002097 627 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens mRNA for fibroblast growth factor-10, complete cds. ACCESSION AB002097 NID g2440220 KEYWORDS fibroblast growth factor-10; FGF-10. SOURCE Homo sapiens adult male lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Emoto,H., Tagashira,S., Mattei,M.G., Yamasaki,M., Hashimoto,G., Katsumata,T., Negoro,T., Nakatsuka,M., Birnbaum,D., Coulier,F. and Itoh,N. TITLE Structure and expression of human fibroblast growth factor-10 JOURNAL J. Biol. Chem. 272 (37), 23191-23194 (1997) MEDLINE 97435285 REFERENCE 2 (bases 1 to 627) AUTHORS Itoh,N. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuyuki Itoh, Kyoto University Graduate School of Pharm. Sci., Dept. of Genetic Biochem.; Yoshida-Shimoadachi, Sakyo, Kyoto 606-01, Japan (E-mail:itohnobu@pharm.kyoto-u.ac.jp, Tel:81-75-753-4540, Fax:81-75-753-4600) FEATURES Location/Qualifiers source 1..627 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="lung" gene 1..627 /gene="FGF-10" CDS 1..627 /gene="FGF-10" /codon_start=1 /product="fibroblast growth factor-10" /db_xref="PID:d1023194" /db_xref="PID:g2440221" /translation="MWKWILTHCASAFPHLPGCCCCCFLLLFLVSSVPVTCQALGQDM VSPEATNSSSSSFSSPSSAGRHVRSYNHLQGDVRWRKLFSFTKYFLKIEKNGKVSGTK KENCPYSILEITSVEIGVVAVKAINSNYYLAMNKKGKLYGSKEFNNDCKLKERIEENG YNTYASFNWQHNGRQMYVALNGKGAPRRGQKTRRKNTSAHFLPMVVHS" BASE COUNT 178 a 151 c 150 g 148 t ORIGIN 1 atgtggaaat ggatactgac acattgtgcc tcagcctttc cccacctgcc cggctgctgc 61 tgctgctgct ttttgttgct gttcttggtg tcttccgtcc ctgtcacctg ccaagccctt 121 ggtcaggaca tggtgtcacc agaggccacc aactcttctt cctcctcctt ctcctctcct 181 tccagcgcgg gaaggcatgt gcggagctac aatcaccttc aaggagatgt ccgctggaga 241 aagctattct ctttcaccaa gtactttctc aagattgaga agaacgggaa ggtcagcggg 301 accaagaagg agaactgccc gtacagcatc ctggagataa catcagtaga aatcggagtt 361 gttgccgtca aagccattaa cagcaactat tacttagcca tgaacaagaa ggggaaactc 421 tatggctcaa aagaatttaa caatgactgt aagctgaagg agaggataga ggaaaatgga 481 tacaatacct atgcatcatt taactggcag cataatggga ggcaaatgta tgtggcattg 541 aatggaaaag gagctccaag gagaggacag aaaacacgaa ggaaaaacac ctctgctcac 601 tttcttccaa tggtggtaca ctcatag // LOCUS AB002107 3873 bp mRNA PRI 08-OCT-1997 DEFINITION Homo sapiens hPer mRNA, complete cds. ACCESSION AB002107 NID g2506044 KEYWORDS hPer. SOURCE Homo sapiens adult brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tei,H., Okamura,H., Shigeyoshi,Y., Fukuhara,C., Ozawa,R., Hirose,M. and Sakaki,Y. TITLE Circadian oscillation of a mammalian homologue of the Drosophila period gene JOURNAL Nature 389 (6650), 512-516 (1997) MEDLINE 97472418 REFERENCE 2 (bases 1 to 3873) AUTHORS Tei,H. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) to the DDBJ/EMBL/GenBank databases. Hajime Tei, University of Tokyo, Institute of Medical Science, Human Genome Center; 4-6-1 Shiroganedai, Minato-ku, Tokyo 108, Japan (E-mail:tei@ims.u-tokyo.ac.jp, Tel:03-5449-5625, Fax:03-5449-5445) FEATURES Location/Qualifiers source 1..3873 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /dev_stage="adult" /map="17q12-13.1" /tissue_type="brain" gene 1..3873 /gene="hPer" CDS 1..3873 /gene="hPer" /codon_start=1 /db_xref="PID:d1023501" /db_xref="PID:g2506045" /translation="MSGPLEGADGGGDPRPGESFCPGGVPSPGPPQHRPCPGPSLADD TDANSNGSSGNESNGHESRGASQRSSHSSSSGNGKDSALLETTESSKSTNSQSPSPPS SSIAYSLLSASSEQDNPSTSGCSSEQSARARTQKELMTALRELKLRLPPERRGKGRSG TLATLQYALACVKQVQANQEYYQQWSLEEGEPCSMDMSTYTLEELEHITSEYTLQNQD TFSVAVSFLTGRIVYISEQAAVLLRCKRDVFRGTRFSELLAPQDVGVFYGSTAPSRLP TWGTGASAGSGLRDFTQEKSVFCRIRGGPDRDPGPRYQPFRLTPYVTKIRVSDGAPAQ PCCLLIAERIHSGYEAPRIPPDKRIFTTRHTPSCLFQDVDERAAPLLGYLPQDLLGAP VLLFLHPEDRPLMLAIHKKILQLAGQPFDHSPIRFCARNGEYVTMDTSWAGFVHPWSR KVAFVLGRHKVRTAPLNEDVFTPPAPSPAPSLDTDIQELSEQIHRLLLQPVHSPSPTG LCGVGAVTSPGPLHSPGSSSDSNGGDAEGPGPPAPVTFQQICKDVHLVKHQGQQLFIE SRARPQSRPRLPATGTFKAKALPCQSPDPELEAGSAPVQAPLALVPEEAERKEASSCS YQQINCLDSILRYLESCNLPSTTKRKCASSSSYTTSSASDDDRQRTGPVSVGTKKDPP SAALSGEGATPRKEPVVGGTLSPLALANKAESVVSVTSQCSFSSTIVHVGDKKPPESD IIMMEDLPGLAPGPAPSPAPSPTVAPDPAPDAYRPVGLTKAVLSLHTQKEEQAFLSRF RDLGRLRGLDSSSTAPSALGERGCHHGPAPPSRRHHCRSKAKRSRHHQNPRAEAPCYV SHPSPVPPSTPWPTPPATTPFPAVVQPYPLPVFSPRGGPQPLPPAPTSVPPAAFPAPL VTPMVALVLPNYLFPTPSSYPYGALQTPAEGPPTPASHSPSPSLPALPPSPPHRPDSP LFNSRCSSPLQLNLLQLEELPRAEGAAVAGGPGSSAGPPPPSAEAAEPEARLAEVTES SNQDALSGSSDLLELLLQEDSRSGTGSAASGSLGSGLGSGSGSGSHEGGSTSASITRS SQSSHTSKYFGSIDSSEAEAGAARGGAEPGDQVIKYVLQDPIWLLMANADQRVMMTYQ VPSRDMTSVLKQDRERLRAMQKQQPRFSEDQRRELGAVHSWVRKGQLPRALDVMACVD CGSSTQDPGHPDDPLFSELDGLGLEPMEEGGGEQGSSGGGSGEGEGCEEAQGGAKASS SQDLAMEEEEEGRSSSSPALPTAGNCTS" BASE COUNT 732 a 1361 c 1102 g 678 t ORIGIN 1 atgagtggcc ccctagaagg ggctgatggg ggaggggacc ccaggcctgg ggaatcattt 61 tgtcctgggg gcgtcccatc ccctgggccc ccacagcacc ggccttgccc aggccccagc 121 ctggccgatg acaccgatgc caacagcaat ggttcaagtg gcaatgagtc caacgggcat 181 gagtctagag gcgcatctca gcggagctca cacagctcct cctcaggcaa cggcaaggac 241 tcagccctgc tggagaccac tgagagcagc aagagcacaa actctcagag cccatcccca 301 cccagcagtt ccattgccta cagcctcctg agtgccagct cagagcagga caacccgtcc 361 accagtggct gcagcagtga acagtcagcc cgggcaagga ctcagaagga actcatgaca 421 gcacttcgag agctcaagct tcgactgccg ccagagcgcc ggggcaaggg ccgctctggg 481 accctggcca cgctgcagta cgcactggcc tgtgtcaagc aggtgcaggc caaccaggaa 541 tactaccagc agtggagcct ggaggagggc gagccttgct ccatggacat gtccacctat 601 accctggagg agctggagca catcacgtct gagtacacac ttcagaacca ggataccttc 661 tcagtggctg tctccttcct gacgggccga atcgtctaca tttcggagca ggcagccgtc 721 ctgctgcgtt gcaagcggga cgtgttccgg ggtacccgct tctctgagct cctggctccc 781 caggatgtgg gagtcttcta tggttccact gctccatctc gcctgcccac ctggggcaca 841 ggggcctcag caggttcagg cctcagggac tttacccagg agaagtccgt cttctgccgt 901 atcagaggag gtcctgaccg ggatccaggg cctcggtacc agccattccg cctaaccccg 961 tatgtgacca agatccgggt ctcagatggg gcccctgcac agccgtgctg cctgctgatt 1021 gcagagcgca tccattcggg ttacgaagct ccccggatac cccctgacaa gaggattttc 1081 actacgcggc acacacccag ctgcctcttc caggatgtgg atgaaagggc tgcccccctg 1141 ctgggctacc tgccccagga cctcctgggg gccccagtgc tcctgttcct gcatcctgag 1201 gaccgacccc tcatgctggc tatccacaag aagattctgc agttggcggg ccagcccttt 1261 gaccactccc ctatccgctt ctgtgcccgc aacggggagt atgtcaccat ggacaccagc 1321 tgggctggct ttgtgcaccc ctggagccgc aaggtagcct tcgtgttggg ccgccacaaa 1381 gtacgcacgg cccccctgaa tgaggacgtg ttcactcccc cggcccccag cccagctccc 1441 tccctggaca ctgatatcca ggagctgtca gagcagatcc accggctgct gctgcagccc 1501 gtccacagcc ccagccccac gggactctgt ggagtcggcg ccgtgacatc cccaggccct 1561 ctccacagcc ctgggtcctc cagtgatagc aacgggggtg atgcagaggg gcctgggcct 1621 cctgcgccag tgactttcca gcagatctgt aaggatgtgc atctggtgaa gcaccagggc 1681 cagcagcttt ttattgagtc tcgggcccgg cctcagtccc ggccccgcct ccctgctaca 1741 ggcacgttca aggccaaggc ccttccctgc caatccccag acccagagct ggaggcgggt 1801 tctgctcccg tccaggcccc actagccttg gtccctgagg aggccgagag gaaagaagcc 1861 tccagctgct cctaccagca gatcaactgc ctggacagca tcctcaggta cctggagagc 1921 tgcaacctcc ccagcaccac taagcgtaaa tgtgcctcct cctcctccta taccacctcc 1981 tcagcctctg acgacgacag gcagaggaca ggtccagtct ctgtggggac caagaaagat 2041 ccgccgtcag cagcgctgtc tggggagggg gccaccccac ggaaggagcc agtggtggga 2101 ggcaccctga gcccgctcgc cctggccaat aaggcggaga gtgtggtgtc cgtcaccagt 2161 cagtgtagct tcagctccac catcgtccat gtgggagaca agaagccccc ggagtcggac 2221 atcatcatga tggaggacct gcctggccta gccccaggcc cagcccccag cccagccccc 2281 agccccacag tagcccctga cccagcccca gacgcctacc gtccagtggg gctgaccaag 2341 gccgtgctgt ccctgcacac acagaaggaa gagcaagcct tcctcagccg cttccgagac 2401 ctgggcaggc tgcgtggact cgacagctct tccacagctc cctcagccct tggcgagcga 2461 ggctgccacc acggccccgc acccccaagc cgccgacacc actgccgatc caaagccaag 2521 cgctcacgcc accaccagaa ccctcgggct gaagcgccct gctatgtctc acacccctca 2581 cccgtgccac cctccacccc ctggcccacc ccaccagcca ctaccccctt cccagcggtt 2641 gtccagccct accctctccc agtgttctct cctcgaggag gcccccagcc tcttccccct 2701 gctcccacat ctgtgccccc agctgctttc cccgcccctt tggtgacccc aatggtggcc 2761 ttggtgctcc ctaactatct gttcccaacc ccatccagct atccttatgg ggcactccag 2821 acccctgctg aagggcctcc cactcctgcc tcgcactccc cttctccatc cttgcccgcc 2881 ctccccccga gtcctcctca ccgcccggac tctccactgt tcaactcgag atgcagctct 2941 ccactccagc tcaatctgct gcagctggag gagctccccc gtgctgaggg ggctgctgtt 3001 gcaggaggcc ctgggagcag tgccgggccc ccacctccca gtgcggaggc tgctgagcca 3061 gaggccagac tggcggaggt cactgagtcc tccaatcagg acgcactttc cggctccagt 3121 gacctgctcg aacttctgct gcaagaggac tcgcgctccg gcacaggctc cgcagcctcg 3181 ggctccttgg gctctggctt gggctctggg tctggttcag gctcccatga agggggcagc 3241 acctcagcca gcatcactcg cagcagccag agcagccaca caagcaaata ctttggcagc 3301 atcgactctt ccgaggctga ggctggggct gctcggggcg gggctgagcc tggggaccag 3361 gtgattaagt acgtgctcca ggatcccatt tggctgctca tggccaatgc tgaccagcgc 3421 gtcatgatga cctaccaggt gccctccagg gacatgacct ctgtgctgaa gcaggatcgg 3481 gagcggctcc gagccatgca gaagcagcag cctcggtttt ctgaggacca gcggcgggaa 3541 ctgggtgctg tgcactcctg ggtccggaag ggccaactgc ctcgggctct tgatgtgatg 3601 gcctgtgtgg actgtgggag cagcacccaa gatcctggtc accctgatga cccactcttc 3661 tcagagctgg atggactggg gctggagccc atggaagagg gtggaggcga gcagggcagc 3721 agcggtggcg gcagtggtga gggagagggc tgcgaggagg cccaaggcgg ggccaaggct 3781 tcaagctctc aggacttggc tatggaggag gaggaagaag gcaggagctc atccagtcca 3841 gccttaccta cagcaggaaa ctgcaccagc tag // LOCUS AB002292 8467 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0294 gene, complete cds. ACCESSION AB002292 NID g2224528 KEYWORDS KIAA0294. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HF0223. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8467) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..8467 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HF0223" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 3732..7097 /gene="KIAA0294" CDS 3732..7097 /gene="KIAA0294" /codon_start=1 /db_xref="PID:d1021590" /db_xref="PID:g2224529" /translation="MHSDEMIYDDVENGDEGGNSSLEYGWSSSEFESYEEQSDSECKN GIPRSFLRSNHKKQLSHDLTRLKEHYEKKMRDLMASTVGVVEIQQLRQKHELKMQKLV KAAKDGTKDGLERTRAAVKRGRSFIRTKSLIAQDHRSSLEEEQNLFIDVDCKHPEAIL TPMPEGLSQQQVVRRYILGSVVDSEKNYVDALKRILEQYEKPLSEMEPKVLSERKLKT VFYRVKEILQCHSLFQIALASRVSEWDSVEMIGDVFVASFSKSMVLDAYSEYVNNFST AVAVLKKTCATKPAFLEFLKQEQEASPDRTTLYSLMMKPIQRFPQFILLLQDMLKNTS KGHPDRLPLQMALTELETLAEKLNERKRDADQRCEVKQIAKAINERYLNKLLSSGSRY LIRSDDMIETVYNDRGEIVKTKERRVFMLNDVLMCATVSSRPSHDSRVMSSQRYLLKW SVPLGHVDAIEYGSSAGTGEHSRHLAVHPPESLAVVANAKPNKVYMGPGQLYQDLQNL LHDLNVIGQITQLIGNLKGNYQNLNQSVAHDWTSGLQRLILKKEDEIRAADCCRIQLQ LPGKQDKSGRPTFFTAVFNTFTPAIKESWVNSLQMAKLALEEENHMGWFCVEDDGNHI KKEKHPLLVGHMPVMVAKQQEFKIECAAYNPEPYLNNESQPDSFSTAHGFLWIGSCTH QMGQIAIVSFQNSTPKVIECFNVESRILCMLYVPVEEKRREPGAPPDPETPAVRASDV PTICVGTEEGSISIYKSSQGSKKVRLQHFFTPEKSTVMSLACTSQSLYAGLVNGAVAS YARAPDGSWDSEPQKVIKLGVLPVRSLLMMEDTLWAASGGQVFIISVETHAVEGQLEA HQEEGMVISHMAVSGVGIWIAFTSGSTLRLFHTETLKHLQDINIATPVHNMLPGHQRL SVTSLLVCHGLLMVGTSLGVLVALPVPRLQGIPKVTGRGMVSYHAHNSPVKFIVLATA LHEKDKDKSRDSLAPGPEPQDEDQKDALPSGGAGSSLSQGDPDAAIWLGDSLGSMTQK SDLSSSSGSLSLSHGSSSLEHRSEDSTIYDLLKDPVSLRSKARRAKKAKASSALVVCG GQGHRRVHRKARQPHQEELAPTVMVWQIPLLNI" BASE COUNT 2240 a 1923 c 2236 g 2068 t ORIGIN 1 ttccccaaat tgatggacat aaacccatat gcttatctca gcatgtgttt aaaaagcact 61 tgctgagatt cagtgaccat ccaacattaa aaactgctga tagaaggaaa ctcacttagc 121 tgaattaagg acgtgttctt aaaatctacc gccaacgtaa tggggaggag ctcacggcgt 181 tcgttaattt attcattccg caaatatgtt ttgggcagtt acataccgaa tagtagatgg 241 aggtgtgcct gctgtcatgg agatagggtg atttcatcct gttgatcagg aaaactccta 301 ggtgcttgca ggtaaatgtg ccacaaagaa agtgaggacc aaaggttagt tgatgtaaaa 361 acaagtttga aatgcatttt ggggtaattt atccggtcgc ttcgggcatt cctcgcggaa 421 ggcgtggtct ggtgactcag aagccaacac actgcgggag tccagccgtc ggccccctgc 481 cgtgtggcga ggcccagtgt gtcccctttg taaggacagc acaagcagga gttaatggac 541 cggccatcca tagcggtggt ggggcaggga gccagtttcc gaaagaaact cacgccgccg 601 caggagggcc ctgtgggatg ctctgtgcag agctgttgtg cggaccggga gacgggaaag 661 cctggtggct gcaggagggc accgtgcaga agtatccagt aaaccaccca cagcacggca 721 gcagaaaaac gaggaaatta tatgtgtgta tgtttataag aactcagaag caatggtgag 781 caaaaagcaa aagcaagagg agaaagtcac agtgtgctgg catcaagttg cattgagagg 841 agcccgcggg gtagtgcacc cgcatttcct cgttgcgttg agaggcgccc gcggggtagt 901 gcacccgcat ttcctcgttt gagaggcgcc cgcggggtag tgcacccgca tttcctagtt 961 gccttgagag gtgccgcggg gtagtgcacc cgcatttcct agttgccttg agaggtgccc 1021 gcggggtagt gcacccgcat ttcctagttg ccttgagagg tgccgcgggg tagtgcaccc 1081 gcatttcctc gtttcattga gcggtgcccg cggggtagtg cactcgcatt tcctagttgc 1141 cttgagaggt gccgcggggt agtgcaccca catttcttca ctcgtttaga gttcggggct 1201 ctcagaacac agggagaata tgggagaatt ccttactaga tgttacagag gccacacagg 1261 gccacttttt tctttttttt ttattgtgtc aggtatacgt aaatattcct ttcggtcagt 1321 tcagcacgta ggactgagtg gcattaggta cgctcactgt acagccaatg cctccatagg 1381 ccacttttta aatacgcggg gctaagggcc aacgacaaga ttgtcatcga ggacaataag 1441 tcgatggcgc tgcctggtca ctggcttggt cagaaaacac atgccagggt gactggattt 1501 aacgttcagt tttagaacca caaatctgcc agcccaggca ttcaagagga agtgagtaat 1561 tcactgaatt gatggttagc aagacccttc aaagtccttg gaagttccgt gtttgctggg 1621 ggtcacaaca gcaattgcgt ttctaaaaca ttgaaaacca cccgtttttc acacatctga 1681 atagcctgag ttgtaacaga cctaagtaaa ggcgtccaaa cgtgcctgat cctgtggctg 1741 ggtcccagga gccttaacaa ggcattgaga gagctggatt gattgattag actcttgcca 1801 actgctgtgc ggaatacaga aaatgcagct ccagctctca aagggctgaa aatctaattg 1861 aggtgaaaaa ggtaacatgt gtgaaaacca tgtctgtgta gacttgtttt gtatggacca 1921 gaaaagttgg aagtccagag atggatgcga ggaagtaggg gatggagttt ccttataacc 1981 tctggaggga cagtccagat acataccggg gtgtgtagcg ggtggaggtt ctaagacagg 2041 ctggaggaac agtgcagaca cacaccaggg tgtgtagggg gtagaggttc taagacagtc 2101 tggagggaga gtgcagacac acaccggggt gtgtaagcag tggaggttct aagacaggct 2161 ggaggaacag tggagacaca caccagggcg tgtagggggt agaggttcta agacaggctg 2221 gaggaacagt ggagacacac accacggcgt gtagggggta gaggttctaa gacagtctgg 2281 tgggagagtg cagacacacc cggggccgtg tagggggtac aggttctaag acagtctggt 2341 gggagagtgc agacacacac ggggctgtgt cggggataga ggttctaaga cagtctggtg 2401 ggagagtgca gacacacacc agggcgtgta gggggtggag gttctaagac aggctggagg 2461 aacagtggag acacacactg gggagcatag gggtgctttt ctctgagtcc cctagtacat 2521 ggtagaggct gtagaccctc cgctcttggg cacgtgggta ggctctcagg atgactcttg 2581 gctcttgggc atgtgggtga gctctcagga tgatgcccag ccccaatttt caggcaattg 2641 tgcaaggact tgacccattc atccgctgag ctgagttgct gagactgctg ggtgcccggg 2701 tggtgatcat tgtccgtggc atacagaaca cactgcagct tctgcaaagt gagctcattt 2761 cacgcatttt atggctttgc caggctgctg ttgacctgcc agaactttta atcagacatt 2821 tggaggacct gttttgtagt cagtggagaa atattacaag gatagggtaa tttgaaatat 2881 ctaaggattg taagtgacaa gttcatgtct aattttgcat ttccagtgaa agcaagtgtt 2941 ggctttgaat gttacttatg tgctgagatg tgtatattcc tcagtgctta attactaagg 3001 atttttaggg ccaagttttg ttacagtgaa tgattgtgga tgcataaaga ataaatttaa 3061 tatttttaag gcatggagat tatttgtatc taagaaacca ggtaaaataa agaaacattt 3121 atgcttgtgt gactgataaa agagttagag agacactcat attctgggag tttgaagaat 3181 gtcattttca ttctctaaaa gtcttgttag tgtcacagca ttgaaaattt aaaaatccgt 3241 gtgtattttc ttgctagtgc tggtacttga atatctgtat catccaccta tccatccacc 3301 tacccatatt tctataatcc accgtccctc gacatgccta tcatctgtcc accatttctc 3361 tctgtctaat tttcaaaaca tcctgtaagt ttatataaag gaagattttt cttcttgtga 3421 agttctctaa ggctgacaag ttacctggca tgactgtggc ggatgcccat agccaggtgg 3481 tcctcggggt acagatgggg caggggcact tgtgagaaac acctgaagtg cttttcccca 3541 gcctccccgg ccctgccggg tggtggaggc gctgcacggt gccttccatg gagcaagccc 3601 ggggctccgc agggtcctca gcatgattca gatttccttc cacccccagc tctagatgat 3661 ttggtaaaac cacaaacagg cacaaaacag cccacatgga attctaaagt tttaatttca 3721 ttttggaatt tatgcactca gatgaaatga tttatgatga tgttgagaat ggggatgaag 3781 gtggaaacag ctccttggaa tacggatgga gttcgagtga atttgaaagt tacgaagagc 3841 agagtgactc ggagtgcaag aatgggattc ccaggtcctt cctgcgcagc aaccacaaaa 3901 agcaactttc tcatgaccta acccgtttaa aggagcacta tgagaaaaag atgagagatt 3961 tgatggcaag cacggtgggc gtggtggaga ttcagcagct caggcagaag catgaactga 4021 agatgcagaa gctcgtgaag gccgcgaagg acggcaccaa ggacgggctg gagaggacca 4081 gggcagccgt gaagaggggc cgctccttca tcaggaccaa gtctctcatc gcacaggatc 4141 acagatcttc tcttgaggaa gaacagaatt tgttcattga tgttgactgc aagcacccgg 4201 aagccatctt gaccccgatg cccgagggtt tatctcagca gcaggttgta agaagatata 4261 tactgggttc agttgtcgac agtgaaaaga actacgtaga tgctcttaag aggattttgg 4321 agcaatatga gaagccgctg tctgagatgg agccaaaggt tctgagtgag aggaagctga 4381 agacggtgtt ctaccgagtc aaagagatcc tgcagtgcca ctcgctattt cagatcgcgc 4441 tggccagccg cgtttccgag tgggactccg tggaaatgat aggcgatgtc ttcgtggctt 4501 cgttttctaa gtccatggtg ctggatgcat acagtgaata tgtgaacaat ttcagcacag 4561 ccgtggcagt cctcaagaaa acatgtgcca caaagcccgc ttttcttgaa tttttaaagc 4621 aggaacagga ggccagcccc gatcgaacca cgctctacag cctgatgatg aagcccatcc 4681 agaggttccc acagttcatc ctcctgctcc aggacatgct gaagaacacc tccaaaggcc 4741 accccgacag gctgcctctt cagatggccc tgacagagct cgaaacacta gcagagaagt 4801 taaatgaaag aaagagagat gctgatcaac gctgtgaagt gaagcaaata gccaaagcca 4861 taaacgaaag atacctgaac aagcttctca gcagtggaag ccgatacctc attcgatcag 4921 atgatatgat agaaacagtt tacaacgaca gaggagagat tgttaaaacc aaagaacgcc 4981 gagtcttcat gttaaatgat gtgttaatgt gtgccaccgt cagctcacgc ccctctcatg 5041 acagccgtgt gatgagcagc cagaggtact tgctgaagtg gagcgttcca ctgggacatg 5101 tggacgccat cgagtatggc agcagcgcag gcacgggcga gcacagcagg caccttgccg 5161 ttcacccgcc ggagagcctg gccgtggttg ctaacgcgaa accaaacaaa gtttacatgg 5221 ggccaggaca actgtatcaa gatttacaaa acttgttgca tgacttaaat gtaattggcc 5281 aaatcactca gctgatagga aaccttaaag gaaactatca gaacttaaac cagtcagtag 5341 cccatgactg gacatcaggt ttacaaaggc ttattttgaa gaaagaagat gaaatcagag 5401 ctgcggactg ctgcagaatt cagttacagc ttcccgggaa gcaggacaaa tctgggcgac 5461 cgacgttctt tacagctgtg ttcaatacgt tcacccctgc catcaaggag tcctgggtca 5521 acagcttaca gatggccaag ctcgccctag aagaggagaa ccacatgggc tggttctgtg 5581 tggaagacga tgggaatcac attaaaaagg agaagcatcc tctcctcgtc ggacacatgc 5641 ccgtgatggt ggccaagcag caggagttca agattgaatg tgctgcttat aaccctgaac 5701 cttacctaaa taatgaaagc cagccagatt cattttccac ggcacatggt ttcctgtgga 5761 tcggaagttg cacccatcaa atgggtcaga ttgccatcgt ctcgtttcaa aattccactc 5821 ccaaagtcat tgagtgcttc aacgtggaat ctcgcatcct gtgcatgctg tacgttcccg 5881 tcgaggagaa gcgcagagag cctggggcac ccccggaccc cgagaccccg gccgtgagag 5941 cttctgatgt ccccacgatc tgtgtaggga cggaggaggg aagcatttcc atttataaaa 6001 gcagtcaagg ctccaagaaa gtgagacttc agcacttttt cactcctgag aagtccacag 6061 tcatgagcct ggcttgcacg tctcagagcc tgtacgctgg cctggtcaac ggggcagtcg 6121 ccagctacgc cagagcccca gatggatcct gggattcaga acctcaaaaa gtgatcaagt 6181 taggcgtcct accagttaga agtctactca tgatggaaga cacgttgtgg gcggcttccg 6241 gaggtcaagt cttcatcatc agtgtggaga ctcatgctgt agagggtcag ctggaggccc 6301 accaggagga aggcatggtg atctcccaca tggccgtgtc cggcgtcggg atctggattg 6361 ccttcacctc agggtccacg ctccgccttt ttcacacgga aactctcaag cacctgcagg 6421 acatcaacat cgccacccct gttcacaaca tgctgccagg gcaccagcgg ctgtcggtga 6481 cgagcctgct cgtctgccac ggattgctga tggtcggcac cagcctggga gtcctcgtgg 6541 ccctgccggt cccacgtctg caagggattc ccaaagtgac cggaagaggc atggtctcct 6601 accatgcaca caacagtcct gtcaaattca tcgtcctggc cacggctctg cacgagaaag 6661 acaaggacaa atccagggac agcctggctc ctggccccga gcctcaggac gaagaccaga 6721 aggacgcact tccgagtgga ggagctggtt catctctgag ccagggtgac cctgacgcag 6781 ccatctggtt gggagattcg ctgggatcga tgactcagaa aagcgacctg tcctcctcat 6841 ctgggtccct gagcttgtct cacggctcca gctctctaga gcacagatca gaggacagca 6901 ccatctatga tctcctgaag gatcctgtct cgctgagaag caaagcacgc cgggccaaga 6961 aagccaaggc cagctcggcg ctggtggtct gtggagggca gggccaccgc cgggtgcaca 7021 ggaaggcccg gcagccccac caggaagagc tggcgccgac cgtcatggtc tggcagatcc 7081 ctctgctgaa tatataagca ggacggccgc cttctgctgt cagaatttgc aatcaagggt 7141 gacttctcag ctaatcctac agcctgagtg gttaagctgt gtctacactg gttgggaata 7201 aattaaaaac agtatttggg ggagaaacgt gcaatagcgt aatggtggtg tccctgccaa 7261 ttccttcctt ctcttctgta cagcagaagt aattacaagc acttctcacg aaggcagaag 7321 actgatgcaa ttttcgagta attgagtgca gttctgggaa aataccacat tctttttgac 7381 tgctgtagtc catatatgaa tactaaatgt taaacttcat cagcgtcaga cctattgtat 7441 catattagag aatttgcaga ctaagaattt atgagaaaat atatgtattc agtagtgcag 7501 gcatttatta acaattctta aaagttttac ctgattcaga ttcacgactt ttatttatat 7561 tctatatttt tgaatttcag agtaaaattt gttaacaatt ttaaaagcca ggtaacacct 7621 accagtccag ttagcatgat ttgctttcag aagtgagctg ggttttccaa agtggtataa 7681 tgtgtgtact gtatatttta acaaagtaat atttttgtat tgcatttttc tattaaaaaa 7741 ttaacagtta atgtttcagt caatgtatta tctgtagcat ttcacaaata atgtttgctt 7801 tgaaccaaaa tgctcagtgc ctatcaacat ttggactcaa gcatcaacac caaattattc 7861 ctcccttctc gtataaatag agtgactatc cacaggagaa aagtgtgtgc tttagtatta 7921 gaggagatag gcagagaagt cttgcttagt tccttcgtgc agcttcttgc ccctgttgac 7981 gtggaatgct gtgtctgctt tagcacgcac gctccgaatg actcctggtg ctaggccatg 8041 ctggctgctg tcactgagcg ggactcaggc caagaggcgt gacctcgggc cagcctgtct 8101 gttgtgcaga cgcctcctct gcagaacgca tcagtttcta ttctgcagtt gcagagccag 8161 ccccgcgtga gaacgtgcat aatgagtgca caccatcatg tcaaggtgca tacttagtga 8221 gcgccatcct gctgaacgtg tatttcagtg tttcacttac tggacggata acaagaaaaa 8281 aatcctaaca caggcagtca ccagaaataa atgtctcagc actttacaga tgactaaaaa 8341 tgttaatttt atgacttagc caaatatgtt ctaggttgca tatatccccc atgtgaaagt 8401 gatttcttcc caagcttctc aaactgttag ctgctgtctg acttcatcaa taaagtattt 8461 ttatttt // LOCUS AB002294 7604 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0296 gene, complete cds. ACCESSION AB002294 NID g2224532 KEYWORDS KIAA0296. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HF0260. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7604) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..7604 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HF0260" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 424..5913 /gene="KIAA0296" CDS 424..5913 /gene="KIAA0296" /codon_start=1 /db_xref="PID:d1021592" /db_xref="PID:g2224533" /translation="MEDTPPSLSCSDCQRHFPSLPELSRHRELLHPSPNQDSEEADSI PRPYRCQQCGRGYRHPGSLVNHRRTHETGLFPCTTCGKDFSNPMALKSHMRTHAPEGR RRHRPPRPKEATPHLQGETVSTDSWGQRLGSSEGWENQTKHTEETPDCESVPDPRAAS GTWEDLPTRQREGLASHPGPEDGADGWGPSTNSARAPPLPIPASSLLSNLEQYLAESV VNFTGGQEPTQSPPAEEERRYKCSQCGKTYKHAGSLTNHRQSHTLGIYPCAICFKEFS NLMALKNHSRLHAQYRPYHCPHCPRVFRLPRELLEHQQSHEGERQEPRWEEKGMPTTN GHTDESSQDQLPSAQMLNGSAELSTSGELEDSGLEEYRPFRCGDCGRTYRHAGSLINH RKSHQTGVYPCSLCSKQLFNAAALKNHVRAHHRPRQGVGENGQPSVPPAPLLLAETTH KEEEDPTTTLDHRPYKCSECGRAYRHRGSLVNHRHSHRTGEYQCSLCPRKYPNLMALR NHVRVHCKAARRSADIGAEGAPSHLKVELPPDPVEAEAAPHTDQDHVCKHEEEATDIT PAADKTAAHICSICGLLFEDAESLERHGLTHGAGEKENSRTETTMSPPRAFACRDCGK SYRHSGSLINHRQTHQTGDFSCGACAKHFHTMAAMKNHLRRHSRRRSRRHRKRAGGAS GGREAKLLAAESWTRELEDNEGLESPQDPSGESPHGAEGNLESDGDCLQAESEGDKCG LERDETHFQGDKESGGTGEGLERKDASLLDNLDIPGEEGGGTHFCDSLTGVDEDQKPA TGQPNSSSHSANAVTGWQAGAAHTCSDCGHSFPHATGLLSHRPCHPPGIYQCSLCPKE FDSLPALRSHFQNHRPGEATSAQPFLCCLCGMIFPGRAGYRLHRRQAHSSSGMTEGSE EEGEEEGVAEAAPARSPPLQLSEAELLNQLQREVEALDSAGYGHICGCCGQTYDDLGS LERHHQSQSSGTTADKAPSPLGVAGDAMEMVVDSVLEDIVNSVSGEGGDAKSQEGAGT PLGDSLCIQGGESLLEAQPRPFRCNQCGKTYRHGGSLVNHRKIHQTGDFLCPVCSRCY PNLAAYRNHLRNHPRCKGSEPQVGPIPEAAGSSELQVGPIPEGGSNKPQHMAEEGPGQ AEVEKLQEELKVEPLEEVARVKEEVWEETTVKGEEIEPRLETAEKGCQTEASSERPFS CEVCGRSYKHAGSLINHRQSHQTGHFGCQACSKGFSNLMSLKNHRRIHADPRRFRCSE CGKAFRLRKQLASHQRVHMERRGGGGTRKATREDRPFRCGQCGRTYRHAGSLLNHRRS HETGQYSCPTCPKTYSNRMALKDHQRLHSENRRRRAGRSRRTAVRCALCGRSFPGRGS LERHLREHEETEREPANGQGGLDGTAASEANLTGSQGLETQLGGAEPVPHLEDGVPRP GERSQSPIRAASSEAPEPLSWGAGKAGGWPVGGGLGNHSGGWVPQFLTRSEEPEDSVH RSPCHAGDCQLNGPTLSHMDSWDNRDNSSQLQPGSHSSCSQCGKTYCQSGSLLNHNTN KTDRHYCLLCSKEFLNPVATKSHSHNHIDAQTFACPDCGKAFESHQELASHLQAHARG HSQVPAQMEEARDPKAGTGEDQVVLPGQGKAQEAPSETPRGPGESVERARGGQAVTSM AAEDKERPFRCTQCGRSYRHAGSLLNHQKAHTTGLYPCSLCPKLLPNLLSLKNHSRTH TDPKRHCCSICGKAFRTAARLEGHGRVHAPREGPFTCPHCPRHFRRRISFVQHQQQHQ EEWTVAGSGRGHEGSQEEVGTQWRGKSSPKVGGGARSERREPRGF" BASE COUNT 1670 a 2295 c 2261 g 1378 t ORIGIN 1 gagcagattc ctactgttct taaaggacag taatgccttt tgagtctggt ctgaagaaca 61 taacaggtct gtgatcagaa gtaggttgca tctctctcaa ctttaatttc cttagctata 121 cctgtaggga tgacttaagc ctaggggagc tcctatattt gggaagcttg tgcacaggga 181 agccttaaat gatggtgcct gcagattgga tctagtagaa attaggtcct tgggcatgga 241 tgcttgggga acctctcagt gacctcaggt gaacttgttg ctcgtagagc caagaggcga 301 agttaattca ggccttcctt ttgaccactg ccccctcttc ctaggccttg gcccctccac 361 cagaggaagg tgctgccacg tgtctgctcc ttctgaacct ccaggtttct gctacgttgc 421 cccatggagg acacaccccc ctcactcagc tgctccgact gtcagcgcca ctttcccagc 481 ctcccagagc tctctcggca ccgagaactg ctccatccat ctcccaacca ggacagtgag 541 gaggctgaca gcatccctcg gccctaccgt tgtcagcagt gtgggcgggg ctaccgtcac 601 cccgggagcc tggttaacca tcgtcggacc cacgagactg gccttttccc ctgtaccacc 661 tgtggcaagg acttctccaa tcccatggct ctcaagagcc atatgaggac acatgctcct 721 gagggccgcc gcaggcacag gcccccacgc cccaaggaag ccactccaca cctccagggt 781 gagacggtgt ccactgactc ctggggccaa aggcttggct ctagtgaagg ctgggaaaac 841 cagacaaaac atacagaaga gacacctgac tgtgaatctg tacctgaccc cagggcagct 901 tcgggtacgt gggaagatct gcccaccaga caaagagaag gcttggcaag ccacccaggt 961 cctgaggatg gtgcagacgg ctggggaccc tccactaact ctgccagagc ccctcctctc 1021 cccatcccag ccagcagcct tcttagcaac ttggaacagt atctggctga atcagtagtg 1081 aacttcacag ggggccagga gcccacccag tcccctcctg ctgaggagga gcggcggtac 1141 aaatgtagtc agtgtggcaa gacctacaag cacgccggga gcctcaccaa ccaccgccag 1201 agccacacgc tgggcatcta cccctgtgcc atctgtttca aggagttctc taacctcatg 1261 gctctgaaga accactctcg actgcatgcc cagtatcggc cttaccactg tccccactgc 1321 ccccgtgtct tccggctccc ccgggagctg ctggaacacc agcagtccca tgagggtgaa 1381 aggcaggagc cacgctggga ggagaaaggg atgcccacca ccaatgggca cacagatgag 1441 agcagccagg accagctccc cagtgcacag atgctgaatg gctctgcgga gctcagcacc 1501 tctggggagc tggaggacag tggcctggag gaataccggc ctttccgctg tggggactgt 1561 ggccgtactt accgccatgc tgggagcctc atcaaccatc gaaagagcca ccagacaggt 1621 gtctacccct gctcactctg ttctaagcag ctgttcaatg cggctgccct caaaaaccat 1681 gtgcgggctc atcacaggcc caggcaagga gttggggaaa atgggcagcc atcagtccca 1741 ccagctcccc tgctgctggc tgagaccacc cacaaagagg aagaggaccc caccaccacc 1801 ctggaccatc ggccctataa gtgcagtgag tgtggtcgtg cttaccgcca ccgggggagc 1861 ctggtgaacc atcgccacag ccatcggact ggagagtacc agtgctcact ctgtccccgc 1921 aagtacccca atctcatggc cctgcgcaac cacgtgcggg tacattgcaa ggctgctcgc 1981 cgaagtgcag acatcggggc tgagggtgcc cccagccacc tcaaggtaga actcccgcct 2041 gacccagtgg aggcagaggc agccccgcac acagatcagg accatgtgtg caaacatgaa 2101 gaagaggcca cggacatcac cccagcagca gacaagacag cagcacatat ctgtagcatc 2161 tgtgggctgc tctttgaaga cgctgagagc cttgaacgtc atggcctgac tcatggggca 2221 ggggaaaagg aaaatagcag aacagagacc acaatgtcac ctcctagggc ctttgcctgc 2281 cgagactgtg gaaagagcta tcgccactca ggcagcctta tcaaccacag gcagacccac 2341 cagacaggag acttcagttg tggggcctgt gccaagcact tccacaccat ggctgccatg 2401 aagaaccact tgcgccggca cagtcggcgg cggagcaggc ggcatcggaa gcgggctggc 2461 ggtgccagcg gtgggagaga agccaaactc ctggcagcgg agagctggac ccgggagcta 2521 gaagacaatg aaggcctgga gtctccccaa gacccttcag gggaaagtcc tcatggggct 2581 gaaggcaacc tggaaagtga tggggactgt ttgcaggctg aatctgaagg ggacaaatgt 2641 gggcttgaga gggatgagac ccatttccag ggtgataaag agagcggagg cactggggaa 2701 ggactggaaa ggaaggatgc cagtttactt gacaacttgg acatcccagg tgaggaaggt 2761 ggtggcactc acttctgcga tagcctcact ggggtggatg aagaccagaa gccagccact 2821 ggccaaccca actcctcttc ccactctgcc aatgctgtca ctggctggca ggctggggcc 2881 gctcacacat gctctgactg tgggcattct ttcccccatg ccactggcct gctgagccac 2941 aggccctgcc acccaccagg catctatcag tgctccctct gcccgaagga gtttgactct 3001 ctgcctgccc tccgcagcca cttccagaac cataggcctg gggaggcgac ctcagcacag 3061 cctttcctct gctgcctctg tggcatgatc ttccctgggc gggctggcta caggcttcac 3121 cggcgccagg cccacagctc ctctggcatg actgagggct cagaggagga gggggaagag 3181 gaaggagtgg cagaggcagc ccctgcacgc agtccaccac tgcagctctc ggaagcagag 3241 ctgctgaatc agctgcagcg ggaggtggaa gcgctggaca gtgcagggta tgggcacatc 3301 tgtggctgct gtggtcagac ctacgatgac ctggggagcc tggagcgtca ccaccaaagt 3361 cagagttctg ggactactgc agacaaggct cccagcccct tgggagtggc aggtgatgcc 3421 atggagatgg tcgtggacag tgtcttggag gacatagtga attctgtctc tggagagggt 3481 ggagatgcca agtctcaaga gggagcaggc acccccttgg gagacagcct ctgcatccag 3541 ggtggggaaa gtttgttgga ggctcagccc cgccccttcc gctgcaacca gtgtggcaag 3601 acctatcgcc atgggggcag cctggtgaac caccgcaaga tccaccagac tggagacttt 3661 ctctgccctg tctgctcccg ctgctacccc aacctggctg cctaccgtaa tcatctgcgg 3721 aaccaccctc gctgcaaagg ctctgagccc caggttgggc ccatcccaga ggcagcaggt 3781 agcagtgagc tgcaggttgg gcccatccca gaaggaggca gcaacaagcc ccagcacatg 3841 gcagaggagg ggccggggca agcagaagtc gagaagctcc aggaagaact taaagtggag 3901 cccctggagg aagtggccag ggtgaaagaa gaggtgtggg aggagaccac tgtgaagggg 3961 gaggagatag agcccaggct ggagactgcc gagaagggct gccagactga agccagctct 4021 gagcggccct tcagctgcga ggtgtgtggc cgatcctaca agcacgccgg cagcctcatc 4081 aaccaccggc agagccacca gaccggccac tttggctgtc aggcctgctc caagggcttc 4141 tcaaacctca tgtccctcaa gaaccaccgg cgcatccatg cagatccccg acgtttccgc 4201 tgcagcgagt gtgggaaggc cttccgcctg cggaaacagc tggccagcca ccagcgggtc 4261 cacatggaac ggcgtggggg tgggggcacc cgaaaggcga ctcgggaaga tcggcccttc 4321 cgctgtgggc agtgcgggcg gacctatcgc cacgccggca gcctcctgaa ccaccggcgc 4381 agccacgaga cgggccagta cagctgcccc acctgcccca agacctactc caaccgcatg 4441 gccctgaagg accaccagag gctgcactca gagaatcggc ggcgacgggc tggacggtcc 4501 aggcgcacag ctgtgcgttg cgccctctgt ggccgcagct tccctggccg gggatctttg 4561 gagcggcacc tgcgggagca tgaggagaca gaaagggagc cagccaatgg ccagggaggc 4621 ctggatggca cagcggccag tgaggcgaac ctgactggca gccagggact agagacccaa 4681 ttgggtggtg ctgagccagt accccacttg gaggatggag tcccaaggcc aggggagcgc 4741 agtcagagcc ccatcagggc agcaagctca gaagccccag agccactgtc ctggggtgca 4801 gggaaggcag gtgggtggcc ggtaggtggg ggactgggga atcatagtgg aggctgggtt 4861 cctcagttcc taactaggtc agaggagcca gaggacagtg tccacaggag tccttgccac 4921 gctggtgact gccagctcaa tggacctact ctgagtcaca tggatagctg ggacaacaga 4981 gacaacagct ctcagctgca gccagggagc cactcctctt gcagccagtg tggcaagact 5041 tactgccagt caggcagcct cttgaaccac aacaccaaca agacagaccg acactattgc 5101 ctgctctgct ccaaggagtt cttaaatcct gtggccacaa agagccacag ccacaaccac 5161 atagacgccc agacctttgc ctgtcctgac tgtggcaaag cctttgagtc ccaccaggaa 5221 ctggccagcc acctgcaggc tcatgcccgg ggccacagcc aggtgccagc ccagatggag 5281 gaggccagag atcccaaagc cgggactggg gaggaccagg tggttctccc tggtcaaggg 5341 aaagcccagg aggccccatc agaaaccccc agaggcccag gagagagtgt ggagagagcc 5401 aggggaggac aagcggtgac gtccatggcg gctgaggaca aggagcggcc cttccgctgc 5461 acccagtgcg ggcgctccta ccgccatgct ggcagcctgc tgaaccacca gaaggcccac 5521 accacagggt tgtacccgtg ctccctctgt cccaaacttc tccctaacct gctgtctctt 5581 aagaaccaca gcaggaccca cacggacccc aagcgccact gctgcagcat ctgtggcaag 5641 gcctttcgga cagctgcccg gctggagggc cacgggcggg tccatgcacc ccgggagggg 5701 cctttcacct gcccccattg tccccgccac ttccgccgcc gaatcagctt cgtgcagcac 5761 cagcagcagc accaggagga gtggacggtg gccggctccg gtagggggca tgaagggtcc 5821 caggaggagg tgggcacaca gtggaggggg aagtccagcc ccaaagtcgg tgggggagca 5881 aggagtgaga ggagagagcc ccggggattc taagaggtgg gtgggggctt ggctatgggg 5941 tgagagaagt agcttgagga tgtgctgagc tgagcacccg caagtcaggt ataacaaata 6001 gcagggtggg ttgggcagca cgtgggggcg tggtcaggcc gaggctgcta cctgggctcc 6061 tccattacac tgtagccaga atggaatggt ctttctgttc aggggaaggt cactgggtac 6121 cccctggctg ctgtgtctgg aaaccctcct gagtcagcca gtaaagtaat gacttccaga 6181 gaaaaagagg aagccattgg tttggtctag gttccattct ttcctggagc aggccgggtg 6241 ccagggaaca agggatgggg catgggctcc acggcttccc tgctgacttg gccacggaaa 6301 ctggttcact ggttggcacc ctactccctg tccctctttc cctgcgcctt gtctctgctg 6361 ctcctctcct tggaaactag acctctggtc cttccctgtc agtgttgctc ccatctcttc 6421 tctaaccttt attcagcccc ttttccctct gctgccaacg gcctttttag gatccaacca 6481 aaccaccctt tctacctgcg caccctgcca ccctctgcac acctttaact ggaggactga 6541 gtcacagata attgtttcct tgaagtccag gcccagctgc agcaacaaca gtcattagcc 6601 cgtgtcacat ccctgatcag agggcatctc cgtggggaat cgcctccacc cagcactgct 6661 ggaagccgcg gctgccaggg agtggggcgg ccggttccct cagcaggacc tgggctggcc 6721 tctccacctc ccctagtaga ggcggaccca ttccatctag tggccaccga gggtgggtgg 6781 ccctgagatg gtgggccctt gacaggcctt gtcagagcag agggcaggtg ggagtcacct 6841 gaaagctgaa ggaatggctt taaggataga agatttctca tgacctcaag ggatatgagg 6901 gaggagccag tttgccaggg ctgggaaaat aattaggagg cctagaatcc ctgttctcat 6961 ctgggcctcc ggggccaggg gcaggggaat ggcctgcagg gctgggaggg ggtacacgct 7021 gtgcggggtc tgcccctcag ttggtgacct cctctctctc tccccccagg agccccagtg 7081 gcaccagtga cgggcagagg ggacttgcca ttgccccctc cacccacccc cacgacccca 7141 ctcctggatc cttcacccca gtggcctgca gacctcagct tctccctctg aacttcaagt 7201 ctccaaagat cagaatctgg gggagggagc gcgtgcaggg aggggcttga tctccacatt 7261 ttctcaggag tagttcgggc atccccatat cttctcctct ccccttgtga agaggaccca 7321 gatctggctt ctttcccaag gagggggtgg ggtgttcctc gcgtccctgt ccttgaagga 7381 cctccttccc ccagcctcat caccgtgctc ttctcagcgc caccctcagc agccagattg 7441 caacaccagg gagaggtgga tgcagagccc caccggtggg aaagttgcct gtggaaggga 7501 gccttttgct acaatttgta acttattttc taaagtctat tttgtaacaa tttatttaag 7561 tttaaaaaaa ggaaaactgc tgccccccaa aaaaagaaat tttc // LOCUS AB002296 8001 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0298 gene, complete cds. ACCESSION AB002296 NID g2224536 KEYWORDS KIAA0298. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HF0341. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8001) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..8001 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HF0341" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 2866..4659 /gene="KIAA0298" CDS 2866..4659 /gene="KIAA0298" /codon_start=1 /db_xref="PID:d1021594" /db_xref="PID:g2224537" /translation="MSRADAPAYGGLQGSSPFYQSHQSPVAQQEALSHPSHKFQSPAV CSSSVCCSHCSPVSPSLKGQVPPPSIHPAHSFRQPPEMVPQQLGSLQCSALLPREKEL ACSPHPPKLLQPWLETQPPVEQESTSQRLGQQLTSQPVCIVPPQDVQQGAHAQPTLQT PSIQVQFGHHQKLKLSHFQQQPQQQLPPPPPPLPHPPPPLPPPPQQPHPPLPPSQHLA SSQHESPPGPACSQNMDIMHHKFELEEMQKDLELLLQAQQPSLQLSQTKSPQHLQQTI VGQINYIVRQPAPVQSQSQEETLQATDEPPASQGSKPALPLDKNTAAALPQASGEETP LSVPPVDSTIQHSSPNVVRKHSTSLSIMGFSNTLEMELSSTRLERPLEPQIQSVSNLT AGAPQAVPSLLSAPPKMVSSLTSVQNQAMPSLTTSHLQTVPSLVHSTFQSMPNLISDS PQAMASLASDHPQAGPSLMSGHTQAVPSLATCPLQSIPPVSDMQPETGSSSSSGRTSG SLCPRDGADPSLENALCKVSPGEMLSKLPLFIIGQKIGHWDPYSDLSLTVLRPLMTTM SEFFDSCRHFTFERWKVRIPLASLTYWDKVP" BASE COUNT 1952 a 2059 c 1960 g 2030 t ORIGIN 1 gacagatttc aatgtctgtt gaaggccccc ccacaccctc cttcagtgcc gggggcatgg 61 agggatatct tgggctcctg ctgtttttta aaatgatgcg agctcatctc ctgtcctggg 121 tgtgaacgag tatatcttac cagggatgta actgaacatt tttttctgca ttgtgttcct 181 actgagcaac ccaagatggc caggaactgc tctgagtgca aggagaagag ggcagcacat 241 atcctctgca cctactgcaa tcgctggctg tgcagctctt gcacagagga acaccgacac 301 agccctgtcc ccgggggccc attctttcct cgggcccaga agggatctcc aggagtgaat 361 ggtggtcccg gagacttcac cttgtattgt cctctacaca cacaggaagt actcaagcta 421 ttctgtgaga catgtgatat gctcacttgc catagctgcc tagtggtgga acacaaagaa 481 cacaggtgca gacatgttga agaagttttg caaaaccaga ggatgcttct ggaaggtgtg 541 actacacagg tggcacataa gaaatccagt ctacagacat ctgcaaagca aattgaggac 601 aggatttttg aagtgaagca tcagcatagg aaggtggaaa accagatcaa aatggccaag 661 atggttctga tgaatgagct gaacaaacag gccaatgggc taatagagga attagagggg 721 attactaatg agagaaagcg gaagctggaa cagcagttac agagcatcat ggttctcaac 781 cgtcagtttg agcatgtgca gaatttcatc aactgggctg tctgcagcaa aaccagtgtc 841 ccttttcttt tcagcaaaga gctgattgtg tttcagatgc agcgattgct ggagacaagt 901 tgtaacacag atcctggctc cccttggagt atcagattca cctgggagcc taacttctgg 961 accaagcagc tagcttctct tggtgagctt ggacccaccc acccaggcct acccaccctg 1021 tgggctttga cttaattgct taccttactg ggtaggtcag catatgccca agagaaagca 1081 ctctctggga ctgccagtgg tcatggaatc attcagcaaa cctttttttt tttttttttt 1141 ttttttttga gggggaagag gagaggatga ggaattaaat aacagtggca atataggact 1201 ttgtatcagg atgagttgaa atctgagttt accattctgt gtaaatgtcc tgttacattg 1261 gttttattgc tgaggaggag acaagacagt caccctgctc tccggaatgc tgtatggcag 1321 tacagatgaa tgtgtgtctc tgtggtcaca caaaggtata cacgtgtggg cacaagcagg 1381 ggtatctcag caacagtgag actgcgtgtg catgtttgtg tcacacagac atatcctttc 1441 gagttgagat gtaagcagat aattcatggc cttgtgcatt tttcttccct gctgcaatca 1501 tttgctggca ttcagtcttt aggcagaatg tagacagctt gaagttgtgc ctgtgttgac 1561 agcctaacga gtgttaaact gatggccagc tgctgccttc atttgaggct gttaatctta 1621 ttgaaaattt gttgaacttg gcaaagtgac aaatgatatg taggtttatt tcttttcttc 1681 cctctttcct ttgatttaac tcagcagacc tttgatggcc acctactctt gtttaggccc 1741 catgctgagc accaaagaac caatggtgaa taaaacacat tttccttaag gagctcagac 1801 ttcttctaag taagtctcac caaaataatg agaggaagga gtcattagac tgagttagcc 1861 atagatctac ttgtgcacaa gggacctaga ggcctcaggg ggcagtgccg gtgcctggca 1921 ctgacctggt gtattagtcc attctcgcat tattgtgaag aaatacctga gactgggtaa 1981 tttataaaga aaagaggttt aattggctca tgattctgca ggatgtacag gaagcatgat 2041 cctggaacct gctcggcttc tggggaggcc tcaggaaact tacaatcatg acagaaggtg 2101 aagacgcaca tcacatggac agagcagcaa gaaacagagc aagggggagg tgttacacac 2161 tttaaagaaa tcaggtctca cgagacctct atcacaagaa cagcaccaaa gggacggtgc 2221 taaaccactc atgagggatc cgcccccacg atccagtcac ctcccaccag gccccacctc 2281 caacactggg gattacaatt caacatgaga tttgggcggg gacacagctc caaactgtat 2341 cacctggtat tagagcttcc ccactctctt ccgtcctact tgctcaacac acagttcccc 2401 acacctgtat ctgggaactt gttcattggt cccaggcatg gctctctacc ctttgctact 2461 ttctagcata ctccctccca accccaaagt tgattctttt tgtgagagtc tgattgggct 2521 gttctgttgg aaaaaaatga gtgggtgcta tttgagagag cttgtggttt ccctagtggg 2581 acagaaacag agtcagggct gagcatctgg taaaagctga actaatttct caaatatcct 2641 aggcccccaa agcaggtgca tacaggtgct attactggca gcattctaga ccgagtctag 2701 gagatcaagg gttgagtttc ttaatctctg ttcctggtac aggattgttg aatgaggttc 2761 ttttgtccca ggagcaggaa gtagcaaggt gggagcaact tgaaaaactt ttgaaaaact 2821 ctctattctg cattttcagg ctgcataact actgaaggtg gacaaatgtc cagggcagat 2881 gctcctgctt atggaggctt acaggggtca tcaccctttt atcaaagcca ccagtctcca 2941 gtggctcagc aagaggctct tagccacccc tcacacaagt tccagtctcc agcagtgtgc 3001 tcctcatctg tgtgctgctc ccactgctcc ccagtctcgc cttccctcaa aggccaggtc 3061 cccccaccca gcatacaccc agcccacagc ttcaggcagc cccctgagat ggtgccccag 3121 cagctggggt ctctgcagtg ctctgccctg ctgcccaggg agaaagagct ggcctgcagc 3181 cctcatccac caaagctgct gcagccctgg ctggaaaccc agccccccgt ggagcaggag 3241 agcacatccc agcggctggg gcagcagctg acttcccagc ccgtgtgcat tgtcccccca 3301 caggatgttc agcaaggagc ccatgcccag cccaccttac agacaccctc tatccaagtc 3361 cagtttggcc accaccagaa gctgaagctc agtcactttc agcagcagcc acagcagcag 3421 ctaccacctc caccaccacc cctcccccat cccccacctc cccttccccc tcccccacag 3481 cagccacacc cacctcttcc tccatcccag catctggctt ctagtcagca cgagagccct 3541 cctggccctg cctgttctca gaacatggac ataatgcatc acaagtttga gctggaggaa 3601 atgcagaagg acttggagct tcttctccag gctcaacagc ccagcctgca actgagtcag 3661 accaaatctc ctcagcatct tcagcaaacc attgtggggc agatcaacta catcgtgagg 3721 cagccagcac ctgtccagtc ccagagccag gaggagaccc tgcaggctac agatgagccc 3781 ccagcatctc agggctcaaa gccggctctc cctcttgaca agaatactgc tgctgccttg 3841 ccccaggcgt ctggggaaga aacccctctc agtgtccccc cagtggacag caccatccag 3901 cactcctctc caaatgtggt gagaaagcac tccacctcgc tgagcatcat gggcttttcc 3961 aacactctgg agatggagtt gtcatctacc aggttggaga ggcccctaga gccacagatc 4021 cagagtgtga gcaacctgac agctggtgcc ccccaggcag taccaagcct gctgagtgct 4081 ccccccaaaa tggtgtccag cctgacaagt gttcaaaacc aggccatgcc cagcctgaca 4141 accagtcacc tacagactgt gcccagcctt gtgcatagca cattccagtc catgcccaac 4201 ctgataagtg actcccctca ggctatggca agcctggcaa gtgatcaccc tcaggctggg 4261 cccagcctaa tgtctggtca cacccaggct gtgccgagtc tggcaacttg tcctctgcag 4321 agcatccctc cagtttctga catgcagcca gaaactgggt ccagctccag ttctggccga 4381 acttcaggga gcctgtgtcc cagagatggg gctgatccct ccctggagaa tgctctgtgt 4441 aaggtaagtc ctggggaaat gctatccaaa cttcctttgt tcatcattgg gcaaaagatt 4501 ggccactggg acccttattc tgaccttagc ttaacagttc tgagaccact aatgacaacc 4561 atgtctgagt tctttgattc ttgtcgacat ttcacttttg aaagatggaa agtgaggatt 4621 ccactcgctt cactgactta ctgggacaag gtcccatagt ccccggtctg gatgctccca 4681 aggacttggc catcccctca gaactggagg agccaattaa cctctctgtg aagaaacctc 4741 cactggcgcc agtggtcagc acgtctacag ctctgcagca gtaccagaac ccaaaagagt 4801 gtgagaattt tgaacaagga gccctagagc tggatgcaaa agagaaccag agcatcagga 4861 gagatgcctg tgttcaaact gaagccacag aagaatgatc aggatgggag cttcctgctg 4921 atcatcgagt gtggcactga gtcctccagc atgtccatta aggggagagt gggtgtgtac 4981 cttgtgccgc agcctgaccc agcccgagat ggagtacgac tgtgagaatg cctgctataa 5041 ccagcctgga atgcgggcat ctcctggcct aagcatgtat gaccagaaga agtgtgagaa 5101 gctggtattg tccttgtgct gcaataacct cagcctgccc ttccatgaac ctgtcagccc 5161 cctggcccgg cattattacc agattatcaa gaggcccatg gacctgtcaa tcatccggag 5221 gaagctgcaa aagaaggacc cagctcacta taccacccca gaggaggtgg tatcagatgt 5281 gcgcctcatg ttctggaact gtgctaagtt caattatcct gactccgagg ttgcagaggc 5341 tggccgctgc ctggaagtgt tctttgaggg ctggttgaag gagatctacc cggagaaacg 5401 gtttgcccag ccaaggcagg aggactcaga ctccgaggag gtgtctagtg agagtggatg 5461 ttccactccc cagggcttcc cgtggcctcc ctacatgcag gagggcatcc aacccaagag 5521 gcggcgacga catatggaga atgaaagagc aaaaagaatg tcatttcgcc tggccaacag 5581 catctctcag gtgtgagagc caaaaggaga ctgggcactc tggcagctgt tgtcccatac 5641 tgtcgaccat tcctccccat cttgcagctt atcctcttca gagtgtggat ggtagatgag 5701 tcttgttgaa ctgtgtagtg ctcttctttg ccatttgcat ctacagcatt tatttaggag 5761 ccgtagtaca tccactacag agaagatttc agtaataaac aggaaagagg aaggtattta 5821 ttattaccag agtaatcaga tgtataggct ccacttgaca agaccacacc tggttaggct 5881 cgggaaaacc tactttcctt ggtgaatttt tttctgcctg ggaaagcaga cccgtctaac 5941 cttttagtgt ccagggctca ggagccatgt atacatctga actcctcctg ggctcagaag 6001 tgggtaaaag aagggagaga ggtttagaca tatggcagga ctgtgatccc tggcccagca 6061 cagccaaaga ctgtcactgt ttgcctttgc ttgtcctgtc tggatagagg cacctgccat 6121 gtgcagacta gtggaggctt ctagaggcca taggtctcaa tttgtgccta tttgggttct 6181 aacggttttc agggtatttg ttttgcatag tcactttcct gatgttgaca taggtggctg 6241 ctgtgaaaat gctgtggatg gtgtgacatt cttgactgag ctgggtggct gtggagagac 6301 cacttaaatc ttctggttca gatttcactt acagacttag taagatgtcc acttcagagc 6361 ccgccctctg tgctacttac tctgggctaa tcacacttct ttagggtgag aacctgcctt 6421 cctagaagtg actgttggtg cttgtcagtg gccataccct tcatcatcct cattcagagg 6481 tatggatcgt gagttctgct tgtgaatact gtgagacagc tgctgtctgc ttcaggtcca 6541 gagttccatg ggtggagaga gtaacccaca accagctcac tcctctgggc ctctcctcct 6601 gcttaaagat tctagttctc aacctctcca agttcatgag ccaaagatca gaagtgagtc 6661 ttctgaccag cccgcctcca ggctgtctac agccccaaaa aagccaacac ttttctggca 6721 gccctttttg ttggaggtta gagaaggcct cagggaccaa caggggccag atctagacat 6781 tcctggggga agtttgtcta aggtggacag ggggaaaaat caagaggcaa aaagccagtg 6841 tcctttggtg accatgaagc aacctcagag ccttagttct cctagaatgc agagagaagc 6901 attctggaag ccatgggaac aggagctcga ggctgcatcc tgaatgcatc tgcatggccc 6961 tcgcagggtg tttgtgagga tactggaggt caggctacct cccaagcaag ggcttggttg 7021 agaaggaacc cattgctcat attgttgggg gagctatgaa cctgacatgt cagtggccag 7081 gactgagaca gctctccctg gtcctgtcat tgcatggcta atttcagggg agtctgaggt 7141 catggcttta tttttgctaa cagatgaagt tcaaggatgt tcttgtttgt agggcctttg 7201 ctaatgagca gttttttaat tagaaaacat ttcttctctc tttgtgagtt gatggtattg 7261 ctcatttcct tctctaccca acctgagaga tcatcttcac tttaaagcaa accatgcccc 7321 ttggcttgcc attctttcag cagcacatgc cacgttctaa gcagatgggc ttccgtgatc 7381 ccgttttcta gtttggggta actgagtctt gaatgcttta ctagtccggc aatctttgga 7441 cttaggcttc tgccctttga gactcacatg actttctggt ttggggtctg gtcatttccc 7501 tttcaatttt tgaatctcct tctctgttca gttggcttgg caaagtaccc tctgttctca 7561 tgtcactaat tcctgctcta ggtcatcttt cctttgtgca gcccacattc ccaccatcct 7621 gtgaactttg ttgtacttgg tgttgggcaa ctggtgagtg ttgtgacagg ctcagacctg 7681 atccttcctc aggagcaatt ggagaccaca aatttaaagc agaaatacct gttaaaagct 7741 tggctttcct tatatgagtg aggctcccag cttgggtgtg gcctcctggg ctcacggatc 7801 agcttgctgg gccagagggg gtcttagtaa tgtgctagag agacacatgc tcccttctca 7861 tcgtagagag cctgcatgct gagaggctcc ctttacacac ctgcctctcg ctggcttttc 7921 ctttagacct agccaccagt gttttgaatc atggttttgt tttgtttttg catgcaaaaa 7981 ccagaacatg acttagaaat t // LOCUS AB002302 6252 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0304 gene, complete cds. ACCESSION AB002302 NID g2224548 KEYWORDS KIAA0304. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0016. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6252) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6252 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0016" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1343..5932 /gene="KIAA0304" CDS 1343..5932 /gene="KIAA0304" /codon_start=1 /db_xref="PID:d1021600" /db_xref="PID:g2224549" /translation="MGGLSVLTSVPGGPPMVCLLCASKGLHELVFCQVCCDPFHPFCL EEAERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRHAYHPACLGPSYPTRA TRKRRHWICSACVRCKSCGATPGKNWDVEWSGDYSLCPRCTQLYEKGNYCPICTRCYE DNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAGAAQPRWREA LSGALQGGLRQVLQGLLSSKVVGPLLLCTQCGPDGKQLHPGPCGLQAVSQRFEDGHYK SVHSFMEDMVGILMRHSEEGETPDRRAGGQMKGLLLKLLESAFGWFDAHDPKYWRRST RLPNGVLPNAVLPPSLDHVYAQWRQQEPETPESGQPPGDPSAAFQGKDPAAFSHLEDP RQCALCLKYGDADSKEAGRLLYIGQNEWTHVNCAIWSAEVFEENDGSLKNVHAAVARG RQMRCELCLKPGATVGCCLSSCLSNFHFMCARASYCIFQDDKKVFCQKHTDLLDGKEI VNPDGFDVLRRVYVDFEGINFKRKFLTGLEPDAINVLIGSIRIDSLGTLSDLSDCEGR LFPIGYQCSRLYWSTVDARRRCWYRCRILEYRPWGPREEPAHLEAAEENQTIVHSPAP SSEPPGGEDPPLDTDVLVPGAPERHSPIQNLDPPLRPDSGSAPPPAPRSFSGARIKVP NYSPSRRPLGGVSFGPLPSPGSPSSLTHHIPTVGDPDFPAPPRRSRRPSPLAPRPPPS RWASPPLKTSPQLRVPPPTSVVTALTPTSGELAPPGPAPSPPPPEDLGPDFEDMEVVS GLSAADLDFAASLLGTEPFQEEIVAAGAMGSSHGGPGDSSEEESSPTSRYIHFPVTVV SAPGLAPSATPGAPRIEQLDGVDDGTDSEAEAVQQPRGQGTPPSGPGVVRAGVLGAAG DRARPPEDLPSEIVDFVLKNLGGPGDGGAGPREESLPPAPPLANGSQPSQGLTASPAD PTRTFAWLPGAPGVRVLSLGPAPEPPKPATSKIILVNKLGQVFVKMAGEGEPVPPPVK QPPLPPTISPTAPTSWTLPPGPLLGVLPVVGVVRPAPPPPPPPLTLVLSSGPASPPRQ AIRVKRVSTFSGRSPPAPPPYKAPRLDEDGEASEDTPQVPGLGSGGFSRVRMKTPTVR GVLDLDRPGEPAGEESPGPLQERSPLLPLPEDGPPQVPDGPPDLLLESQWHHYSGEAS SSEEEPPSPDDKENQAPKRTGPHLRFEISSEDGFSVEAESLEGAWRTLIEKVQEARGH ARLRHLSFSGMSGARLLGIHHDAVIFLAEQLPGAQRCQHYKFRYHQQGEGQEEPPLNP HGAARAEVYLRKCTFDMFNFLASQHRVLPEGATCDEEEDEVQLRSTRRATSLELPMAM RFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYD GKGIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRI LRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN" BASE COUNT 1160 a 2028 c 1860 g 1204 t ORIGIN 1 gcttccatgc cgctgagccc tggagggcag atggaggagg tggccggggc tgtcaagcag 61 atctccgaca gaggccctgt ccggtctgaa gatgagtcgg tggaagctaa gagagagcgg 121 ccctcaggtc ccgagtcccc tgtgcaaggt ccccgcatca aacatgtctg ccgtcatgct 181 gctgtggccc tgggtcaggc ccgggccatg gtgcctgaag atgtccctcg cctcagtgcc 241 ctccctctcc gggatcggca ggacctcgcc acagaggata catcatcggc gtccgagact 301 gagagtgtcc cgtcacggtc ccggcgggga aaggtggagg cagcaggccc tgggggagaa 361 tcagagccca caggttctgg agggaccctg gcccacacac cccggcgctc actgccctcc 421 catcacggca agaagatgcg catggctcga tgtggacact gtcggggctg cctacgtgtg 481 caggactgtg ggtcctgtgt caactgccta gacaagccca agtttggggg ccctaacacc 541 aagaagcagt gctgtgtata ccggaagtgt gacaaaatag aggctcggaa gatggaacga 601 ctggctaaaa aaggtgacga gctttaagga gcatttcttc tcaaaaccgt gttagagttt 661 gtgctgtgga gggagctgtt tttctttgct ctcctccctt gcagctcacc ctctccatct 721 tctccgttgt gtgctttcat agctcctgcg ttcattccct gccccgcttc tttctggctt 781 ctctcccagt gtcccatgtc cctggctgag ctcaaatcct actaagtccc ctgttcccgc 841 aggccggacg atagtgaaga cgctgttgcc ctgggattcc gatgaatctc ctgaggcctc 901 ccctggtcct ccaggcccac gccggggggc gggagctggg gggccccggg aggaggtggt 961 ggcccaccca gggcccgagg agcaggactc cctcctgcag cgcaagtcag ctcggcgctg 1021 cgtcaaacag cgaccctcct atgatatctt cgaggattcg gatgactcgg agcccggggg 1081 cccccctgct cctcggcgtc ggaccccccg agaaaatgag ctgccactgc cagaacctga 1141 ggagcagagc cggccccgca aacctaccct gcagcctgtg ttgcagctca aggcccgaag 1201 gcgcctggac aaggatgctt tggcccctgg cccctttgct tcttttccca atggctggac 1261 tggaaagcag aagtctcccg atggtgtgca ccgcgtccgt gtggatttta aggaggattg 1321 tgatttagag aacgtgtggc tgatgggggg cctgagtgtg ctcacctctg tgccaggggg 1381 ccccccgatg gtgtgcttgc tgtgtgccag caaaggactc cacgagctgg tgttctgtca 1441 agtctgctgt gacccattcc acccattctg cctggaggag gccgagcggc ccctgcccca 1501 gcatcacgac acctggtgct gccgtcgctg caaattctgc cacgtctgtg gacgcaaagg 1561 tcgtggatcc aagcacctcc tggagtgcga gcgctgccgc catgcatacc acccggcctg 1621 tctggggccc agctatccaa cccgggccac gcgcaaacgg cgccactgga tctgttcagc 1681 ctgtgtgcgc tgtaagagct gtggggcaac tccaggcaag aactgggacg tcgagtggtc 1741 tggagattac agcctctgcc ccaggtgcac ccagctatat gagaaaggaa actactgccc 1801 gatctgtaca cgctgctatg aagacaacga ctatgagagc aagatgatgc agtgcgcaca 1861 gtgcgatcac tgggtgcatg ccaagtgcga ggggctctca gatgaagact acgagatcct 1921 ttcaggactg ccagactcgg tgctgtacac ctgcggaccg tgtgctgggg cagcgcagcc 1981 ccgctggcga gaggccctga gcggggccct ccaggggggc ctgcgccagg tgctccaggg 2041 cctgctgagc tccaaggtgg tgggcccact gctgctctgc acccagtgtg ggccagatgg 2101 gaagcaactg cacccaggac cctgcggcct gcaagctgtg agtcagcgct tcgaggatgg 2161 ccactacaag tctgtgcaca gcttcatgga ggacatggtg ggcatcctca tgcggcactc 2221 ggaggaggga gagaccccgg accgccgggc tggaggccag atgaaggggc tcctgctgaa 2281 gctgctagaa tctgcgttcg gctggttcga cgcccacgac cccaagtact ggcgacggag 2341 tacccggctg ccaaacggag tccttcccaa tgcggtgttg cccccatccc tggatcatgt 2401 ctatgcgcag tggagacagc aggaaccaga gaccccagaa tcagggcagc ctccagggga 2461 tccctcagca gcattccagg gcaaggatcc ggctgccttc tcacacctgg aggacccccg 2521 tcagtgtgca ctctgcctca aatacgggga tgcagactcc aaggaggcgg ggcggctctt 2581 gtacatcggg cagaacgagt ggacacacgt caactgtgcc atctggtcgg cggaagtctt 2641 cgaggagaac gacggctccc tcaagaatgt gcatgctgct gtggcccgag ggaggcagat 2701 gcgctgcgag ctctgcctga agcctggcgc cacggtgggc tgctgcctgt cctcctgcct 2761 cagcaacttc cacttcatgt gtgcccgggc cagctactgc atcttccagg atgacaagaa 2821 agtcttctgc cagaaacaca ctgatctcct ggatggcaag gaaattgtga accccgatgg 2881 ttttgatgtt ctccgccgag tctatgtgga cttcgagggc atcaacttca agcggaagtt 2941 cttgacgggg cttgaacccg atgccatcaa cgtgctcatt ggttccatcc gcattgactc 3001 cctgggtact ctgtctgatc tctcggactg cgagggacgg ctcttcccca ttggctacca 3061 gtgctcccgt ctgtactgga gcacagtgga tgctcggagg cgctgctggt atcggtgccg 3121 aattctggag tatcggccat gggggccgag ggaagagcca gctcacctgg aggctgcaga 3181 ggagaaccag accattgtgc acagccccgc cccttcctca gagcccccag gtggtgagga 3241 ccccccactg gacacagatg ttcttgtccc tggagctcct gagcgccact cgcccattca 3301 gaacctggac cctccactgc ggccagattc aggcagcgcc cctcctccag ccccccgttc 3361 tttttcgggg gctcgaatca aagtgcccaa ctactcgcca tcccggaggc ccttgggggg 3421 tgtctccttt ggccccctgc cctcccctgg aagtccatct tcactgaccc accacatccc 3481 cacagtggga gacccggact tcccagctcc ccccagacgt tcccgtcgtc ccagcccttt 3541 ggctcccagg ccgcctccat cacggtgggc ctcccctcct ctaaaaacct cccctcagct 3601 cagggtgccc cctcctacct cagtcgtcac agccctcaca cctacctcag gggagctggc 3661 tccccctggc ccggccccat ctccaccacc ccctgaagac ctgggcccag acttcgagga 3721 catggaggtg gtgtcaggac tgagtgctgc tgacctggac ttcgcggcca gcctgctggg 3781 gactgagccc ttccaggaag agattgtagc cgctggggcc atggggagca gccacggggg 3841 cccgggggac agctccgagg aggagtccag ccccacctcc cgctacatcc acttccctgt 3901 gactgtggtg tccgcccctg gtctggcccc cagcgctacc cctggagccc cccgcattga 3961 acagctggac ggcgtggacg acggcactga cagtgaggct gaggcggtgc agcagcctcg 4021 gggccagggc acgcctcctt cggggccagg agtagtccgg gcaggggtcc ttggggctgc 4081 aggggacagg gcccggcctc ctgaggacct gccatcggaa attgtggatt ttgtgttgaa 4141 gaacctaggg ggtcctgggg atggaggtgc tggccctaga gaggagtcac tccccccggc 4201 gcctcccctg gctaatggca gccagccctc ccaaggcctg accgccagcc cagctgaccc 4261 cacccgcaca tttgcctggc tcccaggggc cccaggggtc cgggtgttaa gccttggccc 4321 tgcccctgag ccccccaaac ccgccacatc caaaatcata cttgtcaaca agctggggca 4381 agtatttgtg aagatggctg gggagggtga acctgtccca cccccagtga agcagccacc 4441 tttgcccccc accatttccc ccacggctcc cacctcctgg actctgcccc caggccccct 4501 cctcggcgtg ctgcccgtgg tcggagtggt ccgccctgcc ccgcccccgc caccccctcc 4561 cctgacgctg gtgctgagca gtgggccagc cagcccgccc cgccaggcca tccgcgtcaa 4621 gagggtgtcc actttctccg gccggtcccc gccagcacct cccccataca aagccccccg 4681 gctggatgaa gatggagagg cctcagagga tacccctcag gttccagggc ttggcagtgg 4741 cgggtttagc cgtgtgagga tgaaaacccc cacagtgcgt ggggtccttg acctggatcg 4801 gcctggggag cccgctgggg aagaaagtcc tgggcccctc caggaacggt cccctttgct 4861 gccacttccg gaagatggtc ctccccaggt ccccgatggt cccccagacc tgctgcttga 4921 gtcccagtgg caccactatt caggtgaggc ttcgagctct gaggaagagc ctccatcccc 4981 agatgataaa gagaaccagg ccccaaaacg gactggccca catctgcgct tcgagatcag 5041 cagtgaggat gggttcagcg ttgaggcaga gagcttggag ggggcgtgga gaactctgat 5101 cgagaaagtg caagaggccc gagggcatgc ccgactcaga catctctctt ttagtggaat 5161 gagtggggcg agactcctgg gcatccacca tgatgctgtc atcttcctgg ccgagcagct 5221 ccccggagcc cagcgttgcc agcactataa gttccgttac caccagcagg gagagggcca 5281 ggaggagccg cccctgaatc cccatggggc tgctcgggca gaggtctatc tccggaagtg 5341 cacctttgac atgttcaact tcctggcctc ccagcaccgg gtgctccctg agggggccac 5401 ctgtgatgag gaagaggatg aggtgcagct caggtcaacc agacgtgcca ccagcctgga 5461 gctgcccatg gccatgcgtt ttcgtcacct taagaagacg tccaaagaag ctgtgggtgt 5521 ctacagatca gccatccacg ggcgaggcct gttctgtaag cgcaacatcg acgcggggga 5581 gatggtcatc gagtactctg gcattgtcat ccgctcggtg ttgactgaca agcgggagaa 5641 gttctacgat gggaagggca tcgggtgcta tatgttccgc atggatgact ttgatgtagt 5701 ggacgccacg atgcatggca atgccgcccg cttcatcaac cactcctgtg agcccaactg 5761 cttctctcgg gtcatccacg tggagggcca gaaacacatt gttatcttcg ccctgcgccg 5821 catcctgcgt ggtgaggagc tcacctacga ctacaagttc cccatcgagg atgccagcaa 5881 caagctgccc tgcaactgtg gcgccaagcg ctgccgtcgg ttccttaact gaggccgtgg 5941 ctgcccacca cgacccctca cacctcctgc tgccgtcgct gccatcttgc ccctagcctg 6001 ggggctccct agcccctccc agagcatctc acccccaccc tcatgttcag ggtggatgtg 6061 ggcatgcagg tgacaagggc cctgcctcca cccctccagc ccatccagca atcgccccct 6121 ttctgccctg ggggcccagg atgtagatat tgtacaaagg tttctaaatc ccttcttttc 6181 tatgcacttt tttatttaag aggtggggtc ccaggtggga acccccccac aataaagtct 6241 gtcaatgttt gg // LOCUS AB002303 6632 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0305 gene, complete cds. ACCESSION AB002303 NID g2224550 KEYWORDS KIAA0305. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0042. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6632) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6632 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0042" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 249..4868 /gene="KIAA0305" CDS 249..4868 /gene="KIAA0305" /codon_start=1 /db_xref="PID:d1021601" /db_xref="PID:g2224551" /translation="MDSYFKAAVSDLDKLLDDFEQNPDEQDYLQDVQNAYDSNHCSVS SELASSQRTSLLPKDQECVNSCASSETSYGTNESSLNEKTLKGLTSIQNEKNVTGLDL LSSVDGGTSDEIQPLYMGRCSKPICDLISDMGNLVHATNSEEDIKKLLPDDFKSNADS LIGLDLSSVSDTPCVSSTDHDSDTVREQQNDTSSELQNREIGGIKELGIKVDTTLSDS YNYSGTENLKDKKIFNQLESIVDFNMSSALTRQSSKMFHAKDKLQHKSQPCGLLKDVG LVKEEVDVAVITAAECLKEEGKTSALTCSLPKNEDLCLNDSNSRDENFKLPDFSFQED KTVIKQSAQEDSKSLDLKDNDVIQDSSSALHVSSKDVPSSLSCLPASGSMCGSLIESK ARGDFLPQHEHKDNIQDAVTIHEEIQNSVVLGGEPFKENDLLKQEKCKSILLQSLIEG MEDRKIDPDQTVIRAESLDGGDTSSTVVESQEGLSGTHVPESSDCCEGFINTFSSNDM DGQDLDYFNIDEGAKSGPLISDAELDAFLTEQYLQTTNIKSFEENVNDSKSQMNQIDM KGLDDGNINNIYFNAEAGAIGESHGINIICETVDKQNTIENGLSLGEKSTIPVQQGLP TSKSEITNQLSVSDINSQSVGGARPKQLFSLPSRTRSSKDLNKPDVPDTIESEPSTAD TVVPITCAIDSTADPQVSFNSNYIDIESNSEGGSSFVTANEDSVPENTCKEGLVLGQK QPTWVPDSEAPNCMNCQVKFTFTKRRHHCRACGKVFCGVCCNRKCKLQYLEKEARVCV VCYETISKAQAFERMMSPTGSNLKSNHSDECTTVQPPQENQTSSIPSPATLPVSALKQ PGVEGLCSKEQKRVWFADGILPNGEVADTTKLSSGSKRCSEDFSPLSPDVPMTVNTVD HSHSTTVEKPNNETGDITRNEIIQSPISQVPSVEKLSMNTGNEGLPTSGSFTLDDDVF AETEEPSSPTGVLVNSNLPIASISDYRLLCDINKYVCNKISLLPNDEDSLPPLLVASG EKGSVPVVEEHPSHEQIILLLEGEGFHPVTFVLNANLLVNVKFIFYSSDKYWYFSTNG LHGLGQAEIIILLLCLPNEDTIPKDIFRLFITIYKDALKGKYIENLDNITFTESFLSS KDHGGFLFITPTFQKLDDLSLPSNPFLCGILIQKLEIPWAKVFPMRLMLRLGAEYKAY PAPLTSIRGRKPLFGEIGHTIMNLLVDLRNYQYTLHNIDQLLIHMEMGKSCIKIPRKK YSDVMKVLNSSNEHVISIGASFSTEADSHLVCIQNDGIYETQANSATGHPRKVTGASF VVFNGALKTSSGFLAKSSIVEDGLMVQITPETMNGLRLALREQKDFKITCGKVDAVDL REYVDICWVDAEEKGNKGVISSVDGISLQGFPSEKIKLEADFETDEKIVKCTEVFYFL KDQDLSILSTSYQFAKEIAMACSAALCPHLKTLKSNGMNKIGLRVSIDTDMVEFQAGS EGQLLPQHYLNDLDSALIPVIHGGTSNSSLPLEIELVFFIIEHLF" BASE COUNT 2198 a 1124 c 1252 g 2058 t ORIGIN 1 actcccggcc ggggtagctc ttcactcctc agcgcgacgt cgtgtcgagt tcccaaaaag 61 ctccgcaggg gctgtaggga ggtgatctca tccattaaca gctgtgtgtt gccagttccc 121 aaatctttat ctatctcaga cttctctcct gcattccaga ttcttatatt cagctgcctt 181 ttggatatct ctcccaggat gttctcaagg catacaagaa ttaaattctg aataagtctg 241 caggtaggat ggacagttat tttaaagcag ctgtcagtga cttggacaaa ctccttgatg 301 attttgaaca gaacccagat gaacaagatt atctccaaga tgtacaaaat gcatatgatt 361 ctaaccactg ctcagtttct tcagagttgg cttcctcaca gcgaacttca ttgctcccaa 421 aagaccaaga gtgcgttaat agttgtgcct catcagaaac aagctatgga acaaatgaga 481 gttccctgaa tgaaaaaaca ctcaagggac ttacttctat acaaaatgaa aaaaatgtaa 541 caggacttga tcttctttct tctgtggatg gtggtacttc agatgaaatc cagccgttat 601 atatgggacg atgtagtaaa cctatctgtg atctgataag tgacatgggt aacttagttc 661 atgcaaccaa tagtgaagaa gatattaaaa aattattgcc agatgatttt aagtctaatg 721 cagattcctt gattggattg gatttatctt cagtgtcaga tactccctgt gtttcttcaa 781 cagaccatga tagtgatact gtcagagaac aacagaatga taccagttct gaattacaaa 841 atagagaaat cggaggaatc aaagaattgg gtataaaagt agatacaaca ctttcagatt 901 cctataatta cagtggaaca gaaaatttaa aagataaaaa gatctttaat cagttagaat 961 caattgttga ttttaacatg tcatctgctt tgactcgaca aagttccaaa atgtttcatg 1021 ccaaagacaa gctacaacac aagagccagc catgtggatt actaaaagat gttggcttag 1081 taaaagagga agtagatgtg gcagtcataa ctgccgcaga atgtttaaaa gaagagggca 1141 agacaagtgc tttgacctgc agccttccga aaaatgaaga tttatgctta aatgattcaa 1201 attcaagaga tgaaaatttc aaattacctg acttttcctt tcaggaagat aagactgtta 1261 taaaacaatc tgcacaagaa gactcaaaaa gtttagacct taaggataat gatgtaatcc 1321 aagattcctc ttcagcttta catgtttcca gtaaagatgt gccgtcctca ttgtcctgtc 1381 ttcctgcgtc tgggtctatg tgtggatcat taattgaaag taaagcacgg ggtgattttt 1441 tacctcagca tgaacataaa gataatatac aagatgcagt gactatacat gaagaaatac 1501 agaacagtgt tgttctaggt ggggaaccat tcaaagagaa tgatcttttg aaacaggaaa 1561 aatgtaaaag catactcctt cagtcattaa ttgaagggat ggaagacaga aagatagatc 1621 ctgaccagac agtaatcaga gctgagtctt tggatggtgg tgacaccagt tctacagttg 1681 tagaatctca agaggggctt tctggcactc atgtcccaga gtcttctgat tgttgtgaag 1741 gttttattaa tactttttca agcaatgata tggatgggca agacttagat tactttaata 1801 ttgatgaagg cgcaaaaagt ggcccactaa ttagtgatgc tgaacttgat gcctttctga 1861 cagaacagta tcttcagacc actaacataa agtcttttga agaaaatgta aatgactcta 1921 aatcgcaaat gaatcagata gatatgaaag gcttagatga tggaaacatc aataatatat 1981 atttcaatgc agaagcagga gctattgggg aaagtcatgg tattaatata atttgtgaaa 2041 cagttgataa acaaaataca atagaaaatg gcctttcttt aggagaaaaa agcactattc 2101 cagttcaaca agggttacct accagtaagt ctgagattac aaatcaatta tcagtctctg 2161 atattaacag tcaatctgtt ggaggggcca gacctaagca attgtttagc cttccatcaa 2221 gaacaaggag ttcaaaggac ctgaataagc cagatgttcc agatacaata gaaagtgaac 2281 ccagcacagc agataccgtt gttccaatca cttgtgctat agattctaca gctgatccac 2341 aggttagctt caactctaat tacattgata tagaaagtaa ttctgaaggt ggatctagtt 2401 tcgtaactgc aaatgaagat tctgtacctg aaaacacttg caaagaaggc ttggttttgg 2461 gccagaaaca gcctacttgg gttcctgatt cagaagctcc aaactgtatg aactgccaag 2521 tcaaatttac ttttaccaaa cggcgacacc attgccgagc atgtgggaaa gtattttgtg 2581 gtgtctgttg taataggaag tgtaaactgc aatatctaga aaaggaagca agagtatgtg 2641 tagtctgcta tgaaactatt agtaaagctc aggcatttga aaggatgatg agtccaactg 2701 gttctaatct taagtctaat cattctgatg aatgtactac tgtccagcct cctcaggaga 2761 accaaacatc cagtatacct tcaccagcaa ctttgccagt ctcagcactt aaacaaccag 2821 gtgttgaagg actatgttcc aaagaacaga agagagtatg gtttgcagat ggtatattgc 2881 ccaatggtga agttgcagat acaacaaaat tatcatctgg aagtaaaaga tgttctgaag 2941 actttagtcc tctctcacct gatgtgccta tgacagtaaa cacagtggat cattcccatt 3001 ctactacagt ggaaaagcca aacaatgaga caggagatat tacaagaaat gagataattc 3061 agagtcctat ttctcaggtt ccatcagtgg aaaaattgtc tatgaacaca ggaaatgagg 3121 ggttacctac ttctggttca tttacactag atgatgatgt ttttgcagaa actgaagaac 3181 catctagtcc tactggtgtc ttagttaaca gcaatttacc tattgctagt atttcagatt 3241 ataggttact gtgtgatatt aacaagtatg tctgcaataa gattagtctt ctacctaatg 3301 atgaggacag tttgccccca cttctggttg catctggaga aaagggatca gtgcctgtag 3361 tagaagaaca tccatctcat gagcagatca ttttgcttct tgaaggtgaa ggctttcatc 3421 ctgttacatt tgtcctaaat gctaatctac tcgtgaatgt caaattcata ttttattcct 3481 cagacaaata ttggtacttt tcaaccaatg gattgcatgg cttgggacag gcagaaatta 3541 ttattctatt gttatgtttg ccaaatgaag atactattcc taaggacatc ttcagactat 3601 ttatcaccat atataaggat gctctaaaag gaaaatacat agaaaacttg gacaatatta 3661 cctttactga gagttttctc agtagcaagg atcacggagg attcctgttt attacaccta 3721 cttttcagaa acttgatgat ctctcattac caagtaatcc ttttctttgt ggaattctta 3781 tccagaagct tgagattccc tgggcaaagg tttttcctat gcgtttaatg ttgagattgg 3841 gtgcagaata taaagcatat cctgctcctc taacaagcat cagaggccga aaacctcttt 3901 ttggagaaat aggacacact attatgaact tacttgttga ccttcgaaat taccagtata 3961 ccttgcataa tatagatcaa ctgttgattc atatggaaat gggaaaaagc tgcataaaaa 4021 taccacggaa aaagtacagt gatgtaatga aagtactaaa ttcttccaat gagcatgtca 4081 ttagcattgg agcaagtttc agtacagaag cagattctca tctagtctgt atacagaatg 4141 atggaattta tgaaacacag gccaacagtg ccactggcca tcctagaaaa gtgacaggtg 4201 caagttttgt ggtattcaat ggagctctaa aaacatcttc aggatttctt gctaagtcca 4261 gcatagttga agatggctta atggtacaaa taactccaga gaccatgaat ggcttgcggc 4321 tagctttacg agaacagaaa gactttaaaa ttacatgtgg gaaagttgat gcagtagacc 4381 tgagagaata cgtggatatc tgctgggtag atgctgaaga aaaaggaaac aaaggagtta 4441 tcagttcagt ggatggaata tcattacaag gatttccaag tgaaaaaata aaactggaag 4501 cagattttga aaccgatgag aagattgtaa aatgtaccga ggtgttctac tttctaaagg 4561 accaggattt atctatttta tcaacttctt atcagtttgc aaaagaaata gccatggctt 4621 gtagtgctgc gctgtgccct cacctgaaaa ctctaaaaag taatgggatg aataaaattg 4681 gactcagagt ttccattgac actgatatgg ttgaatttca ggcaggatct gaaggccaac 4741 ttctgcctca gcattatcta aatgatcttg atagtgctct gatacctgtg atccatggtg 4801 ggacctccaa ctctagttta ccattagaaa tagaattagt gtttttcatt atagaacatc 4861 ttttttagtg aaagaatgtg ccatattaca tattgcaacc taatttgtta aaactaactc 4921 cagcactaaa gctgaaatgc cacaaacact aaaagtataa atatgtctga tttttgaaac 4981 acataagctt tgctctttag gcaggaatga tcttttcaaa tcattagcac aatatttaaa 5041 tatctaaaaa tttaagagat ccatactttc tgtagcttta caattaattt aagtactaaa 5101 aagacaagga tttcttttaa gaaatttata gcatttactg tgttatttaa atgctaagcc 5161 aaagtatctg cacttaggta tacctcttta tgccaataat gattttaatg aaggctcttt 5221 tcagatgtaa ccttatgaag gaaatatctg ctttgtgtat atgccagtta gaatactggt 5281 ttctaaagtc tgtcaaattg tatttcagtg gcacaaaaac cagttttgag gtcttagact 5341 tataattctt tgaataaaac tgataactta tttgtataat tggagtggag acctacctcc 5401 ataattagat aaactctttt tggattataa tcagaatttt gccttttttc ttctcaaatt 5461 attacatatg tatgtattat atatccacat atatagtttt ccctgattaa atggatatta 5521 aaataattgc gggtgcttca ggactttttg cttctatatt taagtatatt gtttttatag 5581 caagaacata ttctgaatgt tttataaatc tttaataatt tatatgtagg taatattttt 5641 gtatcacaat gcattatttt ttttcctcct ttccttccaa actataccac tgtatttacc 5701 acttctaaga gtgactgacg acgggccaga tgacccttga agtagtcatt atgtagcaat 5761 aaatgaagcc tgaaacaggt ttttttactt ccactttaat ccttagaaat ttcttggcaa 5821 cttcgcatat tttcattgac actggtgtat aagtataaat ttaaatgaac taattacttt 5881 tgcatatttt aaattcttta tatggtagtt attttttata acaggatatt aacataagtt 5941 aaatcctatg tatttgaaat tgttacagag ctttcctctt tacttcaaac agcaaaaaag 6001 tggggggcat attgtagtcc tgtcatttaa gttatgtaaa aaatttaatc attattttga 6061 tgctttaaac attctcatgt gtaatatatg tttttgtatc aaaaacactc atatatttca 6121 agaaaaagaa attatgttaa atagccctgt tttaagaaaa atatttatga agcatctcaa 6181 cttgaagatc aagtcaaagt tataactcag gatctgaggt ctcaagctag gagagactga 6241 gaattttaat cagtttgggc atatagtttg gactgaatca catctgtagt acttagccaa 6301 agacaatttg gaggagaata tcagccttct ggaagtagct acttcctgaa caatgtaaag 6361 tgtcgcagat attcaataaa atggcaacct gttataattt gtgaaattta ttgaaatggt 6421 gtaagatgaa aacaattgca tatcaaaccc aatttatgtt ttctaaatat agtgtatgta 6481 ttctgccatg taagtaattg aacagtctta aaataaccaa atggtagagg gctgttccat 6541 gatgggacag ctttggattt gttttcataa aatctctaca ttcaataaaa attggaatta 6601 tgtgcctgaa gtttggaggc acattttgaa gt // LOCUS AB002308 5955 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0310 gene, complete cds. ACCESSION AB002308 NID g2224560 KEYWORDS KIAA0310. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0111. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5955) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5955 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0111" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1656..4301 /gene="KIAA0310" CDS 1656..4301 /gene="KIAA0310" /codon_start=1 /db_xref="PID:d1021606" /db_xref="PID:g2224561" /translation="MEQVSSRPTSPEKFSVPHVCARFGPGGQLIKVIPNLPSEGQPAL VEVHSMEALLQHTSEQEEMRAFPGPLAKDDTHKVDVINFAQNKAMKCLQNENLIDKES ASLLWNFIVLLCRQNGTVVGTDIAELLLRDHRTVWLPGKSPNEANLIDFTNEAVEQVE EEESGEAQLSFLTGGPAAAASSLERETERFRELLLYGRKKDALESAMKNGLWGHALLL ASKMDSRTHARVMTRFANSLPINDPLQTVYQLMSGRMPAASTCCGDEKWGDWRPHLAM VLSNLNNNMDVESRTMATMGDTLASRGLLDAAHFCYLMAQAGFGVYTKKTTKLVLIGS NHSLPFLKFATNEAIQRTEAYEYAQSLGAETCPLPSFQVFKFIYSCRLAEMGLATQAF HYCEAIAKSILTQPHLYSPVLISQLVQMASQLRLFDPQLKEKPEEESLAAPTWLVHLQ QVERQIKEGAGVWHQDGALPQQCPGTPSSEMEQLDRPGLSQPGALGIANPLLAVPAPS PEHSSPSVRLLPSAPQTLPDGPLASPARVPMFPVPLPPGPLEPGPGCVTPGPALGFLE PSGPGLPPGVPPLQERRHLLQEARSPDPGIVPQEAPVGNSLSELSEENFDGKFANLTP SRTVPDSEAPPGWDRADSGPTQPPLSLSPAPETKRPGQAAKKETKEPKKGESWFFRWL PGKKKTEAYLPDDKNKSIVWDEKKNQWVNLNEPEEEKKAPPPPPTSMPKTVQAAPPAL PGPPGAPVNMYSRRAAGTRARYVDVLNPSGTQRSEPALAPADFVAPLAPLPIPSNLFV PTPDAEEPQLPDGTGREGPAAARGLANPEPAPEPKAPGDLPAAGGPPSGAMPFYNPAQ LAQACATSGSSRLGRIGQRKHLVLN" BASE COUNT 1331 a 1726 c 1589 g 1309 t ORIGIN 1 ggggagaaca cttctttgtc tgggattcca accagctctg tccttagctt gtctctgcct 61 agcagtgttg cccaaagtaa ttttccacaa ggttctggtg cttccgaaat ggtttctaat 121 cagcctgcta atttgctggt tcaaccacca tcccagccag ttccagagaa cttggttcca 181 gaaagtcaaa aggatcgtaa ggcaggaagt gctcttcccg gatttgctaa tagccctgct 241 ggaagcacaa gtgtggtgtt agttccacct gcacacggca ccctggtgcc tgatggtaat 301 aaggcaaacc attccagtca tcaggaagac acttacggag ccctagactt tgccttaagc 361 aggactttgg aaaatcctgt aaacgtgtac aacccgtccc attctgacag cctcgcttct 421 cagcaaagtg ttgccagtca tcccagacaa tctgggcctg gggcgcctaa ccttgaccgt 481 ttttatcagc aggtcacgaa agatgcccag ggccagcctg gcctcgaaag agcccagcag 541 gagctggcgc caccccagca acaggcttct cccccacaac tacccaaagc catgttttcg 601 gagctgtcaa atccagaaag tctgcccgca cagggacagg cccagaactc agcacagtca 661 ccagcaagtc tggttctggt cgacgcgggt cagcagctgc cccctcggcc tcctcagtcc 721 tctagcgtgt ctctggtgtc cagtggctcc ggccaggcag ctgtgccgtc agagcagccg 781 tggccacagc cagtgcctgc acttgccccc ggcccaccgc ctcaggacct ggccgcctac 841 tactactacc ggcctttgta cgatgcctac cagcctcagt actctttgcc gtacccaccg 901 gagcctggcg cagcctccct ctattaccag gatgtctaca gcctctatga gcctcgatac 961 aggccctatg atggtgctgc gtctgcttac gcccagaact accgctatcc cgagcccgag 1021 cggcccagct cccgagccag ccactcctcg gaacggccac ctcccaggca aggatatcct 1081 gaaggatact atagttccaa aagtggatgg agcagtcaga gcgattacta tgcaagctat 1141 tactccagcc agtacgatta tggagatcca ggtcactggg atcgttacca ctacagtgct 1201 agagtcaggg acccccgcac ctatgaccgg aggtattggt gtgatgcaga gtatgacgca 1261 tacaggagag agcactctgc cttcggggac aggcccgaga aacgtgacaa caactggagg 1321 tacgatcctc gcttcacggg gagttttgac gatgacccca tccgcacaga gacccttatg 1381 gggaagaggt ggaccggcgc agcgtccaca gcgagcactc ggcacggagc ctgcacagcg 1441 cacacagcct ggccagccgc cgcagcagcc tcagctccca ctcgcaccag agtcagattt 1501 acagaagcca caatgtggct gccggttcct acgaggcccc gcttcctcca ggctcctttc 1561 acggcgattt tgcctacggc acctaccgca gcaatttcag cagtggcccc ggcttcccag 1621 agtatggcta ccctgccgac accgtctggc ctgccatgga gcaagtttca tcaagaccaa 1681 cttctcctga aaaattttca gtgcctcatg tctgtgccag gtttggccct ggcggtcagc 1741 ttatcaaagt gattcccaat ctgccttcag aaggacagcc ggccttggtg gaggtccaca 1801 gcatggaggc cttgctgcag cacacgtctg agcaggagga gatgcgggcg ttcccgggac 1861 ccctggccaa agacgacacc cataaggtgg atgtcattaa ttttgcacag aacaaagcta 1921 tgaaatgttt gcagaatgaa aacttaattg acaaagagtc tgcaagtctt ctttggaatt 1981 ttattgttct cttatgcaga caaaatggga ccgtggtagg gaccgacatt gcggagcttc 2041 tgttacgaga ccacagaaca gtgtggcttc ctgggaagtc gcccaatgaa gcaaacctga 2101 ttgatttcac gaatgaggca gtggagcagg tggaagagga ggagtctggt gaggcccagc 2161 tctctttcct cactggtggt ccggcggctg ccgccagctc gctcgagaga gagaccgaga 2221 ggttcaggga gctgttgctg tatggccgta agaaggatgc tttggagtct gcaatgaaga 2281 atggcctgtg gggtcacgct ctgctacttg caagtaagat ggacagccgg acacacgccc 2341 gagtcatgac caggtttgct aacagcctcc caatcaacga ccctctgcag acagtctacc 2401 agctcatgtc cggacggatg cctgccgcgt ccacgtgctg tggagacgag aaatggggag 2461 attggaggcc gcacctcgcc atggtcttgt ccaacttgaa caacaacatg gacgtcgagt 2521 ccaggacgat ggctaccatg ggcgacactc tggcttcaag gggcctcttg gatgcggccc 2581 acttctgcta cctcatggcc caggcgggat ttggtgttta cacgaagaaa actacaaagc 2641 ttgtcttaat cggatccaat cacagtttgc cattcttaaa gttcgcaacc aacgaagcaa 2701 tccagaggac ggaagcctat gagtacgccc agtccctggg tgccgagacc tgccccctgc 2761 ctagtttcca ggtgtttaag ttcatctact cctgccgcct ggcggaaatg gggctggcca 2821 cgcaagcctt ccactactgt gaggccatcg cgaagagcat cctgacgcag ccgcacctgt 2881 attccccggt gttgatcagc cagcttgtgc agatggcttc ccagttacga ctcttcgatc 2941 cccagctgaa agagaagcca gaagaggagt ccttggccgc acccacgtgg ctggttcacc 3001 tgcagcaggt ggagcggcag attaaggagg gggctggagt atggcatcag gatggagccc 3061 tcccgcagca gtgtcctggc actccgagtt ccgagatgga gcagttggac aggccaggac 3121 tcagtcagcc aggagccctg gggatcgcca accctctgct ggcggtgcct gcaccgagcc 3181 ctgagcactc gagcccgagc gtgcggctgc tgccctcagc tccgcagacg ctccctgacg 3241 gcccattggc cagtcctgcc agagtgccga tgttcccagt gccactgccc ccggggcccc 3301 tggagccggg tcctggctgt gtgaccccag ggcctgcact tggcttcctg gagccctccg 3361 ggcctggcct cccacctggt gtgccacctc tgcaggaaag gagacacttg ctccaggaag 3421 ccaggagccc agacccaggg atagtgccgc aggaggcgcc tgttggaaac tcactttccg 3481 agctaagcga agaaaatttt gatggaaaat ttgctaatct gaccccctcg aggacggtgc 3541 cagactcgga ggccccccca gggtgggatc gtgccgactc gggtcccacg cagccacctc 3601 tgtctctctc acccgctccc gaaacaaaga gacccggaca ggcagccaag aaagaaacga 3661 aggaacctaa gaagggtgaa tcctggttct ttcgttggct acctggaaag aaaaagacag 3721 aagcttattt gccagatgac aagaacaaat cgattgtttg ggatgaaaag aaaaaccagt 3781 gggtgaattt aaatgagcca gaagaggaga agaaagcccc gcccccacct ccaacctcga 3841 tgcccaagac tgtgcaagct gccccgcctg ccctcccagg gcctcctgga gcccccgtga 3901 acatgtactc tagaagagca gcaggaacca gagctcgcta cgttgacgtc ctgaacccaa 3961 gcgggaccca gcggagcgag ccggctctcg ctcctgcgga ctttgtcgct ccactcgcgc 4021 cactcccaat tccttctaac ttgttcgtgc caaccccaga tgcagaagaa ccacagcttc 4081 cagacgggac tggcagggaa gggcctgcag cagctagggg cctggccaat ccagagcctg 4141 ccccagagcc caaggctcct ggcgacctcc ctgctgcagg gggccctccc agcggggcca 4201 tgcccttcta caaccctgct cagctggcac aggcctgcgc cacctccggg agctcaaggc 4261 tagggaggat tggccagagg aagcacctgg tgctgaacta ggcttgccct gctgtgaact 4321 tgcacttgga gccctgacgc tgctgttctc cccgaagaac ccgaccgacc tccgcgatct 4381 ccgtcccgcc cccagggaga cacagcagtg actcagagct ggtcgcacac tgtgcctccc 4441 tcctcaccgc ccatcgtaat gaattatttt gaaaattaat tccaccatcc tttcagattc 4501 tggatggaaa gactgaatct ttgactcaga attgtttgcc gaaaagaatg atgtgacttt 4561 cttagtcatt taggatgatt taaggatata gtattcctgg tcatttaaga atgttcattc 4621 attgaagccg gagctgtctc tgccacggga gagccacatg gtcggtagta accagggcct 4681 ctccaagccc agctgtgagt cactgcccag tgagtcccgc gcttccttta aggtgctggg 4741 agcaaagaga gggtgactga ggcagacccc aacccctgct ctgcaccatc tgggccctcg 4801 ccgtgtttga acctggctga atgagtggag ggcgctgtgt tctcaatcag cgcctccgag 4861 gagccgtggg gttccttcgg cattagttca cggtttttga gagaggccct agttactgca 4921 gtgaatttct ttcctgttgc agagacgctt ccagcctcac tttactttct gtggcctgat 4981 gaggaccatg ggtgattttg tgtacccaaa gcgctgggga ctgcccaccg tgtggcccag 5041 tcactgggaa ggagccccag agagccggct gtctgacatg atggctcagg gtggtcatcc 5101 aggttgaaaa ctgaccgtgt gatgtttgat ttgggcttca tttcgtgtgt aggagcacgg 5161 ttagactcac tgttaaggaa gctggatgca cttctctaaa aggctgcact ttccgtgagc 5221 acttttcgtg gtacaatcca catgacccac tttctcccct gggggacgtt ggttcagagg 5281 ttggtagcac ttggggagag tatcttaaca cagtttcttg acagcagctc tggaacttag 5341 tatttctgcc ccgagttttg ccacactgag actttgagta gctcctggtg gactcaaccc 5401 tgttcaactc agagacgggc ctcctctcac tgatgcaaag ctttaaggct tctctgactg 5461 ttctgaaact cttcgtattc ttgtcaagtc taaagagact gaagaaaaga tttaaatact 5521 aataaaaatc agtagataat ttctgtaggt tctgctggag gaatacaaac tgtttggtgt 5581 tttaaattta agtgtagaaa ttgtagaatg tggaattagc acagatcctt cctggctttc 5641 tgtttcactt gatcatttag cccagaccac ccaggatgtt ttccaaaatg ttccacaggc 5701 gtgtcccgct ggatccattt gtccttgtca cttggagaaa ggccagtccc tgtgacgggg 5761 cagccctctc tgtccctcgg tcagctcgtg tgaatcctgg gacctcttcc ggtcggctct 5821 gcccgctgtt ctggggtcga ctgccacgac ttttgattca agaagcttcc tccaggcggg 5881 agcggctatt tttcctaaat gagaattgtt acattgcaaa ttgttgaata aaatattttg 5941 cgctccttca agcac // LOCUS AB002311 6568 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0313 gene, complete cds. ACCESSION AB002311 NID g2224566 KEYWORDS KIAA0313. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0186. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6568) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6568 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0186" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 63..4562 /gene="KIAA0313" CDS 63..4562 /gene="KIAA0313" /codon_start=1 /db_xref="PID:d1021609" /db_xref="PID:g2224567" /translation="MKPLAIPANHGVMGQQEKHSLPADFTKLHLTDSLHPQVTHVSSS HSGCSITSDSGSSSLSDIYQATESEAGDMDLSGLPETAVDSEDDDDEEDIERASDPLM SRDIVRDCLEKDPIDRTDDDIEQLLEFMHQLPAFANMTMSVRRELCAVMVFAVVERAG TIVLNDGEELDSWSVILNGSVEVTYPDGKAEILCMGNSFGVSPTMDKEYMKGVMRTKV DDCQFVCIAQQDYCRILNQVEKNMQKVEEEGEIVMVKEHRELDRTGTRKGHIVIKGTS ERLTMHLVEEHSVVDPTFIEDFLLTYRTFLSSPMEVGKKLLEWFNDPSLRDKVTRVVL LWVNNHFNDFEGDPAMTRFLEEFENNLEREKMGGHLRLLNIACAAKAKRRLMTLTKPS REAPLPFILLGGSEKGFGIFVDSVDSGSKATEAGLKRGDQILEVNGQNFENIQLSKAM EILRNNTHLSITVKTNLFVFKELLTRLSEEKRNGAPHLPKIGDIKKASRYSIPDLAVD VEQVIGLEKVNKKSKANTVGGRNKLKKILDKTRISILPQKPYNDIGIGQSQDDSIVGL RQTKHIPTALPVSGTLSSSNPDLLQSHHRILDFSATPDLPDQVLRVFKADQQSRYIMI SKDTTAKEVVIQAIREFAVTATPDQYSLCEVSVTPEGVIKQRRLPDQLSKLADRIQLS GRYYLKNNMETETLCSDEDAQELLRESQISLLQLSTVEVATQLSMRNFELFRNIEPTE YIDDLFKLRSKTSCANLKRFEEVINQETFWVASEILRETNQLKRMKIIKHFIKIALHC RECKNFNSMFAIISGLNLAPVARLRTTWEKLPNKYEKLFQDLQDLFDPSRNMAKYRNV LNSQNLQPPIIPLFPVIKKDLTFLHEGNDSKVDGLVNFEKLRMIAKEIRHVGRMASVN MDPALMFRTRKKKWRSLGSLSQGSTNATVLDVAQTGGHKKRVRRSSFLNAKKLYEDAQ MARKVKQYLSNLELEMDEESLQTLSLQCEPATNTLPKNPGDKKPVKSETSPVAPRAGS QQKAQSLPQPQQQPPPAHKINQGLQVPAVSLYPSRKKVPVKDLPPFGINSPQALKKIL SLSEEGSLERHKKQAEDTISNASSQLSSPPTSPQSSPRKGYTLAPSGTVDNFSDSGHS EISSRSSIVSNSSFDSVPVSLHDERRQRHSVSIVETNLGMGRMERRTMIEPDQYSLGS YAPMSEGRGLYATATVISSPSTEELSQDQGDRASLDAADSGRGSWTSCSSGSHDNIQT IQHQRSWETLPFGHTHFDYSGDPAGLWASSSHMDQIMFSDHSTKYNRQNQSRESLEQA QSRASWASSTGYWGEDSEGDTGTIKRRGGKDVSIEAESSSLTSVTTEETKPVPMPAHI AVASSTTKGLIARKEGRYREPPPTPPGYIGIPITDFPEGHSHPARKPPDYNVALQRSR MVARSSDTAGPSSVQQPHGHPTSSRPVNKPQWHKPNESDPRLAPYQSQGFSTEEDEDE QVSAV" BASE COUNT 1974 a 1400 c 1463 g 1731 t ORIGIN 1 cttgccatcg tgagagattg gtacatgatg tgtaaattca gttcagcata tgtttcttca 61 ttatgaaacc actagcaatc ccagctaacc atggagttat gggccagcag gagaaacact 121 cacttcctgc agatttcaca aaactgcatc ttactgacag tctccaccca caggtgaccc 181 acgtttcttc tagccattca ggatgtagta tcactagtga ttctgggagc agcagtcttt 241 ctgatatcta ccaggccaca gaaagcgagg ctggtgatat ggacctgagt gggttgccag 301 aaacagcagt ggattccgaa gacgacgacg atgaagaaga cattgagaga gcatcagatc 361 ctctgatgag cagggacatt gtgagagact gcctagagaa ggacccaatt gaccggacag 421 atgatgacat tgaacaactc ttggaattta tgcaccagtt gcctgctttt gccaatatga 481 caatgtcagt gaggcgagaa ctctgtgctg tgatggtgtt cgcagtggtg gaaagagcag 541 ggaccatagt gttaaatgat ggtgaagagc tggactcctg gtcagtgatt ctcaatggat 601 ctgtggaagt gacttatcca gatggaaaag cagaaatact gtgcatggga aatagttttg 661 gtgtctctcc taccatggac aaagaataca tgaaaggagt gatgagaaca aaggtggatg 721 actgccagtt tgtctgcata gcccagcaag attactgccg tattctcaat caagtagaaa 781 agaacatgca aaaagttgaa gaggaaggag agattgttat ggtgaaagaa caccgagaac 841 ttgatcgaac tggaacaaga aagggacaca ttgtcatcaa gggtacctca gaaaggttaa 901 caatgcattt ggtggaagag cattcagtag tagatccaac attcatagaa gactttctgt 961 tgacctatag gacttttctt tctagcccaa tggaagtggg caaaaagtta ttggagtggt 1021 ttaatgaccc gagcctcagg gataaggtta cacgggtagt attattgtgg gtaaataatc 1081 acttcaatga ctttgaagga gatcctgcaa tgactcgatt tttagaagaa tttgaaaaca 1141 atctggaaag agagaaaatg ggtggacacc taaggctgtt gaatatcgcg tgtgctgcta 1201 aagcaaaaag aagattgatg acgttaacaa aaccatcccg agaagctcct ttgcctttta 1261 tcttacttgg aggctctgag aagggatttg gaatctttgt tgacagtgta gattcaggta 1321 gcaaagcaac tgaagcaggc ttgaaacggg gggatcagat attagaagta aatggccaaa 1381 actttgaaaa cattcagctg tcaaaagcta tggaaattct tagaaataac acacatttat 1441 ctatcactgt gaaaaccaat ttatttgtat ttaaagaact tctaacaaga ttgtcagaag 1501 agaaaagaaa tggtgccccc caccttccta aaattggtga cattaaaaag gccagtcgct 1561 actccattcc agatcttgct gtagatgtag aacaggtgat aggacttgaa aaagtgaaca 1621 aaaaaagtaa agccaacact gtgggaggaa ggaacaagct gaaaaagata ctcgacaaga 1681 ctcggatcag tatcttgcca cagaaaccat acaatgatat tgggattggt cagtctcaag 1741 atgacagcat agtaggatta aggcagacaa agcacatccc aactgcattg cctgtcagtg 1801 gaaccttatc atccagtaat cctgatttat tgcagtcaca tcatcgcatt ttagacttca 1861 gtgctactcc tgacttgcca gatcaagtgc taagggtttt taaggctgat cagcaaagcc 1921 gctacatcat gatcagtaag gacactacag caaaggaagt ggtcattcag gctatcaggg 1981 agtttgctgt tactgccacc ccggatcaat attcactatg tgaggtctct gtcacacctg 2041 agggagtaat caaacaaaga agacttccag atcagctttc caaacttgca gacagaatac 2101 aactgagtgg aaggtattat ctgaaaaaca acatggaaac agaaactctt tgttcagatg 2161 aagatgctca ggagttgttg agagagagtc aaatttccct ccttcagctc agcactgtgg 2221 aagttgcaac acagctctct atgcgaaatt ttgaactctt tcgcaacatt gaacctactg 2281 aatatataga tgatttattt aaactcagat caaaaaccag ctgtgccaac ctgaagagat 2341 ttgaagaagt cattaaccag gaaacatttt gggtagcatc tgaaattctc agagaaacaa 2401 accagctgaa gaggatgaag atcattaagc atttcatcaa gatagcactg cactgtaggg 2461 aatgcaagaa ttttaactca atgtttgcaa tcatcagtgg cctaaacctg gcaccagtgg 2521 caagactgcg aacgacctgg gagaaacttc ccaataaata cgaaaaacta tttcaagatc 2581 tccaagacct gtttgatcct tccagaaaca tggcaaaata tcgtaatgtt ctcaatagtc 2641 aaaatctaca acctcccata atccctctat tcccagttat caaaaaggat ctcaccttcc 2701 ttcacgaagg aaatgactca aaagtagacg ggctggtcaa ttttgagaag ctaaggatga 2761 ttgcaaaaga aattcgtcac gttggccgaa tggcttcagt gaacatggac cctgccctca 2821 tgttcaggac tcggaagaag aaatggcgga gtttggggtc tctcagccag ggtagtacaa 2881 atgcaacagt gctagatgtt gctcagacag gtggtcataa aaagcgggta cgtcgtagtt 2941 cctttctcaa tgccaaaaag ctttatgaag atgcccaaat ggctcgaaaa gtgaagcagt 3001 acctttccaa tttggagcta gaaatggacg aggagagtct tcagacatta tctctgcagt 3061 gtgagccagc aaccaacaca ttgcctaaga atcctggtga caaaaagcct gtcaaatccg 3121 agacctctcc agtagctcca agggcagggt cacaacagaa agctcagtcc ctgccacagc 3181 cccagcagca gccaccacca gcacataaaa tcaaccaggg actacaggtt cccgccgtgt 3241 ccctttatcc ttcacggaag aaagtgcccg taaaggatct cccacctttt ggcataaact 3301 ctccacaagc tttaaaaaaa attctttctt tgtctgaaga aggaagtttg gaacgtcaca 3361 agaaacaggc tgaagataca atatcaaatg catcttcgca gctttcttct cctcctactt 3421 ctccacagag ttctccaagg aaaggctata ctttggctcc cagtggtact gtggataatt 3481 tttcagattc tggtcacagt gaaatttctt cacgatccag tattgttagc aattcgtctt 3541 ttgactcagt gccagtctca ctgcacgatg agaggcgcca gaggcattct gtcagcatcg 3601 tggaaacaaa cctagggatg ggcaggatgg agaggcggac catgattgaa cctgatcagt 3661 atagcttggg gtcctatgca ccaatgtccg agggccgagg cttatatgct acagctacag 3721 taatttcttc tccaagcaca gaggaacttt cccaggatca gggggatcgc gcgtcacttg 3781 atgctgctga cagtggccgt gggagctgga cgtcatgctc aagtggctcc catgataata 3841 tacagacgat ccagcaccag agaagctggg agactcttcc attcgggcat actcactttg 3901 attattcagg ggatcctgca ggtttatggg catcaagcag ccatatggac caaattatgt 3961 tttctgatca tagcacaaag tataacaggc aaaatcaaag tagagagagc cttgaacaag 4021 cccagtcccg agcaagctgg gcgtcttcca caggttactg gggagaagac tcagaaggtg 4081 acacaggcac aataaagcgg aggggtggaa aggatgtttc cattgaagcc gaaagcagta 4141 gcctaacgtc tgtgactacg gaagaaacca agcctgtccc catgcctgcc cacatagctg 4201 tggcatcaag tactacaaag gggctcattg cacgaaagga gggcaggtat cgagagcccc 4261 cgcccacccc tcccggctac attggaattc ccattactga ctttccagaa gggcactccc 4321 atccagccag gaaaccgccg gactacaacg tggcccttca gagatcgcgg atggtcgcac 4381 gatcctccga cacagctggg ccttcatccg tacagcagcc acatgggcat cccaccagca 4441 gcaggcctgt gaacaaacct cagtggcata aaccgaacga gtctgacccg cgcctcgccc 4501 cttatcagtc ccaagggttt tccaccgagg aggatgaaga tgaacaagtt tctgctgttt 4561 gaggcacaga cttttctgga agcagagcga gccacctgaa aggagagcac aagaagacgt 4621 cctgagcatt ggagccttgg aactcacatt ctgaggacgg tggaccagtt tgcctccttc 4681 cctgccttaa aagcagcatg gggcttcttc tccccttctt cctttcccct ttgcatgtga 4741 aatactgtga agaaattgcc ctggcacttt tcagactttg ttgcttgaaa tgcacagtgc 4801 agcaatcttc gagctcccac tgttgctgcc tgccacatca cacagtatca ttccaaattc 4861 caagatcatc acaacaagat gattcactct ggctgcactt ctcaatgcct ggaaggattt 4921 tttttaatct tccttttaga tttcaatcca gtcctagcac ttgatctcat tgggataatg 4981 agaaaagcta gccattgaac tacttggggc ctttaaccca ccaaggaaga caaagaaaaa 5041 caatgaaatc ctttgagtac agtgcttgtc cacttgttta caatgtcctc cttttaaaaa 5101 aaaaaatgag tttaaagatt ttgttcagag agtaaatata tatccattta atgattacag 5161 tattatttta aaccttaagt agggttgcca gcctggtttc tgaaaaacca aatatgccgg 5221 acagggtgtg gccacaccaa gaagacggga agacctggct tgtgaccctg gcttcccatg 5281 tccttctggt ctcacccgcg aagtgcccta tcctggaagt atgaaatgtt agccaattaa 5341 taccaagaca cctcatctgc tccttcccca gtggatgggg ttcttctgta aaactgtttg 5401 cacatggcca ggggagggaa ctaggaccct tgtgtcctgt ctgagcctta tggaggcagg 5461 acggtgtcat tggcggatgt gtcctgctcc attgagatgg atggcaaacc ccatttttaa 5521 gttatatttc tttgattttt gttaatttag aggtgtaggt tttgtttttt gttttttgtt 5581 tttttttaag agaaacattt ataactggat agcattgcag tgaaagcagc ttgggatgtt 5641 ggagctaatg ccagctgttt atactgctct ttcaagacag cctcccttta ttgaattggc 5701 attagggaat aaacaagcct ttaaacgtga taaaagatca aaaacctggt tagacatgcc 5761 agcctttgca aggcaggtta gtcaccaaag actaacctcc aagtggcttt atggacgctg 5821 catatagaga aggcctaagt gtagcaacca tctgctcaca gctgctatta accctataat 5881 gactgaaatg acccctccac tctatttttg tgttgttttg cacagactcc ggaaaagtga 5941 aggctgccaa tctgagtagt actcaaatgt gaggaactgc tggtcttgga ttttttttcc 6001 attaaattca gctgatcata ttgatcagta gataaacgta aatagcttca aattttaaaa 6061 gtggaattgc agtgtttttt cactgtatca aacaatgtca gtgctttatt taataattct 6121 cttctgtatc atggcatttg tctacttgct tattacattg tcaattatgc atttgtaatt 6181 ttacatgtaa tatgcattat ttgccagttt tattatatag gctatggacc tcatgtgcat 6241 atagaaagac agaaatctag ctctaccaca agttgcacaa atgttatcta agcattaagt 6301 aattgtagaa cataggactg ctaatctcag ttcgctctgt gatgtcaagt gcagaatgta 6361 caattaactg gtgatttcct catacttttg atactacttg tacctgtatg tcttttagaa 6421 agacattggt ggagtctgta tcccttttgt atttttaata caataattgt acatattggt 6481 tatatttttg ttgaagatgg tagaaatgta ctatgtttat gcttctacat ccagtttgta 6541 caagctggaa aataaataaa tataacat // LOCUS AB002314 6935 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0316 gene, complete cds. ACCESSION AB002314 NID g2224572 KEYWORDS KIAA0316. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0253. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6935) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6935 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0253" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 498..3782 /gene="KIAA0316" CDS 498..3782 /gene="KIAA0316" /codon_start=1 /db_xref="PID:d1021612" /db_xref="PID:g2224573" /translation="MTANRDGRDYFINHMTQAIPFDDPRLESCQIIPPAPRKVEMRRD PVLGFGFVAGSEKPVVVRSVTPGGPSEGKLIPGDQIVMINDEPVSAAPRERVIDLVRS CKESILLTVIQPYPSPKSAFISAAKKARLKSNPVKVRFSEEVIINGQVSETVKDNSLL FMPNVLKVYLENGQTKSFRFDCSTSIKDVILTLQEKLSIKGIEHFSLMLEQRTEGAGT KLLLLHEQETLTQVTQRPSSHKMRCLFRISFVPKDPIDLLRRDPVAFEYLYVQSCNDV VQERFGPELKYDIALRLAALQMYIATVTTKQTQKISLKYIEKEWGLETFLPSAVLQSM KEKNIKKALSHLVKANQNLVPPGKKLSALQAKVHYLKFLSDLRLYGGRVFKATLVQAE KRSEVTLLVGPRYGISHVINTKTNLVALLADFSHVNRIEMFSEEESLVRVELHVLDVK PITLLMESSDAMNLACLTAGYYRLLVDSRRSIFNMANKKNTATQETGPENKGKHNLLG PDWNCIPQMTTFIGEGEQEAQITYIDSKQKTVEITDSTMCPKEHRHLYIDNAYSSDGL NQQLSQPGEAPCEADYRSLAQRSLLTLSGPETLKKAQESPRGAKVSFIFGDFALDDGI SPPTLGYETLLDEGPEMLEKQRNLYIGSANDMKGLDLTPEAEGIQFVENSVYANIGDV KSFQAAEGIEEPLLHDICYAENTDDAEDEDEVSCEEDLVVGEMNQPAILNLSGSSDDI IDLTSLPPPEGDDNEDDFLLRSLNMAIAAPPPGFRDSSDEEDSQSQAASFPEDKEKGS SLQNDEIPVSLIDAVPTSAEGKCEKGLDNAVVSTLGALEALSVSEEQQTSDNSGVAIL RAYSPESSSDSGNETNSSEMTESSELATAQKQSENLSRMFLATHEGYHPLAEEQTEFP ASKTPAGGLPPKSSHALAARPATDLPPKVVPSKQLLHSDHMEMEPETMETKSVTDYFS KLHMGSVAYSCTSKRKSKLADGEGKAPPNGNTTGKKQQGTKTAEMEEEASGKFGTVSS RDSQHLSTFNLERTAFRKDSQRWYVATEGGMAEKKWIRSSNRENLSKSFWSWGKGGRR EGRRSS" BASE COUNT 2013 a 1637 c 1694 g 1591 t ORIGIN 1 tttccagcca acgccaaaca gtgactgttg acaatttcat attgtcatca ggggaaccaa 61 ggcttattca gatgcctatt tcagaaccta ggacagttcc attgaaaagg cgcaggcgtt 121 cgggctggct gactagatgg atcaggcctg gctgcctgat ggctatattc ctccttcctc 181 cctctccact tccatctcaa cccttgaggc tgcatattga atagttggag aattcagtga 241 actaagagat gcaaatgcac agtacaaaat tcaaatgtcc aattcggggc agggctgcat 301 ctaactttaa tggcaaccac tgcatgtgat gtctggggac tctatagata catggcctca 361 gaccctgaag acatctggat tctgtcactg gattgttcac aaagtgaggc tgaactttcc 421 acaggacgaa gtcttcaggc tggccgcctc cctcgggaac ctggggcttg agccaggtgc 481 cgccctatgg atgggagatg acggcaaacc gagatgggcg agactacttc atcaatcaca 541 tgacacaggc aatccctttt gacgaccctc ggttagagag ctgccaaatc atccctccgg 601 ctcctcggaa ggtggagatg agaagggacc ccgtgctggg atttggtttt gtggcaggca 661 gtgaaaagcc agtggtcgtt cgctcagtaa caccaggtgg cccctctgaa ggcaagctga 721 tcccgggaga tcagattgta atgattaatg atgaaccggt cagcgctgca cccagagagc 781 gggtcatcga tctggtcaga agctgcaaag aatcgatact cctcactgtc attcagcctt 841 acccttctcc caaatcagca tttattagtg ctgcaaaaaa ggcaagatta aagtccaatc 901 ctgtcaaagt acgcttctct gaggaggtca tcatcaacgg ccaagtgtcg gaaactgtta 961 aggacaactc acttcttttt atgccaaatg ttttgaaagt ctatctggaa aatgggcaga 1021 ccaaatcatt tcgttttgac tgcagcactt ccattaagga tgtcatctta acccttcaag 1081 agaagctctc catcaaaggc attgaacact tctctctcat gctggagcag aggacagaag 1141 gggctggaac gaagctgctc ttgcttcatg aacaggagac tctaactcag gtgacacaga 1201 ggcccagctc ccataagatg agatgtcttt tccgaattag cttcgtccca aaagatccaa 1261 ttgacctttt aaggagagat ccagttgctt tcgagtatct ctatgttcag agttgtaacg 1321 atgtggttca ggagcgattt gggccggagc tgaaatatga catagccctg cggctggccg 1381 cattacaaat gtacattgca accgttacca ccaagcaaac gcagaaaatc tccctcaaat 1441 acatcgaaaa agaatgggga ttagagactt ttcttccctc tgctgtgctg caaagcatga 1501 aagagaagaa cataaagaaa gcactttcac accttgtcaa agcaaatcaa aacttggtac 1561 caccgggtaa aaagctctct gcactacaag ccaaggtcca ttatctcaag ttcctcagtg 1621 acctacgatt gtatgggggc cgtgtgttca aggcaacatt agtgcaggca gaaaagcgct 1681 cggaagtgac tctcctggtt gggccccggt atggcataag ccatgtcatc aacaccaaaa 1741 ccaatctggt ggctctttta gccgacttta gccacgtcaa caggatcgaa atgttttccg 1801 aggaggagag cttggtgcgg gtagaactcc acgtgctaga tgtgaagcct atcacgcttc 1861 tgatggaatc ctcagatgcc atgaacctgg cctgcttgac ggctggatac taccggctgc 1921 ttgttgattc caggaggtcg atatttaaca tggccaacaa gaaaaacaca gcgacccagg 1981 aaacaggacc tgaaaacaag gggaagcata acctccttgg cccagattgg aactgtatac 2041 cccaaatgac cacctttatt ggcgaagggg aacaagaagc ccagataaca tacatagatt 2101 caaagcagaa gacggtggag atcacagaca gcaccatgtg tccaaaagag caccggcact 2161 tgtacataga caatgcctat agttcagatg gacttaacca gcagctgagc cagcccgggg 2221 aggccccctg tgaggcagac tacagaagtc tagctcagcg gtccctattg accctctcag 2281 gaccagaaac tctgaagaaa gcacaggaat ctccgagagg agctaaagtg tcctttattt 2341 ttggagactt cgccttggat gatggtatta gtcccccaac ccttggctat gaaacgctac 2401 tagatgaggg tcctgaaatg ctggagaagc agagaaatct ctacattggc agtgccaatg 2461 acatgaaggg cctggatctc actccagagg cagagggcat ccagtttgtg gaaaattctg 2521 tttatgcaaa cataggcgat gtgaagagct tccaggccgc ggaggggatc gaggaacccc 2581 tcttgcatga catctgttat gcagaaaaca ctgatgacgc ggaggacgag gacgaggtga 2641 gctgcgagga ggacctcgtg gtgggggaga tgaaccagcc ggccatcctc aacctgtctg 2701 ggtcaagcga tgacatcatt gacctcacat ccctgccccc tccagaaggt gatgacaatg 2761 aggatgactt cctgttgcgt tccttgaaca tggccattgc cgcaccccca cctggcttta 2821 gagacagttc agatgaagag gactctcaga gccaggcagc ttccttcccc gaggacaagg 2881 agaaaggcag cagcctgcaa aatgatgaga tccccgtgtc cctcattgac gctgtgccca 2941 ccagcgccga aggcaagtgt gagaagggac tggataatgc cgtcgtctcc acgctgggag 3001 ctctagaggc tctatccgtg tcagaagaac agcagaccag tgacaattca ggtgtagcca 3061 tcttgcgggc ttatagtcct gagtcttcgt cagactcggg caatgaaact aactcttctg 3121 aaatgactga gagttctgaa ctggccacag cacaaaaaca gtcagaaaac ctctcccgca 3181 tgttcttggc cactcacgaa ggctaccacc cccttgcaga agagcagacc gagttcccgg 3241 cctccaagac ccccgctggg ggcttgcctc caaagtcctc gcacgccctg gctgctaggc 3301 cagcaaccga cctcccgccc aaagttgtgc cttccaagca gttacttcac tcagaccaca 3361 tggagatgga gcctgaaact atggagacta agtcggtcac tgactatttt agcaaactgc 3421 acatggggtc ggtggcatac tcctgcacta gcaaaaggaa aagcaagctg gccgatggtg 3481 aggggaaggc accccctaat gggaacacaa caggaaaaaa acagcagggg accaaaacgg 3541 cagagatgga ggaggaggcc agtggtaaat ttggtactgt gtcttcacga gacagtcaac 3601 acctgagcac ttttaatctg gagagaactg cctttcgcaa ggacagtcaa agatggtatg 3661 tggccactga aggtgggatg gctgaaaaaa agtggattag aagcagcaac agggaaaacc 3721 tttccaagag cttctggtct tggggcaagg gaggccgaag ggaaggaaga aggagctcct 3781 gatggagaaa ccagtgatgg ctcaggactt ggtcaagggg accgcttctt aactgacgtg 3841 acctgtgcat cttcagccaa agacttagat aacccagagg acgctgactc gtccacctgc 3901 gaccatcctt ccaagcttcc tgaggctgat gagagtgtgg cccgcctttg tgactaccac 3961 ttggccaagc ggatgtcatc actgcaaagc gagggccatt tttctctgca gagctcccaa 4021 ggctcttcag tggatgcagg ctgtggcaca ggcagcagtg gcagtgcctg tgccacaccc 4081 gtggagtcgc cgctctgccc ctccctgggg aagcacttga ttcctgacgc ttctgggaaa 4141 ggcgtgaatt acattccttc agaggagaga gcccctgggc ttcccaacca cggagccacc 4201 tttaaggaac tgcacccaca gacagaaggg atgtgtccac ggatgacagt gcctgctctg 4261 cacacagcca ttaacaccga acccctgttt ggcacattga gagatggatg ccatcggctc 4321 cccaagatta aggaaaccac agtgtagctt tgacagagcc tgggaaggag agacgaggag 4381 gcatgccttc agcttggtct caacatcctg aagctgatcc catcctgcta ccatcaaaca 4441 ttcactcgga atcaaaggtg ccaattccaa atcaagaccc taatgatttc tcccaagcaa 4501 atcaggcata cggagaggct gtgagctggc ggccaccgga tctgagaggg gggagcctca 4561 ggacacctcc cagccagaag gctctgagac atagcagcag tatcctctcc ggatctgtcg 4621 atttggagac cttccgagag agaaccaagg gtgcagtcag cttaaagtgt ccaggcatca 4681 cagaagcaca ggaggccagt tctgaaaggc gagcagaact ccccctgggg aggaagctca 4741 ccaaaagttt ttcccaaagc tcaatgcact tgagctctga ggggaggttt cacaaaaggt 4801 ccccagtggc tcataaagac tcaaagctgt ataggacatt acccttgcgg aagctggagg 4861 gcagcaattg gagatgccgg ggacccttca gctattgctt cctgaaccga gggcaggatg 4921 aagatggtga ggaagaagag gagaggggag aggccaccgt ccaggtctct tgcctctata 4981 gaccacaggt gactcaagcc atgccagaac caagcagccc atgcctggct gtggcgattc 5041 agaagcaacg aggggagcta tccagagggt cagtgctgaa ggtctgggca gaagacctgc 5101 gagacccaga tgacttggac ttcagcaacc tggcttttga tgcccggatt gcaagaataa 5161 atgccctaaa ggagagcaca tatgcaatgc ctgatgggtt ccttgcagcc caaaatgatg 5221 ccaatgagct gctctgtctc gtcagggcaa ccaaggagaa gagggaggag tcacgccctg 5281 aagcgtacga ccttacactt tctcagtaca agcaactgtt atccattgag tccagacagt 5341 tgggaagtgc ctgtaggaaa atggcgatgg ctgagaaaag cccggaggag atgctcctag 5401 ctatgacttc cagctttcaa gtgctctgtt gcctaacaga agcttgcatg cgattagtta 5461 aagtcgtgaa ctcagaaaca cagcggcagg aaattgtagg gaagatcgat gaagtggtca 5521 taaattacat ttgtctactg aaagctgccg aagcagccac tggaaagaac cctggggacc 5581 ctaatgttgg actctcggcg cgacactcaa ccaccatggc cgctctcgta agcacactga 5641 cacgttctct caagaggctt ttaaacaaat aaatatggaa gtcacgtcat aatctacctt 5701 tgcaaagcca tacatgaact tttatttact ttgtgtgtat gatgaacaga tgtctccttt 5761 cttctctctg tatattttgt tattttatat aaaataggag ataaaagtca cactgatgaa 5821 atgttgaaat gtactaatca gatgtattct gtttatatta tacatatata tacacgtaaa 5881 agaaatatcc aagaaagtga tgacatttgg ctatttttca tatagttaaa actccaggta 5941 tatgatgtga aattttaaat tctaccatgt tagagcaaaa caatgaatcc tatccccttt 6001 ctttccaagt agctacttgg aaaccatatc attcatattt agaagtaaaa cacaaaacaa 6061 aaaagagaga gaaaagaaaa gaaatcacaa tgtatataaa acagtactta tgttttaaaa 6121 ttatgatttt taagcattgg aaatagcaaa aagacattta aaattcaaga agctattatg 6181 aattactaga gaatatatct gtaataaatt aattttttgc tcatagtatt tggttactgg 6241 atgctttctt ccaagaatcc cacatattta atttgggttt ttgctactgg ggctacaaat 6301 tggtggggat ggattctact gtgtcagcac aaatgctctt cacagtggtt ctagcattta 6361 aaaaacttcc cggggagaag aacagagggg atgatgggca gtttcctagg taacacctag 6421 agttatagaa tatctcatta cataaaatgt atggaattaa taataccaaa attaattatt 6481 tgatggaaag atctgctttg actaaatgtc aaaaatctgc aaaccaaaga cattatcttc 6541 ccctcatccc aactcaacta cgaaacttaa aattcccttt agagtgatag gacatttagt 6601 aaagtatttg caaacttaaa aaaaggaaca tttaatgatc atcaaaatta agtacagatt 6661 cagtaatgta gaccagacca cacaccagca cctgtgagtc tcatctcaga tcacagctct 6721 cagcataggg cttcatgcat caccgcctct acagaggcta aggctgccag tcaaatttgg 6781 aattatagcg tagtactggg acaaaatctc aaatcttgga tgttccagaa aatcagggag 6841 tgatggctac tgtaatcatg ggagccatga gtaaatagtt aagtatttat taaataaata 6901 cttaatctgg attggctgat aaaaatatga aatct // LOCUS AB002315 5402 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0317 gene, complete cds. ACCESSION AB002315 NID g2224574 KEYWORDS KIAA0317. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0276. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5402) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5402 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0276" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 472..2943 /gene="KIAA0317" CDS 472..2943 /gene="KIAA0317" /codon_start=1 /db_xref="PID:d1021613" /db_xref="PID:g2224575" /translation="MFYVIGGITVSVVAFFFTIKFLFELAARVVSFLQNEDRERRGDR TIYDYVRGNYLDPRSCKVSWDWKDPYEVGHSMAFRVHLFYKNGQPFPAHRPVGLRVHI SHVELAVEIPVTQEVLQEPNSNVVKVAFTVRKAGRYEITVKLGGLNVAYSPYYKIFQP GMVVPSKTKIVCHFSTLVLTCGQPHTLQIVPRDEYDNPTNNSMSLRDEHNYTLSIHEL GPQEEESTGVSFEKSVTSNRQTFQVFLRLTLHSRGCFHACISYQNQPINNGEFDIIVL SEDEKNIVERNVSTSGVSIYFEAYLYNATNCSSTPWHLPPMHMTSSQRRPSTAVDEED EDSPSECHTPEKVKKPKKVYCYVSPKQFSVKEFYLKIIPWRLYTFRVCPGTKFSYLGP DPVHKLLTLVVDDGIQPPVELSCKERNILAATFIRSLHKNIGGSETFQDKVNFFQREL RQVHMKRPHSKVTLKVSRHALLESSLKATRNFSISDWSKNFEVVFQDEEALDWGGPRR EWFELICKALFDTTNQLFTRFSDNNQALVHPNPNRPAHLRLKMYEFAGRLVGKCLYES SLGGAYKQLVRARFTRSFLAQIIGLRMHYKYFETDDPEFYKSKVCFILNNDMSEMELV FAEEKYNKSGQLDKVVELMTGGAQTPVTNANKIFYLNLLAQYRLASQVKEEVEHFLKG LNELVPENLLAIFDENELELLMCGTGDISVSDFKAHAVVVGGSWHFREKVMRWFWTVV SSLTQEELARLLQFTTGSSQLPPGGFAALCPSFQIIAAPTHSTLPTAHTCFNQLCLPT YDSYEEVHRMLQLAISEGCEGFGML" BASE COUNT 1311 a 1302 c 1310 g 1479 t ORIGIN 1 gccggttccc aggccgggtg gagaccaact ctggggtctc ttggtgggag agtggggagc 61 ccgtcttctg ctgagtgctg ccgccccttc ccaggacagc ggaggtggaa cttcgtcgcg 121 ctgcaacccc cggctcggga tcctggggcg tcctttggca gttgattgca ccctgcacta 181 gagagaagtc caggaatgat atattcttgg gctgtatttc tctaccctgg agttcatcca 241 gttcatccca gaaacactga cctgattact gtgacaaagt ctgatggata gaccctttct 301 gttgaccttg tggcgttttt ctttcatgtg gaagttggaa gacaaggtga aaggggccaa 361 aagttacctg ctttggtgaa taggagtgct ctacttggtt ctcagaacat gggccctcgt 421 ttaaagctgc agctgtgatc ctcggctgtc tgttggcatt gacgggacct gatgttttac 481 gttattggtg gaatcacagt gtctgtggtt gcattcttct tcacaattaa gttcctcttt 541 gagcttgccg cacgtgtagt cagcttcctc cagaatgagg accgcgagcg ccgaggggac 601 cggactattt atgactacgt gcggggaaat tacctggatc cccggtcttg caaagtctcc 661 tgggattgga aggaccccta tgaggtgggc cacagcatgg ccttccgagt gcatttattc 721 tataagaacg ggcagccttt ccctgcacat cggcctgtgg gactaagagt tcacatctct 781 catgtcgagc tagcagtgga aattccagtg acccaggaag tccttcagga gcccaattcc 841 aacgtagtaa aagtggcctt cactgtgcgc aaggctgggc gttatgaaat cacagtgaag 901 cttggtggat taaatgtggc atatagtccc tactacaaaa tttttcaacc tggaatggtg 961 gttccttcta agaccaaaat tgtgtgccac ttttctactc ttgtattgac ctgtgggcag 1021 ccgcacaccc ttcaaatagt accccgagat gagtatgata atcccaccaa caattccatg 1081 tccttgagag atgagcacaa ttacaccttg tccattcatg agctcggccc tcaagaagaa 1141 gagagtactg gtgtctcatt tgagaaatca gtaacatcca acaggcagac tttccaggtg 1201 ttcttgcgac tcaccctgca ttctcgaggc tgcttccatg cttgcatttc ataccaaaat 1261 cagccaatca ataatggtga atttgacatt attgtcctaa gtgaggatga gaagaatatc 1321 gtcgaacgca atgtgtccac ttcaggcgtg agcatttact ttgaggctta tctttataat 1381 gctaccaact gtagcagcac tccatggcac ctgccaccca tgcacatgac ctcttcccag 1441 cgccggccat ccactgctgt tgacgaggaa gatgaagact cgccctctga gtgccacacc 1501 cctgagaagg tgaagaaacc gaagaaggtg tactgctatg tgtcaccaaa gcaattctca 1561 gtgaaggagt tctacctgaa gatcatcccc tggcgccttt acaccttccg agtgtgtcca 1621 ggaacaaaat tttcatacct tggtcctgac cctgtccata agctgctcac actggtggtg 1681 gatgatggca ttcaacctcc tgtggagctc agctgtaagg agaggaacat tctagcagcc 1741 acttttatcc gctccctgca taagaacata ggaggctctg agacctttca ggacaaggtg 1801 aactttttcc agcgagagct tcggcaggta catatgaaaa gaccacattc caaagtcacc 1861 ctgaaggtca gcagacatgc cttgttggaa tcgtctctga aagccactcg gaatttctcc 1921 atctcagatt ggagcaagaa ctttgaggtt gttttccagg atgaagaagc tctggactgg 1981 ggagggcctc gccgggaatg gtttgagcta atctgcaaag cactatttga taccaccaat 2041 cagctcttca cccggttcag tgacaacaac caagcattag tgcatcccaa ccctaatcgc 2101 cccgctcatc tgcgcctgaa aatgtatgag tttgcgggac ggctcgtggg caagtgtctc 2161 tatgagtcct ctctaggagg agcctacaag cagttggtcc gagctcgctt cacccgctct 2221 ttcctggccc aaatcatagg actgcgtatg cattacaagt actttgaaac agatgaccca 2281 gaattctaca aatctaaagt ttgttttatc ctcaacaatg acatgagtga gatggagctg 2341 gtctttgcag aagagaaata taataaatca ggtcaattgg ataaggttgt agaactcatg 2401 acaggtggag ctcaaactcc agtcaccaat gcgaataaaa tcttctattt aaatttgctg 2461 gcccaatatc ggctggccag tcaagtgaaa gaggaggtgg aacatttcct aaaaggcctg 2521 aatgaattgg tccctgagaa ccttttggct atttttgatg agaatgagct tgagctgctg 2581 atgtgtggga ctggagacat cagtgtgtct gacttcaaag cccatgcagt agttgttggt 2641 ggctcatggc atttcagaga aaaggtcatg aggtggtttt ggactgtggt ttccagtctg 2701 acccaggagg agttggctcg gctacttcag ttcacaacag gctcctctca gctaccacct 2761 ggaggctttg ccgccctctg tccctcattt cagattattg ccgctccgac ccatagcacg 2821 ctgcctactg cacacacatg ttttaaccag ctgtgcctcc ctacatatga ctcctatgaa 2881 gaggtgcaca ggatgctgca gctggccatc agcgagggtt gcgagggctt tggcatgctc 2941 tgaccactct cctgtcatcc agttggctcc catgctctct ggagcttctg ggcgcaagtt 3001 acagacatca taaccactga tcctaacaca cataaccatc agccagaaga tgccgcatgc 3061 tcccctgtgt ctggaggatt ttgtcaccta caagccttgt ctttacctca cctgctccct 3121 gcccatatct accacaggcc actttggcat ggtatgtaag ctgagctctt cattctgtca 3181 tgagaagagg accatgctgc tatcatttat ttggtccttt gaagatctca gtagctgagg 3241 gagatggcac acggggctca gccctgttgg gaaactggtg tggagaccct taaatccaca 3301 ctgtgctcca aaactccctc tgctgatact ccttggagac accctctttg gccctcacta 3361 cttgaccaga ctggtacttg agtccttctc atgggtgggg tgattgcctc ttctcatcag 3421 gagccaggag agagggggac agataggagg tggcccatag gagcagtccc gctgcacaat 3481 ggtaggcata ggccatggca ctggactgcc tctaaggact gctaaaaaga atattttttt 3541 gtggtgtcag aactggaaaa agcactttcc cttcgggcat ttctggaaat gattattaat 3601 cacaaagaag aactctgtaa gctttttctt gaattgtagc cagtgagaaa agcagataga 3661 ctgaagaata tgaaggatag ctgagctgta gcctccagag tggggcatgc ctaggcatat 3721 ggctggcttg gagactactg atgcttttcc ctgagtttgt tattggcact gaagtatggc 3781 cggcttgggc cactgacttc cccattatgt agtctgctaa aagctgggga tcctttagca 3841 ttctactgaa gaaaattttg tagcaaaaga ttagaacagt aagaatagta tgccagcaat 3901 ccctgattct ctctctgctt ggtgttctct ggcagcacta agacaaagga acagggacta 3961 ggagtttacg tgcttatcac aggtccttgg cgtcaggaca ctagggatga acatggaggt 4021 gatatgttac acagtaaaca cctgccaacc cctgtactcc cctttgctcc atcagttatc 4081 aggaaggaga actaaggagg gaggaagctt attaggttca ctgttgaatg atgattgaag 4141 agtacctgct gctgtatctt ggaagatgac aacatccttt ttcattactg tttgattgaa 4201 aataatttca agattcaaaa tctcattgac ttcccaaatt tgtgttttta agaaggcttt 4261 gctgcaagct accattccca atggggcagt caactctgag aatattacaa gccatcttta 4321 ttgccaaaac cagcaagtac acagagtcct gaaacaaggc aggactttgg caatgccttg 4381 gcctcctgga agcatgtctg agccctcctg ttcgcaggga tcatgaggaa caagccctcc 4441 tgagctgcta cagttcctgc ctgacctcag tctttcgcag tcatcatttg tcctttcttg 4501 gcacatgggt gctgacctca ctcatccttg gggtccggga tccagcatca gggtctctat 4561 accccaggat cctccatgcc acatggtcac tgctctcctc agggaccagg acaaggtgct 4621 gctgcttgcc caaatgtttt cccatgggat atgtaccgga aggtttatca ccaccctggg 4681 aaacataatg ctgccccttg ggcccagaaa ggggtttcca gctgggggca cgtggactgg 4741 ttctgctgtt tttggcagtc gcttttctga attctcccct ccggcagcct tccagagact 4801 gatcctggag attgagttga ttgtcttggc tagacctgtc attttaagct ctagtcatag 4861 cactttttca gagcatctta ggcaccattg caacccaacc agggaagctg catccctgtg 4921 gtggtcctta ggcaccagtc tttgttaaac aaaacccttt ggcactattg tggttttcta 4981 ttctctgtct gaactctatt caaaagtatc tttgctctct tgggcctttt cttttactgt 5041 tttgtttttt ttttctaatc ctgctttcat actagccagt gtggggaaaa ggtacaatat 5101 gtcaaagaga tgagagagtg ttatttcttg ggcaattttc tattagtgtt tcttattttg 5161 gccagttctt ttatttatgt ccttgtgacc caggtacttg gggggccagc tacccttctg 5221 gccttttagc gtctttgaag gagaccagac atgagtgaat acctaggaga gtgtcagcat 5281 gtttctggaa aattggcaga gaccaagccc tgctgcagat tcgtcaggcc aggtgaaagg 5341 gccaggcagt tgcagctgat gatgtaaata ttttgtacag tagataaata aatgtttaaa 5401 ag // LOCUS AB002317 6791 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0319 gene, complete cds. ACCESSION AB002317 NID g2224578 KEYWORDS KIAA0319. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0378. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6791) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6791 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0378" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 512..3730 /gene="KIAA0319" CDS 512..3730 /gene="KIAA0319" /codon_start=1 /db_xref="PID:d1021615" /db_xref="PID:g2224579" /translation="MAPPTGVLSSLLLLVTIAGCARKQCSEGRTYSNAVISPNLETTR IMRVSHTFPVVDCTAACCDLSSCDLAWWFEGRCYLVSCPHKENCEPKKMGPIRSYLTF VLRPVQRPAQLLDYGDMMLNRGSPSGIWGDSPEDIRKDLPFLGKDWGLEEMSEYADDY RELEKDLLQPSGKQEPRGSAEYTDWGLLPGSEGAFNSSVGDSPAVPAETQQDPELHYL NESASTPAPKLPERSVLLPLPTTPSSGEVLEKEKASQLQEQSSNSSGKEVLMPSHSLP PASLELSSVTVEKSPVLTVTPGSTEHSIPTPPTSAAPSESTPSELPISPTTAPRTVKE LTVSAGDNLIITLPDNEVELKAFVAPAPPVETTYNYEWNLISHPTDYQGEIKQGHKQT LNLSQLSVGLYVFKVTVSSENAFGEGFVNVTVKPARRVNLPPVAVVSPQLQELTLPLT SALIDGSQSTDDTEIVSYHWEEINGPFIEEKTSVDSPVLRLSNLDPGNYSFRLTVTDS DGATNSTTAALIVNNAVDYPPVANAGPNHTITLPQNSITLNGNQSSDDHQIVLYEWSL GPGSEGKHVVMQGVQTPYLHLSAMQEGDYTFQLKVTDSSRQQSTAVVTVIVQPENNRP PVAVAGPDKELIFPVESATLDGSSSSDDHGIVFYHWEHVRGPSAVEMENIDKAIATVT GLQVGTYHFRLTVKDQQGLSSTSTLTVAVKKENNSPPRARAGGRHVLVLPNNSITLDG SRSTDDQRIVSYLWIRDGQSPAAGDVIDGSDHSVALQLTNLVEGVYTFHLRVTDSQGA SDTDTATVEVQPDPRKSGLVELTLQVGVGQLTEQRKDTLVRQLAVLLNVLDSDIKVQK IRAHSDLSTVIVFYVQSRPPFKVLKAAEVARNLHMRLSKEKADFLLFKVLRVDTAGCL LKCSGHGHCDPLTKRCICSHLWMENLIQRYIWDGESNCEWSIFYVTVLAFTLIVLTGG FTWLCICCCKRQKRTKIRKKTKYTILDNMDEQERMELRPKYGIKHRSTEHNSSLMVSE SEFDSDQDTIFSREKMERGNPKVSMNGSIRNGASFSYCSKDR" BASE COUNT 1812 a 1542 c 1684 g 1753 t ORIGIN 1 gctgccgcgg gcggtgggcg gggatccccc gggggtgcaa ccttgctcca cctgtgctgc 61 cctcggcggg cctggctggc cccgcgcaga gcggcggcgg cgctcgctgt cactgccgga 121 ggtgagagcg cagcagtagc ttcagcctgt cttgggcttg gtccagattc gctcctctgg 181 ggctacgtcc cggggaagag gaagcgagga ttttgctggg gtggggctgt acctcttaac 241 agcaggtgcg cgcgcgaggg tgtgaacgtg tgtgtgtgtg tgtgtctgtg tgtgtgtgtg 301 taagacctgc gatgacgacg aggaggaaca agtgggacgg cgagtgatgc tcagggccag 361 cagcaacgca tggggcgagc ttcagtgtcg ccagcagtga ccacagttct tgaggccaaa 421 tctggctcct aaaaaacatc aaaggaagct tgcaccaaac tctcttcagg gccgcctcag 481 aagcctgcca tcacccactg tgtggtgcac aatggcgccc cccacaggtg tgctctcttc 541 attgctgctg ctggtgacaa ttgcaggttg tgcccgtaag cagtgcagcg aggggaggac 601 atattccaat gcagtcattt cacctaactt ggaaaccacc agaatcatgc gggtgtctca 661 caccttccct gtcgtagact gcacggccgc ttgctgtgac ctgtccagct gtgacctggc 721 ctggtggttc gagggccgct gctacctggt gagctgcccc cacaaagaga actgtgagcc 781 caagaagatg ggccccatca ggtcttatct cacttttgtg ctccggcctg ttcagaggcc 841 tgcacagctg ctggactatg gggacatgat gctgaacagg ggctccccct cggggatctg 901 gggggactca cctgaggata tcagaaagga cttgcccttt ctaggcaaag attggggcct 961 agaggagatg tctgagtacg cagatgacta ccgggagctg gagaaggacc tcttgcaacc 1021 cagtggcaag caggagccca gagggagtgc cgagtacacg gactggggcc tactgccggg 1081 cagcgagggg gccttcaact cctctgttgg agacagtcct gcggtgccag cggagacgca 1141 gcaggaccct gagctccatt acctgaatga gtcggcttca acccctgccc caaaactccc 1201 tgagagaagt gtgttgcttc ccttgccgac tactccatct tcaggagagg tgttggagaa 1261 agaaaaggct tctcagctcc aggaacaatc cagcaacagc tctggaaaag aggttctaat 1321 gccttcccat agtcttcctc cggcaagcct ggagctcagc tcagtcaccg tggagaaaag 1381 cccagtgctc acagtcaccc cggggagtac agagcacagc atcccaacac ctcccactag 1441 cgcagccccc tctgagtcca ccccatctga gctacccata tctcctacca ctgctcccag 1501 gacagtgaaa gaacttacgg tatcggctgg agataaccta attataactt tacccgacaa 1561 tgaagttgaa ctgaaggcct ttgttgcgcc agcgccacct gtagaaacaa cctacaacta 1621 tgaatggaat ttaataagcc accccacaga ctaccaaggt gaaataaaac aaggacacaa 1681 gcaaactctt aacctctctc aattgtccgt cggactttat gtcttcaaag tcactgtttc 1741 tagtgaaaac gcctttggag aaggatttgt caatgtcact gttaagcctg ccagaagagt 1801 caacctgcca cctgtagcag ttgtttctcc ccaactgcaa gagctcactt tgcctttgac 1861 gtcagccctc attgatggca gccaaagtac agatgatact gaaatagtga gttatcattg 1921 ggaagaaata aacgggccct tcatagaaga gaagacttca gttgactctc ccgtcttacg 1981 cttgtctaac cttgatcctg gtaactatag tttcaggttg actgttacag actcggacgg 2041 agccactaac tctacaactg cagccctaat agtgaacaat gctgtggact acccaccagt 2101 tgctaatgca ggaccaaatc acaccataac tttgccccaa aactccatca ctttgaatgg 2161 aaaccagagc agtgacgatc accagattgt cctctatgag tggtccctgg gtcctgggag 2221 tgagggcaaa catgtggtca tgcagggagt acagacgcca taccttcatt tatctgcaat 2281 gcaggaagga gattatacat ttcagctgaa ggtgacagat tcttcaaggc aacagtctac 2341 tgctgtagtg actgtgattg tccagcctga aaacaataga cctccagtgg ctgtggccgg 2401 ccctgataaa gagctgatct tcccagtgga aagtgctacc ctggatggga gcagcagcag 2461 cgatgaccac ggcattgtct tctaccactg ggagcacgtc agaggcccca gtgcagtgga 2521 gatggaaaat attgacaaag caatagccac tgtgactggt ctccaggtgg ggacctacca 2581 cttccgtttg acagtgaaag accagcaggg actgagcagc acgtccaccc tcactgtggc 2641 tgtgaagaag gaaaataata gtcctcccag agcccgggct ggtggcagac atgttcttgt 2701 gcttcccaat aattccatta ctttggatgg ttcaaggtct actgatgacc aaagaattgt 2761 gtcctatctg tggatccggg atggccagag tccagcagct ggagatgtca tcgatggctc 2821 tgaccacagt gtggctctgc agcttacgaa tctggtggag ggggtgtaca ctttccactt 2881 gcgagtcacc gacagtcagg gggcctcgga cacagacact gccactgtgg aagtgcagcc 2941 agaccctagg aagagtggcc tggtggagct gaccctgcag gttggtgttg ggcagctgac 3001 agagcagcgg aaggacaccc ttgtgaggca gctggctgtg ctgctgaacg tgctggactc 3061 ggacattaag gtccagaaga ttcgggccca ctcggatctc agcaccgtga ttgtgtttta 3121 tgtacagagc aggccgcctt tcaaggttct caaagctgct gaagtggccc gaaatctgca 3181 catgcggctc tcaaaggaga aggctgactt cttgcttttc aaggtcttga gggttgatac 3241 agcaggttgc cttctgaagt gttctggcca tggtcactgc gaccccctca caaagcgctg 3301 catttgctct cacttatgga tggagaacct tatacagcgt tatatctggg atggagagag 3361 caactgtgag tggagtatat tctatgtgac agtgttggct tttactctta ttgtgctaac 3421 aggaggtttc acttggcttt gcatctgctg ctgcaaaaga caaaaaagga ctaaaatcag 3481 gaaaaaaaca aagtacacca tcctggataa catggatgaa caggaaagaa tggaactgag 3541 gcccaaatat ggtatcaagc accgaagcac agagcacaac tccagcctga tggtatccga 3601 gtctgagttt gacagtgacc aggacacaat cttcagccga gaaaagatgg agagagggaa 3661 tccaaaggtt tccatgaatg gttccatcag aaatggagct tccttcagtt attgctcaaa 3721 ggacagataa tggcgcagtt cattgtaaag tggaaggacc ccttgaatcc aagaccagtc 3781 agtgggagtt acagcacaaa acccactctt ttagaatagt tcattgacct tcttccccag 3841 tgggttagat gtgtatcccc acgtactaaa agaccggttt ttgaaggcac aaaacaaaaa 3901 ctttgctctt ttaactgaga tgcttgttaa tagaaataaa ggctgggtaa aactctaagg 3961 tatatactta aaagagtttt gagtttttgt agctggcaca atctcatatt aaagatgaac 4021 aacgatttct atctgtagaa ccttagagaa ggtgaatgaa acaaggtttt aaaaagggat 4081 gatttctgtc ttagccgctg tgattgcctc taaggaacag cattctaaac acggtttctc 4141 ttgtaggacc tgcagtcaga tggctgtgta tgttaaaata gcttgtctaa gaggcacggg 4201 ccatctgtgg aggtacggag tcttgcatgt agcaagcttt ctgtgctgac ggcaacactc 4261 gcacagtgcc aagccctcct ggtttttaat tctgtgctat gtcaatggca gttttcatct 4321 ctctcaagaa agcagctgtt ggccattcaa gagctaagga agaatcgtat tctaaggact 4381 gaggcaatag aaaggggagg aggagcttaa tgccgtgcag gttgaaggta gcattgtaac 4441 attatctttt ctttctctaa gaaaaactac actgactcct ctcggtgttg tttagcagta 4501 tagttctcta atgtaaacgg atccccagtt tacattaaat gcaatagaag tgattaattc 4561 attaagcatt tattatgttc tgtaggctgt gcgtttggac tgccatagat agggataacg 4621 actcagcaat tgtgtatata ttccaaaact ctgaaataca gtcagtctta acttggatgg 4681 cgtggttatg atactctggt ccccgacagg tactttccaa aataacttga catagatgta 4741 ttcacttcat atgtttaaaa atacatttaa gtttttctac cgaataaatc ttatttcaaa 4801 catgaaagac aattaaaaca ttcccaccca caaagcagta ctcccgagca attaactgga 4861 gttaattgta gcctgctacg ttgactggtt cagggtagtt ccccatccac ccttggtcct 4921 gaggctggtg gccttggtgg tgcccttggc attttttgtg ggaagattag aatgagagat 4981 agaaccagtg ttgtggtacc aagtgtgagc acacctaaac aatatcctgt tgcacaatgc 5041 ttttttaaca catgggaaaa ctaggaatgc attgctgatg aagaagcaag gtatttaaac 5101 accagggcag gagtgccaga gaaaatgttt ccccatgggt tcttaaaaaa aattcagctt 5161 ttaggtgctt ttgtcatctc ccggagtatt catcctcatg ggaccatctt atttttactt 5221 attgtaattt actggggaaa ggcagaacta aaaagtgtgt cattttattt ttaaaataat 5281 tgctttgctt atgcctacac tttctgtata actagccaat tcaatactgt ctatagtgtt 5341 agaaggaaaa tgtgattttt tttttttaac cagtattgag cttcataagc ctagaatctg 5401 ccttatcagg tgaccagggt tatggttgtt tgcatgcaaa tgtgaatttc tggcataggg 5461 gacagcagcc caaatgtaaa gtcatcgggc gtaatgagga agaagggagt gaacatttac 5521 cgctttatgt acataacata tgcagtttac atactcattt gatccttata atcaaccttg 5581 aagaggagat actatcattc ttatgttgca gatagccctc tgaaggccca gagaggttaa 5641 gtaacttccc agaggtcatg gccaagaagt agtggctcca agaactgaat gcaaattttt 5701 taaactgtag agttctgctt tccactaaac aaagaactcc tgccttgatg gatggagggc 5761 aaattctggt ggaacttttg ggccacctga aagttctatt cccaggacta agaggaattt 5821 cttttaatgg atccagagag ccaaggtcag agggagagat ggcctgcata gtctcctgtg 5881 gatcacaccc gggccacccc tccctctagg tttacagtgg acttcttctg cccctcctcc 5941 ttttctgtcc ttggccatct cagcctggcc tctctgatcc ttccatcaca gaaggatctt 6001 gaatctctgg gaaatcaaac atcacagtag tgatcagaaa gtgagtcctg tcttgtcacc 6061 ccatttctca tcagaacaaa gcacgagatg gaatgaccaa ccagcattct tcatggtgga 6121 ctgcttatca ttgaggatct ttgggagata aagcacgcta agagctctgg acagagaaaa 6181 acaggcccta gaatatggga gtgggtgttt gtagggctca taggctaaca agcactttag 6241 ttgctggttt acattcaatg aaggaggatt catacccatg gcattacaag gctaagcatg 6301 tgtatgacta aggaactatc tgaaaaacat gcagcaaggt aagaaaatgt accactcaac 6361 aagccagtga tgccaccttt tgtgcgcggg gaggagagtg actaccattg ttttttgtgt 6421 gacaaagcta tcatggacta ttttaatctt ggttttattg cttaaaatat attatttttc 6481 cctatgtgtt gacaaggtat ttctaatatc acactattaa atatatgcac taatctaaat 6541 aaaggtgtct gtattttctg taatgcttat ttttaggggg aaatttgttt tctttatgct 6601 tcagggtaga gggattccct tgagtatagg tcagcaaact ctggcctgca gcctgtgtgt 6661 gcacgcccca tgagccgaaa agtgggtctt atgttttcaa atggttaaaa ataaataaaa 6721 aaatttgaaa catgtgaact atatgacatt cagatttgtg ttcataaata aagttttatt 6781 ggaacatatc c // LOCUS AB002327 6379 bp mRNA PRI 24-JUL-1997 DEFINITION Human mRNA for KIAA0329 gene, complete cds. ACCESSION AB002327 NID g2280477 KEYWORDS KIAA0329. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0872. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6379) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 COMMENT Sequence updated (22-Jul-1997). FEATURES Location/Qualifiers source 1..6379 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0872" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 227..4462 /gene="KIAA0329" CDS 227..4462 /gene="KIAA0329" /codon_start=1 /db_xref="PID:d1021625" /db_xref="PID:g2224599" /translation="MASISEPVTFREFCPLYYLLNAIPTKIQKGFRSIVVYLTALDTN GDYIAVGSSIGMLYLYCRHLNQMRKYNFEGKTESITVVKLLSCFDDLVAAGTASGRVA VFQLVSSLPGRNKQLRRFDVTGIHKNSITALAWSPNGMKLFSGDDKGKIVYSSLDLDQ GLCNSQLVLEEPSSIVQLDYSQKVLLVSTLQRSLLFYTEEKSVRQIGTQPRKSTGKFG ACFIPGLCKQSDLTLYASRPGLRLWKADVHGTVQATFILKDAFAGGVKPFELHPRLES PNSGSCSLPERHLGLVSCFFQEGWVLSWNEYSIYLLDTVNQATIAGLEGSGDIVSVSC TENEIFFLKGDRNIIRISSRPEGLTSTVRDGLEMSGCSERVHVQQAEKLPGATVSETR LRGSSMASSVASEPRSRSSSLNSTDSGSGLLPPGLQATPELGKGSQPLSQRFNAISSE DFDQELVVKPIKVKRKKKKKKTEGGSRSTCHSSLESTPCSEFPGDSPQSLNTDLLSMT SSVLGSSVDQLSAESPDQESSFNGEVNGVPQENTDPETFNVLEVSGSMPDSLAEEDDI RTEMPHCHHAHGRELLNGAREDVGGSDVTGLGDEPCPADDGPNSTQLPFQEQDSSPGA HDGEDIQPIGPQSTFCEVPLLNSLTVPSSLSWAPSAEQWLPGTRADEGSPVEPSQEQD ILTSMEASGHLSTNLWHAVTDDDTGQKEIPISERVLGSVGGQLTPVSALAASTHKPWL EQPPRDQTLTSSDEEDIYAHGLPSSSSETSVTELGPSCSQQDLSRLGAEDAGLLKPDQ FAESWMGYSGPGYGILSLVVSEKYIWCLDYKGGLFCSALPGAGLRWQKFEDAVQQVAV SPSGALLWKIEQKSNRAFACGKVTIKGKRHWYEALPQAVFVALSDDTAWIIRTSGDLY LQTGLSVDRPCARAVKVDCPYPLSQITARNNVVWALTEQRALLYREGVSSFCPEGEQW KCDIVSERQALEPVCITLGDQQTLWALDIHGNLWFRTGIISKKPQGDDDHWWQVSITD YVVFDQCSLFQTIIHATHSVATAAQAPVEKVADKLRMAFWSQQLQCQPSLLGVNNSGV WISSGKNEFHVAKGSLIGTYWNHVVPRGTASATKWAFVLASAAPTKEGSFLWLCQSSK DLCSVSAQSAQSRPSTVQLPPEAEMRAYAACQDALWALDSLGQVFIRTLSKSCPTGMH WTRLDLSQLGAVKLTSLACGNQHIWACDSRGGVYFRVGTQPLNPSLMLPAWIMIEPPV QPAGVSLVSVHSSPNDQMLWVLDSRWNVHVRTGITEEMPVGTAWEHVPGLQACQLALS TRTVWARCPNGDLARRYGVTDKNPAGDYWKKIPGSVSCFTVTASDELWAVGPPGYLLQ RLTKTFSHSHGTQKSSQAAMPHPEDLEDEWEVI" BASE COUNT 1421 a 1818 c 1809 g 1331 t ORIGIN 1 cccccggcgg agccagctgc tgctcttcgg tgctggcccc ggtgccggcc ccgttgccca 61 gggaacaggc tcccggcagc ccccgcggcc cggagtccat cccgcctcct ccggcccggc 121 ggggccgacg agtccggagg ggctgccgcg ggagccccca ggtttcccta gatgacaaat 181 aaacattcct tttcctgcgt gaagatagtc tgtggaaacc ttggccatgg catcgatatc 241 agagcctgtt acattcagag agttctgccc gttgtactat ctcctcaatg ccattccgac 301 aaagatccag aagggtttcc gctctatcgt ggtctatctc acggccctcg acaccaacgg 361 ggactacatc gcggtgggca gcagcatcgg catgctctat ctgtactgcc ggcacctcaa 421 ccagatgagg aagtacaact ttgaggggaa gacggaatct atcactgtgg tgaagctgct 481 gagctgcttt gatgacctgg tggcagcagg cacagcctct ggcagggttg cagtttttca 541 acttgtatct tcattgccag ggagaaataa acagcttcgg agatttgatg tcactggtat 601 tcacaaaaat agcattacag ctctggcttg gagccccaat ggaatgaaat tgttctctgg 661 agatgacaaa ggcaaaattg tttattcttc tctggatcta gaccaggggc tctgtaactc 721 ccagctggtg ttggaggagc catcttccat tgtgcagctg gattatagcc agaaagtgct 781 gctggtctct actctgcaaa gaagtctgct cttttacact gaagaaaagt ctgtaaggca 841 aattggaaca caaccaagga aaagtactgg gaaatttggt gcttgtttta taccaggact 901 ctgtaagcaa agtgatctaa ccttgtatgc gtcacggccc gggctccggc tatggaaggc 961 tgatgtccac gggactgttc aagccacgtt tatcttaaaa gatgcttttg ccgggggagt 1021 caagcctttt gaactgcacc cgcgtctgga atcccccaac agtggaagtt gcagcttacc 1081 tgagaggcac ctggggcttg tttcatgttt ctttcaagaa ggctgggtgc tgagttggaa 1141 tgaatatagt atctatctcc tagacacagt caaccaggcc acaattgctg gtttggaagg 1201 atccggtgat attgtgtctg tttcgtgcac agaaaatgaa atatttttct tgaaaggaga 1261 taggaacatt ataagaattt caagcaggcc tgaaggatta acatcaacag tgagagatgg 1321 tctggagatg tctggatgct cagagcgtgt ccacgtgcag caagcggaga agctgccagg 1381 ggccacagtt tctgagacga ggctcagagg ctcttccatg gccagctccg tggccagcga 1441 gccaaggagc aggagcagct cgctcaactc caccgacagc ggctccgggc tcctgccccc 1501 tgggctccag gccacccctg agctgggcaa gggcagccag cccctgtcac agagattcaa 1561 cgccatcagc tcagaggact ttgaccagga gcttgtcgtg aagcctatca aagtgaaaag 1621 gaagaagaag aagaagaaga cagaaggtgg aagcaggagc acctgtcaca gctccctgga 1681 atcgacaccc tgctccgaat ttcctgggga cagtccccag tccttgaaca cagacttgct 1741 gtcgatgacc tcaagtgtcc tgggcagtag cgtggatcag ttaagtgcag agtctccaga 1801 ccaggaaagc agcttcaatg gtgaagtgaa cggtgtccca caggaaaata ctgaccccga 1861 aacgtttaat gtcctggagg tgtcaggatc aatgcctgat tctctggctg aggaagatga 1921 cattagaact gaaatgccac actgtcacca tgcacatggg cgggagctgc tcaatggagc 1981 gagggaagat gtgggaggca gtgatgtcac gggactcgga gatgagccgt gtcctgcaga 2041 tgatggacca aatagcacac agttaccctt ccaagaacag gacagctctc ctggggcgca 2101 tgatggggaa gacatccaac ccattggccc ccaaagcact ttttgtgaag tccccctcct 2161 gaactcactc actgtgcctt ccagcctcag ctgggcccca agtgctgaac agtggctgcc 2221 tgggaccaga gctgatgaag gcagccccgt ggagcccagc caagagcagg acatcctaac 2281 cagcatggag gcctctggcc acctcagcac aaatctctgg catgctgtca ctgatgatga 2341 cacaggtcag aaagaaatac ccatttctga acgtgtcttg gggagtgtgg gaggacagct 2401 gactccggtc tctgccttgg cagccagcac tcacaagccc tggcttgagc agcctccacg 2461 ggatcagaca ttgacgtcca gcgatgagga ggacatctat gcccacgggc ttccttcttc 2521 atcctcagag acgagtgtga cagagctcgg acctagttgc tcccagcagg acctgagccg 2581 gctgggtgca gaggacgccg ggctgctcaa gccagatcag tttgcagaaa gctggatggg 2641 ctactcgggt cccggctatg gcatcctcag cttggtggtc tccgagaagt atatctggtg 2701 cctggactac aaaggcggcc tgttctgcag cgcgttgccg ggcgccgggc tgcgctggca 2761 gaagtttgaa gatgctgtcc agcaggtggc agtctcgccc tcaggagccc ttctctggaa 2821 gattgaacag aaatctaacc gggcttttgc ttgtgggaaa gtcaccatca aggggaagcg 2881 gcactggtac gaagccctgc cccaggcagt gtttgtggcc ctgagcgatg acacggcctg 2941 gatcatcagg accagtgggg acctatactt gcagacaggt ctgagcgtgg atcgcccttg 3001 tgccagagcc gtaaaggtgg actgtcccta cccgctgtcc cagatcacag cccggaacaa 3061 tgtggtgtgg gcgctgacag agcagagggc cctcctgtac cgggagggcg tgagcagctt 3121 ctgtccggaa ggcgagcagt ggaagtgtga cattgtcagc gaaaggcaag ctttagaacc 3181 cgtctgcata acgctcgggg atcagcagac tctctgggcc ctggacatcc atgggaacct 3241 gtggttcaga actggcatta tttccaagaa gccccaagga gatgacgacc attggtggca 3301 agtgagcatc acggactatg tggtgtttga ccagtgcagc ttatttcaga cgataatcca 3361 tgccactcac tcggtggcca cagcagccca agcccccgta gaaaaggtgg cagataagct 3421 gcgcatggcg ttttggtccc agcagcttca gtgccagcca agccttctcg gggtcaataa 3481 cagcggtgtc tggatctcct cgggcaagaa tgaattccac gtcgctaagg gaagtctcat 3541 aggcacctac tggaatcatg tggttccccg tgggacagct tctgctacaa aatgggcctt 3601 tgtgttggct tctgcagctc ccacgaagga aggaagcttc ctgtggctgt gccagagcag 3661 caaggacctg tgcagcgtca gcgcccagag cgcacagtcg cggccctcca cggtgcagct 3721 gcctcccgaa gccgagatgc gcgcctatgc cgcctgccag gatgcgctgt gggcgctgga 3781 cagcctcggc caggtgttca tcaggacgct ctccaagagc tgccccacgg gcatgcactg 3841 gaccaggctg gacctctccc agctaggagc tgtaaaattg acaagcttgg catgtggaaa 3901 tcagcacatc tgggcctgtg attccagggg tggagtttac ttccgtgtag ggactcagcc 3961 tctcaatccc agtctcatgc ttccagcctg gataatgatt gagccacctg tccagcccgc 4021 cggggtcagc ttggtcagcg tccattccag ccccaacgac cagatgctgt gggtgcttga 4081 cagcaggtgg aacgtgcacg tgcggaccgg gatcaccgag gagatgcctg tggggaccgc 4141 ctgggagcat gtgccagggt tgcaggcctg ccagctggcg ctgagcacca ggaccgtgtg 4201 ggcccgctgt ccaaacggag acctcgcccg gcggtacggc gtcacagaca agaaccccgc 4261 cggggactac tggaagaaaa ttcccggcag cgtgtcgtgt ttcacagtga ctgcgtcaga 4321 tgagctgtgg gctgtgggcc cgcccggcta cctcctccaa cggctgacaa agacgttcag 4381 ccactcgcac ggcacccaga agagcagcca ggccgccatg ccccaccctg aggacctgga 4441 ggacgagtgg gaggtcatct gaaggagccc tggccgagtc acgcggaggg gcccggcgtc 4501 tgtggcgggc acaggggctt cggagtgact ccctggtgga cgcgctgcct caacacttgt 4561 ccagacacct ctggccaggt tggacccgca cacttacttt catctatgtt ggtttctgtc 4621 tcgttccaga acccacagcc tccacccgtg gctggcgtga ttgctgcagc agtggcgcct 4681 cctagctcag gacagtggcg actgcccggc tgcatgcact ccgattaccc acgtgctgcc 4741 gtcctggtct catccacaga tagctccagc ttttgttggt gggagtggtc tccggaggcc 4801 tcccagaacc aagggtagcc gggcagctgg tttggcccag ggcctccttc cacattagta 4861 gccccagggc cagatggagc caaaggtcag ctctctgcag cgcgggatgt gctcagtgat 4921 ggctttgtcc catcataggg gggtgtcccc ccagagacaa agctgcagag cacattccat 4981 gccagacgct ctggccagga agctgaggcc gggcttgaga ggagagcgct ggccatgcca 5041 ggagagaacc cacgcacatg cacaccacaa cacacaacac acctcacctc acaccacagc 5101 acacctcacc acaccacacc gcactgcacc atacctcacc acatctcacc acaccacagc 5161 acacctcacc acacaacaca ccacacccca caccgcactg caccgcaccg caccgcaccg 5221 tacctcgcca catctcacca caccacacca caccacacct cactgcccac acacggcgca 5281 ggctgcccgc ctcctggaga gcgctcttca gctgaaacag taaagcctga tgggtgcaaa 5341 tggaacctgg atgtgtgcac gtgtgtccca ggtagggacg gcacaggagg gtgcatgggg 5401 cgtgggggag ctgagcaagg gtcgctcact tagaaatgtc tttggaatgg tgtttaacta 5461 atgctgctgg cggacatcct aaaaccagat gcatcctcag aggacgagtc tactaattat 5521 tgcctttgtt gttgtattac aaatctgcat aaaatacctc atttcaaatc aaatcttaca 5581 aatttagaag agagatatgt tttccgaaaa cagtggaagc cctttgttcc ttcccgggtt 5641 tgtcctgagc ctgcactgtc ctcgcctgca gcctcagagg ggcaggcatc cccgcacaga 5701 cttgactggc agggcggtca cgggacctgc gggctggctc cgagtggcag cccatgcctt 5761 ctgcagggta tgggttgaca cttgacaggt tgaaaccagt gcctctatgg acggctgctg 5821 tggccccttc agacaatggg cagtgcccac cccgcccact ggcgtctgcg tgtgagggct 5881 aggccgccct gccacacatc ccgccccctc ccggaggcag cttcaggaca ggacaccagg 5941 ctggctgctt tttttagcct gcccctggcc caggcccagt ccttggtgtc agggagcccc 6001 caggccgcag gtggagggtg ataaaatatg ttctctgaca ggacccagcc agccacatag 6061 gtggaggttt tccatgtcca aatgaggtca agatgccgaa atcccagatc tgacttcaca 6121 cttccctttt ctagaacctt ttgtaaaagt tggtggcagc agaggcagcc ccaggccggg 6181 ctgcatctct ctgtgtctgt tgtgccttgc ccggcgcctc acggatggca aagctctcct 6241 cacccatggg actgtagtgc aattaaaccc gcgtctaggt gatgctttta aagttgtagc 6301 ttcgtgcttt gtacagtttt ctttctggtt ttaattttta gttgtgcttt gagtcagtgc 6361 aataaactag actttttcc // LOCUS AB002329 6474 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0331 gene, complete cds. ACCESSION AB002329 NID g2224602 KEYWORDS KIAA0331. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0928. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6474) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6474 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0928" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 467..2794 /gene="KIAA0331" CDS 467..2794 /gene="KIAA0331" /codon_start=1 /db_xref="PID:d1021627" /db_xref="PID:g2224603" /translation="MASAGHIITLLLWGYLLELWTGGHTADTTHPRLRLSHKELLNLN RTSIFHSPFGFLDLHTMLLDEYQERLFVGGRDLVYSLSLERISDGYKEIHWPSTALKM EECIMKGKDAGECANYVRVLHHYNRTHLLTCGTGAFDPVCAFIRVGYHLEDPLFHLES PRSERGRGRCPFDPSSSFISTLIGSELFAGLYSDYWSRDAAIFRSMGRLAHIRTEHDD ERLLKEPKFVGSYMIPDNEDRDDNKVYFFFTEKALEAENNAHAIYTRVGRLCVNDVGG QRILVNKWSTFLKARLVCSVPGMNGIDTYFDELEDVFLLPTRDHKNPVIFGLFNTTSN IFRGHAICVYHMSSIRAAFNGPYAHKEGPEYHWSVYEGKVPYPRPGSCASKVNGGRYG TTKDYPDDAIRFARSHPLMYQAIKPAHKKPILVKTDGKYNLKQIAVDRVEAEDGQYDV LFIGTDNGIVLKVITIYNQEMESMEEVILEELQIFKDPVPIISMEISSKRQQLYIGSA SAVAQVRFHHCDMYGSACADCCLARDPYCAWDGISCSRYYPTGTHAKRRFRRQDVRHG NAAQQCFGQQFVGDALDKTEEHLAYGIENNSTLLECTPRSLQAKVIWFVQKGRETRKE EVKTDDRVVKMDLGLLFLRLHKSDAGTYFCQTVEHSFVHTVRKITLEVVEEEKVEDMF NKDDEEDRHHRMPCPAQSSISQGAKPWYKEFLQLIGYSNFQRVEEYCEKVWCTDRKRK KLKMSPSKWKYANPQEKKLRSKPEHYRLPRHTLDS" BASE COUNT 2107 a 1193 c 1294 g 1880 t ORIGIN 1 gtttggcaag tcagtgcaag aggctgactt ctgagaggct tccaggagcc cgaagagagg 61 acctccacgg gagaagggag tgcgtgtgct cggttttttt tttttctctc tttttttttt 121 ttttttctga atgaacagct ttgcccaagt gactgaaaaa tacagcttct tcctgaatct 181 accggcgtag ttgctgaaga gcgctctaga caggacatgg ctctgaagac tcactctttg 241 gaatgtcctc ttgctcccgg cttataaaca actgtcccga ggaaagaaag gttttacata 301 gccaaataca gcctgacaaa tggcacttcg gaactgtgct ttctgatgac aacgcgttcg 361 atttctgaca aagcctctcg cacgctgccc ctggagggaa gtcctaagta aaactcagac 421 cctccttaaa gtgaggagcg agggcttgga cggtgaacac ggcagcatgg catccgcggg 481 gcacattatc accttgctcc tgtggggtta cttactggag ctttggacag gaggtcatac 541 agctgatact acccaccccc ggttacgcct gtcacataaa gagctcttga atctgaacag 601 aacatcaata tttcatagcc cttttggatt tcttgatctc catacaatgc tgctggatga 661 atatcaagag aggctcttcg tgggaggcag ggaccttgta tattccctca gcttggagag 721 aatcagtgac ggctataaag agatacactg gccgagtaca gctctaaaaa tggaagaatg 781 cataatgaag ggaaaagatg cgggtgaatg tgcaaattat gttcgggttt tgcatcacta 841 taacaggaca caccttctga cctgtggtac tggagctttt gatccagttt gtgccttcat 901 cagagttgga tatcatttgg aggatcctct gtttcacctg gaatcaccca gatctgagag 961 aggaaggggc agatgtcctt ttgaccccag ctcctccttc atctccactt taattggtag 1021 tgaattgttt gctggactct acagtgacta ctggagcaga gacgctgcga tcttccgcag 1081 catggggcga ctggcccata tccgcactga gcatgacgat gagcgtctgt tgaaagaacc 1141 aaaatttgta ggttcataca tgattcctga caatgaagac agagatgaca acaaagtata 1201 tttctttttt actgagaagg cactggaggc agaaaacaat gctcacgcaa tttacaccag 1261 ggtcgggcga ctctgtgtga atgatgtagg agggcagaga atactggtga ataagtggag 1321 cactttccta aaagcgagac tcgtttgctc agtaccagga atgaatggaa ttgacacata 1381 ttttgatgaa ttagaggacg tttttttgct acctaccaga gatcataaga atccagtgat 1441 atttggactc tttaacacta ccagtaatat ttttcgaggg catgctatat gtgtctatca 1501 catgtctagc attcgggcag ccttcaacgg accatatgca cataaggaag gacctgaata 1561 ccactggtca gtctatgaag gaaaagtccc ttatccaagg cctggttctt gtgccagcaa 1621 agtaaatgga gggagatacg gaaccaccaa ggactatcct gatgatgcca tccgatttgc 1681 aagaagtcat ccactaatgt accaggccat aaaacctgcc cataaaaaac caatattggt 1741 aaaaacagat ggaaaatata acctgaaaca aatagcagta gatcgagtgg aagctgagga 1801 tggccaatat gacgtcttgt ttattgggac agataatgga attgtgctga aagtaatcac 1861 aatttacaac caagaaatgg aatcaatgga agaagtaatt ctagaagaac ttcagatatt 1921 caaggatcca gttcctatta tttctatgga gatttcttca aaacggcaac agctgtatat 1981 tggatctgct tctgctgtgg ctcaagtcag attccatcac tgtgacatgt atggaagtgc 2041 ttgtgctgac tgctgcctgg ctcgagaccc ttactgtgcc tgggatggca tatcctgctc 2101 ccggtattac ccaacaggca cacatgcaaa aaggcgtttc cggagacaag atgttcgaca 2161 tggaaatgca gctcagcagt gctttggaca acagtttgtt ggggatgctt tggataagac 2221 tgaagaacat ctggcttatg gcatagagaa caacagtact ttgctggaat gtaccccacg 2281 atctttacaa gcgaaagtta tctggtttgt acagaaagga cgtgagacaa gaaaagagga 2341 ggtgaagaca gatgacagag tggttaagat ggaccttggt ttactcttcc taaggttaca 2401 caaatcagat gctgggacct atttttgcca gacagtagag catagctttg tccatacggt 2461 ccgtaaaatc accttggagg tagtggaaga ggagaaagtc gaggatatgt ttaacaagga 2521 cgatgaggag gacaggcatc acaggatgcc ttgtcctgct cagagtagca tctcgcaggg 2581 agcaaaacca tggtacaagg aattcttgca gctgatcggt tatagcaact tccagagagt 2641 ggaagaatac tgcgagaaag tatggtgcac agatagaaag aggaaaaagc ttaaaatgtc 2701 accctccaag tggaagtatg ccaaccctca ggaaaagaag ctccgttcca aacctgagca 2761 ttaccgcctg cccaggcaca cgctggactc ctgatggggt gagactatct actgtctttt 2821 gaagaattta tatttggaaa gtaaaaaagt aaaaaaataa atcatccaac ttctttgcat 2881 tacttaaaag agatttctgt aatacaggaa tgactatgaa ggtgttataa taaattattc 2941 tacatactca tttgactgga taaactttac ataaaattaa ctaatttttt aaataaatgc 3001 attgcttaat ggtttctcat tatgtttatc aaaaaacaac tgtagctgtt attttcagta 3061 cttggctgct tttctgtgaa aattattatt ttacttttgg aagacaagat tattagaata 3121 ttgaagaaaa attggagact tataatcatg gtaaatataa aactaaatat gttttaatat 3181 ttctgaattt ttcttttcca tcacaatgta agatatgcag aatacaagat actttggcat 3241 tctcatgtga actttctgta ctctttaagg attattttat tagtgttgtt taagccatga 3301 gtgttaagta gcaggtgtgt tgtgagtgct gtaacccatg aaaggaaaaa tgtcattctg 3361 aggcttgtgc ccttcgtaaa atattcatta aagtacattc acactatttt tgctttataa 3421 cacagtcttt aattttcact cactgtggaa ataaaaacta aggtaacttc tcagaaagat 3481 atcaaatctc agaaagaatg tcaaatcaga tgaagttata gttaggattc taactactgt 3541 aaaagatttt tgcttccctc ttgtggtaaa aaaaattata ttctcacaca tttctttttt 3601 ctctacagac ggatatctgt ttaggaaaga tttgaaagca gattatcagt aggtacatgg 3661 atacatcaag ttcatttgca gaaacaaata actgaaataa aaaacatgtt aatccttgta 3721 tcatacttta atatgaaagt attgtttata gataatttat ctcacaagtc aaaaatgaag 3781 attttgcagc actgaaaatc tattaaagct ccaaatttta agtttctaaa taatcttcgc 3841 tgaaatctaa aatatactat aacaaccgtg ttttatttgt gaaaaaaata ttaaagtgat 3901 ttgctctcaa atatcaaatt ttcttctctc ttttatatta agagacagaa aattgtttca 3961 tgagttcact taactactga gatattcaga gcatttttac ctctctctta aatgttataa 4021 aaaacaattg tatttttaag aatgtttatt tatcaaagtc tttccttctt ctattaaata 4081 tttagcaatt acctttctaa aatatgaaat tttgtaagat gttttcacct aaataaaaat 4141 tgaaagcaag tggattacac aggagaacca ttatgaacat ttatttagat attaatctta 4201 aacagtgttt atttcagttt tcaaagttag cttataggtt atacatttaa gttaaagtgc 4261 tcataatcac ttgcaatttc attgtaaaat gaacaaatac ataaatattt taagaaaaat 4321 ttaagtttat tcagataagt caccatgctt caaaagatct aagaaatgca aatatactga 4381 aaattgacat cctctgaaaa ttccacttgc tatttaccca agaatccact ggaggtcatt 4441 actgccatta aataataact gaaaagacta tgtagtgaaa tgtattttta aaaactatat 4501 tcagtaaaag cctgctcaat ttggagaaat agaaccacaa acacagatca caggggcctt 4561 acaaagttta tgtctgaaca aataagtcaa ttaagtacac tttattgaaa attgccttcc 4621 attaacacac aagaaagaaa gcaggatttt ctcctgtatc tgaattttaa aattaaaaag 4681 gcagataaga cataaatagt tatcatttta attgcaataa cacagacaag tagttaatga 4741 tgataacaat ggtgtaactt gtaaactaaa tatttggtaa ctgaagcaat aggcagagga 4801 aaatagcttt tctatgacac aagtcataag aagtccatat actgaagagc gtttgattaa 4861 aataaagtga ctattaacca gaaaagaaac attttacata aaatgctaaa atttattata 4921 ggaaaataaa tcaaacccaa agaaagttta ttcaatgcta atttgaaaga aaattgataa 4981 gaaaactttg agggcccaag tccacaattt ggtgagacca ctaaatttta catataatta 5041 tacacacaca tatgtacata tatatgtata taatcttgct tcccgcctgt ttatggcagt 5101 actgaagaga aatgggaaag aagagggagg gagagagaaa gacgaaggga gagagaaagc 5161 agtttccaag gatatgtttc atgtcccacc attttctcag tttctccctc tctctcccaa 5221 cacacacaca cacacacccc tcacatacta taaaataaat cttcactgcc ctatcaaaat 5281 acaaataaat caatctatgc tgttctgtcc ttcttgagaa tctaaaacat accacaaaaa 5341 tacatcccca gtcttttgtt ctgtctgagg ttagaattaa ttcaaattca gaatctgttg 5401 tgagaaatgc ccaggcttta aaaattaaaa atggatggat cttctctgaa ctcagggagg 5461 gcacatactt agatacctac aagacttgga ggaattaaga gttcaccctt catctcacca 5521 aattttcccc atttttctct ttcttgtaga aggagagaaa ccatgctctc tagcaacatt 5581 gagcaaaaat cataaccact catctaattt ctaagaggca cctccatcga gggccggtct 5641 cctgcttctt tagacctctt ctatctttgt tacaggagag gacctgtgga tagacttagt 5701 tttgacataa aacaatgccc attcacctcc tccttcagca caacgtcacc cattgggcaa 5761 gagatccaga tttgttaaca aaaaagattt tacttcgtga ttccacgtct ataattctat 5821 attgctaatt ttttcttttg tgtgaattac tgaatatttc agagcaaagc tatcaacttg 5881 gagaaacagg gattaaaaat aaggataaac actaataaga gctctagaaa aaagggaaca 5941 gaaagtctgc ctgtttagta agtggcaatt ccatacatat tttagagttt tttctatcta 6001 aaattagtta aatacttaga atgtttgtaa tgagtgttcg atatttgcta taggttttag 6061 ggttttgtaa atcttcatag taattataaa catttgtaaa atttgtaaaa tactataagt 6121 cattttgagt gttggtgtta agcatgaaac aaacagcagc tgttgtcctt aaaaatgaat 6181 tgacctggcc gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg 6241 ggtggatcat gaggtcagga gatggagacc atcctggcta acaaggtgaa accccgtctc 6301 tactaaaaat acaaaaaatt agccgggcgc ggtggcgggc gcctgtagtc ccagctactt 6361 gggaggctga ggcaggagaa tggcgtgaac ccgggaagcg gagcttgcag tgagccgaga 6421 ttgcgccact gcagtccgca gtccggcctg ggcgacagag cgagactccg tctc // LOCUS AB002332 5715 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0334 gene, complete cds. ACCESSION AB002332 NID g2224608 KEYWORDS KIAA0334. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1015. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5715) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5715 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1015" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 252..2792 /gene="KIAA0334" CDS 252..2792 /gene="KIAA0334" /codon_start=1 /db_xref="PID:d1021630" /db_xref="PID:g2224609" /translation="MLFTVSCSKMSSIVDRDDSSIFDGLVEEDDKDKAKRVSRNKSEK KRRDQFNVLIKELGSMLPGNARKMDKSTVLQKSIDFLRKHKEITAQSDASEIRQDWKP TFLSNEEFTQLMLEALDGFFLAIMTDGSIIYVSESVTSLLEHLPSDLVDQSIFNFIPE GEHSEVYKILSTHLLESDSLTPEYLKSKNQLEFCCHMLRGTIDPKEPSTYEYVKFIGN FKSLNSVSSSAHNGFEGTIQRTHRPSYEDRVCFVATVRLATPQFIKEMCTVEEPNEEF TSRHSLEWKFLFLDHRAPPIIGYLPFEVLGTSGYDYYHVDDLENLAKCHEHLMQYGKG KSCYYRFLTKGQQWIWLQTHYYITYHQWNSRPEFIVCTHTVVSYAEVRAERRRELGIE ESLPETAADKSQDSGSDNRINTVSLKEALERFDHSPTPSASSRSSRKSSHTAVSDPSS TPTKIPTDTSTPPRQHLPAHEKMVQRRSSFSSQSINSQSVGSSLTQPVMSQATNLPIP QGMSQFQFSAQLGAMQHLKDQLEQRTRMIEANIHRQQEELRKIQEQLQMVHGQGLQMF LQQSNPGLNFGSVQLSSGNSSNIQQLAPINMQGQVVPTNQIQSGMNTGHIGTTQHMIQ QQTLQSTSTQSQQNVLSGHSQQTSLPSQTQSTLTAPLYNTMVISQPAAGSMVQIPSSM PQNSTQSAAVTTFTQDRQIRFSQGQQLVTKLVTAPVACGAVMVPSTMLMGQVVTAYPT FATQQQQSQTLSVTQQQQQQSSQEQQLTSVQQPSQAQLTQPPQQFLQTSRLLHGNPST QLILSAAFPLQQSTFPQSHHQQHQSQQQQQLSRHRTDSLPDPSKVQPQ" BASE COUNT 1808 a 1129 c 1094 g 1684 t ORIGIN 1 agctgattct atcacattgt aagatgcctt tggataattc tacagtcctc ttaaatgaat 61 ctttagaact tggcaagtct cactagatac cttcaatcat cattttgagc tcaaagaatt 121 ctgagactta tggttggtca tatagaagag gaccttgaac ctatagtttc ctgaagaatc 181 agtttaaaag atccaaggag tacaaaagga gaagtacaaa tgtctactac aagacgaaaa 241 cgtagtatgt tatgttgttt accgtaagct gtagtaaaat gagctcgatt gttgacagag 301 atgacagtag tatttttgat gggttggtgg aagaagatga caaggacaaa gcgaaaagag 361 tatctagaaa caaatctgaa aagaaacgta gagatcaatt taatgttctc attaaagaac 421 tgggatccat gcttcctggt aatgctagaa agatggacaa atctactgtt ctgcagaaaa 481 gcattgattt tttacgaaaa cataaagaaa tcactgcaca gtcagatgct agtgaaattc 541 gacaggactg gaaacctaca ttccttagta atgaagagtt tacacaatta atgttagagg 601 ctcttgatgg ttttttttta gcaatcatga cagatggaag cataatatat gtgtctgaga 661 gtgtaacttc attacttgaa catttaccat ctgatcttgt ggatcaaagt atatttaatt 721 ttatcccaga aggggaacat tcagaggttt ataaaatact ctctactcat ctgctggaaa 781 gtgattcatt aaccccagaa tatttaaaat caaaaaatca gttagaattc tgttgtcaca 841 tgctgcgagg aacaatagac ccaaaggagc catctaccta tgaatatgta aaatttatag 901 gaaatttcaa atctttaaac agtgtatcct cttcagcaca caatggtttt gaaggaacta 961 tacaacgcac acataggcca tcttatgaag atagagtttg ttttgtagct actgtcaggt 1021 tagctacacc tcagttcatc aaggaaatgt gcactgttga agaacccaat gaagagttta 1081 catctagaca tagtttagaa tggaagtttc tgtttctaga tcacagggca ccacccataa 1141 tagggtattt gccatttgaa gttctgggaa catcaggcta tgattactat catgtggatg 1201 acctagaaaa tttggcaaaa tgtcatgagc acttaatgca atatgggaaa ggcaaatcat 1261 gttattatag gttcctgact aaggggcaac agtggatttg gcttcagact cattattata 1321 tcacttacca tcagtggaat tcaaggccag agtttattgt ttgtactcac actgtagtaa 1381 gttatgcaga agttagggct gaaagacgac gagaacttgg cattgaagag tctcttcctg 1441 agacagctgc tgacaaaagc caagattctg ggtcagataa tcgtataaac acagtcagtc 1501 tcaaggaagc attggaaagg tttgatcaca gcccaacccc ttctgcctct tctcggagtt 1561 caagaaaatc atctcacacg gccgtctcag acccttcctc aacaccaacc aagatcccga 1621 cggatacgag cactccaccc aggcagcatt taccagctca tgagaagatg gtgcaaagaa 1681 ggtcatcatt tagtagtcag tccataaatt cccagtctgt tggttcatca ttaacacagc 1741 cagtgatgtc tcaagctaca aatttaccaa ttccacaagg catgtcccag tttcagtttt 1801 cagctcaatt aggagccatg caacatctga aagaccaatt ggaacaacgg acacgcatga 1861 tagaagcaaa tattcatcgg caacaagaag aactaagaaa aattcaagaa caacttcaga 1921 tggtccatgg tcaggggctg cagatgtttt tgcaacaatc aaatcctggg ttgaattttg 1981 gttccgttca actttcttct ggaaattcat ctaacatcca gcaacttgca cctataaata 2041 tgcaaggcca agttgttcct actaaccaga ttcaaagtgg aatgaatact ggacacattg 2101 gcacaactca gcacatgata caacaacaga ctttacagag tacatcaact cagagtcaac 2161 aaaatgtact gagtgggcac agtcagcaaa catctctacc cagtcagaca cagagcactc 2221 ttacagcccc actgtataac actatggtga tttctcagcc tgcagccgga agcatggtcc 2281 agattccatc tagtatgcca caaaacagca cccagagtgc tgcagtaact acattcactc 2341 aggacaggca gataagattt tctcaaggtc aacaacttgt gaccaaatta gtgactgctc 2401 ctgtagcttg tggggcagtc atggtaccta gtactatgct tatgggccag gtggtgactg 2461 catatcctac ttttgctaca caacagcaac agtcacagac attgtcagta acgcagcagc 2521 agcagcagca gagctcccag gagcagcagc tcacttcagt tcagcaacca tctcaggctc 2581 agctgaccca gccaccgcaa caatttttac agacttctag gttgctccat gggaatccct 2641 caactcaact cattctctct gctgcatttc ctctacaaca gagcaccttc cctcagtcac 2701 atcaccagca acatcagtct cagcaacagc agcaactcag ccggcacagg actgacagct 2761 tgcccgaccc ttccaaggtt caaccacagt agcacacgtg cttcctctct tgacatcaag 2821 ggaggaaggg gatggcccat taagagttac tcagatgacc tgaggaaagg agggaaagtt 2881 ccagcagttt catgagatgc agtattgagt gttctagttc ctggaattag ttggcagaga 2941 aaatgctgcc tagtgctaca gatgtacatt aaataccagc cagcaggagg tgatcatagg 3001 ggcacagcca gttctgacag tgttttaggt gcctggatat tttttgatgg aaaaagaata 3061 tattgccaaa tattaagaag ctcagctatg aaatgacctc cagggaatca gaaaggcact 3121 aatgatgtta gtaactttta gtggttctgt gcctcttatc aagtgttaca gaggacatac 3181 cactgccatg tcaggggttt gcttacagtg atgccatgaa gacagtccag tagacttggt 3241 agcgaccccc tcccccaacc cctctccctt ttcagataat gatggaacag taattacttt 3301 cagaatgttg tgtgggttca aattctctat gtacagatga tgtaaaaata tgtatatgtc 3361 tagataaaag gagagaaagc aaaacatttt gtatgctgca tgaaagcgtt atctcttcct 3421 tacaggtgtg agcacctttc ctgaaattct gacaccatgt gcaaactgat ccatcctgtt 3481 tttccttttg tttacaacac agtagtgttc tgttcacttt tccggggcac aagttttttt 3541 gttcatactt tggctgtgat gtcacagttt gttcagtgag gtatgatgtg ctgctgggaa 3601 tggatttttt tttttcaggt taaattattg atacaacagg attttcaagt tattcagaaa 3661 tatccctcat ttcattattt ttcaattatg tttgaaaata ggatttgcac tgctttattt 3721 taggtggctg ggagttttga ttgcatattt tgttatagtt catagttgga aatatttgcg 3781 taaatggttt tcaacaagcc tgaaagtaat ttcaagaatg tttcagttat agaggtaaaa 3841 tttgcacaca aaacatctta ggcacttttt aacattctca atcatgggaa ttttaacttt 3901 tgggatttgt tgaaatcttt tttattatcc ttcacaattt caatgcttct tttagtcaga 3961 aatgattcag ggttatttga ggggaaaaaa ccccatagtg ccttgatttt aattcaggtg 4021 ataactcacc atcttgaagt cattgtccgg tttccgtagc agttttgaaa ccttagtacc 4081 tttttaacag catgtgggtg tcagtgtcat tattagtctc ctaataagtt cctctgaaga 4141 ctgctatcag tctcttggac tggagttaca aataatttag aaataaaaga tgataaccta 4201 acactatcat agttattaat gtgatcctaa aattgtttcc taaatcagca tttttcttta 4261 gtcatttaag aatttaccag aaatatttgc tcaatatgat cttgatattc ctacaaagaa 4321 aaaagaaggg gtagggattt ggctatgcct tcactacaac attagaatat tgtaactcac 4381 atgccttcta aacgtgaact aagatttcct ttggcaatat catattctaa aagtaataaa 4441 ttccaataca agttacatac atttaaaaaa cattttacag attttatggt actaatgaaa 4501 tttacagtga tagaacaaaa gaggattagt agaaaataca ttattagaat ataaaaaatg 4561 ttattactga ggaaagggag gagaggacaa gtgtaataaa tcaaaattga cctcaaaaga 4621 aaatgtgtaa cagagttgag gttgttaaaa cagaaaaggt tctgaataat gaagattaac 4681 ctaatgcaga attgctaggt aaagaggtca ggggaatgct aagccagttc ttaagacttc 4741 tctgtcctct gctttgctgt tatccttaag gcatatactt tgtctttctg cagaaaattc 4801 tacctggcta caattacttt gaacattaat gttgaaaaag aaaacaacca aagaaaattg 4861 gtacttaccc ttctacaaaa gaagtgtgac tagatatcaa tcagtaatta acatatcaag 4921 gagctcttct agctaaatga ccatccagta gagatttccc acattcccat gaatatcaag 4981 aatagttgtc agaatatgta tgtacctgag catatgtaca cagacaaggg ggatgttgtg 5041 gaatatggca atagcattgt tcttctcccc tttcaaattg cctttcttga ccttatgcca 5101 ttccatatat atctgagttg tgcctcattt atttattggc aatacctagt gatacggatt 5161 tagctaacaa aagatatgaa gaactattat attgaggcct gtcctctaca taccacactt 5221 aaaagatggt gaactgtgag tactacttag gttgacagca acaaagcata agacaagccc 5281 caggtaaacg tctaaactgt ttactcacat tgtcctactc cagccccttc aattatttcc 5341 catctccaca aatagtcggg ggaaaaaatt aaaattttcc tttatgattc ttactgttct 5401 tcgcagctca tcttttcctg cttagaatta accattgcta atttaaagga gcagctagct 5461 gcttttctgt cagtctgaag cgtagtagtg gaagaggtag taagcaccag ctgcctcttt 5521 gctgctttgt tttcctcctg attctcttaa atttgggttg caaagctatc ccgcccccca 5581 ccctgcccca tgaaacttga gcattcaaat gaagattcag cagtgtctgt tcttcatttc 5641 tatagccaaa gctgttagtt aaaatcccaa atctatagca tttaaagata ccaaatagaa 5701 acaccttcca gcttt // LOCUS AB002333 6639 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0335 gene, complete cds. ACCESSION AB002333 NID g2224610 KEYWORDS KIAA0335. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1070. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6639) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6639 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1070" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 227..4018 /gene="KIAA0335" CDS 227..4018 /gene="KIAA0335" /codon_start=1 /db_xref="PID:d1021631" /db_xref="PID:g2224611" /translation="MPSEQKQLFCDEKQTTLKKDYDVKNEIVDRSAPKPKISGSIHYA LKNVKIDLPKINIPNEVLLKHEVDKYRKLFQSKQQTARKSISIKTVSCVEECTLLHKS ERAEEEGVKMSAKILNFSCLKCRDNTRYSPNDLQKHFQMWHHGELPSYPCEMCNFSAN DFQVFKQHRRTHRSTLVKCDICNNESVYTLLNLTKHFTSTHCVNGNFQCEKCKFSTQD VGTFVQHIHRHNEIHYKCGKCHHVCFTKGELQKHLHIHSGTFPFTCQYCSYGATRREH LVRHVITLHKEHLYAKEKLEKDKYEKRMAKTSAGLKLILKRYKIGASRKTFWKRKKIN SGSDRSIEKNTQVLKKMNKTQTKSEDQSHVVQEHLSEEKDERLHCENNDKAPESESEK PTPLSTGQGNRAEEGPNASSGFMKTAVLGPTLKNVMMKNNKLAVSPNYNATFMGFKMM DGKQHIVLKLVPIKQNVCSPGSQSGAAKDGTANLQPQTLDTNGFLTGVTTELNDTVYM KAATPFSCSSSILSGKASSEKEMTLISQRNNMLQTMDYEKSVSSLSATSELVTASVNL TTKFETRDNVDFWGNHLTQSHPEVLGTTIKSPDKVNCVAKPNAYNSGDMHNYCINYGN CELPVESSNQGSLPFHNYSKVNNSNKRRRFSGTAVYENPQRESSSSKTVVQQPISESF LSLVRQESSKPDSLLASISLLNDKDGTLKAKSEIEEQYVLEKGQNIDGQNLYSNENQN LECATEKSKWEDFSNVDSPMMPRITSVFSLQSQQASEFLPPEVNQLLQDVLKIKPDVK QDSSNTPNKGLPLHCDQSFQKHEREGKIVESSKDFKVQGIFPVPPGSVGINVPTNDLN LKFGKEKQVSSIPQDVRDSEKMPRISGFGTLLKTQSDAIITQQLVKDKLRATTQNLGS FYMQSPLLNSEQKKTIIVQTSKGFLIPLNITNKPGLPVIPGNALPLVNSQGIPASLFV NKKPGMVLTLNNGKLEGVSAVKTEGAPARGTVTKEPCKTPILKVEPNNNCLTPGLCSS IGSCLSMKSSSENTLPLKGPYILKPTSSVKAVLIPNMLSEQQSTKLNISDSVKQQNEI FPKPPLYTFLPDGKQAVFLKCVMPNKTELLKPKLVQNSTYQNIQPKKPEGTPQRILLK IFNPVLNVTAANNLSVSNSASSLQKDNVPSNQIIGGEQKEPESRDALPFLLDDLMPAN EIVITSTATCPESSEEPICVSDCSESRVLRCKTNCRIERNFNRKKTSKKNFFKNKNSW K" BASE COUNT 2320 a 1125 c 1213 g 1981 t ORIGIN 1 gaaggaactg agacacagag gttgagactt gtctaaagta acatgaccaa tgaggaaact 61 gaggtccaga gaagttgaag ttttcccaag gtcacacagc agagctggaa ctagagcttg 121 catttcctga cttccagatt agtgttattt ttacttcaaa ctattacttc ttgataactt 181 caagagtttt cagaaaaaaa gaaattggga cttttttggt taaatcatgc catctgaaca 241 gaaacagtta ttttgtgatg aaaaacaaac tactttaaaa aaagattatg atgtgaaaaa 301 tgagatagtt gataggtcgg cacctaaacc aaaaatttca ggaagtattc attatgcact 361 aaaaaatgtg aaaattgatt tgccaaaaat aaatattcca aatgaagtcc tattgaaaca 421 tgaagttgac aaatacagaa aattatttca gagtaaacag cagactgcaa gaaaatctat 481 cagtataaag actgtaagct gtgtagagga gtgtacattg cttcataagt ctgagagagc 541 tgaagaagag ggtgtaaaaa tgtctgcaaa aatactcaat ttcagctgtt taaaatgccg 601 agacaacact cgatatagcc caaatgattt gcagaaacac tttcaaatgt ggcaccatgg 661 cgaattacct tcatatcctt gtgaaatgtg caacttttca gcaaatgact ttcaggtatt 721 taaacaacac agacgaaccc atagaagcac tttagtaaaa tgtgacattt gtaacaatga 781 gagtgtatat actttactga acttgacaaa gcatttcaca tccacacatt gtgttaatgg 841 taattttcaa tgtgaaaagt gtaagttctc cacccaggat gttggcacat ttgttcagca 901 cattcataga cataatgaaa ttcattataa gtgtggtaaa tgtcatcatg tatgttttac 961 caaaggagag cttcagaagc accttcatat tcattctggt acttttccct tcacttgtca 1021 atattgtagc tatggtgcca ccaggagaga acaccttgta agacatgtta taactttgca 1081 caaagaacat ttatatgcaa aagaaaaact ggaaaaagac aaatatgaaa aaagaatggc 1141 aaagacttct gcaggactta agctaatact gaaaagatat aaaataggtg catcaaggaa 1201 gacgttctgg aaacgtaaga aaattaacag tggaagtgac agaagtatag aaaagaacac 1261 tcaagtgctt aagaaaatga acaaaacaca gactaaatct gaagaccaga gccatgttgt 1321 tcaagagcat ttaagtgaag aaaaggatga aagactacac tgtgagaata atgataaagc 1381 ccctgaatca gagtcagaga agccaactcc tctgtccact gggcaaggta atagagctga 1441 agagggacca aacgctagtt caggtttcat gaagactgct gtactaggac ctacactgaa 1501 aaatgtaatg atgaaaaata ataaactagc agtttcccct aactataatg ctacgtttat 1561 gggcttcaag atgatggatg gaaaacagca tattgtatta aaattggtgc ctatcaaaca 1621 aaatgtatgt tcaccaggct cacagtcagg tgctgcaaag gacggtactg ctaatttgca 1681 gccccagact ttggacacta atggattttt aacaggagta acaactgagt taaatgacac 1741 agtttatatg aaagcagcta ctccattttc atgttcatct tctatacttt cagggaaagc 1801 aagttcagaa aaagaaatga ctttgatatc tcaaaggaat aatatgcttc aaacaatgga 1861 ttatgagaaa agtgtatctt ctttgtcagc aacatcagaa ttggttacag catcagtgaa 1921 tttgaccaca aaatttgaaa caagagataa tgttgacttc tggggaaatc atctcactca 1981 gagtcacccc gaggtattag gtaccaccat taaaagtcca gataaagtca actgtgttgc 2041 caaaccaaat gcatacaaca gtggagatat gcataattat tgcattaatt atggcaactg 2101 tgagttacct gttgaatcct ccaaccaagg atcattacct tttcataatt actcaaaagt 2161 gaataattct aataaacgtc gtaggttttc aggaacagca gtgtatgaaa accctcaaag 2221 agaatcttca tccagcaaaa cagttgtcca acaaccaatt agtgaatcat ttttatcact 2281 agtgaggcag gagagctcaa aaccagatag cctattagca tctattagcc ttttaaatga 2341 taaagatgga actttaaaag caaaatctga aattgaagaa cagtatgttt tagaaaaagg 2401 acaaaacatt gatggacaaa acctgtacag taatgaaaat caaaatttag agtgtgcgac 2461 tgaaaaatct aaatgggaag acttttctaa tgtcgattca cctatgatgc ctagaatcac 2521 atctgttttc tctctccaga gccaacaggc atcagaattt ctgccacctg aagtaaacca 2581 attgcttcag gatgtattga aaataaaacc tgatgtaaaa caagactcta gtaacactcc 2641 aaataaaggc ttgccacttc attgtgacca gtcatttcaa aaacacgaga gagaaggcaa 2701 aattgttgaa tcttcgaaag atttcaaagt gcaaggcatc ttcccagttc cacctggcag 2761 tgtgggtatt aatgtgccta caaatgattt gaatttgaaa tttggaaaag aaaaacaagt 2821 gtcatcaata ccacaagatg tgagagattc agagaagatg cctagaattt caggttttgg 2881 cacattactt aagactcagt cagatgcgat aataacacag cagcttgtaa aagacaaact 2941 acgagccacc acacaaaatt taggttcttt ttatatgcag agtccacttt taaattcaga 3001 acaaaaaaaa actataattg ttcagacttc aaaaggattc ttaataccat tgaacattac 3061 taacaagcct gggctaccag ttattcctgg aaatgcactt ccattggtta attcacaagg 3121 tatccctgct tctctttttg taaacaagaa acctgggatg gttttaacac ttaataatgg 3181 gaaacttgaa ggtgtttccg ctgtcaaaac cgagggtgcc ccagctcgtg gaactgtgac 3241 taaggagcct tgcaaaacac ctattttgaa ggtagaacca aacaataatt gtcttacacc 3301 tggactttgt tccagcattg gcagttgttt gagcatgaaa agtagctcag aaaatacttt 3361 gccattaaaa ggcccttaca ttttgaaacc aacgagttct gtgaaagctg ttcttattcc 3421 taacatgcta tctgagcaac agagcactaa gttgaatatc tccgattcag taaaacagca 3481 gaatgagatt tttccaaaac cacctcttta taccttcttg cctgatggca aacaagctgt 3541 ttttttaaag tgtgtgatgc caaataaaac tgagctgctt aagcccaaat tagtccaaaa 3601 tagtacttat caaaatatac agccaaagaa acctgaagga acaccacaaa gaatattgct 3661 gaaaattttt aaccctgttt taaatgtgac tgctgctaat aatctgtcag taagcaactc 3721 tgcatcctca ttgcaaaaag acaacgtacc atctaatcag attataggag gagagcagaa 3781 agagccagaa tctagagatg ccttaccctt cttactagat gacttaatgc cagcaaatga 3841 aattgtgata acttctactg caacatgccc agaatcttct gaggaaccaa tatgtgtcag 3901 tgactgttca gagtccaggg tattaaggtg taaaacaaat tgtagaattg agaggaactt 3961 caatagaaaa aagacttcca aaaaaaattt tttcaaaaac aaaaactcat ggaagtaaag 4021 actctgaaac tgcctttgta tctagaaaca gaaactgtaa acgaaagtgt agggatagtt 4081 accaagaacc tccaagaaga aaagcaacat tgcatagaaa gtgtaaagaa aaggcaaaac 4141 ctgaagatgt ccgtgaaaca tttggattta gcagacctag gctttcaaaa gattccatca 4201 gaactttgcg gcttttccct tttagttcta aacagcttgt gaaatgtcct aggagaaacc 4261 aaccagttgt agttttgaat catcctgacg cagatgcacc agaagtagta agtgtaatga 4321 aaactattgc taaatttaat ggacatgtac ttaaggtttc attgtcaaaa agaactataa 4381 atgctttact gaaaccagtt tgttataacc ctcctaaaac aacttacgat gatttttcca 4441 agaggcacaa aacatttaaa cctgttagtt ctgtgaaaga aagatttgtg ctaaaattaa 4501 cactcaaaaa gacaagcaaa aacaattacc agattgtgaa gactacctct gaaaatattt 4561 tgaaggctaa atttaactgt tggttttgtg gtagagtatt tgacaatcag gatacttggg 4621 ctggtcatgg gcagagacat ttaatggaag ctactcggga ttggaacatg ttagaatagt 4681 ttaccataat taccaaggaa aagaaaagta aaattacctt agaagaaaac aacgggttca 4741 gttaccataa tgcagacatt ttctacttca gtatagtacc tgaaatcgaa cattttaaaa 4801 gttgattgta tttctgtgga agagtaaaag ttgtatgtat gatatttgaa aatgctatcc 4861 ctgcactcaa aagttttgaa agaaactaat cacagcacag aagtaccttg atttaatttt 4921 ttaaacgtgt tctcgggaag ttagggttaa agaaaatttt gataaaacaa ttttcccact 4981 ttagttcttt agtgactctt agtagagggt aatggctgac acccccattc cctctcttct 5041 tccaccagcc ttatgcatct aagtgtagct cctctggaga tggagagctc tttcatcgta 5101 gaactgaagg agtttctcac atttgtgaag atactttgag agaacctggg taaagaagtc 5161 actctatatc tgctaaatat agttattttg aaggatatta ggctccttga aagtacattt 5221 aactcactac agttacacct catacaatgc ttgaagagtt tttggcaagt aaattttttt 5281 tccagttgaa ctctgtcact tgttagagat tgggaacact ctcccaaggt gattttcttt 5341 ttctaagtga ctgtgtcaaa tgttgcagta gatgggcagc ttcaagagaa gaatttaaac 5401 tctttgtctc attgaaagtt tggttaaaga acttgaacat atttctaaat ttgattatat 5461 tctctgatga tccctttgca cagaactatg cttatctcat gtttgtcctc cataaacaaa 5521 tggccatttt tttcttcttg gttcccatcc cttttaaaaa gagtggaaca taggagcact 5581 tccaagggag tggcttttct tgaaaattaa aattgtttac caaaacagta ttttgaagca 5641 agatcatatt tttgtctgta ttcactttat tatttgaaca tgtccaaatt aggacaaagc 5701 atttctgtta gctgcttctc agtgtgactg acaacccaaa acatatatac agattgttgg 5761 catttgcaaa aggaagcatc aatagtggac ccagaggcag tgcataaaac cttaggatag 5821 attcctaagg gacatgccca acagagtttt aaaatggatg ttttcatgat gacaacagaa 5881 tcatagattt agatttttca ccttgtaagt ttggatcaaa ttttgctggt tctttggata 5941 atggagttct tgtgttaaca aattacgtgc tatatgattt tttgttacat tgttacaagt 6001 gttaatgaca ttttcattaa tggatgaaaa cttcagggct tcttttctgt atataacaaa 6061 ttaagtctgt acatattttt gtacctttta tgtaaatttt gcacagaaat ttttggcata 6121 agtttatttt ctttcagttt agttcagtgc atgcactaat aaaactggtt gtttaattta 6181 aaaggaatat ataagacttt acccatgtta ttttcttggt ttttatttca gatgtagatt 6241 ttgtttttga ggtaaattcg ttcttcaggg attgaacact attgttagca agtgcctaaa 6301 aaagatgtga aacagtttac atatgtcaac tgtaacagta ggtcccaaat gggcccattc 6361 ccctaatagt tttattttta aagaaagcca tacatagaat gcttcaagct atcttgctat 6421 gcacattata cttgtactgt tttgtgcagt ttgtctactt tcttagtgaa gtattttttg 6481 tataaaatgt tacaattgtg tttcttaaat tgagcctaag aatggagtta attggaaata 6541 tacagtatat attaataatg tacatggtgt ttaaagaatg gtaagcattg ttaatttctg 6601 taatgaacat tttcaattaa atttatcttg tttgtgttt // LOCUS AB002334 6773 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0336 gene, complete cds. ACCESSION AB002334 NID g2224612 KEYWORDS KIAA0336. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1120. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6773) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6773 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1120" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 254..5005 /gene="KIAA0336" CDS 254..5005 /gene="KIAA0336" /codon_start=1 /db_xref="PID:d1021632" /db_xref="PID:g2224613" /translation="MKQEVEDSVTKMGDAHKELEQSHINYVKEIENLKNELMAVRSKY SEDKANLQKQLEEAMNTQLELSEQLKFQNNSEDNVKKLQEEIEKIRPGFEEQILYLQK QLDATTDEKKETVTQLQNIIEANSQHYQKNINSLQEELLQLKAIHQEEVKELMCQIEA SAKEHEAEINKLNELKENLVKQCEASEKNIQKKYECELENLRKATSNANQDNQICSIL LQENTFVEQVVNEKVKHLEDTLKELESQHSILKDEVTYMNNLKLKLEMDAQHIKDEFF HEREDLEFKINELLLAKEEQGCVIEKLKSELAGLNKQFCYTVEQHNREVQSLKEQHQK EISELNETFLSDSEKEKLTLMFEIQGLKEQCENLQQEKQEAILNYESLREIMEILQTE LGESAGKISQEFESMKQQQASDVHELQQKLRTAFTEKDALLETVNRLQGENEKLLSQQ ELVPELENTIKNLQEKNGVYLLSLSQRDTMLKELEGKINSLTEEKDDFINKLKNSHEE MDNFHKKCEREERLILELGKKVEQTIQYNSELEQKVNELTGGLEETLKEKDQNDQKLE KLMVQMKVLSEDKEVLSAEVKSLYEENNKLSSEKKQLSRDLEVFLSQKEDVILKEHIT QLEKKLQLMVEEQDNLNKLLENEQVQKLFVKTQLYGFLKEMGSEVSEDSEEKDVVNVL QAVGESLAKINEEKCNLAFQRDEKVLELEKEIKCLQEESVVQCEELKSLLRDYEQEKV LLRKELEEIQSEKEALQSDLLEMKNANEKTRLENQNLLIQVEEVSQTCSKSEIHNEKE KCFIKEHENLKPLLEQKELRDRRAELILLKDSLAKSPSVKNDPLSSVKELEEKIENLE KECKEKEEKINKIKLVAVKAKKELDSSRKETQTVKEELESLRSEKDQLSASMRDLIQG AESYKNLLLEYEKQSEQLDVEKERANNFEHRIEDLTRQLRNSTLQCETINSDNEDLLA RIETLQSNAKLLEVQILEVQRAKAMVDKELEAEKLQKEQKIKEHATTVNELEELQVQL QKEKKQLQKTMQELELVKKDAQQTTLMNMEIADYERLMKELNQKLTNKNNKIEDLEQE IKIQKQKQETLQEEITSLQSSVQQYEEKNTKIKQLLVKTKKELADSKQAETDHLILQA SLKGELEASQQQVEVYKIQLAEITSEKHKIHEHLKTSAEQHQRTLSAYQQRVTALQEE CRAAKAEQATVTSEFESYKVRVHNVLKQQKNKSMSQAETEGAKQEREHLEMLIDQLKI KLQDSQNNLQINVSELQTLQSEHDTLLERHNKMLQETVSKEAELREKLCSIQSENMMM KSEHTQTVSQLTSQNEVLRNSFRDQVRHLQEEHRKTVETLQQQLSKMEAQLFQLKNEP TTRSPVSSQQSLKNLRERRNTDLPLLDMHTVTREEGEGMETTDTESVSSASTYTQSLE QLLNSPETKLEPPLWHAEFTKEELVQKLSSTTKSADHLNGLLRETEATNAILMEQIKL LKSEIRRLERNQEREKSAANLEYLKNVLLQFIFLKPGSERERLLPVINTMLQLSPEEK GKLAAVAQGEEENASRSSGWASYLHSWSGLR" BASE COUNT 2566 a 1092 c 1363 g 1752 t ORIGIN 1 gcggctggtt gcgggccggc ggcgggctgg cggagatgga ggatcttgtt caagatgggg 61 tggcttcacc agctacccct gggaccggga aatctaagaa ttggagaaag aaattgaaga 121 actcagatca aaacctgtta ctgaaggaac tggtgatatt attaaggcat taactgaacg 181 tctggatgct cttcttctgg aaaaagcaga gactgagcaa cagtgtcttt ctctgaaaaa 241 ggaaaatata aaaatgaagc aagaggttga ggattctgta acaaagatgg gagatgcaca 301 taaggagttg gaacaatcac atataaacta tgtgaaagaa attgaaaatt tgaaaaatga 361 gttgatggca gtacgttcca aatacagtga agacaaagct aacttacaaa agcagctgga 421 agaagcaatg aatacgcaat tagaactttc agaacaactt aaatttcaga acaactctga 481 agataatgtt aaaaaactac aagaagagat tgagaaaatt aggccaggct ttgaggagca 541 aattttatat ctgcaaaagc aattagacgc taccactgat gaaaagaagg aaacagttac 601 tcaactccaa aatatcattg aggctaattc tcagcattac caaaaaaata ttaatagttt 661 gcaggaagag cttttacagt tgaaagctat acaccaagaa gaggtgaaag agttgatgtg 721 ccagattgaa gcatcagcta aggaacatga agcagagata aataagttga acgagctaaa 781 agagaactta gtaaaacaat gtgaggcaag tgaaaagaac atccagaaga aatatgaatg 841 tgagttagaa aatttaagga aagccacctc aaatgcaaac caagacaatc agatatgttc 901 tattctcttg caagaaaata catttgtaga acaagtagta aatgaaaaag tcaaacactt 961 agaagatacc ttaaaagaac ttgaatctca acacagtatc ttaaaagatg aggtaactta 1021 tatgaataat cttaagttaa aacttgaaat ggatgctcaa catataaagg atgagttttt 1081 tcatgaacgg gaagacttag agtttaaaat taatgaatta ttactagcta aagaagaaca 1141 gggctgtgta attgaaaaat taaaatctga gctagcaggt ttaaataaac agttttgcta 1201 tactgtagaa cagcataaca gagaagtaca gagtcttaag gaacaacatc aaaaagaaat 1261 atcagaacta aatgagacat ttttgtcaga ttcagaaaaa gaaaaattaa cattaatgtt 1321 tgaaatacag ggtcttaagg aacagtgtga aaacctacag caagaaaagc aagaagcaat 1381 tttaaattat gagagtttac gagagattat ggaaatttta caaacagaac tgggggaatc 1441 tgctggaaaa ataagtcaag agttcgaatc aatgaagcaa cagcaagcat ctgatgttca 1501 tgaactgcag cagaagctca gaactgcttt tactgaaaaa gatgcccttc tcgaaactgt 1561 gaatcgcctc cagggagaaa atgaaaagtt actatctcaa caagaattgg taccagaact 1621 tgaaaatacc ataaagaacc ttcaagaaaa gaatggagta tacttactta gtctcagtca 1681 aagagatacc atgttaaaag aattagaagg aaagataaat tctcttactg aggaaaaaga 1741 tgattttata aataaactga aaaattccca tgaagaaatg gataatttcc ataagaaatg 1801 tgaaagggaa gaaagattga ttcttgaact tgggaagaaa gtagagcaaa caatccagta 1861 caacagtgaa ctagaacaaa aggtaaatga attaacagga ggactagagg agactttaaa 1921 agaaaaggat caaaatgacc aaaaactaga aaaacttatg gttcaaatga aagttctctc 1981 tgaagacaaa gaagtattgt cagctgaagt gaagtctctt tatgaggaaa acaataaact 2041 cagttcagaa aaaaaacagt tgagtaggga tttggaggtt tttttgtctc aaaaagaaga 2101 tgttatcctt aaagaacata ttactcaatt agaaaagaaa cttcagttaa tggttgaaga 2161 gcaagataat ttaaataaac tgcttgaaaa tgagcaagtt cagaagttat ttgttaaaac 2221 tcagttgtat ggttttctta aagaaatggg atcagaagtt tcagaagaca gtgaagagaa 2281 agatgttgtt aatgtcctac aggcagtcgg tgaatccttg gcaaaaataa atgaggaaaa 2341 atgcaacctg gcttttcagc gtgatgaaaa agtattagag ttagaaaaag agattaagtg 2401 ccttcaagaa gagagtgtag ttcagtgtga agaacttaag tctttattga gagactatga 2461 gcaagagaaa gttctcttaa ggaaagagtt agaagaaata cagtcagaaa aagaggccct 2521 gcagtctgat cttctagaaa tgaagaatgc taatgaaaaa acaaggcttg aaaatcagaa 2581 tcttttaatt caagttgaag aagtatctca aacatgtagc aaaagtgaaa tccataatga 2641 aaaagaaaaa tgttttataa aggaacatga aaacctaaag ccactactag aacaaaaaga 2701 attacgagat aggagagcag agttgatact attaaaggat tccttagcaa aatcaccttc 2761 tgtaaaaaat gatcctctgt cttcagtaaa agagttggaa gaaaaaatag aaaatctgga 2821 aaaagaatgc aaagaaaagg aggagaaaat aaataagata aaattagttg ccgtaaaggc 2881 aaagaaagaa ctagattcca gcagaaaaga gacccagact gtgaaggaag aacttgaatc 2941 tcttcgatca gaaaaggacc agttatctgc ttccatgaga gatctcattc aaggagcaga 3001 aagctataag aatcttttat tagaatatga aaagcagtca gagcaactgg atgtggaaaa 3061 agaacgtgct aataattttg agcatcgtat tgaagacctt acaagacaat taagaaattc 3121 gactttgcag tgtgaaacaa taaattctga taatgaagat ctcctggctc gtattgagac 3181 attacagtct aatgccaaat tattagaagt acagatttta gaagtccaga gagccaaagc 3241 aatggtagac aaagaattag aagctgaaaa acttcagaaa gaacagaaga taaaggaaca 3301 tgccactact gtaaatgaac ttgaagaact tcaggtacaa cttcaaaagg aaaagaaaca 3361 gcttcagaaa accatgcaag aattagagct ggttaaaaag gatgcccaac aaaccacatt 3421 gatgaatatg gaaatagctg attatgaacg tttgatgaaa gaactaaatc aaaagttaac 3481 taataaaaac aacaagatag aagatttgga gcaagaaata aaaattcaaa aacagaaaca 3541 agaaacccta caagaagaaa taacttcatt acagtcttca gtacaacaat atgaagaaaa 3601 aaacaccaaa atcaagcaat tgcttgtgaa aaccaaaaag gaactggcag attcaaagca 3661 agcagaaact gatcacttaa tacttcaagc atctttaaaa ggtgagctgg aggcaagcca 3721 gcagcaagta gaagtctata aaatacagct ggctgaaata acatcagaga agcacaaaat 3781 ccacgagcac ctgaaaacct ctgcggaaca gcaccagcgt acgctaagtg cataccagca 3841 gagagtgaca gcactacagg aagagtgccg tgctgccaag gcagaacaag ctactgtaac 3901 ctctgaattc gagagctaca aagtccgagt tcataatgtt ctaaaacaac agaaaaataa 3961 atctatgtct caggctgaaa ctgagggcgc taaacaagaa agggaacatc tggaaatgct 4021 gattgaccag ctaaaaatca aattacaaga tagccaaaat aacttacaga ttaatgtatc 4081 tgaacttcaa acattgcagt ctgaacatga tacactgcta gaaaggcaca acaagatgct 4141 gcaggaaact gtgtccaaag aggcggaact ccgggaaaaa ttgtgttcaa tacagtcaga 4201 gaacatgatg atgaaatctg aacatacaca gactgtgagt cagctaacat cccagaacga 4261 ggtccttcga aatagcttcc gagatcaagt gcgacatttg caggaagaac acagaaagac 4321 agtggagaca ttacagcagc agctctccaa gatggaagca cagctcttcc agcttaagaa 4381 tgaaccgacc acaagaagcc cagtttcctc tcaacaatct ttgaagaacc ttcgagaaag 4441 gagaaacaca gacctcccgc ttctagacat gcacactgta acccgggaag agggagaagg 4501 catggagaca actgatacgg agtctgtgtc ttccgccagc acatacacac agtctttaga 4561 gcagctgctt aactctcccg aaactaaact tgagcctcca ttatggcatg ctgaatttac 4621 caaagaagaa ttggttcaga agctcagttc caccacaaaa agtgcagatc acttaaacgg 4681 cctgcttcgg gaaacagaag caaccaatgc aattcttatg gagcaaatta agcttctcaa 4741 aagtgaaata agaagattgg aaaggaatca agagcgagag aagtctgcag ctaacctgga 4801 atacttgaag aacgtcttgc tgcagttcat tttcttgaaa ccaggtagtg aaagagagag 4861 acttcttcct gttataaata cgatgttgca gctcagccct gaagaaaagg gaaaacttgc 4921 tgcggttgct caaggtgagg aagaaaatgc ttcccgttct tctggatggg catcctatct 4981 tcatagttgg tctggacttc gataggttga tggaaggaat atttttatta accaaataga 5041 atctatttac aaaaatggtt cacgtatatt accacaattc ttttgtcaaa aagtgtgtat 5101 atatgtttgc atctacatat atttgtacat ctatatgaca gatgtatttt aaaagtttca 5161 tcttgaagta aaagtacaac agcttgaagt gttgatagca ggccacagcc ctctaactca 5221 tgtgatttcc catgcatgct gccagaataa aaccaccagg aatgaattca ctccccactt 5281 ctctggaacc tcaggacccg cccatttctc ggcagtactg tgaattttga agttaaacta 5341 aattttggta ccataccaac tggaatttag gctttaaaaa taatgtttca aggccaggtg 5401 tggtgattca tgcctgaaat cccactactt tgggaggctg aggctggaga attgcttgag 5461 gctagtgagc tgtgactccc actgcactcc agctcgggga acagagcgag accttgtctc 5521 taaaaataat agtaataaaa taaaaataac gttttatgac tatttattgc aaggtcagag 5581 ttacagattg ttataaattg ttgagaaatt tttgtgatta gaatatgaag gaaaaagctt 5641 tgttggtaaa agtgacatgt taaggggcta tgaagtaaat atgctgcagt taattgtgct 5701 aagttaaaat acagtttagt tatttgcttt aaaataaact cttctttttt tctttaaagt 5761 atactatctc aaaactcatt atgttgtcag agccctagag ctggctagtg taacactgac 5821 tatgagtagg tgggcccacc acttgagttg aggtgatttc atggtgtctt tccaggctct 5881 tgatagggtg tcactgcatg caagccatga atctgttttg agaatcctct ccattttccc 5941 aaataaaaac ctatcacaac agtgactata tcactcagca ttggatctaa atataaaagt 6001 ggtgctttca gtgtttttgg cagatagtgt tccataagct ttccatcaga agggatttta 6061 gacaccttag aggtccgtgc tacatcgtca cagttcctcc gaataacctt aggtggtagt 6121 gttacttgcc tttgacacct ctgcatatgt tttaatgact agatccaaac tgtgttgttc 6181 ttaaatcaaa aattggataa tttgtaatat ttatgtgtta atcacacagt atgctctctg 6241 aagttctctt aagccttcag tttatactct taatttaatt ttctttctga gctggagaac 6301 tggctttgca ctttggttac acagaacatt ggtttccaat tcagtttaac tgaaatttgc 6361 tgctgatatg ttgagtttgt tctttaaaaa atagctcata tatctcatct ttcctcctgt 6421 cttagaagaa cagacctaac tagtgaatgt attaatgaaa atgcatctat ttcagagctg 6481 acatgaagag tttagttttt ttactttata aactgtgaat atgagtatgc cagctgcata 6541 cgatgtaact aatcatattt aaatatattt cactttctct ttgactttag accttttgaa 6601 gtctgtataa acttgttttg aaatatagtc tctgcttacg aatgtcataa caaaataatt 6661 ttttgcatga taaaaaatta ctttgattac aaaaggcgta ttctttcatg gtttctgcaa 6721 tgagaggaag tgtaatgatt attttaatat ttctattaaa tatgtttaac tgt // LOCUS AB002335 6289 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0337 gene, complete cds. ACCESSION AB002335 NID g2224614 KEYWORDS KIAA0337. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1226. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6289) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6289 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1226" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 455..4987 /gene="KIAA0337" CDS 455..4987 /gene="KIAA0337" /codon_start=1 /db_xref="PID:d1021633" /db_xref="PID:g2224615" /translation="MARRLPRTSALKSSSSELLLTGPGAEEDPLPLIVQDQYVQEARQ VFEKIQRMGAQQDDGSDAPPGSPDWAGDVTRGQRSQEELSGPESSLTDEGIGADPEPP VAAFCGLGTTGMWRPLSSSSAQTNHHGPGTEDSLGGWALVSPETPPTPGALRRRRKVP PSGSGGSEFSNGEAGEAYRSLSDPIPQRHRAATSEEPTGFSVDSNLLGSLSPKTGLPA TSAMDEGLTSGHSDWSVGSEESKGYQEVIQSIVQGPGTLGRVVDDRIAGKAPKKKSLS DPSRRGELAGPGFEGPGGEPIREVEPMLPPSSSEPILVEQRAEPEEPGATRSRAQSER ALPEALPPPATAHRNFHLDPKLADILSPRLIRRGSKKRPARSSHQELRRDEGSQDQTG SLSRARPSSRHVRHASVPATFMPIVVPEPPTSVGPPVAVPEPIGFPTRAHPTLQAPSL EDVTKQYMLNLHSGEVPAPVPVDMPCLPLAAPPSAEAKPPEAARPADEPTPASKCCSK PQVDMRKHVAMTLLDTEQSYVESLRTLMQGYMQPLKQPENSVLCDPSLVDEIFDQIPE LLEHHEQFLEQVRHCMQTWHAQQKVGALLVQSFSKDVLVNIYSAYIDNFLNAKDAVRV AKEARPAFLKFLEQSMRENKEKQALSDLMIKPVQRIPRYELLVKDLLKHTPEDHPDHP LLLEAQRNIKQVAERINKGVRSAEEAERHARVLQEIEAHIEGMEDLQAPLRRFLRQEM VIEVKAIGGKKDRSLFLFTDLIVCTTLKRKSGSLRRSSMSLYTAASVIDTASKYKMLW KLPLEDADIIKGASQATNRENIQKAISRLDEDLTTLGQMSKLSESLGFPHQSLDDALR DLSAAMHRDLSEKQALCYALSFPPTKLELCATRPEGTDSYIFEFPHPDARLGFEQAFD EAKRKLASSKSCLDPEFLKAIPIMKTRSGMQFSCAAPTLNSCPEPSPEVWVCNSDGYV GQVCLLSLRAEPDVEACIAVCSARILCIGAVPGLQPRCHREPPPSLRSPPETAPEPAG PELDVEAAADEEAATLAEPGPQPCLHISIAGSGLEMTPGLGEGDPRPELVPFDSDSDD ESSPSPSGTLQSQASRSTISSSFGNEETPSSKEATAETTSSEEEQEPGFLPLSGSFGP GGPCGTSPMDGRALRRSSHGSFTRGSLEDLLSVDPEAYQSSVWLGTEDGCVHVYQSSD SIRDRRNSMKLQHAASVTCILYLNNQVFVSLANGELVVYQREAGHFWDPQNFKSVTLG TQGSPITKMVSVGGRLWCGCQNRVLVLSPDTLQLEHMFYVGQDSSRCVACMVDSSLGV WVTLKGSAHVCLYHPDTFEQLAEVDVTPPVHRMLAGSDAIIRQHKAACLRITALLVCE ELLWVGTSAGVVLTMPTSPGTVSCPRAPLSPTGLGQGHTGHVRFLAAVQLPDGFNLLC PTPPPPPDTGPEKLPSLEHRDSPWHRGPAPARPKMLVISGGDGYEDFRLSSGGGSSSE TVGRDDSTNHLLLWRV" BASE COUNT 1248 a 1975 c 1846 g 1220 t ORIGIN 1 acgacctatg gtctagtagg ggttctgggg gctggggcgt gtaccgctcc cctagctttg 61 gagctgggga agggctcctg cggtcccagg ctcgaacccg tgccaaagga cctggaggca 121 cctctagggc attgagggat ggaggatttg agcctgaaag agtcgacagc ggaagtccct 181 gtcaaatcca gatatcgcct cagagaccct gacgcttctc agtttcctgc gctcagacct 241 ttcagagctg agggtccgaa aacctggtgg gagctccggg gaccgtggaa gcaaccccct 301 agatggcaga gactcaccat ccgcaggtgg ccctgtgggg caacttgaac ccatacccat 361 cccagcccca gcatcacctg gcacgcgccc cacactcaag gacttgacag ccactctgcg 421 gagagcaaag tcattcacct gctctgagaa gcccatggcc cgccgcctgc cccgcaccag 481 tgctctgaag tccagctcct ccgagctcct gctcacaggc cctggtgccg aggaggatcc 541 gctgcccctc atcgtccagg accaatatgt gcaggaggcc cgccaggttt ttgagaagat 601 ccagcgcatg ggtgcccaac aagatgatgg aagcgatgcc ccccctggaa gccctgactg 661 ggcaggggat gtgacccgag ggcagcggtc ccaggaggag ctctcaggcc ctgagtccag 721 tctgacagat gaaggcattg gggcagaccc tgagcctcct gttgcagcat tttgcggcct 781 gggtaccaca gggatgtggc gacctctttc ctcatcctcg gcccagacga accaccatgg 841 ccctgggact gaggacagtc tgggcgggtg ggccctggtg tcgcctgaga cccctcccac 901 accaggtgcc ctccgccgac gacgcaaagt cccaccttca ggttctggtg ggagcgaatt 961 tagcaatggg gaggcagggg aggcctacag gtccctgagt gacccaattc ctcagcgcca 1021 ccgggctgcc acctctgaag agcctactgg gttctctgtg gacagcaacc tcctgggctc 1081 actgagcccc aagacagggc tccctgccac ctcagccatg gatgagggct tgaccagtgg 1141 tcacagtgac tggtctgtgg gcagtgaaga gagcaaggga tatcaggagg ttattcagag 1201 catagttcag gggcctggca ccctggggcg tgtggtggac gacaggattg ctggcaaagc 1261 ccccaagaag aaatccctga gtgaccccag ccgccgtggg gagctggctg ggcctggatt 1321 cgagggccct ggaggggagc ccatccgaga agttgagccc atgctgcctc catccagcag 1381 cgagcccatc cttgtagagc agcgggcaga gccagaagaa cctggtgcca ccaggagccg 1441 ggcacagtct gaaagggccc tacctgaggc tctgcctccc cctgccactg cccaccgaaa 1501 ctttcacctt gaccccaagc tggctgacat tctgtccccg aggctaatcc gccgaggctc 1561 caagaagcgc ccagctcgga gtagtcacca ggagcttcgg agagacgagg gcagtcagga 1621 ccagactggc agcctgtctc gggcccggcc ctcctccaga cacgttcgcc atgccagtgt 1681 gcccgccaca tttatgccta ttgtggtgcc tgagccacca acttctgttg gtccccctgt 1741 ggctgtgcca gaacccatag gcttccctac ccgagcccat cccacgttgc aggcaccatc 1801 gctcgaggac gtcaccaagc agtacatgct gaacctgcac tccggtgagg tccctgcccc 1861 agtgccagtg gacatgccct gcttgcctct ggctgcaccg ccctctgctg aggccaagcc 1921 ccctgaggca gctcggcctg cagatgagcc tacccctgcc agcaagtgct gcagcaagcc 1981 acaggtggac atgcggaagc acgtggccat gaccctgctg gacacagagc agtcgtatgt 2041 ggagtcgctg cgcaccctga tgcagggcta catgcagccg ctgaagcagc cagagaactc 2101 cgtgctctgt gacccttcac tggtggacga gatcttcgac cagatccccg agctcctgga 2161 gcaccacgag caattcctgg agcaggttcg gcactgcatg cagacctggc atgcccagca 2221 gaaggtggga gccctgctcg tccagtcgtt ctccaaggat gtcctagtaa acatctattc 2281 tgcctatatc gataacttcc tcaatgcaaa ggatgctgtg cgtgtggcca aggaggcgag 2341 gcctgccttt ctcaagttcc tagagcaaag catgcgtgag aacaaggaga agcaggcgct 2401 gtctgacctc atgatcaagc ctgtgcagcg gatcccacgc tacgagcttc tggtgaagga 2461 cctcctgaag catacacctg aggaccaccc ggaccatcca ctcctgctgg aggcgcagcg 2521 gaacatcaag caggtggctg agcgcatcaa caagggtgtg cggagtgccg aggaggcgga 2581 gcgccatgcc cgtgtgctgc aggagataga ggctcacatc gagggcatgg aggatctcca 2641 ggcccctctg cggcggttcc tgagacagga gatggtcatt gaagtgaagg cgatcggtgg 2701 caagaaggac cggtctctct tcctgttcac ggacctcatc gtctgcacca ctctgaagcg 2761 aaagtcaggc tccctgcggc gcagctccat gagcctgtac acggcagcca gtgtcattga 2821 cacagccagc aagtacaaga tgctgtggaa gctgccgctg gaagacgcag acatcatcaa 2881 aggggcatcc caagccacca atcgggagaa catccagaag gccatcagcc gccttgatga 2941 ggacctcacc accctgggcc aaatgagcaa gctctctgag agccttggtt tcccccacca 3001 gagcctggac gatgcactgc gggacctctc agctgccatg caccgggacc tgtcggagaa 3061 gcaggcgctg tgctacgcgc tttccttccc gccaaccaag ctggagctgt gcgccactcg 3121 gcccgagggc accgactcct acatttttga gttccctcac cctgacgccc gccttggttt 3181 tgaacaggcc ttcgatgagg ccaagaggaa gctggcatcc agcaaaagct gtctagaccc 3241 tgagttcctg aaggccatcc ccatcatgaa aacccgcagt ggcatgcagt tctcctgtgc 3301 ggctcccacc ctgaacagct gcccggagcc ctcgcctgag gtatgggtct gcaacagcga 3361 cggctacgtg ggccaggtgt gcctgctgag cctgcgcgcc gagccggacg tggaggcctg 3421 catcgccgtc tgttccgccc gcatcctctg catcggggcg gtgcccgggc tgcagcctcg 3481 ctgccaccgg gagcctcctc cgtcgctgag gagtcctcca gagacggcac cggagcccgc 3541 cgggccggag ctggacgtcg aggccgctgc agacgaggaa gccgcgacgc tcgcggagcc 3601 ggggccgcag ccctgccttc acatctccat tgcaggctcg ggcttggaga tgacgccggg 3661 cctcggcgag ggtgaccccc gcccagagct ggtgcccttt gacagtgact ctgacgatga 3721 gtcttcgccc agcccctcgg ggacgctgca gagccaggcc agccggtcca ccatctcctc 3781 cagctttggc aatgaggaga ccccgagttc caaggaggcc acggcagaga ccaccagctc 3841 agaggaggag caggagccag gcttcctgcc actgtctggc tcctttgggc ctggtggtcc 3901 ctgcggcacc agcccaatgg atgggagagc ccttcgccgc tccagccacg gctccttcac 3961 ccggggcagc cttgaggacc tgctgagtgt cgaccctgag gcctaccaga gctccgtgtg 4021 gctgggcact gaggatggct gtgtccacgt gtaccagtcc tccgacagca tccgtgaccg 4081 caggaacagc atgaagctcc agcatgcggc ctctgtgacc tgcatcttgt atctgaataa 4141 ccaggtgttt gtgtctctgg ccaatggaga gcttgtggtc taccaaaggg aagcaggcca 4201 tttctgggac ccccagaact tcaaatcagt gaccttgggc acccagggga gccccatcac 4261 caagatggta tctgtgggtg ggcggctgtg gtgtggctgc cagaaccgag tccttgtcct 4321 gagccctgac acgctgcagc tggagcacat gttttacgtg ggtcaggatt caagccgctg 4381 cgtggcttgc atggtggact ccagcctggg tgtgtgggtg acattgaaag gtagtgccca 4441 cgtgtgtctc taccatccag acacctttga gcagctggca gaagtagacg tcactcctcc 4501 cgtgcacagg atgctggcag gctcggatgc catcatccgg cagcacaagg ctgcctgtct 4561 gcgaatcaca gcgctgctgg tgtgtgagga gctgctgtgg gtgggcacca gtgctggtgt 4621 cgtcctcacc atgcccactt cgcccggtac tgtcagctgc ccacgggcac cactcagtcc 4681 cacaggcctc ggccagggac acaccggcca cgtccgcttc ttggctgcag tccagctgcc 4741 agatggcttc aacctgctct gcccaacccc accacctccc ccagacacag gccccgagaa 4801 gctgccatca ctggagcacc gggactcccc ttggcaccga ggccccgccc ctgccaggcc 4861 taaaatgctg gttatcagtg gaggtgatgg ctatgaggac ttccgactca gcagtggggg 4921 cggcagcagc agtgagactg tgggtcgaga cgacagcaca aaccacctcc tcctgtggag 4981 ggtgtgaccc tgtctgccgt ggcccaggac tcgcccgccc acctgccttc agcctgcttg 5041 cctctcccta gcccacacgc agactttgac caggagtatc cagccagggg cacacatgtg 5101 cctgcgtggg ctctgccttg tcttcgcgga agcattcctg atggaacacc cactggccag 5161 ccaggccatg gcttctcccg accctctggc tgccccggtg cttccagtca tgatcgggtg 5221 ggggacatgt gggctgacca ggacctctga ccctggagct tctaccaaag acacagctgg 5281 gtctggaccc cacggggctg gggagggcca tgtgcaatat ttggagggtt ttctggaggg 5341 cagcaggaag gctggggaat tccccatgta cagtatttat gtttcttttt agatgtgtac 5401 cttcccaagc acttatttat gcagtgacct ggtcacctgg ggtgggggtg atttgaggaa 5461 atgacatgag gaaaagaaac ctattcctgc cctggggacc accctgggac tctaaccaag 5521 ccttcctgga gggacccatg cgcccctgag ccccattcca ttcatacaga cacacacgta 5581 cgcacactgc atgtccaagg ccctaaacat tgcccgttga cataaacttt ccagggcccc 5641 agcctgatgg ggctgccctc agtcctctag atcaagatgc tgactattag ggggcagtga 5701 ttgccatctg gggacctgtc aggctttgtc atttcccagt ttgttggtgg tgcctttagt 5761 ggttccctaa tttgggaaca ctgatggggc cttggacagg gctttctctc aggtaggaga 5821 aatgggccca tgatctcctc acagtcgccc ccagtccttg gccctgcttc cctgtgtctc 5881 atgcactggc acatatggtc accttggagg gcagacctag gagcccctct gaccactgaa 5941 tccgtctcca caccccttct gccaagggaa gccccttcag gaaggacccc ccaaagctga 6001 ggggctgaat gtagcctttt caacagagaa ggctcccact tgagagcagc ctctacctga 6061 ccccctggac cacagagagc cactctgacc ctcagccccc tcgcttcttc agctaaaact 6121 ccaaaggttt ggtttcagat ggggtttgtt ttgttctgtt tggttttggt tttgtttggg 6181 gtgggtgggt cattgcggtc ttagattatg tttctcttgc taccaaacag tcatgtatta 6241 actctctttg gatgatgaag tttaaagagt caataaatag aaacaccag // LOCUS AB002337 6446 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0339 gene, complete cds. ACCESSION AB002337 NID g2224618 KEYWORDS KIAA0339. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1304. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6446) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6446 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1304" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 687..4469 /gene="KIAA0339" CDS 687..4469 /gene="KIAA0339" /codon_start=1 /db_xref="PID:d1021635" /db_xref="PID:g2224619" /translation="MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDG VHFSVNDSKYIPVEDLQDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLN DNVRETFLKDMCRKYGEVEEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSV MGNIIHAQLDIKGQQRMKYYELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRR RSSSDTAAYPAGTTAVGTPGNGTPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSR GSTPYSQDSAYSSSTTSTSFKPRRSENSYQDAFSRRHFSASSASTTASTAIAATTAAT ASSSASSSSLSSSSSSSSSSSSSQFRSSDANYPAYYESWNRYQRHTSYPPRRATREEP PGAPFAENTAERFPPSYTSYLPPEPSRPTDQDYRPPASEAPPPEPPEPGGGGGGGGPS PEREEVRTSPRPASPARSGSPAPETTNESVPFAQHSSLDSRIEMLLKEQRSKFSFLAS DTEEEEENSSMVLGARDTGSEVPSGSGHGPCTPPPAPANFEDVAPTGSGEPGATRESP KANGQNQASPCSSGDDMEISDDDRGGSPPPAPTPPQQPPPPPPPPPPPPPYLASLPLG YPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFVNSLELMDRLGAQWGGMPMSFQM QTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPPPQEAAYGLPYALYAQGQEGR GAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELAEGKTLPTAGTVGRVLAML VQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNAAKQQAKEEDKEKTKLK EPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEPSEISEASEEKRPRP STPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSLLWTAKGRRHPRSP PRRRMRRMTRKMRKMKIERKLWIPQRRRQRCRMARTRKAIRLPNVLCMLTQMAKMTAH QTPRAAALPAPHPPPPPRPHPPRPLHPLSPPLKMKRKRSGQQPFPQPPRPPEKSQCPR QHLWRCQCRKGLQAPQSHPCPNRRRLQQGLQAPRRSHPPVRLCVPQNHLLGPRPLPHA PMSVPLLPSPSCPHPRNAGKLSPSLPSRWCQPRSPLQPHRRRPSFPAQPPARLPGAWS GPSATCPWTTHLWSRVGPRRCPEEAGAGLEAEAASPRKRRLSQGQRWTWRSWPTWP" BASE COUNT 1320 a 2186 c 1838 g 1102 t ORIGIN 1 atcctcagaa ttcctcgggt ccctcgatac tcggctgaaa attctcatcg gactctgaga 61 ggagcgctgg gctggaggca ttttccccag ggacagaagc gggctattct ctcacttggg 121 ccagtaagaa aaatccaaaa aaagttgtcg actctgccag cagggattgg ctaacgggcc 181 gttattttct tgactccacc aaggcggatg aaggggaggc tacggctgag gccgggaaca 241 gtggcgaatc tgcagcctct cagaatttgg cagtgcaagg aagggacggg gaagagaagc 301 aaagcggcgc gcatcctgtc cagcgattcg ccccgcccgc ccggtgaatc tgcgtctgca 361 gaacgcgcca ctgaaggttc cccagcgctg gctggcctcc tcccctccgc cccgcccctt 421 ttcctcaggg actagtcgca gctttcgtcg ccgccgattc gtcaaggtcc cgggccgcag 481 catctagatc gtcgtggcga agccgactct ccgggggatg cggccaatct ccaagctccc 541 tgggccgcaa cttccgagcc tcccagggcg ccggccgagg cgaagccgct accctcggcc 601 ccgtgggtcc cccggcagcg cctgtggcga aagtgcgaat gcagaccctg tgcccgctgg 661 gcctcgcgca gtgtaaatga gcaaagatgg atcaggaagg tgggggagat gggcagaagg 721 ccccgagctt ccagtggcgg aactacaagc tcatcgtgga tcctgccttg gaccctgccc 781 tgcgcaggcc ttctcagaag gtgtaccgct atgatggagt ccacttcagt gtcaacgact 841 caaagtatat accagtcgaa gacctccaag acccccgttg ccatgtcagg tccaaaaaca 901 gagacttttc cctcccagtc cctaagttta agctggacga gttctatatt ggacagattc 961 cactgaagga agtgactttt gcaaggctga atgacaacgt gcgggagacc ttcctgaagg 1021 atatgtgccg taagtacggt gaggtggaag aggtagagat cctccttcac ccccgtacgc 1081 gcaagcacct gggcctggcc cgtgtgctct tcaccagcac tcggggcgcc aaggaaacgg 1141 tcaaaaacct ccaccttacc tccgtcatgg gcaacatcat ccatgcccag cttgacatca 1201 aaggacaaca acgaatgaaa tactatgaac taattgtcaa tggctcctac acccctcaga 1261 ctgtgcccac tgggggcaag gccctgagtg agaagttcca aggctcgggt gcagccactg 1321 agacggccga atcccgccgc cgctcttcct ctgacacagc tgcctaccca gcaggcacca 1381 ctgcggtggg cactcctggc aacggcaccc cctgctccca ggacacaagc ttctccagca 1441 gccgacaaga taccccatct tcctttggcc agttcacacc tcagtcctcc caaggaaccc 1501 cctacacgtc tcggggcagc accccctact ctcaggactc tgcctactcc agcagcacca 1561 cttcaacctc cttcaagccc cggcggtcag agaacagcta ccaagatgcc ttttcccgcc 1621 gccacttctc tgcatcttca gcctccacaa ccgcctccac ggccatcgcc gccaccactg 1681 cagccactgc ctcatcctcc gcctcttcct cctcattgtc ctcgtcctcc tcgtcatcct 1741 cttcctcctc gtcctctcag tttcgtagtt ctgatgcaaa ctacccagcg tattatgaaa 1801 gctggaatcg ctaccagcgc catacttcct acccaccacg ccgggccaca cgggaggaac 1861 cccctggagc cccttttgct gaaaatacag ctgagcgctt cccaccttct tacacctcct 1921 acctgccccc cgagcccagc cggcccaccg accaggacta ccggcctcct gcctcagagg 1981 ctccaccccc ggagcctcca gaacctggtg gaggcggggg tggaggaggg cccagccctg 2041 agagagaaga agttcggact tccccccgcc cagcctcccc tgcccgctct ggctccccag 2101 ccccggagac caccaatgag agtgtgccct tcgcccagca cagcagcctg gattcccgca 2161 tcgagatgct gctgaaggag cagcgctcca agttttcctt cttggcctct gacacagagg 2221 aggaggaaga gaacagcagc atggtccttg gggccagaga tacagggagt gaggtgcctt 2281 ctgggtcagg gcatgggccc tgcacacccc ctccggcccc agctaatttt gaggatgtgg 2341 cacctacagg gagcggggag ccaggggcta cccgggagtc tcccaaggca aatggacaga 2401 accaggcttc tccatgctct tctggagacg acatggagat ctccgacgac gaccggggtg 2461 gctcaccccc tccggccccg acgccccctc agcagcctcc gccacctccc cctcccccgc 2521 cgcctcctcc tccctacctg gcgtcccttc ctcttggtta tcctccccac caacctgcct 2581 acctcctccc acccagacct gatgggccgc cgccccctga gtacccccca cctcctccac 2641 cacccccgca catctatgac tttgtgaact ccttggagct catggaccga cttggggctc 2701 agtggggagg gatgcccatg tccttccaga tgcagaccca gatgttaact cggctccatc 2761 agctgcggca gggcaaggga ttgattgccg cctcagctgg cccccccggt ggggcctttg 2821 gggaggcctt cctcccgttt ccacccccgc aggaggcagc ctacggcttg ccgtatgctc 2881 tatatgcaca ggggcaggag ggcagagggg catactcacg ggaggcctac cacctgccca 2941 tgccaatggc agccgagccc ctgccctcct cctcagtctc gggagaggag gcccggctgc 3001 cacccaggga agaagcagag ctggcagagg gcaagaccct cccgacagca ggcaccgtgg 3061 gccgtgtgct cgccatgctg gtccaggaga tgaagagcat catgcagcga gacctcaacc 3121 gcaagatggt ggagaacgtg gccttcggag cctttgacca gtggtgggag agcaaggagg 3181 agaaggccaa gccattccag aacgcggcca agcagcaagc caaggaggag gataaagaga 3241 agacgaagct gaaggagcct ggcctgctgt ccctcgtgga ctgggccaag agcgggggca 3301 ctacgggcat cgaggctttc gcctttgggt cagggctgag aggggccctg cggctgcctt 3361 cattcaaggt aaagcggaaa gagccatcgg aaatttccga ggccagtgag gaaaagaggc 3421 ctcgtccctc cactcctgct gaggaagatg aagacgaccc tgaacaagag aaggaggctg 3481 gagagccagg acgtccgggg accaagcccc cgaagcggga cgaagagcga ggcaagaccc 3541 agggcaagca ccgcaagtcc ttgctctgga cagcgaaggg gaggaggcat cccaggagtc 3601 ctcctcggag aaggatgagg aggatgacga ggaagatgag gaagatgaag atcgagagga 3661 agctgtggat accacaaaga aggagacaga ggtgtcggat ggcgaggacg aggaaagcga 3721 ttcgtcttcc aaatgttctc tgtatgctga ctcagatggc gaaaatgaca gcacatcaga 3781 ctccgagagc agcagctctt ccagctcctc atcctcctcc tcctcctcgt cctcatcctc 3841 ctcgtcctct tcatcctctg agtcctcctc tgaagatgaa gaggaagagg agcggccagc 3901 agcccttccc tcagcctccc cgccccccag agaagtccca gtgcccacgc cagcacctgt 3961 ggaggtgcca gtgccggaaa gggttgcagg ctccccagtc acacccctgc ccgaacagga 4021 ggcgtctcca gcaaggcctg caggccccac ggaggagtca ccccccagtg cgcctctgcg 4081 tcccccagaa ccacctgctg ggcccccggc ccctgcccca cgccccgatg agcgtccctc 4141 ttctcccatc cccctcctgc ccccacccaa gaaacgccgg aaaactgtct ccttctctgc 4201 catcgaggtg gtgccagccc cggagccccc tccagccaca ccgccgcagg ccaagtttcc 4261 cggcccagcc tcccgcaagg ctccccgggg cgtggagcgg accatccgca acctgcccct 4321 ggaccacgca tctctggtca agagttggcc cgaggaggtg tcccgaggag gccggagccg 4381 ggctggaggc cgaggccgcc tcaccgagga agaggaggct gagccaggga cagaggtgga 4441 cctggcggtc ctggccgacc tggccctgac ccctgcccgg cgcgggctgc ctgccctgcc 4501 tgctgttgaa gactcagagg ccacagagac atcggacgag gccgagcgcc ctaggcccct 4561 gctcagccac atcctcctgg agcacaacta tgccctggcc gtcaagccca cgccccctgc 4621 gccagccctg cggcccccgg agccagtgcc cgcacccgcc gccctcttca gttccccagc 4681 tgatgaggtc ctggaggccc ccgaggtggt ggtggctgag gcggaggagc ccaagccgca 4741 gcaactgcag cagcagcggg aggagggcga agaggagggg gaggaagagg gggaggaaga 4801 ggaggaggag tcctctgaca gcagcagcag cagcgatggg gagggcgccc tccggaggcg 4861 cagcctccgc tcccacgccc ggcgccgccg ccctccgccc ccacccccgc cgccaccgcc 4921 ccgcgcctac gagccacgca gtgagtttga acagatgacc atcctgtatg acatttggaa 4981 ctcgggcctg gactcagagg acatgagtta cctgcggctt acgtacgagc ggctgctgca 5041 gcagacaagc ggggctgact ggctcaacga cactcactgg gtccatcaca caatcaccaa 5101 cctgaccacc ccaaaacgca agcggcggcc ccaggatggg ccccgggagc accagacagg 5161 ctcagcccgc agcgaaggct actaccccat cagcaagaag gagaaggaca agtacctgga 5221 cgtgtgccca gtctcggccc ggcagctgga gggcgtggac actcagggga cgaaccgcgt 5281 gctgtccgag cgccggtccg agcagcggcg gctgctgagc gccatcggta cctccgccat 5341 catggacagt gacctgctga aactcaacca gctcaagttc cggaagaaga agctccgatt 5401 tggccggagc cggatccacg agtggggtct gtttgccatg gaacccattg ctgctgacga 5461 gatggtcatc gaatacgtgg gtcagaacat ccgtcagatg gtggccgaca tgcgggagaa 5521 gcgctacgtg caggagggca ttggcagcag ctacctgttc cgggtggacc acgacaccat 5581 catcgatgcc accaagtgtg gcaacctggc cagattcatc aaccactgct gcacgcctaa 5641 ctgctacgcc aaggtcatca ccatcgagtc ccagaagaag atcgtgatct actccaagca 5701 gcccattggc gtggacgagg agatcaccta cgactacaag ttcccactgg aagacaacaa 5761 gatcccgtgt ctgtgtggca cagagagctg ccggggctcc ctaaactgag gtggggcagg 5821 atgggtgccc acacccctat ttattccccc tggtgccctg agctcccagc acccccccag 5881 ccttagtggg ctcagcaggg cccacatgcc cccatctcca agcgtggggt tgggggcccc 5941 aagcccagcg agggagcctc agtccctgga ggcagcttct gcctctcctg tcgcccctgc 6001 ccaccacccc ctgattgttt ttctttgcgg agaagaagct gtaaatgttt tgtagcagcc 6061 agcagctgtt tcctgtggaa acctggggtg ccggcctgta cagattctgt cctggggggc 6121 tacacagtcc tctcgctttg tgttaatggg gacttcccct tacgccctgc gtgtacccct 6181 ccccagttta ggggtctctg gggcagtggc catgttctcc ccctgggggg gctctgcacc 6241 cccagtcctg gggactccgt gcctggaacc ctgcctcatc tgttcctgcc agaccctgag 6301 ggtcaccctt ccaccctggt gtcactcccc ggctcagcca ggccaggatg gcggggtggg 6361 tcccttttgc tgggctggac tgtacatatg ttaatagcgc aaacccgacg ccacattttt 6421 ataattgtga ttaaacttta ttgtac // LOCUS AB002340 6691 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0342 gene, complete cds. ACCESSION AB002340 NID g2224624 KEYWORDS KIAA0342. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG3234. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6691) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6691 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG3234" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1936..5232 /gene="KIAA0342" CDS 1936..5232 /gene="KIAA0342" /codon_start=1 /db_xref="PID:d1021638" /db_xref="PID:g2224625" /translation="MYCQEELFEEAAIAVEKYEEMLKTKTLPISKLSYSASQFYLEAA AKYLSANKMKEMMAVLSKLDIEDQLVFLKSRKRLAEAADLLNREGRREEAALLMKQHG CLLEAARLTADKDFQASCLLGAARLNVARDSDIEHTKDILREALDICYQTGQLSGIAE AHFLQGVILRDFQKLRDAFFKFDTLNHSAGVVEALYEAASQCEAEPEKILGLAPGGLE ILLSLVRALKRVTNNAEKEMVKSCFEFFGISQVDAKYCQIAQNDPGPILRIIFDLDLN LREKKTKDHFLIMTDQVKLALNKHLLGRLCQITRSLLGKTYRGVCMRFIVGLKCEDEN CEHFHRPLRRCEAKCLVQSKMNLVAINGLLLEAKKVFPKILAEELKEIDYILSTDMYG LCKSILDVLFPKHFHQRVLSENPMACKEILKPNYKSFRFYRFALKEYIHFLFENESAR NRRESTDLWLSAMQAFLLSSNYPEEFEKLLHQEEDNYNRELKALESEKDERGRGRGSR IKGIEGKFGMLAPNRDDENMDKTHLCFIRLLENCIDQFYVYRNPEDYKRLFFRFMNVL IKRCKEPLIPSIGNTVALLEFQFIHCGVVLARLWKNVILCLPKSYIALLHYWEFLFSK KDKELGDVFSIIQEYKPKDVTRAIQDFRFHLSYLAKVLCGYENVNFNVLLDAFSEIDY VVSGEAERTLVLCLVMLVNAEEILQPYCKPLLYRHFREIESRLQLMSMDCPGQVPERL LKVVKRVLVAVNVKSVAEALQDLLFERDEEYLMDCDWRWDPVHTKGSIVRGLYYEEVR LNRLLCLDPVDYFAEPECEFGQDEMDELALEDRDHVLATILSQKQRKASIQRKLRRAC LVVSLCISWRRRVGTQMERVREEAREPRAGNFKKADVDRTQCDLCGVKFTRGPENYFS PSKAFEGAASEVAVLSRAELEREECQERNSESYEQHIHLEHHQRQQVAYQKYSEFFHE KVDPAIDEGKLVVQDIEQSVWIHSHVGSKEHSHMLQKVQEHIKRVSDMVEDLYRRKAW AGAEEAMTRLVNILILSVRDARDWLMKTETRLKKEGIVQEDDYENEVEDFGELRPRRR SRKCGKQRKY" BASE COUNT 1708 a 1554 c 1654 g 1775 t ORIGIN 1 cttgtgctaa caatttatga agcaaaaggc ttagaatttg atgatgtcct cctttacaac 61 ttttttactg attctgaggg atgcccaagt tggagcaagc tctaaggaga gccttacaga 121 tgaagccaag gttagcaatg tccaccctgt gatgacagaa ctgggtaagg tcagttcata 181 gatagagaca attccaggaa gacaaatgag gcttggttcc tgccctctgg atcaggtaat 241 gtgggggaca agaacagctt ccttgcatga gagacccctc tagcatggct tcactaccac 301 tgcttgcagc tcatgcagat gatggcagcc aaagagtccc cacattgacc cccagtgacc 361 tctgtattgg tccactgctt tcacttggct cctgagtcct tgtgggtttt tctccccaag 421 cttagttcat tatgggtaga gctaccaact cacagccgtt gtcctgggtt ttctagccac 481 tgttttctct ttcccactat cttttggtgg agactcaagc tggctgctga ctgtactgtg 541 gggtcttctc tggggactta gctggctgtg tcccagtcct tgttgcagac ttcctttgca 601 catgatgcat gcttgaaggg gctatgttat acccaagtca agagcactgt tggctttaac 661 actttaaaaa tctgccccag aactgaccct ttagatagag tccttccttt ttttcttatc 721 attttttaaa tgttattttg ccttaatagc acaactatta caatattgat ataggtgctt 781 tgcagtttac agagtacttt ttgaagtcat tgtctcactc caccaaccct ctgaggttat 841 atgtattttt attactctca ttctattttt taaaatttat ttttattttt atagagatat 901 ggtcttgctg tgttgcccag gctggtctca aactcctggc cttaagtaat cttcctgcct 961 tggcctcccg aagtgctgag attataagca tgagccacct ttatctgtag ataaagcaga 1021 aggtcactga gattagtaca tttctcatag tcgtaatgta gatggcagag cctggagatg 1081 aactcagatg ctttgagttc taaatctctt tccattttcc ttccttttaa agcctcgcat 1141 gcgaattgaa ttccagaacc ttccacaatt agtgtatcat tgctgttcct tatgtacatt 1201 cctttcctgt taaattttgc taataaaatc agatgcacac tcattaaccc tggaaccatt 1261 tttacctctg aggactaact tttttttcta atgatacatc catcggtcat ggttggactg 1321 aggctctcct aaaagaggaa gggcttataa ggaatggaag atcatttcct cattcacacc 1381 tacctcaact gactccagag aggaaaaccg gccattggtt gaagtacccc tggacaaacc 1441 aggctcttct cagggtcgat ctctcatggt gaatccagaa atgtacaagc tcctcaacgg 1501 agagctgaag cagctgtaca ccgccatcac acgggctcgg gtcaacctct ggatctttga 1561 tgaaaaccga gagaaacggg ctcccgcatt caaatatttc attagaagag attttgtcca 1621 agttgtaaag acagatgaaa ataaagactt tgatgatagc atgttcgtta agacctcaac 1681 tcctgcggag tggattgcac agggagatta ctacgccaag caccagtgct ggaaggttgc 1741 agccaagtgt taccagaaag gaggtgcatt tgagaaggag aagttggccc tggcccatga 1801 cactgccctg agcatgaaat ccaagaaagt cagccccaaa taagagatgc tgcctatttc 1861 tataagcgaa gccagtgcta caaagacgct ttcagatgct ttgagcagat tcaggaattt 1921 gatctagcac tcaaaatgta ctgccaagag gagctttttg aagaagctgc tattgcagtg 1981 gaaaagtatg aagaaatgct aaagactaag acccttccca tttccaagct ctcctattct 2041 gccagtcagt tttacttgga agctgcagca aagtatctga gtgcaaataa gatgaaggaa 2101 atgatggctg tcctctcaaa gctagacata gaagaccagc tggtgttctt gaagtctcgg 2161 aaacgcttag cagaagctgc agacctgctg aacagggaag gtaggagaga agaggctgcc 2221 ctgctgatga agcaacatgg ctgcctcctg gaggctgcca ggctcactgc cgacaaggac 2281 ttccaggcct catgtctgct gggggccgcc cgcctcaatg tggccaggga ttccgacata 2341 gaacacacca aggacattct gagagaagca cttgatatct gctatcaaac tggccagttg 2401 tctggcattg cggaggccca cttcctgcaa ggggtaatcc tgagagactt tcagaagctc 2461 agggatgcct tcttcaagtt tgacacgctc aaccactcag ctggagtggt ggaagcactc 2521 tacgaagcag ccagccagtg tgaggccgag cctgagaaga ttctgggcct ggctccaggg 2581 ggcttggaaa tcctcctcag tctggtcagg gctctcaaaa gagtgaccaa caatgctgag 2641 aaggaaatgg tcaaatcttg ctttgagttt tttgggattt cccaggtgga tgccaagtat 2701 tgccagatag ctcagaatga ccctgggccc atattaagaa taatttttga cctggatttg 2761 aacttgagag agaaaaaaac aaaagatcat tttttgataa tgactgacca agtgaaatta 2821 gccctaaaca aacacctttt gggcaggctg tgtcagatca cacggagcct gcttgggaag 2881 acctaccgag gagtctgcat gaggtttatt gtaggcttaa aatgtgagga tgaaaactgt 2941 gaacattttc acaggcctct gcggcgttgt gaagccaagt gtttagttca gtcgaaaatg 3001 aacttggtgg caatcaacgg gttgcttttg gaagccaaaa aagtattccc taaaatctta 3061 gcagaagaac ttaaagaaat tgattatatt ttgtctacag atatgtatgg cctttgcaag 3121 tccattctgg atgtcctttt ccctaagcat ttccatcaga gagtgttgtc agaaaacccc 3181 atggcatgca aagaaatcct caaaccaaat tacaaatcct tccggttcta cagatttgct 3241 ttgaaggagt acatccactt tctgtttgaa aatgaaagcg cacgcaaccg ccgggaatcc 3301 acagacctgt ggctgagtgc catgcaagct ttccttctct cttccaacta cccggaggag 3361 tttgaaaagc tgctccacca ggaggaggac aactacaaca gggaactcaa agctctagag 3421 tctgaaaagg atgaaagggg cagggggaga ggcagcagga taaaaggaat agaagggaaa 3481 tttggcatgc tggcacccaa cagggatgat gaaaatatgg acaagaccca cctgtgcttc 3541 atccggcttc tggagaattg cattgatcaa ttctacgtgt acaggaaccc agaagactac 3601 aagaggctct ttttccgttt catgaatgtc ctcatcaaga ggtgcaaaga accactcatc 3661 cccagcattg gaaacacagt agccctcctg gagttccagt tcatccactg tggggtggtg 3721 ctggcccgcc tctggaagaa tgtcattcta tgcctcccca agagctacat tgcactcttg 3781 cactactggg agttcctgtt tagcaagaag gacaaggagc ttggggatgt gttctccatc 3841 attcaggaat acaaacccaa ggacgtgaca agagccattc aggatttccg gttccatctc 3901 tcctacctcg ccaaggtgct atgtggctat gagaatgtga acttcaacgt cctgcttgat 3961 gccttcagtg aaatagacta tgtggtctcg ggtgaggctg agcggacact ggtgctgtgc 4021 ttggtgatgc tagtgaatgc tgaggagatc ctgcagccat actgcaagcc tctcctgtat 4081 cgccacttcc gggagattga gtcaaggctg cagctcatga gcatggactg ccctggccag 4141 gttcccgaga ggctcctgaa ggtggtgaag cgggtcttgg tggcagtcaa tgtgaagtct 4201 gtggctgagg cactgcagga cctgctcttt gagcgggatg aagagtacct aatggactgt 4261 gactggcggt gggaccctgt gcacaccaaa gggtccatag tccgtggcct ctattatgag 4321 gaggtcagac taaaccgcct gctctgtttg gaccctgtgg actactttgc tgaacctgag 4381 tgtgagtttg gccaggatga gatggatgaa ctggcattag aagaccgaga ccacgtcctg 4441 gccaccattc tttcccaaaa gcaacggaag gcctccatac agcggaagtt gaggagggca 4501 tgcctggtgg tgtctctgtg catcagttgg aggagaagag tgggcaccca gatggagcgt 4561 gtcagggagg aggccaggga gcccagggct gggaacttca aaaaggcaga cgtggacagg 4621 acccagtgtg acctatgtgg agtgaagttt acccgtggcc cagagaacta tttcagcccc 4681 agcaaagcgt ttgagggggc agcttccgag gtggcagtcc tttccagggc tgagctggaa 4741 agggaggagt gtcaggagag gaacagcgag tcttacgagc agcatatcca tctagaacac 4801 caccagaggc agcaagtggc ctaccagaaa tactcagaat ttttccacga gaaggtggac 4861 ccggccattg atgaaggcaa gctggtggtg caggacatcg agcaaagtgt atggatccac 4921 agtcacgtgg gctccaagga gcacagccac atgctgcaga aggtccagga gcacatcaag 4981 agggtttcgg atatggtgga ggacctctac aggcggaagg cctgggctgg cgcggaggag 5041 gcgatgactc ggctggtcaa cattctgatc ctgtcagtca gggatgcacg agactggttg 5101 atgaaaacag agacccgctt aaagaaggaa ggtattgttc aggaagatga ttatgaaaat 5161 gaagttgaag actttggtga gcttcggcct agaaggcgtt ctcggaaatg tggaaagcag 5221 agaaaatact aatgtccaca cagctgcagc ctcctcatcc ttcggaacat tccattctga 5281 cttagaattc tgagcgctgg ggcagaagac aaaaagaatg tgaattagca ttttaaaaat 5341 agatagggga gtctaacaac accattgctt tttattctca ttacttctct ttcagtggct 5401 taggatctga ttgctccctc acctctaaag tgtttccaag tagagtctgg cagtattagt 5461 ttcccccaca aacaaattca ggtagagaag ttgtagtcga agggaatgtt gggactcctc 5521 actgcatggt ccctggggtg tctggctcat ccccatcatg acatgagttg gtagcagagg 5581 actaggtgtt aggacagact ctcctcacct cagttgctgc tgctctgggc caggaaatct 5641 ctctatggat tccctcatct cctgccacac tgctgcagca tcaagtgtgc tatgagagct 5701 gtatgtgccc aggggtgggc cctgccactg gccaccccga gcccaggacg cattctggtc 5761 atctctgagc ccacttgtag tgtagcttct tctgttagag atgttagcct tttctctctg 5821 tggcgacttt tctgccttcc cttcttctgt tttttgtgat cattcttagc tacctgtgcc 5881 aagtcacagt tagactacct cagaatgtag tcaagaattc acccagtgtt ctttggatgt 5941 cccttcccta gggggaaagg ctggggaatg ttgctgggag cattcccagc tgcctctcca 6001 aagtggtcct aactctgcct tggctatggt gcagacattc caaaaagtgt tttgtataca 6061 gccctgtaca tctagttttc ttggttctcc ctggcaagtt cgctacccta gcttaggctg 6121 acgttctgat gagtctctgc ttgctgagct catcactgtg ggatggtgac ctgcagagct 6181 gcagattgga gcaagtggac agaggaggga gccagctaag ccaaatccgt cactgccact 6241 tggctggcag gccttataag agggcggatc tgggcccgtt tgcaggaggc cattgcctac 6301 cctctcccca cagtgagcca gcagccatca gctgatccca cctgggaaac cttcatgcct 6361 ctctgatggt tactgcccac ccttacccca cccctcagct cagcctggta tggaaagcaa 6421 ggtgcacgtt ggtctttgat tgttctgtcc tcacagcaga gccagctttc cccccatgtt 6481 gctgctgctt ttgctgctca tgctgcttgt tgttttctct tccttatctt tccaaattgt 6541 ttccttgaaa tgcttgtttc caatttttct tagattatgt ttctaatttt tattccaggt 6601 gttttttttt cttacttgac agaaaaaaag aaaaccccaa caatatcttg ctgtattatc 6661 acatttttta taaagttaaa gcatttctct t // LOCUS AB002342 5787 bp mRNA PRI 24-JUL-1997 DEFINITION Human mRNA for KIAA0344 gene, complete cds. ACCESSION AB002342 NID g2280478 KEYWORDS KIAA0344. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1486. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5787) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 COMMENT Sequence updated (22-Jul-1997). FEATURES Location/Qualifiers source 1..5787 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1486" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 619..4359 /gene="KIAA0344" CDS 619..4359 /gene="KIAA0344" /codon_start=1 /db_xref="PID:d1021640" /db_xref="PID:g2224629" /translation="MPTEVLATPGYFPTVVQPYVESNLLVPMGGVGGQVQVSQPGGSL AQAPTTSSQQAVLESTQGVSQVAPAEPVAVAQPQATQPTTLASSVDSAHSDVASGMSD GNENVPSSSGRHEGRTTKRHYRKSVRSRSRHEKTSRPKLRILNVSNKGDRVVECQLET HNRKMVTFKFDLDGDNPEEIATIMVNNDFILAIERESFVDQVREIIEKADEMLSEDVS VEPEGDQGLESLQGKDDYGFSGSQKLEGEFKQPIPASSMPQQIGIPTSSLTQVVHSAG RRFIVSPVPESRLRESKVFPSEITDTVAASTAQSPGMNLSHSASSLSLQQAFSELRRA QMTEGPNTAPPNFSHTGPTFPVVPPFLSSIAGVPTTAAATAPVPATSSPPNDISTSVI QSEVTVPTEEGIAGVATSTGVVTSGGLPIPPVSESPVLSSVVSSITIPAVVSISTTSP SLQVPTSTSEIVVSSTALYPSVTVSATSASAGGSTATPGPKPPAVVSQQAAGSTTVGA TLTSVSTTTSFPSTASQLSIQLSSSTSTPTLAETVVVSAHSLDKTSHSSTTGLAFSLS APSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGPTSTPLLPQVPSIPPL VQPVANVPAVQQTLIHSQPQPALLPNQPHTHCPEVDSDTQPKAPGIDDIKTLEEKLRS LFSEHSSSGAQHASVSLETSLVIESTVTPGIPTTAVAPSKLLTSTTSTCLPPTNLPLG TVALPVTPVVTPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTLPSEQ LPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAPTAITEAGTQPQK GVSQVKEGPVLATSSGAGVFKMGRFQVSVAADGAQKEGKNKSEDAKSVHFESSTSESS VLSSSSPESTLVKPEPNGITIPGISSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTT ANKVGRFSVSKTEDKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNG PSSDPEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNLSQSLSNSFNSSYMSSDNESDI EDEDLKLELRRLRDKHLKEIQDLQSRQKHEIESLYTKLGKVPPAVIIPPAAPLSGRRR RPTKSKGSKSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQNQLLQ PLKPSPSSDNLYSAFTSDGAISVPSLSAPGQGNKATIIVQKQ" BASE COUNT 1584 a 1410 c 1217 g 1576 t ORIGIN 1 catttttctc aagacacttc ttcagagaaa aatggttctc tcagtattca ctgtaagctg 61 attgaatgct agatgtggtt aatagtacac ctggcaactt ttacatatat gggtagaata 121 aatagaatag ccaacatggt atcaacaaaa agcttcgggg tttagactgg gaaaattagc 181 atttcatgat atgactgtta tgtggaagaa tttttgcatt tttccattta agattggcta 241 ggaagaaatt gtgttcctcc tatttaaatc atctctaggt gaagataacc gacaataaag 301 tagtattgta gagtcctgtc tcttctgttg gaaccagtct actacttcat ttcaagaatg 361 tggtctttat atgcccttta gattattaga aagagattgg ctactattct tctttgctgt 421 gtgcttcttt tttttttttt tgcatgtctt gctgttttgc taatttattg acccccaacc 481 tgctctaact caccttcctc atttctgtca cagggcttcc cacctcgact gccaccacag 541 tacccaggag attcaaatat tgctccctct tccaacgtgg cttctgtttg catccattct 601 acagtcctat cccctcccat gccgacagaa gtactggcta cacctgggta ctttcccaca 661 gtggtgcagc cttatgtgga atcaaatctt ttagttccta tgggtggtgt aggaggacag 721 gttcaagtgt cccagccagg agggagttta gcacaagccc ccactacatc ctcccagcaa 781 gcagttttgg agagtactca gggagtctct caggttgctc ctgcagagcc agttgcagta 841 gcacagcccc aagctaccca gccgaccact ttggcttcct ctgtagacag tgcacattca 901 gatgttgctt caggtatgag tgatggcaat gagaacgtcc catcttccag tggaaggcat 961 gaaggaagaa ctacaaaacg gcattaccga aaatctgtaa ggagtcgctc tcgacatgaa 1021 aaaacttcac gcccaaaatt aagaattttg aatgtttcaa ataaaggaga ccgagtagta 1081 gaatgtcaat tagagactca taataggaaa atggttacat tcaaatttga cctagatggt 1141 gacaaccccg aggagatagc aacaattatg gtgaacaatg actttattct agcaatagag 1201 agagagtcgt ttgtggatca agtgcgagaa attattgaaa aagctgatga aatgctcagt 1261 gaggatgtca gtgtggaacc agagggtgat cagggattgg agagtctaca aggaaaggat 1321 gactatggct tttcaggttc tcagaaattg gaaggagagt tcaaacaacc aattcctgcg 1381 tcttccatgc cacagcaaat aggcattcct accagttctt taactcaagt tgttcattct 1441 gcgggaaggc ggtttatagt gagtcctgtg ccagaaagcc gattacgaga atcaaaagtt 1501 ttccccagtg aaataacaga tacagttgct gcctctacag ctcagagccc tggaatgaac 1561 ttgtctcact ctgcatcatc ccttagtcta caacaggcct tttctgaact tagacgtgcc 1621 caaatgacag aaggacccaa tacagcacct ccaaacttta gtcatacagg accaacattt 1681 ccagtagtac ctcctttctt aagtagcatt gctggagtcc caaccacagc agcagccaca 1741 gcaccagtcc ctgcaacaag cagccctcct aatgacattt ccacatcagt aattcagtct 1801 gaggttacag tgcccactga agaggggatt gctggagttg ccaccagcac aggtgtggta 1861 acttcaggtg gtctccccat accacctgtg tctgaatcac cagtactttc cagcgtagtt 1921 tcaagtatca caatacctgc agttgtctca atatctacta catccccgtc acttcaagtc 1981 cccacatcca catctgagat cgttgtttct agtacagcac tgtatccttc agtaacagtt 2041 tcagcaactt cagcctctgc agggggcagt actgctaccc caggtcctaa gcctccagct 2101 gtagtatctc agcaggcagc aggcagcact actgtgggag ccacattaac atcagtttct 2161 accaccactt cattcccaag cacagcttca cagctgtcca ttcagcttag cagcagtact 2221 tctactccta ctttagctga aaccgtggta gttagcgcac actcactaga taagacatct 2281 catagcagta caactggatt ggctttctcc ctctctgcac catcttcctc ttcctctcct 2341 ggagcaggag tgtctagtta tatttctcag cctggtgggc tgcatccttt ggtcattcca 2401 tcagtgatag cttctactcc tattcttccc caagcagcag gacctacttc tacaccttta 2461 ttaccccaag tacctagtat cccacccttg gtacagcctg ttgccaatgt gcctgctgta 2521 cagcagacac taattcatag tcagcctcaa ccagctttgc ttcccaacca gccccatact 2581 cattgtcctg aagtagattc tgatacacaa cccaaagctc ctggaattga tgacataaag 2641 actctagaag aaaagctgcg gtctctgttc agtgaacaca gctcatctgg agctcagcat 2701 gcctctgtct cactggagac ctcactagtc atagagagca ctgtcacacc aggcatccca 2761 actactgctg ttgcaccaag caaactcctg acttctacca caagtacttg cttaccacca 2821 accaatttac cactaggaac agttgctttg ccagttacac cagtggtcac acctgggcaa 2881 gtttctaccc cagtcagcac tactacatca ggagtgaaac ctggaactgc tccctccaag 2941 ccacctctaa ctaaggctcc ggtgctgcca gtgggtactg aacttccagc aggtactcta 3001 cccagcgagc agctgccacc ttttccagga ccttctctaa cccagtccca gcaacctcta 3061 gaggatcttg atgctcaatt gagaagaaca cttagtccag agatgatcac agtgacttct 3121 gcggttggtc ctgtgtccat ggcggctcca acagcaatca cagaagcagg aacacagcct 3181 cagaagggtg tttctcaagt caaagaaggc cctgtcctag caactagttc aggagctggt 3241 gtttttaaga tgggacgatt tcaggtttct gttgcagcag acggtgccca gaaagagggt 3301 aaaaataagt cagaagatgc aaagtctgtt cattttgaat ccagcacctc agagtcctca 3361 gtgctatcaa gtagtagtcc agagagtacc ttggtgaaac cagagccgaa tggcataacc 3421 atccctggta tctcttcaga tgtgccagag agtgcccaca aaactactgc ctcagaggca 3481 aagtcagaca ctgggcagcc taccaaggtt ggacgttttc aggtgacaac tacagcaaac 3541 aaagtgggtc gtttctctgt atcaaaaact gaggacaaga tcactgacac aaagaaagaa 3601 ggaccagtgg catctcctcc ttttatggat ttggaacaag ctgttcttcc tgctgtgata 3661 ccaaagaaag agaagcctga actgtcagag ccttcacatc taaatgggcc gtcttctgac 3721 ccggaggccg cttttttaag tagggatgtg gatgatggtt ccggtagtcc acactcgccc 3781 catcagctga gctcaaagag ccttcctagc cagaatctaa gtcaaagcct tagtaattca 3841 tttaactcct cttacatgag tagcgacaat gagtcagata tcgaagatga agacttaaag 3901 ttagagctgc gacgactacg agataaacat ctcaaagaga ttcaggacct gcagagtcgc 3961 cagaagcatg aaattgaatc tttgtatacc aaactgggca aggtgccccc tgctgttatt 4021 attcccccag ctgctcccct ttcagggaga agacgacgac ccactaaaag caaaggcagc 4081 aaatctagtc gaagcagttc cttggggaat aaaagccccc agctttcagg taacctgtct 4141 ggtcagagtg cagcttcagt cttgcacccc cagcagaccc tccaccctcc tggcaacatc 4201 ccagagtccg ggcagaatca gctgttacag ccccttaagc catctccctc cagtgacaac 4261 ctctattcag ccttcaccag tgatggtgcc atttcagtac caagcctttc tgctccaggt 4321 caaggtaata aagcaaccat catcgtccaa aaacaataaa atggagatgt tgccatacct 4381 gggacaaaag cctgttaagg cgggttggga gactagctga ccagaacaca gcctgtgtgt 4441 tgtacactga agaatctggg tgaaaaggga agtggagtga taatgagaat cggtgggctc 4501 actgctccca ttaggtgaaa ttactttttt tcaaggaatt acagtgaaaa gttacatctg 4561 tgtggcctat atgacttgct catttgggat ttggaactta ggctttaata ttaggctgag 4621 atttcctgga tgaaattcta aggtgtttta gcagtttctg aagctaatac attttcttag 4681 ccattgtaga attttgttac ttttaagtat gggagtggca tactaaaatg aataacctta 4741 caattcagtt ttttatccat aatctacttt ccaaatatag ctctgtttat tagtgattgc 4801 tgaaaaaatt cccacagagg aaagagcttt tagtcatatt agaacaagaa ttgaaaagac 4861 ttgggcatct gggtgagaag aatgaaaaaa atataggtac tggcttatgt gcctttgcca 4921 cagtttcaca gaaattagag atcagtctct tcacaggaag aatgcacttg attggtaagg 4981 agggcaaact agctagcatt attcgaacta agaaaagctt ccgcattttg cagatgggta 5041 gaattaagac ctaatatttc atctcttaca tatctgacct tccccccaga agcttgttct 5101 tctgtgtgcc atcttagtgc atttcaccac tccagcctca agtttctaac atcttgtagt 5161 tgtgttctgt ctcttctcct ctctctgttc taccctgttt ttcccctctc acaggctgtg 5221 cgaagtttaa ctgtgcatct gaacaggtga cattcaaacc tggtggcagg aggacccgat 5281 ttctgagtac gccctgcttg gctctttgtg tgtaacacct ttactccttc cttgtccttg 5341 tgtttctgct gcttggatct gatgtttcac gcagtccatt ttcatttgtc tctttttgta 5401 tatcatctac tcagtggctt ggctgaatta ctgttaccct cagaagtttg ggcccccaca 5461 ttaattatga taaaaaatgt caaaataaca agttatctac aaatttcaat gtaactttct 5521 ggtagaagtg cttcttcatg gatctgtgac agagagtgga tatggtatct aggcaataga 5581 ttgctgggtc atttagaata atgaagactg aactccacag tcgtagtcag tgctgtctgt 5641 ctgccctagc attagaaatg agagaaatca gccagacacg gtggcgtaca cctgtaatcc 5701 cagcactttg ggaggccgag gcgggaagat tgcttgaggc caggagctcg agaccaaccc 5761 tgggcaacat ggtgataccc catctct // LOCUS AB002343 6387 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0345 gene, complete cds. ACCESSION AB002343 NID g2224630 KEYWORDS KIAA0345. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1491. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6387) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6387 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1491" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 725..3253 /gene="KIAA0345" CDS 725..3253 /gene="KIAA0345" /codon_start=1 /db_xref="PID:d1021641" /db_xref="PID:g2224631" /translation="MLYSSRGDPEGQPLLLSLLILAMWVVGSGQLHYSVPEEAEHGTF VGRIAQDLGLELAELVPRLFQLDSKGRGDLLEVNLQNGILFVNSRIDREELCGRSAEC SIHLEVIVDRPLQVFHVDVEVKDINDNPPVFPATQKNLFIAESRPLDSRFPLEGASDA DIGENALLTYRLSPNEYFFLDVPTSNQQVKPLGLVLRKLLDREETPELHLLLTATDGG KPELTGTVQLLITVLDNNDNAPVFDRTLYTVKLPENVSIGTLVIHPNASDLDEGLNGD IIYSFSSDVSPDIKSKFHMDPLSGAITVIGHMDFEESRAHKIPVEAVDKGFPPLAGHC TLLVEVVDVNDNAPQLTIKTLSVPVKEDAQLGTVIALISVIDLDADANGQVTCSLTPH VPFKLVSTYKNYYSLVLDRALDRESVSAYELVVTARDGGSPSLWATARVSVEVADVND NAPAFAQSEYTVFVKENNPPGCHIFTVSARDADAQENALVSYSLVERRLGERSLSSYV SVHAESGKVYALQPLDHEELELLQFQVSARDAGVPPLGSNVTLQVFVLDENDNAPALL TPRMRGTDGAVSEMVLRSVGAGVVVGKVRAVDADSGYNAWLSYELQPETASASIPFRV GLYTGEISTTRALDETDAPRQRLLVLVKDHGEPALTATATVLVSLVESGQAPKSSSRA SVGATGPEVTLVDVNVYLIIAICAVSSLLVLTLLLYTVLRCSAMPTEGECAPGKPTLV CSSAVGSWSYSQQRRQRVCSGEGKQKTDLMAFSPGLSPCAGSTERTGEPSASSDSTGK VGFSSILFIYIIFFLERYYRLLPGAVQIVLFIFLEIQQIFFLIK" BASE COUNT 1612 a 1435 c 1494 g 1846 t ORIGIN 1 ctcgcttttc ttgcaatatt ttataccttt tcaattcata gaattactca agaaaactac 61 ctcagttggt tgctactttt tgttgattcc ttttaccaga catgactaag tttctttttc 121 atcagtagat ttctgggctc ctatattcac tagagattgc aactcctgga tttctcttac 181 actagaatcc tatttcgagc catatgggag attctgaatt ccagaacaaa agaattttgt 241 aatttaaaat tcgtgattgc tcaatggaat cattttaatt gttacttcat ttctgtcgtt 301 atttaaaact taagtggaga gttttctcag ggataagaaa accacaatca aggtcataca 361 aaacttttag aggcagtcag tctgctaaga aggctccagc aagagaaacg ggatcttctg 421 tttcaacaat cattacttaa gaaaaaatta agaaaatgaa ataagttttg cagaataact 481 gtgaaatttt tattcatgaa atatgtactt acactttggg ccacgtgatg tcactctttg 541 ccgcgatgtt ctctctgaat ccagacaaat acagcccttt tcccatggga aagaggctca 601 attctttttc actctctctg tgctgaacga tggcgaacac agcagaatgg gactgacgaa 661 atcagatgat ttcttctaat ttggaggcaa ttttcactaa ttagaagaag actgagtatt 721 tgaaatgtta tactcaagtc gaggagatcc agagggtcag cctctactgc tctcgcttct 781 gatcctcgca atgtgggtgg tggggagcgg ccagctccac tactccgtcc cggaggaagc 841 cgaacacggc accttcgtgg gccgcatcgc gcaggacctg gggctggagc tggcggagct 901 ggtgccgcgc ctgttccagt tggattccaa aggccgcggg gaccttctgg aggtaaatct 961 gcagaatggc attttgtttg tgaattctcg gatcgaccgc gaggagctgt gcgggcggag 1021 cgcggagtgc agcatccacc tggaggtgat cgtagacagg ccgctgcagg ttttccatgt 1081 ggacgtggag gtgaaggaca ttaacgacaa ccctccagtg ttcccagcga cacaaaagaa 1141 tctgttcatc gcggaatcca ggccgcttga ctctcggttt ccactagagg gcgcgtccga 1201 tgcagatatc ggggagaacg ccctgctcac ttacagactg agccccaatg agtatttctt 1261 cctggacgtg ccaaccagca accagcaggt aaaacctctt ggacttgtat tacggaaact 1321 tttagacaga gaagaaactc cggagcttca tttattgctc acggccaccg atggaggcaa 1381 acccgagctg actggcaccg ttcaattact catcacggta ctggacaaca atgacaatgc 1441 cccagtgttc gacagaaccc tgtatacggt gaaattacca gaaaacgttt ctatcggaac 1501 gctggtgatt caccccaatg cctcagattt agacgaaggc ttgaatgggg atattattta 1561 ctccttctcc agtgatgttt ctccagatat aaaatccaag ttccacatgg accccttaag 1621 tggggcaatc acagtgatag gacatatgga ttttgaagaa agtagagcac acaagatccc 1681 agtcgaggct gtcgataaag gcttcccacc cctggctggt cattgtacac ttcttgtgga 1741 agttgtggat gtaaatgaca atgctccaca gttgactatc aaaacgctct cggttcctgt 1801 aaaagaggac gcacaactgg ggacagttat tgccctgatt agtgtgatcg acctagacgc 1861 agatgccaac gggcaggtga cctgctccct gacgccccac gtccccttca agctggtgtc 1921 cacctacaag aattactact cgttggtgct ggacagagct ctggaccgcg agagtgtgtc 1981 cgcctacgag ctggtggtta ccgcgcggga cgggggctcg ccttcactgt gggccacggc 2041 cagggtgtct gtggaggtgg ccgacgtgaa cgacaacgca ccagcgttcg cgcagtccga 2101 gtacacggtg ttcgtgaagg agaacaaccc gccgggctgc cacatcttca cggtgtctgc 2161 gcgggacgct gacgcgcagg agaacgccct ggtgtcctac tcgctggtgg agcggcggtt 2221 gggcgagcgc tcgctgtcga gctacgtgtc agtgcacgcg gagagcggca aggtgtacgc 2281 gctgcagccg ttggaccacg aggagctgga gctgctacag ttccaggtga gcgcgcgcga 2341 cgcgggcgtg ccgcctctgg gcagcaacgt gacgctgcag gtgttcgtgc tggacgagaa 2401 cgacaatgcg ccggcgctgc tgacacctcg gatgaggggc actgacggcg cagtgagcga 2461 gatggtgctg cggtcggtgg gcgccggcgt agtggtgggg aaggtgcgcg cagtggacgc 2521 cgactcgggc tacaacgcgt ggctttcata cgagctgcag ccagaaacgg ccagcgcgag 2581 catcccgttc cgcgtggggc tgtacacggg cgagatcagc acaacgcgtg ccctggacga 2641 aacggacgca ccgcgccagc gcctactggt gctggtgaaa gaccacgggg agccagcgct 2701 gacggccacg gccactgtgc tggtgtcgct ggtggagagc ggccaggcgc caaagtcatc 2761 gtcgcgggcg tcagtgggtg ccacgggccc cgaggtgacg ctggtggatg tcaacgtgta 2821 cctgatcatc gccatctgcg cggtgtctag cctgttggtt ctcacgctgc tgctgtacac 2881 tgtgctgcgg tgctcggcga tgcccaccga gggcgagtgc gcgcctggca agccgacgct 2941 ggtgtgttct agcgcggtgg ggagttggtc gtactcgcag cagaggaggc agagggtgtg 3001 ctctggcgag ggtaagcaga agaccgacct catggccttc agcccgggcc tttctccttg 3061 tgctggatct acagagcgaa cgggagaacc ctctgcttcc tcagattcaa ctgggaaggt 3121 gggtttttct agcattttat ttatttatat aatttttttt cttgaaagat attatcgatt 3181 actcccaggg gccgttcaaa tagttttatt catttttcta gaaatccagc agattttttt 3241 tctgataaag taaacccctt aacattggag ccgactttgt cttgacttct agtgagaatt 3301 ataaactgta tattaaatag atattttttg ggtgctgaat caattttatt taaatttgtg 3361 attaaagtga cattgaattt ctgatgctat gctgccataa cacttgaaaa ccaatttagt 3421 tgttagtcat tcattaaaca ttaacatcac tatcatttat ttattgctaa atgatgcata 3481 gtattttagt ctacttgtat tgtttataag aaacccaagc aaaaatatat agcaattgtt 3541 accttgttaa gtttgtagtt ctctacattt ctctggatgg agactgtgaa catctgattg 3601 ttcagcaacc ttcagtatct attattttaa taagaaagaa acttccccta aactttagaa 3661 aacagttgct ccactttagg aatcaaatta tgtcaataaa tgttataaac acagccttca 3721 tttcaactta tataaaatat gttttaaaat gcctgacaat gtagataatt caagaaatgt 3781 tgactgaaat tttgtctaca cttagaacat tttttgaaat tcagtttaca gaaattggag 3841 aaaatgcttt ttaaacaagt gtttcctttc ttcaagaaga cattctcctt ttaattgaaa 3901 ttttctccat tcagtgataa aatgatcagc catgtgaaga ttcgaaactt cgagttcttt 3961 tgaaattcag agtctgtaac ttaaaacatt acccttatga atttagatga gaattcactt 4021 gttctgtcag taatccataa gacagaaatc tgttttttta aaaatatctt tttctcctct 4081 cagctcatac ataacacaag gcagaaatct ggatatgaga tttgcctctt taatgtcact 4141 acatgttatg tttcctgaat tgtagtttgt gactttcaaa atggtggttt tccacactct 4201 acctttagtg caagctattt gtttgttttc taatttatag ttttaaaaac ttcgcttatt 4261 gagtttttgt tatgtggttt atatttttct ttctctttca gctattttat ttaatattgt 4321 gtcagatatt ttacaaggta tgacctaatt aaaaactcag tagagaaaga tcagaatggc 4381 cttgagaata gagccacaaa aataactatg aaaatgccag taacgtttat ttaaaacaaa 4441 atattttaat ttttaaattt tcccttaaaa cacacttttg gaatatgcta caatattaca 4501 tgttttttgt ctttttattt ttctgagacg gagtcgtttt ctgccaccca ggctggagta 4561 cagtggcatg atcttggctc actgcagcgt ctgcctcctg ggttcgagca attctcctgc 4621 ctcagcctcc tgagtagctg ggattatagg cacatgccac cgcgcccagc taatttttgt 4681 atttttagta gagatggggt ttcatcatgt tggccaggtt ggtctcgaac tcctgacctt 4741 gtgatgctcc cacctcggcc tcccaaagtg ctgggattaa agctgtgagc cactgtgcca 4801 aggctttttt attttttttt ttttgtcatt ttctttcaaa acttgagtgg tctctgagct 4861 cctgtcatta aacctatcta tatctgtcta tcagcacaac tcaccttgaa tatagtctta 4921 tactttcaag tatctttgtc tttgcacgtt tttcaagttt catgtgccat ttaaacttgg 4981 acccaggtat ctgattattt gatgtgaata gagggatgct acagatgtca tttgtctccc 5041 gccctaagtc ctccagtctc cttagagcta gtacttacta agcatttact atgtcatcaa 5101 taatcataaa acgtattttt ttttttgagt cagagtctcg ctctgttgcc cgggctggag 5161 tgcagtggtg ccatcttggc tcactccagg ctccccctcc cgtgttcacg ccattctcct 5221 gcctcagcct cccgagtggc tgggactgca ggcgcctgcc accgtgcccg cctagttttt 5281 ttgtattttt ggtagagatg gggtttcacc gtgttagtca ggatggtctc gatctcctga 5341 cctcatgatc ctcccgcctc ggcctcccaa aatgctggga ttgcaggcgt gagccaccgc 5401 gcctggccta aaatgtgttc tttattattg acggctgtat tgatgggatt ggtaatttag 5461 tccttcatat taatctctat tctctctcag agtacaagct ctcatcatat gcaaattctc 5521 agaagggctg tgaacacctt agtaataaat ttatcttttg aggtcattag caaacatgaa 5581 ctcacaggga tccagagatg gtaaaattca aaacagcctg tcaagttcaa aacagagagg 5641 tgaaagcaga agagacactt tcctattttg cctaataggt ctccttatat gcatctgtag 5701 ttaacattcc tcaattcaag ttagaatcat gaaacaataa tgaagctcct cctatgtctc 5761 ttttcaagtt gtaattacta tataggaaaa actaagttgt cacccaatat cttagacact 5821 ttgagagcaa agggggtgct gtaaataagt atacaagatc acagacctaa attgagcctg 5881 ttccagacaa attggggcct atggtcaacc tatccttaga cctgctaacg cattagcatt 5941 agcagcacct aagtcctcat tgaatgttct ggttcaaggc tccacctcag aaattctgaa 6001 atgggtagta agagcaaatt ttcattttaa agcacacctg agatgattct catacaaccg 6061 aaattttaga tccatagccc tatttgatac ttgacagtgc aagtttctgt aatttaaaaa 6121 gatgtggtgg cctgacgcct gcggtccccg ttttgggagg ccgaggtggg agggtccctt 6181 ccttgagccc agcagtttga gaccaatgta gtgagactca tctctgccag aaaaaaaaga 6241 ttggccgggc gtggtggcac acatctctgg tcccaattac tcgggaggct gaggcgagag 6301 aatcgcttga gcctgggaca ttgaggctgc agtgagctgt gatggcacag ctgcatttca 6361 gcccgggtga cagcgagatt ctgtctc // LOCUS AB002346 6562 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0348 gene, complete cds. ACCESSION AB002346 NID g2224636 KEYWORDS KIAA0348. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1551. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6562) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6562 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1551" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1049..4390 /gene="KIAA0348" CDS 1049..4390 /gene="KIAA0348" /codon_start=1 /db_xref="PID:d1021644" /db_xref="PID:g2224637" /translation="MNCLDCLDRTNTVQSFIALEVLHLQLKTLGLSSKPIVDRFVESF KAMWSLNGHSLSKVFTGSRALEGKAKVGKLKDGARSMSRTIQSNFFDGVKQEAIKLLL VGDVYGEEVADKGGMLLDSTALLVTPRILKAMTERQSEFTNFKRIRIAMGTWNVNGGK QFRSNVLRTAELTDWLLDSPQLSGATDSQDDSSPADIFAVGFEEMVELSAGNIVNAST TNKKMWGEQLQKAISRSHRYILLTSAQLVGVCLYIFVRPYHVPFIRDVAIDTVKTGMG GKAGNKGAVGIRFQFHSTSFCFICSHLTAGQSQVKERNEDYKEITQKLCFPMGRNVFS HDYVFWCGDFNYRIDLTYEEVFYFVKRQDWKKLLEFDQLQLQKSSGKIFKDFHEGAIN FGPTYKYDVGSAAYDTSDKCRTPAWTDRVLWWRKKHPFDKTAGELNLLDSDLDVDTKV RHTWSPGALQYYGRAELQASDHRPVLAIVEVEVQEVDVGARERVFQEVSSFQGPLDAT VVVNLQSPTLEEKNEFPEDLRTELMQTLGSYGTIVLVRINQGQMLVTFADSHSALSVL DVDGMKVKGRAVKIRPKTKDWLKGLREEIIRKRDSMAPVSPTANSCLLEENFDFTSLD YESEGDILEDDEDYLVDEFNQPGVSDSELGGDDLSDVPGPTALAPPSKSPALTKKKQH PTYKDDADLVELKRELEAVGEFRHRSPSRSLSVPNRPRPPQPPQRPPPPTGLMVKKSA SDASISSGTHGQYSILQTARLLPGAPQQPPKARTGISKPYNVKQIKTTNAQEAEAAIR CLLEARGGASEEALSAVAPRDLEASSEPEPTPGAAKPETPQAPPLLPRRPPPRVPAIK KPTLRRTGKPLSPEEQFEQQTVHFTIGPPETSVEAPPVVTAPRVPPVPKPRTFQPGKA AERPSHRKPASDEAPPGAGASVPPPLEAPPLVPKVPPRRKKSAPAAFHLQVLQSNSQL LQGLTYNSSDSPSGHPPAAGTVFPQGDFLSTSSATSPDSDGTKAMKPEAAPLLGDYQD PFWNLLHHPKLLNNTWLSKSSDPLDSGTRSPKRDPIDPVSAGASAAKAELPPDHEHKT LGHWVTISDQEKRTALQVFDPLAKT" BASE COUNT 1727 a 1629 c 1659 g 1547 t ORIGIN 1 gtatggcaag ctcacggacg cgtacggctg cctgggggag ctgaggctga aatctggtgg 61 cacgtctctg agcttcctgg tgttggtgac aggctgcaca tctgtgggca gaattccaga 121 tgctgaaatc tacaaaatca ctgccactga cttttaccct cttcaggaag aggccaagga 181 ggaggaacgc ctcatagctt tgaagaaaat cctcagctcg ggggtgttct atttctcatg 241 gccaaacgat gggtctcgct ttgacctgac tgtccgcacg cagaagcagg gggatgacag 301 ctctgaatgg gggaactcct tcttctggaa ccagctgttg cacgtgccct tgaggcagca 361 ccaggtgagc tgctgtgact ggctgctgaa gatcatctgc ggggtggtca ccatccgcac 421 cgtgtatgcc tcccacaagc aggccaaggc ctgcctcgtc tctcgcgtta gctgtgagcg 481 cacaggcact cgcttccaca cccgtggcgt gaacgacgac ggccatgtgt ccaacttcgt 541 ggagacagag cagatgattt acatggacga tggagtgtca tcttttgtcc agatcagagg 601 ctccgttccg ctgttctggg aacagccagg gcttcaggtt ggctcccatc atctgagact 661 ccacagaggc ctggaagcca atgcccctgc tttcgacagg cacatggtgc ttctgaagga 721 gcagtacggg cagcaggtgg tcgtgaacct tctgggaagc agaggcggag aggaggtgct 781 caacagagcc ttcaagaagc tgctctgggc ttcttgccac gcgggcgaca cgcctatgat 841 caattttgac ttccatcagt ttgccaaagg tgggaagcta gagaaattgg agaccctctt 901 gaggccacag ttaaagctgc actgggaaga cttcgatgtg ttcacaaagg gggagaacgt 961 cagtccacgg tgaggctcgc tgcgcactgt gccgcgtctt ctgctggggg gaagcgtcag 1021 tccacgtttt cagaaaggca ctttgcggat gaactgtctt gactgcctgg accgaaccaa 1081 cactgtgcag agcttcatcg cgctcgaggt cctgcatctg cagctcaaga ccctggggct 1141 gagttcaaaa cccatcgttg accgctttgt ggagtccttc aaagccatgt ggtctctgaa 1201 tggccacagc ctgagcaagg tgttcacagg cagcagagcc ctggaaggga aggccaaggt 1261 ggggaagctg aaggatggag cccggtccat gtctcgaacc atccagtcca acttcttcga 1321 cggggtgaag caggaggcca tcaagctgct gctggttggg gacgtctacg gcgaggaggt 1381 ggcagacaaa gggggcatgc tgctggacag cacggcgctc ctggtgactc ccaggatcct 1441 gaaagctatg actgagcgtc agtccgaatt cacaaatttc aagcggatcc ggattgctat 1501 ggggacctgg aacgtgaacg gaggaaagca gttccggagc aacgtgctca ggacggcgga 1561 gctgacagac tggctgctcg actcgcccca gctctcggga gctaccgact cccaggatga 1621 cagcagccca gctgacatat ttgctgtggg gtttgaagag atggtggaat tgagcgcagg 1681 gaatattgtc aatgccagta ctaccaacaa gaagatgtgg ggtgaacagc ttcagaaagc 1741 catctcacgc tctcatagat acattctgtt gacttcggca cagctggtgg gcgtctgtct 1801 ttatatcttt gtacgtccat accatgtccc gttcatcagg gacgtagcca tcgacacagt 1861 gaagacgggc atggggggca aggcggggaa caagggcgcc gtcggcatcc gcttccagtt 1921 ccacagcacc agcttctgct tcatatgtag tcacctgacg gccgggcagt cccaggtgaa 1981 ggagcggaat gaagactaca aggagatcac ccagaaactc tgcttcccaa tggggagaaa 2041 tgttttttct catgattatg tattttggtg tggcgatttc aactaccgca ttgatcttac 2101 ttatgaagaa gtcttctatt ttgttaaacg ccaagactgg aagaaacttc tggaatttga 2161 tcaactacag ctacagaaat caagtggaaa aatttttaag gactttcacg aaggagccat 2221 taactttgga cccacctaca agtatgacgt tggctcagcc gcctacgata caagcgacaa 2281 atgccgcacc cccgcctgga cagacagggt gctgtggtgg aggaagaaac atccctttga 2341 taaaacagct ggagaactca accttctaga cagtgatcta gatgttgaca ccaaagtcag 2401 acacacctgg tctcctggtg ccctgcagta ttatggtcgt gcggagctac aagcgtctga 2461 tcacagacct gtgctggcga tcgtggaggt ggaagttcag gaagtcgatg tgggtgctcg 2521 ggagagggtt ttccaggaag tgtcctcctt ccagggcccc ctggatgcca ctgttgtagt 2581 aaaccttcaa tcaccgacct tagaagagaa aaacgagttt ccagaggacc tgcgtactga 2641 gctcatgcag accttgggga gttatgggac aattgttctt gtcaggatca accaagggca 2701 gatgctggta acttttgcag acagtcactc ggctctcagt gtcctggacg tggacggtat 2761 gaaggtgaaa ggcagagcag tgaagattag accgaagacc aaggactggc tgaaaggttt 2821 gcgagaggag atcattcgga aacgagacag catggccccc gtgtctccca ctgccaactc 2881 ctgtttgctg gaggaaaact ttgacttcac aagtttggac tatgagtcag aaggggatat 2941 tcttgaagac gatgaagact acttggtgga tgaattcaat cagcctggag tctcggacag 3001 tgaactcggg ggagacgacc tctctgatgt ccccggcccc acagcactgg ctcctcccag 3061 caagtcacct gctctcacca aaaagaagca gcatccaacg tacaaagatg acgcggacct 3121 ggtggagctc aagcgggagc tggaagccgt cggggagttc cgccaccgtt ctccgagcag 3181 gtctctgtcg gtccccaacc ggcctcggcc acctcaaccc ccgcagagac ccccccctcc 3241 aaccggttta atggtgaaaa agtcggcttc agatgcgtcc atctcctccg gcacccatgg 3301 acagtattca attttgcaga cggcaagact tctaccagga gcacctcagc aacctcccaa 3361 ggctcggact ggaataagta aaccttataa tgtcaagcag atcaaaacca ccaatgccca 3421 ggaggcagaa gcagcaatcc ggtgtctcct ggaagccaga ggaggtgcct ccgaagaagc 3481 cctaagtgcc gtggccccaa gggaccttga agcatcctct gaaccagagc ccacaccggg 3541 ggcagccaaa ccagagaccc cacaggcgcc cccactcctt ccccgtcggc ccccacccag 3601 agttcctgcc atcaagaagc caaccttgag aaggacagga aagcccctgt caccggaaga 3661 acagtttgag caacagactg tccattttac aatcgggccc ccggagacaa gcgttgaggc 3721 ccctcctgtc gtgacagccc ctcgagtccc tcctgttccc aaaccaagaa catttcagcc 3781 tgggaaagct gcagagaggc caagccacag gaagccagca tcagacgaag cccctcctgg 3841 ggcaggagcc tctgtgccac cacctctgga ggcgccgcct cttgtgccca aggtaccccc 3901 gaggaggaag aagtcagccc ccgcagcctt ccacctgcag gtcctgcaga gcaacagcca 3961 gcttctccag ggcctcactt acaatagcag tgacagcccc tctgggcacc cacctgccgc 4021 gggcaccgtc ttcccacaag gggactttct cagcacttca tctgctacaa gccccgacag 4081 cgatggcacc aaagcgatga agccagaggc agccccactt cttggtgatt atcaggaccc 4141 cttctggaac cttcttcacc accctaaact gttgaataac acttggcttt ctaagagctc 4201 agaccctttg gactcaggaa ccaggagccc caaaagagat cccatagacc cagtgtcagc 4261 tggcgcttca gctgccaagg cagagctgcc accagatcat gaacacaaaa ccttaggtca 4321 ctgggtgaca atcagtgacc aagaaaagag gacagcactg caggtgtttg acccactggc 4381 aaaaacatga ctgagcagct ttgaaggctg cagtcctata gaatgcatac cttcctccct 4441 ctagacatcc ctccaccaga agagacatct atttaaaggc acactggcca aaacgtttgt 4501 gcatctgtca ctctcgtgta gtttacaaaa atcgtgtctc ttattcagta agatggttac 4561 tcagccacca aaatatattt cactcaaggc ttgtacatct gaagtttgct cttcaaggaa 4621 tgggaacctt cctgttaaat tcggtgtatg gattttaaga aaggaatcta gccaatgagg 4681 tccaagaagt tctcacccat tgaattttta aatggctgtt cagttcatgt tgtacgtgat 4741 ggagatttgt cttttgtttt atttgcattt tacagatttg gtataacatt ttggggagcc 4801 acctgaaggt tgatgtataa agtaaggatt agagaaagag gtcgttgtga ccattagtag 4861 ctgtcctggc ccacttaaac aaggttacaa aaaatcagag tcggaagcag ccaaataggt 4921 caacctaatg actagactgt acattcccat gagccttcat gtttaagtgt gtacatgtgc 4981 gttaaccttg atgatgcgtg aatcccgagg gagccggtgg catacaccgt tagcttaacc 5041 ttagcttaaa ctagctgaag gctcctgtgc catgtcttag acattgcatg ccctatcaat 5101 tactataatc ctgagccatg gtgtgctact gaaaccaatt tttatccacc atctagtcct 5161 tattaaatga aacctcacgg atcctttgtt ccgcttatat tccatgcata ccacataaaa 5221 gcacacagtg cgaaaactct tgctgatacg cgatattgat tctcattgtt agaatatgga 5281 gagtgtttca gcctcgtctg tccggctgga gcttcgggat ggaaagtgct atgtgtccct 5341 gcatataaga atcaccaggc cagtgtttct gggtttgctt gtctatatgt ttgtctatat 5401 tttttgccta tacatttttc ccacgtttcc aacagcactt ctcacctatt caataactga 5461 aaaagacatt accatagtgc tttacatttt taaagtaatg ttacaaggtc tggaatccat 5521 ttggagcaga taccgtgttt tcgctattta ataagaagtt cagtagtgaa atcttactgt 5581 accgcctgtt gtatctggga gcctcgtaca gaggctcgca cagcagtgat caagtgtcat 5641 cccttacgtg actgggggat gtctgtccta aaagctgact gctaggatag taaggatcat 5701 cttgcctggg ctatgccact gtcttgttac caattagaca tctggaattt cataattagt 5761 tttcattgtc actgtcaaga tatattgcag attacttaaa tatggccatc aaaacaaaag 5821 ttacaacacg tatctctttt catctgaaaa ctaatacctg gaaaaggata aaaaaaaaaa 5881 ggaatccgtg acccacagag ctagacagat aagatgcata gttgaccagt cataaaaggc 5941 ggtgtttagg tgatcaggat gccgttggtg gcatttacgt gctttatatg atttttacct 6001 ctgtaacaaa cacaagaaat aaacagaatg gtccttaaca gagtttgggg gagagagcaa 6061 gatgggttcc ttggagaagc tgatttgcca agatgcacat cgctattaac agccagagtc 6121 ataaatgaaa tgaaattgaa gaattcattc aaatgctctt ttccctataa cctcttttct 6181 caccaaaaag gagataaatt tgaaaacaga taaatgtaac aaccagtcaa agaagcaggg 6241 gaaaagtaag ctcctccaaa gttgcttgca gtgctggaaa tagatctcat ttttaggttt 6301 tctcttcgtt ccagatacca aataaatggg acagagaata aaatttttgt taaaatatgt 6361 gctcatctcc taagtagctc ttcagagtct gaccgtaagt aaaaacacac agaattgtgt 6421 tgactggggg aggtgaatca caaaaaagtt acgaggagtt taagagttaa atattatttg 6481 atcgtggctg tcaaatttag tgaacaacat agattggatt tggagttggt agtaggtatg 6541 gttctcatac cagaattctc tt // LOCUS AB002349 6336 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0351 gene, complete cds. ACCESSION AB002349 NID g2224642 KEYWORDS KIAA0351. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1609. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6336) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6336 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1609" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 268..1941 /gene="KIAA0351" CDS 268..1941 /gene="KIAA0351" /codon_start=1 /db_xref="PID:d1021647" /db_xref="PID:g2224643" /translation="MYKRNGLMASVLVTSATPQGSSSSDSLEGQSCDYASKSYDAVVF DVLKVTPEEFASQITLMDIPVFKAIQPEELASCGWSKKEKHSLAPNVVAFTRRFNQVS FWVVREILTAQTLKIRAEILSHFVKIAKKLLELNNLHSLMSVVSALQSAPIFRLTKTW ALLNRKDKTTFEKLDYLMSKEDNYKRTREYIRSLKMVPSIPYLGIYLLDLIYIDSAYP ASGSIMENEQRSNQMNNILRIIADLQVSCSYDHLTTLPHVQKYLKSVRYIEELQKFVE DDNYKLSLRIEPGSSSPRLVSSKEDLAGPSAGSGSARFSRRPTCPDTSVAGSLPTPPV PRHRKSHSLGNNMMCQLSVVESKSATFPSEKARHLLDDSVLESRSPRRGLALTSSSAV TNGLSLGSSESSEFSEEMSSGLESPTGPCICSLGNSAAVPTMEGPLRRKTLLKEGRKP ALSSWTRYWVILSGSTLLYYGAKSLRGTDRKHYKSTPGKKVSIVGWMVQLPDDPEHPD IFQLNNPDKGNVYKFQTGSRFHAILWHKHLDDACKSNRPQVPANLMSFE" BASE COUNT 1616 a 1626 c 1417 g 1677 t ORIGIN 1 cgccgctcag gccctggagc ggacggttcc tactgcggct gggcaccggc tccgctcccg 61 cgtctgcccg cgctccagct gcgcctggcc cggccccggc ccggctcggc gtggccccgg 121 cctccaagcg aaggcgccgc tgccgctggg ccgctcccag ggccatgagg aagcggcggc 181 agccactgcg gcccgcgtca aggacttctc cagacaggtt atgttacctg cagaggctgc 241 cctgaagctc cctgtggcct ggagactatg tacaagagga atggtctgat ggctagcgtg 301 ttggtcacct ctgccactcc acagggcagc agcagctcgg actctctgga gggccagagc 361 tgcgactatg ccagcaagag ctatgatgcc gttgtcttcg atgtcttgaa agtgacccca 421 gaggagtttg ctagccagat tacattaatg gatatacctg tgtttaaagc tatccagccg 481 gaggaactag ccagctgtgg atggagtaag aaggagaaac acagtcttgc ccctaacgtt 541 gtggccttta cccggaggtt taaccaggtc agtttttggg ttgtacgaga aattctaaca 601 gcacagactt taaaaataag ggcagaaatc ctcagccatt ttgtgaaaat agccaagaaa 661 cttctagaac tcaacaacct tcattctctc atgtctgtgg tatcagcatt acaaagtgct 721 cccatcttca ggctgacaaa aacctgggct cttttaaatc gaaaagacaa gactaccttt 781 gagaaattgg actacctgat gtcgaaagaa gataattaca agcggacacg ggaatatatc 841 cgaagcctga agatggttcc aagtattccc tatctaggaa tctatcttct ggatttaatc 901 tacattgatt ctgcatatcc tgcctcaggc agtatcatgg aaaatgaaca aagatccaat 961 cagatgaaca atattcttcg aataattgct gatttacaag tttcctgcag ctatgatcac 1021 ctcaccaccc tgccccatgt gcagaagtac ctgaagtccg tacgctacat tgaagagctc 1081 cagaagtttg tggaagacga caactacaaa ctgtcgctca gaatcgaacc aggaagcagc 1141 tctccaagac tagtctcttc caaggaagat cttgcaggtc cctctgctgg ctccggttct 1201 gcgaggttca gccggaggcc cacctgtcct gacacatctg ttgctggcag cctccccaca 1261 cctccagtcc ccagacacag gaagagccac agcctaggca acaatatgat gtgtcagttg 1321 agtgtagttg agagtaaaag tgcgacattc ccatcggaga aagcaaggca cctactggac 1381 gacagtgtcc tagagtcccg cagcccccga aggggcctgg ctctgacctc ctcctctgct 1441 gtcaccaatg gactctccct aggcagtagt gagagctcag agtttagtga agagatgtct 1501 tcagggctgg aaagccccac cggcccgtgc atctgttctc tggggaactc cgcagctgtg 1561 cccaccatgg aggggcctct gagaagaaaa accctgctca aggaagggcg gaagcctgcg 1621 ctgtcctcgt ggaccaggta ctgggtcata ctctcaggat ccaccctcct gtactacgga 1681 gccaagtcct tgcggggcac agacagaaaa cactataaat ccacacctgg caaaaaggtt 1741 tccatcgtgg gctggatggt gcagctgccc gatgaccccg agcacccaga tatcttccag 1801 ctgaacaacc ctgacaaagg caatgtttac aagtttcaga ctggttcccg atttcatgca 1861 atactgtggc acaagcattt ggatgatgca tgtaaaagca acaggcctca ggtacctgca 1921 aaccttatgt catttgagta agtctctgca ggacgtggca tgacttcaga ggcttctggg 1981 aacccaggct gggcctggtg gtgaagagca gtcctgggca caggctgtga gccagggtgc 2041 tgggaaactc acagctggac tcaggggaca cggcctgtgg cctcaccatc ccagagggct 2101 tcaccagtgt gggatccacc tgtcagtccc cagcgactct catgacactc attctgcagc 2161 accgcctctt ggggcagtgg tcagacccca cacgccctct ctgggcccac cacctgcatc 2221 tgcgactaga gagcacccgg cccacgttgg gttctcagtg ctttctactg cacagagtgg 2281 acagcgctaa ctaacctgtg agaggggccc gagagaagga acagctgtgg aacaggcttt 2341 ttacacccca agtgcatggg gttgctcgcc cacagggctg cctcagattt tgtacaaccc 2401 cgaagcgtcc tctgcgtgtg cgtgctgtac gtgtgtgtgt gtgtgtgagc gagtgtgaac 2461 tcttcaagaa acatgcattt tggcacaaga ctcgtgacat cacacacttc attcgctttg 2521 aggccctgct ttaaccttaa gttatagccc tgtccaccga ggaaggtcag ggtgagagcc 2581 tagattcctc ctgtgtcaag ggtccctcgc attcttttac tgtaaacaaa caatgcctta 2641 aattgtgtct tgttttctgt tcctatgggt gctattcatc tggaaggcct gcttccaggc 2701 ctctttgctg tcagcccttc tgagacagga cctggcttca ggactgtgga ctgggctgct 2761 ggcctgcttg cttcctccct tccccattcc tagcagggcc tgaggccctc ctcttctcgc 2821 ccttcccacc atgccagaat gggaagttgt gacgttgcag ctccaaccga cgtgctcata 2881 gtgatcagct gtgcaggagc catgaggcac caacctctcc ccgcagggca aagcctgtgc 2941 ccccatcatc tcactccttt gcctgcactg ccagggtggg gcccaccaag attcctgatc 3001 atgacgggaa gctgagtgac cctgaggcct taagcttccc cagtcttggc cccaaatgca 3061 gtcaccagca agttttccat tttccaagtc caagggcaca attgttgatg accgtgtgac 3121 aatagagcga agccccgggg agtgaacggt ccaacctctg cattcagtta ggagctcttc 3181 acatgaatca catccttatc tgtcaccttg tgtcacattt taaagtgact tttattttgc 3241 acaaataatt tttattcaga ataataaatc actctttatc atagtatctt ctcttccctc 3301 ttccccttta gtttggatag cctaactctg agaagttaac ccttaaacag ttttctggaa 3361 gagactgaat ttctgggtcc ttgcagctgt gatggtttca gagctcagac tgatcaggca 3421 tcaagctacc ctcaagagtt tctgggctgg atgtttcaga acaacatcta caccagtaaa 3481 gtgtaatagg tcagtttcaa aacgaccaaa agacccacca ctgtattttg accaaataat 3541 gacaacttct ttagaaattt gaatggcttg gtgaggaaag tagttgtcac cagggcctca 3601 ttttgtagtt gagccttaca atgcttagta gttcatcttc tttttgagca aagactagaa 3661 tactttcctc ctaagagaaa ctcccaggtg ataaaagttg atgccatcaa accttgacac 3721 cgggtgctct gcacacccac gcggatgttg cacctcattc tcccgatgac tattcaaatc 3781 agcatctaga ggctgaatga caatgccaaa cactccacct ctgatcagaa ccatgcagtg 3841 ttaacacttt aacctacatt gaatctgatt ctacctgtta acttttaaaa agtcgtaagt 3901 ttggatgaaa gtgcaagatg tggaacatca actacctatt ttccttgggt ttttccactc 3961 tgcaaactgt cctggttttt cacaccaatg aagtattata gatgccaatc caaaacctca 4021 gaatttcagg caccacaaaa acaggtaatt ttctatccct tataagtttg tcttttcttt 4081 cagaaacatc tcttagccta atttgaaata gcacaatcac aattcaaaat gtttagtctt 4141 ctcactaatt gagtctgctt ccacgtcctc tcccaggaac attcttagct cggactcttg 4201 aagaatctct ttagattttg ttggcaaaag ccttatagaa gcagtaagag gcttgaccac 4261 gccggaagag tcctggagct aaagctggaa gacactcagc tctctaagca ggggctcggc 4321 caaacatggg agttaagtgc tgcttgtctt cccagtgttg gtttgaaccc tgtgagcctg 4381 agacagagag ggccaggcac caaccacaag gcgggaaagt ccatgggtag accctccccc 4441 tggagggaag catttctagt ttttgctcct tgactgtcca gagtgtacaa atgttcataa 4501 cgccattgaa gggattattt cttgcatgca tatgctgaat ttttttaagc aaatggatca 4561 tggcacccca aaatgaaagt tatagaaagc tgtctacaac tgtggagttg gtagctggta 4621 acattgttgt ctcaagaaca actcacctct ctccctagga ctaatttttg tctctctcag 4681 ttgaacatgt tttgtcattc aagatcagtc aggtgcattc tggcaactga catacttgat 4741 ggaggattga ttcggtagag agcagtagaa atcttgttct aactgtgcct ggtgagagac 4801 tttggccccc tccctcccta taaggctgtg gaacctgagg aagtagatac ttgaagagat 4861 tctgtttagg aagaaactca ctctcttttg ccagttgaat ttatagagca ttttttttct 4921 taccaagatg gccagtatca ttttaccccc acctcccaag ccccaagagg tgtacctttt 4981 cagatgccat tttacaggcg gaaatgctcc atgaaacagg aagccacttg caagcaacat 5041 ctgctctgtt cctcaggtgg ggcccagagc ccttccccga gactgctgat gtctgtaacc 5101 actggggagc actgccaaaa atacagcttt ctggtttgtg agcccataaa tgacttaaat 5161 cagctttaca tcatttttac atatcaagtg gtttcatgtt aaaaaacaaa ctcctagtcc 5221 tttagaaata acagattctc tgcacaaaac cacccattca ttcatttatt cattcacagc 5281 actagcaagt gctgcctatg ctgagaacaa gtcagatctg atccctgccc tcatggacct 5341 gaccactcaa caaacagtcc ccaccacacc tatctcctta ggcaagactt tgcctctctc 5401 ctagtcctga gtataaatcc tgtgcataga ttcctctaga aaggcatcaa aaggctcaac 5461 agactgaatg gcctcttggt ctgcgaaaat tcagttgcaa tgaggatgaa gtcactatcc 5521 tagaggctgc ttggcccaga agagccaggc acagagctgc agttgggcac gccaaggatt 5581 ccaaaggtgg aatgagagag tagggtcaaa ctgtcacagt atctgctcca taggtttctg 5641 tttttaattt caatgttaaa tacaactaca atatgagcga gaactgcatt ttcttgggtg 5701 ttgagaactt gtaccatgga cttcagaccg ccttgcagcc gtatgctgca caagcgtgta 5761 caccccctgg gcagcctcaa aaccccgctt acagcagcaa cacaggagat catctgtcca 5821 ttttagaacc attaatctct ttatccattg ctgaacgact gtgactattc agtaacgaag 5881 taatagtaat taattagtat ggtataatct ttaataaatt tcgtgccaaa atgcatggtt 5941 ttccacttag cattcaaaat gttgcataga gagtagtttt caatttctta tgtactcttc 6001 aaagtaagtt gaaaatcagt ttctacattt taattcgttt cctgttaaat ctgttgcact 6061 ctcctgggct gtctttttct ccagcagacc cctgcatgca gttgtgtaag gactttctct 6121 aattcttgtg aatcgtctca cccgcagtaa ccactgaacg tcaatcagcc ctccatgggg 6181 ttctttcgat ttttggtgaa gtattttgtt acctcagtct tgtatcaagt tgctgtattt 6241 ttcagcttgt tacattgata ataattattt cactaattaa atactttaat gtacaaacat 6301 ctttgtttac tttgaaatta aatgtgtttt ccaatg // LOCUS AB002350 6170 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0352 gene, complete cds. ACCESSION AB002350 NID g2224644 KEYWORDS KIAA0352. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1642. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6170) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6170 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1642" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 87..2225 /gene="KIAA0352" CDS 87..2225 /gene="KIAA0352" /codon_start=1 /db_xref="PID:d1021648" /db_xref="PID:g2224645" /translation="MGMRIKLQSTNHPNNLLKELNKCRLSETMCDVTIVVGSRSFPAH KAVLACAAGYFQNLFLNTGLDAARTYVVDFITPANFEKVLSFVYTSELFTDLINVGVI YEVAERLGMEDLLQACHSTFPDLESTARAKPLTSTSESHSGTLSCPSAEPAHPLGELR GGGDYLGADRNYVLPSDAGGSYKEEEKNVASDANHSLHLPQPPPPPPKTEDHDTPAPF TSIPSMMTQPLLGTVSTGIQTSTSSCQPYKVQSNGDFSKNSFLTPDNAVDITTGTNSC LSNSEHSKDPGFGQMDELQLEDLGDDDLQFEDPAEDIGTTEEVIELSDDSEDELAFGE NDNRENKAMPCQVCKKVLEPNIQLIRQHARDHVDLLTGNCKVCETHFQDRNSRVTHVL SHIGIFLFSCDMCETKFFTQWQLTLHRRDGIFENNIIVHPNDPLPGKLGLFSGAASPE LKCAACGKVLAKDFHVVRGHILDHLNLKGQACSVCDQRHLNLCSLMWHTLSHLGISVF SCSVCANSFVDWHLLEKHMAVHQSLEDALFHCRLCSQSFKSEAAYRYHVSQHKCNSGL DARPGFGLQHPALQKRKLPAEEFLGEELALQGQPGNSKYSCKVCGKRFAHTSEFNYHR RIHTGEKPYQCKVCHKFFRGRSTIKCHLKTHSGALMYRCTVCGHYSSTLNLMSKHVGV HKGSLPPDFTIEQTFMYIIHSKEADKNPDS" BASE COUNT 1489 a 1540 c 1505 g 1636 t ORIGIN 1 gaagcggcgg cggcgatggt gctcggggcg ccgcagagcc ggattaactg tgctgataag 61 gaggtaattt cataggagct gctaagatgg gcatgaggat caaactgcaa agcaccaacc 121 accccaacaa cctgctgaag gaactcaaca agtgccggct ctcagagacc atgtgcgacg 181 tcaccattgt ggtggggagc cgctccttcc cggcccacaa ggctgtgctg gcctgtgcag 241 ctggctactt ccagaacctc ttcctgaata ctgggcttga tgctgccagg acctatgtgg 301 tggacttcat cacccctgcc aactttgaga aggttctgag ctttgtctac acttcagaac 361 tcttcacaga cctgatcaat gttggggtca tctacgaggt agctgagcgt ttgggtatgg 421 aggacctcct ccaggcctgt cactctacct ttcctgatct ggagagcact gccagggcca 481 agcccctgac cagcaccagt gagagccact ctggtaccct gagttgtcct tcggcagaac 541 ctgcccatcc ccttggagaa ctccgaggtg gtggggacta ccttggtgct gatagaaact 601 atgtgttgcc cagtgatgct ggagggagct ataaagagga agagaagaat gttgccagtg 661 acgctaacca tagcctgcat ctgccgcaac cgcccccacc accgccaaag acagaagacc 721 atgacacccc tgctcccttc acgtccattc ctagcatgat gacccagcca ctcctaggca 781 ctgtcagcac gggcatccag accagcacga gctcctgcca gccatacaaa gttcaaagca 841 atggagactt cagtaaaaac agcttcctca cccctgacaa tgcagtagac attaccactg 901 ggaccaactc ctgtctgagc aatagtgagc actccaaaga tcctggcttt gggcagatgg 961 atgagctcca gctcgaggac ctgggggatg atgacttgca gtttgaagac cctgctgagg 1021 atataggcac aactgaggag gtgattgagc tgagtgatga cagtgaggat gagttggctt 1081 ttggagagaa tgacaatcgg gagaataagg ccatgccctg ccaggtgtgc aagaaagttc 1141 tagagcccaa cattcaactg atccggcagc atgctcggga ccatgtggac ctgctgacgg 1201 gcaactgcaa ggtctgcgag acccacttcc aggaccgaaa ctcccgggta actcatgtcc 1261 tgtcccacat tggtattttc cttttctcct gcgacatgtg tgaaactaag ttctttaccc 1321 agtggcagct gacccttcac cgacgggatg gaatatttga gaacaacatc attgtccacc 1381 ccaacgatcc cctgccaggg aagctgggtc tcttttcagg ggcagcctcc ccagagctga 1441 aatgcgctgc ctgtgggaaa gtattggcca aagatttcca tgtggtccgg ggccacatcc 1501 ttgaccatct aaacttgaag ggccaggcct gcagtgtctg cgaccagcgt caccttaacc 1561 tctgcagcct catgtggcac acgctgtccc atctcggcat ctcagtcttc tcctgttctg 1621 tctgtgcgaa cagctttgtg gactggcatc ttctagagaa gcacatggct gtgcaccaaa 1681 gtctggaaga cgccctcttc cactgccgct tgtgcagcca gagcttcaag tcagaggctg 1741 cctatcgcta ccacgtcagc cagcacaaat gcaacagtgg ccttgatgca cggcctggtt 1801 ttgggctgca gcacccagct ctccagaagc ggaagctgcc agcagaggag tttctgggtg 1861 aagagctggc gctgcagggc caacctggga acagcaagta tagctgcaag gtctgtggca 1921 aaagatttgc ccacacaagc gaattcaact accaccggcg gatccacacg ggggagaagc 1981 cataccaatg taaggtgtgc cacaagttct ttcgaggccg ctcgaccatc aagtgccacc 2041 taaagacaca ctcgggggcc ctcatgtacc gctgcacagt ctgtgggcac tacagttcca 2101 cccttaacct catgagcaaa catgttggtg tgcacaaagg cagcctcccc cctgacttca 2161 ccatcgagca gaccttcatg tacatcatcc attccaaaga ggcggataag aacccggaca 2221 gttgactggg tcccggcaga gccacgggga gctcccaagc agcagccagg atgctgatat 2281 ctaagaggtg ttggtccctc cccagctgaa gttataattt tgccttggta ggaattctgt 2341 tctgtgttgt gtttaaagaa gaaaagaaga agaaatagca cataagctgt tactgttgtt 2401 gagaagcaac agccctatca catttacctc catacctgtt cttgcccatg cagggctatg 2461 tttttcattc ttttgaggct ggttttggga tctagtcaag cagttggtgt ccactagacc 2521 cccttcccca gcctctctag ttttagttta ctgataggtt ttatgctgct aagaatccaa 2581 ccaacagcct cacttaacag aggaggtaaa gggaggtttt cactgtgggt gttactgcag 2641 gcctccaact gggatgacca gcaatgagaa agattttggg aatgtgatca ttcagaaaag 2701 acaggtcagc agggcagtcc cctcaggttc cagccctcag cagggacaag acatcaggga 2761 tgttggtgtc tgcttatcta cagccctaat ctgctgattg aacagtgaaa atcttttggc 2821 agctagatcc atactaggca cagagctttc tatttaggtc agaaagcttt gggatgaacc 2881 cctgccagcc agaaggggtg tcctgcagtg ccaccagaag tggcagcctg gatggacagg 2941 agaggtttct cttttctcct catttccaaa gaaacaggat tttatggagt gatggcccgg 3001 gacgtccggc cttctgtggg gcagagagga tagggaacca ctttgatata gtcatctgtt 3061 ttggccactt ctgttggcca tgagtgtctt ggtggagagg tggggatgta tctgacagca 3121 gcagccttgc cttaatttat atctggtctc ccgtccagaa gtgtttggcc cgtggtgtag 3181 atgagctgac tccatgaagt ggtggagtgg cagaccgcga gcccttcagg attaaaggga 3241 cctgagtaac tggtggtgtg tagcagggtg tgcgttctga ctgtctgtcc tagtcgggta 3301 acctgtttac tttgtgctaa cagtgccgga gctttgtcag ctcacctttg acctgctgga 3361 atttatcctg atctgtcgtt gccatcacct ccagggggcg ctattggatg gcagctggtt 3421 caggccctcc gtgggtggct gcagagtgtg ccggcacagc ccacgcagct ggttagctcc 3481 actcttactt ggttttctaa ggggctttgc ccaaagaagt cttgagggat agggccctcg 3541 atcttgcata cttgtgaggt gccacctcag tagctcatac tacctcaccc tgctcaggtg 3601 agctctggga gtccctggcc tcagccctgg cactgcccct ggtgggattc agcaacaccc 3661 cgggctgttt cacaagcaag tggttttcat taactcacaa agcctttttg gacattaata 3721 tttatttatt tttgtttttg tatacacatt acccagcatc tcttttgtat aagagacttt 3781 aggaaaatga gtttctcccc agcaagtagc catattccag agaacagcct tgaccagaga 3841 tgtggagaac caggggtata actaagggaa gacatgtcaa gcccttaagc aaattcttct 3901 tctccagatt gtctctagca taataaaccc agggaacact ttaggccatg ggtgtatgtt 3961 ctataaagtt cgggacagtt aaattccagg cctttcatcc cccttccttc ctgtgatgga 4021 gtagattggg gacagggttg aggggaacaa gtgacatgaa tgacctattt gcacagtttg 4081 gaagcctcct gtctttattt atattgagat gtcagacaaa ccaaagctcc atccttgttg 4141 gacctgctgc ttgtccccag ccctgaactg ataaagcctc agagttggag tgcctggctc 4201 tctggtgggg tgaccattag atgaagggac ttgtacagtg gccagtttaa aggtccacct 4261 ttgaccatct aaacccacct tgttcagtgt cctctgagga catcctcatc aggaaagcag 4321 tgttgagact cttcattgct gactggcttc tccctttctt actcacactg accattagaa 4381 tttaagaagg aaatgtgtaa cagactacag tcaagtgtct gctacatttt caagcatgag 4441 caatccctcc cagactgttg gtgaggactg atttttgaaa tgctggtgtg agagaggtgg 4501 taattacagg aaccagccga gtggctgaga gcagataaat gtgctggaga aacctttttc 4561 cttaacagag ggcatcatgg atgctgggtg gtctgtgtac ttaacctgaa actgtgaagt 4621 tttccctttt tgccagtaaa ccaaaaagca gatccttgaa acttggccct tgaaacacga 4681 aacagaaaac tgcatcccct gatcccccgg gggctgatca gattgatcag ggtggctcag 4741 ttggtcccag tcagatacgt cataggatca gtagctcatg agatttgttg accaaaccct 4801 ctccctgttg gtgacccctg tattcacacc tgaacttccc gtttcccccc accccaccaa 4861 agtcatgtct gcttcctctg cctccagctc accctcttct gaatggtttt gcttgaaaca 4921 ctatattgtg gcagagggca ccctgggata cctggtagaa ggtattcatt ttattttgca 4981 tttttaattg ttttgacttt ctgttcattt gattttgttg ctccctctct cctggaacct 5041 agtttactac tctcttctgt gttactctga agttctgttt aagccttgaa catcctcttc 5101 tcccattttc ttggtatgta ctcaggacca gctacatcat ttgtggagcc ctcttattca 5161 taaattatta aaaatttcaa ggtggtggtc atagagcatt aaaccaaata tgaggccatt 5221 cccaacttgt tttccgaggg gaaaatggta atacttgtgt ggcacccggg gttaaacagc 5281 agaggctcca tgtggccaga ggcagagatt agtatcctgg cactccagtg acccactggg 5341 tgactcactg atgccacagc acccgctagg aagctctgct gaaccttagt atttggtcct 5401 aaattttatg actccatgga gttcccgtag tccatggcta gttaggaaga aaggaggtgg 5461 gataagggtc aggcccaggt gacccctaag aaccaggaga tgggtaaaag ttttttttta 5521 tattctgctt ttctgatctg tgagtacctg tttgtctcca ggccaaacct ttgggcttaa 5581 atatcttttt cctagacagg tttttgctag tgttgaattt tcttcttcct ctggcctcct 5641 tctgtgcccc tttccccaag cccaagactg cttaacttcc aaagcaaatt ctagatagac 5701 actgtattta ttggtatggg agtgggctct atggggtggt ctgcacccat ctgggactct 5761 tttccctaaa tcctgcacca aatgagtcag gaggcagggt gcacagcatt agtttcaatg 5821 tggttatgca tcataagctt aacatcagaa tgaaaatgaa actcgatttt gatgtttctt 5881 taaaaccctt cccctgtcca atccactcgc cgcccccacc ttgaatagct aaagtctctt 5941 atgaaacaga gaagagttgt tgacgtctaa ctccttccat taaattaata agtactgacc 6001 tcctaatatt taagtgttta ctatctattg ctgtaaagtt ttgtatattt tgtaaacttt 6061 tttccccaaa tagtagatgt ctaaaatcat tgtacatctg attcttttat attccattgt 6121 tcagcacaaa gtgtggtttt tatttagaat aaaaaaagaa atttgaaatg // LOCUS AB002352 4631 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0354 gene, complete cds. ACCESSION AB002352 NID g2224648 KEYWORDS KIAA0354. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1842. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4631) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..4631 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1842" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 190..2223 /gene="KIAA0354" CDS 190..2223 /gene="KIAA0354" /codon_start=1 /db_xref="PID:d1021650" /db_xref="PID:g2224649" /translation="MDFPGHFEQIFQQLNYQRLHGQLCDCVIVVGNRHFKAHRSVLAA CSTHFRALFSVAEGDQTMNMIQLDSEVVTAEAFAALIDMMYTSTLMLGESNVMDVLLA ASHLHLNSVVKACKHYLTTRTLPMSPPSERVQEQSARMQRSFMLQQLGLSIVSSALNS SQNGEEQPAPMSSSMRSNLDQRTPFPMRRLHKRKQSAEERARQRLRPSIDESAISDVT PENGPSGVHSREEFFSPDSLKIVDNPKADGMTDNQEDSAIMFDQSFGTQEDAQVPSQS DNSAGNMAQLSMASRATQVETSFDQEAAPEKSSFQCENPEVGLGEKEHMRVVVKSEPL SSPEPQDEVSDVTSQAEGSESVEVEGVVVSAEKIDLSPESSDRSFSDPQSSTDRVGDI HILEVTNNLEHKSTFSISNFLNKSRGNNFTANQNNDDNIPNTTSDCRLESEAPYLLSP EAGPAGGPSSAPGSHVENPFSEPADSHFVRPMQEVMGLPCVQTSGYQGGEQFGMDFSR SGLGLHSSFSRVMIGSPRGGASNFPYYRRIAPKMPVVTSVRSSQIPENSTSSQLMMNG ATSSFENGHPSQPGPPQLTRASADVLSKCKKALSEHNVLVVEGARKYACKICCKTFLT LTDCKKHIRVHTGEKPYACLKCGKRFSQSSHLYKHSKTTCLRWQSSNLPSTLL" BASE COUNT 1171 a 1044 c 1148 g 1268 t ORIGIN 1 gcgaactggc tggaggagct aggggactag aggcggggtg ggaggggggc gggtggaagg 61 gggaggaagt cccgtaacgg agacgctggt caggacgttc ccacctcctc tgacactgcc 121 gagtccgatc ggagaggggt caccgcctcc ttcagcgagg aggagggggg cggagcccga 181 ctcaggatca tggattttcc tggtcacttt gaacaaatct tccagcagct gaactaccag 241 agacttcatg gccagctctg tgattgtgtc attgtagtgg ggaatagaca ctttaaagcc 301 caccgctccg tgctggcagc atgcagcacg catttccgag ccctgttctc agtggcagaa 361 ggagatcaga ccatgaacat gatccagctg gatagcgagg tggtgacagc agaggccttt 421 gctgcactga ttgacatgat gtatacctcc accctcatgc tgggggagag caatgtaatg 481 gatgtcttat tggcagcctc tcacctgcat ttgaactctg ttgttaaggc atgtaaacat 541 tacttaacga caaggacgct gcccatgtct ccccccagtg agcgcgttca ggagcagagc 601 gcccgcatgc agcgctcctt tatgctacag cagctgggac taagcatcgt gagctcagcc 661 ctcaattcca gccagaatgg cgaggagcag ccagccccca tgagctcttc catgcgcagt 721 aacctggatc agcgcacgcc cttccccatg agacgccttc ataagcgcaa gcagtctgca 781 gaggagcggg ccaggcagcg cctccgaccc tccatagatg agtctgccat ttcagatgtt 841 acaccggaga atgggccttc aggggttcat tctcgggagg agttcttttc accagattct 901 ctgaaaattg tggataatcc taaagctgac ggaatgactg ataaccagga agatagtgcg 961 atcatgtttg atcagtcttt tggcactcaa gaagatgccc aggtgcccag ccagtctgat 1021 aacagtgctg gcaacatggc acagttgtcc atggcctctc gtgcaactca ggttgagact 1081 agttttgatc aggaagctgc acctgagaaa agtagttttc agtgtgaaaa ccctgaggtt 1141 ggccttggtg agaaggagca catgagagtg gtggttaaat ctgagcctct gagctcacct 1201 gagcctcagg atgaagtgag cgatgtgacc tcacaagcag aaggcagcga gtctgtggaa 1261 gtggaaggag ttgtggtcag tgccgagaag atagacctca gccctgaaag cagtgatcgg 1321 agtttttcag atccccagtc tagcacagac agggtaggtg atatccatat tttggaagtc 1381 acaaataacc tagagcataa gtccactttt agtatttcga attttcttaa caagagcaga 1441 ggaaataact ttactgcaaa tcagaacaat gatgataata ttccaaacac cactagtgac 1501 tgcaggctgg agagtgaggc cccctatttg ttgagtccag aggctgggcc tgcaggtggg 1561 ccctcctctg cccctggctc ccatgtagag aacccattta gtgaacctgc agactcccac 1621 ttcgtcaggc ctatgcagga ggtgatgggc ctgccgtgtg tgcagacttc aggctaccaa 1681 ggaggagaac agtttgggat ggacttttcc aggtctggtt tgggcctcca ctcctccttc 1741 tccagggtaa tgataggttc cccaagggga ggagccagta actttcctta ctaccgccgc 1801 atagctccca aaatgccagt tgtaacttcc gtcaggagct cacagatccc agaaaactct 1861 accagttctc agctaatgat gaatggagct acgtcctcat ttgaaaatgg ccatccttcc 1921 cagcctggcc ctccacagtt gaccagggca tctgcagatg ttctgtcaaa gtgcaagaag 1981 gccttatcag agcacaatgt tttggttgta gagggagctc gcaagtatgc ctgcaaaatc 2041 tgctgcaaaa cttttctgac tttgacagat tgcaagaagc acatccgtgt tcacacaggt 2101 gaaaagcctt acgcctgcct gaagtgtggc aagaggttta gtcagtccag ccacctgtat 2161 aagcactcaa agactacctg cctgcgctgg cagagcagca atcttcccag cactttgctc 2221 tagctgtttg tccttacaag acaacgctga ggccagttgt cagactgaat ttcttttggt 2281 aagcagttaa tgcctttggg ttcgaggctt ccagctgccc agtggctctt aaacagttta 2341 gcaactaata accggagaac taacatgtag tatttgtgct gctgcatttc tgagtgaagt 2401 gcacgtcttg ggaaagggat gcaatccctg aaaccaggtg cttccttggg gttgagtaat 2461 gcagtcagaa agtagtttgt aattgatatt aaaagtggca catttaaaaa tttaaaaatt 2521 gaagtgcaaa aaaaattttt tagcaatttt tgtaaaactg tgtagcattt aaatttccta 2581 taccttctga tgggagtatt atatccctgt atagtgatgc aaaatgcact tatgtgtaac 2641 cagtggtgat ttggtgcctg tcttaaagga aggcctttga ggacacacct gtctgccaca 2701 aatgctttaa agtgtatcat gagctagtcc taggcctcaa agtactgtat tttttatttt 2761 tacctgattt gcagtcataa acactgcact ttggtgctga cactgggtcc agagtgagca 2821 ttctcttgga ctattagatg tatatacttt tgaatacatc actgttggat agatgtttta 2881 acagtttttt ctggtttaaa aaccaaattg taaatggagt gtgtacttgt agagagtgac 2941 aaggtattgt ttgtttccct atgtgctgtt tgagcagtat tttaaccaac ttgtattaca 3001 gatgttacag ttccatgtta ggaagtcaga aaagacttgt gtttgtcttt gttctgctga 3061 tgtggagtca tgttttgtgg ggtcttccat ggcacattta cctgttgctc cgtccagatg 3121 ttgagggcca gtctaggctg acacatccta cccgaggaca agcctgttct ccatttcttc 3181 actctcccct ccccatatag caactctccc aggtttagat taccgttttc gacgacagat 3241 taaccaaaaa tgccccacac aggttttatt actgttatat actatacttt taacagtaca 3301 gaccctaaat tttattattt gttgctcccc caatctgata ccaaatgttt aaagttgttt 3361 gaaatccaaa catggtagtg ttcatgggta aatattttct aggctatgta agagttagca 3421 gcccatagca tagaagtaat caagtagcat ctgagactgt tggaggcact agggcctctc 3481 tgggccctac agcctcactt ccccagcctc accttgctgt cctctgacac tgccatcagg 3541 gctgttagtg gcacctgtat gaggccaagt gtgcgtccag gggaacagca caggttaatg 3601 cgtctcccta gaactcatga agtcagttta attcatgcat gaacatgagt tcattttatg 3661 ttttatatag ctttcttaga cataccaaac catcattcat aaatcagata aattattcag 3721 tttttgtgtt tagaaagcta agtatgtgta gctggaaaca aaaatgagcg tgttttctct 3781 cctgttaatc tagagtgtgc agttacacat gtgtggataa tttcatgttc caggggcgct 3841 tggcatctcc catggactga ttcccaggaa gaaaagccca aagggaaacc cacgattcct 3901 ttcgagtaga tgtgggaaag agcccattgg aggatatgag gtcctgtgaa attcagttgt 3961 gtgtgtggct ccttgttagc agtcatgttg acatggtgtt aggaggctcc ccatccaccc 4021 tttacatgat gtagggacca gtgtcttgtg agattaacct tgggacacag tgggttagcc 4081 tggagaaaat gagaggccct gcctggaccc agggagagga gccagtgaca caggcagagc 4141 ggtgcagccc tccttccctt ccatttggag gaggtggtgc caggagcctg cccgcttacc 4201 tctgctgaag cataagtgga ctttgctttt ggggcttatc tctgatacat gctggagccc 4261 tgcctctcca ctgctagatg gaacctggaa tctctcatct acctcttagt ctgtcagttt 4321 ctacgtgtga gaagcaagct tgtgggccag tgtccttgta catgctgtag cacttaaaaa 4381 ataattccag ggttccctgg aaaaccagtc ccagggttcc tatgatctgt agtttctacc 4441 tggattataa ctggttttgg gtacctgaat tttgattggt tagccttaat tatagtctgg 4501 cgtgatcatg tagaatcttt tctggtgaac agatcataaa gttctatcaa ggagttctat 4561 caaggcatcc atgtcagtgg tgctatgctg gttacaactt gagatttttg aaataaaaaa 4621 tttgtcatat t // LOCUS AB002353 6657 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0355 gene, complete cds. ACCESSION AB002353 NID g2224650 KEYWORDS KIAA0355. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1881. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6657) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6657 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1881" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 839..4051 /gene="KIAA0355" CDS 839..4051 /gene="KIAA0355" /codon_start=1 /db_xref="PID:d1021651" /db_xref="PID:g2224651" /translation="MYCCSAQDSKMDYKRRFLLGGSKQKVQQHQQYPMPELGRALSAP LASTATTAPLGSLTAAGSCHHAMPHTTPIADIQQGISKYLDALNVFCRASTFLTDLFS TVFRNSHYSKAATQLKDVQEHVMEVASRLTSAIKPEIAKMLMELSAGAANFTDQKEFS LQDIEVLGRCFLTVVQVHFQFLTHALQKVQPVAHSCFAEVIVPEKKNSGSGGGLSGMG HTPEVEEAVRSWRGAAEATSRLRERGCDGCLAGIEVQQLFCSQSAAIPEHQLKELNIK IDSALQAYKIALESLGHCEYAMKAGFHLNPKAIEASLQGCCSEAEAQQTGRRQTPPQP MQCELPTVPVQIGSHFLKGVSFNESAADNLKLKTHTMLQLMKEAGCYNGITSRDDFPV TEVLNQVCPSTWRGACKTAVQLLFGQAGLVVVDTAQIENKEAYAPQISLEGSRIVVQV PSTWCLKEDPATMSLLQRSLDPEKTLGLVDVLYTAVLDLNRWRAGREQALPCIQIQLQ REICDFGNQADLPSGNGNKSSGGLQKTFSKLTSRFTKKASCTSSSSSTNYSIQNTPSK NIFIAGCSEEKAKMPGNIDTRLQSILNIGNFPRTTDPSQSAQNSSNTVANGFLMERRE NFLHGDDGKDEKGMNLPTDQEMQEVIDFLSGFNMGQSHQGSPLVTRHNSAATAMVTEQ KAGAMQPQQPSLPVPPPPRAPQAGAHTPLTPQPGLAPQQQSPKQQQPQVQYYQHLLQP IGPQQPPPQPRAPGKWVHGSSQQPAQAVGAGLSPLGQWPGISDLSSDLYSLGLVSSYM DNVMSEVLGQKPQGPRNNTWPNRDQSDGVFGMLGEILPFDPAVGSDPEFARYVAGVSQ AMQQKRQAQHGRRPGNPRGNWPPMDDAHRTWPFPEFFTEGDGLHGGWSGAQGDSASSS DETSSANGDSLFSMFSGPDLVAAVKQRRKHSSGEQDTSTLPSPPLLTTVEDVNQDNKT KTWPPKAPWQHPSPLPSTLPSPSAPLYAVTSPGSQWNDTMQMLQSPVWAATNDCSAAA FSYVQTPPQPPPPPAHKAAPKGFKAFPGKGERRPAYLPQY" BASE COUNT 1727 a 1653 c 1616 g 1661 t ORIGIN 1 agccggtcca aggcggtgcg ctgggggccg gggcgcgtcg caggaaagga cttagaagcc 61 ttacaaatac atctgtgcat tcttgcttca gactttacaa ctgagggcca gcccagtctg 121 gaagcatctc ttattaatgt tacaaggaaa ccgctacctc agcaaacaaa aggaatggag 181 gaggagactt acaacaaacg ccatttaaaa aaaaaagaaa aacttgtttt caggaaacat 241 tagaggaaat ttggagaatt tcagcattgc atagaagcag ccttacaggt aaacggattt 301 gacgggcagg ctctgtcaaa attctgcaga agtttgatct tccttcttaa ggactttgtg 361 tgaagtaatt ctttactgct tttaaattgc tgttgatcag cttctgggtt tttgggttct 421 aacttttttg ggtatattga agacacagtg tattacatta tattacattt ttaaacgtta 481 agttattttt ctgccattgt aaatacaagt atcaaaatat tcttgcaaag aagtaaacat 541 ttttctcagc aagcatatcc ttttagtaag agagtggaat gctaaacagt tcttatccag 601 cattttggac atcttttatt ttttgtcaga gatcctgtct acactgaaaa tattaattat 661 ataaacctgt tgtctctcac ctctacattg gatcacatgg tcacctgcct catggaaatg 721 ccttttttaa aacttcgatt tgcagaactc cactattttt atacctagct acagttttga 781 gaaagaagaa tcagaaccct gacccactta cggttgctgg gacaattccc cctcccgcat 841 gtattgctgc agtgcccagg acagtaaaat ggactacaag cggcgcttcc tgcttggcgg 901 gtccaagcag aaggtgcagc agcaccagca atacccgatg cctgagctgg gccgagcact 961 gagtgctccc ctggcatcca cggccaccac tgcccccctg ggcagtctga ccgctgcagg 1021 cagctgccac catgccatgc cccacactac tcctatcgcc gacatccagc agggcatctc 1081 caagtatctg gatgccctga acgtcttctg ccgtgccagt actttcctca cagatctctt 1141 cagcactgtg ttcaggaact ctcactactc aaaggcagcc acacagctca aagatgtgca 1201 ggagcatgtc atggaagtag ccagtcggct gacctcggcc ataaagcctg agatcgccaa 1261 gatgctaatg gaacttagtg ctggggctgc aaattttacg gatcagaagg aattcagtct 1321 ccaggacatt gaggtgttgg ggcgatgttt cctgactgtg gtgcaagtcc atttccagtt 1381 tttgactcat gcgttacaga aggtccagcc ggtggctcac tcttgctttg ctgaggtcat 1441 cgtgccagaa aaaaagaaca gcggcagtgg cggcggctta tctggcatgg gccacacacc 1501 tgaagtagag gaagctgtgc ggtcctggcg gggggctgct gaggcgacat ctagactaag 1561 agaaagaggc tgtgatggtt gcctggcagg aattgaagtt caacaactct tttgttctca 1621 aagtgcagca attcctgagc accagctaaa agaactgaac ataaaaatcg acagtgcttt 1681 gcaagcatat aagatagctc tggaaagctt aggacactgt gaatatgcaa tgaaagccgg 1741 cttccacctg aatccaaagg cgattgaagc aagtttgcag ggctgctgca gcgaggcgga 1801 agcccagcag acggggcgga ggcagacacc cccgcagccc atgcagtgtg agctccccac 1861 cgtccctgtg cagataggat cgcacttcct gaagggcgtc tcctttaatg agtcggccgc 1921 cgacaatctg aaacttaaga cgcatacaat gttacagctg atgaaggagg caggctgcta 1981 taatggaatc acatccaggg atgattttcc tgtgactgaa gtgctgaacc aggtttgccc 2041 ttccacatgg cgaggtgcct gcaagacggc ggtgcagctg ctgtttggcc aggctggact 2101 ggtggtggtt gacacagcac agattgagaa taaagaagcc tatgcccccc agatcagttt 2161 agaaggctct agaatcgtgg ttcaagtccc atccacatgg tgcctgaaag aagaccctgc 2221 taccatgtcc ctgctgcaga gaagccttga tcctgagaag accctgggtc tagtggacgt 2281 gctctacaca gctgtgctgg acctaaaccg ctggagggct ggaagggagc aagctttacc 2341 ctgcatacag atccagctgc aaagggagat ctgtgatttt ggcaaccagg ctgacctgcc 2401 ttctggaaat ggaaacaaat cttcaggtgg cctgcagaag acattctcca aactgacatc 2461 ccggttcacc aagaaagctt catgtaccag ctccagcagc agcacaaatt attccatcca 2521 aaatacccct tccaaaaaca tcttcatagc tggatgttcc gaagagaagg ccaaaatgcc 2581 tggcaatatt gatacaaggt tacaaagcat tttgaacatt ggtaatttcc ccaggactac 2641 agacccttca cagtcagctc agaattccag taatacagtg gccaatggct ttctcatgga 2701 gaggcgtgag aacttcctgc atggagatga cggcaaggat gagaagggta tgaacttacc 2761 aactgatcag gaaatgcaag aggtgataga ttttctctcg ggctttaaca tgggccagtc 2821 acatcagggc tctccgttgg tgacaaggca taattctgct gccacagcca tggtgactga 2881 gcagaaggca ggagccatgc aaccacagca gccgtcactg cctgtgcccc ctccaccacg 2941 ggcaccccag gctggggcac acacacctct gacaccccag ccgggactgg cacctcagca 3001 gcagtcccca aagcagcaac aacctcaagt ccaatactac caacacctac tccagcccat 3061 tggaccgcag cagcccccgc cccagcctcg ggcacctggg aaatgggtac atggctcatc 3121 ccagcagcca gcgcaggctg ttggagcagg tctgtctcct cttggtcagt ggcctggcat 3181 atctgatctc agttctgact tgtacagctt gggtctggtg agcagctata tggataatgt 3241 gatgtcagag gttctgggac agaagccgca gggacctaga aataacacct ggcccaaccg 3301 tgaccaaagt gatggagtct ttggaatgct gggagagatt ctgccttttg atcctgcagt 3361 gggctcagac ccagagtttg cacgctatgt ggcaggagtg agccaggcga tgcagcagaa 3421 gcggcaggcc cagcacggtc gccggccagg caacccccgg ggcaactggc cgcctatgga 3481 tgacgcgcat cggacctggc ccttccccga gttcttcaca gaaggggatg gcctgcacgg 3541 tggctggtcg ggtgctcagg gagactctgc cagctcgagt gatgagacat cctcagccaa 3601 cggggacagc ttgttctcca tgttttcagg gcctgacctc gttgctgctg tcaagcagag 3661 aaggaaacac agcagtggag agcaagacac cagcacgctg ccctcaccac ctctcctcac 3721 cacggtggag gatgtgaacc aggataacaa aaccaaaacg tggccaccca aagcaccctg 3781 gcagcaccct tccccgcttc ccagcacgct gcccagcccc agcgcaccac tctatgcagt 3841 caccagccct ggcagccagt ggaacgacac catgcagatg ctgcagtccc cagtgtgggc 3901 cgcaaccaac gactgcagtg ccgctgcctt ctcctatgtg cagaccccac cccagccccc 3961 acccccacca gcacacaagg cagcacccaa gggcttcaag gccttccctg ggaagggtga 4021 gcgcaggcca gcctatctgc cccagtactg accccaggcc agccagcctg cctgcctgcc 4081 tgcctgcccg cccagagctg tggggatgag tgtccccacc ccagggccac ttagctgaca 4141 ccagcccctc agaggaccag tgcgccccat cccagggagg gttccttggg gacaagggtg 4201 gttggcagct ccaagccttt aaacctggct tctgaaacga tggcatcaga gccctggaga 4261 gccagctgga gacacaggcg tctggccttc aggggcttgc tagggaacct gcatgcctag 4321 taagcgccac aggtgactct gatgcaggcg ccacagccac acttgaagaa acacagctct 4381 tgggttttta gtcctgctgt gtgttgggaa gacatcaggc ctgaagctga gggcataact 4441 gaccattttt tggaaaccct ctcctccctc ctccacacct tgagtgatga ccacaccaat 4501 cactgtattt tatagctttt tttttcatgt aggtttttag ttaaaacatc tcctgcctaa 4561 aatgcattga atattttaag ataacagata tactggctgg aggtttgttt aacactatct 4621 atatttaagc ttatacaaaa tgggcaaaat atagaatatt tgtgattgga agcagtcacc 4681 tggggtttct gggggtggac agttcctcgc cacccagcag caccctggga ctgcggcctt 4741 tcccagcttt attgaagcag aatggtggaa cttgtgccgg aggtacactg ctgaaggtgc 4801 gtggcggtgg accagccagc tgctgtccat gtgcagagca aggctgcacc tgctgccctt 4861 cgatccttcc acacatggcc aggacactgc cacaatcctc ggggtgtggt caaggggcac 4921 tcagagacac ctgcactaga aattgcattg acattgtgag ctggctcaga agacaaacca 4981 attaagatgt agatagaaat taatttaagg tcttttctta aaaaaaaaat ccacctcatt 5041 ttcagttaac atgtgccatt aaatagataa catcctgtgg atttagggat attttccagc 5101 cagaatggat ccagaagaat tgaatggtgc ttaaattgag aaataataat aaatatatct 5161 atatagaata gacatatccc actgtatatt aattgaggtt acagaaagtt ctttatataa 5221 aacttattta aatttttcat atttcatctt tgaaaaagtc tgagaaaaat ccataatatt 5281 ttctggtatg aaagtttgac agtattaata ttttttttat attttcttaa atcattaaac 5341 cattttaata tatgttaact actaataaat ggtttattct ttctaactcc atataagctt 5401 ttccagcaaa gattgtaata acacatttat gttctcgttt ttctacagat ataagtaaat 5461 ttatatataa aaataccaaa aagaggctgg gcgtggtggc tcacgcctgt aatcccagca 5521 ctttgggagg ccgaggcagg tggatcacct aaggtcagga gttcgagacc agcctggcca 5581 atatggtgaa accccatctc tactaaaaat acaaaaatta gccgggtgtg gtggcgtgcg 5641 cctgtagtcc cagctactcg ggaggctgaa gcagaagaat cgcttgaacc cgggaggcgg 5701 aggttacagt gagctgagat tgtgccactg cactccagcc tgggtgacag agtgagattc 5761 cgtctcaaca agaaaaaaaa aaaagaatat atatatatgt atgtatgtat gtatgtaaac 5821 acacacacta atttgagagg acccgtaggg tgtctgagcc cagccacctg agtttttagt 5881 actgtgttgt caggctcttc ccaggcctca ggtgttgtct tttgtgctgt gtggggatgc 5941 attgctgcct gtatttatga tcttttgccg tggttctgag cattcacctc accatgttta 6001 caaagaactg ttttgtatat agacattttc aggcacgtgc tttgcaccaa ccctgcgtgg 6061 ctcttgtctg tgttagctgt cacggtgtgc acactaatct ctgttaaagt tgtctatggc 6121 tgttctactt gtaagatagt tttctatttc cttcagtaat gtgtccacag taccctgtat 6181 ttcgagttcc attatactga agtactcatg ttttaatagt gcctctccaa aggcctcacc 6241 ttggacagag gtcaatcctt gatgctccag cacaggtgac gtcactaatt gtcactttcc 6301 agtttgtttt tctctattaa ggaagacatt ttctaattgc atctccatgg gctgtgagac 6361 tgtgtgaagc cgtttgtgtg gtctccatgt aggtgctgtg ttcccggcac cgccttgctc 6421 tgaacactgg taattccagg tgctgcgctt ggcagagggg tctcgccaaa gcgcatgtgt 6481 gtgcatgtgt gaacgtgtgt gtcctttgca tggttgggcg tgggtgcctt gctagggcat 6541 cagcaggaca ttgtgtgtat agttacaatg cttccaaact ggaactctac attttgtatc 6601 ttttaaagct cctataagta aaataactat tggctttatt aaaaatatac atttaat // LOCUS AB002354 5371 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0356 gene, complete cds. ACCESSION AB002354 NID g2224652 KEYWORDS KIAA0356. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0006. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5371) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5371 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0006" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 637..3417 /gene="KIAA0356" CDS 637..3417 /gene="KIAA0356" /codon_start=1 /db_xref="PID:d1021652" /db_xref="PID:g2224653" /translation="MECYLKLLLQEQARLHEYYQPTALLRDAEEGEFLLSFLQGLTSL SFELSYKSAILNEWTLTPLALSGLCPLSELDPLSTSGAELQRKESLDSISHSSGSEDI EVHHSGHKIRRNQKLTASSLSLDTASSSQLSCSLNSDSCLLQENGSKSPDHCEEPMSC DSDLGTANAEDSDRSLQEVLLEFSKAQVNSVPTNGLSQETEIPTPQASLSLHGLNTST YLHCEAPAEPLPAQAASGTQDGVHVQEPRPQAPSPLDLQQPVESTSGQQPSSTVSETA REVGQGNGLQKAQAHDGAGLKLVVSSPTSPKNKSWISEDDFYRPSREQPLESASDHPI ASYRGTPGSRPGLHRHFSQEPRKNCSLGALDQACVPSPGRRQAQAAPSQGHKSFRVVH RRQMGLSNPFRGLMKLGTVERRGAMGIWKELFCELSPLEFRLYLSNEEHTCVENCSLL RCESVGPAHSDGRFELVFSGKKLALRASSQDEAEDWLDRVREALQKVRPQQEDEWVNV QYPDQPEEPPEAPQGCLSPSDLLSEPAALQGTQFDWSSAQVPEPDAIKESLLYLYMDR TWMPYIFSLSLEALKCFRIRNNEKMLSDSHGVETIRDILPDTSLGGPSFFKIITAKAV LKLQAGNAEEAALWRDLVRKVLASYLETAEEAVTLGGSLDENCQEVLKFATRENGFLL QYLVAIPMEKGLDSQGCFCAGCSRQIGFSFVRPKLCAFSGLYYCDICHQDDASVIPAR IIHNWDLTKRPICRQALKFLTQIRAQPLINLQMVNASLYEHVERMHLIGRRREQLKLL GDYLGLCRSGALKELSKRLNHRNYLLESPHRFSVADLQQIADGVYEGFLKALIEFASQ HVYHCDLCTQRGFICQICQHHDIIFPFEFDTTVRCAECKTVFHQSCQAVVKKGCPRCA RRRKYQEQNIFA" BASE COUNT 1090 a 1664 c 1529 g 1088 t ORIGIN 1 ccgagagacc gcatcgtcgg ctcggaggct gaggggctgc cgcggccggg agcgcccctc 61 gcctcgctcc tcgctccgct tggtgtcatg tgattctctg agggagcagc tgcgtgagtg 121 gagatgcttt cagtggtgga gaatggactg gacccccagg ctgccatccc ggtcatcaag 181 aagaagctgg tgggatccgt gaaggccttg cagaagcagt acgtgtccct ggacacggtg 241 gtcactagtg aagacggaga tgccaacacc atgtgcagcg ccctggaggc cgtatttatc 301 catggcctgc acgccaagca catccgagct gaggccggag gaaaaaggaa gaaaagtgcc 361 caccagaagc ctctgcccca gcctgtcttc tggcccctcc tgaaagctgt cacccacaag 421 tgagatttag ctggagaggt tttgctttgc ggaggagcag cagactatgg gacacttggc 481 ttttcctccc cagctgttca ggaagccaac agaggtgccc tgtgttctag catagagaga 541 ggacacatca tctcagagtt ggagcacctg acgtttgtca acacggatgt gggccgctgc 601 cgggcatggc tgcggctggc cctgaacgat ggcctgatgg agtgctacct gaagctgctg 661 ctgcaggagc aggcccgctt gcatgagtac taccagccca ccgccctgct ccgggatgct 721 gaggagggcg agttcctcct tagcttcctg cagggcctca cgtccttgtc cttcgaactc 781 tcctacaagt ctgccatctt aaatgagtgg acgctcaccc cattggccct gtctgggctt 841 tgcccgcttt ctgagctgga ccctctctct acctctggtg cagaactaca gcggaaggaa 901 tctctggatt ccatttccca ttcttcaggc tctgaagaca tcgaagtcca tcactcgggc 961 cataagatac ggaggaacca gaagctgact gcctcctccc tcagcctgga cacggccagt 1021 tcatcccagc tgtcctgcag cctaaactct gatagctgct tactccaaga gaatggctcc 1081 aagagtccag accattgcga ggagcccatg tcctgtgact cagacctggg cacagcaaat 1141 gctgaggact cagaccggtc tctgcaagag gtattgttgg aattcagcaa agcccaggta 1201 aactctgtgc caaccaacgg actgagccaa gaaacagaga tccccacacc acaggcctcg 1261 ctctccctcc atggcctcaa caccagcaca tacctgcact gtgaggcacc tgcagagccc 1321 cttcctgccc aggcagcctc tggaactcaa gatggtgtcc acgtgcagga gccgcgtccc 1381 caggcgccca gccccctgga cttacagcag cctgtagaga gcacctcagg ccagcagcct 1441 tctagtactg tcagcgagac agccagagaa gtgggccaag ggaatggcct gcagaaggcc 1501 caggctcatg acggagctgg tctgaagctg gtagtttcct cacccaccag tccgaaaaac 1561 aagagctgga tctcagagga tgacttctac cggccttccc gggagcaacc cctggagagt 1621 gcttcagacc acccaatagc ttcttacagg gggactccag ggtcaaggcc tggtctccac 1681 aggcattttt ctcaagaacc aagaaaaaac tgctccctgg gggcgttaga ccaagcgtgt 1741 gtaccttccc caggaagaag gcaagcccag gcagccccat cccaggggca taagagcttc 1801 cgggtggtac accggagaca gatgggactg tccaacccat tccggggtct catgaagctg 1861 ggcaccgtgg agcggcgggg ggcaatgggc atctggaagg agctcttctg cgagctctcc 1921 ccgctggagt tccgcctcta cctgagcaac gaggagcaca cctgtgtgga gaactgctcg 1981 ctgcttcgct gtgagtctgt ggggccagcc catagtgatg ggcgctttga gctggtcttc 2041 tctggcaaga agctggccct gcgcgcctcc tcccaggacg aagctgagga ctggctggac 2101 cgggtgcggg aggccctgca gaaggtccgg cctcagcagg aggatgagtg ggtgaacgtg 2161 cagtacccag accagcctga ggaacccccc gaggcgcccc agggctgcct ctctccctca 2221 gacctgctct cggagcccgc ggccctccag ggcacacagt ttgactggtc gtccgcccag 2281 gttccagagc cagatgccat caaggagtcc ctgctgtact tgtacatgga caggacctgg 2341 atgccctata tattttctct gtccttggag gctctgaaat gtttccgcat caggaacaat 2401 gagaagatgc tgagtgacag ccacggcgtg gagaccatcc gggacatcct gccagacacc 2461 agccttgggg gcccatcctt cttcaaaatc atcacggcca aggctgtcct gaagctgcag 2521 gccggaaacg ccgaggaagc cgccctgtgg agggatctgg tccgcaaagt cctggcatcc 2581 tacttggaga cagccgagga ggcggtgacc ctgggcggga gcctggatga aaactgtcag 2641 gaggtgctga aatttgccac ccgggagaat ggcttcctgc tgcagtacct ggtggctatc 2701 cccatggaga aaggccttga ctcccaaggc tgcttctgcg caggctgctc ccggcagatc 2761 ggcttctcct ttgtacgacc caagctctgt gccttctctg gcctctatta ctgtgacatc 2821 tgccaccaag acgatgcctc agtgattccg gccaggatca tccacaactg ggacctcacc 2881 aagcgcccga tctgcaggca ggccctgaag tttctgacac agatccgggc ccagcccctc 2941 atcaacctgc agatggtgaa cgcgtctctg tacgagcatg tggagcggat gcacctcatt 3001 gggaggagac gggagcagct gaagctcctg ggggattacc tgggcctgtg ccggagtggc 3061 gccctgaagg agctcagcaa gaggctcaac cacaggaatt atctcttgga atctccgcat 3121 aggttcagtg ttgctgacct ccaacagatc gcagacgggg tgtatgaagg attcctcaag 3181 gccctgattg aatttgcctc ccagcatgtc taccactgcg acctgtgcac ccagcgcggc 3241 ttcatctgcc agatctgcca gcaccacgac atcatcttcc cctttgagtt tgacaccaca 3301 gtcaggtgtg ccgagtgcaa gaccgtcttc caccagagct gccaggctgt ggtgaagaag 3361 ggctgccccc gctgtgcccg ccggcgcaag taccaggaac agaacatttt cgcctgatgc 3421 ccatctgctg accccgctct gaaagccggg ggtgagtgtg gctcagccat cccggctggg 3481 tttgccatca gcccaggata ctcaccgtgt cacagctgtg tccccttgtc aggaagaccc 3541 tcagatgtgg ccagagcacc ggcctcccag agaaagccca ggaagtcccc gtggtgtccc 3601 ctggccctcc ccccacccct ctgctgcagg aggcgtggcc accaccagat gctccctgtc 3661 tggggcaggc agggccgttt actcaccagg ccctggctca ccagcgtccc ggcgcctccg 3721 ttagggctgc ccaaacactg tggaaagggg acttggggag cattttcaat tccacattcc 3781 tgattaaaag gtctagtttt ttaaggtggg tttttcccat tcatatttca acagaaagta 3841 tttttttatt cttgacccta cgtgagtcac ccatgaaatc cagacattct cacccccagc 3901 ggatgcacag gagccactca ggcctggctt agaggtagcg ggtggctggt gggcaggcca 3961 ctgctgggac agggtctatg ggcccaccca gtgcctttcc ttggccctca ctgccagtgc 4021 caaaccgccc ctcagcaggg cagggccagt cctggcgacc ccggttcctc tgtgtcctct 4081 ttactcccca ccccagtgcg tctcgcctgc tgagcccacc gctttctcag cgtcagagcc 4141 cctagctctt ctctagcctg gggccttggc cccattagga gaaggagaaa tgactggaaa 4201 ccagcactgt cagcactttg agccctcctg ggtttatgga gggtcgccgt cttgcccacc 4261 cccgatcccc gtgcgctcct cacccctcta cagacagggg cgctgctgca gatgctccag 4321 gcccaccttg gtgggtgccc ggggtcacac tctctgcctg actgacccct gccagggcac 4381 agataggcag gaccccagag agggcaggca gccacgccct gcactgtgga gaagtgacct 4441 ggtcctaagc tgtgcagggc gtgcggggga actgccctgt ggatgcagcc accacctcac 4501 aaagccatga attcccaaag catcactgaa cttgcccagc cgggacactg gagactgaag 4561 aagttccccg gagcaggact gcctttgaag gggccttggg gctggggtgg gagtctggaa 4621 ttgttttcaa agtgatggtc agaccctcca ggcttcgggg aaatgtgggt ttgctggtct 4681 ctctgtcgcc ttcccccacc tctcctggac ctagcaggcc ctcagtccta gctgtcttac 4741 ctgcgctcag cttggagtca ggcctcaggt ggagagccgc atgtcactgt cagagatgag 4801 cccaccccca cccttccctg ggggcgatta gctgctgagt cacccaccca gccttggagc 4861 tggatgcctg tgcagcagaa cagaggcaac ggtggcaggc aggtggaggc tgcgcacagc 4921 ttctcccggt catggcacac acctgtgccc agacgccaat gcctcctaag ccaggagctg 4981 ggttatgcac cgccatggcc cacattaacg ctccaccaca agcccagcct agtcagagcc 5041 ctcattgccc catccctgag tgggcaggag cctggcctgg ggagggcttg ggcctgtgag 5101 gattctcccc gaaggtgccc tcggaggcag caggtttgag gccctggggg gcccgttccc 5161 cagaagctgc cagtgctttc agatgcattg actcttcccg cctcccctcc ctcagaaagg 5221 tagtgcccac actattttta aaatgttatt ttatgcaata cactgttcct agtgagcagg 5281 gctctggccc tgaggagtct ttgcaatgta ttgaaggaat tgctgccgtg tgagttttga 5341 atgattttgc agtaaagacc tgatctttct c // LOCUS AB002356 5942 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0358 gene, complete cds. ACCESSION AB002356 NID g2224656 KEYWORDS KIAA0358. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0017. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5942) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5942 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0017" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 188..4933 /gene="KIAA0358" CDS 188..4933 /gene="KIAA0358" /codon_start=1 /db_xref="PID:d1021654" /db_xref="PID:g2224657" /translation="MVQKKKFCPRLLDYLVIVGARHPSSDSVAQTPELLRRYPLEDHT EFPLPPDVVFFCQPEGCLSVRQRRMSLRDDTSFVFTLTDKDTGVTRYGICVNFYRSFQ KRISKEKGEGGAGSRGKEGTHATCASEEGGTESSESGSSLQPLSADSTPDVNQSPRGK RRAKAGSRSRNSTLTSLCVLSHYPFFSTFRECLYTLKRLVDCCSERLLGKKLGIPRGV QRDTMWRIFTGSLLVEEKSSALLHDLREIEAWIYRLLRSPVPVSGQKRVDIEVLPQEL QPALTFALPDPSRFTLVDFPLHLPLELLGVDACLQVLTCILLEHKVVLQSRDYNALSM SVMAFVAMIYPLEYMFPVIPLLPTCMASAEQLLLAPTPYIIGVPASFFLYKLDFKMPD DVWLVDLDSNRVIAPTNAEVLPILPEPESLELKKHLKQALASMSLNTQPILNLEKFHE GQEIPLLLGRPSNDLQSTPSTEFNPLIYGNDVDSVDVATRVAMVRFFNSANVLQGFQM HTRTLRLFPRPVVAFQAGSFLASRPRQTPFAEKLARTQAVEYFGEWILNPTNYAFQRI HNNMFDPALIGDKPKWYAHQLQPIHYRVYDSNSQLAEALSVPPERDSDSEPTDDSGSD SMDYDDSSSSYSSLGDFVSEMMKCDINGDTPNVDPLTHAALGDASEVEIDELQNQKEA EEPGPDSENSQENPPLRSSSSTTASSSPSTVIHGANSEPADSTEMDDKAAVGVSKPLP SVPPSIGKSNVDRRQAEIGEGSVRRRIYDNPYFEPQYGFPPEEDEDEQGESYTPRFSQ HVSGNRAQKLLRPNSLRLASDSDAESDSRASSPNSTVSNTSTEGFGGIMSFASSLYRN HSTSFSLSNLTLPTKGAREKATPFPSLKVFGLNTLMEIVTEAGPGSGEGNRRALVDQK SSVIKHSPTVKREPPSPQGRSSNSSENQQFLKEVVHSVLDGQGVGWLNMKKVRRLLES EQLRVFVLSKLNRMVQSEDDARQDIIPDVEISRKVYKGMLDLLKCTVLSLEQSYAHAG LGGMASIFGLLEIAQTHYYSKEPDKRKRSPTESVNTPVGKDPGLAGRGDPKAMAQLRV PQLGPRAPSATGKGPKELDTRSLKEENFIASIELWNKHQEVKKQKALEKQRPEVIKPV FDLGETEEKKSQISADSGVSLTSSSQRTDQDSVIGVSPAVMIRSSSQDSEVSTVVSNS SGETLGADSDLSSNAGDGPGGEGSVHLASSRGTLSDSEIETNSATSTIFGKAHSLKPS IKEKLAGSPIRTSEDVSQRVYLYEGLLGRDKGSMWDQLEDAAMETFSISKERSTLWDQ MQFWEDAFLDAVMLEREGMGMDQGPQEMIDRYLSLGEHDRKRLEDDEDRLLATLLHNL ISYMLLMKVNKNDIRKKVRRLMGKSHIGLVYSQQINEVLDQLANLNGRDLSIWSSGSR HMKKQTFVVHAGTDTNGDIFFMEVCDDCVVLRSNIGTVYERWWYEKLINMTYCPKTKV LCLWRRNGSETQLNKFYTKKCRELYYCVKDSMERAAARQQSIKPGPELGGEFPVQDLK TGEGGLLQVTLEGINLKFMHNQFLKLKKW" BASE COUNT 1413 a 1576 c 1647 g 1306 t ORIGIN 1 ctggcgcccg ctcggagcgg cgccgcgctg gggagcgact gacgccccgc tgccggggga 61 cgtcgggctg ggcctggcca gtcccccaga gcttgggaga cttcgatttt cagaattcct 121 cctgggaatg ctgactcctt gcttggtgcc ctgatgcttc tctgagataa actgatgaat 181 tggaaccatg gtgcaaaaga agaagttctg tcctcggtta cttgactatc tagtgatcgt 241 aggggccagg cacccgagca gtgatagcgt ggcccagact cctgaattgc tacggcgata 301 ccccttggag gatcacactg agtttcccct gcccccagat gtagtgttct tctgccagcc 361 cgagggctgc ctgagcgtgc ggcagcggcg catgagcctt cgggatgata cctcttttgt 421 cttcaccctc actgacaagg acactggagt cacgcgatat ggcatctgtg ttaacttcta 481 ccgctccttc caaaagcgaa tctctaagga gaagggggaa ggtggggcag ggtcccgtgg 541 gaaggaagga acccatgcca cctgtgcctc agaagagggt ggcactgaga gctcagagag 601 tggctcatcc ctgcagcctc tcagtgctga ctctacccct gatgtgaacc agtctcctcg 661 gggcaaacgc cgggccaagg cggggagccg ctcccgcaac agtactctca cgtccctgtg 721 cgtgctcagc cactaccctt tcttctccac cttccgagag tgtttgtata ctctcaagcg 781 cctggtggac tgctgtagtg agcgccttct gggcaagaaa ctgggcatcc ctcgaggcgt 841 acaaagggac accatgtggc ggatctttac tggatcgctg ctggtagagg agaagtcaag 901 tgcccttctg catgaccttc gagagattga ggcctggatc tatcgattgc tgcgctcccc 961 agtacccgtc tctgggcaga agcgagtaga catcgaggtc ctaccccaag agctccagcc 1021 agctctgacc tttgctcttc cagacccatc tcgattcacc ctagtggatt tcccactgca 1081 ccttcccttg gaacttctag gtgtggacgc ctgtctccag gtgctaacct gcattctgtt 1141 agagcacaag gtggtgctac agtcccgaga ctacaatgca ctctccatgt ctgtgatggc 1201 attcgtggca atgatctacc cactggaata tatgtttcct gtcatcccgc tgctacccac 1261 ctgcatggca tcagcagagc agctgctgtt ggctccaacc ccgtacatca ttggggttcc 1321 tgccagcttc ttcctctaca aactggactt caaaatgcct gatgatgtat ggctagtgga 1381 tctggacagc aatagggtga ttgcccccac caatgcagaa gtgctgccta tcctgccaga 1441 accagaatca ctagagctga aaaagcattt aaagcaggcc ttggccagca tgagtctcaa 1501 cacccagccc atcctcaatc tggagaaatt tcatgagggc caggagatcc cccttctctt 1561 gggaaggcct tctaatgacc tgcagtccac accgtccact gaattcaacc cactcatcta 1621 tggcaacgat gtggattctg tggatgttgc aaccagggtt gccatggtac ggttcttcaa 1681 ttccgccaac gtgctgcagg gatttcagat gcacacgcgt accctgcgcc tctttcctcg 1741 gcctgtggta gcttttcaag ctggctcctt tctagcctca cgtccccggc agactccttt 1801 tgccgagaaa ttggccagga ctcaggctgt ggagtacttt ggggaatgga tccttaaccc 1861 caccaactat gcctttcagc gaattcacaa caatatgttt gatccagccc tgattggtga 1921 caagccaaag tggtatgctc atcagctgca gcctatccac tatcgcgtct atgacagcaa 1981 ttcccagctg gctgaggccc tgagtgtacc accagagcgg gactctgact ccgaacctac 2041 tgatgatagt ggcagtgata gtatggatta tgacgattca agctcttctt actcctccct 2101 tggtgacttt gtcagtgaaa tgatgaaatg tgacattaat ggtgatactc ccaatgtgga 2161 ccctctgaca catgcagcac tgggggatgc cagcgaggtg gagattgacg agctgcagaa 2221 tcagaaggaa gcagaagagc ctggcccaga cagtgagaac tctcaggaaa accccccact 2281 gcgctccagc tctagcacca cagccagcag cagccccagc actgtcatcc acggagccaa 2341 ctctgaacct gctgactcta cggagatgga tgataaggca gcagtaggcg tctccaagcc 2401 cctcccttcc gtgcctccca gcattggcaa atcgaacgtg gacagacgtc aggcagaaat 2461 tggagagggg tcagtgcgcc ggcgaatcta tgacaatcca tacttcgagc cccaatatgg 2521 ctttccccct gaggaagatg aggatgagca gggggaaagt tacactcccc gattcagcca 2581 acatgtcagt ggcaatcggg ctcaaaagct gctgcggccc aacagcttga gactggcaag 2641 tgactcagat gcagagtcag actctcgggc aagctctccc aactccaccg tctccaacac 2701 cagcaccgag ggcttcgggg gcatcatgtc ttttgccagc agcctctatc ggaaccacag 2761 taccagcttc agtctttcaa acctcacact gcccaccaaa ggtgcccgag agaaggccac 2821 gcccttcccc agtctgaaag tatttgggct aaatactcta atggagattg ttactgaagc 2881 cggccccggg agtggtgaag gaaacaggag ggcgttagtg gatcagaagt catctgtcat 2941 taaacacagc ccaacagtga aaagagaacc tccatcaccc cagggtcgat ccagcaattc 3001 tagtgagaac cagcagttcc tgaaggaggt ggtgcacagc gtgctggacg gccagggagt 3061 tggctggctc aacatgaaaa aggtgcgccg gctgctggag agcgagcagc tgcgagtctt 3121 tgtcctgagc aagctgaacc gcatggtgca gtcagaggac gatgcccggc aggacatcat 3181 cccggatgtg gagatcagtc ggaaggtgta caagggaatg ttagacctcc tcaagtgtac 3241 agtcctcagc ttggagcagt cctatgccca cgcgggtctg ggtggcatgg ccagcatctt 3301 tgggcttttg gagattgccc agacccacta ctatagtaaa gaaccagaca agcggaagag 3361 aagtccaaca gaaagtgtaa ataccccagt tggcaaggat cctggcctag ctgggcgggg 3421 ggacccaaag gctatggcac aactgagagt tccacaactg ggacctcggg caccaagtgc 3481 cacaggaaag ggtcctaagg aactggacac cagaagttta aaggaagaaa attttatagc 3541 atctattgaa ttgtggaaca agcaccagga agtgaaaaag caaaaagctt tggaaaaaca 3601 gaggcctgaa gtaatcaaac ctgtctttga ccttggtgag acagaggaga aaaagtccca 3661 gatcagcgca gacagtggtg tgagcctgac gtctagttcc cagaggactg atcaagactc 3721 tgtcatcggc gtgagtccag ctgttatgat ccgcagctca agtcaggatt ctgaagttag 3781 caccgtggtg agtaatagct ctggagagac ccttggagct gacagtgact tgagcagcaa 3841 tgcaggtgat ggaccaggtg gcgagggcag tgttcacctg gcaagctctc ggggcacttt 3901 gtctgatagt gaaattgaga ccaactctgc cacaagcacc atctttggta aagcccacag 3961 cttgaagcca agcataaagg agaagctggc aggcagcccc attcgtactt ctgaagatgt 4021 gagccagcga gtctatctct atgagggact cctaggaagg gacaaaggat ccatgtggga 4081 ccagttagag gatgcagcta tggagacctt ttctataagc aaagagcgtt ctactttatg 4141 ggaccaaatg caattctggg aagatgcctt cttagatgct gtgatgttgg agagagaagg 4201 gatgggtatg gaccagggtc cccaggaaat gatcgacagg tacctgtccc ttggagaaca 4261 tgaccggaag cgcctggaag atgatgaaga tcgcttgctg gccacacttc tgcacaacct 4321 catctcctac atgctgctga tgaaggtaaa taagaatgac atccgcaaga aggtgaggcg 4381 cctaatggga aagtcgcaca ttgggcttgt gtacagccag caaatcaatg aggtgcttga 4441 tcagctggcg aacctgaatg gacgcgatct ctctatctgg tccagtggca gccggcacat 4501 gaagaagcag acatttgtgg tacatgcagg gacagataca aacggagata tctttttcat 4561 ggaggtgtgc gatgactgtg tggtgttgcg tagtaacatc ggaacagtgt atgagcgctg 4621 gtggtacgag aagctcatca acatgaccta ctgtcccaag acgaaggtgt tgtgcttgtg 4681 gcgtagaaat ggctctgaga cccagctcaa caagttctat actaaaaagt gtcgggagct 4741 gtactactgt gtgaaggaca gcatggagcg cgctgccgcc cgacagcaaa gcatcaaacc 4801 cggacctgaa ttgggtggcg agttccctgt gcaggacctg aagactggtg agggtggcct 4861 gctgcaggtg accctggaag ggatcaacct caaattcatg cacaatcagt tcctgaaatt 4921 aaagaagtgg tgagccacaa gtacaagaca ccaatggccc acgaaatctg ctactccgta 4981 ttatgtctct tctcgtacgt ggctgcagtt catagcagtg aggaagatct cagaaccccg 5041 ccccggcctg tctctagctg atggagaggg gctacgcagc tgccccagcc cagggcacgc 5101 ccctggcccc ttgctgttcc caagtgcacg atgctgctgt gactgaggag tggatgatgc 5161 tcgtgtgtcc tctgcaagcc ccctgctgtg gcttggttgg ttaccggtta tgtgtccctc 5221 tgagtgtgtc ttgagcgtgt ccaccttctc cctctccact cccagaagac caaactgcct 5281 tcccctcagg gctcaagaat gtgtacagtc tgtggggccg gtgtgaaccc actattttgt 5341 gtccttgaga catttgtgtt gtggttcctt gtccttgtcc ctggcgttat aactgtccac 5401 tgcaagagtc tggctctccc ttctctgtga cccggcatga ctgggcgcct ggagcagttt 5461 cactctgtga ggagtgaggg aaccctgggg ctcaccctct cagaggaagg gcacagagag 5521 gaagggaaga attggggggc agccggagtg agtggcagcc tccctgcttc cttctgcatt 5581 cccaagccgg cagctactgc ccagggcccg cagtgttggc tgctgcctgc cacagcctct 5641 gtgactgcag tggagcggcg aattccctgt ggcctgccac gccttcggca tcagaggatg 5701 gagtggtcga ggctagtgga gtcccaggga ccgctggctg ctctgcctga gcatcaggga 5761 gggggcagga aagaccaagc tgggtttgca catctgtctg caggctgtct ctccaggcac 5821 ggggtgtcag gagggagaga cagcctgggt atgggcaaga aatgactgta aatatttcag 5881 ccccacatta tttatagaaa atgtacagtt gtgtgaatgt gaaataaatg tcctcaactc 5941 cc // LOCUS AB002357 4724 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0359 gene, complete cds. ACCESSION AB002357 NID g2224658 KEYWORDS KIAA0359. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0048. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4724) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..4724 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0048" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 168..2411 /gene="KIAA0359" CDS 168..2411 /gene="KIAA0359" /codon_start=1 /db_xref="PID:d1021655" /db_xref="PID:g2224659" /translation="MSKLKSSESVRVVVRCRPMNGKEKAASYDKVVDVDVKLGQVSVK NPKGTAHEMPKTFTFDAVYDWNAKQFELYDETFRPLVDSVLQGFNGTIFAYGQTGTGK TYTMEGIRGDPEKRGVIPNSFDHIFTHISRSQNQQYLVRASYLEIYQEEIRDLLSKDQ TKRLELKERPDTGVYVKDLSSFVTKSVKEIEHVMNVGNQNRSVGATNMNEHSSRSHAI FVITIECSEVGLDGENHIRVGKLNLVDLAGSERQAKTGAQGERLKEATKINLSLSALG NVISALVDGKSTHIPYRDSKLTRLLQDSLGGNAKTVMVANVGPASYNVEETLTTLRYA NRAKNIKNKPRVNEDPKDALLREFQEEIARLKAQLEKRSIGRRKRREKRREGGGSGGG GEEEEEEGEEGEEEGDDKDDYWREQQEKLEIEKRAIVEDHSLVAEEKMRLLKEKEKKM EDLRREKDAAEMLGAKIKAMESKLLVGGKNIVDHTNEQQKILEQKRQEIAEQKRRERE IQQQMESRDEETLELKETYSSLQQEVDIKTKKLKKLFSKLQAVKAEIHDLQEEHIKER QELEQTQNELTRELKLKHLIIENFIPLEEKSKIMNRAFFDEEEDHWKLHPITRLENQQ MMKRPVSAVGYKRPLSQHARMSMMIRPEARYRAENIVLLELDMPSRTTRDYEGPAIAP KVQAALDAALQDEDEIQVDASSFESTANKKSKARPKSGRKSGSSSSSSGTPASQLYPQ SRGLVPK" BASE COUNT 1289 a 1073 c 1250 g 1112 t ORIGIN 1 atccggggca gcggggaatg gctgagccag gggttcgccg cccccgccgc cgccgccgcc 61 gccgccgccg ccgccgccgc ccgctttcgg ctcgggcctc aggaccgtag catcctgaga 121 cattttgaat tgacacttct caagatttga ctggatcaga gttcatcatg tcaaagttga 181 aaagctcaga gtcagtcagg gtggtggttc gctgtcggcc catgaatggc aaggaaaagg 241 ctgcttcgta tgacaaagtg gtggatgtgg atgttaagct ggggcaggtg tctgtgaaga 301 accccaaagg gacggcccat gaaatgccca agaccttcac ctttgatgcc gtctatgact 361 ggaatgccaa gcagtttgaa ctgtacgatg agacgttccg accacttgtt gactctgtcc 421 tgcaaggttt caatggaacc atttttgcct atggacaaac tgggacagga aaaacctaca 481 ccatggaagg aatccgtggt gaccctgaaa aaagaggagt cattcctaac tcatttgacc 541 atatcttcac ccacatctct cgatcccaga atcaacaata cctggtcagg gcttcttact 601 tagagatcta ccaggaggag atccgagatt tgctctcaaa ggatcagacc aaaaggcttg 661 agctcaaaga gaggcctgac acaggagtgt atgtgaaaga cctgtcttcc tttgtcacca 721 agagtgtgaa ggagatagag catgtgatga atgtggggaa ccagaaccgt tctgtcggtg 781 ctaccaacat gaacgagcac agctcgcgtt ctcatgcaat tttcgttatc actattgagt 841 gcagcgaggt gggcctcgat ggtgaaaacc acatccgtgt aggaaaattg aaccttgtag 901 atcttgctgg cagcgaacgg caagccaaga ccggcgcaca aggggagaga ttaaaagaag 961 ctaccaagat caacctctcc ctttccgctt tgggtaatgt catctctgct ctagtggacg 1021 gcaaaagcac tcacattcca tatcgggact caaagcttac caggctcctc caagattccc 1081 ttggtggcaa tgccaagact gtgatggtgg ccaacgtggg gcctgcctct tacaacgtag 1141 aagagactct gaccactctg cgatatgcca accgtgccaa aaacattaag aacaaaccaa 1201 gggtcaatga ggaccccaag gatgccctcc ttcgagaatt ccaggaagag attgctcggc 1261 tcaaggccca gctggaaaaa cggtccattg gtaggaggaa gaggcgagag aagcggaggg 1321 aaggtggtgg cagtggtggg ggtggggaag aggaggagga ggagggagaa gagggtgagg 1381 aggaagggga tgataaggat gattactggc gggaacagca agaaaaactg gagattgaga 1441 agcgggccat tgtagaggat cacagcttgg ttgcagagga gaagatgagg ctgctgaagg 1501 agaaagagaa aaagatggag gacctgcggc gggagaagga tgctgccgag atgctgggcg 1561 ccaagatcaa ggccatggag agtaagttgc ttgttggagg aaaaaatata gtagatcata 1621 cgaatgaaca gcagaaaatc ctggagcaga aacgacagga aattgcagag cagaaacgtc 1681 gagaaagaga aatccagcaa cagatggaaa gtcgagatga ggagaccttg gaacttaaag 1741 agacatacag ctcattgcag caagaggtgg acatcaagac caaaaaactc aaaaagctct 1801 tctccaagct tcaggcagtg aaggctgaga tccatgacct ccaagaagaa cacatcaagg 1861 agcgccaaga gctagagcag actcagaatg agctcaccag ggagctgaaa ctcaagcatc 1921 ttattataga aaactttatc cctctggaag aaaaaagtaa aattatgaat agagccttct 1981 ttgatgaaga ggaagatcat tggaaactac atcctataac cagactggag aaccagcaga 2041 tgatgaagcg gccagtctca gccgtgggat ataagagacc attgagccag cacgcaagaa 2101 tgtccatgat gattcgtcca gaggcccgat acagggcaga aaacattgtg ctgttagagc 2161 tggacatgcc cagccggacc accagagact atgagggtcc agccattgcc cccaaggtcc 2221 aggctgcatt ggatgcggct ctgcaggatg aagatgagat acaggtggat gcatcatcat 2281 ttgaaagcac tgcaaataag aaatccaagg ccaggcctaa aagtggaagg aagtcgggat 2341 cctcctcctc ttcctcagga acccctgcat ctcagcttta tccacagtct cgggggctgg 2401 ttccaaagta aagccagctt ctcctctccc agggcggaaa cagcatttgc cttctgagag 2461 aagagactag cgaaaagctg cagagaggat tcggcccaaa ctcagaactg ttcccctgag 2521 gagaagcggt ggcctctttg cagatcaacc aacttaatct ggttgaacgt gctgttccta 2581 atctggcact cagcccctct gggaaacatc ttttaattag catctcagaa atgcatgggt 2641 aaggtaaagt gcgatagttc aagtggaaag caagagaatg accagtgacc ttgcttcctt 2701 cccccttgcc ttcttctccc ccttcccctg tgctcccttt ctctcctctc tccttttcta 2761 gcctgttctt tacatggggc tcccttcttg ttgaacaata gggcagaatc aggagtcacc 2821 ttagcaggac cacatctttg gagcctcggg ataaaatgac agtgaggttg aaaagtgaaa 2881 accctagaac ttgaataggt gcctgttctt gtagggagaa atgagaaatc gcatttggat 2941 ccaggcccca ggtgggcacc atcagcagtc ttgcttccat gcacctcagt aagaagtgga 3001 tctgcctttg ggacctgctc agtgaggaaa tctcttccaa tttctgcttc tgaatgattc 3061 aatgttggga gcaatagaaa taacattccc tttgccttct ctgagtgttt agggaaatag 3121 cttctttaaa acctcaaaac catgaccatc ctgtcaaaga cctaagtctg taagctggtg 3181 ccatgtccat acaccatgtc actttactct tcatttgtca ccatcttttc ccatgcacgc 3241 atactctgaa catccttgtg tgggcccatc ctctgcatcc agagcatgct ctgcagtggg 3301 cctgttttgt ggaagaaagg aggctgtctc tgccttctct gatgggactg gagttgaggg 3361 aaggagctgt attgtggcac ttctgaattc cccgttttgt tccatattgg tatagagagc 3421 agaagagtag ctaggcagat gcagagatgg agacatgaga ctcagtgcag tgggcaggga 3481 agacataaca gatggaagca aaggaatcct gcctgccttc agcagagaat tcaccgaatc 3541 ctagaactgt ggctccctcc aggcagagcc taagatgctg gtgaagaata gctgtgtgat 3601 tgaataggct caaaggagag ttcagaattc ccatttacat attactagtt tggtttgtaa 3661 gttttagttc cttgtattat tgagattcag agcttcattt tatgttggtc attaggtgaa 3721 tattactcat tttccctcaa gagaagctca taagtgtgtg tgggtgtgag agcacgatgg 3781 tgcctgtgtt ctgtgaatgt gtccatatgt gtctgtaaga gagacagaga ccaagaactt 3841 gcccaatttt agaaatacac taatgtgcag ttgttgcctt ttgtctgtat tgaaggccca 3901 ttgaatgact aatccaggct ggaagcattc ccatgtgggt gtctgagtcc atgagccaag 3961 cctgagggga cagtgagtct ccaggtctgc cacactggtg caccttgctg gcacggtgcc 4021 tcaggaaggt ggcgactcag gtgggccttg agttatattt taactcagct gctcagttcc 4081 cagggcacat ttctggatca gaacccatgg gaaacaggag gtactaagtg caatgtctta 4141 gcattctgca aaatggagat ctgttgtcca gcggcttatc tcctttttag taacccttct 4201 ttctgaaccc agggcccttt tcagccttcc ctcatatttt cttgagatca aactttactt 4261 ctttcttatt tactaagaat ttgcctgttt gaataagaac aaaacgctaa ggtgggtagc 4321 ctaagctgat tttctgctgg ttacacgtgt ctctcacacc acatttcctc aaagctaatc 4381 tgaattctgt aggctaaaaa tattcatgta gcaaatctga gaattgaaaa ctgcagataa 4441 ccggccgggt atggtgactc atgcctgtaa tcctggcact ttgggaggcc gaggtgggtg 4501 gaccacctga ggttaggact tcgagaccag cctggccaac atggtggaac cccgtctgta 4561 ctaaaagtgc aaaggtttgc ctggtgtggt ggtgcatgcc tctagtcctg gctactcggg 4621 aggctgaggc acgagaatcg cttgggcctg ggaggcggag gttgcagtga gtcgagataa 4681 cactactgca ttccagcctg ggtgacagag tgagactcca cctc // LOCUS AB002362 5413 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0364 gene, complete cds. ACCESSION AB002362 NID g2224668 KEYWORDS KIAA0364. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0116. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5413) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5413 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0116" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1145..5128 /gene="KIAA0364" CDS 1145..5128 /gene="KIAA0364" /codon_start=1 /db_xref="PID:d1021660" /db_xref="PID:g2224669" /translation="MTLDRPGEGATMLKTFTVLLFCILMDPQPELWIESNYPQAPWEN ITLWCRSPSRISSKFLLLKDKTQMTWIRPSHKTFQVSFLIGALTESNAGLYRCCYWKE TGWSKPSKVLELEAPGQLPKPIFWIQAETPALPGCNVNILCHGWLQDLVFMLFKEGYA EPVDYQVPTGTMAIFSIDNLTPEDEGVYICRTHIQMLPTLWSEPSNPLKLVVAGLYPK PTLTAHPGPIMAPGESLNLRCQGPIYGMTFALMRVEDLEKSFYHKKTIKNEANFFFQS LKIQDTGHYLCFYYDASYRGSLLSDVLKIWVTDTFPKTWLLARPSAVVQMGQNVSLRC RGPVDGVGLALYKKGEDKPLQFLDATSIDDNTSFFLNNVTYSDTGIYSCHYLLTWKTS IRMPSHNTVELMVVDKPPKPSLSAWPSTVFKLGKAITLQCRVSHPVLEFSLEWEERET FQKFSVNGDFIISNVDGKGTGTYSCSYRVETHPNIWSHRSEPLKLMGPAGYLTWNYVL NEAIRLSLIMQLVALLLVVLWIRWKCRRLRIREAWLLGTAQGVTMLFIVTALLCCGLC NGVLIEETEIVMPTPKPELWAETNFPLAPWKNLTLWCRSPSGSTKEFVLLKDGTGWIA TRPASEQVRAAFPLGALTQSHTGSYHCHSWEEMAVSEPSEALELVGTDILPKPVISAS PTIRGQELQLRCKGWLAGMGFALYKEGEQEPVQQLGAVGREAFFTIQRMEDKDEGNYS CRTHTEKRPFKWSEPSEPLELVIKEMYPKPFFKTWASPVVTPGARVTFNCSTPHQHMS FILYKDGSEIASSDRSWASPGASAAHFLIISVGIGDGGNYSCRYYDFSIWSEPSDPVE LVVTEFYPKPTLLAQPGPVVFPGKSVILRCQGTFQGMRFALLQEGAHVPLQFRSVSGN SADFLLHTVGAEDSGNYSCIYYETTMSNRGSYLSMPLMIWVTDTFPKPWLFAEPSSVV PMGQNVTLWCRGPVHGVGYILHKEGEATSMQLWGSTSNDGAFPITNISGTSMGRYSCC YHPDWTSSIKIQPSNTLELLVTGLLPKPSLLAQPGPMVAPGENMTLQCQGELPDSTFV LLKEGAQEPLEQQRPSGYRADFWMPAVRGEDSGIYSCVYYLDSTPFAASNHSDSLEIW VTDKPPKPSLSAWPSTMFKLGKDITLQCRGPLPGVEFVLEHDGEEAPQQFSEDGDFVI NNVEGKGIGNYSCSYRLQAYPDIWSEPSDPLELVGAAGPVAQECTVGNIVRSSLIVVV VVALGVVLAIEWKKWPRLRTRGSETDGRDQTIALEECNQEGEPGTPANSPSSTSQRIS VELPVPI" BASE COUNT 1331 a 1412 c 1358 g 1312 t ORIGIN 1 ctccctgacc ccttgcgctt cccgagtgag gcaatgcctc gccctgcttc ggctcgcgga 61 cggtgcgcgc acccactgac ctgcgcccac tgtctggcac tccctagtga gatgaacccg 121 gtacctcaga tggaaatgca gaaatcaccc gtcttctgcg tcgctcacgc tgggagctgt 181 agatcgcagc tgttcctatt cggccatctt ggctcctcca ttctgctttt aagatacaag 241 tgatgccttc aaatttgagc tcagtttctt atggtcgttt aggagcatga gggtttctgc 301 aataatctac cagtgaaatg atgatcaaga tcttggaaag cactctactg gtcaagcaga 361 atagttcctg tcatctccat taagaatcag cttattggcc gggcgcagtg gctcatgcct 421 gtaatcccag cactttggga ggctgaggcg ggtggatcac gaggtcagga gatcgagacc 481 atcctgtcta acaaggaatc ctttctggca ctcaagagtt ttcaattcaa tcctgatgaa 541 tgaatcatta tttgatacca gcctaagtga agactctgcc tggtgacttt tgcaaggtgg 601 tttctggctg caacatggga agatttggga tcagtgagat ggaagagatt tttctggcta 661 ctagcgttcc taagaaagag ctttagtgtt tatccagcta ttggtatgca gacaactata 721 cccttatgag aaacactagc agagaagaag agttcttgac tgggtctggg ccatgaaact 781 gaccggataa aagtgggaga agaagagatg ttgaaggagg agtttgctgt tttctcagtc 841 tcctgagaaa ttccagatat cttctttcta cacacatcct ggagagggga aagaaagcct 901 ctctctgtca cccaggctgg aatgcactgg tgccatctcc actcactgca gcttcagcct 961 cctgggttca agtgatcctc ctgcctctca gccttccgag tggctgggac tacagtttgc 1021 tgcatctgga ggagctcact ggagaatctc caacatcgga gcgggccttc aactaccatc 1081 ccaccacctg ctgaggagaa aaattcttca agactcagag cacacagcca gcaccagagg 1141 ccccatgacc ctggacagac caggggaggg ggccaccatg ctgaagacat tcactgtttt 1201 gctcttttgc attctgatgg accctcaacc ggagttgtgg atagagtcca actaccccca 1261 ggccccttgg gagaacatca cgctttggtg ccgaagcccc tctcggatat caagcaagtt 1321 cctgctgctg aaggataaga cacagatgac ctggatccgc ccttcccaca agaccttcca 1381 agtttcattc cttataggtg cccttactga gtccaatgca ggtctttacc ggtgctgcta 1441 ctggaaggag acaggctggt caaagcccag taaagttcta gagttggagg caccaggcca 1501 actgcccaag cccatcttct ggattcaggc tgagaccccc gctcttcctg ggtgtaatgt 1561 taacatcctc tgccatggct ggctgcagga tttggtattc atgctgttta aagagggata 1621 tgcagagcct gtggattacc aagtcccaac tgggacaatg gccatattct ccattgacaa 1681 cctgacacct gaggatgaag gggtttacat ctgccgcact catatccaga tgctccccac 1741 cctgtggtca gagcccagca accccctgaa gctggttgta gcaggactct accccaaacc 1801 aactttgaca gcccatcctg ggcccatcat ggcacctgga gaaagcctga atctcaggtg 1861 ccaagggcca atctatggaa tgacctttgc tctaatgagg gttgaagact tggagaagtc 1921 cttttaccac aagaagacaa taaaaaatga ggcaaatttc ttcttccagt ctttgaagat 1981 ccaagatact ggacattacc tctgttttta ctatgacgca tcatatagag gttcactcct 2041 tagtgatgtc ctgaaaatct gggtaactga cactttcccc aagacctggc tacttgctcg 2101 gcccagtgct gtggtccaaa tgggtcagaa tgtgagccta cggtgtcgag gaccagtgga 2161 tggagtgggt cttgcactct ataagaaagg agaagacaaa ccacttcaat ttttggatgc 2221 caccagcatc gatgacaaca catcattctt cctcaacaat gtaacctaca gtgatactgg 2281 catctatagc tgccactatc ttctcacctg gaagacctcc attaggatgc catcacacaa 2341 cactgtggag cttatggttg tagataagcc ccccaaaccc tccctgtcag cttggccaag 2401 cactgtgttc aagctaggaa aggccatcac ccttcagtgc cgagtatctc atccagtact 2461 ggagttttct ctggaatggg aagaaagaga aacattccaa aaattctcag taaacggaga 2521 cttcatcatc agtaatgttg acgggaaagg cacagggacc tacagttgca gctatcgcgt 2581 agagacacat cctaacatct ggtcacatcg cagtgagccc ctgaagctga tggggccagc 2641 aggctatctc acctggaatt acgttctgaa tgaagctatc aggttgtctc taatcatgca 2701 gcttgttgcc ttgctgttgg tagtgctgtg gataaggtgg aagtgtcgga gactcagaat 2761 cagagaagcc tggttgctgg gaacagctca aggggtcacc atgctcttca tagtcacggc 2821 ccttctctgc tgtggactgt gcaatggggt attgatagaa gagactgaaa tagtcatgcc 2881 aacccctaag cctgagctgt gggcagagac caactttcct ctggccccgt ggaagaactt 2941 aaccctctgg tgcagaagcc cttctggctc aactaaggag tttgtgttgc tgaaggatgg 3001 gaccgggtgg atcgccactc gcccggcctc agagcaggtc cgggctgcct tcccccttgg 3061 cgccctgacc cagagccaca ccgggagcta ccactgccat tcatgggagg agatggctgt 3121 atcggagccc agtgaggcac ttgagctggt ggggacagac atcctcccca aacctgtcat 3181 ttctgcttcc cccacaatcc ggggccagga actacaactc cggtgcaaag gatggctggc 3241 aggcatgggg tttgctctgt ataaggaggg agagcaagaa cctgtccagc aacttggtgc 3301 tgttggaaga gaagccttct ttacaatcca gagaatggag gataaagacg aaggcaatta 3361 cagctgccgc actcacactg aaaaacgccc cttcaagtgg tctgagccca gtgagccgct 3421 ggagcttgtc ataaaagaaa tgtaccctaa gcccttcttc aagacatggg ccagccctgt 3481 ggtcacccct ggtgcccgag tgactttcaa ttgctccacc ccccaccagc atatgagctt 3541 tattctttac aaagatggaa gtgaaatagc atccagtgac aggtcctggg caagtccggg 3601 ggccagtgca gctcactttc taatcatttc ggtgggcatt ggtgatggag ggaattacag 3661 ctgccgatat tatgactttt ctatctggtc tgagcccagc gaccctgtgg agctcgtggt 3721 gacagaattc taccccaaac ccactctcct ggcacagcca ggtcctgtgg tgtttcctgg 3781 gaagagtgtg atcctgcgct gccaagggac tttccagggc atgaggttcg ccctcttgca 3841 ggagggagcc catgttccct tacagtttcg gagtgtctca gggaactcag ctgacttcct 3901 tctccacact gttggagcag aggactctgg gaactatagc tgtatctact atgagacaac 3961 catgtcaaac agggggtcat atctcagtat gccccttatg atctgggtga ctgacacatt 4021 ccctaagcca tggttgtttg ctgagcccag ttctgtggtt cccatggggc agaatgttac 4081 tctctggtgc cgagggccgg tccatggagt aggatacatt ctgcacaaag aaggagaagc 4141 cacttcaatg cagctctggg gatccaccag taatgacggg gcattcccca tcaccaatat 4201 atctggtact agcatggggc gttacagctg ctgctaccac cctgactgga ccagttctat 4261 caagatacaa cctagcaaca ccctggaact cctagtcaca ggcttactcc ccaaacccag 4321 cctattagcc cagcctggtc ccatggtggc ccctggcgaa aatatgactc ttcagtgtca 4381 aggggaactg ccagactcaa catttgtcct gttgaaggag ggggctcagg agcctttaga 4441 gcaacagagg ccaagtgggt acagggctga cttctggatg ccagcagtga gaggtgaaga 4501 ctctgggatc tatagctgtg tttattattt ggactctact ccctttgcag cttcaaatca 4561 cagtgactcc ctggagatct gggtgactga taagccccct aaaccctctc tgtcagcctg 4621 gcccagcacc atgttcaagt tagggaagga catcaccctt cagtgccgag gacccctgcc 4681 aggtgttgaa tttgttctag aacatgatgg agaagaagca cctcagcagt tttcagagga 4741 tggagacttt gtcatcaaca acgtagaagg aaaaggcatt ggaaactaca gctgcagcta 4801 ccgcctccag gcctaccctg atatctggtc agagcctagt gatcccctgg agctggtggg 4861 ggcagcaggg cctgttgctc aggagtgcac tgtagggaac attgtccgaa gtagcctaat 4921 cgtggtggtt gttgtagcct tgggggtagt gctagccata gagtggaaga agtggcctcg 4981 actgcgaacc agaggctcag agacagacgg aagagaccag accattgccc ttgaagagtg 5041 taaccaagaa ggagaaccag gcacccctgc caattctcct tcatcaacct ctcagagaat 5101 ctctgtggaa ctgcccgttc caatataata atctcctcct ttacaagagc tttcctctcc 5161 tctctcttgc tctcagagac ctataaatcc aaccagttac cctgcaagtc agccccatct 5221 gctgttcctt ggtctctaat cacctgagct gggtaaaggg gattctggga gttgagagct 5281 ctgccagggt gagatgtttc ctgaagagag gttccccacc cctgtaactc ctcactgtac 5341 tgatttactg gcgcatgaaa ttctattaaa aatgcattct tctgaataaa aagagtattc 5401 actatttaac ttc // LOCUS AB002367 5703 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0369 gene, complete cds. ACCESSION AB002367 NID g2224678 KEYWORDS KIAA0369. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0177. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5703) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5703 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0177" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 213..2402 /gene="KIAA0369" CDS 213..2402 /gene="KIAA0369" /codon_start=1 /db_xref="PID:d1021665" /db_xref="PID:g2224679" /translation="MSFGRDMELEHFDERDKAQRYSRGSRVNGLPSPTHSAHCSFYRT RTLQTLSSEKKAKKVRFYRNGDRYFKGIVYAISPDRFRSFEALLADLTRTLSDNVNLP QGVRTIYTIDGLKKISSLDQLVEGESYVCGSIEPFKKLEYTKNVNPNWSVNVKTTSAS RAVSSLATAKGSPSEVRENKDFIRPKLVTIIRSGVKPRKAVRILLNKKTAHSFEQVLT DITDAIKLDSGVVKRLYTLDGKQVMCLQDFFGDDDIFIACGPEKFRYQDDFLLDESEC RVVKSTSYTKIASSSRRSTTKSPGPSRRSKSPASTSSVNGTPGSQLSTPRSGKSPSPS PTSPGSLRKQRSSQHGGSSTSLASTKVCSSMDENDGPGEEVSEEGFQIPATITERYKV GRTIGDGNFAVVKECVERSTAREYALKIIKKSKCRGKEHMIQNEVSILRRVKHPNIVL LIEEMDVPTELYLVMELVKGGDLFDAITSTNKYTERDASGMLYNLASAIKYLHSLNIV HRDIKPENLLVYEHQDGSKSLKLGDFGLATIVDGPLYTVCGTPTYVAPEIIAETGYGL KVDIWAAGVITYILLCGFPPFRGSGDDQEVLFDQILMGQVDFPSPYWDNVSDSAKELI TMMLLVDVDQRFSAVQVLEHPWVNDDGLPENEHQLSVAGKIKKHFNTGPKPNSTAAGV SVIALDHGFTIKRSGSLDYYQQPGMYWIRPPLLIRRGRFSDEDATRM" BASE COUNT 1540 a 1288 c 1335 g 1540 t ORIGIN 1 gcacatccct gcactagtgg ccgcaaccga gacgccgcgc tccagcagct gctgccgccc 61 agcccggccc cgccgccgcc ccccagccct gcagccccgc agccccggcc gcgcccagcc 121 cggcgaggac agcaccagga ggcggccccc agcgcggcca caaagacccc cggcggcgtc 181 tctccgcgga ccggtcctac ttgaagtcca tcatgtcctt cggcagagac atggagctgg 241 agcacttcga cgagcgggat aaggcgcaga gatacagccg agggtcgcgg gtgaacggcc 301 tgccgagccc gacgcacagc gcccactgca gcttctaccg cacccgcacg ctgcagacgc 361 tcagctccga gaagaaggcc aagaaagttc gtttctatcg aaacggagat cgatacttca 421 aagggattgt gtatgccatc tccccagacc ggttccgatc ttttgaggcc ctgctggctg 481 atttgacccg aactctgtcg gataacgtga atttgcccca gggagtgaga acaatctaca 541 ccattgatgg gctcaagaag atttccagcc tggaccaact ggtggaagga gagagttatg 601 tatgtggctc catagagccc ttcaagaaac tggagtacac caagaatgtg aaccccaact 661 ggtcggtgaa cgtcaagacc acctcggctt ctcgggcagt gtcttcactg gccactgcca 721 aaggaagccc ttcagaggtg cgagagaata aggatttcat tcggcccaag ctggtcacca 781 tcatcagaag tggcgtgaag ccacggaaag ctgtcaggat tctgctgaac aagaaaacgg 841 ctcattcctt tgagcaggtc ctcaccgata tcaccgatgc catcaagctg gactcgggag 901 tggtgaaacg cctgtacacg ttggatggga aacaggtgat gtgccttcag gacttttttg 961 gtgatgatga catttttatt gcatgtggac cggagaagtt ccgttaccag gatgatttct 1021 tgctagatga aagtgaatgt cgagtggtaa agtccacttc ttacaccaaa atagcttcat 1081 catcccgcag gagcaccacc aagagcccag gaccgtccag gcgtagcaag tcccctgcct 1141 ccaccagctc agttaatgga acccctggta gtcagctctc tactccgcgc tcaggcaagt 1201 cgccaagccc atcacccacc agcccaggaa gcctgcggaa gcagaggagc tctcagcatg 1261 gcggctcctc tacgtcactt gcgtccacca aagtctgcag ctcgatggat gagaacgatg 1321 gccctggaga agaagtgtcg gaggaaggct tccagattcc agctacaata acagaacgat 1381 ataaagtcgg aagaacaata ggagatggaa attttgctgt tgtcaaggaa tgtgtagaaa 1441 gatcgactgc tagagagtac gctctgaaaa ttatcaagaa aagcaaatgt cgaggcaaag 1501 agcacatgat ccagaatgaa gtgtctattt taagaagagt gaagcatccc aatatcgttc 1561 ttctgattga ggagatggat gtgccaactg aactgtatct tgtcatggaa ttagtaaagg 1621 ggggagacct ttttgatgcc attacttcca ctaacaaata caccgagaga gacgccagtg 1681 ggatgctgta caacctagcc agcgccatca aatacctgca tagcctgaac atcgtccacc 1741 gtgatatcaa gccagagaac ctgctggtgt atgagcacca agatggcagc aaatcactga 1801 agctgggtga ctttggactg gccaccattg tagacggccc cctgtacaca gtctgtggca 1861 ccccaacata cgtggctcca gaaatcattg cagagactgg atacggcctc aaggtggaca 1921 tctgggcagc aggtgtaatc acttatatcc tgctgtgtgg tttccctcca ttccgtggaa 1981 gtggtgatga ccaggaggtg ctttttgatc agattttgat ggggcaggtg gactttcctt 2041 ctccatactg ggataatgtt tccgattctg caaaggagct cattaccatg atgctgttgg 2101 tcgatgtaga tcagcgattt tctgctgttc aagtacttga gcatccctgg gttaatgatg 2161 atggcctccc agaaaatgaa catcagctgt cagtagctgg aaagataaag aagcatttca 2221 acacaggccc caagccgaat agcacagcag ctggagtttc tgtcatagca ctggaccacg 2281 ggtttaccat caagagatca gggtctttgg actactacca gcaaccagga atgtattgga 2341 taagaccacc gctcttgata aggagaggca ggttttccga cgaagacgca accaggatgt 2401 gaggagccgg tacaaggcgc agccagctcc tcccgaactc aactcggaat cggaagacta 2461 ctccccaagc tcctccgaga ctgttcgctc ccctaactcg cccttttaat aagacccttt 2521 tactcaaagt cctagcttaa ccctttgaga ctctgagatt tttttccccc aaatttgtgt 2581 aaaacagttt catctgatct atctagcgct caatgcttga atggcagaac tgaaagtgtt 2641 ttcaggtatc tttgtagcgg tttcccttta ctgaataaga tgacacgtgg tgattgtgaa 2701 gatggtaatt tgctgctaat agagtcctca aagggttaag gccaatttgc aatttttttt 2761 taaacttaga agcaatgaat gttttcatca gtcaagctag gatctgcagt atgtaatata 2821 gcacttgtta accctctgag tgcatagaat tttattgaga attcttgttt gggaattttt 2881 caggcctttg gatgtataca cacatgtttc ttgattttac tgcagatcaa ggggtgttgt 2941 tagatgctga aatgtccaga aaagaaggac atttagaatg atatcttgtt tgtccttttc 3001 tgtgggttta gaacgtggca ggtttataac ttagacacac gcacggttct ttcttcttca 3061 caatcctatt cagaaacaga tttttttttt cattagagat atgactgtca gttgcagtga 3121 gttctgcatc ccaagtggag ggaattgggt ttgtggcaaa gagcttgacc caggaaatag 3181 atggtgcccc ccaaattgtc tccacatgaa gatgtactga tgacgcccca gaaatgctgc 3241 ttccatatca gctgctgcta gcgccagcgc agactctcag ggagtcacca cagcttgtct 3301 tgtgcttggt gagtgagggt ctctctactc agtgtcagac atctacagga aagaaacaac 3361 tggtggaaaa gagcaataaa ttgcccggtg ctctgcaggg ctggaatttc aaacagaaag 3421 agggaataag atcctgtgat ttttctcacc tgcttttcca cgcactgtgg tcatcactgt 3481 gcaatctaca tctagtatga aatccacaca taggagagct ggggcacaag gggactggag 3541 gcagttgctt tgcaagatgg ctgaggagaa agcacactgg gaacacaatc cagaatgttc 3601 taacaataag ttttcagtga ataaaccact ggcaagacat ttccatgtgc acctttaggt 3661 tacctatata gtctcctagg aagatcagga tgaaagacct agatgatacc cctgaggata 3721 aaacctccat cccctaaaat gatttttttt aaataccact gtctttagct gtccaggagg 3781 tcagagtgtt ttttctgtct ttgggccaag tcctgtctga gacctgtatt ttcactcttg 3841 ttaccaaatc tatctcccta gtgcagtgtc tccaggcctg agtttcttct ggaacagatt 3901 ccattttaga atggggattc acaggttctg tgcatcacca cagtgctcag agaggattct 3961 cctggggtgt cttagaggca ggtgcccaac tcaaatgtat tcccaaggtt tgctgggctc 4021 tgggatccac gagacaacca gagagggata tctcatgaaa tttgcatctg gtggctgaac 4081 agtacctatg ttctctgttt tgaatatact ttaatacctg agagtcttaa aatttgtgaa 4141 caacgtttct atagtccttt attttcaaat gcacattgat cttcacttgc tgcattttta 4201 ctcttcaacc ctgaaactat ggtctacatt aatatggatt tttaaatcac atgtcattac 4261 ttttgcaaca ccatcaccaa aattttttgc tcttttacat ttaggttcat ctctgtggtc 4321 tgtgttgtcc tgacatgtaa aaagcatatc gtttattgag gtttttttcc ccccctttta 4381 gagcatccgg aagtgataac acgcaaaatc acaaagtagc ataaatcagt aaattagttg 4441 agttgttttt gggggggagg tgggggtagg gggcacagaa caccagaaag agtgttggtg 4501 tgtaggtaga ttccatatta atgaggaaca ctgaactagt tggaaattac tgctttctct 4561 agaaatataa agcaaagcac tattccaagg ctatggagta gctctacagc ctggcctcaa 4621 ctctaaaagt gtgaagaatg caatgggcag agacctacct gcagtggact gtcattttcc 4681 tttctttctc tgaattactg ctttttctgt gggcattaac tatattgcta cagcatctag 4741 tgtactgagc ctgcggtgca tggctcaggc cttttcccat cgacgtctag ggggactctg 4801 gaccgtgtga agctaggggg tgtttctcag cacactgcag aagggcagct cagaagaatg 4861 cagggcccat tcagcatggg gatcccagca catcactgta gaatttgagt gatctatgct 4921 gaataaacag tggaatgtga ccagtcaagt agaaatcttg agtaatcaga tggaatgcaa 4981 tctttctaac attaagctac caagatcctg aatgtcagag atgtactcag agggttaaca 5041 gacaagcaca aggcatgctg actacattgg tgtatccaga ttgctttgct tttagccagt 5101 gctttctaat ttttttctcg acattcttgg gatagttcaa gtttgaaata attaagtggt 5161 ggtgttcttt aaggaatttc tataaccaaa ttgatcttat ttttgatttc acttatcata 5221 gaacaaatat gtatcattat ggcagtgtat ctatgtaatt atcaatttaa tcatcaccac 5281 cggtgtttcc atattttttc ccaagtattt aatatagctc tcttatggtg gtggcctggt 5341 gatggggacc gtctttcttt tactgacaca tgaccaatca tatggtattt tcaagggaat 5401 tttaagattc atcttttcag tttgatagta gactagttaa ggaagaactc tttcattact 5461 tgcatcgtgt aaatcatctc tgtagacatg tgttcatatt aatgaacaca ttttttctca 5521 acattgtagc agaaatcatt ttattcgtca tgatcaatga atatgtgatt tgctccagat 5581 cgttagaagg aaaagtaaga tttcagtcat caaaaatgtt tttaccgtag ccctcatcta 5641 acttacacgt ggtgcatatt aaaataagca gagaaaaaaa aatgtgaata aactactgaa 5701 aac // LOCUS AB002369 5886 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0371 gene, complete cds. ACCESSION AB002369 NID g2224682 KEYWORDS KIAA0371. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0252. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5886) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5886 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0252" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 248..3844 /gene="KIAA0371" CDS 248..3844 /gene="KIAA0371" /codon_start=1 /db_xref="PID:d1021667" /db_xref="PID:g2224683" /translation="MDEETRHSLECIQANQIFPRKQLIREDENLQVPFLELHGESTEF VGRAEDAIIALSNYRLHIKFKESLVNVPLQLIESVECRDIFQLHLTCKDCKVIRCQFS TFEQCQEWLKRLNNAIRPPAKIEDLFSFAYHAWCMEVYASEKEQHGDLCRPGEHVTSR FKNEVERMGFDMNNAWRISNINEKYKLCGSYPQELIVPAWITDKELESVSSFRSWKRI PAVIYRHQSNGAVIARCGQPEVSWWGWRNADDEHLVQSVAKACASDSRSSGSKLSTRN TSRDFPNGGDLSDVEFDSSLSNASGAESLAIQPQKLLILDARSYAAAVANRAKGGGCE CPEYYPNCEVVFMGMANIHSIRRSFQSLRLLCTQMPDPGNWLSALESTKWLHHLSVLL KSALLVVHAVDQDQRPVLVHCSDGWDRTPQIVALAKLLLDPYYRTIEGFQVLVEMEWL DFGHKFADRCGHGENSDDLNERCPVFLQWLDCVHQLQRQFPCSFEFNEAFLVKLVQHT YSCLFGTFLCNNAKERGEKHTQERTCSVWSLLRAGNKAFKNLLYSSQSEAVLYPVCHV RNLMLWSAVYLPCPSPTTPVDDSCAPYPAPGTSPDDPPLSRLPKTRSYDNLTTACDNT VPLASRRCSDPSLNEKWQEHRRSLELSSLAGPGEDPLSADSLGKPTRVPGGAELSVAA GVAEGQMENILQEATKEESGVEEPAHRAGIEIQEGKEDPLLEKESRRKTPEASAIGLH QDPELGDAALRSHLDMSWPLFSQGISEQQSGLSVLLSSLQVPPRGEDSLEVPVEQFRI EEIAEGREEAVLPIPVDAKVGYGTSQSCSLLPSQVPFETRGPNVDSSTDMLVEDKVKS VSGPQGHHRSCLVNSGKDRLPQTMEPSPSETSLVERPQVGSVVHRTSLGSTLSLTRSP CALPLAECKEGLVCNGAPETENRASEQPPGLSTLQMYPTPNGHCANGEAGRSKDSLSR QLSAMSCSSAHLHSRNLHHKWLHSHSGRPSATSSPDQPSRSHLDDDGMSVYTDTIQQR LRQIESGHQQEVETLKKQVQELKSRLESQYLTSSLHFNGDFGDEVTSIPDSESNLDQN CLSRCSTEIFSEASWEQVDKQDTEMTRWLPDHLAAHCYACDSAFWLASRKHHCRNCGN VFCSSCCNQKVPVPSQQLFEPSRVCKSCYSSLHPTSSSIDLELDKPIAATSN" BASE COUNT 1398 a 1481 c 1594 g 1413 t ORIGIN 1 gacgtcccag cggctcaggc gctgcccagc gccggccctg gcagggagcc tggtggggtg 61 gcggaggggg agactgtgcc gtggagggcc tcgccatgtc ctgctgcccg acttccttgt 121 gaaacctcct agtagaaaaa gttcttggga caaacttcat ttgacttcac cataagggaa 181 agaagagagc cttgttggac actgaagaag ggacggggat ttctcctcct aatctgggcc 241 tcttgtcatg gatgaagaga ctcggcacag ccttgagtgc atccaggcca atcagatctt 301 tcccaggaag cagctgatcc gggaggatga gaatcttcag gttcctttcc ttgaacttca 361 tggagagagc acagagtttg tgggccgtgc cgaggatgcc atcattgccc tttccaatta 421 cagacttcac atcaagttca aggagtctct tgttaatgtt ccattacagc ttatagaaag 481 tgttgaatgc cgagatatat ttcagcttca tttgacttgc aaagactgca aagttatcag 541 gtgtcagttt tcaacctttg agcagtgtca agagtggctg aagagactga acaacgcaat 601 ccgaccacct gctaaaatag aagatctctt ctcatttgca taccatgctt ggtgcatgga 661 ggtctatgcc agtgaaaaag agcaacatgg agacctgtgc agaccagggg agcatgtaac 721 ttcaaggttt aaaaacgagg tggagaggat gggttttgat atgaacaacg cctggaggat 781 ttccaacatc aatgagaagt acaaattatg tggtagctat cctcaagagc tcatagtgcc 841 tgcctggatc actgacaaag aactggaaag tgtatcaagt ttcaggtcct ggaagcgcat 901 ccctgccgtc atctacaggc accagagcaa tggagctgtc attgcccgct gtggacagcc 961 agaggttagc tggtggggct ggcgaaatgc agatgatgag catctggtac agtcagtagc 1021 caaagcttgt gcctctgact cccgatcgag tggcagcaag ctgtcaacta ggaacacttc 1081 tcgagacttt cccaatgggg gagacctttc tgacgtggag ttcgattctt ctctgtcaaa 1141 tgcttcagga gcagagagtt tagccatcca accgcagaag cttttgatct tggatgcacg 1201 ctcctatgca gctgctgtgg caaaccgagc caaaggagga ggctgcgaat gcccagagta 1261 ttacccaaac tgtgaagttg tgtttatggg gatggcaaac attcattcta ttcggaggag 1321 ttttcagtct ctgcggttgc tgtgcactca gatgccagat ccgggaaatt ggctatcagc 1381 tcttgaaagc acaaaatggc tccatcactt gtctgtgctt ctgaaatcag cgcttctggt 1441 agtgcatgct gtggatcagg atcagcggcc ggtgctagta cactgctcag atggctggga 1501 ccgcaccccc cagattgtgg cattggctaa gctcttgctg gacccttatt accgaaccat 1561 agagggtttc caggtcctcg tggaaatgga gtggctggat tttggccata aatttgctga 1621 ccggtgtggt catggggaga actcggatga tctgaatgaa cgttgcccag tgtttctgca 1681 gtggcttgac tgtgttcatc agcttcagag gcaatttcct tgctcttttg agttcaatga 1741 agcattcctt gtgaaactgg tgcagcatac ctattcctgc ctgtttggaa cattcctgtg 1801 caacaacgcc aaggagagag gggaaaagca tactcaggaa cggacatgtt ccgtgtggtc 1861 acttcttcgg gcaggcaaca aggctttcaa aaacctactg tattcctctc agtcagaagc 1921 cgtgctgtac cctgtgtgcc atgtgcgtaa cctgatgctg tggagtgcag tgtacctgcc 1981 ctgcccatcc ccaaccaccc ctgtggacga cagctgtgca ccatacccag ccccaggcac 2041 cagccctgat gatccccccc tgagccggct accaaagact agatcatacg acaatctgac 2101 cacagcctgt gacaacacag tgcctctggc cagccggcgc tgcagcgacc ccagcctgaa 2161 cgagaagtgg caggagcacc ggcgctcact agagctgagc agcctggctg gccctggaga 2221 ggatcccctt tctgccgaca gcctagggaa gcccaccaga gtgccggggg gtgccgagct 2281 ttctgttgca gccggagtag ctgaggggca gatggagaac atcttgcagg aggccaccaa 2341 agaggagagt ggagtagagg aacctgccca cagggcaggc attgagatac aggagggtaa 2401 agaggaccct ctcttagaaa aggagagcag gaggaagaca cctgaggcct cagccattgg 2461 acttcaccaa gacccagaac tgggtgatgc tgctctgagg agccatctgg atatgagctg 2521 gcctctgttc tcacagggca tttctgaaca gcagagtggg ctcagtgttc tcctcagttc 2581 tctccaggtc ccccccaggg gagaggattc cctggaggtc cctgtggagc agtttcgaat 2641 agaagagatt gcagagggta gggaggaagc agttcttcca atcccagtag atgcaaaagt 2701 tggctatggt acctcacagt catgttctct gctaccttcc caagtccctt ttgagaccag 2761 aggaccaaac gtggacagtt ctacagacat gttagtggaa gataaggtga agtcagtaag 2821 tgggccccaa ggtcatcata gatcttgcct tgtaaatagt ggcaaggaca ggcttcctca 2881 gaccatggaa cccagccctt cagagacaag cctggtcgag aggccccaag tggggtctgt 2941 ggtgcatagg acttcccttg gcagcactct cagcctgaca cgttcccctt gtgccttgcc 3001 tttagccgaa tgtaaagagg ggcttgtgtg caatggtgcc ccagagactg aaaacagggc 3061 ctcagagcag cccccaggtc ttagcaccct ccagatgtac cccacaccca atgggcattg 3121 cgccaatggg gaggctggta ggagcaagga ctcactgagc cgtcagctgt ctgctatgag 3181 ctgcagctct gcccacttac actcaaggaa cttgcaccac aagtggctgc atagccactc 3241 aggaaggcca tctgcaacca gcagccccga ccagccttcc cgcagccacc tggacgatga 3301 tggcatgtca gtgtacacag acacgatcca acagcgcctg cgtcagattg agtcaggcca 3361 ccagcaggaa gtagaaactt tgaagaaaca agtccaggag ctgaagagtc gcctggagag 3421 ccagtacctg accagctccc tacactttaa tggagacttt ggggatgagg tgacttcaat 3481 ccccgactcg gaaagcaatc tggatcagaa ctgtttgtct cgctgcagca cagagatttt 3541 ctctgaagcc agctgggagc aggtggataa acaggacaca gagatgaccc gttggcttcc 3601 tgaccacctg gccgcccact gctatgcgtg cgacagtgcc ttctggcttg ccagcaggaa 3661 gcaccactgc aggaattgtg ggaacgtatt ctgctccagt tgttgtaacc agaaggttcc 3721 agttcccagc cagcagctct ttgaacccag tcgagtatgc aagtcttgct atagcagcct 3781 acatcccaca agctccagca ttgaccttga actggataag cccattgctg ccacttccaa 3841 ctgaagctca gtgacctggg tgggcagtgg ccaagctgct gttcctatga caggcccact 3901 caacctgggc agaccgagag gcccgtgcac tttggaatgg gagcgtggaa ccacctgtac 3961 agagtgacag atttgggatg caccactgga ttgtagattg atttttcttt cctgtccccc 4021 tactccctcc ctaccttttc catcctcctc ctctgccttc aaaaaaggaa actttccctt 4081 ggttgtctta attttttttt tttttgatgg aagaccaagg gttgccaggc ccactgtaac 4141 tgccgagctg cctgctgtca cgtgacactg agggatggct tgtttcttcc gggtgggagg 4201 atggtggtca gagccaggag tatggagatc tgagaccgtg agcagggaag aaagccagtg 4261 ctaacatgca accattcctc acgccaccgc cacatcagag ctggttggga accctttgct 4321 gctggggagg ctggaagcat attccccaag agcactgccc tgggcatcat ctccctcctg 4381 cgaggagctg agccagtccc ctcacagatg gataaatgag gctgatgttt ggagggagag 4441 gcacacggta aatggcatcc ccttaggcgc ctctttggga aaggaaagtg gatgctcctt 4501 tgaggcaggc gaggggctgg gagtgggtag ccgtcatgtt gtcccccgtg ggatcccatt 4561 tttaacttga cccagattgt cttgggctcc ttttactttc agggggctgc tttcctgggc 4621 caggctgtgt agcacttccc accctcaggc atgagtacag actggtcaaa atgtttatgc 4681 agtcaaggcc aaagcctgcc ctgagcccag acacacctgg gtccccattc tggggcctgg 4741 tcccctaaac aatctcttcc tcagccagca cagggaaatc cagactcagg gttccagaaa 4801 tcccccgttc ccaaaccaaa ccaatcatgg atcttccctt taaggggtag aatcagcctt 4861 tagagttaca agcccctgga aaagggagca ggtcccatac aatcagcctt tctcctcttc 4921 tcttcttggg acaggaataa ctgctgagca gtcccccttt gccactcctg gctgtctgag 4981 tgggcacttg tctggctgct ggccagcagt gaagggccac ttggtgactt cgtagggttc 5041 cttagagaag tgacatggcc ttgaggagat tcagggcacc tttcctcagt gagctttgat 5101 atggtccaaa aacagcctta aatcaagaat atttgtggag gtaggtgtgg ggaaggttgg 5161 agaagagtta ggtttgtgct ttgtaatctt ggcatccatg ttttgtgcct gccctccctc 5221 ttggggaggc catgcttgac tctgctgaaa gctgctaact ggtgacaggg tggatcatgg 5281 tgaggactag gggtaaggtc agggaatgct taagctctgg ccctgccctg tattcctctt 5341 ccccttggta gcagttccct gaccaagggc aaggtgtgtc tcaggaaggt ggttctgtgc 5401 atgcctgggt gtggttaagg cgtggcccag aaggcttccc tggtgctctc agtccaggca 5461 aagcccagag catctgagca cgtctctgac ttccagtggc aacacagttt gaacgtggtg 5521 aaacagggtg tagttctttt ccacattctg tgtcatctag tcgtgcaggt agggaaaaag 5581 ttggtctaat tgaagattgt gcatttccta gtgacaggtg ccaagaggtt atgatacggg 5641 tttcttgggt ctgatgtaca gtgtgaataa atgcttgcag ctcttagctc ttttttgatc 5701 gatgaagcac ttttttatta atattttcct ttgttaaagg aggaaccgta actctccata 5761 gctgtacata taaccctttt ctcctaaaga ggagtcagtc agtgctccta tatttttcat 5821 tttttgtcaa agcaagaagt aaatacttta gaattgttaa atatataaat aaagcaaata 5881 aagttg // LOCUS AB002370 5704 bp mRNA PRI 24-JUL-1997 DEFINITION Human mRNA for KIAA0372 gene, complete cds. ACCESSION AB002370 NID g2280483 KEYWORDS KIAA0372. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0270. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5704) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 COMMENT Sequence updated (22-Jul-1997). FEATURES Location/Qualifiers source 1..5704 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0270" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 298..4992 /gene="KIAA0372" CDS 298..4992 /gene="KIAA0372" /codon_start=1 /db_xref="PID:d1021668" /db_xref="PID:g2224685" /translation="MSSKEVKTALKSARDAIRNKEYKEALKHCKTVLKQEKNNYNAWV FIGVAAAELEQPDQAQSAYKKAAELEPDQLLAWQGLANLYEKYNHINAKDDLPGVYQK LLDLYESVDKQKWCDVCKKLVDLYYQEKKHLEVARTWHKLIKTRQEQGAENEELHQLW RKLTQFLAESTEDQNNETQQLLFTAFENALGLSDKIPSEDHQVLYRHFIQSLSKFPHE SARLKKACEGMINIYPTVQYPLEVLCLHLIESGNLTDEGQQYCCRLVEMDSKSGPGLI GLGIKALQDKKYEDAVRNLTEGLKESPVCTSGWYHLAEAQVKMHRPKEAVLSCSQALK IVDNLGASGNSLYQRNLCLHLKAEALIKLSDYDSSEEAIRTLDQISDADNIPGLLVLK SLAYRNKGSFDEAAKIMEDLLSSYPDLAEVHALEALIHFTKKDYLQAEKCFQRALEKD TEVAEYHYQLGLTYWFMGEETRKDKTKALTHFLKAARLDTYMGKVFCYLGHYYRDVVG DKNRARGCYRKAFELDDTDAESGAAAVDLSVELEDMEMALAILTTVTQKASAGTAKWA WLRRGLYYLKAGQHSQAVADLQAALRADPKDFNCWESLGEAYLSRGGYTTALKSFTKA SELNPESIYSVFKVAAIQQILGKYKEAVAQYQMIIKKKEDYVPALKGLGECHLMMAKA ALVDYLDGKAVDYIEKALEYFTCALQHRADVSCLWKLAGDACTCLYAVAPSKVNVHVL GVLLGQKEGKQVLKKNELLHLGGRCYGRALKLMSTSNTWCDLGINYYRQAQHLAETGS NMNDLKELLEKSLHCLKKAVRLDSNNHLYWNALGVVACYSGIGNYALAQHCFIKSIQS EQINAVAWTNLGVLYLTNENIEQAHEAFKMAQSLDPSYLMCWIGQALIAEAVGSYDTM DLFRHTTELNMHTEGALGYAYWVCTTLQDKSNRETELYQYNILQMNAIPAAQVILNKY VERIQNYAPAFTMLGYLNEHLQLKKEAANAYQRAILLLQTAEDQDTYNVAIRNYGRLL CSTGEYDKAIQAFKSTPLEVLEDIIGFALALFMKGLYKESSKAYERALSIVESEQDKA HILTALAITEYKQGKTDVAKTLLFKCSILKEPTTESLQALCALGLAMQDATLSKAALN ELLKHIKHKDSNYQRCLLTSAIYALQGRSVAVQKQISKAVHSNPGDPALWSLLSRVVA QYAQRNAKGGVVAGNVAHILDSNHGKKALLYTAVNQLAMGSSSAEDEKNTALKTIQKA ALLSPGDPAIWAGLMAACHADDKLALVNNTQPKRIDLYLALLSAVSASIKDEKFFENY NQSLEKWSLSQAVTGLIDTGRISEAETLCTKNLKSNPDQPAVILLLRQVQCKPLLESQ KPLPDAVLEELQKTVMSNSTSVPAWQWLAHVYQSQGMMRAAEMCYRKSLQLASQRGSW SGKLSSLLRLALLALKVCMANISNDHWPSLVQEATTEALKLCFCPLAVLLQALLQFKR KMGARETRRLLERVVYQPGYPKSIASTARWYLLRHLYAKDDYELIDVLVNNAKTHGDT RALELNQRLSSQ" BASE COUNT 1901 a 1042 c 1206 g 1555 t ORIGIN 1 gagtcctggg aaaggcgcat gcgctctcat tgcagcctcg gcgtttgtag aagaggagca 61 tctgctccag atggaaaatg ggtaaggaac aatcaaattc aattcgggaa gaatatagag 121 gtggtaataa ttggtaaaag tctgcagggt tttttttaac caaaggaaac tggtttgaga 181 aaattgtgaa atcagcattg aaatttgtta cctactccaa tcaagatttg caaactacaa 241 aaaaatcatt ctgaagcttt cacctataaa tatataccaa gataatttca gataagaatg 301 tccagcaagg aagtgaagac tgctctaaaa agtgctagag atgcaatcag aaacaaagaa 361 tacaaagaag ctttgaaaca ctgtaagaca gtgttaaagc aagagaaaaa taactataat 421 gcctgggttt ttattggcgt tgctgcagct gaactagaac aacctgatca ggcccagagt 481 gcctataaaa aagctgctga attagagcca gaccaattac tagcttggca ggggttagca 541 aacttgtatg agaaatataa tcacataaat gctaaggatg acttgcctgg tgtttaccaa 601 aagctcctgg atctttatga gagtgttgac aagcagaagt ggtgtgatgt ctgcaagaaa 661 cttgtggatc tatattacca agaaaagaaa cacctagagg tggctcgaac atggcacaag 721 ttgataaaaa cacggcagga acaaggtgca gaaaatgaag agcttcatca actatggaga 781 aaattgactc agttcctggc tgaaagtaca gaggaccaga ataatgaaac tcagcaattg 841 ctttttactg cttttgagaa tgcactggga ttatcagata agattcctag tgaagatcac 901 caagtacttt ataggcattt cattcagagt ttatccaaat ttcctcatga gtctgctaga 961 ttgaagaagg cctgtgaagg aatgataaac atctatccta ctgtacagta tccattagaa 1021 gtgctttgtt tgcatttaat tgaatcagga aatcttactg atgaggggca gcagtattgt 1081 tgtagattag tggaaatgga ttcaaaaagt ggtccaggcc tcattggctt aggcattaaa 1141 gcattacaag acaaaaagta tgaagatgct gttaggaacc taacagaagg gttaaaggaa 1201 agccctgtct gcacaagtgg atggtatcat ctggcagaag cccaagtcaa aatgcataga 1261 cctaaagaag ctgttctttc atgcagtcaa gctctgaaga tcgtagataa tcttggtgcg 1321 tctggtaaca gtctttatca gaggaatctt tgtcttcatt tgaaagcaga ggctttgatt 1381 aaactctcag attatgactc ttcagaggaa gcaattcgta cgcttgatca gatttctgat 1441 gcagataata tcccaggact tttggttctc aaaagcttgg cctatcggaa caaaggttca 1501 tttgatgaag ctgcaaagat tatggaagac cttctctctt cttaccctga cctagctgaa 1561 gttcatgccc ttgaggcttt gattcatttc accaaaaagg actatctaca agcagaaaaa 1621 tgttttcaga gagctcttga gaaagatacc gaagttgcag aatatcatta ccaacttgga 1681 ttaacatact ggttcatggg tgaagagaca agaaaagata aaacaaaggc tcttacccac 1741 tttctgaagg ctgcaagact ggatacatat atgggcaaag ttttctgcta tttaggtcat 1801 tattatagag acgtagtggg agataaaaac agagctcgtg gatgttatag gaaagccttt 1861 gaattagatg acactgatgc tgaatctgga gctgcagcag ttgacctaag tgtggagctt 1921 gaagatatgg aaatggcttt agctatccta acaacagtaa ctcaaaaggc aagtgctgga 1981 acggcaaaat gggcctggct taggcgagga ctatactatt tgaaagctgg tcagcattct 2041 caagcagtgg ctgatttaca ggcagcatta agagcagacc caaaggactt caattgttgg 2101 gaatcgttag gagaagcata cttaagcaga ggaggctaca caacagcctt gaagtccttc 2161 acaaaagcca gtgagctgaa cccagaatcc atatacagtg tgtttaaggt tgcagcaata 2221 cagcaaatcc taggcaaata taaggaggct gtagctcaat accagatgat cattaaaaag 2281 aaagaagatt atgtgcctgc tttaaaaggt ttgggtgaat gccatcttat gatggcaaaa 2341 gcagctctag ttgattatct tgatggaaaa gccgtagact acatagaaaa agcactggaa 2401 tattttactt gtgctctaca gcatcgagct gatgtgtcct gcctctggaa gctagctggg 2461 gatgcttgta cctgtctgta tgctgtcgca ccatctaaag tgaatgttca tgttttagga 2521 gtccttctag gtcagaaaga aggaaaacaa gtattaaaga aaaatgagct cctccacctt 2581 ggaggaaggt gttatggtcg tgcattaaaa ctgatgtcta catctaatac atggtgtgac 2641 cttggaatta attattatcg ccaagcacaa catctagcag aaacaggcag caacatgaat 2701 gatcttaagg agttgctgga gaaatcttta cattgtctga aaaaagcagt gagactcgac 2761 agtaataatc acttatactg gaatgctctt ggtgtggttg catgttacag tggtattgga 2821 aattatgccc ttgctcagca ctgtttcatc aaatcaatcc agtcagaaca aattaatgct 2881 gttgcatgga ccaacttggg agtgttatac ctcacaaatg aaaacattga gcaagctcat 2941 gaggctttca aaatggctca atcccttgat ccatcttatt taatgtgctg gattggacaa 3001 gctcttattg ctgaggcagt tggaagttat gacaccatgg atctcttcag gcacactaca 3061 gaactaaata tgcatactga aggagcatta ggttatgcgt attgggtctg cacaacattg 3121 caagataaaa gcaacagaga aacagagctg taccagtaca acatcctcca gatgaatgct 3181 attccagcag cacaagttat tttgaataaa tatgtagaaa gaattcagaa ttatgcccca 3241 gctttcacaa tgttgggtta cttaaacgaa catctacaac tgaaaaagga agcagcaaat 3301 gcataccaaa gggcaatttt gttgttacag actgcagaag accaagatac ttacaatgtt 3361 gcaataagaa attacggcag attgttatgt tccactggtg aatatgataa agctatccag 3421 gcttttaagt caacacccct tgaagtgtta gaagacatca taggttttgc attggcttta 3481 ttcatgaagg ggctttataa agagagcagc aaagcctatg agagagcctt gtctattgtt 3541 gaatcggagc aagacaaagc ccatatcttg acagctctgg caataactga atataaacaa 3601 ggaaaaacgg atgtagccaa gacattgcta tttaaatgct ctatcttaaa ggaaccaacc 3661 acagaaagcc ttcaagccct gtgtgctcta gggttggcaa tgcaggatgc tacactgtca 3721 aaagcagcac ttaatgagtt actgaagcac atcaaacaca aagacagtaa ttatcagagg 3781 tgccttctta catcagcgat ttatgcactc caaggccgca gtgtggctgt gcaaaaacaa 3841 atatctaaag ctgttcacag caaccctggt gaccctgctc tttggtctct gttgtctcga 3901 gttgttgcac agtatgctca acgaaatgca aagggaggtg ttgtagcagg aaatgtggct 3961 catattctgg actcaaatca tggaaagaag gcattactgt acactgcggt aaatcagttg 4021 gctatgggaa gcagttcagc agaagatgaa aaaaatactg cactaaagac cattcagaag 4081 gcagctctcc tttctccagg tgatcctgct atctgggctg ggctaatggc agcctgtcac 4141 gctgatgata aactggcctt agtgaacaac actcagccaa agaggataga tttatacttg 4201 gcactgttat ctgctgtttc tgcttcaatt aaagacgaaa aattctttga aaattacaac 4261 cagtcccttg aaaagtggtc tctctcacaa gctgtcactg gtctaataga cacaggaaga 4321 atatctgaag ctgaaactct ctgcacaaag aatttaaaaa gtaaccctga tcagccagcc 4381 gttatcttac ttttgagaca agttcagtgt aaaccactcc tggagtcaca aaagccactc 4441 ccagatgctg tacttgaaga actacaaaaa acagtcatgt ccaactcaac ctctgttcca 4501 gcttggcagt ggctggcaca tgtgtatcaa tcccaaggaa tgatgagagc tgcagagatg 4561 tgttacagaa agagtctaca attggcatcc caacggggca gttggagtgg gaagctctca 4621 agtctgttga gactagcact acttgcatta aaagtctgta tggctaacat ttccaatgat 4681 cactggccat ctttggttca agaggctaca actgaggcct tgaagctttg cttttgtcca 4741 ctggctgttc ttttacaagc tttgttacaa ttcaaacgca aaatgggggc aagagagaca 4801 cggcgtcttt tggaaagagt ggtatatcag cctgggtatc ccaaatctat tgcatcaact 4861 gcacgttggt acctactgag acacttatat gccaaagatg actatgagct tattgacgtg 4921 ctggtaaaca atgccaaaac tcatggagat acaagagcat tggaactgaa tcagagattg 4981 tcctcacaat aacattggat tattttatag taaggaagca aagaaaaagc tgtaagaatg 5041 aagcaatgaa tcaagacttc tacccaaagc aacatttttt taaactatat ttattccttt 5101 tctaaaggaa tctagagaag ttgaagtttt taagttagga atgcaatttt ctgttaactc 5161 caaacctgat tttaatctga aaaaaaaaaa atttaatttg ggaaaaccag aatgaaagga 5221 aaaccattat tgccttggtt gcttaacatt atttgtaaac cattattttc tgcatcttgc 5281 atggtgcaca atagaatatc ttttactgta atcccttaca ataaagcctt aatagccatt 5341 ttctatgtaa tatgcaaaag tagattagca caatgcacaa ttttcttttg ttaaaaatca 5401 aattcaaaga tttaattctt gctatgaatt ctaaagttcg gcaaaccaat tcatcataaa 5461 atccaaataa tcttgtaacc tatttatcta gtgattcatc tccaattctg ttgaaaaagc 5521 ataatataaa tgttgatgag actagactct aatggatatg tttatataat tccaaacact 5581 caggtgtgtg aatgcattta aaaacattaa tggaaaataa tgctgataat atttaattga 5641 tcatgcaatt ccttcaatta tgatggaaag acttgaactt tctgaaataa aacaaaaata 5701 cagc // LOCUS AB002371 5967 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0373 gene, complete cds. ACCESSION AB002371 NID g2224686 KEYWORDS KIAA0373. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0281. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5967) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5967 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0281" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1181..5800 /gene="KIAA0373" CDS 1181..5800 /gene="KIAA0373" /codon_start=1 /db_xref="PID:d1021669" /db_xref="PID:g2224687" /translation="MAIFKIAALQKVVDNSVSLSELELANKQYNELTAKYRDILQKDN MLVQRTSNLEHLECENISLKEQVESINKELEITKEKLHTIEQAWEQETKLGNESSMDK AKKSITNSDIVSISKKITMLEMKELNERQRAEHCQKMYEHLRTSLKQMEERNFELETK FAELTKINLDAQKVEQMLRDELADSVSKAVSDADRQRILELEKNEMELKVEVSKLREI SDIARRQVEILNAQQQSRDKEVESLRMQLLDYQAQSDEKSLIAKLHQHNVSLQLSEAT ALGKLESITSKLQKMEAYNLRLEQKLDEKEQALYYARLEGRNRAKHLRQTIQSLRRQF SGALPLAQQEKFSKTMIQLQNDKLKIMQEMKNSQQEHRNMENKTLEMELKLKGLEELI STLKDTKGAQKVINWHMKIEELRLQELKLNRELVKDKEEIKYLNNIISEYERTISSLE EEIVQQNKFHEERQMAWDQREVDLERQLDIFDRQQNEILNAAQKFEEATGSIPDPSLP LPNQLEIALRKIKENIRIILETRATCKSLEEKLKEKESALRLAEQNILSRDKVINELR LRLPATAEREKLIAELGRKEMEPKSHHTLKIAHQTIANMQARLNQKEEVLKKYQRLLE KAREEQREIVKKHEEDLHILHHRLELQADSSLNKFKQTAWDLMKQSPTPVPTNKHFIR LAEMEQTVAEQDDSLSSLLVKLKKVSQDLERQREITELKVKEFENIKLQLQENHEDEV KKVKAEVEDLKYLLDQSQKESQCLKSELQAQKEANSRAPTTTMRNLVERLKSQLALKE KQQKALSRALLELRAEMTAAAEERIISATSQKEAHLNVQQIVDRHTRELKTQVEDLNE NLLKLKEALKTSKNRENSLTDNLNDLNNELQKKQKAYNKILREKEEIDQENDELKRQI KRLTSGLQGKPLTDNKQSLIEELQRKVKKLENQLEGKVEEVDLKPMKEKNAKEELIRW EEGKKWQAKIEGIRNKLKEKEGEVFTLTKQLNTLKDLFAKADKEKLTLQRKLKTTGMT VDQVLGIRALESEKELEELKKRNLDLENDILYMRAHQALPRDSVVEDLHLQNRYLQEK LHALEKQFSKDTYSKPSISGIESDDHCQREQELQKENLKLSSENIELKFQLEQANKDL PRLKNQVRDLKEMCEFLKKEKAEVQRKLGHVRGSGRSGKTIPELEKTIGLMKKVVEKV QRENEQLKKASGILTSEKMANIEQENEKLKAELEKLKAHLGHQLSMHYESKTKGTEKI IAENERLRKELKKETDAAEKLRIAKNNLEILNEKMTVQLEETGKRLQFAESRGPQLEG ADSKSWKSIVVTRMYETKLKELETDIAKKNQSITDLKQLVKEATEREQKVNKYNEDLE QQIKILKHVPEGAETEQGLKRELQVLRLANHQLDKEKAELIHQIEANKDQSGAESTIP DADQLKEKIKDLETQLKMSDLEKQHLKEEIKKLKKELENFDPSFFEEIEDLKYNYKEE VKKNILLEEKVKKLSEQLGVELTSPVAASEEFEDEEESPVNFPIY" BASE COUNT 2370 a 903 c 1148 g 1546 t ORIGIN 1 aagcttaata ctgagcatca agaaattctt taataaatat aagtgatatt tattaagacg 61 tgtaataagg aaatgttcat gtcttatttt tgtgttagat ttttttagaa tctacttttg 121 ttagagtttt ataaatacag ttagtgtttg agatagaaag agaaaagaat tagttttctt 181 cctcttctac ctgctcatga acttgatttt tttctcccaa caattgaaga gccaagaaaa 241 agggagattc ttaagagatg ggaaatagaa tctcatctac ccctgtttcc ctcagaacag 301 tgaaactgaa tcttaagggt aagatagaat agtgtgtact taacttagat ggagaagaaa 361 ggctgccaaa atgagatctg aagcgctatt acaaatattt ccatcattac tgtacttcag 421 aatgaattac aaccgtaagt ttttttactt cctcattcat aaatttgatt attccttata 481 ccacttctca gctttcatca ttctttattg tacttttcta tgtaatgttt gcctattata 541 cagcaactta agagaactgt aagtttggac atttcatttt ggtgttgata atagaatatc 601 tttgaatagt tctatagttg atgagtagaa ccatgaacca agtaacttaa agtccttgat 661 gttatttatt acagagaact ataatagaag ctctcccgct aatgtttcca tcatgtgtac 721 aaaaagtttt cttgttatta aagccagtcc gtttaactta caataagcat aaatagctaa 781 gctgtgaaag ttacctgtga taatgctaat tttcccattt attaaaaggc aagttgtttt 841 ccgatcataa gaaatttaga aaagccatcc aaagataaat tccgagtgat atattcctgc 901 tgtttgttat gttttctcaa attaattgag ttttatttta caatgacagg agttattaaa 961 gtattttatt tttattatga ttaagatttt caaagtaaca tttcttatat gaaagaaatt 1021 atgttaatgc atgtttttct tacatgggaa atcatatatt ttaaaaatga ttttaaaatt 1081 cgttttactt taagttgtat tatctttctc aaaagtggct agtgcttcac cagaaaaaaa 1141 gacaccagca taactcagtg tatctttatt tacataggaa atggccattt tcaagattgc 1201 agctctccaa aaagttgtag ataatagtgt ttctttgtct gaactagaac tggctaataa 1261 acagtacaat gaactgactg ctaagtacag ggacatcttg caaaaagata atatgcttgt 1321 tcaaagaaca agtaacttgg aacacctgga gtgtgaaaac atctccttaa aagaacaagt 1381 ggagtctata aataaagaac tggagattac caaggaaaaa cttcacacta ttgaacaagc 1441 ctgggaacag gaaactaaat taggtaatga atctagcatg gataaggcaa agaaatcaat 1501 aaccaacagt gacattgttt ccatttcaaa aaaaataact atgctggaaa tgaaggaatt 1561 aaatgaaagg cagcgggctg aacattgtca aaaaatgtat gaacacttac ggacttcgtt 1621 aaagcaaatg gaggaacgta attttgaatt ggaaaccaaa tttgctgagc ttaccaaaat 1681 caatttggat gcacagaagg tggaacagat gttaagagat gaattagctg atagtgtgag 1741 caaggcagta agtgatgctg ataggcaacg gattctagaa ttagagaaga atgaaatgga 1801 actaaaagtt gaagtgtcaa aactgagaga gatttctgat attgccagaa gacaagttga 1861 aattttgaat gcacaacaac aatctaggga caaggaagta gagtccctca gaatgcaact 1921 gctagactat caggcacagt ctgatgaaaa gtcgctcatt gccaagttgc accaacataa 1981 tgtctctctt caactgagtg aggctactgc tcttggtaag ttggagtcaa ttacatctaa 2041 actgcagaag atggaggcct acaacttgcg cttagagcag aaacttgatg aaaaagaaca 2101 ggctctctat tatgctcgtt tggagggaag aaacagagca aaacatctgc gccaaacaat 2161 tcagtctcta cgacgacagt ttagtggagc tttacccttg gcacaacagg aaaagttctc 2221 caaaacaatg attcaactac aaaatgacaa acttaagata atgcaagaaa tgaaaaattc 2281 tcaacaagaa catagaaata tggagaacaa aacattggag atggaattaa aattaaaggg 2341 cctggaagag ttaataagca ctttaaagga taccaaagga gcccaaaagg taatcaactg 2401 gcatatgaaa atagaagaac ttcgtcttca agaacttaaa ctaaatcggg aattagtcaa 2461 ggataaagaa gaaataaaat atttgaataa cataatttct gaatatgaac gtacaatcag 2521 cagtcttgaa gaagaaattg tgcaacagaa caagtttcat gaagaaagac aaatggcctg 2581 ggatcaaaga gaagttgacc tggaacgcca actagacatt tttgaccgtc agcaaaatga 2641 aatactaaat gcggcacaaa agtttgaaga agctacagga tcaatccctg accctagttt 2701 gccccttcca aatcaacttg agatcgctct aaggaaaatt aaggagaaca ttcgaataat 2761 tctagaaaca cgggcaactt gcaaatcact agaagagaaa ctaaaagaga aagaatctgc 2821 tttaaggtta gcagaacaaa atatactgtc aagagacaaa gtaatcaatg aactgaggct 2881 tcgattgcct gccactgcag aaagagaaaa gctcatagct gagctaggca gaaaagagat 2941 ggaaccaaaa tctcaccaca cattgaaaat tgctcatcaa accattgcaa acatgcaagc 3001 aaggttaaat caaaaagaag aagtattaaa gaagtatcaa cgtcttctag aaaaagccag 3061 agaggagcaa agagaaattg tgaagaaaca tgaggaagac cttcatattc ttcatcacag 3121 attagaacta caggctgata gttcactaaa taaattcaaa caaacggctt gggatttaat 3181 gaaacagtct cccactccag ttcctaccaa caagcatttt attcgtctgg ctgagatgga 3241 acagacagta gcagaacaag atgactctct ttcctcactc ttggtcaaac taaagaaagt 3301 atcacaagat ttggagagac aaagagaaat cactgaatta aaagtaaaag aatttgaaaa 3361 tatcaaatta cagcttcaag aaaaccatga agatgaagtg aaaaaagtaa aagcggaagt 3421 agaggattta aagtatcttc tggaccagtc acaaaaggag tcacagtgtt taaaatctga 3481 acttcaggct caaaaagaag caaattcaag agctccaaca actacaatga gaaatctagt 3541 agaacggcta aagagccaat tagccttgaa ggagaaacaa cagaaagcac ttagtcgggc 3601 acttttagaa ctccgggcag aaatgacagc agctgctgaa gaacgtatta tttctgcaac 3661 ttctcaaaaa gaggcccatc tcaatgttca acaaatcgtt gatcgacata ctagagagct 3721 aaagacacaa gttgaagatt taaatgaaaa tcttttaaaa ttgaaagaag cacttaaaac 3781 aagtaaaaac agagaaaact cactaactga taatttgaat gacttaaata atgaactgca 3841 aaagaaacaa aaagcctata ataaaatact tagagagaaa gaggaaattg atcaagagaa 3901 tgatgaactg aaaaggcaaa ttaaaagact aaccagtgga ttacagggca aacccctgac 3961 agataataaa caaagtctaa ttgaagaact ccaaaggaaa gttaaaaaac tagagaacca 4021 attagaggga aaggtggagg aagtagacct aaaacctatg aaagaaaaga atgctaaaga 4081 agaattaatt aggtgggaag aaggtaaaaa gtggcaagcc aaaatagaag gaattcgaaa 4141 caagttaaaa gagaaagagg gggaagtctt tactttaaca aagcagttga atactttgaa 4201 ggatcttttt gccaaagccg ataaagagaa acttactttg cagaggaaac taaaaacaac 4261 tggcatgact gttgatcagg ttttgggaat acgagctttg gagtcagaaa aagaattgga 4321 agaattaaaa aagagaaatc ttgacttaga aaatgatata ttgtatatga gggcccacca 4381 agctcttcct cgagattctg ttgtagaaga tttacattta caaaatagat acctccaaga 4441 aaaacttcat gctttagaaa aacagttttc aaaggataca tattctaagc cttcaatttc 4501 aggaatagag tcagatgatc attgtcagag agaacaggag cttcagaagg aaaacttgaa 4561 gttgtcatct gaaaatattg aactgaaatt tcagcttgaa caagcaaata aagatttgcc 4621 aagattaaag aatcaagtca gagatttgaa ggaaatgtgt gaatttctta agaaagaaaa 4681 agcagaagtt cagcggaaac ttggccatgt tagagggtct ggtagaagtg gaaagacaat 4741 cccagaactg gaaaaaacca ttggtttaat gaaaaaagta gttgaaaaag tccagagaga 4801 aaatgaacag ttgaaaaaag catcaggaat attgactagt gaaaaaatgg ctaatattga 4861 gcaggaaaat gaaaaattga aggctgaatt agaaaaactt aaagctcatc ttgggcatca 4921 gttgagcatg cactatgaat ccaagaccaa aggcacagaa aaaattattg ctgaaaatga 4981 aaggcttcgt aaagaactta aaaaagaaac tgatgctgca gagaaattac ggatagcaaa 5041 gaataattta gagatattaa atgagaagat gacagttcaa ctagaagaga ctggtaagag 5101 attgcagttt gcagaaagca gaggtccaca gcttgaaggt gctgacagta agagctggaa 5161 atccattgtg gttacaagaa tgtatgaaac caagttaaaa gaattggaaa ctgatattgc 5221 caaaaaaaat caaagcatta ctgaccttaa acagcttgta aaagaagcaa cagagagaga 5281 acaaaaagtt aacaaataca atgaagacct tgaacaacag attaagattc ttaaacatgt 5341 tcctgaaggt gctgagacag agcaaggcct taaacgggag cttcaagttc ttagattagc 5401 taatcatcag ctggataaag agaaagcaga attaatccat cagatagaag ctaacaagga 5461 ccaaagtgga gctgaaagca ccatacctga tgctgatcaa ctaaaggaaa aaataaaaga 5521 tctagagaca cagctcaaaa tgtcagatct agaaaagcag catttgaagg aggaaataaa 5581 gaagctgaaa aaagaactgg aaaattttga tccttcattt tttgaagaaa ttgaagatct 5641 taagtataat tacaaggaag aagtgaagaa gaatattctc ttagaagaga aggtaaaaaa 5701 actttcagaa caattgggag ttgaattaac tagccctgtt gctgcttctg aagagtttga 5761 agatgaagaa gaaagtcctg ttaatttccc catttactaa aggtcaccta taaactttgt 5821 ttcatttaac tatttattaa ctttataagt taaatatact tggaaataag cagttctccg 5881 aactgtagta tttccttctc actaccttgt acctttatac ttagattgga attcttaata 5941 aataaaatta tatgaaattt tcaactt // LOCUS AB002372 5530 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0374 gene, complete cds. ACCESSION AB002372 NID g2224688 KEYWORDS KIAA0374. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0327. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5530) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5530 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0327" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 643..2259 /gene="KIAA0374" CDS 643..2259 /gene="KIAA0374" /codon_start=1 /db_xref="PID:d1021670" /db_xref="PID:g2224689" /translation="MPGSGPSERMTWPGPALSAGPPTRPLSSAPGIPPIPPLTRTHSL MAMSLPGSRRTSAGSRRRTSPPVSVRDAYGTSSLSSSSNSGSYKGSDSSPTPRRSMKY TLCSDNHGIKPPTPEQYLTPLQQKEVCIRHLKARLKDTQDRLQDRDTEIDDLKTQLSR MQEDWIEEECHRVEAQLALKEARKEIKQLKQVIDTVKNNLIDKDKGLQKYFVDINIQN KKLETLLHSMEVAQNGMAKEDGTGESAGGSPARSLTRSSTYTKLSDPAVCGDRQPGDP SSGSAEDGADSGFAAADDTLSRTDALEASSLLSSGVDCGTEETSLHSSFGLGPRFPAS NTYEKLLCGMEAGVQASCMQERAIQTDFVQYQPDLDTILEKVTQAQVCGTDPESGDRC PELDAHPSGPRDPNSAVVVTVGDELEAPEPITRGPTPQRPGANPNPGQSVSVVCPMEE EEEAAVAEKEPKSYWSRHYIVDLLAVVVPAVPTVAWLCRSQRRQGQPIYNISSLLRGC CTVALHSIRRISCRSLSQPSPSPAGGGSQL" BASE COUNT 1069 a 1693 c 1699 g 1069 t ORIGIN 1 ggcggcccct gcagggcagc tgaagccatg gaagcctccg caggtcgctg atcagggcca 61 ggcggctgca gcagcgactg cagaggcgct gcgccaagcc gggccggagt ggtgcgagcc 121 ggcggggctg cggagggcca gtggactcag ggttgttgag aggagtcaat ggcataatac 181 gggaagcccc gaaccacggg ccaactggga agttgatggt cggcgagacc agcccatcct 241 aatttggggt tcctggtcct gctccaggag tcctacagcc tgcagcccct accagagagg 301 gaggactgaa cagcaagggg gtgtgtgggt ctgagtatca gggtcctggg gaagaagcag 361 gcctggctcg tagagtagaa gactcggtgc cggcagtcag gaggccctgc tcctgaagcc 421 tctgtgacct tgggcacggc cttcactctg tctggggcac attcactcat ctggagaaca 481 cagggttgga ctgattactt ttagccactc ctgacagttg tggttttaag ttgggtggtg 541 ggaacctgtg caccagccct attcaattca ctggtggagg cagccgtggt ctgccaggcc 601 ctcgctcctg gggtgtcctg ccgaagggtg agaagagaag ccatgccggg cagcggcccc 661 agcgagagga tgacgtggcc tggcccggcc ctttctgcgg gccccccaac ccgccctctc 721 tcctcagccc ccgggatacc gcccatccca ccccttactc ggacccacag cctcatggcc 781 atgtccctgc caggaagtag acggacctct gctggatcac gcaggcgcac ctctccacct 841 gtgagcgtgc gggatgccta cggcacctct tcgctcagca gcagcagcaa ttctggctcc 901 tacaagggca gtgacagcag tcccacgcca aggcgctcca tgaaatacac gctgtgcagt 961 gacaaccatg gcatcaagcc cccgaccccg gagcagtacc tgacccccct gcagcagaag 1021 gaggtgtgca tccggcacct gaaagcccgg ctgaaggaca cacaggaccg gctccaggac 1081 cgggacacag agattgatga cctgaagacg cagctgtcac gcatgcagga ggactggatt 1141 gaggaggagt gccaccgcgt ggaggcccag ctggccctga aggaggcccg aaaggagatc 1201 aagcagctca agcaggtcat cgacactgtc aagaacaacc tgattgacaa ggacaagggg 1261 ctgcagaagt acttcgtgga catcaacatc cagaacaaga agctggagac gctgctgcac 1321 agcatggagg tggcccagaa tggcatggcc aaggaggatg gcactgggga gtcagccggt 1381 gggtcccctg cccgctccct cacccgcagc tccacctaca ccaagctgag tgacccggct 1441 gtctgtggtg accgccagcc gggtgatccc tccagcggct ctgctgagga tggggcagac 1501 agtggctttg cagcagccga tgacacactg agccggacgg acgcgctgga agccagcagc 1561 ctgctgtcgt cgggggtgga ctgtggcacc gaggagacct cgctgcacag ctccttcggc 1621 ctgggccccc gcttccctgc cagcaacacc tatgagaagc tgctgtgtgg catggaggct 1681 ggtgtgcagg ccagctgcat gcaggagcgt gccatccaga cagacttcgt gcagtaccag 1741 cctgaccttg acaccatcct ggagaaagtg acccaggccc aggtctgtgg gacagaccct 1801 gagtcagggg acaggtgccc agagctggat gcccaccctt cagggcccag agaccccaac 1861 tcagcagtgg tggtgacagt gggtgatgag ctagaggccc cagagcccat cacccgtgga 1921 cccaccccac agcggcctgg tgccaacccc aaccctggcc agtcggtgag cgtggtgtgc 1981 cccatggaag aggaggagga ggctgccgtg gctgagaagg agcccaagag ctactggagc 2041 cgccactaca tcgtggatct gctggctgtg gtggtgccgg ccgtgcccac ggtggcctgg 2101 ctttgccgct cccagcggcg ccagggccag cccatctaca acatcagctc cctgctgcgg 2161 ggctgctgca ctgtggcctt gcactccatc cgcaggatca gctgccgctc gctgagccag 2221 ccgagtccca gcccagcggg cggcggctcc cagctctgag ggggcccatt ccggcagcgg 2281 cgcctgcggc ctgaccactg attgtaggga tgccgttccc ccctcccttc tcccatgggc 2341 atcatcttat ttatttagtt ttgggtgtgg aactgtttct ttttttcaag atgttaaaac 2401 agtcccgtgg aaggagcagg ggttggagaa aggcatccca aagcttcgat ggagagcagg 2461 gaagggggac ccaaggcagg aggtacacca gctggacaaa ttgcagggag gggagggagc 2521 gagggccaac ccggcccctc tgtccccttg gctcttcaga cagggccagc cctgctcagg 2581 aagtctctgg ctgtcttcat gtggggaagc cgggcttgag ttgcccatag gcccctgccc 2641 tgcaccatcc tgtccagtgc cctgcgcact ccatgccgtc tcttccaagc caccttgccc 2701 gcagcccagg ctcctgggcc agtgctctct cctcaaatgg aggcagccat ggcctgaagt 2761 gcagatcact gacccagggc tcagagcaga ggccagaacc actgggccgg ccggcattcc 2821 agcctcccca gactgctgcc caccttggga ctcaggagct cagtcaaggc cacaggctgg 2881 aggagagacg gggctgggcg caaggtggcg gagggcagtg tgggttctgt gtctgtctgt 2941 tcatcccagg ctttcccgtc atccctttcc tcttggcact tctgggtgtg tcagtcatta 3001 ttcctgtgag gtagctaagc ccggcaagct cagtgctggg gtaggagggc ctgcctgagt 3061 cccagctccc agctggagaa tccaccagca caagaagagg aggcaggggc agaaacccaa 3121 gggggctccc ccagccttcc aaggtgaggc catctcatct gcaggctggg aggcagggct 3181 ggactcagga acccacagct tactgaaaaa gccagaggcc atgactgccc ccagaaactt 3241 gccccgagtt tctctggggc cctcgggccc aacttctgct ttgcactatg ttcactttgg 3301 ggttggttct cagccatcca agggtctcca gtgaggtggc tgcttgctgt ctgagatgag 3361 ggttcctaaa ccttaaacct ctctgcctct ggaggagggt ggggtattct ggcaggatga 3421 atcgcaggat ggcgcgtact gaagccacga tgttcatcca ggccaaagca gggtgtcctg 3481 ggataggttt cctaggcagg ggcgtggcag agcaggtagg cgccctgcac cctccctgct 3541 cccagcccaa ggccagtcgg cctgggaagt tcactgcccc aactcttttc ccgggaccat 3601 actgagtccc ccagccacgc tgctgacatt accattatta ttttaccaag taaactcact 3661 ccttttcccc cacaggggtt acacccatct ggttgtccca cccactcttc agaggctagg 3721 ccccacctct gggggttgga gggaccctgt gtcttactcg cccctctggc ctaagggcca 3781 ctctggttat ctgccaaggt tgcttgccct caccccaatg ctccaacagc cattgcctaa 3841 ctcatgggct ctgcccttct gctcggtgcc ctccacgtga ggcggggcac ctgcatgcac 3901 tgggaggggg cggctggccc agccctcggg gcaggagccc cctctgccac acgctttgtg 3961 cctccaaagc tccccccgcc ttggtcaggg cctcagacca gccaaccttt gtggaataag 4021 ccccagccca gccaaaccaa acccagatgc ctgaggcctg gctggggctg cccccgcagg 4081 acactgtggc catgccacgg agggggcagt ggacaaaacc aatccaaagc caagccggga 4141 ctggctgcgg acccagcctc ctgtgccgcg cactcacgga gctgcgtagt ctcctcagac 4201 atagtcaaag ctttgccgag aaaagaaatg tatgaactat atttgacaac ataaaatctc 4261 tctatttttc accactggaa tttagtcaag cttcaggccc ctctgctcct gtctgtgtct 4321 tgcgtctgtg gccttcctat tgtgtcttgt gttttggtgg atgtgacagg gctggggcca 4381 cagtctactc tgtctctgct gctagagaag ccacctgtgg aggactgggc tgtggttggg 4441 cctgaggcct gtggaggagg tgagtgtagc cagcagcggc cgtctactcc tgttctggcc 4501 tgagaccacc ggtgtgggtc acgaggaccc tggcccacag tgttgagggt cctctctttc 4561 gggggctctc ctggggcctt cgatgggctt ttcttcttgt cagtggaggg agcagctccc 4621 cactcagccc tgggacaggc cctgggactg tggtggccgg tggccctggc ccagctctgg 4681 agtgcatgtg tgtgtgctca gatcccgcat ctatgcaaag gtgcaggctg cctgtgaggc 4741 tccaggcggt ggaggtgcgg cagctgctgc tcaggtgcca tgccctgaag aggcagggta 4801 cacatggggc tgggagggca aagatggggc ggggcctccc cctgagagag ctcaccctcc 4861 acagtgaccc ttttccttcc tgcctaatac ctcctcccgt tgggctgtga cttttcctcc 4921 tgccctcagc ctctgaaaca gaaatctttg gggcctcccc tttccccagc gtctggcaag 4981 agctgatgag gtcctgagga acagtgtccc caggaaccac cttctggagc ctagcggcca 5041 gacgtggctt ctcctgagtt ctagggctcg ggccagagtg gcactactgc ccggccagtc 5101 caggctcagg caggaatcgg acgctggggc ccgggctctg cacagagagc tgggcttgag 5161 tggagtttat tgtagatttc tcccaagggg atcacttagc ttctcactga cacacccttc 5221 ccaattgcca caagcgcagg ggtatcttac agcactttgg ggaggggtgg gacaaactgc 5281 aaggttctgg ggccaaggcc tacccagggg cctctgccca gagaactcac aaccccctct 5341 ctgacctagg ggacaatgca aacaggtcac agtgcattcc catattaggc catccccttc 5401 acgaagaatt caggggatgt gggaagtggg gaggcgggga gggatcttga ctcttgtctc 5461 ctttgtcctt ttgttcagac agagttgtac ctgcagcaga caactctgaa ttaaagcatg 5521 aaaacacagc // LOCUS AB002373 5324 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0375 gene, complete cds. ACCESSION AB002373 NID g2224690 KEYWORDS KIAA0375. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0360. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5324) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5324 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0360" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 156..4370 /gene="KIAA0375" CDS 156..4370 /gene="KIAA0375" /codon_start=1 /db_xref="PID:d1021671" /db_xref="PID:g2224691" /translation="MDSPPKLTGETLIVHHIPLVHCQVPDRQCCGGAGGGGGSTRPNP FCPPELGITQPDQDLGQADSLLFSSLHSAPGGTARSIDSTKSRSRDGRGPGAPKRHNP FLLQEGVGEPGLGDLYDDSIGDSATQQSFHLHGTGQPNFHLSSFQLPPSGPRVGRPWG TTRSRAGVVEGQEQEPVMTLDTQQCGTSHCCRPELEAETMELDECGGPGGSGSGGGAS DTSGFSFDQEWKLSSDESPRNPGCSGSGDQHCRCSSTSSQSEAADQSMGYVSDSSCNS SDGVLVTFSTLYNKMHGTPRANLNSAPQSCSDSSFCSHSDPGAFYLDLQPSPFESKMS YESHHPESGGREGGYGCPHASSPELDANCNSYRPHCEPCPAVADLTACFQSQARLVVA TQNYYKLVTCDLSSQSSPSPAGSSITSCSEEHTKISPPPGPGPDPGPSQPSEYYLFQK PEVQPEEQEAVSSSTQAAAAVGPTVLEGQVYTNTSPPNLSTGRQRSRSYDRSLQRSPP VRLGSLERMLSCPVRLSEGPAAMAGPGSPPRRVTSFAELAKGRKKTGGSGSPPLRVSV GDSSQEFSPIQEAQQDRGAPLDEGTCCSHSLPPMPLGPGMDLLGPDPSPPWSTQVCQG PHSSEMPPAGLRATGQGPLAQLMDPGPALPGSPANSHTQRDARARADGGGTESRPVLR YSKEQRPTTLPIQPFVFQHHFPKQLAKARALHSLSQLYSLSGCSRTQQPAPLAAPAAQ VSVPAPSGEPQASTPRATGRGARKAGSEPETSRPSPLGSYSPIRSVGPFGPSTDSSAS TSCSPPPEQPTATESLPPWSHSCPSAVRPATSQQPQKEDQKILTLTEYRLHGTGSLPP LGSWRSGLSRAESLARGGGEGSMATRPSNANHLSPQALKWREYRRKNPLGPPGLSGSL DRRSQEARLARRNPIFEFPGSLSAASHLNCRLNGQAVKPLPLTCPDFQDPFSLTEKPP AEFCLSPDGSSEAISIDLLQKKGLVKAVNIAVDLIVAHFGTSRDPGVKAKLGNSSVSP NVGHLVLKYLCPAVRAVLEDGLKAFVLDVIIGQRKNMPWSVVEASTQLGPSTKVLHGL YNKVSQFPELTSHTMRFNAFILGLLNIRSLEFWFNHLYNHEDIIQTHYQPWGFLSAAH TVCPGLFEELLLLLQPLALLPFSLDLLFQHRLLQSGQQQRQHKELLRVSQDLLLSAHS TLQLARARGQEGPGDVDRAAQGERVKGVGASEGGEEEEEEEETEEVAEAAGGSGRARW ARGGQAGWWYQLMQSSQVYIDGSIEGSRFPRGSSNSSSEKKKGAGGGGPPQAPPPREG VVEGAEACPASEEALGRERGWPFWMGSPPDSVLAELRRSREREGPAASPAENEEGASE PSPGGIKWGHLFGSRKAQREARPTNR" BASE COUNT 1109 a 1702 c 1543 g 970 t ORIGIN 1 ctagttctag atcgcgagcc gccgccgccg gcgcggaagg acgaggctga ggcgagaaac 61 gaggtgccaa gctctcctga tgaaatgtgt tctgccccac tgggctccag ggacagtgtc 121 tggtgctgtt gaagccctta ttcgaacttt ccagaatgga tagtccccca aagctgactg 181 gagagaccct catcgttcat cacatccccc tggtgcactg ccaagtccca gacaggcagt 241 gctgtggagg ggcaggtgga ggtggtggga gcacaagacc taatcccttc tgcccacctg 301 agctgggcat cacccagccc gatcaagacc taggacaagc tgactccctg ctattcagca 361 gcctgcactc tgctccagga ggaactgcac ggtctataga cagcaccaag agtaggagtc 421 gggatggaag aggccctgga gcccccaaac gacacaaccc cttcttgctg caggagggtg 481 tgggtgagcc aggacttggt gacctgtatg atgacagcat tggtgacagt gccacccagc 541 agtccttcca cctgcatggc actggccagc ccaactttca tctatcctct ttccagctgc 601 caccatctgg ccccagagtg ggcaggccat gggggacaac acgcagtcgg gctggagtgg 661 tggaagggca ggaacaggag ccagtgatga ccttggatac tcagcagtgc ggcaccagcc 721 actgctgccg gccagagctg gaagcagaga ctatggagct ggatgagtgt gggggacctg 781 gtgggagtgg cagtgggggt ggagccagcg atacctctgg cttttccttt gaccaggaat 841 ggaagctcag ttcagatgaa tccccaagga accctggatg ctccggctca ggggaccagc 901 actgccgctg cagtagcaca tccagtcagt ccgaggcagc tgaccagtcc atgggctatg 961 tgagcgactc ctcctgcaac agttcagatg gtgtgctggt caccttcagc accctctaca 1021 acaagatgca tggcaccccc cgtgccaatc tcaactctgc cccacagtcc tgcagcgact 1081 cttccttctg cagccactca gaccctggcg ccttctatct ggatctgcag ccctccccat 1141 ttgagtctaa gatgtcttat gagtcccatc accctgaaag tggaggaagg gaagggggct 1201 atggttgccc tcatgcctct tctcctgagc ttgatgccaa ctgcaactcc taccgcccac 1261 actgtgagcc gtgcccagca gtggctgacc tcacagcctg cttccaaagc caggcccgtc 1321 ttgttgtggc cacacaaaat tactataaac ttgtcacctg tgacctatct tcccaatcat 1381 ccccaagccc tgctggctct tccatcacta gctgctctga ggaacacacc aagataagtc 1441 ccccaccagg ccctggccca gacccaggcc ccagccagcc ctctgagtat tacctattcc 1501 agaagccaga agtccagcca gaggaacaag aagcagtgag ttcctccacc caagcagcag 1561 ctgctgtggg ccccactgtg cttgagggac aagtatacac gaatacttca ccccccaacc 1621 tcagcactgg acgtcagcgc tcccgcagct atgatcgcag cctgcagcgc agccctcctg 1681 tccgcctggg ctcgctggaa cgtatgttga gttgcccagt gcgcttgagt gagggccctg 1741 cagccatggc cgggcctggc tccccaccca ggagggtcac ctcctttgcc gagctggcca 1801 agggccggaa gaaaactgga ggctctggct cgcccccact tcgtgtgagt gttggggact 1861 cctcccagga gttctcaccc atccaagaag cccagcaaga tcggggggcc ccactggatg 1921 agggcacttg ctgtagccat agcctgccac ccatgccttt ggggccaggc atggacctac 1981 ttggcccaga cccaagtcca ccctggtcca cccaggtctg tcagggaccc cactccagtg 2041 agatgcctcc tgctggcctc agagctactg ggcaaggccc cctggctcag ctgatggatc 2101 cagggcctgc tctcccaggg agcccagcca acagccatac ccagagggat gcaagagcta 2161 gagctgacgg gggtggcacc gagagccgac cagtccttcg ctacagcaag gaacagaggc 2221 caaccacact gcccatccag cccttcgtgt tccagcacca cttccccaag cagctggcca 2281 aggcccgggc cctccacagc ctttcccagc tctacagcct ctcaggctgc agccgtacac 2341 agcagcctgc cccactggct gcccctgctg ctcaagtctc agtcccagct ccctcagggg 2401 aaccgcaggc atccactccc cgagccactg gcagaggtgc caggaaagct gggtctgagc 2461 cagagacctc tcggccatcg cccctgggca gctactcccc catccggagt gttggcccct 2521 ttgggcccag cactgactct tctgcctcca cttcgtgctc ccctccccca gagcagccca 2581 cagccacaga aagcctgccc ccatggagcc actcctgtcc ttctgctgtc cggcctgcca 2641 cctcccagca gccgcagaag gaggatcaga agatactgac cttgactgag taccggctcc 2701 atggaacagg aagcttgccg cctctgggct cctggcgatc tggcctcagc cgagcagaga 2761 gcctggcccg gggaggtggt gagggcagca tggccaccag gcccagtaat gccaaccacc 2821 tatcccctca agccctcaag tggcgggaat acaggaggaa gaacccacta gggccacctg 2881 gtttgtcagg gagcctagac cgaagatcac aagaagctcg gctggcccga agaaacccta 2941 tctttgagtt ccctggctcc ctcagtgctg ccagccatct gaactgccgg ctgaatggcc 3001 aagcagtgaa gccgttacca ctgacctgcc ctgacttcca ggaccccttt tccttgacgg 3061 agaagcctcc agctgagttt tgtctgtccc cagatggcag ctcagaggcc atttccattg 3121 acctgcttca gaaaaaaggg ctggtaaaag ctgttaacat cgctgtggac ctcattgtgg 3181 ctcattttgg cacaagccgg gatcccgggg tgaaggcaaa gctgggaaac agttctgtga 3241 gccccaatgt gggccacctg gttctgaagt acttgtgccc tgccgtccgc gccgtgctgg 3301 aggatgggct caaggccttt gtactggacg tcatcatcgg gcagcgtaag aacatgccat 3361 ggagtgtggt tgaggcttcc acacagctag gcccatccac caaggtcctg catggcctct 3421 acaacaaagt cagccaattc ccagagctca ccagtcatac catgcgcttc aacgccttca 3481 tcctcggcct gctcaacatc cggtccctgg agttctggtt taatcacctc tataaccacg 3541 aagacatcat ccagacccac taccagccct ggggcttcct gagtgcagct cataccgtgt 3601 gtcccggcct ctttgaagag ctgctgctgc tgctacagcc cctggccctg ctgcccttca 3661 gcctcgactt gctgttccag caccggctgc tgcaaagtgg gcagcagcag cggcagcaca 3721 aggaactgct gcgggtgtcc caggacctgc tgctgtctgc ccactccacg ctgcagctgg 3781 cccgggcccg gggccaggag ggccctggag acgtggacag ggcagcccaa ggggagcggg 3841 tgaagggtgt gggtgcctca gaaggtggag aagaggaaga ggaagaagag gagacagaag 3901 aggtggcaga ggcagccggg ggctcagggc gtgccaggtg ggcccgaggt gggcaggccg 3961 gctggtggta ccagctcatg cagagctccc aggtctacat cgatggctcc attgagggtt 4021 ccaggttccc tcgtggtagc agcaacagca gcagcgagaa aaagaaaggg gcaggaggtg 4081 ggggacctcc ccaggctcca ccaccccgag agggagtagt ggagggggct gaggcctgcc 4141 ctgcctctga ggaggccctg ggccgggaaa ggggctggcc cttctggatg gggagccccc 4201 ctgactctgt gctggccgag ctgaggcgca gtcgggagag ggaagggccc gctgcctcgc 4261 cagcagaaaa tgaggaaggg gcctcagagc cttcacctgg aggcatcaag tggggacacc 4321 tctttggctc ccgaaaagcc cagcgggagg cccggcccac aaataggtga gagcctgccc 4381 atggtaggga tggagggagt agggagcctg ctgtaagcca gggcacgggc agagcccatc 4441 ctgggcagag cctgagtcca gctgctgtcc ctaggctccc ctcggactgg ctgagcctgg 4501 acaagtccat gttccaacta gtggcgcaga cagtgggttc ccgccgggag ccagagccca 4561 aggagagcct gcaggagcca cactccccag ccctgccctc cagtcctccg tgtgaggtgc 4621 aggcactgtg ccaccacctg gccaccggcc ctggacagct gagcttccac aaaggagaca 4681 tcctacgagt gctggggcga gctggaggag actggctgcg ctgcagccgt ggccccgact 4741 ctggcctggt gcccctggcc tacgtgacat tgaccccaac tccaagtcca acccctggaa 4801 gcagccaaaa ctgaggccct gtgcatgctg gtggcctcag ggaccctcat aacccccaga 4861 ctcagagccc gagagccctt cccaagccat tggcttggct gcagagtaga ctgagagctg 4921 gggccacgta tccctgtgct ggcacctgct ccctgtgctc agtattaatt acgccccctt 4981 aactgtccca gtgaccttgt ccagacctcc acccaggaga gggatgggac acagcactgg 5041 gctgccagga ttcccctggc ccgtctgggc caacccttcc atgggtgaag acaagcaagt 5101 ccccctggag gcgggtggcc cagaaagcca tctacagggt tccctaggcc aggtggagat 5161 gaggatgggt aacagtattg gggccagatc cctaagcccc ccagctgtaa ataggctgtg 5221 gccagtgcct ggtcatcaga agagggagga ggagcccagg cgtctgttta tgtatttatt 5281 tatttattta ttatacctat taataaaaaa ggtgctcagc ctcc // LOCUS AB002378 5790 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0380 gene, complete cds. ACCESSION AB002378 NID g2224700 KEYWORDS KIAA0380. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0518. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5790) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5790 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0518" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 746..5314 /gene="KIAA0380" CDS 746..5314 /gene="KIAA0380" /codon_start=1 /db_xref="PID:d1021676" /db_xref="PID:g2224701" /translation="MSVRLPQSIDRLSSLSSLGDSAPERKSPSHHRQPSDASETTGLV QRCVIIQKDQHGFGFTVSGDRIVLVQSVRPGGAAMKAGVKEGDRIIKVNGTMVTNSSH LEVVKLIKSGAYVALTLLGSSPSSMGISGLQQDPSPAGAPRITSVIPSPPPPPPLPPP QRITGPKPLQDPEVQKHATQILRNMLRQEEKELQDILPLYGDTSQRPSEGRLSLDSQE GDSGLDSGTERFPSLSESLMNRNSVLSDPGLDSPRTSPVIMARVAQHHRRQGSDAAVP STGDQGVDQSPKPLIIGPEEDYDPGYFNNESDIIFQDLEKLKSRPAHLGVFLRYIFSQ ADPSPLLFYLCAEVYQQASPKDSRSLGKDIWNIFLEKNAPLRVKIPEMLQAEIDSRLR NSEDARGVLCEAQEAAMPEIQEQIHDYRTKRTLGLGSLYGENDLLDLDGDPLRERQVA EKQLAALGDILSKYEEDRSAPMDFALNTYMSHAGIRLREARPSNTAEKAQSAPDKDKW LPFFPKTKKSSNSKKEKDALEDKKRNPILKYIGKPKSSSQSTFHIPLSPVEVKPGNVR NIIQHFENNQQYDAPEPGTQRLSTGSFPEDLLESDSSRSEIRLGRSESLKGREEMKRS RKAENVPRSRSDVDMDAAAEATRLHQSASSSTSSLSTRSLENPTPPFTPKMGRRSIES PSLGFCTDTLLPHLLEDDLGQLSDLEPEPDAQNWQHTVGKDVVAGLTQREIDRQEVIN ELFVTEASHLRTLRVLDLIFYQRMKKENLMPREELARLFPNLPELIEIHNSWCEAMKK LREEGPIIKEISDLMLARFDGPAREELQQVAAQFCSYQSIALELIKTKQRKESRFQLF MQEAESHPQCRRLQLRDLIISEMQRLTKYPLLLESIIKHTEGGTSEHEKLCRARDQCR EILKYVNEAVKQTENRHRLEGYQKRLDATALERASNPLAAEFKSLDLTTRKMIHEGPL TWRISKDKTLDLHVLLLEDLLVLLQKQDEKLLLKCHSKTAVGSSDSKQTFSPVLKLNA VLIRSVATDKRAFFIICTSKLGPPQIYELVALTSSDKNTWMELLEEAVRNATRHPGAA PMPVHPPPPGPREPAQQGPTPSRVELDDSDVFHGEPEPEELPGGTGSQQRVQGKHQVL LEDPEQEGSAEEEELGVLPCPSTSLDGENRGIRTRNPIHLAFPGPLFMEGLADSALED VENLRHLILWSLLPGHTMETQAAQEPEDDLTPTPSVISVTSHPWDPGSPGQAPPGGEG DNTQLAGLEGERPEQEDMGLCSLEHLPPRTRNSGIWESPELDRNLAEDASSTEAAGGY KVVRKAEVAGSKVVPALPESGQSEPGPPEVEGGTKATGNCFYVSMPSGPPDSSTDHSE APMSPPQPDSLPAGQTEPQPQLQGGNDDPRRPSRSPPSLALRDVGMIFHTIEQLTLKL NRLKDMELAHRELLKSLGGESSGGTTPVGSFHTEAARWTDGSLSPPAKEPLASDSRNS HELGPCPEDGSDAPLEDSTADAAASPGP" BASE COUNT 1419 a 1636 c 1515 g 1220 t ORIGIN 1 aattggctca tttaagaatt tcaaaacatt taatgtaaaa gctttttttt ttttaaggaa 61 gtccataaat tttggttccc agggttgcac tggacttgga aggagtgctg ttgtgtacat 121 actattgtat ggttttattt attattttac tgtacaaatc agccgaaaga atttttccaa 181 gtgccatttc ggatttatta atcctttttt ttttcctttc ctcaaagata tttgctgttg 241 tcatattaag cattggagac tagaaaatta ctttccccct ttgagctaga gggtctcttg 301 ccaacagaag gacagctgag aaagctggat ttaaaggatg gttttatctg tactttgcag 361 ttaacagtga tattttgaag gcacattttt ctgtgattca tttttttttg gccatagtgc 421 taaccttgaa gagattcgtg gctgggtttt tggtttctga gaaggtcgta gtttttcctc 481 ttttcctttt tttttttctt ttttcttttc ttttcttttt ttttaaagcg ggggagggga 541 agaggggctg agaaaggaaa tcatgttcac tggtagaagt agagtggagc atcagttacc 601 agggtcctga gagctggagg agaaaggatt ctatcttcaa gttgggaggc cctcctctca 661 ccttgctcaa aaattgcaag cgattcaatc ctgatcaaga caccaaagct acaggattct 721 ggaaccgtgg agacaccgag aaaccatgag tgtaaggtta ccccagagta tagacaggtt 781 aagtagcctg tcttctctgg gagattctgc accagagcgc aagtcccctt cccaccatcg 841 ccagccttcg gatgcctctg agacaacagg tctcgttcaa cgctgtgtca ttatccaaaa 901 ggaccagcat ggcttcggct tcacagtcag tggggatcgc attgttctgg tgcagtctgt 961 gcggcctgga ggtgcagcca tgaaggccgg tgtgaaagag ggcgaccgga tcatcaaagt 1021 caacggcacc atggtgacca atagctcaca cctggaagtg gtaaagctga tcaaatctgg 1081 cgcctatgtc gcactcaccc tcctgggctc ttcaccttca tccatgggca tctctgggct 1141 ccagcaggac ccatccccag caggagctcc ccgaatcacg tcagtgatcc cctcaccacc 1201 acctcctcca cctctaccac ctccacaacg catcacagga cccaaacctc tgcaggatcc 1261 cgaagttcaa aaacatgcca cccagatcct caggaatatg ctgaggcagg aagaaaaaga 1321 attacaggac atacttccac tatatggtga caccagccag agaccatcag aaggccggct 1381 ctctctggat tcccaggagg gggacagtgg cttggactct gggacagaac gctttccttc 1441 cctcagtgag tcattgatga atcggaactc ggtactgtca gaccctgggc tagacagtcc 1501 tcgaacctcc cctgtgatca tggccagggt ggcccagcac cacaggcggc agggctcgga 1561 tgcagcagtc ccctcaaccg gtgaccaggg tgtagatcaa agcccaaagc ctttaattat 1621 tggcccagag gaagactatg acccgggtta tttcaacaac gagagcgaca tcatattcca 1681 ggatctggag aaactgaagt ctcggccagc tcacctgggg gtttttctac gttacatctt 1741 ctctcaggcg gaccccagtc cactgctttt ttacctgtgt gcagaagttt atcagcaggc 1801 aagccccaag gattcccgaa gcttggggaa agacatctgg aatattttcc tggagaaaaa 1861 tgcgcctctg agagtgaaga tccctgagat gctacaggct gaaattgact cgcgcctgcg 1921 gaacagcgaa gatgcccgtg gtgttctctg tgaagctcaa gaggcagcca tgcctgagat 1981 ccaagagcag atccacgact acagaacgaa gcgcacactg gggctgggca gcctgtatgg 2041 tgaaaatgac ctgctggacc tggatgggga ccctctccga gagcgccaag tggctgagaa 2101 gcagctggct gcccttggag atattttgtc caagtatgag gaagacagga gcgcccccat 2161 ggacttcgcc ctcaatacct acatgagcca tgctgggatc cgtcttcgag aggcacgacc 2221 ttccaacaca gctgaaaagg cccagtctgc tcctgacaag gacaagtggc taccgttctt 2281 ccctaagacc aagaagagca gcaattccaa gaaagaaaag gatgccttgg aggacaagaa 2341 gcgaaaccct atcctcaaat acattgggaa gcccaaaagc tcttctcaaa gcacatttca 2401 tattcccttg tcccctgtgg aagtcaaacc aggcaatgtg aggaacatca ttcagcactt 2461 tgagaacaac cagcagtatg atgccccaga acctgggaca caacgactct cgaccggaag 2521 ctttcctgag gacctgctgg agagtgacag ttcacgctca gagattcgcc tgggccgctc 2581 tgaaagcctc aagggccggg aagagatgaa acggtctcga aaggcagaga acgtgccccg 2641 ctctcgcagt gatgttgaca tggatgctgc tgcggaggct actcgcctgc accagtcagc 2701 ctcgtcctct acctccagcc tctccaccag gtctcttgag aacccaaccc ctccattcac 2761 tcccaaaatg ggccgcagga gcattgagtc ccccagtttg gggttctgca cagataccct 2821 ccttccccac ctcctagagg atgatctggg ccagctgtct gacctggagc cagagccaga 2881 tgcccaaaat tggcagcata cagtgggcaa ggatgtggtg gctgggctaa cccagcggga 2941 gattgaccgg caagaggtca tcaatgagct gtttgtgact gaagcttccc acctgcgcac 3001 actccgggtc ctggacctga tcttctacca gcgaatgaag aaggagaacc tgatgccccg 3061 ggaggagctg gcccggctct tcccgaacct gcctgaactc atagagattc acaattcctg 3121 gtgtgaagcc atgaagaagc tccgggagga aggccccatc atcaaagaga tcagtgacct 3181 catgctggcc cggtttgatg gccctgcccg agaggaactc cagcaagtgg ctgcacagtt 3241 ctgttcctat cagtcaatag ccctagagct aatcaagacc aagcaacgca aggagagtcg 3301 attccagctc ttcatgcagg aggctgagag ccaccctcag tgtcggcggc tgcagctgag 3361 agacctcatc atctctgaga tgcagcggct caccaagtac ccgctgctgc tggagagcat 3421 catcaagcac acagagggtg gcacctctga gcatgagaag ctgtgccggg cccgggacca 3481 gtgccgggag attctcaagt atgtgaatga agcggtaaaa caaacagaga accgccaccg 3541 tttagagggc taccagaaac gcctggatgc caccgccctg gagagggcca gcaaccccct 3601 ggcagcagag ttcaagagcc tggatcttac aaccagaaaa atgatccatg agggacccct 3661 gacctggagg atcagcaagg ataagacctt ggacctccac gtgctgctgc tggaggacct 3721 cctagtgctg ctacagaaac aggatgagaa gctattgctg aagtgccaca gcaagactgc 3781 tgtgggctcc tcagacagca agcagacctt cagccccgtg ctcaagctca atgctgtgct 3841 catccgctct gtggccacag ataaacgggc cttcttcatc atctgcacct ccaagctggg 3901 cccaccccag atctatgagc tggttgcatt gacgtcatca gacaagaaca catggatgga 3961 gctcttagaa gaggccgtgc ggaatgccac caggcacccc ggagctgccc caatgcccgt 4021 ccatcctcca cccccaggtc cccgggagcc agcccagcag ggccccacac ccagcagggt 4081 agaactggat gactcagacg tgttccatgg tgaacctgaa cctgaggagc tgcctggagg 4141 cactgggtcc cagcagaggg tccaagggaa gcaccaggtc ctgctagagg accctgagca 4201 ggagggcagt gcagaggaag aggaactggg tgtcctgcct tgcccttcca catccctgga 4261 tggagagaac aggggcatca ggacaaggaa ccccatccac ttggccttcc caggccctct 4321 gttcatggaa gggctcgctg actccgctct ggaagatgtg gagaacctgc gacatctgat 4381 cctgtggagc ctgctgccag gtcacaccat ggaaactcag gctgcccagg agcccgagga 4441 cgacctgaca cccacacctt ctgtcatcag cgtcacctct cacccctggg acccaggctc 4501 cccagggcaa gcaccccctg ggggtgaagg ggacaacacc cagcttgcag ggctggaggg 4561 ggaacggcca gagcaggaag acatgggtct ctgttctctg gaacacctac ccccaaggac 4621 cagaaattct gggatatggg agtctccaga actggacagg aatctggctg aagatgcttc 4681 aagcacagag gcagcaggag gttacaaagt tgtgagaaaa gctgaggtgg caggcagcaa 4741 ggttgtccct gcactaccag agagtggcca gtcagagcct gggccacctg aagtggaagg 4801 cggaacaaag gctacgggga actgctttta tgtcagcatg ccatcaggac ccccggactc 4861 aagcaccgac cactcagagg cacccatgag cccccctcag cctgacagcc tccctgcagg 4921 gcagacagag cctcagcctc agctgcaggg aggcaacgat gatccaagac gccccagccg 4981 ctctcctcca agcctggccc tcagggacgt gggcatgatc ttccatacca ttgagcagct 5041 cactctcaag ctcaacaggc tcaaggatat ggagctggcc cacagagagc tgctcaagtc 5101 ccttggggga gagtcatctg gtggcaccac gcctgtgggc agtttccaca cagaagcagc 5161 tagatggaca gatggctccc tctcacctcc cgctaaggag cccctagctt ctgactccag 5221 gaacagccat gaactggggc cctgccctga ggatggctct gacgcccccc tggaagacag 5281 cacagcagac gcagccgcgt caccaggacc ataaccgtac aaaccaccaa atcctctgcg 5341 tccccactcc tccttcaggg actggcctga gaccggggca cagggtaggg gggatcccaa 5401 cactcctccc tgtggaggag gcagttaggg aaactaggat ccagccaagg cccgggggga 5461 gacccgcatg ttgcttggtc tgctcaagtc ggagtcaggt ttcagtgtct tttccctccc 5521 ttagcccaac cctccaaggc ctcatgtctc ctaagcatgc tgactgcatc cgaaaggccc 5581 ccactcacca tggtctgccc tcaccccaca tatgtgtgta cacgcgcacg cctgtatgtg 5641 cgctgccctc agacatgcaa gtgaaaggag gaggcttctg tgtaaatgca ctttcttcct 5701 cccctctttc tccataagac cccaggcaga ggtgggtgcc tcccctcccc tctttgtcac 5761 tttggtttcc tataaatatg tatgtatcgt // LOCUS AB002382 5423 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0384 gene, complete cds. ACCESSION AB002382 NID g2224708 KEYWORDS KIAA0384. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0733. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5423) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5423 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0733" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 466..3285 /gene="KIAA0384" CDS 466..3285 /gene="KIAA0384" /codon_start=1 /db_xref="PID:d1021680" /db_xref="PID:g2224709" /translation="MDDSEVESTASILASVKEQEAQFEKLTRALEEERRHVSAQLERV RVSPQDANPLMANGTLTRRHQNGRFVGDADLERQKFSDLKLNGPQDHSHLLYSTIPRM QEPGQIVETYTEEDPEGAMSVVSVETSDDGTTRRTETTVKKVVKTVTTRTVQPVAMGP DGLPVDASSVSNNYIQTLGRDFRKNGNGGPGPYVGQAGTATLPRNFHYPPDGYSRHYE DGYPGGSDNYGSLSRVTRIEERYRPSMEGYRAPSRQDVYGPQPQVRVGGSSVDLHRFH PEPYGLEDDQRSMGYDDLDYGMMSDYGTARRTGTPSDPRRRLMSYEDMIGEEVPSDQY YWAPLAQHERGSLASLDSLRKGGPPPPNWRQPELPEVIAMLGFRLDAVKSNAAAYLQH LCYRNDKVKTDVRKLKGIPVLVGLLDHPKKEVHLGACGALKNISFGRDQDNKIAIKNC DGVPALVRLLRKARDMDLTEVITGTLWNLSSHDSIKMEIVDHALHALTDEVIIPHSGW EREPNEDCKPRHIEWESVLTNTAGCLRNVSSERSEARRKLRECDGLVDALIFIVQAEI GQKDSDSKLVENCVCLLRNLSYQVHREIPQAERYQEAAPNVANNTGPHAASCFGAKKG KDEWFSRGKKPIEDPANDTVDFPKRTSPARGYELLFQPEVVRIYISLLKESKTPAILE ASAGAIQNLCAGRWTYGRYIRSALRQEKALSAIADLLTNEHERVVKAASGALRNLAVD ARNKELIGKHAIPNLVKNLPGGQQNSSWNFSEDTVISILNTINEVIAENLEAAKKLRE TQGIEKLVLINKSGNRSEKEVRAAALVLQTIWGYKELRKPLEKEGWKKSDFQVNLNNA SRSQSSHSYDDSTLPLIDRNQKSDKKPDREEIQMSNMGSNTKSLDNNYSTPNERGDHN RTLDRSGDLGDMEPLKGTTPLMQKI" BASE COUNT 1334 a 1342 c 1288 g 1459 t ORIGIN 1 agatggcggt cttggcacct ctaattgctc tcgtgtattc ggtgccgcga ctttcacgat 61 ggctcgccca accttactac cttctgtcgg ccctgctctc tgctgccttc ctactcgtga 121 ggaaactgcc gccgctctgc cacggtctgc ccacccaacg cgaagacggt aacccgtgtg 181 actttgactg gagagaagtg gagatcctga tgtttctcag tgccattgtg atgatgaaga 241 accgcagatc cactctctcc ttcctgcttc ctccttgctg tggtggctgg gatgcttctt 301 ccatgatttt ttgaatctag actgggctgt tctctgtgtt aaaccaatca gttgcgacct 361 tctcttaaca gtgtgaagtg agggggtctc tctccctcct tctccttcct ctgtgattca 421 ccttcctttt taccctgccc tgcggcggct ccgcccctta ccttcatgga cgactcagag 481 gtggagtcga ccgccagcat cttggcctct gtgaaggaac aagaggccca gtttgagaag 541 ctgacccggg cgctggagga ggaacggcgc cacgtctcgg cgcagctgga acgcgtccgg 601 gtctcaccac aagatgccaa cccactcatg gccaacggca cactcacccg ccggcatcag 661 aacggccggt ttgtgggcga tgctgacctt gaaagacaga aattttcaga tttgaaactc 721 aacggacccc aggatcacag tcaccttcta tatagcacca tccccaggat gcaggagccg 781 gggcagattg tggagaccta cacggaggag gatcctgagg gagccatgtc tgtagtctct 841 gtggagacct cagatgatgg gaccactcgg cgcacagaga ccacggtcaa gaaagtagtg 901 aagactgtga caacacggac agtacagcca gtcgctatgg gaccagacgg gttgcctgtg 961 gatgcttcat cagtttctaa caactatatc cagactttgg gtcgtgattt ccgcaagaat 1021 ggcaatgggg gacctggtcc ctatgtgggg caagctggca ctgctaccct tcctaggaac 1081 ttccactacc ctcctgatgg ttatagtcgc cactatgaag atggttatcc aggtggcagt 1141 gataactatg gcagtctgtc ccgggtgacc cgcattgagg agcggtatag gcccagcatg 1201 gaaggctacc gggcacctag tagacaggat gtgtatgggc cccaacccca ggttcgggta 1261 ggtgggagca gcgtggatct gcatcgcttt catccagagc cttatgggct agaggatgac 1321 cagcgtagta tgggctatga tgacctggat tatggtatga tgtctgatta tggcactgcc 1381 cgtcggactg ggacaccctc tgaccctcgt cggcgcctca tgagctatga agacatgatt 1441 ggtgaggagg tgccatcgga tcaatactac tgggctcctt tggcccagca tgagcgagga 1501 agtttagcaa gcttggatag cctgcgcaaa ggagggcctc cacctcctaa ttggagacag 1561 ccagagctgc cagaggtgat cgccatgctt ggattccgct tggatgctgt caagtccaat 1621 gcagctgcat acctgcaaca cttatgctac cgcaatgaca aggtgaagac tgacgtgcgg 1681 aagctcaagg gcatcccagt actggtggga ttgttagacc atcccaaaaa ggaagtgcac 1741 cttggagcct gtggagctct caagaatatc tcttttggac gtgaccagga taacaagatt 1801 gccataaaaa actgtgatgg tgtgcctgcc cttgtgcgat tgcttcgaaa ggctcgtgat 1861 atggacctta ctgaagttat taccggaacc ctgtggaatc tttcatccca tgactcaatc 1921 aaaatggaga ttgtggacca tgcactgcat gccttgacag atgaagtgat cattcctcat 1981 tctggttggg agcgggaacc taatgaagac tgtaagccac gccatattga gtgggaatcg 2041 gtgctcacca acacagctgg ctgccttagg aatgtaagct cagagaggag tgaagctcgc 2101 cggaaacttc gggaatgtga tggtttagtt gatgccctca ttttcattgt tcaggctgag 2161 attgggcaga aggattcaga cagcaagctt gtagagaact gtgtttgcct tcttcggaac 2221 ttatcatatc aagttcaccg ggagatccca caggcagagc gttaccaaga ggcagctccc 2281 aatgttgcca acaatactgg gccacatgct gccagttgct ttggggccaa gaagggcaaa 2341 gatgagtggt tctccagagg gaaaaaacct atagaggatc cagcaaacga tacagtggat 2401 ttccctaaaa gaacgagtcc agctcgaggc tatgagctct tatttcagcc agaggtggtt 2461 cggatataca tctcacttct taaggagagc aagactcctg ccatcctaga agcctcagct 2521 ggagctatcc agaacttgtg tgctgggcgc tggacgtatg gtcgatacat ccgctctgct 2581 ctgcgtcaag agaaggctct ttctgccata gctgacctcc tgactaatga acatgaacgg 2641 gtggtgaaag ctgcatctgg agcactgaga aacctggctg tggatgctcg caacaaagaa 2701 ttaattggta aacatgctat tcctaacttg gtaaagaatc tgccaggagg acagcagaac 2761 tcctcttgga atttctctga ggacactgtc atctctattt tgaacactat caacgaggtt 2821 atcgctgaga acttggaggc tgccaaaaag cttcgagaga cacagggtat tgagaagctg 2881 gtgttgatca acaaatcagg gaaccgctca gaaaaagaag ttcgagcagc agcacttgta 2941 ttacagacaa tctggggata taaggaactg cggaagccac tggaaaaaga aggatggaag 3001 aaatcagact ttcaggtgaa tctaaacaat gcttcccgaa gccagagcag tcattcatat 3061 gatgatagta ctctccctct cattgaccgg aaccaaaaat cagataagaa acctgatcgg 3121 gaagaaattc agatgagcaa tatgggatca aacacaaaat cactagataa caactattcc 3181 acaccaaatg agagaggaga ccacaataga acactggatc gatcggggga tctaggcgac 3241 atggagccat tgaagggaac aacacccttg atgcagaaga tttagcacca ctatctccgt 3301 tccatctggg cttatatgta cttttatttt ttggtggtga aattgactga tgattttcct 3361 ttttcttcgc tggactattg tgccaactgc caggctgcct cctgccctta cagccctaag 3421 tggctgcctt ctttccatca actcccaact tcttcctgtg aagtttaatt gtctcaacgc 3481 ctccccctcc cccattccct ccatttttct cccaagaaac ctgactcaat tatttgcata 3541 ttttgagaaa ctgctgcaga ttagttcttt ttgccagttt tccctggaac tcctggcctt 3601 ttgtggaggg gagggatgga gagaatagga atcttcacta gaagccgtgg gaagaattgg 3661 aagttacatg ctgtatatgc aatgtccagc agtctgataa actgacgatt cttaatcaag 3721 atttttttcc tgatggggaa gggactttta ttttctttta gagaggggaa agtgtgagct 3781 cttcccttat tcctaatggc tatttttgaa gcaaagaagg ccagcaacat tggcacatgc 3841 cacctggcaa aggacccttg agtaagtgaa ggtctcctaa aactgggatt aagaaacctt 3901 gctctcctca tctccaaggc agggaccatc aagaacctac agactccatc tcttctgcaa 3961 gcctcatgcc aaccctgggc tattgctgct gccccttaaa cacaggctgt ccttaaccca 4021 cctctcctgc cctgtgatat gtctgctgag ttggcctggc catttccaag aggctgtaga 4081 aaggggagaa tgtcaaggaa gacttttggt agagaaggag cagaaagatg tgtttttggg 4141 aagaagaaga cctctaggag gagctagtag gaatgtacat gaagcaatta gtctgaaact 4201 ggcttcccca ctcccccgtt tctccttttc ctatccttat aggcctgtcc cttgcctctg 4261 ccctggattg gttggcaaac taaaggactt gatgtacata actcctgtcc cttttccctt 4321 acaaggtggg gattgcccct ggctttgcct cttctttgtg cctttggcct ggggtgcatc 4381 tcctcccgcc cttccatgtg cctttctttg cctctgcagt ctcatttctc ataattttgc 4441 aaattatatt ttgttgcttt cttacctact attggcccta aatagcagaa agaagagaag 4501 tgaccgagag aacctcagat tcttcattga ggattggtat agccatgatt tcagtcatag 4561 caagcttttg ctcaacagca tatgggtggg attttgcaaa aatcctattc tgatgaatct 4621 caaagtaagg ctggtaagag aagtgagtgg tgtgactctt actccttagg tgcccagaat 4681 ttaccatcat ctctgaagga gttacaggga agtggtctcc ccaattctcc cctccctcca 4741 gtattgcccc ctctcacttt agcatatatt aattagcagg ttgggctaga gaaatcagct 4801 gctatgcggg ttgattatta ttattatttc taatcctttt ccttatttgc cttctactcc 4861 ccttaatcta atctaaaagc tctgttccat gcaactggag ttccttatcc ctctcttccc 4921 cttcccttat atattgaggc tatggggtag gagaaaagtg cacaacccac cacccccttt 4981 actcgtgcat taaaatttct tatttaccct tttccccctt cccatttctt cccactttca 5041 tctacctttt ctggcaaaaa ggagcctttt gctctctgtg accctaagag cacactgcac 5101 agggaaaatt gccccatcca gacctggctc cactcttgat ctctcttgtc ctcttctgct 5161 cttttcctgg tgctcttttt tctcggtggg gtgtgggtaa tagaacagcc gtgggctttt 5221 ggggaccttt aacttttttt tctctctttt gtttataaaa aacactaaac attcaattcc 5281 agagaaccaa aaatcccacc ttcccaccga acactactaa ggggcttgtg ttctgctcca 5341 taccttttct cttttctttc tgtcttgtta atgcttttaa aaacaaatga gttttttata 5401 taaataaagt ttttaaagtg tgt // LOCUS AB002383 5492 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0385 gene, complete cds. ACCESSION AB002383 NID g2224710 KEYWORDS KIAA0385. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0802. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5492) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5492 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0802" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 88..4200 /gene="KIAA0385" CDS 88..4200 /gene="KIAA0385" /codon_start=1 /db_xref="PID:d1021681" /db_xref="PID:g2224711" /translation="MDPSDFPSPFDPLTLPEKPLAGDLPVDMEFGEDLLESQTAPTRG WAPPGPSPSSGALDLLDTPAGLEKDPGVLDGATELLGLGGLLYKAPSPPEVDHGPEGT LAWDAGDQTLEPGPGGQTPEVVPPDPGAGANSCSPEGLLEPLAPDSPITLQSPHIEEE ETTSIATARRGSPGQEEELPQGQPQSPNAPPSPSVGETLGDGINSSQTKPGGSSPPAH PSLPGDGLTAKASEKPPERKRSERVRRAEPPKPEVVDSTESIPVSDEDSDAMVDDPND EDFVPFRPRRSPRMSLRSSVSQRAGRSAVGTKMTCAHCRTPLQKGQTAYQRKGLPQLF CSSSCLTTFSKKPSGKKTCTFCKKEIWNTKDSVVAQTGSGGSFHEFCTSVCLSLYEAQ QQRPIPQSGDPADATRCSICQKTGEVLHEVSNGSVVHRLCSDSCFSKFRANKGLKTNC CDQCGAYIYTKTGSPGPELLFHEGQQKRFCNTTCLGAYKKKNTRVYPCVWCKTLCKNF EMLSHVDRNGKTSLFCSLCCTTSYKVKQAGLTGPPRPCSFCRRSLSDPCYYNKVDRTV YQFCSPSCWTKFQRTSPEGGIHLSCHYCHSLFSGKPEVLDWQDQVFQFCCRDCCEDFK RLRGVVSQCEHCRQEKLLHEKLRFSGVEKSFCSEGCVLLYKQDFTKKLGLCCITCTYC SQTCQRGVTEQLDGSTWDFCSEDCKSKYLLWYCKAARCHACKRQGKLLETIHWRGQIR HFCNQQCLLRFYSQQNQPNLDTQSGPESLLNSQSPESKPQTPSQTKVENSNTVRTPEE NGNLGKIPVKTRSAPTAPTPPPPPPPATPRKNKAAMCKPLMQNRGVSCKVEMKSKGSQ TEEWKPQVIVLPIPVPIFVPVPMHLYCQKVPVPFSMPIPVPVPMFLPTTLESTDKIVE TIEELKVKIPSNPLEADILAMAEMIAEAEELDKASSDLCDLVSNQSAEGLLEDCDLFG PARDDVLAMAVKMANVLDEPGQDLEADFPKNPLDINPSVDFLFDCGLVGPEDVSTEQD LPRTMRKGQKRLVLSESCSRDSMSSQPSCTGLNYSYGVNAWKCWVQSKYANGETSKGD ELRFGPKPMRIKEDILACSAAELNYGLAQFVREITRPNGERYEPDSIYYLCLGIQQYL LENNRMVNIFTDLYYLTFVQELNKSLSTWQPTLLPNNTVFSRVEEEHLWECKQLGVYS PFVLLNTLMFFNTKFFGLQTAEEHMQLSFTNVVRQSRKCTTPRGTTKVVSIRYYAPVR QRKGRDTGPGKRKREDEAPILEQRENRMNPLRCPVKFYEFYLSKCPESLRTRNDVFYL QPERSCIAESPLWYSVIPMDRSMLESMLNRILAVREIYEELGRPGEEDLD" BASE COUNT 1238 a 1578 c 1417 g 1259 t ORIGIN 1 agaaggagcg cggggccgtc gctgtctgca gttctaggct tgtagccttt gcactaaccc 61 tgccgcagta catatccagc catactcatg gaccccagtg atttccccag tccatttgac 121 ccattgaccc tgccagagaa gcccctggct ggagacctac cagtagacat ggaatttgga 181 gaggatctac tggaatccca gactgcccca actcgaggat gggccccccc tggcccttct 241 ccatcctcgg gagccctgga cctgcttgat acccctgctg gcctggaaaa agaccctgga 301 gtcctggatg gagccactga gttgctgggg ctgggggggc tgctctataa agccccctct 361 cccccggagg tggaccacgg tcctgaggga accctggcat gggatgcagg agatcagacc 421 ctagagcctg gaccaggggg ccagacccct gaggtggtac cacctgatcc aggggctggg 481 gcaaattcct gttcacctga ggggctacta gagcctttgg ctccagattc tccaataaca 541 ctgcagtccc cacatattga agaggaggag accacctcca tagctactgc aagaaggggc 601 tcccctgggc aggaggagga gcttccccaa gggcagccac agagcccaaa tgccccgcct 661 agcccttcag tgggagagac tctgggggat ggaatcaaca gttctcagac caaacctggg 721 ggctctagcc cccctgcaca tccttccttg ccaggagatg gcctgactgc gaaggcgagt 781 gagaagccgc ctgaacggaa gagaagcgag cgcgttagga gagcagaacc tccaaaacct 841 gaggttgtag attccactga gagcattcca gtgtcagatg aggattctga tgccatggta 901 gatgacccca atgatgagga ctttgtgcca ttccggcccc ggcgctctcc tcgcatgtcc 961 ctacgctcaa gtgtgtcaca aagggccggg cgctctgcag tgggcaccaa gatgacttgt 1021 gcacattgcc ggacaccact gcagaagggg cagactgcct atcagcgcaa ggggctgcct 1081 cagctcttct gctcgtcatc ctgcctcacc actttctcca agaagccctc gggcaaaaag 1141 acctgtacct tctgcaagaa ggagatctgg aacaccaagg actcggttgt ggcgcagact 1201 ggttctggag gctccttcca tgagttctgc acatccgtct gtctctccct gtatgaggcc 1261 cagcagcagc gcccgatccc ccagtctggg gatcccgccg acgctactcg ctgcagcata 1321 tgccagaaga ctggagaggt cctgcacgag gtcagcaatg gcagcgtggt acaccggctc 1381 tgcagcgatt cttgcttctc caaattccgg gccaacaagg gactgaaaac caactgttgt 1441 gaccagtgtg gggcttacat ctacaccaag accgggagtc ctggccctga gctcctcttc 1501 cacgagggcc aacaaaagcg gttctgcaac acaacctgct tgggggcgta caagaagaaa 1561 aacacacgtg tgtacccatg tgtctggtgc aagaccctgt gtaagaactt tgagatgcta 1621 tcacatgtgg atcgtaatgg caagaccagc ttgttctgtt ccctgtgctg taccacttct 1681 tacaaagtga agcaggcagg gctcactggc cctccccgac cctgcagctt ctgccgccgc 1741 agcctctctg acccctgtta ctacaacaag gttgaccgca cagtctacca gttctgcagc 1801 cccagctgct ggaccaagtt ccagcgcaca agccctgagg ggggcattca cctgagctgt 1861 cactactgtc acagcctctt cagtggcaag cctgaggtct tggactggca ggaccaagtg 1921 ttccagttct gctgccgtga ttgctgtgag gacttcaagc ggcttcgggg tgtggtgtcc 1981 cagtgtgagc actgtcggca ggagaaactc ttgcatgaga aactccgatt cagcggagtg 2041 gagaaaagct tctgcagcga aggctgtgtg ctgctgtaca aacaggactt cactaagaag 2101 ctgggcttgt gctgtatcac ttgtacttac tgctcccaga cctgccagcg cggagtcacc 2161 gagcaactgg atggcagcac ctgggacttc tgcagtgagg actgtaagag caagtacctg 2221 ctgtggtact gcaaggctgc ccggtgccat gcgtgtaagc gccaggggaa gctgctggag 2281 accatccact ggcgtgggca gatccgtcat ttctgcaacc agcagtgtct tctgcgtttc 2341 tatagccagc agaaccaacc caacctggat acccagagtg ggcccgagag cctcctgaac 2401 agtcagtctc ctgagtcaaa accccagaca ccctctcaaa ccaaagtgga gaacagcaac 2461 acagtgagga ccccagagga aaatgggaat ttgggcaaga tccctgtgaa gacccgatca 2521 gctcccactg ctcccacccc tccaccccca ccacccccag caacaccccg caaaaacaag 2581 gctgccatgt gtaagccact gatgcagaat cggggcgtct cctgcaaggt ggagatgaag 2641 tccaaaggaa gtcaaacaga agagtggaag ccacaggtga tcgtgctgcc catcccagtg 2701 cccatcttcg tgccagtgcc tatgcatctg tactgccaga aagtcccggt gcctttctcg 2761 atgcctatcc cggtgcctgt gcccatgttc ttgcccacta ccttggagag cacagacaag 2821 attgtagaga ccattgagga gctgaaggtg aagatccctt ccaacccctt ggaggccgac 2881 atcctggcta tggcagaaat gattgcagag gctgaggagt tagacaaggc ctcatctgac 2941 ctttgtgatc ttgtgagcaa ccagagtgca gagggactcc tggaagactg tgacctgttt 3001 gggcctgctc gagatgatgt cctggccatg gcagtcaaga tggccaatgt cttggatgag 3061 cctgggcaag acttggaggc agacttccct aagaatcctc tggacattaa tcccagtgta 3121 gacttcctct ttgattgtgg cctggtaggg cctgaggatg tgtctactga acaagacctt 3181 ccccgaacca tgaggaaggg tcaaaagcgg ctggtgcttt ccgaaagctg ctcccgggac 3241 tccatgagca gtcagcctag ttgtaccggg ctcaactatt catatggtgt caatgcttgg 3301 aagtgctggg tgcagtcaaa atatgccaat ggagaaacca gcaagggtga tgagctgcgc 3361 tttggcccca aacccatgcg tatcaaagag gatattctcg cctgctcagc tgctgagctc 3421 aactacggtc tggcccagtt tgtgagagaa atcactcggc ccaatggtga acgatatgaa 3481 cctgacagta tctactattt gtgtcttggc atccagcagt acttgctgga aaataaccgg 3541 atggtgaaca ttttcacgga cctttactac ctgacttttg ttcaagaact caacaagtct 3601 ctgagtacct ggcagcccac actcctcccc aacaatacgg tgttctctcg agtggaggag 3661 gagcacctct gggagtgtaa gcaactgggg gtctactcgc cctttgtcct cctcaacacc 3721 ctcatgttct tcaacactaa gttttttggg ctgcagacag ctgaggaaca catgcaactc 3781 tccttcacca atgtggtgcg gcagtcccgc aagtgtacca cccctcgggg caccaccaag 3841 gtggtgagca tccgctacta tgccccagtc cgccagagga aagggcgaga cacgggtcct 3901 ggaaaacgga agagagaaga tgaagcccct atcttagagc agcgtgagaa ccgcatgaat 3961 cccctccgct gccctgtcaa gttctatgaa ttctatctct caaaatgtcc tgaaagcctc 4021 cggactcgca acgatgtgtt ctacctgcaa cctgaacggt cctgcatcgc cgagtcacct 4081 ctctggtatt ctgtgatccc catggaccgc agcatgttgg agagcatgct caatcgcatc 4141 ctggctgtgc gcgagattta tgaggaactg ggtcgtcctg gggaggaaga cctggactga 4201 gctcgtgtgc catccatatc catctttcac atcaatgtct gtcctgtggc catgtccctc 4261 agggtgacag gcccaggaac caatgctact cattctgaag ggccctgact gctcctttcc 4321 gctcacccat tccctgcctt ctctaggaac cctggctttt atcttcttcc gtaccacttg 4381 acaaccatgg ggccctggtc ttctgtactc aggggctggt ctcccagtga tgggcaaaag 4441 ccagcttgcc cgttttcttt atgcttcaga gtaaacccct ccttctgggt ccagactctg 4501 ggtggagtgt taatagctct ggtgatcctg ttggctttgg gtttcctgac ccatcccgca 4561 taggtagagc ctcttgttcc taggcatgac ctagggaaaa acccagctgc cttctctgcc 4621 ctgtgcccac tcccttctct actcttcccc agcaccatgc caaaaggtct tatctgaaag 4681 gtaagaaata aacaatgaaa gcgatgaggg gaccatttac ataaaacaca gagcttagac 4741 actcttcccc tcctatgaaa taattggttg ttggcaccat ctcaccaccg catatccctc 4801 accccctcgg caagcaccaa tccttggtgc tgccgttttt aaaatcttcc aaatgccttt 4861 ttttcctcag aggcagagaa tgactaagta cggggagcag actcctgttg tgcagactcc 4921 tgtccccttg gtttctgtgt ttgtctctct gccatcttag gttgccatga gccatggtgt 4981 caacatgctt agccccctct gtaactgcct ccctttagtt caatggacag acctcccaag 5041 gcaaaaacta ccttctgact tgggttgagg ctgggttccc ctctattgtt cccctatcat 5101 aagagctagg ccaagcctat gggaccttga gtcatgcagg atgggatctg tggtcaaagg 5161 acaggcgagg agctgtgggc gcagggcctg ccgccactgc ctacatcttc tctcttcccc 5221 atcttgcatt ggaggtccca gaaaacaatt agcttctggc aaagggggta cccacttctt 5281 tccctgttga ctttgctgtt tcccaggctc ctttttgtgt ttttataact gtcaccagtt 5341 agccactgtt taaattgtat atattgttct gaggcgcctg gcctgtccct tcagtgagcc 5401 atgcccaccc ttgtgttgta gtgagaagct gttgtcacga ctaaccttct gtctctgaaa 5461 ttgtttgttt caaataaaga gttaaaattg tc // LOCUS AB002384 5471 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0386 gene, complete cds. ACCESSION AB002384 NID g2224712 KEYWORDS KIAA0386. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0015. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5471) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5471 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0015" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 178..3384 /gene="KIAA0386" CDS 178..3384 /gene="KIAA0386" /codon_start=1 /db_xref="PID:d1021682" /db_xref="PID:g2224713" /translation="MLVGSQSFSPGGPNGIIRSQSFAGFSGLQERRSRCNSFIENSSA LKKPQAKLKKMHNLGHKNNNPPKEPQPKRVEEVYRALKNGLDEYLEVHQTELDKLTAQ LKDMKRNSRLGVLYDLDKQIKTIERYMRRLEFHISKVDELYEAYCIQRRLQDGASKMK QAFATSPASKAARESLTEINRSFKEYTENMCTIEVELENLLGEFSIKMKGLAGFARLC PGDQYEIFMKYGRQRWKLKGKIEVNGKQSWDGEETVFLPLIVGFISIKVTELKGLATH ILVGSVTCETKELFAARPQVVAVDINDLGTIKLNLEITWYPFDMEDMTASSGAGNKAA ALQRRMSMYSQGTPETPTFKDHSFFRWLHPSPDKPRRLSVLSALQDTFFAKLHRSRSF SDLPSLRPSPKAVLELYSNLPDDIFENGKAAEEKMPLSLSFSDLPNGDCALTSHSTGS PSNSTNPEITITPAEFNLSSLASQNEGMDDTSSASSRNSLGEGQEPKSHLKEEDPEEP RKPASAPSEACRRQSSGAGAEHLFLENDVAEALLQESEEASELKPVELDTSEGNITKQ LVKRLTSAEVPMATDRLLSEGSVGGESEGCRSFLDGSLEDAFNGLLLALEPHKEQYKE FQDLNQEVMNLDDILKCKPAVSRSRSSSLSLTVESALESFDFLNTSDFDEEEDGDEVC NVGGGADSVFSDTETEKHSYRSVHPEARGHLSEALTEDTGVGTSVAGSPLPLTTGNES LDITIVRHLQYCTQLVQQIVFSSKTPFVARSLLEKLSRQIQVMEKLAAVSDENIGNIS SVVEAIPEFHKKLSLLSFWTKCCSPVGVYHSPADRVMKQLEASFARTVNKEYPGLADP VFRTLVSQILDQAEPLLSSSLSSEVVTVFQYYSYFTSHGVSDLESYLSQLARQVSMVQ TLQSLRDEKLLQTMSDLAPSNLLAQQEVLRTLALLLTREDNEVSEAVTLYLAAASKNQ HFREKALLYYCEALTKTNLQLQKAACLALKILEATESIKMLVTLCQSDTEEIRNVASE TLLSLGEDGRLAYEQLDKFPRDCVKVGGRHGTEVATAF" BASE COUNT 1581 a 1219 c 1266 g 1405 t ORIGIN 1 ccggggtccc ctttcaccca agtgagccag ctcaggctgt ctgcaaagcc ggaggtgcgg 61 gcagctccgg gcattgcatt ggtgcggagg cttttatatg ccagagaacc ctgagtgtgc 121 ccctgacatg acaaggagcc gtcgtcgtta ggactaccga ccagactccc ggaaatcatg 181 ttggtaggat cccagtcttt ttcgcctgga gggcccaatg ggatcattag aagccagtcc 241 tttgcgggtt tcagcggcct ccaggaaagg cgatccaggt gtaactcctt cattgaaaat 301 tcctccgctc tcaagaagcc tcaggccaaa ctgaagaaaa tgcacaattt aggccacaaa 361 aacaacaatc cccccaaaga gcctcagcct aaaagggtgg aagaagtcta cagggccttg 421 aaaaatggac ttgatgaata tctggaggtt caccagacgg agctggacaa gttgacagct 481 cagttaaaag atatgaaaag aaactctcgc ctgggtgtac tgtatgacct agacaagcaa 541 attaaaacaa ttgaaagata catgagacgc ctggagtttc atataagtaa ggtagatgaa 601 ctctatgaag cttattgtat ccagcgacgc ctccaggatg gtgccagcaa aatgaagcaa 661 gccttcgcaa catcccctgc cagcaaagct gcccgggaga gtctgacaga gatcaatcgg 721 agcttcaagg agtacacaga gaatatgtgc accattgaag tggagctaga gaatctgctg 781 ggagaattct ccatcaagat gaaaggtctg gctggctttg cacgcctctg tcctggagat 841 caatatgaaa ttttcatgaa gtatggccgg cagcggtgga aactgaaagg caaaatagaa 901 gtaaatggca agcagagctg ggatggagaa gaaacagttt ttctgcccct gatagttggg 961 ttcatctcca tcaaggtcac ggagctcaaa gggctagcaa ctcacatcct ggtaggtagc 1021 gtgacctgtg agaccaaaga gctgtttgca gcccgacctc aggtagtggc tgtcgacatc 1081 aatgaccttg gtaccatcaa actgaacctg gaaatcacct ggtatccatt tgacatggag 1141 gacatgaccg catcctcagg cgctgggaac aaggcagcag cccttcagag gagaatgtcc 1201 atgtacagcc agggtacccc ggaaacgccc accttcaaag accactcctt ctttaggtgg 1261 ctgcatcctt ccccagacaa gcccaggcgg ctgtctgtct tgagtgcctt gcaagacact 1321 ttctttgcca agctgcaccg cagccgctcc ttcagtgacc tgccctccct caggccgagt 1381 cccaaggccg tgctagagct ctattcaaat ctacctgatg acatctttga aaatggaaag 1441 gcagccgagg agaaaatgcc actgtcgctc agcttcagtg acctgcccaa cggggactgc 1501 gccctcacct cccactcaac aggctcccct tccaactcaa caaatccaga aattaccatc 1561 acccctgcgg agtttaacct cagcagcttg gcctcccaga atgagggtat ggatgacacc 1621 agctcagcat cttccaggaa ctccctggga gaaggccaag agccaaagtc acacctgaag 1681 gaggaagacc cagaggagcc cagaaaacct gcctcggccc catctgaggc ttgccgccga 1741 cagtcctcag gtgctggggc tgagcacctg ttccttgaga atgatgttgc agaagcactt 1801 ctgcaagagt ctgaggaggc ctctgagctc aagcctgtgg aactggacac ttcggaagga 1861 aacatcacaa agcagctggt caagaggctc acatctgcag aggtgccaat ggccacagac 1921 aggctgctct ctgagggttc tgttggtgga gaatctgaag gctgcagatc ctttctagat 1981 ggaagcttag aggatgcttt taatgggctt ttacttgcat tagaaccaca taaagagcag 2041 tataaagagt ttcaggatct gaaccaagaa gtcatgaatt tggatgatat tctaaaatgc 2101 aagccagcag taagccgcag caggtcttcc agtttaagtc tcacagttga aagtgcttta 2161 gaaagctttg atttcctgaa cacctctgat tttgacgagg aggaggatgg tgatgaggtt 2221 tgtaatgttg gcggaggtgc tgactcagta ttttcagaca ctgagactga gaaacacagt 2281 tacaggtcgg ttcacccaga agccaggggg catctcagtg aagcgctcac tgaagacaca 2341 ggagttggga ccagtgtggc aggaagtcct ctcccactga ccacaggcaa cgagagcctg 2401 gacatcacca tcgtcaggca cctccagtac tgcacccaac tcgtgcagca aattgttttc 2461 tcaagcaaaa ccccatttgt ggcaagaagt ctcttagaga agctttctag gcagatccaa 2521 gtgatggaga aactcgcagc tgtcagtgat gagaacatag gaaatatcag ttctgttgtg 2581 gaagccatac cagaatttca caaaaagctg tctttgctgt cattctggac caagtgctgc 2641 agccctgttg gtgtctacca cagcccagcg gacagagtga tgaagcagct ggaggccagc 2701 tttgccagaa ctgtcaacaa agaatatcca ggacttgcag acccagtgtt tcgaaccctg 2761 gtgtcccaaa ttctggacca ggctgagcct ctgctttcct ccagcctgtc ctcggaagtc 2821 gtcactgttt tccagtatta cagttacttc accagccacg gcgtcagtga cctggagagt 2881 tacctgagcc agctggccag gcaagtttcc atggttcaga ctctgcaatc actaagagat 2941 gaaaaactgc tacaaaccat gagtgacctt gctcccagca acctcctggc ccagcaggaa 3001 gtactcagga ctctggctct gctattaacc agagaggaca acgaagttag cgaggctgtg 3061 acgctttact tggcagcagc ctccaaaaat cagcatttca gggaaaaggc cttgctctat 3121 tactgtgaag cactaacaaa gacaaacctc cagctccaga aagcagcttg cctggctctg 3181 aaaatccttg aggctactga aagcattaaa atgctggtga cattgtgtca atctgatact 3241 gaagaaatca gaaatgtggc ctcagaaacc ctcttgtctc tgggagaaga tgggcggctg 3301 gcatatgaac aattggacaa atttcctcga gactgtgtta aagtcggagg tcgtcatgga 3361 actgaagttg ccacagcctt ttaattacag attaactgcc taacagctgt cttaatatct 3421 ggcccttttc atcaggatgg tgctgtggtt tgggctggaa attgtttaga gcctgagaga 3481 cacgacaact gaaataaaaa tgtaggccag gtgcagtggc tcatgcctgt aatcccagta 3541 ctttgggagg ccaacgcagg agaattgctt aagcccagga ggttcaagac caacctgggc 3601 aacatagcaa ggccctgtct ctacaaaaaa aatagtttaa ttagtcgggc gtggtggcat 3661 gcacctgtag tcccagctac ttgggaggct aaggtgggag gattgcttga gcccagaaga 3721 ttcaggctgc agtgagccat aattgcccca ctgcactcca gactgggtga cagagcaaaa 3781 ccctgtctca aaaaacaaaa caacaacaac aactaaaaaa acatgttata gaaagaattc 3841 agagtctcgt ttcacagtac ctatctactt ttcctttggc caatatagaa tagggctatg 3901 gtataattca aatagattac atcgtcgatt gtgtctaata atactcagtg aaagagtcca 3961 tcttatctta atatgtatga aatttaaaaa tagccatctt tgtacttttt tgcaagtttc 4021 tctataagct tgaaatagtc atgtgatact tcgacaaaca tcatatgcct tgcagttttt 4081 ctcggtcgtg ttctcagggt tgagtagtcc ccttagctac ctaactttac tttcaataca 4141 aagcacaaaa aagaatactt caaataaaag tttgcctgca gaacctggca aaatgaccca 4201 ttatgagagt ttagatgttt taattttatg tgctcccagc tactctggag tctgaggtgg 4261 gaggatcact tgaggcagag gttgcagtaa gctgagatta cactactgca ctccagccca 4321 tgtgacagag gaatgagagc cagtctcaaa aaaaaaaaac aaaaaactcc cataatttat 4381 tttgtttctt ttccccatcc tgaaaattcc caaatcatgt ttccattatg gaaaaatgat 4441 aagtaaatga ctaagaacta cattaagatt atgcctatac acttaaacaa aaaactcaaa 4501 gcttatgctt ttttttttta attaatgaga gctatttttt atgatattct taccataggg 4561 gtgttttgct gctaagacaa atcaaaacca aaagctgtag attcacaaac ctgtgatgct 4621 ctttgaggtg gaggaaccta gaagtcagag aaatcctaat ggagtaggag tggaggaaca 4681 tttaaagtgg tccctccttc tgtaaaaatg accgtgggat tagaaagaag gacatcctga 4741 ggggtggtta ctgcccccag ggaaatcact cactggtagg attccctggc caaatagttc 4801 aaacatcagg tcccattatt gcttcagtat cagagatgca agttcattaa gcaaagtaca 4861 agaccattca gtagctctta ttaaaatatc ttttcttcct ctaaagagtg tacaaggtgg 4921 ggtatgccaa ggtatcaaaa caatatatgt gagtgtaatt taaactgtgg aatatcaact 4981 gtactatgga cgtgtttgta tcatttagat gtcattttaa atatttacat tttagcaaga 5041 cttttaaaaa ggactcattt catttcaaag tgcaaattgt ttgccaggct tctggcaaat 5101 ggttctttca actgtgaact tatagtgtac atatctgtat atttataaat attatatata 5161 ttcatacatc cttcagttta aaggtacatt gtacagtctg tagttaggag gtatagccta 5221 tagcttatgt taaatggttg aaatggttct ttttatagaa agtcaaacac agatgttaca 5281 ggattttgtg tttggtttgt cattttttta ttttttattt tgactattgc atgagtaatt 5341 aattccagat cttttgtatt cacttctgta ttttatgttt ggttgagggg tgcttttagt 5401 tgtgtggcat ttgtattcat tgatctttca gtcatgtaag ttaaaataaa aattattttt 5461 gaattactag c // LOCUS AB002386 4606 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0388 gene, complete cds. ACCESSION AB002386 NID g2224716 KEYWORDS KIAA0388. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0039. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4606) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..4606 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0039" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 101..2344 /gene="KIAA0388" CDS 101..2344 /gene="KIAA0388" /codon_start=1 /db_xref="PID:d1021684" /db_xref="PID:g2224717" /translation="MEIPNPPTSKCITYWKRKVKSEYMRLRQLKRLQANMGAKALYVA NFAKVQEKTQILNEEWKKLRVQPVQSMKPVSGHPFLKKCTIESIFPGFASQHMLMRSL NTVALVPIMYSWSPLQQNFMVEDETVLCNIPYMGDEVKEEDETFIEELINNYDGKVHG EEEMIPGSVLISDAVFLELVDALNQYSDEEEEGHNDTSDGKQDDSKEDLPVTRKRKRH AIEGNKKSSKKQFPNDMIFSAIASMFPENGVPDDMKERYRELTEMSDPNALPPQCTPN IDGPNAKSVQREQSLHSFHTLFCRRCFKYDCFLHPFHATPNVYKRKNKEIKIEPEPCG TDCFLLLEGAKEYAMLHNPRSKCSGRRRRRHHIVSASCSNASASAVAETKEGDSDRDT GNDWASSSSEANSRCQTPTKQKASPAPPQLCVVEAPSEPVEWTGAEESLFRVFHGTYF NNFCSIARLLGTKTCKQVFQFAVKESLILKLPTDELMNPSQKKKRKHRLWAAHCRKIQ LKKDNSSTQVYNYQPCDHPDRPCDSTCPCIMTQNFCEKFCQCNPDCQNRFPGCRCKTQ CNTKQCPCYLAVRECDPDLCLTCGASEHWDCKVVSCKNCSIQRGLKKHLLLAPSDVAG WGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKG NKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGI ERETDVL" BASE COUNT 1201 a 1135 c 1183 g 1087 t ORIGIN 1 ggacacctgt tctgctgttg tgtcctgcca ttctcctgaa gaacagaggc acactgtaaa 61 acccaacact tccccttgca ttctataaga ttacagcaag atggaaatac caaatccccc 121 tacctccaaa tgtatcactt actggaaaag aaaagtgaaa tctgaataca tgcgacttcg 181 acaacttaaa cggcttcagg caaatatggg tgcaaaggct ttgtatgtgg caaattttgc 241 aaaggttcaa gaaaaaaccc agatcctcaa tgaagaatgg aagaagcttc gtgtccaacc 301 tgttcagtca atgaagcctg tgagtggaca cccttttctc aaaaagtgta ccatagagag 361 cattttcccg ggatttgcaa gccaacatat gttaatgagg tcactgaaca cagttgcatt 421 ggttcccatc atgtattcct ggtcccctct ccaacagaac tttatggtag aagatgagac 481 ggttttgtgc aatattccct acatgggaga tgaagtgaaa gaagaagatg agacttttat 541 tgaggagctg atcaataact atgatgggaa agtccatggt gaagaagaga tgatccctgg 601 atccgttctg attagtgatg ctgtttttct ggagttggtc gatgccctga atcagtactc 661 agatgaggag gaggaagggc acaatgacac ctcagatgga aagcaggatg acagcaaaga 721 agatctgcca gtaacaagaa agagaaagcg acatgctatt gaaggcaaca aaaagagttc 781 caagaaacag ttcccaaatg acatgatctt cagtgcaatt gcctcaatgt tccctgagaa 841 tggtgtccca gatgacatga aggagaggta tcgagaacta acagagatgt cagaccccaa 901 tgcacttccc cctcagtgca cacccaacat cgatggcccc aatgccaagt ctgtgcagcg 961 ggagcaatct ctgcactcct tccacacact tttttgccgg cgctgcttta aatacgactg 1021 cttccttcac ccttttcatg ccacccctaa tgtatataaa cgcaagaata aagaaatcaa 1081 gattgaacca gaaccatgtg gcacagactg cttccttttg ctggaaggag caaaggagta 1141 tgccatgctc cacaaccccc gctccaagtg ctctggtcgt cgccggagaa ggcaccacat 1201 agtcagtgct tcctgctcca atgcctcagc ctctgctgtg gctgagacta aagaaggaga 1261 cagtgacagg gacacaggca atgactgggc ctccagttct tcagaggcta actctcgctg 1321 tcagactccc acaaaacaga aggctagtcc agccccacct caactctgcg tagtggaagc 1381 accctcggag cctgtggaat ggactggggc tgaagaatct ctttttcgag tcttccatgg 1441 cacctacttc aacaacttct gttcaatagc caggcttctg gggaccaaga cgtgcaagca 1501 ggtctttcag tttgcagtca aagaatcact tatcctgaag ctgccaacag atgagctcat 1561 gaacccctca cagaagaaga aaagaaagca cagattgtgg gctgcacact gcaggaagat 1621 tcagctgaag aaagataact cttccacaca agtgtacaac taccaaccct gcgaccaccc 1681 agaccgcccc tgtgacagca cctgcccctg catcatgact cagaatttct gtgagaagtt 1741 ctgccagtgc aacccagact gtcagaatcg tttccctggc tgtcgctgta agacccagtg 1801 caataccaag caatgtcctt gctatctggc agtgcgagaa tgtgaccctg acctgtgtct 1861 cacctgtggg gcctcagagc actgggactg caaggtggtt tcctgtaaaa actgcagcat 1921 ccagcgtgga cttaagaagc acctgctgct ggccccctct gatgtggccg gatggggcac 1981 cttcataaag gagtctgtgc agaagaacga attcatttct gaatactgtg gtgagctcat 2041 ctctcaggat gaggctgatc gacgcggaaa ggtctatgac aaatacatgt ccagcttcct 2101 cttcaacctc aataatgatt ttgtagtgga tgctactcgg aaaggaaaca aaattcgatt 2161 tgcaaatcat tcagtgaatc ccaactgtta tgccaaagtg gtcatggtga atggagacca 2221 tcggattggg atctttgcca agagggcaat tcaagctggc gaagagctct tctttgatta 2281 caggtacagc caagctgatg ctctcaagta cgtggggatc gagagggaga ccgacgtcct 2341 ttagccctcc caggccccac ggcagcactt atggtagcgg cactgtcttg gctttcgtgc 2401 tcacaccact gctgctcgag tctcctgcac tgtgtctccc acactgagaa accccccaac 2461 ccactccctc tgtagtgagg cctctgccat gtccagaggg cacaaaactg tctcaatgag 2521 aggggagaca gaggcagcta gggcttggtc tcccaggaca gagagttaca gaaatgggag 2581 actgtttctc tggcctcaga agaagcgagc acaggctggg gtggatgact tatgcgtgat 2641 ttcgtgtcgg ctccccaggc tgtggcctca ggaatcaact taggcagttc ccaacaagcg 2701 ctagcctgta attgtagctt tccacatcaa gagtccttat gttattggga tgcaggcaaa 2761 cctctgtggt cctaagacct ggagaggaca ggctaagtga agtgtggtcc ctggagccta 2821 caagtggtct gggttagagg cgagcctggc aggcagcaca gactgaactc agaggtagac 2881 aggtcacctt actacctcct ccctcgtggc agggctcaaa ctgaaagagt gtgggttcta 2941 agtacaggca ttcaaggctg ggggaaggaa agctacgcca tccttcctta gccagagagg 3001 gagaaccagc cagatgatag tagttaaact gctaagcttg ggcccaggag gctttgagaa 3061 agccttctct gtgtactctg gagatagatg gagaagtgtt ttcagattcc tgggaacaga 3121 caccagtgct ccagctcctc caaagttctg gcttagcagc tgcaggcaag cattatgctg 3181 ctattgaaga agcattaggg gtatgcctgg caggtgtgag catcctggct cgctggattt 3241 gtgggtgttt tcaggccttc cattccccat agaggcaagg cccaatggcc agtgttgctt 3301 atcgcttcag ggtaggtggg cacaggcttg gactagagag gagaaagatt ggtgtaatct 3361 gctttcctgt ctgtagtgcc tgctgtttgg aaagggtgag ttagaatatg ttccaaggtt 3421 ggtgaggggc taaattgcac gcgtttaggc tggcaccccg tgtgcagggc acactggcag 3481 agggtatctg aagtgggaga agaagcaggt agaccacctg tcccaggctg tggtgccacc 3541 ctctctggca ttcatgcaga gcaaagcact ttaaccattt cttttaaaag gtctatagat 3601 tggggtagag tttggcctaa ggtctctagg gtccctgcct aaatcccact cctgagggag 3661 ggggaagaag agagggtggg agattctcct ccagtcctgt ctcatctcct gggagaggca 3721 gacgagtgag tttcacacag aagaatttca tgtgaatggg gccagcaaga gctgccctgt 3781 gtccatggtg ggtgtgccgg gctggctggg aacaaggagc agtatgttga gtagaaaggg 3841 tgtgggcggg tatagattgg cctgggagtg ttacagtagg gagcaggctt ctcccttctt 3901 tctgggactc agagccccgc ttcttcccac tccacttgtt gtcccatgaa ggaagaagtg 3961 gggttcctcc tgacccagct gcctcttacg gtttggtatg ggacatgcac acacactcac 4021 atgctctcac tcaccacact ggagggcaca cacgtacccc gcacccagca actcctgaca 4081 gaaagctcct cccacccaaa tgggccaggc cccagcatga tcctgaaatc tgcatccgcc 4141 gtggtttgta ttcattgtgc atatcaggga taccctcaag ctggactgtg ggttccaaat 4201 tactcataga ggagaaaacc agagaaagat gaagaggagg agttaggtct atttgaaatg 4261 ccaggggctc gctgtgagga ataggtaaaa aaaaactttt caccagcctt tgagagacta 4321 gactgacccc acccttcctt cagtgagcag aatcactgtg gtcagtctcc tgtcccagct 4381 tcagttcatg aatactcctg ttcctccagt ttcccatcct ttgtccctgc tgtcccccac 4441 ttttaaagat gggtctcaac ccctccccac cacgtcatga tggatggggc aaggtggtgg 4501 ggactagggg agcctggtat acatgcggct tcattgccaa taaatttcat gcactttaaa 4561 gtcctgtggc ttgtgacctc ttaataaagt gttagaatcc attttg // LOCUS AB002387 5212 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0389 gene, complete cds. ACCESSION AB002387 NID g2224718 KEYWORDS KIAA0389. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0061. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5212) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5212 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0061" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 140..3997 /gene="KIAA0389" CDS 140..3997 /gene="KIAA0389" /codon_start=1 /db_xref="PID:d1021685" /db_xref="PID:g2224719" /translation="MEDGKPVWAPHPTDGFQMGNIVDIGPDSLTIEPLNQKGKTFLAL INQVFPAEEDSKKDVEDNCSLMYLNEATLLHNIKVRYSKDRIYTYVANILIAVNPYFD IPKIYSSEAIKSYQGKSLGTRPPHVFAIADKAFRDMKVLKMSQSIIVSGESGAGKTEN TKFVLRYLTESYGTGQDIDDRIVEANPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKS SVVGGFVSHYLLEKSRICVQGKEERNYHIFYRLCAGASEDIREKLHLSSPDNFRYLNR GCTRYFANKETDKQILQNRKSPEYLKAGSMKDPLLDDHGDFIRMCTAMKKIGLDDEEK LDLFRVVAGVLHLGNIDFEEAGSTSGGCNLKNKSAQSLEYCAELLGLDQDDLRVSLTT RVMLTTAGGTKGTVIKVPLKVEQANNARDALAKTVYSHLFDHVVNRVNQCFPFETSSY FIGVLDIAGFEYFEHNSFEQFCINYCNEKLQQFFNERILKEEQELYQKEGLGVNEVHY VDNQDCIDLIEAKLVGILDILDEENRLPQPSDQHFTSAVHQKHKDHFRLTIPRKSKLA VHRNIRDDEGFIIRHFAGAVCYETTQFVEKNNDALHMSLESLICESRDKFIRELFESS TNNNKDTKQKAGKLSFISVGNKFKTQLNLLLDKLRSTGASFIRCIKPNLKMTSHHFEG AQILSQLQCSGMVSVLDLMQGGYPSRASFHELYNMYKKYMPDKLARLDPRLFCKALFK ALGLNENDYKFGLTKVFFRPGKFAEFDQIMKSDPDHLAELVKRVNHWLTCSRWKKVQW CSLSVIKLKNKIKYRAEACIKMQKTIRMWLCKRRHKPRIDGLVKVGTLKKRLDKFNEV VSVLKDGKPEMNKQIKNLEISIDTLMAKIKSTMMTQEQIQKEYDALVKSSEELLSALQ KKKQQEEEAERLRRIQEEMEKERKRREEDEKRRRKEEEERRMKLEMEAKRKQEEEERK KREDDEKRIQAEVEAQLARQKEEESQQQAVLEQERRDRELALRIAQSEAELISDEAQA DLALRRNDGTRPKMTPEQMAKEMSEFLSRGPAVLATKAAAGTKKYDLSKWKYAELRDT INTSCDIELLAACREEFHRRLKVYHAWKSKNKKRNTETEQRAPKSVTDYDFAPFLNNS PQQNPAAQIPARQREIEMNRQQRFFRIPFIRPADQYKDPQSKKKGWWYAHFDGPWIAR QMELHPDKPPILLVAGKDDMEMCELNLEETGLTRKRGAEILPRQFEEIWERCGGIQYL QNAIESRQARPTYATAMLQSLLK" BASE COUNT 1698 a 936 c 1122 g 1456 t ORIGIN 1 cttcactggc cctcatcact tctcaccgcg ccctccagct tcacccgtac aggtagcccc 61 gccgccgcgc acctgccttc gctcccgcac cggtgacagt ggatagtgga aacaggagat 121 cgtggatcct ccttcaaaaa tggaggatgg aaagcccgtt tgggcgccac accctacaga 181 tggatttcag atgggcaata ttgtggatat tggccccgac agcttaacaa ttgaaccctt 241 gaatcagaaa ggcaagacat ttttggctct cataaaccaa gtgtttcctg cagaagagga 301 cagtaaaaaa gatgtggaag ataactgttc actaatgtat ttaaatgaag ccacactgct 361 ccataatatc aaagttcgat atagtaaaga cagaatttat acatatgtcg ccaacattct 421 gattgcagtg aatccatact ttgacatacc taaaatatat tcttcagaag caataaagtc 481 atatcaagga aaatctcttg ggacaagacc acctcatgtc tttgcaattg ctgataaagc 541 ttttcgagac atgaaggtgc tcaagatgag tcagtctatc attgtatctg gagaatcagg 601 agccggcaaa acagaaaata caaaatttgt tctaagatac ctgactgaat cctatggaac 661 aggtcaagat attgatgaca gaattgttga agctaaccca ctcctagaag cctttggaaa 721 tgcgaagact gttcgcaaca ataatagcag tcgatttggg aaatttgtag aaatacattt 781 taatgaaaag agctcagttg ttggaggatt tgtttcacat tatctcctag agaaatctag 841 gatctgtgtt caaggcaaag aggaaagaaa ttatcatatc ttttataggt tgtgtgctgg 901 tgcttctgaa gatattagag aaaaacttca tttgagttca ccagataatt ttcggtattt 961 aaaccgaggc tgcactagat actttgctaa caaagaaact gacaaacaga ttttacagaa 1021 ccgcaaaagt cctgagtacc ttaaggcagg ttctatgaaa gatcctctgc tagatgacca 1081 tggtgatttt attagaatgt gcacggctat gaaaaaaatt ggtttggatg atgaagaaaa 1141 gcttgatctc ttccgggtag tagctggcgt cctgcacctt ggaaatattg attttgagga 1201 agctggcagc acttcaggtg gttgtaatct gaagaataaa tctgctcagt ctttggaata 1261 ttgtgctgaa ttactgggtt tggaccaaga tgatcttcga gtaagtttga ccacaagagt 1321 catgctaaca acagcagggg gcaccaaagg aacagttata aaggtacctc tgaaagtgga 1381 gcaagcaaac aatgctcgtg atgccctggc aaagacagtg tatagccatc tttttgatca 1441 tgtggtaaac agagtaaatc agtgttttcc ttttgaaaca tcatcctatt ttattggagt 1501 cctagatatt gctggttttg agtactttga gcataacagt tttgaacaat tttgcatcaa 1561 ctattgcaat gaaaaacttc aacaattttt taatgaaagg attctgaagg aggaacaaga 1621 actctatcaa aaagaaggtt taggtgttaa tgaagtgcat tatgtggata atcaggactg 1681 tatagattta attgaagcca aattagtggg aatactggat attttggatg aagaaaatcg 1741 ccttccccag ccaagtgatc aacactttac atctgcagtt caccaaaagc acaaggatca 1801 ttttcgactc actattccca gaaaatctaa gctggcagtt cataggaata tcagagacga 1861 cgaaggcttc attatcaggc attttgcggg ggcagtgtgc tatgaaacaa cccagtttgt 1921 ggagaaaaat aatgatgctt tacatatgtc tcttgaatcc ttaatatgtg aatccagaga 1981 taagtttata cgggaattat ttgaatcatc cacaaataac aacaaagata ctaaacaaaa 2041 agcaggaaaa cttagcttca tcagcgtggg aaacaagttt aagacacagt taaatttgct 2101 tctggataaa cttcgaagta ctggagcaag ctttattcgt tgcatcaaac ctaacttaaa 2161 gatgacaagc caccactttg aaggtgctca aattctgtct cagcttcagt gttcagggat 2221 ggtgtctgtt ttggacttga tgcagggtgg ttacccatca cgagcttcat ttcatgaact 2281 ctacaacatg tacaaaaagt atatgccaga taaacttgca agattggatc caagactatt 2341 ttgtaaggct ttgtttaaag ctttgggctt aaatgaaaat gactacaagt ttgggttaac 2401 caaagtattt tttagacctg gcaagtttgc agaatttgat cagatcatga agtctgaccc 2461 tgaccactta gcagagttgg ttaaaagagt caatcactgg ctcacatgca gtcgctggaa 2521 gaaagttcag tggtgctcac tctcagtcat caaattgaaa aacaaaataa aatatcgagc 2581 tgaagcctgc attaaaatgc aaaaaactat tcgaatgtgg ctttgcaaga ggagacacaa 2641 acctcgcatt gatggtctgg ttaaggtggg cacactgaaa aaacgacttg ataaatttaa 2701 tgaggtagtc agtgtgttga aagatggaaa acccgagatg aataaacaga tcaagaatct 2761 ggaaatttct attgatactt tgatggccaa aattaagtcc actatgatga cgcaggaaca 2821 aatccagaaa gaatatgatg cactggttaa aagctcagag gaactcctca gtgcattaca 2881 gaaaaaaaaa cagcaggaag aggaagcaga aaggctgagg cgtattcaag aagaaatgga 2941 aaaggaaaga aaaagacgtg aagaagacga aaaacgtcga agaaaggaag aggaggaaag 3001 gcggatgaaa cttgagatgg aagcaaagag aaaacaagaa gaagaagaga gaaagaaaag 3061 ggaagatgat gaaaaacgca ttcaagctga agtggaggca cagctggccc gacagaagga 3121 ggaggaatcc caacagcaag cagttctgga gcaggagcgc agggaccggg agctggccct 3181 gaggattgcc cagagtgaag ccgagctcat cagtgatgag gcccaggccg acctggcgct 3241 gcggagaaat gatggaacaa gacccaaaat gacaccggaa caaatggcca aagaaatgtc 3301 agaatttttg agtagaggtc ctgctgtact agccaccaaa gcagctgctg gtactaagaa 3361 atatgatctt agtaaatgga aatatgcaga actacgtgat accatcaata cttcttgtga 3421 tattgagctc ctggcagctt gcagagaaga atttcatagg agactaaaag tgtatcatgc 3481 ttggaaatct aagaacaaga agagaaatac tgaaacagag caacgtgctc caaagtctgt 3541 tactgattat gattttgcac catttttgaa caattcacct cagcaaaacc cagcagctca 3601 gattcctgcc aggcagcggg agattgaaat gaaccgacag caacgcttct tccgcatccc 3661 attcatccgc cctgccgacc agtacaaaga ccctcagagt aagaaaaaag gctggtggta 3721 tgcccatttt gatggaccat ggattgcccg gcaaatggaa ctccatcctg acaagccacc 3781 catcctactt gtggctggta aggacgacat ggagatgtgt gagctgaatc ttgaggagac 3841 tggcctgact cggaagcgtg gtgctgagat cttgccaaga cagtttgaag aaatctggga 3901 acgctgtgga ggcatccagt accttcagaa tgcgattgag agcagacagg ctcggcccac 3961 ctatgcaaca gccatgctgc agagtctgtt aaagtagatg ttgcacacta gccttacagc 4021 tgggagcctt tgccatggta cttaggtagg gtgtgtgccc ccagatttaa ccattccata 4081 atcatgttag agttacttct ataaagtgaa cagattttat taatcacggc ttttggtgaa 4141 tttgtttaag gttaattatg gtagcaaatt ttggacctaa acattatttt tctgtatccc 4201 gctgtaattc ccaaaactct cattattctc taactattac acatgggcat attctgatgt 4261 ttctcatcct ttgccagaag actaccttac atccatcgta attgttctct aggaaaagag 4321 aacttttttc aaaattcaaa atacttttta aggatggcac agtaccatat aactggagta 4381 ataaaacatg agcttacatt cttacaataa ctaaaccact taaaatgatc aaggcactaa 4441 tgttttggtc tgaaaagctg tgtactttat agacattttc agacattttt ggaaatttcc 4501 attaaaggtg gaaaatctat ttttttcctc ctttgcagtg tcttagtttg aatgaaacac 4561 ttcgaagttc tagaattcta gaaagagcct taatgtattt gatgtattct gtgataagag 4621 gtactaatag tatccagcac agatttgctt ttctttgcta gcacaatgtg tgttgctgtc 4681 agaatattct ttttatattc tgtggaaaaa taaaggaaat tcagattgtt taaatgccta 4741 aaagttttga gataagtttt gtttcaatta gaaaaggaaa taggttttag gtggcatagt 4801 ggcttaactg gactgaattc aaatattctt tcaacttcat ctcaatagtg atttttgtat 4861 cagaatcttg tccaagttgt ttcattgatt tagtaagtgt tctgcttcca acatctttct 4921 ttttaagaaa ttcctagtgt cttttttggc ctttgaggtt ttggtaattg tagacctgtt 4981 tcataagctt tgtaattcag aaatccttgt atttagtaag tgcttgtttt acataactga 5041 taattttaaa atgttttctt tgtgtgctgt tagtattgat tcaaatgtca gcagctttaa 5101 gcctaatatt tatgactttc acatttggaa tttaaagaca aaaatacatc aaggagttat 5161 gctgacataa ttctaaggag ttttgttgta ttttagaata aaattataaa gt // LOCUS AB002388 4935 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0390 gene, complete cds. ACCESSION AB002388 NID g2224720 KEYWORDS KIAA0390. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0075. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4935) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..4935 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0075" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 139..4041 /gene="KIAA0390" CDS 139..4041 /gene="KIAA0390" /codon_start=1 /db_xref="PID:d1021686" /db_xref="PID:g2224721" /translation="MEEASLCLGVSSAEPEAEPHLSGPVLNGQYAMSQKLHQITSQLS HAFPELHPRPNPEEKPPASLEEKAHVPMSGQPMGSQMALLANQLGREVDTSLNGRVDL QQFLNGQNLGIMSQMSDIEDDARKNRKYPCPLCGKRFRFNSILSLHMRTHTGEKPFKC PYCDHRAAQKGNLKIHLRTHKLGNLGKGRGRVREENRLLHELEERAILRDKQLKGSLL QPRPDLKPPPHAQQAPLAACTLALQANHSVPDVAHPVPSPKPASVQEDAVAPAAGFRC TFCKGKFKKREELDRHIRILHKPYKCTLCDFAASQEEELISHVEKAHITAESAQGQGP NGGGEQSANEFRCEVCGQVFSQAWFLKGHMRKHKDSFEHCCQICGRRFKEPWFLKNHM KVHLNKLSVKNKSPSDPEVPVPMGGMSQEAHANLYSRYLSCLQSGFMTPDKAGLSEPS QLYGKGELPMKEKEALGKLLSPISSMAHGVPEGDKHSLLGCLNLVPPLKSSCIERLQA AAKAAEMDPVNSYQAWQLMARGMAMEHGFLSKEHPLQRNHEDTLANAGVLFDKEKREY VLVGADGSKQKMPADLVHSTKVGSQRDLPSKLDPLESSRDFLSHGLNQTLEYNLQGPG NMKEKPTECPDCGRVFRTYHQVVVHSRVHKRDRKGEEDGLHVGLDERRGSGSDQESQS VSRSTTPGSSNVTEESGVGGGLSQTGSAQEDSPHPSSPSSSDIGEEAGRSAGVQQPAL LRDRSLGSAMKDCPYCGKTFRTSHHLKVHLRIHTGEKPYKCPHCDYAGTQSASLKYHL ERHHRERQNGAGPLSGQPPNQDHKDEMSSKASLFIRPDILRGAFKGLPGIDFRGGPAS QQWTSGVLSSGDHSGQATGMSSEVPSDALKGTDLPSKSTHFSEIGRAYQSIVSNGVNF QGSLQAFMDSFVLSSLKKEKDMKDKALADPPSMKVHGVDGGEEKPSGKSSQRKSEKSQ YEPLDLSVRPDAASLPGSSVTVQDSIAWHGCLFCAFTTSSMELMALHLQANHLGKAKR KDNTIGVTVNCKDQAREASKMALLPSLQSNKDLGLSNMISSLDSASEKMAQGQLKETL GEQKSGAWTGHVDPAFCNFPSDFYKQFGVYPGMVGSGASSSCPNKEPDGKAHSEEDVP ILIPETTSKNTTDDLSDIASSEDMDSSKGENNDEEDVETEPEMMTKPLSALSKDSSSD GGDSLQPTGTSQPVQGLVSPLSQAPEKQWHSQGLLQAQDPLAGLPKPERGPQSLDKPM NMLSVLRAYSSDGLAAFNGLASSTANSGCIKRPDLCGK" BASE COUNT 1182 a 1391 c 1388 g 974 t ORIGIN 1 gtctggagaa gagagtttcc acgttgtcat cataggatct ggactcaagt gaccaaaacc 61 ttgcttggat ttttacctga tgaagtttgt cttgaaaagt gaaaatggtt ccatttccat 121 tttccctgtt ctcaagggat ggaagaagcg agcctgtgcc ttggagtgtc ttcggcggag 181 ccggaagctg agccccacct gagtggcccc gtcctcaacg gccagtatgc catgagtcag 241 aagctgcacc agatcacctc ccagctcagc catgccttcc ccgagctcca tccccggccc 301 aaccccgagg agaagccccc cgcatccctg gaggagaagg cccacgtgcc catgagcggc 361 cagcccatgg gcagtcagat ggcgctcctg gccaaccagc tgggccggga ggtggacacc 421 agcctcaacg ggagggtgga cttgcagcag ttcctcaacg ggcagaacct gggcatcatg 481 tcccagatga gcgacatcga ggacgacgcc cgcaagaacc gcaagtaccc gtgcccactc 541 tgcggcaagc gcttccgctt caacagcatc ctctccctgc acatgcgcac gcacacgggc 601 gagaagccct tcaagtgccc gtactgcgac cacagggcgg cgcagaaggg gaacctcaag 661 attcacctgc ggacccacaa gctgggcaac ctgggcaagg ggcgtgggcg tgtgcgcgag 721 gagaaccgcc tgctgcacga gctggaggag cgcgccatcc tgcgggacaa gcagctgaaa 781 ggcagcctgc tgcagccccg gccggacctg aagcccccgc cgcacgccca gcaggccccg 841 ctggccgcct gcaccctggc cctgcaggct aaccacagcg ttcccgacgt ggcccacccg 901 gtgccctcgc ccaagcctgc cagcgtgcag gaggacgcgg tggccccggc ggcgggcttc 961 cgctgtacct tctgcaaggg caagttcaag aagcgcgagg agctggaccg ccacatccgc 1021 atcttgcaca agccctacaa gtgcacgttg tgcgacttcg cggcttcgca ggaggaggag 1081 ctcatcagcc acgtggagaa ggcacacatc acggccgagt cggcccaggg ccagggcccc 1141 aacggcggtg gcgagcagtc ggccaacgag ttccgctgcg aggtgtgcgg tcaggtgttc 1201 agccaggcgt ggttcctcaa gggtcacatg cgcaagcaca aagactcctt tgagcactgc 1261 tgccagatct gcggccggcg cttcaaggag ccctggttcc tcaagaacca catgaaggtc 1321 cacctcaaca agctgtcggt gaagaacaag tcccccagcg accccgaggt gcctgtgccc 1381 atgggcggca tgtcccagga ggcccacgcc aacctgtact ccaggtacct ctcctgcctg 1441 cagagtggct tcatgacccc ggacaaagcc ggcctgagcg agcccagcca gctctatggc 1501 aagggcgagc tgcccatgaa ggagaaggaa gcgctgggga agctgctgtc tcccatctcc 1561 agcatggccc acggcgtccc ggagggggac aagcactccc tcctgggatg cctcaatctc 1621 gtgccgccgc tgaaatccag ctgcatcgag cggctgcagg cggctgccaa ggctgcggag 1681 atggaccccg tgaacagcta ccaggcttgg cagctcatgg ccaggggcat ggccatggaa 1741 catggcttct tgtctaaaga gcatccgctg cagcgcaacc acgaagacac tttggcaaac 1801 gccggggttc tgtttgataa ggagaagcgg gagtacgtgt tagtgggagc agatggctcc 1861 aagcagaaaa tgcctgctga tttggttcac agcactaaag tgggcagcca gagagacctg 1921 ccaagtaagc tcgacccttt agaaagcagt cgggattttt tgtcacacgg gctgaaccag 1981 actctcgagt ataacctgca gggtcctggg aacatgaagg agaagcccac cgagtgcccc 2041 gactgcggcc gggtgttccg cacttaccac caggtggtcg tgcactcccg tgtccacaag 2101 cgggaccgca agggcgagga ggatgggctg cacgtgggcc tggatgagcg gcgtggctcg 2161 ggcagtgacc aggagtccca gtcggtgagc cgctccacca cgccgggctc ctctaacgtc 2221 accgaggaga gcggggtcgg aggcggcctc tcccagaccg ggagtgccca ggaggacagc 2281 ccgcacccct cctcgccatc ctcctcagac attggcgagg aggctgggag atctgccggc 2341 gtccagcaac cagcgctgct tcgcgacaga agcctgggct cggccatgaa ggactgcccg 2401 tactgtggga aaactttccg gacatcccat caccttaagg tgcacctgag gatacacaca 2461 ggtgagaaac cctacaagtg tccgcactgt gactatgccg gcacgcagtc agcatcctta 2521 aaataccact tagagcgaca ccatcgggag cggcagaacg gggctgggcc gctgtctggg 2581 caacccccaa atcaagacca caaggatgag atgtcaagca aagcttctct gttcatcagg 2641 ccagacatcc tgaggggggc cttcaagggt ctccctggaa tcgacttcag aggaggccct 2701 gcatctcagc agtggacatc aggggttctc tcctctggag atcactcggg gcaggccacg 2761 ggcatgtctt cggaggtccc ctcagatgct ctgaaaggca ctgaccttcc ttccaaaagc 2821 acccacttct ctgagatcgg aagagcttat caaagcattg tgagcaacgg tgtgaatttc 2881 caagggtcct tgcaagcttt catggacagt tttgtcctca gttccttgaa gaaggagaag 2941 gacatgaagg acaaagccct ggctgacccc ccttccatga aagtccacgg agtggatggt 3001 ggtgaggaga aacccagtgg caagtcctcc cagaggaagt ccgagaaatc tcagtatgaa 3061 cccctggact tgtctgtgcg gccagatgcc gcctccctcc cgggctcctc ggtaactgtg 3121 caggacagca ttgcatggca cggctgcttg ttttgtgctt tcacaacgtc ctccatggag 3181 ctcatggccc ttcatctcca ggccaaccac ctgggcaaag cgaaacgcaa agataacacc 3241 atcggggtca cagtcaactg caaagaccaa gcccgggagg cgagtaagat ggccctgctg 3301 ccctcgttac aatcaaacaa agacctgggc ctctccaata tgatcagctc tctagactct 3361 gcttctgaga agatggccca aggtcagctc aaggagactc tgggagagca gaagagcggt 3421 gcatggaccg gccacgtgga ccctgcattt tgtaacttcc catcagactt ctacaagcag 3481 tttggtgttt acccaggcat ggttggctca ggggcctcca gttcctgccc caacaaggag 3541 cctgatggaa aggcccactc tgaagaggat gtccccatcc tgatccccga aaccacgagt 3601 aagaacacta ctgatgacct ctctgacatt gcctcctcag aggacatgga ctcctccaag 3661 ggggagaaca acgatgaaga ggatgttgaa accgaaccgg aaatgatgac caagccactg 3721 tctgccctca gcaaagacag cagcagcgat ggcggggaca gcctgcagcc cacaggcacc 3781 tcccagcccg tccagggact ggtctcacct ttatcccaag caccggagaa gcagtggcac 3841 agccagggtc ttctccaagc ccaggacccc ttggcgggcc tgccaaagcc ggagcggggg 3901 ccccagagcc tggacaagcc gatgaacatg ctgtcggtcc tcagggccta cagttctgat 3961 ggcttagcag cctttaacgg acttgcaagt agcacagcaa attctggatg tatcaagagg 4021 ccagacttgt gtggtaagtg acactccctg tcctagtcgg tctatctgga cttgcccttg 4081 tctgttcgtg gtcctcggtg gttatctgca gcttgttaat cgtgtaaagt caagagaaga 4141 atgtatacac atatgtgtgt tgaataatta ctattggcat aggtatgtgt atacacacgg 4201 tgcaccaatc tacagtatat atagcagaga atcagaggct aaaaatatta ccccatatgt 4261 tccagtatta gtcatggatt gcaaaggctt agtaacttga gcaggagaga aaactccctc 4321 aaagtcataa atcctgagtg acaactgctg ctggatgaca gatcccttca cctgtggaca 4381 acctggctgg gggtgggggg ctgttccacc agctcacctg agcatgtaga ggtgggtcct 4441 gcagtggtct cgtgggtatt actgcttgtg tctgattgtc ctgtattttg taacacttta 4501 gaagaataca gaaaagtgca gtaattctct ttctccatag tatttaagca gaaatattgc 4561 tagtttaata ttgtgtcagg tcgtcctatt aaccaggagc agatgacagt aaaatttcag 4621 tgaatagcac cttgacatct acaacttaaa aatggtgatt gaagcaaaat atgtaaactt 4681 gtacggggtg atcgtgtgct ttggaacaga gtattgttga agtaattaga agatatatta 4741 aggtgttcct ggtaatgaag gcatgtaagt tataataatt gtagctttct gaataagtgt 4801 caaactatat ctttaagtgt gctgtatgct gagttacaag ttaggtcatt tatgaatgga 4861 atgtaaaata atactaaaaa tgcttcaata acttatcttg gtattgctaa taaaaaaaaa 4921 aagctgtgaa acatt // LOCUS AB002389 5677 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0391 gene, complete cds. ACCESSION AB002389 NID g2224722 KEYWORDS KIAA0391. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0118. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5677) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..5677 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0118" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 360..2063 /gene="KIAA0391" CDS 360..2063 /gene="KIAA0391" /codon_start=1 /db_xref="PID:d1021687" /db_xref="PID:g2224723" /translation="MTFYLFGIRSFPKLWKSPYLGLGPGHSYVSLFLADRCGIRNQQR LFSLKTMSPQNTKATNLIAKARYLRKDEGSNKQVYSVPHFFLAGAAKERSQMNSQTED HALAPVRNTIQLPTQPLNSEEWDKLKEDLKENTGKTSFESWIISQMAGCHSSIDVAKS LLAWVAAKNNGIVSYDLLVKYLYLCVFHMQTSEVIDVFEIMKARYKTLEPRGYSLLIR GLIHSDRWREALLLLEDIKKVITPSKKNYNDCIQGALLHQDVNTAWNLYQELLGHDIV PMLETLKAFFDFGKDIKDDNYSNKLLDILSYLRNNQLYPGESFAHSIKTWFESGQCSG CGKTIESIQLSPEEYECLKGKIMRDVIDGGDQYRKTTPQELKRFENFIKSRPPFDVVI DGLNVAKMFPKVRESQLLLNVVSQLAKRNLRLLVLGRKHMLRRSSQWSRDEMEEVQKQ ASCFFADDISEDDPFLLYATLHSGNHCRFITRDLMRDHKACLPDAKTQRLFFKWQQGH QLAIVNRFPGSKLTFQRILSYDTVVQTTGDSWHIPYDEDLVERCSCEVPTKWLCLHQK T" BASE COUNT 1691 a 1137 c 1200 g 1649 t ORIGIN 1 gccgcgttaa gtctgagtgc cgctttgagt tgttgaatga agtgaacttc atttgtcagc 61 gttcggttca tgaactggaa tgtaagaggc accagaggat tcctgctctg tcccctggtt 121 tgcggcttgc gacgttggac atccccggat tgttgtttaa tagagaaaac tcacctgcct 181 tcttgctttt aagtagcccc aaaagcagaa ccttgatttg tctgtaagga agaaacacaa 241 accttttaaa aagttccact tcgactctgc accgccgacc cccaatctct ttaattttgc 301 catagaagag gggttttttc aacatctctc tcactatctg gtgctgatct cactgcataa 361 tgactttcta tttgtttggt attcgaagct ttccgaagct ttggaagagc ccataccttg 421 ggctaggccc agggcactct tatgtctcgc tgtttctggc agaccgctgt ggcatcagga 481 accagcagag gttgttttct cttaaaacaa tgtctccaca gaataccaaa gcaacgaatc 541 tgattgccaa ggccagatat ctcaggaaag atgagggcag taataagcaa gtttattctg 601 ttcctcattt ttttttagct ggagcagcta aggagagatc acagatgaat tctcaaactg 661 aagatcatgc cttggcacct gtgaggaaca ctattcaact cccaacacaa cctttgaatt 721 cagaggagtg ggataaactt aaggaagatt taaaagaaaa caccggaaag accagtttcg 781 aaagttggat catttcacag atggctggct gtcatagctc tatagatgtg gctaaatctc 841 tgctggcatg ggtagcagcc aaaaataatg gtattgtaag ttacgattta ctggtcaagt 901 atttgtatct ctgtgtcttt catatgcaga catctgaagt tattgatgtc tttgaaatta 961 tgaaagccag atataagact ttagaaccta gaggttacag tcttctcatc cggggattga 1021 tccattcaga cagatggaga gaagcattgt tgctgttaga ggacatcaaa aaagttataa 1081 ctccttcaaa aaagaactat aatgactgta tccagggagc tctccttcat caagatgtaa 1141 acacagcttg gaatttatat caggaattgc taggtcatga tattgttcct atgttggaaa 1201 ctttaaaagc tttctttgat tttggaaaag acataaagga tgataactat tcaaataaac 1261 tactagatat tctttcatat ctaagaaata atcagctgta tccaggggag tcatttgcac 1321 acagtataaa aacatggttt gagagtggcc agtgttcggg ctgtggaaaa accatagagt 1381 ctattcagct gagtccagaa gaatatgaat gtcttaaggg aaaaatcatg agggatgtga 1441 tagatggagg tgaccagtac agaaagacaa cacctcagga acttaagaga tttgagaact 1501 tcataaaatc tcgtcctcct tttgatgttg tcattgatgg tctcaatgtt gccaaaatgt 1561 ttcctaaagt tcgtgaatct caacttctct tgaatgtcgt ctctcaacta gccaaacgga 1621 atctgcgact gctggtccta ggccggaagc acatgctaag acggagttcc cagtggagtc 1681 gggatgagat ggaagaggtg caaaagcaag ccagctgttt ttttgctgat gacatctcgg 1741 aggatgatcc attccttctg tatgccacac tgcactccgg gaatcactgc aggtttatca 1801 caagagacct gatgcgggac cacaaggcct gtctgcctga tgccaagacc caacgcctgt 1861 tttttaagtg gcagcaggga catcagctgg caattgtaaa taggtttcca ggatcaaaac 1921 taacctttca gcgtattctc agctatgaca cagtggtgca aacaactgga gactcgtggc 1981 acataccata tgatgaagac ttggtagaaa gatgttcctg tgaagtacca accaaatggc 2041 tttgcctcca ccaaaagaca tagagattct tacctctatg ctaagtttgt gtttgggtac 2101 cctctaggtt ggcatcagag gctcttgagc tggtgtttgt ttagggcatt gcctctgtcc 2161 tgaagataaa aggattctat taacagcatt gacattgatt ttttaatgaa atgagatata 2221 tcttttcata accagctgcg tttttttccc ctaacatttg tttttggagg cttatcaaga 2281 gttggagaac ttagtgtaga gcaaaacctg catttctcct actgggccag ctattccact 2341 tagcttgggt gactaatagt gcttttggta tccatttttt gctacttctg accttgcctt 2401 ccaggcctac caatagcaga atcaatccat ctgtccctga gatactcatg ttgtttcaaa 2461 tgcctcctcc catttctggc atagtctcat tctctgtatg ttatgcccta tccacatgga 2521 atcatttatc gtcctctgta ataaactggc caagatacta aaggcttact attcatagca 2581 gtttttaatt acttatcatc caattatttg gattggagaa gagggggcat tcactcctct 2641 ttttcttatt ttttttggaa atagagtctc aactcactct agcctgggtg acagagcgag 2701 accttgtctc aaaacaaaac aaagtgctgg aattgcagac ttgagccaca gtgcccagcc 2761 tcacttctct agactatgat ggttttttct tcattctata atctcttttc caaattggtt 2821 caacattttg tgaacactat taatttcatc attcagtata tgtgggcttt ctaaaatatg 2881 ccaatttttt tccacttaat caagtttgac ttaatttaac aaagtgatta tattttaata 2941 gttacatttc tgttttttcc actcactagc cagcttacag tttattagcc cttgatttca 3001 gctgaaaata ttcatgtctg caccccttca tgatagttct ttctttacgt atacatactg 3061 tattcaatat gcaagaacag gcaaaaacta ctctattgtg ataaaaatca gaatagtaat 3121 tgcctgagga aagggatatg agagaacttg agagaacttt ctcggggtga tggaaagttt 3181 cttatattga tttgggtaat agtaacatag ctatatgtat atattagtta aaattcctca 3241 cactaaacat tttaaattga tatatttaca tttatgacaa tatacctcaa agtaagttag 3301 ggtaagaaaa gataattact tacatgaaat aaacaatgac ctcttttata tcaaaccata 3361 tacatgtgta taattagggt atgtattatg cacagagaaa caatttagga aaatccaagg 3421 aggggacgtt ttatcttctt acctatttac tgaatgcaac attactgcac accaagacaa 3481 aagagctctc caggaaaaca ttggatatat tgagagcatt aaaagatact gcaaaagctc 3541 taataaattc agtctgctta ttttccaaat ttcataaact acatacttag gaaactgtgc 3601 tttcagtgag ctaaacttct ttttttaagt aactatcata gttttaagaa aaacatttta 3661 agaagacaaa aagtatttat taagcccatc taaaaggcta atgcaaattc ccaaaaaagg 3721 agcacataga gatagaggag gaggccgaag tggtggctca tacctgttaa ttccagcact 3781 ttgggaggcc aagacaggag gatcacttta ggcccagagt tggagaccaa cctgggcaac 3841 atagcaagac cctgtctctt aaaaaaaaaa aaaaagacgg gagaagctac aagaagaaaa 3901 ctagaacttt agagcaggag taaaccttag agcatgtaaa gtccattttg gagatgagga 3961 acagacccag gaagatgacc tggcttccct gaatcccacg gctagttagt gcagacattt 4021 cagccataac ccagctcttc taattcccaa atactctttc ttctactggc acatagagat 4081 gggggaggag tcagggcatg gtggcccaca cctacagttc cagcactttg ggaggccaaa 4141 tgggagaatt gcttgaagcc aggagttgga gaccagccta ggcaacacag ggagacccgt 4201 gtcgacaaaa aatttaaaaa ttagctgggc atggtagcac atgcctgtgg tcccagctac 4261 tcagagggct gaggtgggag gatcacttga gccccagagg tcaaggctgc agtgagctgt 4321 gatcatgcta ctgcactcca gcctaggtga cagagtgaga ccctgtctca aagggaggga 4381 ggtaagaatg agaagaagga acaggggtgt acctctttta agggcccaag tatcctgaat 4441 ggctcagcag tatagaacat tgtggtagag aaattacatt ttaaaataac tctaataccg 4501 tttagaaaca aaaccctaac ttctgcttga gataaactga agtgcatctg tcccttgtcc 4561 aggagtgggg aaccattgta gggttgctca gcataagtca tactgccacg gtgaccttga 4621 ggagtgcagg gattccctga aggaagcagc tggtaccaga cacttaggct gcccatttgt 4681 gttctgatca tttgagtgaa aaaaaggtac ctgtcaagca agctcctgga caccacaaga 4741 aggaggaatt attttaaaag ctgtactctt aaattgttag tatctttaaa atcagttgtg 4801 aacaatgaag gatttgaaag agcattgact ttgccactta aaagtatttt taaaatactt 4861 tgtgcttccc ccttgcattc tgaatttata cacttttcct cctgctgttc tcagacccag 4921 tggaaagaaa atctcaagga agaaggctga gtttattctc tcagggctct gttgggtcta 4981 cctcatctga ggtggcttat tcttcatagg aaattaattt ttcttctcaa gtatgcactt 5041 aaatataatt actgcttcct tggtcctcta gcagatttct cacttttatt tatttttttt 5101 tttgagacag agtcttgatc ttttttcatc taggctggag tgcaatggtt tgatctcagt 5161 tcactgcaac ctctgcctcc tgggttcaag caattctcat gcctctgcct ctcgggcagc 5221 tggaattaca ggcatgcgcc atgacgcctg gctaattttt gcatttttag tagagacggg 5281 ttttcaccat gttgccccgg ttgctctcaa actcctgacc tcaggtgatc cacccgcctc 5341 agcctcccaa agtgctggga ttacaggtgt gagtcacccc gcacagcctg aaatgaggca 5401 tctctatcta tagtccagca gccctacagg aggcaggagg ggagcaagaa taagaaagga 5461 aatttgtaaa aggcacttag gagtgagcag aaaggaaata ggaccagctt ttacctgccc 5521 agtcctggcc agtgacaagc agtctgcttg agtctgtgct aaataaacaa aggaagttcc 5581 atttagagct ctacagaggg gaagccatag aaattaacag gatgaaaata caagagacag 5641 gaacacagat gaataaatgt aataaaattt gagaaat // LOCUS AB002391 6067 bp mRNA PRI 23-JUN-1997 DEFINITION Human mRNA for KIAA0393 gene, complete cds. ACCESSION AB002391 NID g2224726 KEYWORDS KIAA0393. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1266. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6067) AUTHORS Nagase,T., Ishikawa,K., Seki,N., Nakajima,D., Ohira,M., Miyajima,N., Kotani,H., Nomura,N. and Ohara,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro JOURNAL DNA Res. 4 (2), 141-150 (1997) MEDLINE 97349984 FEATURES Location/Qualifiers source 1..6067 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1266" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 501..2357 /gene="KIAA0393" CDS 501..2357 /gene="KIAA0393" /codon_start=1 /db_xref="PID:d1021689" /db_xref="PID:g2224727" /translation="MHAFCVGQYLEPDQEGVTIPDLGSLSSPLIDTERNLGLLLGLHA SYLAMSTPLSPVEIECAKWLQSSIFSGGLQTSQIHYSYNEEKDEDHCSSPGGTPASKS RLCSHRRALGDHSQAFLQAIADNNIQDHNVKDFLCQIERYCRQCHLTTPIMFPPEHPV EEVGRLLLCCLLKHEDLGHVALSLVHAGALDIEQVKHRTLPKSVVDVCRVVYQAKCSL IKTHQEQGRSYKEVCAPVIKRLRFLFNELRPAVCNDLSIMSKFKLLSSLPHWRRIAQK IIREPRKKRVPKKPESTDDEEKIGNEESDLEEACILPHSPINVDKRPIAIKSPKDKWQ PLLSTVTDVHKYKWLKQNVQGLYPQSPLLSTIAEFALKEEPVDVEKRKCLLKQLERAE VRLEGIDTILKLYLVSKNFLLPSVPYAMFCGWQRLIPEGIDIGEPLTDCLKDVDLIPP FNRMLLEVTFGKLYAWAVQNIRNVLVDASAKFKELGIQPVPLQTITNENPSGPSLGTI PQAHFLLVMLSMLTLQHSANNLDLLLNSGTLALAQTALRLIGPSCDSVEEDMNASAQG ASATVLEETRKETAPVQLPVSGPELAAMMKIGTRVMRGVDWKWGDQMGLLQA" BASE COUNT 1556 a 1459 c 1556 g 1496 t ORIGIN 1 tggaatgaaa tggttaaaga tggagaaatt gtatacactg gaacagaatc aacccagaac 61 ggatagctcc ctcctggaaa agcttttgtc tggtcctctg agccccagtg agagtttcct 121 gaggtacctc acccttccac aagacaacag gcttgccatt gatctgcaac aaacggcggt 181 tgttgtcatg gcccatttag accgtctggc tacaccctgt agatgcctcc tctgtgtagc 241 tctccgacgt ctcataagag tcatttttta caggtcagaa ctgtagaaat aatgagaaag 301 tgacatttgt acgcatagct gatttggaga accataataa cgatggaggc ttctggactg 361 tgattgacgg gaaagtgtat gatataaagg acttccagac acagtcgtta acagaaaata 421 gtattcttgc tcagtttgca ggggaagacc cagtggtagc tttggaagct gctttgcagt 481 ttgaagacac ccgggaatcc atgcacgcat tttgtgttgg ccagtatttg gagcctgacc 541 aagaaggcgt caccatacca gatctgggga gtctctcctc acctctgata gacacagaga 601 ggaatctggg cctgcttctc ggattacacg cttcctattt agcaatgagc acaccgctgt 661 ctcctgtcga gattgaatgt gccaaatggc ttcagtcatc catcttctct ggaggcctgc 721 agaccagcca gatccactac agctacaacg aggagaaaga cgaggaccac tgcagctccc 781 cagggggcac acctgccagc aaatctcgac tctgctccca cagacgggcc ctgggggacc 841 attcccaggc atttctgcaa gccattgcag acaacaacat tcaggatcac aacgtgaagg 901 actttttgtg tcaaatagaa aggtactgta ggcagtgcca tttgaccaca ccgatcatgt 961 ttccccccga gcatcccgtg gaagaggtcg gtcgcttgct gttatgttgc ctcttaaaac 1021 atgaagattt aggtcatgtg gcattatctt tagttcatgc aggtgcactt gatattgagc 1081 aagtaaagca cagaacgttg cctaagtcag tggtggatgt ttgtagagtt gtctaccaag 1141 caaaatgttc gctcattaag actcatcaag aacagggccg ttcttacaag gaggtctgcg 1201 ctcctgtcat caaacgtttg agattcctct ttaatgaatt gagacctgct gtttgtaatg 1261 acctctctat aatgtctaag tttaaattgt taagttcttt gccccattgg aggaggatag 1321 ctcagaagat aattcgagaa ccaaggaaaa agagagttcc taagaagcca gaatctacgg 1381 atgatgaaga aaaaattgga aacgaagaga gtgatttaga agaagcttgc attttgcctc 1441 atagtccaat aaatgtggac aagagaccca ttgcaattaa atcacccaag gacaaatggc 1501 agccgctgtt gagtactgtt acagatgttc acaaatacaa gtggttgaag cagaatgtgc 1561 agggtcttta tccgcagtct ccactcctca gtacaattgc tgaatttgcc cttaaagaag 1621 agccagtgga tgtggaaaag agaaagtgcc tactaaaaca gttggagaga gcagaggttc 1681 gcctggaagg gatagataca attttaaaat tgtatctggt gagcaagaat ttcttacttc 1741 catctgtgcc gtatgcgatg ttttgtggat ggcaaagact tattcctgag ggaatcgata 1801 taggggaacc tcttactgat tgtttaaagg atgttgattt gatcccgcct tttaatcgga 1861 tgctgctgga agtcaccttt ggcaagctgt acgcttgggc tgttcagaac attcgaaatg 1921 ttttggtgga tgccagtgcc aaatttaaag agcttggtat ccagccggtt cccctgcaaa 1981 ccatcaccaa tgagaaccca tcgggaccga gcctggggac catcccgcaa gcccacttcc 2041 tcctggtgat gctcagcatg ctcaccctgc agcacagcgc aaacaacctt gacctcctgc 2101 tcaattccgg cacgctggcc ctcgctcaga cggcactgcg cctgattggc cccagttgtg 2161 acagcgttga ggaagatatg aatgcttctg cccaaggtgc ttctgccaca gttttggaag 2221 aaacaaggaa ggaaacggct cctgtgcagc tccctgtttc agggccagaa ctggctgcca 2281 tgatgaagat tggaacaagg gtcatgagag gtgtggactg gaaatggggc gatcagatgg 2341 gcctcctcca ggcctaggcc gagtgattgg tgagctggga gaggacgggt ggataagagt 2401 ccagtgggac acaggcagca ccaactccta caggatgggg aaagaaggaa aatacgacct 2461 caagctggca gagctgccag cccctgcaca gccctcagca gaggattcgg acacagagga 2521 cgactctgaa gccgaacaaa ctgaaaggaa cattcacccc actgcaatga tgtttaccag 2581 cactattaac ttactgcaga ctctttgtct gtctgctgga gttcatgctg agatcatgca 2641 gagtgaagcc accaagactt tatgcggact gctgcgaatg ttagtggaaa gcggaacgac 2701 ggacaagaca tcttctccaa acaggctggt gtacagggag caacaccgga gctggtgcac 2761 gctggggttt gtgcagagca tcgctctcac gctgcaggtg tgcggcgccc tcagctcccc 2821 gcagtggatc acgctgctca tgaaggttgt ggaagggcac gcacccttca ctgccacctc 2881 gctgcagagg cagatcttag ctgtgcattt gttgcaagca gtccttccgt catgggacaa 2941 gaccgaaagg gtgagggaca tgaaatgcct catggagaag ctgtttgact tcttggggag 3001 cttgctcact atgtgctcct ctgacgtgcc gttactcaga ggtgggtggc cgtctccctt 3061 ccctgtaccc tggtgaagag cggtgcagtg ccgtcactca gaggtgggtg gccgtctccc 3121 ttccctgtgt cctggtgaag agcggtgcag tgccgtcact cagaggtggg tggccgtctc 3181 ccttccctgt accctggtga agagcggtgc agtgccgtca ctcagaggtg ggtggccgtc 3241 tcccttccct gtgtcctggt gaagagccgt gtagtgccgt cactcagagg tgagtggccg 3301 tctcccttcc ctgtgccctg gtgaagagtg gtgcagcagc ttctcccctg gtttcctcct 3361 cagagtccac gctgaggcgg cgcagggtgt gcccgcaggc ctcgctgact gccacccaca 3421 gcagcacact ggcggaggag gtggtggcac tgctgcacac gctgcactcc ctgactcggt 3481 ggaatgggtt catcaacaag tacatcaact cccagctccg ctccatcacc cacagctttg 3541 cgggaaggcc ttccaaaggg gcccagttag atgactactt ccctgattcc gagaaccctg 3601 aagtgggggg cctcatggcg gtcctggctg tggttggagg catcgatggt cgcctgtgcc 3661 tgggcggcca agttgtgcac gatgactttg gagaagtcac catgactcgc atcaccctga 3721 agggcaaaat caccgtgcag ttctctgaca tgcggacgtg tcgcgtttgc ccattgaatc 3781 agctgaaacc actccctgcc gtggccttta atgtgaacaa cctgcccttc acagagccca 3841 tgctgtctgt ctgggctcag ttggtgaacc tcgctggaag caagttagaa aagcacaaaa 3901 taaagaaatc gactaaacag gcctttgcag gacaagtgga cctggacctg ctgcggcgcc 3961 agcagttgaa gctatacatc ctgaaagcag gtcgggcgct gttctcccac caggataaac 4021 tgcggcagat cctgtctcag ccagctgttc aggagactgg aactgttcac acagatgatg 4081 gagcagtggt atcacctgac cttggggaca tgtctcctga agggccgcag ccccccatga 4141 tcctcttgca gcagctgctg gcctcggcca cccagccgtc tcctgtgaag gccatatttg 4201 ataaacagga acttgagact gctgcactgg ccgttgtgga gtccactcac ccttcgagcc 4261 caggatttga agactgcagc tccagtgagg ccaccacgcc tgtcaacgtg cagcacatcc 4321 gccctgccag agtgaagagg cgcaagcagt cacccgttcc cgctctgccg atcgtggtgc 4381 agctcatgga gatgggattt cccagaagga acatcgagtt tgccctgaag tctctcactg 4441 gtgcttccgg gaatgcgtcc ggcttgcctg gtgtggaagc cttggtcggg tggctgctgg 4501 accactccga catacaggtc acggagctct cagatgcaga cacggtgtcc gacgagtatt 4561 ctgacgagga ggtggtggag gacatggatg atgccgccta ctccatgtct actggtgctg 4621 ttgtgacgga gagccagacg tacaaaaacc gagctggttt cttgggtaat gatgattatg 4681 ctgtatatgt gagagagaat attcaggtgg gaatgatggt tagatgctgc cgaacatacg 4741 aagaagtgtg cgaaggtgat gtgatgttgg caaagtcatc aagctggaca gagatggatt 4801 gcatgatctc aatgtgcagt gtgactggca gcagaaaggg ggcatctact ggtttaggta 4861 cattcatgtg gaacttatag gctatcctcc accaagaagt tcttctcaca tcaagattgg 4921 tgataaagtg cgggtcaaag cctctgtcac cacaccaaaa tacaaatggg gatctgtgac 4981 tcatcagagt gtgggggttg tgaaaggtgt gatggatgtc agatgtttcc tatcaatgga 5041 tccagattca aatgcagaaa ctgtgatgac tttgattttt gtgaaacgtg tttcaagacc 5101 aaaaaacaca ataccaggca tacatttggc agaataaatg aaccaggctt cagcagaaga 5161 aacacttcct ataaatctcg cccaaacagg aaaggagaag cattagggtg tccgactacg 5221 tgggtttcat agctgtggaa aagccaaagg ggagactcct gaagaaaggc ggtgaagact 5281 gtgaagagcg ggtcaggaag atgagcacag cactgctact cctgtgggca cagggacagc 5341 atgtctccag ccagcgccac cttgtttaat acatgggaac tcactgaaat tcattctgta 5401 ttttgcccgc aaagttttaa agctttcatc cacagtcagg aattaaactt ataccaatga 5461 gagcctcaca cattcaagga tgtactaagc actacaggcc tcacagaaac agagatccca 5521 tcttggagtt ttcagtacca catgggagat aaagggtttt gaacatgaaa tgacaaaaac 5581 aacagcaaga agaaaattct tgtccttttt cattactatc agactcaaat aaatgtcttg 5641 gctcttacat tacattcatt cttcaaccat tgtggtctgg cttccacttc cttcacttca 5701 ccaacatggc tctgccaaag gaagcccgtg atctctaggc catcacttta attgatcttt 5761 ctacaacatt tatcctggtt gttaagccct ccttacaaca ttcttctctc tttgttttta 5821 tagctccatc tctcctgctt ctttaacttg ataatgcata cttgattttt ctatttgtta 5881 tttcataaac caattaatac acagataaaa tgactgtata tcaaaccatg tttgtataga 5941 aaaaatggat tttggatgcc tctcatatgt aattagttct attaaacata ttaattgtat 6001 tgtttaattt gtcaggtttt tgacagaatt ttgtttacaa gtaataaaaa ttttatctcc 6061 aattttc // LOCUS AB002405 1376 bp mRNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for LAK-4p, complete cds. ACCESSION AB002405 NID g2760120 KEYWORDS LAK-4p. SOURCE Homo sapiens male lymphoid mLT expressing LAK cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1376) AUTHORS Abe,Y. and Takaoka,Y. TITLE LAK-4 clone from the membrane lymphotoxin expressing subtraction library JOURNAL Published Only in DataBase (1998) In press REFERENCE 2 (bases 1 to 1376) AUTHORS Abe,Y. and Takaoka,Y. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Yasuhito Abe, Ehime University School of Medicine, The Second Department of Surgery; Shigenobu, Onsen-gun, Ehime 791-02, Japan (E-mail:yasuhito@m.ehime-u.ac.jp, Tel:+81-89-964-5111, Fax:+81-89-960-5334) COMMENT Sequence updated (05-Jan-1998). FEATURES Location/Qualifiers source 1..1376 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="mLT expressing LAK cell" /sex="male" /tissue_type="lymphoid" CDS 110..1150 /note="Its enhancement of expression is related with T/LAK-cell-activation, unpublished data." /codon_start=1 /product="LAK-4p" /db_xref="PID:d1025089" /db_xref="PID:g2760121" /translation="MIQSPEAAGQEAVLLVLPLVVGLLNLGAPYLCRVLAALEPHDSP VLEVYVAICRNLILKLAILGTLCYHWLGRRVGRPAGPVLGGFCGPGAVPVPGDGLRPH VAGHAFWGTGVEDYLEKKLKRRRSQFDIARNVLELIYGQTLTWLGVLFSPFSPPCRSS SCCSSSMSRRPAFLANCQAPRRPWLASHMSTVFLTLLCFPAFLGAAVFLCYAVWQVKP SSTCGPFRTLDTMYEAGRVWVPHLEAAGPRVSWLPWVHRYLMENTFFVFLVSALLLAV IYLNIQVVRGQRKVICLLKEQISNEGEDKIFLINKLHSIYERKEREERSRVGTTEEAA APPALLTDEQDA" polyA_site 1376 /note="25 A nucleotides" BASE COUNT 228 a 417 c 441 g 290 t ORIGIN 1 ccacgcgtcc gggaggctgc ggcaggcggc tgtgctgggg cttgtgtggc tgctgtgtct 61 ggggaccgcg ctgggctgcg ccgtggccgt ccacgtcttc tcggagttca tgatccagag 121 tccagaggct gctggccagg aggctgtgct gctggtcctg cccctggtgg ttggcctcct 181 caacctgggg gccccctacc tgtgccgtgt cctggccgcc ctggagccgc atgactcccc 241 ggtactggag gtgtacgtgg ccatctgcag gaacctcatc ctcaagctgg ccatcctggg 301 gacactgtgc taccactggc tgggccgcag ggttgggcgt cctgcagggc cagtgctggg 361 aggattttgt gggccaggag ctgtaccggt tcctggtgat ggacttcgtc ctcatgttgc 421 tggacacgct ttttggggaa ctggtgtgga ggattatctc gagaagaagc tgaagaggag 481 gcgaagccag tttgacattg cccggaatgt cctggagctg atttatgggc agactctgac 541 ctggctgggg gtgctcttct cgcccttctc cccgccgtgc agatcatcaa gctgctgctc 601 gtcttctatg tcaagaagac cagccttctt ggccaactgc caggcgccgc gccggccctg 661 gctggcctca cacatgagca ccgtcttcct cacgctgctc tgcttccccg ccttcctggg 721 cgccgctgtc ttcctctgct acgccgtctg gcaggtgaag ccctcgagca cctgcggccc 781 cttccggacc ctggacacca tgtacgaggc cggcagggtg tgggtgcccc acctggaggc 841 ggcaggcccc agggtctcct ggctgccctg ggtgcaccgg tacctgatgg aaaacacctt 901 ctttgtcttc ctggtgtcag ccctgctgct ggccgtgatc tacctcaaca tccaggtggt 961 gcgaggccag cgcaaggtca tctgcctgct caaggagcag atcagcaatg agggtgagga 1021 caaaatcttc ttaatcaaca agcttcactc catctacgag aggaaggaga gggaggagag 1081 gagcagggtt gggacaaccg aggaggctgc ggcaccccct gccctgctca cagatgaaca 1141 ggatgcctag ggggacggcg atgggcctca cgggcccgcc cagcaccctg agaccacact 1201 gttgcctccc agtgaccctg ctgggacacc aggacaagga agacagtttc gcctctcgaa 1261 agccgcagtg cgcctaggct ggagctggaa gggtgggtga atccggcttg ggcatcccca 1321 atgaactctg ccctgcctgg gactctattt attctgatta aaggggtttt gcaaat // LOCUS AB002409 852 bp mRNA PRI 15-AUG-1997 DEFINITION Homo sapiens mRNA for SLC, complete cds. ACCESSION AB002409 NID g2335034 KEYWORDS SLC; mature ELC. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 852) AUTHORS Nomiyama,H. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) to the DDBJ/EMBL/GenBank databases. Hisayuki Nomiyama, Kumamoto University Medical School, Department of Biochemistry; Honjo 2-2-1, Kumamoto, Kumamoto 860, Japan (E-mail:nomiyama@gpo.kumamoto-u.ac.jp, Tel:81-96-373-5063, Fax:81-96-372-6140) REFERENCE 2 (bases 1 to 852) AUTHORS Nagira,M., Imai,T., Hieshima,K., Kusuda,J., Ridanpaa,M., Takagi,S., Nishimura,M., Kakizaki,M., Nomiyama,H. and Yoshie,O. TITLE Molecular Cloning of a Novel Human CC Chemokine Secondary Lymphoid-Tissue Chemokine (SLC) That is an Efficient Chemoattractant for Lymphocytes and Mapped to Chromosome 9p13 JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..852 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 59..463 /codon_start=1 /product="SLC" /db_xref="PID:d1022673" /db_xref="PID:g2335035" /translation="MAQSLALSLLILVLAFGIPRTQGSDGGAQDCCLKYSQRKIPAKV VRSYRKQEPSLGCSIPAILFLPRKRSQAELCADPKELWVQQLMQHLDKTPSPQKPAQG CRKDRGASKTGKKGKGSKGCKRTERSQTPKGP" mat_peptide <107..460 /product="mature ELC" polyA_site 823..828 BASE COUNT 205 a 279 c 217 g 151 t ORIGIN 1 cttgcagctg cccacctcac cctcagctct ggcctcttac tcaccctcta ccacagacat 61 ggctcagtca ctggctctga gcctccttat cctggttctg gcctttggca tccccaggac 121 ccaaggcagt gatggagggg ctcaggactg ttgcctcaag tacagccaaa ggaagattcc 181 cgccaaggtt gtccgcagct accggaagca ggaaccaagc ttaggctgct ccatcccagc 241 tatcctgttc ttgccccgca agcgctctca ggcagagcta tgtgcagacc caaaggagct 301 ctgggtgcag cagctgatgc agcatctgga caagacacca tccccacaga aaccagccca 361 gggctgcagg aaggacaggg gggcctccaa gactggcaag aaaggaaagg gctccaaagg 421 ctgcaagagg actgagcggt cacagacccc taaagggcca tagcccagtg agcagcctgg 481 agccctggag accccaccag cctcaccaac gcttgaagcc tgaacccaag atgcaagaag 541 gaggctatgc tcaggggccc tggagcagcc accccatgct ggccttgcca cactctttct 601 cctgctttaa ccaccccatc tgcattccca gctctaccct gcatggctga gctgcccaca 661 gcaggccagg tccagagaga ccgaggaggg agagtctccc agggagcatg agaggaggca 721 gcaggactgt ccccttgaag gagaatcatc aggaccctgg acctgatacg gctccccagt 781 acaccccacc tcttccttgt aaatatgatt tatacctaac tgaataaaaa gctgttctgt 841 cttcccaccc gc // LOCUS AB002533 2171 bp mRNA PRI 08-APR-1997 DEFINITION Human mRNA for Qip1, complete cds. ACCESSION AB002533 NID g1944124 KEYWORDS Qip1. SOURCE Homo sapiens cell_line:HeLa S3 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2171) AUTHORS Seki,T. TITLE Direct Submission JOURNAL Submitted (31-MAR-1997) to the DDBJ/EMBL/GenBank databases. Takahiko Seki, Tohoku University, Faculty of Pharmaceutical Sciences; Aoba Aramaki, Aoba-ku, Sendai, Miyagi 980-77, Japan (E-mail:taka@phi2.pharm.tohoku.ac.jp, Tel:81-22-217-6876, Fax:81-22-217-6873) REFERENCE 2 (sites) AUTHORS Seki,T., Tada,S., Katada,T. and Enomoto,T. TITLE Cloning of a cDNA encoding a novel importin-alpha homologue, Qip1: disc rimination of Qip1 and Rch1 from hSrp1 by their ability to interact with DNA hel icase Q1/RecQL JOURNAL Biochem. Biophys. Res. Commun. (1997) In press FEATURES Location/Qualifiers source 1..2171 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" gene 10..1575 /gene="QIP1" CDS 10..1575 /gene="QIP1" /codon_start=1 /product="Qip1" /db_xref="PID:d1020322" /db_xref="PID:g1944125" /translation="MADNEKLDNQRLKNFKNKGRDLETMRRQRNEVVVELRKNKRDEH LLKRRNVPHEDICEDSDIDGDYRVQNTSLEAIVQNASSDNQGIQLSAVQAARKLLSSD RNPPIDDLIKSGILPILVHCLERDDNPSLQFEAAWALTNIASGTSEQTQAVVQSNAVP LFLRLLHSPHQNVCEQAVWALGNIIGDGPQCRDYVISLGVVKPLLSFISPSIPITFLR NVTWVMVNLCRHKDPPPPMETIQEILPALCVLIHHTDVNILVDTVWALSYLTDAGNEQ IQMVIDSGIVPHLVPLLSHQEVKVQTAALRAVGNIVTGTDEQTQVVLNCDALSHFPAL LTHPKEKINKEAVWFLSNITAGNQQQVQAVIDANLVPMIIHLLDKGDFGTQKEAAWAI SNLTISGRKDQVAYLIQQNVIPPFCNLLTVKDAQVVQVVLDGLSNILKMAEDEAETIG NLIEECGGLEKIEQLQNHENEDIYKLAYEIIDQFFSSDDIDEDPSLVPEAIQGGTFGF NSSANVPTEGFQF" polyA_site 2171 /note="32 A nucleotides" BASE COUNT 685 a 450 c 477 g 559 t ORIGIN 1 gcacgagcca tggcggacaa cgagaaactg gacaaccaac ggctcaagaa tttcaagaac 61 aaaggccgcg acttggagac tatgagaaga caacgaaatg aagttgtagt tgaattaagg 121 aagaataaaa gagatgaaca tctcttaaag agaaggaatg taccacatga agatatctgt 181 gaagactctg atatagatgg tgattataga gtgcaaaata cctctctaga agctattgtt 241 caaaatgctt caagtgataa ccaaggaatt caattaagtg cagttcaagc tgctaggaag 301 cttttgtcca gtgatcgaaa tccaccaatt gatgacttaa taaaatctgg aatattgccc 361 attttagtcc attgtcttga aagagatgac aatccttctt tacagtttga agctgcatgg 421 gctttgacaa acattgcatc tggaacttct gaacaaactc aagcagtagt tcagtccaat 481 gctgtgccac ttttcctgag gcttctccat tcaccccatc agaatgtctg tgagcaagca 541 gtgtgggcat tgggaaatat cataggtgat gggccccagt gtagagatta tgtcataagt 601 cttggagttg tgaaaccttt actttccttc ataagtccat ctattcctat aacattctta 661 agaaatgtta cttgggttat ggtcaactta tgtcgccaca aagacccacc accaccaatg 721 gaaaccattc aggagattct tccagccctt tgtgttttaa ttcatcacac agatgtaaat 781 atactggtag acacagtctg ggccctctct taccttactg atgctggcaa tgaacaaata 841 cagatggtaa tagactctgg aatagttcct catttggttc ctctgctcag ccaccaggaa 901 gttaaagttc agactgctgc acttagagct gtgggcaaca ttgttactgg aactgatgag 961 caaacacaag tagttttgaa ctgtgatgct ctttcacact tcccagcact cctgacacat 1021 cccaaagaga aaattaataa agaagcagtg tggttcctct ccaacatcac tgcaggaaat 1081 cagcagcagg tacaggcagt aattgatgcc aatcttgtac caatgataat acaccttttg 1141 gataaggggg attttggcac tcaaaaagaa gctgcttggg ccataagtaa cttaacaatt 1201 agtggaagga aagatcaagt ggcttacctt atccaacaaa atgttatccc acctttttgc 1261 aacttgctga ctgtaaaaga tgcacaagtt gtgcaagtag tactcgatgg actaagtaat 1321 atattaaaaa tggctgaaga tgaggcagaa accataggca atcttataga agaatgtgga 1381 gggctggaga aaattgaaca acttcaaaat catgaaaatg aagacatcta caaattggcc 1441 tatgagatca ttgatcagtt cttctcttca gatgatattg atgaagaccc tagccttgtt 1501 ccagaggcaa ttcaaggcgg aacatttggt ttcaattcat ctgccaatgt accaacagaa 1561 gggttccagt tttagaaaga tgttgtggaa gttaggtaca atgcagcact gagatatata 1621 tatatatatg tgtgtgtgta tatatatata tatatacata tatataaaaa ggtttgatcc 1681 atcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaactc gtgccgaatt cggcacgagc 1741 acgcgtgaga cttctccgcc gcctccgccg cagacgccgc cgcgatgcgc tacgtcgcct 1801 cctacctgct ggctgcccta gggggcaact cctcccccag cgccaaggac atcaagaaga 1861 tcttggacag cgtgggtatc gaggcggacg acgaccggct caacaaggtt atcagtgagc 1921 tgaatggaaa aaacattgaa gacgtcattg cccagggtat tggcaagctt gccagtgtac 1981 ctgctggtgg ggctgtagcc gtctctgctg ccccaggctc tgcagcccct gctgctggtt 2041 ctgcccctgc tgcagcagag gagaagaaag atgagaagaa ggaggagtct gaagagtcag 2101 atgatgacat gggatttggc ctttttgatt aaattcctgc tcccctgcaa ataaagcctt 2161 tttacacatc t // LOCUS AB002559 1867 bp mRNA PRI 08-APR-1997 DEFINITION Human mRNA for hunc18b2, complete cds. ACCESSION AB002559 NID g1944129 KEYWORDS hunc18b2. SOURCE Homo sapiens male lymphoid LAK cells cDNA to mRNA, clone_lib:LAK subtraction library. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1867) AUTHORS Abe,Y. TITLE Direct Submission JOURNAL Submitted (01-APR-1997) to the DDBJ/EMBL/GenBank databases. Yasuhito Abe, Ehime University School of Medicine, The Second Department of Surgery; Shitsukawa, Shigenobu, Ehime 791-02, Japan (E-mail:yasuhito@m.ehime-u.ac.jp, Tel:+81-89-964-5111, Fax:+81-89-960-5334) REFERENCE 2 (sites) AUTHORS Abe,Y. and Takaoka,Y. TITLE hunc18b2 from LAK subtraction library JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..1867 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="LAK cells" /clone_lib="LAK subtraction library" /sex="male" /tissue_type="lymphoid" CDS 25..1764 /function="placuble syntaxin binding protein" /note="putative alternatively spliced form of gbIU63533IHSU63533" /codon_start=1 /product="hunc18b2" /db_xref="PID:d1020323" /db_xref="PID:g1944130" /translation="MAPSGLKAVVGEKILSGVIRSVKKDGEWKVLIMDHPSMRILSSC CKMSDILAEGITIVEDINKRREPIPSLEAIYLLSPTEKSVQALIKDFQGTPTFTYKAA HIFFTDTCPEPLFSELGRSRLAKVVKTLKEIHLAFLPYEAQVFSLDAPHSTYNLYCPF RAEERTRQLEVLAQQIATLCATLQEYPAIRYRKGPEDTAQLAHAVLAKLNAFKADTPS LGEGPEKTRSQLLIMDRAADPVSPLLHELTFQAMAYDLLDIEQDTYRYETTGLSEARE KAVLLDEDDDLWVELRHMHIADVSKKVTELLRTFCESKRLTTDKANIKDLSQILKKMP QYQKELNKYSTHLHLADDCMKHFKGSVEKLCSVEQDLAMGSDAEGEKIKDSMKLIVPV LLDAAVPAYDKIRVLLLYILLRNGVSEENLAKLIQHANVQAHSSLIRNLEQLGGTVTN PGGSGTSSRLEPRERMEPTYQLSRWTPVIKDVMEDAVEDRLDRNLWPFVSDPAPTASS QAAVSARFGHWHKNKAGVEARAGPRLIVYVMGGVAMSEMRAAYEVTRATEGSGRCSLA PHTSSPRPASWMT" BASE COUNT 405 a 576 c 554 g 331 t 1 others ORIGIN 1 gcggccgctc gcccctcggg gaatatggcg ccctcggggc tgaaggcggt ggtgggggaa 61 aaaattctga gcggagttat tcggagtgtc aagaaggatg gggagtggaa ggtgcttatc 121 atggatcacc caagcatgcg catcttgtct tcctgctgca aaatgtcaga tatcctggct 181 gagggcatca ccattgttga agacatcaac aaacggcggg aacccattcc cagtctggag 241 gccatttatt tgctgagccc cacggagaag tcggttcagg ccctgatcaa agacttccag 301 gggaccccga ctttcaccta caaagcggcc catatcttct tcaccgacac ctgccccgag 361 cccctgttca gtgagctagg ccgctctcgt ctggcaaagg tggtgaagac gttgaaggag 421 attcaccttg ccttcctccc ctacgaggcc caggtgttct ccctcgatgc tccccacagc 481 acctacaacc tctactgccc cttccgggca gaggagcgca cgcggcagct cgaggtgctg 541 gcccagcaga ttgccacgct gtgcgccacc ctgcaggagt acccggccat ccgctaccgc 601 aagggcccag aggacacagc ccagttggcc cacgccgtcc tggccaagct gaacgccttc 661 aaggcagaca ctcccagtct gggcgagggc ccagagaaaa cccgctccca gctgctgata 721 atggaccggg cagctgaccc cgtgtcccca ctactgcatg agctcacgtt ccaggccatg 781 gcgtatgatc tgctggacat agagcaggac acatacaggt atgagaccac cgggctgagc 841 gaggcgcggg agaaggccgt cttgctggac gaggacgatg acttgtgggt ggagcttcgc 901 cacatgcata tcgcagatgt gtccaagaag gtcacggagc tcctgaggac cttctgtgag 961 agcaagaggc tgaccacgga caaggcgaac atcaaagacc tatcccagat cctgaaaaag 1021 atgccgcagt accagaagga gctgaataag tattctacgc acctgcatct agcagatgat 1081 tgtatgaagc acttcaaggg ctcggtggag aagctgtgta gtgtggagca ggacctggcc 1141 atgggctccg acgcagaggg ggagaagatc aaggactcca tgaagctgat cgttccggtg 1201 ctgctggacg cggcggtgcc cgcctacgac aagatccggg tcctgctgct ctacatcctc 1261 cttcggaatg gtgtgagtga ggagaacctg gccaagctga tccagcatgc caatgtacag 1321 gcgcacagca gcctcatccg taacctggag cagctgggag gcactgtcac caaccccggg 1381 ggctcgggga cctccagccg gctggagccg agagaacgca tggagcccac ctatcagctg 1441 tcccgctgga ccccggtcat caaggatgta atggaggacg ccgtggagga ccggctggac 1501 aggaacctgt ggcccttcgt atccgacccc gcccccacgg ccagctccca ggccgctgtc 1561 agtgcccgct tcggtcactg gcacaagaac aaggctggcg tagaagcccg ggcgggcccc 1621 cggctcatcg tgtatgtcat gggcggtgtg gccatgtcag agatgagggc cgcctacgag 1681 gtgaccaggg ccaccgaggg aagtgggagg tgctcattgg ctcctcacac atcctcaccc 1741 cgacccgctt cctggatgac ctgaaggcac tggacaagaa gctggaggac attgccctgc 1801 cctgacccct gccccgcccc ctacccctcc ctttccagag aaataaactc ttcccgtcnt 1861 ctgccac // LOCUS AB002804 5810 bp mRNA PRI 15-APR-1997 DEFINITION Human mRNA for hSLK, complete cds. ACCESSION AB002804 NID g1944184 KEYWORDS hSLK. SOURCE Homo sapiens cell_line:A549 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5810) AUTHORS Yamada,E. TITLE Direct Submission JOURNAL Submitted (08-APR-1997) to the DDBJ/EMBL/GenBank databases. Eitaro Yamada, Faculty of Pharmaceutical Sciences, Osaka University, Immunology; Yamadaoka 1-6, Suita, Osaka 565, Japan (E-mail:e-yamada@phs.osaka-u.ac.jp, Tel:+81-6-879-8192, Fax:+81-6-879-8194) REFERENCE 2 (sites) AUTHORS Yamada,E., Kameda,Y., Itoh,S., Kohama,Y., Yamamoto,H. and Tsujikawa,K. TITLE Human STE20-like kinase (hSLK) JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..5810 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A549" CDS 233..3847 /function="serine/threonine kinase" /codon_start=1 /product="hSLK" /db_xref="PID:d1020432" /db_xref="PID:g1944185" /translation="MSFFNFRKIFKLGSEKKKKQYEHVKRDLNPEDFWEIIGELGDGA FGKVYKAQNKETSVLAAAKVIDTKSEEELEDYMVEIDILASCDHPNIVKLLDAFYYEN NLWILIEFCAGGAVDAVMLELERPLTESQIQVVCKQTLDALNYLHDNKIIHRDLKAGN ILFTLDGDIKLADFGVSAKNTRTIQRRDSFIGTPYWMAPEVVMCETSKDRPYDYKADV WSLGITLIEMAEIEPPHHELNPMRVLLKIAKSEPPTLAQPSRWSSNFKDFLKKCLEKN VDARWTTSQLLQHPFVTVDSNKPIRELIAEAKAEVTEEVEDGKEEDEEEETENSLPIP ASKRASSDLSIASSEEDKLSQNACILESVSEKTERSNSEDKLNSKILNEKPTTDEPEK AVEDINEHITDAQLEAMTELHDRTAVIKENEREKRPKLENLPDTEDQETVDINSVSEG KENNIMITLETNIEHNLKSEEEKDQEKQQMFENKLIKSEEIKDTILQTVDLVSQETGE KEANIQAVDSEVGLTKEDTQEKLGEDDKTQKDVISNTSDVIGTCEAADVAQKVDEDSA EDTQSNDGKEVVEVGQKLINKPMVGPEAGGTKEVPIKEIVEMNEIEEGKNKEQAINSS ENIMDINEEPGTTEGEEITESSSTEEMEVRSVVADTDQKALGSEVQDASKVTTQIDKE KKEIPVSIKKEPEVTVVSQPTEPQPVLIPSININSDSGENKEEIGSLSKTETILPPES ENPKENDNDSGTGSTADTSSIDLNLSISSFLSKTKDSGSISLQETRRQKKTLKKTRKF IVDGVEVSVTTSKIVTDSDSKTEELRFLRRQELRELRFLQKEEQRAQQQLNSKLQQQR EQIFRRFEQEMMSKKRQYDQEIENLEKQQKQTIERLEQEHTNRLRDEAKRIKGEQEKE LSKFQNMLKNRKKEEQEFVQKQQQELDGSLKKIIQQQKAELANIERECLNNKQQLMRA REAAIWELEERHLQEKHQLLKQQLKDQYFMQRHQLLKRHEKETEQMQRYNQRLIEELK NRQTQERARLPKIQRSEAKTRMAMFKKSLRINSTATPDQDRDKIKQFAAQEEKRQKNE RMAQHQKHENQMRDLQLQCEANVRELHQLQNEKCHLLVEHETQKLKELDEEHSQELKE WREKLRPRKKTLEEEFARKLQEQEVFFKMTGESECLNPSTQSRISKFYPIPSLHSTGS " polyA_site 5810 /note="25 A nucleotides" BASE COUNT 2035 a 958 c 1239 g 1578 t ORIGIN 1 tttaagcata ttagtcagcg gaggagaaga aactaaccag gattccctca gtaacggcga 61 gtgaacaggg aagagcccag cgccgaatcc ccgccccgcg cggcgagctg cgggggccga 121 gggacgccgc gcccgccgcc gccagccggc tcgcgctgga gcagggacag agaaactttg 181 ccttttattg tttttagtcc ttaagtgcaa ggactctgtg ttgggaggaa aaatgtcctt 241 cttcaatttc cgtaagatct tcaagttggg gagcgagaag aagaagaagc agtacgaaca 301 cgtgaagagg gacctgaacc ccgaagactt ttgggagatt ataggagaac tgggcgacgg 361 agcctttggg aaagtgtaca aggcccagaa taaagagacc agtgttttag ctgctgcaaa 421 agtgattgac actaaatctg aagaagaact tgaagattac atggtagaga ttgacatatt 481 agcatcttgt gatcacccaa atatagtcaa gcttctagat gccttctatt atgagaacaa 541 tctttggatc ctcattgaat tttgtgcagg tggagcagta gatgctgtga tgcttgaact 601 tgagagacca ttaactgagt cccaaataca agtagtttgc aagcagactt tagatgcatt 661 gaactactta catgataata agatcatcca cagagatctg aaggctggca acattctctt 721 taccttagat ggagatatca aattggcgga ttttggagta tcagctaaaa acacgaggac 781 aattcaaaga agagattcct ttattggtac accatattgg atggctcctg aagtagtcat 841 gtgtgaaaca tctaaggaca gaccctatga ctacaaagct gatgtttggt ccctgggtat 901 cactttaata gaaatggctg agatagaacc acctcatcat gaattaaatc caatgcgagt 961 gctgctaaaa atagcaaaat ctgagccacc tacattagca cagccatcca gatggtcttc 1021 aaattttaag gactttctaa agaaatgctt agaaaagaat gtggatgcca ggtggactac 1081 atctcagctg ctgcagcatc cctttgttac tgttgattcc aacaaaccca tccgagaatt 1141 gattgcagag gcgaaggctg aagtaacaga agaagttgaa gatggcaaag aggaagatga 1201 agaggaggaa acagaaaatt ctctgccaat acctgcaagt aagcgtgcat cttctgacct 1261 tagtatcgcc agctctgaag aagataaact ttcacaaaat gcttgtattt tggagtctgt 1321 ctcagaaaaa acagaacgta gtaactctga agataaactc aacagcaaaa ttcttaatga 1381 aaaacccacc actgatgaac ctgaaaaggc tgtggaggat attaatgaac atattaccga 1441 tgctcagtta gaagcaatga ctgaactcca tgacagaaca gcagtaatca aggagaatga 1501 aagagagaag aggcccaagc ttgaaaatct gcctgacaca gaagaccaag aaactgtgga 1561 cattaattca gtcagtgaag gaaaagagaa taatataatg ataaccttag aaacaaatat 1621 tgaacataat ctaaaatctg aggaagaaaa ggatcaggaa aagcaacaga tgtttgaaaa 1681 taagcttata aaatctgaag aaattaaaga tactattttg caaacagtag atttagtttc 1741 tcaagagact ggagaaaaag aggcaaatat tcaggcagtt gatagtgaag ttgggcttac 1801 aaaggaagac acccaagaga aattggggga agacgacaaa actcaaaaag atgtgattag 1861 caatacaagt gatgtgatag gaacatgtga ggcagcagat gtggctcaga aagtggatga 1921 agacagtgct gaggatacgc agagtaatga tgggaaagaa gtggtcgaag taggccagaa 1981 attaattaat aagcccatgg tgggtcctga ggctggtggt actaaggaag ttcctattaa 2041 agaaatagtt gaaatgaatg aaatagaaga aggtaaaaat aaggaacaag caataaacag 2101 ttcagagaac ataatggaca tcaatgagga accaggaaca actgaaggtg aagaaatcac 2161 tgagtcaagt agcactgaag aaatggaggt cagaagtgtg gtggctgata ctgaccaaaa 2221 ggctttagga agtgaagttc aggatgcttc taaagtcact actcagatag ataaagagaa 2281 aaaagaaatt ccagtgtcaa ttaaaaaaga gcctgaagtt actgtagttt cacagcccac 2341 tgaacctcag cctgttctaa tacccagtat taatatcaac tctgacagtg gagaaaataa 2401 agaagaaata ggttctttat caaaaactga aactattctg ccaccagaat ctgagaatcc 2461 aaaggaaaat gataatgatt caggcactgg ttccactgct gatactagca gtattgactt 2521 gaatttatcc atctctagct ttctaagtaa aactaaagac agtggatcga tatctttaca 2581 agaaacaaga agacaaaaga aaacattgaa gaaaacacgc aaatttattg ttgatggtgt 2641 agaagtgagt gtaacaacat caaagatagt tacagatagt gattccaaaa ctgaagaatt 2701 gcggtttctt agacgtcagg aacttcggga attaagattt cttcagaaag aagagcaaag 2761 agcccaacaa cagctcaata gcaaactaca gcaacaacga gaacaaattt tccggcgctt 2821 tgagcaggaa atgatgagta aaaagcgaca atatgaccag gaaattgaga atctagaaaa 2881 acagcagaaa cagactatcg aacgcctgga acaagagcac acaaatcgct tgcgagatga 2941 agccaaacgc atcaaaggag aacaagagaa agagttgtcc aaatttcaga atatgctgaa 3001 gaaccgaaag aaggaggaac aagagtttgt tcagaaacaa cagcaagaat tagatggctc 3061 tctgaaaaag atcatccagc agcagaaggc agagttagct aatattgaga gagagtgcct 3121 gaataacaag caacagctca tgagagctcg agaagctgca atttgggagc tcgaagaacg 3181 acacttacaa gaaaaacacc agctgctcaa acagcagctt aaagatcagt atttcatgca 3241 aagacatcag ctacttaagc gccacgagaa ggaaacagag caaatgcagc gttacaatca 3301 aagacttatt gaggaattga aaaacagaca gactcaagaa agagcaagac tgcccaagat 3361 tcagcgcagt gaagccaaga ctcgaatggc catgtttaag aagagtttga gaattaactc 3421 aacagccaca ccagatcagg accgtgataa aattaaacag tttgctgcac aagaagaaaa 3481 gaggcagaaa aatgagagaa tggctcagca tcagaaacat gagaatcaaa tgcgagatct 3541 tcagttgcag tgtgaagcca atgtccgcga actgcatcag ctgcagaatg aaaaatgcca 3601 cttgttggtt gagcatgaga ctcagaaact gaaggagtta gatgaggaac atagccaaga 3661 attaaaggag tggagagaga aattgagacc taggaaaaag acactggaag aagagtttgc 3721 caggaaacta caggaacagg aagtattctt taaaatgact ggggagtctg aatgccttaa 3781 cccatcaaca cagagccgga tttccaaatt ttatcctatt cccagcttgc attccaccgg 3841 atcataacaa agggaagcat tctgtgcgtg ggtttggctc tttcagtatg tcattctgtt 3901 ctcatcttct gccacagtct ctcagatagc tcatgaagac aatcacctgc ctcaccttct 3961 aggtgttttc cttttttgtt ttttttgttt tgttttgttt ttaagccaaa gatgaaggga 4021 aaacgaacta agacagacgt taggccatgt tggcaaagta gcatcttggt gactaaggtg 4081 actttgtata ttcatcttaa aaattatgtt ctttagacac tgctacctga aaactgttgg 4141 agaaataatg tttaaagtta tttaagaaaa actgttacat cactaagtat taataaattc 4201 ttcttacctg acgtaacttc tcaatgccta aattctgtag ttgaagctct gctgcagaga 4261 gttgggataa ttttcttttg gtggatcagc tctcataaaa aagctatgat ttgctcaaat 4321 atgctgattg actcagtaaa tgaatatatt tttttcttta aataggaaca acctctttta 4381 aaagagaaaa attatttcag tgatttgtca aaacgaatta cctcttttgc atgagctaat 4441 aattgagggt gctaattttc ttaagatagt gcctaaaaca ctaaatttca gtcaagtcgt 4501 aagtaggatt ttctttttga tcaacaggga caaaaactat ctttagaatt aaaaacatgg 4561 ttgttttgga atttttgctt ctcttacgtt tgatagcaat tttcatccta aaatacatgt 4621 acaaagtttg gaaagatgaa aaaaagaggt agcttttaga ttgcaaattg gaaatgttaa 4681 aactcatgaa atttaagcaa tataggttta gctatctgtg tttattttct aaaataatac 4741 ctgagctggt taaatgattt ctctccatct tagctaattc tgtttaaaac tctgtcagag 4801 gcctgcaggc tgtgagttat atttataaat atatcttcag aaattaatct taaaagaggc 4861 attagttcag aatacttttt taaaagttta aattaaatat ttaggcacgt cagaaattac 4921 ttttccttat tttgaaatga ggctacttat gtcttggttt tattttgttc catgtttaaa 4981 tcattcactt tgatttgagt gggaaaagcc tgaagccttt atcatgtggt tgctggtgtg 5041 tgtaattatt aatgaaatgt tcactcctag tcccttatga ggcttagaat ttcaaccacg 5101 tgtcaggtca gacagtatta taaagtgtac tttgtgtctg agacagcaca tttgtgaatg 5161 atgcttgctg cctgccattt tcaacctatt ctctcttaag agtgctaggt accaaattgt 5221 gaaagtttgt ttgttttcag ttatattact tttgaggctg gtggaaaaat ttaaatgtaa 5281 ctttgtggga acactgattc atatttagaa aatgtaaatg tctgtagcac tttcttgcag 5341 ttaatttgaa aactttggat gctgaacctt gtttgtcagt gatttagatg atttaaaaat 5401 gcatgtgtga tttgaatttt ataattgttt tgacaagcat aatttacttg gacaacttcg 5461 taggtagcct taacttctgg ccaagtttgt tttttatata aatatatata catatataca 5521 tattatgtat ggttgtaaat tcatacactt atcacatgaa tgtgttactg tatacaaaac 5581 tcttaatgct ttattctcaa atgctgggtt gaaaaatgtt ttgaaagcct tttaaaatat 5641 atatctttat aaagtaatat tcaggatgat gataaaaatt gtttatattg ttatgataaa 5701 aatgacagta taatgttgcc cagtgtattt aatgatttat ttgaagaggg attgggaagg 5761 aacgtcttca taccactttc tcctttgaaa ttttcagttt attgactttt // LOCUS AB002806 2690 bp mRNA PRI 21-JAN-1998 DEFINITION Homo sapiens OS-9 mRNA, complete cds. ACCESSION AB002806 NID g2780782 KEYWORDS OS-9. SOURCE Homo sapiens cell_line:HL-60 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kimura,Y., Nakazawa,M., Tsuchiya,N., Asakawa,S., Shimizu,N. and Yamada,M. TITLE Genomic organization of the OS-9 gene amprified in human sarcomas JOURNAL J. Biochem. 112, 1190-1195 (1997) REFERENCE 2 (sites) AUTHORS Kimura,Y., Nakazawa,M. and Yamada,M. TITLE Cloning and characterization of three isoforms of OS-9 cDNA and its expression in various human tumor cell lines JOURNAL J. Biochem. (1998) In press REFERENCE 3 (bases 1 to 2690) AUTHORS Yamada,M. TITLE Direct Submission JOURNAL Submitted (08-APR-1997) to the DDBJ/EMBL/GenBank databases. Michiyuki Yamada, Yokohama City University, Graduate School of Integrated Science; 22-2 Seto Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:myamada@yokohama-cu.ac.jp, Tel:+85-45-787-2214, Fax:+85-45-787-2370) FEATURES Location/Qualifiers source 1..2690 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /chromosome="12" /map="12q13" exon 1..203 /number=1 gene 42..2045 /gene="OS-9" CDS 42..2045 /gene="OS-9" /note="OS-9 isoform 2 is missing nt 1642-1806; OS-9 isoform 3 is missing nt 1407-1451 and nt1642-1806." /codon_start=1 /db_xref="PID:d1025275" /db_xref="PID:g2780783" /translation="MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIE ILPLPVMGGQSQSSDVVIVSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGIPELL SPMRDAPCLLKTKDWWTYEFCYGRHIQQYHMEDSEIKGEVLYLGYYQSAFDWDDETAK ASKQHRLKRYHSQTYGNGSKCDLNGRPREAEVRFLCDEGAGISGDYIDRVDEPLSCSY VLTIRTPRLCPHPLLRPPPSAAPQAILCHPSLQPEEYMAYVQRQADSKQYGDKIIEEL QDLGPQVWSETKSGVAPQKMAGASPTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQD PSPEAADSASGAPNDFQNNVQVKVIRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDA AEVPQREPEKERGDPERQREMEEEEDEDEDEDEDEDERQLLGEFEKELEGILLPSDRD RLRSEVKAGMERELENIIQETEKELDPDGLKKESERDRAMLALTSTLNKLIKRLEEKQ SPELVKKHKKKRVVPKKPPPSPQPTEEDPEHRVRVRVTKLRLGGPNQDLTVLEMKREN PQLKQIEGLVKELLEREGLTAAGKIEIKIVRPWAEGTEEGARWLTDEDTRNLKEIFFN ILVPGAEEAQKERQRQKELESNYRRVWGSPGGEGTGDLDEFDF" exon 204..380 /gene="OS-9" /number=2 exon 381..444 /gene="OS-9" /number=3 exon 445..521 /gene="OS-9" /number=4 exon 522..620 /gene="OS-9" /number=5 exon 621..831 /gene="OS-9" /number=6 exon 832..933 /gene="OS-9" /number=7 exon 934..1034 /gene="OS-9" /number=8 exon 1035..1086 /gene="OS-9" /number=9 exon 1087..1175 /gene="OS-9" /number=10 exon 1176..1451 /gene="OS-9" /number=11 exon 1452..1641 /gene="OS-9" /number=12 exon 1642..1806 /gene="OS-9" /number=13 exon 1807..1919 /gene="OS-9" /number=14 exon 1920..2690 /number=15 polyA_site 2690 /note="10 A nucleotides" BASE COUNT 684 a 686 c 785 g 535 t ORIGIN 1 agggcggaaa cagattctct gcataagaag gggaacgaaa gatggcggcg gaaacgctgc 61 tgtccagttt gttaggactg ctgcttctgg gactcctgtt acccgcaagt ctgaccggcg 121 gtgtcgggag cctgaacctg gaggagctga gtgagatgcg ttatgggatc gagatcctgc 181 cgttgcctgt catgggaggg cagagccaat cttcggacgt ggtgattgtc tcctctaagt 241 acaaacagcg ctatgagtgt cgcctgccag ctggagctat tcacttccag cgtgaaaggg 301 aggaggaaac acctgcttac caagggcctg ggatccctga gttgttgagc ccaatgagag 361 atgctccctg cttgctgaag acaaaggact ggtggacata tgaattctgt tatggacgcc 421 acatccagca ataccacatg gaagattcag agatcaaagg tgaagtcctc tatctcggct 481 actaccaatc agccttcgac tgggatgatg aaacagccaa ggcctccaag cagcatcgtc 541 ttaaacgcta ccacagccag acctatggca atgggtccaa gtgcgacctt aatgggaggc 601 cccgggaggc cgaggttcgg ttcctctgtg acgagggtgc aggtatctct ggggactaca 661 tcgatcgcgt ggacgagccc ttgtcctgct cttatgtgct gaccattcgc actcctcggc 721 tctgccccca ccctctcctc cggcccccac ccagtgctgc accacaggcc atcctctgtc 781 acccttccct acagcctgag gagtacatgg cctacgttca gaggcaagcc gactcaaagc 841 agtatggaga taaaatcata gaggagctgc aagatctagg cccccaagtg tggagtgaga 901 ccaagtctgg ggtggcaccc caaaagatgg caggtgcgag cccgaccaag gatgacagta 961 aggactcaga tttctggaag atgcttaatg agccagagga ccaggcccca ggaggggagg 1021 aggtgccggc tgaggagcag gacccaagcc ctgaggcagc agattcagct tctggtgctc 1081 ccaatgattt tcagaacaac gtgcaggtca aagtcattcg aagccctgcg gatttgattc 1141 gattcataga ggagctgaaa ggtggaacaa aaaaggggaa gccaaatata ggccaagagc 1201 agcctgtgga tgatgctgca gaagtccctc agagggaacc agagaaggaa aggggtgatc 1261 cagaacggca gagagagatg gaagaagagg aggatgagga tgaggatgag gatgaagatg 1321 aggatgaacg gcagttactg ggagaatttg agaaggaact ggaagggatc ctgcttccgt 1381 cagaccgaga ccggctccgt tcggaggtga aggctggcat ggagcgggaa cttgagaaca 1441 tcatccagga gacagagaaa gagctggacc cagatgggct gaagaaggag tcagagcggg 1501 atcgggcaat gctggctctc acatccactc tcaacaaact catcaaaaga ctggaggaaa 1561 aacagagtcc agagctggtg aagaagcaca agaaaaagag ggttgtcccc aaaaagcctc 1621 ccccatcacc ccaacctaca gaggaggatc ctgagcacag agtccgggtc cgggtcacca 1681 agctccgtct cggaggccct aatcaggatc tgactgtcct cgagatgaaa cgggaaaacc 1741 cacagctgaa acaaatcgag gggctggtga aggagctgct ggagagggag ggactcacag 1801 ctgcagggaa aattgagatc aaaattgtcc gcccatgggc tgaagggact gaagagggtg 1861 cacgttggct gactgatgag gacacgagaa acctcaagga gatcttcttc aatatcttgg 1921 tgccgggagc tgaagaggcc cagaaggaac gccagcggca gaaagagctg gagagcaatt 1981 accgccgggt gtggggctct ccaggtgggg agggcacagg ggacctggac gaatttgact 2041 tctgagacca acactacact tgacccttca cggaatccag actcttcctg gactggcttg 2101 cctcctcccc acctccccac cctggaaccc ctgagggcca aacagcagag tggagctgag 2161 ctgtggacct ctcgggcaac tctgtgggtg tgggggccct gggtgaatgc tgctgcccct 2221 gctggcagcc accttgagac ctcaccgggc ctgtgatatt tgctctcctg aactctcact 2281 caatcctctt cctctcctct gtggctttcc tgttattgtc ccctaatgat aggatattcc 2341 ctgctgccta cctggagatt cagtaggatc ttttgagtgg aggtgggtag agagagcaag 2401 gagggcagga cacttagcag gcactgagca agcaggcccc cacctgccct tagtgatgtt 2461 tggagtcgtt ttaccctctt ctattgaatt gccttgggat ttccttctcc ctttccctgc 2521 ccaccctgtc ccctacaatt tgtgcttctg agttgaggag ccttcacctc tgttgctgag 2581 gaaatggtag aatgctgcct atcacctcca gcacaatccc agcgaaaaag gtgtgaagca 2641 cccaccatgt tcttgaacaa tcaggtttct aaataaacaa ctggaccatc // LOCUS AB003102 1551 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for 26S proteasome subunit p44.5, complete cds. ACCESSION AB003102 NID g1945608 KEYWORDS 26S proteasome subunit p44.5. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,A., Watanabe,T.K., Shimada,Y., Fujiwara,T., Slaughter,C.A., DeMartino,G.N., Tanahashi,N. and Tanaka,K. TITLE cDNA cloning and functional analysis of p44.5 and p55, two regulatory subunits of the 26S proteasome JOURNAL Gene 203 (2), 241-250 (1997) MEDLINE 98086225 REFERENCE 2 (bases 1 to 1551) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (14-APR-1997) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Tokyo Metropolitan Institute of Medical Science, Cancer Therapeatics; 18-22 Honkomagome 3-chome Bunkyo-ku, Tokyo 113, Japan (E-mail:tanahash@rinshoken.or.jp, Tel:03-3823-2237, Fax:03-3823-2237) FEATURES Location/Qualifiers source 1..1551 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 12..1280 /codon_start=1 /product="26S proteasome subunit p44.5" /db_xref="PID:d1020529" /db_xref="PID:g1945609" /translation="MAAAAVVEFQRAQSLLSTDREASIDILHSIVKRDIQENDEEAVQ VKEQSILELGSLLAKTGQAAELGGLLKYVRPFLNSISKAKAARLVRSLLDLFLDMEAA TGQEVELCLECIEWAKSEKRTFLRQALEARLVSLYFDTKRYQEALHLGSQLLRELKKM DDKALLVEVQLLESKTYHALSNLPKARAALTSARTTANAIYCPPKLQATLDMQSGIIH AAEEKDWKTAYSYFYEAFEGYDSIDSPKAITSLKYMLLCKIMLNTPEDVQALVSGKLA LRYAGRQTEALKCVAQASKNRSLADFEKALTDYRAELRDDPIISTHLAKLYDNLLEQN LIRVIEPFSRVQIEHISSLIKLSKADVERKLSQMILDKKFHGILDQGEGVLIIFDEPP VDKTYEAALETIQNMSKVVDSLYNKAKKLT" BASE COUNT 452 a 338 c 390 g 371 t ORIGIN 1 agagcggtaa gatggcggcg gcggcggtgg tggagttcca gagagcccag tctctactca 61 gcaccgaccg ggaggcctcc atcgacatcc tccactccat cgtgaagcgt gacattcagg 121 aaaacgatga agaggcagtg caagtcaaag agcagagcat cctggaactg ggatctctcc 181 tggcaaagac tggacaagct gcagagcttg gaggactcct gaagtatgta cgacccttct 241 tgaattccat cagcaaggct aaagcagctc gcctggtccg atctcttctt gatctgtttc 301 ttgatatgga agcagctaca gggcaggagg tcgagctgtg tttagagtgc atcgaatggg 361 ccaagtcaga gaaaagaact ttcttacgcc aagctttgga ggcaagactg gtgtctttgt 421 actttgatac caagaggtac caggaagcat tgcatttggg ttctcagctg ctgcgggagt 481 tgaaaaagat ggacgacaaa gctcttttgg tggaagtaca gcttttagaa agcaaaacat 541 accatgccct gagcaacctg ccgaaagccc gagctgcctt aacttctgct cgaaccacag 601 caaatgccat ctactgcccc cctaaattgc aggccacctt ggacatgcag tcgggtatta 661 tccatgcagc agaagagaag gactggaaaa ctgcgtactc atacttctat gaggcatttg 721 agggttatga ctccatcgac agccccaagg ccatcacatc tctgaagtac atgttgctgt 781 gcaaaatcat gctcaacacc ccagaagatg tccaggcttt ggtgagcggg aagcttgcac 841 ttcggtatgc agggaggcag acagaagcat taaaatgcgt ggctcaggct agcaagaaca 901 gatcactggc agattttgaa aaggctctga cagattaccg ggcagagctc cgggatgacc 961 caatcatcag cacacacttg gccaagttgt atgataactt actagaacag aatctgatcc 1021 gagtcattga gcctttttcc agagtacaga ttgaacacat atctagtctc atcaaactct 1081 ccaaggccga cgtggaaagg aaattatcac agatgattct tgacaagaaa tttcatggga 1141 ttttggacca gggggagggt gtcctgatta ttttcgatga acccccagta gataaaactt 1201 acgaagctgc tctggaaaca attcagaaca tgagcaaagt agtggattcc ctctacaaca 1261 aagccaagaa actgacatag agttggatct gtagcggtcc tttggagagt gtgtgtggcg 1321 ggagagtgaa accttggggg aaaatgctag gagattcttt tttctttttg ttctactttt 1381 cgctcggaaa gtttttaaat cctcatttgg tgcatctgta ttccagccaa taggtgtgcc 1441 agttttcatg taatctttac tggcccaact tgggagtggg gaaattgctt aaaaaaaaag 1501 aaaaagaaaa aaaaaaagat tattctaaat aaaaggaaaa aggcttacac t // LOCUS AB003103 3548 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for 26S proteasome subunit p55, complete cds. ACCESSION AB003103 NID g1945610 KEYWORDS 26S proteasome subunit p55. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T., Shimada,Y., Saito,A., Fujiwara,T., Slaughter,C., DeMartino,G., Tanahashi,N. and Tanaka,K. TITLE cDNA cloning and functional analysis of p44.5 and p55, two regulatory subunits of the 26S proteasome JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 3548) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (14-APR-1997) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Tokyo Metropolitan Institute of Medical Science, Cancer Therapeatics; 18-22 Honkomagome 3-chome Bunkyo-ku, Tokyo 113, Japan (E-mail:tanahash@rinshoken.or.jp, Tel:03-3823-2237, Fax:03-3823-2237) FEATURES Location/Qualifiers source 1..3548 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 44..1414 /codon_start=1 /product="26S proteasome subunit p55" /db_xref="PID:d1020530" /db_xref="PID:g1945611" /translation="MADGGSERADGRIVKMEVDYSATVDQRLPECAKLAKEGRLQEVI ETLLSLEKQTRTASDMVSTSRILVAVVKMCYEAKEWDLLNENIMLLSKRRSQLKQAVA KMVQQCCTYVEEITDLPIKLRLIDTLRMVTEGKIYVEIERARLTKTLATIKEQNGDVK EAASILQELQVETYGSMEKKERVEFILEQMRLCLAVKDYIRTQIISKKINTKFFQEEN TEKLKLKYYNLMIQLDQHEGSYLSICKHYRAIYDTPCIQAESEKWQQALKSVVLYVIL APFDNEQSDLVHRISGDKKLEEIPKYKDLLKLFTTMELMRWSTLVEDYGMELRKGSLE SPATDVFGSTEEGEKRWKDLKNRVVEHNIRIMAKYYTRITMKRMAQLLDLSVDESEAF LSNLVVNKTIFAKVDRLAGIINFQRPKDPNNLLNDWSQKLNSLMSLVNKTTHLIAKEE MIHNLQ" BASE COUNT 1143 a 583 c 677 g 1145 t ORIGIN 1 tggccgaagc agggggacag caagggacgc tcaggcgggg accatggcgg acggcggctc 61 ggagcgggct gacgggcgca tcgtcaagat ggaggtggac tacagcgcca cggtggatca 121 gcgcctaccc gagtgtgcga agctagccaa ggaaggaaga cttcaagaag tcattgaaac 181 ccttctctct ctggaaaagc agactcgtac tgcttccgat atggtatcga catcccgtat 241 cttagttgca gtagtgaaga tgtgctatga ggctaaagaa tgggatttac ttaatgaaaa 301 tattatgctt ttgtccaaaa ggcggagtca gttaaaacaa gctgttgcca aaatggttca 361 acagtgctgt acttatgttg aggaaatcac agaccttcct atcaaacttc gattaattga 421 tactctacga atggttaccg aaggcaagat ttatgttgaa attgagcgtg cgcgactgac 481 taaaacatta gcaactataa aagaacaaaa tggtgatgtg aaagaggcag cctccatttt 541 acaggagtta caggtggaaa cctacgggtc aatggaaaag aaagagcgag tggaatttat 601 tttggagcaa atgaggctct gcctagctgt gaaggattac attcgaacac aaatcatcag 661 caagaaaatt aacaccaaat ttttccagga agaaaataca gagaaattaa agttgaagta 721 ctataattta atgattcagc tggatcaaca tgagggatcc tatttgtcta tttgtaagca 781 ctacagagca atatatgata ctccctgtat acaggcagaa agtgaaaaat ggcagcaggc 841 tctgaagagt gttgtactct atgttatcct ggctcctttt gacaatgaac agtcagattt 901 ggttcaccga ataagtggtg acaagaagtt agaagaaatt cccaaataca aggatctttt 961 aaagcttttt accacaatgg agttgatgcg ttggtccaca cttgttgagg actatggaat 1021 ggaattaaga aaaggttccc ttgagagtcc tgcaacggat gtttttggtt ctacagagga 1081 aggtgaaaaa aggtggaaag acttgaagaa cagagttgtt gaacataata ttagaataat 1141 ggccaagtat tatactcgga taacaatgaa aaggatggca cagcttctgg atctatctgt 1201 tgatgagtcc gaagcctttc tctcaaatct agtagttaac aagaccatct ttgctaaagt 1261 agacagatta gcaggaatta tcaacttcca gagacccaag gatccaaata atttattaaa 1321 tgactggtct cagaaactga actcattaat gtctctggtt aacaaaacta cgcatctcat 1381 agccaaagag gagatgatac ataatctaca ataagggtct tagtgcttta gaaaaaagtt 1441 aaaattggaa gtcattaaaa aaagactgtt ataatggtgt atatgttggg gttttttttc 1501 taagcttctt tgtcttaaat tttaaaatag tgaatatgtt tgagactccc tttgaccttt 1561 cagttcccca agttcattgt taactttgca tttgcaattg gtgcaaaaat acagatttct 1621 gtcgtctgaa tacacaaaaa gttgtgtcat aacttaccca gatatgtttt tctatcattt 1681 gaaacctttt tagctactgt ttgttttcat tcaactaaca aacatattcc aataataaaa 1741 gcagtatata catatttcct ttctacagtt acctctgatt ctcaacattt tgtggggtag 1801 tgatttggca agtgtttttt aaataaaaca aatctcattg taaagttatc agtcatttag 1861 tagaatagaa aagcaacata gagcatacaa gaacatttgg gatagagttg tgatttgtga 1921 agaatttgta ctttgatatt gtggcggaaa gtctagactg agtgtgtatg ctggtaaact 1981 gtagactttt tttttttttt ttgagtccgg ctggttccaa tcacagtagc ttgattgctt 2041 tcagccctca tcctctcact tgatcagttg ttcaacagaa tcagctgaca taattgacac 2101 agtttattgg gtgttaagtc cgctctatag ggatagtgac tacttttttt tttttttttt 2161 tttttgctct tcttcctctc ccctttcttt atatgggttt aaatttaaca taaagttgtt 2221 tttataaggc ttatttgtgg ctttaacttg taagtctgat tacatcatta ttgttccaaa 2281 ttcattatct ctgtaggaac ttttagttcc attatatgaa cactggataa cctaattttt 2341 tttaatgctt taaaaaaatg gcaaaaagac gtcaggccac cctcatagta agtggtgtag 2401 tattaaaata ttttcacgga attaaaagta gcttgctgtc aaagaaacac ctgagatgaa 2461 ttggtgtgaa cgaattttgc aagtttaatt tgatttattt cagagaaaat agaaaaaaca 2521 atgttagaag gttatttaaa atgatactta aataaagaaa gtgtgaggtc tactttaaaa 2581 aaattcaaat gaagagaaaa agaaaaacag cattctagaa atggcatttc tcctaattaa 2641 ttttccactt aatggaagat tatcaattgt cctattttat gatcccagga ctgaagacag 2701 ttgtgggata tctgtcatat ttatcctgtg agtcattgtg aataatgaca tacagtactg 2761 aagtaatctg attttattct ttggaaattc aatgcattgg tcacactaat aacatcaaca 2821 tctgctatca cttatctttt taaaactaac caaaaaaggc tgggattaca ggcatgagcc 2881 actgcaccca actcctcttt cgtctttctt taacacacac taggctcttt gtgtattatg 2941 attcagtgct atttgtaact gtgtcccagt gaccaaattg cactcgactc gatcagctgt 3001 tcatccattt cgtgtttttt cctgtcaaac attaatccag caaatatatg aggtatttac 3061 caatttattt tcttagtatt acaaaataat tcattagcat aaagtacaat agtgaaatat 3121 ttgagttgtt cggaacctca attaatcctg ttttacattt cagacctaaa gctggcaatc 3181 aggagaagaa gcactttgtt ttaaatgtgg agaagataac acttgattcc atttcattgt 3241 cattagtgta ttaaccagca ggagaggtga tgagccattt ttcaaatgaa atacctttta 3301 tttccatata atttttttat tttagagttc aatagctgtt tctatgatta tcctcaattt 3361 ccatatgtta ctgaatctga aaaacatctt taaaattcaa acagttccat tttctctctt 3421 gtaagtgtta aatgtgataa aagtacatat tttaaattgt tttcagctct tggatatagc 3481 agcaataaaa acactaattt gtgggtattt aagaaaacct ggagaataaa ctcatacttt 3541 aaaagatc // LOCUS AB003177 1193 bp mRNA PRI 24-APR-1997 DEFINITION Human mRNA for proteasome subunit p27, complete cds. ACCESSION AB003177 NID g2055255 KEYWORDS proteasome subunit p27. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1193) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (17-APR-1997) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Tokyo Metropolitan Institute of Medical Science, Cancer Therapeatics; 18-22 Honkomagome 3-chome Bunkyo-ku, Tokyo 113, Japan (E-mail:tanahash@rinshoken.or.jp, Tel:03-3823-2237, Fax:03-3823-2237) REFERENCE 2 (sites) AUTHORS Watanabe,T., Suzuki,M., Saito,A., Fujiwara,T., takahashi,E., Slaughter,C., DeMartino,G., Tanahashi,N. and Tanaka,K. TITLE cDNA cloning and Chromosomal Mapping of a Human Proteasomal Modulater Subunit p27 JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..1193 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 125..754 /codon_start=1 /product="proteasome subunit p27" /db_xref="PID:d1020572" /db_xref="PID:g2055256" /translation="MSDEEARQSGGSSQAGVVTVSDVQELMRRKEEIEAQIKANYDVL ESQKGIGMNEPLVDCEGYPRSDVDLYQVRTARHNIICLQNDHKAVMKQVEEALHQLHA RDKEKQARDMAEAHKEAMSRKLGQSESQGPPRAFAKVNSISPGSPASIAGLQVDDEIV EFGSVNTQNFQSLHNIGSVVQHSEGALAPTILLSVSMNLTTPGTSSRSP" BASE COUNT 278 a 303 c 361 g 251 t ORIGIN 1 actgttctcg cgttcgcgga cggctgtggt gttttggcgc atgggcggag cgtagttacg 61 gtcgactggg gcgtcgtccc tagcccggga gccgggtctc tggagtcgcg gcccggggtt 121 cacgatgtcc gacgaggaag cgaggcagag cggaggctcc tcgcaggccg gcgtcgtgac 181 tgtcagcgac gtccaggagc tgatgcggcg caaggaggag atagaagcgc agatcaaggc 241 caactatgac gtgctggaaa gccaaaaagg cattgggatg aacgagccgc tggtggactg 301 tgagggctac ccccggtcag acgtggacct gtaccaagtc cgcaccgcca ggcacaacat 361 catatgcctg cagaatgatc acaaggcagt gatgaagcag gtggaggagg ccctgcacca 421 gctgcacgct cgcgacaagg agaagcaggc ccgggacatg gctgaggccc acaaagaggc 481 catgagccgc aaactgggtc agagtgagag ccagggccct ccacgggcct tcgccaaagt 541 gaacagcatc agccccggct ccccagccag catcgcgggt ctgcaagtgg atgatgagat 601 tgtggagttc ggctctgtga acacccagaa cttccagtca ctgcataaca ttggcagtgt 661 ggtgcagcac agtgaggggg ccctggcacc caccatccta ctttctgtct ctatgaattt 721 gactactcca gggacctcat ctagaagccc ctgaatgtga cagtgatccg caggggggaa 781 aaacaccagc ttagacttgt tccaacacgc tgggcaggaa aaggactgct gggctgcaac 841 attattcctc tgcaaagatg attgtccctg gggaacagta acaggaaagc atcttccctt 901 gccctggact tgggtctagg gatttccaac ttgtcttctc tccctgaagc ataaggatct 961 ggaagaggct tgtaacctga acttctgtgt ggtggcagta ctgtggccca ccagtgtaat 1021 ctccctggat taaggcattc ttaaaaactt aggcttggcc tctttcacaa attaggccac 1081 ggccctaaat aggaattccc tggattgtgg gcaagtgggc ggaagttatt ctggcaggta 1141 ctggtgtgat tattattatt atttttaata aagagtttta cagtgctgat atg // LOCUS AB003184 2110 bp mRNA PRI 22-OCT-1997 DEFINITION Homo sapiens mRNA for ISLR, complete cds. ACCESSION AB003184 NID g2554603 KEYWORDS ISLR. SOURCE Homo sapiens (isolate:Caucasian) retina cDNA to mRNA, clone_lib:human retina 5'-STRECH cDNA library (CLONTECH). ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagasawa,A., Kubota,R., Imamura,Y., Nagamine,K., Wang ,Y., Asakawa,S., Kudoh,J., Minoshima,S., Mashima,Y., Oguchi,Y. and Shimizu,N. TITLE Cloning of the cDNA for a new member of the immunoglobulin superfamily (ISLR) containing leucine-rich repeat JOURNAL Genomics 44 (3), 273-279 (1997) MEDLINE 97468140 REFERENCE 2 (bases 1 to 2110) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (17-APR-1997) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) FEATURES Location/Qualifiers source 1..2110 /organism="Homo sapiens" /isolate="Caucasian" /db_xref="taxon:9606" /chromosome="15" /clone_lib="human retina 5'-STRECH cDNA library (CLONTECH)" /map="15q23-24" /tissue_type="retina" CDS 99..1385 /codon_start=1 /product="ISLR" /db_xref="PID:d1023718" /db_xref="PID:g2554604" /translation="MQELHLLWWALLLGLAQACPEPCDCGEKYGFQIADCAYRDLESV PPGFPANVTTLSLSANRLPGLPEGAFREVPLLQSLWLAHNEIRTVAAGALASLSHLKS LDLSHNLISDFAWSDLHNLSALQLLKMDSNELTFIPRDAFRSLRALRSLQLNHNRLHT LAEGTFTPLTALSHLQINENPFDCTCGIVWLKTWALTTAVSIPEQDNIACTSPHVLKG TPLSRLPPLPCSAPSVQLSYQPSQDGAELRPGFVLALHCDVDGQPAPQLHWHIQIPSG IVEITSPNVGTDGRALPGTPVASSQPRFQAFANGSLLIPDFGKLEEGTYSCLATNELG SAESSVDVALATPGEGGEDTLGRRFHGKAVEGKGCYTVDNEVQPSGPEDNVVIIYLSR AGNPEAAVAEGVPGQLPPGLLLLGQSLLLFFFLTSF" sig_peptide 99..152 misc_feature 153..260 /note="amino-flanking region" repeat_region 261..638 /note="leucine-rich repeat" misc_feature 639..788 /note="carboxy-flanking region" misc_feature 789..1148 /note="immunoglobulin like domain" misc_feature 1335..1382 /note="transmembrane domain" polyA_signal 2089..2094 polyA_site 2110 /note="13 A nucleotides" BASE COUNT 379 a 693 c 592 g 446 t ORIGIN 1 caggccgagg cagggagaac tctccactcg gaggaggagc tggggtcctc ttccatcccg 61 tcttcatcct gcctggctgc gtgacctcgg gaggcaccat gcaggagctg catctgctct 121 ggtgggcgct tctcctgggc ctggctcagg cctgccctga gccctgcgac tgtggggaaa 181 agtatggctt ccagatcgcc gactgtgcct accgcgacct agaatccgtg ccgcctggct 241 tcccggccaa tgtgactaca ctgagcctgt cagccaaccg gctgccaggc ttgccggagg 301 gtgccttcag ggaggtgccc ctgctgcagt cgctgtggct ggcacacaat gagatccgca 361 cggtggccgc cggagccctg gcctctctga gccatctcaa gagcctggac ctcagccaca 421 atctcatctc tgactttgcc tggagcgacc tgcacaacct cagtgccctc caattgctca 481 agatggacag caacgagctg accttcatcc cccgcgacgc cttccgcagc ctccgtgctc 541 tgcgctcgct gcaactcaac cacaaccgct tgcacacatt ggccgagggc accttcaccc 601 cgctcaccgc gctgtcccac ctgcagatca acgagaaccc cttcgactgc acctgcggca 661 tcgtgtggct caagacatgg gccctgacca cggccgtgtc catcccggag caggacaaca 721 tcgcctgcac ctcaccccat gtgctcaagg gtacgccgct gagccgcctg ccgccactgc 781 catgctcggc gccctcagtg cagctcagct accaacccag ccaggatggt gccgagctgc 841 ggcctggttt tgtgctggca ctgcactgtg atgtggacgg gcagccggcc cctcagcttc 901 actggcacat ccagataccc agtggcattg tggagatcac cagccccaac gtgggcactg 961 atgggcgtgc cctgcctggc acccctgtgg ccagctccca gccgcgcttc caggcctttg 1021 ccaatggcag cctgcttatc cccgactttg gcaagctgga ggaaggcacc tacagctgcc 1081 tggccaccaa tgagctgggc agtgctgaga gctcagtgga cgtggcactg gccacgcccg 1141 gtgagggtgg tgaggacaca ctggggcgca ggttccatgg caaagcggtt gagggaaagg 1201 gctgctatac ggttgacaac gaggtgcagc catcagggcc ggaggacaat gtggtcatca 1261 tctacctcag ccgtgctggg aaccctgagg ctgcagtcgc agaaggggtc cctgggcagc 1321 tgcccccagg cctgctcctg ctgggccaaa gcctcctcct cttcttcttc ctcacctcct 1381 tctagcccca cccagggctt ccctaactcc tccccttgcc cctaccaatg cccctttaag 1441 tgctgcaggg gtctggggtt ggcaactcct gaggcctgca tgggtgactt cacattttcc 1501 tacctctcct tctaatctct tctagagcac ctgctatccc caacttctag acctgctcca 1561 aactagtgac taggatagaa tttgatcccc taactcactg tctgcggtgc tcattgctgc 1621 taacagcatt gcctgtgctc tcctctcagg ggcagcatgc taacggggcg acgtcctaat 1681 ccaactggga gaagcctcag tggtggaatt ccaggcactg tgactgtcaa gctggcaagg 1741 gccaggattg ggggaatgga gctggggctt agctgggagg tggtctgaag cagacaggga 1801 atgggagagg aggatgggaa gtagacagtg gctggtatgg ctctgaggct ccctggggcc 1861 tgctcaagct cctcctgctc cttgctgttt tctgatgatt tgggggcttg ggagtccctt 1921 tgtcctcatc tgagactgaa atgtggggat ccaggatggc ttccttcctc ttacccttcc 1981 tccctcagcc tgcaacctct atcctggaac ctgtcctccc tttctcccca actatgcatc 2041 tgttgtctgc tcctctgcaa aggccagcca gcttgggagc agcagagaaa taaacagcat 2101 ttctgatgcc // LOCUS AB003286 11198 bp DNA PRI 11-JUL-1997 DEFINITION Homo sapiens DNA for choline kinase like protein and muscle type carnitine palmitoyltransferase I, partial and complete cds. ACCESSION AB003286 NID g2257471 KEYWORDS alternative splicing; choline kinase like protein; muscle type carnitine palmitoyltransferase I. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11198) AUTHORS Yamazaki,N., Yamanaka,Y., Hashimoto,Y., Shinohara,Y., Shima,A. and Terada,H. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) to the DDBJ/EMBL/GenBank databases. Naoshi Yamazaki, University of Tokushima, Faculty of Pharmaceutical Sciences; 1 Shomachi, Tokushima, Tokushima 770, Japan (E-mail:yamazaki@ph.tokushima-u.ac.jp, Tel:0886-33-7279, Fax:0886-33-5196) REFERENCE 2 (sites) AUTHORS Yamazaki,N., Yamanaka,Y., Hashimoto,Y., Shinohara,Y., Shima,A. and Terada,H. TITLE Structural features of the gene encoding human muscle type carnitine palmitoyltransferase I JOURNAL FEBS Lett. 409 (3), 401-406 (1997) MEDLINE 97367931 FEATURES Location/Qualifiers source 1..11198 /organism="Homo sapiens" /db_xref="taxon:9606" CDS join(<61..142,311..>385) /note="putative; similar to GenBank accession number T07548 and T30127" /codon_start=1 /evidence=not_experimental /product="choline kinase like protein" /db_xref="PID:d1022338" /translation="VCSGIPFLLGSVVHPPGIHVHHRIWLLGLCPVSVPVLLPAEGAA DQCPLLIL" 3'UTR 386..590 /evidence=not_experimental polyA_site 581..586 /evidence=not_experimental exon <905..1017 /note="alternative splicing; exon 1A" /number=1 5'UTR join(<905..1017,1631..1649) /note="alternative splicing" exon <1128..1180 /note="alternative splicing; exon 1B" /number=1 5'UTR join(<1128..1180,1631..1649) /note="alternative splicing" exon 1631..1790 /number=2 CDS join(1650..1790,2102..2241,2531..2708,2928..3029, 3230..3367,3453..3530,4964..5071,5145..5229,5848..6043, 6498..6683,7250..7355,7437..7553,8020..8184,8267..8401, 8516..8668,9152..9265,9890..9982,10137..10220) /codon_start=1 /evidence=not_experimental /product="muscle type carnitine palmitoyltransferase I" /db_xref="PID:d1022339" /db_xref="PID:g2257472" /translation="MAEAHQAVAFQFTVTPDGVDFRLSREALKHVYLSGINSWKKRLI RIKNGILRGVYPGSPTSWLVVIMATVGSSFCNVDISLGLVSCIQRCLPQGCGPYQTPQ TRALLSMAIFSTGVWVTGIFFFRQTLKLLLCYHGWMFEMHGKTSNLTRIWAMCIRLLS SRHPMLYSFQTSLPKLPVPRVSATIQRYLESVRPLLDDEEYYRMELLAKEFQDKTAPR LQKYLVLKSWWASNYVSDWWEEYIYLRGRSPLMVNSNYYVMDLVLIKNTDVQAARLGN IIHAMIMYRRKLDREEIKPVMALGIVPMCSYQMERMFNTTRIPGKDTDVLQHLSDSRH VAVYHKGRFFKLWLYEGARLLKPQDLEMQFQRILDDPSPPQPGEEKLAALTAGGRVEW AQARQAFFSSGKNKAALEAIERAAFFVALDEESYSYDPEDEASLSLYGKALLHGNCYN RWFDKSFTLISFKNGQLGLNAEHAWADAPIIGHLWEFVLGTDSFHLGYTETGHCLGKP NPALAPPTRLQWDIPKQCQAVIESSYQVAKALADDVELYCFQFLPFGKGLIKKCRTSP DAFVQIALQLAHFRDRGKFCLTYEASMTRMFREGRTETVRSCTSESTAFVQAMMEGSH TKADLRDLFQKAAKKHQNMYRLAMTGAGIDRHLFCLYLVSKYLGVSSPFLAEVLSEPW RLSTSQIPQSQIRMFDPEQHPNHLGAGGGFGPVADDGYGVSYMIAGENTIFFHISSKF SSSETNAQRFGNHIRKALLDIADLFQVPKAYS" exon 2102..2241 /number=3 exon 2531..2708 /number=4 exon 2928..3029 /number=5 exon 3230..3367 /number=6 exon 3453..3530 /number=7 exon 4964..5071 /number=8 exon 5145..5229 /number=9 exon 5848..6043 /number=10 exon 6498..6683 /number=11 exon 7250..7355 /number=12 exon 7437..7553 /number=13 exon 8020..8184 /number=14 exon 8267..8401 /number=15 exon 8516..8668 /number=16 exon 9152..9265 /number=17 exon 9890..9982 /number=18 exon 10137..10222 /number=19 3'UTR join(10221..10222,10476..10695) exon 10476..10695 /number=20 polyA_site 10671..10676 BASE COUNT 2334 a 3187 c 3331 g 2346 t ORIGIN 1 ggatccaagc agtgctagac atgctcttga atgcccctcc ttttcctgcc ctccccccag 61 gtatgctctg gcatcccatt tcttctgggg tctgtggtcc atcctccagg catccatgtc 121 caccatagaa tttggttact tggtaagtga ccctggggat gggaatgcta gctggggggc 181 tggggagcag cagcagccac actcttccag gaggcctggg gagtcccggg tggctgtggg 241 cagctgaggt ggatgtagaa tgctggtccc acgtcttctc accactgtgt gggtgggttt 301 ccttccctag gactatgccc agtctcggtt ccagttctac ttccagcaga aggggcagct 361 gaccagtgtc cactcctcat cctgactcca ccctcccact ccttggattt ctcctggagc 421 ctccagggca ggaccttgga gggaggaaca acgagcagaa ggccctggcg actgggctga 481 gcccccaagt gaaactgagg ttcaggagac cggcctgttc ctgagtttga gtaggtcccc 541 atggctggca ggccagagcc ccgtgctgtg tatgtaacac aataaacaag cttcttcttc 601 ccaccctgtc ctggccctgc tgagcagcag cagaaagtac caaaccgagc agtacacaca 661 aagggactct tcagtgctct gggattgaaa gtggttagcg ttcatgctgc cagttggggt 721 cccccatccc tccccagtcc cctggctgca gcttagaata ataaatacta ggacttgggg 781 aggaggagag tgatgggggt atgaagacga ccctgaggtg gggatgccgc ccggagcacc 841 agcgatccca gaacaggcag cagctgacac atcggtgacc ttttccctac atttggctat 901 ttttagctct aatgccacca tcctcacgag actctggggc cccccaggct cccagacctt 961 tgagcaacct tcaccgcaca gaaacccagc cgcgccctgc aattcccacc gcggaaggtg 1021 ggtgggttct ggttctcgcc ccacgttttt ccccgacccc gatttgggag taggtgtcag 1081 gttcctggtg agggcggggc gggggtggct aggcctgaag gacgtgggga cacgggccag 1141 agtggctggc cccacgcacg gacaggagtg aacccgagct gtgagtaggg gccggcacag 1201 ggcggcccgc gggggtctgg gccctcagcc ctgcacaggg gcgaggacgg cgctggggcc 1261 cgcgcgctcg ggtggggaag ggcgggctcc cgaaccctgc ctctgtccag gccggctcca 1321 cttccagggg cgcctctctc ccctgcccgc gccctcgctg acgcccccca actccaggct 1381 gcccttcgcc gtccttgggg ctcctggagc tttcaaggcc caaatcccct gcaccacagt 1441 ggctgtgccc acccggaagg ctggcgcgag gatttggcgg cggttggcct ggcggggggc 1501 gcgggccggg ggcagccgct agtcgcgggg tggggggcgc gaggggtcgc ggactggctg 1561 ggggcgtctc ggcgcggctg gcggcggggc cggcctaacg cgcccgcgca cccatctgcc 1621 cccgtcctag gtgccgacca acccccagga tggcggaagc tcaccaggcc gtggccttcc 1681 agttcacggt gaccccagac ggggtcgact tccggctcag tcgggaggcc ctgaaacacg 1741 tctacctgtc tgggatcaac tcctggaaga aacgcctgat ccgcatcaag gtgcgcacag 1801 gtgcttctcc cagagcgtag gcagaggccg gctgtcagct gttaagcgct ttgttagggt 1861 ccctcactgc ctccttggct ggcacttctg cccggtacag gttgtggaag tacagacacc 1921 agaggggtgc acaggatgtg gtcggacaca gggagctgtg ggtgtggcgg aggaaggagc 1981 acagcagggc atcaggagag aaagccttcc aggccaagac caggagccag ttcccaagac 2041 ttcacaggca ggctaacctc ccgccttccg gctccataag ggcgcctgtt tctgcccaca 2101 gaatggcatc ctcaggggcg tgtaccctgg cagccccacc agctggctgg tcgtcatcat 2161 ggcaacagtg ggttcctcct tctgcaacgt ggacatctcc ttggggctgg tcagttgcat 2221 ccagagatgc ctccctcagg ggtaaggagt gaaactggaa gggcacaggt gccaccaggg 2281 agggctgggc ccagctccca aggctgaggt tcctgagctg ggcagataca ggacagcagc 2341 cattggcagt cacggggcag ccctccccta tgacaaccat tgtcttagcc ctacatccgc 2401 tcatttgatg cagtcagaca tgagtgtgcc cagggaggtt cttccccttg gtgtctcccc 2461 tgagacagtt cacagccacc cgaggctggc ctcaagagga ccccctgcag cctttgcccc 2521 tctccaatag gtgtggcccc taccagaccc cgcagacccg ggcacttctc agcatggcca 2581 tcttctccac gggcgtctgg gtgacgggca tcttcttctt ccgccaaacc ctgaagctgc 2641 ttctctgcta ccatgggtgg atgtttgaga tgcatggcaa gaccagcaac ttgaccagga 2701 tctgggctgt gagcagcagc cagtggaggg gttcaggcac ctgggttgag actctttgga 2761 ctcctttggg gttctgagct agaggggaga ggcagacagg gcactggtgc ctggtgtgtg 2821 gtttgtcctg gaggggctgg gatggctctg agggtctcag ggagttgctg gttggtttcc 2881 attttttcca ctggctccca ccccagcact ctgctctgta cccccagatg tgtatccgcc 2941 ttctatccag ccggcaccct atgctctaca gcttccagac atctctgccc aagcttcctg 3001 tgcccagggt gtcagccaca attcagcggg tgagggcctc gcttgggcat cccagtgggc 3061 aggggaggtt ggattcagga gatgtttcca aatataaggt tctgtgcaaa gagtggcctt 3121 aagggcttga gaataatggg gctgggtgag gagggagagg tgggaagagg attaagatag 3181 aggcagccct tgccatctgg ccccacggtg atgataactg gctggacagt acctagagtc 3241 tgtgcgcccc ttgttggatg atgaggaata ttaccgcatg gagttgctgg ccaaagaatt 3301 ccaggacaag actgccccca ggctgcagaa atacctggtg ctcaagtcat ggtgggcaag 3361 taactatgta agttcctgcc cctgggctca ctgtcacctg ccatgtgtcc tggctgcacc 3421 cgccccagct ctaaccttcc acctccccac aggtgagtga ctggtgggaa gagtacatct 3481 accttcgagg caggagccct ctcatggtga acagcaacta ttatgtcatg gtatgaacta 3541 gagcccccag gtccccgcac gtgctcagct ctgtcccagc tccaaggcaa gggatctgga 3601 ggacagccca gagctctagt agcagcttcc gtgggcaagt gggggttatg gagtgaggcc 3661 tgagggaaag ggaagagaga gaggagatcc tagaagagtc cagaagcagc ttagggcaat 3721 ggggatccta aggatgagga gagtggagac cgccagcctg ccaccgcttc tcagagtccc 3781 ggggtcactg cccctgccca gctcgggctc tgtcacctct ttccttggtt tcctcactgg 3841 cctcctggca tgggttccca gctgtcctga caccataacc agggaattgt ctagaacgtg 3901 tcttgctttg tgtccctctg cagagccgga cagcagaatg gaggccaggc tgctgctttt 3961 agagctcagg aagtcagtct gcctctgccc catgtaactg gcccttctga gtctctgggt 4021 ctcccagcca cctgggctgt atggcatgcc tcttctctcc tttcccacgg tccaaagcac 4081 acttgacttg ccagagccta ctggaattct cctccataag ctgtccttcc aggaacttac 4141 acacaacttg cccatagcag aggttttaaa ctgcttttta gtggtagaac ccctttatca 4201 aagcaaaaga agtagaatat aagcacataa aatagcttat aaaaaggcag ctcgggccgg 4261 gtgcagtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg tggatcacaa 4321 ggtcaggaga tcgagagcat cctaacacgg tgaaaccccg tctctaccaa aaatacaaaa 4381 aattagccgg gcgtggtggt gggcgtctgt agtcccagct actcgggagg ctgaggcagg 4441 caaatggcgt gaacccggga ggtggagctt gaaatgagcc gagatcgcac cactgcactc 4501 cagcctgggt gacaagagcg agactctatc tcagaaaaat aaataaataa aaatttaaaa 4561 ataaaaataa ataaacaaat aaaaaggaag ctcagagcca ggcgtggtgg tgcatgactc 4621 taatccctgc aactcaggag gctgcagcag gaggatcact tagggcgagt aatttgaggc 4681 tgcagtgagc catgcttgtt ccgctgcact ccagcctggg cgacagagca agaccctaac 4741 tctaaaaaat aaagaataaa gcagctcagg ctgaagctgg agcagagggt tactgcgatc 4801 agttgtgcat ttctggaggt tcaccccggc agcatgtgca ggatgtactg gaagagggga 4861 gacagaacac ctcaggcccc cgaatagatt ggtccttggg tcagcaagac tcaggtgaca 4921 cccagacaga ggcccccacc ccgcgggctc tgttcctctg taggaccttg tgctcatcaa 4981 gaatacagac gtgcaggcag cccgcctggg aaacatcatc cacgccatga tcatgtatcg 5041 ccgtaaactg gaccgtgaag aaatcaagcc tgtgagttgc gtcagggttg aaggtgggat 5101 gggaggggag acctgagtct gagccatgct gggccttccc tcaggtgatg gcactgggca 5161 tagtgcctat gtgctcctac cagatggaga ggatgttcaa caccactcgg atcccgggca 5221 aggacacagg taactgagcc ccctcgctgc tacctgtggg ccatctggct ggctcgctgc 5281 cctcctgcct gctcatcacc aagcgtcccc agtgtctcag ggtcctgaaa ctgtgaacag 5341 tagtcaattg tgacagatac tagacatcca ttgtttacag gcatgatgct ggcgcaggga 5401 ccccacaggg cccagagcag agcccctccc ctccgggctc actgcgcatg gtgaagggag 5461 tcctgctgca gcccaaagct taggacagac ccggcgctgc catggagccg gcagaggagg 5521 ggaggggcgc acccaggggt ctggcagagg cagcagcctt ccctgcttct gacactgtat 5581 ccttaggcgg tttgcacaac cctctagtgc gtcgtttcct cttctgtgaa atactttata 5641 ggattgttgg tgttgcgtga gagagagtgg aagcacccag cacagggcct ggtttaggac 5701 acacggatcc catccagggg gcaggaaggc ccaggcagag gccgagcaaa acagggtctg 5761 caggggtcac ctagtgcatg ggaggtgggc ccttcccagg atgtagctgg gggccccgcc 5821 tcagcttgcc cgtggcctgt atcacagatg tgctacagca cctctcagac agccggcacg 5881 tggctgtcta ccacaaggga cgcttcttca agctgtggct ctatgagggc gcccgtctgc 5941 tcaagcctca ggatctggag atgcagttcc agaggatcct ggacgacccc tccccacctc 6001 agcctgggga ggagaagctg gcagccctca ctgcaggagg aaggtattgg cctctgggaa 6061 gggactgtcc ccaccctgag ttcagggctc cgtgaggaga aggagcgtgg ccctgcctgc 6121 caccctggaa ctggaggctg gaggcacaac taggggaggg gcattggtgg tcatggcagc 6181 aggacagcca gcataaccta cctctgacgg gtggcagcca ggtgaagtgt gcagagggtg 6241 gggacacctc caaaatagct tggcaccccc cacctccagg cccagcctgg cacacacacc 6301 ccacctccag gcgcacgcaa cggcacccca acacctccag gcccagcctg cacccccaca 6361 tctccaggtc cagcgtggta cccccatctc caggtccagc ctggcacccc acccccatct 6421 ccaggtccag ccaggccctc agaggcaccc tcatcccaag tccacgtgcc cactgcttac 6481 cctgccccat gcttcagggt ggagtgggcg caggcacgcc aggccttctt tagctctgga 6541 aagaataagg ctgccttgga ggccatcgag cgtgccgctt tcttcgtggc cctggatgag 6601 gaatcctact cctatgaccc cgaagatgag gccagcctca gcctctatgg caaggccctg 6661 ctacatggca actgctacaa caggtacggc agccccagcc ccacaggtta cagcttaagg 6721 ttaaaagtta gggttatggt tagaggatta aagataaaag aaggtagggt tatgagctgg 6781 gtgcagtggc acacacctgt gatcctagca ctttgggggc caaggcaggt ggatcacttg 6841 agtgcaggag ctcaagacca gcctgggcaa cggagcgaga ccccttcata aaagtagtta 6901 ggattgtgat tagtggttgg gtagggctag tggctagggt taaaagctag ggtttgggtt 6961 atagaatagg gttaaaagcc gggcatggtg gcaggcacct gtaatcccag ctagtcggaa 7021 ggctgacgca ggagaagccc ttgaacccgg gaggttatgg gaagctaaga tcacaccact 7081 gcactccagc ctgggcaaca gagcaagatt ccatctcaat ttaaaaaaaa agtgagagaa 7141 aaagagagag agaatagggt tagtaattag ggttaaaggt tggggttgca ggtcaggctc 7201 ctctggacat tcccagcttt ggttcttcat gtgtctactc ttcctgcagg tggtttgaca 7261 aatccttcac tctcatttcc ttcaagaatg gccagttggg tctcaatgca gagcatgcgt 7321 gggcagatgc tcccatcatt gggcacctct gggaggtaat agccttgcag agggaacctg 7381 cagggcaggc tgtaggggga tgaggccagc ctctcagtct catcctctcc ctgcagtttg 7441 tcctgggcac agacagcttc cacctgggct acacggagac cgggcactgc ctgggcaaac 7501 cgaaccctgc gctcgcacct cctacacggc tgcagtggga cattccaaaa caggtgggtt 7561 ggaagctccc agagcaggtg tgagaccaca aagcagcagg tgggtacagc cccgacgagg 7621 cctgagcctc ctcctcccct gctggcctca ctgcctggcc cagccctcgg gaaggcacag 7681 ggcacgtctc aggatacctg tagagtccaa actggcttca gggaggacag agaccaccca 7741 ccgcccctgg ggccatctgt gtttagagac agccatgaga tggaggaggc actcacaggc 7801 ccctggagca ttttcagcac ttccctctta cccacaaagc tgagcccggc ctctgggggc 7861 tgattctccc tcagactgtc ttttgcgtac cctcctcctg aagatgtctt ggccggctgt 7921 gccctttctc caccaactaa acgtcatgcc tcctagacat gacccagagt cctgcttgga 7981 gagccctacc ctcagctgac ccttcccatg tcttggcagt gccaggcggt catcgagagt 8041 tcctaccagg tggccaaggc gttggcagac gacgtggagt tgtactgctt ccagttcctg 8101 ccctttggca aaggcctcat caagaagtgc cggaccagcc ctgatgcctt tgtgcagatc 8161 gcgctgcagc tggctcactt ccgggtagga gccccgcctc ccgctgctga gagggcaggg 8221 tggtaccagg gtccacctgc cagattcacc cctctgtata tcccaggaca ggggtaagtt 8281 ctgcctgacc tatgaggcct caatgaccag aatgttccgg gagggacgga ctgagactgt 8341 gcgttcctgt accagcgagt ccacagcctt tgtgcaggcc atgatggagg ggtcccacac 8401 agtaagtgtc ctctgcccat gtgggggtca cagtcgtcgg gtgaggtgcc ccctctgcct 8461 cctgtctgcc tggagggcca gggctactct tcaccccttt acttctgccc cgcagaaagc 8521 agacctgcga gatctcttcc agaaggctgc taagaagcac cagaatatgt accgcctggc 8581 catgaccggg gcagggatcg acaggcacct cttctgcctt tacttggtct ccaagtacct 8641 aggagtcagc tctcctttcc ttgctgaggt cagcaccgtt gttgggtgtg tcctttgtcc 8701 cactgccctc ctacacgcag ggcttgggcc atctcatgat ggagcacggc ctgttttcct 8761 ggcttgctcc cctgaagctc cctggagtgc tgggccagct ttcccgccca caccccacct 8821 gggcctgtgg tcctgggact gagcaggagc aacatcctct ttgtggtgtt ggtgtccttg 8881 tgacagggaa cagacaaata tatgatctgt caggtggtga cagcactact gagactagtg 8941 aggctatgtg ggggagaggc agagcagagg gagggctggg cagctcacac aagccttatg 9001 tggcatgatg cacaggggac cacggggtgg tggccaggga gctcctgcac cctcagcaga 9061 accgtgtctg gctcagagta ggtcgcagca ccacgtgttg gacaggtgct ttgggtatgt 9121 ggcctctgac cagctgtggc ctccattgca ggtgctctcg gaaccctggc gtctctccac 9181 cagccagatc ccccaatccc agatccgcat gttcgaccca gagcagcacc ccaatcacct 9241 gggcgctgga ggtggctttg gccctgtgag tgctcctgaa gggggtgggt gggcagcacc 9301 agggccctga gggtttcagt ggcgagtgca ggccctgagg tcagacgaga ggcaggagca 9361 ccttgcttag aggagagcat aaccccagac ctgcatggaa gcagaatgtt agctatggac 9421 tcaggcagcc aagcacctgg gacccaccca tcccaaatac ctccctgctg aagcatgacc 9481 gtggtttccg gggtactgaa gcatgactgt ggggctggtt tccagggtac tgggcagtgc 9541 actgggtgct tctgaagtct acttatccaa gagagtgcag ctgtctccca cagaaacctt 9601 cacatgggct agcttgtcag aaggctagag aaccctctgg agagagacgg agtgatctag 9661 agaataagcc ccctgaggga aactgggagg tcccagatcc cttggcgggg tctgggcagc 9721 attctttggt tttcacagtt tcctgggggt ggtcctcagg aaaaatgcag tgtctagggc 9781 tggggcccgt tcctgcaact cttgtgggaa gaacaagtac aatatcagct tggctttcag 9841 atcttcaagg tttgatttat gccttcttca cccttcttgt tgccctcagg tagcagatga 9901 tggctatgga gtttcctaca tgattgcagg cgagaacacg atcttcttcc acatctccag 9961 caagttctca agctcagaga cggtgagtct cctgccacag ctcaggcctg aggaaggggt 10021 gccacctggg gctgcccagg aacacaggtg tctttggctg gggaggcatc cttgcttgtg 10081 ggaacagagg ggtgggtaca tatctgaagg tgcatctgaa ctcttggctc ccacagaacg 10141 cccagcgctt tggaaaccac atccgcaaag ccctgctgga cattgctgat cttttccaag 10201 ttcccaaggc ctacagctga aggttggaga aatgccagct gccctttcgt cccacactgt 10261 ggaggaaggg acctgtggca gctcacaggc atgaggggtg gccgtgcaca ggtgcccagg 10321 ctccaaggac agctccggca gcaggtcctc gctgggcaga tgctgctccc tgagggccca 10381 ggtggtggag gtggggttgg agcaggaagg gaattttgat tttttttttt cttgatagat 10441 actaataaaa ataaggctgt gtaattttct ctcagccctt aggtacctgt gttttgtttg 10501 ggaactcgga ggccctcccc ctcccccagc tcagaccaca gaggtggcaa gagaagggct 10561 gaagctggaa gactgttcat gagggacttg tgtgacctgc tttgaaatgt gtgactctgc 10621 tgagtgacgt aggctctgag atagctgtcc acgcccacgt gtttgcttgg aataaatact 10681 tgcctcagaa ccttcacctg ttccctgggg ccatttctgt ttgtctgtct gctggagagt 10741 gagcccacct cctcccatgc aggggcatgt gtgaggcacc cctcttggct gaggcacaac 10801 cctgcacggg ccccgcagct tcttgtcagc atggaaaggg gctgcaggcc aggcctgaag 10861 tcttgaggcg ccaggtggtc atgtctgcaa gagggcacgc aggatgacct gtagcaacag 10921 acactccttc actgaggttg gctcggctgc taaaatcatg ttaagagtaa atacaaaata 10981 attattcttt gtttttaagg cggagtttcg ctcttgttgc ccaggctgga gtgcagtggc 11041 acgatctcga ctcactacaa cctctgcctc ctgggttcaa gcgattctcg tacctcagct 11101 cctgagtagc tgggacaaca ggcgcccgcc atcacgcccg gctaattttt tgtattttta 11161 gtagaggcag ggttttacca tgttggccag gctgtctc // LOCUS AB003476 6287 bp mRNA PRI 12-MAY-1997 DEFINITION Human mRNA for gravin, complete cds. ACCESSION AB003476 NID g2081606 KEYWORDS gravin. SOURCE Homo sapiens Umbilical Vein Endothelial Cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6287) AUTHORS Kokame,K. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) to the DDBJ/EMBL/GenBank databases. Koichi Kokame, National Cardiovascular Center Research Institute, Department of Etiology and Pathogenesis; Fujishirodai 5-7-1, Suita, Osaka 565, Japan (E-mail:kame@ri.ncvc.go.jp, Tel:+81-6-833-5012, Fax:+81-6-872-8091) REFERENCE 2 (sites) AUTHORS Sato,N., Kokame,K., Shimokado,K., Kato,H. and Miyata,T. TITLE Changes of cell growth-related and other gene expression by atherogenic lipid, lysophosphatidylcholine, in cultured human umbilical vein endothelial ce lls JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..6287 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Endothelial Cell" /tissue_type="Umbilical Vein" CDS 174..5228 /codon_start=1 /product="gravin" /db_xref="PID:d1020716" /db_xref="PID:g2081607" /translation="MLGTITITVGQRDSEDVSKRDSDKEMATKSAVVHDITDDGQEET PEIIEQIPSSESNLEELTQPTESQANDIGFKKVFKFVGFKFTVKKDKTEKPDTVQLLT VKKDEGEGAAGAGDHKDPSLGAGEAASKESEPKQSTEKPEETLKREQSHAEISPPAES GQAVEECKEEGEEKQEKEPSKSAESPTSPVTSETGSTFKKFFTQGWAGWRKKTSFRKP KEDEVEASEKKKEQEPEKVDTEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEY EKVELPSEEQVSGSQGPSEEKPAPLATEVFDEKIEVHQEEVVAEVHVSTVEERTEEQK TEVEETAGSVPAEELVEMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLS KPPEGVVSEVEMLSSQERMKVQGSPLKKLFTSTGLKKLSGKKQKGKRGGGDEESGEHT QVPADSPDSQEEQKGESSASSPEEPEEITCLEKGLAEVQQDGEAEEGATSDGEKKREG VTPWASFKKMVTPKKRVRRPSESDKEDELDKVKSATLSSTESTASEMQEEMKGSVEEP KPEEPKRKVDTSVSWEALICVGSSKKRARRGSSSDEEGGPKAMGGDHQKADEAGKDKE TGTDGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS EDSIAGSGVEHSTPDTEPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTGANE DDSDVPAVVPLSEYDAVEREKMEAQQAQKSAEQPEQKAATEVSKELSESQVHMMAAAV ADGTRAATIIEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEPL PENREARGDTVVSEAELTPEAVTAAETAGPLGAEEGTEASAAEETTEMVSAVSQLTDS PDTTEEATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTGGPEDVLQPVQ RAEAERPEEQAEASGLKKETDVVLKVDAQEAKTEPFTQGKVVGQTTPESFEKAPQVTE SIESSELVTTCQAETLAGVKSQEMVMEQAIPPDSVETPTDSETDGSTPVADFDAPGTT QKDEIVEIHEENEVASGTQSGGTEAEAVPAQKERPPAPSSFVFQEETKEQSKMEDTLE HTDKEVSVETVSILSKTEGTQEADQYADEKTKDVPFFEGLEGSIDTGITVSREKVTEV ALKGEGTEEAECKKDDALELQSHAKSPPSPVEREMVVQVEREKTEAEPTHVNEEKLEH ETAVTVSEEVSKQLLQTVNVPIIDGAKEVSSLEGSPPPCLGQEEAVCTKIQVQSSEAS FTLTAAAEEEKVLGETANILETGETLEPAGAHLVLEEKSSEKNEDFAAHPGEDAVPTG PDCQAKSTPVIVSATTKKGLSSDLEGEKTTSLKWKSDEVDEQVACQEVKVSVAIEDLE PENGILELETKSSKLVQNIIQTAVDQFVRTEETATEMLTSELQTQAHVIKADSQDAGQ ETEKEGEEPLASAQDETPITSAKEESESTAVGQAHSDISKDMSEASEKTMTVEVEGST VNDQQLEEVVLPSEEEGGGAGTKSVPEDDGHALLAERIEKSLVEPKEDEKGDDVDDPE NQNSALADTDASGGLTKESPDTNGPKQKEKEDAQEVELQEGKVHSESDKAITPQAQEE LQKQERESAKSELTES" BASE COUNT 2021 a 1313 c 1710 g 1243 t ORIGIN 1 ggcagctccg agggcacctc cggttctccc ccatcctccg ggagtgtctg ggcgctcagt 61 ccgctctgat cccgccgaaa ccacctgcgg ttggcaggca ggagactagg cgtctgccgg 121 ggagggcagg gacccgctaa gctgatctcc tgtacagtag tgctacttaa aatatgctgg 181 ggaccatcac catcacagtt ggacagagag actctgaaga tgtgagcaaa agagactccg 241 ataaagagat ggctactaag tcagcggttg ttcacgacat cacagatgat gggcaggagg 301 agacacccga aataatcgaa cagattcctt cttcagaaag caatttagaa gagctaacac 361 aacccactga gtcccaggct aatgatattg gatttaagaa ggtgtttaag tttgttggct 421 ttaaattcac tgtgaaaaag gataagacag agaagcctga cactgtccag ctactcactg 481 tgaagaaaga tgaaggggag ggagcagcag gggctggcga ccacaaggac cccagccttg 541 gggctggaga agcagcatcc aaagaaagcg aacccaaaca atctacagag aaacccgaag 601 agaccctgaa gcgtgagcaa agccacgcag aaatttctcc cccagccgaa tctggccaag 661 cagtggagga atgcaaagag gaaggagaag agaaacaaga aaaagaacct agcaagtctg 721 cagaatctcc gactagtccc gtgaccagtg aaacaggatc aaccttcaaa aaattcttca 781 ctcaaggttg ggccggctgg cgcaaaaaga ccagtttcag gaagccgaag gaggatgaag 841 tggaagcttc agagaagaaa aaggaacaag agccagaaaa agtagacaca gaagaagacg 901 gaaaggcaga ggttgcctcc gagaaactga ccgcctccga gcaagcccac ccacaggagc 961 cggcagaaag tgcccacgag ccccggttat cagctgaata tgagaaagtt gagctgccct 1021 cagaggagca agtcagtggc tcgcagggac cttctgaaga gaaacctgct ccgttggcga 1081 cagaagtgtt tgatgagaaa atagaagtcc accaagaaga ggttgtggcc gaagtccacg 1141 tcagcaccgt ggaggagaga accgaagagc agaaaacgga ggtggaagaa acagcagggt 1201 ctgtgccagc tgaagaattg gttgaaatgg atgcagaacc tcaggaagct gaacctgcca 1261 aggagctggt gaagctcaaa gaaacgtgtg tttccggaga ggaccctaca cagggagctg 1321 acctcagtcc tgatgagaag gtgctgtcca aaccccccga aggcgttgtg agtgaggtgg 1381 aaatgctgtc atcacaggag agaatgaagg tgcagggaag tccactaaag aagcttttta 1441 ccagcactgg cttaaaaaag ctttctggaa agaaacagaa agggaaaaga ggaggaggag 1501 acgaggaatc aggggagcac actcaggttc cagccgattc tccggacagc caggaggagc 1561 aaaagggcga gagctctgcc tcatcccctg aggagcccga ggagatcacg tgtctggaaa 1621 agggcttagc cgaggtgcag caggatgggg aagctgaaga aggagctact tccgatggag 1681 agaaaaaaag agaaggtgtc actccctggg catcattcaa aaagatggtg acgcccaaga 1741 agcgtgttag acggccttcg gaaagtgata aagaagatga gctggacaag gtcaagagcg 1801 ctaccttgtc ttccaccgag agcacagcct ctgaaatgca agaagaaatg aaagggagcg 1861 tggaagagcc aaagccggaa gaaccaaagc gcaaggtgga tacctcagta tcttgggaag 1921 ctttaatttg tgtgggatca tccaagaaaa gagcaaggag agggtcctct tctgatgagg 1981 aagggggacc aaaagcaatg ggaggagacc accagaaagc tgatgaggcc ggaaaagaca 2041 aagagacggg gacagacggg atccttgctg gttcccaaga acatgatcca gggcagggaa 2101 gttcctcccc ggagcaagct ggaagcccta ccgaagggga gggcgtttcc acctgggagt 2161 catttaaaag gttagtcacg ccaagaaaaa aatcaaagtc caagctggaa gagaaaagcg 2221 aagactccat agctgggtct ggtgtagaac attccactcc agacactgaa cccggtaaag 2281 aagaatcctg ggtctcaatc aagaagttta ttcctggacg aaggaagaaa aggccagatg 2341 ggaaacaaga acaagcccct gttgaagacg cagggccaac aggggccaac gaagatgact 2401 ctgatgtccc ggccgtggtc cctctgtctg agtatgatgc tgtagaaagg gagaaaatgg 2461 aggcacagca agcccaaaaa agcgcagagc agcccgagca gaaggcagcc actgaggtgt 2521 ccaaggagct cagcgagagt caggttcata tgatggcagc agctgtcgct gacgggacga 2581 gggcagctac cattattgaa gaaaggtctc cttcttggat atctgcttca gtgacagaac 2641 ctcttgaaca agtagaagct gaagccgcac tgttaactga ggaggtattg gaaagagaag 2701 taattgcaga agaagaaccc cccacggtta ctgaacctct gccagagaac agagaggccc 2761 ggggcgacac ggtcgttagt gaggcggaat tgacccccga agctgtgaca gctgcagaaa 2821 ctgcagggcc attgggtgcc gaagaaggaa ccgaagcatc tgctgctgaa gagaccacag 2881 aaatggtgtc agcagtctcc cagttaaccg actccccaga caccacagag gaggccactc 2941 cggtgcagga ggtggaaggt ggcgtacctg acatagaaga gcaagagagg cggactcaag 3001 aggtcctcca ggcagtggca gaaaaagtga aagaggaatc ccagctgcct ggcaccggtg 3061 ggccagaaga tgtgcttcag cctgtgcaga gagcagaggc agaaagacca gaagagcagg 3121 ctgaagcgtc gggtctgaag aaagagacgg atgtagtgtt gaaagtagat gctcaggagg 3181 caaaaactga gccttttaca caagggaagg tggtggggca gaccacccca gaaagctttg 3241 aaaaagctcc tcaagtcaca gagagcatag agtccagtga gcttgtaacc acttgtcaag 3301 ccgaaacctt agctggggta aaatcacagg agatggtgat ggaacaggct atcccccctg 3361 actcggtgga aacccctaca gacagtgaga ctgatggaag cacccccgta gccgactttg 3421 acgcaccagg cacaacccag aaagacgaga ttgtggaaat ccatgaggag aatgaggtcg 3481 catctggtac ccagtcaggg ggcacagaag cagaggcagt tcctgcacag aaagagaggc 3541 ctccagcacc ttccagtttt gtgttccagg aagaaactaa agaacaatca aagatggaag 3601 acactctaga gcatacagat aaagaggtgt cagtggaaac tgtatccatt ctgtcaaaga 3661 ctgaggggac tcaagaggct gaccagtatg ctgatgagaa aaccaaagac gtaccatttt 3721 tcgaaggact tgaggggtct atagacacag gcataacagt cagtcgggaa aaggtcactg 3781 aagttgccct taaaggtgaa gggacagaag aagctgaatg taaaaaggat gatgctcttg 3841 aactgcagag tcacgctaag tctcctccat cccccgtgga gagagagatg gtagttcaag 3901 tcgaaaggga gaaaacagaa gcagagccaa cccatgtgaa tgaagagaag cttgagcacg 3961 aaacagctgt taccgtatct gaagaggtca gtaagcagct cctccagaca gtgaatgtgc 4021 ccatcataga tggagcaaag gaagtcagca gtttggaagg aagccctcct ccctgcctag 4081 gtcaagagga ggcagtatgc accaaaattc aagttcagag ctctgaggca tcattcactc 4141 taacagcggc tgcagaggag gaaaaggtct taggagaaac tgccaacatt ttagaaacag 4201 gtgaaacgtt ggagcctgca ggtgcacatt tagttctgga agagaaatcc tctgaaaaaa 4261 atgaagactt tgccgctcat ccaggggaag atgctgtgcc cacagggccc gactgtcagg 4321 caaaatcgac accagtgata gtatctgcta ctaccaagaa aggcttaagt tccgacctgg 4381 aaggagagaa aaccacatca ctgaagtgga agtcagatga agtcgatgag caggttgctt 4441 gccaggaggt caaagtgagt gtagcaattg aggatttaga gcctgaaaat gggattttgg 4501 aacttgagac caaaagcagt aaacttgtcc aaaacatcat ccagacagcc gttgaccagt 4561 ttgtacgtac agaagaaaca gccaccgaaa tgttgacgtc tgagttacag acacaagctc 4621 acgtgataaa agctgacagc caggacgctg gacaggaaac ggagaaagaa ggagaggaac 4681 ctctggcctc tgcacaggat gaaacaccaa ttacttcagc caaagaggag tcagagtcaa 4741 ccgcagtggg acaagcacat tctgatattt ccaaagacat gagtgaagcc tcagaaaaga 4801 ccatgactgt tgaggtagaa ggttccactg taaatgatca gcagctggaa gaggtcgtcc 4861 tcccatctga ggaagaggga ggtggagctg gaacaaagtc tgtgccagaa gatgatggtc 4921 atgccttgtt agcagaaaga atagagaagt cactagttga accgaaagaa gatgaaaaag 4981 gtgatgatgt tgatgaccct gaaaaccaga actcagccct ggctgatact gatgcctcag 5041 gaggcttaac caaagagtcc ccagatacaa atggaccaaa acaaaaagag aaggaggatg 5101 cccaggaagt agaattgcag gaaggaaaag tgcacagtga atcagataaa gcgatcacac 5161 cccaagcaca ggaggagtta cagaaacaag agagagaatc tgcaaagtca gaacttacag 5221 aatcttaaaa catcatgcag ttaaactcat tgtctgtttg gaagaccaga atgtgaagac 5281 aagtagtaga agaaaatgaa tgctgctgct gagactgaag accagtattt cagaactttg 5341 agaattggag agcaggcaca tcaactgatc tcatttctag agagcccctg acaatcctga 5401 ggcttcatca ggagctagag ccatttaaca tttcctcttt ccaagaccaa cctacaattt 5461 tcccttgata accatataaa ttctgattta aggtcctaaa ttcttaacct ggaactggag 5521 ttggcaatac ctagttctgc ttctgaaact ggagtatcat tctttacata tttatatgta 5581 tgttttaagt agtcctcctg tatctattgt atattttttt cttaatgttt aaggaaatgt 5641 gcaggatact acatgctttt tgtatcacac agtatatgat ggggcatgtg ccatagtgca 5701 ggcttgggga gctttaagcc tcagttatat aacccacgaa aaacagagcc tcctagatgt 5761 aacattcctg atcaaggtac aattctttaa aattcactaa tgattgaggt ccatatttag 5821 tggtactctg aaattggtca ctttcctatt acacggagtg tgctaaaact aaaaagcatt 5881 ttgaaacata cagaatgttc tattgtcatt gggaaatttt tctttctaac ccagtggagg 5941 ttagaaagaa gttatattct ggtagcaaat taactttaca tcctttttcc tacttgttat 6001 ggttgtttgg accgataagt gtgcttaatc ctgaggcaaa gtagtgaata tgttttatat 6061 gttatgaaga aaagaattgt tgtaagtttt tgattctact cttatatgct ggactgcatt 6121 cacacatggc atgaaataag tcaggttctt tacaaatggt attttgatag atactggatt 6181 gtgtttgtgc catatttgtg ccattctttt aagaacaatg ttgcaacaca ttcatttgga 6241 taagttgtga tttgacgact gatttaaata aaatatttgc ttcactt // LOCUS AB003698 3187 bp mRNA PRI 15-AUG-1997 DEFINITION Homo sapiens mRNA for Cdc7-related kinase, complete cds. ACCESSION AB003698 NID g2102636 KEYWORDS Cdc7-related kinase. SOURCE Homo sapiens adult, fetal testis, liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3187) AUTHORS Sato,N. TITLE Direct Submission JOURNAL Submitted (08-MAY-1997) to the DDBJ/EMBL/GenBank databases. Noriko Sato, Institute of Medical Science, University of Tokyo, Molecular and Developmental Biology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:nrksato@hgc.ims.u-tokyo.ac.jp, Tel:81-3-5449-5661, Fax:81-3-5449-5424) REFERENCE 2 (sites) AUTHORS Sato,N., Arai,K. and Masai,H. TITLE Human and Xenopus cDNAs encoding budding yeast Cdc7-related kinases: in vitro phosphorylation of MCM subunits by a putative human homologue of Cdc7 JOURNAL EMBO J. 16 (14), 4340-4351 (1997) MEDLINE 97392464 FEATURES Location/Qualifiers source 1..3187 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1p22" /dev_stage="adult, fetal" /tissue_type="testis, liver" CDS 133..1857 /codon_start=1 /product="Cdc7-related kinase" /db_xref="PID:d1020752" /db_xref="PID:g2102637" /translation="MEASLGIQMDEPMAFSPQRDRFQAEGSLKKNEQNFKLAGVKKDI EKLYEAVPQLSNVFKIEDKIGEGTFSSVYLATAQLQVGPEEKIALKHLIPTSHPIRIA AELQCLTVAGGQDNVMGVKYCFRKNDHVVIAMPYLEHESFLDILNSLSFQEVREYMLN LFKALKRIHQFGIVHRDVKPSNFLYNRRLKKYALVDFGLAQGTHDTKIELLKFVQSEA QQERCSQNKSHIITGNKIPLSGPVPKELDQQSTTKASVKRPYTNAQIQIKQGKDGKEG SVGLSVQRSVFGERNFNIHSSISHESPAVKLMKQSKTVDVLSRKLATKKKAISTKVMN SAVMRKTASSCPASLTCDCYATDKVCSICLSRRQQVAPRAGTPGFRAPEVLTKCPNQT TAIDMWSAGVIFLSLLSGRYPFYKASDDLTALAQIMTIRGSRETIQAAKTFGKSILCS KEVPAQDLRKLCERLRGMDSSTPKLTSDIQGHASHQPAISEKTDHKASCLVQTPPGQY SGNSFKKGDSNSCEHCFDEYNTNLEGWNEVPDEAYDLLDKLLDLNPASRITAEEALLH PFFKDMSL" polyA_site 3187 /note="20 A nucleotides" BASE COUNT 1003 a 549 c 664 g 971 t ORIGIN 1 gaattcggca cgagttggag acggcgaccc aggcatctgg ggagcacaga agtcgtactc 61 ccttaaaccc tgctttgctc cccctgtgga tgtaacccct tagctggcat tttgcatctc 121 aattggcttg tgatggaggc gtctttgggg attcagatgg atgagccaat ggctttttct 181 ccccagcgtg accggtttca ggctgaaggc tctttaaaaa aaaacgagca gaattttaaa 241 cttgcaggtg ttaaaaaaga tattgagaag ctttatgaag ctgtaccaca gcttagtaat 301 gtgtttaaga ttgaggacaa aattggagaa ggcactttca gctctgttta tttggccaca 361 gcacagttac aagtaggacc tgaagagaaa attgctctaa aacacttgat tccaacaagt 421 catcctataa gaattgcagc tgaacttcag tgcctaacag tggctggggg gcaagataat 481 gtcatgggag ttaaatactg ctttaggaag aatgatcatg tagttattgc tatgccatat 541 ctggagcatg agtcgttttt ggacattctg aattctcttt cctttcaaga agtacgggaa 601 tatatgctta atctgttcaa agctttgaaa cgcattcatc agtttggtat tgttcaccgt 661 gatgttaagc ccagcaattt tttatataat aggcgcctga aaaagtatgc cttggtagac 721 tttggtttgg cccaaggaac ccatgatacg aaaatagagc ttcttaaatt tgtccagtct 781 gaagctcagc aggaaaggtg ttcacaaaac aaatcccaca taatcacagg aaacaagatt 841 ccactgagtg gcccagtacc taaggagctg gatcagcagt ccaccacaaa agcttctgtt 901 aaaagaccct acacaaatgc acaaattcag attaaacaag gaaaagacgg aaaggaggga 961 tctgtaggcc tttctgtcca gcgctctgtt tttggagaaa gaaatttcaa tatacacagc 1021 tccatttcac atgagagccc tgcagtgaaa ctcatgaagc agtcaaagac tgtggatgta 1081 ctgtctagaa agttagcaac aaaaaagaag gctatttcta cgaaagttat gaatagtgct 1141 gtgatgagga aaactgccag ttcttgccca gctagcctga cctgtgactg ctatgcaaca 1201 gataaagttt gtagtatttg cctttcaagg cgtcagcagg ttgcccctag ggcaggtaca 1261 ccaggattca gagcaccaga ggtcttgaca aagtgcccca atcaaactac agcaattgac 1321 atgtggtctg caggtgtcat atttctttct ttgcttagtg gacgatatcc attttataaa 1381 gcaagtgatg atttaactgc tttggcccaa attatgacaa ttaggggatc cagagaaact 1441 atccaagctg ctaaaacttt tgggaaatca atattatgta gcaaagaagt tccagcacaa 1501 gacttgagaa aactctgtga gagactcagg ggtatggatt ctagcactcc caagttaaca 1561 agtgatatac aagggcatgc ttctcatcaa ccagctattt cagagaagac tgaccataaa 1621 gcttcttgcc tcgttcaaac acctccagga caatactcag ggaattcatt taaaaagggg 1681 gatagtaata gctgtgagca ttgttttgat gagtataata ccaatttaga aggctggaat 1741 gaggtacctg atgaagctta tgacctgctt gataaacttc tagatctaaa tccagcttca 1801 agaataacag cagaagaagc tttgttgcat ccatttttta aagatatgag cttgtgataa 1861 tggatcttca tttaatgttt actgttatga ggtagaataa aaaagaatac tttgtaatag 1921 ccacaagttc ttgtttagag accagagcag gattaataat ttattttaac attttagtgt 1981 ttggtggcac attctaaaat atagattaag aatacttaaa atgcctggga tagttcttgg 2041 gactaacaac atgatcttct ttgagttaaa cctacctaag tagattttag gtgggttcct 2101 attaggtcag atttttagct tccctaatta cctttcactg acatatacag aaaaaggagc 2161 agttttagtt ttaattaatt aaaattaaca gatgtgatga ggattaaatg aatcaaaaga 2221 cttaatttgt agattctttt agagttatga gctaggtata gtttggggaa actcaacctg 2281 gtgctggtgc tcttaacaat tttgtaaata aagaagataa tttccttttc tagaggtaca 2341 tattaggcct tttatgaaca ctaaaacaat gaggaaatgt tggtcatggg gcaaagtatc 2401 acttaaaatt gaattcatcc atttttaaaa aacacttcat gaaagcattc tggtgtgaat 2461 tgccattttt ttcttactgg cttctcaatt ttcttccttc tctgccccta cctaaaacat 2521 tctcctcgga aattacatgg tgctgaccac aaagtttctg gatgttttat taaatattgt 2581 acgtctttac agttgggaat ttaaaataat acatacactg gttgataaag ggaagctgca 2641 ggaccaaggt gaagattgat agtccaaatg cttttctttt ttgagttgta tatttttgga 2701 caccatctta gatataatta ggtagctgct gaaaggaaaa gtgaatacag aattgacggt 2761 attattggag atttttcctc tgcgtagagc catccagatc tctgtatcct gttttgacta 2821 agtcttaggt gggttgggaa gacagataat gaagtgtagg caaagagaaa aggacccaag 2881 atagaggttt atattcagaa atggtatata tcaatgacag catatcaaac ttcctatggg 2941 aaaaagtctg gtgggtggtc agctgacaga tttcccattt agtagtcata gaatacagaa 3001 atagtttagg gacatgtatt cattttgtta ttttgagcat tgataggtca gtatatctac 3061 ctaatctgtt tggtaagtat aggatatata aaccattacc attgatctgt cttatgccat 3121 aatcttaaaa aaaaattgaa tgctcttgaa tttgtatatt caataaagtt atccttttat 3181 atttttt // LOCUS AB004066 2922 bp mRNA PRI 05-AUG-1997 DEFINITION Homo sapiens mRNA for DEC1, complete cds. ACCESSION AB004066 NID g2308996 KEYWORDS DEC1. SOURCE Homo sapiens cartilage chondrocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2922) AUTHORS Kawamoto,T. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) to the DDBJ/EMBL/GenBank databases. Takeshi Kawamoto, Hiroshima University School of Dentistry, Department of Biochemistry; 1-2-3 Kasumi Minami-ku, Hiroshima, Hiroshima 734, Japan (E-mail:tkawamo@ipc.hiroshima-u.ac.jp, Tel:082-257-5688, Fax:082-257-5629) REFERENCE 2 (sites) AUTHORS Shen,M., Kawamoto,T., Yan,W., Nakamasu,K., Tamagami,M., Koyano,Y., Noshiro,M. and Kato,Y. TITLE Molecular characterization of the novel basic helix-loop-helix protein DEC1 expressed in differentiated human embryo chondrocytes JOURNAL Biochem. Biophys. Res. Commun. 236 (2), 294-298 (1997) MEDLINE 97382424 FEATURES Location/Qualifiers source 1..2922 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chondrocyte" /tissue_type="cartilage" CDS 197..1435 /function="transcription factor" /note="basic helix-loop-helix protein" /codon_start=1 /product="DEC1" /db_xref="PID:d1022575" /db_xref="PID:g2308997" /translation="MERIPSAQPPPACLPKAPGLEHGDLPGMYPAHMYQVYKSRRGIK RSEDSKETYKLPHRLIEKKRRDRINECIAQLKDLLPEHLKLTTLGHLEKAVVLELTLK HVKALTNLIDQQQQKIIALQSGLQAGELSGRNVETGQEMFCSGFQTCAREVLQYLAKH ENTRDLKSSQLVTHLHRVVSELLQGGTSRKPSDPAPKVMDFKEKPSSPAKGSEGPGKN CVPVIQRTFAHSSGEQSGSDTDTDSGYGGESEKGDLRSEQPCFKSDHGRRFTMGERIG AIKQESEEPPTKKNRMQLSDDEGHFTSSDLISSPFLGPHPHQPPFCLPFYLIPPSATA YLPMLEKCWYPTSVPVLYPGLNASAAALSSFMNPDKISAPLLMPQRLPSPLPAHPSVD SSVLLQALKPIPPLNLETKD" polyA_signal 2896..2901 polyA_site 2922 /note="16 A nucleotides" BASE COUNT 742 a 751 c 715 g 714 t ORIGIN 1 ggacaccggg ccatgcacgc ccccaactga agctgcatct caaagccgaa gattccagca 61 gcccagggga tttcaaagag ctcagactca gaggaacatc tgcggagaga cccccgaagc 121 cctctccagg gcagtcctca tccagacgct ccgctagtgc agacaggagc gcgcagtggc 181 cccggctcgc cgcgccatgg agcggatccc cagcgcgcaa ccaccccccg cctgcctgcc 241 caaagcaccg ggactggagc acggagacct accagggatg taccctgccc acatgtacca 301 agtgtacaag tcaagacggg gaataaagcg gagcgaggac agcaaggaga cctacaaatt 361 gccgcaccgg ctcatcgaga aaaagagacg tgaccggatt aacgagtgca tcgcccagct 421 gaaggatctc ctacccgaac atctcaaact tacaactttg ggtcacttgg aaaaagcagt 481 ggttcttgaa cttaccttga agcatgtgaa agcactaaca aacctaattg atcagcagca 541 gcagaaaatc attgccctgc agagtggttt acaagctggt gagctgtcag ggagaaatgt 601 cgaaacaggt caagagatgt tctgctcagg tttccagaca tgtgcccggg aggtgcttca 661 gtatctggcc aagcacgaga acactcggga cctgaagtct tcgcagcttg tcacccacct 721 ccaccgggtg gtctcggagc tgctgcaggg tggtacctcc aggaagccat cagacccagc 781 tcccaaagtg atggacttca aggaaaaacc cagctctccg gccaaaggtt cggaaggtcc 841 tgggaaaaac tgcgtgccag tcatccagcg gactttcgct cactcgagtg gggagcagag 901 cggcagcgac acggacacag acagtggcta tggaggagaa tcggagaagg gcgacttgcg 961 cagtgagcag ccgtgcttca aaagtgacca cggacgcagg ttcacgatgg gagaaaggat 1021 cggcgcaatt aagcaagagt ccgaagaacc ccccacaaaa aagaaccgga tgcagctttc 1081 ggatgatgaa ggccatttca ctagcagtga cctgatcagc tccccgttcc tgggcccaca 1141 cccacaccag cctcctttct gcctgccctt ctacctgatc ccaccttcag cgactgccta 1201 cctgcccatg ctggagaagt gctggtatcc cacctcagtg ccagtgctat acccaggcct 1261 caacgcctct gccgcagccc tctctagctt catgaaccca gacaagatct cggctccctt 1321 gctcatgccc cagagactcc cttctccctt gccagctcat ccgtccgtcg actcttctgt 1381 cttgctccaa gctctgaagc caatcccccc tttaaactta gaaaccaaag actaaactct 1441 ctaggggatc ctgctgcttt gctttccttc ctcgctactt cctaaaaagc aacaaaaaag 1501 tttttgtgaa tgctgcaaga ttgttgcatt gtgtatactg agataatctg aggcatggag 1561 agcagattca gggtgtgtgt gtgtgtgtgt gtgtgtgtgt gtatgtgcgt gtgcgtgcac 1621 atgtgtgcct gcgtgttggt ataggacttt aaagctcctt ttggcatagg gaagtcacga 1681 aggattgctt gacatcagga gacttggggg ggattgtagc agacgtctgg gcttttcccc 1741 acccagagaa tagccccctt cgatacacat cagctggatt ttcaaaagct tcaaagtctt 1801 ggtctgtgag tcactcttca gtttgggagc tgggtctgtg gctttgatca gaaggtactt 1861 tcaaaagagg gctttccagg gctcagctcc caaccagctg ttaggacccc acccttttgc 1921 ctttattgtc gacgtgactc accagacgtc ggggagagag agcagtcaga ccgagctttc 1981 tgctaacatg gggaggtagc aggcactggc atagcacggt agtggtttgg ggaggtttcc 2041 gcaggtctgc tccccacccc tgcctcggaa gaataaagag aatgtagttc cctactcagg 2101 ctttcgtagt gattagctta ctaaggaact gaaaatgggc cccttgtaca agctgagctg 2161 ccccggaggg agggaggagt tccctgggct tctggcacct gtttctaggc ctaaccatta 2221 gtacttactg tgcagggaac caaaccaagg tctgagaaat gcggacaccc cgagcgagca 2281 ccccaaagtg cacaaagctg agtaaaaagc tgcccccttc aaacagaact agactcagtt 2341 ttcaattcca tcctaaaact ccttttaacc aagcttagct tctcaaaggc ctaaccaagc 2401 cttggcaccg ccagatcctt tctgtaggct aattcctctt gcccaacggc atatggagtg 2461 tccttattgc taaaaaggat tccgtctcct tcaaagaagt tttatttttg gtccagagta 2521 cttgttttcc cgatgtgtcc agccagctcc gcagcagctt ttcaagatgc actatgcctg 2581 attgctgatc gtgttttaac tttttctttt cctgttttta ttttggtatt aagtcgttgc 2641 ctttatttgt aaagctgtta taaatatata ttatataaat atattaaaaa ggaaaatgtt 2701 tcagatgttt atttgtataa ttacttgatt cacacagtga gaaaaaatga atgtattcct 2761 gtttttgaag agaagaataa tttttttttc tctagggaga ggtacagtgt ttatattttg 2821 gagccttcct gaaggtgtaa aattgtaaat atttttatct atgagtaaat gttaagtagt 2881 tgttttaaaa tacttaataa aataattctt ttcctgtgga ag // LOCUS AB004885 4299 bp mRNA PRI 10-JAN-1998 DEFINITION Homo sapiens mRNA for PKU-beta, complete cds. ACCESSION AB004885 NID g2217932 KEYWORDS PKU-beta. SOURCE Homo sapiens tissue_lib:testis placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamakawa,A., Kameoka,Y., Hashimoto,K., Yoshitake,Y., Nishikawa,K., Tanihara,K. and Date,T. TITLE cDNA cloning and chromosomal mapping of genes encoding novel protein kinases termed PKU-alpha and PKU-beta, which have nuclear localization signal JOURNAL Gene 202 (1-2), 193-201 (1997) MEDLINE 98087437 REFERENCE 2 (bases 1 to 4299) AUTHORS Date,T. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) to the DDBJ/EMBL/GenBank databases. Takayasu Date, Kanazawa Medical University, Biochemistry; Daigaku, Uchinada, Ishikawa 920-02, Japan (E-mail:date@kanazawa-med.ac.jp, Tel:0762-86-2211(ex.3701), Fax:0762-86-4693) FEATURES Location/Qualifiers source 1..4299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q12-22" /tissue_lib="testis placenta" mRNA 1..12 CDS 213..2576 /codon_start=1 /product="PKU-beta" /db_xref="PID:d1021398" /db_xref="PID:g2217933" /translation="MSVQSSSGSLEGPPSWSQLSTSPTPGSAAAARSLLNHTPPSGRP REGAMDELHSLDPRRQELLEARFTGVASGSTGSTGSCSVGAKASTNNESSNHSFGSLE SLSDKESETPEKKQSESSRGRKRKAENQNESSQGFPNLPVFQSLAYWEMGRTAGGKSI GGRGHKISDYFEYQGGNGSSPVRGIPPAIRSPQNSHSHSTPSSSVRPNSPSPTALAFG DHPIVQPKQLSFKIIQTDLTMLKLAALESNKILDLEKKEGRIDDLLRANCDLRRQIDE QQKLLDKYKERLNKCISMSKKLLIEKSTQEKLSSREKSMQDRLRLGHFTTVRHGASFT EQWTDGFAFQNLVKQQEWVNQQREDIERQRKLLAKRKPPTANNSQAPSTNSEPKQRKN KAVNGAENDPFVRPNLPQLLTLAEYHEQEEIFKLRLGHLKKEEAGIQAELERLERVRN LHIRELKRIHNEDNSQFKDHPTLNERYLLLHLLGRGGFSEVDKAFDLSEQRYAAVKIH QLNKSWRDEKKENYHKHACREYRIHKELDHPRIVKLYDYFSLDTDTFCTVLEYCEGND LDFYLKQHKLMSEKEARSIVMQIVNALRYLNEIKPPIIHYDLKPGNILLVDGTACGEI KITDFGLSKIMDDDSYGVDGMVLTSQGAGTYWYLPPECFVVGKEPPKISNKVDVWSVG VIFYQCLYGRKPFGHNQSQQDILQENTILKATEVQFPVKPVVSSEAKAFIRRCLAYRK EDRFDVHQLACDPYLLPHMRRSNSSGNLHMAGLTASPTPPSSSIITY" BASE COUNT 1340 a 824 c 895 g 1240 t ORIGIN 1 gcgggcctcc ctgtaccctc tccttccctc acctctttcc tccccacctc ccctcttctc 61 ggtttcctcc cccatcccct tgactctccc ctcccagccc tcgctctctc gctcgccctc 121 agcggggccc ccgccatgac ggaggcgggt gccggtgccg ttgccgccgc tgccgtcgca 181 gggggggagt cgggttccca gaaagtagct tgatgagtgt ccaaagtagc agtggaagtt 241 tggaggggcc gccatcttgg tcccagctct ccacgtctcc aaccccgggc tcggcggcgg 301 cggccaggtc cctgctgaat cacacgccgc catccgggag gcccagggaa ggtgcaatgg 361 atgagcttca tagtctggat ccaagaaggc aagagttatt ggaagctaga tttactggag 421 ttgcaagtgg gagcactgga agtacgggca gttgcagtgt tggagctaaa gcctcaacaa 481 ataacgaaag ctctaatcac agttttggaa gcttggaatc tttaagtgat aaagaatcag 541 agacaccgga gaagaaacaa tcggaatcat ccaggggaag aaagagaaaa gcagaaaacc 601 agaatgaaag tagtcagggg ttccccaacc tcccggtctt ccagtccttg gcctattggg 661 aaatgggtcg tacagcagga ggaaaaagta ttgggggacg tggccacaaa attagcgact 721 attttgaata ccagggtgga aatggctcaa gtccagtaag aggcatacct cctgcaatcc 781 gttctcctca aaattcacat tcacattcca ctccttcctc atctgttcga ccgaatagcc 841 cttctcctac tgcattagca tttggggacc accctattgt acaaccaaag caattatcct 901 ttaaaattat tcagactgat ctcacgatgc tgaaattagc agcattagaa agtaataaaa 961 tcctagacct ggaaaagaag gagggacgta tagatgattt gctcagggcc aactgtgatc 1021 tcagacggca aatagatgaa caacaaaaat tacttgacaa atacaaggaa cgattaaata 1081 agtgcatatc aatgagcaag aaacttctta tagaaaagag tacacaagaa aagctgtcaa 1141 gcagagagaa gagtatgcaa gatcgattac gcctcgggca ctttacaaca gttagacatg 1201 gcgcttcatt tactgaacaa tggacagatg gttttgcatt tcagaatctt gtgaagcaac 1261 aagaatgggt gaatcagcaa agggaagata ttgaaaggca aaggaaactt ctagccaaac 1321 gcaaacctcc cacagctaat aattctcagg caccctctac caattctgag ccaaaacaaa 1381 ggaaaaacaa agcagtcaat ggagctgaga atgatccctt tgttagacca aatttgccac 1441 aactgttgac gttggcagaa tatcatgaac aggaagaaat tttcaaactt agactaggac 1501 atctcaaaaa ggaagaggca ggaatccagg cagaacttga acgtttggaa agagtcagga 1561 atcttcacat acgtgagctg aaaagaatac acaatgaaga taattcacag ttcaaagatc 1621 atccaacatt aaatgaaaga tatttattac ttcatctgct tggtagaggt ggctttagtg 1681 aagtggataa ggcttttgac ctttctgaac aaagatatgc tgctgtgaag atacatcagc 1741 ttaataaaag ctggagagat gagaagaaag aaaactacca caaacatgcc tgcagagagt 1801 atagaataca caaagaactg gatcacccca gaatagttaa actctatgat tatttctcct 1861 tggatacaga tacgttttgt acagtgttag aatactgtga aggcaatgac ttggatttct 1921 atctgaagca acacaagtta atgtcagaga aagaagctcg gtctattgta atgcagattg 1981 taaatgcatt aagatatctc aatgagatca aaccccctat tatacattat gatcttaagc 2041 caggaaacat cctactggta gatggaacag catgtggtga aatcaaaatc actgattttg 2101 gtctgtccaa gattatggat gatgatagct atggtgtaga tggaatggtt ctaacttccc 2161 agggggcagg cacttactgg tatttacctc ctgagtgttt tgtggttgga aaagagccac 2221 caaagatttc caacaaggtt gatgtatggt cggttggagt catcttctat cagtgtcttt 2281 atggtagaaa gccatttggt cacaatcaat ctcaacaaga cattcttcaa gagaatacaa 2341 tattaaaagc cacagaagtc cagttccctg taaaaccggt tgtaagcagt gaagccaagg 2401 catttattag acgctgtttg gcatatcgaa aagaagatcg atttgatgtg caccagctgg 2461 catgtgaccc ataccttctc ccacacatgc gaagatcaaa ttcttcagga aacctacaca 2521 tggctgggct gacagcatcc cctacacccc cttcttcaag cataattact tactgacttt 2581 cctccaagat tggcatgata tctttgaatt tgcttccaga tgcacactta agtttgagag 2641 catttgagtg tttgttttct ttttcttttt tttttttttt ttttttttta cacaagacgt 2701 ggttaagaac tgtttgtgaa ctgaagttcc tcatagtgtc atttgtatga gagaggatca 2761 tggacaatga ataaatgcac acttctgact ataattttga gcagtgaagg aaggatgact 2821 gcctgttaca caggaacagg aataccatgt taagatagag gaaaaaaatt ttgtttattt 2881 gggggaacac tgtaaataac gtttataata cttaaataca tttggttgtt actagagagt 2941 tctaatatga agggtagttt ttcattattt aaaaacacat ggataaatat gagacatatt 3001 tctaacatac gaactatagt gcctggatac attcttcagc atttggcagt taatctgctg 3061 aaaccaaagt aagaactgaa tgacagtgac agttgtgttt tatagtaata gaatgtgaca 3121 gttagaatct ttggacaaag ttaggctttg tcatttgcct attcagactt ttaccaagtt 3181 ggtcctcgag atcacatgtc cgttttcttt cactggttat attaaaggga gagctattat 3241 ttctaaatac tttacacctt tgacaagaat gagctgcttg tgttgaaatc agttgaattc 3301 actaaattat aatattgcca gtgttttcaa cacgttagaa ttgataacag gtatttttct 3361 tttgttacca ggccttgttc atcaaaacag gtgacagtga gtaaccaaat gcagaccagt 3421 tctatgcagg ataatgataa attcccttct agctctattg taaattgttg tacagatgtc 3481 atatttcaat tactaagttt cagcagctta ttcttgtaca aatgtttaaa atatggcttt 3541 tctaattgga tattgtattt tttttaagtc ttcttgaagg cactctaatt gctgctaaat 3601 gatgttgctt ctttgctaaa aaattaaatg acctagaagg tgcaagttga ctgtacatca 3661 tagatattaa aaagcagaca gtcattaaca atcaagatgt aaaagtgatc catgttggac 3721 ataattgagt ttttaaatca gtttattggg ttctagggct gtagaacaca tagatactag 3781 atttttaatt tgttcatggt tattctacat ttctagaaag ttcttatcag caagatggtc 3841 ccataaggaa atttcttgtg gtttgtccag atttgttaaa atgtgaacgt tttctaactg 3901 cctcgtaggg tagaaaccaa tatttttcag gatgctgtat ttcactcctg taaggacttt 3961 ttctttattg tcacattcat aatgctgata cttacatgta agttgttcat gttgcagaaa 4021 aaggttatac gtaccacgga ttctctttaa tattgtacag ttaaactgta tttctgatgt 4081 tgcaagccat tttgtttttt cattggacat agacacattg tctctttaaa atgctgctga 4141 aattgtagag agtatagcca aaacagtaat aaacaatgac aggtaaaatg taaagactat 4201 gaaaattaca ttggaaggga gctttcaaga tggtaggata ttgactaact gagctccttc 4261 acttaaaaat cagctggata aattccttaa gtatacttc // LOCUS AB004903 704 bp mRNA PRI 22-SEP-1997 DEFINITION Homo sapiens mRNA for STAT induced STAT inhibitor-2, complete cds. ACCESSION AB004903 NID g2443360 KEYWORDS STAT induced STAT inhibitor-2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Minamoto,S., Ikegame,K., Ueno,K., Narazaki,M., Naka,T., Yamamoto,H., Matsumoto,T., Saito,H., Hosoe,S. and Kishimoto,T. TITLE Cloning and functional analysis of new members of STAT induced STAT inhibitor (SSI) family : SSI-2 and SSI-3 JOURNAL Biochem. Biophys. Res. Comm. 237, 79-83 (1997) REFERENCE 2 (bases 1 to 704) AUTHORS Minamoto,S. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) to the DDBJ/EMBL/GenBank databases. Seijiro Minamoto, Osaka University, Internal Medicine III; Yamadaoka 2-2, Suita, Osaka 565, Japan (E-mail:minamoto@imed3.med.osaka-u.ac.jp, Tel:06-879-3833, Fax:06-879-3839) FEATURES Location/Qualifiers source 1..704 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 39..635 /function="inhibition of cytokine signal" /note="SSI-2" /codon_start=1 /evidence=experimental /product="STAT induced STAT inhibitor-2" /db_xref="PID:d1023294" /db_xref="PID:g2443361" /translation="MTLRCLEPSGNGGEGTRSQWGTAGSAEEPSPQAARLAKALRELG QTGWYWGSMTVNEAKEKLKEAPEGTFLIRDSSHSDYLLTISVKTSAGPTNLRIEYQDG KFRLDSIICVKSKLKQFDSVVHLIDYYVQMCKDKRTGPEAPRNGTVHLYLTKPLYTSA PSLQHLCRLTINKCTGAIWGLPLPTRLKDYLEEYKFQV" BASE COUNT 198 a 172 c 174 g 160 t ORIGIN 1 gggcggccac ctgtctttgc cgcggtgacc cttctctcat gaccctgcgg tgccttgagc 61 cctccgggaa tggcggggaa gggacgcgga gccagtgggg gaccgcgggg tcggcggagg 121 agccatcccc gcaggcggcg cgtctggcga aggccctgcg ggagctcggt cagacaggat 181 ggtactgggg aagtatgact gttaatgaag ccaaagagaa attaaaagag gcaccagaag 241 gaactttctt gattagagat agctcgcatt cagactacct actaacaata tctgttaaaa 301 catcagctgg accaactaat cttcgaatcg aataccaaga cggaaaattc agattggact 361 ctatcatatg tgtcaaatcc aagcttaaac aatttgacag tgtggttcat ctgatcgact 421 actatgttca gatgtgcaag gataagcgga caggtccaga agccccccgg aacggcactg 481 ttcaccttta tctgaccaaa ccgctctaca cgtcagcacc atctctgcag catctctgta 541 ggctcaccat taacaaatgt accggtgcca tctggggact gcctttacca acaagactaa 601 aagattactt ggaagaatat aaattccagg tataaatgtt tctctttttt taaacatgtc 661 tcacatagag tatctccgaa tgcagctatg taaaagagaa ccaa // LOCUS AB004904 850 bp mRNA PRI 22-SEP-1997 DEFINITION Homo sapiens mRNA for STAT induced STAT inhibitor-3, complete cds. ACCESSION AB004904 NID g2443362 KEYWORDS STAT induced STAT inhibitor-3. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Minamoto,S., Ikegame,K., Ueno,K., Narazaki,M., Naka,T., Yamamoto,H., Matsumoto,T., Saito,H., Hosoe,S. and Kishimoto,T. TITLE Cloning and functional analysis of new members of STAT induced STAT inhibitor (SSI) family : SSI-2 and SSI-3 JOURNAL Biochem. Biophys. Res. Comm. 237, 79-83 (1997) REFERENCE 2 (bases 1 to 850) AUTHORS Minamoto,S. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) to the DDBJ/EMBL/GenBank databases. Seijiro Minamoto, Osaka University, Internal Medicine III; Yamadaoka 2-2, Suita, Osaka 565, Japan (E-mail:minamoto@imed3.med.osaka-u.ac.jp, Tel:06-879-3833, Fax:06-879-3839) FEATURES Location/Qualifiers source 1..850 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 107..784 /function="inhibition of cytokine signal" /note="SSI-3" /codon_start=1 /evidence=experimental /product="STAT induced STAT inhibitor-3" /db_xref="PID:d1023295" /db_xref="PID:g2443363" /translation="MVTHSKFPAAGMSRPLDTSLRLKTFSSKSEYQLVVNAVRKLQES GFYWSAVTGGEANLLLSAEPAGTFLIRDSSDQRHFFALSVKTQSGTKNLRIQCEGGSF SLQSDPRSTQPVPRFDCVLKLVYHYMPPPGAPSFPSPPTEPSSEVPEQPSAQPLPGSP PRRAYYIYSGGEKIPLVLSRPLSSNVATLQHLCRKTVNGHLDSYEKVTQLPGPIREFL DQYDAPL" BASE COUNT 147 a 316 c 250 g 137 t ORIGIN 1 gcgccttcct ctccgcagcc ccccgggatg cggtagcggc cgctgtgcgg aggccgcgaa 61 gcagctgcag ccgccgccgc gcagatccac gctggctccg tgcgccatgg tcacccacag 121 caagtttccc gccgccggga tgagccgccc cctggacacc agcctgcgcc tcaagacctt 181 cagctccaag agcgagtacc agctggtggt gaacgcagtg cgcaagctgc aggagagcgg 241 cttctactgg agcgcagtga ccggcggcga ggcgaacctg ctgctcagtg ccgagcccgc 301 cggcaccttt ctgatccgcg acagctcgga ccagcgccac ttcttcgcgc tcagcgtcaa 361 gacccagtct gggaccaaga acctgcgcat ccagtgtgag gggggcagct tctctctgca 421 gagcgatccc cggagcacgc agcccgtgcc ccgcttcgac tgcgtgctca agctggtgta 481 ccactacatg ccgccccctg gagccccctc cttcccctcg ccacctactg aaccctcctc 541 cgaggtgccc gagcagccgt ctgcccagcc actccctggg agtcccccca gaagagccta 601 ttacatctac tccgggggcg agaagatccc cctggtgttg agccggcccc tctcctccaa 661 cgtggccact cttcagcatc tctgtcggaa gaccgtcaac ggccacctgg actcctatga 721 gaaagtcacc cagctgccgg ggcccattcg ggagttcctg gaccagtacg atgccccgct 781 ttaaggggta aagggcgcaa agggcatggg tcgggagagg ggacgcaggc ccctctcctc 841 cgtggcacat // LOCUS AB005297 5535 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens BAI 1 mRNA, complete cds. ACCESSION AB005297 NID g2653431 KEYWORDS BAI 1. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nishimori,H., Shiratsuchi,T., Urano,T., Kimura,Y., Kiyono,K., Tatsumi,K., Yoshida,S., Ono,M., Kuwano,M. and Nakamura,Y. TITLE A novel brain-specific p53-target gene, BAI1, containing thrombospondin type 1 repeats inhibits experimental angiogenesis JOURNAL Oncogene (1997) In press REFERENCE 2 (bases 1 to 5535) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (28-JUN-1997) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, The Inst. of Medical Science, The University of Tokyo, Lab. of Molecular Medicine, Human Genome Center; 4-6-1, Shirokanedai Minato-ku, Tokyo, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) FEATURES Location/Qualifiers source 1..5535 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q24" /tissue_type="brain" gene 184..4938 /gene="BAI 1" CDS 184..4938 /gene="BAI 1" /codon_start=1 /db_xref="PID:d1024528" /db_xref="PID:g2653432" /translation="MRGQAAAPGPVWILAPLLLLLLLLGRRARAAAGADAGPGPEPCA TLVQGKFFGYFSAAAVFPANASRCSWTLRNPDPRRYTLYMKVAKAPVPCSGPGRVRTY QFDSFLESTRTYLGVESFDEVLRLCDPSAPLAFLQASKQFLQMRRQQPPQHDGLRPRA GPPGPTDDFSVEYLVVGNRNPSRAACQMLCRWLDACLAGSRSSHPCGIMQTPCACLGG EAGGPAAGPLAPRGDVCLRDAVAGGPENCLTSLTQDRGGHGATGGWKLWSLWGECTRD CGGGLQTRTRTCLPAPGVEGGGCEGVLEEGRQCNREACGPAGRTSSRSQSLRSTDARR REELGDELQQFGFPAPQTGDPAAEEWSPWSVCSSTCGEGWQTRTRFCVSSSYSTQCSG PLREQRLCNNSAVCPVHGAWDEWSPWSLCSSTCGRGFRDRTRTCRPPQFGGNPCEGPE KQTKFCNIALCPGRAVDGNWNEWSSWSACSASCSQGRQQRTRECNGPSYGGAECQGHW VETRDCFLQQCPVDGKWQAWASWGSCSVTCGAGSQRRERVCSGPFFGGAACQGPQDEY RQCGTQRCPEPHEICDEDNFGAVIWKETPAGEVAAVRCPRNATGLILRRCELDEEGIA YWEPPTYIRCVSIDYRNIQMMTREHLAKAQRGLPGEGVSEVIQTLVEISQDGTSYSGD LLSTIDVLRNMTEIFRRAYYSPTPGDVQNFVQILSNLLAEENRDKWEEAQLAGPNAKE LFRLVEDFVDVIGFRMKDLRDAYQVTDNLVLSIHKLPASGATDISFPMKGWRATGDWA KVPEDRVTVSKSVFSTGLTEADEASVFVVGTVLYRNLGSFLALQRNTTVLNSKVISVT VKPPPRSLRTPLEIEFAHMYNGTTNQTCILWDETDVPSSSAPPQLGPWSWRGCRTVPL DALRTRCLCDRLSTFAILAQLSADANMEKATLPSVTLIVGCGVSSLTLLMLVIIYVSV WRYIRSERSVILINFCLSIISSNALILIGQTQTRNKVMCTLVAAFLHFFFLSSFCWVL TEAWQSYMAVTGHLRNRLIRKRFLCLGWGLPALVVAISVGFTKAKGYSTMNYCWLSLE GGLLYAFVGPAAAVVLVNMVIGILVFNKLVSKDGITDKKLKERAGASLWSSCVVLPLL ALTWMSAVLAVTDRRSALFQILFAVFDSLEGFVIVMVHCILRREVQDAVKCRVVDRQE EGNGDSGGSFQNGHAQLMTDFEKDVDLACRSVLNKDIAACRTATITGTLKRPSLPEEE KLKLAHAKGPPTNFNSLPANVSKLHLHGSPRYPGGPLPDFPNHSLTLKRDKAPKSSFV GDGDIFKKLDSELSRAQEKALDTSYVILPTATATLRPKPKEEPKYSIHIDQMPQTRLI HLSTAPEASLPARSPPSRQPPSGGPPEAPPAQPPPPPPPPPPPPQQPLPPPPNLEPAP PSLGDPGEPAAHPGPSTGPSTKNENVATLSVSSLERRKSRYAELDFEKIMHTRKRHQD MFQDLNRKLQHAAEKDKEVLGPDSKPEKQQTPNKRPWESLRKAHGTPTWVKKELEPLQ PSPLELRSVEWERSGATIPLVGQDIIDLQTEV" BASE COUNT 944 a 1937 c 1780 g 874 t ORIGIN 1 ggactttaga agccgttgct gccctctctg tcacctgaag cggggccctc tcccatccca 61 cccttgcccc gcctccctgc ccccaccggg ccggccctgc ccgccgccgg accctggcat 121 gtcaagacct ggtccgcgcc tgcctgccca gcccgcggaa ccccggcggc cccgcgagct 181 aggatgaggg gccaggccgc cgccccgggc cccgtctgga tcctcgcccc gctgctactg 241 ctgctgctgc tgctgggacg ccgcgcgcgg gcggccgccg gagcagacgc ggggcccggg 301 cccgagccgt gcgccacgct ggtgcaggga aagttcttcg gctacttctc cgcggccgcc 361 gtgttcccgg ccaacgcctc gcgctgctcc tggacgctac gcaacccgga cccgcggcgc 421 tacactctct acatgaaggt ggccaaggcg cccgtgccct gcagcggccc cggccgcgtg 481 cgcacctacc agttcgactc cttcctcgag tccacgcgca cctacctggg cgtggagagc 541 ttcgacgagg tgctgcggct ctgcgacccc tccgcacccc tggccttcct gcaggccagc 601 aagcagttcc tgcagatgcg gcgccagcag ccgccccagc acgacgggct ccggccccgg 661 gccgggccgc cgggccccac cgacgacttc tccgtggagt acctggtggt ggggaaccgc 721 aaccccagcc gtgccgcctg ccagatgctg tgccgctggc tggacgcgtg tctggccggt 781 agtcgcagct cgcacccctg cgggatcatg cagaccccct gcgcctgcct gggcggcgag 841 gcgggcggcc ctgccgcggg acccctggcc ccccgcgggg atgtctgctt gagagatgcg 901 gtggctggtg gccctgaaaa ctgcctcacc agcctgaccc aggaccgggg cgggcacggc 961 gccacaggcg gctggaagct gtggtccctg tggggcgaat gcacgcggga ctgcggggga 1021 ggcctccaga cgcggacgcg cacctgcctg cccgcgccgg gcgtggaggg cggcggctgc 1081 gagggggtgc tggaggaggg tcgccagtgc aaccgcgagg cctgcggccc cgctgggcgc 1141 accagctccc ggagccagtc cctgcggtcc acagatgccc ggcggcgcga ggagctgggg 1201 gacgagctgc agcagtttgg gttcccagcc ccccagaccg gtgacccagc agccgaggag 1261 tggtccccgt ggagcgtgtg ctccagcacc tgcggcgagg gctggcagac ccgcacgcgc 1321 ttctgcgtgt cctcctccta cagcacgcag tgcagcggac ccctgcgcga gcagcggctg 1381 tgcaacaact ctgccgtgtg cccagtgcat ggtgcctggg atgagtggtc gccctggagc 1441 ctctgctcca gcacctgtgg ccgtggcttt cgggatcgca cgcgcacctg caggcccccc 1501 cagtttgggg gcaacccctg tgagggccct gagaagcaaa ccaagttctg caacattgcc 1561 ctgtgccctg gccgggcagt ggatggaaac tggaatgagt ggtcgagctg gagcgcctgc 1621 tccgccagct gctcccaggg ccgacagcag cgcacgcgtg aatgcaacgg gccttcctac 1681 gggggtgcgg agtgccaggg ccactgggtg gagacccgag actgcttcct gcagcagtgc 1741 ccagtggatg gcaagtggca ggcctgggcg tcatggggca gttgcagcgt cacgtgtggg 1801 gctggcagcc agcgacggga gcgtgtctgc tctgggccct tcttcggggg agcagcctgc 1861 cagggccccc aggatgagta ccggcagtgc ggcacccagc ggtgtcccga gccccatgag 1921 atctgtgatg aggacaactt tggtgctgtg atctggaagg agaccccagc gggagaggtg 1981 gctgctgtcc ggtgtccccg caacgccaca ggactcatcc tgcgacggtg tgagctggac 2041 gaggaaggca tcgcctactg ggagcccccc acctacatcc gctgtgtttc cattgactac 2101 agaaacatcc agatgatgac ccgggagcac ctggccaagg ctcagcgagg gctgcctggg 2161 gagggggtct cggaggtcat ccagacactg gtggagatct ctcaggacgg gaccagctac 2221 agtggggacc tgctgtccac catcgatgtc ctgaggaaca tgacagagat tttccggaga 2281 gcgtactaca gccccacccc tggggacgta cagaactttg tccagatcct tagcaacctg 2341 ttggcagagg agaatcggga caagtgggag gaggcccagc tggcgggccc caacgccaag 2401 gagctgttcc ggctggtgga ggactttgtg gacgtcatcg gcttccgcat gaaggacctg 2461 agggatgcat accaggtgac agacaacctg gttctcagca tccataagct cccagccagc 2521 ggagccactg acatcagctt ccccatgaag ggctggcggg ccacgggtga ctgggccaag 2581 gtgccagagg acagggtcac tgtgtccaag agtgtcttct ccacggggct gacagaggcc 2641 gatgaagcat ccgtgtttgt ggtgggcacc gtgctctaca ggaacctggg cagcttcctg 2701 gccctgcaga ggaacacgac cgtcctgaat tctaaggtga tctccgtgac tgtgaaaccc 2761 ccgcctcgct ccctgcgcac acccttggag atcgagtttg cccacatgta taatggcacc 2821 accaaccaga cctgtatcct gtgggatgag acggatgtac cctcctcctc cgcccccccg 2881 cagctcgggc cctggtcgtg gcgcggctgc cgcacggtgc ccctcgacgc cctccggacg 2941 cgctgcctct gtgaccggct ctccaccttc gccatcttag cccagctcag cgccgacgcg 3001 aacatggaga aggcgactct gccgtcggtg acgctcatcg tgggctgtgg cgtgtcctct 3061 ctcaccctgc tcatgctggt catcatctac gtgtccgtgt ggaggtacat tcgctcagag 3121 cgttctgtca tcctcatcaa cttctgcctg tccatcatct cctccaatgc cctcatcctc 3181 atcgggcaga cccagacccg caacaaggtg atgtgcacgc tggtggccgc cttcctgcac 3241 ttcttcttcc tgtcctcctt ctgctgggtg ctcaccgagg cctggcagtc ctacatggcc 3301 gtgacgggcc acctccggaa ccgcctcatc cgcaagcgct tcctctgcct gggctggggg 3361 ctccctgcac tggttgtggc catttctgtg ggattcacca aggccaaagg gtacagcacc 3421 atgaactact gctggctctc cctggagggg ggactgctct atgccttcgt gggacctgcc 3481 gctgccgttg tgctggtgaa catggtcatt gggatcctgg tgttcaacaa gctcgtgtcc 3541 aaagacggca tcacggacaa gaagctgaag gagcgggcag gggcctccct gtggagctcc 3601 tgcgtggtgc tgccgctgct ggcgctgacc tggatgtcgg ctgtgctcgc cgtcaccgac 3661 cgccgctccg ccctcttcca gatcctcttc gctgtcttcg actcgctgga gggcttcgtc 3721 atcgtcatgg tgcactgtat cctccgtaga gaggtccagg acgctgtgaa atgccgtgtg 3781 gttgaccggc aggaggaggg caacggggac tcagggggct ccttccagaa cggccacgcc 3841 cagctcatga ccgacttcga gaaggacgtg gatctggcct gtagatcagt gctgaacaag 3901 gacatcgcgg cctgccgcac tgccaccatc acgggcacac tgaagcggcc gtctctgccc 3961 gaggaggaga agctgaagct ggcccatgcc aaggggccgc ccaccaattt caacagcctg 4021 ccggccaacg tgtccaagct gcacctgcac ggctcacccc gctatcccgg cgggcccctg 4081 cccgacttcc ccaaccactc actgaccctc aagagggaca aggcgcccaa gtcctccttc 4141 gtcggtgacg gggacatctt caagaagctg gactcggagc tgagccgggc ccaggagaag 4201 gctctggaca cgagctacgt gatcctgccc acggccacgg ccacgctgcg gcccaagccc 4261 aaggaggagc ccaagtacag catccacatt gaccagatgc cgcagacccg cctcatccac 4321 ctcagcacgg cccccgaggc cagcctcccc gcccgcagcc cgccctcccg ccagcccccc 4381 agcggcgggc cccccgaggc accccctgcc cagcccccac cgcctccgcc cccaccgcca 4441 ccacctcccc agcagcccct gcccccaccg cccaatctgg agccggcacc ccccagcctg 4501 ggggatcccg gggagcctgc cgcccatccg ggacccagca cggggcccag caccaagaac 4561 gagaatgtcg ccaccttgtc tgtgagctcc ctggagcggc ggaagtcgcg gtatgcagaa 4621 ctggactttg agaagatcat gcacacccgg aagcggcacc aagacatgtt ccaggacctg 4681 aaccggaagc tgcagcacgc agcggagaag gacaaggagg tgctggggcc ggacagcaag 4741 ccggaaaagc agcagacgcc caacaagagg ccctgggaga gcctccggaa agcccacggg 4801 acgcccacgt gggtgaagaa ggagctggag ccgctgcagc cgtcgccgct ggagcttcgc 4861 agcgtggagt gggagaggtc gggcgccacg atcccgctgg tgggccagga catcatcgac 4921 ctccagaccg aggtctgagc gggtgggcgg cggccacgca ctgggccacg gaggagggat 4981 gctgctccgc ccgctcctgc cgcagacggg cacagacacg ctcgcgggca gcgggccagg 5041 cccgcacccc ggcctcaggg cgctcagacg gcggccaggc acagggcccg cagtgctggg 5101 accagagcca gatgcaggac aggaggcggc ccggccagcg ggcacagggc accagaggcc 5161 gaaggtgcct cagactccgc cctcctcggg ccgaggccca gcgggcagat gggcggacgg 5221 ctgtggaccg tggacaggcc cagcgcggcc agcgtcccag ggtacccgcc tgagctcctg 5281 ctgcggagga gctgcctgct tggcccggcc ggcctggcac cgttttttaa acacccccat 5341 ccctcgggaa gcagccagct ccccacacct tccagggccc taggcccctc ctagacccag 5401 gtggagggca cagccctccg accctcatgg cccccagggg caggactgag tcccctccag 5461 gaagaagcag gggggaatct attttttctc tccttttctt ttcttcaata aaaagaatta 5521 aaaacccaaa aaaaa // LOCUS AB005659 4939 bp mRNA PRI 20-OCT-1997 DEFINITION Homo sapiens SMRP mRNA, complete cds. ACCESSION AB005659 NID g2554609 KEYWORDS SMRP. SOURCE Homo sapiens bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Suzuki,T., Nishio,K., Sasaki,H., Kurokawa,H., Saito-Ohara,F., Ikeuchi,T., Tanabe,S., Terada,M. and Saijo,N. TITLE cDNA cloning of a short type of multidrug resistance protein homologue, SMRP, from a human lung cancer cell line JOURNAL Biochem. Biophys. Res. Commun. 238 (3), 790-794 (1997) MEDLINE 97472289 REFERENCE 2 (bases 1 to 4939) AUTHORS Suzuki,T. TITLE Direct Submission JOURNAL Submitted (10-JUL-1997) to the DDBJ/EMBL/GenBank databases. Toshihiro Suzuki, National Cancer Center Research Institute, Pharmacology Division; Tsukiji 5-1-1, Chuo-ku, Tokyo 104, Japan (E-mail:tssuzuki@gan2.res.ncc.go.jp, Tel:03-3542-2511, Fax:03-3542-1886) FEATURES Location/Qualifiers source 1..4939 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q27" /tissue_type="bone marrow" gene 737..3577 /gene="SMRP" CDS 737..3577 /gene="SMRP" /note="a short type of multidrug resistance protein homologue" /codon_start=1 /db_xref="PID:d1023757" /db_xref="PID:g2554610" /translation="MKNATLAWDSSHSSIQNSPKLTPKMKKDKRASRGKKEKVRQLQR TEHQAVLAEQKGHLLLDSDERPSPEEEEGKHIHLGHLRLQRTLHSIDLEIQEGKLVGI CGSVGSGKTSLISAILGQMTLLEGSIAISGTFAYVAQQAWILNATLRDNILFGKEYDE ERYNSVLNSCCLRPDLAILPSSDLTEIGERGANLSGGQRQRISLARALYSDRSIYILD DPLSALDAHVGNHIFNSAIRKHLKSKTVLFVTHQLQYLVDCDEVIFMKEGCITERGTH EELMNLNGDYATIFNNLLLGETPPVEINSKKETSGSQKKSQDKGPKTGSVKKEKAVKP EEGQLVQLEEKGQGSVPWSVYGVYIQAAGGPLAFLVIMALFMLNVGSTAFSTWWLSYW IKQGSGNTTVTRGNETSVSDSMKDNPHMQYYASIYALSMAVMLILKAIRGVVFVKGTL RASSRLHDELFRRILRSPMKFFDTTPTGRILNRFSKDMDEVDVRLPFQAEMFIQNVIL VFFCVGMIAGVFPWFLVAVGPLVILFSVLHIVSRVLIRELKRLDNITQSPFLSHITSS IQGLATIHAYNKGQEFLHRYQELLDDNQAPFFLFTCAMRWLAVRLDLISIALITTTGL MIVLMHGQIPPAYAGLAISYAVQLTGLFQFTVRLASETEARFTSVERINHYIKTLSLE APARIKNKAPSPDWPQEGEVTFENAEMRYRENLPLVLKKVSFTIKPKEKIGIVGRTGS GKSSLGMALFRLVELSGGCIKIDGVRISDIGLADLRSKLSIIPQEPVLFSGTVRSNLD PFNQYTEDQIWDALERTHMKECIAQLPLKLESEVMENGDNFSVGERQLLCIARALLRH CKILILDEATAAMDTETDLLIQETIREAFADCTMLNIAHRLHTVLGSDRIMVLAQGQV VEFDTPSVLLSNDSSRFYAMFAAAENKVAVKG" polyA_site 4939 /note="15 A nucleotides" BASE COUNT 1174 a 1224 c 1249 g 1292 t ORIGIN 1 cagagaatgt ttgaggcagc agccgttggc agcctgctgg ctggaggacc cgttgttgcc 61 atcttaggca tgatttataa tgtaattatt ctgggaccaa caggcttcct gggatcagct 121 gtttttatcc tcttttaccc agcaatgatg tttgcatcac ggctcacagc atatttcagg 181 agaaaatgcg tggccgccac ggatgaacgt gtccagaaga tgaatgaagt tcttacttac 241 attaaattta tcaaaatgta tgcctgggtc aaagcatttt ctcagagtgt tcaaaaaatc 301 cgcgaggagg agcgtcggat attggaaaaa gctgggtact tccagagcat cactgtgggt 361 gtggctccca ttgtggtggt gattgccagc gtggtgacct tctctgttca tatgaccctg 421 ggcttcgatc tgacagcagc acaggctttc acagtggtga cagtcttcaa ttccatgact 481 tttgctttga aagtaacacc gttttcagta aagtccctct cagaagcctc agtggctgtt 541 gacagattta agcttccttc cactgtatag cctgatgtgt tttgctacgt gagtgtacgc 601 cctaggcttg tctccctggt cttgatccag tgtctcatct ttgcacctct ctcacaactc 661 ctatcagagt ttgtttctaa tggaagaggt tcacatgata aagaacaaac cagccagtcc 721 tcacatcaag atagagatga aaaatgccac cttggcatgg gactcctccc actccagtat 781 ccagaactcg cccaagctga cccccaaaat gaaaaaagac aagagggctt ccaggggcaa 841 gaaagagaag gtgaggcagc tgcagcgcac tgagcatcag gcggtgctgg cagagcagaa 901 aggccacctc ctcctggaca gtgacgagcg gcccagtccc gaagaggaag aaggcaagca 961 catccacctg ggccacctgc gcttacagag gacactgcac agcatcgatc tggagatcca 1021 agagggtaaa ctggttggaa tctgcggcag tgtgggaagt ggaaaaacct ctctcatttc 1081 agccatttta ggccagatga cgcttctaga gggcagcatt gcaatcagtg gaaccttcgc 1141 ttatgtggcc cagcaggcct ggatcctcaa tgctactctg agagacaaca tcctgtttgg 1201 gaaggaatat gatgaagaaa gatacaactc tgtgctgaac agctgctgcc tgaggcctga 1261 cctggccatt cttcccagca gcgacctgac ggagattgga gagcgaggag ccaacctgag 1321 cggtgggcag cgccagagga tcagccttgc ccgggccttg tatagtgaca ggagcatcta 1381 catcctggac gaccccctca gtgccttaga tgcccatgtg ggcaaccaca tcttcaatag 1441 tgctatccgg aaacatctca agtccaagac agttctgttt gttacccacc agttacagta 1501 cctggttgac tgtgatgaag tgatcttcat gaaagagggc tgtattacgg aaagaggcac 1561 ccatgaggaa ctgatgaatt taaatggtga ctatgctacc atttttaata acctgttgct 1621 gggagagaca ccgccagttg agatcaattc aaaaaaggaa accagtggtt cacagaagaa 1681 gtcacaagac aagggtccta aaacaggatc agtaaagaag gaaaaagcag taaagccaga 1741 ggaagggcag cttgtgcagc tggaagagaa agggcagggt tcagtgccct ggtcagtata 1801 tggtgtctac atccaggctg ctgggggccc cttggcattc ctggttatta tggccctttt 1861 catgctgaat gtaggcagca ccgccttcag cacctggtgg ttgagttact ggatcaagca 1921 aggaagcggg aacaccactg tgactcgagg gaacgagacc tcggtgagtg acagcatgaa 1981 ggacaatcct catatgcagt actatgccag catctacgcc ctctccatgg cagtcatgct 2041 gatcctgaaa gccattcgag gagttgtctt tgtcaagggc acgctgcgag cttcctcccg 2101 gctgcatgac gagcttttcc gaaggatcct tcgaagccct atgaagtttt ttgacacgac 2161 ccccacaggg aggattctca acaggttttc caaagacatg gatgaagttg acgtgcggct 2221 gccgttccag gccgagatgt tcatccagaa cgttatcctg gtgttcttct gtgtgggaat 2281 gatcgcagga gtcttcccgt ggttccttgt ggcagtgggg ccccttgtca tcctcttttc 2341 agtcctgcac attgtctcca gggtcctgat tcgggagctg aagcgtctgg acaatatcac 2401 gcagtcacct ttcctctccc acatcacgtc cagcatacag ggccttgcca ccatccacgc 2461 ctacaataaa gggcaggagt ttctgcacag ataccaggag ctgctggatg acaaccaagc 2521 tccttttttt ttgtttacgt gtgcgatgcg gtggctggct gtgcggctgg acctcatcag 2581 catcgccctc atcaccacca cggggctgat gatcgttctt atgcacgggc agattccccc 2641 agcctatgcg ggtctcgcca tctcttatgc tgtccagtta acggggctgt tccagtttac 2701 ggtcagactg gcatctgaga cagaagctcg attcacctcg gtggagagga tcaatcacta 2761 cattaagact ctgtccttgg aagcacctgc cagaattaag aacaaggctc cctcccctga 2821 ctggccccag gagggagagg tgacctttga gaacgcagag atgaggtacc gagaaaacct 2881 ccctcttgtc ctaaagaaag tatccttcac gatcaaacct aaagagaaga ttggcattgt 2941 ggggcggaca ggatcaggga agtcctcgct ggggatggcc ctcttccgtc tggtggagtt 3001 atctggaggc tgcatcaaga ttgatggagt gagaatcagt gatattggcc ttgccgacct 3061 ccgaagcaaa ctctctatca ttcctcaaga gccggtgctg ttcagtggca ctgtcagatc 3121 aaatttggac cccttcaacc agtacactga agaccagatt tgggatgccc tggagaggac 3181 acacatgaaa gaatgtattg ctcagctacc tctgaaactt gaatctgaag tgatggagaa 3241 tggggataac ttctcagtgg gggaacggca gctcttgtgc atagctagag ccctgctccg 3301 ccactgtaag attctgattt tagatgaagc cacagctgcc atggacacag agacagactt 3361 attgattcaa gagaccatcc gagaagcatt tgcagactgt accatgctga acattgccca 3421 tcgcctgcac acggttctag gctccgatag gattatggtg ctggcccagg gacaggtggt 3481 ggagtttgac accccatcgg tccttctgtc caacgacagt tcccgattct atgccatgtt 3541 tgctgctgca gagaacaagg tcgctgtcaa gggctgactc ctccctgttg acgaagtctc 3601 ttttctttag agcattgcca ttccctgcct ggggcgggcc cctcatcgcg tcctcctacc 3661 gaaaccttgc ctttctcgat tttatctttc gcacagcagt tccggattgg cttgtgtgtt 3721 tcacttttag ggagagtcat attttgatta ttgtatttat tccatattca tgtaaacaaa 3781 atttagtttt tgttcttaat tgcactctaa aaggttcagg gaaccgttat tataattgta 3841 tcagaggcct ataatgaagc tttatacgtg tagctatatc tatatataat tctgtacata 3901 gcctatattt acagtgaaaa tgtaagctgt ttattttata ttaaaataag cactgtgcta 3961 ataacagtgc atattccttt ctatcatttt tgtacagttt gctgtactag agatctggtt 4021 ttgctattag actgtaggaa gagtagcatt tcattcttct ctagctggtg gtttcacggt 4081 gccaggtttt ctgggtgtcc aaaggaagac gtgtggcaat agtgggccct ccgacagccc 4141 cctctgccgc ctccccacag ccgctccagg ggtggctgga gacgggtggg cggctggaga 4201 ccatgcagag cgccgtgagt tctcagggct cctgccttct gtcctggtgt cacttactgt 4261 ttctgtcagg agagcagcgg ggcgaagccc aggccccttt tcactccctc catcaagaat 4321 ggggatcaca gagacattcc tccgagccgg ggagtttctt tcctgccttc ttctttttgc 4381 tgttgtttct aaacaagaat cagtctatcc acagagagtc ccactgcctc aggttcctat 4441 ggctggccac tgcacagagc tctccagctc caagacctgt tggttccaag ccctggagcc 4501 aactgctgct ttttgaggtg gcactttttc atttgcctat tcccacacct ccacagttca 4561 gtggcagggc tcaggatttc gtgggtctgt tttcctttct caccgcagtc gtcgcacagt 4621 ctctctctct ctctcccctc aaagtctgca actttaagca gctcttgcta atcagtgtct 4681 cacactggcg tagaagtttt tgtactgtaa agagacctac ctcaggttgc tggttgctgt 4741 gtggtttggt gtgttcccgc aaaccccctt tgtgctgtgg ggctggtagc tcaggtgggc 4801 gtggtcactg ctgtcatcag ttgaatggtc agcgttgcat gtcgtgacca actagacatt 4861 ctgtcgcctt agcatgtttg ctgaacacct tgtggaagca aaaatctgaa aatgtgaata 4921 aaattatttt ggattttgc // LOCUS AB005666 3885 bp mRNA PRI 10-SEP-1997 DEFINITION Homo sapiens mRNA for GTPase-activating protein, complete cds. ACCESSION AB005666 NID g2389008 KEYWORDS GTPase-activating protein. SOURCE Homo sapiens cDNA to mRNA, clone:SPA1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kurachi,H., Wada,Y., Tsukamoto,N., Kubota,H., Hattori,M., Iwai,K. and Minato,N. TITLE Human SPA-1 Product Selectively Expressed in Lymphoid Tissues is a Specific GTPase-activating Protein for Rap1 and Rap2 JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 3885) AUTHORS Minato,N. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) to the DDBJ/EMBL/GenBank databases. Nagahiro Minato, Faculty of Medicine, Kyoto University, Immunology and Cell Biology; Yoshida-Konoe-cho, Sakyo-ku, Kyoto, Kyoto 606, JAPAN (E-mail:minato@med.kyoto-u.ac.jp, Tel:075-753-4659, Fax:075-753-4403) FEATURES Location/Qualifiers source 1..3885 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /clone="SPA1" CDS 297..3425 /codon_start=1 /product="GTPase-activating protein" /db_xref="PID:d1023058" /db_xref="PID:g2389009" /translation="MPMWAGGVGSPRRGMAPASTDDLFARKLRQPARPPLTPHTFEPR PVRGPLLRSGSDAGEARPPTPASPRARAHSHEEASRPAATSTRLFTDPLALLGLPAEE PEPAFPPVLEPRWFAHYDVQSLLFDWAPRSQGMGSHSEASSGTLASAEDQAASSDLLH GAPGFVCELGGEGELGLGGPAFPPVPPALPNAAVSILEEPQNRTSAYSLEHADLGAGY YRKYFYGKEHQNFFGMDESLGPVAVSLRREEKEGSGGGTLHSYRVIVRTTQLRTLRGT ISEDALPPGPPRGLSPRKLLEHVAPQLSPSCLRLGSASPKVPRTLLTLDEQVLSFQRK VGILYCRAGQGSEEEMYNNQEAGPAFMQFLTLLGDVVRLKGFESYRAQLDTKTDSTGT HSLYTTYQDHEIMFHVSTMLPYTPNNQQQLLRKRHIGNDIVTIVFQEPGSKPFCPTTI RSHFQHVFLVVRAHTPCTPHTTYRVAVSRTQDTPAFGPALPAGGGPFAANADFRAFLL AKALNGEQAAGHARQFHAMATRTRQQYLQDLATNEVTTTSLDSASRFGLPSLGGRRRA APRGPGAELQAAGSLVWGVRAAPGARVAAGAQASGPEGIEVPCLLGISAEALVLVAPR DGRVVFNCACRDVLAWTFSEQQLDLYHGRGEAITLRFDGSPGQAVGEVVARLQLVSRG CETRELALPRDGQGRLGFEVDAEGFVTHVERFTFAETAGLRPGARLLRVCGQTLPSLR PEAAAQLLRSAPKVCVTVLPPDESGRPRRSFSELYTLSLQEPSRRGAPDPVQDEVHGV TLLPTTKQLLHLCLQDGGSPPGPGDLAEERTEFLHSQNSLSPRSSLSDEAPVLPNTTP DLLLATTAKPSVPSADSETPLTQDRPGSPSGCEDKGNPAPELRASFLPRTLSLRNSIS RIMSEAGSGTLEDEWQAISEIASTCNTILESLSREGQPIPESGDPKGTPKSDAEPEPG NLSEKVSHLESMLRKLQEDLQKEKADRAALEEEVRSLRHNNRRLQAESESAATRLLLA SKQLGSPTADLA" BASE COUNT 706 a 1321 c 1230 g 628 t ORIGIN 1 ggcagggtga ctggggtccc atgacagctg ctgtgacccc agcaccttcc tcaggatgtg 61 ggggcctggc agagggctgg gcccacagtt ggggctactt cctgtgctga aggaagtcct 121 cttgccattc ctgctgccct gccgctgcct ccctgggaac ccatgtgtcc ttgtggcccc 181 tctgagcagc cccctcctcc ttcagggcag gaactgctgc cacaacctca ggctgggcac 241 caaacacccg tgcccgccaa tgcggcccag cccccggaga gtcaggccca cagagcatgc 301 ccatgtgggc cggcggtgtg gggagccctc ggcggggcat ggcccctgcg tccacagatg 361 acctctttgc ccgcaagctg cgccagccag caaggccccc gctgacaccg cacaccttcg 421 agccgaggcc agtccggggc ccactcctgc gcagcggcag cgatgcaggc gaggccaggc 481 cccccacgcc agccagcccc cgtgcccgtg cccacagcca cgaagaggcc agccgacctg 541 cagccacttc cacccggctc ttcactgacc cgctggcact gctggggctg ccagcagagg 601 aaccagagcc tgccttccca ccagtgcttg agcctcgatg gtttgcccac tatgacgtgc 661 aaagcctgct ctttgattgg gctccgaggt ctcaggggat ggggagccac tcagaggcca 721 gctctgggac cctggcttca gccgaggacc aggctgccag ctcggacctg ctgcatgggg 781 cacctggctt tgtgtgtgag ctcgggggtg agggtgagct aggcctgggt ggaccagcat 841 tcccacctgt gccccctgca ctgcccaacg cggccgtgtc catcctggag gagccacaga 901 accgaacctc ggcctacagc ctggagcacg cagacctggg tgctggctac taccgcaaat 961 acttctatgg caaagaacat cagaacttct tcgggatgga cgagtcgctg ggcccggtgg 1021 cagtgagcct gcggcgggag gagaaggagg gcagcggagg gggcaccctg cacagctacc 1081 gcgtcatcgt gcggaccacg cagctccgga cactccgtgg caccatctcg gaggacgcgc 1141 tgccgccggg gcccccacgg ggtctgtccc caaggaaact tctggagcac gtggcgccgc 1201 agctgagccc cagctgcctg cgcctgggct cagcttcacc caaggtacca cggacgctgc 1261 tcacactgga tgagcaagtg ctgagcttcc aacgcaaggt gggcatcctg tactgccggg 1321 cgggccaggg ctcggaggag gagatgtaca acaaccagga ggcgggaccg gccttcatgc 1381 agtttctcac cttgctgggc gatgtggtgc ggctcaaagg ctttgagagt taccgggccc 1441 agctagacac caaaacggat tccacaggca cgcactccct ctacaccaca taccaggacc 1501 acgagatcat gttccacgtg tccacgatgc tgccttacac ccctaataac cagcagcagc 1561 tcctccggaa gcgccacatt ggcaacgaca ttgtgaccat cgtgttccag gagcctggca 1621 gcaagccctt ctgccccacc accatccgct cgcacttcca gcacgtgttc ctagtggtgc 1681 gggcacacac accctgcacg ccacacacca cctacagggt ggccgtgagc cgcacccagg 1741 acacccctgc cttcgggcca gctctgcctg ctggcggagg ccccttcgca gccaacgccg 1801 acttccgggc cttcctgctg gccaaagcgc tgaatggtga gcaggcggcc ggccacgcgc 1861 gccagttcca cgccatggcc acgcgcaccc gccagcagta cctgcaagac ctggccacca 1921 acgaggtgac cactacgtcg ctggactcgg cttcacgctt cggcctgccc tccctgggtg 1981 ggaggcgccg ggcggcccct cggggcccag gcgccgagct gcaggcagcg ggctcactgg 2041 tgtggggagt gcgcgcggcg cccggggcgc gggtcgccgc cggggctcag gcgagcggcc 2101 ccgaaggcat cgaggtgccc tgcctgctgg gcatctcggc cgaggctctg gtgctggtgg 2161 cgccgcgcga cggccgcgta gtgttcaatt gcgcctgtcg cgacgtgctg gcctggacct 2221 tctccgagca gcagctggac ctgtaccacg gccgcgggga ggcgatcacg ctgcgcttcg 2281 acgggtcccc cggccaagcc gtgggcgagg tggtggcgcg cctgcagctg gtgagccgtg 2341 gctgcgagac ccgcgagctg gcgctgcccc gcgacggtca aggccgcctg ggcttcgagg 2401 tggacgccga gggattcgtc acgcacgtgg agcgcttcac attcgccgag acggcggggc 2461 tgcggcccgg ggcgcgcctc ctgcgcgtgt gcggccagac gctgcccagc ctccggcccg 2521 aggccgctgc ccagctcctg cgctcggcgc ccaaggtctg cgtcaccgtc ctgccccccg 2581 acgagagcgg ccggccccgc aggagttttt cggagctgta cacgctgtcg ctgcaggagc 2641 ctagccggcg gggggcccca gatcctgtgc aggatgaggt ccacggggtg accctgctgc 2701 ccaccacaaa gcagctgctg cacctgtgcc tgcaagatgg tggcagtcct ccagggcctg 2761 gggatctggc cgaggagagg actgagttcc tgcacagcca gaactcgctg tcaccacgca 2821 gctctctgtc ggatgaggcc ccagtcctgc ccaacaccac cccggacctc ctcctggcca 2881 ccacagccaa gccatcagta cccagtgctg acagtgagac acccctgacc caggacaggc 2941 caggcagtcc cagtggctgt gaggacaagg gcaacccggc gccggagctg agggcgtctt 3001 ttctgccacg taccttgtct ctgcggaact ccatcagcag gatcatgtcg gaggcgggca 3061 gtgggaccct ggaggacgag tggcaggcca tctcggagat tgcctctact tgcaacacca 3121 ttctggagtc gctgtcccga gagggacagc ccatcccaga gagtggagac cctaagggaa 3181 ctccaaaatc tgatgctgag ccagagcctg ggaacctctc agagaaggtc tctcacttgg 3241 agtccatgct caggaagctg caggaggacc tgcagaagga gaaggcggac agggcggccc 3301 tggaggagga ggtgcggagc ctgagacaca acaaccggcg gctgcaggcg gagtctgaga 3361 gtgcagccac acgcctcctc ctggcctcca agcagctggg ctcacccacc gccgacctgg 3421 cctgagccgt ctggaaccac ttgggcccat gagggcactg tggtcacact gggccctcct 3481 caggaactct ccctgcggca gaggcgtgtc ttagcactgc ccccctccct agccccttat 3541 ttggtggcgg aagtggcctc caccccttcc ctgttagtac aatattctgt ggagaaaaga 3601 ggacttcagg gagtaaaaaa gccactgatg aaagaagaaa actttagagc acaatggatc 3661 tcgaggtcga gggatctcta gggaattgct gcagcagctt ttagagcaga agtgacactt 3721 ccgtacaggc ctagaagtaa aggcaacatc cactgaggag cagttctttg atttgcacca 3781 ccaccggatc ccccgggctg caggaattcg atatcaagct tatcgatacc gtcgacctcg 3841 agggggggcc cggtacccaa ttcgccctat agtgagtcgt attac // LOCUS AB005754 1557 bp mRNA PRI 21-JAN-1998 DEFINITION Homo sapiens mRNA for LAK-1, complete cds. ACCESSION AB005754 NID g2804243 KEYWORDS LAK-1. SOURCE Homo sapiens adult male LAK-cell cDNA to mRNA, clone_lib:membrane lymphotoxin positive subtraction library clone:No.1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Abe,Y. and Takaoka,Y. TITLE mLT positive LAK-cell clone No. 1 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1557) AUTHORS Abe,Y. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) to the DDBJ/EMBL/GenBank databases. Yasuhito Abe, Ehime University School of Medicine, The Second Department of Surgery; Shigenobu, Onsen-gun, Ehime 791-02, Japan (E-mail:yasuhito@m.ehime-u.ac.jp, Tel:+81-89-964-5111, Fax:+81-89-960-5334) FEATURES Location/Qualifiers source 1..1557 /organism="Homo sapiens" /note="Japanese" /db_xref="taxon:9606" /cell_type="LAK-cell" /clone="No.1" /clone_lib="membrane lymphotoxin positive subtraction library" /dev_stage="adult" /sex="male" CDS 201..1151 /codon_start=1 /product="LAK-1" /db_xref="PID:d1025349" /db_xref="PID:g2804244" /translation="MLCIEDPLLPGNDVGRSSYGAMQVKQVFDYAYIVLSHAVSPLAR SYPNRDAESTLGRIIKVTQEVIDYRRWIKEKWGSKAHPSPGMDSRSKIKERIATCNGE QTQNREPESPYGQRLTLSLSSPQLLSSGSSASSVSSLSGSDVDSDTPPCTTPSVYQFS LQAPAPLMAGLPTALPMPSGKPQPTTSRTLIMTTNNQTRFTIPPPTLGVAPVPCGQVG VEGTASLKAVHHMSSPAIPSASPNPLSSPHLYHKQHNGMKLSMKGSHGHTQGGGYSSV GSGGVRPPVGNRGHHQYNRTGWRRKKHTHTRDSLPVSLSR" BASE COUNT 354 a 464 c 403 g 336 t ORIGIN 1 gctttctaca gttgcatcca agaattgatg cccggagagc tgatgaaaac cttggaatgc 61 ttcttgtaga attttttgaa ctctatggga gaaattttaa gggcctactt gaaaaccggt 121 attagaatca aagaaggagg tgcctatatc gccaaagagg agatcatgaa agccatgacc 181 agcgggtaca agaccgtcga atgctgtgca ttgaggaccc cctgctgcca gggaatgacg 241 ttggccggag ctcctatggc gccatgcagg tgaagcaggt cttcgattat gcctacatag 301 tgctcagcca tgccgtgtca ccgctggcca ggtcctatcc aaacagagac gccgaaagta 361 ctttaggaag aatcatcaaa gtaactcagg aggtgattga ctaccggagg tggatcaaag 421 agaagtgggg cagcaaagcc cacccgtcgc caggcatgga cagcagatcc aagatcaaag 481 agcgaatagc cacatgcaat ggggagcaga cgcagaaccg agagcccgag tctccctatg 541 gccagcgctt gactttgtcg ctgtccagcc cccagctcct gtcttcaggc tcctcggcct 601 cttctgtgtc ttcactttct gggagtgacg ttgattcaga cacaccgccc tgcacaacgc 661 ccagtgttta ccagttcagt ctgcaagcgc cagctcctct catggccggc ttacccaccg 721 ccttgccaat gcccagtggc aaacctcagc ccaccacttc cagaacactg atcatgacaa 781 ccaacaatca gaccaggttt actatacctc caccgaccct aggggttgct cctgttcctt 841 gcggacaagt tggtgtagaa ggaactgcgt ctttgaaagc cgtccaccac atgtcttccc 901 cggccattcc ctcagcgtcc cccaacccgc tctcgagccc tcatctgtat cataagcagc 961 acaacggcat gaaactgtcc atgaagggct ctcacggcca cacccaaggc ggcggctaca 1021 gctctgtggg tagcggaggt gtgcggcccc ctgtgggcaa caggggacac caccagtata 1081 accgcaccgg ctggaggagg aaaaaacaca cacacacacg ggacagtctg cccgtgagcc 1141 tcagcagata atggctcctg gctgcgtcag cctcccccac ccctctgcag actgccccgc 1201 ggcctcggcc accggcaggg gaaccgagac cagcaccccg cacgtcagcc gggcctcggg 1261 cacgcccgcc gttgattact ctgcatgttt cttcgtgtgg tggtcgcgtc catcttcaag 1321 aacagctcgt tgtgctcatc tgtgaagcct tattaaacgt ggacgttgtt ttctgccttc 1381 ccaggattct tccttcagtg ctgaggcagg ttgggctcag gaactgcagg gacgtgaaca 1441 tgcgcttgcg gtttgaggta gccgtgtctg ttccttcgcg gtttgctatt ttcatttcct 1501 gttcgtcaaa gcagcagagg agatcaaacc ccgttcgtgt gtctttcctc cacggat // LOCUS AB005910 3471 bp mRNA PRI 16-SEP-1997 DEFINITION Homo sapiens mRNA for phosphatidylinositol 4-kinase, complete cds. ACCESSION AB005910 NID g2285793 KEYWORDS phosphatidylinositol 4-kinase; PI4Kb. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,T., Seki,N., Ishii,H., Ohira,M., Hayashi,A., Kozuma,S. and Hori,T. TITLE Complementary DNA cloning and chromosomal mapping of a novel phosphatidylinositol kinase gene JOURNAL DNA Res. 4, 1-5 (1997) REFERENCE 2 (bases 1 to 3471) AUTHORS Saito,T. TITLE Direct Submission JOURNAL Submitted (22-JUL-1997) to the DDBJ/EMBL/GenBank databases. Toshiyuki Saito, National Institute of Radiological Sciences, Genome Research Group; Anagawa 4-9-1, Inage, Chiba 263, Japan (E-mail:t_saito@nirs.go.jp, Tel:043-206-3135, Fax:043-251-9818) FEATURES Location/Qualifiers source 1..3471 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" /tissue_type="brain" gene 70..2556 /gene="PI4Kb" CDS 70..2556 /gene="PI4Kb" /codon_start=1 /product="phosphatidylinositol 4-kinase" /db_xref="PID:d1022511" /db_xref="PID:g2285794" /translation="MRFLEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSV ITEGVGELSVIDPEVAQKACQEVLEKVKLLHGGVAVSSRGTPLELVNGDGVDSEIRCL DDPPAQIREEEDEMGAAVASGTAKGARRRRQNNSAKQSWLLRLFESKLFDISMAISYL YNSKEPGVQAYIGNRLFCFRNEDVDFYLPQLLNMYIHMDEDVGDAIKPYIVHRCRQSI NFSLQCALLLGAYSSDMHISTQRHSRGTKLRKLILSDELKPAHRKRELPSLSPAPDTG LSPSKRTHQRSKSDATASISLSSNLKRTASNPKVENEDEELSSSTESIDNSFSSPVRL APEREFIKSLMAIGKRLATLPTKEQKTQRLISELSLLNHKLPARVWLPTAGFDHHVVR VPHTQAVVLNSKDKAPYLIYVEVLECENFDTTSVPARIPENRIRSTRSVENLPECGIT HEQRAGSFSTVPNYDNDDEAWSVDDIGELQVELPEVHTNSCDNISQFSVDSITSQESK EPVFIAAGDIRRRLSEQLAHTPTAFKRDPEDPSAVALKEPWQEKVRRIREGSPYGHLP NWRLLSVIVKCGDDLRQELLAFQVLKQLQSIWEQERVPLWIKPYKILVISADSGMIEP VVNAVSIHQVKKQSQLSLLDYFLQEHGSYTTEAFLSAQRNFVQSCAGYCLVCYLLQVK DRHNGNILLDAEGHIIHIDFGFILSSSPRNLGFETSAFKLTTEFVDVMGGLDGDMFNY YKMLMLQGLIAARKHMDKVVQIVEIMQQGSQLPCFHGSSTIRNLKERFHMSMTEEQLQ LLVEQMVDGSMRSITTKLYDGFQYLTNGIM" BASE COUNT 809 a 966 c 929 g 767 t ORIGIN 1 cagattacac ttggttgact actccggagc agccactaag agggatgaac aggcctgcgt 61 ggaaattgaa tgagattctt ggaagctcga agtctggctg tggccatggg agatacagta 121 gtggagcctg cccccttgaa gccaacttct gagcccactt ctggcccacc agggaataat 181 ggggggtccc tgctaagtgt catcacggag ggggtcgggg aactatcagt gattgaccct 241 gaggtggccc agaaggcctg ccaggaggtg ttggagaaag tcaagctttt gcatggaggc 301 gtggcagtct ctagcagagg caccccactg gagttggtca atggggatgg tgtggacagt 361 gagatccgtt gcctagatga tccacctgcc cagatcaggg aggaggaaga tgagatgggg 421 gccgctgtgg cctcaggcac agccaaagga gcaagaagac ggcggcagaa caactcagct 481 aaacagtctt ggctgctgag gctgtttgag tcaaaactgt ttgacatctc catggccatt 541 tcatacctgt ataactccaa ggagcctgga gtacaagcct acattggcaa ccggctcttc 601 tgctttcgca acgaggacgt ggacttctat ctgccccagt tgcttaacat gtacatccac 661 atggatgagg acgtgggtga tgccattaag ccctacatag tccaccgttg ccgccagagc 721 attaactttt ccctccagtg tgccctgttg cttggggcct attcttcaga catgcacatt 781 tccactcaac gacactcccg tgggaccaag ctacggaagc tgatcctctc agatgagcta 841 aagccagctc acaggaagag ggagctgccc tccttgagcc cggcccctga cacagggctg 901 tctccctcca aaaggactca ccagcgctct aagtcagatg ccactgccag cataagtctc 961 agcagcaacc tgaaacgaac agccagcaac cctaaagtgg agaatgagga tgaggagctc 1021 tcctccagca ccgagagtat tgataattca ttcagttccc ctgttcgact ggctcctgag 1081 agagaattca tcaagtccct gatggcgatc ggcaagcggc tggccacgct ccccaccaaa 1141 gagcagaaaa cacagaggct gatctcagag ctctccctgc tcaaccataa gctccctgcc 1201 cgagtctggc tgcccactgc tggctttgac caccacgtgg tccgtgtacc ccacacacag 1261 gctgttgtcc tcaactccaa ggacaaggct ccctacctga tttatgtgga agtccttgaa 1321 tgtgaaaact ttgacaccac cagtgtccct gcccggatcc ccgagaaccg aattcggagt 1381 acgaggtccg tagaaaactt gcccgaatgt ggtattaccc atgagcagcg agctggcagc 1441 ttcagcactg tgcccaacta tgacaacgat gatgaggcct ggtcggtgga tgacataggc 1501 gagctgcaag tggagctccc cgaagtgcat accaacagct gtgacaacat ctcccagttc 1561 tctgtggaca gcatcaccag ccaggagagc aaggagcctg tgttcattgc agcaggggac 1621 atccgccggc gcctttcgga acagctggct cataccccga cagccttcaa acgagaccca 1681 gaagatcctt ctgcagttgc tctcaaagag ccctggcagg agaaagtacg gcggatcaga 1741 gagggctccc cctacggcca tctccccaat tggcggctcc tgtcagtcat tgtcaagtgt 1801 ggggatgacc ttcggcaaga gcttctggcc tttcaggtgt tgaagcaact gcagtccatt 1861 tgggaacagg agcgagtgcc cctttggatc aagccataca agattcttgt gatttcggct 1921 gatagtggca tgattgaacc agtggtcaat gctgtgtcca tccatcaggt gaagaaacag 1981 tcacagctct ccttgctcga ttacttccta caggagcacg gcagttacac cactgaggca 2041 ttcctcagtg cacagcgcaa ttttgtgcaa agttgtgctg ggtactgctt ggtctgctac 2101 ctgctgcaag tcaaggacag acacaatggg aatatccttt tggacgcaga aggccacatc 2161 atccacatcg actttggctt catcctctcc agctcacccc gaaatctggg ctttgagacg 2221 tcagccttta agctgaccac agagtttgtg gatgtgatgg gcggcctgga tggcgacatg 2281 ttcaactact ataagatgct gatgctgcaa gggctgattg ccgctcggaa acacatggac 2341 aaggtggtgc agatcgtgga gatcatgcag caaggttctc agcttccttg cttccatggc 2401 tccagcacca ttcgaaacct caaagagagg ttccacatga gcatgactga ggagcagctg 2461 cagctgctgg tggagcagat ggtggatggc agtatgcggt ctatcaccac caaactctat 2521 gacggcttcc agtacctcac caacggcatc atgtgacacg ctcctcagcc caggagtggt 2581 ggggggtccg gggcaccctc cctagagggc ccttgtctga gaaaccccaa accaggaaac 2641 cccacctacc caaccatcca cccaagggaa atggaaggca agaaacacga aggatcatgt 2701 ggtaactgcg agagcttgct gaggggtggg agagccagct gtggggtcca gacttgttgg 2761 ggcttccctg cccctcctgg tctgtgtcag tattaccacc agactgactc caggactcac 2821 tgccctccag aaaacagagg tgacaaatgt gagggacact ggggcctttc ttctccttgt 2881 aggggtctct cagaggttct ttccacaggc catcctctta ttccgttctg gggcccagga 2941 agtggggaag agtaggttct cggtacttag gacttgatcc tgtggttggc cactggccat 3001 gctgctgccc agctctaccc ctcccaggga cctacccctc ccagggaccg acccctggcc 3061 caagctcccc ttgctggcgg gcgctgcgtg ggccctgcac ttgctgaggt tccccatcat 3121 gggcaaggaa gggaattccc acagccctcc agtgtactga gggtactggc ctagccatgt 3181 ggaattccct accctgactc cttccccaaa cccagggaaa agagctctca attttttatt 3241 tttaattttt gtttgaaata aagtccttag ttagccactt gtgtcatttc caggttttct 3301 gggggagtgc agggggagat gggtgatgag gtatgaacgg atgcctcagt gtccaagata 3361 caaaaggcac cacatagaag tttgcttttt ccctgcctgt cttggtcact accacctctt 3421 ccctgagaag ggcgggcctt ccatgttctc tcacccgctt caactccaca t // LOCUS AB006077 1158 bp mRNA PRI 28-OCT-1997 DEFINITION Homo sapiens doc-1 mRNA, complete cds. ACCESSION AB006077 NID g2564010 KEYWORDS doc-1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Daigo,Y., Suzuki,K., Maruyama,O., Miyoshi,Y., Yasuda,T., Kabuto,T., Imaoka S Fujiwara,T., Takahashi,E., Fujino,M.A. and Nakamura,Y. TITLE Isolation, mapping and mutation analysis of a human cDNA homologous to the doc-1 gene of the Chinese hamster, a candidate tumor suppressor for oral cancer JOURNAL Genes Chromosomes Cancer 20 (2), 204-207 (1997) MEDLINE 97472613 REFERENCE 2 (bases 1 to 1158) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (28-JUL-1997) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, The University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:y-daigo@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..1158 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q24.31" gene 79..426 /gene="doc-1" CDS 79..426 /gene="doc-1" /codon_start=1 /db_xref="PID:g2564011" /translation="MSYKPNLAAHMPAAALNAAGSVHSPSTSMATSSQYRQLLSDYGP PSLGYTQGTGNSQVPQSKYAELLAIIEELGKEIRPTYAGSKSAMERLKRGIIHARGLV RECLAETERNARS" BASE COUNT 291 a 289 c 254 g 324 t ORIGIN 1 accgcccggc ctcgccgccg ccgccgccgc cctcgcggcc tggccccgcc gcgcccggcg 61 cgcccgccgc ccggggggat gtcttacaaa ccgaacttgg ccgcgcacat gcccgccgcc 121 gccctcaacg ccgctgggag tgtccactcg ccttccacca gcatggcaac gtcttcacag 181 taccgccagc tgctcagtga ctacgggcca ccgtccctag gctacaccca gggaactggg 241 aacagccagg tgccccaaag caaatacgcg gagctgctgg ccatcattga agagctgggg 301 aaggagatca gacccacgta cgcagggagc aagagtgcca tggagaggct gaagcgcggc 361 atcattcacg ctagaggact ggttcgggag tgcttggcag aaacggaacg gaatgccaga 421 tcctagctgc cttgttggtt ttgaaggatt tccatctttt tacaagatga gaagttacag 481 ttcatctccc ctgttcagat gaaacccttg ttttcaaaat ggttacagtt tcgtttttcc 541 tcccatggtt cacttggctc tgaacctaca gtctcaaaga ttgagaaaag attttgcagt 601 taattaggat ttgcatttta agtagttagg aactgcccag gttttttttg ttttttaagc 661 attgatttaa aagatgcacg gaaagttatc ttacagcaaa ctgtagtttg cctccaagac 721 accattgtct ccctttaatc ttctcttttg tatacatttg ttacccatgg tgttctttgt 781 tccttttcat aagctaatac cactgtaggg attttgtttt gaacgcatat tgacagcacg 841 ctttacttag tagccggttc ccatttgcca tacaatgtag gttctgctta atgtaacttc 901 ttttttgctt aagcatttgc atgactatta gtgcttcaaa gtcaattttt aaaaatgcac 961 aagttataaa tacagaagaa agagcaaccc accaaaccta acaaggaccc ccgaacactt 1021 tcatactaag actgtaagta gatctcagtt ctgcgtttat tgtaagttga taaaaacatc 1081 tgggaggaaa tgactaaaac tgtttgcatc tttgtatgta tttattactt gatgtaataa 1141 agcttatttt cattaacc // LOCUS AB006190 1258 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens mRNA for aquaporin adipose, complete cds. ACCESSION AB006190 NID g2317273 KEYWORDS AQP7L; aquaporin adipose. SOURCE Homo sapiens adipose tissue cDNA to mRNA, clone:GS3340. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kuriyama,H., Kawamoto,S., Ishida,N., Ohno,I., Mita,S., Matsuzawa,Y., Matsubara,K. and Okubo,K. TITLE Molecular cloning and expression of a novel human aquaporin from adipose tissue with glycerol permeability JOURNAL Biochem. Biophys. Res. Commun. 241 (1), 53-58 (1997) MEDLINE 98070904 REFERENCE 2 (bases 1 to 1258) AUTHORS Kuriyama,H. TITLE Direct Submission JOURNAL Submitted (01-AUG-1997) to the DDBJ/EMBL/GenBank databases. Hiroshi Kuriyama, Osaka University, Institute for Molecular and Cellular Biology; Yamada-oka 1-3, Suita, Osaka 565, Japan (E-mail:kuriyama@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..1258 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GS3340" /tissue_type="adipose tissue" gene 173..1201 /gene="AQP7L" CDS 173..1201 /gene="AQP7L" /standard_name="aquaporin 7-like" /note="Aquaporin 9 appeared in this citation was renamed as aquaporin adipose (AQPap). Gene symbol AQP7L is also assigned for this gene." /citation=[1] /codon_start=1 /product="aquaporin adipose" /db_xref="PID:d1022600" /db_xref="PID:g2317274" /translation="MVQASGHRRSTRGSKMVSWSVIAKIQEILQRKMVREFLAEFMST YVMMVFGLGSVAHMVLNKKYGSYLGVNLGFGFGVTMGVHVAGRISGAHMNAAVTFANC ALGRVPWRKFPVYVLGQFLGSFLAAATIYSLFYTAILHFSGGQLMVTGPVATAGIFAT YLPDHMTLWRGFLNEAWLTGMLQLCLFAITDQENNPALPGTEALVIGILVVIIGVSLG MNTGYAINPSRDLPPRIFTFIAGWGKQVFSNGENWWWVPVVAPLLGAYLGGIIYLVFI GSTIPREPLKLEDSVAYEDHGITVLPKMGSHEPTISPLTPVSVSPANRSSVHPAPPLH ESMALEHF" BASE COUNT 276 a 354 c 347 g 281 t ORIGIN 1 ggctctggac tggggacaca gggatagctg agccccagct gggggtggaa gctgagccag 61 ggacagtcac ggaggaacaa gatcaagatg cgctgtaact gagaagcccc caaggcggag 121 gctgagaatc agagacattt cagcagacat ctacaaatct gaaagacaaa acatggttca 181 agcatccggg cacaggcggt ccacccgtgg ctccaaaatg gtctcctggt ccgtgatagc 241 aaagatccag gaaatactgc agaggaagat ggtgcgagag ttcctggccg agttcatgag 301 cacatatgtc atgatggtat tcggccttgg ttccgtggcc catatggttc taaataaaaa 361 atatgggagc taccttggtg tcaacttggg ttttggcttc ggagtcacca tgggagtgca 421 cgtggcaggc cgcatctctg gagcccacat gaacgcagct gtgacctttg ctaactgtgc 481 gctgggccgc gtgccctgga ggaagtttcc ggtctatgtg ctggggcagt tcctgggctc 541 cttcctggcg gctgccacca tctacagtct cttctacacg gccattctcc acttttcggg 601 tggacagctg atggtgaccg gtcccgtcgc tacagctggc atttttgcca cctaccttcc 661 tgatcacatg acattgtggc ggggcttcct gaatgaggcg tggctgaccg ggatgctcca 721 gctgtgtctc ttcgccatca cggaccagga gaacaaccca gcactgccag gaacagaggc 781 gctggtgata ggcatcctcg tggtcatcat cggggtgtcc cttggcatga acacaggata 841 tgccatcaac ccgtcccggg acctgccccc ccgcatcttc accttcattg ctggttgggg 901 caaacaggtc ttcagcaatg gggagaactg gtggtgggtg ccagtggtgg caccacttct 961 gggtgcctat ctaggtggca tcatctacct ggtcttcatt ggctccacca tcccacggga 1021 gcccctgaaa ttggaggatt ctgtggcgta tgaagaccac gggataaccg tattgcccaa 1081 gatgggatct catgaaccca cgatctctcc cctcaccccc gtctctgtga gccctgccaa 1141 cagatcttca gtccaccctg ccccaccctt acatgaatcc atggccctag agcacttcta 1201 agcagagatt atttgtgatc ccatccattc cccaataaag caaggcttgt ccgacaaa // LOCUS AB006198 2506 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens mRNA for SART-1, complete cds. ACCESSION AB006198 NID g2723389 KEYWORDS SART-1. SOURCE Homo sapiens cell_line:KE-4 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shichijo,S., Nakao,M., Imai,Y., Takasu,H., Kawamoto,M., Niiya,F., Yang,D., Toh,Y., Yamana,H. and Itoh,K. TITLE A gene encoding antigenic peptides of human squamous cell carcinoma recognized by cytotoxic T lymphocytes JOURNAL J. Exp. Med. (1998) In press REFERENCE 2 (bases 1 to 2506) AUTHORS Ito,K. TITLE Direct Submission JOURNAL Submitted (04-AUG-1997) to the DDBJ/EMBL/GenBank databases. Kyogo Ito, Kurume University School of Medicine, and the Cancer Vaccine Division of Kurume University Research Center for Innovative Cancer Therapy, Departments of Immunology and Surgery; Asahi-machi 68, Kurume, Fukuoka 830, Japan (E-mail:kyogo@med.kurume-u.ac.jp, Tel:0942-31-7551(ex.3248), Fax:0942-31-7699) COMMENT Sequence update (24-Dec-1997). FEATURES Location/Qualifiers source 1..2506 /organism="Homo sapiens" /note="KE-4 is the squamous cell carcinoma cell line." /db_xref="taxon:9606" /cell_line="KE-4" CDS 39..2441 /note="squamous cell carcinoma antigen recognized by T cells" /codon_start=1 /product="SART-1" /db_xref="PID:d1024962" /db_xref="PID:g2723284" /translation="MGSSKKHRGEKEAAGTTAAAGTGGATEQPPRHREHKKHKHRSGG SGGSGGERRKRSRERGGERGSGRRGAEAEARSSTHGRERSQAEPSERRVKREKRDDGY EAAASSKTSSGDASSLSIEETNKLRAKLGLKPLEVNAIKKEAGTKEEPVTADVINPMA LRQREELREKLAAAKEKRLLNQKLGKIKTLGEDDPWLDDTAAWIERSRQLQKEKDLAE KRAKLLEEMDQEFGVSTLVEEEFGQRRQDLYSARDLQGLTVEHAIDSFREGETMILTL KDKGVLQEEEDVLVNVNLVDKERAEKNVELRKKKPDYLPYAEDESVDDLAQQKPRSIL SKYDEELEGERPHSFRLEQGGTADGLRERELEEIRAKLRLQAQSLSTVGPRLASEYLT PEEMVTFKKTKRRVKKIRKKEKEVVVRADDLLPLGDQTQDGDFGSRLRGRGRRRVSEV EEEKEPVPQPLPSDDTRVENMDISDEEEGGAPPPGSPQVLEEDEAELELQKQLEKGRR LRQLQQLQQLRDSGEKVVEIVKKLESRQRGWEEDEDPERKGAIVFNATSEFCRTLGEI PTYGLAGNREEQEELMDFERDEERSANGGSESDGEENIGWSTVNLDEEKQQQDFSASS TTILDEEPIVNRGLAAALLLCQNKGLLETTVQKVARVKAPNKSLPSAVYCIEDKMAID DKYSRREEYRGFTQDFKEKDGYKPDVKIEYVDETGRKLTPKEAFRQLSHRFHGKGSGK MKTERRMKKLDEEALLKKMSSSDTPLGTVALLQEKQKAQKTPYIVLSGSGKSMNANTI TK" BASE COUNT 606 a 656 c 905 g 339 t ORIGIN 1 ggttcggcgg cagccgggct cggagtggac gtgccactat ggggtcgtcc aagaagcatc 61 gcggagagaa ggaggcggcc gggacgacgg cggcggccgg caccgggggt gccaccgagc 121 agccgccgcg gcaccgggaa cacaaaaaac acaagcaccg gagtggcggc agtggcggta 181 gcggtggcga acgacggaag cggagccggg aacgtggggg cgagcgcggg agcgggcggc 241 gcggggccga agctgaggcc cggagcagca cgcacgggcg ggagcgcagc caggcagagc 301 cctccgagcg gcgcgtgaag cgggagaagc gcgatgacgg ctacgaggcc gctgccagct 361 ccaaaactag ctcaggcgat gcctcctcac tcagcatcga ggagactaac aaactccggg 421 caaagttggg gctgaaaccc ttggaggtta atgccatcaa gaaggaggcg ggcaccaagg 481 aggagcccgt gacagctgat gtcatcaacc ctatggcctt gcgacagcga gaggagctgc 541 gggagaagct ggcggctgcc aaggagaagc gcctgctgaa ccaaaagctg gggaagataa 601 agaccctagg agaggatgac ccctggctgg acgacactgc agcctggatc gagaggagcc 661 ggcagctgca gaaggagaag gacctggcag agaagagggc caagttactg gaggagatgg 721 accaagagtt tggtgtcagc actctggtgg aggaggagtt cgggcagagg cggcaggacc 781 tgtacagtgc ccgggacctg cagggcctca ccgtggagca tgccattgat tccttccgag 841 aaggggagac aatgattctt accctcaagg acaaaggcgt gctgcaggag gaggaggacg 901 tgctggtgaa cgtgaacctg gtggataagg agcgggcaga gaaaaatgtg gagctgcgga 961 agaagaagcc tgactacctg ccctatgccg aggacgagag cgtggacgac ctggcgcagc 1021 aaaaacctcg ctctatcctg tccaagtatg acgaagagct tgaaggggag cggccacatt 1081 ccttccgctt ggagcagggc ggcacggctg atggcctgcg ggagcgggag ctggaggaga 1141 tccgggccaa gctgcggctg caggctcagt ccctgagcac agtggggccc cggctggcct 1201 ccgaatacct cacgcctgag gagatggtga cctttaaaaa gaccaagcgg agggtgaaga 1261 aaatccgcaa gaaggagaag gaggtagtag tgcgggcaga tgacttgctg cctctcgggg 1321 accagactca ggatggggac tttggttcca gactgcgggg acggggtcgc cgccgagtgt 1381 ccgaagtgga ggaggagaag gagcctgtgc ctcagcccct gccgtcggac gacacccgag 1441 tggagaacat ggacatcagt gatgaggagg aaggtggagc tccaccgccg gggtccccgc 1501 aggtgctgga ggaggacgag gcggagctgg agctgcagaa gcagctggag aagggacgcc 1561 ggctgcgaca gttacagcag ctacagcagc tgcgagacag tggcgagaag gtggtggaga 1621 ttgtgaagaa gctggagtct cgccagcggg gctgggagga ggatgaggat cccgagcgga 1681 agggggccat cgtgttcaac gccacgtccg agttctgccg caccttgggg gagatcccca 1741 cctacgggct ggctggcaat cgcgaggagc aggaggagct catggacttt gaacgggatg 1801 aggagcgctc agccaacggt ggctccgaat ctgacgggga ggagaacatc ggctggagca 1861 cggtgaacct ggacgaggag aagcagcagc aggatttctc tgcttcctcc accaccatcc 1921 tggacgagga accgatcgtg aatagggggc tggcagctgc cctgctcctg tgtcagaaca 1981 aagggctgct ggagaccaca gtgcagaagg tggcccgggt gaaggccccc aacaagtcgc 2041 tgccctcagc cgtgtactgc atcgaggata agatggccat cgatgacaag tacagccgga 2101 gggaggaata ccgaggcttc acacaggact tcaaggagaa ggacggctac aaacccgacg 2161 ttaagatcga atacgtggat gagacgggcc ggaaactcac acccaaggag gctttccggc 2221 agctgtcgca ccgcttccat ggcaagggct caggcaagat gaagacagag cggcggatga 2281 agaagctgga cgaggaggcg ctcctgaaga agatgagctc cagcgacacg cccctgggca 2341 ccgtggccct gctccaggag aagcagaagg ctcagaagac cccctacatc gtgctcagcg 2401 gcagcggcaa gagcatgaac gcgaacacca tcaccaagtg acagcgccct cccgccccgg 2461 ccctgcctca accttcatat taaataaagc tccctcctta ttttta // LOCUS AB006202 1313 bp mRNA PRI 01-SEP-1997 DEFINITION Homo sapiens mRNA for small subunit of cytochrome b in succinate dehydrogenase complex, complete cds. ACCESSION AB006202 NID g2351036 KEYWORDS small subunit of cytochrome b in succinate dehydrogenase complex; complex II. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1313) AUTHORS Kita,K. TITLE Direct Submission JOURNAL Submitted (05-AUG-1997) to the DDBJ/EMBL/GenBank databases. Kiyoshi Kita, The Institute of Medical Science, The University of Tokyo, Department of Parasitology; 4-6-1, Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:kitak@ims.u-tokyo.ac.jp, Tel:81-3-5449-5370, Fax:81-3-5449-5410) REFERENCE 2 (sites) AUTHORS Hirawake,H., Taniwaki,M., Kijima,S. and Kita,K. TITLE Cytohrome b in human complex II:cDNA cloning of the component in liver mitochondria and chromosomal assignment of the genes JOURNAL Unpublished (1997) FEATURES Location/Qualifiers source 1..1313 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 12..491 /note="complex II" /codon_start=1 /product="small subunit of cytochrome b in succinate dehydrogenase complex" /db_xref="PID:d1022913" /db_xref="PID:g2351037" /translation="MAVLWRLSAVCGALGGRALLLRTPVVRPAHISAFLQDRPIPEWC GVQHIHLSPSHHSGSKAASLHWTSERVVSVLLLGLLPAAYLNPCSAMDYSLAAALTLH GHWGLGQVVTDYVHGDALQKAAKAGLLALSALTFAGLCYFNYHDVGICKAVAMLWKL" BASE COUNT 369 a 261 c 259 g 424 t ORIGIN 1 ccaggaacga gatggcggtt ctctggaggc tgagtgccgt ttgcggtgcc ctaggaggcc 61 gagctctgtt gcttcgaact ccagtggtca gacctgctca tatctcagca tttcttcagg 121 accgacctat cccagaatgg tgtggagtgc agcacataca cttgtcaccg agccaccatt 181 ctggctccaa ggctgcatct ctccactgga ctagcgagag ggttgtcagt gttttgctcc 241 tgggtctgct tccggctgct tatttgaatc cttgctctgc gatggactat tccctggctg 301 cagccctcac tcttcatggt cactggggcc ttggacaagt tgttactgac tatgttcatg 361 gggatgcctt gcagaaagct gccaaggcag ggcttttggc actttcagct ttaacctttg 421 ctgggctttg ctatttcaac tatcacgatg tgggcatctg caaagctgtt gccatgctgt 481 ggaagctctg acctttttga cttcatactt tgaagaattg atgtatgcct ctttgcctct 541 gctttgtcat gccattaagc tcacaataag gaagaaataa cagataagtc cattggtgga 601 cagccttctt ctcttaatca caagattatt ttcagaattt aatctttgag gaaaaggttt 661 gagaggaatt atatctaagt tgtgagactg agttctatat tctggtgagt taatggggtt 721 gcctcccagc ttcttataag actcacagta taactaaaca tgatatatca gcttttgcct 781 ttcaatttat caatctctta aagagaatcc aactttatta cgattagtat atgatcaaac 841 ttccatattt gccttgggaa taatggacaa agggaaatac tcttaattca tgaataaaaa 901 ctttgcagaa aattagacag tgtttaattt tcgaaaactt ccctctctag acagtagata 961 ccacctactg atggttacat atactaggga aattttaaaa ttaggaaatg ctgatagctc 1021 atattataaa tttctaaatc ctaggaagaa acgcttggag tgcttctgaa tatacagaag 1081 ttccatttaa gggcaagttt ccccgtagat gtatcaaaat actaccaact gtaaattgag 1141 atttaattcc caaatgtatt ctacttgttc taaaacaatc tgtccacaaa tataaaacta 1201 taagtaataa attgttattt tcgcacaatg ggaatctcta atgtgaaaat gtattctatg 1261 aaaataattt tttaaataaa atgttatata ataaaagtgt cttctatgct ttt // LOCUS AB006623 3425 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens mRNA for KIAA0285 gene, complete cds. ACCESSION AB006623 NID g2564317 KEYWORDS KIAA0285. SOURCE Homo sapiens Brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6864. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3425) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 3425) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (20-AUG-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure I; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) COMMENT Sequence updated (26-Aug-1997). FEATURES Location/Qualifiers source 1..3425 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6864" /clone_lib="pSPORT 1" /tissue_type="Brain" 5'UTR 1..389 /gene="KIAA0285" gene 1..3425 /gene="KIAA0285" CDS 390..911 /gene="KIAA0285" /note="No similarities to any reported proteins" /codon_start=1 /db_xref="PID:d1023830" /db_xref="PID:g2564318" /translation="MPDGTIVTTVTTVQSRPRIDGKLDSPSRSPSKVEVTEKTTTVLS ESSGPSNTSHSSSREWGMGCMSVGWPWCPSIKKYSGWDRRSSNPRKSQGEVRLFILCT PSRGQPPFQRLGPCSRDSDSPADRAQWAGGQEDTHQAQHSHHLWCFQGAHCSGRVGAI PGLCGIPGSLSAG" 3'UTR 1194..3425 /gene="KIAA0285" BASE COUNT 793 a 920 c 908 g 804 t ORIGIN 1 gggatctggg cccccagagc cgggagctga ccctcaaagt gctgaggagc agcagctgtg 61 gagacaccga actcctaggc caggccacac tgcctgtggg ctccccctcc agaccactgt 121 ctcgaagaca gttgtgccca ctcaccccag ggccagggaa agccctggga ccagcagcca 181 ccatggcagt ggaggtgaga agctgacccc tggggtaggt gggaggacac aggggatggg 241 cagtctccga ggctctgtct cttctctctc ccacccctgg gcagcttcac tatgaggagg 301 gctctccccg gaacctgggt actcccacct cctccactcc acgccccagc atcacaccta 361 ccaagaagat tgagcttgac cggaccatca tgcccgatgg caccattgtc accacagtca 421 ccactgtcca gtcccggccc cgtatagacg gcaaattaga ctccccctcc cgctccccgt 481 ccaaggtgga ggtgaccgag aagacgacaa ctgtgctgag tgagagcagt ggccccagca 541 atacctccca tagcagcagc cgtgagtggg gaatggggtg catgagtgtg ggttggccct 601 ggtgccccag catcaaaaag tactctgggt gggataggag gagttccaat ccaagaaaga 661 gccagggaga agtcaggtta ttcattctct gcacccctag caggggacag ccacctttcc 721 aacggcttgg accctgtagc agagacagcg attcgccagc tgacagagcc cagtgggcgg 781 gtggccaaga agacacccac caagcgcagc actctcatca tctctggtgt ttccaaggtg 841 cccattgctc aggacgagtt ggcgctatcc ctgggctatg cggcatccct ggaagcctca 901 gtgcaggatg atgcagggac cagcggaggc ccctcttcac ctccctcaga cccaccagcc 961 atgtctccag gaccgctaga tgccctctct agtcccacaa gtgtccagga agcagacgag 1021 acaacccgtt cggatatttc tgagaggcca tctgtggatg atattgagtc ggaaacgggg 1081 tccactggtg ccctggagac ccgcagcctc aaggatcaca aagtgagttt cctgcgcagc 1141 ggcactaagc tcatcttccg ccggaggcct aggcagaagg aagctggcct gagccaatca 1201 cacgatgacc tctccaacgc aacggccacg cccagtgtcc gaaagaaggc cggcagcttt 1261 tctcgccgcc ttatcaagcg cttttccttc aaatccaaac ccaaggccaa tggtaacccc 1321 agcccccagc tctgaggacc cagctctgaa agggcacgag ttctctcagc ccattcccca 1381 cctccccttc catacccctt cctggatctc cagtgcctgg gccaggaaag ccctctgggt 1441 tccgggaagc cccgtccacc ctgggccatg gggccggttg gaaggatact tggaacggga 1501 agcacatgag aggtgggcac ccggtgccga ggacatggac gagggactgg tggctgggag 1561 ggagaggagg gccctgtccg gcatgtgtgg gtattcccca gaagcatttg cctcctgctg 1621 agcctggtcc ctgagcggag tcccagggtg ctcagctctt cagctgaccc ttcttccctt 1681 atttattctc ttttctattt atatgtgtgg cttaggaccc tccgtgaaca gatgatagag 1741 ggcatctctc ccaggtgacc cttcttttct gtcccaggag ggtgggtaat tccctttggg 1801 atggggctcc cacacctccc tcaggtcccc actcagacca gcaccagtgt ctgcctctga 1861 gaatgttggc agctcacaga gagcagggcc ggcccgggat ggggggcagg tactccccac 1921 cttcctgcct cccctcctgc tcctcatccc tccctccccc tttattaccg ttttttgtac 1981 ttgatgcctt ctctgtgagc agtggctctg tgggaaggag ggagccggga gcctggtggg 2041 aagccttccc cagagagatg gctttagggg ctttatttaa agactgtgat gatggagcca 2101 cgcaaggctg cacctctgtg tgttgggaga cgatgatgat gtccattgct gtgtgatggc 2161 ttggaattta atttattaaa gtcaaattgg agtttataaa ctggacaact ggttatcctt 2221 tgaaaggcag taggcagcca ggctgtgaca tggatggtgt gggaggatga gacaggggcc 2281 cggataatga ggttgggtag atgacacaca ttgtggatct gctaagaagt tcctgcagga 2341 aagaaggggt gtgcagagaa aggatggaag agacaggagg ctctggagta tgaaattgta 2401 gcaaaaaact caatcaacta gtacaggtta gtcttcattc ccctttctgg aacaggctgc 2461 tccttctagg atgtaatgga gatcaaagct tggaccctgt aactagactg cctgcctggg 2521 tttgagtccc agctcttcta attacttgtg tgaccttggg caagttatgt aacctctaag 2581 tgcctcagtt ttctcatctg taaaatggga acttcaatca tcatggctat catttggttg 2641 tggatgagaa gtaaatgaag tgctgagtgt taactgttgc ctcagaagaa actgggaggg 2701 ggaagtctta gaagtgtttc aacataataa aagctctctc attctgaaat gtgggcaaat 2761 ttcagggatt gatagatcct agcaaactgt gtccctggtt cccgtgcttg agatcttata 2821 caggacttga accaggtaag ctgctacagg atattgggca gaatgcagga ccaggaagca 2881 agaaaactga gtctgccaca gacaaaccaa gtgatcttga gcaggcaact tctcctttgg 2941 ggacctcagt ttcctcatat ctaaaataag ggtgacctgg ataaacattc aggtcacttc 3001 cagccttgac attctatgac tacctatatc tcagggtgtt tagggcacat gcctttgggt 3061 agatagcatg tggaaattgt tactattccc catgctcctg tccacactac tcaccaaata 3121 tgagatatcc gggaagaatg gagggtatag attttttttt tcaattagat aatttagttt 3181 tatatatttg gggggacaag tgcaggtttc ttacatgcat atattgcatg catagaggtg 3241 aagtctgggc tttttttagt gtgcgcatcg cccaaatagt gaacatcata cccaataggt 3301 agtttttcaa ccctcaccct cctcccacct tctcaccttt catagtctcc gatgtctttt 3361 attccactct atatgtccat gtatacccat tatttagctc ccacttataa atgagaatat 3421 gccat // LOCUS AB006626 8459 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens mRNA for KIAA0288 gene, complete cds. ACCESSION AB006626 NID g2564323 KEYWORDS KIAA0288. SOURCE Homo sapiens Brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6116. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8459) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 8459) AUTHORS Ohara,O., Nagase,T., Ishikawa,K., Nakajima,D., Ohira,M., Seki,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (20-AUG-1997) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure I; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:+81-438-52-3930, Fax:+81-438-52-3931) FEATURES Location/Qualifiers source 1..8459 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6116" /clone_lib="pSPORT 1" /tissue_type="Brain" 5'UTR 1..1143 /gene="KIAA0288" gene 1..8459 /gene="KIAA0288" CDS 1144..4047 /gene="KIAA0288" /codon_start=1 /db_xref="PID:d1023833" /db_xref="PID:g2564324" /translation="MLAMKHQQELLEHQRKLERHRQEQELEKQHREQKLQQLKNKEKG KESAVASTEVKMKLQEFVLNKKKALAHRNLNHCISSDPRYWYGKTQHSSLDQSSPPQS GVSTSYNHPVLGMYDAKDDFPLRKTASEPNLKLRSRLKQKVAERRSSPLLRRKDGPVV TALKKRPLDVTDSACSSAPGSGPSSPNNSSGSVSAENGIAPAVPSIPAETSLAHRLVA REGSAAPLPLYTSPSLPNITLGLPATGPSAGTAGQQDTERLTLPALQQRLSLFPGTHL TPYLSTSPLERDGGAAHSPLLQHMVLLEQPPAQAPLVTGLGALPLHAQSLVGADRVSP SIHKLRQHRPLGRTQSAPLPQNAQALQHLVIQQQHQQFLEKHKQQFQQQQLQMNKIIP KPSEPARQPESHPEETEEELREHQALLDEPYLDRLPGQKEAHAQAGVQVKQEPIESDE EEAEPPREVEPGQRQPSEQELLFRQQALLLEQQRIHQLRNYQASMEAAGIPVSFGGHR PLSRAQSSPASATFPVSVQEPPTKPRFTTGLVYDTLMLKHQCTCGSSSSHPEHAGRIQ SIWSRLQETGLRGKCECIRGRKATLEELQTVHSEAHTLLYGTNPLNRQKLDSKKLLGS LASVFVRLPCGGVGVDSDTIWNEVHSAGAARLAVGCVVELVFKVATGELKNGFAVVRP PGHHAEESTPMGFCYFNSVAVAAKLLQQRLSVSKILIVDWDVHHGNGTQQAFYSDPSV LYMSLHRYDDGNFFPGSGAPDEVGTGPGVGFNVNMAFTGGLDPPMGDAEYLAAFRTVV MPIASEFAPDVVLVSSGFDAVEGHPTPLGGYNLSARCFGYLTKQLMGLAGGRIVLALE GGHDLTAICDASEACVSALLGNELDPLPEKVLQQRPNANAVRSMEKVMEIHSKYWRCL QRTTSTAGRSLIEAQTCENEEAETVTAMASLSVGVKPAEKRPDEEPMEEEPPL" 3'UTR 4048..8459 /gene="KIAA0288" BASE COUNT 1852 a 2374 c 2338 g 1895 t ORIGIN 1 ggaggttgtg gggccgccgc cgcggagcac cgtccccgcc gccgcccgag cccgagcccg 61 agcccgcgca cccgcccgcg ccgccgccgc cgccgcccga acagcctccc agcctgggcc 121 cccggcggcg ccgtggccgc gtcccggctg tcgccgcccg agcccgagcc cgcgcgccgg 181 cgggtggcgg cgcaggctga ggagatgcgg cgcggagcgc cggagcaggg ctagagccgg 241 ccgccgccgc ccgccgcggt aagcgcagcc ccggcccggc gcccgcgggc cattgtccgc 301 cgcccgcccc gcgccccgcg cagcctgcag gccttggagc ccgcggcagg tggacgccgc 361 cggtccacac ccgccccgcg cgcggccgtg ggaggcgggg gccagcgctg gccgcgcgcc 421 gtgggacccg ccggtcccca gggccgcccg gccccttctg gacctttcca cccgcgccgc 481 gaggcggctt cgcccgccgg ggcgggggcg cgggggtggg cacggcaggc agcggcgccg 541 tctcccggtg cggggcccgc gccccccgag caggttcatc tgcagaagcc agcggacgcc 601 tctgttcaac ttgtgggtta cctggctcat gagaccttgc cggcgaggct cggcgcttga 661 acgtctgtga cccagccctc accgtcccgg tacttgtatg tgttggtggg agtttggagc 721 tcgttggagc tatcgtttcc gtggaaattt tgagccattt cgaatcactt aaaggagtgg 781 acattgctag caatgagctc ccaaagccat ccagatggac tttctggccg agaccagcca 841 gtggagctgc tgaatcctgc ccgcgtgaac cacatgccca gcacggtgga tgtggccacg 901 gcgctgcctc tgcaagtggc cccctcggca gtgcccatgg acctgcgcct ggaccaccag 961 ttctcactgc ctgtggcaga gccggccctg cgggagcagc agctgcagca ggagctcctg 1021 gcgctcaagc agaagcagca gatccagagg cagatcctca tcgctgagtt ccagaggcag 1081 cacgagcagc tctcccggca gcacgaggcg cagctccacg agcacatcaa gcaataacag 1141 gagatgctgg ccatgaagca ccagcaggag ctgctggaac accagcggaa gctggagagg 1201 caccgccagg agcaggagct ggagaagcag caccgggagc agaagctgca gcagctcaag 1261 aacaaggaga agggcaaaga gagtgccgtg gccagcacag aagtgaagat gaagttacaa 1321 gaatttgtcc tcaataaaaa gaaggcgctg gcccaccgga atctgaacca ctgcatttcc 1381 agcgaccctc gctactggta cgggaaaacg cagcacagtt cccttgacca gagttctcca 1441 ccccagagcg gagtgtcgac ctcctataac cacccggtcc tgggaatgta cgacgccaaa 1501 gatgacttcc ctcttaggaa aacagcttct gaaccgaatc tgaaattacg gtccaggcta 1561 aagcagaaag tggccgaaag acggagcagc cccctgttac gcaggaaaga cgggccagtg 1621 gtcactgctc taaaaaagcg tccgttggat gtcacagact ccgcgtgcag cagcgcccca 1681 ggctccggac ccagctcacc caacaacagc tccgggagcg tcagcgcgga gaacggtatc 1741 gcgcccgccg tccccagcat cccggcggag acgagtttgg cgcacagact tgtggcacga 1801 gaaggctcgg ccgctccact tcccctctac acatcgccat ccttgcccaa catcacgctg 1861 ggcctgcctg ccaccggccc ctctgcgggc acggcgggcc agcaggacac cgagagactc 1921 acccttcccg ccctccagca gaggctctcc cttttccccg gcacccacct cactccctac 1981 ctgagcacct cgcccttgga gcgggacgga ggggcagcgc acagccctct tctgcagcac 2041 atggtcttac tggagcagcc accggcacaa gcacccctcg tcacaggcct gggagcactg 2101 cccctccacg cacagtcctt ggttggtgca gaccgggtgt ccccctccat ccacaagctg 2161 cggcagcacc gcccactggg gcggacccag tcggccccgc tgccccagaa cgcccaggct 2221 ctgcagcacc tggtcatcca gcagcagcat cagcagtttc tggagaaaca caagcagcag 2281 ttccagcagc agcaactgca gatgaacaag atcatcccca agccaagcga gccagcccgg 2341 cagccggaga gccacccgga ggagacggag gaggagctcc gtgagcacca ggctctgctg 2401 gacgagccct acctggaccg gctgccgggg cagaaggagg cgcacgcaca ggccggcgtg 2461 caggtgaagc aggagcccat tgagagcgat gaggaagagg cagagccccc acgggaggtg 2521 gagccgggcc agcgccagcc cagtgagcag gagctgctct tcagacagca agccctcctg 2581 ctggagcagc agcggatcca ccagctgagg aactaccagg cgtccatgga ggccgccggc 2641 atccccgtgt ccttcggcgg ccacaggcct ctgtcccggg cgcagtcctc acccgcgtct 2701 gccaccttcc ccgtgtctgt gcaggagccc cccaccaagc cgaggttcac gacaggcctc 2761 gtgtatgaca cgctgatgct gaagcaccag tgcacctgcg ggagtagcag cagccacccc 2821 gagcacgccg ggaggatcca gagcatctgg tcccgcctgc aggagacggg cctccggggc 2881 aaatgcgagt gcatccgcgg acgcaaggcc accctggagg agctacagac ggtgcactcg 2941 gaagcccaca ccctcctgta tggcacgaac cccctcaacc ggcagaaact ggacagtaag 3001 aaacttctag gctcgctcgc ctccgtgttc gtccggctcc cttgcggtgg tgttggggtg 3061 gacagtgaca ccatatggaa cgaggtgcac tcggcggggg cagcccgcct ggctgtgggc 3121 tgcgtggtag agctggtctt caaggtggcc acaggggagc tgaagaatgg ctttgctgtg 3181 gtccgccccc ctggacacca tgcggaggag agcacgccca tgggcttttg ctacttcaac 3241 tccgtggccg tggcagccaa gcttctgcag cagaggttga gcgtgagcaa gatcctcatc 3301 gtggactggg acgtgcacca tggaaacggg acccagcagg ctttctacag cgaccctagc 3361 gtcctgtaca tgtccctcca ccgctacgac gatgggaact tcttcccagg cagcggggct 3421 cctgatgagg tgggcacagg gcccggcgtg ggtttcaacg tcaacatggc tttcaccggc 3481 ggcctggacc cccccatggg agacgctgag tacttggcgg ccttcagaac ggtggtcatg 3541 ccgatcgcca gcgagtttgc cccggatgtg gtgctggtgt catcaggctt cgatgccgtg 3601 gagggccacc ccacccctct tgggggctac aacctctccg ccagatgctt cgggtacctg 3661 acgaagcagc tgatgggcct ggctggcggc cggattgtcc tggccctcga gggaggccac 3721 gacctgaccg ccatttgcga cgcctcggaa gcatgtgttt ctgccttgct gggaaacgag 3781 cttgatcctc tcccagaaaa ggttttacag caaagaccca atgcaaacgc tgtccgttcc 3841 atggagaaag tcatggagat ccacagcaag tactggcgct gcctgcagcg cacaacctcc 3901 acagcggggc gttctctgat cgaggctcag acttgcgaga acgaagaagc cgagacggtc 3961 accgccatgg cctcgctgtc cgtgggcgtg aagcccgccg aaaagagacc agatgaggag 4021 cccatggaag aggagccgcc cctgtagcac tccctcgaag ctgctgttct cttgtctgtc 4081 tgtctctgtc ttgaagctca gccaagaaac tttcccgtgt cacgcctgcg tcccaccgtg 4141 gggctctctt ggagcaccca gggacaccca gcgtgcaaca gccacgggaa gcctttctgc 4201 cgcccaggcc cacaggtctc gagacgcaca tgcacgcctg ggcgtggcag cctcacaggg 4261 aacacgggac agacgccggc gacgcgcaga cacacggaca cgcggaagcc aagcacactc 4321 tggcgggtcc cgcaagggac gccgtggaag aaaggagcct gtggcaacag gcggccgagc 4381 tgccgaattc agttgacacg aggcacagaa aacaaatatc aaagatctaa taatacaaaa 4441 caaacttgat taaaactggt gcttaaagtt tattacccac aactccacag tctctgtgta 4501 aaccactcga ctcatcttgt agcttatttt ttttttaaag aggacgtttt ctacggctgt 4561 ggcccgcctc tgtgaaccat agcggtgtgc ggcggggggt ctgcacccgg gtgggggaca 4621 gagggacctt taaagaaaac aaaactggac agaaacagga atgtgagctg ggggagctgg 4681 cttgagtttc tcaaaagcca tcggaagatg cgagtttgtg cctttttttt tattgctctg 4741 gtggattttt gtggctgggt tttctgaagt ctgaggaaca atgccttaag aaaaaacaaa 4801 cagcaggaat cggtgggaca gtttcctgtg gccagccgag cctggcagtg ctggcaccgc 4861 gagctggcct gacgcctcaa gcacgggcac cagccgtcat ctccggggcc aggggctgca 4921 gcccggcggt ccctgttttg ctttattgct gtttaagaaa aatggaggta gttccaaaaa 4981 agtggcaaat cccgttggag gttttgaagt ccaacaaatt ttaaacgaat ccaaagtgtt 5041 ctcacacgtc acatacgatt gagcatctcc atctggtcgt gaagcatgtg gtaggcacac 5101 ttgcagtgtt acgatcggaa tgctttttat taaaagcaag tagcatgaag tattgcttaa 5161 attttaggta taaataaata tatatatgta taatatatat tccaatgtat tccaagctaa 5221 gaaacttact tgattcttat gaaatcttga taaaatattt ataatgcatt tatagaaaaa 5281 gtatatatat atatataaaa tgaatgcaga ttgcgaaggt ccctgcaaat ggatggcttg 5341 tgaatttgct ctcaaggtgc ttatggaaag ggatcctgat tgattgaaat tcatgttttc 5401 tcaagctcca gattggctag atttcagatc gccaacacat tcgccactgg gcaactaccc 5461 tacaagtttg tactttcatt ttaattattt tctaacagaa ccgctcccgt ctccaagcct 5521 tcatgcacat atgtacctaa tgagttttta tagcaaagaa tataaatttg ctgttgattt 5581 ttgtatgaat tttttcacaa aaagatcctg aataagcatt gttttatgaa ttttacattt 5641 ttcctcacca tttagcaatt ttctgaatgg taataatgtc taaatctttt tcctttctga 5701 attcttgctt gtacattttt ttttaccttt caaaggtttt taattatttt tgtttttatt 5761 tttgtacgat gagttttctg cagcgtacag aattgttgct gtcagattct attttcagaa 5821 agtgagagga gggaccgtag gtcttttcgg agtgacacca acgattgtgt ctttcctggt 5881 ctgtcctagg agctgtataa agaagcccag gggctctttt taactttcaa cactagtagt 5941 attacgaggg gtggtgtgtt tttcccctcc gtggcaaggg cagggagggt tgcttaggat 6001 gcccggccac cctgggaggc ttgccagatg ccgggggcag tcagcattaa tgaaactcat 6061 gtttaaactt ctctgaccac atcgtcagga tagaattcta acttgagttt tccaaagacc 6121 ttttgagcat gtcagcaatg catggggcac acgtggggct ctttacccac ttgggttttt 6181 ccactgcagc cacgtggcca gccctggatt ttggagcctg tggctgcaag gaacccaggg 6241 acccttgttg cctggtgaac ctgcagggag ggtatgattg cctgaccagg acagccagtc 6301 tttactcttt ttctcttcaa cagtaactga cagtcacgtt ttactggtaa cttattttcc 6361 agcacatgaa gccaccagtt tcattccaaa gtgtatattg ggttcagact tgggggcaga 6421 agttcagaca caccgtgctc aggagggacc cagagccgag tttcggagtt tggtaaagtt 6481 tacagggtag cttctgaaat taactcaaac ttttgaccaa atgagtgcag attcttggat 6541 tcacttggtc actgggctgc tgatggtcag ctctgagaca gtggtttgag agcaggcaga 6601 acggtcttgg gacttgtttg actttcccct ccctggtggc cactctttgc tctgaagccc 6661 agattggcaa gaggagctgg tccattcccc attcatggca cagagcagtg gcagggccca 6721 gctagcaggc tcttctggcc tccttggcct cattctctgc atagccctct ggggatcctg 6781 ccacctgccc tcttaccccg ccgtggctta tggggaggaa tgcatcatct cacttttttt 6841 ttttaagcag atgatgggat aacatggact gctcagtggc caggttatca gtggggggac 6901 ttaattctaa tctcattcaa atggagacgc cctctgcaaa ggcctggcag ggggaggcac 6961 gtttcatctg tcagctcact ccagcttcac aaatgtgctg agagcattac tgtgtagcct 7021 tttctttgaa gacacactcg gctcttctcc acagcaagcg tccagggcag atggcagagg 7081 atctgcctcg gcgtctgcag gcgggaccac gtcagggagg gttccttcat gtgttctccc 7141 tgtgggtcct tggaccttta gcctttttct tcctttgcaa aggccttggg ggcactggct 7201 gggagtcagc aagcgagcac tttatatccc tttgagggaa accctgatga cgccactggg 7261 cctcttggcg tctgccctgc cctcgcggct tcccgccgtg ccgcagcgtg cccacgtgcc 7321 cacgccccac cagcaggcgg ctgtcccgga ggccgtggcc cgctgggact ggccgcccct 7381 ccccagcgtc ccagggctct ggttctggag ggccactttg tcaaggtgtt tcagtttttc 7441 tttacttctt ttgaaaatct gtttgcaagg ggaaggacca tttcgtaatg gtctgacaca 7501 aaagcaagtt tgatttttgc agcactagca atggactttg ttgtttttct ttttgatcag 7561 aacattcctt ctttactggt cacagccacg tgctcattcc attcttcttt ttgtagactt 7621 tgggcccacg tgttttatgg gcattgatac atatataaat atatagatat aaatatatat 7681 gaatatattt ttttaagttt cctacacctg gaggttgcat ggactgtacg accggcatga 7741 ctttatattg tatacagatt ttgcacgcca aactcggcag ctttggggaa gaagaaaaat 7801 gcctttctgt tcccctctca tgacatttgc agatacaaaa gatggaaatt tttctgtaaa 7861 acaaaacctt gaaggagagg agggcgggga agtttgcgtc ttattgaact tattcttaag 7921 aaattgtact ttttattgta agaaaaataa aaaggactac ttaaacattt gtcatattaa 7981 gaaaaaaagt ttatctagca cttgtgacat accaataata gagtttattg tatttatgtg 8041 gaaacagtgt tttagggaaa ctactcagaa ttcacagtga actgcctgtc tctctcgagt 8101 tgatttggag gaattttgtt ttgttttgtt ttgtttgttt ccttttatct ccttccacgg 8161 gccaggcgag cgccgcccgc cctcactggc cttgtgacgg tttattctga ttgagaactg 8221 ggcggactcg aaagagtccc cttttccgca cagctgtgtt gactttttaa ttacttttag 8281 gtgatgtatg gctaagattt cactttaagc agtcgtgaac tgtgcgagca ctgtggttta 8341 caattatact ttgcatcgaa aggaaaccat ttcttcattg taacgaagct gagcgtgttc 8401 ttagctcggc ctcactttgt ctctggcatt gattaaaagt ctgctattga aagaaaaag // LOCUS AB006679 1432 bp mRNA PRI 22-AUG-1997 DEFINITION Homo sapiens mRNA for ATP binding protein, complete cds. ACCESSION AB006679 D64158 NID g2342476 KEYWORDS ATP binding protein. SOURCE Homo sapiens leukemia cell_line:ME-1 cDNA to mRNA, clone:1-4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1432) AUTHORS Shiosaka,T. TITLE Direct Submission JOURNAL Submitted (20-AUG-1997) to the DDBJ/EMBL/GenBank databases. Takahiko Shiosaka, Ehime College of Health Science; Takaoda 543, Tobe, Ehime 791-21, Japan (E-mail:ytakaoka@m.ehime-u.ac.jp, Tel:0899-58-2111(ex.452), Fax:0899-58-2177) REFERENCE 2 (sites) AUTHORS Shiosaka,T. TITLE Differential expression of 1-4 gene in functionally distinct ME-1 subclones JOURNAL Unpublished (1996) COMMENT D64158:Submitted (30-Jun-1995). FEATURES Location/Qualifiers source 1..1432 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ME-1" /clone="1-4" /tissue_type="leukemia" CDS 130..810 /note="APACD:ATP binding protein associated with cell differentiation" /codon_start=1 /product="ATP binding protein" /db_xref="PID:d1022739" /db_xref="PID:g2342477" /translation="MEADASVDMFSKVLEHQLLQTTKLVEEHLDSEIQKLDQMDEDEL ERLKEKRLQALRKAQQQKQEWLSKGHGEYREIPSERDFFQEVKESENVVCHFYRDSTF RCKILDRHLAILSKKHLETNFLKLNVEKAPFLCERLHIKVIPTLALLKDGKTQDYVVG FTDLGNTDDFTTETLEWRLGSSDILNYSGNLMEPPFQNQKKFGTNFTKLEKKTMRGKK YDSDSDDD" BASE COUNT 469 a 256 c 277 g 429 t 1 others ORIGIN 1 ttgcagccgc cggcagctac tgcaaggcaa aagccggagt ggacgtgtct tttgaaactg 61 ctgctctttc acttctcagg cgtcaccgag agctcagcac ccaggctgaa ctctgtacca 121 tttggaagaa tggaagctga tgcatctgtt gacatgtttt ccaaagtcct ggagcatcag 181 ctgcttcaga ctaccaaact ggtggaagaa catttggatt ctgaaattca aaaactggat 241 cagatggatg aggatgaatt ggaacgcctt aaagaaaaga gactccaggc actaaggaaa 301 gctcaacagc agaaacaaga atggctttct aaaggacatg gggaatacag agaaatccct 361 agtgaaagag acttttttca agaagtcaag gagagtgaaa atgtggtttg ccatttctac 421 agagactcca cattcaggtg taaaatacta gacagacatc tggcaatatt gtccaagaaa 481 cacctcgaga ccaatttttt gaagctgaat gtggaaaaag cacctttcct ttgtgagaga 541 ctgcatatca aagtcattcc cacactagca ctgctaaaag atgggaaaac acaagattat 601 gttgttgggt ttactgacct aggaaataca gatgacttca ccacagaaac tttagaatgg 661 aggctcggtt cttctgacat tcttaattac agtggaaatt taatggagcc accatttcag 721 aaccaaaaga aatttggaac aaacttcaca aagctggaaa agaaaactat gcgaggaaag 781 aaatatgatt cagactctga tgatgattag agctcaataa ttctttgtaa attgtctttt 841 tttttctgct tcagatttaa atgtgttttt aaaattctat taatgtctat acattggtca 901 cctaaatact catattctcg agttttatac agttgtatca catcgaaaag tgtctttact 961 gttttctgtg tggccatcat gtttaagttg aggaaactca gttcttaaat tatctgggaa 1021 gggtctggat tctctatttt tgagattgac tttatcacaa tatgattctt acatctttat 1081 accatttaca attgtgtttt agatctacag agttagaaat tcgraaacta ttccaggact 1141 aattcttaat cggcattatt tatacaagag gtcaagtaac atttactagc gcaatactgc 1201 acttgtaaat gaattataaa cgctcttctg gaatatattt aaataaccat taaagaactg 1261 cttattcatt ctggacactg catgttgatg ttgaatcaac tgatgccagc agaaagctat 1321 tttgatttgt gaacatactg ccttatttaa agggtcctga ttgcttgtat tttaagacat 1381 tcattaaaaa gaaaccagga aacacttttg aaataacagc ataaggaact tc // LOCUS AB006682 2027 bp mRNA PRI 13-DEC-1997 DEFINITION Homo sapiens APECED mRNA for AIRE-1, complete cds. ACCESSION AB006682 NID g2696614 KEYWORDS APECED; AIRE-1. SOURCE Homo sapiens (isolate:Caucasian) 3-yr-old male thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagamine,K., Peterson,P., Scott,H.S., Kudoh,J., Minoshima,S., Heino,M., Krohn,K.J.E., Lalioti,M.D., Mullis,P.E., Antonarakis,S.E., Kawasaki,K., Asakawa,S., Ito,F. and Shimizu,N. TITLE Positional cloning of the APECED gene JOURNAL Nature Genet. 17 (4), 393-398 (1997) MEDLINE 98061086 REFERENCE 2 (bases 1 to 2027) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (16-AUG-1997) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) FEATURES Location/Qualifiers source 1..2027 /organism="Homo sapiens" /isolate="Caucasian" /db_xref="taxon:9606" /chromosome="21" /dev_stage="3-yr-old" /map="21q22.3" /sex="male" /tissue_type="thymus" gene 128..1765 /gene="APECED" CDS 128..1765 /gene="APECED" /note="autoimmune regulator-1" /codon_start=1 /product="AIRE-1" /db_xref="PID:d1024894" /db_xref="PID:g2696615" /translation="MATDAALRRLLRLHRTEIAVAVDSAFPLLHALADHDVVPEDKFQ ETLHLKEKEGCPQAFHALLSWLLTQDSTAILDFWRVLFKDYNLERYGRLQPILDSFPK DVDLSQPRKGRKPPAVPKALVPPPRLPTKRKASEEARAAAPAALTPRGTASPGSQLKA KPPKKPESSAEQQRLPLGNGIQTMSASVQRAVAMSSGDVPGARGAVEGILIQQVFESG GSKKCIQVGGEFYTPSKFEDSGSGKNKARSSSGPKPLVRAKGAQGAAPGGGEARLGQQ GSVPAPLALPSDPQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGT WRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSAGEEVRGPPGEPLAGMDTTL VYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGARCGVCGDGTDVLRCTHCA AAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPARLAPGPAKDDTASHE PALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS" polyA_signal 2013..2018 BASE COUNT 357 a 716 c 644 g 310 t ORIGIN 1 agacgggcgg gcgcacagcc ggcgcggagg ccccacagcc ccgccgggac ccgaggccaa 61 gcgaggggct gccagtgtcc cgggacccac cgcgtccgcc ccagccccgg gtccccgcgc 121 ccaccccatg gcgacggacg cggcgctacg ccggcttctg aggctgcacc gcacggagat 181 cgcggtggcc gtggacagcg ccttcccact gctgcacgcg ctggctgacc acgacgtggt 241 ccccgaggac aagtttcagg agacgcttca tctgaaggaa aaggagggct gcccccaggc 301 cttccacgcc ctcctgtcct ggctgctgac ccaggactcc acagccatcc tggacttctg 361 gagggtgctg ttcaaggact acaacctgga gcgctatggc cggctgcagc ccatcctgga 421 cagcttcccc aaagatgtgg acctcagcca gccccggaag gggaggaagc ccccggccgt 481 ccccaaggct ttggtaccgc cacccagact ccccaccaag aggaaggcct cagaagaggc 541 tcgagctgcc gcgccagcag ccctgactcc aaggggcacc gccagcccag gctctcaact 601 gaaggccaag ccccccaaga agccggagag cagcgcagag cagcagcgcc ttccactcgg 661 gaacgggatt cagaccatgt cagcttcagt ccagagagct gtggccatgt cctccgggga 721 cgtcccggga gcccgagggg ccgtggaggg gatcctcatc cagcaggtgt ttgagtcagg 781 cggctccaag aagtgcatcc aggttggcgg ggagttctac actcccagca agttcgaaga 841 ctccggcagt gggaagaaca aggcccgcag cagcagtggc ccgaagcctc tggttcgagc 901 caagggagcc cagggcgctg cccccggtgg aggtgaggct aggctgggcc agcagggcag 961 cgttcccgcc cctctggccc tccccagtga cccccagctc caccagaaga atgaggacga 1021 gtgtgccgtg tgtcgggacg gcggggagct catctgctgt gacggctgcc ctcgggcctt 1081 ccacctggcc tgcctgtccc ctccgctccg ggagatcccc agtgggacct ggaggtgctc 1141 cagctgcctg caggcaacag tccaggaggt gcagccccgg gcagaggagc cccggcccca 1201 ggagccaccc gtggagaccc cgctcccccc ggggcttagg tcggcgggag aggaggtaag 1261 aggtccacct ggggaacccc tagccggcat ggacacgact cttgtctaca agcacctgcc 1321 ggctccgcct tctgcagccc cgctgccagg gctggactcc tcggccctgc accccctact 1381 gtgtgtgggt cctgagggtc agcagaacct ggctcctggt gcgcgttgcg gggtgtgcgg 1441 agatggtacg gacgtgctgc ggtgtactca ctgcgccgct gccttccact ggcgctgcca 1501 cttcccagcc ggcacctccc ggcccgggac gggcctgcgc tgcagatcct gctcaggaga 1561 cgtgacccca gcccctgtgg agggggtgct ggcccccagc cccgcccgcc tggcccctgg 1621 gcctgccaag gatgacactg ccagtcacga gcccgctctg cacagggatg acctggagtc 1681 ccttctgagc gagcacacct tcgatggcat cctgcagtgg gccatccaga gcatggcccg 1741 tccggcggcc cccttcccct cctgacccca gatggccggg acatgcagct ctgatgagag 1801 agtgctgaga aggacacctc cttcctcagt cctggaagcc ggccggctgg gatcaagaag 1861 gggacagcgc cacctcttgt cagtgctcgg ctgtaaacag ctctgtgttt ctggggacac 1921 cagccatcat gtgcctggaa attaaaccct gccccacttc tctactctgg aagtccccgg 1981 gagcctctcc ttgcctggtg acctactaaa aatataaaaa ttagctg // LOCUS AB006713 2699 bp mRNA PRI 22-AUG-1997 DEFINITION Homo sapiens mRNA for dihydropyrimidinase related protein 4, complete cds. ACCESSION AB006713 NID g2342485 KEYWORDS dihydropyrimidinase related protein 4. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2699) AUTHORS Hamajima,N. TITLE Direct Submission JOURNAL Submitted (19-AUG-1997) to the DDBJ/EMBL/GenBank databases. Naoki Hamajima, Nagoya City University Medical School, Department of Pediatrics; 1 Kawasumi, Mizuho-cho, Mizuho-ku, Nagoya, Aichi 467, Japan (E-mail:hamajima@med.nagoya-cu.ac.jp, Tel:+81-52-853-8246, Fax:+81-52-842-3449) REFERENCE 2 (sites) AUTHORS Hamajima,N., Kato,Y., Kouwaki,M., Wada,Y., Sasaski,M. and Nonaka,M. TITLE Novel members of dihydropyrimidinase related protein family JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and Nonaka,M. TITLE A novel gene family defined by human dihydropyrimidinase and three related proteins with differential tissue distribution JOURNAL Gene 180 (1-2), 157-163 (1996) MEDLINE 97128821 FEATURES Location/Qualifiers source 1..2699 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 149..1867 /codon_start=1 /product="dihydropyrimidinase related protein 4" /db_xref="PID:d1022744" /db_xref="PID:g2342486" /translation="MSFQGKKSIPRITSDRLLIRGGRIVNDDQSFYADVHVEDGLIKQ IGENLIVPGGIKTIDAHGLMVLPGGVDVHTRLQMPVLGMTPADDFCQGTKAALAGGTT MILDHVFPDTGVSLLAAYERWRERADSAACCDYSLHVDITRWHESIKEELEALVKEKG VNSFLVFMAYKDRCQCSDSQMYEIFSIIRDLGALAQVHAENGDIVEEEQKRLLELGIT GPEGHVLSHPEEVEAEAVYRAVTIAKQANCPLYVTKVMSKGAADAIAQAKRRGVVVFG EPITASLGTDGSHYWSKNWAKAAAFVTSPPVNPDPTTADHLTCLLSSGDLQVTGSAHC TFTTAQKAVGKDNFALIPEGTNGIEERMSMVWEKCVASGKMDENEFVAVTSTNAAKIF NFYPRKGRVAVGSDADLVIWNPKATKIISAKTHNLNVEYNIFEGVECRGAPAVVISQG RVALEDGKMFVTPGAGRFVPRKTFPDFVYKRIKARNRLAEIHGVPRGLYDGPVHEVMV PAKPGSGAPARASCPGKISVPPVRNLHQSGFSLSGSQADDHIARRTAQKIMAPPGGRS NITSLS" polyA_site 2699 /note="25 A nucleotides" BASE COUNT 535 a 864 c 830 g 470 t ORIGIN 1 gcccagcggg ggcgggactg gaacggagcc gtgcggcccc gcgcgctcgc agtctgtctc 61 ccgccgtccc cacgcacgcg tcccggctca cgcgtccgcc cgcccgcccc cgcttgtgcc 121 gcccctacca gagaccccca ggagcaggat gtccttccag ggcaagaaaa gcatcccccg 181 gatcacgagt gaccgccttc tgatcagagg tgggaggatc gtgaatgacg accagtcctt 241 ttacgctgat gtgcacgtgg aagatggctt gataaaacaa atcggagaaa acctcatcgt 301 ccctgggggc atcaagacca ttgacgccca cggcctgatg gtccttcctg gtggcgttga 361 cgtccacaca aggctgcaga tgcctgtcct gggcatgaca ccggctgacg acttctgtca 421 gggcaccaag gcagcgctag caggaggaac caccatgatc ttggaccacg tcttccccga 481 cacgggtgtg agcctgctgg cggcctacga gcggtggcgg gagcgggcgg acagcgcggc 541 ctgctgcgac tactccctgc acgtggacat cacccgatgg catgagagca tcaaggagga 601 gctggaggcc ctggtcaagg agaagggtgt gaactccttc ctggtcttca tggcatacaa 661 ggaccggtgc cagtgcagcg acagccagat gtacgagatc ttcagcatca tccgggacct 721 gggggccttg gcccaggtgc acgctgagaa cggggacatc gtggaggagg agcagaagcg 781 gttgctggag ctcggcatca ctggccccga gggccacgtg ctcagccacc ccgaggaggt 841 ggaggctgag gcggtgtacc gagctgtcac catcgccaag caggcaaact gcccgctgta 901 cgtcaccaag gtgatgagca agggggcggc cgacgccatc gctcaggcca agcgcagagg 961 ggtggtcgtg tttggggagc ccatcaccgc cagcctgggc accgacggtt cacactactg 1021 gagcaagaac tgggccaagg ccgcagcctt cgtcacatca ccccctgtca acccagaccc 1081 caccacggca gaccacctca cctgcttgct gtccagcggg gacctccagg tgacaggcag 1141 cgcccactgc accttcacca ctgcccagaa ggctgtgggc aaggacaact tcgcgctgat 1201 ccccgagggc accaacggca ttgaggagcg catgtcgatg gtctgggaga aatgtgtggc 1261 ctctgggaag atggacgaga atgagttcgt cgcggtgacc agtacaaatg ctgccaaaat 1321 cttcaatttt tacccaagga aggggcgagt ggctgtgggc tctgacgctg acctggtcat 1381 atggaacccc aaggccacca agatcatctc tgccaagacc cacaatctga acgtggagta 1441 caacatcttc gagggagtgg agtgccgggg agcgcctgcc gtggtcataa gtcagggccg 1501 agtggcgctg gaggacggga agatgtttgt caccccgggg gcgggccgct tcgtccctcg 1561 gaaaacattc ccggactttg tctacaagag gatcaaagct cgcaacaggc tggcggagat 1621 ccacggtgtg ccccgtgggc tgtatgacgg gcccgtccac gaggtgatgg tgcctgccaa 1681 gccagggagt ggcgctccgg cccgcgcgtc ctgcccaggc aagatctccg tgcctcctgt 1741 gcgcaaccta catcagtcgg ggttcagcct atctgggtct caggctgatg accacatcgc 1801 ccgacgcaca gcacagaaga tcatggcacc acctggcggc cgctccaaca tcacctctct 1861 ctcctagacg cccaggaccg gccctgtgag ccgtgctggc cccacccgag gccgcggggg 1921 ccccagggca ctcgcccccc tccttagcat tttcttttgt agaagtttct cgaaggtgct 1981 tggcggtctt gccttccccc tccccacagg ctctccttgt ggggtcccag gtcctgctgc 2041 caagagcccc tcaagagaag ggctgaacct ggggagatgt cactgccagg gtgaggtgga 2101 gccacatggc agggacaatg ccggcagcct gagcccaggc accccagtgc ccgctgggcc 2161 cagcctgggg acagggaacc tgccgggctc acagtgtggg agcagctgga caccaggctt 2221 cttggtgaac cggcgagggg ccgagtcccg cctggtgggc atttgctgcc gcctccccac 2281 caccagtcac tgcctcgcag agccctacac tcccgcagcc gctcctcaga ggcctgtgcc 2341 catcgcaggc ctgggaggaa agtgggcgca gagccctcct gctcacacag ctgctgagac 2401 ttcagggacc catcagaact tggtgcagca cagccccgcc cgtggagggt cccttttacg 2461 caccccaagg cccacaccta agcttccatg tagccctcat ccagggaagt tttgcgatcc 2521 tttaggaaga cactgtcctc ttattacaga ttgtgtattt ccgtaggctt cttagtagca 2581 gctttgtaca ctgaggacac tgtagccagg aacctgtgca tgccacccac cgcctggaca 2641 ggcagtcatc ctgcctctga tgtgaatcag gcccattaaa gacgtctggg tttgaagcc // LOCUS AB006781 1113 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens mRNA for galectin-4, complete cds. ACCESSION AB006781 NID g2385453 KEYWORDS galectin-4. SOURCE Homo sapiens gastric adenocarcinoma cDNA to mRNA, clone:HP01049. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1113) AUTHORS Kato,S. TITLE Human galectin-4 full-length cDNA cloned from gastric adenocarcinoma JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1113) AUTHORS Kato,S. TITLE Direct Submission JOURNAL Submitted (27-AUG-1997) to the DDBJ/EMBL/GenBank databases. Seishi Kato, Sagami Chemical Research Center; 4-4-1 Nishi-ohnuma, Sagamihara, Kanagawa 229, Japan (E-mail:seishi@sagami.or.jp, Tel:+81-427-42-5091, Fax:+81-427-42-5091) FEATURES Location/Qualifiers source 1..1113 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HP01049" /tissue_type="gastric adenocarcinoma" CDS 57..1028 /codon_start=1 /product="galectin-4" /db_xref="PID:d1023025" /db_xref="PID:g2385454" /translation="MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHMK RFFVNFVVGQDPGSDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFE LVFIVLAEHYKVVVNGNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGP PMMPPYPGPGHCHQQLNSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKGYVPPTG KSFAINFKVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFDL SIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI" BASE COUNT 247 a 337 c 290 g 239 t ORIGIN 1 atctcccact cctgcagctc ttctcacagg accagccact agcgcagcct cgagcgatgg 61 cctatgtccc cgcaccgggc taccagccca cctacaaccc gacgctgcct tactaccagc 121 ccatcccggg cgggctcaac gtgggaatgt ctgtttacat ccaaggagtg gccagcgagc 181 acatgaagcg gttcttcgtg aactttgtgg ttgggcagga tccgggctca gacgtcgcct 241 tccacttcaa tccgcggttt gacggctggg acaaggtggt cttcaacacg ttgcagggcg 301 ggaagtgggg cagcgaggag aggaagagga gcatgccctt caaaaagggt gccgcctttg 361 agctggtctt catagtcctg gctgagcact acaaggtggt ggtaaatgga aatcccttct 421 atgagtacgg gcaccggctt cccctacaga tggtcaccca cctgcaagtg gatggggatc 481 tgcaacttca atcaatcaac ttcatcggag gccagcccct ccggccccag ggacccccga 541 tgatgccacc ttaccctggt cccggacatt gccatcaaca gctgaacagc ctgcccacca 601 tggaaggacc cccaaccttc aacccgcctg tgccatattt cgggaggctg caaggagggc 661 tcacagctcg aagaaccatc atcatcaagg gctatgtgcc tcccacaggc aagagctttg 721 ctatcaactt caaggtgggc tcctcagggg acatagctct gcacattaat ccccgcatgg 781 gcaacggtac cgtggtccgg aacagccttc tgaatggctc gtggggatcc gaggagaaga 841 agatcaccca caacccattt ggtcccggac agttctttga tctgtccatt cgctgtggct 901 tggatcgctt caaggtttac gccaatggcc agcacctctt tgactttgcc catcgcctct 961 cggccttcca gagggtggac acattggaaa tccagggtga tgtcaccttg tcctatgtcc 1021 agatctaatc tattcctggg gccataactc atgggaaaac agaattatcc cctaggactc 1081 ctttctaagc ccctaataaa atgtctgagg gtg // LOCUS AB006782 1725 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens mRNA for galectin-9 isoform, complete cds. ACCESSION AB006782 NID g2385455 KEYWORDS galectin-9 isoform. SOURCE Homo sapiens gastric adenocarcinoma cDNA to mRNA, clone:HP01461. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1725) AUTHORS Kato,S. TITLE Human galectin-9 isoform full-length cDNA from gastric adenocarcinoma JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1725) AUTHORS Kato,S. TITLE Direct Submission JOURNAL Submitted (27-AUG-1997) to the DDBJ/EMBL/GenBank databases. Seishi Kato, Sagami Chemical Research Center; 4-4-1 Nishi-ohnuma, Sagamihara, Kanagawa 229, Japan (E-mail:seishi@sagami.or.jp, Tel:+81-427-42-5091, Fax:+81-427-42-5091) FEATURES Location/Qualifiers source 1..1725 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HP01461" /tissue_type="gastric adenocarcinoma" CDS 82..1149 /codon_start=1 /product="galectin-9 isoform" /db_xref="PID:d1023026" /db_xref="PID:g2385456" /translation="MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSSSGTR FAVNFQTGFSGNDIAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLC FLVQSSDFKVMVNGILFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFS TVPFSQPVCFPPRPRGRRQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMY PHPAYPMPFITTILGGLYPSKSILLSGTVLPSAQRFHINLCSGNHIAFHLNPRFDENA VVRNTQIDNSWGSEERSLPRKMPFVRGQSFSVWILCEAHCLKVAVDGQHLFEYYHRLR NLPTINRLEVGGDIQLTHVQT" BASE COUNT 337 a 566 c 445 g 377 t ORIGIN 1 tttctttgtt aagtcgttcc ctctacaaag gacttcctag tgggtgtgaa aggcagcggt 61 ggccacagag gcggcggaga gatggccttc agcggttccc aggctcccta cctgagtcca 121 gctgtcccct tttctgggac tattcaagga ggtctccagg acggacttca gatcactgtc 181 aatgggaccg ttctcagctc cagtggaacc aggtttgctg tgaactttca gactggcttc 241 agtggaaatg acattgcctt ccacttcaac cctcggtttg aagatggagg gtacgtggtg 301 tgcaacacga ggcagaacgg aagctggggg cccgaggaga ggaagacaca catgcctttc 361 cagaagggga tgccctttga cctctgcttc ctggtgcaga gctcagattt caaggtgatg 421 gtgaacggga tcctcttcgt gcagtacttc caccgcgtgc ccttccaccg tgtggacacc 481 atctccgtca atggctctgt gcagctgtcc tacatcagct tccagaaccc ccgcacagtc 541 cctgttcagc ctgccttctc cacggtgccg ttctcccagc ctgtctgttt cccacccagg 601 cccagggggc gcagacaaaa acctcccggc gtgtggcctg ccaacccggc tcccattacc 661 cagacagtca tccacacagt gcagagcgcc cctggacaga tgttctctac tcccgccatc 721 ccacctatga tgtaccccca ccccgcctat ccgatgcctt tcatcaccac cattctggga 781 gggctgtacc catccaagtc catcctcctg tcaggcactg tcctgcccag tgctcagagg 841 ttccacatca acctgtgctc tgggaaccac atcgccttcc acctgaaccc ccgttttgat 901 gagaatgctg tggtccgcaa cacccagatc gacaactcct gggggtctga ggagcgaagt 961 ctgccccgaa aaatgccctt cgtccgtggc cagagcttct cagtgtggat cttgtgtgaa 1021 gctcactgcc tcaaggtggc cgtggatggt cagcacctgt ttgaatacta ccatcgcctg 1081 aggaacctgc ccaccatcaa cagactggaa gtggggggcg acatccagct gacccatgtg 1141 cagacatagg cggcttcctg gccctggggc cgggggctgg ggtgtggggc agtctgggtc 1201 ctctcatcat ccccacttcc caggcccagc ctttccaacc ctgcctggga tctgggcttt 1261 aatgcagagg ccatgtcctt gtctggtcct gcttctggct acagccaccc tggaacggag 1321 aaggcagctg acggggattg ccttcctcag ccgcagcagc acctggggct ccagctgctg 1381 gaatcctacc atcccaggag gcaggcacag ccagggagag gggaggagtg ggcagtgaag 1441 atgaagcccc atgctcagtc ccctcccatc ccccacgcag ctccacccca gtcccaagcc 1501 accagctgtc tgctcctggt gggaggtggc ctcctcagcc cctcctctct gacctttaac 1561 ctcactctca ccttgcaccg tgcaccaacc cttcacccct cctggaaagc aggcctgatg 1621 gcttcccact ggcctccacc acctgaccag agtgttctct tcagaggact ggctcctttc 1681 ccagtgtcct taaaataaag aaatgaaaat gcttgttggc acatt // LOCUS AB006965 2449 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens mRNA for Dnm1p/Vps1p-like protein, complete cds. ACCESSION AB006965 NID g2385511 KEYWORDS Dnm1p/Vps1p-like protein; DVLP. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shin,H.W., Shinotsuka,C., Torii,S., Murakami,K. and Nakayama,K. TITLE Identification and subcellular localization of a novel mammalian dynamin-related protein homologous to yeast Vps1p and Dnm1p JOURNAL J. Biochem. 122 (3), 525-530 (1997) MEDLINE 98006302 REFERENCE 2 (bases 1 to 2449) AUTHORS Nakayama,K. TITLE Direct Submission JOURNAL Submitted (01-SEP-1997) to the DDBJ/EMBL/GenBank databases. Kazuhisa Nakayama, University of Tsukuba, Institute of Biological Sciences; 1-1-1 Tennohdai, Tsukuba, Ibaraki 305, Japan (E-mail:kazunaka@sakura.cc.tsukuba.ac.jp, Tel:+81-298-53-6005, Fax:+81-298-53-6006) FEATURES Location/Qualifiers source 1..2449 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 36..2246 /note="DVLP" /codon_start=1 /product="Dnm1p/Vps1p-like protein" /db_xref="PID:d1023053" /db_xref="PID:g2385512" /translation="MEALIPVINKLQDVFNTVGADIIQLPQIVVVGTQSSGKSSVLES LVGRDLLPRGTGIVTRRPLILQLVHVSQEDKRKTTGEENGVEAEEWGKFLHTKNKLYT DFDEIRQEIENETERISGNNKGVSPEPIHLKIFSPNVVNLTLVDLPGMTKVPVGDQPK DIELQIRELILRFISNPNSIILAVTAANTDMATSEALKISREVDPDGRRTLAVITKLD LMDAGTDAMDVLMGRVIPVKLGIIGVVNRSQLDINNKKSVTDSIRDEYAFLQKKYPSL ANRNGTKYLARTLNRLLMHHIRDCLPELKTRINVLAAQYQSLLNSYGEPVDDKSATLL QLITKFATEYCNTIEGTAKYIETSELCGGARICYIFHETFGRTLESVDPLGGLNTIDI LTAIRNATGPRPALFVPEVSFELLVKRQIKRLEEPSLRCVELVHEEMQRIIQHCSNYS TQELLRFPKLHDAIVEVVTCLLRKRLPVTNEMVHNLVAIELAYINTKHPDFADACGLM NNNIEEQRRNRLARELPSAVSRDKSSKVPSALAPASQEPSPAASAEADGKLIQDSRRE TKNVASGGGGVGDGVQEPTTGNWRGMLKTSKAEELLAEEKSKPIPIMPASPQKGHAVN LLDVPVPVARKLSAREQRDCEVIERLIKSYFLIVRKNIQDSVPKAVMHFLVNHVKDTL QSELVGQLYKSSLLDDLLTESEDMAQRRKEAADMLKALQGASQIIAEIRETHLW" BASE COUNT 767 a 481 c 552 g 649 t ORIGIN 1 tccggcgggc actggggccc cgtgttttca gagtcatgga ggcgctaatt cctgtcataa 61 acaagctcca ggacgtcttc aacacggtgg gcgccgacat catccagctg cctcaaatcg 121 tcgtagtggg aacgcagagc agcggaaaga gctcagtgct agaaagcctg gtggggaggg 181 acctgcttcc cagaggtact ggaattgtca cccggagacc tctcattctg caactggtcc 241 atgtttcaca agaagataaa cggaaaacaa caggagaaga aaatggggtg gaagcagaag 301 aatggggtaa atttcttcac accaaaaata agctttacac ggattttgat gaaattcgac 361 aagaaattga aaatgaaaca gaaagaattt caggaaataa taagggagta agccctgaac 421 caattcatct taagattttt tcacccaacg ttgtcaattt gacacttgtg gatttgccag 481 gaatgaccaa ggtgcctgta ggtgatcaac ctaaggatat tgagcttcaa atcagagagc 541 tcattcttcg gttcatcagt aatcctaatt ccattatcct cgctgtcact gctgctaata 601 cagatatggc aacatcagag gcacttaaaa tttcaagaga ggtagatcca gatggtcgca 661 gaaccctagc tgtaatcact aaacttgatc tcatggatgc gggtactgat gccatggatg 721 tattgatggg aagggttatt ccagtcaaac ttggaataat tggagtagtt aacaggagcc 781 agctagatat taacaacaag aagagtgtaa ctgattcaat ccgtgatgag tatgcttttc 841 ttcaaaagaa atatccatct ctggccaata gaaatggaac aaagtatctt gctaggactc 901 taaacaggtt actgatgcat cacatcagag attgtttacc agagttgaaa acaagaataa 961 atgttctagc tgctcagtat cagtctcttc taaatagcta cggtgaaccc gtggatgata 1021 aaagtgctac tttactccaa cttattacca aatttgccac agaatattgt aacactattg 1081 aaggaactgc aaaatatatt gaaacttcgg agctatgcgg tggtgctaga atttgttata 1141 ttttccatga gacttttggg cgaaccttag aatctgttga tccacttggt ggccttaaca 1201 ctattgacat tttgactgcc attagaaatg ctactggtcc tcgtcctgct ttatttgtgc 1261 ctgaggtttc atttgagtta ctggtgaagc ggcaaatcaa acgtctagaa gagcccagcc 1321 tccgctgtgt ggaactggtt catgaggaaa tgcaaaggat cattcagcac tgtagcaatt 1381 acagtacaca ggaattgtta cgatttccta aacttcatga tgccatagtt gaagtggtga 1441 cttgtcttct tcgtaaaagg ttgcctgtta caaatgaaat ggtccataac ttagtggcaa 1501 ttgaactggc ttatatcaac acaaaacatc cagactttgc tgatgcttgt gggctaatga 1561 acaataatat agaggaacaa aggagaaaca ggctagccag agaattacct tcagctgtat 1621 cacgagacaa gtcttctaaa gttccaagtg ctttggcacc tgcctcccag gagccctccc 1681 ccgctgcttc tgctgaggct gatggcaagt taattcagga cagcagaaga gaaactaaaa 1741 atgttgcatc tggaggtggt ggggttggag atggtgttca agaaccaacc acaggcaact 1801 ggagaggaat gctgaaaact tcaaaagctg aagagttatt agcagaagaa aaatcaaaac 1861 ccattccaat tatgccagcc agtccacaaa aaggtcatgc cgtgaacctg ctagatgtgc 1921 cagttcctgt tgcacgaaaa ctatctgctc gggaacagcg agattgtgag gttattgaac 1981 gactcattaa atcatatttt ctcattgtca gaaagaatat tcaagacagt gtgccaaagg 2041 cagtaatgca ttttttggtt aatcatgtga aagacactct tcagagtgag ctagtaggcc 2101 agctgtataa atcatcctta ttggatgatc ttctgacaga atctgaggac atggcacagc 2161 gcaggaaaga agcagctgat atgctaaagg cattacaagg agccagtcaa attattgctg 2221 aaatccggga gactcatctt tggtgaagag aactatgtaa tactgagact ttgttgactc 2281 aaaacttgct agttactgcc tacctgagta gaatcttatt tatgaactcc tgtgtattgc 2341 aatggtatga atctgctcat gtggagactg gctataaact gaaaagtgta ttccaaattg 2401 cagaacacat cacacattta atccaaataa taaatggctg tttctaaag // LOCUS AB006968 1608 bp mRNA PRI 17-SEP-1997 DEFINITION Homo sapiens mRNA for CIS4, complete cds. ACCESSION AB006968 NID g2463524 KEYWORDS CIS3; CIS4. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Masuhara,M., Sakamoto,H., Matsumoto,A., Suzuki,R., Yasukawa,H., Mitsui,K., Wakioka,T., Tanimura,S., Sasaki,A., Misawa,H., Ohtsubo,M. and Yoshimura,A. TITLE Cloning of CIS family genes and characterization of a new inhibitor of cytokine signaling, CIS3 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1608) AUTHORS Matsumoto,A. TITLE Direct Submission JOURNAL Submitted (01-SEP-1997) to the DDBJ/EMBL/GenBank databases. Akira Matsumoto, Institute of Life Science, Kurume University, Molecular Genetics; Aikawa-machi 2432-3, Kurume, Fukuoka 839, Japan (E-mail:matumoto@lsi.kurume-u.ac.jp, Tel:+81-942-37-6313, Fax:+81-942-31-5212) FEATURES Location/Qualifiers source 1..1608 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1608 /codon_start=1 /product="CIS4" /db_xref="PID:d1023406" /db_xref="PID:g2463525" /translation="MKKISLKTLRKSFNLNKSKEETDFMVVQQPSLASDFGKDDSLFG SCYGKDMASCDINGEDEKGGKNRSKSESLIGTLKRRLSAKQKSKGKAGTPSGSSADED TFSSSSAPIVFKDVRAQRPIRSTSLRSHHYSPAPWPLRPTNSEETCIKMEVRVKALVH TSSPSPALNGVRKDFHDLQSETTCQEQANSLKSSASHNGDLHLHLDEHVPVVIGLMPQ DYIQYTVPLDEGMYPLEGSRSYCLDSSSPMEVSAVPPQVGGRAFPEDESQVDQDLVVA PEIFVDQSVNGLLIGTTGVMLQSPRAGHDDVPPLSPLLPPMQNNQIQRNFSGLTGTEA HVAESMRCHLNFDPNSAPGVARVYDSVQSSGPMVVTSLTEELKKLAKQGWYWGPITRW EAEGKLANVPDGSFLVRDSSDDRYLLSLSFRSHGKTLHTRIEHSNGRFSFYEQPDVEG HTSIVDLIEHSIRDSENGAFCYSRSRLPGSATYPVRLTNPVSRFMQVRSLQYLCRFVI RQYTRIDLIQKLPLPNKMKDYLQEKHY" BASE COUNT 436 a 384 c 401 g 387 t ORIGIN 1 atgaagaaaa ttagtcttaa aaccttacgg aaatctttta acttgaataa aagtaaagaa 61 gaaactgatt tcatggtagt acaacaacca tcgctagcca gtgactttgg aaaagatgat 121 tccttatttg gtagctgcta tggtaaagat atggccagct gcgatatcaa cggtgaagat 181 gaaaaaggcg gaaaaaacag atcgaaaagc gagagcctga taggtacgct aaaaaggcgg 241 ctttctgcaa aacagaagtc aaaaggcaag gcgggcacac cctctgggag ctctgccgac 301 gaggacacct tctcctcctc ctcagcaccc atagtcttta aagacgtgag agctcagagg 361 ccaataaggt ccacgtcgct ccgcagccat cactacagtc ccgcgccgtg gcctctgcgg 421 cccacaaact ccgaggagac ctgcatcaag atggaggtga gagtcaaggc cttggttcac 481 acttccagcc cgagtccagc cctgaatggc gtccggaagg atttccacga cctccagtct 541 gagaccacgt gccaggagca agccaattca ctgaagagct cggcttctca taatggagac 601 ctgcatcttc acctggatga acatgtgcct gtcgttattg gacttatgcc tcaggactac 661 attcagtata ctgtgccttt agatgagggg atgtatcctt tggaaggatc acggagctat 721 tgtctggaca gctcttctcc catggaagtc tctgcggttc ctcctcaagt gggagggcgt 781 gctttccccg aggatgagag tcaggtagac caggacctag ttgtcgcccc agagatcttc 841 gtggatcagt ccgtgaatgg cttgttgatt ggcaccacgg gagtcatgtt gcagagcccg 901 agagcgggtc acgatgatgt ccctccactc tcaccattgc tacctccaat gcagaataat 961 caaatccaaa ggaacttcag tggactcact ggcacagaag cccacgtggc tgaaagtatg 1021 cgctgtcatt tgaattttga tccgaactct gctcctgggg ttgcaagagt ttatgactca 1081 gtgcaaagta gtggtcccat ggttgtgaca agccttacag aggagctgaa aaaacttgca 1141 aagcaaggat ggtactgggg accaatcaca cgttgggagg cagaagggaa gctagcaaac 1201 gtgccagatg gttcttttct tgttcgggac agttctgacg accgttacct tttaagcttg 1261 agctttcgct cccatggtaa aacacttcac actagaattg agcactcaaa tggtaggttt 1321 agcttttatg aacagccaga tgtggaagga catacgtcca tagttgatct aattgagcat 1381 tcaatcaggg actctgaaaa tggagctttt tgttattcaa ggtctcggct gcctggatct 1441 gcaacttacc ccgtcagact gaccaaccca gtgtcccggt tcatgcaggt gcgctcgctg 1501 cagtacctgt gtcgttttgt tatacgtcag tataccagaa tagacttaat tcagaaactg 1561 cctttgccaa acaaaatgaa ggattattta caggagaagc actactga // LOCUS AB006969 1960 bp mRNA PRI 19-DEC-1997 DEFINITION Homo sapiens hGAA1 mRNA, complete cds. ACCESSION AB006969 NID g2706631 KEYWORDS hGAA1. SOURCE Homo sapiens fetal heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hiroi,Y., Komuro,I., Chen,R., Hosoda,T., Mizuno,T., Kudoh,S., Georgescu,S.P., Medof,M.E. and Yazaki,Y. TITLE Molecular clonning of a human homolog of yeast GAA1 which is required for the attachment of glycosylphosphatidylinositols to proproteins JOURNAL FEBS Lett. (1997) In press REFERENCE 2 (bases 1 to 1960) AUTHORS Hiroi,Y. TITLE Direct Submission JOURNAL Submitted (02-SEP-1997) to the DDBJ/EMBL/GenBank databases. Yukio Hiroi, University of Tokyo School of Medicine, Department of Medicine III; 7-3-1 Hongo, Bunkyo, Tokyo 113, Japan (E-mail:hiroiy-tky@umin.ac.jp, Tel:81-3-3815-5411, Fax:81-3-3815-2087) FEATURES Location/Qualifiers source 1..1960 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /dev_stage="fetal" /map="2q11-13" /tissue_type="heart" gene 9..1874 /gene="hGAA1" CDS 9..1874 /gene="hGAA1" /codon_start=1 /db_xref="PID:d1024941" /db_xref="PID:g2706632" /translation="MGLLSDPVRRRALARLVLRLNAPLCVLSYVAGIAWFLALVFPPL TQRTYMSENAMGSTMVEEQFAGGDRARAFARDFAAHRKKSGALPVAWLERTMRSVGLE VYTQSFSRKLPFPDETHERYMVSGTNVYGILRAPRAASTESLVLTVPCGSDSTNSQAV GLLLALAAHFRGQIYWAKDIVFLVTEHDLLGTEAWLEAYHDVNVTGMQSSPLQGRAGA IQAAVALELSSDVVTSLDVAVEGLNGQLPNLDLLNLFQTFCQKGGLLCTLQGKLQPED WTSLDGPLQGLQTLLLMVLRQASGRPHGSHGLFLRYRVEALTLRGINSFRQYKYDLVA VGKALEGMFRKLNHLLERLHQSFFLYLLPGLSRFVSIGLYMPAVGFLLLVLGLKALEL WMQLHEAGMGLEEPGGAPGPSVPLPPSQGVGLASLVAPLLISQAMGLALYVLPVLGQH VATQHFPVAEAEAVVLTLLAIYAAGLALPHNTHRVVSTQAPDRGWMALKLVALIYLAL QLGCIALTNFSLGFLLATTMVPTAALAKPHGPRTLYAALLVLTSPAATLLGSLFLWRE LQEAPLSLAEGWQLFLAALAQGVLEHHTYGALLFPLLSLGLYPCWLLFWNVLFWK" polyA_site 1960 /note="71 a nucleotides" BASE COUNT 296 a 657 c 592 g 415 t ORIGIN 1 gccccgccat gggcctcctg tcggacccgg ttcgccggcg cgcgctcgcc cgcctagtgc 61 tgcgcctcaa cgcgccgttg tgcgtgctga gctacgtggc gggcatcgcc tggttcttgg 121 cgctggtttt cccgccgctg acccagcgca cttacatgtc ggagaacgcc atgggctcca 181 ccatggtgga ggagcagttt gcgggcggag accgtgcccg ggcttttgcc cgggacttcg 241 ccgcccaccg caagaagtcg ggggctctgc cagtggcctg gcttgaacgg acgatgcggt 301 cagtagggct ggaggtctac acgcagagtt tctcccggaa actgcccttc ccagatgaga 361 cccacgagcg ctatatggtg tcgggcacca acgtgtacgg catcctgcgg gccccgcgtg 421 ctgccagcac cgagtcgctt gtgctcaccg tgccctgtgg ctctgactct accaacagcc 481 aggctgtggg gctgctgctg gcactggctg cccacttccg ggggcagatt tattgggcca 541 aagatatcgt cttcctggta acagaacatg accttctggg cactgaggct tggcttgaag 601 cctaccacga tgtcaatgtc actggcatgc agtcgtctcc cctgcagggc cgagctgggg 661 ccattcaggc agccgtggcc ctggagctga gcagtgatgt ggtcaccagc ctcgatgtgg 721 ccgtggaggg gcttaacggg cagctgccca accttgacct gctcaatctc ttccagacct 781 tctgccagaa agggggcctg ttgtgcacgc ttcagggcaa gctgcagccc gaggactgga 841 catcattgga tggaccgctg cagggcctgc agacactgct gctcatggtt ctgcggcagg 901 cctccggccg cccccacggc tcccatggcc tcttcctgcg ctaccgtgtg gaggccctaa 961 ccctgcgtgg catcaatagc ttccgccagt acaagtatga cctggtggca gtgggcaagg 1021 ctttggaggg catgttccgc aagctcaacc acctcctgga gcgcctgcac cagtccttct 1081 tcctctactt gctccccggc ctctcccgct tcgtctccat cggcctctac atgcccgctg 1141 tcggcttctt gctcctggtc cttggtctca aggctctgga actgtggatg cagctgcatg 1201 aggctggaat gggccttgag gagcccgggg gtgcccctgg ccccagtgta ccccttcccc 1261 catcacaggg tgtggggctg gcctcgctcg tggcacctct gctgatctca caggccatgg 1321 gactggccct ctatgtcctg ccagtgctgg gccaacacgt tgccacccag cacttcccag 1381 tggcagaggc tgaggctgtg gtgctgacac tgctggcgat ttatgcagct ggcctggccc 1441 tgccccacaa tacccaccgg gtggtaagca cacaggcccc agacaggggc tggatggcac 1501 tgaagctggt agccctgatc tacctagcac tgcagctggg ctgcatcgcc ctcaccaact 1561 tctcactggg cttcctgctg gccaccacca tggtgcccac tgctgcgctt gccaagcctc 1621 atgggccccg gaccctttat gctgccctgc tggtgctgac cagcccggca gccacgctcc 1681 ttggcagcct gttcctgtgg cgggagctgc aggaggcgcc actgtcactg gccgagggct 1741 ggcagctctt cctggcagcg ctagcccagg gtgtgctgga gcaccacacc tacggcgccc 1801 tgctcttccc actgctgtcc ctgggcctct acccctgctg gctgcttttc tggaatgtgc 1861 tcttctggaa gtgagatctg cctgtccggg ctgggacaga gactccccaa ggaccccatt 1921 ctgcctcctt ctggggaaat aaatgagtgt ctgtttcagc // LOCUS AB007042 6172 bp mRNA PRI 27-DEC-1997 DEFINITION Homo sapiens EXTR1 mRNA, complete cds. ACCESSION AB007042 NID g2723390 KEYWORDS EXTR1. SOURCE Homo sapiens testis and total fetus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,T., Yamauchi,M., Hayashi,A., Kozuma,S. and Hori,T. TITLE Identification, chromosome assignment and expression profile of a novel EXT1-related gene, EXTR JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6172) AUTHORS Saito,T. TITLE Direct Submission JOURNAL Submitted (06-SEP-1997) to the DDBJ/EMBL/GenBank databases. Toshiyuki Saito, National Institute of Radiological Sciences, Genome Research Group; Anagawa 4-9-1, Inage, Chiba 263, Japan (E-mail:t_saito@nirs.go.jp, Tel:043-206-3135, Fax:043-251-9818) COMMENT Sequence updated (19-Sep-1997). FEATURES Location/Qualifiers source 1..6172 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p" /tissue_type="testis and total fetus" gene 594..3353 /gene="EXTR1" CDS 594..3353 /gene="EXTR1" /codon_start=1 /product="EXTR1" /db_xref="PID:d1024987" /db_xref="PID:g2723391" /translation="MTGYTMLRNGGAGNGGQTCMLRWSNRIRLTWLSFTLFVILVFFP LIAHYYLTTLDEADEAGKRIFGPRVGNELCEVKHVLDLCRIRESVSEELLQLEAKRQE LNSEIAKLNLKIEACKKSIENAKQDLLQLKNVISQTEHSYKELMAQNQPKLSLPIRLL PEKDDAGLPPPKATRGCRLHNCFDYSRCPLTSGFPVYVYDSDQFVFGSYLDPLVKQAF QATARANVYVTENADIACLYVILVGEMQEPVVLRPAELEKQLYSLPHWRTDGHNHVII NLSRKSDTQNLLYNVSTGRAMVAQSTFYTVQYRPGFDLVVSPLVHAMSEPNFMEIPPQ VPVKRKYLFTFQGEKIESLRSSLQEARSFEEEMEGDPPADYDDRIIATLKAVQDSKLD QVLVEFTCKNQPKPSLPTEWALCGEREDRLELLKLSTFALIITPGDPRLVISSGCATR LFEALEVGAVPVVLGEQVQLPYQDMLQWNEAALVVPKPRVTEVHFLLRSLSDSDLLAM RRQGRFLWETYFSTADSIFNTVLAMIRTRIQIPAAPIREEAAAEIPHRSGKAAGTDPN MADNGDLDLGPVETEPPYASPRYLRNFTLTVTDFYRSWNCAPGPFHLFPHTPFDPVLP SEAKFLGSGTGFRPIGGGAGGSGKEFQAALGGNVPREQFTVVMLTYEREEVLMNSLER LNGLPYLNKVVVVWNSPKLPSEDLLWPDIGVPIMVVRTEKNSLNNRFLPWNEIETEAI LSIDDDAHLRHDEIMFGFRVWREARDRIVGFPGRYHAWDIPHQSWLYNSNYSCELSMV LTGAAFFHKYYAYLYSYVMPQAIRDMVDEYINCEDIAMNFLVSHITRKPPIKVTSRWT FRCPGCPQALSHDDSHFHERHKCINFFVKVYGYMPLLYTQFRVDSVLFKTRLPHDKTK CFKFI" BASE COUNT 1280 a 1681 c 1692 g 1519 t ORIGIN 1 ggcgggtccc tgagctggaa gccggagagc aagccctgga ggttcactct ttcaagaagt 61 cgtgtgctga ggtgtaatgc tacacaagtc agaggaagga agggtcctga aacacatggc 121 ctgattgttg gcaaaggcat cataagaagc tggcatttat ttctgttcta acctattact 181 gtataactgt gaatagacac tatgcatatt tgttggtcag caaaaccaag aaacaagagc 241 tatggcattt gaaaaagtct gtctgattcc agggtgtttt tcctgggttt catcatcagg 301 tacctcctcc ctttcatctc agcaagaatg tggcaccttt tatcgtttga taaagattaa 361 ggacatgttc tttggtcaac agccagaact taaaatctgc tggaataggg tcagagacca 421 tttcagctgc agctgaggaa aatgaaatgt tcattttatt tggtgccttg tctggggagc 481 acactaactc ttctggaaac gtgtcagtga aacagagatc gttttgtgga atagcaaccc 541 atggttatgg cgagtgaccc gacgtgatct ggggggcagg ctgcagagga ctcatgacag 601 gctataccat gctgcggaat gggggcgcgg ggaacggagg tcagacctgc atgctgcgct 661 ggtccaaccg catccgcctc acgtggctca gcttcacgct ctttgtcatc ctggtcttct 721 tcccgctcat cgcccactat tacctcacca ctctggatga ggctgatgag gcaggcaagc 781 ggatttttgg tccccgggtg gggaacgagc tgtgcgaggt gaagcacgtg ctggatctgt 841 gccgcatccg ggagtcggtg agtgaagagc tcctgcagct ggaggccaag cgccaagagc 901 tgaacagcga gatcgccaag ctgaatctga agatcgaagc ctgtaagaag agcattgaga 961 acgccaagca ggacctgctc cagctcaaga atgtcatcag ccagaccgag cattcctaca 1021 aggagctcat ggcccagaac cagcccaagc tgtccctgcc catccgactg ctcccagaga 1081 aggacgatgc cggcctccct cccccgaagg ccactcgggg ctgccggcta cacaactgct 1141 ttgattattc tcgttgccct ctcacctctg gcttcccggt ctacgtctat gacagtgacc 1201 agtttgtctt tggcagctac ctggatccct tggtcaagca ggcttttcag gcgacagcac 1261 gagctaacgt ttatgttaca gaaaatgcag acatcgcctg cctttacgtg atactagtgg 1321 gagagatgca ggagcccgtg gtgctgcggc ctgctgagct ggagaagcag ttgtattccc 1381 tgccacactg gcggacggat ggacacaacc atgtcatcat caatctgtca cgtaagtcag 1441 atacacagaa ccttctctat aacgtcagta ctggccgtgc catggtggcc cagtccacct 1501 tctacactgt ccagtacaga cctggctttg acttggtcgt atcaccgctg gtccatgcca 1561 tgtctgagcc caacttcatg gaaatcccac cacaggtgcc ggtgaagcgg aaatatctct 1621 tcaccttcca gggcgagaag attgagtctc tgaggtctag ccttcaggag gcccgctcct 1681 tcgaagagga aatggagggc gaccctcccg ccgactacga tgaccggatc attgccaccc 1741 tgaaggcggt gcaggacagc aagctggatc aggtcctggt ggaattcacc tgcaaaaacc 1801 agcccaaacc cagcctgccg actgagtggg cactgtgtgg agagcgggag gaccgcttgg 1861 aattgctgaa gctctccacc ttcgccctca tcattacccc cggggaccct cgcttggtta 1921 tttcctctgg gtgtgcaaca cggctcttcg aagccctgga agtcggtgcc gtcccggtgg 1981 tgctggggga gcaggtccag cttccctacc aggacatgct gcagtggaac gaggcggccc 2041 tggtggtgcc aaagcctcgt gttaccgagg ttcatttcct gctcagaagc ctctccgata 2101 gtgacctcct ggctatgagg cggcaaggcc gctttctctg ggagacttac ttctccactg 2161 ctgacagtat ttttaatacc gtgctggcta tgattaggac tcgcatccag atcccagccg 2221 ctcccatccg ggaagaggcg gcagctgaga tcccccaccg ttcaggcaag gcggctggaa 2281 ctgaccccaa catggctgac aacggggacc tggacctggg gccagtggag acggagccgc 2341 cctacgcctc acccagatac ctccgcaatt tcactctgac tgtcactgac ttttaccgca 2401 gctggaactg tgctccaggg cctttccatc ttttccccca cactcccttt gaccctgtgt 2461 tgccctcaga ggccaaattc ttgggctcag ggactggctt tcggcctatt ggtggtggag 2521 ctgggggttc tggcaaggaa tttcaggcag cgcttggagg caatgttccc cgagagcagt 2581 tcacggtggt gatgttgact tatgagcggg aggaagtgct tatgaactct ttagagaggc 2641 tgaatggcct cccttacctg aacaaggtcg tggtggtgtg gaattctccc aagctgccat 2701 cagaggacct tctgtggcct gacattggcg ttcccatcat ggtggtccgt actgagaaga 2761 acagtttgaa caaccgattc ttaccctgga atgaaattga gacagaggcc atcctgtcca 2821 ttgatgacga tgctcacctc cgccatgacg aaatcatgtt tgggttccgg gtgtggagag 2881 aagctcggga ccgcatcgtg ggcttccctg gccgttacca cgcatgggac atcccccatc 2941 agtcctggct ctacaactcc aactactcct gtgagctgtc catggtgctg acaggtgctg 3001 ccttctttca caagtattat gcctacctgt attcttatgt gatgccccag gccatccggg 3061 acatggtgga tgaatacatc aactgtgagg acattgccat gaacttcctt gtctcccaca 3121 tcactcggaa gccccccatc aaggtgacct cacggtggac attccgatgc ccaggatgcc 3181 ctcaggccct gtctcatgat gactcccact tccacgagcg gcacaagtgc atcaacttct 3241 tcgtgaaggt gtacggctac atgcccctcc tgtacacgca gttcagggtg gattctgtgc 3301 tcttcaagac acgcctgccc catgacaaga ccaagtgctt caagttcatc taggggcagc 3361 gcacggtctg gggaagagga tgagcagagg gaggaagatg gctcccaagg ttcctaggca 3421 ttgcaggacc ttgggcacat ctgctggtgg gtggcccaga gcctctgctg gaaggggcag 3481 caggaggagt ggaaggaaac cgctgccttt atcttgaagt cagccacact gggcctggag 3541 ccctgggcgg agtccccggg gttccccaca cagggcactg actgatagct tacactgagg 3601 actgtggcga ctctgcagag tcactcacac cgttcgtacg cccaggacag ctggttcgtg 3661 gtttttacat tcaataacaa ctattatgat tatttaaaaa gagaaagttt cagatttgcc 3721 attcaaggct tatttatata tatgtgtgtg tatataaata catgcacaca cttgcataca 3781 tatatatttt tggctggggg agtgtgagtt ttgcctttct aagggaggga ccgcgcaggc 3841 tcctttgttc tgtattctgg cggagatggg tcctggcctt gtgtcactgg cttatcctta 3901 aagatcatct cccatcctcc ccagcgccat ctgtgtgcag caaccagaaa gggatgaact 3961 tggccctctt gcgggcctgg acaaggtctc ttccttaccc tttctgttgc cagtcagcaa 4021 cctgtaactc acattctctt cccagtgaat ccctgggagc gcctgaccct ggtgggctgt 4081 tcagcttcct gctgctgggg ccagcgattt ttgaggattt atctttaggc caggcttgcc 4141 tccgtactta tccctgctct cccatttctc tcttgtttga gagagaatga ggaagcaaag 4201 agtgagaaag aataggggct gaagacgcca ctcccagatg gctctttcta tcctgctctt 4261 ctgttgaaac acacgtgctg tgggcctcag gcgtttctga agtgctcttt cttggattgg 4321 acaggagatc agcagcgtgc acatctgctg tggtctgaag tggtttgcag gtcagcctcc 4381 tctccctagt gtagagcaag ccagtgtcct tcgaggaacc cacccggctg gccgggaagt 4441 tttacagcaa ggcgcctgcc ttgggataat tccttggtga aattcacctt ccccccgcct 4501 ctgtctggag ccccatcctg tgttatctgt ggtttttgga cccctaatgt cagcttggct 4561 gtaggactcc ccgaggtttg gtatgtgcta gaacaatggg aggctgtgat ttgctgtgta 4621 agctcacatc cagccttgga atctaacggg cattcacaac ccgagttacc actttccact 4681 ccctgcttag gattctgttc cctgggctga aactgaaata agctaatttt ttgggtcacg 4741 gtggcagtag gggaacctag gagggtgtga gtggcatttg tcagggattt agcccatgac 4801 gtgtttcttg aaccctactt tctggaagtg gagttgactc tggaagtttt ctagcaactg 4861 aacaaaagct caggtttgtc ctggtcatgc acatgcctta agccagttcc gtcttcccta 4921 gaccttggca tcctgtgctt ctatttcttg gaatacgttc tcctctgacc tgcctgtacc 4981 acgtgggtcc tcttcaagta ctgttttgaa gctgggctct tttgtgtagc tcccacccac 5041 ctgtagggct agctcggctt aagggaactc tccccattgg caaaccggac ccggccgccg 5101 ccaggactgt gtttccaaag gttccccgcc cccaacccca gcatcagcct gtagctcccc 5161 tgctgaggca gtgtggttat gttcccagca gtgggggtca gacgcccttc ctcagaactt 5221 tctagttgcc ctctacctga ctcctgactt gtattccttt tagcagtagc cttcttccct 5281 cggggagcca aagagtgtgg tgtgtggcgc tatattgtgg ctgctatttc atctggtttc 5341 ttttaatgtg aggaactcac atactgactt cagtgggact cggtgagccg gggccgtctg 5401 tgtggtggga ccccctttag cgggactcag tgagctgggg ccgtctgtgt ggtggagcca 5461 gggcctctcc ctttagtgga gccaggttgt cgggccccga atgtcactgg tggatctaag 5521 aagggctgag tggtctgaca ccaaaacatg ccgcagggag ggctgtggtg ccggtgcttc 5581 caacaaggac agccctcctt gaccctgaaa ggaacactgg cttgaaggac tgcagacagg 5641 ctctgagggg cacgccctcc tcagcgagag gcagcaaggt ggccacagtg tcactggtca 5701 ggtgcttctc accacgggaa agccgccgac ctgtgactcg cttgagatgg gaaagcggcg 5761 ccacagaccc cgggtctcct tggctgtctg tgggccgccc ctggccacct tgtcctggct 5821 cgcagggtgc aggagcgcct cgttctctgg gtggccggct tgctgctccg gtttgggctg 5881 tcttaccata acaccgtccc agggctctgc aggccactgt gagcgctggc tccctgggca 5941 gtgctcctcc gtgtggactg tgcctcaggc cagggctcac cagctggggt cctgtccgga 6001 aggatgggat ctttctggga gctgcgccgg acagagtggg gagctcctag tttgtggggg 6061 gaagctttga tatccatgcc acgtccatcc accccacccc ttttcgtcac gagcacaatg 6121 gtcttacatt ggatttttgt aaaaaaataa aaataaatgg agactttaac tc // LOCUS AB007191 1492 bp mRNA PRI 19-SEP-1997 DEFINITION Homo sapiens mRNA for AMY-1, complete cds. ACCESSION AB007191 NID g2443309 KEYWORDS AMY-1. SOURCE Homo sapiens placenta tissue_lib:lamda gt11 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1492) AUTHORS Ariga,H. TITLE AMY-1L, an alternative splicing form of AMY-1S, a novel C-MYC binding protein JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 1492) AUTHORS Ariga,H. TITLE Direct Submission JOURNAL Submitted (10-SEP-1997) to the DDBJ/EMBL/GenBank databases. Hiroyoshi Ariga, Faculty of Pharmaceutical Sciences, Hokkaido University, Molecular Biology; Kita 12, Nishi 6, Kita-ku, Sapporo, Hokkaido 060, Japan (E-mail:hiro@pharm.hokudai.ac.jp, Tel:81-11-706-3745, Fax:81-11-706-4988) FEATURES Location/Qualifiers source 1..1492 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p32.2-p33" /tissue_lib="lamda gt11" /tissue_type="placenta" gene 39..350 /gene="amy-1L" CDS 39..350 /gene="amy-1L" /codon_start=1 /product="AMY-1" /db_xref="PID:d1023271" /db_xref="PID:g2443310" /translation="MAHYKAADSKREQFRRYLEKSGVLDTLTKVLVALYEEPEKPNSA LDFLKHHLGAATPENPEIELVRLELAEMKEKYEAIVEENKKLKAKLAQYEPPQEEKRA E" BASE COUNT 432 a 294 c 317 g 449 t ORIGIN 1 gaattcgggg cgccagctac gccgctgccg ctgtcactat ggcccattac aaagccgccg 61 actcgaagcg tgagcagttc cggaggtact tggagaagtc gggggtgctg gacacgctga 121 ccaaggtgtt ggtagcctta tatgaagaac cagagaaacc taacagtgct ttggattttt 181 taaagcatca cttaggagct gctactccag aaaatccaga aatagagctc gttcgcctag 241 aactggccga aatgaaagag aagtatgaag ctattgtaga agaaaataaa aaactgaaag 301 caaagcttgc tcagtatgaa ccacctcagg aggagaagcg tgctgaatag gattcttctc 361 agtttgaaag acaatgaaaa atggttttgt atgacttgaa tagtttgtat agtatataat 421 cttttctgaa cagatgctat agaactcttt taatatgttt aattcaccta tcacactctg 481 ttaaaaacac atagaatcat caataaaaac tcaatataac tttctttggg tcttaaagca 541 ggagaatcca aagtaaatcc tgaacaaaac ctaaacacag ccatctaact cattacctta 601 aaagacattc tgtttattag tctgattagg aatgatggca ctggttgtat tttagccaag 661 acagtttagc atggagctat tccttggtgc agttcaggat atgaacacag gtacagtcat 721 tctttgaacg gtgacactgt tctgtatatt ccctataggc agctggagag atctgtgtga 781 cacaagatgc ttttgtacgg gttcccatga atcttctgct cttgtttgtg tgacatgaaa 841 caaataactt ctttgccacc actttgcctt agataactgt gtgtgtgtgt gccagtttga 901 actctgacac cacattttcc ttctatgcaa tcatgcctgt ctgataatct tgcattgctt 961 tcctctgagc tttagtgggt cctagtgcac aatggccttt ctgtgctgtt tttcaatttg 1021 cctaataata gcagttaccc tgattgtaat ttatgtaact ttaaacagga tcacactgta 1081 ccccctgcct gccttatttg cttactgagc acaggacaga ggcaatatac aactctgggt 1141 tcacacacaa gctgagatga gaagaggaat gagccatata ttggggaaaa tcatagtttg 1201 taggtataat tatatagtgc ttttctccct caaagtattt ttctagcctt gaattcattt 1261 tatcttcatt atccctgtga agtaggtggg acaagtataa ggggaagagg ggtgctgaat 1321 ttttaggcca aagactgata ttaatacaaa tcactcacta actgtagagc cttgggcatt 1381 atcagtgaac tactctgaga tttactgtct tcatctgttt aatgagtaga atgtccgtga 1441 tgcctacctc acagggttgt tgtgagggtc acccgaattc gggcccgaat tc // LOCUS AB007447 2618 bp mRNA PRI 01-OCT-1997 DEFINITION Homo sapiens mRNA for Fln29, complete cds. ACCESSION AB007447 NID g2463530 KEYWORDS Fln29. SOURCE Homo sapiens fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2618) AUTHORS Nezu,J. TITLE TRAF interacting Zn finger protein JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 2618) AUTHORS Nezu,J. TITLE Direct Submission JOURNAL Submitted (18-SEP-1997) to the DDBJ/EMBL/GenBank databases. Jun-ichi Nezu, Chugai Research Institute for Molecular Medicine, Inc., Gene Search Program; 153-2 Nagai, Niihari, Ibaraki 300-41, Japan (E-mail:nezuj@tk.chugai-pharm.co.jp, Tel:81-298-30-6211, Fax:81-298-30-6270) FEATURES Location/Qualifiers source 1..2618 /organism="Homo sapiens" /note="inbetween D12S105 and D12S369" /db_xref="taxon:9606" /chromosome="12" /dev_stage="fetal" /tissue_type="brain" gene 55..1803 /gene="fln29" CDS 55..1803 /gene="fln29" /codon_start=1 /product="Fln29" /db_xref="PID:d1023409" /db_xref="PID:g2463531" /translation="MAEFLDDQETRLCDNCKKEIPVFNFTIHEIHCQRNIGMCPTCKE PFPKSDMETHMAAEHCQVTCKCNKKLEKRLLKKHEETECPLRLAVCQHCDLELSILKL KEHEDYCGARTELCGNCGRNVLVKDLKTHPEVCGREGEEKRNEVAIPPNAYDESWGQD GIWIASQLLRQIEALDPPMRLPRRPLRAFESDVFHNRTTNQRNITAQVSIQNNLFEEQ ERQERNRGQQPPKEGGEESANLDFMLALSLQNEGQASSVAEQDFWRAVCEADQSHGGP RSLSDIKGAADEIMLPCEFCEELYPEELLIDHQTSCNPSRALPSLNTGSSSPRGVEEP DVIFQNFLQQAASNQLDSLMGLSNSHPVEESIIIPCEFCGVQLEEEVLFHHQDQCDQR PATATNHVTEGIPRLDSQPQETSPELPRRRVRHQGDLSSGYLDDTKQETANGPTSCLP PSRPINNMTATYNQLSRSTSGPRPGCQPSSPCVPKLSNSDSQDIQGRNRDSQNGAIAP GHVSVIRPPQNLYPENIVPSFSPGPSGRYGASGRSEGGRNSRVTPAAANYRSRTAKAK PSKQQGAGDAEEEEEE" BASE COUNT 632 a 641 c 706 g 639 t ORIGIN 1 tgcagctagt gtgtcaactc agcgtttctc ctctcgtccc tggaagagct aaagatggct 61 gaatttctag atgaccagga aactcgactg tgtgacaact gcaaaaaaga aattcctgtg 121 tttaacttta ccatccatga gatccactgt caaaggaaca ttggtatgtg tcctacctgt 181 aaggaaccat ttcccaaatc tgacatggag actcacatgg ctgcagaaca ctgtcaggtg 241 acctgcaaat gtaacaagaa gttggagaag aggctgttaa agaagcatga ggagactgag 301 tgccctttgc ggcttgctgt ctgccagcac tgtgatttag aactttccat tctcaaactg 361 aaggaacatg aagattattg tggtgcccgg acggaactat gtggcaactg tggtcgcaat 421 gtccttgtga aagatctgaa gactcaccct gaagtttgtg ggagagaggg ggaggaaaag 481 agaaatgagg ttgccatacc tcctaatgca tatgatgaat cttggggtca ggatggaatc 541 tggattgcat cccaactcct cagacaaatt gaggctctgg acccacccat gaggctgccg 601 cgaaggcccc tgagagcctt tgaatcagat gttttccaca atagaactac caaccaaagg 661 aacattacag cccaggtttc aattcagaat aatctgtttg aagaacaaga gaggcaggaa 721 aggaatagag gccaacagcc ccccaaagag ggtggtgaag agagtgcaaa cttggacttc 781 atgttggccc taagtctgca aaatgaaggc caagcctcca gtgtggcaga gcaggacttc 841 tggagggccg tatgtgaggc cgaccagtct catggcggtc ccaggtctct cagtgacata 901 aagggtgcag ctgacgagat catgttgcct tgtgaatttt gtgaggagct ctacccagag 961 gaactgctga ttgaccatca gacaagctgt aacccttcac gtgccttacc ttcactcaat 1021 actggcagct cttcccccag aggggtggag gaacctgatg tcatcttcca gaacttcttg 1081 caacaggctg caagtaacca gttagactct ttgatgggcc tgagcaattc acaccctgtg 1141 gaggagagca tcattatccc atgtgaattc tgtggggtac agctggaaga ggaggtgctg 1201 ttccatcacc aggaccagtg tgaccaacgc ccagccactg caaccaacca tgtgacagag 1261 gggattccta gactggattc ccagcctcaa gagacctcac cagagctgcc caggaggcgt 1321 gtcagacacc agggagacct gtcttctggt tacctggatg atactaagca ggaaacagct 1381 aatgggccca cctcctgtct gcctcccagc cgacccatta acaatatgac agctacctat 1441 aaccagctat cgagatcaac atcaggcccc agacctgggt gccagcccag ctctccttgt 1501 gtgccgaagc tcagcaactc agacagccag gacatccagg ggcggaatcg agacagccag 1561 aatggggcca tagcccctgg gcacgtttca gtgattcgcc ctcctcaaaa tctctaccca 1621 gaaaacattg tgccctcttt ctcccctggg ccttcaggga gatacggagc tagtggtagg 1681 agtgaaggtg gcaggaattc ccgggtcacc cctgcagctg ccaactaccg cagcagaact 1741 gcaaaggcaa agccttccaa gcaacaggga gctggggatg cagaagagga agaggaggag 1801 taatggtgtc tccagagact ttacatcggt tcctgtcttc tgtgcacagc agcacttgcc 1861 gctgtgcagg cccacctctt tggctctttg ggtgggagag tttttccaga ttttagattt 1921 ttctaggtta tggccatttt gtgtcttttg aggttgtgct gtgggggttt gggtttgagg 1981 gaagggagca gggtggcggt tgaggaacgc ttcagcctta gctgctacct ttcggcagca 2041 gtgaaataca agctgcagcc tcggctgcca gggctccctt ttgacttatt gtcgccactg 2101 ccccttggtg ctgtgtggtc ccagtggaag gaggggaaga ttttggaaac ctggtagcca 2161 ccagtaaggt gattctctgc cctgttgggg cctaaatttg ggggcttttg ggcaacctct 2221 ccgtgtactg cgtctgtcca cactcgattg ggccccaggt gtgtatgagg cgctctggta 2281 aggtgctcag gccagttgca atgtctgtca gtaacgaggc ttttgatgtg ttgagctgga 2341 ggtgagtgga ccgggggctg tgttttaagc tgcttccttg gcatttgcat cactgccttc 2401 tgttcccggg ggagcatgga tcttttgtcc tcactgcttt ctaatgggga gggctgaggg 2461 ctccctgtcc ccacagcagg tatgtttgct ctgccccagc cccacacttg ctctgaaaac 2521 caagtgtcag agccccttcc ccttgttttt attttactgt tataataatt attaacttcc 2581 ttgtaataga aataaagttt gtacttggag ttcagctc // LOCUS AB007448 2135 bp mRNA PRI 03-OCT-1997 DEFINITION Homo sapiens mRNA for polyspecific oraganic cation transporter, complete cds. ACCESSION AB007448 NID g2605500 KEYWORDS polyspecific oraganic cation transporter; fls631; OCTN1. SOURCE Homo sapiens fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2135) AUTHORS Nezu,J. TITLE Identification and functional expression of OCTN1 and OCTN2; two novel polyspecific oraganic cation transporters both containing a nucleotide binding domain JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 2135) AUTHORS Nezu,J. TITLE Direct Submission JOURNAL Submitted (18-SEP-1997) to the DDBJ/EMBL/GenBank databases. Jun-ichi Nezu, Chugai Research Institute for Molecular Medicine, Inc., Gene Search Program; 153-2 Nagai, Niihari, Ibaraki 300-41, Japan (E-mail:nezuj@tk.chugai-pharm.co.jp, Tel:81-298-30-6211, Fax:81-298-30-6270) FEATURES Location/Qualifiers source 1..2135 /organism="Homo sapiens" /note="inbetween D5S642 and D5S649" /db_xref="taxon:9606" /chromosome="5" /dev_stage="fetal" /tissue_type="liver" gene 147..1802 /gene="OCTN1" CDS 147..1802 /gene="OCTN1" /note="fls631" /codon_start=1 /product="polyspecific oraganic cation transporter" /db_xref="PID:g2605501" /translation="MRDYDEVIAFLGEWGPFQRLIFFLLSASIIPNGFNGMSVVFLAG TPEHRCRVPDAANLSSAWRNNSVPLRLRDGREVPHSCSRYRLATIANFSALGLEPGRD VDLGQLEQESCLDGWEFSQDVYLSTVVTEWNLVCEDNWKVPLTTSLFFVGVLLGSFVS GQLSDRFGRKNVLFATMAVQTGFSFLQIFSISWEMFTVLFVIVGMGQISNYVVAFILG TEILGKSVRIIFSTLGVCTFFAVGYMLLPLFAYFIRDWRMLLLALTVPGVLCVPLWWF IPESPRWLISQRRFREAEDIIQKAAKMNNTAVPAVIFDSVEELNPLKQQKAFILDLFR TRNIAIMTIMSLLLWMLTSVGYFALSLDAPNLHGDAYLNCFLSALIEIPAYITAWLLL RTLPRRYIIAAVLFWGGGVLLFIQLVPVDYYFLSIGLVMLGKFGITSAFSMLYVFTAE LYPTLVRNMAVGVTSTASRVGSIIAPYFVYLGAYNRMLPYIVMGSLTVLIGIFTLFFP ESLGMTLPETLEQMQKVKWFRSGKKTRDSMETEENPKVLITAF" BASE COUNT 499 a 547 c 530 g 559 t ORIGIN 1 ccccggcttc gcgccccaat ttctaacagc ctgcctgtcc cccgggaacg ttctaacatc 61 cttggggagc gccccagcta caagacactg tcctgagaac gctgtcatca cccgtagttg 121 caagtttcgg agcggcagtg ggaagcatgc gggactacga cgaggtgatc gccttcctgg 181 gcgagtgggg gcccttccag cgcctcatct tcttcctgct cagcgccagc atcatcccca 241 atggcttcaa tggtatgtca gtcgtgttcc tggcggggac cccggagcac cgctgtcgag 301 tgccggacgc cgcgaacctg agcagcgcct ggcgcaacaa cagtgtcccg ctgcggctgc 361 gggacggccg cgaggtgccc cacagctgca gccgctaccg gctcgccacc atcgccaact 421 tctcggcgct cgggctggag ccggggcgcg acgtggacct ggggcagctg gagcaggaga 481 gctgcctgga tggctgggag ttcagccagg acgtctacct gtccaccgtc gtgaccgagt 541 ggaatctggt gtgtgaggac aactggaagg tgcccctcac cacctccctg ttcttcgtag 601 gcgtgctcct cggctccttc gtgtccgggc agctgtcaga caggtttggc aggaagaacg 661 ttctcttcgc aaccatggct gtacagactg gcttcagctt cctgcagatt ttctccatca 721 gctgggagat gttcactgtg ttatttgtca tcgtgggcat gggccagatc tccaactatg 781 tggtagcctt catactagga acagaaattc ttggcaagtc agttcgtatt atattctcta 841 cattaggagt gtgcacattt tttgcagttg gctatatgct gctgccactg tttgcttact 901 tcatcagaga ctggcggatg ctgctgctgg cgctgacggt gccgggagtg ctgtgtgtcc 961 cgctgtggtg gttcattcct gaatctcccc gatggctgat atcccagaga agatttagag 1021 aggctgaaga tatcatccaa aaagctgcaa aaatgaacaa cacagctgta ccagcagtga 1081 tatttgattc tgtggaggag ctaaatcccc tgaagcagca gaaagctttc attctggacc 1141 tgttcaggac tcggaatatt gccataatga ccattatgtc tttgctgcta tggatgctga 1201 cctcagtggg ttactttgct ctgtctctgg atgctcctaa tttacatgga gatgcctacc 1261 tgaactgttt cctctctgcc ttgattgaaa ttccagctta cattacagcc tggctgctat 1321 tgcgaacgct gcccaggcgt tatatcatag ctgcagtact gttctgggga ggaggtgtgc 1381 ttctcttcat tcaactggta cctgtggatt attacttctt atccattggt ctggtcatgc 1441 tgggaaaatt tgggatcacc tctgctttct ccatgctgta tgtcttcact gctgagctct 1501 acccaaccct ggtcaggaac atggcggtgg gggtcacatc cacggcctcc agagtgggca 1561 gcatcattgc cccctacttt gtttacctcg gtgcttacaa cagaatgctg ccctacatcg 1621 tcatgggtag tctgactgtc ctgattggaa tcttcaccct ttttttccct gaaagtttgg 1681 gaatgactct tccagaaacc ttagagcaga tgcagaaagt gaaatggttc agatctggga 1741 aaaaaacaag agactcaatg gagacagaag aaaatcccaa ggttctaata actgcattct 1801 gaaaaaatat ctaccccatt tggtgaagtg aaaaacagaa aaataagacc ctgtggagaa 1861 attcgttgtt cccactgaaa tggactgact gtaacgattg acaccaaaat gaaccttgct 1921 atcaagaaat gctcgtcata cagtaaactc tggatgattc ttccagataa tgtccttgct 1981 ttacaaacca accatttcta gagagtctcc ttactcatta attcaatgaa atggattggt 2041 aagatgtctt gaaaacatgt tagtcaagga ctggtaaaat acatataaag attaacactc 2101 atttccaatc atacaaatac tatccaaata aaaat // LOCUS AB007454 1503 bp mRNA PRI 26-DEC-1997 DEFINITION Homo sapiens mRNA for chemokine LEC precursor, complete cds. ACCESSION AB007454 NID g2723285 KEYWORDS chemokine LEC precursor. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shoudai,K., Hieshima,K., Fukuda,S., Iio,M., Miura,R., Imai,T., Yoshie,O. and Nomiyama,H. TITLE Isolation of cDNA encoding a novel human CC chemokine NCC-4/LEC JOURNAL Biochim. Biophys. Acta (1998) In press REFERENCE 2 (bases 1 to 1503) AUTHORS Nomiyama,H. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) to the DDBJ/EMBL/GenBank databases. Hisayuki Nomiyama, Kumamoto University Medical School, Department of Biochemistry; Honjo 2-2-1, Kumamoto, Kumamoto 860, Japan (E-mail:nomiyama@gpo.kumamoto-u.ac.jp, Tel:81-96-373-5063, Fax:81-96-372-6140) FEATURES Location/Qualifiers source 1..1503 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" sig_peptide 77..145 CDS 77..439 /codon_start=1 /product="chemokine LEC precursor" /db_xref="PID:d1024963" /db_xref="PID:g2723286" /translation="MKVSEAALSLLVLILIITSASRSQPKVPEWVNTPSTCCLKYYEK VLPRRLVVGYRKALNCHLPAIIFVTKRNREVCTNPNDDWVQEYIKDPNLPLLPTRNLS TVKIITAKNGQPQLLNSQ" mat_peptide 146..436 polyA_signal 560..565 polyA_signal 1485..1490 BASE COUNT 417 a 374 c 312 g 400 t ORIGIN 1 gttggcaagc ggaccaccag caacagacaa catcttcatt cggctctccc tgaagctgta 61 ctgcctcgct gagaggatga aggtctccga ggctgccctg tctctccttg tcctcatcct 121 tatcattact tcggcttctc gcagccagcc aaaagttcct gagtgggtga acaccccatc 181 cacctgctgc ctgaagtatt atgagaaagt gttgccaagg agactagtgg tgggatacag 241 aaaggccctc aactgtcacc tgccagcaat catcttcgtc accaagagga accgagaagt 301 ctgcaccaac cccaatgacg actgggtcca agagtacatc aaggatccca acctaccttt 361 gctgcctacc aggaacttgt ccacggttaa aattattaca gcaaagaatg gtcaacccca 421 gctcctcaac tcccagtgat gaccaggctt tagtggaagc ccttgtttac agaagagagg 481 ggtaaaccta tgaaaacagg ggaagcctta ttaggctgaa actagccagt cacattgaga 541 gaagcagaac aatgatcaaa ataaaggaga agtatttcga atattttctc aatcttagga 601 ggaaatacca aagttaaggg acgtgggcag aggtacgctc ttttattttt atatttatat 661 ttttattttt ttgagatagg gtcttactct gtcacccagg ctggagtgca gtggtgtgat 721 cttggctcac ttgatcttgg ctcactgtaa cctccacctc ccaggctcaa gtgatcctcc 781 caccccagcc tcccgagtag ctgggactac aggcttgcgc caccacacct ggctaatttt 841 tgtatttttg gtagagacgg gattctacca tgttgcccag gctggtctca aactcgtgtg 901 cccaagcaat ccacctgcct cagccttcca aaagtgctgg gattacaggc gtgagccacc 961 acatccggcc agtgcactct taatacacag aaaaaatata ttcacatcct tctcctgctc 1021 tctttcaatt cctcacttca caccagtaca caagccattc taaatactta gccagtttcc 1081 agccttccag atgatctttg ccctctgggt cttgacccat taagagcccc atagaactct 1141 tgatttttcc tgtccatctt tatggatttt tctggatcta tattttcttc aattattctt 1201 tcattttata atgcaacttt ttcataggaa gtccggatgg gaatattcac attaatcatt 1261 tttgcagaga ctttgctaga tcctctcata ttttgtcttc ctcagggtgg caggggtaca 1321 gagatgtcct gattggaaaa aaaaaaaaaa gagagagaga gagaagaaga agaagaagag 1381 acacaaatct ctacctccca tgttaagctt tgcaggacag ggaaagaaag ggtatgagac 1441 acggctaggg gtaaactctt agtccaaaac ccaagcatgc aataaataaa actcccttat 1501 ttg // LOCUS AB007510 7221 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens mRNA for PRP8 protein, complete cds. ACCESSION AB007510 NID g2463576 KEYWORDS PRP8 protein. SOURCE Homo sapiens tissue_lib:fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7221) AUTHORS Shimada,Y., Fujiwara,T., Kawai,A., Shimizu,F., Okuno,S., Ozaki,K., Takeda,S., Watanabe,T., Nagata,M. and Takahashi,E. TITLE Human homologue of Saccharomyces serevisiae PRP8, Pre-mRNA splicing factor JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 7221) AUTHORS Shimada,Y., Fujiwara,T., Kawai,A., Shimizu,F., Okuno,S., Ozaki,K., Takeda,S., Watanabe,T., Nagata,M. and Takahashi,E. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) to the DDBJ/EMBL/GenBank databases. Yoshikazu Shimada, Otsuka Pharmaceutical Co. Ltd., Otuka GEN Research Institute; Kagasuno, Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (E-mail:shim@otsuka.genome.ad.jp, Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..7221 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p13.3" /tissue_lib="fetal brain" gene 42..7049 /gene="hPRP8" CDS 42..7049 /gene="hPRP8" /codon_start=1 /product="PRP8 protein" /db_xref="PID:d1023431" /db_xref="PID:g2463577" /translation="MAGVFPYRGPGNPVPGPLAPLPDYMSEEKLQEKARKWQQLQAKR YAEKRKFGFVDAQKEDMPPEHVREIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKL LENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIEPVYISQWGSMWIMMRREKRDRRH FKRMRFPPFDDEEPPLDYADNILNVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSRK YVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGG PKFEPLVRDINLQDEDWNEFNDINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHT PNVVFIKTEDPDLPAFYFDPLINPISHRHSVKSQEPLPDDDEEFELPEFVEPFLKDTP LYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKL LKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEGWLQVCRQGYNMLNLLI HRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRL GNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRVCKDLKHLIYYRFNTGPVGKG PGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFD LELRAAVMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMIL RYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNYLKDG PYITAEETVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKEAYSVKSRL NQSQREELGLIEQAYDNLHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPL EKITDAYLDQYLWYEADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGE CNVMLESRFEKMYEKIDLTLLNRLVRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGI IRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNNKKCWPRDARMRLMKHD VNLGRAVFWDIKNRLPRSVTTVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSY EEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNK WNTALIGLMTYFREAVVNTQELLDLLVKCEHKIQTRIKIGLNSKMPSRFPPVVFYTPK ELGGLGMLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESE FIDSQRVWAEYSLKRQEAIAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGW RVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTY FPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQ VQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQ KETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLADSKDVMDSTTTQKYWIDIQLRW GDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTN VYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTA EEVAALIRSLPVEEQPKQIIVTRKDMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVE KFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDK TTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEI SAPSQQRQQIAEIEKQTKEQSQLTATQTRTVNKHGDEIITSTTSNYETQTFSSKTEWR VRAISAANLHLRTNHIYVSSDDIKETGYTYILPKNVLKKFICISDLRAQIAGYLYGVS PPDNPQVKEIRCIVMVPQWGTHQTVHLPGQLPQHEYLKEMEPLGWIHTQPNESPQLSP QDVTTHAKIMADNPSWDGEKTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNNP KGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDPNMKYELQLANPKEFYH EVHRPSHFLNFALLQEGEVYSADREDLYA" BASE COUNT 1824 a 1961 c 1800 g 1636 t ORIGIN 1 cgggcggcct cttgtgtgag ggcctgtggg attctccgga tatggccgga gtgtttcctt 61 atcgagggcc gggtaacccg gtgcctggcc ctctagcccc gctaccggac tacatgtcgg 121 aggagaagct gcaggagaaa gctcgaaaat ggcagcaatt gcaggccaag cgctatgcag 181 aaaagcggaa gtttgggttt gtggatgccc agaaggaaga catgccccca gaacatgtca 241 gggagatcat tcgagaccat ggagacatga ccaacaggaa gttccgccat gacaaaaggg 301 tttacttggg tgccctaaag tacatgcccc acgcagtcct caaactcctg gagaacatgc 361 ctatgccttg ggagcagatt cgggatgtgc ccgtgctgta ccacatcact ggagccattt 421 ccttcgtcaa tgagattccc tgggtcattg aacctgtcta catctcccag tgggggtcaa 481 tgtggattat gatgcgccga gaaaaaagag ataggaggca tttcaagaga atgcgttttc 541 ccccttttga tgatgaggag ccgcccttgg actatgctga caacatccta aatgttgagc 601 cactggaggc cattcagcta gagctggacc ctgaggagga cgcccctgtg ttggactggt 661 tctatgacca ccagccgttg agggacagca ggaagtatgt aaatggctcc acttaccagc 721 gctggcagtt cacactacct atgatgtcaa ctctctaccg cctggctaat cagctcctga 781 cagacttggt ggatgacaac tacttctacc tgtttgattt gaaggccttc tttacgtcca 841 aggcactcaa tatggccatt cctggaggcc ccaaatttga acctcttgtt cgagacatca 901 acctacagga tgaagactgg aatgaattca atgatattaa caagattatc atccggcagc 961 ctatccggac tgagtacaag attgcttttc cttacttgta caacaatctt ccacaccatg 1021 tccacctcac ctggtaccat actcccaatg ttgtattcat caaaactgaa gatcctgact 1081 tgccagcttt ctactttgac cctttgatca acccaatctc ccataggcac tcagtcaaga 1141 gccaggaacc attgccggat gatgatgagg aatttgagct cccggagttt gtggagccct 1201 tcctgaagga cacacccctc tatacagaca atacagccaa tggcattgcc ctgctctggg 1261 ccccgcggcc cttcaaccta cgctctggtc gcacccgtcg ggccctggac ataccccttg 1321 tcaagaactg gtatcgggag cattgtcctg ccgggcagcc tgtgaaagtg agggtctcct 1381 accagaagct gcttaagtac tatgtgctga atgccctgaa gcatcggccc cctaaggctc 1441 aaaagaagag gtatttgttc cgctccttca aagccaccaa attctttcag tccacaaagc 1501 tggactgggt ggagggttgg ctccaggttt gccgccaggg ctacaacatg ctcaaccttc 1561 tcattcaccg caaaaacctc aactacctgc acctggacta caacttcaac ctcaagcctg 1621 tgaaaacgct caccaccaag gaaagaaaga aatctcgttt tgggaatgct ttccacctgt 1681 gtcgggaagt tctgcgtttg actaagctgg tggtggatag tcacgtgcag tatcggctgg 1741 gcaatgtgga tgccttccag ctggcagatg gattgcagta tatatttgcc catgttgggc 1801 agttgacggg catgtatcga tacaaataca agctgatgcg acagattcgc gtgtgcaagg 1861 acctgaagca tctcatctat tatcgtttca acacaggccc tgtagggaag ggtcctggct 1921 gtggcttctg ggctgccggt tggcgagtct ggctcttttt catgcgtggc attacccctt 1981 tattagagcg atggcttggc aacctcctgg cccggcagtt tgaaggtcga cactcaaagg 2041 gggtggcaaa gacagtaaca aagcagcgag tggagtcaca ttttgacctt gagctgcggg 2101 cagctgtgat gcatgatatt ctggacatga tgcctgaggg gatcaagcag aacaaggccc 2161 ggacaatcct gcagcacctc agtgaagcct ggcgctgctg gaaagccaac attccctgga 2221 aggtccctgg gctgccgacg cccatagaga atatgatcct tcgatacgtg aaggccaagg 2281 ctgactggtg gaccaacact gcccactaca accgagaacg gatccgccga ggggccactg 2341 tggacaagac tgtttgtaaa aagaatctgg gccgcctcac ccggctctat ctgaaggcag 2401 aacaggagcg gcagcacaac tacctgaagg acgggcctta catcacagcg gaggaaacag 2461 tggcagtata taccaccaca gtgcattggt tggaaagccg caggttttca cccatcccat 2521 tccccccact ctcctataag catgacacca agttgctcat cttggcattg gagcggctca 2581 aggaagctta tagtgtgaag tctcggttga accagtctca gagggaggag ctaggtctga 2641 tcgagcaggc ctacgataac ctccacgagg cgctgtcccg cataaagcgt cacctcctca 2701 cacagagagc cttcaaagag gtgggcattg agttcatgga tctgtatagc cacctcgttc 2761 cagtatatga tgttgagccc ctggagaaga taactgatgc ttacctggac cagtacctgt 2821 ggtatgaagc cgacaagcgc cgcctgttcc caccctggat taagcctgca gacacagaac 2881 cacctccact gcttgtttac aagtggtgtc aaggcatcaa taacctgcag gacgtgtggg 2941 agacgagtga aggcgagtgc aatgtcatgc tggaatcccg ctttgagaag atgtatgaga 3001 agatcgactt gactctgctc aacaggctcg tgcgcctcat cgtggaccac aacatagccg 3061 actacatgac agccaagaac aacgtcgtca tcaactataa ggacatgaac catacgaatt 3121 catatgggat catcagaggc ctgcagtttg cctcattcat agtgcagtat tatggcctgg 3181 tgatggattt gcttgtattg ggattgcacc gggccagtga gatggctggg ccccctcaga 3241 tgccaaatga ctttctcagt ttccaggaca tagccactga ggctgcccac cccatccgtc 3301 tcttctgcag atacattgat cgcatccata tttttttcag gttcacagca gatgaggctc 3361 gggacctgat tcaacgttac ctgacagagc accctgaccc caataatgaa aacatcgttg 3421 gctataataa caagaagtgc tggccccgag atgcccgcat gcgcctcatg aaacatgatg 3481 ttaacttagg ccgggcggta ttctgggaca tcaagaaccg cttgccacgg tcagtgacta 3541 cagttcagtg ggagaacagc ttcgtgtctg tgtacagtaa ggacaacccc aacctgctgt 3601 tcaacatgtg tggcttcgag tgccgcatcc tgcctaagtg ccgcaccagc tatgaggagt 3661 tcacccacaa ggacggggtc tggaacctgc agaatgaggt tactaaggag cgcacagctc 3721 agtgtttcct gcgtgtggac gatgagtcaa tgcagcgctt ccacaaccgc gtgcgtcaga 3781 ttctcatggc ctctgggtcc accaccttca ccaagattgt gaataagtgg aatacagctc 3841 tcattggcct tatgacatac tttcgggagg ctgtggtgaa cacccaagag ctcttggact 3901 tactggtgaa gtgtgagcac aaaatccaga cacgtatcaa gattggactc aactccaaga 3961 tgccaagtcg gttccccccg gttgtgttct acacccctaa ggagttgggt ggactcggca 4021 tgctctcaat gggccatgtg ctcatccccc aatccgacct caggtggtcc aaacagacag 4081 atgtaggtat cacacacttt cgttcaggaa tgagccatga agaagaccag ctcattccca 4141 acttgtaccg ctacatacag ccatgggaga gcgagttcat tgattctcag cgggtctggg 4201 ctgagtactc actcaagaga caagaggcca ttgctcagaa cagacgcctg actttagaag 4261 acctagaaga ttcatgggat cgtggcattc ctcgaatcaa taccctcttc cagaaggacc 4321 ggcacacact ggcttatgat aagggctggc gtgtcagaac tgactttaag cagtatcagg 4381 ttttgaagca gaatccgttc tggtggacac accagcggca tgatgggaag ctctggaacc 4441 tgaacaacta ccgtacagac atgatccagg ccctgggcgg tgtggaaggc attctggaac 4501 acacactctt taagggcact tacttcccta cctgggaggg gcttttctgg gagaaggcca 4561 gtggctttga ggaatctatg aagtggaaga agctaactaa tgctcagcga tcaggactga 4621 accagattcc caatcgtaga ttcaccctct ggtggtcccc gaccattaat cgagccaatg 4681 tatatgtagg ctttcaggtg cagctagacc tgacgggtat cttcatgcac ggcaagatcc 4741 ccacgctgaa gatctctctc atccagatct tccgagctca cttgtggcag aagatccatg 4801 agagcattgt tatggactta tgtcaggtgt ttgaccagga acttgatgca ctggaaattg 4861 agacagtaca aaaggagaca atccatcccc gaaagtcata taagatgaac tcttcctgtg 4921 cagatatcct gctctttgcc tcctataagt ggaatgtctc ccggccctca ttgctggctg 4981 actccaagga tgtgatggac agcaccacca cccagaaata ctggattgac atccagttgc 5041 gctgggggga ctatgattcc cacgacattg agcgctacgc ccgggccaag ttcctggact 5101 acaccaccga caacatgagt atctaccctt cgcccacagg tgtactcatc gccattgacc 5161 tggcctataa cttgcacagt gcctatggaa actggttccc aggcagcaag cctctcatac 5221 aacaggccat ggccaagatc atgaaggcaa accctgccct gtatgtgtta cgtgaacgga 5281 tccgcaaggg gctacagctc tattcatctg aacccactga gccttatttg tcttctcaga 5341 actatggtga gctcttctcc aaccagatta tctggtttgt ggatgacacc aacgtctaca 5401 gagtgactat tcacaagacc tttgaaggga acttgacaac caagcccatc aacggagcca 5461 tcttcatctt caacccacgc acagggcagc tgttcctcaa gataatccac acgtccgtgt 5521 gggcgggaca gaagcgtttg gggcagttgg ctaagtggaa gacagctgag gaggtggccg 5581 ccctgatccg atctctgcct gtggaggagc agcccaagca gatcattgtc accaggaagg 5641 acatgctgga cccactggag gtgcacttac tggacttccc caatattgtc atcaaaggat 5701 cggagctcca actccctttc caggcgtgtc tcaaggtgga aaaattcggg gatctcatcc 5761 ttaaagccac tgagccccag atggttctct tcaacctcta tgacgactgg ctcaagacta 5821 tttcatctta cacggccttc tcccgtctca tcctgattct gcgtgcccta catgtgaaca 5881 acgatcgggc aaaagtgatc ctgaagccag acaagactac tattacagaa ccacaccaca 5941 tctggcccac tctgactgac gaagaatgga tcaaggtcga ggtgcagctc aaggatctga 6001 tcttggctga ctacggcaag aaaaacaatg tgaacgtggc atcactgaca caatcagaaa 6061 ttcgagacat catcctgggt atggagatct cggcaccgtc acagcagcgg cagcagatcg 6121 ctgagatcga gaagcagacc aaggaacaat cgcagctgac ggcaacacag actcgcactg 6181 tcaacaagca tggcgatgag atcatcacct ccaccaccag caactatgag acccagactt 6241 tctcatccaa gactgagtgg agggtcaggg ccatctctgc tgccaacctg cacctaagga 6301 ccaatcacat ctatgtttca tctgacgaca tcaaggagac tggctacacc tacatccttc 6361 ccaagaatgt gcttaagaag ttcatctgca tatctgacct tcgggcccaa attgcaggat 6421 acctatatgg ggtgagccca ccagataacc cccaggtgaa ggagatccgc tgcattgtga 6481 tggtgccgca gtggggcact caccagaccg tgcacctgcc tggccagctg ccccagcatg 6541 agtacctcaa ggagatggaa cccttaggtt ggatccacac tcagcccaat gagtccccgc 6601 agttatcacc ccaggatgtc accacccatg ccaagatcat ggctgacaac ccatcttggg 6661 atggcgagaa gaccattatc atcacatgca gcttcacgcc aggctcctgt acactgacgg 6721 cctacaagct gacccctagt ggctacgaat ggggccgcca gaacacagac aagggcaaca 6781 accccaaggg ctacctgcct tcacactatg agagggtgca gatgctgctg tcggaccgtt 6841 tccttggctt cttcatggtc cctgcccagt cctcgtggaa ctacaacttc atgggtgttc 6901 ggcatgaccc caacatgaaa tatgagctac agctggcgaa ccccaaagag ttctaccacg 6961 aggtgcacag gccctctcac ttcctcaact ttgctctcct gcaggagggg gaggtttact 7021 ctgcggatcg ggaggacctg tatgcctgac cgtttccctg cctcctgctt cagcctcccg 7081 aggccgaagc ctcagcccct ccagacaggc cgctgacatt cagcagtttg gcctctttcc 7141 ctctgtctgt gcttgtgttg ttgacctcct gatggcttgt catcctgaat aaaatataat 7201 aataaatttt gtataaatag g // LOCUS AB007618 1111 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens mRNA for COX7RP, complete cds. ACCESSION AB007618 NID g2465177 KEYWORDS COX7RP. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T., Inoue,S., Hiroi,H., Kawashima,H. and Muramatsu,M. TITLE Isolation of estrogen responsive genes using CpG island library JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1111) AUTHORS Watanabe,T., Inoue,S., Hiroi,H., Kawashima,H. and Muramatsu,M. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) to the DDBJ/EMBL/GenBank databases. Toru Watanabe, Saitama Medical School, Department of Biochemistry; 38 Moro-Hongo, Moroyama-machi, Iruma-gun, Saitama 350-04, Japan (E-mail:watanabe.toru@yamanouchi.co.jp, Tel:81-492-76-1143, Fax:81-492-94-9751) FEATURES Location/Qualifiers source 1..1111 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 41..385 /codon_start=1 /product="COX7RP" /db_xref="PID:d1023439" /db_xref="PID:g2465178" /translation="MYYKFSGFTQKLAGAWASEAYSPQGLKPVVSTEAPPIIFATPTK LTSDSTVYDYAGKNKVPELQKFFQKADGVPVYLKRGLPDQMLYRTTMALTVGGTIYCL IALYNASQPKNK" BASE COUNT 281 a 206 c 270 g 354 t ORIGIN 1 cgcgttggca gcggatgcgg gaagccggac tctgggcgtc atgtactaca agtttagtgg 61 cttcacgcag aagttggcag gagcatgggc ttcggaggcc tatagcccgc agggattaaa 121 gcctgtggtt tccacagaag caccacctat catatttgcc acaccaacta aactgacctc 181 cgattccaca gtgtatgatt atgctgggaa aaacaaagtt ccagagctac aaaagttttt 241 ccagaaagct gatggtgtgc ccgtctacct gaaacgaggc ctgcctgacc aaatgcttta 301 ccggaccacc atggcgctga ctgtgggagg gaccatctac tgcctgatcg ccctctacaa 361 tgcttcgcag cccaaaaaca aatgagttag gctgcagagg actggtttgt tttttggcat 421 aaaccctttt gaaggtcctt tttcattgtt aaattaaaat tttttttttt acttggatgg 481 cttaacattt ttgcaagaaa aaatagggag atatgaagat gatgttttgg tttgtttatg 541 aaatgcatat ggcttgtcag agctcattcg acagttaaag ccattgttta aagaaacggt 601 gctttgctct gtgtttgtgc tcctgatttc cctggaggtt ctggatgaag gctgaacaca 661 ggcttgttaa tgtcagtctg tgctgaggac ctcagggact tgaggttgca tttttgagca 721 tggggtgcag gagcctttct ggatttggga tgtggctatg gaaagaacac agaaggcaag 781 gtcatgtgca tgtaaatgag gagtttgagt tagtcacctc ggggattttt tccattttgc 841 agtaaaatgt taaattaatg tagcctgcct ctatttgttg ggcaggtaat ttcaaagggt 901 tatttgcctc atctcctatc tttagtgaaa tcttatgtgt aattgtgtgt atttattcca 961 ccgtgggaac agagaatacc tgtttagtgt tgcactttag actggtgtct gttttgttaa 1021 tgcagctgtg ccacaaattc tcctttatct tttaaaaatg ttataccttt aaattttgat 1081 ttattttgac tgtggaataa atacatgaat g // LOCUS AB007619 1104 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens mRNA for EBAG9, complete cds. ACCESSION AB007619 NID g2465179 KEYWORDS EBAG9. SOURCE Homo sapiens breast cancer cell_line:MCF-7 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T., Inoue,S., Hiroi,H., Kawashima,H. and Muramatsu,M. TITLE Isolation of estrogen responsive genes using CpG island library JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1104) AUTHORS Watanabe,T., Inoue,S., Hiroi,H., Orimo,A., Kawashima,H. and Muramatsu,M. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) to the DDBJ/EMBL/GenBank databases. Toru Watanabe, Saitama Medical School, Department of Biochemistry; 38 Moro-Hongo, Moroyama-machi, Iruma-gun, Saitama 350-04, Japan (E-mail:watanabe.toru@yamanouchi.co.jp, Tel:81-492-76-1143, Fax:81-492-94-9751) FEATURES Location/Qualifiers source 1..1104 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MCF-7" /cell_type="breast cancer" CDS 284..925 /codon_start=1 /product="EBAG9" /db_xref="PID:d1023440" /db_xref="PID:g2465180" /translation="MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPT TVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTI RKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWE EEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLS" BASE COUNT 352 a 240 c 246 g 266 t ORIGIN 1 gaattcggca cgaggggaga ctgggctgtg gggtaccggc ccggaaagca cgcagcctcc 61 aaagccgcct tcctcaggga aatttgcgtg accttactgc cctccgtcta caggccttgt 121 acctctccag gccgattttt ccacaattta aatctcagtt cacctggtat ccagctccag 181 caacttagag cgtttcacgt cacgccgggc gccaggcgtc ggcttgtata acctgaaaac 241 gctcctgttt ttctcatctg tgcagtgggt tttgattccc accatggcca tcacccagtt 301 tcggttattt aaattttgta cctgcctagc aacagtattc tcattcctaa agagattaat 361 atgcagatct ggcagaggac ggaaattaag tggagaccaa ataactttgc caactacagt 421 tgattattca tcagttccta agcagacaga tgttgaagag tggacttcct gggatgaaga 481 tgcacccacc agtgtaaaga tcgaaggagg gaatgggaat gtggcaacac aacaaaattc 541 tttggaacaa ctggaacctg actattttaa ggacatgaca ccaactatta ggaaaactca 601 gaaaattgtt attaagaaga gagaaccatt gaattttggc atcccagatg ggagcacagg 661 tttctctagt agattagcag ctacacaaga tctgcctttt attcatcagt cttctgaatt 721 aggtgactta gatacctggc aggaaaatac caatgcatgg gaagaagaag aagatgcagc 781 ctggcaagca gaagaagttc tgagacagca gaaactagca gacagagaaa agagagcagc 841 cgaacaacaa aggaagaaaa tggaaaagga agcacaacgg ctaatgaaga aggaacaaaa 901 caaaattggt gtgaaacttt cataacacat gttcaaattt tatcatgcca gtaggagaaa 961 tctcagctcc acaacccaag caacatttgt atggatttaa gagtatttta agaagacata 1021 ctgcttgatt ttaatacatt gatcaggcca tccaggacac cacgattctc ccaaagtacc 1081 ttgaactctt agtgattgag actc // LOCUS AB007828 3300 bp DNA PRI 09-OCT-1997 DEFINITION Homo sapiens gene for necdin, complete cds. ACCESSION AB007828 NID g2516265 KEYWORDS NDN; necdin. SOURCE Homo sapiens female leukocytes DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nakada,Y., Taniura,H., Uetsuki,T., Inazawa,J. and Yoshikawa,K. TITLE Structure, expression, and chromosomal localization of the human necdin gene JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 3300) AUTHORS Yoshikawa,K. TITLE Direct Submission JOURNAL Submitted (02-OCT-1997) to the DDBJ/EMBL/GenBank databases. Kazuaki Yoshikawa, Institute for Protein Research, Osaka University, Div. Regulation of Macromolecular Functions; Yamadaoka 3-2, Suita, Osaka 565, Japan (E-mail:yoshikaw@protein.osaka-u.ac.jp, Tel:06-879-8621, Fax:06-879-8623) FEATURES Location/Qualifiers source 1..3300 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocytes" /sex="female" promoter 1191..1335 /function="postmitotic neuron-restrictive core promoter" prim_transcript 1368..3263 gene 1454..2419 /gene="NDN" CDS 1454..2419 /gene="NDN" /function="postmitotic neuron-specific growth suppressor" /codon_start=1 /product="necdin" /db_xref="PID:d1023528" /db_xref="PID:g2516266" /translation="MSEQSKDLSDPNFAAEAPNSEVHSSPGVSEGVPPSATLAEPQSP PLGPTAAPQAAPPPQAPNDEGDPKALQQAAEEGRAHQAPSAAQPGPAPPAPAQLVQKA HELMWYVLVKDQKKMIIWFPDMVKDVIGSYKKWCRSILRRTSLILARVFGLHLRLTSL HTMEFALVKALEPEELDRVALSNRMPMTGLLLMILSLIYVKGRGARESAVWNVLRILG LRPWKKHSTFGDVRKLITEEFVQMNYLKYQRVPYVEPPEYEFFWGSRASREITKMQIM EFLARVFKKDPQAWPSRYREALEEARALREANPTAHYPRSSVSED" polyA_signal 3254..3259 BASE COUNT 856 a 842 c 807 g 795 t ORIGIN 1 aagcttaaga gtcctgttgg agggactggt gtggtaatgg ctctgcaaaa gtgttatgtg 61 cgtgcaaacc caaagagaga aagcacagaa aacctttcaa catcaacctg cttgaggaaa 121 aataaagtgg gaaaagatac atactcacag tgaggactct agacatgtca agacaatttt 181 taaatatgct tttggcttcg agtggcaata actagattca agacagcata tttaagaagc 241 tgctgatgag aagaaacccg ggaagagctg aaggaccaca tcagcccaga ccaaggatgc 301 tgaagcagca ttaaggtccc tggtttcaga tgctcaggca atgacccttt ttttcatgga 361 gagcctgtag gagtgacagt tttgtctttg cccactggga atctgttttc catacctgga 421 aaacagggtt acctatgttt cccctgctac cctttggtca tctcagagac actaccagat 481 attacccatg ggacctattt tttttttaaa tctcaggaaa gacttgggtg tggcttccaa 541 cgtggaggac tcagtagctt cagagagggt cctgagagaa ggtgaattga agaatgaggg 601 tgctgggcag agggaaaaga cattatcata caagtttgtg ctaaaagata tagcaatcct 661 tctgctatgg actaagtatg gaaaaaaata aaatggaatc aaagttaccc aaaggaagtg 721 taaaacccaa atttatgccc gttaaagcat taatgatgct ctaagtccac tgcctactta 781 aaaagttcat agttcacatg ggttgatagg aaattacgtt aacgacacac tgcatttccc 841 cttttcttat agcctatctg atttggtagg gagtcgatca ttttttattg gaatttctca 901 ggattccaac ctcagacatc cactttacag tttacacatt ttcttggaca agcccgactg 961 ttcctctcac tggttcgcat aaagctcatg tttacaaagc cgcccagacc tttctctggg 1021 actctcatat ttaaattaat tctggatata cccaggtaag cgtttcccaa gaaacttgac 1081 cccaacatcc caaaaactta aggtatcttt cccttaaact ggccccttct ccagtacgca 1141 tccatctcac ttctctcctg ccctacatct tctcagccca aacaggaaac cccgggatcg 1201 ctctcccagc aggtgaagcc tcgccatgga ccctccccgt cgggccccgc gctgccccgc 1261 ccgcccccag ccgctggcca aggccgcggt cgcgcaggcg cagtgccgcg tcccgccgcc 1321 gccccgccct gcccgtcgct gcggaaggcg ctgcgccagc aacgcgcact tcctctccag 1381 gaatccgcgg agggagcgca ggctcgaaga gctcctggac gcagaggccc tgcccttgcc 1441 agacggcgca gacatgtcag aacaaagtaa ggatctgagc gaccctaact ttgcagccga 1501 ggcccccaac tccgaggtgc acagcagccc tggggtttcg gagggggttc ctccgtccgc 1561 gaccctggca gagccgcaga gccctcctct aggcccgacg gccgctccgc aggccgcgcc 1621 gcctccccag gccccgaacg acgagggcga cccgaaggcc ctgcagcagg ctgcggagga 1681 gggccgcgcc caccaggccc cgagcgcggc ccagccgggc ccggcaccgc cagccccggc 1741 gcagctggtg cagaaggcgc acgagctcat gtggtacgtg ctggtcaagg accagaagaa 1801 gatgatcatc tggtttccag acatggtgaa agatgtcatc ggcagctaca agaagtggtg 1861 caggagcatc ctccggcgca ccagcctcat cctcgcccgg gtgttcgggc tgcacctgag 1921 gctaaccagc ctgcacacca tggagtttgc gctggtcaaa gcgctggagc ccgaggagct 1981 ggacagggtg gcgctgagca accgcatgcc catgacaggc ctcctgctca tgatcctgag 2041 cctcatctac gtgaagggcc gcggcgccag agagagcgcc gtctggaacg tgctgcgcat 2101 cctggggctg cggccctgga agaagcactc caccttcggg gacgtgcgga agctcatcac 2161 tgaggagttc gtccaaatga attacctgaa gtaccagcgc gtcccatacg tggagccgcc 2221 cgaatacgag ttcttttggg gctcccgggc cagccgcgaa atcaccaaga tgcaaatcat 2281 ggagttcctg gccagggtct ttaagaaaga cccccaggcc tggccctccc gatacagaga 2341 agctctggag gaggccagag ctctgcggga ggctaatccc actgcccact accctcgcag 2401 cagtgtctct gaggactagc aaagtctgga ggcagatgaa tggtttctga ccctcaccag 2461 ggctgtggaa gggtgggggt gggtcattat agtattcagg atttacagtg cagtattcac 2521 gtgtaacttt taagttttca gtacagtgct tttatacctt taatgcaatg ttgtattcat 2581 ttgggtacta ttgtgtagta tttaggatgt atgcatgttt gtttatatgt aagcttggtt 2641 ggtgctttcg cttttgtgct acctttcttg gatttttgta ccagagatgt gctaaactga 2701 tgaaatacat tgagaaagtt tccatcttat tcttttatat gggactgatg atgtgtgttg 2761 gggtagactg ctcctgcaga gtttggaaga agtcaccagc aaagccggcc taaccaagaa 2821 aagtcaaggc ccttcatgac cttgctgggc acagaaaaca ccctcgtgga gtacactaat 2881 ttgaactgga ctggtctcag tgtgagcact tggcacactt tactaaacac atatacaacc 2941 ccaccgtgag tcaactttaa agtaaacatt aaagattctt gtgatacaat catttttgga 3001 aaagtgtact ttatcatttt aacaaagcag tatggttggg aatgagacaa ttctctattt 3061 tacagtgtat acagatacaa ctatttcccc taatagggtg ggaaaaatcg ctactcatga 3121 ttactcctaa atttgtgaag tttatagttc tattgtcttt aaatgtaact catgtttatt 3181 tcaaaaacat tcacaaatat agaaaagtat acaaaacaaa acagtaagat tgtctgtaat 3241 cacatcatat gggaataaaa aacaaaaata atttccttcc cttaagtttc tacattttat // LOCUS AB007854 7979 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0394 mRNA, complete cds. ACCESSION AB007854 NID g2662068 KEYWORDS KIAA0394. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HF0236. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 7979) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..7979 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HF0236" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 122..1360 /gene="KIAA0394" CDS 122..1360 /gene="KIAA0394" /codon_start=1 /db_xref="PID:d1024571" /db_xref="PID:g2662069" /translation="MVPPPPGEESQTVILPPGWQSYLSPQGRRYYVNTTTNETTWERP SSSPGIPASPGSHRSSLPPTVNGYHASGTPAHPPETAHMSVRKSTGDSQNLGSSSPSK KQSKENTITINCVTFPHPDTMPEQQLLKPTEWSYCDYFWADKKDPQGNGTVAGFELLL QKQLKGKQMQKEMSEFIRERIKIEEDYAKNLAKLSQNSLASQEEGSLGEAWAQVKKSL ADEAEVHLKFSAKLHSEVEKPLMNFRENFKKDMKKCDHHIADLRKQLASRYASVEKAR KALTERQRDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGDDLMRCVDLYNQAQSKWF EEMVTTTLELERLEVERVEMIRQHLCQYTQLRHETDMFNQSTVEPVDQLLRKVDPAKD RELWVREHKTGNIRPVDMEI" BASE COUNT 1890 a 2079 c 2006 g 2004 t ORIGIN 1 gaggatgcag agtgggagac cttaaaaccg tctgatctgg ctgctaaagc gagtgtctga 61 cgaccaccca ttaacaccag cgagtagcgt ttgctcacgg ctttaggata agaagcctgg 121 aatggtcccc cctccgccgg gagaagaaag ccagacggtc atccttccac ctggctggca 181 gagctacctg tcgcctcagg gccggcggta ctatgtcaac acgaccacca atgagaccac 241 ctgggaacgt cccagcagtt ctcctgggat tccagccagc cctggctctc acaggagctc 301 tctgcctcca acagtgaatg gataccacgc atcagggacc ccagcgcacc ctccagagac 361 tgcccacatg agtgtccgaa aatccaccgg tgattcccag aacctgggat cctcatcgcc 421 aagcaaaaag cagagcaagg aaaacaccat cacaataaac tgtgtgacgt tccctcaccc 481 agacacgatg ccggaacagc agctgctgaa accaaccgag tggagctact gcgactactt 541 ctgggctgat aagaaggacc cccaaggcaa cggcacggtg gctgggtttg aactactgct 601 ccagaaacag ctgaagggca aacaaatgca gaaggaaatg tcagaattca tccgggaaag 661 gataaagatt gaagaagact atgcgaagaa cttagctaag ctctctcaga actccttggc 721 ttcacaggag gaaggctcct tgggagaggc gtgggcccag gtgaagaaga gcctggcgga 781 cgaagcagaa gttcacctca agttctctgc caagcttcac agcgaggtgg agaagcccct 841 gatgaacttc cgtgagaact tcaagaaaga catgaagaag tgcgaccacc acattgccga 901 ccttcgcaag cagctcgcca gccgctatgc ctcggtggag aaggcccgga aagccctcac 961 agagcggcag agagacctgg agatgaagac ccagcagctg gagatcaagc tgagcaacaa 1021 gacagaggag gacatcaaga aggcgcggag aaagtccaca caggctggag acgacctcat 1081 gcgctgtgtg gatctctaca accaggccca gtccaaatgg tttgaagaga tggtgaccac 1141 cacattggag ctagagcggc tggaggtgga gagggtagag atgatccggc agcacctgtg 1201 ccagtacacg cagctgcggc atgaaacaga catgttcaac caaagcacag tcgagcccgt 1261 ggatcagctg cttcgaaaag tggacccggc caaagacagg gagctgtggg tcagagagca 1321 caagacgggc aacatccgcc ctgtggacat ggagatctag atgggcctgt gcagcttcgg 1381 ggggtcctgc tggggagggg ggctgggctc ccaccatggg gcccatgccg agtggatgcc 1441 ccccaccctc tctcctgggc cactgaggag aggggagaga gctggtgatt ccagaagggt 1501 gacccggaca gcctagctgg gggctccccc atattcccag gcccagaaga cagacccaca 1561 gccctgccct tgtctctgag gctgaagacc ccctgactcc catgctgtgc ttgccgctct 1621 gaaacaaaca gaggctggaa ctttgtggtc cttgccgagt tttgtagggg tcactgtcac 1681 atgcttgtgc cctggaagcc ccaaggctct tcctccagct ggctcccttg tctccagggc 1741 tttggaggat cagggtaggg agggctctgt ctctaagcca ggtgtcagga tcagaatcat 1801 gggtagaagg tgccattcag ctcacagccg cacccagaat cctttgcagc cctccttctt 1861 tatttttttc ccattgcatt ctgggagtcc acatctggct ttctcagcca ctgttcatca 1921 ccaggggttt taggaggaag gcttggctcc tgtcttccca gacccaccat gcctggagag 1981 gtcaggatgg aactacctca ttcggcaaat tagccccaaa ttgagcgctg aatcgtgtcc 2041 catgagatca ggcgccatct gtaaagtctc ctctggaaat gccaatccat ccttccccca 2101 gctgcttcct cggggaggcc cctgccccca ccctgccagc ccttcccagt ctcattagag 2161 gaagctttcc aaagttctca gttatcaaga cctcatccag ctcgtccaaa ggcttcaggg 2221 atggaaacaa accagctcgt ctcagaggcc agcaagctgg ggcctgtccg ccacggtgcc 2281 ctgtgcacct ttgggctgcc ctggccccca ctctcccggg ccgcccacca caggctctta 2341 atggggccgg gtcagtccta catgtgagat gggttagggc aagtctttgc catcccccag 2401 atggctctgt cttcttgtgt atggcagggc tgggactgct gtcccttgta cagttttctg 2461 tcactggtgg gcactggaca ggcatagcag actctcttgt ggccacacta ggttgtcgcc 2521 tttcaggcag tgtcccgcca cctttgcttc ccgctccttc tggacattcc agagccctgc 2581 accaaaccct tacttggtgt ctgcacctcc tttccctctc tccatcttcc aatccctgga 2641 aaagtctggt ctgagtgtga cttgggaagc tttcagtgct gctgtttggc ccagctcatt 2701 actttctccc tttctctacc acagcaaaca cattctccac catgtcagtc taaagagtct 2761 aaaggggccg ggagaagcat gagcgagggg cagattctag tcgggagccc atgccctgga 2821 aatccatctt tcttcatctt tcccattgac cactttgggt ttgacctgca catctgcagt 2881 gagggcagaa ttcaacaagc acaactcact ggtctttcag tcaacgtgct agaaaccgat 2941 gacttattaa tctctaattt tttggcgcct ctttcattga atgagaattg ctttcgtata 3001 gttcacatta gaaaaatgcc taatatacta aagtaaacca aacgttgtca cttttctctt 3061 gttcttgaaa cattgcaacc aaaaaggtca gcacaaaggc cttcacctac gtgaagacct 3121 cctggtagga tctgtccatg ggatggagaa ccactctgtc cagatctggg gttgggtcat 3181 gacaccagct accaatttta gaatattatt tcttggtttc tttatgaaaa atgggtgcta 3241 gtggtaattg ctttgtggct tagtaaacta ctctgtggat gatttccaaa cattcaaagc 3301 caatagcctt gttattaaca agatattttg agtacaatat ggctcattga cttttccatt 3361 acatctgagg attccagagt cctctgttca tccctgggat agagtgagcc ctcttgcgtc 3421 ttccagggta gctcaggttg gcccagtttg ggccagtcca tttttggagg tcactctttc 3481 ccccctcatc ccctcacctc cttctcttta cccccttcac gtacttcctt tctttcttcc 3541 ctctgttatt cattcatcaa gcaggttaaa gccattgtgc taagatctaa tctgaggaca 3601 ctataatggc cctgtcgtca gggaagcccg tgatcgctcg ttttcagggg tcttaccggt 3661 cagcaacctt ggcattgata cataggcact ctacagaatt taagttttca gaggtaagtc 3721 tgttgttctg attaatctgc attcattcag cgaatgctca agacaagcca ggcatgctga 3781 gaagcggcgc tctagttgta tgtaagatgg gcattcctct gctctcctct attctccatg 3841 tcgactgaga agagtataat aagaacattg taccctctcc acacttaacc ctctgggatt 3901 ctttattata atataaacca ccgtcgtgag gcatttacct gttggatgga aggtagacag 3961 caacaagatt ttaatcacct ggtcaccaga ccctcacggc ccatatccaa ttcgaatttc 4021 agaaatttct ctgtttgctt taaataactc tttacaaata agggtcgggc acggtggctc 4081 acgcctgtaa tcccagctct ttgggaggcc gaggtgggca gatcacgagg tcaggagatc 4141 gagaccatcc tggctaacat ggtgaaaccc cgtctctgct aaaaatacaa aaaattagcc 4201 agatgtggtg gtgcacacct gtagtcccaa ctattcggca ggctgaggca gtagaatcgc 4261 ttgaacctgg gaggcggaga ttgcagtgaa ccaagatcgg gccactgcac tccagcttgg 4321 gtgacagagc aagactccat ctcaaaaaaa ttataaataa ctctttgcaa ataggaaagt 4381 gaagctcaag ggtaaccagt gaggctatga aattgccagt gaattaagac cagggctaga 4441 gtcaacagtc cagaacccct atccgtatga acaggggccc gtttctggtt gaaatcaagg 4501 ttaggatgca actgagaaaa taaaagactg cactttggag aggcaagcac gctatccggg 4561 ttgttgtttt gttttatgtc ttggtgctta ggtgaggact tagctgggtt tattcaaagt 4621 gggcgggtct gggctattct tgagaatgtc ccttccattt ctgagaacaa gtcagcccct 4681 tctacctgcc tccacccagg agactgatgc acaagatctg tttgtgaaag cctgatttaa 4741 aaagaaaaaa aaaattcctt aagccaaatg gtgttccaag ttggttgctg tgacaacctc 4801 caggaagagc cacttgttac cctctattct ttctagtgct ttcagtcagt ggcaacacat 4861 aggccccttg gaccgtgggg atggcggccc tgagattctg tcagtgggac caccctgggc 4921 ctcctttcta cctccactca gcaccctgtt ggccaaggag aatttctgcg gtgggaggca 4981 gtgctctgct agtcaggatt gataaacagc tgggcacacc gaaacagtgt accaacgaat 5041 tcaccaacca ggggtttctt tccctaccct ttgtgaaaac caatcaatta ctagatgagt 5101 ggatggatgc agaaaaatct gggctgagcc aaagtccctt ttggaaatac aagccataac 5161 attcgaagga catcagcgac cttggcttgt ttaggtgatt ttacttccag ctgcaggtag 5221 tcttgacaag gagtgtttaa acagaaggct caagatgcat tccttgtgta ggtcggagag 5281 agcacttcta atgttaagtg gggtacagat cagctgcccc cccacgtagc ctggacatcg 5341 tcttatcccc ataatccttg ccatccctac aaggcccatc gccaccacct tttcccaggt 5401 ttattgaggt atgattgaca tccagtaaaa ttcacccttt ggaaatatac agctctgtga 5461 attttgacaa atttagtgtc ttgtgaccat caccaagatc aacctgtttt taaccctcca 5521 aaaaattccc ttctcctgcc ctctttccct ggcaaccctg attgattatc tgatcctata 5581 attttgcctt ttctagaatg tcatataaat ggagtcacac tgatgtcgcc ttttgagtct 5641 ggcatctttc cctcagctta atgcttttga gtcattcatg atgtgtgcgt gtggtttgtt 5701 ccttctcctt gccgggaagt atttcattgc atgggcgtac tactctgctt ctttattttt 5761 acactgagcc tttcgccctg gaagttcttg gcccagggtc ttcaacttgt ttgcacgacc 5821 acttccagct tgctgcttcc ttcagttccc caggctcctc ctccaaatgc cagctgtgcc 5881 cacaagctgt tcttagggct cacactcgcc ttttggcgcg tggtgggttt ttggtagtgc 5941 agaaagaatc cagggatggg aggggaaggg acggatgggt gctcaattgc tgctcacgtc 6001 tgctgcaacc tgaagcttgc atctcagcca gcagatctgc tcccttctgg gacccaggct 6061 tcagtgtcac acggtccttg gctatgtatt gggcgttgga ggctttgaaa ggcgagcaga 6121 agcagcatgg acaagaaggc tggggctggc cctggggctc atcatgatgc tttcctggat 6181 ctgttgcttc atctgcaagc tgagggtgtt tgttctagat gagtgtctca ggccaccggc 6241 aactgcatgt acctcctccc tttctcattt ccaatgatgt accctgtaca cgtgttcatg 6301 ctgtgtctgg ctctcatctg cacaatcgtg catagaattg cctcaagtcc tggtgagaga 6361 gatgccgtgg tacttttcca tttagattca aatggagcta aaattaagag ttttatgagc 6421 tgttaagaat gaggtagttt ctcctaggac cccccaaaga cagtgcaagt aatgaccgtt 6481 tggatctcat tcgtcgatct ttgatagtat gttctggagt ctactcccca ggagccagga 6541 caggcgtgaa gatggagtcc ttgtcgcagc tggagccttg cctagctggt gatcacacag 6601 cctggcctgt acctgcaccc cactggatgg tggtacatgg tggcagggac aggaccacac 6661 ccagttaagg ccagaccagg ctgagtgtga cccctgaggt aaacactcca ctaagctgtg 6721 tcttgttcat gccccctgct cagtgaaagg tgagtcccga gaccagttgg gtacctctct 6781 atgcgaacca gagacatttc tggatccagg ccaggtgaag attagggcca ggaagcctga 6841 gcccccgggg cctcaaggta gggagccgaa gaggctgcca ggactctgct gggttgaaat 6901 ttgccgggga ggactcttgt ctccccctca ggagtatttt tgttgaggct ttcctggagg 6961 tgaagaagca attcccattg cagcaggtta gagcgagaat cagacagagg gcaaaaacca 7021 attcgcttct ccccacgttc taaatgctgg ggcatggctg tcaggagggc ttcctgggag 7081 gtgtctctgg gggtggggtg aggttggggc gatggccttt ggagattgcg tgtggtgttc 7141 aggactgttc cttggtgttt gagggaaact ttagtgggat tgcagtggaa tgtaaggtca 7201 gggcacgtgg gtgctctctc ggggtggggt gactgggaga cctagaggga aagcctgcta 7261 tgcaggggga gagcacagga ctggccctgc tctgcggcct cctttgtccc ataacctgaa 7321 gttaagtcac atcccctgtc gggacctccg tgcactcatc tgtcaagtgg gggcgcttcc 7381 cttccagcat cacctgcagc agacgggctc tcgggagtcg tgggttccag gcagctgtgt 7441 ggacccaggg acagacattc aaagggacgc cagccatcct tagtgacagg ggccccaact 7501 tagcatccct tcccttccgt taggaaggag atgaccggaa gcaacccctt cacagacacg 7561 agcacatcgg caaaccctat gaaagtggaa ttttctaaca aaataaactt gcttgtttga 7621 tctgttttct gtaacttttg ctaaatactt tatacatttt tcatgttaaa gagccgtgtc 7681 tcccgccagc actcctcacc ccggtatgaa tgtgtttcct ccacattgta tatccttcca 7741 ccctctggct gcctagatca gtaaataaaa ttgatgtaat ataatttata agtaacactg 7801 ttgaaaccct gatcccagtg gaggctgtaa cccacctgcc cccgcaccac ccccctgacc 7861 cctgttaccg catttgtgtg tattaatgct gaagaattaa atgtttaaag agtttaaatt 7921 ttgaaggcgt ttgctatata cagttgtcct gcattattat aaagagtttt caggaagtt // LOCUS AB007857 6629 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0397 mRNA, complete cds. ACCESSION AB007857 NID g2662074 KEYWORDS KIAA0397. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0184. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6629) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6629 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0184" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 55..1521 /gene="KIAA0397" CDS 55..1521 /gene="KIAA0397" /codon_start=1 /db_xref="PID:d1024574" /db_xref="PID:g2662075" /translation="MGSAEDAVKEKLLWNVKKEVKQIMEEAVTRKFVHEDSSHIIALC GAVEACLLHQLRRRAAGFLRSDKMAALFTKVGKTCPVAGEICHKVQELQQQAEGRKPS GVSQEALRRQGSASGKAPALSPQALKHVWVRTALIEKVLDKVVQYLAENCSKYYEKEA LLADPVFGPILASLLVGPCALEYTKLKTADHYWTDPSADELVQRHRIRGPPTRQDSPA KRPALGIRKRHSSGSASEDKLAACARECVESLHQNSRTRLLYGKNHVLVQPKEDMEAV PGYLSLHQSAESLTLKWTPNQLMNGTLGDSELEKSVYWDYALVVPFSQVVCIHCHQQK SGGTLVLVSQDGIQRPPLHFPQGGHLLSFLSCLENGLLPQGQLEPPLWTQQGKGKVFP KLRKRSSIRSVDMEEMGTGRATDYVFRIIYPGHRHEHNAGDMIEMQGFGPSLPAWHLE PLCSQGSSCLSCSSSSSPHATPSHCSCIPDRGAAAPPC" BASE COUNT 1404 a 1926 c 1925 g 1374 t ORIGIN 1 gcggcgaggg cgcgggggct ctgaggaccg ctcggcgccg cctcctgcca caccatgggc 61 agcgcagagg acgcagtcaa agagaaactg ctgtggaacg tgaagaagga ggtgaagcaa 121 atcatggagg aggctgtcac caggaagttt gtgcatgaag acagcagcca catcattgct 181 ttatgtggtg cagtggaggc ttgcctcttg catcagctga gacgccgtgc cgctggcttc 241 ctgcgcagtg acaagatggc agccctgttc accaaggtgg ggaagacgtg cccagtggcg 301 ggggagattt gccacaaggt acaggagctg cagcaacaag cagagggcag gaaaccctca 361 ggggtcagcc aggaggccct gcggagacag ggctcagcca gcgggaaggc cccggccctc 421 agccctcagg ccttgaaaca cgtatgggta cgcacggcgc tcatcgagaa agttctggac 481 aaggtcgtgc aatacctggc ggaaaactgc agcaagtact acgagaagga ggcactgctg 541 gcagaccctg tgttcggccc gatcctggcc tctcttctag tgggaccctg tgccttggaa 601 tacactaagc tcaagacagc cgatcactac tggactgacc cctctgctga tgagctggtc 661 cagcggcacc gcatccgggg tccacctact cgccaggact cccctgcaaa gcgcccagcc 721 ctggggatcc ggaaacggca ctcaagcggc agcgcgtcgg aggacaagct ggctgcctgc 781 gcccgcgagt gtgtggagtc cctgcaccag aactcacgga cgcggctgct ctatggcaag 841 aaccacgtgc tggtgcagcc gaaggaggat atggaggcgg tccctggcta cctctccctg 901 caccagtctg cagagagcct gactctgaag tggaccccca accagctcat gaatgggact 961 ctgggggact ccgagctgga aaagagcgtt tactgggact atgccctcgt ggtgcccttc 1021 agccaggtcg tgtgcatcca ctgccaccag caaaagagcg gtggcacgct tgtgctggtg 1081 agccaggatg gcatccagag gccgccgctg catttcccac agggaggaca cctgctgtcc 1141 tttctgtcct gtctggagaa tgggctgctg cctcagggac agctagagcc cccgctgtgg 1201 acccagcaag ggaaggggaa agtgttcccc aagctacgga aacgaagcag cattcgctcc 1261 gtggatatgg aggagatggg cacggggcgg gccaccgact atgtgttccg gatcatctac 1321 cccggccaca ggcacgagca caacgctggt gacatgatcg agatgcaggg ctttgggccc 1381 agcctgccag cctggcacct ggagcccctg tgcagtcagg gctcctcctg cctctcctgc 1441 tcctccagca gctccccaca tgcaaccccc agccactgta gctgcatccc cgaccgaggg 1501 gccgcagccc ctccctgctg agcccggttg ttgccacagg ttgccgctca ggctactgtg 1561 tgagagtatg aagaggcaga tcgtgtcccg ggccttctac ggctggctgg cacactgccg 1621 ccacctgtcc acggtgcgga cccacctgtc ggcgctggtg caccatagcg ttatcccacc 1681 tgaccggccc ccgggggcct ccgcgggcct caccaaggac gtgtggagca agtatcagaa 1741 ggacaaaaag aactacaaag agctggagct gctgcggcaa gtttactacg gaggcataga 1801 gcacgagatc cgcaaggacg tctggccctt tctgcttggc cactacaagt tcggcatgag 1861 caagaaggag atggagcagg tggacgcagt ggtggcagca aggtaccagc aggtgttggc 1921 agagtggaag gcctgcgagg tggtggtgag gcagcgggag cgggaggccc acccagccac 1981 acgcaccaag ttctcctcag gcagcagcat cgacagccac gtgcagcgcc tcatccaccg 2041 agactccacc atcagcaacg atgtgagcca gacgggacct ggagggttgg gggtctcggg 2101 ggccacccgc gttttatgca cagtggtcct gagcaccagc ctgacctctg ggaactggtg 2161 gggccctgcg agaaaggcct aaggtgcctg tgtctcattt tctccaactg gaaatggcta 2221 actgtgcctc tgctgcctac ttctctgggt attgtaggaa taaagtgaga gagtgcattg 2281 tgctcagttt tagccaacta tagggaaaga tggacttact gggatttagg gaagccctcc 2341 tccttgtaga aagacctcaa agctagcaac aggcagcgct gggttctagt cccagatcca 2401 ctactgacaa gctgaatgtc tctgggcaag cacttcccgt ctctgggtct cagtttcccc 2461 tctccaccca tatcctctga ctgcagaggc ttcctgagat ctgtgggcct gagaataggg 2521 gagcccgtag agcagcccca ttggtgtcga ctggcgagat ccttcctccc cgcgatgttg 2581 cctgtcactg tacagaactg actatggcag gcttgttcgg agcacgggag ggtagctctt 2641 tctggcatca ctcctgcctt ttgaacagca agttctaaac tgtgactgcc tggcccaacc 2701 aacactgata agtttcaatt ttaaggacgc tttattaatt tttctttaaa attgcctctt 2761 tagataatgt gtattcttgt tactttacta aatccttacc aacattaaca gaaaatgtaa 2821 gttgaagtaa gttaaatata actggctggg tgtgatggct catgcctgta attccaacac 2881 tttgggaggc agaggtggga ggattgcttc agttcaagag ttcgagacca gcctgggtaa 2941 catggcgaaa ccctgtcttt acaaaaaatg caaaactttg ccgcatgtgt tggggtgcgc 3001 ctgtagtccc agcttctcgg gaggctgagg tggggggacc acctgagcca tggaggttga 3061 ggctgcagtg agccgtgata ccaccactgt actctagcct gggccataga gtgagacacc 3121 ctgcctcaga aataaaataa aaaaaaagaa atagaacaaa caaggcacat tgtcatttcc 3181 cacgtgcttt gttcttaatc ccctatccag gaacacattc tccctctctc tggctttggg 3241 ttttgtaaaa atctgtctgt gccagtgccc agctgctgag ctcatggaca gcagtagctc 3301 ctcgctggcg tattagatgg ccacagaacc gtggccttac ccagcacttc atgcttcctg 3361 cccaaagtag aaactacttt tagcattgct ttccctacgt ggtgaaaaca gagagagaaa 3421 aatcccaaac agtctctgta tgtgtctctt cttttcctcg tgccttgtcc atggtggtgg 3481 actgtatcta ggaggcagaa tgtttgcttt tccttcctac atctccccac cttgcctgag 3541 ggctaatgga actgactagg tgcttgctac agaacgttcc tgtcttgtcc tgctgaacac 3601 tcagtactga aaataaggca aaacttgttt ttcctaaaac ctctagcctt ctctccaccc 3661 agccacacca aaacactaag gccattctcc cggtctcatg acttcaggcg tttctagaat 3721 gagactgagt tcaaactggc cattgagcgt tcagccccct ggctccccta cttcccttca 3781 gtagccacag gaagacctgg ggctgttcct tcaagcaggg ctgaagacgt gtcttcatgt 3841 ttggccagag gcttgtttct ctctgcctgg ctggctgggt cagaagctgg cttggtgctc 3901 cgtggccaca gtgttggagg ctgtggtggg agagggggtg tgaagagttt tgcattccag 3961 ctcccttctg gcccgaggag gcggtgggct gcgggaaggt gctggagtgc aggtggagcc 4021 gccctgtgtt caccccccag gtgtttatct cagtggatga tctggaaccc ccggagcccc 4081 aggaccctga agattccaga ccaaaacctg agcaggaagc aggacccggg actccgggca 4141 ccgccgtggt ggagcagcag cattccgtgg agttcgactc tccagactca ggactgccct 4201 cctctcgcaa ttactccgtg gcctcgggca tccagtcaag cctagatgag gggcagagcg 4261 tgggcttcga agaggaggac ggcggtgggg aggaaggctc cagtgggccc ggccctgcag 4321 ctcacacttt gagggagccc caggatccca gccaggagaa gcctcaggcc ggagaactgg 4381 aggccggaga ggagcttgcg gctgtgtgtg cggctgccta cactatagaa ttactggaca 4441 ctgtggcctt aaacctgcac cgcatagaca aggatgtgca gaggtgtgac cgcaactact 4501 ggtacttcac gccccccaac ctcgagaggc tcagagacgt catgtgcagc tacgtgtggg 4561 agcacctgga cgtgggctat gtgcagggca tgtgcgatct gctggcgcct ctcctggtca 4621 ccctcgacaa tgatcagctg gcctacagct gcttcagcca cctcatgaag aggatgagcc 4681 agaacttccc caacgggggt gccatggaca cccactttgc caacatgcgc tccctcatcc 4741 agatcctgga ctcagagctg tttgagctga tgcatcagaa tggagactac acccacttct 4801 acttctgtta tcgctggttc ctgctggatt ttaagagaga actgctgtat gaggatgtgt 4861 ttgctgtgtg ggaggtgatc tgggcagcca ggcacatctc atcggagcac tttgtcctgt 4921 tcatcgccct cgccctggtg gaggcctacc gagagatcat ccgtgacaac aacatggact 4981 tcactgacat catcaagttt ttcaatgaac gtgctgagca tcacgatgcc caggagatcc 5041 tgcggattgc ccgggacctc gtccacaagg tgcagatgct catagagaac aagtgagctg 5101 gggccaggag gcagcagccg tgcagagcct gggctccggc agggagaggt gcaggggagt 5161 caccgcccag acctccccag ccaccaaccg accccacctc tgttcctaac aaagcggttg 5221 tgagcctgga tccgactccc ggcagtgctg accctgcagg gcaagtcagg ggccaggatg 5281 ccctcggatc agggccggga tgggaggggt cagcctcagg gagcagctgc cttgggggac 5341 acacctactc tgctcccctc tcacacatct gggagtagcc ccactgccac ctgcagccgc 5401 agcctggact gctgcccacg agtgaacctg gggccccaca ggattaacag gggctatagc 5461 ggcctgggcc ctactcagct ggggtggcag agggcgagag gctctgtgct gtgtcccttc 5521 tgagggtccc tttgcagtcc cagtatattg tgcgtgcacc agccccagct ggagcaacca 5581 aaactgcttc tggtttaggc acacgtcacg ggtgcgggag acccgggcac gggagacccg 5641 ggccgccttc aggccgctcc cccgagattc tggggcagtc ggaagatgtg ggccctgggt 5701 gggagcagcc cacttcagca gcactgccac tgcctttggc cacctgaggt gacccccagg 5761 cctccccggc cttgtacagt gtacctctgt gtatctgtac agcctcgctc ctgccacccc 5821 acccttgcgt tctgcattag gtacttccct gaaaaccacg tgtaagaagt gatgcttttg 5881 ccagtggatg atctggaatg cgaccggagc acttgctctg aggaatccca gggtgactct 5941 gtcggggaag aatccggtca cagcctcccc tcagagacag gcctcagttc gagggcagcc 6001 cattatctgt cgcagacatc tgccatgtcc ctgagactgc gggaggcagg ggatgcatgg 6061 gtgtccccat ctgtccctgg tgagaagcaa ggctagctct gccttccaca tgcttctcag 6121 gatgccaaga ggcctaggaa cccagaaacc cccttggagg agctgtgtat gcgggggtgc 6181 caggaagggc atagctcctg cccccaggcc taggcatgct gcttgctcgg ccatccccac 6241 ttcctcctct accccaacac atgcaggcta ggccttgccc tggaacgtgg aggccctgcc 6301 cggggctggg gcctagtcca gcagccacca aagtctggca gactttctgc atttgagaaa 6361 catgacaaag ggccccgcag cccttctgca cccagcacag cctgcccagc ccagccctgc 6421 ccccgggcac tgcacagcct gtctgggggc cagaccaccc cattgtccat ccttgttgtc 6481 cgtgagtcct tggcctccac caagcacgtg tggccattgt gtgcctgcct tagtgactcc 6541 gtggttttgt gaggagcaga gtgtgtatga tttttgcctc agaaactata ctcttctgtg 6601 taacagacca ataaacaaca tttgtcaac // LOCUS AB007858 6203 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0398 mRNA, complete cds. ACCESSION AB007858 NID g2662076 KEYWORDS KIAA0398. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG0376. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6203) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6203 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG0376" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 197..1627 /gene="KIAA0398" CDS 197..1627 /gene="KIAA0398" /codon_start=1 /db_xref="PID:d1024575" /db_xref="PID:g2662077" /translation="MANSAKAEEYEKMSLEQAKASVNSETESSFNINENTTASGTGLS EKTSVCRQVDIARKRKEFEDDLVKESSSCGKDTPSKKRKLDPEIVPEEKDCGDAEGNS KKRKRETEDVPKDKSSTGDGTQNKRKIALEDVPEKQKNLEEGHSSTVAAHYNELQEVG LEKRSQSRIFYLRNFNNWMKSVLIGEFLEKVRQKKKRDITVLDLGCGKGGDLLKWKKG RINKLVCTDIADVSVKQCQQRYEDMKNRRDSEYIFSAEFITADSSKELLIDKFRDPQM CFDICSCQFVCHYSFESYEQADMMLRNACERLSPGGYFIGTTPNSFELIRRLEASETE SFGNEIYTVKFQKKGDYPLFGCKYDFNLEGVVDVPEFLVYFPLLNEMAKKYNMKLVYK KTFLEFYEEKIKNNENKMLLKRMQALEPYPANESSKLVSEKVDDYEHAAKYMKNSQVR LPLGTLSKSEWEATSIYLVFAFEKQQ" BASE COUNT 1797 a 1198 c 1341 g 1867 t ORIGIN 1 aacagaatcg cgtttggctg tgctggatgt gtgaacctat tgggtactgt acaacttcaa 61 gcctcgaaat cagataggca ccaccaacct catttcctgt ttcaccttga ttttctgtga 121 taccaaaatc tcagcttcac aaagtcattg aaagtgttgg ttcatgaagt tttaccatca 181 attcaagtaa tcataaatgg caaattctgc aaaagcagaa gaatatgaaa agatgtctct 241 tgaacaggca aaagcgtcag tgaattctga aacagagtct tcattcaata ttaatgaaaa 301 cacaacagct tctgggactg ggctttctga aaagacttct gtctgtaggc aagtagacat 361 agcaagaaag agaaaagagt ttgaagatga tcttgtaaag gaaagttcta gttgtgggaa 421 agacactcca tccaagaaga gaaaacttga tcctgaaatt gtcccagagg aaaaagattg 481 tggtgatgct gaaggcaatt caaagaaaag aaaaagagaa actgaggatg ttccaaaaga 541 taaatcttct actggagatg gcactcaaaa taagagaaaa atagcacttg aggatgttcc 601 tgaaaagcag aaaaatctgg aagaaggaca cagctcaaca gtggctgccc attacaatga 661 acttcaggaa gttggtttgg agaagcgtag tcaaagtcgt attttttacc taagaaactt 721 taataattgg atgaaaagtg ttctcattgg agaatttttg gaaaaggtac gacagaagaa 781 aaaacgtgat atcactgttt tggacctggg atgtggtaaa ggtggagatt tgctgaaatg 841 gaaaaaagga agaattaaca agctagtttg tactgatatt gccgatgttt ctgtcaaaca 901 gtgtcagcag cggtatgagg acatgaaaaa tcgtcgtgat agtgaatata ttttcagtgc 961 agaatttata actgctgaca gctcaaagga acttctgatt gacaaatttc gtgacccaca 1021 aatgtgtttt gacatctgca gttgtcagtt tgtctgtcat tactcatttg agtcttatga 1081 gcaggctgac atgatgctga gaaatgcgtg tgagagactt agccctgggg gctattttat 1141 tggtactact cccaatagct ttgaattgat aagacgcctt gaagcttcag aaacagaatc 1201 atttggaaat gaaatatata ctgtgaaatt tcagaagaaa ggagattatc ctttatttgg 1261 ctgcaaatat gacttcaact tggaaggtgt tgtggatgtt cctgaattct tggtctattt 1321 tccattgcta aatgaaatgg caaagaagta caatatgaaa ctagtctaca aaaaaacatt 1381 tctggaattc tacgaagaaa agattaagaa caatgaaaat aaaatgctct taaaacgaat 1441 gcaggccttg gagccatatc ctgcaaatga gagttctaaa cttgtctctg agaaggtgga 1501 tgactatgaa catgcagcaa agtacatgaa gaacagtcaa gtaaggttac ctttgggaac 1561 cttaagtaaa tcagaatggg aagctacaag tatttacttg gtgtttgcct ttgagaaaca 1621 gcagtgagca cataggcagt agtcccagag gggccgtgtt ctgtcctgca caaatttgaa 1681 caactcatct cgatatattt gatatttctc tgtctgttga ttttaattct aaatgtgcag 1741 gatgctgcca gaaactccaa tgtagaaatt caacatttgc tgtctgtgac agatgaactt 1801 ttgcatgtgt atataagaat gagttgggac ctctgtcttt aaaaatctat ttttaggtaa 1861 tgttctaaga attccatttg cctctatgat cttagctcat aaaaatataa tatgacttga 1921 taaagcaact aaactcttcc cacagtgttc agatttgtcc tgtgtgtgtt tacagtattc 1981 aatttattgc agttatagaa ttggtcagag agcattttca tagtgtgctc atttctatgg 2041 ttttgttata tagcattttt caacatttaa tggtctgtac agttgaatgt aagtgttcaa 2101 tatgtattgc tgaagttata agtttaaaac tcaatttcag atgctcataa aagttactta 2161 gctaaaattt tagcaattta ttgcattttg aaataatcat taacatgctg caattcagga 2221 gctggttaga acattttaag tggcagcata gaattttgga attttggggc tttcttttca 2281 gaaattgcta ccatagtaat taatgtttcc agttatcaag attgtgatta gacacattta 2341 cctttcttca ttgaacaaat ggtgccatag ttatttttct caaaatttag tgaaaatccc 2401 tcccatgtag acatgttgca cattttttcc aaatttatac atggaactgc agtaggaata 2461 ttctcaccat ctgatgccat gtacccactt cagaaataag caatacttgt tcctctgtta 2521 caacctcagc actttgcacc gtaggagcca ttgttaaagt tgtcacttgt gtaactgact 2581 gcttttccaa aactggtact tacgtgaact gttgtccttg ctttacacca ccatttggaa 2641 aacttaccag tttttagatg tagatgtagt gaaaaacttc aagaatgaag cagagcaatt 2701 gagtattctt ttttaaatta ttaagccatg atttacaaaa acattacttt ctgtaattca 2761 caatacttgt tttaaaaaca tagtgtcttc attagtgtgc atctattaac tgttcatggt 2821 gttagagttg caaacttttt agcaagaaaa tatggatttc ctcatttcag ttccttctgc 2881 agcctgtgaa tctccacaaa gtgttaccag tttacaaaaa taagtctttt tgccttaagt 2941 cattttggaa ataagtaata ctgcatctga ctctggtggc tgtattagct aggaaaggtt 3001 tgtaaatggt gtcagtgagg tggggaaagg aagtcttcct gtcacatatg caggttcgtt 3061 ttcattctag ggcagtgcca ggaagtatat tgatagcttt gtaggtacag gaaaaacatc 3121 atcattattt cctctgttca catttactgg tcttaattaa caggtaataa taatacatgt 3181 acttttagcc tgaaacctct tccacgccat gggtaacttg ggggagagaa gaatcctcca 3241 aacgatggag tagccagtgg taatacaaag cagggagaac agaaaggtag agttactaag 3301 gccttcagtg aacagaaagg agcagagagc aagattagat ctgagaagat gctctgggga 3361 ctgagcccac tgttgttggt gtcagggagg cttactggag ccacacctgc aggcgctgtg 3421 ttcaggcacc accttcctcc ttgagctttg cctgtctctt gccttatcag ttcttcctcc 3481 accaccctac acccccctcc ccccggcccc aagcccctgt gtctccttgt tacaattaag 3541 tttctggata ttgacttaag aactgttagg aagaggacta gaaaaggctt cccctgccta 3601 tcctctccga tcaccaaggt ggaagggagc tagtaggact cttccttgac acaccttgta 3661 gtctaaatgt tcggtattct attcaggact tacggtaact attatgaggg aggcatggct 3721 ttccaccgtc gggccaggaa gagcacctgt tgctgcaagc tcagtgaagt ggggcactcc 3781 cagacctgcc atgcagttta tcctctgaga atggaattgg aaatgaagac ctaaccagct 3841 attggtggga atgacggaac tggggattgc gatgattgat ctgggaacat ggctggattg 3901 tgatttaacc aagaatgctg atgttgaatt ctttgggcct agaatatact tgagaaagca 3961 ctagtggctt gtgttcaggg agaggagctg gcagttttta accacttctg tgggagccgt 4021 gttctaacct gtggaaagta ttgcaattct gtgagagtga ctctgcagag tcactgcacc 4081 atcaggcttg gccctgctgt gccttcagta cccagccagg ttctctgggt ccagggtgac 4141 tctccaaaga aattggcctt cagctggaga aaacattggg tggagactct cacttatgtt 4201 aatgcaatct tgaaatgact gaaaggtaga ttgccagcac agtgagtttc ccagcctgct 4261 cccccatcca cacttggaaa ttgagggagc atgaccgctc tctgacttca catgttaata 4321 gaggatcaga gcagagttgg gagtatattg gtcaggatca tagaaaagaa gacaaagctt 4381 gctcagccat gagatggcca ggtatccagt tttgttaact ctctttggga atttcttttt 4441 cagcctgttt tttagcttag tgccattatg tcattttgat ttgtattcaa gtactcttca 4501 agtatctttg taatgaaggt ttggctactt gtataggtct gcctgcaggg tgaaaatgcc 4561 agtgtgaata ttctagctac caaacattgt tttttgttga aaaactgact ttctgttgtc 4621 tacctcaggc cttgtgcatt tgggttatct caagccagtc accacagagg gtctctaggg 4681 tctgcaaaat agaggccaaa tccagggacc aggccctaat aatagaagtt gtaccaaaat 4741 gcctgtggta cttgatggcc tgttggtcaa ataggaagta caagtgtgtg atgttagaac 4801 ctccctagtt gctgctatat caaacactgc acacttgaca agtgttttca ttccccgccc 4861 tctgacagaa caacattcct aattctttga aggcaaccag tgcaaaggct actacacttg 4921 tgtaatgata tttagcagat gcatacagga ctggatccca gggcactgtc agttcttccc 4981 tccctctcgt gctgctcagt ttgtcctctg ctcccatgta ggcctaaagt caccccactc 5041 cttagtgcct gcacctcacc acgatattga ggaagcacag gacatccaag ggtactctcc 5101 agtttggctg tggagacttg agcaagccct aaagtcccct gctccctgga cttctcctgg 5161 gttgtgcttt tttgggggca acataccttg gacaaagctg agctgacagc taatgtttat 5221 tagcctccta cctaagtcag tcactttgct aaattcttca cctgtggtaa ctcattttga 5281 ttgttataac atctcagcac cattattccc attttaatga tgaggaaact gagtcataga 5341 ggatacagac ttgcccaaag gagagccagg attttcacac cccctcctca agctgggcct 5401 gccctccaag tgcttgttat atacctccac gggtgtcagc ccagatgact cctcacccat 5461 ccaccacctg cagcttgaga tgttaactat tagagctcca ttcttttggt tcaaaacgtt 5521 gatacttact tagatgttcc ctgagaggag tgtttatttc tgagtaaggg gctttgttga 5581 aagagggggt tagagagagc aagacagcac ttgagtgcac tggcaggaag cagagataag 5641 acttgaattt cagtttggta gaccaccttc tttagcagcc caacctgtag caaatctagt 5701 ttagcctgca tggcagggag agggattctc ttcccaccct caccatttgc aagtggcagg 5761 agctgagaat gccagtacga gagtgtagcc aaagtgagag gctgagagca aaggagacat 5821 ttttttcagt tttgagtcga gtatccagac agaggcaaat cattttgttt aactttttat 5881 taaagtgtaa ctatagaaac acatcaatga tttttcacaa gtggagcacg tgcatacaat 5941 cggcacccca gaagcccccc gtcagattcc cttccagtta actacctctc caagggaaac 6001 cactatcctg agttctaagc gcatagatta gttctgtctg gtttggggag atatataaat 6061 ggaattatgc attcttcgta tctggtttct tttcaccaat attatgtttg tgagattttt 6121 gttgcatgta tttgtacatg gattttcatt ctcatggttg tataatattt cattgtgtga 6181 ataaaccaca tactgtttat ctg // LOCUS AB007860 5711 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0400 mRNA, complete cds. ACCESSION AB007860 NID g2662080 KEYWORDS KIAA0400. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1091. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5711) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5711 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1091" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 341..3361 /gene="KIAA0400" CDS 341..3361 /gene="KIAA0400" /codon_start=1 /db_xref="PID:d1024577" /db_xref="PID:g2662081" /translation="MPDQISVSEFVAETHEDYKAPTASSFTTRTAQCRNTVAAIEEAL DVDRMVLYKMKKSVKAINSSGLAHVENEEQYTQALEKFGGNCVCRDDPDLGSAFLKFS VFTKELTALFKNLIQNMNNIISFPLDSLLKGDLKGVKGDLKKPFDKAWKDYETKITKI EKEKKEHAKLHGMIRTEISGAEIAEEMEKERRFFQLQMCEYLLKVNEIKIKKGVDLLQ NLIKYFHAQCNFFQDGLKAVESLKPSIETLSTDLHTIKQAQDEERRQLIQLRDILKSA LQVEQKEDSQIRQSTAYSLHQPQGNKEHGTERNGSLYKKSDGIRKVWQKRKCSVKNGF LTISHGTANRPPAKLNLLTCQVKTNPEEKKCFDLISHDRTYHFQAEDEQECQIWMSVL QNSKEEALNNAFKGDDNTGENNIVQELTKEIISEVQRMTGNDVCCDCGAPDPTWLSTN LGILTCIECSGIHRELGVHYSRMQSLTLDVLGTSELLLAKNIGNAGFNEIMECCLPAE DSVKPNPGSDMNARKDYITAKYIERRYARKKHADNAAKLHSLCEAVKTRDIFGLLQAY ADGVDLTEKIPLANGHEPDETALHLAVRSVDRTSLHIVDFLVQNSGNLDKQTGKGSTA LHYCCLTDNAECLKLLLRGKASIEIANESGETPLDIAKRLKHEHCEELLTQALSGRFN SHVHVEYEWRLLHEDLDESDDDMDEKLQPSPNRREDRPISFYQLGSNQLQSNAVSLAR DAANLAKEKQRAFMPSILQNETYGALLSGSPPPAQPAAPSTTSAPPLPPRNVGKVQTA SSANTLWKTNSVSVDGGSRQRSSSDPPAVHPPLPPLRVTSTNPLTPTPPPPVAKTPSV MEALSQPSKPAPPGISQIRPPPLPPQPPSRLPQKKPAPGADKSTPLTNKGQPRGPVDL SATEALGPLSNAMVLQPPAPMPRKSQATKLKPKRVKALYNCVADNPDELTFSEGDVII VDGEEDQEWWIGHIDGDPGRKGAFPVSFVHFIAD" BASE COUNT 1569 a 1337 c 1332 g 1473 t ORIGIN 1 cgcccctctc cgcgggaggc gtcggggccc gcggctgggt ggcgggtagt tcccgccggc 61 tgtgcgcgcc cgcctcgggg ctcgctctga gaggacgcgg cggcagcgga ctcggagccc 121 tcggcgcgca ggcgggcgga ccggccgagc tgcgcggggc tgcgcgccgc ccctgctccg 181 ccgccaggcc ccgcgcggct cccgcgcccg gcgctcccct ttgtccgcgg gccggagcgg 241 cggcggcagc ggcggtgtcc gagcggcggt cggagcctgc tgcggcagtt gaggcggcgg 301 cgcccctgcg gctgtgcgcc agcgccctcg cgccgaggcg atgccggacc agatctccgt 361 gtcggaattc gtggccgaga cccatgagga ctacaaggcg cccacggcct ccagcttcac 421 cacccgcacg gcgcagtgcc ggaacactgt ggcggccatc gaggaggctt tggacgtgga 481 ccggatggtt ctttacaaaa tgaagaaatc cgtgaaagca atcaacagct ctgggctggc 541 tcacgtggaa aatgaagagc agtacaccca ggctctggag aagtttggcg gcaactgtgt 601 atgcagagat gacccagatt taggaagtgc gttcctgaag ttctcagtgt ttacaaagga 661 gttgacagca cttttcaaaa acctgattca gaatatgaac aacataatct ccttcccttt 721 ggacagtttg ctgaaggggg acctgaaagg agtgaaaggg gatctgaaaa agccttttga 781 taaagcttgg aaggactatg aaacaaaaat aaccaagata gaaaaggaga aaaaggaaca 841 cgccaagctc catgggatga ttcggactga aataagcgga gcggaaattg ccgaagagat 901 ggaaaaggag aggcgcttct tccagctaca gatgtgcgag tatctgctga aggtcaacga 961 aatcaagatt aaaaagggag tagatttact tcagaatctg atcaaatact ttcatgccca 1021 atgcaatttt tttcaggatg gactcaaagc cgtggaaagc ctcaaacctt ccattgaaac 1081 gctgtctacg gatcttcaca cgatcaaaca ggcccaggat gaagaaagaa ggcagttgat 1141 acagcttcga gatattttga aatccgcatt gcaggttgaa cagaaagagg actcccaaat 1201 tcgtcagagc acagcttata gcttacatca gcctcaggga aacaaggaac atgggaccga 1261 gcggaacggc agcctctaca agaagagtga cgggatccga aaagtgtggc agaaaaggaa 1321 atgttcagtt aaaaatggtt ttctgaccat atcccatggt accgctaacc ggcctcctgc 1381 aaagctcaac ctgctaacct gccaggtgaa gaccaaccct gaggagaaga agtgctttga 1441 ccttatttca catgacagaa cttaccactt tcaagctgaa gatgaacagg aatgtcaaat 1501 atggatgtct gtgctgcaaa atagcaaaga agaagcttta aacaatgcat ttaaggggga 1561 tgacaatact ggagaaaata acatcgtcca agaactgaca aaggagatca tctcagaagt 1621 gcagaggatg acgggcaatg acgtctgctg tgactgtggg gcgccagatc ctacatggct 1681 ttccaccaac ctgggcatcc tgacctgcat cgagtgttcc ggaatccacc gagagctggg 1741 ggttcattat tccaggatgc agtccctgac cttagatgta ctgggaacat ctgagctgct 1801 gctcgccaag aatattggga atgcaggctt taatgagatc atggaatgtt gcctaccagc 1861 tgaggactca gtcaaaccca acccaggcag cgacatgaat gcaagaaagg actacatcac 1921 agccaagtac atcgagagga gatacgcaag gaagaagcac gcggataacg cggcgaagct 1981 tcacagtctt tgcgaggccg tcaaaacgag agatattttt ggattgctcc aagcttatgc 2041 tgatggtgtg gatcttacgg aaaaaatccc actggccaac ggacatgagc cggatgaaac 2101 ggccctccac cttgcagtca gatccgtgga tcgaacctct cttcacattg tagacttttt 2161 agttcagaac agtgggaacc tggataaaca gacagggaaa ggcagcacag ccctgcacta 2221 ctgctgcctg accgacaatg ccgagtgcct caagttgctc ctgcggggga aggcctccat 2281 cgagatagca aacgagtcag gagagactcc gctggacatt gccaagcgcc tcaagcacga 2341 gcactgtgag gagctgctga cccaagcctt atctggaaga tttaattctc acgttcacgt 2401 tgaatatgaa tggcgactac tccacgaaga cctggatgaa agtgatgacg acatggatga 2461 gaaattgcag cccagtccca accggcggga agaccggccc atcagcttct accagctggg 2521 ctccaaccag cttcagtcta acgctgtatc tttggccaga gatgctgcaa accttgccaa 2581 ggagaagcag agggctttca tgcccagcat cttgcagaat gagacttacg gagccctcct 2641 gagtggcagc ccacctcccg cccagcctgc agcccccagc accaccagcg cccccccgct 2701 tcctccacgg aatgttggca aagttcagac agcctcctct gctaacaccc tgtggaagac 2761 aaactctgta agtgtggacg gtggaagccg gcagcgatct tcgtcagatc cgccagctgt 2821 ccatccaccg ctgccccctc ttcgcgtgac atctaccaat cccctgaccc ccacgccgcc 2881 cccacccgtt gccaagacgc ccagcgtaat ggaagccttg agccagccga gcaagcctgc 2941 cccgcctggg atctcacaga tcaggccccc acctctgccc ccacagccgc ccagccgcct 3001 cccgcagaag aagcctgcgc cgggggctga caagtccacc ccactgacca acaaaggcca 3061 accgagagga cctgtggatc tctctgcaac ggaagctctg ggtcctctgt ccaatgctat 3121 ggtcctgcag ccccctgcac ccatgcctag gaagtcgcag gcaaccaagt tgaagcctaa 3181 gcgggtgaaa gcgctctata actgtgtggc tgacaacccc gatgagctca ccttctccga 3241 gggggatgtg atcatcgtgg acggggagga ggaccaggag tggtggattg gccacattga 3301 tggagatcct ggtcgcaaag gcgcattccc ggtgtcattt gtgcacttta tcgctgactg 3361 aattgctact gaacaaaagc attaacagtt atgttcctgt ttcgttattg gtaccaaaac 3421 tcttgccaga taaccagttt catgaactgt ttgtatggca gcccatgttc tctaatgcca 3481 ctgctctgtt ttaaaaactc agaggcaatt tttacatatc agtaattgtt tttataattt 3541 gtggttttca tgaaacattg ctatgcattt attaggaaaa actgaatttc ccaacaggtg 3601 aactgaaaag ttattttaac tattatacat aatcaagatc ctgcctctac ggaattagct 3661 aaacctaaaa atgtttgcat taatgaataa attcttcctg cattccttgg cccagttctg 3721 gagttggtga cctttatcac aattattatt ttaggcggcc agtgaactgc tgcttcagaa 3781 gtccatagcc cagctctgaa ctttctcgat aaaatgccat cagttcacct ttaaagacac 3841 acattccttt gaaatccacc cagtgtttaa aaagcaactt ggaaatttac acattagcat 3901 tgtactttct agccctaatt tgtgaggttg cagctatcat tatattctgc atgtatgtat 3961 aacctgttgt gaacaatcat acttaacaaa actactgatg gtttatgaca acgtagggta 4021 actacagttc attctgttcc aggttatata aaactgcatt tcctgaattt ggttaaaaac 4081 taaggatgat ggattgcaaa acagttcttt taaattagtt tatatgcttt aggtgttttg 4141 gaatttgcct tcttgaactt cctgagtcac acagaaagca actgtacaca gtagaattct 4201 gtggcgcaga ccatgctgta ttaacacatc acttgctgtt tcctactgag tgtaccactg 4261 ccttcccttc tagcccagga gaatgtttac tcagtttagt gtcttgtatt tctataatac 4321 tccaacagga atggtagtca cactgtcttg aaattgaatc tgtccatctg tttataatca 4381 agaacatatc agaaatatat aggtcccagg taatactccc aaacatccca ctttttactg 4441 tttcaggcca tcatatcatt cttaagctac ttggggtggt agtagaggat taggttgtct 4501 attataaaac caaaactcat tcgtttaatg aacttgactg tcatacctct atttagtaat 4561 tgcgagggta agattcatag taggaatatt ggaaattttg gcactctgag aataaatagg 4621 catatgatac ccacttggac ttttaacaaa agtaaaggaa taaatttgca tataggcttg 4681 gaaagtgagg cagcaatgct gttaactgca tttgttgtga tggtgcattt gattgaagca 4741 gcttgtcttt attatgcaag actgtgtaga gttttttttt tttttggcat tgtacttttt 4801 gtttttgtta taaaggaaga cagaacaaac tggaatgttt tatgatgttg tatagcaatc 4861 gctttttacc tttcaaagtt ccgggtaaaa atgtgttata tctgtagttt tttgtttttg 4921 ttttttttta aagcactaca tctgttttca ctaattgtta atttctgttt gaacccttca 4981 tttaattttc tcatagattt aagtaaacag atgtattttg cacagtgcac ttatgtctat 5041 tttaacaatc ctcctgcatc tgtattttat agtcagcctt ttgaccacct ggtgccagct 5101 atataaggaa taaagttgat tcatatcaac attagaactc cagtcccaaa ctaatctgtc 5161 aggttcactg gtacataaat acctaggaaa tatttttcca gtctacaatt tggtgctatt 5221 gtgcagtaac taatagtact cttaccagag gagaaattat attaacgacc ctgctaatat 5281 cctttcttag ttatttgctc cttgcaaatt aaaaaagcaa ctaagagaaa gaaaaacatt 5341 gtagatatct atttatattt aaagtttatg tttcatgaac tgcagctgca ggattctggc 5401 attttgcatg ccattctcca tcagatctgg gatgatggct cagaacatgt acacagacta 5461 agagtaactg tgtgatctgt taaggggtgg ataacataat atgcagctta ggatgctatt 5521 ttgagatgta tgatattcag ttcattcacc tgattacttt ggttgcagca caactgtata 5581 tattgtataa ccgaaattga ttattttcat tgtccttaat gcagtgattt ataattagag 5641 catgtttaat aagtttactc ttcttgttaa ctagtcattt gactggaaaa aaataaaata 5701 cttttaaatg g // LOCUS AB007865 7527 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0405 mRNA, complete cds. ACCESSION AB007865 NID g2662090 KEYWORDS KIAA0405. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1274. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 7527) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..7527 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1274" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1125..3107 /gene="KIAA0405" CDS 1125..3107 /gene="KIAA0405" /codon_start=1 /db_xref="PID:d1024582" /db_xref="PID:g2662091" /translation="MGLQTTKWPSHGAFFLKSWLIISLGLYSQVSKLLACPSVCRCDR NFVYCNERSLTSVPLGIPEGVTVLYLHNNQINNAGFPAELHNVQSVHTVYLYGNQLDE FPMNLPKNVRVLHLQENNIQTISRAALAQLLKLEELHLDDNSISTVGVEDGAFREAIS LKLLFLSKNHLSSVPVGLPVDLQELRVDENRIAVISDMAFQNLTSLERLIVDGNLLTN KGIAEGTFSHLTKLKEFSIVRNSLSHPPPDLPGTHLIRLYLQDNQINHIPLTAFSNLR KLERLDISNNQLRMLTQGVFDNLSNLKQLTARNNPWFCDCSIKWVTEWLKYIPSSLNV RGFMCQGPEQVRGMAVRELNMNLLSCPTTTPGLPLFTPAPSTASPTTQPPTLSIPNPS RSYTPPTPTTSKLPTIPDWDGRERVTPPISERIQLSIHFVNDTSIQVSWLSLFTVMAY KLTWVKMGHSLVGGIVQERIVSGEKQHLSLVNLEPRSTYRICLVPLDAFNYRAVEDTI CSEATTHASYLNNGSNTASSHEQTTSHSMGSPFLLAGLIGGAVIFVLVVLLSVFCWHM HKKGRYTSQKWKYNRGRRKDDYCEAGTKKDNSILEMTETSFQIVSLNNDQLLKGDFRL QPIYTPNGGINYTDCHIPNNMRYCNSSVPDLEHCHT" BASE COUNT 2236 a 1588 c 1522 g 2181 t ORIGIN 1 gagagagaga agattatatt cttatataaa tgaatcttgc ctcttcctgt tttctattcc 61 attcatttat ctaccataag ggattccact gtgatgaaaa aaaaagtatc ttgtttcctc 121 agaaagacat ttagaaatat gtaaacattt tgctttcatc tttatattta tggtaattct 181 aaggatatta taaacttgca gtggtaattc tgaagtataa tatatattca gggtcacaaa 241 acagggcggc aaggatgata actctattag catttaaaaa gcataaaaat actaataagc 301 acaaattttc ctactatagt aactgtagcc tgaagttaag aataatagca tagtgattgt 361 aattaataat aatatatatt tgaaaattga tgagagatta aaactcaaat gttcttatca 421 aaaaatgata agtatgtaac gtattggatg tgttaattag ctttaattat ttcacaatgt 481 atacacatat caaaatgtca cattgcacac cataaatata tacacttttt atttgtcaat 541 tataccctaa taaagcttga ggaccaggag aggaaaaata agaatataca aaaatggaac 601 cagcatgata gttcacgtct gtaatcccag cgctctgtga gaccgaggca gaagaatcat 661 ttagcccagg agttccagac cagcccgggc gacacagcaa gatcccatac ctactacacg 721 ttaaaaagtt agccaggcat ggcattaaaa aattagccag gcatggtggc acacttcctc 781 ctgtagtccc agcctgtcgg gaggctgaag caggaggacc actcgagccc aagagtccaa 841 ggctgcagtg aaccataact gtgccgctgc actccagcct aagcaacaga gcaagatcct 901 gtctcaaaaa aaaaaaaagg aagaatatta ttgaaaaatg aggtctgcaa gaaatggcat 961 atcacagtgg ctagaagaca cagtgatacg ccctcaggac gttccctcta gctggagttc 1021 tggacttcaa cagaacccca tccagtcatt ttgattttgc tgtttatttt tttttttctt 1081 tttctttttc ccaccacatt gtattttatt tccgtacttc agaaatgggc ctacagacca 1141 caaagtggcc cagccatggg gcttttttcc tgaagtcttg gcttatcatt tccctggggc 1201 tctactcaca ggtgtccaaa ctcctggcct gccctagtgt gtgccgctgc gacaggaact 1261 ttgtctactg taatgagcga agcttgacct cagtgcctct tgggatcccg gagggcgtaa 1321 ccgtactcta cctccacaac aaccaaatta ataatgctgg atttcctgca gaactgcaca 1381 atgtacagtc ggtgcacacg gtctacctgt atggcaacca actggacgaa ttccccatga 1441 accttcccaa gaatgtcaga gttctccatt tgcaggaaaa caatattcag accatttcac 1501 gggctgctct tgcccagctc ttgaagcttg aagagctgca cctggatgac aactccatat 1561 ccacagtggg ggtggaagac ggggccttcc gggaggctat tagcctcaaa ttgttgtttt 1621 tgtctaagaa tcacctgagc agtgtgcctg ttgggcttcc tgtggacttg caagagctga 1681 gagtggatga aaatcgaatt gctgtcatat ccgacatggc cttccagaat ctcacgagct 1741 tggagcgtct tattgtggac gggaacctcc tgaccaacaa gggtatcgcc gagggcacct 1801 tcagccatct caccaagctc aaggaatttt caattgtacg taattcgctg tcccaccctc 1861 ctcccgatct cccaggtacg catctgatca ggctctattt gcaggacaac cagataaacc 1921 acattccttt gacagccttc tcaaatctgc gtaagctgga acggctggat atatccaaca 1981 accaactgcg gatgctgact caaggggttt ttgataatct ctccaacctg aagcagctca 2041 ctgctcggaa taacccttgg ttttgtgact gcagtattaa atgggtcaca gaatggctca 2101 aatatatccc ttcatctctc aacgtgcggg gtttcatgtg ccaaggtcct gaacaagtcc 2161 gggggatggc cgtcagggaa ttaaatatga atcttttgtc ctgtcccacc acgacccccg 2221 gcctgcctct cttcacccca gccccaagta cagcttctcc gaccactcag cctcccaccc 2281 tctctattcc aaaccctagc agaagctaca cgcctccaac tcctaccaca tcgaaacttc 2341 ccacgattcc tgactgggat ggcagagaaa gagtgacccc acctatttct gaacggatcc 2401 agctctctat ccattttgtg aatgatactt ccattcaagt cagctggctc tctctcttca 2461 ccgtgatggc atacaaactc acatgggtga aaatgggcca cagtttagta gggggcatcg 2521 ttcaggagcg catagtcagc ggtgagaagc aacacctgag cctggttaac ttagagcccc 2581 gatccaccta tcggatttgt ttagtgccac tggatgcttt taactaccgc gcggtagaag 2641 acaccatttg ttcagaggcc accacccatg cctcctatct gaacaacggc agcaacacag 2701 cgtccagcca tgagcagacg acgtcccaca gcatgggctc cccctttctg ctggcgggct 2761 tgatcggggg cgcggtgata tttgtgctgg tggtcttgct cagcgtcttt tgctggcata 2821 tgcacaaaaa ggggcgctac acctcccaga agtggaaata caaccggggc cggcggaaag 2881 atgattattg cgaggcaggc accaagaagg acaactccat cctggagatg acagaaacca 2941 gttttcagat cgtctcctta aataacgatc aactccttaa aggagatttc agactgcagc 3001 ccatttacac cccaaatggg ggcattaatt acacagactg ccatatcccc aacaacatgc 3061 gatactgcaa cagcagcgtg ccagacctgg agcactgcca tacgtgacag ccagaggccc 3121 agcgttatca aggcggacaa ttagactctt gagaacacac tcgtgtgtgc acataaagac 3181 acgcagatta catttgataa atgttacaca gatgcatttg tgcatttgaa tactctgtaa 3241 tttatacggt gtactatata atgggattta aaaaaagtgc tatcttttct atttcaagtt 3301 aattacaaac agttttgtaa ctctttgctt tttaaatctt aaaaaaaaaa aagttgctga 3361 agtactgtac agggttgtac aatgagaacc caatgccaag gcaaaaagaa cgagtgattt 3421 ttccttagga tacacatcaa ccactttgct gttgaagctg tcagaataaa ttcctggtgg 3481 tcagatgaaa gggcagatta aatggactca tcagggtaag aggaataata tgggtaaaac 3541 aagaaatggc ccgatagttt cacactattc ctatacctcc aggtccggaa gacaggtaaa 3601 aaaattctat aatgtaagaa tggaggtagt taccctgatt tgaccctgtg tgggaaatgc 3661 tgaaagcacc aggaggaagc cggttcccgt gagataagtt aacccggcct gacagaatca 3721 agaaaattga gatgagattt gaaaggaccc gaaaatgcag gggttggctt tctgactggg 3781 aacttaaaaa tcactcttca tgcttccctg gtcctatgtg ataacagagt tagagacttg 3841 agtctgattt cagtcatctt cagggaccag tctgatgttg tagcaagaag actcccttta 3901 aaagtgttac tgttcaaatc atatatcagg ttgaatcaca ttcaacagag atatattcta 3961 gaatactttt ttagaagagg ctaataaagg gaagaattat attgaatgga attatttttg 4021 ataatgagaa ttatttgggt agattcactg aggctatgtc aacatgatat ttagaccaac 4081 aggtgatcaa tgtttggaaa atacaacaat gacttattta aaaattaccc ttcctgctat 4141 ttagacaaaa acaactgatc agtggttctg ttatgtcagc tgactttgtt agtatcatgt 4201 tgaaatagct tgaagtaata tcttttatcc ccttgcaaat tcttgtcttc caatcatctc 4261 ccatatattt tcataattag ttgtttatga cacctttgtt tttctccctc tgttcagtat 4321 ttcaaggaaa attatggatg ccagtcttgg ctgcacaaga tatccattac gtacttatac 4381 attttaaaat gagtactaat tttcactgct aataattctg taaggacaca tcaaagctgg 4441 ccaaaataat gaattttttt taaaaagcaa tacctggttt ccaccttgga ctgactttga 4501 tcctgttcca cttttgaaat tttatttgtt ccttttccat cgtggatgtt cctctacttt 4561 ggcaattgtg gagggctaat caatcttatg ttagcaggac aacccatgaa aaacaagtca 4621 gagagtgaag gctttttccc ctaatcctgg cagagcaggg cgtagaaaag agaggatgtc 4681 catgcttaat tctagatact tttgagacaa caccttcaga aaacacataa tttaatcttt 4741 gccatcctta gatagagaag ggctatagat cacatacgtt atcaaaaact actcccttgg 4801 aaaaaatatc tttcgaaaat caaatttaaa catttcactc tgtgctgcat atttctttta 4861 ccattgacca ttattatagg gacccatgaa gtaaatgtca caatcatctt actagctctc 4921 tctctctcag caaaataaaa ctgtgatgtg tcttctttgt aaaagtttag gtataaaatg 4981 ctatgcaact ttttctttat agtaagagct ttattttctt taataatata gcccaactta 5041 tatgttttaa tctcccttgt ccctcagaat aagcagaaaa tataactgca gctgtatgtc 5101 gtagacacaa gaattgaaac tgtgcagtca gtagagctga acttagctat gaattaattt 5161 aaattaattc taaagtgact ctggtttcca ttagtctatt agctagaggg ttttggctgt 5221 tgcttttttt aaagttaggt ccacacagtg aagggaaaag agtctgtgaa ggtgatcagt 5281 gtagcagtaa gacatctaaa atcaagacaa tgacatgggg gctttgtgta cttagctgga 5341 taattccctt ctactgtcct cttccccttg gctgtgtaga taaaattgtg cattcaaatg 5401 atggtacttt gacttttgag ggtttttatt tctgttattc acaaaatact cattctcatt 5461 tatgtatatt gtatgtttaa cccccagtgg gatttctggc tgctgaaacc actttgggcc 5521 aggaagaaca aggatgaaga ggttcctgtc atttcttatg gggttcacag aattatttgg 5581 ggcttaaatg gtacaatgga gacagtcatg tgcaatgctt aagatggtct gacagggtcc 5641 cttttgctgg aggtgttcct gaagagattg aacctagtaa caggctttat tttcaccttg 5701 tgtacaacat ggcaaagact gctaagatta aaatccgtct cccatttgtt acagcctcac 5761 ctgcaccagg atagaaagca cgtgatggaa tctgtgatgt ctaatgtgtc ttatgaaaat 5821 tgccaaacaa ctgtccctgg ggattctgct ttgattggca tttttagtca tgggaatgta 5881 tatttgctga tatatctgct ctgtgtttgg gcctctcttg ctgtcattat gatgtatttt 5941 gagatgatta gtcaagagtc aaggttgcga gtacaggcca agaccatggg aaaaaagcca 6001 tgctcactgg ccaataaaga gcttgatgct gcctggccaa atgaggtgac tcagatagaa 6061 tctgatccca ttcagagctt ggtaaatgtc actgtacaga agacattgaa aaggaagagg 6121 catagtcatg atcaaaagga tatattgaca gttatctata gccagggtag ttgttaatac 6181 ctgcttgttt ggggagatat gtttgattat agagagactc tgggccaccc tcaaacaccg 6241 tcagatgcat tagcagccca ctgcatggga caatcgcagg cagatacgag gaatgtatcc 6301 cctgcctatt ttccttgtgt aacaaatgga aaacattctc ccctgtgaga aataatgcaa 6361 tttctaatta tctggatgtt cgttgaaaat atattagaca ttctccctga ggttaaaaac 6421 aaaaagtacg tgaccagtct ggtaagaagt attaatgaag tagctaatat tacagcttca 6481 ttttctacta gcacctatca taatggtctt agtcatttca cacaaatcag aacttccttc 6541 cccaccaggg aggacaacat cttcatgctg tgattgaagc atccattcag aacacgaggc 6601 aatattgcag tccacaggga atggatgctt cacttgatct ccggaccttg gctgcagagg 6661 ccatcgcagc ttttgaaaag tgaaggggtt aattcccatt ggtgtctttg cttatagcat 6721 ttttctctaa cctataacaa ggagacatta cattttactt tagaacatga gaatagcagt 6781 tttgctcatg acttaccatt ccagctgcat gggaaagcaa agcagaaaac agtgccccaa 6841 atggaaaaaa gatactcaca cagaacaaaa cagttcttgg tcttgttctt ggtcttgtca 6901 aaccttgcct gatgctcttt ctaaagtcaa aatatgaatg ctaagaaggc ataacctaca 6961 tccttctctg atttcttcag cagggtcaaa agacagttac tagcaatggg gaatgcttgt 7021 cactgtggag aaagagtttt gtatatgtct gataccgttg ttataacaaa acaaattttt 7081 ttactatagt tttttgtttt ctacctgcac acccaccaga agagcacaaa gcaaggccat 7141 tgcaacaggc atttaaaaat tattatcaaa catgcacatg cttgtacaca cacacacaca 7201 cacacacaaa caggggcatt tgtaaaggtg tccctggaat gtaagattta taatgtttaa 7261 ggcaaggtga aggcattgcc aagtgtgtgt cgctcatagg actagtgtat attcactgaa 7321 agttaacctg atgatttgtt attgtttgaa ccatatgctg atttgcttct ggtttctgtt 7381 tagtgtgttc tctctgataa ggggctgaaa gattctgcat cacacatcct ctgagaccta 7441 ccatgtcgca cactttgtta atgacaaact tcactctaca ctatacagta ccttgttgat 7501 atattcagta aagtcttatt ttaaaag // LOCUS AB007866 7323 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0406 mRNA, complete cds. ACCESSION AB007866 NID g2662092 KEYWORDS KIAA0406. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1335. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 7323) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..7323 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1335" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 196..2595 /gene="KIAA0406" CDS 196..2595 /gene="KIAA0406" /codon_start=1 /db_xref="PID:d1024583" /db_xref="PID:g2662093" /translation="MAVFDTPEEAFGVLRPVCVQLTKTQTVENVEHLQTRLQAVSDSA LQELQQYILFPLRFTLKTPGPKRERLIQSVVECLTFVLSSTCVKEQELLQELFSELSA CLYSPSSQKPAAVSEELKLAVIQGLSTLMHSAYGDIILTFYEPSILPRLGFAVSLLLG LAEQEKSKQIKIAALKCLQVLLLQCDCQDHPRSLDELEQKQLGDLFASFLPGISTALT RLITGDFKQGHSIVVSSLKIFYKTVSFIMADEQLKRISKVQAKPAVEHRVAELMVYRE ADWVKKTGDKLTILIKKIIECVSVHPHWKVRLELVELVEDLLLKCSQSLVECAGPLLK ALVGLVNDESPEIQAQCNKVLRHFADQKVVVGNKALADILSESLHSLATSLPRLMNSQ DDQGKFSTLSLLLGYLKLLGPKINFVLNSVAHLQRLSKALIQVLELDVADIKIVEERR WNSDDLNASPKTSATQPWNRIQRRYFRFFTDERIFMLLRQVCQLLGYYGNLYLLVDHF MELYHQSVVYRKQAAMILNELVTGAAGLEVEDLHEKHIKTNPEELREIVTSILEEYTS QENWYLVTCLETEEMGEELMMEHPGLQAITSGEHTCQVTSFLAFSKPSPTICSMNSNI WQICIQLEGIGQFAYALGKDFCLLLMSALYPVLEKAGDQTLLISQVATSTMMDVCRAC GYDSLQHLINQNSDYLVNGISLNLRHLALHPHTPKVLEVMLRNSDANLLPLVADVVQD VLATLDQFYDKRAASFVSVLHALMAALAQWFPDTGNLGAPPRAKFRRRGKSFEPKTSS S" BASE COUNT 1931 a 1700 c 1576 g 2116 t ORIGIN 1 ggctggaaga cgagacctgc tcactctgtc accgaggcta gagtacagtg gcacaatcac 61 agctcattgc agcctcaacc tcccaggctc aatcgatcct tccaactgag cctccctagc 121 agctgggact attagtgcac accaccacac ctagcctgca ggatgtttcc tcaatgaggg 181 gaaggctgct gcacaatggc agtttttgat actcctgagg aggcctttgg tgtcttacgt 241 ccagtctgtg ttcagctcac aaagacccag acagtggaga atgtggagca tctgcagaca 301 cgactacaag ctgtgagtga cagtgccctt caggaacttc agcagtacat cctcttccct 361 ctgcgattta ccctgaagac cccaggtccc aaaagagagc gtttgatcca aagtgtggtg 421 gaatgcctca catttgtcct ttcttcaaca tgtgtgaaag aacaggagct tctccaggaa 481 ctcttttcag aactctctgc ttgtctgtat tcacccagct cccaaaaacc tgcggctgtg 541 tccgaggagt tgaaattggc tgtgatccag ggacttagca cattaatgca ctcagcttat 601 ggggacatca ttctgacttt ttatgagccc tccattctgc cacgtttagg atttgctgta 661 tctttactgt taggccttgc agaacaggag aaatcaaagc aaattaaaat tgctgcctta 721 aaatgtttac aggttctact cttgcagtgt gattgtcagg accatccaag gtcattggat 781 gaacttgaac aaaagcagct gggggatttg tttgcctctt ttttacctgg aatctcaact 841 gcactgacca ggcttatcac aggagacttt aaacaaggtc acagcattgt cgtatcttcc 901 ctaaagatct tttacaagac agtgagcttc attatggctg atgaacagct caaaagaatc 961 tcaaaggtcc aagcaaaacc tgcagttgag cacagagtag cagagctgat ggtttacagg 1021 gaagcagatt gggtaaaaaa gactggcgac aagttgacta tccttattaa aaagataatt 1081 gagtgtgttt ctgttcaccc acactggaag gtgagactgg aactggtaga acttgtggag 1141 gaccttcttt tgaagtgcag tcaatcattg gtcgaatgtg ctggtcccct tctgaaggcc 1201 ttagtgggac tagtaaatga tgagagtcct gaaatccaag cccagtgcaa taaagttctg 1261 agacattttg cagatcaaaa agtagtggtg ggcaacaaag ccctcgctga catcttgtca 1321 gaaagcctgc attcccttgc cacatctctt cctcgcctaa tgaactccca agatgaccag 1381 ggcaaattct ctactctttc cttgttactt ggttatctga aactcttggg cccaaaaata 1441 aactttgtcc tcaactctgt ggcccatctc cagcggcttt ccaaagcact catccaagtt 1501 ctagagctag acgtggctga catcaagatt gttgaggaac ggcgttggaa ctctgatgat 1561 ctgaatgctt ctccaaagac ctcagccaca cagccttgga accgcatcca gaggagatat 1621 ttccgcttct tcactgatga gagaatcttc atgctcttga ggcaggtttg tcagctactt 1681 ggttattatg ggaatcttta tttgcttgtg gatcacttta tggaacttta ccatcaatct 1741 gtggtttacc ggaagcaagc tgccatgatc cttaatgaac tggttacagg ggctgctggg 1801 ctggaggttg aggatcttca cgaaaaacat attaaaacaa acccagaaga actgagagag 1861 attgtgacat ctatacttga agaatacaca agtcaagaaa attggtattt ggttacctgt 1921 cttgaaactg aggaaatggg agaggagctg atgatggagc acccaggcct ccaagccatc 1981 acgtctggtg aacacacctg ccaagttaca tcttttctag ccttctcaaa gccaagtccc 2041 actatttgct ccatgaacag taacatctgg caaatatgca ttcagttgga aggaattggc 2101 cagtttgcat atgcactagg aaaagacttc tgtttgctct tgatgtcagc cctttatcca 2161 gtactggaga aggctggaga ccaaacccta ctcattagtc aggtggctac cagcaccatg 2221 atggacgttt gccgtgcttg tggctacgac tccctgcagc acctgatcaa tcaaaattca 2281 gactatttag tgaatgggat ctctttaaat ctgcgtcatc tggctctgca tcctcatacc 2341 ccaaaggtcc tggaagtcat gctgcggaac tcagatgcta acctgcttcc tttggtggca 2401 gatgtggttc aagatgtctt ggccaccctg gaccaatttt acgataagag agctgcttcc 2461 tttgtcagcg ttctgcatgc tctgatggca gcattagccc agtggttccc agacacaggt 2521 aatcttgggg cacctccaag agcaaagttt aggagaagag ggaagtcatt tgaaccaaag 2581 accagcagct cttgagaaga gcaccaccac agctgaagac atcgaacagt ttttgctgaa 2641 ctacctcaaa gagaaggatg tggcagatgg aaatgtctcg gattttgata atgaagaagg 2701 taacttgttt attttggctt agctggtttt ttctttctga aacagattat tgtaaaaatg 2761 gtgaaacatc attacagaaa gggtagaaat aagaaaagtt actcataacc ttaccaggct 2821 aacatagcta ttctcctgtt ttgtgtaatc tttttccatt cttttctgtt acaaaactgt 2881 ttttccgtgg ttgcagctgt gtaccctttt tgtgtttagt tataagcaac tctgcttatt 2941 gccgttgaat tttgatcatt aacgttttaa tagctgttac taagtgtcca tcccctgtag 3001 gacatttagg ttgcttctgg cttaataata actctaataa acagtgctgc ggtggatgat 3061 tctgtgcatg tgaccttttt tttttttatt ttttttattt ttttattttt tatttttgca 3121 tatttcttta aattcccaga agtaggattt ctgggtcaag gatatgaaca taatttaatg 3181 cttgccaaat tgcctttcaa aaaggttgtg tcaatttata cttttccttc ggcagtgcag 3241 gatgaatact ggtttcacca cagccttacc aacattggct atttccagtt ttcttcctaa 3301 attaataggt gaaaaatggg tcttgttatc taccttgcat ttctttgatt accagtgagg 3361 ttgaatgtct ttataagctt ctttcctaac aggttttttt tccttattcc cattgtctat 3421 ttatatgctt tgtccatttg tttgttggtg ggaggagatt gcagtctttt tcttaccaat 3481 ttatatgata aagaagaagg gagttcaggc tagttgaagc tctggcctgt tgttttattc 3541 acaagctcaa tctggagctt caggtcacgg aaggattaat aaattattag gatctcctct 3601 gcaaatataa aatgccaagt cataatgagc ttggtggtct cagaaccatc ctaaatcgaa 3661 ccgagtccca aattagtttg tgaggttgga ataaaacgtt tctttttctt tttcttttct 3721 tttccttttt ttgttttttt cctccttttg gtgatctcca ctgtgagatt ctggtgaact 3781 gaagccagta cttccagcag tgtaacagga aatagtagct tgatgccact cactacaaca 3841 aattccttct aaatagcaga aaaggcatca cagggcccaa ataatgattt atgcagaatt 3901 gagtcattgc tctccccgag gacagagttt tctgatcaga aatctatcag gctttttctt 3961 ctcagatttg tttctcgagc caatccagtc ctttttgtga actgcaccct tcacccaaac 4021 ctggaaatgc tgaagcaggg gaggcattct gatccctcat aatccagatt tgccattctt 4081 gtttaaaatt tgagcactgt cagtgaatcc attccttcat gattaggatc ttctggtgtt 4141 agttgatgtt cacgtagagc agactgaaag agtcaaaccc ttcttccaat aacaggaaaa 4201 tccacatccc tcaaataaga ttctgcaata ggtgaatttc aaacaacaaa ttcccctctg 4261 gggaaaaaaa gcacaggcct atcatgctta tcattcatca agtacacacc acacacttgg 4321 cattctaagg aatgtttgcc tcagtgctgg cttgacatga gttttttgtt ttaagcacat 4381 aaaagcacct tgtatatctg aggtcccttt ctccaagaaa tccaagcatt ctacaaatgc 4441 attttaattt atcttttcag catctgttag agacaggtgc agcctctact gtaagagtct 4501 gtctttccag cagagggaac tgaaatggga aaaagttaag ggagcagtgc tctgctcagt 4561 ggaaatgagc acagctggta atcagggttg gcatgaggct gggggctgta agtagatcag 4621 gaaaattttt cagggtaaga attttgacct gggtttcatg caattctcaa atcttgtggt 4681 gatgtttccg tttacaaact tacatctatc tcatgcaacg cagcactcta gacagtgaga 4741 atgataatga ctggtaatat ctcatttaat gctctcagta gcatgtgagg tatacatgct 4801 ctatcccttt ttttccagat gaggtaatgt aggatgaaag aagttgctaa taggttttag 4861 aggcaagact tggtccaacc tcctctaacc aaactccatg ccctatgaac tacaccaagc 4921 tgcccttgtt acctcaccta caaagataag taaggtttga ttctggccat cagaagttct 4981 tacaggctgg tggagaagac agatgtgcac ccccctttta atcaaaggat gaatcaagag 5041 gagacaaaaa agtgaaggac agagcaaagg ccagccatgg aacttcccag cttcaaggct 5101 cctccacacg agaaccctgc tatctctccc tttttttttt ttttttgaga cagagttttg 5161 ctgttgttgc ccaggctgga gtgcagtggc acgatctcgg ctcattgcaa cctctgcctc 5221 ctgggttcaa gcgattctcc cgcctcaccc tcccaagtag ttaagattac aggcgcccgc 5281 cactatgccc ggctaatttt tgtaatttag tagagatggg gttttgccat gttggtcagg 5341 ctggtctcaa actgctgacc tcaggtgatc cacctgcctc aacctcccaa atatcttgtt 5401 tttatcttaa atattcatta atggaaattt agaaaagacc aaaggcatag ggaaaaaaac 5461 tgaagtcatc tgttatctca ttaccctgtc acttttaata ttttgctgta cttcctctca 5521 tgcaaaaacg tatgtagtag tgttcatgct gcatatgcaa ttttgtgttg tgctttttgt 5581 ctttttaaga caatattatt ttataagcat ttctcttgtc attaaaaccc atgcttagcc 5641 tggtgtagtg gctcatacct gtaatcccag cactatgaga ggccacggcg gggggattgc 5701 ttgagcccag gaattggaga ccagcctggg caacatagtg agacccccat ttctacaaaa 5761 aaatttaaaa attagttggg catggtgaca tgcacctgta gtcctggcca ctcaggaggc 5821 tgaggtggga agatcactcg accccataag tttgagattg cagtgagccg tgttcacacc 5881 actgtactcc agcttggaca gagcaaggcc ctgtgcctaa aaaaaatagg gacctcataa 5941 aatgccatta tatggctata tcatggtttc tgtacctatt cccccttgag tggaggtgtc 6001 tcagtccatt tgtgctgctg taaccaaata cctaagactg agtaatttat aaagaacaga 6061 aatttaggta tctctgtttt atcactcata caacactgcc cccagttgtg ctctttttcc 6121 agaggaacag tcagtccctc ccaaagtgga tgagaatgac acccgtccag atgtggagcc 6181 accactgcca ttgcagatcc aaatagccat ggacgtgatg gaacgctgca tccacttgtt 6241 gtcagataaa aatctgcaaa tccgcctgaa ggtcttggat gtgctggatc tgtgtgtggt 6301 tgttcttcag tcccacaaaa accagctgct tcccttggct catcaggcct ggccctcgct 6361 cgttcaccga ctcacacggg acgcccccct ggcagtgctt agagccttca aggttttacg 6421 taccctggga agcaagtgtg gtgactttct tcgcagccgg ttctgcaaag atgtcctgcc 6481 aaagctggct ggctccctag tcacccaggc ccccatcagt gccagggctg gaccagttta 6541 ctcgcacacg ctggccttca agttgcagct ggctgtctta cagggcctgg gccccctctg 6601 tgagagactg gacctaggtg agggtgacct gaataaagtg gctgatgcct gcttgattta 6661 cctcagtgtc aaacagcccg tgaaattaca agaggctgcc aggagcgtct tcctccactt 6721 gatgaaggtg gacccagact ccacctggtt cctcctgaac gagctttact gccccgtgca 6781 gttcacacct ccccacccca gcctccaccc tgtgcagctg cacggggcca gcgggcagca 6841 gaacccctac acgaccaacg tgctccagct gctcaaggag ctgcagtgac cctgctcccc 6901 caccacagag gccaccgatc cctcccctac tgccagccag aagctgggct gaccccaccc 6961 cggccatagg cggtggcagc ggcagcagag aaggtgaatt agttagccaa tcgatttata 7021 aattgatcga tcacacaact gcttagaaat ggattgaagg aaagtagctg actattattt 7081 atatttcata ccttgtgttt tcaagtgaca ttgtctggtg gctctaaggg tttaacccct 7141 tagcctacca tctctatagc cccagctccc tcacaggcca cacacacaca cacacaagag 7201 gtcagttccc ctccatctgc atacacctcc ctgtcttcaa ataatgagat ggaactaatt 7261 tgttttacct aacctgatct ttgggaaaca aacggaaata aagacacttc ttggatgaaa 7321 agt // LOCUS AB007867 7308 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0407 mRNA, complete cds. ACCESSION AB007867 NID g2662094 KEYWORDS KIAA0407. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1339. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 7308) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..7308 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1339" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 271..6678 /gene="KIAA0407" CDS 271..6678 /gene="KIAA0407" /codon_start=1 /db_xref="PID:d1024584" /db_xref="PID:g2662095" /translation="MPALGPALLQALWAGWVLTLQPLPPTAFTPNGTYLQHLARDPTS GTLYLGATNFLFQLSPGLQLEATVSTGPVLDSRDCLPPVMPDECPQAQPTNNPNQLLL VSPGALVVCGSVHQGVCEQRRLGQLEQLLLRPERPGDTQYVAANDPAVSTVGLVAQGL AGEPLLFVGRGYTSRGVGGGIPPITTRALWPPDPQAAFSYEETAKLAVGRLSEYSHHF VSAFARGASAYFLFLRRDLQAQSRAFRAYVSRVCLRDQHYYSYVELPLACEGGRYGLI QAAAVATSREVAHGEVLFAAFSSAAPPTVGRPPSAAAGASGASALCAFPLDEVDRLAN RTRDACYTREGRAEDGTEVAYIEYDVNSDCAQLPVDTLDAYPCGSDHTPSPMASRVPL EATPILEWPGIQLTAVAVTMEDGHTIAFLGDSQGQLHRVYLGPGSDGHPYSTQSIQQG SAVSRDLTFDGTFEHLYVMTQSTLLKVPVASCAQHLDCASCLAHRDPYCGWCVLLGRC SRRSECSRGQGPEQWLWSFQPELGCLQVAAMSPANISREETREVFLSVPDLPPLWPGE SYSCHFGEHQSPALLTGSGVMCPSPDPSEAPVLPRGADYVSVSVELRFGAVVIAKTSL SFYDCVAVTELRPSAQCQACVSSRWGCNWCVWQHLCTHKASCDAGPMVASHQSPLVSP DPPARGGPSPSPPTAPKALATPAPDTLPVEPGAPSTATASDISPGASPSLLSPWGPWA GSGSISSPGSTGSPLHEEPSPPSPQNGPGTAVPAPTDFRPSATPEDLLASPLSPSEVA AVPPADPGPEALHPTVPLDLPPATVPATTFPGAMGSVKPALDWLTREGGELPEADEWT GGDAPAFSTSTLLSGDGDSAELEGPPAPLILPSSLDYQYDTPGLWELEEATLGASSCP CVESVQGSTLMPVHVEREIRLLGRNLHLFQDGPGDNECVMELEGLEVVVEARVECEPP PDTQCHVTCQQHQLSYEALQPELRVGLFLRRAGRLRVDSAEGLHVVLYDCSVGHGDCS RCQTAMPQYGCVWCEGERPRCVTREACGEAEAVATQCPAPLIHSVEPLTGPVDGGTRV TIRGSNLGQHVQDVLGMVTVAGVPCAVDAQEYEVSSSLVCITGASGEEVAGATAVEVP GRGRGVSEHDFAYQDPKVHSIFPARGPRAGGTRLTLNGSKLLTGRLEDIRVVVGDQPC HLLPEQQSEQLRCETSPRPTPATLPVAVWFGATERRLQRGQFKYTLDPNITSAGPTKS FLSGGREICVRGQNLDVVQTPRIRVTVVSRMLQPSQGLGRRRRVVPETACSLGPSCSS QQFEEPCHVNSSQLITCRTPALPGLPEDPWVRVEFILDNLVFDFATLNPTPFSYEADP TLQPLNPEDPTMPFRHKPGSVFSVEGENLDLAMSKEEVVAMIGDGPCVVKTLTRHHLY CEPPVEQPLPRHHALREAPDSLPEFTVQMGNLRFSLGHVQYDGESPGAFPVAAQVGLG VGTSLLALGVIIIVLMYRRKSKQALRDYKKVQIQLENLESSVRDRCKKEFTDLMTEMT DLTSDLLGSGIPFLDYKVYAERIFFPGHRESPLHRDLGVPESRRPTVEQGLGQLSNLL NSKLFLTKFIHTLESQRTFSARDRAYVASLLTVALHGKLEYFTDILRTLLSDLVAQYV AKNPKLMLRRTETVVEKLLTNWMSICLYTFVRDSVGEPLYMLFRGIKHQVDKGPVDSV TGKAKYTLNDNRLLREDVEYRPLTLNALLAVGPGAGEAQGVPVKVLDCDTISQAKEKM LDQLYKGVPLTQRPDPRTLDVEWRSGVAGHLILSDEDVTSEVQGLWRRLNTLQHYKVP DGATVALVPCLTKHVLRENQDYVPGERTPMLEDVDEGGIRPWHLVKPSDEPEPPRPRR GSLRGGERERAKAIPEIYLTRLLSMKGTLQKFVDDLFQVILSTSRPVPLAVKYFFDLL DEQAQQHGISDQDTIHIWKTNSLPLRFWINIIKNPQFVFDVQTSDNMDAVLLVIAQTF MDACTLADHKLGRDSPINKLLYARDIPRYKRMVERYYADIRQTVPASDQEMNSVLAEL SWNYSGDLGARVALHELYKYINKYYDQIITALEEDGTAQKMQLGYRLQQIAAAVENKV TDL" BASE COUNT 1332 a 2308 c 2243 g 1425 t ORIGIN 1 gcccggccga cgcggctttg tctcctttgt tcccggcggt ggcagcgccg cgcgggaggg 61 gcgggcagcg ggcgcagttt tccgcccctc ggtctccggg taacagctgc ggctccacca 121 gacccgggga gaggccgctg cgcgcggagc ccgagcccgg agcggccgac gcccgcctcg 181 gcgcgcacat cccgcggggc ccggccgggt ggtgactccc acacgggtca tgctgttgtc 241 tcctgatcca gccggccctg ccaggtgacc atgcctgctc tgggcccagc tcttctccag 301 gctctctggg ccgggtgggt cctcaccctc cagccccttc caccaactgc attcactccc 361 aatggcacgt atctgcagca cctggcaagg gaccccacct caggcaccct ctacctgggg 421 gctaccaact tcctgttcca gctgagccct gggctgcagc tggaggccac agtgtccacc 481 ggccctgtgc tagacagcag ggactgcctg ccacctgtga tgcctgatga gtgcccccag 541 gcccagccta ccaacaaccc gaatcagctg ctcctggtga gcccaggggc cctggtggta 601 tgcgggagcg tgcaccaggg ggtctgtgaa cagcggcgcc tggggcagct cgagcagctg 661 ctgctgcggc cagagcggcc tggggacaca caatatgtgg ctgccaatga tcctgcggtc 721 agcacggtgg ggctggtagc ccagggcttg gcaggggagc ccctcctgtt tgtggggcga 781 ggatacacca gcaggggtgt ggggggtggc attccaccca tcacaacccg ggccctgtgg 841 ccgcccgacc cccaagctgc cttctcctat gaggagacag ccaagctggc agtgggccgc 901 ctctccgagt acagccacca cttcgtgagt gcctttgcac gtggggccag cgcctacttc 961 ctgttcctgc ggcgggacct gcaggctcag tctagagctt ttcgtgccta tgtatctcga 1021 gtgtgtctcc gggaccagca ctactactcc tatgtggagt tgcctctggc ctgcgaaggt 1081 ggccgctacg ggctgatcca ggctgcagct gtggccacgt ccagggaggt ggcgcatggg 1141 gaggtgctct ttgcagcttt ctcctcggct gcacccccca ctgtgggccg gcccccatcg 1201 gcggctgctg gggcatctgg agcctctgcc ctctgtgcct tccccctgga tgaggtggac 1261 cggcttgcta atcgcacgcg agatgcctgc tacacccggg agggtcgtgc tgaggatggg 1321 accgaggtgg cctacatcga gtatgatgtc aattctgact gtgcacagct gccagtggac 1381 accctggatg cttatccctg tggctcagac cacacgccca gccccatggc cagccgggtc 1441 ccgctggaag ccacaccaat tctggagtgg ccagggattc agctaacagc tgtggcagtc 1501 accatggaag atggacacac catcgctttc ctgggtgata gtcaagggca gctgcacagg 1561 gtctacttgg gcccagggag cgatggccac ccatactcca cacagagcat ccagcagggg 1621 tctgcagtga gcagagacct cacctttgat gggacctttg agcacctgta tgtcatgacc 1681 cagagcacac ttctgaaggt tcctgtggct tcctgtgctc agcacctgga ctgtgcatct 1741 tgccttgctc acagggaccc atactgtggg tggtgcgtgc tccttggcag gtgcagtcgc 1801 cgttctgagt gctcgagggg ccagggccca gagcagtggc tatggagctt ccagcctgag 1861 ctgggctgtc tgcaagtggc agccatgagt cctgccaaca tcagccgaga ggagacgagg 1921 gaggttttcc tatcagtgcc agacctgcca cccctgtggc caggggagtc atattcctgc 1981 cactttgggg aacatcagag tcctgccctg ctgactggtt ctggtgtgat gtgcccctcc 2041 ccagacccta gtgaggcccc agtgctgccg agaggagccg actacgtatc cgtgagcgtg 2101 gagctcagat ttggcgctgt tgtgatcgcc aaaacttccc tctctttcta tgactgtgtg 2161 gcggtcactg aactccgccc atctgcgcag tgccaggcct gtgtgagcag ccgctggggg 2221 tgtaactggt gtgtctggca gcacctgtgc acccacaagg cctcgtgtga tgctgggccc 2281 atggttgcaa gccatcagag cccgcttgtc tccccagacc ctcctgcaag aggtggaccc 2341 agcccctccc cacccacagc ccccaaagcc ctggccaccc ctgctcctga cacccttccc 2401 gtggagcctg gggctccctc cacagccaca gcttcggaca tctcacctgg ggctagtcct 2461 tccctgctca gcccctgggg gccatgggca ggttctggct ccatatcttc ccctggctcc 2521 acagggtcgc ctctccatga ggagccctcc cctcccagcc cccaaaatgg acctggaacc 2581 gctgtccctg cccccactga cttcagaccc tcagccacac ctgaggacct cttggcctcc 2641 ccgctgtcac cgtcagaggt agcagcagtg ccccctgcag accctggccc cgaggctctt 2701 catcccacag tgcccctgga cctgccccct gccactgttc ctgccaccac tttcccaggg 2761 gccatgggct ccgtgaagcc cgccctggac tggctcacga gagaaggcgg cgagctgccc 2821 gaggcggacg agtggacggg gggtgacgca cccgccttct ccacttccac cctcctctca 2881 ggtgatggag actcagcaga gcttgagggc cctcccgccc ccctcatcct cccgtccagc 2941 ctcgactacc agtatgacac ccccgggctc tgggagctgg aagaggcgac cttgggggca 3001 agctcctgcc cctgtgtgga gagcgttcag ggctccacgt tgatgccggt ccatgtggag 3061 cgggaaatcc ggctgctagg caggaacctg caccttttcc aggatggccc aggagacaat 3121 gagtgtgtga tggagctgga gggcctcgag gtggtggttg aggcccgggt cgagtgtgag 3181 ccacctccag atacccagtg ccatgtcacc tgccagcagc accagctcag ctatgaggct 3241 ctgcagccgg agctccgtgt ggggctgttt ctgcgtcggg ccggccgtct gcgtgtggac 3301 agtgctgagg ggctgcatgt ggtactgtat gactgttccg tgggacatgg agactgcagc 3361 cgctgccaaa ctgccatgcc ccagtatggc tgtgtgtggt gtgaggggga gcgtccacgt 3421 tgtgtgaccc gggaggcctg tggtgaggct gaggctgtgg ccacccagtg cccagcgccc 3481 ctcatccact cggtggagcc actgactggg cctgtagacg gaggcacccg tgtcaccatc 3541 aggggctcca acctgggcca gcatgtgcag gatgtgctgg gcatggtcac ggtggctgga 3601 gtgccctgtg ctgtggatgc ccaggagtac gaggtctcca gcagcctcgt gtgcatcacc 3661 ggggccagtg gggaggaggt ggccggcgcc acagcggtgg aggtgccggg aagaggacgt 3721 ggtgtctcag aacacgactt tgcctaccag gatccgaagg tccattccat cttcccggcc 3781 cgcggcccca gagctggggg cacccgtctc accctgaatg gctccaagct cctgactggg 3841 cggctggagg acatccgagt ggtggttgga gaccagcctt gtcacttgct gccggagcag 3901 cagtcagaac aactgcggtg tgagaccagc ccacgcccca cgcctgccac gctccctgtg 3961 gctgtgtggt ttggggccac ggagcggagg cttcaacgcg gacagttcaa gtataccttg 4021 gaccccaaca tcacctctgc tggccccacc aagagcttcc tcagtggagg acgtgagata 4081 tgcgtccgtg gccagaatct ggacgtggta cagacgccaa gaatccgggt gaccgtggtc 4141 tcgagaatgc tgcagcccag ccaggggctt ggacggaggc gtcgcgtggt cccggagacg 4201 gcatgttccc ttggaccctc ctgcagtagc cagcaatttg aggagccgtg ccatgtcaac 4261 tcctcccagc tcatcacgtg ccgcacacct gccctcccag gcctgcctga ggacccctgg 4321 gtccgggtgg aatttatcct tgacaacctg gtctttgact ttgcaacact gaaccccaca 4381 cctttctcct atgaggccga ccccaccctg cagccactca accctgagga ccccaccatg 4441 ccattccggc acaagcctgg gagtgtgttc tccgtggagg gggagaacct ggaccttgca 4501 atgtccaagg aggaggtggt ggctatgata ggggatggcc cctgtgtggt gaagacgctg 4561 acgcggcacc acctgtactg cgagcccccc gtggagcagc ccctgccacg gcaccatgcc 4621 ctccgagagg cacctgactc tttgcctgag ttcacggtgc agatggggaa cttgcgcttc 4681 tccctgggtc acgtgcagta tgacggcgag agccctgggg cttttcctgt ggcagcccag 4741 gtgggcttgg gggtgggcac ctctcttctg gctctgggtg tcatcatcat tgtcctcatg 4801 tacaggagga agagcaagca ggccctgagg gactataaga aggttcagat ccagctggag 4861 aatctggaga gcagtgtgcg ggaccgctgc aagaaggaat tcacagacct catgactgag 4921 atgaccgatc tcaccagtga cctcctgggc agcggcatcc ccttcctcga ctacaaggtg 4981 tatgcggaga ggatcttctt ccctgggcac cgcgagtcgc ccttgcaccg ggacctgggt 5041 gtgcctgaga gcagacggcc cactgtggag caagggctgg ggcagctctc taacctgctc 5101 aacagcaagc tcttcctcac caagttcatc cacacgctgg agagccagcg caccttttca 5161 gctcgggacc gtgcctacgt ggcatctctg ctcaccgtgg cactgcatgg gaagcttgag 5221 tatttcactg acatcctccg cactctgctc agtgacctgg ttgcccagta tgtggccaag 5281 aaccccaagc tgatgctgcg caggacagag actgtggtgg agaagctgct caccaactgg 5341 atgtccatct gtctgtatac cttcgtgagg gactccgtag gggagcctct gtacatgctc 5401 tttcgaggga ttaagcacca agtggataag gggccagtgg acagtgtgac aggcaaggcc 5461 aaatacacct tgaacgacaa ccgcctgctc agagaggatg tggagtaccg tcccctgacc 5521 ttgaatgcac tattggctgt ggggcctggg gcaggagagg cccagggcgt gcccgtgaag 5581 gtcctagact gtgacaccat ctcccaggca aaggagaaga tgctggacca gctttataaa 5641 ggagtgcctc tcacccagcg gccagaccct cgcacccttg atgttgagtg gcggtctggg 5701 gtggccgggc acctcattct ttctgacgag gatgtcactt ctgaggtcca gggtctgtgg 5761 aggcgcctga acacactgca gcattacaag gtcccagatg gagcaactgt ggccctcgtc 5821 ccctgcctca ccaagcatgt gctccgggaa aaccaggatt atgtccctgg agagcggacc 5881 ccaatgctgg aggatgtaga tgaggggggc atccggccct ggcacctggt gaagccaagt 5941 gatgagccgg agccgcccag gcctcggagg ggcagccttc ggggcgggga gcgtgagcgc 6001 gccaaggcca tccctgagat ctacctgacc cgcctgctgt ccatgaaggg caccctgcag 6061 aagttcgtgg atgacctgtt ccaggtgatt ctcagcacca gccgccccgt gccgctcgct 6121 gtgaagtact tctttgacct gctggatgag caggcccagc agcatggcat ctccgaccag 6181 gacaccatcc acatctggaa gaccaacagc ttgcctctga ggttctggat caatataata 6241 aaaaacccgc agtttgtgtt cgacgtgcaa acatctgata acatggatgc ggtgctcctt 6301 gtcattgcac agaccttcat ggacgcctgc accctggccg accacaagct gggccgggac 6361 tccccgatca acaaacttct gtatgcacgg gacattcccc ggtacaagcg gatggtggaa 6421 aggtactatg cagacatcag acagactgtc ccagccagcg accaagagat gaactctgtc 6481 ctggctgaac tgtcctggaa ctactccgga gacctcgggg cgcgagtggc cctgcatgaa 6541 ctctacaagt acatcaacaa gtactatgac cagatcatca ctgccctgga ggaggatggc 6601 acggcccaga agatgcagct gggctatcgg ctccagcaga ttgcagctgc tgtggaaaac 6661 aaggtcacag atctatagga acccaggagc cacggcctgc tgttgcttca gcctggcctg 6721 ggcagccctg gaagctcgga ggagaggcca ccttcttagg tgcctgtagt gactgacaag 6781 cagagttagt ggaaggtgac tcccagtctc ctggtggctc tggcctcggc cctgctggat 6841 ccacctccta gacccggggc ctcaaggctc atggggtagt acccagcctg ctccccgagt 6901 ccagcgaccc tgtgacaccg gtctgcaggg agttggggac taagggcttc cagagagtgg 6961 ctggaagaga ctccaggccc ctggggagac tgtactgttc ctgaacactg gccttggcca 7021 cactgggatt cggagaggaa ggaggagagc cccatgcttc ctgtctgcct cctccaccat 7081 ccctgacctc agttgagctg cctctggcct tgttgctgct gccacatcct aggtctaaga 7141 gttgaacgcc tctcctaggc cactacaaac tgacccctca gcagggctgg ctgccacagg 7201 gctgccctgc ctcataggta gccatggtga gggctatctg ctgcaggggg gtcttgggga 7261 gagtggtgac tccattgacc cagcttttca ttaaaggata acacactg // LOCUS AB007868 6581 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0408 mRNA, complete cds. ACCESSION AB007868 NID g2662096 KEYWORDS KIAA0408. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1359. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6581) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6581 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1359" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1081..2814 /gene="KIAA0408" CDS 1081..2814 /gene="KIAA0408" /codon_start=1 /db_xref="PID:d1024585" /db_xref="PID:g2662097" /translation="MCKEQKATKKSKVGFLDPLATDNQKECEAWPDLRTSEEDSKSCS GALSTALEELAKVSEELCSFQEEIRKRSNHRRMKSDSFLQEMPNVTNIPHGDPMINND QCILPISLEKEKQKNRKNLSCTNVLQSNSTKKCGIDTIDLKRNETPPVPPPRSTSRNF PSSDSEQAYERWKERLDHNSWVPHEGRSKRNYNPHFPLRQQEMSMLYPNEGKTSKDGI IFSSLVPEVKIDSKPPSNEDVGLSMWSCDIGIGAKRSPSTSWFQKTCSTPSNPKYEMV IPDHPAKSHPDLHVSNDCSSSVAESSSPLRNFSCGFERTTRNEKLAAKTDEFNRTVFR TDRNCQAIQQNHSCSKSSEDLKPCDTSSTHTGSISQSNDVSGIWKTNAHMPVPMENVP DNPTKKSTTGLVRQMQGHLSPRSYRNMLHEHDWRPSNLSGRPRSADPRSNYGVVEKLL KTYETATESALQNSKCFQDNWTKCNSDVSGGATLSQHLEMLQMEQQFQQKTAVWGGQE VKQGIDPKKITEESMSVNASHGKGFSRPARPANRRLPSRWASRSPSAPPALRRTTHNY TISLRSEALMV" BASE COUNT 2135 a 1173 c 1251 g 2022 t ORIGIN 1 tgcggctgtt aaacatagag tcatttgctt tgatagtcat ctgacagtct cttttgaagt 61 ttgacaagct tttacatgac ccaagcctaa agagaaagcc atttctttcc tctattcaga 121 agtgaactga atgagtgtgt gcaggttcaa ctaagatctc ataaagaagt aatctggatt 181 ctaccttaat acagactaat agatgatcaa agccttgcag ttctattata tcacataggg 241 tattgtcata tttgttttaa tctggatgag aaatatttta aaaatgtgta tcactttgag 301 attttttaaa aatctcagta atatctagcc ctcacaatgt aattgaccaa ccaaaggaaa 361 catcttttga ctgacaagtg actgtttagg gaacaagaat aagttgactt actatctgag 421 attgtcaagc atttagacag ttaaatgtta gtgtacaagt aggaccagac atttgaaaac 481 tatgacttta atgttaaatt cttgcatctc tctgaaagca attatataat atggccagta 541 attgattcag gtcattctaa ttttgtgaca gtgtttctgt atttatatac agatacagat 601 ttattttatt tttgaatgat tgattgctag tcaaaagaat tagcatccat gtgttgccct 661 actaaaagta gcctttatgt ttcatcaaac ttgttccagc taatccatat tttagaggca 721 ctatttggag gccgaagcat taatacccag gaagtatatt ttaatatggc aaagagaact 781 caaagcggtt tgcttagtat caagaatatt agcaaagtga attaggggaa aagaagtttt 841 aattttgttt tgtttttctt acagctttgc cgggaagtaa agctttggag gaaaatcaat 901 atcaatgaac gtgctaagat cattgatctt taccatgaga agaccattcc agagaaagtg 961 atagaatctt ccccaaatta ccccgattta ggacaaagtg aatttataag gacgaatcac 1021 aaagatggtc tgagaaaaga aaataaaaga gagcagagct tagtcagtgg aggaaatcaa 1081 atgtgtaagg aacaaaaagc aacaaaaaaa tcaaaagtag ggtttttgga tcctttggct 1141 acagacaacc aaaaggaatg tgaggcctgg cctgacctga ggacttctga ggaagacagc 1201 aagagctgtt ctggcgccct cagtacagct cttgaagaac ttgcgaaggt gagtgaagaa 1261 ttatgcagct ttcaagagga aattcgaaag cggtctaacc atagaaggat gaagtcagat 1321 tcttttctcc aggaaatgcc aaatgtaact aatatacctc atggggaccc catgatcaac 1381 aatgaccagt gcattcttcc aatcagttta gaaaaagaaa aacagaaaaa taggaagaat 1441 ctgagctgta ccaatgtgct ccagagcaat tctacgaaaa aatgtggaat tgatacaatc 1501 gatttaaaaa gaaatgaaac tccaccagtt cctcctccaa gaagcacctc tcgaaatttt 1561 cccagctcgg attctgaaca agcctatgaa agatggaagg aaaggttaga ccacaacagc 1621 tgggtgcccc atgagggtcg aagtaaaagg aattacaacc ctcacttccc tttgagacaa 1681 caagagatgt ctatgttgta tccaaatgaa gggaaaactt cgaaagatgg tatcatcttt 1741 tcctctttgg taccagaagt caaaatagat agcaagcctc caagtaatga agatgttgga 1801 cttagcatgt ggtcatgtga cattgggata ggtgcaaaaa ggagcccctc tacttcgtgg 1861 tttcagaaaa cctgctctac ccccagtaat ccaaaatatg aaatggtgat cccagatcac 1921 cctgctaaat ctcatcctga tcttcatgta agtaatgact gtagctcctc agtagcagag 1981 agcagtagcc cacttagaaa tttcagttgt ggctttgaaa ggactacaag gaatgagaag 2041 ctggcagcaa agactgatga atttaacaga actgtattta gaacagatag aaattgtcag 2101 gcaatacagc aaaatcacag ctgctcaaaa tcatcggagg atctcaagcc ctgtgatacc 2161 tcatctactc acacaggtag catatcacaa agtaacgatg tgtccggtat ttggaaaacc 2221 aatgcccaca tgcctgtgcc catggaaaat gtgcctgata atcccaccaa gaaatccaca 2281 acaggcctag taagacaaat gcagggacac ctaagtcctc gcagttatcg aaatatgctc 2341 cacgagcatg actggagacc gagtaatttg tctggccgtc cgaggtcagc tgatcccagg 2401 tcaaattatg gtgttgtgga aaagctgctg aaaacctatg agacagcaac agagtctgca 2461 ttgcaaaatt ctaagtgctt ccaggataat tggaccaaat gtaattctga tgtcagtggt 2521 ggtgccacat taagtcagca tttagaaatg ctccaaatgg aacaacagtt tcagcaaaag 2581 acagctgtgt gggggggaca ggaagtgaag caaggaatag atccgaaaaa gataacagag 2641 gaatccatgt cagtgaacgc ctcacatgga aaaggatttt cccgacctgc tagaccagca 2701 aatcgtcgtc tcccctccag atgggcatcc agatctccat ctgcaccccc tgccttgcgg 2761 agaactaccc acaactatac catttctctg cgatccgaag cattgatggt ttaagtcttt 2821 ggcctggatt gctatattac agaagttcta gtcccacttg tcaaacagag cattctgagt 2881 gtttacaggc ggacctgttc tcttccgaca agcaatttga atcttaactt tccatcagtc 2941 ttcagtgttt ttaatctaag gaataattat cttcctgcat tttgatttct tagatcagaa 3001 ttttttaatg ccatcctcat tagttttcca atacaatata agttgaaaca atgtacaata 3061 ttgtatattc tttgttgcaa gtggcagaaa gtaaggttat gtgcatggcc atgtgtttgt 3121 aggtgcatgt gttgaaatag gaagtatcct agtatgtatt tagatatgaa aaacttatgt 3181 gctagtgttg actttgaaat tataacatgt atacctatat attctttgtg tttatttaaa 3241 agttttgaaa atatgcagta ctgtttatat tttgtatacc ttttaatagg tatccactta 3301 aggcatttgc agtacaaata ttaattgacc acttccttgg cccccagcta caatgaaaaa 3361 gaagaattat gcacacatct aacattggta gatttctaaa gacaactcat aaataatttc 3421 ttaacccata taaggataaa ataataccat tgatattctt gtttattctt tttaaataaa 3481 aatgtcacaa gtttaataga tcagccttgt aaagcaagca taaaatgttc ataaatgttt 3541 tatatattat ttggtagtct agtatggctt taactgaata tttgatcctc agagacagta 3601 tcatgagact gcattataaa tgacatgatt aaatgatgaa tttattttgt cagcctcagc 3661 aacaaacatt tcctaaatgg gatgcaaaaa gaaactgaat gatgttgaat tttagttttt 3721 ctttaagttg tggtaagctt ttagtttagc tgctttaaca tgaagtcata catacttaga 3781 ggtaagtcat ttattatttt tttccttttt ctaatctttt ccttaaaatt aatagataac 3841 aaactaagtt acttttttac taacaaggaa gatcaggaat aatatttctt tgaaagctga 3901 aattatactt ctcacagttt gctaccaaaa aaaggcttaa actaaacctt gacatccaca 3961 gtagcatcag gaatagaaat gtatattttg gctacagttt gcatacagtg aacacataaa 4021 atctatattg aaaactgaca gaaatatgaa tgtcgagggg cttggaattc taaatggaag 4081 attttagaag gaactttgct ctctactgtc actatataaa accaattatg ttatgttatt 4141 agatattaac tctatttaag cttttctttt actaagaaat tcttatgcca aatgtattcc 4201 aagagccatc tccttatgat gtatttgaac catttagaga aattcacttc attgtacccc 4261 atgacctgag gtaaagaaaa taattttcaa attgcagact tccttgaaag gctcaagact 4321 gtagcatggt gatgtctaga ttcatacttt actgtaagga cagtttcggt agttaagtgg 4381 aattaaaggt ctagagtgat ttgtattgga tggtaactaa ggtctataca tctactcttt 4441 taattcttat ttccttcagg aagatccttt tccacaggaa gctatcaaag aaaagtaaac 4501 tagacaccaa gtaatatcac tctcataata tgctttccag ctagagggga gaaaacacca 4561 ggaaagagga accctgggtc ataaaatgac ggacctgtta ggaaaatgct gttgtgtgac 4621 aatggagagc ctgcccaagg actgtgcaac ctatctgttc ctctccacca gtgtatccca 4681 agtagtctgg tatgatctgt ggttccttag aatattgagg atctctttaa aaaatgaaat 4741 ttctgatctc accccatatt actaaagcaa aatttccagg ggtagggact gggactccag 4801 aaccattaat aaaacaaaac aaaacaaagc aaaatattga tgatcaggca gatttgaaaa 4861 cagtgacatt gcttgatctc tgaatggatg gatgcccttg ggtggtcact tttcttccta 4921 ggacttgcct ttgttatctg taaattagaa tggctatact ttcagacttc acaattttaa 4981 gccacttttt tctattaaca gttgaatgaa ccagtgggac acattttaac gtcttcttat 5041 gttgactaaa attatagtta gagctgagaa aaaaaaaata gcatctagtc caactccttg 5101 attgtaaaac tgaaaaacta aggttatttg aatgttctgt tttctctttt taaatatttt 5161 ataactttta aatggaaaaa cctattaggt aaatggaaat aatccttgaa aatagttttt 5221 gatgttttgt gatttttaac tggtaaattt ctactttagg tgggcatttt cttccctaga 5281 atatgccctt tattatgcta tcattgctag atttaaaggg acttgtaagc atttctggaa 5341 atatttgatt taaaaacata tattaataga tgttttccct agcagacttt tggagcaaaa 5401 atattatttt agttcagcag aggattactg atattatggg ctgatctcag aaaggaataa 5461 ggatgaggca gagaggagta cttgagtttg tgtgtgtgtg tgtgtgtgtg tgtttgtgtc 5521 agtgtcacat aagttctcca tattgcctta tgtcccaaag ccaaatataa aaatataaat 5581 gatgctttgt ataattcatt ttatcaaaaa ttacccataa ctttcatttg tttttatatc 5641 gacaatgaag atgatcatat ttcattttgc agatgtggcc ataaaatatt atttacttct 5701 acataggtga ttttaccagg tttagttatc ctgagaaaac atctggactt aagagtttct 5761 gcttccctac aaagatcttt aaagttattt ttaggcacat ttgtgacaaa caactacttg 5821 atattgaaat ctccttcagc cattagaagc tatcaaataa agtaggtgta aaaacaactg 5881 tctgtggcat ttgtatacat cgagaacatt tttctttccc tcattttctg cagtgaactc 5941 cagtaaagct aagtgtctta tgaaatctaa actcatatat gtacacagtt cactctagct 6001 tcttccaaaa tatctctagg tagtacaact gaagccaaac ctgcctgact tcctgctcct 6061 ggccacccaa aactccatat ggcttctcgt acactgacat cctctctcta cccatttaac 6121 tgctaattga gtctgataaa agtcttcttt gaaaaaagtt tttacttcta agatttgcat 6181 ttacatcata aaattaaacg attttcagga aaatcagatt ttttttatta cagtactatt 6241 tgctttaaat tcggcatgtt tttcttaagt agcaagtaca tgtatcggaa cttagaactg 6301 gtgggcgcgg tggctcttgc ctgtaatccc agcactttgg gaggccaagg tggggggatc 6361 acgaggtcag gagatggaga caatcctggt ttcatcacgg tgaaaccccc tctctgctaa 6421 aaatacaaaa aattagccag gcattgtggt ggtcacctgt agtcccagct gctcgggagg 6481 gtgaggcagg agaatggcat gaacccggga ggcagagctt gcagtgagca gagatcgcgc 6541 cactgcactc cagcctgggt gacagagtga gactctgtct c // LOCUS AB007870 6356 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0410 mRNA, complete cds. ACCESSION AB007870 NID g2662100 KEYWORDS KIAA0410. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG1877. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6356) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6356 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG1877" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 144..1601 /gene="KIAA0410" CDS 144..1601 /gene="KIAA0410" /codon_start=1 /db_xref="PID:d1024587" /db_xref="PID:g2662101" /translation="MSTGFSFGSGTLGSTTVAAGGTSTGGVFSFGTGASSNPSVGLNF GNLGSTSTPATTSAPSSGFGTGLFGSKPATGFTLGGTNTGIATTITTGLTLGTPATTS AATTGFSLGFNKPAASATPFALPITSTSASGLTLSSALTSTPAASTGFTLNNLGGTTA TTTTASTGLSLGGALAGLGGSLFQSTNTGTSGLGQNALGLTLGTTAATSTAGNEGLGG IDFSSSSDKKSDKTGTRPEDSKALKDENLPPVICQDVENLQKFVKEQKQVQEEISRMS SKAMLKVQEDIKALKQLLSLAANGIQRNTLNIDKLKIETAQELKNAEIALRTQKTPPG LQHEYAAPADYFRILVQQFEVQLQQYRQQIEELENHLATQANNSHITPQDLSMAMQKI YQTFVALAAQLQSIHENVKVLKEQYLGYRKMFLGDAVDVFETRRAEAKKWQNTPRVTT GPTPFSTMPNAAAVAMAATLTQQQQPATGLNAFKL" BASE COUNT 1910 a 1084 c 1234 g 2128 t ORIGIN 1 gccttcgccg ccgttggggc tggaagttcc cgccaggtcc gtgccgggcg agagagatgc 61 tgcccggccc gcctcggctt tgaggcgaga gaagtgtccc agacccattt cgccttgctg 121 acggcgtcga gccctggcca gacatgtcca cagggttctc cttcgggtcc gggactctgg 181 gctccaccac cgtggccgcc ggcgggacca gcacaggcgg cgttttctcc ttcggaacgg 241 gagcgtctag caacccttct gtggggctca attttggaaa tcttggaagt acttcaactc 301 cagcaactac atctgctcct tcaagtggtt ttggaaccgg gctctttgga tctaaacctg 361 ccactgggtt cactctagga ggaacaaata caggaatagc aacaactata actacaggat 421 taactctggg aacgccagcc actacatctg cagctacaac aggcttcagt ttaggattca 481 ataaacctgc agcatctgcc acaccatttg ctctacctat tacctctacc tcagctagcg 541 gtctgactct ttcgtctgct ctgacatcaa ctccagcagc atccacagga tttactctaa 601 ataatttggg tgggacaaca gccacaacta caactgcatc aacaggcctc tctttagggg 661 gagccttagc tggtttggga ggttcacttt tccagagtac aaacacagga acatcaggac 721 ttggacagaa tgctttaggg ttgactttgg gaactacagc agctacttca actgcaggca 781 atgaaggcct tggtggtata gatttcagta gctcctcaga taaaaagagt gataaaacgg 841 gaacaagacc agaggatagt aaagctctga aggatgaaaa tctacctcct gtcatctgcc 901 aggatgttga aaatctccag aaatttgtga aggagcagaa acaagttcaa gaagaaatta 961 gtagaatgtc ttcaaaagca atgcttaagg tacaagaaga tattaaagct ctgaagcagc 1021 tcctgtcgtt ggctgccaat ggaatacaga gaaacactct caacattgac aaattgaaaa 1081 tagaaactgc tcaggagttg aagaatgctg aaatagcttt aagaacccag aagacaccac 1141 ctggacttca acatgaatat gcagctcctg ctgactactt cagaatcttg gttcagcaat 1201 ttgaggtaca gcttcagcag tacaggcagc agattgaaga actagaaaac catcttgcca 1261 ctcaagcaaa taattcacat ataacccctc aagatttgtc aatggctatg cagaaaattt 1321 atcaaacatt tgtagcttta gcggcacaac ttcagtctat tcatgaaaat gtaaaggttc 1381 tgaaagaaca gtaccttggc tacaggaaaa tgttcttggg agatgctgtt gatgtgtttg 1441 aaacaaggcg agcagaagcc aagaagtggc agaacacacc cagagttact actggaccca 1501 ctcctttcag caccatgcca aacgcagcag ccgttgccat ggctgcaaca cttacacagc 1561 agcaacagcc tgctacaggt ctgaacgcat tcaagttata gcttcttact gttccaatgc 1621 agaacattta tgtgcgtttt tataagtctt ccatttgtgt gagcactgaa gaaatttcta 1681 tgagtagctg ttccaataca atctaaaatt gtgatggaca cgattataaa catacctgat 1741 aaaaaaggca gggtagtctt agcaataatt atacatcaat tagatattga atattgtgaa 1801 tctacttaat gttatttgct gtaatgctat aaatgccctt tagaacatgg tgtttcacta 1861 gtgaattaca tttaagaata acattagatt ctgaaatcat gactgtacca tttattgttt 1921 taaagtcaga gatcatccat ccattctttg aatttttcat atatgatact gataggtggt 1981 gcattctcat tttcctcata catagtgatt acaaagtacc cacattcagg atgaagtatg 2041 cttttattga ctgaaggtat gttgaagatc aataggcttg tttgtatgta tgttgactgt 2101 aaacagaata caagccctct aaaccaaatt ttgaagatga tacatgctag aattgtacct 2161 ttccaaggtg aacagaacag caagagcagg aaaaagatgt tttgagatgt ggattgcaca 2221 ttaaggagca actgggaagt tgaacagatg aaaataattg gtgacatgga aagcagattg 2281 ttttatttct tggtttagtt acacttctaa ttagagaaac attttttgtg gtcattctac 2341 agtgttttat ttggtcctgt gtctttcgga ggttaatagt tatcttaatc ctgccatccc 2401 ttttgtcagg tttgaagctt gacatggaaa aatagtctta atcgatattg atttttttat 2461 tttaatcatt cataaatttc ttagtaatgc ttactattca atagcatgtg tctttctgag 2521 agttccatat agtccagtag tacattataa tacacacaaa ttaatacata tttagattgc 2581 attcaagcta cctatactag ggaagcattc tctggttttc actattggca gtttaaaaaa 2641 ttgttcttca catttggaga tgacagaatt gacttgccat aagtaaaggg ttgatgatag 2701 atttatgcca aatcctccat atcccgtggc tggacccatg tgaatagtgg ctcgcacgag 2761 cactgtctgg tagctctcta gctggctgga gtgtaggatt gcgttgttaa gttatggagg 2821 gaaaaagtca ttatgggatt gattcctttg aggattgata tttcttctta aaaggattca 2881 atcccataat tatgaccatc agctctcatg attacctaac cagttaatca aaccaaccat 2941 agtactagtt tactttttag agccagtatt aatcctaaag tagataagtg tagatttggt 3001 taagtgttct ttaaacatag tgaatttcca ttctattaaa tagaacttga gaaatgtcag 3061 ttatataaaa agttatgttg taagtgaata tattctatga taaataccta aacttgaagc 3121 aaaatctcct gtcaagggtt tgtgtgtgtg tgtgtgttta actttcttat ctatagatta 3181 ttattttgag aaattcatag gttttcactt atacattttg agcttgtgga catatggaga 3241 tgtgtgttca ttctctaaat gagtgcctac tatgtgctgg gctcgaggaa tacaaaggta 3301 gacaatatag attttgtctt tgccctctgt ggtggttgac aatgttagct taatctggag 3361 tattcttgag gagccttttg ctcataggct ttaattttgt ctgtgacttt tgccaggctt 3421 tggggaaatt gactgccttc actcttccct gaaactttga agtccacata agggctttct 3481 catcagatgg cagaaattat aatttattaa cctaaacaac cttacagtgt tttcgctttg 3541 aattgtatga attcttagaa ctgagaagct aaacaacagt gataacaatt ttacattatc 3601 ttttctttct ccttttgttt ctctaaaggt ctcccaggcc ttcccagtca aaattttcgt 3661 tatttttagt catgtcacca tgatgtgaga tttctgagct ctgttagata catttagaaa 3721 tgtccctatt tatcaaatgg tgttttgtac cccataagtt atcccttatg ccctgtgact 3781 tctgtgaaat ctttgttttt aagtgccagt aacaaatttt agaccatcta gatgagggta 3841 gagagagctt atgcattgtt tttatacatc actgttgatc tatacaagat ttgagttatc 3901 agaagggttt tgcttttatt tgcatatgaa atttttatta acatgaatat gggcacagag 3961 caaatagctt cttagactgc tgccttctga aggtggcttc tgtatctttc tcagaggagg 4021 aggaatagac ttaaacgact tctttgtcct ccttcaattt tttttttaat gaaaaagtat 4081 actagggctt aggcatgtgt gatttacagt tatttgttct gaaggctgca gagaattgaa 4141 aatccagcag gctatttgtg tatgaaatca tgagcatcga taacgtggat aacgtagttc 4201 catgttggca gaacttgaca cttgtctata agtgcttccc aagcctcctg aaaaactgaa 4261 gacatttaca ttaggctttt cagtataatt ttaatttagg ataggttttt tagcaatgta 4321 tctgggattg agttttgcta ttgtgttttg acactttaag agttcttgat gtttacataa 4381 taagactttc ttatgtgact tagctgctgt tttaaggaga gccaagacgt tttgttagga 4441 attcctaatt tttagcttag tgtaggtttg gacatttaag gtgtagtgtt aaataggaaa 4501 cattaagtat ttcatcttag tttcttgtgg ttttgtgatt ttgttgtatt taataaccaa 4561 aggaacattc cccattaaat cagtaagagt aaaatttttc aaactgggga aaacgcattt 4621 taggtttttt aaagaaaaat tattttaaaa tgtgacttag aaataaccat actaattaca 4681 ttaggcattt ggggtgatca tggaaaatta taaaatattt ttagcacttt agtccttttt 4741 ctgttaatgc agttttactt ttaagaaagt aaaatttatt ctggaaaata tgaattacag 4801 cgctaaacct ggttttgttg tgtgtgaata tattatatgg taacttttgt aaaatgttac 4861 tctacatgaa gatttcactt tggccagtca ttggtatata tcactacaaa cttaatttta 4921 tttatggaaa taagaacctc tcattggatg actattcaaa tttcaaagca gacttcttga 4981 attgtttacg taacttttgc ctgaaggatt tagactgctt tctaatagtg aaactaatat 5041 atatggtggc cagagtgtaa tatatgctaa tactttggca tgggagatat ttatcatgag 5101 tttttactat taaaaaatgt tatacatttg cctacgagtt ttataaatga tgttgccttc 5161 agaatttgtg tgaaggtaca aaactaaaaa tatcatgtat ctgatgtgca atggaaagtc 5221 ttgtcatcat tagatatgag ttcttgatta tattctgaat attgatgatt agaaaaatct 5281 tatgttctgt cgctcttaac aacagaccca cacacaacaa aaagcatgct atttgagtaa 5341 ctcttacgac ttacaaggct gagggctgtg gtgtgcatta ttctttaggg tcttccacca 5401 gcagtgcctc ctattaacct gtgacaaaca agtctctcaa tgttaaagaa ctggacatgt 5461 gttttttatg ttttttgtta ggaatcagaa agtataataa tatgcttggt ttctttttcc 5521 tatagtaaca ttttacaaaa gagctgctag gcaggtattt tattctcaaa gtgatctcaa 5581 actggatgta aaattttaaa ttattttttt aatagtagta taagaattct ggctgctatt 5641 agttattgtt tgttttctgg gttagaagct attcgaaaag tccagtttct gtcccagtgt 5701 agcaaaatgt agttcctcgg ttgtttttct ttaaatgctt tataatttta cactaccttt 5761 ttaatataca aacctcattc ttcattggat aacttgaagg ctttgatttc tttaaaaatt 5821 taaattttag tgtgtatatt actttgacag ttccctcatc tttgagatgc actgatcact 5881 gtgcttgaaa aagacaatac tgaagattgt actatgaagt ttattgaata attttcataa 5941 attatttatc caaatgagag atttttagat ttttgtattc tgcttagttt taaaaaaaaa 6001 aaatagtagt ttaaaagaga ggctagtaag tttgatgcta ttcttgccaa acaaactcag 6061 ccaaaatctt taaagtaaca agagggaaaa ggatgactaa tcgttctgct tctgagtaca 6121 ttttccaaaa cgttggaaag aaacttctga attgaaatct tgaatgtatt gaatctgtca 6181 aggtacacag cggtgccttt gtaaatgttc attactttat ttaatcaggt gataagtggt 6241 gtaatgtagc agagcttaag aatagaactc aattatcact ttttgtgaac aagttggaat 6301 tgtcatgtta ctgtgtaatt gatttgcttt acaatgaaca ataaatttaa taaaat // LOCUS AB007871 6631 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0411 mRNA, complete cds. ACCESSION AB007871 NID g2662102 KEYWORDS KIAA0411. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG2207. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6631) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6631 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG2207" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 824..3583 /gene="KIAA0411" CDS 824..3583 /gene="KIAA0411" /codon_start=1 /db_xref="PID:d1024588" /db_xref="PID:g2662103" /translation="MDFFYFNGSEVQRCQSGTVRTNCLDCLDRTNSVQAFLGLEMLAK QLEALGLAEKPQLVTRFQEVFRSMWSVNGDSISKIYAGTGALEGKAKLKDGARSVTRT IQNNFFDSSKQEAIDVLLLGNTLNSDLADKARALLTTGSLRVSEQTLQSASSKVLKSM CENFYKYSKPKKIRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKR SKPTDIFAIGFEEMVELNAGNIVSASTTNQKLWAVELQKTISRDNKYVLLASEQLVGV CLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQ SQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWD SLIAGDQLINQKNAGQVFRGFLEGKVTFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLW RRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRAELKTSDHRPVVALIDI DIFEVEAEERQNIYKEVIAVQGPPDGTVLVSIKSSLPENNFFDDALIDELLQQFASFG EVILIRFVEDKMWVTFLEGSSALNVLSLNGKELLNRTITIALKSPDWIKNLEEEMSLE KISIALPSSTSSTLLGEDAEVAADFDMEGDVDDYSAEVEELLPQHLQPSSSSGLGTSP SSSPRTSPCQSPTISEGPVPSLPIRPSRAPSRTPGPPSAQSSPIDAQPATPPPRPVAP PTRPAPPQRPPPPSGARSPAPTRKEFGAPKSPGTTRKDNIGRSQPSPQAGLAGPGPAG YSTARPTIPPRAGVISAPQSHARASAGRLTPESQSKTSETSKGSTFLPEPLKPQAAFP PQSSLPPPAQRLQEPLVPVAAPMPQSGPQPNLETPPQPPPRSRSSHSLPSEASSQPQQ EQPSG" BASE COUNT 1980 a 1368 c 1348 g 1935 t ORIGIN 1 gttggaaaaa ttcaagaatc tgaagttttc cgagttactt ccactgagtt tatatcactg 61 cgaatcgatt cttcagatga ggatcgcatt tcagaagtgc ggaaagtttt gaattcagga 121 aacttttatt ttgcatggtc tgcatctggc atcagtttag atttgagtct taatgcgcat 181 cgtagcatgc aagaacagac aactgataat agatttttct ggaatcagtc tttgcatttg 241 catctcaaac actatggcgt gaattgtgat gactggttat tacgtcttat gtgtggagga 301 gtagaaatca gaacaattta tgctgctcat aaacaggcga aggcttgcct catttcaaga 361 ttaagctgtg aacgagctgg gaccaggttt aatgtccggg gaacaaatga tgatggtcat 421 gttgccaatt ttgtagaaac agaacaggtt gtgtacttag atgactcagt ttcttccttc 481 atacaaatcc gaggatctgt tccattgttc tgggagcaac cagggttgca agtgggatct 541 catcgtgtcc gtatgtcaag gggatttgaa gccaatgcac ctgcttttga caggcatttt 601 agaacactta agaacttata tggtaaacaa ataatagtaa atttgcttgg atctaaggaa 661 ggtgaacata tgctaagtaa agctttccag agtcatttga aagcttctga acatgctgct 721 gatatccaga tggtgaattt tgactatcat caaatggtta agggaggaaa ggcagaaaaa 781 ttacatagtg ttcttaaacc tcaagtccag aagtttctag attatggatt ttttttattt 841 caatggaagt gaagttcaaa gatgccagag tggtacagtt cgaacaaact gcttggattg 901 tcttgataga acaaatagtg tgcaggcatt tcttggctta gagatgctag ctaaacagtt 961 ggaagctctt ggtttagctg aaaagcctca gttggtgact cgctttcaag aagtttttcg 1021 gtcaatgtgg tccgtgaatg gtgattcaat cagtaagata tatgcaggaa ctggagctct 1081 tgaagggaaa gcgaagttaa aagatggtgc tcgctctgtt acccgaacaa ttcagaataa 1141 cttctttgac agctccaagc aagaggccat tgatgttttg ctactgggaa atactctgaa 1201 tagtgattta gctgacaaag ctcgagcact tttaactact ggaagtttgc gtgtttctga 1261 gcagacatta cagtcagcat cttctaaagt actaaagagc atgtgtgaga atttctacaa 1321 atattcaaag cctaagaaaa ttcgagtatg tgtcggaacc tggaatgtga atggtgggaa 1381 gcaatttcgc agcatagctt ttaagaatca gacactcact gactggcttc ttgatgcacc 1441 caagttagct ggcatccagg agtttcaaga taaaagaagt aagccaactg atatatttgc 1501 aattggtttt gaagaaatgg tagaattgaa tgctggaaac attgtgagtg caagcacaac 1561 aaatcagaag ctctgggctg tagaacttca gaagacaatc tccagagaca acaagtatgt 1621 gctgctggct tctgaacagt tggtgggcgt ctgtttgttt gtttttatca gaccacagca 1681 tgctcctttt atcagggatg ttgcagttga tactgtgaag actggaatgg gaggtgcaac 1741 tggaaataag ggagcagttg caatccgaat gctcttccat acaaccagcc tttgcttcgt 1801 ctgtagccac tttgctgcag ggcagtcaca agtcaaagaa agaaatgaag attttataga 1861 aatagcacga aaattgagtt ttcctatggg aaggatgcta ttttcccatg actatgtatt 1921 ttggtgtggt gatttcaact atcgaatcga tctccctaac gaagaagtta aagagctcat 1981 aagacagcaa aattgggatt ctcttatagc aggagatcaa cttatcaatc agaaaaatgc 2041 tggacaggtt tttagaggat ttttagaagg aaaggtaacc tttgctccga catataagta 2101 tgacttgttt tctgacgact atgacaccag tgaaaagtgc cgcacccctg cctggacaga 2161 ccgtgtcctt tggagaagga ggaaatggcc ttttgataga tcagctgaag atctagatct 2221 tctaaatgct agttttcaag atgaaagcaa aattctgtac acgtggactc caggcacttt 2281 gctgcactat ggaagagctg agctgaagac ttctgaccac aggcctgtcg ttgccctgat 2341 tgatatagat atatttgaag ttgaagctga agagaggcaa aacatttata aagaagtaat 2401 tgcagttcag ggtccaccag atggtacagt attggtctca atcaaaagtt ctttaccaga 2461 aaataatttt tttgatgatg ccttgattga tgagcttctg cagcagtttg caagttttgg 2521 tgaagttata cttataagat ttgtagaaga taaaatgtgg gttacatttt tggagggaag 2581 ctctgccttg aatgttctga gcctaaatgg taaagagtta ttgaatcgga ctataactat 2641 tgctttaaaa agtccagact ggatcaaaaa tttggaagaa gaaatgagtt tagagaaaat 2701 tagcattgca ttgccatcat caacaagctc taccctgctt ggtgaagatg cagaggttgc 2761 agcagatttt gatatggaag gtgatgttga tgactatagt gctgaagtgg aggaacttct 2821 tcctcagcat ctccagccat cttcaagttc cggccttggt acttccccca gctcttcacc 2881 ccgaactagt ccctgccagt cacctacaat atcagagggt cctgtacctt cccttcccat 2941 cagaccaagc cgagcaccgt caagaactcc tgggcctccc agtgcacaga gttctcctat 3001 tgacgcgcag ccagcaacgc cgccgccccg cccggtcgcc cctcccacac gcccggctcc 3061 cccacagaga cctcctccgc cttcaggggc taggagtcct gcacccacta gaaaggaatt 3121 tggagcaccc aaaagccctg gaacaacaag gaaagataat ataggacgca gtcagccttc 3181 acctcaagca ggacttgcag gcccaggacc tgctggatac agtacagcca gaccgacgat 3241 tcctcctcgt gctggagtta tcagtgcccc acagagccac gcgcgggcat ctgctggaag 3301 actgactcct gaaagccaaa gcaaaacatc agaaacgtcg aaaggttcaa ctttccttcc 3361 tgaaccactg aagcctcagg ctgcttttcc tccgcagtct tctttgcccc cgcctgctca 3421 aaggttgcaa gagcctcttg tccctgtggc agcacctatg cctcagtctg gcccccagcc 3481 aaatttggaa accccaccac aaccaccacc tcgaagcagg tcatcccata gcttgccttc 3541 agaagcttcc tcacaaccgc aacaggagca accatcaggg taacaggtaa aaacaaatgg 3601 aatctctgat ggcaaaagag aatcaccatt aaagattgac ccatttgaag atctgtcatt 3661 taatctgctt gctgtatcaa aggctcagct atctgttcaa acgtcacctg ttcccacccc 3721 agacccaaag aggttgattc agttgccttc tgcaacgcaa agtaatgtta atactttgag 3781 ttctgtaagt tgcatgccaa caatgcctcc aattccagct cggagtcaat cccaggaaaa 3841 tatgcgaagt tctccaaacc catttattac tggcttgacc aggacaaatc ctttcagtga 3901 caggactgct gctcctggaa acccatttag agccaagtct gaagaatcag aggcaacttc 3961 atggttctcc aaagaagagc ccgttactat cagtcctttc ccttctctgc agcctcttgg 4021 tcataacaaa agcagggctt catcttcact tgatggcttt aaggacagtt ttgatctaca 4081 gggccagtct acattaaaaa ttagcaaccc gaaaggatgg gtaaccttcg aggaagaaga 4141 ggattttggt gtgaaaggga agtcaaagtc agcttgttca gacttactgg gtaatcagcc 4201 aagttcattt tctggctcca acctgacatt gaatgatgac tggaataaag gtacaaatgt 4261 ctccttctgt gtgttgccgt caagaagacc tcctccacct cctgtccctc tgctcccgcc 4321 cggcaccagc cctccagtag atcctttcac gaccttggcc tctaaggctt cacccacact 4381 ggactttaca gaaagataac gccatgcaat agaaaacagt gggtacttgc ttttggcagg 4441 atagagctaa gagaattggg cattagtatt tcattatgtg caataagtca ttgtaagtgc 4501 actgatatct tcacaaaaca ccactatttg atgtgtacag agttggacta tgtgtatatt 4561 ggaaataagg aaaaaccctt ctcattgtta actggagttt tgatgtattt ctctttggat 4621 gaataggaga cagtagtagc cataaaaagt acttatactt tagaaaacag tccttattca 4681 gaaacttttc ggtcagtctt ctgaagaatc tcaaaaagcc cacccaactt tcagctgaca 4741 tttccaccag ccctctcata cttgttaaca attggtatct ttgagtattt accaaagagc 4801 tgccaaggtt acagtgaaca gagttttgaa aggcattgct ttaaaggaaa aaagtatagg 4861 tatgtgtaca tatataatac atacaaacac atgtacttct gtatacattt acatattttt 4921 acaattcata ctttaatttc taggctataa ctcagaccaa attataccta aaagttccaa 4981 caaagtccct ttttcaatat cacattacca aaaagatggc tgcaaatgta atttggacct 5041 ttcattaatt ttgttttcaa aactagaata atctcaccac agaatcagaa ttttctaccg 5101 ttccacaccc aaccccttca aatacacaca accttgttac ttttcactcc agcaccttca 5161 tacgcttttc tccaggagga ggttcttgca gctggaaaca gcctattttg tggtcactgt 5221 caagtggatg gatattctag cgctcccaaa aaagcactat ggccttatat gcagggaagg 5281 cacataccac caagttcaat gagaaatatt agagctaacc gtactctctt ctctgcgtac 5341 gttcgagtat acgttgccca tatccctccc atattttctt tttgctgctt ttgctctgga 5401 actttgcttt tagcagggaa agcagctgtc ccctgagtgc tttgaattgg gaatataccc 5461 agtgtgtgtt ctcccccctc ttacgaggct acataacaca tctatgatgc tgctttaagt 5521 ttttagaggc tatacctcaa agtagctgcg gattttgtct cctgcactgc caatatgcaa 5581 ctgatcccgc ttttattaat tttttgaaga agtacacaga atttttacag aatgtagtat 5641 tttgatatca ttaagtaaac caatcagaaa actccttgag caatagttgt ttctttgtca 5701 gtttcagtta caatcatctt tacccattaa gacttacatt aacattcctt ttatataaag 5761 agttgtatat gtccacctaa attcctatgt ccacacttaa cctttaaaga tgtacattga 5821 gggaatatca aaaaatagcg ttcatggcta cgaatatgta gaatgttaaa agcacagcaa 5881 actgcactgc acttataaag caaatctatt agaaaaaaag catttttcta aatgctcaaa 5941 tgttatcaaa atactatatt tatagaagta atcttctctg ttaacagcca agatttgctg 6001 ttagaaataa ctcttgtgag ttttatattg tgctttttgg aggttctaat catttcagca 6061 gtagcgtctt aaacagcagt attactataa gcagttgctt caaaatgtga attaacttgt 6121 tgaaactgtg gctttaacat ccatgtgact agtgtatatg gtatttgctc tccattagca 6181 aaataattca ttgttaggta aacttcatta gtgcaaattg cagatctgtg agcaatgttt 6241 cctattgaat ataaactact gtctaaaata tacatatcag cagcccagcc tttatcagga 6301 aaattatact tggcaagttg ctgaaaatgc acaaagttat gaaagttaaa ggtatgctgc 6361 aaataactag ccattattct atgtattatt aaatatttac tagttctgtt aaaagcagag 6421 cagaagttag acactaagga tctctttgtg aactctgtgt tctctatatt agattgctgt 6481 ttatatgtaa gaattttatt gcttatgtgg catacaatat ttataactat aaactttata 6541 gaagtacagt attaaagtca gtggtacaca gacattctgt acatatcctg tgaaacgtgc 6601 tgtcatatga aataaatata tctgtcttta c // LOCUS AB007873 6530 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0413 mRNA, complete cds. ACCESSION AB007873 NID g2662106 KEYWORDS KIAA0413. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HG3242. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6530) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6530 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HG3242" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 364..2247 /gene="KIAA0413" CDS 364..2247 /gene="KIAA0413" /codon_start=1 /db_xref="PID:d1024590" /db_xref="PID:g2662107" /translation="MAAAVLTDRAQVSVTFDDVAVTFTKEEWGQLDLAQRTLYQEVML ENCGLLVSLGCPVPKAELICHLEHGQEPWTRKEDLSQDTCPGDKGKPKTTEPTTCEPA LSEGISLQGQVTQGNSVDSQLGQAEDQDGLSEMQEGHFRPGIDPQEKSPGKMSPECDG LGTADGVCSRIGQEQVSPGDRVRSHNSCESGKDPMIQEEENNFKCSECGKVFNKKHLL AGHEKIHSGVKPYECTECGKTFIKSTHLLQHHMIHTGERPYECMECGKAFNRKSYLTQ HQRIHSGEKPYKCNECGKAFTHRSNFVLHNRRHTGEKSFVCTECGQVFRHRPGFLRHY VVHSGENPYECLECGKVFKHRSYLMWHQQTHTGEKPYECSECGKVFLESAALIHHYVI HTGEKPFECLECGKAFNHRSYLKRHQRIHTGEKPFVCSECGKAFTHCSTFILHKRAHT GEKPFECKECGKAFSNRKDLIRHFSIHTGEKPYECVECGKAFTRMSGLTRHKRIHSGE KPYECVECGKSFCWSTNLIRHAIIHTGEKPYKCSECGKAFSRSSSLTQHQRMHTGKNP ISVTDVGRPFTSGQTSVTLRELLLGKDFLNVTTEANILPEETSSSASDQPYQRETPQV SSL" BASE COUNT 1644 a 1400 c 1429 g 2057 t ORIGIN 1 gtcggggccg ctttgcaggt ccctagtcag gaccgagcag gggagtagga taggaatccc 61 cgccgcacct ttgtacgagc ctgacccctt ccgtgggttt gttcctgggt cgccgtcaag 121 ctgcggtctc tcctcccccg cccttcagcc ccgcggtctc caggggcggc gccctgggtc 181 tggaacgcgg ttgccaccga ggaggcggcg gccctgcgtc tggaacgccg ttgccaccga 241 ggaggcggcg gccccgagcg cgcctggaag ccccgggcaa ccggccaggg tcgggcacag 301 gtggggtccg tcaggccgcc cggggctcct ctgtcccagc tctgcggccc agggggtgac 361 gtgatggcgg cagcggtgct gacggaccgg gcccaggtgt ctgtgacctt tgatgatgtg 421 gctgtgactt tcaccaagga ggagtggggg cagctggacc tagctcagcg gaccctgtac 481 caggaggtga tgctggaaaa ctgtgggctc ctggtgtctc tggggtgtcc tgttcccaaa 541 gctgagctga tctgccacct agagcatggg caggagccat ggaccaggaa ggaagacctc 601 tcccaagaca cctgtccagg cgacaaagga aaacctaaga ccacagaacc taccacttgt 661 gagccagcct tgtcagaggg aatctcactt cagggacaag tgacacaagg aaactcagtg 721 gactcacagt tggggcaagc cgaggatcag gatgggctat cagaaatgca ggaaggacac 781 ttcagaccag gaatagatcc ccaggagaag tctcctggga agatgagccc tgaatgtgat 841 ggtttaggga cagctgatgg tgtgtgttca aggattggac aggagcaagt ctctccagga 901 gatagagtcc gtagccataa ctcatgtgag tcaggtaaag atcccatgat tcaggaagag 961 gaaaataact ttaaatgcag tgaatgtgga aaagtattta acaagaaaca cctccttgct 1021 ggacatgaga aaattcactc tggagttaag ccctatgaat gcacagaatg tgggaaaacc 1081 tttattaaga gcacacatct cctgcaacat cacatgatcc acactgggga gaggccctat 1141 gagtgcatgg agtgtggaaa ggccttcaac cgcaagtcat accttaccca gcaccagcgg 1201 attcacagtg gagagaagcc ttacaagtgc aatgaatgcg gaaaggcctt cacccaccgc 1261 tccaattttg tcttgcataa caggagacac actggagaaa aatcctttgt gtgcacagaa 1321 tgtggccaag tctttcgaca taggccaggc tttctccggc actatgttgt ccacagtggt 1381 gagaatccct atgagtgctt ggagtgtggc aaggtcttca aacacaggtc atatctcatg 1441 tggcaccagc agactcatac cggggagaag ccctatgagt gcagtgaatg tgggaaggtc 1501 ttcttggaga gtgcagccct gattcaccac tatgtcatcc acactggaga gaagcccttt 1561 gagtgcctcg agtgtgggaa ggctttcaac caccgatcct acctcaagag gcaccagcgg 1621 attcacactg gggagaagcc cttcgtgtgc agtgaatgtg gaaaggcctt cacccactgc 1681 tctactttta tcttgcataa aagggcccac actggagaaa agcctttcga gtgcaaagag 1741 tgtgggaaag cctttagcaa tcggaaggac ctcattcgcc acttcagcat ccacactgga 1801 gagaagccct atgagtgcgt ggagtgtgga aaggccttca cccgcatgtc gggcctcacg 1861 aggcacaagc ggattcatag tggagagaag ccctatgaat gtgttgagtg tgggaaatcg 1921 ttttgctgga gcacaaacct cattcgacat gccattatcc acactggaga gaagccctat 1981 aaatgtagtg aatgtggaaa ggccttcagt cgcagctcgt ccctcactca gcatcaaagg 2041 atgcatactg ggaaaaatcc catcagtgta acagatgtgg gaagaccttt tacaagtgga 2101 caaacctcag ttacccttcg agaacttctt ttagggaagg actttttgaa tgtaaccact 2161 gaggcaaata ttttgccaga ggaaacatct tcctctgcat ctgatcaacc ataccaaaga 2221 gaaaccccac aagtgtcttc actgtgagaa aaccttctgt tgctgaatat tacttgtcat 2281 ctgaagagtc atattagaaa ttcgttcagt ctagagcctt attctccatc tgataattta 2341 tcctggagag agacccagtg gttattgtgc acataggaga accttcagct gcatctttct 2401 ccttagttta cagtgcaatt ttatctcagg aattattttt aaaaggagga ggggacatag 2461 aaaaaatgaa atgcaagcac acatcttttc aggcttctct gccaagccta tggcgctttg 2521 tcatggattt cttagtgtat ttgggggaag ggaaatgttt caaggtaaag aaccttgacc 2581 ctttatgtgc ttgtatgtac atttattgct atccagtgtc agaacagtta gtttaggaaa 2641 agtatgcaaa ctttaatcgc acatcttctg tattccacaa tagtacttac tcttgagaag 2701 cataacttta tgacaactta gggggcttga gccatgaaat actcacgttt aagtcagtaa 2761 ggatacacat gttaacattc aggcttttgt cttgatgccc gtcttttggt ttacgtcatc 2821 attgtagcca tatggtaaat ttttatttgt taaattttta aaaattactt agctcaaaat 2881 gtcagtggta aatttttaaa ttgagtaaag tgattcttct ttgctcttat ttaaaatcga 2941 cagcatttct agttcctttg acattccata taatttttag gattagtttg tacttattta 3001 caaaacagtc ttgttcagat ttttacagaa attgagttaa gtctgtggat taatctgtta 3061 agaattggta tctttactat gttgggtctt gtagttcatg agtgcaggag taggcctctt 3121 aatttataat agttaagcat tccttaaagt atttcattaa tgttaaaaat tttcagcgta 3181 tagatcccta tgcatttttg ttagaatttg catcctaggg tgcttttttg tttgtgtgtg 3241 tgtgtaattt gagtgattgt aaatggcatt gatgtttgaa ctttgaattg cacacaattt 3301 ttggtagtat acagaaatat aatatttttg gacattgttt tgctaccctg caaccttatt 3361 gagttcactt attattttta gctatttctt tcgacagaat ccttggaatt ttctatatag 3421 aaatgtcatc tgcaaataca gaccattttt tatctttcat cttacctgta atcctagcac 3481 tttgagacca ctaaggcggg tggatcacct gaggccagga gttcgagacc agcctggcca 3541 acatggcaaa accctgtcta ctaaaaatac aaaaattagc cgggcatggt ggcgtgtgcc 3601 tgtggtccca gctgctgggg ggctgaggca ggaggatcag ttgaacgtgg gaggcggagg 3661 ttgcagtgag ctgagatctc accactgcac tccagcctgg gcagcagagc gagactctgt 3721 ctcaacaaca acaacaaaaa gtcctgaaca tgattgtgga agtgtgttgc tctttcaagt 3781 tctatcactt tttgtttgca aagttcaaag ctgtattgtt tggtacatat acatgtaggt 3841 ttgccaagtc tttgtggtga attgactctt ctgtcattat gtgatgtcat ttttttgcct 3901 tttaatagtc ttgtcaatac tttacctgat gttctcatag tgactcctgc atattttgat 3961 taatgtttgc atggttaata tttcttcatt ttattttaaa gcttacctgt atcattactt 4021 atgaagtcag tttctttgaa cagcatatac tcaggccatg ctttttttat tcattctgca 4081 tatgtctctc ttaattggta tgttgaaatg atttacatta aaataattat tgatatttta 4141 gggcttaagt gtgcccttaa atgatttttg tgttcttttt tattgttcct ctgttatttg 4201 gggttgttta ctagtcttcc tataggttac ttcacctttt ttttttttaa taatttgatt 4261 tgatacatat gtagtgtttt ttagtataga tcctccttga cttatgatgg ggttatgtca 4321 caataaaccc attgcaagtt gaaaatacta tgtcaaatat gcatttaata cacctaccct 4381 gctgaacatc atagccgatc ttgccttcag aatgctcaga aaatttacat tagcctgcag 4441 ttgggcaaaa tcatctaaca caaggcctac tttataataa agttactgca aagaatttta 4501 aataaaaatt caagtgtggt ttctactgaa tgcatgtcgc attcgcacca ttgtaaagtc 4561 aagtagtaag tcgaaccatc ctaagtcagg gactgtctgt atatatctgc attttttagt 4621 gattgttcta ctattacagt gtaaatatat aacttatgac agtttaataa gttatcagca 4681 atttagcact ttacttccat taggttcctt tagcttacct actaatgtaa ctatcttaag 4741 taatagtaat tcctcttcaa aaattgagcg ctatgttgca agatgtaatt tttcttttcc 4801 ttttttttga gaccaagtct cactctgtca cccacgctgg agtgctgtga tgcgatctcg 4861 gctctctgca acctctgcct cccgggttca agcaattaac tgcctcagct tccctagtaa 4921 tggattacag gcgcccgcca ccacgcctgg ctaatttttg tatttttagt agagacgggg 4981 tttcaccatc ttggccaggt tggtcttgaa ctcttgacct catgatccac ccgcctcggc 5041 cccccaaagt gctggggtta caggtgtgag ccactgcacc cggccacaag atgtaatttt 5101 tactttatct ctcacacgta ttttacaaaa cgtcatgaga aacattgtct cttgaccttt 5161 tttttttttt ttttttttta aagagacaga gtctcactct gtcacccagg ctggagtgca 5221 gtggcacgat cttggctcac tgcaacctcc gcctcctggg ttcaagcgat tctcctgcct 5281 cagcctcctg agtagctggg attacaggtg tgcgccgcca cacccagcta attttgtatt 5341 tttagtagag acggggtttc accatgttgc tcaggctggt ctcaaactcc tgaccttgtg 5401 atccgccaac cttggcctgt cgacctcttt acccattcca ttgttcattc ttctttcttg 5461 aaatcccaaa ccttctatta acatttcttt tcagttatac actgaacttt actgtagcct 5521 ttcttctaga gtaacaaatg ttctttgttt tccttcctct aaagatgtct ttatgtttcc 5581 ttcattccca aagaatattt ttgtggaata taagattcag agttggcagt tgttttcttt 5641 tagacttcag agatgtatct ctctgttctg tacattattg tttaatataa gaaacctact 5701 accattcaaa taatcattcc ctattttaca atgcatcagt tctgtcaagc ttcattcaaa 5761 gtgtttttca ttgtctctag tgttcagaag tttggctgtg atgtgcgtgg catggaagtt 5821 tttgggtgta ttctatttgg cgctccctgg tgcttgccca gctttttgat ctgtaggatt 5881 atgccttttg caaaatttgg ggaactttca actattattt cttcaaatat tttttcaccc 5941 cccagtcttg tcttttttag ggacttcaat aacatgagtg gcagatcttg ttttacactc 6001 ccatgggtcc ttcaggctct catctttttt ctttttccag tctattttct gtcttgttaa 6061 tattgattaa tttttattga ccttccatgg tcctcactga ttgttttctt tgtcatacct 6121 aatctgttga gtttgtgcag tgagttttca ttttggtttt gtattttcca gttgtttaat 6181 ttccattggg tgggttcttt tgtacacctt ctgtttcttt gcttattttt taacgccaaa 6241 gaaagactct cagagaatag acaactatat tccaaagtca tggttctctg gtggtttgtc 6301 ttgacatttg aatagaaatg ttaaactatc tgggggaata gaaagcccac agtcttctga 6361 gttgtgctac accaatattt ctatgaacag atcttacaac tgagagtgat ctgcagattt 6421 ttcagagtca tgttctccat ggaatgtttg taaaattccc tagctctctg cactgagctg 6481 agatcgtgcc actgcactcc agcctgggca acagagcgag actccatctc // LOCUS AB007875 5725 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0415 mRNA, complete cds. ACCESSION AB007875 NID g2662110 KEYWORDS KIAA0415. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0161. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5725) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5725 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0161" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1133..2536 /gene="KIAA0415" CDS 1133..2536 /gene="KIAA0415" /codon_start=1 /db_xref="PID:d1024592" /db_xref="PID:g2662111" /translation="MEPGTNSFRVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIF RAHKAVLAASSPYFCDQVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVS YLTAASFLQMWHVVDKCTEVLEGNPTVLCQKLNHGSDHQSPSSSSYNGLVESFELGSG GHTDFPKAQELRDGENEEESTKDELSSQLTEHEYLPSNSSTEHDRLSTEMASQDGEEG ASDSAEFHYTRPMYSKPSIMAHKRWIHVKPERLEQACEGMDVHATYDEHQVTESINTV QTEHTVQPSGVEEDFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDGNL IGHRQEAALAAGYSENIEMVTGIKEEASHLGFSATDKLYPCQCGKSFTHKSQRDRHMS MHLGLRPYGCGVCGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSC TKSYEAAKAEQNTTEAN" BASE COUNT 1648 a 1155 c 1361 g 1561 t ORIGIN 1 gagaaggtaa acgagcaaga ctaatactaa tgagaggatc acctgagccc aggagtttga 61 gactacagta gctatgatgg tgccactaca ctccagcctg ggtaaaagag tgagacctca 121 tctctttaaa aaaaaattct gccaggagcg gtggttcatg cctataatcc caacacttta 181 ggaggctgag gctggtggat cacttgaagc caggaattca agacgaacct gggcaaaaag 241 tgagacccct gtctctacaa aaaaaattaa ttttttaaaa aaactagctg ggcagctggc 301 gtggtggctc acacctgtaa tcccagcact ttgggaggct gaggtgggtg gatcatgcgg 361 tcaggagttc gagaccagcc tggccaacat agtgaaaccc catctctact aaaaatacaa 421 aaaaaaaaaa ttaaccaggt gtagtggcgg gcgcctataa tcccagctac ttgggaggcc 481 aaggcaggag aatcgctcga aaccagaagg tggaggtggc agtgagccga gatcacacca 541 ttgcactcca gcctgggcaa caagagcaaa actctatctc aaaaaaaaaa aaaaaaaact 601 agctggacat ggtggcatgt acatacagtc ccagctactc aggaggctga agcaggatga 661 tagcttgatt acccaagagt tcaaggctgc agtgaactat gattgtgcca ctgcactcca 721 gcctgggcaa cagagcaaga ccccatctcg tttaaaaaaa aattatatta tttgaatggg 781 tatcttgttt gttttaactg taaatattta cagtacttgg ttttacttgg gagatttgta 841 tagcttacaa atcttgtccc tatggattca gttctcttac ccacagacac agaaccaaaa 901 tcaggagttc cacaaggagt ggtggtgagt cctgcctttg gcaaacacgc acctgcaagc 961 tttgctctag ggattgaatc ccttacccag tggggcccat aggctagagg actttgtgga 1021 actttggaac ttgtcttaac aatggaggag gggataaaat aagtttatct ttgggctgga 1081 atttgtacta atttttcttt ctcttgtagc accgaacaag atttgtgatg aaatggagcc 1141 tggaacaaac tcttttcggg tagaatttcc tgatttttcc agcaccattc tacagaaact 1201 gaaccagcag cgccagcaag gacaattatg tgacgtctcc attgttgtcc aaggccacat 1261 tttccgggca cacaaagccg ttcttgctgc cagttcaccc tacttttgtg accaggtact 1321 cctgaaaaac agcaggagaa ttgttttgcc tgatgtgatg aacccaagag tgtttgagaa 1381 cattctccta tctagttata caggacgtct agtaatgccc gctccagaaa ttgttagtta 1441 cttgacagcg gcaagcttcc tccagatgtg gcatgtggta gacaaatgca ctgaagtttt 1501 agagggaaac cctacagtcc tttgtcagaa gctaaatcat ggcagtgacc accagtcacc 1561 aagcagcagt agttataatg gcctggtaga gagctttgag ctgggctctg ggggtcatac 1621 tgattttccc aaagcccaag aactgagaga tggtgaaaat gaagaggaga gcaccaaaga 1681 cgagctgtca tcccagctca ccgagcacga atacctgccc agcaactcgt ccacagagca 1741 tgaccgcctg agcacggaaa tggcaagcca ggatggggag gagggcgcca gcgacagcgc 1801 cgagttccac tacacccggc ccatgtacag caagcccagc atcatggctc acaaacgctg 1861 gatccacgtg aagcccgagc gcttagaaca ggcttgcgag ggcatggatg tgcacgcgac 1921 ctacgacgag caccaggtca cagagtccat caacaccgtg cagacagagc acacggtgca 1981 gccttcggga gtggaggagg acttccacat cggggagaag aaagtggaag ctgagtttga 2041 tgaacaggct gatgaaagca attatgatga gcaggtggat ttctatggct cttccatgga 2101 agagttttcc ggagagaggt cagatgggaa tctaattggg cacagacagg aggctgccct 2161 cgcagcaggt tacagtgaga atattgaaat ggtaacaggg attaaagaag aagcttccca 2221 cttaggattc tcagccactg acaagctgta tccttgtcag tgtgggaaaa gtttcactca 2281 caagagtcag agagatcggc acatgagcat gcacctcggt cttcggcctt acggctgtgg 2341 ggtctgcggt aagaaattca aaatgaagca ccatctcgtg ggccacatga aaattcacac 2401 aggcataaag ccgtatgagt gtaatatctg tgcaaagagg tttatgtgga gggacagttt 2461 ccaccggcat gtgacttctt gtactaagtc ctacgaagct gcaaaggctg agcagaatac 2521 aactgaggct aactaaaaat aggatctggc ccttgagtgg catgcacaaa aataaactat 2581 ggtaattaat gcaaatctgg gcacagatga tgcgtgctac ttgctattat gagagaagct 2641 taaaaaaaaa aaggaagata tttctgaaag accagctcta agtaggccaa ttaaaaaaat 2701 ctaattcctc aaatttgtgt gttccagtcc tggcctggaa tgggtaatgg ggtgagttaa 2761 cccaccgccc agctggcaag ggaaaccttc tgactggttg tgatcgaaac aggtggacag 2821 agacacctgc acttggaact ggactccacc caccagttcc attttgggtg gcagcagctt 2881 tggatcactc attattacaa ggtcatgctg aaattttatt ttgctcttgc tatagatact 2941 taggtaatgt ggatttgttt tggtagctat ttcactgaag gaagtgctac ttatataaaa 3001 gctagaaata atgtgattcc taggatgaga aattggttaa cagagctctg ttgtctggtt 3061 ttagtctttc taaaggatat tttaactaaa actatggaga tgctaagaga gtgactttct 3121 aaatatgaaa cagataattt acggtacaag gctgacatag tgcccttgtc agtttctgta 3181 agatgccact actgtcacaa ggtgtttcag actcttgata aggcagtgtt ttgtatttta 3241 gttctaacat tgagtttgga caattttatc taattgtaat tctctagggt gccagagata 3301 ggtatttctc attggtttgc tttcccaaat cctgtttggt taattgtagc ctccatacag 3361 tggggtcttc tctgtggctg gtagacacca agctgctgtg actgaccacg gtaccacggg 3421 ctgccacagc ccctgctctg tcttagatta tggtgcttta caaagagagt ggtccatgac 3481 cacacttagt gagaaggagc cacaagttgt ggctgagagt tccctgtgaa cttgagtagt 3541 tcatagagtg ctaaggtgac actccaccaa ccagagtgag agggcagata ggcaggattc 3601 tatgagtggt tatacttaag ggggacaaaa ctgcccaaga agaatcttga gaatacactc 3661 tttcaaggtg ggggagatac tctttagagg gtacactgag ctaatactac cagttcttta 3721 tgagcactgg aatgtgtttg taaaaggagt cctaagttta gcaaggtagt ctacagaacc 3781 atgctcccac attatagata aagctgctta aacttaaaag tccacaaagc tacgccgacc 3841 agaaaaaaaa aattaaaaaa acaaacaaga aaagcaacta ttctgagacc ttttctgccc 3901 atcagttaga tgatttaggt taaaaagaaa ggtaatattg cacatgcttt taagctgtgt 3961 aacatacctg aggttatcac cagggtagga cagggtgcta ctaccatgtc atcttttcca 4021 caatcgtact gggttattta cttctaaata gaaacttttt ttcttaaaat aaaaataatt 4081 tttcttggat ttggggtgaa attttatttg aaaagtttgg ctttgctgta atgtaataga 4141 cattgctggc aatggcctct gattctcaag ctcctaacac cagggtgttt acttgttgaa 4201 cattgtctgg aaagaggaaa gaaaatactt atttaccagt taactcttgt aagcaagatt 4261 acaaacaggg atttattcac aacactgtat cattctcgat atataaaaag cactttgtat 4321 ttaaaacttt attataaata tatatatata ttgttttttt ttaaacctag aaactagata 4381 ttacctcttg gttgtttgcc acattaatag cttctcttag tattgaaacg ttactggtta 4441 gcagtctttt ctctgtgtac ctgacacacg tatactgagg ggattgtaca atcaagccta 4501 ttgtctcctt ttctttcact catggtagag gccagtgggt tttaggtatg atcctagcca 4561 ttatatttga ggagaaattg ttctattact cctactaatt tcagtactaa ggtggtgatg 4621 ccattttgtt ctgccaaaaa ctgatcaccc tctcccatgg tattagcaga gcattttctg 4681 cctgtttgga aggtttgatg tcctgtttct cattgaagac tatttacatg atcattagga 4741 cattgcagga gaagtctgag aggtaaaaat acagatattc tgggagagtc gtggtccttc 4801 agttctgctg aaatcagcat agtgcccttg tcatgaggaa gagtttctgt tcagccaaga 4861 gtggtggcac gcttgggtgg tagttttgga agcagtcagt tgtgctagga cttatttaat 4921 atgttgtgaa gagaagagtg tcttcttgaa agccttatgt gtccatcagc tactaaatgt 4981 agaacttaaa taagttgctc acatctgttc ttttagtgtt ttgtggtatt tgaggttttg 5041 gcaaaaattg ggattttttt atcaggcagc cagagcctgg gaggtggtag ggtgtctgaa 5101 atgctggcca tgttcagaga ggcaggagaa gggggttgct ttcatttgaa tattaaaagt 5161 gaatttttgt aaactctggt ttttaccttt ttttcatccc cacgttgagt tggaggaata 5221 gtctcttctc ttcacctcaa tatagcttta gaaaatcttt atccttccta ataagtttgg 5281 atggttgtgg gtaacattgt tcaaacaatc tttcagggat cacgtcaatg gcctacaacc 5341 aagctatttg tcccctactt tgagtcttaa ctgtggtttt tcttcaatcc ccatgggaaa 5401 gggcttcaag gcaccaccag tggtatttaa tattcatact tggggccagg catggtggct 5461 cacgcctgta atcccagcac tttgggaggc cgagatgggt ggatcacctg aggtcaggag 5521 tttgtgacca gcctgaccaa catagtgaaa ccccatctct actaaaaata caaaaattag 5581 ccaggcgtgg tggcgcacac ctgtaatccc agctactcgg gaggcggagg caggagaatc 5641 acttgaacct gggaggcgga ggttgcagtg agccgcgatt gcaccaccac actccagcct 5701 gcgcgacaga tcgagactct gtctc // LOCUS AB007877 5572 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0417 mRNA, complete cds. ACCESSION AB007877 NID g2662114 KEYWORDS KIAA0417. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0236. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5572) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5572 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0236" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 410..1960 /gene="KIAA0417" CDS 410..1960 /gene="KIAA0417" /codon_start=1 /db_xref="PID:d1024594" /db_xref="PID:g2662115" /translation="MGLHFKWPLGAPMLAAIYAMSMVLKMLPALGMACPPKCRCEKLL FYCDSQGFHSVPNATDKGSLGLSLRHNHITELERDQFASFSQLTWLHLDHNQISTVKE DAFQGLYKLKELILSSNKIFYLPNTTFTQLINLQNLDLSFNQLSSLHPELFYGLRKLQ TLHLRSNSLRTIPVRLFWDCRSLEFLDLSTNRLRSLARNGFAGLIKLRELHLEHNQLT KINFAHFLRLSSLHTLFLQWNKISNLTCGMEWTWGTLEKLDLTGNEIKAIDLTVFETM PNLKILLMDNNKLNSLDSKILNSLRSLTTVGLSGNLWECSARICALASWLGSFQGRWE HSILCHSPDHTQGEDILDAVHGFQLCWNLSTTVTVMATTYRDPTTEYTKRISSSSYHV GDKEIPTTAGIAVTTEEHFPEPDNAIFTQRVITGTMALLFSFFFIIFIVFISRKCCPP TLRRIRQCSMVQNHRQLRSQTRLHMSNMSDQGPYNEYEPTHEGPFIIINGYGQCKCQQ LPYKECEV" BASE COUNT 1818 a 1183 c 979 g 1592 t ORIGIN 1 ctcggaaaag cttttcagta aatgcactca tgctgctcca ggagctcttc tctgcaacaa 61 gccacccatc tgccatcaac aagttaagac caagagaact aacggctgcg gaaatgagac 121 tgaagctttg attaaccagc tgagcagtta gaggtggaga aatttaataa cttatattca 181 aaactgattt agggatcaga gcaatcacag aaaatgaaac caatgctttc cagagggata 241 tcctaaggaa aaaacaactc gtcctggtgg attcaatttt acttggaaag cctgggacac 301 agaaaacagg aaactggtca aaggctccta catgttagag ccttttacag actcactgcg 361 ttgagtctaa caaccgcgac tgaatgcagc ctccaatgtg ctcagaagaa tgggcttaca 421 tttcaagtgg ccattagggg cccctatgct ggcagcaata tatgcaatga gtatggtttt 481 aaaaatgctg cctgccctgg gtatggcgtg tccacccaaa tgccgctgcg agaagctgct 541 cttctactgc gactctcagg gcttccactc agtgccaaac gccacagaca agggctctct 601 gggcctgtcc ctgaggcaca atcacatcac agagctcgaa agagatcaat ttgccagctt 661 cagtcaactt acttggctcc acttagatca caatcaaatt tcaacagtaa aagaagatgc 721 ttttcaagga ctatataaac ttaaggaatt aatcttaagt tccaacaaaa tattttactt 781 gccaaacaca acttttaccc aactgattaa cctgcaaaat ttggacctgt cttttaatca 841 gctgtcatct ctgcacccag agctcttcta tggccttcgg aagctgcaga ccttgcattt 901 acgttccaac tccctgcgga ctatcccagt acgcctgttc tgggactgtc gtagtctgga 961 gtttctggat ttgagcacaa atcgtttgcg aagtttggct cgcaatggat ttgcaggatt 1021 aattaaactg agagagcttc acctagagca caaccagctg acgaagatta attttgctca 1081 tttcctacgg ctaagcagtc tgcacacgct cttcttacaa tggaacaaaa tcagcaactt 1141 gacatgtggg atggagtgga cctggggcac tttagaaaag ctagacctga ctggaaatga 1201 aatcaaagcc atcgacttga cagtgtttga aacgatgccc aatcttaaaa tactactcat 1261 ggataacaac aagttaaaca gccttgattc caagatctta aactccctga gatccctcac 1321 aaccgttggt ctctctggca atctgtggga atgcagcgcc cgaatatgtg ctctggcctc 1381 ctggctgggc agtttccaag gtcggtggga acactccatc ctatgccaca gtcctgacca 1441 cacccaagga gaggatattc tagatgcagt ccatggattt cagctctgct ggaatttgtc 1501 aaccactgtc actgtcatgg ctacaactta tagagatcca accactgaat atacaaaaag 1561 aataagctca tcaagttacc atgtgggaga caaagaaatc ccaactactg caggcatagc 1621 agttactacc gaggaacact ttcctgaacc agacaatgcc atcttcactc agcgggtaat 1681 tacgggaaca atggctttat tgttttcttt cttttttatt atttttatag tgttcatctc 1741 caggaagtgc tgccctccca ctttaagaag aattaggcag tgctcaatgg ttcagaacca 1801 caggcagctc cgatcccaaa cacgactcca tatgtcaaac atgtcagacc aaggaccgta 1861 taatgaatat gaacccaccc atgaaggacc cttcatcatc attaatggtt atggacagtg 1921 caagtgtcag cagctgccat acaaagaatg tgaagtataa tatctaccca tcatcaaaaa 1981 tcacatcaga taagtaacct attttacata gtagaggcta aatacatatc taatttttac 2041 caatggtgac attaagccta attttccaaa ctaagtggag acttagtttt tgaagtgttg 2101 aagtattttt aattttttta aatgaaacca tattttaagt gttaaatgaa tcaatgctca 2161 cattaatttg cactcctgtt ggaaagtcta aaatgcttac ttcaaaataa gaaatgtacg 2221 taattatata caatcgtgtg taaaccttta cactaaggtc tccatatact atttttttct 2281 actgaaaaca atttagaaag aagctattgg gcagaaacag atatagatca atacctgttt 2341 gatcactgct ctccatccca tgtaccacaa ctatcttgct gcttaaaagg agacttagta 2401 aagttctctt gtatgataat ttggtattta ctcaaatctt caatttcttg ccagggtggg 2461 aagtagaatt tcatgtatgc tgaagactgg taaatattaa acattctcct tccagagttt 2521 ctgcctgggt tagtagatat tatcaaagtc catatcatga attcagaacc ctttaatgta 2581 attctaataa gctgagtgac tctttaatat atttacacaa tgaatccaag tgactgtgaa 2641 aaggtctcat tacaatgaaa ccaatcctaa ataattacac caacttctta tacatttctc 2701 gtttgactta taatggacat gatttttgtg gcatttagac aactgtttta aactagcatt 2761 aaactgtcat tgtactaatt aatgtactaa atcctatgtt tacattaata tgtgaaaaaa 2821 gatttagaag atattttgga gataaacaag acaaactgag tcaatttaac aatcctgaca 2881 tgctccctcc aatttaaaag cacacacaca cacacacaca cacacacaca cactctcgcg 2941 ctctctctct caactatcat gatcaaagat gccttgacaa aggggtttaa acctctttct 3001 tctgcttttt cctgatcttc aaacctcaaa gagccaagtt aaaaataatt ggctagcaac 3061 aggcttaaca ctgattctta ttgtattaca aaggaaccat gaaaaaaaac actttctaac 3121 tattatataa tacatttcga tctttcaata aagatacgca ttcacgctgt aagtgtagtt 3181 cagtgtaggt gaagaaaaca gcaccactga gatgaatctc atggcagaac aactcgacat 3241 gcctatgcag ccacacagtg gtaacttgaa ggcagcacag aggtgaggga ttaaaaggaa 3301 aggcagattt tgaaatctga ctctcaggtt agctggagtt acaaggcagc caacaatgat 3361 tatacacaat ctgcaccaaa cagggaaagg acctgctgct gaaaagggaa gaggggtaga 3421 tcctccaatg taaatataaa catgccatag ttaacaaatg ataaggatgc cacatttttc 3481 atgcagaaaa gaaccacagg tattcagttt cttgtattat gatttctaaa gaaaacctat 3541 taaatatttg gaatttagat gcaagctact cattcacaca gaacttttcc ataaacattg 3601 ttccacatat tgaaaaaact gtaaaattat agcactaact tgattttaga aaatgagcat 3661 tttttacaac tgccatttac tgtatgtaga aaaaagacat aaaaccttag agaaagccac 3721 taccattttt caagttatct tttccccaac atcaacatca acatgatatc cttcattcag 3781 ggcatatttt ttcttcccag gatcaaactg aaaacatgtt aaggctatag tttagccagt 3841 aagatttaat cacaattttt cttctttaat actaagagat gggtacctta gtacccttca 3901 cctaacatct ctcggtaaaa agcagacaag atcacaaatt acagtaaaaa tgtgctcttt 3961 tttaaggtgg gcagattagt ttgtcaaagt ctcagcccag ttcaattcag ttgatctcga 4021 gtggtgagta ttttgtctct tgttaaacca gaccctgggc ttgtctgggc agagacagaa 4081 gctgatttaa cagcatagag cacagcgccc tacttcccca cagacaggct gaattagaag 4141 cccctattac aaataagagc accacgagct aaaatttacc aacccattta tcaggatgac 4201 cattgccatt tgtcaggtga cagacagctt ggtcacaaca caactgtgtc ctgagcacac 4261 aagattccat cagccctgaa ggacagcaga ggagtatttt ccatcatttc aaaccacaaa 4321 caaaattaag atttaaactg tgggatgggt aagaaatccc cttctttggc tgggaatctc 4381 caccttaatc cttaaaggac cataactggc aaaatcctta gtgtacggac aaaagcaaat 4441 tggaaaagca agccatttac taattcagag gaacagcata gccaaggcta gcgaggcata 4501 ctttccagtt gaaaaaaaat caaaattttt tgccttccat ttttctgggt aatgtccttt 4561 aagaaaagtg aatgacagta gtaccaaaaa ggtattttcc tttctcattt aaaagcaaaa 4621 acctcccttt ttaattgcct gtagaaggca agattttctt tcatttttgt ccatttgtgt 4681 ttcagaatca gcacaggttt gaagattaga ctgttagtca tctgtgcttt ctttaccaac 4741 tggctagaac ggaacagcat cacagcaacg ctgcggatct tggttacaca gtgtcagagc 4801 ttttgttttt tgatggggtt ttatcctctt caccaccact acccactcca ttttttttct 4861 atttttaaga ttagaggaaa atgaccatta agaaccagaa aaataattaa tctctctgga 4921 aaaaggaaag ctaaagaaga ctatgatacc atacccatct agaaaccaag aactagtttg 4981 aaatcgttta tattcatttc tgtagcatga cagggatttc aatcaccttt ctgaaaagag 5041 tgggctgata aacttgtaaa tccacacaca actctgagaa tacccactgc cagcatcaaa 5101 aagcaaagat actaattttt taagccaaaa atctgtacta acacatgtaa tttcttaatg 5161 tgcctaagtt aatttctgta tcaattcaat aaatggaatt gactgaactt cccaacgcca 5221 ctaattatta aaatccatct gcttattcta atgctagtca ccaagagcta gactccatct 5281 ttccaataaa aatgagccct atgtagcact aggtttgaat tctaaaattc aaaacaggca 5341 tttcattttc ttaaggcaca cttcctacac acagctatca ggggaaaaag ctataaatgt 5401 cctgttcttt ttctgaagtg tggatatcaa tataaaattt gttaaagaat atttttaaca 5461 gctgacttac atgatcgttt tcctaataca ggaatacagc gacagaccta tctgaaaagt 5521 ctgtttgggg gcatactttt catatcttat tgactaaagc ccttgagcca gg // LOCUS AB007879 5504 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0419 mRNA, complete cds. ACCESSION AB007879 NID g2662118 KEYWORDS KIAA0419. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH0988. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5504) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5504 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH0988" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 386..3208 /gene="KIAA0419" CDS 386..3208 /gene="KIAA0419" /codon_start=1 /db_xref="PID:d1024596" /db_xref="PID:g2662119" /translation="MILEQYVVVSNYKKQENSELSLQAGEVVDVIEKNESGWWFVSTS EEQGWVPATYLEAQNGTRDDSDINTSKTGEEEKYVTVQPYTSQSKDEIGFEKGVTVEV IRKNLEGWWYIRYLGKEGWAPASYLKKAKDDLPTRKKNLAGPVEIIGNIMEISNLLNK KASGDKETPPAEGEGHEAPIAKKEISLPILCNASNGSAVGVPDRTVSRLAQGSPAVAR IAPQRAQISSPNLRTRPPPRRESSLGFQLPKPPEPPSVEVEYYTIAEFQSCISDGISF RGGQKAEVIDKNSGGWWYVQIGEKEGWAPASYIDKRKKPNLSRRTSTLTRPKVPPPAP PSKPKEAEEGPTGASESQDSPRKLKYEEPEYDIPAFGFDSEPELSEEPVEDRASGERR PAQPHRPSPASSLQRARFKVGESSEDVALEEETIYENEGFRPYAEDTLSARGSSGDSD SPGSSSLSLTRKNSPKSGSPKSSSLLKLKAEKNAQAEMGKNHSSASFSSSITINTTCC SSSSSSSSSLSKTSGDLKPRSASDAGIRGTPKVRAKKDADANAGLTSCPRAKPSVRPK PFLNRAESQSQEKMDISTLRRQLRPTGQLRGGLKGSKSEDSELPPQTASEAPSEGSRR SSSDLITLPATTPPCPTKKEWEGPATSYMTCSAYQKVQDSEISFPAGVEVQVLEKQES GWWYVRFGELEGWAPSHYLVLDENEQPDPSGKELDTVPAKGRQNEGKSDSLEKIERRV QALNTVNQSKKATPPIPSKPPGGFGKTSGTPAVKMRNGVRQVAVRPQSVFVSPPPKDN NLSCALRRNESLTATDGLRGVRRNSSFSTARSAAAEAKGRLAERAASQGSDSPLLPAQ RNSIPVSPVRPKPIEKSQFIHNNLKDVYVSIADYEGDEETAGFQEGVSMEVLERNPNG WWYCQILDGVKPFKGWVPSNYLEKKN" BASE COUNT 1246 a 1543 c 1606 g 1109 t ORIGIN 1 cacaaatctg tgattacctg gcgggtgaac cgctccggtg gcggggagag cgggctccca 61 gcgctgggta ggggccgggt tccggcgagc gccatcccgg agcgtcagtt tcccagtttg 121 ggaagtgagg agaacctgcc tcgcccttcc ccgccaaggc ttagggaagg gactatggca 181 gttccaagag gaaatcaggg tctcgctctg ttgctcaggc tggattgcag tggcgtgatc 241 atgcctcact gcagcctcga cctccctggg ctcaagcaat cctcccactt cagcctccag 301 agtggctggg accacagtgt ggctgtccag ctgggctgag tcgcccaaga aggacgtgac 361 aggtgccgac gccaccgccg agcccatgat cctggaacag tacgtggtgg tgtccaacta 421 taagaagcag gagaactcgg agctgagcct ccaggccggg gaggtggtgg atgtcatcga 481 gaagaacgag agcggctggt ggttcgtgag cacttctgag gagcagggct gggtccctgc 541 cacctacctg gaggcccaga atggtactcg ggatgactcc gacatcaaca cctctaagac 601 tggagaagag gagaagtatg tcaccgtgca gccttacacc agccaaagca aggacgagat 661 tggctttgag aagggcgtca cagtggaggt gatccggaag aatctggaag gctggtggta 721 tatcagatac ctgggcaaag agggctgggc gccagcatcc tacctgaaga aggccaagga 781 tgacctgcca acccggaaga agaacctggc cggcccagtg gagatcattg ggaacatcat 841 ggagatcagc aacctgctga acaagaaggc gtctggggac aaggaaactc caccagccga 901 aggcgagggc catgaggccc ccattgccaa gaaggagatc agcctgccca tcctctgcaa 961 tgcctccaat ggcagtgccg tgggcgttcc tgacaggact gtctccaggc tggcccaggg 1021 ctctccagct gtggccagga ttgcccctca gcgggcccag atcagctccc cgaacctacg 1081 gacaagacct ccaccacgca gagaatccag cctggggttc caactgccaa agccaccaga 1141 gcccccttct gttgaggtgg agtactacac cattgccgaa ttccagtcgt gcatttccga 1201 tggcatcagc tttcggggtg gacagaaggc agaggtcatt gataagaact caggtggctg 1261 gtggtacgtg cagatcggtg agaaggaggg ctgggccccc gcatcataca tcgataagcg 1321 caagaagccc aacctgagcc gccgcacaag cacgctgacc cggcccaagg tgcccccgcc 1381 agcacccccc agcaagccca aggaggccga ggagggccct acgggggcca gtgagagcca 1441 ggactccccg cggaagctca agtatgagga gcctgagtat gacatccctg cattcggctt 1501 tgactcagag cctgagctga gcgaggagcc cgtggaggac agagcctcag gggagaggcg 1561 gcctgcccag ccccaccggc cctcgccggc ctcttctctg cagcgggccc gcttcaaggt 1621 gggtgagtct tcagaggatg tggccctgga agaggagacc atctatgaga atgagggctt 1681 ccggccatat gcagaggaca ccctgtcagc cagaggctcc tccggggaca gcgactcccc 1741 aggcagctcc tcgctgtccc tgaccaggaa aaactccccc aaatcaggct cccccaagtc 1801 atcatcactc ctaaagctca aggcagagaa gaatgcccag gcagaaatgg ggaagaacca 1861 ctcctcagcc tccttttcct catccatcac catcaacacc acttgctgct cctcctcttc 1921 ctcctcctcc tcttccttgt ccaaaaccag tggcgacctg aagccccgct ctgcttcgga 1981 cgcaggcatc cgcggcactc ccaaggtcag ggcaaagaag gatgctgatg cgaacgctgg 2041 gctgacctcc tgtccccggg ccaagccatc ggtccggccc aagccattcc taaaccgagc 2101 agagtcgcag agccaagaga agatggacat cagcacttta cggcgccagc tgagacccac 2161 aggccagctc cgtggagggc tcaagggctc caagagtgag gattcggagc tgcccccgca 2221 gacggcctcc gaggctccca gtgaggggtc taggagaagc tcatccgacc tcatcaccct 2281 cccagccacc actcccccat gtcccaccaa gaaggaatgg gaagggccag ccacctcgta 2341 catgacatgc agcgcctacc agaaggtcca ggactcggag atcagcttcc ccgcgggcgt 2401 ggaggtgcag gtgctggaga agcaggagag cgggtggtgg tatgtgaggt ttggggagct 2461 ggagggctgg gccccttccc actatttggt gctggatgag aacgagcaac ctgacccctc 2521 tggcaaagag ctggacacag tgcccgccaa gggcaggcag aacgaaggca agtcagacag 2581 cctggagaag atcgagaggc gcgtccaagc actgaacacc gtcaaccaga gcaagaaggc 2641 cacgcccccc atcccctcca aacctcccgg gggctttggc aagacctcag gcactccagc 2701 ggtgaagatg aggaacggag tgcggcaggt ggcggtcagg ccccagtcgg tgtttgtgtc 2761 cccgccaccc aaggacaaca acctgtcctg cgccctgcgg aggaatgagt cactcacggc 2821 cactgatggc ctccgaggcg tccgacggaa ctcctccttt agcactgctc gctccgctgc 2881 cgccgaggcc aagggccgcc tggccgaacg ggctgccagc cagggttcag actcacccct 2941 actgcccgcc cagcgcaaca gcatccccgt gtcccctgtg cgccccaagc ccatcgagaa 3001 gtctcagttc atccacaata acctcaaaga tgtgtacgtc tctatcgcag actacgaggg 3061 ggatgaggag acagcaggct tccaggaggg ggtgtccatg gaggttctgg agaggaaccc 3121 taatggctgg tggtactgcc agatcctgga tggtgtgaag cccttcaaag gctgggtgcc 3181 ttccaactac cttgagaaaa agaactagca gagggcctgg gctcttccag cctcagtgtg 3241 cctctctggc cgcccactgg atgagcggtg agacgaacaa aagggaaagg aaaaaatggg 3301 ggtggggggt ggggggtgga caacattcaa cactgcagaa tgggtgacct caaagatgcc 3361 ccctgtccaa gccatcccac agctggaagg taggggatgg gggtgcccac actgagtgag 3421 gaagggaatg gaccagggag tcccaggcct gggacccaga gccaagaaag ctgagatatc 3481 ctgtgcacca tagggacttc accaatggat tacatgccat ctgggacagg ccatgtggga 3541 gaccccagtt gtgcctttgc tacagatctg gaaaagacaa ggtcatgggg gcctccagtg 3601 tcctgcccct gcttggccca gttttgattg ctggcatctt gccaccccag gtatccctgg 3661 tattgtccta agctgtattt gtgaattgtg ctggtttcct gggcattgcc acgcctacca 3721 caggtgggta cattagaagc caccactggc tttcaggctt gggggtgtct tctgagctca 3781 agcctgcttc tgggccaggc cattgtcact gttagttgaa gaaaaagcag ttcccaggtg 3841 ccagcaaaga ccatctttca taactgtcac tgtcttggcc ttgagaagag agcccgctct 3901 ccgtggggca ccccatggag gacacagtac cagagtttac agagagggtg ggcgaagcca 3961 ccggtctctt cctaatctgc acagactatt ttgggtattt ctgggcgggc agttcctttg 4021 catgtttcgg gagaggtttg ttgatttggg gcttatatgt caggcctttg gtttgcgtct 4081 tattttaggg gttgtttggg ggcctgggtg gtcggcctca catgggaagg agatgggtag 4141 tggatggggt ttctgttgta tcttgtgggc gggtgatttt gcttttgttt ttgtttcaca 4201 ttcttccccc tccacaagcc aaagtcgttt catttggttt ccactgtgtg gactgtgctg 4261 gagcttggcg cctgccagaa aaatttgggg ctaggcaagc cccaggttgc agacatggtg 4321 aagcagagaa actgttcttc tggttcctgc acaacctcag aggggcaaaa accctcccca 4381 ggaaggagga gggtgttcag gagccagact tttggagaga aggcagctcc cagcctgctg 4441 ggtgaccgcc attctgcgtg tgttccccag ctgggcaggg ctggaagcct tacgtatgaa 4501 gcatggagaa gcagccattg tccccactat gggcagaggg gggacccggc tggccccttg 4561 ggtcagactg gagccaacac cgccagccac cccctctggc ctgctggcaa tgccacaggt 4621 gcccaagaag atggaggatc cctgtgccag gagccaacct ggtcttcccg agggtcagtg 4681 ccccagtgaa gacagaagcg agagaataaa gttccctgta ggtcctctgt cacctttggg 4741 ttgtgttttt caattgttga catttcagag gggaccctcc agaagcccag ccggcttccc 4801 ccaaggactc ccccttcgct gggagtggat ttccacacgt gcctttgatt tcggacagat 4861 tgggcctcac agccaccgat tcagctgcca gggtccctgg actgggggtt ggtgttttct 4921 atagaggagg aaaggccctc cctcaccctg ctccccaccc aggcagggca gcatgggacc 4981 cagtgtctca gtgccttcaa aacccacccc cacccctacc ctaccccacc acaccccatc 5041 ccagaggcct tgcctgggca accctaagcc cctgtccctc gccatacact gatgcctggc 5101 agctagagca aatggctcgt gttctttgtc gaaggcctgt ggtgagattg ttttgtttcc 5161 ttttgttttg tgagtttgtt taaaattgaa attagttatt ttcttctgct ggacagtatt 5221 aaatagagca ggatgttgag ttaatctgct agattgcagt actaatggta gtggtttagt 5281 gtcttcatgt taatattatt tgtacttatt tgaacaataa tgataaagaa gtggttcatt 5341 attttttaat taatgcactt taaataaggt agaatggaaa aaacccagag agcaaagtgc 5401 attacttaaa gatgcagtat atacttttct catttttaaa cagcacatat ttattaagag 5461 aaaaaaagta atttatgact atttaaaata aaatttaaaa gtag // LOCUS AB007880 5399 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0420 mRNA, complete cds. ACCESSION AB007880 NID g2662120 KEYWORDS KIAA0420. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1019. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5399) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5399 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1019" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 292..3267 /gene="KIAA0420" CDS 292..3267 /gene="KIAA0420" /codon_start=1 /db_xref="PID:d1024597" /db_xref="PID:g2662121" /translation="MEEYEKFCEKSLARIQEASLSTESFLPAQSESISLIRFHGVAIL SPLLNIEKRKEMQQEKQKALDVEARKQVNRKKALLTRVQEILDNVQVRKAPNASDFDQ WEMETVYSNSEVRNLNVPATFPNSFPSHTEHSTAAKLDKIAGILPLDNEDQCKTDGID LARDSEGFNSPKQCDSSNISHVENEAFPKTSSATPQETLISDGPFSVNEQQDLPLLAE VIPDPYVMSLQNLMKKSKEYIEREQSRRSLRGSMNRIVNESHLDKEHDAVEVADCVKE KGQLTGKHCVSVIPDKPSLNKSNVLLQGASTQASSMSMPVLASFSKVDIPIRTGHPTV LESNSDFKVIPTIVTENNVIKSLTGSYAKLPSPEPSMSPKMHRRRSRTSSACHILINN PINACELSPKGKEQAMDLIIQDTDENTNVPEIMPKLPTDLAGVCSSKVYVGKNTSEVK EDVVLGKSNQVCQSSGNHLENKVTHGLVTVEGQLTSDERGAHIMNSTCAAMPKLHEPY ASSQCIASPNFGTVSGLKPASMLEKNCSLQTELNKSYDVKNPSPLLMQNQNTRQQMDT PMVSCGNEQFLDNSFEKVKRRLDLDIDGLQKENCPYVITSGITEQERQHLPEKRYPKG SGFVNKNKMLGTSSKESEELLKSKMLAFEEMRKRLEEQHAQQLSLLIAEQEREQERLQ KEIEEQEKMLKEKKAMTAEASELDINNAVELEWRKISDSSLLETMLSQADSLHTSNSN SSGFTNSAMQYSFVSANEAPFYLWGSSTSGLTKLSVTRPFGRAKTRWSQVFSLEIQAK FNKITAVAKGFLTRRLMQTDKLKQLRQTVKDTMEFIRSFQSEAPLKRGIVSAQDASLQ ERVLAQLRAALYGIHDIFFVMDAAERMSILHHDREVRKEKMLRQMDKMKSPRVALSAA TQKSLDRKKYMKAAEMGMPNKKFLVKQNPSETRVLQPNQGQNAPVHRLLSRQGSICRK NPKKAAKCCDNLRRQHSLG" BASE COUNT 1816 a 938 c 1105 g 1540 t ORIGIN 1 gtaatatggc ggggaggagg aggagaaggc ggcggcggac cgagctgcgc tctgtcagta 61 ccatttgagc cattcgcttc ctgacaaggc ccgtggcgag gggagaggag ctgaaggggc 121 cgtgggggat cagtgcctgc tgtgtgctga tactgttctg tgtaatgggg attcagtgaa 181 caagactgaa aaggtacctg tacttatggc gcttacattt tggtggagga agacagacaa 241 aaatcaagga aataaacacg ataatttcag atagtgtgtg actgtgggaa gatggaggag 301 tatgagaagt tctgtgaaaa aagtcttgcc agaatacaag aagcatcact atccacagag 361 agctttctcc ctgctcagtc tgaaagtatc tcacttattc gctttcatgg agtggctatc 421 ctttctccac tgcttaacat tgagaaaaga aaggaaatgc aacaagaaaa gcagaaagca 481 cttgatgtag aagcaagaaa gcaggttaac aggaagaaag ctttactgac tcgtgtccag 541 gagattcttg acaatgttca ggttagaaaa gcacctaatg ccagtgattt tgatcagtgg 601 gagatggaaa cagtttactc taattcagaa gtcagaaact tgaatgttcc tgctacattt 661 ccaaatagct ttccaagcca tacggaacac tctactgcag caaagcttga taagatagct 721 gggattttgc cattggataa tgaggaccaa tgtaaaactg atggaataga cttagctaga 781 gattcagaag gatttaattc tccgaagcaa tgtgatagtt ccaatattag tcatgtagaa 841 aatgaagctt ttccaaagac ctcttcagca accccacaag aaactcttat ttctgatggt 901 cccttctcag taaatgaaca acaggatcta ccacttttgg cagaagtcat cccagatccc 961 tatgtaatga gtcttcagaa tctgatgaaa aagtcaaagg aatatataga aagagaacaa 1021 tctagacgca gtctgagagg tagtatgaac agaattgtta atgagagtca tttagacaaa 1081 gaacatgatg ctgttgaagt ggctgactgt gtaaaagaga aaggccagtt gacaggcaaa 1141 cactgtgtct cagttattcc tgacaaacca agccttaata aatcaaatgt tcttctccaa 1201 ggtgcttcca ctcaagcaag cagcatgagt atgccagttt tagctagctt ttcgaaagtg 1261 gacataccta tacgaactgg ccatcccact gttctagagt ctaattctga ttttaaagtt 1321 attcccacta ttgttaccga aaataatgtt atcaaaagtc ttacaggttc atatgccaaa 1381 ttacctagtc cagagccaag tatgagtcct aaaatgcacc gaagacgttc caggacatca 1441 tcagcgtgtc atatacttat aaataaccca ataaatgcct gtgaattaag ccctaaagga 1501 aaagaacagg caatggactt aattattcaa gatactgatg aaaacacaaa tgtgcccgaa 1561 attatgccaa agttaccaac tgatttagcg ggagtttgtt caagcaaggt ttatgtgggc 1621 aaaaatacat ctgaagtcaa agaagatgtg gttttaggta aatcaaatca ggtatgtcaa 1681 tcttcaggaa atcatttaga aaataaagtt actcatggac ttgttactgt ggaaggtcag 1741 ttaacatccg atgagagagg cgcacacata atgaacagta cctgtgctgc gatgccaaag 1801 ctgcatgaac catatgccag cagtcagtgt atagcaagtc caaactttgg aactgtgagt 1861 ggactcaagc cagccagtat gttagagaaa aactgcagtt tgcaaacaga actgaataag 1921 tcttatgatg taaaaaaccc ttctccttta ttgatgcaaa accagaatac gagacagcag 1981 atggacacac ctatggtgtc ctgtggaaat gaacaatttt tggataacag ttttgagaaa 2041 gttaaacgga gacttgattt agatattgat ggtttgcaaa aagaaaactg cccttatgtc 2101 ataacaagtg gaataactga acaagaaagg caacatttgc cagaaaaaag ataccctaag 2161 ggatctggct tcgttaacaa gaataaaatg ttaggaacta gttccaaaga aagcgaggag 2221 ttactaaaaa gcaagatgtt agcttttgaa gaaatgcgga agagactaga agaacagcac 2281 gcccagcaat tatcactact catagctgag caggaaaggg aacaagaaag actgcaaaag 2341 gaaatagaag agcaggagaa aatgttaaaa gagaagaagg caatgacagc ggaagcctct 2401 gagttggaca ttaacaatgc agtggaatta gaatggagaa aaataagtga ctctagtttg 2461 ctggaaacaa tgctgtctca agcggactca ctccatactt caaattcaaa tagttctggt 2521 ttcaccaatt ctgccatgca atatagcttt gtttctgcaa acgaagcacc attctacctc 2581 tggggatcat caactagtgg cttgaccaaa ctctcagtaa caaggccttt tggaagagcc 2641 aaaactagat ggtctcaagt ttttagtctg gaaatacaag caaaatttaa caaaataact 2701 gcagtggcaa aaggatttct tactcgtaga cttatgcaga cagataagct gaagcaactt 2761 cgacaaactg taaaagatac tatggaattc ataagaagtt ttcagtcaga agcaccatta 2821 aagagaggca ttgtttcagc tcaagatgct tcacttcagg aaagagtgtt agctcagttg 2881 cgagctgcct tgtacggtat tcatgacata ttctttgtaa tggatgcagc tgaaagaatg 2941 tctattctac atcatgatcg agaagttcgc aaagagaaaa tgctcaggca aatggataaa 3001 atgaaaagtc cacgagtggc tctttcagct gcaacacaga agtctcttga taggaagaaa 3061 tacatgaaag ctgctgaaat gggaatgcca aataagaaat ttctggttaa acaaaatcct 3121 tctgaaacaa gagtccttca gccaaaccaa ggacagaatg cacctgttca taggctactt 3181 agtagacaag ggagtatatg caggaaaaat ccaaagaaag cggccaaatg ttgcgacaat 3241 ttaagaagac aacattcatt aggataaaat ggggggaagg attattattc atgttatttt 3301 ccctgcccaa gactttattt aaccctggac tccgtttaca cagacaaagt gacatcagaa 3361 ggctgagcac ttatctggat catttggtca gtttggtaat tcctgctcca cacccctatt 3421 ttcctcttaa taatacgttt gggtgaagac aaattagtgt ttagtaattg catcatctct 3481 gtgcttacct atacaaacat aagtttattt tatatgccca gatgtctaca gagacccttt 3541 ttgtaaatgt caaggacatt tggatttact tttacagaat attgaaaaga taagacaaat 3601 tataaataag ttctaaaaca taaatttaat tcatctctgc atagtgattt ttgaatttga 3661 ttcaaaggga aattattgca gaagaatgcc tttccctcat tttataactt taaaaacttg 3721 gattaaccac tcaatgtcca ctttctttga cttacagtat aaccatgtag ccaattgtgg 3781 cccactaaaa tctacagaag ttaatgtggg tcaccatttt ggtcagaagc atacattcct 3841 gtcagacaac tagttgtctg acagaaatgt taggcttcat gtatgttacc ccagtactgt 3901 tagaaacatt tgtactaggt tataagatct ttctgtgaca ggtaacaaat ttggggaaga 3961 cagcacaatc ttcttgaatg tagctcttgg gaatgcatta ttacatccat ttctgtaaca 4021 taataatatg ttgcatgcag ttatattttc tatttagtct gtatattttg ttcttcatag 4081 tctgtttttt ctagcatgct tgatttagga gagaataaag ggctatataa taataaatcc 4141 agatttccgg ataagaatat tgcctggtta aaattctgca ttgcttaaag acacccatgt 4201 ttaagatttt tcatcactaa catatccatt aaaagtatca actggccagg cagggtggct 4261 cacacctgta ctcccagcat tttgtgaggc caaggtgggt ggatcacctg aggtcaggag 4321 tttgagacca gcctgaccaa acatggcgaa accccatctg tactgaaaat acaaaaatta 4381 ggcatggtgg tgcatgcctg tagttccagc tacttgggag gctgagacag gcgaattgct 4441 tgaacctggg aggcagaggt tgcagtgagc tgagactgtg ccattgcact ccagtcgggt 4501 taacagagca agacactgtc tcaaaaaaaa aaaaaaaaaa aagtatcaac caacaaatgt 4561 taccaagata acgtgacttc atgagggaga atgtcactat taatttatca taccatttcc 4621 aaaaagggct ttgtgctttt cacataaaat tgagacagtg tatatttaat ctaatttaaa 4681 ttttaaagag atactggtat tttgaaaatg caacctatat atattcttaa tatcctttta 4741 agaatatgga gatgaagatt gttttctcca attttctgtg ccattttaaa tttaactttg 4801 acatccagct atagacagaa ataataagcc accctgggtg taaacttgat tttctttatt 4861 gagatgtatc atgtattgaa tgagtgaacc agaaaattag aagatggtca aaaaaagtcc 4921 aagttaccaa ttttttaaaa tttataggca aagtatcaaa ttgtcttctt aatatgataa 4981 actgtgcttt atcattctga aactcaggat acagcttatt catagcattg tgggtctctc 5041 cagtaagaaa gatgctaaaa gttttgtgca ctttttgtgt gtgtaatgca aattagttaa 5101 aacaaatagt tttggagaaa gttaaaacta gctttagagt aaggatgaga aacttgagtg 5161 tttttaattt aaagataaaa gcctgtgttt tacacattct tttttggtgt tcatagcttc 5221 ttctcataca ggtgccagac actgtttgtg cttttgatgg atttttattt atatactttt 5281 tttgcttatt tttactttga gtggaatgtt cattaatgta aattgtattt atttttatac 5341 ttttattttc actagttttg cttctaggca aaaagcaaaa taaacttttc atcttaaag // LOCUS AB007885 5413 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0425 mRNA, complete cds. ACCESSION AB007885 NID g2662130 KEYWORDS KIAA0425. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1267. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5413) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5413 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1267" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 802..2352 /gene="KIAA0425" CDS 802..2352 /gene="KIAA0425" /codon_start=1 /db_xref="PID:d1024602" /db_xref="PID:g2662131" /translation="MTLLITGDSIVSAEAVWDHVTMANRELAFKAGDVIKVLDASNKD WWWGQIDDEEGWFPASFVRLWVNQEDEVEEGPSDVQNGHLDPNSDCLCLGRPLQNRDQ MRANVINEIMSTERHYIKHLKDICEGYLKQCRKRRDMFSDEQLKVIFGNIEDIYRFQM GFVRDLEKQYNNDDPHLSEIGPCFLEHQDGFWIYSEYCNNHLDACMELSKLMKDSRYQ HFFEACRLLQQMIDIAIDGFLLTPVQKICKYPLQLAELLKYTAQDHSDYRYVAAALAV MRNVTQQINERKRRLENIDKIAQWQASVLDWEGEDILDRSSELIYTGEMAWIYQPYGR NQQRVFFLFDHQMVLCKKDLIRRDILYYKGRIDMDKYEVVDIEDGRDDDFNVSMKNAF KLHNKETEEIHLFFAKKLEEKIRWLRAFREERKMVQEDEKIGFEISENQKRQAAMTVR KVPKQKGVNSARSVPPSYPPPQDPLNHGQYLVPDGIAQSQVFEFTEPKRSQSPFWQNF SRLTPFKK" BASE COUNT 1407 a 1297 c 1285 g 1424 t ORIGIN 1 gaagagggat agggccagca aggcagggat cgaacgagtg tctggcagcc gggagcccag 61 cgaagagagc gagcaagctt aggaaaacga gcgaagtaaa gggagtaggg gagactgaga 121 ctgaccggta gccaggcagg cggacggacg cacgcccgga cagactgagc aggcgccgga 181 gaaccactca caggttcccc ccgcctttcc ctttgaaagc taggattttg cctttcccgt 241 ggcgcccgag agagaatgct ggactctgcc gacttcagcg caagctaaga tttctcagct 301 agggacaaac gatcagccca atcctgagaa ggggggaacc aagcaccccg tccccatccc 361 cctcccctcc cccgactaaa ctcgggcgcc aaacccagcc cttctctaac caccctactt 421 cctcctctcc tttctagcat ggtggctgta tggacagtct gacagaacag agactgacat 481 ctcccaatct gccggccccc cacctggaac actacagtgt tctgcattgc accatgaccc 541 tggatgtgca aactgtagtc gtttttgccg tgattgtagt cctcctgctt gtcaatgtca 601 tactcatgtt tttcctggga acgcgctgaa tggagtccag ccacctgagc tgtcgcgaac 661 tctcgctttg atttcatccc gagagccacc gagaaaaaaa aaaaatcaca gacagagaca 721 gggaaagaga gagaaagaac aagctttctt actcaggggg gaaaacgttt tgagcttcaa 781 catggcctcg ctgtgatatg tatgacgttg ctgatcactg gagattccat cgttagtgct 841 gaggcagtat gggatcacgt caccatggcc aaccgggagt tggcatttaa agctggcgac 901 gtcatcaaag tcttggatgc ttccaacaag gattggtggt ggggccagat cgacgatgag 961 gagggatggt ttcctgccag ctttgtgagg ctctgggtga accaggagga tgaggtggag 1021 gaggggccca gcgatgtgca gaacggacac ctggacccca attcagactg cctctgtctg 1081 gggcggccac tacagaaccg ggaccagatg cgggccaatg tcatcaatga gataatgagc 1141 actgagcgtc actacatcaa gcacctcaag gatatttgtg agggctatct gaagcagtgc 1201 cggaagagaa gggacatgtt cagtgacgag caactgaagg taatctttgg gaacattgaa 1261 gatatctaca gatttcagat gggctttgtg agagacctgg agaaacagta taacaatgat 1321 gacccccacc tcagcgagat aggaccctgc ttcctagagc accaagatgg attctggata 1381 tactctgagt attgtaacaa ccacctggat gcttgcatgg agctctccaa actgatgaag 1441 gacagccgct accagcactt ctttgaggcc tgtcgcctct tgcagcagat gattgacatt 1501 gctatcgatg gtttcctttt gactccagtg cagaagatct gcaagtatcc cttacagttg 1561 gctgagctcc taaagtatac tgcccaagac cacagtgact acaggtatgt ggcagctgct 1621 ttggctgtca tgagaaatgt gactcagcag atcaacgaac gcaagcgacg tttagagaat 1681 attgacaaga ttgctcagtg gcaggcttct gtcctagact gggagggcga ggacatccta 1741 gacaggagct cggagctgat ctacactggg gagatggcct ggatctacca gccctacggc 1801 cgcaaccagc agcgggtctt cttcctgttt gaccaccaga tggtcctctg caagaaggac 1861 ctaatccgga gagacatcct gtactacaaa ggccgcattg acatggataa atatgaggta 1921 gttgacattg aggatggcag agatgatgac ttcaatgtca gcatgaagaa tgcctttaag 1981 cttcacaaca aggagactga ggagatacat ctgttctttg ccaagaagct ggaggaaaaa 2041 atacgctggc tcagggcttt cagagaagag aggaaaatgg tacaggaaga tgaaaaaatt 2101 ggctttgaaa tttctgaaaa ccagaagagg caggctgcaa tgactgtgag aaaagtccct 2161 aagcaaaaag gtgtcaactc tgcccgctca gttcctcctt cctacccacc accgcaggac 2221 ccgttaaacc acggccagta cctggtcccc gacggcatcg ctcagtcgca ggtctttgag 2281 ttcaccgaac ccaagcgcag ccagtcacca ttctggcaaa acttcagcag gttaaccccc 2341 ttcaaaaaat gatacctaca gggaggcaga taattttaaa ataaagtaaa taaaattata 2401 tttatagatg gacctttttt cggagaagca ctgttgaaat ttatacacac acacacacac 2461 agagaccctt gagtacacat acacacacac acacacagac acacacacac acacacacac 2521 acacacacac acagagagat aaggaacaaa agtgttttct gttgttttgg ggaagtgaaa 2581 tatgtggttg gtaggaagag gtaccaatga cttccaaaca tgtgattccg tcttaaaagt 2641 tttccatttt taccctgtcc cccttccctt tgctttcaga agttgacatt tctattcatt 2701 gcttttcttg ttaagataat ctctttactc ccctgtgagt gattcactgc cttgtcatta 2761 ttacgataga tgtgtttgta ttgttttttt tctgatgata ctgatgttga tgaattttta 2821 attttatttg atgtggtaga gttgggaggt ttcagggttt tttcccctct tttactttcc 2881 attgaggaag ggaatgagct cctttctcct ctccttcagc caatcattat caaatgttcc 2941 ttcagccctg cagttgcccc aaataacctt ttttcagcat cctctgtcct cagtcatgcc 3001 agtctggaca tgctctgttg tgccctgtga caaaactgct cagtattcct attgctttta 3061 ctgtgtttta ggtactgtga agggatcaaa aaaccaaaca gaagcaaggg agtatcagac 3121 tatgatgatg ctggagtgga cttctgttca gggaacattt tgcattcagg ctgtttcttc 3181 tatcactggg gtttcccatg ttgcagcact tctgggtcgt tgcaattttg catctaggag 3241 ttagtttgat cgagttattc tcttttttca agtcactttt gttataggtc tccccctagg 3301 cctgtctctc ccttagccca aaagatctga actggaagca gaggttgaga ttctgcctcc 3361 caggagaggg atttacctgc cccctagtac cagataggtt tagggcagtg atctctacag 3421 caatcagttc agtgtcctgg ttgtccctgc tcccatttac agatgtttgg gcagcattga 3481 tagaagtatg gaggggttca agacagagcc cacctgatca agatcatcag ctaccttcaa 3541 attattgacc tggacagggt ccaagtctga tagtaacctt ttacaagaaa gaacagggat 3601 gggaatggaa agagatagcc ttgatccaca gtattgtacc tgcattttct accaccctaa 3661 aattgtgtga gacttctccc attgttaaca gattgcatgg acaatcttcc ctggcttctt 3721 tctttccctc tctctttctt ctttctcctg ccatcctagc acaggaggat ttttggtatt 3781 gatatagtta aagctgttct ggcactcaaa gaaggccgtg tttccaacat cctctcatcc 3841 caggacattt ggggcaagtg agttaggggc ccaggggcaa ttttccctct gaataacgtg 3901 tctgaggcag ggatgctacc ctcaggctcg cttttggcca gctttttgct tgggaaaatc 3961 taacttcttt cacaaggagg caggcttcct atggatgttg gagtacctgt ttttcctcca 4021 cacatagccc ttttcatgga tagaccttga acaacaaaaa gggtataagg gaataaggat 4081 gaactctgct gtgaagagca agccactgta gtgaggaatg tggagactgg gagtctgtcc 4141 taaaccccat gggagaagac ttcatcatga caggacttca gcttaccaag cagcagccat 4201 agctgtgtgg aggcttcagc atagctagca tgtttactgc tctatgcctc ctgatccaga 4261 ccaggcattg cccagcctgg gaatcttttc tttgtgggaa tcaaattaca agctatttaa 4321 gtttatattc catcacaacc aagtcagact tgtattataa gtcaaggatg agcctgatct 4381 ggggagaggg ccggggctcg ggactggcca ccactgttca gcacatgacc taactacgta 4441 agcctctttg gcaagggtcc tggtgcccag cacccaggct aaaatatcct gtctggcaga 4501 gtgttttggt agctatgcag gcctcccttc agtgtacctc tttttccaac ttctcactcc 4561 tccttactag gcttggcctt gacatgcttc ttcgagggtt ggcagcacac cgggagggga 4621 tgcttggaca agtttctggg cctacatttc ttgactaggc cctctcattt cctccctcct 4681 tggggcttct gcccagggct ccaggatcag ggatattact tctcaacccg cacttctcct 4741 ctactgaacc cactggcatc acctgatgcc actaatttgt gaacaacaag aaatcatttc 4801 cccattggtt ggagtattcc ctcagcctat agcatcaaag cagaccagtg gccaacagcc 4861 ccaaggggag cccaattaaa tacctgggtt cagtatccta acctgttatg tcctgacagc 4921 aatggtaacc ccagtaattc tgtaatgttg taatttccgc atggccctga gctccctttt 4981 cctcaactca gtgaggccag gatttgctct ccaaaaggct ttgctagtgt gttcaatggg 5041 acctgctgtg gggagtccta agacagacat ctaattattc tctctttttc cccccctctc 5101 tatgtgtata tttctaatgg atctataaga acagcaacaa gagagttcta acaattctag 5161 tgtgaagcca aatagtgatc ttttagtgct ttggggatgg ggtgggctgg ggtggatgga 5221 tgggcaacag tgactttgat tacccttgct gctctgcatt tgccagttta ttcttttgtt 5281 tcttttatct gactgactct gtcaaacaag tgtcaaagtt gtgtgttaaa aaatgtttaa 5341 caaaaaaaaa tgttgtaatg acacaaagcc ttatgaaaat atttatggag ttcaataaaa 5401 gaagtaaaaa gac // LOCUS AB007886 6108 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0426 mRNA, complete cds. ACCESSION AB007886 NID g2662132 KEYWORDS KIAA0426. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1272. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 6108) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..6108 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1272" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 173..3847 /gene="KIAA0426" CDS 173..3847 /gene="KIAA0426" /codon_start=1 /db_xref="PID:d1024603" /db_xref="PID:g2662133" /translation="MNKMLPSVPATAVRVSCSGCKKILQKGQTAYQRKGSTQLFCSTL CLTGYTVPPARPPPPLTKKTCSSCSKDILNPKDVISAQFENTTTSKDFCSQSCLSTYE LKKKPIVTINTNSISTKCSMCQKNAVIRHEVNYQNVVHKLCSDACFSKFRSANNLTMN CCENCGGYCYSGSGQCHMLQIEGQSKKFCSSSCITAYKQKSAKITPCALCKSLRSSAE MIENTNSLGKTELFCSVNCLSAYRVKMVTSAGVQVQCNSCKTSAIPQYHLAMSDGSIR NFCSYSCVVAFQNLFNKPTGMNSSVVPLSQGQVIVSIPTGSTVSAGGGSTSAVSPTSI SSSAAAGLQRLAAQSQHVGFARSVVKLKCQHCNRLFATKPELLDYKGKMFQFCGKNCS DEYKKINNVMAMCEYCKIEKIVKETVRFSGADKSFCSEGCKLLYKHDLAKRWGNHCKM CSYCLQTSPKLVQNNLGGKVEEFCCEECMSKYTVLFYQMAKCDACKRQGKLSESLKWR GEMKHFCNLLCILMFCNQQSVCDPPSQNNAANISMVQAASAGPPSLRKDSTPVIANVV SLASAPAAQPTVNSNSVLQGAVPTVTAKIIGDASTQTDALKLPPSQPPRLLKNKALLC KPITQTKATSCKPHTQNKECQTEDTPSQPQIIVVPVPVPVFVPIPLHLYTQYAPVPFG IPVPMPVPMLIPSSMDSEDKVTESIEDIKEKLPTHPFEADLLEMAEMIAEDEEKKTLS QGESQTSEHELFLDTKIFEKDQGSTYSGDLESEAVSTPHSWEEELNHYALKSNAVQEA DSELKQFSKGETEQDLEADFPSDSFDPLNKGQGIQARSRTRRRHRDGFPQPRRRGRKK SIVAVEPRSLIQGAFQGCSVSGMTLKYMYGVNAWKNWVQWKNAKEEQGDLKCGGVEQA SSSPRSDPLGSTQDHALSQESSEPGCRVRSIKLKEDILSCTFAELSLGLCQFIQEVRR PNGEKYDPDSILYLCLGIQQYLFENGRIDNIFTEPYSRFMIELTKLLKIWEPTILPNG YMFSRIEEEHLWECKQLGAYSPIVLLNTLLFFNTKYFQLKNVTEHLKLSFAHVMRRTR TLKYSTKMTYLRFFPPLQKQESEPDKLTVGKRKRNEDDEVPVGVEMAENTDNPLRCPV RLYEFYLSKCSESVKQRNDVFYLQPERSCVPNSPMWYSTFPIDPGTLDTMLTRILMVR EVHEELAKAKSEDSDVELSD" BASE COUNT 1852 a 1245 c 1261 g 1750 t ORIGIN 1 caaaaaagtg gtgcagtttt tgatgaaatt gtagagaact gagtatagtc atggccaaca 61 gcaaaaaact caagaggggg aactgaaaat tagtgctgtg ttttcagtca gtggcagccc 121 tcttgctcca cagttgacta ctggctttca gccctcactg gcgtcatctg gcatgaataa 181 aatgcttcct tcagttccag ccacagctgt tcgagtttcc tgttctggtt gtaaaaaaat 241 cctccagaag gggcaaactg cttatcagag gaaagggtct actcagctat tctgctccac 301 actgtgcctc actggatata cagttccacc tgcccgccca ccgcctcctc tcaccaagaa 361 aacttgttca agttgctcaa aagacatttt aaatccaaag gatgtgatca gtgcccagtt 421 tgaaaacacc accactagta aagatttttg cagtcagtca tgtttgtcaa catatgaact 481 gaaaaaaaaa cctattgtta ccataaatac aaatagtatt tcaaccaaat gcagcatgtg 541 tcagaagaat gctgttattc gacatgaagt taattaccag aatgtggtcc ataaactttg 601 cagtgatgcc tgcttctcta agtttcgttc tgctaacaac ctcaccatga actgttgtga 661 gaactgtggg ggttactgtt acagtgggtc gggacaatgc cacatgcttc agatagaggg 721 acagtctaag aagttttgta gttcatcgtg tatcacggca tacaagcaga aatcagccaa 781 aattacaccg tgtgcgcttt gcaaatcatt gagatcctca gcagaaatga ttgaaaatac 841 caatagcttg gggaagacag agcttttctg ttctgttaat tgcttatctg cttacagagt 901 taaaatggtt acttctgcag gtgtacaagt tcagtgtaac agttgtaaaa cctcagcaat 961 tcctcagtat cacctagcca tgtcagatgg aagtatacgc aacttctgca gctacagctg 1021 tgtggtagct ttccagaatt tattcaacaa accaactgga atgaattctt cagtagtgcc 1081 cttgtctcag ggccaagtaa ttgtaagcat ccccacaggt tccacagtgt cagccggagg 1141 aggtagcaca tctgctgttt ctcccacctc catcagtagc tctgctgcag ctggtctcca 1201 gcgtctcgct gcccagtccc agcatgttgg gtttgcacga agtgttgtga aactcaaatg 1261 tcaacactgt aaccgtcttt ttgccacaaa accagaactt cttgactata agggcaaaat 1321 gtttcagttc tgtggcaaga attgttctga tgaatataag aaaataaata atgtaatggc 1381 aatgtgtgaa tattgtaaaa ttgagaaaat tgtaaaggag actgttcggt tctcaggtgc 1441 tgacaagtca ttctgtagtg aaggttgcaa attgctttat aaacatgact tggcaaaacg 1501 ctggggaaat cactgtaaaa tgtgcagtta ttgtttacag acatctccca aattggtaca 1561 gaataattta ggagggaaag tggaagagtt ctgttgtgaa gaatgcatgt ccaaatatac 1621 agttttgttc tatcagatgg ccaaatgtga tgcttgtaag cgacagggta aactcagtga 1681 gtccttgaaa tggcgagggg aaatgaaaca tttctgtaac ctgctttgta tcttgatgtt 1741 ctgtaatcag caaagtgtat gtgacccgcc ttcacaaaat aatgcagcaa atatttccat 1801 ggttcaagct gcttcagcag gacccccatc tctgagaaaa gattcgactc cagttatagc 1861 caatgtagta tcattggcaa gtgcccctgc tgctcagcct acagtgaatt ctaacagtgt 1921 cttacaaggt gcagttccaa cagtaacagc gaaaatcatc ggtgatgcaa gtactcaaac 1981 agatgccctg aaactgccac cttcccaacc tccaaggctt ttgaagaaca aagctttatt 2041 atgcaaaccc atcacacaga ctaaagccac ctcttgcaaa ccacataccc aaaacaaaga 2101 atgccagaca gaagacactc caagtcagcc ccagattatt gtggtgccag ttcccgtacc 2161 agtgtttgtt cccatacctc ttcaccttta tactcaatat gctccagtcc catttggaat 2221 tccagttcca atgcctgtcc ctatgcttat tccatcttca atggatagtg aagataaagt 2281 cacagagagt attgaagaca ttaaagaaaa gcttcccaca catccatttg aagctgatct 2341 ccttgagatg gcagaaatga ttgcagaaga tgaagagaag aagactctat ctcagggaga 2401 gtcccaaact tctgaacacg aactctttct agacaccaag atatttgaaa aagaccaagg 2461 aagtacatac agtggtgatc ttgaatcaga ggcagtatct actccacata gctgggagga 2521 agagctgaat cactatgcct taaagtcaaa tgctgtgcaa gaggctgatt cagaattgaa 2581 gcagttctca aaaggggaaa ctgaacagga cctggaagca gattttccat cagactcctt 2641 tgacccactt aataaaggac agggaatcca ggcacgttcc cgaacaagac gacgacacag 2701 agatggcttc ccccaaccca gacgaagagg acggaagaag tctatagtgg ctgtggagcc 2761 caggagtctt attcaaggag cctttcaagg ctgctcagtg tccgggatga cactgaaata 2821 catgtatggg gtaaatgctt ggaagaactg ggttcagtgg aaaaatgcca aggaagagca 2881 gggggatcta aaatgtggag gggttgaaca ggcctcatct agcccacgtt ctgacccctt 2941 aggaagtact caagaccatg cactctctca agaatcctca gagccaggct gtagagtccg 3001 ctctatcaag ctgaaggaag acattctgtc ctgcactttt gctgagttga gtttgggctt 3061 atgccagttt atccaagagg tgcggagacc aaatggtgaa aaatatgatc cagacagtat 3121 cttatacttg tgccttggaa ttcaacagta cctgtttgaa aatggtagaa tagataacat 3181 ttttactgag ccctattcca gatttatgat tgaacttacc aaactcttga aaatatggga 3241 acctacaata cttcctaatg gttacatgtt ctctcgcatt gaggaagagc atttgtggga 3301 gtgcaaacag ctgggcgctt actcaccaat cgtcctttta aacaccctcc ttttcttcaa 3361 taccaaatac ttccaactaa agaatgttac tgagcacttg aagctttcct ttgcccatgt 3421 gatgagacgg accaggactc tgaagtacag taccaagatg acatatctga ggttcttccc 3481 acctttacag aagcaggagt cagaaccaga taaactgact gttggcaaga ggaaacgaaa 3541 tgaagatgat gaggttccag tgggggtgga gatggcagag aatactgaca atccactaag 3601 atgcccagtc cgactttatg agttttacct gtcaaaatgt tctgaaagtg tgaagcaaag 3661 gaatgatgtg ttttaccttc aacctgagcg ctcctgtgtc ccgaatagcc ccatgtggta 3721 ctccacattc ccgatagacc ctggaaccct ggacaccatg ttaacacgta ttctcatggt 3781 gagggaggta catgaagaac ttgccaaagc caaatctgaa gactctgatg ttgaattatc 3841 agattaaaac ggaagtgagg ttcttatttt catacatatt ggtatgcacc aaactgtgaa 3901 tgcatccagc tgttggaaaa tgatgtataa gtctaagtcc tcttgacttg accataagat 3961 catggaaaac agatgacttg tgaaccccac agtgtggatg tgcaaatgaa aattgaagga 4021 aagaatatga actgagaaat gttctttggc agtgatatag ttcttagaca tcttcagaat 4081 gactaatttc tccgagtggt gcataatctt attttgtttg ggagtaacaa atcgtggaat 4141 atttttaagg aaaactgttg tataaaactt taccatagta accttagacc ttagagaggt 4201 agctttggag tgaaactttg gctgcaatag gctactttgg caagccctcc gtaaaagtca 4261 gaggagagat cagtacagag ctaagagtga catcaaatga ggactgtggg acccagattt 4321 gaagacccaa taaaaatact caacttttta aaaaagatag tgaagtggtc ttgattgatt 4381 ttgattttca ctgccaagcc aatcatgtga aggacagaag cttttgccat gggcccctca 4441 catcagggaa aatgaccttc actgctgtta acagtaatgt gtccctttca ttttctggat 4501 caagccttct cagcggtggg tctggatgtg ggtaaactaa ggtaaagggg atgatattcc 4561 acaaactaat tatgcacaca gaaaatctgt ggagcctatc agaccccaag tgtcttgaaa 4621 tgtttgtaga aacccactaa aatgcccctt ctctgggtgt gggcccttat tgcagctgtc 4681 tcacagcctg agctgtggta cagagaaatg ggggttctcc ttttattttc attttttttc 4741 cccaatggca gcttttctcc cgttgtttta ccttcctatt tcccaaacag ttcctcttat 4801 tttgtctttt gcaccagttt ctggaggccc ttgtcatttc aaaaaggata gtctcttttc 4861 ttactctggc aaacctgtga gtgattccac aaagatacag tattacttag ctatctgaat 4921 tatgatagaa aaggtcctag ttaggttcct atataaagca tttggaagat gaccttgttg 4981 cccttgaaac ttgaaaatag ggattctggg gtgaggatac aaagacattg tcttgcatat 5041 ccataagcag gtcttagagc attattccaa actctagctg tttcagtagt tctatgagga 5101 ttgcaagtca taggtgtgtg tggcatatca gtccatctcc ctcatctcca ttctcagttt 5161 cttccccaca aaatttggaa tcaaagcttt tatgacgttt gccaattgca gaacttcttc 5221 agctaaggtt aatttgacgc tatgataaaa ctgagagatg tcaaaaagcc tcttagaaat 5281 tttaatcttg aaagactttt cagggtatct cattttttag gtgggggtgg caggtgtatt 5341 tcttttttaa caaataaaag gcatttaagt aaaactaaaa tgaaaaaagt aggccttctg 5401 acattgtgta cttggtggtt ctgtccctct gcctgtaaca aatctcattt ttgttaccaa 5461 gaactgtatg aaagaagtaa atccaccccg attctgtatg attaattcca tctgtgtttg 5521 tcatttctga ctggaaaact tcttactcca taccttgttc gatatggagg acaaataatt 5581 ggattgtctg ataagtctgc caataaacta tccagaaata gcaagtgtaa tagtccccac 5641 tatacgaatt ttatggtttg tataaacact aacattttcc ccttctgtag ttgtatgaaa 5701 aaacaaatat tgttagcata gtagataaat tgttatgaaa taccagaaaa aaaaatctgt 5761 atcttttact gagaacaccc aatacccaga taaatgactg tatcaggatt tcatttgcat 5821 gttagtccac agagttgccc agaaccctaa atttattcat aagagaaaat attgattaat 5881 tattggtcat tcctcataag tgtagctgtt gatgtgtgcg tctgattatt gcttttttaa 5941 ttttatgaaa attgtgtaaa attacatttt ttttccaggg gagaaaaaaa catcaaacaa 6001 aaacatctaa atcatccttt ttgttctttt tcagttttta accactttta ggttttcccc 6061 ttacagaaac cacagaaata ttcccttaga ataaaatagt atatttgt // LOCUS AB007888 5737 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0428 mRNA, complete cds. ACCESSION AB007888 NID g2662136 KEYWORDS KIAA0428. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1382. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5737) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5737 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1382" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 286..2082 /gene="KIAA0428" CDS 286..2082 /gene="KIAA0428" /codon_start=1 /db_xref="PID:d1024605" /db_xref="PID:g2662137" /translation="MENSSAASASSEAGSSRSQEIEELERFIDSYVLEYQVQGLLADK TEGDGESERTQSHISQWTADCSEPLDSSCSFSRGRAPPQQNGSKDNSLDMLGTDIWAA NTFDSFSGATWDLQPEKLDFTQFHRKVRHTPKQPLPHIDREGCGKGKLEDGDGINLND IEKVLPAWQGYHPMPHEVEIAHTKKLFRRRRNDRRRQQRPPGGNKPQQHGDHQPGSAK HNRDHQKSYQGGSAPHPSGRPTHHGYSQNRRWHHGNMKHPPGDKGEAGAHRNAKETMT IENPKLEDTAGDTGHSSLEAPRSPDTLAPVASERLPPQQSGGPEVETKRKDSILPERI GERPKITLLQSSKDRLRRRLKEKDEVAVETTTPQQNKMDKLIEILNSMRNNSSDVDTK LTTFMEEAQNSTNSEEMLGEIVRTIYQKAVSDRSFAFTAAKLCDKMALFMVEGTKFRS LLLNMLQKDFTVREELQQQDVERWLGFITFLCEVFGTMRSSTGEPFRVLVCPIYTCLR ELLQSQDVKEDAVLCCSMELQSTGRLLEEQLPEMMTELLASARDKMLCPSESMLTRSL LLEVIELHANSWNPLTPPITQYYNRTIQKLTA" BASE COUNT 1224 a 1753 c 1668 g 1092 t ORIGIN 1 gtcagatcag ggatcatttt ttttccttcc tctactccct cccccctacc cgcccctccc 61 tccctgtttc ccttccctcc ctccctcccc tctctgctgg gtctgtgcgc tggggcgccc 121 gatcccctcc gcagctggga cgctccgaac tcgaggcagg agtcggctct ccggagcctc 181 gtccctccct tccccttccc tgcccccttc ccccaccccc gactcgggct tggcgcggcg 241 gccagaggaa ccccgagtcc cggcccaggc ccctgagctg gagggatgga aaactcctct 301 gcagcatcag cctcctcgga ggcagggagc agccgctccc aggagatcga ggagctggag 361 cgcttcatcg acagctacgt gctggagtac caggtgcagg ggctgctggc tgacaagacg 421 gagggtgatg gcgagagcga gaggacccag tcccacatct cccagtggac agcggactgc 481 agcgaaccgc tggacagcag ctgttccttc tcccgagggc gagccccccc acagcagaat 541 ggcagcaaag acaactctct ggacatgctg ggcacggaca tctgggcggc caacaccttc 601 gattccttca gtggtgccac ctgggacctg cagccggaaa agctggactt cacccagttc 661 caccgcaaag tccgacacac gcccaagcag cccctgccac acatcgaccg cgaagggtgt 721 ggcaaaggga agctggaaga tggggatggc atcaacctga atgacatcga gaaggtcctt 781 ccagcctggc agggctacca cccgatgccc catgaagtgg agatcgcaca caccaagaag 841 ctgttccgca ggaggagaaa tgatcgaagg cggcagcaga gacctccggg gggcaacaag 901 ccccaacagc atggtgacca ccagccaggc agtgccaaac acaacaggga ccaccagaaa 961 tcctaccagg ggggctcagc accccacccc tcagggaggc ccactcacca tggctacagc 1021 cagaaccggc gctggcacca tggcaacatg aagcacccac caggcgacaa gggggaggca 1081 ggcgcacacc gcaatgccaa agagaccatg accatcgaga acccaaaact ggaggacact 1141 gcaggggaca ccgggcacag cagcctcgag gccccccgca gccctgacac cctggccccg 1201 gtggcttctg agcggctgcc cccacagcag tcaggggggc cagaggttga gacaaaacgt 1261 aaagacagta ttcttcccga gcgcatcggg gagcggccca aaattaccct gctccagtct 1321 tccaaagaca gactgcggcg aaggctaaag gaaaaggatg aagtggccgt ggagacgacc 1381 actccccagc agaacaagat ggacaagctg atcgagatcc tgaacagcat gcggaacaac 1441 agcagcgacg tggacaccaa gctcaccacc ttcatggagg aggcccagaa ctccaccaac 1501 tccgaggaga tgctgggcga gatcgtgcgc acaatctacc agaaggctgt gtccgaccgc 1561 agcttcgcct tcaccgctgc caagctctgc gacaagatgg cgctctttat ggtggagggg 1621 accaagttcc ggagcctgct cctcaacatg ctgcagaagg acttcacggt gcgcgaggag 1681 ctgcagcagc aggacgtgga gcgctggctg ggcttcatca ccttcctgtg tgaggtcttc 1741 ggcaccatgc gcagcagcac aggcgagccc ttccgtgtgc tcgtgtgccc catctacacc 1801 tgcctcaggg agctcttgca atctcaggat gtgaaggaag atgctgtcct ttgctgctct 1861 atggagctgc agagtacagg ccggctgctg gaggaacagc tgcctgagat gatgacagag 1921 ctcctggcca gcgcacggga caagatgctg tgcccctcgg agtccatgct gacccggtcg 1981 ctgctcctag aggtcatcga gctccacgct aacagctgga accctctgac gccccccatc 2041 acgcagtact acaacagaac catccagaaa ctgacagcct gacagccagg gggcctggca 2101 ggcggcccac gggcagctgg ggccctggtg cacagggcca gatggacagg cgggaggaca 2161 ggggtggccc tggcgggaga aagaaatggg gaggagggca ggcagagtcg gtggccagtc 2221 tggagccaga cggggaaggg agcaaatccc tgagaggagt gcccccgcac aagcccccca 2281 gcccgagcat gcaagctcac accaataagg gaagcatgtt tctttttcct ggtggccctg 2341 gccctcccct tcctcactcc cgcctctccc ctccccatca gacccatccc ccacggagct 2401 ttgtgtgagg gatctcatcg ctgtgactcc tcggagacct tggcagcctc gcacgccggg 2461 gcaccgcttg ggtcagaaag gacctcggaa ggctgaaaaa gtgggtcgga gacgggctcg 2521 cattgttccc gcatgctgtc agccgcagtc gccaactggc agcaggcgac gtgtagcaga 2581 tgtccgggag gacaaaggca ggcacggtcc ccaccagccg cccgtaattg acggcctttg 2641 tcagccatgg cagagctgac gctccacctc ccacctccaa gtcctcctca ctgcagcccc 2701 cacagcctca ggcctagggg gtcaggcgca gcgggggaga tggagtttgc agttccactt 2761 gcactctttt gtttattgtg ttttattttt caaaagtcgg ttgctttgaa gtctctttgg 2821 ccaatgaaaa tgcccgtgag gtgatcacac agtcagcact gttgaggacc cccggattag 2881 tgggagatca aacccagctc ccctctagaa gaaggattcg agccacagac agcttgccag 2941 tagccaatta gggtaattgg aaacttctgc cccggcgggg ggtccccgct ggaatcctgt 3001 gttcctcgcc actggcttcc agcgcctctg ttttctcaaa gggctgatac tgtcaccact 3061 gggaccaagt taaacctggt cctggcccca ggggccttgt ggcaaacagg gcacagaacg 3121 agactggcaa attaaaacca aaattctaga tggtgtcttg cgctccacac gcaggtctta 3181 ctggggaaaa ggatgggagt gggggctccc caggactcga ttttagctaa tgcgctgtgt 3241 cactgcccca gctcggacgt agaagcccag ccctccgtga gctcttggga aaggggtgaa 3301 ttcactgggt catggaaggg acagtcaggt gaccagcggg gtcgccagat gaagcttccc 3361 agccgggaaa caagacgggg tttcttggca ggccctggtc ctggggagca ggccctgttg 3421 ttggctggag aggaaggtgt ggggtggaac aggtgtccac atagctccat ctctgggggc 3481 tggagcacac actttgatga gcccccccgg aaatgatgtc agagcctagc cgcttcctta 3541 tttgctcttt tattgaggcc gggcaggccc tgggtcactt tggaggcccc tcttggtcca 3601 cactggactg gccgggaggt gatgggcggg gaaggttctc gtgattgatt gattctgagt 3661 ctgagagtgg cgagtgggga gaggcttccc cagttctctc cagctttccc tgcagctgca 3721 acctgccctc tggtcccagg tgtggagcct ttgcctgtct ctaaaaagag cctgttggcg 3781 acaaggtgta gggggcacaa gtttacctga aacaggtcag tggtctctcc caagaagcgc 3841 acgccacctc tggtccctgg ccctgaaccc tgccttcttc ctccctccac ggtttcttcc 3901 cagactttct caagctcctc ctcactgccc ttcctcccca gcccagcctg ggaacacaga 3961 tgccccgcgg gtaggaggcc tcgagggagg agccgggctg atgcggggct gctcagggca 4021 ggccccaggg cgagcttgcc atcgtggcca ggcagcctcc acctgtgctt cagtggcccc 4081 tgcccccctg aagcatgtgg ggtttgtccg ctaggaggag gcaaggcccc cgaagagagg 4141 agagacctgg gagtgggagc tcaggtcagg gaggaggcag gggagtgggg tctcccagac 4201 ccaacggtga gctcagagca agcttcacgc aggacgctcc gaaacactgt gtggaggggg 4261 ctgtgttgtg ggcaccttgg ggcctgattc tccttcctcc gaacgggctc cttgatggcc 4321 tggccacagg ggcagctccc cattggctgt taggaccaga gtgtgaagaa gaagtgaaat 4381 ataaatatgt atacatatat aaatatattt ttaattacat gtcgtgtcac ggtggctcca 4441 gacatactgt ttgcctagtt tattccactg cttgaaagcg cttcctagcc aatctgaaca 4501 acaacacttt aagctgtttt tctaaatgca ggttgctgct cctttttcag atatggaagg 4561 aaaacgttaa gactattttt tttttaaaga aacaacagtc aagcctaaaa tttgagaccc 4621 cgaggcagct tcccgaggga gactgctcag acaggaactg caggacagaa gtggatgccc 4681 cacagaccct ggccccctcc ccaagtccat cccctctctg tggcatgagg aaggccgcgt 4741 ccgagttgac ctctgaatgt atgtgatgag aggcagagct ggatattgca tttctaaggc 4801 ttgcattgct ttcccctcgc ccgcggttct tggcgcatgg aagaggcggt ccagccatct 4861 gatgttgatc ctgtctcagt ctccccactg cctgtcagga tgagttagtc attgtttttc 4921 tccgaggcgg cctgcttgcc acagccctgc tccccaaggc ctggtggctt tgccgaagct 4981 ctgggaccgc agccccagcg aggcccccaa cctcacccag acgaggccag gagccccgcc 5041 accctccacg ggatgtgcac cctcagaccc cattctctct gttcgtcctt ccttgaccag 5101 tctgtaaacc ttcactgttt ggggatcgtc ctgtccatcc atgtaaatgt aaatgttggc 5161 cgagtcggta tttattctga ttgattttta ttttattcta ttattttctc cgagggatga 5221 gggtgggggg tgtgggaagg gtaccacaga tcaggccggg gcagctgtag gggcgggggc 5281 ccagacagcc aggccgccac cagagcagcc ccatggggtg ccccagacgc gggcctccaa 5341 gaagccaagt cccagtctgt tttctggcat cagacaccgg cccgtgttcc ttgtcagaca 5401 gacagactct caggcctgcc tggggagtcg tgtccctcag ctgcagggca ctgtgttggg 5461 aaaccattgg ctgggccttt gaggacacag atcagaagaa agaaagacaa ctttcctctg 5521 cgcggaacac tcacacggaa gggctggccg cctccctgag ccggctggga gtggacgaca 5581 ggacctacct ccccagagca agggcctggg gcttcccgcc aaagctgccg cggaaccccg 5641 ctagtgcgac caccctccct ccgtcggtat gtcctgcttt ccagctgaac ccaaactaca 5701 agtgggttta aaaaaaataa acaccaccac caaaaac // LOCUS AB007889 5940 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0429 mRNA, complete cds. ACCESSION AB007889 NID g2662138 KEYWORDS KIAA0429. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1409. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5940) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5940 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1409" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1415..2527 /gene="KIAA0429" CDS 1415..2527 /gene="KIAA0429" /codon_start=1 /db_xref="PID:d1024606" /db_xref="PID:g2662139" /translation="MAVSVTPIRDTKWLTLEVCREFQRGTCSRPDTECKFAHPSKSCQ VENGRVIACFDSLKGRCSRENCKYLHPPPHLKTQLEINGRNNLIQQKNMAMLAQQMQL ANAMMPGAPLQPVPMFSVAPSLATNASAAAFNPYLGPVSPSLVPAEILPTAPMLVTGN PGVPVPAAAAAAAQKLMRTDRLEVCREYQRGNCNRGENDCRFAHPADSTMIDTNDNTV TVCMDYIKGRCSREKCKYFHPPAHLQAKIKAAQYQVNQAAAAQAAATAAAMGIPQAVL PPLPKRPALEKTNGATAVFNTGIFQYQQALANMQLQQHTAFLPPVPMVHGATPATVSA ATTSATSVPFAATATANQIPIISAEHLTSHKYVTQM" BASE COUNT 1785 a 1178 c 1179 g 1798 t ORIGIN 1 cttgccggct tccttgcaaa gcccggtgca agggcctctt tcaaaatgaa cccactggtg 61 tgcctagcag tcggtagaag aagcgggagg gcgtccggtc tgcacgcccg ccgcgaggtt 121 acaatgctga acgcatgaga tggaagatac caacgggagg ccgaggggat ccacggcgcc 181 cgcgcgggct ccggcttcct cctgctctcg gcgccgctgg gcgaccgccc atgacccgct 241 cttgcgggct ctgtccggtt gacaggcgac cctgtggccc ggggaagcgc gggagggcgc 301 cggcggaaag ttgaagagcg tttttctcgc cgccgcgtgc attaggagct cgacgagtcc 361 gccctgggct tcctggtggg gctgggcggg cgggggaggg gccgcgcagc agcagcggaa 421 gccagacctc ggcgataaga ggctgcacag cgacatgcaa cagtcttttc actgcagctg 481 aatgagttgt ggcgcccaca atgctcccat gacaaggagc tgacaagttc cattttccgt 541 cgcgggcatc ttggaatcat gactcccaca atgccttggg cacttggtcg acagtggggc 601 cgcctctgaa aaaaaaaatg tgagaggttg gtactaagaa gtgcctttcc tgacgtctct 661 gctgcttgga accgcttcta gagcagtctc tgcttttgcc ttgcttgctg ccagctagac 721 tgtgacgaca gcacatccac cctccacctc tagcccagac acccccattt ctacttataa 781 tcaagagaaa agctctaagt atctggcatt gccctaggct gctttagtgt taaaagaaaa 841 gtttgctgaa aaagtaagat atcttctgcc aggaaatcaa ggaggaaaaa aaaaatcatt 901 ttctcgattt tgctctaaac tgctgcatct gtctatgcca aactaatcaa taccgattgc 961 accaccaaac tccattgcaa attcagctgt gaggagattc cctttcagac aactttgctg 1021 aaagcagctt ggaaattcgg tgtcgaaggg tctgccacgt tttcatgctt gcattttggg 1081 ctccaaattg gcactgggaa ggggttactg agagcacaag gctgatacca ggccctactt 1141 ttaaacgttc atctacttac aatcctagta tttctctaaa aaccaaaacc tctttgaatt 1201 aacagtttca tgctgtgaat ttctagtggg agatcttttc cttgatattg acgacacaat 1261 tttccatgta cttttaaagc agggagtggg gaaaagtatt ttgaggggac attttcatca 1321 tcagttcagc tttttttttt ggttgttgct cttttttggg ggggttgggt ttgttggttt 1381 cactgaaaca tttaactacc tgtaaaatct aaacatggct gttagtgtca caccaattcg 1441 ggacacaaaa tggctaacac tggaagtatg tagagagttc cagaggggga cttgctcacg 1501 gccagacacg gaatgtaaat ttgcacatcc ttcgaaaagc tgccaagttg aaaatggacg 1561 agtaatcgcc tgctttgatt cattgaaagg ccgttgctcc agggagaact gcaaatatct 1621 tcatccaccc ccacatttaa aaacgcagtt ggagataaat ggacgcaata acttgattca 1681 gcagaagaac atggccatgt tggcccagca aatgcaacta gccaatgcca tgatgcctgg 1741 tgccccatta caacccgtgc caatgttttc agttgcacca agcttagcca ccaatgcatc 1801 agcagccgcc tttaatccct atctgggacc tgtttctcca agcctggtcc cggcagagat 1861 cttgccgact gcaccaatgt tggttacagg gaatccgggt gtccctgtac ctgcagctgc 1921 tgcagctgct gcacagaaat taatgcgaac agacagactt gaggtatgtc gagagtacca 1981 acgtggcaat tgcaaccgag gagaaaatga ttgtcggttt gctcatcctg ctgacagcac 2041 aatgattgac accaatgaca acacagtcac tgtgtgtatg gattacatca aagggagatg 2101 ctctcgggaa aagtgcaaat actttcatcc ccctgcacat ttgcaagcca agatcaaggc 2161 tgcccaatac caggtcaacc aggctgcagc tgcacaggct gcagccaccg cagctgccat 2221 gggaattcct caagctgtac ttcccccatt accaaagagg cctgctcttg aaaaaaccaa 2281 cggtgccacc gcagtcttta acactggtat tttccaatac caacaggctc tagccaacat 2341 gcagttacaa cagcatacag catttctccc accagttccc atggtgcacg gtgctacgcc 2401 agccactgtg tccgcagcaa caacatctgc cacaagtgtt cccttcgctg caacagccac 2461 agccaaccag atacccataa tatctgccga acatctgact agccacaagt atgttaccca 2521 gatgtagaat tttcatcact aaacaatcat gctaaagagg aaaggacagt gtgcttggtt 2581 agagtaaagg acgaggtcat tagccatatt gtatatatcg tcaagcaaca cacacaaaag 2641 ttcctcagcc acaagacatc cacatattgc atgttaacca gaagaaaaga caacattttc 2701 cggaaatcca ctgcacactg ttgcctatac actttgtaca tttaattgat atttgtgctg 2761 aggtgatatt cctgtctaaa agaacaacat tgtctttctt ttctagcaca gagttatgca 2821 ttcaaagatg catacctagt tagtttccta tatattcatg ccatcttgaa aagacagact 2881 atggtgtaac catgattcta ttatgtattg gtacgtctgt agaccaagat ataatttttt 2941 aaaaataagt ttatttcttt caaggtttac aaataacaaa ggtgcacctt gtatttaaaa 3001 ttgccattat agatgagagc gtgcatgcac agtcattttt gtttaagagt aatattttta 3061 atgtaataga ttgtaagacg tggtgaggga gggatctgac agagatgaat gtgccaagca 3121 aaaccacaac tgtgtatatt ttaaagcaca tcatggcttt aagtaccatg ttgttaagga 3181 ttctcatgaa gtgccataga ctgtacatca aattagagta ttatttcttc agtgttattg 3241 ttttcagagc cacattttgt tgcatatttg ctagtactaa tcagtcaaag ggcaccattc 3301 tttttttttt tttttgaaac caaagctgtc tcagaaatgg ccaatttaac tttacagtaa 3361 caatagacag cacaacacaa actctctcaa tacagataaa ctcacacata ctggagatat 3421 atatataata gatatatata aaattatttt aatgcattgt agtgtaatat ttatgcatac 3481 tatactgtat aacatgttat tcaaaaggga ttgccatttc tgagacacag taacaaaaaa 3541 atgaggaaat tattttgctt ctatttatag cctctgtcaa aagtcaaaag actataaatg 3601 ctttgcaaaa atggtttcac gtttgcttaa atgcttcatc acagtcacat tcaaaatagt 3661 gactctaaac aaagaagaaa gcagcactgt catcagatgc atgataaacc aaaatatgaa 3721 aatgggaaat gtttaattaa cctagtaatt gggtgggtta agtacatggg tgaattttat 3781 atgtgatttt tgttttgttt tgttttgttc agattaactg cttatagcct tagaaagcct 3841 tttacaaaat taaaaaaaaa atagatgtgc attcagtttt taagaatgga atcatccaaa 3901 ggaattcctt tttttgaggt ttggatgttg cagctagtaa aggatatttt tgctctgttc 3961 agcagttcta aaaattgctg aagtaggggc caggtcactg gtagttatag tatggaatgg 4021 gagaagtgaa agttcagtta tagaactttc catacttcca agtttactgc aagtttttat 4081 gcttgagaga gatgctttct aatataagac tgatgtgttg attttactga ttgtactgta 4141 catctattaa agccttagat tattacatta cgggttggaa cccataccaa tgtaatttca 4201 atcgtgttaa gaaagtaatg gtgacttcac atgttattgt agttagttac attatagaat 4261 attacttatt tttcttgtta aaatgtagtt tttcatttcc tacatttatt agattttcat 4321 tttctattaa caattgaata ccatttcagt ttatagactt gttttattag attttaccaa 4381 tgaatttttc aaaatacaaa aaaaagtagt ttttccttca taacatactc agttttgaat 4441 tacatgtagt gtcacatgaa tattcgtatt gttaactaaa tgatttatat tttactgatt 4501 taatattaca gtgtaagaat gtcagtcatt gttagttctt gtctagtttt cattaaaaga 4561 acaaagatct tttatatgga tatcttataa atatataatc attgctaagt aagaagttaa 4621 gttgttgcta tcgcaacaat cctggcagac aattgagtaa tattttgatg atttattttg 4681 tttgtaatta gttattataa gaagatctag atcctagata ttagaataaa atttattttc 4741 tactgtatcc atttcaaatg ttaaaatatt gtttaatatt tttgaaatcc ctgagtatca 4801 ggccttgtta taaataagct gcataatcaa taaatagaac aagggacttt ttgttgataa 4861 tccaaatact caaagtttac gtaatgaaaa ttatagcgtg tgtgcaaact cttgagggtt 4921 gattatgctg caatttagca tgttggaacg tctagggaga aggttgactt tttgcacttc 4981 tgtatatagt caaaagagag aaacctgtat aatagtaaga tcttattttg aataaaaacg 5041 tctataatta caaggagttt tgttaaggct aatacaatga cagactgagc aaaattgctt 5101 gcaaaagtgg cacagagtta gcactccata ccccttcaaa catgttgctt tgctttcttg 5161 tggacagctt gtagtttgcc aggatttttt cagctggaaa gatacgccat cctttcaaac 5221 cctcatgact gacaaaaact ccatggggcc aaatctgcct gaagatcatt accaaaaata 5281 gcaggtactt ctaccattaa ggtgaaatca tggatcagat attccttaca tttttcaaaa 5341 ctactgcatg tttaaaactt caacaaaaaa agagagaaag aactatacta agaacatata 5401 ttattcagat cagtttctgc caatttcagt ggtttattgt tcacaaaaaa atcttcaaaa 5461 caagtattga ctttcacaaa atttaaatca taaacaggca aaccaaacag cacactgtag 5521 ctatagttgt tatgtgattg ttttttaatt gctgtaggat cctgttcttt cagcaggtga 5581 aaaataaaac gcagttcaaa tttcatggtt ttaattttca actcagaagc actcaaaaat 5641 gcaaaatgtg ataatgggca cttgtttaaa agaattagtg tatccagcct tcactccagc 5701 tggttaaaaa tgttgcactt atcagcaacc ctaccacttt catctgctga aaggacaaat 5761 gtgcttggtt ttactattat gtaatcacaa cttactttct gcttgtagtt gcttaaaatt 5821 atgtattttg tcttgggctg caatttgttt tatgcttatt ttattattac tgcagtagtt 5881 gactttgctg tatggaaaaa taaagtgaaa ttgccctaat aaaacttctc tttcttaagt // LOCUS AB007890 5645 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0430 mRNA, complete cds. ACCESSION AB007890 NID g2662140 KEYWORDS KIAA0430. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1494. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5645) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5645 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1494" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 2374..3444 /gene="KIAA0430" CDS 2374..3444 /gene="KIAA0430" /codon_start=1 /db_xref="PID:d1024607" /db_xref="PID:g2662141" /translation="MFPSSQIPSWKDWAKPGPYDQPLVNTLQRRKEKREPDPNGGGPT TASGPPAAAEEAQRPRSMTVSAATRPGEEMEACEELALALSRGLQLDTQRSSRDSLQC SSGYSTQTTTPCCSEDTIPSQVSDYDYFSVSGDQEADQQEFDKSSTIPRNSDISQSYR RMFQAKRPASTAGLPTTLGPAMVTPGVATIRRTPSTKPSVRRGTIGAGPIPIKTPVIP VKTPTVPDLPGVLPAPPDGPEERGEHSPESPSVGEGPQGVTSMPSSMWSGQASVNPPL PGPKPSIPEEHRQAIPESEAEDQEREPPSATVSPGQIPESDPADLSPRDTPQGEDMLN AIRRGVKLKKTTTNDRSAPRFS" BASE COUNT 1472 a 1390 c 1272 g 1511 t ORIGIN 1 cgtccaccgt ggagtctctc cttgctttgg gtccactgag ccccacacat tgttgagtgt 61 ctgattgtct ccttgccctt acaggacagg aaagcgaaat ctctgttcct ttgtagggac 121 ctttcttttg taggtgaaag ggacagaaat atgattctgg cctccttgat ggtgaataca 181 ggaacacttt cagataaatt acaatagaga ggagtcagct tatttaaaca aactgaggtt 241 aagccagaga ttgcagttaa cagtgaaaaa atttaaagca agtacctgga tggatgcaaa 301 gcagtgccca ggagacccaa cttacgtgac agagatgatc attgtcactc attcattatg 361 ttttgagcac cagccatgtg ccaggtccag aactagaaga aacttgggtg taggtgagat 421 cataggaaca gtccctgccc cagctgaaca tcaggaagcc agactgggag cacatgccgt 481 ggctgactag gactccagca gggacctggg gttccatcag atcagtgtaa agcgatggtt 541 tcccaaatga gacctctcca gagatggttc attccacccc gcaatggcca ccagaaacat 601 tgtgctgacc agagggcttc actgccccaa accccaaaat cacactccag gatgagaaac 661 ctaaaagtct atcttgatga ggcttcatag ctggaaacca cccctcatgg atccacaggc 721 tgcatttgat gaaaggagac tccaggaagt cgagccattc caggcagaca tgtagcaaaa 781 gcccgagctg gggtccgccc tcatgcaata aataacacct gtgcacataa gagaggtgac 841 aggcagggca tccctgggga cctgggcctg tggcacgtga agaaacaccc aagagagcac 901 gaagtcccaa tgccatgacc agatatttcg gtgccaggca tgcttctggg tcccagcaaa 961 agacacacaa atccgtgtcc ttggggaccc acagtctagt gacctagggc tggccgggtc 1021 gcacgcttct tttcaagggc tcgaaagcct ctgcatggat gcaactttgg gggaaaagta 1081 accctaaccc tctgcttctc tcatcctccc ctcttctgtg tatatgtctg tctctttctc 1141 tctgtgtgtg cccttcccca ctcccatccc cattcctttt tttatttatt atattttggt 1201 ggtgtgtggc tcggtttctg ttggtcccct cgtgtggccc ctcccctgct gcagaactcg 1261 tccagctcgg cctcctccga agcctcggaa acctgccagt cagtgagcga gtgcagctcc 1321 cccacctctg tcagctcggg ctccaccatg ggtgcctggg tgtccacaga gaaggtgacc 1381 gcccgggtgg cagccacagc ggccccactc tccctggtcc ccctgcctgc tgctcagaca 1441 gccctcacag ccgcggcaac ctggaagctc agaatcgaca ccaagagtgg ggctcctctt 1501 ggggacagag ggtgggacaa ggctgagagc attttctaga atgagtgcct ggctgctgga 1561 caatgaccac atggttgtct caagggagag gaaggggtat ggggaatttg gaggtgtgaa 1621 ggctttctcc cagaccaggt gttgcgtgcc tgtccatcct cctccctccc ttcccgagcc 1681 acccaccaga ccgttggatg agccatgtca ctgtgtcaga ggctgtgtgc ttttccgctt 1741 tttagtttgc acccccaccc ccctgccctt gattgagaat gtgctcctga gcaatgcgac 1801 gattttggaa acctagaatg aacagattca tttgaagatg ctttggaatt tttaaatttg 1861 cataatgaaa tgagcaaatc actatcacca ttaaatgagg cttcattagt tacattatca 1921 aaaaccattg acaaggccac aaattgaggg cactgatcaa tttccaaact tctaaaacgt 1981 aggtcaaaca tagatgctta tttgctgtca aactcatgtg cattctttgc accccggctc 2041 ttttctcttg catcacagat catccgtgag cagcagagcc ccaatgtttg ttttatttat 2101 aaatattcag gtttcccctc ccttgaatgt caatgccatt ttgtatctcc gcactcatcc 2161 tgttatatta attttttttc ctttcccccc cctttttttg tttgtttcca gttgtctaac 2221 gggttttctc actatagttt atcaagtgag tcccacgtgg ggcccacggg tgcaggcctt 2281 ttccctcatt gcctgcctgc ctcccgcctg ctccctcggg tcacctctgt ccaccttcca 2341 gactacgctc attattacac cattgggccc ggcatgttcc cgtcatctca gatccctagc 2401 tggaaggact gggctaagcc tgggccctat gaccagcctc tggtgaacac cctgcagcgc 2461 cgcaaagaga agcgagaacc ggaccccaac gggggaggac ccactaccgc cagcggccca 2521 cctgcagcag ctgaggaggc tcagagacca cggagcatga ctgtatcggc tgccaccagg 2581 cctggtgagg agatggaggc ttgtgaggag ctggccctgg ccctgtctcg gggcctgcag 2641 ctggacaccc agaggagcag ccgggactcg cttcagtgct ccagcggcta cagcacccag 2701 acaaccaccc cctgctgctc tgaggacacc atcccttccc aagtttcaga ttatgattat 2761 ttctctgtaa gtggtgacca ggaggcagat cagcaggagt tcgacaagtc ctccaccatt 2821 ccaagaaaca gcgacatcag ccagtcctac cgacggatgt tccaagccaa gcgtccagcc 2881 tcaactgctg gcctccccac caccctggga cctgctatgg tcactccagg ggttgcaact 2941 atccgacgga ccccttccac caagccttct gtccgccggg gaaccattgg agctggtccc 3001 atccccatca agacacccgt gatccctgtc aagaccccaa ccgtcccaga cctcccaggg 3061 gtgttgccag cccctccaga tgggccagaa gagcgggggg agcacagccc tgagtcgcca 3121 tctgtgggtg agggccccca aggtgtcacc agcatgccct cctcaatgtg gagcggccaa 3181 gcttccgtta accctccact tccaggcccg aagcccagta tccctgagga gcacagacag 3241 gcaattccag aaagtgaagc tgaagaccag gaacgggaac ccccaagtgc cactgtctcc 3301 ccaggccaga ttccagagag tgaccctgca gacctgagcc caagggatac tccacaagga 3361 gaagacatgc tgaacgccat ccgaaggggc gtgaaactga agaagaccac gacaaacgat 3421 cgctcagccc ctcgcttttc ttaggttcac aagaaatgcg ccggtgggga atgaactgtt 3481 tcattaataa aacctaattt gtcttgatcc attccactct ataataaaac aaaagatttt 3541 gtaggcaact cggaatatag ctcttttgaa agtactcgac acctttagat aagaattaaa 3601 accaacctat gtaactgaca taatcttgat cttttaattt gtaaatattg acaattttct 3661 ttctgcacat tttaatctta gtttcccttt tgatttttct gaaggtgcca aattccattt 3721 aactttttta caagtctttg taaaatttta aatgcataaa gggggttggg gcaggggaac 3781 cacgaagtag ttaattttag aaaaggattt actatacttc actcttcttt ttttttcccc 3841 acaagctttt gtagatgcat tgtagtagtc tagcttagaa gcaaatgcaa gttattttaa 3901 tgtacaaact aaatgggtaa gaggtaaaat cttcatttaa atatactatg ttctggatga 3961 aaagagcagg agtaacaatt gatgagcaat attcagagtg aagtaaatct ggaaatggta 4021 gactgtgttg ggattggggg gagggccatg ggaggggtac atcgtcaaca tagccgatcc 4081 tgttacattt aagagtagcc tcgtaggttg aatttcttct ggtagcttca tggtaaatgc 4141 atccgaataa gccatactgg attgcagtgt ttgtttctgt agggtgttta aggacttgac 4201 ttcctttctc ccatgattcc tctggactgc acacagcacc cacaaccagc cccatgcatg 4261 ctgctgcctc tgggcagtcg tagaatctcc cacttcagtt tctcgttgat tgtactcacc 4321 tttatggaat ccaaatacat ccaaaagggt aaggcagttt taaaaatgtg aaaacattta 4381 aaaatgataa tagcagggaa ttcttagatt atagtaaatg ccttttactt aactgtgccc 4441 agcaggctgg gtgcgttaaa aagcccaagt attttgaaaa aactcgaaca gatttgacaa 4501 gggtagccag cttggagtct agcaacttgc caatgtgttt accaatctgg gggcttgttt 4561 ttcttttctt ctttcaaata aatggcagtt aactggcttt acagtaaaca ttgaagagag 4621 gaggatttgt ttattgtcac tgggaatctg accactatac tgtccttttt ttgtattctg 4681 ggtaatgttt tttggaaaag atttgtcttt tctaagtgga agttaaattt gttatactgc 4741 ccatccccta aagccaacag agatttgtag atttaaaggg atcacatttg aagacaatag 4801 tgtttaagaa agcaagcaag tcccttagca gtcaggtcat aacagggcac atttctgacc 4861 gaaccctctc aaggcagagg aggagtttgg tgggtttcat acaccctgca gattcctgtt 4921 ggctctaacc ctcaattacc taatcttatg ctttaacaca taactgcatt ggatgtgaga 4981 gtaacgtacc gtatggtcat tgttctatat attaacattg aacactgctg cgattgctca 5041 aggacatttt atgttacggc tttaaagcaa aggcatgatt attagaaact atttaagctt 5101 ttttctttga aaaacaagct tcttttacag aatataaaca acagtagtgc ctgtggttta 5161 gcccaccaat cttgatgact aaaagtagct gatgcattgt gcatatgatg cttgagatgg 5221 tttttgcaaa agcagaaatc gctgcaaggt aatcacaata gataaaagtg gtattttaaa 5281 cctttgaaat aaatggatgt aactgtacct tggtacagct tttcacttgt ttagttttta 5341 aacgttagta taatctgaat aaataaaatg ttgccaaatt caatgtagaa agaatgtgac 5401 aacacacctt gggtagttct gcttgtgttt ttgcatattg taaaagcagt gtcacagcta 5461 aaaagaaaga aatcgtttct aacagtaaat tattgtgctt tagttgctag tttgtactga 5521 gagttgacct ctccctgtgc agttttttgt tctaaacttg tataaataac aattgtgtaa 5581 tgtgtctccc tcctacattg taacaattgc ttcagcctac gttataaata aagaaccact 5641 agatt // LOCUS AB007892 5350 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0432 mRNA, complete cds. ACCESSION AB007892 NID g2662144 KEYWORDS KIAA0432. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH1739. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5350) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5350 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH1739" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 964..2967 /gene="KIAA0432" CDS 964..2967 /gene="KIAA0432" /codon_start=1 /db_xref="PID:d1024609" /db_xref="PID:g2662145" /translation="MKMHAEKKHKCSKCSNSYGTEWDLKRHAEDCGKTFRCTCGCPYA SRTALQSHIYRTGHEIPAEHRDPPSKKRKMENCAQNQKLSNKTIESLNNQPIPRPDTQ ELEASEIKLEPSFEDSCGSNTDKQTLTTPPRYPQKLLLPKPKVALVKLPVMQFSVMPV FVPTADSSAQPVVLGVDQGSATGAVHLMPLSVGTLILGLDSEACSLKESLPLFKIANP IAGEPISTGVQVNFGKSPSNPLQELGNTCQKNSISSINVQTDLSYASQNFIPSAQWAT ADSSVSSCSQTDLSFDSQVSLPISVHTQTFLPSSKVTSSIAAQTDAFMDTCFQSGGVS RETQTSGIESPTDDHVQMDQAGMCGDIFESVHSSYNVATGNIISNSLVAETVTHSLLP QNEPKTLNQDIEKSAPIINFSAQNSMLPSQNMTDNQTQTIDLLSDLENILSSNLPAQT LDHRSLLSDTNPGPDTQLPSGPAQNPGIDFDIEEFFSASNIQTQTEESELSTMTTEPV LESLDIETQTDFLLADTSAQSYGCRGNSNFLGLEMFDTQTQTDLNFFLDSSPHLPLGS ILKHSSFSVSTDSSDTETQTEGVSTAKNIPALESKVQLNSTETQTMSSGFETLGSLFF TSNETQTAMDDFLLADLAWNTMESQFSSVETQTSAEPHTVSNF" BASE COUNT 1632 a 1092 c 1101 g 1525 t ORIGIN 1 tttattgaac atttattctg ttcaaaacat tcccaaaggc aacagaagat acaaataaat 61 ctctgcccat gaaaaggtgt ggggggcatt agaaggcgtt ctcttcggtg taatgaagta 121 atgagagaag aaaaagtagt ttgaagctat ggagtaaggg actttgagta tcccaggctc 181 aaaaagttgg gacttgaaca gtacgggggt gctgctgaaa acgtttgagg gaggtaatga 241 catgatcgaa gctatacttg agaaaggtga atctgataaa gtatgagtga aaaagagact 301 gaaggtctag aaattagatt gaggctaatg acaaaatcca cataaatagg aggacttgaa 361 cgaaggggca cttagaagag gacaggagat agtaaaaggc attcaatgat gagagcacac 421 actacagggg agcatgaggg aggttggaaa agataatgaa aggattaccg agcttcactg 481 acgatgtgtt tgaaatgagc aggaatcttg tagtgatcct aatccgtggt tttctggagc 541 atttcacagc ctaggaacat acaagggggg catctccctg gaatgtaaat tgactaagag 601 gaattcaata atggtcaaat gaatgcagaa ttttagagtc ttgcttagta ttctcaccac 661 atttcgttta gtctactcat actctttttc tcttactgct gacactagat ggaaaaactc 721 ttaattaaaa gtatttcaca aaatgtgctc gttttcagtc attccgtttc cactccagcc 781 tgttgtgttg tttttttgaa ataataattt aaagtaattt tccttttgca ggatggcata 841 gtcaatccaa caataagaaa agatttgaaa actggaccga aattctactg ctgtccaatt 901 gaaggctgcc ccagaggccc tgagagaccg ttttctcagt tttctctcgt aaaacagcac 961 tttatgaaaa tgcatgctga gaagaagcac aaatgtagta agtgcagcaa ttcgtacggt 1021 acagaatggg acctgaaaag acatgcagag gactgtggca agaccttccg gtgcacatgc 1081 ggctgtccct acgccagtag aacagcactg cagtctcaca tctaccgaac tgggcacgag 1141 atacctgcag aacacaggga cccacctagt aagaaaagga aaatggaaaa ctgtgcacaa 1201 aaccagaagt tatccaacaa gaccattgaa tcattgaaca accaaccaat ccctagacca 1261 gacactcaag aactagaagc ttcagaaata aagctagaac catcttttga agactcttgt 1321 ggctctaaca ctgacaagca gactcttaca acaccaccga gatatcctca gaagttgctt 1381 ttaccaaagc ccaaagtggc tttggttaaa ctacccgtga tgcagttttc tgtcatgcct 1441 gtctttgtgc ctacagccga ctcctcagcc cagcctgtgg tgttaggtgt tgatcagggc 1501 tctgccacag gggctgtgca cttaatgccc ttgtcagtag gaaccctgat cctcggccta 1561 gattcagagg cttgctctct taaggagagc ctacctcttt tcaaaattgc taatcctatt 1621 gctggtgagc caataagtac tggtgttcaa gtgaactttg gtaaaagtcc atctaatcct 1681 ttacaagaac tagggaacac gtgtcaaaag aatagcattt cttcaatcaa cgtgcagaca 1741 gatctgtctt atgcctcaca aaactttata ccttctgcac agtgggccac tgctgattcc 1801 tctgtgtcgt cttgttctca aactgatttg tcgtttgatt ctcaagtgtc tcttcccatt 1861 agtgttcaca ctcagacatt tttgcccagc tctaaggtaa cttcatctat agctgctcag 1921 actgatgcat ttatggacac ctgtttccag tcaggtgggg tctccagaga aactcaaacc 1981 agtgggatag aaagtccaac ggatgaccat gtacagatgg accaagctgg aatgtgcgga 2041 gacatttttg agagtgttca ttcatcatat aatgttgcta caggtaacat tataagcaac 2101 agtttagtag cagagacagt aactcatagt ttgttacctc agaatgagcc taagacttta 2161 aatcaagata ttgagaaatc tgcaccaatt ataaatttca gtgcacagaa tagtatgctt 2221 ccttcacaga acatgacaga taatcagacc caaaccatag atttattaag tgatttggaa 2281 aacatcttgt caagtaatct gcctgcccag acattggatc atcgtagtct tttgtctgac 2341 acaaatcctg gacctgacac ccagctccca tctggcccag cccagaaccc cggaatcgat 2401 tttgatatcg aagagttctt ttcggcctca aatatccaga ctcaaactga agagagtgaa 2461 cttagcacca tgaccaccga gccagtcttg gagtcactgg acatagagac tcaaacggac 2521 ttcttactcg cagatacctc tgctcagtcc tatgggtgta ggggaaattc taacttctta 2581 ggccttgaga tgtttgacac acagacacag acagacttaa actttttctt agacagtagc 2641 cctcatctgc ctctgggaag tattctgaaa cactccagct tttccgtgag tactgattca 2701 tctgacacag agacccaaac tgaaggagtc tccactgcta aaaatatacc tgctctagaa 2761 agcaaagttc agttgaacag tacagaaaca cagaccatga gttctgggtt tgaaaccctg 2821 gggagcttgt tcttcaccag caacgaaact cagacagcaa tggatgactt tcttctggct 2881 gatctggcct ggaacacgat ggagtctcag ttcagctctg tagaaaccca gacttctgcg 2941 gaaccacaca cagtctccaa cttctaaaac taacggtgga gtccatgtgt gaaatggcat 3001 ctaccatttc ctctggatta aaactacgga ctggggacaa cagtattaat tcgattgaat 3061 gtggctgatg atgcagttgc ttagcttctt tgtgtttctt tgccttttgt acttgtaaac 3121 agaaatttgc gtataaatgt gagtgtatta taaagtttga gatgttgatc taaattgttt 3181 ttgtgttgcc tacatttgcc ttttcacagc tagtcttttc atgttaaaaa aaaaatgtat 3241 ttcatatcta taaaacctat atagccattt agctgaagcc cagcttacca ggttcaaggg 3301 tacaaacttc tcaaatcttc aaaacatttt agtcaaagtg taatatactt aaactgcacc 3361 taaaatatct ttggcactgc ttgttagaaa ttcctgattc ctgttactaa tcactaaaga 3421 aaccggatgc tgccaccgta ggatttaagc agtagtgctt ccatgctctt aagactcctg 3481 ctgcctggac cttcgtcagc tttgacacct cttttctgat ttaaagacac caaggaaaac 3541 tacaactgtc tttagctttg aagcagtttt catgtaatca ttgccacctc ttcgctacat 3601 gaactactat tgataccagc atacaagtgt atagcacttt acacacaaga ggtttattga 3661 tgtaaaatta tcggctaggg aagcagcagc gggccaggtg tggtggctta cccctgtaat 3721 cccagcactt tgggaggcca aagcaggacg atcacttgag cccaggagtt caacaccagc 3781 ttgggcaaca taagaagacc gtgtctctgg aatttttttt ttttttaatt agccaggcac 3841 agtggcatgc gcctgtgatc ccagctactt ggaaggctga ggtgagagga tcactcgagg 3901 agattggggc tgccatgagc catggtcttg gcactgtact ccaacctggg taacagggca 3961 agaccctatc tcaaaaaaaa aaaaaaaagt cgccagcaac aagcacgtag tgtagtgttc 4021 ctgctaaatg agcataggtt atccaaacct tgggaacagg gagttatgga aacgtgccta 4081 tgacttcatc ttggggtgtg tcctatgaag atcctttctg gtctccacag taggccagag 4141 ttgggggctc tggagctgtt tccccaagtg catccacaag ctggatctga gttttgtcac 4201 tctaaaatta aacaagaaaa aaagtgggaa aagggcatcc cccattaggt ttcaatactt 4261 tgcacttcta ctaagcttga tagggcagga gtgcaatcta caattatttt aaagtgaatt 4321 tccttccatt caccattctt tatcttttct ttgaataaga aaaagtatct agcaaggata 4381 ttacttgtgc cttgaggcta gcaattatag gatagattca tctaaaatat ggtattctgc 4441 attttggttt tttttcttaa gtgaataata ccagtcttca aagaaaacaa ggtgaagacc 4501 tattgcttca ataatcaaga atgctttgtg tgttttgagg taggagcatg atcaagtatg 4561 ctttggggat tttctgtatt taggagatcc tggattctta attgttggct aagttccagt 4621 caagtaggaa tcagtgcagc ctgtaagttc tccacattga cacacacaca cacacacaca 4681 cacacacaca cacacgacat gctcctttct gtggcacatg cctgtattac tgaaagctaa 4741 atcctcaaaa cctagtaagg ggaccaatga ttcattaaag taaattgatg gttttgctac 4801 taattcctat cccatacatt tgacacaaaa gaagtgttgg taatggataa ataacatatc 4861 ccgggcagat gagctcaacc tagtaggtaa gagtttggtt tggtcacagt tgcctatgag 4921 tgtgggtttc aaaagaaaca taaagcctta acttagaatt tcattatgtt ttagaatcat 4981 cactgcctta atattcaagc atctatttaa gtcctaataa aggagaaatg catgtttatg 5041 gcttttttgt aaatataaat gcagtgatct atggcttaaa aaatttgttt ctgtgacaat 5101 gtttgtaaat ctagccaata gagtcattta cagaagaaaa atgagcatgt aataatacaa 5161 gaactgtttc cccctcaaaa cctgaacctg aattatttgt aaaaactgaa atttaatgat 5221 taaagagaag ccagaattgt accctttttt gtgaattctt gaacgtactc ataaatatga 5281 cttattgtat tgccttaagt tttcactcat tgtcttttga aagccatatg ataaaatgat 5341 tttatttaat // LOCUS AB007895 5347 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0435 mRNA, complete cds. ACCESSION AB007895 NID g2662150 KEYWORDS KIAA0435. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HH2241. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5347) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5347 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HH2241" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 1196..3529 /gene="KIAA0435" CDS 1196..3529 /gene="KIAA0435" /codon_start=1 /db_xref="PID:d1024612" /db_xref="PID:g2662151" /translation="MTFYPFVASSSTRRVDNSNTRLAVQIERDPGNDDNNLNSIFYEH LTRTLQESLCGDLVLGRWGNYSSGDCFILASDDLNAFVHLIEIGNGLVTFQLRGLEFR GTYCQQREVEAIMEGDEEDRGCCCCKPGHLPHLLSCNAAFHLRWLTWEITQTQYILEG YSILDNNAATMLQVFDLRRILIRYYIKSIIYYMVTSPKLLSWIKNESLLKSLQPFAKW HYIERDLAMFNINIDDDYVPCLQGITRASFCNVYLEWIQHCARKRQEPSTTLDSDEDS PLVTLSFALCTLGRRALGTAAHNMAISLDSFLYGLHVLFKGDFRITARDEWVFADMDL LHKVVAPAIRMSLKLHQDQFTCPDEYEDPAVLYEAIQSFEKKVVICHEGDPAWRGAVL SNKEELLTLRHVVDEGADEYKVIMLHRSFLSFKVIKVNKECVRGLWAGQQQELIFLRN RNPERGSIQNNKQVLRNLINSSCDQPLGYPMYVSPLTTSYLGTHRQLKNIWGGPITLD RIRTWFWTKWVRMRKDCNARQHSGGNIEDVDGGGAPTTGGNNAPNGGSQESSAEQPRK GGAQHGVSSCEGTQRTGRRKGRSQSVQAHSALSQRPPMLSSSGPILESRQTFLQTSTS VHELAQRLSGSRLSLHASATSLHSQPPPVTTTGHLSVRERAEALIRSSLGSSTSSTLS FLFGKRSFSSALVISGLSAAEGGNTSDTQSSSSVNIVMGPSARAASQATRVRGWAGLT RTGWDGGTGSWPERGTCLAFPPFCLQNPIPFSMGLPE" BASE COUNT 1268 a 1439 c 1368 g 1272 t ORIGIN 1 cgaatatatc agatttaacc atatgaaatt gccattttgt aggtgataat aaacagttga 61 ctatcatcaa tttcataggc ctgatgtcaa tttcatattt cattctttaa agaccgccac 121 tcattgttgg aaataaaagg acaaaggtag gctggcatag cctctccaag agagcagatc 181 gctgcactta caaagggttt tggggatgtg tggagttcaa ggactggtga ggttttggag 241 gcatacatgg gttgatggca tctgtatagg tggttacttg aaaagcctgt tattcatcag 301 ttgcccctca agtctgcagt gagtcggcta actctcagaa gccaggggtt gtcatggact 361 ggctgcctta caaaagctgt gcactgaggg gtgtttacat gatggaatag gttcaaagct 421 ggttctagtg gccacttgtt accatggctg cagatgatcc gagaactttc attctcatgg 481 ttcaacctaa gaaaggtcta taagagtata ccccactctt gaaagcaggt ggtggagacg 541 ttgactactc accaccattt cctgaaatca atttgaagag aaagatacct ttgtggacac 601 tgttctcaag ttctgaaagt gggtagcatt acattggcag tttaaacact tgcttttttt 661 tttttgagtc ttgctctgtc acccagtttg gagtgcagtg gtgcagtctc ggctcactgc 721 gacctcggcc tcctggattc aagttcagcc tcccgggtgg ctgggattgt aggagtgtgc 781 caccacgcat ggctaatctt tttgtgtttt tggtggagat ggagtttcac actgtttgcg 841 gactggtacc aagttcctga cttcaaatgc tctgcctgcc tcggcctccc taggtgttgg 901 gattacaggc gtgagccacc gtgctcagcc taaatacttg ctttgatttt gcagatctgg 961 tgctccaaat tgattaagac acaagaccgt tctctccaaa tcagagagct actcatctct 1021 cccttccatg tgtttcttgt actgctccct ctatctctta tcatgagact cctctcttcc 1081 tgcagtgtgg taaaactaca gcaatcgtct taacctgtga gatctgtcac ctttgcattt 1141 tccactcatg cagctggttc tataaaccaa ctcttctgct tggggggatc taatcatgac 1201 cttttaccct tttgtggcct cttctagtac aaggcgagtg gataattcca acacaagact 1261 ggcagtccaa attgaaagag atccagggaa tgatgacaac aatctcaatt ccatttttta 1321 tgaacacttg acaaggaccc tccaggagtc cctctgtgga gacttagttc ttggacgttg 1381 gggcaactac agctctggcg attgctttat tttggcttca gatgacctca atgcctttgt 1441 tcacctgatt gaaattggaa atggtcttgt cacctttcaa cttcgaggac tggaattccg 1501 aggaacctac tgccagcaga gggaggtaga agccatcatg gagggcgacg aggaggacag 1561 aggctgctgc tgctgcaaac caggccactt gcctcacctg ctgtcctgca acgctgcctt 1621 tcacctccgc tggctcacct gggaaatcac gcagacccag tacatcctgg agggctacag 1681 catcctggac aacaacgcgg ccaccatgct gcaggtgttt gacctccgaa ggatcctcat 1741 ccgctactac atcaagagta taatatacta tatggtaacg tctcccaaac tcctctcctg 1801 gatcaaaaat gaatcacttc tgaagtccct gcagcccttt gccaagtggc attacattga 1861 gcgtgacctt gcaatgttca acattaacat tgatgatgac tacgtcccgt gtctccaggg 1921 gatcacacga gctagcttct gcaatgttta tctagaatgg attcaacact gtgcacggaa 1981 aagacaagag ccttcaacga ccctggacag tgacgaggac tctcccttgg tgactctgtc 2041 cttcgccctg tgcaccctgg ggaggagagc tctgggaaca gccgctcaca atatggccat 2101 cagcctggat tctttcctgt atggcctcca tgtcctcttc aaaggtgact tcagaataac 2161 agcacgtgac gagtgggtat ttgctgacat ggacctactg cataaagttg tagctccagc 2221 tatcaggatg tccctgaaac ttcaccagga ccagttcact tgccctgacg agtatgaaga 2281 cccagcagtc ctctacgagg ccatccagtc cttcgagaag aaggtggtca tctgccacga 2341 gggcgacccg gcctggcggg gcgcagtgct gtccaacaag gaagagctgc tcaccctgcg 2401 gcacgtggtg gacgagggtg ccgacgagta caaggtcatc atgctccaca gaagcttcct 2461 gagcttcaag gtgatcaagg ttaacaaaga atgcgtccga ggactttggg ccgggcagca 2521 gcaggagctt atatttcttc gcaaccgcaa tccggagcgc ggcagtatcc agaacaataa 2581 gcaggtcctg cggaacttga ttaactcctc ctgcgatcag cccctggggt accccatgta 2641 tgtctcccca ctaaccacat cctacctagg gacacacagg cagctgaaga acatctgggg 2701 tggacccatc actttggaca gaattaggac ctggttctgg accaagtggg taaggatgcg 2761 gaaggattgc aatgcccgcc agcacagtgg cggcaacatt gaagacgtgg acggaggagg 2821 ggccccgacg acaggtggca acaatgcccc gaatggtggc agccaggaga gcagcgcaga 2881 acagcccaga aaaggcggtg ctcagcacgg ggtgtcatcc tgtgaaggga cacagagaac 2941 aggcaggagg aaaggcagga gccagtccgt gcaggcacac tcagcgctaa gccaaaggcc 3001 gcccatgctg agctcatctg gccccatctt agagagccgc caaacattcc tccagacgtc 3061 cacctcagtg cacgagctgg cccagaggct ctcgggcagc cggctctcct tgcacgcctc 3121 ggccacgtcc ctgcactctc agcccccgcc cgtcaccacc accggccacc tgagtgtccg 3181 tgagcgggcc gaggcgctca tcaggtccag cctgggctcc tccaccagct ccaccctgag 3241 cttcctcttc ggcaagagga gcttttccag cgcgctcgtc atttccggac tctctgctgc 3301 ggaggggggc aataccagtg acacccagtc atccagcagc gtcaacatcg tgatgggccc 3361 ctcagccagg gctgccagcc aggccactcg ggtaaggggc tgggcagggc tcaccaggac 3421 aggctgggat ggtggcacgg gctcctggcc tgagcgtggc acctgccttg cgttcccacc 3481 cttctgcctg cagaacccca tccccttctc tatggggctc ccagagtgac aaaggacagt 3541 gattagacac gaagtggctt agctgctctt gaaagcagac aagatacaga gcagatatcc 3601 tgtaaacgat aatgcccagg caggcactga aaggagtcac cggatacaga ggttctgcag 3661 aactgtggcc atctgcccta caccggggca tgacggagaa tgccctccac ccattcacac 3721 agcaggactc ttctcacatg attctggcct gagggagagg aaaggacacc tgtcaatgct 3781 ggagttagag cttcactgct tctcagccaa tcgatttgac tttaaagctg ctgagatggc 3841 ccactgcttt taggtattta aatactagac aaggagagtt ctaaggactt cacccaaata 3901 agctgttact tgtccagaat cccaaaccag ctgagatgaa atgaatactt gagcttcttc 3961 agtgagaaaa aagtaaataa atacccagca gtgctcctat gtgacctggt agacagggaa 4021 aatcgatggt gtcaaggcaa aaatgggtca ggtttggaga gttcccccac tccttttgag 4081 tgttcaggtt ttccttacca tggctcatgc tttccatcaa gcaccagagt tgcagtggct 4141 tggcctctgg ttctgggtga ggttatttgc aggtggagac ggggggctgc acctgaacat 4201 ttctagtgtc accctccctc tccttcatgg gaaacagctc tccagggaag taccttcctg 4261 ccaggggaag ccaaggctgg gccggccgcc ctacaaggag ccacaggatt gcagccatgg 4321 gtgccacctt tcatggaagg ggagatttat gggctttcct ggaaccccca ggctgtcctg 4381 gccaagagga aagaggtggt tacttcagga gtttgacctt agttagataa ctaaaagaat 4441 acatttcccc tcccttttct ttatttcctc aataaaaatg tacaaagtat cacccttctc 4501 catgccccaa tctgtgttaa agtcacaatc tatgggtgta gttctgggat tctgtcaaat 4561 tctccttcct gctctccaaa atggacaatt gtcgtaggga ccacatgccc ccagaataca 4621 atggcctctg tgttctactg gggtcaagcc tgctagaact cagcattcat gacaggggct 4681 aagtgtgcat gaagtgacac tgactacagc tagaaagcca ggcgcacaaa tgccccttcc 4741 ccccagggcc gctctttcca gcgcagtcat ccagaaaggc ccacgtgcag agcccctgtg 4801 tctcagatgc tgcttcagtt gcccgtcctg tcctcagagg ccactgtgct ggccctctat 4861 catttgacct gactttagaa cctgacctca aggatatggc agcgctagcc tttagctccc 4921 acagcacgga tgggggtgat gccagttaga agtgggtagt gaacgtttgc tgagctgttc 4981 actgtttctc tcttctcttt ggaagcacct ctccgagcca tgtgagcccc ctgatgccac 5041 cgagcagggg cagcttcatg accgatgtct ggctgaggct gtggcggaca ctctcggggt 5101 tgtctgcagg agagcaagcc aggaggacat gggcctggac gacacggcct cgcagcaaag 5161 tgtgtcagac gagcagtgac gggcgtgcgg ccgggcgggg aggctggctc ccccacacct 5221 cccacctgca ttgctctccc tcgtgctccc caaatcacca caaccaacca ataccgcgat 5281 ccatgaggga ctcctcctgt ggaaaaggag agctgttcca gaacacagaa ctgatctcag 5341 gtttttg // LOCUS AB007898 4765 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0438 mRNA, complete cds. ACCESSION AB007898 NID g2662156 KEYWORDS KIAA0438. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0450. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 4765) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..4765 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0450" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 118..2244 /gene="KIAA0438" CDS 118..2244 /gene="KIAA0438" /codon_start=1 /db_xref="PID:d1024615" /db_xref="PID:g2662157" /translation="MSQYTEKEPAAMDQESGKAVWPKPAGGYQTITGRRYGRRHAYVS FKPCMTRHERSLGRAGDDYEVLELDDVPKENSSGSSPLDQVDSSLPSEPIFEKSETEI PTCGSALNQTTESSQSFVAVHHSEEGRDTLGSSTNLHNHSEGEYIPGACSASSVQNGI ALVHTDSYDPDGKHGEDNDHLQLSAEVVEGSRYQESLGNTVFELENREAEAYTGLSPP VPSFNCEVRDEFEELDSVPLVKSSAGDTEFVHQNSQEIQRSSQDEMVSTKQQNNTSQE RQTEHSPEDAACGPGHICSERNTNDREKNHGSSPEQVVRPKVRKLISSSQVDQETGFN RHEAKQRSVQRWREALEVEESGSDDLLIKCEEYDGEHDCMFLDPPYSRVITQRETENN QMTSESGATAGRQEVDNTFWNGCGDYYQLYDKDEDSSECSDGEWSASLPHRFSGTEKD QSSSDESWETLPGKDENEPELQSDSSGPEEENQELSLQEGEQTSLEEGEIPWLQYNEV NESSSDEGNEPANEFAQPAFMLDGNNNLEDDSSVSEDLDVDWSLFDGFADGLGVAEAI SYVDPQFLTYMALEERLAQAMETALAHLESLAVDVEVANPPASKESIDGLPETLVLED HTAIGQEQCCPICCSEYIKDDIATELPCHHFFHKPCVSIWLQKSGTCPVCRRHFPPAV IEASAAPSSEPDPDAPPSNDSIAEAP" BASE COUNT 1505 a 797 c 974 g 1489 t ORIGIN 1 gttctcgctc cggagccgct gcacatttcg gaatcttctg cggcttgtcc atagtgtgaa 61 taaaaactaa atcacatcta taattctact gaactggtca tacagacgct gccatatatg 121 tcacagtaca ctgaaaagga gccagcagca atggaccaag aatctggtaa ggctgtctgg 181 cccaaaccag caggagggta tcagacaatt acaggcagga gatatggaag aagacatgct 241 tatgtcagtt ttaaaccatg tatgaccaga catgaaagaa gcttaggtcg ggctggtgat 301 gactatgaag tgttggaact agatgatgtt ccaaaggaaa attcctcagg ttccagtcct 361 ttggatcaag ttgattcttc tttacccagt gaacctatat ttgaaaaaag tgaaacagaa 421 attcccactt gtggttcagc attgaatcaa accactgaga gcagtcaatc ctttgttgca 481 gtacatcaca gtgaggaagg cagggatacc ttaggaagca gtacaaatct tcataatcac 541 tctgagggag agtatattcc aggagcttgt agtgcttcaa gtgtccaaaa tggaattgca 601 ttggttcata cagactctta tgatccagat ggcaaacatg gagaagataa tgaccatctt 661 caactttctg cagaagtcgt ggaaggtagt agataccagg aatcattagg caatacagta 721 tttgagttgg aaaacagaga ggcagaggca tacactggtc tttcaccacc agttccctca 781 tttaactgtg aagtaagaga tgagtttgaa gagttagatt ctgtaccatt agtgaaaagt 841 tctgctggtg atactgagtt tgtccatcag aatagccagg aaattcagag gtcttctcaa 901 gatgaaatgg ttagtacgaa acaacaaaat aatactagcc aggaaagaca gacagaacat 961 tcacctgaag atgcagcctg tggtccaggg catatttgta gtgaacgaaa taccaatgat 1021 agggaaaaga accatggaag ttctcctgaa caggtagtga ggccaaaagt tagaaaactg 1081 ataagttcaa gccaggtgga ccaagaaaca ggttttaata ggcatgaggc gaaacaaaga 1141 agtgttcaaa gatggagaga ggctttggaa gttgaggaaa gtggctcaga tgacctctta 1201 ataaaatgtg aagaatatga tggagagcat gactgtatgt tcttggatcc accatactca 1261 agagttatta cacaaaggga aacagaaaat aaccaaatga catcagaaag tggagccaca 1321 gcgggaaggc aagaagtgga taacaccttt tggaatggct gtggagatta ttaccaactc 1381 tatgacaaag atgaagatag ttctgaatgc agtgatgggg aatggtctgc ttctttgcct 1441 catcgatttt ctggtacaga aaaagatcaa tcctcaagtg atgaaagctg ggagactctg 1501 ccaggaaaag atgagaatga acctgagcta caaagtgata gcagtggccc tgaagaagaa 1561 aaccaagaat tatctcttca ggaaggggaa cagacatcct tggaagaggg agaaattcct 1621 tggttacagt acaatgaagt caatgaaagc agcagtgatg agggaaatga acctgccaat 1681 gaatttgcac agccagcttt catgttggat ggtaacaata acctggagga tgactccagt 1741 gtgagtgaag acttagatgt ggattggagc ctatttgatg gctttgcaga tggactagga 1801 gttgctgaag ctatttcata tgtggatcct cagttcctta cctacatggc actagaagaa 1861 cgcttagccc aggctatgga gactgctctg gcccatttag agtctcttgc agtggatgtt 1921 gaggtggcca atccaccagc tagtaaggaa agcattgatg gtcttccaga gacccttgtt 1981 cttgaagatc acactgctat tggtcaggaa caatgctgtc caatctgttg cagtgagtat 2041 attaaggatg atatagcaac agagttgccc tgtcaccatt tctttcacaa accttgtgtc 2101 tcaatttggc tacaaaagtc gggaacatgc cctgtgtgcc gccgtcattt cccacctgcg 2161 gttattgaag catctgcagc tccttcctct gagcctgatc ctgatgcccc accttcaaat 2221 gacagtattg cagaagcacc ctaaaccttg acagttgaaa tgagatcagt gtatcaaagt 2281 aaatctgcaa attccttcta aatttcatgt gcaaataatt atatataaat atatttaaaa 2341 atgctatata tagtatatgc catagtttag aaagaatatt aacctttcta aactaaattt 2401 aggtttgcag aaagtattaa acatttttaa gctgaatgtt gagacagtgc atccattttc 2461 tttagttgaa tatgtttgta ttaattgtaa agccaagctt atcagttgac tctctccaga 2521 ataaataatc atctgtgtgg catacgttat tggctttgtc tgtaatactg ccactaagtg 2581 attataatta agctgtcctg tttgcatcaa atgtaaagac tgtgttcaca accgtagtaa 2641 aatttggttt cattggaaat gaaacaaatt ctaaagtatg ctttttcact agtccctttg 2701 attttgctat atcataacct cttggttcat atgctgagaa atttttcaga aagtccattt 2761 ttgtttaaaa ttagaacttt tcagaatgcc aaatgaaggc taaaattttg gccagaacat 2821 tacaaaagtt ttaaatcgta gacgtaactc cccctgaaat aaagttaggt agtaaaatcc 2881 ttaatgaaac cagtggatgt gcttaacgta aggttagtaa agcatacaaa gaatctagtg 2941 tgctcagggc ttggtacaat gagctgaatt agatggcctt atgaaactct ttctaacctc 3001 ttacccaacc tgtttctcct tggttaaaat tatacttgaa ggcccagaac actcatggca 3061 catttgttta atattgctta tagttagttt aaggtaattt tgcttctaca gtattttgga 3121 aggtctgaaa acttgcacag ggtcatcttt gtaattatat aaccccaaac taagatgcac 3181 aatgtctcct tcaggtgatc acacacagtg gacgagtatg tgcaaacatg gacataatag 3241 ttcacttaca aatgtgattt gatgttaaca ctagagaatg atgactgtag aacatttgag 3301 caagtaaaat agtaaagcac atagtgagtg tatgtccgtc taactggtac attgataatt 3361 tagtttgggc acataaaagg aatatttata tggcttccca aatgcagagt tacatcttat 3421 tcgtgtattt ctctgagtat ttatatcccg tctccttttt tcattcttaa aaataaatga 3481 attttcactg ttggcacata tgaggcttaa atataaggaa cataacactt gcattctaat 3541 ttttgcatat attgtaaatg tgtctggtat ttacagcaaa atactgtgta tccttttatg 3601 ggtaaaacaa aagtgaacat tgcatgcatg taatgtgatg aatttgtaat ttaggagttc 3661 tttggggctt ctgtgacttg ggaaatgctt acattcaggc cttaatgttg cattagctag 3721 catgttttcc ctctgatgta tatagtcact gttgtataaa ctaatctttg cttgttttct 3781 actctgtgat ctttccatat catatttcat taatgatcag ttagtgtcaa ggagtcaaaa 3841 cagattaaaa ttaatttcat gtgtatatgg tggaaatttg tggctagtgt gatttttgtt 3901 tgtttccttt taagtactgt tgatcagttg tgacacttac tggttaaact tacgttgcta 3961 aagatttctc tataataagc cacacattat atttagacta tattaaggga ccttggtttt 4021 cttctagata gcagctgtcc caaagaaaat atttcttctt tgtctgttaa gatttagcta 4081 ttatctgcca gttgttaaga ggttttggtt ccaaactcaa ccagcaatgt tgagagctga 4141 acttaagata gctgttgtac tttttgcttt ccatctgtta ctgtccttca ttcttggctc 4201 cctactatct ataaacagct gctgtgaaga agaaaagttg aataagagtt ggcttaaatt 4261 ttaaaaaaga aaaagaaaat tgaggtttta ggattttcat ggtaacaagc tctggtataa 4321 gctaaggctg gcaagttcag atactaaaat attatttgat catatcttgg atccttttga 4381 aaaagttaag actatatgaa ggtaaattag aaataagtat gaatattaat aaaatagcat 4441 ttatcttatt tctctatttt atgttgtgac ttaacctaat tttatttttt taacattttc 4501 ttatttctta taatatgaat gctgatattt aaaggtagat ctatgtggta ttctttgtgt 4561 ttcttaattg tttaactctt aagattattt gtgatctgga tttatgtatt tgttagatac 4621 atacgaattg ttaaaatgga atgcaagttt ttcaaaagcc caggtctaaa tgtaatggtt 4681 ggtttattgt tctataaccc cagcccatca ttttctgtgt aaatcataaa caataaacag 4741 aatatactcg gtggtcattt ctaat // LOCUS AB007903 5190 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens KIAA0443 mRNA, complete cds. ACCESSION AB007903 NID g2662166 KEYWORDS KIAA0443. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pBluescriptII SK plus clone:HJ0137. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., Miyajima,N., Tanaka,A., Kotani,H., Nomura,N. and Ohara,O. TITLE Prediction of the coding sequences of unidentified human genes. VIII. The complete sequences of 77 new cDNA clones from brain which can code for large proteins in vitro JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 5190) AUTHORS Ohara,O. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) to the DDBJ/EMBL/GenBank databases. Osamu Ohara, Kazusa DNA Research Institute, DNA Technology; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, Tel:+81-438-52-3913, Fax:+81-438-52-3914) FEATURES Location/Qualifiers source 1..5190 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HJ0137" /clone_lib="pBluescriptII SK plus" /sex="male" /tissue_type="brain" gene 654..4841 /gene="KIAA0443" CDS 654..4841 /gene="KIAA0443" /codon_start=1 /db_xref="PID:d1024620" /db_xref="PID:g2662167" /translation="MTGAEIESGAQVKPEKKPGEEVVGGAEIENDVPLVVRPKVRTQA QIMPGARPKNKSKVMPGASTKVETSAVGGARPKSKAKAIPVSRFKEEAQMWAQPRFGA ERLSKTERNSQTNIIASPLVSTDSVLVAKTKYLSEDRELVNTDTESFPRRKAHYQAGF QPSFRSKEETNMGSWCCPRPTSKQEASPNSDFKWVDKSVSSLFWSGDEVTAKFHPGNR VKDSNRSMHMANQEANTMSRSQTNQELYIASSSGSEDESVKTPWFWARDKTNTWSGPR EDPNSRSRFRSKKEVYVESSSGSEHEDHLESWFGAGKEGKFRSKMRAGKEANNRARHR AKREACIDFMPGSIDVIKKESCFWPEENANTFSRPMIKKEARARAMTKEEAKTKARAR AKQEARSEEEALIGTWFWATDESSMADEASIESSLQVEDESIIGSWFWTEEEASMGTG ASSKSRPRTDGERIGDSLFGAREKTSMKTGAEATSESILAADDEQVIIGSWFWAGEEV NQEAEEETIFGSWFWVIDAASVESGVGVSCESRTRSEEEEVIGPWFWSGEQVDIEAGI GEEARPGAEEETIFGSWFWAENQTYMDCRAETSCDTMQGAEEEEPIIGSWFWTRVEAC VEGDVNSKSSLEDKEEAMIPCFGAKEEVSMKHGTGVRCRFMAGAEETNNKSCFWAEKE PCMYPAGGGSWKSRPEEEEDIVNSWFWSRKYTKPEAIIGSWLWATEESNIDGTGEKAK LLTEEETIINSWFWKEDEAISEATDREESRPEAEEGDIVGSWFWAGEEDRLEPAAETR EEDRLAAEKEGIVGSWFGAREETIRREAGSCSKSSPKAEEEEVIIGSWFWEEEASPEA VAGVGFESKPGTEEEEITVGSWFWPEEEASIQAGSQAVEEMESETEEETIFGSWFWDG KEVSEEAGPCCVSKPEDDEEMIVESWFWSRDKAIKETGTVATCESKPENEEGAIVGSW FEAEDEVDNRTDNGSNCGSRTLADEDEAIVGSWFWAGDEAHFESNPSPVFRAICRSTC SVEQEPDPSRRPQSWEEVTVQFKPGPWGRVGFPSISPFRFPKEAASLFCEMFGGKPRN MVLSPEGEDQESLLQPDQPSPEFPFQYDPSYRSVQEIREHLRAKESTEPESSSCNCIQ CELKIGSEEFEELLLLMEKIRDPFIHEISKIAMGMRSASQFTRDFIRDSGVVSLIETL LNYPSSRVRTSFLENMIRMAPPYPNLNIIQTYICKVCEETLAYSVDSPEQLSGIRMIR HLTTTTDYHTLVANYMSGFLSLLATGNAKTRFHVLKMLLNLSENLFMTKELLSAEAVS EFIGLFNREETNDNIQIVLAIFENIGNNIKKETVFSDDDFNIEPLISAFHKVEKFAKE LQGKTDNQNDPEGDQEN" mutation 3048 /gene="KIAA0443" /note="nonsense mutation" /replace="a" mutation 3095 /gene="KIAA0443" /note="nonsense mutation" /replace="a" BASE COUNT 1472 a 979 c 1457 g 1282 t ORIGIN 1 tgggaatcca actgaagagc agccagagga gagctgaaga gaggaggggg aggccgatga 61 cctgggctct gggcctctga aggtctggcg tattctgaca ggacacagtg agcatctgta 121 gaggagaggc ttgaaataaa ggaggagcac gaatattccc tggatttctg gaggcctgct 181 ttaaggctgg ccagttctgc aagaaaggca aggaggagga gactggctca cacctctgga 241 ggaccccctt ctgtcagctg tggggcttga cactacttga acaagaaaag gagggggaaa 301 ctgcaccaca taagtgaaga tccacctcca gtggctgctc tgctggtggt ggggttgctg 361 ctgacaacca ccctcaacgg gtctgcaccc atccaggaaa tctctgtctt cctcaagctt 421 ggttgtgcct gttctacact ctatctgtat tattgaatta ctgactgaga ctgtgtttgg 481 gaaggaggct gagtgactac tggactggat attgactcta actcttgttt ccaagcttat 541 atcctcaatc acctaaagat cagagtgtga agaaacaaac ctgtgacaga tctgtggttg 601 aggtttagac tgggggagga gtatagtact ggactttctt tgtaacttgt accatgactg 661 gggcagagat tgagtctggt gcccaggtca agcctgaaaa gaagcctggg gaagaggttg 721 taggtggggc tgagatagag aatgatgtcc ctctggtggt cagacccaag gttaggaccc 781 aggcccagat aatgcctggg gcaaggccca agaataagtc caaggttatg cctggagcaa 841 gcaccaaagt tgagacaagt gcagtgggtg gggcacgccc taagagtaag gccaaggcaa 901 tacctgtttc acgatttaag gaagaagccc agatgtgggc tcagcccagg tttggtgctg 961 aaagattgtc taagacagag agaaactccc agaccaatat catagcctct ccacttgtca 1021 gtactgattc tgtcttggtt gctaaaacaa agtacctgtc tgaggataga gaactggtta 1081 atacagacac tgagagcttt cctagaagga aggcccatta ccaagcagga ttccagcctt 1141 cttttaggtc aaaggaggag accaatatgg ggtcctggtg ctgtcctagg cctacatcca 1201 aacaagaagc ctctcctaat tctgatttca aatgggtaga caaatctgtg agttccttgt 1261 tctggagtgg agatgaggtc actgcaaaat ttcatcctgg gaatagggta aaagacagta 1321 acagatccat gcacatggcc aatcaagagg ctaataccat gtctaggtcc caaactaacc 1381 aggagctcta tattgcatct agttctggtt ctgaggatga gtctgttaag acaccctggt 1441 tctgggccag agataaaacc aatacctggt ctgggcccag ggaagatccc aatagcaggt 1501 ccaggtttag gtctaagaaa gaagtctatg ttgaatcaag ttctggatct gagcatgaag 1561 accatttgga gtcctggttt ggggctggaa aggagggcaa attcaggtcc aaaatgagag 1621 ctgggaagga ggccaataac agggccaggc acagggccaa gcgagaagct tgcattgatt 1681 tcatgcctgg gtctatagat gtaattaaaa aagagtcctg tttctggcct gaagaaaatg 1741 ctaatacctt ttcaaggccc atgatcaaga aagaggccag ggccagagca atgacaaagg 1801 aagaggccaa aaccaaggcc cgagccaggg ccaagcaaga agccaggtca gaggaggaag 1861 ccctcattgg gacctggttc tgggctacag acgagtccag catggcagat gaagccagca 1921 tagagtccag tctacaagtg gaggatgagt ccataattgg gagttggttc tggactgaag 1981 aagaggccag tatggggact ggggctagca gtaaatccag accaaggact gatggggagc 2041 gtattggtga ttccttattt ggggctaggg aaaagaccag tatgaaaact ggggctgagg 2101 ccacctctga atctatacta gcagctgatg atgaacaggt cattattggt tcctggttct 2161 gggctggtga agaggtcaac caagaggctg aggaagagac catttttggg tcgtggttct 2221 gggtcattga tgcggccagt gtggaatctg gtgttggggt cagctgtgag tccaggacaa 2281 ggtctgagga agaagaggtc attggtccct ggttttggtc tggagaacaa gttgatatag 2341 aggctggaat cggagaagag gccaggccag gagctgaaga agagacaata ttcgggtcct 2401 ggttttgggc tgaaaaccag acctatatgg attgtagggc tgaaactagc tgtgacacca 2461 tgcaaggggc tgaggaggag gagcccatta ttgggtcctg gttttggacc agagtagaag 2521 cttgtgtgga gggtgatgtc aacagcaagt ctagcctgga ggacaaggaa gaggccatga 2581 taccatgttt tggagccaaa gaagaggtca gtatgaagca tgggactggt gtcagatgca 2641 gatttatggc aggggctgag gagaccaata ataagtcttg cttctgggca gaaaaagaac 2701 cctgtatgta tcctgccggt ggaggaagtt ggaagtctag gccagaggag gaagaggaca 2761 ttgtcaattc gtggttctgg tccagaaaat acacaaagcc agaggccatt atagggtcct 2821 ggttatgggc tacagaagag agtaatatag atgggactgg agaaaaggcc aagttactga 2881 ctgaagagga gaccataatc aattcctggt tctggaaaga agatgaagcc atttcagagg 2941 ctactgacag agaagagtcc aggccagaag ctgaggaggg ggacattgtt ggttcttggt 3001 tctgggctgg agaagaggac agactagagc cagctgctga gactagagaa gaagacaggc 3061 tagcagctga gaaagaaggt attgttgggt cctggtttgg ggccagagaa gagaccatta 3121 gaagagaggc tgggtcttgc agcaaatcca gtcctaaagc tgaagaggaa gaagtcatta 3181 ttgggtcctg gttctgggaa gaagaggcca gtccggaggc agtggcagga gtcggctttg 3241 agtcaaagcc tgggactgag gaggaagaaa tcactgttgg gtcctggttc tggcctgaag 3301 aagaagccag tatacaggct ggatctcagg cagtagagga aatggagtca gagactgaag 3361 aggaaaccat ttttgggtcc tggttctggg atggaaaaga agtcagtgaa gaagcaggac 3421 catgctgtgt atccaagcca gaggatgatg aagagatgat tgttgagtcc tggttctggt 3481 ctagagacaa agccattaag gaaactggaa ctgtggccac ctgtgagtcc aagccagaaa 3541 atgaggaagg ggccattgtt gggtcttggt ttgaggctga agatgaggta gataacagga 3601 ctgacaatgg aagcaactgt gggtccagga cattagctga tgaagatgag gccatagtgg 3661 ggtcctggtt ctgggcagga gatgaggccc attttgaatc aaatcctagc cccgtgttca 3721 gggccatttg caggtccacg tgttcagttg aacaggagcc tgatccttca cgcaggcctc 3781 agagttggga ggaggtcact gttcagttca agcctggtcc atggggtagg gtcggcttcc 3841 catctataag cccctttaga tttccgaaag aggcagcatc tttattctgt gaaatgtttg 3901 ggggcaaacc caggaacatg gtacttagcc cagaagggga agatcaggaa tctttgcttc 3961 agcctgatca gcctagtcct gagttcccat ttcagtatga tccttcctac aggtcagtcc 4021 aggaaattcg agagcatctt agggccaagg agagtacaga gcctgagagt tcatcctgta 4081 actgcataca atgtgagctg aaaattggtt ctgaagagtt tgaagaactc cttttattaa 4141 tggaaaaaat tcgggatcct tttattcatg aaatatctaa aatcgcaatg ggtatgagaa 4201 gtgcttctca atttacccga gatttcattc gagattcagg tgttgtctca cttattgaaa 4261 ccttgcttaa ttatccgtcc tcccgagtta gaacaagttt tttggaaaat atgattcgca 4321 tggccccacc ttatccgaat ctaaacataa ttcagacata catatgtaaa gtgtgtgagg 4381 aaacccttgc ttatagcgtg gattccccgg aacagctgtc tggaataagg atgattagac 4441 atctcactac tactactgac tatcacacac tggttgccaa ttatatgtct gggtttctct 4501 ccttattagc tacaggcaat gccaaaacaa ggtttcatgt tttgaaaatg ctactgaatt 4561 tgtctgaaaa tcttttcatg acaaaagaac tactcagtgc tgaagcagtg tcagaattta 4621 taggcctctt taacagggaa gagacaaatg acaatattca aattgttctt gcaatatttg 4681 agaatattgg caacaatatc aaaaaagaaa cagtgttctc tgatgatgat ttcaatattg 4741 agccgcttat ttctgcattc cacaaagttg agaaatttgc taaggaactg caaggcaaaa 4801 cagacaatca aaatgaccct gaaggggacc aagaaaatta gtaatggtta attgctggcc 4861 tcagattgtc cttatgttcc tgagttatga tccttgagta atgctttgat tttaatagtt 4921 ggttctgtgt tgcaacatat atctttagtg ctgacactaa ctttgtccaa ctctgtctgt 4981 aagctggagc atttttctga tgccagctga atattagagc tgaaaacaca tttgttgata 5041 tttgtcttgt ccacattgtg atgttcagta tttgagctta tagtgaactg agcaatcata 5101 aataagccac ccttctgatt gtcgttctac tgtatatata tatatatttg agtgttgttt 5161 gtgtttcaat aaagtcctat gttaaagttg // LOCUS AB008109 2076 bp mRNA PRI 20-OCT-1997 DEFINITION Homo sapiens mRNA for RGS5, complete cds. ACCESSION AB008109 NID g2554613 KEYWORDS RGS5. SOURCE Homo sapiens neuroblastoma cDNA to mRNA, clone:nb-20. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Seki,N., Sugano,S., Muramatsu,M. and Nakagawara,A. TITLE Human G protein signaling regulator RGS5 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 2076) AUTHORS Seki,N. TITLE Direct Submission JOURNAL Submitted (14-OCT-1997) to the DDBJ/EMBL/GenBank databases. Naohiko Seki, Kazusa DNA Research Institute, Gene Structure I; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:nseki@kazusa.or.jp, Tel:+81-438-52-3932, Fax:+81-438-52-3931) FEATURES Location/Qualifiers source 1..2076 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="nb-20" /map="1q23" /tissue_type="neuroblastoma" CDS 82..627 /function="G protein signaling regulator" /codon_start=1 /product="RGS5" /db_xref="PID:d1023759" /db_xref="PID:g2554614" /translation="MCKGLAALPHSCLERAKEIKIKLGILLQKPDSVGDLVIPYNEKP EKPAKTQKTSLDEALQWRDSLDKLLQNNYGLASFKSFLKSEFSEENLEFWIACEDYKK IKSPAKMAEKAKQIYEEFIQTEAPKEVNIDHFTKDITMKNLVEPSLSSFDMAQKRIHA LMEKDSLPRFVRSEFYQELIK" BASE COUNT 674 a 399 c 371 g 632 t ORIGIN 1 agacagtttt gaagttttca aagactggct ctgctgttaa gaagttgtac ttaaagcgga 61 ggagctaagc cacctgccaa aatgtgcaaa ggacttgcag ctttgcccca ctcatgcctg 121 gaaagggcca aggagattaa gatcaagttg ggaattctcc tccagaagcc agactcagtt 181 ggtgaccttg tcattccgta caatgagaag ccagagaaac cagccaagac ccagaaaacc 241 tcgctggacg aggccctgca gtggcgtgat tccctggaca aactcctgca gaacaactat 301 ggacttgcca gtttcaaaag tttcctgaag tctgaattca gtgaggaaaa ccttgagttc 361 tggattgcct gtgaggatta caagaagatc aagtcccctg ccaagatggc tgagaaggca 421 aagcaaattt atgaagaatt cattcaaacg gaggctccta aagaggtgaa tattgaccac 481 ttcactaagg acatcacaat gaagaacctg gtggaacctt ccctgagcag ctttgacatg 541 gcccagaaaa gaatccatgc cctgatggaa aaggattctc tgcctcgctt tgtgcgctct 601 gagttttatc aggagttaat caagtagtaa tttagccagg ctatgaaatc atcctgtgag 661 ttatttcctc cataataacc ctgcatttcc cattaatcta catatcttcc cacagcagct 721 ttgctcagtg atacccacat gggaaaaatc ccaggggatg ttgcttactc tttttgccca 781 cactgctttg gatacttatc tactgtccga aggccttctt tccccactca attcttcctg 841 ccctgttatt aattaagata tcttcagctt gtagtcagac ccaatcagaa tcacagaaaa 901 atcctgccta aggcaaagaa atataagaca agactatgat atcaatgaat gtgggttaag 961 taatagattt ccagctaaat tggtctaaaa aagaatatta agtgtggaca gacctatttc 1021 aaaggagctt aattgatctc acttgtttta gttctgatcc agggagatca cccctctaat 1081 tatttctgaa cttggttaat aaaagtttat aagattttta tgaagcagcc actgtatgat 1141 attttaagca aatatgttat ttaaaatatt gatccttccc ttggaccacc ttcatgttag 1201 ttgggtatta taaataagag atacaaccat gaatatatta tgtttataca aaatcaatct 1261 gaacacaatt cataaagatt tctcttttat accttcctca ctggccccct ccacctgccc 1321 atagtcacca aattctgttt taaatcaatg acctaagatc aacaatgaag tattttataa 1381 atgtatttat gctgctagac tgtgggtcaa atgtttccat tttcaaatta tttagaattc 1441 ttatgagttt aaaatttgta aatttctaaa tccaatcatg taaaatgaaa ctgttgctcc 1501 attggagtag tctcccacct aaatatcaag atggctatat gctaaaaaga gaaaatatgg 1561 tcaagtctaa aatggctaat tgtcctatga tgctattatc atagactaat gacatttatc 1621 ttcaaaacac caaattgtct ttagaaaaat taatgtgatt acaggtagag gccttctagg 1681 tgagacactt ttaaggtaca ctgcattttg cagaaaaaaa aaaaaaaaag taatctttta 1741 gcaaccccag tattccttca ctatttcgct tcctgcatta gcaaatttta cttacagtca 1801 aaagtgcaga tttatactcc tgacgtgtct cattcacagc taaataatag gccataggac 1861 ttttggtagg tttaaacttt taattctgta tttcatgatt ataagtcttg ctagaatttt 1921 ttctaatctt tagtagattt gattaaataa tgattcacag aatttagtaa cagaatcaaa 1981 ctaagccatg tatgagggta atcgagatga ggatattaac tcaaaagaaa tagggtgatt 2041 tttaaaggat taataaaatt ctgaaatgtt aagtag // LOCUS AB008375 2375 bp mRNA PRI 27-OCT-1997 DEFINITION Homo sapiens mRNA for osteoblast specific cysteine-rich protein, complete cds. ACCESSION AB008375 NID g2570151 KEYWORDS osteoblast specific cysteine-rich protein. SOURCE Homo sapiens trabecular bone osteoblast cell_line:primary culture cDNA to mRNA, clone:GS3841. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2375) AUTHORS Ohno,I., Matsubara,K. and Okubo,K. TITLE The cloning and characterization of a cDNA for OSCP (osteoblast specific cysteine-rich protein) JOURNAL Published Only in DataBase (1997) In press REFERENCE 2 (bases 1 to 2375) AUTHORS Ohno,I., Matsubara,K. and Okubo,K. TITLE Direct Submission JOURNAL Submitted (21-OCT-1997) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..2375 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary culture" /cell_type="osteoblast" /clone="GS3841" /tissue_type="trabecular bone" CDS 379..1590 /note="OSCP" /codon_start=1 /product="osteoblast specific cysteine-rich protein" /db_xref="PID:d1023870" /db_xref="PID:g2570152" /translation="MSGCARPTTGAPVQAMVSVTVGKCKCDQGWYGDACQYPTNCDLT KKKSNQMCKNSQDIICSNSGTCHCGRCKCDNSDGSGLVYGKFCECDDRECIDDETEEI CGGHGKCYCGNCYCKAGWHGDKCEFQCDITPWESKRRCTSPDGKICSNRGTCVCGECT CHDVDPTGDWGDIHGDTCECDERDCRAVYDRYSDDFCSGHGQCNCGRCDCKAGWYGKK CEHPQSCTLSAEESIRKCQGSSDLPCSGRGKCECGKCTCYPPGDRRVYGKTCECDDRR CEDLDGVVCGGHGTCSCGRCVCERGWFGKLCQHPRKCNMTEEQSKNLCESADGILCSG KGSCHCGKCICSAEEWYISGEFCDCDDRDCDKHDGLICTGNGICSCGNCECWDGWNGN ACEIWLGSEYP" polyA_site 2375 /note="21 A nucleotides" BASE COUNT 655 a 480 c 641 g 599 t ORIGIN 1 agctcatcaa cgcaattgca actccggctg gagccccgga cctgcaagcc tgggtgtccg 61 tgggtccgtc tgcccagcca tctgctggtg gcacctctcc ctcctgccgc ctccctcggt 121 gaaccccacc ttgcaaaagt gcagctcgcc cggagcagcc cagaagctca gcatgcgtcc 181 cccaggcttc aggaacttct tgctgctggc gtcctccctt ctctttgctg ggttgtcagc 241 tgttcctcaa agcttctcgc catctctgag gagctggccg ggcgccgcct gcaggctgtc 301 ccgggccgag tcgcagcggc gtctgcatct gccacgtgac tgagccgggc atgttcttcg 361 ggcccctgtg tgagtgccat gagtgggtgt gcgagaccta cgacgggagc acctgtgcag 421 gccatggtaa gtgtgactgt gggcaagtgc aagtgtgacc agggatggta tggggatgct 481 tgccagtacc caactaactg tgacttgaca aagaagaaaa gtaaccaaat gtgcaagaat 541 tcacaagaca tcatctgctc taattcaggt acatgtcact gtggcaggtg taagtgtgat 601 aattcagatg gaagtggact tgtgtatggt aaattttgtg agtgtgacga tagagaatgc 661 atagacgatg aaacagaaga aatatgtgga ggccatggga agtgttactg tggaaactgc 721 tactgcaagg ctggttggca tggagataaa tgtgaattcc agtgcgatat caccccctgg 781 gaaagcaagc gaagatgcac gtctccagat ggcaaaatct gcagtaacag agggacttgt 841 gtatgtggtg aatgtacctg tcacgatgtt gatccgactg gggactgggg agatattcat 901 ggggacacct gtgaatgtga tgagagggac tgtagagctg tctatgaccg atattctgat 961 gacttctgtt caggtcatgg acagtgtaat tgcggaagat gtgactgcaa agcaggctgg 1021 tatgggaaga agtgtgagca cccacagtcc tgcacgctgt cagctgagga gagcatcagg 1081 aagtgccagg gaagctcgga tctgccttgc tctgggaggg gtaaatgtga atgtggcaaa 1141 tgcacctgct atcctccagg agatcgccgg gtgtatggca agacttgtga gtgtgatgat 1201 cgccgctgtg aagacctcga tggtgtggtc tgtggaggcc acggcacatg ttcctgtggt 1261 cgctgtgttt gtgagagagg atggtttgga aagctctgcc aacatccgcg gaagtgtaac 1321 atgacggaag aacaaagcaa gaatctgtgt gaatcagcag atggcatatt gtgctcgggg 1381 aagggttctt gtcattgtgg gaagtgcatt tgttctgctg aggagtggta tatttctggg 1441 gagttctgtg actgtgatga cagagactgc gacaaacatg atggtctcat ttgtacaggg 1501 aatggaatat gtagctgtgg aaactgtgaa tgctgggatg gatggaatgg aaatgcatgt 1561 gaaatctggc ttggctcaga atatccttaa caattacatg agagaggtct ggattcttat 1621 tttttctggg ccattagaac atataaatgc gaaggaaacc atgtatattc accactagga 1681 caggttaaaa agaccattgt atgtttttct atttctgaat tacgaatgaa atccgagtac 1741 ctattagaaa tgagttatgc aaatttagat gcaaataaca ttagaaaaaa aagattcttc 1801 cataattaac ataagtggtt cctaacgaga gcaatttttc cacccaaaag tcatttggca 1861 acatctacag acaattttga ttgtcacact gggtcgggta ggaaggtatg ctgcagacat 1921 ttggtgggta gaggccaggg atgctgctga gcatcccgca gtgtacagga cagcccccaa 1981 acaaggaatt atccagcccc aaatgccaat agggctcaaa ctgagaaaca ttgagttata 2041 tggctattag aaatccacat tcttacacaa gaaagaccat attagaatct aaggaaaaca 2101 tgcatattca cattaattaa tcgatcagat ttttccagaa ttccgtatca gtcaccattt 2161 taatatgggg acaatgaaga caagcacaca ggaggtagaa tatcagagtg gggctggatc 2221 aagggcaaaa actggtcatt aagtcatctg acattaaatc atttagccac taagttattt 2281 gtctactctc actttaaact caccaaagaa gattctctta aagaaattat gaaaaatgta 2341 caatttaaca ttttaaataa atagtgacag aagtt // LOCUS AB008430 3442 bp mRNA PRI 13-JAN-1998 DEFINITION Homo sapiens mRNA for CDEP, complete cds. ACCESSION AB008430 NID g2766164 KEYWORDS CDEP. SOURCE Homo sapiens embryo cartilage chondrocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Koyano,Y., Kawamoto,T., Shen,M., Yan,W., Noshiro,M., Fujii,K. and Kato,Y. TITLE Molecular cloning and characterization of CDEP, a novel human protein containing the ezrin-like domain of the band 4.1 superfamily and the Dbl homology domain of Rho guanine nucleotide exchange factors JOURNAL Biochem. Biophys. Res. Commun. 241 (2), 369-375 (1997) MEDLINE 98086358 REFERENCE 2 (bases 1 to 3442) AUTHORS Koyano,Y., Kawamoto,T. and Kato,Y. TITLE Direct Submission JOURNAL Submitted (22-OCT-1997) to the DDBJ/EMBL/GenBank databases. Takeshi Kawamoto, Hiroshima University School of Dentistry, Department of Biochemistry; 1-2-3 Kasumi Minami-ku, Hiroshima, Hiroshima 734, Japan (E-mail:tkawamo@ipc.hiroshima-u.ac.jp, Tel:082-257-5688, Fax:082-257-5629) FEATURES Location/Qualifiers source 1..3442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chondrocyte" /dev_stage="embryo" /tissue_type="cartilage" CDS 49..3186 /function="Rho Guanine Nucleotide Exchange Factor" /note="Band 4.1 superfamily" /codon_start=1 /product="CDEP" /db_xref="PID:d1025178" /db_xref="PID:g2766165" /translation="MGEIEQRPTPGSRLGAPENSGISTLERGQKPPPTPSGKLVSIKI QMLDDTQEAFEVPQRAPGKVLLDAVCNHLNLVEGDYFGLEFPDHKKITVWLDLLKPIV KQIRRPKHVVVKFVVKFFPPDHTQLQEELTRYLFALQVKQDLAQGRLTCNDTSAALLI SHIVQSEIGDFDEALDREHLAKNKYIPQQDALEDKIVEFHHNHIGQTPAESDFQLLEI ARRLEMYGIRLHPAKDREGTKINLAVANTGILVFQGFTKINAFNWAKVRKLSFKRKRF LIKLRPDANSAYQDTLEFLMASRDFCKSFWKICVEHHAFFRLFEEPKPKPKPVLFSRG SSFRFSGRTQKQVLDYVKEGGHKKVQFERKHSKIHSIRSLASQPTELNSEVLEQSQQS TSLTFGEGAESPGGQSCRRGKEPKVSAGEPGSHPSPAPRRSPAGNKQADGAASAPTEE EEEVVKDRTQQSKPQPPQPSTGSLTGSPHLSELSVNSQGGVAPANVTLSPNLSPDTKQ ASPLISPLLNDQACPRTDDEDEGRRKRFPTDKAYFIAKEVSTTERTYLKDLEVITSWF QSTVSKEDAMPEALKSLIFPNFEPLHKFHTNFLKEIEQRLALWEGRSNAQIRDYQRIG DVMLKNIQGMKHLAAHLWKHSEALEALENGIKSSRRLENFCRDFELQKVCYLPLNTFL LRPLHRLMHYKQVLERLCKHHPPSHADFRDCRAALAEITEMVAQLHGTMIKMENFQKL HELKKDLIGIDNLVVPGREFIRLGSLSKLSGKGLQQRMFFLFNDVLLYTSRGLTASNQ FKVHGQLPLYGMTIEESEDEWGVPHCLTLRGQRQSIIVAASSRSEMEKWVEDIQMAID LAEKSSSPAPEFLASSPPDNKSPDEATAADQESEDDLSASRTSLERQAPHRGNTMVHV CWHRNTSVSMVDFSIAVENQLSGNLLRKFKNSNGWQKLWVVFTNFCLFFYKSHQDNHP LASLPLLGYSLTIPSESENIQKDYVFKLHFKSHVYYFRAESEYTFERWMEVIRSATSS ASRPHVLSHKESLVY" polyA_site 3442 /note="50 a nucleotides" BASE COUNT 864 a 952 c 927 g 699 t ORIGIN 1 cgccgcagcc gccggcgctg tggagatatt ctctaagccg ctttcatcat gggagaaata 61 gagcagaggc cgaccccagg atcacgactg ggggccccgg aaaattcggg gatcagtacc 121 ttggaacgtg gacagaagcc gcccccaaca ccttcaggaa aactcgtgtc catcaaaatc 181 cagatgctgg atgacaccca ggaggcattt gaagttccac aaagagctcc tgggaaggtg 241 ctgctggatg cagtttgcaa ccacctcaac ctcgtggaag gtgactattt tggcctcgag 301 tttcctgatc acaaaaagat cacggtgtgg ctggatctcc taaaacccat tgtgaaacag 361 attagaaggc caaagcacgt tgttgttaag tttgtggtga aattctttcc gcctgaccac 421 acacaactcc aagaagaact cacaaggtac ctgttcgcgc tgcaggtgaa gcaggacttg 481 gctcaaggca ggttgacgtg taatgacacc agcgcagctc tcttgatttc acacattgtg 541 caatctgaga ttggggattt tgatgaagcc ttggacagag agcacttagc aaaaaataaa 601 tacatacctc agcaagacgc actagaggac aaaatcgtgg aatttcacca taaccacatt 661 ggacaaacac cagcagaatc agatttccag ctcctagaga ttgcccgtcg gctagagatg 721 tatggaatcc ggttgcaccc ggccaaggac agggaaggca cgaagatcaa tctggccgtt 781 gccaacacgg gaattctagt gtttcagggt ttcactaaga tcaatgcctt caactgggcc 841 aaggtgcgga agctgagctt caagaggaag cgctttctca tcaagctccg gccagatgcc 901 aatagtgcgt accaggatac cttggaattc ctgatggcca gtcgggattt ctgcaagtcc 961 ttctggaaaa tctgtgttga acatcatgcc ttctttagac tttttgaaga gcccaaacca 1021 aagcccaagc ccgtcctctt tagccggggg tcatcatttc ggttcagtgg tcggactcag 1081 aagcaggttc tcgactatgt taaagaagga ggacataaga aggtgcagtt tgaaaggaag 1141 cacagcaaga ttcattctat ccggagcctt gcttcacagc ctacagaact gaattcggaa 1201 gtgctggagc agtctcagca gagcaccagc cttacatttg gagaaggtgc cgaatctcca 1261 gggggccaga gctgccggcg aggaaaggaa ccgaaggttt ccgccgggga gccggggtcg 1321 cacccgagcc ctgcgccgag gagaagcccc gcgggtaaca agcaggcgga cggagccgcc 1381 tcggcgccca cggaggaaga ggaggaggtc gttaaggata ggacccagca gagtaaacct 1441 cagcccccgc agccaagcac aggctccctg actggcagtc ctcacctttc cgagctgtct 1501 gtgaactcgc aggggggagt ggcccctgcc aacgtgacct tgtctcccaa cctgagcccc 1561 gacaccaagc aggcctctcc cttgatcagc ccgctgctga atgaccaggc ctgcccccgg 1621 acggacgatg aggatgaggg ccggaggaag agattcccaa ctgataaagc gtacttcata 1681 gctaaggaag tgtctaccac cgagcgaaca tatctgaagg atctcgaagt tatcacttcg 1741 tggtttcaga gcacagtgag caaagaggac gccatgccgg aagcactgaa aagtctcata 1801 ttcccgaatt ttgaaccttt gcacaaattt catactaatt ttctcaagga aattgagcaa 1861 cgacttgccc tgtgggaagg ccgctcaaat gcccaaatca gagattacca aagaatcggc 1921 gatgtcatgc tgaagaacat tcagggcatg aagcacctgg cggctcacct gtggaagcac 1981 agcgaggcct tggaggccct ggagaatgga atcaagagct cccggcggct ggagaacttc 2041 tgcagagact ttgagctgca gaaggtgtgt tacctaccgc tcaacacctt cctcctgcgg 2101 ccactgcacc ggctcatgca ctacaagcag gtcctggagc ggctgtgcaa acaccacccg 2161 ccgagccacg ccgacttcag ggactgccga gccgctttgg cagagatcac ggagatggtg 2221 gcacagctcc acggtacgat gatcaagatg gagaatttcc agaagctgca cgaactcaag 2281 aaagatttga ttggcattga caatcttgtg gttccgggaa gggagttcat ccgtctgggc 2341 agcctcagca agctctcggg gaaggggctc cagcagcgca tgttcttcct gttcaacgac 2401 gtcctgctat acacgagccg ggggctgacg gcctccaatc agtttaaagt ccacgggcag 2461 ctcccgctct atggcatgac gattgaggag agcgaagacg agtggggggt gccccactgc 2521 ctgaccctcc ggggccagcg gcagtccatc atcgtggccg ccagttctcg gtccgagatg 2581 gagaagtggg ttgaggacat ccagatggcc attgacctgg cggagaagag cagcagcccc 2641 gcccctgagt tcctggccag cagcccccct gacaacaagt cccctgatga agccaccgcg 2701 gctgaccagg agtcagagga tgacctgagc gcctcgcgca catcgctgga gcgccaggcc 2761 ccgcaccgcg gcaacacaat ggtgcacgtg tgctggcacc gcaacaccag cgtctccatg 2821 gtggacttca gcatcgcagt ggagaatcag ttgtctggaa acctgctgag gaaattcaaa 2881 aacagcaacg ggtggcagaa gctgtgggtg gtgttcacaa acttctgcct gttcttctac 2941 aaatcacacc aggacaatca tccccttgcc agcctgcctc tgctcggcta ctcgctcacc 3001 atcccctctg agtccgagaa catccagaaa gactacgtgt tcaagctgca cttcaagtcc 3061 cacgtctact acttcagggc ggaaagcgag tacacgttcg aaaggtggat ggaagtgatc 3121 cgcagtgcca ccagctctgc ctcgcgaccc cacgtgttga gccacaaaga gtctcttgtg 3181 tattgatggc cggacacact cgtttccgca gtggctgctt tcctggaaga cgtttccttt 3241 cttctgtatt aatgaagcct ggtaaaatta acacctgtct gaaaatcaaa aacatggctt 3301 cccagcagct ctcctgtctc cacagccgcg ttttttaacc ccgacctctc agcgtttgaa 3361 tgaacagcgc tcccacctcc agtcctggca tccgctgggg gcgctgttct ttagctagtg 3421 ccagtattaa aacattgtca tt // LOCUS AB008913 1088 bp mRNA PRI 24-JAN-1998 DEFINITION Homo sapiens mRNA for Pax-4, complete cds. ACCESSION AB008913 NID g2809074 KEYWORDS Pax-4. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Matsushita,T., Yamaoka,T., Otsuka,S., Moritani,M., Matsumoto,T. and Itakura,M. TITLE Molecular cloning of mouse paired-box-containing gene (Pax)-4 from an islet beta cell line and deduced sequence of human Pax-4 JOURNAL Biochem. Biophys. Res. Commun. 242 (1), 176-180 (1998) MEDLINE 98102804 REFERENCE 2 (bases 1 to 1088) AUTHORS Matsushita,T. and Itakura,M. TITLE Direct Submission JOURNAL Submitted (18-NOV-1997) to the DDBJ/EMBL/GenBank databases. Takaya Matsushita, The University of Tokushima, Otsuka department of Clinical and Molecular Nutrition; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (E-mail:matsushita@nutr.med.tokushima-u.ac.jp, Tel:81-886-31-9804, Fax:81-886-31-9476) FEATURES Location/Qualifiers source 1..1088 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1053 /codon_start=1 /product="Pax-4" /db_xref="PID:d1025423" /db_xref="PID:g2809075" /translation="MHQDGISSMNQLGGLFVNGRPLPLDTRQQIVRLAVSGMRPCDIS RILKVSNGCVSKILGRYYRTGVLEPKGIGGSKPRLATPPVVARIAQLKGECPALFAWE IQRQLCAEGLCTQDKTPSVSSINRVLRALQEDQGLPCTRLRSPAVLAPAVLTPHSGSE TPRGTHPGTGHRNRTIFSPSQAEALEKEFQRGQYPDSVARGKLATATSLPEDTVRVWF SNRRAKWRRQEKLKWEMQLPGASQGLTVPRVAPGIISAQQSPGSVPTAALPALEPLGP SCYQLCWATAPERCLSDTPPKACLKPCWDCGSFLLPVIAPSCVDVAWPCLDASLAHHL IGGAGKATPTHFSHWP" BASE COUNT 222 a 333 c 306 g 227 t ORIGIN 1 atgcatcagg acgggatcag cagcatgaac cagcttgggg ggctctttgt gaatggccgg 61 cccctgcctc tggatacccg gcagcagatt gtgcggctag cagtcagtgg aatgcggccc 121 tgtgacatct cacggatcct taaggtatct aatggctgtg tgagcaagat cctagggcgt 181 tactaccgca caggtgtctt ggagccaaag ggcattgggg gaagcaagcc acggctggct 241 acaccccctg tggtggctcg aattgcccag ctgaagggtg agtgtccagc cctctttgcc 301 tgggaaatcc aacgccagct ttgtgctgaa gggctttgca cccaggacaa gactcccagt 361 gtctcctcca tcaaccgagt cctgcgggca ttacaggagg accagggact accgtgcaca 421 cggctcaggt caccagctgt tttggctcca gctgtcctca ctccccatag tggctctgag 481 actccccggg gtacccaccc agggaccggc caccggaatc ggactatctt ctccccaagc 541 caagcagagg cactggagaa agagttccag cgtgggcagt atcctgattc agtggcccgt 601 ggaaagctgg ctactgccac ctctctgcct gaggacacgg tgagggtctg gttttccaac 661 agaagagcca aatggcgtcg gcaagagaag ctcaagtggg aaatgcagct gccaggtgct 721 tcccaggggc tgactgtacc aagggttgcc ccaggaatca tctctgcaca gcagtcccct 781 ggcagtgtgc ccacagcagc cctgcctgcc ctggaaccac tgggtccctc ctgctatcag 841 ctgtgctggg caacagcacc agaaaggtgt ctgagtgaca ccccacctaa agcctgtctc 901 aagccctgct gggactgtgg ctccttcctc cttcctgtga ttgctccctc ctgtgtggac 961 gttgcctggc cctgcctcga tgcctctctg gcgcatcacc tgattggagg ggctggtaaa 1021 gcaacaccca cccacttctc acactggcct taagaggcct ccactcagca gtaataaaag 1081 ctgttttt // LOCUS AB009284 1408 bp mRNA PRI 26-DEC-1997 DEFINITION Homo sapiens EXTR2 mRNA, complete cds. ACCESSION AB009284 NID g2723392 KEYWORDS EXTR2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,T., Seki,N., Hayashi,A., Kozuma,S. and Hori,T. TITLE Structure, chromosome location and expression of two novel EXT-related genes JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1408) AUTHORS Saito,T. TITLE Direct Submission JOURNAL Submitted (26-NOV-1997) to the DDBJ/EMBL/GenBank databases. Toshiyuki Saito, National Institute of Radiological Sciences, Genome Research Group; Anagawa 4-9-1, Inage, Chiba 263, Japan (E-mail:t_saito@nirs.go.jp, Tel:043-206-3135, Fax:043-251-9818) COMMENT Sequence update (08-Dec-1997) Sequence update (24-Dec-1997). FEATURES Location/Qualifiers source 1..1408 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p21" gene 288..1280 /gene="EXTR2" CDS 288..1280 /gene="EXTR2" /codon_start=1 /db_xref="PID:d1024988" /db_xref="PID:g2723393" /translation="MRCCHICKLPGRVMGIRVLRLSLVVILVLLLVAGALTALLPSVK EDKMLMLRREIKSQGKSTMDSFTLIMQTYNRTDLLLKLLNHYQAVPNLHKVIVVWNNI GEKAPDELWNSLGPHPIPVIFKQQTANRMRNRLQVFPELETNAVLMVDDDTLISTPDL VFAFSVWQQFPDQIVGFVPRKHVSTSSGIYSYGSFEMQAPGSGNGDQYSMVLIGASFF NSKYLELFQRQPAAVHALIDDTQNCDDIAMNFIIAKHIGKTSGIFVKPVNMDNLEKET NSGYSGMWHRAEHALQRSYCINKLVNIYDSMPLRYSNIMISQFGFPYANYKRKI" BASE COUNT 444 a 280 c 279 g 405 t ORIGIN 1 cactttgcgg gcggcacttt ttccaggttg ttaatccagc taatggagaa ggatagatgc 61 acgctacttg gtttagaaaa aaaaacaaaa atgagcaaac gagacgcccc ttccgtttta 121 tgataactaa gctgcaggga aataaatcgg ctggccctac tgcaatctac tgcactcgag 181 aaacatcaca gaaaattctt tgatttatct taatagtgac aagtgagcct gcttctgtca 241 attactgaag ctataaggag attttttaaa aattaaactt caacacaatg aggtgttgcc 301 acatctgcaa acttcctggg agagtaatgg ggattcgagt gcttcgatta tctttggtgg 361 tcatcctcgt attattactg gtagctggtg ctttgactgc cttacttccc agtgttaaag 421 aagacaagat gctcatgttg cgtagggaaa taaaatccca gggcaagtcc accatggact 481 cctttactct cataatgcag acgtacaaca gaacagatct cttattgaaa cttttaaatc 541 attatcaggc tgtaccaaat ctgcacaaag tgattgtggt atggaacaat attggagaga 601 aggcaccaga tgaattatgg aattctctag ggccccaccc tatccctgtg atcttcaaac 661 aacagacagc aaacaggatg agaaatcgac tccaggtctt tcctgaactg gaaaccaatg 721 cagtgttgat ggtagatgat gacacactca tcagcacccc agaccttgtt tttgctttct 781 cagtttggca gcaatttcct gatcaaattg taggatttgt tcctagaaag cacgtctcta 841 cttcatcagg tatctacagt tatggaagtt ttgaaatgca agcaccaggg tctggaaatg 901 gtgaccagta ctctatggtg ctgattggag cctcattctt caatagcaaa tatcttgaat 961 tatttcagag gcaacctgca gctgtccatg ctttgataga tgatactcaa aactgtgatg 1021 atattgccat gaattttatc attgccaagc atattggcaa gacttcaggg atatttgtga 1081 agcctgtaaa catggacaat ttggaaaaag aaaccaacag tggctattct ggaatgtggc 1141 atcgagctga gcacgctctg cagaggtctt attgtataaa taagcttgtt aatatctatg 1201 atagcatgcc cttaagatac tccaacatta tgatttccca gtttggtttt ccatatgcca 1261 actacaaaag aaaaatataa aagtaaaaca aacaaaaaca aacctgaaaa ctgcttggca 1321 tttgagtagc ttctccatgc tatgtatttt tttaagcaac atcatgaatt ttatctactc 1381 cagaagtctc tacaatagaa aaaaaagt // LOCUS AB009303 2116 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens mRNA for membrane-type matrix metalloproteinase 3, complete cds. ACCESSION AB009303 D50477 NID g2662305 KEYWORDS membrane-type matrix metalloproteinase 3. SOURCE Homo sapiens (isolate:MMP-X2) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Takino,T., Sato,H., Shinagawa,A. and Seiki,M. TITLE Identification of the second membrane-type matrix metalloproteinase (MT-MMP-2) gene from a human placenta cDNA library. MT-MMPs form a unique membrane-type subclass in the MMP family JOURNAL J. Biol. Chem. 270 (39), 23013-23020 (1995) MEDLINE 96032735 REFERENCE 2 (bases 1 to 2116) AUTHORS Seiki,M. TITLE Direct Submission JOURNAL Submitted (02-DEC-1997) to the DDBJ/EMBL/GenBank databases. Motoharu Seiki, Institute of Medical Science, University of Tokyo, Department of Cancer Cell Research; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:mseiki@ims.u-tokyo.ac.jp, Tel:81-3-5449-5255, Fax:81-3-5449-5414) COMMENT D50477: Submitted (06-May-1995). FEATURES Location/Qualifiers source 1..2116 /organism="Homo sapiens" /isolate="MMP-X2" /db_xref="taxon:9606" CDS 113..1936 /note="MT-MMP-3" /codon_start=1 /product="membrane-type matrix metalloproteinase 3" /db_xref="PID:d1024647" /db_xref="PID:g2662306" /translation="MILLTFSTGRRLDFVHHSGVFFLQTLLWILCATVCGTEQYFNVE VWLQKYGYLPPTDPRMSVLRSAETMQSALAAMQQFYGINMTGKVDRNTIDWMKKPRCG VPDQTRGSSKFHIRRKRYALTGQKWQHKHITYSIKNVTPKVGDPETRKAIRRAFDVWQ NVTPLTFEEVPYSELENGKRDVDITIIFASGFHGDSSPFDGEGGFLAHAYFPGPGIGG DTHFDSDEPWTLGNPNHDGNDLFLVAVHELGHALGLEHSNDPTAIMAPFYQYMETDNF KLPNDDLQGIQKIYGPPDKIPPPTRPLPTVPPHRSIPPADPRKNDRPKPPRPPTGRPS YPGAKPNICDGNFNTLAILRREMFVFKDQWFWRVRNNRVMDGYPMQITYFWRGLPPSI DAVYENSDGNFVFFKGNKYWVFKDTTLQPGYPHDLITLGSGIPPHGIDSAIWWEDVGK TYFFKGDRYWRYSEEMKTMDPGYPKPITVWKGIPESPQGAFVHKENGFTYFYKGKEYW KFNNQILKVEPGHPRSILKDFMGCDGPTDRVKEGHSPPDDVDIVIKLDNTASTVKAIA IVIPCILALCLLVLVYTVFQFKRKGTPRHILYCKRSMQEWV" BASE COUNT 600 a 475 c 503 g 538 t ORIGIN 1 ggctccttac ccacccggag actttttttt gaaaggaaac tagggaggga gggagaggga 61 gagagggaga aaacgaaggg gagctcgtcc atccattgaa gcacagttca ctatgatctt 121 actcacattc agcactggaa gacggttgga tttcgtgcat cattcggggg tgtttttctt 181 gcaaaccttg ctttggattt tatgtgctac agtctgcgga acggagcagt atttcaatgt 241 ggaggtttgg ttacaaaagt acggctacct tccaccgact gaccccagaa tgtcagtgct 301 gcgctctgca gagaccatgc agtctgccct agctgccatg cagcagttct atggcattaa 361 catgacagga aaagtggaca gaaacacaat tgactggatg aagaagcccc gatgcggtgt 421 acctgaccag acaagaggta gctccaaatt tcatattcgt cgaaagcgat atgcattgac 481 aggacagaaa tggcagcaca agcacatcac ttacagtata aagaacgtaa ctccaaaagt 541 aggagaccct gagactcgta aagctattcg ccgtgccttt gatgtgtggc agaatgtaac 601 tcctctgaca tttgaagaag ttccctacag tgaattagaa aatggcaaac gtgatgtgga 661 tataaccatt atttttgcat ctggtttcca tggggacagc tctccctttg atggagaggg 721 aggatttttg gcacatgcct acttccctgg accaggaatt ggaggagata cccattttga 781 ctcagatgag ccatggacac taggaaatcc taatcatgat ggaaatgact tatttcttgt 841 agcagtccat gaactgggac atgctctggg attggagcat tccaatgacc ccactgccat 901 catggctcca ttttaccagt acatggaaac agacaacttc aaactaccta atgatgattt 961 acagggcatc cagaaaatat atggtccacc tgacaagatt cctccaccta caagacctct 1021 accgacagtg cccccacacc gctctattcc tccggctgac ccaaggaaaa atgacaggcc 1081 aaaacctcct cggcctccaa ccggcagacc ctcctatccc ggagccaaac ccaacatctg 1141 tgatgggaac tttaacactc tagctattct tcgtcgtgag atgtttgttt tcaaggacca 1201 gtggttttgg cgagtgagaa acaacagggt gatggatgga tacccaatgc aaattactta 1261 cttctggcgg ggcttgcctc ctagtatcga tgcagtttat gaaaatagcg acgggaattt 1321 tgtgttcttt aaaggtaaca aatattgggt gttcaaggat acaactcttc aacctggtta 1381 ccctcatgac ttgataaccc ttggaagtgg aattccccct catggtattg attcagccat 1441 ttggtgggag gacgtcggga aaacctattt cttcaaggga gacagatatt ggagatatag 1501 tgaagaaatg aaaacaatgg accctggcta tcccaagcca atcacagtct ggaaagggat 1561 ccctgaatct cctcagggag catttgtaca caaagaaaat ggctttacgt atttctacaa 1621 aggaaaggag tattggaaat tcaacaacca gatactcaag gtagaacctg gacatccaag 1681 atccatcctc aaggatttta tgggctgtga tggaccaaca gacagagtta aagaaggaca 1741 cagcccacca gatgatgtag acattgtcat caaactggac aacacagcca gcactgtgaa 1801 agccatagct attgtcattc cctgcatctt ggccttatgc ctccttgtat tggtttacac 1861 tgtgttccag ttcaagagga aaggaacacc ccgccacata ctgtactgta aacgctctat 1921 gcaagagtgg gtgtgatgta gggttttttc ttctttcttt cttttgcagg agtttgtggt 1981 aacttgagat tcaagacaag agctgttatg ctgtttccta gctaggagca ggcttgtggc 2041 agcctgattc ggggctgacc tttcaaacca gagggttgct ggtcctgcac atgagtggaa 2101 atacactcat ggggaa // LOCUS AB010710 2463 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens mRNA for lectin-like oxidized LDL receptor, complete cds. ACCESSION AB010710 D89050 NID g2828355 KEYWORDS lectin-like oxidized LDL receptor; LOX-1. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sawamura,T., Kume,N., Aoyama,T., Moriwaki,H., Hoshikawa,H., Aiba,Y., Tanaka,T., Miwa,S., Katsura,Y., Kita,T. and Masaki,T. TITLE An endothelial receptor for oxidized low-density lipoprotein JOURNAL Nature 386 (6620), 73-77 (1997) MEDLINE 97205278 REFERENCE 2 (bases 1 to 2463) AUTHORS Sawamura,T. TITLE Direct Submission JOURNAL Submitted (22-JAN-1998) to the DDBJ/EMBL/GenBank databases. Tatsuya Sawamura, Kyoto University, Department of Pharmacology, Faculty of Medicine; Yoshidakonoe-cho, Sakyo-ku, Kyoto, Kyoto 606, Japan (E-mail:sawamura@mfour.med.kyoto-u.ac.jp, Tel:81-75-753-4393, Fax:81-75-753-4402) COMMENT D89050: Submitted (12-Nov-1996). FEATURES Location/Qualifiers source 1..2463 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" CDS 62..883 /note="LOX-1" /codon_start=1 /product="lectin-like oxidized LDL receptor" /db_xref="PID:d1025500" /db_xref="PID:g1902984" /translation="MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGV LCLGLVVTIMVLGMQLSQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENEL KEMIETLARKLNEKSKEQMELHHQNLNLQETLKRVANCSAPCPQDWIWHGENCYLFSS GSFNWEKSQEKCLSLDAKLLKINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWE DGSPLMPHLFRVRGAVSQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ" polyA_site 2463 /note="26 A nucleotides" BASE COUNT 734 a 518 c 467 g 744 t ORIGIN 1 atttttagtt tgttgaagtt cgtgactgct tcactctctc attcttagct tgaatttgga 61 aatgactttt gatgacctaa agatccagac tgtgaaggac cagcctgatg agaagtcaaa 121 tggaaaaaaa gctaaaggtc ttcagtttct ttactctcca tggtggtgcc tggctgctgc 181 gactctaggg gtcctttgcc tgggattagt agtgaccatt atggtgctgg gcatgcaatt 241 atcccaggtg tctgacctcc taacacaaga gcaagcaaac ctaactcacc agaaaaagaa 301 actggaggga cagatctcag cccggcaaca agcagaagaa gcttcacagg agtcagaaaa 361 cgaactcaag gaaatgatag aaacccttgc tcggaagctg aatgagaaat ccaaagagca 421 aatggaactt caccaccaga atctgaatct ccaagaaaca ctgaagagag tagcaaattg 481 ttcagctcct tgtccgcaag actggatctg gcatggagaa aactgttacc tattttcctc 541 gggctcattt aactgggaaa agagccaaga gaagtgcttg tctttggatg ccaagttgct 601 gaaaattaat agcacagctg atctggactt catccagcaa gcaatttcct attccagttt 661 tccattctgg atggggctgt ctcggaggaa ccccagctac ccatggctct gggaggacgg 721 ttctcctttg atgccccact tatttagagt ccgaggcgct gtctcccaga catacccttc 781 aggtacctgt gcatatatac aacgaggagc tgtttatgcg gaaaactgca ttttagctgc 841 cttcagtata tgtcagaaga aggcaaacct aagagcacag tgaatttgaa ggctctggaa 901 gaaaagaaaa aagtctttga gttttattct ggaatttaag ctattctttg tcacttgggt 961 gccaaacatg agagcccaga aaactgtcat ttagctggct gcagaactcc tttgcagaaa 1021 ctggggttcc aggtgcctgg cacctttatg tcaacatttt tgattctagc tatctgtatt 1081 atttcaccta gcttgtccca agcttccctg ccagcctgaa gtccattttc ccctttttat 1141 tttaaaattt gactcctctt caagcttgaa aaccctctga actcagtctt ctttacctca 1201 ttatcacctt cccctcacac tcctaaaatt gcatgaaaga cagaacatgg agaacttgct 1261 caagtgcagg cagagagcaa aaaggggaaa tatgtctggg aaaaagtgca cgtgaagaaa 1321 caaagaagga cagaggccat tccgaaatca agaaactcat gttcttaact ttaaaaaagg 1381 tatcaatcct tggtttttaa actgtggtcc atctccagac tctaccactt acggacagac 1441 agacagacag acacacacac acacacacac acacacattt tgggacaagt ggggagccca 1501 agaaagtaat tagtaagtga gtggtctttt ctgtaagcta atccacaacc tgttaccact 1561 tcctgaatca gttattattt cttcattttt ttttctacca gaggacagat taatagattt 1621 aacccttcac aacagttctt gttagaatca tgggatgtgt ggcccagagg taagaataga 1681 atttctttcc ctaaagaaca taccttttgt agatgaactc ttctcaactc tgttttgcta 1741 tgctataatt ccgaaacata caagacaaaa aaaatgaaga cactcaatct agaacaaact 1801 aagccaggta tgcaaatatc gctgaataga aacagatgga attagaaata tatcttctat 1861 ttttaggctt ctatttcctt tccacccact cttcacaggc tattctactc taaaggaagc 1921 ctttttattt tgctgcacac aatctagcag gaatcttttt ttttttttta agagctgtgt 1981 catccttatg taggcaagag atgtttgctt ttgttaaaag ctttattgag atataattaa 2041 cataaaataa actgaacata tttaaagtgt actatttgat aagttttcac accttgtgga 2101 gaacatgcat actacaatta agagagtgaa catatccatc atccctcaaa gtgtcacaat 2161 gctcctcctg atgactcctc cccagaaaac caccaatcgg ctttcatttt gcattttgta 2221 gttttatgtg aatggaatca tatagtatgt cttttttttt tgtctggctt ctttcacttt 2281 gcataattat tttgagattc atatgtctcc atcttgatgc tcgtatgaat tcattctttt 2341 aaatgttgaa tattcccttg tatggatata ccacaattca tttacccatt tacttgttga 2401 tgacatttgg gttgttttag ttttgggata ttacaaataa agctgctgtg aacatttgtg 2461 tac // LOCUS AC002125 39471 bp DNA PRI 15-AUG-1997 DEFINITION Homo sapiens DNA from chromosome 19-cosmid F25965, genomic sequence, complete sequence. ACCESSION AC002125 NID g2642412 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 39471) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1 Mb region in human 19q13.1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 39471) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Direct Submission JOURNAL Submitted (15-AUG-1997) Human Genome Center, Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..39471 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F25965" /cell_line="UV5HL9-5B" /cell_type="fibroblast" /dev_stage="adult" /clone_lib="LL19NC02 F2 chromosome 19-specific cosmid library" /map="19q13.1 between D19S208 and CAPNS ; Is adjacent to cosmid F24109 (AD000671)to the left (gap <3.7 kb) and cosmid F19541 (U95090) to the right (gap est. 4-6kb); oriented centromere to telomere ;" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid UV5HL9-5B, which carries chromosome 19 as its only human chromosome." CDS join(<642..782,871..944,1018..1099) /note="hypothetical proline-rich protein (NH2-truncated)" /codon_start=1 /product="F25965_1" /db_xref="PID:g2642413" /translation="GSEVTNSKSRDVYKLPPPTPPGPPGDACRSRIPSPLQPEMQGTP DDEPSEPEPSPSTLIYRNMQRWKRIRQRWKEASHRNQLRYSESMKILREMYERQ" misc_feature 1195..1437 /note="BLASTN similarity to AA010888 (1..243); match: 0.92, score: 7.6e-80; database searched: month.na; ze23a02.s1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 359786 3' similar to contains element MSR1 repetitive element" misc_feature complement(1195..1608) /note="BLASTN similarity to F19408 (1..414); match: 0.98, score: 4.0e-159; database searched: est; H.sapiens EST sequence (011-X3-34) from skeletal muscle." misc_feature 1196..1547 /note="BLASTN similarity to H28558 (2..353); match: 0.98, score: 5.3e-134; database searched: est; yl59f12.s1 Homo sapiens cDNA clone 162575 3'." misc_feature complement(1202..1557) /note="BLASTN similarity to F18964 (18..373); match: 0.98, score: 3.6e-140; database searched: est; H.sapiens EST sequence (013-X1-25) from skeletal muscle." misc_feature 1203..1618 /note="BLASTN similarity to H27651 (9..424); match: 0.96, score: 5.6e-154; database searched: est; yl57a09.s1 Homo sapiens cDNA clone 162328 3'." misc_feature 1217..1592 /note="BLASTN similarity to R75610 (1..376); match: 0.99, score: 2.3e-149; database searched: est; yl21g01.s1 Homo sapiens cDNA clone 158928 3'." misc_feature 1217..1497 /note="BLASTN similarity to H51409 (1..281); match: 0.97, score: 2.2e-141; database searched: est; yo31b04.s1 Homo sapiens cDNA clone 179503 3'." misc_feature complement(1308..1350) /note="BLASTN similarity to W75962 (359..401); match: 0.95, score: 1.2e-141; database searched: est; zd59d07.r1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 344941 5'" misc_feature complement(1357..1417) /note="BLASTN similarity to W75962 (289..349); match: 0.73, score: 1.2e-141; database searched: est; zd59d07.r1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 344941 5'" misc_feature complement(1371..1704) /note="BLASTN similarity to W75962 (1..334); match: 0.96, score: 1.2e-141; database searched: est; zd59d07.r1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 344941 5'" misc_feature complement(1456..1586) /note="BLASTN similarity to C03913 (94..224); match: 0.96, score: 2.5e-55; database searched: month.na; Human Heart cDNA, clone 3NHC2410." misc_feature 1496..1598 /note="BLASTN similarity to H51409 (281..383); match: 0.94, score: 2.2e-141; database searched: est; yo31b04.s1 Homo sapiens cDNA clone 179503 3'." misc_feature complement(1508..1531) /note="BLASTN similarity to H27650 (347..370); match: 0.87, score: 2.2e-115; database searched: est; yl57a09.r1 Homo sapiens cDNA clone 162328 5'." misc_feature complement(1520..1584) /note="BLASTN similarity to H27650 (293..357); match: 0.92, score: 6.7e-131; database searched: est; yl57a09.r1 Homo sapiens cDNA clone 162328 5'." misc_feature complement(1553..1575) /note="BLASTN similarity to F18964 (1..23); match: 0.95, score: 3.6e-140; database searched: est; H.sapiens EST sequence (013-X1-25) from skeletal muscle." misc_feature complement(1580..1874) /note="BLASTN similarity to H27650 (1..295); match: 0.99, score: 6.7e-131; database searched: est; yl57a09.r1 Homo sapiens cDNA clone 162328 5'." misc_feature 1601..1625 /note="BLASTN similarity to R75610 (387..411); match: 0.88, score: 2.3e-149; database searched: est; yl21g01.s1 Homo sapiens cDNA clone 158928 3'." misc_feature complement(1607..1667) /note="BLASTN similarity to C03913 (11..71); match: 0.81, score: 2.5e-55; database searched: month.na; Human Heart cDNA, clone 3NHC2410." misc_feature complement(1610..1726) /note="BLASTN similarity to AA010737 (294..410); match: 0.88, score: 1.4e-153; database searched: est; ze23a02.r1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 359786 5'" misc_feature 1618..1647 /note="BLASTN similarity to H51409 (405..434); match: 0.83, score: 2.2e-141; database searched: est; yo31b04.s1 Homo sapiens cDNA clone 179503 3'." misc_feature complement(1705..2018) /note="BLASTN similarity to AA010737 (1..314); match: 0.99, score: 1.4e-153; database searched: est; ze23a02.r1 Soares fetal heart NbHH19W Homo sapiens cDNA clone 359786 5'" misc_feature complement(1706..2053) /note="BLASTN similarity to H29935 (1..348); match: 0.98, score: 1.4e-131; database searched: est; yn81f06.r1 Homo sapiens cDNA clone 174851 5'." CDS complement(join(2147..2308,2397..2519,3436..3633)) /note="hypothetical 17.2 kDa protein similar to rat alpha crystallin B chain; identical to Homo sapiens p20 protein (PIR B53814)" /codon_start=1 /product="F25965_2" /db_xref="PID:g2642416" /translation="MEIPVPVQPSWLRRASAPLPGLSAPGRLFDQRFGEGLLEAELAA LCPTTLAPYYLRAPSVALPVAQVPTDPGHFSVLLDVKHFSPEEIAVKVVGEHVEVHAR HEERPDEHGFVAREFHRRYRLPPGVDPAAVTSALSPEGVLSIQAAPASAQAPPPAAAK " misc_feature complement(2205..2250) /note="BLASTN similarity to W47487 (383..428); match: 1, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(2248..2276) /note="BLASTN similarity to W47487 (356..384); match: 0.96, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(2275..2314) /note="BLASTN similarity to W47487 (317..356); match: 0.92, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(2396..2459) /note="BLASTN similarity to W47487 (259..322); match: 0.95, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(2455..2526) /note="BLASTN similarity to W47487 (193..264); match: 0.95, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(3431..3516) /note="BLASTN similarity to W47487 (119..204); match: 0.98, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(3515..3552) /note="BLASTN similarity to W47487 (84..121); match: 1, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(3553..3597) /note="BLASTN similarity to W47487 (40..84); match: 0.97, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" misc_feature complement(3592..3638) /note="BLASTN similarity to W47487 (1..47); match: 0.91, score: 1.1e-143; database searched: est; zc35f01.r1 Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 324313 5' similar to PIR:B53814 B53814 p20 protein -" repeat_region complement(5301..5595) /rpt_family="ALU" misc_feature 6561..6616 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" repeat_region 6744..6826 /rpt_family="MIR" repeat_region complement(6869..7048) /rpt_family="ALU" repeat_region complement(7117..7400) /rpt_family="ALU" repeat_region 7424..7507 /rpt_family="MIR" repeat_region 7900..8490 /rpt_family="ALU" repeat_region 9063..9669 /rpt_family="ALU" repeat_region complement(9835..10430) /rpt_family="ALU" repeat_region complement(10434..10709) /rpt_family="ALU" repeat_region 11002..11298 /rpt_family="ALU" misc_feature 11479..11561 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature 11659..11801 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" repeat_region 11832..11922 /rpt_family="MIR" repeat_region 11947..12209 /rpt_family="ALU" repeat_region complement(12266..12426) /rpt_family="ALU" repeat_region 12902..13141 /rpt_family="ALU" misc_feature 13471..13580 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature 14088..14587 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature 14781..14862 /standard_name="putative exon" /note="Xgrail 1.3c, quality= excellent" misc_feature complement(15648..15691) /note="BLASTN similarity to T19597 (183..226); match: 0.93, score: 1.2e-79; database searched: est; 719F Homo sapiens cDNA clone 719." misc_feature complement(15683..15872) /note="BLASTN similarity to T19597 (1..190); match: 0.98, score: 1.2e-79; database searched: est; 719F Homo sapiens cDNA clone 719." repeat_region complement(15933..16222) /rpt_family="ALU" repeat_region complement(16363..16955) /rpt_family="ALU" repeat_region 17232..17525 /rpt_family="ALU" repeat_region 17654..17961 /rpt_family="ALU" repeat_region 18180..18466 /rpt_family="ALU" repeat_region complement(18619..18926) /rpt_family="ALU" repeat_region 18991..19269 /rpt_family="ALU" repeat_unit 19984..20065 /rpt_family="L1" repeat_region complement(20293..20581) /rpt_family="ALU" repeat_region complement(20603..20894) /rpt_family="ALU" misc_feature 21312..21412 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" repeat_region 21545..21821 /rpt_family="ALU" CDS join(24211..24368,24450..24534,24906..24986,25090..25227, 25658..25796,26478..26577,27047..27068) /note="hypothetical 26.3 kDa protein most similar to orf 4 from C. elegans cosmid F47A4 (Z49888); Gene prediction accomplished using Xgrail 1.3c coupled with local Blast comparisons to Genbank, non-redundant protein libraries, and dbEST" /codon_start=1 /product="F25965_4" /db_xref="PID:g2642414" /translation="MLSLSLCSHLWGPLILSALQARSTDSLDGPGEGSVQPLPTAGGP SVKGKPGKRLSAPRGPFPRLADCAHFHYENVDFGHIQLLLSPDREGPSLSGENELVFG VQVTCQGRSWPVLRSYDDFRSLDAHLHRCIFDRRFSCLPELPPPPEGARAAQMLVPLL LQYLETLSGLVDSNLNCGPVLTWMEVGLGRGLGDSEWVRGCVCHHAQHREILDGNRVA SAVEDEGAEVDGEAFRWETLSR" repeat_region complement(26058..26495) /rpt_family="ALU" misc_feature 27045..27108 /note="BLASTN similarity to N62845 (1..64); match: 0.98, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." misc_feature 27105..27126 /note="BLASTN similarity to N62845 (62..83); match: 0.95, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." misc_feature 27205..27280 /note="BLASTN similarity to N62845 (79..154); match: 0.9, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." misc_feature 27390..27444 /note="BLASTN similarity to N62845 (154..208); match: 0.94, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." misc_feature 27540..27698 /note="BLASTN similarity to N62845 (201..359); match: 0.96, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." CDS join(27696..27698,27784..27878,28999..29143,29223..29305, 29388..29495,30798..30943,31579..31736) /note="hypothetical 35.3 kDa protein similar to GTPase-activating proteins and orf3 from C. elegans cosmid F47A4 (Z49888); Gene prediction accomplished using Xgrail 1.3c coupled with local Blast comparisons to Genbank, non-redundant protein libraries, and dbEST" /codon_start=1 /product="F25965_5" /db_xref="PID:g2642415" /translation="MVLRCCSEFIEAHGVVDGIYRLSGVSSNIQRLRHEFDSERIPEL SGPAFLQDIHSVSSLCKLYFRELPNPLLTYQLYGKFSEAMSVPGEEERLVRVHDVIQQ LPPPHYRTLEYLLRHLARMARHSANTSMHARNLAIVWAPNLLRSMELESVGMGGAAAF REVRVQSVVVEFLLTHVDVLFSDTFTSAGLDPAGRCLLPRPKSLAGSCPSTRLLTLEE AQARTQGRLGTPTEPTTPKAPASPAER" misc_feature 27775..27792 /note="BLASTN similarity to N62845 (359..376); match: 1, score: 1.4e-125; database searched: est; yz83b02.s1 Homo sapiens cDNA clone 289611 3'." repeat_unit complement(28059..28140) /rpt_family="MIR" misc_feature 29045..29156 /note="BLASTN similarity to W11173 (2..113); match: 0.85, score: 7.8e-86; database searched: est; ma74f04.r1 Soares mouse p3NMF19.5 Mus musculus cDNA clone 316447 5' similar to PIR:B53764 B53764 beta2-chimaerin, cerebellar" misc_feature 29051..29140 /note="BLASTX similarity to (174..203); match: 0.53, score: 2.6e-13; database searched: nr; protein kinase C homolog (song control circuit) HAT-2 - Canaries >gi|249118 (S98891) HAT-2=protein kinase C homolog {song control" misc_feature 29051..29140 /note="BLASTX similarity to P30337 (209..238); match: 0.53, score: 9.7e-14; database searched: nr; N-CHIMAERIN (NC) (ALPHA-CHIMERIN). pir||S29128 N-chimaerin - rat >gi|55940 (X67250) n-chimaerin [Rattus norvegicus]" misc_feature complement(29099..29136) /note="BLASTN similarity to N77752 (394..431); match: 0.92, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 29211..29300 /note="BLASTX similarity to (201..230); match: 0.3, score: 2.6e-13; database searched: nr; protein kinase C homolog (song control circuit) HAT-2 - Canaries >gi|249118 (S98891) HAT-2=protein kinase C homolog {song control" misc_feature 29211..29300 /note="BLASTX similarity to P30337 (236..265); match: 0.3, score: 9.7e-14; database searched: nr; N-CHIMAERIN (NC) (ALPHA-CHIMERIN). pir||S29128 N-chimaerin - rat >gi|55940 (X67250) n-chimaerin [Rattus norvegicus]" misc_feature complement(29223..29259) /note="BLASTN similarity to N77752 (349..385); match: 0.97, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 29223..29306 /note="BLASTN similarity to W11173 (101..184); match: 0.86, score: 7.8e-86; database searched: est; ma74f04.r1 Soares mouse p3NMF19.5 Mus musculus cDNA clone 316447 5' similar to PIR:B53764 B53764 beta2-chimaerin, cerebellar" misc_feature complement(29257..29306) /note="BLASTN similarity to N77752 (301..350); match: 1, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 29274..29306 /note="BLASTN similarity to AA000922 (1..33); match: 0.93, score: 6.0e-128; database searched: est; mg26d07.r1 Soares mouse embryo NbME13.5 14.5 Mus musculus cDNA clone 424909 5' similar to SW:CHIN_HUMAN P15882 N-CHIMAERIN" misc_feature 29372..29491 /note="BLASTN similarity to W11173 (168..287); match: 0.91, score: 7.8e-86; database searched: est; ma74f04.r1 Soares mouse p3NMF19.5 Mus musculus cDNA clone 316447 5' similar to PIR:B53764 B53764 beta2-chimaerin, cerebellar" misc_feature 29372..29497 /note="BLASTN similarity to AA000922 (17..142); match: 0.92, score: 6.0e-128; database searched: est; mg26d07.r1 Soares mouse embryo NbME13.5 14.5 Mus musculus cDNA clone 424909 5' similar to SW:CHIN_HUMAN P15882 N-CHIMAERIN" misc_feature complement(29385..29497) /note="BLASTN similarity to N77752 (192..304); match: 0.96, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 29389..29496 /note="BLASTX similarity to (233..268); match: 0.47, score: 2.6e-13; database searched: nr; protein kinase C homolog (song control circuit) HAT-2 - Canaries >gi|249118 (S98891) HAT-2=protein kinase C homolog {song control" misc_feature 29389..29496 /note="BLASTX similarity to P30337 (268..303); match: 0.47, score: 9.7e-14; database searched: nr; N-CHIMAERIN (NC) (ALPHA-CHIMERIN). pir||S29128 N-chimaerin - rat >gi|55940 (X67250) n-chimaerin [Rattus norvegicus]" misc_feature complement(30797..30830) /note="BLASTN similarity to N77752 (161..194); match: 1, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 30797..30945 /note="BLASTN similarity to AA000922 (140..288); match: 0.85, score: 6.0e-128; database searched: est; mg26d07.r1 Soares mouse embryo NbME13.5 14.5 Mus musculus cDNA clone 424909 5' similar to SW:CHIN_HUMAN P15882 N-CHIMAERIN" misc_feature complement(30820..30945) /note="BLASTN similarity to N77752 (47..172); match: 0.94, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 30835..30906 /note="BLASTX similarity to (276..299); match: 0.45, score: 2.6e-13; database searched: nr; protein kinase C homolog (song control circuit) HAT-2 - Canaries >gi|249118 (S98891) HAT-2=protein kinase C homolog {song control" misc_feature 30835..30906 /note="BLASTX similarity to P30337 (311..334); match: 0.45, score: 9.7e-14; database searched: nr; N-CHIMAERIN (NC) (ALPHA-CHIMERIN). pir||S29128 N-chimaerin - rat >gi|55940 (X67250) n-chimaerin [Rattus norvegicus]" misc_feature complement(31133..31163) /note="BLASTN similarity to N77752 (5..35); match: 1, score: 1.1e-134; database searched: est; yz83b02.r1 Homo sapiens cDNA clone 289611 5' similar to SW:CHIO_RAT Q03070 BETA-CHIMAERIN. [2] PIR:S29956" misc_feature 31570..31735 /note="BLASTN similarity to AA000922 (278..443); match: 0.81, score: 6.0e-128; database searched: est; mg26d07.r1 Soares mouse embryo NbME13.5 14.5 Mus musculus cDNA clone 424909 5' similar to SW:CHIN_HUMAN P15882 N-CHIMAERIN" misc_feature 31815..31960 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature 31994..32109 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature complement(32033..32098) /note="BLASTN similarity to T08446 (193..258); match: 0.96, score: 8.9e-54; database searched: est; EST06337 Homo sapiens cDNA clone HIBBE34 5' end." misc_feature complement(33036..33140) /note="BLASTN similarity to T08446 (81..185); match: 0.97, score: 8.9e-54; database searched: est; EST06337 Homo sapiens cDNA clone HIBBE34 5' end." misc_feature 33047..33698 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature 33881..34005 /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature complement(34015..34042) /note="BLASTN similarity to Z42222 (123..150); match: 0.82, score: 2.6e-33; database searched: est; H. sapiens partial cDNA sequence" misc_feature 34015..34043 /note="BLASTN similarity to AA089134 (4..32); match: 0.75, score: 3.6e-92; database searched: est; mo21b02.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 554187 5'" misc_feature complement(34018..34151) /note="BLASTN similarity to Z42222 (1..134); match: 0.96, score: 1.8e-91; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(34143..34187) /note="BLASTN similarity to Z42222 (125..169); match: 1, score: 1.8e-91; database searched: est; H. sapiens partial cDNA sequence" misc_feature 34143..34182 /note="BLASTN similarity to AA089134 (6..45); match: 0.77, score: 3.3e-95; database searched: est; mo21b02.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 554187 5'" misc_feature complement(34144..34168) /note="BLASTN similarity to Z42222 (1..25); match: 0.84, score: 8.1e-33; database searched: est; H. sapiens partial cDNA sequence" misc_feature 34175..34300 /note="BLASTN similarity to AA089134 (37..162); match: 0.81, score: 3.3e-95; database searched: est; mo21b02.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 554187 5'" misc_feature complement(34258..34363) /note="BLASTN similarity to Z42222 (156..261); match: 0.93, score: 1.8e-91; database searched: est; H. sapiens partial cDNA sequence" misc_feature 34287..34478 /note="BLASTN similarity to AA089134 (146..337); match: 0.81, score: 3.3e-95; database searched: est; mo21b02.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 554187 5'" misc_feature 34377..34587 /note="BLASTN similarity to D61580 (4..214); match: 0.98, score: 7.2e-114; database searched: est; Human fetal brain cDNA 5'-end GEN-420D11." misc_feature 34389..34532 /note="BLASTN similarity to R12216 (1..144); match: 1, score: 6.3e-88; database searched: est; yf52b11.r1 Homo sapiens cDNA clone 25797 5'." misc_feature 34460..34530 /note="BLASTN similarity to AA089134 (318..388); match: 0.8, score: 3.3e-95; database searched: est; mo21b02.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 554187 5'" misc_feature 34523..34631 /note="BLASTN similarity to R12216 (134..242); match: 0.82, score: 6.3e-88; database searched: est; yf52b11.r1 Homo sapiens cDNA clone 25797 5'." misc_feature 34565..34643 /note="BLASTN similarity to D61580 (194..272); match: 0.84, score: 7.2e-114; database searched: est; Human fetal brain cDNA 5'-end GEN-420D11." misc_feature 34604..34657 /note="BLASTN similarity to R12216 (216..269); match: 0.9, score: 4.9e-65; database searched: est; yf52b11.r1 Homo sapiens cDNA clone 25797 5'." misc_feature 34605..34682 /note="BLASTN similarity to D61580 (235..312); match: 0.73, score: 9.4e-107; database searched: est; Human fetal brain cDNA 5'-end GEN-420D11." misc_feature 34613..34700 /note="BLASTN similarity to R12216 (226..313); match: 0.67, score: 6.3e-88; database searched: est; yf52b11.r1 Homo sapiens cDNA clone 25797 5'." misc_feature 34680..34735 /note="BLASTN similarity to D61580 (311..366); match: 1, score: 7.2e-114; database searched: est; Human fetal brain cDNA 5'-end GEN-420D11." misc_feature 34925..35028 /note="BLASTN similarity to D81268 (1..104); match: 0.96, score: 8.3e-41; database searched: est; Human fetal brain cDNA 5'-end GEN-142F07." misc_feature 35065..35094 /note="BLASTN similarity to D81268 (144..173); match: 0.8, score: 8.3e-41; database searched: est; Human fetal brain cDNA 5'-end GEN-142F07." misc_feature 35100..35136 /note="BLASTN similarity to D81268 (181..217); match: 0.81, score: 8.3e-41; database searched: est; Human fetal brain cDNA 5'-end GEN-142F07." misc_feature complement(35154..35203) /note="BLASTN similarity to R39938 (263..312); match: 0.98, score: 3.9e-73; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(35191..35274) /note="BLASTN similarity to R39938 (191..274); match: 0.8, score: 3.8e-69; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(35203..35448) /note="BLASTN similarity to Z38468 (1..246); match: 0.99, score: 1.0e-92; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(35246..35319) /note="BLASTN similarity to R39938 (145..218); match: 0.93, score: 3.9e-73; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(35318..35347) /note="BLASTN similarity to R39938 (116..145); match: 0.96, score: 3.9e-73; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(35351..35426) /note="BLASTN similarity to R39938 (35..110); match: 0.94, score: 3.9e-73; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(35423..35448) /note="BLASTN similarity to R39938 (12..37); match: 1, score: 3.9e-73; database searched: est; yf52b11.s1 Homo sapiens cDNA clone 25797 3'." misc_feature complement(36936..37140) /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" misc_feature complement(37213..37370) /standard_name="putative exon" /note="Xgrail 1.3c, quality= good" BASE COUNT 8228 a 11501 c 11272 g 8470 t ORIGIN 1 gatcaggcta ggcacacaga tgtgcactct taggttgggg tgtggatcag agtccctagg 61 gaagacatac gtaaccacct ggcctgggag acagggaaca gggttaccct ggacagttag 121 atgtggacct ccaagatcag gggttcccaa gagccgaggg gctagacaag ggtccttagt 181 acagacagaa gtggccacca ggccttgagg agcaggggct gggcaggaat ggatgggagt 241 ctataagcca tctgacccag cggggtagac aagggtcccc aggacagaca tataggaccc 301 ttcagacata taggactgtc agaaatggac agagcttcct aggtggcaca gacgtgaccc 361 cctagacagg attccctgag gcaggcagac gcgactccct gggctgacag gtgtagatgg 421 agtctctggg ctggatagga gaagacaggg gtccctgggg caggcacact tgacccgctg 481 gctggttgag gtgtctgccc ttcatacatg cctcctgtct gagtccctta gcgagcccct 541 catgccattt gttgctggga aaggcaggag ggatgaatgt ggattcccag cccctctgct 601 ttgactctct tgggcctggc atggggccct tgtgtcccca gggctcagag gtaaccaaca 661 gcaagagtcg tgatgtgtac aagctgccgc cacccacacc cccggggcca cccggagatg 721 cctgcagatc ccgcatccca tctccactgc agcctgagat gcagggcacc cctgacgatg 781 aggtgagtat gccaggctgg cccagggtta tggggcacat gcggtagggc tcaggaatgg 841 ccactcaacc tggcctgtct gtccatgcag ccctctgagc ccgagccctc accctccaca 901 ctcatctatc gcaacatgca gcgctggaaa cgcatccgcc agaggtgagc gtcccccagc 961 ctgcttgccc ctcgtagggc cctttgcaac tgctgagtct cccctctgcc tccccaggtg 1021 gaaggaggcc tctcatcgga accagcttcg ttactcagaa agcatgaaga tcctacgaga 1081 gatgtacgaa cgacagtgat gttcccaggt ccccccacac cagtaaacat cccccagctc 1141 cacactggtg tctgctccgg tccctccttc ctggcccagg cacaaagcag ttggcagagc 1201 tctagcacat ttattgggag agtaagcctg ggaaagacta agggagtggt ggcagggaga 1261 aaggctgtgg ggaatcagag cgggtgctca gttgggtctt gaaggagaag aggaggaggg 1321 tgggaggtgg gttgccgagg atatctggtt gaagacttgg gggtcaagac aaagggactt 1381 agggggatgg ggtctggtta gagttgggga gggggcctag gacatccgtg cagagtctgg 1441 ggaggttggg gtgggagagt ctgtacagtt tggtgttggg tgttctagtt ggcctggtgt 1501 ccaagagttg gggcagtcga aaaagggttc cagagtctgg tgtggctggc tggggtttca 1561 cggcagaaaa tgggctggag ggggcagttg tagactgtct ggttgcaggg gaaggatcgg 1621 gtcttgggaa ccaggcctgg atggtctgga gtgggaggtc tgtgcagtcc aggacgtttg 1681 gggggtgggg ggattgtgcc tgactggcct ggggttgggg gagggctgtc tacaccataa 1741 tttggtgtca aaatagggaa gggatggaaa tgtagtaccc ggatgtcttt gtagaggact 1801 cagaaggaag tagaggaggg gtcttaagtg gggctatatg ccaagaaagt gggtcggtgg 1861 ggctgagact gtcggctgag ggttagggta gtgctggtag ggtctggaag ccggggagtt 1921 gtcggtgtgg ggtcggaaag ctggaggggg tgtgagagcg agggtgtcag tggaagggtc 1981 tatgttatca aggcagtgtc caggatggag gtcagggcag aatccaggag tgggtgagag 2041 gacagtcctt ggcgcactcg ggacatctgg ctgggcggag tcagatcggc tttaatagag 2101 ggagcctgag gaggctcccg gggtgcgggc gcggcccagc cccctcctac ttggctgcgg 2161 ctggcggtgg ggcctgggcc gacgctggtg cggcctggat ggacaggacg ccctcggggg 2221 acagcgcgga cgtcacggca gccggatcca cgccaggcgg caggcggtag cgacggtgga 2281 actcgcgcgc gacgaatccg tgctcatcct ggaggggagg gaggcttgag cggccccgcc 2341 cctccggccc ggcggcgagc gtacccgctc cagccccgcc ccacgccgcg gctcaccggg 2401 cgctcctcgt ggcgcgcgtg cacctccacg tgttcgccca ccaccttgac agcaatttcc 2461 tccggcgaga agtgcttcac gtctagcagc accgaaaagt ggccggggtc cgtcggcacc 2521 tgagcgcagc gggcgcgggc gggactgtca ttgggctggg ccaggctcca ggacccaccc 2581 agggagaccc cacccccgtc ggttccgtgc cgcggctcac tctggcctcc ggagttgagc 2641 agtcagctct cctactggga gtcacccagg acctcttcac cctgaacaac agggatacgc 2701 cctccccctg taatatctgg gattgccctc tcccccgcag tgatgaggct cccttcccct 2761 gcaaggtcag gaacctctgc cctcccccgg tgtcagcgaa tccgaagtca tcgtgccccc 2821 cagggcactg attccccagc tccctcctca gagccccggg tcccttctcc gccagccccg 2881 ccaaactcgc caagatgccc ctttccccgt ctccagggtc ctctgcctac tctcccactc 2941 gagtcacggg acctcggcag ctactggtag ccttccccca cttcagagtg gccgggaacc 3001 cttcctcaag tcgcagggcc ctctcccctc ctcttctgga gatatcgggc caccgcacac 3061 agtctctcaa ctcttccccg agactaagaa cattccttcc cctcaggagt cactgcccgc 3121 ctccttttcc ccagatttcc cggatcacca gcctccttca gagtaagcaa gactcccctt 3181 agagtgaccc aggccttcca ttctctggag agctgagagc cccccacccc gccccccacc 3241 taagagcgac ggggacctcc cacccgctgc agtgtgcggg cctcagattc ccattgactc 3301 cggaagccct tcggagctgc ggactctcca ccccgcccaa ggccaacgaa cattctctca 3361 catgttccca ccccaggctg ccccagaccc ctggcaatgg aagtggtcga gttcacccct 3421 ccccaggcct ggcacctggg cgacgggcag cgccacgctg ggtgcgcgca ggtagtaggg 3481 ggcgagcgtg gtggggcaga gcgcagccag ctcggcctcc agcagcccct cgccgaagcg 3541 ctggtcaaag aggcgtccgg gcgccgaaag tccgggcaac ggggccgagg cgcggcgcag 3601 ccaagacggc tgcacaggca cagggatctc catcctgctc ctccgcgttg cagtgcccct 3661 gcgtgcgcag ccgctcattt atagtgcgcc ctgtgccggc ccctcggagg gctgcccagg 3721 gacactggga ccgtggccag acccggccat taggagagcc ttatttagaa acccaaacaa 3781 tgacttggcc atttattaag cgcctgctgg atgcctggca aatgctgtca agaaggtgac 3841 atgaagattc cccaatttcc agaagaggaa actaaggctg aggaggggga gccttgcggg 3901 tgtggggagc acagcgtggt gattggggcc caggacccaa gaggcccacg gaagagtttc 3961 cagtcccttt aatccactgc ctcccccctg gcagctctga ggatgacgca gatggaggtc 4021 tttaggagca gattcctgga gaatgcggta gccccgccag gatgtcagga ggacatagaa 4081 caggccctcc ctacccttat cctcagccag accttgggcc cagtcccccg gcaggccggg 4141 cctaattctg tgcctctgac tcagcctctg gtgcgcagag gccagggggc gcgagcatga 4201 acagttcagg ctggcccccc gggggccgcg cgctggggct ggaaatagct cattttgatc 4261 ccgctacctc ggccaggagc ccagacctgg caactgatgc aaccagctgc tcctgcagca 4321 agaaagaagt tagatgaccg ttgggacgga ctgattggcc caactgatgg aggaatagcg 4381 gttagagaga gggcagagcg cgcccccccc ccaacccgcc ccactgcccc tgcaacagaa 4441 agaatggtag ttagaccggg gtcgggacag gacttctgca gcaagaaggc gcttaaaaag 4501 cgggactttt aagcgagggt actttagatc acctcctaac cgcaagccca gcctcagatc 4561 cttcccattg gtgcacggtc ttgccaatta ctgctacact cggcaggttc tccaatcaaa 4621 cacgccctgt cgggcaggca gtcgtcccag ccaatcctga aacacctccc tccgggtact 4681 tctgtttcct ctctttcatt ggtcgccttg gaggccgctc gcctccggcg cgggcacaga 4741 gaggggcggg gctgataggg cgttgctaag cgacggagat gcgcgcgggg cctgttgggt 4801 gaaggagcag agcggccgga agcgcggagg gagccgcggg atggaccgca ggtgaggccg 4861 atcgctcttc cagggactac aggaggctgg ggaggaccaa cggcgagagc agcacagcct 4921 aggacgggct ggatacggtc tggagtcgct agggctccac cgcactggaa ctacaattcc 4981 caacatgctc cacagccgtt ggcctctcca gccgtagccg ttagcatccc gggggtcccc 5041 taagagtctt atgttcctct ctgagtgggc cccaaggaat tattgcctct aaaggtgtcc 5101 aagaaaggct tgagatctga atttcttcat tttgaaatgg cccccagaca cgcctgggcg 5161 ttgtctttga actttctcgc ggaggcggag cccagtggat cctggggctt gtagtccatc 5221 taccctttgc cttcgtgtcc cccaggaatg tatgggaaat gctcggtgat ataatccagc 5281 cgcggttctt tctttctttc ttttttttta agacagagtc tctcgctctg ttggcccaga 5341 ctggagtgca gtggcacaat cttggctact gcaacctctg cccccgggtt aaagcaattc 5401 tcatgcctca gcctcccagg tagctgggac tacaggcacc tgccaccgcg cctggctaat 5461 tttttatatt tttagtagag acggggtttc gccatgttag taaggctggt ctcgaacacc 5521 tgacctcaag tgatccaccc gcctcggtgt aatcccaaag tgctgggatt acaggcgtga 5581 gccaccacgc ccggcgagcc gcgattctta acctgaactc cacttcgcaa tcacctggga 5641 cgctgcggaa aagacacgga ggcccagccc cactaataga tattctgatt ctgttggtct 5701 ggaatgggaa ccgcgcgcct gtaacgttga aaagcccctc ctagactgga tccagggttg 5761 agaaccaccg gctgtcagtt cctgagttgc tccctgttaa gactgctcca ggggcgggct 5821 cccaggactc acccttccac tgtcgatatc ctgaatgtgc aacggtgctt catggaaatg 5881 acagtccgtc tcctccagga atctatggga attgtctggt tctgccctcc tctaatgtcc 5941 ccctccccag ggctgcggcg aaaccacgtg ctgcctgaac cccactttcc tcttgcagcc 6001 tgccagtttt ctccattcaa gatagtccct ttggagatgc gcccctgggt cgaagccact 6061 actggccatc ccagagccag acctggtgtc ccaaggtgag gacacccctc aaagagtgct 6121 gagtgccagc ccagtagcaa gagaatgacc tttagagggt aggaagacat gtgatgagag 6181 atagggatga gagatttaag agacagcccc ttgtcccctc cccacggccc tgcccttgtc 6241 cccctctcta ccacctggat tccccatctg agcccccatc acactaggtt gttatcatta 6301 caggatgtgt ttcctcccct ctggactgag actttgtgtg tgtcctggtt cccctgcagg 6361 gatgacccat gagacctcac actttttctt cttgtgctct tccctgatct tagaccctga 6421 gcccatccag gtctcagaga tccaggctcc cacaagctcc caaggctcta gccacaggtc 6481 ccaactcccc tgagctgttt gaggagtcct ggccatccag ttcagggacc ccctccctgc 6541 ccagcaccac tgagggacag atgtgggcct ccccagcacc caccctgatt gacagcgggg 6601 actccgtggt ggccaagtaa gtaccagcag ccctggggga aaaagaggct ttgggttaga 6661 gggagggaga aggcatgcag taataatcat catggccaga gctgtttctg cagcactctt 6721 ttaggaccag gcagtttatg tgtgttaaga actctggtgc tagactgcct gagttcaaat 6781 cccagctctg tcatttaact ctctgtatga tcttggcctc agtttcttta tctcttttta 6841 tttatttatt tatttgttta gagacagggt ctcgctctgt tactcaggct ggaggtcagt 6901 ggtacaatca cagctcacta taacctcaaa ctcctgggct caagccgtcc tcccacctca 6961 gcctcctgag ttgctgggat tacgatgcat gccaccacac ccagcaaacc tcccaccttg 7021 gcctccaaaa agtgctgtga ttacaggcca gacgtaccat gcccagcctc agtttattta 7081 tctctaaatt gggtgttttg ttttgttttg tttttgtttt tgtttttttg atggagtttc 7141 actcttgttg cccaagttgg aatgcagtgg catgatctcg gcttactgca acctctccct 7201 tctgggttca agtgattctc ctgcctcagc ctcccaagta gctgggatta caggtgccca 7261 ccaccacacc tggttaattt tgtgtatttt tagtagagat ggggtttcac catgttggcc 7321 aggctggtct caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagttctg 7381 ggattacagg tgtgagccac tgccaggcct aaattgggca tattaatagt acacatcccc 7441 tagggctgtt ttgagctttc agtgagttac tatatataat gatctctagc agtgcttctc 7501 acatagtcac cacttagatt tgtagaatac aggttcagct gctgtaacaa agagtcccaa 7561 attaattatg gctgaaacaa gatagaattt attcctccct cacctgcagt ccaggcagaa 7621 gccaccaggg atgatgtggc aacccgtagt tgaggtgtca gggacacagg ttacttctgt 7681 cttgttgctg tccgaactct aggccgttgc cctttccttc atggtgcaaa atggctttca 7741 accacatcag ctttccaaaa tcacatttca taccctgttt gtcaagaact tagtcacgtg 7801 gtcacaaggc tacaagggta cctggtaaat atagtccata ggagactggc tgtgtgccca 7861 gctaaaagtt acattcctgg ctgggcacag tgactcattc ctgtaatccc agcactttgg 7921 gaagccaaag tggcaggatc ccttaagccc aggagttcct gaccagcctg ggcaacatgg 7981 caagaccccg tttctactaa aaatacaaaa attggcaggg ctgcagtggc tcacgcctgt 8041 aattccagca ctttgggagg ccaaagtggg tggatcacct gagatcagga gtttgagacc 8101 agcctgacca acatggtgaa actccgtctc taccaaatta gccgggcgtg gtggtgcatg 8161 cctgtaatcc cagtgacttg ggaggctgag gcagaagaat cacttgaacc aggaggcaga 8221 ggttgcagtg agccaagatc acgccattgc actctaacct gggcaacaag agcaaaactc 8281 cgtctcaaaa aataaactaa ttaattacaa ataaataaat aaataaaaat acaaaaagta 8341 gccgagtgtg gtggcatgca cctgtgtccc agctacccag gaggctgagg agagaggatt 8401 tcttgagccc aggagttggg ggctgcagtg agctatgatt gcaccactgc cctccagcct 8461 gggtgacaga gcgagaccct gtctcaaaaa cagttatgtg actatataag aaagaaaagc 8521 agatattgag ggatagctac caggctctgc catggtgctt ggtaaacgtt ggctgctata 8581 attattctta ttattatttg gctctcctca ctccactcac ctcctatccc ctgatgttca 8641 caggtatata aacaggttcc gccaggctca gcccaccagt cgagaggagc gccagcctgc 8701 aggcccaacc ccagctgact tttggtggct gcagtctgac tctccagacc ccagcagtca 8761 aagtgcagca ggtacctctt tcagtgccat ccactactcc cacccctaaa cctttgctga 8821 tcaatgcccc cagcagccac tcctcctggc cctcatacct caactggggc ttctcagcag 8881 gagccaacaa accagaagga agaccccata cagctgtccc tactgcggtc aacgtgacca 8941 gtgcatccca tgctgtggct ccccttcagg aaataaagca ggtgacatcc ccattcactc 9001 cctcccttgg gtgcctgaac tgacaacacc agccctagga cagaattaga agatcaggag 9061 cagtggctca cacctgtaat cccagcactt tgggaggcca aggtgagagg actgcttgag 9121 gccaggagtt caagaccagc ttgggtgaca tggtgagatt ctgcctctac taaaaaaaaa 9181 aaaaaaaaga gagagagaga gagaaccagg tgtggtggta tgtacctgta atcccagcta 9241 cttgagagcc tgaggctgga ggatggcttg agcctaggag ttcaaggctg ctgtgagcta 9301 tgatcatgcc actgcactcc agcctgggca gtagagcaag accctgtctc tatttaaaaa 9361 aaaaaaaaaa aaaaaagcct gggcaccgtg gctcatgcct ataatcccag cactttggta 9421 ggctgaggca ggcagatcac gaggtcagga gttcaggacc agcctgacca acatggtgaa 9481 accccgtctc tactaaaaat acaaaaatta gccgggcgtg gtggtacaca cctgtaatcc 9541 cagctactca ggagcctgag gcaggagaat tgcttgaacc cgggagacgg aggttgcagt 9601 gagccaagat agcgccagcg cactccagcc tggcgacagc aagactccat ctcaaaaaaa 9661 aaaaaaaaag aattagagct gatccccatt tcaaggaagc taaaacagaa aaggaggatt 9721 tgctggctaa tgtaattgag aaatccaaaa gtagatgttt ccagatatgc ctggatccag 9781 gtgtttgaat aatgttggga gagacccaac tctctctggg ctccatgccc cccctttttt 9841 ttttgataca gagtcttgct ctgtcaccca ggctggggta cagtggcgcc atctcagctc 9901 actgcaacct ccgcctctcg gggtcaagca attcttctgc ctcagcctcc tgagtagctg 9961 ggactacagg tgcatgccac cacacccagc taattttttt gtattttttt agtagagatg 10021 ggatttcacc gtgttgccca ggcttgtctc gaactcctga cctccggcaa tccgcccacc 10081 tcagcctccc aaagtgctag gattataggc gtgaaccacc aagcccggcc tttttttttt 10141 ttttttgaga cggagtcttg tgctgtcatc caggctggag tgcagtggcg cgatctcggc 10201 tcactgcaaa ccccgcctcc caggttcacg ccattctcct gcctcagcct cccgagtagc 10261 tgggactaca ggtgcctgcc accacgcccg gctaattttt ttttgtattt ttagtagaga 10321 tggggtttca ccatgttagc caggatggtc tccatctcct gaccttgtga tccacccacc 10381 tcggcctccc aaagtgctag gattacaggc atgagccacc gcgcccggcc ttcttttttt 10441 ttttttttaa ttagataggg tcttactctt aggctggagt gcagtggcac aatcatggct 10501 cactgcagcc ttgaactctt tgcaaccttc gcctcctgag tacctgggac tacagcatgc 10561 gccaccatgc ccagctaatt ttttggtttt tttgtagaga tgggatctta ctttgttgtc 10621 gaggctggtc tcgaagtcct gggctcaagc aatcttcctg cctcggcctc ctaaagtgct 10681 gggattgcag gcgtgagcca ccacgcccgc ctctgtgctt ctcttttttt gggctcagtc 10741 ccaggcgggc tttcccctca caatagccag gatggtcctt ggggtctcta ggctcccatg 10801 gtcctggctc agtgactcca gtgggaagtg ggggcttttc gaattgctcc atcagaagcc 10861 ccagggctca ctgtgattca ctcatccttg agccaaccac tgtgaaccag aggatagaat 10921 gctctgatga agcagctctg atctgggagg tgggatccat cctcctccaa ccatgatttg 10981 tccatgaaag gtgctaatcc accgggcgcg gtggctcacg cctgtaatcc cagcacttta 11041 ggagactgag gtgggcggat cacctgaggt cgcgagtttg agaccagcct gaccaacatg 11101 gagaagccct gtctctacta aaaatacaaa attagccgag cgtggtggcg catgcctgta 11161 atcccaggta cttgggaggc tgaggcagga gaattgcttg aacccaggag atggaggttg 11221 ctgtgagccg agatcgtgcc attgcactcc agcctgggca acaagtgaaa ttcaatctca 11281 aaaaaaaaaa gaaaaaaaaa aaggtgctaa tccaaagggg aaaactgggt ctctcctccc 11341 tgaggaggag gagcagatcc taagctggca tgtgaggtta ccaagggcag tacaacctgc 11401 attccaggcc ctctgtttag aacccccttc ccagcagcct ttgggttggg gctggcgtct 11461 gaccctgtca ccctgcagaa cctccacaca tggaactcat ccctgctgga cctggagacg 11521 ctgagcctac agagcagagc tgccaggctg ctcaaacgca ggtgcccgca cccctgcccc 11581 catcaccctt cctactgcgg ctccacgtgg cccaggtctc gagacctcca ttgcctcact 11641 cctgccttct ccctgcagca aagcctccat ctcctcctcc tcctccctca gccccagcga 11701 tgccagcact tcctcattcc ccaccagctc tgatggcctc tctcccttct cggagacctt 11761 catccctgac tccagcaagg gccttggccc cagggcaccc ggtaagggct gagaggcaaa 11821 gactgagggc agcctgggtg caaatcccaa ctctgccact gaccagctat gggactttgg 11881 gcctctctgg gccttgtttc tccagctgta aaacagggat aaggccagat gaggtgggtc 11941 aggcatggag gccgaggcgg gcagattgcc tgagctcagg agtttgagac cagcctggcc 12001 aatgtggcga aacccggtat ctactaaaaa tacaaaaatt agccacgcgt ggtacataat 12061 ggtgcatagt ggtgcatgcc tgtagtctta gctactcagt aggctgaggc aggagaatca 12121 cttgaacctg ggaggcggag gttgcagtga gccgagattg cgccattgca ctccagcttg 12181 ggcgacagag agagactcca tctcaaaaat aaataaataa atggtgccat catagtccaa 12241 ttgctgggct caagcaatcc tataacctca gcctcctgag tagctgggac tacaggcatg 12301 caccactatg cccagctaat tgtttttatt tttttatttg ttgagatgag ggtctcacta 12361 tgttgctcag gctgatctca aactcctggg ctcaagcgat ccccccaccg ccttcgcttc 12421 ccaaagcact agggttataa gaatgggcca ttcgttatgc ccagctggtt gttgcagttt 12481 tgaataggga agccagggaa ggtgaccaag atctgatgga tgtgagagaa tgcaccgtgc 12541 gatacatgaa ggaagagcat tccaggcaga gggaacgaca ggtgtaaaag gcaaagcctt 12601 cgccttttac cgggagagag gtgcagccaa gggagggttc ttaagtacga gaggcctggt 12661 ctgactcagg tattcacagg ctccctctgg ctgagagtag ggaacagatt tggaggagag 12721 tagagagacc agggaggcta tttaagagtc caggttgggt gagtggggct ggagcagggt 12781 gagggctgta gcgggtgggg gagtgggtag attctgggga atatgtcttc tggtcactgc 12841 tgtaactcca gtgtccagaa cggctcaata aatatttgtt gaatgaaacc aggcactgtg 12901 actcacgcct ataatcccag cactttggga ggcccaggtg ggatgatgat ggcttgagcc 12961 caggagtttg agaccagctt gggcaacaaa gcaagaccct gtctctacaa aaaaataaaa 13021 attacccccg tgtggtggca cacgcctttt gtcccagcta ctcaggaggt taaggcagga 13081 ggatcgcctg agcccaggag gtcgagactg cagtgatcgc accactgccc tccagcctgg 13141 gtgacagaac aagactatat atatataatg aacaaagact tgaatgatga cagatggaga 13201 agaccaggag tgctcgctcc cctgactttg accccttccc ctcttcccac cccagagagg 13261 agctgggaca gtatctggag agcctgtctg agtaaccccc tgcacagttg gagggcgtgt 13321 gtgcaggctg tcctggggtt gctgatctcc ctggcccctg cctcaccctc tcctctctac 13381 ctccccctac agcatccccg gcaccagccc aggcccagac ccccacccct gctccagccc 13441 cagcctcctc ccaagcaccc cttcggccag aggatgacat tctgtaccag tggcggcagc 13501 ggcggaagct tgaacaggct cagggaagca agggtgacag agcttgggtg ccgcctctga 13561 cccctgccct ccgcacgttg gtgagccgag ggagggagga gcctgggggg agctggagga 13621 ggggctgggt ctgaggggga ctgaggaccc tccaccctct ctctctctgt ctcagatctc 13681 tcttatctgt ctacagacct ctcctgctcc agtggagacc ctcagttctc tggggaccca 13741 gcctaaccat gtcccactgt ggagcagtgt ggcccagcct ggtccaccag aggccttcta 13801 tgtggagagg cctccttttc cctcagtgtc ctctccacac atcttttggg cccccagctc 13861 ccacgggttc ttctgggccc cacagtctgg gccttgggta tcccttgggg ctgttcctcc 13921 cacgcagcag gcctccaccc tagcacatct gggctctacc ctcgcgcccc cggcttccct 13981 ggcctccacc ctcgaacccc cggcctccac cccagctccc ctggcctcca cccttgcacc 14041 cccagcctcc accccggctc ccctggcctg tacccctgca cctccagcct ccaccctcgc 14101 atccccagct gtcccccagg gcctgcccat ccctgaccca agcagctgtg cccagcctaa 14161 gagcctgggg cccaagtctc ggaggagcag agcccctcgc ccagaggctg ccgagcaagt 14221 ccctgcagct ggccagggac ctggccctca gctcaggggt gtcctgggcc aggtagtggc 14281 agctcggctg ttccctgaca gcctggagga cacgcctcct cacttcgagg gcccccctcc 14341 acccaaggct ggatctccga aagtccaggc cacacaaccc caaaccaagg tcactccccc 14401 tccatctgaa tcccagtgtc gggccaaggc cgagtctctg aaagccaagg ccttgccgcc 14461 cgcagcgggg tcagtgatac ggaagagcga agccactcct tcccctggag cctgcctgca 14521 gcccgaggtc ccactctctc cagctgagca ggcaaccaca gtcaaggcct cgccgccagc 14581 cttccaggtg gggtctccgg aggccctggc cccgcccccg cccgctgctg accacgcccc 14641 ctcggaggcc ctgcttgccc agggccgccc tgctgctgca ggctgcagaa ggtgatgccc 14701 gcgccctcct ggatgttgga ggcaggccag ccccctaaga ggccggcccc ttggagctca 14761 ttcttttctc cccggcccag actccgacgg cagcgagttc caggacgatc ccgtgctgca 14821 ggtgctaaga gcccataggg cagagctgag tcggcagaaa aggtgaccga ccctccatcc 14881 ccagagtcta tgacactggg ccccggagac ctctgagacc cggttaggca tccagccctc 14941 ctcatcccct gtcccagcag tccgtccaca gccccagatt ctaagcctga gcacataccc 15001 cagcctctgc tctcccggcc tttctccagg gaagcggatg cccgattatc gttcctgttg 15061 gaccaggctg aagacctggg atcttggtcc cctccagccg ggtcgccccc taggtcccca 15121 aggaggctgc taagaaggga aggagattcc ctggaggcca gaagactttg aattgtacag 15181 attctatttt acccagtgag gctctttttt tttttttcat acagttactg gttatctgtg 15241 agaaaggggt tgtttggaaa ggccaagggg tcatgccaat ttaaggaaat gccccccatc 15301 ctccgctgcc ccatggctct gccctgcccc ccaccccttc ctgaccacgg aaacaagatc 15361 tcctttctgg ttatatcttc cttggggcag agggagtttc agacaagact tctggcaaga 15421 ctgggaccca ccttatctgg cttcactcag gagccctggc tcaaatggac caaaggactg 15481 ccctccacgg aggaagctac tagggtcatc agagtgttgg atgatggtca ggctatagcc 15541 cacacatgct tgagcagcag cagcatctgg tgcagctgac ccctccctcc ttgaagcact 15601 ttctctccct gcttccagcc cttctctctg cacctctctg gtcaggccta gtcactagcc 15661 cagcatgtca actttggcat gacccagggc catggcgttg acatctctat ctgcacccac 15721 ccataggtgc tcttgtctag cttaatgact ctaaaccccg tctctattct gataatcctc 15781 aaatttctgt ctgtaaccta gacctggcca cgaaaccttg ggctcctacc tcattaccat 15841 ttctacccaa gggtttaaaa gacatctcaa attccatagg tcctaaatgg agtttccaat 15901 ctttccccct aggcgtgcct ctccccagct tctttttttt tttttttttg gatggagtct 15961 agctctgtca cccaggctgg agtgccatag tgtgatctcg gctcactgca agcttcgcct 16021 cccgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta caggtgcccg 16081 ccaccacggt cggctaattt tttttgtatt tttagtagag acggggtttc actgtgttag 16141 ccaggatggt cttggtctcc tgacctcgtg atccacccgc ctcggcctcc caaagtgctg 16201 ggattgcagg cgtgagccac cgtgctgggc cccccctccc caactttctt cggctctgtc 16261 ctttgccgct cagaccaaaa accttagagt tgtctttgac ttctgtcttt cccttccacc 16321 cacagttaac caggaaatcc tgccatctcc gcctttattt tattttattt tttgagatgg 16381 agtttcaccc ttgttgccca ggctgtagta caatggcatg atctcggctc acggcaacct 16441 ccacctcccg ggttcaagcg attctcctgc ctcagccttc tgagtagctg ggattacagg 16501 cacctgccac cacgcccagc taattttctt tgtttgtttg tttgttttga gacagagtct 16561 tgctctgtcc cccaggctgg agggcagtgg cacgatctcg gctcactgca acctctgcct 16621 tgcaggttca agctattctt ctgcctcagc ctccctagta gctgggacta caggcgtgtg 16681 ccaccacgcc tggctaattt ttgtattttt agtagagacg gggtttcacc atattggcca 16741 ggctggtctc aaactcatga cctcgtgatc ctcctgcctc agcctcccaa agcgctggga 16801 ttacaggcat gagccatcat gcctggcaat tttttgtatt tttagtagag acgaggtttc 16861 accatgttgg ccaggctggt ctcgaactcc taacctcagg tgtcccaccc acctcagcct 16921 cccaaagtgc taggattaca gatgtgagcc accgcacctg gcctccacct ttaaaatcta 16981 tccagaattc aactgcttct ccccttctac tgctcacacc tcaatctagg ctgtcatcac 17041 ctcctctctg gactgttgga aaacttaacg gtttttataa acctgatatt ctttgacaca 17101 cttgccatca agaggcagag tctatgtccc ttcatcttga aactgggccc ctcaacaaat 17161 aaaatgcaga ggaagtgatg cagcttaact tctaaagtgc agttagaaaa gatgatttct 17221 cagctgggca tggtggctca cgcctgtaat cccagcactt tgggaggctc aggtgggcag 17281 attacttgag gtcaggggtt caagaccagc ctggccaaca tggtgaaatg tctctactaa 17341 aaatacaaaa attggccagg cattggtggt gggtgcctgt aatcccagct actcaggagg 17401 ctgaggcaag agaattgctt gaactcagaa ggcagaggtt gcagtggaaa aaaaaaaaaa 17461 gataatttct ctctctttct ctctctctct gacacttgct ttagaggcta cattaaaaaa 17521 aaaaactagt tgccatgctg gggaggggtt tgggttacac aggtgtatgc acttatgaaa 17581 cttagtgaag tgtgtgaatt tcattgtatg taaagtttac ctcaaaagaa aatattgaac 17641 gccagttaaa gatacgcctg taatctgact tctttgggag gccaaggcgg gaggatcact 17701 tgaggccagg agtttgagac taccctgggc aacacagcaa gacctgtctc tacaaaaata 17761 aataagtaaa taaatagcca gatgtggtgg cacacgattt gcagtcccat tactaggggc 17821 ggggaggggc gctgaggcag gaggtttact tgagcccagg agttcaaggc tgtattgtag 17881 tgaactaaga tcatgccact gcactccagc ctgggctaca gagctagacc ctgtctttaa 17941 aaaaaagaaa agaaaaaaaa aagtatgtac actgaaatat ttagggaaca ataaaccaat 18001 gtcgacaatt tattttgaaa tgcatccaga gatgagttgg attgatggat aaatagtgta 18061 ataggtagat gtgagataaa gcaaatacag caaagtgttc atggtaggat ctaagtggcg 18121 gatatgcggg tgtccactgt aaaatttttc aactttgctg taatttagaa acttttcatg 18181 gccgggcgcg gtgactcatg cctgtaatcc cagcactttg ggaggccaag gcgggcggat 18241 cacgaggtca ggagatcgag accatcctgg ctaacacggt gaaaccccgt ctctactaaa 18301 aatacaaaaa attagccggg cgtggtggcg ggcgcctgta gccccagcta ctcggaaggc 18361 tgaggcagga gaatcacttg aacctgggag gcagagcttg cagtgagccg agattgcgcc 18421 actgcactcc agcctgggtg acagagcgag actccgtctc agaaaagaaa cttttcataa 18481 taaaatgatg gagaatatat gattggggtt gttttgatat ctgcatttac aaaatggagg 18541 gataagatgg atttctagta acttctgtat gtcacatggt aaatataata tttgttttca 18601 acctttcttt ctttcttttt tttttttttt ttttttgaga tggagtcttg ctctgtcacc 18661 caggctggag tgtagtggtg caacctcggc tcactgtaac ctctgccttc caggttcaag 18721 tgattctcct tcctcagctt tccgagtagt tgagattata ggtgcatgcc acctcacctg 18781 gctaatttgt gtgtgtgtgt gtgtttttag tagagacagg gtttcaccat gttggtctgg 18841 ctggtctcga actcctgacc tcatgatcca cctgccccgg cctcccaaag tgctgggatt 18901 acaggtgtga ggcaccgcac ccggcctgtt tctaaccttt ctaagcaaac catacattta 18961 aatgaaatca gagtgggcac agtggcacag gcctgtaatc tcagcacttt gggaggccaa 19021 ggcgagcaga tcacttgagg tcacgagtta gagaccagcc tgggcaatat agtgaaactc 19081 tgtctccact aaaaattagc tgggcctggt ggcgggtgcc tgtagtccca ggtgctcagg 19141 aggctgaggc aggagaattg cttgaacatg ggaggcggag gctgcagtga gttgagatgg 19201 tgccactgca cttcagcctg ggcaacagag tgagactcca tctcaaaaac aaacaaacta 19261 acaaaaaaat gtaattgaca agaaaaatag ctgtttagat ataaggagaa ataaaagata 19321 ctatagtagc agaacctgat ctagctgggt cacgggtggg gtctagaagc attcctgagg 19381 aagttacact taagctgaga caggtagaaa ttatctagtt aacaaagggc tgtcctaatt 19441 actctagttg gataaccgct cccaaaactt agtggcataa aacaattatt ttattatgct 19501 catggattct gagagtcaga ggtttggaca gggctcatat ggggacaatt tttgtctcct 19561 ccatgatgtc tggggattca cctggaaaga ctcaaaggtg acttgataga cttgatggct 19621 gtggagtaga atcctccaga acttcttccg tggtcttctc ccagtctgac tgggactatt 19681 gactaatgcc tatacatagc tccattggcc tgggcttcct caaagcatgt ctgcttcagc 19741 atagtcacac ttcgcatatg atgcaccatg gttctacagc tcattccagt ggacgagaac 19801 attggtcaag aagctgcatt gcctttcatc acatagcctt agaaatcacc gtgtcacttc 19861 cactgttttc tattggttga agctatccta gtcccaccca gattcataga aagcaatcat 19921 agagtccgcg tcttgataag aagagtgtca gagaatttgc agctgtttta aaatcactgc 19981 aggaccccat tctccatgat gtgattatta cacattgcat gcctgtatca aaacatctca 20041 tgtgccccct caatatatat acctaaatat acctgttatg tacccagaaa aattaaaaaa 20101 taaaaaaaaa aatcactgca ggggttgatg gggaaacact agacagaagg aatatcgtgt 20161 gggaaatccc tgtgatgaga gtgaggtcag caaatttagt acataagtaa ctgcacattc 20221 acctctccct cagcactccc tttcaccagc ccttgcttaa ttattctcca tagaatgaat 20281 caccttctaa tatttttttt ttttttcaga cagagtcttg ctctgtctcc caggctggag 20341 tgcagtggca aaatctcggc tgactgcaac ctccgcctcc tgggttcaag caattctcct 20401 gtctcaggct cctgagtagc tgggattaca ggcacacgcc accacacctg gctaatttta 20461 atatttgtat ttttagtaga gacggggttt cagcatattg gccaggctgg tctcgaactc 20521 ctgaccttgt gatcctcccg cctcggcctc ccaaagtgct ggggttacag gcgtgagcca 20581 ctgtccccag cttttttttt tttttttttt tttttttttt gagacagagt ctcacttcaa 20641 cacctaagct ggagtgcagt ggcgcgatct cggctcacag caaactctgc cacctgggtt 20701 caacccattg tcttgcctca gcctcccagg tagctgggat tacaggcacg cccgccgcca 20761 cacctggcta atttttgtat ttttagtaga gacggggttt ctccatgctg gccatgctgg 20821 tgtcgaactc ctgacctcaa gtgatctgcc cgcctcggcc tcccaaagtg ttgggattac 20881 aggcgtgagc caccacgcct ggcctaatat tctatatttt gtttaatgtt aattttgttc 20941 attgtcttct cccagtagct tttgagctcc aatgtgggca gggtatattt tctcctggtt 21001 tggttgttga tgtatgtaca aattgattag agtttgagtg aatgaatgaa tgaatgaatg 21061 aatgaacaga ccaagaccct ctgagatgag aatttgttga gggcatgact aaggagagac 21121 cctcctgtga agggcgttat tacagtgtta tctgggcatg ctcagtatta gcaggctcca 21181 ttgggaatgg ctttatgggg ggcataagca tgatctggca tttcccccta agcattttcc 21241 tagaaaaaaa aaatcaaggc tggagattgg cccgtaataa gcagtagaag gggaaacaag 21301 aaaatgtcca gtgggcaggg gaggccaaat cgcagaaagc ctcgggttac gtcctaggca 21361 aggtatatga ggcaacgaga aacgtccatg ggcggagcgt cctcggcatt atgtgagcgg 21421 ggtcgggatc aggactgaaa aggtgagact tgggactggg actttgaaaa gctgcttaag 21481 gaactgggca ctgctctgag gccgtgggag aatccaatta agactatcag gggctggctg 21541 gcgtggtggc tcacgcctgt catcccagcg ctttgaaagg aggctgagga gggagtatct 21601 cttgagcccc agagatcaag accatcctgg gtaacatagt gagaccccca tgtctaaaaa 21661 aaattagcca ggcgtggtag cgcacgcctg tagtcccagc tacatgggag gtatcgcttg 21721 agcccaggag gttgaggctg cagtaagcca cgattgcagg actgcgctcc agcctgggcg 21781 acagagtgag accctgtctc ttaaagaaat aaaaataaaa aagtcagaag aatggacact 21841 gggagtggac gaaaggtagg gtccattgct tggtgttgga agtgtcggaa gcaaagcata 21901 aagaacctgg gggcttgtct caggagggcg atgacagcca atgaaaagca gtcatgggcg 21961 tggcgctgtc aaacttcgag gggcgggacc gaggacacag agccggggcg gagcccaagg 22021 tgaaaccaat gagaagcctc cgggtgggcg gggcatcggc ctaaggccaa gggcggagcc 22081 aatgagaagc agcgccgcgt tcccgctgcc ccccgccccc gtggggcgcg cgccggagcc 22141 acgggcagcc gttaggggcg gggtctgcag ccgcccgcgc gcggctcgcg ccctcccctt 22201 tgtgtcgcca tggcggcggc agcggcgacg agaacggcga gcgaggggtc gagcgcggcc 22261 ggggcctgag gaggctacgc gaccatggtg gtaagggtcc cacgcggccg tcagcctgtc 22321 cgtccggatg tcagtctgtc cgtgcgcagc cccgccccgc gcgccccgcc cccggccccg 22381 cccgatcccg cggcctgtgc ttcagccgtg gtccctcccg tcctgcggcc ccatcccggg 22441 tcccagcccg tacctcgacc ccgcccccta agcgcgcatc cccgtcttcc acgccctgga 22501 tggggtgaca gggacctagg gcctgggctg ggaggaggcg gggctagtcc aggaagggac 22561 ccgcgccacc caagtggccc ctgcaggggc ctcctgaggc tcctgggtcc ttccccagct 22621 cccatcccag caccttcctc ggcatccttc tgccagccct cagccctccc cggcggagcc 22681 ccctcctcct ccccacagcc cctttctcat tcccgagccc caccccccac ccgctccatc 22741 cccagcctga cccttttctt ctccttctcc ctcattttca ccctatccca gttcctctca 22801 atcccccccg ccccgcatca gtctgagcct ttgcctcgtc tctccagcca ctccttcctt 22861 actccacccc aacaggccag cactgtacct cccttgatcc tcgggtctgc tccatctcct 22921 tgtcctcatc cccctccccc accatttccc tctcccacca tgctgctccc gctcattcat 22981 ccattcattc cctcacttag cagacattca ctgagaccgc ctctgtgcag gccccacgct 23041 ccaggcacag agagagtcag ctcccatcct gccttgggga accttatggg ctggaggggg 23101 acagaccctg agggtgagac cctgagggtg actgggggtt gcagggacat ggagcaggga 23161 gacgctacag cccagtgaat ctgtctggca gctggttaat ttattcatca gattttaccc 23221 agtccctgct gcgcgcgact gggctggcac tggggaccca gaaaagaatc aggaacaatc 23281 atgtctgaac accaactcaa tgcccagctc tatgctgggg agctcagtct ggggccctcc 23341 ttctgggagg ccggtcatag tccaatgaga gagatgaact tgtcaccacc agtgccatct 23401 cagagtgatc agagctgtga agggagaagc tctggtcaag gcatggaacc tgggatgggg 23461 gaacctcagg ggatggagag agtccagagg agatgccagg tgcagcctga agggtcaggg 23521 agaacttcct ggaggagggc acatgtgagt tgggacctga ggggtgggag tgaggatgaa 23581 tgtacccagg gaaaggtggg gcgagtgttc caggctgagg gaacagcctt tgcaaaggcc 23641 tagaaggaaa tgagaagagg gtgctttttt tggactctta gaagttcaag agctctgaga 23701 gtgggggcaa tgggaagaga tgggactgga gaggtaaaca gaaatcagct cagggaagtc 23761 tcgattatca gaccagagaa ttcagacttc gtcttgaaga caacagggag tcattgaagg 23821 cccaggatct atggagaaat agagccagat ttgtgttttc tgaaaacgat ccaggctatt 23881 gcggaaaagc tggcaggagg ggagaggctg gaagcaggaa ggctgatgag atgggagttg 23941 tgattgtcta gacagtggta ggagatcact gtactctgca ggggctacgg aagggttttc 24001 agcagggaga gatgggacca gcatgctctc catcctgttg ccccgcctca cgtctggccc 24061 ctgcttctcc agtcccccac cccagaccac acacgaagaa gcagtcctgt cctcagccca 24121 gccctcacct cccccgacct gccatcctgc ttcatgctca gggcggtgtg tggagcgccc 24181 ggggctctgg acccgcgctg ccagataaca atgctctcgt tgtctctttg ctcccatctc 24241 tgggggcctc tgattctttc tgctctacag gcacgcagca ctgacagcct ggatggccca 24301 ggggagggct cggtgcagcc tctacccact gctggggggc ccagtgtgaa ggggaagcct 24361 gggaagaggt gagggtgagg gaggaaaggg ctcagctagg agctggggag actcaaggag 24421 ctgctggtga cgcatcccct gtctcccagg ctctcagctc ctcgaggccc cttcccgcgg 24481 ctggctgact gcgcccattt ccactacgag aacgttgact ttggccacat tcaggtatgg 24541 gggctttgca tttgcaccca ggagggaaga caatatccta cccaaccttg caacgatatc 24601 aggtgctttg cttctgaaac ttatccctgt gactggctgc tgcacgcgtc tgagcagtca 24661 gtagcaaaag ttgcaggagc cacagcagat ggggagagct gctcattcat tcatccattt 24721 gttcttttat tcattcaaca acaacttttg ttaggtatct gctctatgtc agggaccagg 24781 acatcaccag gccctgtcct cggggaggtc cagaagggag tcacaaaccc gtccctagat 24841 agtgccatcc ccaggaggtc aggactggga caaaagtcca cctgggcctt gctttccctc 24901 tgcagctcct gctgtctcca gaccgtgaag ggcccagcct ctctggagag aatgagctgg 24961 tgttcggggt gcaggtgacc tgtcaggtga ggccatcccg cctctcatct agcctgagag 25021 aatggccact gtagtcctca gctcggtgtc atggggcccc tgctcccttc tgtccctctg 25081 ctcccacagg gccgttcctg gccggttctc cggagttacg atgactttcg ttccctggat 25141 gcccacctcc accggtgcat atttgaccgg aggttctcct gccttccgga gcttcccccg 25201 ccccccgagg gtgccagggc tgcccaggta acctgcttgt tgtctcagcc cctgcctcat 25261 gagtgtgtcc tcatccacag tgtgaaatca tcaaggcagc gggatagaga aatatttaaa 25321 tactggtatg gcccaagtcc caaccaatct gaatgaatgg agaagcctta gaaacgtagt 25381 aggcttctgc tacatttagc tttgctaagt tttaggatca tggggagttc cagggggtcc 25441 catgggaggg tcaccatggt catgccaacc agggtactca agagcttggc ccagcctgct 25501 tattcattca tatgcccata tctcagtacc taacagctga gcagcaccca tgtgttcagg 25561 atgaacttca gcccagagac gagctagggc agtgtgggag ggcagtgggc tctctcacgt 25621 ccactcacag aggcccactc tgacaacctg cccccagatg ctggtgccac tgctgctgca 25681 gtacctggag acactgtcag gactggtgga cagtaacctc aactgcgggc ctgtgctcac 25741 ctggatggag gtgggcctgg gcagggggct tggagattcc gagtgggtga gggggtgtct 25801 gaggggcgag aagcagcctc ggtgtgtgtg tgggcataga aaagaaagga gccagagtgg 25861 aggaggcgtt gatattttag tgtctgtgtg ggcgcatgcc tgtgagagaa tgtgtgtgag 25921 catgtgtgtg tgtgcacgtg tgtgtgttag caggtgttcg gctttaagga tacaagggtt 25981 taaatgtata tcagttagaa atgggttggg ctgtaactca cagaacacct ggctatgagt 26041 ggcttaaacc agaggggttt tttttctttt cttgagatgg agttttactc ttgttgccca 26101 ggctggagtg cagtggcatg atctcgccca ccacaacccc tgcctcccag gttcaagtga 26161 ttcttttgcc tcagcctcct gagtagctgg gattacaggc atgtgccacc acgcccggct 26221 aattttatat tttcagtaga gactgggttt ctccatgttg gtcagactgg tctcaaactc 26281 ctgacctcag gtgttccgcc cacctcagcc tcctaaagtg ttgggattat aggcatgagc 26341 cactgtgcct ggcctctggc taattaaaaa aaaatgtttt gtagagacag agttttgcta 26401 tgttgaccag gctggtcttg aactcctggg gtcaagcaat tctccaggct caccctccca 26461 aagtgctggg attacaggcg tgtgccacca cgcccagcac agagagattt tagatggcaa 26521 ccgtgtggca tctgctgtgg aggatgaagg tgcagaggtg gatggggaag ccttcaggtg 26581 gggaagcctt tgggtgggag agtcctggga catgtgaggg gaaataaagg ggtttttctt 26641 agaggtttcc ccacccgcag agggtgccag ggctgcatcc ctacaaacag gaatctaggt 26701 gtttgaccaa tagctctgag tgacaggggc tcttgacggg ggcgggcagt ggcctcaccc 26761 agcgcggagg agcttggtat gcctgcacta acaccgtctt ctgacctgtc cttgccacat 26821 tccacctcta tttcagctgg acaatcacgg ccggcgactg ctcctcagtg aggaggcgtc 26881 actcaatatc cctgcagtgg cggccgccca tgtgatcaaa cggtatacag cccaggcgcc 26941 agatgagctg tcctttgagg tgaggctgtg gggaagcaga ttccagctgg gctccccaca 27001 ccccctgctc cttctgaccc ttctcttccc acccgccctc tcccaggtgg gagacattgt 27061 ctcggtgatc gacatgccac ccacagagga tcggagctgg tggcggggca agcgaggctt 27121 ccaggtgagt ccagctgggc gcggacaggt ggggctgggg tacctgccaa ctggggtggc 27181 ccagctactg accctgacct tcctcaggtc gggttcttcc ccagtgagtg tgtggaactc 27241 ttcacagagc ggccaggtcc gggcctgaag gcgggtaagt gccatggatg gatgggaggt 27301 gtggggaggg gtgggaaggg gtggggcctc ctgcgtcttt tgcctcccac tcatcccttc 27361 caccccattt ttcgcctagc agatgccgat ggccccccat gtggcatccc ggctccccag 27421 ggtatctcgt ctctgacctc aggtaataga aataggcggt caggtcccag cccctacccc 27481 accaggcccc tggccatgct gaccccacaa gacctgcctt tgccctttgc cccttgcccc 27541 cacagctgtg ccacggcctc gtgggaagct ggccggcctg ctccgcacct tcatgcgctc 27601 ccgcccttct cggcagcggc tgcggcagcg gggaatcctg cgacagaggg tgtttggctg 27661 cgatcttggc gagcacctca gcaactcagg ccaggatggt gaggccgggg cccacccacc 27721 ccacccgtca caccagggct gcggcccacc cagccctgac cttgctttct cccagtgccc 27781 caggtgctgc gctgctgctc cgagttcatt gaggcccacg gggtggtgga tgggatctac 27841 cggctctcag gcgtgtcttc caacatccag aggcttcggt gagggccctt agccaaccct 27901 gtccttccac aggcactcac ccagcacctc cactccagcc ccgtgctgca tgctggggac 27961 acagttacca ggagtcagga gtggcaggat caaggctggg gtcagggaca gctctcttga 28021 gagtagagtt agcagtctaa gcgaggacta tcttcaccga gcacctgcca cgtgccaggc 28081 actgttctag gcactgggga cgcagcagtg agtgagacag ccagaaaccc ctgccctcat 28141 agggcccatg ggctagagga agagacaaac aggggaggga tagagttcac cagaagatga 28201 caggtactat ggaggaaaac agattggggt aaggaggatg gcgacgaaga gccaagaggg 28261 aggcctgcag ttttggatgg caagggcaga gccacctcat gcgagggtct ggaggagggg 28321 aggggcaggc catgcagagg gagcagcagg gcaaagggaa cagcaacagg aggccatggc 28381 caaggagtcc ccgtgccagc gcaggcaaag ccttcatgtt agggttttgg cttttactat 28441 aagtgaaggg ggaaccgcag gagggctctg agccggggcg gggtctagag gcggggagga 28501 gccaggcgag atctcggcct gcaggaggct gagaggggag tgcttgaagc ctgcaggggg 28561 gtggccaaga gggaggcagg aggtggcaat gggtgggaga ttttaccaca ggcagtgggg 28621 ccccaccaga gaattgggag tagcagggtg ctgggatttt gtccacattt tccagtgccc 28681 tcttacagcc aggagaatgg cagctgcatg gtaggagctg ctcatgctct ggagtcagac 28741 agtcttgggt agctgcaggc agaggcacct ctgtctgagc ctcagcatcc tcctctgtaa 28801 atgcggcacc cttctctttg ccacacaggc ttgctgggga gttcagtaag gttctgcctc 28861 tgaagagccc agtgtaggac ctggggaaga gggttccatc cacctgcggg cacttggggg 28921 taggggcaag gggagcatgg ccacgtgagc gagcccctct gacctggatc ttcctcctcc 28981 ttgacacggt ggtctcaggc acgagtttga cagtgagagg atcccggagc tgtctggccc 29041 tgcattcctg caggacatcc acagcgtgtc ctccctctgc aagctctact tccgagagct 29101 tccgaaccct ctgctcacct accagctcta tgggaagttc agtgtgagta agggagctgg 29161 cgggacggag ggggccggga cgcctctggc ccagacctca tcacacctgc ccaccatctc 29221 aggaggccat gtcagtgcct ggggaggagg agcgtctggt gcgggtgcac gatgtcatcc 29281 agcagctgcc cccaccacat tacaggtaaa ccaggagggg cagggcggga cttggtggga 29341 ttccaagggg gttgaggctc aggtgccccc tctgctccca cccccaggac cctggagtac 29401 ctgctgaggc acctggcccg catggcgaga cacagtgcca acaccagcat gcatgcccgc 29461 aacctggcca ttgtctgggc acccaacctg ctacggtgag ctgcttgctc gcctgcctgc 29521 ccctcaggtc tttccccaaa accaccccag gaacccgccc agcttttctt ttgtttattc 29581 atcgaatacg tgtctatcaa atacctccca tggacctggc cccgtcccta gcactgggga 29641 cccagcactg agcatggccc tgtcctcctg ggactcatgt tctcatggag gagatggacc 29701 atgaacatca acaggaaaaa tacagagtaa agtctgatgg tgatgagtgc tgaggaggag 29761 agaggaggag ggaagggcag tgtgcagggt cagggcagtg tgaagttttt agcgagagtg 29821 gcaagggagg ctttggcagg tcttctgtgg ccagcagcag aataagcctg gaggatttga 29881 tggtgaacga gacaggcatg atctctcccc tgccggagtt cacagtcttc tgggggcaca 29941 gatacataaa caagtaaaac aggtggtatg tcagaggacg ggcgtaagat gtacggagaa 30001 aaataaacga gaaaggccag gggtacagag aagggagtgt acagttttga ataaagtggg 30061 tgaggaagac tgcactaaga aggcaatatt tgagtcaaga cctgcagagg atgagggagg 30121 gccccaggtg cgcacgtggg gagtgacagg ctgagggaac ggcaggtgca gacatctggg 30181 cctgggccgg tgcctggtgt gttggaggag ctgcagggag gccggtgtgg ctggagctga 30241 gtgagtgagg ggctgaggga gaggagatga tgaggtcaga gaggtgacgg ggaccagaca 30301 gaggggtgac tcacaggtca tgggtcacag tgaggacttt gctcttgcct ggagcgaggt 30361 gcagccaggg cagggctctg agccaggcgg gccctgatgg aactcaggtg gtctcaggct 30421 ccctctggct gcaggtggga gcatacttta gggtggggac cggaaggagg ctctgggatg 30481 tccaggctgg gggagatgga ggctgggcca gatggggggc agtggagggg gtgacagatt 30541 gattgtggga gttaaagagg ggggctcagg gcaaccccaa ggtgtttggc cagagccaca 30601 ggaagaacca cactgagatg gggaaggcag gaggggcggg tctgggggaa ggttgagagc 30661 ccagcgaggc gtgatcagtt tgaggtcaga ggtgttgaat agacagtcac tgcctcccct 30721 ggctcccctc attgccctgc cagaacctgc tggctgggct caaggcaccc agcctccctc 30781 ccgctcctcc cacccaggtc catggagctg gagtcagtgg gaatgggtgg cgcggcggcg 30841 ttccgggaag ttcgggtgca gtcggtggtg gtggagtttc tgctcaccca tgtggacgtc 30901 ctgttcagcg acaccttcac ctccgccggc ctcgaccctg caggtatgcc ctcccacccc 30961 ctgaggtcct ggctactgcc caccacgatc agggctgcag ggggagggca ggtgggctcc 31021 cagtcccgtc cccaccccac tgaagctggg cctccctccg gctccttgag gatcccgccc 31081 cggcctctcc ctccccgcgc ccccctcctt ctcatttcag ctcctccgct caggattccc 31141 acctcttggc ccggacgccg ctcttccttc accccttgta gctcctgggg gcgcttgggg 31201 ccatgggtcc acctgggagg aggtgggagg tccccagact tgaccccgcc ccggccccac 31261 ccagactccc cgccctgccc cggaccccag cccagtcagg actcagcacg tcggagggcc 31321 ctctggcccg aggtaactga agccagagcc gctgccctcg ctggctgccg ggagctgcct 31381 cctcatcagc tcgtcccgcc ccgccctcct cccacctgcc tgctgcccgc ctgctcccgc 31441 ctgatccgcc ccggccccct gccttgccag cccgggtggg catgctgcgg ggccggggct 31501 tgggctgtgg cgcttggctt tgcctgtggc cttgggcggc cccagagctg acagctgccc 31561 cctttccaca ctccccaggc cgctgcctgc tccccaggcc caagtccctt gcgggcagct 31621 gcccctccac ccgcctgctg acgctggagg aagcccaggc acgcacccag ggccggctgg 31681 ggacgcccac ggagcccaca actcccaagg ccccggcctc acctgcggaa aggtgagtgg 31741 gatgctgggg gtggcgaggg gcaggtggag gcctggttcc tcagacggcc tcctgtttct 31801 cccccaaacc gcaggaggaa aggggagaga ggggagaagc agcggaagcc agggggcagc 31861 agctggaaga cgttctttgc actgggccgg ggccccagtg tccctcgaaa gaagcccctg 31921 ccctggctgg ggggcacccg tgccccaccg cagccttcag gtgagaggct gagccatggg 31981 ctggtgggca gcgatggtcg ctggagtgcc ctcctacctc tccctctcct gcaggcagca 32041 gacccgacac cgtcacactg agatctgcca agagcgagga gtctctgtca tcgcaggcca 32101 gcggggctgg tgagcaaggc gggcaattgg ggggcgctac ctgtgcccat gtggagccgg 32161 gaggaattgg ggccctggtt tggcctccaa attttatact tcacatttgg gggccctggg 32221 gcttttgaaa aatttaacct ggcctttatg tacaatgagt tggatacagt atcacctttg 32281 gttcatcaca cactgaactt aacttactca ttggtccatg acgggccctt taaaagtctt 32341 tattattcct gttgaacatt ttaaaataat atcatggaca cccatgaacc cagcagccag 32401 aactagagca gccccattcc ttctcccctt cgagttcctc cccactggag gcatcctctc 32461 aatttgcaga ctaccctgcc cttgctttcc aggtctgatc atacacaggt gtgtgtgtaa 32521 ggatcatact gctcagttct agccattttg gaaacttgat ttaaagggcg ttgtaagaaa 32581 gggatgccgc actgtatggg gagcttggct ttttctccat cgctacctca cagcccgccg 32641 cgctgctggg tgctgctctc cgacctctgc ggggggctgc ttatccaccc cacccagccg 32701 tgcctctggg ggctcttctg agccaccgcg ccatcctcac actcctgggg tactgccctc 32761 ccacataccc aggagcccag gtgctgtcct gctggctcag cagctgtatg tgaatctgtt 32821 ggtctcgtgc tcacagcctg tggggcagcc acagggcctt cgtgctttgg gccggaagtg 32881 tcctcttcat ggtctccact gtcaatctga acagctcttc ctggcttcac actactgtgg 32941 ccctctccag gaggcgcctc ttcatgtcct ttccccagct ttctgtgggg ctgtgcgccc 33001 tgttctcagg gatggtctca ctgaccccac ccctccaggc ctccagaggc tgcacaggct 33061 gcggcgaccc cactccagca gcgacgcttt ccctgtgggc ccagcacctg ctggctcctg 33121 cgagagcctg tcctcgtcct cctcctccga gtcctcctcc tctgagtcct cctcttcctc 33181 ctctgagtcc tcagcagctg ggctgggggc actctctggg tctccctcac accgtacctc 33241 agcctggcta gatgatggtg atgagctgga cttcagccca ccccgctgcc tggagggact 33301 ccgggggctg gactttgatc ccttaacctt ccgctgcagc agccccaccc caggggatcc 33361 cgcacctccc gccagcccag caccccccgc ccctgcctct gccttcccac ccagggtgac 33421 cccccaggcc atctcgcccc gggggcccac cagccccgcc tcgcctgctg ccctagacat 33481 ctcagagccc ctggctgtat cagtgccacc cgctgtccta gaactgctgg gggctggggg 33541 agcacctgcc tcagccaccc caacaccagc tctcagcccc ggccggagcc tgcgccccca 33601 tctcataccc ctgctgctgc gaggagccga ggccccgctg actgacgcct gccagcagga 33661 gatgtgcagc aagctccggg gagcccaggg cccactcggt gagtcctcag cctaccccac 33721 ccctgtcccc gccagctgtc actgactctg agggcctggc cccagctgaa cccctctcca 33781 ttcatttata taggtcctga tatggagtca ccactgccac cccctcccct gtctctcctg 33841 cgccctgggg gtgccccacc cccgccccct aagaacccag cacgcctcat ggccctggcc 33901 ctggctgagc gggctcagca ggtggccgag caacagagcc agcaggagtg tgggggcacc 33961 ccacctgctt cccaatcccc cttccaccgc tcgctgtctc tggaggtggg cggggagccc 34021 ctggggacct cagggagtgg gccacctccc aactccctag cacacccggg tgcctgggtc 34081 ccgggacccc caccctactt accaaggcaa caaagtgatg ggagcctgct gaggagccag 34141 cggcccatgg ggacctcaag gaggggactc cgaggccctg cccaggtcag tgcccagctc 34201 agggcaggtg gcgggggcag ggatgcgcca gaggcagcag cccagtcccc atgttctgtc 34261 ccctcacagg ttcctacccc cggcttcttc tccccagccc ccagggagtg cctgccaccc 34321 ttcctcgggg tccccaagcc aggcttgtac cccctgggcc ccccatcctt ccagcccagt 34381 tccccagccc cagtctggag gagctctctg ggcccccctg caccactcga caggggagag 34441 aacctgtact atgagatcgg ggcaagtgag gggtccccct attctggccc cacccgctcc 34501 tggagtccct ttcgctccat gccccccgac aggctcaatg cctcctacgg catgcttggc 34561 caatcacccc cactccacag gtcccccgac ttcctgctca gctacccgcc agccccctcc 34621 tgctttcccc ctgaccacct tggctactca gccccccagc accctgctcg gcgccctaca 34681 ccgcctgagc ccctctacgt caacctagct ctagggccca ggggtccctc acctgcctct 34741 tcctcctcct cttcccctcc tgcccacccc cgaagccgtt cagatcccgg tcccccagtc 34801 ccccgccttc cccagaaaca acgggcaccc tggggacccc gtacccctca tagggtgccg 34861 ggtccctggg gccctcctga gcctctcctg ctctacaggg cagccccgcc agcctacgga 34921 agggggggcg agctccaccg agggtccttg tacagaaatg gagggcaaag aggggagggg 34981 gctggtcccc caccccctta ccccactccc agctggtccc tccactctga gggccagacc 35041 cgaagctact gctgagcacc agctgggagg ggccgtcctt ccttcccttc accctcactg 35101 gatcttggcc caaccaaatc ccttgttttg tattttcttg aaccccgacc actaccccag 35161 gtttctaact ttgtaacttg cttctgatgt gggtccctaa cctataatct cagcttccct 35221 accctggact gaagggtctg cccatccccc caccaccctc catcctgggg gccctcgcac 35281 aaatctgggg tgggaggggc taggctgacc ccatcctcct ctccctccag gagcccccag 35341 catgtcctga cctgtgcacg gggatggggg gacaactcct acccttcttt ccccacatgc 35401 cccactaaac catctgacaa cattaatgaa taaaatggtg aaaatgtggc tgctgaagtc 35461 gttctggggc caaactcagt gagtgaagac agacagaggg ggcactttat cattctattt 35521 ccttttgggg aggcaccaag gcaggagtta ctcctgggag cctctgagag gtggtatctg 35581 cagagagggg aaccatgggc accctgggag tctaggtcag gagtggggct caggctgggc 35641 ttttctccaa gctgacatcc tttttgtagc tctgctatga tctcctccag caggtccatc 35701 ctggggagca gtgaggcaca ggtagaccta gggttacctc agggataaag acctagggtt 35761 acctggggtc ctcccatcca aggcctcggg tctcacttct gcaggctgga gaagagacct 35821 cctctagtca gctcagtacc gggtgcctgc aggagagaag gctggagctg ccacctggta 35881 gccatggtgc tccagcgggc ctgggagggc agatggggca caggctgagc tcggggtcct 35941 ctgcttagcc tgccctcgtg gccttccctg ccagcagcct cccagtgtgt gggaagcctc 36001 cctgtgggtc agcaggctgg ggatgggctc acagggctgg ggctccagca gaataaccct 36061 gtgtgtgcgc cgcagcagcc cctgcagggc tggaatcttg gctgagagct tcctcaaact 36121 gttccgcaga gctctgggtc cccagcaggg aggctgagtc ctagaggctg gagttggagt 36181 gcagggaagg gccctggggg aagcagggtt ggttaagggt cataggtgag tgaggggaca 36241 gttgggcgac cctaaagttc cctggtcaaa ggagaggcag ccagggaact tcgggttggt 36301 gactcctgga gggcgggcgg agggggtcac aggttagcag tgggacagtt gtgcagggct 36361 ctgggggtat ctgtactgtt gagctgggct ccaggcagca gtgcaggctg accctggctg 36421 tggggctgtg gccagaccca gggctgggtc ctgggctggg cccagagctg ctgcctagag 36481 tcaagctgct gcccaagttc caggctgcac cccttgggat cctgctgtgg tctctgacct 36541 caggagtctc aggcctgcag agaaggatgg cagagagctg gcatgggttg cccaggtgtc 36601 ccacagctcc cttggtcccc cacagccccg tcctggctcc tacctggcat gggatggggc 36661 ctggggacat cacaggggcc acaggcctgg gtgtggtctt ggctctgtga gccacttgat 36721 gtacagtttc tctgtgggga cagaccagtt ggtcagcacc tgccagcatc ccttccttgc 36781 cctctaactt ggtggggaac cctggggtgc aggggatagc gtggtccact ccattgggga 36841 gctcacctca ggagccacgt cctgtggacc ggaggctggg tgtggggaca caggcttgcc 36901 tggggtccca tggccaccac ccaccccctg ctgacctgcc ctgtacctgg gccggggcct 36961 cccgcagtag gtatcgggta ctctgcaggc tgccccagcg gtcagcaggt gctaggccta 37021 gcccagctcg aatcagggcc tggtagggtg ctggcacccg gggctccagg gccggactct 37081 cccctgcctc cagcttggcc ttcacctcag gtccccctct cccagcccag ggcagctctc 37141 ctgccaggaa gcacagttag ggacccaagc gggagggtgc cttcatgggg tggggcgcag 37201 ccactcactc accagtgaag acctcctgga tcaggatgca gaagctgtag aggtctgagg 37261 tggtggtggg catgtcaccg cagatcagct gaagtggcag ccatgggtgt agttcagggg 37321 gcgggggaag ccctgggcct gggcctcccc aggggtagcc cttctgctgc ctgtggagcc 37381 caagggccct ggtgagccga cggctgctgg acagggacct gcacccgcca ccagacagga 37441 aaactgagac cccaggtcac acagcccagc agtggtaggg ccagcaccta gctcctatgc 37501 cactcacagt gcccctctcc acccccacct cctccctggg gtctacaaga caccactgcc 37561 catggccacg gaggctggag aacagggagg aggggagcgt gcaggcccga gtcaaggggc 37621 agcaggcagg gcaggcactc acctgggccg cagccagcgc tggcgtagga ggcgcctgtg 37681 ctccaggtgg cccactttag ccaggcctgg ctgcaccagc tacacggtgt gagagctgaa 37741 gccactgtga gctcgccagt gggcctgcag gaacagcagg gcctttagca cctgctgcag 37801 cagggggccg gggcggcagg cccagcacag tcccaggtgc ctcctcactt ggtctcggtg 37861 ggtgcagcac cccctgcagg gagcccagcc acacaggctc aaagagaagg cacagccccg 37921 acagatctgc agagggactc agtgccatca gcagcaacag gccagggtgg tgcagcttgc 37981 tgggcaggag gacagcaggc cactcacgag caggccccag ccacacaggg tgggagatgc 38041 tcctgggccc agactgccca ccacgtgcca gggcaggcaa tctgcacctg ccatcagaca 38101 gaggaaagac tgatcccagg tcacgcagcc cagcagtggg agggccagca cccaccgcct 38161 acctcatcct ggtgccttcc acacccccac ctcctctcag gcctacaggg cattgcacac 38221 acaactggta cccacccact ttccagtggg gggatcgccc cagcagaagg gctccatgcc 38281 gggtgctggc ccagaagtga agattctggc catgactacg gccacacacc cttgtcttcc 38341 gtgggagggc caacaggagg cggggctggg ggctggcaca ggtacctgca gtgctgaagg 38401 tcagccagta gcacatctgt ctgggttcca ggggccttcg gctgctgcac agtcattttg 38461 tgacccatcc acaggaggct ggagaatagg ggcggggcag tacaggcctg agacgagggg 38521 gctttccctg ggaggggagc aggagagctg gcaagggcac agcggggctc tcacttggcc 38581 atgagggtgg ggagctgctc tcataggtgc agtcaggctc tccctgggct gccagcagct 38641 ccttggggtc cacaaggggg atgcccgtta ccagccccgg tggccacagg ctactcagct 38701 agggacaggc aggatcatga ggcacgaaga agacaggggt ggttcccact cctacagcca 38761 ctcctgctta cctgaccaaa ccccaaggcc gggacttggg aggctctcct gatctgtgct 38821 agcctcagca tcctggggtg ggggcaggcc gagggtaagt ctggggtggc tgggaatggt 38881 tgggcgggtg ggcttcagga tgtggacggt tacctgtctg cctgcatcag cctcagggag 38941 gacagggagc caggcgggct gttcccaaat ctggcctgca actggtccag agatgccatg 39001 ggtggcaact caccgtcatg cacgaacgct gatatgtggg cccggcagag ctgcaacaac 39061 tccagcaact ggaggagggc acgtagggtg aggcctgatg ggtgccggcc ctgatccctc 39121 actcccacca gggccactgc tcacctccca gttctgcttg gccccaccct gctcagccca 39181 gtcttgtggt gtgcgacccc gctggtcatg cagttgcaag tcaccccctg cctgcagcag 39241 gggcaccatc actaaagaag gcgcctgcag gcacgggggt gctgccatcc aggcagcggt 39301 tagagcagaa caacgcggtg ggagtcgatt agggaaggga ctgaacaggg acaaaaaggg 39361 gcaggatggg ggccctgggg gatgggagga gtcaccatgt catgggaggt gaatatgagg 39421 atgggagggg tcacatgagg gcaggatggg gtctttaggg aatggaggat c // LOCUS AC002381 172533 bp DNA PRI 23-JUL-1997 DEFINITION Human BAC clone RG020D02 from 7q22, complete sequence. ACCESSION AC002381 NID g2275186 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 172533) AUTHORS Gattung,S. TITLE The sequence of H. sapiens BAC clone RG020D02 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 172533) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (23-JUL-1997) COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: The sequence of this clone was established as part of a mapping and sequencing collaboration between the NHGRI Chromosome 7 Mapping Project (Eric D. Green, Director), John D. McPherson in the Department of Genetics (Washington University), and the Washington University Genome Sequencing Center. For additional information about the map position of this sequence, see http://www.nhgri.nih.gov/DIR/GTB/CHR7 or send an E-mail to egreen@nhgri.nih.gov SOURCE INFORMATION: This clone is from the first release of the human BAC library. The library contains cloned DNA from a human male fibroblast cell line 978SK. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); Kim et al., Genomics 34:213-8 (1996). The clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of RG020D02; actual end is at 172533 of RG020D02. The orientation of this clone is unknown. This clone contains STS sWSS1826 (NID:g1113234) and sWSS869 (NID:g23340). FEATURES Location/Qualifiers source 1..172533 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="RG020D02" /clone_lib="CITB-978SK-B" /map="7q22" repeat_region complement(1060..1352) /rpt_family="ALU" misc_feature complement(1353..1438) /note="match to human EST AA225630 (NID:g1846956) nc08a05.r1" misc_feature complement(1353..1438) /note="match to human EST AA225239 (NID:g1846547) nc22d09.r1" misc_feature 1353..1431 /note="match to human EST AA225171 (NID:g1846460) nc22d09.s1" misc_feature 1353..1420 /note="match to human EST AA225498 (NID:g1846862) nc08a05.s1" repeat_region 3499..3596 /rpt_family="L1" repeat_region complement(3803..4141) /rpt_family="MER" repeat_region 4164..4203 /rpt_family="L1" repeat_region 4418..4709 /rpt_family="ALU" repeat_region 4771..4797 /rpt_family="L1" repeat_region complement(4806..5036) /rpt_family="ALU" repeat_region complement(5050..5182) /rpt_family="ALU" repeat_region 5249..5281 /rpt_family="L1" repeat_region 5317..5493 /rpt_family="L1" repeat_region 7526..7555 /rpt_family="L1" repeat_region complement(8884..9083) /rpt_family="L1" misc_feature 9114..9199 /note="match to human EST R05512 (NID:g756132) ye92e05.s1" misc_feature complement(9688..9980) /note="match to human EST R05619 (NID:g756239) ye92e05.r1" repeat_region 12598..12886 /rpt_family="ALU" repeat_region 14708..14768 /rpt_family="L1" repeat_region complement(15481..15539) /rpt_family="L1" misc_feature complement(15834..15947) /note="match to human EST AA094108 (NID:g1639701)" repeat_region complement(15948..16239) /rpt_family="ALU" misc_feature complement(18000..18131) /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" misc_feature complement(18000..18131) /note="match to human EST AA188905 (NID:g1775949) zp79f02.r1" repeat_region complement(18508..18770) /rpt_family="ALU" repeat_region 19013..19068 /rpt_family="L1" repeat_region complement(20234..20523) /rpt_family="ALU" repeat_region 21100..21124 /rpt_family="L1" repeat_region 21172..21542 /rpt_family="L1" repeat_region complement(21174..21536) /rpt_family="L1" repeat_region 21731..21764 /rpt_family="L1" repeat_region 23854..23898 /rpt_family="L1" repeat_region complement(24280..24523) /rpt_family="ALU" repeat_region 24969..24996 /rpt_family="L1" repeat_region complement(25073..25365) /rpt_family="ALU" repeat_region complement(25967..26258) /rpt_family="ALU" repeat_region 26790..27002 /rpt_family="L1" repeat_region 27007..27161 /rpt_family="ALU" repeat_region 27171..27273 /rpt_family="ALU" repeat_region 27593..27650 /rpt_family="L1" repeat_region 27831..28035 /rpt_family="ALU" repeat_region 28177..28205 /rpt_family="L1" repeat_region 28909..28986 /rpt_family="L1" repeat_region 29056..29327 /rpt_family="L1" repeat_region 29372..29429 /rpt_family="L1" repeat_region 29489..29780 /rpt_family="ALU" repeat_region 31289..31368 /rpt_family="THE" repeat_region 31384..31694 /rpt_family="THE" repeat_region complement(34785..34956) /rpt_family="L1" repeat_region complement(34958..35069) /rpt_family="ALU" repeat_region complement(35451..36208) /rpt_family="L1" repeat_region 36175..36203 /rpt_family="L1" repeat_region 36479..36545 /rpt_family="L1" repeat_region complement(37143..37236) /rpt_family="ALU" repeat_region complement(37259..37450) /rpt_family="ALU" repeat_region 37995..38049 /rpt_family="L1" repeat_region 38292..39976 /rpt_family="L1" repeat_region complement(38934..39336) /rpt_family="L1" repeat_region 40082..40185 /rpt_family="L1" repeat_region 40187..40477 /rpt_family="ALU" repeat_region 40478..40789 /rpt_family="L1" repeat_region 40869..41156 /rpt_family="ALU" repeat_region 41157..42135 /rpt_family="L1" repeat_region complement(42165..42184) /rpt_family="L1" repeat_region complement(45154..45252) /rpt_family="L1" repeat_region 46470..46763 /rpt_family="ALU" repeat_region 46973..47017 /rpt_family="L1" repeat_region 48559..49039 /rpt_family="L1" misc_feature complement(49105..49541) /note="match to human EST C18909 (NID:g1580511)" misc_feature complement(49221..49541) /note="match to human EST C17953 (NID:g1579555)" repeat_region 50404..50587 /rpt_family="ALU" repeat_region 50615..50708 /rpt_family="ALU" repeat_region 53373..53663 /rpt_family="ALU" repeat_region 54526..54870 /rpt_family="L1" repeat_region complement(56243..56271) /rpt_family="L1" repeat_region complement(57058..57478) /rpt_family="MER" repeat_region 58118..58261 /rpt_family="L1" repeat_region 58855..58891 /rpt_family="L1" repeat_region complement(58882..58899) /rpt_family="L1" repeat_region complement(61299..61596) /rpt_family="ALU" repeat_region complement(62935..63227) /rpt_family="ALU" repeat_region 63470..63566 /rpt_family="L1" repeat_region 63788..63807 /rpt_family="L1" repeat_region 63899..63921 /rpt_family="L1" repeat_region 64072..64353 /rpt_family="ALU" repeat_region complement(64369..64462) /rpt_family="L1" repeat_region 64369..64475 /rpt_family="L1" repeat_region 64876..64899 /rpt_family="L1" repeat_region 65013..65303 /rpt_family="ALU" repeat_region 65377..65618 /rpt_family="L1" repeat_region 65619..65900 /rpt_family="ALU" repeat_region 66077..66343 /rpt_family="L1" repeat_region 66384..66417 /rpt_family="L1" repeat_region 66426..66912 /rpt_family="L1" repeat_region 70590..70777 /rpt_family="L1" repeat_region 70867..71084 /rpt_family="L1" repeat_region 71617..71654 /rpt_family="L1" repeat_region complement(72862..73149) /rpt_family="ALU" repeat_region complement(74469..74522) /rpt_family="L1" misc_feature 77074..77208 /note="match to human EST Z36242 (NID:g530303)" repeat_region 77209..77250 /rpt_family="L1" misc_feature 77251..77428 /note="match to human EST Z36242 (NID:g530303)" gene 79559..80860 /gene="GPR22" CDS 79559..80860 /gene="GPR22" /note="RG020D02.1; match to mRNA U66581 (NID:g1753106) and protein U66581 (PID:g1753107)" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2275187" /translation="MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSL TGFLMLEIVLGLGSNLTVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVI LLLSLESNTALICCFHEACVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVML MISIWIFSFFSFLIPFIEVNFFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIP IFFFTVVVMLITYTKILQALNIRIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSG GRNVVFGVRTSVSVIIALRRAVKRHRERRERQKRVFRMSLLIISTFLLCWTPISVLNT TILCLGPSDLLVKLRLCFLVMAYGTTIFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEA DPLPNNAVIHNSWIDPKRNKKITFEDSEIREKCLVPQVVTD" misc_feature 80073..80418 /gene="GPR22" /note="match to human EST R59799 (NID:g830494) yh07h07.r1" misc_feature 80158..80504 /gene="GPR22" /note="match to human EST Z18870 (NID:g30677)" misc_feature 80316..80575 /gene="GPR22" /note="match to human EST R58357 (NID:g828415)" misc_feature complement(80825..81173) /note="match to human EST R61341 (NID:g832036) yh07h07.s1" repeat_region 81686..81740 /rpt_family="L1" repeat_region complement(87641..87943) /rpt_family="ALU" repeat_region 88922..89215 /rpt_family="ALU" repeat_region complement(89581..89712) /rpt_family="ALU" repeat_region complement(90833..90853) /rpt_family="L1" repeat_region 92588..92612 /rpt_family="L1" repeat_region complement(92864..92997) /rpt_family="ALU" repeat_region 94061..94350 /rpt_family="ALU" repeat_region 95120..95158 /rpt_family="L1" repeat_region complement(95713..95827) /rpt_family="ALU" repeat_region 97454..97490 /rpt_family="L1" repeat_region 97638..97660 /rpt_family="L1" repeat_region 100230..100280 /rpt_family="L1" repeat_region complement(101429..101577) /rpt_family="ALU" repeat_region 101578..101873 /rpt_family="L1" repeat_region 101875..102165 /rpt_family="ALU" repeat_region 102166..102425 /rpt_family="L1" repeat_region 102426..102722 /rpt_family="ALU" repeat_region 102769..104828 /rpt_family="L1" repeat_region complement(103394..103802) /rpt_family="L1" repeat_region complement(104912..105200) /rpt_family="ALU" repeat_region 106289..106322 /rpt_family="L1" repeat_region 106812..106865 /rpt_family="L1" repeat_region complement(107515..110460) /rpt_family="L1" repeat_region complement(112782..112801) /rpt_family="L1" repeat_region complement(112802..113103) /rpt_family="ALU" repeat_region 113406..119563 /rpt_family="L1" repeat_region complement(117964..117991) /rpt_family="L1" repeat_region complement(118151..118569) /rpt_family="L1" repeat_region complement(121699..122051) /rpt_family="L1" repeat_region 122061..123200 /rpt_family="L1" repeat_region complement(123204..123493) /rpt_family="ALU" repeat_region 123494..127684 /rpt_family="L1" repeat_region 127685..127976 /rpt_family="ALU" repeat_region 127978..128145 /rpt_family="L1" repeat_region complement(128165..128903) /rpt_family="L1" repeat_region 128904..129029 /rpt_family="L1" repeat_region 129031..129323 /rpt_family="ALU" repeat_region 129333..129645 /rpt_family="L1" repeat_region complement(130436..130722) /rpt_family="ALU" repeat_region complement(131055..131232) /rpt_family="ALU" repeat_region complement(131289..131382) /rpt_family="ALU" repeat_region 132268..132372 /rpt_family="L1" repeat_region 132711..132737 /rpt_family="L1" repeat_region 132911..133029 /rpt_family="L1" repeat_region complement(133565..133595) /rpt_family="L1" repeat_region complement(133597..133880) /rpt_family="ALU" repeat_region 134519..134546 /rpt_family="L1" repeat_region 135818..136109 /rpt_family="ALU" repeat_region complement(137163..137191) /rpt_family="L1" repeat_region complement(137565..137854) /rpt_family="ALU" repeat_region 138009..138074 /rpt_family="L1" repeat_region 138485..143399 /rpt_family="L1" repeat_region complement(142181..142207) /rpt_family="L1" repeat_region 144103..144442 /rpt_family="L1" repeat_region 144499..144733 /rpt_family="ALU" repeat_region 144882..145118 /rpt_family="L1" repeat_region 145156..145235 /rpt_family="L1" repeat_region 145797..146085 /rpt_family="ALU" repeat_region 146094..146124 /rpt_family="L1" repeat_region 146367..146658 /rpt_family="ALU" repeat_region 146714..147004 /rpt_family="ALU" repeat_region 147005..147073 /rpt_family="L1" repeat_region 147149..147437 /rpt_family="ALU" repeat_region 147803..148038 /rpt_family="L1" repeat_region 148186..148479 /rpt_family="ALU" repeat_region 150569..150697 /rpt_family="L1" repeat_region 150924..150995 /rpt_family="L1" repeat_region 150996..151287 /rpt_family="ALU" repeat_region 151288..151317 /rpt_family="L1" repeat_region 151336..151933 /rpt_family="L1" repeat_region complement(151942..152097) /rpt_family="ALU" repeat_region 152099..153428 /rpt_family="L1" misc_feature complement(153429..153730) /note="match to human EST T78744 (NID:g697253) yd01e08.r1" misc_feature complement(153607..153681) /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" gene complement(<153609..169487) /gene="WUGSC:RG020D02.2" CDS complement(join(<153609..153678,153776..153830, 159783..159840,163474..163613,169301..169487)) /gene="WUGSC:RG020D02.2" /note="match to human EST AA206526 (NID:g1801907)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2275188" /translation="MGWVGGRRRDSASPPGRSRSAADDINPAPANMEGGGGSVAVAGL GARGSGAAAATVRELLQDGCYSDFLNEDFDVKTYTSQSIHQAVIAEQLAKLAQGISQL DRELHLQVVARHEDLLAQATGIESLEGVLQMMQTRIGALQGAVDRIKAKIVEPYNKIV ARTAQLARLQ" misc_feature complement(153775..153832) /gene="WUGSC:RG020D02.2" /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" repeat_region 154038..154220 /rpt_family="L1" repeat_region complement(154899..155199) /rpt_family="ALU" repeat_region 155409..155443 /rpt_family="L1" repeat_region complement(156212..156396) /rpt_family="ALU" repeat_region complement(156549..156757) /rpt_family="ALU" repeat_region complement(157117..157413) /rpt_family="ALU" repeat_region 157749..158750 /rpt_family="L1" misc_feature complement(159781..159843) /gene="WUGSC:RG020D02.2" /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" repeat_region 161228..161521 /rpt_family="ALU" repeat_region complement(162276..162566) /rpt_family="ALU" misc_feature complement(163472..163614) /gene="WUGSC:RG020D02.2" /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" repeat_region 163814..164395 /rpt_family="ALU" repeat_region complement(164507..164533) /rpt_family="L1" repeat_region 164511..164542 /rpt_family="L1" repeat_region 166852..166878 /rpt_family="L1" repeat_region 167193..167483 /rpt_family="ALU" misc_feature complement(169300..169391) /gene="WUGSC:RG020D02.2" /note="match to human EST AA206526 (NID:g1801907) zq58c01.r1" repeat_region complement(171031..171309) /rpt_family="ALU" BASE COUNT 60693 a 32920 c 30394 g 48526 t ORIGIN 1 aagcttgcca aaacaggaac actgtaacca aagcaactgc tggcatagct aatggaacag 61 agatgtagca agactgctat aaagattgtt tcatgaaata gcaagcaata ttttagttta 121 caaagccata ggctaatatg agataacctt tgtgagcata aatacagcat taaatcaagt 181 gtgacaggat ttttaaaact ttatataatt gaaattagaa ataaaaagtg acagaatttg 241 atgagaaata aaatgcaaaa aatgacataa atatgcaagt ccaggggccc acagtggata 301 ataccattaa cattgaaact ctgaaaggaa atccttgtat catgaggact aacaatcaca 361 ttttactttt tcctctaata aaatattctt tagaagttaa attctttttt aagtgtttcg 421 tctataaaag aagggcaaca aattgaggga aggaaaagga caaatacaga atacaaattt 481 agctcatgtt gctctgctgt tatagtcatg aaattctgca aagtgactca gttcacaaaa 541 ctctgactta tgataaagac ttagttgagt ccaactgatt ttaactggaa aaccacacag 601 agatttaaaa attttacttc caaaataatt tattagcatg gatgtttcaa aactaaaata 661 tttcttcaaa gtttaataag tgcaccattt cacactcaaa atgaaaaata agaaacccca 721 gaatcacatt aaaaagacgg ggaaagggtg gggtggaaag aaaagtgggg agaaagagag 781 ggagaatgaa gatagggagg gagaagtaaa gggtggtggg aagtagaaag aaagaaagaa 841 atgatcaaca cattcccaac tcactgtgtt ttcccaaaga cagaggttat agtcagaaag 901 atgcaaagtc aactcaagca aaaattcaaa gtagctatct ttcactgtat agacaatcct 961 aactactaaa aagcgataca gcttttttaa atagtgtgta aagggtacaa atcaaatcgt 1021 ctttaccatg aagcagatct tttttttttt tttttttttt tttttttttg agaaggaatt 1081 ttactcttgt tgcccaggct ggagtacagt ggcaagacct cggcttactg caacctccgc 1141 ctcctaggtt caagcaattc tcctgcctca gcctcccaag tagctgggat tacaggcatg 1201 tgccaccacg cctggctaat ctttgtattt ttagtagaga tagggttttt ctatgttgat 1261 caggctggtc ttgaactccc aacctcaggt gatccaccca ccttggcctc ccaaagtgct 1321 aggattacag gcgtgagcca ctgcgcccag acaaagcaga taattcttta aaccactccc 1381 cctgggtaga atatgtattt aataaaatta ggccgatttt caggtaaact taagacaata 1441 aaaagaacag taccttaatc atgtacaatc attgagaaat tataaaatcc taatatttat 1501 gaagaacact taataaatcc ataacaccat acatgtgaca actatataac aggacaaaat 1561 atttaaggca gcttataata tctgtgtgca tatgctccat tatgttacac acacgtaaca 1621 ctttttctgg agataacaca tgtctagaag actgaaaact taatggagct ccatgggaga 1681 ttacaagaat taaaggaaaa gagaatcaca ttaaatacca taatttaaaa gcaagaatta 1741 tattagctac cttagttgtg attcagagct tagcagtagc tttgctgatt cagtaacatg 1801 caacccatta atgcttgatg tgtacttctc agctaaccta atataattat ttcagtacct 1861 gttaacaatc aacagtcaac agtacataat acaaataaaa ttaagatact aaaaaggaaa 1921 tttaatagta caatatgatc tcatacacca gaaatcaact ttattctagc ctttattttg 1981 aagagaggca gaaaaagaaa aagattatat agaaaggcca ggggggtgag gagtgggggg 2041 tgggaaatga gagaaaaatg acaagaggta gaatgaagat gaacatggat tagaagagga 2101 aagttctaca gaaagtgata caccgagtgc ctaaaaaaat gaggaagaga gtataaaaat 2161 gatcaatatg gagaagaaga gctattttaa tttctcaatt tttaacaaaa tatctgaaga 2221 agacattcta acaagtcata gtttggcatt aaagttttta aataatgtca atacacagaa 2281 taacatattc caagtcatat tagataatag taattaatat taaaaggaaa ctattgaaaa 2341 tgtttattca acaataaaat gaattttata ggaaaatgga ataatgacaa attaatgtgg 2401 caagccataa aaacatagct tatttttaaa tataataaaa tgtaaagtca ttctgtgctt 2461 taccaaagat atttatggca agcattatgt attagctatt gcccaaaact actttcttct 2521 tttttcttga taaaaaccta attttctcag gcaatggctg ataagtagcc tgagccgatg 2581 ttgacaattc tgttcctctt tgccagacac ttgttttcac agtcttcttg atagctaggc 2641 tgggcatgtg actcaaatct ggccagtgag acataaggac atgttggctt aggcaatttg 2701 gggaaggata ttataacaac accttttgac tcacaactgg ctctgcccca cctctgtacc 2761 ttgaacacag acataatggc taccggcatg acagctatct tgtgaccatg atgggacaag 2821 caaaagcaca ctaaggatga tggaatggaa aaaagaaagg cctgaatcct tgttgaatag 2881 ttgaaacaat gttagcacca cctacaccca gtcaaatctg ttggtgtaag ccactattaa 2941 tctagcttgc tgttaacagt tcatacagcc tcatccagca ccatcattga ttgaaagaca 3001 atatataaaa aaatacatgg gcccataaca ataattttta acagatgaaa aacaaaagaa 3061 taaaatcaca tctcatctgc tacctttggt ggtgactact gtatcaactc attactctat 3121 aaagtggtaa tgacagtgaa atttctaagt atttatatat cattttcaaa agaaactgca 3181 gttcagggta accaaatatg cacagttgat gagttaaagt tctttacaaa ataatcgcag 3241 ctagtaaaca caaggaataa caaaaataga aaatcacact ttcataatcc ctaagtaatt 3301 taagaaaaag atttaggcag aaatcgttga tattaaaacc attatgggaa agactgacag 3361 gaaattaaat aacctcaaaa cgcaaaggta tccacaaatt acaatgaagg attaaatgaa 3421 agaaaaaaaa aggtcttttt caacaaaagg tgctcaaaaa actgactttc tctatggaaa 3481 aaaaataatt ttgcactgta cataaaaatt aaaatgtaaa attctaaaac ctctaaaaga 3541 aaacatgaga caaaaacttt gtaaccttag gtaggcaaaa atttctttca taatatacaa 3601 aagaaaacat ggataaacta gactttatca aaacaaaaaa ttttgctttt caaaaagaca 3661 atatcaaggg aggaaaagac agcattattc ataataaaca aaagcagtct accaacttat 3721 caagacacaa gcaaaatgta tatatccata gaactgaata ttattcatca atgaaaaaca 3781 atggactatt tatacataca atcaggagtt cccaacgccc ctgggctgct gaccagtaag 3841 agtcggtggc tgttaggaac tgggccacac agcaggaggt gagcggcagg gcagtgagca 3901 ttataacctg agctctgcct cctgtcagat cagcagcagc gttcgattct cataggagtg 3961 cgaaccctat tgctaactgc acacgcgagg gacataggtt gtgtgctcct tatgagaatc 4021 taatgctgat gatctgaggc gaacagttca tcccaaaacc accacctctg cccccacatg 4081 tgtggaaaaa ctgtcttcca tgaaaccagt ccctggtgtc aaaaaggctg gggatcactg 4141 catacaacaa aatggatcat tctaaataaa agattgtgct aagtgaagaa gctagataca 4201 caagattacc tgtcatatga ctgtatttat ataaaatttt cagaaaaaaa ctaagaagac 4261 agaaattata atctatcaaa attagtaatt gcctgggttt gggggtagga ggaggaactg 4321 actacaaaca agcatgaagg aacttttgta tgatgatggc aatattccaa aactggttat 4381 gagatggttg cacatctgta taaatttact aaaacttggc tgggcaaggt ggctcacacc 4441 tgtaatccca gcactttggg aggccgaagc agggagatca cctgaggtca ggagttcaag 4501 accaaccctg ccaacatagt aaaaccacgt ctctactaaa aatataaaaa ttagctaggt 4561 gtggtggcat gtgcctgtag ccccagctac tcggaagtct gaggcaggaa aattgcttga 4621 ttccaggaga tggaggttgc agtgagtcga gatcgtgcca ctgcattcca acctgggtga 4681 cagagcgaga ctctgtctga aaaaaaaagt taaaataaat aaataagttt actaaaactc 4741 atcaaactat acattaaaag tgcatgataa cagaaaaaaa ggaaagacac agaagaaaat 4801 atttatgata tggtttggca gtgaccccac ccaaatctca tcttgaattg taatcctcat 4861 actccccatg tgtagaggga ggaacctggt gggaggtgat tggatcatgg gggtggtttt 4921 cccacttgct gttctcatga tagtgagtga gttttcacca gacttgatgg ttttataatg 4981 ggctctttcc ccttagctcc acacacactc ctctctcttg cctgccacca tgtaagacat 5041 gcctcatccc cttctgccat gattatacgt ttcctgaggc ctccccagtt ctgcggaaat 5101 atgagtcaat taaacctctt ttctttataa attacccagt cgtgggtacg tctttatagc 5161 agtgtgaaaa ccgactaatt caatttacaa tgcaaaattt gatagaagac ttacatccag 5221 aatgtaataa aaattcttac aactcagtaa tgagaagaca acaaaccctt caaaaattga 5281 acaaaaacct tcactattaa gcagttaaaa agtttgtcac atcagtagtt attaagaaaa 5341 tgcacataaa aagtacgaga acctcctaca cacccactaa aatggctaaa attaaaaaga 5401 ctgctaacat caaatgttgg caacaatctg gaggaaccag aaccctcaac tacggatgtg 5461 gaaatgtaaa ctaatagaac cacttcagaa aactggtatt ctctttaaaa agtaaacata 5521 atagtctttg caaaaatcta accttttttt tttgccttag gtattaacct aacaaatgaa 5581 aatataagct gaaactaaga cttgtacttg agtgttaatg cagctttatt tacaatagtc 5641 ttaaagtaga aataacctaa atgctcatca ataggtgaac gtataaagat acttccataa 5701 gtaatgacat aataaaagag gaacaaatta tcaatgcaca taacatgcat gaatctcaaa 5761 ataattatgt tgaatgtata aagccagata agaggaagag cacactgtat aactctaatt 5821 atgtaaaacc atggaaatgc aaattaatct ataaagacaa aaagttgctg cctgtaattt 5881 ttctgaatta caaaataaaa ttctagtatt tactactata ataacactcc agcacaagag 5941 tgtttaaatt aagtaaaatc atatagattt atacaagtac aaacacaatt ttgagaaatt 6001 tttgagaagc agaaagcaat tacacataga ctagagtgac taccaaattt taatcaaaga 6061 gcagtaacag aattctatat gaaatacaat aaatttagtc cgttattgaa aaattattca 6121 catgaataac tgaaatgaaa tgcattcagt gaactggaaa actatgggaa gtaataacac 6181 caaaataccc taagtctaaa agcagacagt taattagatg taagtccaca ctgcaagatg 6241 gcaataaaat aaaataataa agtagccaat gttattgtgg attgttcaca acaagagaat 6301 ttgtcaaaga atatgaaaat aacatgactg catatagaaa ctagaataaa agtcatactt 6361 tacatttggt gtaaaacaaa agcatacaga aatgccgtat ggtaattgac acatcgtaaa 6421 agaggaaagg gtagacagag agaaaaagag taggagaaaa gaaaagatga aagtatctgc 6481 caagaaaatc aattatttaa agcagggatc agcatatttt ttctttaaag tgcaatacag 6541 caaatagttt aggctttgca cactacgtat gctgtttgtt tttgcttttt tccaaccatt 6601 taaaaatata aaaataattc ttagcactgc ggcaacataa aaaaacaggc cacgggctgt 6661 agtttaccaa cccatgattt tgggtgttat gtaagagttc aaattaattt ccattcaaat 6721 ttaaggataa aataaaaagg acatgtcaca caactaacat taagctttag agaaaaaaaa 6781 tttaaatgat tagttttaaa atatcctttt aaattataaa agctatgacg gtacatatta 6841 tgtaactatt attaataatg atatacaatg ttatctctaa aaagaaattg ccgaagagta 6901 acttttaaaa ttattattta tacatgtcat aaagtttatt taattgaata ccaatgtact 6961 ccacatcagg tctccattaa atctaatgaa agctatagtg atgggaacaa acagacaaat 7021 atatggtgaa acaacagcaa attatcattt gaaggtcata aaggaaatac gcgagctctt 7081 actacattca ttctttcaag ttgtttactg tgtgtcaagg aacaagtttt gaacaagagg 7141 ggaaaaatta ctgaaactct agttagtggt aaaaacaata ctgggaccaa atatacaaag 7201 gagtttacac aggcccaaat gatgggctct gttgtataat ccatgagaca agtcagaaaa 7261 acagctaaat atataaaagc tgtacctttt aaagctaggc aaatggtgtt tgttgggtca 7321 taagggaagg aaaacatccc aaaacaattg cttgctgcct gcagaactac tatcgcaaac 7381 tacaaaagga atcactgtgt tcaactttaa gaggaccttc tttttgaagc ttaagataat 7441 gcttaacttc catagagctc actgaacaca atgcatttga actagtatcc ttgctgtacc 7501 agaaataatt cagaattttt aactgcaaaa gtgaaaagag ctcataatat caaaatttct 7561 tactgtatac tatatagata tttatctact gaacttcaat tctacattat tttcttagag 7621 tagctaggtc taggtctagg ttctctagag ctggcttctc taaggttgtc atggtacttt 7681 tggctgacag ttgataagca ccaggaccct aggcttgacc aagaaggtct tctgctttag 7741 ggaaaaggca agttcatgaa tacttagtta ctatcactag caacaattac cacagggtcg 7801 agattatagc caggtaccaa gctctgaaag aagtgaacac agaaatcctc tttcctgatg 7861 acctctaaag acaagtaaac acattatctt cctccacctt gaacctccag aaagtctcat 7921 acttgtttac tgggaaagct tctaattcac aaggtaaagt catcttccta attaaaatgt 7981 catcatttta ctgtggggga ctggtatcta tggtgacgca ggaactgatc cagatcacaa 8041 tttctaccac ctatttttga aagctagtta aagcctgagg taatggagtc tgttgtaatc 8101 ctgaaaaaat aaataaacaa ataatgtcct ctccctttct ttgtgtgact tctgtcagtg 8161 aaagattgta tggtgagtag aaaagagaga aatgggaacc aggaagagct gcaggtggat 8221 ggcatatcta gtcgcatgct gtgacagact gcctatagag ttgaaaaatg ttttttctaa 8281 tatgatcata ttctgtgcat ccccagaaga tgtgctctac atcaccagtc cttccaaatt 8341 tttccaaaga aaatggtagt gatatcacta taaatctttc cgatacagaa gtaataagac 8401 ttttgtaagg tcagaagttt atttatgtat ttttcgtgta gaaagaaaca atagctcctc 8461 catatctaaa cagtcattta gtttataaat ctgaaaatag acattttgca atatgatagt 8521 attttataat actacaaagc tatttaaaac atacaaataa atacaaaaca cttaggcaat 8581 taactaggaa aaaatagtga tgaaagaggt tatttgcatt tactaaaaca attttggttt 8641 ggaaaggaat gctaaaaaaa gaaattacac taactttctt ctaataaata gttgcttcag 8701 ataacaaagg aatagctatt atattagtct gttctggctg ctgtaacaaa ataccataga 8761 ctgggcttat aaacaagaga aatttatttt ttgcagttct ggagactggg aagtccaagt 8821 tcaaggtgcc agcagtttgg gtatctgata agggctcatg ttcctagttc acagaaagac 8881 accctttttt tttttttaaa ttatacttta agttctggga tacatgtgta gaatgtgctg 8941 gtttgttata taggtataca cgtgccatgg tggttttgcc gcacccatca acccgtcacc 9001 tacattacct atttctccta atggtatccc tcccctagcc ccctaacccc aacaggcccc 9061 ggtgtgtgat gttcccttcc ctgacaacgg acaccttttt aatatgatcc cacatggcaa 9121 aaggataagc agctctctca ggctctgagt aaaatgaggg gtagaaggag aggggaaggt 9181 agatagtaag tgcaaaaagt cttttataag ggcactaatc ccattcaaga gagtcccacc 9241 ttcatgacct gtttacctcc caaaggccga cctccaaata acataatttt ggcggttagg 9301 atttcaacat aaggagaatt taaggaggat gcattgggaa catagcagcc ataaaaatga 9361 agtgacaatt tatctgcaaa ttacagttag gtgactctat tattattatt attattatta 9421 ctatttttat gattttagta agggcagaat tcaatgaata gaaacatcac ccatttcttt 9481 aacatgattc aacagaaagt aacaatttag tttttaacaa aagatctcat tgaattatca 9541 gtcaataacc aaaaatctaa aaatcactga agacaatgaa gatggagaat ttttagtgtt 9601 attaaatact actaatttca caactggtaa agatagcact aaagactcat gctattttaa 9661 cttttacaat gaactgatat tgaaaatgat gggcatcaac tacatactca ataagagagg 9721 atcatcaaca cttttgagga atcaaattaa tgtcagaaat taattattaa atgtaactga 9781 taatatgatg taatttttca aaagttcctt gtttactgaa catctattca ggctaagagc 9841 ctataaacta agtgcgtaat cattcatgaa tgcaatagta cttcaaagtt aaaatactgt 9901 aattctttta gtctatattt ataataaaca ttatacattt attttaatct ctgatacatg 9961 cgagaatata atttgaggat aaaatacata taagctgtag catctgttct aacagggagc 10021 gtgaagcata ttccaggcat aaaaagagta ccaaggccct aaagaagaaa ggaaaatggg 10081 atgtgaacga ctaatagaaa gcttgtgaga ctgaaaagca gagaacatat gaggagagta 10141 gcataaaatg tgcacgttta aatgttgatg agaaggagac agtataaggg aaatgaggga 10201 tggaaatgtc aacaagataa ggttctaaga aaataaaaaa taggatctag agcagaggtg 10261 gtgtgagagg actctcagtg aagcaggaat taaggacatc tgctcagagt aaaatggggg 10321 tatggcaaga gaagatggtt gatggaaagt aaagagagcg taaaacagta atttcagaag 10381 atgaaaaaaa aaaaaaaaaa aaaaaacagt tggccagaaa acaaaaattc tgaggaatgc 10441 tgaaggacca gtttagatta atcatcaaat ggagacttcc taaaaaaaaa tttatgctat 10501 aaaaattccc ttttcaatac aaattgcttc acaaattgtc aaataatttt caatataatt 10561 atttaataaa gttagaataa actgcttggt cttactaaag aagggggaag caggagctcc 10621 agatcatgat tctattactt gccagctatg taatcttcag ttaatttcct aacctttcca 10681 tctctttgtt tccttaacca tacagaaaat ggataattat atcttcttca cagatctgtt 10741 actgcaattg agataacaaa tgtaaagtga ctaacacaga gcctgacaaa gaacagagac 10801 tgaacaatgg ttcgtttcac tcttaattcc taaattgcct gctgacacag ctggatgaca 10861 cctctgtctt gttgctctct gagaagaaaa ggcactagct cccaaacaca ggctctttct 10921 actacgacag cctgcttccc aagaagcttt atcacagtga cgtgattatt aagagtttaa 10981 tcctctcagc tccacaaaac ctgaaacttg actttgtgag aaagccaggt ctgaggaaga 11041 cagacacaca caattaaact cagaaagatg ttacaaacac aaatatcaat ttatacataa 11101 gaaatacagt aaatttcatg aactgaacca gaatgttaat actggcagat aatacatttt 11161 ctccctttag gaaaattaca cagcaagctg caatagtctg aatgtttgtg tctttcctcc 11221 tcaattcata tgatgaaatc ctaactgcca aggtgatagt attagaaggt gtggcctttg 11281 ggaggtgact atatcatgag ggcagggcac ccatgaatgg gattcatgcc cttataaaag 11341 aagcttgagg gtgcctttgc tccttctacc acatgaggac acatcaagag ggcatcattt 11401 gtgaatcaga atgttggccc tcagtagata cccaatctgc tggcacctcc agccttcaga 11461 cttccccagc ctccagaact gtgaaaaata aatttctgtt tataagccac ccagtttatg 11521 gtgttttgtt acagcaaccc aaatggacta acagaggagc caaattgtac tcagaaggtt 11581 actttataca tctaaataag acaaatgaat tggataatat taagggataa tgtcaataaa 11641 ctatgataat atgatacata tggccttaca gggagaaaat actccccctc cactgagcag 11701 aacctatcct cctatgaacc atgctgagac atacaacaaa attatagttc cctgactcct 11761 atttttctaa gatctatgca gatcattcta cacctacccg tgcagacaca cattactttt 11821 atacttttgt ctttttaatc tgatttatta gaaataggcc actggttaaa taacaacagt 11881 catctacttt gataaatctt ttgaataaat ttatatagtc taaacagata acaaaactat 11941 caatatttaa ggaatactaa cttcctaaaa gtttgtcagt ctcttaattt ttaatatgaa 12001 tttaaaaacc aaagaaaagt caaacttgac aactggtaaa aatgtgcaca agacaataaa 12061 atatctaata tatatttatt tagaaacaac taggctagaa aatactttct tactttaaat 12121 ataagagaat taaattaaaa aacaaaaagc ctttaaaagt aaccattcct ttttgacatg 12181 ttaaatttat ccaaccctat tgaagataat attttggttt caaaaacgtt ggtactgttt 12241 cccttctaga attacttggt ttgagtccca tgactctaaa atatgataca attttaagaa 12301 ctgcataaag atatatacaa atgtatacat caaaaacatg tcatagatgc aaaattttag 12361 gtagtatata tacataaggt ttcccattag attaatacaa aaggcttatg acattgcact 12421 ccctgaaata caaattgcca tctttctcac aactaaacaa actatgctaa tatcaatttt 12481 ctgtagaaat attcatcttt attcaaggtt gacttggaat tccttaagta tttctttgag 12541 aagtaattat caaagaattt gataatttga gcaatcatcc ttcaaaagaa acattagggc 12601 caggcacggt ggctcacgcc tgtaatccca gcactttggg agacagaggc gggtcgatca 12661 tctgaggtca ggagtttgag accagtctgg ccaacatggc gaaaccccgt ctctactaaa 12721 aatacaaaaa tgagctgggt gtggtggcac gtgcctgtaa tcccagctac tcggggggct 12781 gaggtgggag aatcacttga acccaggagt cggaggtttc agtgagccga gatcgtgccg 12841 ctgcactcca acctgggcaa caagcgcaaa actccatctc aaaaaaggaa aaagaaaaat 12901 aaacattaag accatcatct agtgttttat ccaattattt aatatttaac aaatacttat 12961 tgagcaattc aaagaatggg aaaacagaga taaaagatat agtactaggt cttaatgaac 13021 gtcaatccag tgagtacatc aaaacatata tatattataa caagtgtgat aaataaaatg 13081 ctcaaatata ataaataaca ttattaagca tcttcttttt tttttttttt tcaaaggtgc 13141 tattataacc ctaaatagag gttctaaaac cacccacaag aatcaggaaa ggctatttgg 13201 ggaagttaag gaggacaaag ttgggtaaag aatgaataca aaacaaacag ccaggtaaat 13261 tcgaccagta actgtgttca aagaatattt aacattaaat gcaaaccaca gaggctgaac 13321 aaacctgtta tgttcaggga aacaagggac agccataagt gtcatttggg gatgtagcaa 13381 gatggcaaaa agcaaatggg gttacagttt gagatttata aagagcaaag ttgttgaaga 13441 attttaagta gcaaacggac atgactagat atgccttata ttaagatcac tgtggaagca 13501 gtatgagtat taaatggaaa tcagacacta atagaagact aatacagtaa tcaggatgag 13561 aaagctaagg tatgaaacaa gaatacagaa acaatcagag tagaacagga gtcaacaaac 13621 tatgctcgtg ggacaaatct ggtttattgc ttgtttttgc taataaagtg ttaatggaac 13681 acaaccacat tcattcattt acatattgtc tatgactgtt ttcctgcctc aacagcagag 13741 ttcagtagtt ataactgaaa ctaaatggcc tataaaatct aaactattta ctatctgtct 13801 cttcgcaaaa agtgtgctgg tccttagatg agctgatttc caagttttcc agttgaatag 13861 tggtgctcca cagcagctaa caaacacagg agaagatata tttgacaact ctgtatttta 13921 atattctgag ttacagagtc tttatgatgg tctcatggaa taatgtggat gatactaaga 13981 atgagcttgc tcaatataca ttctggcttc cttttagtat gtgtttgctg taaaagggtc 14041 cagatatgac taatgttcca ttaactataa gaacttgttt gacatttaga tggcagaggc 14101 tgtacttttc ctacttctgt tgttttccct gccaagcttg gtcatagagt gtttggattt 14161 cttctatcag ctctaatggg gcctggtgta tagtccctag cattatgagt gccaaaaggc 14221 agaacagggg acatctattt tgctgatcta gatcttgtca gaggaaatat ggctctgaag 14281 ctatcaaatg caatgaaggt tatataaaac accagtgggc agtttcctga catggaggta 14341 gcaatatgtg cccaacttgc ccaattctcc acagttcagt agttctgaga ggctcagatt 14401 atggtctttt tcttgagccc tcccaaccat atcctataat aaacccctat ctactaaaac 14461 ttgctaaagt aagttctgtt ctctataagc aaactttaac caatacaagc atcctgcagg 14521 cagggcagga agctataaag atttggtgtt tacaaaaaga gatcttggaa gcagattgag 14581 attttaaagt tatcagcaaa tgggaatata cgagatcatc aaggaaaagc atttagaata 14641 aaaagtggtc acacagagag agcacaaaga ttaaggaagt aagcagaaat tgttacaatt 14701 atgaagaaac caaaaagcaa cgtcaaggaa aaaaaaaaaa aaaaagaaaa caggagagat 14761 tgagataaag aaacgaagag aagaaaggtt gtcaagaagt tgaaatggtt accaatatca 14821 aaaatttgag aaaagaaaaa aaaagtcctt tgggaaactg ggaggtcacc cagcagtcac 14881 ccagtagatt actgaaagta atctactttc agtagattca ttgaagaagt catgttagaa 14941 gccacaatag tgtagactgg ggactatatg agaagtcaag aaaaggagac agtaagagca 15001 gcttattctt ccaaatagtt tgaagtaaac agactaggaa atagttacat aataagatat 15061 aactaagtaa acagttaaca ttgaagaatc ctttttcctt tcaagtgaga gaaattttag 15121 tatctttgtt tgctgctaat aaaaaggaaa aggttgaaga ttcagataag ccaaaaagca 15181 aagtaaagaa gcagggcaga aaggatggtg tggaattagc cttaaatagg aaggttagag 15241 atctgatgga gttgtgtcct taagaatgaa ctttgcatga agtcaaaaac atgcatgcct 15301 tgtgaccttg attttccttg taaggagtgt gacaagatca tctcatgaca atgagagaga 15361 agagactgga ggtttacagt gaataaagaa aatctgaaac aaccatctgt gaaatggaag 15421 agggaactgg acgtaataaa aaagtaattg aagcccagct gcatctaaga atcataaatt 15481 ttagttacaa cagtcttaca gttgtctgtt ttgttttgtt tgttttgttt ttgtttttta 15541 agagcaatga aaagactagg catagaagta aaatagggat aaatgttgca tggtccacag 15601 atggcatttt gctgtgcagg agtagtagaa aggcaagggt gaaataattt gagggtatca 15661 tcaaggggaa tacttcttgg taccagatct actgaatcag aaattctggg tgttgggctc 15721 agctatctgt gttttaacca gtcctcccat caattctgat gcatggtgaa gtttgagaat 15781 catcagttag gtgaatagtt gaagagacac atctttgatc aatttaatat aaaaaaataa 15841 gtagacccag agaaggggac agacatgagg ttgacagagt agaaaaccat cgagtagtaa 15901 agacatttat gctgtagatg agttaacaga ctgatgggtt cattttgttt tgtttttgag 15961 atggagtttt gctcttgttg cccaggctgg agtgtgatgg caggatctcg gctcaccgca 16021 acctccggct cctgggttta agcaattctc ctgcctcagc ctcccgagtc gctgggatta 16081 caggagtgcg ccacaatgcc tggctaattt tgtattttta gtagagaagg ggtttctcca 16141 tgttggttag gctggtctcg aaatcccaac cttaggtgat ctgcccgcct tggcctccca 16201 aagtgctggg attacaggtg tgagccaccg tgcccggcca acagattgat gttttaggtt 16261 aataaaagaa ttatgagtta taacaggaac aagggagtgg gagggcagaa aaaacaggaa 16321 gctacagtca aaggaagaga cactggagac ttagaccaaa tgatactaat gacaatattc 16381 aacgttgata aggagggaaa gattctgaaa taggatggaa gggaagggca ctagactcaa 16441 gtttgggcta gtagatgtca aacgactgtg agcccaacgt atgagaataa ccatctccat 16501 aggtattggt aacatccagg atgtgggatt tagaatggag agaaaacagt actgctgaag 16561 tcctagagat gggagacaca aaaggattag gagatgaaaa caaccatatg aggaagagga 16621 caatatgtta tagaaaataa attccctgca taaaatccca aagaaggcat aattcataca 16681 ggatggaggg ctacagagtc aaatccagcg gccacttctt aggaaggtaa ctgccctttc 16741 ttgttttgaa attcagggca aaatgtttga ttaatagagt tatctttcct tctcttttga 16801 agatctaaaa aaaggtatct tctcttactt aatgtagaga aatactcttt ttttttttaa 16861 acggtgtaac agtttcatcc ttacagggag acctagagtt aaaacataat tgaccaaaca 16921 actaaaagta atctctaatg ttagaagtta gaataataat tatctttggg gaagccagaa 16981 gagagtaaaa attgtgagag agggtaaaga aactctagga gtactgataa tgtcctattt 17041 ctatcctggg ttttgatttc acaggtatgt tcacactgtg gtattcattc tatatgaaag 17101 tttatgattt gtccactttt cattttgtat atgatattca attttaaaag aaaaagtacc 17161 tgattcagtt atattttaaa aattacaact atttaggaat cagaagtcaa ataagtacct 17221 tcttgattta gaggcttaag aacagcacat taataacatt gtttctgtcc aacataaatg 17281 cacaaaatga aattcctgat ttagaggata tgaaaaaact ctattaataa ttatattgga 17341 gcccaataca aacggagaaa atatctgggt ataatacaag ggtagatcta agatctaatt 17401 ctaagttgtt gaatctaata catcaatacc tttgaatttc ttccaattca gaaaattaag 17461 catcaggaaa atgatacctt aggagaaatg tcaagtcaaa aattttttaa attatggcat 17521 aatgattaag agtgcaaagt tctgggacta gagtgattaa aatttgaatc caaactaaat 17581 aatttactag ctatgtgatt ttcaacaact tatttaagct ctgtttcttc aactgcaaaa 17641 tataattaac aataatacca atcttactgg gttgttgaag ggcttaaatg aggaatcgtg 17701 ctttagacaa agcaggcatg gtcagctagt tatctatcta tatatgcaca tcataaaaaa 17761 tacatttcat gttatataca ttatacatta gaaacattag gtctattttt aaattaaata 17821 tattaaacat atattaaaat ttcccttaac gatgacccat tctttgattt aaattgtaga 17881 catactttct cagtgatata ttactataag acatatatta cttttttgaa ctcaaagtca 17941 aatgttcacc tgtattatga cacatttttt acattaaaaa acagtcttat ttttattacc 18001 tgagtctcca aaccctgctc tagtaggcgc ttagcttgat tttccacttc aagtcgggct 18061 cttgcaataa aaagtagatc attttctatc acttctattc cagaaagatc tattccttga 18121 gaaagataat ctgtttaaaa caaaaacata cacattcaaa tatttcaata ctgaataaat 18181 atcaaggtat caaaatatgt ataacttctc aaaaaaaaaa aacaaaaaac gctattaaac 18241 agggttgcca ttcaaaatcc gtattttaaa aaatacagtt tcccaaatat tttctattta 18301 ataagtaatt tatgctggag ttaaatgctc caaataagac agatttggta aggcaagaat 18361 agaaatgatg attattacaa gtttaaggaa atgtcaagag atcacattta aaagtgtcaa 18421 atttagtatt ttaaagtaaa taaatattct cacattttca gaaaattctc aagatgcaag 18481 tacagttttt ttgctgctgc tgttgttttt tttttaagag atgggggaag tctcaataca 18541 ctgctcaggc tggagtgtag tggcttgtca cacatgtgat tatagttcac tatagtctcg 18601 aactactggg ctcaagcagt cctcccacct cagcctccca agtagctggt actaccagta 18661 cctggcatca catccaacta ttctcttttt ttttaagaga cagggtctca ctatgttgtt 18721 caagctggtc tcaaactcct ggcctcaagt gatcctccca tcttgacctc ttgagtagct 18781 gggccccagt atttttaaaa gcaaaaaagt ggccttcatc atattatata accaccacca 18841 gcttccacct ccccagttat ccaaaccata tttacaatca ctcaccttaa gaagtaacca 18901 aacaacatgc tagatttgca gtacaaaggg ctctaagagc tgggagggag aaaggtctat 18961 attattttag ttactaatta ttccacaaac ctgcttttca gtaagagaca tgtagcaaca 19021 gagggaaaaa aacacaacaa aaaaaaaaaa ccaggaaaaa aatgtagtaa caaaccccag 19081 cgactgccct tcaccacaga agaatagagg gattatgata ctaaatttcc cctcccattt 19141 atctgatgca tacaatcact atcagcgtta caagtcttta tatatctgct ctttgtcaag 19201 cacatctaga gactggagcc aagaacaaaa tgatgatgat gatgattcca ataacaatga 19261 ctaacatttt tgaagatata gcatgtataa tacattaagc tagacgcaaa gtgtgtatta 19321 ctttaatgca tctgttattt gtaattttct aaacaatctt gctagaaatc tcatcatacc 19381 tagatggaga tactaaaatc cgaagaatct aagtaatctg aacaaattca cttggctagt 19441 aaccggtaaa acagaattaa aatcctggtc tttctgacca cagtcttttc agtagataat 19501 cccaggcctc aaaagaatag agcttatatt caaagagata gtattcctcc gatattcttc 19561 caatttgcac attgaaaggg taaagatgta aaaatcataa cagatataac tgttcaaaaa 19621 atttgcccag gttagaacaa tgaaagtatt aaataactag tttaactttt aaaaatatgt 19681 aaatgtttaa aaatatatat aatgccaaat tttgaaatat ggcagcatta ctcacatctt 19741 gacaattaaa ccttaataaa tgtggagtgc acagccagaa ttaacttcct agccacttaa 19801 tacaaatatc tcttggatcc cttcatatac cacagtgatg tagcaggaaa tgaaactaat 19861 ctatttgaat ttaatgtatt tatcatctcc ttaggagaaa actatgagac tagtgctttt 19921 agtggaaaag aaatgatatt aatcatctct tggtttactt aacatgagaa aaatttaatc 19981 tactagatta gataaaaggg aattcaagca catcagtcat tttactaggc tttcaatatt 20041 ctagtcaata gtacacatgg ctaccggaat aaatgttctg gtttattcta gtttattata 20101 gttgcccttc cattcatgaa cctctggctt cacactgact aacatgttct ccattttgca 20161 atagaaaatg tctttattga ttccaaaatc ctcactgtcc cttttttttt tttttttttt 20221 tttttttttt tttttttttt tccgagactc agtcttgctc tgttgccagg ctggagtaca 20281 gtggtgcgat ctcggctcac tgcaacctcc gcccgctggg ttcaaagtga ttcccctgcc 20341 tcagcctccc gggtagctgg gactacaagt gcacgtcacg acgcctggct aattttttgt 20401 attttagtag agacggggtt tcaccatttt ggccaggatg gtcttgatct cctgaccttg 20461 tgatccaccc gcctcggcct cccaaaatgc tgggattaca ggcgtgagcc accatgcccg 20521 gcccactgtc ccttctttta ctagcctcct cctttgtgta aatgaaaaca aaaaaaggaa 20581 agtacaacta cagtttgcta aagaattcag ctatgcactt atgtttcacc acaatgttag 20641 cactaaatta tttattcaga agaaaactag atgaatgagg aatgcacgta tgaaattggg 20701 ttggtgctgg aagttgaaag gaatgaggag ggtaaatctt catgcaccat gatgaggagt 20761 caaagaagtt aaaaattaaa aagtggaaat ataagcagat tatttacaga tatggaggca 20821 aatactaaaa taattagctg aaaaagactg aaaagtttgc ctctgagaaa gagtaaatga 20881 ggccaggaag aaggcagagc attgccattt tttaaaaaca aaccttgtag agctatttga 20941 gtttattaca catgtataac ttccataata aataaaacta aattctaaaa agtaaataga 21001 agtcaagaaa aataatacca aaaattaaaa taaggtggct agtctactag aaaaaaaaat 21061 cacatcctct aaatcaatga ttctgcttct gggaataaat aataatttta aaaaagaaga 21121 aaaaggtgaa tatatttgag cacaaacgca tttattacag tgttaattgc aacagtaaca 21181 tacatacata gggatatatg tacgtgtgca tgaatgtata tatgtatatg tatataaata 21241 catgtatgta tatatgtatg tgtgtatata tacacacaca tacacgtatg tatgtatgtg 21301 tgtatatata cacacataca cgtatgtatg tatgtgtgtg tatatataca cacacataca 21361 cgtatgtatg tatgtgtgtg tatatacaca cacatacacg tatgtatgta tgtgtgtgta 21421 tatacacaca catacacgta tgtatgtatg tgtgtgtata tacacacaca tacacgtatg 21481 tatatatgtg tgtatatata cacacacata cacgtatgta tatatgtgtg tatatataaa 21541 tagcaacatt gtaagttagt aaaaaccagt ttggaaaaca atatgggact atctgataag 21601 gtttcatatg tgtatgcccc atgactaaaa tattccattc ttaggcacat accccagaaa 21661 tgttgtcata tatgtgtatc agaagacatg tacaagaatg ttcatagtag cactgttaaa 21721 aacaaagctg aaaatacaaa taaccatcaa taaaagaaaa aaattttgta atatattcat 21781 tccaagtact actacagaag agtgaaaata gacaaatgac aggtatgtac atatggtata 21841 aatatcaata acaatgaatc tcaataacat aatgttaagg aacaactttt agaaaacaca 21901 cacagcaaca ggtaattcat actgagaaac ataagactaa atgatatatt atttgggata 21961 catagaaaca ttaacaacat ttggaaaaat tagcagagat agcagatacg ttattctaat 22021 acactaattc ttaaatttag tgggcagtat tataacttat atatgttaaa aatgtgtttt 22081 tgcattttta tattttccca ttaaaaacta cgtatgaacc aatggcacag agtatagcat 22141 ttttagacaa ggacttcaca acagttacta taactgtatt tcatatgttc ccccatcaaa 22201 aaagacggac catgtcaaat agacatggat aataagacaa atcagttaaa atgtataatg 22261 tctgtatgag attaacagca gatgggacgt tataggaaaa gcagaatata ttacctcagg 22321 gagaagagtt taacgcaaag attattaact ggtaactgga taaaagcttt taagataata 22381 ctgttttcat taaagatatt ttagaagtca gaggaatatt aaaaagaggg gaaattatat 22441 aaccccataa cacagaggca accactggcc agtggagcat tggaatttgt acagcaatat 22501 ctcagaaatt tatgaatctc caaatcctca agtaacagaa aaacagcata tatcctcctc 22561 ctggatattt atgtgagaaa gtgcagagaa aggaaaaact gaggtacagg gtctgggcag 22621 ctggcgacag tcaccaacaa aagtggctcc ctcctacaca ccaactttat aaagagcacc 22681 atacaaatat atgtctcaga agcaaccttc agatgtttct aaaatactta aaaccttgat 22741 tgagcttaaa agtaaatgtc tactatattc tttttaaatc tatcctttta aaattttacc 22801 catgagttgt tttgctctta ttatttgtaa ttatacttat ttacaatgtc ttctttatta 22861 acttaatgtt acatacattt acctacccta agacggtcct tataaaaata tcaatttttg 22921 acagctccca atatttcatc aaattaatga ccaaaattta cttaaccttc agtttaagat 22981 ggaatatttg ttatcataaa tgactacttt aaacatcttt ataaagattt tcaatatcca 23041 gaattatttt cttgggagag aatcaaagac agaattattg gatcaaagta acacgttagc 23101 ttcttgatac aaattatcaa actacttttc aaaatggcta tactacttta tatgcttttt 23161 ccaataatct atgggagtaa tttaaccaca gatagcactg gatattacta ctgtttttga 23221 atattatata tgcttctata agaaatatgt tctcattgat agtctaatct aagttatgaa 23281 gctgaatact gtttattaac tatatttcct cttgagaatt tctaaataat ttttgctccc 23341 tttttgtctt aaatcccttt gtattaatta tataataata ttcactgttt gcctatcata 23401 tttgctattt tcctattatt ttgtattttc ccagtctatt gatttttaag ttatttgttt 23461 acatattatt tcttgaggcc attctcagaa cacaatcata aaattatcta aaaagctcca 23521 cacacttgat tcttcatcac atccctgcct gattaagaaa ttcagactaa aatcttttac 23581 aaaagcaaat atcaaaatat aaaattgtgg tttatagagg gttatctaga agactgatga 23641 acagtaatct ggagaggaaa gatgcctttt tttgatccgt tcaacaaact ttgtatcttt 23701 gaaattaaaa taaattgtat tgcaaagatt aaggcttatc acaacaacca catttgaact 23761 ctttctgaag catcaatctt cttggctgca atcatatcag atactctaca aagaacacag 23821 ctactagaga atactgagat aaataacaaa tttaaattaa gacaaaacaa aacaaacaac 23881 cattaaataa caccaaacaa catgaatgtt taagggttat aatattgaca gaactaattt 23941 atccaatcac aaatgagatt tgggctttcc acatactgaa tagtgttcca gaagccaagg 24001 agaataatgt tccagaagtc acttaaatct aaggtagtcc agttttttaa aaattgcttt 24061 aaagatacat ccacacacag acacataaac acacacagag tcatggcttg cttttttttt 24121 ttttttaagt ttcattaaag aaaccttaga gaacacagaa gttagtggaa aatgtcatcc 24181 aaaactccaa ggacaactac atatacattt gattttcagt tcattattct ggatgcaaat 24241 agtgttttat ttttatttat ttgtatttta gtttagtttt gttttgtctg aaatagagtc 24301 ttgctctgtc accccaggct ggagtacagt ggcacaatca cagctcactg caacctccgc 24361 ctccggggtt caagcgattc tgctgcctca gcctcccaag tagctgggat tacaggagca 24421 tgccaccatg tccagctaat ttttgtattt ttagtagaga cagggtttca ccatgttggc 24481 caggttggtc ttgaactcct gaactcaggt gatctgcctg cctgtatttg gaatacttta 24541 atcatattta agtagagatc tgatgatgaa acatgtcttt gaagaaaaat attttaaaga 24601 caatttttca catacagttg cagagtgttg ctaataaggt gtaatataat tgaaaacaca 24661 gtaaaataaa attataaggc aaaaatttta aagtcgtggt atttgcacag aaacgcaaat 24721 ggcttataga gaacagagaa cataataatg tgggcttaaa attttaagat tttaatatgt 24781 gatagcctaa atataattag acaaatagaa aaattataaa taaaagctca caatatataa 24841 aaccaataat ttaaacaaaa ttaatgaata caatgcttat tccaactaaa aataatttct 24901 taacctttaa agaatgaatg aattacaaaa aatattttac caccaattat ctattttttt 24961 ctaatagtaa aaataaaaga aaaagaaaaa gagaatggag gaggtatatt tatcaaatga 25021 aatagagaaa ggcatcatca tcaaaaacac ataaagaatt tatagatttt tttttttttt 25081 ttgagacgga gtttcactct tgtcgcccag gctggagtgc aatggcacaa tcctggctca 25141 ctgcaacctc cgcctcccag gttcaagcga ttctcctgtc tcagcctcct gagtagctgg 25201 gattacaggc atctaccacc atgcccagct aatttttgta ttttttgtag agacagggtt 25261 tcaccatttt ggccaggcta gtctcgaact cctgacctaa ggtgatccac ccgcctcggc 25321 ctcccaaagt gctgggatta caggcgtgag ccactttgcc tggccaagag tttatagttt 25381 taaatgggaa gataaccaaa gcctattaaa taacacgttc atgtaacaga ataaccataa 25441 gtctatattt gaaaacctat taaacctttt taataatctt tgaaattaat ctttacataa 25501 aatgaaatag taaatattca ctgaaacatc tttaaaatat gtaaaaatat atttcaatta 25561 aaagttattt cttacaaaaa ttttagagaa aacaggtaca tcttttcaga ggccaaatat 25621 tacatgatag aaagaataca gaactagaaa tgagaacacc tgtgctttaa ttatgctcaa 25681 ttctatgacc caagaaaaat cactaaactt ttctgagact ccaacagtga agattaaaaa 25741 acctgtgttg ctaaccaagt agggttgtat ggagcaagta aggcaatatg catgaagcac 25801 gttttgtaga ccataaacaa ctacatgtaa gttattagtt attattttca taatagggta 25861 ttattctttt gatcgggata gggagaagaa gcacccggaa gaacatgtag agataatctt 25921 aaggagttca ggtattacag aaaacaaaca caggttaaaa tttgtttgtt tgttttgaga 25981 cggagtcttg ctctgtcgct caggctggag tgcagtggtg caatctcggc tcactgcaac 26041 ctccgccttc caggttcaag tgattctcct gcctcagcct cctgagtagc tgggactaca 26101 ggcgtgtgca accacgcctg gctaattttt ctgtattttt aatagatatg gggtttcacc 26161 atgttagcca ggatggtctc gatctcctga cgtcgtgatc cacccgcctc ggcatcccaa 26221 agtgctggga ttacaggcgt gagccaccgt gcccatccta aaatttgtat ttttctgact 26281 agttcttaat tgataccatc acataaacta tgcatagaat agcctcaggc ttaagtaaac 26341 tcctttagct gaatctttat atttcaagtc taaggcactt ctcaggacat gttcatcttt 26401 cttgttcttc aaagcttcat ggaaatttag aatcaaaagt cttctttttt gtgtctctca 26461 caaaaggggg aaggaaaata ccttactatt aaaaatgaga attagttttc ctttcctaag 26521 tctcactctg tgtctgcttt ggctaccaca aaaaattgct cagagccaat ggaaagaaaa 26581 tacattgtcc atctgttctt ctgggcacca tttttccttt tttcctgact ttctaactgt 26641 gatactgttg atatgacatt atactttaaa aacatgaatt taaacaaaac ctcatccctc 26701 catcgtcaac agaaagatat ttcaggtact ttattgaatg cctaccttcc aatcagtacc 26761 tagctagaaa taactgtcaa tatctctcct tagatgccct aggtaaaatc agcagggata 26821 tataaacagt gaccactacc ataaactaag gatctaattg acatctacag agcactccac 26881 caagcaaaaa caatacaaat tctgctcaac tacacatgga acattatcca atatagacca 26941 tgttctgggt cacaaaacaa accttataaa tgtaaaagaa ttgaaataat acaaagtatg 27001 ttctttggct gggcacagtg gctcatgcct gtaatcccag cactttggga ggccgaggca 27061 ggctgatcac ttgaggtcag gagttcaaga ccagactggc caacatggtg aaaccccatc 27121 tctataaaaa tacaaaaatt agccacacgt ggtggtaggt gaagcaggaa gcttgaacct 27181 gggggcgggg gatggggagg ttgcagtgag ctgagattgc gccactgcac tccagcctgg 27241 gagacagaag gagactccgt ctcaaaaaaa aaaaaaaatt gttctttgat cataatggaa 27301 tcaaattagg aaattttcta aacctttgaa aattagtaca tttttaaata atcaagggat 27361 cacaaaggat ctctgagaaa tttctctttg attttgaatt gagtgaaaat gaaaatacaa 27421 aatcaaaatc tgtgagatgc aaccaacgca gcagttaaga aggaatttaa taacattgtg 27481 tttttagaag aaaaactttg ttattagaaa aaggtctcaa atcaataata caaactcctg 27541 cctcaagaaa ctagaaaaaa ggtaaaataa atccaaaatt agaactaata aagaagatga 27601 aatcaatgaa attgaaaaca ggaaaataga gaaaaatcaa tcaaaccaaa agttgactct 27661 ttgaaaacat caataaaatt tgtaatctcc tagtaatacc aacgaagagg aaaaggagga 27721 aaacaaatta ctaattatat aataatattc actgtttgcc tatcatattt gctatgagta 27781 ttttcccagg ctgttgattt ttaagttttt gtttacattt tatttcttga ggccaacatg 27841 gtgaaaccct gtctctacta ctaaaaatac aaaaattagc cggacatggt ggcatgcccc 27901 tgtaatccca gctacttggg aggctgaggc agcagaatcg cttgaacccc ggaggcggag 27961 gttgcagtga gctgtgattg tgccactgta ctccagcctg gggtgacaga gcaagactct 28021 atttcagaca aaacaaaact aaactaaaat acaaataaat aaaaataaaa cactatttgc 28081 atccagaata atgaactgaa aatcaaatgt atatgtagtt gtccttggag ttttggatga 28141 cattttccac taacatctgt gttctctaag gtttctttaa tgaaacttaa aaaaaaaaaa 28201 aagcaagcca tgactctgtg tgtgtttatg tgtctgtgtg tggatgtatc tttaaagcaa 28261 tttttaaaaa actggactac cttagattta agtgacttct ggaacattat tctccttggc 28321 ttctggaaca ctattcagta tgtggaaagc ccaaatctca tttgtgattg gataaattag 28381 ttctgtcaat attataaccc ttaaacattc atgttgtttg gtgttattta atggttgttt 28441 gttttgtttt gtcttaattt aaatttgtta tttatctcag tattctctag tagctgtgtt 28501 ctttgtagag tatctgatat gattgcagcc aagaagattg atgcttcaga aagagttcaa 28561 atgtggttgt tgtgataagc cttaatcttt gcaatacaat ttattttaat ttcaaagata 28621 caaagtttgt tgaacggatc aaaaaaaggc atctttcctc tccagattac tgttcatcag 28681 tcttctagat aaccctctat aaaccacact tttatatttt gatatttgct tttgtaaaag 28741 attttagtct gaatttctta atcaggcagg gatgtgatga agaattaagt gtatcagtaa 28801 caggaattaa aaagaagata taactatgga cctcacagac atatcatcgt ggaaaactat 28861 gaataactga acaacttagc aaaacgacca gagctcatac aagattatct gagttgatct 28921 agaacattaa gggaactgaa tttgtagttt aaaacattct gaataagaaa tcaacaggcc 28981 cagatgtttt cattatcact gaaaaaagaa atcaattcta tacaatgaga gagggaacat 29041 ttcccaaaag tagggaaaca aagaatgcta cagaacatta tccctcaaga gatatacaca 29101 aaagtcttca aaacactagc aaatgtgatt accttaatac aaaaaccaaa gacattacaa 29161 aaaaaaaaaa cctacaaaat aatctccctc aagagcagag atgcaaaaat atcaagaaaa 29221 tattagcaaa cagaatcttg caacatatga aaagaataat atgccatgac catgaggaat 29281 ttcttctggg aatgcaagga agatacacac tgataattta gaattgttta ctatgaaccc 29341 tttaacaaca aactaagatt ctatagctaa taagtcaaca aatgaggtaa aatttaaaaa 29401 taatgaaaag aatgcaggaa aacaatgaac agatgacaca gattgcaaaa tatttctaga 29461 aaatgcaaag tagaatgtag gttagtgaga ctgagcgcag tggctcacat ccataatcct 29521 gacacttctg gaggctgagg ctggcggact gcccgagctc acgagtttga gaccggcctg 29581 ggcaacatca tgaaaccccg tctctacttg aatacaaaaa attagcaggg tgtggtggtg 29641 cgtgcctgta gtaccagcta ctcgggaggc tgaggcacaa gaatcacttg aacccaggag 29701 acagaggttg cagtgagctg aggtcatgcc actgcactcc agcctgggca acagagcgag 29761 agtctgtctc caaataaaaa ataaataaat aaaaaatgta ggttggtggt tccctggggc 29821 ctggaggggt tttcagagaa gggtggaagg gaggtttcac atggaaactt ttgggagtga 29881 taaatatgtt tattatcttg attatgttga tgctttctca ggcctataaa tatataaaaa 29941 tgtataaagt tatatacttt aaacaagtgc aggttattga caccaattat acctcaacaa 30001 aacttttttt ttttttttga aaaaaagaaa aactgcaaaa gaaatatatc aaacagagct 30061 atggggttta ttcaacccct gaaaaaagca ttttccctct tcactatgct tacaaaagac 30121 atcaagaagg attttctatt tcaaacagcc attaaatgcc tatatgctta gcactgtata 30181 aagacggaaa cagatattaa ttctacattt acattatacc ttttttgaca gactagaaaa 30241 cagaattata aaccaggctg tcccctatcc aaaatcctgc atgataccct tatatacatt 30301 aaagtgtttt gttcccaaag agctaaactt gatcacataa ttcttaatta tggatgctag 30361 agggtctgct tcactcaggt ccaagcactc caaaaaatta aaagcttagt ggacactgac 30421 tttgcaatat attaataggt gttataactt aagggaatcc tagggtaaac acaaaattaa 30481 gtaggtactt taccacaata ctcctcaaaa ccattactgt gcacaatgaa catccaaaag 30541 aagaatataa tatataacat ttcccacatc tatttatcca caggatacca tccatccccc 30601 acttttttcc cttcagattt ttcaaaagat ccagtaagac taaaacccat ggaacacact 30661 tagaaaaaca cagtgactac ttcatactgg tttagacgca atactaaaag aaacaaaatc 30721 catataaatc cactgataga agccctttgt ttactccttt aaaaatcagg tcaataaagg 30781 gtcttattaa tatagtataa agcacttaca agcaataata caggggtgag actctgtctc 30841 caaaaaaaaa aaaaaagatg aggaagatac acatctattt atatcctact ctcctatgaa 30901 gagtctccat catataaaca ttaatgacca cagaatcttc atatacatga taccgtaaca 30961 acttctagtc agattaattt ttattctatt tctgaatttt actaaaaaaa aaaaaacaca 31021 agaaatatta tataggtatt atgagagtca catatccaca gtcagaagag tccttactat 31081 gacgaaataa cataaacatt attaatggta attactaagg tctgcaaatt ttaagggctt 31141 taagttaact cagaatatct actctctaat ctacacatta cagatgcaat acgctgatgt 31201 aaaaggttag aagaactacc tggggtgata atacaataag tcatcagaaa taacaataat 31261 aataaacatt catataatgc ttactatgtg ttatgggttg aactgtgtct ccctaaaatt 31321 catatgttga agtcctaatt cccagtacct caaaatgtga gcttatttaa acataggact 31381 gttgccacta attcaatatg actagtgtcc ttatattata aaaagaggaa acttggacaa 31441 tacccacaca tggagaatgc tatgtgaaga caagggcaga gacgggggta aagtttccat 31501 aagccaaaga actcccaaga ctgccagcaa accaaagcta ggcaagaggc atggaacagc 31561 ttctccctta tggaccacac aaggaagcaa acttactgac atcttgattt tggacttcta 31621 gcctccagaa ttgtaagaca acacatttct gttatttaag ccacccagtt tgtggtactt 31681 tgttatagca gctcgagcac tacagaccct atgtctccag acactgttat aagaacctgt 31741 atattactca tttaatacaa caactcattt aatacattta tctgctccac tttacagatg 31801 atcaaactga ggcacacagc agttaattaa cttgcccaag ttcataatta ctaaggcagg 31861 atttggaaag tgacgcaaat ggtgtttgtc ttagttcagg ctgctataac aaggtaccat 31921 aaactgggtg gcttataaac aactgaaact tacttctcac aattctggaa gctggaagtt 31981 caagatcagg gtgccaacat ggtcagcttc tggtaaggtc tctcttctgg gctgtaggct 32041 tgtatcctca tatggcagaa gagggcaaga gggctctctg ggatcccttt ttcaaagaca 32101 ctaatagcat tcataagggc tccaccatga tgacctaata acttctcaaa agcccttcct 32161 cctaatatca ccacattagg aggtaagaat ttcaaaatat gaatttgggg gagacataaa 32221 cattctgtcc attgcagtgt tgttgctaga atggccccat agtgtagctg tgtgtttcaa 32281 ctgtctgggg cattcagatc aaagtttgat tctctagacc tgtaaaactt tgttctcttt 32341 atacattcta aaatcttata ataaatcctt tctacttaaa ttgggtatag tagattcagt 32401 tgatctcaac taggaaccct aacttcagta aggaactgga gttctggaga aagatgaagt 32461 acttaaagaa actgacagtg gttgcagccc cagaaaaata tccgagtaat ctctgactac 32521 tattcgataa tatgggtttc gaggtttgca cgacagcagt gaagggccaa tctgacaaac 32581 ctggccacaa aaatcaattc ctctggtctt gttcatctat atgatcataa acactttatt 32641 gacctctctc caaaagtaag gctatataga tagaccacac tggagtccag ctgaagagtg 32701 gaggtcatga ggtaaggaaa ataatggagg gttccaaact accacggact tcaagagagg 32761 tgaccagaga tcagcaactg agatttggct tttcctatac atcaaaattg acatggccca 32821 aacttgataa tttcttctct tattcacttt tcttcactaa actcccttca tgttaactcc 32881 ctgtgaaaac actgcacaca caaattattt cacaacttga gaattaaata cattttagga 32941 caaaaatata tactcttctg ttatggactg aattgtatta tgcaaaaaaa gatgtactga 33001 agtcttaacc ctcagtacct gtaaatgtga ctttacttgg aagcagggtt ttcccaggtg 33061 taatacagtt gacactggat ttttctcggt gacttttgcc agctggacct cctccagctg 33121 gtgacgcctc caccgaagcc tcgctcagcc ccaggcctgc cactggagac atcctaccca 33181 ctcagcccac tgggctgtgc cttgctcgtg caccagctca gcctgtggct gggctgggat 33241 tgccccagcc cacccatgtt acagctcata cacacattca gcagttccca agttcttgtc 33301 ctgcatccaa gaagaatgac attacactga ccatcaaggg ttgagaaggg cagagaagag 33361 tgttattaag caacaggaca gctctcagca gagaggggat atgagggtgg tcccccactc 33421 aaaggcgggt atctctcctt cagtgtggct gggtccgaag cttttatggg ctcagaatgg 33481 gaagtgcgtg ctgactggtt tgtgagtatg caaaaaagac tacaaccaag gcaccactca 33541 aaggtgggct cgacagtata aaaaaccaat taggaaaggg taggtatatg taaaacaggt 33601 gaaaggtggg gatcaataag aggaaagtac gccaaacagg aagagaggtt ctcaatatgg 33661 tatgcggatt tatataagac ttgtaggttg gccttcacac tttaaattgt ctttcatttg 33721 aaggtggggt ttcatcaggg acctgcccct gtctgcttag gatttgtctg cctcctgcta 33781 ctatcctagt taaaataaag ttaaactgga ttagggtggc ccctaatcca atgactggtg 33841 cccctaaaag acacctttta aggaacattt ggacccagct acagagagga gaatgccatg 33901 taaagccaca gagacctagg agaaagccat gtgaagatag aggcagagac tggaatgaca 33961 catttataag tcaagcaaca acaagaatga ccacaatcac caaaagctac aggaggcaaa 34021 gaaggattcc tcccctagag ccttcagaga gagtatggct ctgttaacac ctttacttca 34081 aacttctagc ctccagaact atgtaagaat aattctgttg ttttaaggca ctcagtttga 34141 gtaatgtgct gcagtagccc taggaaacta aaacatcttc tctctgttat attcattttt 34201 tttctatctt caatcaaccc ccaatttagg cctgaagcta tagaccaaaa gagttaactg 34261 agaattgaag tcagtgaagg ttccacatgc agagcagctg caggatgagc tgtttaatct 34321 tctacacaac actcactggc gtattactag caagttctca tattttctta gctcaagagc 34381 tcttgagcta ctaaacacaa tttgcataat gaatgaatat gcttataaag agaagttagg 34441 aaggtagaac aaaaactata ttggcttgaa gcacaggcat cagatatttt gctaaataaa 34501 atggaggtca ccatgtcaat tccaccatga acatttcttg tctgctgttt aaaaaaagcc 34561 aagaatgagt gtgtgacaaa cctattgtac aatgaaagat actaagtaac agaatattac 34621 taagagtacc acttaaaatt ttatctaaaa tattttagct gtcccttact atagggaatc 34681 tttagggaac aaccatattt tacttaatcc ttcaaaggaa ctaaataaaa tatcatcttg 34741 gaagagtagc atattatgga tctgacagct cagagggagt agaatgttac catatgtctt 34801 tttaatttta gtcattcttg ggggtgtaaa gtggtatatc attgtggttt gatttgcatt 34861 ttcctaatga ctaatgatac tgaccacctt ttcatgtgct cattggccat ttgcacatct 34921 tctctggaga aacgcctata ttcaagtctt tgcccagttt tttttttttt taagagatgg 34981 ggtcttgcta tgttgccaag gctgatctcc aactcctgga gctcaagcag tcctactgtt 35041 tcaacctcca gagtagcagg gactacagga ctgtgtctgg cttttgctca ttttttagtt 35101 ggtgatatgg tttggctgtg tccctaccca aatcttatct tgaactgtag ctcccaaaat 35161 tcccacatgt tgtgagaggg acctggtggg aggtaatcca attatgggga cgggtctttc 35221 ctgtgacaat gaataagtct catgagatct gatggtttta tgaaggagag tttccctgca 35281 caagttctct tctctcatct gctgccatgt gagacgtgcc tttcaccttc caccatgata 35341 atgaggcctc cctagcccca tggaactgtc agtccattaa acctctttct tttgtaaatt 35401 gctcactctt gggtatgtct ttatcagtag catgaaaacg gactaataca gttgggttgt 35461 cttttttgag ttgtaagggt cctttatata ttttgtatgc aagcaccctg tgtcagatac 35521 atgatttaaa aacatctttt ctgattctgt ggattgtctt ttcactttct tgataatgtc 35581 ctttaatgtg caaaagtttt aaattttgat gaagtgtaat ttatctcatt tccttttgtc 35641 acttgtgctt ttggtgtttt atctcagaaa ccaaggtcac aataatttac tcatgttttc 35701 ttctaacaga ttgtatggtt ttagatcttt aggtatatga tacattttta gttccttttt 35761 tgcatggtat gaggaagtga tatatattca ttcttttgca tgtagacact ggttgtccca 35821 ggacaatttt ttgaaaagac tattcattac caattacatt gtcttcacat tcttgttaaa 35881 aattaattaa ctataaatgt cagagtttat ttctggactc tcaattctat ttcactgatc 35941 tacatatcta tctttatgcc attaccagac tgtatgaatt attgtagctc tgtactaagt 36001 tttaaaatca gtatgtgtaa gtcctccagc tgtgttcttt ctcaagttga ttttggctat 36061 tctttttaga atttgtttta gctattctgg gttgcttgca ttccagtatg aattttagga 36121 tcattttaca atatctgcag caaacaccaa ttcggattct gataaggact gcactgaatc 36181 tataaaacaa tttgggaagt gctgccatct gtgctaggca tattaatgta ctctaaggat 36241 atccaacaac ctaatcctca gaatttgtga aatgctacct tacatgctaa aaggaacaat 36301 gcagatatgc catgggtctt gagatgggga ggttatcctg gataggccct aataaaatca 36361 caagtgattt tataaaatag aggcagtggg agatttgatt acagaagaag gcactgtgag 36421 aatggaagca ggaggagaaa aggtgatacg atgcagggcc attaaccaag ggaagtggac 36481 aacctccaaa aaccaaaaaa acaaacaaac aaaaaaaaca aaaccaaaaa caaacaaaca 36541 gaaaactaac ttacagtgtt ccgaaggaac cagctctgct gacattttga ttttagtccc 36601 ttaaaactca tttcacactt ctaatctcca gatttgtaaa aaaataaatc tgtattgttt 36661 taagccatca aatttgtggt aatttgttac agcatcccta atgaactaat acacccaata 36721 catcctaagt gtgtttgcac ataacaacag agtttgatat aaatgtagca aaaataaaaa 36781 agaatgaaat ttgaaaatct acaattatgt caaatagttt tatatccttt tataaataat 36841 tgataaaaca ggcagacaaa aaatcagtaa agatatctat agagggagat aacacaaaca 36901 tggcaaaggt tttacaactg gcaaatctag gttatgggta aggggtgttt attttactat 36961 tctttcaact tttctctaat ttaaaacttt ttaaaataaa aagatgggga aaatcactga 37021 agtatagaga atatatgaaa gataatctat tttttatttg gggcatcatc aaactctatc 37081 atcttactat tagattttat tatttactag aaaaatccta tttgccacat tgattttctt 37141 tttaattttt ttagacaggg tctcgttctg tcacccaggc tggattgcaa tggcacaaac 37201 atagctcact gtggtctcaa cttcccaagc tcaagcctca ggatactgag gctcaacttc 37261 ctctcacctc agcctccaac gtagctggga ccacaggcat gcaccaccac acctagctaa 37321 tttttttatt ttttgtagag acaatgtctc actttgttgc ctaggctggt ctagaactcc 37381 tgagttcaaa caatctgcct gccctcagcc tcccaaagtg ctaagattaa aggtacgaac 37441 ctgcccaatc aacaatgact tttaaataaa ggtcagactg caaatacaga gcctaaagaa 37501 ctgctatttt aaacaggaag tgtgtgtgtg tgtttgtgtg tgtgtgcatg aacattttat 37561 tgtaaaaata tcatgaaaga agagagaatg atcctttgta cccattactg agtttcaaca 37621 attaccatca ttttgccatt cttattttga ccatatccta tctcatgtgt ttgaatgctt 37681 tggcataagc attaaatctt aagtacaaaa agaataatca aaaagagtat gacaataatg 37741 accaagtgaa aagaccataa tgttctataa aaataactgc tctgtagtaa actaaagttg 37801 aaaaagtaaa cacagagagg tcagacaaaa cattgctggg ttaaaataaa attcaaactg 37861 ataaaatatg ccaggctgca aaatattttg gcaattcaag agtaattgat atacaaaaga 37921 gagggagtgg aagttaacag gatagtacca cctaatcctc tgaggacatg aacttaggat 37981 actgaaataa gaacatgaca taagtacaga ccacaggaaa ataccataat gcttaaaaac 38041 tcaaatgcaa tcgtttcagt gtaagaatat cacttaacca aaaattacca ttaatatcaa 38101 actggcctta taagaaaagc tgcctctaca tgaaaataaa gtcttttcta cttaaaaaat 38161 tctaactagg aagaaaaatg gggatttctg aagtcatatt ctatttatag tacatatatt 38221 ctaccatgta ctataaaaag tccagggatg catccatttt cgcaaaaaaa attctacaaa 38281 aaattttcta tttttttcta ttcatcatag tactggaagt cctagccaga acaattaggc 38341 aagaaaagaa ataaaaggca tccaaattgg aaaggaagaa gtaaaattat ctctgttcac 38401 agatgacata atcttagata tagcaaacct tatagattcc acaccaaaaa aaaactgtaa 38461 aaataaacga atttagcgaa gttgtaggat acaaaatcaa cacacaacaa tcagttgcat 38521 ttcagtacac taacagtgaa caatccaaag aggaaattaa gaaaataatc ccatttacaa 38581 ttttagcatc aaaaagagta agacacttaa gaataaactt agagaaaata aagaggcaaa 38641 agacatgtat tctgaacact acaaagtatg gctgaaagaa attaaagaca acacaaataa 38701 atggaagata tcccatgatt ttgaattgaa aggcttaata ttgttaagat gtcaatacta 38761 cccaaattga tctacagatt taatgcaatc cctattgaaa tcccaacagc attttgtgtg 38821 tgcatgcaga aacagaaaaa cgcatcctaa aattcatatg gaatctcaag ggaccccaaa 38881 cagccaaaac aaaagatgca agtttcacac ctgctggctt caaaacatat tacaaagcta 38941 caataataaa aattgtctgg tactggcata aagacagaca tacagatcag tggaatagag 39001 agcccagata taaaccttta cgtgtaaggt caaatgattt ttgacaatgg tgccaacaca 39061 attcaacagg aaaaggacag tgttttcaac aaatggtggt gggaaaactg gatatccata 39121 tgccaaacaa taaaatggta ccattacctt agctatacgc aaaatttaac tcaaaatgga 39181 tcaaagacct aaacataaga gctaaagcta taaaactctt agaagaagca taaaggaaaa 39241 atttcatggc attgaatctg gcaatttctc acatattgac accaaatcac aggcaaaaaa 39301 agttaaaaat aaactgaact acatcaaaac gtaaaatttc catgtaaata ccacaattca 39361 cagagtgaaa aggcaaccta cagaatggga gaaaatattt gcaaataata tttctaataa 39421 agggttagaa tatataaaga aatcctacaa ttaagaaaca aaaaagcaaa taacctgatt 39481 tttaaatggg caaagaatat aaatagacat ttctctaaag atgagagaca ggtggccaac 39541 aagcacatga aaagatgttc aacatcagta atcattggac aaatgcaaat caaaatcaca 39601 atgagatatc acctcacacc agttagaatg gccactatta aaaaaataca caaaacagaa 39661 aataacaact gttggcaagg atgtagaaaa attggaacct ttgtgcattg tgtctagaaa 39721 tgcaaaatgg tgcagctgtt acggaaaacc atatagacct tcctggaaaa aacttaaaat 39781 tagaatcacc atatgataca ataatcccat tttggggcat acacccaaaa aaatggaaaa 39841 caagatctta aaggaatatc tgcatatcca tgttcactgc agctattatt caaaatagcc 39901 aagaggtaaa aagcaaccca aaagtccact ggcagagata aatgcataaa gaaaatgtgg 39961 tatatacact gggtacagtg tgtacactgc ttggacgatg agtgcaccaa aatctcagaa 40021 atcaccactg aagaacttat ttgtgtaagc aaacatcacc tgctcccaaa aacctattga 40081 aataaaaaat aaattaagag ataaaatttt aaaaaaagat aaaaagaaaa agaaaatgtg 40141 gcattataaa cccaatagga tattagcctt aaaaatgaac aaaattggct gggcgcggtg 40201 gctcacgcct gtaatcccag cactttggga ggccaaggtg ggcggatcac taggtcagga 40261 gatcaagacc atcctggcta acacagtgaa accctgtctc tactaaaaat acaaaaaatt 40321 agttggatgt ggtggcaaat gcctgtagtc ccagctactt ggtaggctga gggaggagaa 40381 tcgcttgaac ctgggaggcg gagattgcag tgagctgaga tcatgccact gcactccaac 40441 ctggacgaca gagcgagact ccgtcttaaa aaaaaaaaaa aagaacaaaa tcctgtcaca 40501 tattgcaaca cagattatga ggacattatg caaaatgaaa taagtcagtc acaaatagac 40561 aaacactgtc tgattctact tttatgggtt atctaaagta gtcaaactca tagaaacaga 40621 aagtagaatg gtggttgcca gcaggtgggg gagaggaaaa ttaagagctg ttaaataggt 40681 atagaatttc cattttacaa ggggaaaaaa ttctgggaat ctgttctaca acaatataaa 40741 catagtacta aactgtgcag ttaaaaatga ctaagatggg aaattacatg gtatttgttt 40801 tttccacaat taaaaaagaa aaacagaaga taacaattgt tgaggagtat atagaaaaac 40861 tgaaatctgg ccaggcgcgg tggctcacac ctgtaatccc agcactttgg gaggccaagg 40921 taggtggatc acttgaggtc aggaatttga gaccagcctg accaacatgg tgaaacctcg 40981 tttctactaa aaatacaaaa attagctggg tgtggtggca ggcacctgca atcctagcta 41041 tttgggaggc tgaggcacga gaattgcttg aacccaggag gcaagggttg cagtgagcca 41101 agatcgtgcc actgtactcc agcctgggcg acagacactt tgtcccagaa aaaaaaagaa 41161 aaaagaaaaa ctgaaatcct catatatcac tggcagacat gtaaaatgat gcagtttctg 41221 tggaaaatgg tatggttgtt tctcaaaaga ctaaacatag aattaccata tggtccacca 41281 attctacttc tatatatacc aaaagatatt taacagggat ttgaataggt atttgtaagc 41341 ccatattcat agcagcaata ttcacaatag ccatgaggtg gaagcaattc aaatgtacat 41401 tgatgaatga aggataaaga aaatgtggta tacatacaca atggaatact attctgccat 41461 aaaaaagaat gaaatcacat cattcgtggc aacatggatg aacttggagg atatgatatt 41521 aagtgaaata tgcacagaaa gagaaatacc acatgttctc actcatatgt gggagctaat 41581 taagtggatc tcatagaggt agagaataga ctggtgataa ccagacagaa gttgggaagg 41641 atggggagag ggaaggatga agacaggttg gttattcaca gtagccaaga cactgaagca 41701 acttaaatgt ccatggacag atgaataagc aaaatgtggt atatacatac agcgaattat 41761 tcagccttta aaaggaagga aaatttgacg catgctatac aacatagatg aaacttgaag 41821 acattatgct aaatgaaata agctagtcaa caaacaaaca aatatataat ccctctgatg 41881 tgaggtacct ggagtagtta aaatcataga cacaataaga atggtggttg ccaggggctg 41941 agggtggggg agaatgggca gttagtattt aatgactaca gagtttcagg ttggtacaga 42001 taaaaaagtt ctggagacag gtggtggtga tggttgcaca actatgtgaa tgtactgaat 42061 gccataaaac tacatactta aaattggtta aaatagcaaa ttttgttatg catattttat 42121 gacaaaaaag ttctatttta tgtgtagcat tcattttaaa ctttcattgt aaataaaaat 42181 tcatgaaatg ttaatttttg ctttcaactg atggaaagaa aactaaaatt gaattaaaag 42241 gggaataaag gaatataata aaaattctaa agcaatagag aattaaagca gagttggatc 42301 atgagtgctc agcaaatgtc atcaagacgc agtgtgcctt gtttaaataa aaagtgtagt 42361 atttaataat aattgcaaat agtatgatgt atttataaag caaatataac cctgattcgt 42421 cttttcaaag aagtcaatgt ccatttacat taatacttga attaagtaag acatttcaat 42481 gacactattt gtggtcaaac agcatgtatt gaccaagtca aaaatcagag atctaatttc 42541 aacttcacca tttaccggcc atgcaaacct agacaaggaa tttaatgagc ctcagtttat 42601 tcattcatta aattgggatc tcctctctga attagtatga gtttcaaatg agatatttac 42661 caaatcattt aaaaaccatg aatttttaca acagtgtaag gcattataac acattgtcac 42721 atgatttctt tctgactacg tcaccaagtg ccctttcctc tgcctactct actccaaaaa 42781 tggtgaatag ctagccccag tgcattctgc ttggtattaa tctattatgt tagtagatag 42841 catagccctt tggcatcatt ttaaaacaac aagccaaaac ctgaggttta ggcatcttag 42901 gttaagaacc atagtaactt ctttcttgtt tctgctaaaa atatagaata ctggtgcctg 42961 gaagaaagtg aatatgtctt gccttttact ggatgactat tatcgaatag ccactgtgtg 43021 acaggtgctc tttcagatac ttaaagtcct tccactggac aactattatt gaatagctac 43081 tctgtgacag ttgctctctt agatacttca gaaaggaagt atatgatttt aatatacatt 43141 tcacatgtct atttttaatt tatatatatt tagatgcaca tattaatgta catcttaata 43201 tgcattttta aaccataatc aaatcagaca ttcctcatag aaactgaaat tcttgctttc 43261 ctacaacagt atatactaag cattttttca ttacagtacc tgcaaatttc taaaccaata 43321 tccactttcc ccttgatcat aactaaccaa actataattt tgttcatggt aataaggtgc 43381 ccagctaaat aaatctcctt ttctagcaaa atatgtccat gagaaacagt taacatgcaa 43441 gaaatggcca attagatata tgttccaatc tgtcaaagac ttatgggaaa gttttacttt 43501 ctcaggtagt caatacctct tccttctttt cctacctgga aactagattc aacgcctgaa 43561 gataaagcaa tcttactgag gccctatgga tgaaaataag catagtggac cagaaaaaga 43621 gaagaaactt agtttctatc atcccaaaat atgactcttt gacataaata tttttgagct 43681 aaaggcaatt aagaagcagc aaatggacta agggctcttt ctatcctcca ctcttttcta 43741 ccaagacagg atataaattc tcctttactg gagacaagtc ttaccagctc agaggcagca 43801 ccagagaaat ctgcaaacaa atcttattcc attagtttat tcccataaat ttaccttccc 43861 acagtttccc acctctggaa gcctaaaact gcttttcttt gtcttgtcac ttctctaaaa 43921 tgtattgttc tttgtcaaaa cgttatataa gccagagttt taagccactg ctttatcttt 43981 cgttgaggtt tctcctgagt gatgtgcact gcacacgtta gcaaacttgc ttgtttttct 44041 cttcttactc tgtcttttgt tataggactc tctcccaact acaaacttag gaggactgag 44101 aagttatatt tcctgcccta caatacccct ggactatcac atgagaaaga tgcatctgtt 44161 actgttaagc tactgtatgg gagactttat actgtttaca accaaacaat ggtacttaca 44221 aagaagtaat tataaactag aggcgattta agtgcttgaa aagaccatga gaaggaatat 44281 aatttagatc agagagaaac aggctttatg aagaaaataa attcatacct tcttgttcta 44341 acagaaactt ggctctcctt ctacagtctt tcaagtaggc attgttttct cctttatccc 44401 ttctaccaca tggtctggag gttaggttgt tgtccacttt gcttcctact gacactttca 44461 ggctcttctc cctccaccct cttcaaaact ccagctttga atctcatatc atcaaactat 44521 gccattcatc accacttcct gacctcttct cttctaatga tcctgtgttc aacccaattt 44581 ctgctaccca atcccacgat cattaccgta aggccaacct ctccactatg tacactaggt 44641 tctatgccct cctgcctcat caaggacatt cttccaggaa tgctcccttc actctcttac 44701 atcatcaatt tttcccttct ctacgcaatc attcccatca gatataacta tgatgcaatt 44761 cctacagctt ataaaagaaa aaaaaaaatc tttgaaattt acatcctttt ccagttacta 44821 ctagtttctc tgctgcactt tacaggaaaa ttcctcaggt aagttgtcag tattctattt 44881 ggatttgctc tccccgtttt ctcgagccca ttataaatag gcttttgttc ctaccaccac 44941 caaacctgct cttttctaaa tacattactt ttctgctgct actttaatgt atcttttact 45001 tagaaaatta catttataac tttatatgta gtatatagga aaataattta tataaattga 45061 tcttatttcc ataaatttgg ctaaatttta ttaattatat ttcttaaata tatttaacat 45121 aaccatatta actacatatg taacaattat gtaaaatata attaatagat tataatctaa 45181 tgtaatagat ccttttgagt tttctaagta gactatcatg tcatctgcaa ataaagacaa 45241 ttttgtactt ttgcagggtc tgagagactg ggcaaaagga agtagaagtt ttgtgccaaa 45301 atgacatttc aaaaatggga tacattagaa aacaggaggt gctgaaggag aaggagagaa 45361 ggactctaag tttggtaagg tgaacttgca aaggggagag gcaggcctat gattctgctg 45421 agaaggagct gagcctcttg gatcattgta atcgacacac tgtctaccct gaatttatat 45481 cctcagcccc aacatcctcc atgaatttct tgactaatat atcactgggt agtggaaagg 45541 taggccgaat ttaatacttc caaaatgaac tgttatttct cccataaaat ggctcctcac 45601 tcagtattct ccacatcagt aaagggtact gtgtaagaat ctacttgtct aactgtccct 45661 aaaacattct tttaatcttc ttgagagcag aaatttggtt gttgtatcgc tagtgcttag 45721 agtaatatct aggacatagg agaacttcag tatgtatttg tttgaatgaa ttaattacca 45781 taagtttatt ctggtcttaa ttctcaacac ctcagcagaa atcagtaaaa taactacaac 45841 tttcagcagc tataggatag tttagttttg gttgtttttc ttggttaaaa aaaaaaagct 45901 gagagggatg ttgtaatata atgaaaaata aatatttcat ctttgtccca aatattgtat 45961 caggttcctg gcacagcccc taaaatcctt ggaatctctg gactaataag acggtctttt 46021 gcatgctaat agaatgatgg tggccactag acagcttcag gatgaaggct ggtcaccaga 46081 aagatcaatg catgattaga ggattggaat tttcaatccc accccacacc aacttctggg 46141 gagggaaaag aggctgaaaa ctgagtcaat caccaatggc taatgattta atcaatcatc 46201 gttatgtaat ggaacctcca taaaaacccc taaccacagg gttcagagaa attctgagtt 46261 gatgaataca tcaaggtgct gagaggtcaa cccatgggag agggaagtga agttttgtgc 46321 cccttccccc atactatgct ctatatatct tttctatttg gctgttcctg agttgtatcc 46381 tttattttaa aacaacaaca acaaacagta ataataagtt aagcactttt ctgagctctg 46441 tgagttgttc taaagaatta tcaatcccag gccgggcgtg gtggctcatg cctgtaatcc 46501 cagcactttg ggaggctgag gtgggtggat catgaggtca ggagatcgag accatcctgt 46561 ctaacacagt gaaaccccgt ctctactaaa aatacaaaaa aaattagccg ggcatggtgg 46621 cgggagcctg taatcccagc tactcaggag gctgaggcag gagaatggct tgaacccagg 46681 ggggcaaagg ttgcagtgag ccgagatcgc gccactgcac tccaacctgg gcaacagagt 46741 gagactccgt ctcaaaaaaa aaaaaaaaaa aaaaagaatt accaatcctg aggagggagt 46801 tgtgggaaac cccttaatct acagcaggta gatgagaaat actactgata acctgggact 46861 tataactgcc atctgcagtg ggaacaatct taggggattg agtccttaac ttgtctccat 46921 tctttgtttt tgtcgtttat gcctttttag cccttcactg tcattttagt gaacataaag 46981 gagagagcag aaataaaggt gtgcattcaa tccaatgttt gacctgtaat ccagagctta 47041 cattttagtc aaactagaag attattcccc tgactccaag tgtatgcacc ttcccaatct 47101 ccaaatagcc caataaaatc ttatctttca aaactcagtt caagtgccag ctccacataa 47161 catattttct aaccttccaa attgtaaata acttatccat cttctgaatt ttcacagcac 47221 attacttgta tctcttctgg catttactta gaccatatgt gctaaatttt aatgactaat 47281 gctcaaaaat ctcctccact gctcaaagtt attcacttca ctaaaagagt ttaaattcta 47341 tcaatcaggg atttatttaa taagtcacat tttaaaatag aaatattctt caaaataata 47401 aacctattgg tatgcattag tttagcaatg ctcttttcct tctagcccaa actatgatcc 47461 atgaactgag caaatatgtt atctaaaagg taaggaaaga aatcttacac aatttcaaga 47521 accaagcatc tggtgatgtg gtttggctct gtggccccac ccaaatctca tattgaattg 47581 tcatcttcaa tgttggagga ggggcctggt gggaggtggc tggatcatgt tggcagacat 47641 ccccccctgt tcttgtgata gtcagtgagt tctcaagaga tttggctgtt taaaagtgtg 47701 cagcactttc cccttgggct tgctctcttc cttttgccag ccatgtaaga catgcttgct 47761 tcccctttgc cttccaccat gattataagt ctcctgaggc ttccctagaa gcagaagaat 47821 gtacagcctg cagaaccagg agctaattaa atctccttat caattaccca ttctcaggga 47881 gttctttata ttaatagtaa tgtgagaaca gactaataca tctggccttg acaaaagaga 47941 aagcttgata tggctgtatg tgtcaggagt aacccacagg ttttttgttg tttttttttt 48001 ttaaatactt atgcactaaa ggtaaacatc agcccaacag tatgtcaacc atgttccctc 48061 cctactttca catatatgcc tctggctaca tgttaccttc tggaaataca gtctcatctg 48121 aggtttaaat caatttcacc taaaaaaaaa aaaaaaaagt ctatacgaga aacttgtagc 48181 atgtaatctg tcaccaagag agataattta gggttgcaga ctctttgggt agatggcaga 48241 cccaataata cgctataata gatgtgtttc agagaataca aatacaagat tttaaaagct 48301 ggaaaaagaa acaccagaaa gaaagaacaa aatttagaga tacagtctgg aatctactgt 48361 aataaaatag ctgacctagc agtataactc tccagttcct taatttatac atcctaatct 48421 ctttccaagc ttgagaggag tctataataa tgtttaaaag atgcaacatc aagggttttc 48481 ccttgctgca ggtccatgaa aataggatgt tatatctctc aaattgtatt acctttaaac 48541 taaaataaaa ttacttcaga aaagacaatt cacagaaaaa gagaaaatat ttgcagaacc 48601 caaatgtctg tcagctgatg aaaagaaaaa caaaaggtgg tatatccata catggtatat 48661 tatctggcaa ttacaaagaa tgaagaactg acacatgcta caatgtgaaa gaaccctgaa 48721 aatatatact aagtgaaaga gcccggcata aaagatcata tattatatta tcccatttat 48781 atgaaatatc cacaatagtc aaacctgtga aacagaatat ttgttctaga atattagttg 48841 cctaggactg catggtttgg ggaggaaagg agagtgactg ttaataggta tgaagtggtt 48901 tcggggaata ataaaaatct actaaaatta gtttatggct acaattgcac aactctctgg 48961 atatactaaa agccattgaa ttgcacaagt tataatgggt gaattacatg atatgttaat 49021 tatatcttaa taaagctgtc aaaaaatacc tcaggccaaa ataatatctc taatgtaatg 49081 aagtattatg atagaggaga gactttaaat tatgctgcta ataaaaacta aaaatgtaaa 49141 tgactttgct gacagcccag aaatattttt tttgttaaaa tcctagtctt aatttatggt 49201 ggactcaatc tcttcctcta tcaacatagt ttagaatttt tcctcatgtt tctgtgccat 49261 gaaggactta tgccaactgg catacttatg ttcctatcca ttccactcac agtctctcct 49321 acagccaaat tgatattata tatttcatat attctctacc gtcttctagg tgagctataa 49381 atgagatctc tagtaagaga cacagaaagc agggaaagca ctggactgat ttgtttgtag 49441 gaatctacag attctgcaca aattaaatat ttcagattca gttaccttaa caaaaatgaa 49501 ttagacgcct acctcaaata atcctaatgg actgacagtt gaaagacata gacttacaca 49561 ggaacagttc catcagagta aatatgtgta gaaaactggg cacttatctc tctgttgttt 49621 cttaatcttc aaaacgggga tgatgaatta actggtctct ctatgtccct ctcagttcta 49681 taagtctcta tattctatat tatcctctag tctagctaat actaccacaa tctcctatcc 49741 tcagtaaact ggctggttgc aggatagaac agcaaattaa tcatataatt tatttttaca 49801 atagagtcta aattattcag tattattttt acagagttta aattattcaa gtccatctta 49861 tcatcgctca gagcaataat taaaagtaaa aagttagcca tgaaaagttg cacaggccaa 49921 atggttcagg attttataag ctgaaagata cctgcagttt tcttacactg ttttcgccac 49981 tccaaccctg cttttcagaa ttatacctgt tctttcaccc cctactttat ttaatgaaac 50041 ttgttctaat tcttcaaaca atatggttta ttttgaaact tcatagcaca ttgttaatat 50101 ctttttgaat actgatcata ttaaagttat gtttatactt ctcttagttt tacatatccc 50161 aaatcatcaa atactatatc ataaactcta aagagcaagg actatgcctc tcctgtaata 50221 aaagaatagc gattttcaca taagaaacac atcaaaattg aattgaagtt gagtaacaca 50281 actcttttgg atgaccaaaa atacaattta aaaaagcatc agtcttaaag agtacttctt 50341 tttggattgt gacacacata ccctcataat ttcttataac tgaaactaat aagcagaaat 50401 gtagggcaag cgtagtggct catgcctgta atcctagcac tttgggaggc caaggcgggg 50461 gaactgtttg aacccaggag tttgagacca ccctgggcaa catagcaaaa ccctgtctct 50521 acaaaaaata ccagtggtga caggcactgg tagtcccagc aacttggaaa gctgagatag 50581 gagtatctca ggagcttcca ggagcccagg agttcctgag cccaggagct cgaagctgca 50641 gtgagccgtg attaggccac tgcactccag cctgggcaac acagcaagac cctgtctcaa 50701 aaaaaaatgt tgtagaaatg tagctaataa atattagtgg agttacaaaa acacactctt 50761 cctaggtctt tgttaaattt ttacttaaaa ccatacacat ttctaaaggt ttaagtaaga 50821 gagaatagca taattcctca agatacaata tcctcaacta aaatattcct gaaaagattc 50881 ttcaactaaa ctatcttgca taccacccat aaaatactta agaatattaa acttttctat 50941 acttatgatg agcatacaca aactgaaagc aattcttcaa cacataagag actactattt 51001 acaaacaggt ttactgtgaa tccattttgg acttatgtaa tgaaagatga attggaaaat 51061 tgccttaatc tatcagaaac taactagcaa gtacttatgc cttcatttat tccactatct 51121 tacaaattgt tctatttcat tctttagtgt aaatagaatc ttcttggtat cacaatataa 51181 aattaaataa aaaatattgt gatgactaaa agaatggtat acgatgaaaa gcatttaagt 51241 agttgaatag tgatactgcc ccatcaacca ttcactcatg cactcatcca ttcttttaac 51301 aaatatttat tgagtaatca tgcgctaggc actatgctac aagacacacc aacaaacaaa 51361 agagacactt atacctacgg acttaagtcc taaaattgtc aaaaatacat aaatacaggg 51421 ccactgtgta gggatacaca gactgcccac tgcacaactt gacaagtgcc actcataaag 51481 acctcattga gaatagtgcc ccttgcaaaa aaggcaccac tcagcagctc taggaacaat 51541 ctatgtcatt acagagggag cagccatatt cacatgatat gacaaacttg accagttcaa 51601 gaataatctg agacaaggta aatcagtgat agtcttctat attcaaccat tcctttaact 51661 tatcaaaaat atctcaagat ccttaatctt caaatcgcat attcaaaaat agtaagtcct 51721 gagaccttta agaaacacct tatttgctta agctagctaa gttgacacat aaaactaatt 51781 tcaactattt caattttgaa tgaataactg gagttttaga gaggctaaat aacttgctta 51841 aaattacact gcagacctgg gattatatat ggctaattag gtaatacgga actgacagtt 51901 tgtcatctta ggcacacata cttacggtaa ccagccttca acatgatccc ccaatgatcc 51961 ctgcgtcctg gtaatcacat tcctctgtaa tgtaattcat tattatccca gggttggtcc 52021 atgtgaacaa cagaatacag caaacataac gatacatcat attagagatt tgactttaaa 52081 aaacattatg gcttctatac tgtgtaccca catgctctca ctgtctctct ctctctcccc 52141 actcacactg ggaagtcagc tgccacatca tgcacagccc tggaaagagg ccctcatggt 52201 gaggaatgga ggcctaccaa caaccatgtg agtcagcttg aaagtacatc cttccaccac 52261 agtcacaccc tggctgacaa cttcactaca acctcatgag aggtcctaag ccagaatcac 52321 ccagctaagc cacttctgga ttcccacccc tttgcattag ttaactattg ctgccatgac 52381 aaattacaac aaaattagtg gctctacaca atacaattta ctatctcagt tctgtaggtt 52441 agaagtccag tgggctcaat tggctttttc tgctctgggt gttgcaaacc tgaaatcaag 52501 gtattggctg tctgtgttac ttaatggagg ctctgggaaa gaactgcttc cagattcagc 52561 caggtttttg acaaggttca gttccttgta gttgcagcac tgactttctg gtttccttgc 52621 tgcctgtcag ctgggggata ttctcagctt ctagaggcca cccacattcc ttggctcata 52681 gctccctact tccatctcac gctttgaatc tctctgaaca cccttctgcc tcacctctct 52741 gactcatctc ttccaacatc agctgggaaa agttctccac ttttaaggtc tcatgtatta 52801 cttagcccat atctaggtcc ataattgtaa ccatattttc aaagtctctt tcaccacata 52861 atgtaacata tttgggggtt ttaggaaata ggacatggac atctttaggg ggctattatt 52921 cttcagaaat gatgtaattt cataaatgtt tgttgtttta aactttcact ttgtcacacg 52981 ataatataca tacgaagagg aaaagtgtcc tgaagttatt aaccaggata cagttatata 53041 taaggtgttc agtatatcac atatttacat acaaacacac atacttgaca aggtattata 53101 ttagaggcta taggcattat agcatttttc atacagtatc tcataatact ctgtcaataa 53161 ccatatatta tctataagat gaaactccag gtcattggta gtttagatta aatcccagtt 53221 atttcaaatt caagctatat tatgacatct gtttctcttt ttcttttaaa atctaaagtt 53281 ccaggttaca gcatttcata aatcaggctt ttattcggat tataaagtct tcaaaaataa 53341 ctttccacaa attaatttta ttcttactga cagtcgggca cagtggctca cacccgtaat 53401 cccagcactg tgggaggctg aggtgggtgg atcacctgag ttcaggagtt cgagaccagc 53461 ctggccaata tggtgaaacc ccatctctac taaaaataca aaaattagtt gggcgtggtg 53521 gcacacgctt gtaatcccag ctacatggga ggctgaggca ggagaatcac ttgaactcgg 53581 gaggtggagg ttgcagtgag ccaagatcat gccactgtac tccagcctgg acgacagagt 53641 aagactctgt ctcaaaaata ataataatat tcttactgac accctcctcc cccaaaacat 53701 ggaatgttat attgctgaac agctgaaatt acttcttcat tgaaagaggc atatgaaaaa 53761 tgagttcaca ctaatgttca ccttgatctc aatacttgtg tgattaccat attcctgaag 53821 ctgtcttgcg gaggtcacca accatgtttt ttctttcttt cttttttttg gctaaatcct 53881 aaggctactc ttcaactctc atctacaatg ttgatatttc atccttaaaa caccatgctt 53941 tcttagtttc catcatttca taccatccca cctctctagc taatgcctgt tagtcttttg 54001 gtcttctctt cctatgctca ccccttacat gttagaattc ctcaggatat ctgtcctatt 54061 tccccccttt tgtggcaact ctcagatctg tgtcttctgc ccttttcttt ctccaggact 54121 tcaaatccac atatacagtt tcctattagc tctctacact tggatgtcta ataagcacct 54181 caaattcaac atgtcccaga gtacagttga ttcttgaata ggtttgaaat gtgcaggtcc 54241 acatatacac agattttttt cccaataaat acattggaaa tttttttgga gatctgatgc 54301 aacaatttga aaaaacttgc agatgaacca cacagcctag aaatatcaaa aaaattaaga 54361 aaaatatatg tcatgaatat ataaaatata tgtggatacc agtctatttt attactacca 54421 taaaatatac acaaatatat tataaaaatt aaaatttatc aaaacttatg aaaacacctg 54481 cagaccatac atggcaccat tcacagttga gaaaaatgta aacaaacata aagatgtggt 54541 attaaatcat cactgcacaa aattaactgt agtacgaaaa tcaaacacca catgttctca 54601 ctgataagtg ggagttgaac aataagaaca catggacaca gggaggggaa catcacacac 54661 aggggcctgt cagagggtgg ggggcaaggc gagggagaga attaggacaa atacctaatg 54721 catacgggac ttaaaaccta gatgatgggt tgatagatgc agcaaaccac catggcacat 54781 gtatacgtgc acgttctcca catgtatcca aacctgcaca ttctgcacat gtatccaaga 54841 acttaaagta aaagaaaaaa gaaaaattaa ctgtagttca tactgtacta ctgtaacaat 54901 ttggtagcca catccttttg ctattgctgt gatctcaagt gttacaatta tccccttaaa 54961 tgttaagtga tgctatcatc ttcacttgag cagtttgtct ctctctagta ttgtgttatc 55021 acaatagaaa gtgatctctt gcagttctca cttatttttc atagttttta atgcaatacc 55081 ataaaccttg aaaaacttca taggacccat atgaagtgcc cttagtgatg ttgttggaag 55141 tgtttccaag aagcagagaa aagacaggac attacaagaa aaagctgaat tgcttgatat 55201 gtgctgaatt ttgagacctg cagctatggt tgcccaccat ctcaagataa atgaatccgg 55261 cataaggacc actgtaaaaa aagaaaagaa aaattgtgaa gccattgctg cagctacagc 55321 agcacttttt gcaaaatatc tttttatctc atattgaaaa tgtagctttt atgtgggtgc 55381 atgattgcta caaaaaaggc acacctatat gattcaagaa atagtgaggt catatgacaa 55441 cttaaagcaa aaagaaggta aaggatctaa agctggagaa tttaatccca gcaaaggatg 55501 gtttaataag tttagaaagg ggtgtggctt taaaaatatc aagatagtcc tctctgggct 55561 tgtgcctagt tcacagctac atagctaaaa ctcaccaagt aggttgaaat cactggtaaa 55621 tatgggattg ctatggtgcc tccctctgga aaacactgaa gaaaactgaa atcagccatc 55681 ctgccaagta cacttgctcc ttctgtggct aaaccaagat gaagaaacaa gcagttaggg 55741 atcttacact gtggttccta catgaaaaca gtagctgatg gtgcctggat ctacaatgcc 55801 acttccacca acatggcaaa gatggccatc agaagactga agaaattgaa aggccagtag 55861 aagctctatc atttgagaca acactagcct agaataaagg tttaatttat gtaacaagaa 55921 caaaaaaatc aaggtaacag gagaagcagc ttctgctgct caagaggaag cagatgagtt 55981 cccaggtgct attaagaaaa tcattgaaga gaaaggatat atgcctgaac aggtttttaa 56041 tgtggacaaa agtaccctat tctggaaaaa catgtcataa aggtcattta ttagtaagga 56101 agaaaagcaa gcaccaggat ttaaggcagt aagagatggg ctagctccat tgtattgtgc 56161 aaatgcagtt gggcttatga tcaggaggat tgtccttatc tattaaagct actaacccca 56221 agatttgaag gaaaaagata aacactactg ccagtctttt ggttgttcaa caagaaggcc 56281 tggacaacga gaacaccttt tcttaattgc gtccactgtt ggtttgtccc tgaagtcagt 56341 accttgcctt aaagttattt tgatatcaga caacgccctg gccacccaga accccatgag 56401 gttcaacatc cagggcatca aagtagacta cttgcgccaa ccacaacaac tctaattcaa 56461 catctagatc tggtgtgata aggaccttta aggctcatta catatggtac tctatggaaa 56521 gggctgtcaa tgctatagaa gagaactcca atagagagaa cataatttaa gtctggaagg 56581 attacaccac tgaacatgcc atcgctgtta cagaaaaagc catgaaaacc atcaggcctg 56641 aatcaataaa ttactgctgg agaaaactgg atccagatgt tgtgcatgac ttcacaggat 56701 ttacagcaga gccaatcaaa gaaaccatga aagagatcat agatatgaga aaaaggtggc 56761 agtggtgaag ggtttcagga gtgtttccaa accagtgcca gaagataagg aagaagatgt 56821 agaagaagca gtgccagaaa acaaattgac attagacaat ctagcagaat ggttctaatt 56881 attcaagact acttttgact tcttttataa catggaccct tccatgttac aggcgctgaa 56941 actaaagtaa atggtgaaag gattggtacc atacagaaac atttttagag aaatgaaaaa 57001 gcaaaaaagt cagacagaaa ttgccatgta tttctgtcaa gttacactga gtgtgcctgc 57061 ctttcctgcc tactcttcta cctccctcca tctcttctgc ctctgcgacc ctaagacaaa 57121 accgacactt cttcttcctc ctactcctca gccgactcaa catgaagacg atgtgaatga 57181 aaatctttga tgatccactt ccaattaatg aacattaaat atactttctc tttcttatca 57241 ttttcttaat aacattttct tttctctagc ttattttact gtaagaatgc agtatataat 57301 acaaataaca tacaaaatat gtgttaatca actatttatg ttattggtaa ggcttccagt 57361 caacaattgg ctattagcag ttaagtttct ggggagtcaa aagttataca tggatttttg 57421 actgcatggg gagtcagcaa ccctagcccc cacattgttc aaggattaat tatgtacata 57481 tgattatccc tcacagaacc attcctccta cgaatggttc agtgagtaga cctactatcc 57541 atttaatttt gcaagtcagc cctacttctt taaccacatc tcactgctcc tatatctcct 57601 taaaacagat cccattactt tgttccatcc tcaatgctac tcccttaatt cttatcacca 57661 tgaactcttg cctggcatac tggacttccc tgtctctaat tatgctccca atattccaat 57721 ccatttgaca ttattgtcag agtgatcttt ctaaaataca agtcggatta cattactccc 57781 ttggagtcat caatgctctc ctgtaaaagt tgaaattccc taatatggct tggatggcca 57841 tataatattt ggccctatct ctcaacgctt ccttgtctga tatggtttgg ctgcatcccc 57901 acccaaatct caccttgaat tgtaactccc acagttccca cctgtcatga aaggaacccg 57961 gtgggaggta attgaatcgt ggcagcgggt ctttcccatg ctgttctctt gatagtgaat 58021 aagtctcaca agatctgatg gttttgaaaa tgggagtttc tgatggtttt aaaaacggga 58081 gtttccttgc acaagcactc tctttggccc ctatcatcca tgagagacat gactttctct 58141 tccttgcctt tccccatgat tgtgaggcct ccccagccat gtgaaactgt aagtccatta 58201 aaccgttttt tcttcccagt ctcgggtatg tctttatcaa cagtgtgaaa acggactaat 58261 acactgtcct tcttttcttg tattccccat ctttaatttc ttctaggaaa ggctgtctct 58321 ttgtatgata cactttcacc aatctgaaca agcacccatc cccttgatct ggcaaatact 58381 tgagtattct tcatatctag tttatccact tcctctggag agccttccat gatcatttcc 58441 ctgcagacta gtgtctgtct tatgtaattg catagcacct tatacttaat ctttagtact 58501 gcacttactg tatactatca tagtgtattc atctgcatct tttactaaac tacaaatgca 58561 ttgtcattgt tcccttagca ccttataccg ttttggacac acagaatgta atgaatattt 58621 gttaaatgtt taaccaagca atcagacctt cctatggtga ttttcaaaat aatctgaaca 58681 atagtcacag gtctacaaaa taaaagtgag taagcattac aactgaaagt acaggagtta 58741 aatctctcat attaatatag tacattcaat agatataaac aataggaatt tcaagatttt 58801 cccagttttt aaatgtgagt tttgagaatt gctgcatatg gtattaataa tgagaatctg 58861 aggaatatat agataactaa caaggatgaa attgggtaag taagcaaagc taagctcata 58921 gattagctaa atgactctct tcctgaacat cagcaaaacc aaatgtattt tatactcttg 58981 tctcagcaaa taaacaagtt actataaatt aagaatcttc tataacgagg aagaacagtc 59041 aaaatctaaa tgcaaaatca cctaatgttt tcacagagca atttccctca gattacatca 59101 gtcaaataac acttttactg gtctcctttc taaagctagt aaaacaatgg tatttttaac 59161 ttatctgaaa tatcttcatt acatcacatg ttcaaaaaga gagcacaata tggcacagat 59221 tttgattcta ctagtataag agttcatttg tgattcaaaa gattcaaata cttaacacag 59281 ctcttataat accaataaac tgtctttatt ctaggctaaa tgtttccaaa tccaagtgca 59341 atgtcaaaca tgaatttaaa aatattaaat tattgtatta ctgatccttc agatgattca 59401 taccacatct cttcaatttt gaataattct gagttatgct tgtatatgtg aaatatccaa 59461 atcaaaacaa ataagaaatt tactcagata aaaaattcat taatgatact attttaagtt 59521 catcaaaagg gaaatattca acagtaaaag actatttcaa aactaattca atatgacaaa 59581 agcaagcaat taaaatttaa gacacaagca gttgcctagt gtatataatt gaaaaataat 59641 aaaattatgc ttaataaaat atctaaacac agaatccaat atgatgaaac cacttaaaag 59701 aacaacatga atacaatcct tgaaatctta aatgatttta aatttgaatt taactctgag 59761 aaaatgatat gaaatctgaa aagtaaaatc tacagatgga agtctagtac tgtgtgtttt 59821 acctctggga gttgcaggct gcaccatgat tctattctta cattacaaat ctgttgtgtg 59881 gggaacgggg gaggaataac tgtattacag ggaaaaaata atagaacaaa ttgttgtaag 59941 aacatgtagt tttttaacgt ctttattctt tacaagtaat cttattcttt aaagtatttt 60001 ttatataaat ggactcatct atatttaaaa cacctaatgc cctttgtgtt tcttagtata 60061 aggatgatcc tgaagaagcc actgctactt aactcacatt atagtttttt aaacccatag 60121 tatcataaga aaaatgaaga agagattctt ttgtattttt tcttccccaa cataaaccat 60181 aacaccaatg caaaccacta tcaaggagag gaaaggagag ctgctggtta ggacagcagc 60241 tgtgttgaag ctcagaaaag acatgggttg gcagatttgt gaggtgatgc ctgcaacaca 60301 atgtgcctag atctagtctg attcctaaaa ttaaaaagaa aaaaaaaatc tgtacttaat 60361 attttgcctt ttagagagta tttccttccc catcgcagaa gatttttgct tgagatgttc 60421 taatatgtct taatatgttg tcattggagg aaaaaaagct gttaatttat gaaataaaga 60481 aaatgttgcc aagattgact cagaaatgag tatgtaaaac accttatacc taagtacttt 60541 attacagttt aaagaccatt attttttcca tttgcaattt aaactcaact aaaagtacat 60601 aataagataa gctgtcgtgg tggatggcat gaatacaaaa tttaaagcag caggaataaa 60661 gtgagaagag agcatgttaa cagagtatag ctggaagagt cagggaagat aatacatagg 60721 agtgcaaact agaacaacgt gtgaatgaat gtggaagggg aatccacaaa gtgggtttag 60781 gcttaaacga aaggacatgc atctatcttc cctcaacaca agaatgaaag atagacaaat 60841 aattgaaggt ctagatacac ttttagatga gatgcacagg aagcaaaaag ttgataaaat 60901 ttatttgttt ttgcaataaa gtaaaaagta aggtctcctg gtaaaaatga aaggggaagg 60961 ttatgtgaac aaagtggtgg tggtctcagc cctcaagctg aaaatgtttt tacattttta 61021 aaaggtcaaa aaagaaaaca aagaatatgt gacacagact atatacagcc cacaaagcct 61081 aaaatattta ttatctggcc ttttgtagaa aaagtttgct aactgatggt atatgtaata 61141 acacctgtga gagagagaga cagagaaatg actagagatc atagggcata aatttgtata 61201 gtttatgatt tttttctcca gcacatcttt tcctccaaca acaaaaaaat tatggaagca 61261 tttatgaaag aaaactgtcc agagcagatc tgttgtttgt ttgtttttga gacagagtct 61321 cactccatcc acctaggctg gagtgcactg gtgctatctc tgcacactgc aacctctgcc 61381 tcccgcgttc aagcgattct tgtgcctcag cctcccaagc agctgggact acagacacgc 61441 accaccacac ctggctaatt ttttttttta ttttttagta gagacagggt ttccccatgt 61501 tggccaggct ggtctcgaac tcctgacctc aagtgatctg cccgcctcgg cctcccaaag 61561 tgctgagatt acaggcatga gccactgtgc ctggcccaga gcagatattt taaacatcaa 61621 gaatcagatc tgcatgattc agaatattgc taacctatcg gttctaacat aagaaacaag 61681 gagtaaaaag taaaacaacc accaaagatt ttcaataact ttaagacata actttaagat 61741 accagtaaaa gaaatgcctc aaggtagttc ataattgcta aaagaagtgc catgacaatt 61801 aagttgtaaa aacagattta aaacttctac ttccaaccta acagaataat tggtatagga 61861 tttaccctag caactagaaa atggggcaga atttatgaaa ctactctttt cagatattag 61921 acagcaggca atgcaggact gtgatcccta aaatatagct ttcagctaaa tatactttag 61981 ctttcggcta aatatacttt cccctataat gtaaggggaa aacctcagta aacacaacga 62041 actcaattag ttgaagggac aaatttgagt taaggaaggg ctgagaaagc tggaattctc 62101 aggtcagaat aacagaaaaa aggtaagcta agcagagaaa cagtaccaga aaattctata 62161 tagggactct tgaatcttca aagaaatatt aagttataaa tgtacacagt gagactcaaa 62221 aaggccaggc aaagaaccac taccagaaaa tctgcaaaac tgatcaactc tcaaactcac 62281 aggatactgc aaaatattct agcttggaat gctaaaaagg gagataccat taaataacag 62341 gagcattcaa tatggctcta tgaaggagta tgccttaaca gtagggctaa actagcctta 62401 gaataaatgt tcctctgaag ctgccctaat atagcttaaa aacaagtgtg aaaggatcaa 62461 gttgatccac aagtaactta actgcctgat agaacaaagt acaacattca ctaatggata 62521 acaaaatcca ttcatagtat tccaacaaca tgaccagcac ccaataacaa attattagaa 62581 atactagaat gcaaaaaaga tgacaaaatt acatgacaaa gatattaaaa tatctattaa 62641 aagtatattc acagatttaa agaaacacat gaatagagag aaattgtaac tatagaaatt 62701 aatcaaataa aactttccag gctgaaaaac acaatatttg aactgaaaag ttccttagag 62761 gggcttaaca acaaatcaga cactataaaa gattaaaaag agtgatgaaa caggaaccag 62821 agcaaccaaa actacccaaa caaattaaag ctatgaggag aaaaaaaaaa aaaaaagcat 62881 caatgacatg gggaaaaata ccaagtctaa aatatgagtt tttgtttgtt tctttgtttt 62941 tttgagacag agtctcactc tgtcgcccag gctggagtgc aagtggcgca atctcagctc 63001 actgcaagcg ccgcctccca ggttcatgcc attctcctgc ctcagcctcc caagtagctg 63061 ggactacagg tgcccgccac catgcccggc taattttttg tatttttcag tagagacagg 63121 gtttcaccat gttagccagg atggtctcga tctcctgaag tcgtgatccc ccccgcctca 63181 gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cccggcctaa aataggagtt 63241 tttgaagtct aagaaagaga ataatgttct aaggaaaaaa agtatttgaa gaaacaatga 63301 aataattttt tcaaatttga tgaaaaaaca aaaacaacaa ataacaatag cagcaacaac 63361 cacaacaaaa acactgatcc atgaatatca gtgaacccca agcaaggaag gagagtacac 63421 acaaaaaatt acaccacggt acatcacaat gaaaattata aaaacaatga aataatgagg 63481 aactcttaaa agaagctaga ggaaagaaga tacatcacag aactgaggaa caaagataag 63541 aatgatcaca gacttctcat cagaaaccaa gcatgccata aaacaacaaa acattttttg 63601 aatgatggaa gaataaagtg tgtgtataga gctattattg agtgaaagtc aaaaatgaag 63661 acaaaaagaa catattttga caagaaaaac agaaagaatt tgttgcaaac aaccctgcac 63721 tacaaaagct gtcacagtaa gttctttagg ctgatggaaa atataccaga aggtatccct 63781 gatgtacaaa aagaaatgaa caatgccatt gtaaatatgc ggataaacat aaacttctta 63841 aaaaactgaa agataactgg ctatttaaaa caaaaatagt gacactgtac tatgaagtaa 63901 aaaacagcag aaaagatgga aggggccaat agaaattata ttgtattaat agtaaggttc 63961 taacattata catgatgtgc tattacttca aactagactg tgtgagttaa gcatgtatat 64021 caaaaaacct aaaacaacca ttaaaacaag aaaactaaaa aatatagcta agccaggtgc 64081 agtggctcac atgtgtaatc ccaacacatt gggaggccaa cacagaagga tcacatgagg 64141 ccaggagttg aagaccccag ccttaggcaa catagcaaga ttccatctct acaaaaatac 64201 acaaaaatta gccaggtgtg gtggtaagtg cttgtagttt cagctactca ggaggctgag 64261 gcaggaagat cacttgagcc cgagaagtca aagttgcagt gagccaagat cattccactg 64321 cattccagcc taggcaacag agtgagaatt catgtatttc atgtacatta tatgtatata 64381 taacaaatat atgtacatat gttatatata catatattat atgtacatat atacacacac 64441 atttatatta ttatatatat acagccaata agtcagcaga gaaggtaaaa tggaaatgta 64501 aaaattactc aatccaaaaa agggtaggaa acagaaaaaa tgaacaagga atacatggaa 64561 caaacagaac aaatgagagg cagaagatgt aaccctaacc atatgaaaat tagattaaat 64621 gcaaatactc tgcactctaa ttaaaagaca aaaatcaaat tggatttaaa aagcaaaaac 64681 aaaccattat tatcaacaga aaatacacct caaaaataaa caagttaagt taaaagtgaa 64741 ggttaaaaaa tagtctatat atatgatgca aactatatag aatgaacctg gaatagcttt 64801 actgacagga gaaaaggatt atcattgagc acaaagacat gtatttcata gcaatcctaa 64861 atgtgcatgc acccgataaa aacacatgaa acaaaaacca gatgaagtca aaaaactaaa 64921 actgacaaat ctgcaattat aggcaaggat ttcacaactc tttttttcag taaaaaaaaa 64981 aattatcttt attttaaatg ttaatagaac agggccaggt gtggtggctc acgactgtaa 65041 ttccagcact ttgagaggcc agggcaggag gatccttgag accagcagtt taagaccagc 65101 ctgggcaacc taaggagacc ccatatctac tgggaaaaat aaaaaaagcc aggtactgtg 65161 gtacacacct gtggtcccag ctacctggga ggctgaggaa ggaggattgc ctgagtccag 65221 gaggttgagg ctgcagtgag ctgtgatagc accactgtac tcctgcctgg gtgatgcagc 65281 aagattctgg ctcagggaaa aaaaaaaaaa gttaatagaa cagtaaaaac attatcggta 65341 aggctagaga agattgtaca atactatcat taacttgacc tgtttgacat ttatagaaca 65401 ctactcctaa caatgagaga aagcacatta ttttcaagag tacacagcac aatcaccaag 65461 atagattata tactgagcca ttaaactaac ctaaacaaat ttaaaagaac agaaatcata 65521 caaatcatgt tctctgacca taatgaaatt aaattagaaa ccgataacaa aaaattttga 65581 aaaatcacca agtatttaaa aactagaaaa atacacttgg ccgagcacgg tggctcatgc 65641 ctgtaatccc aacacttttg gaggctgagg ttggaggatt gcttgagtcc aggagttcaa 65701 gaccagccca ggcaacacgg tgagaccctg tctctacaaa aaatacaaaa attagctggg 65761 cctggtggca catgtctgta gtgccaccta cttggaaggg tgaggtggga gaatctcttg 65821 agcccaggag gtcgaggcta cagtgaccag taatcgtacc accctgggtg agagagcaaa 65881 actctgcctc aaaaagaaaa aaaaaacaac atactttgaa ataacccaag taacagcaaa 65941 gagatacttc aggaaaatta gaaagcatac tgaactaagc gatataaaat aaaacatcca 66001 aagttgtgga atgttggtaa aacaatactg aaccacgtat aaaaaaaaat ttaaattgat 66061 gaaagaaact tttaccaaag tagagaagaa aggaaatact aaagagtaga aatcaataaa 66121 acagaaaaag aacataacaa cattgttcat taagaacaaa gaaatcaaaa ggtaattatt 66181 tctgaagatt gagaaaattg gtaaacttgt aggcaaactc aaaaagaaaa aaagatgata 66241 gaaattgtca aatatcagta ataaaaggga agatatctgc acagatccta cagataatta 66301 aaatacaatg aggaaatatt aaaaacaagt tgatgttaat aaatttagca acttttaaat 66361 gtaaactgat tattcaaaag atatatatta aaacacagaa gaaatagaaa atatgaatct 66421 attatctatt aaattatttg aatttataat ttaaaacttt gaacaacata aaacttcttg 66481 gctcaaatgt cttctctagt gaattttatt aaacatttaa gaaaggacta actctgatct 66541 tacaaaaaca ttttcaaaat tggagaaaaa gaacatcttt caactcactg gagtccagca 66601 ttgtcctgat tccaaaacca gatcaaaata ttaaaataaa agaaaatgac agaccaatag 66661 gccttatgac tacataggca aaagattttt taaaaataat tgttgataca accatgcaat 66721 atctaaaaag gtagtactat caaaataggt ttatgtcaaa aatgcaaggt tggcttaaca 66781 ttaaaaaaac aaatccatgt aatttatcat gttactagat taaatgagaa aaaccatatg 66841 aaccctcatt aaatggaaaa aatcttaaat aatgtaatat gcattcctga taaaaattct 66901 caacaaagaa ggaaacttta tcaacctgaa aaatgatatc cataaaaacc gtatacctag 66961 catcaaacct aataatgaaa gattgaatac tttccccctg aagacatgga aaaaagcagt 67021 tatgtctgct ctcaccattt ctactcaata ttacttgtgg tcttggctac ggcaataagg 67081 caagaccaaa acagatttaa gattgaaaag gatgacataa acctttgttt attcaaatat 67141 aattctgtaa gaaggaaatc gtaaggaatc aagaaaagaa taaataaatg agtttggcaa 67201 aatcatatgg tttaagttca atattcaaaa ttcaacagta tttctttata taaacaaatc 67261 aaaattttaa aatattgttt gtaatagtat caaaaatgtt atgagtctag cactacccaa 67321 cttgacaaaa tatgtgtgag acctctacac tgaaaattac aaaatattac caagagaaat 67381 taaagcaacc taaataagtg gagaaatata ccatgtttaa agatcggagg actcaataca 67441 aaggtgtgaa tgttccctaa cagatagaca ggttcattgt gatatcaatt aaaatcccag 67501 cttccatatg agatgcacaa agctggacag aacatagctc acactgtaac tacaaaaaag 67561 aacaagggaa aaacaaatga tctacaaata gaaacttctt gaactcatca gagtgctgca 67621 gttactggga aactaaccaa tccaaattct aatgaaaggc aggacacatg ggacgtgagc 67681 acttgagtac ctggggaaga caatactaga caccagtaag aataattcag tcaaaaccgt 67741 taatgaattg ctaaaggcta agtgtgtgtt cacaagaaaa tatagaagcc agaggatcta 67801 cagacacaaa gggaattcat acccattcat ggacttttgc aaagatctca ccagatgttc 67861 acaggcatac acaaaaagac tgagtgtatg gtgaaaggcc tgacaaagca tttctcatgg 67921 tgcaggcttg agggaagcat gaagccactg agggaaagca tgaagtccag gctggatcct 67981 ttctaccatc tttaatatgg ggggtaaggg gtcagagggg agccttaaac cccagaggga 68041 aaggcaacaa ccccaatatg ctttgagcac tgttgaaaac acattagaac taggagaaga 68101 aaaatgccta aaaaaaaaaa aaaaaaaact ttagccatga aggaggaaaa ggaattcatc 68161 ctacaagtgg aagcaggggt gggacactga ttgaagtccc tattcccaaa aaccaggaca 68221 caatgattgc ccagactatt atagtggatt tattaccata tctgatacaa taacgaatcc 68281 aggtagaaat gatggcaatc tgggaggtac aggtttgagt aagccagact gtctgactgg 68341 caactataat ttgttttcac aatttgcagt cccaaagaaa agttgtttta aatctgttag 68401 tcttctaaca gcagacctga aggttaggct attggcaaca ggccaggagc cagtaaaaat 68461 ataaaaaggt taatcaaggg gcatattggt caaagctata agaatggagt gatggaacac 68521 actctcactt ttctgtagct ggcactgacc ctgaacaact gcatcagctt ttaattgggc 68581 tgaactgtaa gtgaaacagg cccaggtatt tataagaaat tccttaaatc cagggccgta 68641 atgagtcaat tgctttgctt caggtggcaa agtgagaaat aagcttttcc ccaaaagggg 68701 taggttctac ttcatggaac gccaaaatga tactaaggcc aggtcagctg catccctttt 68761 caaagctgta tctgcctttt caaaagagat gcacttggta gctgggtcag gcaggtggca 68821 ctgctgtact gggaaatggt tgagttctcc agcagtcatg gagcctttaa ccaaagttgt 68881 gattttcttt tcttccctcc tgggatgata cattattttt gctgaatcac ccagggaggg 68941 gaaaggggac aggggaacac ctctttacat gcagcagccc atgaatagga gaatctcctc 69001 taaatctgta agtattcata gtgtaagaac ataaaattat caatactaat tatacattta 69061 gcagaggaac ccaaaacaat agtacactga ataggcccag aggctccatc tacattttag 69121 ctttttaaaa tttgagtgtt ttaccccctg taggaacaca gaccaaggaa tttaacatat 69181 gcacaaaagg gactacagcc atagctgcca ggagactaca ttatttccta accaggagca 69241 tggggaagac aaggtagcgg ggaaaatcta ttctgcagag gactgtggtg gtgggaaggc 69301 ttttcttctg ctttggtttc tcactccagg gtgacgctgt gctcctggaa gtgcttgggc 69361 ctaggggtct ctaatttgcc cctgtgcatg ctgctagaac tccaccctgc agtcctcagt 69421 gccttcagcc taccttgaca cctactgtgc ttttttccca caggaccacg taggatggcg 69481 aattgagcat ctctattcaa cagagctaaa aaaaagtctg aatttctctc cctccctaag 69541 gttccctacc atatttggct tttatcttaa gcagctgagg ttagaataga gggggtagag 69601 aaaaatggga gtccacaaag taaaagagca gctatgtacc agtcagcaag actgcattta 69661 cattcaattt caagtggctg cccactcaga ctccacaaat gaacttataa tatacttagt 69721 ctacccaaag ctccacatcc tcattgctga catcttttat gttagctctc agaagccctt 69781 cattcagcag ctacagccat actgcctttt gccttagggc cctgaggctt gctgttttgt 69841 tgatataata ataatttttc ctcctccagc ctagagagtg agcaataacc aaggcactat 69901 ttgagaagct tcattcattt ccacccctca ctgcttagcc tcaaaaaatt cattaacagg 69961 caggtcgtta ggagactctt actctgtact actaccccac cacttaaaca agagcattcg 70021 ctactggtct catatcctct aaaccagccc ataagagccc tgtcatttct ctttcaggaa 70081 accttatctc tgtcaacatc atatgcttgt agccaaaaat gtcaaaaatt gtttagggtc 70141 tgggattcta ccctatttat aagctgtgaa gctagcctgt tactgtttca tggatgctaa 70201 cataagatgt aagatgcctg ggtccgaacc aaagaactgt attacccatg atgcagcaag 70261 cagtatgagc atgatatcag tgaaaattca ccttgccctc caagtccctg gcaaagggac 70321 tgtggatgct ctgtatacta cgggtttttt cacagctgaa gagcactgag cttgggaaag 70381 ttgccatttt tataacaaag aaaagcaagt ctgcttgcat tctatttgtc tagggtgtca 70441 ttacttcatt cctcaaggtt gctagctaca accccttcct gagaaatgga tggagtaaag 70501 ggtagttctg ttagggcttt gcattctcag catatccggc aagaatgttc agggatattc 70561 agggcttatg gggattgttt ctcccaacac actcccattc aacctaatac cagaggtcct 70621 aggcagtaca agaaggcaag aatataagta agtctcacat gtttcaggaa aaagtaaact 70681 atctctattt gcagacaatt gtctatatag aaaataccat ggaattcaca caaaaaactc 70741 ccaagactaa tatatgaatt tagtaaagtt gcaggatatg agttcaatat gatcctaatt 70801 acatatacta acaataaaca attggatttt tttaaaaaat taggtgctac ttcaacagta 70861 ctgaagaaat aattaaatac aaatctaaca aaatatgtgc aggatctgta ggccaagaac 70921 tacaaaatat tgacgaaata aatgtaaaat gccctaaaaa tagtagagat actatgttca 70981 taagttagaa gactcaatac tgttaagatg tcatttcttt ccaaactgat ctacagattc 71041 tctgcaatac caatcaaaat aacagcagga ctttttatag atattgcacc cctaaacaga 71101 gatcagaagg taggagtaac tgaagaaaca ctgctgaata tacctggcag aaagacacca 71161 gtgattacag tctttaagaa ttattaatta aactcttaaa atgcattcat tttcccaaat 71221 ggatgactca agtagtatag tggatgttac tgagtttata gtagaactac agcctatagt 71281 aaaactctgc cattctcagg ctctttatat aaagaatggc agccttctta caaaaatgtt 71341 atgtatctaa ctagtatcaa taacagagca ctagattgta gacagtcagc tatgtccccc 71401 agcacagcat aagagtcata acagcacaaa acaggggact tgacacagct ttgttaaata 71461 tgtaatctag ttgatcaatg gacttggcct caggacacaa aattaatttg attctgcata 71521 acttaagagt aagagaatta tgcaaagaaa aaaaggaagc aagagaaaag agaaatcaat 71581 ggggtgatgg agcacaaaca tggctatttg tgtagttcct tgatgaaaat ataagagaaa 71641 aacatcaata aaatccataa tatttaggaa ggcaaagagg gatagtaagc agtcaaaaca 71701 gtaaaccaga aaactggaca tgaaggtaat aatttggaca aagccccaaa gtagacatgc 71761 caacagttgt gcaacagagg gaggataaat tgtaataata gtcacaggat ccaaagtgta 71821 ggtatggatg gaaaaatggc aagaaatgcc atgtatttac ttttataatg tatgtgatac 71881 tggcaaatga cttacaaaca cagtgtgaaa aataaaaatg aaaaaaacgt aaatgcccaa 71941 tttaggtgaa gtagggaagg atgggaaaga atggctttgg attttctatc tgttaagtgt 72001 atgcaattct gatgcatgtg ctttatcaaa tataatcagt attcagttat tctcatatag 72061 aaaatgaatc accaatgttg ttgattctct acgacttctg ctactcccca cttgatggaa 72121 gttcagcaga tggtccattt ctatttttct gataaagtat ttgttgtgaa agaaatgtaa 72181 agttgagaag ccaaataaaa tctctgaatt ggcattaaag caaaataaac cttttttcta 72241 tgggctccag tatcagagag agttcaaggt ttactacgct aagtatccac tgaaaaagga 72301 cccttgtcgc aatacagctt aaaactgatt ttcagcaatt aaaaattgtt tgtacataga 72361 taccttaaaa ttgtttcaaa gtactttcag ttatggtgaa ttccaatatt tcacaaagct 72421 gagtttttat ttacaaactc agagaatctt caaaactgtg gcatcagaac tagccacaag 72481 ctgggtaaga atgatttgat atgaatagta caatgtaaat atacaataaa ttactctgtc 72541 atagtcacgg ttgtcaaaat ttaagaaaac aaatacataa aattagttag agacttccca 72601 taggaggaat taaaacagat atgcaattaa aatatttata actaaaaaga aaagcacgac 72661 ttttaaaaaa ggattttaat tgagctgtat tttggggtta gaatcatatc tagtgcttag 72721 ttattattag gaagtcaaac tgctctccta ggttagtata taaaggttat attaattgca 72781 caaaatatat taaatgcaca agtgtgagtg ttactacgga aacaaggatc ttcataaatt 72841 gataatctga aattattatt attttttttt tgagacctag tctcaccttg ttgcccaggc 72901 tggaggggta gcggtgcaat cgcagctcac tgcaatctct gccacccagg ttcaagcgat 72961 tctcattcct caacctcgcg aggagctgga ctagaggctt gtgccaccat gcctaatttt 73021 tgtatttttt gtagagacag ggtttcacca tgttggccag actggtctcg aactcctgac 73081 ctcaagtgat ccacccgcct cagcctccca aagtgctggg attacaggtg agagccactg 73141 cacccggtct gtaattaatt tttttcatga aaataatata atctgattct caatcagtag 73201 ttgagatatt tgcatatttg ttaaaaacag gggaagttgg aattaccttg ttaagatttc 73261 ggcatacctt gttaatcacc aatttgtggt ttttaaaaag gaaataaaat tatactaagc 73321 actgtacaca ataaaaaaag tttgtgatat cttatttatc cttggagaca agttactcat 73381 acagaaaata aagagcactg tctgtgctcg agctaccatt atggctcaga aaccccagat 73441 tttgtttttt tgaaataatc tgaatataac ttgaaaacac tgtctcacaa agagttaaac 73501 aaagcacaaa aatatacttc aacaaaagga atcaaagttg acccaaagga tgtttgactt 73561 gccaaaagtt atataggtaa taaatagcaa ctaaatctag attatccaat tctaagtccc 73621 aagaccttca taattttcca aagagagaag gccaaccgcc tagtacagcg tggtttgcag 73681 ttttaacatt gactctgctg ctaacttacc ttgacccttt caagtcagta aatttctctg 73741 tacctttgtt tcttctactt catggggata ttctgatgat aaacataagc aaatacttga 73801 actctggcag aaaagcactc tgctgtgtta ataatctcct catccccatc acatatttcc 73861 actgatgcta tgctttctct ggtcttagtt ttaagcttaa atgaatgtgt gtgctgtggt 73921 gtgtgtgtgt gtgtgtgtgt ttcggtgtgt gtaagttttc tttataaatg gctggcattc 73981 tagcaagatc cttttcccca actttcaaaa cgcatgaata gcagcacaac attttcattt 74041 ttagaattac caaagccttg ccacccaaag ctgtgcagaa gattttagaa attcacaaca 74101 aaattttcca ccatattact ctcaacttct atagaaatgc aactcacaac tcatattggt 74161 cagctattcc cctataaaga ttcaacacta ccagtggaag attataaaaa tgcatacggc 74221 ttttcaaata ataagagaat agttccaatt gcaaataaaa accagtatac attttgaaaa 74281 atgaaggtca ctaaatattc ctaaaataat ggaaacaggt gataattaac attctaacag 74341 tttagccaca tgtaatgtcc tctattttgt caacagagtc cagctgtttt atctacttaa 74401 gtctaaaaga aatttatcct catcattaaa aacaggaact tatttatagg taactaaaat 74461 taacagaaat ttaattttaa atattttaat tttattttta atattaaatt taatttaaat 74521 ttaatttaat ttaaaatttt taatttaaat atttaaatat ttaattttaa atattcagta 74581 cttgtaccat tattttcatt ttgaaaatgt atctatctta taatgatttg gctttataac 74641 ttataaagac atgtaactta ctttatcctt taaaataata ttttactaac aatattagcg 74701 tgtagctaca tatcatattt aagaaacagc aatctactaa gaaatgtgcc caggtcaaaa 74761 atagaataag ccaattgaaa taaaatatta tagcaaatga tcagtataca tctaccaaat 74821 aaaagggtta tgaaaggtga agaccaagta gtaatctcca aaagtaatac aatgattact 74881 gattgaagtt ttatacccga gactaagcta ctataatgaa atgtttaaaa aatattttag 74941 ggtattatca ccttaggtca atctgattat tattttgaat gaatgaatga atgaaatgaa 75001 tgaataccca aatgaatgaa tactgaagtc atgacaatgt ttaacacctt gagatgctca 75061 caacaatttg gaatacccag agaagatgac ttgaatacaa acaaactatt taaaattcac 75121 tgtggctcac agtaaagatg cagtgggttt ctatttgtag aacaaagaat ctcaatcaaa 75181 aacctatata tgactaaaat attctcaaat gcaattatca aggctttaga aagcatatat 75241 actttttact atgtaattct ttcctgttcc cccacctccc tgcatttttt tttaatgttt 75301 ggcataagtg accaacaaac cagaatagaa tacacatttt taaaatacta atttacacat 75361 attctattta agaacaacat acaaccactc ataaatttag cctttttaaa agtgtccttt 75421 atataaaaca gtggggagcg ggggaaatcc ctgctataca tatgaagcac agtttaagcc 75481 caatacaact gaccaatcaa aattagtgaa tacactgcat ctctctgcag tggagagaag 75541 cgacaaggta ggctgtcatt ctgcaaaagc tgcccgaaca ggctttcctc tgaaaagcag 75601 tgccgttcct ttttagaaag actgaatcta aatgttgtct ctggagacaa gaagccttca 75661 gtatgttaaa ttactttcat tatgtatttt cagatgctta ttgattccac agtaggaaga 75721 gtgagagact gcagcagcct ctaagcagca ctacatgttc catacattag cagtactgct 75781 gaaacaatgg cacactacag acacatattt tcattaaggt cattttcgaa gagatgttta 75841 tgaccctctt cccccagtcc tctactaaca agtgaacaaa atgaatcaat ccaaactgga 75901 aatgcttcaa ctacatcaag aatttatcaa atctttagca gaggtaagat ttgttcctat 75961 tatagtatac agctgctttt gaaccggcat agtggggtaa aaattacttt gaaaaatttc 76021 agcctaaatt ttaagagttt gtttaattct acttattgct agacatgtat tttaagatct 76081 atttctttta agacatgcta cattttaaat gtaatttttt aggatgcatt gttaaataca 76141 aatattcttt gtaaattcat tatgaagaca gcttttgagt cacttcagat atactaaata 76201 ttcttaatgt aatcagtaca gtttttctcc agtgtttcaa aaatgcttct gtttcttaaa 76261 agaactaggt ttactatttt gctgttactt attacttaat tttatgttaa gtttattatt 76321 tggcatgttc actgaaaatt aatttgcagt tacaattttt attacctttt gtatttcaac 76381 cacatgaaca attcatcaat ggaaatttat agtatcttat gaaatctgtt ttaatgagtt 76441 aaaattcaaa ccatgtgcta ataattaggc aagttacaat atttttagag caaaaaatgt 76501 tgttaaattt attacaaaga agaaataaca gtatgtttag tgtatgctaa ttgttcatca 76561 tctctgaaaa tacaggttag cttctaaaat gagagcagaa attagtttta aaaaatatag 76621 cttttctctc acaattgtat ttgaaatgtg aactctattg ttaaataaca taatatatac 76681 aatctttatt aggaaaaaag acttatttaa agaaacaatt ttatgcatat agctgtggct 76741 ctaataataa cagtttggtt attttgataa caaatactgg atagttttta aacaaaacta 76801 aagtaacttg caattaatta caaattatta aaaaccctat ttcttaacca attttcttct 76861 tttcacaagg gccaatagca gctaattcag tatttacaac tgacaatatg aagaatgcaa 76921 ttgactgagc atctccctag ctgtctgaac tacgaactgc aagatgttct tgtaacacga 76981 ctttaagaca ttaaggagtt aaaaccaggg aataggtcta cattactgat ggaatataaa 77041 aaatcaactg tatcctaaga agatgctaca taaaataacc acaaaaagaa aaacaatata 77101 actgtaaaag cctgaaaaga atttttaaaa ggggaagttt atactttcat ataccagaat 77161 tgtggaagtt actgattctg gaagacataa tgaaacatga atttccaaaa agaaaagaaa 77221 atactttatc agcacacaaa aggaagattt aggaagtgtt ttctgcacta aatattcaga 77281 tattcatatc aattggtatg actaatggat ttttctatct gacttttatg accaattatg 77341 tatcctcttc taatgaaaac aaacaaaatt aaacagcaga tggtttttat caaaaggaca 77401 tggcctggat ttataatata aagcaagtta tgtgatcaag aaatcatttc aaaatagtga 77461 gcactgctat taaaaacaga tttacaatgg taacaaaagg atgtctaaat atattttaga 77521 agctacaaac gttatgtttc cttttttgtt ttcacatatt ctggaaaata aagaaatatt 77581 atcatgtact ccatcaaagg gaaacataat tcctatcatc tgaggaaatt cctcttggcc 77641 gtgacttttt aaagcaaaac aaatacaaat attatgtact gttctttaga aatccatcag 77701 ccaactaaat ctcataatgc atgcagttga agtattggag agaaaacgaa agaattccta 77761 caagacatga aataaaacac agctacttca ctgttgtcag gtaaaaattc atgtcaaaat 77821 ctgtcaatga tatcatgtat caatttgcca aaaactgtca tagtgaacca aaaggcccat 77881 aaggcaacag caaacaggta ggtcagaaga tgcatgcacc ccaccacact gtttaacatt 77941 tacgaaagaa cagaatcttg ctctagagaa aatgtatttt tcttaatcat catttaactt 78001 gacatttctg ctttatttaa ttataaatct aactgatgtg acaaagacct gatgtttaat 78061 ctgtcagttc agaaaatttg gcacacttaa aattttccat ttttatagga tttcaatgtt 78121 agctaagacc ttaacttact cgaataaata tacctctagt aatacttcat actaattcaa 78181 aagaaataat gttaccattt gtgtttctgc aatattattc ccaaaaaagt tcataaataa 78241 aactgtattc taaaacttgt caaatataac attaagtgaa agttaacatg aattttgaaa 78301 attattagct tttataattt atgttgaata tgattttgca gttgaaatgc tgtatttgaa 78361 atgtgaaata tagttggttg tatgagttca tattttaaaa tgaatatact gattaactga 78421 cattttagcc cagaacatta gacattattt tttacaaatt attctaaacc cctatatatt 78481 aaaatataga tttgcatgaa ttagcaaaag cgtttgtatt tttttttaat ttttcttcaa 78541 aattagacaa tgggtgatat atacaaaagg cttcacaggt aatgtgtaac tttaaaagct 78601 tcctcaaaaa acgggcttca catactcctt ttccctaaaa tttcatcttt tactttaaaa 78661 actcagttta aacaatgtca actgatatat tcctttaaac tacttcaaaa atggctattt 78721 cttaatgtga tattaaattt caattttctg tttccacaag atacaagagg gtgtggtcac 78781 aaaataaatc atttcatagg agtgattaat gctcttttac caagctttgt agctttttat 78841 tttagcataa tgtcccactt cttctaattt ttaagtttgc caactgcaaa gttcagtgag 78901 cattgtaact gtgatatgta aaatattaat cactggccta tttctgtggg aaataacccc 78961 aaatatcacc aagcacattg ttgacttctg aatatgaatt caatcaacat ggataaagca 79021 tcttaaacta gtgctctgta ccatttgtca ttttaaatga acacttgtcc tgttcatttg 79081 aaatctcaca ggaataatta cacctacact cattatgcta gagcatatta aaatagcatt 79141 catttggtac ctactcacaa catttaaatg aaattttaag acactgggct gaaattaatt 79201 ttgtatgcta ggaagtttta tcatacaaaa atacacttta tctcaaataa taagcttgaa 79261 atactcaaat gagaaaagcc ctttagcata ttaactttgc actacagagg aacaatttcc 79321 atagttattt cttcaaaagg aaaacacaat tttcttttat atcaaaacaa tgcaaacttg 79381 atggttctta attctacatt ttctattaat agtttacaaa cttaaaaatt aaactaagta 79441 cacaattgaa agattttttt tcttacaaag aacacgttat acgtcattta aattgccaaa 79501 tatcaaatag tttattctat ttcactttct agggaaaaaa accaactgct ccaaaagaat 79561 gtgtttttct cccattctgg aaatcaacat gcagtctgaa tctaacatta cagtgcgaga 79621 tgacattgat gacatcaaca ccaatatgta ccaaccacta tcatatccgt taagctttca 79681 agtgtctctc accggatttc ttatgttaga aattgtgttg ggacttggca gcaacctcac 79741 tgtattggta ctttactgca tgaaatccaa cttaatcaac tctgtcagta acattattac 79801 aatgaatctt catgtacttg atgtaataat ttgtgtggga tgtattcctc taactatagt 79861 tatccttctg ctttcactgg agagtaacac tgctctcatt tgctgtttcc atgaggcttg 79921 tgtatctttt gcaagtgtct caacagcaat caacgttttt gctatcactt tggacagata 79981 tgacatctct gtaaaacctg caaaccgaat tctgacaatg ggcagagctg taatgttaat 80041 gatatccatt tggatttttt cttttttctc tttcctgatt ccttttattg aggtaaattt 80101 tttcagtctt caaagtggaa atacctggga aaacaagaca cttttatgtg tcagtacaaa 80161 tgaatactac actgaactgg gaatgtatta tcacctgtta gtacagatcc caatattctt 80221 tttcactgtt gtagtaatgt taatcacata caccaaaata cttcaggctc ttaatattcg 80281 aataggcaca agattttcaa cagggcagaa gaagaaagca agaaagaaaa agacaatttc 80341 tctaaccaca caacatgagg ctacagacat gtcacaaagc agtggtggga gaaatgtagt 80401 ctttggtgta agaacttcag tttctgtaat aattgccctc cggcgagctg tgaaacgaca 80461 ccgtgaacga cgagaaagac aaaagagagt cttcaggatg tctttattga ttatttctac 80521 atttcttctc tgctggacac caatttctgt tttaaatacc accattttat gtttaggccc 80581 aagtgacctt ttagtaaaat taagattgtg ttttttagtc atggcttatg gaacaactat 80641 atttcaccct ctattatatg cattcactag acaaaaattt caaaaggtct tgaaaagtaa 80701 aatgaaaaag cgagttgttt ctatagtaga agctgatccc ctgcctaata atgctgtaat 80761 acacaactct tggatagatc ctaaaagaaa caaaaaaatt acctttgaag atagtgaaat 80821 aagagaaaaa tgtttagtgc ctcaggttgt cacagactag agaaaagtct cagtttcacc 80881 aaatccacat tcaaatgagt tttaaattta aattgtaaaa actgatatta ctgccaaata 80941 taagaaaaat attttaagta ttggttatgt tgtaaatttt caatgtgaat gtcaattaga 81001 taggtcatat atattcaatt tcttcattac ttaatgtatt tgttgcatgg cagtttgtta 81061 aagtactatc atgtgtatat tttgtcaata ttatgtccaa cagaaaatat tcatgtaagt 81121 catatttttt aaggaataaa tacatagcct taaaacagtg tataacttta aaatgtaact 81181 gacataggta tccttgcttt attttttaag ttaaaatgca ttgtttctaa gccacaaact 81241 acagatatat ttagattaca actggagtag cattttaatc taaaaaccaa aattatgggc 81301 tcaaaacaat ccagtatttt ccataccact atgctatgtt tcctggtata gtgtatttgc 81361 tatatttgat gcatcacaaa taattaagta cgtatgaagc tttattcttt taaatgtaaa 81421 aaatcataga atttatcaaa attttaaaat taatgaacca aaaaaacctc tgtatacaca 81481 ccaaaataga gaaactttaa aattcatgct tactaggaaa aaaaagattg attttctaag 81541 ttcaaggaca gtatgcctat aatatacaaa tgaaatgaaa ctaaagggaa ggaagaatac 81601 taaaacacca gcctctttct tccttcctca ctttgcttaa gtctaagcca aatgctctgg 81661 attaatatag tgcaatgatt ttaaaaaaaa aaaaaaaaaa aaaaagaaca gtacatcaaa 81721 ataagctgac ttcctaactt tattcaactc agctcttttg tataatagaa agttaactac 81781 ccaaatctga ataccaggaa tgaaaaaaaa aagactttaa atcattatag aaaagtctaa 81841 caaagcctct aatattagat acatacaact ttagggtaat acactttata caatgttccc 81901 accaaatcat agctcccaag taattcaaaa tgagttgttt tatttccaca ctgaaatcac 81961 taaggtaaca ataagtcttg tcacttttct accgtaccat cacttaattt aactttaggc 82021 atttttctct cccaccctca attcatgtgg gaaaatcatg aaattactaa ccctaactac 82081 agattacaag aattgggaac ctcatatcaa acaaatattt tccaattgtt ataatgtaat 82141 aatctatcga taagtgtcaa tatcactatc agaaatttag caataagtta tttaaaatag 82201 gaaaatactg cattgtaatt ggtcagctga acaactttta ttctattttc tgaaactgta 82261 tccactaaac caagtagcaa ttcattttta aagaaaatat taaatgtcta aaaattcaat 82321 aatatatcac ttttatataa ataaatgaga tttatccagt ctagtttact gaatacccta 82381 atccttggag aaatgaatac catgcagtag cttgaactca ctaaggttac atgacaaatt 82441 aaatagtaat ttacttctga taataatgtt aacataatgt caaatcttgg caaaagccca 82501 atttccttct gtcttatttt ctactgtttt ttggaattta aaaaaatgct tcaatttgac 82561 aaaatgtagc ataagcaagt aagatatttt ttgaaacccc aaagagaaat taacacttca 82621 ggcacaaaag gattaagaaa attttcgttt agtaaataca tttggcacag accagctcct 82681 aatcttcttt tattttttac cccaggaaat aaaacaagct cattaatatc ttcatttaag 82741 aatttaaaaa atgaatttta taattggact cccaagcata aattaaaatt ttaatatcta 82801 gttttataat ctcaataaaa attagactat ttttagaacc ttcatattaa gaaaaactcc 82861 cttgtgtttt aaatattttt actcttcagg tttataacta tgccaaagaa ctcagtataa 82921 aaaagagatt tctaatatat ttgtctaaac tgctttcatg ccttcttgtc ctctgtaata 82981 agaaattaaa taagaaataa atggtaaata ttatattgcc ctttaaactg ctatcatttc 83041 tatacttgac actgaagcag acatttaaca agcattgttg accatccgta atgatctttc 83101 aggtatacaa aagtgcatca actgatttac gacaaagaaa actcatacct atcagaaaac 83161 ttctaacagc tttttaatgt gctaagtagt atataatatg taaaccaaaa taatatgtgt 83221 taaaaattta gcttcaagaa caagtcacaa ctatgtttcc aactactgac ctcagcttaa 83281 atgtgagaat gagaagctta gacaactttt tgtattaaac atctgttcta tgaaaagacg 83341 aaatatttct attttcttta cgttggtctt cattacaatg tgtgaatttt tcactttatt 83401 tcaaataatt ttgctcaaaa aagcattaag gctaattctt cctgttagaa tgaaaaaatt 83461 aaaaatgcaa aagcaataga ggcaacaata aactcaatat gggacctttc ttcagactaa 83521 atttttctat ccatatatca taaatcatag gcatagatat caatttctac atagttaatt 83581 gggtcagttc ttccaaagga taagaagccc ttagtagccc ttagtagggc ttattccgca 83641 tagtatttta aataccaacc aaagatgagc tcaacaagat gtggtaaatt ttatttgctt 83701 tgttttgtgt gttaatttaa gggagctgtt actctaccaa ctactactga acaaatgttc 83761 aaacttattg tgaaggtaga attatttata attatatgtc ttttaaaagg gccattcatg 83821 ataattctgt acttctgacc caagtcaatt tttagttgta tgttctattc tttcatcctt 83881 taactaaaat aacagtagtc tccccttatc catggtttta cctgctgcag tttcagttac 83941 ccatggtcag ttagcatggt caactctgtg gtccaaaaat attaaatgga aaattccaga 84001 aataaacaat ttataagttt taaattgtta tatttaaact tatttataaa tttatgtata 84061 attgtactac tttattatta gttgttgtta tagcctactg tgcctaatat atgaattaaa 84121 ctttatcata gttatgtatg tataggaaaa aaacatagta catgatatac tatatatagg 84181 gttcaatact atccatagtt tcagacatcc tctggtggtc ttgggatgta tacctgaaga 84241 taagggggga acaaatgtgt agcaaaatct cagttccttg ggcattaaaa accagattcc 84301 tcattgaact gaaattagac ttaagggtca aaaaatctaa aatcattaga aaacaggacc 84361 ttagtagggc ttattccaca tagtatttca aggctacctc ccaaagtcaa taaaactttt 84421 aattttattg catttcatga ttacacaatt aagtaagctt tgtgattcaa ttagcattta 84481 aaatcttaca gaaaagaggc attttggcag atactttgaa ttatatgaaa gagtcattac 84541 cctcaagagc ttaaaatcta gagtcaaggt attaaaaagt atggcattac ctaaaatcag 84601 aggtagatgt ggtaaatact ataaaagaag aataaagttc ttaatagact ttagaagaaa 84661 gggatcatgt tgaatttaag aacagtaaaa agaaaaacat tgctgaataa agctgctgca 84721 gcatattctg gaagagcaaa ttgttctttg gaagaaaggt aatacagtag cgagaagtta 84781 tctagaaagt taaatttggg tcataagaca gaaagccttc tatcttatgc tagaagaggt 84841 atctgagcag gggaattaca aggtgaaaaa atgagcttca aaaaaacaaa taaagcagca 84901 atacacagaa ttgatgagac acagaagata ggaagctgct gccatcacct tcatagctag 84961 tgaaaggcct ggcaaaaagt gatggcacta gcttgacaag ggagtgtaca gatgtgacag 85021 gcaacgtgaa ggcagagtca atatttacta actagctagg aaagaaaact ggcataagag 85081 actgaagaat taaaaatgat gaccataagg aagaccataa gaaacagatt acccgcaata 85141 ctcaaataac tatttaatta tgctttgcat aaattacatt gataacacag gcaaaaacaa 85201 aaagaatatc tggcaaggcg gaaaactcaa tgaagtaaaa taacattaaa taatgttcta 85261 gttattagaa gctattcaga atgaaaattt cgaattgacg ttttaaatta tctattttat 85321 atatttttca attggttcat tccatttttc cattaaacac atattttcta acaactattt 85381 taactataaa aatacacagt gagaaaatac caatgtatat gaagtgaaac acacaaatgt 85441 catcctttgc ttttttccct gaataactta ttatgggcat ccatccaact ctttataaac 85501 atatcgtcct cattctttgt taatggctac ataaaaacag tttttaatta aaatgtttta 85561 aaatagcaac atttctaaag gataatagta atgaaagggc aaataaagct ttttctgtcc 85621 atttaagtgg cttcaaactg tatttcaaaa tcatgttaga aaattcacat tttcagtttc 85681 aattaaaaat taaaattttt aatagccaag ttacctttgg caactgagaa ggttatattt 85741 cagaatgacg gcaagaagca atgggaaaac cataaaataa tgtatttctt ttatcctttt 85801 cactaaataa gatgagaaat aagaaaacta aagtaaactt tgaccaacta ttttctttgt 85861 atctcagatg agcacttgta tgtactcatg aatgtaagcc cccactcagg gactgcagtg 85921 aactctaaga aagttaaaaa tgagttgacc aataataaat gttagaaatt ttaaaagttc 85981 agatcatatc gtaaaagaga aagtaaggag ataaaagaac ccagatgagg ctggaaaaag 86041 acctaaaata ttcaaagatg ctaagctctt agagaaatac gaatagaata cagaattgta 86101 ggttagagct cacaaaagaa aactaatttg aagattgact tttttctaaa aagcattcga 86161 aaaagcaaaa gacaccccca ttgacaggct gcacatacta gaaatataac aacaaacaat 86221 gcatcataac aagcagaata aatattaaat attacaattt acagatattt agaacaaaat 86281 ggcattgaaa tgtggaaaca aatggaaaag tgggcttacc actaatacaa aagccataac 86341 ttttaagaaa tactcatttt tttcaggaca aaaagagcca agtgaacgta atcacaataa 86401 tatttcttcc cttgctcata ttaaaacaac atgtgaaaat aatcctacgt gtctctccca 86461 ttctattcat acattttcta caggcctcca taagctgtat ccatggcttt caaatatgac 86521 agctttgtac ttgcaccagc aagtggtaaa atgtgctgtc attcctgtca ctgacaccaa 86581 tatcaactcc ctgccactat caaaccccga agcatcaagt aagataaggg aggtgagcta 86641 agtctgtctg ggattgaggt cattttcatc accctgcttc catgagagca atatgtcata 86701 tttttctcag aggtattaag caaggcagac taccaaagca catcaaccaa tttctcagga 86761 tgaagagaga aagtggctag cgaggatgac aaggttatgg ggaaactgct tttccctcag 86821 acagctgtgg acacataata ctcataagat ttgaaaaatt aaaatttaaa tttaatataa 86881 aattttaaaa tattaatgct aacaaatgag agcactggcc atattaaacg tagactgtag 86941 gagtgagaaa tacatatggg actgtagtga ggtaataatt atatattaac atatttatac 87001 ttcaatcagc cagtaatatt acacacaatc ccatacgtca tatactataa actgagtaca 87061 ctggttgggt atactgcata ttgattgggt aaagaagaat caaagatata tatttcatcc 87121 aaaatattgc taatgcctta ggacaggttg aaatataaaa tatcaaattt acatactctg 87181 tactttaaaa gtctcactct caattcttaa aaagctttgt cttttctttt gctgttaaat 87241 tttttccaac tatcacttct tataaacaac aaaatgctag caaactgtag ctacctactt 87301 tataaaaact gtcttaaatg cacacaatac aataacagtt ggtaatacta atgtactttc 87361 gcactacttt tataatacct ttatactgct tttataatgc ctcccaaaaa agtacctgat 87421 acatactgac tggtaagtaa atattagtgg aattgaaaac aaatttttaa cctcagaccc 87481 tattgttgta tcttgtttag aaggttgttt ctcaaaatgt agtccacctc tatcagagat 87541 tctgtggtac tttttaaaat gccaattcct gggccccatc tctgactctc tgacttactg 87601 agtcagaatc tctgaggttg cagtcctgaa aaaaaaatct tttttttttt ggtgggggag 87661 acagggactc gctctctcac ccaggttggg gtgcagtggt acaatcatgg ctcactgcag 87721 cctcgaactc ttgaggctca agcgattctc acacctcagc ctcctgagta gcccggacta 87781 caggcaggca ccaccatacc cagctaatta aaaaaaaaaa ttttgtagag acagggtctc 87841 attatgttgc ccaggctggt ctcaaactcc tggtctcaag caatcctccc gccttggcct 87901 cccaaagttc tgggattaca ggtgtgagcc accaagccca accaatatat attctaataa 87961 gtactctcag gtgatttgtg tgctgacgtt taacaattac tactttagat atcagaagac 88021 agactatgta tcaaggtgat attcaggtaa taaatgaaaa ggcacagaag cagctcttaa 88081 ttcagactgt actgccttct tgaaaaaaaa ctcctccatt tacttcattt taactgatta 88141 aattttagtt ggattatata tactataatc cagtactaca ttagatttct aagaaatata 88201 atgttgtttc tttaaacaag gttatctaga caaaagtttc ctttaaaaac catatactct 88261 cagaagataa aatgctatca aacacaaaac attcaccata aactgttatg tacttcactc 88321 aaccatacta ctcaattaat tataaaattg catggctact atttttcata aaattcttac 88381 tcaaaccaga agtactattc ttgcacattt ttgttcattt cactataaaa cttccgctaa 88441 aacataactg acataaaaga attacttcta aaataaactt gcagtgaaaa tacttacaaa 88501 ttgtattttg tatgttttga agaaatgcca ttcaggttag gtagaggttt gaatcaaaca 88561 tttttctttt gctgttatta tttactttgt accattaatg caatttagta aacagcaacc 88621 taatctttga taattttatg acatccttta aattcctatt ctgtattata atagtttagg 88681 taacactgat acttgataaa tgccgcaaat ggtattacgg aaacactact ctaggaacat 88741 ttcttattaa ccctttcact aagaactttt tccaaccaca tcgatattat cttttgttat 88801 tacttttggt ttttagtatt ttcattgatt taatcatgaa actaacctat cacaacttat 88861 tgaaaatgag cattattgaa aatcagctgc tagtgaatac aacttaaaga cacaaagaca 88921 aggctgggca tggtggcaca tgcctgtaat ctcagcactt tgggaggctg aggtgggtgg 88981 atcacctgag gttgggagtt cgagaccagc ctgaccaaca tggagaaacc tgtctctact 89041 aaaaatacaa aattagccgg gtgtggtggt gcatgcctgt aatcccagct acttgggaag 89101 ctgaggcagg agaatcgctt gaacctggga ggcaaaggtt gtggtgagcc aagatcgcac 89161 cattgcactc cagcctggga ggcaacaaga gcaaaactgc atctaaaaaa aaaaaaaaag 89221 atacaaagac cataactcaa taattcaggt gactgaatat ggccacgtta tagaaaaata 89281 tttgccaagt aatatattcc agataaacag tggctttagt atagggtttc ctagagtctc 89341 ataaagtacc ctaaatcaga tgcaaaatgt cacgggtatg gttatacatt tttttttttc 89401 ctagaacaaa gattcacaaa ttcagtaaga tcctccaaag agtatgtaat ccaaaactaa 89461 aataaaaatc atcatcagtg ttaggaagaa agcagggaag caaaactggt aaatgttaga 89521 tataatttta tttatatata tcctccctta tcctaaagat aatataaaat aactttagca 89581 ttttattttt cttttttgag acaaggtctc actaggttgc ccaggctggc cttgtactcc 89641 tgggttcaag caatcctccc tgctcagcct ctcgagtggc tggggtgaca ggcgtgcacc 89701 aacgctacca gcaagatgac tttaagaatt ttagacacaa ctgaactaaa tgaaaaaata 89761 ataaaatcag agcaaacagc atgaaaagaa tgaaacaata cttgtatcat taagtcctgt 89821 attcttacaa gaggtgagcc acaaattgct ccttatgatt tctttttaat ttttatttta 89881 ggctctttct ttgaattcta catactttct actaggtcat ttaaacaggg aaaaggagct 89941 gattacatgt ctcagcattg gtaagattaa aaactcccta gatgctcagt agaagtacaa 90001 actattggta ctaagaccag agagcagtga ctctcatggg ccttaccatg tctctatggg 90061 tcttctcaat gttttccaca actatatatt ccacaactat atacacttaa tgatacctta 90121 agtctcacat ggttgtttct tatatccctc aatgtataac actggccaca acaaccttat 90181 agaaaaggca acaagaagtc tcacatagct aatgattata gttttcctca atgtacaata 90241 ataacatcaa actaaagtga aattaagtaa aaagaattag atgggaatgg ttgtctaggt 90301 ataaaaccct ctgcaactct ggtttaatcc agggatagat tcttgaatgt taaaacagtg 90361 gatttgaatc atttcgtaac acacctaaga ggtcaatcag actgtgtaga aagaggaggg 90421 tagtacctcc aactgaaaag aagacaagtg tcttaaaaac aaaaatggcc tctaggtttt 90481 ctagtacagt taccttctcc ttcccttacc ccacgacctt cagagattca ctaattaaga 90541 gaaatggagc atggactgat aaccttcaga aataccttcc ataaatattc tctctcaaca 90601 aaattaggga aaataattga cagagaaatg gttaagtgca gaacaaaaac ttgaatagcc 90661 accagagctg aagcgaagac taaaatctcg agaatgccac tctgttttgt gcagaggttg 90721 ctgttaccaa attctgaaat aagctaaaga actgaacttt cattaactaa ttttaagaaa 90781 agtctacttt gcctgaaatt tctgatcctc aaaactgatg atttgattca aagatacagt 90841 tagagcatac aataagagct aactgtgttc tgaactacat gataaaagac actacacaaa 90901 tgttttatca ccattgtcaa atacaagaaa taagacaatt ttaaatacaa ttattcttct 90961 atgggtacta tttatattaa cacatatgct taccaatttt acaatgtaac tagcttctca 91021 aactacataa atgtttctaa aaacttagtg aataacatgt cacagaatct agtgacacaa 91081 atacaccaga ctaaattgtc tgacaacagt aatcataaat cccagttact atagtataca 91141 agtcttttat cttgtaagca ctgaaaaaaa ttataaaagc acttttaaag ggtccattat 91201 gatagcactt agccctcact taatgccaaa tttgaacatt tcataaaaca ttggggttgt 91261 acttgcttac agaagcagag gagataagat aaatatgcca acatcaatgt cacaaagcaa 91321 aatatcataa gaatataagt aaagagcaat tgggctcaaa ggaaaaggag gcacatgtac 91381 ctggaaaagc tgtagaaaac aaatggcact ggaaatgaac tccgaatgag ggatgagact 91441 ggctgatggc atcagggcaa ggacagcata ttaggcagag gcacaaactt gggcaattat 91501 acagaagcaa gaaaagcaag tctccaggga acaattcact ttgacactag tatactccac 91561 actctatgaa tgtagaaaca gacagagata aagctgaaaa ataggttata ttcgtgtttg 91621 ccaggccaaa aaaaaattat aatttaatcc caaagcctag agaaagtctt tctagactta 91681 ccagtatgga agtgtatggt gaaaaacata cattttcact ttctatttcc atcatcatag 91741 tttacgccct caatatccta gaccactgtg aggtgaagaa atcaagtcag aggctagcaa 91801 tctctcacaa tttccatttc catttaccta aaatacaatt ctgctcattt tcaacccctg 91861 atcacaaatc ttcaaccaga tacatcaccg agacttaaaa tacaacataa tatataacag 91921 aaaacaattc aaatccagat attgcagcaa tgtcttcata tttttcaaat gtagaattaa 91981 gatgccttac aaattactca aagacacagt cttggaaaaa aaatgccagt ggaaaatgcg 92041 tacgtccaat tatattttcc tattaaaaac tataaaaaag aaaacactaa aaaaatctag 92101 ctgtaaatag taataagtga ataataacta aaatgcaaca aaaaccataa tcccaggatt 92161 taaaaaagga aaatagacaa acaactgaac tgcaatagaa agtccataca agactcaagc 92221 aaatgtaaga atttattatg aaaaataaga tacttcaaat ctgtggtgaa gtgatggatt 92281 attcattaag tagtgttgga aaagtaacta aacaatggaa aaacagggtt taatgcttaa 92341 ccattcccaa agacagtgag ttgtacaact ccaggaggca cagttcacat tatagtctat 92401 gtgacagtaa atggtactgc ctagagttgt gagatgtggt tgctatgtgc ctaagttata 92461 ccaaaaaaat agcagataga taaaagtaaa atcataaaaa tatccaagaa tatggaggtg 92521 aatatttttg taatattgtg gtgaggaacc aaataaatcc attattaagt aaaatattaa 92581 caagtataat tataaaactt ctaaaattaa acctaaaaca tcaaagaata aactgggaaa 92641 cacatttaca acctatgtag taaagcaaat ttacctctac tgtataaaaa atagttacaa 92701 accaacatgg caaaaggaaa ataggcaaag aatacaaaca ggtaactcat caatgaaaaa 92761 gaatgacaaa cacaaaactg ttcaaccaca tacagcaaaa acaaatccaa attagaataa 92821 attttaaaac taatatacca atgttcatat gtgcttggca aacaattttt aatttttttt 92881 aagagatggg ggtctcacta tgttgcccag gctgatctcc aactcctggc ctcaagcaat 92941 cctctcacct gggactcccg acgtatcagg attaaggtgt gaggcaccgt gcctggctag 93001 caaataattt ttaaatgctt atgactaata attttgataa acctatgagt atttttcaca 93061 ttactacaca aaatatattt tattttgaaa tattaagcaa aaacctcaaa catgaccttt 93121 gaccaagaaa ttcaacttct aaaatgtatc ataaggaaaa aaattgaaca actgtgcaaa 93181 gatgtatgtt taaggatgat caaaacagtg tctataccag taaaaaaaaa aagtatgact 93241 aaaataatta acaatagatt attacataaa gtattggata gtttttaaat attatccagc 93301 cactaaaatt atgacataga tttatgttta ttaacatggc attatgtcca taatgtaatc 93361 ttaggtacaa aatagtatgt ataataaaca cagtaatgca taagcaaaca gttgagaggt 93421 tagtttctga aaatgctgat aatgaatctc aaggttatat ctgtagtttt taattttcag 93481 caatcaacat atgcctcaat tttctagttt tcacaatgaa catatattgt taatgtaact 93541 ttacaaatta aaaatattta taagaaacat tgttttatta aaccattgaa agcattatga 93601 cagtagtatt tcattttaaa aatatacaaa attgagctga ttattccaag cccatttatt 93661 taaaaaaatt ttaaaatata gcacatggca ggccttcata aaaattcctt cctctgttac 93721 attttgggtc catactgata ataaccaaac ataagtgtaa aattatatta ttaaataaaa 93781 aatatatatg aaaaatagag cctgtgaaaa taaaataaat tggtcttctg gatacaatat 93841 ttagaaaaag acattttctc acaaagtact gtccagattg agactaagtt gttgactaag 93901 acatccaggt cttgtttctc ttgtgttttg ctttctgtta tcttttattt atgagcatac 93961 tatacaacat gctcattccc ttgtaatatt ttgaagaaag tgaaaaagat gaaaacatat 94021 aataattatc aattatttgg tttaagccaa aagaataata ggctgggtgt ggtagctcac 94081 gcctgtaatc ccagcgcttt gggaggccaa ggaaggtgga tcacttgagg ccaggaattc 94141 aagaccagcc tgggcaacat agtaaaaccc catccctcca caaaaaatac aaaaattagt 94201 tgggcatggt agtgcacgcc tgtagtacca actactcaag agcctaaggt gagaggacca 94261 cctaagccca ggaggcagag gttgcagtga gccaagattg caccactgca ctccaacctg 94321 ggcaacagag taagaccctg cctcaaaata ttaataaaaa attaaaaaaa aaataacagc 94381 atttttctca tttcttcctc ctgcattcaa aactatgtat ttccccaaag ttataatgtc 94441 acatatatga aacctgtcat tttccatttt tagagacctg agcaaccaaa cattgtgatt 94501 tttccaacct ttggaatgtt ccttgaaaaa gtttttctct attttcttgt atctgtaatt 94561 aggccaggct atgaaaacta tagaggtgat aattcattaa gttgtacatt tacagtgtat 94621 gtactttttt catacttcac taaaatgtta aaactgcagg gaccatttaa atgatgcctg 94681 gttaatgtcc ttaaatgttc tctaaaatac atttaagtaa acagaagcaa aatataaact 94741 cttcgctaaa atgtattctt attttcttgg aatattaaaa atgaacacat ttaaatctca 94801 aaccaatctc atatctcagc cgagataaat gatggttgaa aagaaaatgc aggccgcatc 94861 aatggaagag gtgccatcta tcacattaaa tttaaaaaag aaagctgtat agttcactca 94921 gcttctggtg acaaacaata ctgttataac cctacccaaa ttttactgaa gttctaagtt 94981 cacatatact tttgtcttac agaagtctga tccccattac cacctctacc agagtacaag 95041 ttgaaaaaat aaaagtgaaa gtccaagaaa acaagttcac ccattgttaa aaaagaaaaa 95101 aatctatgtt tagtgaggaa gaaaactcac attgtaacct caaatactta cttcagaatg 95161 cacgttaagt atgtaacata ataactactt ataaaggaat gaccctcaaa gaaagcctcc 95221 cagtcatatc aaaaacttta agggcatatg tattcaacaa ctaacttgca acacagcaaa 95281 taattgccca agagattttt tttaaaaaaa gaaaaagaaa tccaagtctt cttcctacta 95341 gctgtgtgat ccgggaacgc tatgtaatct tattctgatt ctctttgccc atctgcaaaa 95401 tgatagcatc tacctcacag agttgggaag cttcaatgag ataagcacaa acaatgttca 95461 gcataatacc tgccatacag caatcaatgg aaacttttat cattattatt ctgaaatttt 95521 atttaaaaaa caactaacaa acgcattttc aattcaatta tatgatgtag ttatttttta 95581 aaggaataaa ttatttttaa aaggaatgat tatgatttct cacctcaaca gaaatttcag 95641 agacttgatt ttcacttttc ttaaatgttt ctactctctt ccagtatcgt cccccatccc 95701 gctcctcctc caagataggg tcttgttata ttgcccagga taacctcaaa cccttgagct 95761 caagagagcc tcccacctta gcctcccaag tagctgggat tacaggcaca tgccacggtg 95821 tcttgccagt gtccatcttt aatgcagaaa tctttacagt aatacttcaa tatcttgttc 95881 ccagaaaaat atgcatgaga aatgtgtgta caaaaatacc atcttttttt ttccaagaaa 95941 agcattacat gatgtaatgg gggatgggga atagtatttt tgagcacaca gaagtagaac 96001 acatgatcac agattatgat gtatgctcca agtgatccaa gtgtgtgaca tcaaaatatc 96061 acacaaattt gttgtttagt tatatttcta aagaaagaaa acaggaggaa agcaatattt 96121 tgaggaccta ctatatactg ggcagggtca aaggaaatct actctggctt gttatatctt 96181 cctaacagtc ctctgaaacc tgagtgagaa gttaacttct ggtaattcac agttagaaac 96241 agggatggaa atgggattta aaccaagatc taaatggttt caaagcctat tctttcttgt 96301 ctctcaatca caaattacac agcagaaatg attacaatag tttatggcat cactttatga 96361 tattatgaag cctctaaagt attagtaaag tattactttg atggaaaaaa tttatgaata 96421 cggtgttgtt ctcattatta tcttaaaata cctagcctcc atgtgcccat tcaacaagat 96481 ttattcaaag tcatttaccc tcagtagaag tcttcctaac taatccttgc agtttttatt 96541 gttgttgcta ccatttaact cactttaaaa tatcaatttc atacttagtt ttaggtagaa 96601 agtctctaaa tcaatagatt aggtggctat tcttcagtat cttcaaactt catcaaaaac 96661 aacagcatac aaacaaaata ccttctctct atcctctacc ttctgtcctg caattatcac 96721 aacttggcat taatggaaaa gaccaagcag agattacttt gcttctttat cttccaaaga 96781 agagatcaat atcctgggta ctcttccatc cctgaagagt aactagatca agctggagga 96841 tgtgggttat catacagcac atcccaataa aacaccattt tatcctactc actctcttac 96901 tccacataga ctgctgatac atatgttaac cattcatgct ctccactttg tcccaagatg 96961 caattcttcc tcaagtcttt tcccaggtga aggctcgtac cctacagctg ccaatgaatg 97021 ttctattact tctcccagga aaaaatatgg ctttcatttt caaagccctt ttagcaaagg 97081 aactttttct tttccaaata aaatagttta caaaactcaa atttataaac aagataaaag 97141 caaactaccc tcaacaaagc aaaattaggg gaccagagta aaacagctca acattccttt 97201 catccaaatc acaaggattc ctcacaatac aatgtgaaac aaacatgatt ttgaagaagc 97261 agcctcaata gaaaattcca tgagaaattt cctgctcact gtactaaaat cttgaataaa 97321 tatattcttc tgtaagaaat taaataacca tgatgaagaa acagactcac cctgcccctg 97381 tagccagatc tgtgataata tcccataagt atcccaaaga ctaagaatga acttcattaa 97441 tctgcataat tccaaaacag acacataaac agatggacac atataaaaga tattttataa 97501 gcaatatccc attacagctg tttattttta aatcatcagt taatattttg tatctgtgta 97561 taaaaaaata tctagtttct ggcaataaaa cgtcttacaa ttctggcact tttctgtcag 97621 aagagccaaa acatagtaaa caaaaaacca aaaacaacaa tgggtagtgt gtgtgtgtgt 97681 gtgtgtgtgt gtgtgtgtgt gtagtttatt ttgctttgtt ttaactctaa tgtctttcta 97741 gagaagagct tcaaaaatca ttcttcttcc tccaaaacaa acagtactct ttccatgtaa 97801 ctgacgtggt acaaatgctg catgtattag tttcttattg ctaccgtaac aaattaccac 97861 aaacctagag gcttaaaaag cacaaatttg ttaccttata gctctgaagg taaagtccaa 97921 aacaggtcac aataagctaa aaccaagatg tttccatggc tgtattccct tctggaggcc 97981 cttaggggag aatctgtttc cctggtcatt taggttgctg gcagaatgga agtccttatg 98041 gttgtggaac tgaggacctc acttccttgc aggctatcag ctgaggacag ttcccagctt 98101 ctagaagaca tctacatacc ttggcttatg atctccttct ttcatcttca aagccaacaa 98161 tggcaaatca agtacttctc atgcttcaaa tctttcctgt ctcttcatcc atcacctatc 98221 cactcttctg ccttcatctt ccaatccaga ataatctccc catctcaaag cctataattt 98281 aaacgtacct gccaagaccc ttttgcaatg taagttaaca tattcatagg ttccaaggat 98341 taaaacagga ctctctgggg gaccattatt ctgcctattg tgccatactt gttgagctgc 98401 actgatttga agtcagaaaa aacaccagag aacatgtcat gatctgctgt agtctgcatg 98461 tgtcccccaa aattcatatg ctggaagttt aatccccaat gtaatagtgt tgggaggtgg 98521 ggccttttgg gagatgttta gctcatatag gctccacctt atgtacacat acagcaatgc 98581 tgctataaaa agggtttgca ggagtgggtt ctctcacttc tgctcttatc ccacaaagga 98641 acacagagtt catctctctt acccttctac cttccaccac gtaaggatgc agcacgaagg 98701 cccacattga atggcaacaa cttgttcttg gactttcttg cctccagaac tgtgagatac 98761 tacatttctg ttctttataa actatataat ctgtagtatt ctttacagca gtacaaaaca 98821 gagtaagata ctatccttta gcctcaaaag ttgttaggat caaaagcatt agagttcaaa 98881 atactgagca attgtatcat taaatattac cactatgagt aaatccatca attagaatag 98941 ctattatcac taagccctta cttctcttga aatcaaagaa tatctaaata tgtgaacata 99001 acttaaggct gaataaatta aatttctcat tgtaaaacag aacttattta taattttaga 99061 ctcctctctc tttgcttaac tgcctgtgtc ccacaaatct caatccctaa ataatgtcac 99121 cacaaggcta aagtcacatt ggaagacaca ggcttagatg atatcttcta gaaagcacaa 99181 cactgttctt gtcatagaaa caagaccgaa cttaaaccag tgacaagacc catgagttca 99241 acacatgcaa catctagaat ctcttagaat ccccatatta aatatattca aagactatga 99301 aaaaggtgtt gctactttga attagttttg gcaaaaagaa agaacaactt ctattgggac 99361 acagatcttc tgctgttctt ctaaataagg gaaagatgtg ccttagaaaa taaattgtta 99421 aatagctaga aaaataataa catgaaacta atttgagtat taaaaatttc cacaagacat 99481 gtcatctttt caaaacaaag taatttaata gcaatagacc ccaaaaacca aagtaaaggt 99541 acttgacaac attcttatat ccgtttgttc tttgggagta cagaaatttt atatacataa 99601 aatttaagtt tatatttaat attacctcca atttcaaact aaaaataatg cataagctct 99661 caacctacac agattgttat caaagtgatt taaatttaca atgcatatcc aattttacag 99721 tctttgagtg cctatgagaa acaggcaaat acaaattagt ttcatctagc ttccactaat 99781 ggcagagtga actggtcagg catagtttcc agcagataag ctctaaacct atataaagca 99841 cttctaaaaa cacaccatag tcatgtgcca cataatgatg ttttaattag taacagtggt 99901 cccataagat tataatggag ctgagaattt cctattgcct acgatgtcac agccacagtg 99961 accttagcat aatatgtcac ctgctctatg tttagatata tttaagtaga caagtaccac 100021 tgtgttacaa ctgcctacaa tattcaatac agcaacatca ataggatata ccatatagcc 100081 taggtgtgta gtaggctaca tcatctaggt ttgtgtaaat acactctatg atgtttacac 100141 aaagatgaat cacctaaaga cacatttctc agaacatatc cccatcatta aacgatgaat 100201 aactgcattt gaaggcgaca gaaataacca aagaaagatt tacttttaat tatagaaatt 100261 gcaatgagta agaattccaa gtctggtcat ttttgtctaa tggcactcac taccacaact 100321 cccattccaa gtcccataga tacagactta cattctggag atggctgaga acaggaatca 100381 gggcaggcag agtagttatt tcacataaag ctggggacga ggggtcagtg ggtggaatcc 100441 tgagagccac aggagcaaga ccaaatcccc gccttatcat taacatacat atgcgtggca 100501 gagatccaat ggggatttgg agaaaaaagc aatcgaaagt cagagaggaa tctaccctta 100561 caagatctgt aaaatgagtg agtctgctgc tttttcagac tgatagcatt ccctaaacag 100621 tgcacagcta taatggcaga aagctggggc cttactaact tgcggtgtca gaggactgag 100681 ctcatggctg gcacggcaac tgaaaggata gggaagaaac acacaaagga aaaagaggta 100741 agggccataa ttttagtacc aactttgcca gaatccttgg cgatagctaa ctacacaggt 100801 acagggaaaa ccataaggta ctacactccc aaagtggcag ctgtaagatt aaagaactga 100861 gcaaaaccag caactacaca atgcagggaa aataatgtac agtttaagac cagacaagtt 100921 aaatttgttc tacaggctag aacaaaaaca acatgagcaa tcctcagaga acataacaga 100981 atctacaatc aataaaatgt actatccata atgtctagtt ttcaaccaaa atttatttca 101041 catgcaaata aatgagataa catgatccat atgcatgaaa aggaaaaaaa aagtcaatag 101101 aaattgattc agagagggct aatatgtggg acttagcaaa gacttcaaag cagctactat 101161 ttcacctgaa gtaattcaac actaagtaga ctgtggtaat taaagatgca tgttgcaatc 101221 tccagtgaat ttctaagaat tacagtgtaa ttcttagaaa aatagtacaa gcacagagct 101281 aaaaagtgca caaacaaaat aaaatagaag acacaaaagt atttgattaa cataaaagaa 101341 agcagaaaat tccaagaaag gaaatcccag actagatgac ttcacaggtg agtttggcca 101401 aatactttat ttcatttatt ttttatttat ttttactcat tttttaatta tttttttttt 101461 taagagacaa ggtctcgcta ttctgctcag gctggtcttg aactcctgag ctcaaacgat 101521 cctcccacct tagcctcctg agtacctgag attacagtca catgccacca tacccagttg 101581 gaccgaacat ttaaagagta aatatcaata cttctacaac tcttcccaaa aattaaagag 101641 aacggaatgc ttcctaactc agtctatgag ggccagcatt tgcctgacat caaagtcaaa 101701 gacactacaa gaaaacaaca gcccaacatc ctgtataaat acagatgaaa aaaatcctca 101761 acaatatact agcaaaccaa attcagctgt gtattaaaag gatcatacat catgaacaac 101821 taggatctat ctctggaata caagagtagt tcaacataca aaaatcaatc aactgactgg 101881 gcacagtggt tcatgcctgt aatcccagca ctttgggagg ctaaggcagg tagatcactt 101941 gaggtcagca gttggagacc agcccagcca acatggagaa acccccatct ctactaaaaa 102001 cacaaaaatt agctgggttt gatggtatgc gcctataatc ccagctactt gggaggctga 102061 ggcatgagaa ccgcttgaac ccagaaggca gaggttgcag tgaaccaaga tcgtgtaact 102121 gaattccagc ctgggtgaca gtgagactct gtctcaaaaa aaaaacaaaa aacaaaaaac 102181 aaaaaaacaa tcaatataat ataccacatt aataggatga aaggaaaaaa acacatgacc 102241 atttcaactg atgcaggaag agcaccccaa ctcaacacca tttcatgata aaaacattta 102301 ataaacagaa ataaaagaaa aacttcctta acacaataaa ggtcatatat gaagaacaca 102361 cagctaacaa aatactcagt tgcgaaagac tgaaagcttt ttccctaaga tcaaaaacaa 102421 aacaaggcca gacacagtgg ctcacactta taatcccagc acttctgcaa gcaaacatag 102481 gagaatcact tgagcccagg agttcgagac cagcctggac aacatagcga gactccatct 102541 ctaccaaata aaaattaaaa attagccagg catggtggca tgcctgtgtc ccagccactc 102601 aagtggctga ggcaagaggc tcacttgagc ccaggagttt gaggttacaa tgagatatgt 102661 ctgcgccact gccactgcac tccagcttga atggcaaagc aaggccctgt ctctacaaaa 102721 caaacaaacg aacaaacaag tattagcctg cacctgctcc accaaaactt ctatagtgtt 102781 ggaaatccta gccagagcaa tcaagtaaga aaaagaaata aaaggcattc aaatcaaaag 102841 gaaagtagta aaactgtccc tgtttgcaga taacatgatc taatatttac agaaatgcat 102901 aaagacccca ccaaaaaact attaacataa aaaattcagt aaagttgcag aatacaaaat 102961 caacacacaa aaatcagtgt cattcctatg taccaacaac aatctatatg aaaaagatat 103021 taagaaaaca atctcattta caattgcatc caaagaataa aatactagga acagacttaa 103081 gtaagaaagt gaaagacctg tatattcaaa actacaaata aaagaaacta aagaactaaa 103141 gaagacaaac aaatggaaat acttctaatg ttcatagatt agacaactga atactgttaa 103201 aatttcccta ctattcaaag tggtctacag attcaatgca atccctatct aaatcccaat 103261 gtcatttttt agagaaatag aaaaaaaaat ctaaaattca tatggaacca cagaagaccc 103321 tgaatagtca aaagaactct gagaaacaaa taatatagct ggtatcatca tacttcttta 103381 tatatacata cacaaagcta taataattaa atagtatggt actggcaaaa agaaagacaa 103441 actgatgcaa tagagtagag agcccagaaa taaattcatg catatatgat aaaccaatgt 103501 gtaacaaggg tgccatgaat acacaaggag gtaatgacag tctcttcaac aaatacaagg 103561 aaaactagat atacaaatgc aagggaatga aattggaccc ttagtcttac accatacaca 103621 aaaatcaact cagaatggct taatgattta gccgttagac ctgaaactgt aaaactccta 103681 gaacgacacg tgaggaaaaa ctctgtacca ttggtcttgg caatgatttc acagatgtga 103741 ccaaaaacac agacaacaaa tgcaaaaaca gacaagtgcg actacatgaa actaaagagg 103801 ttctgcacag caaaggaaac aagacagtga aaatgcaacc tatgaaatgc gagaaaatat 103861 tcacaaacca tgtatctgat aaagggttaa tttccaaaac gtataaggga ctttacaact 103921 cgatagcaaa aataaaaacc caattaaaaa tgagctaagg acttgaatag acatttctcc 103981 aaagaataca tacaaatagc caacaagtat ataaaaagat cattaatatc tgaccagtca 104041 tcagagtaat gcaaatcgaa accacagtga gataccacct cacatccgtt agaatggtta 104101 ttattaaaaa aaacaaacaa gcaaaaacaa caaaaaacag acacgtgctg gtgaggttgc 104161 ggagaaattg gaatcgtttt acactgttag tgggaatgca aaattgtaca acccctatag 104221 aaaatagtat ggtgattcct caaagaaatt aaaaatagaa ttactatacg atccagcagt 104281 cccacttctg ggtttttatc caaaataact gaaataggat cttgaggagg tatctgcact 104341 ctcatgttca cagcagcact atttacaaca gccaaaatac agaaataacc taaatgtcca 104401 ccagctgatg actgaatttt ttaaatgtga tatatataca tagaacagaa tattattccg 104461 acttaaacaa gaaagaaatc ctgaaatatg caacaatgtg gatgaatctt atggatatta 104521 tgcttagtaa aataagccag tcacagaaag acaaatactg catgattcca ctgacataag 104581 gtatctaaaa tagacaaatg catggaagcg aataatggaa ttgtggctgc caggggctgg 104641 gggagaggaa gaaatgagga gttgttaatc aatgggcata aaattttagt aaagcaagat 104701 aagtaagttc tagagatctg ctgttcaata ccatgcctat agataacaat accatattat 104761 acatttaaaa atctgttaag agggtagatc tcatattaag cgtacttacc attctaaaat 104821 ttaaaaaaga aaatcaccac actaacgtaa ggtattccta ttaacctatc cagaaaagtt 104881 taaaaaacaa agggaagagg agtaaatttt tttttttttt tgagatggag tctcgctctg 104941 ttgtccaggc tggagtgcag tggcacaatc ttggctcact acaacctctg cctcccaggt 105001 tcaagccatt ctcctgcttc agcctcctga gaagctagga ttaaaggagc acgtcaccat 105061 gcccagctaa ttttttgtat tttcagtaga ggtggggttt caccatgttg gcccagctgg 105121 tctcgaactc ctgacctcaa gagatcaggc tgcctcagac ctccctaagt gctaggaggt 105181 gtcagccacc atgcctagcc tgaaattttt aaaatgagat gacagaatgg aatgttaaag 105241 agggaactgg agtctgttaa ttcagtgcaa ataggcgaat agtgtatctc aatttcaaag 105301 tttagaagta gggatagtat ctaacctcca gctctaattc ctcctttgca tctacatttt 105361 ttttctgcaa acctttctaa aacatccccc agctcacttc tcctcaagtc tcctgagccc 105421 agatgagtcg catgtccacg tttaaaccat tacgtaaagg aatgaaacta tcatgatcta 105481 tcatgattta agcactggag caattaagta gtcccttcct tgagcccaaa gatattatta 105541 ggaagaagag cactcaacca aactgagacc ctgtaaaaaa gaaagaatgg aaacaaagga 105601 agtgggaagt caatcagtaa tgtatgctac actgaatata gatccacgtg ccatggaaac 105661 caatttaaga tgtaagagtt ggcatcctgt cttgaaaaat aagattctat ccaattgttc 105721 tttcctggaa catatgaagg gggttttcag gaacttttgc tgacaaattc tgcattctcc 105781 tcaagaagcg actctggata aacttgagaa gtgaacaact gaacttccag agccctagac 105841 catgcaattt agatacggat ttctttttca aagcagatgt tactaatctc tgtaattgct 105901 acagaaacaa cttcataaat ctctgtgtac tacctcattc tgaggatcat gccttcttca 105961 tgtgcccacg aaaataaatt caccataaga tatatcccta gatatctaat ttggcacttt 106021 ttagttacat agtataaact gattggattt tagagaatgg tttaccagat ataataattt 106081 cagggttttt tccaacagtg aaataacttt agttttaata taattaataa ggaaataata 106141 tagattatca ccactggcaa caggatatat tcctagttgt aataataaaa ccatgtcatc 106201 agtatgtatc ttcctttggg atatgatatt aaaagattca tcaaatcaca tttctgattc 106261 atcaaaagag atttcaaatc tctaacacag aagttaaaaa caaatgaata aaactgcaag 106321 ctattttact gaatatgttt ctgagaagat catgcctccc tgccctgttg tttatacaat 106381 atgaaagtgt aaaattctac aggtacacct ataatcataa atttacattc aaccattcta 106441 acagccttta gcaatcttaa tttgccatat ttaccaaaac aaagactaat aaaaactgtt 106501 tcacattgtt actggctttt ctatatataa aggaataaat tatcttggca atgaaaagta 106561 gaacaaccca cctaattttc ctgtacaata ttatacttag agacccaaag ctgtaattca 106621 tcaagaagaa agctagcata attttaaagg aacagtcaac aatagaataa gcctgtagtt 106681 ctgaggcact ttgtttagat taagacacat ttttttcaaa ttaatattat tagtaatata 106741 tactagaagg gaaaagaact tgagaactat tggcataagc atccaaatga aaaagaaaat 106801 cccttagaac caagaaataa aaaactttga tacattacta gttggaagta ccttaagcta 106861 aataaattgt tctcaaactt tataagtctt ccagcaaggg gtgatagcaa ggtaaaacac 106921 agagcttcca tgaacattat taaacaacca tatgaaacag aacacatatc tagaaatgct 106981 tgcaacttta aaatgtacga aaaacaacat actaatgttt acagtaaaag agctccttgt 107041 ttttgtctag atgttaccct gaatttttta cactatatga taaaagaatg aatagaatca 107101 taaatgtggt atgagtatta cctgttaaat ataaatttta gcaacattaa cttcaattag 107161 ttttaaaaaa ggtttgtgtg cttgtgtgta taatttctat atgcttatct ctaccacata 107221 caaacacata tctaagcata agcctttctg gccatttgga tttgagcaca acatcacccc 107281 catctggtgg tggctgaacc cagtcacata tgtaagcaat ttagagacca tttaaatcaa 107341 atgtacaaac agaaaatctt tataatctgt cattacactg tagatataca catatacatt 107401 acattaacta tgaagaagtt gtcacacaca aaaaacttca atgcatttaa gaaacaagtc 107461 acttcagact cacaataaag ggaaaacgag aaacagtttc atggtatttt tttcttttga 107521 tttttatttc aatagctttt ggagtacaag tggttgtgat tacataaata aattatttag 107581 tgatgaattc tgagatttta gtacacctat cacacaagta gtgtactttg tacctaatat 107641 gcagtttttt atcccacatt ctcaataccc ttcctcttct gaatctccaa agtccatttt 107701 atcactctaa gtctttgcgt attcttagat tagctcccac ttataatgag aacatttgat 107761 gtttagttct ccattcctga gttatctcac ttagaataat ggcctccagc ttaatccatg 107821 ttgctgcaaa acacattact tcattccttt ttatggctga gtagtattac atggtgtgta 107881 tatacatata catcacgttg tctttattca ctcattggtc aatgggcatt taggtgggtt 107941 ccatatcttt gaaattgtga attggtctgc aataaagata tgtgtgcacg tgtctttttc 108001 atataatgac ttcttttcct ttggatagat acccagtagt gggattgctg gattgaatgg 108061 tagatctact tttagttctt taaggaatct ccatactgtt ttccataaag gttgtattaa 108121 tttacattcc caccagtagt gtataagcat taccacatcc acacaaacat ctattgtatt 108181 ttgactttct aataatggcc attctagacg gaataagctg ataatctcac tgtggtttta 108241 atttgcattt ctctaatgat tagggatgtt gaacattttt tcatatgttt ctcgaccatt 108301 tgcatatctt cttttgagaa atgtctattc atgtcatttg cccacttttt ggtgggatta 108361 tttatttttc ttgctgattt gagttccctg tagcttctga atactagttc tttgtcacat 108421 gcacagttta caagtcttat cttctcccgt tctgtgggtt gtctgtttat tttgacaatt 108481 atttcttttg ctgtagagaa gctatttaac caggtcccat ttatttattt ttgttgcatt 108541 tgcttttggg atttcagtca tgaatacttt gtctaggcca atgtctagaa gagtttttcc 108601 taggttttct cctacggttt gtatggcttt gggtcttaga ttgaagtctt tgatccatct 108661 tgggttgatt tttatataaa gtgagaggga tacagtttca ttcttctaca tgtggctagc 108721 cagttttctc agcagcattt atttaatagg gtgttctttc cccaatttat gtttttgatg 108781 ctttgttgaa gatcacttgg ttctaagtat ttggctttat ttctgagttc tctattctgt 108841 ttcattgggt ctatgtgcct acttttatac cagaaccatg ctgttctggt aactgtatac 108901 agccttgtag tataactaag agtccagtaa tatgatgcct ccagatgtgt tctttttgct 108961 taggattgct ttggttattt gggctctttt tggttccaaa tgaattttag gatttttttt 109021 cccattcggt gaaaaataat gttggtattt tgatggggaa ttgcactgaa tatgtagact 109081 gctttgggca gtatggttat tttcacaata ttgattcttc caatctatga acacaagatg 109141 tgtttccatt tgtttgtgtc atccacgttt tctttcagca gttttgtagt tcaccttgta 109201 gagatctttc acctccttgg ttaagtatat tcctagtatt ttatcttttt tgcagatgca 109261 aaaggggcta agttcttgat ttgattctca gcttggtggt tggtgtgtag cagtgctact 109321 gatttgtgta cattgacttt gtaacctgag actttacttg tttatcaaat ctaggagtct 109381 tctggagcag tctttattta gggtttttca tgtatacaat cacatcatca gtgaacagtg 109441 atagtttgac tttctttttc caatttggaa gccctttact tccttctctt ccctgattgc 109501 tctgactagg actaggactt ccagtactat gttgaataga agtgatcaaa gtgggcatcc 109561 ttgtcttgtt ccagttctca gggggaaggc tttcaacttt tccccgttca gtatgatgtc 109621 ggctgtgggt ttgtcatata tggcttttat taatttgaag gaaatccctt ctatgcctag 109681 tttgttgagg gttttttatc ataaaaggat gctggatttt accggatgct ttttctgcat 109741 ctattgagat gatcatatag tttttattta taattctgtt tatgtgatat atcacattta 109801 ttgatttgca tatgttaaat catccctgca tccctgaaat aaaacccctc gatcaccatg 109861 cattatcttt ttgatgtgat gttggatttg gttagttagt attttgttga aaatttttgc 109921 atcaatgttc atcagggata ttgctctgta gttttctttt tttgttatgt cctttctgag 109981 ttttggtatc agggtaatac cggcttcaaa gaatgattca gagagaattc cctctttctc 110041 aatcttttgc aatagttgca gtaagattga taccagttca tctttgaata tctggtagaa 110101 ttcagctgtg aatctatcta gccctgggct ttttctttgt tggcaatttt ttttattact 110161 gatttaatct tgctgcttgt tattggtctg ctcagggttt ctatttcttc cggatttaat 110221 ctaggagggt tgtatgtttc caggaattta tccatttcct cgaggttttc tagttgatat 110281 gcataggggt gttaatagta gtctcaaatg atctattgcg ttctgtggca ttggttgtaa 110341 tgtctccagt ttcatttcta attgagctta tttgaatctt cttttttttc ttggttcatc 110401 tagcttaatg gtctatccat tttgtttatc tctcaaagag agaattcagt tcttggtgct 110461 ttcaggggtg gaaactctga gttccttggt tattaagagt ctttgtatga tagctttctc 110521 tgctgctggt tgtagtagca atatgttggt catgtgagca agttcactgt ctcccatagg 110581 gttgaaatgg tagaggtctc ttgacgctta atctcattcc cctgtgatgt gcactttttt 110641 atttattttt tccccagtat tttatttact ggattgaata ttttaggctt caggccaata 110701 caggaggtgt ccacaggtaa aaaccagctt tggctaaagc aggtgggtaa atgcaatacc 110761 caatggtggg gaaaggtccc agccttgaca gaggtggccg aggaagctct cagtgaaaca 110821 cactgaggtc ttaccagagg gagggactgg agccacctta gctcccttgt caggctggca 110881 gaaaattcat ctgcaggaat gttgatgttt caagtggaga ggaattgtgc ctctgcctct 110941 catgcaatcc tgcacttgga aagtgctcct cctgtgggga tgcagtcacc ctgaagtgtt 111001 tcagagaggt tgtctatagg tatacccatg ccaagctccc atgggaaaag ccccacctgt 111061 gtctgcagtg gtgaatgagg gggaaaagaa gtcatcttct ccaaggctat tcacaagcac 111121 caggactgcc tgactgttgg ggcagagctg cagactttcc ttgctgagcc cagcatggca 111181 actgtatttc tcctgaaaga aacttcccac cagtggaaag atctgggact caaggcctcc 111241 cgcctgggtt cttttgtgcc atgggtggtc ccttgatgtg gtacactctc ccttccccta 111301 gaaacaggtg tccctaaggg caagaatact gtgaatgctg ttgttcctct gggtctagat 111361 gcccagtgtg gctgccatac tctaggctgc tgcttggaat gcctgcaagg aatccagaga 111421 tatggcctgt cctcaagtct cccagcagtg ggtaccagta ccagctctga tgggggtggc 111481 aggaaagtga cacagactgt gagattcctt ggttatggat agtcttaatg tgttggcttt 111541 ctgaaatgat ggttgtagta gtaatgaaaa ggttatgtgg acagactcag gacctcctgg 111601 ttggccaggg tagtagaggc aatggtgaca ggtaaggtca tgcacaagtt ttctccttcc 111661 tgggtgcagt attattctac ctggagatga tgtaatggac tgtgttagtt ggcctccagc 111721 caggaaatga tgtgtgcaaa agagcaccag ctgcagtagt taacagtggt atttgtgttg 111781 gtcttatgtt acccaggggc agtgctttgg tttctcagga atgggcaggg ccataaagct 111841 cccaaaagtt tctgtccttt gtgttaaact accagggtgg ctggaggggt aaagccaggt 111901 gggggctggg gtcaggcagg tccctgctct gactctccac aagtggggca agcagcagcc 111961 cctgggcagg agtttgaggg aggtttctct ggcactgggg taatgttcca gagaggtgta 112021 ttaactgtct ctgctgcaca gagaattgta tgtagggagt gggaagtaac aggtgatagt 112081 aaaccccacc cagctcaccc agctccccgc atacttcgca aggcagatct cacacccaca 112141 gtttcccagt agcagcatcc agctaagttc tagaccttct actcaaaact gccccaagcc 112201 atatgccttc cctgcgggaa aacaggaacc acaggtttca ggccacgccc ctcccagttg 112261 gctcacacag ctggggcacc cagctcccta gcttgtggct acaggacact tgccactcac 112321 cctctggttc tggcctagga aattcttcct cattcgaggt tatcatgaaa ttcagttggg 112381 aggttctttc aacctgtgac tgcttcctga gttagctggc ggacttccat gagatacact 112441 gtgaggcaga ataatgaatg gctccccttg gtctactctg cagactggga aagcacgcaa 112501 ggctgctccc actgccgttc ctacttttat atatgtcacc acttcctaaa tcagttccag 112561 tgttgagtag gataaaggtc ttcccctgtg ggttggattg tcaggttccc ctgtaggggt 112621 gtgtatcctg gaggcagtct cttccactct caccctctgg ggacttacag tttttcacct 112681 ggctcacagt ataggctgta gcctgcagct tctttcaaag ctcatggttt attttcaacc 112741 tttcctatca atgaacaatt ttaatagcct tgaaataaaa cttccttttc ttttcttttt 112801 tttttttttt tgagatggag tctcgctgtc acccaggctg gagtgcagtg gtgcaatctc 112861 cgctcactgc aagctctgcc tcccgggttc acaccattct cctgcctcag cctcccgagt 112921 agctgggact acaggcaccc accaccatgc ccagctaatt ttttgtattt tttttttttt 112981 ttttttagta gacacagggt ttcaccatat tagccacgat ggtcttgatc tccagacctc 113041 gtgatccacc cgcctcagcc tcccgaagtg ctggattaca ggcatgaacg accacacccg 113101 gccgaaataa aactttctaa aatcttcacc aactgcaaca tattatagac atctcaaaaa 113161 acattaatta cttcagtaat taatgggtcc ctattgtatg gcaaggttat ataaatcata 113221 ctttcatcta tacataagtg agaaacgttt gcctataagt attgttttat cagattaaga 113281 aatatcatag ctacaataag aacatcagtg gatgaaacaa gtttaaacta cccctctctt 113341 ccacaataaa tgtatcataa tgcttctatg acgtatttgg ttttaagaat attctctcat 113401 ggggtgggtg gagctaagat ggctgaacag gaacagctcc cagcgtgagc gacgcagaag 113461 acgggtgatt tctgcatttc catctgaggt atcgcgttca tctcactaga gagtgccaga 113521 cagtgggtgc aagacagtgg gtgcaacgca ccatgcacga gccgaagcag ggcgaggcat 113581 tgcctcactc gggaagagca aggggtcagg gagttccctt tcctagtcaa agaaaggggt 113641 gacagaaggc acctggaaaa tcgggtaact cccaccctaa tactgcactt ttccaatggg 113701 cttaaaaaaa cggcacacca ggggattata tcccgcacct ggctcggagg gtcctatgcc 113761 cacagagtct cgctgattgc tagcacagca gtctgagatc aaactgcaag gcagcagcaa 113821 ggctggggga ggggcgcctg ccattgccca ggcttgatta ggtaaacaaa gcagccccga 113881 agctccaact gggtggagcc caccacagct caaggaggcc tgcctgcctc tgtaggctcc 113941 acctctgggg gcagggcaca gacaaacaaa aagacagcag taacctctgc agacttaaag 114001 gtccctgtct gacagctttg aagagagtag tggttcttcc actacgcagc tggagatctg 114061 agaatgggca gactgcctcc tcaagtgggt ccgtgacccc cgagcagcct aactgggagg 114121 caccccccag taggggcaca ctgacacctt acacggccgg gtactcctct gagacaaaac 114181 ttccagagga acgatcaggc agcagcattt gcggttcacc aacatccact gttctacagc 114241 caccgctgtt ctgcagccac cgctgctgat acccaggcaa acagggtctg gagtggacct 114301 ctagcaaact ccaacagacc tgtagcagag ggtcctgtct gttaaaagga aaactaccaa 114361 acagaaagga catccacacc aaaaaccctt ctgtacatca ccaacatcaa agaccaaaag 114421 tagataaagc cacaaagatg ggaaaaaaac aaagcagaaa cactggaaac cctaaaaatc 114481 agagcgcctc tcctcctcca aaggaacgca gctcctcacc agcaatggaa caaagctgga 114541 aggagaatga ctttgacgtg ttgagagaag aaggcttcag aagatcaaac tactctgagc 114601 taaaggagaa agttcgaacc aatggcaaag aagttaaaaa ccttgaaaaa aaagtagacg 114661 aatggctaac tagaataacc aatgcagaga agtccttaaa ggagctgagg tagctgaaag 114721 ccaaggctca agaactacgt gaagaatgca gaagcctcgg gagctgatgc gatcaactgg 114781 aagaaagggt accagtgatg gaagacgaaa tgaatgaact gaagcgagaa gggaagttta 114841 gagaaaaaaa gaataaaaag aaatgaacaa agcctccaag aaatatggga ctatgtgaaa 114901 agaccaagtc tacgtctgat tggtgtacct gaaagtgatg gggagaatgg aaccaagttg 114961 gaaaacactc tgcaggatat tatccaggag aacttcccca atctagcaag gcaggccaac 115021 attcagattc aggaaataca gagaacgcca caaagatgct cctcgagaag agcaactcca 115081 acacacgtaa ttgtcagatt caccaaagtt gaaatgaagg acaaaatgtt aagggcagcc 115141 agagagaaag gtcgggttac ccacaaaggg aagcccatca gactaacagc ggatctctcg 115201 gcagaaactc tacaagccag aagagagtgg ggaccaatat tcaacattct taaagaaaag 115261 aattttcaac ccagaatttt catatccagc caaactaagc ttcataagtg aaggggaaat 115321 aaaatccttt acagacaagc aaatgctgag agattttgtc accaccaggc ctgccctaaa 115381 agagctcctg aaagaagcac taaacatgga aaggaacaac cggtaccagc cactgcaaaa 115441 acatgacaaa atgtaaagac catcaaggct aggaagaaac tgcatcaact aacgagcaaa 115501 ataaccagct aacatcataa tgacaagacc aaatacacac ataacaatat taaccttaaa 115561 tgtaaatggg ctaaatgctc caattaaaag acacagactg acaaactgga tagagtcaag 115621 acccatcagt gtgctgtatt caggaaaccc atctcacgtg cagagacaca cataggctca 115681 aaataaaggg atggaggaag atctaccaag caaatggaaa acaaaaaaag gcaggggttg 115741 caatcctagt ctctgataaa acagacttta aaccaacaaa gatcaaaaga gacaaagaag 115801 gccattacat aatggtaaag ggatcaattc aacaagaaga gctaactatc ctaaatgtat 115861 atgcacctaa tacaggacaa cccagattca taaagcaagt ccttagtgac ctacaaagag 115921 acttagactc ctacacaata ataacgggag actttaacaa cccactgtca acattagaca 115981 gatcaacgag acagaaagtt aaaaaggata cccaggaatt gaactcagct ctgcaccaag 116041 cggacctaat agacatctac agaactctcc accctaaatc aacagaatat acattctttt 116101 cagcaccaca ccacacctat tccaaaattg accacatagc tggaagtaaa gcactcctca 116161 gcaaatgtaa acagaaatta taacaaactg tctctcagac cacagtgcaa tcaaactaga 116221 actcaggatt aagaaactca ctcaaaaccg ctccactaca tggaaactga acaacctgct 116281 ccagaatgac tactgggtac ataacgaaat gaaggcagaa ataaagatgt tctctgaaac 116341 caacggaaac aaagacacaa cataccagta tctctggcac acattcaaag cagtgtgtag 116401 agggaaattt atagcactaa atgcccacaa gagaaagcag gaaagatcca aaattgacac 116461 cctaacgtca caattaaaag aactagaaaa gcaagagcaa acacattcaa aagctagcag 116521 aaggcaagaa ataactaaaa tcagagcaga actgaaggaa atagagacac aaaataccgt 116581 tcaaaaaatt aatgaatcca ggagctggtt ttttgaaaag atcaacaaaa ctgatagact 116641 gctagcaaga ctaataaaga agaaaagaga gaagaatcaa atagatgcaa taaaaaatga 116701 taaaggggat atcaccaccg atcccacaga aatacaaact accatcacag aatactacaa 116761 acacctctac gcaaataaac tagacaatct agaagaaatg gataaattcc tcgacacata 116821 caccatccca agactaaaac aggaagaagt tgaatctctg aatagaccaa taacagactc 116881 tgaaattgtg gcaataatca atagcttacc aaccaaaaaa agtccaggag cagatggatt 116941 cacagccaaa ttctaccaga cgtacaagga ggagctggta ccattccttc tgaaactatt 117001 ccaatcaata gaaaaagagg gaatcctccc taactcactt tatgaggcca gcatcatcct 117061 gataccaaag cctggcagag acacaacaaa aaaagagagt tttagaccaa tatccttgat 117121 gaacgctgat gcaaaaatcc tcaatacaat actggcaaac cgaatccagc agcacatcaa 117181 aaagcttatc taccatgatc aagtgggctt catccctggg atgcaaggct ggtttaacat 117241 atgaaaatca ataaatgtaa tccagcatat aaacagaacc aaagacaaaa accacacgat 117301 tatctcaata gatgcagaaa aggcctttga caaaattcaa caacacttca tgctaaaaac 117361 tctcaataaa ttaggtattg atgggacata tctcaaaata ataagagcta tctatgacac 117421 atccacagcc aatatcatac agaatgggca aaaactggaa gcattccctt tgaaagctgg 117481 cacaagacag ggatgccctc tctcaccact cctattcaac atagtgttgg aagttctggc 117541 cagggcaatt aggcaggaga aggaaataaa gggtattcaa ttaggaaaat aggaagccaa 117601 attgtccctg tttgcagatg acatgattgt atatctagaa aaccccactg tctcagccca 117661 aaatcacctt aagctgataa gcaacttcag caaagtctca ggatacaaaa tcaatgtgca 117721 aaaatcacaa gcatccttat acaccaataa cagacaaaca gagagccaaa tcatgagtga 117781 actcccattc acaattgctt caaagagaat aaaataccta ggaatctaac ttacaaggga 117841 tgtgaaggac ctcttcaagg agaactacaa gccactgctc agtgaaataa aagaggacac 117901 aaacaaatgg aagaacattc catgctcatg ggtaggaaga atcaatatcg tgaaaatggc 117961 catactgccc aaggtaattt atagattcaa tgccatcccc atcaagctac caatgacttt 118021 cttcacagaa ttggaaaaaa ctactttaaa cttcatatgg aaccaaaaaa gagcccacat 118081 tgccaagtca atcctaagcc aaaagaacaa agctggaggc atcatgctac ctgacttcaa 118141 actatactac aaggctacag taaccaaagc agcatggtac tggtaccaaa acagaggtat 118201 agaccaatgg aacagaacag agccctcaga aataatgccg catatctaca accatctgat 118261 ctttgacaaa cctgacaaaa acaagcaatg gggaaaggat tccctattta ataaatggtg 118321 ctgggaaaac tggctagcca tatgtagaaa gctgaaactg gatcccttcc ttacacctta 118381 tacaaaaatt aattcaagat ggattaaaga cttacatgtt agacctaaaa ccataaaaac 118441 cctagaagaa aacctaggca ataccattca ggatataggc atgggcaagg acttcatgtc 118501 taaaacacca aaagcaatgg caacaaaaga caaaattgac aaatgggatc taattaaact 118561 aaagagcttc tacacagcaa aagaaactac catcagagtg aacaggcaac ctacaaaatg 118621 ggagaaaatt tttgcaacct actcatctga caaagggcta atatccagaa tctacaatga 118681 actcaaacaa atttacaaga aaaaaacaga caaccccatc aaaaagtggg caaaggatat 118741 gaacagacac ttctcaaaag aagacattta tgcagccaaa agacacatga aaaaatgctc 118801 atcatcactg gccatcagag aaatgcaaat caaaaccaca atgagatacc atctcacacc 118861 agttagaatg gcgatcatta aaaagtcagg aaacaacagg tgctggagag gatgtggaga 118921 aatagcaaca cttttacact gttggtggga ctgtaaacta gttccaccat tgtgcaagtc 118981 agtgtggcga ttcctcaggg atctagaact agaaatacca tttgacccag ccatcccatt 119041 actgggtata tacccaaagg attataaatc acactgctat aaagacacat gcacacgtat 119101 gtttactgtg gcactattca caatagcaaa gacttggaac caacccaaat gtccaacaat 119161 gatagactgg attaagaaaa tgtggcacat atacaccatg gaatactatg cagccataaa 119221 aaatgaagag ttcatgtcct ttgtagggac atggatgaag ctggaaacca tcattctcag 119281 caaactattg caaggacaaa aaaccaaaca ccatatcttc tcactgatag gtgggaactg 119341 aacaatgaga acacatggac acaggaaggg gaacatcaca ctccggggac tgttgtgggg 119401 tgggggaaga ggggagggaa agcattagga gatataccta atgctaaatg acgagttaat 119461 gggtgcagca caccaacatg gcacatgtat acatatgtaa caaacctgca cattgtgcac 119521 atgtacccta aaacttaaag tataataata ataaaataaa ataaaataaa agaatattct 119581 ctcatgacac gtggtgttaa tatattgcag ctaggcaata aaaatgtaaa actcagaaaa 119641 tagaccataa gcgattaaaa ttagtagagg aaatattttc tatcatatcc aattgcttga 119701 ttatactttt aaaaaatgat tcttcaataa tgcaataaag ctttaaaata gaatattaag 119761 ttactaacaa cagtcaaccc acaatcctaa cataaaatac attttctata taaacatggc 119821 ctattttaca cacacacaca cacacacaca cacacacaca cacacacata ttttatattt 119881 tggtttatta tgaggttata gctcaatcag aagtcatttt ctcattttca cttaattata 119941 atatacattt aaacctattt tgtataaagt attcttagaa agtacaataa taatgtgatt 120001 tttttaaagg acactgggag acctaccaat agcagaaatc ccttttatga agcaattaaa 120061 tgctgccaca gagtttaaga aaagagtata gcattggtgg tttaccagct caggctttag 120121 aatcagattt ttacagatac accagatggg tgtccttagg acccactgga atatacattc 120181 tacaaagggt agggggtttc tgttttgttc agtgtcttgt gttttcagtg tctggaatag 120241 cacatagcac acatagtaga tattcaataa acaatggctg aataagtaaa tgaatttttc 120301 caagttttat gataaaagaa tgaccaaagt tttttttatt gttacttcat gcaaagtgca 120361 gatgagccaa aataggaaga ctaggcagag actggtttac cccatggata ttatcatcat 120421 catcatcaat atattgttgt tgtcattatc agttttatgc tgagtttata gataaggaca 120481 gatcttacac ttctcaaaca cagaaactgc attttactca ctctgtagga gattctaaca 120541 agctacaggg cttatgtata ttgaaaccta attgtttgat aaataaatta aacgaattat 120601 aaatcatgat tttcataaac agactaaaga gtattcattc atgaaggtat ggtctggtta 120661 ctttatggga aatttttcct cctttaatct ctatttgtag gataagtcaa aaatagatta 120721 gataatacta aatagctacc tagaattaat gttatcatga gactcaatat gtgggatata 120781 gatatattat tgatgaaaac taaaaatctt gctagatata taagcagact tcataaaacg 120841 aagagattga ttcagccaag aacattaaaa aaaagataca aagtttatca cggtaaccta 120901 tacatttcaa ccataaatat actaacaatt ctgttttcac aagtgggatt agtagagaat 120961 cgttctaaca ttcgagagac taaatgcatt aatatacaga attgggaaaa tttcagaagg 121021 atgagttgtc attatggtgt ataagagata cagtcatcct caccctcctg tgaatcccat 121081 attccatttc ccctcaaaag cagagtatcc atcgatggga ggtggagagg agaatgatga 121141 atccttcctg cctcaattga ggactttaat gtaaaagctg ggctgtcaaa gaaactccct 121201 acaaatcatc cttcttttac tcttgttgct agcctctttt tcattatctt ttgatttact 121261 gttagtcaaa ttaatgccct ctttagaaca agtcaggcaa taaaatatct ataaacaatt 121321 attaaatcaa ttctcagtat agcaaaataa agtttctaag ccagaaagag taagaaagag 121381 aacagatcct ggattatgac aaatttaccc aggtaaatac tgtctcttgg cagccatgaa 121441 tgtagttaag tacattacct tactcatgct taaataacat caccactttg tagtaatgcc 121501 tatctaaatt ttcatctagt cccaagactc ttcttaaaga aaaatcatga atctatgtga 121561 tataacttaa tcattcaatc tacactttgt agttttattt tttaaactat tacataataa 121621 cctgcattaa aaagtacaat aaatatgtct gcactaaaat aattctgctt attctctgac 121681 aataagataa tagtatactt ttcattttca tttccctaat gcctaacgat gttgagcatg 121741 tttacagatg cgtatctgtc atcatcttgg gtgaagtgtc tgttcaactc ttttgcctat 121801 tttttaagtg ggttgttgga gttcctatta agttttgagt cttttgatat gtctggatgc 121861 aagtcctcta tcaaatatac tttgcaaata ttttctcctt gtccacagtt tgtcttttta 121921 ttctcgtaac agtatcattt gaagagcttc ttaaattttg atgaaatcta atttatcaac 121981 tttattctct tgtgaattgt gctttgggta atatctaaga atcactgtct aattcaaagt 122041 cacagaaatc ttctatgttt ctgggagaga cctcccaaca ggagttgaca gacacctcat 122101 acaggagagc tccagctggc atcaggccag tgaccctctg ggacgaagct tccagaggaa 122161 ggagcaggca gcaatctttt ctgttctgca gcctccactg gtgataccca ggtgaacagg 122221 gtctgcagtg gacccccagc aaactgcagc agacctgcag aagaggggtc cgttggaaga 122281 aaaactaaca aacagaaagc aacaacaaca acaacaaaaa gacccccaca caaaaacccc 122341 acccaaaggt catcagcctc aaagatcaaa ggtagataaa tccacaaaga tgaggaaaaa 122401 ccagtgaaaa aatgctgaaa attccaataa ccagaatgcg tcttctcctc caaatgatcg 122461 caacagctct ccagcaaggg cgcagaactg gacggaggat gagatgaact aattgacaga 122521 agtaggcttc agaaggtggg taattacaaa ctctgctgag ctaaagaagc atgttctaac 122581 ccaatgcaaa gaagctaaga accttgataa aaggttacag gagctgctaa ctagaataac 122641 cagtttagag gggaacatta atgacttgat gcagctgaaa aacttagtga agcatacaca 122701 agtatcaaca gccgaataga tcaagtggaa gaaaggatat cagagtctga agatcacctt 122761 gctgaaataa ggcatacaga caagattaga gaaaaaagaa tgaaaaaaaa atgaacaaaa 122821 tctcggagaa atatggaact atgtaaaaat aaagaaccta tgattgattg gagtacctga 122881 aagagatggg gtgaatggaa tcaagttgga aaagactctt caggatatta tccaggagaa 122941 cttccccagc ctagcaagac aggccaacat tcaaattcag gaaatacaga gaacaccact 123001 aagatattcc atgagaagat aaaccccaag acatataatc atcagattct tcacggttga 123061 attgaaggaa aaaatgttaa gagcagccag agaaaaaggt caggtcacct agaaaaggga 123121 agcccatcag actaacagcg gatctctcag cagaaactct acaagccaga agagagtggg 123181 ggccaacatt caacattcct tttttttttt tttgtgatgg agtctcactc tgtcgcccag 123241 gctgcagtgc agtggcgcaa tctcggctca ccacaacctc tgcctcccag gttcaagtga 123301 ttctcctgcc tcagcctcct gagtagctgg gattacaggt gtgcaccatc acgcctggct 123361 aatttttcta ttttcagtag agaaggggtt tcactatgtt ggtcaggctg gtctcgaact 123421 cctgacctcg agatctgccc gcctcggcgt cccaaagtgc tgggattaca ggcatgagtc 123481 actgcgcctg gcctcagcat tcttaaagag aagaattttc aacccagaat ttcatatcca 123541 gccaaactaa gcttcatgaa caaagcagaa ataagaaagg aaatcctttc tagacaagca 123601 aatgctaagg gatttcatca ccaccaggcc tgctttgcaa gagctactga aggaagcact 123661 aaatatggaa aggaaaaacc agtaccagcc actgcaaaag cataccaaaa tgaaaacacc 123721 aatgacacta tgaagaaact gcatcaacta gtgtgcaaaa taacaagtta gcatcatgat 123781 gacaggatca aattcacaca taacaatttt aaccttaaat gtaaatgggc taaatgtccc 123841 aattaaaaag gtacagactg gcaaattgaa taaagagtca agatcaactg atgagctata 123901 ttcaggagac ccatctcacg tgcaaagata tacataggct caaaataaga ggatagatga 123961 aaatttacca agcaaatggg aaacaaaaaa aagcaggggt tacaatccta gtctctaata 124021 aaagagactt taaaccaaca aagatcaaaa aagacaaaga aggtcattat ataatcataa 124081 atggatcaat tcaacaagaa aagctaacta tcctaaatat atatacaccc aagagaggag 124141 caccaagatt cataaaacaa gttcttagag acctacaaag agactcagac tcccacacag 124201 taatagtggg agactttaac accccactgt caacattaga cagatcaacg ggacagaaaa 124261 ttaacgagga tattcacgac ttgaactcac tctggatcaa gtggacctaa cagccatcta 124321 cagaactctc cattctaaat caacagaata tacattcttc tcagtgccac aaggcagtta 124381 ttctaaaatc aacctcataa ttggaaataa acacttgtca gcaaatgcaa aagaatggaa 124441 atcataacaa acagtccaca atgcaatcaa attagaactc gggattaaga aactcactca 124501 aaaccaaata actacatgga aattgaacaa ccgctcctaa atgactcctg ggtaaataac 124561 aaaattaaag gaagaaatca agaagttctt tgaaaccaat tagaacaaag agacaacata 124621 ccagaatctc tgggacacat ttaaagcagt gtttagaggg aaatttatag cactaaatgc 124681 ccacaggaga aagcaggaaa gatctaaaat cgacacctta acatcacaat taaaagaact 124741 agagaagcaa aagcaaacaa acccaaaagc tagcagaaga caagaaatga ctaagatcag 124801 agcagaactg aaggagacac agacacgaaa aatccttcaa aaaattaatg aatccaggag 124861 ctggtttttt gaaaaaatta acaaaataga gcattagcta gactaataaa gaaaagagag 124921 aagaatcaag tagacacaat aaaaaatgat aaaggggata tcaccactga ccccaaagaa 124981 atacaaacta ccaccaagag aatactataa aaacctctac gcaaatgaac tagaaaatct 125041 agaagaaatg gaaaaatact tggacacata cacactccca agactaagcc aggaagaagt 125101 caaatctctg aatagactaa taacaagtcc caaaattgag gcactaatta atagcctacc 125161 agccaaaaaa gcccaggacc agacggattc acagccgaat tctgccagag gtacaaagag 125221 gagctggtac cactccttca gatgatattc caaacaactg aaaaggaggg actcctccct 125281 aactcatttt atgaggtcag catcactctg ataccaaaat ctggcagaga cacaacaaaa 125341 aaagaaaact tcaggcgaat atccctgatg aacatcgatg tgaaaatcct caataaaata 125401 ctggcaaacc gaatccagca gcacagtaaa aaacttatcc accacgatca actcggcttc 125461 atccacggga tgcaaggctg gttcaacata tgcaaatcaa taaacataat ccatcacata 125521 aacagaacca atgacaaaaa aacataagat tacctcaaca gacacagaaa agggctctga 125581 taaaattcaa cacaccttca tgttaaaaac tctcaataaa ctaggtactg atggaacata 125641 tctcaaaata atgagagcta tttatgacaa acccatagcc aatatcatac taaatgggca 125701 aaagctggaa gcattccctt tgaaaacagg cacaagacaa ggatgccctc tctcacctct 125761 cctattcaac atagcattgg aagttctggc cagggcaagc aggcaacaga aagaaataaa 125821 gcgtattcaa ataggaagag aggaagtcaa attgtctgtt tccagatgac ataactgtat 125881 atttagaaaa ccccatagtc tcagcccaaa aactccttaa attgataagc aacttcagtg 125941 aagtctcagg atacaaaatc aatgtgcaaa aatcacaagc attcctatac accaacaaca 126001 ggcaagcaga gagcccaatc atgaatgaac tcccactcac aactgctaca aagagaataa 126061 aatatctagg aacacagcta ctaagggaag tgaaggacct cttcaaggag aactacaaac 126121 cacagctcaa ggcaataaga gaagacacaa actaataaga aaacattcca tgctcatgga 126181 taggaagaat caatactgtg aaaatggcca tactatccaa agtaatttat agattcaatg 126241 ctattcctat caaactacca ttaacattct tcacagagtt agaaaaaact actttaaatt 126301 tcatatggaa ccaaaaaaga gctcatatag ccaagacaat cctaagcaaa aggaacaaaa 126361 ctggaggcat catgagatct gacttcaaac tatactacaa ggctacagta accaaaacag 126421 catggtcctg gtaccaaaag agatatgtaa ctaatgaaac agaacagaga cctcagaaat 126481 aacactacac atgtacaacc agctgatctt caacaaattg gacaacaaca agcaatgggg 126541 aaaagattcc ctatttaata aatggtgctg ggaaaactgg ctagccgtat gcagaaaact 126601 gaaactgcac tccttcctta taccttattc aaaaattaac tcaagataga ttaaagactt 126661 aaatgtaaaa cccaaaagta taaaaacgct agatgaaaac atagacaaga ccattcaggg 126721 catgggcatg ggcaaagatt tcatgatgaa aacgccaaaa gcaaatgcaa caaaaacaaa 126781 aattgataaa tggtatctaa ttaaactaaa gagcttctgc acagcaaaag aaactatcat 126841 cagactgaac aggcaaccta caggatggaa gaaaaatttt gcaatctacc catctgacaa 126901 atgtctaata tccagcatct acaaggaact tacacaaatt tacaaggaaa caaacaaaaa 126961 atcacaaagt gggcaaaggt tatgaataga ctcttctcaa aagacgacat ttatgcagcc 127021 aacaaacatt ttaaaaagct taacatcact gatcattaca gaaaggcaaa tcaaaaccat 127081 aatgagatac cacctcacgc caatggcaat tactaaaaag tcaagaaaca atagatgctg 127141 atgaggctgt ggagaaatag gaatgctttt acactgttgg tgggaatgta aattagttca 127201 acagttgtgg aagaaagtgt ggcgattcct caagtatcta gaaccagaaa taccatttgc 127261 cccagcaatc ccattactgg gatataccca aaggaatata aatcattcta ctgtaaagac 127321 acatgcacat gtatgtttac tgcagcacta cttacagtag caaagacatg aaaccaacgc 127381 aaatgtccat caacgataga ttggataaag aaaatgtgct acacacacca tgcagccata 127441 aaaaaaaatg aaatcatgtc cttggcaggg acatggatga agctggaagc cttcatcctc 127501 agcaaactag cacaggaata gaaaaccaaa cactgcacat tctcactcgt aaatgggagt 127561 tgaacaacga gaacacttgg acacagggag gggaacaaca cacaccaggg actgtcaggg 127621 ggcagggggc aggggagaga gagcattagg ataaataact aatgcatatg aggcttaaaa 127681 cctaggccag gcgcagtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcggg 127741 caggtcacct gaagtcagga gttcaagatc agcctgacca acatggagaa gccctgtctc 127801 tactaaaaat acaaaattag ccaggcgtgg tggtgaatgc ctgtaatccc ggctactcag 127861 gaggctgagg caagagagtc gcttgaaccc aggaggcgga ggttgcggtg agcggagatt 127921 gcgccattgc actccagcct gtgcaacgag agtgaaactc cgtctcaaaa aacaacaaca 127981 acaacaaaac ctagatgaca ggttgataga tgcagcaaac caccatggcg catgtataac 128041 tatgtaacaa acctgcatgt tctgcacatg tatcccggaa cttaaagtaa aattaaaaat 128101 ttaagaaaag aagaaaagaa aaaaataaaa ataaaaataa aaaaatcttc tgagtctcct 128161 actttaagtt ctgtagtttt agattttaca tttaggacac gaattcattc tgagttaata 128221 tttttatatg aaatgaagta tggatacaag ttcactgttt tgcatattga gatccaattg 128281 ttcctgcacc atctgttgaa aagactattt ttttctctac taaatgtgtc aaaaatcagt 128341 tgttcatata tatgtgggtc tatttctgag atctctgttc cattgatctt tatgtcttga 128401 ttacttcaag gagtctttgt aataagtctt caaatcaggt agggttattc ctccaactct 128461 gttttctctt aaaagttatt ctagctattc gagatcctct gaatttctat atgaatttta 128521 aaatcagctt gtcaatttct aaaagccccc ctggaattta aactgaaatt ctgttgaatt 128581 catagatcaa gttgggaaga acatacatct taataaacaa tattaagtct tccaactgat 128641 gaacaacatt gactgattga ttgatttagg tcttctttgt tctcaacaat attttgtagt 128701 tttcagtgta caggttttac acatcttttg ttagacttat ccttaagtct gtcatatttt 128761 tatgctgtta taagtggtat ttttgaattt caatttccaa ttgcttgttg ctaagagtat 128821 acagaaatac aattaatttt ttgtatattt atctagtata ttgaaacctt actcaactct 128881 tttattaact acagcttttt ttgaaaactc aaaacaagat accacgacat acctattaga 128941 atgactaaaa ttaagacagt gtaacatgtg ttggcatggt tgtaggggga agtagaactt 129001 tcaaatacta ctggtaagaa tgtaaacatt ggctgggcgc ggtggttcac gcctgtaatc 129061 ccagcacttt gggaggccga gaagggtgga tcacctgagg tcagaagttc aagaccagcc 129121 tggtcaacat ggtgaaaccc cgtctctact aaatatacaa aaattagccg ggcatggtgg 129181 tgggcgcctg taattccagc tactcaggag gctgaggcag gagaattgct taaacccggg 129241 aggcggaggt tgccgtgagg cgagattgcg ccattgtgct ctagcctggg caacaacagc 129301 aaaactttgt ctcaaaaaga aaaaaaaaaa aagaatgtaa acattacaac aactttgaaa 129361 aataatcttg gcaatttttt aaaaagttaa acatatacct accatatgat ctagacattc 129421 caatcctagg tatttatcca agagaaatga aaggatatgt tcctacaaag acttgtacat 129481 gaatgttcat agcagcttaa tttataatat taaaaaatta ggaacaaccc aactgtccat 129541 cagcatgtaa atggataaac cagctatgat ctatcaatgg aatactagtc cataataaaa 129601 aggcaaaggt attaatacac acaacactga cgaatttcaa atttacgtaa cataaaactc 129661 tccctttcaa agtgtacaat gcagtggttt ttagtacact gacaaggtgg cacaaaagtc 129721 acgactatct aattccagaa cattttcatc accctaaaaa caaatctgta tccactagta 129781 gtcactccac atcccagtct catagaatat acttaatttc cagacaacaa tgtatttcat 129841 ctgggtgtcc caaatcaagc ctcaaacctt tcaagtttga aacattgctt ttaagttaat 129901 ttcaagggtc tatggtataa acactgttac tcttcattat tttattcttt ctaatgcttt 129961 ctaatgaaaa acagaaccac actgataaag acactgacaa ttttagagaa gttccaatca 130021 cagaagtgtc atttagtgta catgtgtata actgtaatag tgactgcaaa attattactt 130081 ttactcaaat attaattcca aatgtagcaa cttatttttc cttggcttct ccttattata 130141 aatgccagga ctggtcaaga ctgaactaca gtggtgaata aaaaagacat gatgtctttg 130201 caaatattaa atcacacatc atctgcaagg cttttaaaaa ttagattaac acaatccact 130261 ttgagatcat tccaccatga actatatctt ctcaaaaact ttttaacaat ctttataaaa 130321 actttaggta aagccaacag cacattggaa ggataatggt tttacactgc attatactaa 130381 ctcagaacat tatgatcaat acttgtgcct gtttttttgt ggggtttttt gtttgtttgt 130441 ttgttttgac agagtcttgc tctgtcacca ggctggagtg cagttgcaga tctcggctca 130501 ctgcaacctc cgcctcctgg gttcaagtga ttctcctacc tcagcctcct gagtagctgg 130561 aattacaggc acacaccacc acgcccagct aatttttgta tttttagtag agacggggtt 130621 tcaccatgtt ggccaggatg gtctcgatct cttgacctca tgatccgccc gcctcagcct 130681 cccaaagtgc tgggattaca ggcgtgagcc accatgcccg gctgtgcctg tttttgtttt 130741 tttttttcga agacatgacc atttgtatta cagaatctgt caattataga agaaattata 130801 atcataatcc aatcaaaatt tagcactttt ttaatattga aggaaagtca gtgatgacaa 130861 acatccataa taagccttgg aataatatgc atctatgttt aatgattcca taatcctaca 130921 aactattgtt ggtgcacaga tattaaatgt gatagagcaa aacagtggct gatacttgga 130981 tttttaccct tctatgttta gataatactg tgaaaacaac acacaagcat tattcattgt 131041 ttaaagttaa cttttttttt tttttaaaga gtcaagatct tgctctgtca cccagggtgg 131101 agaacagtgg tgcaataaca gctcactgca gcctctaact tgtgggctcg agtgatcctc 131161 ccatctcagt ttcccaagta gttgagactg caggcacatg ctatgatgct caacttttat 131221 tttgttttaa tttcctttac ttctcaattt ttacttttta gagccaagag gagatctgtt 131281 ctgttcaccc aggctggtct caaactcctg gcctcaagca attctgcctg cctccacctc 131341 ccaagtaact gggcctccag gtaccagcca ccacacccag cttaaaagtt caacttttta 131401 tatgacattt tccaagccta ctaaatcaca agttagagat attcccaatg attttcctaa 131461 gcaaaaccta tatttcagca gtatacaatt gcaatgctgt tgaagcatta tttgtgaagg 131521 atttccttga ttaacaaaaa atatggaata agcattacag catagcatga ctatataatc 131581 aaaaaatcaa agacaggaaa attcagaaag aacatgtaat gtacatttct ctgatactat 131641 tttggcataa ctgctttcaa agcacaatag caagagtaag cagactcttg atatctgaaa 131701 tggatataat taaagacttc tgagcctaca aagtacaaat gccactacat atatccactt 131761 cagaaaaata agtgggtacc cagtgaaaag aagaaaacca ccaagccaca ttaaaagtgt 131821 tgggcagttc tatgtgcact cactagctat tcaaaatttt gcctgaaaaa ttatgaactg 131881 ctgcatcatg gactttgcag ctggttaata gtcaaaacca tgaacattaa actcttgact 131941 gtcaaggtcc tgcaataaat gtctgagaaa agaatgaaca aagaaataaa tttttaaaat 132001 cttctaggca tagtatcagc atgggatttg tcatgttaaa aacagcattt ctctagaata 132061 tttcctagag aaaagaaaac ttcacataaa gaactgtaca cctattcata atcaccaaaa 132121 actggaaaca acaacaaatg tccttcaatc agtgaatcaa taaacaaact gcaggctact 132181 catacaatgg tattcaatca aacacaaagt aacaaactat tagtacaccc caaaacttga 132241 ataaatctca aaggtattat tttgagtaca agaagacagc ctcaaaaggt cacctacttt 132301 gtaattccat ttatgtgtca ttctcaaaaa gacaaaacta caatgatgga gaaaagatca 132361 gtggctgtca ggactagcag aagcgggtag gcatcaccgt gaagggataa tgagggaatc 132421 tggaggagtg atggaattgt tctgtatcat gactgtaatg gtgcttacat aaatctgtat 132481 atgtattaaa atttacagac ctgtatatga aaaaggacaa ttttactata taattaaaat 132541 aaatagcatt tcagtgaaat caactacttg tacttactca ttctgagaag tgttacaata 132601 ttgtatacag ccaattattt aaaaatttgt ttaatagtac ttgctaatgg tgataacaaa 132661 agtcaaccaa tcaagtgact taaaaatatt taaatctcta ctaacttttt atttaaaaaa 132721 aaaaaaaaac ttaccaagtt cattgagact ctgagcagct tttgttatct ctctacttcc 132781 cccttgcagt tgtccttgga gtctcttact gagattcaag atacgaataa tcctccgaag 132841 caaatcacag gcaacctgtg caaacatcac aatgttaagt tagttgatcc atgaagtttt 132901 attactatta ataaatagta ataacaagca cagtgggttg ataggtgcag caaaccacca 132961 tggcacatgt ttacctatgt aacaaacctg cacatgtatc ccggaactta aattacaaaa 133021 aaaacacagt ggctcacatt gtgcttaata tgtgaatcta cccattcttt ccagatgcta 133081 aagattcact catgcttttt ttgtttccca ccaaccacaa acctaaataa tcgcaagtct 133141 taaaaagcat caaatgtcct agggaacaca cggaagtact atatttacat tttacaataa 133201 actccaattc atcccaagtg gagagaactt ggaattgagg cactgtgaga cccaaggatt 133261 ctttcagtcc taaaccagct ttcttctcta ggccagtggt tctcagtctt gattacaaat 133321 tcaagtgggt agactgaaaa tgttccaatg cccaggccac agccctgacc aattaaaaca 133381 gcatttctat gggttagaac caagcatgag ttatgttttc atgtttttaa tattcccagg 133441 tggtaccaat gtgcagccaa aattgacaat cactatgcta ggttgctcag cttcctacta 133501 ctgcctcctg ttattttcag tcccagcttc tctttcctat atcctgcagg agtagtctta 133561 aaacttctct caatttcatt attttcctaa tctttttttt ttttttgaga acagggtctc 133621 gctcagttgc ccaaactgga gtgcagtggc atgatcacac ctcactgcaa cctctacctc 133681 ctgggctcaa gcgatccttc tgcctcagcc tcccaagtag ctgggactac aggcgtgtgc 133741 caccatgcct ggctaattgt attttttgtg gagatgaggt tttgccatgt tgcccaggct 133801 gctctcaaac tactgggctc aggtgaacct cccacatcag actcccaaag tgctaggatt 133861 acaggtgtga gccaccatgc gtggtgattt ttaatcacta ataaattata aaagagaaag 133921 cacataaaca acaaaactat ttcacacact tcctattcct tttcccagat ctaatactag 133981 gtctttagct atcaaaaggc tgagataaga aataaaggtt catgtatttt aggacacttt 134041 attacagaag ctggattcac agcagttgca tgagggagag tgtggtagag ggtgacatac 134101 tgaataacca acatgtagtg atcttcagcc tcagtagtga ttacaaagga aagagagaca 134161 ttccatactg ctgcatcaca accatttttg aatggcccca aggctaactg agaagcaagg 134221 acatctcccc tgtttaagga gaaaaattct acatggtggc aataagaatt aaaagctctg 134281 agaagccagg gtccctgaac ctccaggtat aaacattctt ttatgggtga aataaataca 134341 tacaccagag ccacacacag ccacacccat gcactcacaa gcaatacacc gtatgtatgc 134401 atacaccata ctcaaacata aaacaaacat atatttacaa acacatgtaa gtcaaagcca 134461 atcaagtatt aaacatgttt acctgactgc ttttcagaga aactaatctg aaatacttaa 134521 ataaaaaaaa aaaaaaagga atgccaaagg aaaatttttc ccatggaaat tgacaattgg 134581 ggcatcttta tcttttaatt aaatattatc cgttttctga atagataata caagcacatg 134641 attcaaaatt caagaaataa aaagaaataa cattgaaaag tcaccctcgt atctctgtcc 134701 caactatgca cttaacaaaa acaatcactg ttattgttgc aggtagttag acagtcaaga 134761 gtggggcagg agagggctac tgccccaccc actaggaacg tcggttgatg atttggaagt 134821 tatcacattg cctctctaaa actgataaat tggcagccag tgccagggag aggacatttc 134881 ctgatggtcc acgcctggtg cattaaagtg ttaattgaat gcaaatgcca gggagaagca 134941 acttccaggg catgtgcttt aagacacaaa atggtggagt atgaccttcc gggggcactc 135001 cacctgaaaa aggaagaaag cctcaggtgg gcatgcatag aaattcctaa acacactgcc 135061 tgtgctcacc tcttaagggt aaggagggca ctggacatgt gggcagccca ccctaaggga 135121 agaatcttgg gaaaggagcc aggctataaa gtcctcggat caaggttaaa cactgcacct 135181 gacctcagtg cccacgtggg actcttccaa gcgtactttc ctttctttct ttcctgttct 135241 aaagcctttt taaataaact tccactcctg ctctgaaatg taccttggtc tctttttctg 135301 ccttatgccc ctcagtcgaa ttctttcttc tgaggaggca agaattgagg ttgctgcaga 135361 cctgtacgga ttcgacacca gtaacttgga tatcttgcac tggaaacatt atcagcacct 135421 tccctgttag aactgacatt tttagataaa caatttggaa cgtaaaatct cattcctctg 135481 agattttaaa tacatcgtaa gaagaaatta agctcttact ctaagactgt catccagaat 135541 tccctaatcc tacccaacaa agggtatgct ctcaaaacat gcaatcttta tttctactgt 135601 ttttattttt agcaaccata cttaatctgt caccaagtct taccaactat cctactgcat 135661 aaaacaaaag ttgttatttt acctctattc ctacaagcat caccctaatc cacatcttta 135721 tcaccattac ctttacatag tggtttctcg gctagctttc aagtctctca ttgaaatcca 135781 cccagccaga tttatcttct taaaaatgca gtttttgggc cggacgcagt ggcttatgcc 135841 tgtaatccca tcactttggg aggccaaggc gggcggatca ccttaggtcg agagttcgag 135901 accagcctga ccaacatgga gaaaccccat ttctactaaa aatacaaaat tagctgggtg 135961 tggtggcata tgcctgtaat cccaactact tgggaggctg aggcagaaga atcgcttgaa 136021 cccgggaggc ggaggttgcg atgagccgag attgtgatat tgcactccag cctgggcaac 136081 aagagcgaaa ctccatctca aaaaaaaaaa aaaaaaaaaa aaaacacagt tttcagcact 136141 ccaaccccct ttttagtaaa ctttaatggc tctctactta cagattatgt tgaatcttct 136201 gggaatacca tttaataaaa tccttagtta ataagtctaa gctattataa gttgatctat 136261 aaggcatcag ttctcaccat ttcccaatat agctgcctcc cctccaatca atttcctttc 136321 cctccctgtt tttttatctt agacttttac tattaaatat aacacaagca tggaaaacca 136381 cactcagaga tacacagcta aaggaactta gaagcaacac acagatcagg agacaggatc 136441 atgtgagcca ccacagaagc cctcatttac tccctcccaa tgccattacc cttcatgcca 136501 accaagagaa accactactc ttttacatta atcacttcct tacttttttt ttttattact 136561 gaagtacaaa tacctaagtg ctatggttta atttttcttg tttttaatct gttctctcac 136621 cttctctttt tttcattgca atttgttgaa tatattggat catctgtcct aaagtttccc 136681 acagactggg ttttgttgac tgtattccct tgatgcagtt taacaagttc ctctatcttc 136741 tatattacct gtaaattagt agttggatgc gggcagcttc atcagattta agtttagtat 136801 ttttggcaag attaaacatt agagggtggt atattttaag agatgtatga ttttctctct 136861 ttttgtgttg agtatcagcc actgatattc aatgtcaaaa tccactgaaa cataagtgac 136921 tgcaaaatag taataatcca attctgtcat tttgtattta taagcatatg aaatttctct 136981 ataaaaataa actttacctc attgactatt tgtttaccta ggagtattgt tcatactgaa 137041 aactaaaaat aaatacttaa tctttccctt attagttttc aaaatgaaga aataatttcc 137101 taatattcac caataatgaa aaattagtcg gtgagggcgg gggggaggtg ggtgttctgg 137161 ggttttgctt tttttttttt ttgatatcat tatgaaattg atttaaacat aataagggat 137221 gcccaaccca tcatggctct tatccttatt gatgctcaaa gtgttccatt tcggccagca 137281 gaagcctctg agagtgatta actgtaccac tttgcctgag actgagaagt ttcccaagtt 137341 ttaggacttt cagtagtaaa actaaaatag ctggtcaccc aaagcctatt caagtcagtt 137401 cctgagttcc tttgtcatga accgagcagt cgttgatagc ctccttggta tctggtataa 137461 aaatgttcca ggatcatttt atacatttca tgccccagac ctggaatcaa agatttctca 137521 aagaagccct ggtttctttt agtagaaaac agcattactt tttttctttt tttgagacgg 137581 tttcgctttt gtcgcccagg ctagagtaca atggcacaac cttggcttac tgcaacctcc 137641 ccctccaggg ttcaagcgat tctcctgcct cagccacctg agtagctggg attacatgtg 137701 tctgccacca tgcctgacta attttagtat ttttagtaga gatggggttt caccatgttg 137761 gccaggctga tctcgaactc ctgacctcat gtgatccacc tgccttggcc tcccaaagtg 137821 ctgggattat aggaatggac cactgcaccc aggcagaaaa cagcatttca agaccacatt 137881 cacagaacta ggtttgttca tttctgggac ttttccatag acatagcaag gaaatgccag 137941 cattaaaata tcacataaat attcatttga tttatgccac aatacacata aaacattctc 138001 agattaacat acatatacat atgcacaaat atatatatac acacaaatat aaggtatata 138061 ctatttttaa atgacattaa agaatctcaa gttcatactg atacttctaa ttaaaaatca 138121 gaagtagagg atttagcctt cttacatcta tataaatata aacagtatat aatttcttcc 138181 acattaagat tcctagctct caaggacaca ggagataaca aaattagaat aatgtataat 138241 tattcatttg ctctgctgca caatacacag aaaatacctg agcaacaaaa caaacactac 138301 agtcaaaagt atgatggcag aaaatagttt ttaaaatttt ttgcatatgc tcttcccact 138361 gttttttaca gctgaattat atttctttgc caaaacatat agtcattaca cactattctt 138421 tataactttc atttaatact agttctcatg cttaacacta gtctcacggt gatttttttc 138481 tagtcagtct ttgctgttct gcagcctcca ctggtgatac tcaaggacac agggtctgga 138541 gtggacccgc agcaaactcc agcagacctg cagcagaggg gcctgactgt tagaagaaaa 138601 actaataaac tgaaagcaat agcatcaaca ccaacatcat gcaaaaactc catccaaagg 138661 tcaccaacag caaagaccaa aaggtagata aatccacgaa gatgaggaaa aaccagcaca 138721 aaaaggctga aaatgccaaa aaccagaata cctcttctcc tccaaaggat cacaacgcct 138781 caccagcaag ggaacaaaag tggatgggga atcagtttga caaattgaca gaaggagact 138841 tcagaaggcg ggtaataaca aactcctcca agctacagga gcatgttcta acccaatgca 138901 aggaagctaa gagctttgaa aaaaaggtta gaggaattgc taactagaat aaccagttta 138961 aagaacataa atgacctgat tgagctgaaa aacagcagga gaacttcgtg aagcatacac 139021 aagtatcaat agccgaatag atcaagtgga agaaaggata tcagagactg acgatcaact 139081 taatgaaata aagcatgaag aaaagatcag agaaaaaaga atgaaaaaga acaaacaaag 139141 cctccaagaa atacaggact atgtgaaaag accaaaccta catttgattg ctgtacccga 139201 aagcgatggg gagaatggaa ccaagttgga aaacacactt caagatatta tccaggagaa 139261 cttccctaac ctagcaagac aggccaacat tcaaattcag gaaatacaga gaagaccact 139321 aagatactcc tcaagaagaa caaccccaag acagataatc atcagattca ccaaggttga 139381 aatgaaagaa aaaatgttaa gagcagccca agagaaaggt caggttaccc acaaagggaa 139441 gcccatcaga ctaacagcag atctctctgt agaaacccta caagccagaa gagagtagca 139501 gccaatactt aacattctta aagaaaagaa ttttcaaccc agaatttcat atccagccaa 139561 actaagcatc ataagcaaag tagaaataaa atcctttaca gacaagcaga tgctgagaga 139621 ttttatcacc accaggcttg ccttacaaga gctcctgaag gaagcactaa atatcaaaag 139681 gaaaaactgg taccagccac tgcaaaaaca aaccaaaagt taaagaccat caacactatg 139741 aagaatctgc atcaactaat gagcaaaata acaagctagc atcataatga caggataaaa 139801 ttcacacata acaatattaa tcctaaattt aaatgggtta aatgctccaa ttaaaaggca 139861 cagactggca aattggatag agttaagacc cattcatgtg atgaattcag gagatccatc 139921 tcatgtgtaa agacacatat aggctgaaaa taaagggatg ggtgaatatt taccaagcaa 139981 atggaaagca aaaaaaaaaa aagtgggggt tgcaatccta gtctctgact gtttagactt 140041 taaaccaaca aagatcaaaa aagacaaaga agcccattac ataatggtaa agggatcaat 140101 gcaacaagaa gagctaacta tcctaaatat atatgcacgc aatacaggac cacccagatt 140161 cataaccaag ttcttagaga tctacaaaga gatttagact cccagacagt aacagtggga 140221 gacattaaca ccccaatgtc aatattagtc agatcaacga aacagaaaat taacaaggat 140281 attcaggact tgaactcagc tctggaccaa gcagacctaa tagacatcta cagaactctc 140341 caccccaaat caacagaata tacattcttc tcagtaccac acagcactta ttctaaaatt 140401 ggccacataa ttggaagtaa aacacccctc aggaaatgta aaagaatgga aatcataacc 140461 aacagtctct cagaccacag tgcaaccaaa ctaaaactca ggattaagaa attctctcaa 140521 aaccgcacaa ctatatagaa actgaacaac ctgatcctga atgactactg ggcaaataac 140581 aaaattaagg cagaaataaa tcagttcttt gaaaccaatg agaacaaaga cacaacatac 140641 cagaatctct gggacacagc taaatcagtg tttagaggga aatttatagc actaaatgcc 140701 cacaggagaa agtgggaaag atctaaaatt gacaccctaa catcacaatt aaaagaacta 140761 gagaagcaag agcaaacaaa tgcaaaatct agcagaagac aagaaataac taagagcaga 140821 actgaaggag atagagacat gaaaaaccct tcaaaaaaaa tcaatgaatc caggagctag 140881 ttttttgaaa agactaacaa aataaaccac tagccagact aacacagaag aaaagagaag 140941 aatcaaatag acacaatcaa aaatgataaa ggggatatca ccactgatcc cacagaaata 141001 caaactacca tcacagaata ccataaacac ctctatgcaa ataaaccaga aaatccagaa 141061 gaaatggata cattcctgga catatacacc ctcccaagac taaaccagat agaagttgaa 141121 tccctgaata gaccaataac aagttctgaa attgaagcac taatagccta ctaaccaaaa 141181 atagcccagg accagacaga tcaacagctg aattctacta gaggtgttgc aaagaggagc 141241 tggtaccatt ccttctaaaa ctattctaaa caatagaaaa agagagactc ctccctaact 141301 cattttatga ggtcagcatc actctgatac caaaatctgg cagagacaca acataaaaaa 141361 tttcaggcca atatccctga tgaacatcaa tgtgaaaatc ctcaataaaa tactggcaaa 141421 ctgaatccag cagcacatta aaaagcctat ccaccatgat caactcggct taatccctgg 141481 gatgcaaggc tggttcaaca tatgcaaatc aataaacata atccatcaca taaacagaac 141541 caatgacaaa aaccacatga ttatctcaac agatgcagaa aaggccacaa aattcaacag 141601 cccttcatgc taaaaacact caataaacta ggtattgatg gaacatatct caaaataatg 141661 agagctattt atggtaaacc catagcattc cctttgaaaa ctggcacaag tcaaggatgc 141721 cccttctcac cactcctatt caacatatta ttggaagttc tggccagggc aatcaggcaa 141781 gagaaagaaa taaagggtat tcaaatagga agagaggaaa tcaaattatt tccgtttgca 141841 gatgacatga ttgtatatat agaaaactcc atcgtctcag cccaaaaact ccttaagctg 141901 ataagcaact tcagtgaagt ctcaggatac aaaatcaatg tgcaaaaatc acaagcattc 141961 ctatacacca ataatagaga gccaaatcat gagcaatctc ccattcacaa tttctacaaa 142021 gagaataaaa tacctaggaa tacaacttac aagagatgtg aaggacctct tcaaggagaa 142081 ctacaaacca ctcctcaaga aaataagata ggacacgaac aaatggaaaa acattccatg 142141 ctcatggata ggaagaatca atatcatgaa aatggccatt ctgcccaaaa taatttatag 142201 attcaatact attcccatca agctaccatt gacattcttc acagaattag aaaaaactac 142261 tttaaatttc atatggaacc aaaaaagagc ccatatagcc aaggcaatcc taagcaaaaa 142321 gaacaaagct agaggcatca cactgcctga cttcaaacta tactgcaagg ctacagtaac 142381 caaaacagca tggtactgat accaaaacag atacatagac caatggaata gaaatataga 142441 tcaatggaac aggcctcaga aataacacca catatctaca accatctgat ctttgacaaa 142501 ccagacaaaa acaagcaatg gggaaaggat tccctattta ataaatggtg ctgggaaaac 142561 tggctagctg tatgcagaaa actgaaactg gagcccttcc ttatacactt tacactgttg 142621 gtgggagtgt aaattagttc agccattgtg aggctatttc tcaagaatct agaacaagaa 142681 ataccatttg accagcaatc ccattacagg gtatataccc aaaggattat aaatcattct 142741 atacccaaag gattataaat cattctacta taaagataca cacacacgta tgtttactgc 142801 aacactgttc acaatggcaa agacttggaa ccaacccaga tgtccatcaa cgttagactg 142861 gataaagaaa atgtggcaca tatacaccac ggaatactat gcagccataa aaaagaatga 142921 gctcatgtcc tttgcaggga caaagatgaa gctggaaacc atcattctca gcaaactaac 142981 acaggaagag aaaaccaaac gctgcatatt ctcactcatc agtgggagct gaacaatgaa 143041 tacatggaca cagggagggg aatatcacac accagggcct gtcagggggt ggggccaagg 143101 gggagggata gcattagaag aaatatctaa tgtagataat gggttgatgg gtgcagcaaa 143161 ccaccatggc acatgtatac ctatgtaaca aacctgcaca ttctgcaaat gtatcccaga 143221 acttaaagta taatttaaaa aaaaaagaaa gaaaaaaaaa agagcaaagg attatttgaa 143281 tagacatgtc tccaaggaag atacataaat ggccaaaagg caccacagaa gatgctcaat 143341 atcattagcc attaggaaaa tgcaaatcaa aatgacaatg agatactact tcatacccaa 143401 taatgataca ggagatagaa attatttagg cagacaataa gggcaacaga gtccttggca 143461 gaatttccct tttaacaaaa agcagctccc aaatcatttc ttttctgaca aagagcagcc 143521 tgaaaaatcg agctacagac atagataagc aagctggaaa ttgaacaggt gaatgccagc 143581 agctgtgtca atagaaaagg gctacctgga agccaggtat gttcaacatg gaggctccat 143641 ctttgctttt ctttgtaacc acatgtacag taaagaagca ggcaacacag caccagccag 143701 ccagagaatt catctgcata ataaaagatt agggcagggc ggccagcttt ttcacatgct 143761 atgcaagtgg cacacctagc cctaactagt tttttgcacc ttaggcaaat agcacacctg 143821 gtctgaccaa tctttcatgc cctatgtaaa ttagacactg cctcctcaag ctcatctata 143881 aaactcaact gcatttcaca ataaaagcag caacccagtt ctccaggacc cctctctgca 143941 gcagagagag ctcttctctt tctttcgcct attaaacttc cactctgaac atcactattt 144001 gtgtgtctgc gtcctagttt tgtgtggctg taagacaaca aaactcagct atttactcaa 144061 gacaacgatg ccacttcaat aagatggcag tattaaaaaa aagtacagaa attagaatct 144121 tcatacattg ttggtaataa tggaaaacag tccagttgcc ttggaaatag tttggcattc 144181 ctcaaaatgt taaacataga gttagcatat gacctagcat aatatgctct caggtatata 144241 cccaagaatt gaaaaaaaaa aaagcacaca caaatattat ttatgcaacg tttgttctta 144301 gcagtctact caaaatagtt aaaaagtaga aacaacctaa atattcatca actgatgaaa 144361 ggataaacaa attgaatgga taaacaaaat gtgatatatt cataaaatga tatattattt 144421 gaccatataa gtaaagaagt acctaagtcc tcattaaaca tcatcaactg gttcttttga 144481 ctttaagtga aaacaatgga ccaggcggct cactcctgta atcccagcac tttgggaggc 144541 tgaggaagat ggatcacttg agtccagaag tttgagacaa gcctgggcaa catagtgaga 144601 gacactgtct aaaaaaaatt aaaatttaaa aaaaaattag ctgggcatgg tggtgcgtgc 144661 ctgttgtctc agctactcaa gaggctgcac tccagcctgg atgacagagc aagaccctgt 144721 ctcaaaaaac aacaacaaca aagcaacgta taatgaaacc cagttttttc ctcctcagtg 144781 ttataatgaa atgacatgtt attcaaggac ctgctgtatg ttcttttact taaagttgca 144841 gtttccaaga acctatcaat gacattaaat aagaacttac tgtactgaca tatgctacaa 144901 tatggatgaa ccttgaaaac attacattaa gaaggcaaat actacataat gaatgatcaa 144961 tttccatgaa atgtcctgaa tagggaaatc caaaaaggaa gactgcagat tagtgcagat 145021 tagttgagag cttggcgtag agaagtggat agagggagta actgctaatg ggcatagagt 145081 ttcttttttg ggtaatgaaa atattctggc attagataat attatagtat aattagatat 145141 tgtattcaca agatgcaatc ttgtgaatat agtaaaaacc cctaaactgt acatgttaaa 145201 gtgattattt tatggtatgt aacttataac tcaattttta ataatgagtg tgacttctaa 145261 caatgatgga atagggtcag acttactttt ccattttaaa caactagaaa attgaacaaa 145321 atacacgaaa taattttaga acaacaggaa tgaataattg gacaataatt ggataacaga 145381 aaacacaggt ctgtgatctt tgagagaaac aagaaaatga gcttaacaat tgtctcagat 145441 taccactggg agacaatttt caggccacag caaagggaag gcaagggaaa cagagcatga 145501 cagtctctct gaagttggga agtagagatt acatttcaag aaagccaaaa aagctagtgt 145561 ttatgggcag agaattggag aaaagctgca aaaacagagg ttgtgcagat aagcagaaaa 145621 atagcttcca gtcttctgag taccaatctg tgcatgtgtg agaggaagca caggaaaaga 145681 aacactgaaa caacagtaga aagacctaat agtaaacagg gctaaactag ccctaggcta 145741 aaggctgttc tggacctgta ctaacaaaga tttaaaacta aatctcaaaa agattaatgg 145801 tggctcatgc ctgcaatccc agcattttgg gaggccaggt aggagtatca cttgaggcca 145861 ggagtttgaa atcagcctag gcaacatggt gagactccca tctctacaat aaaaaaaaaa 145921 aaaaaattag ctgggcatgg tagtgtgtgc ctttagtccc agctacttgg gaggctgggg 145981 caggaggatt gcttgagccc agcacttaga ggttacagtg ggtcaagact gccccactat 146041 actccagccc aagtgacagg gcaaggccct gtctcaaaaa ataaataaat aaataaaaat 146101 taaaattaaa taaaaaaaaa aagatcaaac tgattccaag taacttaagt gtgaaccaga 146161 aaaaagtcca acattattta aaagaaattt taaataatct aggaacccta aacataaaat 146221 acacatttct aatatccaat aaaaaattac caggtatgta aagaagcaaa acacataaaa 146281 cccataacaa gaaacaagaa gaaaagtcaa tcaacaaaaa caaacctaga aataataaag 146341 aattaaatgt taaaatatat tttataggcc gggcacggtg gctcatgcct ataatcccag 146401 cactttggga ggctgaggca ggcggatcac ctgaggtcaa gagttcaaga ccagcctggc 146461 caatatggta aaaccccatc tctactcaaa atacaaaaat tagccaggtg ttgtggcaca 146521 tgcctgtaat cccagctact caggaggttt aggcaggaga atcacttgaa cccaggaggc 146581 ggaggttgca gtgagccaag atcctgccac tgcaccccag cctgggcaac agagcgggac 146641 tctgtcttga aaaaaaaaaa aaaatttata aacatgtata tgttcaagaa ggtaaatgaa 146701 aacatgaatg tcagccggac acagtggctc atacttgtaa ttccagcact ttgggaggcc 146761 aatacaggtg gaatgcttga cctcaggagt tcgagaccaa cctgagcagc aggacagaac 146821 ccggtctcta caaaaaatac aaaaattggc tgggtgtggt ggtgcatgcc catagtctca 146881 gctactcagg aggctgagat gggaggatag cttgggcctg ggaggaagag attgcagtgg 146941 gttgagatcg tgccactgca ctccagcctg ggcaacagag tgagaccctg tctcaaaaaa 147001 aaaaaaaaaa aaaaaaaaaa aatatatata tatatatata tgtatttatg cgtgtgtgta 147061 tttatataaa tgaaattgaa atataaatgt cataacagaa agagaagata taaaaagaca 147121 caaatataaa ctagagatga aaactatagg ccggggatgg tggctcacac ctgtaatccc 147181 agcactttgg gaggctgagg cgtgcggatc acgaggtcag aagttcgaga ccagcctggc 147241 caatatgttg aaaccctgtc tctactaaaa atacaaaaat tagctgggca tggtggcagg 147301 cacctgtact cccagctact caggaggctg aggcagaaga attgcttgaa gccaggaggc 147361 gcggttgcag tgagccgaga tggtgccact gcactccagc ctgggcaaca gagtgagact 147421 ccatctcaaa aaagaaaaaa aaaaaaaaac tataaaaaca cacaggatgg gattaacagc 147481 agattacata cttaattaaa aagattagta aatttacaag caatttcagg tagattatag 147541 atctaaacat atgaggcaaa acaatacgtg cttttgaagg caatatacaa tatcttcata 147601 gcttcagggt agagaaagtt atcttgtgag aatttttcat aaaagagaac taaccataaa 147661 agaaaaatta gataaattta acttctctcc aacaacagac aagagcgaga aggagaaaaa 147721 aatggatagg cagaggaata tctgaaacac acataataga tgacttacaa gaggttcaat 147781 ctccttagta attagctctt agaaaaacaa aaccacaaca agataccacc acacacctaa 147841 aatattagca aaaactaaca agtttgacaa cactaagtgt tgacaaggat acaaactaaa 147901 aggaactctc atatactgat agcatatgtg gtataaccat ttggtaaaat agtttggcat 147961 tatctaataa agttgaaaat cagaaaccct ataacccaga aatctcactc ccaggatata 148021 acttacacaa aggggtatta gcatgagcca acatatattt gtaaaaaagt tttcacagcc 148081 atattactca taatggtccc aatctggaaa caatccaaat tccataagca gaatattcat 148141 ataatgacat attctacagc aataaaaatg aaacaaaaca agcaaggcca ggcatggtgg 148201 ctcacgcctg taatcccagc actttgggag gccgaggcag gtggatcact tgaggtcagg 148261 agttcgagac cagcttggcc aacatggtga actctgtctc tactaaaaac acaacaaaaa 148321 aatagccaaa tgtggtggca ggtgcctgta atcccagcta cttgggaagc taaggcagga 148381 gaatccttga acctgggagg cggaggctgc agtgagccaa gatcgctcca ttgcattcca 148441 gcctaggcaa cagagcacaa ctctgtctca aaaaaaaaaa aaaagggaaa agcaatatag 148501 atgaattcaa aaaaggataa tactttccaa aaaacagggt attaaagcat aatttttagt 148561 taaaaaaaat caatgtgata tgtttatgag tgacattagt aagatggcag aggaagactt 148621 tccaggactc cagcttgcag acacattaat ttgaaccact atccatacac aaaaacatct 148681 tcacaaaagt taaagaaacc aggtaagaca ttacagcacg tgaatacagc acagaaataa 148741 aaagaggcac attgaagaga gtaggaagga cagtattaca ttacctgcgt cccccaacac 148801 cagccagcta gcaaggagac agatgctcgt tgaggagaga aaagagaagg aattgagcac 148861 tggactttgc ctcagcccca acactaggcc tgccccatta aaattgtgtt gggcaggctg 148921 cccccgcccg actccaggcc agtacttcca gactacatct cctggcccac tccaatacca 148981 ggctggtccc agtgacccca ggccccagac cagccctcgg cttcaggcta atcccaactc 149041 aggctccagg cccaacccag caccaggcaa ccctgcatag cccaagatgt taggctagcc 149101 ctagtgccag gttagcactc agacttccag caccaggcca gcccctgaat actcatgctc 149161 cagaccagcc caagtggccc caaacaccag gtctgccccg catcaggttg gcacctgtgg 149221 cctcagacac caggccagca cacaaggaca caggctccaa gcctgcccta cgggtccaca 149281 ttccaagtca gcccctgtgg ccccacactc caacagacac aaagttcagg ctcctctggc 149341 agagccaggg tccaggtcca tcccagcaga acctggcctc ctctggcaaa cccagggtcc 149401 aggctcatca cagcagaacc cagtgccagg cctgccccca agacctgatg actcacacct 149461 gcctcagtgg cccagtctcc aggcaagccc ttgcagaccc agcctccagg ccagcactca 149521 ctcacccagc ctctaggaag accctcatgg ctccaggcac aaagcaagca cccacagact 149581 gcagccttca gtggacccag agtccaggct tatttcagca gaccaaggct ccaggaccac 149641 ccttgcaaac ccaagctcca ggctaaccct ggtaaaccca agaccaggcc tgactcacag 149701 actccaggcc cacccccatg gtcccaggtg ccagtgaaag ccaggcattg tggactcagg 149761 ctccaggccc atccccaagg tcccaggtgc cagcaactgg ttggctcctg cagattcaag 149821 ctaaaggccg gtccaagtgc caggtcagct cccaagggac caggcttcag accagctctt 149881 gtggactagg gccctaatag tcaacgtgtc cactccacaa ggggccagct tggcactgtg 149941 aaccctggat ctaagtctgc tccagcaaac tcatgaccca catttgcccc agtaaatata 150001 gtctgcagac caaacctcat ggacccaagc accagacctg tgcacctgct aacctaggta 150061 ccacaacagc ctgcctgaag acaccagcag taagcctgcc cacagaccac accagaaagc 150121 ctgcctagga tctctggtca ggctaattgg tgaagaactt tgccagccaa agccagtctg 150181 caaagagtgg agtaagttcc tacttcttca aatgtgcagg caccaataca aggcaacaag 150241 aatcacaatc agggaaacat aacaccacca aaggaacaaa cattaaagat attgaggttt 150301 atgagctgcc taacaagcaa ttcaaaataa tcatctttta aaaactccat gaactacaag 150361 acaatacaac tagacaagta cacaaaatca ggaaaacaat aaacaaacaa aatgagaagt 150421 tcaacaaaga aacagaaacc ataagaatga accaaactga aattctggaa ctaaagaaca 150481 caatgactga actaaaaaaa ttccaaagag agcttcaaca gtagactcaa tcacataaaa 150541 gaatcagtgt gcttgaaaag gggttatttg aaatgatcca gtcagaggaa caaaaagaaa 150601 aaagaaaaag agtaaagaaa gcctttgaga attacgggat gccatcaagc aaaccaattt 150661 atatgttatg ggtgtcccag aaggagcaga gagagaagaa agtttatttt aaaaaataat 150721 aacaaaaatt ccccaaatct gtggaatgta aaaaatatcc aggtatagga aactcaaaga 150781 tctccaatca ggttcaatgt aaacaagact accccaagat atattacaat caaattgtca 150841 aaaatcaaaa acagagagga tcttgaaagc aacaagagaa aagaagcata tcacatacaa 150901 aggaatccca agatggctat gaacagattt ctcagtagaa atcttataag ccaggagaga 150961 cagggatgat ctattcaaag tgctgaaaga aaataggctg ggtgcagtgg ctcacacttg 151021 taatcctagc acgtggggag gccaaggcag gcagatcacc cgaggtcagg agttcgagac 151081 cagcctggcc aagaaggcga aatgccatct ctaccaaaaa tacaaaaatt agccaggcgt 151141 ggtggcgggc acctgtaatc ccagctactc gggaggctga ggcagaagaa tggcttgaac 151201 ctgggaggtg gaggttgcag tgagcagaga tcaggccatt gcactccacc ctgggtgaga 151261 gagtgagact ccatctcaaa aaaaaaaaaa aaaaaaagaa aagaaaagaa agaaaaaaac 151321 caacaaatca attttgaaac aacagacttt aactacactc taaaacaaat aaactgaaca 151381 aatacagaac actccgtcca acagcagcgg aatacacatt cttctcaaat gcacatggaa 151441 cttttttcag gatagatcat atgttagacc agaaaataag gcttaacaaa cttaagaaga 151501 ctgagatcat atcaaatatc ttttccaacc acaatggtat taaactagaa atcaataaag 151561 gatggatttc gaaaaattca ccaaaaattt ctcaaaaaaa ataaacaaca tgcttatgag 151621 caaccaatgg atcaaaaaag aaattaaaag agaagtttta aaatgtcttg agacaaatga 151681 cacaacatac cgaaagttat gggatgcagc aaaagcagtt ctaagacgaa agtttacagc 151741 aataaatgtc tacatcaaaa aagatttcaa ataaataacc tagtattaaa cctcaagaac 151801 tagaaaaaga acaaacaaag cccaaagtta gtagaaggaa gaaaataata aagatcaggg 151861 cagaaataaa ttaaatacag actaggaaaa caataaagat taataaaacc aagagttgtg 151921 ttttggtttt ttgttttttt tttttttttt tgagacaagg tctcgctctt ttgtccaggc 151981 tccagtacaa tggtgcaatc ttggctcgct gcaacttcca cctcccaggc tcaagcaatt 152041 cttctgcctc agcctcccga gtatctggga ctacaggcgt gtcaccatgc cttgctataa 152101 gagttggtgt tttgaataga taaatcaaat caataaaact ttagctagac taagaaaaaa 152161 aagaaatctc aaaaccagaa atgaaaaaag aaacattaca actgatatta caaaaatata 152221 aagaattata agaaactact atgaacaaat atatgccaac aaactggata acttagaaga 152281 aactgataca tttctagaca tatacaactt acaaaaattg aatcataaag aaacaaaatc 152341 ttaacagatc aatagtgagt acagagatta aatcagtaat aaaaagtctc ctaacaaaga 152401 aaagcccagg acttgatggc tttactgttg aattctacca tacatttaaa gaaaaactaa 152461 taccaatcct tctaaaactc tccccaaaaa aatcaaagat ggaaaagtat ttccaaactc 152521 attttattag gccaatatta cccttctgtt aatgacaaag acactacagg aaaagaaaat 152581 tacaggtcaa tattcctgat caacacagat gcaaaatcct caacaaaata ctagccaatc 152641 gaattcagta gcacattaga agaatcattc accatgatca agtgagattt atctctgggg 152701 tgcagggata gttcaacatg tgcaaatcaa taaatgatat accacactaa cagaatgaaa 152761 gacaaaaatc atatgatcat tttgatagat gcagaaaagg cacttggtaa aattcaacaa 152821 cacatcgtgt taaaaactca aaaaaaactc tatagaagga acgtacctca acgcaataga 152881 ggccatatac gaccagccca caactaacat cataatcaac agtgaaaagt tgaaagcttt 152941 tcctctaaga tcaggaacaa aacaaggatg cacactctca ccactactat tcatcaacat 153001 agccactgga agtcctggcc agagccatta ggcaagtgag aaataaaagg catccaaatc 153061 tgaagggaag taaaattatc cctgtctgca gacagcatga tcctttagac agaaaactag 153121 ataaaggacc ctaaagactc caccaaaaga gtgctagaat aaatgcatac agtaaagtca 153181 gaggatacaa aattaacata caaaaatcag tagcatttct atacactaac aatgaactat 153241 ccaaagaaat aatcagaaaa acaatcccat ttcccattta caatagctac aaaaaaaaaa 153301 agttagaaat aaatttaacc aaggaggtga agcatctata cattgaaagc tataaaacat 153361 tgatgaaaga aattgaagaa gatataagtt aataaacaat tatacactgc aatgacagtg 153421 aaaagaattt gctttagatt tgtatttttt gagctagaca tcttaaggct ttttagtatt 153481 tttttctcct tttctcttac cttctgttcc cattctctta cttcttccaa actcatactc 153541 ttaaattttg tccacttatg aaaacaaaat gaataataaa gtataaagaa ataaaagtaa 153601 gaaactacct gaagtcttgc tagttgtgca gtccgggcaa ctatcttatt gtatggttca 153661 acaatttttg cttttatcct acaggaaaag agaggagtga gaataaaatc attttcagaa 153721 tatcaatgga taaaaacatt cccattccat ttatttataa aattaagaac agtacctatc 153781 aacagctccc tgtaaagccc caattctcgt ctgcatcatc tgaagaacac ctaggaaaac 153841 ataagtttac attggcttta caaatttaca gggcatccaa ctgggtaaat agttaaatta 153901 aaagaaaata catgcatata taattttaac tttttttgtc accatcctaa tccaagcaac 153961 cagtatcttt tacccacact gctgtaaaag cctgtaactg gtctccccct ccaatctttg 154021 cctcacaaaa gaagcctgac agggaatggg aactgactgc ttaatggata tggggtttcc 154081 atctggggtg atggaaaagt tatggaccta gatactgagg atggttgcac aacactgtaa 154141 atgtacttaa tgccactgaa ttgtatgttt taaaatgcta aaaacaataa attttatgtt 154201 atgtatattt taacacatta ccaaaaaagt atgactgatc ctccatgata tcccctttgc 154261 cccatcactc ctatattaaa actatagtcc acagcttagt ctacaactct ctaggcttac 154321 aacatgagga aattttcaca aattggaaaa atatttccag gaagatcaag cataaatttt 154381 attatttagg ctttcaatta gtacagatat ttaattattt tgaatgccat tctttaaaat 154441 tcagtttcta tatccaactg gcatagttgt tttgaaagca tagtattcta ttaatatcta 154501 aatttaagta tagacatgtg gtcatatttt aaatgaaata atattggaac aaataaacat 154561 tcctatcagc atacaagtat acaataacag agttcagttc aatagaatcc tctctgaacc 154621 aaagattccc acttttctgc tggttttgtg gtgtattcac catacactgc caagaataca 154681 ctgccattac ttataaccat tttcttacat catattctct ccacaaccca gtctcaatgg 154741 gtttccacca atactccacc aaaacagatg ttaccaaggt caccagaaat ctatgagttg 154801 ccgaattcta ttgttggttc tgtcttaacc ttgttccagc tctcagaaat tcacagcagt 154861 tgactactcc ctttctaaaa cattttttgt ctaggctttc tttttttgag acagagtctc 154921 gctctgtcgc ccaggctgga gtgcagtggc ttgatccccc ctcactgcat gctccgcctc 154981 ccgggttcac gccattctcc agcctcagcc tcccgagtgg ctgggattac aggcgcccac 155041 caccacgccc agctaatttt tttttttttt tttgtatttt tagtagagat ggggtttcac 155101 catgttagcc aggatggtct cgacctcctg acctcgtgat ccacccacct tggcctccga 155161 aagtgctggg attacaggcg tgagccacca cgcctggcct tgtctaggct ttcaaatcat 155221 cacacttcct gggtctccta ttatctcatt agttgttccc acttctcctc tgtcagacct 155281 ctacatgtgc tggagtaccc caagacgtga tcctcagcca tctgctcttc tccatctaca 155341 atctttcagg aatttcattc agtcccaaag ttttaaatag tatctgttca ataaaatact 155401 gattttatac acacacacac atatacagac gtatacacat acatctatgt aaaaagattt 155461 tctttatata tctgcctatg ttctcattcc ctatctctct gtcctctttc tctctcttcc 155521 cctgtactgt tacatcaaat gtaagatacc caagacagtt tatgccctac cccccaacct 155581 atcaaaaatc acacttctct ccaggctatg tcatctccat tactcatacc atcatctatc 155641 caatttttga gactaaaatc ctaggaataa ctcacggtga cttttttcca tcagctcttt 155701 catctgccat caacaaaccc tatcagcttt gcctacattt tactccatct gccaccactc 155761 taatccaatg caccatcaac ttcaactatt ctaatagctt ccaaaccagt gtccctgatt 155821 ccaccattgc ctttttctac agagctatca gagtaaacta ttccgagtat taatctcatc 155881 atacaataac ctgattaaac tcttccaatg gtttaccatt gcatttagaa ttaaatccaa 155941 attccttccc aagaacctta tctacgatcc tctccctcct tcactaaaca tcagctattc 156001 tgggctactt tctgcccctc aaacatgcca aattaattcc tgctttccca tttattgttc 156061 agtcttctcc tcaaatcttc acatgatggc tcctcctcat cattagatct tggctcaaac 156121 gtcatcttct caaaaaggcc ttccctctcc ttcatccatc cccatcacct gttttatttt 156181 ctctatagaa tttacaaata aaatttttct ttctgttttc tgagatacag tctcactcta 156241 ttgcccaggc tggggtgcac tggcatgatc tcggctcacc gcaacctctg cctcccaggt 156301 tcaagcaatt ctcatgcctc agcattccaa gtagctgggg ttacaagcat gatccaccat 156361 gcccagccct gttttctact tgtactttgt tttttttttt ttatctacct ccaggaccag 156421 aacatagttc caaaaggaca gtgactttgt ctatcattca caagtgccta gtacttaaca 156481 agttattaat tactatttga ttgcctgaat gaacaaataa atcaatgagg ctacctttgt 156541 tttttgtttg tttgttttga gatggagtct cgctctgtcg cccaggctgg agtgtagtgg 156601 catgatctcg gctcactgca agctccacct ccggggttca cgccattctc ctgcctcagc 156661 ctcccaagta gctgggacta caggcaccca ccaccacacc tggctaattt tttgtatttt 156721 tagtagacac ggggtttcac catgttagcc aggatgggag gctacctctt tttgacaaaa 156781 agaagcttaa taaatattcc tcaaaattcc atgaaaacaa aaactctatt gctatagctt 156841 tctgaaacct agcaaattct caaaagtaat atttggaaga atatttttaa aaagaacatt 156901 tttaaataaa ataggcttaa acatacctcc aaactactta catgctaaaa aaattttaca 156961 tatttaggag aataaagcag atgaagtaga agctagaata atattcaatt tttctaatat 157021 gtatacacac cacacttact tttcagcatc tcaaaataac atctagacac atcatgctaa 157081 aatgaatgct aaccaattca catttaaagt tttttattta ttttttgaga cagggtctag 157141 ggtcttgctc tgtcacccag gctggagggc agggacacaa tcacagctca ctgcagcctc 157201 aacctccttg gctcaagcga tcctcctaca tcagcctccc aagtagctgg gactagaaac 157261 atgcaccacc acgccaagct aatgttttat tttttataga gatagagtct cattatgttg 157321 cccaggctgg tcttgaactc tgggatcaag tgaacctccc accttggcct cccagagtgc 157381 tgggattata ggcgtgagcc actgcaccct ggcctacact tcaatctttt atgttcaaat 157441 aatctagtga atatcctgag ataatcatag gaaaaaagta gccagccttt gtactctaca 157501 cactcacaca agagataatc attctaattc agtctagcag tgtatttcaa atattttaat 157561 ctttgaacat cattctggga aaaacaagca cattataaac taaaacacat atttaaaaac 157621 cacaatgaga aatatttctg ttgcggtgat tcatctagac ctatacacac tcaatgtact 157681 accatgccat ccttattaat aaaaaactag catgttttca agattgccat taaaatatta 157741 catgctctca aaaatgcata aaaatattat gacatttaaa aacataatta aaactagtga 157801 gcttctgcac agccaaagaa actatcaata gaataaacag acaacctaca gaacgggtga 157861 aaatgtttgc aaactatgca tctgacaaag gtctaatatc cagaatttat aagaaactta 157921 attcaacaag caaaaacaca actccattaa aaagtgggca aaagacataa agagacattt 157981 ctcaaaagaa ggcatacatg caggcaacac acatgaaaaa aaaatgctca acatcaccaa 158041 tcatcagaga aatgcaaatc aaaaccacaa tgagatacca tctcacacca ttcagaatga 158101 ctattattaa aaagtcaaaa aacaacagat gctggtgagg ctgttgagaa aagggaatgc 158161 ttatatgctt ttggtgggaa ggtacagcca ctgtggaaag cagtttggcg atttctcaaa 158221 taactcagaa ctaccatttg acccagcaat cctgttactg gatatatatt caaaagaaaa 158281 taaattgttc caccataaag acacatgcac ttgtatgttc atcacagcac tgttcacaat 158341 aacaaagaca tagaatcaac ctaggtgccc aacaacagtg aattaaagaa acatggtaca 158401 tatataccat ggatactaca cagccataaa aaaagaatga aactatgtcc tatatagcaa 158461 tatggataga actgaaggtg attatcctaa gcaaattaag gcaggaagag aaaaatcaaa 158521 tactgcacag tctcacttat aagtgggagc taaacattgg gtactcatag acctgaagat 158581 gacaacaaca gacaataggg actactagag gagggagtac ctaactgttg ggtcctatgc 158641 acactacctg ggtgacagga tcatttgtac cccaaacctc agcattatgt aatattctca 158701 ggtacaaacc tacacatata cccccccgga tctaaaataa aagttgaaat gactaataat 158761 agtaattaat ttccagtact aagtcaatcg gtacttagag cttcctaatc ttcctaagaa 158821 cctagtcaga tatgtatccc agatttcaaa taacaaacta ttatttccaa aaatactaag 158881 ttcttcatca gggtatctta gaaatattgt acatgctgag aataagttag tatctttgag 158941 tacatgagaa atactggggg aagcattaaa gaatactgca tcacagaaaa caatatcctc 159001 tctccttaag aatttgatca acatcgagaa aaacccctta cctttcttct gatgtacaat 159061 gattaataat agtaatcaca agtaataatg taaaacacaa tagtaacatt tcatatttat 159121 gagatgtagt ttaatttaat caagagggaa tgaaaaattc ataaaagatt tagtcaaaca 159181 aaaataagca gatttgtaaa aataaaaatc caattattac ttgactatat tttcattaaa 159241 aagagttgca aaaaaggctc caagaaaggc aaaaataaga aaaaaaattc atgccttctt 159301 ccactactag tcactctggt attgttgatg tttttattgt tttaattaac taacacttac 159361 aggtgcacta aatatgtgca ggcattgttc taagtatctt gctaattcat ttaatcctca 159421 gaacaatgct gtgttagtac acattatact cctgttttac aaataagaaa actgaggcag 159481 aaaagggtta actaacttgc tcaaagtcaa acacctagta agcagtaaaa caaggatttg 159541 aacccagatc atttagctca ggggccagca aagtttcact aaagtcttac tgaaacatag 159601 ccacacacat ttgttttcat attatctgcc tactttcact ctacaatggc agagtttagt 159661 agttgcaaca gagactgtat ggctggcaaa gtctaaaata cttgccatcc aaagtagctt 159721 gccaaagctt gccaatccct ggtctagctc tacataatct ctagcagtta ttagaaatat 159781 accttccaac gactcaatcc cagttgcttg tgccagtaaa tcttcatgtc ttgcaacaac 159841 ctgaaaatca aaaataaatg attagctttt gctctgttag gtattcttcc ttgtaaccac 159901 aatgtagaat ggacagtctc tcttggtgta gaaacgaggc aaatacaaat acaaaagaaa 159961 aaaccacagg tttttaaggt gtgtctgaga tgagttgtaa gcgggaacac acatacattt 160021 acagttcaac acagagcaag aggagttaca gccttttctt agccctgtct tagtctgttt 160081 ggactacaaa cacaaaatac cttagactga gtaatttaca aacagaagaa atttattgct 160141 cacagttctg gggtttggga agtccagcat caaggcattg gcaaattcag catctggtga 160201 gaacccattc cttacagatg gtaccttgta tctatcttca catggtggaa ggggcaaaca 160261 gtcttcctcg ggcctctcca atatgatatg atagtattaa gagccctgca ctaaacccta 160321 gtaagagccc cgagctctta atactatcat attggagatt aagtttcaac atatgaatgt 160381 taggggaaca caaacactca gacaagcccc ttgctatctt aaagagatgt acaagtggag 160441 ttgtgccaaa aaatgtaaaa aacagctcat tattgccaca ccattattat gttttccctt 160501 taaagaacat tttcaaacat caggttctct ttgttcttga atttgattca gcaccaccaa 160561 ggctacatcc tactctggaa agatacagca ttcagtgctc acctatacct gaacatctct 160621 ctgcttccag tttcccagtg tcacataccc acagcaaatc cttttcccta cagatgtttg 160681 atacttttat gaaaattagg gcactgcaga tggccagaac aaaaaaacca aggttactaa 160741 aggcgacatg caagacattc ctagaaacat cacaaaatca tgcaaatctt tagaataatg 160801 caggaatcct tattattgtg tgccttttat tccaatgaat acttagtttc atgtctagtc 160861 tgctgactga agttcttaat caccaacgac tggaacccct aaatcctcca ctttgtcttg 160921 tttcagatat cactgtcatt ctgtcaagtt tctttctatc atatctactc aaagtattac 160981 tcttggtcac ctcggtcttt tattcgtaag gttgccagaa gatgtctgaa tttttcccag 161041 cctactgata cggtcatcaa tatgtccatc ttaacaagac taacttttta aaaattatct 161101 attctccttt atcactagta atactttggt aaggttaaaa agcctaaaac tatagtttca 161161 tggactccct tgcaaccaat taaatgattt ttccaagagt tgtggaagac aagagagagg 161221 aaagctaggc tgggcacggt ggctcacacc tgtaatccca gcactttggg agaccgaggt 161281 gggtggatca tttgaggtca ggagtttgag accagcctgg ccaacatggt gaaatcctgt 161341 ctctactaaa aatacaaaaa aattagctac gtgtggtggt gcacacctgt aatctcagct 161401 actcaggagg ctgaggcaca agaatcattt gaacccggga ggcagaggtt gcagtgagca 161461 agatcgtgcc acttatacac cagcctgggt gacagagcaa gactctgtct caaagaaaaa 161521 aaaaaaaaag gagagaggaa agcttactgt ttcagctagt aataaaggtc ctgagacctg 161581 agaatttaga gacagttaag agaaccacat tgatcactcc aggatttagt catcagatat 161641 ctagaggcag aattttagca ctggcatgga aggtaagaac aggtacttcc aggcctctgt 161701 attgcatctg tggtagttcc ttcaactccg tccactagct attcacccaa caattttata 161761 agcatccaaa ctctctgtat gaagtgtctt tcttagagtg atttctagtt cctgcactaa 161821 accctaatag taaatgtttt acaattttag tcatctcact gatatagact actgggactc 161881 ggagacttcc tctacctcac atcctaggcc actacaactt ctcccctcta agtgatggta 161941 aatacatttt aagttcttat tgattcctgc agagtagatg ctgcatacaa ggaataagga 162001 aaataccaag cataaaagga ctaaaagaat ttcaggtggt tcttgcaaaa agccaccccc 162061 tgagaactct agacaaaact aaatacatta tccccgctct cataaacaac ttacactgtt 162121 ctatttaaaa caggcctcgc ttgcctacca ttatttctgt ctgtgtctgc cactggtctg 162181 tgggtccttg gagagccaag aaatgtattc tatttatttg gctatcccca cagtctaacc 162241 caaagcttga caattaagcg tgttttcttt cttttttttt tttttgagac ggagtcttgc 162301 tcttgttgcc caggcttgag tgcaatggca cgatctcggc tccctgcaag ctccaactcc 162361 caggttcaag tgattctcct gcctcagcct cccgagtagc tgggattaca ggcgcccacc 162421 accatgccca gctaattttt gtatttttag tagagatggg gttttgccat gttggccagg 162481 ctggtctgga actcctgacc tcatgacctg cccgcctcag cctcccaaag tgctggggtt 162541 acaggaataa gccactgcgc ccggcctaaa cgtgttttct aaatatctgt caataaagat 162601 tgaaatcaaa tgatacataa tcagaatttt agagttaaaa taaccttaga aggcatccca 162661 gtcctatcta aggcactcaa agaaactaag acacaaaaat gatcaagtgg atatgtctca 162721 ggttatacca aatctatgtt tcctggtaaa caatctatgt ttcatgcata tgttcattca 162781 tttaaatatg tattaaatac ctactcaatc aaaggcaaca ctagaaatta tgccataaaa 162841 tcgccctata aaatttacag aattttagta gaggagggaa tcttcaaagt tatctactac 162901 agtctcctct tcctcttgct tattccaacc catctgatag attaaaaaac aaaaaagcaa 162961 gactcaaaga ctttaagtga tattaaagtc agttaacaac agcatagctg aaattaggac 163021 ccgtcgccta acatctcata ttgtatccta ctcacaataa catttatcaa ctttctgtaa 163081 tttcagcaaa tgctacactt tgtgctaata aagtatgaat tgcgaatgta acatcccaat 163141 ttttcttatt catattgttt tacttccaaa attccatttt ctattatctt tatcgaacac 163201 atcctatttt ttgcccacta tattcgttgc acatcagtta aggtggtgaa caggaagtat 163261 gaatgaaagg ctttggtgag gtggtatatt agtttttcca caaacaaaaa gtggtttaca 163321 gaaaatttca aaaatttact ttgaaaaata agtcacactg gtaagtatgt acttcttcat 163381 gtaaatatgc atttgtattt aataagatta caaaaatgct aaattatcac tttaggctga 163441 ataatccata gggacccaga agatcagaat tacctgtaag tgtagttctc tgtccaactg 163501 actgattcct tgggcaagtt ttgctagttg ttcagcaatt acagcttgat gaatagattg 163561 agaagtataa gtctttacat caaagtcttc gtttaaaaag tcactataac accctggatt 163621 ggggaaaaaa taaggaaaag tccttaatta ccctgtacta aaccagttat taaacaacag 163681 aacaattata atcataatta acatattttg atcccatact ctgaaccact ccactatact 163741 gcttaatgct tacttttatc atgaccctaa aggaagtata acattgagaa agagattaaa 163801 aaacataacg actggccagg cacactggct catgcctgta atcccagcac tttgggaggc 163861 cgaggtgggt ggatcacctg aggtcaggtg ttcaagacca gcctggccaa catggtgaaa 163921 cctcaaaccc catctctaca aaaatacaaa aattagctgg gcatcatggc gggtgcctgt 163981 aatcccagct actcgggagg ctgaagtgga agaatagctt gaacccggga ggcacaggct 164041 gcagtgagcc gagatcgtgc cattgcaatc cagcctgggt gacagagcaa gactttatct 164101 ctggccgggc gcggtggctc acacctgtaa tcccagcact ttgggaggcc gaggcaggtg 164161 gatcacgagg tcaggagatc aagaccatcc tggctagcac agtgaaaccc cgtctctact 164221 aaaaatataa aaaaaattag ccgggcgtgg tggtgggcgc ctgtagtccc agctactcga 164281 gaggctgagg caggagaatg gcgtgaaccc gggaggcgga gcttgcagtg agctgagatt 164341 gcgccattgc actccagcct gggcgacaga gcaagactct gtcccaaaaa aaaaaaaaaa 164401 agacttcatc tcaaaaaaaa aaaaaaaaaa aaaccataac tactagctat aactattaat 164461 ccaagaaaac aagatcattg tcagtgaaaa gacaacagga aaattgactc tatatatata 164521 tatacacata tgtaatattt atggcctcag atatctctag atcaatacac aaatactgtt 164581 atagcaaagt gaatctggaa ttttagtaac aataggaaat ataaatttat aggtgttact 164641 gaaatttagg atttatgggt gaaatatgtt aatggaaaga aaagctgttt aaaataggcc 164701 cctaatagac agaaaacaga agtaatacca gatgttaaaa agatacccct caatatagaa 164761 atccacaaat atgtggatga aagaaaaatt aagagaattg aaagctaaaa atagataccc 164821 agaagggagc tcatggtaac catgtaccaa atctcctaat gatacagtag atacaagtaa 164881 agctgccttg agagagatta tgaaacaaac aaaatagagt acttttaaga aaacctaaat 164941 tgtctacatc tgctataatg tatgctccaa gaggggaaga aggtctttgt tttgtttgct 165001 aactcttctc cagtgtctga acaataccag acacatggca ggttccatga attaactaat 165061 ctatacatca gctgatgtat cattagtatc attctggtaa caaccaacaa caaaaagtaa 165121 aggctcatac tcttgattat tcttaaaatg tagtggaatg aaagaaggaa actgctactt 165181 tggctgcttt taagtctgag aaccacattt ccttctgatt tgccaagtgc ttctcagaat 165241 taagagtcct gtcagccaga caaatgaaaa aagccattcc aagcaaaggg aaggtttagc 165301 agtggcaaat gccatggcat attttgagaa acaaggggaa tactgatttc agtgtgtcta 165361 aatcatagca catgtatatt ggacagaaat ggtaaagaat gacagatgtg gttccaaagg 165421 ttggagcgca gtttgaaaag aggtttgtaa gacatttgaa atgtttaacc atatctcatt 165481 taacccaaga aattgcacta tcagctttat gttacagaac aataatcctg gcagctgtat 165541 agaggatgat ggaaaaagga ttagtcacac agcaatgtgc tgaagttgcc tcttattggc 165601 tcttgagcac agactgttac attttcagga attctgtgag ccagttgacc tttcacacag 165661 atatcttaaa atctacgatg gtgagagtat ttacaccatg gtaatcagca aacactacaa 165721 aaatcagggc tttttctatt ttcccacaaa gccaattaac caatacaccc aagaaataga 165781 accacagagg ccagtaaagt actatgttaa cagcctaaga cataatgaag gcccgactaa 165841 gacctgaata agtggcacta agaatgtcaa agtgacaaat ttgacaaata ttcacactgc 165901 acaggatgta ggacttagtg acaagctgaa aatgggtgat taggtagaac aaacaaggat 165961 tcccaaattt ctggattaag caactgatac aggaaattta gttttacact aacttagacc 166021 actagacact gaatatacca agatgaacat aaacttctgc ctccaaggaa ttcataattt 166081 acaaggggaa caagcacaca tttacctaac aacatagtaa acacagataa cagaaactac 166141 cagaggaaaa gattaggaga gaagaacaga aaaagtgaat tcatttcatt ttttcatatg 166201 taaaccttgt actatttgtt catatgtagg ttttcatgtg tattatagaa ctagcactgc 166261 ttgtttacta tcaaaataga aatgtccata caggttatgg aatggcatat cataaaaaaa 166321 ttaagataga aatgtccagt aggcagttaa aaatatagtt ttataattca gaagacaggt 166381 aaggagcata ctgaaaccac agacaagacc agatgattta ggaaaaaaaa gttttagaat 166441 gcacaaaaag gcaggaaaat gaggaagata ggaaagtggg gggtggacag agaaaatgaa 166501 agcaatcaaa gagactaaag agtgctcagg gagaaagaga aacaaaaaag aatggcaaaa 166561 gccaaaagaa gggataaaga aagtaagtat tagtattgtc atagagctta caaagatcaa 166621 acacaacggg atgtggcatt tgagggctga cagcctttaa agagatgaat tatataatgc 166681 ttctctttta tagtttcctc attcccatag agggcatatc agaaacacct caccaatttt 166741 catactatgc atctgtgctc accaagccta tcaatccgat actctgtcat tggccaaatc 166801 cccgatatag tgaatcaaag tttctgctgg aaatgcttaa ttcctcagga atgttggggg 166861 tggaggggca acagagaccg cttaattcag aacaaactga actgacaagt atctacagga 166921 acctggaaaa gagcacctag agaattaaga atttttaaaa agcacggcac ccataaaccg 166981 catgaggaaa gtatttcaag aaaggaatca aatgttatta caagttaaag gaccaaaaaa 167041 gtgtccattt gtcgtatgtg caagaaagtc attgaaaacc ttgcaaagtt acttcaatgg 167101 catagattga gggtgactga gtccccagga aacaggaaga aaaaatccag agcaaagtag 167161 gaggcactat ccttagaacg gagcagagac tcggccgggc gtggtggctc acgcctgtaa 167221 tcccagcact ctgggaggcc aaggcaggca gatcacgagg tcaggagatc gagaccatcc 167281 tggctaacac ggtgaaaccc cgtctctact aaaaatacaa aaaattagct aggcatagtg 167341 atgggcgctt gtagtcccag cttctcagga ggctgagcca ggagaatggc gtgaaccccg 167401 gaggcagagc ttgcagtgag ccgagactgc gccactgcac tccagcctgg gcaacagcgc 167461 gagactccgt ctcaaaaaaa aaaaagaaag aaagaaagaa aggagcagag gctccacttc 167521 cgttgcaatt gaaaaggcaa gggaaaaaag gtgcaacaga tggttatacg ttgaggttaa 167581 attcataggt ttgatgggca agaaatggag aaagttcaca tctgatggat cctctttttc 167641 tccccaggga agcttaaagt catttgttga gcttgaagcg aaaggtttga aaagggtaaa 167701 gaagatttaa acaatcactg tggagaatgg cagagaacca aggaggaaaa tacagaaggc 167761 agaactgaag agtatttacc gtccaagaac tgattacaat tcctgaaata catcctactg 167821 aaacatatat ccaatgttcc actgttcacc tggggaagtc ctgtctacag tttaaaacgt 167881 ctccacctag aaagcctttt ctgacatttc ctaaccagtt tccaaccccc agttatcatt 167941 ctcctctgta ttaatgctct actttatatg caatcctact gctatcttac atcactatat 168001 tacagcacct acctttatag tctacaagat tatttcctct actagatggt gaactacttg 168061 agggcaaaaa ttgcacgtgt atcaggagca actgcaaata tatcagggtc tctgccttgt 168121 atctaaagtt taaattcagt ttagccaagt tttagacaga gacccggtcc tagagtttcc 168181 aacatgaaat agttcagaaa acatcattgc ttttgacaaa agtacataaa ttatcatgaa 168241 aaggcacgta attcaggatg acaataaaat atttaacagt cagagaaaat aaacaataat 168301 ttggaacttt atttcattta ttacccactt cggtcttcaa atgtgtgcat gtgcttctaa 168361 ctgcgatgat gaatttaaaa aacattctgc acctactgta tgcaagatac tgctattcac 168421 actgtgggat attaaaaaaa gtttgcaata cagaagagat taaaacaagt atacaaatga 168481 ttttagtaca aggtagaatg cgatggccac tagagaaaaa attaagaaca ggtgagaagg 168541 gctttatttt tatgcttata aatgtgaact ttattctgaa ggataagaga ttcgggacat 168601 ggtaactgga ttgacacaac tgaatatgct tttcaagtat attatatcta acttttctta 168661 ttttgggcaa cttctctcca aactgactag caataaaatc tgactgctag attctgtaca 168721 aatggttaat cccaacctga acaaatgtct caaaccctcc cccacgtgtc actgtagata 168781 actgcctcat aaaggaggaa aaaaaaaacc agcctgtaag aggggtcacc tgtacagtcc 168841 atttgttaaa ggagtaagcg tttccagctc tactttaaaa caaggcccta tttcattatg 168901 cgtctgccgt ctgttctaga gcttacactt ttgaagagct cgaacgctac accccttcgt 168961 caagaactgc ggcagcagcc ggtggctgcc aacgcggtcc acagggtttg ggctcccgaa 169021 acttcaggga agctggaggc atgggggggg ggggggtcga gttgaaatga ggaacacaca 169081 gaaaccaagt gtccccaaga ctccagaaaa gagggagcca gtcccagagt aaataggttg 169141 gctagaacag tacccaagcc agggggccgg gcggagggga gtggtcacgt ccagactgga 169201 aaactttctg caaagcaacg ggttgggtcc acctcgctgt ccaggtgcgg agcagcgcag 169261 acccccaacc ccacgacctg gtcagacccc gtctccttac cgtcctgcag aagttcccgg 169321 actgtagctg cagccgctcc agagcctcga gctccgaggc cagctacagc gacgctgccg 169381 ccgccacctt ccatgttggc aggtgccggg ttgatgtcgt cagcagcaga acggctccgc 169441 ccaggtggtg acgcagaatc ccggcgccgc ccgcccaccc agcccatggc tccaggccca 169501 cctggcgaac tgactctcag cccgcgcctg ggctaagcct ggctaggagc cgcgcaggta 169561 ctcgagcagt gggcgcccag ggtccgagtg ctctgcgccc agcgcaccga gggagccaag 169621 gccgtcgggc cggcgctttc agctgtctct cgcagcagct cagggccgcg ccccctgggc 169681 tggcgtcgtg ccctagccaa ggagaaggtg ggcaccggaa cgtggccgca gcgcttatct 169741 gtgaagcaag tattaaacct cagaacaggg gccgcgcggc gtaaggcagg agagcagagc 169801 ggaggtttca gggagcttcg tttgccctcc acactcacaa gagtcacgcg ttagttgcta 169861 cacagcttta tctgttccac gccttgattg gcactcctgg ggtaataaaa gcccgcagga 169921 aatgcttggg aagaaaagtt gctggggatg aacagtgaag agaattaggg agagtcgatg 169981 aagaaggaag tctgaaccat aaaaggacca ataaaactca ctgtctctac cctccttcct 170041 tcccatgaga cctttgacct tttgcctttt ctacttcttc ctcagtttca gtcacgtgcc 170101 agactttttc ttaattacaa actgatagat ggtccacatc tcaactaaca gtattgacgc 170161 tgtaagagca tcaggttggt tcccattgta tcttctttac taatacgttc gcgtttttga 170221 cctagatttc tttcttctaa gataaatact gataaattaa cagggtcagg caaccacaag 170281 tatgcaagac agtcttaagc ttcttcgcaa aaagaagctg gaagatgact ttttcatttt 170341 gtacagtagg tatgtatcca ggaaaccaat tttatccagt tccagagcta cgtggttttc 170401 tacattctgt ttcaaaaagg tgctattact tatcctgtgc ttatcgctgt tctaagtatt 170461 ttataactca atgcatttca tactcataac acctcaggta agcaagtctt atgctacact 170521 gcattatgct ctctttgtct cagaggattt tccctagtat gtaagatctt agtcatgatg 170581 ggaatagggg tgattcattc attacttgtg aattaggaat tgtttttcct tccttggctt 170641 tattcccccc agtggaagat tttagaagag tgagtaatat tcattcatac agcgaaagtt 170701 ttgacccctc tactctgcat gctctcatga gtttacaaac ataagactca atccatattc 170761 taaaaggagt taataggtaa ggtaggaacc catttaaaca aacaattgta ccaggtgctt 170821 tggaatcttt ggatttactt attccagtaa ttccaagctt agctgcacat tgaaatcacc 170881 tgaagagctt taataaaaac actgatgcct gcatcctgag cccagacgtt tggatttaat 170941 tggtctaggg tatggcttgg gcatgctaat tcttaaaagc tcctcgggtc attgtagtgt 171001 acaactaagg tagaaagcta ctgattgttt tgttttgttg agttggggtc tagctctgtt 171061 acccaggctg gagtgcagtg gcaggatcgt ggctcactgc agcctcaacc tcctgggctc 171121 aagggatcct cctgcccaag cttctggagt agttgggatc acagtcgctt gctaccacgc 171181 ccagctgatt tttataattt ttttagagat gggggtctca ctatattgcc caagctggtc 171241 tcaaactcct ggccttaagc aagcctccca ccttggcctc ccaaaatgct gagtttacag 171301 gtgtgagccc agtcccctga aacctactga tttattcttt cagcaaataa atattgaggc 171361 cttctgtatg ccatttacta tgactgcagg agtggataaa aatgaagtaa ttcttgctaa 171421 aatcactaat gaccttgttg ccattgcgct ttatttttgc cttactttga tgtttaagat 171481 acctatcccc tccttccttc ataatggtct ctctcgtttt ctgagtatca ctctttaggt 171541 ttattcctcc cagctccaaa gttcatcttc agtctctttt gctctcccct tttcctctgt 171601 tacttaaata ttggtgcttc cagcggtttc ttgcaagatc ctgacatcct taagtacctt 171661 ggctattttt aatttcaact ccttgcctaa aggttaaaga ctctaaagtt tgagcaatta 171721 gaccattcta aagttggccc atatctgatt cttttttttg ttgttgttat tttctgcttc 171781 atttttcctt catacatacc tgtttctaag ttactgaggc caaggactgg aatatgaaaa 171841 gtagccaaca ctgtgaaata gtaggatttt aagtatagca aatgtaaaac atttacaaac 171901 tccataattt catcactcgt acctgtacct cttggggtat aaaagtgatt tacaatgcaa 171961 ttgtagaatt ataattgaaa ttttcactag ataatatcac attaaaattt ttaaaaataa 172021 ttaacgtttg tttatcttct gattcacaga cagaacagga ttcaagaatt acaataaaag 172081 caccagcata tcttttgaaa cactgactac ttaagagttg ctctattaat gccatttaac 172141 tttcctgaat aaaacaagat atttccaaat gccagtcgag tatcatatat aaaataattc 172201 aaaccaaatt tcttggctta ccaatcccaa caaagtaaca tgtaaaaaat ttttaaataa 172261 catgaaaact tacaacatgg taaactaggg aaaaaaaaaa aatctcgttt tctgcatctt 172321 gtgtttccac aactatcttg ggtagttttc cacatctatt tatcttacaa atagaggcag 172381 cattctatag tatcatattg gatactgtga aaataagcta ttaataaaaa aatttctgag 172441 cccacaagcc tgataaaaaa tatatgttgt tggcagttga aattgttaga atggaattaa 172501 tatagtgaaa acatattttc tctatacaag ctt // LOCUS AC002389 46275 bp DNA PRI 28-JUL-1997 DEFINITION Human DNA from chromosome 19 specific cosmid R28461, genomic sequence, complete sequence. ACCESSION AC002389 NID g2282012 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 46275) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1Mb region in 19q13.1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 46275) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (28-JUL-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA FEATURES Location/Qualifiers source 1..46275 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R28461" /chromosome="19" /map="19q13.1 between D19S208 and CAPNS" /map="overlaps F21246 to the left and F14121 to the right" /map="oriented from centromere to telomere" /cell_line="5HL2-B" /clone_lib="LL19NC03 chromosome 19-specific cosmid library" /note="LL19NCO3 library constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries chromosome 19 as its only human chromosome" misc_feature complement(120..164) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 76.000" misc_feature complement(935..1117) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature complement(1287..1400) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 52.000" repeat_region complement(1523..1812) /rpt_family="Alu" misc_feature complement(1945..2001) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 74.000" misc_feature complement(2574..3206) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 73.000" misc_feature complement(3435..3460) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 45.000" repeat_region complement(4601..5178) /rpt_family="Alu" repeat_region complement(5273..5564) /rpt_family="Alu" repeat_region complement(5976..6038) /rpt_family="MIR" repeat_region 6116..6391 /rpt_family="Alu" repeat_region 6665..6920 /rpt_family="Alu" repeat_region complement(7248..7567) /rpt_family="Alu" repeat_region 7841..7972 /rpt_family="MIR" repeat_region 8079..8362 /rpt_family="Alu" repeat_region 8426..8699 /rpt_family="Alu" repeat_region 9617..9883 /rpt_family="Alu" repeat_region 10170..10450 /rpt_family="Alu" misc_feature 10958..11093 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 71.000" repeat_region 11262..11390 /rpt_family="Alu" repeat_region complement(11389..11683) /rpt_family="Alu" repeat_region 11686..12022 /rpt_family="Alu" repeat_region 12358..12561 /rpt_family="MIR" repeat_region 13438..13706 /rpt_family="Alu" misc_feature 15018..15043 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 71.000" repeat_region complement(15218..15497) /rpt_family="Alu" misc_feature 16307..17764 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 89.000" repeat_region complement(19868..20134) /rpt_family="Alu" repeat_region complement(20319..20592) /rpt_family="Alu" repeat_region 21294..21600 /rpt_family="Alu" repeat_region complement(21699..21955) /rpt_family="Alu" repeat_region 22479..22762 /rpt_family="Alu" CDS join(23053..23119,26338..26515,27832..27928,28102..28208, 31844..31934,32015..32133,32443..32551,32865..33016, 33190..33352,34434..34531,34623..34695) /note="glyceraldehyde-3-phosphate dehydrogenase-like; BLASTX similarity to PIR I49681 (1..440) [Mus Musculus] GLYCERALDEHYDE 3-PHOSPHATE DEHYDROGENASE Pval= 7.3e-219; 89% identity" /codon_start=1 /product="GAPDH-2 like" /db_xref="PID:g2282013" /translation="MSKRDIVLTNVTVVQLLRQPCPVTRAPPPPEPKAEVEPQPQPEP TPVREEIKPPPPPLPPHPATPPPKMVSVARELTVGINGFGRIGRLVLRACMEKGVKVV AVNDPFIDPEYMVYMFKYDSTHGRYKGSVEFRNGQLVVDNHEISVYQCKEPKQIPWRA VGSPYVVESTGVYLSIQAASDHISAGAQRVVISAPSPDAPMFVMGVNENDYNPGSMNI VSVRAHLGCFSNASCTTNCLAPLAKVIHERFGIVEGLMTTVHSYTATQKTVDGPSRKA WRDGRGAHQNIIPASTGAAKAVTKVIPELKGKLTGMAFRVPTPDVSVVDLTCRLAQPA PYSAIKEAVKAAAKGPMAGILAYTEDEVVSTDFLGDTHSSIFDAKAGIALNDNFVKLI SWYDNEYGYSHRVVDLLRYMFSRDK" repeat_region complement(24910..25211) /rpt_family="Alu" repeat_region complement(25298..25554) /rpt_family="Alu" repeat_region complement(25705..25783) /rpt_family="MIR" repeat_region 25834..26113 /rpt_family="Alu" repeat_region 27144..27604 /rpt_family="Alu" repeat_region complement(28446..28710) /rpt_family="Alu" repeat_region 29556..29648 /rpt_family="MIR" repeat_region 30401..30679 /rpt_family="Alu" repeat_region 30715..30986 /rpt_family="Alu" misc_feature 35203..35252 /note="BLASTN similarity to AA453931 (43..92); match: 0.94, score: 7.6e-94; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 35242..35366 /note="BLASTN similarity to AA453931 (81..205); match: 0.96, score: 7.6e-94; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 35265..35341 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 65.000" misc_feature 35351..35385 /note="BLASTN similarity to AA453931 (189..223); match: 0.77, score: 3.7e-55; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 35362..35485 /note="BLASTN similarity to AA453931 (198..321); match: 0.91, score: 7.6e-94; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 35413..35482 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature 36049..36113 /note="BLASTN similarity to AA468166 (41..105); match: 0.98, score: 3.2e-125; database searched: month.na; nc73b09.r1 NCI_CGAP_Pr2 Homo sapiens cDNA clone 782969 similar to WP:ZK418.5 CE00807 ," misc_feature 36049..36113 /note="BLASTN similarity to AA453931 (317..381); match: 0.98, score: 6.2e-35; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 36051..36110 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 54.000" misc_feature 36196..36341 /note="BLASTN similarity to AA468166 (102..247); match: 0.97, score: 3.2e-125; database searched: month.na; nc73b09.r1 NCI_CGAP_Pr2 Homo sapiens cDNA clone 782969 similar to WP:ZK418.5 CE00807 ," misc_feature 36196..36256 /note="BLASTN similarity to AA453931 (378..438); match: 1, score: 6.2e-35; database searched: month.na; zx32g04.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 5' similar to WP:ZK418.5 CE00807 ," misc_feature 36197..36333 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 97.000" misc_feature 36215..36246 /note="BLASTN similarity to T50919 (9..40); match: 0.87, score: 2.6e-71; database searched: est; yb31h02.r1 Homo sapiens cDNA clone 72819 5'." misc_feature 36240..36302 /note="BLASTN similarity to T50919 (33..95); match: 1, score: 2.6e-71; database searched: est; yb31h02.r1 Homo sapiens cDNA clone 72819 5'." misc_feature 36307..36420 /note="BLASTN similarity to T50919 (98..211); match: 0.78, score: 2.6e-71; database searched: est; yb31h02.r1 Homo sapiens cDNA clone 72819 5'." misc_feature 36419..36471 /note="BLASTN similarity to T50919 (211..263); match: 1, score: 2.6e-71; database searched: est; yb31h02.r1 Homo sapiens cDNA clone 72819 5'." misc_feature 36467..36556 /note="BLASTN similarity to AA468166 (237..326); match: 0.97, score: 3.2e-125; database searched: month.na; nc73b09.r1 NCI_CGAP_Pr2 Homo sapiens cDNA clone 782969 similar to WP:ZK418.5 CE00807 ," misc_feature 36471..36499 /note="BLASTN similarity to T50919 (264..292); match: 0.82, score: 2.6e-71; database searched: est; yb31h02.r1 Homo sapiens cDNA clone 72819 5'." misc_feature complement(36481..36556) /note="BLASTN similarity to AA453425 (329..404); match: 0.92, score: 4.9e-145; database searched: month.na; zx32g04.s1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 3' similar to WP:ZK418.5 CE00807 ," misc_feature 36643..36704 /note="BLASTN similarity to AA468166 (324..385); match: 1, score: 3.2e-125; database searched: month.na; nc73b09.r1 NCI_CGAP_Pr2 Homo sapiens cDNA clone 782969 similar to WP:ZK418.5 CE00807 ," misc_feature complement(36643..36766) /note="BLASTN similarity to AA453425 (208..331); match: 1, score: 4.9e-145; database searched: month.na; zx32g04.s1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 3' similar to WP:ZK418.5 CE00807 ," misc_feature 36677..36765 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 84.000" misc_feature 36692..36766 /note="BLASTN similarity to D45521 (1..75); match: 0.98, score: 3.5e-101; database searched: est; Human adult lung 3'directed MboI cDNA, HUMGS02699, clone lg2367. >gb|G21435|G21435 human STS WI-16652." misc_feature 36725..36972 /note="BLASTN similarity to AA476161 (23..270); match: 0.83, score: 8.1e-80; database searched: month.na; vh21h05.r1 Soares mouse mammary gland NbMMG Mus musculus cDNA clone 876153 5'" misc_feature complement(36738..36766) /note="BLASTN similarity to Z39690 (203..231); match: 0.96, score: 2.8e-81; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(36745..36761) /note="BLASTN similarity to T50759 (277..293); match: 1, score: 9.4e-83; database searched: est; yb31h01.s1 Homo sapiens cDNA clone 72817 3'." misc_feature complement(36749..36798) /note="BLASTN similarity to T50759 (239..288); match: 0.86, score: 3.6e-91; database searched: est; yb31h01.s1 Homo sapiens cDNA clone 72817 3'." misc_feature complement(36806..36875) /note="BLASTN similarity to T50759 (160..229); match: 0.94, score: 3.6e-91; database searched: est; yb31h01.s1 Homo sapiens cDNA clone 72817 3'." misc_feature complement(36849..37051) /note="BLASTN similarity to Z39690 (1..203); match: 0.99, score: 2.8e-81; database searched: est; H. sapiens partial cDNA sequence" misc_feature 36849..37051 /note="BLASTN similarity to D45521 (75..277); match: 0.99, score: 3.5e-101; database searched: est; Human adult lung 3'directed MboI cDNA, HUMGS02699, clone lg2367. >gb|G21435|G21435 human STS WI-16652." misc_feature 36849..36972 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 73.000" misc_feature complement(36849..37051) /note="BLASTN similarity to AA453425 (6..208); match: 0.99, score: 4.9e-145; database searched: month.na; zx32g04.s1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 788214 3' similar to WP:ZK418.5 CE00807 ," misc_feature complement(36872..37033) /note="BLASTN similarity to T50759 (1..162); match: 0.99, score: 3.6e-91; database searched: est; yb31h01.s1 Homo sapiens cDNA clone 72817 3'." misc_feature 36973..37051 /note="BLASTN similarity to AA476161 (270..348); match: 0.78, score: 8.1e-80; database searched: month.na; vh21h05.r1 Soares mouse mammary gland NbMMG Mus musculus cDNA clone 876153 5'" misc_feature complement(39866..40384) /note="BLASTN similarity to B16457 (1..519); match: 0.99, score: 1.7e-204; database searched: month.na; 342E17.TVE CIT978SKA1 Homo sapiens genomic clone 342E17" gene complement(40140..>45337) /note="GASTRIC H+/K+ ATPASE ALPHA SUBUNIT" /gene="ATP4A" /map="19q13.1" CDS complement(join(40140..40168,40359..40450,40535..40636, 40972..41105,42562..42707,43231..43354,44447..44601, 44691..44859,44965..45115,45201..>45337)) /gene="ATP4A" /note="POTASSIUM-TRANSPORTING ATPASE ALPHA CHAIN (PROTON PUMP)" /codon_start=1 /product="ATHA_HUMAN (partial)" /db_xref="PID:g2282014" /translation="VIMVTGDHPITAKAIAASVGIISEGSETVEDIAARLRVPVDQVN RKDARACVINGMQLKDMDPSELVEALRTHPEMVFARTSPQQKLVIVESCQRLGAIVAV TGDGVNDSPALKKADIGVAMGIAGSDAAKNAADMILLDDNFASIVTGVEQGRLIFDNL KKSIAYTLTKNIPELTPYLIYITVSVPLPLGCITILFIELCTDIFPSVSLAYEKAESD IMHLRPRNPKRDRLVNEPLAAYSYFQIGAIQSFAGFTDYFTAMAQEGWFPLLCVGLRA QWEDHHLQDLQDSYGQEWTFGQRLYQQYTCYTVFFISIEVCQIADVLIRKTRRLSAFQ QGFFRNKILVIAIVFQVCIGCFLCYCPGMPNIFNFMPIRFQWWLVPLPYGILIFVYDE IRKLGVRCCPGSWWDQELYY" repeat_region complement(41580..41727) /rpt_family="Alu" repeat_region complement(41846..41987) /rpt_family="MER5" repeat_region complement(41986..42132) /rpt_family="MIR" misc_feature complement(44962..45098) /gene="ATP4A" /note="BLASTN similarity to AA473622 (11..147); match: 0.69, score: 1.3e-59; database searched: month.na; vg78g12.r1 Barstead MPLRB1 Mus musculus cDNA clone 872134 5' similar to gb:J05096_rna1 SODIUM/POTASSIUM-TRANSPORTING ATPASE" BASE COUNT 10705 a 12764 c 12560 g 10246 t ORIGIN 1 ggagaacctg agctgctgcc caccgcaggc cgggggtggg aacccctgga gcccggccgc 61 gcagtagccg cgatcctgcc accagggggc agcaagtgga agggacccta gagccctcac 121 ctgggagcgg ccgtagctgc cttggtcacc attcccagaa ctgccctaga tgggggagag 181 gaggatcata cggggagcgg cgcatgagag cagctcaggt tattcggctt attgtgtcca 241 cgcacctcct cctgtctgac acttttcccg acccccgagg aagccagatg tggctccctc 301 tccagctcga ctccaagaca cgggatggga gaggccgtca gccaaggggt tggacaggcc 361 attgaccagg aggctggaga gactgccggc tctggagtca gtgaggcccg gggccctggg 421 gtggcagctg cggggtcaca gcgccccttc ctgccctcac acccatcgtg caggggcccc 481 atgtgtgaat gtcgtctgca ccgtcgctct gtcagcgtcc gtaggtctcc cggatcacag 541 tcctgccccc attctcctgg gagcttccca gacaagtgtc atattctccc atgtccccct 601 ctttctccgc caccccacgg tctcctgaga cttgccaagt ggacttgact gcaagagctt 661 agacaggcgg gataacgtcc ccaccatcac agagtggaaa atagggatac agagtcagaa 721 gcagtggggt ggctctggca gaaacgatag tgcgcgcatc aggggcacag catgggtggg 781 gaacaggagg ctgagggagc ctcagggtga cgactatcag cgaggccgcc catcctcggg 841 cagcggcagc tttcagagaa acttggagcc ctctcccggc agggaccact agggcctcac 901 cccatctcag ccttccacag aggtgccaaa ctcacccagg aggactcact gccgctgtca 961 cctctgctgc caccactgtt gccactgctg ccaccactgc tgccgccact gctgccgcca 1021 ctgctgctgc cactgctgct gccaccactg ctgctgccat tgttgttgtc accattgctg 1081 ccactgccac tgctgcccga ctgtgagccg ctgcctccct gaggggcagg aagggagcag 1141 ggctgggatt aaagacagag caccaagacg ggcccctcct ccctccctct ccacccaggc 1201 catattccat gatgcccagt ctaaggggag cagaacttct ccatttcctc ctggtgcaca 1261 actcctgggt tgcccctctc cactcacccc agagttgctg gagcctccac ctgagccaga 1321 tggtggggga ttcgtgcact gtcgagggaa agggatggtg agtttgggga cggtgagttt 1381 ggagacgttg gcctcggcca tggacacagg ccaggcctct cccattcttt ggggcttggc 1441 ttcctttctc ctaatgtgtt tctcccacct ccccttccca tctgcctcct ccccaggccc 1501 aggcccgtct cttcctcctt tttttttttt tttttttttt gagacagggt ctcactctgt 1561 tgctcaggct ggagtgcagt gatgcgatct cggctcactg caacctccat ctcccaggtt 1621 caagtgattc tcctgcctcc gcctcccaag tggctgcaat tacaggcgca tgccactatg 1681 accggctgat ttttgtattt ttagtgaaga cagggtttca ccatgttgac caggctggtc 1741 ttgaactcct gacctcaggt gatccacgtg ccttggcctc ccaaagtgct gggattacag 1801 gtgtgagcca cctcgcccag ccccatctct tcctctcacc ccaatccctc cactccccca 1861 tcttgctttc ctctttgtct ttccccagct ctctgtagcc tgcacccagt ggcttcttca 1921 tttgttccca gccccttcct cttacccctt cattctggtt gctggctctc actgaaccat 1981 agccaggctg ggccacagct ccctgcagag agggtgagac tgagagtagg atccagaggg 2041 accagggagg cacgcggagc atgtgggctt ggagggaggg aggtggcagg tagcaggtct 2101 gggggtgact ttacctgagt gttggtccca aagtttggtg gccctccatt gcctccttga 2161 ccccagggag ctccctgagg attcattcca aagctgcctg ctgagtttcc ggggtatccg 2221 tggacccacg gagtccccag acctccagga ttgccctggc cctggcctcc aaggccacct 2281 tgagagccaa agatgccatg gcctccagaa gtttcctgca agaacaacat acctgcactt 2341 agtgggacca tctctgaccc tggtagagaa gagccagggc aaatcataaa gagaccagga 2401 gagagacagg cagagggact ccagggtggc ataggagggg ggcagaacat agaaggaagc 2461 tgccccagaa gcctgcaagt aagagatcca tccatccttt caaccgtttc ccaagaatct 2521 cagcccagca gcccacagcc tcccatttcc acatgcagcc ccacagccac tcacccaagc 2581 accattgtgg ccaggcaccc cctgccagga gccgcggaca gcatctgctc cgtgtcgaat 2641 gacatcttct gcctgtctgc caatctcgtg cccagtgttt cccagagcat gggctgcttc 2701 cccgaccctg ttgcccaaag catctgctac gccaaagcct ggaacctgcc tgactccagt 2761 gccaactgct tctctggtcc cttggccaag ggcctcactg actttagagc cagctgcccc 2821 tccggcctct ttgccaatgg cctttcccac cccttcgctc agggcgtctc ccaggccatg 2881 tccaagggcc tccccaatat ttgtcccagt gctttcctct ccgctctgca gggggccagc 2941 ctccccactg cccaggcaga gggccagcag gaggcaggcc aggggcccct ggaacttcat 3001 ctctgcccag ccccctctct ctccagagtg tcttcctccc accagggtct cctccttgcc 3061 gcccttgctc tgcgtctctg tgccctcctc tgtcctcctc cttccgactc cctgtcctcc 3121 ctccctctgg gtctgcagcc ttctctctcc tgtcctcaac tgcctgggct tctcagagtc 3181 cacctccctg tcactactcc ttatactcta ggttgggaga aggtgacagg gctgggcaac 3241 ctgaggtccc tccccgcagt gactcagggc ccagccacca atagggtaat cagctgttat 3301 tgtgagtctg agagtctctg tcgaaacaga gagtgagccg catcagggag ggagggagaa 3361 atggatgagg tgatgcaaaa gtgctcccat ttcccaaccc caggcattcg gggagctggg 3421 tgggggcagg ggcatctaag gagacaggtt gtcgttggct cctggaggcg gtggggtggg 3481 ggcagactta gaagagagga gttactggca ggaccccacc aaaggagtaa ggacagggaa 3541 gccccagggg acaggaccca cccaggctca gccagcctgg ctcaggggct ccagagtgac 3601 aggcattgcc cgtcctcggc ctggcctttc tcggtttcag cacctcagcc tctcctctcc 3661 ctctcccctc ccgcattgcc acctacaggt gactgccttc ttgttcctcc acaccttcca 3721 gcccacagag agagactggc aaacacgggg ctgcgtgtca gtttctgaga ttctggggag 3781 agcccccggg gggaccctcg gtgtcgggga aggaggtggc catagccctt tgagtggtca 3841 gcagggtgtc tcagcgttca ggctgctccc ggctgagaaa gctgggcgcg gagcagggca 3901 gccaggcgcg ccagccaccc ccagggaacg agtgggaggg aaggccaagg gagggagccg 3961 ctgacccagg cacccatcct gtttacgcag ccctagaggg gcacgtgtga gtataagcgc 4021 ctatgagggg cgggactggg ccagctcacc ccacccatcc cccaggccag gggccagggc 4081 agcaatggca agcgagaaga gggtagcgca aggtcaggag ccatccgtga gagccacggc 4141 cctctcccca gtcccagctc gggatacaga acctgggagc ctccccccgc ctcacccctg 4201 cccagcaagt gtggctgctt ctctattccc aggggccaca gggcctccca ctgtttcgta 4261 tcactggggt tcatcgccca gttaggaggg gccagtttcc atcagccttt ctcaatgtct 4321 ctcagtcagt ctctctcttc tctccacccc aatatccgca ccccagctta taacatccaa 4381 cacttcttac cttgtaaaca ctatccccat ctgtcctcct cagctcctac ttccctgtta 4441 ttccatctct gactcgttaa tgatgataat agcgatggtg gtaataataa taataataat 4501 aataataatt acagctccag cttatcagcg gttctgtatg tacccagcat tggactaagc 4561 acaagcagtt taagaattat acaactctgg gagatggttc tttttttcta gagacagggt 4621 cttgcccagg ctggagtaca ctggcacaat tgtagctcac tgcagcatcg acctcctggg 4681 ctcaagcgat cctcctgtct cagtctccca agtagctggg actatagatg cacaccagca 4741 cacccagcta attttttttt tctttttgag acagagtctt actctgttgc ccaggctgaa 4801 gtgcaatggc acgatcttgg ctcactgcaa cctccgcctc ccaggttcaa gtgattatcc 4861 tgcctcagcc tcccaagtag ctgggattac aggcgcatgc tgggcctgta atcccacgct 4921 gggctaattt ttgtgttttt agtagagaca gggtttcacc atgttggcca ggctggtcgt 4981 gaactcctga cctcgtgatc tgcctgcctc ggcctcccaa agtgctggga ttacaggcgt 5041 gagccaccgc acctagcctc cagctaattt tttaattttt atttttgtgg agatgagatc 5101 ttgttgtgtt gcccaggctg atctcaaacc cctgggctca tacgatcctc ctgccttagc 5161 ctcccaaagt gctgggatga caggcatgag caaacagtgg tattacatta tgcttgctgc 5221 agctcctgca ctctctttgt tttttgtttg tttgtttgtt ttgtttcgag atggagtctc 5281 cctctgttgc ccaggctgga gtgcaatggc gcaatctcgg ctcactgcaa cctccacctc 5341 cagagttcaa gcaattctcc ttcctcagcc tcccaaatag ctgggattgc agatgcacac 5401 caccatgccc agttaatttt tgtattttta gtagagacgg agtttgacca tgttggacaa 5461 gctggtctcg aactcctgac ctcaagtgat ctgcctgcct tggcctccca aagcgctggg 5521 attacaggca tgagccacca cgcccggcca tgtgcccttt ttgttaagaa ggaagattag 5581 caagtatgaa agaggtgttc tgactcacca gcacagctgg acatggggtc tttctcaaag 5641 gtaggtccag gcctcaagaa aataacacta ataataaatt ttttcaaaaa ctaaaatgaa 5701 gtctttctct ttaacatagt cagagagctt aaaagaaaat tgagaagaaa gtaagtttct 5761 ccctgtgtga ttcagttatt caaagccgtg tccatggatc acctcaattc tcccatgcca 5821 ccaacaggcc tgtgttctac ctgctaggaa aaaattgccc ctcagacatc gctgggcctg 5881 aggagccacc tcagatcagc cctagggcat ccttattaaa taactagtgt gaaccaaata 5941 aaatattagg tgctttcaca tacatgatct cattgaatcc tcaatatgac cacaagaggt 6001 taatattatt cccattatac agatgaggta actgaggccc aagtggctca gcatcacaca 6061 caaaatcata tgtgaaaaga ctgtggtaaa atagagttgt caaaacttga gccaggcact 6121 ttgggaggct aaggcaggtg gatcatctga ggtcaggagt tcaagaccag tctggccaac 6181 atggtgaaaa ccccctctct actaaaaata caaaaattag ccgaacgtgg tgacaggcac 6241 ctataatccc agctactcag gaggctgaag caggaaaatt gtttgaacct ggaaagtgga 6301 ggttgcagtg agccgagatt gcaccattgc actccagcct ggacaacaga gcgaaactcc 6361 gtctcaaaac aacaacaaca acaaaaaaaa acttgagcca ggtcagtctg actccaaaca 6421 gctttctctc ctccatcacc atgccaccca tgttcttatg cgtgatggca cccacaaatc 6481 tgatatgcct atgtcatgtc caaggccctc tggtctcagc tcttctctct aattcacatc 6541 attcattcag caaatattta tgactacctg ctccgtgaca cgcctattcc agacactggg 6601 gatacagcct agaaaccatt ctccagctga gtgacacctc ataatcccag ctcatgcctg 6661 caagcccagc actttgggag gccgaggcaa gtggatcact tgaggtcaaa agttcgagac 6721 cagcctgggc aacacagaga gaccccaact ataaaaaatg tttttgaaaa agtagttgga 6781 tgtggtggtg cacacctgta gttctagcta ctccagaggc tgaggtgaga ggatcgcttg 6841 agctctggag gttgaggctg ctatgaccac accactgcac tccagtctgg gtgacagagt 6901 gagaccttgt ctcaaaaaaa gcattctcct tgcagaactt acattcatgg tgggggaagt 6961 taggcaatag gtgacagaga ggtgttttaa atagggtact cgggaaaggc tctttgagat 7021 gtggcctgtg agctgacccc tgcagggagt ggaagagtga gccatgggga ttagctgggg 7081 ctagtggatt ccaagcaggg ggaacagcca tgcaaaggcc tggaggcagg agcctgccta 7141 ctgtatttga gaaccattgt ggctggagca gtgttgggat gggagaagct aaagtcaaag 7201 agggaaagat cctggggtct tgcggactat tatgaagact ggcttttttt tttttttttt 7261 tttttgagac tgggtcttgc tctgtcgctc aggctgaagg tcaatggcgc aatctcggct 7321 caccacaacc tccgcctcca gcgtacaagt gattctcctg ccttagcctc ccgagtagct 7381 gggattatag gcgcctgcca ccacgcctgg ctaatttttg tatttttagt agaaacgggg 7441 tttcaccatg ttgtctaggc tggtctggaa ctcctgacct caagtgatcc acccgcctag 7501 gcctcccaaa gtgctgggat tacaggcgtc agccaccaca cctggccaac tttgactttt 7561 atttttactc tgaatgaggt agaaccagcc aagggttctg ggctgccacg ggtgctaaca 7621 gagttcccct agggaactga ttgttggggg ttgtggagta gagagaccag ggaggaggga 7681 gattggaggc tggactcagt ctagtgggaa gatgaggctt tgggggtggg tgatgtctct 7741 gctcccttct ggggggtccc agatctcaaa ctggcaccta cagtcttacc tcttccacgg 7801 tgcctatccc agagcctaca ttcaaaccct agctccaccg cttactaacc atgatcttaa 7861 cccatctgtg caacagtttt ctcatctgta aagaggggtg aatgtgagga ttaagtgaag 7921 taatacatgt aaattgctta gagaagtgtc tgccactcag aagcactcaa tatatgttct 7981 cggcagtgct ggtcacattg ctgtggctgg aatgctttct tggcagttgt ttaatcttgt 8041 gacacacaca tgcacacatc ctgaaaatgc cagatgcatg gctcatgcct gtaatcccaa 8101 aaccgtggga ggccaaggag ggaagattgc ttgagcccaa gagttcccga ccaagctggg 8161 caacatcgtg agaccctgtc tctacaaaaa atacaaaaat tgccagggtg tagtgacgtg 8221 tgcccatagt cccagccact tgggaagctg aggcacacgg atcacttgaa tccaggaggc 8281 tgaggttgca gtgagccatg attacgccac tgcactccag cctggacaac agggtgagac 8341 cctgcctcaa ttaaaaaaaa aatgccagat gcagaaactt acttttaaag aacactcatt 8401 tggagccagg agagggtagc tctcacctgt aatcccagca ctttgggagg ccaaggcagg 8461 cagatcacct aaagtcagga gttcgagacc agcctggcca acatggtgaa acctcatggc 8521 tactaaaaat acaaaaatta gccaggcgtg gtgtgcgcct gtaatcccag ctacttagga 8581 ggctgaggca ggagaatctc ctgaacccgg ggttcggagg ttgcagtgag ccaagatcgc 8641 accactgcac tccagcctgg gcaacagaac aagactctgt caaaaaaaaa aaaaaaaaaa 8701 cccactcatt tggatgtagc cccctcccgc ttcctggggg tcgcaagggc agcatgccac 8761 ctcactgcta gagatgttca gcagggaagt gcccttgaac tggagcctgg ccatcaccac 8821 tgactagcag ggtggcctca ggcaacacgc tccgttccgt gcacctcttc ttcaacatct 8881 gtgagggcaa tgatcatggt acttgcctcc cggatggttg tggggatgaa atgaatcatg 8941 cctgtaagtg cttagcacag accctgtaaa agatgcccgg tgagagccac acccaaactc 9001 ctctccctgt tcttcaacat cctcagaaac cggcccttcc tctctgtgtt tctccctagc 9061 ctggctccac ccatctcagg caaagcagcc tgtgtcttgc tcacactcgt tccatggccc 9121 ttcccaacct gctcaagtca gtggcttccc ttctgctcct tgccagacat tcatgcctgg 9181 ccttgacatc actgcttgaa cctctttgtc tggggggaaa tggatacgat atttgtaata 9241 tgattacaga ttgtcataac caacaagaaa aacaagatca gcgctatgag agagagtgac 9301 agggtggccg tagctcaggt tgagcggtca aggtagctga gatgcaaagg gtagggaaga 9361 gccagtatga ggggccgagg ggcatggaga gtgttcctag tagagggaac agcacagaag 9421 acctccaggc aggagagagc caggcgtctt ttaccctcaa aaccgactca aagctctcct 9481 ttttcagaag cttcccagga ctgactcagt gcctatcaat tatcttgtat ctctttggta 9541 ttttcaggac ccccatccaa gcaaacccag caagaaaatg tgcctaaaaa atagtcagtc 9601 agccgggtga tggccaggcg cggtggctca cgcctgtaat cccaatactt tgggaggcca 9661 aggcgggcga atcaactgaa gtcaggagtt caagaccagc ctgaccaaca tggtgaaacc 9721 ccgtctctac tgaaaataca aaaattagct gggtgtggtg gcgcgcacct gtaagggagg 9781 ctgaggcagg agaattgctt gagcctggga agcagaggtt gcagtgagcc gatatcaccc 9841 cactgcaatc cagcctggat gaaaaagcaa gactctgtct caatatatat atatacatac 9901 acatatatac atatatatac acgtatatat atacacacac atatatgtat atttttttaa 9961 ataaaaataa agtcaatgaa agcttttact tcaagaaaga aagagtccct tgaatgttgc 10021 cttttcgaaa gccgccacta taggattctt ttaagtaggg cctcccccaa tacctgtgag 10081 atgatgtcca gcacggagca gaccctcaat cagtatctga catgaatgga aggttctatt 10141 tatatcagtg cagcaagcag ctgggtgctg tggctcacac ctgtattcct agcactttgg 10201 gagcctgagg caggaggatt gcttgagctc aggatttcaa gaccagcctg ggcaacatta 10261 tgagaccctc gtctctacaa aaaaaaatta caaaattagc caggcatggt ggtgtgtgct 10321 tgtagtccca gctacttggg aggctgaggt gggaggattg cttgagccca ggagattgag 10381 gctgcagtta gccatgactg tgccactgca ctccagcctg aacgatagac caagactctg 10441 tctcaaaaaa taaataagac aaaatgcagc aaatataatg tttacagttt actttgaaat 10501 acatcaaaaa tgtttgatat ggggctaaat ggacagatgt gtgaccaagc aagtatggta 10561 aaatctaagt ggtgagaatt ctggtgttca ctataaaatt ctttcatctt ttcatgatgc 10621 ttgaaatgtt ttcaaaataa aatatcgcag ggggagggaa ggagcatatg taatgaaatg 10681 atggtattcc acaaaataag ggaaaatgca gtgactataa agtgaggaac gtcccatttt 10741 ttgcacaaat gcaggcccct ccccttgccg gtcattgagc tcttattgat gcgtggcttt 10801 gtgtgtttgc attcacattc tgtttatagg ccttgcctcc caggaaggct gtgaattccc 10861 caagggcaag aacactgtct attattttca gtcccccatt gatggaattt tgtagctgga 10921 agcaacagtt gtcgaatccc tgcctccctt tatacaggag gaaatcgtgg cccctagagg 10981 ggaagtaggc ggccttcagg catcttgctc ccgattggaa cgcaagtctc ccagcaccca 11041 ggctagtgcc tgggagtgtt acagagactt caccgggaac cacgagaagt caggtgctgg 11101 cgacagacgg agccagaccc aggcctgctg ctttctctgt gggcgccctt gtcccactga 11161 gccgcagcat cctcatccat acaacggtga cgataacagg gcttctgcct ttgggttgtg 11221 ggagacttaa agagttaaag aacatgaaca ggccagatga agtggctcat gcctgtaatc 11281 ccaccacttt gggaggccga ggcaggagaa tcccttgagc ccaggagttt gagaacagcg 11341 tgggcaatat agtgaggccc tgtctctacc aaaaaaagaa aaaaaaaatt tttttttgag 11401 acagggtctc accctgtcac ccaggctgga gtgcagaggt gcaatctcgg ctcactgtaa 11461 cctccacttc ccaggttcaa gtgattctcc tgcctcaacc tcctgagcag ctgggattac 11521 aggcacgcgc caccacaccc agctaatttc ttttttggta tttttagtag agacagggtt 11581 tcaccatgtt ggtcaggcta gtctcgaact cctgacctca tgatctgcca gcctcggcct 11641 cccaaagtac tgggattaca ggcatgagcc actgtgcccg gccaaaaaaa taatttttaa 11701 ttagccaagt gtggtggtgt gtgcctgtag ttccagctac tcaagaggct gaggtgggag 11761 aatcccttga gcccaggagt ttgagaacag cctgggcaat atagtgagac cccatctctt 11821 aaaaaaaaaa aagaaagaaa aagaaaattt gatgagccaa gtgtggtggc gcgtacctgt 11881 agtaattcca gctactcaag aggctgaggt gggaggattt cttgagccca ggaggtcgag 11941 gcagcagtga gccatgatcg catctctgca ctctggcctg ggcgaaagag caagattctg 12001 tctcaaaaaa aaaaaaaaaa aaaaaaaaaa aaagaatatg aacaactctg gacacggtac 12061 cttgagtcta gcaagtgctc tatgctcagt aattgtaatc attgttcctc tgggctttga 12121 atggtccaca gatctgaaaa tggggccttc tgtgccccta ctcaggtggc tcctgtcacc 12181 agcatgtctt ccacctgagc cttccatcct tctccctctt tcccctgaga ggatccccag 12241 ccccatcccg tgtctcactt ttcttattca ccaattctcc ttctctgctg acgccaaagg 12301 gcaaagacct ctgtttttcc agaatatcct ttgtagctct tcactgtttc tctccttcca 12361 cttactgact gtgtaaactt ggtcatgttt cttaatgtgt gcaagcgccc acgtgcttct 12421 ctataaaatg gggcaataac agtccctttc acctaactgg ggggttggga ggagtcaatg 12481 atgtcatgca ggtaacgcac ctggcaggca gcctggcagg tcttaactgc acaataaata 12541 caagttattg ttgttattat tttggcgccc cccaccccct tgactgagga tgctacctcc 12601 tgggcccaga ccattctctc ccggtgtgtc tgtccccacg agtggcctgc agtggttttc 12661 cctgcacgag tcctcctgcc agtagccagc acgaactttg caccccagca aagcgtggtt 12721 tgtccccatt acccgccctc ttacagctaa tcagcctgaa atctccagct caacccaccc 12781 tatgcctcag acactccgac tccttcagca aagtaacaag aaaagacaag gccctcctgc 12841 ctccctgtcc atccctcctt agggagccca ggggaccaca atgagacagc atagtgtatc 12901 aagtttattc acaaatccca gtacaacccc cttcagggat ttcagaaacc tgtcccccac 12961 ccccaacccc tccaggtcat gtcagctgat gtgacaacgg cgacattatt ctcccagcaa 13021 ggccggatgc cagtttaggg catgatgttg gcgacgctct ggaagagaga aggagagcag 13081 gggtgaatgt agggtccaca gtcatgacgg gacccgagac ctcagccccg cccctcggga 13141 gaagcacaag gggactggct gctctggccc cggagttatg ttctggggaa gtttcaggaa 13201 ggtagcatct aaacttaaga acacaaggaa gccagtcccc acaggcagag gggctggggc 13261 ctctgcagga accaacaagc ctcgtgtgtt tagcctgaga ggatgctctg ccgcccccat 13321 agcctgggat acacaagggt gggccttcct cctccgtgat acaccctcac cctggcctgc 13381 accaggtctc gggcagcaga tacaataagt gcaaaaagag gctgggtgca gtggcttacg 13441 cctgtaatcc cagcactttg ggaggctgag gaaggagggg atcacttgag atcaggagtt 13501 cgagaccagc ctggccaata tggtgaaacc cctgtctcta ctaaaaatac aaaaattagc 13561 tgggcatgtt ggcgcacacc tgtaatccca gccactcagg agactgagat aggagaatca 13621 cttgaaccca gaagacggag gttgcagcga gctgagattg caccactgca ctccagcctg 13681 ggagatagag tgagactccg tctcaataaa ttaaaataaa gaaagagagt caggagttta 13741 ctttgagccc caactgggaa ccaggattcc ataggagcct ctgcactggg gtgaccttgc 13801 accagggaga ggacaaagac acaggctccc cctcccagca tgaagcaaag gcatttgcag 13861 gagggttggg acaggaggga agccagtcag caaatgggaa tcgaggaatg acaaggagag 13921 agagggaatg agaacagtca gcagcccctc ctccctccct ggctcccagg ggcaccaggg 13981 cctgggaggt cttttctgcc agcttgtgtt tgccttggtg gaaatgaaag gcacccaaaa 14041 gatgagggca gggtgcaaag gggagggagc aggtggagag agggaagccc aggactgggt 14101 atcaggttgg agggtctttg cagaggctgc aggtctgccc gcccggggag ctgtccaaac 14161 agacaccggg aaggggtcgg ggtgagcgta tctatcagga cgcagagcag aaaagagaca 14221 ccatggggca ctcaccctcc acagggcggg aaggttgatg aaaggcgtgt tgaccgaggc 14281 ctgcaattca aggacaagaa ggtcagtctc cagcgtctga cccacagcca tgctagggaa 14341 ctcccagggg cctcctctcc cctaccccag agtccgcgct taccccagag gctaacggcg 14401 tggttgtggc ccctccttga tggctggaag atccgctttg atggttgccc tgtggacaaa 14461 gcccagactc ttgttacaac cccacaaacg cggagggtct cccagttcct tttccccagg 14521 gacctggtcc cctccagggt gactggggct gcggtgggag gcacagctta acccactgcc 14581 tcctcactgt ccagaagaaa acccaaagcc cgtgctcgcg tgggcattct gcctcctgct 14641 gccttttcct ccccaaagcc ccttttaatt tccttaatga agtgctctca gtcatggagg 14701 aggggctggt ggaaagggaa ggtagaggac ccgtgacaaa caatatcagg acctagttta 14761 acctcagccc tgacctcaaa tctacccttc tatgccctgt ctagctgcat ccctaactcc 14821 actgatcctt gtccttaacc cagccccaga cactcagcca tgcaccccag cagtgccccc 14881 aaccccggct gctgttgcca gccccaaatc ctacagtgca ccgtcctctg ctcgcggcac 14941 cctgcagcct gccgtgcccg ctcctctgct gcgccgttcc tccccaccca cccatccctg 15001 gctctgtctg gcggcagatg tggctctgga cttcgacgtc tgagccagtc acatcttgca 15061 gatccaggac ccctcatgga gcctggagag tgagggcgca caggagtcca ccctctcctg 15121 ctccccacgc tcccttcttg gtgcctgttc tgttgtgagt ggtgctattt gcctgcctcc 15181 agtctctgtt ccctctacat attttatttt attattattt tttttgagac agagtctcac 15241 tctgttgccc aggctggagt gcaatggtgc aatctcagct cactgcaaac tccgcttctc 15301 gggttcaagg gattctcctg cctcagcctt ctgagtagtt gggattagag gcgccagcca 15361 ccacacctgg ctaatttttg tatttttagt agagataggg tttcaccatg ttggccaggc 15421 tgctcttgaa ctcctgagct caagtgatcc acctgcctcg gcctcccaaa gtgctggggt 15481 taacaggcat gagccactga acctggcctc tgttccccct acataaatgg gtaagagaaa 15541 ccttccagaa aacctctcct ggctgggcag tgcaggggaa gagaaccctg gcctgccctc 15601 tgccaaagct gggcactagc cacaggactt tggcacgtcc cctaacctct ctgagccttt 15661 gtttccttat ctgagaatca gggcgtcagc agcctcatct tgaccacatc agagggtagt 15721 tacaaagatt acacaggcca aggctcatct ccgaaagcta ttcctcatgc tcactctcct 15781 aaccaggttt tgtcaacact cagtcccttt gcactatact agatcttgat ctcaggaact 15841 gtttcccatt ccccaggaag ggcttcatgg gccccacttg aggccccctt aaccaggagg 15901 gcagctccct gaaggtggga cagggtgccc gccaccctgt ctgatacgtg ctttgcagat 15961 aaggtggctc accccatctg accagctgcc accagctcct ttgtaaggga gatggggaag 16021 ggggaggaac ccagtgatat agctgcccaa gcttctttca ccaaacaaag gctacgcgga 16081 cccccccgcc ccgccaaatg tcccaggggt ttaacctttc cctcccccaa cccccctgtc 16141 ccccatctcc ccatccctgt tcacctacat tcagcagctg gttggcctcc ttgctggctt 16201 ggttgacccc attatgagca ttctgcagct ccttcccggc ctggccagca gcatggtggg 16261 caccttggcc aagcttctcc acttcctttc cagcctggtc agcagcatgg ttgacccctt 16321 ggccaagttt ctctgcttcc ttcccagcct ggtggacccc agtgtggaac ccttggaccg 16381 ctttgtctgc ttccttcccg gcctgttcaa gggtatggtg aactccctgg ccaaactgcc 16441 ctgcctcctt cccggcctcg tggaccccag gttggacacc atgaactgct atgtctccct 16501 ctttcccagc ctgccctgct gtgtggtgaa tgtcgtggcc aaactgcccc gcctcccttc 16561 cagcctgact aacgccatga tggaggcctt ggaccactct gtcttcctcc ttccccacct 16621 gcgaggcagc atggtggaca ccctggccaa acttctctgc ttccttccag gcctcattaa 16681 ccccatggtg gaccccatgg ccgagcttct ctgtttcctt cccaaactga ctggcagcat 16741 ggtggacccc ctggccaaac ttctctgtct ccttccagcc ctcactgaga ccatggtgga 16801 ccccctggcc aaacctccca gcctcatttc cggcctgccc cgcagcatgg tgggccccct 16861 ggccaaatct ccctgcctca tttccggcct gccccgcagc atggtgggcc ccctggccaa 16921 acttctctgc ctccttccca acctgaccag cagtatggtg gaccccctgg ccaaacttct 16981 ctgtctcctt ccagccctca ctgagaccat ggtggacccc ctggccaaat ctccctgcct 17041 catttccggc ctgccccgca gcatggtggg ccccctggcc aaacttctct gcctccttcc 17101 caacctgacc ggcagcatgg tggatcccct ggccaaactt ctctgtctcc ttccagccct 17161 cactgagacc atggtgggcc ccctggccaa atctcccagc ctcatttccg gcctgccctg 17221 cagcatggtg gactccctgg ccaaacctcc cagcctcatt tccggcctgc cctgcagcat 17281 ggtggactcc ctggccaaac ctcccagcct catttccagc ctgccctgca gcattgtcga 17341 ctccctggcc aaacttccct gcctcacttc ccgcctggtt ggccccgtga tggaccccat 17401 gatggatcag tttgtctgcc tccttcccaa cctgtccagc agcgttgttg accccatggc 17461 caagcttctc tgcttccttt cctgcttgtc caataccatg gttgatctca tgggcaacct 17521 tgtccatgcc gtggttgagc ccctggacgc ctttgtccaa ctccttgccg gtgtggctcc 17581 ccatgttgct aagtccgttg aaaaccttct ccacttccct tccggcatgc gtgattccac 17641 tgttgatgcc atccagggcc ttgcccacct ctctctctgc attgctcagc cctcggttga 17701 tcccttcaat gaccttctca atggggtcat cgctggccgc ccatccagac agggccccca 17761 gtagcagaag gagggagcag gagccgacca gacgtgcaag atgcatattg ctgggaaggt 17821 cgggaaggat gcagagagga gccagggaag ccacgctgct atttatcctc tctccctctc 17881 gccctcttct ccacccctca tgactcaatt acccctgtga ggcactgagt tggtcttcag 17941 tcaaggaggt gtttcccagg gagatttgga gttgagcaat ggagaggagg aggaagagag 18001 ggtgagtcat ggatggagaa gaactctggc atctttgctc cttgccctcc accccagaga 18061 tgggtgtctg cccttccaac cacctcctct ttcattcctc acccactgtc caggcctcct 18121 ctctcgcaaa cccaccttcc accatgctcc atccctggga tgcaggcaat tcttcaacac 18181 ctcactcctc tcaggcatct aagcctccag ctccccaggc agtgaggggc tagaatgatg 18241 ggaggcaccg tctggagagg ggagaacact aggtgggatg aaggagtcat cagacctcgt 18301 tcttgtgggc cagcccctgc cctcactcat tccattccag ccacacaggc cttctccctg 18361 ttccacaaac acaccaagct cattcctgct gcagggcctt tgcacatgct cctctctctg 18421 cctggaatgc agaacccaga tcttcacgtg gctggctcct cctcatccgt acttgactta 18481 aatgccaact tcgcaagaaa acctctcctg accataacat ggcattaatt atactctgta 18541 gtatcaccct tttgttttca cagaaaaaaa aaatctgaaa ttacctttat tttttattgc 18601 ctgtctcacc tactagcaat gtaaattggt cgtgttgatc actgctgtat tcctagcttc 18661 ttgaacactg cctggggttc agtaaatatt tattgactgg acaaatggat ggattcgaac 18721 aggccaagga aagagactcc gagcccgagt ctgggtctca cagggtgctg gagggaaggg 18781 cgcctgggtc gccagggcag catagagtcc cttgtggagt gccgcagatt ttggaggcag 18841 gatcccattc ctgtcgcaga acccagacct cctgcctccc agcccttggg caagggcagg 18901 gggctgattg ccacgagctg gaacaggacg cagtgtctcc tgaaaagccg gttgaggcag 18961 ccacggggag gggccgaaga cccagggcag caggagcctg tgagccagaa cctgggctgg 19021 ggccaggccc tcatccttcc tgccctcccg acatggggcc tcctaagccc caacctgaag 19081 ccccaagtgt tgacatattg gctcgggttg gaggggaaac agctgagagt ctggcctcag 19141 gcatgaaaag ggcatcggaa ggaagggcag gtggtggtgg gcaggcttgg ggcagcagat 19201 gcatctagaa gccagttggg agggaggcta agaccagaac gagagccagt gggtgggtac 19261 agtaggtgca ggggtgaaag atatttgggt tttggtgcca aggggagaga ctgagagagg 19321 ggtggaggtg ggattgctgg tgggaaggcc aggcaggaac aatgtgcagg gtgtgagcag 19381 ctggaggtgg cagagctatg tgtgtgggtg cccaggtgtg gcgagaatct gcacgtgttt 19441 tcaggctttg ctcaaacaaa ggcgccaagg cccactcccc tcagaggacc cggggttcct 19501 ctccagctcc tggacacttg actccaccca aagacaggca tcccaggaca gatatccaga 19561 agcttctgcc ttctctgggc tctgcctata gagggtgact aagtgtgcca cctctccaag 19621 cttgagctca tcaaatctgt ttcccggttt cacactccag ggacaaagcc aaattcctgg 19681 gcaaactcag gtgtggtttt tctgccctgg tccagggccg aaatgatgga gtttcccata 19741 ccaggaggtg gctgggcaga gaagagtggg tgatgagtgt cagaatccta acccacccag 19801 agatcaagag gtgggatttc tcccactttg aactgtgttc catcttcttt ttgtgttttt 19861 ggagggagag tctcactcta tcccccaggc tggcatgcag tggcatgatc tcggctcact 19921 gcaacctccg cctcccgggt tcaagtgatt cttctgcctc agcctcccga gtagctggga 19981 ttagccgggc acctgccatg atgcccagct aatttttgta ctttcagtag agacggggtt 20041 tcaccatgtt ggccaggctg gtctcgaacc cctgacctca ggtgatccgc ccacctcggc 20101 ctccccaagg gcagggatta caggcgtgag ccactgtgcc tggcctgaac tgtgttccat 20161 cttctctatc cctctccacc ccatcctttc atcttctgcc agcctcagtc cctctcctct 20221 gtctctctat tcccatgtgg cacgattcga ttcaggactt cagcatctgg ctccataact 20281 gagaatggcc tggagttgtc ctttctttat tttatttatt ttttttatga agtatcactc 20341 tgttgcccag gttggagtgc agtggcaatc tcggctcact gcaacctccg cctcccaggt 20401 tcaagtgatt cgcctgcctc agcctcccga gtagctagga ttacagatgc atgccactac 20461 acctggctaa tttttgtatt tttagtagag atggagtttc accatgttgg ccaggctggt 20521 ctcaaactcc tgacctcagg tggtctacct gcctcggcct cccaaagtgc tgggattaca 20581 tgtgtgagcc actgcaccca gccaaaatca gccccttttg aatcctggtt ttgttgaggt 20641 tatatttttc acctccttga ttgtacacca aaatgtagct ccctttgttt aattaatcat 20701 ctgtgttttg gtctcggttc gttaaaacct cctcacctgt acacttagtt tactttcttc 20761 tcccaggcca tcaaagcagg agagcctcac cctgccctgg tcccatggca ggtgcttggc 20821 aaatgaattt ctcatggatg ttggcggaag gaagaggtta tgtcattctg ccccttcttg 20881 catccatccc tctctgggat ggtgactcct tcctggcaca aacccccact ccccctgaac 20941 cttggtcaca gctgcgtgtc aacctgctgt gcaggtgatg gtgcctgcct gtgagtcact 21001 gaggtgcaag caaaccccta ccccggggct gcaggggagg gagctgtttg tccacttgtt 21061 ctttgtcaga cgttcattga aaacctgctt tgtattgttc taagtgctgg aggcacagtc 21121 gtgaacaaac aaacaagaac aggagccctg tgaggggctt agagcccaag ggggagacat 21181 cgaacaaata aaccaatcag tagatcatga caaactacat gaaaagctac aaggaaaaga 21241 atgggtgctg cgagaagata taacgtgaag agtttgataa gattggcgat cagggccggg 21301 cgaggtggtt cacgcctgta atcccaccac tttgggaggc cgaggcagat ggatcacctg 21361 aggtcaggag tttgagatca gcctggccaa catggtgaaa ccccgtctct actaaaaata 21421 caaaaacaaa aaaaaaatta gctaggtgta gtggcgcacg cctgtagtcc cagctactca 21481 ggaggctgag gcgggagaat cacttgaacc caggaggcgg aggttgcggt gagctgagat 21541 cgcaccattg cactccagcc taggcaacaa gagcgaaact ctgtctcaaa aaaaagaaaa 21601 gaaaagaaaa gattagcgat cagatgaaac ctgaggaatt gacatttatt cattcaggta 21661 tagtttattg ggctaacgcc tttttcagac agggtcttgc tctgtctccc agctggagtg 21721 cagtgacaca atcacagctc actgcagcct tgacctccta ggctcaagtg atcctcccac 21781 ctcagcctcc ctaataattg ggacaacagg catgcaccac cacacctggc tacttttttt 21841 attttttgta gatacagggt tttgccacat tgcccatgtt ggtctcaaac tcctggactc 21901 aaatgatcct cccacctcag cctcccaaag tgttgggatt acaggcatga gccactgcgt 21961 ccagcccaca aacacctttc aggtgccaaa cacttttctt gtcaatggga tacagaagtg 22021 acatagatga cgtccctgtc ctctcagagc ttacagtctg gtggacagag acagttggca 22081 agaaaacaaa caatttttag agagacatgt gctctaagga aaataaaaac atagacagtg 22141 tagccctaga gaaattttgc acactgaagc ataactcaga atggagaagt cagacaaggc 22201 ttcctggaag atgggcagaa attgggaagg gggtgacggc tgacagtgac agagaaagga 22261 gggaacttag atgcagagtg tagaaagcct ctcttaaagg gtgggaaacc aagaggcagc 22321 caagggaagc cctgggtggg ccttccaggc atacggaggc agcaagagtc actccactgg 22381 agaatccaag ggacagcatg gaacctgtgt gcttcaagag acagagaaga ggaaagggct 22441 ttcaggaaca gtgtaaaagg tgggattcca ggctgggtgt ggtggctcgc tcctgtaatc 22501 ccagcatttt ggaaggccga ggtgggagga tcgcttgagc ccaagggttc tagaccagcc 22561 tgagcaaaag agtgagaccc tctctctaca aaaaaataaa aagtaaaaat agcctgcagt 22621 ggtggctgca catgtagtcc cagctactca ggaggccgag gcgggaggat cgcttgagcc 22681 caggagttca aggctgcagt gagccatgat tgcaccactg cactccagcc taggagaaag 22741 agtgacaccc cgtctcaaaa taaataaaca aatgataatt ttaaaaagtg ggattccatt 22801 ccagacggca gtgcgtggcc gtggaacggt gatgcatagc gtgctgacgt cccatgcctc 22861 tacggggatg ggatgcgatg gtgtcaggcc tctggcttgg gcgccgcccc tcggcgacgt 22921 cactgggtac tgtgacgtca ctgggtgctg tgacatcagg gcaattagcc caggacccac 22981 agccctggcg ctccgcacgc acctcggtaa catcacagca ggtccaggcc aatgataacc 23041 ttataagagg ccatgtcgaa gcgcgacatc gtcctcacca atgtcaccgt tgtccagttg 23101 ctgcgacagc cgtgcccggg tgagggaggc agcggagggc gcgggggagg ggtggaaagg 23161 gtagtgggga gcgtctgcac cctcacacct gtgccgtatc cctaccgccc ttcctggcgt 23221 gtgcaccatc tagggaccag gggatcagcc gctctccctc cctccacacc cgccagtgtg 23281 ctggcagagt ggggtttact gccatggcgg ggagaccggg cctccgcccc cagggctggg 23341 tgggcttcgc gccgctatca cgcggtagaa agagcagaaa cgagggccat cggaggctgc 23401 agcctgaaag tggcgggaag tgggcgggct taggaggggc ttagggagaa cccaggggtg 23461 ctgggcttcc atgtaaaagt gatgttcatg gcgcttcgcg tgaggtgagc tttgccctcc 23521 ctgtgatgca caaaacaagg acacttggaa ccttgccggg gacgtgctga ccacttcgca 23581 ggcgagagct cagggaagcc tcaccaccac ctgagaaggc cagagtgtgc ccgcagttta 23641 cagagcggcc aacagaggtt aagagagggc aaactacttc cttagtctct caacaagggc 23701 ggcgcgcccg cgcgcgcgca cacacacaca cacacactca ctcactcact ctagccggag 23761 gaaccaggtc ttccggtggg ctgcctccat tcccacactg ttccctcact tcccaagggc 23821 gggggccgca ggggtaggtg gggctggggc tttggtccag gagaccctgg aagacacagg 23881 agcagacagg cttaccagcc tcatgagact tgaacccctg ccctgcgcat cactgctccg 23941 tggcctcagg cttcattttc ccctctgtaa agtgggttta tagcgcatgc ctctcgggat 24001 tcagtgggag gacgcagtga ggagctgggg gcagcaccca gctagccccc aggcttttct 24061 ccaaccgggg gaccccaccc ctacagtgga ggggaagaga gtgcaaggct gggagtgctg 24121 ggagaggagg accatctgag ttttcttcgg aagggtcacc cattgggctc agattcccac 24181 ctgaaccagg tggacaaatc ccctttctcc catagcagaa acaaccccca aggatgcaaa 24241 catcctcaag ttcaccaata aacactaaca ccttcatgaa cttggacagc cactggaaac 24301 acctatagtc cagcctcagg ccacaaggtc tgtggcaaca gggagaggtc agggagacag 24361 acaggccagc tggcccccgg ctcccagccc caccaccccc caggtgcctt tgatttggtg 24421 ccaaaaactt ggactcccac tacccccttc cagcccttct ccaccccctg ctgtctccct 24481 ttcctcagga atctgtgtag gatcagactt ccctcctcta ttaaaatcag ctcctatgtg 24541 tgggcctcga gatctcactg cgccctcaca gcagccccta ggtgggggat cagccccaag 24601 aggttcatta acttgctcaa atccacagag cagcaggtct gtgtggactg agccaggctt 24661 gactttccag cttgaaactc tctgccacca ggtcccctga gtctccactt ctcccctctt 24721 ctcaagccct cctctgtagc tcaggcggct gctacctcca ggcgcaggtg tttttaagtg 24781 cagtctggga gcaggagtgt gtgtggagag gtgggagcca atgtgagctt ccatgggttt 24841 cacagagtat gatacacggt actctctaag cttccctgag gataagaaat gcctggagtg 24901 cttattttat tttttatttt tttttgggag atggagtttc gctcttgttg cccaggctgg 24961 agtgcagtgg cacgatctcg cccaccacaa cctccgcctc ctgggttcaa gcgattctcc 25021 tgcttcagcc tcctgagtag ctgggattac aagcatgtgc caccacgcct ggctaatttt 25081 tttgtatttt tagtagagac ggggtttctg catgttggtc agactggtct caaactccca 25141 acctcaggtg atctgcctgc ctcggcctcc caaagtgctg ggattacagg cgtgagccac 25201 cgctcccggc ctggagtgct gattaaaaac aggatttctg ggcctccata cagacccact 25261 gagtccgatt tttctaggaa gagccctcct caggtgattt tttttttttt ttgagacaga 25321 gtctcactct gtcgcccagg ctggagtgca gtggcgcaat ctcggctcac tgcaccctcc 25381 gcctcccagg ttcaagagat tctcttgcct cagccccccg aggagctggg attacagtat 25441 ttttagtaga gaacaggttt ctccatgttg gccaagctgg tcttgaactc ctgacctcaa 25501 gtgatccacc caccttggcc tcccaaagtg ccaggattac aggtgtgagc caccacgcct 25561 ggccactccg acgactccta tcatggaaca gtttggggac cactgggcta aatcaattat 25621 tggacaaatt cggggaacaa tctacaaaat actaatgcat gtcaaacatt tcctcggctt 25681 ggtcataagc aatccatgta ttacctcatt taatcctcac aacaaccctc taagaaaagt 25741 actattatct ccccttacag aggaggaaac tgaggcacag agaatgtcac cccaagcgat 25801 ctatgttgaa agtacatacc ccggccaggc gcagtggctc acacctgtaa tcccagcact 25861 ttgggaggct gaggcgggtg gatcacctaa ggtcaggagt tcgagaccag cctggccaag 25921 tgaaaccctg tatctactaa aaatacaaaa attagccagg caccttggcg ggtgcttata 25981 atcccagcta ctcgggaggc tgaggcagga gaatcacttg aacccaggag gcggaggttg 26041 cagtaagcca agatcagtcc actgcactcc agcctgggtg acagagcaat actctgtctc 26101 gggaaaaaaa aaacaacgga agtacataac cagtgaatta gaatctccag gaaagagccc 26161 ttgggattga cattttaaac aagttcattc tgtagagaaa gtgggtttgg gaaacactgg 26221 gctaaatggt tggtcacgag gggccaggag ccagcaacag attctgggca gcaaagacag 26281 tttgttctct ttcactttgg tggtcatgca acccattccc tcccttcccc cgcatagtga 26341 ccagagcacc gcccccacct gagcctaagg ctgaagtaga gccccagcca caaccagagc 26401 ccacaccagt cagggaggaa ataaagccac caccgccacc actgcctcct caccccgcta 26461 ctcctcctcc taagatggtg tctgtggccc gggagctgac tgtgggcatc aatgggtgag 26521 tctactccag cccctgatca gtctgaagcc ttagagccca gcccctcctc tggctgctaa 26581 ctgggggcac ctccacattt ccccccatca atgctttcgt tccgacgacc ctggagaaat 26641 agcccggagc aaggcagaca gggccccatg ctcagcagca tgctggtggg gaagacagta 26701 atcaaataaa gaaattagat tgttcagtag agtgtgctgg gaacagtgta gaatccagtg 26761 gaaaagtcag gctggagagg gcagcctttg ccagtgatca gggaaggcct tggggagaag 26821 gtgtcattgg agcgggacca gagtgatgag acacagccag ctcccagggc accttgctgg 26881 ggaaggagtc ttccaggcag cgggacaagc aatgattctg gggcagaggc gagctcagta 26941 tgttcaggga cagccaggaa gtcagagtgg ctggtgtcca ccaagcaaag ggaaaaggag 27001 ggaggccagt gagaagcccg gagggggctg ctacaccaca gtaaggagtg tggatcttgt 27061 tttactgatg gacagtcgtg gatgttttca tgccaagaga cgtaatctgt cttgaagttt 27121 taagatccat ccaggctctc acacctgtaa acccagcact ttgggaggcc aaggtaggag 27181 aatcctttga gaccaggagt ttgagaccag cgtgggcaac atagcgagac tctgtctcta 27241 cagaaaaaaa ttgttttaat tgtttaatga aaaaataaaa gatccatcca gctacaaggt 27301 taagaatagg ggttaggagc caggcgcagt ggctcacgcc tgtaatccca gcactttggg 27361 aggccaaggc gggtctatca acttgaggtc aggagttcga gaccagcctg gccaacatgg 27421 caaaaccccg tctctattaa aaatacaaaa attagccagg tgtggtggta cgcgcctgta 27481 gtcccagcta cttgggaggc tgaggcagga gaatcacttg aacccaggag gcagaggttg 27541 caatgagccc agatagcgcc actgcactcc agcctgggcg aaagagctgg attccatctc 27601 aaaatatata tatatatata tgtatatatg ggttagggaa cggattggga gagggagctg 27661 aaaaagcagt taaaggggct ggtgaggtca tcttggcaag agatgcccat ggccagcaca 27721 gaggttgttg ccttcttccc ctggattaag gattcccaac caggctcctt gagggagcct 27781 cctctccatt acccccttct ctcccatccc ttcctgtctg tccctggcca gatttggacg 27841 catcggtcgc ctggtcctgc gcgcctgcat ggagaagggt gttaaggtgg tggctgtgaa 27901 tgatccattc attgacccgg aatacatggt cagtagctgg cagagggcag gaacagcagg 27961 gtgggactgg ggtgggaaag ggactcaggg aagctgtatg gaaggctctc agctgtaatg 28021 tgagaccttc accaaagagg ggactctcta gggatcctca ccctgccacc ccaagactag 28081 gagccattcc atcccccaca ggtgtacatg tttaagtatg actccaccca cggccgatac 28141 aagggaagtg tggaattcag gaatggacaa ctggtcgtgg acaaccatga gatctctgtc 28201 taccagtggt aaggaaagca tctgtctgat gcacggctgt gacatattga ggcagggcat 28261 gtggaggtgg ctaaaatggg attccagcct ctcacatggg gtgctgaaac cacccccaaa 28321 attgcgtgtg catgcctcta ggtactaatt tgaagaggga agtctcaggt ccttcacccc 28381 taaaatgatt aagaagcact aatctaaaaa gatgcctcac tgtactgaag aatgcttttt 28441 ctttatattt tttttaggga cggggtatca ctcttttgcc cagtctggag tgcagtggtg 28501 taatcatagc tcaccgcatc cttgacctcc cgggctcaag tggtcctccc acctcggcct 28561 cccaagtagt gggcaccagc caccacacct ggcaaatttt ttgtagagat gaggtcttgc 28621 catctggccc aggctggtct caaactcctg gccccaagca atcctcccac cttggcctcc 28681 cagagtgctg ggattacagg caggagccac tgcattttgc tttaacgcac atttgcatgt 28741 tttctcgtgg gcatatcaac atgtccagaa ctcaactctt gattcccctt caacccacac 28801 ctattcagct cccattgacc actgctggtc aactatgcct gaattccaaa gggaggaagg 28861 tataatgagg cttgtgcgaa catccgaaca tccgttccca tcatggcctg aaccagtgtt 28921 tcagttttac tttagaatgc cctgggccga gagaagggct acattcagtt agttggggga 28981 gtttagaatt ttatttttgg ctgacattca cctttctaat gtaaactatt agcgactcct 29041 ggctctgtgg ctgcctgggg cttttacaca tacacaccca caggcccaca cacatgcaaa 29101 tacacacagg cacactagtg aaccctgtct gggcccaggc agggtctgca gcctccccag 29161 cgcccctccc cacagatcta atctcagccc ttctccttgt acttactggg ggtagggtga 29221 gtgataaagg ggcagggatg ggtgcaggaa gtttgtgggc agggggcaga gactggagag 29281 tcagtgaaaa ttgaagaaag ctgcttaacc caagtttgaa atcatgacac tagaaacaac 29341 tccagcacca tagttatcac gcagacttca acgtggccac caccattacc gccacttgga 29401 gtcatccctc ttccccacct agctgccccc tccgtccacg gtcccgactc aggtcttgcc 29461 ttctctggtc acttgtgcaa tgcagttccc cgcccctcca gccccctctc catccccgca 29521 ctgttgggct gcacaagcca ggctgtgtgc ctgcactctg ccctgcacct ccgctagcta 29581 gctgggtgac cttgggcaag gaaccaaact tgcctgagcc tcagttcccc gctgtatctg 29641 atggggatga tgagcaaacc caccacagcg gtagctggga gcacgaatgg agtgaacatg 29701 tgtttgtgga cacttaaacg ggcacttccc acaaggccat ggccttgcac ggagggctct 29761 gtcaactggt accaccactg ttgctctggc attagcaggt acaggccctg atagccattc 29821 agaacttcct ttgaatgaat gaatggaatc ctcaaaacag cctggtgggc aggaggctgg 29881 ggcccatgac agccttgggg agggcctggg gtcttctggg ggcacagtct ttgcctcagg 29941 ccaactcagc aattgttgca gaggaatcgc aagtgtccct aatttggaag gcaggaaatg 30001 gggggtatct gagaagagct cgttgggcca gatgatggaa agtggactga cagagcagga 30061 gacaggaaga ggataaccat gcatgtgtga tggctttgca gggcagccca ttcattcatt 30121 cattcaggca gtggaaacag gagtgcagac ccgaggtggg accagccacc tgcactccag 30181 caggctctcc ctctcactcc cctcccctct gccctgacag cagtaaaaaa atgcaaaaaa 30241 tgacagcggt agcacttcct catcgggttg tgaaggttac atgagtttaa tgcaggtaag 30301 gcacttagaa tatgggcaca ggacaggtgt tcagtaaaca cgtataaaat ggctaactaa 30361 aaaaaaaaca cgaggccagg tgcagaggct catgcctgtc atcccagcac tttgggaggc 30421 tgaagtgggg atcacctgag gtcaggagtt tgagaccagc ctggccaaca gagcaaaccc 30481 cgtcaccact aaaaatacaa aaaacaacaa caataaaaaa ctagccaggt gtggtagcgc 30541 atgcctgtaa ccccaactac tcgggaggcg gaggcaggag aatcgcttga acccggaagg 30601 tggaggttgc agtgagccaa aatcgtgcca ctgcactcca gcctgagtga cagagtgaga 30661 ctctgtctca aaaacaaaac aaaacaaaac aaagaaaacc catgaggcca ggcacggtgg 30721 ctcacacctg taatcccagc actgtgggag gtcaaggatc acttgagccc aggggttcta 30781 aagcaccctg ggcaacaaaa gtgagacccc atctctacaa aaaaatttaa aaattagctt 30841 ggcatggtgg tgtgcgcctg tagtcccagc tactcgggag gccaaggcag gaggatggcc 30901 ggagccagga agatcaaggc tgcagtgagc tctgattgca ccactgcact ccagcctggg 30961 cgacagagga agaccctgtc tcaaaacaaa caaaccccac aaacattgat tgaacccccc 31021 attgagggga gcaggctcaa tggcaaagga aacagaccga cactggccaa gaatcacaga 31081 agcaaagcac ccggttactc atgataaacc tgggaaggaa gaaagtggaa cagtttattc 31141 acttaatgct tgtttgctaa agccaagtag aaaacggcaa tactagatgt ttagtgtgct 31201 tcaaatacaa atgtttgtca aaataccaag taaaaacata aatttataaa acaggaaacc 31261 agaaacctcc cttacctttc caatgtaaac catgagccac tgctgcccca aggctgcctc 31321 aggcttcagg agtcctccag gagctttttc tttaatgcag tgtctgtttt cagggaccct 31381 cctggccttt ggcagccagc ttcctattgc aggagctgag ggtctaggag ggttcatggc 31441 gaggttgtca gagctgggaa gcagaaagca gccgggcagg cctcacagtc acagagtcca 31501 cgtcccgggc agcatatgga gaaagggtct gggctggctg acagcagtca gcattggcgc 31561 ctctgaggag gttgagggct gagttgggga ggtcaccacc tgtacccagg cataacaggg 31621 gtggtgcaat cagcaaaggg aaagtggtct gaagtggggc agtggtgaaa agctaggcca 31681 gcggggtgga gcccatctgg aacaagtagg tgtgtggggt aggggacgca ggtggttgca 31741 ggtgggccgg tacctgctca gcctgtcaaa tggtgaaagc catggactat gaaggtgggg 31801 ctcaagcagg ggagactgac tcagccctgc cacttcccca cagcaaagag cccaaacaga 31861 tcccctggag ggctgtcggg agcccctacg tggtggagtc cacaggcgtg tacctctcca 31921 tacaggcagc ttcggtaagc tggggagagg tgcccagggc tagctggggg gatgatggtg 31981 ccagaagccc ctgacacctg cgcttcctcc ccaggaccac atctctgcag gtgctcaacg 32041 tgtggtcatc tccgcgccct caccggatgc accaatgttc gtcatgggtg tcaatgaaaa 32101 tgactataac cctggctcca tgaacattgt gaggtaatgt gggcagtgac atcctgcaat 32161 gtgtggaagg gagggtagac tcgtcctccc caccctcagc cccactggat tcctggtcgc 32221 ttggcttctg cttctccttc ccgaaactat ctgctaaaaa cgcatgactt ccagaggaca 32281 agcttgggga gcctccccag ctgcacctca gtgcctcctt cagtctgaca gtgtccccac 32341 agatccctct caccctcatc ctgacgtttc ataaaaccaa gtctgcgtgc ataccccaag 32401 aggggtaagg gtggaggggt ggctctgcga ctcacctcac agtgtccgtg cacaccttgg 32461 ctgtttcagc aacgcgtcct gcaccaccaa ctgtttggct cccctcgcca aagtcatcca 32521 cgagcgattt gggatcgtgg aagggttgat ggtgagttga ggatgagggg ctggggcagg 32581 aaggatggca gggaaaccca acttcttccc gggccttgct tactgtatgg agttaagagg 32641 gagagactgg tttcgggagg agaggcccac cagtgcagaa gtcacttaaa acactgtgca 32701 accctcaggc aaggctggac cctggcctgc acacatcccc tcctgtggtc tgtggtggtg 32761 gccccaccag cctccacacc taggccacca acttagtcct ggaaaaaaga ggcatgggag 32821 cttaggagca tgaaggcctc atcttggtcc ttctcttccc caagaccaca gtccattcct 32881 acacggccac ccagaagaca gtggacgggc catcaaggaa ggcctggcga gatgggcggg 32941 gtgcccacca gaacatcatc ccagcctcca ctggggctgc gaaagctgtg accaaagtca 33001 tcccagagct caaagggtat gaggacaaga agctgcaacc agggtggggg catacgccag 33061 gaggactgga ctggcccggc cctcagtcct taagaggaaa gcaggggcct ggcccagcca 33121 cagggaaagg gggaatggag ggcaacgtcc ctaagttctg actcctgttc ctcatggggg 33181 attctccagg aagctgacag ggatggcgtt ccgggtacca accccggatg tgtctgtcgt 33241 ggacctgacc tgccgcctcg cccagcctgc cccctactca gccatcaagg aggctgtaaa 33301 agcagcagcc aaggggccca tggctggcat ccttgcctac accgaggatg aggtaggggc 33361 tgaggagagg agaccctggg aggagccctc tgggaaggga catgatttcc acttgccagg 33421 gagctgctct caatgtgcca agtcagaaac tgcagggcag gaagggagat ctccctgcct 33481 cagggccttt gcacttgctg ttcctttagt ctggaatgct tcccttgcca ggtgaccata 33541 cagtctgtcc ttacttcctc caagtctctg gaggaacctc cctcttcagc aaggcacccc 33601 ctcaaaatat gttctttttg tggccatatt cctcacctga aatttcattt catagtcact 33661 gttggcccca gtggtgtgtg cactccaaga aaacagggat ttaggggctg ccttttcact 33721 tttgtggctc caacctggca tacagcaggt gactaataca tgcttagtga acaaaagaca 33781 acagctccca tttttttcaa tacttgctca gggccaggcc atggccagat gccttctctg 33841 tgaggagtgg atgccatggt catggtggac ccctgtacat ccaccttggt gctgtctccc 33901 tgcattaact cctgcaggtc cactttccca cagtcactgg gatcttttgt tttaatggaa 33961 tcacattgca tcacccgttt tcaggattga aaacactcct gtggcctttc atgacactag 34021 ggctaaaacc ctcaggtcct cccttcctgc agctccttct acctcagaca ctgtgctgag 34081 ggccccagcc ggtcagatct ttctagtcat ctagacccag gctcatggcc acccctcaga 34141 gaggccctcc atgctggccc attccaaacg cttgcattat tcctaggaaa gggatgtact 34201 ggtacagcat gagggtgttt gtgacctgtg ccctttggcc agaccctgac acagtgcctt 34261 ggtcataccc tgcacttggt cataccctcc agtccagaga ctggacacag tagggctaca 34321 tcaaatatta tagatgaatg catggatggg cgatagagtt aagagtcggg gcctcagctc 34381 ctggaggtcc ttgctctgcc ggacacactt atctttgaaa ttctgacttc caggtcgtct 34441 ctacggactt cctcggtgat acccactcgt ccatcttcga tgctaaggcc ggcattgcgc 34501 tcaatgacaa tttcgtgaag ctcatttcat ggtaaggggg aaggagctgg agacttagag 34561 ggaggggaac taaggggtgg tcggaaggaa cccccttgaa cctcccgacc cctcctccac 34621 aggtacgaca acgaatatgg ctacagtcac cgggtggtcg acctcctccg ctacatgttc 34681 agccgagaca agtgaaacgg gaaggtcctt tctttccttc ccaggggccg gggccggaac 34741 atgtgcctcc cgttccagca tctggctgcc cgggggagga aggacacccg gggcgggcgc 34801 cccacgccga tgggtccatg gtgaaataaa aaacagtgct cacggctgcg tcccgtatct 34861 ctgcgccggt cagggcgggt tctgatccgg gtttgaggcc cgccccaccc ttactcgatc 34921 gcctgcgccc acgggcgagg ggtcgcgctc gactccaagc cgggttccac ttcaggagac 34981 cgggaccgcg atggcagcgg tagaggcccc gcatggccgg aagtcactcc ccaaagcgct 35041 gggccggcag cggtgggacc cagggccggc cacgggctct ctgacgtcac tgggcgcgac 35101 gccccgcgcc gggactactg ctcccagaag gtcgcgcgcg ggcccccgcc agtcaggtgg 35161 gtgccaggcc ctggccgtgg cgaaagagcc ggcggagccg gagacccgct cccggagacg 35221 ccgcctcgcg atccccgcgc gggcgggacc gggcggccgg catcatgacc ctgtttcact 35281 tcgggaactg cttcgctctt gcctacttcc cctacttcat cacctacaag tgcagcggcc 35341 tgtgagtgcg ggaagggcgc ggggcggaga gggcgcgggg cccgggccga ccctcacctc 35401 ccgcttctcc aggtccgagt acaacgcctt ctggaaatgc gtccaggctg gagtcaccta 35461 cctctttgtc caactctgca aggtgagggc caccgggaag ccacgtgttc tggcccccag 35521 gctctgcaga cccagggacc cgcccccgtt gcctatccgc gcccccgccg ccccacggtg 35581 ggaccgccct cgggactccg cactgggagg cgtcaggata cctagagagg atggacttta 35641 aagagggcac gacctgagaa gagacctaga agcaactttt gcgtagcact tagtaaaact 35701 gagaaaacct cagtagtgtg gttggctaac aggttttttt tggtgcggtt aagagtgatc 35761 actgattccg tacgtgctgc taacactgcc caaggagcag caccttcaga cctgactcag 35821 atgccctgtg atcacccaga actcaaactt ccagccccct cggggacccg gggacctatc 35881 ctctatctcc ccgattcctg taattttgcc accatctggt ggttcctctt ccagatgggc 35941 ccccagaacc tgtcctgccc tcttcctacc cacgaacttc ccacaaatcc cacgtggttt 36001 atcttgatcc cctcaccttg aagtgacctc tttctgcttt ctgttctcag atgctgttct 36061 tggccacttt ctttcccacc tgggaaggcg gcatctatga cttcattggg gtgagagggg 36121 ccagggaagg gaagggagtt caggaatggg gctccctgtc cccctgtgct tacttaagcc 36181 tcaacctgac ccgcaggagt tcatgaaggc cagcgtggat gtggcagacc tgataggtct 36241 aaaccttgtc atgtcccgga atgccggcaa gggagagtac aagatcatgg ttgctgccct 36301 gggctgggcc actgctgagc ttattatgtc ccggtgcgta cagcagcctg gagcccagac 36361 ccctgagaag ggacacctgg gttccacggg ggtgctggag ggcaggggct caaagcctgg 36421 tgctgaaggt gtctgagtac tggagaatcc catcctttgc cttcctcagc tgcattcccc 36481 tatgggtcgg agcccggggc attgagtttg actggaagta catccagatg agcatagact 36541 ccaacatcag tctggtaggc agtcgtgctc tcccacatac acatttctgc tggcggccat 36601 actcctcccc aaggcctggc cccgactttc tgcctccctc taggtccatt acatcgtcgc 36661 gtctgctcag gtctggatga taacacgcta tgatctgtac cacaccttcc ggccagctgt 36721 cctcctgctg atgttcctca gtgtctacaa ggcctttgtt atggagtgag ttgggtgggg 36781 tttagggctg ggtccaaagt ggggtgggtt atctagtctc ccttccttat tgtgacattt 36841 tcctgcagga ccttcgtcca cctctgctcg ctgggcagtt gggcagctct actggcccga 36901 gcagtggtaa cggggctgct ggccctcagc actttggccc tgtatgtcgc cgttgtcaat 36961 gtgcactcct aggcttggtg tctcagacat tgatgtacct tttccctgcc tcactccagg 37021 ttttagtgaa gtaaacagta tttggaaagt tgttgctgcc tccatttctc tctcttggga 37081 actgtctccc aataccgtgt ccacctgggt ctcagaggcc ctggttctgt ctcaggagcc 37141 aggtagacaa gctggaagct agccagtcac tgacttgtcc catgtcttgt tcctcaggct 37201 cctggtttgc caggagtaga cagaaggttt ggatgatctt tgagcagtgg cagaggccag 37261 ggccctcagg gaacagatga tagaggggag ctagaatcca agagaaggcc cttggggggc 37321 tcttcctcct cacagcccca acctgggcct cctcacatgg gcccttcccg ggctggttgc 37381 ctctgaggct cctggcccca gtgtccccct ccaatccatc ctctgtatgg cagccagggg 37441 atctatctga aacccgtcta accaggtcat cctcccactt gcagccactt gcggcccctt 37501 gttacacagt ggacagtcca attgcttggc ctttggtttt acccaaccag caaaaccagc 37561 ttttctgaaa ctgctcccta gaataatttg tccagccaga ttcttaacat cgttcaagcc 37621 atggggtcag catggggctg gggggagagt tgtgactgtg agctcctcag ttctatggcc 37681 ctcagatcag atcccagcct gagccttctc agggtggagt ggagagacag ctgcagccaa 37741 gaagcagagc agtggggccc ccagcttgga cattgtcctg gatgcccccc tccaccctca 37801 gccttccttg gtccactgaa gccggctccc cgcctttctc tgagggggat gtgtcagaag 37861 ttttgaatgt catgtttaag ctccactttg atgtacaccc tccccactca ggaagatgct 37921 ccccatcctt ggtgttccca gtggggtacc ccagggagta ggatagcacc attgcccctc 37981 ctcctccacc aggccgttga agttccacct gatttttttt aagcttaggc ctaggaaagc 38041 tcactatgac catctcgatg tttccagagg gagcatttgc cttcagacgg cagctgccat 38101 ctaggccact cttcctcatt gtttggagta aagacagggt ccatgggtct cttgggagct 38161 ggaagtgatt gatcaccttg actttgatgt agaaaggaag tagatggggc agtctatctg 38221 ggtggacttg tgacaaggtc actttctccc acacttccat gccccacata gctcttcaca 38281 catattgaga caagtgtagg atgcaaaatt accaactgga ataatcccag cttacatggg 38341 gttcagggag agagactgga gtggctgggc ctgagttggc agaggacggt gaagcctggg 38401 cggttggact gtggggagcc aggcttcagg tgatcggggt tatgatggga aaaccccggg 38461 ttatggggac ccagaggagc cgcctggcct gccgtggaag cagtgcgggc tcccctgagc 38521 agaggacatg tgagctgaga cctaaagaat gagtgattgg ggcaaggcag aggtaatggt 38581 cttgaggtca gagagggggc ctgtttttcc caagggaaca taactggttg gaatgagttc 38641 aaggggactt gtaagactac agaagctggc aggggtcaga tcacaggagg ctttgggaac 38701 caaggagagc tctaggcaga ggaaggacag ggtcagattt ggctttagga agctctgcat 38761 ggctgttgtc tgaaatggga aagctgaggc cacgaggcca gggctggacc agggcagggc 38821 tttgtggggt taggagagtg actagaagcc aagcccaggt gttggtgatc agtggctgtg 38881 tgtatttatt taccggcagg gggcgctaac aggtaaggga ggagcaaaga gttccaggat 38941 aactggagtt tggggcttgt aagcccgagg gctcaccacg tttccaaggg agaagggacc 39001 acactgatgg gacagtggga ctatcagatt gggtcagtgc atgggtacag gcatgggggg 39061 tgatgaagag gccagtgtag gtcaccaaca ctgggggaag ctggggacag aggtgggtct 39121 cactgcagaa acaactgaag ccatggggga ctggggggct gtggtcacaa ggaaggagaa 39181 gcccaggccc tagccagaga aagccacata agaaagagga agagaggtga acagaggggc 39241 tgggaaagga gcggcagaga tgggaggtag ggggatgggg ttaacctttc aggtgagggg 39301 tgtgaggagg ggacctccag gagttgttca ggaactggcc attgccggga cagaggagta 39361 agcaggtagg ctcagaaagg agacaccgct ggggggtggg ggatgcttga tgtggaaatg 39421 acttgggccc ccttgtgtct gggagcctcc aggagggaga agggtgctga gaacagagaa 39481 agcccagagg tggtaggggg tgcctatcgc cacggcccag cttgcgttta gatgccatga 39541 ccctgggcaa aggaatagga atcatttccc caaggcaggt ttattgagga cctactatgt 39601 gccaggctgt gtactaggca ctggaggtgc agccctggat aaacagcccc catgtacagg 39661 actgtgggag atcacaagca ctaatgagag acgctgagga cagtgcctgg caccagatac 39721 agcacagtca gcatgagctg gttttattac catcgcccag gacacaagcg tgtctttaac 39781 gaagggccct caggcagccg tccagcctgg actggagtcc taagaacaga aacaccctcc 39841 agaagcggtc agctgtactc cctgtcagag ccccaccgcc accacaggtg gcggctttcc 39901 cgaggccagc ccagaggact gcccaggcgc tgctgctcca ggaggtgcga accttgggaa 39961 tgggggaggg gagtgggcag gtccctccaa gtttggggtg ccgtgggcta cagaagcaga 40021 tactggtggg gctgggactc ttggttgctc agatatcttg gtggctgtcc agagggtccc 40081 acgagccctg cccccacctg ctgtggcagt tgcagggatg cttgaaggca gtcgtccctc 40141 taatagtaga gttcctggtc ccaccagcct gggagagaga gggagaaagg agactcagtg 40201 ctgggggtgc cacttaggca gggccagggg ctcagaggtc agtgctggtt gctatggagg 40261 gtcaagggct tagaggccaa gtgttgagga tcaaaggtgg gtgcagctgc tcaaacgttt 40321 cagtcccctg cccccaggct cccagtctcc acactcactc cctgggcaac agcgaactcc 40381 aagcttccgg atctcatcat agacgaagat gaggatgccg tagggcaggg ggaccagcca 40441 ccactggaac ctgaaggcac atggcaaggt gaaggccatc ccaggcctgt gccctacagc 40501 cccctccctg tccttgcccc tcacagcctc tcaccgaatg ggcatgaagt tgaagatgtt 40561 gggcatgccg gggcagtagc acaggaagca gccgatgcag acctggaaca cgatggcgat 40621 caccaggatc ttattcctgg gggtgggcag aatgggacag gccattagga attggggacg 40681 tgatggaaat caggatactg tgggtcaaag taggtcagat gtcagcagca gcagggaggt 40741 gttctacaca ctgaacactg gagtggcccg ggcctctcag caccacccct ttagtgaaat 40801 gtggaggggg ccgtgggggg agttattggc cagttagaaa ggtttttttg gcatgtcacc 40861 atctggcact ggcacccgct ggcatttgcc ggctgttcta aaccacagtc aggatctgat 40921 gggagttggg accaggggtt ggagggcagg aagagggcca gccagggaca cctgaagaag 40981 ccttgctgga aggcagagag acggcgcgtc ttgcggatga ggacatcggc gatctggcac 41041 acctcaatgc tgatgaagaa cacggtgtag caggtgtact gctggtacag gcgctgcccg 41101 aatgtctgca ggccaggggc aaacggaaac agcctgagtc cagcctgagt cccggcggag 41161 agcctctgca gcccaccggg catagggtcc cagggccgtg aacccctaaa tgctctctct 41221 gccttgcatc agagtgtggg ggtgggggga aggagagagg cagggtctgc caggtgtgca 41281 ggaccccaga acgtggggcc cagaaacctc agcctagcag agcttggatg acagtggccg 41341 ggaaaactgt aggctgggcc gactgctttc atagtgaaag gggggaggca cccaaaagaa 41401 agggcctggg aaaggggaat tgaaatcctc agatgggatt tgcaaagtcc agtgactcca 41461 ggggccaggc agatggcaaa aataggtgag gcagtcaccg cgggcctttg ggaaaagagg 41521 gcagatgccc catttaaagg gggcagccac tatgcagccc atttcttcaa cacagctact 41581 tttttttttg agatggaatc tcgctctgtc acccaggctg gagtgcagtg gtgcaatctc 41641 agctcactgc aacttccgaa tcccaggttc aggagattct cctgccttag cctcccaagt 41701 agctggaatt acaagcatgt gccaccatgc ctggcaacac aactcctaat accagctaac 41761 tgtggggatc actgactgct ccaggtgccg agtgttgcac gtgcattagc tcatttaatc 41821 cgctgggtaa ccctaggagg taaatcagtg gtgctggaac ttgaacatgc atgagaatcc 41881 cagcaagggc tcccagaggg ctgggataga gcctgagaat ttgcactgca aacatgttcc 41941 caggagatgg tgatgctgct ggtctgggag caaactgaaa accactgagg tagactttga 42001 tattctctcc actttgcagg tgagaaaaca ggcacagaga ggcaaagcaa cctgcccagg 42061 gttacacagc ttgtgagaag cagtaccaga ccataagtcc aggcagtctg gcttcagtgt 42121 ctaccttaac catgatagta aatgtttaag ggaagtatcc tcgtctgtat gtaactctac 42181 aggtgaacta aacacatctg ctggccagat ctagcctcac tgtgctccaa accttcatta 42241 aaatagacca catgtgcaaa taccctgtgg gagtgatgct gtagggaaaa tagcagaata 42301 ggggcctggt tccacatctg agatgatgca ttagtgtcag ggcagagaca cccatctgca 42361 gttgcgatca tcagaagcag gagtttggag ctggagatgg tgggaaaggc ccttgccccc 42421 ccgaactggc agggagtctg ggggtccggg atgctctggc tgaactcagt cacacgtgga 42481 ggaacaagtg ggccgcacac aaggcaagtt gccctgggaa ggagggagcc cagggatggg 42541 atggggcggg gcagggctca cccactcctg gccgtagctg tcctgcagat cttgtaggtg 42601 gtggtcctcc cactgcgccc gcagccccac gcacagcagt gggaaccagc cctcctgggc 42661 cattgccgtg aagtagtcag tgaagccagc aaaggactga atggcacctg gagagagaca 42721 aggggacaca gggagacaga gatggacaca gagacaggga cacaggaaga gagggacatg 42781 gagagacagg gacacagata cacaaaggtc aaggacacac agacagggac acgggaagag 42841 agacagggac acagatacac aaggtcaagg acacacagag ccagggacac agaaacagtg 42901 acagagacac agaggcagag agagaagcag ggacagagag acagaggcag ggacagagag 42961 aaaaagacac atccaggacc caaaaagaca gaaataggca acaactcaga gacagggtca 43021 tggagagcaa ggtcagagag agagcaggga cacagcagaa ccagagagac tgtcacaaac 43081 agagagggac acggaggaca acggagaccc aggaccaaat gcagacactg aggtccagaa 43141 ccagccggag cccaaggcag ggcctggggc aggcgagcag cccagcgcag agccccccac 43201 ctccaggctc cccggcccac gggcacccac caatctggaa gtaggagtag gcagccaggg 43261 gctcgttgac caatctgtca cgctttgggt tgcgtggacg caggtgcatg atgtcactct 43321 cggccttttc atatgccagg gacacagatg ggaactggcc aggagtggaa ggaactggga 43381 ctgagggttt ggctgggccc ttgtcccctc cacttcgggt cccccttccc ccacttgtga 43441 gcaggaacag gactgaggtt agcaggcagg acctgcaggg acggcacagc cacaccagcc 43501 tggacagcct gggccccact ggcccctggg tgttggcatc ctggtgccca tcttcgtaag 43561 aacatctgct tttatcttct ccctctgtct ccctccctct ctttctctgt tcctcactct 43621 gtttctctgt ccctgtctcc agtgtaattc tctgtgactg tatctgctct tgtgtatttc 43681 tgtgtcactc cctatctctg tgtatcttca gtctctcagt gtcgccgtgc cccttcatct 43741 gcttgtgcct agagtggcag cagagcatac tgaccacgac cacaggctgt gggacctgtg 43801 actgctcaag ctcaagtcac agccgggcca cttatttgcc cagtgacctg ggacaatcca 43861 ctctacctgt ctatgccttg atgagctcat cagtaaagta cgggtgatag taacaaaact 43921 catctcatag tgtggttatg agtacaaaat taaggaaatg atggtgtgtc tggcccagag 43981 aaagcactgt gtaaatgtta ccgaataaat acctgtcaac acagctgcgc ctctggctgc 44041 attcgtgact gtatctgtgg ttcttttctg tctgtctgcc atggggtcta tctgaggggg 44101 tttatttgtc cgtgtctgtt taggggcacg tgtctgtgat tcaagtttgg ccagatacct 44161 gttctgtttc catctctgtc tgagtatctg tatctgagtc atgtgtgtgc atgggtcggc 44221 ccgggcctac ctgagtgtcc gcctgtgtgt gcatggctgt atgtccattt aggtctgttc 44281 acctgcctgc cacccatgtg tcctgcactg gctgtcattg cacacacagg tcttgtctgt 44341 cactctgtgc ccaagagtgt ctgtggtggt ctctgtccct cctgtccatc tgcaagcaag 44401 tgtctctggg caccctgtgg atgggtaccc tgggctgtgg acttacaatg tcagtgcaga 44461 gttcgatgaa gaggatggtg atgcacccga ggggcagggg cacgctgacg gtgatgtaga 44521 tgaggtaggg tgtcagctct gggatgttct tggtcaatgt gtaggcaata gacttcttca 44581 ggttgtcgaa gatcagtcga cctgtggggt agggtgggca cctcagcctc ctcacagccc 44641 tctccctcct gtgcccacac tgcctgccct ccccctggcg tggctcggac cctgctccac 44701 gcctgtcaca atggaggcaa agttgtcatc cagcaggatc atgtcagctg catttttggc 44761 agcatctgag ccagcgatgc ccatggctac tccgatgtct gccttcttca gagctgggga 44821 gtcattcaca ccatcccccg tgacggccac aatcgcaccc tgcaggcagt gggtgcaggt 44881 ggtgggtggg tggtcagtga gaggccggtc caagaccagc cccgcctgtc tgcccgcctg 44941 cccaccctca tcagcagggc tcaccagccg ctggcagctc tccacgatca ccagcttctg 45001 ctgggggctg gtgcgcgcaa acaccatctc ggggtgggtg cgcagggcct cgaccagttc 45061 cgatgggtcc atgtccttca gctgcatgcc attgatcaca caggcacggg catccctggg 45121 gaggagatgg gaggacctcg ctgggacctc ggtctgtgcc agatgtgggg agaaccccgg 45181 ggaggtctgg gggggcttac ttgcgattaa cctggtctac gggcacacgg aggcgggcag 45241 cgatgtcctc cactgtctcg ctgccttccg agatgatgcc cacactggct gcaatggcct 45301 tggcggtgat ggggtggtca cccgttacca tgatcacctg tagggggaac cagtggatca 45361 ctgacccctt cagatcagcc caatctccct gtcctccctg ggagacatct gctgatacac 45421 gtgttcattt acttgaccaa gcgtgaccac ctcctacgtg cctgggatat agcagagaca 45481 aaagagacaa aaatccctgt ccttgaggag ctctagtgag ggtgacagaa aataaagaaa 45541 cgaattatat agtgtgttag aaagtggaaa aaaacatatt aagcttggtg aaggaaacgg 45601 tgactcgatt ttaagtgtac ccagggaagg gcttattgag aaggtgatat ctgagcagag 45661 acctgtaaga ggtgagagag tcagccacgt ggaaagctgg gagaagagtg ctcctggcag 45721 ggagaacagc aggtgcaaag gccctgaggt gaggacaggc ccgaactgtc ggaggaacag 45781 cagggaggac accataactg cagaggagtg aacaaggaaa gagggctaac gaacaagctt 45841 agaaaggtaa caaggcccga tgtaacaggg ccgtagagac acgggaaggg cttgggcatt 45901 gcctgcctgg ccttggcttt cccacccttg ttttgtgttt ggggaaccac tcctctcccc 45961 tacaccatca gttgcagtga ggtccctggc taagattggg taagtaagcc cagacctggc 46021 caatcaaggt cttccatccc ccagccccag tgactggctc agggataaat gagtgaccca 46081 atccagtgct actgcaaaag aggcactctc tttttgtggg gtggctaagc aggtgggagt 46141 taagtttgag ctgctggagc aggggtgtgt gtgtccccaa gtgaagggag tgcctgcaaa 46201 taaagccagc aaagagaaca gcagagctga gacatggaga aaggaaaatt cctaatgaca 46261 ctgagctcct ggatc // LOCUS AC002428 143875 bp DNA PRI 19-AUG-1997 DEFINITION Human BAC clone GS039E22 from 5q31, complete sequence. ACCESSION AC002428 NID g2335068 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 143875) AUTHORS Strong,C, Biewald,T, Tin-Wollam,A and Duckels,G. TITLE The sequence of H. sapiens BAC clone GS039E22 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 143875) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (19-AUG-1997) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: Mapping information for this clone was provided by Dr. John D. McPherson, Department of Genetics, Washington University School of Medicine, St. Louis MO. SOURCE INFORMATION: This clone is from the first BAC library from Genome Systems, Inc. (http://www.genomesystems.com). Cell line: lymphoblastoid Haplotypes: two VECTOR: pBeloBAC Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of GS039E22; actual end is at 143875 of GS039E22. The orientation of this clone is unknown. FEATURES Location/Qualifiers source 1..143875 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /clone="GS039E22" /clone_lib="GSBAC1" /map="5q31" repeat_region complement(5925..6381) /rpt_family="L1" repeat_region 6395..7261 /rpt_family="L1" repeat_region complement(9580..10034) /rpt_family="ALU" repeat_region 15719..16011 /rpt_family="ALU" repeat_region 16686..16718 /rpt_family="L1" repeat_region 17851..17899 /rpt_family="L1" repeat_region 22012..22036 /rpt_family="L1" repeat_region 22113..22221 /rpt_family="L1" repeat_region 24193..24236 /rpt_family="L1" repeat_region 30275..31887 /rpt_family="L1" repeat_region 33104..33227 /rpt_family="ALU" repeat_region 33244..33528 /rpt_family="ALU" repeat_region 34799..37445 /rpt_family="L1" repeat_region complement(36012..36430) /rpt_family="L1" repeat_region 38537..38815 /rpt_family="ALU" repeat_region 38955..39081 /rpt_family="L1" repeat_region complement(40756..41047) /rpt_family="ALU" repeat_region complement(41119..41151) /rpt_family="L1" repeat_region 48982..49122 /rpt_family="L1" repeat_region 49941..50794 /rpt_family="L1" repeat_region 50979..51144 /rpt_family="L1" repeat_region complement(51255..51279) /rpt_family="L1" repeat_region 51518..52173 /rpt_family="L1" repeat_region 52352..53446 /rpt_family="L1" repeat_region complement(57647..57935) /rpt_family="ALU" repeat_region complement(58385..59161) /rpt_family="L1" repeat_region complement(59607..59887) /rpt_family="ALU" repeat_region complement(59979..60046) /rpt_family="L1" repeat_region complement(61870..62160) /rpt_family="ALU" repeat_region complement(62446..62494) /rpt_family="L1" repeat_region complement(62565..62857) /rpt_family="ALU" misc_feature 64970..65148 /note="match to human EST H53181 (NID:g993328) yq83f08.r1" gene 65103..72605 /gene="WUGSC:GS039E22.2" CDS join(65103..65148,66969..67065,68568..68713,72439..72605) /gene="WUGSC:GS039E22.2" /note="similar to chicken myeloid protein-1 precursor; 60% similarity to P08940 (PID:g127095)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2335070" /translation="MFSTKALLLAGLISTALAGPWANICAGKSSNEIRTCDRHGCGQY SAQRSQRPHQGVDILCSAGSTVYAPFTGMIVGQEKPYQNKNAINNGVRISGRGFCVKM FYIKPIKYKGPIKKGEKLGTLLPLQKVYPGIQSHVHIENCDSSDPTAYL" repeat_region 65477..65504 /rpt_family="L1" repeat_region 66438..66725 /rpt_family="ALU" misc_feature 66968..67065 /gene="WUGSC:GS039E22.2" /note="match to human EST H53181 (NID:g993328) yq83f08.r1" misc_feature 68566..68694 /gene="WUGSC:GS039E22.2" /note="match to human EST H53181 (NID:g993328) yq83f08.r1" repeat_region complement(68868..69019) /rpt_family="L1" repeat_region 69424..69451 /rpt_family="L1" repeat_region complement(70815..70874) /rpt_family="L1" repeat_region 70962..71252 /rpt_family="ALU" repeat_region complement(72301..72364) /rpt_family="L1" repeat_region 72683..72819 /rpt_family="L1" repeat_region 73775..73842 /rpt_family="L1" repeat_region complement(76149..76437) /rpt_family="ALU" repeat_region complement(77820..77864) /rpt_family="L1" repeat_region complement(79481..79517) /rpt_family="L1" misc_feature complement(80014..80341) /note="match to human EST Z45085 (NID:g574280)" repeat_region 83779..83817 /rpt_family="L1" repeat_region 85736..85928 /rpt_family="ALU" repeat_region 85960..86088 /rpt_family="ALU" repeat_region 86112..86266 /rpt_family="ALU" repeat_region 86269..86302 /rpt_family="L1" repeat_region 88547..88584 /rpt_family="L1" repeat_region complement(89933..90211) /rpt_family="ALU" repeat_region complement(90265..90556) /rpt_family="ALU" repeat_region complement(91084..91505) /rpt_family="L1" repeat_region complement(91775..92785) /rpt_family="L1" repeat_region complement(93324..93694) /rpt_family="MER" repeat_region complement(95492..95806) /rpt_family="MER" repeat_region complement(95891..96119) /rpt_family="MER" repeat_region complement(96360..96566) /rpt_family="MER" repeat_region 96678..96758 /rpt_family="MER" repeat_region 96870..97502 /rpt_family="MER" repeat_region complement(97277..97569) /rpt_family="MER" repeat_region complement(98097..98139) /rpt_family="L1" repeat_region 102458..102484 /rpt_family="L1" repeat_region 106261..106544 /rpt_family="ALU" repeat_region 109130..109171 /rpt_family="L1" repeat_region 112301..112857 /rpt_family="MER" repeat_region complement(112870..113161) /rpt_family="ALU" repeat_region 113182..113255 /rpt_family="MER" repeat_region 113451..113500 /rpt_family="L1" repeat_region 115252..115561 /rpt_family="ALU" repeat_region complement(116482..116720) /rpt_family="L1" repeat_region 119909..119940 /rpt_family="L1" repeat_region 123014..123303 /rpt_family="ALU" gene 124120..127545 /gene="IL-9" CDS join(124120..124233,124346..124381,124464..124496, 125785..125916,127426..127545) /gene="IL-9" /note="GS039E22.1; match to P15248 (PID:g124362)" /codon_start=1 /product="T-cell growth factor P40 (P40 cytokine)" /db_xref="PID:g2335069" /translation="MLLAMVLTSALLLCSVAGQGCPTLAGILDINFLINKMQEDPASK CHCSANVTSCLCLGIPSDNCTRPCFSERLSQMTNTTMQTRYPLIFSRVKKSVEVLKNN KCPYFSCEQPCNQTTAGNALTFLKSLLEIFQKEKMRGMRGKI" repeat_region complement(126315..126606) /rpt_family="ALU" repeat_region complement(126826..127046) /rpt_family="MIR" repeat_region 128933..128964 /rpt_family="L1" repeat_region 129752..130041 /rpt_family="ALU" repeat_region complement(130202..130352) /rpt_family="L1" misc_feature 131299..131693 /note="match to human EST H05645 (NID:g869197) yl75a10.s1" misc_feature 131299..131632 /note="match to human EST R40604 (NID:g822927) yf72a09.s1" misc_feature 131626..131740 /note="match to human EST R40604 (NID:g822927) yf72a09.s1" misc_feature complement(136449..136763) /note="match to human EST H22496 (NID:g891191) yn69f04.r1" BASE COUNT 43727 a 29465 c 29485 g 41198 t ORIGIN 1 aagcttaaaa aggcttggct cagtgattaa agtacaacac aataagcaac tttaaaatgt 61 gtttatttag atacaaactc catccaagac cccataagtt agaccttggc ctgttttcgt 121 gaagtggtgt gacggcagtg gccttttaaa aatttagttt ctgtaactta ctctggggac 181 ataggttgat cactctaaat cagtatgtta ataaatagag cagcagaaag gaaggctgca 241 gggtcatgag acccaggctt gcatgcctct tctttcttga taactgtttc cagatgtggg 301 tgttggcaca gtatcagctt gatgacaaaa acgaatgaaa gtcctttctc cccagctaac 361 agcaggcggc tactggttcc ggcagaaagc ctgcttgtgt tttcctttca gaggtctcac 421 gtggctgaaa gcatttgcct cttgctgaat gtttacctgg catacgctag gaacagcaaa 481 tgacacaaga acctcagtgc aagatgtctc ttgaagaaga aggtggtaga aaagtgcata 541 catctaataa tcctaaacaa cagggaaaga cacggctgga tttggggtaa aaacatctca 601 tcaccaagga aatgatatgc aggaagtcgt cagctcaggg tcctattctt atcatagatg 661 aaatggttgc ttccgctagt gttctggagg ggagaggtga gagaagattg ttcctgactt 721 cagggtgcct aagaggagtc acatacgcat gacacaaaac agacatatta aaggactcac 781 taaggacagt tgggtctgct ttcagtagag aaaagatgtg cacttaggaa gtggtagttt 841 ctggcatggt tcaactggcc tgtctccatg tgtgtggaat ctcacctttg acctcagacc 901 ttggtcacgt accaggcagg ctgcatgttg cccaaggctt ggagaccagg ctagctgagc 961 tcagacttct cacattctgc ttccatcctc ctttgacttt tgggaaagtc tcctcctgct 1021 gattttcatt tgctggccaa cccgcttgcc acaaaccagg gaaaatcagc tctgaatttt 1081 gaaaactgga aacataaatt catttggcat aaacacagct ttctccagca gacacacagg 1141 aagtaagctt acctatgtat gcgatggtat catggaccac cttagagcct atcataacta 1201 gcgttcagtt ggtgtggatg gtgtaagaat aaatttcatt gctctctagg acacaaggag 1261 cacttaactt atggatttcc gggaaaagtt tccagctagg aatgctctaa aagattacca 1321 aaaatctttg ttttcatgcc tcaaaagaaa tatccgatct cagctggttt tgacttgttg 1381 gtgaccacag cttatttccc agtgtcattt gcacgagtca attttgtgga cttgatcaga 1441 aagccttgaa gggtgtgaga agaggagaac agatggctag caagggccac tttgggacag 1501 caagaaaggc cgagtccatc tactggtgac tggatctgac ctgtccatga tagagccaga 1561 gtctgaatgg acaatcctgc tctctcatcc ctcctgccaa ccctgtgcct ttccttggtc 1621 ttggacagga aatgcagaat gagacaagcc atccattgcc attcatccat cccacaggat 1681 cagagcccca gccccaaggc cactggttat gaatatccct tgcaggtgaa atgctcctat 1741 taatacataa ccctgagttt gggataatgc atgaaaagat atttcagagc atggcttcct 1801 gtagatacat gtcttaaggg ccagtaggtg gattctgtta gcccctttct ggccagaagg 1861 gattctatac agtgctgccc aggcccaggg aatgtatgaa atgagctcca aggaagcttc 1921 ggagaggtct ccttttctag taagttccac acccactaag catttattaa agtgtcagct 1981 gtggatattg catagtcaag caaatacagg gtaaacacaa aagatgtaga aggtccttca 2041 tgtcttccag aagcacacca ttgtctgtga aggccaagga aaggcaagaa atcaagttca 2101 agactggaaa acatacattt ttatagccaa tttactaaaa tattttcagt tctctgaacc 2161 tagcatgctt tttttccacc tccaaacctg tgaacacaga cctttattct tgctgtctgg 2221 cttccatgtg tccatgagat tttaatttag accacacttt ctctgggagg ccttccttga 2281 tgccttcagc ctgggtaggt gccctttctg tgggctttta ctggctcctt ttcttctccc 2341 accatagcat ctatcagtga catgtctgtt ttctctacca gacggcaagt ttgatgaggg 2401 cagggataat gcctgccttc ttcactaccc cattcccggt attcattcat tcattggtta 2461 attaactatg gagtgcctgg catgagccag tcaaatgatg aaagagaaag gtcactgtga 2521 agctgtatta gtcagggaag gcaacttggg ggaggtgcat tttgagatgt tctttgagga 2581 agttgaaaag ggaaaacaat gggaggcatg aaacttgggg gcctcatgct accccctccc 2641 tctccacaag atgtctaggc ttccatggga atcaaatctg ggaaccatta ctaaggggaa 2701 gggcactcac attcatggag ctcttaccac atgccaagat ttgccagacg ccaaccacat 2761 taataatact tgccattcac cctgagttgg ctgcaggcca ggccacttct tgggctgtct 2821 catttgatct tttcaacaac cctgctgaga tggttgtccc catttttaca gatgagagac 2881 taagcctcag aaagagtggg atttggccaa gatggcatag tgactaagcc aagagtcata 2941 gccagggctg tttggctcaa aatcaatgat cttcaaatat tttgctcaca acccaactca 3001 aagaattgac cttttttgca cattttaaag ttgacttata aagatttttt atcatgtgtt 3061 taaataactg taaaagatgt cagagcattg tattttgaat atgagcatta cattatgttt 3121 ttgtcaaaat attggaactt tagattgagt ttatgcagtt gatagaaaat ctctgccata 3181 tctcaactct gtccctcata ccagccttgg atgagttact taactgcttc tatgtcaatg 3241 ttttctgcaa attgagaata agaataatac ctacttacag gactattagg aggtttatat 3301 gagataattt tatgtaaagt acttagaaaa aggactgctc cacacttctg tataaatgtt 3361 tgctattatt ataaactcta attgaacatt gagaggctgc cttttctaat tcccctttgt 3421 tagaagtcct ggcacaatgc cccatggtaa gctggtaaga gggaaatctt gccttcatga 3481 agaagggagg ttgaggaaag ggaagtagga gcatcagcag aggggaggcg catgcttagg 3541 ctgatctgtt ctaggagagg gcacttgtgg aaaatgggga cttggaagtt gattctggta 3601 aaaattatgc atgctcccta aaaggtcttc ctgtagcccc aaggatgtgt gtgttccagt 3661 ttgaagtctc catttgtcct tgaagagcaa caaccaccag tgaatgaggc agccctctct 3721 tagcagggcc caacatccca gggaatagca gatgaaaaaa ggtcaaagta tttttggaat 3781 ttctcctccc tagagaagtg aaaatgcagg caagtgctgt gctgatgtgg catggtcaag 3841 tcctactcaa aaagtggtct atgggccagc atcactaaca ttgcctcgag ctttctagaa 3901 caggcctcag cccattgaat cagaatctgc tacaaccagt ggttctcaac cctggctgca 3961 acttagaata agtaatcatg gggagctttt aaaaaatcct gatgcccagg gccacatccc 4021 agaccactta aaccagaatc tctggtcatg tggtccaggc atcactagtt tttaagtttc 4081 cctggtgact ctgatgtgca gccacaatgg agaaccactt tcatttgtag caatgttaat 4141 acagaacttt aaatgtgggc tccttccctt ctaaataata gccttatctg cagtagctct 4201 tctgtgcatt ctgcccaccc gttcgccgct gcatgctcat tctctcacct gggctactgt 4261 ggcagcctcc agcccttccc acccaaagga agaccttcca caacattgcc ttgacttttc 4321 ttccagactt ttctccagac actcccctcc attccaatcc aacctgatga ttctctcttc 4381 cctaaacatg cctcaaagtt tcccacatgc ccaccacagt ctggtgtctt cttctccatt 4441 tcttgcccaa atttctgctc ctttggggga tcatgttaag atacctcctc cgttggatcc 4501 cctctgcctg gttgtgaatt ccctgtagtg ttttcttccc ttatactttc tactttagat 4561 tcacattgta tgccttcttt ccccacttca ccccatccca ttagcctgtg ggctacaaga 4621 gcagagtagg ggcccacttc tctttattta tcctgaaaga cctgccactt gcctgggcac 4681 acagagctgc acccccgccc tgctccctcc ccaggcagag acaggctcag ggaggcttgc 4741 aaaccatcgc ccctaccgct ggcgattact tggcatacta attgttccat taaattaaag 4801 ctctttaaaa attactaccc taaggggagc gggcaaaacc cacatcaggc aagttgcttt 4861 aaacgttgcc tagaaatgaa aagttatttt tatcatttat agcaatgtta atacagaact 4921 tcaaatgtgg gctccctccc ttctaaataa tagccttact cttcctagaa caagatgggt 4981 tcatcagcac agccggcagc agctgaccac caaacttcaa aggaaatatt ctccaatcgc 5041 tccttccaga tccatctggc accctgagcc tggcctcagc tcaagtccat gataagtgtt 5101 cctttaatgc agagacttgc atttgggctt ggactcctcc tctgttttcc cttcatgcta 5161 tggaaatcat ttagtaattt gccctcataa aaatcaagca gagaaccaga tatttaatgt 5221 ctacagacag tgcacagggc agagacgtca cctcaccatc attgccattt atgtgccttt 5281 cagcagggct gacaaaatgt cttcctctcc aaggttctac aatccaagct gctgttcttt 5341 ctagttgctg gtgaccttca ctttcatttc ttgcccttgc cagacaccta tttctaatat 5401 ggatctgcct tcccattttg tagacaggat tctggtaagt atcagtgcag attgcattca 5461 agactcattt cttctctgag ataagaatct gtaccgatgg ggttgcaggc atatggcagg 5521 atttattaaa gagccagata acttctttca ttattatctt cttagtccca aaccacaggt 5581 ctcaagagaa atgttagaaa gtgttgcttc cctagtacac ataagagaat agatatttag 5641 cagagacaga aatgagtcag cctctctccc tgtgtctcac ctaggcatgt gctgttaagg 5701 ccagtacccc gcagggcccc aggttagagc ggggcaaatt tatattacac atttaaaagg 5761 gcctcttgag tgttaggctt ggaacaggct gcagaggctg ctgggcggct tcccaaggag 5821 agctttaaga acatacagcc atctattaca cttggctgag gacaagccct gcaaagaggg 5881 aggatgggga aaaagatttc tgaaggctgc tgctaccttg gggttttttt ggggtcattt 5941 ttgttttgtt tttatttgtt atggatacag aatagttgtg tgtatttatg gggtacatct 6001 gatgtttgga tacagggcat acaatgtgta atgatcaaat tgggataatt gtagtatccg 6061 ttacctcaag ttttcatcat ttctttgtgt taggaacatt tcaattccac tcatttagtt 6121 atttaaacat acacaataaa tcattgttaa ctacagtctc cctgtcgtgc tactgaacac 6181 tagatcttat tcatgctatc taactgcatt tttgtaccca gtaaccatcc ccactttacc 6241 cccccattcc tgctgctctt ccttacctct ggtaatcatc tgtatctcca tgagatcaat 6301 tttttaaaaa ttttaattct tacatatgag tgagaacatg tgaaaaatgt gtccttctgt 6361 gcctggctta ttccacttaa cgaaaccaga cccttatctc aagacttaaa tctaaacctg 6421 aaactatgaa actactaaaa gaaaacattt gggaaactct tcaggacatt ggtatgggca 6481 aagatttctt gagtaagacc tcgaaagcac aggaaaccta atcaaaaatg gacaaatggg 6541 atcatatcaa gctaaaaagc ctctgcagag caaaagaaaa aataacaagc aaaaggacaa 6601 cccacataat gggaaaaata tttgcaaact acccacctga caagggatta ataactggaa 6661 tatataagaa gctcaaacaa ctcaatagga aaaaaatcaa ataatccaat taaaacatga 6721 ggaaaagatc tgaatagata tttctcaaaa gaaaacatac gaatggccaa caggtttgtg 6781 aaaaaaatgc tcaacatcac taatcattag agagatgcaa atcaaaacta caatgagata 6841 tcatctcacc ccagttaaaa tggctttcat ccaaaagaca ggcaataata acaagtgctg 6901 gtcaggatgt ggagaaagga gaaccctcat acactgttgg tggaaatgta aatgagtaca 6961 accgctatat agaacaatcc tcagaaaact aataatagag ctaccattca atccagcaat 7021 cccactgctg gaaataattc caaaagatag gaaatcagta tattgaagag atatctgcaa 7081 tctcatgttt attgaagcac tactcacaat agccaggata tggaatcaac ccaaatgtcc 7141 ctcagtgggt aaatgaataa agaaaatgtg gtacatatgc agaggaatat tattcagcca 7201 taaaaagaat gaggtcctgt catatgcaac aacatggttg gagctagagg acattaccct 7261 ggggttttgt gatgtgtaca gctggcctca ggtcagagcc agtagaagtt gcagctcttc 7321 tctgattttt gcttaagtgc ctgtatctcc acaaatgttt ccaacttcca ctctactatg 7381 aggtctcact ctacacagga acttcagctc agaggagaaa aaaatagaac cttctagacc 7441 aaacaccaag aaacttttgt ttgctcatgt ctgtacatgc atctcttcat agaataaaac 7501 agttgtggaa tatctaaatt ctgatctgcc tcttgtctca acagatgttt aagaaaatga 7561 caccttaaag ttaactaaga actcagagat tcatattttg agtacaaaat ttttgatgag 7621 atctgcagtt ttgccgttta tagagaaatg catgaattag acacttaggg taagaggaag 7681 tggcttacta aatatcagcg ttaattcttg agattccatt taattgcaca acattctact 7741 tcctcttcct caagtaacat gaacaagcaa ttgtctgaaa tattaaacac ataaaactac 7801 agcaaatatt aacaaattct aaaattgtat atgtgtgttc ttcatatgca aaaaaatatg 7861 gtatcatagg aagaaaatat catatacctg agagagcctt ccccctctcc cctattcctt 7921 agagattcta ggaaaccatt tttcaagttc ttttctgatt ttcaaaacca caggctgaca 7981 acaataagta gttagctttt aaatctttta ctacaaccac ccagattaaa ttgttccctt 8041 attaatctag ggaaaataat taccaggcaa aagatcctct tttgtccatt agccatgatt 8101 tatttcaaat gcgtgctttc agtccacatt caattttttt cacatatgtg aacacttaga 8161 tattgaaatt cttttcctgt cagctgtcaa tgtgcattgc tttttaatcc aaaaaagact 8221 taaaagatta tcagataaaa agtatttcaa agcttaaaac gaatcctcct attgtattcc 8281 caactacctt atatctccta tcgtaaggtt atattaagtg cttttggctg ttttcccaaa 8341 catccagaca tgttttatgt gaagtgtgag tcatggcagc tttgctttca tggttcagga 8401 cagtctcctt gaagcgggaa ctaaatggtg gaatcttaag ttgttggtga tgatggattg 8461 ggaactggac atttatccaa gtgcagaaaa tgaaatagga aaatgtttct tgccagtttt 8521 aaatcaaagg agctaattgc cttactcgag ttgtgtctac taagtagcat gggattgtga 8581 cagagcctgg aaagaacaga agttagagag aagctccctc atgaatgaaa taatttaaac 8641 tctagactaa aatgtgaatt tgttagagtt tcaggaaaaa aaaatttatt cctcttcctc 8701 catttgatgt tgtttctata tggtaaagag ttatgatggc aatttccaca gactgtgaat 8761 ctagttattt cctaaaatgt caatttgaat ccaaatgcaa gttaacatct gtgtgatggg 8821 acaaaaatga tgaatgcatt tgtttctagg gtctttgctc tttgacatcc tatctaagaa 8881 gagtttcatt tctgccagac cagtaatagg agattttgtg tttttaaatc aagatatcag 8941 ggcaaagaag aaccacagag caaggagaaa aacgtgaaga cttctaggta gtcttaattg 9001 atctgatcca tggctgtcca gtagaaatac aacctgagct acatatgtaa ttttatactt 9061 tataaatata aaatcatatt ttataagtag ccatattacg aaagtaaaaa gaaacaggta 9121 caaattaatc ttaaaaatat ttctatggaa gccaatatat ttaaaacatt atcattttaa 9181 catgtaatca atatgaaaaa cattagtcag acattttata tttctttttt tctcatccca 9241 agtcttcaaa ctccagtgtg cattttacac ttatagcaca tctcaaatca aactagccat 9301 atttcaagtg cttaattagc cacatatggc agtggctacc atattagaca gcacaaatct 9361 agataattaa ttcctcggag ggaggcatag aaataaagaa gaaaagccat aagagctgtg 9421 gatttaggaa gagttttggc aagggtaaga actttgaaga gagctggctg gagcagctgt 9481 ggactgtctg gtgaatgtat ggaattccca tggagcatgt acttagatcc cttatataac 9541 tttcttttct tttttttttt tttttttttt tttttttttt tttttttttg agacagagtc 9601 tctctgtctc ccagtctgga gtgcaacaga gagatcttgg ctcactgcaa cctccatctc 9661 ccaggttcaa gagattctta tgtctcagcc ttccgagtag ctgaggttac aagcacacgc 9721 caccgtgccc agctaatttt tttttttttt tggagatgga gtctcgctct gtctcccagg 9781 ctgaagtgca gtggcatgat ctcggctcac tgcaacctcc gcctcctgga ttcaagtgat 9841 tcttctgctt cagcctcccg agtagctggg actacaggtg cacgccacca tgccaggcta 9901 atttttgcat ttttagtaga gatggggttt cgccatgttg gccaggctgg tctcaaactc 9961 ctggcctcaa gtaatctgcc tgcctcagcc tcccaaagtg ctgggattac aggcatgagc 10021 caccacaccc agccatataa ttttcaagga gcaaaatact tgagttattt tgaaattaat 10081 tagggtggat ggactgtgtt tctttgaata gcacctcctt gaaactggac cacttggtgc 10141 tggctgtata agggttatat atccaatcat ggatgaagga gaggtaaata aaaatgcagg 10201 gtagaaagat gctgatggtc gtgggccatt ccccatgtgg gagcggtggg cagaggcctg 10261 acctctatta ttatctggat ctgcacaagg aacctctgtc cttgatttgg catgaatgac 10321 aaccagccca agaagacatg aagccagcac atcaatcata tgcaagagag gacatcaatt 10381 tgaagggccc attggtgctt tgaagcaacg atctggaaac tcttggagga aacaggcaca 10441 gcccacatgg tatagatcct gacctgtgga gcagtgtggg aggttacctg gctgcagcag 10501 caggaagcac acaggataaa ggactccggc caccacctcc accaccctct ggtgccccat 10561 aatcagaaca tgaatgaaca ggattgactg gcacttgggc cttcacacac agtagaagcc 10621 caagggcgag tatagcctag agctctgaac tcagatcaga tggaagtaag aggattccaa 10681 atcagcttgt cagcaccatt cttgaggccc aatctcatgg gatcctctga aagaaataca 10741 ctgcctcgtg agagtgatct attcaccaaa ccatctttct ggaactgggg cctggcccca 10801 gggtccctcg ataatcccat ggatgtgatt cctaaaattc ctcagaatat ctggttgacc 10861 cagcataggt gaaagagagt cagatgtccc actggggaga catgaggacc agggcccata 10921 tgagtcctgg tctaggtttg gggcaccctg cctcgctgtc aatgatggct ccagctggtc 10981 ccaggcatcc aaggggctta gaaaatttca aggaagttac cccaaagcac agggtcccca 11041 gaggcatgga gcccaggaca gagatgccac ttcctgtgtc taaagatatt tgggaggaag 11101 ccctcctaaa agagaataac atctacagaa acagagaagc agagtgggaa agaccaagtg 11161 acagagacaa tcctgccaac actgagccct ccatgccagt tgtgccctgg gaacgaccag 11221 ttttgatagc caacaaattc tttttctgac tttagctagt ttgtattatg cctctgtcac 11281 ttgcaactaa aagagccctg gccaatgaag aacaattcaa cgctttcaag accaaatttg 11341 tgtcttctat cctcatggga gccactctgt tcctccctgc cccacagcca aggaagtacc 11401 tctccccatg tgtccaaaca tgcaccagat ggtgaagcag gcgaggttcc tctgcctcct 11461 ggaaaacgac aagagtggat gtattaggta catcctgtac atgaaggatg ggaccctggt 11521 tttaccaaaa ggtcttttca aaatgcctcc atgaacagat ttgtcccacc ctcctctgcc 11581 cagctcaaag taccttagtg gttccccaat gcagtcagag taacccgtgt gatctggccc 11641 tagcctacac ttcagccctc cactccccag tccccactct ccactcccac tctgtgctct 11701 tcagccactc tggattcctg tttctttaac aagaaagttt agttccacct ctgggccttt 11761 gcactagcta tctcctccat ctgcagtgct ctctcccctt ttcattattc agatctcact 11821 ttaaagcaga cagaaaggtc tttcctgatc ccgatcactc tctattatct taagtctctt 11881 aatgactgct actatttgat ttgttttgtt aggtgtttat tgtctgtctc cccctacatt 11941 taaactcttg tgggtaggaa tctcactggt gttgttcact gctatatccc cagtatctag 12001 acgaatgcct gacacagtaa atgctcaata aatatttatc aattatgtaa actgaatgca 12061 cacaggctac gtttgttgac ttgtcttctc aggaatatca tcctttcagc tctttagcaa 12121 agccttgggc tgacttattt atttggttca acaaatcact gctgagcatc cactctgctt 12181 aggttctgtc caagatgata atgagccctg gttcttaacc tttggaagtt cagacccagg 12241 gaaagataga cactaggcaa atgccagaca atgtggaaaa tgctatggtc aggctgtgca 12301 ctagcactgg tactggggaa acaaagaaga atgtgatctt tttgttgggt acaagagttc 12361 aggcagtgaa gtcccagaaa gccacagaga aagatccttt gggacctgta cacgttcccc 12421 tcttgcaaca cctcacagtt gggaattatg aaagcagttc tccctgccct tctttctccc 12481 taaagtattg gtatatgtga gtgggataag tgtgagcaat ttgtcctgaa aggagaggga 12541 gctggaacaa ccagtacttg ggagaacttg tgattatcct gtttgcgtga caacagccaa 12601 ttaaattatt tacgcctctc aagtgcattt tgcatcaagt agaatatctg gtcttggcgc 12661 tattttataa tctgctaagt tagctaactg cccctcctgt cctgggttgc ggcaaggaga 12721 tgtcttggga cagagtagag ggtatcgggg ggctcctgcc tttagagaac tcgggacgac 12781 aggcagccag ccgtgcccat gtgtggaggg acagagtaca ggagtccctg aggtcccagg 12841 tcttcctttt attgccccaa gggggttcac acctaagacc agccgtgcca aacagcaggg 12901 tgccactctg ctctttacca tgaggctgac tcacctctac atgtcagaca gtagagaatg 12961 tggaattctg aacagaagag aaaaaaatcc acttggaggc tccagactag gcctcaaggt 13021 ctctggaact caaggagtct ctgcacaccc tgatgcatca ggagttactc acttgtgcct 13081 tagattcctt gtctaaactt ggcctaacaa agggatcatg gctcattgtc agaatgaagt 13141 aacgtccagc tccttatggc taaattatga atcagtcata agagggataa ttgacttctt 13201 ttaaaaggaa gcctgaaatc cattttctgc tatattaggt gcacacatac acagaggcaa 13261 gatgacttga gtagatcaaa tgtttgtaaa gtgaaaagtt atacaatgta tgggccagct 13321 cctctgagga aatcgaacag agacaagctg gagaagtggg aatggggcat aagagacagc 13381 agccctctgg gggcacacga gattctccct ggcacaaagg cgggacactg ggaagtatct 13441 gctgtttctg ttcctttggc ccctggagct gaccattcag ctcaaggtca gccagtccct 13501 catttctgca gtgttcccac tgaatcacag cagcttagaa agatgatgaa tgttggagct 13561 ggaggcacac tggaaactag accaatcccc tcatttgtaa acaaggcaag agagttccag 13621 aagggagctc tgcctcatct gaggttacac agcaagtccg tattggagtc acgaaagggg 13681 agccggtgtt cctgattccc ggcccaccac gctcccaggg caccctcgtg atacaccgtc 13741 tggtattgtt agttattttc tatgtgtgtc tgtcttagcg ccctaatgag actactccct 13801 cggttgctta tagcatttta ttctttgtga tcccccgtgg ccccttagtt agaattttcc 13861 tcaagacttt agtttttaaa tgattcttgg ttaatagcct cctaccaagc tgctcccaac 13921 ttctcctgaa aatatccaag atgatgttga aaaaagtaaa aacgttattc tgtgagttaa 13981 gatcaagatt caacctaagg ccaggcaaga agctggattg aaaatctcat ttcagggcca 14041 cccacactgt gtgccaggga acctgcccct ataaccagga acacagggct gtgccatgca 14101 aaactcgggt gctttcccgg ggggccccct ggctgcacgg gaccccctcc tcatggcctt 14161 ggagatgcac agggaggaac tatgggttcc acagtagcca ggaggagagt cccatgggca 14221 cggcactgca gccacagccc acgatggcaa cgttggaggg aggcatgcca gggcactgcc 14281 ctttctacac attccatttc tgtatcatgt caaggagcta catggtgggg tagaagaaga 14341 tggatttcag tttaacagac gttcagtaaa caaaaacgtg tggtcactaa gaacagcact 14401 agaagctgca caccacaatc ggctgctcaa ctgtttattt aacaaggctt ttgatagaca 14461 cccaagatgt caagggcagt gacaagtaca aacccctaaa acttctcttt tataaatcag 14521 tgggtctaga tagacctttc actatttgag gaggggtgga ggggtgggga ctggggggag 14581 aaaatcatga aacttctgaa aaagtactca ggtttgcaat taaaaatgtt tttaatgcaa 14641 tatctgaaaa aaaaaattaa gtcaagaagg agcagagctg acaccataga aaagtgaatc 14701 agtaatgtgg agggcaagaa tgagatcaaa ggagaagagc aaaatgatga aaatgatgag 14761 gaaggacagg atgattcggt cctccttctc tgtcctgcta tatccaagaa ttacagaact 14821 tctgaagaaa gaaacccact caagtgggaa gaaagcaata atcaaagata tcattagaga 14881 aaacagccct tagtatgcaa atcgaaatgg cccatagcat tctaaaaaaa ataaatttaa 14941 agggggctgt ctagaaaaga gttcaccgag caggcctgaa gactgcaata cttagaaagg 15001 cctgctacta aacacctgct ctcttggaat tttagttcat gctagacaga aggacctaca 15061 tgatcatgat cagaccccag taaaaaaaac ttgtgtaatg ggtctctgat gggcttccct 15121 agacagatac atcatgcata cgttgtttca tttttctcac tggtgaagag taagctgtgc 15181 gtgactccta atgggtgaga gagagcatga ggaaacctgc acatgaattc ctccagactg 15241 cctgtgcctt ttcccttcat gagccactca tatatcctta caacactgct gtaataaatc 15301 ttggctatga gtacacttta agctgagtac actttaagct gagttttgtg agtcatttta 15361 gtgggtttcc aaacagcgat ggacctgtgg cctaggggat ccctgacaga gaggcctcac 15421 tgtgtcacag gggctaaaac acattctagc aacatatttg aattataaga ctaatttgtt 15481 ttcatgtcat gagcattgag ggaaaaaaaa tacctctttc aaagtttaca aaattggttg 15541 gcttcagact tgtcctttgc caccctgtat agagcaggct ttagaaagtt ttgaaggaag 15601 aaagttagga tccagtaatt ttgcagcctt atgaaggcaa cagacagaca ttcttagaca 15661 ttcaaggact tgaaatagac ttatactagt gactgaggta ataaacttga agaacacagg 15721 ccgggcgcag tggctcacgc ctgtaatccc agcactttgg gaggctgagg caggtggatc 15781 accaggtcac aagatcgaga ccatcctggc taacaccgtg aaaccccatc tctactaaaa 15841 atacaaaaaa aaattagcct ggcctggtgg cgggcacctg tagtcccagc tacttgggac 15901 gctgaggcag gagaatggcg tgaacccggg aggcagagct tgcagtgagc cgagatcacg 15961 ccactgtgct ccagcctggg caacagagcg agactctgtc tcaaaaaaac aaaaacgaaa 16021 acaaaaaaca acaacaacaa aaagggaatt ctggatatgc agctcttctt ttatggagca 16081 tagatcccaa attcaaaagt ccagttggac tgtgttatca gagaattctg agattttatg 16141 cttagccctt tgctatttcc ctaagctgtc cttggcttct gaaattttat aagaagacca 16201 agaaattaat ttcttatgcc atatctttca ccctacaatt taggaagtat cacctgcaat 16261 gacggatggc caccaaagct attacgcagt gtaggtcttt cttaatgatt gttaaacata 16321 aaggatgatg agacctatgg gtctcaattt gtaggagtac ctgtgttgag gattgtggca 16381 ttgaggatta ctgtattgag caagattctg agccccttct gagctctgcg ggagagaatg 16441 caggggtcag tgagtgacac ccaacctggg gaaaaaggga gcagggtggt acattcattt 16501 cttacctgct attcttgatg tatatattca gaagattgtg gtttaaaaat taataaacaa 16561 aaatgaccaa aaatgaatat aaatactttc tgtaattgtt gtgctagaaa ataaaataac 16621 cgcaacttca attttaaaca ataggacaat atctgtattt aatgctaagg tttacttaat 16681 attttaggta agacagaaaa aaaactttaa aaaataaaat actgtaggaa gaaagtacca 16741 agaaaaagat tgatagatta gctgcatagc atttttaaac ctctatttca caaaaataca 16801 aaacatcata agcaagataa gaaggcagat aacaaaccga gagacatgtt tgcaacacag 16861 gtggcaaagg atcaacatcc ttaatgtaaa aggcttttgc aaatcaacgt aaaaataaat 16921 tacctaataa attacattgt gtgaagagca tgaacattga gtttacagaa ggttaagtta 16981 aaattgacaa tgaatgaaaa agtttcctat ttggatcact aaaaaaaatg aaaggtataa 17041 aaaggaatca tgcatttaaa aaaatcttct ttgctatttt cgtaaatgaa agattataat 17101 gcttagtgtt gcaaagagta aaacttttct taaaggcaat ttccctcatc tttcaaaact 17161 tcttaaagta tcatgcatgt ctattagctc aaattccact actggcaaat caatggatat 17221 agccttaagg gtatctgtat tgaatgatca tctcagcatt ataatgacaa catttggaac 17281 aacccaaagt caggggattg gctaaacagg ttcagctaaa tctatgtaat gagatatacg 17341 cagctgttaa aatcatgcgt acaggactat tggttgaata gtccccatct actgcttagt 17401 gaaaaacagg acacaaactg ccttgcattt catgtatagt gtgtacggtg gaggtagtta 17461 tgagctgtga aatttggagt tacccaacct gcaactgagt cctgagtcct ggtcccacca 17521 caccttcctg ggtgatcttg agcggagctc ctgcccctct taactgcagt gtcttcattt 17581 gtaaaaatgc tgttaatgaa attatttact catagggttg tactatatgc aacctgaaat 17641 tatcccctgc gaagtactta gaatactgcc taactcagca gcgtgcatta ggtcccgata 17701 ttttttttta atgatgctaa aaaataatca agaaaaatag accactgaca gaacacatca 17761 aaaaaaaatc ttgataatgg ggctgttact ttttgcttct tttcattctt ctaggttttc 17821 tgcattaaac gtagaataca tttacactgt tcataacaca cacacaggca gacacacaca 17881 gacacacaca cagtaaatat cgactttact aggaggttac tatgagtctt gggaaaacag 17941 ctgaaatcca tctttctaga aagaaaagga agaagaaacg gccaaatgag ggggaaacca 18001 cccccacact ctaggaaaca caagctcccc agcctcttgg ccttcaaggc agaggagggc 18061 cactccccag cctgccagcc agcaggccat ggggtcaccc tgtggggcca caccccatgc 18121 aggctgcata ctgccccatt gtctaagcat ggccccactg aagtcaacga gaaacagatg 18181 gtgagaaaca tttctgctct tccacaccta ggatgcggtt ctgctctgag catccagcac 18241 cagtgaggca atctgctttt catttatggg cagagaaaag ccacagagga aggaatgagg 18301 gaggccatag ttctcagagt gtcggaggag ttgatgcaat aataattcaa ttccccctac 18361 tctgaaaagc aaggcagtcc aacccggcag gatgcctgca gccagggctg taaactccag 18421 ctcacctcag gctacctcca gctgagcaag cccacaccag aacctgtggg cctgaactct 18481 tgcaggggaa gagttcaaac agggcctttt ccagtttgca tcatgtggca gagccctgta 18541 gccaatcagc tggaagcgag ttccaggtag gtgggatggg aaggatctgg actggggggc 18601 ctcctcacaa gggaatcatc atggacactg gcaactaggg ggtagctcct ctggccagtg 18661 tagttctttc ttccctttgc ctgggtagaa gcatggactt tgaagccagc tgtctgggac 18721 cctatcccag ttctctgctt acttgctgtg taccttaggt gaactgcttt aactttctga 18781 gcccctgctt cctcatccga attgtgggca tgagactagt gttacccaac aggaaatatg 18841 aagactgcag gagataatcc ataggaagtg tctacgctgt gcttgggtca tggaaagtgc 18901 tcattaactg ttagttatta ttattattat aattgccaga ttctctctgt gtcttatttc 18961 ctctttaaac catggcataa tttacactat gacacgctct tgtgcataat ttcttctcat 19021 agtcctaacc agttgtagtc tctgcttgag ggtgatctat gcctgtgtac cctctatcgg 19081 tccccctggc agtcaccatc cctctggaca aaaatttgcg tggtctttgg aatgagtctg 19141 ggtaattgag tggcagccaa cctctgccag catcctccct catcgggaga ctgtggagca 19201 ggggcaatgg tctcctgagc caaagaacag cttgcagatg gccccagtga gaagtgccac 19261 tctgaagccc taaaagatga taagggagag taggagaagc aagcccagag aagaaagaaa 19321 gtggagaatg gtccccagag gaagggaggc aaacagggag caacagttac acaggtgact 19381 gcaatgtgag aagcatccaa gaacggacag agtttcaaaa tgctcaggaa aaagcaaggt 19441 gcagtcttct ggatgtggtg gcagaaaaga ccccaagaga aagacatcat caacataaaa 19501 gcctggaaaa aatagggatc ataaagatgc ctgtgtgact gacaaacaaa ttagaggtgt 19561 aggaaaagaa aagagcggga atggcaggtt gcaggtaatt gagatttttc agaacaagat 19621 gacaagtaaa ctacatcaaa gcatgtccaa gtcaggtttg actggcagtt ttccctgaat 19681 gtgctaatta ctcttatttg ttctcaaaca aaaattattt atttatcatg agggctgagg 19741 gcccagggtg tgggagagca tcttctctca gcaagcaggt gagtagctca gtttgaatga 19801 cggaagatac atgactatag agcaacacag tcacttttca tcaagattga accctatggc 19861 tatatctttg agaagccact gcatcaaaga cctggctcag ataagagttt ccggagaaaa 19921 agagaaggtg gcagccagaa taactgtgat actgtccttt ttgtaccagc caggataggc 19981 taggtgatgc tgcagtaaca aacaatccta aatcttaggg gctttaaaca acaaagattt 20041 atttctgtca tctgtgccac aagtccattt cgggttggct ggagttctgt tctgcatccc 20101 ttcagggacc caggataatg aagcagccac cgtctcaaac tgctagtcac tatagcagag 20161 gacaagggag ttgagacaac acacacactc acacacacac acacacacac acacacacac 20221 acacacgggc tcttaaaact tcagctcaga atgatagaca tcatccctgc tcagatttca 20281 ccagccacag tcaggcacat gcctatactg aattgcaaag gggatgggga agtaaaatcc 20341 cacttccgct gctcagaaga agaaacagaa tgtatgtgaa cagccctaat gactcaccaa 20401 acatccattc caggagcagg cctccctgcc cgttactcag ctgcccctcc cttggcctgc 20461 gaccccttca gtggttctca gtcatctctg cccctgtccc catcactatc ttcaaggcac 20521 attttggatg agatgattgg gcatagtgct tccaaatgga ctctaatcat aaaatgaagg 20581 agaacacaga ctcagagtat tggtgcagag cagaatctct gagaggacct agcccacagc 20641 ccctgccttt ctagatggag aaataggaag tcaaaggaaa aaccatgtgc aatgcgcaga 20701 attctaagat gggtcctaag attcctggtc cctgctgtgc aggcatcatc tcctcccttt 20761 gagtgtaagt gggacccgta aataagactg gctgtcactc cagtgacatg ttactttgta 20821 tggcaaaagg gactttgcag atgtaattca ggcactctcc tgctggcctg aaggaaagta 20881 aacagccatg ctgtgaactt cccgtggagg ggacctgcac ggccctctag gacctgagcg 20941 cagcccctag attggctaga aaatgggacc tcagccctac aactgcagga actgaattct 21001 gattaccggg aaccctgagc tccaaatgaa aacatagccc tggttaacac tttgatttca 21061 gcctgatgag attgtgagct gagaacccat ctcctctatg tccagactcc tgacccatgg 21121 aaactgtgag attaaacaaa aaaagccact gtattgtttt aagccactaa gttgaaggta 21181 atttgttacc cagcaataaa aacaagtatg tgaaaatggg agactgtgta gcagggccat 21241 ggtctcctga gccgaagaac agcttgcaga tggccccagt gagaagtgcc actctgaagc 21301 cctaaaagat gataagggag agtacgagag acaaacacct ctaatttgtt tgtcagtcac 21361 actggcatct ttatgatccc tattttttcc aggcttttat gttgatgatg tctttctctt 21421 ggggtttttc tgccgccaca tccagaagac cgcaccttgc tttttcctga gcattttgaa 21481 acactgtcta ttcttggatg gcttcttaca ttgcagtcac ctgcgtagct gttgctccct 21541 gtttgcctcc cttcctctgg ggaccattct ccactttctt tctcctctgg gcttgcttct 21601 cctactctcc cttgtcatct tttagggctt cagagtggca ctcatcatgg cagtggcagg 21661 tcctgcaatt tgagcagtat tcaaggcctt ttcgtgattt ctgtcccact tgctgaacac 21721 aaactctcta ttccaggtct ggcggcatct ttgcagcctc tcacctgcca ggattgctcc 21781 ccctgtggtc ttcctctgcc tgcttcccag ggcagggctc catccagcgt catttgatgg 21841 agctttcaca tgccattccc caataaaata aaacgccctg aagtgacccc ggccccctgt 21901 actcttccat tctctgccgg ttccatagca tccagcacaa ctacaagcct ccctgcagtt 21961 gattatatcc caatcatatc caataataaa attccacgag gattaaatca tttaaattgt 22021 ctctttttgc caagacttta agctctctaa gggcaacaga cgtgtttact ccaggtgcta 22081 catcaggaat cagaaaaggc atttgtactt gtaatacgcc aaactggaaa cagcccaaat 22141 gtctatcact gagtgaagag agaaacaaat tatcttattt ccatacaatg agatactgct 22201 cagctaaaaa aaagaacaaa ttatccataa catgctataa catgggtgaa tctcaaaatc 22261 gtaggctgag tgtaagatat aaatgatatc tactgtatga taatttacat aaaaatctag 22321 acaaagcaaa actcatctat attgacagga agcaggtggg tgtttgctca tggctggggt 22381 tggggggata atggcaaacg ttgcattctg gggtgatgga gatgttctgt gtcttgattg 22441 tcgtggcggc tatgtggatg tacacgtttg tcacaactca tcagattgta tacttaaaat 22501 atatacattt tattacacat aaattatgcc tcaataaagt tgacttggaa aaaaatcttt 22561 gtatattctg agtcagctaa agctttgttg gaaatgcttt gagacatgat ttctcatgtc 22621 tcatgatttc tcaacatgat ttgagaaggg tgagcccaag aaaacatcta cagacttctg 22681 tttaggtaga gccatctccc agagacagat gagctcccag gctgagagca aagatatgcc 22741 tcaaaaggta agttaaccga cttaaagaga gattgcttgg ccagaaaaac ccagaaaaag 22801 aacagagagg atggcaaaaa ggagccatgt attcaggcgg aaaggcctag attggggggt 22861 tcaaggatag tctggctgtg agctctcccc atgctgtgtg gtggggtgga gctctttcag 22921 cggtctcacc atggcccaga gaaagagctt ttgggaccag ccacactgct ttgctctgaa 22981 tccacagcca ctccactggc caccagcctg attaacttct agggaagccg ggtcctggga 23041 agaaggcatg aggtttctaa caaatatgaa ggcatcatca tctggatgaa gtgtcagagg 23101 accatagtcc aggcaatgct gcctgctgtg gcttttgatt agtcaggcat gacccattcc 23161 tcccactgtc actcaagaca ctgcccggag agacaggcac agccagggca gcagccagaa 23221 tgctgaggat tttgtgggtt acctaaagct gtggagaagc cagcagggct ccagacctgg 23281 agagagcccc ataaaaagaa aacattccct tctttggtag aaaagaaatg atacccgagc 23341 agctagcaac cctgactgtg cagaggcaaa aaccaagttc tgaggtagga aaagaaagtg 23401 tttgtgtgaa aagcccgggg gcttggaaga gcctcggaaa tgaaacagga aaggagggaa 23461 gagaacagga aagctcagct ggctgctgtg gggctcagtg gcgaggcatg agcagcagca 23521 gaggctgaaa ggggcagaga gggtgctcac aaggaaaggc aaggggagga gagctctgca 23581 gtcttaatgg gggcatgggg gttgtgtgta caatggctgc agggtggacc ctcctcttga 23641 ccaaacccca agtgccctcc gtccccctga attccaagca tgattatcag cactactttc 23701 cactggagtc acccagccca acgcttgtgc gcactgccct gactccagca ctccataacc 23761 cagccgtcta gactctgcag tcttctggcc ctaagtcagg ggttgtccac tcctctctgg 23821 ggagatgact agaacctctt aattagcttc tctccctccc cacttgttaa accttatcag 23881 caggtactcc atcatggaga aggaaaccag tgcatgttaa gcatttatta tcctaggcat 23941 tttaattata attctcataa cagccagatg aagaaggcag cactatctcc atcttatgaa 24001 gaacccgagg ctcacaaggg tttagtgatt tccccaaatt caaacagcta ggaaggggcc 24061 agccgaacca ggatctcaac tttctttcat ccactgctcc cacgtggatc ttaaatacaa 24121 atccattcaa ttcatttccc tggttaaaat ccttcagtgt cttcccaccg cttacagaca 24181 tgtttctttc aatatttttt gtccttatag aactactttc taaaaaaaaa aacacttgtt 24241 ttactttatt tgtgctaata taacaaaaca ccacaagcta ggtaatttat aaacaataga 24301 aatgtatttc tcacagctct ggaggctggg aagtccaaga tcaaggtgct ggtaggtttg 24361 gtatctggtg agggctgctc cccactccca agatggtgcc ttgagcactg catcttccag 24421 aggcgaggta tgctatcttt gcatggcaga aggcggaagg gcaaaagggg caaactcctt 24481 ctgccaagcc cttttataag ggtatctaat cttattcatg aaggcagagc cctcgtgact 24541 taatcacctc ccaaaggcat acctcccaat actgttgcat tggggattat gtttcaacat 24601 agattttgga aggaacaaaa acattcaaac cacagcaaat ctttacatga aatgctaccc 24661 tatcacacag aaaacagcag agctgaccca gtgtgagtgg gggtaaggca cagcatcctg 24721 tctctacttt catagttgac atgtccactt gaagtttcag caccatgtca gtttgtttaa 24781 ggttgaactc ttgatctaac tcccagtatc tcccagagtt ttctatctta aagaatggtt 24841 ggaccatcca ttccagagcc tgtgagccac ctctctcacc atcttcaccc tcatcctcct 24901 taagtacatt caattttgcc ttctacatct cccttgaaga agactttgtc cttcatttgc 24961 actttcatca cccaagcatc caaatccaag ctcttacagc ctcttaccta gaataggggc 25021 acagcctcct tacttgcacc acagagccct ggcagtccac ccccaacaca gtactcagac 25081 tgtgctgaca gaaaattccc agcaaaagga gagggccact ctgacatgga atgtcctgac 25141 atccaagtgt cattatcatt tagcaacctc ccccatcaaa tacgacagta tttcctctta 25201 cttattattt tattaaaaaa tgttataact ttgagatgaa actatgcact aaaaagtaag 25261 ctaagggtca aaagaattga gggctggttg tttgagttgc tgtggggcca ctgagtttcc 25321 ccagctatag agcagagtta gttactagaa tctgcttctt ctctcttgcc cacagttgtt 25381 atggaaataa cttatgtaaa atgttcagaa ttaattacaa tgaacatata caactaaagc 25441 actcttatgt ctcactgaca tgaaataata aatttggata tatggacatc tggttgacaa 25501 tgtgacattc caagatctaa aacaaatttg atgatgtaat taaataccaa tctttctact 25561 tttacttgta tataggacaa gatccctcgc agaacctatt tatatgtata cttggagctg 25621 gcctcatggt cacataaatc ataggaaaat ttggagcaga tgttgcaggg tagctggaat 25681 tctcctgtta ggactgcgtt ttaatggaaa aatgacacag agcagacttc ccacatgctt 25741 cacagataga gtaatagagc tacttgaaag gaacatttat attctagata ggctgatttt 25801 taatcaacaa acgtgattgc tttattattg tctggtctcg gatgagtcat tcagcagagt 25861 gcacaggcat ttccggacag gaggggaatg ggccacaagg gagaaagctt tacagtacct 25921 tgtctgtctg tctgtctgtc agtgtggcct tgagctcttt ctcctcttgc tggtagtaga 25981 gatcttccct ggggcctttg acagtggcct cccatgggct tgcagcctca gaaactttta 26041 tttgcgagga aaggggtaaa tgtgagctaa acctgaaatc ccacgaatta cagaaaagaa 26101 cttgttggca gcacacacgc aaagattttc gaaatcctct gtttcgctgg tgtttatgca 26161 tttgagcctg tttcatctga agtcattgtt ttgagccttg tttggaagaa ttttccagac 26221 atgattttct agggaggaag aatgtcattc agaagacaaa ggcctgtgcc tctgagataa 26281 agtaggtgat gttctgcttc agatggcctc tggctccctg gcttttatcc tccctttcat 26341 gttaccaagt agagcttcaa gagccgtagt ccagattttc ttctgcttct tcttttgtaa 26401 cttggaaaat gaagacgtag aactagagtg ttcattgtgg gaggtctagt ggataaatgg 26461 atgagaaatg acttaagcca aacaatgact caattagatt ttctgacata gtatcagttt 26521 gtttaaccct gctgatccaa tgaggttagg tgttctgaat aactgagaag ggtgttctga 26581 catagaatca ggttgtttaa ccctgctggt ctcagtgagg ttaggtgttc tgaataactg 26641 aaaacacctc cttagtcggg cagccatctt tgatttcagg ggaggtttgg ggtgggtgct 26701 ccgaagaaca gggactgtac caagtgcccc ctgagtgtgt gtggggcttg aggaaggggg 26761 agacacagaa gacacgcttc tcttcctgct ctgagcccaa gggtgcagaa ggccccgcca 26821 ttggtgttca tcaggagaaa ggcagcggca gccccaactc ctggcttaca ctgcctcaga 26881 gtagcagggg ttcagtgtgg cggagtgaat gataagatga agaagaatga caagcatgcc 26941 ttgaatcgga caatcacttt tctcgttgag gtaattacaa gaatcatgaa aagcaggtta 27001 gatattatca actagtttta tacagagaga cttgtgcatg attcaaaaga gtcagtgcca 27061 aagctagacc atgttgcctt tgtcatttag atggatcgct tactgcattt accattggaa 27121 gctggttggc ctacatgctt ctgcatccta ggaagctccc acacttagat cctctgaggt 27181 atcttcatag ctttgcagta catgattcct agatagacaa gtttggagca accgataacc 27241 cctatgtcaa agctgaattt gtttagatat attttatgta catcaaaatg gaatttacaa 27301 tacatgtcaa aaccagaggg aaaaccagtg gctgtgatct gaatttgttt ccccaatgtg 27361 atagtactag tagatggggc ctttgggagg tgattaggtc atgaaggtgg agcccttgtg 27421 aatgagaaaa gaggcccaag gaaacttgct tgccccttcc accacatgag gacacagcaa 27481 gaaggctcca ttctatgaac aaggaggtga gccctcacca gacagtgaat ctgccagcac 27541 ctggatcttg aactccctag cctccagaac tgtgagaaat aaaatttctg ttatttgaaa 27601 gtcaccagtt tatggcttat ggcattttgt tatagcagca caaacagact aagataccag 27661 tccttctgtt ttctgccaga aaattgcacc acactattgc ttttaatctt ttttcttggg 27721 ccaagtttag acaagaggca tcaactctag gtaattctga atgcagcaag acatacagag 27781 acacctgctg ctggaaacac tgggaacttc actgaagagc aggagggcca aggactcaac 27841 tctttataaa tactaattgc attaagtaag catttttgga tttcacatgt ttatagctta 27901 aaatacttgc agaacttgca agaaaaacaa caataaaaat gaaatatttg atgaactaga 27961 tattaccaag gggcacctta ctgcagggaa acggtgtgta gacttgttct aaatgaaatt 28021 tgacgttgtc atgagttatg gatacacctc aaaatcatgt gagagtgtga taattatccc 28081 ggtctgagaa agcctgcttc actattgcca taaagttcaa ggcacaaagt gcatgagtat 28141 tatttttgcc ttgcagtcaa ccagcggaaa atactggttt tgaacttgga gctaccaaag 28201 tgtacctcta agttactttt tgattatctc agcataaatt ttcttcacat gtttgctaaa 28261 cagggcaata ttgccatgga taccttccac catcccatag cttccctgac acattctggc 28321 acttactctc aaagctacca cgtatgacac tgattgaaca catcagcatg tcatttgttc 28381 tcagcgcgga tgggaaatcc ggagaaaagg ctgcctaaaa acataatgca gagagaggaa 28441 acataaagtg gagaagcctg ggaggggtgg agacacaatc ggccttagaa aacagctggg 28501 gggtactgaa ctctttccta gctccgggag cctccatatt caggctagaa aaggtgaggt 28561 gatacaacgt ggggcagagg gcagagaggg aggagactgt ggcacacaca gaaggtgggg 28621 gcagctctcc ttgcacctgg actggcacta agagccagca gcctcgtcac cccaaggcgc 28681 cactttcagg gggctgctga gcagcctttg gtgtcttcac acaccggtgt tgtgctgaat 28741 gacaatatct tctctgcgct cactttggag gcctctttgg gaactgacaa ttctgaattg 28801 tgcgtttggc tgctgagaga ggctgtcagg actccagata tctgtctgat gctccctgcc 28861 cgactgtgac ggagagacta caagggagtg cagcacaggg gcgacaagtg ctcctgggga 28921 tttgctgcct gcatggcctc cctaaagccc ctgccatctg cccactgctc tccacttccg 28981 ctgccaggct gagcccagcc gcggattccc aaacgcccct catggccctt cctcggccct 29041 gattcaccag tctcccttct cttctcctca ctgcggctgc agtggcttca ctccgcccat 29101 acgccttcag tgggttcccc ttttattcta agacagaatt cagcattttt cctacgctgt 29161 gtgcagtcct acggggtggg cttctctgct cttccccacc cctggcccca cctccggctc 29221 ctctgacgcc acattttctg ctctgtctca gtccctcggg gcacactggc tgtcttcttg 29281 cagggccgtg gtgcctgctg cgccctgtgc ctggactgcc tggctctcct ttcccctggc 29341 gctgctccag caccctcagc acctccagca cccccagcac ctccagcacc tccagcaccc 29401 tcagcacctc cagcgcccca gcccgtcttt cagatcttgg ctcaggcatc acttcttcag 29461 agaagccttc cctgaccctt ggcctaggcc aggtcccctc attatgcgct aagatagaac 29521 caccttcttt ttctttatgg tacttcccca agttcctaac ggagacatta agcacataat 29581 cacactgctg agtccagggc tcttggcctg gaacctaagc tccacaaggg aagggcaggt 29641 gctgctgttg agtaaccact taatcctcag ctcctagcac agggcctggc acagagcagg 29701 ggcttaataa atattcgcat tttctcaact atgccctcta acttagttag ataccaactt 29761 ctcaacctcc acaaccagct gtgatgagca gttcactttt acttgcatta tccttttctt 29821 tcttctttta tatttaacct ccaaaaagct gacttcatca aaacctaaat aaatgctttc 29881 ttttataaac gactctcctt aatgataaat catgcataga gcaaggaata ggaaaatgta 29941 acgtgctttt atttctttat ctctatttgc caccgtgaat tgacaacaat ttctgggcaa 30001 ctctcaacca cttggtgacg ccagatcata gagaattcaa aaggggccaa ggtcagaaac 30061 cagaacctca tcagttaagc tccttgtcat aattagagct tcccagagct gactgtcctc 30121 tgtgcctgga cagccagaga tgtgtacacg attccaaaca aatataagcg aatgggtcgc 30181 aagtgctact cctcagcact ccacattctc aggagctgtg acactgggac agtgcaggtc 30241 ctgaatggtg agagacctta aggcttaaaa ccacacatgg cattacctga cttcaaatta 30301 tactacagag ctacagtaac taaaacagca tgatactggc ataaaaacag acacatagac 30361 caatcaaaca gaacagagaa tttagaaaca aatccatata tctacagtaa actcattatt 30421 gacaaaggtg ccaagaacat atattgggaa aagaaccttc agttcaataa atggtgctgg 30481 gaaaactgga catctatatg cagaaaaatg aaagtagact cctacctctt gccatattca 30541 aaattaaaat caaaatgaat taaagactta aatccaatac tttaaatgat gaaactacta 30601 caagaaaaca ttggagaaac tctccaggac attggtctgg gcacagattt cttgagtaat 30661 accccacagc cacaggcgac caaagcaaaa atgaacaaat ggaatcacat caagttaaaa 30721 agcttctgca catcaaagga aacagtcaac aaagtgaaga gacaacccac agtatgggag 30781 gaaatatctg caaactaccc atctgacaag agattaatcc ccaggatata taaggagctc 30841 aacaactcta taggaaaaaa aactaataac caattaaaaa tgggcaaaag atctgaatag 30901 acatttctca aaataagaca tacaaatggc aaacagatat atcaaaaggt gttcaacatg 30961 actgattatc agagaaacac aaatctaaac tacaatgaga taccatctca ctccagttaa 31021 aattgtttat atcaaaaaga caggcaataa caaatgctaa cgaggatgtg gggaaaagag 31081 aacccttgta cactgttggc ggaaatgtaa attagtacaa ccactatgta gaacagcttg 31141 gaggttttcc aaaaactaaa aataaaactg ccatatgatc cagcaatcaa tctcactgat 31201 aggcatatac ccaaaataaa ggaagtcagt atattgaaga gatgtctgca ctcgtatgtt 31261 tattgcagca ctattcacaa tagtgaagat ttggaagcaa cgtaagtgtc cctcaacaga 31321 tgaatggata gtgaaagtgt ggttcatata cacaatggaa tactatcagc tctcaaagga 31381 atgagatcct gtcatttgca gcaacatgga tggaattgga ggttccatgt taagcaaaat 31441 aagccagaca cagaaaaaca aactttgtat gttctcactt acgtgtggga gttaaaattt 31501 aaatcaactg aaaccaccga gttacagagt agaaggatgg ttactagatg ctgagaaggg 31561 tagtgggggg tgcggtgatg ggagggttaa tgggtacaaa atataatgga aagaatgaat 31621 aaaacctagt atttgctagc acaacagggt gactatagtc aataataatt taattgtaca 31681 tttaaaaagt aactaaaaga gtataattgg gttttttgta atacaaggga caggtacttg 31741 aggggataga tgccttattt gccctgaggt gattattatg cattgtatac ctgtatcaaa 31801 atagctaatg tatcccataa atatatacag ctactatgta cccacaaaag ttaaaaatta 31861 agaaaacaca cacttggcca caaagaatat tattgtgaat aaacctaatt gacattcacg 31921 cataagaaaa atatcagagc atctcagctt agaacttgcc agcagacaag cacactgaaa 31981 gagcaaatct ttctgtttga gggctggcta tagtgagggt tgttgacttc ttacattcat 32041 tcacagaaca tttgggaaag tggtggtgag tcggcgaaag agactcagag acacacggtg 32101 aattgcccag cttctcagat cgagctgatg gaagaaccaa gatgaaaact tgctccaatt 32161 acagctagtt tagggttctt gtgttcagtc cacaattata aaaggacttg ataagatgcg 32221 gactagggac cacttgcatc aaaaccactc cagggagttt gcgggaacac cctgactcca 32281 aaatttctgg ggatgggacc cagacatatg tatgttcaac acccttacaa ggattctgat 32341 gtctactacc aggcgagaat cacatagctg gacttagctt tctgtcttaa aagctcacgt 32401 gtgtagagta aagcctatga tcttcttgaa tctcttggtt gaagtcactg gtcctcgggc 32461 tgagtaaagg gacttccaca ttatgagaac ttggggagag aaggagtata ttgctaaaat 32521 tagatgaatt attttggata agagatgagc attcttctca gtagtttcta gtggaattga 32581 tccaacccac agcttagact taaagtctta acatagttct tgtagcttag ggctcagcta 32641 gtagaatcca gggcctgcca gttggaagag gatagaggca ctgaagggct acaccccagc 32701 agtaggggga aacagaacta ggtaagcctt tgcagggacc atagcccagc tttcagtcat 32761 ctccatctct gaagttcttt tgagctcatc ctggcttgtt gcaaggctta ggagtctggc 32821 aaaaggaaac acaatccaca atcctctctg gaattataac ctcctatgag tcttctattt 32881 ctacagacat tttgtaaata caacgtcccc acaagattaa aaataaccag ggccactgaa 32941 agacaaggca acccgagaaa aaaataagac aaactagaga aacgaaacag acccataaga 33001 atttcagaag ctagagtcac cacactgact ttaaaataac tatgttatta aaagggaagc 33061 ttaagagttt tatcagagat ctaaaaatta taaaaatgaa tcagaaaggc atggtggctc 33121 atgcctgtaa tcccagcact ttgaaaggcc aagactggag gactgcttga ggccagaagt 33181 ttgagaacag cctggctcgg caagacagtg agaccctgtc tctacaattt ttttttcttt 33241 tttaaattag ataggtatag tggcgtgcac ttgtcatcac agctacttgg gaggctgtgg 33301 taggaggatt gcttgagccc aggagttgga aaccagccag gacaacatag caaaaccctg 33361 tctcttcaaa aaatacaaaa attagctgga cacggtggct tgtgcctgta gtcccagcca 33421 cttggggggc tgaagtggga ggattgtttg agcctaggaa ttggaggctg cagtgagcta 33481 tgatcatgcc actgcactcc agcctgggtg acagagcaag accctgtccc ctgctccccc 33541 caaaaaagaa tcaaatggaa attttggaaa ttaaaaatac aataactgaa attaagaact 33601 cagtgaagag agaactagtc agtcttctca aggttcaaaa tgacttgaga gagcagaaaa 33661 ggagactgtg gctgagggac tgatctgggg tgaggagtcc catgcaggaa gggacttgtg 33721 tcatttgaat ctcctattgg tgccaaagca ggaaagaccc tggtttctta tcagctttcc 33781 cagatgtgga gaagaaaggg aagagggaag ggtgaggctc aaaaactgcc agcaaacatc 33841 agaaagtgga gtcagactcc ttattacaca gaacaaatta cttaatgtaa agaatgatat 33901 attttcttct aaatttaccg tcttattagg tgctattttc ctgcctattt tacatttcct 33961 tgtccctcct ttcttgccct ctttttgatt atttctatat tacacttgcc cctctattaa 34021 ttgggagttt tatatgattt ttaagtcatt cctttaatat tcatcctaga gattatggca 34081 tgcattacaa cctatcaaaa atttaatatt aattggtgtt tttgtttcat gtgggataat 34141 gcaatgattt tataaaacac ttatataact ctctgctatt atttgtattt taattgttat 34201 atcttttaaa caccacaata cattattttt attttatgca gttgcaaatc atttagattc 34261 actcagacat attcaatttt tatcattctt tgtttctttc tgcatctcca gacacccatc 34321 ttcaaaccat tcctcttcag aaggaatctt tccttctcct agaaagatgc ttttcaacat 34381 ttcctttagt gaaagacttc tggagacaaa ttctctcatt ttttgtttct ctgaaattta 34441 tcatgtcacc acttcttaaa aacatttatt ggtagctata ggattccaga ttagcggtta 34501 tattttttca gcactctgaa gatataattc caatacattt tggctcccat tatttctatt 34561 ttgaagacag ctgtcagtct aatagtggct acttttagtc tgtttttact taattttggc 34621 tgcacttaca agttttgttt gcttgctttt gtctctggtt tttaatagat tttcttatgt 34681 gcctaagtgt ggatttcctc ttatttctct ttagggttcc tagagcttta gaatctgtgc 34741 tttgatatct cttattagtt ttgcaaagtt ctctgccagt gtcccttaaa gtattgctac 34801 cgattcacag ccgaattcta ccagaggtac aaggaggaac tggtaccatt ccttctgaaa 34861 ctattccaat caatagaaaa agagggaatc ctccctaact cattttatga ggccagcatc 34921 attctgatac caaagccagg cagagtcaca acaaaaaaag agaattttag accaatatcc 34981 ttgatgaaca ttgatgcaaa aattctcaat aaattactgg caaaacgaat ccagcagcac 35041 atcaaaaagc ttatccacta tgatcaagtg ggcttcatcc ctgggatgca aggctggttc 35101 aatatatgca aatcaataaa tgtaacccag catataaaga gagccaaaga caaaaaccac 35161 atgattatct caaaagatgc agaaaaggcc tttgacaaaa ttcaacaacc cttcatgcta 35221 aaaactctca ataaattagg tattgatggg acgtatttca aaataataag agctatctat 35281 gacaaaccca cagccaatat catactgaat gggcaaaaac tggaagcatt ccctttgaaa 35341 actggcgcaa gacagggatg ccctctctca ccactcctat tcaacgtagt gttggaagtt 35401 ctggccaggg caatcaggca ggagaaggaa ataaagggta ttcaattaag aaaagaggaa 35461 gtcaaattgt ccctgtttgc agatgacatg attgtatatc tagaaaaccc cattgtctca 35521 gcccaaaatc ttcttaagct gacaagcaac ttcagcaaag tctcaggata caaaatcaat 35581 gtacaaaaat cacaagcatt cttatacacc aacaacagac agagtgccaa atcatgagtg 35641 aactcccatt cacaattgct tcaaagagaa taaaatactt aggaatccaa cttacaaggg 35701 atatgaagga cctcttcaag gagaactaca aaccactgct caaggaaata aaagaggata 35761 caaacaaatg gaagaacatt ccatgctcat gggtaagaag aatcaatatt gtgaaaatgg 35821 ccatactgcc caaggtaatt tacagattca atgccatccc caacaagcta tcaatgactt 35881 tcttcacaga attggaaaaa gctactttaa agttcatatg gaaccaaaaa agagcccgca 35941 tcgccaagtc aatcctaagc caaaagaaca aagctggagg catcacacta cctgacttca 36001 aactatacta caaggctaca gtaaccaaaa cagtatggta ctggaaccaa aacagagata 36061 tagatcaatg caacagaaca gagccctcag aaataacgcc gcatatctac aactatctga 36121 tctttgacaa acctgagaaa aacaatcaat ggggaaagga ttccctattt aataaatggt 36181 gctgggaaaa ctggctagcc acatgtagaa agctgaaact ggatcccttc cttacacctt 36241 atacaaaaat taattcaaga tggattaaag acttaaacgt tagacctaaa accataaaaa 36301 ccctagaaga aaacctaggc attaccattc aggacatagg catgggcaag gacttcatgt 36361 ctaaaacacc aaaagcaatg gcaacaaaag acaaaattga caaatgggat ctaattaaac 36421 taaagagctt ctgcacagca aaagaaacta ccatcagagt gaacaggcaa cctacaaaat 36481 gggagaaaat tttcgcaacc tactcatctg gcaaagggct aatatccaga atctacaatg 36541 aactcaaaca aatttacaag aaaaaaacaa acaaccccat caaaaagtgg gcaaaggaca 36601 tgaacagaca cttctcaaaa gaagacattt atgcagccaa aaaacacatg aaaaaatgct 36661 catcatcact ggccatcaga gaaatgcaaa tcaaaaccac aatgagatac catctcatac 36721 cagttagaat ggcaatcatt aaaaagtcag gaaacaacag gtgctggaga ggatgtggag 36781 aaataggaac acttttacac tgttggtggg actgtaaact agttcaacca ttatggaagt 36841 cagtgtggcg attcctcagg gatctagaac tggaaacacc atctgaccca gccatcccat 36901 tactgggtat atacccaaag gactataaat catgctgcta taaagacaca tgcacacgta 36961 tgtttattgt ggcattattc acaatagcaa agacttggaa ccaacccaaa tgtccaacaa 37021 tgatagactg gattaagaaa atgtggcaca tatacaccat ggaatactat gcagccataa 37081 aaaatgatga gttcatgtcc tttgtaggga catggatgaa attggaaatc atcattctca 37141 gtaaactatc acaaaaacaa aaaaccaaac accgcatatt ctcactcata ggtgggaatt 37201 gaacaatgag atcacatgga cacaggaagg ggaatatcac actctgggga ctgttgtggg 37261 gtggggggag gggggaggga tagcatcggg agatatacct aatgctagat gacgagttag 37321 tgggtgcagc gcaccagcat ggcacatgta tacatatgta actaacctgc acaatgtgca 37381 catgtaccct aaaacttaaa gtataataaa aaaataataa aataaataaa taaataaaag 37441 gagagtaaat tgaaaaaaaa aaaaaagtat tgctaccaat ctattttctc tctaatccgc 37501 tttggggact ccaactacac tcatgtaaga catgagtctc attgctttca ctttgcttct 37561 ttacgttttc ttccatattt tttattcttt ttttctctct atgcttcctt ccgaaaattt 37621 ttcttttgtt aaccaacact tctcaaactt taatgtcttt aaaatcattt gaggatctta 37681 ttaaaataca attcagattc actaagtctg tggtggggcc taagattttg catttctgaa 37741 aagctgccct acagaaagag gacactaggt aaaaactaag gaaatacgag taaagtatgg 37801 acttcagtta ataaaaatgt atctgagcta tgtttgcaac ttttctgtaa atctaaagct 37861 agtctattct aaaattaaag ttttatttta aaaaatgttt tgagattatg tcaatgaggc 37921 tggtctacag atcacaggtt aagtgattgg ttgagattct aaccgatctt ccaagattga 37981 cctcttcttc cagacaacct taattatata agtagtaaac ctttaacctt cttggtgtgt 38041 atgtatggtg tcatcagttt caacattcaa accaaatttt ggatagggat gcattctgct 38101 tctgcaggct ggctgaaaca gactacatgt gtcttagcca attttgtgct gctattatag 38161 aatagcaaag gtgaagtaat ttataaagaa cagaaattta tttctcacag ctctggaggc 38221 tgggaagtcc aagattaagg tgccagcagg ttcagttgtt tgatgagggc ccagtctttg 38281 ctcccaagat ggcaacttga acaagtgcaa gtcctcacat agcagaagag tccaaaccca 38341 cttctgcaag ccctttttgt agcggaatta atccactcat gaggactctg cctcctatta 38401 ggccccatat cccaacactg ttgtattagg gattaagttt ccaatacatg aattttgagg 38461 gacacattca aaccatagca atgttctatt taggagacca agattatgtg acttgataaa 38521 aaaatacaaa actctaggcc aggcacagta gctcacacct gtaatcccaa cactttggga 38581 ggccaaggca ggcagatcgc ttaaactcag gagttcgaga ctagcctggg gaacacagtg 38641 agaccccatc tcaacaaaac atacaaaaat tagccaggca tagtggtgca cgcctttagt 38701 tccagcactc aggaggctga ggtgggagga tcacttgagc ccagatggtc agctgagatc 38761 acgccactgc tctccagcct aggtgacaga ctgacaccct gtctcaaaaa taaataaata 38821 aataaataat gcaaaaccaa aatatatacc gtttacaaga gctgtgtcta aaacacagag 38881 acacagaaag attgtaaata aagggttgaa aaagataaca gggaaaccac taggttaata 38941 ttcaaagata aaggacttta aggtagactg cattatcaga gataaatagt gacattacaa 39001 aaagctaaat tattcaactc agaggatgat gtcactattc taaaattgca tgcatctaat 39061 aaaagagcct caaaatataa atgcatgaat caagtttcca acactaacag aaacagaaat 39121 gtgcaatcat agtgggaaat tgttacacaa ctatttcaac aactgacaga acaaataaac 39181 aaaaatccag tagagatata gaaaatataa acaacgtgat tagcaaactg acagaatgga 39241 cacattttct acattgcata ccaaaaatgt aaaagatgca catggaatat ttacaaaaca 39301 ctgaccaagt ggtagttaca tagagctttc ctttttgata aatcatcgaa ctgtacattt 39361 tcattttatg cacattactg tttgtgtact gtttctcata atagtctgga agttccctgt 39421 ctagaactat taagaccctc catggtgtca aagacacagg ctctttctac cttgttattc 39481 cactatatat atcttccatt tttaaggcga tctcctgtcc aaatgcctgc tcaaatttct 39541 agccattgaa acacatccca atcacaggaa ggagaaaaaa ggaagcgagg ggattcacaa 39601 cctcccttta aaaaaccctt tgtagaatct gcaggtacta tttctgctta tatctcagta 39661 gccagaactt agttactttc acacctagct caacaaaaat ttgagaatgg agtctttatt 39721 tcagttgcca tgtgcccaga aaaaaataaa ataaaataaa gggaggacga agattctatt 39781 gctacacaag agagataaaa atgttaccta gatattaaaa taaaggggga ttttgatctt 39841 gtttaatgga tttttccctc agacccagac aatcgtcaac atagaaaaag acagatttta 39901 ctcccaagga ctcctctttg cctatattcc ttttgtgtag tatgaggggg gattgaatgg 39961 gtcagggaga aacaggcaag tttatttact tgacttggct ttataattgc acccataccc 40021 taaacatact acttttaatg aataggctgc agaggtcttg cttttttgag ttaagatgag 40081 tgagttgcat ggaaactgag gtcattaata aatacagcat ctattccaag tatgatgtcc 40141 aagtctaaat caaacacaac tgttgaaagg ccagatgtta ttcccattgc tatatttcag 40201 tgttttcctg gaagtgatct caccctgatg gtaatgccgt tgtccaactg tgtatgctat 40261 ttatatctag aaccccagct ttcaggttta aaatgcacca aactgcaaat agctattgct 40321 ttgagtgctg agcctgtcat ggggaattta agaagaatcc ttgagttgtg atgtgctaac 40381 ttggatggct atcaggaagg aaagaattta tgtaacttgt aaacatgtag atatattgtg 40441 atagagttct aatgcatggc tctgagaagc accttcattc atattcacca atgcagggaa 40501 tcaaagactt cactgtcccc tctgtgaaga agaaatttaa ctgcaattaa ttatctccta 40561 cagactaaaa agtgtataat cccaaagacc aaaggaacat ttagtcagag ccataaacaa 40621 tgccaatact taaaaagaaa acactctcag aaaaaagtgt gatgatgcct tttatctggt 40681 gtattaaaac aagaaaatac accagataaa aagcatcata acactttatt tatttattta 40741 tttacttatt tatttattta ttgctgagat ggagtcttgc tctatcgtcc aggctggagt 40801 acagtggtgt gatctctgct cactgcaacc tccactccct gggttcaaac aattctcctg 40861 cctcagcctc cagagtagct gggattacag gtacccgcca ccacacccag ataatttttg 40921 tatttttaat agagatgagg tttcaccatg ttggccaggc tggtctcgaa ctcctggcct 40981 caagtgatcc acccacctca gcgtcccaaa atgctgggat tacaggtgtg agccactgcg 41041 cctggcccac actttaaaat ttggaggcaa agagagtcat cacgagaaaa taatctgttt 41101 ttgaagaata tcatagactt ttttttttac aaataatttg taaaatattt tggtttatgc 41161 attcatctca caaaaacatt atttgtatag aggtgattac gatgcagtta ctaacacatc 41221 tatagctgca tacactttta atagtacatt tacataatag atgtgaaata gaacatttag 41281 ctggctccta tgaatatcaa catgaagaca actagtgcgt aagactgata aaaaaaacaa 41341 aagctaacat tcctgaggta cttcctatca ggcaggttct gtgctaagtg gtttatatgc 41401 tttatctcac ccaatgggag attctactat ggtctccaat ttcagcagag gaaagggagg 41461 tgtgggaagg ggatgttact tgcccaaatc cttgtggctc ttgagaagtg gagctgggac 41521 ttgagtccag gtctgctggc tccagatcct gtctctgtag caccagatgg cacggcctcg 41581 gattaagtga gacatacatg ctatgagagc ttcgtccact gcttgaccca tactgagcag 41641 ccaataaagg cttacacaaa agcagactaa ggcaggctgc aatccacacc ccatccacag 41701 cactgaaggc tcccagagca cagttaaggt aatgaaaggt gctgggaacc tctggttgag 41761 ggtccctttt tggtgctacc tctgcctgtc agttatctac gagtttcaac cctgcctaag 41821 tcactgaggg gctggatggc tacgggacca atcccttctt tctggactgg actttcttct 41881 gtaaaagaag aggatttgac tagaacgatt cctaagcttt ctccccgctc tgaacttcta 41941 ccacccaagt catctctaag cttttgcgtg tgaaaattag tccagaagcc cagggaggcc 42001 cgtacccgcc ctgaacatgt aggaagggat taaccatcct aataaaaaag acagcagtgc 42061 tctatgaaga tcagcatttt ccaccaagga tgaactttgt ttcaatggca acaggccatg 42121 gtggcaaaac tgcaaacttc tagtgtgggc atgactcaaa accagagggc cagggcagac 42181 ctttagaata gctgatggaa tcgaattatt ccaccgactt tcacaatcct ggccatttca 42241 gccccacatg cgtcagtcag gattggaaat tgacgcaggc tgtggggctc tggagtgctg 42301 aggcagaagc gccgaagtcc gctcagcggc gcccactagc ggaaggccac agagcggcgg 42361 tgcaacggga cggctcaggg ctccaactgc tctgcactgt gaagctgcaa gtcctccctt 42421 gcatcatcgc acagggaaac ggagggagaa aggccaccag ctccttagag gcagagttga 42481 cctcttggcg gccattatgg tccatgaaga aattccctct gtacaataca ctaatttttc 42541 ttcatcataa cctaacacat agtatttatt tattgcaaga tgctggtcac cgtattaaaa 42601 gataatacaa atttattcaa ttttttaatc caagaaaatg taaaacatct gaagaaaatc 42661 ttggttcttt tgattctgtc atagtttaat aactgtgcaa atgaatgctt gggatccctg 42721 ctcccatgaa tattttatta ggggcaaatg gtttatagag ataatattgg aaataagaat 42781 tatttgacag atattttaaa catttcatgg gtgactggga cgcatattgc aaattctcaa 42841 acatcaaaaa tctgacaaag ttcaccaaat agtctcaaaa taagtttatg gtagtaatgt 42901 catttattta ctggataact aaccaacact taaaaaagtt aagcatgttt gggttaaagt 42961 agcgtttgac catgttaggg aagcccagtt cttatcctta gagaggttgc attctcacag 43021 gtaaatgaga tctatacaaa ggatacaaca taagactata attctgaaat aatttaagtg 43081 tcctagaaaa atgttctgaa ggagttcaga gaaggagtgc agaaaatcta aaaacaagta 43141 ggagaacctc atggagtgag taggacttga gttgaaactt aaaggatggg aagggcattt 43201 aagtagaagg gaaggaagag aagagcgtgt tcggggtagt caaaactcat aaggacgtat 43261 ggtgatgatt ggtacaggaa ggtgtttcag gaaagaaagt gggtattcag agccctcctg 43321 ccatggctta cctcatttat ggctttagcc tctaaaactt ctcccttgtt gtcagttggc 43381 ttctttcttg ccttttattt atttatttgc ctcttttcta gtctagaact catggtcaag 43441 cttgctgaac atgtgcttta gcgaccttgc tacctccttg tgctgtggct cctgggcagc 43501 caaacactag aggagagggc gctgggaagc caggctgtgt ggcaattctt ttgaccagtt 43561 gccatacatc agctgcagtg accagcccaa acgtgtttaa ctcaaactca cattcatttc 43621 ctcgccatcc tcacaccttc tacttacctc atctcgggag actggggcta ttctacatga 43681 gctccccaac gtacctcgtc tccactttaa attgtttctt gattttcatt ctttcttcac 43741 ctgtgtccca aaccactccc tcgcatttac acatcacctc ctcaggtgag gctcaatcgt 43801 taggtcctct gactctccca tttgaagtgt ctcacaaagg agagcactgt ctttgctttc 43861 aggcattctg tgctttaggt gaatgtgaaa catacaatgg aaaaatctct gttacacgta 43921 ttcagccccg gagctcccgt gattcccaga gagtgggtat tagaagtgat cccaaagaga 43981 ccgcccctgg gaagggagaa tggatataga tgacagcctc ctggagtcaa ggtcaaacag 44041 tcatcactta acttgtctga gcttcagttt tagcacaagc aaagagcaag aacaaatgaa 44101 gcagtaagtg cagaagcgaa aaggtgcttt gttctcttta aaagaagcac actgaggaat 44161 gaagtcccct gaactgctaa atggaagatt tacatgcaat ccttctgcct ttggcgaggc 44221 aaagtttttt atttttttaa acaacaaact tttgggaaga aaggaacaaa ctgcaacttt 44281 tgacaatgaa aaaatattcc aaaatatttg atattgagca acaaaagcac tcagccattt 44341 aataggtgat tggtgtttca aaaaccatgg gtaaaaaaat ctctttcagc taacatggct 44401 cttaaatgat ccattggaac aggctttgtt attcgttatt gaaaatgtgg agatctcgct 44461 aacctgagtt tctctccctt ggtgggagtc tctctgcttg ctccagcata ccacccactc 44521 ttcgcttttc tctggtagcc cttctgtttc tgtatggttt ggccctggga aaataaagac 44581 acatgcagag tgctcacaaa atgcaaaaca tgctaatgaa tgtgggaatt agggatgttt 44641 ccctggctca atcctttcat tacgtcttat ttttcctgta actcctcatc aaaactttaa 44701 attaaacagg ccatatggaa aaagctttca atgagattgt aattaggtat ataaagcttt 44761 cataaattaa accaagaaca ggaaattttt tttgaaatcc catgaaggca ctcacagcgg 44821 caggctccta tggcacgcca agctctgctc ctgttctagt tcaactcctg aagtcaaagg 44881 acatagtcaa agtcttgctc ctgaaccaaa ggcactacaa tggtttgttt gacggttagc 44941 tctctgtgtc aacttcgctc cgccatggta cctagatatt tggtcaaatg acagtctaaa 45001 tgtcactgtg aaggcatttg taagatgaga ttaacattta aatcagtagg ctttgaatga 45061 agccgattac ccttcatgat gtgagtgggc ctcatcccat cagttgtagg caataagaga 45121 aaaaagatgg aaggccccca ggaaagaggg aattcttcct ctagactgcc tttagattcc 45181 aagatgcaac atcacctcct ccctgggtct cctagcctga cacctgccct gcagagtttg 45241 ggcttgccac ctcccacaac tgcatgagcc aattccttaa cataactgta tttatctatc 45301 tacctatcta taaatatgta tagctcctat tggttttgtt tctctagaga accctaatac 45361 agatgggtgc tcttttgtta attgcaagag aaaacatcat ctccaacaga aagaaactgt 45421 ctgaacttga aatcacagaa gatgtttcca ggctaggact gttacatttt ccctgaaaat 45481 ttctgtttac cctgttgcct ggctaaagtt caatttaggt agggtgatgc actctgaagg 45541 tcacattttc tctcttttat gtccagcaga tccaaccaag tttctttcct tgacagttct 45601 tatttagcaa taacagaagg gattgtttta tatctgatat atgtgctcat aaaaaatttt 45661 aaagtaaact ttgaaaacag ataaaaacat gaatgtattt agttcttgcc caaatgcttt 45721 taaacctcca ataattagtt ccctattctc tctcatagca gccagcagaa atattcctgg 45781 aacaattcca gatctcccta gagtcaaatg aaattcagaa actccaactt tattgttgga 45841 aacagggaag tagggggctt aggcttggct cccaatgccc tctacaatgc ttcacaggct 45901 cagtgagcta caggacaccc agcacttcct ccacttgtct cccttttcct gggtccatgc 45961 tcaccctcat tctcacctca gggcttgacc cctcccaggt tctcagtctt tactggcttg 46021 ctttcctcct tacccacctt catgttcttt ctgtgcagag ttcagtggca tgcgcactgc 46081 acactgggtg tgggaatgac tgagttgttg aaaggtgaag tagtattgta accaactgtg 46141 acagtccttc ttgccagttt cttctttcag gaggtatcag tagcaggaga gggattttca 46201 tgcctgaact ggctaccttc atttcttcca ttatcattgt tgagttacca ctagcttaag 46261 atgttccact ggcctgagtc aatcagaaca atggtgccaa gaatgtgtgt gttgtaacct 46321 ctggcaactc tctgactatg tctgtgtgtg gtgtaatttc agggttcaaa gctcagtgga 46381 ccaaacacat ctgtttacca tggtctggag taatgtaagt ctgtttccct aggtggcccc 46441 gggctgtttg ctttctgctg gtcaaagtac tagcacatgg tgaagctaag tgaggaacta 46501 tgcccactac agttctgtaa gtatttgata gaatgaagat tttctattga ctggggctta 46561 gtacattctg aaatcttaga accacatggt cagtttccac acaaaattct cagacagaat 46621 tttaaaacta agaaactctc atttgaatac aacagtttta aataacgagt aattgtagag 46681 tctcttgatt gcctatttgc acccaatggg atgtaaaaag ccatttttgg taaaagtttg 46741 tgtctaattt aaacatcata ggctagagta atattatttt atttcatgta aatgatcatc 46801 agtcagtttt tttgggccaa aagatatgac agagacagag agttaatcag ttcacaagaa 46861 ttcttgagag caactttagg taaatatgaa tagttgtaag ctgtaaaggt agacgatgta 46921 atttaccaag ttataagaga taatctcctt ttggttctct tcctttagca ctcggtaggt 46981 agaaagaggg aggacactta cattctacat atgttctaca tatgaacatt tgtacatttt 47041 acatgcatac tttccgtagg gttgtggatc tggttgtgag atacagtcac acaacttcat 47101 gtaagtactg aatacagagt ggagatgtct actgctagta gagttcaatg tctttgaagt 47161 tttatattac atccttacta agttatgtag atttttgcat cctattccat aaaatgtcca 47221 aattagtttt aatataaaaa taatggtcat ggataccagc aaaaatggta gagtaagaac 47281 ctctaaaaat tccatctttc ataaaaataa taagaaacag gcaaaaattg tcagcatcaa 47341 ctttttcaga gctctggaaa tgaactaatc ctgggagcac ttactgaaca aaaatggctg 47401 aatcttgcta agaacagtaa gttttgtggt attttgactc ggtaaaaacc tacccactgc 47461 ttaccagcct acctttgaaa ccagtagcct ggcagcaact ggcagaagca gaatagagct 47521 tgagctcttt taaagctcat ttccaagaat tgtcattatt tgactgtcag tatgtctttc 47581 aaggacccca cttgaaggct gtctgttttt gaactgactt ggagcttgcc ctgtccaaat 47641 agcctttccc tcagagtggt tgttgaaaaa caattacaaa caactgctta aatccacagc 47701 tgcttgagat ggtgaataaa agttggggta aacaatagac taactaaaaa gcataaaaat 47761 aaaaaataag ctgtccatag ggactttgaa aaactcctac atattcctgg gaattaaaag 47821 aacaaagtca tgtgtccatg cctagagctg tgcaggtgct caggaaagac atgagaaggc 47881 cctaaggcat taacttgggc tgactttgat gctttgcaca agcagggagt gaaggcttag 47941 acagagctgt aaactgcctg gctgaatgtt gaaggcatcc cctagcacac acaataaacc 48001 cctaagcaaa gacagggata cttattgatt ccaggtgttt aggaaaagtt ctgcctaaat 48061 attaactggc caaacataag cagacacttt agtggccaca catgacaaag attatggacg 48121 ttatagaatt ggttcagaaa ggtcagtaaa caaactaaat caacaactac aataaaacag 48181 aaacaacaac aaaccttgga gacttgggaa gggtggggga gggaatatga tttccagaat 48241 tgccacatga tattatttta aatattgagt tttcaacaaa gatgtatgag acacacaaag 48301 aggcaagaaa ttatggcaca tacaggaaca aaagcagcca atagacattg tccctttgga 48361 aaccagatgt tgcacatact agacaaaata ctgtcagtta cttaaaatat attcaaagaa 48421 ctaaaggaaa ccatgtatag agaactaaag gagagtgtga gaaaaatgtc tcacaaaata 48481 gataatacca ataaatagaa attacatata aaaaagaatg aaataaaaat tctaaagttt 48541 gaaagcacaa taactgaaaa accttcccga gagaagctca atagaagatt tgagcaaata 48601 aaagaaagaa tcaacaaact tgaagatgtg ttaattgaga ttatccagga tgaggaacag 48661 aaagaaaaaa caatgaagaa aaataaagag agcctcagag acctgtgcga cactctgtag 48721 cacaagaata tatgcataat gggaggtgtc tagaaagata ggagagagag aaagcagtag 48781 aaagagtatt taaagaaata atggttcaaa gttttccaaa tttgataaaa accattaatc 48841 ttcatatcca agaagctcga gaaaattgaa gtagggtaaa ttcaaaaact caaagatgac 48901 acattataat cacatcattg aaagacaaat cttgaaaaca agagagagag actcatctca 48961 tatgagagat tttcagtgaa gttaacagtt gatttatcat cagaatccat agacgccagc 49021 agtgagatga catagtactg aaaatactga aaggaggaaa aaatctttca gccaagaatt 49081 ctatatctag caaaattatc tttcagaaat taaagataaa caaaaacaga taatttcgtg 49141 caagcagatc ttgcctgcaa gaaatgctaa ggaactcctt taggcataaa tgaaacaaca 49201 ctagatagta acttgaatct acatgaataa agaacaccgg taaaagtaag tacatacgta 49261 aataaagcaa taattataaa ttcatgtgtt gatgggtata caacatagaa agatgtgatt 49321 tttatgacaa agcacaaagg aagggagaag gaatgaagct atgtagggac aaagttttta 49381 tgtatgatta caattaagtt ggtattaatc tgaattatat tatcataaac taagtaataa 49441 tcaccagtgc aaccactaag gaaataactc aaaaacatat agtaaaacaa tgacaaggaa 49501 gtgaaaagat acaaaaagtg tgtattgaac ataaaaaagt cagttatgga ggaatggagg 49561 aacaaaaagt catgaccttg aatacagaaa atttgaaaga atccaccaag aaataaaaag 49621 tgttagtgct aatggttacc tagcttgaaa aggttatagg atataagaca tacagttgac 49681 ctttgaaaaa tatgggttta aactatgtgg gcccacctaa aggcagattt ttttcaataa 49741 atacaatcag cctttgatat ccataggttc tacctctgca accaaatgca gataaaatat 49801 acagaattca tgggatgtga aacctgtgga tagaggccag atgttttgta taaatgggtt 49861 ccaaagagcc aagtgcagga cttgagtatg ttaaaatttt gatatttctg gggatctgag 49921 aaccaatccc ccacagatac caatgcacaa ttgtatatac aaaaagcaat tgtattttta 49981 ttccctagaa atgaacattc tggaaatgaa attaagaaaa caatttcatt tacaacagca 50041 tcaaaaagaa taaaatatta gtaataatgc caacaaatga agcacaaagt atattctaaa 50101 agcaataaga ttattgaaga aatttaagag aacctaaaca gatggaagta cacctcatgt 50161 tctttgagat gggaagaact tagtattgtt aaaatggcaa tacttaatac tgattgaaaa 50221 acttaatatt gtcaagacag cagtattccc taaattgatc tacagattca attcaatccc 50281 tatcaaaatt caagttgctt ttgttcccag aaacccacaa gatggctcta aaattcatac 50341 gggaatgcaa aagatccaga ataaccaaag ccattttgta aaagaagagc aaagtgggag 50401 gactcccact tctagatttc aaattttcct acaaagctac attaatcaag actctgtggc 50461 actggcatgg aatagaaata tagatcaatg gaacagaatt gagaatccag aatggtcgat 50521 tgatttttga caagggtgct aagataattt aatgagagaa agagtaatct tttttaacaa 50581 acggtgctgg gaaaactgga tatctacatt aaaaaagtta atttggaccc ttactttata 50641 ctatatacaa aaattaactg aaaatggatc aaagacctaa acacaagaga taaaactata 50701 aaactcctag aagaaaacat aggcataaat cttaataacc tttcttgaca tttgctaaga 50761 gagtagatct taagtattct taccataaaa aaggtaattg tgtgaagagg tgaatacatt 50821 aattggcttg cctgtagcat tcagtttact gtgtatatgt atatcaaaac atcacatcac 50881 acaccttaat tctatgcaat tttaataaaa cataaaaaat tttaaataga aatataaaat 50941 gaagatttct gatggtaaac gtagtgcccg ctgattataa atattcaaat agacattttt 51001 ccaaagaaga catacaaatg gccaatgggt atatgaaagg ttgctcaata tcactaatca 51061 tcaggaaaat gcagatcaaa atcacaatga gatatcacct gttaggatgg ttattatcaa 51121 gaaaaaagcc aaatgataac aagttgtatt agtcaatttt catgctgctg ataaagatat 51181 acctgagact gggtaattta caaaagaaag aggtttaatg gactttacag ttccacattg 51241 ctggggaggc ctcacaatca tggcagaagg tgaaaggcac atcttacatg gtggcagata 51301 agagaagaga gagcctgtgc agggaaactc ctgtttttaa aaccatcaga tctcatgaga 51361 cttattcact atcataagaa cagcatggga aagacctacc cccatgattc aattctctct 51421 cactgggtcc ctcccacaac acatgggaat tatgggagct acaagatgag attcaggtgg 51481 agatacagag ccaaaccgta tcacaagtgt tcatgagaat gttgagaaat tgaaaccctt 51541 tttactgtgt tggtgggaat gtagagaatg gtgcagccac tatggaaaac agtataaagg 51601 ctcctcaaaa aattaaaaat agaatgatca tgtgatctag ccatcccact tctgggtata 51661 catccaagag aattgaaatc aggatcttga agatatatct gcactaccat gttcactgta 51721 gcattattca caacagccga gatatggaaa caacctaaat gtctgtcaac agatgaatgg 51781 ataaaaaaag tgggggcata tacatgcaat ggaatattat tccaccctaa aaaaaggaag 51841 gaaatccttc catttgcagt aacattaatg aacctagagg acattatgct aagtgaaata 51901 agccagtcac agaaggacaa acactatgtg attccactta tatgagatat ttaaaatagc 51961 caaacaccca gaagaagaaa atagaattgt ggttaccagg ggctgggatg aggtgaaaat 52021 gggcaattgt tgctcaatgg atatgaagct tcagtcatgc aagatgaaaa agttctagag 52081 gtctgctgta caacatagta cctatagtta aaatacagca ttgtgcactt aaaaacatgt 52141 taggaatgta gatctcatgt tgtgttctta ccatacacaa atataataat aaacagagac 52201 cacacacata aaataataca agaaagtttt tgaaaattgt gaatatgttt agtaccttca 52261 ttgtactaaa tggtatcgca tgtatacata cgttcaaatt tatcaaaata catagattaa 52321 gcatgtgcag ttttttgttt ataaattata cctcaataaa cctaaaaaaa tcctatgacc 52381 ttggattaga caatggtttc ttagatatga cagctaaagc acaagcagcc aaaggaaaaa 52441 aatacataat aaaacttcat taaaattata aaacttgtgt gcttcaaaga acattattga 52501 gaaagtgaaa ataaaaccca cataatggaa taagatattt gctaattata tatctcacaa 52561 aggtctagta tccagtatat ataaagaatt cttacagctc aacaataaaa agataagtag 52621 cccgattaaa aaccgggcaa aggatttaaa tagacatttc tcctaagaag ataaataaat 52681 ggttaacaag catgagaaaa gatgctcaat atccatagtc attggagaaa ttcaaataaa 52741 aacaataaga taccacttac acccactagg atccctgtaa ttgaaaatac agataacaaa 52801 tactggtgag aatgtggagg aattggaact ctcatacatt gctagtgcaa atgtaaattg 52861 atttggaaaa gaacttgaca gttccttgga aagttcaaca tagagttacc atatgactca 52921 gcaatttcac tcctaggtat atacccaaga gaaataaaaa cgtatatcca cacaaaaact 52981 tgtacacaaa cgttcgtagc agtattatta ataatggccc caaagtaaag acaacccaaa 53041 tgtccatcaa ctgatgaatg aataaacatg ttagtatatc cattcaatgg tattgtccag 53101 tcataaaaaa aaagaatgag tactgataca cagatgaacc ttgtaaacat actaagcgaa 53161 agaagctgga cacaaaaggc cacatactat atgattccat ttatatgaaa tgtctagaat 53221 aggcaaattg atggacagaa aagtagatta ctagttgcca ggggccagga aagtggagaa 53281 tggggaatgg ctactaatgt gcatggtttt tcatttgggg tgacaaaagt gttctgcaat 53341 tagatagtga taatggttgc atgacttcgt gaatatactt aaaaaccagt gaattgtact 53401 ttaaggggtg gatgttatgg tttgtgaatt aaatctcaac ttaaaacaga ttaaatacta 53461 attagcagat ataaaaggca tatgagatat ataatagttc ttccttttcc tcttgtccta 53521 ttcccagatc ctaaggaaac actgacaccc tgcgtcacca tgtttatcct gtggcttctt 53581 gtattgcccg gcccacctcc aggcactcgc agttgaaaaa catgagccta ggaagccaaa 53641 aaggaaatca aagagtcaga ggttctcagg tcccttaggt gatgcccggg agaaacactt 53701 tctactccat ttttttcatg ggaaagtaaa tatatttcag attagaccac acagttagtt 53761 ggaaaagccc ttaaaattct tcaccttttt ttttttgccg tcatttcata ttcataatcc 53821 caaggattat aacaacagga agagagcatt tccccccatt atagaagagc tattgccact 53881 gaaaatcata tccatcttct atcttgtcta tctatcatct atctatctat ctatctatct 53941 atctatctat ctatctatca tctattcatt tatcactgta ttcatttatt tggtccccag 54001 taacttgatg aacttgaact actctcccta agtctgagtc tgggacactg aaaaaaacca 54061 agggtcaaga ctctggatta aggtaaatta acctccatca gggatgccaa gtagatttta 54121 atctatatgc taacttcaat acattaacag ttgagaaagt ttctgtgctg agcaagatcc 54181 tagggacatg tctgcatcta gtttgacaga atggtgtgct tgattagtaa tgtctgcttg 54241 aattcatgta ggaaaatcag ggcctgtgta cagcagatac gccaccctga cttagcgcat 54301 gctccctcag gggcctctag ccagaaaacc tcctggtcat atcactcaaa ccttgaaata 54361 ttgcattcgg cccaatatga aattgtattg tatgtgtaca tttcaccttc agcagcctgc 54421 agcttaattt aaagcaaatc aaaccaaatg cccactctgt tgtaatgatt acaacctgtc 54481 ctatccatta ttccatcaat gacaaaggaa atgacttatt ggagaatgct tctttaggtt 54541 cctgtcttct aggataagtc aaccaccctt tacattagac cccatacatt agataatgca 54601 tgttcagggt ggtatggctg tagacacacc ccattcatta gaaaatttag aaatgatttc 54661 ctatcttaaa attattgttg ccatcaaatc ccccttgctt tttcacttcc caaattggtg 54721 atgactaatc tctcatactt gtaattattt ccattctttc tcactcagca cagaggattt 54781 gtgtgtgtgt gtgtgtgtat gtgcatgtgt gaaggcagaa ggatggggac aggtagaatg 54841 gaatgattta aatgtttact ttgcattttc aagtcagaga tggaagatac caagtgggga 54901 ctcattatta agctaaggct cccactggcc aaaacatgtg gatttaatgc cttctctact 54961 gggatcatga gtatgcagac tttattaata gagcagttct ggactttctt caggtttctg 55021 aatgcatttg gttaataagg cagtggtcta gcacagcctg ctgtttctgg gtaggttctt 55081 tccactgaaa cacaggagga ggagagaggg agggatccct gcgtatttag tcaggacacc 55141 cagccaacaa ccctttacaa agcacatgtt ctccatcctt ttctcaaagg cttccaaact 55201 gcagatcaaa taaacaatca aagatactga agagatttct gatttccata ccctctcacc 55261 cattattccc acagctaagt aaaaccaaat ggtgttatgt aaattttatt caaataaaag 55321 cacccagaca agtccaatac tttaactgtt ctgttattta acattctata caaatcttta 55381 ctcttttctt gcattttact tctagtgctg tgctcagttg aagacagtgc acatctaata 55441 aaacactgat attttcctga taatagacac aagttaaaaa gaaaaagaaa aatacataca 55501 ccaaaaacat tagtggggat ttgttttata cttttttgca tgttggggta atttttgcaa 55561 taagaatgta ttgctggtat tattaaaaca atcaatatag ctatgtctta catatcaata 55621 ggcaggttat caacaaaaga cacaattagc tttaacagaa gaaaaggaaa tgattttttg 55681 gccaacggct taccagaaag aaataacaga ataggaaaaa ctaatgacag gggaaaatgt 55741 ctgcctaccc acagcagagg tgtaaacatc tttgaccttt attaacgaca cagcaagggc 55801 cttacaatca gaacatggac ctgagtcttt ttgaggtaca catcccgttt agcatatggg 55861 ggggggcctc cagaatttgc cctataaaca aggtcacctt attatgaaat cttctggttt 55921 ctgtaaccta ttccagccac aggacattgg aaacagtttg ggatcataat cctgaaagac 55981 acaatccaaa tgccataatc ctaaaggttg aaattccaaa agatcaaaat ctgtaaagtc 56041 caaatccttc aagtctaaaa tccctcacat cttaaaatcc tgaagatcac aataacagaa 56101 tagttgtgtc atattaggcg gaactattac ctcgttagtg tctctatttg ggaattaagt 56161 atggtttaaa gagaagaaat tggtggaaaa ttcagatgat tggtcatgct atatggcaac 56221 aatgaaaact tcagtttaaa aagccatcgg ttgcctgcat tagcattcct tccatctcat 56281 tacattccgg gggcttttaa tgaattaaag ccacgtttgc ctgaagaagc cagtgaagtt 56341 tctgcctggt tagaaaataa ttatgtgcat gataggataa gaagacccgc aatagttttg 56401 caccagtatt atttcgagaa tggacttcta tgtacccaaa acaacatgga accaaggcac 56461 caaagatggg aaaacttaat agggaaggct attatgttgt tgtatacaga atcatagaag 56521 aatttttaaa agagcagtgc cacatagaaa atgaaggtga acatattctc tgaggaaagc 56581 catgtcctaa aaggaaaaaa gcagctattc actgaaatgc aagatttcaa aatacagtta 56641 atgcctgtga aagtcagtca ttttttacag actacctttg cacaatgcct ataatctatt 56701 cctgtaacat agttcttcat atgcgaaact tcctttttag ttttttcttt tttaagtttt 56761 attttattat tttaaatttt cagcattatt tttttttaca attcactatg cacatttcat 56821 ctttgcatca tttccaatac tggaggtata aattgtgtag agactttcag aaagttctaa 56881 ttcattgcat acatttttgc aaatttgact tcgcaaaagt atattatcac aacaatgact 56941 ttgtgtgtaa acatcgtgcc tgtacgtaaa acgttgaaac atcttcagta aataaggaga 57001 tgttcctttt gtgcatctgc atttgtgaaa gataaaattt ctcaagatct cagatctttg 57061 gacaactgta tgttatgatg gtgattcact gtggtttttg attgatccca tcaaaagact 57121 tgggttgtgc atcacagtat ttcagataac tgcaatgtta aatctggctg cacacaattt 57181 ccaacaatag taatatgtac ttatacattt ccctttttga cctatttctt tatgcatatg 57241 attcattttc tcatcactat tatattcata tgaccgttgt tagtataccc aagtgcttat 57301 gcttgcaaaa atgtgtattt ttgcctattt tattgtgtaa agtagcctat agtgctctgt 57361 cacgttttca tgtgtctcaa gtaaatctcc ttttgaaaat gtaaatcata gctttaaaaa 57421 aattttaatt ttttttccag aattatagtt tcaggatttt gatcttttgg aatttcaaca 57481 tttcgaatta tggcatttgg ggttgtatct tggtgattat atttggctcc cataggaaaa 57541 taactgagtt tacatctaat tccattttag agtagataag aaagcaggtg agggtgaatt 57601 ttattaagtg ttttattttt atttttaatt aattattatt tttattttta ttttttgaga 57661 cagagtcttg ctctgttacc agcctggagt gtagtggcat gctctcagct cactgcaacc 57721 tccacctgtg gggttcaaac aattctcctg ccttagcctc ccaagtagct gggactacag 57781 gcatgcgcca ccatgcccag ctaatttttg catttttagt agagacaggg tttcaccatg 57841 ttggccagga tggtctcaat ctcttgacct tgtgattcac ctgccttggc ctcccaaagt 57901 gcttggatta cagacgtgag ccactgcacc cagccttagt gttttgtatg tgcttggcct 57961 aatgtaatta ttttaggaat actacttcat ttaattctta caagaatcct atgaaggttg 58021 atatcatcat cttcatttta gaggtagtaa aatgaaactg atgtattgat tgtcaagctt 58081 caaattctga aatatgagaa aggctgtgta tgaatgatga gtcattcatt gaaccatata 58141 acaaaatatg taaaagtatg aattctagag tctagtagag tttgagactt ggctttggca 58201 catccagctg tggaacactg aacatgtcac ttcagttctc tgagcctcaa gcttcctcag 58261 ctttattgct cagctttttg cgaggattaa attatacagg gaatgcactt gttacaatac 58321 actctgtaca gtaagcttgc ctccactgac ttctgcaaaa aatctttaaa caatttttat 58381 tgcataatat tttttattat ttaaaaaatg atttacatat attgagctag tcttgtatcc 58441 caggaatgaa gccaacttga ttgtggtaga taagctgttt gatgtgctgc tggattcagc 58501 ttgccagtat tttattgaga atttttgttc tcaataaaat ttttgttctt tctgatgaat 58561 caatgttcat cagagatatt ggcctgactt tttttgttgt tgtatctctg ataacttttg 58621 gtatcaggat gatgctggcc tcatagaatg agttaggcaa gagtccctcc ttttcaattg 58681 tttggaataa tttcagaaga aatggtacca gctcctcttt gtgcctcttg tagaattcag 58741 ctgtaagtcc atctgatcct gggctttttt ggttggtagg ttattactgc ctcaatttca 58801 caaattgttc tgatctattc aggaactcaa cttcttcctg gttcagtctt gggagggtgt 58861 atgtgtctgg gaatttatcc atttctcgta gattttctag tttattgcta tagagctgtt 58921 tatagtattc tctgatggtt gtttgtattt ctgtggggtc agtagtgata tccgtttatt 58981 atttttaatt gtgtccattt gattcttctc tcttttcttc tttattagtc tagctaataa 59041 tttatctatt ttattatttt aaaaaaaaca gctcctggat tcactgattt tttgaagact 59101 ttttcatgtc tctatctcct tcagttccac tctgatcttg ttttttattg taattttttt 59161 gtcacttaaa attgtgtgtg aatgttgaaa tactggaaga taaaaatctg gaaaatataa 59221 aatgtataaa acatatatgt acagtttaaa gggtaataac cagatgaaca cctgtgccca 59281 gcaacaggtt tataggagca ttatcagccc ccatgaagcc caggtgcctg cccctgatca 59341 catccccttc ctctctcctg aaggtaacca gcatacaaat tgtgtcaact cttctcttgc 59401 ttttctttat attaatagtt ttaccatcta tgtatgtgtt cctaaacaat atatgcttta 59461 gatttgcatg tatctgaatg ttttagaaac agcttcatct tgagatataa tttataaagc 59521 ttactgtttc aagcatgcat ttgaactttt aataagcata tactgcatat tgttctgtga 59581 tttcctcttg tgtgtgtgtg tgtgtgtgag atggagtctt gctctgtcac ccaggctgga 59641 gtgcagtggc gcaatctcgg ctcactgcaa cctccgcctc ctgggttcaa gcgattctcc 59701 tgcctcagcc tctctagtag ctgggactat aggcgtgcac cagcatgccc agctaatttt 59761 tgtattttta gtagagacag ggtttcacca tgttagccag gttggtctcg atctcttgac 59821 cttgtgatcc gcccacctcg gcctctcaaa gtgctgggat tacaggcgtg agccaccacg 59881 cccagccgat ttcctttctt ctaatcacca ttatgttttt gagagtcatc taagttgatg 59941 tatttggttg tattttattc attcccattc ttgtagggta tttcattaaa caaaaatacc 60001 agaatttgtt cccattctac tgttgatgca catataggtt gtttcctgtt ttacactact 60061 acaaatgata ctactatgaa cattttgaac gtgtacacat gcaagagtat aactaagagt 60121 gaagctgttc tgtcgcaggg tatggacagg atcaactatg tttcttagga ggacgagacc 60181 atcactcaac ccactatctg aaataaaaag acaaagctga atgagtgaca atagttagga 60241 agagctgtgt cggtcaggca caatctttcc aagcagagca gcctgttcat cttatttatg 60301 tatggatttg ggaagctggg agttcggggg tttgtgaact ttcggaatgt tagaggtttg 60361 acggattaaa agacatggtg ctcaggtaag actgttgttg attggctggg aatcacttac 60421 tagagtgagt ctgtccctaa ttggctgaaa gtcagaagca aggggctgtt attgattggt 60481 tgttttgaag agcatgttta ccagacgtgg gttgtcattg actgattaaa gggtttaaaa 60541 tataggtggc taaatattac caaggttaca gaaaaatcag tttttccttg ctgtatggga 60601 acttttttat gttcaaaaga tattgccaac aaattaattg tccaacaaat gtctttattt 60661 aacttgttgc tatgggaaaa ctttcaaata taaacaaact ggagaggata gtatagtaaa 60721 cttctgcgta ctcattaccc agcttcagta attagcagtt catagccaat tgtgtttaat 60781 ctatatgcta cctactctcc taactcctgg attattttga agcaaatctg aaacatccta 60841 tgatttcata cccataaatg tttcagtatg tatttctaaa agataggaac tttttacaaa 60901 cacctaatgt attatgctta aaattacaca gaataatttc taatatcatc aagtagtgtt 60961 aaaattgtcc ctattctctg tttctctctg tctatcattt cctcatttga ttcagactcc 61021 aagtaaagtc catacacggc attttgttga taaatcttgt aaatctcttc taatccacag 61081 attgcccctc tctctatttt cctttcctca gaatttattt gttgaagaaa ccaggttctt 61141 tatctggtag aacttcctac agcctgcatt tgtctgactg cattcttgtg gtacatttaa 61201 catgcatccc tgtgctgcat agttcctgta aactgaaagt tagagctaat ggccttataa 61261 gactttagcc tgattttttg acaagaatac tttataggtt gtgttgtgca tttccaacag 61321 gaggtataaa atgtctgata tactctctcc ttctctctct tcttaacaat acgattgatt 61381 aggtatcatc agcctgatcc atccattaca aagtttctca tcagcttttc agcattgatg 61441 acattgctca gatcctatcc cattacgggt ttcaaaagag tctcctacca tctcttctgc 61501 agtttagagg acggcactgg cttcagaggg gaagatacat tggaaacagt taatgcagaa 61561 ggcagggaaa ataggaggaa agcaggccac aggtaggaaa acaatagcat tgctggagca 61621 aaggttttgt gcatttaaaa atgtgagggg tattgcaaaa ttgtactcta aagaggttaa 61681 accaacttac acttttcaat aaatgtttct acataccttt gtcaatttga tgggtgaaaa 61741 actacacctc attctggttt acaattgtta attttaagtg aagtggatca tattttcaaa 61801 tacatattgg taatctgtat tttttctttt aggaaagtct gctgctatct tttgcccatt 61861 tttctttttt ttttcttttg agacggagtc ttgctcttgt tgcccaggct ggaatgcaat 61921 ggcacaatct cggctcactg caaactccgc ctcccaggtt caagcgattc tcctgcctca 61981 gcctcccaag aagctgagat tacaggtgtc tgtcaccaca tctggctaat ttttgtattt 62041 ctagtagaga tggggtttcg ccatgttggc caggctggtc tcgaactcct gacctcgtga 62101 tccgcctgcc tcggcctccc aaagtgctgg gattacaggt gtgagccacc gcgcctggcc 62161 tttgcccatt tttctatcaa tttgtttaca ccttttattt ttttaaagaa ttctttatat 62221 attagggaga tgagttcctt atctgtcata ttatatgggt tatagatatt tttttcagat 62281 tattatcttt tgactctgat caggatactg ttggctttgt agaaagtttt ttttattatt 62341 gttataacgc atttatcaaa tatttccttc agcacttctg ggttttttaa cattctttga 62401 aaagtccaac ccataccaag accattaaaa atcattcata atttctccta gtatttctat 62461 ggtttcagtt catttctttt ctttctttcc tcctcctccc tctttctctc cttttgcata 62521 caaatgtttg gttcactttg atatggattg aacttttatt tatttattta ttttgagaca 62581 gggtctcact atcttaccca gggtggagtg cagtggcatg atcgtagctc actgtaactt 62641 caaactcctg gattcaaata ttccttctgc ctcagcctcc caagtagcta ggactgcgta 62701 cacacacacc atgcctgact aagttttaaa aatgttttgt aggaatgggg ttttgctatg 62761 ttgcctaggc tcgtctcgat ctcctggcct caagtgatcc tcccgcacag gcctcacaaa 62821 gtgctaggat tacatgcatg agccaccacg cctagactgc acttttattt cttgccagtt 62881 gttgtctcaa tacagttaat agaatgactt atatttcctc taatcattta aagtatgact 62941 tctattgtat actacatgct gtatatattt gtgcctattt ctgaatcatc tgttttgttt 63001 cactaaccta ttctttcagt acattgactc agatccttga tgtattcaat gcgttggcat 63061 tctatattgt atacattaga tgcctccttg aaaggttgtg tgtgaagcaa aaacaattag 63121 gtgcataatc tcattggcat cgtttagtta ttgttgtgtg acataccaca ccaaaactca 63181 gtggcttcca atgacagcca gttaagccca tacattcata gggtatacta agctggattt 63241 tgtaggggca actgtgctct gcatggcatc cattcccttc ccagggcagc aggatcatct 63301 gcctcacggc gatgtagaaa cacaggaagg agtgtgcaaa gactaaggcc ccagaagacc 63361 caggctttcc cgacccactg ccacttctgc ctcatcttag tggtcaaagc tacacagcta 63421 aacccaaaat caagggtgag gaaagttccc tttgccccca tgaaacatgg caaaggtgtg 63481 gatgcagaga gggtgaagaa ctggggctac aatcccaatc tcccacagct actggcacat 63541 aatttaaatg gggaagcttc catagaaaat tccaggtgct catctaatat taagctaaaa 63601 aggagctgct aacatatatt tgaaacacac tgaaaattta attccccaga aaagtacagg 63661 gtaacataac atccatgtgt ctatctccaa taattaacaa attaatattt taacatattt 63721 gcttctgatt tttttctttt tttagaaata aaatatttta gagtttatgt actctgtata 63781 tccttccacg gttttatttc tctcactatc cagaagtaat tagtacctta aaggtagtct 63841 gtgcatattt ttattataca tacagcattc attaagatta atatacttgt tttcagattt 63901 tacatattat aaatagtatg atactatata tataattcta aaaatagctt tttaattcaa 63961 cagtatatct agagatctag ctgtgttaat acatgtagat tgtcttcatt taaattgtta 64021 tattttactc tgttgtatga ctgtattatt ctgttcttgc attgctataa agaaatacct 64081 gagattgggt aatttataaa gaaaagaggt acagaaagca tggtagcttc ttgggacgtc 64141 tcaggaaact gaaaatcatg gcggaaggca aagggcaagc aggcatgtct tacacggctg 64201 aagcaggagg aagtgagaac aggggaggtg ctacacactt ttaaacaacc ggatctcacc 64261 ataactcact cactcactat caggacaaca gcactaaggg gatgatgcca acccattcat 64321 gagcactcta gccccataat tcaatcacct ctcaccaggc cccacctcca acactgggga 64381 ttacaattga acatgagatt tgggtgggga cacagatcca aaccatatca atgactatat 64441 tataattaat cttttttcct ttattgatgg atgcttttaa tggtgttaac ttaccattat 64501 cataaacaag aaaacattta aaaatcacat atttaaggtg aatacatgca attttattaa 64561 aatattaaag ttatttgcaa atattacaca ggtgaaatat aattaaactg agaattcaaa 64621 aacacctaag ggatatatta aggaatatgt tatgaagggc tacacattat taatgaccaa 64681 ttttcttctg aaatatgtgc tatcaaggaa ataaaagaaa gatctaccag caggcagcac 64741 taaatattcc atcattcgca cgaacattct cctacaagcc acagcccaga agactgtcga 64801 tatgcaccta gccttatcat tttctgttaa tatttagtca gtatttccaa tataattggc 64861 ttcttcttca gatcaaaggc ttttacatag gccctacctg caaatcaaat agctatccat 64921 ggaatattag aacttgactt gctccatcct cttaaacttt ttgtgtctca cactaaagaa 64981 atgagagatg cagaattcta aggctaaata gctaggaagt attcattcaa acttgaatat 65041 tcttcaaaga gagtgtgggg gcaactctaa tcagaggaag aaactaaagg aagtaaaacc 65101 agatgttttc caccaaagcc ctccttttgg ctggtctgat ttctaccggt aagtcgaaag 65161 tgcaatttta gaatattgat taacttttaa aaactgattt caggggaaga ctaaaaagat 65221 taaaaagtca ttaatagcat gatcacaaca aagacttgtg tatttggcta ttaaaaatga 65281 aataggaatg acacatggtc ctgaggatgt tttaagtaat ttaaatggaa tgttccggtt 65341 ctttaatgag ggaaacataa tagtgggttg agaagacatt acaaagcctt tcagtctgag 65401 tctgggtctg ggacttaact agaacaaaac tagataataa aaagttgata taaatatttg 65461 attctgattt tgatcaaaat caaattctga tacagatact aatacagaga ctggtatgga 65521 ttaagacaaa gggatggctc ccttaatact caccgaagga ggaaaaagtg aataaattac 65581 tacactgtct gatatttgcc tcaattccca aggataaata ttagttgcca aagtgatggg 65641 attatatctt gatcaaggtg ttgatttgga gcttggctcc aaggcataaa acttttgaaa 65701 cccacctttt aaaagtattt taaagaagag gcttaatggc catattctgt aggttctctt 65761 cacaggatag gtattaatga agagtaaaat gtctttaaat aaaatcaata gtcaaattac 65821 caggctctcg cttttgcttg tttgtttcaa agtgcaaggt tgagtgtgct cactgtatct 65881 ctctgtgaat gttaacactg gaaatccagg acatttaatt ccattgaaca tttattaagt 65941 gcttactata gtaaagaaac aaattgtgcc tgaaacaaag tttgcaccaa gtctgctgag 66001 aaagcagccc ctcctccacg tttcataatt tttcatataa ttggttctac tccttctcct 66061 accttcaccc ctaatctcag tctaggctaa attggtttga catttcagat tatttttaag 66121 aatctccctc ttccatgttg gcaactatga gagatttatc tagtgttaaa caaacttggc 66181 tagtacagag ttcgtttctc ctttaattca aacatttgct tggatttgat tagtggaact 66241 aaccatttca aaattacaat ttctaaaatt attttctgag ccatgctatt ttattatttt 66301 tgcttaatct aaaaataatg atccccatgg cagataatga gcaaagaaaa atagaaaagc 66361 acaacttata ccatttacta atggatctga agcaaagtat aatgatggag gccaatgctt 66421 ataaagaaac acattcagct gggcgcagtg gctcacgcct gtaatccctg cactttggga 66481 ggctgaggca ggcagatcac gaggtcaaga gatcaagacc gtcctgacca acatggtgaa 66541 actccgtctc tactaaaaat acaaaaatta gctgggtgtg gtggtgcgca cctgtagtcc 66601 cagctactca ggcggctggg gaaggagaat cacttgaacc caggagatgg aggttgaagt 66661 cagctgagat cacgccactg tgctccagct tggcaacaga gtgagactcc gatccaaaaa 66721 aaaaaaaaat tgtattcact gatactctaa cgtcacaaag cagaattctg tggccaggga 66781 aaaagccaat tttacatctg aatcaggaaa taaaattctg gaaggacgat cttcccataa 66841 atacataaag tccagggaaa actacagtga aaacagaatt gtcaggatca actgtgtgac 66901 tgataaggca aatggggata ggaaatgtct gaggccagga gctctgctct cacttttctc 66961 ctttttagca ctggcagggc catgggctaa tatatgtgct ggcaagtctt ccaatgagat 67021 ccggacgtgt gaccgccatg gctgtggaca gtactctgct caaaggtatg aagttgcaca 67081 accacccacc caaccctttg gtccctaacc tcttttctta gtgtccttgc ccaacgttgg 67141 ttgcctcaag ggctcacagt tctgacagca tgaaaaacca gagatgtaac cagttacatc 67201 actgtctttg gagcagcttc caagaactct gcttcaagta tgagctataa attgggatca 67261 gaatgtggaa gaaggaaaag atcagggagc aaacacaatg ttttccttct gaatgtcttc 67321 acacattcac ttatctcaac tgtttttttc caaagggtta ttaaacacaa ctatcccatt 67381 ttatagaaga gaaactttaa ctgaggtacc aaagggttaa gggacctgcc aagcacagtt 67441 ggagatggtg aagcctggac ctgaattcag gcttttggtt cctcttccca ccaaaccacg 67501 cggttactgg ctcaggctct ggatgttgaa tattagaggc tggccctaag caggagagaa 67561 agcaccacta tctcttccag aaaatgaggg tcacacttgt gaattttcag ttatacccaa 67621 catcaccttg gggtgccact ttctaatgaa atagccatgt gtccagtcat atcactgatc 67681 tccatcctaa ctagagggga actctcacat atacagatac tacttacaaa gcaacctcaa 67741 atactgaaac aggactaaga gaagggaaat atgggaaacg tcccttgaat tgccaaaatt 67801 cagactcagc ttatactgca aagtttatgc acttgattga tacagtaaat tcttggctct 67861 tgagcttgct cagtgttgct ttgtacataa ttgcagcaat cctggtgaac ctgatcgttc 67921 aggtgacagc ctattagaag gtaggggagg ctggtgagag caggcatagt gacagcagcg 67981 ggaagcaaga gctgaggggt tgctaagacg aactcaaggc acgtcacaat ctagaatagt 68041 ttgatttggg ctccatgcac tttaataaac ccaggaggta tcactcattg tattaaaata 68101 ccatatacag taagtgacgt ttacaatcac ttttctagat cactgcaaga aaaataagac 68161 tagggtaaca tgtgatactt ttatgatagg ttcccataaa aagacagctg ttttcaaagt 68221 gggcttgaag atgtcaagat ctaggtatct agaaaatgca cttacatgga ctgcctcatt 68281 cccccaacat gccataaaat aatagtagct aatatttgct taaggcttat tatgtaccaa 68341 gtaccatttt aaatgctttc atctttatga tatggttatt agcacctgcg gggagggagg 68401 ttaaaattgg gctaaacaga gtgaggcatc ctgggacatc ccaacatgtt ctggcagctt 68461 caagctttgg tacctcgtct gcaaacagtg ggctaacacc atgttcccag gcagtaaagc 68521 tcccctaagg cagttcctca gtctgttcgt tcttatttac atcctagaag tcagaggcct 68581 caccagggtg tggacatctt gtgctctgct ggatctactg tgtacgcacc attcactgga 68641 atgattgtgg gccaggagaa accttatcaa aacaagaatg ctatcaataa tggtgttcga 68701 atatctggaa gaggtaagag aggtggagtc tacacctggg gtgcctcctg gagatgttga 68761 tgctcattta tttaatgcag gttccatacg tagagatttt gtacttcttt caaaatgata 68821 aaacctggaa cagaaagaat atagagtgta cccaaactag ctgatgcatt ttttaaaaaa 68881 cttttatttt aggttcaggg gtacatgtgc acatttgtta tataggtaaa ccacatgtca 68941 cagacatttg gtgtacagat tattttgtca cccaggtaat aagcatagta cccgatagat 69001 atttttttct gatcctctcc cagctgaggc actttcaagg acgtttcacc attttctctg 69061 aaaacttctg cttcatgcac acacttcacg aaaccccaca gctgattgga ggcattcctt 69121 cctgtggccc atgtggtttt gcttaacaat ctataaacaa agccactttc agcaatgcta 69181 accataatgg atagatactt gcatttactc aaatcccaca aatcataaat gctggcactt 69241 ctgcaagagt tgaatgtgaa atatttttga aagcttctaa gttttaggta aagctagatc 69301 tcctggtact ccaggtattg gttttgttgt ttggttactc catcaaggta taaatacaca 69361 cataaaagtg cacaacacat ccatgtatag ctccgtgaat ttcaaaaagt gaacacacct 69421 gtgtaatcag cacccaaatc aagaaaaaga acatcaccca catcccagag atcccttgtg 69481 ctaccactta gtcattattc ccagcaagat aatcactatc tcaagttcta acatcatgta 69541 ttagttcttc ctgtttctaa aaatggtata aatagaatta aacagcatgc actctttggg 69601 tcttctttca ttccacatcg tgattatgca gtttatgttt ttgtatccac gttctttctg 69661 atgcatgata ttccattata tgcgtatact ttcatctatt tatctattct tcctcagatg 69721 gacttttggg cagtttccag tttaagatga ttataaaatt gctgctacga acttccttat 69781 accttttaat gaacatatat atgcattcct attgggagtg agtccagagg agaccctcct 69841 caccaagcct gtgagaggca tcttttgagc gaacccacat ccttgaggag ctacatagtg 69901 gttgtcctct gtaggccaat gtcacaaggg aaacttctgc cattgaactg tgtcctctgt 69961 agtggcactt aacagttaat gccaaggtca gtgagagcag aagcagtaat cagattcgtc 70021 taaatctcag agatttgtga cactggctag taatcatggt gactcaggaa ctgaatagat 70081 gagcagccaa ctaaagcctt attggacttg tatgaatgga aatgctccag gtctgatgaa 70141 caaaatgctg acttcagtca tcacaataat gggtcccaat tcccaataaa ttcccaggct 70201 tgaattggtt cacagtccca gagcctcttg aacaaaaggg aggctcagtc cttttgcgta 70261 agggtcctgc tacgcagcca aacatttata ctgtaaacct tacttctagc tgtccccaaa 70321 gggacctgca acggtttagc aggatgacag cacattaggg aagaaataat taggctcttc 70381 agagattact ggacacttgc tcaaaactga cactaattcc tggagatcta aaaggccact 70441 gtggttcacc agtcagacta ggggcttaga gtcaggtggt aaatatagct tttcactagg 70501 gtccatctta cagtgagccc agtgcatcct ccaatctacc aagaggttat ttccccagtt 70561 ccagaatgca taattggaat tgactttctt cttctttcag actgccttgg caattcttgg 70621 ttatttgcat ttccaaataa attacagaat ctgtttgcta atttccacca aaaatcctgc 70681 tggggctttg actgaatctg cagtaaatat acagatcaaa ttgagaacta atatttttaa 70741 agtagtgagt cctccaatca atgaacatat tatgtcctcc acttacttag gtcttccttc 70801 actgttatat taatggttgt tgtagttttc agtgtagagg tcttgcacat cttttgttta 70861 atatttattc gtaaaaccta cctgatggag ttttagctct tctacagcag aactatccaa 70921 tagaaatata acagaagtca tatatataat tttaaatttt cggctgggca cggtggctcg 70981 tgcctgtaat cccagcactt tgggaggccg aggtggatgg atcacgaggt caggagatcg 71041 agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaaatacaa aaaattagcc 71101 agatgtggtg gcgggcgcct gtagtccctg ctactcagga ggctgaggca ggagaatggc 71161 gtgaacccgg gaggcggagc ctgcagtgag ccgagatcgc gccactgcac tccagcctgg 71221 gtgacacagc aagactccgt ctcaaaaaaa taataataat aattattatt attattattt 71281 taaattttct agtagccatg ttgaagtaaa aaaacatggt attttagtat tttatttaac 71341 tcaatatttt caaagtatta tatcaatatg taatacaaaa attattgaga tattttacat 71401 ttttaaatac taagtcttca aaatccactg tgtattttac cattaaagca catctcaatt 71461 catattaact acctttcaag tgctgcataa ccacatgtgg ctagtggcta ccatattgga 71521 cagcataatt ttacagcctt atttttatgc tcacttgacc ggtttgcagt gccactagtg 71581 tatcaaagtc tcctattatt ggtgggtttc tatctatgtt tcctggcatc tcttgtagtg 71641 tctgctttat atagatgact tgtgctattt agtgcatgga tagttgtaac tgttatatct 71701 tcattgtgac ttgtggcatt taccctctta cattatcttc gtcttgtttc aatgcctttt 71761 ggcctgattt ccactcttct ggtaacagga caatagccct tgctttctta ttgcttggca 71821 aatctttgtc catgtgcttg gcaaatcttt gtccagctct tcactttcag ccttttcaat 71881 cgctgtgttt tattccatac agcattcaat tggggtttgc ttttcaaccc aatgggacat 71941 cttttaatag gcaagataag gccattcaca tttatagaca tgactcctgt atttgtaccc 72001 aacaccattt tattgtataa ttattatatt gtactctatt tctctacttg ggtctttgcc 72061 tttttaattc ttttgatatt tagcaaagtt tgtatttttg ttccagttat ttctattttt 72121 ctttatctct aaagtccaat cattctatca gaatatgttt tggagttgac attgttcagg 72181 gctcaccttt tctaactagc tagtattttc ctgctattat tcttcaatgc ctttattagc 72241 gtttcatttg aatctattct tcctttggca tcttgtaatt tagtcttcct ttttaatatc 72301 attttgtcat tttcatctat tcctctcctg agttcattca acattctttt ttttattttt 72361 tgtccatttg ttattttttt aatttcagat tcaagattac tgaagcccag ataaataatt 72421 atacttattt tcttttaggt ttttgtgtca aaatgttcta cattaagcca attaagtata 72481 aaggtcctat taagaaggga gaaaaacttg gaactctatt gcccttgcag aaagtttatc 72541 ctggcataca atcgcatgtg cacattgaaa actgtgactc gagtgaccct actgcatacc 72601 tgtaaatcga aggccaatgg tcagatcttc aaaataaaaa gtcatcttaa aaacctggat 72661 gcataccctt ctcttcaaga aatttgtgtt cacaaaggaa aaatgcatga agggatggat 72721 accccatttt ccatgacatg attattacac attgcatgcc tgtatcaaaa catctcacgt 72781 acctcataaa caaatacacc tatgtaccca caaaaatttt ttaattaaaa aaaggaaatt 72841 tgagtttaaa tagaaacatg ataaatgcaa gaaagaaaac attttgattt taactcattg 72901 tcactctgat gttcatgtga actggttgct tcgggctctt tgatctgtca cctatggaat 72961 ctgagtggtt ttatttttta gatttctcag tcccgaagat ctaagataaa taaacaagag 73021 aacttacgct gggccaggct gcatttttga cacctctaaa actagcaacc tttctgcttg 73081 aaaataagaa aaagcctgaa agtgtttcta tggtacaaga acagtgagtg gtttataatg 73141 ctagatttag tatacattta atacatattc aatgaataag ttgagtttag gtagtagact 73201 ctcagcaata tccttaggtt tatcatattg gttggagaga gagctgcttt catggataat 73261 aaaggatttg ctggtatttc catccactca cccacatgat tatgaatgat ggcagtccaa 73321 gaaaacatat cccaagaaaa cagtgaggct gcactacagg aaggactaga gtgagaatga 73381 ggcctacctt tatctgaagt gtctcatgga attgggcctg tcaacttcat ctagccatat 73441 gcatattaca gctacttatt tctacacatc attgtattat atctatgtta ttctgtctta 73501 gttatattga caacaggtac tatgaggaaa caagaccata caaatgtcca aaggaccaga 73561 aataattctg gctccttaga gtttagccag ttggtaagaa atgaatacat tgaactaatc 73621 attctggaaa cttcatgaat ttctgcttaa atttgttcgg aatgcagaaa atgccagcag 73681 aatgattata ttcttatgtg tagattgtta gaggaaaaca taattaactg aagtttattg 73741 ttagagaaat ggctgcattt gtgtatcagg gcatatgtat gagaatgttc atgatatcac 73801 tatttttcat agcaaaacct tcagtgacct aaatgcccat caacagtgaa atccaataaa 73861 taatctcaaa gttatagaat gtaatagtca acagcagtca caataaatga acttcagcta 73921 aacataatga cctgaatgaa tcatggcaat ataattttaa gttttaaaaa gtccagaaga 73981 ttacataatg gatgataatt ctttttataa cattaaaaca gttaaaatac aatttccttt 74041 ttatataaca aaatcatata cgaaggcgag caagaaaata atgaactcta gttaactcat 74101 aactagatag agagaaccac acagaaagat ataagttact atcagtgctg tagttttgca 74161 gttgggaggt gatttcatgg gtatttatta ttatgaataa catgtcacag aagcaacagg 74221 acttgcagtg ggtggccagg aagaaagagt tatcatgatg actctcaggt aattgagaat 74281 ctatgtcaca catttagcac cttcttatat ctattaggtt ggtgcaaaag taatcacggt 74341 ttttgctatt actttcaatg gcaaactata tctagcactg tctgatttga atatgcatct 74401 ccttggtacc aaataacttt aggttgtttc cagaaatttg gctttcaaaa aataaatgct 74461 gtacaccact gaggacacac aaaaaggaaa tagcacatca gctcttaaag gcaattttaa 74521 aagaaactct gaaatatttg cacaataaaa tataagctta gttttccaaa taatcatcct 74581 tctgattatt taggaggcaa aacacaggtg gatatggata ttgttttaaa aaccagactc 74641 atgacctgga atggagaaat tcctcaaggt agagcaggga cattgccttc ccctcagcct 74701 tctcacatct gctcaatgga aatgcaaact ttggaccaaa ggacttattt ctcaggaaga 74761 actggaatac agcgccctct agtgcaacaa caaatccgtg ttattttccc ccttgtcttt 74821 taggtttgga tgtcaaaacg tttgacttaa gcataaaaat gttacaaagg tttgaatggc 74881 aaaggggtaa cagtatggag aaacttactt ttcattagta accatgatga taatatacac 74941 ttcaaaaatt gtgatattat tctgtcaggt ttgatagcta cacattaata atgtgaataa 75001 aaccaatatt tactttgaaa atgttttgaa ttatctgcaa gagataaatt taatagaaga 75061 aaaagaagga ggtaagtggc caatttaaga ggccaggcta caaaaggaag aatccttgca 75121 tccatttgac atttatcaca ctcagtggca tattacagta atcacatacc tgttgcctct 75181 gtgacatgtt attaataata atacccatga aaaataggct tgcttcctga ctcacctctc 75241 ccatatttgg tacaatacaa atgaggccag caaggatcac aaggaaggaa aaacaaaaaa 75301 tatgagcatc ctatttttgt gtttcttcag cacaacctac agtaagacct ttcccccatt 75361 ctctccattt accaaatttt tgcccacaaa attatccaaa cttctccaga atcatctaac 75421 aaatgtaaac acttttaaca tgaaaaagtt tgcttacaga actaaaatat tctccaaaga 75481 ctaaagtttg aaacccactt aatgtcaaat cctttatcta gaatccacta gttttaagta 75541 aacagtactt ttcatagact tccaaagtag agacattttc attagataaa acatttaaaa 75601 tactctataa gttcttgtct tatttttggc tgttgtagga atgttttact cttccattta 75661 cttacccagt aagtcaggcc aatcaaggta atttacatga gttcatgaac ttagatttac 75721 tctgagtttt gctctaagat tcattatagt gcttagttca ctaggttcaa ctactgggcc 75781 actaaaaagc acaatttaga gcagcacctt cttattctat gtagaagagg ccaatggtac 75841 tcatcatctc tccatactag tcgacaatgg ggagaaactc acccccaacc tcccttctct 75901 agtccctctg ttttgcttca ttatcctcag tgcaaactcc actggcattc tatcacctaa 75961 ccattcgggt atgtatactc ccaactcctc ctagaatata tcatggtaga gtgatcccag 76021 gtgagaccag aggcttatcc cttcagaatg tccaccactg ctctgatccc caactccatc 76081 tctgttaagc tatttctttt gtttgtcctg ttatcctaat caggttttgt ttcattttgt 76141 tttgttttgt tttgttatga gatggagtct tgctctgttg cccaggctgg aatgcagtgg 76201 catgatcccg gctcactgca acctccgtct cctgggttca agcaattctc ctgcctcagc 76261 ctcccaagta gctggaacta caggcgacca ccaccaaccc acgctaattt ttgtattttt 76321 agtagaacgg ggtttcacca tattggccag gctggtctca aactcctgac cttgtgatcc 76381 acctgcctcg gcctcccaaa gtgctggaat tacaggcgtg agccaccaca cctggcccct 76441 aatcaggttt taagtcctgt attatatgat gcttctgtat cctcctgaaa tcttaaagta 76501 taaattgtat tatctgtaat tgtagtacca tccccgtcct caagaaattg gttttatttt 76561 ttttaaccag taaaatcagc tccactagca tttatggagt aattactaat gtagcaatac 76621 tgctctgaga gggaggaaca cataggtaga tatgagaagc cttacagagc ttatagtgag 76681 aaataacatg gcacagtggt gagaatgtac acacaaacat agtctctaaa tacagggata 76741 tgaaagtgct attttttaaa ctgccattta cacatcacaa tgtattctaa ttgcttaata 76801 ggtaaaaata cttataattt tttcctaata ttccaaactg gaaagtgatg aacatattta 76861 attctctctt tttattttaa ttgatttatt caaaatttaa gatactgtgg ggcaaactga 76921 gtggaacaag cttgaaaacc aaagtactta cgtaaaattt attgagtgct tactacgtgc 76981 taaccactag catggcagat tctggggata aaacagtgag caaaaaacag caaggatatt 77041 gcactgatgg agtttacagt gcaatggaac aaacaggtaa taatcagacc agtaagtaat 77101 gaattataac agggataagt gccgcacagt aaagccaggg tgctgagtgc cctgcacaga 77161 gtgcacatcg tggacctgac ctaagagcag ggaatgggct gggtgtccag gaaggcttcc 77221 atgatcaagg acatacagct agaggaaaga tctgaggagc acagagagta agccaggcaa 77281 agagagatac gagtgaattc cttgtgacta gagggtgcag gacagttcta aggaactgag 77341 aggaaatggg aactgaaggg caaggggcag cttgatggga aatgaggcaa aagaagcagg 77401 gagggaccag accttgcagg accttgtggg ttattttaag gctagatctt caccctcaag 77461 acaataagca gccactaagg agttctaact gtgttgctaa ttaaattcta agatcactta 77521 tacaaaggat ctggcacagt tccagatcca tagcaggagg cagcaaatgc tgattccctc 77581 ccaaacacat cctgcaacat ttactgcttt attcaaatag tctaccaact aacaatcaga 77641 atgactatct cttgtctttt cacctcagta tcattatgaa atgaattagt atttttaaat 77701 tttaatcgaa tgtccattat ctctcaatat aataaagtag aatctgaagt actttggaaa 77761 ttagaagttt tttactgaat cacaagatga tctcataggg tgctaaaaat gttatagtta 77821 aataagcttt aaaatatttt atattttata aaatatatta ataccaagat aaaatagaat 77881 caaataaaac aatgtaataa agcaaatgac ataactttac agaatccaag ttcaatatta 77941 cacacttact tagcaagcta caatatgcct ccaaggaaag gtgagaaatc tacttagaaa 78001 ccatctgtag tttttggcat atcgaccatc taatccatat acagggcttg tgccctattt 78061 acagcaattt cttgcttttc aaatttctta tttttttcca aatgtttctg cagcaatcca 78121 tcacgacagt ttgcaaacaa tcaaatattt gacctaaaat acagcagcaa cacattttaa 78181 aatttacatt tgcttagata gaaagcagct aatcatactg attaaaaaaa caacaacaac 78241 aatttcagaa tctttggcag cccattacca gagaggcatc acatcaggga accatactct 78301 tcccaggtat ttggagactt cagtgtgaat ttcatctagg ctataatcct catcagggat 78361 caaaacttcc tccattacag agagctgtgt taaccttctc tcacacagtc ttacaaacct 78421 gatgaaggca ctgcagctaa cttcacattt gctgaggccc aaggctgtta ggtttgtaca 78481 gtgttcagca atacaaataa gttcattatc aagaggctga agatcattag cacacaccac 78541 taactcaatc agtcgaggac agttgagacc tacccgtcct aaaaccactt tgctgactga 78601 acgaccaaaa taaaggtgag taacaggggt ttcttctttg aagaacgtct cgaattcctc 78661 ttcatataga aagaagtgca taacaacatt aactctaggg gaatgtttaa taagtgcatc 78721 ccaactgtgt tttttaacag catgaaattt aatctgtcca ggattttcac tcacaacatc 78781 aattcgaaga tgttcaaggt taacatgagt ctcgcttgag agtgcaagga aaagttcatc 78841 agttaggatg taataattca acgccagttc tctaaggcct tgacaacggt cagctacaca 78901 aagaattcct gtgggaagtt gggaaaaatt ctcaagaaca aagaatcata ctgatcctaa 78961 tataatattt tgtaaattta aatttaatat taattattca ttatttgtga aaatagctat 79021 ctaataacaa cttctgagag catggctgcc aggaggtatt atgtatataa tgaaaagcat 79081 atgtatatat gtaaaattaa aatatctcag gagaaactta ctataactat acctaccttg 79141 cagattaaga aatcttttga aaaaacttaa ataatttatg cttcataggt atataatttt 79201 caaaaggaaa ttgcatcttt atagttattg taataaaaat atcatatcat tctagcttta 79261 aaaaatgtaa cccatcatca gatgaaacat gaggacagct actcatcttt gggagtctta 79321 gagtgtcact attattggcc acaagaatct tcaatgaagg atcatccact ggtgtatctt 79381 caattttgat tgatgataat gattttgagt tgataaaaac aactgtaagt gctgacacaa 79441 aatgagactt gcatggcaat aagaaaaaat tatgaaaaca ttattttaaa atgtacagct 79501 ttaagtaatt cattaaaact atatcaatac atgctaatga taacatagta agaaccatga 79561 acatacggct gtttccacag aatcattttc tagtagaaga tatttttata atgaagtata 79621 gttttatact ttttataatt gataaatgaa atagtgagaa aaaaaagctt caaagatttc 79681 ataaaacatg tcttccaatt acttcaatta aggatttggt tttcaatatt agaaggacat 79741 gattattgta tatttatttc ctagaacagg ggttggtaga ttacagacaa cttgccaaat 79801 ccagcccatg gtctgcttct gtatggctca ctaagaatgc cttttgtatt tttaaatggt 79861 tggaaaaatt ttttaaagaa taatatttta tgacatgaaa taaacttgaa tttcagtgtc 79921 cctaaataca gtctgatggg aacacagcca tgccctttat ttaaatattg tctgtgtctg 79981 ctttcagact ccagtgttag agttgagtag tcacagcgta actacttggc ccccaaagcc 80041 tgaaatattt gctatctgag tatttttgac ccctgcccta gaagaaacag aatttttatc 80101 tgatttatcc actcccgtat ctctagcacc taaaatagta tctggcacat agcacggcaa 80161 atggatgata aaaatttgtt gaatgaataa atgaggtttt aattcatttt ataagatttc 80221 acaatcttca gtgacatgct tttcctagcc aattatccaa agtaaatcca agtcgtatat 80281 gctttgaaga ctgacagact aaagcttact tctagtttga tctgaaaatg agtagaattt 80341 cagaaatttt cagattcagt attttttggc aaagatagaa agtctggaaa ccaaatctgt 80401 acatgatcac ttttcccttt ttaaaatcaa tcaatttgga ttctaccaaa ggttctatta 80461 tgcctaacag aatgaccaaa ttggttataa atgttcatta atttataaga aaacatattg 80521 atttatacta attgtaaacc ggtttgcaca tattttcaaa tggtacaagg ctaaacttct 80581 tctgtaaatt ttatggtgcc aaaattatta ggggaatata aatacaaggt ttagtttccc 80641 tttaatcaat tgtcaactga ctgctttagt cctgaaggac gactttaaaa aacattagag 80701 tacatatttc attataatag caacatatta gtttagtact atttatgctt agcaaaatgc 80761 tatactttta tttttcctta gttaacagat taaaaaaact gagtctcaga gagatctgtg 80821 gtcttgccag aagccacaga gattacttta aacagtagag gcagagcttg aaatgaagct 80881 tgctaattaa ttcctggtat ctagctctgg tgactgcatc agctaaccta gaagctggcc 80941 ctggaagctg atgtgcaaca ttagagacag cttttagaat ataagaagaa tttcagggta 81001 atcggattct gtagttcata aacaaggaaa taactagaag tggtaaattt catcagatta 81061 cttacattac tctaatccct aaatgaccca ttttttttca gcacatccct ttagaagata 81121 aacaggccag caaaaatact tgctagaaga gtgccatgac tcagtttata agagagaggc 81181 tgctattcct ctatcctcct ccataggaaa atttccatta gccagcttta ccaactacca 81241 taatttaatg cctatcatat atcaggtact attctgactc tgggtttata atggtaaaca 81301 aaatagaatc tctgccttca atgagtttat gtttggtata aggagaagag aataaacaca 81361 ttaaataaat aaggtggttt caagtactga tgaatgacat gaagaagata aaaataatgt 81421 aatggtagag tacattatta ctggagcagg aggtgccatt ttcacaatgg ttaagggggg 81481 gctctccaag gagatggcat tcaggttgaa gcctgaatga ggagaatacg gtatgagttt 81541 ctatggcaga gcgtggctgg ctgcttgctg gcaaccattt ttcctttgtt tgttaatcca 81601 gtcccaattt ttagctgggc acactgccac ccagctcaaa gattccccag actccctgga 81661 atctggtgtg gccagactta gaggtaaaca gaaacactgt gtggtactta cgggtagtct 81721 ccttgataag gagaaggcac acccttcttc ccccatttct aacatcctag cccccagagc 81781 aaggatgtga tggtcagagc tcagatggtc atcttgaaga aacaccctgg agagagtgga 81841 gtggctggaa gaaacctagg ttcctaccac cttcatacca gctatggact gtctcaccta 81901 tcaatttctt taatgtgaga gaaatatact ttgatcatgt ttttaaacca cttttgtttt 81961 ggagtttcta ctgcatgaat gtaaacatca tgctaacaag cgtagttcat ttagttaaat 82021 attatattgt tccacctcca aaagacaggc cacagaatca caggaaaaac acccactatt 82081 gcctctacat gctctatgtt caaagtaact acataaattg ttatttaagt agaataagca 82141 caatatccaa attttatatt aatgacactg tggtaaccat gactacagtt acaaaatata 82201 tctaaagaat gaaaacatgc ataaactatt tttgcaaaaa taagtgctag tcaacagaaa 82261 acagctttaa ataatacttt gaatacaaat aggagatggt ctgtctttag gaaaaaaatg 82321 gcttttgcct agtttggcag tctttaggga ttttagtata gctcaccaaa aacgcataat 82381 atccttacct ccgacacatt catgaaactt ggcttggctg ttgaaatcaa gcccaaggtc 82441 tggatggaac aatttaccag ctgagagagt atatcacagg cagcttctgc tgactcagcg 82501 ctactgtcaa cctagacaga caatttttaa tctcgaataa tttcctgaag ataatttatc 82561 agaaatactg gcaatgtaat tatttagtag aattcagttg caaaatttta ttcattcctt 82621 tgagctatca tataaatgag tattttttct tcatacaaaa tgtctgagat gataaaacga 82681 aaaacagaga tgatgacaaa ttagctttcc aagcagttca tattttcaaa tcttttacaa 82741 ttacctgaat tttaataaca gcatacaaat gatggaaact gcattttaag tttttcaatt 82801 acggttcaag ttctacactt caatcgatgc aaataactgc agaaaaacag gaaacctctt 82861 tggataaaat gatttgcaaa attcactgac tgcctcaaaa taaggctcta gatatacaaa 82921 agcattgtta acgaaatgta agtcatggtt tccatttaat gcttattttt cttaccttaa 82981 agctgacata ctgaagatga gcaaaatgct ttttaatgat ctgctgaatg agatcaggat 83041 gagtggactt aaaagatgaa gtagctgact ggttcagttc aaattcaaac tttctccaaa 83101 ggtcagaaat atgaaaaact tcattccacc tcctacatac agaagatgca caggcccgat 83161 ctagtaaagg aagttactga aaaatttgta atactacatg gtgaggcaaa ctcccccagt 83221 ctagaagaac cgtgtgtgta tgagtctggt tgagagaaga gtagaaccca acttttggct 83281 gtttcgctgc tcctgacaac tggacaattt tattctcaac agataaactg ttcctcttca 83341 ttctgtaaac attagaaaat gttcagtttc tgatagtttt ttcacatttc caccaaatat 83401 tggcactcta actgtacaga ataacattag aaaatgaaaa attgtttaat acaattttag 83461 taggtatctt ccactttcat gcatggatag gtgctacctc atgctgggga tacagagata 83521 aataagtaca aacctgttct caaagagcta tggtaaaatg gtgaaataca gtgataaatg 83581 ctatgaccaa agttgggaca agtttcaaca gaaggaaaag aggatgcatg tgattgagtt 83641 tgtgtgtgga agaggtggct tgggaggctt ccaggaggaa atgtccaagg aagcagtgta 83701 tgtcccaagt tttttttttc ctagtcaatt attgtatttt atctctttca aagaaggaag 83761 tatcacattg cgattgtcct taaaaaaaaa aaaaaacatg ggaggtctgg gagatagtaa 83821 aacacaatta aaagcatagg tctagataag tccctactct gctacttact aaccatatga 83881 tcttgtcaga acaaattact ctaagcctca gtttcctcat ctccaacaag aaaagtacct 83941 atatgatcat tgtgagtatt aaagtgccag cagatagttt agttgcagtc tgtgttgcta 84001 tattaaattg tccagtgcag ttttctgaac tgttttcatt acttaagtca aagcacaatt 84061 ctaacaaaaa ctcaaatgat ctgcaacaat tttcagacga cagccaggac acaggagctc 84121 agagaagtta gatatggcag ggacaaagtc ttagttcaag caacacacct gatcctctcc 84181 agtaaagcag acagggaatg agcattctta ccctccaaac agaaatgatg aatctctagg 84241 tattagccag cttgggacgc aattaagttc cagcaaattt gaagtccaga tacgtccaag 84301 tatatatggt catcttgtac gtatatttta tgtaagaata caaagtattt taagtataca 84361 attatttgaa aggcaaaata atggtaagag tggttctgaa ctttattgaa ctttggttaa 84421 ctatataacc ctcctttaac ccctgcaatg ttaggaatgt acacatgaaa gattcagtgt 84481 cataattgaa acataccttg aaatctaata aaatatttat gtgataggta aatatctctg 84541 tcttgtgatt cactctgtct gatgtgacac atttcttaag cacaacatag agttggatcc 84601 cagaagcagg aatgacctat aaagtcattg aacaaaatga aaaatcatct ttagggtttg 84661 gaaaattgaa atgaaaagca aaaatagtta caaagctgca tctcaagcat caggcaaaag 84721 agctactaca cacatttcag tctgactaaa ccagttacat gcatgactta ggccctgtcc 84781 agcagtacta cgtagactat gtgaaactgc tgactaccaa gaattcaaag catgtgctat 84841 tttaaaaggc ccgatctact actcttcaat gctaactcct taatgctgta aggtacaaga 84901 acagatgctg gattctaaac tctgatgtgg agcacagtca ttgacatttt gcttgcaacc 84961 gtttcatttg gaaatcaaca aaatggataa attttcacta agtttttata aagaagttac 85021 ccttaataga atcctttact tgaaaaccag tgatgctaac ataaaacaaa gacatcactc 85081 aaatgccagg ttagaaaact accttgcagt ccttaacaga gggactgtgc tatatattaa 85141 atctcattgt atgtggcatg gattgttaaa attatttaat gatatcatta aattgccaat 85201 tcactgatag tagaattgca ataagtaatt taaggtatta tattaatacc agtaacttgt 85261 taaaaataat attttaattt tttttaagtt gtgtaatctt ccgagcataa agattatgct 85321 catccacaaa tcaaacactt catatgattt tatagctatg tacattgtaa aatatatttt 85381 agacaaattg atctttatta gcttgaaaac gttaatctaa acattctagc ttaaagatat 85441 gtgaccattt aaaaagtcga tcatttcttt ttcaagcttt tcttacctaa ggaaggcttc 85501 cagattagag ttcactctga accatgaaac gaagcaatgg ttgccgtggg gtactcacca 85561 acctccattt gatttcactt cctatcttgg tgaaatacca caaaaacaca ttcatcagaa 85621 tacataaata aacattcaat aatttctatg cttaggctct ttatatatca ctctatatat 85681 tgaaggattt tgttagaaat tattgattct gaaactattt tttgttgttg ttcattgatg 85741 aaaccttgtc tctactaaaa atacaaaaat tagccaggca tagcggcggg tgcctgtagt 85801 cccagctact tgggaggctg aggcaggaga atggagtgaa cccgggaggc ggagcttgca 85861 gtgagcagag attgcaccac tgcacgccag cctgggtgac agagcgagac tccgtatcaa 85921 agaaaaaaaa aaaaaaacca cgcaaaaaaa cctagccctg ccgggcgtgg tggctcacat 85981 ctgtaatccc aacactttgc gaggccgagg caggtgcacc atttgaggtc aggagttcaa 86041 gaccagcctg gccaacatgg tgaaacccca tttctaaaat taaaaaaact ttttaaaatt 86101 ttttaaaata caaaaataca aaaattagct gagtgttagt ggcgcatgcc tgtaatccca 86161 gctacttggg aggctgaggc aggaaggcag cggttgcggt gagctgagat cacaccactg 86221 cactccagtc tgggcaacag agtgagaccc tgtcttaaaa acaaaaacaa aacaaagtaa 86281 aacaaaacaa aaacaaaaca aattccaaaa aattagcatg gttagtggaa atcttgttat 86341 gattcccagg aattattctt gaggctttac caaggctatg gatcaaagag tgaatggact 86401 ctgagaatct gtattttaat taatgtctat agagagatac tacaaaaaat gtgagggaat 86461 tcatgactac aattatctgt ttacaggctg tgtaaatctt ccagttccag attttagtag 86521 cacataatac caccacagtt atagtaaaga ggcagacatg agtgattttt attattttat 86581 cattcccata tgaatatagt tggactagtc tgctgttaat agaatatatg cttcatattt 86641 aaaaatcatt tcatatatat tgatagagat ggcattgatt cagtcatact ctttgaggtt 86701 tgccaaattt caggaaaatg acttgtcaga gctagcagga aagtctatgg aaactgaaga 86761 aaatagtaat ataggtctac agcaggataa aactcacacc cttggcatca tgaataaatg 86821 tcgtaaggta aacataaata aatacattct actacttttg taaaccaaac tcaactacac 86881 agaggagtta aacatggcaa cctattaatt tactaaacag ttcccttaaa cctctactgt 86941 ctaccccctg ttaattttcc atttctctct gaatccaact cctaaagcag aactggggag 87001 gacaggctag gggaatgtat ttgagctcac gatctaccat ttgatttcct cactgccacc 87061 agctgttaca ctgttttcca gataatgtca cggttggctg tccttccttt attccagatg 87121 tagagggaaa ggaactccag ggcacacttt cagtgaaaca ccaaaaataa aagacaagtc 87181 gagttactag ttaactcaaa tcagatgaca cacaatttta tttactgcag aaacttttct 87241 cctgagttac acttatggct aatttctgtg ccaataggcc attcttcatg tgagaatact 87301 gacataatac tgacatatca atagggagtt gctacccaag accagtaaga agatggtgca 87361 ccactaccta tctcagatca gttcactgaa gtcttccgaa tcagatgaat tacttccctg 87421 gttgctgaag gaacatgatg gctctcatag tcacgagaat tttgaggaca ggtagggaac 87481 aaaaccagaa tacaggactc catcaagtga taagccattc ttccaaaaag ggagaaaaga 87541 gcaagctgga cacggatctt cagcacaatt ccagagtatg aaacaagttt attagcattg 87601 aggagctcca ctaaccacct taactcaaat gccttttttg atgtgttttc taatttggta 87661 gatcaggtga ctacgtgaat gggatcctct agaagtatct aagagattgc cacagtattc 87721 ttgagggcaa gattatcaaa gaagatgaga tgaagactgt ctgaacatcc acagccaact 87781 cactaaaggc catggaagca gctctcttct gcaccatcac taggttcttc cagattggtg 87841 aaatgaatac ccagaagtga ctttcaacag tttcaagttt tcagtgatca caacaataga 87901 aataataggt gaataccaca gataacagaa catagatgca aaattatctg aaaagggtgg 87961 acaaatttgt tgggagcagc aaaagtaaag ggaaataatt attaaatgtc ctttatttaa 88021 tctccttcct aaagtttcat gataagcaat cagattgttg ccccaggcag acggtccgtg 88081 ttaaggagcc tggaatccta ctcaccagtg caagggcagg ggaagcatgg cctggacagg 88141 agggctgaag cctcgcgctc caaaacacgt aaaaacacac agttttccaa caaaagaatg 88201 gagcaacacc tctttctact aaatataatc caagcaaagg acaaaacata atttgttgga 88261 gacgttgtca aacagattta tgtactttgc agcccataag actctaagtt cccttctaag 88321 tctaaaagtt acaattcttt ttaaaatatc aaatcaaata aaaatatcac cgagatatta 88381 aggagacaaa acatgaaaaa tatttatgtc aaatatttcc taaagcagtg gttgccatct 88441 ttgcagtgtt ctggcatttt gtcactggtt tagcaatttt ctacactcaa gtctctctgt 88501 ggtactagga catatctact taaaaagaaa atttttactt cctaataaac ataatatgta 88561 tgcttatttt taaataaatc atatgaaaac tgaagttttc caaatatatg tagattgaaa 88621 acacctttgg agacgcaagg gacaaagctg aactgggcag tttgtgaaaa tccctgaatg 88681 gggggcgcct agggcatctg tttctctctc cacatcgacc ccaaagtcgc gtctctccga 88741 ctctggaagc gaaaagcgag cggacagcaa gccaggccct ccgcggggag gccttccgag 88801 ccagggagag cccgcctccg gatgggcatt ctaagggggc taacaacgaa cacaggtgtt 88861 tccagaaatc atttctgcaa gatttttctg ggagaaaaca atcatcattt cttattcaat 88921 ttatcttgga agcgtcttaa aaggcacact cgccaaagac aaagacggta caccttaaag 88981 cgcacaggtg agcgctttca gctcgccaga aggctacttt ttgacatttg tggcaactat 89041 aaactccggt gaacttaact gcaagcagtc atgcaaaaag caacctcggg tagcactgat 89101 cactggacga acgccgccct ttctaggctc actccaccga agagccagac gcgcggctca 89161 tggggtggcc gggtgggcgc ccggcactgc agaaacccag cccagcgaga aagcggacgc 89221 ccgggggctg cgagtcctca ctggcggccc gagccaggcc ccgctcaccc cggcttcggg 89281 ggcggccggg cccgggccgc gggaggggtc gctccagggg gcgctcagcc gcccgagacc 89341 gccccgacgt ggcccagggc gcgagcgcac gcctcccggg cccaaagagc gagcccattc 89401 cgccccacgt cgccaccgcc ggcggtgccc ccgcgacacc cgaaaaatgt gctaccttta 89461 aagggtttcc gcgcgtgctc agcgttcgcg ggccgggcca aggtgctgga gtccggccgt 89521 ccggtcggcg caggaacacg cccgagcgag gccggctccg cgcactggag gcacaggagc 89581 gcctaggcac gggggtccgc gcagaccgca gccgcccaag gccccgcccc gcctggcccc 89641 gcctcgcgcc cgctcctcaa aggccgcctc ccccagcagg ccgcggcgat ggcgcccacc 89701 tggcgcagca ccaagtcacg cacggcgatc tgagacccca gccgctcctc ccgcaggggc 89761 ggcggttccg ggagcgcccc ttaggtggca tgtgggaggc ttcctgcctg ctggtcttca 89821 gctaattccc cgtggccacc tgcctgtgac tccacacagg catgagccaa cgcttgtggt 89881 ttctgggact tactcccgat gaagggcgcc cctttatcaa tgcacaaatg caatggggtt 89941 tcgttcttgt tgcccaggct ggagtgcaat ggcgcaatat tggctcactg caacctctgc 90001 ctcctgggct caagcgattc tcctgcctca gcctcccaaa tagctgggat tacaggcatg 90061 caccaccatg tctggctaat tttgtatttc tagtagagac tgggtttctc tatgttggtc 90121 aggctggtct caaactcctg acctcaggtg atccgcctgc cttggcctcc caaagtgctg 90181 ggattacagg catgagccac cgtgtccggc ctcattttct tttctttctt tctttctttc 90241 tttctttttt tttttttttt tttttttttt ttttgaggtg gagtctcgct ctgtcgccag 90301 ctggagtgca gtggcttgat cttggctcac tgcaagctcc gcctcctgga ttcaacagat 90361 tctcccgcct cagcctcccg aatagttagg attacaggcg tgctccacca tgcccggcta 90421 agtttttgta ttttcagtag agacggggtt tcaccatgtt ggccaggctc gtcttgaact 90481 cctgatctca aagtgatcca cctgcctcag cttcccaaag tgctgggatt acaggcatga 90541 gccacggctc ccggccccag cctcattttc ttaatggtgt cttttgaaaa gcagaagttt 90601 tgattttgat gaaatccagt ttatcaattt gtttttttat ggatcatttg tttgatgtta 90661 aacttaagaa atgtttgtct aagtcaaggt cacagtttct tttatatttt cttctacatg 90721 gtttatagat ttagatttta catttaggtc tatgttctat ttgagttaat gtttgtgtat 90781 gctgcaagat agaagtcaag gtttattttt ttcatgtgca agttgtatcc aatttaccta 90841 gcaccagttg ttgaaaaggc tactcttttt ctccattaaa ttgcctttgc acttatgtgg 90901 gaaatcagtt gtccatgtat gtgtaggtat attttcagac cctctgttct atttcgttaa 90961 tatattttcc catctttaca ccaatatcgc accgtcttga tttctgtagc tttataacaa 91021 gtacttgtat tgaaatcaca tagttttagc cctctaactt gttcttcaaa attgttttgt 91081 ctattttgtc tattctagat tctttacatt cctgtatgat ttttttagaa caagcttgtc 91141 aatatctaca aaaatattgc tgggatttta ctgaagattg ccttgaatct agaaatcaac 91201 ttagggagta ttggcatctt tacaatgttg tgtcttcgat aaacatgatt gatatagcgc 91261 tctatttact taggtcttct ttaatttctc tcaacaacat ttgcagtttt cagggtataa 91321 gtcttacaca cattttgtca ggactattct tagatatttc atagttttta aatgttatgg 91381 taaatagtat tgttttttaa attttaattt ctatttgttt gttgccagta tgaagacatg 91441 caattgattt tttatattgg tcttgtatcc tacaaccttg ctaaaatcac ttgtcagttt 91501 tattaataga aatttttatt agaatttagt atattccatt ggattttcta catagacagt 91561 catgttgtgt gcgaacaaag acagtttttc tttcaatata aaatgcattt ttattttttt 91621 ctttccccat tgcactatat tgccggtaca gtgttaaata attaacatta ataaagagat 91681 atatcaatga cattcattgg tttgaagact ccgtattttt aagatgtgaa ttctccccaa 91741 actgatttct acatcaaggt gagaacagac acttttgtct tgttcctcat ctaagaggga 91801 aagtgcttca tctttaatca ttgacatatt gcatggatgt gcatattgca tagatgctct 91861 tcatcagatt gaggaacttc cgttttattc ctcctttgct gagaattttt agatgaagaa 91921 tggatgttag attttatgaa gtacttgtgt gtgtcaattg aaataatctt attgttttcc 91981 tttttagttt attaaaatga tggactacat tgactgatct tcaaaagtta aaccaacctt 92041 gcatttctgg caaaagactt attttatcat aatttattat cctttttacc tattgtttga 92101 gtcaatttcc taaaattttg tttagaattt ctgcatctag gttcatgggg gatattagat 92161 tgtacttttc ttttttagta atatctttgt aaaatgttga tatcaagtaa tacaagcctc 92221 ataaagcaag ctgagaagta tttcctccta ttcaattctg gaagagtcta tatagaaatg 92281 gtattattta ttctgaaatg tttggttgaa ttcaccaata aagccatctg ggtctggagt 92341 ttattttgtg ggaaggcttt taaccagaaa ttcaattact ttaataaatt tagggctatt 92401 tcttctatta ggttagttta tctatttctt cttaaaggag ctttggtagt ttacatcttt 92461 aaataaatct atccatttca tctggattgt taaattgata gacataatat tgtttataat 92521 attctcttat tattcttcta atgtctgtag tatctataat gatgtcactc tctcattcct 92581 gatgttggtc atttgtatct acactatttt ttatctcatc agtctggttt aaggattatc 92641 aattttattg atcttctcca agaaccagcc tttggtttaa ttaattttct ctattatttt 92701 tcctgtttta tattgcattg gtttctgctg tcatctttat ttcctttctt ttgaatttaa 92761 attgctattt attttctagt ttctcagggt agaatctgag ttcattgatt tgagaccttt 92821 cttctgttct aacataggct tgcagtgcca taaatatacc cataagtact gctttagcag 92881 catattgtca actcctaaac tcagggtgtc cactaaactt agtctggatt ccccctctct 92941 gtgtagcaga ctagacattc ttttaaggca ataaaataag ggtaatcata ggacttatct 93001 catttatttc ttatctcttg gagatctttg tcttttgttg cctgatatct tgtttctcaa 93061 aagtgttggg gctctgaaaa tgactccaaa gcgaagacct tagaagcagc ctcaaaagca 93121 aagtttctct ctgatctttt gcccttctat ctctcacccc tcattcttcc ctgaggcaaa 93181 ccatagaaac tagaattcct cttccccaag gtaggtcata gaaaccagaa cccttttccc 93241 ccaaagccag ccgtaaaacc taaaagtgtt gctctaacct tcctctgcct ttttgggtaa 93301 cagctggcta taaagaaatt aagattccag aagggtccta ccctataccc aggaggaaga 93361 aatgctacaa tatttcttcc tcaggaagaa cctaatcaga cagaccttgc taggtttttc 93421 ccaatcagcc tattaccatt agctcatagc cttttgttca tctaattaca tttctataca 93481 ctgtccctgc ttcatcaaac ctaaacaaaa atcagatagc tcccctgtat ctttgggtct 93541 tcgttctgaa gattcccata tcacataaag ctatgatcaa ataactttgt tatgcttttc 93601 tcttgttaac ctgttgtcat agggatgttg gctgtgacct atataatgat gaggaggaaa 93661 aaaatcattc cctttctatt cctacaaaag tcattgtttc aaatactttg tgtgtttttg 93721 ttgttgttgt tgttgatttt gttctttggt tgttttgtcc cggttttgtt tttttgttct 93781 tttatgcaga gagggaattt attccctatt tctctatctt ggttgtaatg agaaatctct 93841 tcctcactta gtaatggcca aatgagcaat ccccttatat ggaaacattg caattcttta 93901 accttcttct attgctgaac atttagtttt tactattata aagtaggctg tgaaaatcct 93961 gtatcttggt gcatttctgt gaatattact gaagaatttt tctagaattt caattattag 94021 tataaatatg ttggattttg atggaaagag ccatatctaa aatgatgtcg gtcagataat 94081 tatgatatac tttacaatac aggaatacac aatatgccat ttgtgataca aaaaagaaca 94141 tctctgtata gtcctttcaa agaaaatatt attattaagc tttgagcaaa caaagatttt 94201 ggttgtttaa gtcagaatag gctaggttat gctgaaataa caaacacctg tgatttaatg 94261 caaaaaaaag tttatttctc agtcatgcta cacttctatc acacttcagc tagaagctgt 94321 catccctgtt gtgttcattc tgagaccaag gacgttagag cagccactgt ctggaaaata 94381 gtctgtgacc atgggaaaga gacactgagg tacatacact gcaccttaac ttctaactgt 94441 atgtcaattc tgttcttact tcattggcca aagctagcta aaatggcaac acctaactta 94501 agttaggtaa agaagtgcag tcctacctta tgcctggaag aagaaacctg gaatgtttgg 94561 gaacatctgt actgactgtt acaagtaaaa ttgtgcaaca aaaatcacat caaaaatatt 94621 ttcacacata aaaagatgtc tctgttttgg tggggggggg tgggatggga gggtggaggg 94681 ggagtgcatg ttggacattt ctgggaaacc ctcaagacag ctttaacttc tgacaccaac 94741 ttcagagctt gtggattctc aagatcaacc tcaaatgtga taatttgtca gaaggactca 94801 cagaactcat ggaaagctgt tatattcatg gttgtagttt attacagtga aaggatacag 94861 attaaaataa tcaaaggaaa gagaggcatg gagaagaatg caaaaaaaaa aaaatccata 94921 tgtagagctt ccaagtgtcc tcttccagtg aaatcatgag cagtgctaaa tttctccagg 94981 caacaatgta tagcaataca caacagagta ctgtcaatca gggaagctct ttcatgtttt 95041 ggttggagtt tggtcacata gacatggttg actgcctgtg tgtggttgct ctttagtttc 95101 cagccttccc cacccctgag gtcaaggtga tatcacgtag cccaaaacct ccaccataaa 95161 ctacattatt agactgtccc atgtggcccg aggttcccag gtaaacaaag acactcctat 95221 caggtaagac actccaagga ctaagagatc acctcctagg agctaaatgc aaaagtcaga 95281 cctttgtttg ggtaaggtta actctttact acactaggat caaattgtat ttgactcaac 95341 atcatttcag aactctgcaa tgaaattgtt tatgtatctt attgtgaaga aaaatacagg 95401 taaactctac caataatttt tctatttaat aaagaatctt tatttttggc actgtggtgg 95461 gataaagtaa tgctatacag gcatatttca ctttattgca cttctcagat gttgtgtttt 95521 ttacaagttg aagttttatg gtaacccagg atcaagcaag tctatcggca ccattttccc 95581 aacagcacgt actcactttg tgtttctgtg tcacattttg gtaattcttg caatattttg 95641 aactttatta ttattatatc tgttatggtg atctgtgatt tgggatcttt ggtgtgtcca 95701 ttgtaattgg tttgggatga cacaaactgc acccatatta gaaggcaaac ttgataaatg 95761 tgtgtgttct gagttccaat cactggctgt ttcctcatct ctctccccct ccttgggcat 95821 ctctattccc tgagacacaa cagtatcgaa attaagccaa ttaataaccc tagaatggcc 95881 tctaagtgtt caagtgaaag gataagttcc ttgtctctca ctttaaattc aaatctaaaa 95941 atgattaagc ttggtgatga aggcatgtca aaatcggaca tacgctgaaa gttaggcttc 96001 ttgtgttaaa caattagcca agttttgaat gcaaaggaaa agtccttgaa ggagattaaa 96061 aagtgctact ccagtcaaca cgcaaatgac aagaaagtga aacagcctta ttgctggtat 96121 ggggaaagtt ttcatggtct ggatagaaaa tgaaatcagc tataacctgc ccttaagcca 96181 aagcttaatc cagagcaagg ccctaacctc tttaattcta tgaaggttga gagaggtgag 96241 gaagctgcag aagaaaagtt tgagattagc agagtttggt tcatcagatt tagggaaaga 96301 agccatctcc ataacataaa agtgcaaggt gaagcagcag gtgctgatgg agaagctgca 96361 gcaagttatc cagaagatct agccaagatc attgataaag gcagctacat tcaaaacaga 96421 ttttcaatgt agattaaaca gtcttctatt ggaaggagat gccatctagg actttcataa 96481 ctaaagagga gaaatcaatg ccttgtacca aagcttcaaa gaacagctgg cctcttttgt 96541 taggaataat gtgggtggtg actttaaatt aaagcattgg ctttacttta atgttcattt 96601 actattctag aaatcctaat tccattaaga attatgctta atctgctcct tctctgttct 96661 agaaatatga caaagccatc tgtttacagc atattttact attttaagcc cactattgag 96721 atctacttct cagaaaaaga gattcctttc aaaatattac tgttcatgga cagtgctcac 96781 ccaagagctc tgatagagat gtacaaggaa attagtgttt tcaggccggc taacacaata 96841 tccattctgc agcccatgca tcaaggattg atttccaagt cctactattt aagaaatata 96901 ttttgtaagg ctattgctgc catagatagt tattcctctg atatatctgg gcaaagcaaa 96961 ttgaaaacct actggaaagg attcaccatt ctagatgcta ttaggaacat ttgggcttca 97021 tgggaggagg ccaaagtatc aatattaaca ggagtttgga ataagttgat tccagcctaa 97081 atggatgact ttgaggggtt taagacatca tggaggaagg aagtgcaggt gaggtggaaa 97141 ctgcaaggga actagagtaa gaagtggagc ctgaagatgt gactgaattg ttgcagtctc 97201 acaagaattg aacagatgag gagttgcctg tcatgggtgg gcaaagaaag tggtttcttg 97261 agatggagtc tatttgtggt gaagatgctg tgaacattgt tgaaatgaca acaaaggatt 97321 tagcatattc cataaactta gttgataaag cagcagcagg gtttgagagt atttattcca 97381 attttgaaag aagttctaat gtgggtaaaa tgctaccaaa caacattaaa tgctacagag 97441 aaatctcttg tgaaaggaag agtcaatccg tgtgcaaact tcactgttgt cttattttaa 97501 gaaattacca cagacacccc aatcttcaac aactactacc ctgatcagtc agcggccgtc 97561 aacatcaaga cgagatcctc caccagcaag aagatgatga ctcactgaag gttcagatga 97621 tggttagcat ttttttttca gtcaagcatt tttaaattaa ggtacgtaaa gattttaaac 97681 ataatgctcc tgcacaccta gactacagta tagtgtaaat ataacttgta tatgcactga 97741 gaaaccaaaa attcatacaa attgcttcaa tgaggtattt gctttattgc agtggtctgg 97801 aactgaaccc acaatatctc tgaatcatgt ctgtatgtgg cttaatatgt aaactgccca 97861 cttccgagag aaaagttcag gtacattttg cacatgtttt cccttggaca ctgcaaaaca 97921 ccttgccagc ataaaaagac actgtaagat ttccctctgt tctaattcat ttcatttaaa 97981 cctgcaatgg attaattatg tcctgaagat tgatcatatt tttcctttaa gaatttgttt 98041 tttactattt aaaatataaa tagcctatta tctacatcag tcaagatttc ttgcctgtgc 98101 aaaaatctct caacattttt gagatacagt ttttacttca tatttcttta gattaagttt 98161 tccctaaagg aaattagtaa aggagctatt taactttact tcaagattaa ggctattcat 98221 tttaaaatca caagtcccag ctagtgtgtc tctaaatgta gaagagcttg ctgttctttg 98281 cattctatcc acagtatgaa gccctcgtaa atggggaatt atgaaataag catttctaaa 98341 tgggtcaggc ttagtcatca ctgacaacat tgattcaaag ccattactct gtgttgaaaa 98401 gtgaataaag tcaagggcat tgtatggagc tcaagatttc attaacaaca tgttcttctg 98461 gtttatggtg gacttttttt gcacagcttc atggatctga agttatcact ttttaaaata 98521 aggaataaaa ataattttta acctattgct ttttactatt gctcttctag taaagctttc 98581 acattgcaac atatttaaaa taaaatgtaa cactgaatca tgttttccaa cgttgcagat 98641 tacttggttt caaaagctga aattcaagga aaataattct acaaatatat tttgtttaaa 98701 aagaagcaag tgattttatt gccatgttta actgaaaatt tcataggaat actttgttaa 98761 tctcgagata attttctgta gtcattttat ttatgaaaga caaaactctg tattattact 98821 ttgattgtca gcataagtct gggatcaaga gaaacagggt tagtccttca aaatcaccta 98881 ataaaagtag gttagtacgt catgttttat agatgtcctg gtgatccgtc attttaaagt 98941 gcatttcaaa aaggagtcat taccaaaaaa aaatttcata gtgtttttag caaaaggaaa 99001 atgttaacac aaagatattt cttataaaga acagggaact gtttcaagct actaaatttt 99061 gatttgtttg tttatggcat gtcctcccat gttaatttac aacctatgaa aactgacata 99121 ttttaagatt aaaaattaag aacttaaaat taagtggttt tgattaattt gatttactgg 99181 ttctagaaag aaaatgctct gaagtgattt tttaaaaaag attatagtta attgaaggaa 99241 tactctgtat taccaattac cctaaaatca taaaatagag tacttcccag aggaaataga 99301 aaagagctac aatagttaac agatataata tttaaaatgc cataggatag tggtatactg 99361 ccagcaatcc caccaatcaa gtgccaacag aaataaggca actctttata aacaaggtat 99421 tgagaagact ttaagaaatc agacttactg atatcaacaa ggtttgccag tgaaggacta 99481 tgcagccaat ccaggacagg gaacagtgca agagaacaaa ctgtgggccc cacacttata 99541 cgtacaatgc agctaagtgc ttgcaacaat aaatatttac tactggtcat atcatctctg 99601 aacacaccag cacctaccac cgtgtcttct agatggtggt gttaaaagtt attacaaaaa 99661 tagaaattaa aatagaggac aggatgtctg acttttacag ttcccagtac ggactccctg 99721 ttttgagtaa aataataaag aacagcactt gtagggagaa aggcatcttc agatccttca 99781 cgtgcatttg tatttgaacc ttacgaaaat gggatgaggt gtgcagagca tatacaaata 99841 aaaatttaag tgaaacccac ttgcctgact atatgaatat ctggcacttg gaatgatgtg 99901 ggcactaact tgggctttga gaaaaagtcc attctttaga gaatgcagaa cattttgtgt 99961 caagggaaag atttctttcc agaagtgggg agcagatctt gtgtatgaag aagtattttc 100021 ttcccacagt gatgatcctc agcatatata ttcagtttgg attgaatcat agtcagagta 100081 tactttaaat tcccaaatcc ttgtcaagca ttgttgccct tatatttaat aaagaaagtc 100141 tttagaaaaa actttgatat tcatttttct ttatcaacat ttaccaccca ttcttacctt 100201 tagggaactt ttcaaatttt tgaaaggaaa tcatcatact atccactgtt gattgaagta 100261 tatttcacca cgagtgaatt gctattgagt cacatcctaa gagtgtgttg ttacttttag 100321 tgggataatt atacaggtgc ttcagaagag aaccctgtca aactcttatt gtattgacac 100381 atttcccttt caaacagagc tttcattttc aatccaggaa aatttactaa tattaaactt 100441 gtatatatat atatatgtat atatatatat atatatgtct ccaatctgac ctcctcccct 100501 cctctttgac tttgtcttct gctacctcct acatatatgt gtgtgtgtgt gtgtgtgtgt 100561 atatatgtat atgtatgtat gtatgtatgt tgtctttatc aaatagtggt agtgatgtgc 100621 aaatacttca ttgacccctg agcataatac tccagtttca cttgtttgaa tttttactta 100681 actagaaaat tgtgctagaa actgaattaa tacatcacaa cattgcttat accacattca 100741 tagcatccct agtatttttc ttaattgctt tcaggttgga gttaaagcat tatttaatta 100801 taaaaggtaa tacatttgag caaatgatca ttttcacaaa ttcagattcc tctaaattct 100861 ggaaggcata caaaacctat tttcacacaa cctacattta attcagaata ctttttggta 100921 ttatttggta gaagtatgtt gagaactgta attattatat tcttgaatta agcagaaaaa 100981 atatatccaa atagttttta aatcaaatgc tttggtttta aggctttggt gcgcatcctc 101041 ttttaaagta gaaaacatga tggaaggcat atctaaatag agttccgtgg ttgcgtgatg 101101 caatagaatc tcccttagca ttgctttgtc tgactctcag gccaaatcgc acacattctt 101161 ctaaatagag atcaattaac gtaaaatggt tttggggtat gctacttaat tttgtgctat 101221 ttcatggtaa aagacggtga acttagcctc tgacattccc cccactgctc tagccagtct 101281 tctgcttggc tgatcccctt caagtcccca acctctaaat actggagtgc tctgcaactc 101341 agtcccagga gcatttctgt ctacctgcgc ccattccctt ggggtgccgt cccatctcat 101401 ggctttaagt gcagtgtcct gtgtgattga tcccagttct gtatctccag ccctggagat 101461 acagttcact tgaactgcag atgtttatat ccagctgtct actggactcc tcgttgattt 101521 taatagtcac ctcatacatg tccacagtga attccagatc tcctttaaaa cttgtcttcc 101581 tacctcagtg aatagcaatt ccatgttttc agttgctcag gcccaaaact ttggaatcgt 101641 ctctgactcc tctattctct ttcacattct acttctcatc tgtcagcaga agatgggttg 101701 tagaattctg aagacttctc aatgggctga ccaccactat ctgggtccaa ggcatcaccg 101761 tctctcacct ggagtgtgca gcaggctcct agctggcctc accacctcca ccttgctccc 101821 tacagcgtct tcttagtaca cagctaggag tcctgttaaa atgtcagctc atgccacaca 101881 tctgctcaaa accctccaaa agcttcctct cattctgagt aaaagccaag taagtcctta 101941 gaaggactta tgcaacctct cccaatctga cctcctcccc tcctcctctc ctctttgact 102001 ttgtcttctg ctacctcctc cctccttccc tctgccctag tcataggggt ctcctccttg 102061 ctgttcctca aaccttccag gcccaatcca cccttggggc cttcatactc actgctccca 102121 ctacctggaa tgctctttcc acagttatct gtgagggtca ctccctttcc tccttcaggg 102181 ctctactcaa aggatatctg ctttatttcc tgcttagaac taactgctgt tttgcagtta 102241 gttctgtctc tctccttaga aggtaagctc caggcaggta gggattttgt ctgttttgtt 102301 cactgccgta tcctgactac ttagagtagc gcctggcatg gtgggtactc aatgcataat 102361 tatagaggtg atgaatgaat gaatgctcta tcatctacca aaattgttcc agtagacaga 102421 ggataggtag ggagatagat acatcaatcc atcgatgata gatagataga tggatcaata 102481 gatagataaa tagattgatt gattgattgg tagataatag cacttatgga caggttgtta 102541 taacttgtca ccaccactaa ccccctccac ccccatctag gaaaaacagc acaagtacca 102601 atatagactt acagatagaa cgctcccatt ttagactaaa gacagcatat aatcattata 102661 cattttccat ttctcagaaa ttaagtatta aataacacct ttcatttgaa catcttaaaa 102721 atgtgttacc gaaaaaagag tactgtaatt acatggattt atacatcact gttatacatc 102781 aggtagcaat gaaaggctta atagattcaa gatctcagtg gctcttgaca catttgtatt 102841 gaaacataaa aatgcaagat aagctcaagc agcagtttga catttacaat atgtcattga 102901 agcctgggaa agtctatcgt tgcctggctt attagcataa gaaaaaaagt ggcatgacat 102961 gtagactatg agcacaatct cacggtgacc ttgggccagt cagtcacctc gcttaacttt 103021 acttggttct tcctgtctcc gtgggcatat catggttgaa cagactattc agggaaggcg 103081 ctgagaaaaa agtgagcctc aatcttggcg gttaaccaaa agtgttttcc tgaggtcaca 103141 atgcgagtaa cacatgtgct ctgtgttgac cacacttaga atgtaaaggc aagtgttttt 103201 tatgagactg agccttcaca cacttgccag atgtttagca tgtggcttgg atgccttggt 103261 ctcttactgc catgcccctc agaaccttaa atctcctctg ctgatgtcca gagccattgt 103321 gcctcttcct ttcaaaacca acaccaggaa agcctctgat cacagcttat acaagctgcc 103381 tcttccaagg ttgttgtttg tctttttgga gagggtgaaa cgaatgaagt ccccatgctg 103441 aaatttcagg aatcaaaaag ccaggttcag cctctgtggt ttggggtcag gactctgact 103501 gccaatgccc tggagataaa gttgagtacg acagacacaa cccatgcacc ccaaccatct 103561 tcagcatata acttctctgc ttattaaaat gaactctgct ccatgggtct agagactgaa 103621 agaagctcca acaggcctta atcgtgttca cagactccta gaagacctta gaagtcattt 103681 agtctaactc cttcatttta tacatgagga aattgaggtt tcaaaaagtc atctgctgat 103741 ctgcgagctt ctcggtacag attccatgtc ttaatttttg tgtgtacttg gaaatgccca 103801 gactaccttt cttgcaaaag atcctgtaac tcattctgca cttcctctta gtctagacta 103861 gtaatatttg actgagcttg tcggtcagga tgctctaaaa atactcatgt tttaggaata 103921 taaataatat agaatactta agaagattta agttgggcct ggaggaacta acttatcact 103981 gcctctaaac tggagtaccc ccatgaccac aggcgccaaa atggtaaaac agtattctct 104041 gctcaaccaa agagaaagca tggtagtgtg gagaggaccc tggttggggg aaggtagtgt 104101 gagttgtagc cctagacttt gtcacaatac agacatttca ctttgggtaa gtactttatc 104161 tctttgtaaa atagaaacct tcctagtgag taacaaaact tactattatt ccttagaaca 104221 tgacagatat atgagcaaag gacatgagta ataattttgc tttatagata aaatgcctca 104281 ctggttccgt gtctcctaat ctatcagagg cacggagaaa cattgcctgg aaaggctgac 104341 ggcaaaactt gcgctatctt aatgctttct agttatttct gtggccagaa aatggacaat 104401 agatcaggaa ccaagaaaat tcatccccca cccaattaag aggtggacat aagttccaca 104461 tttgaccaac ggtaggaccc tactccccta gcaacagaga tagtttcaga ggataggagt 104521 gtgaacaggg cacagccaac ctgagttctt ccctcagact gagccacaga tcttgggaat 104581 taaagtcttt tcctctagga attaagaagc tgggatgata acttaaagtt accagtggct 104641 gcttccctta ggcccatagt ggaagcccat ttacaggagg tgagaaatgg ccaacacaga 104701 aatatgcaca gctgagaaca agacaatgtt ctagtgccgt ggtttaaatt cctggatcca 104761 gccatgtctg aagatgttta gttgaaaaat tctctttttt gcctgaacta gttttagctg 104821 ggttcctgtc acttttgttt aaaagaattc caagtaatca tatatcttct cattcttcaa 104881 atgtggaaag tgaggctcat atggagggag tgactgattt aaggttatac agcctgttgg 104941 aaacaaaatc ttgatttgaa gtcgggtcta ctgatttcca ttccaacgca tttgctacta 105001 cactgagctt tatacaccct ctcttgggaa ttatcccaca ttttctgaat aatagtacag 105061 ttctcttccc ctaccatact acaacacgga aaaatccatt catttaagca aaagatagag 105121 gaccatttct taaggactct aagatatgac acgagagctt agattagttg gttggtgtct 105181 aggtaagttc aaataaaaga ttcaatcatg tcagcatcaa aggtagaaac catgtgtttc 105241 attctttttt ttccccctcc aatggaagat ctactctgtt ctaggcatag caacagggtt 105301 tttcatggtt cataagtagt tgtctcatgc tagacttcct gcataatgga aacacaatag 105361 atatttgaag gaaagatgaa aagaaggaaa gaaaaaaaga aggaaggatg aagaaaagaa 105421 ggtagaaaag tagagtggat ttcaagtaaa aattgttttt cctactgttc aacagagctt 105481 acatttaata accagtatct tacagtgatc tgaaaatatg tctacaagac ctagcccact 105541 gtgcaactac tgcttttcct aaattgaaca gaagaaactg tgagaagaga aacctcaagg 105601 cagagcctca gaccacaggg tcatcatggt agggctgaag atttggagca aatgaggtaa 105661 cagaagatgg tgacattcga taaatgccct cactgtagta gttctgtgaa ctggtcatga 105721 cgaggctcag tcatcattag taatattcgt ttgcaagggc tgtatattag agtaaattgt 105781 atataagtaa gcagttcctg gatttggtaa caccatgaat aatgttttag taaacttgaa 105841 catttaatga tgaataattt ttaagatgat tctaactatt gtttagtatg tgttgtatca 105901 ttctttgtga tcagatttta tttaaaccac aaacaggatg cacataaatg gtgaataaag 105961 tttcagagtc agaagaaaag atggttatca aaagaatttg ttttctatga cttcgtaaaa 106021 gcactcaacc acttgaaaac ttgctatttt ctagtttcaa attcagtaat gaaaggcaac 106081 accctaaatg attccttagt atcataatca cataactgaa gagtgtcttt tctcatgctg 106141 tatttagtta aaatttagac attgtgaaaa agggaaggga gggatgtttc attattaccc 106201 taattgaatt aaccaaatag aatatgccat gtgaattcag atattaaaat agggttttca 106261 gactgggtgt ggtagctagc acctgaaatc ccagcacttt gggaggtgga ggtggaggtg 106321 ggaggatcac tggagcccag gagtttgaaa ccagtctggg ccacataggg agaccccatc 106381 tctacgaaaa cattaaaaat tagttgggca tagtggcatg cacctgtggt cccagctact 106441 caggaggctg aggcaggagg atcgtttgag ccctggaggt tgaggctgca gtgagttgag 106501 atcatgatac tgtactctag cctgggcaac agagtgagac cctgggaaca gagtgaaaaa 106561 aacatatttt ctttgggtaa gaaatcttta gtttaacatg tggtaatcta gaagatccaa 106621 ttttagttat aaatgggttt tattgatatt ttcatgaaca gttttttatt ccaagcatag 106681 tacattatag aaactatttt tgaacttttg aaatatttag aatactgaag tatgaaatct 106741 gttaagatgc actcacagca ctgacatcaa cttgccatgc aatttattgc tgaaaactgg 106801 ataacctgag tggctggttc actgtccagt gaatttttaa tgatttctta cttctattct 106861 tccatctgtg tgtctctaga tccacgtcac tgtctttctg agaaatacta aagccttggc 106921 tttgctggtt agcaacaact taactgtctt tttatgagac acctacaata acatcacatg 106981 aaattttcac cattgattct ttaagtcaag tgttcattcc acaaatttgt gccattcagt 107041 gtttaagatc ctgaaaataa aggcggacct gacagaaatg gtccctgctt ttgaaatttt 107101 tattcttgca gaattaaaag aggaagaaac aagaaatacg cgaatgaata aatgatttta 107161 ggggttgata aacatcaaga aaaaaacatg taatatgaca gaagatgtgg gaggaggata 107221 atatgaatgg cgtcagttag gaaaatcctt actgaggatg tgacgtttga gcccaaacct 107281 agatgagaag aaggagccag cctgtgtatt ccagtttggc aaaacagaaa aagcaaaagc 107341 aatggtctgg tgtcatgtgg gaggaaaaga gggtgatagt gagggagaat ggcaaggcta 107401 gagaaggaag cagggtccca gtgtgcaggg catcacaggc catggtggga agttgaaata 107461 cggttctaag agtgttggga agacattgag gatattcagc agggcagtga gatctaattt 107521 atatctttaa aagatccctc tggctgctgg tggatactga atcatagggt gcaagcagag 107581 aagcgaggaa gactgctggc tgggttttga taggttggag tccacacaat ggtgccgggc 107641 agagcagatg tgtagaggtt gagatccagc ccttgcccat tcaccaagct ccatgtctag 107701 cctgggactg tgctcacgtg tagaaagggc tgtctttttt ttattgcata caaagcacca 107761 tatggaagct gcctcagagg tgggaacaag gaagaggaca ttgcagtaat ctttgtaagg 107821 ataatagcag tggtggagaa agagagaacg gggtggattt aggatgtgcc tggaggtaga 107881 gctggcaaga catactgatg gacgagatgt gcagggtgac gggaaaggag gaatcaaccg 107941 ggagacttaa actcatggga ggatacgctg ccactgccta aggcagggaa aattaagagt 108001 ggagtggttt ttttgttgtt attgttgttg tttagtaggc agggagggaa gaagaattca 108061 gtttaagcta ggtttggtct aagattcatg ttagatgtca acaatcagat ttatgagttg 108121 cagaaatttg tgactgcatt aagtgtaacc tagaagtgca atacaatata tcctcttgga 108181 tcttcttcat gactgctgga cccggtggga tgggggttga gtcggggagg gaactagagt 108241 tagtccagcc tgggattctt ctcctgaggg cagcatttaa acaggtgcaa tctctcagat 108301 gttcagccca tcactcctgt tctccaagat gccaggagcc tatagtgccc tccagcaggg 108361 cccggaaaga gtcagtgaat tttagtgtct cgcctcctct gaactccata gtcatgcacc 108421 ttatttgcaa agtcctggtc atcactggcc ccgctggcat tcactgttga agttggcagc 108481 cagtgtggct tggtgagggc ttggtggatg cttgggactc cctggctgac tcagccccgt 108541 gttggattgc cctgtgcacc cctctagctg tctgtcatat tctccaccca gcccctggtt 108601 caatggccca ctgtgcccct tgcagccaca ggccctgtga atgtccacta gctttccacc 108661 tagatatggt ccactgacac tttggcaaca tgtgggaatt tctgcatccc atccttacca 108721 caggggacct taaggtagct tagctacaat ccagccctgt ctctgcctga cccagctgtg 108781 ccactgattg tactctgata tatttgcctg ccacctcctt cccactgccc aggtcggtgg 108841 caaaaaggcc agttgggtga tgatctcctc ccagagctcc taaaataaat caggcaaaaa 108901 ctctcctctc agcattcccc tgaagagcta taacccatct tcccctgatt ggccagaagt 108961 tcggagaggg gaggtcagca cccacatgtc tgcctctgat cccttggtac cagataagtg 109021 gggcttccaa cttttattcc tcctggttca ttcttctcct gcctgggggc agcaagacca 109081 cagcccaact atcagagtca gaccctcgtt tcttccttaa aaagatactg aggatgagtt 109141 gaaaggaaaa aaaaaaaatc tcataattta tctgagaacc attgttcacg aaatttcata 109201 aaatctcttt gtatattaaa tcctaaatat attaaggatt gtaaacattg ttttcagtct 109261 cacagctaca tcttgattgc tgaggaagtt agagaaaagg tgatctgatt attgataagg 109321 gaaagaaaat ctgttcctat ttaaacatgt cacacttgtg attttgtagc tctgggaagc 109381 tactgatgct gcagatacaa tttgagtcat cagcacgtgg atgagtaaag ccatgggctg 109441 aatgggaaca tctatatcat ggtagagaag aacagcaagc tcaggactga gtcctggggc 109501 aatccggtag gtagagcctg aaaaaagtga aggagcccca cagaagactg agagggagca 109561 gtgcataaga taggaagaga gttattagca agaggtgccc ctggggccag gagactagca 109621 ggtttccaga gggggaaaat gctcgttgaa ttaattgtat ctaggatgcc aaacctgtag 109681 aagaatccta acctatactc tgatcctaac cctgaccctg accctgaccc agaccttcat 109741 tttttcctta gaaggataca ctgaaaatga gttgaaaagg aaaaaaaaac ccacaactaa 109801 taatttatct aagaggacca ttgatcacaa aactttgtaa aatcatgtca ccatttttaa 109861 ttatgatgat actctattta gcaaatgcag ggctctttga gcccagctgg agccccaccg 109921 ggatgactga agtagtgaac atgggcgctg gagttcctgc actggtactg agaactttta 109981 gaagtgctgc acaagatccc caatctgctt cttatgaagt agcatgagga gcagaatgac 110041 tgtttctttc tttgctttgc ttatcttttt ctgaaatcaa cttggaaaac acgtttgtct 110101 ttcatccttt caggcagacg catggttttg tgggtgaggc tcctgctggg tcctgtgctt 110161 ggtgtctcag gcattctggc aatggcagga acataaccag aggtagagac accttttccc 110221 ccaagaggag cagtggaatt aaacccctgg ggtgtgcgtg gtgcctgcac cctgctgcct 110281 ctccctacca ttgctccgag ttcctggtgc ccaccactcc tatcctgacc tccactccca 110341 acagtgacat ccctacagcc atcataggaa tgttattcat ttcttccact ccattttaca 110401 tatttcatca aactgagatt atttagacca tggagagaat aatccgaaaa tagtgtttga 110461 gttcagagtt tgaattaagt gtttactttg atggtcaacc agggagccag aatgaggtag 110521 gacaaagaag gtgggttgaa tcctggctct atctctcacc agctcagtga gcttagagta 110581 atccagtggg ttagagcctt ctctctaact tggactgagc ttctagaacc cattacaagt 110641 agtgaaggaa gaacacagtt tgtcatgact gtggggagaa ggagtgagtc aggacccagg 110701 gagagagtca ccatgaagca ctccatccaa gggagcagga tggagaggtg gaaacagaca 110761 gaggccaata gtatgcattt ggggccaacc ccagagagag ttagcagatg cgaggggtgc 110821 gtgggaagtg gaagctacca ctgtccctga agtgctgaag cacagagccc tgtccccttt 110881 gagatggaga ctgcaattgg atgagtcaag caatagcggg aatgcggggg agttgcttag 110941 ggtactggga tgctcagaca ctgaaaagtt acctaagtcc tctctacctc tgtttcctca 111001 tccacaaatt tcaggtaaca atgccttcct cgtagaattg ttacaaggtt taagatgaga 111061 taatcctata aaatatctaa gtctagagct ggctcataag aaacgttaaa acaactctca 111121 tcgagcaaca actgactgaa tgttttcatt taagctcctt cgtaatttca aactcagcat 111181 attaaaaatc aaaatcacag taccaggact aaggtagtgt tccaagcctt gttctatgca 111241 cattatttcc cgatatccca cagaaacaac ccagcctaca ccagatgaaa aactatgtac 111301 cttctctggg gactggagta ggcaggtctt atccctatct gtatgagcag acttctaagg 111361 tggcccccaa ggacccacat ctcctagcgt tcatattctt gtataatcct tcccccgaag 111421 tgtgaggtgg acctagtgag aggatgccac ttcagcgatg cggttataaa agattccaac 111481 ttctctcttg ctagcagatg tccctattgc tttctcagtg tgcacacttt gatgaagcca 111541 gctgccacat tagagaagtc catgtgacaa aggagccttc agcgcatagc cagggaggaa 111601 tcaagatcct cagcccaaca agcttggcag gaggaactga atcctgccaa agccatgtgg 111661 gcttggaagc agatctttcc cccaacggag cctttagagg agaccacact gcctctgtca 111721 acaccttaac tacagccttg cgggagaccc tgaatcagag gaacacgtaa gctgtgctca 111781 attcctgaca gacgcaaact actgtgcaat aatagatatg tgctgtttta agctgctagg 111841 ttttggaata gtttgaatgc agcaataagt aactaatcca tcaaccctgt tatgctatga 111901 agttttagtc tgggatggca cagtcaggaa gaggatcagg ggagggctgg ttagacatca 111961 catcataaca ggcaggttac ctggagtaac tgtttgttgt attacatcac taaagcaaaa 112021 gcaggttgtt atggaaagga agcaacagtc aaacaattcc taaagataaa atgtaccata 112081 gcacaactaa aactatctca agagtaataa taggcacgcc ctagctgcca ttttgattta 112141 taataaaatg gttccaataa aaggaacctg ggctctttgg agaaatgact gcttctagga 112201 ctggggcagg aaatagacaa gatgtgcctg gagtgtcttg tagtgccaga aagtaataag 112261 tgctcaaaaa aaaaaatccc aaaagccaca ttgaaggggc caaagggaac acagaagcca 112321 actcaaaggg gccccagtgg ccagcactgg aaaaatttga gcaacaaaat aaacaaagca 112381 gtactggatt ataacccgaa gtataaaatt aatatccatg aacccttaat gatataaatg 112441 atagaataaa taaataaata aatggggggg acaaatatcc tatgcagaag aattccacat 112501 aattatgtgg atactcagct ctcgaggaag tggagcatac ctcccttcca cttaagtgtg 112561 agacttcctt ccaaagagtg cagcatggaa aggaggggga aagatgaact ttgcagtgga 112621 gaaacctgat aaacacgacc ttggccagat gatcaatgtc acatcagcag tgataaatta 112681 tgtggataat atgtaccttt gatatgctat aacaaaaata gcactttact tctgtggtct 112741 ttctcccaaa acacataacc ccagattaat cctgagaaaa acatcaggca aatcccaatg 112801 gagggacagt ctaccaaaca cctgaacagt gtgccttgta actgtcaagg tcatcaagga 112861 aaatctaaga attttttttg tgacagggcc tcactctgtc gcctaggctg gagtgcagtg 112921 gcacgatctc ggctcactga aacctctgcc tcccaaggtt caagtggttc tcctgcctca 112981 gcctccagag tagctgggat tacagacaag cgccaccaag ccctgctgat ttttgtattt 113041 ttgtagagat ggggttttgc catgttggcc aggctggtct tgaactcctg acctcagttg 113101 atccacccac ctcagcctcc caaagtgctg ggattatggg tgtgaggcat cgtgcccggc 113161 cctaaagtct aagaaattat ggccaagagg agcttaaaga gacatgatga ctaaatgtaa 113221 tatggtgtcc tggaaaggat cctggaacag aaaaattacc agctatcatc cagtaatagg 113281 tatgtggtct tgggcaagtt acttgacatt aggtaaaatt aaggaaatct gaataaggca 113341 tggacttaat taataatcat atgtcgatgt tggttcatta agtgtaacaa atgtaccctt 113401 ctagtggaaa ctgcatttgg ggtatgtgag aattctctgt agtgccttta taattttttg 113461 gtaaaaaaaa gctgttctaa aataaaaatt taattaagta acaaaaaaga agatggggag 113521 atgctgagga atgaggaaga gtgtggagaa gaacagctgg aaccagatga ctggttttag 113581 gatctggcag tggttgccag gtcttggaca aagagaagtt gaatggctgt atctgaggta 113641 gaagagtttt catggaaact agagtttaaa ttgaattttt attaacttaa ttttagtagg 113701 aaattcaaag accactttat gggactttgt cattcgtata tcatctgtct ttcccggggc 113761 ttcatgtgct tgttatgtgc tttttggaat gcatatttac attttaaaag tatttaaagt 113821 taaaatgttc actggtttaa aaaaaaaaac ggaatagagg ctgttaaaca gcaaatggga 113881 atctttggat aattgcactg ggaatgaaaa cacccaatcc aaatgttctc ccagaaatca 113941 gtactaaaaa gaataatgat aagacattta agagaaaaaa aaatacgtga aaaataggtt 114001 taggaaattt tgacacacat attataggaa ttataggaat tatagtagga ggaaaaagga 114061 gattggaggt caaagaacta atggaaaaaa atttattgag ttcgagaatg tcatcttaag 114121 atctaataag ccaattgcat gctcgtcagg aggaatcaaa gaacagccac aactagacat 114181 atcctggtga agtttctgaa tttcaaggat gaatatcacg gaattacagt cttaatcctg 114241 aaattacaga ctattaagga atcaataaga atgagcaaac tacttaacaa aatctatgca 114301 gtgctgtcca tgctgtgttc aaaagacagc tccttatcca gctatagcta tgtgatgctg 114361 ggcaagttac tagacttctt aaagcttcca tttcctcatt ttaaaataat attagtttgg 114421 ttcaaaagta tttgtggttt ttaccattac ctttaatggc aagagcagca attacttttg 114481 caccaatcta ataatatcaa ctacttccac tgtgctgtgt gtgggtcaga cactgctcta 114541 ggggctttcc aggcagtaat taatttatct tcctccacag ccccatggtg tagcttatat 114601 aattatgccg ttttgcaaat gagaaaacca aaatgcagag aagctatttg cacaaggtta 114661 tggaatggta agtggcagag actttctcaa gtctgcactc ataattactt tgctataaat 114721 acctcttata tggcctggtg tggtattatg aggattcagt gacataatgc atttaagatg 114781 tgcagcattg gtgcctacca tacaatgtat tagataattg aagctagtat ttgtttaaaa 114841 ggtgaagcaa atattcaact caaggagcta gaaaaggaat gataagataa atttaaatca 114901 agtagaaatt aacaatgaaa aagcataaaa tgctaagtta ggaagataaa aatcgtacac 114961 agaattgata aataaaagca aaacttgaat ctgaaaaaaa tcaaatatac aaatttctta 115021 caagttttat tacagagaaa gatgggtaaa atgcacagta ttacactttg tcaaatctgg 115081 gattatttcc tacaaaaatg cacacaaaaa tattaagcag aaaatgtaat agagtaatta 115141 aaacagaaga aatagaaaac accaacaaat gtacctccaa aagtaatcat tcccagattt 115201 tacaaatgag ttctgtcaga acataattga ttcagcaaat attctaatta tggccaggtg 115261 tggtaactca tgcctgtaat cccagcactt tgggaggctg aggctgtagg atcacttgag 115321 gccagaggtt ggagaccaac ctgagcaaca tagcaatagc tatgtgaggg agaccccatc 115381 tctatacaaa atttttaaaa aattagcttg ttgtggtggt gcgtgcctgt ggtcccagct 115441 actcaggagg ctgaggcggg agaatcacct gagccctgga agtcaaggct gcagtgagcc 115501 aagatcatgc cactgcactt cagcctaggt gacagagcaa gacactgcca aacaaaaaaa 115561 ttctaattac tatgtgttgg gaactgtctt gtcactggct ttgtggaaat aaaagaagaa 115621 aaatccttgc tatggaggag tgaggaaata ttgaaaattc tcttactatt taaactcttc 115681 ctaactcact ttttatgtgc cccaaaaaga aggcacataa aaagaatatg aagaatgaac 115741 tcccttaggg atggacacta tctaagtaaa gttgttaatg cagcctcttc ccttctgaaa 115801 gagaacctgg cttttccatt cattatttag ctctgttccc ctacacacca gatgaagatg 115861 aaatatcaag taggcctggg tggcaaagtg tctgggctgg gatgactata aatgaactat 115921 ccatttttag tggagtgtca cagccttggg ctctcctgag tgcaggggcc ttggaagctg 115981 tctgtgatct ctttcaccag atattgactc ctgtggggca caaagcctta ggccaggaca 116041 tatcttgggc ctcccaatgg aagtcgtgct tctccacacc tgagtcatca cctggagcac 116101 taaagagaat ggtgcctgga cgaccttgag ggccattcct actgaagagc agcacctaat 116161 gctgccaccc tgggaacctg tgtttcatcc tctgttcaaa ataaaggaag aacaaattga 116221 ttccatcagc gtattaaaac cataatatag tatgaccaaa tagaacttat tccaggaatg 116281 tgaggatggc tcaacattta gtaatcagtt aaaataatgt atcatatgag ttagtggaaa 116341 gaaaactcat gatcctgata aaaacttgaa aggcgtttgt aaaattcagc tatcatttct 116401 aataaaaatt tggagaacta aatattcgaa actgttccct attaggaaat caacagcaaa 116461 catgatattg attgatagat gattatttta gattcaggag gtacatgtgc agctgtgtta 116521 tatgaatata ttgtgtaatg gtgaggtctg ggcttctgat gtacccatca tccaaatagt 116581 gaccattgta cctaataggt aatttttcaa cccttgcccc ctcctcccct cccctatttg 116641 gagtctttgg cgtctagtgt ttccatcttt atgtccatgt gtacccattg tttagctccc 116701 acttataagt gaggtcatgc aagcaaacat gatatttaaa tgaaaagcat tgaaagcatt 116761 cccaataata tgaaggaaat gtcgacaatc attttattac cacaattgca ttctaaggtc 116821 aatctagagc tgtaagatag ctcattggac taaattagag tccagaaaga tacttaagaa 116881 aaattaaaat ttagtgttca ataaggtagt atttcaaatc agtaggtaaa agagatattt 116941 agttagtaat agtatcaacc aaacatggaa gaaaaataaa ttagatttct tctataccct 117001 caaataaact ctaaaaagat taaagattta attgctgaaa ataaaatcat tcacagactg 117061 gaaaaataga gataaatatg ttatatgttt gaccacatga aaacacagtc agaaaacact 117121 acaagcataa caataaacag aagaaaataa atgtaatata tttgaaaaac aaagatacac 117181 aaagaactct tagtaatcag taagcaaaag attaatatac caacagaaat ctgggcacta 117241 acaggtaact cacaagtgaa gaaatataaa aaagcaagaa ttcattaatt tatgaattca 117301 atacatatgt attagcatca taaatttgcc agacacacgg ccaaggttaa taaaaaatca 117361 ctgtccagtg ttgataagga tgtcagaagt ggacattctt aaacgtgact aatgggatta 117421 ttataaatta taaattctat tattatatgt tagaacatat ttctggagga cagtttggca 117481 aaacgtattg agagtgaaag ggttgcaaat acttgaccat tcattccaaa tgtaggattt 117541 atattgtcat gagtggacgt gagccctgga actgggcagt catcttatag ccccaagggg 117601 agggaagtca acccacagag tggcaaagca gcctgtccac aagatctcag agctgcagat 117661 ccgaccaccc caggacccac cgtatccaca gatgcagtgt tttgtgagat ggcaaatttc 117721 cttatcactc cagtcagtgt taactggggt tttgtgattt gtggtcaatg caatcttgat 117781 gacacagggc tctgtccttc cgttcagaac ttctagcctc atcttccact tcttcccaac 117841 agccacttct gcagtagtca ggctgcttcc aaccctgtca acaccttgat ttgcacttct 117901 ggttccaaga ttgtgcgaga ataaatttcc gtggctttaa gtctcccagt ttgtatactt 117961 tgtgatggca gccccaggac acagatatag ctgaggccta gaatccacct ccctcttccc 118021 caaaggcata ctttgatttt ctgcttgtat taagtagact tggagtacag cctatagagt 118081 atcggtcaac agaagtggag attgatttat ggccttgcag ataagaacac aggctctggg 118141 gtctggtaaa tctcattcaa gttgccagct gtgtgtctat ggcaactctc ctagcttctc 118201 tgagcttcca tttggatatc tacaaatagg gttaataaag attgttgagg gggctgagtg 118261 agatgatata tataaagtgc ttaataagtt atcttgtgca caggaagatt taacaaattc 118321 taatgcctta aggctccagt gtaatccaga acaacagttc tcatcctccc ttaattagat 118381 ggacatcaat gaattggcca cagcaaaggg tggggagttg ggggagtccc atagtctctg 118441 agaaatacag gggcatgtct caagtaatgg tggactatgc caatgggagg atgtgaaatt 118501 ccatttacaa ggatagccca gattgttatt aaccaacaac agtgaggaaa gagaagtgag 118561 aggaaggagg aaagaaatag atcctaggtc ttggagctca gagaaaactg tgtggggcca 118621 cacagggaac agtagcttcc aagggcagca gacacagggt ttgctcaggc aggatgtgag 118681 ggaaggggtc ctgggttccc cttccacgga gagaagaccc aaccgaagga cagcatcacc 118741 agtggggctt accccatcat gcttacttag aaagtagtga tctagaaata actacttgat 118801 taggaaatta gcacttgagg attctcataa ctagttacct cttggctgct tggagtaaaa 118861 ggcagaacag gttttacttg gaaatgcagt gccttttttt tttttttttg gtcattttat 118921 gctgtgcctt cctagaacac ataaccattt tgttttattc ctagagaaat aatccatgac 118981 atatgagctt ttatcacaaa ataaaaacat caaactttat ctttggaaca aaataaatgg 119041 ttgatcattg tcttttgtca tttattgtaa aatatgttat ctatgataac tcttggtatg 119101 catctcactc taatcttgat atttaaaata ggtgagtaat tgatttctga gcgtggtgta 119161 tttgctctgt agcttatatg catacttaca ttttgatact gaaagaaatc ctaggagtta 119221 aacctcaaaa gtcatagact ttgcctagat aaacatctgc tgttttaatg gtattttgaa 119281 ctgcagagct accgtttaac aaccctgtat tcttttcttt tttataactg agtgggtggg 119341 gaaatgggaa agaattctaa agaaccagtc acctcacctg tctccttctc tctgtttcat 119401 cctttgcaga ggagctcctg caccacgcat gccttcctca tgcagttagt ataagagtcc 119461 aaagaaagaa gcggttgaaa agcacttcag aaattctcaa accctgcacc aatgtaagga 119521 actacagaac tgcaggaggc tttactccct tgttggttaa tcacttttca aatgtgttcc 119581 ttgttcaggt aaatgtgaca tctgcttttt gtttgttacc tgtctactgg gaaccatgtg 119641 ttggtcattt catcagtaac acaaatttaa taaatgttca caaaattaaa tgtattagtc 119701 atttattcat ttaaccagga attcctgagc actctgtata gactatgccc tgggaatagt 119761 gatgagcaag agagtccgtt gtagtgtcca ttgtcatgga actgacattc tcctgggaga 119821 accagacact gaaagaggca caggcagatg gcagtgattg ctatggtcag agaaggtctt 119881 gctgaggagg ttgtagttga gctgaaaact aaatgatgag aacaggtcac gtagggagga 119941 gctgaggaag gagccagcag gccaagctta gcaagggtaa aggcctggag gatggaaatg 120001 aaaagaaact gatgtgacgg gagggtagta atgagggggt atcaaagagc gaggcacctg 120061 gaccagacca gaccacattt ccaaacagaa acatgcctca ttcctgagag tatatcccaa 120121 atggagcact cctctctttc atgaggaagg gctgggtgaa ggcttacatt cctttctcta 120181 agtgagtgag cactgggggt gcattggtgc atggaaaggg taagtaacct gcccaaggaa 120241 caccagtggc catggatcac ttgaccctga gccagctcct gagggtagca gggaaggtgg 120301 gtgacccaag gcctgctggt caagccaaga cttaacttgt taattcattc acttttctcc 120361 taacttcaaa atgtgggact tttcctagta aaccagagcc tgcctgagat atctttgctg 120421 cagctaccgg gtttcattga gagaagatct ttggttcatt ggaccagcga gctcactcca 120481 caaaaagcct caggaaatgg caggccccca acacttcgga taatctggca gtctgtggcc 120541 caggtcccct gctcccttcc ctctgtacag atacctgacc taaataacca gatggggaaa 120601 gggagaagag acacctgagc aaccaacccc tagctctacc cttcccagga ctggaggata 120661 atagttgatg tcttcagaaa atgtggggtg gggtgtggga actcagccag ggggtggcca 120721 acttttcctg taaaggggcc agataataaa tgtttaggct ttacgggcta catacggtct 120781 ctatcacaca tgcttgtttt tatttgttaa ttatttcttt ataaacttta accctctgaa 120841 cccttaaaaa ggtaaacact gttcttagct tgtagacctt accttcaacc tgaaaagtgg 120901 attacagttt gttgaccgct aacctcggct gctttgccct aacaaacttc aaaccggatg 120961 ccacttaaac caagttaggg atggagcagc tgtttctttt aatctctatg ggagtaccct 121021 gaggggagcc ttcactgaca actgcatccc tagatcccca ttcccacttt ccacatcacc 121081 atcctcactt acgagtgagt gagaactgac aactgcaact gcatccctag atccccattc 121141 ccactttcca catcaccatc ctcacttacg gtcaggaaat ggaagctggg agaggccaag 121201 caatccccca aggccatgga gctagcaagt gatagaggca ggacttgaac tcaggcattc 121261 tgattctaga gtcaccggtc ttaaccactg cactgtacta tcccctgcta gaaactagag 121321 ctgtggcttt ctcttcttgg gcaaagcccc catctgtaca acaggagggg tggggtggtg 121381 ttttcctcaa tatgaaaggc atacccgatg gcaggtgaaa tgatttccac ggttcacaca 121441 gagagcatca gataatattg attcacagaa tgaaaagatg attgcctttt cacatctctc 121501 tcaagtcttc caataatgtc agggaaggag gtttagtctg atggtaatgt atctttaaaa 121561 tctctccaaa gcatgctaat tttcttttct tttcttttta aacaagaaca ggttttataa 121621 ctcaggatcc tcagtaggca gtgatagcta gctagaattt tagtatcact gttgtactat 121681 tattattact tctcttgtta gtctcatggt tttatttttt attgcctttc tgaatatttg 121741 ctatttatga aaaatgttag tgttgccatt gctgtaattc tcttgtccat ttcttcatag 121801 ttataatatt gctgctgaag ctcctgccat cacacccaaa ttggaattaa ggggaaagac 121861 agggttcagc cacttttgtc tcctattacc aggaaagcaa aataatttta catactttgc 121921 attgatcgaa acatgctgca tggctacatg tagcagcaaa agaggctgaa aatgtgggtg 121981 tttggctgga tacattggta ccccaaacta atcagaattc taccatcaag aaaaaggcat 122041 acatccagtg ctttgtaaca aaagtgtttg ccacaggctg ggaatccttt gagttttctc 122101 ttcagtgtct gcaaagagca gggatggaag tcagagatgt gggatggctt aagggaattt 122161 agcaaaacac ctgtttctct ttaccttgtt taccaccaag acactccctt ccccatagtg 122221 cagtggaaca ctgacgagcc atcatcactc ctaagaaaca atacaaaatg aaaaaaaaca 122281 gtattgccat actcttacag tggaggcgag gaaatcttat gagtttgtcc ccactggcta 122341 ttatagttca gagtgctaag ctaaagaacc cctcaccttt tcaatctact gtgtagaagc 122401 tatatttact tacctgcaca tacctccaaa tatgtatcct agaaatcaga tttgaaatct 122461 tataaatccc agaagtagga agttcgtatt tcaagaacta cttttcaaat tagtgtttta 122521 attcaacaag catttagtga gtgcccacag ttccccaggc acctccttaa gtgctgagaa 122581 tataggggtg aactggaccc ataaagccct ggagtatata tcctggtggg agaggcagat 122641 aaaagatctg cagaaaaaca ggagttcagg aggattaaga gctgaaatca ggaggcagac 122701 ttctgggctt gaatacaaat tctgcggctt cctggctttg tggttttggg caaattacct 122761 gcctctctat gcctcagttt ctccatctgt aagataataa gaataattat agtacttcat 122821 agagttacac tgagattgct tttttgactc atagtgatta aacacagcca tactgaaggg 122881 ccaggaagct aagatgtaga cagtgtttta aagggtaaaa aatggtgatg acaaattact 122941 ccccacctca tttggcagcc gagtgtatta catggcttaa tgcatgtaaa gtacttagaa 123001 atgttgctag tatggtcagg cccagtggct catgcctgta atcccagaac tttgggaggc 123061 caaggcgggc ggatcacgag gtcaggagtt caagaccagc ctggccaaca tagtgaaacc 123121 tcgtctctac taaaaataca aaaaattagc cgggcgtggt ggagggtgcc tgtaattcta 123181 gctactcggg aggctgaggc aggagaatcg cttgaacccg ggaggcagag gttgcagcga 123241 acctggattg tgccaccgca ctccagcctg gtgacagagg gagactctgt ctcaaaaaaa 123301 aaaaaaaaaa aaaaagaaat gttactagta tgtagtaagt tctcagtaaa tgttagctac 123361 tatactcttt caagtgctgg gtttttactt gatgtcatac agtgttatat aagatctcca 123421 aagatactga ggagtcctca aggccaattt taacaagcat ggttgccgca ttcttgtgct 123481 tatagttgaa catttcttct ttcagacact tgcacaaagg gatacttcta agatgcattt 123541 gcattaggtg gcaaacttca tcctgggtat gaaaaacatt gagatttggg aataaagcat 123601 agtaagactg aggttgcaat tactaaagga aaaccccaac agagataagt gaagttctgc 123661 aatatcatgc accctccccc aacccgctct gtctccccag gccccccttc gttagaacac 123721 ccatgactgg ctatattata tcagcatttc ccataatgta aaaagggaaa atacagacct 123781 gggcgttcat ggaaagtatt ctaactctca caaccagaat ccctgtcttt gaattttttt 123841 tcttggtttt tagatcttta acttttcctt cagcatttca gtactcaact ttttgaaaat 123901 catcttttct gaggaatgat atttcctggc acagcatcat ctctgtcaag tgactcagtt 123961 tgattttttt gtttgttagt ataaagtggc cccaacttac agagaaaaag tgggctcttg 124021 gtatcagttt gatgtcaggg tttttccgtg tttgagaggg agctttaaat accactcgat 124081 ttgaaggtgt ctgcaagcga gctccagtcc gctgtcaaga tgcttctggc catggtcctt 124141 acctctgccc tgctcctgtg ctccgtggca ggccaggggt gtccaacctt ggcggggatc 124201 ctggacatca acttcctcat caacaagatg caggtaggct gcagggggag cccatgggaa 124261 agacagctac tgacaaagtg aaatatgtat gaggatgaaa aaactcgggg ctgactaaag 124321 gttcttatct ctctatctac tttaggaaga tccagcttcc aagtgccact gcagtgctaa 124381 tgtgagtgaa tgctctttaa gaactttcca aattaatttt aattttcaca tctggaatct 124441 tcactctgaa atttcccttg caggtgacca gttgtctctg tttgggcatt ccctctgtaa 124501 gtatagtgaa ataacataat gttgaccttg gatttttttg gtttgttttt aagtaaaaat 124561 aagttgcttt atttaatatt taatgttata cattgttgct taatttaatt gttacagatt 124621 agtattccct gttaaaacca cattgttaca aattattccc ttttaaaact acgatcttga 124681 aatcctatat tatgaacatt tctttgtatt taattaactt tatgcctctt gagaagtttg 124741 aacacttttc aacattaaaa aaagaatcct gaatatcttt ttagataggt ggccatgtgc 124801 acaattaaat aaaactggaa ctaaggatat aataattgct gtagctcata tcatattgct 124861 ttctaactca tttactgata actctagagt tgtgaaacaa tgtaaataaa atgacaactc 124921 cttatctttc atctgtcatg aatgatctat gcgctatacc tccccctccc tgcctcctcc 124981 cttcctcccc accaccctgt tgtctgtcta gctgattaga gtgactgttg gtttgaatgc 125041 tgccctctgg gcaggtagag gatctgaggt tgtgagtgga aggagggctt ccagagggcc 125101 actgcccact acggcaggaa ggatgggtgg caggaaagtt ctgattccta attcaaactc 125161 ctggttaggg tgaggaggag gcacttctcc aaggtgcagt gctttattct ttctcatgca 125221 aggcctggga gaatctgaag aatctgagct tcttgccctg gctagggtaa gacatcgcac 125281 ccatcgcggt ccatccatta gatgagaaga ggatagagtg ccttctgggc aggaaccagg 125341 cagacagcac agcccctgtc ccttggagta ccgtccatgt ttttagctgc tgctgaaata 125401 ccagctgcat tcaattgtca catcccatta gctggtgtga aaaggctttt cctcactctg 125461 cactttcaga cttacaagcc ttgaagccgg gaagcacccg ttgaaaagaa cattcagagc 125521 cgactatttc agggcccaga gccctcatgt ttcctggatg taacatacag gaagtctcct 125581 ccaggggatg tcactgtgga aaaatggcat cccctttaaa tacgggagat cacttcctac 125641 attggcaagg gacctgtcta aaaataatgc aagtttgagt aatggtgatt aaataaaaat 125701 catctctatt atattgctct ttgtgatata tttccaaagc tgtcctcaga atatttcttt 125761 gaataaatcc ttactattta ccaggacaac tgcaccagac catgcttcag tgagagactg 125821 tctcagatga ccaataccac catgcaaaca agatacccac tgattttcag tcgggtgaaa 125881 aaatcagttg aagtactaaa gaacaacaag tgtccagtaa gtttgttttc atatgtgata 125941 tgttcctgtt ggtgatttct atgtgaatgg tgatgccaac cctgtttgaa cacaaaagga 126001 tgataaagtt ggaattggta gttcaaggtt gataaaagac atctaagaat tttaatcaga 126061 agtaatataa ttaaagtgag atccactgaa acaatagaat taaagtgaga tagatcattg 126121 ttcctgacga ggccatttac ttctctctac tatggaataa tgaaagaatc ctttctgagt 126181 gtaattagaa gctacaatct agagaatcag ggatgtagct cacataatac taaattatcc 126241 tagagattca atgtactaac tgaatggatg ttgttaacag ggattttttt ttcctgttgg 126301 ttaaggaggt tttgttttgt tttggagaca gagtcttgct ctgttgccca ggctggagtg 126361 cagtggtgcc atctgagctc actgcagcct ctgcctcccg ggttcaagtg attatcctgc 126421 ctcagcctcc cgagtagctg gcattacagg tgcgtgccac catgcctggc taatttttgt 126481 atttttaata gagatggggt ttcaccatgt tggccaggtt gctctccaac tcctgaactc 126541 aagtgatttg cccgccttga cctcccaaag tgctgggatg acaggtgtga gccaccatgc 126601 ctggcctgca ttaaggaggt atttaaaggg caatgcaccc aggtcaaggt ggaagcttgc 126661 tactcatcct gaatgcccat ccacacattc ttttcttcag catataccct agtccctgac 126721 agcagactgg gatggcaagt tgggtagagg tgacctccct ctgttttttg ggtattagca 126781 tctccacaca agatcctaga aggctgaaag ccctgagctc agctgtttag ctgcatgcgt 126841 ttctaccatc aatggcatct agttctaagt gcttaatata tgctgtctca ctgaataaat 126901 acatacctta gggacaatta ttcaatttat tactctcagt gaggttaact aatttgccta 126961 aggctgcata tttgataagt ggcagagctg agatttgaac tcaggcctat atgacctcag 127021 agccccactc ttagccattg tactgtcaaa tgaccttgga aagacaacct aaaaggataa 127081 tgatacaatt ttaggcctca aagagtcccc agaaaaggct ttctctaatg cagagattta 127141 gggccactta ataggggtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 127201 gtaaagaccc ctgaaatcca atttgaggtc aaccacctat gctgtcttta caccacatga 127261 gctagcctgg acctgcccac ctatttgctc tgtgtctcaa gccacttccc ttcccatccc 127321 cacaatcctc accaccgact ctggctcttg gcaggtaggc ttctggggct gcttggctct 127381 acatcatttg agtcactctg tccttatcaa ctttcatccc cacagtattt ttcctgtgaa 127441 cagccatgca accaaaccac ggcaggcaac gcgctgacat ttctgaagag tcttctggaa 127501 attttccaga aagaaaagat gagagggatg agaggcaaga tatgaagatg aaatattatt 127561 tatcctattt attaaattta aaaagctttc tctttaagtt gctacaattt aaaaatcaag 127621 taagctactc taaatcagta tcagttgtga ttatttgttt aacattgtat gtctttattt 127681 tgaaataaat acatatgtgg aaaaaacaac atgagctggt ctcttggcaa ttattcattt 127741 cttgctgctc agaacaaaga aagctacaag tgttgttaag gggaagaata gatcagagac 127801 tcctgtagga gtctctgtga taagactcct gatgctgaat acagaccctc aggctcatag 127861 gctgtggctg gagctgcagg agggctgcag ggagcagtta caaagaatgt ctatgaggac 127921 catctgaggg ttgccactag gggaggctga aggcatgtct gacactgcta agagggctag 127981 ggtcctgaga aactggcctt gtgcctgaaa gaggagaagg cagcagcttg gaagtgggag 128041 actgctcaag aatgtgggca tgcagaataa atgcttaata attaagtgct taacccacta 128101 gttatggtag gaagtagata ttattattat gcatttctgt tttatatata ctgaaaccaa 128161 agcttagaaa gtctgcatgg cattcccaag atcccacagg tggtgagatt tagaagaagg 128221 attcacccct cgtgctctct gcttccaaac cctttgctgt ttgcattctg cccctttctt 128281 attctcttta taccaagggt gtgcattgtg cttcttccaa ccacagcctg agggtaaaat 128341 agacccatca taattccagt ccacactctc atatccagaa acctagggag tcaggtttca 128401 gaattcaagc tttttagcct tttagaaagt taaaacaatc cacgtgccat atattacata 128461 acacccacag cagggcctgg ggcaagtttc cataattaaa catatcaatt ctgcaacagt 128521 cactgaaatg gtcaggtttt gcagccaaat gagtcctaac aaatcttgag ttttcgcagc 128581 tgtttggatt tcagaataat gataagaaat tagtaagctg cattaacaaa acagcctttc 128641 tctagctcat gccctattta acaagggagg tgggggtggg gggcactctt acgaagacat 128701 tttatgtggg gctgggctgg gtgtctccag gagggataaa ggtttctgta gcttcctgga 128761 cagccagcca catggagttc cagatcacag ctgaagggcc ttctcttctc ctctcctcca 128821 tgtgggtttt ctgaatgact ggcgccagac atagttgaat catgtgacac tatgtggttt 128881 accacctgtg gcatggggag acaccagccc aagcaggatc ccagcaggga cagatactga 128941 tggttggtga tggtttcaga atataactcg gaaatgcaat gtcttgccag aggagacggg 129001 cagagacgag cttccactca ctgatgcaga ttaatcacca gcatagcttg gtgctttccc 129061 taggcttttc tgcaaaagtc acataacctg cagtgttcat tcctgcaggc aagctccaag 129121 gagggttcac ttacagtaaa gtgtgctata attaccatct tctctgtgcc aggagagttg 129181 agattcatca agaagccacc ctggagaaag gagaactaat atttattgtt tactctgtac 129241 caggcagtcc tctaagccta ttttgtagaa tatctggttg gatcctcaca acaaccccat 129301 gagatattta tataattgtt gtatgtaatt tttcccactg cacaaatgag ataagtaggc 129361 ccaacgaggt tgcataactt ggccagaagc acgttgctca caagcagtga agttgctacc 129421 ctctccaggt tgccaggccc caggcagagt tgtttccagc agattccaga tcaaagcaca 129481 cactgcaggc caagatgtgg ccagccataa agctagcttg cccatctatc catccatccc 129541 tccctcttta catccaccca gtttgaagca ggaagatgct gtgatgttca aacctcacaa 129601 ttcctctcct cccattgtca cattgtcact ctgcttcatt gaccccagac caagacctcc 129661 ctttctctgc aaaggaaggc ctcagcagaa actcaaagag gacctaaagg tcttcattct 129721 ttatttttat cactttaaaa acatggcttt agctaggcac agtggctcat gcctaaaatc 129781 ccagcacttt gggaggctga cgcaggctga tcacttgagc ccaggaattt gagatcagcc 129841 tggtcaatat ggtgaaaccc agtctctatg aaaaatacaa aacttagctg ggtatggtga 129901 catgcaactg tagtcccagc tactcggagt ctgagatggg aggatcactt gagcctagga 129961 ggtcgagggt gcagtaagcc atgattgcac cgctacactc cagcctgggc aacagagcaa 130021 gatcctgtct taaaaaaaaa caaacaacaa caacagcagc aacaacaaca acagcaaaca 130081 gatggcttca ttgagatacg cttcatatac cacatgcaat acaattcatc catttgaagc 130141 gtataattca atatatttta gtacattggc aggttgcaca accctcactt tttaaactgt 130201 gtcatattat ctgtaaatag aggtaactga cttctttttt tccaacttgg atgccttttc 130261 tttcacttcc ttggctaatt tctggctaga acttccaata caatgttgaa tagaagtggt 130321 gagagtacac attcttgttt tgttcctgat tttagtggga aaggatccag tcattcacga 130381 ttaaatatta atatgatgtg agctgtggac ttttcataaa tgctcagtat caggctgaga 130441 aaattccctt ttattctcag tctgttgagc acttttatca tgaatgagat tggactgtgt 130501 ctgatgtatt ttctgcactt attggtatag aggttattga tacagaagtt aattcttggc 130561 tttagagttt ccagtggtaa ccttcctagc cctgggatag aaagtaactc cttggactat 130621 ggaagagaag tccgagagga gaggatgatg gcagacacta ttgagttccc tggagaagag 130681 ctctctgcct ggcccccagc agagcctcgg ggtagagggc agaactcctg gggagggtag 130741 aaaaccaaga gtgaagggca catgagctgt atcccctgaa tggcctggga gatgacacga 130801 aaagagaatg cctcttgcca ccacacctaa ggtgtggtat agggaggacc acccactggc 130861 cctaggaagc tgggcttgat gcagccccct gaatagtttc aacggagaaa ccaaagattt 130921 tcacacaggg cagagagggg cactgatgtg tcttggagga tggtaaggct ggagaggcct 130981 tgggatgggc acatatgtca ggctcggacc acgggacagg gaccaaatgg gactgaagat 131041 gcttatacag aagccaccct ggagggaaag ccatgcccag gactagccag gaagaacacc 131101 agggtaggac agagcacctc ccaggaccct gaagcctctc cctaaatccc aaacctccca 131161 gatctaaagt caaccccagg ggaaggtgga gaaggaaaaa atcttgaact gggcatttaa 131221 cttggaattt cgaagtttta ccctgaaatg gcaaaattat ttttttctgt catcagcttg 131281 aaaaaaatct caatttagtt aataaaaaat aaagaaactt tcatttattt gctgcagacc 131341 tgaggatgtg atttaatcac tcaacagaca ctaataagtg cacagtgagg agcacgcact 131401 ggactcagcg tgcggtgagg aaggcgctac cgctgtcctc aggatccacg ggtgcaggag 131461 cggggaaaaa cctggtcggt gctgagagga gactggaggc acggggacgt gttacttgag 131521 actggaggga gtggctgagc ccagagctca ctggcccctc cccatcctca gacagaggag 131581 tgaagcaacg tttctctggg tggtttctgg ctccctggtg ctcatggtca ggacttctca 131641 gctggaccaa acatttcttg gaaagctctt tgatgccagc ctagcagagt gggcattggg 131701 gatcagaacc ttggagggct tgaatcagtg actcatgagg catcccaggc atcagaaggg 131761 ccacgtcagt gtctgcacca tcctagccca cgcttgctaa ttgatgtcca cttggaggtc 131821 caaaggccaa acatctgcag ctctcagggc ttcacatgca gagatgaggc ccagggaaca 131881 ctgtagtcat cctgtgttca cctcctggaa acagcaaaca acccagactc agagaagaag 131941 gcaggcaaag aacagaaacc agaaaaatca caccaggcca ccccttcctc taacccctat 132001 gttttgcact ttacagttta caaaaacact tttgcacttg attcttgtgg aagctgtgca 132061 aagtaggtaa gggagaccag tgtggccatt ttacagatgg ctcagggggc actgctgcct 132121 gcaagagcac agggggcact gctgcctgca agagcacagg tggctgatgg cctgcgtgaa 132181 actggaaccc aaggggaagg cctccctaat gtgaagttct tgccaccaca gagaaggaaa 132241 agggcaacgc caagggagat ggtgagaccc aggaaacagg aaaagaagtg tcacggggcc 132301 caggtgttct caccacaggg ccaagatcgt gcactcccag acgcggattc ctgcggaggg 132361 tttagttgac ctctactgcc accaagtggt ggttctgcag ggtatcagcc tttgggcgag 132421 ggatatgtat atacacacat atatatgtgt gtgtgtgtat atgtacacac acacacacac 132481 acatatgtat tttttcccct gtagggcagc agttctcaaa gaggacccac gaggtcaaaa 132541 ccactttcat agaacactaa gatgttatct gctcttttca atcttatgtt ctcacaagca 132601 tacggtggag tttcccaaag gctataatgc aagtgataac ccatagacta aaagcaaaag 132661 cagacagagg aacacaattg ccctctgttt aagccagact tcgccgcaaa aacggaaaac 132721 gcaagtcttc tcgttaatta tattggttta gaaagaatcg ttattttcat agaatgttat 132781 ttatgttaaa atgtcatggg tttattatta tacctaaatg gataaatatt tttgaaattc 132841 tgtttgaatt tccagtaatt gttggtaaat atataagcag ttgttcttcg aggtccttaa 132901 taattttcag gagtgaaagg ggctgctgtt gaggactgct gctgtatggt aatgattttt 132961 gttcttcttt tcatctttcc tggtaaggca aattggccaa ccctgaagcc ccccaactcc 133021 aaggcacaga caccaaaact ggtgcttggc ttggaattct gttttgtttt catccttctt 133081 aaaaatagga ttccatttgt ttaaagactt tcctgaccat gacaacagaa attaatgcct 133141 gatactaagc tccctgaggg cagggtccct gccattgtca cccaatgcct gtaacagcac 133201 tggccacagt cagactacac acacacacac acactctctc tctctctctc tctctctcac 133261 acacacacac acacacacac atatatatat tttatatata tatatatttg gttaaataga 133321 accataatgt gttgatggag gaagactcac aagctgttgg cccaactatt ccacaagatt 133381 gccctctcca gaaaaatcta agaggaagca gatccattgc tggctgaccc ctcccagatc 133441 tggctacctc atgagatttg tttgggctgg agctgagttt cttcaataaa gccgagtttc 133501 attcattgca gcccaggtgc tcctgggtgg gagtgaaggt gcagaagatg tgctgaggca 133561 ttgtctgccc agcatggagg agggcggggg gagggggtgt ggcaggggct gttactcaga 133621 gctgggctga ggctgggcag cgaggaagag ccagcccgca gcctggaaaa caagtatcca 133681 gcagggagcc ctagctctgc aataaggtgt gcttttattt taagaaatta taaaaacaat 133741 atatgtgtgc tgtaaaaaaa ttaaaacatt acaataatca ccatcatcaa atcaccatca 133801 ttatagtgat gtacaaagaa atcaccaaca tcaaattcgg aactgaaagc cttctcccaa 133861 tcccacatcc cagaaatagc cactattaaa tttggtgtgt atccttcaag ggtttttcaa 133921 tgcccatata attacccata tacgatatat attttaaaat gaaaatgaat aacatcatat 133981 atgttattct gcccctccat ttctgcactt acaatgttgc tgacatattt ccatgacaat 134041 atatgtgaga tctatctcat tctttttagt ggctgtttgt tcaaccccct cttctattga 134101 taggcattta gaaactttct gaattttcat tattacaaac aattctcaat taacatggtc 134161 atgcttgcat ttttgcccat taatttatgc ttcctttatt ctaaaccctg ttcattcttg 134221 ctgctatgct gaatctaagg attgggaaga ctcagccctg gctctgaagg ggggcacagg 134281 ccagtgagaa agagagggtg cacagacatg actgtagcac agggagggac acagtgccag 134341 aggggtcacg ttggtggaag gaccatggat gctgctggag gggagagaga gagagaagca 134401 aggaagccct aacatctaag ggcttaaagc atctgagctg ggtcgaaaac agggagcatt 134461 cctgttcctg gtgatgctgg gggtaagagt gtcctgtagg ttaaagaaaa gcatagaggc 134521 agggaagcct ggagggtgtt gggcacacag cagctggccc agcaggcttg tgggatcact 134581 tgggaaccct ttcagcaaca tttaaatgcc cattgtatgc ctggcactgg atgggatggg 134641 ggaaatgatg agagcacagg gaacctggac cagagcaatg ggaggtgggt ccagaagggc 134701 agttctgatg tggttgatgg gaggccttca atagcagttt cagacgccat acttatgtgt 134761 cccaacaact ttgtcttttt tacaccgtaa aatagccatc atcatggctg ctggcgatgt 134821 accctgggct ggacagcagg gaagtgtagc ctcaggaaga ttatcaaatg ggattggtgg 134881 taaagtcctg agctggggct caggtatgtt ggagcctcag ttccttcatc tgtgaactgg 134941 atcagcaatg cttctctctg catgcctcct tctaccatca agaggtgcca tgatgattaa 135001 tagaaactag gaaaagctca gacacctgaa gaaggtggtt gttgcatctc agcctcatca 135061 ggagcaccca gtagaagacc ccagttggcc ttgtggacac agcaccactt ctaaacaaac 135121 tgtgcaagca tgcctcgcac cgggaaatgg tccagttgag gcagacttaa gaaacacatt 135181 ctaggatgtg ttagagcaga gacttcaggt ctacccaggg atgaaccagg tgccagaagc 135241 atctcttcac tcgcatcctc cgtgcccacc cagacccagt gatactgggg cagacaggca 135301 ggctggtctc tcccatcaat agagcctctc caatgcacca gaatgacatg ccactggagt 135361 ccctcatggc tgtccccacc acctaggtca gcatctgctc agcaaatgtt tgatgactga 135421 acaaatgaat gaatgtccca ctgctggagg gtgggcagct tgaacagtaa cttgcagaga 135481 tacccagggg accctgcagt cccaggggct gtgaaggaga aatattgagg agggcatgct 135541 gagggtttta tttcaagcat ggaaccctcc gctccagtaa caggtttcac accgaagaga 135601 cggaacactt tcaactcttt tgcaccatag aattgctttg gtcttcggtc tattaagaca 135661 gagatttttc agagacatgg cccaaatcac tcaaaggcag ggggtggttg gggaacagcg 135721 tagtggagaa accaaggaag accttttcaa gagaaagaaa aaagaacaaa tttgaggaca 135781 tttccttcac tgggatttaa gccagggatg gcaaatagat tcattttgta tgacagttct 135841 gcacggctgg tggtggctgc cacggcactg tgctgagaaa gactctgagg cgccactcag 135901 ctcatcaaga aagggctcta tgattgacta ggaatgcttg ttgtgggcac agaattggaa 135961 gaggctgcag ggacacaagc cacataacag caaaatagca atggcctccc cacgcggctc 136021 tagtaaggaa taaaatttag aacaatgaag tcactctgga agccagactg tggatcaagt 136081 gttttgtttt gttttgttct ccatgtaaaa ataaggccca gagagtgccc accccaacca 136141 agcccagggc caaactctct gaagcaagtc accaacgtgt gccactgctc agggtgggag 136201 aacccctaga aggtttctgt ccctcatcac cctgttcaac cactacgccc tctgctgccg 136261 accttaggca tctgggaaaa aagaagccag gctgccccgg ctcagtttct ctcctcacct 136321 gtgtggaggg gttttccctc tgggcctgca acacccacag aggcctgaac agagagagag 136381 gaatgacagt ggccaggtgc ctcttggaag ctaggcagag gcatctgaca gagaaacaat 136441 gtgaagaggt ttttaatgct tttattttga aacttagagc caactccagc tcttccatga 136501 attcacagtg agtttacaat ggccatgggg ggagggtcta ttttactcca gttttgtttt 136561 gttctgttcg tttgtttgtt accaggtgca ggtccaaagc tggaatagtt ggtgatcagt 136621 tgaccaatgg tcagttgaat ccctccaagg cagaagctcc gctgccccag ccggggtggg 136681 gatgagaatg cttgtaattg tgaaggaagg aagaggttct gaggacagcc tgcctggctg 136741 aaagcccacc tggtggagag tctttctctt cagtaaattc acatcatggg ttctgcagac 136801 acttttagca ttgacctgtt aatgggatga gaccctggga aaaaggagga aagacaagaa 136861 catcttctaa cgggagtctt gaaatcttgg atcttggggt caaacaaccc atttggagac 136921 tgggtttgtg aagcagtaag aggcaaagta cagactggca gaggctctca gggttgccgg 136981 ggctatttga aaatgttcag caaaggtatc ataagagcac tatgtcttct ttccttgaaa 137041 atcagttttc ctgaaaccga aagtaggcta gtgagacctg cagcatgcat gtcaggaccg 137101 agggcagacc agcactattc cagggagctc tgggggctgg tcttggacag gtagggatgg 137161 cagagttgga ttcatgagcc acagacaagt ctgagagcca gccactggta agtggtcggg 137221 atgccacttc taccaccaaa ggtcactgat cttcccgcat ggaacttgga aaagctgggg 137281 gatatctcca tgcctggttg ttggaaggca cctggcaact gctgggagct gggtgatcta 137341 tcccctgcag gctcttagaa acaattctca agtcatccct gtcatctgta ctcaggaact 137401 ctcgagatgg tccccatccc gccacttctt agggacagct tggcacccta gatctctaat 137461 atacatcatc tcactctgtc ctgggtctag acaatgaggg aggcattcaa agagtaggtg 137521 tgactgctgc ctgagcatgg gcttccccaa atctcagagc tacttggtaa attaaaatcc 137581 agtcttgtta aaaggccttg gtatttgggg gttagagcag gaagttgaat gccatgggca 137641 agacttgttt aagatgaatc tgcagaccag tggctgggaa cctccctctc ccaccaggag 137701 gcaatagcct cctccctgcc ttagcagtcc tccctcctaa gggagacaga catctatgcc 137761 ccttgcaaga aagaacgctt ccagaaagaa aagcttccaa ttgggttggt cactgcttgt 137821 gaaaggctaa atgcaggaca atcgtgattg ggtagctaga aatttcctcc tttgtcctga 137881 agcagttaat caccagggtg gtcacagtgg caggtacgac aggttgggca ggcatccagc 137941 cactctggga tctgggaggt cttggcatta aaaccatctt gtagctgaag atttcatgac 138001 atgggaaaat ggtcccaaac tactaaatgg gaaaagaggg ctgtaaaaca ctgtgtagaa 138061 taggatccca atttgaaaat acacataaga catacttgga agaataacaa caaggtcttg 138121 tgatgacagg tcttattatt ttgttctttc tgtttattta aattttctat ttttgtctac 138181 aataaaaatg cctggctttt gttataggat ttttaagaaa ttacttttct ctgttaaaag 138241 ggggaaatgg ccaatatctc agcacagctt taagaataag atgaattcct tccctatcct 138301 ggatgccacc caaatgcagt aattcttcac tcaagatgtt taattatgta aactgtgttg 138361 ctaccctttt ccttttataa gaaagggcaa atagccagag aagctgtgaa tgttgccact 138421 ttagcaaaaa gtggagagca ttttatttga agccagcttc tggggcttgt cctgggctga 138481 ctacagtctg cctgattggt caggccccct ccctttgtcc aggttctggc accacccgtg 138541 caccctggca gggctctgcc gtcccgagaa tgcccctgcc agctcacggg ggaggcagca 138601 gtgaggtttt atagacaaag atcatccttc ccaggagaat ttgtgctaac caagactgat 138661 gtcaaatggc ttcttggttg actagaaagt tcagtccaca cttactttgt cttctgagct 138721 ttgtgggtag ccggtatggg gccagacaat gaggaagaaa aaacacaggg aggaagagga 138781 ggcctgaggg gatatggggg ttggtcatcc acccatccga tgacactgct gagtggccct 138841 ttgtgtgctg ggctctgtgc caggcaccat tagtgcaaag gtagggacag cgcagcagcc 138901 ccacctgggg agtgagggtg tcctctggca tttaggatgc agtgtgggtg ggggtggggg 138961 atggtctgca gaaggcgcga ctcaccctag ctgagaaagc aagccggcat cctggaggag 139021 gagacttcta cagcctggat ggcaaagaac tggtgaagcc aaggacaagt aggaaaggag 139081 cgagaagagg atgtcaggca gaggggacag cctgagcaaa gacctagggg tgagagagac 139141 ggcgggacat tcagaacagt ggtgaggagt gaggctggcg gaagcagggt gaaggcttta 139201 tctagagggc aatggggagc caaggagggc atgtgagcac aggagtcaga ctgctgtctt 139261 tagaagagtc ctcaggctac agtgatcttt ctgcaatgag aaagaaaaag agggtatggg 139321 aggcaaagaa tggtggcaga gacctgtcct ctagggctct ggcagcctgt gggtggctct 139381 cagagtcagg ctgtgcagtt cagtagggag gtggggagac ggtgccccct gcaggtaccc 139441 tggttgcagt cctcacttgt gagcctgggt catttgcatc tgcacagtga aggtaaaagt 139501 acttctctga aagggctgtt gaagctgctg gaaggggaaa cggagaggcg agctgtctgg 139561 aaacaggtat attctgtgcc agacttggtc cctggggacc cagcacacca ccagtcactg 139621 gtgggcattt ctgaactgct gtttggtgac gaaaagggtg gcagccccag cctaccacag 139681 attagatatg tgacaaacaa caggacattt cagaaggcca caacattctc atgataaaaa 139741 aaaaaaaaag tttcaagttc ctgggaggca caattggcca ggatggaagc acactgacct 139801 ggcgcttatg ggctcgtcac tgcgtggtcc ccgcggatag cctgcagcga cagctcgtac 139861 ccaaggaaca tggccgcact catggggaag ccccgcaccg cgttcacagt gatgcctctg 139921 aaaaacacct ttgcacaggg gccaggttat tgcaggggac tgacccacac tctctgcagg 139981 gccaccaagg aggggcagag aatagctagt ggtgctgcct acccaggccc ggtcccccac 140041 cagagcccaa aggacctgtg caggcagaga ccacacaggg aagtcccagt catgccctgc 140101 cctgacccgc gtcatgcctg gtggtgcttt tcctcccaga ccctgataca catgcactaa 140161 atactatagg taagctgttg ggccaggcaa acacctacat atgcttttta aattgccctc 140221 tctctccctc tctttcttct tctctcacat gtacacacac acacacacac acacacacac 140281 acacacacac acacacacac tctctctctc tctcctcggg aatctagaag ccctgagtag 140341 acagagcacc cagccatgcc caatccctgt agcaggggct tcacaggaca tcccaaaagt 140401 ggcagagaac agaactcgag ctgtgcttaa gactctgcag acatggggtt gaggcacggc 140461 tttgccattg agtcacttgg gccaatggca aactctctag aactcaattt tcccgactgt 140521 aaggggcggg taataataga attgacctga taagaatgtt ggagggacta aatgagatga 140581 ggcttttcag gcacttgcac ggtgcctggc atacagtaca agcatgtatt aactgctgat 140641 tatcatcatc accctccttg ttcttatcaa tagtctatca cctaataggg agtaggactc 140701 aacctccatc actaggatgc atgctcagga agaatggcct gtgctgatga gctgatgaac 140761 tccgcctttg ctactccatg acacctgtat ttgtcttcca tccctaggga gtattatgtg 140821 atgcttccca aacactgcct atgatcttcc tttcctccct caatcttctt gcttggtgtt 140881 gtgtgtcggc tcataaatgt gatgggattc caactctcaa gcagttataa cccattagca 140941 gagacaaggc cccttcagga acaatgacaa catgacccag aaaatgatga gagcagataa 141001 aagctttgga gagggggatt gtggggagga gctggccttt aatgcatcct ctggtaagaa 141061 acgaccacct aaaagcaagt tcccacacaa acgtaaatga cttgagaact gtcaggcaca 141121 gatagaaacc tactctgatt cctgtataat tttctaagtt aaagaaacag aagtggtcat 141181 tttgtaaagg tagaaaccaa cccttttcat ttccccagtt tcctcagtct ctgggcactg 141241 tggtatcaga gttggtgcag gcctctgtca ctggctagga tatcaggtgg ccttccttca 141301 ggagaggtgc cagcacctcg cagcagcctc cagggtgcaa catgtccctg ggaaaggggc 141361 tgatggacaa ggatccctag agatgaacaa gtggctcagc caagccctcg agctgttcag 141421 tcagggtgta agtgagatcc tggtaatatt aacactcttc tctaatgtct aacctttctg 141481 gaattccaca ggctcatcca gagtgactga ataacagcca ggaactgggt caaactctga 141541 gaggaaaatt ccatgccggt gcccacagag gttcccctgt gggtctgggc ttggctctgc 141601 tctgcaaggc ggaggcagcc agggcccagc ttggagtgtg agccttctgt cccacagccc 141661 cgggccctgc aggagctgtg ggccacttca gaccagcaga cagctcaccc tccagcagct 141721 gtggcctgct ctgctgcaga gcccacagaa ggcaaacaaa atgcagacct gccagctttt 141781 acccagagct ttcagagcat atgacctcca gaccgagtcc cccctcactc ccaccctaca 141841 aagcctcaat gtctccctat ggcctagggt cactctttcc tggcttgcaa cgacccacat 141901 gctctccctt gcactttctc tggacggcct ctcacagctg acactctggg aaggttaaca 141961 gcctgtcatt cccttaaggg gccagccttt ttcctccact gggcctcttc tggtcccctc 142021 tgcctggagt gcttcccagg cctggcttgc agcctggcag tgtccccatg gccaccccca 142081 ggtggatgat ggcccctcac cttctgtgca cttggtatct ccccatcaca gcactcctca 142141 cactgttctc caagtgtcac tgacttatct gtcaggaagg acagataaca gaggtggggg 142201 tgggattagc aatcacaatc agggcagtga atgcagggct gtgtggatgg tttgcctctg 142261 agaccccagc acctgcgagc gcttcgccca gagtagacct tccataaaca cattaattga 142321 atcagtgaat ggatggatgg atgagtaaat gaatgaatga ataaatgaat tttggacgtg 142381 aaaaaggaag agcgtctgaa ctgaaagagt cctggtgcct gcagttcagg tttccaggag 142441 tgacaggaag gctccagatt ccccagggac agcaacccag ggcagctggc cctggtgcct 142501 ccagaagctg tctacctaca aatgccccct ctgtctgcca ttcagcctca tctcactgcc 142561 acagcccctc cacagcagct gcttccttgg agccaagatt accaaggtgc tcaccagcct 142621 ggttatttca gatggcaact ctccaagtca gagctatggt ccatgtagcc caagacgacc 142681 ctaacagtgg gcagagaatg gctccaaaac taccagactt tccttctgtt tctacttggg 142741 gatgcctcga caagtccatg gacttttgat tttccccacc acaaaggctg caccactgat 142801 cattcctaag ttggttacaa agagctactc aaattgatca ccatcgtcat tattaatgcc 142861 tttctaactc tttcctgtca caaacttctt cctccatagc aaggtgagaa cagattggaa 142921 tgatgatggg ctttgggcct tgcactctat ctctgcgagc cacagtcaag aatgcttggg 142981 gaggggtgct ccattcagcc agagcccaag ccaactggga actggagcca cccctggggc 143041 actggaggaa tcctctctac cttctcctgc tgcggctgcc agaaaactgc ctttgagaca 143101 acctgattgt tttagatggt gtgattattt ttgtttatgt tactatttat gttcataagt 143161 aaacagtcac atatacaagc atacatagat atgtacatac acatatattc ttttatacac 143221 tgtaaatgtt tttgaaatgt atcacaagaa actttacggt aattgccttt gagcgaggac 143281 agggaccaca ggcagggcat cgagacagta tatcctcatg tagttttgga agcttcctat 143341 gttcatacat taccaatttt tgactgatta atacaaattt taaacaatta accttgaacc 143401 gtctgatcat gcagtggtta ctcaagctga gtattattat cttgacttaa aacctgtctc 143461 tgaaccaaca gtgaggacca aacgctatcg agattaagaa accaatcgtc tgacctcagc 143521 tgacttccag gacttcattt ttttaaaaga gcacagtaat tttcagagga aataatatca 143581 gcaatatatt ttgcctctac agacttttta cgaattatga cacattaata tttggggaag 143641 tagacaaaaa catctcaaat aaaaatgaat catccacttg aaacactttt atatccctct 143701 atactcattg gctgaatcac aaactgtatt tctgctaaaa aaaaaaaaaa aagaagaaaa 143761 aaaagataca aaagacaatt gaggccgcac cagggctggg cttcaccctt ctgtcagagc 143821 agctcttgtt aaaaatttaa aaaaggaatt taaaataatg agaataggaa agctt // LOCUS AC002455 29314 bp DNA PRI 20-AUG-1997 DEFINITION Human cosmid clone LUCA13 from 3p21.3, complete sequence. ACCESSION AC002455 NID g2337875 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 29314) AUTHORS Dante,M, Kramer,J, Smith,A and Elliott,G. TITLE The sequence of H. sapiens cosmid clone LUCA13 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 29314) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (20-AUG-1997) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: Clones from the 3p21.3 region were contributed by Michael Lerman, M.D., Ph.D., at the National Cancer Institute, FCRDC, Building 560, Room 12-64, Frederick MD 21702 USA (mailto:lerman@ncifcrf.gov) SOURCE INFORMATION: This clone is from a chromosome 3 specific library described by Wei et al., Cancer Research 56:1487-92 (1996). VECTOR: pWE15 NOTE: this clone was originally named cos3. NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the left is LUCA12, 200 bp overlap; the clone sequenced to the right is LUCA14, 200 bp overlap. Actual start of this clone is at base position 197 of LUCA13; actual end is at 4280 of LUCA13. FEATURES Location/Qualifiers source 1..29314 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /clone="cos3" /clone_lib="LLNL3" /map="3p21.3" misc_feature 120..339 /note="match to EST W48574 (NID:g1337098) zc42f05.r1" misc_feature 120..333 /note="match to EST W48574 (NID:g1337098) zc42f05.r1" misc_feature 311..424 /note="match to EST W48574 (NID:g1337098) zc42f05.r1" misc_feature 325..424 /note="match to EST W48574 (NID:g1337098) zc42f05.r1" misc_feature 443..836 /note="match to EST D81812 (NID:g1180443)" misc_feature complement(655..1060) /note="match to EST W48875 (NID:g1337004) zc42f05.s1" misc_feature 707..1060 /note="match to EST AA327812 (NID:g1980054)" repeat_region complement(1129..1189) /rpt_family="ALU" repeat_region 1406..1990 /rpt_family="ALU" misc_feature 2637..2711 /note="match to EST N36019 (NID:g1157161) yy01a10.r1" misc_feature 2833..2892 /note="match to EST N36019 (NID:g1157161) yy01a10.r1" misc_feature complement(2833..2893) /note="match to EST N24898 (NID:g1139048) yy01a10.s1" repeat_region 3930..4218 /rpt_family="ALU" misc_feature complement(4369..4491) /note="match to EST N24898 (NID:g1139048) yy01a10.s1" misc_feature 4369..4491 /note="match to EST N36019 (NID:g1157161) yy01a10.r1" misc_feature 4645..4776 /note="match to EST N36019 (NID:g1157161) yy01a10.r1" misc_feature complement(4645..4852) /note="match to EST N24898 (NID:g1139048) yy01a10.s1" misc_feature 4645..4803 /note="match to EST N36019 (NID:g1157161) yy01a10.r1" misc_feature 4726..5161 /note="match to EST H12210 (NID:g877030) ym12d09.r1" misc_feature 4796..5073 /note="match to EST Z42475 (NID:g565892)" misc_feature 4839..5277 /note="match to EST AA428700 (NID:g2110278) zv50d05.r1" misc_feature 4897..4986 /note="match to EST AA225784 (NID:g1847092) nc17f10.s1" misc_feature complement(4931..5330) /note="match to EST AA244358 (NID:g1875083) nc05h10.r1" misc_feature 5030..5449 /note="match to EST AA309740 (NID:g1962089)" misc_feature 5042..5528 /note="match to EST AA031739 (NID:g1501804) zk14e09.r1" misc_feature complement(5448..5938) /note="match to EST H29039 (NID:g899949) ym59d07.s1" misc_feature 5482..5896 /note="match to EST AA044002 (NID:g1521860) zk58a06.r1" misc_feature 8672..9899 /note="CpG_island (%GC=69.9, o/e=0.80, #CpGs=109)" misc_feature 10309..10525 /note="match to EST F11962 (NID:g706283)" misc_feature 10309..10656 /note="match to EST AA453909 (NID:g2167578) zx32e04.r1" gene 10358..12718 /gene="LUCA-2" CDS join(10358..11278,11802..11891,12308..12718) /gene="LUCA-2" /note="match to U09577 (NID:g1209015); match to U09577 (PID:1209016)" /codon_start=1 /product="human PH-20 homolog (LUCA-2)" /db_xref="PID:g2337876" /translation="MRAGPGPTVTLALVLAVSWAMELKPTAPPIFTGRPFVVAWDVPT QDCGPRLKVPLDLNAFDVQASPNEGFVNQNITIFYRDRLGLYPRFDSAGRSVHGGVPQ NVSLWAHRKMLQKRVEHYIRTQESAGLAVIDWEDWRPVWVRNWQDKDVYRRLSRQLVA SRHPDWPPDRIVKQAQYEFEFAAQQFMLETLRYVKAVRPRHLWGFYLFPDCYNHDYVQ NWESYTGRCPDVEVARNDQLAWLWAESTALFPSVYLDETLASSRHGRNFVSFRVQEAL RVARTHHANHALPVYVFTRPTYSRRLTGLSEMDLISTIGESAALGAAGVILWGDAGYT TSTETCQYLKDYLTRLLVPYVVNVSWATQYCSRAQCHGHGRCVRRNPSASTFLHLSTN SFRLVPGHAPGEPQLRPVGELSWADIDHLQTHFRCQCYLGWSGEQCQWDHRQAAGGAS EAWAGSHLTSLLALAALAFTWTL" misc_feature 10400..10764 /gene="LUCA-2" /note="match to EST R51257 (NID:g813159) yg70d12.r1" misc_feature 10400..10766 /gene="LUCA-2" /note="match to EST R51257 (NID:g813159) yg70d12.r1" misc_feature 10413..10898 /gene="LUCA-2" /note="match to EST T80285 (NID:g698794) yd03g12.r1" misc_feature 10491..10597 /gene="LUCA-2" /note="match to EST AA248196 (NID:g1878785)" misc_feature 10608..10766 /gene="LUCA-2" /note="match to EST AA248196 (NID:g1878785)" misc_feature 10785..11096 /gene="LUCA-2" /note="match to EST AA039550 (NID:g1515828) zf07b12.r1" misc_feature 10847..11278 /gene="LUCA-2" /note="match to EST W05774 (NID:g1278516) za89c04.r1" misc_feature 11800..11892 /gene="LUCA-2" /note="match to EST W05774 (NID:g1278516) za89c04.r1" misc_feature 12307..12867 /note="match to EST AA148827 (NID:g1721664) zl06c06.r1" misc_feature 12307..12676 /gene="LUCA-2" /note="match to EST H77861 (NID:g1055950) ys09a07.r1" misc_feature 12307..12424 /gene="LUCA-2" /note="match to EST W05774 (NID:g1278516) za89c04.r1" misc_feature 12307..12513 /gene="LUCA-2" /note="match to EST AA039550 (NID:g1515828) zf07b12.r1" misc_feature 12466..12977 /note="match to EST W07074 (NID:g1281087) za93a03.r1" misc_feature complement(12503..13034) /note="match to EST AA453401 (NID:g2167070) zx32e04.s1" misc_feature complement(12651..12738) /note="match to EST H82914 (NID:g1061584) yq46g10.s1" misc_feature complement(12671..13041) /note="match to EST H82914 (NID:g1061584) yq46g10.s1" repeat_region 14565..15155 /rpt_family="ALU" repeat_region 16401..16811 /rpt_family="MER" repeat_region 17049..17342 /rpt_family="ALU" repeat_region 17510..17799 /rpt_family="ALU" repeat_region complement(17956..18257) /rpt_family="ALU" misc_feature 17978..19669 /note="CpG_island (%GC=74.8, o/e=0.70, #CpGs=129)" misc_feature 18669..18738 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature 18685..18787 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(18692..18762) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(18692..18762) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(18692..18762) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(18728..18811) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(18728..18811) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(18728..18811) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature 18734..18821 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(18777..18822) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(18777..18822) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(18777..18822) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature 18783..18823 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(18919..18985) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(18919..18985) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(18919..18985) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature 18919..18995 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(18951..18996) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(18951..18996) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(18951..18996) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature 18957..18998 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19091..19161) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(19091..19161) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(19091..19161) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature 19091..19137 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature 19093..19186 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19127..19210) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(19127..19210) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(19127..19210) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature 19134..19235 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19176..19259) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(19176..19259) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(19176..19259) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature 19182..19269 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19225..19270) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(19225..19270) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(19225..19270) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature 19231..19272 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" repeat_unit 19276..19281 /note="poly C, ambiguous" misc_feature 19367..19396 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19496..19562) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(19496..19562) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature complement(19496..19562) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature 19496..19589 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature 19496..19540 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19530..19590) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(19530..19590) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(19530..19590) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" misc_feature 19536..19595 /note="similar to EST AA043704 (NID:g1521580) zk50h09.r1" misc_feature complement(19724..19794) /note="similar to EST AA147631 (NID:g1717002) zl52e05.s1" misc_feature complement(19724..19794) /note="similar to EST AA043705 (NID:g1521581) zk50h09.s1" misc_feature complement(19724..19794) /note="similar to EST AA047386 (NID:g1525450) zk69d10.s1" repeat_region 20282..20574 /rpt_family="ALU" repeat_region complement(21062..21158) /rpt_family="ALU" repeat_region complement(21374..21664) /rpt_family="ALU" repeat_region 21819..22391 /rpt_family="ALU" repeat_region 22421..22712 /rpt_family="ALU" repeat_region 22753..23043 /rpt_family="ALU" repeat_region complement(23157..23448) /rpt_family="ALU" repeat_region complement(23476..23939) /rpt_family="ALU" repeat_region complement(23951..24242) /rpt_family="ALU" repeat_region 24826..25010 /rpt_family="MER" repeat_region 25015..25595 /rpt_family="ALU" repeat_region 25668..25840 /rpt_family="MER" repeat_region complement(26315..26506) /rpt_family="ALU" repeat_region complement(26563..26859) /rpt_family="ALU" misc_feature 27295..27383 /note="match to EST AA223264 (NID:g1843909) zr08c03.r1" misc_feature 27865..28242 /note="match to EST AA223264 (NID:g1843909) zr08c03.r1" gene 27891..>28790 /gene="LUCA1" CDS 27891..>28790 /gene="LUCA1" /note="match to U03056 (NID:g532973)" /codon_start=1 /product="LUCA-1" /db_xref="PID:g2337877" /translation="MAAHLLPICALFLTLLDMAQGFRGPLLPNRPFTTVWNANTQWCL ERHGVDVDVSVFDVVANPGQTFRGPDMTIFYSSQLGTYPYYTPTGEPVFGGLPQNASL IAHLARTFQDILAAIPAPDFSGLAVIDWEAWRPRWAFNWDTKDIYRQRSRALVQAQHP DWPAPQVEAVAQDQFQGAARAWMAGTLQLGRALRPRGLWGFYGFPDCYNYDFLSPNYT GQCPSGIRAQNDQLGWLWGQSRALYPSIYMPAVLEGTGKSQMYVQHRVAEAFRVAVAA GDPNLPVLPYVQIFYDTTNHFLPL" repeat_region complement(28887..29176) /rpt_family="ALU" repeat_region complement(29222..29314) /rpt_family="ALU" BASE COUNT 6597 a 8122 c 8137 g 6458 t ORIGIN 1 gacatcagaa agcccagtcc tagcttgtgt gaacatgagg tgctagtctt ctctggggag 61 ggtctgctgg cttggccatc ccttctgcag cctgtacact ccccttttgc cccttgcagt 121 gggacgcctt cagcatgcct gaactacata acttcctacg tatcctgcag cgggaggagg 181 aggagcacct ccgccagatc ctgcagaagt actcctattg ccgccagaag atccaagagg 241 ccctgcacgc ctgccccctt gggtgacctc ttgtaccccc aggtggaagg cagacagcag 301 gcagcgccaa gtgcgtgccg tgtgagtgtg acagggccag tggggcctgt ggaatgagtg 361 tgcatggagg ccctcctgtg ctgggggaat gagcccagag aacagcgaag tagcttgctc 421 cctgtgtcca cctgtgggtg tagccaggta tggctctgca cccctctgcc ctcattactg 481 ggccttagtg ggccagggct gccctgagaa gctgctccag gcctgcagca ggagtggtgc 541 agacagaagt ctcctcaatt tttgtctcag aagtgaaaat cttggagacc ctgcaaacag 601 aacagggtca tgtttgcagg ggtgacggcc ctcatctatg aggaaaggtt ttggatcttg 661 aatgtggtct caggatatcc ttatcagagc taagggtggg tgctcagaat aaggcaggca 721 ttgaggaaga gtcttggttt ctctctacag tgccaactcc tcacacaccc tgaggtcagg 781 gagtgctggc tcacagtaca gcatgtgcct taatgcttca tatgaggagg atgtccctgg 841 gccagggtct gtgtgaatgt gggcactggc ccaggttcat accttatttg ctaatcaaag 901 ccagggtctc tccctcaggt gttttttatg aagtgcgtga atgtatgtaa tgtgtggtgg 961 cctcagctga atgcctcctg tggggaaagg ggttggggtg acagtcatca tcagggcctg 1021 gggcctgaga gaattggctc aataaagatt tcaagatcct cctgctgttg gaatctttta 1081 tacatataaa gtttttgtag agacatgagt ctctctgtgt tgcccaggat cctcccaact 1141 tggcctccca aagtgttggg attacaggtg tgagccaccc tgcccagcct ggactcttta 1201 ttattatagg cgcagagctg cagttgcccc tcatggtgcc agaagttgcc aagggtgatg 1261 gacaggctcc caggtgtctt gcaaagtcac catggaccaa tttgtgaaga tgtagtatgc 1321 atacatactt ggtcatcact cagctccctg gggctcaggt tgtggtggag acaaaaatgg 1381 actgcagtta gaacttaggg aaactggctg ggcatagtgg ctcacacctg taatcccaac 1441 actttggttg ggctaggtgg gcagatcact tgaggccagg agttcgaggc cagcctggcc 1501 agcatggcga aaccccatct ctaccaaaaa tacaaaaaaa atttagctgg gcgtggtggt 1561 gggcgcttgt agtcccagct actcagaagg ctgaggcagg agaatcgctt gaacccggca 1621 ggcagaggtt gcagtgagtg gagatcacac cactgcactc cgatagagca agactccaac 1681 tcaaaaaaaa aaaaaacggc cgggcgcagt ggctcaggcc tgtaatccca gcactttggg 1741 aggccaaggc gggtggatca cctgaggtcc ggagttcaag actgcctgac caacatggtg 1801 aaaccccgtc tctactagaa atacaaaaaa attagccggc atggtggcag atgcctgtaa 1861 tcccaagtac tcgggaggct gaggcaggag aatcgcttga accctggagg cagaggctgc 1921 agtgagccga gatcgtgcca ctgcacatta tcctgggcga caagagtgaa actccatctc 1981 aaaaaaaaaa aaaaacaaaa ccatcccttc aacacacaca caccacgctc tgggagaagg 2041 tgtggcataa ctccttcacc aaatacagag ctgccaccgt ggaccagaca ctgctcgtga 2101 taccgagggt atagctgtta acaattcttg ctttcattaa gcatggactc tgctgggttt 2161 gaaaacactg aattcgaagt tcttcagaac tgaatgtaac tatgtgaatc tggccagttc 2221 cttaattttc tttcaacttg gttagttcac ataagcgtgg caatcgcaaa aatacagctg 2281 tgaaaataga agccagatgg gcacccggcg gtctggcctt aggccctgaa gtgcaggttt 2341 gaggattggt gcttgcgaag tcctgctagg cctgaactca ggtgttgggg gacgtcagag 2401 ccgccaaata cacccaaaag accgggagga ctcacggcca ccactttcct cggtgggagc 2461 tgtcccagct ggtcagatcg cgcttgctgg gacctgggat ctcgcaacgc atgctgggat 2521 gcccagcatc taagggcgcc cattggtccc gcccccacga cttgagcaac agccaatcag 2581 aggtggcagc gtgcggaagc ggaagtgagg tttccgtgga gacagccgag cctgcggaag 2641 gcggcggcgg cggcacctgc gatcagcggc tggggcaggt tatggtagtg cggactgcgg 2701 tgtgagcaga gcggccacgg ggcccgccat gcgccggcgg ccctgacatg ggcgccagcg 2761 ggtccaaagc tcggggcctg tggcccttcg cctcggcggc cggaggcggc ggctcagagg 2821 cagcaggagc tgagcaagct ttggtgcggc ctcggggccg agctgtgccc cccttcgtat 2881 tcacgcgccg cgggtaaggg catgggttcc accctggcgg ggggaacagg cgggcggcca 2941 ggcgtcccgc gccacggggg aacttccacc gctgtacccc actacagcca agccaggacg 3001 acccccatat tttgagcctc attggagctg ggggtggaga aagccgggca gtggtctcct 3061 ggcggcctgg ccactctgaa agtctcccta gggaagagtg ggcctgaggc ttgtcactgg 3121 tcggaccttc cgcacggtaa agctaagcta gggcctccat aagacctcca taagatctct 3181 tcctttcatc ccttccggct cagcctgcat actgaagggt cctagcctcc ctttcccccc 3241 cagttattcc aaggggataa taaccctctg tttaggctgc ttttcctgac agtagtgggg 3301 ctgaaccttt gagcgtagtg gctgggaaag cagggatagg cctcagtatt gggggtgagg 3361 ctgaggaaaa aagaccctca cccagttatg ttctccagat ccagagcgct gacctaccca 3421 cccaaccccc agcccagtct cccagaggcc cgcacaagag taccagattg gcttggctct 3481 agtgggctcc tggagatggg cgctccctgt gctagagatt tggctgtaga caccaggaag 3541 acccaacagt ggcaattcac cagccacctg gggtgtgggg tcaggggaga cgcagccttc 3601 attcctttgt gcttcttcct agcttaggct tctgctattt cagtctttga cactgggcca 3661 gcagcatctt ctggagtgag accaacatgc ccaggccacc accctgccca ctctagaagt 3721 cctgggtgaa ggcaaagtcc tgatattaga atggctgccc ttgctgaaca aatcaaggat 3781 gagaactggc cttggtggct gcctggatgg tgtgctctgg tgtgggctct cttcttgggg 3841 gtctgacata gtagcactgc ccacccatca cctcacttcc cactttcttc tagtagaccc 3901 caaaccccag gcaaaggatt ctccaggcag gccaggcacc gtggctcacg cctgtaatcc 3961 taacactttg ggagatgaag acgagtggat tgcctgagtt caggagttcg agaccagcca 4021 gagcaacatg gcaaaacccc gtctctacta aaatacaaaa aattagcagg gtgtggtggt 4081 gcatgtctgt agtcccagct acttgggagg ctgaggcagg agaattgctt gaacccagga 4141 ggcagaggtt gcagcaagct gagatcgcgc cactgcacct ccagcctggg caacagagtg 4201 agattctgtc tcgaaaaatt ttttttaaaa aaaggattct ccaggcatat tatcctcttt 4261 tcttcccact tggggttggg aagaccacag acttccccgt gacctgtgac atttgccatg 4321 cccaggtgtg gggctccagc ccagcctgtt tccttctccc tctacacagc tctatgttct 4381 atgatgagga tggggatctg gctcacgagt tctatgagga gacaatcgtc accaagaacg 4441 ggcagaagcg ggccaagctg aggcgagtgc ataagaatct gattcctcag gtgaggggct 4501 gggcagcggt cacagagcct acactaagcg tgtggaggtg gacccagatg ggatctgttg 4561 tcatggtggg ggcttttgcg gggggatctg ggaggaacaa ggaagcttcc tgaagtagag 4621 gccctgagct gacccttact agcctctgtt tccatggcag ggcatcgtga agctggatca 4681 cccccgcatc cacgtggatt tccctgtgat cctctatgag gtgtgaccct gggaggtggc 4741 agacagaagc accccctgcc ccggcaagaa actcccaggc tcaatcaagg tgtggcttcc 4801 attgaggagc ccaggctggg gccacaaccc tgaataaact ctgttggccc ataaccttca 4861 gctgtgagcg ggtcggtccc acagtattgg ttgggtgttg gtttgtgtgt ggacaagagg 4921 tggttggtgg gtggtgaagg ctaatggcag agttagcacc ccactctccc aagccacccc 4981 tgcaagcagc acagcagggc atataccagt caggaatgcc cgttacctgg ttccttgcct 5041 ggtctgcttt cttccaagtt tgcctggggc ctagccctgc tagaggctac agcactttac 5101 aagcaaggta tgctttcttc cagcccctag gctgtgggca ctgtatacaa gtaggaactt 5161 cctttccttc acttcccttt taacccctag tcagagcatt tcagccgttt gctacctcga 5221 ttcctcctgt gttggacaga ggctgggggc agtgccagcc tgattcttcc gacctacctg 5281 ccatttgttc ccgccttcag atggatggac agtttgctgg ctattgatag gagtggggac 5341 tgggtggggg cttctccctc tacccagggc tgggctgatc cccctactgc aactaactgt 5401 tgccccccaa ccccgaaccc ccagttgagg agttgagaga gtgcaggctg gggtcaggac 5461 aggctgcgga tgcttgtgcc tatggggagt tactccaacc cacctattct gtctaatctc 5521 catggctttg caccaaatcc tccacccctc caattgggag gggactgttc accaccttgt 5581 ggtaagggac aacaccctaa ggctggtgcc agtagttatg agtagcctac caccccctcc 5641 cttacagtaa cccccacccc ttcaggatca gtcaagggaa agcactagaa cccctgggta 5701 gggaaagaaa ggagggaaaa accataaaag gaatacttat aatgtgaagg tttgtaaata 5761 gtccatgatg atgtcgtggc agagtctgat ttctatatag aggtgacttt ttttttaagt 5821 actgtgcaag ctctgtgctt ctataatgtg ggaaatggct tggggaggat ggcccctagc 5881 ttaggaagac tgttgtgtta tttgttcaat ttcaataaaa tgatttgtag atcctgcaca 5941 tgagagtggc atcctgtggg ggctctgaca cccagtgtct agactaccac ctgggttcaa 6001 taggtaggcc tggcacagga ctctggcagg ccccctatga ccagcaaagc ccaagagtaa 6061 gtctgagccc agcctgtggc catcccaact gtggccagca ggtggctgtg tgttcacaca 6121 gacctcactg caggccccct gcagggtggg agggcactca tttatctgtt gtgatccatg 6181 tggtggtgac ccttggaatc agaaggccct gggatacttc actaaggatt ggccccacct 6241 gcctgatcta aggtgagaga gtgtgaccca agatagttct ggaatatctg tcttagctgt 6301 agcatcgcca gctgatccta agtggggtca tggcctgacc cccatggacc tcaaaggccc 6361 agctttaatg gacagcagcc agataggtgt ccccacgcct ttggcctact gccaggctca 6421 gcaacccaaa ctcatgtgcc tcctctgagg cctgcctgcg gacaccatgg gagaatgagc 6481 tcaggctgct gtttcagcaa gcaaggctcg gcccactagt attgcccctg ctgtctgttg 6541 cccccatctt ctcatcacct gcagcctgcc cactgtggtc attctgcctg cacttctcag 6601 gggctgctat aaggcagatt ctctactagg cttaggcttt gactatcagt ccaactccca 6661 gttaaagagg ggagatttca gggacaggct tctgagtgta gggagctggt ctgccagtct 6721 ttcggaggtt tgaacttgtc aaggctaggg caggatcacc atatccagcc tggacttgca 6781 gttctgtggg gtgcctcccc atacccccat aagatgccaa acatgaggcc ctgtcatcct 6841 ccatggtccc cctctactgg ctgttcaagg cccagggctc tcccatgcca gatagcatcc 6901 tgtctcctac caccactgtc ccagcctgag ggaactccct gtgctgggcc tacccagctg 6961 accccatcgc tggaaacaat gggggtcagg caacacttcc ccactctctc ccgccgggct 7021 gtgctcactt ccttcctgct ggctgcctga ggaagtgtcc ctgccctggg acagtctggc 7081 ctagcctttg tttccccggg ggtccccacc catggagctt tcaaggcttc tggcccctgt 7141 gaagccagca cagtggtaca gggacactgc accttcccct accatgtctt ggactccttg 7201 gtcctcaggg ccagactcct gggtattcac ctacccctca cacagccttc tgttggggga 7261 gaggctcctg ggcattgggc cattgggttg ttgaggggtt ggctctgggt gatttgaggg 7321 taggtagatt tgcacccctg gagaggtctg ctggaacctt gctgcagtcc tccctagggc 7381 cagccacagg aacacccctc ccagaggagt ataccccttt cctcatccat gttgtaagga 7441 gccccctact ctgtatagtg gactctgaaa tccatccagg aatgaactcc agcaggaaag 7501 cactctgcag aaccctccca gtccatcccc tcctccccaa cccacacagt cacactcact 7561 aacacattgt cctgtcacac aggatgcgga agctctcagt ggaaaaaaac ggactcagct 7621 actggaagtc cccccgaccc tccccccaag gctagttccc ttcttgggca cctgctctgg 7681 gggaccatca gctgaacgac ccccaagtat tttgactccc aaaagcacca ccacctgacc 7741 ccatcctctc acaccctact ggatttgagg atgggcccca atcctaggga aggagtgaag 7801 aggttcccta gtgttggaag ctgtgggtgt gggggagatt ggcacctgat cctgagccca 7861 tagccttcct gtcacctggc gcagctggcg gggccagatc ctactcggga agggtgggga 7921 gggcagccag ccagcagggc attctggagg gaaacagggt caaggcgatc tcctccccca 7981 cgcctgttcc tggccctttc ctctcagggg gcagcaggaa gtgaggagaa agggctggga 8041 tgggaggcgg gagcggatgg gagggaatgg ggtttatcaa gtcctcggcg agctgcccaa 8101 cgggcagcag ctggcgcaag tagcctagct ggagaggctc accccaggaa ggagggaggc 8161 caccgaccta ctgggccgac ggactcccac acaggtgagc ccagagcaga cggctggtct 8221 gcacccccac agatgcgctc gcagttgcac tccctccctc tcctggcgcc cgggagggtt 8281 aggggctggt ggtgcagacg cgggcccttt tgggagttga gtctcgcaca agggagcgga 8341 cctaggaaga gccgaggtgg tttcgcacgg ggctcgccag ggtctaagcc tgccccccac 8401 cgggagaggc ctgtggagcg tagggggcgc tggatacggg atggaggccc tgggagaccc 8461 ctcttgctgg ctttctcgga ggtccagcca gaaactgctg caaggaatgg aggcctcctc 8521 ggggttgaga gggagccggg ctcccaaagg acctcagaga ctgggcagaa aaggacggga 8581 tctcagggat gactgtcccg ccctgatgcg agtcagggag aggggcggcc aacctcctag 8641 acccctctga gcttccctga ccccaaccct gcggccacgc cgcgacccag agccgggctg 8701 ccaggataac gactgcctcg gcccttcctg ggccggctaa gaagcggtgc ttggcccctt 8761 ccctcagtct ggcagggggc ggggcctccc tttagacggc ggaccagaga agggggcccc 8821 tgattcgtgg gaggcggggc actactctcc aggagaccag aggtcgcctc aggtcaaagt 8881 ccctttttcc acacaaaggg gacccacggc tggcgtctac gttagggggt gcagagccag 8941 atctggtgct gccccctgcc aacctcggag taccacagca cctcctgatg gccgaacggg 9001 gcaacgcctc ctcctattcc cccccccccc cttcccgtcc ccctgccttg tccctcacac 9061 tgtctcttta aagggctggc ggcgccgcgg agctgggagg actgaaccac cggcctcggg 9121 ctgcagggga aacatttcag gctgactggc gctcgtggct gagactccca tagaaagccc 9181 ggctcagagg ggcattaggg tcctaaatgg gcggccacgt ccctctgcag aggacctggg 9241 gctcttcgag cccgaaacga ggcaccggca ccgagaaagg tggaccacac cttcccgccc 9301 cgtccgcaag tccaatcccg ggcccacctc cgcactggag tcttaaaggg ccagcgtgcc 9361 tgggggcgga gccagcagag gcgctgagcc gggccgcgcc tgggcgaacg gccggagcgg 9421 gctgggctgg gcccgggatg gcggtggccc tggcgccggt cccggtggcg ccccgcgcga 9481 ggtgagggcg ggcggtgcaa acctggcggc tctctcccct tgggctgggg tctgaatccc 9541 cgggggtgct cgcggagagg cgtcccagaa accccacccc cacccgaccg ggcgcaggcc 9601 ccacgtgtgg ggcgggggcg gggtcccgca caaagacccg gcggagcgcg ctctagccct 9661 gagcggccgg gcgggggagg cgagcgcgcg cattcccggt ggcggtggag ggaagggccg 9721 ggcggccggc gccgccgtgg gaggtccgct gcccctttgt ccctacgggg cctcctccaa 9781 gccgggagag tgtcagcgct cgagagaaag tccggagagc ctcactcttc tgccgggcga 9841 gtgttacacg gatagaagcc tcccggcagc gttccttcca gtttcgtagc ctcttgacga 9901 gctgttccct gctttaccca aatgctgtcg tttctctgga tcaagggttc ttcacggtgt 9961 acagggtggg catcagctgt tcagggttct ctgaaaccat gtactggatg gtatgcaggc 10021 atatatgtga ctctggtgag cccaaattat ctagttctta gaagggtcac agacctaata 10081 gagatttgtg ccgagccaca ggctacctgc tccagaaaag agccctgtgc ttctggcagt 10141 gagttcccag ggtgccttgt ctgccctgca gtggcctgtg gtctctcaaa cctattacag 10201 ccatgagcac agtgccccca cacagcatgg gctttgggag catagatggg cttgaggtgg 10261 gcacttccag tattccctag actgacttgt tctccccaac ccttcttcca gttcctgagc 10321 tggtgccagg caggtgacac ctcctgcagc ccccagcatg cgggcaggcc caggccccac 10381 cgttacattg gccctggtgc tggcggtgtc atgggccatg gagctcaagc ccacagcacc 10441 acccatcttc actggccggc cctttgtggt agcgtgggac gtgcccacac aggactgtgg 10501 cccacgcctc aaggtgccac tggacctgaa tgcctttgat gtgcaggcct cacctaatga 10561 gggttttgtg aaccagaata ttaccatctt ctaccgcgac cgtctaggcc tgtatccacg 10621 cttcgattct gccggaaggt ctgtgcatgg tggtgtgcca cagaatgtca gcctttgggc 10681 acaccggaag atgctgcaga aacgtgtgga gcactacatt cggacacagg agtctgcggg 10741 gctggcggtc atcgactggg aggactggcg acctgtgtgg gtgcgcaact ggcaggacaa 10801 agatgtgtat cgccggttat cacgccagct agtggccagt cgtcaccctg actggcctcc 10861 agaccgcata gtcaaacagg cacaatatga gtttgagttc gcagcacagc agttcatgct 10921 ggagacactg cgttatgtca aggcagtgcg gccccggcac ctctggggct tctacctctt 10981 tcctgactgc tacaatcatg attatgtgca gaactgggag agctacacag gccgctgccc 11041 tgatgttgag gtggcccgca atgaccagct ggcctggctg tgggctgaga gcacggccct 11101 cttcccgtct gtctacctgg acgagacact tgcttcctcc cgccatggcc gcaactttgt 11161 gagcttccgt gttcaggagg cccttcgtgt ggctcgcacc caccatgcca accatgcact 11221 cccagtctac gtcttcacac gacccaccta cagccgcagg ctcacggggc ttagtgaggt 11281 atgtgtctcc agggccctgc cttttttctc cttccttgag gatccactgc acatttgggt 11341 tatcaggggc cactatagga ctatacctgt agtttattgg ctgctccaca tcccctcaaa 11401 tcattctgtc tataacctga aagggtgaca tcattatccc catttttcag atgagaacaa 11461 ttgaggcaca gcgatgctga gaagtttgcc caaagcttcc cagcaaatcc agagcagagc 11521 tgggactggg gcccaggcct tctaactttg gaatccatcc tgatagctgc acctaggctc 11581 aagtgtgccc ttggcttggg aaactgagcc ccaaaagtgg ggtggaaatg gacttaaatg 11641 tttaaaaagg aagcaaaaca aatgagcctc tgcctggacg tgaaatgggg gttgagctgg 11701 gagttcagca ggttgaatca ggctggggtc tgccctcctt gtctgttcct gtcaccctgt 11761 ggtctcagtc ttccccagtg accatccctt ttcctgcata gatggacctc atctctacca 11821 ttggcgagag tgcggccctg ggcgcagctg gtgtcatcct ctggggtgac gcggggtaca 11881 ccacaagcac ggtaagcgag acccagcctg ccagagctca gtctatggga cagacagaga 11941 gttagggcct ttggctacca acctcaggtg ggcaatccta gtctggtccc cttgtctgtc 12001 tcccacccga ggcatccctg gtggcaacag gaggaggtca ccccggccct gctctcagga 12061 aagaggaaat gctgtcctga acagctgttg agttccgccc ttccccatct gcccaactgg 12121 gttccggtct gtacccccac ccacttctca gcaggagcag ggactgtgtg gctcaggtgc 12181 agattgaatg tatgaatgtg tgcagttaaa ataaggtata gtcatcacag tgtggacaca 12241 gtttccatct gggcctgtgg gctgaggcag ctgaccctga cctgtggtcc ttgtcttctg 12301 cctccaggag acctgccagt acctcaaaga ttacctgaca cggctgctgg tcccctacgt 12361 ggtcaatgtg tcctgggcca cccaatattg cagccgggcc cagtgccatg gccatgggcg 12421 ctgtgtgcgc cgcaacccca gtgccagtac cttcctgcat ctcagcacca acagtttccg 12481 cctagtgcct ggccatgcac ctggtgaacc ccagctgcga cctgtggggg agctcagttg 12541 ggccgacatt gaccacctgc agacacactt ccgctgccag tgctacttgg gctggagtgg 12601 tgagcaatgc cagtgggacc ataggcaggc agctggaggt gccagcgagg cctgggctgg 12661 gtcccacctc accagtctgc tggctctggc agccctggcc tttacctgga ccttgtaggg 12721 gtctcctgcc tagctgccta gcaagctggc ctctaccaca agggctctct taggcatgta 12781 ggaccctgca gggggtggac aaactggagt ctggagtggg cagagccccc aggaagccca 12841 ggagggcatc cataccagct cgcacccccc tgttctaagg gggaggggaa gtccctggga 12901 ggccccttct ctccctgcca gaggggaagg agggtacagc tgggctgggg aggacctgac 12961 cctactccct tgccctagat agtttattat tattattatt ttggggtctc ttttgtaaat 13021 taaacataaa acaattgctt ctctgcttgg attttgtacc tgggctaaag tgtatgcgag 13081 attccagaat ttcccatggg gtttgagctc agctctgtct ctatgaaacc ctctgctggg 13141 gacttctcgt gtaagatgta ttcaccaaga cttgggggcc cagagctgcc cactctggag 13201 ctgttctgct ggggcaggag ccaccccagt gggagtggag ggacccacat tccatcccaa 13261 caccaaggac gttgctttct ccgtattggt tgcaagggag tcaggccccg ccttgtgtga 13321 gggacagggc tcattgtgtt tctgccactt gcccctgtcc tcccagctgg accagtgcca 13381 gcctccttgc caacccaagg gagaaacgtt tgatcttccc tgctgttccc acaggcaagg 13441 gtgtctcctc cagtccctgc tcccaagggc agtggacacc agccacatac acctctcatc 13501 agggtcacag ccctccctcg gatctggtgg tgggaggtgg gcaggggatt gggatctgaa 13561 tcgtgaggtt cagtccctgt cccattctag agatgcctga tggggctggg aggtggcatc 13621 ttgggcctgg gagcaggaac atcttctaag gtctgctccg tcctggctgt gggagagtat 13681 ggctagtctc tggctgtccc tgaggaaaga aggagcagct ggagttgtca gaccagctct 13741 gagctgcttc actagccctg cctctgacgg gagtgcattc tgcctctccc ctcagccagc 13801 aaccccacac ctggggtccc gcccatgtcc gtgatgactc accacgcagc cctgcttcag 13861 gaggctctgg ctctggggag acagactcaa gaagccgttg agtggtgagg gcagccagca 13921 gcaggggagc aggggtggtg cccttggaga tgcttcccac attgggttct aaaaggtgag 13981 ggcggcagct cgatgctcag gaacaaatcc tgggcagagg ctggaggaag ttggaggtgg 14041 tgaaggtgaa cagttcaggg ctggggtgtg gcttagtggc tgctgtgggc agaggcagga 14101 ggggcagccc tagagcagca ccaggcctga cttcacctaa tactgactta ctcattcagt 14161 aaacgctgag cacacagtgt gcctgtgctg ggctaggtgc tggggatgca tcgatgatgc 14221 atctatgaac cacaccaaca aagacccccg tccttgtggg agtgacaaaa tagaggtaga 14281 tgcacactag atgcaaagac cgagatgcca gatgggagaa acaagcccgg ccaagccagt 14341 gccaacccag gctgtaaaga tatccttccc tcacctcttt cgccggtgat gggtagagaa 14401 tggagaggag agagtgaggt ggtaacactg gaaaggtgac gtggtaaatt cacacagtga 14461 aggaggaggt gcacatacaa gtggaggggt cccaggtgct catggccatg gtggggggag 14521 gccaggctgg agagaagatt caggtgttgg tcaataaaat gatgggccgg gtgcggtggc 14581 tcacccctgt aatcccagca ctttgggagg ccaaggcagg tggatcactt gaggccagga 14641 gtttgagacc agcctggcca acatgctgaa accctgtctc tactaaaaat acaaaaatta 14701 gctgggcatc cgggcctggt ggttcacgcc tctaatccca gcactttggg aggccgaggc 14761 ggacagatca caaggtcagg aattagagac cagcctgacc aacgtggtga accccatctc 14821 tactgaaaat aaaaaattag ccgggtatgg tggtgcctgc ttgtaatccc agctactcag 14881 aaggctgagg caggagaatc gcttgaaccc gggaggtgga ggttgcagtg agccaagatc 14941 acaccactgc gctccagcct gggcgacaga gtgagactct gtctcaaaaa aaaaaaaaaa 15001 aattagctgg gtttgttggt gcacacctgt agtcctagct gcttgggcag ctgaaacaca 15061 agaatcactt gaccccagga ggttgcagtg agcggagaat gggccactga actccagcct 15121 gggtgacaga gcaagactct gtctcaaaaa aaaagttgcc acaataagat gatggcactt 15181 agcaaaagga cagcaggaaa ggagaaagga gctgtggcct gtcccccata cctgtgactt 15241 cctgcacctg tgacctcccc cacacagatg tgacccatgc atacctctga gctaccccac 15301 ccacacccaa cacacagagg tgatccagac cacaccactg tgacctttcc cacctagcta 15361 tgacccctac atacctgtaa cccctcccat gtgtgacccc atacagctgt taccgtctcc 15421 aaataactga ctcctccgca actgtgacct ctcctgcata gctgtgacca gtacccatcg 15481 agctgtgacc ccctccatat aaacctttat gaacctcaca tttcttcccc cggaggacct 15541 aaaagatggt cctagggaaa gtgcaactca gggagagcca gcactgtcct gtggggaaag 15601 cggggagtgg gtaggaataa ggtaatgggg agtgatgcgg ctcagagatc tgaagtgttg 15661 aacaaccgga gacagcaggc gggactggac catcaagctg atggacccca gacatatgcg 15721 cacacacaca ccatttgagc caatcctact ggaccagccc agtaggagca ggcctgtggg 15781 cagggccctg gggtggagga ggatttcagg caccctcaga ttcttcctat ggttccacca 15841 ttagagcttc tcccttggac tgaggccagc tggagaatgc tggcctggtg gttacctgga 15901 ttgggccaac tgcagcacta ggtttctggg ggcaccagga ggtggcatga atccttgagg 15961 aggggggcca ggggttgcct tgatgttctg ttccctcagc cctcaagagt tgtgccaggc 16021 tctctgggcc tacatggtca tcactctggg accctgcttc ctgaactgga tcagcagtca 16081 ccacttctgg ggcttgatct tgtgtacaag tctcatgctg tccccagcat cctgggccct 16141 tctttccctc agccctgcca gcttttctat cccaggcagc ctgtaggacc caacccagat 16201 ggtagttaac tgtgagtttc gttgcttcag tggccaccgt gccaggggtg gcctggctac 16261 agtggctcag tctcagactt gactcagatc caggtagcac tttcacttgg gcagggacat 16321 ggtgtttcct tcaacctact ctggcaggca aggctgggcc caagcaccta gcaggtgtca 16381 ataaccagct tctgaatcag ctgtgaaccc agaaaatctg acaaaggtct cagttaactt 16441 agaaagttta ttttgccgag gttgaggact tgctggtgac acagcctcag gaagtcctga 16501 ggacatgtgc ccagggcggt tggggcacag cttggtttta tagattttag ggagacatga 16561 gacattaatc aatatgtaag aagtacatta gttccagaaa gaaaggtgga gactgctcaa 16621 atcaaggctc ccaggctcaa agcactgggg gcttccaggt cacagatagg tgagagacag 16681 atggttgcat tcttttgagt ttctggtaag tctttccaaa ggaggcaatc agaatatgca 16741 tctatctctg tgagcaaaag gatgacttga atagaatggg aggcagattt gtcctgagca 16801 gttcccagct tgaagaggcc caagatactt tcctttcaca tttaccccat tttctttttc 16861 aaaatctttt ggagaaagca ttttgcaaga aaatgagtat ctggtctcag gtttcatctg 16921 atctctcatt gctagataag taggtccgga aagctcattt ttagcaggtt gtaaagtctc 16981 atgcagtgtg aagagaaaat agggagaagg aaggaagaga aaaaaaaaac agcaaaagaa 17041 caatcccagc cctggcgggg tggctcatgc ctgtaatccc aacaatttgg gaggctgagg 17101 cgggtggatc acctgcggtt gggagttcga gaacagcctg accaacatgg agaaactctg 17161 tctctactga aaatacaaaa aactagccag gcatggtggc tcatgcctgt aatcccagct 17221 actcaggagg ctgagacagg agaatcactt gaacccagga ggcggaggtt gcagcaacct 17281 gacattgcgc cattgcactc cagcctgggc aacaagagtg aaactccatc tcaaaaaaaa 17341 aaaaaaaaaa aatcctggga aaatataggc cacattactc tgaagtccat acattggtag 17401 gcaggtatga aagtggctta tgtatgtaca taaacaggtt actgttactt tcttctgaag 17461 tgtaagttgt ctgactttag ttgacacgct tttaagaaac cacagctagg gccgggcgca 17521 gtgactaacg cctgtaatcc cagcacttcg agaggccgag gcggccggat cacaaggtca 17581 ggagttcgag accagcctgg tcaatatggt gaaaccctgt ctctactaaa aatacaaaaa 17641 ttagccaggc atggtagcaa gcgcctgtag tcccagctac tcgggaggct gaggcaggag 17701 aattgcttga acctgggagg cataagttgc agtgagctga gatcgcgcca ctgcactcca 17761 gcctgggtga cagagcgaga ctccatctca aaaaaaaaaa aaaaagaaac cacagtggcc 17821 gattgcagtg gctccccctc cccctccccc tcccctcccc ctccccctcc ccctctccct 17881 ccacggtctc cctctgatgc cgagcggaag ctggactgta ctgctgccat ctcggctcac 17941 tgcaacctcc ctgcctgatt ctcctgcctc agcctgccga gtgcctgcga ttgcaggcac 18001 gcgccgccac gcctgactgg ttttcgtatt ttttgggtgg agacggggtt tcgctgtgtt 18061 ggccgggctg gtctccagct cctaaccgtg agtgatctgc cagcctcggc ctcccgaggt 18121 gctgggattg cagacggagt ctctcaatgg tgcccaggct ggagtgcagt ggcgtgatct 18181 cggctcgcta caacatccac ctcccagcag cctgccttgg cctcccaaag tgccgagatt 18241 gcagcctctg cccggccgcc accccgtctg ggaagtgagg agcgtctctg cctggccgcc 18301 catcgtctgg gatgtgagga gcccctctgc ctggctgccc agtctggaaa gtgaggagcg 18361 tctctgccca gccgccatcc catctaggaa gtgagcagcg cctcttcccg gccgccatcc 18421 catctaggaa gtgcggagcc tctctgcccg gccacccatc gtctgagatg tggggagcgc 18481 ctctgccctg tcgccccgtc cgggatgtga ggagcgtctc tccctggccg ccccgtctga 18541 gaagtgagga gcccctccgc ccagcagccg ccctgtctga gaagtgagga gcccctccgc 18601 ccggcagcca ccctgtctgg gaagtgagga gggtctccgc ccggcagcca ccccgtccag 18661 gagggagatg ggggggtcaa ccccccgcct ggccagccgc cccgtcctgg agggaggtgg 18721 gggggtcagc cccccgcccg gccagccgcc ccgtccggga gggaggtggg ggggtcagcc 18781 ccccgcccgg ccagccgccc cgtccgggag ggaggtgggg gggacagcct cccacccggc 18841 cagccgcccc gtccgggaga tgaggggcgc ctctgcccgg ccgcccctac tgggaagtga 18901 ggagcccctc tggccggcca gccaccccat ccgtgagggg aggggggatc agccccccgc 18961 ccggccagcc gccccgtccg ggagggaggt gggggggtca gccccccgcc tggccagccg 19021 ccccatccgg gaggtgaggg gcgcctctgc ccggccaccc ctactgggaa gtgaggagcc 19081 cctctgccca gccagccgcc ccgtccgtga gggaggtggg gggatcagcc ccacgcccgg 19141 cctgccgccc cgtccgggag ggaggttggg gggtcagccc cccgcccggc cagccgcccc 19201 atccgggagg gaggtggggg ggtcagcccc ccgcccggcc agccgccccg tccgggaggg 19261 aggtgggggg gtcagccccc cgcctggcta gccgccccat cccggaggtg aggggcgcct 19321 ctgcccggcc gcccctactg ggaagtgagg agcccctctg cccggccagc cgccccgtcc 19381 gggagggagg tggggggggg tcagcccccc gcctggccag ccgccccatc cgggaggtga 19441 ggggcgcctc tgcccggccg cccctactgg gaagtgagga gcccctctgc ccggccagcc 19501 gccccgtccg ggagggaggt gggggggtca gccccccgcc cggccagccg ccccgtccgg 19561 gaaggaggtt ggggggtcag ccccccgccc cggccgggag ggagttgggg gggtcagccc 19621 cccgcccggc cagccacccc gtccgggagg tgaggggcgc ctctgcccgg ctgcccctac 19681 tgggaagtga ggagcccctc tgcccggcca ccaccccgtc tgggaggtgt acccaacagc 19741 tcattgagaa cgggccatga tgacaatggc ggttttgtgg actagaaagg tgggaaaggt 19801 ggggaaaaga ttgagaaatc ggatggttgc catgtctgtg tagaaagatg tagacgtggg 19861 agacttttca ttttgttcta tactaagaaa aattcttctg ccttgggatc ctgttgatct 19921 gtgaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtccact cagggttaaa 19981 tggattaagg gcggtgcaag atgtgctttg ttaaacagac gcttgaaggc agcatgctcg 20041 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacac tgcggaaggc 20101 cgtggggtcc tctgcctagg aaaaccagag acctttgttc acttgtttat ctgctgacct 20161 tccctccact attgtcctgt gaccctgcca aatccccctc tgcgagaaac acccaagaat 20221 gatcaataaa aaaataataa ttaaaaaaaa aaaagaaaaa aaaaagaaaa gaaaccacag 20281 tggccgattg cagtggctca ctcctgtaat cccagcactt tgggaggccg aggcgggcag 20341 atcacgaggt caggagatcg agaccattct ggctaacacg gtgaaacccc atctctatta 20401 aaaatacaaa aaaaaaatag ccgggcgtgg tggtggttgc ctgtagtccc agctactcag 20461 gaggctgagg caggagaatg gcgtgaacct gagaggcgga ggttcagtga gctgagaccc 20521 tgccactgca ctccagcctg agtgacagag cgggactctg tctcaaaaaa aaaaaaaaaa 20581 acacacagct tagttttcag tgactccaaa ttaggaaaat gaaaaaaaag aaggaaaaaa 20641 attgaaaact taatttgaag acttgtagcc aagaaaaatt cattccaaac tgtagaaaat 20701 cataaaaatt ggaagcaaac aaacaaaaaa acagttaaga ctagaatcta acaacaggtg 20761 aacattcgtt ttgaaacatg attttattct ctctccaatt tcccatttta ctaaagacaa 20821 atcatggtat gacttgtttg cttattatac ttggcctaaa tatttgtata cagtgcagca 20881 agaataattt attttgtccc ataagaggaa tctcaggtaa gactttttaa agccctgccc 20941 agccatggat ttgtgccatc aaatacccat gagttggatg gaatttcctc tcctttcaag 21001 ttccaagata aacataaacc tggggcctct gtgcctgtca gaaagtgaca tccttttttt 21061 tttgatcagg ctagtcttga attcctgacc tcaggtgatc ctcccgcgtt ggcctcccaa 21121 agtgctggga ttacaggcat gagccaccac tcctggcccc agaaagtgac attctttact 21181 taccacaggt aagaaatcct gtacagggac tgtgtacaca aaatgtgagg ccagttttcc 21241 caagggcttt attggctcca taagttaagt ttgattcctt aaacgaaagc acaccattcc 21301 tgtcaaagtc ttggtcaaat aatcaattcc tccaattgtg tcctgttaca aatgaaaaca 21361 gattcttttt tttttttttt tttttggtgg agtctggctc tgttgcccag gctggagtgc 21421 aatggtgcaa tctcagctca ctgccacctc tgcttcccag gttcaagcga ttctactgcc 21481 ttagcctctg gagtagctgg gattacaggc atgtgccacc atgcccagct aatttttgtg 21541 tttttagtag agatggggtt ttgccatgtt ggccaggctg gtctcgaact cacaacctca 21601 ggtgatttgc ccaccttggc ctcccaaagt gctgggatta caggagtgag ccaccatgcc 21661 aggcaaaaac agattcttat ggcacttatg caaataactg tattgccata agttaagaat 21721 actcacaaat agtttccaaa ttctggagaa atcaggtaga gaaacaaata tgctccaaat 21781 tttgttcata ggactgtact aaattgttaa aagctgttgg ccaggcacgg tggctcacgc 21841 ctgtaatccc agcattttgg gaggccgagg caggcagatc acgaggtcag gagttcaaga 21901 ccatcctgac caacatgctg aaactctgtc tctactaaaa atacaaaaat tagccaggtg 21961 tagtggtgtg cacctgtaat cccagctact cgggaggcta ggggaggaga atcgcttgaa 22021 cccaggagcc agaggttgca gtgagccgag atcctgccat tgcactccag cctgggtgac 22081 agagtgagac ttcgtctaaa aaaaaaaaaa aaggccaggc acggtggctc atgcctgtaa 22141 tcccagcact ttgggaggcc aaggtgggca gatcacgagg tcaggagatc gagaccacgg 22201 tgaaaccctg tctctaccaa aaatacaaaa aattagccgg gcgcggtggt gggcgcctgt 22261 agccccagct actcaggagg ctgaggcagg agaatggcat gaacctggga ggtggagctt 22321 gcagtgagct gagatcgcgc cactgcactc cagcctgggt gacagagcga gaatccatct 22381 caaaaaataa taataataat aatatataat acgacaacgt ggcctgacat ggtggctcac 22441 gcctgtaatc ctagcatttt gggaggccaa ggcaggcgga tcacctgagg cgaggagttt 22501 gagaccagcc aagccaatat gctgaaaccc catgtctacc aaaaatacaa aaattagcca 22561 ggtgtggtgg cacatgcctg taaacccagc tactggggag gctgaggcag gagaatcact 22621 tgaacccagg aggcagaggt tgcagtgagc caagattgca ccactgcact tcagcctggg 22681 caacaaagca agactctacc ctgaagaaaa aagtatacga caacttgatt atataaaagt 22741 ttttgggttt ttggccgggc acggtggctc acacctgtaa tctcagcact tctggaggcc 22801 gaggcaggtg gatcacgagg tcaggagatc aagaccatcc tggctaacat ggtgaaaccc 22861 tgtctctact aaaaacacaa aaaaattagc caggcatgat ggcgggcccc tgtagtccca 22921 gctactcaga aggctgaggc aggagaatgg cgtgaacccg ggaggcaagc ttgcagtgag 22981 ccgagattgc gccactgcac tccagcctgg gtgatagagc gagactccgt ctcaaaaaaa 23041 aaaaaaaagt ttttgggttt tttgtgactt acactgactg ttcatgacat ggttggactt 23101 tccaatttgt tctgaacatc cctccttttt atttattttg ggtttttttt tttttttttt 23161 ttttttcaga tggagtcttg ctctgtcacc caggctggag tgcagtggca tgatctccgc 23221 tcactgcaag ctccgcctcc cgggttcacg ccattctcct gcctcagcct cccgagtagc 23281 tgggactaca ggcgcccacc accacgccca gcttaatttt ttgtattttt agtagagaca 23341 gggtatcacc atgttagcca ggatggtctt gatctcctga ccttgtgatc cgcccgcctc 23401 ggcctcccaa agtgctggga ttacaagcgt gagccactgc gtccggccct attttggttt 23461 attttttatt ttttattttt ttcttgagac tgagtcttgc tgtcacccag gctggagtgc 23521 agtggcatga tctcagctca ttgcaacctc tgcctcccag gtcaagcgat tctcctgcct 23581 cagcctccca agtagctggg attataggca cacaccacta agcctggcta atttttgtat 23641 ttttagtaga aacagggttt cgccatgttg gccaggttgg tttcaaactc ctgacctcaa 23701 atgatcctcc tgcctcagtc tcccaaagtg ctaggattat gtgtgagcca ctgcacccgg 23761 cttatttttt ttttttttta gcagagtctt gctctgtcac ccaggctaga gggcagtggc 23821 gcaatctcag ctcactgcaa cctccgcctt ccaagtttaa gtgattctcc tgcctcagcc 23881 tcctgaatag ctgggattat aggtgtgggc caccatgcct ggataatttt tgtgtttttg 23941 ttgttgttgt tgttgttgtt gagatggagt cttgctctgt cactgaggct ggagtaacag 24001 tagcatgatc tcggctcact gcaaccccca cctcccaggt tgaagcaatt ctcctgcctc 24061 agcctcctga gtagctggga ttacaggcac ccaccaccat gccgggctaa tttttgtatt 24121 tttagtagag ataggtttca ccatgttggc caagctggtc tcaaactccc gacctcaggt 24181 gacccgccca cctcggcctc ccaaagtgtt gggattacgg gtgtgagcca ccactcccag 24241 ccaaaaaaaa tttttttaat cttattacca tatttcagct agaacaaaat gctgctaaag 24301 taacaatgat cacacaaagt atatgatttc tgagtgctgt aagtgtaagc agaagttaac 24361 accagctggt tgttaaatgc taactttagt catttaaagg aaattgcaag gcagaatccc 24421 aagccagttt cttacctagt gatgggtctc aggctgtaga ctgctctcta ccatcccaga 24481 agcaggaaaa aaaaactaat tttccctgtt ggaagcgagc tcaaactcca taaaggagtt 24541 acctgccttc catcgtcatg gaagcagaaa aacttgcttt ccttttggga gcaagtaaaa 24601 ctccaaaaaa aaaaaaaaaa aaaaaaagag ttgtacagca aaataaactt tagatctcaa 24661 ccaattgaga aatcaaggat tctctggagg gggtgctccc agacctcagc aaattgtcct 24721 attggtttga gccataaagt tagctcctgc tggtaccaag cacagatagg caattagtca 24781 aaagtcaagg acatctccac acagaatcca tccatagtta ccaaatgtga acccagaaaa 24841 tctgagacag ctctcagtta atttaaaaag tttattttgc tgaggttgaa gatgcatcca 24901 agacacagcc tcaggaagtc ctgatgacat gtgcccaagg tgttccgggc acagcttggt 24961 tttatacatt ttagggagat atgagacatc tatcaatacg taagaagtac cttagccaag 25021 cgtggtggct caagcctgta atcccagcac tttgggaggc caaggtgggc agatcacaag 25081 gtcaggagtt caagaccagc ctggccaata tggtgaaacc ctgtctctac taaacatata 25141 aaaattagct gggtgtggtg gcacgcgcct gtagtcccag ctactctgga ggctgaggca 25201 ggagaatcgc ttgaacccgg gagccagagg ttgcagtgag ccgagatcat gccattgcac 25261 tccagcctgg gcgacagagt gagacttcat ctaaaaaaaa attaggccag gcgcggtggc 25321 tcacacctgt aatcccagca ctttgggaag ctgaggcagg cagatcacaa ggtcaggaga 25381 tcgagaccat cctggctaac acggtgaaac ctcgtctcta ctaaaaatac aaaaaattag 25441 ccaggcgtgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 25501 gcatgaaccc ggggggcaga gcttgcagtg agctgagatc gcaccactgc actccagcct 25561 gggcgacaga gcgagactcc gtctcaaaaa ataaaaataa aaaaaaaaca gaagtacata 25621 agctctgtcc agagaggtgg agacagctca aagcacggcc ccccactggg ggcttccagg 25681 tcacaggcag gtgacagaca gatggttgca ttcttttgag tttctggtag tctttccaaa 25741 agaggcaatg gaggcaatca taatatgcat ctatgtctgt aagcaaaagg atgacttgaa 25801 tagaatggga ggcagatttg tcctgagcgg ttcccagctt gaaggggccc aagatatttt 25861 cctttcacat attgaacaag tgaaaccaca aatgaatgaa caagcatgtg ctcactgaca 25921 gctcatatgc atggtcacac aaacacagga tctcatgctc acaagacagc agggctcaca 25981 cacacccagt cccaggcact gggttccaca ctcacaccta cttgccactg gaatttgtgc 26041 agaggctcag atggtcacta aatggggctt cacaaagaca cagtcactaa cccagcacat 26101 atacaaaggg tgatgacaca gccctgactt catgaacaca gtgcctttca gaaacaggtc 26161 cctacaacag ggatgactta tccactaagc acaatagacc caaggcctag ggccacagta 26221 ctttcagggg cccacaaaat gtttttttaa tcagaagaag aatataaatg taaatatatt 26281 aataattaat atataatagt gtaacccaga catgacctcc tgggctcaag cactcctcct 26341 gcctcagtgt cccaagtagc tgggactaca ggtacatacc accatgccca gctaattttt 26401 aatttttttg tagagatggg gtcttgctac attgcctagg ctggtctcga actcctgggc 26461 tcaagcaatt ctcccacctc agcctcccaa agtgctaaga ctatagccat atatatatat 26521 atatatatat atatatatat atatatatat tttttttttt tttttttttt ttgagatgga 26581 gtcgctctct gtcacccagg ctggagtata gtagcacagt ggagcagtct tggctcactg 26641 caacctctgc ctcccgggtt caagggattg tcctgcctca gcctcccgag tagctgggat 26701 tacaggcgcc tgccaccacg cccggctaat tttttgtatt tttagtagag aaggggtttc 26761 accatgttag ccgggatggt ctcgatctcc tgacctcgtg atctgccacc tcggcttccc 26821 aaagtgctag gattataggc gtgagcctcc gcacctggca tgcctggcca tattttaaaa 26881 tatttttttt aaggaggaaa agatccacga aagtaaagtg cctagggccc atcaatgtca 26941 gatgcagcca cgtggacaaa gaacactccc tgagcacaag acacgaaaga acccggtctc 27001 cagtgtcaca aacaggcctg cccagcaggg tctgcccctg cccacctcct ccagcctagg 27061 taaggccaca aagccctgcc ctagtgctga cagcgtggga ggggccactg ttggcccagc 27121 cccaccccca accaagatcc ctttgccagg gattcagagg tagccctgat gcctggctgg 27181 cctggcctcc taatccaagg cccgcccctt ggcccgcccc tggcctgggt ggggtgaccc 27241 cctacaaaag ctcagaattt ccagcagcgg ctccttcctc caggagtctc tggtgcagct 27301 ggggtggaat ctggccaggc cctgcttagg cccccatcct ggggtcagga aatttggagg 27361 ataaggccct tcagccccaa ggtcagcagg gacgagcggg cagactggcg ggtgtacagg 27421 agggctgggt tgacctgtcc ttggtcactg aggccattgg atcttcctcc agtggctgcc 27481 aggatttctg gtggaagaga caggaaggcc tccccccctt ggtcgggtca gcctgggggc 27541 tgagggcctg gctgtcagcc actcttccca gaacatatgt catggcctca gtggctcatg 27601 gggaagcagg ggtgggcgag cttaggctag agcaagtcct gtgggagatg gcagaggcct 27661 ggtctgagag gcaactcgga tgtgccctcc agtggccatg ctcccctcca tgcgtctccc 27721 ctgccctcct ggagccctgc aggtcaatgt ttaacagaaa ccagagcagc ggtggattaa 27781 tgcgcaaggg ctcagccccc cagccctgag cagtggggga atcggagact ttgcaacctg 27841 ttctcagctc tgcctcccct ggccaggttg tcctcgacca gtcccgtgcc atggcagccc 27901 acctgcttcc catctgcgcc ctcttcctga ccttactcga tatggcccaa ggctttaggg 27961 gccccttgct acccaaccgg cccttcacca ccgtctggaa tgcaaacacc cagtggtgcc 28021 tggagaggca cggtgtggac gtggatgtca gtgtcttcga tgtggtagcc aacccagggc 28081 agaccttccg cggccctgac atgacaattt tctatagctc ccagctgggc acctacccct 28141 actacacgcc cactggggag cctgtgtttg gtggtctgcc ccagaatgcc agcctgattg 28201 cccacctggc ccgcacattc caggacatcc tggctgccat acctgctcct gacttctcag 28261 ggctggcagt catcgactgg gaggcatggc gcccacgctg ggccttcaac tgggacacca 28321 aggacattta ccggcagcgc tcacgggcac tggtacaggc acagcaccct gattggccag 28381 ctcctcaggt ggaggcagta gcccaggacc agttccaggg agctgcacgg gcctggatgg 28441 caggcaccct ccagctgggg cgggcactgc gtcctcgcgg cctctggggc ttctatggct 28501 tccctgactg ctacaactat gactttctaa gccccaacta caccggccag tgcccatcag 28561 gcatccgtgc ccaaaatgac cagctagggt ggctgtgggg ccagagccgt gccctctatc 28621 ccagcatcta catgcccgca gtgctggagg gcacagggaa gtcacagatg tatgtgcaac 28681 accgtgtggc cgaggcattc cgtgtggctg tggctgctgg tgaccccaat ctgccggtgc 28741 tgccctatgt ccagatcttc tatgacacga caaaccactt tctgcccctg gtgagtcttc 28801 tgtaacctgc ccaccttgtc aacctgcctt gtcagatatt aggtccagcc ttggggtact 28861 ttagctttgg gacattttct tcgttttttt ttttttgaga cagagtctca ctctgtcacc 28921 caggctggag tgcagtggca cgatcttggc tcactgcaag ctctgcctcc cgggttcatg 28981 ccattctcct gcctcagcct ccgagtagct gggatacagg cgcccgccac cacacccagc 29041 taattttttg tattttttag tagagtcagg gtttcactgt gttagccagg atggtctcaa 29101 tctcctgacc tcgtgatcca tccgcctcgg cctcccaaag tgctgggatt acaggcgtga 29161 gccaccgcgc ccggccccca accttgggac attttcatcc attcattcat cctttttttt 29221 tttttttttt tgagacggag tcttgctctg tcacccaggc tggagtgcag gggcaagatc 29281 tcagctcctg caccctccac cttccggatt caag // LOCUS AC002481 28244 bp DNA PRI 21-AUG-1997 DEFINITION Human cosmid clone LUCA12 from 3p21.3, complete sequence. ACCESSION AC002481 NID g2340092 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 28244) AUTHORS Miller,N, Kramer,J, Elliott,G and Keppler,D. TITLE The sequence of H. sapiens cosmid clone LUCA12 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 28244) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: Clones from the 3p21.3 region were contributed by Michael Lerman, M.D., Ph.D., at the National Cancer Institute, FCRDC, Building 560, Room 12-64, Frederick MD 21702 USA (mailto:lerman@ncifcrf.gov) SOURCE INFORMATION: This clone is from a chromosome 3 specific library described by Wei et al., Cancer Research 56:1487-92 (1996). VECTOR: pWE15 NOTE: this clone was originally named cos14. NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the right is LUCA13, 200 bp overlap. Actual start of this clone is at base position 1 of LUCA12; actual end is at 8943 of LUCA13. FEATURES Location/Qualifiers source 1..28244 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /clone="cos14" /clone_lib="LLNL3" /map="3p21.3" gene <2..3548 /gene="WUGSC:H_LUCA12.1" CDS join(<2..316,455..678,3344..3548) /gene="WUGSC:H_LUCA12.1" /note="match to U09584 (PID:g1209020)" /codon_start=1 /product="PL6 protein, unknown function but deleted in small cell lung cancer" /db_xref="PID:g2340093" /translation="IWTLATHGLMEQHVWDVAISLTTVVVAGRLLEPLWGALELLIFF SVVNVSVGLLGAFAYLLTYMASFNLVYLFTVRIHGALGFLGGVLVALKQTMGDCVVLR VPQRHSRGRGDMADHFAFATFFPEILQPVVGLLANLVHSLLVKVKICQKTVKRYDVGA PSSITISLPGTDPQDAERRRQLALKALNERLKRVEDQSIWPSMDDDEEESGAKVDSPL PSDKAPTPPGKGAAPESSLITFEAAPPTL" misc_feature 3488..3815 /note="match to EST AA323108 (NID:g1975433)" misc_feature 3515..3904 /note="match to EST R46303 (NID:g805700) yj53e02.r1" misc_feature 3708..4135 /note="match to EST AA226710 (NID:g1848035) nc27g09.r1" misc_feature complement(3833..4140) /note="match to EST H99532 (NID:g1124200) yx29d12.s1" misc_feature 4816..5050 /note="match to EST W86852 (NID:g1400581) zh59d04.s1" misc_feature 4822..5244 /note="match to EST R15090 (NID:g769363) yf48d05.s1" misc_feature 4822..5228 /note="match to EST R15090 (NID:g769363) yf48d05.s1" misc_feature 4824..5316 /note="match to EST AA447529 (NID:g2161199) zw81b11.s1" misc_feature 4825..5158 /note="match to EST AA421268 (NID:g2100093) zu06b06.s1" misc_feature complement(4863..5114) /note="match to EST F20845 (NID:g2060021)" gene complement(5147..7457) /gene="WUGSC:H_LUCA12.2" CDS complement(join(5147..5650,6845..6882,7331..7457)) /gene="WUGSC:H_LUCA12.2" /note="match to AA456453 (NID:g2179029)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2340094" /translation="MALSAETESHIYRALRTASGAAAHLVALGFTIFVAVLARPGSSL FSWHPVLMSLAFSFLMTEALLVFSPESSLLHSLSRKGRARCHWVLQLLALLCALLGLG LVILHKEQLGKAHLVTRHGQAGLLAVLWAGLQCSGGVGLLYPKLLPRWPLAKLKLYHA TSGLVGYLLGSASLLLGMCSLWFTASVTGAAWYLAVLCPVLTSLVIMNQVSNAYLYRK RIQP" misc_feature complement(5426..5650) /gene="WUGSC:H_LUCA12.2" /note="match to EST AA456453 (NID:g2179029) zx74f03.r1" misc_feature complement(5492..5650) /gene="WUGSC:H_LUCA12.2" /note="match to EST N79667 (NID:g1242368) yz81g06.r1" misc_feature complement(5592..5650) /gene="WUGSC:H_LUCA12.2" /note="match to EST R92257 (NID:g959797) yq06e02.r1" repeat_region complement(6196..6367) /rpt_family="ALU" repeat_region complement(6395..6692) /rpt_family="ALU" misc_feature complement(6845..6887) /gene="WUGSC:H_LUCA12.2" /note="match to EST N79667 (NID:g1242368) yz81g06.r1" misc_feature complement(6845..6887) /gene="WUGSC:H_LUCA12.2" /note="match to EST AA456453 (NID:g2179029) zx74f03.r1" misc_feature complement(7330..7552) /note="match to EST N79667 (NID:g1242368) yz81g06.r1" misc_feature complement(7330..7485) /note="match to EST AA456453 (NID:g2179029) zx74f03.r1" misc_feature complement(7331..7676) /note="match to EST R92257 (NID:g959797) yq06e02.r1" misc_feature complement(7905..7962) /note="match to EST AA456453 (NID:g2179029) zx74f03.r1" misc_feature complement(7907..7948) /note="match to EST N79667 (NID:g1242368) yz81g06.r1" misc_feature complement(7995..8200) /note="match to EST AA421315 (NID:g2100191) zu06b06.r1" misc_feature 8113..8318 /note="match to EST AA233630 (NID:g1856833) zr44c02.r1" misc_feature 8125..8465 /note="match to EST AA302916 (NID:g1955267)" gene 8239..11285 /gene="WUGSC:H_LUCA12.3" CDS join(8239..8316,8869..8960,9058..9226,9397..9505, 9881..10017,10109..10206,10328..10364,10481..10574, 10650..10767,10990..11132,11218..11285) /gene="WUGSC:H_LUCA12.3" /note="similar to nitrogen permease regulator; similar to P39923 (PID:g730170), match to AA233630 (NID:g1856833) and AA399402 (NID:g2053147)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2340095" /translation="MGSGCRIECIFFSEFHPTLGPKITYQVPEDFISRELFDTVQVYI ITKPELQNKLITVTAMEKKLIGCPVCIEHKKYSRNALLFNLGFVCDAQAKTCALEPIV KKLAGYLTTLELESSFVSMEESKQKLVPIMTILLEELNASGRCTLPIDESNTIHLKVI EQRPDPPVAQEYDVPVFTKDKEDFFNSQWDLTTQQILPYIDGFRHIQKISAEADVELN LVRIAIQNLLYYGVVTLVSILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGHKRA SLRDVFQLYCSLSPGTTVRDLIGRHPQQLQHVDERKLIQFGLMKNLIRRLQKYPVRVT REEQSHPARLYTGCHSYDEICCKTGMSYHELDERLENDPNIIICWK" misc_feature 8866..8960 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA233630 (NID:g1856833) zr44c02.r1" misc_feature 9050..9226 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA233630 (NID:g1856833) zr44c02.r1" misc_feature 9399..9465 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA233630 (NID:g1856833) zr44c02.r1" misc_feature 9933..10017 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature 10109..10208 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature 10328..10364 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature 10474..10575 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature complement(10517..10575) /gene="WUGSC:H_LUCA12.3" /note="match to EST AA447620 (NID:g2161290) zw97b01.s1" misc_feature 10619..10990 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA340305 (NID:g1992543)" misc_feature complement(10648..10768) /gene="WUGSC:H_LUCA12.3" /note="match to EST AA447620 (NID:g2161290) zw97b01.s1" misc_feature 10648..10768 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature complement(10937..11133) /gene="WUGSC:H_LUCA12.3" /note="match to EST H09156 (NID:g873978) yl98e08.s1" misc_feature complement(10989..11133) /gene="WUGSC:H_LUCA12.3" /note="match to EST AA447620 (NID:g2161290) zw97b01.s1" misc_feature 10989..11133 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" misc_feature complement(11071..11133) /gene="WUGSC:H_LUCA12.3" /note="match to EST R41569 (NID:g816869) yf88e06.s1" misc_feature complement(11215..11401) /note="match to EST H09156 (NID:g873978) yl98e08.s1" misc_feature complement(11215..11399) /note="match to EST AA447620 (NID:g2161290) zw97b01.s1" misc_feature complement(11215..11399) /note="match to EST R41569 (NID:g816869) yf88e06.s1" misc_feature 11215..11245 /gene="WUGSC:H_LUCA12.3" /note="match to EST AA399402 (NID:g2053147) zt59h09.r1" repeat_region 11532..11718 /rpt_family="ALU" repeat_region 11754..12045 /rpt_family="ALU" repeat_region complement(12372..12661) /rpt_family="ALU" misc_feature 13193..13403 /note="match to EST W07770 (NID:g1281782) zb03g01.r1" gene 13312..17481 /gene="WUGSC:H_LUCA12.4" CDS join(13312..13403,13659..13767,15041..15157,15275..15328, 15447..15584,16378..16494,16751..16876,16960..17081, 17406..17481) /gene="WUGSC:H_LUCA12.4" /note="match to W07770 (NID:g1281782), AA383562 (NID:g2035879), and AA399396 (NID:g2053141)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2340096" /translation="MGDLELLLPGEAEVLVRGLRSFPLREMGSEGWNQQHENLEKLNM QAILDATVSQGEPIQELLVTHGKVPTLVEELIAVEMWKQKVFPVFCRVEDFKPQNTFP IYMVVHHEASIINLLETVFFHKEVCESAEDTVLDLVDYCHRKLTLLVAQSGCGGPPEG EGSQDSNPMQQKLSKLDGQVWIALYNLLLSPEAQARYCLTSFAKGRLLKLRAFLTDTL LDQLPNLAHLQSFLAHLTLTETQPPKKDLVLEQIPEIWERLERENRGKWQAIAKHQLQ HVFSPSEQDLRLQARRECQVKHWEKHGKTCVLAAQGDRAK" misc_feature 13657..13773 /gene="WUGSC:H_LUCA12.4" /note="match to EST W07770 (NID:g1281782) zb03g01.r1" repeat_region 14274..14565 /rpt_family="ALU" misc_feature 15038..15159 /gene="WUGSC:H_LUCA12.4" /note="match to EST W07770 (NID:g1281782) zb03g01.r1" misc_feature 15276..15329 /gene="WUGSC:H_LUCA12.4" /note="match to EST W07770 (NID:g1281782) zb03g01.r1" misc_feature 15440..15480 /gene="WUGSC:H_LUCA12.4" /note="match to EST W07770 (NID:g1281782) zb03g01.r1" misc_feature 16402..16494 /gene="WUGSC:H_LUCA12.4" /note="match to EST AA383562 (NID:g2035879)" misc_feature 16749..16876 /gene="WUGSC:H_LUCA12.4" /note="match to EST AA399396 (NID:g2053141) zt59f10.r1" misc_feature 16749..16876 /gene="WUGSC:H_LUCA12.4" /note="match to EST AA383562 (NID:g2035879)" misc_feature 16958..17083 /gene="WUGSC:H_LUCA12.4" /note="match to EST AA399396 (NID:g2053141) zt59f10.r1" misc_feature 16958..17083 /gene="WUGSC:H_LUCA12.4" /note="match to EST AA383562 (NID:g2035879)" misc_feature 17400..17602 /note="match to EST AA399396 (NID:g2053141) zt59f10.r1" misc_feature complement(17459..17774) /note="match to EST N80709 (NID:g1243410) zb03g01.s1" repeat_region 19223..19517 /rpt_family="ALU" repeat_region 20312..20586 /rpt_family="ALU" misc_feature 20693..22057 /note="CpG_island (%GC=67.9, o/e=0.90, #CpGs=139)" misc_feature 22400..22711 /note="match to EST AA361965 (NID:g2014286)" repeat_region 22825..23566 /rpt_family="ALU" repeat_region 23644..23933 /rpt_family="ALU" repeat_region 25112..25406 /rpt_family="ALU" repeat_region complement(25787..26078) /rpt_family="ALU" repeat_region complement(26090..26386) /rpt_family="ALU" gene <26737..>28203 /gene="WUGSC:H_LUCA12.5" CDS join(<26737..26841,27035..27332,27442..27551, 28164..>28203) /gene="WUGSC:H_LUCA12.5" /note="60% similarity to Z49912 (PID:g872093), match to Z41881 (NID:g564108)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2340097" /translation="DEPVEWETPDLSQAEIEQKIKEYNAQINSNLFMSLNKDGSYTGF IKVQLKLVRPVSVPSSKKPPSLQDARRGPGRGTSVRRRTSFYLPKDAVKHLHVLSRTR AREVIEALLRKFLVVDDPRKFALFERAERHGQVYLRKLLDDEQPLRLRLLAGPSDKAL SFVLKENDSGEWDAFSMPELHNFL" misc_feature 27260..27332 /gene="WUGSC:H_LUCA12.5" /note="match to EST Z41881 (NID:g564108)" misc_feature 27440..27557 /gene="WUGSC:H_LUCA12.5" /note="match to EST Z41881 (NID:g564108)" misc_feature 28166..28207 /note="match to EST Z41881 (NID:g564108)" BASE COUNT 6503 a 7604 c 8106 g 6031 t ORIGIN 1 gatctggacc ctggccaccc atgggctgat ggagcagcat gtgtgggacg tggccatcag 61 cctgacaacg gtggtggtgg ccgggcgttt gctggagccc ctctgggggg ccttggagct 121 gctcatcttc ttctcagtgg tgaatgtgtc tgtagggctg ctgggggcct tcgcctacct 181 cctcacctac atggcttcct tcaacctggt ctacctgttc actgtccgta tccacggcgc 241 cttgggcttc ctaggtggcg tcctggtggc actcaagcaa accatggggg actgtgtggt 301 cctgcgagtg ccccaggtgc gcgtcagtgt gatgcccatg ctgctgctgg cgctgctgct 361 cctgctgcgg ctcgccacgc tgctccagag cccggcgctg gcttcctatg gcttcgggct 421 gctctccagt tgggtatatc ttcgcttcta ccagcgccat agccggggcc gaggggacat 481 ggctgaccac tttgctttcg ccactttctt ccctgagatc ctgcagcctg tggtgggttt 541 gctggcgaac ttggtgcaca gcctcctggt gaaggtaaag atatgccaga agacggtgaa 601 gcgctacgat gtgggtgccc catcctccat caccatcagc ctgccaggca cagaccctca 661 agacgccgag cggagaaggt actgtgaaat tttccaaaat cttgtcaagc taggagcata 721 ctggtcgagt cacatgccat ctgttgaggt gttcgctgta cataggtgga gacctacagt 781 caggtattgg ggcacatccg gaaagacaaa atcagtcttt ggccttaagg aactcagcag 841 ttaggtgtct ggggtccacg aggcactttt ctcaccactc gggccactta ataagtctca 901 agaaatgcag tttcctttgg tgctgtgaag tcccagtgga gggacacaga accctggcct 961 tgcctcagaa tctcagaatc ctagacattg ggcctgtggg agccagaagc aacctgctgg 1021 tactttcccc tgtgaggaag ggacagggaa ggaggcctgg ggtcctcaaa aggtgatctg 1081 ggccactcca atttccaaat gctgtgtcct cttcctgggc tttaactcag atgctgtgct 1141 tctctgccac ttacttattg agagggaagc ttggggtatt accacattta cctctgtctc 1201 cagcaagcat gtgtcaccaa aaccagggac aggctacatg acccactgag ggagcctgct 1261 gacaccaggg agtggcgatg accctttcat tatctgtaaa ccaagctatc cacagcatca 1321 ggagtcccag ctgaattatt cactgaatgc actttggcca gggcaggagg gatcttttta 1381 aattcagctt actctctctc ccagatctgt ggagccagca gagtggtccc agccttcata 1441 agatccacag ctccatttcc aacattattc ctcactgttt tgaggcaatt actcgagtag 1501 taaaatctgg gacgggcttt gtcttttgca ttgccaaggc tcctgctttt gctcacctgt 1561 gaatggggtt gtggcaggaa gctctatgaa tttagctgca gttatccctg tatttactct 1621 ccactgggct atggaattga gtgtgggaca cactcctttc ttgaaggagt tccagctggg 1681 gaagggccac agatcactat ggcccaaggt atggtaggtg ttgaaaagta tatacagcat 1741 gctgcagggc ataattctgg gccctagagc ttgggtcttt ataggtgata agccctaagg 1801 acagaagtct ctcttaacct tagctggtat taaagatggg gccaggtgag ctgggcctgg 1861 taggaattga gctgtccctc agagccattc aggtggggcc aggccagtgg ggacccctcc 1921 tccactgaga ggtgttggaa tgttagggtt ggctggacat tcaaaaacct ctgcctaggc 1981 ccaagaaaat ggacaggggg gaagcaggtt ggtggggagg aagggagggt caagaggcgg 2041 gttcttggca tagccttatg gttgttagct tacatggggg tggcggttat tcctccagcc 2101 agtttgacat ctgactgcat ggccacgccc aactccaaag ggctaccagg agagggaggc 2161 agggtttcct atctgtctct ctggtccttg tgaggaagtg actagcgtaa cactagctct 2221 ctgccttggc aggctttaga tggaactggc aagggcatca tcaggatggc agcctgggta 2281 gcagcaggga aagactgtcc tgtctatacc aacttgtcat gtgtgagact gacgggcagg 2341 ggggaagcgt gctagggctt tctacacctg tgcccttgtt gcctctcttg tcgtgttccc 2401 tgacatggtg gagattttgg ttctcagctg gaacctgaaa ggctacacag gctgtcccca 2461 cattggtggc acagagaggc tgaggtatag gagtccacag tggtcactgt tgaaacggtt 2521 ccctggcagc tgcagcactc caccttctgg tgggtcaggg tcccaagcac atgtgtaaac 2581 cttcttctgt tggactttcc cacctgccct caaagtgggg cctctaaggg tccaccctgt 2641 ggaagcagta ggctctggcc tgtgcctacc catgtgagct gtttctcctt cttctcctct 2701 tgcaagggct ccgctaagtc cctgctctaa caagggcctt ggtggctttg ggggttcttc 2761 tgagagggag aatattagca tcagttagga ggggtagact cacccaagtt ggccctgctt 2821 cctgggctga gagccaaggg aggtgttgaa ttccagcctt ccccattgac ccttgccagg 2881 aggctcaaga tacccctaac caggagaggg ggagaattaa taacccttcc cttggatgtg 2941 atgccagcac ccaggcagaa actcctggaa actccagaac ttgccgttcc acccatctgc 3001 cctagggcac tagttccact gacccttaga taccaccctc agaaccattc ctggtggctg 3061 gcctcgtggg cagtctggca gacacccagc tgaccttccc atcccggagg tgcctcagct 3121 atggagggaa gattcttaga gcctttgggt tatcattctg atggcagagt gggtcctaaa 3181 ccaatctctg gggagtccca ggctctgcat ggtgaagggg aaacatagga ttctggggga 3241 ggagtctagg tggtacggtg aggagtcagc ctgctgcgga agggtagagg gcatgagcca 3301 aggccaaacc cagagtgacc tttcctctct cctctggccc caggcaactg gccctgaagg 3361 cactcaatga gcggctgaag agagtggaag accagtccat ctggcccagc atggatgatg 3421 atgaagagga gtctggggcc aaggtggaca gccccctgcc ctcagacaaa gctcccacac 3481 ccccagggaa gggggctgcc ccagaatcca gtctaatcac cttcgaggca gctcccccga 3541 cgctgtaact ccagaccacc ttgagtgtgg cacctcccct cccaagcccc ccttgacatc 3601 ctctcagcta ctccagggca cctgactgct ctgaggagag ggaagaaggc ctgctggggc 3661 tttccatggc cttctgctgt ttctcgccaa cactacccag gactcttgct acctggttcc 3721 aactccagac aaccactatg ccaggcccgg agcctctgag gcatcggcca gtccaggccc 3781 tcatctgagg taagaatgta catcagctgg cagccccaag caagtggctg cagggacact 3841 gatgccacag ctcctgggcc ggccctcaca tctgaaactg gttgccgaga gccctgagcc 3901 aaggcaagga tttgccaaaa atgttctggg ggcccagcaa atgcaggagc cgacctgggg 3961 ctgcacatcc ctgcccatcc ccagaaagac tgttcctgtc aggatttgtt tccctctgct 4021 gtggcggtga ctgcttctgg accagaacag ctccagctcc caggtatttt ctacaggacc 4081 acttgagtgg gcagccaagc ccaggctcgc agtatcaata aagcagttct ctgaggaatg 4141 actcctgggc tcttgtcagg ccaccttgtg cagccccctc atcacgttca actgcaatgt 4201 tagacaggga cttgattgtc tccatggtaa ctccagctgg ataagaagcc cattgcccca 4261 gctccccacc accgccacca ccaccaccac atgcgcactg ccaagcagcc ttccaagagc 4321 tattgcctca gactcttact ccttttttct ttccctccaa accccaatct gggaggcctt 4381 ctgtccctgc ctagatctga tgataagggc ctctccatca taaggcgctg ttcttaccca 4441 tggtttatct ctaagacaga atgctcagtt cttgccaccc ctggagacaa ctcaccccca 4501 gctcccagtt attgtatgta tctaaccccc acagtctgga gactccagca cagtagacag 4561 ttggggggac agaaaggaca cacagctgta ccattcacag ctacagctct agggcctgcc 4621 agggcccgga gtggcctggg aggctcagca gctagaccct ggggatttgt gtgctaggag 4681 agcttcagcc cctcagggag cagccagggg tcgggggtgg ggggtcccca tcttgcccag 4741 ctgcccattt tcctgtgctg atggcattaa caagaaccat ccccaactct gggctgcttc 4801 tcccatcaat cttttcagtc attttttaca aattttttac ttttttctct cctaaaattt 4861 ctgaacatac agtacaagta tgagctgcct tgcaggagag gcagtcttgc agccccagct 4921 tcatccagac acatgatgcc tatgttcctc tgggatgaca gagaaagggg aaggaaatgg 4981 gaggcactcc agcatgagga gcaggaattc acacacgggc ctccaaatgc ccaactgtcc 5041 cagtgcctga ggtgtcccct gactcagctg agagagttca acaggtccct aggcccagct 5101 cctacatgga ggggcaaatc caggcttccc ctaggctggg aagagctcat ggttggatcc 5161 tcttgcggta taggtaggca ttgctcacct ggttcataat gaccaagctg gtgaggacag 5221 ggcataatac agccaggtac caggctgcac cagtgacaga ggcagtgaac cagagtgagc 5281 acatgcccag caagaggctg gcactaccca gcaggtagcc caccagccca gaagtagcat 5341 ggtatagctt gagcttcgcc aggggccatc ggggcagcag cttggggtag agcagcccca 5401 ccccacctga gcactgcagc cctgcccaca gcacagccag cagccctgcc tgcccatgcc 5461 gcgtaaccag gtgggctttg ccaagctgct ctttgtggag gatgacaagg ccgaggccca 5521 gcagtgcaca cagcagggcc agcagctgca gcacccagtg gcagcgtgct cggcctttcc 5581 gtgagaggga gtgcagcagc gaactctcag gagaaaacac cagtagtgcc tcggtcatca 5641 ggaaggagaa ctgaggaaac aggggatggg aagctcaagt gagggtgctg ccccagaaga 5701 ggagaagagg gctttatgta acccttagct gccagtgagt ctgggctgat tccccttttt 5761 ctttctctcc ccactgacaa gactatccat ttggtggggc aagagtggaa aaccccatct 5821 gggatgatat atgtgccctg aataatctcc acacttcctc ctaggtgcca gtctgaaatg 5881 tcatctgtcg tatgttacca agcaaacaca gggtttgaga tgtggcagtt ataaatgatt 5941 cacaagcttt gtttccctaa cgtggggctg gatacaattt cttggcaaga gctggccaca 6001 tgggtgcact gtttcaggga aagttaggtg catcaccaga aattaaagta aaaagggaca 6061 gaggtacatc caaatgcaat gtctcagctt tctgcctaga ggtgggtgaa gcctcattgt 6121 gcttagacaa atcttaccca tttgacctgg gtcctgggtc ttcattactc tgattcaagt 6181 ctcttttttt tttttttttt ttttttggag atagtctctc tctgtcaccc aggctggagt 6241 gcagtggcct gatctcggct cactgcaatc tccgccttcc gggttcaagc aattctcctg 6301 cctcaggctc ctgagtaact gggattacag gcacctgcca ccacacccag ctaatttttg 6361 gttttttggg gggttttttt gtttttgggt tttttttttt ttttgggaca gagcctcgct 6421 ctgtcgccta ggttagaggt tagagtgcag tggcgcaatc ttggctcact gcaacctccg 6481 tctcctgggt tcatgcgatt ctcctgcctc agcctcccga gtagctggga ctacaggcgc 6541 gtgccaccat gcccggctaa ttttttgtat ttttagtaga gacggggttt caccatgttg 6601 gccaagatgg tctcgatctc ctgacctcag gtgatgtgcc cgcctcggcc tctaagtgct 6661 gggattacag gcatgagcca ccgtgcccag cctcaagtct gaaatatata ccatgggcag 6721 gagtagttgg agcatgcaac tcttccaaac agatattaaa tgtggacaaa gattcagagg 6781 ctgccagggc caccaaccat gcccatgctc ccacccctaa gaactgccca ggcccactac 6841 ttacagccaa agacataagc accgggtgcc aggagaacag gcctggaatg agaagaggag 6901 gcctcatggg gctgaggcaa gaagaaacca gccataccaa tccttaaggg ttgaggaata 6961 ccagcagccc atcacccaag gaggtgctgc caaactgtga gccccttctt gctctgcaac 7021 aagctgaccc cacacctatc ctctcttcct ggcctcttcc tggatcagag gccatgcttt 7081 gccactcctc tggcagtcct ccctccttcc cttacacctc ctgagaggca gccaaatgga 7141 gcacccagag aaggaaagac aaaccaggaa ttctgtgtgc cctggccata cctacctgct 7201 ccacctcact catcgctcca gcacagaatt ccctcaacca gccaggcctg ttcccggaag 7261 tcccactgcc cttcctcccc aggaacagtg gcccttccct tcccagaagt cagctatgaa 7321 ttctacttac tggagccagg cctggcaagc acagccacaa agatggtaaa gcccagggcc 7381 acaaggtggg cggcagcgcc agaagcagta cgcagagctc ggtagatgtg tgactcggtc 7441 tccgcagaaa gggccatcgt cagccgtgct agtggttgta gcctgaaagc atagtagcgt 7501 gggccaggat ctcaggtgcc cactgttcac tcctcactgg agaaaaggct gcaaggcagg 7561 aatgccaccc tcttccaata agttatggta ggaaagaaca gtaccaagac cagcaccagg 7621 aatggtcaca acatgccagt cgacggccgg cggcctgtca acgtgtaccc atgtctgaac 7681 tggtaccaat cgctggcccg ccttccaggt aggaggcgca aagccatgta agactacaaa 7741 tcccagcgtg taccacgccg tcgccggcag tagagccagc tgggagggcg cggagcacta 7801 tggaaattgt agttccctgc tgcggtccca gttacagcgt gaatccctta gcgcaccgcc 7861 tccccaagtg ctgccagcat gctgcccctc cacgctaggg acttgcctgc ctctgccgca 7921 gcgcagatct gatgcggtgg tttcctccaa agaaagcggg atactccggg ccctgccgcg 7981 cgcaccgccg ctccgtcacg gagccgcttt acttccgggt gcgacgtcta cccacccagg 8041 acttcctggg acgagccgtc ccccgcccca gccacgcctc tgagtcgcgc tgcgcaggcg 8101 ccattgggca gagggattgg caagcttgcg tgccccagcg acacaggcct cgaggctgtc 8161 tctgacaagt gttcacagga ggtggggacg cctctgcgcg aggaacgagg agctacgggc 8221 ctgggcccgg ttattgccat gggcagcggc tgccgcatcg aatgcatatt cttcagcgag 8281 ttccacccca cgctgggacc caagatcacc tatcaggtgc cacccgggct cgcgggactg 8341 ggcgggaaga gggacgttct cgagagctca actggctgcc tgaaggcagt agtttcccgt 8401 gctgcacgca ggcatgctgc ttcattcgca gacaactgtg aggctagagg atagtcagac 8461 ccaggccccc ggctcccaat gtggcaggga agacatacgt cacattggca gtgtttgtag 8521 gatatcgggc tgtagggatc cggttggggg gttccaggtc ccacagaagg tgagttgggt 8581 gacccatgag tatccctgaa ctgagtgaag agtaaatgat gcataggaag tttaggtaag 8641 gcaggagaaa gtgttccagg tagagggaac agcatatgca gaagccagga aggaggtgag 8701 atcatagcgt atgtttggaa acagaaagaa gtaataggcc tgagtggatg ggagagaggg 8761 gaacgctgga gaagaggact tgggcttggg gggcatgagg aggtcaccat ttgtactaca 8821 cccagtacag atgtccactg aggggcccat tgccactccc ctccccaggt ccctgaagac 8881 ttcatctccc gagagctgtt tgacacagtc caagtgtaca tcatcaccaa gccagagctg 8941 cagaacaagc ttatcactgt gtgagaccct agctcggggt gagcggtggg tggcaggggt 9001 tgtcccagga ggaggaaggg acaggcttgg gggcctgact gtgttccaat tatccagcac 9061 agctatggaa aagaagctga tcggctgtcc tgtgtgcatc gaacacaaga agtacagccg 9121 caatgctctc ctcttcaacc tgggcttcgt gtgtgatgcc caggccaaga cctgcgccct 9181 cgagcccatt gttaaaaagc tggctggcta tctgaccaca ctagaggtct gcaatgagat 9241 gggcttgggt ttaccgacag gcagagttgg ggaaggtgtt cccatgtctc caaagtcccc 9301 caaacctggg accctgcaaa ccagctcagg actcagggca gccaaccagc caggggtcct 9361 gggatgtgcc caccatgctt tccatgctct gtctagctag agagcagctt cgtgtccatg 9421 gaggagagca agcagaagtt ggtgcccatc atgaccatct tgctggagga gctaaatgcc 9481 tcaggccggt gcactctgcc cattggtatg gccagcctct gcaaggacag cagaggggta 9541 gaggagagat gggaaagtga acccagctag tcagggagaa gggcaagact catgaacaca 9601 ttggtgggca ggaaatagcc atgggctgag gtgcccatgt ccagggtcca atgcatgatg 9661 gagctgggac tgggggtagg gggaaagcag agctctccag ctctgcccct tgggcaattc 9721 cttccccttt gtgggcctcc gtttctcact ccataaaaag agaaagatta gccctaccca 9781 gaggcacctg cttgccccaa gggatgctgg gaggcctgga tggtaattga ggacctcttg 9841 gaagaagtgg cactgaggat atggatgggg ccccctgcag atgagtccaa caccatccac 9901 ttgaaggtga ttgagcagcg gccagaccct ccggtggccc aggagtatga tgtacctgtc 9961 tttaccaaag acaaggagga tttcttcaac tcacagtggg acctcactac acaacaagta 10021 tgccatccct ccctgggtat catcacctgg ggctggccac agccctggcc tcatcctgtt 10081 tcttctgacc ttgtcccact ctgcccagat cctgccctac attgatgggt tccgccacat 10141 ccagaagatt tcagcagagg cagatgtgga gctcaacctg gtgcgcattg ctatccagaa 10201 cctgctgtga gtgggcctac agtcatacct gggacaaggt caccagccag ggaagacctc 10261 aggggccctg ggtgtgaggg ttgggaaggc atggtcctca gaagctgagc acagctctct 10321 ctctcaggta ctacggcgtt gtgacactgg tgtccatcct ccaggtaggt gagttgggtt 10381 tggttaagtg gagagtggat catgtggctg gaccagacca gggctgtgtc agcttcccaa 10441 gcccctcact caggccccta tgccttctgc taccctctag tactccaatg tatactgccc 10501 aacgcccaag gtccaggacc tggtagatga caagtccctg caagaggcat gtctatccta 10561 cgtgaccaag caaggtagtg gtgggttctg ggggaagcca tcttggggag ggcagggcca 10621 catgatgagc tgatccctgg cacccacagg gcacaagagg gccagtctcc gggatgtgtt 10681 ccagctatac tgcagcctga gccctggcac taccgtgcga gacctcattg gccgccaccc 10741 ccagcagctg cagcatgttg atgaacggtc agaggagaat ttgctggggc atttgggagt 10801 tacctgaggg aagctagacc ctttatgtct ctcaggagcc ctggatcatg gggcactgcc 10861 aatccaagca ggcttcctgg agatgatggg ctacagagac aaaattgaag ggagactaca 10921 ggaaagggtt ggcctgcctg aaagaaggcc tggccagggc gtcaccccgt cctctgatcc 10981 tcaccctagg aagctgatcc agttcgggct tatgaagaac ctcatcaggc gactacagaa 11041 gtatcctgtg cgggtgactc gggaagagca gagccaccct gcccggcttt atacaggctg 11101 ccacagctat gacgagatct gctgcaagac aggtggaggc aggcgggcag tcagggtggg 11161 ttcagggcag ggtggccagg ccaaggccac tgacctctcc tccaccaccc cacccaggca 11221 tgagctacca tgagctggat gagcggcttg aaaatgaccc caacatcatc atctgctgga 11281 agtgaggctg gtagtgactg gatggacaca ttgctgtggg tagtccctcc tactaggagg 11341 cttgtcatac tgtctagagg ttgactctta gttctgtaaa taaagacatc catttcaaac 11401 agcctttatt gagtgctgtt tctgggccag ccgtgggaga cccagcagtg aatgaaatag 11461 ccacgagccg tgggctagat cgccatctgg tgggcaggac cgacaatcga caaataaaag 11521 tgccggttaa ttacaaaaaa aaaaaaaaaa aaaaaaaaaa atcgtcctga tgcggtgact 11581 cacctgtagc cccagctact caggaggctg aagtgggggg atcatttgag gccacgagtt 11641 ggaggctaca tttagctgtg atcccatcac tgcaccccag cctgggtgac agagcaagac 11701 cccatctcaa taataataat aataataaat aaacctagaa agaagcagat gtagccaggc 11761 gcggtggctc atgcctataa tcccagcact ttgggaggct gaagcgggca gatcacctga 11821 ggtcagatgt tcaaaaacag cctggccaac atagtgaaac cccatctcca ctaaaaaaaa 11881 atacaaaaaa ttagccaggt gtggtggtgc acacctgtaa tcccagctac tcaggaagct 11941 gaggcaggag aatcacttga acctgggagg cagaggttgc agtgagccaa gatcatgcca 12001 ctgcactcca gcctaggtga caggagactc aatttcaaaa aaagatgtga tcaagagtga 12061 gttgggttgg aggcagggac tgctctggat taaatggcca agaaaggcca tgctgaggtg 12121 taaactgagc cctgaaggaa aagtattcca ggcaaagtgc aaaactctga ggcaaaaacc 12181 tctttggaat gtatgaggga ggttcaggga agcaaggagg agaggaaaga atccaagtcc 12241 cgaggtacag gagaggacaa gaagtagatc ttgcagagct gctagtaagc ggcatcccac 12301 ctaacatttt gagggccaga agtagccaca gagcccctct cacgcaaggg agtaatgtct 12361 atatgctttt tttttttttt tgagacggtc tccctttgtt gtccaggctg gagtgcggtg 12421 gtgcaatctt gcctcactgc aaactccacc tcctgtgctc aagcgattct cctgcctcag 12481 tctcccatgt agctgggact acaggtgtgt gctaccacac ctgtctaatt tttgtacttt 12541 tagtagagac agggtttcac catgttgccc aggctggtct caaactcctg acctcaagtg 12601 atccgcctgc ctcggcctcc caaagtgctg agattacagg tgtcagccac cgcgcccggc 12661 ctgatataca cttttaaggc tccgggctgt ccatgaggat caagcgtgga agtaggtcta 12721 cagaaaaggc tgccgcagtt cacccatgac aggtgatgac agtctggaca tagggaggat 12781 ggagagaagt cgtgcgccct ggcagtattt caggagccaa ctagtggagc cgggacgcga 12841 gaaatagagg aaccaagaag ccataaggag gaggaagagc agcgggtagg tcaagatgcc 12901 acagccgcag gaggggctgt tggggtcggg gttaccccca tccctgtgca ggcggacttg 12961 gcgtctgagg acagagtcca gaccacaagg atctggagct caggagagac tcgtgggcca 13021 cagcccgaga aagcgctggg aatccaaata ctatggcgat tggcagtcgc gtaggcgagg 13081 cgggctagag acccgcccgg atttaggcgc gagccacctc caggggcggg gcccaggccg 13141 cactgcgcag gcgcggctaa cccgtttcca tggctgcgag aactgacgct ccccaaccgt 13201 cccgcaactg tcctgtccca gactttggca ccgtcggggt ccgtcgtccc cgaatgtgac 13261 agcatcccca ccccggctgc tgcccaggat ccgccggacc ccggcctcga tatgggagac 13321 ctggaactgc tgctgcccgg ggaagctgaa gtgctggtgc ggggtctgcg cagcttcccg 13381 ctacgcgaga tgggctccga agggtgaggc acccgggtca ggcggagtcc cggagtcatt 13441 gtccttgagt cggggagctg gggcctgact cgggggaggg gctgcccagt gtggaggggc 13501 tcccaaatgg gggagcagag cgttccgaga caggagtatt actgctcctg agccccctgt 13561 gtcccctcag gatcaggtta ggcttcagta ggatccagcc cccatcccca ctcctaatgc 13621 acacacgtgg acgcacatgc acttaccctc tgaggcaggt ggaaccagca gcatgagaac 13681 ctggagaagc tgaacatgca agccatcctc gatgccacag tcagccaggg cgagcccatt 13741 caggagctgc tggtcaccca tgggaaggta ccccgaggtc acaggcaggg ttcctgcctt 13801 cccccatacc tcacctactc tacccctccg gagtcccctg tgtgcccttc ccctctggcc 13861 tggtacacct gttctccctg aaggacaaag aggaatgtgt tacatgtttc attttgtatc 13921 cctattggac aggactctgg cacaccaggc tgggtgcagg gcatgagttg attagggaga 13981 aagctgtagg tcctagaaca gcttaggctt caaggggaag gcccaaatgc taaaggcatc 14041 tgtgaattga ctgtaaggct ggtggtgggg aaggggtggg gaggggttgg ggagggcggg 14101 agggagggga gataacctaa ctggaggtgg aacttcggca tggaaggaag cagccttccc 14161 aacatgaaag ggggaagtta gaaaccaggg agatgcctgg ctggaacatg gaccagggag 14221 tgtcaccagc agatgacctg agatatcaat tgaccaaaaa aaaaaaaaaa aaagccgggc 14281 atggtagctc atgcctgtta tcccagcatt ttgggaggcc aagacgggtg gatcatctga 14341 ggtcaggagt tcaaggccag cctggccaac atggtgaaac cccacctcta ctaaaaatac 14401 aaaaatttgc agagcatggt ggtgcacacc tgtaatccca gctactcggg aggctgaggc 14461 aggagaatcg cttgaacctg ggaggcagag gttacagaga gccaagatca tgccactgca 14521 ctccagcctg ggtgacaaga gtgaaactcc gtctcaaaaa aaaaaaaaaa aaaaagaaag 14581 gtcctttgca aaagaaagat ggaactcata ctaggggata ggataggaac agggcactgt 14641 gaagggtcct gagtaggagt gaggccaagg cacacaagag ctttggagga ccacagacag 14701 ggactagagg gagggcatga ggagaagggc tggcttgaaa gggatgcctg aatgggcggg 14761 cagaggataa gggtgcaggt gcaggcaggg caaggcagtc tgggaactgg gcaggagcca 14821 gtcacataag catgagggac atccacagag gtgttgggag cagctggtaa tgaaggtcca 14881 aggtgcaaga gagaagtcag gaaggatact catgggtctg gaatagttta ggggcccagc 14941 agtgtttggg gatatcaggg ttgagctgag ccggggatgg gagggttgcc aggcaaaggt 15001 aggcccatct catccctgtc ctttacccta cccttcaaag gtcccaacac tggtggagga 15061 gctgatcgca gtggagatgt ggaagcagaa ggtgttccct gtgttctgca gggtggagga 15121 cttcaagccc cagaacacct tccccatcta catggtggtg agctgggccc ctggttcata 15181 cctcttctca ctccttcaga gggctctgga ccggggagga gagctggtag cccctatccc 15241 ttcctcaggc cctgtccttc tctttatctg acaggtgcac cacgaggcct ccatcatcaa 15301 cctcttggag acagtgttct tccacaaggt gagggactat ctctgcccat gggccacagt 15361 tccgggtcag ggcctggcag gaagggagat tgtgtctgtg tggggaaggc atcagacaca 15421 gaaagtttcc ctcctccttt tcccaggagg tgtgtgagtc agcagaagac actgtcttgg 15481 acttggtaga ctattgccac cgcaaactga ccctgctggt ggcccagagt ggctgtggtg 15541 gcccccctga gggggaggga tcccaggaca gcaaccccat gcaggtgggt tgaggttacc 15601 tagggttgtg aaagcctagg tctgggttcc ccaaggcctg cgcaggtgag ggtggcccag 15661 cgtgaacact gtgtgacctc ccaggagctg cagaagcagg cagagctgat ggaatttgag 15721 attgcactga aggccctctc agtactacgc tacatcacag actgtgtgga caggtgagca 15781 gtccgactgg gcctgggcct actgtggagg gctggaagac cgggcctgta gcctgcctct 15841 actcacctcc ttcacaacgt ccctgcccct agcctctctc tcagcacctt gagccgtatg 15901 cttagcacac acaacctgcc ctgcctcctg gtggaactgc tggagcatag tccctggagc 15961 cggcgggaag gaggtagggt cctcccccac cagcctaagc cccaggctac tgcttcaggg 16021 tatctttttg atagaggggg gcagcttgca cacacgaaga caaaccctgt ccccaagccc 16081 actgaggata ccaggatgcc tcagccaagg ttggcctaga cctgagctct gcagcaggcc 16141 aggcccatgt gtccactact gaggctcacc ctgctctggg gtcagcagcc ctatagcctg 16201 ggcaagtcct gcagcccagg ttctcccatt cccaggcagt ggtcagtctc ccagccccca 16261 cagctggctc acttgaagag aattcaacgt ctgcacccag tgtgctggtt cctctcccca 16321 ggcaagctgc agcagttcga gggcagccgt tggcatactg tggccccctc agagcagcaa 16381 aagctgagca agttggacgg gcaagtgtgg atcgccctgt acaacctgct gctaagccct 16441 gaggctcagg cgcgctactg cctcacaagt tttgccaagg gacggctact caaggtcaga 16501 ctccctccgc accagccccc acagccccag taccgccctc cccatcctac cccgactgcg 16561 tccctgctgt ttatctttgc ccacccacct caaccccagt gctcttttca gtccttgggc 16621 ctcaggtgac acaccagcta gtgggacatg ggcccccaca ggcattctca gcccaaccca 16681 gccccttcct tttccttggc cccctggcca gcacctgcat cacactggcc tccactggac 16741 acccttgcag cttcgggcct tcctcacaga cacactgctg gaccagctgc ccaacctggc 16801 ccacttgcag agtttcctgg cccatctgac cctaactgaa acccagcctc ctaagaagga 16861 cctggtgttg gaacaggtag gcactggaaa gttagctgct caggaccact gtcccacttt 16921 accagcacct tcctgccact ctccacttct ctctcctaga tcccagaaat ctgggagcgg 16981 ctggagcgag aaaacagagg caagtggcag gcaattgcca agcaccagct ccagcatgtg 17041 ttcagcccct cagagcagga cctgcggctg caggcgcgaa ggtaaggcct gtggaaatgg 17101 cagggagggt ggaggggatg caggaggcat ggatgtgggt ggggtgcccc caccttccag 17161 ggccagtcag accttcctga ctttccccca ggtgggctga gacctacagg ctggatgtgc 17221 tagaggcagt ggctccagag cggccccgct gtgcttactg cagtgcagag gcttctaagc 17281 gctgctcacg atgccagaat gagtggtatt gctgcaggtg agggtatcct agaaccttgg 17341 acctctaagc cctactccca catcccccac atgcattgcc atcctcaata cccacctgcc 17401 tgcagggagt gccaagtcaa gcactgggaa aagcatggaa agacttgtgt cctggcagcc 17461 cagggtgaca gagccaaatg agggctgcag ttgctgaggg ccgaccaccc atgccaaggg 17521 aatccaccca gaatgcaccc ctgaacctca agatcacggt ccagcctctg ccggagcccc 17581 agtctccgca gtggagagca gagcgggcgg taaagctgct gaccgatctc cctcctcctc 17641 accccaagtg aaggctcgag acttcctgcc ccacccagtg ggtaggccaa gtgtgttgct 17701 tcagcaaacc ggaccaggag ggccagggcc ggatgtgggg accctcttcc tctagcacag 17761 taaagctggc ctccagaaac acgggtatct ccgcgtggtg ctttgcggtc gccgtcgttg 17821 tggccgtccg gggtggggtg tgaggagggg acgaaggagg gaaggaaggg caaggcgggg 17881 ggggctctgc gagagcgcgc ccagccccgc cttcgggccc cacagtccct gcacccaggt 17941 ttccattgcg cggctctcct cagctccttc ccgccgccca gtctggatcc tgggggaggc 18001 gctgaagtcg gggcccgccc tgtggccccg cccggcccgc gcttgctagc gcccaaagcc 18061 agcgaagcac gggcccaacc gggccatgtc gggggagcct gagctcattg agctgcggga 18121 gctggcaccc gctgggcgcg ctgggaaggg ccgcacccgg ctggagcgtg ccaacgcgct 18181 gcgcatcgcg cggggcaccg cgtgcaaccc cacacggcag ctggtccctg gccgtggcca 18241 ccgcttccag cccgcggggc ccgccacgca cacgtggtgc gacctctgtg gcgacttcat 18301 ctggggcgtc gtgcgcaaag gcctgcagtg cgcgcgtgag tagtggcccc gcgcgcctac 18361 gagagcggaa ggggcagcca aggggcagcg cagtcgccgc gggtcaagtc gcggcagagg 18421 gggtcggcgg ggacagctcc cgaggactag gtccgttact ttcgccccat cgctgaagag 18481 tgcgcgaaaa tggtttatcc cttgtcgcac tccactcgta tctgggccac agatgagcag 18541 aggtggctgc ttatatgtaa aaatacgctg attttaagtt tcttatcttt aaaatgcctt 18601 ggcccttctt gagaaagggt ttgtgcctac tgtcctcgga gtccatcttc ccaggcttgc 18661 ctcttctcaa acactcatga ccccctccag aacctttagg gtgaagggaa attaccacct 18721 atgggaggga gcctggaaaa atttagaacc tttggtgggc cccctgcaag caggagtttt 18781 gttgagtctt tatttagcaa acaccctttt ctgacccagt gaatcagatg ctaaaatatg 18841 cacgcagcca cacacccagc agtccttctg cacccctggg aatcgccagc aagcaaaggt 18901 tgctctcccc tgggtagaca ccagctggaa tcaccagggg tgcttttaca gtcctccccg 18961 ctagcctgga tcccaccgca gacctgttga atcaactgct gggagtggac cctaggcatc 19021 agtaaatttt aaaaactccc caaattattg taacatggag tctgggttga gcatcactgc 19081 tctggcctat ttaggaactt gtggatggat agtgtcccag gtctgtgtgt gcatggagac 19141 cctctcatcc ggtacaagag gacatcacaa attcagctgg ggggagcaca aagttgtgac 19201 agaatgcaaa gaatgaacaa ggggccgagc gcggtggctc atgcctgtaa tcccagcact 19261 tcggaaggcg gaggcgggtg gatcacctga ggtcaggagt tcaagaccag cctggccaac 19321 atggtgaaac ctcatgtcta ctaaaaaata aaaaaaaatg agccaggcgt agtggcgggt 19381 gcctgtaatc ccagctactc gggaggctga ggtgggagaa ttgcttgaac acaggaggcg 19441 gaggttgcag tgagccgaga tcgtgccact gccctccagc cttggcgaca gagtgagact 19501 ctgtctcaaa aaaaaaaaaa aaaaaaaaaa gaacaaggct gggacattgc agcgttctca 19561 aagagaaata aagtagccat ggagataaga agcaggatga tttgggcatg tttatcagag 19621 gtagagacaa gggagaaatc aaagataagt ttgggctttt gtctccagta actgggagcc 19681 tagtggccat ttttgctgca aagaggaagc tgggcaagtg tagcagtgag gctgaagaaa 19741 agggaattaa attttggcca tgttcacttg aaacgtcttt tagacatcct agtgaaggta 19801 ctggcacgga ggatctagtc tgagggttta ggtcagtgtt tcagccgtgg atctggggca 19861 gatgaatgta gacagaccag gccagtgatc aggactgagc ccagacttca tcgtgagata 19921 tggaagttga gtcagaatct gcaaaggagc tgagcaggag ctgcaggggg taggaggaaa 19981 actgggagag tgtagcccct gggagtcaaa gggagcaagc ttcaaatgat gctgaggggg 20041 tgagaatgga gaatggaaca ctggattcca tttggtagta cacagatcgc tgaggaccct 20101 gtcccgggca gtttcctgga ggaagaggca agcctggctg gagtgggtag aggggagagt 20161 gaaggcgaag gattagagtg tatagagacc agtgtcttgg tctgagggga gtagagacag 20221 gtgacaacca cagggcagac gtaggttaaa ggtgtttagt ttttccttca agtaaatggg 20281 cagatgtatt ccatatacgt tcccagtgaa gggccgggtg cggtggctca agcctgtagt 20341 cccagcactt tggaaggccg aggcgggtgg atcacctgag atcaggagtt tgagaccagc 20401 ctggctaaca tggtgaaacc ccgtctctac taaaaataca aaaattagct gggcatggtg 20461 gcgggcgcct gtaatcctag gtactcagga ggctgaggca gaagaatcgc ttgaacccag 20521 gaggcggagg ttgcggtgag ccgaaatcgc gccattgcac tccagcctgg gtgacaaaag 20581 caagacgcag ttttttgttg ttgttttttt aattgccaat gaggaaaggg gaagttctgt 20641 gctaggcgat agagatccaa ctgttgagca ggcctctctg cctgtggcct tccggccggt 20701 ttccagacgc ccaggtggcc aacattagag tccgcgtagc agtgtgaggt aacccactga 20761 gataggtcgg gcctgcggag cctggcgagc agcggccctc tccctggggc ttcccttcaa 20821 tctccgggac atttccccga cctggagctc ctccgcctca ccgccaggcc tctctgcaga 20881 ttgcaagttc acctgccact accgctgccg cgcgctcgtc tgcctggact gttgcgggcc 20941 ccgggacctg ggctgggaac ccgcggtgga gcgggacacg aacgtggtga gcgcggggcc 21001 gagggcgtat gggaagggcg aggatgggca ggccacagtg caggcattct cgagggctgc 21061 ctgggtgccg cgcgcaagga gcgttctaat tgccgatttc ccggcggcac agagaggcta 21121 attctgcgcg ggggctggga ggggagcctg gattgccggc tccgcaagta ctccacccgc 21181 tgcaagcgga cccgggccca ggctgaccca ggctccgcgc acgcgcactt cccgcacctt 21241 cccgccctcg cctccggcca gaggccactc ttgtgcgctt gcccggacgc tggcacccgc 21301 ccccgttccc tgtggtaggt ggggtctgtg agtggagctc cggagcgatg aggtcattcc 21361 tgggggcgaa gcgtgcgtgt ccccgccccg gcgttcctgc cccaatgaga caagagctag 21421 atcccggcga tctacgtttc agtcttaacg gttgcggcgc ggctctggcc cgggcgcacg 21481 cgcacactga cacgcgtaca cgcacgcacg cgaccggggc ggtggttggc ggctacggac 21541 gcgcaggact gggggacggg cgggtacggc tatgggcgag gcggaggcgc cttctttcga 21601 aatgacctgg agcagcacga cgagcagtgg ctactgcagc caagaggact cggactcgga 21661 gctcgagcag tacttcaccg cgcgaacctc gctagctcgc aggccgcgcc gggaccaggt 21721 gggagccagg gggtgccggc gggcgggagg ggaagcggtc gctggagctc cgccctcccc 21781 ggtccgttgc cgcgtcctgg gtcggtgggc agccccaccc tcctggctac gtggctcccc 21841 gcgggtcctg gccggggacc tgcccgcgga accgtgcgta agaccccgat tccaccgcct 21901 agatgctggg tgccggggcc cccttggttt ctgtcacaga caggttgaac acggaaaaag 21961 cagctgtatg gcttgtggta gacctgagcc gggcattatc cagctatgac taaagccgac 22021 cgagcagttt ggactagcac ctcgatttcc gcgttcgaat gctcctgctc cctccttggg 22081 gagactaggg gaggatgtgg agagggaaga gtcctcgcca ggaattgaga agtatgttta 22141 ggaaaacttg agaggcagag agagatcctg ctcctccatc tgcactcctg tatggagcca 22201 gctgagccct cacctcttcc ctgttctggc ctgtcaccag ctgctggaat gtggaagatt 22261 ctgttccctt cctctagggt ggatctggag aaagatttgg gaatagatag gaaagaagtc 22321 ttgttttgga ccataagcat tcaggagcac tttacccaca ggaaggggga aagctagatt 22381 ataaaatgcc taaagaggtg gaaaaagaga tccaggttac taacccagga ctgtaaggtg 22441 tctcggaacc tcctaggtat ccccattatc ggagaactgt gtgccagatg ccattggtgt 22501 gaccaccagg ctcagagaac caggcctagg caccaggaaa aagaaacagg gactgtgaag 22561 ctcagtatgc ctggcagaaa tggggcggaa atccttattt aagtaaagaa agtggagttg 22621 tgagtgatgc ttcagataaa attttacaaa attccttaca aaatgggtgg tgctcagcac 22681 gccaaaatct tagcccagag cttgggtgca agggttgagt tgagtgtaga cccctgggct 22741 tgtcttcatg tcagtcagtc ctgagccatt ttccactgtg gaaaggtggg aaaaccacaa 22801 gacactaacc aattgaaaag gagggctagc cacggaggtg cacacctgta atcccagcta 22861 cttgggaggg tgaggcagaa ggatcacttg aacctgggag gcagaggttg cagtgagcca 22921 agatcgtgcc actgcactcc agcctgagtg acagagtgag actctgtctc aaaaatagaa 22981 aaggaagcca agtacggtgg ctcacacctc taatgccaat gctttgggag gccaaggcag 23041 gtggatcatt tgcaatcagg aattcgaggt cagcctggcc aacatggtga aaccctatct 23101 ctactaaaca tacaaaaatt agccgggcat ggtggtgtgt gactgtagtc ccagctactt 23161 gggagactga atcacttcaa ccgggaggca aaggttgcag tgagccaaga tcgtgccact 23221 gcactccaac ctgggtgaca gggtgaggct ctgtctcaaa aaaaagaaag aaggctgggc 23281 ttggtgactc atgcctgtaa tctcagcatt ttgggaggcc aaggcaggca gatcacttga 23341 ggccaagagt tcgagacctg ccaggccaac atagcaaaac cccgtctgta ctgaaaatac 23401 aaaaaaatta tctggccatg gtggtgtgtg cctgtaatcc cagctactgg ggaggctgag 23461 gcaggagtat cacttgaacc cagaagacag aggttgcagt gagtcgagac tgggccactg 23521 cattccagcc tggatgagag agcaagactc tgtctcaaaa aaaaaaaaaa aaaaaaagaa 23581 agaataggag gctgagaagt cccaagttat atgttaaaaa aaaagaaaaa aacatcagtt 23641 ttaggccagg tgcagtggct cacaccttta atcccagcac tttggaaagc cgaggtgggt 23701 ggatcatgag gtcaggagtt caagaccagc ctggccaaaa tggtgaaacc ccgtctcgac 23761 taaaaataca aaaaattagc cagttgtggt ggcaggcacc tgtaatccca gctacttggg 23821 aggctgaagc agagaattgc ttgaacccag gaggcagaga ttgcaatgag ccaagatcgc 23881 accactgcac tccagcctgg aaaacagagc gagactctgt ctcaaaaaaa aaaccatcag 23941 tttttatgga cagtggtaga gtggagggtg ggtccctatg gtgcagaagg gaaattccat 24001 ggtcctgctg tgcatccgac tgggatggct gttgaaatcc tcttccagca ggcagctttg 24061 gaaacagaaa aagaaactct tcctccttta gaatcctgga agggctgtgc agtgcctcta 24121 atccaagtct gttttctgag tgaagatagg gaggttcatc accagaaggg aaggggctgg 24181 aaatgaggtc actgcatccc agcccagggc tcctgggtca tccaggaagg gaagaaggag 24241 caagctttct cattgttagg taggagctca gagccatcac aagaacaagt tagcaccatc 24301 cctgtgccct ccctgttctg caaacaaaat gatcttcctt cttgccctgg cactagagtc 24361 tgtctggcat ttctcctgcc cctagtactc ctcccatctg ggtacttctt cccgttggtg 24421 tactgaacaa acacatccac tgctttattc acagcctcca gccctcattt tccagggccc 24481 acaccatttg tttttactaa cccgacaagg ttgcccactg tccccagtaa ggtttgtact 24541 ggggttttta ctccagtgct cttctccatc caggagacct ttggatactt ggggaagaaa 24601 atgagcttaa attcccaccc ctcccccttt acctttttcc tgtaaggccc tggccttagt 24661 tcttagcccc acatccttgc tggctgcaga atagcagcgg gttctgggta aggagcattc 24721 tgctaaaacg ctccaccctg ctccctcatc tgtcctctcc atttgtcccc atcagatggt 24781 ttaagtgctt aaggggactc cagggcggag tcagggagaa ccctggctct cctgggctag 24841 gcacaagatc attctacagg aaaccttgtg ggaattcttc tgggacaaag tattggtcag 24901 cgctgagctt agctgtgtct gtgacactcg cattctaact agggcctatc tgacgtcaac 24961 aggaagtaag gctgatgcag tggggccaag ggagtctggg agaagaaagt cggttcagag 25021 ccctggctgc cctgtcccac actccaccct tccggcaaga atccagtccc tagatgaggt 25081 ggggagtgag tggtcgagtt aaaaatctct gggtcgggta cgatggttca cgcctgtaat 25141 cccagcactt tgggaggtga aggcaggcgg atcacttgag gtcaggagtt caagaccaac 25201 ctggccaatg tggtgaaatc ccatctctac taaaaataca aaaattagcc gggtgttgtt 25261 gtggcacgcg cctgtagtcc cagctactcg ggagtctgag gcaggagaat cgcttgaacc 25321 caggaggcag aacttgcagt gagccaagat ccagccactg cactacagcc tgggcgacag 25381 agtgaggctt cgtctcaaaa aaaaaaaaaa tctttgggcc aaatctccag acagcacagg 25441 caggtgcaga aacccaccag gaagctgcct gtgtacctct ggcagattgg agcctggcct 25501 aaagctgcct tttatgcagc ttgggtcaag gttaaacatc atgtcacagt gatttttctc 25561 actatgtgtg agacatggag aactggctcc aagtactact ctgtccactg gtggctggac 25621 tactgatgtg caccactctc cactcctctc accctgcagt gggtcatggc cccgtgccgg 25681 ggcagaggag aaaaatgggc tgccttctcc aggacaaacc ctcactccaa ctcaactagg 25741 gtgctgtgat cagaatgtgc aattgaggtg tgattttact gatttttttt ttttttgaga 25801 ccgagtttcg ctcttgttgc ccaggctgga gtgcgatggc acgatctcag ttcactgcaa 25861 cctccacctc ccgagtttga gcaattctcc tgcctcagcc tcctaagtag ctgggattac 25921 aggcatgtgc caccacgcct ggctaatttt gtatttttag tagagacggg gtttctccat 25981 gttggtcagg ctggtctcaa actcctgacc tcaggtgatc cacccgcctc ggcctcccaa 26041 agtgctagaa ttacaggcgt gagccaacgt gcccagcctg tttttgtttt ttgtgttttg 26101 aagcagggtc tcactcagtt ccccaggctg gagtgcagtg acacgataat agcttactgt 26161 agctgcaatc tcccgggctc aaacgatcct cccacctcag cctcctgaac agttgggact 26221 acaggcacac caccacacct ggctaatttt tttttttctt tttttagtag agatgaggtc 26281 ttgctatgtt gcccaagctg gtctcaaact cctgaggatc aagtgatcct cctaccttag 26341 cctcccaaaa tgctgggatt gcagatgtga gccaccacac ccagcctgat tttactttaa 26401 atgagagtcc ctcttcagag tccctcagct gttcctggcc cctggccatg tgccttcagt 26461 tgcccctgct tctgtggtat ccttaaggct acattcagtg ctgaggccct aggcaggcag 26521 cagagagaag ccaaatgatt ctgtctttcc cttatccacc cagagcatgc aaaaccagga 26581 gcagtggtgg gttcagggtg ggcaccagct atgtatatgt acatcaggga cagggggcca 26641 aaggcagtca gtttccaaag actgccccag aggccatttt tcagagaagc cctgggttcc 26701 tcaagggccc tgtgtccatg ctggcccatc ttgcaggacg agcctgtgga gtgggagaca 26761 cctgaccttt ctcaagctga gattgagcag aagatcaagg agtacaatgc ccagatcaac 26821 agcaacctct tcatgagctt ggtgagttga ctgctcagga agggggcgtg gggaggagca 26881 ggtacccagc tatgtgcctg atactcagag ggtcacaact gaggttatct tgggtgggcg 26941 caagcagtaa tttgtgcata cccagcctag ccccaagtag actgacatct cacctggaac 27001 ctattatcaa ggtttggttt ctctatttct ttagaacaag gacggttctt acacaggctt 27061 catcaaggtt cagctgaagc tggtgcgccc tgtctctgtg ccctccagca agaagccacc 27121 ctccttgcag gatgcccggc ggggcccagg acggggcaca agtgtcaggc gccgcacttc 27181 cttttacctg cccaaggatg ctgtcaagca cctgcatgtg ctgtcacgca caagggcacg 27241 tgaagtcatt gaggccctgc tgcgaaagtt cttggtggtg gatgaccccc gcaagtttgc 27301 actctttgag cgcgctgagc gtcacggcca aggtgggctt cccaccccac cctgccctat 27361 gtgagggtat atacgcatgc acctgagcat gcaggggctg agcagctggc cctgtctctg 27421 atcattactt ccccttcaca gtgtacttgc ggaagctgtt ggatgatgag cagcccctgc 27481 ggctgcggct cctggcaggg cccagtgaca aggccctgag ctttgtcctg aaggaaaatg 27541 actctgggga ggtgaacgtg agtacatagt tcttagtttc ttggttgtca ctagacagga 27601 ctgatgggct gtagctacag taaggcttgg aggaggaatt gtgctggaag acaagccctg 27661 caaaacagtt ccaggagtgt ataggcattg taactaaagc aaaggcttcc agaccactca 27721 tgccaaagcc tagggttgtc ccaagaagcc aggaagaatt gccttggtgc tttgatcttt 27781 cctggtgtgg aaaatcttct ggagatgcag gagtccatct aatgacatga ggaggccccc 27841 ttcagacttt ttacctggaa gctttctggc tccaaggtat taggcctgtg gagtgaaatt 27901 agactcagaa tatgcctgac ctgtccacag gtaattgggg aacatctgac ttggttgtct 27961 cagtaaggtg accgttttgt agggcccatc ttccatacaa actgctgtca gggatcctac 28021 cagagatcat tcagccaaga gcctgacatc agaaagccca gtcctagctt gtgtgaacat 28081 gaggtgctag tcttctctgg ggagggtctg ctggcttggc catcccttct gcagcctgta 28141 cactcccctt ttgccccttg cagtgggacg ccttcagcat gcctgaacta cataacttcc 28201 tacgtatcct gcagcgggag gaggaggagc acctccgcca gatc // LOCUS AC002511 98713 bp DNA PRI 28-AUG-1997 DEFINITION Human DNA from chromosome 19-specific PAC PC28130, genomic sequence, complete sequence. ACCESSION AC002511 NID g2347082 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 98713) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1Mb region in 19q13.1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 98713) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (28-AUG-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA FEATURES Location/Qualifiers source 1..98713 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PC28130" /chromosome="19" /map="19q13.1 from D19S208 to CAPNS" /map="overlaps cosmid F16632 to the left and cosmid R26667 to the right" /map="oriented from centromere to telomere" /cell_line="HSF7" /cell_type="fibroblast" /sex="male" /note="see Ioannou et al.(1994) Nature Genetics 6: 84-89 for more information regarding PAC library construction." repeat_region 8..67 /rpt_family="Alu" repeat_region complement(69..169) /rpt_family="Tigger2" repeat_region complement(261..730) /rpt_family="Tigger2" repeat_region complement(760..1013) /rpt_family="Alu" repeat_region 1057..1274 /rpt_family="Alu" misc_difference 1257..1258 /note="polymorphism in length of Alu tail" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="two fewer A's in cosmid at base 29187-29188" repeat_region complement(1320..1550) /rpt_family="Tigger2" repeat_region 1479..1550 /rpt_family="MER8" misc_difference 2741 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="C in cosmid (base 30668) with T in PAC clone" misc_difference 2780 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="A in cosmid (base 30707) with G in PAC" repeat_region complement(2925..3242) /rpt_family="Alu" repeat_region 3453..3740 /rpt_family="Alu" misc_feature complement(3990..4054) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 70.000" misc_feature complement(4210..4312) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 92.000" misc_feature 4688..7614 /standard_name="duplication" /note="duplication flanking ~10 kb DNA insertion in PAC clone relative to ch19 cosmid f16632~locations: ~4688-7614 on 5'-side of insertion and ~17160-20068 on 3'-side" misc_feature complement(5499..5778) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 85.000" misc_feature 5566..6063 /note="BLASTX similarity to P34976 (50..215); match: 0.26, score: 5.8e-13; database searched: nr; TYPE-1 ANGIOTENSIN II RECEPTOR (AT1) pir||A48857 AT1 angiotensin II receptor - rabbit >gi|299615 (S59041) AT1" misc_feature complement(5988..6380) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 77.000" misc_difference 6225 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="G in cosmid (base 34153) with A in PAC" misc_feature 6247..6333 /note="BLASTX similarity to P34976 (290..318); match: 0.31, score: 5.8e-13; database searched: nr; TYPE-1 ANGIOTENSIN II RECEPTOR (AT1) pir||A48857 AT1 angiotensin II receptor - rabbit >gi|299615 (S59041) AT1" misc_difference 6559 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="G in cosmid (base 34486) with A in PAC" misc_difference 7341 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="T in cosmid (base 35268) with C in PAC" misc_difference 7350 /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="C in cosmid (base 35277) with T in PAC" misc_difference 7484..19281 /standard_name="polymorphism?" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="insertion of ~10 kb in PAC clone" repeat_region complement(10667..10998) /rpt_family="MER1" repeat_region complement(11157..11490) /rpt_family="L1" misc_feature complement(11607..11717) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 43.000" misc_feature complement(12401..12451) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 82.000" repeat_region 12779..12916 /rpt_family="MER25" repeat_region 13127..13375 /rpt_family="MER25" repeat_region complement(13601..13884) /rpt_family="Alu" repeat_region 13900..16647 /rpt_family="L1" CDS 17927..18967 /note="hypothetical 38.8 kDa protein similar to G-protein coupled receptors; BLASTX similarity to (U66578) putative G protein-coupled receptor [Homo sapiens]; gi|2231669 (U90323) purinergic receptor P2Y9 [Homo sapiens]; Pval= 3.7e-20~BLASTX similarity to P34976 (33..68); match: 0.33, score: 2.4e-13; database searched: nr; TYPE-1 ANGIOTENSIN II RECEPTOR (AT1) pir||A48857 AT1 angiotensin II receptor - rabbit >gi|299615 (S59041) AT1~predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 93.000" /codon_start=1 /product="PC28130_1" /db_xref="PID:g2347083" /translation="MDTGPDQSYFSGNHWFVFSVYLLTFLVGLPLNLLALVVFVGKLR CRPVAVDVLLLNLTASDLLLLLFLPFRMVEAANGMHWPLPFILCPLSGFIFFTTIYLT ALFLAAVSIERFLSVAHPLWYKTRPRLGQAGLVSVACWLLASAHCSVVYVIEFSGDIS HSQGTNGTCYLEFWKDQLAILLPVRLEMAVVLFVVPLIITSYCYSRLVWILGRGGSHR RQRRVAGLVAATLLNFLVCFGPYNVSHVVGYICGESPVWRIYVTLLSTLNSCVDPFVY YFSSSGFQADFHELLRRLCGLWGQWQQESSMELKEQKGGEEQRADRPAERKTSEHSQG CGTGGQVACAEN" repeat_region 20319..20595 /rpt_family="Alu" misc_difference 21328 /standard_name="polymorphism" /clone="ch19 cosmid f16632" /db_xref="U62631" /replace="C in cosmid (base 36784) with T in PAC" repeat_region 21849..21978 /rpt_family="Alu" repeat_region 21979..22490 /standard_name="VNTR" /rpt_family="minisatellite" repeat_region 22166..22270 /rpt_family="Alu" repeat_region 23162..23449 /rpt_family="Alu" misc_feature 23544..23665 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 61.000" repeat_region 24021..24330 /rpt_family="Alu" repeat_region 24520..24800 /rpt_family="Alu" misc_feature complement(24848..24943) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 81.000" repeat_region 25659..25928 /rpt_family="Alu" repeat_region complement(26232..26521) /rpt_family="Alu" repeat_region complement(28145..28453) /rpt_family="Alu" gene 28691..30619 /note="EF-1-ALPHA-1 pseudogene; BLASTX similarity to HUMAN ELONGATION FACTOR 1-ALPHA 1 (EF-1-ALPHA-1) (EF-TU) >pir||EFRB1 translation elongation factor eEF-1 alpha chain - PVal= 1.1e-275" /gene="EEF1Ap" /map="19q13.1" /pseudo misc_feature 28691..28902 /gene="EEF1Ap" /pseudo /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 95.000" repeat_region 29017..29502 /rpt_family="LTR2" misc_feature complement(30439..30478) /gene="EEF1Ap" /pseudo /note="BLASTN similarity to T15563 (439..478); match: 0.9, score: 2.1e-162; database searched: est; IB1554 Infant brain, Bento Soares Homo sapiens cDNA 3'end similar to Human elongation factor 1-alpha (EF1A)." misc_feature complement(30476..30498) /gene="EEF1Ap" /pseudo /note="BLASTN similarity to T15563 (418..440); match: 0.91, score: 2.1e-162; database searched: est; IB1554 Infant brain, Bento Soares Homo sapiens cDNA 3'end similar to Human elongation factor 1-alpha (EF1A)." misc_feature complement(30496..30914) /note="BLASTN similarity to T15563 (1..419); match: 0.92, score: 2.1e-162; database searched: est; IB1554 Infant brain, Bento Soares Homo sapiens cDNA 3'end similar to Human elongation factor 1-alpha (EF1A)." repeat_region complement(31006..31295) /rpt_family="Alu" repeat_region 31632..31913 /rpt_family="Alu" repeat_region complement(32245..32529) /rpt_family="Alu" repeat_region complement(32737..33013) /rpt_family="Alu" repeat_region complement(34285..34408) /rpt_family="Alu" repeat_region complement(34529..34914) /rpt_family="Alu" repeat_region 34994..35680 /rpt_family="LTR8" misc_feature 35796..35975 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 56.000" repeat_region complement(36419..36594) /rpt_family="MER4" repeat_region 37030..37301 /rpt_family="Alu" repeat_region 37778..38170 /rpt_family="LOR1" repeat_region 38190..38477 /rpt_family="Alu" repeat_region 39381..40004 /standard_name="VNTR" /rpt_family="microsatellite" repeat_region complement(40008..40405) /rpt_family="MLT2B2" repeat_region complement(41482..41758) /rpt_family="Alu" repeat_region complement(41786..42097) /rpt_family="Alu" misc_feature complement(42468..42578) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 52.000" repeat_region complement(42894..43167) /rpt_family="Alu" repeat_region 44074..44352 /rpt_family="Alu" repeat_region complement(44379..44669) /rpt_family="Alu" repeat_region 45452..45751 /rpt_family="Alu" repeat_region complement(46395..46657) /rpt_family="Alu" repeat_region 46949..47043 /rpt_family="Alu" misc_feature 47588..47684 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 44.000" repeat_region complement(47914..48215) /rpt_family="Alu" misc_feature 48236..48403 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 48.000" repeat_region complement(48474..48748) /rpt_family="Alu" repeat_region complement(49339..49650) /rpt_family="Alu" repeat_region complement(49663..49953) /rpt_family="Alu" repeat_region 50386..50682 /rpt_family="Alu" repeat_region complement(50977..51711) /rpt_family="Alu" repeat_region 52302..52575 /rpt_family="Alu" repeat_region 52750..53029 /rpt_family="Alu" misc_feature 53347..53439 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 50.000" repeat_region complement(53629..53912) /rpt_family="Alu" repeat_region complement(53989..54270) /rpt_family="Alu" misc_feature 54291..54471 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" repeat_region complement(54680..54954) /rpt_family="Alu" repeat_region complement(55414..55670) /rpt_family="Alu" misc_feature 55791..56026 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 68.000" repeat_region complement(56092..56336) /rpt_family="L1" misc_feature complement(56203..56313) /note="BLASTX similarity to (307..343); match: 0.32, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature complement(56321..56413) /note="BLASTX similarity to (275..305); match: 0.45, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature complement(56518..56583) /note="BLASTX similarity to (211..232); match: 0.45, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature complement(56608..56679) /note="BLASTX similarity to (177..200); match: 0.5, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature complement(56715..56807) /note="BLASTX similarity to (135..165); match: 0.41, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature complement(56833..57006) /note="BLASTX similarity to (37..94); match: 0.31, score: 3.7e-18; database searched: nr; hypothetical protein (L1H 3' region) - human" repeat_region complement(57178..57549) /rpt_family="THE1" repeat_region complement(57747..57996) /rpt_family="Alu" repeat_region complement(58655..58943) /rpt_family="Alu" repeat_region complement(58981..59302) /rpt_family="Alu" repeat_region complement(59666..59965) /rpt_family="Alu" repeat_region 60636..60951 /rpt_family="Alu" repeat_region 61276..61535 /rpt_family="Alu" repeat_region complement(61561..61842) /rpt_family="Alu" misc_feature 61874..61980 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 68.000" repeat_region complement(62257..62523) /rpt_family="Alu" misc_feature 63046..63083 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 89.000" repeat_region complement(63517..63689) /rpt_family="LTR3" repeat_region 64382..64630 /rpt_family="Alu" misc_feature 65411..65558 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 55.000" repeat_region 66068..66352 /rpt_family="Alu" repeat_region complement(66632..66914) /rpt_family="Alu" repeat_region 67381..67829 /rpt_family="Alu" repeat_region complement(68952..69233) /rpt_family="Alu" repeat_region complement(69238..69511) /rpt_family="L1" misc_feature 69583..69807 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 44.000" repeat_region complement(70193..70468) /rpt_family="Alu" repeat_region complement(70492..70748) /rpt_family="Alu" repeat_region complement(71392..71708) /rpt_family="Alu" repeat_region complement(71765..71888) /rpt_family="MIR" repeat_region 72673..72954 /rpt_family="Alu" repeat_region 72997..73278 /rpt_family="Alu" misc_feature 73219..73251 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 75.000" repeat_region 73541..73818 /rpt_family="Alu" repeat_region complement(74330..74567) /rpt_family="Alu" repeat_region complement(74685..74844) /rpt_family="MLT2B2" repeat_region complement(74907..75049) /rpt_family="Alu" repeat_region complement(75231..75500) /rpt_family="Alu" repeat_region complement(75612..75780) /rpt_family="MIR" misc_feature complement(77418..77577) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 46.000" repeat_region complement(78275..78533) /rpt_family="Alu" misc_feature 79378..79519 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 77.000" repeat_region complement(79833..80097) /rpt_family="Alu" repeat_region 81341..81590 /rpt_family="Alu" repeat_region complement(82200..82468) /rpt_family="Alu" repeat_region 84309..84772 /rpt_family="Alu" repeat_region complement(85043..85128) /rpt_family="MLT1" misc_feature complement(85435..85618) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 49.000" repeat_region complement(85795..86090) /rpt_family="Alu" repeat_region complement(86102..86242) /rpt_family="MLT1" repeat_region 86860..87270 /rpt_family="Alu" repeat_region 87562..88014 /rpt_family="Alu" repeat_region 88821..89097 /rpt_family="Alu" misc_feature 89216..89287 /note="BLASTX similarity to (397..420); match: 0.37, score: 3.2e-06; database searched: nr; hypothetical L1 protein (third intron of gene TS) - human >prf||1510254A L1 repetitive element ORF [Homo sapiens]" misc_feature 89286..89321 /note="BLASTX similarity to (426..437); match: 0.41, score: 3.2e-06; database searched: nr; hypothetical L1 protein (third intron of gene TS) - human >prf||1510254A L1 repetitive element ORF [Homo sapiens]" misc_feature 89327..89401 /note="BLASTX similarity to (488..512); match: 0.4, score: 3.2e-06; database searched: nr; hypothetical L1 protein (third intron of gene TS) - human >prf||1510254A L1 repetitive element ORF [Homo sapiens]" repeat_region 89430..89717 /rpt_family="Alu" repeat_region complement(89835..90134) /rpt_family="Alu" repeat_region 90174..90273 /rpt_family="L1" repeat_region 90278..90553 /rpt_family="Alu" repeat_region 90576..90679 /rpt_family="L1" misc_feature 90627..90710 /note="BLASTX similarity to (1134..1161); match: 0.53, score: 3.1e-12; database searched: nr; hypothetical protein (L1H 3' region) - human" misc_feature 90710..90946 /note="BLASTX similarity to (1163..1241); match: 0.26, score: 3.1e-12; database searched: nr; hypothetical protein (L1H 3' region) - human" repeat_region complement(91002..91279) /rpt_family="Alu" misc_feature 91399..91499 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 85.000" repeat_region 91874..92170 /rpt_family="Alu" misc_feature 92961..93094 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 64.000" repeat_region 94219..94497 /rpt_family="Alu" repeat_region complement(95368..95659) /rpt_family="Alu" misc_feature 96276..96770 /note="BLASTX similarity to 190698 (7..171); match: 0.27, score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" CDS 96282..97274 /note="hypothetical kDa protein similar to G-protein coupled receptors; BLASTX similarity to 190698; score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" /codon_start=1 /product="PC28130_2" /db_xref="PID:g2347084" /translation="MLPDWKSSLILMAYIIIFLTGLPANLLALRAFVGRIRQPQPAPV HILLLSLTLADLLLLLLLPFKIIEAASNFRWYLPKVVCALTSFGFYSSIYCSTWLLAG ISIERYLGVAFPVQYKLSRRPLYGVIAALVAWVMSFGHCTIVIIVQYLNTTEQVRSGN EITCYENFTDNQLDVVLPVRLELCLVLFFIPMAVTIFCYWRFVWIMLSQPLVGAQRRR RAVGLAVVTLLNFLVCFGPYNVSHLVGYHQRKSPWWRSIAVVFSSLNASLDPLLFYFS SSVVRRAFGRGLQVLRNQGSSLLGRRGKDTAEGTNEDRGVGQGEGMPSSDFTTE" misc_feature 96282..97274 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 92.000" misc_feature 96753..96812 /note="BLASTX similarity to 190698 (167..186); match: 0.3, score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" misc_feature 96843..96914 /note="BLASTX similarity to 190698 (195..218); match: 0.29, score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" misc_feature 96936..97010 /note="BLASTX similarity to 190698 (229..253); match: 0.36, score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" misc_feature 97050..97127 /note="BLASTX similarity to 190698 (277..302); match: 0.26, score: 6.1e-22; database searched: nr; (M88177) platelet activating factor receptor [Homo sapiens]" repeat_region 97517..97806 /rpt_family="Alu" BASE COUNT 26842 a 23219 c 23463 g 25189 t ORIGIN 1 gatcgcacca ctgcactcca gcctgggtga cagagcaaga tcctgtctct ggaaaaaaaa 61 gaaaaaagac atattttgta tgttatatat attatgtgct gtattcttac aataaagtaa 121 gctagagaaa agtacatgtt actgaaaaca ttattagaaa gagaaaatac cttcacagca 181 ctgtgctgta tttatcaata ctgtaagttt tgttgtctgt ttataagaga agttgtctgt 241 ctcaagcggc agtaaccgcg gctgcagatc tcaatctgtg gtagatatca agcagttcaa 301 ccctttttta taatgtcatg acttttcttt gcttcttgga agcagctctg gcatcactag 361 tggccccgca tttggggcca agggggttat tcaaagttta cggtattgca ttaaacatgg 421 tgaaacatat gcaatgacta tgaaagatca ctttttactt ctgtacacag tgtactggcg 481 agaactgctc accaggaggt gattagcttc acgtggtgtt ggaagcagat actcaacagt 541 tgagctcacg gagaaagcac aggaggtggc tacgaaatta ttataggagt acagtatgtc 601 tacagtaaat atgcacttat gactttaata ctgcacatct ttatgtttgc ttgcatttct 661 cttgacttca aatggcatca aatagagttt gtgtttgcgt gcatatattt ttgataaatt 721 ttaacttttt tgttttgttt tgtttttaga cagagtcttg ctctgtctcc caggctggag 781 tgcaatggtg tgatcttggc tcactgcaac ctccgcctcc caggttcaag caattctcct 841 gcctcagcct cccaagtggc tgggattaca ggcacctacc accacacctg gctaattttt 901 tgtattgtaa gtagagatgg ggtttcgcca tgttggccag gcctgtctcg aacccctgac 961 ctcagttgac ccacccgctt cggcctccca aagtgctggg attacaggcg tgaaccacgg 1021 taccctgcca attttaactt tctataagag atttgcccag cctgggcaac atggagaaac 1081 cctgtcttta ccaaaaatac aaaaaattta gctgggcatg gtggcgtgtg cctgtagtcc 1141 caggtacttg ggaggctgag gtggaaggat ggtttcagcc tgggacgcag aggttgcagt 1201 gagccgagat catgtcattg cactccagcc tgggtaatag agccagatcc tgtctcaaaa 1261 aaaaaaaaaa aaaaaaagct ttgtatccat tttggcatac atagactagt atgtatggca 1321 gtaaatagac tagtggcagt aaatagacta gtacctacat atattttttg tattcgtggc 1381 atacttaact tttgcttaat tttaaaaata tatttctagg cttgtagttc ctctgtaagt 1441 tttttcaaat tgttgcaaat ctctgaaaat gtttccaatg tatttattga aaaaaatcca 1501 caagtaagtg gacctgcaca gttcaaacct gtgctattca aaggtcaact atatagtctg 1561 taatacgtga tgggtgtcac gctgagtgtc tacagtgagt gttgtctcac ttcagcttca 1621 cacaacgctg tggtttagcc ttgccactgc cgtacggatg agcgggtgtt cactacacga 1681 aggtactggg cagaggagca aacactgact aaattctagc ctgtgctttg ctcaacaagc 1741 cacatgcctg ccttgcttgc atcaagggaa ggagtattgt gttctaattt gaagaaatac 1801 accagccact ccttaggcag cacttggtgc gggatgggct cctctggaaa gggagcacct 1861 aatttgcaca aaggtgctgc accaggtagc tacgatcctg gaatgggatg aagaaacttg 1921 cctaaggccc tacagcaagg aaggggtaaa agcaggatgt gatccttcca agcagaaccc 1981 ccaagagcca ggggacattt gggaaacctg ggggtgtggg ggttaaccct gtatttgtgg 2041 tccatttcgc aaaggatagc atcccacatc catgagaggg aggtcagaag ggagtggctg 2101 tggctgtgaa ggagacagac acctgggcac ctgctctaaa ttacagcaat gccatgcaca 2161 tgatgatgta ttggagcaca acaaatggca gccaccagct ggggtgcacg caccaatttc 2221 tacaaatgag ctctgttcag gctgaaatgt ctacgcatgc aaggattttt cttgtaattg 2281 atgatatatc cccagcgccc agaagacaga gtggcacgca gtaggtgctc agtaaatgtt 2341 cgctcaatga atgaatgaat gcctctacta gccttccaaa gaaagttcta tgtctgtttt 2401 atcaatatgg aaacaggcca gagaaattaa gtaacttctt ttggaagcca gaaacaagcc 2461 cggcacttgt gcagtgacaa caccagggaa tttcagtaaa aacccaaact ccaccccagg 2521 gagccccagt gcctcctccc gcatcacagg gtgccagggc ccactctctt tgactcctgc 2581 tccatagccc ctgctgggta ctgggctgct tctcccagtt cctcctgcct ccccccactt 2641 agggcaaatc cagtgcctcc ctggggccca ggagactcag tgtgatccac cccaggcgcc 2701 tctcactcct gtgagaccca aacctcctat cagctcccct agcaggccaa gtctgtgcac 2761 ccgtgactcc cctgccccag tgttccctcc agtggctctt tgcctcttca gtatccacgt 2821 cttcatgatc cttcccctac tccatccccc aagccctcac caccaaatgt aatgtaaaca 2881 gcatgacagc aaaaactttt tgttcacaac tgcgtgttgt tttttttttt tttttttttt 2941 tttagacaca gtctcactct gttgctcagg atggagtgca gtgacacagt catggctcac 3001 tgcagccttg aattcctggg ctcaagcgat tcttccattt cagcttccca agtagctggg 3061 accagaggca catgccacta aacccagcta atttcttttt tctttctctc tttttttttt 3121 ttttttcccc tgtggacata gggtctcctc atgttgccca ggctagtctc aaacaacctg 3181 tgctcaagca atcctcccac ctcagcctgc caaagtgcta aaattacgga cctgagccac 3241 cgtacccagc ctactgctgc acttttaaca gcttaattga gatattatgt acatatcatc 3301 aaattcacca atgttaagtg tacaattcaa tgatttttag taaatttact gaggtgtgag 3361 tccattacca tcagctagtt ttagaacatt ttcatgaccc tagtaaagga ttttacttca 3421 tgttcattta aacttaatct tgggccaggc ggggtggctc acgcctataa tctcagcatt 3481 ttgggaggct ggggtggacg gatcacctga ggtcagtagt tcgagaccag cctggccaac 3541 atggcgaaac ctcatctcta ctaaacatac aaaaattagc caggcgtggt ggcgggcacc 3601 tgtaatccta gctacttagg aagctgaggc atgagaattt cttgaacccg ggaggcagag 3661 tttgcagtga gccaagattg caccattgca ctccagcctg ggcgacaaca agaccctgtc 3721 tcaaaaaaaa aaaaaaaaaa aaaaaaacaa cttaatcttc atttctacct ccatccccca 3781 gacaaccacg agtctacttt ctgactctat aactttgcct attctggaca tatcatataa 3841 ataaatgaaa tctgacaata tatgtacatg gtcttttacg tctggcttgt ttcactcaac 3901 ataaggctgt agtgtgtatg ggtagctcat tccttttctt tgccaaatag aattccagta 3961 tatggatgtg ccactttttt attcactcac cagctgatgg gctgccaggt aacatttcca 4021 ctgttggtga ctatgaccaa cactgctgtg aacatttagg actgagtctt taagtagacc 4081 tgagtttcca cttctcttaa gtaaatacct acggtggaat tgttggatcg catggtgaat 4141 ttatgtttaa ctttttaagc aactgccacc tgctttccaa agcagggaca ccacctccca 4201 ttattctcac cagcagtgaa tgaggcccct atctctctac attctcgccg accataggca 4261 ttgtctgact ttttgcttgt ggccattcta gtggatgtga aatcatatct cattgtgccc 4321 tgtgattgtt tgaatgaact aatagatcac ctgtgaaact ggacagcact tgggtttgaa 4381 ccctagtggt gacatttcac aagctgtatc acctggagca agtcacttcc ctacggtgaa 4441 ggacaaggat tttatgacat aaaagcgttg ccctagaagc agtgtctgct atggattatg 4501 tgcccatgaa aaggcagctg ctgtcatcat cgccaccacc attattattg ttattattat 4561 tacattgtca ccccagagat aatcctgcac cagtgacagg gaaaatagca gctggcatcc 4621 gctgcctgct cacgaccaca cgccaggctt ccgtcaaacc actcaacatg tattagtaat 4681 cttttaatcg acctacatat tgttttaatt tgcatgtgtt aatgcgttga tctatgagat 4741 gggtattatg atgagctctg ttctgcaggg gagaaagcag aaacatggag aatttaagtc 4801 atttccccca aatcacaaag tcaggaagaa acagacctca cggagctcgc tctctgtcat 4861 tgcatcacac ttcctgccct tacaaggcaa attggataaa tgccattcta gagaagcaga 4921 caaaattcaa gtgaagaagg ggagaggaag acgtcggctg gggcctgctt agagcatccc 4981 agctgagact gcatgaggag ggaggcacgc agttgtggaa tttgttcccc ttttagcatg 5041 ctgaccagcc ctggcaacgg agctcaaggc atctatgtgc cactgctcaa cagtgagtga 5101 cgtcatgggc acggccaggt ctttatcagt tctgccggat aaatagccaa ctgcactagg 5161 tctggagaga cagcaaggtg ctgtgcggca gagcatttgg ggtctcaaag aagcaggtga 5221 gcctgggccc gaggggctgg gtggaggagc accttggtgc ttctctgctg gggaagggac 5281 aggggacagg gcatgctcag gaagacaggc aggctgaccc cgcctggaag gcacccagag 5341 acaagagggg tgggcgtagt gacctcgtgc ccttttaggg gagatgctgc tggccagagg 5401 ccgttagggc ccccactacc aactccatgt tactctctct caccagtggc caccaccatg 5461 gatacaggcc ccgaccagtc ctacttctcc ggcaatcact ggttcgtctt ctcggtgtac 5521 cttctcactt tcctggtggg gctccccctc aacctgctgg ccctggtggt cttcgtgggc 5581 aagctgcagc gccgcccggt ggccgtggac gtgctcctgc tcaacctgac cgcctcggac 5641 ctgctcctgc tgctgttcct gcctttccgc atggtggagg cagccaatgg catgcactgg 5701 cccctgccct tcatcctctg cccactctct ggattcatct tcttcaccac catctatctc 5761 accgccctct tcctggcagc tgtgagcatt gaacgcttcc tgagtgtggc ccacccactg 5821 tggtacaaga cccggccgag gctggggcag gcaggtctgg tgagtgtggc ctgctggctg 5881 ttggcctctg ctcactgcag cgtggtctac gtcatagaat tctcagggga catctcccac 5941 agccagggca ccaatgggac ctgctacctg gagttccgga aggaccagct agccatcctc 6001 ctgcccgtgc ggctggagat ggctgtggtc ctctttgtgg tcccgctgat catcaccagc 6061 tactgctaca gccgcctggt gtggatcctc ggcagagggg gcagccaccg ccggcagagg 6121 agggtggcgg ggctgttggc ggccacgctg ctcaacttcc ttgtctgctt tgggccctac 6181 aacgtgtccc atgtcgtggg ctatatctgc ggtgaaagcc cggcatggag gatctacgtg 6241 acgcttctca gcaccctgaa ctcctgtgtc gacccctttg tctactactt ctcctcctcc 6301 gggttccaag ccgactttca tgagctgctg aggaggttgt gtgggctctg gggccagtgg 6361 cagcaggaga gcagcatgga gctgaaggag cagaagggag gggaggagca gagagcggac 6421 cgaccagctg aaagaaagac cagtgaacac tcacagggct gtggaactgg tggccaggtg 6481 gcctgtgctg aaagctaggt cctccggggg aggagggtgt agctggcgtg tcatcctcag 6541 ggcgcttcct cgctcacacc aggagggact tggagtggcg agctggggcc cgatggggct 6601 tgggggcaga gtagacatct agcctcccta agggtatgcg cgctaaagcc cagctctcga 6661 tctcacctcc atccccatcc acccacacac tatggattgg gctctgggaa ggggtcaggg 6721 tgagaggctg ctctggagaa caatgaggtc ctcatagcag caggcagctc ctgtgttttc 6781 ttgagggtgg cagaggagct aagagcagtg cccagggtct gagggggctg cccagtgagt 6841 ggcaggggca ggagagggga gaaccccatc ctcagagctg ctcccagcca gcgagtcagg 6901 agcgggggag acagggctcc agggatgagg ccgcattctg ctcccacagc gccttttcca 6961 gaaagttccc attgctcaat aaatgtggat catcagagac atttatgaac aatgacagaa 7021 gaaaaattac ccaaataaat gtggaagcaa gcaaaagaga acagtgtttc cttcttctcc 7081 tgttttgttc tggtggtgtg cttgggccgg gtgggactgg tggatggaag gagaaaacac 7141 cagactctgg aggaaaaggg ccaaacacca ggatgcctgg atgctgggag aggatctggc 7201 ttgcagggat gaaaataaca gctgccctgt ctaaaggact tggcctgaca catcatctcc 7261 ttctatctct caatagccct gtgagaggta ccagcattat ccccagtttc agatgaagga 7321 gtggcccaga gaggtgacac ctcctacctg agatcccata gctggtgggc gattgaagtg 7381 ggaccagaag ctggtgtcag ttgactctga cacccatgcc cctaagccac tctgctgttc 7441 tccatctgtg tgtcacctgt ggtcacctgg cccagtcaaa tgtcccaagt cagataagtc 7501 tgtctgagca ggacaaatat aaacaggcca tgataagaaa cagaactaaa tgcaaagcgt 7561 attgactatt tgtgtgggtt caaccttttc ccagtaaaaa agccctcttg gaaaaaaaga 7621 aaaaaagata attagtcagc acaatggact tcaggtttct gcaaagtcag agagatgttc 7681 ctcagagctc tattcatcaa aatatggcca ttcaggaagt tttcatgttc cctgagtggt 7741 ggcccatgag aaatagctgg tcaggacctc ctccccaagc aaggccacct gagactcgcc 7801 agcccctgtt tgcatgctgt cagcagtatt tcagaagcat ttgttgaaca cctattgcat 7861 gcagatggct gggctcagaa cttcggggag ttcgacacta aggtctttaa atatctgctt 7921 cctccctggg gttaagtgtg ccgggagaat gaagactggc tgtgtgtggg gaagtggatt 7981 tcaggtggaa ggagagggac tctgtggaga gttgaaacca tgagctgtct ctctgtccca 8041 cggctcacct gtccatcccc agggcactca gccctggtgg ctgcaaagaa gttgatgcag 8101 gtaggggtgt tcatggctca gaaaggggac aaaggtgcca tggaatcctg caacttcaga 8161 atgggcaccg ccaccccaag gcagagaaca gaaaaactga aaacctcaca gtcgaaagag 8221 agccatgaag agctggagtc tccctcctcc cccaggttta taagagggaa taagaggcag 8281 cttcatgggc tttgctgtcc ccagctatct cccagcccag cccagtctgt ataagcaaaa 8341 gcccaggggt ggggaagatg aaaagccatc agggtgtggg tgatcgtggg agtgggggtg 8401 aggaggatgt tggagcattt tccatacatc tcagaagccc agggaacggg gctgtcatag 8461 aacgggagat gcatctgggg ctgccgcttg gtgccatcta ggtggggtgg gagaaaggca 8521 gagagaggag gtgggagaag actgtggggg gtgcagcagc tggcaccctc tccatcgggc 8581 agagcaagta ggtgtgaatc tcccatgggt gaagcgatca cacgcaaaaa ggagggagat 8641 ggcatctttg gagggaggtg gtgataggct gagaccccct cgttgtcccc cccgaaacct 8701 ggtgaccgga gctgccagga ggccacacag agagaggctg atgatggggg acctggaacc 8761 tggaaggggc agagtggtca gatgacccac acacgcctga gaaaacagat gtggagaggc 8821 tgtgttgtca ttctgaggca ggaaacagag acagacagac agacagacag acaccggcac 8881 agtgagagac agagacagat tcacagagca gagctggctt ctcactggca ggcatgggca 8941 cacctgagag agcatccaga gaggttagaa atcttcctta ggtcctcctg ggaggctcaa 9001 gtggccccac cggggagctg aacagctggc tggatttcca gggttgagag aaggaagctg 9061 cgcagccggc cagggccaca tcccctgtgc ctcagggagg gacccccagt gtccccacca 9121 cactccccag gaggagtcct tggagttgag gccactgcct gagtcctcca gtgccagcca 9181 gcggaggttg ctcccgtccc ttgtcttcct cctccccatc cttctgccag ccctggaaag 9241 gtcagaaacc accattgcgg agctagaggg gaaggagagg cagaacagag gggcctctcc 9301 tcgcaaaaac taaaataagc ccacagcctg tgcagaaccg cacccgggtg tgatcactcc 9361 ttaccccaca cacccactgc cacttggaga tggtttcacg gctgcagtca cagggcaccc 9421 tgtgttgtca ggggtgagcc agcaaggccc ggggcctgcc caggggtaca ttgagggtgg 9481 agaagacctg gccccagtgc aggtgtgaac aggcaggagg gctggggagg gaatagcgct 9541 gggttgtggc cgagccccca agccctgctc ttgagaggca ctggctacac actcggcccc 9601 tacccagaca ggacgctgat tcctccctgg cccttctgtg ttcctgcatc ttccaagacc 9661 ctcacctcct ccagtgccgc atcagctcca gccaggatgc cccatcagct tctcacccaa 9721 gctgctagaa cctcaggcta atagccctgt cattccagct cactcccccc gtccccttct 9781 ggctcactgc ccatcctggg catcctccat cagcccgctg aggggactgc tgagcctcag 9841 ggacatctgg cagcaccagg aggtgggtcc ctctctctgt ggctcctctc ttcacctgac 9901 tgttccccag acccccagac cacccctgcc caggtggccc ctgagaactg ctaagctcaa 9961 ccgtgacatg gatggggccc agctgaccct tgggctctcc ctgggggacc cccttttcct 10021 gtccctggcg acaccctcac cttattcaca cccgtctcct ccctgctgcc ccgccggctt 10081 tccactgtgg gtgtcggggg gtcagctcct gccttgccag ggaaccaggg aaggcagtgg 10141 gctcaggagt caaactccca ttcctagaga agactcaggc agtgggccca gccctgttct 10201 gaattctctc cacccctcct ccccacagag tctcagaaat caagaagctt tcttcctctg 10261 ggacccacgc gtgaagtcct atctcctgtg accctgcagg tgctcccctc attttaagag 10321 atgatgtgga gagaaagcac aggactgggg gccacagctt ctcatccagg ctccgctgct 10381 ccctggctgc tgcattcctc tgtgggcctc cgtttcctcg gtgatcccac caggcgaatt 10441 gtaatcctcc aggcagggct ggaacccatg agtgatccca ggagcccctc ctctctgggg 10501 cctcaagatc tgaggacggt gagccagttg tacacaggag ccgtgcaggc caggacggtg 10561 gccccatgga cctgcccccg cagctctcct tagccctcta tgggaccacc tggcttgttt 10621 cacttagcgt aatattttaa agattcactc gtgatatagt acagcagtcc ccaaactttt 10681 tggcaccagg gactggtttt gtggaaaact atttttccat agacctcagg ggagtggggg 10741 gatggttttg ggatgaaact gttccacctc agattgtcag acattattta gattgccata 10801 aggagcgcgc aacctagatc cctcgcatgt gcggttcaca atagggtttg cgctcctgtg 10861 ggaatctaat gctgccgctg atctgacagg agctcaggcg gtcatgccag cttgtctgcc 10921 cactgctcac ctcctgctgt gtggctgggt tcctaacaga ccactcacct gcacgggtct 10981 gcagcccggg gttggggagc tctgacgcag taggtatagg cagctcattc ctttttattg 11041 ccaaatagaa ttccagtgta tggatgtgcc actattttta tccactaact agcagaaggg 11101 cctccaggtg aggtttccac ttttagttct ttttcttttt ctttcttttc ttcttctttt 11161 ttttttttat actttaagtt ctgggataca tgtgcagaac atgcaggttt gctacatagg 11221 tatacatgtg ccatggtggt ttgctgcacc catcaacccg tcatctaggt tttaagcccc 11281 acacgcatta ggtatttgta ctaatgctac ccctcccctt gccgcccatg ccccgacagg 11341 ctgcggtgtg tgatgttctc ctccttgtgt ccatgtgttc tcattgttca gctcctgctt 11401 atgagtgaga acatgcagtg tttggttttc tgttcctgtg ttagtttgct gagaattact 11461 tttggttctt atgatgtatg ctgctacaaa cattgaggac tgagtctttg cttagacaaa 11521 tgttttcact tattttaaat aaatacctat ggtggaatta ttggatcaca tgttaaattg 11581 atgtttaact ttttaagaaa ctgccacctg ctttgcaaag cagctgtacc atctgacatt 11641 ctcagcagca atgagaggcc cctgtttctc cacatcctca ccatcacttg ttattgtctg 11701 ccttttgctt atagccattc tagtggatgt gaaatggcat cattgtgctc cgtaagtatt 11761 tgaatgaaga aataaattac ctctgcactg gagagcactt gggtttgaac ccgggtggtg 11821 atatttaata agctgtatca cctggagcaa gtcacttccc cactgtgagg cacaaaggat 11881 ttcatgacaa taggggatgc cttgagaagt ggagtatctc tggatctgtg gattatatgc 11941 ctaccgaaaa gcagttgtta tcatcaccac cgccaccatt attattatta ctgtcacccc 12001 agagacatgc ccacacctgt gacctggatg ctggcagctg acaggtgctg aacacacacc 12061 atcacatgcc aggcctcctt caaagtactt gacacagctc ttatatggga tttcacaaag 12121 ggcggcaccc aactctccct cactacagga gcagcagggt ctcagcaatg taggacagtt 12181 cacagagttg catgctctgg acttggggta gagactctgc catgatgcct atcttaattg 12241 tagctggcag agaggcggat ctacagtcca caacagcact ttggctggga accaaaggat 12301 gaagacttta taaccagaag gtcatgtgcc atgtaacagg ggtgtaggga agtagatcat 12361 gcttctgttg gcttaggaca aggagctggt gcacccctta cccctttctg tgtcacttca 12421 gagcacccca tcacgatctc ttcctgccac tcctgtcaag gaaggtgtga ccactgggcc 12481 ctggcctacc tgccagctct tcttaagtgc catctactag acttcagctt aaattgcacc 12541 acgaaacaaa aatatattgc tacaacatgc ggcatctgag aaagcaaccg catgaaccta 12601 tctgcaacca agaacttgta cagagccttg gccctccgaa agcatccaga aatgaagcca 12661 gttgatcata cacaacacac accatagtca taccttcaac agaaaaaaag aataaaaaaa 12721 ttaaaaagcc ccacccaaat gaaagcaaat tttgaaaaag aagcatcagc cctctcagat 12781 gagaaggaat cagcacagga ctctggcagt acaaaaagcc agaatgtttt gtcacctcca 12841 aaggatctca ctagctccct agaaatgtat cctgaccaaa ttgaaatgct gaaaatgaca 12901 gatctagaat tcagaatgtg agtggcaagg aaactcagca agacccaaga gaaagttgaa 12961 atgcaatact aagaagccgt aaaaatgagc caggatgtga aagatatata tatagggggg 13021 gaaaaaaact atatatagag agaaagaacc aaaacacaaa aagaatcaag cagaacttct 13081 ggaattgaaa aatttactac aggaatttca aaatactgtt ggacacctca acaatagact 13141 agaccaagta gaagaatttc agaggcagaa tgaatttttt aaattaacct agtcacacaa 13201 aaataaagaa aaataatttt aagaaatgaa caaaggcttt gagaagtata ggattatggg 13261 aaatgaccaa actatgaccc actggcattc ctgacagaga agaagaaaaa ataagcaact 13321 tggaaaatgc attttaacgt ataattcagg aaaatgtccc taatcttgct aaagaggtca 13381 acatgcagat ataagaaatt cagggactgc ctgagaggta ctatacaaaa tgaccatccc 13441 caaggcacat agtcatgaga ctgcatgaaa gaaaaatatc tttaaggaag ctagaaaaaa 13501 gagccaaatt acctataaag gaagtccact cagaataaca gtggatttct tagcagaagt 13561 cttacaattt agaagagatt gggtgcctat ttttggcttc ttttttttga tggagtctta 13621 ctctgttgcc caggctggag tgcagtggtg cgatcttggc tcatggcaaa ctctgcctcc 13681 cgggttcatg caattctcct gcctcagcct cccaagtagc tgggattaca ggcatgtgcc 13741 accatgcccg gctaattttt ttgtattttt agtagagacg gggtttcacc atattggcca 13801 ggctggtctc aaactcctga ccttgtgatc cacccacctc agcctcccaa agtactggga 13861 ttacaggtgt gagccaccac gcccagcctt ctaggcttct taaagaaaaa atgccagcca 13921 agaatttcat atcctaccaa actaagtgtc acaaaggaga aataaaatct ttcctagaca 13981 atcaaatggt aagagaattc atcaccacca gactggtcca acaagaaatg ctcaaagaag 14041 ctgtaagcat gaaaacaaaa gaatgatact tgctaccaca aaagcacaca taagtacaaa 14101 gcccacagac cctataaagc aactacccaa ccaagactac aaagtaatta actaacaaca 14161 cgaagataaa tcacaacagg gacaaaacct cacatattaa ccttgaatgt aaatggccta 14221 tacactccac ttaaaagaca tagtgtggaa aattggataa aaatatgaga cccaaccttc 14281 tgctgtctta aagagaccca tctcacatat aatgataccc aaaccctcaa agggttggag 14341 aaagatctct catgcaaatg gaaaacaaag ggttgctatt gttgtatgag ataaaacaga 14401 ctttaaacca acaacagtaa aaaagcacaa ataagggcac ttcctaatga taaagggttc 14461 gattaaacaa gaagatttaa catcctatat atataaacac ccaacgctgg agtacctaga 14521 tttataaaac aattactacg atacctaaga aaagagatgg acagccacac aataatagtg 14581 gggttcttca acagcccatt gacagcatta gacagatcat tgaggcagaa aactaacaaa 14641 gaaattctgg acttaaattg gacacatgat caaatggacc taatagacat ctacagaata 14701 ctctacccaa caaccacaaa atatacgttc ttctcatctg cacacagaag atactctaag 14761 attgacccaa tgcttagtca tgaagcaaat ctccataaat ttaaagaaat gaaaattata 14821 ccaaccttct tctcagacca cagtggaata aaaatagaat tctcaaaatc acacaaatac 14881 atggaaacta aacaactatt tcctgaatga ctttggggta agtcattaag gaagaagtca 14941 aaaaattaag gcagaagtca aaaacttctt tgaaacaact gaaaatagag gcacaacata 15001 ccaaaacctc tgggatacag caaaagcagt tttaagagga aattttatat cactaaatgc 15061 ctacatcaag aagttagaaa gatctcaaat taataaccta acattacacc tgaaggaact 15121 agaaaaacaa gagcaaacta aatccaaagc tagcagaaga aaagaaataa ctaaaatcag 15181 agtagagcta aacgaaattg tgacaaataa gccattcaaa ggatcaatga aatgaaaaat 15241 tggttctttg aaagaataaa caagatttat aggctgcaag ctagattaac aaaggaaaaa 15301 aaaaaacctc caaaaaagca caatcagaaa taacacagac aagattacaa ctaattccac 15361 agaaatacaa aagatcctca gagactgcga tgaacatcta tattcacatg aatcagaaaa 15421 tctagaggaa acaaataaat ttctggaaac atacaaccaa gattgaacca gaaagaaatt 15481 gaaactctga gcagaccaat aatgagttac aaaattgaat cagtaataaa atatctacca 15541 cctttaaaaa gcccttgacc agatggattc acagccaaac tctaccagac ctgcaaagaa 15601 gagatggtac caatcctact gaaactttac cagaaaattg aggaggagag attccctact 15661 cattctacaa aaacagtctg attctgatat cgtaatctgg caggacacag tgaaaaaaga 15721 aaactgtagg ccaatagccc gatgaacata gatccaaaaa tcctcaataa aatactagca 15781 aaccaaatcc agcagcatga caaaatcata attcagcatg atcaagtggg ctttattcct 15841 gggatgcaaa gatgtttcaa tatatgcaaa tcaataaatg tgactcacca cataaaatta 15901 aaaacaaaaa ttatatgatc atctcaatag atgcagaaaa aaacatataa taaaatccaa 15961 catcccttca tgataaaaac cctcaaaaaa ctaaacatcg aaggaacaca cctcaaaata 16021 ataagagcca tccaggacaa actcacagcc aacatcatac tgaatgggca aacgttggaa 16081 gcattccccc taagaaatgg aaaaagacag ggatgcctac tcccactact cctattcaac 16141 atagtagtgg aagtcctagc cagagcaatc aggcaagaaa aaggaataaa aggcatccaa 16201 atagaaaggg gaagtcaaat tatctctgtt cactgatgat atgattctat acctagaaaa 16261 ccctacagat tcctccaaaa tactcctaga cctgataaat gactggcagt ttcaggatgc 16321 aaaatcagtg tataaaaatc agtagcatta ctatatacca acaattgtca agctgtgagc 16381 caaataaaga acagaatccc aattacaata gccacacaca aaaataaaat acctaggaat 16441 acatctaacc aaggaagtga aagtgaaaga tctctacaag gagaactaaa aaacacaaat 16501 aaatgaaaaa cattccatgt tcatggatta gaagaatcaa tatcattaaa atgtccatac 16561 tgctcaaagc aatctacaga ttccacccaa ttcctaccaa attatcaatg tcatttttca 16621 taaaatggaa tcattctacc aaaaagacac ctgcatgtgt atgtttgtca cagcactatt 16681 cacaatggca aagtgatgga atcaacttag gtgcccatca atggtggact agataaagaa 16741 aaaatggtac atacacatca cgaaatacta tgcagctata aaaaagaatg aaatcatgtc 16801 ctttgctgca acataaatgc agctggaggc ctaagtgaat gatcctaagt gaattaacac 16861 agaaacagaa aatcaaatgc tgtatgttct cacttttaag tgggaggtaa acaatgggta 16921 cacgcaaaca ttaaagatgg aaaccataga cagtggggac tccaaagtgg gggagggagg 16981 gcggaggaca agggtgggaa aactgcctat tgggtactat gttcacaatt ttggtaatgg 17041 gttcaataga agcccaaacc tcagtattat gcaatatacc catgtaacaa acctgcacat 17101 gtgctcccga atctaaaata aatattttca aaaagcaatt gacctatatt attaacattt 17161 taatctacct gtatatatgt tttaatctgc atgtgttcat gctttgaaat atggtacagg 17221 tactatgaca atccctgttc tgtaggggag gaagaagaga cacatctctt cccccacatc 17281 acaaagtcag gaaggaacag acctcacgga gtccagtctg tcactggctc atgctttgtg 17341 cccttaaaag gcaaactgga taagtgccat tctagagaat cagacaaaat tcaagtggaa 17401 gagaggggag gggaagaaga cgtcagctgg ggcctgtttg gagcatccca gctgaggctg 17461 cgcaaggagg gaggcacgca gttgtggaat ttgttcccct ttttgcatac tgaccagccc 17521 tggcaacgga gctcaaggca tctttgtgcc actgctcaac agtgagtgat gtcatgggca 17581 cggccaggtc tttatcagtt ctgccggata aatagccaac tgcactaggt ctggagagac 17641 agcaaggtgc tgtgcggcag agcatttggg gtctcaaaga agcaggtgag cctgggcccg 17701 aggggctggg tggaggagca ccttggtgct tctctgctgg ggaagggaca ggggacaggg 17761 catgctcagg aagacaggca ggctgacccc gcctggaagg cacccagaga caagaggggt 17821 gggcgtagtg acctcgtgcc cttttagggg agatgctgct ggccagaggc cgttagggcc 17881 cccactacca actccatgtt actctctctc accagtggcc accaccatgg atacaggccc 17941 cgaccagtcc tacttctccg gcaatcactg gttcgtcttc tcggtgtacc ttctcacttt 18001 cctggtgggg ctccccctca acctgctggc cctggtggtc ttcgtgggca agctgcggtg 18061 ccgcccggtg gccgtggacg tgctcctgct caacctgacc gcctcggacc tgctcctgct 18121 gctgttcctg cctttccgca tggtggaggc agccaatggc atgcactggc ccctgccctt 18181 catcctctgc ccactctctg gattcatctt cttcaccacc atctatctca ccgccctctt 18241 cctggcagct gtgagcattg aacgcttcct gagtgtggcc cacccactgt ggtacaagac 18301 ccggccgagg ctggggcagg caggtctggt gagtgtggcc tgctggctgt tggcctctgc 18361 tcactgcagc gtggtctacg tcatagaatt ctcaggggac atctcccaca gccagggcac 18421 caatgggacc tgctacctgg agttctggaa ggaccagcta gccatcctcc tgcccgtgcg 18481 gctggagatg gctgtggtcc tctttgtggt cccgctgatc atcaccagct actgctacag 18541 ccgcctggtg tggatcctcg gcagaggggg cagccaccgc cggcagagga gggtggcggg 18601 gctggtggcg gccacgctgc tcaacttcct tgtctgcttt gggccctaca acgtgtccca 18661 tgtcgtgggc tatatctgcg gtgaaagccc ggtgtggagg atctacgtga cgcttctcag 18721 caccctgaac tcctgtgtcg acccctttgt ctactacttc tcctcctccg ggttccaagc 18781 cgactttcat gagctgctga ggaggttgtg tgggctctgg ggccagtggc agcaggagag 18841 cagcatggag ctgaaggagc agaagggagg ggaggagcag agagcggacc gaccagctga 18901 aagaaagacc agtgaacact cacagggctg tggaactggt ggccaggtgg cctgtgctga 18961 aaactaggtc ctccggggga ggagggtgta gctggcgtgt catcctcagg gcgcttcctc 19021 gctcacgcca ggagggactt ggagtggcga gctggggccc gatggggctt gggggcagag 19081 tagacatcta gcctccctaa gggtatgcgc gctaaagccc agctctcgat ctcacctcca 19141 tccccatcca cccacacact atggattggg ctctgggaag gggtcagggt gagaggctgc 19201 tctggagaac aatgaggtcc tcatagcagc aggcagctcc tgtgttttct tgagggtggc 19261 agaggagcta agagcagtgc ccaggtctga gggggctgcc cagtgagtgg caggggcagg 19321 agaggggaga accccatcct cagagctgct cccagccagc gagtcaggag cgggggagac 19381 agggctccag ggatgaggcc gcattctgct cccacagtgc cttttccaga aagttcccat 19441 tgctcaataa atgtggatca tcagagacat ttatgaacaa tgacagaaga aaaattaccc 19501 aaataaatgt ggaagcaagc aaaagagaac agtgtttcct tcttctcctg ttttgttctg 19561 gtggtgtgct tgggccgggt gggactggtg gatggaagga gaaaacacca gactctggag 19621 gaaaagggcc aaacaccagg atgcctggat gctgggagag gatctggctt gcagggatga 19681 aaataacagc tgccctgtct aaaggacttg gcctgacaca tcatctcctt ctatctctca 19741 atagccctgt gagaggtacc agcattatcc ccagtttcag atgaaggagt ggcccagaga 19801 ggtgacatct cctacccgag atcccatagc tggtgggcga ttgaagtggg accagaagct 19861 ggtgtcagtt gactctgaca cccatgcccc taagccactc tgctgttctc catctgtgtg 19921 tcacctgtgg tcacctggcc cagtcagatg aggctgtctg agcaggacaa atgcaaacag 19981 gctgtgataa ggaacagaag taaatgcaaa gcacagtgag tatctgtgtg agtgcatcct 20041 ctccccagta aagaagactt cttgaaaagt atagattacg atacttggag tcaatgattg 20101 ggcaggtgac tcacagtggg cacccgatgc tcctcgatct tagctaatgc aggatctctt 20161 caacttttta catccttgtc cccttcagaa gcctctttgg atatacatct ttcctacaca 20221 gcacattcct ctgcctgctt atgcaatttc agtgcaacca acacactgtc ggtttatgtg 20281 ttgtggccct ttacaaagac agaaactggc tgggcgcagt ggctcatgcc tataatccca 20341 gcactttggg atgccgaggc aagagaatca cttgaggtca ggagtttgag accagcctgg 20401 ccaacatggt gaaactttgt ctctactaaa agtacaaata ttagccgcat atggcagcat 20461 gcacctgtaa acccagcttc ttgggaggct gaggcaggac aatcacttga ggccaggagg 20521 cagaggttgc agtgagccga gattacacca ctgcactcca gcctgggtga cagagtgaga 20581 ctctgtctca aaaaacagaa ggaaggaagg aaggaaggaa ggaaggaagg aaggaaggaa 20641 ggaaggaagg aaggaaggaa ggaaaagaaa gaaggaagga aagagagaaa gaaagaaaga 20701 gagaaagaag gaaagaaaga gagagagaga aagaaagaaa gaaggaagga aagaaagaaa 20761 gagagaaaga aagagagaaa gaaggaaaga gagagagaga gaaagaaaga aggaaggaaa 20821 gaaagaaaga aggaaagaga gagaaagaaa gaaagagaga aagaaggaaa gaaagagaga 20881 gaaagaaaga aagaaagaaa agaaggaaaa gaaagccaca aaccattgtg agattttttg 20941 ttgttgttgt tccccaagaa ctagttttca cctatttggg atcaacattg ctgctgctga 21001 gaatgtctga gctgcagaaa tggtcccatc taatatactg atagagcctg tctcacagaa 21061 ctgtagccca acaaaattag gctgaggacc ttgctgcagg aatggagatc acactacgga 21121 agaagtgtga ggtctcgtac aagagaagag acccagttat catcgaattt gagctaagag 21181 tagcgtgaat ttagatgaag aggtgtgtga taggctcaac acaaagcagg gatgtgtgga 21241 aagaggccag tgttagatgg aggccaggaa gattactcaa ggtcctgttt ctctggaaaa 21301 taccaagcaa agataaaggt agaacttcgt attggaaaac tccttatctg aagctctggt 21361 gggttgaaaa tcaaggttgc atctctgtgt cagagactct gatcctctaa gcagaagtat 21421 atgttccttt cttactcatg atctcatgca gctaagtttc tgactatctt tgacttttga 21481 gaacaaggtt tcttagcaat atccttttgt taattagcaa aacaagtttt taaaggttta 21541 caagagaaac ttttttagaa ggagagatcc tccttcaccg aaaccaaggg caggtcaacc 21601 cagaaggaca agaggaaggg aagattcaac ccatctcggc tacagcttcc tgaggccata 21661 ggtcaagtct cagcttacag gggagtgaga ggaaacaaga aattttaaat cacccagtgt 21721 tggttgggtg cgatgactca cgcctgtaat ctcagtgcct tgagaggctg aagtgggagg 21781 actgcttgaa gcaagactcc gtaatctaag aaaaaaagtt agtcgggcat agtggcatgt 21841 gcccatagtc ccagctgctc atgaggctga ggcaggaaga tcgcttgaac ccaggaggtc 21901 gaggctacag tgagccgtga ttgtaccact gcactccagc ctgggtaaca gagcaagacc 21961 ctgaaaaaaa aaaaaaaagg aaagaagaaa ggaagaaaga aagagagaaa gaaagaaaag 22021 gaaagaagga aagaaagaaa gaaagaaaga aagaaagaaa gaaagaaaga aagaaagaaa 22081 gagaaagaaa gaaagaggaa agaaaggaaa ggaaaggaag aaaagaaaag aaaagaaatg 22141 agtaaatcac caagagtgcg ggaagatcgc ttgagcccag gaggaggtcg aggctacagt 22201 gagccgtgat tgtaccactg cactccagcc tgggtaacag agcaagaccc tgaaagaaaa 22261 aaaagaaaaa ggaaagagag gaaagaaaaa agaaagaaag gaaggaagga aggaagggaa 22321 agagagagaa agaaagagag aaagaaagga aaagaaaaga aagaaatgta taaatcacca 22381 agcgtgacca gaaacctttg gaagccacct aagaaactga caaagatggt tcaatccatc 22441 ccatggaagg cagtgagacc attgcaggct tgagcccagt acgctcagac tactcagtaa 22501 accagtggca gtggcagaca ggagagagag caggaattcc ccactgggag acaggatgcc 22561 agtgcctttt tatgagttaa acatgtatcc actagagagt gcagtaagtg gcctctcagg 22621 ctgtgcatat gcctggtggg gcatgggaag ctgtgctgca agacaggatc cgtcagccgg 22681 cttcaactgg gttcagctgg gagggggcag acatagacct ggtcagggcc ggcggaggtg 22741 gaagggaggt aggaggctgg acttgacttt ggaggtgggg ctcagacact ggaccaaatt 22801 gaggactagc taaaacaggg aaacaaggac ggagcttaag cgacttccca taggacatgc 22861 ccaccagtgt cccctgtcag tttaccatgg tcatagcaac agctggtaat taccaccctt 22921 tttctaaaac tttgggcatt aaccgcccct taatctgcat gtaatgaaaa atagatataa 22981 atatgaatgc aaaactgccc taagcggcta ctcttagcac attgctatgg ggtagccctg 23041 ctctgcaggg gcagtcacag agctgttgca ctgccagaac tataccacca tcactttaat 23101 aaaaccactt tcttctacca ctggctagcc tttaaattct ttcctgggtg agctgggtgc 23161 agtggctcac acctgtaatc ccagcacttt gggaggctga ggcaggcaga tcatttgtga 23221 tcaggagttc gagaccagcc tgggcaactt ggtgaaaccc agtccctact aaaaaataca 23281 aaatttagcc aggcgtagtg gcccgcacgt gtaatcccag ctactcagga ggctgaggca 23341 ggataattgc ttgaacccag gaggcggagg ttgtagtgag ctaaaattac accactgcgc 23401 tccagtctgg gtgacagagt gacacaccat ctcaaaaaaa aaataaaaat ttaaaaatta 23461 aataaataaa ttctgggttc tgggcaaagc caagaacctt cccaggctaa acccctattt 23521 agggctcatc tgccctacat cagaagggaa acaaagacca gacaaggaga gcaagaagcc 23581 tgggagtcta caaaataagt tgcgggactc ttccaaaaaa atgaagctga gcatcgataa 23641 aagctcaaag acacctggaa gaaaagtgag accaccgttt ctgggaatta caagtaggag 23701 acagaggttt ttattggggg agaatagtgt gagaaggtgc tcaacttttg accttcctct 23761 ctgaggacat gactcttctg gtttagaaaa caacagtaaa aagtccccaa gagatactgg 23821 tggttgtggt tcccaaaact tcattaacat ctgtgggaag ccttctgact ttcctctgag 23881 gccagttctg aaaagccatt catatggagg caaatcaaaa tatcaaagca gcagcctgtg 23941 ttctgggaaa atgttgctgg gatcacattc cagagaaaag atgggataga aggaatgaaa 24001 gctgatcaac atggctgggt gcggtggctc acacctgtaa tcccagcact ttgggaggct 24061 gaggcaggca gatcacctga ggtcagaagc tcaagaccag cctggccagc atggtgaaac 24121 cccgtctcta ctaaaaatac aaaaattagc cgggcatggt ggcgtgtgcc tgtaatccca 24181 gctactcagg aggctgaggc aggagaatcg cttgaacctg gtaagcagag gttgtgaggt 24241 tgcaaagtta cagtgagctg agattgcagc actgctctcc agcctgggca acaagagaga 24301 aaactccatc tcaaaaaaaa aaaaaaaaaa aaaagaagaa gaagaagaag aaagctgatc 24361 aacactggta gaagtgggga agggcagagc tcacctccca tgggagccag ggtggagcta 24421 gcagagagaa aatgttgtga aagaggaaat gggatacgga aatatttttt gaggagaggg 24481 tgcttttaat ggatttagaa gaatatagtg gctggatatg gtggctcatg cccataatgc 24541 cagcactttg ggagtctgag gcaggtggat agcttgagcc caggagttcc agaccagcct 24601 gggcaacatg gctaaaccct gtctctacta aaaatacaaa aattagagga gtatgttggt 24661 gcatacctgt agtcccagct acttgggggg ctgaggtagg aggatcacct gaactttggg 24721 aggtcgaggc tgcagtgagc catgatcagg ccactgcact ccagcctgag tgataaagtg 24781 agacactttc tcaaagaaaa ttaaaattaa aattaaaatt aaaagaagag tatagtgaga 24841 taacccaatc aatcatcccc tgttgcagat cccccagtct caagggtcaa attgtgattg 24901 gcactaatca tgatgctgtt gaccgccttt actggtgact ggtcctaaga tggaagtgtg 24961 acccaattcc agccaacaaa gggaaataag tctgctgggg gcctctggga aagatttttc 25021 tctatctcat taaaaaaata gatgtgtaag gagctgcgct cctacttcta atctggaact 25081 ttctatggaa ggatgtgata cttggagcta cagcagccat attggggcca tgaggttaca 25141 aacttgagga agaaagcaaa caagctaagg acagcagagc agcaggatgt aaagggcctg 25201 cgtccttgac cctgttgagg tgggaccacc aacctccaaa gtcactgtag taaatgttcc 25261 ttttgttgac tcactattgg ccagataccc tctaacttgc agcttaacat aatagatcag 25321 cttatatcct actagaacct acccattgac atcttctctc acgttaaaat attttaggtt 25381 cctggagttt ctaaactctg cctatttgcc ttataggagc tatgtgatgt tagcatttgc 25441 cttatttgtt acagctgaac tcaggagatg tcttacagtt ctagacacta gctctagcac 25501 caagatgtct ggactctcct cattcagaag aaggcttttg agagctgact ggctggggct 25561 gcatgaacat tgctttccaa gacaaaaaaa cacacgtgct agatgggacc tttttaaggt 25621 aaaatgttca tctgactgac aatcttaatt gcaagcctgc acttttatta ataaaaattc 25681 tcatacttga gcagatcacc tgaggtcagg agatcgagac cagcctagcc aacatggcaa 25741 aaacctgtct ctactaaaaa tacaaaaatt agctgggtgt ggtggcacag gcctgtaatc 25801 ccagctactt gggaggctga ggcaggagaa ttgcttgaac ccaagagatg gaggttgcag 25861 tgagccaaga tcatgccact gccttccagc ctgggcaaca gagtgagact atttcaaaaa 25921 aaaaaaaatt ctcatactta ctaagggctc ttggatgcca tggaaacaag agagggctga 25981 aagacctctc tttcttgatg tggaccatga gcgatgttat cctgtggaaa agtacttgtc 26041 tctgttcagt gttcttgggc acaggacagc tacccagtcc ttgtgcctgt ggagtcactc 26101 aaacacacac aatgcccaca tgaagatggg gatatgcggt cttggatgcc taatttacta 26161 cttaaaatta acacacacaa ttagatgatt cccttttccc tcccctcccc tttttttttt 26221 ttttttgttt cttttttttt ttttctgtga cagggtctgg ctctgtcacc caggctggag 26281 tgcagttacg caatcatggc tgactgcagc ctcaacttct caggctcaag caatcctccc 26341 gcctcagcct cccaagtagc tgagactata ggtgcatgcc accatgcctg gctaattaaa 26401 aaaaaaaaaa aactgtagag atggggtctc gctatgtggc ccagactggt ctggaactcc 26461 tgggctcaag cgatcctccc accttggcct cccaaagtgc tggcatgaca gggatgagcc 26521 agtgcaccta gccttgataa tttccagtgg ttctctatgc ttagagcatt actctcatct 26581 tttttttaaa cagtctagaa tgctctctta aagcctccat aaaaataact ccttgtttgt 26641 gtatcctttg gtgaacaatc caatgacttg caggagcact tccaggcatc cccagccgtt 26701 gcaggcctgc agacttcttc cctggcccac acttgctggt ctgtttgttc actctgcagt 26761 gaacccttct gcttacagct tttccagcta accttttgac tcacaatgta gcctctgaca 26821 aaaagcagac cagtagttac caggggctgg ggatgggaga gaggagggtg ttgactggga 26881 agataacaga gggacatttt caggtgatgg aaatgttaga tcacttgatt gcagtgaggg 26941 tcacacaaga agtcaaaact tattgaactc tctccgtcca gtgggtgcat ttcatggtgt 27001 gtcaatgaag cctcaataat gttgttttta aaaaatggtg tcacctcacc tcctggcttg 27061 tagtgtttgt gtacctaata tcaatcttgc taagtttcct ccagccaagc aagtctctgc 27121 ttttgctggg ctgctctgca gaggctgtga ctgctggatg ataacccagg ttacataaaa 27181 gtgtagaaac catgtgcaaa caaatgatac gtggcaacca gcttttatct tggtgctgga 27241 gatctgcagg gcgtggtgga aagggcggca actggacagt gcaatggttc atttcacctc 27301 ctgtcaccac atggggctgt gggggcagct ctgtcatacc ttccaatttt ccaagctgaa 27361 aatccaattg tcattgctgt ttttgaaatc ttacattttt aaaaaactga taacaaattc 27421 aaaatgttta agtatctatg tggggaaagt gaatcaggcc tgagggttgt ggactgcttt 27481 ctaggtttcc agcctgtgac ctctgcagca gtggaggcag aggagggctc agaggtcggg 27541 agggaagggg aaggagcatg aagaagtgtg attctgcacg taagcacggc ccatactttt 27601 actcagaacc caagacaggc agttgacaag aacgtcggaa gatgttcttc ctcatccctc 27661 gttcttattt tgcagtccct ccctgcccct acagctgttc tctgaaagct gtcaccctca 27721 tccacaaaaa gatcttggag gccacaggca aggcagggag ggaagctccg aggactgatg 27781 ctggctctcc ttccttctcc tcccaactcc ccctcccaac tggcactgaa ggcaggtgtt 27841 cctcaggctc ggaccctgcc tgcctttgtt cgttttctct ctcctgtctg caggcattct 27901 gatccatcac aatggctgtt aacattcatg actcccacat ctccagcacc agacccttgt 27961 gtatccaact gtatagagag acatctcaac ttctagactt tcaagaccaa acctaactcc 28021 ttacttcgtc ttcctaatct ggccccacca cccagcctac tccacctcta taaatggcaa 28081 taccacccac ggtctgccca catcagaaac ccagatgtca ttcattaatt cattcattca 28141 ttcctttttt ttttttaaac agagtcactc tgtcacccag gctggagggc agtggcacaa 28201 tctcagctca ctgcaacctc cacctcctgt gttcaagcga ttttcctgcc tcagcctcct 28261 gagtagctgt gattacaggc atgcaccact atgcctggct aacttttgta tttttactag 28321 agatggggct tcgccatgtt gtccaggctg gtcttgaact cctgggctaa agcaatctgc 28381 cagcctcagc ctcccaaatt gctgggatta caggcatgag ccaccatgcc cagcccattc 28441 tttctttttt tagagatttg attctttttt taatttttaa aatgatcata caataaaatg 28501 aatgttttca ggaaaaggtg tacagttctg tgaatcaaac agacagagag attcccgtaa 28561 ccaccaacaa aatcaagata aagagcaacg atagtcacat cttcccttca cccctagcca 28621 ctacgaaaca cagatctgct atcattaggg ttttatcttc tcaagaatgt cacatagctt 28681 ttcgcaacag gtttgcaaca gaacacaggt gtcatggaaa ctacccctaa aagccaaaat 28741 gggaaaggag aagactcata tcaacattgt cgtcaatgga cacatagatt cgggcaagtc 28801 cacctctact ggccatctga tctacaaatg tggtggcatc aacaaaagaa gcattaagaa 28861 atttgagaag aaggctgctg agatgggaaa gggctccttc aagtaagcct gggtcttgga 28921 taaactgaaa gctgagcgtg aatgtggtat caccattgat acctccttgt ggaaatttga 28981 gaccagcaag tactgaatga ctatcattaa tgcccctaag gaatgagacc accacttctc 29041 ctgttgtcct tcccagcttc tacccaacct tcccttttcc ctagtttata agacaggaga 29101 aaagggagaa agcaaaaagt tggaaagaaa caaaagtaag ataaatagct agacgacttt 29161 ggtgccacca cctggccctg gtggttaaaa taataataat aatattaacc cctgaccaaa 29221 actactggtg ttatctgtaa attccagaca ttgtatgaga aagcactgta aaactttttg 29281 ttctgttagc tgatgtatgt agcccccagt cacgttcctc atgcttactt gatctattat 29341 gaccctttca catggacccc ttagagttgt aagcccttaa aagggctagg aatttctttt 29401 ttggggagtt tggctcttaa gacgcgagtc tgccgacgct cccggccgaa taaaaacttc 29461 ttccttcttt aatccggtgt ctgaggagtt ttgtctgtga ctcgtcctgg tatatcccag 29521 gacacagaga cttcatcaca aacatcttta cagggacatc tcaggctgac tgtgctgtcc 29581 tggttgttgc tgctggtgtt ggtgaatttg aagctggtat ctccaagaac tggcagacct 29641 gagagcatgc ccttctggct tacacattgg gtgtgaaata actaatcact ggtgttaaca 29701 aaatggattt tattgagcca ccctacaacc agaagagata tgaggaaatc attaaggaag 29761 tcagcactat attaagaaaa ttggctacaa ccctgacaca gtagcatttg tgccaatttc 29821 tggttggaat agtgacaaca tgctggagcc aagtgctaac atgccttggt tcaagggatg 29881 gaaagtcacc cgtaaggatg gcaatgccag tggaaccata ctgcttgagg ctctggactg 29941 catcctacca ccaactcgtc caactgacaa gccctgcttc tccaggatgt ctacaaaatt 30001 ggtggtattg gtactgttcc tgatggccaa gtggagactg ttgctctcaa acctggtgtg 30061 gtgatcacct ttgctccagt caacactaca actgaagtaa aatctgtcaa aatgcaccat 30121 gaagctttga gtgaagcttt ttctggggac aatgtgggct tccatgtcaa gaatgtgtct 30181 gtcaaggata ttcattgtag taacaatgct ggtaactgca aaaatgaccc accaatggaa 30241 gcagctggat tcactgctca ggtgatctat cctgaaccat ccaggccaag tcagtgctgg 30301 ctatgcccct gtactagatt gccacatggc tcacattgca tgcaagtttg ctgagctgaa 30361 ggaaaagatt gatcgctctt ctggtaaaaa gctggaagac ggccctaaat tcttgaaatc 30421 tggtgatgtt gccatcattg atatggttcc tggtaagccc atgtgtgttg agagcttctc 30481 agactatcca cctctgggtt gctttgctgc ttgtgatatg agacagacag ttgcactggg 30541 tgtcatcaaa gcagcggaca agaaggcagc tggagctggc aaggtcacca agtatcccca 30601 gaaagctcag aaggctaaat gaatattatc cctaatacct gccaccctgg tattaagcag 30661 tggtggaaga atggtcttag aactgtttgt ttcagttggc tatttaagtt tagtagtaaa 30721 agactggtta atgataacaa tgcatagtaa aactttcaga aggaaaggag aatgttttgt 30781 ggaccacttt ggttttctct tttgcatgtg gcagttttaa tttattagtg tttaaaatca 30841 gtgcttttta atggaagcaa cttgaccaaa aatttgtcac agaattttga gacccattaa 30901 aaaagtttaa tgagaaaaaa aaaaaaagaa agtcacataa acagaatcct acagtgtgta 30961 accttttgag ctggcttctt tttatttatt ttatttattt atttattttt ttgagacaga 31021 atctcactgc aatgcccagg ctggagtgca atggtgcaat ctcagctcac tgcaacctct 31081 gcctcccagg ttcaagcaat tcttctgcct cagcctcccg agtaactgga attataggca 31141 tgcgccatca tgcctggcta atttttgtat ttttagtaga gacagggttt caccatgttg 31201 gccaggctgg tctcaaactc ctgacctcag gtgatatgcc cacctcagcc tcccaaagtg 31261 ctgggattac aggcatgagc caccgcaccc ggctagctga cttctttcat tcagcacaat 31321 gtaactgagc ttcatccatg tcaccatgcc tggctaattt ttaaattttt tgtagagaca 31381 ggatgtcact atgttgcaga ggctgatctc aaactcttag gctcaactga tcctcccatc 31441 tcagcctcca aaagtactgt aattacaagc atgagccacc acactcgacc tcctatttac 31501 ttcaccttgg aagctacagt gttacccaga gcctataaac aaacaaacaa aaacaccatg 31561 gatattttgt tcctggtaca caaattaatg tctttaaatc caccaacacc actatacatt 31621 ctatgcaatt aaaaaattaa cttgaggccg ggtgtagtgg ctcacacttg taatcccagc 31681 actttgggag gctgaggagg gcagatcact tgaggccagg agtttgagac cagcctggcc 31741 aacatggcaa aaccccatct ctactaaaaa tataaaaatt agctgggcat ggtggtgtgc 31801 acctataatc ccagctactc tggaggctga ggcaggagaa tcacttgaac ccaggaggca 31861 gaggtagcag tgagccaaga tcatgccact gcactccagc ctgggtgaca gagtgaaact 31921 gtgtctctaa aaaacaaaaa gaaaaagaaa ttcacctaag gcacatgacc agtaagtccc 31981 ttagtgctag taccgtctat gcaaaatagc aaatacagag tgaaacaaac caatgcaagc 32041 atttatgtga aattgggctg catgcgaaat ctggcttcat gcttaactac attaaaaaca 32101 attgccaaac tgtcaatgta tttcttcaca atatttctta ttttaacttc atcacaacta 32161 agagctttaa ctatgaacaa tgttaactag tcaaatttct gtaattttcc accatgtttt 32221 aaataatatt ttatttttat tttattttat ttttttgaga cagagtcttg ctctatcatc 32281 caggctggag tgcagcagca tgatcttggc tcactgcaac ctctgcctcc caggttcaag 32341 cgattctcct gcctcagcct ccctcaagta gctgggacta caggtgccca ccatcacgcc 32401 ctgctaattt ttgtattttt agtagagatg ggtttttacc atgttggcca ggctggcctt 32461 gaactcctga cctcaggtga tccccttgcc tcagcctccc aaattgctgg gattacaggc 32521 ctgagccact gcgtccagtc aaaaatatct tattaactaa actttttcaa cttcctgttt 32581 tttctgtatg ttcatgaaca cagacactta gagaaacaga aaaaaataaa aaactgtgta 32641 tgacttacat agaccatcta tgacatgctt ggacttcctg ttgtgtccta attttattta 32701 tttatttatt tatttattta tttatttatt tatttatttt ttgagacaga gtcttgctct 32761 gttgcccagg ctggaatgca gtggcatgat cttggctcac tgcaatctct gcctcccgga 32821 ttcaagcaat tctcctgcct cagcctccca aagagctgac attgcaggtg tgtgccactt 32881 cgcccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gccaggctgg 32941 tctcgaactc ctgacctcag gtgatctgcc tgccttagcc tcccaaagtg ctggaattac 33001 aggcatgagc cactgggccc cacctgtcct aaatttttta tattttaaat aacccctcat 33061 tttactttag gacagaaatt tacaatgcaa gatcctttct cctacaaaag tattttcttt 33121 ataaccttcc ttaccaaaaa cacatcttta tatccataac tgtcttcaca tctctctccc 33181 ctttgctttc taccttgttt cataaataac tttcccaagc ccataatgtg aatcaacctc 33241 tagataactt ctgaattaga caaaattttt cttattctca ataagaacgc atcttctttg 33301 gcacatttta catacagaat tatatgttaa ctagaattct tattcttagt aatcttaaat 33361 tttagtgaaa acctaggaag caagaaatcc tgaactgcca aacagatgtt agcattggca 33421 agacaattct acaattttta aaaacatgtt tgcctacatc ataacccttt ctgaattgga 33481 aatgtcccag acatccaata aacacccaat ataatttcaa gattttaaat tacatgaaaa 33541 atttccctac tgcatttatc ccatttacat ttcattcatt tttagcaatt tatcctacta 33601 atgaaaattg ggatattaaa caaagctagt cattatttct tcattaacca tttttataac 33661 ctgtgaatat caggtgttca cctaagtaag aaatttcgag ttaaacacat gggtattttc 33721 acccgtaact cagaagattc ctctgccttt tttaaactaa caacattaaa ttagtcttac 33781 ctatcaaaaa atcacacaaa gatcattttg ttcttggcta gatttacaat cttataatct 33841 tttgtgccaa actctgacac cttaaaatat ctagcagaga caaagataaa acccagacaa 33901 aaatgtatgc tgacaattca gaagacattt ctatttttat tttaccgatt atcttaaagc 33961 cagcttattt attaaacatt tacctaagtc acgtaaactt gaaaaatgct tggacccata 34021 cacttaattt atgagtgctc ctttatttac tggccaattt tggtaccctg caaaaacaac 34081 atataacatc caggaacata tgtacttgaa aagatccaat agcttttcct agggattctt 34141 gctgtgagat agcaatacaa attcaccagt ttataaacat actcacacag gtaaactttg 34201 ttttccctga taggtaattc aatgaaggct atgaaccaaa attttgagta aagcactttc 34261 tatgatagtt ttatttttat tttattttta ttttttgaga cagggtcttg ctctgctgcc 34321 caggctggag tgcagtagtg caatcacagc cacagcccac tgcagccctg acctcccaag 34381 ctcaagctat cctcccacct cagcctcctg tatatctggg actacaagcg catgccacct 34441 tgcccagcta atattttaat tttttttgta cagaaaggat cttcctatgt cgtctagcct 34501 ggctgcaaac tcctaagctc aagttatcct cctgctttgg cctctcaaaa cgctgagatt 34561 ataggcatga accactgtgc atgacctata gttttatttt tatttattta tttattgaga 34621 tggagtctca ctttgttgcc caggctgaag tgcagtggca tgatctcggc tcactgaaac 34681 ctccacctcc cagattcaaa caattctcat gcgttagcct cccaagtagc tgggattaca 34741 ggcgcctgcc accacatcca gctaattttt gtatttgtaa tagagacagg gtttcaccat 34801 gttggccagg ctggtctcaa attcctgacc tcaagtgatc tgcccacctc agcctcccag 34861 agtgctggga ttacaggcat gagccaccac gcccagccta tagtttgatt tttaaaggcc 34921 caacctcccc agactccaaa gaacactgag gccaaacagt accacagaag aacatcatgt 34981 actaataagg ccttgaaacc acctttgcaa agatgatgac agtgagagaa gtctagcatg 35041 actgactccg tcttgcttct agccttacag gctggctgtc ctcattattc ctgggcataa 35101 gccaagctaa ccattggagg aatttatagt ttaactttga agcaaggatg ataacaatct 35161 ttccctaaaa ctataccctc cttgctcagt gactgaagcc acatttataa aactaatgaa 35221 aggccacaac attaggatta tgggaggggc ctggattctg ttaaaatgta ggcatagttt 35281 caataatctc ttactgctca ggagtttgtg gccagagatc acaagattcg tgacttcccc 35341 gattgctcct atagacaaca ttgctattgt agaacgtgat cggtcttttg agatgttttt 35401 cagactttgg cactctggca accaactgac cccacctgaa cccatgactc atgactcaac 35461 tggtcctgtg gatcccaccc agaggcagac tcagtgcaca agaacaattt tctacaccct 35521 tacagtttta tccccagcta atcagcagca cccatctcct agacctctgc caccaaataa 35581 tccataaaaa ccctagcttc tgaatcctca gggagactga tttgagtgat aactccagtc 35641 tttctgtttg gctagctctg tgttaattaa attctttctc tactgcagta tcatggtctc 35701 agtgaattgt ttttttctgt gcagtgggca agaaaaatcc ttcaggcaat tatagcccaa 35761 cctggctttg aacagcaaca taaaaacctg gatacatgga actccatccc actttcccat 35821 tcaacagcaa aatgagaccc atggaggggc cagagcgttg caaaagaata tccattgatc 35881 aaattctcgt ttctcacaac tatattgaca cacacacaca taaacaaaca attatcaaaa 35941 gcaattcaac tgctacagca acaaacaagc cccaagtgtg tccatactaa aacagctggg 36001 gtgccttcct ctctcagttg gtttttaaag gccaaaccaa acctttggtg ggcagaagcc 36061 tagggagtgg gttctgctaa ttggttgggg gtaaaactgt aggggtgtgg aaaatggtcc 36121 tcgtgcactg agtctgcctc tgggtgggaa ccacagaacc agttgagtca tgagtcatgg 36181 gttcgggtgg ggtcagtgcg ttatcagagt gcaaaagttg atcttgttcc atctgcaaac 36241 agaaatatct tctgagtttc ccaaatcaag agaagccctc tggtacccac cagaaatcac 36301 tcacatgtcc agacgcagtt gtgcacacac acaattacta atgtcaacaa aaagagtcaa 36361 actctgtaaa atatttgaag agatttattc tgagccaaat atgagtgacc atggcccatg 36421 acacagtccc caggaggtcc tgagaacatg tgcccaaggt cgttggggtg cagcttggtt 36481 ttatacattt taaggaggca tgagacatca atcaaataca tttaagatat acatttgttt 36541 ggcccagaaa ggcgatacaa ctcaaagcag ggtggggctt ccaggctata ggtaaattta 36601 aacatcttct ggttgacaat tggttgagtt tgtctaaaga cctgggatca tagaaaggaa 36661 atgttcaggt taaaagattg tggagaccaa ggttcttttg aagtcttaca gtggctgccc 36721 ttagaaacaa tagttgacaa atgtttccta ttcagacctt taaaaggtgc tagactctta 36781 gttaatctct tcaggattgg gagggcctga aagaaaaaga tctagctatg ttaatagaga 36841 ttctttacag atgcacattt cccccccaca aaggacaact ttgcagggcc atttcaaaat 36901 atggcaaaga aacatgtttt ggggtaaaat attatgactt tcttctttgt cgtgtaatgt 36961 tataccagag taagattgga aagtaagtca caatatgtaa gttaaataaa acccatctgg 37021 gctgggcatg gtggctcacg cttgtaatcc ctgcactttt ggaggctgag gcaggcagat 37081 cacctgaggt caggagttcg agaacagcct ggccattatg gtgaaacccc atctctacga 37141 aaaatacaaa aattggcagg gcgtagtggc aggcacctgt aattccagct actcgtgagg 37201 ctgaggcagg agaattgctt gaacccagga ggcagaggtt gcagtgagct gagatcgcac 37261 cattgcactc cagcctgggc aacagagtaa gactgtctca acaacaacaa caaaaatctg 37321 atgagaattt atggtttgta gagcatgact tctctagacc ccttagatag gaatttgggc 37381 aagattttaa aaatcagagc ttagttctca ctaacaagcc ccaagagtgt ccaaactgaa 37441 acagtcatgg tgctttcctc cctccatcat ttgggcttat tcaacctgca aatggaaatt 37501 ccttagaaaa ttcccaaatt gagaggagcc cttcctgctg tctggaaccc agaaaagaca 37561 ctcatccatc cagacacaga tatcaaattt caaagtctgt tcttcctagg caatcagcaa 37621 tacatttggg gctggcagca gcagagccaa agagaaaaag atggaaaccc acctccagcc 37681 aaaaaaaggt cagacagctg tacagggggc ttctgaaagt cccagtctgc agcagctgag 37741 ccacaagcaa tgtgttccca ggtagggaac caaaatctgt tactgaaatg ccagggctcc 37801 agtctaggtc ccgttgctca ctgcacagaa agccaatcac caagacaatg agtactgcct 37861 gggaagaagg ctttaattga gtgctttagc caaggaaaat ggggatcagt ctcaaatcca 37921 tctccctagt caactaaaat cagggggtta tacagcaggg aagaaatgta actacatgtg 37981 ggaaaaccag agttagggag aggtaaggaa gaggagttgg tcaaaggaat tagggaggga 38041 taaggaaatc aggagggatg aggggtctgg catctcattg tctggatgca gtgatctggt 38101 gagtttcaat tcgttgattt cctgtggcaa gaactcagat aagacaaatg caggcttcaa 38161 actttaagac caggaggggg ccaggcacag tggctcatgc ctgtaatccc agcactttgg 38221 gagcccgagg tgggtggatc acgaggtcag gagatcaagg ccatcctggc taacacagtg 38281 aaaccctgtc tctactaaaa atacaaaaaa ttagccgggt ttggtggcat gtgcctgtag 38341 tcccagctac tcaggagact gaggcaggag aattgcttga acctgggaag cagaggttgt 38401 agtgagccaa gatcgcacca ctgcactcca gcctgggcaa cagagaaaga ctccatctaa 38461 aaaaaaaaaa aaaaaaaaaa aaagaagggt gaatttctac gttgatccaa aagaagcatc 38521 tatgggacaa tcaggccagt ttcatgagga atataaattg gatcctgtca ttgccctgaa 38581 taaaactttt caatggcttt ccattctggc ctcctcctca cagcctttga ggccctgcct 38641 ctctctccaa gcttctctgg ggccatcccc agagaagtgt cactgacatc gctcttctgg 38701 cctggacttc tctctgtccc ttgagcaggc caagctctga atgtctgcat gcaccatgcc 38761 ttccactctt tacctggcta gttccccact tcttccaagt ttctgtccaa tgtcacgcct 38821 tagagaggct ttccctgatc atccaatatc aagtagcccc cttctgatat tatctcacac 38881 tgcgctctca tggtttgatg tttgtatgtg tgtgttcatt tgtttaacgt gactcctact 38941 agactgagaa gcccatgaca gcagtcacca cgtcagtttt ctttgtcact atatacccag 39001 tgggtgccta aaaccgtgcc tagcacagac tgggcatttg atggtgaata aatgaatgaa 39061 tgatttttta aaaatatctc taattttcta ctattataaa taatgctatg agaacatcct 39121 tttacataaa ggcacgtgca ctgtgcattt cctacaagtc gtactgttga attaaatgat 39181 atgtacatgt aaagctttga taaatatcga tgaattacct tccccggctg agtcaaccct 39241 ttcttcccca acagtctgta agagacccca taactctgct gacaagagaa aatattgtat 39301 tttaattttt tgagatgata tttagaattt agaaacttaa gtattagtct gggatttcca 39361 gagaaacaga accaatagag tgtgtgtatg tgtgtctgtg ggtgtgtctg tgtgtctgtg 39421 tctctgtgca tgtgtctgtg tgtgtatgtg tgtctctgcg gtgtgtgtgt gtctgtgtgt 39481 gtgtctgtgt atgtatctgt gtgtgtgtct ctgtgcatgt ctgtgtgtgt ctctgtgtgt 39541 gtgtacctct gtgtgtctct gtatgttttt gtatgtgtgt ctctgtctgt gtctctatgt 39601 gtgtctctgt gtgcatgtct gtgtgtgtgt ttctctctgt gtgtgttttt gtgtgtgtct 39661 gtatatgtgt ctccatgtgt gtgtctgtat gtctctgtgt gcgtctgtgt gcatgtatgt 39721 ttgtggctct gtgtgttaat gtgtgtctct gtgtatgtgt ttctgtgtgc atgtttgtat 39781 gtgtttctgt gtgtctctgt gtgtttttgt gggtctgtgt gtctctctgt gtgtgtgtat 39841 atctctctgt gtctgtgtgt gtctctgtgt ttctgtgtgc atgtctgtgt gtgtgtccat 39901 gtgtgtgtct gtgtgttact gtgtgtctcc gtgtgtgtgt ctgtgtgttt ctgtgtgtgt 39961 gtttgtgtgt gtgtgtgtgt gtgtctgcat gtctgcatgt gtgtataaag agatttattt 40021 aaggaattgg ctcatgtgat tgtgaatgct tgatgagccc aaaatctgac ggaggaagct 40081 ggctgactcg aggctcagga agaagttgtg gttgagtcta aagagagcct gctggtgagc 40141 caggaagacc cactgctgca gacaaagtct gaatccagca tgcaggagaa ttctcttttg 40201 catgtggagg tcagcctttt gtgctgttca ggccttcagt gactggagga gtcccccaca 40261 ttttggaggg caatctgctt tgctcagagt ccaccaatgt aaatgtcaat ctcatccaaa 40321 aacaccctca cagcaacacc cagaataacg tttcaccaaa tatctgggca ctgtggcgtg 40381 gccaagccaa cacataaaat taaccgtcat actctgtatt attttcatat taatctccca 40441 gattaaaaat gagatagaca atatttacat attgtgtaga gtcatttgtg tttctcccaa 40501 tacttgcttt gttcttattt gacttacagg agttctttga atattggctt acaaacctca 40561 tatcaggtgt ataagtagaa ggatagtggc cttttacttt gttgattgta atgtttttca 40621 taggttatct ctttttatgt gctcaaattt ctcatcattt tttgatgact tctacattat 40681 gtcatactgg gaaaaagcct ccttatttat aagactttta aaattacaca atacctacac 40741 aacatttatt atttttaaaa acatttttga tcttgaatgc acctgccttt tattcttgtt 40801 acctggctgt aattcttccc agtggatagg cagctatccc aacataattt actgaatagt 40861 ccatcctttc ctacagtggt ttgaaatgaa cctggatcac ttaagtagga agaatcctct 40921 aaatgtaatc taaaacaaca agatttcatc atttttaccc atccttaccc ataatgagga 40981 aaaacattaa agattcaaac atccagtgtt ggagaaggtg caggggaagg gattctgcca 41041 tgtacagctg gtggaaatgt aaattaccaa acactatcag gaaggtaata tgcaaatatc 41101 tatcaacaat ttcaaggcat acctctttga actagcaatt acataaatta tagatctaaa 41161 gattcatcta accaaaatac tcacacctgt tcagatagct agctagctag ctagatagaa 41221 agatgataga tagatagata gataggtaat agataaattt gcagaggcac tgtttgttat 41281 accaagagaa ctgtaaacta tgtaaattaa tcctaaatgg ttacataaat tataagacag 41341 tcctaaaatg aaataccatg tagccattag aaagactcta gattggtata ggctgccatg 41401 aaaagatcac taaagcatat tgtaaggtga aaaataaaat ttcaaaaaaa atatggagta 41461 tgattccatt tatgcttttt cttttctttt tttgagatgg agtcttgctc tgtcacccag 41521 gctggagtgc agtggtgcca tctcggccca ctgcaacctc cgcctcctgg gttcaagtga 41581 ttctcctgcc tcagcctcac gagtagctgg gactacagac gcctgccacc atgcctggct 41641 tatttttgta tttttagtag agacagggtt tcaccatctt ggccaggctg gtctcgaact 41701 cctgacctca ggtgatccgc ctgcctcagc ctcccagagt gctgggatta caggcgcgag 41761 tcaccgggcc cagccccagt tatgcttttt ttttttttga aacggagtct cactctgtcg 41821 cccaggcggg agtgcaatgg cgcgatctcg gcttactaca acctctgcct cctgggttca 41881 agcaattctc ctgcctcaga ctcccgagta gctgggatta tgggtgcaca ccaccacact 41941 ggctaatttt ttgtattttt agtagagacg gggtttcact atattgccca ggctggtctc 42001 gaactcctgg gctcaagtga tctgcctgcc ttggcctccc aaagtgctag gattacaagt 42061 gtgaggcact acacccggcc ccgtgtatgc atttttaaac caaatatttg tatacgcaca 42121 tatatggaca taaacaaaat agaggtttcc tctgggaatg gagaagtgtc agggagaagt 42181 gaaacggcat ttcacatttt acttttgtac cgttgggttt ttttaaaggc tatattcaca 42241 cattataatg caacttttta aaattagaga aataataatg ggaaaccatt cagcaccttg 42301 attctgacca agtgagagca gataaggcag cacagggatt caagctgcct ggagacctcc 42361 agggtcactg tgagggtggc tggaatcatg ggaataaata aatgatttaa catgctcagg 42421 cttctaagtg cccatccccg gggctgcgga ggagcgaggt gcttaaaccg ttcccaagcc 42481 tcaggcattg aaagcaggag tgttttctct aagaaggagg gtgctgtgct gtaattgacc 42541 tctttggcat tcagcgtctg cacgacatcc agtgcagctc tgggccggcg gaagagacca 42601 ggctgcttgt gctgggaggg gccgctgccc aaggacaagt cctgtactca cagcaacgct 42661 gaatcgcagt cctgggccat ctggccccgg cctcataatc actcttctgt catgaatact 42721 gaatgaaggt aaacatacag tcacagggct gtcttggcag cacccaactt taaacctgtg 42781 gcctatttct agctctccac tgtgctgaat tctgtcaaaa acaaacaaat aacatgctaa 42841 aactattttg tgaacattac ttatttcaca ttaagaatgt tgctctgtta ttgttttttt 42901 ttttttttga gatggagtcc cgctctgtcg cccaggctga agtgcagtgg cgcgatcact 42961 gcaacctctg cctcccgggt tgaagcaatt ctcctgcatc agcctcccga gtagctaaga 43021 ttacaggtgt gagccaccac gcccggctaa ttgttgtatt tttggtagag acagggtttt 43081 gccatcttgg ctaggctggt ctcgaactgc tgacctcagg tgacccgccc acctcggcct 43141 cccagagtgc tgggattata ggcgtgaacc actgcaacca gccaacagca tggttttggt 43201 cacctcaaat gaggacacga atatataaat ctcacggttt tggtattgca tcatattttt 43261 catacactgg gaaacaaaaa atggaagagg tgtaggctgt cctgacctct gaaaggccct 43321 gaggggacag tggccatctt ttggtttcct ggcttctgaa ccccttctgt gtttagggaa 43381 ctggccaagt atgagccttg tgaggaggca gagcccatct cccactagag aagctactga 43441 tgcagggacc ttgttttcca agcatccctg atgcctgcgt ttagccaatc aggtgcacct 43501 gccccaaatt ctgaatctga ggctaatggc acaaaggagg aaaagcagaa aattatttct 43561 ggcagtggca gtagtggtgg cagacagaat gtggctctag gtcctagtgt gcggtggcgg 43621 ggggactccc acgtccttcc tggactgagt taatggcatg acttttaatg tggccctggc 43681 tgtgcagctt ccttgattct agaagcctgg ttttccagcc ttcctggcac atctctgagc 43741 tacccaatat cttatcatta aatccctaaa tgagccagag ttggtctctg ttgctgcaat 43801 taacagcgat tgctgagaca cgagtccagc acaatgatga tgagcaagga agcctgcgtt 43861 ctgttgagag atgtgcaaac cattagtaat cagaaaaatg agactacact aaaacggcat 43921 ggagacgcta cttgatatcc atgtgactgg caaaatctca ggtgctggat aatgccaaga 43981 gtgggcaggg gtgtggctca gagatccagt tgccttgcag ctggtggcta tacccttttc 44041 ccttttaaaa aatttgtttt ctcggctggg cgtggtggct cgcgcttgta atcccagcac 44101 tttggtagac cgcggcaggt ggatcacttg aggtcaggag ttcgagacca gcctggccaa 44161 catagtgaaa ccctgtctct tctaaaaata caaaaattag ccaggcatgg tggcagccac 44221 ctgtaggctc agctactcgg gaggctgagg caggaaaatc gcttaaaccc aggaggtgga 44281 ggttgcaatg agccgagatt gcaccactgc actccagcct ggtgagagtg actctatcaa 44341 agaaaaaaaa aaggtttgct tttttacatt tttttctctt tttttaattt ttaattttta 44401 ttttttgaga cagtcagtag cctgggctgg agtgcagttg tgcaattgta gctcactgca 44461 gcctctacct ccgggctcaa gtgatcctcc tgcctcctcc tcccaagtaa ctgggactac 44521 aggcacatgc catcacgccc agctttttaa tttttatttt gtagagacag ggttcttgct 44581 gtgttgccca ggctggtctc aaactcatgg gctcaaatca ttctcccact tcagcttccc 44641 aaagtgccag gattataggt gtgagccaca gcacctggtc cctttttctt ttttatgtct 44701 atttgggtcc catacactca acagctcaac agaagaaagg gctgtggctg gacagagtgg 44761 gccatgccgc ttgcctctta caggaactat cgccttaatc actttcttct tgcttttaat 44821 ttattttttg agtaacatta aacattcaca cacatactca catacttaga aaatttcttc 44881 aacagcagaa aaggatatac ggggaaaggt gagtttccct cccgcgccaa cgcccagagc 44941 cccttcccag agacagccgc ggagaatagg ttcatgggtg acccttcccc aagtcttttc 45001 ctcactcagc atccagatgg gtccttcgga aatgtgacac agattacggc acaacctctg 45061 ccctccaggg gcttcccatc attctccgga tatcattcta catcctactg tggcgacaaa 45121 gccctacgtg agctggaccc agtcgcctgc ccctgcctca ttcccacggc tctcccccac 45181 cctgtcccct tgcagcctct ctggcctccc cggcctcctc tgacacccca tggctcatct 45241 catgacttgc tgtacctctg cctagaatgt ccctagagcc tccgactctc ctgctggagt 45301 tctctcctta aataccacct cctccgaggg ccttcctggg ctgcctgcgc cgccttcctt 45361 cctctgttct cccaatctga ggcttcctac ccgactccag cccacagccc caggctttgt 45421 ggtaatgtaa ctgttgtaaa cttaaaattc tggccgggcg tggtggctca tgcctgtaat 45481 cccagcactt tgggagttcg aggcgggcag atcacttgag gccaggagtt caagaccagc 45541 ctggccaaca tggccaaacc ccgtctctac taaaaataca aaaattagct gggcgtggtg 45601 gagtgcgcct gtagtcccag ttactctgga ggctgacgtg ggaggatggc ttgaactcgg 45661 gaggcagagg gtgcagtgag ctgagatcac accactgcac tccagcctgg gtgacatagc 45721 aagactctgt ctcccaaaaa aaaaaaaaaa atcctaagcc cccccagtga ctgaacagac 45781 tccctcttgg ccaaggggac cccagaaaaa ccttaaaact tgagttcctg gccatgatag 45841 gatgggaggt cagacatgcc tcattatacg tctttacctt ttacagttca gacactgacc 45901 agcattcatg ttaaaataga gatcataaga ctgacagaac ggacccttta tggaagtaag 45961 ataccaaatt ataaatagaa ctaaggtcat gccaggcaag tggtaagtca cgcacctcta 46021 cacttaaaga ataacttatg ctctaactgc cacgaggctt ttctttttct ctagcagata 46081 aacaagcact ggcctcagga taagcagtgt tgaaacattt ggaagctcct gcagatgctg 46141 aataactgac ctccagcctc tgttccacca gccacaacta cagctttgat tggagaagag 46201 actgatttca gccactttct cctggtaaga agaacaggga ctggtcctgg ctggtttaca 46261 gaggttgctc acaggttgcc ttcctgtctt gaaaatacct tttgatattt agggcctaat 46321 tgtaataaca tttaaatgct aagtctccac tccaaggtga acataggttg gttttttgtt 46381 tatttgtttg tttgtttttt gagacagagt ctcattttgt cgcccaggct ggagtgcagt 46441 ggcgttatct ctgctcattg cgtcctctgc ctcccaagtt caagtgattc tcatgcctca 46501 gcctcccaag cagctgggac tacaggcgcg cccctccgcc tggctaaatt ttagtagaga 46561 cgggtttagc catgttggcc aggctggtcc taaactcctg gcctcaagtg atttgcccgc 46621 cttggcttcc caaagtgctg ggattacagg cgtgagctac tacacctggc caacataggt 46681 tgtatgttac atgcatattt gttcaatatg catgtgtcat gactacctca tgaatattca 46741 tagcttctcc tgtaatctgt tgaatatgta tgtttagcca acccattcag tataaaactc 46801 ctaccccaaa ccctcttcct tcgaagtacc tgtctctggt cttggccgaa ggcacacttc 46861 ccagcctgtg ggacagccac cttgcaggct gtaacccttt ataagaaata aagttttcgt 46921 caggcatggt gctgtgtacc tgtagtctca gctactcagg aggctgaggt gggaggacca 46981 cctgagccta ggaggttgtg gctgcagtga gctatgatta tgccactgta ctccagcctg 47041 ggcaacacag tgagatcctg tgaaacaaag gaaagaaaaa agaaggaagg agggaaggaa 47101 ggaaggaaga gaaagagaga gagaaaagaa aagaaagaga gagacagaaa gaagaaggaa 47161 aagaaagaga gaaaggaagg aaggaagaaa gaaagagaga gagaaagaaa aagaaagaaa 47221 gagaaagaaa gggagagaaa ggaaagaagg aagggaagga gggagggagg aagggaggga 47281 gggacgggaa agaaaagaga ggaaggaagg aagaaaggaa ggaagaaggg gagggagaga 47341 gaaagaaaga aggggagaac agaagagaaa gaaagaaaga gaaaaagaaa atcagagtct 47401 cctccccttt tctacatgta taaattctgt gagttttaag taatcactat tcatcaatct 47461 ctggcccatc ttgctgcaat ggtcagctcc atggagccag gccttgtcta ccgctgaatc 47521 cccagcaccg gcgagtgctg tgatgggggc tactcaggaa gtcctcgctg aatgggtgtg 47581 gagacgcatg gccatcctca tgccttgctt ttttccgctg aacagtacat gtcagggcct 47641 gatctacatt ggcatacgtg gacctgcctc gttctttgta acaggtgtgt gggattccta 47701 ccatgtggat gctccatgat ctttatttaa ccagagcctt tactgataga caggttgctc 47761 ccaatttttt attatcacaa aacaaacaac aaacaaaaaa aaaccagaaa agtatatcca 47821 tacagtaaat ctgtagatat cagcttcttg gatcaaataa tatggaatgt ttcattttgg 47881 tttttttgtt ttcttttgtt ttgttttgtt tccttttttt tctctatttt tttgacacgg 47941 agtctctctc tgtcactcag gctggaatgc agtggtggaa tctcagctca cttcaacttc 48001 ctcctcccag gttccagcga ttctcctgcc tcagcctcct gagtagctgg gatttcaggt 48061 ttgccccacc acgtctggct aatttttgta tttttagtag agatgaggtt ttgccatgtt 48121 ggccagactg gtcttgaact cctgacctcg agtgatctgc ccacctcagc ctccctaagt 48181 tctggaatta caggcgtgag ccaccatccc cggccggggt gtttcatttt gaaagatgtc 48241 atcaaatcgc agtccccaga gaaatcccca gagaatcgca gtccccagag tcacagggag 48301 atgggagaaa agagaagcag atgaggccag agagtgctgc agacagaagg cccgtggagc 48361 agaacttgca agttctggga ggactctagc tttgatcaga gaggtgggga gcctctgtgg 48421 ggttctgagt agggaagaga cgtagcctac ttatgttttt atttatttat ctattttgag 48481 acagggtctc actctgtcac ccaggcagga gtacagtggt gtaatcacga ctcactgcag 48541 cctcaatctc ctgggctcaa gtgatcctcc caccttagcc tcttcagtag ctgggactac 48601 aggcatgtgc caccacacct ggttaatttt tttatttttt gttgagacag ggtttcacta 48661 tgttgcccag gctggtcttg aactcctggg ctcaagtgat cctcccgcct tggcctccca 48721 aagttttggg attacaggtg tgagccactg cacccagccc tgcttatgtt tttataagac 48781 cattctggct gctgcgtggt gtgaagtagg gaaccagaga gcagatgctg cagagacaca 48841 gctgagaaag tgctgagagc ctggaccagg gctgtgctaa aaagtggctg gattggggag 48901 ctattctgaa gacagggtgg aaaggatttg ctgatggggt aaagagaaga atcagagatt 48961 tctcagggcc tgagcaactg gaaggatggg gtggcagagt ctgtgtaatt ggggccctat 49021 tcccaagatg ttggcaccat gggcctgtag gggcagcagc tggtggaggg gaggcatctg 49081 ggaaacttcc ctttcccacc agatagggtt tcctgatgtt agcctgtcag ggtggtttca 49141 aaacaggtac agctgggctg cagcaggttg aactgtaagg aagggccagc ctacaggcaa 49201 atgagggcag tggctaactc aaatcctgtc ccccggctag gcccacactc ttcctttccc 49261 tccatttccc ctgttggtgc atgctggtga ccaggtgccc aggtgcccat acagctggag 49321 acatctgcat tcccttgctt tttttttttt tcctttttct tttttctttt tgagactgag 49381 tcgctcactc tgtcgcccag gctggagtgc agtggtgcaa tctcagctca ctgcaacctc 49441 cgcctcctgg gttcgagcaa ttcttctgcc tcagcctgct gggtagatgg gattacaggt 49501 gccagccatc acacctggct aatctttata tttttagcag acatggggtt tcagcgtgtt 49561 ggcaaggctg gtctggaact cctgacctca agtgacctgc cgcctcagcc tcccaaagtg 49621 ctgggattac agcactttgg catgagccac tgggcccagc tttttttttt tttttttaaa 49681 gaaatggagt ctggctctgt cgcccaggct ggagtgcaat ggcgtgattg cagctcacta 49741 cagcctggaa ctcctgggct taagcagtcc tcccacttca gcttcctgag tagttgcaaa 49801 tacaagcatg caccagcaca ctcagctaat tttttaattt ttttagcgat gggctaccgc 49861 tatgttgccc aggctggtct caaacaactg aattcaagca atccttctgc tttagcctcc 49921 caaagtgctg ggatttccag catgagccac cacatctggc cccctttcat cttaaaatgc 49981 ttcccattct gttttgggtg ccccatatct gacctttacc ctatctgtca ttgcgggatg 50041 ctgacaagga cagtgtggct caatgtgggt ttctgttgtc aagataccac aagggcatgg 50101 aaccagaagg gggctgtcag tttctgactg gctgcagcag cttatcagaa cgtatcacag 50161 atcaggagag gacgatatga tcaacggatg aggtaaccat cagctgaggg caaacagctc 50221 ctaactaaag tcagcacttt acagctggct ccagtgcact ctccctgtgg ctgcaggcag 50281 agcagccagg gcccctcctc cagcctcctc ccacctcaga gcttcagagc ttcagacatc 50341 cttggtgctg aggcctaggt tccatgaggt caaagggaga attcaggccg ggcacagtgg 50401 ctcacgcctg taattccaac actttgggag gccaaggtgg gttgatcacc tgagctcagg 50461 agttcgagac cagcctgacc aatatggtga aacctcatct ctactaaaaa tacaaaaatt 50521 agccaggtgt ggtggtgagt gcctgtaatc ccagctactc aggaggctga gtcaggagaa 50581 tggcgtgaac ccaggagaat ggaggttgca gtgagccgag atcgcctctg cactccagcc 50641 tgggcaacag agtgagactc tgtctcaaaa aaaaaaaaaa aaggagaatt cagactaagc 50701 tgttttgcag catcccaagg ctgtgttgac ctcacgtaca gctgcatcat gggagtgggc 50761 ggtgcgacat ctcaagtcca actcaaccga gccatggttc aggctaagtt cccgctggtg 50821 cccctacctg gaacgctctt cagatagtgg ctccttctca cccattcaaa tgtcccaggc 50881 ctttcctgac aggccctgcc catctctctc tctctctctc tcaatttccc tgcttgcttt 50941 ttgttgttgt tgttgttgtt gtttgatttt tggtcctttt tttttttttt ttttgaggca 51001 gagtcttcct ctgtcgccca ggctggagtg cagtggcatg atctcggctg actgcaatct 51061 ctgcctccca ggttaaggtg attctcctgc cccagtctcc tgagtagctg ggattaccag 51121 tgcccgccac catacctggc taatttttgt atttttagta gagacagagt ttcaccatgt 51181 tggccaggct ggtctcaaac tcctgacctc aagtgatgtg cccgcctcag cttcccaaag 51241 tgctggggtt acaggcatga accccaccac gcctagccga tttattttct ttcccttttt 51301 tttcccccag acaggaccac tggattggag tggtgccatc atagttcatt ttaatttcca 51361 actcctaggc tcaagtgatc ctcccgcctc agcctcctga gtatctagga ctacaggtgt 51421 gcaccaccat gcctggctaa ttttttattt ttatcctttt ttgtagagac aggggatctc 51481 actatgttgc ccaggctggt ctcaaacacc cagccttaag caatcctcct cctgcctcag 51541 cctcctaagt agctgggatt ataggtgtga gccaccatgc ctgacttttt atctcacttt 51601 ttgtagagac ggggtatcgc tatgttgacc aggctggttt tgaactcctg gcctcaagca 51661 gtcctcctgc ctgggcctcc caaagcgttg agatcacaga cgtgagccac cttgcctggc 51721 ctacttcctt tcttcctgta aaattgcctt gttggtttgt ctacttgttt tctgttcatc 51781 ctgttcactc ttgtctccag ctcctagaac agtacctcac acagagtaac ggctccatgt 51841 gttttgtgac agggaagcag acagtgtggc ttgttggaga ggagccacat acccccgccc 51901 cccgattcat acatggtgtg actcagcccg aaccattgct cagttgagtt ggatttgaga 51961 tgatgcactg cccactcccg tgatacagct gtgtgtgagg tcagcacagt cttggggtgc 52021 tgccagacaa cttagtttga attctccctt tgactcattg gaacctagac ctcagcacca 52081 aagatgtccc cataccagta acccgggcaa cagaatgcag ataaaggaca ggagacaaaa 52141 ggcgcaggtt ctgacacatt ctttgcttgt gtcatcagct agcaagtatt cacaggcaca 52201 tccaacctcc atccttacat tccacaagga tttgagtgtc tgttataggc aagggacaat 52261 gctaggtttt tttttctaaa aacaaacttg gggctgggca tggtggctca cacttgtaat 52321 cctagcactt tgggaggcca aagcgggtgg actgcctgag ctcaagagtt cgacaactct 52381 tggtaacatg gtgaaacccc gtctctgcca aaaatacaaa aaattagcca gatgcagtgg 52441 cgtgcacctg tggcccagct tctcgggagg ctgaggtgtg aggatcactt gagcccgaaa 52501 ggtgaaggct gcagtgggcc gcgattgcac cactgaactc tagcctgggc aacagagtgg 52561 gaccctgtct caaaacaaca acaacaaaat aagaaagaaa gaaagaaaaa gcggaagtga 52621 gagatggatg tgcgtgtagt cgtgtgggat aattgcccat gggaaggaaa cagaattcaa 52681 cctcttggaa tgtaggggga ggaggaagcc tgaaaaaact gttcaaattt ggattctggg 52741 ctgagcacag tggctcacgc ctgtaatccc agcactttgg gaggcaaggc aggcggatca 52801 cttgaggttc ggagttcgag actagcctgg ccaacatggt gaaaccctgt ctacaaaaaa 52861 tacaaaaatt agtcgggcgt ggtgttgcac acctgtaatc ccactacttg gaaggctggg 52921 gtaggagaat cacttgaact agggaggcag aggttgcagt gagctgagat cgcaccacta 52981 aactccagcc tgggtgatag agcaagactc tgcctcaaaa aaaaaaaaat tggactctgg 53041 gtgaaataag tctgttgggg acaaaatctg gtgggagacc cagaaggtca agctgcttta 53101 gaaagagctc aaccactgac tttcctcttt ttgaccacat gactcttcca ggtataaaag 53161 cctgccgttg gtggaaaagg aaaaggtggt ccagttagac aagatgccac cggcttctga 53221 aaccagacat ctttgaaaac ccacccatct acctgcagga aaaggcaagg gacctgacct 53281 cgcctgggct cagggaagag gttgctggga tcataacccc aggaagagac gggatggaag 53341 gagagcatgc agggagcaga ggacaggcta ctctccaagg aggagaggga agaggccaca 53401 gaaatggaaa aggggccagg acaagtcagt ggagagatgg tcagggcctg gcgaggagga 53461 gcaggacaaa agcggtggag ctcagccccg tggcggcagg gccactccca ttcatctggt 53521 cctctacttc tgaagttctc gaatgcagaa tgagatgtac agatgggcca gcaggtgata 53581 tgctatctgt cataggttaa ggcctcatta ttttattcat ttttaaactt tttttttttt 53641 gagacagggt cttgttctgt cgcccaggct ggtttgcagt ggtgccacca tggttcactg 53701 taaccttgaa gcccctgggc tcaagtgatc ctcctgcctc agcctccaga gtaacgagga 53761 ctacaggtgc acaccgccat gcccagctaa tttcttaacg ttttgtagag aaaggctctc 53821 actgtgttgc ccaggctggt cttgaactcc ttgactcaag caatcctcct gtcttggctt 53881 cccaaagtgt tgggattaca ggtgtgagcc actgctccca gccacaagtt ttattttaaa 53941 tgtttacctt tcaatgatca cttcgaggag atttaagctt actctttctt tttttttttt 54001 cttgagatgg agtcttgctg tgtcgtccag gctggagtgc agtggtgcta tcttggctca 54061 ctgcaacctc tgcttcccag gttcaagtga ttctcctacc tcagcctccc gagtagctga 54121 gactacaggc atgtgccacc actccctact aatttttgta tttttagtag agtcggggtt 54181 tcaccatgtt agccaggttg gtctcgaact cctgacctaa agtgatctgc ctgcctcagc 54241 ctcccaaagt gctgggaggc atgagccacc atgcctggct cattgtctag tttttaccaa 54301 gaagatgatg agatatgcca gagggctccg tatctgctat gagaacgtcc cccaagagca 54361 gctggccgtg gtgctgcagc tggagctgtg cttggagctc ttcttcaccc ccatggcggt 54421 caccaccttc tgccactggc actttgtgca gatcatgctc tctggcccca agtggggacc 54481 tggaggtgct gaagagccaa gagactggct gccgtgaccc tcttcaactt cctggtctgc 54541 ttggaccctg tgctttgtct cccatggtgg gttcccccag tggaagagtc aaattggtgg 54601 gcaggcgcag tgttgttcag tgctctcgat gctgcattga ccccttgatt ttctttgttt 54661 ttgtttttat tttgttttgt tttgtttttg agatggagtc ttgctctgtc acccaggctg 54721 gagtgcagtg gtgtgatgat cttggctcac tgcaacctcc gcctcctggg ttctcctgcc 54781 tcagcctccc gggtagctgg gattacaggc acccaccacc acacccggct aatttttgta 54841 tttttagtag agacgaggtt tcaaccccta ctcaggctgg tgtcaaactc ctgaccgcaa 54901 gtgatctgcc caccttggcc tcccaaagtg ttgcaattac aggcgtgagc caccatgccc 54961 agcctaaccc cttgattttc tgtttctctt cctccaccgt gcgcaaagcc tttgacagag 55021 ggccacggag gtggcagcac ggcgggggtc tcactgtctg ggtggtggtg aaagctctgg 55081 agagccagct gcagagagaa gatgtggatc ttagggacgg ccaatgtcac cttcacagga 55141 gattagaggc gccagcttgg ggtaaccttt tggggaagga gggctcaaaa agaggaacaa 55201 tgaaacagaa ctaaggtgag ttctcctgtg cccagtttca gaggtcaagg agagtagtag 55261 atagctcaaa cccaactcag ccattgagtg agtgttttca ttttgttcct tggggtctat 55321 gatattttct attttctttt ctttcctttt gttttggttt gtttgtttgt ttggttggtt 55381 ggttggttgg ttggttttag agacagggtt tccctctgtc acccaggctg gaatgcagta 55441 gtgtgatcac atcacactgc agcttctaac tcatgggttc aagccatcct cctgcctcag 55501 cctcctgagt agctgggact ccaggtgcac gccaccatgg ccggctaatt gtttattttt 55561 tatagagaca gggtttcact atgttggcca ggatgttcaa caacttctgc cttcaagcga 55621 tcttccctcc tgggcctccc caagtgctga gtttacaagc atcagccacc acacccagcc 55681 tatgacattt tcttcattat ggctggtact gcctcatggt aagccacaag cctttttatc 55741 ttgtaatatc agagtcttgg aattctttat cttccctact ctatatctag aagacagtag 55801 cactgactta tctgctgcag agctcttaat tctatccttg gccccgattc agaaaagggg 55861 gttcctcatg ctggatctga acaacccagc acaggcaagt ctggggagca aggcaggagc 55921 ctgcatgtgg actcaaagga ccggcacccg aaacaaagca gtggctggca caggtggaca 55981 aaccaggcaa gaaaagcaac tgtccttcgg agcagagtcc acgtgatggg aagaaacagg 56041 tcatgatact tttgagtggt tacatcctac tgtgtttgtg ccttcttttt tcctttctca 56101 gtcttgccaa gagacccatt aatttgtctt ttcagaaaac caagatcctc tccagcattt 56161 atctttgttt tcttgttgca ttgatttcta atcttatctt tattgtttcc ttccttttac 56221 tttattgagt cttctctgtt gctccttttc taacatgtta acttggatgc tcgttcattt 56281 tagcctttct tctttgctaa tgtaagcatt tagtgtgttg gatagatttc cctctaagaa 56341 cagctttagt gggatgtcac aagttttgat ttgcagcatt ctcattattt tgagttctaa 56401 ataggctcca aattccatgg gattttttta tttgactctt gactttggaa atgtacctta 56461 attttcaaac acagatgttt ccccccttta tccttttgtt attcatttct aactacagag 56521 tattgtttca atggtatcag ttctttgaaa tttgctgaga attgtttttt ggcttagaat 56581 ataaacagta ttctagtatt ccagtaatga aaagaagata gattctgtaa ttgttgggtg 56641 ctatgttctc tgtatatacc ttagatcaaa cttgttaatt tatgatgttc aaatcttcat 56701 aacctttata tacattttca tctatttaat ctatcagtta tatgaagagg tgtgttacat 56761 gctccttcta tcatggtaaa tttgcctatt tctccttgta gttctgtctc cctggtgaat 56821 tgaaactttc aaaattcttt ctgccttaac attgattttg tctgatatta acagagccac 56881 atcaacttgg tttttgctta ttatttgcca gtccatctct gtccatcctt ctacattcaa 56941 ccttgctata tctttatgtt ttaaatgtgc ttttcctaag cagtgtatag agtggttgga 57001 tttttttgta atctggtctg acaactggac aatatttgtc ttttaactgt accattttgt 57061 ccatttatag ttataaaata ataatttcca atatatttga atttatctct gtcatcttat 57121 tttaaatact tttagtgaag tatgtgcaca tatattgaaa tatacatgca taccgtgtgt 57181 attagtccat tctcacgctg ctagtaaaga catacccgag actgggtaat ttacaaagaa 57241 gagaggttta attgactcac agttccacgg ggctgggcaa gcctcagaaa acatagaatc 57301 atggtggaag ggaaagcaaa caggtccttc ttcacatggc agcagcaaga agtgcagagc 57361 aaaatgggag aagccatctg ataaaaccat tagatctcgt gagaactcac tcactatcag 57421 gagaacagca tgagggtaac ccaccccatg attcaattac ctcccacctg gttcctccca 57481 tgacacttgg gattatggga actgcaattc aagatgagat ttaggtgggg acacagccaa 57541 accatatcac catgcatggc caaataataa gtgtacaggc caaataaact atcacaaata 57601 tcacagagtg aacacaccta tttaagaaaa agcaaaaagt ctaccatatt attttgtgct 57661 ttttatttcc ctcatctttt attttatttt attttatttt attttattct attttatttt 57721 attttcagac acagtctcac tctgttgccc aggctgaagt caacggcatg atctcagctc 57781 actgcaacct ccacctcctg ggttcaagca attctcgtgc ctcagcctcc caagtagctt 57841 ggattacagg tgtgtgccac tatgcctggc taattttggc atttttatta aagacggggt 57901 ttcaccatgt tgaccaagct ggtcttgaac tgctgacctt aggtgatcca accacctcgg 57961 cctcccaaag tgctggtatt acaggtgtga gccaccttgc caagccatcc ctcacctttt 58021 aatatttctt tttttcttct ctctagcctt cttttgagat tcagttcgtt tatttctttt 58081 ctttctcatt tcatttattc ctccactgac ttggaaactg tatgttctgt ttccattatt 58141 tagtggttat tttaaacact gtaagatggt aggagttcct caatgagtta ctccctccct 58201 cccttccttc ttccctttaa taccacacat caaggaaaac ccaacccaag tagcttcgaa 58261 ctatgtgaat ctagggcagc agttttcaaa gcactgtctg tgaggtcaaa actattcata 58321 atgctattaa aatgctgatt gctctcttca ccttcattat ctcacaaata atgtgcacag 58381 tggaggtttt gagagctacg tgacgtgata tcatgacgga atgaatgcag accagatagg 58441 aaaatacagc tgtcttctat taagccaggc attaaagaga tctgcaaaaa tgtaaaacaa 58501 tgccactgtc acgaattttt tgaagatata gttgtttttt ctaggaacgt gtcatttata 58561 ttaacatgta atggttttat ggttattatt attataagat aagttaatac acatttttaa 58621 atttttctgt tttaactttg tttttttttt gttttttttt tttgtttttt ttgatacgca 58681 gtttcactct tgttgccaag gctggagtac agtggtgtga tctcggctca ccgcaacctc 58741 tgcctcctgg cttcaagcga ttctcctgcc tcagcctccc aagtagctga gattacaggc 58801 acaagccacc atgcccggct aatttttgta tttttagtag agacagggtt tcaccatgtt 58861 ggtcaggctg gtctcgcact cctgacctca tgatctgccc acctcagtct cccaaggtcc 58921 taggattata gatatgagcc accacgcctg gcaggaattt tcatttttgc ttttctttcc 58981 ttttttttct tttttctttt ttttttcgag acagtgtctc actctgccac ccagggtgag 59041 tgcagtggca tgaacacaac tcactgcagc ctcgtcttcc caggctcaag caaccctccc 59101 acctcagcct cccaagtagc tggaactaca ggtgcacgcc accatgctca gctaattctt 59161 gtattttttg tggagatggg gttttgccat gttgcccggg ttggtctcaa actcctaagc 59221 ttaaagcaat ccacctgcct cagcctccca aagtgttggg attagagggg tgagccacca 59281 cacccatccc ttcagtaatt ttcaagagta taaaccgacc ttgagatcca caagtttgag 59341 aattgctgac ctggtgagat gacagacagg attactctgc taagtgcaat tgtgcacata 59401 taaaaccttc tgctacctga tgtgtcttta atgattcgaa atttcataaa gtctaatgcc 59461 acttaacctc tcaaagtgag tttaaagttt gcctgataaa atgcaaagct tgttgagata 59521 ttaaatatga tacttaacaa tcagaactat tccatgagga gataaagaag taacaaagcc 59581 ggctcagatg gatgggatgt aattgcgccg cacagacctg tattcaccat gttatgttag 59641 ttcctgttct aaagatgctt tttttttttt tttttttttt tttttgagac agagtctcgc 59701 tctgtaaccc agactgctgc agtgcagtgg catgagcttg gctcactgca acctccgcct 59761 cccaggctca agcaattctt atgcctcagc ttcccgagtt gctgggatta caggcatgcg 59821 ccaccacacc tggcaaatgt ttgtattttt agtagagatg gggtttcacc atgttggcca 59881 ggctggtctt gaactcctgg cctcaagtga ttcatcaacc tcagcctctc aaagtgctag 59941 aattacaggt ttgagccacc gcgcctggcc taaggattgt ttaaattgga gtttttcttc 60001 ttgttttatg gggaaggatt acttcatcta caattaggtg gaatattaac ccaaactata 60061 tgtaggattt ccaatcttag aggctatagg aactggggga gaaggaaagc ataatactaa 60121 ttatatttgt cctatgtcat cctagggaaa gtgcatattt aattcctgtc agcaaagatt 60181 aaagaaagat tttcagggca aaagttataa acaggcttag tgtgttttat tttgaacggg 60241 ggaattcaca aagcctttgt gaaagcctca attttactgt tagactactt gatagcacct 60301 agttaatata agctagggcg attcaactag ttccattgca cctcccggta aataaatggt 60361 tcctgaggca ttgtgttaag cgaggaatgg aagtttgaag gttaccggca aggctccttc 60421 agccctctcc ggttttcagg ttaatggtgg ggagagtgag tgtccaccag aaagggactg 60481 tgatgtgacg ttaggtcact tgatttctgt aaacttatgg gagacctttc tcctaaagat 60541 gggctcaaga tgccaccatt tacaatgtta ctctcaatga ccttggggaa gtcctgctcc 60601 taggggtggg acacagtgat aagagggagg ggtgtgccgg gcatggtggc tcatgcctgt 60661 aattccagca ctttgggagg ctgaggcagg cagatcacct gaggccagga gttcaagacc 60721 agcctggcca acatggcaaa accccatctc tactaacaat ataaaaatta gctgggcgtg 60781 gtggtgtgca gctgtaatcc cagctacttg agaggctgag acaggagaat cacttgaatc 60841 caagaggcag aggctgtagt gagccaaaat catgccattg cactccaacc tgggcaacaa 60901 gagcagacac tccgtctaaa gcaaacaaca accaaaaaaa aaaaaaaaaa acacaagagg 60961 aaggggtgga agagtgtact gattgttgga ccctgaggat tgcaatttta tcataacatt 61021 ctctacagtc tcttttcaaa aatgaaccag caatcagaga tggcaccgat tggtttggcc 61081 ttcttagcaa catctctccc ttgtcctctg ccatgtggtt tggtggagct gactttaccc 61141 ccagctcctg ctccaggggc aggtgtgaaa ccaggaccct gctcatctac agttcatcct 61201 ctgaccacag tgatggttca cacttgggtg catgactcaa tccagaccaa tgaaagatga 61261 acccaggctg ggcatggtgg ctcacatctg taatcccagc acatgggaag gctgaggcag 61321 gcgaatcagt tgaggtcagg agtttgagac cagccaggct aacttggcga aaccctgtct 61381 ctactaaaaa tacaaaaatg agctgggcgt ggtggtgcac acctgtaatc ccagctactt 61441 gggaggctga ggcggaagaa tcacttgaac acgggatgtg gaggttgcag tgagccaaga 61501 ccatgccact gcacactcca gcctgggtga cagagtgcac ccactgtgga tccagccttt 61561 tttttttttt ttttttttga gacagagtct caccaggctt gagtgcagtg gcgttaatct 61621 tggctcactg caacctctgc ctcccaggtt caagcaattc tcctgcctca gcctcctgag 61681 tagctaggac tagaggcgcg caccaccccg cccagctaat ttttgtattt ttagtagaga 61741 cggggattca ccatgttggc caggatggtc tcgatctctt gacctcatga tccacccacc 61801 ttggcctccc aaagtgctgg gattataggc atgaaccacc gcacctggcc agcccttctc 61861 ttttaaccac caggagaaag aacctctctt ctactgatct tgaacttgaa ggctggtccc 61921 tctcttctgg gggccattgt gaagagaggc tgcctaagcc tgaagtcaac ttaaaggaag 61981 gtgaaacaga gaaaaaccac acattcaggc atttgatgtc ctaaagcaga tctactctgg 62041 ggatttttag atttgaaaac caaatattct ccttccttag cttaaagcaa gtttgaagct 62101 gagatggaca taatgtaatt gctctgcaca agcctgtata ttcattgctt tatgctcatc 62161 cttgttctaa agattagcct tttgcccatt ttctaattga attgcttgat tgtgttttgt 62221 tttactattg agttttgtga gacatttatt tagagtgtct cgctctgttg tccaggctgg 62281 ggtgcagtgg catgatctca gctcactgca acctccgcct cccgggttca agtgatcctc 62341 cctcttcaac ctcctgagta gctgggatta cgagtgtaca ccatcacacc tggctaattt 62401 ttgtattttt agtagaaatg gggtttcacc atgttggcca ggctggtctc aaactcctga 62461 cctcctcaag ttgatcagcc caccccggcc tcccgaagtg ttgggattac aggcgtgagc 62521 cactgcacca gccaacactg agtcgtattt ctttagattt ttgtgattga tgttcccatt 62581 atgcttctct ctcactgaat ctcctacctt cttcatcaga ggcagcgtca tactggactt 62641 taagtgcatc cttcctgtcc atgattttaa attgtacctt gcaaatatac tttgcaccta 62701 aacactgttt agtattcttt tgcgtttttt caaatatcca taaataccta tagtacaagt 62761 cattctgcaa ctagctactt tctaccccaa atcatccctt tgctatctgg cctccatcac 62821 taaaccttac atatctcatt gatttatttt gtctgctcta gcagtccatc atattaataa 62881 gacgagtttt attttttcca ttctactatt gatagacatt taaatcactt cccattcttt 62941 gccagcatga caatgctgct ttgaacatcc ctgtgtgtgt ctcttcgcgc acacgtggaa 63001 gagttcttcc actggtaggc acagcatttg cattctccat ttcagtaaaa gctgccaagg 63061 tcctcttcaa cgtgatggcg taaatccagc tctttctgca agaaggctca ccagccttcc 63121 caccagctct tctccccacc ccagccctgc agccctgcag cctgacccta ggtaatgagt 63181 ggtggggagt gacccaggca tgcacgacct gcccagagga aagggagttc aaagaaggac 63241 ccgagaccct cacaggaacc tagcagattg gcacaagtgc cagatgtatc caggacagac 63301 actttcccca ggactccagg ggtaaaggag tgccccacct ctccactcct tgcctttctt 63361 aactgaattc agattgtcgt gaacattcaa gaaaagcatt gtggctgact tccaaggtta 63421 ctacttgagt ccatcactac agcagttact actcttactg ctccagaccg tcattacagc 63481 agttactact gttaccactt gagaccatcc ttacaagact gaaggaaggg aggaacgtag 63541 aaatgaaaac caaggaaaaa aagaaactgc tttaagtaaa ggctagcatg gggaagaaga 63601 gagctccccg cttctagtga gcaaaggcag cccccttatt tattgggtaa caagagcacg 63661 aggaggtggc aacaattggt cagctgctta attgatcaga ggttcatgtt gttactgaca 63721 ggcttctatt atgcctaatc ataagaaaca tttgttcagc ttccaacaaa gctgaagttt 63781 aaatgatgca attacattca gcacagagga gtatctcccc caatcccacc atataaagag 63841 aaagcacatg catgcaggag agaaacagtg aatggaaggc taggaaatct tggtcttgga 63901 ggagagtgca gattggagat agagggaagg ggaaaataag atgctgggat cttgagggag 63961 gatgtggggt ggtggccaag ctccctggtt catggggatg agaatgaagc ctctgttggc 64021 tctgggtcat gtgacatggg ccaagataag acatggttcc tgcagagaga tggagcgtgg 64081 tctgtctgtc ttgtgctcat gtgtctgtgt caaatggctt ccgagggaac tagagccact 64141 taggtctatg tggtttaagg aaagcacaag tgaaaagaag ccaggaatag tgcagaggta 64201 ggggatggga caccaagaca tgaggccaga gtgtgagaca gaatcagagc tcctgcctgc 64261 agagaatgat gggctgtgtg aggagccctg gccctctgcg gacattttaa taagtggccc 64321 cactttctta ggtgctcatg tgtccgggga ttctagggag gttttgtgag gctgagctaa 64381 aaaaaataac caattccagc taggcacagg ggctcacgcc tgtaatccca gccctttggg 64441 aggctggggt gtgaggattg cttaaggcca ggaggtcaag gctacagtga gccatgattg 64501 tgccgctgca ctccagcctg agcaagaggg tgagaccctg tctcaagaaa caagcaaaca 64561 aaaataacct attccacttt gaccctgaat agggaaaact gaccaaacaa ggagagtagc 64621 cagaggcagg gatggaagca taggagaaac tggcagtgga caagctcctt tggaataatc 64681 caccaagaga ggaaatgaaa cagaatgcct gaatgaatta tgaaagcaga acaaagtctg 64741 aaagaataat cagggcttga gatctagaaa ggatcaatga gatttgtggg aacagacttt 64801 caggggaaac acagagggct aagcggcctg agaaacagct caactcttga ccttcctctt 64861 cctgacacat agcaagctca agtgtaaaag ttcttggttc ctgggggatc ccgaaatggc 64921 agaggtggct gtatctatgc accttattca acaagttggt aaaagtccac ttaaggctga 64981 tcaccaagag ctgcctgctt agaaacagga ggaagctaga tacctaaaat actcccagca 65041 cgtcctggga gatgacaccc taagctctca cctgggaaag gaaaaggatg agagacccag 65101 gtgactgcag gttcaccttg ctggggaaca aggttttctc accttcattt ccagtgcggg 65161 agggaaacca gaatagcaaa agatcaagaa aactgaacaa aaactagaag agaaattggt 65221 ggagggatga cagaggttga aatgggaagg caccaagcag aatcaggtat cctctacaga 65281 gcagagaatt gccctgagca cagaatgcta tatcattttc tcaaagtcac agagctggga 65341 attgccaaca gtgggactgg gatccagatc tgctcccctc attacttatt ctccctggat 65401 ttctcttcag agtggagcaa tgccctggca ttcaactcgt ctgtgatatg acagtcctca 65461 cactctgtgt agtgacctac tatgagtggt acctgggcgc tgcctgccta gaattcatcc 65521 ctccttcttc tggtagcagc atcccaattt tcctctagtt aaccactccc cctctactct 65581 tggtctgtgg gatgtgggga gggggacagg gggagaaggg gaggttggcc ccattatcac 65641 tgcagggcat gcaacccaga cttggcaatc tcagtccatc tagagtcaat cttttctaga 65701 ctcttctaga ctaattcaga aagcagcttt ctctttccac agggtggtta ggaaggtaga 65761 aggtaatcct ggagcttccc gccaccacgc aggaggaatt tgcctgagag caaagccagc 65821 atagagaagc agagctgagt catggagaca ggtggctgaa agtttcattt cggcatatgg 65881 atctcaccat tcctgaagcc actttcaact cttcaatttc atagctaata aattctcatt 65941 tttgcacaat tagctgaagc aaggtttttt gttacctgga agcaatgaat gcaaatgttt 66001 tttaggtaat ttgttaatag taataaaaga taaagaatat aataataaag aataggggca 66061 tttccagggc cgggcacagt ggctcacacc tgtaatccca gtactttggg aggccgaggc 66121 gggcagatca cttgaggtca ggagttggag aacagcctgg caaaaccccg tctctcctaa 66181 aaatacaaga attagccagg cgcagtggca cacacctgta atcccagcta cttgggaggc 66241 tgaggcagga gaatcgcttg aacccgggag gcagaggttg cagtgagtca agatcacacc 66301 actgcactct agcctgggag atggagcaag actacatctc aaaaaagaaa aagaataaag 66361 tcatttctga attgtatctt ttatgatttt aaaaaatgaa tcagagaaga ttcaaggcac 66421 attctgacat gcaagggtta cagtcacagc ccatgggcca ggcatggcac ttcaccccag 66481 tatgctggcg acggccctct ctgcaccaat aaatcagatg cctgcttaac aactatttca 66541 atcagaagtg cattctgcag ttttgaaagt ttccacacgt attatcactc ctcagtcata 66601 aaacaatctg atgaagttag tttctaatgc cttttttttt tttttttttg agacaaagtc 66661 tcggtctgtc acccaggcca gagtgcaatg gcatgatctt ggctcactgc agcctccgca 66721 gttcaaacaa ttctcctgcc tgagcctcct gagtagctgg gattacaggt gtgcaccacc 66781 atgcccagct aatatttgta tttttagtag agacggggtt tcaccatgtt ggccaggctg 66841 gtctcgaatt cctgacctca aatgatctgc caacctcggc ctcccaaagt gctggtatga 66901 caggcctgag ccactgcacc tggctctgac cacacgtctt ccatcaagga cttacatcat 66961 tctttttggc cacccagaac ctctgaactc cctgcctgtg aatctacgcc tgcccaaggg 67021 agaagcagaa acttgctttc tcagagtcgg ctcccatgta gggcacaggc atgtgaccaa 67081 agtccagcca gttaccaagc ccaggtgggg gcagtgcagc atagctgtga cgttaggccc 67141 ggaggccact ctagggagct ggcagcacac agccagctgc cagggcggca gagagaggtg 67201 ccgagggtgt ggcagcagtg cccagcgtgt gaaccgcaga gtgtgcgcca gtggtggagg 67261 tggtggtgtc tgcaagggta tcatccaggc ataccctcca agcctggtgt tctggctggc 67321 ctggagattc cacaagctac ctaataccct ttaaaaacaa aattcctagg ctgggtggga 67381 tggctcacgc ctgtaatcac agtgtctggg gaggccgaga tgggaggatg gctccaagtg 67441 agaggatctc tggaggccag gagtttgaga tcagcctggg caacatagtg agatcccatc 67501 tctataaaaa taattttaaa ataaaataaa taggccgggc atggtaagct cacacctgtg 67561 atcccagcac tttgggaggc caaggcaggc agatcacctg aggtcaggag ttcgagacca 67621 gcctggccaa catgatgaaa ccctgtctct actaaaaata caaaaattag ccgggtgtga 67681 tggtgggcgc ctataatccc agctacttga gaggctgagg tgggagaatg cttgaacctg 67741 gaaggcagag attgcagtga gccaagttcg caccactgct ctccagcctg ggcgatagag 67801 caagactcca tctcaaaaaa agtaaaaaat aaataaaata aataagaaca aaattccttt 67861 tccgcttaac ctgacaagag gagagtctgc tgtctgtagt gggaaacctg acagatacaa 67921 tcaggagagc atccaaggtc tccaggcgga cagtcctctg cagctgcatc ctgtctgtgc 67981 cattggtccc tctgccagcc tcttggccat gtgtgtagcc tgggaagaat gctgccctgc 68041 tccacctgct gtggcaactg ccactatgcg agatggccag gctgctccag ctcccacagt 68101 ggctcgtggt ctgtgcctcc gtcagtctgg cttctcctgg ggcagcggct accggctgga 68161 ctctggccca ggaacagggc tttccctgtg ctcgcctgca cctcgcacca tggtcctggc 68221 cccacgacga ctgaatctgc cagtctccac tcccctgtgc actttctgca ctacagcagt 68281 gggatgtggt gacaggaaca ctaaaggatt gaatgccggg atcctggagg gaaattgaag 68341 ctgctggtgc ttctgcctgg gtcctagggg agcaggaaac actgtggagt acttactgtg 68401 ggctcagcac agttccattt taattgacaa ataataattg tatctatgta tgggatacaa 68461 tatgatgttt tgatccatgt ttacaacgtg gaatgattaa atcaggccaa ttaaacacat 68521 ccatcacctc acatactcat ccttttcttg cagtgtaaac atttaaaatc tcttttcacg 68581 gctttgaaat atacattaca ttattattta ttataaacac cattatgtgc aattgatcac 68641 taaagctgac tcctcctaac tgaaactttg aagtcttcag tcagcgtctt ccctttcccc 68701 atccagcccc ttcccccagc ctctggtaac cagcatactt ctcgccatga gatctgcttc 68761 tagatcggct ttttaagatt ctccatgtaa gtgagatcat gcagcacttg cctttctgtg 68821 cctggcttat gtcgctgagc agaaggccct ccaggctcat ctatgctgcc acaaatgaca 68881 gggtttcttt ctttttcaag gctgactagt attccactgt gtatctacac cacattttct 68941 tttcttttct cttttgagat ggagtttttg ctctgttgcc caagctggag tgcagtggca 69001 cgatcttggc tcactgcaac ctccacctcc tgagttcaag tgattctacc acctcagcct 69061 cccaagtacc tgggattaca ggtgcccgcc accacacccg tctaattttt gtatttttag 69121 tagagatggg gtttcaccat gttggccagg ctggtctcca actcctgacc tcaagtgatc 69181 tgcctgcctc agcttcccaa agtgctggga ttacaggtgt aagccaccgc gcctggccca 69241 cattttcttt agcgattgac ctgatgatgg atatctaggt tgcttcctca tcttggccat 69301 tgtgaataac actgcaatgg acatgggtgt gcaggcatct ctttgacata ctgatttcaa 69361 ttcctttgga tttacaccca gcagggggat ttctggatca tatggcagtg ctacttttac 69421 ttttctgagg aaactccgta ctgttttcca taataactgt attaatttac atttccgtca 69481 acagttcggc gctatttcaa aggtttcctg tgctctaaag cactctatta ggctctcgtt 69541 gcagggattt ggggaaaaaa agacactccc ttgtccagag acatggacac ctgctcgcat 69601 cccaatttgc ctgtcattca ctccttcatc aactgctccc ggccaggact tgttcaagct 69661 gctgagaatg ttttggtgaa caagacagga agcggcagct ctcgtgaagt tcgcgttctg 69721 gaaaagcacc ggtgttggag gaacagcggt tggacctttt ccggcctgga atttgggctc 69781 cccacaaagg agatgcactt tagatgatta ttacttgatc aaactaataa aaggcccgag 69841 atagttaata ataatgaaag cgtggcttcc tcctccctgt ttctaggcca agaggtggca 69901 aggggtagca aattaaaaac atacatattg gtttaacctg aaagtatgtc tatgtgcaca 69961 gctctttttg ctaaagacag caaaaagttt ttactagagc aacctaaaac caattctgtg 70021 acctgagtag cacctcttac caggaattgt tgacaaaaga tgaataaata gaacaatttc 70081 agattctggt aaagcaccgt gaagaaaaca aattgattag tgtaataaga agaggggaat 70141 tcctttttta tatttattta tttatttatt tatttattta tttatttatt tatttttgag 70201 acacagtctc actctgtcgc ccaggctaga gtgcaatggt gcgatctcgg ctcactgcaa 70261 cctccacctc ccaggttcaa gtgattctcc tgcctcagtc tcccaagtag ctgggactct 70321 aagcacccgc caccatgccc gactatgcgt atttttgtag agatggggtt tcaccatgtt 70381 ggccaagctg gtcttgaact cctgacctca ggtgatctgc ccgcctcagc ctcccaaagt 70441 gctgggatta cagtcatgag tgagccactg cacctgacct atttttattt attttttgat 70501 acagggtctc actctgttgc ccatgctgga gcgcagtggt gcgatcttgg ctcactgcag 70561 cctcgaactc ctaggctcaa gccttagcca ccagaatagc tgagattcta agtgtgcacc 70621 accacccctt tttttccttt tttattgtta taggcatggg ggtttcagta tgttgcccag 70681 gttggtcttg aacccctggc ctcaagtgat cttcccacct ccgtctccca aagttctgag 70741 attacaggtg tgggccacca cacctggccc aggactcctt tagatggagt ggccaggaag 70801 ggctctctga ggacagacat tggaaagatg acagagaggc agctgtgaga gggtgtgggg 70861 caaggacgct ccaggcaaag gcaacagcag agacaaagat ttccacgcaa gacaagctca 70921 gtgtgggctg gagacagtgt taggagacac ggtgcatcca cagaggtctc ctgcaggtgc 70981 actggagaga cgtggcccag aagcagggca aggtgtgctg ggaaagagaa cgtggaggtc 71041 agcaggccat caattcccct ggaaccacac ctggatgaag ccgaactagt aaaccttttc 71101 caagcagaag aaaagatctg gggtccaagc caagtgatcc cactgcccaa acacatacct 71161 ggtgggtctt cacctggtgg gtctgcagct gaaccctgga gtgcaaatga aaacccacta 71221 ggactcagga taggagggaa caaatatgag agtagtccca ggtgtgatga aggagtggac 71281 ctaccgggag atcaaggaat ccaggttacc atgaggaagg acaaacacaa aacacacagt 71341 actgactgtg tcccagatcc tgacccaaga catttattta tttatttttg cgacggagtc 71401 tcgctctgtc acccaggctg gagtgcagtg gcatgatctt ggctcactga agcctccgcc 71461 tcccgggttc gagtgattct cctgcctcag cctcccaagt agctggaatt aaaggagccg 71521 cccccaccac gcccagctat tttttttttt tttttttgag acggagtctc attctgtcgc 71581 caatttttgt atttttagta gagacagggt ttcaccatgt tggccgagct ggtctccaac 71641 tcctgacctc aggtgatctg cccgccttgg cctcccaaag tgctgggatt acagacgtga 71701 gccaccgcac tcagctgccc aaaagtttta gatacatcaa cccattcagt cttcacaaaa 71761 gcccatgagg tgggcactat catttcctcc atttagaggg aaggaaacca cagcccagga 71821 gactacagga cttgtccaag cacacacagc tggagggagc ccagcaaggc tggaatccag 71881 gcagtctgca tcccagtcca tgcccttgcc tgccactcta catggcctca cgagatgcct 71941 ctccaggtac ctaaatgtag aagtcaaagg aatactgtgg aagcagccaa ggtctgcatt 72001 tgggaatcca tacgttagat tcgtggaatc agttctggag aaaaggtgga atttttcagg 72061 tttttttttc atagaggtaa tcaactacca actttccaat ggacacagcc catctcagtg 72121 tgaaagcctg tccctagagc agagccagag gaggtacagc cagctccgac tctgcaactt 72181 cagtaaggag caagaagatt tctgtttaga ggttaccact ccactcccca acaaaaaatg 72241 agagtagagc aaggtttcca ggcagctgcc tatactccgg gaaaggatgt tggtggagaa 72301 ggaatcgaga gagggccaca ctgttggggg caaggcaagg ggctcactgc caagggacag 72361 acagaaccag ggaaggcaaa gatcacagaa agggaacggg agtgcagagc ttctaatgga 72421 gagactgcat gatgctctcc tgggctggag ctggggagct gggaacagct aaggtatgcc 72481 cacaggcaca gaggcatctc ttctgcccct tggaacttct gactcggagg ctgtcaacat 72541 tccctctctt gagcaggagc aggatgggag aagttgcagc tagattgctt tggagtctat 72601 ccagcatttg gacataacct gtcccttgtg tggtcactgg aaagaagtta agaactgctc 72661 ctaggctgga catggtggct cacacctgta atcctagcac tttgggaggc cgaggcaaga 72721 gcatcgcttg aggtcaggag ttcaaggcca gcctggccaa catggtgaaa ccccctctct 72781 cctaaaaata caaaacttag ccagccatgg tggcacatgc ctgtagtccc agctactcgg 72841 gaggctgagg caggagaatt gcttgaacct gagaggcaga gggtgcagtg agccgaaatc 72901 acgccattgt actccagcct gggcgacaga ctgagactcc atcttaaaaa aaaagagaac 72961 tgctcctgtg tagagttcag ttgaggccag gcacaatggc tcatgcctat aatcccagca 73021 ctttgggagg ccaagccggg cagattgctt gagcccagga gttcaagaac agcctgggca 73081 acatggcaaa gcctcaactc cgcaaaaaat ataaaactaa gccaggcata gtggtgtgcg 73141 cctgtagtcc cagctacttg ggaggctgag gtgggaggat tgcttgagcc tgggaggtca 73201 aggctgtagt gagccgtgat ggagccattg cacaccggcc tggatgacag agtgagaccc 73261 ggtctcaaaa aaaaaaaaga gttcagttgt ctggttacaa ctgcagaaaa taacctatgt 73321 gtggtggtcc attttcaatt ctaagtgcct taatataggt ttaaacaggc tacaaagaga 73381 taaagaagca gaatgtacta agtcaccccc ccacccccac cccctcctgc ccatcctgct 73441 tcttgctttc cctttcatct agctgccagg cgcctatcag tcaggacctc cttaaccatc 73501 cccttccacc cctccaaaga atttagttgg gccaggcaca gtggctcatg actgtaatcc 73561 cagcactttg ggaggccaag gcaggcagat cacctgtggt caggagttca agaccagcct 73621 ggccaacatg gtgaaacccc atctgtgcta aaaatccaaa aattagccgg gcacctgtaa 73681 tcccagctac tcgggaggct gaggcaggtg aattgcttga accgggtggg cagaggttgc 73741 agtgagctga tatcacgcca cttcattcca gcctgggcaa caagagtgaa actccgtctc 73801 aaaaaaaaaa aaaaaaaaag aatttagttt aggctagctt gcaaagtaaa taattgtact 73861 ctttcttatc agctaagtcc agtcactatg gccataactc aaatgtttga agagtcctga 73921 gacagttgca atgcattatg ggctgcaata aaatgcagca gaaagaccct aaagaacata 73981 ctgaaatcct taacccaaat atcaataggt gacatgcaga aagattgtaa cccaatagta 74041 ctcagccagt gaggaactag gggagggact tgtgcacttg ggaataaatt gcttgttgaa 74101 atcgttgcag gtgtgcccgc atgccagaca ccccatcttg caaggcagcc actaaggtct 74161 cgcttctgct gttctcccat ccctaagtcc attctttggt ttggacaagt gagtgtgttt 74221 gttttttctt tttctttttc tttttctttt ttccctttta tctagctgcc aggcacctat 74281 cagtcagggc ttccttaacc accccctccc acaccaccac cgaggctgca gtgcagtggc 74341 gtgatcttgg ctcactacag cttccaccta ctgggttcaa gcgattctcc tgcctcagcc 74401 tcccgagtag atgggattac aggcacccgc caccacaccc ggctaatttt tgtattttta 74461 gtagagacgg gtttcaccat gttagtcagg ctggtctcaa attcctgacc tcaggtgatc 74521 cacctgcctt ggcctcccaa agtgctggga ttacaggtgt gagccacggc acctggcaga 74581 gtttgtttct cacacaattt acaatagagg agtattctgc cattttgcaa agtgttctca 74641 catatgttac ctcatttatt ctgagagaga gggaggatta ttgtgggaat tggctcatgt 74701 gattatggag gctgagaagt cccatctgtc tgcaagctgg ggaaccagga gagctggtgg 74761 tgtaattcag tccaaatctg aaggcctgag aactgatgtc tgagggcagg agaagatgga 74821 catcccaact caatagagag agcagagaca gaaaacttgc tcttcttctt tttttttttt 74881 tttttttttt ttgatatgga gcgcctgcca ccacacccag ctaatttttt gtgttttttt 74941 gtggagacag ggtttcgccg tgttggccag gctagtctcg aactgctgac ctcaggtgat 75001 gcactcagct cagcctccca aagtgctggg attacaggtg tgagccacca cacctggcag 75061 agccaccaca cctggcagag ccactggttt tttttttttt aattgactta tttttcttac 75121 atataattta gccttaccct aaacaaaaat aaatattaat aacagtaaca gccagcaatt 75181 ctaagtactt gctatgtacc aggcatatac atacatacgt acatatggga ttttttgaga 75241 cagagtctgg ctctgtcgtc caggctggag tgcagtgaca caatcacagc tcacttcagc 75301 ctcaacctcc tgggttcaat caatcctccc acctcagcct caaaagtggc tggaaatata 75361 ggtacatgcc accacaccca gctaattcgt ttgtatattt tgtaaaggcg agattttgcc 75421 ctattgccca ggttggtctg gaactcctgg gctcaagcga tctgcccacc tcggcttccc 75481 aaagtgctgg gattacaggc ataagtcact gtccccagcc cagcccagta cgtatgttta 75541 tagacaacta taaacatatg tatacacaca acatgtggtt gtgtaaagat gtgtaataca 75601 catacacaca aacatataca tattcatttc aatcctcaca gcaaacctac aaggtaggta 75661 tattagtctc ctcgttctac aaatgaggaa actgaggccc agagaaatga gtgcccttgc 75721 ccaaggtgac acagccaaca agtagccgcc aggactgcag cccacagggc cagctccaga 75781 ttccacgctt ttaacccctg aaatcatgag tttaatggct tagctacagg ttaagtcact 75841 tgccagggaa cccagagcta gcaatgggat gggaactgag atgtgaatcc cagtagtcta 75901 acctcaaaac ctgcgtgttt agctcctaca tgcgttccca catgggttat ggacagctct 75961 actaatggaa gagggcgctg gactggggtg gaattgtgaa agaactagcg acgtggagag 76021 aggcttgaag ctcagcctaa ataaatgttg gggacagccc ccactgacag ctggacttgg 76081 cccgccatct tcccttcttg gtacctctca atctttcgta aaaataaacc aaaagctgac 76141 gttcaccact tcatgccatg caaacaactg tttttggttt tgttcttcaa atataaaata 76201 acatgaaaca tgctttgtca atgtacttac ccccatccca tcccacactc tctcttccca 76261 aatacccgca tcaccctcca ctcttcagag atactagacg gggtggctca gcagggctgg 76321 gaacaaacac tggccaatgt aaaggcttct gaatactccg cagcactgga gagtctgtct 76381 tcagcacgat cccctgttaa ccctaggaaa gatgtgcacg tgacaggggc ggggtgcgtc 76441 atggaagcct gacgttcctc atgccagatc agacgggaag gctgtaaaag ggggaaggtg 76501 aggccagctg gcagaaagca ggaggaacgc agccttgctc caggttcgta ctcccatacc 76561 tgaaacgcct cctccttctc ccgcctaccc aggctgaaca cctggagagg ggcagggtag 76621 gaatgccccc taggtgacca gagtggagag gaggacactg aaggggtcta aggatcacga 76681 gaccgcgtgt ggcttgactt attcgccaag tggagattcc tcgatcatcc tcagagcctg 76741 tggtcccttc catctagtca agaatgggca ggaggccagg cgagacaaaa ctcaagggga 76801 aagccgtact catttttatc aacatctcct tgccaggaag cttgtggtat tcactcaaat 76861 ccaggacatg ataaacattt acagaaaata ataaaagcag atgatgtgag gacagtcaag 76921 agacgcaaag tacacttatg ctccccctcc tcatctagaa agtgggtctg catggagtgt 76981 aacagcagaa tctgacatag ctgactccac cctgcttcta acctcacaag ctaatggtct 77041 ttgtcattcc tgcacatcag ccaagctaat catgggaaga atttagttta cagtttaact 77101 ttaaagcaag gatgacaata atcccttccc aacactcacc cccaaggaga taaggagggt 77161 gtacacacta gcagctacat catattaaag atttatagga acattgtgac ctcagcagga 77221 caaagaagtt gcacagtgcc cctcctcgga cactcactgc cacccagatg tccgctatca 77281 tcggtcacct cttgatctta aacctcactc tcttccccct tccctaatgt aaaaggagcc 77341 caaaattcta ttacttaaga ttgttctcaa gacactaggt ctgctatctc ctcagtctgc 77401 tggctctctg aaataatgtc atttttcctt cccccaatat ctcgtctctt gatttattgg 77461 ctgttgtgca gagattggta tgagctttgg attcaactgc aggaggacgg tgaagccaga 77521 agagatagag agagagagag aaagagaaat gagcagctgc tgaatcatga gggcttccct 77581 aatggcaggg aaagaagaaa gttttgacag acagtggggc aagacacttg cacaaaacag 77641 gaagccagag gaatccttaa aacccacaca tcatacaaaa ataaagaaag aaaataaaac 77701 atgataaaag aaatgaaacc catacatcag atcacttact cctctgcctt atacccttca 77761 agagtttcct atgactttgg gaataaaatc caaatcctca ccttggacaa cagggccctg 77821 caccatgtgt tttccctcca ccccctccac ccccagctgt cacgtctctg acctggtttc 77881 ccatcacttc cctcatgctc attcccaccc caccccccca gccacactgg cctcctttcg 77941 gtttctccat cacaccaagc ttcgtatcgc ctcagggcct ttgcaaaaaa tgtcccatct 78001 gccccgagtg ttcctctccc agctcttcca tggagagcac ctacaggcct cggctcaaat 78061 gtcacccctt caaggacatt tccctgctgc ccccagcaca gattacccta ttctatcatc 78121 caattctatt tccttccgaa agcgtgcccc acattacctc ctatattact gtagagactg 78181 tactagtgtc tcacccctcc ccacaactac aatgtcaatt tcatgaaggc aagaaccatg 78241 tctgccttta aaaaaaaaaa aaatcagggt ctcactctgt cccacaggct agagtgcagt 78301 ggcacaatct aggctcactg cagctttgat ctcctgggct caagtgatcc tctgacctca 78361 gcttcctgag tagctggaac taaaggcatg cactaccaca tctggctaat ttttgtattt 78421 ttttgtagag atcaggtgtc accatgttac ccaggctggt cttgaactcc tgagctcaag 78481 caatccacct gccttggtct cccaccaagt gctgggatca taggcatgag ccaacacacc 78541 cagccacaat gcctgtcttg tgatgctcta tccccaacca tccatggaga gatgaaaatg 78601 catgatctgg ttgttacctt gcactcagct gcctaatcca agagacatgc aggtgacttt 78661 gagaaggctg gggctgtcct gggcccacag aattcagaca gaagacactg gggaaaggag 78721 aggaggtggg acagggacca gacacagagc actggaaagg agaggatgtg caggagcaag 78781 agctgagccg gattttccag acggcggcag actctggggg gctgcctgga cttggggcat 78841 tttttctggg ctggccctgg acttagagaa tatttttgtg atttagcggc cccaattttt 78901 gtgaaatgtc ctctgctgac tgagagatta acctgctttg actgggccat gctaaaaaat 78961 gaacaggaat ttgatatagt ccagtgtgga ggtggagagg gagggaggtg aagaaagaac 79021 agaaggaagg ggtaagacac actgaggagc agatggaagt tacctgaggg gatcctccaa 79081 agaagaaagg gtgggctatg caaacagtgg atgtaaaaag acgcctggaa gaagttagga 79141 cttggatttc taggagagat taattcgact tgtaggagca gggatttggg gagaggcaga 79201 gggctctgct gcctctgaaa gagctcaccg agtgcgctcc ctctctctga ctaaggaccc 79261 ctgtggataa gaaagctctc tgccctagag aagaagagca gtcaaggtgc ttacaggctc 79321 tttcccgaag actgaagacc tttcccaggg tggactgtgg tgcccttccc tgtgcaggca 79381 ccctgaccac tgtgtgttca gagaagaggg atgtggcaca cacacctgag aagcactggg 79441 actcctgcga gaccaagtgg gaggttgcca ctcctgggga ccatctgatg ggaagcagcg 79501 gtgaagaaaa gggaatggcg tctggatgtg gatggaggca ggtgcttcta ctgctgcccg 79561 gccagagagg agcatcccca gagggtccgt ccctagccag gctggaagct tgtctctgcg 79621 catctggtcc cctctcgaag cagttctcaa aagccgttga gacacactgc tgggatgtgg 79681 caacacacca agaggcaaga aatgtaagtc catggttcct ggtaatactg aaaaaactag 79741 gctttggaag tttaagtaaa caaagaaatg agaggatata aggtaagctg ttacaacgtc 79801 actgtcagtt atgagtattc tgttatgctt tcttttgagc cagtctcgct cttttgccca 79861 ggctagagta cagtggcatg atctcggctc actgcaacct ttgcctcccg ggttcaagcg 79921 attctcctgc ctcagcctcc tgagtcactg ggattacaca tgcccgccac cacacccagc 79981 taaatttttt tgtattttag tacagacagg gtttcaccat gttgcccagg gtggtctcaa 80041 actcctgagc tcaggcaatc cgcccacctc agcctcgcaa agtgctagga ttacaggtgt 80101 gggccatcat gcctggcccc cattacactt tctagagaga ggaagaagta ctaacctcac 80161 agtgcagact ggagactgca cagcacatgc gcagatggtc ttgacagagg atgcctccct 80221 ctgtgctggg gaactcatcc ttgcccattc agtttccgtg gctcaagcag gctgatgtca 80281 gcccttggcg atgctgttgg cgtgtgtgac ccacaaatca ttcagaggca gcagagccag 80341 cctggagatc cgggctgtga cttcaggaca ggagaccctc tctctgcaca ggttattatc 80401 ctaaactgcc aaaggtcact gcgtgaagca attttactaa gagccaccaa acccctctct 80461 tccctccaac caagttcatt cacccttctc agaagagccc aagagacctc tgtgagagta 80521 agacaggagt tggcgatggt gcaatttgca gaacgccatg tttgttagcg ccttgtcttc 80581 atgtggccgg agggcactcc tgaagcttca aactcctgta aaaacgcttc ctgtccaggg 80641 agggtggtat gggaagtctg aggtggctgg gagaggaact ggcgaagtaa ggagttggag 80701 caggagtcag agagacatgg gatggggaaa ctcacaagca gactccaccc cagtcaggga 80761 acctcactca ccgttccatt aagggctgga aaaggaggca ccgacaggtg tctcacagaa 80821 gcgaatggta gctgttcagg aggctggacc caggaaaaag gacataaaat cggggaggtg 80881 cctcacccta atttttctca actccagcag aaccttctta gagagcctca ctcaattcca 80941 catttctcct cctctcagac aagcccggca gctccgtcca cgtcctttgc tgaatggagc 81001 cagcgtggtc acttctgccc tatcgaggaa gctgaaggcc ctggcccatc tctgacagct 81061 cagggcacag gaaagccttg cctctgccta gtctgccctc tgcttcccca ccagcgtcat 81121 ctgtggccca cggctcttgg ctggacctgc actgaaggac acagctgctc cgagctacac 81181 aacagggtgg ctgtccctcc gccctccatg tgtggtggct gctgcagcag gatacagcca 81241 gcctcctggc tgctatccca cagtgctatg gttgattgcg ttcttacact ggaagataaa 81301 cagtgcttgt gtcaaagaaa atagaaacaa ggccgagcat ggtggctcat gcctgtaatc 81361 ccagcacttt gggaggcgga ggtgggcaga tcacttgagg tcaggagttt gagatcagcc 81421 tgtccaacat ggcgaaaccc catctctact aaaaatataa aaattagctg ggtgtgttgg 81481 catgcgcctg taaccccagc tactcgggag gcagaagcag aagaattgct tgaaaccggg 81541 aggcagaggt tgtagtgagc caagatcgca ccactgcact ccagcctggg agtgctcata 81601 tgagcactca ctctacagag accctgtctc acaaaaataa ataaataaat aaataaataa 81661 aataaataga taaaaaaaag aggtagaagc tggaaaagtg gttatcttag gagtggaagg 81721 tttgctgact gagagggaat aaaagggcct ctagggtgct gaaaatggcc tgcatctcaa 81781 cctgggtggt catacacaaa tgtacagata tgttaacatg catccagctg tatccctaag 81841 ttgagctcac ctttttgtat gtatgttacc tgttaataac acaattgata aaagccccct 81901 ctaaaatccc acatcttcct atggctacca tagatatttt aggcttataa ttggttttgc 81961 tccccatatt tgctattcaa gatctgtata attttggaac tctccccttt ttccttgcca 82021 acagggtccc aattttgtta ggttcgggaa gtgcccaatt aatactcatt tgcccaaact 82081 ctcttgcggc taggaattgc catgagatac aatgtcagcc agttggaaat aagtacacat 82141 ctgctgggta gggcttccag aaaagctatc attttccttt ttccttcttt agagacaagg 82201 tctcgctgtg tggcccaggc tggagtacag tggtgccatc acagctcact gcagcctccc 82261 attcccaggc tcaagcgatc ctcccacctc agcctcccaa gtagctggaa ctataagagt 82321 gcaccaccac acccaactag attttttttt tattttttgt agagataggg ggtctcactc 82381 tgttgcccag gcaggtctcg aacccttggc ctcaagcgat cctcttgcct cggcctctca 82441 aagtgctgga attacaggcg tgagccactg catctagaaa agttgtcatt ttcctgcttg 82501 aaaaagggat agatccatct ctcatacgtc ttttgctctc tgtcctccct gccccctgtt 82561 tttctgctca gaatgtagat gcttcttagg accatgaagc gaaagccaag atctggggat 82621 agtgaaacag atagccagag gaccccaggc tgtggatgga tgagtgatcc tgagcagcag 82681 ccgtggacat gggttcccta attccttcat gtcccatgag aaaaataaac ctctatgtgt 82741 ttaagtctct gttatttggg catatttgac agtcctcaaa tgagatataa gggttttatc 82801 ttcataaatg gcaaagactt ctagagacag aagggaaggc ttctgggtgc taattcactg 82861 tggcttcata tcatcaagta gcaagggctc aaggaaaaaa aaaagtgggg gaaggaatcc 82921 ccaagacatt cccaaaatta agactacatt ctcccctccc accttctcct gagagcagtc 82981 ttccctgctt cctatggaat ctgaattctt ctggcattgg aaaaggctca gggaagatgg 83041 ctcaacgtga agaggatctc agttcacgta aactttacca agagcgggtc ttagagtgtt 83101 atgtccactg catcactgtt tctcaatcta tttttaacca cttcccactt ctgataaaca 83161 gaaactttac tttctggttt gcattaagaa aaccagggct tttctctctc tcttggaaaa 83221 tttccaagca ttttgaaacg tgctgccaac tccccgggtg tgacacccgg gagtttctga 83281 tatcctccct gcacttcctg aaaatctgcc gtctaattta agtttcctaa gattctgcaa 83341 agcagacaga aaccctggtg actccagcga taccacaact cttaggggaa gatgagtcat 83401 gtatgtggca gaagcaggta atggaggaaa acagagacaa ggttgggctg gggaaaggaa 83461 cacagagcaa ctagttttgc cccccatctg gtgtgtcttt tgcaggatca gccagcacct 83521 gactgcaagt ttccgcctcc agacactctg ctgcctctcc tccctggctc agggtcagca 83581 ctgaaaatca gcccaaggaa acgaccagct gccttctggc ccttatcaag ggtggtgtga 83641 acccgcgcta gctacccagg ggtctggccc ggaagcggca gggggagggc tttttagtcc 83701 tgtgtctagt ccagtggtca cctctgcacg gacagcagat ggagaagggg gctcggggaa 83761 ggaggaaaag gtcaggcaac ctctccagga ttcctggaat aaacattcct cctcccaagc 83821 tcttcattct gtttcacaat caaatccaga gtcttataag cattcatgga aaccatgaaa 83881 gcaaacaact tggtaaacgg tagacagaaa cagtaaatag aatcttccct ctacttcctg 83941 tgtatccaga atgtttagag acagggagtg gggactgacg acagattgca gcatctggga 84001 tctggggcta tggcaggaac gaggacatgt ggaaggctgg gagttgcctg ggtcttggga 84061 gtctggggtt gaaggttact ccttcatggt ttcagggaaa gagtgaagac cactggcaag 84121 aagcctggta accacacaga cacaccctgc aggaagcagt tgacaccatc tcacaccctg 84181 acctgtcccc catgtcctcc tctgtgggat gcacaggact tccaggggcc accaaagaag 84241 ttgggtctaa ccggggccct cacagtacag tggcgagtgt agtgcggtgg taaagaccat 84301 ggactccagc cgggcgcggg ggctcacgcc tgtaatcccc acactctgga aggccaaggt 84361 gtgcagatcg cttgagtcca aaagttcaag accagcctgg gcacatggtg aaaccccgtc 84421 tctacaaaaa atacaagcat tagccgggca tggtggcgca ctgtaatctt gtaatcccag 84481 ctactcggga ggctgaagtg ggaggatttc tgaagtcgca ggtttcagtg agccaagata 84541 gtgccactgt actccagcct gggtgacaga gtgagaccct gtctaaaaaa aaaaaaaaaa 84601 aattagccag gcatgctggc gtgcacctgt aattccagct actccagagg ctgaggcagg 84661 agaatcactt gagcgtggga ggcagagatt tcagtaagcc atgattgtgc ccctgcactc 84721 cagcctgggc aacagagcga gaccctgtct aaaaaaataa aaaataaaaa aagcccatgg 84781 actttttagt tcagagttca gttcctccgt ccaccagatg catgactcca ggcaaatcac 84841 ttctttctct acatacttca gcttcctcgt ccgttaaagg ggatgagtgt gtaacttcct 84901 aggatgtttt aagatttaca tgagttcatc tttgcaaagc tttttgcatg gcacttgata 84961 cacagtgtca tacaatacaa taaatgttgg ctgccatgtc tgctatgtcc tctgtctgtt 85021 gtaatgaatt gccaaaaact tggtggctta acataacaga aatttattct ctcacagtac 85081 tgaagactag aagtccaaaa ccagtgtcac ggggctgaaa tcaaggtgca gggtctccta 85141 attttccctc ctttttcctc tgtcctgacc aagaaacaga gtgccttgac caacctgcga 85201 cccagccagc tgcgtgtttt ctctgcagac ttgaacccaa gccagggctt gaacattccc 85261 aggcactgat aaaggtgttt agacactgaa agaaaccagc cctggccctg ggccaaattc 85321 cttaaaccct catgcaaacc cctcccttgc cccctccctt ttctctcatt gttcgtcttg 85381 aggatgctgc agcccactct gcaaattccc ctccgaaatg ctttgcacta atcaccctgg 85441 catttggtgc ttcttgcttt gggaacccat ctggccccct cttgggatgg tttggggaac 85501 tcctcgacag aaactccctt gccacagctt ttcggggtga ctccagccag atccggctgg 85561 gatggaacac aggtgttcag caggattcca tttctccgga ggctcttgag aaaatccatt 85621 ctttgcctct tccagcttct ggtgggcgct agcttgcagc tacatcactc cggtcctcca 85681 ggatcacagc atcttctccc cttctgtctg atttctccct ctacctcttt ctttttgtga 85741 ttacatttag gaccatctca gatagtccaa gattatctcc ccatctaaag attctttttt 85801 tttttttgag acggagtctc actctgttgc caggctagag tgcagtggca cgatctctgc 85861 tcactgcagc ctccacttcc cgtattcaag tgattctcct gcctcagcct ctggagtagc 85921 tgggattaca ggattacaga tgcacaccac catgcctggc taagtttttg gatttttaat 85981 agagatgaga cggggttttg ccatattggc caggctggtc tcgaactcct ggcctcaagt 86041 gatctgcctg cctcggcctc ccaaagtgct gggaacacag gtgtgagcca atgcgcccag 86101 ttctgcctcc tcctttttgt gattatgttt agggccatct cagatagtcc aagatgacct 86161 ccccatctca agcttcttaa tcccaatgca aagtcccttt tggccgtggt tccagggatt 86221 aggatatggg tctcttttgg ggggaccatc atttggccac cacagctggt gtgattggcc 86281 aggagtgaag tctaataaac aggactcagg tggacggaga gtggaggaag gagaggtctc 86341 tgcagagaga ggttggggta tggctaccgg cagtgggtgg gaacaaccac agaggcccac 86401 caccaaggcc acagagacag agtctgcaga gttctcccaa gaagctagcc ttgagacgct 86461 ccttttaatt ttcttcttca gcaatgtacc agcccgggga tgggttctgc agggactgtg 86521 gaagtgaagg accccacgac tggagtgaca ggggaggctg cataatagac aactaaaaga 86581 gaagatgcac ggaggggggt gggcataaac tgttaccttc acagtgggtg agctcctctg 86641 gtgaggcctc tgcagggctc aaaaaaggtg atacgcaaat gaactcaggg gaactgcact 86701 ggcacagggg ccattacgtc tcaaagtcca tcagcacgtg ctggtctaga ttctccttcc 86761 ctcttcatcc ctaggagagt tccatttctc tctggcatcc tgtacttcct tattaattgt 86821 catttattta aaattgcaca ctgggtgcaa tggcccacac ctgtaatccc agcacttagg 86881 gaggccaaag caggagaatc gcttgaggcc aggtgtttga gaccagcctg agaaacacag 86941 caagaccctg tctctacaaa aaatagcaaa ttagccaagt gtggtggcca agcccgtagt 87001 ctcagctact tgggaggccg aggtgggagg accgctggag cccaggaggt cgaggctgga 87061 gtgagctatg atcacaccac tgcactccag cctgaagtga cagagcgaga cccccacctc 87121 taaaaataat aataatcagc ccagacgtgg tggttcacac ctgtaatccc agcactttgg 87181 gaagccaagg tgagagtatg gcttgaggcc aggagtttga gaccagcctg ggcaacataa 87241 caagacccca tctctaccaa aaataaaaat aataaaaaca tttagaaata ataataataa 87301 aacaaagtta catatctcaa atcctaagtg atcctctctg ccccgggtgc acattctccc 87361 ttatccatgc catcctcaat cccctcaacc cctggcgact gcaacaccct cctagcggat 87421 ccctctgctt ccagttctgt ccttctaggg agcattcttc acacagtagc aggcttgatc 87481 ttctagaatc atatatcaca tcacatcatt cccctcttat ggccctaagt gacttcccac 87541 tgctcaaaaa tggagtctta aggccgggtg cagtggctca cacctgtaat cccagcattt 87601 gggaggctgg caattgtttc attcattcat tcatttaaca aatatatatc caccccccag 87661 gatgtgccag gatctgtgcc aagcattaga aatatcatgg ttgaaaaata cagatgggtc 87721 caggcgcaga ggctgacgcc tgtaatccca gcattttggg aggccgaggc aggcggttca 87781 cctgaggtca ggagttcaag accagcctgg ccaacatggt gaaaccccgt ctctactaaa 87841 aatacaaaaa ttagcagggc atggtgacag acacctgtaa tcccagctac tcaagaggct 87901 gagacgggaa aatcaattaa acccaggagg cagaggttgc aatgagccaa gatcacgcca 87961 ctatgctcca gcctggatga cagagcaaga ctccgtctca aaaaaaaaaa aaaaggtata 88021 tagcctggat catcagtgct gaccatgagg aaaaaggtcc tgtggctttt gctgcaagtc 88081 ccttaaacca caagacagct ctcaagtgat tacacgtcca ggatgtctta acaggtccca 88141 gaaaaagata aatgaggcat ctgaggacaa gcgtcctctg agtttttgaa atggacttat 88201 ctccattctg aagtcactct catccatcta tcctccactg tcatccagga aatagttact 88261 aagggcctaa catgttccag gctgcacgcc aggtcctgag gatgcaatgg tgagcaaagc 88321 agctgtgttc cctgctttgt ggacagtctc ataaggggag gtagacttca catccatctt 88381 aatgcaaatc ctcatgctaa tgaacacatt ataataaatt gagacgtggg acataggaag 88441 gaatagtaaa agatgctatg agagcaacca acaatgagaa aataaactct aaaaaaatta 88501 aatttacagc tgcctccaaa atatttagaa tcatgcaaaa cctctatgct gacaactaga 88561 aagcattgct gagaggaatt aaagacctaa ataaatggag agagatgtcc agtgtcaatg 88621 atcccaaatt gatctatgga ttcagtgtga gcccattcaa aattccagca ggcttttctg 88681 tagaaattga ttttaatatt tatatgaaca tgcaaatgac ccagaatagc cacagcaaat 88741 ttagaaaaga agaaggaagt tggaggactt actctatttc aagacttatt atgaaaccac 88801 agtaatcaag actgtgtgca ggccgggcac agtggttcgt gcctgtaatc ccagcacttc 88861 gggaggccgg ggcacgtgga tcacttgagg tcaggagttg aagacttgcc tggccaacat 88921 ggcgaaaccc tgtctctaat acaaatacaa aaattagccg ggggtggtgg caagcatctg 88981 taatcccagc tactcaggag gctgagacag aagaattgct ggaacccagg aggaggaggg 89041 ttcagtgagc tgagatcacg ccacaacact gtagcctggg agacagagca agactccatc 89101 taaaaataaa aattttaaaa aataataaaa ataaataaat tttaaaaata ccgtgtgcaa 89161 atgatgaaag gcaagacaca gagtctctca gaagaataga gcgtccagaa atagtaccac 89221 acacacatac aaccaatggg tttttagtac aggtgccgcg gcaattcaac agggaaagga 89281 aaatctggaa taattggata aggaggtaaa agcttcttag gacataaaag cacaaaccat 89341 agaagaaaaa ccagataaat tagaccttgt caaaatgtaa cacttttttg cttttcaaga 89401 gacaattaag aaaatgaaga ccaggcacag tggctcatgc ctgtaatccc agcactttgg 89461 gaggccgagg tgggcagact gcttgagctc aggaggtcga tgccagcttg gacaaaatgg 89521 caaaacctca tcagtacaaa aaatagaaaa attagtcagg catggtggca ggtgcctgca 89581 gtcccagcta cttggagggc tgaggcggaa aggtcacctg agcctgggag gtcaaggctg 89641 caatgagctg tgatggagcc actgcactcc agcctgggca acaaagtgaa actctgtctc 89701 aaaaaaaaaa aaaattaaaa gaagagaaaa gtcacagaat ggcagaaaat attggtaata 89761 tatacatctg acaaatgact agtaccaaga atataataaa gaactctcag aactcaacta 89821 tatgaaaata atccaatttt tctttttttt tttttttttt ttgagacaga gtttcactct 89881 gttgcccagg ctggagtgca gtagtatgat cttggctcac tgcaacctct gcctcctggg 89941 ttcaagcgat cctcctgcct cagcctccct agtagctggg attacaggcg cccgccacca 90001 cgcccagcta aattttgtat ttttagtaga gatgaggttt cactatgttg gccaggctgg 90061 tctgaaactc ctgacctcaa gcaatccacc cacttcagcc tccaaaaatg ctaggattac 90121 aggtgtgagc caccatgccc agcctaaaat tttttaaatg agcaaaatat ttaaacagac 90181 attccacgaa agaagatata caaatggctc ataagtactt gaaaaaatgc ttaacaccaa 90241 taatcatcaa gaaagtataa attaagacca caacaaaggc cgggtacggt ggtccacgcc 90301 tgtaatccca gcactttggg cagctgaggc gggcagatca cctgaggtca ggagttcaag 90361 accagtctgg ccaacatggc aaaaccccat ctgtattaaa aatacaaaaa ttagccaggt 90421 gtggtggcac gtgcctgtaa tcccagctac tcagggggct gaggcacgag aatggcttga 90481 acccaggagg tgaagtttgc agtgagtgga gatggcgcca ctgcactcca gcctgggcga 90541 cagatcgaga ctctgtctca cacacacaca caaaaatacc atggcatact tattagaatg 90601 cccacaatta aaaactaaca attgtcaagt gttggtgagg atgtggagca actgaaatgc 90661 tcgtaggctg ctggtgggaa tgcaacatgg cacagctact ttggaaaact ggtagtttct 90721 taaaaagtta ataaaacgtt ttaccatacc acctggccaa tacactgcca ggtatttgcc 90781 caacagcact gaaaacattt ccacacaaga atgttcttaa cagctgtatt catactagac 90841 aaaaactttt taaaaaaaac ccaaaccaac caaatatcca tcaacagatg aatggataaa 90901 taaaatagga aatatcctta aaacagaata ctattcagca atagaaaaga tgcttaacct 90961 gtaagagggg aatcgaaatc taggaggtca gggccttttt tttttttttt tttttttttg 91021 agacagagtc ttgctctgtc gcccaggctg gaatgcagtg gcgcaatctt ggctcactgc 91081 aacctccgcc tcctgggttc aagtgattct cctgcttcag cctcctgact agctgggatt 91141 acaggcgtgc gccactatcc cggctaattt ttgtatattt agtagagacg ggatttcgcc 91201 atgttggcca ggctggtctc gaactcctta cctcaggtaa tccacctgcc tcggcctccc 91261 aaagtgctgg gattacaggt gtgaaccact gtgcccagcc agacagggat ttcttaagga 91321 agtaaatggt tatctgaaag aaagctacaa attaactgtg aaagttggaa tgggtaacaa 91381 gaaataaaga cataccaggc aggggctggt tcaagctccc catggcagat ggaaaacatg 91441 acacattcac agaagtggaa aatggccctg cccagacagc aaagagagag aagagtgatg 91501 tgagtggaag ctggcaaggg ccatgctagg caggcctttt taaggatgtt gatatttctc 91561 ctaaaagcag tgagaagtct ctgaaaggac aagaaggtcg tataattgga ttgacatttt 91621 taaaatgatt ccactaatac agtatggaga actgactgcc agggaagcag atggatgaga 91681 ggaagtgagt cagggccctg ctgaaatcct ggtagggatg atggcactga gatgggatga 91741 tggagatgga gacggatgga tggattcata tttggaatcc ataagtattt ggggagtaca 91801 tcagactgca ttctagggat aaggaagaag ataatattga gattcacttt aagactgact 91861 ctagtcaggc tcagtggctc acgcctataa tcccagcact ttgggaggcc aaggcgggcg 91921 aatcacttga ggtcaggagt tcgagaccag cctgaccaac atggtgaaac cccgtttcta 91981 ctaaaaatac aaaaattagc cgagtgtggt ggcacatgcc tgtaatccca gctactcggg 92041 aggctgaggc atgagaattg cttgaatctg ggaggcagag gtggcagtga gccgagacta 92101 tgtgccactg cactccagcc tgggcgacag agtgagattc agtctcgaaa ttaaaaagaa 92161 aaaaaaaaaa gactgactct aagataatgg tgcaggagat gtgatgggaa gagatcttac 92221 gccccaagac agggaagggt ggaagaagga agatctgagg ccaggtctgg acacatttga 92281 tttggttgat ttgggacacc tgtgagacac ccaatggcag tgtcaactag gcagttgagc 92341 gtgtacatgt gaagctcaca gaaacctagg gtgaagattc aaaccagggg tgtcatgaag 92401 gtggtaactg gaactacgga aacagttgca gtcgactaga aggagagcac aggcgagaac 92461 aggaaagaac ctggagccaa gcttgaagga agtcaccatt tcggggatga gtaggggaat 92521 ggtgagctag caaaggggta cgagaagtga ccaaaggaaa gagagggaag ctagcagcgt 92581 aacgtatcag aatttgtgaa aagaggatgt ttccagaagg gcgtggaaat tgtgttgaat 92641 gctaataaaa taagaaaagc gtaagtccac tgaatttagg aacatggagg tcactgctaa 92701 catcgggaag agaaaatccc atggaaaata ggcacagaac ccagattagg gtggactgag 92761 gggcgagtga gaattaagaa tgtggaagtg attacacaga actttagata agggtccaac 92821 ttcctgtgat gacaggctgc attataaaga cgatgagaga aaactgaact tggaaggaac 92881 tgtgaaggat attgctcttg tgaaagttta cccccttgaa ctcggtctga gaggttgggt 92941 ttccttttcc tgtgacgtag gtggacatca tgaactgtga gactgaaagc ctccttggtg 93001 atgctccctt cataactgcg gtgaacattc ataactgctg tgaacccaga gagggggttt 93061 cgttggggcg tcttccacag ggggactcac gacagtgacg gcgaaaggca ggaggtgggg 93121 ggagaagaac gtgcagagac agagaggcag tcccactcct caacacttgt tttccaatga 93181 gtttcctgga ttctgcaatt aaatccagat cctcgtgcat gtttatagga aatcataaaa 93241 gcgaaatgac acaaataaat ttggtggcaa caagagagaa tggtctcttt cttagttctc 93301 ctgttgatct gagaagcgac gtgtgctggg ggaaggaaat aaagaactag aacgccttca 93361 gaggagagta gagtggacga gaaagaaaag ctaaatatca aaatgtctag accctggggt 93421 tacagaccac aaaaagaata aatgactgac tgactgaatg actgaatgaa tggcaaacaa 93481 taggaagtag gtatgtccta gggaaagaat ggggattctg tgagcaggaa tcaggccaca 93541 gatggcagaa cagagacttc tttctgagac ttcctcccta gttcaagagg aaaacacgtt 93601 ccaacgatcc ttgctgcatt aggtgtgtat ttcaaaggga ccttcaggag ctcctaacct 93661 ggggctacca aggtcagaga gggggaaaca gtgagacttt atcagggaaa ggaatggagc 93721 ctgagtgcca ctccctgacg aaaagccaca tgggcacgtg aactcgggcc tcctgaattc 93781 ctgacacttt cccaagaggg acagcacatc catgccctga gcctctggag gctgcaccaa 93841 gtggctcact ctctgtgaag ctgcgttaca ggcttgagac gaggcagcac ttacgggcat 93901 ctgctaagtg ctttatatac acaacctcac tgaatcctcg cagaagtcct aggatgtcag 93961 aattcgcatg ctcacttgac tacagcaaac tagacttcag aaaacggatc tgcccccaag 94021 gttgtaatga agggacctgg atttgagccc atatctgcat gactataact cccacaccat 94081 ttccatcaca acaagatgca ggccaggacc agcctgtttc agctgagacg ggcaaaagaa 94141 atgtgttggg ggcagagggc aggagaccat agagaaagta aaaaaattga agaagaaaag 94201 gaaggagtgc cgggtgcagt ggctcacgcc tgtaatcgca gcactttggg aggccaaggc 94261 aggcggatca cttgaggtca ggagtgcaag accagccaac atggtgaaac cctgtctcta 94321 ctaaaaatac agaaattagc tgggcgtggt ggcaggtgcc tgtcatccca gggaggctga 94381 cgcaggagaa tcgcttggac tcgggaggca gaggttgcag tgagctgaga tggcgccact 94441 gcactccagc ctagacaaca gagcaagatt cagcctcgaa aaaaaaaacc aaaaaaaaag 94501 aggaaaaata aaaaggaagg aggaaagctc atttctgaga atgagcagaa tgacagaaga 94561 aactgcctaa gaaagagaaa agaaaataat ctgacttggg aagtaaaaaa agaaaaatgc 94621 ctgggaattt tttttttaaa tcaggaatta aggacagaaa aatctgaatt tggaagtcaa 94681 gagtgaggat tctctaggga gagaataaag atgctgggcc tttcctttgg gtttgagagg 94741 aaatagtgct taaagactga ccttcctttc tctggtcacg tggctggtgc tagtataaat 94801 gcttactacc agccaggttc ctcaccattt tttcctgcat ggcatggaca gaagcagggg 94861 ctaaagctct gttcctctct cctggaagct tgcagacctc ccttcagaac caatcccaag 94921 aagccaccta tccggaacaa cacaaggcaa ggcagctagt gtagtgcttg tgctctggga 94981 agagaggcca ggatcaggtt tgaggggaag gttctgggag actggaggaa ggatgtcaga 95041 accaaatgga agactcatac ccagagacag gagctagtgt gaggagcaag gaaacagcca 95101 ggaaatcaag gctggagagt ggacagaggg acaggacctg ggttggggcc agcaagggtg 95161 acaccccaag tggaaactga gttaccccgt gaagaagacg agttctattt atggtcagaa 95221 tgccacgctc aagggccagt gaaggcctgt gggcttttct gaatatttct ctaaggtctt 95281 ctagggacaa gccttcctcc taaggctagt gctggttcat ccttcttccc ctccaccctc 95341 ctctaagcct cccagcttcc ccttttcttt tctttttttt tttttttaga cggagttttt 95401 gctcttaccg cccagggtgg agtgcaatgg cgcaatcttg actcactgca acctctgctt 95461 cccggttcaa gcgattctcc tgcctcagcc taccaagtag ctgggattac aggtgcccgc 95521 catatcaccc ggctaatttt tttttttttt agtggagatg gggtttcacc acgttggcca 95581 ggcttgtttc gaactcctga cttcaggtga tctgcccgcc tcggcctccc aaagtgctgg 95641 gattacagaa gtgagccact gcgcccagcc ccagcttccc cttttctaga gagtcccaga 95701 gtttggtgtt ggacaggaga ggacaaggtg ggggagaatc tgtgctctgc aaaacagacg 95761 aacggaagta ggaagctggg ctggtggctg ggcacttggg tcctttggag agctggcact 95821 gggtatccgc gcctggggcg ccacagcctc tttatactga taagcatctt acagctctct 95881 cccggcactg atgcctttct ctcctgcagg tctctgtctg agaaagggac ttgagtaaag 95941 ttagaaactg gcaacagtaa cttcctgtat ctgtgaggac aggaggagaa aggctaggga 96001 aactggaggg ggaattccct tgggaagcca cgagttggtc tcctccagag acacatgatg 96061 gcaaacataa tcactactgc ttaatatccc gccccaggca aagtcaataa ttgctctggg 96121 tagttcgagc aggtggtgtg agcaagccgt ggtggcatca gaaagcacag tcctgggaag 96181 gggacggtgc cggggaggat gtccgcatcc tgaaggagag ctggctgcgc gggctgtgag 96241 tgagacctcc ctgaccccgc ccttttttgt tcccctccag gatgctgccg gactggaaga 96301 gctccttgat cctcatggct tacatcatca tcttcctcac tggcctccct gccaacctcc 96361 tggccctgcg ggcctttgtg gggcggatcc gccagcccca gcctgcacct gtgcacatcc 96421 tcctgctgag cctgacgctg gccgacctcc tcctgctgct gctgctgccc ttcaagatca 96481 tcgaggctgc gtcgaacttc cgctggtacc tgcccaaggt cgtctgcgcc ctcacgagtt 96541 ttggcttcta cagcagcatc tactgcagca cgtggctcct ggcgggcatc agcatcgagc 96601 gctacctggg agtggctttc cccgtgcagt acaagctctc ccgccggcct ctgtatggag 96661 tgattgcagc tctggtggcc tgggttatgt cctttggtca ctgcaccatc gtgatcatcg 96721 ttcaatactt gaacacgact gagcaggtca gaagtggcaa tgaaattacc tgctacgaga 96781 acttcaccga taaccagttg gacgtggtgc tgcccgtgcg gctggagctg tgcctggtgc 96841 tcttcttcat ccccatggca gtcaccatct tctgctactg gcgttttgtg tggatcatgc 96901 tctcccagcc ccttgtgggg gcccagaggc ggcgccgagc cgtggggctg gctgtggtga 96961 cgctgctcaa tttcctggtg tgcttcggac cttacaacgt gtcccacctg gtggggtatc 97021 accagagaaa aagcccctgg tggcggtcaa tagccgtggt gttcagttca ctcaacgcca 97081 gtctggaccc cctgctcttc tatttctctt cttcagtggt gcgcagggca tttgggagag 97141 ggctgcaggt gctgcggaat cagggctcct ccctgttggg acgcagaggc aaagacacag 97201 cagaggggac aaatgaggac aggggtgtgg gtcaaggaga agggatgcca agttcggact 97261 tcactacaga gtagcagttt ccctggacct tcagaggtcg cctgggttac acaggagctg 97321 ggaagcctgg gagaggcgga gcaggaaggc tcccatccag attcagaaat ccttagaccc 97381 agcccaggac tgcgactttg aaaaaaatgc ctttcaccag cttggtatcc cttcctgact 97441 gaattgtcct actcaaagga gcataagtca gagatgcacg aagaagtagt taggtataga 97501 agcacctgcc gggtgtggtg gctcatgcct ataatcccag aactttggga ggctgaggca 97561 ggtggatcac ttgaggtcgg gagattgaga acatcctggt caacatggga aaaccccgtc 97621 tctactaaaa atacaaaaaa attagctggg catggtggca catgcctata atcccagcta 97681 ctctggaggc tgaggcagga gaatccttga acccgggagt tggaggttgc agtgagctga 97741 gatcacgcca ctgcactcca gcctagcgac agagcaagac tccatttaaa aaaaaaaaaa 97801 aaaaaaaaaa gaagcacctt caggctggag aagcagcgta gctaacacaa gtccagtcct 97861 tgtgatgtgg ctggtagttg gggatggcca ggctgaagca gagagtccta gagaaatctc 97921 gatacaagct tcaaagcaac acctagacac tgctctagcg gttgatcctg gagataaacc 97981 aacaagagag agatggaaga gaaatactaa atgaggtcaa agaagactca gaaaggttct 98041 gagcctggag atgagcaggg aggcctcagg gcttagacct ttaatgatag gggtttccct 98101 gcattggttt gacctgttgc ctttttgatg tgctctgttt gttttcatgt gttgtcttgt 98161 ctcccctgct aaactgggag ctgccagggg tctgggtctt atctccttcc tccatggtac 98221 cccacacagg ccaggatgtg gtttggtacc cagcaatcag agattggcac tccctcatac 98281 aggggaaagc aacctggtct agcaaattga aaataaagat gataaaactc tgaagtgaat 98341 gtccacaatt tttgtaacac tgctgcaagc acagggaggg caaaaatagg agagagaacc 98401 agattcagca gcaaagggga aaagtgatgc atctgaaacc acagaaacat cttaagataa 98461 agaggtggtg tgggcaaagg aagtgagagc actgacggaa tcgtctttcg agggatgaat 98521 agagagtccg gacttagaaa ggatgcaggg tgattccgat tcacagtcat ctgtctgcca 98581 cagggcctta aaacatgtca ccctcctgct cagaaacctg cagagctttc ctatcatcaa 98641 aatgtttcct tggccagttg caggggctca tgactgtaat cccagcactt tgggaggctg 98701 aggtgggagg atc // LOCUS AC002563 136436 bp DNA PRI 26-SEP-1997 DEFINITION Human PAC clone 127H14 from 12q, complete sequence. ACCESSION AC002563 NID g2439515 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 136436) AUTHORS Connell,M, Goela,D and Harper,M. TITLE The sequence of H. sapiens PAC clone 127H14 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 136436) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (26-SEP-1997) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This clone was originally isolated in the laboratory of Professor Graeme Bell, Howard Hughes Medical Institute and Departments of Biochemistry and Molecular Biology, and Medicine, The University of Chicago, Chicago, IL, USA. The clone was provided by the laboratory of Dr. Roger Cox at The Wellcome Trust Centre For Human Genetics, Oxford, UK. Some contig information was also obtained from Yamagata et al., Nature 384:455-8 (1996). SOURCE INFORMATION: This clone is from the first release of the human BAC library. The library contains cloned DNA from a human male fibroblast cell line 978SK. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); Kim et al., Genomics 34:213-8 (1996). The clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the left is 7E17, 200 bp overlap. The actual start of this clone is at base position 1 of 127H14; actual end is at 136436 of 127H14. This clone contains polymorphic bases with 7E17 and 162B15. This clone contains STS WI-13962 (NID:g1343786). FEATURES Location/Qualifiers source 1..136436 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /clone="127H14" /clone_lib="CITB-978SK-B" /map="12q" repeat_region 220..509 /rpt_family="ALU" repeat_region 586..657 /rpt_family="L1" repeat_region complement(1375..1768) /rpt_family="THE" repeat_region complement(2196..2490) /rpt_family="ALU" repeat_region complement(2502..2537) /rpt_family="L1" repeat_region complement(2538..2829) /rpt_family="ALU" repeat_region 3158..3449 /rpt_family="ALU" repeat_region 7328..7626 /rpt_family="ALU" repeat_region 8360..8652 /rpt_family="ALU" repeat_region 9665..10178 /rpt_family="MER" repeat_region 10191..10263 /rpt_family="MER" repeat_region complement(10514..10647) /rpt_family="ALU" repeat_region 12209..12501 /rpt_family="ALU" repeat_region 12785..12902 /rpt_family="ALU" repeat_region 13366..13661 /rpt_family="ALU" repeat_region complement(14193..14485) /rpt_family="ALU" repeat_region complement(15520..15546) /rpt_family="L1" repeat_region 16203..16487 /rpt_family="ALU" repeat_region 16825..16843 /rpt_family="L1" repeat_region 17517..17807 /rpt_family="ALU" repeat_region complement(18517..18808) /rpt_family="ALU" repeat_region complement(20096..20388) /rpt_family="ALU" repeat_region complement(21096..21225) /rpt_family="ALU" repeat_region complement(21635..21907) /rpt_family="L1" repeat_region 23046..23337 /rpt_family="ALU" repeat_region 23907..24197 /rpt_family="ALU" repeat_region complement(24681..24972) /rpt_family="ALU" repeat_region 25012..25065 /rpt_family="L1" repeat_region 25567..25865 /rpt_family="ALU" repeat_region 27142..27431 /rpt_family="ALU" repeat_region 28072..28104 /rpt_family="L1" repeat_region 28514..28670 /rpt_family="L1" repeat_region 28700..28935 /rpt_family="ALU" repeat_region 28951..29040 /rpt_family="L1" repeat_region complement(29084..29496) /rpt_family="ALU" repeat_region complement(29517..29659) /rpt_family="ALU" repeat_region complement(29679..29966) /rpt_family="ALU" repeat_region 29968..30817 /rpt_family="L1" repeat_region 31559..31676 /rpt_family="ALU" repeat_region 31685..31936 /rpt_family="ALU" repeat_region complement(32096..32465) /rpt_family="MER" repeat_region complement(32508..32621) /rpt_family="MER" repeat_region 32938..33043 /rpt_family="ALU" repeat_region complement(33738..33866) /rpt_family="ALU" repeat_region complement(34802..35179) /rpt_family="L1" repeat_region complement(35189..35481) /rpt_family="ALU" repeat_region complement(35496..35543) /rpt_family="L1" repeat_region 36972..37263 /rpt_family="ALU" repeat_region 37895..38289 /rpt_family="THE" misc_feature 42009..42429 /note="match to EST R20494 (NID:g775275) yg46e04.r1" misc_feature 42029..42302 /note="match to EST T78033 (NID:g696542) yc97e08.r1" gene 42275..54355 /gene="WUGSC:H_127H14.2" CDS join(42275..42433,46331..46494,47994..48087,48370..48520, 50567..50700,53963..54031,54278..54355) /gene="WUGSC:H_127H14.2" /note="AMP-activated protein kinase beta; 95% similar to X95577 (PID:g1185269)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2439516" /translation="MGNTSSERAALERHGGHKTPRRDSSGGTKDGDRPKILMDSPEDA DLFHSEEIKAPEKEEFLAWQHDLEVNDKAPAQARPTVFRWTGGGKEVYLSGSFNNWSK LPLTRSHNNFVAILDLPEGEHQYKFFVDGQWTHDPSEPIVTSQLGTVNNIIQVKKTDF EVFDALMVDSQKCSDVSGMNTVILYHMRAELSSSPPGPYHQEPYVCKPEERFRAPPIL PPHLLQVILNKDTGISCDPALLPEPNHVMLNHLYALSIKDGVMVLSATHRYKKKYVTT LLYKPI" repeat_region 42996..43156 /rpt_family="ALU" repeat_region complement(43175..43289) /rpt_family="ALU" repeat_region complement(43306..43598) /rpt_family="ALU" repeat_region complement(43619..43904) /rpt_family="ALU" repeat_region 45417..45498 /rpt_family="ALU" repeat_region 45517..45737 /rpt_family="ALU" repeat_region 46977..47269 /rpt_family="ALU" misc_feature 48447..48484 /gene="WUGSC:H_127H14.2" /note="match to EST AA131038 (NID:g1692666) zo16h03.r1" repeat_region complement(51803..52093) /rpt_family="ALU" repeat_region complement(52419..53023) /rpt_family="L1" repeat_region 53012..53158 /rpt_family="L1" misc_feature complement(53202..53411) /gene="WUGSC:H_127H14.2" /note="match to EST H88397 (NID:g1069976)" misc_feature 53963..54032 /gene="WUGSC:H_127H14.2" /note="match to EST AA131038 (NID:g1692666) zo16h03.r1" misc_feature 54276..54537 /note="match to EST AA131038 (NID:g1692666) zo16h03.r1" misc_feature 54276..54804 /note="match to EST H64787 (NID:g1023527) yr58g07.r1" misc_feature complement(54383..54742) /note="match to EST R52329 (NID:g814231) yg75a10.s1" misc_feature 54755..55072 /note="match to EST D59775 (NID:g960881)" misc_feature 54755..54962 /note="match to EST D80363 (NID:g1178240)" misc_feature complement(55028..55654) /note="match to EST W72204 (NID:g1382653) zd69g12.s1" misc_feature complement(55247..55381) /note="match to EST AA074330 (NID:g1614198) zm15e01.s1" misc_feature 55279..55654 /note="match to EST W68041 (NID:g1376910) zd39f01.r1" misc_feature complement(55290..55647) /note="match to EST AA074330 (NID:g1614198) zm15e01.s1" misc_feature 59821..60076 /note="match to EST R39283 (NID:g796739) yc91a11.s1" misc_feature complement(59838..60181) /note="match to EST W69508 (NID:g1378779) zd45a02.r1" misc_feature 60155..60525 /note="match to EST W42440 (NID:g1326940) zc22e04.s1" misc_feature 60165..60631 /note="match to EST H10788 (NID:g875608) ym04c10.s1" misc_feature complement(60775..61155) /note="match to EST T75443 (NID:g692205) yc91a11.r1" misc_feature complement(61034..61390) /note="match to EST T84907 (NID:g713259) yd52h06.r1" misc_feature complement(61182..61585) /note="match to EST R24294 (NID:g779182) yg32a08.r1" misc_feature complement(61244..61729) /note="match to EST H10809 (NID:g875629) ym04c10.r1" gene complement(62262..>135084) /gene="WUGSC:H_127H14.1" CDS complement(join(62262..62285,64181..64484,71689..71868, 72006..72084,74775..74855,75654..75782,75880..75988, 78393..78485,82209..82348,84281..84416,84570..84650, 86208..86377,86621..86725,87233..87324,87498..87678, 88227..88364,92275..92439,92729..92846,94508..94615, 95339..95479,102532..102636,104525..104660,108194..108387, 109190..109387,116441..116494,126080..126277, 131400..131574,132620..132729,134968..>135084)) /gene="WUGSC:H_127H14.1" /note="putative RHO/RAC effector protein; 95% similarity to P49205 (PID:g1345860)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2439517" /translation="VLDNQIKKDLADKETLENMMQRHEEEAHEKGKILSEQKAMINAM DSKIRSLEQRIVELSEANKLAANSSLFTQRNMKAQEEMISELRQQKFYLETQAGKLEA QNRKLEEQLEKISHQDHSDKNRLLELETRLREVSLEHEEQKLELKRQLTELQLSLQER ESQLTALQAARAALESQLRQAKTELEETTAEAEEEIQALTAHRDEIQRKFDALRNSCT VITDLEEQLNQLTEDNAELNNQNFYLSKQLDEASGANDEIVQLRSEVDHLRREITERE MQLTSQKQTMEALKTTCTMLEEQVMDLEALNDELLEKERQWEAWRSVLGDEKSQFECR VRELQRMLDTEKQSRARADQRITESRQVVELAVKEHKAEILALQQALKEQKLKAESLS DKLNDLEKKHAMLEMNARSLQQKLETERELKQRLLEEQAKLQQQMDLQKNHIFRLTQG LQEALDRADLLKTERSDLEYQLENIQVLYSHEKVKMEGTISQQTKLIDFLQAKMDQPA KKKKVPLQYNELKLALEKEKARCAELEEALQKTRIELRSAREEAAHRKATDHPHPSTP ATARQQIAMSAIVRSPEHQPSAMSLLAPPSSRRKESSTPEEFSRRLKERMHHNIPHRF NVGLNMRATKCAVCLDTVHFGRQASKCLECQVMCHPKCSTCLPATCGLPAEYATHFTE AFCRDKMNSPGLQTKEPSSSLHLEGWMKVPRNNKRGQQGWDRKYIVLEGSKVLIYDNE AREAGQRPVEEFELCLPDGDVSIHGAVGASELANTAKADVPYILKMESHPHTTCWPGR TLYLLAPSFPDKQRWVTALESVVAGGRVSREKAEADAKLLGNSLLKLEGDDRLDMNCT LPFSDQVVLVGTEEGLYALNVLKNSLTHVPGIGAVFQIYIIKDLEKLLMIAGEERALC LVDVKKVKQSLAQSHLPAQPDISPNIFEAVKGCHLFGAGKIENGLCICAAMPSKVVIL RYNENLSKYCIRKEIETSEPCSCIHFTNYSILIGTNKFYEIDMKQYTLEEFLDKNDHS LAPAVFAASSNSFPVSIVQVNSAGQREEYLLCFHEFGVFVDSYGRRSRTDDLKWSRLP LAFAYREPYLFVTHFNSLEVIEIQARSSAGTPARAYLDIPNPRYLGPAISSGAIYLAS SYQDKLRVICCKGNLVKESGTEHHRGPSTSRSSPNKRGPPTYNEHITKRVASSPAPPE GPSHPREPSTPHRYREGRTELRRDKSPGRPLEREKSPGRMLSTRRERSPGRLFEDSSR GRLPAGAVRTPLSQVNKVWDQSSV" repeat_region 62909..63422 /rpt_family="ALU" repeat_region complement(63716..64008) /rpt_family="ALU" repeat_region 64922..65211 /rpt_family="ALU" repeat_region 66198..66335 /rpt_family="ALU" repeat_region 68552..68845 /rpt_family="ALU" repeat_region complement(69447..69543) /rpt_family="MER" repeat_region 69544..69825 /rpt_family="ALU" repeat_region complement(69831..70040) /rpt_family="MER" repeat_region complement(70565..70853) /rpt_family="ALU" repeat_region 72338..72629 /rpt_family="ALU" repeat_region 73003..73296 /rpt_family="ALU" repeat_region complement(74422..74709) /rpt_family="ALU" repeat_region complement(76143..76261) /rpt_family="ALU" repeat_region 76434..76727 /rpt_family="ALU" repeat_region complement(77464..77755) /rpt_family="ALU" repeat_region complement(78951..79242) /rpt_family="ALU" repeat_region 79653..79811 /rpt_family="ALU" repeat_region 79838..80129 /rpt_family="ALU" repeat_region complement(80306..81004) /rpt_family="L1" repeat_region complement(81555..81829) /rpt_family="ALU" repeat_region complement(82529..82817) /rpt_family="ALU" repeat_region complement(85714..85983) /rpt_family="ALU" misc_feature 85984..86339 /gene="WUGSC:H_127H14.1" /note="match to EST AA131000 (NID:g1692491) zo15f04.s1" misc_feature complement(86621..86728) /gene="WUGSC:H_127H14.1" /note="match to EST T87377 (NID:g715729) yd83e05.r1" misc_feature complement(87233..87329) /gene="WUGSC:H_127H14.1" /note="match to EST T87377 (NID:g715729) yd83e05.r1" misc_feature complement(87497..87679) /gene="WUGSC:H_127H14.1" /note="match to EST T87377 (NID:g715729) yd83e05.r1" misc_feature complement(88227..88292) /gene="WUGSC:H_127H14.1" /note="match to EST T87377 (NID:g715729) yd83e05.r1" repeat_region 88380..88973 /rpt_family="L1" repeat_region 89326..89744 /rpt_family="L1" repeat_region 90126..90418 /rpt_family="ALU" repeat_region 90858..90883 /rpt_family="L1" repeat_region complement(91090..91373) /rpt_family="ALU" repeat_region complement(91414..91458) /rpt_family="L1" repeat_region complement(91459..91750) /rpt_family="ALU" repeat_region complement(91879..92169) /rpt_family="ALU" repeat_region complement(95006..95296) /rpt_family="ALU" repeat_region complement(95457..95484) /rpt_family="L1" repeat_region 96938..97094 /rpt_family="ALU" repeat_region 98039..98120 /rpt_family="ALU" repeat_region 98688..98970 /rpt_family="ALU" repeat_region 99172..99465 /rpt_family="ALU" repeat_region complement(100257..100522) /rpt_family="ALU" repeat_region 101381..101639 /rpt_family="L1" repeat_region 101673..101964 /rpt_family="ALU" repeat_region 101974..102482 /rpt_family="L1" repeat_region 104876..105166 /rpt_family="ALU" repeat_region 105223..105515 /rpt_family="ALU" repeat_region complement(105874..106125) /rpt_family="ALU" repeat_region 107279..107569 /rpt_family="ALU" repeat_region complement(108676..108979) /rpt_family="ALU" repeat_region 111119..111346 /rpt_family="ALU" repeat_region 112633..112923 /rpt_family="ALU" repeat_region 115240..115315 /rpt_family="L1" repeat_region 118471..118754 /rpt_family="ALU" repeat_region 119175..119253 /rpt_family="L1" repeat_region 119676..119842 /rpt_family="L1" repeat_region 121171..121460 /rpt_family="ALU" repeat_region 121469..121757 /rpt_family="ALU" repeat_region 121914..122204 /rpt_family="ALU" repeat_region 127485..127775 /rpt_family="ALU" repeat_region 129108..129167 /rpt_family="ALU" repeat_region 129232..129293 /rpt_family="ALU" repeat_region 130876..131156 /rpt_family="ALU" repeat_region complement(133155..133390) /rpt_family="MER" repeat_region 133395..133684 /rpt_family="ALU" repeat_region complement(133708..133831) /rpt_family="MER" repeat_region 133934..134225 /rpt_family="ALU" repeat_region 134380..134665 /rpt_family="ALU" BASE COUNT 40415 a 30319 c 29949 g 35753 t ORIGIN 1 gatcatacat tgcatataat tgttaggttc atctagtttt ctttaatcta ggagagtctc 61 ctgatgtgtt tgggtgggtg ggtgtgcatg tctttcacga cattgacttt ttaaaaataa 121 tttcaagcca tttgtcttgc agaaagttgc tcaaatagaa tttaccctat tattccctct 181 taattaaaaa aagctaggta tattgagaaa actacatagg gccaggtgca gtggctcatg 241 cctgtaatcc caacactctg agaggccgag gcaagtggat tgcttgaggc tggaagttcg 301 agaccagcct ggccaacatg gtgaaactct ctctgctaaa aatacaaaaa tttgccacgc 361 gtggtggcag gcacctttag tctcagctac tcaggaggct gaggcgggag aatggcttga 421 acccaggagg cagaggttgc agtgagctga gatcacgtca ctgcactcca gcctgggaaa 481 caaagcaaga ctccgtctcc ggaaaaaaaa aaaaaagaag aaaaaggagg aggaggagga 541 ggaggaggag aagggagaag ggagaaggga gaagagaaga gaagagaaga gaagagaaga 601 gaagagaaga gaagagaaga gaagagaaaa aagaaaagaa aagaaaacta cacaggtaaa 661 atgtccttgc tagggtattt tatcagaagg cacatgatgt catttggtca agattctctt 721 attattaaag ggagaagaac agtatctggc cttcatttca gcagcaagca gtctctgaat 781 acctcttctt tgcctggcag tgtactatgt cctgcagacc tcaaagatgc cctcaaaaac 841 ttcacattct gtggggaaac agaaatattc atgcactcat tcagtcaata acatatttct 901 tgagctccaa caatgtgcca gacaccactc taagtactag aaacaaggat gatatttaaa 961 tatttaacaa gtttggcatg agcacagaat cagagaagat gctctgctga gcccaataac 1021 ccttaagttg ctggatcatg gggaaaccgg ggaaaaggca gaggagcaag tgggatggct 1081 gggatattgg ggaaggcgga caagcttcag agtagctgct atttatcagt gagtacagag 1141 tatttcaaca ttttaacagc tgggaatgac agccgtggct tgggcatgga gatgaaaaat 1201 tcagccaagg tccctgacct catggattga ccttctagtg caggagtggg cgaacttctt 1261 tggtaaatga ccaaatagta aatatttttg gccttgcagg ccctacattc tctggacaga 1321 gaggcaaagt cccaactact caactctgtc tttgtagcac aaaagtagtg tgtcagtcca 1381 ttttgcactg ctataaagaa atacctaaca ctgaatgatt tataaagaaa ggaggtttgt 1441 ttggtgcatg gttctgcagg ctaaacaaac atggcgccag tatcttcttc cggtgaggac 1501 tcaggaagcc tttactcatt gtggaaagca aggggaaaac aattgtgtca tgtggcaaga 1561 cagagagaga gagcaagaga acgatgccag gctcttttaa acaaccagct cacacatgaa 1621 ctaacagagt gagaactcat tactgcaggg agggcaccaa gccattaatg agggtctgcc 1681 ccctgaccac aacatctccc accaggcccc acctccaaca ttggggatca cgtttacaca 1741 tgagatttgg agggaacaaa catccaaact gaggtaggag gcgggacttg actccagacc 1801 agattgaaga ccggctgaaa cagggaaaag gcactcaaag ctcctctcca taagacatgc 1861 tcaccagtgc catgacagtt taccattgct atggcaacac ccagaagcta ccaccccttt 1921 ccagggcaat gactcggaag ttgtcacccc ttttcagggc cacgacccag aagttactac 1981 cccttatcta aaaatttctt aacaacctgc cccttaattt acatatagtt aaaagtgagt 2041 ataaatctga ctgcagcact gccctgagct gctgctctca acacgctacc tatggggtag 2101 ccttgctctg caggagcagt catggagctg taactcaacc agggctctaa tgctgctgcc 2161 tcaatcaagc tgcttttata ttttattttt tttattttta tttttgagac ggagtttcac 2221 tcttgttgcc caggctggag tgcaatggca aaaatctcag ctcactgcaa cctctgcctc 2281 ccaggttcaa gcaattctcc tgcctcagct tcccgagtag ctgggattac aggtgcctgc 2341 catcacaccc ggctaatttt ttgtattttc agtagagaca gagttttacc atgttggcca 2401 ggatggtctt gaactcctga cctcaggtga tccacctgcc tcgacctccc aaagtgctgg 2461 gattacaggc gtgagccagc acgcccggcc taaagctgct ttcttctacc accagctcac 2521 tctttttttt tttttttttt tttttttgag atggagtctt gctcagtcgc ccaggctgga 2581 gtgcagtggc acgatcttgg ctcactgcaa gctccgcctc ccagattcac gccattctct 2641 tgcctcagcc tcccaagtag ctgggactat aggcgcccgc caccacgccc ggctaatttt 2701 tttttatttt tagtagagac ggggtttcac tgtgttagcc aggatggtct tgatctcctg 2761 acctcgtgat ctgcccatct catcctccca aagtgctggg attacaggca tgagccacca 2821 cgcccggccc accagctcac tcttaaattc tttcctgaga gaggtcaaaa aacttcccag 2881 gctaggcccc agttctgggg ctcacctgcc ctacaacaaa actatatcaa atagccacag 2941 acagtagtaa acaaatgagt gtgactatgt tccaaataag ctttatttac aaaaacaggc 3001 agcaggctgg atttggccca tgggccatga tttgcaaatc ccattctagt gaacaagaca 3061 gacatactca tttgtgtttt tatgctacaa atacttatcc agcatcttct acatggcaag 3121 tattgcactg gacactggta tgccgaacaa gacagaaggc cgggcgtggt ggctcatgcc 3181 tgtaatccca gcactctggg agaccaaggt ggatggatta cttgaggtca ggagtttgag 3241 accagcctgg ccaatgtggt aaaaccctgt ctctactaaa aatacaaaaa ttagccaggc 3301 gtggtggcgg gcacctgtaa tcccagctac ttgggaggct gaggcaggag aattgcttga 3361 acccgggagg cggaggttgc catgagccaa ggttgtgcca ctgcacttca gcctgggtga 3421 cagagtgaga ctctgtctca aaaaaaaaaa gaaaagataa aagacagacg atgacccctg 3481 cccacgttgc acttacatgc tggtgggaga gacaaaatct tgttacgtaa atgctaagac 3541 aaaagtcatc aggaaataca gagggggcag ttaactcatc cttagagttt agggaaggat 3601 tcctggagga ggggatgact gtgttgtttg cttagcaggc tgtcagtgag atgagcaaag 3661 tcaccagaga gctcttggaa ccttttcatc aggacagcct gaggacctcc atccctttcc 3721 ctgtgggtta tctgtcatca ccagctcttc ccttctgtcc cccagtctct gaacagctac 3781 aacgatggag actacgaagg agccaggcgg cttgggcgga atgctaagtg ggtagccatc 3841 gcctccatca tcattggcct tctcatcatc ggcatttctt gtgcagttca cttcacaagg 3901 aagtaagtag gctttttgaa tccctcccca tgtcaaacgc cctttctcga acgcccgttg 3961 gaatctgtta gggtagatta ggttatgctc cagtaacaac cccaagccaa agggttttca 4021 tgttttcttc tcagaataaa tttgtcccag tccagaggga catatcaaca gtgtggatct 4081 tctagtgcgg gagtgggtga actcctttgg taaatgacca aatagttaat atttctggcc 4141 ttacaggccc tacatcctct ggacagaggg acatagtccc aactactcaa ctcagtcttt 4201 gtagcacaaa agtagtgtgt tagtccattc tgcattgcta taaagaaata cctcagactg 4261 agtgatttat aaaaaaagaa ggtttggatg cttctttgga tcaggtcaac aatatggatc 4321 cattacatgg atcatatagg tcagtggaag gtctctcctc cacacagtca tgcagggtcc 4381 cacgctatac aaagctgcac catcttgtag ctgctctatc taggacagaa gtcaacaaac 4441 tttgtctgta aagggccaga tagtaaatct tttaagcctt gcaggccata tggtcccgat 4501 catggtctct ggcacagcta cttagctctg ccattttagc acaaaagcag ccatagacaa 4561 tatagaaatg aacaggcgtg gctgtgttcc aataaaactt tatttacaac aacaggcatc 4621 tgtctgtggg ctgtagtttg ccaacttcag atctagaaca tgaggtgtcc tcaggcaaag 4681 gcgggggaga ggaagactgg agaatcccac atgggctttt tactgcctag cgacaaatgt 4741 cgcttaggct tacattgact gaaatcagtc acgtaatccc acctaactgt aagagggtag 4801 gaaatgtggg agaataaatg ggatatttag gattgctatt tgctctgccc tagcacatca 4861 tctgctaaat aaactcctac ttgcatcaaa ctagcataac tgttttagtt taagggcaga 4921 aaggtcacta catatttttg gtattgacaa ggagctctca tctcatgaag agatggatgc 4981 tgctgatgtg ttgtccctct ggactgggac aaatttattc tgatttctct aaccagctca 5041 gatgggctcc atctgctaga ctcaataaca ctttccgtca gaggtttgac agacagcaag 5101 gtcatgtgat aggcaactca gacccagaaa tgacaagtcc agatccatct ctgtcttgct 5161 gtgtaatttc tggtaagaca gataagaatg ctgagaaatt ctttcacttt tctgccacag 5221 cagcatggct actcctatac ccttgaccaa agtcctacac ccatcaaact gagaaggtca 5281 acctctttct cttggttgac aaaggaggct cacccagatg ggtgagggag gaaggagacc 5341 tcttttgaca ttccagatca gccaacccaa ccttggcaac caagacagta gcagatgtca 5401 tggtagatcc acttcctatc agagaaggca gacccaggaa caaccagtgg cagctgggta 5461 gaggggaaga tgtcaaaaca gaagtgtcac aaaagaaaat taaatggctt tggccaaata 5521 aaagagaaaa ggacactcaa acatgaaagg aaactcaagg tttcttgcct gcaggatggc 5581 aaatttgatg ttggcccatg aagaggaaga tggttggaga atacacgtga atgaacctga 5641 ctatattctt aagagaaggt gtaaggtgct tgtgaacttg tgagcagggg agccacagag 5701 tgtgggttta atggtgcagt tttggcacca gactgcctgg acccaggtcc cagtcaccac 5761 ttattcactg tgagaccttc agccaattgt ttaacattcc tgaacctcag tttcctcacc 5821 tgtaaaatga gaataatagt agtacccatt tcataaagct attgtaaagg ttaactggat 5881 gacccacaga atgtgtaggt tggtgtctct gttctttagt gggtgcacaa aaatgtaaac 5941 catttattaa tcccattatt actaatatta tcaatgttac tgggaaagga cctcagaatt 6001 caagatgcca ttaaatagaa ttcaaggctt tatatttatg agccaaacaa aggacaatgt 6061 tctaagggaa tgagaccaag ccttggcttg actcctggga acttgggaaa atgcatactg 6121 cctagattag tgtttgagct aagtgggtta taataatagc agctgacact aacatagacc 6181 ttgacatatg ccaggtactg ttctaagctc ttaacattaa ttaatcctca ttgagtcttc 6241 acagcaaccc aaagaggtag ataggtatta ttatccctat tttgtagatg aggaaaccga 6301 gactcagaga ggtaaagtga cttgcccaaa gcctcacagc tagttagtgg aaaaggtagg 6361 atttaaaacc aggatagtgt cgctcttcag gctgtgttct tagccaaact ctgcactgcc 6421 gctaagtcaa tatgttcctg ctagtacacc cagtaacaaa ggccagtaag tggtacttgc 6481 tttgtggcta actccaggat tgatttacgg gggaatggta tggtgaggtt tcatcctaca 6541 gaaatgtgtg atgagtcagg gctaaggttc acaaaatcaa aggagtacag gggtcagggg 6601 aaggacatac atgaggtagc ctggttatag gacaatagga agtggtgggg accatggtaa 6661 tttcaaaagc acagtttaaa ggaagcagcc accactctgc tccaattgta accacttagt 6721 atgcaggcct catggggctc atttttcaag agatgttgga aatctgggat tttatgtaaa 6781 atctccaaat ttttagatgt tgacaaccaa ttcacattac taaaacgcat ggttagcaat 6841 tcacattact aaaacacaaa aacaatattt gcagcctaga cactatctac tttgcagtct 6901 ctagtggaca gtttgctttc aaagatcagt ttgctcatgg caactccatc caagaagtgt 6961 tggaagggat ctagaagcat tcatatgtta gactttctcc agccttaata taagtgtttt 7021 tatctgggga tgaagtgttg atgtctctgc ttgtgattga agctttgtta ggaataaaag 7081 gtatgagaag gagttgcctc tttgatcctt cttcctaaaa ggcactgttc acaaaggtta 7141 gagcctatga tttaatacca gctttatcag gattcagatc ctacttctac cacacaggct 7201 atctgggcaa attattaaac tctctgtgcc tcaatttccc taactataaa atgagaattg 7261 aattatttcc ccatctgtaa ggtcaaattc cttctagatg cctttccctc tgttattaaa 7321 gaaaacaggc caggcacagt ggctgatgcc tataatccta gcactttggg aggcaaagat 7381 ggaaggatct cttgaggcca ggagttcctt gaccagcctg ggcaacatag ctagacccca 7441 tctctaaaaa aaaaaaattt tttttaatta gctgggcatg gtggcacaca cctgtattcc 7501 cagctatttg gaaggctgag atgggaggat cacttgagtc caggaattca aggctgcagt 7561 gagctgtggt catgccatcg cactctagcc tacatgacag agtgagaccc tgtctcaaaa 7621 actaaataaa taaattttaa aagagaccca aatttaaagg ctttgtcata tgcatgtagg 7681 ggcaagagag agagagacag agagaaagaa ggaaagaggc agaaaagaac tagctgttag 7741 agaatgaaat caattctctc ttccaggtaa catggccaat ttctcagtaa tcctttgaaa 7801 ttaaatctcc cagccctcat ttacaggagt aactaaatgg caatatgaaa cttttaaggc 7861 aaaagtaccc aagctttaga ctccataaga gttacctgga gaggttttaa aaaagttagc 7921 atccgagcct cacctccaaa gattccaacc tgctaaatca gaagtggggt tcaagaacat 7981 ctgcatcttt cagctgcccc tgctcccaac ctgaggattc agatgcaggt ggcgggcaga 8041 agggtctttg ccaaaccttg gcaaatatcc tgccaccaaa gtccacaagg aagaacgtgc 8101 gaatggtggc cattcatgga ctggtcggtt ttcatttaca gcaaactcat catcctaatc 8161 tgctgtctaa cccaatgcta gggaggggaa tgaagttgcc acctagcagg tctacccaca 8221 gggagcctcc tttgtaacaa tctatttcca cagaggggct ggaaaatctg acattagagg 8281 agtcgaagcc tgaagatgag cccaaaaccc atgaatctgt gagttttctg aaagtattca 8341 ttccaaaatg ctgtctcggg gctgggtgtg gtggctcaca cctgtaatcc aataccttgg 8401 gctgaggtgg gaggatctct tgaggccagg agtttaggac cagcctgggc aacatagcaa 8461 gaccctgttg ctccaaaaaa tatttatttt aaattagctg ggtttggtgg cgcatacctg 8521 tcatcctagc tatttgggag gcggaagcag gaggatcttt tgagcccagg agattgaggc 8581 tgcagtgagc tgtgatcaca ccactgcact gcagcctaag caacagagtg agaccctgtc 8641 tcaaaaaaaa acaaaaacaa aaacaaaacc tgtctgaagt aaattactgc cttttaaaaa 8701 ttgtaaattc cctgtcttta aaaaaatcat tctttatcct tattaagaaa gctgtaacca 8761 atgtgttaga gaacagattg aaatctcaaa tggttgaaat cccgggtagt ttacttcctt 8821 ggcgtagtct cagatcatga tagtctttac ccaaactcat aaaaagctct cattcgtcct 8881 attttacagg atatctatgt aagccacatg gtggctgggg caatgaaaca cggggtttca 8941 tgacatcaaa ttcctaagag ctgggaatta ttcaactgaa atattaaacc attttggata 9001 gaatcgtgca ttgacagcag ggaattgttc ttgtccagac agaatcatga acttgcagtc 9061 tttcaaagtc agaaaactct ccgccctcaa gctgctaggg aaagtgaaga aataaatcac 9121 acaggtgaaa ggtttaaata tagtagctgt ccatttatca atcctagtaa ttcaaacccc 9181 tagtcaaaac caacctattt gcaggctttt gcttcctgct aataccttga ggactcaagc 9241 aaggttaggt tattctagtg atgtttctca aagcacaaat ctcagagcca cctacgacag 9301 attcatctgg gatgtttata aaaatgcaag ttcctggacc ccatttcctg agtagcctgg 9361 aaatatgaca agccaccttc cccaaaaaat aattctaaat attctaaagt tgcagaacta 9421 ctatcctagg gcttgcattt aatgaagaac attcttgcct gtgtggacat gtgccaacaa 9481 acaagacttt ttgcatccct aagataaaag ataaacagtt tttcttctct tatacactca 9541 acacaacaca cagctgtgag caggtgtgtt ttgtttcccc cacacaccaa gcaattgtcc 9601 agtgactacc aactaggtat tctgtaattt agcccgatta tgacaccatc tactgggagt 9661 tagagtcaga tcccacaggt taatggtttg gtcccccaag actgcctccc atttcagatg 9721 ctaattgcaa gccccaggtt gtgacctgcg cttctgactg gctggctctt agttggagat 9781 tttcaccacc ctcttctcag tttctattaa tgcgctaaag tggctcacag aactcaaggc 9841 aacacttact tatgtttacc catttattac aaaggatatt acaaaggata cagatgaaca 9901 gtcagatgga agagatgcat agggcaaggt atagaggaag gggcgtggtg tttccatgtg 9961 tgccaccttc caggtcccac cacacattca gcaatctaaa acctctccca accctgtcct 10021 tttgggtttg catgggggct tcattacata ggcatgacca attacatcac tggccattgg 10081 tgcccaactc aaaatcaacc ttcaggccat ttgtcctccc tggagtccag ggggtgggac 10141 tggaagttcc aaccttctaa ttacatggct ggttcccctg gcaacccact cccaacctga 10201 ggctatccag gagcccacca agagttgcct cattagaaca aatatagtcc catcacccag 10261 gaaattccaa gaaactgggg agctctgtgt cagaaactgg agtcaaagac aaaacatcaa 10321 aacaaaagat tctcctagta cccagtctac aagggtgtta ggagctctgt gtcaagaacc 10381 agatgcaggg atcaaatatt tcttattgtt attatagcac aacatcacac ctaaggtacc 10441 tggtagaagc ttcagttatt gctgcttccc tggaaagttt acacacctcc caaattttct 10501 cttcctccct cccttttgtt tttggttttt gagacagggg tctcattatg ttgccccggc 10561 tggtcttgaa ctcccagcct caagtgatcc tcccacctca gcctcctgag ttagctggga 10621 ttacagggtt gagccaccac acccagctct tttcatctct cattgagatt aggctcatgg 10681 tcacttatct gatgctctgt ggaattcagc ggaacgtcct tagcagcaga aagagttgac 10741 caggacaaca tttgggtatg agacaaatgt cctctgtgac aaatatgaca ttcccggatg 10801 accctgctaa tacagtggag cttcatgtta ctcaggagta aggttctcat gtcagggcac 10861 aaggtgagct ttgagtataa tcaaacttaa tcttggaagt atccaaggag ggcgccctac 10921 tggagactca cacatggtat cttgatcagc ccagcatagc cagtgaatgt ttccagtgtt 10981 ggaacaggtg actgttcctt gggttgagag cccactgaag tatgtggctg ggtttagggt 11041 tcatggcatc tgtttcaaca ctggaaccac actctgggct cacatagaag gaatagatgt 11101 gacatcccca tccgtgctca tgatcattga gcggggcggc agtcagaaac tccagctcta 11161 tttaaacagt tgcttttttt gtttttgagg aacataaata gttgatctgc cacatgttaa 11221 atgcacatac aatgtaactc ccctgaaggt acatctctga gtggtcttgt tgaggtgtaa 11281 acagccaggt acacacctgt gtgttaaaaa tgcttgcttg ccattcttaa caacaacagc 11341 ctttgtgctc caaggttttc attaaggaaa aacttacctc taaacagggg ttagggtccc 11401 atgactgtgt ctcattagca ggcacagagt tagaagacct tgacccattc tgtactggga 11461 aagcgtctct ctcataagca aattgaaatg aatcctttcc tggtctcaaa atatccttca 11521 attgcatcag aattggaagg agacatacat atcaaggtgg aacttgacct ttttgcaaaa 11581 gcttaaaagc aaatccctct tcttttctat atttttggct ggggaaaggc tttctgggaa 11641 ttgttttgat cagagaattt ttgtgattcc ataatgaaaa atgggaatca ttacctattt 11701 ctagaaggtt agaagggtct ttctctgttg gttcagataa gttcaggtag agacaaatca 11761 gacctaagtt gccagcaaca atggaagaat ttttgcttgg aaaaggatat aagctagcag 11821 cgcattgatg aagaaatcag aatgaacatt aaaaagatgt tcaacctcac tagtaatcaa 11881 gaaagcacaa attaaagcaa taacaacaaa aatcctacat atttttgtct ggcagattgc 11941 aacaattaaa aagatgggta atatcaaggt tgaccaggat gtgaaaaatt tgacactata 12001 ctggtggtga aagagaaaat tgggacagtc ttttggaaag tcagtttgga tgtatctaat 12061 ataaaatcca ggcacccacc aacccagcaa ttccactgct cagtatccac cttagagaca 12121 ggtgcacaga aatgcatgta acacagtaat gcttcaccct agctacgtgt tagaatcacc 12181 tggggagctt taaacatatg ccaatgccgc tgggcactgt agttcatgcc tgaaatccca 12241 gcactttggg aggcagaaac aggaggatca ctagagctca ggagctggag gccagcctgg 12301 gaaacatagc aagatctcat ctctactaaa aatttaaaaa aatatatctg ggcttggtgg 12361 catgcacctg tagtcccagc tacttgggag gctcaggtgt gaggccagct tgagcccaag 12421 aggtcgaggg tgcagtccgc tataattgtg ccactgtact ccagcctggg tgacagagca 12481 agaccctgtc tcaaaaaatt aaaacaaaaa aatttttaaa aacaatgccc agggcccacc 12541 ctggaccaat taaattagaa tctccagtaa cagggccagg aattggtaga ctgtagggct 12601 ctccaggtaa ctctaatgtg cagccagggt tgggaaccac agacgtgcaa cgatgttcat 12661 tgcagctttg tttgcaatag gagaaacctt gaaacaacct aaatgtccat cagtaggcag 12721 gttaaattag gagtgatgta ttcatatgat gggacgccac actgcagttt aaacatgaag 12781 acagggccag gtgccctggt tcatacctgt aatcccagca ctttgggagg ccgaggtggg 12841 aggatcactt gagcccagga gttcaagacc agcctgggcg atatactgag atcccgtctc 12901 tatttttttt taagtataca ggtggagaca tccttcttac tatgacttta caagccctgc 12961 atgatctggc tgccacctcc ccatcctcat ctcccatccc acatcctcat ctcccctctc 13021 tctccctttc actccctcca cttcagccac atcagagtcc tctctgagca atgcaagcat 13081 attcctgcct cagggcctct gcacttgccg ttccctctgc ctggaattct cttccttcta 13141 aaagccaaac agctggttcc catttctcct tctggtactt gttcatatat tacattagca 13201 aagagactat ccctttagcc agttttactt ctcttcctta ctactccctg acaaattata 13261 tatctatgtc tgtttccccc ctactggcat gagaactcca gggacatttt catttctgtt 13321 attgcccata ataatagagc atggccctta agagtgccta tttatggcca ggtgtggtga 13381 ctcacacctg taatcccagc actttgggag gctgaggcgg gtggatcaca aggtcaggag 13441 atcgatacca tcctggctaa catggtgaaa ccccgtctct actaaaaata gaaaaaaaaa 13501 aattagctgg gcatggtggc aggcgcctgt agtcccagct acttgggagg ctgaggcagg 13561 agaatggcat gaacccagga ggtggaggtt gcagtgagcc gagatcatgc cactgcactc 13621 cagcctgggc aacagagcga gactccatct caaaaaaaaa agaaaaaaag aaaaagagga 13681 cctatttaca agttagactg ctggagttcc agtccaggtg cttcctctaa gtagctgtgt 13741 gcccttgggt atatctttta agcagtctgt gcctctgttt tctcatacat aaaatgggga 13801 taataacagt gcctatctca tagggctgtg agggttaaat ggcacaaact acagggcctg 13861 gcactctgta ctcactaggt aactgttggg tattattact atcatcagca cctaaagcag 13921 tgcccagcat gtaaataagc attgaataaa tgaatgagta aaacaaaaaa agaaattgca 13981 gcacgatcac attgtgtaaa cgcatacaca tgatgagtat ataatttcta catgcttatt 14041 gatcagtttc aatgcagaga aaaacgcctg gagaacttac accaaactgc taacagcact 14101 actatactaa tccttgggga ggagagtgag gttcggaaat gctcaagagg catttttcca 14161 tatccgcatt ttgtttgttg ttgttgtttt gttttgtttt ttgagatgga gtcttgctca 14221 gtcgcccagg ctggagtgca gtggcacaat ctcggctcac tgcaagctcc gcctcccggg 14281 ttcacgccat tctcctgcct cagcctccca agtagctggg actacaggca cccgccacca 14341 cgcccggcta attttttttg tatttttagt agagatgcag tttcaccatg ttagccagga 14401 tggtctctat ctcctgacct cgtgatccac ccgcctcagc ctcccaaagt gctgggatta 14461 caggcgtgag ccaccatgcc tggcctgttt ttgcttttgt ttttttgcac gtaccacatg 14521 taagataaaa taagttttac aaatagaata tgtttctcaa aaggaagccc ctgcttagag 14581 tcaaaagtgt gcacgcaggc tggtacattt caacatgtgt ttaaccagag tcatcatcat 14641 caaataagga aggcatcgag ctgggaaagc caaccacaga agctcagccc acggtctttc 14701 tctctctctc tctgtctgta ccacacagtg cctgaggaac cagcggtcag tgggctgtga 14761 gcgtggagga tggacctcat ccacacacac cccaaaggag tttctaagga atggatcctt 14821 gacttcagac tgtgagatct tttcctccag gactctccag aggcaggtcc ctggcaaatg 14881 aacaagaaaa aaaaaaaaaa aaagtccaaa atttaggcaa tccaagctgc acagccggat 14941 cagccaaagt cattgatttg taaaaatgaa aagaaaacag aaaaaagaaa aatgaagtct 15001 cactgtctca gtttagcgaa tcccgttgtg tccactcctg tcctccagag gcgagcctca 15061 ggaaatcaca taacttttca ctgaggggat ccagggggtc tccatatagg gggagatgga 15121 ggtttctagg aagagcagca ggtgctggta tttacaatgt tgagcacaaa catttgcagc 15181 atgtttaaaa ttgtctagta gagttcaagt tgtggatttg cttttccttt tattcttata 15241 accttcagta actcctcctc tgggagtcag cactcccatg cccagagttc acccatctgg 15301 tcatcaaaca ctcaaagaag gggcttttct ggccttttgt cttgatgctt atatttccaa 15361 ataggccccc tcccttgctt gcatccacgt tggtcaactt gaccaaaacc tcactcttca 15421 ctcaaacagg ctctgagaat ggacttagtg gccaattcta ggtacatgag cacttcctgt 15481 atcccagttt tgggaataaa ctggctgtat ttatagaatg tgcttttttt ttttcaattt 15541 ctcactctct ctcctatctc tagcaagtct caggcaagat ctttgatttt cctggatgcc 15601 acctggaaat gccacccatt gtgtttcttt tctgtcaaat gtaaaccctt tagatgtgaa 15661 tgtactggtt taatgatgcc attattctgc ctgccagaac gcagtaaccc agtgtctcac 15721 agagcacaag gggtgtgcca ctggtggtac acaagataat ttttaagtag tttctaggaa 15781 caacattaag taataccaaa tcacaaagaa tgtttcccct tttctattct tttttcatcc 15841 tgattacagc aaggaaaaag tctctgttta gtgctagcag gtcctttaca cctttcagac 15901 actatggctc ttttcccttt ttagcaaaga aagagcaggc ctcagagtct tctgtctaga 15961 tagaatttaa tgatattgtt ttgtgtcatg gtatttattt tatttattac cttccattta 16021 cagcttccca cagtggggga tgtgacatat tgtttctgtt caaataaatt aagaaaaaca 16081 agagaactca agaaaatatc aagtaattaa cacaccagat aagtatatgt ggcaaaagtc 16141 acttcaaaga attaatgtca gaaagatggt gataatgaag caaaagaaag gcagattatg 16201 ctggccgggc gtggtggctc acgcctgtaa tcccagcaat ttgagaggct gagatcactt 16261 aaggtcagga gtttgagacc agcctgacca acatggtgaa actccatctc tactaaaaat 16321 acaaaaatta gccaggcgtg gtggtgcatg cctgtaatcc cagctaatag agaggctgag 16381 gcaggagaat cacttgaatc cagaaggcgg aggttgccgt gagctgagac tgtgccacta 16441 cactccagcc cggggtgaca gagcaagact ccatctcaaa aaaaaaagaa aagaaaacga 16501 aagaaaaaga aaaaagaaag gcagattatt ctaattgaat gaataagagg ctgagagtca 16561 atacctagtt ccaacgccct tttgtttttg cagggttttt attggccact aactagctat 16621 gcaagctgat tttaggcaag tttcatagcc tctctgtgcc taaaattcct tcacctgcaa 16681 aatggggata agaatgcctg cctacttacc tcacagtgct gttatgagaa ttaattgttg 16741 taatagttgt gactaaccat tggcggggag aattctaagt actataccaa tacaaatttc 16801 agcatttttt catatactga accctagcaa aaaaaaaaaa aattctgatg aacatcagat 16861 tcaacctgtc attttacccg gatgttgtga tgagccagca cttgtcccag gaggacgttg 16921 aatcagggac acgaaatctc agcacacaga gattctagca tcattccaat agattggacc 16981 ctgatttctg tttctgacat cctgtttttt gtttgtgcgt gtttgggtcc ctggtctttt 17041 acccaaatca aatgaaaagt gttgctcaga ggcagaataa aaatggtttt aagtaagcct 17101 gtgtaccaca gtgtgtgaat tgtgttttct tggggttccc caccttccag agatttccgt 17161 aaccaaaata agctcatcta gtccagcatc agtccccagg gaagagccca gagaggccag 17221 aggtggcgct agagaggcat cactctgaac taatttatgg ccagaaatgt ccctctatga 17281 ggcactgaaa atgttggtcc tattaaagta aaaaaataaa ataagaaaaa tcttgtcatt 17341 aaatgtgctc cttggggtca agagggccac tgatttttgt tgggcgtccc ccaggtccta 17401 agctctaaga aggacaaaat ggtttttgtt gggatgatgg gcttgtccag agcttcctcc 17461 aaagctaaga ctgcccctgc tccactcgca aaagcacagt ttaaaaagag attggtggcc 17521 gggcgcggtg gctcacgcct gtaatcccag cacttcggga ggccgaggca ggtggatcat 17581 aaagtcaggc tagcgagacc atcctggcta acacagtgaa acccagtctc tactaaaaat 17641 acaaaaaatt agccgggcgt ggtggcgggc gcctgtagtc ccagctactc gggaggctga 17701 ggcaggagaa tcgcttgaat ccgggaggcg gaggttgcag tgagccgaga tcatgccact 17761 gcactccagc ctgggcgaca gagtgagatt ccgtatcaaa aaaaaaaaaa aggtgtatta 17821 tcaaccaaga gactccttca tcagccagtt acaaaaactg gctaaataga gcagaaaaag 17881 tagagtttac cctcacaaga cggaaacttc cagggatagc ttcagaggca gctggatcct 17941 ggtgcgcagg gtttttggga acgtgtccct ttccaggtct cagccccaac ttgctgagtt 18001 cattttagtt taatagggac aatacgactg ccaacagctt tggacttttt cctaccagcc 18061 tgtggaaaga cagtgtttct tccctaatag ctccaggggg gaaaaatggt tcaattcaat 18121 ataatgaaca actgatacaa accagaaacc gacctatcaa aatggatacc agtcattaac 18181 aaacaggtgt ggccactgta ccaacataaa tggatgctgt ggccaggggt gtgtggtgcc 18241 ctcattggcc aggcttgaag tcaggattca atcactggaa ctgaatgtga ggtcagctcc 18301 actcaaacct tatagattga gggtgcggga aggatggttc cccaaaggaa aaccaaagta 18361 ctcttaccag aagaaggaga gagatttgct aggcaaaaaa acaaaaacaa acaaacaaaa 18421 aaaacagcgt ccatattaag catgctgctt cttactatac cttccatatg cagggcaaag 18481 taatctaatg gcttttcttt ttcttttctt tttttttttt ttttttgaga tggagtttca 18541 ctcattgccc aggctggagt gcaatggcac gatctctgct cactgcaacc tccacctccc 18601 gggttcaagc gattctcctg cttcagcttc ctgagtagct gggattacag gtgtgcacca 18661 ccacacctgg ctaatttttt gtatttttag tagagactgg gtttcaccat gttggccagg 18721 ctggtctcga actcctgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 18781 ttacaggtgt gagccaccac acccagcctc ctcatagctt ttcaatttcc ccgtggtgtt 18841 gtcactttca tccagctcct tctctgacca gtatttatag agcacctgct caaccacaga 18901 aactgggctg gtggaaattc cagaatgttt aagagagacc cctgagaagg tggcagtagt 18961 ctagtggggg gaacagacac acacaaacaa tgagaaggaa acatgagagt gcccagacag 19021 aggcataaac aaagtattgt gccatcactt tgcaggagaa cacactggct gtactaaagg 19081 gaaggaagac agtccctgtc ctaccattct tcgcagtgct gaggccaata gggctacagc 19141 tcttgaagtt gccctgtctc tgcccacaag ttatgggcag aaacagatgc agatataaaa 19201 tcgctggcct cagctcctgg atcagcaaat tacctttggc ctacagcagg acagctgcat 19261 tcagcaaaac agcatgatgc acatatcttg atacgtaacc ctagattcag gcttctaaat 19321 ataagagaca gttctgggcc accaattacc aaaaaaaaaa aaaaaaaaat cttgtaagat 19381 aacgttcata ctcatgggtt acaaccaatt cttaaaaaat actcacgaat attgaatatt 19441 taagtctggg gaattgccct ataaaaatat gtccctgcac caaaacactt tgaattctgg 19501 tttggacaag cacaaagaga ggcaggcata aagatgactc aggcatcacc tacacttgct 19561 ggagaagcca cagaaagaag gaattccctt tccaatacat tcattcattc attccagtag 19621 acatctattt atcaggggtc aggagaggga agtatgaagt tgcctctgtg cattgcctgc 19681 atgaggcatc agagagtttc atctcgagta cctgttccaa acctcgggtg agcagccagt 19741 cagacctgca atgacccggt cacggcgttc atctctcttc ctgtcccagt gtagaatttc 19801 ccagtttgca cccacactgg ttcactgatg cacccaagct gtctgctcct ggaactgtga 19861 ccaagtgttt actcccccag gtcacgtgga atcccatgca tggggactag ttgtactagt 19921 tgtactagtt ggagtgaaag tattgggagg aactcataaa aggccaaatt ctgcctatat 19981 agaaatggcc tgatcacccc caagaaagag gacctaaagc acgcctagaa gggaaggtaa 20041 ataaatattt cttgagccat taccaaagct catctaaaca attacagtag cttttttttt 20101 tttttgagat ggagtcttgc tctgttgccc aggcttgagt acagtggcat gatcttggct 20161 cactgcaacc tccgcctccc aggttcaagc aattcttctg cctcagcctc ccgagtagct 20221 gggactacag gctagcacca ccatgcccag ctaatttttt gtatttttag tagagacgga 20281 gttttgccat gttggccggg ctggtctcga actcctggcc tcaagtgatc cgcccacctc 20341 ggcttcccaa agtgccaaga ttaaaggcat gagccacggc accagtccta cagtagcttt 20401 cttacagggc tctctggtat aacaaccatt acaaacttaa tgagttaaaa caacccaaat 20461 ttattatctt gcagttctgg aggtcagaag tctaaaacgg gtcttgctgg accaaaatca 20521 agatggcggc agggttggat tcctttctgg aagttcgtag agagaatccc ttttcttgcc 20581 ttttccacct tctagaagct atccacatgc cttgactcag ggcctccttc catcttcaac 20641 ggcagcaaag tcaccattta cttatttatt atcttcttta ttatctacct cctcccctta 20701 agatgccttt ttcacactgc atctcttttg ttctgactcc ttttcctctc tcttccccat 20761 ttagggaccc ttgtcattac tttgggtcca cctggatgat ccagaaaaat ctccttattt 20821 taaggtcagc tgattagcaa ccttaattcc atctgctccc ttaattcctc tttaccatgt 20881 aacccaacat attcacaagt tccaggtgtt tgtatgggga tatcttttgg cgagcattat 20941 tctgtctact acatcttgct ttgacatcca ttcgcttaaa aaacacacac agcagccagg 21001 gttatctcac aacctagact taatcacatc attcctctgc tgaaacccaa cccacacacc 21061 cagtgaccac ccacggaatt aaaactcaaa ctttcttttt ttttggaaga gatgggagtc 21121 ttgctatgtg gcccaggcta gtctcaaact tctggcctca agcaatcctc ccacctctgc 21181 ctccgaaagt gctgggattg taggtgtaag ccaccatacc cagcctttaa aatccaaact 21241 ttttttgtgt gtccttaaaa aggtctttat tactaaaagc tggaaggtca gtctttaacc 21301 cccaatcaca aaatgctact ttgcctcgac aagcaggctt tttagttgag ttggtgtctc 21361 ttttgtagtt aagacaagca tttgcccttg ttgaccagtg atcaggaatc cgttcttgta 21421 attttgccct ttcatccttg ataggtgaaa tcagtacctt ttccacaact ttgtttattc 21481 ttcatttttt ttgtcttcgg agtaaaagac caagggacat taacaagttg atggtcccct 21541 agatatgggc tctagtgcca gcgagtctgt tctcttcttc agtgtcaaaa ggactgctca 21601 ttctggagca tctactcttc tggactaaaa tccaaacttt tttttatttt ttaacttcaa 21661 gttctgggat acatgtgcag aacgtggagg tttgttacgt aggtatatgt gtgccatggt 21721 ggtttgctgc acctatcaac ctattatcta ggttttaagc cctgcatgca ttaagcattt 21781 atcctaatgc tctccctccc tttgcccccc acccccccga caggccccag tgtgttatgt 21841 tcccctccct gtgtccatgt gttctcattg ttcaactccc acttatgagt gagaacatgc 21901 ggtgttttaa aatccaaact tttaatgata ccccgcaaag ccctacatga tcagaccctt 21961 ggctgcctct cagatctcat cttctacctc gcccccactc tactcagctg cagccacaca 22021 ctggccccaa acatgacaag atcattctgg tctcaaggcc ttatgtgtgc tgtttcctcc 22081 attcataaca ccattctcca gattttcaca tagctggtcc tttcttcccc atgccctagg 22141 tcacagtttc tcagagagga tcaacctaac ccccatctct aaggcaacca ccccacccct 22201 catcacttgc tattacacca ccctgtttta cttcctccat ggggtttgcc accactaaca 22261 atatttttat ttaactgttt acttatttat tatcttcttt attatctatc tcctccccct 22321 aggatgttga ccttgtgagg gcagagacct tgtctgttct gttcactgat gtgtccccag 22381 catctagact agggcttggc ccatggcaga tactctatag ctatttgctg aatgagtgaa 22441 aggttctttt ccccatgtgc ccttcctcct ggctaatgtc cactccttta tctcagatta 22501 agtgcctctt gttcagaggt gccttctctg atatgccaac gtttcaagaa catgagggtc 22561 tcccttttgc atgctcacac ttccccgtgg tttcactcaa ggtcatatca caatggtagt 22621 tagatgcagt ttgcacaatt accatttaat atctgtcttt cccagcagat catgagcccc 22681 catgagggca gggaccatgt tcatcttgtt cacctctgtc ttcctggtcc ctagcttggt 22741 acctggcaca taagtttcca atagttattt aatgaatgaa tgaattaatt aattaattaa 22801 tgaggtagag agaaatggaa gaagaaaact gagtcaaagc ttcctccatc atcatagctt 22861 tatttttaca ttgttatctg acaagattta aacttgaatt ttaaagcaag atgaagctaa 22921 ccaaggtatt ggctgtaaat cctgtctcaa aactttctca catattcctt atcgtgatgg 22981 tttagaaaaa ggaagaaatg aaaatcgact tataccaaaa aataatttca agaagataga 23041 gacccggctg ggcgcggtgg ctcatgcctg taatcccagc actttgggag gccgaggtag 23101 gaggatcacc tgaggtcaga acttcgagaa cagcctggcc aacatggtga aacccaatgt 23161 ctactaaaaa tacaaaaatt agccgagagt ggtggtgcat gcttgtaatc tcagctactc 23221 gggaagctga ggcaggagaa tctcctgaac ccaggtggcg gaggttgcag tgagccgaga 23281 ttgtgccact gcactccagc ctgggccaca agagcaaaac tctgtctcaa aaaaacagaa 23341 gaaaagaaaa agaaaaggag aagagaaaag aagagaaaaa gaaaagatag agacccaaat 23401 gaaccccagc ctcctttgtg cctgggatgc cccctagtgg aaaaaatgga gaaagacaat 23461 ttagagtgtt ccctaacaat ttgacaaata tcgtatttct ttgcttttta tactctctga 23521 gttagtgtag gatccagaaa caaactggaa gaaaatcaca cagctttgat gaagcaaaga 23581 agcaagatgt tctagagtaa taaactttca gtaaaacaaa aactgagcaa ctgttgcttt 23641 ggaaccagat ttttggctga tctctgatgg taatttttgg gctatctcac tctaagactt 23701 atttctgata tggaactttt caaaaatgat gtagattcaa ttagtcacac ctctgttcct 23761 tagaaattca aggtggataa gatcatcaaa tgccaacact ggctcagcgg ggcttgttct 23821 tcagtctcta tattctacaa ttctaaactt agctcttctg tcccacggta gttgtttaac 23881 tattaaaaaa aaaaaaaaaa aaaaagggcc gggcgcgttg gctcacgcct gtaatcccag 23941 cactttggga ggccgaggtg agcggatcac gaggtaagga gatcaagacc accctggcta 24001 acacgttgaa accccgtctc tactaaaaat acaaaaaatt agccgggcgc ggtggcgggc 24061 gcctgtactc ccagctactc aggaggctga ggcaggagaa tggcgtgaac ccgggaggcg 24121 gagtttgcag tgagcagaga tggcgccact gcactccagc ctgggcaata gagcaagact 24181 ccatctcaaa aaaaaaaaaa aaagtgggta actataataa caaacattat ggcttgctat 24241 ttgccagaca ttctgctaaa cctttgacat gatctctcat ttgatcatta ttaacaaccc 24301 tagatataaa taccattaac ttctctttgc atctgaggaa actgagtcat ggaagagtta 24361 aataatttgt ctgaggcaca actgctgaat gttgggccat ttgattttaa aatccatgat 24421 cttaatcatt acactcaatt aactcccata aatgataaac taactctcag agtttagttc 24481 ataactaagc aaaaataaca gaactacttt ctacgcataa cagcgtgcaa tgacctaata 24541 gcctattacc cattgtccct aattcgttgc aaacttgaca taaatttctg tgagatgaac 24601 ttttacggcc gatttgcatt aattataaca cattcctttt gggattgctt atttgatttt 24661 gttgttgttg ttgttgttgt tgttgttttt gagacggagt ctcgctctgt tgcccaggct 24721 ggggtgcagt ggcacaatct cagctcactg caaccttcac cccccagatt caagtgattc 24781 tcctgcctca gcctccccag tagctgggat tacaggcatg agccaccaca cccagctaat 24841 ttttgtattt ttagtagcca tcgggttttg ccatgttggc caggctggtc ttgaactcct 24901 gaactcaagt gatctcccca ctttgacctc ctaaagtgct gggattacag gcacaagcca 24961 ctgctcccgg cctgcttatt cgatatttaa gaagctctga ggtctcactt caatgtctaa 25021 gatatatttt ctatgcacaa taattaatat acacaggaaa gaaaaaacag ctacatttct 25081 actgttgaag gatatgagtc catccgagta tacgatttag gatgctttgg gttgtaagca 25141 acagaaaatc caactggctt taataagggc agttaccagt tctcacaatg gaaaatttca 25201 gatagaagac aaacttcaag gttggtttga ttctgcacat cacaaattta ataggctatg 25261 gctctgtttc cctttgattt tctcagcttt gtccaccttt gtgtattctt tgacctcaaa 25321 ctggtttctc tcatgggagt agctcttcag gctacaaact tcctcatcaa tgtcctgaga 25381 gaaggaaggc ctctctaccc taaaccttca gacaaaagcc ccactgtatt cttactgtaa 25441 cagttaggtc atacaaccat tcctgaaccc agtgctccag tctgggttat gtgaccattc 25501 cgaaaaaaac tgcagcaaga agggtaggat tgtccagtca ggacctgcct taagaattgc 25561 tcagtaggcc aggtgcagtg gctcacacct ataatcccag cactttggga ggcagaggtg 25621 ggtggatcat gaggttaggc ggtcaagacc atcctggcca acatggtgaa accccgtctc 25681 tactaaatat acaaaaaaaa aaaaaaatta gctggatgtg gtggcacatg cctgtaatcc 25741 cagctactcg ggaggctgag gcaggaaaat cacctgaacc agggagtcgg aggttgcagt 25801 gagccgagat cattccactg cagtccagcc tggcgacaga gtgagactcc gtctcaaaaa 25861 aaaaaaaaaa aaaaaaaaag aattgctcag taaacagtag ctcttagaaa tttttattct 25921 aataaatgaa aatttctgta cccaagtaaa gtataagtag aatgactaat aatgaaaagt 25981 tagtccacct cactcataaa tcaacaaaag cataggaaac caatggcata caacttttat 26041 taatcagact accaaaaaaa tttaaaaatt aataatgctt agtgttgagg aggatgtgag 26101 aaacagatat cacaaacatc attgctgcaa cacttttgga gagaaatttg gcaatttcta 26161 gcaaaagtgt aaaggtatgt acccttggac ctagtgattc catattagaa attattccca 26221 ctattttact cagaaaacta tgatatgtga gctaggatat gtgaatcagt cacattgtat 26281 ggatatacca aggcagtgtt atgtgtgaaa gtttaaaatg agaagcaaaa atgtccatca 26341 atagagacta attagatttc tggattttag tagtggagga gtaacttata ttgggcaaat 26401 tccattgcct agtacagcta caaactctga acaaaacatt tttacaaatt aaagacacct 26461 gagaacaact aaaaactgga gggtcagtaa gaatcctcat aaaatgggaa tcatccacat 26521 ttacccggct tttcctctga agggtcaccc agtctgcata gcacacagag aacagagctc 26581 aagccaaaag caatagtctt attaggcagc agaatcacag gttagagttt ggggccccca 26641 tggtggctga aatttgatga gagaaatccc agaaatgata gagccacaga aaagggacct 26701 aaaaaatctg aatataaact cccctcaaat ccttggctga ctcaaactgt atttgtgaag 26761 cccgacagaa aacaaaacca aaccaaaaag ctggaggaga atgcaaagaa ctgcattgat 26821 cagtgctggg gaggcagagt ttggagctca agttctacca aattagagaa acttaaactc 26881 ctcagtcttt ctgttgaaac ccagaggggt cacaaattag tcagaaagac cacatccaag 26941 aactagtatg tagaaggcac aaaaccaaaa tggatccacc caaaagaagt cccaaaccat 27001 accttcgtaa cataagaacg agtcactagt aatttcacta ccagttaaaa caaaactcaa 27061 caattttctg agaaagataa tagaatccag agcctctgct atattatcca caatgtgcag 27121 tataaaattt taaaatcact gggccgggca cagtggctca cacctataat cccagcactt 27181 tgggaggccg aggtggatgg atcacgaggt caggagatgg agaccatctg gctaacacgg 27241 agaaaccccg tctctatcaa aaatacaaaa aattaaccag gcaaggtggt gggcatctgt 27301 actcccagct gcttgggagg ctgaggcagg agaatggtgt gaacctggga ggcagaggtt 27361 gcagtgagcc gagatcacac cattgcactc cagcctgggt gacagagcga gacgccacct 27421 caaaaaaaaa aaaaaaaaaa aattaatcac taaacatgtg aaggagcagg aaactgtgat 27481 agtcaagaga aaaaatagtt aatagaaaca gacccaaaga caatgcagat gttgaaataa 27541 gtaggcaaat attttacaat aactccaata catatgttaa agaatctaga ggaatagatt 27601 aataggtgaa gaggtgatga gtttcatgaa gctatgcaat gtaaaaaaaa attaaaaaat 27661 tttttaaaaa cccctgaaaa tcctagaact aagaaaacaa atcctgaaat aaattatttg 27721 tatgggatta agcatacaga agaaaggagc aatgatcttg aagacaggta aataaaaatt 27781 atccaaactg aagcagcagc aagaaaaaag atttaaagaa aaaaataagc agagcttcag 27841 tgatttgcag agaagtatca agcattctac catatatgta attagagtcc taaaaataaa 27901 aggagagaaa gaatggagca tttaaaaagt ttgcttaaga aaaattgatt taaatgtttc 27961 caaatggtat taaaaatatc aacctacata tccaagaagc taaaaaaaaa aaaaacaaaa 28021 acccaatcag gataaatgca aaggaaaaca cacctaggca aattactgga aaaagaaaaa 28081 atctaaaagg cagccagagg gaaaaatgat acatcacgta catggaaaca aaggtaaaaa 28141 tgataattga cttctctttt ttttcaggtc ttggcatcaa gacttgattc tcatcagaaa 28201 caattaaata cagaatagag tagaatgaca actttaaaat gctgaaagaa aaaaggtgtc 28261 aagctctagt gaaaatattc ttcaacaatg aaagcaaaat aaagatattt ttagttaaaa 28321 acttagagac tttgttgcca gcagatctgc attacaagaa atgctaaagg aagtcctttg 28381 gcctacaggg aaatgaaacc agatgggaat tagtatttaa aggaaaaaaa atgaaaagta 28441 gtaggaatat taaatttgtg aataaatatt aaagaccata aatctatggt ctaatcaaaa 28501 tctcagcaag actttttgta gaaactgaca agtggggtct aaaatttata tagaaaggca 28561 aaagaactgg aattgctgaa acaatattga caaagaaaag caaagttgga gaagttacat 28621 tgcctgatgt gaagacttgc cataaaatta cagtaattga gacagtgtga agctgggcac 28681 agtggctcac acctgtaagc tttaggaggc tgaggtggga ggattatatg aggccaggag 28741 tttaagacca gcctagacaa catagcaaga ctccatctct acaattttct ttaattagcc 28801 agacatagtg gtacatgaat atagtcttag ccacttagga ggcttgagcc caggagttca 28861 aggttacagt gaactatgat catgcactgc actccagcat gggtgacaaa gcgagacgct 28921 gtctcaaaaa aaaaaaaaaa aaaaaaaaaa aagacagtgt ggtattgata agacagacac 28981 atagatcaat ggaacaaaat agaaacttca caaatagacc cacacatata tggttaattg 29041 tttttgtttt gttttgtttg tttttttgtt tttttgttgt tgttgttttt ttgagacgga 29101 gtctcgctct gtcgcccagg ctggactgcg gactgcagtg gcgcaatggc tcactgcaag 29161 ctccgcttcc cgggttcacg ccattctcct gcctcagcct cccgagtagc tgggactaca 29221 ggcgcccgcc accgcgcccg gctaattttt tgtattttta gtagagacgg ggtttcacct 29281 tgttagccag gatggtctcg atctcctgac ctcatgatcc acccgcctcg gcctcccaaa 29341 gtgctgggat tacaggtgag ccaccgcgcc aggcctgttt tgttttgttt tgagacagtc 29401 tccctgtgtt gcccaggctg gtctgaactc ctggcctcaa gtgatcctcc cacctcagcc 29461 tctcaggtag ctgggataac aagtatgagc caccactaga ttaagtcaat agattgtttt 29521 ttggttgttt gtttgtttgt ttgttttgag acagagcctc cctatgttgc ccaggctggt 29581 ctgaactcct ggcctcaagt gatgctccca cctcagcctc taaggtagct gggacaacaa 29641 gtgtgagcca ccactagact aagccaattg attttttttt ttttttttga gacagggtgt 29701 ctctctgtct cccagctaga gtgcaatgcg atcatggctc actgaagcct caaccacctg 29761 ggctcacatg atcctcctgc ctcagcccca caagtagctg gggccacagg tgcatgccac 29821 catgtccagc taattttttt actttttgta gagatggggt ctcactatgt tgctcagggc 29881 tggtctcaaa ctccttggct caagcaatcc tcccacctcg acctcccaaa gtgctgggat 29941 tacaggcata agccacggca cctggctgta aattgatttt tggcaaaggt gcaaaggtca 30001 tccaatggag aaagaatagt gttttcaaca aatgatgctg gaataattgg acatccatgt 30061 gcccacagat gaacttcaac ccatagttca caccatacat aaaaattgat tcaaaataga 30121 tcatagagct aaatgtaaaa cctaaaatga tataatttct gaagaaaata taggaggatg 30181 tggagaaact ggaactcttg tgtactgtta gtgggaaatg gtacagcggc tatgaaaaac 30241 agtatggcaa ttccccaaaa attaaaaata gaattaccat aggatctggc aattccattt 30301 ctgtataccc aaaacaactc aaagggacat gaagaaatat ttgtacacct gttttcttag 30361 aagcattatt cacaatagcc aaaaggtaga agcaacccaa ctgtccatca atagatgaat 30421 ggataaacaa gctgtggtgt atacatacaa aatatgattc agccttaaaa aggaaggaaa 30481 ttctgataca tgctacagca tagatgaaac tcaaagacat tatgctaagt gaaataagcc 30541 agttacagaa gcacaaatac tgtttgattc catacatctg agatacttgc agcagcaaaa 30601 tttatacaga cagaaagtaa aatggtgctt gctggggtca agggtaagat ggaatgagga 30661 gttactgttt aatgggcata aagtatcagt tttgcaaaat gaaaaaactg tggatggatg 30721 gtgatgatgg ctgcaaaaca atgtaaatgt acttaatgtc actgaactgt acactttaac 30781 atggataaaa taataaagtt tatgctatgt atatgtatta atcgcatttt tcattttcta 30841 tatcttatga ggcttgacat cttggggact taccaacctg aaagagactg tttgtcccag 30901 ggttagctca ttccaagaga tagcagatga ccttcctgca atatgcagtc caaccaattc 30961 agagcccata ctctgaacca cctcctctct ctggcttttc cactccgaga ggcaagtttc 31021 ccctgcccta atcatcccag ggtcaggtat cagacagtta gagactaccc ctatagccca 31081 cagctggcct gaattattca aactatccaa tcctaagccc taatcatccc agggtcaggt 31141 atcagacatt tagagactac ccctatagcc cacagctggc ctgaattatt caaactatcc 31201 aatcctaagc cctaatcatc ccagggtcag gtgtcagaca tttagagact acccccccca 31261 accccctata gcccacagct ggcctgaatt attcaaacta tccaatccta agccctaatc 31321 atcccagggt tgggtgtcag acagttaggt acgacaccta tagcccacag ctggcctgaa 31381 ttattcagac tatccaatcc taagcctgtt cagctgctta ccttgcctcg cccattcctc 31441 ctggcaaaaa ccacaataaa ggctttttcc taggctttct tctcatcctc tctggctgtg 31501 accaacctca aagttctctc ctgtggttct gcatggcatg gcgggctcct gccccttggg 31561 aactgtatta gactgttctc acactgctgt gaagaaatac ctgatactag gtaatttata 31621 aagaaaaggg gtttaattga ctcacagttc cccatggctg ggaagcccca ggaaacacaa 31681 tcatggcaga aggagaagca aacaaatcct tcttcacaag gaggcaggag agagaagtgc 31741 caagcaaagg gggaaaagcc ccttataaaa ccatcagatc tcataagaac tcactcacta 31801 tcaggagaag agcatagagg taatcggctc catgattcaa tgaccgcccc cgccaaattc 31861 ctcccacgac acgtgggaat tatgggacct acacttcaag atgagatttg tgtggggaca 31921 cagccaaagc atatcaggaa ctgtgagcaa taaaactata tttttaatgg caatggtctt 31981 ccgatccatt ggccttacca tacatgaata aacccaaagt cccaggtaca ttctagaaca 32041 gtatatccta caacatttta aaaaatacat taaggaaatg atagattcag taccttaagg 32101 cagcagtccc caaccgccag gccacggacc agtaccatgg tacgttagga accaggccgc 32161 acagcaggag gtgagcagta ggtgagcaag taaggcttca tctgtattta cagctgctcc 32221 ccatcgctca cattactgcc tgagctctgc ctcctgtcag atcagtggtg gcattagatt 32281 ctcacaggag cacaaactgt attgtgaact gcgtatgcga gggatctagg ttgcatgctc 32341 cttatgagaa tctaatgcct gataatctgt cactgtctcc catcacccct agatggaagc 32401 atctagctgc aggaaaacaa gctcagggct cccgctgttt ctacataatg gggagctgtg 32461 taattatttc attatatatt acaatgcaat aataatagaa aagtacgcaa taaatataat 32521 gtgcttgaat catcccaaaa ccatcccccc aaccctagtc catggaaaaa ttgtctttca 32581 cgaaacctgt ccctggtgcc gaaaacactg gggaccactg caatatagta catctgtgca 32641 gcagtagttt tgcatggtgg ctaagagcat agactcaaga attaaatttc ctgtatctaa 32701 atcctggctt caccacttac tagctttgtg ccattggaca agttacttct ccacttgtgc 32761 cttagttttt tcttatgtaa agtggggata ataacaaaat gcacctctta aagctttata 32821 agaagtacat ggcttaaagc aaggaaagtg cttaggacag tggcaggtgt acagtaagca 32881 ctctatacat agacctcttc ttgttatgaa atctgcagtc attacaaaac tgaggtggcg 32941 ggacatggtg acacacaact gtagtcccag ctactcagga ggctgaggca ggaggatcac 33001 ttgagcccag gactttgagg ctgcagtgag ccatgattgc acctgtgact agcctctgca 33061 ctccagcctg gcagcatagc aagaccctgt ctcttttaag aaattttttt tttacttaaa 33121 aattttttta attaaaattt taaaatgagg tggagctgta tatgctgata tggaaagata 33181 tgctaaggaa aaaaagtact aattgcttgc atttttaaag aagagcgatg taagtacttt 33241 ttctatagtt ataattatag gaaattataa ttataataat tgtaacaaaa tataccccca 33301 aaaagtatac aagaaactct caacaagtta tttgagggga aaggaattaa tgatgtagga 33361 gaaggtgtaa gggagacttt tgtttttgat tctgtacccc ctcccatact gcttgaagtt 33421 tttaaccatg ttcatccata ctttaaaata catacattta attttttgaa aggccatagg 33481 aataatcttt gttgctcttt attgccacct aggggatgct aggggcattg cctgccatga 33541 tcactcattg ctgttaaccc tattgactat ttaataattg attccacctt cttgattagt 33601 acgttacacc cagcaagtca tggcttccaa ctttcagatt tttattgagt tttgtaacaa 33661 taagaaacat ggttgggaaa ttgatgctct tttgaaaaaa aagaaaagaa aaaacaaaca 33721 tggattcgta gttagaaaat tttttatttt tttaattttt attttttaaa atataataaa 33781 gatggggtct cgacagcttg ctcaggctgg tcttaaactc ccggactcaa gctatcctcc 33841 cgcctcggcc tcccaagctg ctgagagcca ctgcatctag cctagaaaat ttaatagata 33901 ttaaaacaat tgaacaccta gtttgttctc cctctctgaa tgaccattcc agctttatac 33961 ttataatcta gatcagccca atccattggc ccacccaaaa tgatttcatc cctcactcaa 34021 tgatctgtcc gtggtggcta aaatgttgac atctgtctgg ttgcctgcat ttgaatgctg 34081 gtttcatgac tttctagttg tgtgacctcg aataagtcac ttcatctctc tgagcctcat 34141 tttctcctga aaaatgcaag tggtaataat agtagatccc tcaaaggact gttggtggca 34201 ttacatgagt tcatacatca aacacaatgc cgagtgcata ataaggattg ataaacaact 34261 ttcactttaa cacctacaat ctccaaatct caacttgatt gtcacttcct tcagcaagcc 34321 tcctctgacc tcaggacttg gtcacatcct tctcactagg tcacacttct ccttcttagc 34381 gtttgtcaca gaagactgtt ggatgattgt tcgattaatt tctctcttcc ctactagatc 34441 gtaaacactc aggacagagc ctgggtttga ttttgcccac aattctctcc ccagtgtcta 34501 ccacagtgcc cggtggatag gagtttttgc catgaaaaag ttattgataa agaaatgaat 34561 taatgcaatt agaaaactga aggcctctga aaaaatcaat attaactaga acaagtgcca 34621 tgaactagtt aggttctgat gggctagatg acatgagtca tgagacactt acagggaagg 34681 acagggtgcg gagaggttta ttttgtaaga cagttgcaga aaataaattg gaaaaaagat 34741 aaactgtaca cctctaccat ttgttctagg actaacccta ttttctttga ggtaaaattc 34801 atacaacata aatgttaatt ttaaagtaga catttttaag tgaacaattc ataggcattt 34861 agtacattga caatgttgtg gaaccaccat ctatgccaag tcttaaaaca ttttcatcat 34921 tccaaaagaa agccctcatt ccctcttccg ccaagcccgt ggtcaccacc agtctgcttt 34981 ctattgacat ggatatatct attctgtata tttcatataa atggaatcat ataatatatg 35041 accttttgtg tctggcttat ttcacttagc ataatgttaa tatttttgag gtcatccaca 35101 ttgtaacatg taacagtact ttattccttt ttgtgataga agaatattcc actgtatgaa 35161 cataccacaa tttgtttata attcattttg ttattgttgt tacagagtct ttctctgtcg 35221 cccaggctgg ggtgcagtgg cacaatctca gctcactgca accttcgcct cccaggctga 35281 agcaagtctc atgcctcagc ctcccaataa actgggatta caggtgtgcg ccactacaac 35341 cagctaattt tttgtatttt tagtacagat ggagtttcac catgttagcc aggccggtct 35401 cgaactcctg gccccaagtt gatctaccca cctcggcctc ccaaagtgct aggattacag 35461 gtatgagcca ctgtgccagc ctacaaactc atctcttggt ggacatttgg attctttcca 35521 ccttctgtct attatgataa tgctgccgat ccctgttttt taaaagtctc catggttcat 35581 catatcaaga agaatgataa tctcacagta ccctggacca gaaaccaaga cataagatcc 35641 aaattatgat ctatgaccac taactttttc tataattcag caagctattg tccctcctca 35701 tctgtaacct atgtggttga actaggattt tcatgcaatt agtgtgtctt gggaggtgtt 35761 tctgagtatt tatttggaag gtgctcctaa aaagagataa agaaaaggaa aggaagataa 35821 tcaatgaatt ccttgatcca attactgctg tggacatcta gggtgcattt ccactgggga 35881 cctctgagac acggtaggga atatatctta gttatacgag agctaaaaag aaattattta 35941 gacagttagt gagggtaaga gagtcctcag taaggttttt cttttaataa aaacgcagtc 36001 tccaaatcat ttcttttttt aacaaaaagc agcctgaaaa atcaagctgc aagcatagat 36061 aagcaagcta gaggcttgca taggtaaata ccagcagctg tgccaataga aaacagatag 36121 ctggaaacca ggtatattca acatggaggt tccctcttcc cttttctttg ttgccacgtg 36181 tgcagataac atggcaccag ccaggttttt taaaattcca tttgcataat aaaagattag 36241 agtgggatga ccagcctctt cacgggctat gaaaatggca catctggtcc aaccaatcca 36301 ctacacccca tgtaaatcat ataccatctc ctcacgctca tccctaaaac caaccacacc 36361 ttgccccaaa ccagggaaac ccactcagga ccccttcttc tgcatgagga agctctcttc 36421 cttctttcat ctatttaact ttccactctt aaacccactc cttgcatggc catgtcttca 36481 atttccttag catgagacat cgaacctcgg gtgttacccc agaaaaatga cgctgcttca 36541 ttagagttag cacatccaag gggcaaggga actggctcct ctgtggttgg ctgagggctt 36601 ctctcagggt gtcttctctg gtccccttgg gttcccactt gtgtcgacaa agctttctcc 36661 agaaggccct cagagttgca cctgctcaca gttaggaagt ttggggcatg cgcagaaggg 36721 cacatgcaga tggggtatga tggtcccaac agcatatgtt aaagtgatta tgaaactttg 36781 atattcaaat ccctgggagc ttacatttta aagagatgag acagagacag acttcctctc 36841 attgttttca cctccgttgg atctttgcct ggatactgct ttagtttgtc acttgtcctc 36901 tagaaaaggg gtagaaagtg tacatccttt tctcctccat tccttcaaag ctcttaaaag 36961 aggatagaga tgccaggtgc agtggctcac acctgtaatc ccagcacttt gggaggccga 37021 ggagggtgga tcacctgagg tcaggagttt gagaccagcc tggccaacat ggtgaaaccc 37081 cgtctctact aaaaatacaa aaagtagcca ggcgtggtgg ggtgcgcctg tggtcccagc 37141 tacttgggag gctgaggcag aagaatcgct tgaacccggg agacggaggt tgcagtgagc 37201 cgacattgca ccactgcact ccagcctgag tgaaaagagt gaaactccat ctcaaaaaaa 37261 aaaaaaaaga ggatagactt gaggtgaact ttattaagaa cattgtgtta ccttgaactg 37321 ggcacttctc ccttctaggt ttcagcttac cctgtgagtg aagctattta tttaaaaaaa 37381 actttattga aaataaaata aagatacagg aaactacaca aaagaaatgt atggcttagt 37441 aagttactac aaggcaaaca tgttaataat taccaaccca ggtcaagaaa cagaattttc 37501 tcagacaacc taaaagtcac aaaatgaggc tatttgccaa ttaaggttgt agaccaaatg 37561 ctgtagaatg aacttctttt aattctatgt tgatgactct ctgtcacaat attcacatct 37621 cagtgcaaag agaacacaag aatgtctagc aagaaaagat ggacaagttt ctcaaacatt 37681 ggcagtgtta cttacattgg ctaggcaaaa ttcagaaggc ctggtggaaa tctggttaag 37741 ccctaaaagt aagcttgtaa ttctatccta gcagcccaca cacatttgag tctgtcttag 37801 tcagctcaga ctgctataac aaatcatcaa ctgggtggct taaacaacag agatttactt 37861 ctcccaggtc cgaaagctgg aagtctaata tcagtgatat aatttggata cttttccccc 37921 taaatcttac actgaaatgt aatcccatgt tggaggtgag gcctccaaca tggtgggagg 37981 tgactgcatc atggaggtgg acttctcatg aatagtgtag caccatcgtc ttggtgctgc 38041 cctcacaaga tctggttgtt taaaagcgtg cggcacctcc tcccactttt tttcttactc 38101 ccactcttgc catgtgatac atcaatacat cggctccccc ttggcttcct gaggcctcac 38161 cagaagcaag cagataccca gagccatgct tgctgtacag cctacagaac cataagccaa 38221 gtaaacctct tttctttata acttactcaa cctcaaatat ttctttatag caatacaaga 38281 atggcctaat acaatcaggg tgccaggatg gttgatgtgt ggtgagggct ctttccctga 38341 gttgcagatg gccgccttct tgctttgtca tcacatggcc tttccattga gcatgggcgt 38401 gaatagacag catgcgagca agctctccag cgtctcttct tcccataagt accctgatct 38461 catcatgggg gccccactct catgagctca tctaacccta attacccctt gaaggcccca 38521 cctccaaaca ttataaaatt gggagtggga gtgcaaatgt atgaattttg gggggataca 38581 atttagtcta cagcatagtc caaatgtgaa atttcttcct ccagctaccg atgtttagtg 38641 agaacactgc ctgttctcac cacatgtgca tatgtgtgtg gactctttcc tctggagggc 38701 aagaacttgg ttttgttcac tgttgtaacc aagagttaca acagtgcctg gctcataatt 38761 ggcactcaat attgccattg agttatttgt gatcagagcc ctggctgttc ttagatgggg 38821 agggttcttc agaacagtac aaatgtttgt agacaggcca gctggggaac caaaggtctt 38881 tttgatgagg tgccatgacc atgatgtagt ggggagtggc caggtgatcc agaattcagt 38941 ctcagtatga cctatgatca tctgagtgac ctcaggcaaa actgttcgat ctctgggact 39001 tagactcttc cttatataaa acaaatgggt tggtcagttc agaagtgcta tactgattct 39061 aatccatcct atggaggagg ccccaaaaca ttatccttaa cccttttaat cataaattta 39121 aggataaatg agattttatc tcactctggg tgacttttaa ggacttgaaa acctacttct 39181 ttttgtgctt ggagctggag tgaatttgaa gccagggttg cgggggtggg ggaggggtac 39241 tcctagctgt ggctgtgtta aaattgccct ggtttgtttg ggggagaatc tagggtattt 39301 atcatttgtc tttgccctgg tgccagacag acagcagtga gtagatgatg ccttttacac 39361 agcaatgtgc taacagttat acttcctggc acacctccag cccccaagat ggaatggtct 39421 gacatccata tcagcagaga atcagagaaa gatgagacgc tctgatgctt aggaaagtca 39481 caaaccattg ttacacagat aagtaacaca aaatgaccca agagagacac agctgggacc 39541 caagtagaca cttgtaattg gcgagtctac cctacccatg attttcaata tggggtcaat 39601 agctgataga taccaatgtt agtttctttg tcatgaaatc ttgatacaat gtggtaagag 39661 tttccaatca gcagttggtg gcatcctcag gatattgttg attgtatttg acatcaaggg 39721 ccacccatgc tttcactgtt atcttattca agctagtgca gatgtcttaa aactttcttc 39781 atgcatcaaa gcatctgcat agctcagata aagaatccac agctcatgtg ttctttctta 39841 tctctccggc tgtttgggag agatttcaaa actcaacacc ccactgtctc atgtaaccaa 39901 gtcagggcca ctattagaaa agcagctcaa agaagatatc ctgggtcatc cacagtatct 39961 tagcttattt aattcagcaa gcacattcaa catttctgat tatactgaat acattcaaat 40021 tctcagtcat cagcttcagg aagaccctcc tggctcccat cccttttcct ggctcccatc 40081 ccttttcctg gcctggctta agtcccttct gggctcccac aggatccctt acctagctat 40141 attattgctt ttctgtgggt cggtgccagc ctgtactgta ttaattcctt tggggctgga 40201 accatatctt atttttttaa attaaaaagt agtacatcct cacaatagaa aacttttcaa 40261 atacagaact atataaatga aaagtgaaag tcccttttct gtctgctcct atttgcaatc 40321 tccctcctct ccccacagaa gaccactatt aacaatgtgt tgtataccct tccagaactt 40381 tccatttgct agcctaatag tgctaataaa agtgttaaca gttattgagt gtttactaag 40441 agtcaagtgc tatgctcagc atttttatgc acgttcccaa ttaattcaca aaacaagcct 40501 atgaagtagg tgttattttt gttcccattt cacaaaagaa gagactaagg tgacagttac 40561 taaatggtaa agccagaatc tgaacatggc caggctgaat tcagagctcc tgcccttgaa 40621 cttgttctct cacttgcttc cggaccttgc aatatcatag acacttcccc atagtgacct 40681 aaaaatggtt tctccctttt taaaagttgc ttggtattct acaaaatgaa tgtattccat 40741 tcattttgcc attctcctac tgaacaactt ttaggatttt gtagtttttc atttccttat 40801 ttcaccaagt gtaagattcc attattgtaa aaagcatcca gtttcagaaa ttaaaatata 40861 tgggggcatg gggggtgaca agtgcatctt agaatatatt tttgcatgta aatattttgt 40921 agataatgct tgtaaataat tttcatgtgt gtatttctgc acaagtttcc taggagtaaa 40981 attgtgtgtc aaaatgaagg tgcattttta aattttgatg catactgctg aattcccccc 41041 aggaaacctt atcttagtca tcttgtgttc ctcatttcct tagcacaatg tctggtactt 41101 ttttaggtac taaataaatg ttcagtgggt gaaatgcatg acctgacccg gcaaatgcaa 41161 agtattttga gatattcaca caccaaaaaa cccacaaaat atgacaatga ctgacaatga 41221 gaagatgaga atattttact accgtatttg tatgcacttc ttgttgaagc actgaccaga 41281 gtaaatgaca aacgctaatg catcttatgc agaaggatgt ttccgaagtg attacaagat 41341 atcttgcaaa attacctatc aaaagattgc acactcaata aaaatacacg aataactcca 41401 gcattttaca acccccaacc tactagacat aagaaatttc cagagatctt atggtctgcc 41461 attttttcaa atacctgcaa gactgcctgc acccccgacc ctaatctgta gaaatcatga 41521 ttctgcaggt ctgcagtggg ggatgcattt aacaagcgct ccaggttaac ctaatgcact 41581 gtaaagcgtg agaacacgtg cattaaaagg gctttaagaa gtggagtttc tgaatcagat 41641 ttgcatttta gaaagatccc tctggctgca gagtgtagaa tcaactataa gtgagcaaag 41701 atagctaaat ggcattattc agcccagcgc atgaacggct cagattgccc gcgcgctggg 41761 gtgccccctg ccggccctcg gcagcgcctc gtcctgggcc tggccctggg cggccggctg 41821 ctgactgcga ctgcgcgctc cgaggcctgc agagacggcc cggggcacct gttaccgcac 41881 cgctcaagac cggaagcgga aatggaatcg agatagcctc gcgcgtttag ctggccgccg 41941 ccacctccac gccctaggcc gggccgactt acggagtcgc cggaagcgga agtcgctgag 42001 gggtggtgaa gcggttggga aagtgtcggt ttatcttcgc gccccttgcg ttcttgccgc 42061 ggcttgcctg ggcaggtaaa gcgcgattgc gagagctcgg caaccctgcc gactcagccg 42121 gaaccggctc ccggcccgag gggcgtggtg tcctggtgct ccgactcctt ccgcaggctc 42181 cttgggaccc gcggttccgg gagtcccttg ctcagggtcc ctttcctgca gtgaggcgcc 42241 gtccgccttc cctgtgtccc cgcagacccc catcatgggc aataccagca gtgagcgcgc 42301 cgcgctggag cggcatggtg gccataagac gccccggagg gacagctcgg ggggcaccaa 42361 ggacggggac aggcccaaga tcctgatgga cagccccgaa gacgccgacc tcttccactc 42421 cgaggaaatc aaggtgcgag cggtgtggag gaacccgatt ccccttgact aatgttgagg 42481 agaggaaccc tccactagat tcccgtcaca tcctttcgaa aaacaccaga acggagaggg 42541 aggtggtaca ttacccggtg cacttaaaag ccagatggtc ctgtaagagt tctcttagcg 42601 gtccaagaac ctgtgtctaa ttaccaatat gagaaagtca ccctgaggga ggaggtgctc 42661 acttggttta acatcattaa ggagaaagca gctctgggac agagtttcaa attcttaatt 42721 aattcagagt cgagctaaca gtaattgagc cccagcttct gcttttctct taggtgtaat 42781 taattgcact gctttgagac aaattccctc acctttacac aaactggagc ttttgtaaaa 42841 gttagtttgg gaatttatcc attttggagg caaaccaggt acatatacaa gctgaaagta 42901 gagcggaaca gctgacggca gttgccagag gtgtttagtt ctgcccaaca cttttgattc 42961 accctactta ttttttaaaa gacggtggtc tcactaatta gccgggcgtg gtggcgggcg 43021 cctgtggtcc cagctactcg ggaggctgag gcaggagaat ggcgtgaacc cgggaggcgc 43081 agcttgcagt gagccgaggt cgcgccactg cactccagcc tgggcgacag agctagactt 43141 cgtctcaaaa aaaaaatata aataaaaata aaaaagacgg tggtctcaca atgttctcca 43201 ggctggtctc gaactcccgg gttcaagcaa tctttatgcc ttggcctccc aaagtgctgg 43261 gatcgcaggc gtgagctacg gtgctggccg gttcagccta cttttttttt tttttgagac 43321 cgagtctcgc tctgtcgccc aggctggagt gcagtggcgc gatctcggct cactgcaagc 43381 tccgcctcct aggttcacgc cattctcctg cctcagcctc ccgagtagct gggacgacag 43441 gcacccgcca ccacgcccgg ctaatttttt atatttttag tagagacggg gtttcaccgt 43501 gttagccagg atggtctcca tctcctgacc tcaggagatc cgcctgcctc ggcctcccaa 43561 agtgctggga ttacaggcgt gagccactga acccacccta cttatttttt ttttttcctt 43621 ctttgagatg gagtctcact ctgttgccca ggctggaggg cagtagcaca atctcggggc 43681 actgcaacct tcgtctcccg gactcaaacc atcctgcctc agcctcccga gtagctggga 43741 ctacaggcgt gtgccaccat gcctggctaa ttttgtgtat ttttggtaga ggtggggttt 43801 cgccatgttg cccaggttgg tcttgaactc ctgacctcaa gtgatccgcc cgcctgggcc 43861 gcccaaagtg ctgcgattac aggcatgagc caccactccc ggcctcaccc tgcttttttt 43921 taaagcgtac atagaaaagt ctttcagctt ctagaggtca gtgggcggtg agggtctgtc 43981 tgaaggaggc agccagcagc tactgaactt ctctctagtg ttgtttaaat atgacttaat 44041 cgtgcttaag caggccagtg caaacatttt tcttcctgct gcagcctgtt taaattcaaa 44101 aactcaggtg acttaactcc tgcttcaacc acactcctgt taaccacctt catatggtgg 44161 atatttccag ccgcagggct cagtccaagg acacttagtg attctggcag ctggtattta 44221 tgtgttatgt ccaaaaggaa ctaattgttt caaggaacaa agtaggttta ggaaagataa 44281 agctgccctc tgcccaacat accctgttag tatccaattt catgttaatc atcatccatt 44341 taattgcttg gcagctcatt ccatctgtgt ttgtcctcat gttgagcgag gtgatcacat 44401 ctcacacctt cttttggatc ttagttttga gcctgactac aaagaggctg cttttatgtg 44461 gccctacaac ctgtggtttg aacttgctct ctgaatttgt tgcattcttg aagagctgtg 44521 tgcaaagtga actttagaat catgtttctc taatgacttg acgtcctttc cccaggaggt 44581 aattttggtc cctggtccca agaaggctat ttaagacacc aaaggaatca aaactacttt 44641 cttcagaagt atttgttctg ttactattcc aaaactcctc caggaaatgt tgagcacaca 44701 catcatctat ggggaatgat ttcaagcagc gtccattctc acttcattag aagttactga 44761 aatccccaga taccaaaaaa gtcgtgaatt cttttaacta cactagggtg ctattttggt 44821 cataattagt tatccttagt gtgaagtgat tccagtctgt tatttaatat gtagatcttg 44881 ttagaattca aagaatacag gcatcatttg gataagctga ggtcggtatt tgagatgaag 44941 gtgctagaag cctgcttaac tccctgaaga gtcttttcca gcctgaacct tctgctcttt 45001 ttaagtcttt ggcttctgac tgggaaaaaa tctaaaaggt gcatgactac atatgtcatc 45061 cagatagatt tggatgagtt tatttcttat tctggaatac ctcagttata cggagcttac 45121 cacactggag gttatcattt atcctaatga gggttcagtc tttgtcttgc tttattcccc 45181 agtccctaga cagaactttg tgagtagttt ttcccttcct caaaatctgg atttgcctat 45241 tttcaagttg acagctctag aattgttcct cttaaatttt catgtacagt catacatcac 45301 ttaatgacag ggacatattc tgagaaatac atcattaggt gattttatca ttgtgtgaac 45361 atcatagagt gtacttacac aaacctagtc tgttctcatg ctactatgta gaaatacccg 45421 agactgggta atttataaag aaaagaggtt taattgactc agttccgcat ggctggggag 45481 gccttaggaa acttacaatc atggtaaaag gcacctcttc acaggacggc agaagagaat 45541 gagagcaagc aggggaaatg ccagatgctt ataaaaccat cagatctcat gagactcact 45601 cactgtcaga gaacagcatg gggaaaccgc ccccatgatc gatacctcca cctggtcctg 45661 cccttgacac gtggggatta tggggattac aattcaaggt gagatttggg tggggacgca 45721 gagccaaacc gtatcaccta gtctatatgg tgtaacctgt tgcttatgga ctataaacct 45781 gtacagcatg ttattctact gaatactgta gacacttatt ccacaagtat atttaaggat 45841 ttgtgtactt aaatgtatct aaacatagaa aagataacaa taaaaatatg gtataatctt 45901 atgggactac catatacaca gtccgtcgtt gactaagaca tcatgcacat gactgtactt 45961 caaatcttgc ttgattaagg gaggaacatg acattaagct agtcttggat tcgtagtagc 46021 cccttccaag tgtgtttcat agcttcttct agaattttga tgtcaacttc tttaatgaca 46081 acaatagcta ataattagca ctcactgtgt cagacactat atagagcact tacatggact 46141 gtctcagtct tcacaacctt aaaaaatata tttctatttt gcaggcaagg aaactgaggc 46201 accactagta agtttccgat cctaaccatg aaccaagata gtaacagctg cgtctttaga 46261 atgaattgtc aatattgtct gttgtaggga gctcgcttat ggctttctga gtaaactgct 46321 ctgcttctag gcaccagaga aggaggaatt cctggcctgg cagcatgatc tggaagtgaa 46381 tgataaagct cccgcccagg ctcggccaac ggtgtttcga tggacggggg gcggaaagga 46441 agtttactta tctgggtcct tcaacaactg gagtaaactt cccctcacca gaaggtaatt 46501 gcctggggag tgttcacata tttgtcttaa cataaattct cttctttcta aaacatctct 46561 gagagagaac agaaaatgga tgtttctata aaatggatgc ctagtggaaa aataattttg 46621 cttaataagt cttaaggttc aaatgttaaa ccttggtggt ctctaggttt agagtacaga 46681 agaaagagaa tatttctgtg tcacagacat ctttcacttc agatcctcag ctatctgttc 46741 ggtcaggact tttccacaga aacacaaaaa tctgtcttaa tgcagtaagg gagcactggg 46801 ctggaatctt agaaacccac atttgagccc acattggcca tgctgtcagt cacctaacgt 46861 tagttaacct gagcctcagc gttcgtttct gtaaagaggg aatattggac cagatcatct 46921 ctgaggtttc atctaacttt gactctgcgt aatactttgt ttgaaaagta catgttggct 46981 gggcatgggg gctcatgcct atgatctcag cacttttggg ggctgaggca ggaggattgc 47041 ttgaggctag gagttcaaga ccagcctggg caacatagtg agacccccat ctctacaaaa 47101 aaattaaaaa ttagctggac atggtggtgc acacctgtat ccccagctac tcgggaggct 47161 gagatgggag gattgtttga gcctgagagg tcagtgctgt agtgagcagt ggttgcacca 47221 ctgcactcca gcctgggcaa cagagcaaga ctttgtctcc aaaaagaaaa gaaaaataca 47281 tgctatgtag gtgtcattag caaggattat agtaagacat tttatcagga accccagcat 47341 ggacttctgt cttaagttgt cagtgagaga acacctttag tgagtattct tgataaaatg 47401 agggagaaag tgtcaggcct attgtcaggg gctgcctgct ttccagcgtt tccattggag 47461 aaggagggag ccactcagca cagcagtcac aggtcagttc aagccagggg acaaaacaac 47521 ccagtgaagg ggctcttccg gagtttggtt tttcagtcat taggcaagtt tctttagaga 47581 acatttgagc ttcctattca attgatccct tctcctgccc aaggcaaagt gacaaagggt 47641 aaaacgaggc agcttctgtt tatcttgtgt tgtcatcctc cttggcatcc aggagccttg 47701 cgagcttgag acgcacaaat gcctgactgt tgacctttta acattaggga tgtggtttcc 47761 tggtatttga gtaaggtgag ttttcaggaa actgtgtcgc actgctgctt ccttatagcc 47821 ttttcagatt gaggtcatta tgctgaattg tctaaatggg ctaaaaatgt ttgttctcct 47881 gcagctgaac tcttggtttt atgtggaaac gttttgttct gtagctggtt tggcaagtaa 47941 gctcgggggg cagcccaccc cacggaagtc ctctgcttcc tttttccttg cagccacaat 48001 aactttgtag ccatcctgga tctgccggaa ggagagcatc agtacaagtt ctttgtggat 48061 ggtcagtgga cgcacgaccc ttccgaggta ctcttcctcc cacctctggt cctctgggtg 48121 cccgcacatt ccaaacaaat caccttccca agagattgcc gctaggtccc tttgcccagc 48181 tagtaaaagt ccccgtgtgt ggcagagctg agtagcagca ctacctgtca gacagttggc 48241 atacttgacc aagatgagca gggtggctag ccaggagatg aggccttcca gccaggaatt 48301 ccaagtcctc tgaagaataa ctccgcagac cttccacgtt atgatttctg cctatctgtc 48361 tcttcccagc ccatagtaac cagccagctt ggcacagtta acaacatcat tcaagtgaag 48421 aaaactgact ttgaagtatt tgatgcttta atggtggatt cccaaaagtg ctccgatgtg 48481 tctggtatga acacagttat tttataccac atgcgtgcag gtgggggctg tacagtctag 48541 acatactctt gtttctcttg cctctcttga gctgaagctg cccagtcaga taggcattta 48601 tagcccccac ttaaaggcca cagaacttaa ttcctgtcag gttgttgaaa tttagcccta 48661 agggagctac aagtcatttc cctggtcatc tcaggtattc agtaggtctg cttggccaca 48721 gaacgcagac agcagaaatg gaaccatagc ttgatctcgt gccaagggca gggctgaaga 48781 ggccctggag ggtggcactg agtcagtgac agcccagcac tggggaagcc tgtgtctcat 48841 cccacttgtg gtgcctgaag aatttcagcc tgacaggtgt aaatggacac ctcagtgacc 48901 ttagcagtca cttatgtggt acattagcac cttgtcagaa tccctttggg gaggatgcgg 48961 ctccccacgg gaaacagcct gcacagccca actctccttg atgggaagct atttggtaca 49021 gaaatacaga cccagtaaaa cttctccatc ttgtaagagc ctttcaggca tcgaggtttc 49081 aataaattgc tttccctcca agccccacag gaagaaacta cctggaacat gattcataca 49141 gttcggggga agagggttgc gatctaaagt gttccccttc ttggaggcgg gggcttaaca 49201 attcctggac tctgagggag gcagtggaaa gatgccctgg cagaatatct ggcgccctaa 49261 gaaaacagat tccgttcgtt gacaggagag ttcttcatca ttgtctagtt aggcctggaa 49321 gttctgccac caggatttct tgttaccagt tcagaaatca cagtggtaaa atgatcacaa 49381 tgggctcact gtggttgaca gtttgttgag caggacttgg aaatccgaaa tctgtgcttt 49441 tatctgggta attagtcttt tagagatatc atctagtttt tttgcagata gttgagaatc 49501 agggaatgga ggaagtgtgc ccaaatatcc tccaaaatat ataacttact gatacatgta 49561 attttaagcc acttacaagc tttttagtca aatttccacc tagggaaagg aaacggaccc 49621 ctcactgata cgaagctaat tccacgttct ctacagactt ctgtttcgac acttacttcc 49681 tcaaaatgtt atttactcct tatggctcag ttttctgcac ttcatgcttt tctcagtttc 49741 ttgctgtgat atggttgggt cacataaacg gggcttgtcc cctgaaaaat gacattgagg 49801 gaattggcag agctggggca tagcagattt tccctggacc agtcccagga tctggcgacc 49861 agaaccatcc cacgcgttct ttggatgatg gcttggtacc ttctggctgg ccaaacgtca 49921 ctatccagtg ttctagcatt atgaaatcct gctcttgtag cagcttctca gtgtatctga 49981 ataatccaga tactcaagca agagtttgga aggaacttga gtgtaatgga gttaaagagc 50041 ctttagatat tgggctttca attttccccc ccattttgaa atgtgcctcg tttagcctgt 50101 tttctccacg gcctctgtgt aaacagactg cctcatcttc tgttgcattg cggccggggg 50161 tgcccataac ttggacttag ctactcaatc ctcacttagc ctccagcttc tttgtggttt 50221 tgccagccca gcctgaaggg agaaggaagt cttattctgc cgtgcgtgtg ggattaagtt 50281 atcgagaata gcctccttgt gatctgacgc agactcactg gctgggccgt gtcttccacc 50341 agggcatctg agttgactac tgagttgtat gctagtcacc cagattccaa aagggttttc 50401 tcctctcctc tattctgcac tcttggaacc agtgcatcct tcaagagaat gtaaatctgc 50461 attgagggaa caagagacaa ggaaaatgag atgatgacaa cataagtaag gctgctccta 50521 gaatgcctga ttttcaaagt aaagtcctgt taccatctcc caacagagct gtccagttct 50581 cccccaggac cctaccatca ggagccctac gtctgcaaac ccgaagagcg ctttcgggca 50641 ccccctattc tccccccaca tctcctccag gtcatcctga acaaggacac ggggatttcc 50701 gtaagtatgt gggcatctgc ccggaccatc cgccgtgggt catgttcagt tgctttcttt 50761 cctgtgtcca cctcttgcaa agcagctggt gagcaagctc agtgtcaccc cctagtgtga 50821 actgtgccgt tcacctctgc tgtggggatg ggctgataac aagaggttcc cagtgtggac 50881 accgctgcct cgtcttgcaa agcctttctg cactactctt ccttgtcctt cacgcctgcc 50941 tcagtctaag ccctgttcaa tttacatttc atgacaaact gtatcaccat ctcttcttag 51001 taagctacat gctctggccc ccacatttta gaatctagtt ttaatcattc aattgctttt 51061 agaaatatca gtactcctgg atcaaaattt gggatatcta ggtaacacac atttctcctc 51121 cgggaaatta ttacatgcag aaaagaagtg tgtgtcctcc tggatcttga agaacagcca 51181 gttctgttgt gattctgttt tctgttccat ctaacctgga ttgcctgccc agtccgccta 51241 gctctgggat gcatggcaca gccgtaattc ttgggaatcc aggaattcag cttggtccaa 51301 gctacctctt aagaggcatt gtttacgagg tgcttaaagt aaccatgctg ctactgaaca 51361 ctccaccctg aagtggaggg catgtgtgct gagacgagga atgcacgtgt acacatgcag 51421 gtgtccctac agtggtccct gccgggctgc caggacaggg tctcagtgct tgttacccta 51481 cagggtaccc aacagggctt gcctttctca ggtcacctcc tgtgctggga gtgcccctgc 51541 agctgggtag ccatcctggt ataggcccag aagaggccag tggtaaggac agcacgcctc 51601 agctggcagg gcccaccatc tttccagcac tgagttggca acagtctcaa ataggtcaca 51661 ggtgaagagt ttcaaggaca tgtagcttct gccttagaag cacttagttg acactgcttc 51721 ccttagctct aactttacaa aggttggtgg gtaagatgac tttcagtgcc ctcctaagct 51781 agtgaggttt gttttttttt tttttttttt ttgagacgga gtcttgctct gttgcccagg 51841 ctggagtgct gtggtgccat cttggctcac tgcaagctcc gcctcccgag ttcacgccat 51901 tctcctgcct cagcctcccg agtagctggg actacaggcc cccgccacca cgcccggcaa 51961 attttttgta tttttagtaa agatggggtt tcaccatgtt agccaagatg gtctcgatct 52021 cctagcctcg tgatccaccc gcctcggcct cccaaagtgc cgggattaca ggcgtgaacc 52081 accgtgcctg gccaggtttt gttttttcta gaaaagggag gcatcactag ccatcttctc 52141 tgatggagac tgctgctggg aatagacaag ctgagagtgg cccagggggc tgggaatatg 52201 agaatgtcag gacccctctg tccaaggtag cactaagctc taatgtaaac tgtggttaca 52261 gttgtcacta ctaaagggtt aacatttatt ttgccccaaa ctaatcttaa gtgcttttcg 52321 tgtattctta ttcaagcctc acaacaacca tgtgaagtag gtgcaattat tacccccatt 52381 ttagggatgg ggaaactgag gcacagagca gtagggagaa cacttaaaat ctactgtctt 52441 agagattttc aagaatacat tgttactaat agtcaccgtg ttgtacagtg gatctcttga 52501 acgtattcct cctgtctaac tgaaatgttg tattctttga ccagcatctc cctaccccca 52561 ccactcctag ccgctggcaa ccaccactct actctctact tctgtgagtt cgactttctt 52621 ggattccaca taagtgacat cacatgggat ttgtctttct gtgcctggct catttcactt 52681 aatgtcctcc aggtttatcc atgctgtcac agatgacaag attccttttg taaggctcag 52741 cagcactgca ttgtgtatca gtacctcatt tctttatcca ttcatccatt gtggacacct 52801 acgttgattc cacgtcttgg ctattatgaa tagagctgca gtgaacgtgg gagtacagat 52861 atctcgacat gctgatttca tttccttgga gtatctaccc tgtagtggga ttgctagacc 52921 atgtggtaat tttagttata actttttagg aaccttcatg ccattttcca taatggttgt 52981 actaatttac atgcccatca gcagtgcact aaaagtagct tttaagtgtt ctcaccacaa 53041 aaacatggta agcatgttct gcgtgtgttg atgagcttga tttagccatt ccacaatgtg 53101 tccatatttc aaaacaacat attgtacatg gtaaatatac ataattttat ctgtcaatct 53161 ttgaaattga ttaatttttt aaaaacccct cttaggctca cacagcaact ggctgagcca 53221 gcagtcacat caggcttggc tggtgccaga tccttggtag ttggtgctct cagcagccac 53281 cctgaatggc ctcccagcgg gagcttgctg agtatttgtt gacacaatga ttggcccagc 53341 ctctgggatc ctaagcctac agcagttgtt ctacactttt ttccatttga attgcattta 53401 taacagttgg gggtatacaa tgaaaaaata aatcccgatt tctggcttct cttaaaacat 53461 aggaaggtgc ggcgggcagc tctgagtgcc agcccggtgg cagattttcc agagtagtgg 53521 tgtcttgccc aggtatccgt ttcattcttc aggttcattg agtcttacag atacaagttt 53581 tgaatccatt gctcaggagc gtttagtgca gatggtctct tcctgtgttc tctgaaagtg 53641 ctgttaatag ctgctttctt cctgccccac cctccataag aggacagggc cgtggcccac 53701 acttgaaaga ggccggggta aatgcctggc cagagacaca caccgatgcc tccagcaggc 53761 atgcagggag ctcccttcag gtcagaaggc gtgaccttca tctcacctgt cgtcttggac 53821 aagcccttgc gctgcctgat ttgggaagag aggtcggcct gagcgctgcc tcctgtcctt 53881 tgatatctgg cactgggaag taaaggggag aatcttggtt tccaaatccc aaatgctcac 53941 cgctgccttt gttccctcac agtgtgatcc agctttgctt cctgagccca atcacgtcat 54001 gctgaaccac ctatacgcgc tgtctatcaa ggtaatgaca tgtctgtccc catgagagct 54061 gtgttgccca gtgtgtgctc tgaggacccc cagcagtgga atgacctggc gggactgact 54121 taaagggcag cttccagggt ctggcacagg ccatcaggct caggttctga agctggcccg 54181 ggaatttgtg gttttatgaa gcccttaatg attctcatgt cctctgaggc ttgagcccct 54241 tagtccagcc tttgatcatt tccaccttgt tccttaggat ggagtgatgg tgctcagcgc 54301 aacccaccgg tacaagaaga agtacgtcac caccttgtta tacaagccca tatgaagagc 54361 tgggggcgga tggtggccca ggagacagca caccaccagg ctccacacgt gcatgctttc 54421 cccaagaggg aatggactgt acattgctca tttcacactc ttcagaagac atttcatacc 54481 tgccctggtc ctgcttgaag gtttgtccag gcagagcagc tcctgcagcg cctcggtctg 54541 tgacagtcct cctagcaccc ccatggcttt gagcctcggg gactcatcaa gtccaagaaa 54601 agagggaggg gtggcagagg atctgcagcc ctggccccgc ggtgcatgag gctgggtgca 54661 gttctaaacc tacattctcg atttttctta agccaaaaat gaatgctaac tcctttgcca 54721 gtaaaattct gggaaacagg gactgaggcc acacatcatt tccagtcatc tgtgtgtttt 54781 taaggccagc cacttgtccc tgttgaggcc tggctatgga actaaataca gtgttggtct 54841 tgcctgtcct tcaaaatcaa caacagattg tctctcggct ccagggaggt gtcatttcta 54901 tagaaattag aagctttctg atttctagat gaggttttac aattgtttct tacagtcatg 54961 tgcactaagt actctttttg taagcagagg tggctggctc tgcagcctta aggccatttt 55021 ttaagtcacc acgtctagaa gtcacatgaa ctctgctcag caataatctg ttctcagaac 55081 agacttttca acctgctgcc ggatttctcc attcagctgg atgatcctca ggactgacca 55141 gttagctggc aggttgtcca gctttttatt ccagtcataa taggtgacag tgttaaccgt 55201 gaaaacttga gaggcactct gccctcttcc ctataaaatc acacagcgtg attttacaag 55261 gtcccgtggc accttgctca ggacctctgc ccctagttag caagactgca gcagttgctg 55321 ttgcttattc tgaaaggaat gtagaacttg acagcagcct tctgagtctg ggtcaggaag 55381 atgtcctttg gaccaaagca gacttcttta tacgcagctc agtttcccgg gagtcgccac 55441 agatgtaccc actagcccag gttgctgtga gtcagcggaa gctcccgtta tgccctttgc 55501 tcctggtggg agagggagga gtgagctccc tgggttccag tatttacttg gtatacctga 55561 gtttgggggt accctttttt gtgacttttc aaaacagtga attactgtca ccttgatgga 55621 caagtttcaa taaaactttg taaaaataat tggacatgtg cctggaggag ctctgatttt 55681 attcagtccc ttggagaaga gactggaact cttcgagagt tgcgtttcag taccgttctt 55741 gtggctctga gcggacaggg ctactccagc tgcccaggcc tgagaaccag tcctgtgact 55801 cacacccatt cacacagttt ggaaccttaa gggaccccta aaactgagcc cctctgaaca 55861 gctcaagtca caaggggaag ctgaagaggg gtaacagcag aggctcctgg cttcggccag 55921 ttgttggtgt ccccaggcca gagggtttgt ttccttttgc tcctaaacaa gcccggtggg 55981 acagcctggg atcgtgaccc tggcgtgtga gagagcagga cgttgtggag tcagagctgt 56041 tttcatccac taaaagaggc cgtgtccctc ctccaaaaag gtggaaggtt atgggtgttc 56101 ctttaagcct tctcaggtat caggaaacat tttaaattta gcagaaaagc atgctaatga 56161 gctctaccca ctggtgcaga aaacccaagc tgaaaatgct atgtaaaata agactggttt 56221 actggagcca gtcaccccat gagttctcca agccctgggg ccacagggga cttccaggtg 56281 aggtcactgc tgagatgatg gaggagacac aaaacaagag gacatggtca gagacaacac 56341 atctctgtgc gtattcaaag gatttcattt tctttaaaaa gcacattata gtcggaaagc 56401 ctgcccacca agtgaaagcc tccataccca agaagctaaa ggttaatggt gattttacaa 56461 aggtggaaaa ggagaccttg ttttgcccaa atcacagggg caaagccata tgtcaaatcg 56521 tgtcctatga ccaagtaatc ctttcttcat tagcatttcc tttatgggtg aggacttagt 56581 taaagcttaa ctcaaattgc ttttcaatga agatacattt ttccctctag aagctgatat 56641 acagatgaat ttatagagcc aaattaaagc ctaaaaaatt actaaaatgt aaatcagtat 56701 tgcagggcta aggccacaaa caggaagtca aaaggccaga caagaaaatt gtgcctgcac 56761 ctcagctagc tcaattagga aatctccaag gctcctaaaa gactgagaga aaaaacctgg 56821 aacatcatga tacgatctgt ggctgaggtc agagactgaa cgagccagtc taggaaagaa 56881 caaaatttca ttttaattca gttccttttg tatgagcaca gtttcacttt caagcattta 56941 cttcccatat ctgatcctcc tctgctattg tgtcaccttc aaaattcaaa tgtccccaat 57001 gtgatagtat taagaggtag ggcctttaag aagagattag gtcatgagag cccctcctct 57061 catgaatgaa attgaggttc ttatcaaaga ggcttcaggc agcattcagc tcatgtgccc 57121 tggcaccttc tgccatgtga ggacacacag cttttctccc ctgcagagga cccaagcaca 57181 aggcgccatc tgggaagcag agagtagccc tcactagaca agcaaacctg cccatgcctt 57241 aatcttggac ttcccagcct ccagaactgt gagagaatga atttttgttc tttataaatt 57301 acccattctc aggtattttg ttaaagcagc actaaatggg ctaagacatc caccaatgtt 57361 tacaaatggt ttgaatgaat tgatgaagac ccaagacatg gaccactcat caaatcataa 57421 agtttctgag gaataaaagc ttcagcactc atcttggcca gcaaattagt ggagaaacaa 57481 actagagcct tctgccctgg tatgccaact ataattgtct tcaggagaaa acatggaagt 57541 tgtagatacg tatacatttc ttctaaaacc ccagtcatcc ctattaattg taaaaacatc 57601 cacaggatcc tcctaagagt catcgtcact atagtgagtc agacatttgg agaagatccc 57661 aaaagctgga gcagcagccg gagccccgtc tctgggaccc cagccactgg ctctagaacg 57721 agccaccagg ctctgccctc agggtgggct cccacaacta cattgtcgtc ttgtctttca 57781 tacaactaaa gagcctttcc ccagatgagg atgccagctc ttactgctac ctccccctaa 57841 aagaattcac ccatcccatc tgaaatactc agtggccagt gagaacaggc caagatgtgt 57901 atacccttgg tcgatttttt gtttcctttt ttgtggctct ctgaaagaca gcctgactta 57961 tgcagagtaa taaagtcatt ttcaaggttc tttcatctac ctctaacaat ctctctaggg 58021 caaatagagt tccgttcatc attaacagac tcaagggaaa acaaagtgat caacactgga 58081 ctctatttcc actaccccca ccctccccag cttcaggaaa cagcttcagc atttgagtgg 58141 gcaaataaga aactattttt ctcatttaac actgtaggcc tagagccacg gtctacatgc 58201 tttagacatc attagcagta tttaaattgc caaggaagac atgggactaa aactagaaca 58261 gaccacggtt tctcaacctt ggcactattg aggctttggg ccagataatg ctttacagtg 58321 gggggctgtc ctttgccttg taggatgttg agcagcatct ctgacctcca cccattggat 58381 accagtagta ctcccgagtt gtgacaacca aaaatgtctc cagacattgt caaatgtcct 58441 ctgggaggag ggaagcaaaa tctccccatg gcttagaacc agtggaacaa gccagtgtaa 58501 ccactgtatt ctgagaatgg aatctgagct ggcccccacg aatgctttct atagcaaatc 58561 acattcaaga ggtccaattc tgaagacagt cctattaagc aaattctctt ttggcctaac 58621 cctccaaatc ctaaagttcc atggttatgg aatttgctat gcaatcaacc tctttagtcc 58681 ggtgacaatt aagttgtaag cgcagaaaac atgacatcta attttgtgct tatgaaacag 58741 gagtggccag catgaagagt taggcctcat tttacggtgt ggctgtggaa ttggttggtt 58801 gtttctttcg ggctattcct tagggttcaa acactcttcc ttggcacttc cttttgaaac 58861 caaactatgt tgacttagct ttggaagttt tctgtgacct cccacaatcc tctagacaat 58921 cactgaaatg cttggttgat atttaacaga tgtttttcta atgctacaag gaaagaaagg 58981 ttcacatttc accacttcat cttctgaatt attttcattc ttacaactac gaccttgtgg 59041 cagattttca tgcccaacag taacatcgaa aagaacagaa ttgcatctgt ggttcccatc 59101 acaagcaaca aagacagata cgactctgcg agtccaattt tgaccgtggt cagacgactt 59161 cattagaaaa taaaaattca aagactacaa cccacaggaa ccatgaaaac aaaaacatca 59221 ccatttgcca aaacccgttt gtctggatcg agcaagttta tactaacaaa ctaaagaatg 59281 agaaaaaatt tatcttgcag actgtaaaag ttcctctact taaaaatgga agtattttca 59341 tttttaatta cctttgattt gaattcttct ctacaaagaa atctacatta ttaaattgta 59401 gttggccttt taatagatga aagatatctt gagctgacct gttcctggtt agctgattat 59461 agaactgggt aatccattct tcagtaatcc tttccaaatg tcctgtgtga ctgcttgaag 59521 agctttccag ctctgcagtc ctataaaatg accttaatag ccagttcatg gttcactgga 59581 agtattgacc aaatttccac agtcaaactc ttggcagaga atgtcaccaa tcgcttaatt 59641 ccaggatagc agtgtgccct aaccaccacg tggcttcctc tatcccactc cagaggaaaa 59701 ccctccccac aacgtaaaga aggtcccatc catgataata cacctacact gacacgctgg 59761 tgatagtttt tctttcacag tctctcaaaa aaagaagagg tcattgcact aaagcacaca 59821 ttagtttaga tcattgcttt atttcactaa ttgttcaaca acaaagttca ttcctctcaa 59881 ggtggatctt gaatcattcc ttgtgttctc ttcctcgttc tgaataaaaa aggtaatgaa 59941 gaaaaagcct gtacttttgg agacctagaa tcttgattga tgctaataag cttttgacag 60001 caatttcgtg ttacggtata tcctgctggc atcttggtgg tcacacggag ctccccaccc 60061 tccagtgaag atggctgtcc acaactacca ccattgaaac caaaagtaag tgttgaaaaa 60121 tgcacctttt acaataaaaa agtagaaacc aattcaattt cctctttttt tttttacgaa 60181 tataaagttt cttgtaaata tgtacagtct tttgagctag ttctatatag cagaaagcag 60241 ttcacagatg agacacacaa tatacatttt cagggctcaa gaagcccatt tctcatggag 60301 atcctaaatg aaatgccaag actgaaagac ccattttcag tgacctttcc aaatactgtg 60361 gaccaagaga caaaacttca gcaaacattc aatcaaatct gccctgggga cgggagggga 60421 gggagtacga ccccacagac tccaagcaac acataaaacg ccacagcagg acatttgcca 60481 aaggagctac cacatggagg tttgtaggcg tttcaaacaa gaaagcactt aggttaagac 60541 acgagcagga aggagctggg atgcaggcca ggttgcaatc tggaaaggga tctgaactgt 60601 ggacacagtt gaaaacacgg tcatgttcac ctgctgcggt ggaaacgggt ctacctttga 60661 gcctacctag gaggtgcctg ggttgttgag gcgggcctaa gagtcgtgga ggacccaaag 60721 caacagatgc cttgtgggag attaaaggca gaaataaggc cgcgagttag cttgccaact 60781 gcaaagcctc taggttggtt ctgtaccagt tgccaggaat ttcgtactct aataggaggg 60841 tccgcagacc cagtgggagc cgcactgcac ctaaggactt cccaaccctg gcggagctgg 60901 aagagttgag ctgcggctgc tttctcaaac ttcctggtct taggtctagc cctagggaaa 60961 ggctcatttg gaaagtcaca gtttcgtggg gtagaaccgc gctggattgg gtgtgaacag 61021 agtcatcctc tgcaggcagg gaagtgtgaa gtttggaggg gacactgagt gggaggtggt 61081 ggtttgctga aatccttctc tggtcgtaag gcttgagcgt tttgtttttt gtttgttcag 61141 tgctacagac tgcagcttgt ggtggccagt tagtcggcaa gtcgtcagag ggtctcgatt 61201 tggcaggagc tatgggatgg tattaataca ttggcagagc aacccaaggg ggcagcacat 61261 gcagtgaact gccatgcaga actcccgacg ggcctcttcc ccatcccaga gtggggaaca 61321 acacgccgtc acagacaagg aagtgggtgc ccccgtcccc tccccgaccc cgagacccag 61381 gagtgctggg ctccgagcaa gtctattgca tgctttcctg gccaaagcta tatggaaagc 61441 gggaacagca ggctggggag atgacgctgg ggggtgggga aggaaagcgt ctcgaggtcc 61501 cttggcccca agtcactgcc tccgagctga ggccccgatg acctccatac attctcagct 61561 ggtgggaggg gaagtcgtca aggccataaa aggcaatgaa aacagtaaac atttggctac 61621 gatgtcaccc tggggaaagc agggccatct atgagaatga acaggaacaa ggatgaacct 61681 acttggctgc gggaggttgg ccaactgacc gggactgttc tacactatgc ctcggtggca 61741 aagacctgat gatgtggtcg agctaaaaat gactacaaca ggaaatacgt cgctattcaa 61801 atctattaag aattctgttg tcttaaaaaa aaaaaaaaaa aaaggaaaga aacaaagaga 61861 aaaaaaaaag aaaaaccaac caatcctagg atcttaaagt agctaattag gattctaacc 61921 atgttgtagt tagcatcccg gttggtttcc tttgatgaac taactggtac aggctagagc 61981 taggtacaaa agtttgtgaa tgctttgaaa gagtaacaaa gtgcaaagag atgactgcag 62041 ggaggtgccc agggcaggca ccgggcgctg acagctccga agagcctcag ccacctgccc 62101 ctcctggaga caggggtgtc cgtgccgagg tgggtctggg ccccgctgag ccagagggtg 62161 gctgagcacg tgggcgcttg ggtccccatc agcagagttc catagtgtgt ttggtgtttt 62221 cctgcagatc aagatgagga gttggttttt ctggctgaga tttatactga agactggtcc 62281 cagacctagg agtgaaataa ggaggagtgt tagccatccg aggccagaag ttgctgtgct 62341 caccttgcat gatgcaaatg ggaatctgca gcccctgggc tatccccttc agcagctagc 62401 agtgagtgct tggcagggca gcacccgctc cagagcctgg ggaagccagg gagtgaaact 62461 cactggaaaa tcccaaagcc gtgtgtgtac ctgcgatgct gtaagcatgt gatacacaca 62521 catgcacata cacagtttca tcggtcaatt gggagggata agaagccata tggatggtgt 62581 ttccttcttt ctttaccagc aggtgaactg caggtggcat cagaaggaga atgccaccca 62641 gaaaatcagc cccaaatggc agaaaggtgg ctccttgacc tgcttagcct gctaaggagg 62701 caccaagcac ttgagccatg ccctgcctgg gggagcccct ggtcttccac acccatccct 62761 ctcctcgatc gcaaacctaa aattacacag atcattcatg cacatgattt aaacatttcc 62821 aaaagttcaa agttcttctg gaaggcaacc actgtgaaca ctttctgtgt agccttccag 62881 aaaaacaatt tatatataca catgtgtggg ctgagcgtgg tgactcacgc ctgtaatccc 62941 agcattttgg gaggccaagg tgggaggatt gcttgagccc aggagttcaa gaccagcctg 63001 ggcaacatag cgacattatg tctctaaaaa aaataaaaag ggccgggggt gggggtggga 63061 atcagacaga tggtggcttg agcctgtagt cgcagctatt caggaggctg aggctggagg 63121 atcacttgag cccagaagtt caaggccgtg gtgagctatg atcacaccac tgcactccag 63181 cctgggcaac atagtaagac cctgtctcta aaacaaaaca aaacaaacaa acaaacaaac 63241 aaacaaaaag tgggggaaaa agttagccag acatggtagc tacagcccat attcccagct 63301 actcaggagg ctgaagtgag aggatcactt gagcccagga gtttgaggct gccgtgagct 63361 ataatcacgc cactgcactc cagcctgggg taacagagca agacgctgtc tctcaaaaga 63421 aaaaaaaaaa aagaaaagca tatgtgtgta tgtgtgtgaa tctatatcta tgtttgaaaa 63481 aagcatgcaa catttcatat aggatgctat catttgggtt tcaaaaggag agcacacata 63541 tatatttgct ggtacgcaca gaatatctct agaaactgct accacttgcc tctggagaag 63601 ggaaaacatt ttttggtagc tattatattt tgagttacat gcatacatat tttccttttc 63661 aattacacac acgttatcca tttgcttagg aagctctttt tttttttttt tttttttttt 63721 tttttaagac agagtctcgc tctgtcaccc aggctggagt gcagtggcgc gatctcggct 63781 cactgcaagc tccgccctcc gggttcacgc cattctcccg cctcagcctc ccaagtagct 63841 gggactacag gcaccagcca ccacacccgg ctaatttttt gtattttttt cgtagagacg 63901 gggtttcacc acattagcca ggatggtctc gatttcctga cctcgtgatc cacccgcctc 63961 ggcctcccaa agtgctggga ttgcaggcgt gagccaccac gcccggccag gaagctattt 64021 cttaaggtac attttcttcc taaattcctg tgtctcgcaa agtctgaatt acagtcatgg 64081 agttcctttt ttaaacaaca gtggccccgt gggggcctca gttccccaag tcactcctgg 64141 cctccgcaac agacacacag gcctcggaat gctgcctcac cttgttcacc tgggacagcg 64201 gggtcctcac ggctcccgca ggcagccggc ccctgctgct gtcttcaaac agcctcccgg 64261 gggaccgctc tctccgcgtg ctgagcatcc ggccggggga cttctctcgc tccagggggc 64321 ggccaggaga cttgtccctg cgcagctcgg tccgcccctc gcggtagcgg tggggtgtgc 64381 ttggctctcg cgggtggctg gggccttcgg gcggcgctgg gctggaggcc acgcgcttgg 64441 tgatgtgctc gttgtacgtg ggtgggcctc gcttgttggg gctgctggcg acacaagagg 64501 aacgtaggga gctgcgaggc cacaacccca gaggggcatt ttccttctta cctgagcaac 64561 cccctccttg acccagcctc accactgatt atcaagagat tagacctgga gttctttgga 64621 ctttgcctgc atttgatttt ggtagcaaag acagggaaga gagcaaagat ggcaacaaaa 64681 gacaacccac cttcctcagt gcaggctttc atttactgag cacctactgt gtgccaggcc 64741 tgtgctaagt tcttcatctg ccatctctca ttatctcatt aaatcatcac agcagctcct 64801 gagctatgaa caagcatgac cctcactgta caggcaagaa aatggggaca gacaggagtt 64861 aggtgacttg cctgagctaa tgagtggtag agctgggatt ccaataaaga ccagtttgat 64921 aggccaggcg cagtggcgca cacctgtaat cccagcactt tgggaggccg aggcaggtag 64981 atcacgaggt caggagatcg aggccagcct ggccaacatg gtgaaacccc gtctctacta 65041 aaaatacaaa aattagccgg gcgtgttggc gtgtgcctgt agtcccagct actcaggagg 65101 ctgaggcagg agaattgctt gaactgggga ggcggaggtt gcagtgagcc aagatcggac 65161 cactgcactc cagcctaagt gacagagcaa gactccgtct caaaaaaaaa aaaaaaaaaa 65221 aaagaccagt ttgatgtcat aagccccata taatatccta ctgacctact atttcccaga 65281 gcctagaagg cacagaagga agtcatcaac taggaggtca gcctactaag acccagaagt 65341 cagggaccca caaaggccct cagtgcaagg aagcagtttg gagaaaccct gtactgatca 65401 ctgttaactg ccatgtctgt cccactcctg ggcctcgtgt ttggtcacct ctgcccccat 65461 gtggctttcc aagagtctct gacacctgat gcaaatgcag atgctcagaa ggtggatggt 65521 tctcagtcat gacaataaga cgaccgagag gtgtagggat caggcctgat agagtcagga 65581 aactgagcag ggcagacaaa gtaaagcggg ctcagaaaac gtcccaggcc tcaaaggttg 65641 ataattagta accaagcagt tgaaaacact gcggcctttt tgtacaaccc aacccaatcc 65701 tccacccact cctggcctgg atatttcaaa tgaagatttc agcagaccca ggggccaaac 65761 tccctctctg aagatgtagt aagacctaca aacacagaga gaactgaaat gtggccttct 65821 gagtcaattc tttcccaccc ttgatcagct ctaaaagtac tagggtccct attatctcag 65881 cataatttgt gcttggaaaa ggccctactt aaaatggagg aaaaaaatta cagtgaatat 65941 tacttgaaaa catattttgt ttgattgagg ctttatcttg gaatataaac ttattttatg 66001 ctagatattt ttagtattta aattacaatg caattacatg ctgattccaa aaatgaacca 66061 agcatgaatc acagaatgat ttgttttgtg ctaattgtct caacaaaatt gctgctaaca 66121 tgctctgaca ccaacgttgg acacattgga cttttgcagt ggatcctaaa gaaggctctt 66181 caaaaatatt tgaaaaagcc gggcacagtg atgcatgcct gtagtcctag ctattaggaa 66241 ggctaaggaa ggagaattgc ttgagcccag gagttcaaga ccagcccggg caatatggca 66301 agaccccgct tctaaaaaac taaaactaaa aattaaaaca agaaaagtgt tttcccaaag 66361 gactatctta atccttgata tttaaatcct tctcattgga agaggctata attttatcaa 66421 cttgttcatt tctttcttca attattggtc taactgtatc agaacctctt ctgtcaacgt 66481 ccataattga gaagcgacag caaagacagt gtttattgtg gaaggagaga ctctagaaca 66541 ggcttgaggg ccaaatccgg aagatgcttg tttttataaa tacagtttta ttggaatgaa 66601 gccatgccca tctggttatg cattgtctta cagctgcttc cgcatgacca caacagggtt 66661 gagcagctga gacagagacc atatggctca caaagccgaa aatatttacc atctaaacct 66721 gtgcagaaaa agtttgccaa ttcctcctct acaatataac agagaagttg gtgtgcagag 66781 ataatttcat aagggctcag ttcatagtga atagtttcat gccatgctct tgctttccca 66841 tgcaactgta acgtttggag attcagacaa cgaatgctca gaaggccaga acccctgtct 66901 ctgcagcagt aagatttaag ataactcaga tttcaacatc ttgaaatcaa ggtttatatt 66961 catctctagc catatggctt tgaaatctgg aagacaaaca aaaaaataaa ttccgagtag 67021 gagaactaca ttaagcctac ttgtaatgga aaagaaatga taagatgtgt ccacaagaaa 67081 ctatcataaa acaaatgaga gggtgctcac tggccagggg aagaaggatt gctcaaggga 67141 aggccattca ggacgtcagc tgagattaac gggaggaagg aacacacccg tcattccaag 67201 ccaagtgggc agtcttccat ctccttatcc attaatgacc cataatctac cacatcaatt 67261 ttatggtttt ttggcaagga gttacgtttc ctgctctgct aaacagacca gggcaacgaa 67321 gcctgcagtg aatcagaccc agaaagccaa ccttctacat tctcaactgc ccaaagaagt 67381 cagcgaccaa aagatcattc tataagccaa gagataacca actactgatt ttccagacat 67441 gccacaggaa agcaaacact tttctgcccc tcgttctcag gaactggaac tgttttgatc 67501 gcacagtaag caactccttt gagagactat ataatctcaa aagcaaggtg cctggccacc 67561 tatccagtga ggcaacggag gaaaaaaatc taattacaat atttcaatgt gtttctggca 67621 tgacttggga actacttaaa aaagatctag acattttcag gggaggactg tggtgcactg 67681 agcagtcact cttgaagcag ggccaagcca agagttagga gcatttaatt attggcactt 67741 ctggtttttc agcaagttcc tcaggttaaa accacactaa ccaaacattt cagattaatt 67801 aactcctata catcaacaaa agagagatta ttcattagga cagtcttgcc caagatcttg 67861 gtattttgta atattagttg tggttcttcc accaaattat gggctcctgg aatataagaa 67921 catgctttct acatctctgt atccccagcc atagacatta tgtcgggcac atggtatgga 67981 gtccacataa ctatttagaa ggtagacaca ccttataata atccagaaga ttatcttagg 68041 tcatttattc ccggggttaa ataatctcag accttctttg caggccaagg tttcaaactt 68101 cctacacctc gagctggatc tagctcctag cactgtgtca agacgaggcc agtcccactt 68161 ccacacgcag ctggtgggat ggcaaatggg agaagccttc agggcagaaa actccacact 68221 gtccatcaga tttacaaatc cacatcagcc atcacccatt cacccacact tttctaggaa 68281 tttatcctac agatagattc ctcagcatgt aaaatgacgt aaggagaaag tcatttattg 68341 cagcaacaga ttggagacaa ctcgaacagc cattaatgaa gaagcagtta aaccaattaa 68401 ggtacagccc tgataaggaa ggttgtgcag gcatctgggg agaaatgaat gaataactac 68461 ttctcacact gacacggagc aatcgccaag acacatcaag gtacaggaga gtatgcagaa 68521 tatgctgcca cttgtattaa aaaggagggg tggttgcgtg tagtggctca cacctgtaat 68581 cccagcactt tggggggcca gggcaggcaa attgcttgag cccaggagtt caagaccagc 68641 ctgggcaatg tggtaaaacc ctgtctctac aaaaaataca aaaattagcc aggtgtggtg 68701 gcgggcgcct gtagtcccag ctactcgggg ggaggctgaa gtcgggggct ctcttgagcc 68761 caggaggcgg aggttgcagt gagccgagat cacgccactg tgctccagcc tgggcacaga 68821 gtgagaccct acctcgaaaa aaaataaata aataaaaata aaaaataaat aaacaaggag 68881 gggaagggca cttccagaat gccagactaa ggacctctca caacctgttt ctccataaaa 68941 gcaacaagat cactggcagt tgtcaaaatc aacttttcag agctctagaa attaacccta 69001 gccctttgcg cggccactct cgaggcaggg tcaaaccaaa agagtcagga gcacgtgggt 69061 atttggcacc tctgcttttc cagcaagttc ctgggatcaa aaccacactg aacaagcatg 69121 ttgcaaaaaa ggcctgcaac aattcaaaga gtgtttatgc aagaaaaaca aaggaatctc 69181 agcaagaacc gaatgttttg tgacattcaa cttgccccat acccactccc ctctccccag 69241 atctgtggaa gagttgaaac caatgtcctt ggaaccacag tagctgtgaa aaccagcagt 69301 ctagtagcca ctagagggag cagatcagga ttgaagctcc ccaaaaagcc ccatccccag 69361 aaaactgttg ctatctgacc tgcctagcga cgctccgtgg aaaagaccca ttctcagggc 69421 agtggatctg actgtacatt tggttgtaca gttgtccctt ggtggcctgg gggactgctt 69481 ccaggacttc ctgtgggtac caaaatctat ggatgttcac atctcttata ttaaatggtg 69541 taggaccaga cacagtgctt caagcctata aacctagcac tttgggaggc caaggcgggc 69601 agatcacttg agcccaggag tttgagacca gcctgggcaa catggtaaaa ctccatctct 69661 acaaaaaata caaaaattag ctgggtgtgg tggcacacac ctgtaatccc agctgctcat 69721 caggctgagg taggaggatc acctgagccc agaggtcgag gctgcagtga gctggtacca 69781 ctgcattcca gcttagatga cagagaccct gtcccaaaaa aaaagtaggg ggagtgtatt 69841 tgcatataac ctacacacat cctgttgtac actttaaatc atctctagat tacttataac 69901 acctaataca atgtaaatgc tctgtaaata gctgttatac tatattgttt agggaataat 69961 gacaatttgt aaaagcttga acatgtgcag tacaaacaca attatttttt caaatatttt 70021 caatccatgg tttattgaat ctacagaggc ataacctata gacagagagg gctggctgta 70081 ttggcataaa ggatccctaa acggaaacac aaaaacttca aaaaattggt tccttcccag 70141 gtggggacct ggggagaggg agtgaggaga gttaggaggc aagagagacc cagaagagag 70201 gctctgtact gtacaccttt cttatccttt gaattctgtg ccctgtggtt agattatcaa 70261 ttcaaaaaga agtaaaattt aaaaagaatg ccttttaaga agaggccttt ttttggaaca 70321 tgcttgttca atgtggtttt gatcccagga acttgctgga aaagcagagg tgccaaatac 70381 ccatgtgctc ctgactcttt tggtttgact ctgccttgac agagaccact cagagggcta 70441 gggttaattt ccagagctct gaaaagttga ttttgataat tgccagtggt cccgttgcct 70501 ttatggagaa acaggttatc agaggtcctt actctgaaat tctggaagtg cccttcccat 70561 ccttttttga ttttgagata gggtctcact ctgtgcccag gctggagtac agtggtgtga 70621 tctcagctca cggcaacctc cgcctcaagg gctcaagcag tccccgccac ctcagcctcc 70681 agagtagccg ggattacagg cacacaccac cacacctggc taatttttgt attttttgta 70741 gagatagggt tttaccacat tgcccaggct ggtcttgaac tcctgggctc aagtgatctg 70801 ccagccctca cctcccaaag tgctgggatt acaggtgtga gccaccgcac cacctcccct 70861 cctttttaat acaagtggca gcatatctgc atactctcct gtaccttgat gtgtcttggc 70921 gattgctcca tgtcagtgtg agttcttcat tcttttcccc agatgcctgc acaacattcc 70981 ttcccagggc tgtactttaa tttatttaac tgcttcttta tcaatggctg tttgagtcca 71041 atcctgttca ggaggtctcg cattccaggg aaggcacgag gctggccagg agtccctgac 71101 ctgctctcac gacctcacat ctctggacct ccaggcatgg cttgccttcc ctcctgcctg 71161 agcacccctc actggtctcc ctgcccccac ctcgtatctg atgggaaatc ctccctcacc 71221 tccacgtcca ggctttcaac caccccacca gggcctcatc cttccatgcc ttaatctctg 71281 attgccagca tctctcccac tgggctgtga gtgccatcag ggaggggcta catctgcttt 71341 attcaccatt ctccctcccc cacaagccca acaactagca ccgcagaata cctgctaaag 71401 caataaatgc ctaatttgtg ctccatttct caccaagcca atgaaaatac ccaggagaga 71461 aagatcagac ctctcccctc tgacttttgc aaggtggtat tttctaattc tcctgatgct 71521 cataatggtc tgtgttattt cagtgccgca gatcgcaaac aaatgaatgg tttgagctta 71581 agaacaaagt gaagattggg aaacattcta ctggtttagt atcacttcct tctgccttgc 71641 agaaaagcac gttgcccctg gtactgagga agaggagaag ctggttacct gcgggaggtg 71701 gacgggcccc ggtggtgttc agtgccggac tccttcacga ggtttccctt gcagcaaatg 71761 acccttaatt tatcctggta tgaggacgcc aagtaaatcg ctcctgagga aatggcaggg 71821 cccaggtagc gcgggttcgg gatgtccagg tacgctcggg caggggtcct gcagagtccc 71881 agagttccag ttaccttcat tgcaggctac ctccattgct aggcaagtta ggaaggaaca 71941 tgcggccggc cccaacacct accccaatag ctgaagtttt ccacgcatgg gaggacaatg 72001 cttaccctgc tgaggagcgt gcctggatct caattacttc gagtgagttg aagtgggtca 72061 caaacagata gggttctctg taggctgtgt gatcaccatg caggcacagt gtgggccaaa 72121 gaatggtgaa aagagggaaa atcaaaatct gactgcaatg cattctttga cccagcaatc 72181 ctgtgtaaag caactcaccc aacaaataca tatctatgcg ccagaacagt catgcatgtt 72241 cgaggatatt cactgccgtg tggtttgtta cagcagaaga tggaaaacaa ctcaagtgta 72301 cagtgtaggg attggctaaa tacattataa tacatctggc tgggcgtggt ggctcacgcc 72361 tgtaatccca gcactttggg aggccaaggc aggtggatca cctgaagtca ggagttcgag 72421 accagcctgg cctacatggt gaaaccccat ctctactaaa aatacaaaaa ttagccaggc 72481 atggtggtgt acacctgtaa tcccagctac tcaggaggct gaggcgggag aatcacttga 72541 acccaagagg cagaggttgc agtgagctga gatggcacca ctgcactccg acctgggcaa 72601 cagagggaga ctctgtctca aaaaaaaaaa aaaaaaatta taatacatct atggaattct 72661 gtgcagctgt aaacatgcat gaaaaggctc tctttgtaca gatatgaaag aatgcccaag 72721 gtacactgct cagtgcaaaa agcaaggttc agaatcatgt gggaagtatt ctgccttact 72781 gatttttaaa aagctggaaa aaagaatata tatattttat gtgctagtat ttgcataaag 72841 tactggaagg acaagaaact catatgagct tttttttctt tggaggaata gcgatggaga 72901 atttgggaga tggactctct gggtgagact tttcattgta taccttttaa aactgattgt 72961 aaacatcttc aaagaattca atttttaaaa agactaggtt cgggctgggc gcggtggctc 73021 atgcctgtaa tcccagcact ttgggaggcc gaggcgggtg gatcacctga gatcaggagt 73081 tcaagaccag cctggccaac atggtgaaac cctgtctcta ctaaacatag aaaaaattag 73141 ccgggtatgg tgacaggcgc ctgtaatccc agctacttgg gaggctgagg caggagaatc 73201 acttgaaccc gggaggtaga ggttgcagtg agctgagatt gtgccattgc cctacagcct 73261 gggcaacaag agtgaaaatc cgactcaaaa aaaaaaaaaa aaaaaagact gggttctatt 73321 ttaaaaaaaa gactgggttc acagctattt agacattgct ggtcttctta atactctatc 73381 tctgcatgac agtgtcactg aaactagagc acacaggact ggaaatattt tctgccttaa 73441 agggaaaaag cactattatt ccttgtggca agacttggta ttagactttt aaagtcatgg 73501 gctctggtat gcgtgccccg gtgcagttcc ctggccttga atctgggctg actctgcggt 73561 ctgctctaac cagtagagcg tggtggaagt gagcacgctt gccagcttcc gcttctatat 73621 tcttggaaga agccagcgtg gaaagcaaca aggcctccag ccaccagccg cagctaagcg 73681 cccacattat ggccagcact cattgcgtgc catgtttgtg agcctgtctt gggctttcta 73741 gttgtcccag tggccctgct gataccacat gaagcagaac tgtccagtca agccacggag 73801 ttggggccaa taataagctg ttgtttgaag tcactaagct ttcagatggc ttgttatgca 73861 gcaagagaga aacaaaatag cctcgaaggg tatctgggcc acgacggcat ttcagaacat 73921 tcacacagat acgtatgtac atacttgtat gtgtatgtgt gtttagtctt tgatccaaca 73981 gttccacttt taggagctta taatcagggg gtgctatcta acaagtgcaa aaagattagt 74041 acaaggatgt tcatcgtagt gttgtttaca cttacaaaca aaacgggcat ggccttactg 74101 ctttgcctaa ttagaactgg ttaagtgaac aggtgttcag ctatgccatg ggacactctg 74161 cagcctttgg aaatgactga aataaatcta caggaattgg gctcctttct atcatgagag 74221 cctagcaaat atttcccagg gctgggactg tgggctggtg aatctggcgt aggcagctct 74281 tcatcgagcc tcagaatctt tccatgaaca gttctccggg tgactgttac acatggcact 74341 tggtatctaa tggaattcac atgcacagtg cttaggccaa aaaagaaaaa aatttccact 74401 tgcaattgcg gttttttttg ttgttttttt gagacagagt ctcgctttgt cacccaggct 74461 ggagtgcagt ggtgtgatct cagctcactg caacctccgc ctcctgggtt caagcaattc 74521 tcctgcctca gcttcctgag tagctgggac tacaggcaca tgccaccatg cctggctaat 74581 ttttgtattt ttagtagaaa tggggtttcg ccatgttggt caggctgggc tcaaactcct 74641 gacctcatga tccacccgcc tcggcctccc aaagtgttgg gattacaggc gtgagccacc 74701 gcacccggat gcaattcttt tccaggacac cagctgctgg caaaagagaa gcccccacac 74761 tgagcctcac gtaccaaagg ccaaaggtaa gcgactccac ttgagatcgt ctgtgcggct 74821 acgtcttccg taagaatcca cgaacactcc aaattctgca aggtgtcaag agcacgtggg 74881 cattagcaca gccaagagca ggggcatcag cgacacacat ggtgatggaa gaatgaggag 74941 cagagcccag ggacccctgg ccagatggga ctgactgtga aaaccagagg ccctctgctg 75001 agccagctct cagctttatt aaaaatgagg gactttttaa atttaattta ttttaattta 75061 ttaaatttta ttaaagaaag agaactcaag tgaaaagaag gaaaggaata agaggaggag 75121 agggaaagag aaagagacag agagagaggg gaatgaagcg atccggagga aggaggacag 75181 acagggaaag acggaaagac tcaggaaaga agggaagggg gaggggaggg aggggatggg 75241 gagggggagg ggagggaggg gatggggagg gggaggggag gacccaagaa agaaagaaaa 75301 gaaagaatgc aacaaagaaa aggacagata gaaggatgga aagggagggg aagggagggg 75361 aggacgaaat gaggacagag taagagaaag gaaagggaga gaaagaaagg caagggatgg 75421 atgaggaaaa aactgcctgc tctttggaga gaatgtgata aagttagacc acgctagtat 75481 ctggtgatgg agaggagcat ccccaagggg acttggtggt gctggaagga cattctctgt 75541 acccctagtc aaacctatcc tgccccagag ctccatgtac cctcctccaa cccctcatgg 75601 gtccctagtg tccatgagga cccaaaaggg cagtgggcgc agccacgact caccgtggaa 75661 acacagcaag tactcctctc gctgccctgc gctgttcacc tgcacgattg agacagggaa 75721 gctgttggaa gaggcggcaa acacagcagg tgccaaggaa tggtcattct tatccaggaa 75781 ttctgtaaag gcaggcgaga agcggggctt caggagtgag ttctgagtct cagcgcggcc 75841 tgagccatgg gtgggtgcga aagtggaggt gggtcctacc ctcgagcgtg tactgcttca 75901 tgtcgatttc gtagaattta ttggttccaa tgaggatact gtaattggtg aagtggatac 75961 agctgcaggg ctctgaggtc tctatctcct gagacatgag agcagaagag aaggatgtga 76021 gaggtaacac agggcagcca aggtccacag ctgttctgct tctgcctaca gcatctagcc 76081 aagccctgga gaacattgtc accaagcttg ttcacgttgt catagagagc cattcatctt 76141 tttttttttt tggagacaag atctcgctct gtcacctagg ttagagtgca gtggtgtgat 76201 catagctcac tgcagcctca aactgctggg ctcaagcaat cctcccacct cagcctccca 76261 aagcattaga attacaagct tgagccactg cacacgcccc attcaccttt catgcctctc 76321 aggatgaaca gtgatccatt tgttttctct tcctattgtt ccatgcaacc caaaactaag 76381 aaaacaaggt caaactgcag ccaaaaaaag gtggggtcag aaacaccatt ccaggctggg 76441 cgcagtagct tacatctgta atcccagcac tttgggaggc tgaggcaggt ggatcacctg 76501 aggtcaggag ttcgagacca gtgtagccaa catggtgaaa ccccgtctct actaaaaata 76561 caaaaaatta gccaggtgtg gtggtgtgca cctgtgatcc cagttactca ggaggctgag 76621 gcaggagaat cgctcaaacc cgggaggcag aggttgcagt gggctgagat agtgccattg 76681 cactacagcc tgggcgacaa gagcaaaact ctgtctcaaa aaaaaaaaga aagaaaagaa 76741 agaaacagca ttccaatgaa gagacagatc ttgaacaata caacattctt atcacaggaa 76801 aaaaaaatac aggaaaaata caacaactat taacagtggt ggtcagcagc aatggttttg 76861 gtttccttct ttttgctagc ttctaagttt ctagtttgaa cctatattac ctatgcagtc 76921 atcatcatga tcatcattat tgtcatctta attacctgct tcctgattat ttttagttgt 76981 ttttaggcca atcctaatgt catcaaaaaa ccaggcacac ctattcaaaa tgtgtggccg 77041 ggaatcataa agaagcaatg atttctgaga attttcctga agctaacagt tcaaattcta 77101 gtgagcatta tcatcatcac tgaatatttc ttggtcacct actaagtatg ggaaaacttt 77161 gaggaagcac gatatagtga aatgctagga aaaaaaatca aacttcagtt ctggcctcaa 77221 ctttgtaact gaggattttc tgccagcctc agttgtctta cctgaaaatt gggaataact 77281 gtctctgctt ttcagaactg tggtgaaagg tgcttgtaaa acagccagtt cagtccctta 77341 cagggtgggc acccaatgaa cacaccaact aaactaacac caccctgccc agaatggaac 77401 agtttctcta acaggtaatg acccttatat attattcacc cttcatttca cttttttttt 77461 tttttttttt tttgagatgg agtcttgctc tgttgcccag gctggaatgc ggtagtgcaa 77521 tctcggttca ctgcaacttc cacctcccac gtttatgcaa ttctcctgcc tctgcctccc 77581 acgtagctga gataacaagc acccgccacc atgcccggct aatttttgta tttttagtag 77641 agacagggtt tcaccatgtt ggccaggctg gtctcaaact cctgacctca agtgatccac 77701 cctcctcggc ctcccaaagt gctgggatta taggcctgag ccaacatgcc cagcctcatt 77761 tcatttttga gactaatcat gaaagcacca gaaatatatc aggcttagga ggagtgggac 77821 actccgaggt agggacatca gacagcattc tccagaagcc aggcagattt aaggatctaa 77881 cacacatctc gtccatattg cagtcatcat cacagctcac tctgctccga aaagaaggag 77941 ctccctaatt acatatcaaa atactgctag cactggttat tgcagaaggt tatattaata 78001 gacacctcta gaatttgaga aagattctga taagtgctac ctgggactgc taggatctta 78061 gaaagagccc cccctttttt ttttccactg gtttttccta tcagtcaact ttctgttcta 78121 acaaaaggct gaactcactg ggctggatgg ggaggcagtt cgcaaatgtg agaagaaaag 78181 caggagggaa accgcagcta ctagtggccc tggggaatcc tccacaaagg aaggatctcg 78241 ctgacgacag aggtcacccc aggctaaagg cccgcagggt agaaggaacc cagtacaggc 78301 tgtgtcccag gacagtgcac tttccacact ggctcgctga gagagcagga ctggctttcc 78361 cacgttcaac caaggcaagg cccaggactt actttccgga tgcagtattt gctgaggttt 78421 tcgttgtagc ggagaatgac gactttgctg ggcatggctg cacagatgca gagcccgttc 78481 tcaatctgaa aagcaaccaa gaaaagaaaa agactgtcac cagtgtgata ccggctgtgg 78541 ggcaaaacta ggaaaagctc tcaggctcag cgccattgat gattcagacc agatcattct 78601 gtgtggtggg gggagggctg tcctgggcat cgcaggatat gagcagtatc cctggccttg 78661 acccactaga tgcctgcagc acttcaatgt gtgacaatca aaaatgtctc cagccattac 78721 caaatgtccc ctggtaaggc agaattaccc ctggtcgaga accaaggctt tacctgggat 78781 ccagagaagg cctcagggag gcacctgaca ccacacacct ttcccacaca tgtgccttct 78841 gctggagaca gaatcttttg acaaacttcc cttgtgagga gggagaagac agaatgaggg 78901 aagggagagg aaaggtggta gagactgcta aacgaattta gagaaaattt cttttcttct 78961 gagacggagt ctcactctgt cgcccaggct ggagtgcagt ggcatgatct cggctcactg 79021 cagcctccac ctcccggctt caagccattc tcccacctca gcctcccaag tagctgggac 79081 cacaggtgcg caccaccatg cctggccaat ttttgtattt taagtacaga tggggttttg 79141 ccatgttggc caggctggtc tcgaactcct ggcctcaagt gatccgcctg cctcggcctc 79201 ccaaagtgct gggattacaa gtgtgaacca ccacacccgg cccacattta gagaaaattt 79261 caatccaaaa ggaaaagttc ttagctaaga actcatacac gtgcagagtt ctacatcaaa 79321 acgctctttt tctagaacag ctgaaaccct tttaacgtct cctgctacaa ccagatgttt 79381 atcattcacc aaatgaggct tactgctcaa attagtcctg gcacgaaggt cagcagctgg 79441 tacagtggga gaatgaagat gggccgggat aagggaggaa cacacaagaa aagggaaatt 79501 gttagcaatg ttctatttct tggactgggt agtgagtgtg tggatgctta cgacattatt 79561 aggctttatg atttacatat atgttccata cattttttga acagttcaga tattagaatg 79621 aaaataattt ctaatttaaa aaactaattt ttattagctg ggcatggtgg gcatgcctgt 79681 aatcccagtt actcaggagg ctgaggcaga agaatcactt gaacctggga ggcagaggct 79741 gcagtgagcc aagatcacac cactgcattc cagcctgggt gacagagtga gactctgtct 79801 caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaattttggc caggtgcagt ggctcatgcc 79861 tgtaatccca gcactttggg aggccaaggt gggtggatca cttgaggcca ggagttcgag 79921 actagcctgg ccaacatggt caaaccctgt ctctactaaa aatacaaaaa ttggcctgct 79981 gtgatggtgc gtgcctgtag tcccagctac ttgggaggct gaagagggag aatcgcttaa 80041 agccgggagg cagaggctac agtgagccaa gatcgtgccg ctgtgctcca gcctgggtga 80101 cagagcaaga ctctgtgtca aaaaaaaaaa aaaaaaatgt tgattgagcg gctgaatgaa 80161 caaatgtgca gtgtgctctc ttctgagggt caaaatgcga tggttttcaa cagcagaacg 80221 ccagtagcct cagggggtct ataaacacct gattcagact taattaaagt tatacatgta 80281 catgtatata tacccgcaaa cactcttttt tattattttc tactttaact tttaagttca 80341 caggcacatg tgcaggtttg ttgtatcaat aaactcatgt catgggggtt tgttgtacag 80401 gttatttcat cacctgcgta ttaagcctag tatccattag ttatttttcc cgatcccctc 80461 cctcctccca ccctccaccc tctgacgggc cccagtgtgt gtgttcccct ctatgtgtcc 80521 atgtgttctc atcattcagc tcccacttat aagtgaaaac atgtggtatt tgtttttctg 80581 ttcctgcatt agtttgctta ggataatggc ctacagctcc atccatgttc ctgcaaagga 80641 catgatctca ttctttttta gagctgcata atattccatg gtgtatatgt accacatttt 80701 ctttatccag tctaccattg atgggcattt aggtggattc catgtctttg ctattatgag 80761 taggctgcaa tgaacataca catgcatgtg actttataat agaatgattt atattccttt 80821 gggtatatac ccagtaatgg gattgctggg ttgaatggta tttccatttt taggtctttg 80881 aggatcccca cactgtcttc cacaatggtt gaactaattt atactcccac caacagtgta 80941 taagtgttcc tttttctcca cgacctcacc agcatctgtt attttttgac tttttgacta 81001 aagaagcaaa caaaggtatt ttatgtgcca cttttcaata tgccagaagc tacaaatgaa 81061 ttaacttgag ttctcatgaa ctttgttatg taaatgtcat aaattcgaat ttttaaattt 81121 ctagtaatga aacatacttt catctagttc agacatggag atccacatta gatttttttt 81181 aaggttcatc tctctctcac agacacagca gacatacaca cgcacatgca aacacactta 81241 tttttaaatt atggatcatc cttctagttt tgttaatttt attcccattt tgctctcaga 81301 catttatttt agattcaaaa cctgtacaaa tttcatggct acacaaatta tttcacgtgc 81361 ctaaaggagg caacagaaat ttaacctagt ttgaggatgt acaattgtga aagttgtcag 81421 aatcaaaatg gagctgccaa tgttaacaaa acgctgacat tagagcagga ggaggccatg 81481 aagagaggat tctcacactt gtgtgcctga taaaaaagag actcgacgaa aaccacaaag 81541 gccattgcaa tctgtttttt ttttccctgt caccaggctg gagtgcggtg gcgcaatctc 81601 ggctcactgc aacctccgcc tcctgggttc aagtcatttt cctccctcag cctcccgagt 81661 agctgggatt acaggcgccc gccaccgtgc ccagctaatt tttgtatttt tagtagagac 81721 agggtttcac catgttggcc aggatggtct tgatctcttg acctcgtgat ccacctgcct 81781 cggcctccca aagtgctggg attacaggca tgagccaccg cgcccggcca ggccattgca 81841 atcttacaca aaaaatactt ctgcaaggac atccgcccag tgactatctg tccaacctta 81901 gacaagcgtc agtcttgtta ttaatctttg aagccaagga taattatttc aaaaagacta 81961 tggaatcctc acttacaaac ctctgtcttc ctttatctcc ctagataccc acatagttta 82021 caatggcacg tgtattccca ttgcaaagcc ctgttcccaa atcaatctct tttctttcaa 82081 gagagcccct ctctgtttgt tatttaagtt gacactatct tgaaggtcaa atcaacgagc 82141 tctgtgggaa gagaggtagt gtcaatttcc agaacattcc accagctagg actctgaggg 82201 gagcttacct tgcctgcccc aaacaagtgg cagcccttga cagcttcaaa aatgttgggt 82261 gagatgtcgg gctgggcagg caggtgggac tgggccaggg actgtttcac tttcttcacg 82321 tccacaagac acagtgcccg ctcttctcct gaacaggaaa aggaacaacc tctcgtcagt 82381 gtgaagccgt taagtaaagg tacgggatgg tggttctagt tctagtcaca agtaaaagtc 82441 acaagaggcc acacagatga tctgagtttt ccacaaatcc atataaacac attttaggtc 82501 catgatttat ttattttttt tttttaaatt tttgagacag agtcttgctc tgttgcccac 82561 actggagtgc aatggtacaa tctcggctca ctaaaacctc tgcctcctgg gttcaagcaa 82621 ttctcctgcc tcagcctcct gagtagctgg gattacaggt gcatgctacc acgcctggct 82681 aatttttttt tgtattttta gtagagatgg ggtttcgcca tgttggtcag gctggtctca 82741 aactcctgac cttgtgatct gcccgcctca gcctcccaaa gtgctgagat tacaggcgtg 82801 agccaccacg cccggcccaa gtccatgatt taaatctaag acaataacat ttggcctttt 82861 agtcaagatt ctgaggaggg tggaattctg cagacataaa attggagaat gtgaactgta 82921 ctttcaaaca tgctccccag caaagtaaag aacgaatgtg tgagggagga atgaggaggc 82981 attctatgaa aaaacagact tggactatcc aaatatgtgg atgtcaggag agacaaaagg 83041 ctgaggaacc gttcagatta aaggagacta gagagacatg attaaatgca atgcaaattc 83101 tgggacatta tagaagccac aggtaaaatc caattatggg cagcatagcg gatacttgta 83161 ttatagtaag ggtaaatttc ctaaatgcag aaattatagt atgtgaaact attctatatg 83221 atactctatg tgtggataca tgtcattata tgtttttcaa aacccataga atgtatacca 83281 ccaagaaaga actctataat gtaaactgta gacttggggt gacaatgatg tgtcactgtc 83341 aatgtgtcaa tgtaggttca ctgactctaa cgaacgtatc actctggtgg gggatgttgg 83401 tagtggggga ggctatgcat atatgagaac gggggtatat gggaactctc tgtacctttc 83461 aattttgctg tgaatctaaa actgctctaa agaataaaat aaggaagaga ggaagagacg 83521 gaggaaaaag agctgagatt aagaaaggcc agttcttgtc catagtatgt cagtccccgg 83581 atccagccat gcctgaaaca agacctaact gttgttgttt tgaacactga atcgatatat 83641 tactcttttc ccctttaaaa ataatgtcac taacacagac agagagaaat aaactgtacc 83701 ctggttatgt gacagaatgt cttcaatcct taggttgata tactgaagaa cttggcgtca 83761 tacctgcaac taactctcaa atggttcagg aaagagagag agagagagag agagagagtg 83821 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgttaa gaagacagag 83881 aaaaaaatta tgacaaaata ttagcaaatg gtgacttgag agagatcaca gagttaacag 83941 agcaagcaag ctctcagaag gtcagggccc acggtggttt cgttgcaatc aggacaaagg 84001 gtttggcttt agatagtctt cgaggcttag cactcttcac ttggcctaac agtgggatgt 84061 tctttgggaa tgtcagactg tgctaaactg gatactggaa gggaactgaa tctggacaga 84121 gggggtccat tagctgaggc ttgggcatct atggggacac agagataaga gctacaacgg 84181 ctcctctcta ctattttgtg ttttacgagc atgaaacgtg gcttcaacat attggctccc 84241 ttgaaagtaa agaaaaaaca ccatgtcctt ggcctcacac ctgctatcat gagtagcttc 84301 tccaggtcct tgataatata aatttggaag actgctccaa ttcctgggac atgggttagg 84361 gagtttttca agacattcag ggcgtagagc ccttcctcgg tgcccaccaa caccacctgc 84421 aagggcagaa gtccgaaggc agaacataag cacggtcacg tcaccagaac aatctctagt 84481 aagagcaaca aggcctccac agccctttct ttcttgatgg ggtgtggctg taaccagaca 84541 ccagctggcc gtgcccatgc aagcattacc tggtcactga agggcagcgt gcagttcatg 84601 tctagacggt catcaccttc cagtttcagc agggagtttc caagcagttt cttttcatag 84661 gtgaaagaaa agaaaaacag aagacatcgt gaggctgatc tgtttatgac caaaccatcc 84721 agagaaacca agagaaggtt aaggccaagt tattcctgga aatgctcaga aacgcacata 84781 tgctcttagc tcagcccgta ttttgatctg agctccaaat gcaaaatgcg agtgctattg 84841 gtatctctgg ggcagctgtt aattagcatc tgagttataa tttggctggg atgcagaaat 84901 aagggagggt agtagacatg gaaagctatt ctgtaataag aaataagctg aagctaagga 84961 gatttcacct gcgcagggtt aataagacac atgaaagact ccttccccca taaggctgca 85021 gcagaagtgc aaaggcgccg tgttagcagc gagccagcac gcctctgcat ctctttacct 85081 gaacccaggc aggcagaagc tcgtaagaaa cacagtcgcg ggcctattgt acacatatta 85141 acaaggattt cagtaaaagc tcacgtgtga aacgaccaac aagtcatcca aacaggtcta 85201 accactgagt aaagcccttt tcaggaggct tgtggttaag aatctgccaa ttgctaacac 85261 ataaagtgtt tattaggagt gaagagtctg aaaagagctg tttggtcttc gctggagcaa 85321 acgtcaatga gcaatgagca ctgtttgtat cctggaacat tttaggtggg gtgatgccaa 85381 tcacagcaca cctacgtttc taacagctta aggataagta ggactccaaa cctgcctacg 85441 gaagagcaaa tactgcaaaa acaactgaga tgcttatgaa aagccttcct ggaggggaca 85501 cagctgccgg cagcctattg tgctgcttcg tggccaaagt gcttctaact caagccttca 85561 aaccagacaa actcatttga gtaaacagga gagaggattt ccttataagt acatttgctg 85621 ctgcatcacc acaaccaagg tctacaatgc caccagagta ggctactgat gccgattccc 85681 ccattctttt tcctttgctt ttacagtcca tgcctcgctc tgttgcccag gctggagtac 85741 agtggcacaa tcgtagctca ctgcagcctc aaactcctgg gcaatccgcc ggcctcagcc 85801 tccctactag ctggaactac aggtgcatgt caccacacct ggctaacttt ctttctttct 85861 ctttttttaa gagatgggat cttgctatct tgcccagtct ggtttcaaac tcctgggctc 85921 aagtgatcct cctgccttgg cctcctgggt cactgggacc acaggcatga gccaccgcat 85981 ccctgattcc cacattctaa atggatcatg atgccacaca agctaccttg gtgcttgtgc 86041 cttcatcacc accagactcc tggccatgac acagtctaga gccatcagcc ctcagagaga 86101 gagcctgagg ggaatcaaaa tggccaatgg gattctcgtc gttaacacca gttaccaagt 86161 cagccccctt tacaaagaca acctccaggg cccgctttca tactcacagc atcagcttct 86221 gctttttccc tagaaactct cccacctgcg acaactgatt ctaaggcggt gacccagcgc 86281 tgtttgtcag ggaagctggg agctagcaag tagagggttc tcccgggcca gcaggtggtg 86341 tgcgggtgag attccatctt cagtatgtat gggacatcta ggagatttca gagagcacag 86401 gattgggtgt ggatttcccc cgttcccact gggagggacg ggcctgagag atcaaagatg 86461 cccaccaaac cacgcaaatc ccagttaccg ccaaggcggg gagcggagga agatgggcct 86521 cctttgcaga gccaatcttc cctgtaccac cccttctgtc cctgctgatt ggccaagccc 86581 ggcccacctc cagggcgggg ctcctccggc tcctcctcac ctgctttggc tgtatttgcg 86641 agttcggaag caccaacggc accatgaata gatacatccc cgtcgggaag gcacagctca 86701 aattcttcca ccggcctctg tccagcttgg tgcaaagagg aagggcagaa agaaaaacaa 86761 aagaacagga acaagaacaa ggggagaaga gagagcgaga gagacagcaa gggagagaga 86821 gacagggtac gtgtgtaaag agaggcgcac gagaacaagg aagggacaga ggtgtgcaga 86881 gaaggcagag agggaagata aaaaaaaata aagtgtgtca gatgagtagc acagtcaaat 86941 ttctgatgac gatgcggatt tatgggttag ttgactgagg cagaagttct cggaaggtgt 87001 attagcaggt ggaatcttca gggcagcgcc ctgcgggttc ttcagggggg agacctatag 87061 atgtgagtat cgcaccaata acctttagag aagtcatttg gtttgtaagt ttcaaaaatg 87121 gaaaagcgta gaaggcttgc aaggcagtcc aaaaaacaaa gccttcgcgc cagcactggg 87181 gagacctggg ttagccacgt gcaatgacct tccccctgaa ttattaattt accttctctg 87241 gcttcattgt cataaatgag gacttttgat ccctccagga caatgtactt cctgtcccag 87301 ccttgctgtc ctcgtttgtt attcctgggg aaagaaagat ggaaaagaaa aatgtctgaa 87361 ctcagaagaa acatccggct ggaggatagg atgagggggt ggcagagttc ctagaaaaga 87421 cacttcccta gaaaacatca gggacagagt ggcttgtgcc agcacctgct ccagcccaag 87481 ccaccctgac atggtacctg ggcaccttca tccacccttc caggtgcaag ctgctgctgg 87541 gctccttggt ctggagacct ggggagttca ttttgtcacg gcagaaggcc tcggtgaagt 87601 gtgtggcata ttcagcaggc aagccgcagg tggctggcaa gcacgtggag cacttggggt 87661 gacacatcac ctgacattct agggaagaac agtgagcaac ctgagggcat gctcagctga 87721 ccctggaccc ttcccttcct ctgccagcca acgcctgggc ttctctaccc cagccgggcc 87781 cagagagtct ggccctggct tgcaggtagt ctcctgagct tcccctggtg ccaggcccct 87841 gcagaaaggg aaaacaaaag tgccaagtgc tcttgaccca gttttccagg cttcaatcca 87901 gaccgcccac aaacaggtgt gtaaagaaca cgtttgcatg tttcagttgg agtcagcgga 87961 aaggtaggga gagaccagcc tgacaaaaag gggctggaaa ttagcaaagc ctaaggcact 88021 ctgagcagca ggctggagtc atgagctgca gacatatata aactcacagc ttcaacaggg 88081 gaaaaatagc atcctcctac tgaagtaaga agctcgaaaa tgaatgcact atatgatgag 88141 ttagaaaaaa atcacctcca cctgatagac tttttaagct ggagatgcac aggtgcccag 88201 cacagatgca caactactga tcttaccgag acatttggat gcctggcgtc caaagtgcac 88261 ggtatccaga cacacagcac actttgtggc tcgcatgttc agtcctacgt tgaatcggtg 88321 aggaatattg tggtgcatgc gttccttaag acgccgacta aattctggag gaaaatgttt 88381 taaaaaatca ttagagtact gcaaatgatc ccatcaggaa agtgaaaaga caacccacag 88441 aaagggaaaa aatacttgca aatcaaatct ctgataaggg actcatatct agaatacata 88501 aagaactctt acaactcaat aataaaagga caacccaatt ttaaaatggg ccaagaattt 88561 aaatagacac ctccccgaag aggataaatg gccaacgagc aaatgaaaag atgctcaaca 88621 ttatcagtcg ttagagagat gtgaatcaaa accccaagca gacaccactt cacaaccact 88681 aggatgtcta cagtctaaaa aaaggaagat aagtgttgat gaggatgtgg agaaactgga 88741 accctcacac attcctggtg agaatgtaaa aatggtgcag ccactttgga actcagtctg 88801 gtagttctgt aaaaggttaa acatagttac cccagaaccc agcaattccg ctcctggcta 88861 tatatccaag gggactgaaa acatatgtac ccacaaaaat ctgtacataa atgttcacag 88921 cagcataatt tatgacaact acaaagtgaa aacaacacaa atgtccacca atggtgatat 88981 ggtttggctg tgtccccacc caaatctcat cttgaattgc agctgccata attcccatgt 89041 gttgggaggg acttggtggg agataactga atcatgcggg tggtttcccc catactgttc 89101 ttgtggtagt gaataagatc tgatagtttt ataaggggaa acccctctcg cttggttctt 89161 attctctttt gtcaccgcca catgagacat gccttttgcc ttccaccatg attgtgaggc 89221 ctccccagcc acatggaact gtgagtctat tacacctctt tttctttata aattacccag 89281 tcttgggcat gtctttatca gcagtgtgaa aatggactaa tgaaaatgga ttaatggata 89341 atcagatgtg ccatctacac aatggagtat tattcagcca tgaaaggaat gaagttctga 89401 tacatgctac aacatgggtg aaccttgaaa acattacatt ccatgaaaga agccagacac 89461 aaaggccgca tgcagtatga ttcatcttta tgaaatgtct aagagagcag atcgacagag 89521 acagaaagta gattggtggt tgctaagaag aacattagtg gaaggagaat ggcgggggac 89581 tgataatgtg tatggagtat tttttgggtg ggggggtgtc aaaattgttc aaaaattaga 89641 atgtgatgag ggttacacaa ccctgtaaac ttactgaaaa agactgaatt gtacattttt 89701 aatgagcgaa ctgtacggta tgcaaattac atctgaatga aactgttaaa accattagaa 89761 caggttagtt acagattttg aaagtattaa cactaacaga tgcatctaga tttataagac 89821 tcaacttaaa atctccatta tccagtgatt aggaatcagg ctttaagaac atgttactgg 89881 taggagattg actgactata cggtaggctc tccttacaac atcacccccc ttcctttctt 89941 ggaaaggtgc aaggaataaa taacaggaat gcactttgaa actgtgtggt gcagtataca 90001 caaaggagca atatactgta gtactatcac tattcaggga catttcagtg gacaccacag 90061 ccagaagcaa tgctttcctc tatcttgtca tcaccatttt aatgagttta aaagtaaagc 90121 tcttgggctg ggcacagtgg ctcaaacctg taatgccagc actctgggag gccaaggcaa 90181 gtggatcgct tgaggtcagg agttcaagac cagcctggcc aacatggtga aaccctatct 90241 ctactaaaaa taaaaatatt agccaggcat ggtggcgcat gcctataatc ccagctactc 90301 aggaagctga ggcatgagaa tcgcttgagc ctgggaggca gaggttgcag tgagccaaga 90361 tcacaccact gctctccaag cctgggcgac agagcgagac tctgtctcaa aaaaaaaaaa 90421 aaaaaaaaaa aaaaaaaaaa aaaagtaaag ctctttgatt caaggagtta tataaacaca 90481 ggaggaagaa aatgacttca ggctaaagaa aaacaacagc aacaagaaaa caccatctgg 90541 tttacaacca aaacaggttt gattgcaatg ccatgattgc taaaccacag ctctacccta 90601 gtgttcctaa gacattccaa acagcttcct tggccttttc ttaagtcctt ttaatctcta 90661 ggtaaatgga gtacatttga gaccagcaag aaaactcctc acactgttgg tagaaataaa 90721 cattggcaca aactaccagc actcaaaatg tacatgacct ttaacccagt gattcatttc 90781 ccaggaattt ctccaggcac aacacccaca gaggcacagg atgtcctctg caaggtgttt 90841 acagtaacag gggattggaa acaacctaaa tggccatcaa caggagcctg gcttaacagg 90901 tgaattgcat gctcacacca cggaatactg tgtggttctc tgaaagaatg gggtaggtcc 90961 ttatgtacta atgtgaaacc tctaagatag cactttaatg aaacgaaggt gtcttgcagt 91021 gtatactgta gtgtgctatg atatttataa gagaaaaaag gcctgaaaga taagcttacg 91081 ttttcttgtt tttgtttttg agacagagtt tcagtcttgt tgcccagact ggaatgcaat 91141 ggctttatct tggctcactg caacctctgc ctcccaggtt caaacagttc tcctgcctca 91201 gcctcccagg taactgggat tacaggcatg caccaccaca cccggctaat tttgtatttt 91261 tagtagagac agggtttctc catgttggtc aggctagtct cgaactccca acctcaggtg 91321 atccgcccac ctcagcctcc caaagtgctg ggattacagg cgtgagccac cactataata 91381 tttataaaag aaaaaaggcc tgagagagag atgtgcttat tttttcatat tcttttttct 91441 tttcttttct tttttttttt ttttttttga gatggaatct cactttatca cccaggctgg 91501 agtgcagtgg ggtgatctcg gctcactgca acctccacct cctgggttca aatgattcta 91561 ctgccttagc ctcccaagta gctgggactt caggagaatg ctaccacacc cagctaattt 91621 ttgtattttt agtagaaacg aggtttcacc atgttggcca ggctggtctc aaactcctga 91681 cctcaggtga tccgcctgcc tcagcatccc aaagtgctgg gattactggc gtgagccacc 91741 atgcccagcc tattttctca tattctttaa atataatcac acaatgtaag atacatgaat 91801 ggtatagaca tgtggtcaca tatatgaacc aacactgagg atggggagcc aggagactga 91861 cttctttctt tttttttttt ttttttttga gatggagttt ccctcttgtt gcccaggctg 91921 gagtgcaatg gcgcaatctc ggctaccgca acctccgcct cccggattca agcaattctc 91981 ctgcctcagc ctcccgagta gctaggatta caggcataca ctaccatgcc cagctaattt 92041 tgcattttta gtagagacgg ggtttctcca tgttggtcag gctggtcttg agctcccaac 92101 ttcaggtgat ccgcccacct cggcctccca aagtgctggg gttacaggtg tgagccaccg 92161 tgcctggcca agactgactt tttaatcttt aacccctagt attacttgta atttttacca 92221 tatgcccaca tgtcttagtc gactaaaaga aatttccata caactccaac gtaccctctg 92281 gagttgaaga ctcctttctg cggctggatg gcggggccag caggctcatg gcactgggct 92341 ggtgctctgg cgaccgcacg atggcggaca tggcgatctg ctgcctcgcg gtggctggcg 92401 tggatgggtg tgggtggtcc gttgctttgc ggtgggcagc tgcagagaga ccaggacaat 92461 gccttttggt tagctgggtc gcctagagga ccaaactaag ccgaatgtcc ctgggctatc 92521 tttcaaggac ccaagaccaa aagaatatgc gtcacatcaa cttggcaatg cacaggggcc 92581 atacgttttt cagacatggg atgtctggtc tgaaagcgta tgggccataa acgaagacac 92641 tgagctagtt agtcttggct gatctcactg agatcaatcc tctgcctttt ccacatcctt 92701 gagcattttg gagagagtga gcccctacct tcctcccggg cggaccggag ctcgatgcgg 92761 gtcttctgaa gggcttcctc tagctctgca cagcgagctt tctccttctc cagggccagc 92821 ttcagctcat tgtactgcag aggaacctgt gtgggtaaag cagggtcctc tttccgtcga 92881 ctaaataaac cctagcaatg gaaacagaga tatctcctaa ctcctggaaa ctggcacttg 92941 aaactagcta aaggaaatct gactcgggag gagcaaacca caaaagccag gagcatactc 93001 actgtaagtg tatttagtaa aaatggcatg tgaagggggg gaagggtagg ctgagccaca 93061 gcccgttcca gccggctgaa aaaaataggg gtgcggaagt tttcaccaag aggattaaag 93121 atgcttgaat ggccaccacc tcttcctcta gcattgccaa gacaggagag tcatgtcctg 93181 agtctttccc attcaaaaaa tggaatcagg agagtatgga gtatggagtc aggagagatg 93241 gttcagaggt cactcttgcc ttggaatatg tctcatgatg ctcagttaca agacacagag 93301 atgagtgttt tgtaaaagga agcaaaattt taaaaggaat gctaaaatga ggtcaaaaca 93361 aggggcttac tcaaaaaaaa atgcaaatga aaatatttac tatacatgga aaaaaaagga 93421 aaacagaata atgcaactta aaaaaaaggt aactaaaaaa aataacaaat ggaaacaccc 93481 taactaaatc aagtgacata accaggtaac agttcactgc taaaataaaa aagattaaat 93541 atcataagca taacaaccat atacagtcac gcagcatgca agaaaatcct tcccgtgcag 93601 ccaacgtcta gtgggaggcg gcctctgaat gttgtgctca tgcccttaca gcgaagatgg 93661 caccggaaat attgagacaa ctgtcaactt cagccctcca ctccttcctc ctaggtattc 93721 caagtcatca aggcctctcc tgcaaaacca ctgccactgc tgctatggta tgcttctgca 93781 tttggaagga aactgcaagc ttcttcaaga cttttctgga actgagaaca cctggccaaa 93841 cctagtatga aggctttagt ggtttgggaa agcatgtata gatcaggctt atgaaaaaca 93901 atcttctcat gtgcactatg tataaccctg cccctagttt gtaatttgtc tcctgcaatc 93961 taaaagggtt gttcagtact tggcagaggg gagaagtgtc cagtcagaca cttcaggcag 94021 gtcaccccag ctccctgagt tctagtctct ttgtgagccg atcaaaagga ctggactaga 94081 tcactgttaa agtcctttct ggagtgtggt acgagagacc tgaactgctg aagccccata 94141 cttgtgactt gctaactccc tctagagtca tgtatgcatg acttggtata aatctttctc 94201 tctgcataca tccctctgcc ccaactgcta tgtgtgtggg ctggggaggg attacttctt 94261 taacaaagat actatgattt gaatggtcat aaacaaaaag tagtttgtta gacttgatca 94321 ctgatatctt tcagtaagac ctattataat tatctcaaaa tctatttttc aacccaaaat 94381 aaaacaacaa aaaaagtata ctttgttttc ttctatgttc ctatgatcat atatcacagt 94441 ggtgtccact gagccatgaa tgatgagtga aactgacaat tcccactttt caaacacttt 94501 gactcacctt tttcttttta gcaggttggt ccattttggc ttgcagaaaa tcaatgagtt 94561 tggtttgttg agaaatagtg ccttccattt tcaccttttc atgagaatag agaacctaaa 94621 attcaagaaa acaaactaac ctcaaatcca gacagaagta tacttttaaa gttggtactt 94681 taagaaacta aaaagtgaag ttgttaggac actaatatat gaaaatccaa cacaaacagt 94741 actgagtcag tatgtattta gtccagtttt actgcaaaag agaaaaaaga tatatttata 94801 gataaatatg tgaaaaagta aaaatagcaa caggttcctt gtaggtcatg gtcactatag 94861 aaatcttcca acttctctga atgtttgcaa tctttgaaaa taaaatcttg gggaaaatac 94921 acacaatatg cagcacaaag atactgacca ttttaccaga tgactgtaca atcatgcttt 94981 gcctctgcag acttatttat ttatttattt tcttgagaca aagtcttgct ctgttgcccg 95041 ggctggagtg cagtggtgtg atctcagccc actgcaacct ccacctcccg ggttcaagcc 95101 attctccttc ctcagtctcc cgagtagctg ggactacagg cacccgccac cacgccaggc 95161 taatttttgt atttttagta gagatggggt ttcaccatgt tggccaggct ggtctcgaac 95221 tcctgacctc aagtaatctg ccctcctcag cctcccgaag tgctgggatt acaggcatga 95281 gccactgcgc ccagccgtgc agaccatttt taagcagatt ccctctgcca gtcctcacct 95341 gaatgttttc cagctgatac tccaagtcac ttctttctgt cttcagtaga tcagcccgat 95401 ctagagcttc ttgcagtcct tgagtcagac ggaaaatgtg atttttctgc aggtccatct 95461 gctgctgtaa tttggcttgc tagtggggag aagggccagg gaaatgcttg atactgcaca 95521 caatacaatt tgttcccctg ctgttcctat ttcaaactaa tttcaggcat ggcaccagaa 95581 gtacagcttt tgaaatcatc tcaattcaga attaccaatt gtcatctatt ccaaaaggat 95641 gcaagcacag ttaaaatgaa tttgcacaga aacctctgct ggccaacaag tccgaggatc 95701 cctaccgtca caggaagtcc aggctgcagg gatggccttc ccagacagag ttccaattga 95761 ggatattttg gtacattaag gtgaggatat tttggatggc atgagccatc tgagttgaca 95821 tctaaggcag ttcgactcag tgccatgaca actaagttcc aatgtcttta ttttctgttt 95881 tcaagatctt aattgttgct atgagaaatt actccatgca ggcagtgtgg gatacacaaa 95941 gagaaatcat atcatccctt acttaataaa ggtcccatta gggtaagaga ccccaacaag 96001 tacatgaata accacaatca tgagacatat ttctacaacg ttgggggaaa agaagaagat 96061 cccagttata gtacagagtg ttggtaaaac ccagaaaaga aattcaatta agtatattga 96121 aaatagtttt gtgctgcaaa tctcgctcca atttttataa tggaagttaa ccagaaaata 96181 ggacaggcta taaagacgct ggttttgtgg ttctttttcc taaaatgcat ccgtttctaa 96241 aaaagcagga gagagcaggc atcctgatga gacagtgctt gattttatcc aaaccctgga 96301 ttatagctta actacactta gtgtggccca aataatctct tccctggtag cctgggtcct 96361 ctgcagactt agggacagag agctactcaa agcaataaac atcatcactg gagattccaa 96421 aaggcctatg gcagccccca aactcctcca tcccagcagg cccagagcca ctgataatct 96481 cagcatttcc tggccctctc tgtctctttg cttctctcta cctctgtttt tctttccatt 96541 tatattcctc acctgccctt cctcttaaca tgtagctgat tccctaaggc atcgtgttgc 96601 agtagaaaga cctggatgct ggattcttac agaccctggt ttaaatcctg acttttacac 96661 ttatcatatc actgatacct gttaaaatct gtatttatca cctctcagag cctcagtttc 96721 ttcatctgaa agtgggtata ctagcttgcc tcattggatg acatattgaa caaagtgccc 96781 agtatacagt aggtgcttat tgaatgttta tcctctccat ctcctttgtc cttctttctt 96841 ccctccattc agttaaggtg ctccctctca tcctcccgtc tgcctttccc taacagtact 96901 gccacaaagg aagagctacc aactaaagat cacctggggc caggcacagt ggctcacacc 96961 tgaaatccca ggactctggg aggctgaggt gggaggatcg cttgagccca tgagttcgaa 97021 gtggcaatga gctatgatta tgccactgca ttccagcctg ggcgacagag caagaccctg 97081 tctctaaaaa aataaaaacg aaaatttaaa aaaaaaatca tttggcccgt gtgcgtaatc 97141 tggaatttct caatcaaacc acagagcccc ctaaaataca ggaaaaaagg aaagaaacag 97201 ctaagatacc ctcctccgca gcatgagtga tttcttctgc aaattaccct ctttagtttc 97261 ttcccactat tttaggacaa cattaattac taggcagaaa ccagagaaac aagtgctggg 97321 gacgagaatc tgtttggtga cagccgagcc caaaatattt ttctagaata aacgcatcaa 97381 ctgtccatta gctttccaag aaataggctg gttattccct atacattaaa aaaaaaaaaa 97441 aaaaaaagta agcggggcat ttgtactcta tgaaatgttt ttcagaagca tttacctgct 97501 tcccagaaag aagcaaacag aggtatattc atgggccagt gttaggctga aagggggatc 97561 ttagctgtgt gtttagacct gtccaacaat ttcacgagga aactgtaagg aaaaagacag 97621 gaaactggga ggaaaaagaa agggcctgga tgatagaaca agagcatgaa acaacccccg 97681 tgggctggaa gaatggatca agaccaacaa gaaaaacttc agtagggcgg tattttttaa 97741 attgtcaatg gaataaatct agaacataaa gatgttttta agaaaaagca aacttgtgga 97801 agggcaggta gtgatttaac agaccagtgg tgtcaatgat gtgatcaaaa ctacccaaag 97861 agcacaggta attggcagag gtttagagcc cagtatggag acggaaagat cccaatgtat 97921 tcagtgctat tcagaccaca tggacttgta gaaactctct gtggaaagac aaatctacca 97981 gaaaagaaca tccaaagctt gaggggatct aaaatgtttc acttaaaaaa gagaaggtgg 98041 ccgggtgcag tggctcactc ttataatcct agcactttgg gaggccaagg caggaggatc 98101 gcttgagccc aggagtttga gggcaacata gggagacgcc gtctctacaa aaatacttga 98161 aaaaataaaa gagaaggcaa tttggagtgg gagagctatt tggatatcta gttgactgtc 98221 acagagataa taaaactagt tatctcccag gaacattgca aggactgtaa gagttaattc 98281 atgtactcag agagtcagtc acatggtaag tgctaattaa acattagtta ttttaattat 98341 taatcaagta atctattaat cagtattaat ctcccatttt ggttggagat atcagactac 98401 tttctaattt catgtgtttt aatgtaatag aaccaatagg taaaagtgga aggtttttag 98461 ttcaaaataa gaatgttcta agaatgagag ttgctgaaaa atgggttggg atgtccaaag 98521 tcattgtaag tatttaggcc aaggttcaac ggggcctacc tatgaggagc gttggggaag 98581 ggatcttcta ccttggatga gagatgaatt tgaccaggac tagcaaatgg actttgatca 98641 ctgatattac ttccaatgaa cagatagtgg ctataaagtg tcatgttggc caggtgcagt 98701 ggctcacgcc tggaatccca gcactttggg aggccaaggt gggtggatca cgaggtcaag 98761 agatcgagac catcctggcc aacgtggtga aaccctgtct ctactaaaaa tacaaaaaat 98821 tagccgggcg tgatagcagg cacctgtagt cccagctact cgggaggctg aggcagaatg 98881 gcgtgaaccc gggaggcaga gcttgcagtg agccgagatc actgcactcc agtctgggcg 98941 acagagcgag actccatctc aaaaaaaaaa aaaaaaaaaa aaaaaaagtg tcatgttgag 99001 aaagcttctg aagccatttt gggcccagga ggaaaaatgt gaaagatcca ttagtaaagt 99061 ttgtcaggag caaggataaa gggaatagtg ctggaaattc agactggatg atataaaaat 99121 tctttcaatt ctgagacttt gattctggct ccatttaaaa gtctccttat aggccaggca 99181 tggtagctca tgctataatc ctagcacatt gggaggctga ggtgggcgga tcatttgagg 99241 ccaggagttg gagacaagct gggccaacat cgtgaaacct tgtctctact aaaaatacaa 99301 aaaaatttag ccgggagtgc tggtgtatac ctgtaatctc agctactcgg gaggctgagg 99361 caggagaatt gcttgaccct cggaggtgga ggttgcagtg agccaagatc acgccactgc 99421 actccagcct gggcgagaga gcaagactct gtctcaaaaa ataaaaataa aaataaaaaa 99481 taaaagtctc cgcacctacg aagaggatac ttaggtggaa gccttgggca catatgctta 99541 tacagcacac accaaccagg aggcccaatc acaatgtgga caataaagag cattctttgt 99601 tccagaatgg tttcaactgt cactgaaatc tgcctccggc tcaatgtcag ccagaagtct 99661 gtgctcaaaa atctttctgt gaggaacatc ctgggaaact ggaaatgttc tcatgtaggt 99721 tagaagtgag aaggagggaa ctgaattggg atcccggcac tctattccta gagtcgtcac 99781 tgactcacca cataaccttg gaccaaagta ctgggctgtc agggccaact cgccctctca 99841 gaaggaaaat aggacaggca agagcattga ttatttctgt tctttctgtt tgcaggatgt 99901 gcctaagtgg gaacactgtc tattcactgc agataaaaga ccctaattct ctatttgacc 99961 acgaggtaca tttaacattt ccactgtagt gttgttctta aggccacggg gtgaattcct 100021 atgtgtgttc accaaatata cgtgtgtgtg tgtgtgtgtg tgtgtttagt attaatttta 100081 ttttttaaat tgtttacaga ggcaaaatac acgacataaa atttaccatt ttaaccattt 100141 tttaagtgta tagttcagtg gcattaagta cactcatgtt attgtgtaac caccaccacc 100201 atccatttct agaactttct caaactccca cccatatttt tcattttaat ttttaatttt 100261 tattttgaga caggctctca ctctgtcacc caggctacag tgcagtggta ccatcagagc 100321 tcaatgcagc ctcaaacacc tgcgctcaag tgatcctccc acctcagcct cctgagtagc 100381 tgggactaga actgcacacc accacacttg gctaattttt tttttttttt tttttttttt 100441 ttgtagagat ggggtctcac tatgttgcct atactggtct caaactcttg gcctcaagca 100501 atcctcacgc ctcagccacc catatttttt aaagtatcat tcaataaaat attttatact 100561 cacccgcaaa aatgtcacta tacttttcaa acttaacagc ataacaatat tatttctatc 100621 agcaaagtgc ctcaggagct atgggtatca cgtgacaaaa cctcatcagc cagaacaggg 100681 cattctggtt aattaaactg tcaggctcac taagggttaa acagaatact ctttggagtc 100741 caatttttca ttataaagtg catttgtcct tgcaggtcaa tcctaaagaa caaagcggaa 100801 tcatcagtga ttcaacacac agggctctgg ttggggagtg ggaggtagtc atcaatttaa 100861 atagccgttc tcagagttct gtgaaaaggt accagccatc acgtaaggga tgggtttaga 100921 agatcagtct gagcttgtct attatgcttc aaaagcaggg cacacattat ttttaactaa 100981 attacccaat gttcaaaaac cttcatgaaa tttggtactc agaataatct gataaaattc 101041 tgccctgtgc aggaagctaa tttgcatgga tgtcctatcg gcatgaaact taatatactt 101101 tggatattag ctctaaagaa caatttaact caatattcat tcacccatgt atttaataaa 101161 cttctatttg gcacacactt actatgtgct tgacactcca ttaagcatca catagaagga 101221 taaaaatgac agagaccctc tggccttcat gacatagtct tttaatgtgt taaatacctt 101281 aagacaagta ttacatgacc cccatactct tacttgaaaa ttctaatttc aaaatatcta 101341 atttggatct gacaaattgt caatttcaaa attccaccta gaacggaaag ctaactactg 101401 cacattctca cttacaagtg ggagctaagc tatgagtatg aaggcataca agtggtataa 101461 tagacactgt ggattcagaa gggggaaggc gaggagggga gtgagggatg aaaaattacc 101521 tatcgggtac gatatacact atttgcgtga caggtacacg aaaagcccag acttcaccac 101581 cacacaattc acccatgtca ccaaaaaaac acttgtaccc ctaaagctat tgaaataaaa 101641 aaatctttta aagaaatttg tttacaagtc caggcagggc acggggactc atgcctgtaa 101701 ccccaacact ttgggaggcc aagacaggta gatcacttga gcccaagggt ttgaaaccag 101761 ccagcctgag caacatgggg aaaccccatc tctacaaaac aatacaaaaa ttagtcgagc 101821 atggaggcat atacctgtag tcccagctac ttaggaggct gaggtgggag gatgacctga 101881 acccaggaag tcaaggctgc agtgagtagt gattgctcca ctgcattcta gcctgggcaa 101941 cagagcaaga ccaaaaaaaa aaaaagactc cagagagaaa tgagctacca gctatgaaaa 102001 gacatggagg aaacctggat gcatatgact aagtgaaagc agctgatctg aaaggctaca 102061 tcctgtgtga ttccagatat gcaacattct ggaaaaggga aaactatgga gacagcaaaa 102121 agatcagtgg ctgccaggag ttgagggtgg ggagggatga acaggcagag tacagagggt 102181 ttttagggca gtgaaactat cctgtatgat acagtaaagg tggataactg tcattacaca 102241 tttgtcaaaa cccatagaac atacaacacc aatagcaaac ctaatttaaa ctatgcattt 102301 ggggtgatga cgatgtgtca gtgcaggccc attgattgta acaaatggac actctggtac 102361 agggtgttga tagctgggga cactgtggga ggaggagaca gggagtatgt gggaactctc 102421 tgtgctttct gctcaatttt gctgtgaacc taaagctgct ccaaaaaaaa gaagcctatt 102481 aaaagaaaaa aaaaatccac ttggcccttc tcccaggtat ttcacactca cctcttccag 102541 aagcctctgt ttgagctctc gttcagtctc cagcttctgc tgtaagcttc gggcattcat 102601 ttcaagcata gcatgcttct tctccaggtc attgagctag acatttggaa agattggcat 102661 aatgccacac ttaggaatgt tagatgctga caacgtttat gaatgtccac gaaccttcac 102721 ggagaaagaa tattgagttc taaaggtcag aacgtaaagg ttgctttttt ctaaatacag 102781 aagtccctgt cactggtgag tcttccatag gttattatat gaggatggaa attatttcta 102841 agggatggct cttaaatatg gctgtgacaa gttctaggag gataactgct caaggactta 102901 tttatctgac atgctttggc tgaatttaat taaataatca tacgctgaat gcttactacc 102961 tgcattatag cttttaggat gaaaggataa ccaagctctc aaggatctta cgtcaagtca 103021 gaaaggccca taactagaca tgtatgcaaa taatgactgc tctgtcagct tataagtttg 103081 ataagaaagg cttgggaaag aaattcaacc caattaggag aagatcaaga caataaggta 103141 ttgcaaagtc tggagcttta aaagatatga aggactttgc tgcttaacat tttaagaaca 103201 gtcaactaga aaaaaatatt tttcacatta aacataaaag gcctcacata gctctaataa 103261 atctagtttc ctcatgctta aaaaaaatgg taacccagtt tcagggaact caggagcatt 103321 cctgactaga gagctctggc cttaaaccta ataattctgc ttctattaat ataagtatcc 103381 ggtaagaaaa taatcatgga tgtccataaa gattcatgtt tatcatattg ttatttatat 103441 ggtaaaatac tggaagttca aatttccaat acaagtataa tgattcaagt aggtacttcc 103501 atgcaattta ttttttttag aatcacatat tcaaaaaata ttcaactcag aaaaaacctg 103561 aagttaaatg atttttaata aaaacaggta aagctgtata tataatgaaa atctaattta 103621 taaaaataca tacgtatgtc tatgtatatg tgtgtatgca taaacataag cacagagact 103681 gcaaataaat acaccaataa aacaacagcg atgggtgacc agccattatt tcttttactt 103741 ttctgtggtt tctatatttt ccacaatgac ccaaataatt atttttctat catgaaataa 103801 caaagttata gaaaatataa aatcagatcg ttattttatt ttgtgtgtgt atgtgcatga 103861 aaaagagtgt ctatgtatgc atctgtgtgt tcatacataa aagtcctctg atcttcactt 103921 tggtgatctg taacgctgaa gcgcctattc tgtatctctc tcaagattat tagctaaaat 103981 ctccaaaaca ttgtttatga ctataaatgt aatatggaga ggcacatgga tggaacacgt 104041 gtatttccga tgtgaaatac tgagaaccag catgcgtcat cagaccctat cacgaccctg 104101 aacttggaac acggagaagc tgagctgcct attactgcag caattggatg agcattagtc 104161 actgccacca agaacttccc aaagatgcac agcattcaat ccataaagcc ttctggaaat 104221 aagatgccag acaaccagag agtagaacac ccagatcaaa attcgaactt caagggccta 104281 gacacacaga ttttaaaaag tcactggccg actgtgatca ctgggtaccc accgcaccca 104341 cccacgctac tgccaaccaa aaatgtccca tggtaaggca aaaccatggg acatttttgg 104401 agtgcatgaa catatgttta tgaaatatgt ttgccatgga ttgcatttga atttttatca 104461 agcgtgcctt taaaagaagg aaacgaaaat gttttcctaa aaagcagaag gggtgggcgc 104521 tgaccttgtc agagaggctc tcggccttca gcttctgctc tttgagagcc tgctgcagag 104581 cgagaatctc agccttgtgc tccttcactg ccagctccac cacctggcga gactcggtga 104641 tccgctgatc ggctctcgcc ctgccagaaa gacacaggtc agcttttggt agccccccaa 104701 cagctgacct gcggaaagaa ggctggtccg tctctaatcc tgacgagcaa ctaaacaggc 104761 tgcaggccag aaagatgccc aaccttgact cttagttcca aactgtcttc ctttccaaca 104821 gcttacaaag caaagatttg acccatgtta ggcttgggta aggaaaaaca aaacaggcca 104881 ggcacggtgg ctcatgcctg taatcccagc agtttgggag gccaaggtgg gtggattacc 104941 tgaggtcagg agttcaagac cagcctggcc aacgtggtga aatcccatct ttactaaaaa 105001 tacaaaatta gctgggtgtg gtggcgtgcg cccctaaacc cagctactcg ggaggctgag 105061 gcaggagaat cacttgaacc caggaggtgg aagttgcagt gagccgagat cacaccaccg 105121 cactccagcc tgggtgatag agcaagactc ggtctcaaaa caaaacaaaa acaaaaacaa 105181 agttaaatgt ttttgggtac tcatcaagaa actgaaactt gaggctgggt gcagtggctc 105241 acgcctgtaa tcccaacact tcgggaggct gaggtgggca gatcacctga ggtcaagagt 105301 tcgagaccag cctggccaac atggtgaaac cccgtctcta caaaagatac aaaaaattag 105361 ccaggcatag tggcgcatgc ctgtaatcct agctactcag gaggctgagg caggagaatt 105421 gcttgaactc gggaggcaga gttgcagtga gccgagatgg caccactgca ctccagcctg 105481 ggcaacaaag caagactcca ttctcaaaaa aaaaaaaaaa aagaaaaaac aaaagtgaaa 105541 cttgaatctg atgagtgtct ccataaataa tgctaaagaa tagaaaaacc agaagcaaaa 105601 taggaaaaaa aaatcacttc cctttcccca tttaggtatc atgcttccta cttccacact 105661 ttggcttata caatctcctc cactggccat tccctccttt gctactctgc aaactcaaat 105721 cttactcagc cttcaaggat aaattcaaat gctatgatct ccataaaact attcccactt 105781 tctcttctca ggagcaacct caccttgttc tgtaacaaaa gcataactgt atttctctcc 105841 taaatatata tatatatata tatatatata cttttttttt tttgagatgg agtcccactc 105901 tgtcgcccag gctggagttc agtggcgtga tctcagttca ctgcaacctc cacctccaag 105961 attcaagcga ttctcctgcc tcagcctccc aagtagcttg gactacaggc acgtgccacc 106021 aaacccggct aatttttgta ttttttgtaa agacaggatt tcgccatgtt ggccaggctg 106081 gtctcaaact cctgacctca agtgatccac ccacttcggc ctccccatct ctcctaaata 106141 tctgaatcca tatctcctcc agtagactct taggctcttg aggacaaaga ttaacaatta 106201 tattttactc atctcagtat cccatataat accaaaacag aaaaaagaaa aaaaaggaaa 106261 agaaaggcac gatcttccac aattgccctg aaagcagaca tgaaatgaaa gcagactaat 106321 gctagtcttc ctgagaacag ttcataatta taccttccat gtatggtcgt gtatttcttg 106381 atgctgaagc cagatcctac cccaaaaatg atctctgtag gtccctaaaa acagctagga 106441 taggagctgg acttgagggt gggagggaag gggtctgtgg gaaaacaacc agaacttaca 106501 atgaggatgg gggagtctaa ggattcagaa ggaagctgga ataggagact taaacagcag 106561 tccctcacat caacaaatac cctgagtggg gaagctggga ggatctacaa aaaggaacta 106621 tttctaccat ttcttttgaa atgtaagggt tttcatcttt tacctccaaa cttttggagc 106681 tcaactgaac tttgcgtaat ccatctgaca tatttagtta agcctactat ggccaagcaa 106741 aagtgagaga tagataccca agtgagtgta atacagacta attcattgat cgatatttct 106801 atggatagct aatcttgctt aaggagttca gagaatcaca ggaaggtaag tttcagagga 106861 aggtgacatg agcaagttga tgaaaaatgg atccgattca gctgtgcaga caggaagtca 106921 cggggcacac actagcccat atcatgcctt tgtacaaact ggaaagcggc acccgtcctt 106981 ctagaccatt tctttccatc tgtgaaatgt cacaacgtac caattttgtc agaaagaaaa 107041 cacttttctg ccatgagctg tgtaaaactg tcataacaac tcttccagtc tatgacatct 107101 ttaatgtgac cagcaaccct tagaactatg gagtatgcaa ttgtgccact gtggccccac 107161 aggaagggaa atctatactg gggaactgtg gaaacaaaag tgagaaagca gaggaacgat 107221 actgcctgaa ctgattatta tcatttccta ctaagaaagg agaaaaaaaa aaaaaccagc 107281 caggtgtggt ggctcacgct tgtaatccca gcactctggg agaccaaggc aggtggatca 107341 cttgaggtca ggagtttgag accagcctgg ccaacacggt gaaaccccgt ctctacaaaa 107401 atacaaaaaa aaaattagcc gggcatagtg gtgcacacct gtattcccag ctacttggga 107461 ggctgaagaa ggagaattcc tcccaccctg gaggtggagg ttgcagtgaa ctgcgatggt 107521 gccactgcac tccagcctgg gtgacagaga ctctgtcaca aaaaaaaaaa aaacaaaaaa 107581 aacccactaa actattcttg tacagcagtg gttctcaact ggggacaatg gagacacagg 107641 ggatattggg ggcaatgtct agagacatat ttgttgtcac aacgtgggaa gggggttgtt 107701 ggtggcatct agtgagtaga gactagggat gctgtcccgt cctacaatgc acaggacagc 107761 ccccataaca aagaatgacc cagcctgaca tgtcaatagt gccaaggttg aggaaccctg 107821 gggtggaggg aagaccagga ttaaaatggt aatggttttg ctatggcctt gctatggtca 107881 ctctgacaag aaacctcatg agagttttct gcacagaaaa cagaggtatt tgccctgatg 107941 cccaggcctc tgcatgggga aaacagatac cagtcatagc cacagaatcc aactcacact 108001 atctacgttt acaactaggg tgaccaccct atcctggttt gcccaagagt ctcctggttt 108061 ttagcaccag gaggcttgtg acccaggaaa cttctcagtc tcggcgggaa tagctgatca 108121 ccctgtccac aactctcagg ccgatcctcc tgcggtcacc tgtcatgagc tttccacaaa 108181 cacaaagccc cacctgctct gtttctcggt gtccagcatt ctctgcagct ctcgaacccg 108241 acactcaaac tgggatttct catcacccag gacgctcctc caggcctccc actgccgctc 108301 tttttctagc agctcatcgt ttagggcctc caaatccatg acctgttcct ccagcatggt 108361 gcacgtggtc ttcagagcct ccatcgtctg caaatcagta gcactgattt gtgccttgtc 108421 tttaaacaag gatttcctgg agattacttt ggtcgggtca gtttcagaaa agcacaagtt 108481 tctaactaca gtttgaaatg cggtttgtct tttgcttctg cagcccaagg ggaccaggca 108541 agaatggatg aagggccagt tctgcagcca ccttagaggc aggatccaaa gcaaagccag 108601 gggaagtgac caagcttaaa atcaagccat ttacattgat cccccaaagc atttgatttt 108661 gtttttatct atctttattt tttcgagaca gggtccagct ctgtcaccca ggctggagta 108721 cagtggcatg atcacagttc actgcagcct cgacctcctg ggttcaagcg ttcctcccgc 108781 ctcagcatcc tgaaaagctg gaactactgg catgcaccac caagcccagc taatttttat 108841 cttttttatt tttatttttt ttaagagatg gggtctcact atattgccca gactgatcct 108901 gaactcctgg gctcaagtga tccttctgcc tcggcctccc aaagtactgg gatgacaggg 108961 aagagctgcc acactggccc tattttttaa aatcaaaatt aataagaact acacactttt 109021 aaaatacatt tttaaaaccc taaaggcttg gaaaaaagac atgattttaa aaagagccca 109081 attactgtat gaggttcagt gttggaaccc ttgggagaac gcaccaagca ctcctaaaat 109141 ccttctaaaa gtcctccagg gatctcacga ccttgttaac cctccttact tgcttctggc 109201 tggtaagctg catctctcgt tccgtgatct cccggcggag atggtccact tcacttcgca 109261 gttgtacaat ctcgtcgttg gcgccagaag cctcatcgag ttgtttggac aagtagaagt 109321 tttggttgtt gagttcagcg ttgtcctcgg tcagctggtt tagctgctcc tccaggtctg 109381 tgattaccta aaagaggaaa ggaatccagt tacacgccaa gcacattcat ttcaaataag 109441 aaattgcttc tagggctatg gatttctgaa cccacgtgac actggcaatc aacgtgccgc 109501 tcatgagttt gcccacctga ccgtgagagc catgtccaag ctcctatgcc cagccactta 109561 actctaaaca cgagcagtgg aagaaaaaga ctcagtttct ctctaactca ggacacatct 109621 aaaaaatatg gcaaaagcaa tcaaaagtaa tcactgcatg aaacggatga cttcgaatga 109681 gatcaaattg acggaaataa tataatgtgg aaagaaaatg caaattaaga atagactggt 109741 atcaccatcc actaattaag ctaaaaatgc ccatgacact atcactagga ccttattatt 109801 ggagatggcc atagtcttag aaagcaaaat gaagttctta tcaaggaaga ctgacaggga 109861 aagacgccag gagttatttc gataatatca caagcaataa atttaatttc atacaacaaa 109921 ggaaaaaaac ttcagggaaa ctcagaaaaa ccctgaaaat acaatacact gtactatggc 109981 atgactttaa tggattttac tatacagttg gctatcagac aatgggaagc catgtatatg 110041 aaattgttat tttattgaga gagatttaat cggggtttgc taaattactc tcctgctcag 110101 tagcagaatg acttcaattt cagaacattt agctttccta ttctctttgc tgagatttag 110161 tattatggct tgcagagagc ttttatgtat aataccttgg aagtttaagg gcaattttct 110221 aataatagtt ggactgaggc acacaggcag taaattaaat tttacgtaac aaaaagaagg 110281 aaattaaaag gaaatgccgt taagcctaca gttattaatt gtttccaatt ttcaaaccta 110341 gagtcgattt cgtttatgaa atgttgccat tgctttcatg actaattccc tattttaacc 110401 ctctttgatt atcaatttaa aaaaaaatct tcccaggcac atgaaattaa agaacatcaa 110461 ttaagtacac ttgtccacaa atgtactcag aggagataca acttttcatt ttgttaactg 110521 agctgataac cttttagtgt atggtctaat ggtctagtaa tttgcttttt aatgtcaaaa 110581 gccacattaa ttcctatgta cgattttgaa gtacactgtc agctacaaga gggggaataa 110641 aagttatctg gtcaggttct tagctcagat acattcttta tcaaggaaga caaataagga 110701 aaatctgtac gtttgccatt cctaacagac acttatgact tgcagggtcc ctgcctttcc 110761 ctttctgtgg catgttaaat ctcctaaaac cagaaacaat gaataaacaa tgcccagtag 110821 ttcaactggt ctccttcctg tacgtagtaa acccattcct taagatgttt ttcataatgt 110881 ctccatttaa ctactaattc ctccccctac ctctttccca aagaaaaata agacaggcaa 110941 gctggcttgg aagatataaa atggattctg gaacccccaa taagccatct gacctttgct 111001 gggttggttt cttcatctgt gacatgggag tgaacactat atttactcat ctgcttttac 111061 ctttttaatt attttaattt tcttgcatgt tatttgacct taatgcatca tgtaaacatg 111121 aggtcaggag atcaagacca tcccggctaa cacggtgaaa ccccgtctct actaaaaata 111181 caaaaaatta accaggcgtg gtggcgggca cctgtagtcc cagctactcg ggaggctgag 111241 gcaggagaat ggcatgaacc cgggaggcgg agcttgcagt gagccacaat tgcgccactg 111301 cactccagcc tgggtgacag agcaagattc catctcaaaa aaaaaaaaaa aaaaaaaaaa 111361 aaaaaaaaaa aagcctcttg gggaataaat agactagact atagaaagtc tactcacatg 111421 aaaatacatc attgccaaaa taatttactt gccattcaag gatgtactta agactttggg 111481 tacctgcttt ctttgattca cgaaaaaaat gcaatttatc atttgtgatg gtagcccatt 111541 tctgccacct cttcgttttc tcccaaagtt atattataaa atgctaaaga agaaacactt 111601 gaaaaagggc aattttgtat cactcaattg agatgggccc attggaataa gaagggtctt 111661 tctatttccc tgttgtcatc cagttttcaa gggttctcac gcctctaatt atatttccgg 111721 tttgtgattg ctggttctat atttggtcat catctctatt ttgcccattt gacaagagaa 111781 gtgttacatg tttttgtaaa gtacccatgc cgagattgaa ggcattttaa ttgatttgga 111841 ctttttatac acatattcta tctacccaag ccataacaca tttagcttgt gaccttgttt 111901 caagttgctg gagtcatgat attaaacgtt ggacacaccc acacacacac acatacgtta 111961 ctgagtttac agatattaaa ctgacttatg acataggcag taaattaatg gtcttcgtgg 112021 attttttaaa aaaggcataa caacaggcaa tgttcactac tgtcctttga agtaggtatt 112081 attttatttc tattttatag atggggaaat tggggaaaca gagatgctag gtaacttgcc 112141 caaggtcact gctagtaaat aacaaagcca gaatttgaac ccaacactaa aactcaagaa 112201 tctacattct tgccccctga accatactgc ctgtctaaaa tagggataga agtgtttatt 112261 cactttttta ttcagatctt tgaaaactga aatgatggag gtaagatata ttgaccattc 112321 aggacaaccc ctcttgattt tgatccaggg aatgtgcaag gaaaattgtt aggtctactt 112381 tctggtcctg ggtatattct ctacccacca aggacaaatg atttgtttac tatgagtctg 112441 ttaacctttt aaccccaaag atagagagta aaaaagagaa actgaaaggc tgattccatc 112501 ttagcagcaa aaaaagatga gcaaaatatc ttcacaacgc aggcgccagc agagctttct 112561 tatatgctgt ttttacagat gagaggtctt ggtacagttc taagacagaa gatattaaaa 112621 aaaaagatct aagctggacg tggtggctca cacctgtaat cccagcactt tgggaggctg 112681 aggcgggtgg atcacctgat gttgagagtt caagaccagc ctgaccaaca tggagaaatc 112741 ccgtctctac taaaaataca aaattagctg ggcatggtgg cacatgccta taatcccagc 112801 tacccaggag gctgaggcag gagaatcact tgaacctggg aggcggaggt tgtggtgagc 112861 caagatcacg ccattgcacc ccagcctggg caataagaga gaaactcaat ctccagaaaa 112921 aaaaaaaaaa aagatctact ctcagtgtca aaaagaaaaa aaaacagctc tgtttgggct 112981 tatctgttgt ttttcagtgc tatcatggtt ctgagaagaa gatgtgaacc tctcacacct 113041 aactggaaga ccaggagagg aaaattacca ggtaaaagtc caatggcatg ttaggggtga 113101 agaaaccagt cagataatga gcatcaggct aaacagacac aggactgaaa tcctatttta 113161 atgcctggtt ctcttctgaa ggcagctatt tgcatattat ttatttttaa tcttatagct 113221 gacagattga ttcataatta aaactgtgtc atatgtttgc ccaccaactc ccctctaaaa 113281 cagacttagg tagggtttta ggtgccagat tgaatcaaat cctttttaaa tgtcttcaaa 113341 atatagaaat ggtctgagct tgaaaatctt actgtgcatg agctttttga aagctgaaga 113401 gagctcctat caattgagtt attatgggag ttgtgggaac tcccataata actcattttt 113461 aaatgacaaa gatgacattt cctagcggag acaatcttgg gcaatgccaa gaaagaaagg 113521 aacctccaac acactcctct gaagtgtctt taaagcggtc ttgaacttgc aaaagtataa 113581 ttgagaccta ctattgtggc caaatgttgt tctaaagtag tggtattcaa actttttaaa 113641 gtagcagcag agccattttc caaacaaaag attatattat aagcccagga gaataaaata 113701 atagtagtaa ccaatgttgt ttagtacttt ctatggatca ggccctgtgc aagcagtttg 113761 cacgtatctc atttaattct cgcagcaacc ttatgaggga ggtactacta ccaaatccca 113821 tgactcagtg ttccataagt tgtattgttt ctgaaatgtg gatacgtggt atactggaac 113881 cagcagctct ctctcttccc tcacccagcc cccaaagagc agttattaaa taaattatgc 113941 atcctacaat caatgagcgt actgattgga actgagggta tacaatgtta tctccatttt 114001 acagacaaag aaacagaggc ttagtaggat aatacacgta ggttactaag aggccaattt 114061 ataatctgaa ctctatctga ttccagagcc catgacttag actactaagt tacccagctc 114121 ctctggctga aatgtgggtg aagggcatct ctctcaggtt cacacacaca cacttgggaa 114181 ggcccccata ggcttcgcca cagaacacag aggtaaatga agcaaagatt agaagtcacc 114241 gttccagaaa tgcaaatttg gagagcatta gagttacatg agaaagtaac caaatgcatt 114301 ctacatacta gtgtattcca ctaaaactaa acttccagtt tttaaaatgg gggacaagat 114361 gaattacaaa ggggtataag gaaacttttg agggtcatga atacaataaa tattttgttt 114421 ttatcgctca aggataaaat ttggataacc ttattatcca aaataataac aaaattcatt 114481 attttgattg ttttgctgtt tcacagttac acacacattt taaaacgcat cagattatag 114541 tacaccaatt atacctcaat aaaactgtaa aaaaaagttt tttagaaagt tcttatgcca 114601 aaagactgaa aaaggaaaaa agaactccag atgaatggaa caggattact aatgcacaca 114661 ttttttaaaa tccaagctta tataaggaaa aaatagtaat ataaaaaatt ccagtcataa 114721 aatgatctac attttattca gtgatcattc acgcgcatgt atgtaatata agcaaataat 114781 tcttgtgtaa aataagaaac gaagacaaca agaaactaaa atcctagggc agaaaactac 114841 cagaatgtca ccccccaact cctcaaggaa gggaggaggg atggaggaaa tccacagaca 114901 ttttaaggca ggcccatccc gatccaccac cacatcagga aatacagacc aaaaaactgg 114961 tctgtcagtt ccctactaaa gtcaaacaga ggcaaatctc atgacctagt aattcttggg 115021 gatatgttca ccagacgtaa gtgctcacag ccaccaaagg ccttgtgtaa cggggtttcc 115081 taacagcttg atccattatg gaaacaatcc aaatgtctac caacaaggag aatggataaa 115141 caaactatag ttctattcat acaccagaaa caccacacag tgataaaaaa aaaaaagaac 115201 aaattattga taggtacagc aacatggagg aacctcacaa aacatttaag cgaaaaaaaa 115261 agctaagccc cagagagcac atcctctgtg attccattta tatgacattc aagaacaggt 115321 aagactaaac tatggttatc agaggtcagc atagtggtta ccctttgggt gaaggatgat 115381 gatgggaagg ggacatgagg gagccttcag ggactaagaa agttctgtat cttgatctac 115441 attggggtta cccagttgtc tatctgtgta aaaattttat tgagctgtat acttaaaatt 115501 actacattat tatgtgcttt actagataca ttttatatct aaattaagaa aaaaatacca 115561 ttaggactta atatttcatt tacagtagct aaagaggaaa gaaagggctc ataacaagaa 115621 aataaaatat gtatgtgtgt ggaggctggg aggtgtctct ggaaattcac aaatgcaaat 115681 aatacaaaca aggacaagat cactgaggac taagaaggta ataaagtgac aaaaaggcca 115741 aagaaaagag agggcacatg aataaggaga acatccccat cttctgaaag gggcctattt 115801 cagttcttca ctctcatcaa aaatagactg aattcccact gaccctactc tattatataa 115861 cctatggaag atgcttatcc actcatagaa tttacttctc tcaatataaa taacatctcc 115921 caaagagtcc tagattttga atttgcatgt tgggaagcca aattttgcct ggtagaaagt 115981 tacaatttct aggtctccag gagagctcac tggtctttcc tccgcagggg cagcaatctg 116041 gacaagtctc acagatacaa ctgccctggt gacaggggag tctctgacct gggctagcca 116101 tggaaccttc tacattggga caggaggtgg aactgccttg ggatttttgt tgcggctgct 116161 gaaacctatt agttgttcca ctaatagaat tattttgacc cttccaacta tcctcttccc 116221 caaaacatga agaatggttt cagtttttga aaactggaag aacccccaaa aacaggttat 116281 ctagggctca aaaactccaa ataaggttga ttgcattatc ttcttcaagc tgcaagttcg 116341 tatcaaaact cagtcaccaa tattttaaag agacgtcccc ttttcctcct ctgttttagt 116401 ttcatgttat tttaatcgtc tttgggtaat taatactcac agtacagctg ttacgaagag 116461 catcaaattt gcgctggatt tcatctctat gtgcctaaaa ggtaagaagt tgattataaa 116521 tttttcatag ggtagaatac aattctacat gctacgggtc aaatatagag aatagcaatt 116581 tgcctggaaa gccgagaatg cggcaccagc atctgttttt ctaagaacta attaatcaaa 116641 tcatccccaa caccagaaat aagttacagg aatacacaat gctgtgttaa taacaattaa 116701 ttctttagac catacatgca caccctgtac aaatattttt aacatttata ttaacaccta 116761 caggctatta ggagaaaagg gaaaaaaagc aataaaggat gaagttttac tctccttcat 116821 ttgtcctatt cccattctga aaagatgagt attttccctc ttccctccaa ctgactcctc 116881 cccagcctac tccctgacac acacacacac acacacacac acacatacac acagaaacgc 116941 acacacagga caatagccaa gacaaaggga ctttgtgtgt gtgttgtttt ttcattaaaa 117001 gctgcggaca ttcagaaaac tctacatcgg ccttgaaata aatatatata ggctttctcc 117061 aggaaaaagc tgccttaaac cagcagcatt ttttcagttg acaaggactt ttttagtgca 117121 tactatatgc ttgacactgt ggaaggatat aggcatccat gacccatgcc ttcattctgg 117181 tgacggaaga tcggaaatga ataagtgcgt gattagagat tgggacaaat gttataaaga 117241 aaacaaatag ggaccagttt aaaaaaaaaa aaggaataag aaaggtaggt agaaacttac 117301 ttagatggga tggtcaagga gggcttctct gaggaggtgg catgtaagct gaaacctata 117361 ggatgatgag gagccagcca tgcaatggac aggagacggc agagggagca gcatttgcaa 117421 agaccctgag gcaggaaagg gtctggtgca tttgatgaac aacaacaacc aaaaaaactg 117481 agtggctaga gcagagtttt tcaaactgca agctgtgacc caatagcaag catgacaagt 117541 ggggtttttt gttttgttgt atccagtgaa gtagaataaa aaagatagaa ttgaaaacat 117601 cagaatgtgc cttgaggagt aaacatgagg tctgttatag gaaacttctg ctttagatac 117661 tataactaat caaatttaag gtactttcca tggtaaaacc taattttatg taacattaag 117721 aaagaaaaag tgctgttaaa ctatgacatg tttatgtctg tatatgactg taatatgcac 117781 cttgatttca gatgccttag aaacagtgaa atacagtata tatactccct tctattggat 117841 tgcaagtaaa atgtattctg tgggttgtgg tcaacttttg aaaaagcaca gcaaagaaga 117901 aaaaaaagta gcccaagatg atgacaggga ccaaggcaga gaccaagaat cttgaaggtc 117961 cagtaaagaa tttaggtttc attcaaaatt caattaattg aaaaccaatg gaaaatttta 118021 agaaagaaag tgacggattt tactgttgtt ttaagatcac ggtagatgtt acagggagaa 118081 gaggatcaaa atagaacaaa ggtagaacgt agcagggaaa ccaatgaagg tactgttact 118141 gtcattcaag cgagagatga tagtggcttg gagaaggatg gtggcaaaaa caaacaaaca 118201 aacaaaccaa acagagaaag tggatgaaac tgagatctcc tttgaaaaca taaggaaaag 118261 gacttactga tagatgtgtg gccaagagga aagaggagtc aaagacgccc taacagactt 118321 ttcacttgat agacaaagtg gatatggtcc catctgctta tgtatgaaaa cccaaataag 118381 aacccagaac ccagtttagg gcagatgtaa gagtgatcct tctatttaat tataaaaaaa 118441 aaaaaagtct ccataagaag gtctatagga ggccaggcgc agtggctcat gcctgtaatc 118501 ccagcacttt gggaagccga ggcgggcaga tcatgaggtg aggagattaa gaccatcctg 118561 gctaacagga tggtctctac taaaaaatac aaaaaattag ctgggtgtgg tggcaggtgc 118621 ctgtagtcac agctactctg gaggctgagg caggagaatg gtgtgaaccg gaaggcagag 118681 cttgcagtga gctgagatca cgccaccaca ctccagcctg ggcaacaggg cgagactccg 118741 cctcaaaaaa aaaaaaaaaa aaaagtctat aagataactg agtcaaaaat tttagagctg 118801 tgatgttttt tctaatatgt ctatttttcc ataaatacta ttgtcagttt ataatggcag 118861 attagatact ttagaagaaa agattagtga acttgaagac aagtagaaac aaagaagtag 118921 aagacaagaa gtagaaacca aaatgaaaca caatgagaaa agaaaaaggc attttcaaaa 118981 aatgcagagc atcagggagt tgtggacaac ttcaagtagc ctaacataca tgcaattaga 119041 gacccagaag gagtacagga gacagaaaaa atatttcaag aaataatggc cacaaatttt 119101 ccaaagttga tgaaagccat aaacccacag atgcaaaata ttcaacaagt aagtaccaag 119161 cataagaaat gtgaagaaaa ttaacctaaa gcacactata atgaaactgt ttaagtccaa 119221 taataaaaag aaaatcttaa aaaaaaaaaa aacagccaga gaaaaataca tgttacatta 119281 agagaaacaa aaatgagaaa gacagatttc tcaccaagat ggggagaagg agtgggcatg 119341 aaaagctaga acacagtaaa gcaatatctt caaagtgctg aaataaaaaa cagaagaaac 119401 aagcaaaaaa aaaaaaaaaa aaaaaaaaca cctgtccaca tagaattcta tacccagcaa 119461 aaatacctct taaaaaatga aggtaatata aaaacttttt caaacataca aaagttgaag 119521 gaatttgaca ccagcaaacc tgtgctataa gaaatgttaa agaaagtcct tccgacagat 119581 ggaaaatggc aaccagaaat ttggatctac acaaatgaat gaagagccct ggagagatgg 119641 taaaacaaaa caaaccaaaa atagattagt tgctgaaaat agccagaaga gaattataac 119701 gttcccaaca caaagaaaag ataaatgttt gaggtgatgg atatcccaat taccctgatt 119761 tgatcattac acattgtagg catgtatcaa aatgtcacat gtatctcaaa aatatgtacg 119821 actgtgatat accaataaaa aaatacataa aataccaaaa atagatagtt gcatagacag 119881 aatcacagac agatttataa ataaaaagtt attgtttaga attgacttcc taaataacta 119941 gcctatattc tatggactaa tttaacagat cttaattttt tttctaaaaa tctacctgct 120001 ttttgcccag caataatgag agatttattc ctttgttaag ataagaatac tgcttaccca 120061 aagtaagact gtggctatct gcactatgtc aaaatctcat ctgtgaccaa gcagcagcct 120121 agacccaagg accacacaac cccctttagt gtcttccatg atcaccaaaa accaaaggca 120181 aggattcttc ctagagaaag aataaggccc ttatgttaac agactcttgg tcaggaatcg 120241 gcaaattttt tccttaaaat gccagataag gaaatttcct taaaatttcc agagtatcca 120301 gtcttggtta gaactactta actctaccct tctaaagcaa aagcagccac aaacgatatg 120361 taaacaaaca gacatgtttg tttctaatgt gttctaataa aactttattt acagaaaaca 120421 gtgacagact aaaatttggc ctgtaatttg ccagtctctg ctctaaatga ctatttggtg 120481 tgaaatctct caaaaacaga aacaggaagc tgagagtgga gttagctcct tcaaagcata 120541 agtggtgacc ataaaaaagc tcactggctt atccacaggg acagaagccc tattccactt 120601 tcttattaac agcaaagttt caaataacac agcaccatta aaaactcatt taactctccc 120661 actggagtct gaaatttcag tcattaatac tgatccattt ccattaaaat taaccattat 120721 ggggactttg gatcatgtca agcaatcctc tctgaaagtt aacataatga accacagtca 120781 ttatggcttt tgatttatta ctgtttatga aatataaaca caaaatgtag aactgaccct 120841 gttgtggttg ctagacctac tgtaagcact taactactga actagtcaca gtaagaaaat 120901 ggtaagtcat atgctttatg ttttttgaaa taagttataa aacaaataca caaaaaggca 120961 aaagctcatg caatagaata tatacataaa gacagcaaaa cgcataagac aagctttgga 121021 tgctctcaat gaaatgtgtg agtgttaatt agatattagg tgctacagag aataaatatc 121081 tacactctta aaaattactg aggaccccaa agagctcttg tttatgtagt atccacctac 121141 acttgcaaca ttagaaatta agactgagat ggccaggcac agtggctcac acctgtaatt 121201 ccagcacttt gggaggctga agcaggtgga tcacaaggtc aggagttcga gaccagcctg 121261 accaacacgg tgaaacccct tctctactaa aaatttaaaa attagctggg agtggtggcg 121321 catgcctgta atcccagcta ctcaggaggc tgaggtagga aaatcgcttg aacccaggag 121381 gtggaggttg cagtgagcca agatcgtgcc attgcactcc agcctgggca acagagggag 121441 actccatctc aaaaaaaaaa aaaaattagg cctggcgcag tggctcaagc ctgtaatccc 121501 agcactttgg gaggcagagg cgggtggatc acaaggtcaa gagatcgaga ccatcctggc 121561 caacatggtg aaacccgtga ctcactaaaa atacaaaaat taactgggcg tggtgggcga 121621 gcacctgtag tcccagctac tcgggaggct gaggcaggag aatcgcctga acccaggagg 121681 tggaagttgc agtgagccga gatcacgcca ctgcactcca gcctggcaag agagtgagac 121741 tccgtctcaa aaaaacaaac aaacaaaaaa attaagacag atatttttca aatatttatt 121801 ttttatttct tttaaaataa cactaacatt gcttttgcac catcagtgca aatattaaca 121861 cagtgaaaag tgttaaatga catcttagta ttgttatgaa aacagtctga caggcaggca 121921 taatggctca tgtctgtaat cccagcactt tgagagacca aggcaggcga attgcttgag 121981 cccaggagtt caagaccagc ctgggaaaca cggtgaaacc tcttctttac aaaaaaatac 122041 aaaagttaac caggtgggtg gcgagcacct atagtcccag ctacttggga ggctgaggtg 122101 ggagtatcac tcttgagcct gggagtcaag gctgcagtgt gctgacattg caccactgca 122161 ctccagcctg ggtgacagag tgagaccctg tctcaaaaaa aaggaaaaaa aaccatctga 122221 tcttgggaaa cccaggagtc cacagatcca ctttgggaac cactaaccta ggagttgggt 122281 ggcaaaaatg tggacggaca ggttgcagcc agtctgcccc taagaccaga gagatacttt 122341 cttttggaag tatttatgcc ttaattttca ggaaagttac aagaagaaga aagaaaagta 122401 aactacaatg tagccttgag ggagcaaaat atctgcagaa ctgaagaaag gaaacaagcg 122461 tgtagatgtc tgaccacctt ctgggatcca tggtcaggat tctagaacgg ttaggcttct 122521 gtgcacccac cccctttcag ggtgaaggca ggcaccaggc aagaatgtca ccttacatgc 122581 cccgatttca catcttggct atcatgccga tggcttacta attaattaaa cagagggctt 122641 gatatttatg tacgagaatg agaacgtgca ctgactcatg attttaacaa atcaaagagg 122701 atgactttgg gatcccatca tcacgctatg cagggagaag gattcctttc attagtgtct 122761 ctagctttca actcttgatt attcaacatt agtcacttga cagaatcggt aaaagaacag 122821 aggagtgaaa acacaaaatg tgttcatgag gcctggccgg tgcacacaga gcacacagca 122881 cgcctcttgc ctggtggtcc tcttcagagt gatgcttata ccacatcaca catcgtgtgc 122941 tataaggtgt gcagactcaa attatctccc ttgagatgga gtcttaaata tcaagttcag 123001 gctgaaatgt ctcttttcca gcaataagta cggaagtatc ttaaaagcat gtttctgttt 123061 aaatttgaca ggatgggtga agtgaaagtg gagaagaggc aagcattaga aacggcaaca 123121 aaggcctccc agctgcaccg ccaagaccct tcagtcgaga gctttaattt aactgtgctc 123181 catctcattt ggcccaggga tggaggccaa acaaggtaac tgctcccttt cacaggctag 123241 gagatgactg cctcttacaa ttaaacaatc atagcacatt tgactattcc cttttattga 123301 caaaacagcc tgctgtgaaa ctcacaaatc tcctaggcga tcattataga tccagctgat 123361 ttcatttgac tgatgcttgt gcagggtctc cactcaggtc tcaaacaatg agaaagttgg 123421 aaagtatggc attttgggac cattttcgct ttttcttgga aaagacgttt ttcagaacaa 123481 attcaactaa cccattcagg caacaaacac atgctgtcca taaccaggtt tatggaacat 123541 aatcaagaga agggggaagg aaaagcccac agccttgaaa ctgaatggaa gcaaatggaa 123601 atggagtttg gatttccgta tgatgggttt gcaaacagct gttggaggaa ggcacttctg 123661 ccatcatttg aagaagccct gttttttcca cagcatcttc atctgcaaaa gcaacagagc 123721 cctagtttct aaaccaaggg aggtttgttt ccttattttt ttttttaaga agatgctttg 123781 aggcagtgat ctcatctaac taaattaagt cgtgacttgc tgctgtgctc agcttccagc 123841 agccatcaga cacagcacat tccaaccagc agcaagaaag gatttaaagg actaatttcc 123901 ccatagataa agagcagtga cacgtgctct attttatgcc ctcttccttc caaacagggc 123961 caggataaga tacacagtca cccctccttc ccaattccaa cagagcagca agataaatgt 124021 gtccttctcg gctgctctga aataaacgga tttgaggaat tacagaccaa tcccctgggg 124081 agaaaaagct aaagacacag cattgggccc atagcaaggg cttattaaat atggattaaa 124141 ttgctgatgt ctctttttct ttcagaggtt gggtataggt acaggaggaa aatattaata 124201 tacattgcgt taaatataat agtggttgaa atttaggtgc ctccctttaa aagctaagaa 124261 atgttatctg tctctctcca cccacccaac tcccaactcc agcaagtcta taagccttct 124321 tggcatgaac tcttatatat caacttcatt cctagctcca ttttttcttc tacagactta 124381 ttttctcttt gccttttgtt tcttattttg ttctgtttta ctgtgttgca tctttccaag 124441 tggttttcag tccactgtga aacaagacag tatacacaaa gacatactgt ggttattttt 124501 tgccaaagca aattttgggg gcatgatgtg gcatcattta atgagtgttc tcatagagac 124561 tatggaacct aaatgtgcag tcaatatctg aacttggtgt aagtcctaga aaatctagga 124621 tgcacaaaca tactggaagc cactaaacct tggactggtc ctatacagac cctggcactg 124681 tcagatcaca ttcccagttt ggttcagata ctgggatccc caaacccaac aatgggcaat 124741 aacttctaag aatgaaacta cgttaaatat atatagatat agatagatag atagatagat 124801 agatagatag atagatagat agatagagat atagagatgt acacgtgcat tacatcgagg 124861 taaagtgtaa gcaatgaatg tacatggcca ttcattccac ggaagtaacc ccaatcaaag 124921 agaggtttag agatgttcac tgacggaaat ggggacagca tctaagttga gtcagaatga 124981 ataaggtacc tatttaggtg gcttattaaa aaaacactaa acccaatcag ctgtattgtg 125041 actggcccac aggagagggc accctgacca gcaggcaggt gtgtgcctgt ctgcttggcg 125101 ggtacagtat gatactatga gaaatgtatg agcttggctg acaccagcca ttaaaattca 125161 tttggccttc tatgctcaca gtctaggtcc tccggggatc cttccaagat taacgggatt 125221 ttctaagtat acttccctga atgggaaaaa gaaaatgtga agattggcca cccagagtac 125281 cctagtcaac ttggcagttg catttccccc ttcaagttaa gttggcattt cacatatcag 125341 atttttgtga gttttgttat tctgggtcag tctctgtggg aatgatcaaa tgtggggaaa 125401 atggccccac aaaaaaatcc atttcagact tctaaaaaga actaaggttt caatccagta 125461 acacccccgc tccaagttct acctcctgag aatactctcc agctaagtaa actagtctct 125521 cagaaagaac aaggaagcca ggtccctttt agagaaagaa tctttcattc agtagaattt 125581 ttcccatcca gttcatacca gatgcataca aaagacagga agcagaatct acagcttgaa 125641 aaaaaaaaaa agtaacatga gtaaggttaa ctaaaacaaa aagtaaaaat atggtagaaa 125701 aactactata agattattat tattatttta aatgattatt gaaaactaaa gtagaaagcc 125761 aaggtaaaca aacttttaaa caaactgaaa ccaaacttta aaatataatg atgatttttt 125821 caaaagaagg aaaatatata agtaaaggag gaactattca agatactaag aaaagtttga 125881 atataaccaa ccaagctcat ctactagttc caatgaagca gcagcctatg taagccaatt 125941 tcctggccac tgaatatctt tgtggatcat gttttccctc ctggaatttc taagggtgag 126001 ccagtatact cagtgccagt tccacactca tcctagctga ggcctttagc aacctgaaga 126061 tgttgctccc cagacctacc gtgagtgcct ggatctcctc ttcagcttct gctgtggtct 126121 cttccagctc tgtcttcgcc tggcgaagct ggctctccag ggccgcccgt gcagcctgca 126181 gggctgtcaa ctgtgactcg cgctcctgca gggagagctg tagctctgtg agctggcgct 126241 tgagctccag tttctgctcc tcgtgctcta gactgacctg agacagagag agagagagaa 126301 agagagataa acccttgctt ggtctgaaca aaagagaaag tgaagacatg acctgctact 126361 caacctgcag ctgaaacctg gtcttggaac caccttctcg gcactcgata gaggagagga 126421 aaatgcaaca aaatgggcct gagttccatg catctctgaa ttcaacagga gtaagaacag 126481 gtaagggcta tatctcctaa ccacccctta cttacatata ttcaaatgcc tgggaagact 126541 ttaaaaggaa aagccacaca gcaagaatca ctaatgaaaa tgatgatgat tactcttatt 126601 agaaaaacat ttaaagctaa gcagatataa ataacaacat attagtaaca aggacatctt 126661 ttcaattcga ttgtaaatga aacaacagac tgaattctat ttattttaaa cacgctttct 126721 ctgatgtgtc ctcattaggt aatctgcttt aatgtcacat tagaagttaa tcattcttcc 126781 ctccatctgc tcatccatcc acctatctat tcaatctttc atcattcatt taacaaaaat 126841 ttactcagca atttctgtgg aaaatggaat atggagatga aagacagtct ctgccttcaa 126901 gtggggagaa aaatgaaaac caaccatgac catatatcat ggtcaagtgc aacggtcagg 126961 acatgtaggg atgttgcagg aataccgcag gtgggtagag gagctaactc tggatctgtt 127021 aaattctgtt gaagaatatg caggagtcaa acaagcaggt aagaggtgaa agaaaattcc 127081 aggcagggga agcaagacat ccaaaggtat gggggtgagg gagaccacga cacatctatt 127141 gacctgacaa acatggacag tgtggtccta aagatcaatg tcaagcaact cagagatgct 127201 tgctgccctc agagagcctg ctgtctaagg ggactgccaa gaagaacaca ctgcgatgag 127261 ctatgccctg aaagccatgg ggataaaaag gagacctcca acccagcttc agagggtcgg 127321 agaaggctcc tccctagaaa aggtggcacc cacatcccga agccataggg gtgaagagaa 127381 gaaacacttg caaaagatgt aacctgcaaa accaacagga tttagtggtg aatttgatgt 127441 ggggcagaga acaaaggaca taagggagtc aaagatgact cccagccggg tgcaatggct 127501 cacacctgta atcccagcac tttgggaggc cgaggtgggc agatcacgag gtcaggagtt 127561 cgagaccagc ctggccagca tggtgaaacc ccatctctac taaaaataac aaaaaattag 127621 ctgggtgtgg tggcacacac ctgtagtacc agctactcag gaggctgagg caggagaatc 127681 acttgaacct ggcaggcaga ggttgcagtg ggccaagatt gcgccattgt actccagcct 127741 gggtgacaga gcgagactct gcctcaaaaa aaaaaaaaga aaagaaaaaa agatgactcc 127801 cacatttctg gacgggtgaa ttacacaaat agaccattgc tatgacagaa taggagagaa 127861 agggccagtc tgaggcaaaa gataagaaat tcagtcttaa catgttgaac ttgagccctg 127921 cgggagacat ccaagtggag atgtctaacg tctactaata agtagtctac tagtaagaca 127981 atgggtcaga aactcaaaag aaaggtctgg attggcaacc tggatccagg aggccttaac 128041 aaatcagtga cagctgaagc catggctatt atctctctct agtgatgatg gtgatgacga 128101 cgatgtcaca gcagcaaata cggtcctcac aacacactgg gccctgaatt aactcatttc 128161 attttcagaa caactcacta agggaggtac tactattacc tccattttac agataagcaa 128221 actgtggcac tgagagatta ggtaactgag caaatgtcac acagctataa aaatcttttt 128281 aaaaggcagg actggcattc aaacccagaa agtctggctc aagagtccat gtaacatgtt 128341 gctgtattcc tgcaacatcc ctacgtgtcc tgaccattgc acttgaccat gtacagggta 128401 aacactgtac ccttctgtct tactaaaaca ccatcagtca cagcagagtg gacctctact 128461 agcaagcctg aaaatgcttc cgatgccatt tggcaaaacc aagagaggca ggtgacagtg 128521 actcaagggg aggagcggga gctggatgct atacaatgaa ggaagaaccc gggagcctgc 128581 atctttgtga ctccacacac ctggctagag aagagctaac gctgcacaca caaagttggg 128641 ggaaggccca ggtcctttaa ggcggatcaa actccaggtg gaggcctctg ccctcaatct 128701 cagtagcagc atgtggccaa aagaagaggg actgatctgg ttatatttag agctaaactg 128761 caaaactccc agggagaata tagagcctga acctctcgtg aaacgtttga agtttctctt 128821 accatccttg aggggctggc tctactgatt caagtgaaaa acaacagctt ccctgggaca 128881 ctttaatagc ccagatgagc tacagcagca gggcaggctc ctaagaggaa catttgtgca 128941 aagggaaggc aagagacagc cagtcaaggc ccccaagcta gagaagtcca tctggattgt 129001 ttagactcaa acttaagggc gagtctcccg tgatcttgct gttaaactta cagaagagtg 129061 aaagtatagc ctggatgata aaagttaaac catcgcctaa ggcttaaggc tcatgcctgt 129121 aatcctagca ctttggaggc cacggaggga ggactgctta aagccagacg tcctccttct 129181 gagaaatcct gacctaagaa ataataaaag aaaaataaat aaataaataa agccaggagc 129241 tcaagaccag cctgggcaac atagcaagac cccattgcta gaaaaaatga aaataaaacc 129301 agacactcca ggtgaaccta atggaggagt tcacaccttg acaaccccta aaaaacatga 129361 gataacacat tgttactgtc ccttggagaa ctgacttcat tttaatactc caagacaaat 129421 aaagacatgc ataaacatga gtactaccca aggcttagac ctcagaccgc tgtctgctta 129481 acaggacctt ccttaagaaa gatcatctgc tttttgaaga attccatcac aaatatttga 129541 gcaactattt tacatgagac attgtatctt ccctcccagc ctgggtgttc agcttccaac 129601 aaccactatc ctgaactcca gttgtgtctc tgctggtgcc agcgaaccct gctgctaggc 129661 taaaagccct cgtgtcagga acaggtctgt cttgtcgcca ctgtagaccc aggctacaaa 129721 ttggcacaca gcagacatct gacaaatata cagtgactac ataaatgtaa ccttatggct 129781 tcaatcaacc tcaacatgtc tacaacccaa tttgttgtat tttcctcccc aaaacctggc 129841 cactcttccc agcctcctta tgtttttgaa gagagctacc tttcatcaat cacacctaaa 129901 tttggttttg tcttttaatc tcccaccgtc attatgagac agacctctgc caccagcact 129961 gggccaccat ggctaagctc tgactaggtc acggggctgt cattacatct tgataaagga 130021 ggaattgctc tgctcccatt tgacagatgc acaaattgac acgctcccca caggacccta 130081 taaaggagat acattatcac tgacatttta cagatgaaga aactgaagcc cagcagagag 130141 atgaggcagc ttgcaaaact agtaagaggt cataaaacca gtaaggagtg gaggtggaat 130201 tctaacccag atccccatgg cccagtctgt taccagaagg agggcctggg gtctaagttg 130261 tctattttga acaaagaatt ggacaaaacg cacaaagtaa caaaggaacg aaacacagga 130321 acgaagcagc gaaagcaggg atctattaaa gcaagaacgc actccacagg gtgggagtgg 130381 ccccagcaag tggctcaagg gcccagttac aaagttttct gggttttaag tactcctttt 130441 ggggttccta ttggttactc cttatctgaa taaaggattc ggtctgtggc taattacagg 130501 ctgaggcgaa ctggtgccct atgcagatga agggatagtc cctccttggc ccgcggccca 130561 tccaaggtgc tcttcctttc catctgagca atggtggaag gtggggggag ggggttgcag 130621 ggagagtagt cttccatctt ttggagcatt gcagggagag tagcctgttc catctttcac 130681 tactggggcg tgggaagatg aggattttcc ttttggttca gctttaggaa gtttgtgtta 130741 attggcctca ggttccctgc ctccagcccc acgtgttttc ctttcgatcc agctctggga 130801 agtcagcagg aatgggcttt aggttccctg cccacagaac ctggtgtttt ctcttttaga 130861 aagctagcac aaattggcca ggcatggtgg ctcacgcctg taaccccagc actttgggaa 130921 gctgaggcgg gtggatcact tgaggtcaga agttcaagac cagtctggtc aacatggcaa 130981 aaccctgtct ctacaaaaat tggctgggtg tggtgccagg cgcctgtagt cccagctact 131041 cgagaagctg aggcacgaga atcacttgaa cccaggaggc ggaggctgca gtgaaccaag 131101 atcgcaccac tgcactccag cctggacaac agagcaagac tctcttaaaa aaaaaaaaaa 131161 aagaggttag cacaagggcc tcagattccc tgccataaac cttggtgttt ttccttgatt 131221 cagcatgaat tggccttagg ttccttgcct ccagaccctt ttctcctgcc tcaagtctgc 131281 ccccttaacc accaagcaca atgtctcttg tatagatatt gatttgtgct cgaaagctga 131341 cttgttcaga gcccgaatcg cccagcaaat cctcccagga gatgactcct cgctctcacc 131401 tcccgcaatc ttgtctccag ttccagcagc cgattcttgt cactgtggtc ttggtggctg 131461 atcttctcca gctgctcctc cagttttcgg ttctgggcct ccaacttccc agcctgtgtc 131521 tccaggtaaa atttctgttg cctgagttca gaaatcatct cttcttgggc cttcctggtg 131581 ggggtggtgg gaggatccaa acaaaatcac agtctcatta gggttgagcc cgtttccacg 131641 ggaggtagac ggacacggag ttagacatgt ctcctcacta agtggactag ggccattcct 131701 gggttagctg tctcctgatc tggcctcata agcatgatcc cggcatgacg aacgtgggga 131761 ccatttactc atcaccactc agtgtcagtc cctcacatat gggattaagt aaggtccagg 131821 aggatgcaag agatactaac atttcactcc tttgacaggc ctgtttattt cttctgtgag 131881 tctgtgtagg gacagaaggg ggcctcctta gattaatgta caagggcacc ccagtaactc 131941 atgaaggcag gccctggaga aaaaggacta aggagaaaga atggtattcc ttctagttct 132001 ctgcagatct ccctcacctc tccagcaaac tgcttagaag tatcaaaacc tagaacgact 132061 caaagaccca aaacacagcc tactaagctg gtttagatta agatgggact gttttgggag 132121 atctctgaag agagaaacct aagggggaaa aaaatcctgg agattctaat tctctggggg 132181 aaaagattca gctcaaagtg ctcctggaac acagcccttg ctgaacccca aacttccttc 132241 catagggtct aacctagtga caactaaatg gatggcccaa ctgggcacac acaatttcac 132301 ctcttactaa ttacaggatg caagtctagg gaacaagaac tgcagctttc cacaagcaaa 132361 ttataaatag cctcattaga gagagcaaaa cactctgtaa tgtgatggta attgaaggca 132421 tttaagaaaa cccgactcaa aagtggcaga gtgagaccac ttaagacagg gcaagatgac 132481 gtaagtatct gagggctggg ctgagctaca gaagtcattc aatccactcc agctccatca 132541 aacagaagaa gggagtttat gccacaggag aggagaccaa agtgtaataa tgtagggcag 132601 ttagaatttg aatacttaca tgttcctttg ggtaaaaaga ctgctatttg ctgcaagttt 132661 attggcttca gacagttcca caatcctctg ttccagggat ctgatcttgg aatccatagc 132721 attgatcatc tgaaacacag ggcacctatg aaacttcaca caccagaaca ggagtaacag 132781 gggcagtgcg ggccagggta aaagtttcat ctgaaatgac ggcaatttgg ctagaggtaa 132841 caaggcactc aactagtctg tactctgagg atcacagatg gtaccactca aatggatggt 132901 acttggttgt tttttcagtg atgaaatctc cccacactgg ctaagtggct tgcatttaaa 132961 acagacataa ggacctattc aagaacctaa catcctggga gtaatcaggc cttgttaaac 133021 cccatctcat ctacatgtga cctgggcctg caagtctctt aactcaaatc tgcacattcg 133081 catttatcac cacctcctcg agggttgtca ttttctgagt gattccttct cacccagcca 133141 ccattgctct agactccaca gcaggggtct ccaaccccag ggccacagag cagtacccca 133201 ttaccggtct gccacctgtt atgaaccagg ccgcacagca ggaggtgagc ggccagcgag 133261 cgagcattac tgcctgagct ctacctcctg tgggatcagt ggcattagat tctcctcaga 133321 gtgtgaaccc aatcatgacc tgcgcaagcg agggatcgag gttgcgtgct cctaatgaga 133381 atcgaatgcc gccagccggg cgcggcggct tagcctgtaa tcccaacact ttgggaggcc 133441 aaggcaggtg ggtcacctga ggtcaggaat tccagaccag cctggccaac atggagaaat 133501 gctgtcttta ctaaaaatac aaaaattagc caggtgcagt ggcacatgac tataatccca 133561 gctactcggg aggctgaggc aggagagtca cttgaactcg ggaggtagag gttgcagtga 133621 gccgatattg cgccactgca ctccagcctg tgtgacagag caagactccg tctcaaaaac 133681 aaaacaaaac aacaaaaaaa tctaatgcct taatgcctga tagtctgagg tggtacggtt 133741 ttatctcaaa accatccccc cacccctgtg gaaaaattgt cttccacaaa accggtccct 133801 ggtgctaaaa aggttgggga cagcttctct acagcaaagt ttccccaact taaaaaatca 133861 taagaattgg ggtgtggggg tactaactaa aaatttaatt tcccaggatc tcccctcaga 133921 gatcctgatg tggggtcggg cgcggtggct cgtacctgta atcccagcac tttgggaggc 133981 cgaggtgggc agatcacttg aggtcaagag tttgagactg gcctggccaa catagtgaaa 134041 ccctgtctct actaaaaata caaaaattag ccgggcacgg tggcacatgc ctttaattcc 134101 agctactcgg gaggctgagg caggaaaatt acttgaacct gggaggcgga ggttgcagtg 134161 agctgaaatc atgccactaa actccagcct gggtgacaga gtgagattcc atctcaaaaa 134221 aaaaaaaaaa aaagagagag attctgatat ggtaagtctt ggtggagtgc agataaataa 134281 gcaccacagg taaaccttag gggcaacaag tttctgaaac aactgcctat gtggattaga 134341 aaagagacaa acaggcaagt aaaaagaaga gaagtgtagg gccaggagtg gtggcctgta 134401 atcccagcac tttgagagac caagatggga ggattgcttg aggccagaag tttgagacca 134461 gcctgggcaa catagcaaga tctcgtctct acagaaaact taaaaattag gtaagcatgg 134521 gggcatgcac ctgtagtctc agttacttgg gaggctgagg tggaaggatc acttgagccc 134581 aggagttgga ggctgcagtg agccatgatt gcaccaccac attgcagcct gggcttcaga 134641 gcaaaacccc acctcaaaaa aaaaaaaaag agcaggaaga ccaagagcca ctggacattt 134701 tgtcaggggc atttagaagg acagtatttt agcagacagc agaacagcat gagttcttga 134761 aaaacacaaa actggccatc cagagccacc acctcagttg cttttaggaa aggaaacagt 134821 ttcagagaaa gcctcctatt accaacaact ccacaggatt caacagaagg aatttcacca 134881 ggtgaaatag agtcttatca ggggtgattc ttggcatatt tcaaaaaccc tcctttgaac 134941 actggcttgg caactccttc tacctaccgc cttctgttcg ctgagaattt tgcccttctc 135001 atgggcctcc tcctcgtgtc tctgcatcat gttctccagt gtctccttgt cagccaggtc 135061 tttctttatc tgattgtcca acacctacaa aaacaaaagc agcagcaaat gttctcagga 135121 gccaagctga ggagggaaga ctaaacgggg acacttgaga aaatctagga aagaggagaa 135181 aagcagggga gaggcccgag gtctccaaat tctggggcaa agagaaggca aaaactggga 135241 gtccgcttat caatcaggat aagattgaaa agtcaaatat taccatccaa tggcaagtaa 135301 caggttgact tgaaccaaga acttatttta aaatcagcag cccaaagacc agagagatat 135361 attcaaaatg aggcagactc cagaatgaga agggaggagg aaacaagatt gctctagatc 135421 cagacctaga gaataaattt cttataatct gcagatatta cctggggaca actttttctt 135481 cgccagtgca cccgtagttt tcagaatatt cctcactcct aatttataca gattaattct 135541 agatactcct gggctgtgtg caagagggag agaagccgaa gacaggggaa gtctatgtga 135601 ccaaggatga tgctcagaaa caaccaggga caagaggctt ttggccacca ataacgtagg 135661 tggatggtca acagattaaa aagacaacat aaaagaaaga aaaaaatgcc tctttccctg 135721 gttctttcct ctgatccaac tagggggaga aaaaaaagtc actgggaaaa gaaacctttc 135781 ccaggggaag agagatcatg tgttctcagt gctcagaatg ccactgtaag cacctactgt 135841 tcccaagagg aaacataaaa cagtaatggt atcaccaatc gccaacttga tctaatgttc 135901 tttggggcac acacagccct acgagttgcc aaaacaacaa gcaaaaagga aggaaaaaag 135961 aagacgacat cagaacatgc ccaagccttt gaccacaacc ctacgttcct caccacagag 136021 gctccattcg acttttccaa agaacaccaa caacatcaag gaggacaaca ggatcctggg 136081 aataaagctg aaagagctgg ggtcacagag agggctaaaa tctgcctttc caacaatgtc 136141 tattgtgaaa ggactgtcac agctcatggg gacgacgtcc ctaaaaagag ccagtcagcg 136201 gggagggagt ccacaggaga aagaaaggaa agaacagatt tcaaagaggt ctttaaacta 136261 gaactgactg atgaaaaaaa atagaatgga agttggaaga gaagaccaaa aggtgggaac 136321 agggtcatga gagaaaggaa gaagctcaaa aggatgagaa atggcccact ctcttggatg 136381 tctgagcatg attctaaaat tatgagcaag aacagggtag aatcgagggt atgatc // LOCUS AC002984 40946 bp DNA PRI 22-OCT-1997 DEFINITION Human DNA from chromosome 19-specific cosmid R33853, genomic sequence, complete sequence. ACCESSION AC002984 NID g2443872 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40946) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1Mb region in 19q13.1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 40946) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA REFERENCE 3 (bases 1 to 40946) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (30-SEP-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA REFERENCE 4 (bases 1 to 40946) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (22-OCT-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from centromere to telomere. Cosmid R33853 overlaps cosmid R24590 to the left, and is right end of current sequencing tiling path. FEATURES Location/Qualifiers source 1..40946 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R33853" /chromosome="19" /map="19q13.1 from D19S208 to COX7A1" /cell_line="5HL2-B" /clone_lib="LL19NC03 R chromosome 19-specific cosmid library" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries a single chromosome 19 as its only human chromosome" repeat_region complement(670..919) /rpt_family="Alu" repeat_region 1369..1671 /rpt_family="Alu" repeat_region 1800..2422 /rpt_family="Alu" repeat_region complement(3283..3587) /rpt_family="Alu" repeat_region complement(3726..3997) /rpt_family="Alu" repeat_region complement(4013..4298) /rpt_family="Alu" repeat_region 4581..4869 /rpt_family="Alu" repeat_region 4956..5063 /rpt_family="Alu" misc_feature 5611..5896 /note="BLASTN similarity (1..286); match: 0.98, score: 5.0e-109; database searched: CpG; bases 83 to 368 (SL to QR)" misc_feature complement(6585..6706) /note="BLASTN similarity (22..143); match: 1, score: 1.8e-43; database searched: CpG; bases 114 to 256 (SL to QR)" gene 6714..15541 /note="calpain, small subunit" /gene="CANPS" CDS join(6714..6922,8002..8035,8354..8443,8611..8668, 8766..8830,11508..11576,11678..11756,11898..12014, 15280..15338,15515..15541) /gene="CANPS" /EC_number="3.4.22.17" /note="CALCIUM-DEPENDENT PROTEASE, SMALL SUBUNIT; Human EST matches: T12356, T19758, T11809, T12355, AA555256, AA553834, T54940, AA543034, T19759, T54774, and others ~Mouse EST matches: Z36352" /codon_start=1 /product="CANS_Human" /db_xref="PID:g2443873" /translation="MFLVNSFLKGGGGGGGGGGGLGGGLGNVLGGLISGAGGGGGGGG GGGGGGGGGGGGTAMRILGGVISAISEAAAQYNPEPPPPRTHYSNIEANESEEVRQFR RLFAQLAGDDMEVSATELMNILNKVVTRHPDLKTDGFGIDTCRSMVAVMDSDTTGKLG FEEFKYLWNNIKRWQAIYKQFDTDRSGTICSSELPGAFEAAGFHLNEHLYNMIIRRYS DESGNMDFDNFISCLVRLDAMFRAFKSLDKDGTGQIQVNIQEWLQLTMYS" repeat_region 9157..10053 /rpt_family="Alu" repeat_region complement(10763..10812) /rpt_family="Alu" repeat_region complement(10907..11196) /rpt_family="Alu" repeat_region complement(12556..12835) /rpt_family="Alu" repeat_region 13196..13800 /rpt_family="Alu" repeat_region 13994..14525 /rpt_family="Alu" repeat_region complement(14930..15057) /rpt_family="Alu" gene complement(16684..18109) /note="cytochrome c oxidase subunit VIIa [Homo sapiens]" /gene="COX7A1" CDS complement(join(16684..16736,17164..17248,17371..17457, 18095..18109)) /gene="COX7A1" /EC_number="1.9.3.1" /note="CYTOCHROME C OXIDASE POLYPEPTIDE VIIA-HEART PRECURSOR; Human EST matches: AA541775, T28203" /codon_start=1 /product="COXK_HUMAN" /db_xref="PID:g2443874" /translation="MQALRVSQALIRSFSSTARNRFQNRVREKQKLFQEDNDIPLYLK GGIVDNILYRVTMTLCLGGTVYSLYSLGWASFPRN" repeat_region complement(18814..19077) /rpt_family="Alu" repeat_region complement(19145..19216) /rpt_family="Alu" repeat_region complement(19291..19553) /rpt_family="Alu" repeat_region 19673..19970 /rpt_family="Alu" repeat_region 20228..20506 /rpt_family="Alu" repeat_region complement(20552..21130) /rpt_family="Alu" repeat_region complement(21177..21609) /rpt_family="LTR7" misc_feature 22762..23048 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 45.000" misc_feature complement(23481..23796) /note="BLASTN similarity (1..316); match: 0.92, score: 1.7e-106; database searched: CpG; bases 85 to 400 (SL to QR)" misc_feature 24291..24558 /note="BLASTN similarity (1..268); match: 0.93, score: 8.7e-105; database searched: CpG; bases 82 to 400 (SL to QR)" misc_feature 24347..24464 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 65.000" misc_feature 24557..24601 /note="BLASTN similarity (268..312); match: 0.97, score: 8.7e-105; database searched: CpG; bases 82 to 400 (SL to QR)" repeat_region complement(26294..26725) /rpt_family="LTR7" misc_feature complement(26653..26859) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 45.000" repeat_region complement(27215..27509) /rpt_family="Alu" misc_feature complement(27791..27898) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 48.000" repeat_region 28008..28615 /rpt_family="Alu" repeat_region 28634..29252 /rpt_family="Alu" repeat_region complement(29516..29665) /rpt_family="Alu" repeat_region complement(29804..30086) /rpt_family="Alu" repeat_region 30230..30866 /rpt_family="Alu" repeat_region 31090..31709 /rpt_family="Alu" repeat_region complement(31879..32028) /rpt_family="Alu" repeat_region complement(32102..32404) /rpt_family="Alu" repeat_region 32722..33296 /rpt_family="Alu" repeat_region complement(33860..34726) /rpt_family="Alu" misc_feature complement(35021..35110) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 53.000" repeat_region 35541..35810 /rpt_family="Alu" repeat_region 35950..36008 /rpt_family="Alu" repeat_region complement(36841..37158) /rpt_family="Alu" repeat_region 37318..37925 /rpt_family="Alu" repeat_region complement(38490..38775) /rpt_family="Alu" repeat_region complement(38803..39092) /rpt_family="Alu" repeat_region complement(39623..40222) /rpt_family="Alu" repeat_region complement(40262..40550) /rpt_family="Alu" misc_feature complement(40789..40946) /note="BLASTN similarity to D20326 (1..214); match: 0.99, score: 6.1e-60; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS01300, clone pm1820." BASE COUNT 10107 a 9730 c 11109 g 10000 t ORIGIN 1 gatcaatatg taaagtacac agggctgggc gtggtggctc atgcctgtaa tctaagcact 61 ttgggaggct gaggtgggtg gatcacctga ggtcaggagt tcgagaccag cctggccaac 121 atgatgaaac cttgtctcta ctaaaaatac aaacaaatta gccgggtgtg gtggtggaca 181 cctgtcatcc cagctacttg ggaggctgag gcaggagaat cgcttgaacc caggaggtgg 241 aggttgtagt gagctaagac cgtgccattg ccctccagcc tgagcaacaa aaatgaaact 301 ccatctcaat aataataata ataataataa aacaaaaaaa aaagtacaca gtacgtgagc 361 gggtgataaa agacaggacc ttcccgtgat cactagatgt gtagacatga aagatctgct 421 tcccagacct ggggctggag aaggtcttgg taagggagtg gtggcctcat gtcttcatgt 481 ctgccctgag agaggctgtt tggatgttgc tgtgcttgca gggcctccat gctgagtttt 541 gtctctgttc agctgggaac atgcaggtga ccttgccagg caggcatctt ctcactttct 601 ccagggcgaa gagatgaggg cagggaggtg acaggacaga tcactagggg tggctgtcat 661 gagaacttta ggctggagtg cagtggtgca atcaaggctc actgcagcct ctacctcctg 721 ggctcaagtg atcctcccac ctcagcctcc cgagtagctg ggactacagg cacatgccac 781 catgcctggc tagtttttgg tattttttgt agagatgggg tcttgccatg ttgcccaggc 841 tggtctcaaa ctcctaggtt caagcgatcc tcctgcctca gtctcccaaa gtgctgggat 901 tccaggcatg agccaccgca cctggccctg ttgtgagaac ttctgactct ttcctggagt 961 ggggtgcagc cagggcaggg ctctgagcct acagggccct gacctgactc aggtggtctc 1021 aggcttcctc tggctccaca agaggagcag acagcaggga gcagacagaa gggagcaggc 1081 tgtggggtgt ccaggtggga aaggatggag gctgtggcag ggtggggaca gtggagggga 1141 agagtttagt ttctgtaggt ggagccaata atatctgctg acaggttgga gtgggtgtga 1201 aggagagagg tgaaggacca ccccggagtg tttggcctca gtcactggaa ggacagagct 1261 gccactgacc taggtgggga aggtagtgag ggagcaggtt ctggaggaag gtcagaagct 1321 cagtttagga tatgtaaagc ttaaaatgcc tctaagggcc aggtgcattg gctcacacct 1381 gcaatcccag cactttggga ggctgaggcg ggtggatcac gaggtcaagg gattgggacc 1441 atcctggcca acatggtgaa accccgtctc tactaaaaaa tacaaaaatt agctgggcat 1501 ggttgcgcat gcctgtagtc ccagctactc gggaggctga ggcaggagaa ctgcttgaac 1561 ctcagaggtg gaggatgcag tgagccgaga tcgcgccact gcactccagc ctggtgacag 1621 agcaagactc cgtctcaaaa aaagaaaaga aaaagaaaaa ggaaaaaaaa agcctctgag 1681 atcttccatc ctttcccatg gaaatggtga atatgatgca ggatgtagga gcctggagtt 1741 cgcaggaggt gtctgggctg gagatagaaa tttgggggtt agggttaggg ccaggtgcag 1801 tggctcacgc ctgtaatcct agcactttgg gaggctgagg tgggaggatc acttgagctc 1861 agagttcaag accagcctgg gcaacacagt aagacctcat ctctacaaat aattaaaaaa 1921 gaaaaaagat aaaggaaatt tggatgtttg ccgggcgtgg tagcttgtgc ctgtagcagt 1981 agctattcag gaggttgagg caggaggagg attccttgag gccgggagtt acaggctgca 2041 gtgagctaca tcacatcact gcattccagt ctgggtgaca gaacccctgt ctctaagaac 2101 aacacaaaaa actgggcgtt ggccgggtgc tgtggctcat gcctgtaatc ccagcacttt 2161 gggaggccaa ggcgggtgga tcacaaggtc aggagatcga gaacatcctg gttaacacgg 2221 tgaaaccccg tctctactaa aaatacaaaa aattagccag cgtggtggca ggtgcctgta 2281 gtcccagcta ctctggaggc tgaggcagga gaatggcgtg aacccgggag gcggagcttg 2341 cagcgagcag agatcgcgcc actgcactcc agcctgggcg acagagtgag actccatctc 2401 gaaagcaaaa caaaacaaaa aactgggcgt tgatagccta gagacaggat ttaagctgtg 2461 catgtagggg aggcacccca tggcatgagt ggtacagaga ggaggataga gacctaacac 2521 ctggggctgc cgggaataag gggttggggg ccagaactag atgacaatat ggttacgatt 2581 ggcagtgagt gccatgagaa agtacaagtc attttgaaag catataactg gaagcctcac 2641 ctagtctcag gttgggtgga gacagggagg tctttctgga agagatgact cctgaggtga 2701 cacctgcagc ctgagtagct acacaatcag gccagaaaaa aaaaggcagc aggggagttc 2761 ctggccgagg gcatgagggc tggcagcaag tgagcctcgg gagagggcag agagctggag 2821 cagtcagtct ggcagggagg ggtgtggtga gaagggaatc tggattagtc atcaccaaaa 2881 gatctgtgcc aagcccgggt tcagatctgg ttctgccacc ctccgtgacc ctgcctttca 2941 gaatctccat tctctctcct gccctgcaga caaggcttgt gtagcagtta ccttggcctg 3001 gccccagcaa ggcctcagga aacatcagct gcaactagtt ttgtctgttg tgatgttgtt 3061 atttgaccta ttatattaat tatattatta acgcataagg gtagtggtta aggatacagg 3121 ctagggagag tgactgcttg ggtttaacct cagagtcggt ttacttatct gtaaaatggg 3181 gataataaat agcatctatg tcacagggct gttaggagaa caagggaact gatggacgtc 3241 aagcacttgg aggagtggtt gatggttact tgtttttgta tctttttttt ttttttgaga 3301 cagtttcgct cttgttgccc aagctagagt gcaatggcac gatctcgagt cactgcaacc 3361 tctgcctccc gggttcaagc gattctcctg cctcagcctc ccgagtagct gggattacag 3421 gcacacgcca ccacgcctgg ctgatttttt gtatttttag tagaaacagg atttcaccat 3481 gttagtcagg gtggtcccaa actcctgacc tcagtgatct gcccacctcg gcctcccaaa 3541 gtgctgggat tacaggcgtg agccaccaca cctggcctgt ttttgtacta tttaaggaag 3601 actaagagtc taccagctga ggaatacagc tggcctcgag gctgagggaa cagcatgagc 3661 acagacacag agctaatggg ggcatttaca aagcatttat caccaattta tttatttatt 3721 tattggagac ggagtttcac tcttgttgcc caggctggag tgcaatggtg caatcttggc 3781 tcactgaaac ctctgcctac tgggttcaag cgattctcct gtctcagcct cccgagtagc 3841 tgggattaca ggtatgcacc accacatccg gctaattttg tattttttag tagagacagg 3901 gtttctccat gttagtcaga ctggtctcga actcccgacc tcaggtgatc tgcccgcctc 3961 agcatcccaa agtcctgggg ttattggcat gagccactgt gcccagtatt tatttgagac 4021 agagtctcgc tctgtcaccc agattgtagt gcagtggcat gatctcggct cactgcaacc 4081 cccgcctcct gggctctggt gatcccccca cctcagtctc tagagtagct gggactaggc 4141 gcatgccacc acattcagct aatttctgta ttatttgtag agacaaggtc tcactatatt 4201 gcccaggctg gagtcaaact cctgggctca agagatcctc ctgcctcagc ctcccaaaat 4261 gcagggaata caggcatgaa ccacttcacc cagccaccac ttaagttttt tttttaataa 4321 tcagtattgc tgcagatatc atttaatttt gatagggggc ctagtggtat ggagcaaccc 4381 atgttcccat tttacagatg agaaaatcgc tgctcaggaa tatcaagcca ctcttttggg 4441 gttgtgcagt tattacaggg cacagccagg attcaactca agcctagcca gcctgcaggg 4501 taggggataa gagccagcca gcctgcctgg gggccagatt tagctccccc cgatgagtcc 4561 tgaaaaagta ctggactccc gtggctcatg cctgtaatcc tagcactttg gcaggccgag 4621 gcaggtggat cacctaagat caggagtttg agaccagcct ggccaacatg gtgaaacccc 4681 gtctctacta aaaatacaaa aattagctgg gtgtggtggt gcatgcctgt aatcccagct 4741 actcgggagg ctgaggcagg agaatcgctt gaacctggga ggcagagact gcagtgagct 4801 gagatggcgc cactgcaccc cagcctgggc aacagagcaa gacgctgtct caaaacaaaa 4861 aaaaaaaaaa agaaaagaaa agaaaagaaa ccaggctatg taaaaatatt attgtaaaaa 4921 attaaattaa atcaaaaaaa ggccggtagc ccatacctgt aatcccagta cctcacagta 4981 ctttgggagg ctgaggcagg aggacggctt gaggcccgga gttcaagagc agcctgggca 5041 acatagcgag acctccgtct ctattatgta aaaacaataa tattttttaa gtaaataaag 5101 aaaccaggcc gtggccaaag gaatcttcct gtgtctgggg ccttctttag aggctagcat 5161 actgaggaca ctctttgggc cagtccagga aggaatgatt ctggtggccc ctgactgggg 5221 cgggggcgtg gtaaggttcg cctgggaggc ggcgcaggcg ctgcctgcgg tttgactgcc 5281 atccgggctg ctgcaggcac tgggaaaggc aatccagcct caatcctcgc cacagacacg 5341 ggcgactgtc ccccagcgtg tgagtcaaac catactccag gcctccagga ggtgtttgtc 5401 gcgacaacac ctgcagaggg cccgtgcgga gtcccttagt gagcggaccg aaaaccgcca 5461 ccctggaagg atattggcat gccctagggt gaaaattcac actacgacac tcggggggtg 5521 ggtcccctcc gagttctcgt ccgcgggtgc ctccacccag acctgagggg tgtcgggaag 5581 acccccgcct accgagtcag ggcgggatta agacctccgg cgctggaaac gcgtagggcg 5641 gggcccagac ctggatccag ctagccgggc ggtgtggggt gcgcatgcgc aatgtccgct 5701 tcggctctag gaccgcgcgg gcgacagcag ggccgcggtg cagtgtccga cccgagagtt 5761 gcggcctgag tcaccggccc cgccctccgg agccggacgc tgcgggaggc ccgggagcgg 5821 cagtggaacc gactcccaga actccggacg tgtgcggcgg taagcgcccg gcccgtaccc 5881 cctccgcacc ccgcaactcc gactttggcg gcccccagac ccggcgaaac cgtgaagtct 5941 cggcctcagg agccccccag atccatggtc tcagattcga accctcagac ctgttcgatc 6001 tccttttatg ccgtcccggg gacccttctc caccctctgt ctcatgatcc aaaatacagg 6061 cccataggtt tcagagatac cctgtgaatt cttccttggg gtgtcctgaa ccgcgatcgt 6121 agatctgtgc ccccagttcc agcggggaac cccattgtcc gggaacccag agctcacagc 6181 cacgatctta gacccgagcc cacagagcca gaggtgacac tggaatcctg ccgctgggaa 6241 tacctaaatt ctcaagccca catttgggat ttcctaaacc ccgccctggg gccctttagg 6301 ccgtggatgt ttgctccgcg cccatgacct gccaaggacc tctacattca ggctgggctc 6361 tggatctcac attctagcgc ccagacatgt cgggaacgcc ttacgccaac ttggggtccc 6421 caccaagaga acccccacca gatctgcacc ctccccttca cgcgtgcacc cagtccaggc 6481 tccctcaagc cccacgggtg ccttttagac ctgaggaggt tgcaaacctg atcccccata 6541 cctgccccac ccatccgcgg acaacccgcc ctcgcaaact cagaccccca cccggaggct 6601 tcagattcct cccaggtcca gctgccggaa atgcgtgttt gaagggaggg tgtgggctca 6661 ggggcgaagc acccactggt cccctttttt ccccccagca gtgagtcgca gccatgttcc 6721 tggttaactc gttcttgaag ggcggcggcg gcggcggcgg gggaggcggg ggcctgggtg 6781 ggggcctggg aaatgtgctt ggaggcctga tcagcggggc cgggggcggc ggcggcggcg 6841 gcggcggcgg cggcggtggt ggaggcggcg gtggcggtgg aacggccatg cgcatcctag 6901 gcggagtcat cagcgccatc aggtaaggcg gagactatca gaggggcggg gcctgggaat 6961 gggaggagcc tcagtgaggc gtggtctggg aggggcgtgg tctaaaaata gaataggatt 7021 aacctggagg ctaacctggg tacatgaatt aggccgggga ggcctggttt gagagttctg 7081 ctgtaagggg gtggacccca gtgaggcgga tatcagtcat tgggggcggt gcttgatatg 7141 ggagtagtct gattgtgtgg gaccaggaca atgtggtcct gaggggactg atgtggagtt 7201 ttggcgggtg gggcttatgg gtctgggctc agcctgcatt ggctaacctg gagatgaacg 7261 ttaaaggtgc ggagccaatg cgtctgagtg acattttacg gaaggggctt gacttatgag 7321 tgtggtcaga ggttttaggt gggttaatga ggaggtgagc ccaggaggaa gcaagaccta 7381 atggggtagg gtaaatggcc cagaatgggc tgggcttgct gggaactggc tggattaatg 7441 gagaaaaatg ccctggtagg gatagggcag gttggcttaa ttagccacgg ccacttagca 7501 gtgatccctt gaaggggcgg ggaccaatgg ctggggtagg atgttgggaa gacgggcatt 7561 taggaagcgg gccatggagt agggtcttgg gcagatggca cctgaggagg ataaggcctt 7621 gggaagctgg ggcagagcct ctatgaagtg agccctccaa ggggcggggt cttttgattc 7681 tacgtgggat tttttaggtt tagggtggcc aagatgactg aaatctgcca ctgggtaggt 7741 gtgcctggca ggaggggagc ctcccagggg accggtctct gggtttcctc gagggtgggg 7801 ttggcctgag gaagggagaa gaggggcacg accagggcag tgtggattgg gacagatgag 7861 gacaagaaca aatgaaaggc acagcaacca agtaaggaag ataacggctg gggtctggag 7921 cgttggggct gatggttctg tagtgctgcc cgttggaggc cccgcccctg gcactaaccc 7981 ctccccctta tctcttcgca gcgaggcggc tgcgcagtac aacccggagc ccccggtaag 8041 ccccctctgc aaccagaccc ccttctcctg ccaaggcctc ttcgaggtcc catccctgtt 8101 cctgtagaga agccccacct tcctcccctt cttgtgaaat tcctctgcca gttcctccca 8161 tgccgtgtct gcagcttcgc catgggtctt agccatgccc cacatacgtg caccccattc 8221 actaccaccc ctcattcttt tcctcatcaa gctgcccagc ccactctgac ttccccaccc 8281 agggtacctg ggtttgggga gccgtcctgg ccgggttccc ctccccctgc tctgagctct 8341 cctccctttg cagcccccac gcacacatta ctccaacatt gaggccaacg agagtgagga 8401 ggtccggcag ttccggagac tctttgccca gctggctgga gatgtaagta acctggggtc 8461 cctggccccg tcctaaccgt tccatccctt cccttgtggc tgcccttgca cacacaccct 8521 tgaccatgac aatcccagtg ttcccattct ccatgacatt ctcagacccc tttcagtcac 8581 ccctgacctg cccctaactt ccgcccgcag gacatggagg tcagcgccac agaactcatg 8641 aacattctca ataaggttgt gacacgacgt aagtgaccgg ggttaaggaa tagggtagat 8701 tcagaggcag aggggtcaga gaggatttga cctctggcct ctgactttca acctgttacc 8761 cacagaccct gatctgaaga ctgatggttt tggcattgac acatgtcgca gcatggtggc 8821 cgtgatggat gtatccttgg gggcagtgtg ggagaggccc tgggtggaca gagagttttt 8881 tgagagtatg tttgggaagc caacatgtag ctttcatatc cacagatgtg gaaatgcatt 8941 catcagaaca cagtcacctg cagacatgtg gcggtaaatc ataacaatgc cagctgtcac 9001 agctggcact ttaattatat aaagtgaccg aaccgtcata accaccctgt tggacaggca 9061 caattattat attccatttt acagaagagg aaactgaggc tcagtttcct gaaagaagca 9121 gtcatttgtc tcaagttata gataaaggtc aggcatggtg gctcatgcct gtaatcccaa 9181 cactttggga ggctaaggtg ggcagatcac ctgaggtcag gagtttgaga ccagcctggc 9241 caacatggtg aaaccccatc tctattaaaa atacaaaaat tggccaggca cggtggctca 9301 cacctgtaat cccagcactt tgggaggccg aggcgggcgg atcacgagat caggagattg 9361 agaccatcct ggctaacaca gtgaaaccct gtctctacca aaaaaaaaaa aaaaattagc 9421 cgggcgtagt ggcgggcgcc tgtagtccca gctattcggg aggctgaagc aggagaatgg 9481 cgtgaacccg agaggaggag cttgcagtga gccgagatcg cgccactgca ctccagcctg 9541 ggcgacagag ccagactctg tctcaaaaaa taaaataaaa ataaaaataa aaataggccg 9601 ggcgccgtgg ctcacgcctg taatcccagc actttgggag gccgaggcgg gcggatcacg 9661 aggtcaggag atcgagacca tcctggctaa cacggtgaaa ccctgtctct actaaaaata 9721 caaaaaaatt agccgggtgc agtggtgagc gcctgtagtc ccagctactc gggaggctga 9781 ggcaggagaa tggcgtgaac ctgggaggcg gagctttgca gtgagccgag atcgcgccac 9841 tgcactccag cctaggcgac agagagagac tccgtctcaa aaaaaaaaaa aaagaaaaaa 9901 aaagaaaaat aacaataaca aaaattagct gggcgtggtg gtgcacgcct gtagtcccag 9961 ctactcggga gtcccagcta ctcaagcagg agaattgctt gaacccggga ggcggaggtt 10021 gcagtgagca gaggtttcgt cactgcactc cagtctgtgg actcaggtct tttgggcttg 10081 agaacccacg tttataacag ccacatgcgc atatacacac acacacacct aagagcacac 10141 ccacatacgt gaaggtgtat gaggcagtgt gcacagggac acacgtgtta tctgtacatg 10201 ggccagccat tagctgatct tggatacatg catattttta tttatttttg ttttaaaaaa 10261 tattttatag agaaaagctt tcactatatt gacctggcaa gtctcaaact ccagacctta 10321 agcaatcctt ccgccttggc ctcccaaagt gttgggatta caagcttgag acactgcgcc 10381 tggcctcatg cacatttaac acatagggca aacagacaca tacataatat acatgtcatg 10441 taatcacaga catattgcat agagcaaagc aatcccaact ctacctctta accaggttat 10501 tgaaccgctc tgtgcctcag tttctctctc tgtaaaatgg gtgagtgata atggtgatac 10561 cgaatcggaa ctacatccaa aaggtttagc acagtgtata gcgcatagta tgtggtctgt 10621 acacattagc tcttaactat gatcttgggt agagttttca gctttgaatg ttttcttctc 10681 tggaaatagt cctggtagaa ctggtgtccc aaggatttag tgagaaagtg atgttcctga 10741 gtggacatca ggtggatcct gagttcaagc gatccacctg ccttggcttc ccaaagtgct 10801 gggattacag gcatgaacca ccatgcctgg tcacacactt gtttttcaca tatcctaaca 10861 tttttatgaa attgagttac tgtcaagaat tttcttttct ttctcttttt tttttttttt 10921 ttttgagaca gagtctcact ctgtttccca ggctgaagtg cggtggtgcc atcttggctc 10981 actgctaact ccacctccta ggttcaagca gttctcctcc ctcagcctcc tgaatagctg 11041 ggattacagg tgtgcgctac tacagctggc caatttttgt atttttagta gagacgggat 11101 ttcatcatgt tggccaggct ggtctcgaac tcctgacctc aagtgaaccg cctgcttcgg 11161 cctcccaaag tgctggattg caggcgggag ccaccgtgcg tggcctatta agaattttct 11221 gaatgagcaa ttaacaactc ttcaggcaag gaatttgtac tttagagtag ttgaatataa 11281 ttcctcatgg ttatgtttaa agagaatccc tatttagaaa tgtaagtgac aatatgtatg 11341 gataaaatga aaaataaaac cgcacaccag gtgatatgac taggaccaag cgggagctgg 11401 agtttcagcc ctggctgccc tgcttgctgt atcaccatcg tgtccacagt gcatgtgatg 11461 catgcatctg ggtgtagctg tcactcttct taacaccctc ccaccagagc gacaccacag 11521 gcaagctggg ctttgaggaa ttcaagtact tgtggaacaa catcaaaagg tggcaggtgt 11581 gtagaaacct ctgaaatccc tggcacccag acccccaagc catggagaca ctatgcccca 11641 caatgctcac ttggacccaa tgtctctgtc ctcacaggcc atatacaaac agttcgacac 11701 tgaccgatca gggaccattt gcagtagtga actcccaggt gcctttgagg cagcaggtat 11761 ggctggcagg gacatgctgg ggctgggagt gggatgggtg agaaaacctc tactcagctg 11821 gtctgggcct gactcctggg catgggagtg ggttgggcca tgtggccccc gcacctgaag 11881 gactacatcc atcttagggt tccacctgaa tgagcatctc tataacatga tcatccgacg 11941 ctactcagat gaaagtggga acatggattt tgacaacttc atcagctgct tggtcaggct 12001 ggacgccatg ttccgtgagt gacaacccag ctgtcttcct gggtggggat tcctatgacc 12061 tctatggacc aaggtctcct ctaagcctga ctttgaggtg ggcgatgcta ggggcacagt 12121 ggtgattgaa tcagttccag tttctgccct catggagctc acagtctggt ggaggagaca 12181 caagtcccca gacagtgaca gttcagactg gtgatggctg ggagagggga agctcagggc 12241 agagtgatgg gttgtgatgg gggcagcaca ggccgagggt ctagggctct ggtgagggga 12301 agagtgagca gagcgtgctc agattgtgct tggggaagcc tagagggcaa atccaatcca 12361 ggaggaacct gatgcagccc aaggggtcag ggagggctca ctggagaggg aacattgaga 12421 cctgagggga tgaggagaga acttagtgag aaggtaggga atactgttgc aggctgaggg 12481 aacagcctgt gcaaaggcct agaagggaaa gaagatgggg cttctgatgc tttttgttag 12541 atttctttct ttctattttt ttgagacaga gtctgctgct cttgttgccc aggctggagt 12601 gcagtggcat gatcatggct gactgcagcc tcaactccta ggctccagca gtcctcctgc 12661 ctcagcttac cgaatacctg ggagtacagg cacacaccgc catgcctggc taatttttaa 12721 aattttttgt agagacaggg tcttgctatg ttgcccaggc tggtcttgaa ctcctgggct 12781 ccagcaatcc tcctacctca gcctcccaaa gctttggtgt tacaggcatg agccattgca 12841 tgtggccttc tgagtgttag tgtactgaat gctgaagaca tggctgggaa gtggggatta 12901 ctgtccttat tttgtgaagg aggaacttag gattcaagga ggcaaatttt catgctcata 12961 gtcacccagc taggaagttg taaagccaga atttgaactt ggatctatct gagatctgag 13021 ctaagtaaga gaaggtagga gcagtgagca tgattaggtc atgcaggatt gtgggtgcca 13081 agtttggggg tgctcaaact ctaccagaga cattggggag ccatggcagg ttctaggcag 13141 agggaggaca gggtcaggtt tggctttaga aagatccctt ggcctcggct gggcacggtg 13201 gctcacgcct gtaatcccag cactttggga ggccgaggag ggcggatcac gaggtcagga 13261 gttcgagacc atcctggcta acacagtgaa accccatctc tactaaaaat acaaaaaatt 13321 agccaggtgt ggtggcgggc gcctgtagtc ccagctactt gggaggctga ggcgggagaa 13381 tggcatgaac ccaggaggcg gagcttgcag tgagctgaga tctgccactg cactccagcc 13441 tgggtgacag atcgagactc cgtcttaaaa aaaaaaaatc ccttggcctc caggccgggt 13501 gcagtggctt atgcctgtaa tcccagcact ctgggaggct gaggtgggcg taaaacctga 13561 ggtcaggagt tcaagaccag cctggccaac atggcgaaac cccgcatcta ctaacgatac 13621 aaaagttagc caggcgtggg ggcgtgtctg tagtcccagc tactcaggag gtttagggag 13681 ggagaattgc ttgaacccag gaagcggagg ttgcagtgag cagaggtcat gccattgcac 13741 tccagcctcc aacctgggtg acaagagcga gattccatct caaaaaaaaa aaaagaaaaa 13801 aagatccctt ggcctccaga gagcccacaa tgcatgaaaa gtaacagagg aaaagaagca 13861 ggtatgtgag atagcaggga ggtgaaggga gtggacagac aaggttgatg ccacaagtca 13921 cataagaggg gccaggaaat agccactgga tctaccagca agaaggctga tagtcgggag 13981 gatcatttga gtccaggagt ttgagaccag cctgggcaat gtagtaggac tccatctcta 14041 caaaaaaaaa aaaatatatg tatatagaca tagccgggca tggtggcaca tgcctgtggt 14101 ctcagttgct cgggaggctg aggtaggaga atcacttgag cccgggaggg tgaggctgca 14161 gtgagctatg atcttgccac tggactccag cctgggccac agagacaccg tctaaaaaaa 14221 aaaaaaaaag ggctgggcat ggtggctcac tcctgtaatc ccagcacttt gggaggccga 14281 ggtgggcgga tcacctgatg tcaggagttt gagaccagcc tggccaacat agtgaaaccc 14341 cgtctctact aaaaatacaa aaaaattagc caggcgtggt ggtgcacgcc tgtagtccca 14401 gctactcagg aggctgaggc aggagaatca cttgaacctg ggaggcaaag gttgcagtga 14461 gccaagatcg tgccactgca ctccagcctg ggcaacagag tgggactctg tctcaaaaaa 14521 aaaaagaaag aaaggcagcc agggtgtgca ggacgctgtg ggtatggaca cctgagggtg 14581 gtggtaactg agcggagata gtgtagtcat catgtctaga ttccctgtgg gtggggaaca 14641 tggagtcaag tttcagggca ggcgtgggga cctacctgta agactctagg ggcaggaagt 14701 agagtgtgca gagcctccct ctagtagcct gttaggctta gaactggaga ggatggagca 14761 tagatgtcag taggaggctg gggctgggca gaggtgatat cagtacacag aggccattat 14821 tgaggagggt aacagattgg attccaatcc tgatttcttc actttcttgc tatgtgacct 14881 ggggcatggc ccctcccttc ctgcctcatt tctttttttc ctgtttttaa tttttagttt 14941 ttttaaaaag cagagatgag gtctctgttg cccaggttgg tctcgaactc ctgggcggct 15001 caagtgatcc tcccatcttg gccttccaaa gtgttgggat tataggtgtg agccaccacg 15061 cccagcctgt gcctcatttt aaaatgacac ttatctcaca gagttggatt aaatcagcca 15121 atacctggaa agcactttac cccatgcttg gcacacagta aatactcagt gaagtgccag 15181 ctgctatgat tgtcatccca tgtatttgtg taatgttatt atgccaaaaa ccagtctcct 15241 tccttccctg ccgccaaacc tctgcatctc ctcctccagg tgccttcaaa tctcttgaca 15301 aagatggcac tggacaaatc caggtgaaca tccaggaggt aaggaccccc atattggggt 15361 atgggtgcct gggaggaccc cacccctcag cccttcatac cagctctgag ctgcagtccc 15421 cttcctccta ttttgccagc cgtccctggg tgagggcaaa ggggctggtg ctcttggggt 15481 tccctgtcct cactctccac cctcctctcc ccagtggctg cagctgacta tgtattcctg 15541 aactggagcc ccagacccgc cccctcactg ccttgctata ggagtcacct ggagcctcgg 15601 tctctcccag ggccgatcct gtctgcagtc acatctttgt ggggcctgct gacccacaag 15661 cttttgttct ctcagtactt gttacccagc ttctcaacat ccagggccca atttgccctg 15721 cctggagttc cccctggctc taggacactc taacaagctc tgtccacggg tctccccatt 15781 cccaccaggc cctgcacaca cccactccgt aacctctccc ctgtacctgt gccaagccta 15841 gcacttgtga tgcctccatg ccccgagggc cctctctcag ttctgggagg atgactccag 15901 tccctgcacg ccctggcaca cccttcacgg ttgctaccca ggcggccaag ctccagaccg 15961 tgccagaccc aggtgcccca gtgcctttgt ctatattctg ctcccagcct gccaggccca 16021 ggaggaaata aacatgcccc agttgctgat ctctatggga tctgcctcct gatctgaccc 16081 gagagaggag gggtggctcg ggctctgggt cctggggagc aggggaggta gtagattcgt 16141 ccccagggct ggatccgtgg atttcctcgt ctttttctcg tcggcatttt ttgtctcatc 16201 ctgagctgtg gaggctgaga caggacccag aagccccagt gatctctgct tctgcttcca 16261 gagtctgggc tggagggagg tctcaatgcc actcctggcc ctgtccccca aggccatgtt 16321 tgcccctctt gcagcttctc catccacact tggcctctgc ctctgttgtg tcatggacca 16381 tgctcagtgt cttccttccc acttgaggac accaccccac gtccccgggt cccgaggggg 16441 agcctatggg tgctgtttgg ggtactcagc agagggctga gaggtagcaa catcctagga 16501 gtatccctgg tccagcatga ggcaccgagg gctgggggca gtgggcagtc cacagggcag 16561 agatccctgg ctggtgggag gatgggcttg gtggggaggg gtggacacaa acacagacac 16621 acacagaggc cagcgtttat tgacacttgt tcaagtctct caggcccccc aggcttcttg 16681 gtcttaattc ctggggaagg aggcccagcc aagggagtac aagctgtaga cagtgcctag 16741 aaaggggaag cgttggagac agaatgagac ctaatcccct ccctggacat cccaggcttt 16801 taaggccttc aggctccctg gttcccagcc tggaccccag ctctagttcg gactccagct 16861 cccttcccag attggactcc tattgtaaag ccccctggct ttgatctaga ccgggctggg 16921 agctttctcc ccgactcctc cctgggcctc agccccacca ctaacccagc ccccgtcctg 16981 ggacaagagc ccaaacccta gttactcccc tccctgcagg cctgactcca gctcagtact 17041 cgaccccctt cctaaacgcc aacccccaac ccagatcctg gtcagtcctg cccagaaacc 17101 agccacctct tactctgggt ccagccccgc ctcccccgca gcccagacgg gccctgcgct 17161 caccgcccag acacagcgtc attgtcactc ggtacaggat gttgtcaacg atgccgccct 17221 tcaggtacaa cgggatgtca ttgtcctcct ggatgtgggg gtgtctgatg agaaaggggt 17281 gctggctgct ccttagagcg cgccccgcct gccccggctc ggcgcgctcc cggctggcgc 17341 gtcggatccc cacccccccc gaccccccac ctggaagagc ttctgtttct cgcgcactcg 17401 gttctgaaag cggttccggg cggtggagct gaaggagcgg atcagcgcct gggacacctg 17461 cggggtggga ggtgggcggg tattggaggc acctggaggg gtgtcctgaa gtgggacccc 17521 gctctctgag gcctggacgt ggggacagcc ccatccctag cgagcgtccc aaaggcgagg 17581 agggtaatag cttggattct gttacccgcg aactgccagc tgggttggga ggggactcgc 17641 cctggagtct ggcctggggg tgaccgggct gcctagaggg tcctggaagg gagaggaccc 17701 agaaggactt tgttctgggg ggcctaagtc ctgagatttt tgttggggga gggtttcctg 17761 gattaaaggg aggaaaccag gtcccgggaa cgtggggacc ggaatggagg tccccgggag 17821 cgtccaggct gcctgagaag gcccgggtga ggccagtgtt aagggttccc gagatgtcaa 17881 gttcaaaccc ggggctttgg gccggggtgg gggtccccag ttgtctccga agagtggatt 17941 gtgggggaag ggacccaggc tgtttagcgg ggagtcccag gccctgaagg tggctcacgt 18001 tccaaggccc tagatgcggg ggcacttgga gagtcgcgta ttagggcggg gttttctggg 18061 tggggggctc cggcccagcc catgggggcc tcacccgaag ggcctgcatt ctgccttgtc 18121 ctcttccgcc ggagtcacct cccttctccg cccaaggaca cgcgtagtcc tcctctgtcc 18181 aggaacccca ggccccaccc gctgcgcgca gcgtgacagc tggggacgcg gcggggctgg 18241 ggctggggct ggggctgggt tgcggggagt cctgctcctc ggattcgtcc accacggagg 18301 aggcggtcca gacccgcacc ctgggccgca gcgcgcatcc gtttcggtct cggaatttcg 18361 cttggtccca ccggagccca aagcggtacc agggaataca tttttgcaga tttttggaga 18421 tttccgccgg gggcaagtgc aactcctccc cggatctctc cctgcccgct ggtatgtgag 18481 agtgaggaaa cgcataaggc cgtgttccaa ctccctgttc tgtctacaca gccagagatt 18541 gcacctccga gcctgtttcc gtatctgtaa cgtgagggct aaggcaggat ctttctcaga 18601 gatttgttga gacattaaat gaccttgttt ttaggaaaca caagctaaag tctttgggag 18661 taaaacggta ttttatgtta ccctaagaaa cacatcatat ctatttatag ctctctaatt 18721 ctggcagtca acggggtgca gcacagttgt cctaatggtt ttagcaacca ggaaagatcc 18781 tggcctagtc tacaaggttt tctttttctt ttcttttttt tctttttttt ttagacaggg 18841 ttttgctctg ttgcacaggc tggagtgcag tggtgcggtc atagctcact gcagcctcga 18901 cctcctgggt tcagcgatcc tcctgcctca gcctcctgag tggctgggac tacaggcgcc 18961 caccaccatg ctcgaactaa ttttcgtatt ttttgtagag gctgggtttc accatgttgc 19021 ccaggctggt gtcgaactcc tgagctcaag aaatttgccg gcctcatcgt gagccacgca 19081 cccagccagc agacaatgtt tgtttgtttg tttttgaggc agtgtcttgc tatgttgccc 19141 agggtggtct ccaactcctc caagtaatgt tcctacctct atcacccaaa gtgctagaat 19201 tagaggcatg agccactata cccagcttgt aaaaggtttt aacaatctct tttatttatt 19261 tatttaattt atatttttag acagagtctt gctctgttgt acacgttgga gtgcagtggt 19321 gtgatctcgg ctcactgaaa ccttctgcct cctgggttta ggcaattctt gtacctcagc 19381 ctcctgagca gctgggatta caggcacccg ccaccacacc cagctcattt ttgtattttt 19441 tagtagagat ggggttacac catgttggcc aggctggtct caagctccca acctcaggtg 19501 atccgcccac tttggccttc caaagtgctg gaattacagg cctgagccac cgcttccagc 19561 cacggtctct tttaaagagg gcttttatga ttattaaatt cccctaacac tttcaaacta 19621 tgcttgtaaa tttaatgtta ggtttttcca tttaagagcc acaagtaggg ctgggcgtgg 19681 tggctcacac ctgtaatccc ggcactttgg gaggccaagg caggtggatc acttgaggct 19741 gggagtttga gaccagcctg gtcaacatga caaaaccctg tctctactaa aaatactaaa 19801 attaaccagg tgtggtcatg ggcgcctaca atcccagctg cttgtgaggc tgaggcacaa 19861 aaatcgtttg aacccaagag gcacggaggc tgcagtgagc caagattgca ccactgcact 19921 ccagcttggg tgacagaacg agattctgtc tcagaaaaaa aaaaaaaaaa aagctgactt 19981 accagataaa cagatgactc tttcttcaat tcttaaataa cttttatgaa taaaaaattc 20041 ataaactgta cataaacaga tttcaagtaa gtcttagtca ccagctgaat tccacatttt 20101 acaatggaat tacaagattg acctgttatt tgaataggcc agtttacctt aaatattaca 20161 aattcttaaa ataccaagta ttcacagatt ctctaacaca aagttaatac catgggggcc 20221 aggcactgtg gctcacgcca gtaattccaa cactttggga ggccgaggtg ggtggatcac 20281 ctgggtcagg agttcgagac cagcctggcc agcatggtga aaccccgtct ctactaaaaa 20341 tacaaaaatt agccaggcgt ggtggtgggt gcctgtaatc ccagatactc cggaggctga 20401 ggcaggagaa tcacttgaac ctgggaggcg gaggttgcac tgagcagaga ttacgccatt 20461 acactccagc ccgggcaaga gagcgagact ccatctcaaa acaaaacaaa acaaacaagt 20521 taatactatg gttttgactg tcttaatttt cttttttctt ttttgagacg gagtcttgct 20581 cagtcaccca ggctggagtg cagtggcgcc atctcggctc actgcaagct ccgcctccca 20641 ggttcacgcc attctcctgc ctcaggctcc ccagtagctg ggactacagg cgcccgccac 20701 cacgtccagc taagtttttt ttttgtattt ttagtagaga cagggtttca ccgtgttagc 20761 caggatggtc tcaatctcct gacctcgtga tccacccgcc tcggcctccc aaagtgctgg 20821 gattacaggc gtgagccacc acgcccagcc tctttttttt ctatttgaga tgaagtctcg 20881 ctctgtcacc caggctggag tgcaatggcg caatctctgc tcactgcagt ctccgcctcc 20941 cgggttcaag tgattcccct gcctcagcct cccaagtagc tgggactagg tgcacgacac 21001 cacgcctggc taattttttg tattttagta gagacgaggt ttcaccatgt tggccaggat 21061 ggtcacgaac tcctgacctc aggtgatcca tctgtctcgg cctcccaaag tgctgggatt 21121 acaggcgtga cccactgtgc ctgtcctgtt tttctttcat tttctttatt tgtttatttc 21181 atgcgcgtcc gtgtgaagag accaccaaac aggctttgtg tgagcaacat ggctgtttat 21241 ttcacctggg tgcaggcggg ctgagtcaga aaagagagtc agcgaaggga gatagggatg 21301 gggccatttt ataggatttg ggaaggtaat ggaaaattac tgtcaaaggg ggttgtgctc 21361 tggtgggcag gggcgaaggg gtcacaaagt gctcagtggg ggagcttctg agccaggaga 21421 aggaaattca cagggttaat cactcagtta aggtggggca ggaacaaatc acaatggtgg 21481 aatgtcatca gttaatgcgg ggcagggcct tttcactttt gtgattcttc agttacttca 21541 ggccatctgg gtgtatacgt gcaagtcaca ggagatgcga tggcttggct tgggctcaga 21601 ggcctgacat tcctgcctta tgttaataag aaaaataaaa tagtgtcgaa gtgttggggt 21661 ggcgaaaatt tttggggggt ggtatgaaga gagaatgggc gatgtttctc agggctgctt 21721 caagcgggat taggggcggc gtgggaacct agagtgggag agattaagct gaagggagat 21781 cttgtggaaa ggggtgatat tgtggggatg ttagaagaaa catttgtcgt atagaatgat 21841 tggtgatcgc ctggatacgg ttttgtatga attgaaaaac taaatggaat aagaaaagga 21901 gcaaaacagg tataaaagga ctaagaattg ggaggaccta ggacatctga ttagagagtg 21961 cctaaggaga ttcagcatag tcctgccagc aaagattatt tatttacttc aagagttaag 22021 agtggcagtt tggggatagc atgaggagat atcagctgtg atggcttgga gaaactgtgt 22081 aaactggcag tgtaaacaag agcagggcat gtatgagtag ttgagaacgg tgaataggag 22141 tatgactaga cagaagatag tagggatgac aagttattcg ggggcacagt ctaagttggt 22201 ctggtgtctg gaatgagact ggggcctaat aaaaaggagc gtctatacag gagcttaaat 22261 gggctgtagc ttgtagcatt ctgaggacag gtctgacttc tgagaaggga aagtggtaaa 22321 agtattgtcc agtccttttt aagttggtgg ctgagcttgg tgaggtgtgt ttttaataga 22381 ccattagtct gtcactgaat actaagagct ggaaaaaatg cttggctgat ttgactaata 22441 aaggctagtc tgttaccaga ctgtatagag gtgggaaggc taaactgagg aattatgtct 22501 gacagaaggg aagaaatgac tgtggtggcc ttctcagacc ctgtaggaaa ggcctctccc 22561 tatctagtga aagtgtctac ttagactaag aggtatttta gtttttgtga ctcggggcat 22621 gttgagtaaa gctaatttgc cagtcctggg ctggggcaaa tgctcaagct tgatgtgtag 22681 ggaagggagg gggcctgaat aatccttgag gagtagtaga ataacagatg gaacactgag 22741 aagttatttc cttgaggata gatttctaca atggaaagga aatgaaaggt tctaagaggc 22801 gggctagtgg cttgtactat agcatagcct gcctttgctg gtgtgtggcg attaggcctg 22861 gtggaactgc catcaataaa tcaagcgtga tcagggtgag gaacagggaa gaaggaaatg 22921 tggggaaatg ggatgaacat caggtggatc acagagatgc agtcatgggg gtcaggtgtg 22981 gtatccggaa taatgtggga ggctggattg aagtccgggc caggaacaat ggtaattgtg 23041 ggacttaaca aaaagtgaga acagctgaag gagtcaggga gcagaaagta tatgcgtcag 23101 gtgtgaggaa gaaaatagat tttggaagtt atgagaaatg tagagagtga gttgagcata 23161 gtttgtgatt ttgagggcct ctaatagtat taaagcagtg gcagccgcta cacgcagaca 23221 tgagggctag gctaaaacag taaggtcaag ttgtttgcac agaaaggcta cagggtgcgg 23281 tcctggctct tgtgtaagaa ttctgaccgc actaaccatg cctaggaagg gaaggagttg 23341 ttgttttgta agggatcgag gtttgggaga ttaatcggac acgaacagca gggagagcac 23401 atgtgttttt acgagaatta tgccgagata ggtaacagat gaggatgaaa tttgggcttg 23461 actgaagtaa tggggtctgt ctgtgaagcc ttgcggcagt acagcccagg taatttgctg 23521 agcctgatgg gtgtcagggt cagtccaagt gaaagggaag agaggctggg aagatgggtg 23581 caaaggaata ggaaagaaag catgtttgag atccagaaca gaataatgga ttgtggaggg 23641 aggtattgag gataggagag tatatgggtt tggcaccatg gggtggatag gcaaaacaat 23701 ttggttaata aggcatagat cctgaactaa cctgtaaggc ttttccagtt tttggacagg 23761 taaaatgggg gaattgtaag gagagtttat aggctttaaa aggccatgct gtagcaggcg 23821 agtgataaca ggctttaatc ctttcaaagc atgctgtggg atgggatgtt ggcactgagc 23881 cgggtaaggg tgattagctt ttaatgagat ggtaaggggt gcatgatcgg tcaccaagga 23941 gggagtagag gtatcttata cttgtgggtt aaggtggggg gatacaagag gaggatgcaa 24001 aggaggcttt ggattgggaa gaagggcggc aatgagatgc agctgtagtc caggaatagt 24061 cagggaagca gataatttgg ttaaaatatc tcggcctaat aagggaactg ggcaggtggg 24121 gataactaaa aatgagtgca taaaagagta ttgtctaagt tggcaccaga gttggggagt 24181 tttaagaggt ttagaagcct ggctgtcaat acctacaaca gttatggagg caaaggaaac 24241 aggcccttga aaagaaggta atgtggagtg ggtagcctcc gtattgattg agaaggggac 24301 ggacttaccc tccactgtga gagttaccta aagctcggca tctgtgatgg tctacggggc 24361 ttccgaggca atcgggcagc gtcagtcttc agccgctaag ccaagaagga gtcagtcaga 24421 gagccttggg caagagttcc aggggctctg ggagtggctg ccaggtgagt tgaacagtcc 24481 gatttccagt ggggtcctgc acagatggga cacggcttag gaggaatcct ggactgcagg 24541 cattccttgg cctggtggtc agatttctgg cacttgtagc aagctcctgg gggaggaggt 24601 tctggaggaa cgcctggccg ctgcggttca ggcgtttgga agttcttgtg tgctggagat 24661 gtggctgggg tttgtctcac agtggaggca aggaattgca actttttttt attattgtac 24721 accttaatta agtcctgttt tggggtttga gggccagatt ccaatttttg gagttttatt 24781 taatgtcggg agcagattgg ataataaaat gtatattgag aataagatgg ccttttgacc 24841 ttttagggtc tagggctgta aagcatctca gggttgctgc cgaacaagcc atgaactggg 24901 ctgtgttttt atatttgatg aaaaagagcc taaacgcttc tgatttggga taaagaaaaa 24961 gcattaacct tgactatgcc tttggctcca gccacctttt taagagtaaa ttgctgggca 25021 ggtgggggag ggctagtcac agaacgaaac tgtaagctgg accaggtgtg aggaggggag 25081 gtgataaaag gattataggg tggaggagcg gaggctgagg aagaattggg acctagctcg 25141 gcccggcgag gaggggagag gtcagatggg tctgtagaaa aggaagatta gaaagactca 25201 gcaatgcttg gggttgggag tgaggggaca ggtgggaggg aaagaaggaa gatttgggac 25261 gagttgcact gggcacagag actaggaaag gactgatgtg taaaagaatg cctggacatc 25321 aggcacctca gaccatttgc ccattttacg actagaatta tttagatctt gtaggatgga 25381 aacattgaaa gtgccgtttt ccagctattt ggaactactg tggagtttgt attggggtca 25441 agcggcattg cagaagaaaa taagacgctt agattttagg tcaggtgaga gttgaagagg 25501 tttataagtt cttaagaata caggctaagg gagaaggagg aggaatggag ggtggaaact 25561 tgcccatagt gaaggaggca agcccagaga aaagagtaga gacacggaga aggggcgggg 25621 gtttcttgcc cttcagaaaa gcagagaaag ggttggggca tggaaataag ggattggggg 25681 ttcttgcccc ctagaaaagt gggacttacc actaacggtg aagaaggggt tgaggggttc 25741 ttgctcctgc cccagaagag cagagaaggg gtagagacac ggagagaagg ggttggggta 25801 cttgcccctc cccgagaaaa gcgggacttg ccactaaggg tgaaggacca aggcaggtgt 25861 ccctgcgtgg tttgacacct ttgaaacgtg ggggaataat cagagaggtg gccctgcaat 25921 gattaaacac caagggaagg ctgccttccc agtccgtgac cggcgccgga gttttgggtc 25981 cacggataaa acgtgtctcc tttgtctcta ccagaaaatg aaaggaattg aaattaagag 26041 aatggagaga ttgaagtgtg gcgccaagat tgaaaggaga aagaggttga gggataggga 26101 ggttggagaa gagagtaaaa agaggccgct taccggattt gaaattggtg agatgtttct 26161 tgggctggtc agtctgagga cctgaggtca taggtggatc tttctcacgg agcaaagagc 26221 aggaggacgg gggattgatc tcccaaggga ggtcccccga tcagagtcac ggcaccaaat 26281 ttcatgcgcc acatttcatg cgcgtccatg tgaagagacc accaaacagg ctttgtgtga 26341 gcaacatggc tgtttatttc acctgggtgc aggcgggctg agtcggaaaa gagagtcagc 26401 gaagggagat aggggtgggg ccgttttata ggatttggga aggtaatgga aaattacagt 26461 caaagggggt tctctggtgg gcaggggcag ggggtcacaa agtgctcagt gggggagctt 26521 ctgagccagg agaaggaaat tcacagggtt aatcactcag ttaaggtggg gcaggaacaa 26581 atcacaatgg tggaatgtca tcagttaatg cggggcaggg ccttttcact tcttttgtga 26641 ttcgtcagtt acttcaggcc gtctgggcat atacgagcaa gtcacagggg ttgcgatggc 26701 ttggcttggg ctcagaggcc tgacagttta tttttgcctc ttggttcagc gacacaagga 26761 cctcaaaatg ttcaggtgga agtcttatct cccagttgtt ggatccattg ttgtgtgccc 26821 agtggaagca ttcctctcca tgaggaccaa aatctctaaa ctagcagagc tcaaagctgt 26881 tgaggagaga agcaattatt caagttactg tgtgtaacag agagaggaac catttctact 26941 cactttcccc ctgactccca gtcccataaa ttctcactat ggggaaacat ggtagagtgg 27001 tctctcattc aaagcacagg ccatgtcctg taagacaatc ccattttttc agggtattgt 27061 ttcccagttg gtactttaat ggagtcattg ctgactcatt ctacctttca gttaggccag 27121 ttgcttctgg gtgatgggga agatagtaag attctataag attccatggg caggagccca 27181 ttgctgcact tccttcgctt tcaaatgaaa tttttttttt tttttttttt agagataaag 27241 tacttggtct gtcactcagg ctggggtgca gtggcaccat catagctcac tgaagccttg 27301 atctcctggg ccaaagtgat cctcttacct cagcttctta agcaactgga actacaggtg 27361 ggtcccagta tgcttggcta atttttttgg tgttcttagt agagacaggg tctcactgtg 27421 ttgcccaagc tggtctcaaa ctcatggtct caagtcatgc tcccacctca gcctcccaaa 27481 gtgctgggat tacaggcgtg agccaccgca gccagcctca gatggatttc ttgttctgac 27541 gcgatgctgt gtggaatgcc atgatggtgg gtgagcatgc tttgagtctg cagatggtag 27601 tgacagcaaa cgcattaaag gcagggaagc acgttcgtgt ctgggatatc tgtctattcc 27661 agtgaggaca aatctttgcc ctcttcatga cggaagaggt ccaaggtaat caacttgcca 27721 ccgggtagca ggctgatccc cagggaatgg tgccatattg gggctcagtg ttggtctctg 27781 ctattggcag attaggcgct cagcagtaac agtagccagg tcagccatgg caaggggaag 27841 ctgacgttgc tgaacccatg catagcctcc atctctgcca ccatggccac ttttttcatg 27901 gacccattga gcaagcatta gggtggctgg gaaaagagcc tgactctcat gcacacaagt 27961 gtcatcttgt ctgcctgatc attaagaatt tttgggccag gaccggacgc ggtggctcac 28021 gcctgtaatc ccagcacttt ggggggccga ggcgggcaga tcatgaggtc aggagatcga 28081 gaccatcctg gttaacacgg tgaaaccctg tctctattaa aaatacaaaa aattagccgg 28141 gcgtggtggt gggcgcctgt agtcccagct actcgggagg ctgaggcagg agaatggcgt 28201 gaacccagga ggcagagctt gcagtgagcc gagattgcgc cactgcactc catccggcct 28261 gggtgacaga gcaagactcc gtcttaaaaa aaaaaaaaaa aaaaaaaaag aatttttcgg 28321 ctgggcatgg tggctcacac ctgtaatccc agcactttgg gaggccgagg caggtagatc 28381 atgaggtcag gagatggaga ctatcctggc taacacagtg aaaccccacc tctactgaaa 28441 atacaaaaaa ttagctgggc gtggtggcgg gcgcctgtag tcccacctac tcaggaggct 28501 gaggcaggag aatggcgtga accctgggag gtggagcttg cagtgagccg agatggcgcc 28561 actgcactcc agcctgggca acagagctag actccttctc aaaaaaaaaa aaaaagaaaa 28621 aaagaaaaat ctgggccggg cgcggtggct caagcctgta atcccagcac tttgggaggc 28681 tgaggcgggt ggatcacgag gtcaggagat cgagaccatc ctggctaaca cagtgaaacc 28741 ccgtctctac taaaaataca aaaatttagc cgggcatagt ggcgggtgcc tgtagtccca 28801 gctactcggg aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcagtga 28861 gccgagattg caccactgca ctccagcctg ggtgacagag ccagactctg tctcaaaaaa 28921 aaaaaaaaaa aaaaaaaaaa agaaaaatct gtggctgggt gcagtggctc acacctataa 28981 tcccagcact ttgggaggct gaggcaggtg gattgcctga agtcaggagt tcaagaccag 29041 cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaaaattagt cgggtgtggt 29101 ggcgggtgcc tgtaatccca gctacttggg aggttgaggc gggagaatca cttcaacccg 29161 ggaggcagag gttgcagtga gccgagatcg tgccattgta ctccaacctg ggaaacagag 29221 cgagattcca tctcaaaaaa aaaaaaaaaa aaaggaattt tcacttcagt ggctgccctt 29281 gggtgggcac tcctatggga cacaaatatc ttcacagtct gtgcagagac tgagaggccc 29341 atccatatgc ctctccaagc ttccttgtca ttaatctttc aatcttcttc cttccacgta 29401 cttgaccatc cacccaaacc attagcaact gcccataaat tagtatagat ctatacttct 29461 ggccacttct ccctccaggc atactggaca actaaatgtg cagcatgaaa ttctcttttt 29521 tctttttgaa acagagtctc gctctgtcgc ttaggcaggt tgcggtggca caatctcagc 29581 tcgctgcaac ctccacctcc caggttgaag caattctccc gcctcagcct ccgcagtagc 29641 tgggactaca ggtgtgtacc accacacccg acctacatga agttctgacc actgccctgt 29701 aggttcatcc ctaagggagg ttgtaatgct gccattgtcc actttcgagc ggtaccagca 29761 tgatgtacag actagctgta agccaggctc aagtttataa ttattttttt tgaaacggag 29821 tctcgctttg ttgccaggct ggagtgcagt ggtgcgatct cagctcactg caccctccac 29881 cttctgggtt caagagattc tcctgcctca gcctcccgag tagctggtac tataggtgcc 29941 tgccaccatg cccagctaat ttttgtattt ttagtagaga cggggtttca ccatgttggc 30001 caggatggtc tcaaactcct gacctcgtga tccacctgcc tcagcctccc aaagtgttga 30061 gattacaggc gtgagccacc acgcccagcc aagtttcttc ctcatttcac tgggcataag 30121 gaacatgcca gaggaaaagt ttggcttgaa gtctaggaac caatgcaaca gaggttagtg 30181 ttgaaggact ctgaccttgt taatggtaaa gcagcttcaa aagccggcac ggtggctcaa 30241 cgcctgtaat cccagcactt tgggaggccg agatgggcag atcacgaggt caggaaatcg 30301 agcccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa aaattagccg 30361 ggcgtggtgg tgggcacctg tagtcccagc tactcgggag gctgaggcag gagaatggtg 30421 tgaacccggg aggcgcagct tgcagtgagc cgagatcgcg ccactgcact ccagcctagg 30481 ggacagagtg agactccgac tcaaaaaaaa aaaaattttt ggctgggtgt ggtggctcac 30541 tcctgtaatc ccagcacttt gggaggctga ggcgggcaga tcacctgagg tcagcagttc 30601 aagaccagcc tgaccaacat ggtgaaaccc cgtctctact aaaaataaaa aattagctgg 30661 gtgtggtggc gcatgcctgt agtcccagct acttgggaag ctgaggcagg agaatctctt 30721 gaaaccggga agcggaggtt gcagtgagct gatatcacac cactgcactc cagcctggga 30781 ggttgcagtg agctgagatc gtgcccctgc actccagcct gggcgacaga gtgagacttc 30841 gtctcaaaaa aaaaaatttt aaaaaagagc agcttctact gcagcctcct cttaccctat 30901 tgccttctct tgctctggtc tccactcaaa gcatgcagcc ttctgggtga ttttgcagat 30961 gggtcaaaac agcatactca atgttgcctc ccaaataaaa aaacctaccg accattgtac 31021 ttctttcttt gtggtaggta ctgcaacttg cagcaacttg tctttcacct tagaaaagat 31081 atctttcttg gccgggcgca gtggctcacg cctgtaatcc cagcactttg ggaggccgag 31141 gcggcggatc acctgaggtc gggagttcga gaccagcctg accaacatgg agaaacccca 31201 gtctctacta aaaatacaaa attagcaagg cgtggtggcg catgcctgga atcccagcta 31261 ctcagccggc tgaggcagga gaatcgcttg aacctgggag gtggaggttg cggtgagccg 31321 agatggtgcc attgcattcc agcctgggca acaagagtga aattccgtct caaaaaaaaa 31381 agaagaagaa aagaaaaaat atctttcttt cttgctgggt gtggtggctc atgcccgaaa 31441 tcccagcact ttgggatgcc caggcaggcg gattgcctga gctcaggagt tcgagaccag 31501 cctgggcaac aaagtgaaac cccttctcta ctaaaataca aaaaattagc tgggtgtgct 31561 gctgtgcgcc tgtagtccca gctacttggg aggctgaggc agaattgctt gaaccccaga 31621 gatggaggtt gcagtaagcc gagatcatgc cactgcactc tagcctgggt gacagagcga 31681 cactccatct ccaaaaaaaa aaaaaaaaaa aaagtagaaa agatatcttt atttccctcc 31741 ctccctcctt tccctctttc cttccttttt gagacagggt ctcactttgt cacccagact 31801 gaagcgcagt gataaaatca cagctcagca gcttcaacat cctgggctca agtgatcctc 31861 caatctcaac ctcctgaata gctgggacaa cagttgtgga ccaccatgcc cggctaattt 31921 tatagagatg aggtcctact ctattgtcca ggctggtctc taactcctgg gctcaaacaa 31981 tcctcccgcc tgagcctctg aagggctgag attataggca tgagccactg tgcctggcct 32041 gaagataatt ttagatcctt atcagactgg agatggtacc agaggaatct acattaacga 32101 gttttctttt ttttttgttg agacagagtc tcactccgtt gcccaggctg gagtgcagtg 32161 gtgtaatctc ggctcactgc aaccttcgcc tcccgggttc aagcgattct cctccttcag 32221 cctcgtgagt agctgggatt acaggcgcca ccaccgtgct cagctttttt tgttttgctt 32281 tgtttttttt gtatttttag tagggacggg tttcaccatg ttggccaggc tggtcttgaa 32341 ctcctgacca cagatgattt gcctgcctcg gcctcccaaa gtgctgggat tacaggcgtg 32401 agccgccatg ccgggccaac aagttttcaa actaggcttc ctctgctttt atttggcttc 32461 tcacaagtta ctaccccttt tcctttgttc tataatttct ctaaaaattt tatgcttttt 32521 gttgaagacg ctgtataagc tggaattcga agccacctct ttgagaacta ctcattccct 32581 gggtgtctcc catgtgtata agaaatatac atgttaataa gcttctgttt gtttttctct 32641 tattatcttt tgttacaggg gtacattcca actaggaact tatgaggatt aaagaaaaaa 32701 tatttttcag gccaggcacc atggctcatg cctgtaatcc cagcactttg ggaggccgag 32761 gcaggtggat tacctggggt caggagttca agaccagcct ggataacatg gcgaaatccc 32821 gtctctacta aaaatacaaa aattagctgg gtgtggtggc atacacctgt agtcccacta 32881 cttgggaggc tgaggcagga gaatcacttg aacccaggag acggaggttg cagtgagccg 32941 agatcgcgcc acttcactcc agcctgggca acagagtgag actctatctc aaaaaaaaaa 33001 aaaaaaaaaa agccaggcat ggtggcttac acctgtaatc ccagcacttt gggaggccga 33061 ggcgggcata ttacaagctc aggatttcga gacccgccta gccagcatgg tgaaacccca 33121 tctctactaa aactacaaaa aattagatgg gcatggtggc acgcgcctgt aatctgagct 33181 actcgggaga ctgaggcagg agacttgctt gaacccggga ggcagaggtt gcagtgagcc 33241 gagattgtgc cactgtactc cagcctgggc gacagagtga gactctgtct caaaaatata 33301 tatatttttc ttcccctgta accccccaaa acttcactga ggtgggaggg ggtcgctgaa 33361 ttttgatggt gactttctcc aattttctca ttcgtacata ccttactaaa gcatttaaag 33421 tacttgctat ttgttgctcg tcaggaccaa tcagcataat gtcattcttg tagcagaccg 33481 atgtgatggc ttgtgaaatg tcaaaatgat caagatttag tggacaatat tggggcaaag 33541 agcaggagaa ctgctgtagc ctggaaatac cactctcaag gtatactgtt gggtctgcca 33601 attaaaagca aactgcttct ggtggtcctt gaaaattgat atggagaaaa aggcattagg 33661 caggttaata gctgcatacc aataccagat tctgttgatt tgttccagca agcatattac 33721 atctggtaca gcagctacaa ttagaataac tatctgatta tgtttacaat agcccacagt 33781 cattcctcaa gttattcttt tttgacacgc ttatttattt atttattcat tttttcaaga 33841 cacggtcttg ccctgccgtc caggctggag tgcagtggtg caatcttggc tcactgcaac 33901 ctccacctcc tgggttcaaa tgacccttcc acctcagcct cccgagtagc tgggatcaca 33961 ggcatgggcc accacacctg gctaactttt gtatttttaa agagatggga tttcgccatg 34021 ttgcccaggc tgatctcgaa ctcttgagct aaagtgatct gcctacctca gcctcccaaa 34081 gtgctgggat tacaggagag agccaccttg cctggccgtg atgcttatta ttttaaacca 34141 ctgttttggg tggcggtgtt tttgagacag agtttcgctc ttgttgccca ggctggagcg 34201 cagtggtgcg atctcagctc actgcaacat ccacctcctg gattcaagtg attctcctgc 34261 ctcaggcctc tccagtagct gggattacag gtgcctgcca ccacgcccgg ctaatttttt 34321 gtatttctaa tagagacagg gtttcctcat gttggccaga ttggtctcga actccagacc 34381 tcgggtgatc cacctgcctc ggcctcccac agtgcgggga ttacaggcgt gagccaccgt 34441 gccaggcctg tttttgtttg ttttttgaaa cagtttcatt ctgtctccca ggcttgagtg 34501 cagtggtgcg gtcttggctc actgcaacct ccacctccca ggttcaagca attctcctgc 34561 ctcagcctcc cgagcagctg ggattacagg catgtgccac catgcccagc taatttttgt 34621 atttttaata gagatggggt ttcactatgt tggccatgct agtctcaaat tcctggcctc 34681 cagtgatcca ccggccttgg cctcccaaag tgctgggaat tacaggggtg aggcaccgca 34741 cctggccttg ggtgtttttt aatataacaa aagctaattg acatagggac atgcatttct 34801 gcttcaccaa agtccccacc aacccctatt gtcttccatc tatttcactt actcctattt 34861 ctggcatggt ccttgtaggc atttgagttt gccactcatg gatagggcct ggcatggcaa 34921 actgcctgga ccttccgcac tgctccctac catcaccctc caatcccagt tcctgggagg 34981 gaatcccggg tctcattcca cccatttctc ctccccagca ccttccccag ttcctgccca 35041 ccatcatctt ctcgcttctg gctcactgca cccatctctt ctctgttctc cgtgtataca 35101 gctgtaatca ttatctagtt aattgctccg actctgagag agtaataaaa tttctcttat 35161 ctcaacacaa agtctttggt tacagcctgg acaagtttcc tggattaagc tccaaggtgc 35221 caggaagtct ggaacaatat ctccatatca aaaacaatga agcagctggc acaggggctc 35281 acgcctgtaa tcccagcctg ggcgacagag cgagtccttg tctctaaaca aaacaaaaca 35341 aaaactgctg ctgccatagc acaggtctca tttaggaatg gttttgtcaa cttgcgggac 35401 aaatgagctc aaactcagta taaatggcca gctgggggat gccatctctg ctgctctgta 35461 tcggtttaca tgaggactta gctgctagaa caaaaaagca ccaacatatg tggtttaaag 35521 aacacagaat tgccaggcat ggtggctcac gcctgtaatc ccagcacttt gggaagctaa 35581 ggcggatgga tcacttgagg tcaggagttc aaaatcagcc tggccaacat ggtgaaatct 35641 cgtctctact aaaaatacaa aaatattagc cgggcatggt ggcgggtgcc tataacccta 35701 gctactcagg aggctgaggc aggagaattg cttgaacctg ggaggtggtg gttgcagtga 35761 ggtgagatcg agccactgca ctccagcctg ggcaagactc cgtctcaaaa taaataaata 35821 aataaataaa taaaaacaga atttacatac caacggatct caatataggg gtcttatttg 35881 gatcttgata taaacaaata ttttaaaatt tatgactttt tttttttttt ttttttttag 35941 acaaggtctc actccagcct gggtgaggtg acagagtgaa actatgtctc aaaaaaaaaa 36001 aaaaaaaaga gtccttatat cttagaaaca cataaacaca tactcatttt ttttttttgg 36061 tggcatggtg tgatgtatag tgtattagct atctattgct aagtaacaaa ttttaccccc 36121 aaatgtagca gttcagttga ctttctcttg agagggagtc tcattctgtc gcccaggctg 36181 gagtgcagtg gcacgatctt ggctcactgc aacctccgcc tcccggattc aagcgattct 36241 cctgcttcag actcccgact agctgggatt acaggcgccc gccaccacgc cctgctaatt 36301 tttgtatttt tagtagagat gggatttcgc catgttggcc aggctggtct cgaactccta 36361 aattcaggtg atccgccagc ctcaccttct caaagtgctg tgattacagg catgagccac 36421 ggtgcccggc cacgcagttc agaacaatac ttatttatta tctcagtttc tgcgagtgta 36481 gaatgtgtgg atggttcagc cctgccacaa tcacacagcg ggtgggaaga ccacaggagt 36541 tgcgggttgt aggtgggaag gggacctgcc ctcgggcgga ctttggtgcc gctctgctgc 36601 catcttgtgg cgggcacaga aacaactgcc ctttccacct ggtccttccc atactctaac 36661 cattcttggt attttagaac ttgctattca aaatttttag tttttattct cccagttgct 36721 gaacagatga taaagtcaca tggtttaaaa ttcatgagga taaacagggt cttcctctcc 36781 ctggggactc taggtttctt tccaggggaa ctattctatg ttatttcttt ttttcttttc 36841 tttttttctt tttctttttc ttttttgaga tggagtctcg ctctgttgcc caggctggag 36901 tgcagtggtg cgaactcggc tcactgcaac ctccgcctcc tgggttgaag cgattcttct 36961 gcctcagcct cctgagcagc tgggattaca ggcgtgcacc accacacccg gctttttttt 37021 tttttttttt tttgtatttt tagtagagaa ggaagtttta ccatgttagt caggctggtc 37081 tcaaactcct gaccgcaggt tatccgcccg cctcggcctt ccaaagtgct gggattacag 37141 gcatgagcca ccgcgcccag ccacttactt gtttcttgtg tatctttctg ggaaaagttc 37201 atgtatatac aaaagcaaac aacacacaca tacacgacca acaacaacaa gagcaccccc 37261 caaaacatgc ccagcctttc ccagtaataa aaggataatt tgttcctggc caggtgtggt 37321 ggctcacacc tgaaatccca acattttggg aggctgagac aagaggacca ctgaagccca 37381 ggaattggag accagcctag gaaacacagt gagatccctt ctcaacaaaa ataaaaataa 37441 aaaaattagc tgggcgtggt ggcccatgcc tgtggtccca gctactggga aaggctgagg 37501 tgagggaatc acttgggcct gagaggcaga ggttgcagtg agccaggatt gcaccactgc 37561 actccaccct gggtgacaga atgtgaccct gtctcaaaaa taaaaaataa ggccgggtgc 37621 ggtggctcac gcctgtactc ccagcacttt gggaggccga ggcaggcgga tcacgaggtc 37681 aagagatcga gaccatcctg gctaatgcgg tgaaaccctg tctctactaa aaatacaaaa 37741 aattagccag gcatggtggc gggtgcctgt agtcccagct acttgggggg ctgaggcagg 37801 agaatggcgt gaacccggga ggcggagctt gcagtgagcc gagattgcgc cactgcactt 37861 gggcctgggt gaaagagcca gactccatct caaaataaat aaataaataa taaaaaataa 37921 aaaaaggtta gttattatga tcctgccagc atctcagcta catgaatgcg tggagttttt 37981 cagtattgtt ccctgctgat ctccagtgtc tggaacagtg cctgacacac agtaggtgct 38041 caggaaatat atttcagttc gtaatgatgg gaaatgtcaa acacacaaaa acacagcaag 38101 tagaataatg aaccccctgt acctatcacc gagcctcagc aaacttgcct catctatacc 38161 ccacccacct tttcccctca atggattttt aaatcaaagc caaggcatca tcatatcctt 38221 tcatccatga atacttcaaa ttctatctct aagagacaac tccatatcac acctaaacaa 38281 ttaacatgaa ttccttaata caactaaata tccagtcagg gttcaatacc catttttttc 38341 cattgggtca tataatattt tttgtttgtt caaaacataa atatattatt tgattacttg 38401 ttcaaatcag tatcaataaa tactgttcca ggcatagtgg cacatcacga tagtcccagc 38461 cactcaggag gctgaggcag gaggacttct tttttttttg agacagggtt ttgctctgtt 38521 ccccaggctg gagttcagtg gtgtgatcat agctcactgt agccttgatg tccaggctca 38581 agtgatcttc ccacctcggc ctccctagta acttggacca caggtgcaca tcaccatacc 38641 tagctaattt ttttattttt atttttgtag agatgggggt ctcactatgt tgcccaagct 38701 ggtcttaaac tcccaggctc cagtgatcct cctgtctcgg cctcccgaag tgctgagatt 38761 acaggcgtga gccactgtgc ctggtcaaat gtgactacta cttttttttt tttttttttt 38821 tttagacagt ctcactccat tacccaggct ggagcacagt gtttcgatct tggctcactg 38881 caatctctgc ctcctgggtc aagcaattct tgtgctttag catcctgagt agctggaatt 38941 acaggtgtgt gtcaccacac tcggctaatt tttggatttt tagtagagac agggtttcgc 39001 catgttggct ggctggtctc gaactcctga cctcaggtga tctgcctggc tcagcctccc 39061 aaagtgctgg gattagtggt gtgagccacc gcatctggcc caaatgcgat ttcttgaagc 39121 tctcccttac ctcaaagaac agagtctggc tcttccattc tgacagtaag cacaggtcta 39181 tcattgcctt cagttgcctt caggaccact ggggtctgtt gcagtcactt taggggattg 39241 agtctttggc cacctctcag agtccttgaa cagtctcagc cagctgctta ggttgaatga 39301 tcaaagcagc aagtcctggc ttcactgaac caaaaatgtc ccttccagaa ttggaagctg 39361 ttctggtgtt ttctgcacaa aagtgtaaac atagattcaa gttccagtgc tttctgcatc 39421 caacagaatt ccatagaagg ccctggctct gacctcgatg gaatgcatgg aagacaggaa 39481 gatactctta aacccagcct gctgcaaaca caggttactt ccagcacgaa aaagttacac 39541 ccgaggaggt ggccattctc ttctcaagag gtggtgagtc taggcccttc ttgcagaagc 39601 agagctgcac atacacatct tttttttttt tttttttttt aagacacggt cttggtctgt 39661 tgcccaggct ggagtgcagt ggtgcaatct cagccttctg gtttcaagca gtcctcccgc 39721 ctcagcctcc tgagtagctg ggactacggg tgcatgccag cacgtctggc taatttttgt 39781 attttttgta gagatgaggt tttgccgtgt tgcgtatgct ggtcttgaac tcctgggctc 39841 aagctatctg cctgcctcag cctcccaaag tgctgggatt acagacgtaa gccatcatgc 39901 ccggccaaat tccttatcct tatctctttc tctctctttt tctttttctt tttttctttt 39961 ttgtttgtta agacagggtt tcactctgtt gcccaggttg gagtacaaga ggcaggatca 40021 cagctcacag cagcttcgaa ctcctgggct caagtgatcc tccctcctca gcatcctgag 40081 tagctgggac tataggtgtg ccactacacc tggccaattt ttaaattttt tggagatagt 40141 gtcttgctgt gttgcccagg ttggttttaa ctcctggcct caagcgatcc tttcactggc 40201 ccctcccaaa gtgttgggat tataggaatg agacactgca cctgacctcc ttattattct 40261 cttttttttt agacggagtc tcgctctgtc acccaggctg gagtgcagtg ctgcgatctc 40321 ggctcactgt aacctctgcc tcccaggttc aagcgattct cttgtctcag cctcccgaga 40381 agctaggatt acaggtgcac gccaccacgc ctggctaatt ttgaattttt agtagagaca 40441 gggttttacc atgttggcca ggctgatctt gaactcctga cctcaggtga tctgcccacc 40501 ttggcctccc aaagtgctgg gattacaggt gtgagtcacc gcacccggcc ccttctcttt 40561 taaagtagaa aaaaaaaagc cacccaccag agacataaat ggcaatgcat gcatcaggca 40621 tggtgctagg aactttatac acatgtaatc ctttcaaaaa tagctctttt aagtaagaaa 40681 aatatgaccc ctccccatat taaatgaaca agggatatga acatacaagt aatttattca 40741 attaacaaat atttattgag cacctactat attttaggca ctgtgctaaa atcaattcca 40801 atggtcaaca aatatgtgaa caaatggtaa tcattattga aggaatgcaa atgtacaaaa 40861 agtgactttt ccatttttca cctattcaat ggtctatatt caaaataaat gattctccag 40921 gattggatga gcgtggcttt gggatc // LOCUS AC002985 38041 bp DNA PRI 22-OCT-1997 DEFINITION Human DNA from chromosome 19-specific cosmid R27090, genomic sequence, complete sequence. ACCESSION AC002985 NID g2443868 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 38041) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of an ~1 Mb region containing the MEF2B gene in 19p13 JOURNAL Unpublished REFERENCE 2 (bases 1 to 38041) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA REFERENCE 3 (bases 1 to 38041) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (22-OCT-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from p telomere to centromere. Cosmid R27090 overlaps cosmid R32469 to the left and cosmid R31317 to the right. FEATURES Location/Qualifiers source 1..38041 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R27090" /chromosome="19" /map="19p12 between UBA52 and D19S451" /cell_line="5HL2-B" /clone_lib="LL19NC03 R chromosome 19-specific cosmid library" /note="LL19NCO3 cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries chromosome 19 as its only human chromosome." gene complement(<202..12590) /note="clathrin-ordered protein; identified by sequence homology to Cricetulus griseus epsilon-COP mRNA (Z32554)" /gene="Epsilon- COP" /map="19p12" CDS complement(join(<202..354,4213..4313,6226..6288, 12465..12590)) /gene="Epsilon- COP" /note="clathrin -ordered protein; Identified through sequence homology to epsilon-COP product from C.griseus (87% identity; Z32554) and B. taurus (89% identity; X76980).~Human EST matches: AA143411, AA205346, AA312499, AA488261" /codon_start=1 /product="epsilon-COP" /db_xref="PID:g2443869" /translation="MAPPAPGPASGGSGEVDELFDVKNAFYIGSYQQCINEAQRVKLS SPERDVERDVFLYRAYLAQRKFGVVLDEIKPSSAPELQAVRMFADYLAHESRRDSIVA ELDREMSRSVDVTNTTFLLMAASIYLHDQNPDAALRALHQGDSLE" repeat_region 1392..1671 /rpt_family="Alu" repeat_region 1699..1756 /rpt_family="Alu" repeat_region complement(1764..2058) /rpt_family="Alu" repeat_region 2064..2226 /rpt_family="Alu" repeat_region complement(2762..3043) /rpt_family="Alu" repeat_region complement(3095..3219) /rpt_family="MLT1" repeat_region 3583..3821 /rpt_family="Alu" repeat_region complement(4798..5674) /rpt_family="Alu" repeat_region 5723..5998 /rpt_family="Alu" repeat_region 7500..7777 /rpt_family="Alu" repeat_region 7892..7970 /rpt_family="Alu" repeat_region complement(7998..8258) /rpt_family="Alu" repeat_region 8300..8471 /rpt_family="Alu" repeat_region 9272..10204 /rpt_family="Alu" repeat_region complement(10234..10412) /rpt_family="MLT1" repeat_region complement(10611..10904) /rpt_family="Alu" repeat_region 11160..11445 /rpt_family="Alu" repeat_region complement(11721..11961) /rpt_family="Alu" CDS join(12984..13098,13822..13945,14905..14990,15085..15206, 15568..15755,15846..15986,17475..17550,17865..17941, 18124..18221,19545..19619,21008..21096,21207..21278, 21370..21558) /note="Hypothetical 56kDa human ATP-dependent RNA helicase; Putative ATP-dependent RNA helicase of DEAD box family. Most similar (57% identical)to probable ATP-dependent RNA helicase Dbp45A (S38329)- fruit fly (Drosophila melanogaster)~Human EST matches: AA534472, H08289, AA464032, AA196836, AA569862, AA573466, AA464741, W46150, AA535538, AA488261, W46162, R00974, AA378518, R26194, R15246, AA359638, etc.~Mouse EST matches: AA271508~Drosophila EST matches: AA540379~Rat EST match: H35240" /codon_start=1 /product="R27090_2" /db_xref="PID:g2443870" /translation="MAGFAELGLSSWLVEQCRQLGLKQPTPVQLGCIPAILEGRDCLG CAKTGSGKTAAFVLPILQKLSEDPYGIFCLVLTPTRELAYQIAEQFRVLGKPLGLKDC IIVGGMDMVAQALELSRKPHVVIATPGRLADHLRSSNTFSIKKIRFLVMDEADRLLEQ GCTDFTVDLEAILAAVPARRQTLLFSATLTDTLRELQGLATNQPFFWEAQAPVSTVEQ LDQRYLLVPEKVKDAYLVHLIQRFQDEHEDWSIIIFTNTCKTCQILCMMLRKFSFPTV ALHSMMKQKERFAALAKFKSSIYRILIATDVASRGLDIPTVQVVINHNTPGLPKIYIH RVGRTARAGRQGQAITLVTQYDIHLVHAIEEQIKKKLEEFSVEEAEVLQILTQVNVVR RECEIKLEAAHFDEKKEINKRKQLILEGKDPDLEAKRKAELAKIKQKNRRFKEKVEET LKRQKAGRAGHKGRPPRTPSGSHSGPVPSQGLV" repeat_region 14112..14397 /rpt_family="Alu" repeat_region 16503..16734 /rpt_family="Alu" repeat_region complement(17052..17340) /rpt_family="Alu" repeat_region 18380..18802 /rpt_family="Alu" repeat_region 20044..20821 /rpt_family="Alu" misc_feature complement(21577..21650) /note="BLASTN similarity to Z41188 (216..289); match: 0.85, score: 3.5e-33; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(21611..21871) /note="BLASTN similarity to T16014 (1..261); match: 0.94, score: 1.1e-90; database searched: est; IB2449 Infant brain, Bento Soares Homo sapiens cDNA 3'end." misc_feature complement(21615..21825) /note="BLASTN similarity to Z41188 (42..252); match: 0.92, score: 8.5e-89; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(21799..21865) /note="BLASTN similarity to Z41188 (1..67); match: 0.94, score: 8.5e-89; database searched: est; H. sapiens partial cDNA sequence" CDS complement(join(22685..22876,24567..24653,24751..24867, 25218..25374,26166..26287,27506..27652,31595..31726, 31970..32126,32211..32224)) /note="Hypothetical 41.3 kDa human protein most similar to Vesl and GLGF proteins of rat; Residues 1-128 of hypothetical protein R27090_3 are 81% identical to Rat GLGF protein (U92079) and 83% identical to Rat Vesl protein (AB003726); both proteins are expressed in brain and upregulated during seizures.~Human EST matches: W42820, AA121538, W42730, AA127702~Mouse EST matches: AA407944, AA408331, AA013888, AA035853, AA212542~Drosophila EST matches: AA391781, AA202338, AA201147, AA202832,AA246370" /codon_start=1 /product="R27090_3" /db_xref="PID:g2443871" /translation="MSTAREQPIFSTRAHVFQIDPATKRNWIPAGKHALTVSYFYDAT RNVYRIISIGGAKAIINSTVTPNMTFTKTSQKFGQWADSRANTVYGLGFASEQHLTQF AEKFQEVKEAARLAREKSQDGGELTSPALGLASHQVSTPYSPMPAWAPVPPSPLVSAN GPGEEKLFRSQSADAPGPTERERLKKMLSEGSVGEVQWEAEFFALQDSNNKLAGALRE ANAAAAQWRQQLEAQRAEAERLRQRVAELEAQAASEVTPTGEKEGLGQGQSLEQLEAL VQTKDQEIQTLKSQTGGPREALEAAEREETQQKVQDLETRNAELEHQLRAMERSLEEA RAERERARAEVGRAAQLLDVSLFELSELREGLARLAEAAP" repeat_region complement(23068..23216) /rpt_family="Alu" repeat_region 23231..23507 /rpt_family="Alu" repeat_region 23533..24266 /rpt_family="Alu" repeat_region complement(26954..27247) /rpt_family="Alu" repeat_region complement(27925..28084) /rpt_family="Alu" repeat_region complement(28109..28363) /rpt_family="Alu" repeat_region complement(28377..28665) /rpt_family="Alu" repeat_region 28980..29921 /rpt_family="Alu" repeat_region 29950..30567 /rpt_family="Alu" repeat_region 30657..30931 /rpt_family="Alu" repeat_region 31163..31420 /rpt_family="Alu" repeat_region 32436..32693 /rpt_family="Alu" repeat_region 34254..34547 /rpt_family="TAR1" repeat_region complement(35537..35821) /rpt_family="Alu" repeat_region complement(36251..36361) /rpt_family="Alu" repeat_region 37314..37783 /rpt_family="Alu" BASE COUNT 8745 a 10624 c 10743 g 7929 t ORIGIN 1 gatctgcagc ctggtgcgca gatgggagtg cctcccacac tgggcagggg agctggagcc 61 tggacagcag gccagggcct catttcagca ccacctttct ctgctcccat gacgacagcc 121 agcgaggccg tgcgcagcct ggagatgtgg cagggggcac ttgcctccct gggctggccc 181 cagagcaggg aggccactca ccactccagg ctgtccccct ggtgcagcgc acgcagggcg 241 gcatccgggt tctggtcgtg gagatagatg gaggcggcca tgagcaggaa ggtggtgttg 301 gtcacgtcca cgctcctgct catctctcgg tccagctcgg ccacgatgct gtccctgcaa 361 gacagagatg ctcacgaccc acagctgggg tggcctctgt gggacctcag agggtcccct 421 tccttggaag cagacaggtc cagagagggt gagccaggcc ctgggggtgc agagcagcag 481 caggccgagc atcgagctgt cccgacatgt cctccctccg ttcctatgaa gcacctctca 541 tgctgggcca ggcaaggacc cgaggccggg agatgtgggg tggggcgctc caaggggaca 601 tgaggaggca ccgtcagtct cacaccaacc tcacccgcct tcctgaggag gctggtgggc 661 ccagctggga ctcctgacca taacccatca accctgtggg cttcaaggcc acgtcttcta 721 gcaggaggtc ggcctgggtg ccgaggggcc gtgcagaagg atgggagccc tggggcaccc 781 ccactgccag ggcagctgcc ctcagtccct gagagcccag ggtgctggct gcaccctgaa 841 gatggccagt gtgctgcggg ccagggccag gcttgagccc aggcaaggct gaggaccggc 901 tcagctccga gggcacaggt cacccatgcc tccatgtccc taactggacc cagaccctcc 961 attccccatg gctattgagc agctaaaaag agaaggcagg tgagggggcc accctagacc 1021 aaagctggtg ttcttctggg ttcaaacgtg gctctgtgac cccgtaagca tccacgcgca 1081 tctgggggct gtctgcccag ggcccgtgtc tgaggtgtca ctgagggcat ggcatgtctg 1141 ataccaacca ggctccactg agggctcgcc acagtgccag gtccaacaga caagtgactt 1201 caggtctgag tctgtttccc cgtgctgagg gaccatgtta cagggatcat gcaggggagg 1261 agcaaggagc tcccaagacc atcttacaaa gcttcctgag gctccagcac acaagggtgt 1321 ataaacactg ccagttgtcg tagcgttggt actgtaatgt tgtcatccta aggtgttaca 1381 attaccaatg gggccgggta ctgtggctca cacctgtaac cccagcgctt tgggaggcca 1441 aggcaggatc gcttaagccc aggagttcaa gaccaccctg ggcaacatag taggacccct 1501 ttttctacaa aaaagtaaga aagtcagctg ggcctggtgg cgcctgtagt cccaactact 1561 tgggaggctg aggcaggagg atggcttcag cccgggaggt tgaagctgcg gtgagctgtg 1621 gtcacacctc tgcactccag cctgggtgac agagggagac catctcaaaa acaagcaaca 1681 gcaacaaaat caattactgg gcggatcgct tgagcccagg agtttgagac cagcctgggc 1741 aacatggcaa aaccccatct ctcttttttt ttttttgaga cggagtctcg cactttcgcc 1801 caggctggag tgcagtggtg cgatcttggc tcactgcaag ctccgcctcc cgggttcatg 1861 ccattctccc gcctcagcct cccgagtagc tgggactaca ggcacccgcc accatgcccg 1921 gctaattttt tttgtatttt tagtagagat ggggtttcac gtgttagcca ggatggtctc 1981 catctcctga cctcgtgatc cgcccgcctc ggcctcccaa agtgctggga ttacaggtgt 2041 gagccactgc gcccggccag gcaaaacccc atctctaaaa aatatacaaa ttagccaggc 2101 atggtgatgc gtgcctgtag tcccagctat tcgggaggct gaggtgggat gatcacttga 2161 gcctgggaga tcaaggctgc agtaagccaa gattgtgcca ctgtatgcca ggctgaacga 2221 cagagcaaga ccgtatctca agaaaacaaa atcaaaacca aaaccaatta ctaatggatg 2281 ctccttgtcc ccggggcgtc ctgccaatga ggtcgcggct gtgcagccag atgaacggac 2341 cacaagcccc tgcctggacc gcctgctctc cggggctgct tttctgtgat gctcacagca 2401 gctgctgctc ctgggggacc cagggttccc tgccttcact ccagctggtg aagtaagtgc 2461 agggtgggag aaacagtggc cctgggaggt cacctgtctc aatgtcctgg agctgctgtg 2521 gcaaagcacc acagcccggg cagttaagga caacagggat tcgttcctca cagctctgga 2581 ggccgtaagt ctgaagtcaa ggtggcggca gggctggctc cagctggagg ctggagggca 2641 aggccgtccc aggcctctct cccagctctg gtggctgctg gccactcgtg gtgctccttg 2701 gctagcagac atgtcactcc agtctgttgt cacatgggct taactctgtg ggcctctttc 2761 cttttttttt tttttttttt aagacagtct cccgctgtca cccaggctgg agtgcactag 2821 tgcgatcttg gctcactgca acctctgcct gccaggttca agcgattctc ctgcctcagc 2881 ctcccaagta gctggaatca cagacatcac atcctgttaa ttctgtattt ttaatagaaa 2941 tggggtttca ccacgttggc caggctggtc ttgaaaactc ctgacctcag gtgatctgcc 3001 cacctcgtcc tcccaaagtg ctgggattac aagtatgagc cactgcaccc agccctcctt 3061 ttatctctta aaaggacaca agtccttggc tttggggccc accctaaatc cagcatgtct 3121 tcatctcctt atctcttaat ctgatcggca aagacccttt ttccaaatga tatcatattc 3181 acaagcatgt gggataggat atggacgtac cttttggggg acaccattca ccccaccatg 3241 ctgcctctgc acaggagtca gctcccagga accccaggtt tctttgagtg aagggaggcg 3301 gtagttggga cactgcctcc gtgggccaag gaagagcagg atcacagcag tctccccatg 3361 ccagaccctc actccgaccc cacagcagga gggggcaagg ttcctgggtc cccacagccc 3421 agccccctct ccccttggct gccgcccatg ggcgtttgac caacgctggc ctgtggctca 3481 ggggcaggac aggcgtggcc tcacccaccc tcccactggg ctccagggga gatactggca 3541 ggttgcagtc cccaactagc tgggactaga aaactgatag gatttgggag gctgaggtgg 3601 gcagaccgcc tgagctcagg agttcgagac caccctgggc aacacggtga aacccagtct 3661 ctgctaaaat acaaaaatta gctggacgtg gtggcacgca gctatagtcg tagctactca 3721 ggaggctgag gcacaagaat cacttgaacc caggaggcag aggttgcagt gagccgagat 3781 ctcaccactg caactccagc ctggtgacag agtgagactc catctctaaa taaataaata 3841 agaaaactga tgggagggcc ctgcccccgc tgcgccctgg gctgtcttgg ggctctgggt 3901 gttgcagagg gctgctcggt aattgatatt ttcagttcta ggttcagaga gtgccttgtg 3961 gtccatgagg gcgtgtgcac acacagcact gtctctgacc ctcccctttg aagcaggatc 4021 catcaccagc cccccactgc tgtgcaggtg agggaaactg aggcacaaag cagcagggga 4081 gcagagggga gaggcgccta atctgaggga tgctgtgcgc tgcacgccat gcccccaccc 4141 acccaggttc ataaccaaca ggccttgaag atggccccgc gcctcaggcc aacccattct 4201 ctggggcctc acctccgact ctcgtgggcg aggtagtcag caaacatgcg cacggcctgg 4261 agctcagggg ccgaggaggg cttgatctca tccaggacca caccgaactt cctctgcagt 4321 agggacgagg cgtcagctgc acccgtccac cccagaggat ttccatgtct tcagcttggc 4381 agggagagcc ccagacagga agtccctgcc ccaccaggga cccctcagcc cagcatgggc 4441 cacctccact gcagtacact gtggagggtg gcatggcgtc acctgtcgtg agcagccagg 4501 tgtgccctca tctgcagggc cacactgccc atgccttgag ctagggaccc tggtaggtta 4561 ctgtctatcc cctggcagct gtccacctct ggtggtgagg tgaagggcgt gtcacctggg 4621 ggcagggcag cagcagttat gggggttgtg acatgcgctt ctataaagca ggacatgccc 4681 ccagccagcc ctgcccacct actggccctc tgaggatgcc cagactgata ccactgtcac 4741 atctgcagct gggggcccgt ccctcattgt gttccctcct agatggttcc ttccccattt 4801 tttttttttt cgagacggag tctcgctctg tcgcccaggc tggagtgcag tggcgtgatc 4861 tcggctcact gcaagctcca cctcccaggt tcacaccatt ctcctgcctc agcctcccga 4921 gtagctgaga ctacaggcgc ccgccaccac gcccggctaa tttttttgta tttttagtag 4981 agacggggtt tcactgtgtt agccaggatg gtctcaatct cctgaccttg tgatccaccc 5041 accttggcct cccaaaatgc tgggattaca ggtgtgagcc accacgcccg gccttttttt 5101 tttttttttg agatggagtt tcgctcttgg tgcccaggct ggagtgtaat gctgtgacct 5161 cagctcaccg aaacctctgc ctcccaggtt caagtaattc tcctgcctca gcctcccaag 5221 tagctgggat tacaggcatg caccaccaca cctggctaat tctgtatttt tagtagagac 5281 agggtttctc catgttgagg ctggtctcga actcctgacc tcaggtgatc tgcccgcctt 5341 ggcctcccaa agtgctggga ttacaggcgt gaaccaccgg gcccagcccc cattcttatt 5401 tttttagaga caaggtcttg ctatgttgtc caggctggag tgcaatggcg tgatcacggc 5461 tcactgtaat cttgacctcc tgggctcaac tgatcctcct gccacagcct cccaattagc 5521 tgggactcca ggcacatgcc accatgactg tcttattttt taattttttt ttgtagagac 5581 gggggtctca ctatgttgcc caggctggtc ttgaactcct ggcttcaagt gatcctccca 5641 cctcagcctc ccaaagatta caggtgtgag ccactgtgcc tggctcctac caattatttt 5701 aggaaaaaaa aaaaaatcgg ctgggcgcgt tggctcacgc ctgtaatctc agcactttgg 5761 gaggctgagg caggtggatc atgaagtcag gagttcaaga ccaccctggc taatgtggtg 5821 aaaccacgtg tttattaaaa atataaaaat tagccaggcg tggtggcgca cgcctgtagt 5881 cccagctact tgggaggctg agacaggaga atcacttgaa cccaggagac ggaggttgca 5941 gtgagctgag atcgagtacg acagagtgag actccgtctc aaaaaaaaaa aaaaaaaaat 6001 caagaagctg tgacccaggc cccctggcct ggcagtctca ggcctccgtg acacagccac 6061 cttcacatgc tggctgcgct ggctgttcac agtcactgaa cactgcaggc ctgggtccct 6121 gctttgttta ctgtccccgc cacccactct gtggtgcctt cctgtcccca gccccaccct 6181 ctactctgtt catcactctt gcccccggcc aaggcccgca ctcacctgcg ccaggtacgc 6241 tctatacagg aagacgtccc tctccacgtc tctctctggg cttgatagct gtgggaacca 6301 atgtgagtca ggacgcacag tgggaggccc agcgcagccc acacagggtg agcatgacag 6361 acaggcccct gtacactggg gacaggtggc ttctccggca gggttggagg tctgagtccc 6421 agccctgatc ctccgtagcc cgagtggcct ctgctggggg taccctacac ggggggtgcc 6481 tgatctcatc ccgcgctgac accaccccag gaaggaggct tgctcccaag ttcagtgtct 6541 gccccagtca cacttaagaa gccagggtgg cacccgcacc tctaacctct ggacatgggg 6601 ccactgtggc ttgattctgt tctgcttttg tctcaagaca ttttgagctc agtttggctg 6661 ccccagtgtg gtgacatctg tgcccagccc tggccagggg agaccctggg aatggctctg 6721 tacttgggcc acacccccac agcccagcca ttgcacagaa ctgaggagca agagggaggt 6781 aacctgtggc aacatccatt ttgggatgat aatgaagctg cccctgagcc caggccaggg 6841 agggaggctg ggaccgctct ccagaacaca gctttgcatg ctgcccccgg aggctggctg 6901 acttcccagg gtgcccatgg caggtgggca cagtgcagct cagcccatgg cctgagggtg 6961 ctcgcccagg cccctggggg cttcgagagg aaagtacagt tgggaagggg cccagccacc 7021 ttggcatcgc cactccaaca acctctaaac agaggggact gagaaggctg tcagcaagag 7081 ggcctgccag ctgtcaaggc acagggtcaa aggacccagt gcccaaacca gggggagtca 7141 ctctgccctg tcctctgaag ccgagaacca cagattccaa gccacatctc ttccttctcc 7201 ccagcctgag ggaggagcgt ggatgtcaaa cgtgtcctca tcccaggccg ccgcagacct 7261 ggccccagcc tgactcaggc tgccattgcc aagaggaccc agcagcttct gctcaggccc 7321 tggtagccag agtgggtatc tggctatgat ttggaacaag atgccatcta aataaataac 7381 tcctagaaga atgtcaaagg catgcggagg gtggtggctg cagaaatcga gccccacctg 7441 caaatgggat ggacaggtga aatatgtacc ttttcaagta tgcaaagaaa catattatag 7501 gccgggcatg gtagctcatg cctgtaatcc cagcactttg ggaggccgaa gtgggtggat 7561 cattagaggt caggagttcc agaccagcct ggccaacatg gtgaaacccc gcctctactt 7621 aaaaatacaa aaattagcca ggtgtggtgg cgggcgccta taatcccagc tactcaggag 7681 gctgaggcaa gagaatcact tgaacccggg agatagaggc tgtagtaagc cgagatcacg 7741 ccattgcact ccagcctagg tgacagagca agactccatc tcaataaaaa ataaataagt 7801 aactctcaat caaccttctg aggtggactt cagttaggtg actagcccag ggccacatgc 7861 caccctcagt taaaaaagac aggccaggtg tggtggctca cacctgtaaa cccagtgctt 7921 tgggaggcca aagcaggagc actgcttgag cccaggagtt tgagaccagc ataggcaaca 7981 tagcaaaaag catagtcttt tttttttttt tttttttgat ggagtcttgc tctgtcaccc 8041 aggttggagt gcagtggcgt gtatctcggc tcactacaac ctctgccgcc tgggttcaag 8101 agatttttct gcctcagtct cccgagtagc cgcgactata tgtattttta gtagagatgg 8161 ggtttcacca tattggccag gctggtcttg aactcctgac cttgtgaacc agccgcctcg 8221 gcctcccaaa gtgctgggat tacaggcatg agccaccgtg cccggtctga gaccctgtgt 8281 ccataaaaaa attataaata ttagccaggt gtggtagcat gcacctgtag tcccagcgac 8341 tcaggaggtt catgaaggag gatcgcttga gcctgggagg tcgaggctac agtggactat 8401 gatcatgcca ctgcactgca ctctggcctg ggtgacagag tgagaccctg tctatcttta 8461 gggaaaaaaa agagacaaaa gctggctgtt gggctttcaa agtgaggaca cagtggggcg 8521 ccatcggggg caccagggtg gcctcagccg cccctgtgtg aaatgcaggt gcccaggaga 8581 ctgtgcgtgc ggaagtgctc agaggctgca gagtgcactg gagagacaag agtgccggga 8641 agcataacaa caacaaatgt gcagaccgtg accctgacgt gaggctctgg aaccgttgtg 8701 gttttcctcg acttccaggg acgatcctac catcaggacg catgtgtgtc cccggggtgg 8761 gagagcattg ctcttcaacc ccaagggctc actacgatca cctgggagcc caggaaccac 8821 caaagccagg aggccacatc caccaaggtc acctggccct aggatggccc cagcactggc 8881 cccaggtggc ttaagatgtg gccatggttg aggaatcagg gtactgtttg ggaaggcggg 8941 tgacaggatg ggggccctgt agccagaaac cttgggggta gtaactcctc ccttctgtcc 9001 ctgtagccag tgatctgacc gcactgggcc gtcagcaact caaggcctat gagctgcttc 9061 aaccccaaca gctgggcggt gcctgccatc ttgtgggcaa actggcaaat gtcagaaacc 9121 cagaaacgtc atgatggtag cctagtgctg ggggcagcaa agggcaaaga ctgccaggtg 9181 ctctccctgg ggccacctca ggtggggctg aggttgctca ggccactgtc atgaactacc 9241 acagaccaga tggcttaaaa aatgtattcc tcaaaaatta gccaggtgtg gtggcgcacg 9301 cctgtaatcc cacctactcg ggaggctgag gcaggagaat cgcttgaacc caggaggcag 9361 aggttgtggt gagctgagat catgccacta cactccagcc tgggcaacag agcaagactc 9421 catctcaaaa acaaaaaaac aaaaattagc tgggcgtggt ggcaggcacc tgtaatccca 9481 gctactctga aggctgaggc aggagaatgg tgtgaaccca ggaggcggag gatgcagtga 9541 gctgagattg tgccactgca ctccagcctg ggtgacagag ttagactctg tctcaaaaaa 9601 aaaaagtatt cctggccagg cgtggtggct cacacctgta atcccagcac tgtgggaggc 9661 tgaggcgggc agatcacttg aggtcaagag ttcgagatca gcctgaccaa catgataaaa 9721 ccctgtctct actaaaaata caaaaattag gccaggaacg gtaactcatg cctataatcc 9781 cagcactttg ggaggctgag gcaggcggat catgaggtca ggagttacag acgagcctgg 9841 ccaacacagt gaaaccccat cactaggaaa aatacaaaaa ttagctgggt gtggtggcac 9901 acgcctgtag tcccagctac tagggaggct gaggcaggag aactgcttga acccgggagg 9961 cagaggttgt ggtgagctga gatcgtgcca ctgcactcca gcctgggcaa cagggcaaga 10021 ctccatctca aaaacaaaaa aacaaaaaaa aaattagctg ggtgtggtgg tgggcacctg 10081 taatcctagc cacttgggag gctgaggcag gacaatcgct tgaacccagg aggaggagga 10141 tgccactgca ctccagcctg ggtgacacag agtgagatcc tgtctcaaaa aaaaaaaaaa 10201 aaaaaaaaaa agtattcctt cacagtcaca gtcctggagg ccagaagtcc caaatcaagg 10261 tgtcagcagg gccacactcc ttccaaaggc tctaggggag gatacttccc tgcctctccc 10321 agcttttggt gactgttaca ttcctcagtt tgtagccaca ttactcccat ctctgcttcc 10381 atcatctctg tgtctcaaat ctccctctgc cttttttttt ttctttttcc agtggcgggt 10441 tctggtgaag acaagaaagc ctactagaaa cattcttttt ttcttttttt tttttttttt 10501 ttgagatagg gtctcacttt tgacatagta agagcaccaa catctcagac aaacactgcc 10561 actttaagtt ccagccccct ttctagcctc atgaatttta aggaaatttc ttttttcttt 10621 ttttttaata gagaagtctc gctcttgttc cccaggcttg aatgcaatgg ctggatcttg 10681 gctcactgca acctccacct cctgggttca aacgattctc ctgcctctgc ctcccaagta 10741 gctgggatta aggtgcctgc caccatgccc ggctaatttt tgtatttttt agtagagacg 10801 gggtttcacc aagttggtca ggctggtctc gaactcctga cctcaggtga tccgcccgcc 10861 tcccaaagtg ctggaattac aggtgtgaac caccgcgccc ggccaggaaa tcacttctaa 10921 ctacaagcag ccagaaagag gagacagtaa aacacagaaa gacagctcgg acagagagag 10981 agtgggaaga aaatttcttg ggtaactgcc aaacttcacc ttcatacaac gggccccagt 11041 aaaacagtgg gccttaataa gcacattcct ttcccttcag gtgcactaaa ataggcaagc 11101 taaaagcaga gtaggggggt atgcctgcag ctgcagaaag atgtatggga ggctgggcac 11161 ggtggctcac acctgtaatc ccaacacttt ggtaggccaa ggcgggcaga tcacaaggtc 11221 aagagatgga gaccatcctg gccaacatgg tgaaaccccg tctctactaa aaatacaaaa 11281 attagccggg ccatggtggc gcgcgcctgt aatcccagct actcgagagg ctgagaaagg 11341 agaattgctt gaacctgggg gcggaggttg cagtgagccg agatcacgcc actgcactcc 11401 agcctggcaa cagcgcgagt ctccatctca aaaaaaaaaa aaaaagaaaa gaaaagaaaa 11461 gaaaaaagaa agatgtatgg gaacagacac acaactctcc ctcccagata agcacaacga 11521 agagacacag aagcagttca agcctctgat aaactctccg accctgaatc cttaaaaact 11581 cttagtctgt aaaagagtgt tactctgacc caactccgcc agaaggcgcc tctcaggttt 11641 gttttctcta aaataaacct gtcttgactg gcaagccacc tttcgtgttt ctttcctctt 11701 taattcttac aactttgtca cccaggctgg agtgcagtgg cgcgatctcg gttcactgca 11761 acctcggcct ccgggttgaa gcgattctcc tgtctcagcc tcccggtagc tgagattaca 11821 agtgcccgcc accacgcccg gctaattttt ggtagactca tggtttctcc acgttgccca 11881 agctggtctc gaactcctgg gcgcatgcga tctgcccgcc tcggcctccc aaagcgccag 11941 aattacaggc gtttgagcca ctgcgctggg aaactgtgtt tatcattctg agtccagcgg 12001 aatacagtgc ccgtccaaaa aagtgctcaa tgaagaatga atgaatgaat gggtaagctc 12061 tgggagaaat aaccaggctg gttgggcagg gaaacgtcct tctctgaact ttagggttct 12121 catccgtaaa atgaggataa gcaacagtct ccatctcacg ggaactgaag gtatggctga 12181 acaaaatagg cttattgtca ctacagtggc ataaacacct cccttactca aaacactttc 12241 aagggcgcag agagaacgct gtacgagtga gcagtaccgt aaactgttac tgctctgggt 12301 tctgtcacgg gacagtgact actcctcact gaactgtgga agaggtggcc caaaaaacgc 12361 gagaagaaaa gagaaagttt ttagacgcag aacgcggctc gcggccacca gaaagcgtcc 12421 tccgcctccg cccccagcgt ccccgcgccc ctgcgcggcc gcaccttcac ccgctgcgcc 12481 tcgtttatgc actgctggta gctgccgatg tagaaggcgt tctttacgtc gaacagctcg 12541 tctacctccc cggagccgcc ggaggccggg ccgggggccg gaggcgccat ttcgctgtct 12601 tctcaccagc tcctcttcct gaaagacacg tcagccggaa gcaagacacg ggcacgctag 12661 gaaatgtagt ttactttttc cgagcggcgg cgaagtaagc caatgggaag ctccacaata 12721 atatgcaagg gggcggaagg cgtaagtgcg tcacggagag catctcggga attgtagtgt 12781 gtttctgcaa gccaatggga tccggggaaa catgataagg atgcggtcga ggctgggaat 12841 aggctgaggg agctggagag ggtgggacca cgcaggaggc agatccaata aacaggaaga 12901 tttttctcgt gacgtcgtcg gcgcgcgccg gaagcgcgga tcacacgggc ccctacaagg 12961 ggcccctaca agcggccaca aggatggcag gcttcgcgga gctcgggctg tcatcgtggc 13021 tcgtggaaca atgtcggcag ctgggtttga agcagcccac gcccgtgcag ctcggctgca 13081 tccccgccat cctggagggt gagtatggcc cagggccttc cccaagaggc ctctccgctc 13141 ctctcgactc ctttcccttc tcgcaacctt tgcttattcc ggtcccacga ggcgaggggc 13201 aacctcgggc gttacaggag atccggggtt ccggaggacg actggacagt gttggcattt 13261 aggttgcatg actgactcgc tttgtgttcg tacgcaagtg atgtaatctc tgaaattttc 13321 atagtctcag cacagggagg gtttgatgga tgactagtcc gatagtatcg caaaatcctg 13381 cagacctggt actgctatct ttattttaca aatagaagtt gaggctgtca aaattgctgg 13441 tagatccagg actccttggt cctgtgattg ctcctctagg agaaggtctt ctctctgctc 13501 ctctagccca acccagtttc gccattaaag agaggcccac aggaagccta agatgtgata 13561 agtcagggat taaataaaca aatgaagggg tcaacaggga gcacaagatg catatgaggg 13621 aagggaattc tggattcccc aaattcacgt gctcctggac ggagttagaa cgaggtggta 13681 ggaagctgtg ggcatctcgg gtcaaatgga tatgggttcc agtccagcct gccactcact 13741 ggcagtgtgt cctgggcaag gtctctgaaa cccagctgtc cacacatgga gggtgttgat 13801 gctgctttgt ccccaaccca ggtcgagact gcttgggctg tgctaagaca ggcagtggga 13861 agacagcagc gtttgtcctt cccatcttgc agaagctgtc tgaggatccc tatggcatct 13921 tctgcctcgt cctgacaccc accaggtaag cccccagcag gcctcctggg tatgggttaa 13981 ccagctgtgc tctcttgggc aagcaccttt ccctctctga gcgccctttt cccacgtgtg 14041 agatgagcat gaaaatcttg ctacccgctt cttaggggtc ctgcagaatt aaacaagatg 14101 gccccggtgc agtggctcac acctgtaatc ccagcacttt gggagctcca ggtgagcgga 14161 tcacttgagg ccaggagttt gagaccagcc tgaccaacat ggagaaaccc cgtctctact 14221 aaaaatacaa aaaattagct gggcatggta gcgcatgccg taatcccagc tactcgggag 14281 gccgaggcag gagaatcgct tgaacccagg aggcggaggt tgcggtgagc caagatcgtg 14341 ccattgcact ccagcctggg caacaagagc aaaactctgt ctcaaaaaaa aaaaaaagaa 14401 agaaagaaag aggcaggcag cagaaagggt actgggacct gggcatattg gagccagttt 14461 gagaaaggcc ttgaatgcca ggcggagggt tagaattgat tctgggggtg tgtgagcagg 14521 aaaggtcatg agccagggag aggccaggac agagttggtt gggggcagaa aggaggctga 14581 tgcttgagac cctctggctg gggcaactgg ctggtggtga ggccattgct gaaaggcgaa 14641 ctgggggaag gcacaggttt gggataggaa gagatgctga ggtcagcttg ggtcgggctg 14701 gtgaaggagg gatgttccag gctgagcctg gcgagggggc aggggccaca atggcctcca 14761 aggctcagca tcagagcctt tgtgaacagg actgggcaag tcaggagcct gggggctcag 14821 ggaggaaccc tggggatgga gaggggaatg gggggcaggt ccagaaccac tacctgaccc 14881 agcctggccc cttcacaccc acagggagct ggcctaccag atcgcagagc agttccgggt 14941 cctggggaag cctctagggc tgaaagactg catcatcgtc ggtggcatgg gtacgggagc 15001 tgggaggcgg gggaagcccc agcatgggac cctagcagct ttggaccacc tgcccagccc 15061 tgcctctcat gctctgtccc ccagacatgg tggcccaggc gctggagctc tctcggaaac 15121 cacacgtggt catcgccacg ccggggcgcc tggcagatca cctgcgcagc tccaacactt 15181 ttagtataaa gaagatccgc ttcctggtga gttcgccccg cccctgcaga cctcaggagc 15241 tgggctcgga gcctccaggc ccaatgtcag agcctggggc accatctcat ccatttagga 15301 aacgtatgtc cgttagactg tccagtaaaa cgctaggaac agtgacaagg accgttcaag 15361 gaccaacaag agggggccca gtcctagcaa tgttattcgg gggtagggcc ttgctgagac 15421 ttgggtgagg ggcaggcctc ccttggaagg agggtactcc agggcctagg gtccatgtcc 15481 atgcctctca aagaggcctg cttcaggggt ggccaagacc cggggccatg gacggcaccc 15541 tcacccctgc ccccattggc acggcaggtg atggatgagg cagaccggct gctggaacag 15601 ggctgcactg acttcaccgt ggacctggag gccatcctgg cggctgtgcc ggcccgcagg 15661 cagacactgc tgttcagcgc cacgctgacc gacacactcc gggagctgca gggtctggcc 15721 accaaccagc ccttcttctg ggaagcacag gccccgtgag tccacagccc agacagcgtg 15781 gggagggcag ccccatccta cagacaggga cactgaggcg tggcggtcta tctgtccatc 15841 cccagggtga gcaccgtgga gcagctggac cagcgctacc tgctggtgcc tgagaaggtc 15901 aaggacgcct acctggtcca cctgatccag cgcttccagg atgagcacga ggactggtcc 15961 attatcatct tcaccaacac gtgcaagtga gcggggcccg cctctcccct cccaccgccc 16021 ttcaaaggag gaggtggccc gacgtctttg gtctgggaca cacagccagt ctcagcactc 16081 cccagcctct gctcagcctg gaggtcatgg gggcttccct caaggagcca aggtccctca 16141 atctgaaggt gcagccggcc tttggggttc ctcagcccag ccgttgatgg aagggccagg 16201 aaacccctgc aagcagctgt ttgagcagcc catttgctgt gttctggggg tgtcactgag 16261 ctttcctggg cctttagtcc catctcagag gaagttgtga caactaacag gcctggtgtg 16321 cgtctgcaca gacagagacc cctggggagg gcagttcata tcttcattga attgtttggg 16381 aaggtcattg agaagctgga ggctcttcca gcatgagcca ggatagaagg gcagaaatca 16441 aggcgcttta aatgcgttac ctcccagaag agaaaagttg ggactgggtg cagtgactca 16501 cacctgtaat cccagcactt tgggaggccg aggtaggcgg atcacctgag gtcaggagtt 16561 cgagaccagc ctgcccaacg tggtgaaact tcatctcgac taaaaataca aaaattagcc 16621 aggcatggtg gcgagcacct gtagtcccag ctactgggga aggtgaggcg ggagaatcac 16681 ttgaacccgg taggtggagg ttacagtgag ccgagatcac gccactgcac tccatcctgg 16741 gggacaaagt gagactgtct cagggaaaaa gagagaaaag tttggctgca gggtggctgt 16801 caggcgtggc aggatccagc agcttatgtg atggtcccca tcctttctcc ctgtgcccct 16861 ctttctcctc tgttttcctc tgctggctcc ttcttggcct tcttggtaac agtgaccacc 16921 cctagcaagt cagtttccag ccttccagcc ctgcagtccc agggccaact cttgtcagtt 16981 ccagctccag ggagggatca gcagagagga catcaggcag gcaagccgtc tgtcaaccct 17041 catcctgcct cttttttttt tttttttttt tgagagtctc acgctgtccc ccaggctgga 17101 gtgcaatggc gcaatctcgg ctcactgcaa cctctgtctc ctgggttcaa acaattctcc 17161 tgcctcagcc tcctgagtag ctgaggctac agtcatacac catcatgccc ggctaatttt 17221 tgtgttttta gtacagatgg ggttttgcca tgttgggcag gctggtcttg aactcctggc 17281 ctcaagtgat ccacccgcct cagtctccca aagtgctggg attacaggca tgagccaccg 17341 tgcctggcct gctgcctctc ctttctaagc tgcatgaggc cactgtgcta ggtgtctcag 17401 gggcgggtgg gggcaggaac accttcccag aacctgagag ctggaggggt tgacaggcac 17461 atccttcccg ccaggacctg ccagattctg tgcatgatgc tgcgcaaatt cagcttcccc 17521 accgtggctc tgcactccat gatgaagcag gtgaggccac cctggggccc gccagcctca 17581 ccctgggata ccttccccgc ctcagacatt ggccccgatc cttccttctg ccgggcgcct 17641 tcttcctgcc acctggtctc ttctggtccc agaatatcgc agctcaagag gccctccctg 17701 accgcatctc ttttagccgt cattcatttg tcccggtctg acttctcatg gccaccttcc 17761 tcaatggacg gctggggact gtcccggcct ccaccctgtc ccgaggcctc acacagaaaa 17821 ccctgccttc atgcattccc aattttcttc ccaaccctgt gcagaaagaa cgctttgccg 17881 ccctagccaa gttcaagtcc agcatctacc ggatcctgat cgcaacagac gtggcctccc 17941 ggtgagcagc ccccagtctc ctgccaaggg cactccctct tttactacaa ggccccacag 18001 atgagaaggc tggcctcagg catgtcaggc agccctagca tccctgctga gtgaccctgg 18061 gtgagtcctt gcctcggttt ccccacatgg acagtggagc tgaccagcca cctctgcctc 18121 caggggcctg gacatcccta cggtacaggt ggtcatcaac cacaacaccc ccgggctccc 18181 caagatctac atccaccgag tcggccggac ggcccgtgca ggtgagcagt ggagggggag 18241 gccgagcctt gggcctctgt ccctccagcc tgcccagcaa attcaggtgg taggacgtgg 18301 gtagggtgca gcccacagat gagagctact cagggccatg tttgcaagtt ggggagttgt 18361 cctttctaaa gcaaaggtga atcccagcta ctcgggaggc tgaggcaaga agatcacttg 18421 aacccgggag gttgcggcaa gctgagaatg cactattgca ccccagcctg ggcaaaaaga 18481 gcaaaactct gtctccaaaa aaataaaata aaataggcca ggtgcggtgg ctcacgcctg 18541 taatcttagc actttgggag gccgaggtgg gtggatcacg aggtcaggag ttcgagacca 18601 gcctggccaa catagtgaaa cactgtctta ctaaaaatac aaaaaaatta gccaggcgtg 18661 gtggagggtg cctgtaatcc cagctactcg ggaggctgag gcaggagaat tgcttgaacc 18721 cgggaagcgg aggttgcaat gagtggagat cgcacaattg cactccagcc tgagcaacag 18781 tgtaagactc tgtctcaaaa aataataata aactaaaata agtaaataaa ataaaattgt 18841 aaaataaagc aaatgcaagt caggtgctgc cttcacccaa gccaggtgat gaactgagca 18901 gtgtggggct gccccctggg agggtgcaca cacagctccg gcccccacta cttgctgctg 18961 cctcgtcagg gaggggcatg gtgtccatga gatgctgtca gttctgaagc caggaaagat 19021 gtggttgctg gggtcggact gggcctggac cttctatcct gaccctgtgg tgctgggcag 19081 gtcacttcca ctccctagac ctcacatggt ggataatgag aatggtgacg tcctcgtggg 19141 gctgccatga gggtcacgga ccttgtccac catcagtgca gtttcggccc tagtcaacgt 19201 cacagaaaac agcaagcaga gatcttgggc tgaggcccgt cagggctaat ggctggcatt 19261 acttagactg ggttccctgg cagccaagcc aaaaagggaa atcgggcagt aactaactgc 19321 tgtgtaaggc cagactggca gacctgcctc tccccagcga cccgtcctcc acacagtggt 19381 cctgcttggg gtcaagccca gacactctgg gctgatccct cccaagccca agcccttgtt 19441 cagcatctcc aggacccctg agctgtgtag ctgtcaggat ctacctttat tcgccccatg 19501 tcctgacgcc cagcacatag cagataaccg gcacatcccc acagggcggc agggtcaggc 19561 catcacgctg gtgacacagt acgacatcca cctggtgcac gccatcgagg agcagatcag 19621 tgagtggggt tggggtgggt ggtagagaag gaggggtggg gtgggcagag gtgggcacct 19681 cggggtgaga cccattttcc cgtcagacac cagccggcct gctcccgtga gcagatttag 19741 gctggggctt ccttccctag gcacccccaa aagaagttct tttgcccatt tattgaaaca 19801 aactccccat agttgtcgcc agcagctgaa agccacatca gtcaggaggg gacttgggtt 19861 tcttagactg agatgtatgg aaggtccctg agatgtcaga ggccccattg acagggcctg 19921 gcctgctaaa aacccctggg aatggtttca tatgcagcca agacagagac acttcctaag 19981 gtatctctga tcactgcata gactgtcctg aaatttagaa acatctggac ttggccaggc 20041 acagtggctc acacttgtaa tcctagcact ttgagaggcc gaggtgggcg gatcacttga 20101 gcccaggagt tcaagaccaa cctgggcaac atggtgaaac cccgtctcta ctaaaaatac 20161 aaaaattagc caggtgtggt ggcgcacgcc tgtaattgca gctacttggg agggtgaggc 20221 atgagaatta cttgaacccg ggaggcaaag gttgcagtga gccaaggtcg cgccattacg 20281 ctccagcctg ggtgacagtg agactctgtc tcaaaaaaaa aaaaaaaaaa aaaaagagaa 20341 acatctggac tttaaggccg ggcacggtgg ctcacacctg taatcccagc actttgggag 20401 gccaaggtgg gtggatcact tgagcccagg agtttgagat cagcctgggc aatgtggaga 20461 aaccctgcct ctccaaaaaa tatagaaact agcagcacac gcccgtagtt ccaactactc 20521 aagaggctga ggttggaaga ttgcttgagc ggaggctaca gtgacctgat tgagccacca 20581 cgctccagca tgggcgacag agtaagaccc tgtcttcaaa aaaaaagaaa gaaatatctg 20641 gactttaaag gacactgttt gcaagaaatt aatattcatg gttaaactca taatgctgct 20701 aacaccgaaa aaaaatggcg gggggaggct gaggcatgag aattacttga acccgggagg 20761 cggatgttgc agtgagccga gatagtgcca ctgcactcca gcttgggtga ccgtgagact 20821 ctgaaagaca gcggtccata ccttccttac ctcagaaatc cctgatcttg ctcagctgac 20881 ccttacctga tatggcccca gatggccttg cataccccac agcatgggga actcactact 20941 tggctgaact ccctctccat gtcacgtctc ctccccaccc ccacccccgg ctttgctgtc 21001 cccacagaga agaagctgga ggagttctcc gtggaagagg ccgaggtgct acagatcctc 21061 acacaggtca acgtggtgcg aagagagtgt gagatcgtga gtgtcagagg cgggcaggaa 21121 ctaaagtgct ctccagggcc gggggtgctc ccttccaggt ggggcccccg tgaccagcat 21181 ctccttaccc cacttccctc caccagaaac tggaggcggc ccactttgac gaaaagaagg 21241 agatcaacaa acggaagcag ctgatcctgg aggggaaggt gagggccgag cccgcaggta 21301 gggggtgggt ggccaggttc cctggcgggg gccgccagct cagccatccc ctggtcctcc 21361 ctgtgccagg accctgacct ggaggccaag cgcaaggctg agctggccaa gatcaagcag 21421 aagaaccggc gcttcaagga gaaggtggag gagacgctga agcgacagaa ggctggcagg 21481 gctggccaca aggggcgtcc acccaggaca ccgtctgggt cccactcagg cccagtcccc 21541 tcccagggcc tggtctgagc cccacacggc catctgccca gtccttgact cgtccatgga 21601 gctgagggtc ggaggaacct tccttggggg cagcagccct tcccgggggc ctacccagtg 21661 ccccacagca gaacccgtgg gcgctcgtgt tgtgcgggcc ctgctcctct gccccgaaac 21721 cactggctgg tcccttccct gagccctggc caagattcag gctgcagggg aagaaagaac 21781 atgaccggga ggttgtgacc ccaacccaag gtcacccccc aggggtgccg catacaggag 21841 gtgcttaata aacgggtctt ttgacttcct cagtctgact ttcgaagagc agggggacag 21901 gagaggtggg gtgcagccgc tgttgtttct caggtgtctg cccagaacac catgtccatt 21961 tccaccaggc aggcccaaag ttttgcagac accagctgct cgaatggcca tgggattttg 22021 ggccctgaag ccccttctct gggcttccag tttcctcatc tggagaatgg gcatagcaac 22081 tgccctgctc tccagaggca gctgtgaggc tttggtgagg tcagtcctgt gccgaatggg 22141 tgcagctata agcttccctg gccacccacc tggccacatc tggggtttgt ggccagagtg 22201 aaggggccat gaaagtccaa atttggccag acaatgtggg ctaagcccaa ggggacaggc 22261 agagagacac cagagccagc ttcctggctg tgtggcctgg acccccctcg gactccaagc 22321 ctcacccctc atctgcaagc ggcttataac tgctacatgg attcaggggc acccaaaaag 22381 acgcagggaa aggcgccggc gacatgttaa gtgcccagat acccacatac cacacacaca 22441 cagccacgct tagaaatgta atcgggggat ctagaaattc tacacaatga gaagctcaaa 22501 aacagcccca aagctgccaa caaccagagc cgactggggc ccaccccagc ccagcccggc 22561 ccggcccacc cagggctaag ttgggacccc ccagtccctt tccaggacga atgggcccaa 22621 ctatgccgcc tgcagcctgg cccgcatccc aggccggaat cgttcataga aaaccagccc 22681 cggctcaggg cgcagcctca gccaggcggg ccaggccctc acgcagctca ctcagctcaa 22741 acaggctgac gtccagcagc tgcgctgccc ggcccacctc agcccgcgcc cgctcccgct 22801 ctgcccgtgc ctcctccagg ctgcgctcca tcgcccgcag ctggtgctcc aactccgcat 22861 tgcgggtctc caggtcctgc caggaaaggg tgggcagggt tgggggcccc aacaaatcac 22921 cagtcctgcc cgaggcatcc agagtctgac caccttcatg tccctcttct gcccagagcc 22981 atcctagggc gcccaacaca ccaggaataa aatgcacacc ccacaggggc ctagcttctg 23041 ccctcatttc ttttgtttgt ttgtttgttt ttgagaagga gtttcgctct tgtcgcccag 23101 gctggagtgc aatggcgtga tctcggctca ctacaacctc cgcctcccag gttcaagcaa 23161 ttctcctgcc tcagcctccc aagtagctgg gattacaggc atacaccacc acgcccagtg 23221 atggcttaca cctgtaatcc cagaactttg ggaggcggag gcaggtggat cccctgaggt 23281 caggagttca agaccagctt ggccaacatg gtaaaactcc atctctacta aaaatacaaa 23341 aattagctgg gcatggtggt gcacacctgt agtcccagct actctggagg ctgaggcagg 23401 agaatcgctt gagctcagaa ggcggagctt gcagtgagcc gagatcgcgc cactgcactc 23461 cagcctgggt gacagagtga gactccatct caaaaaaaaa aaaaaaaggc tgggcgcagt 23521 ggtttacgcc tataatccca gcactttggg aggccgaggc gagtggatca taaggtcaag 23581 agatcgagac tatcctggct aacatggtga aaccccgtct ctaccaaaaa atacaaaaaa 23641 ttagccgggc atggtggcga gcacctgtag ttccagctac tcgggaggct gaggcaggag 23701 aatggagtga acctgggagg cggagcttgc gccactgcac tccagcctgg gcaacagagt 23761 gagactccgt ctcaaaaaaa aaaaaaagta ccccctgaac atttctcaga taaagccttt 23821 agcaaaagcc aagccaaggc cagtcgtggt gactcaagcc tataatccca gcactttggg 23881 aggctgaggg cagattgctc acggtcagga gttcgagacc agtctggcca acatgaggaa 23941 accccatctc tactaaaaac acaaaaaatg cccgggcacg gtggcgcatg tctgtaatcc 24001 cagcactttg ggaggccgag gcgggcaaat catgagatca ggagttgaga ccagcctggc 24061 caacatggtg aaaccctgtc tctactaaaa atacaaaaaa ttagctgggt gtagtggcgg 24121 gcgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttca acccgggagg 24181 tggaggttgc agtgagctga gatcgtgcca ttgcactcca gcctgggcaa cagagcgaga 24241 ctccatctca aagaattaaa aaaaaataaa gccaagccaa gccagaccta gttccaatgc 24301 aggctctgtg atctcaggct gagaacagga ggaattgacg gggtctggca tgggaataca 24361 gcaggaactc aagcattgct tgttgaaccc atagatggat atgaaactgt accaacagcc 24421 actgttcacg cggcacctgc tgggcatcag gccttccaca ggactgggcc tcccaatact 24481 ttatcatctt aggtagatat tgtcctgagt gtctgttgag gtgactgtca cccaccgtca 24541 ccccacctcc ttgtgcccac acacacctgc accttctgct gagtctcctc acgctcggca 24601 gcctccaggg cctcgcgggg ccccccagtc tgactcttca gggtctgaat ctcctagagg 24661 agaagagttc ccagggtctg ttgcgggggt cctggctttc ccacaggagg tgagggaagg 24721 cgaggcccag gaaccacact tggcactcac ctggtccttg gtttgcacca gagcttccag 24781 ctgttccagc gactggccct ggcccagccc ctccttctca ccggtggggg tcacctctga 24841 agctgcctga gcctccagct cagccacctg tagggcagga atagcagccc ctgacgcccc 24901 cgcctgcact ctcccttgct cgcccaacct ctagggaatc tgcctcctgt ctggccacgc 24961 ccaggaccga gcacccatct cccttggttg gtgccaccac ccccctctgc acagcttgga 25021 ctttagtgtc tgatcaccct ctgaggctgg cattgtctca aacccatttt acagagcaag 25081 aaactgaggc tgggtcagtg acctgctgag gtcacacagc tagtccatga acccagggca 25141 gatgggaact gccgaggcgc ggtctccaca gttgcccggg ccaggtgggg gtctctcgga 25201 gggaggggtg ccctcacccg ctgccgcagc cgctcggcct ctgcacgctg agcctccagc 25261 tgctgcctcc actgggctgc ggcggcgttg gcctctcgca gggcgcctgc cagcttgttg 25321 ttgctgtcct gcagtgcgaa aaactcggcc tcccactgta cctcgcccac ggagctgcag 25381 agacacgagg gtcggcagtg ggtgacacgg ggctgggggg cggagtcggg cgatgctggg 25441 ctagggggcg gggccagggg tgggatagct ctgggcttag gtcagggcga aggctggcga 25501 gggtgcggag tcgtgcgcga agttggacta gggggcgggg caggggtgga gtggcactgg 25561 gctgggggcg gagtcagcgg caatgctcgg ttagggggcg gggccagggt gggtggcgct 25621 gggctgaggc ggagtcacgg gcgatgctgg gctagggggc ggggccagag gaggggtggc 25681 tttgggctag ggcacggagt cagggccacg cagggctggg gatgtggcga cactgattgc 25741 tgagcttggg gatggggcag tcccgcagaa gcggggtctt gtcgaggacc gagtcaaaga 25801 agggggcacg tggagtccgc ccagtgacac tgtgggaatg ttgagggagg gatgggaggt 25861 tggggagagt cagccgatgg gggcgtggtc agggtggtta tcctaagacc agagaggcat 25921 cgacggggcg aggttagcag cgagaagaga cagggatggg gtcaaaacag aggggcgtgg 25981 ccaacgcagg gtaggggcgg ggtcggctct ggatggggca gcattcagat cctcatcccc 26041 ttccccaggt cttgagtcca cagttagatc ccgccgtcct ggcactccct tggcttccaa 26101 gaaccgcctc ctcgtccccc acccctaccc ccgcccctgc cacgccccca agtcccgccc 26161 ctcacccctc agacaacatc ttctttagcc gctcgcgctc tgtggggccg ggggcatcag 26221 cgctctggct gcggaacagt ttttcctcgc cggggccgtt ggcactgacg agagggctcg 26281 ggggcaccta ggcacgggga aaagaatagg tcacgacccc acatcagctg ggatcaaggc 26341 tgatgtgact ggggtcgtca catcgggggt gcaattgacc tatgcaccct aagcaggtcc 26401 ttcacagcac aggatccaag tgtgtccctc cccacctcac ccaccctcca tggctccctg 26461 tggcccctaa gctgggagtt ccatgcccac gccctcgacc ttgccttcct gggctttcac 26521 cacccccagg agcaagtctg atgcaggtta gtctaaacca agatacttag aacctgtaag 26581 gtaggggagg ggtacaatcg gacacctccc tgctgttccc ttccctgagg ccgaacttca 26641 acatcacgtt taggatttca gcaggaagcc ctgcccaggc cgaattcgtg tttcgtgagc 26701 aaccccactg cctggaccac tggctctgag ggtagacaag ctggcgggtg ccctgcgaga 26761 ggccagcgcc accgcagccc agtgcaggca gcagctggag gctcatgctt tttttcatct 26821 ctccaacccc tgtgcacagc cctcggcctg gtacacagca ggtgctcaat aaataatgct 26881 gaataaatta atgtgggaaa acacgcaggg acagttcgtt gttctgcctc acacttagcc 26941 atttctttct ttcttttttt ttttgagaca gagtcacact ctgtcaccca ggctggagtg 27001 cagtggcacg atctcggctc actgcaacct ccatctcctg ggttcaagcg attctcctgc 27061 ctcagcctcc cgagtagctg ggactacagg cgcatgccac cacgcctggc taatttttgt 27121 attttttagc agaggtggag tatcaccatg ttggccaggc tggtctcgaa ctcttggcct 27181 caagtcatcc tccctcctca gcctcccaaa ttgctgggat tacaggcgtg agccaccgca 27241 cccggccaca cttagccatt tcaatttcta gtcttccatc ctgccatctg agctggtttt 27301 gctactggga cctactgggt gtaccccacc ggccccaggc aaggtccaga ccctctatga 27361 cacactccac accactcctc tgtggtccct gctgtgggtc taaccttagg tcatctggaa 27421 gcattgggtc ttgacgacca gcagagtgcc ccggtctccc aatatcaagg tcccccaata 27481 tcagttgagc gcctggctga ctcacaggtg cccaggcagg cataggggag tagggagtgc 27541 tgacctggtg ggaggcgagc cccagggctg gactggtgag ctccccgcca tcctgagatt 27601 tctccctggc cagcctggct gcttccttca cttcctggaa cttctcggca aactaaggga 27661 gtggggagga aaagacagta aaaccccagc ccatcctccc aaacaggtca ccatcactca 27721 gtgacagccc cgccactttc gaaccgtacc catcccctta tcacacacct tggaagaagg 27781 agagaaactt tgaaaaatga tgtaccatgg tgggtgggca ggtggcatca gatcacagag 27841 gtggcattta aatacattgg gccacattga gaaagggaaa ggaattccca tgtcctcttt 27901 ttaggaaagg gtctcacttt gtcacccagg ctgcagtaca gtggcatgat ctcggctcac 27961 tgcagccttg acctccctgg ctcaagtgac ccttccacct tacccctcca ccaaggagct 28021 gggactacag gtgtgtgcca tcatgcccgg ctaatttttt ctagagacgg ggtctagcca 28081 tgttccccag aatggccctt ttcctgtttt ttgttttttt tttttttgag atggagtctt 28141 gctctgttgc ccaggctgga gtgcagtgac atgatcttgg ctcacggcag cctctgcttc 28201 agcctcatga gtagctggga ttacaggcat gcaccagcat gcctggctaa ttttcatatt 28261 tttagtagag atgtggtttt accatgttgg ccaggctgat cttaaattcc cggcttgaag 28321 tgatccgcct gcctcagcct cccaaagtgg tgggattaca ggcacagcac ccagcctttt 28381 tttttttttt ttttgggggg ggacagaatc tcgctttgtc accctggctg gagtgcagtg 28441 gcaccatctc agctcactgc aacctctgcc tcccaggttc aagcgagtct tctgcctcaa 28501 cctcccgagt agctaggact acaggcccgt gccaccacgc ccagctaatt tttgtattta 28561 gtacagatgg ggtttcacca tattggccat gctggtctcg aactcctgac cccgtgatcc 28621 acccacctcg gcctcccaaa gtgtgagaat gacaggtatg agccaacatg tggggtccct 28681 tttaatcttt atcagggcta tttcaacaaa gagagcctca ggtgggtgct aatctgtctt 28741 taacacctgt tattattcct ttataacaaa gagattaggt ctcaggctct gagagacatc 28801 agaagtcaat gagaaagtaa tcatctcatt cttgtttttg ttgtattaaa tgtttacatg 28861 tttacattta ccacctattt atgacaaagc aataccagtt ttccagttag gataggatat 28921 gtttcccccc accaccccgg ttttaaaata aatttcctta gtttaaaaat aattttttag 28981 gccgggtgcg gtggctcaca cctgtaatcc cagcactttg ggagaccaag gcgggcggat 29041 catttgaggt caggagttcg aaaccagcct ggccaacatg gtgaaacccc atctctacta 29101 aaaacacaaa aattagccag gcatggtggt gggcgcctgt aatcctagta cttgggaggc 29161 tgaggcagga gaattgcttg agcccaggag gtggaggttg cagtgagctg agattgtgcc 29221 attgcactcc agcctgggtg acagagcaaa actctgtctc aaaataaata gataaggcca 29281 tgtgtggtgg ctcacgcttg taatcccagc actttgggag gccgaggcag gtggatcatg 29341 aggtcaggag ttcaagagca ggctgaccaa catggtgaaa ccccatgtct actaaaaata 29401 caaaatcagc caggcatggt ggcacatgac tgtaatccca gctacttggg aggctgagga 29461 aggagaattg cttgaaccgg ggaggcggaa gttgcggtga gccgagatcg caccattgca 29521 ctccagcctg ggggacaaga gtgaaactct gtctcaaaaa aataaataaa taaataaata 29581 aataaatgat tttttaggct gggtgcagtg gctcacacct gtaatcccat cactttggga 29641 gcctgaggca ggcggattac ctgaagtcag gagtttgaga ccagcctggc caacatagtg 29701 aaaccctgtc tctgctaaga atacaaaaat taaaaaaaca aaaattttgt aaaaatacaa 29761 aaattagcca ggtgtggtgt gtctgtaatc ccaactactt gggaagctga ggcaggagaa 29821 ttgcttgaac ctggagagaa gaggttgcag tgagccgaga tcgggccact gcactccagc 29881 ctggcaacag agctagactc tgtctccaaa aaaaaaaaaa agacctaaat tagaaaggta 29941 aattgttggg gccgggcgcg gtggctcaca cctgtaatcc cagcactttg ggaggccaag 30001 gtgggtggat cacgaggtca ggagatcgag accatcctgg ctaacacggt gaaacccccg 30061 tctctattaa aaatacaaaa aaaattagca gggcgtggtg gcaggcgcct gtagtcccag 30121 ctactaggga ggctgaggca gaagaatcag aagaatggtg tgaacctggg aggcagagct 30181 tgcagtgagc caagatcgcg ccactgcact ccagcctgtg cgacagagcg agactccgtc 30241 tcaaaaaaaa aaaaaaaagt aaattgtctg ggtgcagtgg ctcaagcctg taatcccagc 30301 actttaggag gccaaagtga gaggctcgct tgaggccagg agtcaagaac agcctggaca 30361 acatagtgag atcctatctc taaaaaaaat aagattaaag atattagcta ggcatggtgg 30421 catacgccta tagtctcagc tagtcaggag gctgaggtgg gaggatccct tgtgcccagg 30481 agtttgaggc tgcagtgagc catgattgca ctgcttcact ccagcctgga cagcagagca 30541 agatcttgtt aaaaaaaaaa aaaaaaagaa gggagggagg gagggaaaaa aaaagaagaa 30601 aggtaaattc taaaccttct agaagaaaac aaagaaagcc tggtgtggta gctcatgcct 30661 gtaatcccag cactttggga ggccgaggca ggtggatcat ctgaggtcag gagttcaaga 30721 ccaccctggc caacaaaatc ccatctccac taaaaataca aaaattagct gggcgaggtg 30781 gtgggcacct gtaatcccag ctactccaga ggctgaggca ggagaatcgc ttgaacccgg 30841 gaggtggagg ttgcagtgag ccgagatcgt gccactgcac tccagcctgg gggacagagc 30901 aaggcattat ctccaaaaaa aaaaaaaaaa aaagaagaag atcttcgtga aagcactcat 30961 catcaacaaa accacggata aagataataa tggaagtata gagacggcac agggtgcagg 31021 gaggtgagga tgaaggtatt gctggggcat gtgggaaagt gcccagccca gcccagccac 31081 ccatttgatg tcccaaagag gagcagggtg tgtccaaaat tagagctaca ggcagtagaa 31141 ctaggactca agagcaggta cagtggctca tgcctgtaat tgcagcactt tgggaggccg 31201 aggcaggcag atcatttcag gtcaggagtt cgagaccagc ctggccaaca tggtgaaacc 31261 ctgtctctaa taaaatacaa aaaattagct gggcgtggtg gcggactcct gtagtcccag 31321 ctactcagga ggctgaggca ggagaatcgc ttgaacccgg gaggcggagg ttgcgtgagc 31381 caagatcact gcagcctatc tcaaaaaaaa aacagaaaaa tgaactagga atcaaaccgg 31441 ggaagcttaa aggggcttca gtctcaactc atcctacaat cccatcctgc agatgggaag 31501 ataggggacc tgcccaagat cccagagtga gcccaggaga agacttgggc ccagtctcat 31561 cggttccctc cactctcccc ctcacccggc ccacctgtgt cagatgctgt tcagaggcaa 31621 agcccaggcc gtagactgtg ttggcgcgac tgtcggccca ctgcccgaac ttctgggaag 31681 ttttggtgaa ggtcatgttg ggagtgacag tgctgttgat gatggcctag ggtcggggaa 31741 caaagttcaa ggtagatggg acagatacta acaaacatga aaccccccag gtattacctt 31801 gtattagggt cttgaaggtc tttactctga accctttcag gagatagaga ctagggcctg 31861 ggaaggctta gggtttcctg tcccttgggc acacagcaag ccagccacca gacccaaggt 31921 tgtcaccagg actcctggtg cctctgcccc acccagaccc caccctgacc ttggcgcctc 31981 cgatgctgat gatgcggtac acattgcggg tggcatcgta gaaataggag acagtgagtg 32041 cgtgcttgcc cgctgggatc cagtttcgct tggtggctgg gtcaatttgg aacacgtgcg 32101 cccgtgtgct gaagattggc tgctccctgc gtggggagag ggtgtttggt gggggccagg 32161 cacccccact tagaaagctc agggccaggc tgggggaggt gggagctcac ctggctgtgg 32221 acattggtca ggctgggatg ggcaggctct agggggcagc cagagaggtg gcaggagcac 32281 tggtttggcc cctagggaga gaggagggac tatgagtgtc cctcaggcca ggctgaggga 32341 ggccaagcag gttccatctc aggacccatc accaggtgtg cccgtcatgg aactttcaat 32401 caaaaaggca ggagaggccc aaagtacacc tggtgctttg ggaggctgag tcaggaggat 32461 cctttgagcc caggagttcg agaccatcct gagcaacatg gcgagacccc atctctacca 32521 aaaatttaaa caattagctg ggtgtggtgg catgtgcctg ccgtcccagc tactggggag 32581 gctgaggtgg gaggatcact tgagcccagg aggtcaaggc tgcagtgcag tgagctatgg 32641 tcaggccact gcactccagc ctggacaaga gagtgagacc ctgtctcaaa aaagaaagaa 32701 attataaaaa gaaacaaaaa ggtaagataa ttattataga taactcagag caagcagacc 32761 ccatctttct ttgtctaggg gaagccccca caacaccctg tacaggagat ggccctctga 32821 gaggagcctg gtggccgctg acatccagat gtaaatttct cattgtcctg gttattgcac 32881 ccccgtgaag tcccctccat cccaggctga gctgtggagg ccaaatctct gctgagttct 32941 ttccaccaca ttcacccatg actggccaga gcggggctct ccaccaaaaa atggggcgtt 33001 gagaggtgtg tagtaccagg agaccagttc cccactacac cctgcagagg gaccccaccc 33061 ccagcagctc ccaccctagg gcagaagcac accttgaagt ccagagtcca aatccccagg 33121 gagcccccag ccaagttaaa aagacaagat gtagacaaac agccaatccc aatgcccacc 33181 atgagggcag agcagaatcc acactgggag accagggctc cccaacctcc ccattgggcc 33241 attcagcaac cccgaatctg caggggacca aggaggagtt tggaggaggg ggaagacagg 33301 tgggggaggg gaggccctgg cccagcggcc tctgtggtca aaggggggat gcccccccta 33361 tcctgtcctg ggcagctgct gtcctggtca cagggtctga aaccggagcc cctgagtggg 33421 ggtggggcgg cagctctggc ctggactggg ggcgggtcct gcgccgggtg ggagttgggg 33481 aacacagggg tccccggaac ccaggccctt ccggtccggc agctcctgct gcgacttgca 33541 gcgtttgggg aaagttactc accaactagt catgcgcctc ggatgccgcc ggccacgggg 33601 ctcccggagg ggggtccgac accgcccggt gcccgggatc cccaatctgg ccggtggatg 33661 aggccccatc ccatgcaccc gagtgcgggg gacggggggg ttaatattaa ttcaggatcc 33721 catccccacc cgaggcagga cacccagtgc acgccccaac tccgggacgc gggctctgac 33781 cctggaactt ccgagattcc cgcgacgccc cctgtctctg ggtccagaac ttcgggggac 33841 acctcttctg cctcctccaa ccccaggtcg gtaactccgg ggactccctt cgtgactcct 33901 cctctaactg ggtccgggac cccggtcctc cgcgttgtcc tagactccta aacttcgagg 33961 accccctgcc acccctccag tccctgggtc cgggacgtcc ggacccctcc caccccagta 34021 ccccacccca ggtccgcagc tttggggacc tcttcgcagc ctcctccaac gccgggtcca 34081 gaactttggg aacctcccac gcagtcccct ctcagaccct gaatccagaa cttcgggacc 34141 cccgcgctcc ctccagccct ggaacttggg gacccctcgc gccgctccca gtcccgggtc 34201 cgaaacttct ggggaacccc cacgccctcc ccttggccct gggacttcga ggcccccgcg 34261 ctcccagcgc aggctcggca ggccgggctc accctgccag gtcggcgccc cgcgagtgtc 34321 caggcgcgag tcgggtgcgg cccccgcggc gctgactccg cgctgccggg cgccccgctc 34381 gctccctggc tcccgggccc gcgccctccg cgccgccctc cacgccgccc gtgcctttgt 34441 ctgcgccgcc gccgcccgcg ccgcctccgg tcccatcgcg gccggtcccg acgcggccgc 34501 cgcccgcccc gcctccccgg gccgccaagg ccgccccgct gcgccccctg ccggcttttt 34561 gcactgggag agggtatggg ggaggggata agggggaggg gcgggggccg ggactgggga 34621 agggacggag gattgaggga gggggttagg ggcggtggag ggggagtggg ggggggggtt 34681 gcgtggggtc gggttgggag cagagggtgg ggacagagag acagacggag agggacatct 34741 gggatgagaa agagacccag aaacagagac acgggggaga caggcggaga gataggaagc 34801 agatggagag agttgcacgg acagcgatgt gacctcaccc tcctgtcacc ttctcaggag 34861 acagctgcct ctgtccccac tacctgaggt cataactttg ggcctgtgag ggaggtcagg 34921 tccctcccgg gctgacatgg gtggcccagt gaggttgggg agggggcagt gggtctcagg 34981 gtctgcatac agtaagtgct caatgggtgt tgccaggaaa cggaagagat caacttcggg 35041 agggaacggg aagcaagaca aagagagaga agcctctgtt cccctgcccg tcccatccct 35101 aagcagggac gtctgtccac ctctgtccct cccaagacga tggtaaaaat gcccacaatg 35161 gttggattaa ggttatcttt gctgttgttt gagggttcca aaatccaaag gctgaggaaa 35221 tgtctccaac gcatggttgc atcccctcag cctccactgg cctgcgtttg tcacctctgg 35281 tcacagggag cacccccttc ccacacttgc caggagagca tccttccgga gttgaactag 35341 gatccacctc cccagcccca agcccatgtg tcccacttgt caatcctggt gccgccccct 35401 ctctggcacc gtctccaaaa tcacacggtg cggcgccgtt ggttcgtttg tggacagagg 35461 tcctgcagct ccctggaacc aaggcctgtg gcgccctaaa gctctgatct tacatttgag 35521 tttttctttt cttttctttt cttttttttg agacaaagtc tcactctgtt gcccaggctg 35581 gagtgcagtg gcacaatctc ggcttactgc agcctctgcc tcccagattc aagcaattct 35641 cctgcctcag cctcccgagt agttgggact acaggtgcat gccaccacac ctggctgatt 35701 tttgtatttt tagtagagat ggggtttcac catgttggcc agtctggtct cgaactcctg 35761 acctcaagtg atctgccccc ctcggcttcc caaagtcctg ggattacaga cgtgagccac 35821 cacacctggc cgacacttga gtgttttgac aacttcagag aggcttttgc ttttcatctc 35881 caagacctct gctcctgtct cctttccctg ttcctttatt tctgtttcct tctaagaaga 35941 atagtatatc tgagctggcg gttacattac attcaaatta cctttaggtg aattccaggt 36001 caggttgttt ttggagattg actcctgcgc caagatctga gtcttccccc atccaggcct 36061 ttgcccaaac tgttccttcc acctggaatg ccctcgctgc aggcatctct tctgccttgt 36121 ccttccaggc tcagcgccac caccctgtct cctcctccag gaagcctctc tgactaccct 36181 ggccttggaa gggatcttgc cctcccctgt cccaatccct gtgactcttt tttttttctt 36241 ttttaaagtt tatttttaga gacgaggtct cactatgttg cccaggctgg tctcaaactc 36301 ctaagctcaa gtgatcctcc cacctcggcc tcccaaagtg ctgggatgac aggtgtgagc 36361 ccagaccctt tctgtgactc ttatgttacc tgtttattac atgaccaggt ctgggtgctc 36421 ctctagggct ggcccatcca tcctgaagcc ccagtctgac ccagcacaag agcgactgtt 36481 catcccacag gggaaactga ggcacagact cggtttccct ggctgcttct aacttcacga 36541 ggcatctgca caagggaagg atttggctct gttctcaggc cggaagccac gaaatgcttg 36601 aacattgtgg ggagcaaaga aaataccgtc tgttctgtgc aaaaaatgac ttttccttat 36661 taccaacaaa ataatggaaa aaataaaaaa gtatctggga aaggtgctgg catggccaag 36721 gcttctggaa gctgcttaga gtttgccaag tgcacaagtg gggccctgtg gctattttac 36781 aggggcggct gggcgagggg cccaggcaga ctttgccctt tgtgtgggca gaaagcaaca 36841 gtgaaagaga cagtgaaggg ttgagttgct ttcctgaaca ccctctccgg tctctgggag 36901 gggctgggac agggctggcc ttgagcagcc ccaacctgaa gctggactag ccaagtctac 36961 actgaggcac tggcttcctc agtgtgggcc tcagttttct cacctggaaa atgggccgat 37021 gcttgcaggg cttctagtag agggagaaac atgaaggatt tagtcaatat cagcagctgt 37081 cctctggggt gcctgccatt aggttcgtgg gcaaaggccc tgccactctt tgctgcagct 37141 gggaaaacag agtctgtgtc tcagtttccc tgctgggatg aatagtcact cttatgctgg 37201 gtcagactga ggcttcagga tggatgggcc agccctagag gagcacctag acctggtcat 37261 ataataaaca ggtaacataa gagtcacaga aaggggctgg gctcatgcct gtcatcccag 37321 cactttgaga ggccgaggct ggaggatcgc ttgagctcag gagttcgaga ccagcctggg 37381 caacatagtg agacctcgtc tctaacgagg gctgccctga gctgctgccc ccaccccagt 37441 ctgggccctc ttcttcctag gtctggaggc ctctcttcta aagaactggg ctgggtgcag 37501 tggctcatgc ctgtaatcct agcactttga gaggctgagg caggcagatc acctgaggtc 37561 aggagtttgg gaccagcctg gtcaaaacat ggtgaaaccc cgtctctact gaaaatacaa 37621 aaattagccg gacgtgatgg tgcacacctg taatcccagc tactcgggag gctgaggcag 37681 gagaattgct tgaactcagg aggcagaggt tgcagtgaac tgagatcagg tgactgtact 37741 ccagcctggg tgacagaagg agactctgtc tcaaaaacaa aaacaaaaaa gggtgggggg 37801 gagggaggaa ggaaggaagg aacagacttg agatgtcctt gcagggcagg cagaggccac 37861 cccaaggtgt ctgactagga aagcaacttt cctttggggc tgaaaaatgg tgtgggagaa 37921 tgagcttccc accttaaagg ggaaaccaag cgggagcctg aacagctcat tcagcactaa 37981 gaaaactgca tttgcagaat ggcttttatg gctcaggctc tggtctctgc ctagttagat 38041 c // LOCUS AC003002 81786 bp DNA PRI 08-OCT-1997 DEFINITION Human DNA from overlapping chromosome 19-specific cosmids R29515 and R28253, genomic sequence, complete sequence. ACCESSION AC003002 NID g2494139 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 81786) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 500 kb ZNF gene family- containing human contig in 19q13.4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 81786) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from centromere to telomere. Clones overlap cosmid R30217 to the right. FEATURES Location/Qualifiers source 1..81786 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R29515 from bases 1- 42,876 and R28253 from bases 41,773- 81786" /chromosome="19" /map="19q13.4 from D19S303 to ZNF134" /cell_line="5HL2-B" /cell_type="fibroblast" /clone_lib="LL19NC03 R chromosome 19-specific cosmid library" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid UV5HL9-5B, which carries chromosome 19 as its only human chromosome." misc_feature 754..1071 /note="BLASTN similarity to Z62624 CpG clone HS70G1R (1..318); match: 0.99, score: 8.9e-125; 99% identity; database searched: nr" misc_feature complement(1608..1798) /note="BLASTN similarity to Z62625 CpG clone HS70G2F; P = 1.2e-70; Identities = 191/199 (95%)." misc_feature 2392..2472 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 88.000" misc_feature 3462..3588 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 90.000~~BLASTX similarity to (5..48); match: 0.61, score: 6.5e-08; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]." repeat_region complement(3735..4029) /rpt_family="Alu" misc_feature 5882..7687 /note="DPS similarity to P52740|Z132_HUMAN ZINC FINGER PROTEIN 132; gi|488551 (U09411) zinc finger protein ZNF132 [Homo sapiens] (1..589); Score: 1751 Identity: 323/589 (54%).~" misc_feature complement(9636..9873) /note="BLASTX similarity to (152..235); match: 0.53, score: 4.2e-07; database searched: nr; probable pol polyprotein-related protein 4 - rat gi|56590 (X53581) ORF4 gene product [Rattus norvegicus]" repeat_region complement(9898..10089) /rpt_family="Alu" misc_feature complement(10194..10261) /note="BLASTX similarity to (1101..1152); match: 0.38, score: 2.8e-07; database searched: nr; line-1 protein ORF2 - human" repeat_region complement(10317..10572) /rpt_family="Alu" repeat_region complement(10893..11193) /rpt_family="Alu" repeat_region 11670..11966 /rpt_family="Alu" misc_feature 13244..13439 /note="DDS similarity to N24366 yx14c04.r1 Homo sapiens cDNA clone 261702 5' similar~ to SP:YB9B_YEAST P38334 HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC ; (1..195). Identity: (97%).~~Other overlapping matches:~T89537 ye04c07.r1 Homo sapiens cDNA clone 116748 5' (1..167); 100% identity." repeat_region 14197..14483 /rpt_family="Alu" misc_feature 14551..14946 /note="DDS similarity to N24366 yx14c04.r1 Homo sapiens cDNA clone 261702 5' similar to SP:YB9B_YEAST P38334 HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC; (196..485); 99.7% identity.~~Other overlapping matches:~T89537 ye04c07.r1 Homo sapiens cDNA clone 116748 5' (167..554).~~N40075 yx98e03.r1 Homo sapiens cDNA clone 269788 5' (1..365) ;.Score: 709 Identity: 361/365 (98%)~~H99119 yx14c04.s1 Homo sapiens cDNA clone 261702 3' (1..429).Score: 836 Identity: 426/429 (99%)~~AA195608 zr38a02.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 665642 3' (1..395); Score: 713 Identity: 384/395 (97%).~~T89449 ye04c07.s1 Homo sapiens cDNA clone 116748 3' (1..426); Score: 681 Identity: 399/426 (93%)." CDS 14570..14992 /note="Hypothetical 16.6kDa protein most similar to hypothetical C. elegans ORF; DDS similarity to gi|1946954 (U97552) W05H7.3 gene product [Caenorhabditis elegans] (1..141).~Score: 389 Identity: 76/141 (53%)~~BLASTP similarity to sp|P38334|YB9B_YEAST HYPOTHETICAL 19.7 KD PROTEIN IN SRB6-RIB5 INTERGENIC REGION; Expect = 8e-21, Identities = 57/170 (33%)" /codon_start=1 /product="R29515_1" /db_xref="PID:g2494140" /translation="MSGSFYFVIVGHHDNPVFEMEFLPAGKAESKDDHRHLNQFIAHA ALDLVDENMWLSNNMYLKTVDKFNEWFVSAFVTAGHMRFIMLHDIRQEDGIKNFFTDV YDLYIKFSMNPFYEPNSPIRSSAFDRKVQFLGKKHLLS" misc_feature 14934..15045 /note="BLASTN similarity to D20534 (1..112); match: 0.98, score: 8.1e-38; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS01509, clone pm2818." repeat_region complement(15246..15556) /rpt_family="Alu" repeat_region complement(15599..15887) /rpt_family="Alu" repeat_region complement(15955..16446) /rpt_family="Alu" repeat_region complement(19551..19635) /rpt_family="THE1" repeat_region complement(19642..19918) /rpt_family="Alu" repeat_region complement(20687..20976) /rpt_family="Alu" misc_feature 21491..21644 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 84.000~~BLASTX similarity to (8..59); match: 0.53, score: 2.7e-07; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]" repeat_region complement(22785..23157) /rpt_family="THE1" repeat_region complement(23212..24758) /rpt_family="MSTAR" repeat_region 24816..25415 /rpt_family="Alu" repeat_region complement(25421..25610) /rpt_family="THE1" misc_feature 25502..25568 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 79.000" repeat_region 25617..25902 /rpt_family="Alu" repeat_region complement(25875..26074) /rpt_family="THE1" repeat_region complement(26080..26738) /rpt_family="PAB" misc_feature 27148..28419 /note="BLASTX similarity to P51522 (45..395); match: 0.52, score: 6.7e-136; database searched: nr; ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1) >pir||A32891 finger protein 1, placental - human~~predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 62.000" repeat_region complement(29075..29136) /rpt_family="MER5" repeat_region complement(29736..29819) /rpt_family="MER1" repeat_region complement(29838..30055) /rpt_family="MER1" misc_feature complement(30554..30678) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 61.000" repeat_region complement(31101..31395) /rpt_family="Alu" repeat_region complement(31405..31691) /rpt_family="Alu" repeat_region complement(31894..32164) /rpt_family="Alu" repeat_region complement(32277..32579) /rpt_family="Alu" misc_feature complement(32792..32928) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 87.000" misc_feature complement(33256..33322) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 73.000" repeat_region 33683..34118 /rpt_family="Alu" repeat_region 34378..34688 /rpt_family="MER1" repeat_region 35245..35532 /rpt_family="Alu" repeat_region 35731..36020 /rpt_family="Alu" misc_feature 39657..39736 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 48.000" misc_feature 40539..40672 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 41.000" repeat_region complement(40898..42495) /rpt_family="L1" repeat_region complement(41530..41830) /rpt_family="Alu" repeat_region 43115..43374 /rpt_family="Alu" misc_feature 43924..43959 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 76.000" repeat_region complement(45112..45494) /rpt_family="THE1" misc_feature 46757..46910 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000~~BLASTX similarity to (8..59); match: 0.53, score: 3.6e-11; database searched: nr; KRAB-domain- containing zinc finger protein ZNF45 - human (fragment) >gi|186633 (M67509) ORF [Homo sapiens]" misc_feature 48579..49625 /note="BLASTX similarity to P51522 (84..395); match: 0.53, score: 4.6e-149; database searched: nr; ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1) >pir||A32891 finger protein 1, placental - human~~predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 87.000" misc_feature 48808..48893 /note="DDS similarity to AA205091 zq71g04.r1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA clone 647094 5' similar to SW:ZN17_HUMAN P17021 ZINC FINGER PROTEIN 17 ;(1..96); Score: 164 Identity: 89/96 (92%)." repeat_region complement(50165..50484) /rpt_family="Alu" repeat_region complement(50701..50893) /rpt_family="L1" misc_feature 50995..51067 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 57.000" repeat_region 51045..51310 /rpt_family="MER2" repeat_region complement(51475..51754) /rpt_family="Alu" repeat_region complement(51791..52008) /rpt_family="L1" misc_feature 51907..52250 /note="DDS similarity to T94161 ye28g12.r1 Homo sapiens cDNA clone 119110 5' (1..346). Score: 666 Identity: 342/346 (98%)." misc_feature 52492..52955 /note="DDS similarity to AA282963 zt15h09.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone 713249 3' (1..461).~Score: 901 Identity: 461/461 (100%)" misc_feature 52810..53007 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 47.000" repeat_region complement(54460..54750) /rpt_family="Alu" repeat_region 55286..55576 /rpt_family="Alu" repeat_region complement(55589..55867) /rpt_family="Alu" misc_feature complement(56201..56489) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 60.000" repeat_region complement(57156..57441) /rpt_family="Alu" misc_feature complement(58267..58345) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 62.000" repeat_region 58377..58639 /rpt_family="Alu" repeat_region 58742..59029 /rpt_family="Alu" misc_feature 60754..61085 /note="BLASTN similarity to Z64322 CpG clone HS9F12R (1..331); match: 0.99, score: 6.8e-125;Identities = 327/332 (98%).~" misc_feature 60915..61142 /note="DDS similarity to AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (1..229); 97% identity.~" misc_feature 61073..61216 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 54.000" repeat_region complement(62185..62479) /rpt_family="Alu" repeat_region complement(62615..62911) /rpt_family="Alu" misc_feature 63332..63372 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 78.000" misc_feature 63332..63372 /note="DDS similarity to AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (230..270); 100% identity." repeat_region complement(63779..64061) /rpt_family="Alu" repeat_region complement(64073..64361) /rpt_family="Alu" repeat_region complement(64450..64734) /rpt_family="Alu" misc_feature 65177..65242 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 68.000" repeat_region complement(66240..66543) /rpt_family="Alu" repeat_region 67303..67598 /rpt_family="Alu" misc_feature 67648..67774 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000~~BLASTX similarity to (3..42); match: 0.55, score: 1.6e-07; database searched: nr; KRAB-domain- containing zinc finger protein D19S19 - human (fragment)." misc_feature 67648..67693 /note="DDS similarity to |AA333113 EST37150 Embryo, 8 week I Homo sapiens cDNA 5' end (271..317); 94% identity.~" repeat_region complement(68620..68905) /rpt_family="Alu" misc_feature 69783..71276 /note="DPS similarity to Accession: gi|1769491 (U66561) kruppel-related zinc finger protein [Homo sapiens] (221..718). Score: 1555 Identity: 269/519 (51%)~~predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 89.000." misc_feature 71011..71484 /note="DDS similarity to AA418246 zv96b07.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 767605 3' 91..474).~Score: 936 Identity: 471/474 (99%)" misc_feature complement(71056..71543) /note="DDS similarity to AA418360 zv96f07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 767653 5' similar to gb:X52354 ZINC FINGER PROTEIN KOX23 (HUMAN);~Score: 938 Identity: 481/486 (98%)." repeat_region 71654..72997 /rpt_family="MER7" repeat_region complement(72074..72359) /rpt_family="Alu" repeat_region complement(73231..73488) /rpt_family="Alu" repeat_region 74156..74440 /rpt_family="Alu" repeat_region 74806..75099 /rpt_family="Alu" repeat_region complement(75157..75468) /rpt_family="Alu" repeat_region complement(75655..75955) /rpt_family="Alu" misc_feature complement(77624..77929) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 52.000" misc_feature 77744..78141 /note="DDS similarity to D81878|HUM417H05B Human fetal brain cDNA 5'-end GEN-417H05; (1..396). Score: 762 Identity: 392/396 (98%)~~Other overlapping matches:~(77796..77947) AA454141 zx45h09.r1 Soares testis NHT Homo sapiens cDNA clone 795233 5' (1..152); Score: 304 Identity: 152/152 (100%).~" repeat_region complement(78065..78125) /rpt_family="THE1" repeat_region complement(78160..78332) /rpt_family="Alu" repeat_region complement(78762..79050) /rpt_family="Alu" repeat_region complement(79247..79541) /rpt_family="Alu" repeat_region 80505..80792 /rpt_family="Alu" misc_feature complement(81102..81399) /note="DDS similarity to AA229025 nc50c11.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone 5709 (1..300); Score: 550 Identity: 290/300 (96%)." repeat_region complement(81556..81783) /rpt_family="Alu" BASE COUNT 21056 a 17431 c 19441 g 23858 t ORIGIN 1 gatctttgat ttatcttcct aaagcagtgt atgtgagaac cttgtgttga ggaagactga 61 taacagcttg tatgtattta gcacgatgtg ccaggcctgg tgttaagtac tttttacctc 121 acttcccata ttcctaatga ggttggttgt agtgtttttc atcttactga tttttttttg 181 tttgtttgtt tttgttttgt tttgtttttg gctaatgact ttaaaagacc tccagttctt 241 gaacttcagc tagggtccca tccccagagt ttatcagaag gtataggttg ggtccaagaa 301 ttcacaggtt ttttaagttc atagtgaagc tgatgctgta aaactacata tcaaattggg 361 aagcaatgaa ttagaagagg ctcatgtttt cgtcatgtta catgtaagtt cctccttcac 421 ttttcgccat gaataaaagt tccctgtggc ctcaccagaa gcagatacgg gcaccatgct 481 tcctgtacag ccagcagaaa ggtagctaat tacatctctt ttctttatat gttaggaaaa 541 ttagaaggac attcaatgtt ttacaaaaag gcccacaaac gaaatacaga caacgttcaa 601 tttattaata cgtttcaatc cccaaatctg agagaggtga agcaggtttc ccagggcctc 661 agaagcagga ttgaagaatg ggtcccttgc tggcgacttc agtggcgtgg cattgtccca 721 ggtggccatg gaaatcaggg tgcataccct ttaaaagcag ctgtggcctc cgagatatcg 781 ttttcccagt cttttctccc cctccctgca cgacgcctct tggcagacat ccgggagaac 841 ccgaaaggcg ctcgttgcct ggttggatgc agggtacaaa ctacgttacc cagaaatctg 901 tgcccagctc attgttagcc ggcgtgcagc caatgacagc ccagaaactg ggcgtttcct 961 gctgctctgg gctgcagggg cgagacttct ggcgtcgccg tcgtgacgta tttttcctat 1021 gcccggtccg tgcattctgg ttgtgaaggc tgagttctag agatcgggtc ggctttctac 1081 gcggctctcg tggaacctag caaagaaaga cagtgaagac tgcaggacct tccttcgcgc 1141 ttttgttaca atccatgacc cctgtcgtgg gacgggcggc ctctcgcgga ggtgtctgcc 1201 ggggctgggc tcttaccgag gcctccacac acgtcctctt gtccttgtct cccccagaag 1261 cagccgcctt agtcttgtga gcgtttttac accgggtaga ttgagacttg gagtgctaca 1321 ctcagcccga gggcgtccag cgcggtggag gcgtggggtt tcggctgagc ccacagggca 1381 cagactgttc atccgcttct catggcagcg gcggtgctga tggaccgggt tcaggtgagt 1441 gggggcatcc ctcaagcgca ccccggcctg gttggtgtgt cctgggatgt tcgctctcat 1501 cgcgcacttc agggtgctca ctccgtcgca ttatgtggag gggttccgct cccctcatca 1561 gtggcataag gtgtggaacg gactcgaagg cgcaggggca gttcgtacaa atgcatgcat 1621 ggagaggaga atgagtttgg ggagttccgg gaactctctg tgcctggtga gaacagggac 1681 tagggggtca gaggtaggcc tggaatggcc gcccgaggag ctggtcctct cttctgaggg 1741 tgctgggagt catgcaaggt ttggtacaga gaagggaggt cctctgacgc agttcttaaa 1801 aggctccctc tggctgcaga gtgggtagac tagggattat agggtagacg gggagatcaa 1861 gatgaagtct actgaaatag tccaggtgag cgttgatggt ctgggccaga gtggtggcag 1921 aagaggttag agaatgtatg ggttttgggt tttgtatgta ttttataagt aaggctgacc 1981 agattctctg atgagacata gtgatgatat cctgggattc ttcacatgac tgtcttcact 2041 tgagacaggg ggcagtgaat ggaggtcatt tctagtttag tgtcctgaat ccaggccatg 2101 agttgtatgg agttatgtgg agaagtcccc acctggcgac caatgggatg agtactgcaa 2161 aagtatatca gacctaggaa caggtacgct caaatgcttg taggattgag tttggagcgc 2221 tcagggaatc tcaggtctta attacctttg ttggaaacag gtatcagagg aaggagttag 2281 gcaggaatgg ctggccgagg tgtctcgagt gtggtaaaag tcgtgggaga tttggagcag 2341 aggagggaga tgatctgact cagattttaa catgctctct ctggctttca ggttgagaac 2401 agactgtgag catgagggag cagctactgc agtagtgttg atggtgtctg gtctagaggt 2461 gtggctgtgg aggtgagaga agttggtaaa ttttctgttt tttaaagtag ggtgattgga 2521 ttttttgatg cgaaaggtga aagagaggag agtcgtggat gaccctaagg ttggtttgtt 2581 tttgctgtag ctggtaaaat tatgaaaatg ccatcaactg agatagagaa gttggcagca 2641 ggtatgattt ttagtggtta tatcaggagt taggctttgg tcatgtttag gttaagaggt 2701 ctcagttgtc atcttagttg aggcctcagg aaaacacaca ggtttttagt tctggggaag 2761 ggttctgttg aagaagttga aaaaaatggt ggccagtgtg tggagtggga tggggtatct 2821 tcaaatcatg gtggtaatgc tgttattcag taagaaagtt gaaactggac actgagagaa 2881 aactgggtct gttgcttagg cacctggacg gcagtggtgg atagtgtgaa gacgcgagcg 2941 cttcctggac actgcttgca gagtggcttg ggggatgagg gatgtcatgg aggtgcatat 3001 ggcatgggcc agtggcggtg gggatctcag atgtgaggaa ggtgtagtcc aaagagtaat 3061 gagcacgttg ggaggagtgt gaactgatgc ccttaagact ctagtattgg cagctcatgg 3121 gagaagatgg agctgaagtt tggttgtgtt tagaggcatc gtctggacaa caccaggaga 3181 actggttggt aggcgagtat ttgggaccct tcatgccaga cccagagttg agacagcatc 3241 tgtctgcaca cttgcctggt tctgctctgg gctggacaca gtggtgagag ggacatgagg 3301 agagagcaga cactagtggt gccaaggtct gttatctcct cttagttggg aggctgggga 3361 ggcgagggac tagagagtgg gtgtgtgagt gtgtagactg gggaggaggg ggttctgggg 3421 agagatgctg actgtggact cagctgtact catcatggca gagttgtgtg accttcgagg 3481 atgtgttcgt gtacttctct cgggaggagt gggaacttct tgaggaggca cagagattcc 3541 tgtaccgtga tgtgatgctg gagaactttg cacttgtggc tacactaggt aagtctgtgt 3601 tttagttatc taatgctaca tggcaaatta tcccaaagct tagtaactgc aaacaataaa 3661 aatcaattat ctctccagtt tcagtgcatt ggtaatctgg ttagagcatt ttttgttgtt 3721 tattattttg agggtttttt tttgtttttt gttttttgtt ttttcagaga gtcttactct 3781 gtcacccagg ctacagtgca gtgttgtgat catggttcac tgcagcctca aattcttggg 3841 ctcaagtgat cccccccacc tcaccctcct gagtagctgg gactataggt gcatgccacc 3901 acgcccggct attttacttt ttgtagagat aaggtcttgc tacgttgccc aggctgatct 3961 agagctcctg acctcaagcg atccttcctc cttggcctcc caaagtgctg ggattacagg 4021 cttgagccag tgtgcccagc ctgatcacac cttcttatgc aggtgctttt ggctctcatg 4081 aagttttaat caagctctca gccagtgttg tggtcttacc tgagaagaat ccacttacaa 4141 gctcactcac attgcttttt gcaagattta gtttcttgtg gcttgttgga ctgtgagcac 4201 tgttccttgc tggctgttgg ccaaagtcct ccctcaggtg gcctgtctgt attgtacctt 4261 atgacacaga agctaacttc tctttgagca agtgaccaaa agaatgttca agatggaagc 4321 ccaggattgt aacctaatct tgaaagtgtc atttcattat ttttgccata ttctattcct 4381 tggaaataag taagtccatc cagcttacaa tcaagtgagg aggcttacac aggggcaaga 4441 gtaccaagag gtagggatca ctgggaccat ctcagaggct gtgttccaca ggcctttaca 4501 cccactccgg tgtccttggt ggggtttgag tcttcttctc ttccccatgg gccactcttt 4561 ctgtccatcc agatccatga ctcttctcct tcccttcttt gtttccaaga gtaggtgctg 4621 tgggttctag ggctggctgt gttcagtgtt tctgccttca ctgggtaatc tgaacatctg 4681 ttgccctaga gctttgcagg aaaaggttag gattcgggag tttttcagtc tgctaagcag 4741 attctacaca gctcaggtgc ccactgcctg agtccgtgtc atgatctttg cacctctttg 4801 tcctagattc tgccctcatc ctctggctga catttcctgg ggcctgactg tgccaggaat 4861 ttcaggggct gacacagtca cttcttgtgg tgattacagg gctgtcctgg tacatccgcc 4921 ttggagagta ttttttctaa acattctatt ttctctgtgg agtgggtcta cctgcgccac 4981 tcacctgccc tttgtttgcg ttacagcttt cattttccca gtcccatgca gttgcgcagt 5041 tggagggggc agaaaacctt gggtgcctga cagggcagac atcactgcag ccacagtaaa 5101 ggagacctac agagggcatg gccctgctga atgggagctg gaagagggta tctgttatgg 5161 ttgggttcaa atcgaacctt caacttgttt tgtgtttgtc agtgttttgt tgccaaggcc 5221 catagtgatt cttcacatct gctttacttt cctcattctt ggcacttttt ccatttgtca 5281 tttctcacgt gtcttccatc ctttgggtgt tttcccactt ttctggcctc cgtgtttcaa 5341 ttgctctctt caacactagc ttactttaat gtgtcatgag ttctgcacat taaatagtcc 5401 gtgagaatga gctacttatt tggagtcaga ttttcgtggg catggacata cccatctgca 5461 cagagctcac ctgacaccag cggccaaggc cctgctgaaa ctctcttgca gaggaatcag 5521 gtagaggctt cacctatact ccctatactc tatgccattg tttttaggtc tttacctcat 5581 agtcagggtt cctccattct gtgaggccct atactgacaa ccagcccttc tttacactga 5641 tcctggccct cttgtcctgc caacaagcca gtggactcgc actggtgtta gacacacatt 5701 tgtgatgggg ctgctgcctc ccaccaaagt cagcatgcac ttcaccagct cttttttgct 5761 ttcaggtttt tggtgtgaag cagaacatga ggcaccttct gagcagagcg tttctgtaga 5821 aggagtgtca caggtcagga ctgctgagtc aggtcttttc cagaaagcac acccatgtga 5881 gatgtgtgac ccactcttga aagacatttt gcacctggct gaacaccagg gatcacacct 5941 tacacagaaa ctgtgcacac gtgggccgtg taggagaaga ttctcgttca gtgcaaactt 6001 ttaccagcac cagaagcaac ataatggaga gaattgcttc agaggggatg atggaggggc 6061 ctcatttgtg aagagctgta cagtccacat gttagggaga tcctttacgt gcagggagga 6121 agggatggac ttaccagata gctctggcct tttccagcac cagaccactt acaatagggt 6181 gagtccatgc agaaggactg aatgcatgga gtctttccca cacagctcca gtctcaggca 6241 acaccaagga gactatgatg gacagatgct tttcagttgc ggtgatgaag ggaaagcctt 6301 cctggacacc tttactcttc ttgacagcca gatgactcat gctgaggtga gacccttcag 6361 atgcctacca tgtggaaatg tgttcaagga gaaatcagct cttattaatc acagaaaaat 6421 ccacagtgga gaaatatctc atgtgtgtaa ggagtgtgga aaagccttca ttcacttgca 6481 ccacctaaaa atgcaccaga aatttcacac tggaaaaaga cactatacat gcagtgaatg 6541 tgggaaggcc ttcagccgca aggacacact tgttcagcat cagagagttc acactggaga 6601 aagatcttat gactgcagtg aatgtggaaa agcctacagc agaagctccc accttgttca 6661 gcaccagaga attcacacag gagaaaggcc ttataagtgc aacgaatgtg ggaaagcctt 6721 tagccgtaaa gacacacttg ttcagcacca gagatttcat actggagaaa ggccttatga 6781 gtgcagtgaa tgtggaaaat tctttagcca aagctcccac cttattgagc actggagaat 6841 tcataccggg gcaaggccct atgaatgcat agaatgtgga aaattcttta gccataactc 6901 tagcctcatt aaacatcgga gagtccacac aggagcaaga tcctacgtgt gcagcaaatg 6961 tgggaaggcc tttggctgca aagacacact tgttcagcac cagataattc acactggagc 7021 aaggccttat gagtgcagtg aatgtgggaa ggccttcagc cgtaaagaca cacttgtgca 7081 acaccaaaaa atccacactg gagaaaggcc ttatgagtgt ggtgaatgtg gtaaattctt 7141 cagccatagc tccaacctta ttgtacacca gagaattcac actggagcaa agccttatga 7201 gtgcaatgaa tgtgggaaat gctttagcca caactccagc ctcattttgc accagagagt 7261 tcacacagga gcaaggcctt atgtgtgcag tgaatgtggg aaggcttaca ttagtagctc 7321 ccaccttgtt caacacaaga aagttcacac tggagcaaga ccttatgagt gcagtgaatg 7381 tgggaaattc tttagccgca actctggcct cattctgcac cagagggttc acactggaga 7441 aaagccttac gtatgcagcg aatgtgggaa agcctatagc agaagctccc atcttgttcg 7501 tcaccagaaa gctcacactg gagaaagagc tcacgagtgc aacagttttg gtggcccttt 7561 agctgcatct cttaaacttg tttaacacca gaaaattcac acaagagaaa ggccttatga 7621 atgcagaaaa tatgtcatct tgttcatcct cataggactc acaccagagc aatgctctgt 7681 gagtaccctt tgtgagggaa ccatcagcta gcagatgagc accgtatatt cattccaccc 7741 tggggagatt cctgataagc accacatatg tgggaggctt tcatgaggtg tgttgcactt 7801 tgtaactgtc tagagctctt gatggaatta tatcactgcc agtgcctgtg gcggaagcca 7861 tcttattgct accagctgtg tgtgtcaatc actccatttt gctcagggaa ggcagacttc 7921 tgtgctttct ttcctgttcc ctacaggtaa tcatgaatat tttcaaggac ttcccccccc 7981 ccccacttca ccccctacca ttgagggtcc tcatcttttc cctcatgatt aggttctgag 8041 caaacatgat ctagctctca ccaaaaggac ctgagctagg gtctgctggg atttcctgac 8101 acgattttcc atcttcatgg acaatgttaa ctgtaaacgt gatagctgtg acttacttgt 8161 cttactgcca aatcgcccaa atttggaact gcttgtccca tgctgctctg atttatacag 8221 tgataagggc ctattgtggc agtctttact cttggggatt ttgttactgt gtagagtgga 8281 ttgagaaaag aattgggttc tgtcatgaag ggagtagcac cttttgtggg caccaccttt 8341 atgtgcctca gaggggacca aaggatggca gaaaactgtt ctcagtctag tttgacctaa 8401 tttacactat tgccctaggc ttgctaggaa agactgaaaa aatttttgtg cctaactttg 8461 tggcctggct gccaggattt cctgtgagcc cagtgaggag ggcatagttg gtgcgaaaat 8521 ctcctgtgct tgtggaagca acataagttg ggtcccttaa ctgtcacgta ctccccagca 8581 ctgttgaggg attgctgtct tctgtacctg tacaggatca agtccctgat atccagaacc 8641 ccataggcaa gtaacattga ctgtaagacc tctgtagggc aatgtgaaaa tcatgattgc 8701 tacagaagca ctgagataat ggagtgggga tgactgtggc aaacaaaagg agattcgaac 8761 attctagagg ggcatccaca agtctcggtg cagttatgat ggtgtgggat gggagtgcat 8821 agaaattctc aggaccttca gacatctgta tctcctgtgg ccaaggccac tgtcagagga 8881 aataatctaa atgtttgtgg attcctgtgt ctccctgggc agttgacctg cacagacagg 8941 aacctcagca gtgacagaag agagatccag ggatgggtct tctctgaccc agccagcatt 9001 ccgagtgact gaggtggatg tggacatcat tgcttaccaa tttttaccaa cattgaggca 9061 gggctcactc tcctaaattg tagggagagg aatatggtaa agccaaaata gttgacagat 9121 ctaactttcc tagtggaaca gtatatatag attttaatga ggaacattta tttaatagct 9181 ttatggaggt atgattaaca tgcaataaac tgcaaatatg taaagtgtaa aatgtgctgt 9241 ttcagcatgt gtgtacacct atgaaaccac cacagtcaag atatccaaca caacaaaaga 9301 ttgtcccttt ataatcctca atttttcctt atcttgtttt ccacaattca caagcaacag 9361 caagcatttt ctataatttt ataaatgaaa ccatatggtg tgtacttttt agggggtggg 9421 gctctggctt tttcagtaaa cataactatt ttaaggttaa tccattatct tgttgcatga 9481 atcaattgtt tgctcatttg tattgctgag tagtatttca gtatttcatt gtataccaca 9541 atttttcttc tttgtcaact taattgttat ggatgtttgg gtaactttca gtttttggct 9601 attacaaata aacctctgaa gatttgtgtg caaattatgg ctgaatgata ttccattcta 9661 taaatacacc tcagtttctt tatccattta cttattgaag gacggttgct ttcaagtttt 9721 ggcagttatg aataaaccta ctacatattt gtatagtttt tctgtggaga tgttttcaac 9781 tcatttggat atgtgccaag gagtgcaatt ttctctgtca tatagcatgt ttagtcttgt 9841 aagaaactgc caagctgtct tccaaagcgg ctgcaccatt tattttattt atatatattt 9901 tttgagacag agtctccact ctgtcaccca ggctggagtg cagtggtatg atcttggttc 9961 actgcaacct ccacctattt tagtagacat gggtttttac catgttggcc aggctggtct 10021 caaactcctg acctcaagtg atcaccctcc tcggcctccc aaagtgctgg gataacgggg 10081 gtgagccact gcaacgggct gtggctgtac catttttgca tccaactaac aatgattgaa 10141 cattcctctt gcatctcatc ctaatcagca tttggcattt tcagtatttt ggattttaaa 10201 atgccattca aataggtgtg tagtggtatg ccattgttgc ttttatttgc aattccctaa 10261 taacacacaa taagtatctt ttttaaaggc agggttgttt tgttttgttt tgaaatggag 10321 tcttgctctg ttgcccaggc tggagtacag tggcatgatt tcggctcact gcaacctcca 10381 cctcccgggt tcaagcgatt ctcccacctt agccttccaa gcagctggga ctacaggcac 10441 ccgccaccat gcccagctaa tttttgtatt tttagcagaa atggggtttc gccatgttgg 10501 ccaggctggt ctcgaactcc tgacctcagg tgatctgtct gcctcggcct cccaaggtgc 10561 tgggattaca ggtgtgagtc actgcgccca gcctggctgg ggttttttta tgtgagtttc 10621 acacacctct gagtaagtgg cttgcagaca gagaagttaa caactttggc tgttagcttg 10681 aacctgtgat ttatgaatac caaaaagaca aatacagaat ctaaggaaac acataacagt 10741 cctgtataca cacagcagct ggcagcccta ttcccggtgt gtcccatata cctaccattg 10801 acctgaaacc cagtttgtac gtgctcattg aacattttgt agagcagtaa atacacatca 10861 tcacacagat tgcctactca tttttatttt tatttttcga gacggagtct tgcactgtca 10921 cccaggctgg agtgcagtgg cgcgatctct gctcactgca acttccacct cctgggttca 10981 agcgattctc ctgcctcagc ctcccaagta gcttggatta caggcacccg ccaccatgcc 11041 cagataattt ttgtattttt agtagagaca gggtttcact atgttggcca ggctggtctt 11101 gagctcctga cctcgtgatc cgcccacctc ggcctcccaa attgctggga ttgcaggcgt 11161 gagccaccgc acccggcctt catttttatt ttttgacaag tgtttccatt tgttatctct 11221 atccctccaa acagttaata agcgatgttt tcaagtatga aataagaaat acatatagga 11281 gacagaagca taatgtgtcc aggttccagc acagccagtc tcaacttcat tctttgactt 11341 ctcagactct tgctcaaact ctaccgtaac cttgcttttt ccccccttca catgtcacag 11401 agagatttct gtaggactat ttctgccact agttttgaaa taacataatc actgttagtc 11461 caaactgcac catcctgtaa gccccctgcc atctcacaga ccttggtcag agtgaagcat 11521 tccacggagt gagggccttg agaaacatcc tgcccaactg cctgactttc ttatcacatc 11581 gttctgggaa aagatccaag gaaggtcact atcacatcct gccggataaa aggccaaact 11641 gcctcaggaa catcttacgc acatcctttg gccgggtgct gtggctcacg cctgtaatcc 11701 cagcaatttg ggaggccgag gcgggcggat cacctgaggt caggaattcc agactggcct 11761 gaccaacatg gtgaaaccct gtccctacta aaaatacaaa aaaattagcc gggcgtggtg 11821 gcacgcgcct gtaaagccaa ctactcggga ggcggagttc aagaatcgct tgaactccgg 11881 gaggcggagg ttgcagtggg ccgagatcac gccattgcac tcaaggctgg gcgacaagag 11941 cgagactcct ccaaaaacaa aaaaaacccc accatcctcc tgggcagcaa gtcataccct 12001 cccgccgcgc cccctgcccc ccacccctga cccctctcat ccaggcctat aattgcccca 12061 gcctgtaagc agtgaggggt tctggcacta agctagttct ccccatcaca ggtctcgtgc 12121 tggacataaa acctgcattg ctgtagagct gccaactctg cctttcttta accctcgcct 12181 tcccttcaaa acctaacagt tattattaat agatttctgt aattttcatt tcacaagtat 12241 tttcacagat aatttcaact ccctgtaaaa tgataactca cttctcagga aggagataag 12301 atcatgaaag ttcacagttt ttattgggga taatacgttc cctgtgtaca tatactccaa 12361 ctatggaatg gcacaaaatg tcctgtcagg aaatggcatg tatcagtcac tctcattttc 12421 agacaggatt gagactttta aaataaatgt aaaatatttc agacatgaag taataagtat 12481 tattagaata aaagcattgc actcttggta ggaacacaat cactgactct acagacagta 12541 atgatgtttt cttctggatc cagaggcaca gcgtcaccct ttcaatctac gtcggtatct 12601 cctgtcttct tttgtgcaaa ttcccagcca tgtagtcttt ctttgttctc ttggaccaac 12661 catattgaag cttaataaat agaagtcaaa gtttaactgt atgtgtattt gaaaagcaac 12721 atatggctga gaggcatgag tagagacaga gtgcctttgt ggtatagtgt gtccctttgg 12781 cggtgtagcc gttgagaact tgctgtgctt cacaatccga tatcataggt ccctcacatt 12841 tctgtgagta tcctagacct tacttcaaga aaatctggtt atgtcgcctt taagagcaac 12901 ccaagccccg tcctgtccgt ggagtccggc actctgttta ccagctcccc tgacggtgcc 12961 actgagcctc tatccgccat ataaggactg accaacgctc taaggcaagt ccagaagcgg 13021 tgtcgaaact tcataaccca gaagacactg cggttctcgg cagcgcgact gacccaatga 13081 atgtgcagga aggaaccttt gcgtgcgtgc gtgcgtgcgt gcgtccgtcc tcgtgctcgc 13141 gcatcgtagg agggcgggac ttccggcgtc ctcttgccgt ggttgatttg attttctctg 13201 gtgttttcac tagttccggc ctttggcgct ctatgacgtc accgaagtga cggagcggaa 13261 aagcgcgaga agcggcttgg ttccttgtac gcagaggcgg tagtgacaca ggcacaactg 13321 acagtggcag aagctcagct gacaaggact ggggacggcg gtgtccttgt cttgcctttg 13381 tcgcccccgc ccctctcttc cctggctgga cttgcggagt ccccgccgaa gaacccgagg 13441 tgggtgcccc gtcccaggcc cccccccacc tccgcccgac ccctccttcc agtgctgaag 13501 cccatagaag gggccctgca ggtcaggccc cttgtctcga agagaggggc gttcctgtgt 13561 ggggtccccg tatcagctgg tttgatgcag cctcagagct cccttcgggg atctcaggtt 13621 cagagcatgg aggtgctgcc aagtgggccg cgtcggggag cccggagtgt gggttgtttc 13681 tgcgggaaag agagggtttt gtgaattttc tcttggaatg gcagcctgag cagccggtat 13741 aaccatggaa ggttgtcaag ggaagaccag tgggaggggc aggtccagag ttagaacgag 13801 caggatgggg aattgacaca gagagaccca gaagaggcca ctgcagtgga tcccagaggc 13861 tgcaccagag ccgtggtata agagggacaa tgtgatggga ttctggatag acctggaaga 13921 tagagccaac aggttttcct gccgtatcag atatgtctgt gagcaaaaat agggagtgaa 13981 gggtcatttt tttttttttg cccgaacatc tgaaagaatg aagttgggga agtcagcagc 14041 atgagaagta ggtttcatgg agaagatcat gggttggagt ttggccagtg tgattctcag 14101 gcgtctcaag ttgaaaatgt cacatgggta gggggacaga aaggtgtgga gtctacagag 14161 gagaactggt gtagattctt caaaagaata gagggtggcc gggcgtggtg gctcacgccg 14221 gtaatcccag cactttggga ggccaaggcg agcggatcac aaggtcaggc gttcaagacc 14281 agcctgacca acatggtgaa accccgtctt tagtaaaaat acaaaaatta gccgggggtg 14341 gtgacgcgcg cctgtaatcc cagctattcg ggaggctgag gcaggagaat tgcctgaacc 14401 caggaggtgg agattgcagt gagccaagat cgcgccattg cactccagcc tgggcgacag 14461 agcgagattc cgtgtcaaaa aaaggtgcgg agcgcgggtc tcttccgcgg aaactgacat 14521 tgcgtttccg ttgtcggcct cccgctgcag gagccatata ttgaagacca tgtctggaag 14581 cttctacttt gtaattgttg gtcaccatga taatccagtt tttgaaatgg agtttttgcc 14641 agctgggaag gcagaatcca aagacgacca tcgtcatctg aaccagttca tagctcatgc 14701 tgctctcgac ctcgtagatg agaacatgtg gctgtcgaac aacatgtact tgaaaactgt 14761 ggacaagttc aacgagtggt ttgtgtcagc atttgtcacc gcggggcata tgaggtttat 14821 tatgcttcat gacataagac aagaagatgg aataaagaac ttctttactg atgtttatga 14881 tttatatata aaattttcaa tgaatccatt ttatgaaccc aattctccta ttcgatcaag 14941 tgcatttgac agaaaagttc agtttcttgg gaagaaacac cttttaagct gaatggagaa 15001 aattccaaaa taaattatat caccacaatg gtgtatactc aggaatgtgt acattgtaaa 15061 ttacttgatt aaatagcctg gaaatctttt gtgtattctc agcttatcta aacttaatga 15121 aatttctttt atatttaaaa atagtacatt ctgtctcatg tcacatatca gtagatcaat 15181 tagtatttcc ttgtgaacaa tgttatttat aaagaactca ttatcaataa taattaattt 15241 ctttcttttt tttttttttt tgagacggag ttttgctctt gttgcccagg ctggagtgca 15301 gtggcacagt ctcgtctcac tgcaccctcc gcctcccagg ttcaagcaat tctgcctcag 15361 cctcccgagt agctgggatt acaggctccc accaccatgc ctgcctaatt ttttttgtat 15421 ttttagtaga gacggagttt cgccatgttg gtcaggctgg tcttgaactc ctgacctcgt 15481 gatctgcctg cctcggcctc ccaaagtgct gggattacag gcatgagcca ccactcccgg 15541 ccatgaaata tttttactta aaaattggga ataagctttc tttttctttc tttctttctt 15601 tttttttttt cttttgagat ggagtctggc tcctgttgtg caggggctgg agtgcagtgg 15661 cacgatcttg gctcactgca acctccacct cccgggttca agcaattctc cttcttcagc 15721 ctcccaagta gctaggatta caggcatgca ccaccacgcc tggctaattt ttgtattttt 15781 agtaaagacg ggatttcacc atattggtca ggctggtctt gaactcctga cctcgtgatc 15841 cgctcgcctc ggcctcccaa agtgctggga ttacaggtgt gagccactgc gcctggccat 15901 gaaatatttt ttacttaaaa attgggaata agcttttttg tgtgtgtgtg tatgtttttg 15961 tttttttgtt tttgagatgg agtcttgctc ctgtcatgca ggctggagtg cagtggcacg 16021 atcttggctc actgcagcct ctgcctcccg ggttcaagca gttctccttc cgcctccaga 16081 gtagctggga ttacaggcat acgcctggct aatttttgta tttttagtag agacagggtt 16141 tcgccatttt ggccaggctg gtcttgaact cctgacctca ggttatctgc ctgcctcggc 16201 ctcccggagt gctgggatta caggtgtgag ccactgcgcc tggctgggaa taagctttcg 16261 acttgcccaa ttcagataaa ttgttttttt tttttttttt tttgacgtgg agttttgctc 16321 tgtcgtccag gctggagtgc agtggcgctt ggctcactgc aacctctgcc tcctgacttc 16381 aagcgattct cctgcctcag cctcccaaag tgctgggatt acaggtgtga gccaccatac 16441 ccggcccaga taggttgatc ttataacaat ccagaaacaa atgtcatagt caagatttgg 16501 tagatagatt taaaactaaa atattctgcc attggaagtg aaatgtcata gcacatacgt 16561 ggacgttatg catttagaga tgttataaaa atgtattggc agtatacata gcactactca 16621 agaagccaaa gaaacacttg tgcagtgcta agtgtcacat gtctgcttct gccagaggct 16681 aggaatagtg atcttgctat aatgtgagaa cctgaatcat gtttgtaaaa taggaggctg 16741 ggagcagttc caggccacag tgaagtgtgt tgcttctgtc actttatatt cttatatttc 16801 ctttccctca gacaatcaac tgttgatgcc tgtacttgtg aaaagttgta aggcaatttt 16861 tagggttgtt gtcaaagaaa agcttggatt acattaacat ttgtactcag ccttttaggc 16921 aatacaccag agggggctgg gaattgtctg tttgttccta tgataagtag atcttactgt 16981 aaaatagtaa atgtccattg aaaagcaagt aacacaacct gtgctaactg ggagcacttg 17041 aacaattttg ttccattctg aataactttt cagcaagtaa ttctaagctt tgtcttatat 17101 ccttgtgtaa aattgctacc ttcattttta ccctatttga tttcttaaat ggtttgttca 17161 agaaaaaaaa aaaaaaaaaa aagaatagag ggtgtagatt ggaccgggtc ctgaggatcc 17221 aatttggtag ggaagtctaa aggtcattgc ctgcaagcta tgtggtgagg gaggagagag 17281 ctgatacttg gggttcacac cataggtgga gccataagcc agatgcttac tttagaaagg 17341 cgcagacctg ccataggaag gattttgtcc tctgcctaga aaaagggagg cgtgggtggg 17401 ataaggaatg ggcttgggat ggagactaag gtttgagatg gtttatattc tgcacctaca 17461 ggcttggctg accctcaggg ccactctttg tgggaaaaga aaaaagttat ccaggctagt 17521 ggagcagctg ggttctaaac tagggcactg ctttttcccc ttctccacat atcaaggcaa 17581 gattggatga tctactgccc cagtcaaggt ctgccagcaa gctcccctca gagccttatg 17641 taacaggcag gcctggttgc agaggaccag tagctgacct acctgctcag cccttcccta 17701 caactcatta agaatatatg aaacacttat tggtataagc actaaccagt gtccttctaa 17761 atgtgtacac ataggcagtt agaagtgaag ctattcattg agttaaattc ctgcaaacac 17821 tacataattg atttgaccca aatttgccta taattatcaa cataccatgg aaaattttag 17881 tatcctctaa aattttaaaa gctaaaaata tacacctcag tttttgaaat tctactacat 17941 catttatact gaaaacatat agtaaaccat taatccaaga acagtaatag ctgatatata 18001 tgatgagcaa cgagtgtttg acaccggtaa gagtcctatg aggtgggagc cattcttaca 18061 acctatttac agatgagagc actgaagctc cagcaggttc agttcttttc tagagtcaca 18121 gtgctgctaa gagccaacag tgaagtcgga ggagctgagt ttatacaatc aacttgagat 18181 cagtttgctt tagggtaagg aagaagagat ggtggtccaa ctctgcctgt aagatcttcc 18241 catcgttgcc catttctcac ggtttccctc ttccttcagg gtcccctggc gatggcagaa 18301 atgaaccctg cacaggtgag tggagtgttt tctacctttc acctaccagt tatctcatag 18361 atggttttgg cacctccatg taaagaaaag tggtgtccag agactctctt tctgtctttc 18421 tccctctctt atgtctgaaa agtgagtgat tccaccagtt ctagtatctt aaattaggtc 18481 tgggattttt gtttgattgt tttaagttta ctgtgacacc ttacctggat ggccctggat 18541 tatgtggagt ggctcccatt tttgacagtt gacgtggtaa catctgtcag agtgtcatgg 18601 gcacaagatt aagaatgttt cagtgtttag aggagagact gaactggatt tttagggaac 18661 tgcaagtgtc tgttatatcg gataggaata gggagcagag tgggaggtgg ctagagacac 18721 acaaacatca ggcctggaat gacagcctag gtcctgggtc tctaatctaa gggaggtggt 18781 agatggggaa gatttgaagc agaagagggc agtgatagga ccctagctta aggaactttc 18841 tttggcgtct gagtggagga cagactgtgg aggtgagggt aatggcaggg agatctgaga 18901 ggatgttcct gcagcagtct aggtggatgt tggtggtgac ttgtcaagag tagcggtggc 18961 ttcaggatgt gttttgaact ggagctgccc acattttatg atgtagagag tgaggcagct 19021 gggggggtgg gatagaaggg gaatcaggag tggccccatg gtttttgttt ggaagcatag 19081 atgctccatc agctaagatg ggagaagtca gcagtattag gaattttcag agtgagtgtg 19141 aggagtcagt gtttgttcaa gccacagatg tctcagagac tcctggtgga ggagttcagg 19201 ttggagacgt ccacactatg gtggctgtcg tatggaccag gatgaagcca tggattcagt 19261 ttaggtgaca gcatctctag gttatggtgg cagtgctgtt cgaagacaga gaattggaat 19321 ggggcatgag aactgggtct cttactgggt agctgtgcag tggtggagga gtcactagac 19381 aaatgaggga atggagtgtt tgtggggatg agtgtatgtg agatggtggg gtataggagt 19441 gagaaagact tctgaaggag agtggacatg atggggttgc tgaggagggt aatagggtga 19501 ggtcggagag ttgctaagca tcagttggaa tgagatgagg atgggaattc tattagtctg 19561 ttttcatatt gctacaaaga actacctgag actgggtact ttatgaagaa aagaagttta 19621 attgactcac agttcttttt tttttttttt tttttttttg agatgtggcc caggctggag 19681 tgcagtggca caatctcggc tcactgcaat ctctccctgc caggttcaag cgattcttct 19741 gcctcagcct cctgaatagc tgggacgaca gactcacccc actacgccca gctaattttt 19801 gtatttttag tagagatggg ctttcaccat gttggccagg atggtcttga tctcctgacc 19861 tcgtgatcca tcctcctcag cctcccaaag tgttgggatt acaggcgtga gccaccgcac 19921 ccagcccata gttcttcagg cttaacagga agcataactc ggaggcctca ggaaacttac 19981 aatcctgtgt ccatgaggca ggacacagct gagagtatgt gaagtcctca catggtttgg 20041 gcacctgaca gtgaagtgag aaacagggcc atgtagggaa ccttcaaccc agggcttagg 20101 gctgcccccg gtgctaggga actttggtgt tctggggcag gactgggagt gaattggatg 20161 ctgagcgtat ttgccgtgca ccacctggtt tctgtgtgca gagtggcctg ttaggaggtg 20221 aaggaaatgc ctgtggtgtg ggctgggagg tgtctgtgga attgactgga gtggaggtgt 20281 gagagatggg attggagtgc tgtccaaaga gtgatgtgca ggcaggagga gtgtgaatgg 20341 agggcctcag gccctggccc tggtagctca agggaggagg atgagtctga gtttggttgt 20401 cttgggaggc aagatctggg tgactccatg gaaactggct ggcaggagag gatggatccc 20461 cccacgggag gcccacattc cttttctacc ttatcttcca tctgtttcct ctgtgtgctg 20521 gacacggtgg ccaatgtcaa tatggagaga aaagtcactc atgccacctg gatgtggttt 20581 ctcttccaca ttgggagggt gtagagattg aggaaggaaa ggaggtgtag gggagaggga 20641 tgagttcctg gagagagaaa agtagcagtg attgtaccca ttggggtttt ttgttttttt 20701 ttttttagat ggagtcttgc tgtgttgtca cccaggctgg agtgcagtgg catgatcttg 20761 gctcactgca acctgcacct cctggattca agcagttctg cctcaatctc ccaagtagct 20821 aggactacag gcatgtgccg ccatgcccac ctaatttttg tatttttagt agagatggga 20881 tttcactgtg ttggccaggc tggtcttgaa ctcctgacct caggtgatct gcccgcctca 20941 gcctcccaaa gtgctgggat tataggcatg agccactgtg cctggccccc atggagtttt 21001 aatttatgga tgtgagtgag tgatgccagg gacaggccaa tgacactgaa agaagatatt 21061 tattccttac gtttccctaa ggaggttaca taccacataa cacagggcca cacggtgaaa 21121 caccagattt gatcaggagg aaaattggag tgagtttagt ttagtccaca gatgctgttg 21181 gggtttccaa gggaaacaag gcagggctgg ctgtcaggat agtttggcta gttttagtaa 21241 ttccaggaca cctgggctat tgggactgtc cctagttgtg cagtacctgt ccctgggttg 21301 atttagagca tgggaaatac tgacttggtt tgagaaagtt agagatggag atggttaaga 21361 atatgtgccc aggatggtag gggagatgga aacacattta gctgttagtt tgtccctgtg 21421 tttaatgggt ggtaaatata agtagaaaat aaatagagaa cttaagtaaa tacagtttga 21481 aaagctgctc atgtattcat ctttcccaat ttggcagggt catgtggttt ttgaagacgt 21541 ggccatatat ttctcccagg aggagtgggg gcatctcgat gaggctcaga gattgctgta 21601 ccgtgatgtg atgctggaga atttggccct tttgtcctca ctaggtaagg ccctcacact 21661 tgcccagtgt cctgggttgg gctgtgttgt ctccttttac ctgaaggcag ctctgcgttt 21721 cccacagtga gaccatgggt gctgcttctt ttccttgttt cctgacatat gttccatgag 21781 agtcaggact gcaatatgtg ctgtgtgctt cctttttcct ggcagcccca tcctctgctg 21841 ttctgaggct tgcaagaaag ggctcaggat ccagaaatgt tgaaggtgac atagagaccc 21901 actggccctg tgttctattc aaatgcatga gacctctgtg ctctgttgtt cccttgctct 21961 tgccaatatt tcttggtccc ttcttgttcc tggaatttct ggcactgacc tgatcactaa 22021 tttggtgaac cactggcttg attgtgtagg atgtagaaat cttctgtgaa gttctggtga 22081 ttatcgaatg tgagatatca tcaggtgatc tctctgggaa tgtgctgtct tgtctctttc 22141 tgtgtctttc ccctaggact ggtctctttc aggtctcaca gactgtcacc ttaagcaatg 22201 gggagggccc tggtacccaa gagggtggat attactccct gaagggctgt agagactcag 22261 agggacatga tcctggttcc tgggagttgg gaaaaggtgt aatatcacag ctggggtcat 22321 attactagga atctgtctgg tgacccctgt tgtttagttg aagactggtt gccacaactc 22381 actcttcttt tccttccccc gttgtcattt ttactctgac ttctaatccc tggcttccag 22441 ccgttaccta ttttcttcaa ctgctccctc tgcagtgcca tccactgcat gacctgtact 22501 taaggtgagg tctgcatgta tttgtcagta gtccctgaac accatctcct ctgtcacaat 22561 cagatctctc tgggtttaga cacagccttc tacacactgc tcactagtca ccagcagcca 22621 tggcctgtgg aaagccattt gcagagaagt gacttggaga ccctcctcct ctctgcactg 22681 tacccctctc ttgtgttctt acccatcatc agggctctcc cattatgggt atttgccagg 22741 tccaactctg cacagcaggc cttcttctca ttggccttag tgtttgtatt acttcattct 22801 cactctgcta ttaggaaata cccaagactg ggtaatttct aaaggaaaga ggttaattga 22861 ctcacaggtc cccattgctg gggaggcctc aggaaactta cagtcatggc ggaaggcaaa 22921 ggagaactag gcatcttctt cacagggcgg caagacaaat gagtccaagc agggaaaatg 22981 ccagacacat aaaaccagca aatttcatga gaatttactc agtttcatga gaacggcatg 23041 ggggaaactg ttcccatgat tcagttctgt ccacctggtc ccacccttga cacgtgggga 23101 ttatggggat tacaattcaa gatgagattt gggtggggac agagagccta accatattat 23161 tccaccccgg ccccccccac ccaattttat gtcccttcca catttcaaca ccaatcatgc 23221 cttcctaaca gtcctccaaa ctcttaattc actctagcat taacccaaaa gtccaagtcc 23281 aaagtctcat ctgagatgag gcaagtccct tctgcctatg agcctgtaaa atcaaaagca 23341 agttagttac ttctaagata taatggggat agaggcactg gataaatgta cctattccta 23401 atggaagaaa tgggccaaaa caaagcctac aggccccatg caagtccaaa acccagcgag 23461 gcagtcatta aatcttaatg ctctgaaatg atatcctctg actccatctc tcacatccag 23521 ggcacagtga tgcaataggt gggctcccac tgtcttgggc aactctgccc ctgtggcttt 23581 gcagggtaca gaccccccct cagctgcttt catgggctgg cattgagtgt ctctggcttt 23641 ttcaggcaca cgttgcaagc tgttggtgga tctaccattc tggggactgg aggacgtggc 23701 cttcttctca tagctccact aggtagtacc ccagtaggga gtctgtgtgg gggctccgac 23761 cccacatttc ccttcctgca gtgccctagc agaggttctc cgtgagttcc acccctgcag 23821 caaacctctg cctggacatc aggcatttcc atacctcctc tgaaatctag gtggaggttc 23881 cctcccaaat atcagttcat gacttctttg cacccacagg cccaacacca tgtgcaagcc 23941 accaaggctt ggggcttaaa ctctgaagca atggctggag ctgtaccttg ccccttttag 24001 ccatggctgg agttgaagca gctgggatgc agggcgccat gtcccgaggc tgcatagagc 24061 agggggaccc agggccagca gctgggatgc agggctccat gtcctgaggc tgcactgaac 24121 agcctgggcc aggcccacaa aaccattttt ccctcctagg cctctgggcc tgtgatggga 24181 ggggctgccg ggaaggtctc tgacaggcac tggagacatt ttccccatca ccttcgtggt 24241 taacattcca ctcctcgtta cttatgcaaa tttctgcagc agccttgaat ttctccccag 24301 aaaacgggtt tttcttttct attgcatagc caggctgcaa attttccaaa cttttctgcc 24361 ttgcttcctc ttgaacactt tagcacttag acatttcttc tgccagatac cctaaatcat 24421 ctctctcaag ttcaaagttc cacagatatg tagggcaggg gcaagaagct gccattctct 24481 ttgctaaagc atagcaagag tcacctttac tccagtcccc aacaagttcc tcatctccat 24541 ctaagaccac cacagcctgg gcttcatggc ccatattgct aaatcagcat tttgatcaaa 24601 accattcatc aagtctctag gaagttccaa actttcccac atttcctgtc ttctgctgag 24661 ccctccaaac tgttccagcc tctgcctgtt acccagttcc aaagtcactt ccacattttc 24721 gggtatcctt acagcagtat ttcactacct cagtaccagt ttactatatt tgtccgttct 24781 cacactgcta ttaagaaata ccctggctgg gcacagtggc tcactcctgt aatcccagca 24841 ctttgggagg ctggggcagg tggatcatct gaggtaggag tttgagacca gcctggccaa 24901 tatggtgaaa ccccatctct actaaaaata caaaaattag ccgggtatgg tggcaggtgc 24961 ctgtaatcct agctactcgg gaggctgaag caagagaatc acttgcaccc agggggtgga 25021 agttgcaatg agctgagatc gtgccactgc actccagcct gggtgacaga gcaaaactct 25081 gtctcaaaga gatacgcgat acttggccag gcttggtggc ccactccggt aatcccagca 25141 ccttgggggg gctgaggtgg acgggtcact tgagatcagg agtttgagac cagactggcc 25201 aacatggcga aaccccgtct ctactaagaa taacaacaac aacaaaaatt agccgggcat 25261 ggtggctcat acctgttatc ccagctactt gggaggctga ggcaggagaa ttgcttgaac 25321 ccaggaagcg gaggttgcgg tgagctgaga tcaagccact gcactccagg ctcggggaaa 25381 gaatgagagt ttgtctcaaa aaaaaaaaaa aaaaaaaaaa aaagacatac ccgagactgg 25441 ataatttata aaggaaagag gttgaattga ctcacagttc cgcatggttg ggaggtctca 25501 ggaaacttac catcatggta gaaggcaaag gagaagcagg catcttcttc acaggacagc 25561 aggatggagt gagtgcaagc aggggaaatg caagatgctt ataaaaccat tagattggcc 25621 gggtgcagtg gctcacacct gtaatcccag cactttggga ggccgaggag ggcggatcac 25681 ctgaggtcag gagtttgaga ccagcctggc aaacatggtg aaaccccatc tctactaaaa 25741 atacaataat tagccaggcg tggtggcacg tgcctgtaat cccagctact caggaagctg 25801 aggcaggata atcacttgaa cccaggatgc agatgttgca gttagcagag atcgtgccac 25861 tgcactccag cctgggcggc agagcaagac tcagtctcaa aataaataaa taaaacccat 25921 gagatctcag gagaactcat gatcatgaga acagcatggg gcaaccgtcc ccatgattcg 25981 gttaccttca cctggtccca cccttgacac gtggggatta tggagattac aattcaagat 26041 gagatctggg tggggacgca gagcctaacc atattgtttc cagaccaaac cgagggtggg 26101 gctgcttatt cttgcagccc aataatgaga tgcagatgaa ctggggaaaa agagagtttt 26161 tatttctgta accggttaca gggagaaggc ctggaaatta ttgccaaacc aactcaaaat 26221 tacaaagttt tgcagagctt atataccttc taagctattt gtctacatgt gggtttgcat 26281 tcatctaaag atataagtga ttaacttctc tgtaaccaag atctgagtcc tgaagacctt 26341 cctctggagc ctcagtaaat ttacttaatc taaatgggtc caggtgctgg ggtgattacc 26401 cttatcttgt ctccttttaa atcatggagg tttggggagt ttccttagaa ccccaataaa 26461 cttattcgtg gaggcctggg gagtttcttc agacccccaa taaaatgtat ttaatcctaa 26521 acgagtcctg ttaagaattc cttcattatc ttttcatcct ttaaggccca ggaaaggcct 26581 aggcaaaact cttggtgggc ttttgttaca ttccagcctg tacatgaggg cactggctct 26641 atcagctttt aatcaactta accactcagt cagtgctgaa accgttgtca tggaagcctg 26701 cctgctcagc tgttagtgag acctggcctg ccacagtacc agggttcatg cctgtcacca 26761 attccatgat cattttcatg gggtgtctga ctgacatttg tgatggggtt gcctccgctc 26821 atcacagtca acatgcacct caccagcatt tttattctta caggttgttg ccatggagct 26881 gaggatgagg aggcaccttt agagccaggt gtttctgtag gagtgtcaca ggtcatggct 26941 ccaaagccct gtctatctac ccagaatacc cagccctgtg agacatgtag ctcacttctg 27001 aaggacattc tgcgtctggc tgagcatgac ggaacacacc ccgagcaggg actgtacacg 27061 tgtccagcac atcttcacca gcaccaaaag gagcagatta gagagaaact ttctagaggg 27121 gatggaggaa gaccgacatt tgtgaagaac cacagagttc acatggcagg gaagaccttc 27181 ttgtgcagtg aatgtgggaa agcctttagc cacaaacata aactttctga ccatcagaaa 27241 atccacactg gagaaagaac ttataagtgc agcaaatgtg ggatattgtt tatggaaagg 27301 tccacactca atagacatca gagaactcac actggagaaa ggccttatga gtgcaatgaa 27361 tgtgggaaag cctttctttg taagtctcac cttgttcgtc accagacaat ccactctgga 27421 gaaaggcctt atgagtgcag tgaatgtggg aaattgttta tgtggagttc cacactcatt 27481 acacatcaga gggttcacac tggaaagagg ccttatggtt gcagtgaatg tgggaagttc 27541 tttaagtgca actcaaacct ctttaggcat tacagaattc atacaggaaa aaggtcttat 27601 ggttgcagtg aatgtgggaa attctttatg gaaaggtcta cactcagtag acatcagaga 27661 gttcacactg gagaaaggcc ttatgagtgc aatgaatgtg ggaaattctt cagcttgaaa 27721 tccgtcctca ttcaacacca aagagttcac actggagaac ggccttatga atgcagtgag 27781 tgtgggaagg ccttccttac aaagtcccac ctcatttgtc atcagacagt tcacactgca 27841 gcaaagcagt gcagtgaatg tgggaaattc tttaggtata actctacact tctcagacat 27901 cagaaagtcc acactggata aggcccttat gaatgcagtg gatatgggaa agccttcagt 27961 caccaacata ttgtggctgg acagcaggca gtacacactg gagaaagact gaatgccgtg 28021 aacgtgggta attatgtagg tacagctctc cagtcgctat gtatcagaga attcacactg 28081 cagaaatgtg tgttcagcaa actcgggaca ttattttggt ttgactctca tctcattaga 28141 cattggagag tttacactga agaagagtct tttcaataaa gtagaaagtg gtaaagattc 28201 aacatgcaag attgtactta ttgggcttca gaatatccac actagtgaaa gtcttctgag 28261 tacagcaaat gtgtgacatt attttgctac tactccacac tacttagaca tcatgtagtt 28321 cacactggaa aaaggccacg tatgtgcctt gaatgtagcc aaaatgacga acaacaccca 28381 gaaatctgtg atttagcact gagaactagt attatatggt ttttaaaaaa caatggtgaa 28441 gtacatgcca cataaaattt gccatcttaa ctattgtaat gtcttgttta atacttgaag 28501 tacattaaca ttgttgagca aagaatatcc tgaactcttt atcttgtaaa atgaaactct 28561 ataaccacca ttaaaaaaac aactcattcc cacattcttc agtccctggc gaccaccata 28621 tttttaagtc attatgactc tgactattct tggtacttct cagaagaaga atcttagagt 28681 gtttttcatt attttttttc acttaggata atatcctcaa agttcattca ttttttagca 28741 tgtgtcagaa attttaaggc tgagcaatat ttcattgttt gcatttacca catttccttt 28801 atttctgaca tctattcatg gacacttggg ttacttctac ctctggctat tgtgaatatt 28861 gctactacaa gcataagggc acaaatatct ctttgagacc ctgctttcaa ttcttattgg 28921 gtaccccaaa aagtggaact gctggatcac atggtacttc taactttaat tctttgatga 28981 actgagaaac ttttccataa aacttgcacc gttttacatt cctaacattc cacaaaagtt 29041 gtgatttgtc ctcattctaa tccatagaag gcaacatttc taacaggttt ctaggtgatt 29101 ttgttgttgc tgatgtaggg actacatttt gagaactgtc ctatcccatt tttataacaa 29161 tgcagagctt ctctgtaaat accccttgtt tctatgttct atcctgtgag ccttttcatt 29221 ttgttaattt ttcccagtgt ttacaaccag gaaatagctt gtcatctttg atacaaaaca 29281 gaaatctttg caaaatacca attgcattct gtagcccttg gatcattttc tcccccaatc 29341 tatgtagata tttagtggtg tataatgttg tatgagaata catcatggta agaatacatt 29401 ttaaaggact ttaagtcagg tcacaaaata cagcaattta cctgtggacc tgccctctac 29461 ccaggccatc cagtcccaat gactgtgctc ttactgctgc tttccaggtg tgtacctatc 29521 atggggtgtg catgccactg accattcctc tgtgctctta ctgctgcttt ccaggtgtgt 29581 acctatcatg gggtgtgcat gccattgagc attcctctta atgggttcta ggcaacagat 29641 agaatagctt aggtgccaaa gccacagaac ctatcagggg ttgtgtcatt ttgaaaatta 29701 tgccaaattc taaatgatct tgtatgacag cagtccccca acctttttgg caccaggaac 29761 tggtttcatg cgagacagtt tttctgtgga tggcagtgca gggggctgga tggtttcgga 29821 atgaaactct tccacctcag atcatcaggc attagttaga ttcacataag gagtgcacct 29881 aggtccctcg catgtaacag ttcacagtag ggtttgtgct cctatgatgt ctaatgccgc 29941 cgctgatcag gaggtggaga tcaggcagga atgctcgctc ctgctgtgcg gcactgtgtg 30001 gcctagttcc tgacaggcca cggactggta cggtctgtgg tttaggggtt ggggattgct 30061 gttaaatggg atttggggaa ttttatttgg gttctctctt tgactagtct attgtcaaag 30121 ttattccatc taatttttct agagtaagcc aggttctgtg tcatgtaata gagccacata 30181 aacagcaaag gtgctattgt gaactgtgca tacgagggat caagttgtgt tccttatgtg 30241 aatctgagct aaaataagaa aaaaaaaaat ttagcctgtg aggtcagaag caagacagtc 30301 atgttagatt tctgtcatta ttcatagttc tgcaaaggtg gtttcaggtt ctgcatcaac 30361 tgaaacgtgc tcttccatcc agcagcattt aaaaccaaag atgctgtccc ttaattagtt 30421 aactctcaca atccatttta gtaatggttc ctcaggaaac tgactacttc aatctataaa 30481 gtaaaacaag ttcttcacat tatactctct ggtttgaata gttacttggt tttgttcttc 30541 cccacaatga ctaccttctt ggtagccgta ggtctcagag gcattttctg ttgtccctaa 30601 atgattctgc ccttggtggg aggtggcttt taggctggct ggtggctttc gttcagatgg 30661 actgaaatct aagcttgatc tagcatcaag tttggaccca gcactctcta attttttttt 30721 ttttagctgt tatagataac aaaggattga gtaatttgct ttttgcttat tagcttgcat 30781 tttcttatgc atccagtgaa ccagagattg aatattttgc tttttgcata ttagtttgca 30841 ttttcttatg catccagtgt accaactccc tgggagctgg atccatctct aaatttccat 30901 ggtaactctt acctttagta actgagcaca gacagccaca gttccatacc atgtatggcc 30961 atttggccac taaggaatca aaggtttttc atcctccacc catttttaaa ttgaggatta 31021 ttcaaaagtt aaagaaaaaa ctttattatc tcttattaat atatgtaaat tgtgttcaaa 31081 atagaaaatg aacttctact tttttttttt ttttttttga gatggagtct cactctgttg 31141 cccaggccag agtgcaatgg tgcgatcttg gctcacttca acctctgcct cccgggttca 31201 agtggttctc ctgccacagc cccctgagtt gctgggatta caggtgtgca ccaccttgcc 31261 tggctaattt ttgtattttt agtagagacg gggtttcacc atgttggcca gactggtctc 31321 aaactcctga cctcgtgatc cgcccgcctc tgcctccacc tcccaaagtg cttggattag 31381 aggtggagcc accgcaccca gctttttttt tttttttttt ttgagacaga gtcttgctct 31441 gttgcccagg ctggagtgca gtgatgtgat ctcggctcac tgcaacctca gcctcctggg 31501 ttcaagcaat tcccctgcct cagcctcctg agtagctggg actacaggtg cgcacaacca 31561 cgcccggcta attttttata ttttagtaga tatggggttt caccatatta gccatgattg 31621 tctcaatctc ctgaccttgt gatctgcccg ccttagcatc ccaaagtgcc aggattacag 31681 gcatgagcca ctgcgcccag ccgaaattct atttttatat ttgtgtattg tcaatactaa 31741 agctaatttg aataaagtct tataaacaaa tccatccaat tttaatcagg ttttgaccac 31801 acaaggtaag atttttccgt aaacctttta taccttctta caaatttttt tctatttttc 31861 tctttcccca atttttagat ctatttacct ttgtttgaga cagaatctca ctctgtcacc 31921 catactggag tgcagtggtt tgatcttggc tcactgcaac ctccacctcc caggttcaag 31981 tgattctcgt gcctcagccc cccacatagc tgggattaca ggcacacacc accatgtccg 32041 gctaattttt gtattagtag agatgaggtt tctccatgtt ggccaggctg gtttcaaact 32101 cctggcctca agtgattcac ccacctcagc ctcccaaagt tctgaaatta taggcttgag 32161 ccactacgcc cagtcctaga tctatttact tttatctaca ttattttcct ttcattttga 32221 aacgaccttt acataacccc taaactagac agaattcctt tttttttctt tctttctttt 32281 tttttttttt ttttgagaga agtctcactc ttgttcctca ggcttgagtg caatggcccg 32341 atcttggctc actgtaacct ctgcctcccg ggttcaaaca attctcctgc ctctgtctcc 32401 caggtaactg ggattaaggc acctaccatc acgcctggct aatttttgta ttttttttta 32461 gtagagacgg gttttcacca agttggccag gctggtctcg aactcctgac ctcagatgat 32521 ctgcctgcct tggcctccca aagtgctggg attacaggcg tgagccacca cgcccggccc 32581 agaattactt tttctttagc aaaaaccaca tcttcatttt ttaaatataa gcttcttcac 32641 aaaaaacaca tagtaaacat catggcactg ggctgcacaa tactttgata gagcacgttg 32701 atgtaaagac atttttgtaa gtgtttagag catgcctttt atatctaaac atgcaaagaa 32761 atgagtagcc tcctgtcgta ataaccattt actgtaaaca actgccacca gctgcttctg 32821 acactgcagc tcttgcttgt gacagccatt acgcacaaaa acgtcaagtt ctttcacagg 32881 acaaagtaat ctctggtacc ccccaaaacc aatgatatca ggtaatgcac tacaaaagaa 32941 ggcagaattt tagacctgag ataaatctgt cctcttaaaa ctcttgagtg agagagagag 33001 agagagagag agagagaggg agagagattt cctcatctgg ttagtgtaga tagcaggttg 33061 gcttcctgag ctggtcagtg cagtagtggg ttagagttct gttttatatt tggcttggcc 33121 attgttgatt tctatagtca atttcttaca tgtaaacaag gaagatagct ataatattga 33181 gatttcttgt tttcttaact ggtcttaggt tgaaaattga tttttcattc tcctccccgc 33241 tcccccctcc ccccaccatc ccattcttct gcctctgccg gatcttcagc tgggtagcat 33301 tttggttttg tttgtttttt aacctggagg atgtccccaa atatcaaaac ctaaacttct 33361 gattcttaga agtcctccct tcgatgagtt gctagtttta tttctgaaca tgaacttacc 33421 aaaacatccc ccagaaatcc cctggtctca gacataactc tctatgcttc ccattccatg 33481 gcagcaactc agcctaactc atccaaagca aaaaagcaat cccttcttgc ttaagtgaca 33541 caaggatccc cagatgaaca agtagctgta tcacccttca attaaatgta ataatattta 33601 cttctggtgg aacctctcct tccttggaac aaagggcccc caaccagcag aatttacaat 33661 taagaagaag tggccagaca cagtggctca agcctgtaat cccagcactt tgggaggccg 33721 aggcaggcag atcataaggt caggagttcg agaccagcct ggccaacatg gtgaaaccct 33781 atctctacta aagatacaaa aaattagctg tgcgtggtgg tgcgtgcctg taatcacagc 33841 tactcgggag gctgaggcag gagaattgct ttaacccggg aagagggggt tgcagtgggc 33901 tgagatcatg ccattgcact ccagcctggg cgacagggca aaaaaaaaaa ttagccgggc 33961 atggtggcgg gcgcctataa tctcagctat tcaggaggct gaggcagaag aatcgcttga 34021 agccgggagg cagaggttgc agtgagctga gatcgcgcca atgcactcca gcctgggtga 34081 cagagcaaga ttctgtctca aaaaaaaaaa aagaaaaaaa gaaaaaagaa gaagggtagc 34141 agaaattctg tgaatgagtt actaggttca aaaatgagga tgctcaatat aatgcatagg 34201 ttcagaaaaa aattacgatg aaaagaattg tcaaattttg taactgatac ttgacctggt 34261 taaatttgaa ttatttctaa ttccctatgt atatgtgcag tgagataaaa gataaatatt 34321 gtatgtgaca tactcatcat aaaattattt gttgtttatc tgaaatacat tctaaatcag 34381 gggtccccaa cccctggggc cacagatgag tactatggcc tgttaggaat ggggccatat 34441 agcagagggt gagtggcaga tgagtgagca ttgctgcctg atctccgcct cctgtcagat 34501 cggcagcagc attagattct cataggggtg caaaccctac tgtgaactgc acaagagagg 34561 gatctaggtt gtatgctcct tatgagaatc taatgcctct gaaaccatct gcccccaaac 34621 ccctctccct tatccatgga aaaattgtct tccacaaaac cggtccctgg tactaaaaag 34681 cttggggatc acagttctaa tgacataact ctttccctga taatttgctt ctacttaaca 34741 attccttgtt ttatataatt tgcttacaga attcctgcag ttacagtctt gatgccattt 34801 atgttgtgac aaaccttcgt catttgtcat attcaactta tttcattcct ttatctattg 34861 gacattcatt aatgagaaac acactcaaca gtagcattct gagcctaaag actcatgatg 34921 ataaacagtg tagaaaacat tacaaactct attataatta ttctaccatc tccatctgat 34981 tttaacaaaa aagttatttt gtctctttct ttggaactag ttgagaaaat catgcacctt 35041 ccaaagttaa tatatatttg actgtccacc attttcatta gtcaatgggc aattacatag 35101 agtcgaagac cttcaaaggg aattttaaat actcctcagt tcatcataaa ttataaactt 35161 gcattttgac attttttgca ttcaatatac tcaaaatata gacaatagca acttctacaa 35221 gagttgaaaa acattgtggg tacagtggct catgcctgta atcgcagcac tttgggaggc 35281 ctaggcggac agatcacttg aggtcaggag ttcgagacca gtatggccaa catggcgaaa 35341 ccccatctct aatagaaata caaaaaatta gccaggcatg gtgccacgcg cctgtaatcc 35401 cagctacttg ggaggctgag gcatgagaat cacttgaacc tgggagccag aggttgcagt 35461 gagccgagat tgcgccactg cactccagct gggtgacaga gtgaaactgt atcttaaaaa 35521 aaaaaaagaa aagaaagaaa aaagaaaaca ctggagaaaa cgttggattt tacaatgtat 35581 gtctgggtca gttagtctac catttcatat ccaaactcat aactaattac aaacagatgg 35641 aaccattgaa agcttgcaaa cttttaaatt atattcatat aattaaaata taagtaataa 35701 tttatattca attaagaata aatggggcca ggcgtggtgg ctcatgcctg taatcccagc 35761 actttgggag gccgaggcag gcggatcatg aggtcaagag atcgagacca tcctggccaa 35821 catggtgaaa tcccgtctct actaaaaata caaaaaaatt aactgggcgt ggtggcgtgc 35881 atatgtagtt ccagctactt gagagtctga ggcaggagaa tcacttgaac cggagaggcg 35941 gaggttgcag tgagccgaga tcacaccact gcactccagc ccagcgacag agtgagactc 36001 cgtctcaaaa aaaaaaaaaa gaataaatag attttgatat ctcaagggaa aaaaagtgat 36061 attcctattt gtctgcaggg cagagctgta gtggtcaggc agtctcctca aacaaagcca 36121 aaactgctat tatatgagct tttaggaagg ctggccttca gggactgagg actttcagaa 36181 gcagcagagt tgaaagactg atggactttc aggtatgtta gtgtaaatgc aaccatttac 36241 caactggctt tcataaacag gattactgac aggaggcact tgtggattag ttgttgccag 36301 cagcctgggg ctgccactga ttgaccatct ttagtactgg gccttacccc cattggaaaa 36361 attttagaat cttggcctaa caatgatttg ctaagtttct ggttaattac tggggtgaag 36421 cttttgctga ctagctgact tttaaagcat aggtacacat cccaagttat ctctaattaa 36481 tggtggtttc ttgtttatga ttatggattt ggggcagaga taaatttttt ccttacttga 36541 acataagcca gcaacttaat tctcattccc catttttggt ttcccaacag ccagaatttg 36601 taagaaatca tatgctatgg tttggatgtt tgtctcttcc aaagctcatg ttgaaattta 36661 atttccaatg tttgaggtgg ggcctaatag gaggtgttct ggtcatgagg gtagatccta 36721 cctgaatacc cccacttgaa gatgtgtaaa ttctcactct attagttccc aagaaagctg 36781 gttgttgaaa agagcctgat acactcccca cccactgtcc tcctctcttg caatatgatt 36841 ccttcgcatt atagctctct ttcgttttcc accatgaata gaacccgccg aagaccctca 36901 ccagatgcag atgcccaatt ttaaactttc cacacatcaa cattttgagc caaataaaca 36961 tttttttctt tataaattac cagcttcagg tattcattta tagcaacgct aactggacta 37021 agacatcata agtattatct agatatttgt ttgtgaggat atgattttca actcatgttg 37081 ccaatgagta atttaacggc cagttacagt ctgagaaaaa tgtgtctccc actggtgatt 37141 ctttgtaata tattacaacc agctgatatg taggatatgt aggccattct gataagtcaa 37201 tacaggattc aggaaaaatc gttatatata ggtcatattc ttaaactgtg cccaggtctt 37261 aattaataat tagcaatgac acagagggga cactgccatc aaaagaaaag attaatgtta 37321 cagttccctg gaaataagag gcactgaaca ccacacaggc cacaagaaac cttattacat 37381 acagagactg gaaagcagaa agagaaaaca gagtaactgg aagtatgtct ttataactaa 37441 agtggcagag aatgatgagg tagagatacc gtcgatgggc aggaaggaaa aacacatttt 37501 agtgtttgtg gttggggggt tttatatgag tttcatgcac ttgagtaagt ggcttgcaag 37561 aaaaaaagta aacaactttg gttctattag tttgaccctg tgattaatga atatcaaaca 37621 gacaaataca gaatctaaga aaacatagaa cagtcctgta tacacatagc agctgggaaa 37681 cctattccca gtgtgtccca tgcacctatc attgatctga aacccaatac atatgtgctc 37741 attgaacatt tactagagaa gtaaatacac attatcacac agcttctcta ttcattttta 37801 tctgtggcaa gtctttcagt ttatcatctg tacttacccc tctaagcagt taatgagcaa 37861 tgttttcaag tgtgtaaaaa aattacattt tggaaacaaa agcatagtgt gtccaagtgt 37921 ccagcagagc cacaggttca acttcattct ctgacttctc agaatttgct gtctctcact 37981 caaacattga gatcttatct ttttctcaca ttacagatcg atttctctag ggcttctggt 38041 tttttttctt ttggaataac ctacataatt actaatagat ttcggtaatt tttttatttc 38101 acaagtgttt ttcacatttc gactccctgt gcaaatacag ttgtaaacac aggtgaaatt 38161 ttaaatcact tctgaggaag aagataacat catggaattc cacacctttt ttgaggataa 38221 gatgtgcata tgcttcaact atgaaaacgg tagacaatgt tgtccagaaa tggcatgtat 38281 cagctgctct tgagaaataa aaataaaatt ctaagcaccc cctaactgac tgaatggatg 38341 ccctcttgac catgagaacc ctagaataac tttggaagct gaattcacca ctatagagga 38401 acgggaaatc agacacacct cattatatcc cctaccacac tacaatcatt aggttttctt 38461 ccctaagggc taaatagaaa tcaacctttt cagaagacta ctagcttatc ttccaaggta 38521 cagaacagag agaagatgag attattcatt ccttcatcct tctctgagac atcttcttta 38581 ttcccttttc ccctcacgtg tttactctat cttatgtaaa atgtagattt actgggcact 38641 aactaaagtt tcacatgtct gtaatcattt gtctcactgc caacccctct tcctttttaa 38701 ggaaaatgta taataaatac taaacctcct aagaacctct ttggaaaaac cagccacaga 38761 tgcttctgtg acttacattt ttctggatgt gcccttaggc tggtccagta atcctccatg 38821 atttgtgact tatgcctcaa tcactcattt tggtcgtcac tctcattttc agataggatt 38881 gtgattctta aaataaatgg aaaatatctc agatatgaag tagtaagtat tattagaata 38941 aaagcgttgc acccttcgta ggaacacaat cgctgactct acagtcaggt aagatgtatt 39001 ctctttgatc cagaggcata gcttcaccct ttcaacctac ttcagcgtct cctgtcatcc 39061 ttcctacaaa ttccagtcca tagccctaac tgcgtagtct ttctttcttc ttctggacca 39121 aacatattga agcttactaa acagacgcca aagcgtaact gtatgtgtat ttgaaaagca 39181 ataatatggc tggcaggcat gagtagaggc agactgcctt tatggtatgg ggtgtctctt 39241 tggacctgta gttattggga attagctgtg cttaatagag tcttaagtaa taagttgcta 39301 acatttctgt aaatatccta tactacaatc caaaaaaata ccgtatgttg cctttaagag 39361 caagccaagc cccctcctgt ctttggagtc ggggactccg tttcccagct cccctgacag 39421 tgccactggg cctttatctg ccatataagg actgatcaac gctctaagac aagtccggaa 39481 gcggtgtcga aacttcataa cccagaagac actgcagctc tcggcggcac gacttaccca 39541 ataaaggctt agaaggggac gtttgcgtgc gtgcgcaccg tcggagggcg ggacttccgc 39601 cgtcctcctg gtggtggtcg ttttggttct gtgtggtgtt tcaccaactt cggcctatgg 39661 ctctgtctga cgtcaccgaa gtgacggaac ggaaaagcgc gagaagcggc tcggttccca 39721 ccacggagag gcgggagtga gtcaactgac aagcgctggg gacagtggcg tccttgtctt 39781 gcctttgtcg ctcccgcccc gctcttccct ggctgggctg gcggaggcct tgctgatgaa 39841 cctgactgag gtgggtgtcc cgtcccaggc tcccccgccc gaccggtcct cccagtgctg 39901 aagccccctg aaggggccct gcaggtcagg ccccttgtcc cgaagagagg ggcgttcctg 39961 tgtggggtcc ccgtatcagc cgatttgatg cagcctcaga gctcccgtta gggacctcag 40021 gttcagagca tggaggcgct gccgagtggg ctgtgttggg gcgtcagggg tgtgtgttgt 40081 ttctgcggga aagagagggt tttgtgaatt ctcgcttgga atggcaacct gagcagccag 40141 taaccatgga aggttgtcaa gggagggaca gttggagggg gcaggtccag agttcgaagt 40201 ttcaagttag gaaaaccagg ttaggaatgg acacagggag acgcgaaaag gccctgaggc 40261 cactgcagta gacccaggag aattatggtt ggggctggac cagtgacagt agagggagat 40321 gtgatgagat tcttcatagg cctgaaagat agggcccgta ggttttcctg gtgcttcagt 40381 tatgtctgaa caaaggcagt gaagaaccgt ttcaagtgtt tgttttgttt tttgttttct 40441 gtttgaacgt ttggaagaat ggagttgggg aagtcagcat aaagagcaga tttccgggag 40501 aagatcacag gttggatttt ggacatggtg attctcaaat gtctcaagat ggatatgtca 40561 cacagacagg tggacataaa ggtgtggagt ctgaggaagg ggattgggtt ggcgatacca 40621 gacggatgga gggcttggac tgggccagag aatggtctga ggatcagatt gagtaggtcg 40681 gtctcgggtt ttttttgttt gtttgtttgc ataaatttaa ggagtcaagt gcagttttgt 40741 tatatggata tatttgcata gtggtcaagt ctgggctttt aatgtatcca tcgcctgaat 40801 aatgtacatt gtatccatta agtaattttt catcactcat cttcgtctca tccttctacc 40861 tgtttggagt ctccaatgtt tgtcattcca cactctttgt ccatgtgtac acattattca 40921 gcttccactt gtaagtgaga acatttggta tttgactttc tgtttctgag ttgtttcatt 40981 taataatggc ctccagttcc atccacattg ctgtaaaaga catagtttca tgcttttttt 41041 tttttttgtg gcctagtggt attaccttgt gtatatatat gccacatttt ctttacccag 41101 tcatacattg atggataatt ggttaattcc ctatctttgc tattgtgaat agtgctgtga 41161 taaaaataca tgtttttttt aatgtaatga tttattttcc tttgggtaga tgcccagtag 41221 tggaattggt ggatttaatg gcagttctat tttcagttat ttgaaaaatc ttcatactgt 41281 tttccattga ggttgtactt atttacattc tcaccaacag tgtgtaacct ttttctcttc 41341 tccatatcct gggcaacatc agttgttttt tgacatttaa atgatagcca ttctgactgg 41401 tgtaaggaga tactgtggtt ttaatttgaa tttgtctgat gattaatgat gttgagcatt 41461 tttttatatg cctgcttgcg atttatgtgt cttctttttg aaaaatgtct gccctgtgct 41521 tactgttttt tttttttttt tttttttgaa atggagtctc gctccgtcgc ccaggctgga 41581 gtgcagtggc gcgatctcgg ctcactgcaa cctccacctc ccaggttagt tcaagcaatt 41641 ctgctgcctc agcctcccaa gtagctggga ttacaggtgc ccaccaccac gcccagctaa 41701 ttttttgtat ttttagtaga gacggggttt caccatgttg gccaggctgg tcttgaactc 41761 ctgacctcag gtgatccacc tgcctcggcc tcccaaagtg ctgggattac aggtgtgagc 41821 caccgcgccc agccctgtgc ttactgttaa tgagattttt tgttgttgtt tttttgttat 41881 gttttgtttg tttgttgttg agttctttgt aaattctgga tattagtacc ctgtcagatg 41941 catagtttgc aaatatttca tcccattttt caggttgtac atttacttgg ttgattagtc 42001 cttttgctat acagaagctg tttagtttaa ttaggtcctg tctacttttg tttatgttgt 42061 ttatgctttt gaagtcttat tcatgaattt ttttgcctag accaatgtcc agaaggggtt 42121 tccctgagtt ttcctttagt attcttcgtt caggtcttac atttaaatct ttaattcatc 42181 ttgagttgat ttttgtatat ggcaagagaa attggtccag attcattctt tttcatatgg 42241 caatccagtt ttcccagcac attttattga aaagtgtgtc ctttccgtag tgtgcatatt 42301 tgttgccttt ctcaaagatc agttggctat agatatgtgg ctttatttct aggaactttg 42361 ttctgttcca ttgatctatg tgtctatttc tgcagtcaaa tgctttacca ctgagctgta 42421 caccttatgt gtctatttct atccattacc atgctgtttt ggttactaac aacttgtagt 42481 ataatttgaa gtcagatgga cttaatttga gggaaatctt gaggttgtca tttcaggtgg 42541 gctctgtgtt gagagaacag ggggctgaga cttgggggcc agaccattgg tggctccaca 42601 gacatcatac tctgtccaga acggtgcaga cctggcataa cagggattct ggattgcatc 42661 tgtaaaaagg gaggcctgga ctgggtaagc atatggcctg ggatggaggc tgaagtgtga 42721 ggtggtttcg atcctgcacc tgcaggcttg gttgaccctc agggccactc tctgttgaac 42781 aagagaaatg ttattgaggc tggtggagta gctgggttcc agatcagagc actgcttttt 42841 ccccttctct acatgtcagg gctacagggg atgatctact gctccagcca aggtctcctg 42901 gcaagctcac tttagaagat tatgtagcag gtagacctgg ttgcagaaag gaccagtacc 42961 tgacctgcct actcttccct tctctataac tcattaactt aataatgtat gaaacactta 43021 ttgatttaaa ccccaattag acagtatctt tctaagtgtg tgcaaacaga taggcagtta 43081 atagtgaagt cattatgtta aatttctaaa agtgagccgg gtgtggtggc tcacacctgt 43141 aatcccagca ctttgggagg ccgaggtggg tggatcacct gtggtcggga gttcgagacc 43201 agcctgacca acatggagaa accctgtctc tactaaaaat acaaaattag ccgggtgtgg 43261 tggcgcatgc ctgtaatccc agcttctcag gaggctgagg caggcgaatt gcttgaaaac 43321 aggaggcgga ggttgcagtg agccaagacc acgccattgc actccagcct gggcaagaag 43381 agcaaaactc tgtctcacaa tagaggagag gagaggggag gggaggggag ggcagaggag 43441 gggagggggg agggcagggg aggggagggg agaaagtggt tcataattgg tttgatgtta 43501 atttatctcg tttgctttcc ttggaaaaag ttaatatcgc atcctctaaa atcttaagct 43561 aaaaacatac atgaaaattt caaaacccca ctctttcatt tgcattgaac tcatgtacaa 43621 aaccattaat cttggaccat agccgctgat atatatagta catgctactg atgtttgata 43681 ctaatatgag ccctgtgagg gggcagctat tcttgcaatc tatttacaga tggagacact 43741 gaagctcagg tgggtttggt tcttttccat ggtcacagag ctgcagcact gaggcctgag 43801 gaactttatt caatgaattg agatcagttt gctcccaggc agggcagacg aggtggtggt 43861 ccatctcagc ctgaaggatc cttccatggc tgtcaatttc tcacggcctt tctcttccct 43921 cagggtcccc tggcgatggc agaaatggac cctacacagg tgagtagagt gtttcctact 43981 attcacccac cctggttatc tcccagatgg ttttggcagg tgacaaaacc tcttttcctg 44041 tctttctctt atgtctgaag ggtgagtggt ttcaccagtt agtttcagta ttacaagttg 44101 gctgctgggt tacatagact ggtccccgtt tttgagaatt gatctgataa gatatgtttg 44161 ggcctcttgg gcacaggatt cagaatgttt caatgtttgg aggagagact gagctaggtt 44221 tttagggaac tgcaaatgtc tgttacatct ggtaggaata cggagcagaa tgggaggtgg 44281 ctagaggcac acaaacatca ggcctggaat gacagcctag gtcctgggtc tgtaatctaa 44341 gggaagtggg agacggggaa gatttgaagc aaaataggta gtgataggac cccagtttaa 44401 gagactttct ttggcttctg agtggaggac agactgtgga ggtgagggta atggaaggga 44461 gatctgagag gatgttcctg cagcagtcta ggtggatgtt ggtggtggct tgtcaagagt 44521 ggtcatagct tcagttttaa ctagagctgc ccacatttta tgatacagag agtgaggcag 44581 ctggggaggt gggatagaag gggaatcagg agtggcccca tggtttttgt ttggaagcat 44641 agatgctcca tcagctaaga tgggagaagt cagcagtatt aggaattttc agagtgagtg 44701 tgaggagtca gtgtttgttc aagccacaga tgtctcagag actcctgatg gaggagttca 44761 ggttggagat gtccacacta tggtggctgt cgtatggacc aggatgaagc catggattca 44821 gtttaggtga cagcatctct aggttatggt ggcagtgctg ttcgaagaca gagaattgga 44881 atggggcatg agaactgggt ctcttactgg gtagctgtgc agtggtggag gagtcactag 44941 acaaatgagg gaatggagtg tttgtgggga tgagtgtatg tgagatggtg gggtatagga 45001 gtgagaaaga cttctgaagg agagtggaca tgatggggtt gctgaggggg gataataggg 45061 tgaggtcgga gagttgctaa ctatcagttg ggaggaggtg aggatgggag ttgtattaat 45121 ccgtttccat actgctgtaa agaactacct gagactgggt aatttatgaa gaaaagaggt 45181 ttaaccaact cacagttctt caggcttaac aggaagcata actgggaggc ctcaggaaac 45241 acaatcctgg cggaaggtga aggggaagcg aggcacatct tcccatgatg gagcaggaga 45301 gagggagaga atgaaggggg aagagccaaa cacttttaaa caaccagatc tcgtgagaac 45361 ttactattat gagaaaggca aggggaaaat atgccctcat gatccagtca catcccacca 45421 ggtccctccc ctgacacgtg gggattacaa tttgacatga gatttgggtg gggacacaga 45481 gccaaaccat atcaggagtg gaaatatgga ggtcagtgca gctgggcttg gaatgaggtt 45541 ttggtggaga gcaattttcc tgactgttta tgggacagag tgtaccatga ggcaggacac 45601 agctgagagt atgggaagtc ctcacatggt ttgggcagct gacagtgaag tgagaaacag 45661 ggccgtgtag ggaaccttca acccagggct tagggctgcc cccggtgcta ggggactttg 45721 gtgttctggg gcaggactgg gaatgaattg gatgctgagc gtatttgccg tgcaccacct 45781 ggtttccgtg tgcagagtgg cctgttagga ggtgaaggaa atgcctgtgg tgtgggctgg 45841 gagttgtctg tggaattgac tggagtggag gtgtgagaga tgggattgga gtgctgtcca 45901 aagagtgatg tgcaggcagg aggagtgtga atggagggcc tcaggccctg gccctggtag 45961 ctcaagggag gaggatgagt ctgagtttgg ttgtcttggg aggcaagatc tgggtgactc 46021 catggcaact ggctggcagg agaggatgga tccccccacg ggaggcccac attccttttc 46081 taccttatct tccatctgtt tcctctgtgt gctggacacg gtggccaatg tcaatatgga 46141 gagaaaagtc actcatgcca cctggatgtg gtttctcttc cacattggga gggtgtggaa 46201 gagattgagc aaggaatgga ggcgtagggg acagcgatga gatcctggag agagaaatgt 46261 agcagtcatt ttacctattg ggttttaatt tatggatgtg agtgattgat gctggggaca 46321 ggccaatgac tgaaagaaga tatttattcc ttgcatttcc ccaagggggg tacataccac 46381 ataacacagg gccacatggt gaggcaccag atttggtcag gaggaaaatt ggagtgaggg 46441 gaaagtttag tttagtccac ggacgctatt ggggtttcca agggaaagca atgcagggct 46501 gggcgtcagg atagtttggc tagtttaaat aattccagga cacctgggct attgggactg 46561 tccctagttt tgcagtacct ggccctgggt tgatttagag tacgggaaat attggcttgg 46621 tttgagaata tgggcccagg gtggtagggg agatggaaac acatttggct gttagtttgg 46681 ccctgtgttt aatgggtggt aaatataagt agaaaataga gaacttaagt aaatacagct 46741 tgaaaagaag ctgcttatgt attcatcttt cccaatttgg cagggccgtg tggtctttga 46801 ggacgtggcc atatatttct cccaggagga gtgggggcac cttgatgagg ctcagagatt 46861 gctgtaccgt gatgtgatgc tggagaattt ggcccttttg tcctcactag gtaaggccct 46921 cacacttgcc cagtgtcctg ggttaggctg tgttacccct ttttacccaa aggcagctct 46981 gcgtttccca cagtgagacc gtgggtgctg cttcttttcc ctgtttcttg gcatatatgc 47041 tgtgggaggc agggctgggc cgtgtgtttt ataccccctt tttcctagca tccccatccc 47101 tgctgctctg aggcttacaa gaaagggctc aagagccaga aatgttgaat ttggcataga 47161 gaccccacca gccttgtctg tccctggtca ggttactctg tccaaacaca tgagacctgt 47221 gttctgttgt cccctttctc tcactgatat ttcttggtgc cttcctgtgc ttggaatttc 47281 aggcactgcc ctggtcactg attttgggga ctgctgagtt gactatgcag gacgcaggac 47341 tcttctgtga agttcttacg attatagaat gtgggatgcc atgggcttga tctctctggg 47401 catgtgctct ttactctttt tctctttctt tcccctagag agggccccag gcacctgaga 47461 gggtggatat tacttcaaca atggcattag aggctcaggg acatggccct ggtgcctggg 47521 aattgggaaa agggttgatg tcacagctgg ggtcaaatca ccctgggaac ctgtcctgtg 47581 ttcaccattg tttgttgcca gggccactgg ccacagattt ctaccttttc ttccccatta 47641 tcatttctag tatgactcct ggcccctgca cctccaatgc cccacagtct gtacttctgg 47701 ggccaccacc cttcaaccat tcttttccac tgctcccttt gaatgcctgc caacacgtgg 47761 cctgtattta acatggtgtg aggtctgcat gtatccctga acacaagctc ctctgtcaca 47821 gtctgatctc tctgggtttg gagagaacct tctacacact gcttgccagt ctccagcaac 47881 catggcctgt tgatggctgt tttcagagga gtgagttgga gaccctgcct tctctctctg 47941 taccataccc ctcccttgta tgcttaccat catcagggct ctcccattat ggggccttgc 48001 caggcccagc cctgcatagt ggaccctcct ctcactggcc ttaatgtcca tgatgtcccc 48061 aatcccatgg ccttatttgt gtgtgtgcct gacacacatt tgtgatggag ctgcttcctc 48121 ccaccagagt caacatgcac ttctccagca tttctgttct gacaggttct tggcatggag 48181 ctgaggatga ggaggcacct tcacagcaag gtttttctgt aggagtgtca gaggttacag 48241 cttcaaagcc ctgtctgtcc agccagaagg tccaccctag tgagacatgt ggcccaccct 48301 tgaaagacat tctgtgcctg gttgagcaca atggaattca tcctgagcaa cacatatata 48361 tttgtgaggc agagcttttt cagcacccaa agcagcaaat tggagaaaat ctttccagag 48421 gggatgattg gataccttca tttgggaaga accacagagt tcacatggca gaggagatct 48481 tcacatgcat ggagggctgg aaggacttac cagccacctc atgccttctc cagcaccagg 48541 gccctcaaag cgagtggaag ccatacaggg acacagagga cagagaagcc tttcagactg 48601 gacaaaatga ttacaaatgt agtgaatgtg ggaaaacctt cacctgcagc tattcatttg 48661 ttgagcacca gaaaatccac acaggagaaa ggtcttatga atgtaacaaa tgtgggaaat 48721 tctttaagta cagtgccaat ttcatgaaac atcagacagt tcacactagt gaaaggactt 48781 atgagtgcag agaatgtgga aaatccttta tgtacaacta ccgactcatg agacataagc 48841 gagttcacac tggagaaagg ccttatgagt gcaacacatg tgggaaattc tttcggtaca 48901 gctccacatt tgttagacat cagagagttc acaccggaga aaggccgtat gagtgcaggg 48961 aatgtgggaa attctttatg gacagctcca cactcattaa acatcagaga gttcacaccg 49021 gagaaagacc ttataagtgc aatgattgtg ggaaattttt taggtatatc tccacactca 49081 ttagacatca gagaattcac actggagaaa ggccttatga gtgcagtgta tgtggggaat 49141 tgtttaggta caactccagc cttgttaaac attggagaaa tcacactgga gaaaggcctt 49201 ataaatgcag tgaatgtggg aaatcattta ggtaccactg caggctcatt agacaccaga 49261 gagtccacac gggagaaagg ccttatgagt gcagcgaatg cgggaaattc tttcgttaca 49321 actccaacct cattaaacat tggagaaatc acactggaga aaggccttac gagtgcagag 49381 agtgtgggaa agcctttagc cacaagcata tacttgttga gcaccagaaa atccacagtg 49441 gagaaagacc ttatgagtgc agcgaatgcc agaaggcctt tattagaaag tctcacctgg 49501 ttcatcacca gaaaatccac agtgaagaga ggcttgtgtg ctccatgaat gtggggaatt 49561 ctttagctaa aactccaacc tcattaaaca tcagagattt cacaatggag aaagtttacc 49621 attgactatt gtaattgggt agtaatgtta tataaattcc acatttttat gcaactaatc 49681 tccagaacat ttttcctctt accaagaagt aaaatgctgt acccattaac aacaactcat 49741 tccccttccc tacttcccca gaaatgtctc aactatattt ctatactcta tggtacttat 49801 atgaggtacc aatagatatc tatgaatttg atatatattt gtacctcata taagtggatt 49861 ctacagtatt tatcttttga gactggctta tttcacttag gataaggtct tcacggttca 49921 cccatgttgt ataatgtgtc agaatatcct tcctttttag gtgaaataat attctatggt 49981 atttatatac cacatttatt tatccattca tctgttagtg gatacttggg ctacttccac 50041 cttttgccta ttgaaataat gctgctatga agatgagtgt acaagtgtct attcaagatt 50101 ctactttcaa ttcttatagg gtatatactc agaaatggtg gtgctggatc atataggatt 50161 tctatttttt ttttttgttt gtttttgaga cagagtcttg ctctgtcacc caggctggag 50221 tgcagtgctg tgatcttggc tcactgcaag ctccgcctcc caggttcatg ccattctcct 50281 gcctcaccct cccgagtagc tgggactaca ggtgcctgcc accacgcctg gctaattttt 50341 ttgtattttt agtagagacg gggtttcacc gtgttagcca ggatggtcct gatctcctga 50401 ccttgtgatc tgcctgcctt ggcctctcaa agtgctggga ttacgggcgt gagccaccgc 50461 gcctggccag gatttctatt tttaatattt ttgggaaaat ttttccatag tacctgtgcc 50521 attttacatt cccaccagca gtgcacaagg attgcaatct atatacatcc tcaccaacat 50581 tgttcatttt ctatttctgt ttttggggtt ttttgtagtg ccttttgttt tggatagcag 50641 ctatcttgtt ggatgtgagg tggaatctat agtgtctttc atttttattt tgtgaatgat 50701 tgatgatgtt gaggatcttt tcatgtgctt gttaggcatt tgtgtatctg gaaaaatatt 50761 caagtctttt ttttccattt ttaatgggac tatttgcttt ttgttgttga gttgtagttc 50821 tttatacatt ctggatatta actccttacc aaatatatgc tttttacata ttacctccca 50881 gtccataggt tgctttttcg ctctgttgat tgtgtccttt gatgaaattt taagttttga 50941 tgtactgttg actctttctg tctgtgggtt ctgtattcat ggatcgaagc aaccatggat 51001 caaaagtatt tggagcatcc atggattgca gtgatcatta atcaaaaata tttggaaaac 51061 aaaaagggta gttgcatctg tactaaacat gaacagacat tttttcttgt cattattccc 51121 taaactatat agtataataa atatttacat agcatttaca ttgtattaga agttataaat 51181 aacctaatga taatctatat aggaagatgt gtgtaggtta tattcaaaca ctatgccttt 51241 ttatgtgagg gacctcttga gcatcagatg attttggtat ccacaagggg tcctggaatc 51301 agtcccccac agacaccaag ggatgactgt agtgcatttt atctattttt acttctgtta 51361 cctgggcttt tgatgttata tattaaaaaa aattagtatc aaatccaatg ccaagcattt 51421 tccctatgct ttattctaag aattttatat ttgaaggtct tacatttagg tctttttttt 51481 ttttttcttt tggaggcaga gtcttgctct gtcacccagc ctggagtgca gtagtggaat 51541 ctcagctcac tacaacctcc gcctcctggg ttcgagccat caccccacct cagcctccca 51601 agtagcttgg attacaagtg tacaccacca cacctggcta atttttgtat ttttagtaga 51661 gatggggttt tgccatgttg gccaggctgg tcttaaactt ctggccttaa gtgatccccc 51721 tgcctcggcc tcccaaattg ctgagattac aggcaggagt tgtaatgcac tgtgcctggc 51781 tacatttaga tctttaatct acttggggtt catttttgca tatggtttaa ggcaaaagtc 51841 cactttatgt ggctatccag ttttccaagc accatttttt gaaaagagca tctttcctct 51901 gttgagtagt cttggcacac ttgtcaaaat catttgtcca tatatgccat ggtttatatg 51961 tggattctct attttattgg tcatatgtct gtctttatgt cagtaccaca cattttaggt 52021 gtgtgtgtgt gagactcagt gttgaggaca aggctagtgg gctttcacac tccagactgc 52081 tgtattccag cccaaattac tcaaattagc caatccatgg ggaacatgga aaacgtagct 52141 aatgcaatcc gcttgcctta cctaagttgt cccctgcagc ctcaggttgc tgttactgtg 52201 tttcagatgc aaccctctgt gggaccctac ccaagttctc tcattcttag ctataggtaa 52261 taaattgttc tgattttgtg tatccaagtg acattgggtt gtttcttgct atcagaagaa 52321 cccagaaaag tattatgaat ctagtgaatg ttggaaaatc tttagccatt agcataacct 52381 catttcgtgc cagcatgttc accctagaga aaagtagaag tgaaggcaat gtcatctttc 52441 cttgttaaca tgataactca gtagagcaat gctttggaga ttagcttttt agggagagag 52501 ccagcagttg agcctcctgc atctgaacat ccacactagg gatattgtct gtactgccag 52561 atatatggga agattttgtg aactgtgttg cagttttcaa cttgactggg gcctttccca 52621 gagttatgcc cctgccagtg cctatttaaa aatgtcatct cttttctacc aactggcaaa 52681 gagccatggt gtgtagcatt ttagtcacat taaaatgcag ttatggcagc atgctgtgtt 52741 ctctatctga agaattcact agtcacttga acactttgga gtcctcaccc ccctcctatg 52801 attgattagg gcatagacat cagccttgga cagaagacat ggactggctt tggcaggcag 52861 attgcagaaa tttcctttct ccaggggaaa gtattggtta tctcattgat agtggtagga 52921 ggcagataca ttgctaggca gactaaggac gggtccctgg tgaaacccaa acttcaagcc 52981 aacgacagtt taaagcctga aaattgagct gccagttcca agtagagtcc atgactggag 53041 tgagaacttc ctcaatgcct tttagccaat caaatggtgc tttttccagg cccacccatg 53101 gaccaatcag tatgcagtct ccattctgag cccataaaaa ccctgggccc agctacacat 53161 tgggctaccc actttcaggt cccctcttgt tgagagcttt tctgccactc aataaagttc 53221 cctgccttgc tcactctctg gtgtccacat aacatcattc ttcttggtca tgggacaaga 53281 actcggaaac tgccaaatgg cgggtgtgaa aggagctgta acactgtagc cctcctgcct 53341 tccaccagcc ccaggcagcc accccatgtg acaggaagca gcggcagcag ggccaggcca 53401 gcccatgagc catgggctgg agcagggtgg caggaccaaa caagctgtga cacaccccca 53461 ttcactgaag tgtgtggatg gtgggaacag acaagctgta acacaaatga gctgtaatgc 53521 ttccttgggg ctcagacctt gggattccct gagcaaaagc tgtaacaccc cttggggctc 53581 tgtggttgct ggcgtctctg agtttttggg cgctgccatg tcccccttgt ccagatacca 53641 gcttccaagg cagaagccgg tcgcagcacg cctggaccag ctgtaggcca agcacagagc 53701 catggtgggc acaggatccg gctggtaaca tgagccaagc acagcctgtc ggactgagtt 53761 agttgagtga gtccagcagg ccaagtgatg tctgggcaga agtgcttcag ccatggaggt 53821 ttctgcctgg tgaagtggca ctgaaagtat cttgtgtcat catgacactt gggatggaat 53881 tttccaacct gccagtcacc cacactgtga actccttctc acccctaatg cacacacata 53941 ccctggttgg ttttgtgata ataaaggtca cattgtttaa gctaccttaa ctcttggaaa 54001 tctagtcacc gtatgcagtg ggttttgaac agaattggtc tctgctagaa taaaagcatg 54061 aatggttttt tgtgggacct ttactttgtg agctccagag ggactagtag gaagcaaaag 54121 atcagctcgt atgcagattt gggccaattt gcattgccat gagaagcctc ctgggaaagt 54181 ctgaaggact tctgcacaaa atttcaagcc ctgagtacaa agattatttg tattcagaaa 54241 agtacaattt gaggagaaaa cagttgcctt gatgtttaag gcattgggca acagtatgca 54301 ttgggtatcc ctacccatat ttcccatatt tcccctgcat tgatgaggga tagctgtttc 54361 ttgtgacctg ctggaagcta tgggtcctga agtaaaacac tatctaggta tatgttcctg 54421 gctgttgacc tttagggcta gatggaagcc atattctttt tttttttttt ttttcttgag 54481 acgtagtctc gctctgttgc ccaggctgga gtgcagtggg atgatctcag ctcactgcaa 54541 cctcgcttcc tgggttcaaa caattatcct gcctcagcct tccgagtagc tgggactata 54601 ggtgcacgcc accacacccg gctaattttt gtatttttat tagagatggg gtttcaccat 54661 attatattgg ccaggctagt ctcaaactcc tgacctcgtg atccgcccac ctcagcctcc 54721 caaagtgcca ggattacagg agtgagccac tgcacctggc cagaagccat attctataat 54781 aaatagtggt tggattaata ggacatggga gagactgcag tagagtggat gagtcccctc 54841 taaaggagca ctcacaaatg ccctggtagc tatggctgtg tggggtgggg tattacagga 54901 attccaaaga cctaaggagc tgtgtacctc ctgtagccaa ggtcaatgtc agaacaacta 54961 aatcaatgat tttatgaatt cctgtagcct ccctggacta taaatttcaa accatagttg 55021 catcatctac atagtgatgg gccttgagcc ttccctaaag ataaaaagcc tacatagccc 55081 gtgctgcccc atcacctgtg tgtcggaaca tcttccttgt ttcaagctac atgagtgctt 55141 ttttattctg ttgaagtgtg tcatgtcatg cctggtgaat taataaatct gtcctctgca 55201 actgacaggg ttcttccact gtgaaacaag agggttgtat ataggttgcg tttaactaac 55261 agagttaatt aaaccttttt actttaaaaa ttactcagtc ttgggccagg cgcggtggct 55321 cacgcctgta atcccagcac tttgggaggc cgaggcaggc ggatcacgat gtcaggagat 55381 cgagaccatc ctggctaaca cagtgaaacc ccgtctctac taaaaataca aaaattagct 55441 gggcgtggtg gcaggcacct gtaatcccag ctactcagga ggctgaggca ggagaatcac 55501 ttgagcccag gagttggaga ttgtggcgag ccgagattgc gccattgcac tacagcctgg 55561 gcaacaagag tgaaactcca tctttttttt tttttttttt ttttttgaca cagagtcttg 55621 cactgtcacc caggctggag tgcagtggtg tgatctcagc tcactgcaag ctctgcctcc 55681 caggttcaca ccattctcct gcctcagcct cccgagtagc tgggactaca ggtgtccgcc 55741 acgacgccca gctaagtttt tgtattttta gtagagacgg ggtttcaccg tgttaaccag 55801 gatggtctcg atctcctgac ctcatgatct gcccgcctcg gcctcccaaa gtgctgggat 55861 tacaggcatg aaccactgtg cctggccact ccatcttaaa caaatttaaa aaatagttta 55921 tctctctatt ttttaattta ctgttgtttg tgggggtttt gaagatgtat caatgatttt 55981 ggttgagtag ggcactttag ctttacttct gagtgcatgc agcagtaaag tttttttttt 56041 atgatttcct tggctttaaa cagttcaagt ggttttctca agtgtgttag ggtgcacact 56101 gttagttatt agtggaggtt ttggtaaagt tgtgctggga acagaatgcc agatgagctt 56161 gtcttcaggc ttcagcagta ctggtggtga gctatgtatg tttatccttg tactttgcgg 56221 tgctgaatgt tggtacctgt gttggcagtt ctaggcaggc cgattcttgg gcctctggtt 56281 ggctttctta gatgctggtt gtgatagcag tgtaccaagc atgtgagtgc actcttgagc 56341 ccctgggctg ctggtgtggc atccgtgatg gtggtggcag tggtgcgaaa ctcttctggg 56401 tcgcacttgc tgtgctcatt agcagtggtt gcagcgggct ctgtgggcca gccactagac 56461 cagcagatgg cgcttgtagg caggagatgg ctgaagtggg tgcagtaggg tatttaggcc 56521 taacctcagc gccctaggag tgctcaggtg tttcacttgg tggactaggt tgtgcaatct 56581 ctgggggatt ttaaaagttt attttgcaat tgagaatggg atgtttagaa tttttgtctc 56641 ctactgatta aaggtagagt atggggaaaa aaacaggcaa aactaggagt ttaaagaaaa 56701 ggagagagaa gaaaaagaaa tgaagataca ataatacaga aagaacagag ggaaaaactt 56761 tatttatatt taatacctat gatgtgacat gcccggagtt aggagcattc ataagcacca 56821 tctcatatta tcctccaact aagctactca tgtcagaaaa aattaaaatt aacaaaggtc 56881 aacagggagt aagaaataga aaatggaaag gaaacaattt tggagtgaga cctacaaaat 56941 gatctcagaa tttaccctct gataagtctt tctttgatcc cagagtgcac aaaatgttac 57001 atgaaaaaac aaattaatat aattacatct aaatttcaag ctttctattt cataaagatt 57061 atgagtcaaa actgagtgag ttcagggact cagagaagac atttaaaata tctaaatgtg 57121 cccagtttta cggttttagg catgcgagtt ttagattttt ttgctttttt attgagacag 57181 ggtctctttc acctaggatg gagtgtagtg gcgtgatctt cactcacagt agcctccacc 57241 tctcaggttc aagcaatcct cccattcagc ctccccagta gctgggacta caagtgcgca 57301 ctaccatgct tggctaattt ttaacttttt tggtagagac ggggatctca ctatactgcc 57361 caggctggtc tcgaactcct agactcaaat gatccttcca cctccacctt tcaaagtact 57421 gggattacag gcatgagcca ctgtgcccag ccaaagtttc agattcttct taaaggaata 57481 aagtcagaga tcttagaaat gaactaaagc acgagcatac aataaacaag agagaaaaat 57541 cagatggtta tcaagaatat gaatggatat tcaaactcgg tgtcagagaa atgcaaatta 57601 gatattattg catgttactt taagttctat aaattactat actatggaaa gaaatatggg 57661 gtgatgtagc tgcaatcatg tatatttcat ggaagagtca aagactgtag cagtttcata 57721 aagcaatctg cacaggtact gtaaaaaaaa tgattttcta gaatatcatg tgacacagaa 57781 agcctgtttc tggtatatct ctcagtcaag tccataaatt cttatgtgag agaagttttc 57841 tcacaatact gttcaaggca gcagggacat ggagacaata tgttgtagca ttaagtgtaa 57901 gaggaggacc aaagacatct atgagtgaag tagaagactg aatgtccaca ctaaagtaca 57961 acagggcagt tactagcatt taagtagact ttgacacagg agtatatata catattaaac 58021 acacattttg agtaccaaca aagggagaaa gaagtctata gtacagtaca attaaactac 58081 acaaaaagta acagtgctcc tctttctaaa acacatgtaa atacaaaaac acacaacaga 58141 cctttagaat tgttgtctac aagtggtggg gtgatgaagt gtgaggaaca ggaatgagtg 58201 gaaacacgtc ggaacattga acaagacgag gtccttgtgg ggacataagg agagaatgtc 58261 cactcactgt ccagagcctc ctgggccatt gtcaccatac agccgctcat tacatgaaag 58321 tggaatttca aaaggtagcg aaacacctaa taaaatatgt tgaaagggct gggcatggtg 58381 gctcatgcct gtaatcccag cactttggga ggccaagccg ggtggatcat gaggtcagga 58441 gttcaagacc agcctggcca acatggtgaa accctgtctc tactaaaaat acaaaaatta 58501 gccaggtgtg gtggtgcaca cctgtagtcc cagctactcg ggagctgagg caggagaatt 58561 gcttgaaccc aggaggcaga ggttgcagtg agctgagatc gtgtgactgc actccagcct 58621 gggcaacaga gcgagactct ctctctctcc ctctctccat atatatatat atatatacat 58681 acacatacac aaagatatgt ggcctgtcac caagtgttag atggatgttt gggccaggcg 58741 aggtggctca tgcctgtaat cccagcactt tgggaggctg aggcgggtgg atcacctgag 58801 gccaggagtt caagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaagtaca 58861 aaaatcagcc aggcgtggtg gccatgcctg taatcccagc tactcaggag gctgaggcag 58921 gagaatcgcc tgaacctggg aggcagaggt tgccgtgagc cgagatcgtg ccattgcact 58981 ccagcctggg cgacagagag agactctatc tcaaaaaaag aaaaaaaaag tttgtttagt 59041 ttctgttggt aaatatctag aaatagaatt gtaggatggt agggatgtat tgttggtaaa 59101 tgattaacat gtttttaatt ttcagttttt tcagagacgg ggatctcact atattgccca 59161 aactggtttt gaacttatga actcaagtaa tcctcccgtc ttaacttcca tgtagctggc 59221 actactgaac agctttaact cttccagacc ttcagttggg tggttttttc ttttcctttc 59281 gtttccttct ttctctctcc ttccttcctt ctttttgtaa tttgataaac caaatttcaa 59341 gactttggcc aaaaaacagt cttctccttt actgaatatc tcagtgctcc attttatttt 59401 tccctcacca attctgcagg agagatcaca agactatgtc catatcttta tttgtggggg 59461 tatgattttc agctaacata gctgctaagt gatttaattt acaattatag tggtagaaaa 59521 atagctctaa cttctgaggt tttgtaattt attgtaaaaa gcggaataca ggacatttaa 59581 agtgacctga caaggtcaat acaggactga ggactaattc attaaatacg tcttacgtgt 59641 cttagaggac cacagacagc tagcaaccct actcctaatg tgtcccatgc acttagtatt 59701 gtacctggca tcgaatgagt gttcatcaaa tatgttagag taataaatac gacagcacag 59761 tttccctatt cagctttact tttctggtaa ctcttttaat atatagtctt tacctctccc 59821 ttcaagctgt taatgcacaa tgttttcaag tttgaaaaag aatcagattt tggacaccaa 59881 acttgagtgt gtccaaggtt ccaggacaac cttagctttt ttcctcctat cacaaatatg 59941 tttctctata actatttttg ccactttctt tggatatcta cataatcgtc attaatagtt 60001 tcagtaatag ccatttcaca agagttattc agagacaatt tcaactccct gtgcagacac 60061 tgttgataca atttcaggcg aaatgataaa tcactttcca ctaatgagat gagatcatgc 60121 attcccaaac ctttatctca ggagtaagat gctttctttg cacaagggct ttgactggtg 60181 gaaggccaca ctattctggc agataatggt agataagtcg cttttccttt tcagttaggt 60241 ttcagtctct taaaataagt ggaaaatatc tcagatataa agtaataagt atcatgtgaa 60301 taaagtgttg ctgggtaggc tgaaagacaa taactgatta ttcagaaagg aaagatatta 60361 ctcctcaaac tccccatcat gccatgtctg ggtcttttca atgtgcccag cagagcacat 60421 tgaagatcat aaaacagtct ccagagcttc gttctgtgtg tatttggcaa gcacaaaaga 60481 gctgaccgct atgcgtggaa gaccagttct ttatggttgg ccatgttcct tcggatgttc 60541 gtcttttggg aacttgctgt gctttacaga atctgaaagg acaggtctct gaaacatttc 60601 tctgacttgt tctagactgc cttccgggaa aatctggatg cttgccgttt aaaagccaat 60661 gaagcccctc cacgtccttg gagccccgca actgcatttc tcaaagcctc aggaggatcc 60721 tcctggcttt catcctgaac gcgagttaac ttaagttcag aagcggggca ggcagtggcc 60781 tgggaactac attacccaaa agacacagcg gcggacacaa gcagcgagtg tagccaatga 60841 aggcctagca gagcggcgtc tacgggggtt cgcaatgcgt gtgggcggga cttcctgcaa 60901 cgcctcctgg ggttgtcaat atggctgcgt tgggatctgt tcaccttcag gctgagtcga 60961 gactgaggtg aaaaagcgga aaaacgcgag aaaaggtttc cccgttgtac agaggctaga 61021 gtgaggctcg gttgaatcgg ttgcaggcgt tggtgcctct gtcagcgtcc aggtcactgc 61081 cgctcccgcc ccgctcttcc ctggctgtgc tggcggaggc tgcgccgatg aacctgactg 61141 aggtgggtgc cgcgtcccag ggcgccccgc ccgatccctc ctccgagtgc cgaagccccg 61201 aggaggggcc ctgcaggtca ggcccctgtg tcccaaagag aggagcgttc ttgtgtgggg 61261 taccggtgtc cgcggtgctg tgaggcgggg gagctcctgt cagggacctg cacgtgcgag 61321 gcttagaggt gctgcagagc gggcggcact ggggagcccg agcgtctttg tctccacgga 61381 gctgagggtg aggcggagtc tcgcctggga gggcagcgga gctgctgaaa gccgtagaag 61441 gctgtgcagg aagggttatg cccagaggta gacgtttaaa gttgggaaaa acaggatgtt 61501 aaaccaggac aaggggaccg ctgaggagac cccttgagtt atggtaggga tgggccagag 61561 gggttgcagt agaggggaag tgtgatgaga ttctggatgg atctcagaga tagagtcgac 61621 aggatttcct gctgcatcaa atgtagcagt gagggtagtg aagggtctat tccatggatt 61681 ttgttttggc ctgaaaatct gaaagaatgg agttggagaa gtcagcttag ggagcaggtt 61741 tcagagagga ttatgagttg cattttggag ggctgagttt tggagggcag atcatgggtg 61801 ggtccataag cctgacgcta cctttagaaa ggcgcagacc tgtcatagga gggattctgg 61861 ctcatatctg caaaaacgga gttctgaatg ggttaaggaa ctggcctagg ctggagactg 61921 aggtgtgagg tggtttagat ttctgcacct gcagacccag cagcccccca ggcccactct 61981 gtggaacaat aggaaggtca tccagactag tggggacatt gggtccagac gggggtaggg 62041 ctttctcaca tctagtcaag ggcctccagc aagcttccag caactgaatt tgtagcaggc 62101 aggcctggtt gctgatggga ccagtaactg acctgcttgc ttatcccctc cctgcaactt 62161 atttatttat gaatgtgtaa aacatttttt ttttttttga gacagagtct cgctccatcg 62221 cccaggttgg agtgcagtgg cgcgatctcg gctcactgca agctccgcct cccgagttca 62281 agcaattctc ctgcctcagc ctcccgagta gctgggacta caagtgcccg ccaccacgcc 62341 aggctaattt tttgcatttt tagtagagac ggggtttcac cgtgttagct agtatggtct 62401 ctatctcctg acctcatgat ctgcacgcct cagcctccca aagtgctggg attacaggca 62461 tgagccaccg cgcccggccg aatgtataaa acatttattt aaccactaat gaaccagtat 62521 ccttctaaat atgtacacat aggtcggcag ttagaaatga atgtaaccaa tagttaaatt 62581 tattcactta ataattgttg aatgttgatt tttctttttt cttttttttt ttgagacgga 62641 gtctcgctct gtcgccaggc tggagtgcag tggcgtggtc tcggctcact gcaacctctg 62701 cctcccgggt tcaagcgatt ctcctacctc agcctcccga gtagctggga ttacagacat 62761 gcgccaccat gcccatctaa tttttgtatt tttgatagag acagggtttc accatggttg 62821 gccaggatgg tcttgatctc tgcacctcgt gatccgccca cctcggcctc ccaaagtgct 62881 gggattagag gcttgagcca ccatgcccgg ctgaatgtta atttttctat ttgcttgcaa 62941 tggaaagttt tatgtatcac atttccttaa caccgtaaga agtgggaaaa atacacaccc 63001 aaaatttcaa aacccttcca tgtcatttgt accgaaaaca tgtaaaaaat atttaatcca 63061 gtagaaagaa attgctgata catagtgtgt actacttata tttgatgccc atataagccc 63121 tattagttgg caatcctttt aatctctatt ttacacataa gggcaccgag acttaaggaa 63181 gttcagtcct catcaagaca ctgaaccctg aggacttact gtgtaccctc agataacttt 63241 gctctagttg cagggccaga ggtggttgta cagtcagcct gtaggatgct gcaatgctgt 63301 cttaatcctc atggcctgcc tcttcccaca gggttcatag cagtggcagc aatgcttatg 63361 gatgctggac aggtgagtgg agagtgtttc cagctttcac ccatcccaga tggtttcagc 63421 atgctgatat agggaatggt gttatcctga gactgttcac cttttcttct ctctcccttg 63481 ggtccaagga gagcctgtag ttccaccaga cctgggttcc aaactcagtg cctggttata 63541 tagagtggtt ctagttctca caactgatgg tttgagattt ggcaaggcat ctcaaacaca 63601 ggatctagaa tgttcaaatg cttggtggtg agacttagat gggatgctta gagaactgta 63661 gcttaatgtt ctagctggca aggataagga gcacagcagc agggcaggag gtcggggcat 63721 gcagggatca ggcatggaat ggcagcctga ggttgtctag gtttttgttt ttgttttgtt 63781 ttgtttttga gacagaatct tgctctgtct ccaggctgga gtgcagtggc gcgatctcag 63841 ctcactacaa cctccgcctt cctggttcaa gcgattctcc tgccccggcc tcccgagtag 63901 ctgggattat aggcacccgc caccatgccc ggctaatttt tgtatttttt agtagagatg 63961 gggtttcacc atgttggcca ggatgttctc aatctcttga cctcgtgatc tgcctgcctc 64021 ggccctccca aggtgctggg attacaggct tgagccacca cacctggcct tgtttttttt 64081 tttttttcca gacagaggct aactctgtcg cccaggcggg agtgcattga tgtgatctcg 64141 gatcattgcg acctctgtct cccaggttca agcaattctc ctgcctcagc cttccaagta 64201 gctgggatta caggcatgaa ccaacacacc cagctaattt ttgtattttt agtagagaca 64261 ggatttcacc atgttggcca ggctggtctc gaactcctgg cctcaagtga tctgcctgcc 64321 tcagcctccc aaagtgctgg gattacaggt atgagccacc gtgcccagtg gttctaggtc 64381 tttaatctga ggggggtagt agctagggaa ggtttgagaa agaaaaaaga ggtgatttga 64441 cctagattct tttttttttt ttttgagacg gagtctccct ctgtcgccca ggccagagtg 64501 cagtggcgtg aactcggctc actgcaagct tcgcttcctg ggttcacgcc attctcctgc 64561 ctcagcctcc caagtagctg ggattacagg cacctgccat catgcccggc taactttttt 64621 ttgtattttt tagtagagcc agggtttcac cgtgttagga tggtctcgat ctcctgacct 64681 catgatctgc ccgcctcagc ctcccaaagt gctggaatta caggcctgag ccactgcacc 64741 cagccagatt tgacctagat tctaagaggc ttcctctgcc tgcagaatgg aggacagagt 64801 gagagttagg gaagaaccaa ggatacctgg gtgataattc ctgccacagt ccagtggagg 64861 ttggtggtgg ctggacacga gtggcggctg tggaggtggg aggggtgggt ggcttctgta 64921 ggttttgatc tagagctgct cacatttttt gatgtacaga gcgaggaagc tcgggaggta 64981 ggatgcaaga gggagtcagg aatggcccca ttgttttggc cttgttgaga aggttggatg 65041 ttccagcaac taagatgtgg taaattttgt agtaggggga gtttaataga gcatatgaag 65101 aacgaggttt gggctaagtc aaaaatgttc cagagactcg agtagaggtg ccagactggc 65161 agttgaacac acaggaatgg agttcaatgg aggtgttcag gatggagaca tctactatat 65221 gatagccagc gtgtacacca tggtaaatct gtgggtcaga tttaggaata gacatcttta 65281 gattctggag gcaatgctgt tagggagaag gaaggggaag gaaggggctg aggactggat 65341 ctttctgttg ggcacctgga tggggttgct gggcatcaca aagacagatg agggagtgga 65401 tggactgagg atatgtgtgt gtgtatttgt gtgtttatgt tggagatggc ctttggaaaa 65461 taaaggcttt tgagggggag agtgcacata aaagttggat ggtgaggcat ctccacaaaa 65521 gaggtgagat gatagtgagg tctgggaggg gcttagtacc ctattatgga ctcaacaggg 65581 ttttgtggca aatattggtt gtacctgggt agtttcacgc gaactctgag actattttgg 65641 cagaagatgg gatcatctag aagcattgct agtcctgggg tggggatggt gttggggagg 65701 tcagtgtatc tgggctttgg attagagttg ggggatagag aaagtcgtga ctgcttatgg 65761 gacaacatat acatgtagaa gggggagaca gccaagatct ggaactcaaa ggaggtgtgc 65821 aagaatatgt gggtttggaa agctgaggtc aaagaaagaa acaggattgg gttgggagac 65881 gttcaaagca atgtttccta aacattgcat tctggtcatt ctaggggcaa atacgattct 65941 ggttgcattt gtcatgcact gcaggatttc atgtgcagag tggcatggta acacgccatg 66001 gaaatactcc cagttgggct gtggtggttg ttagtgttgc caagagtgga ggtgttggat 66061 acatgaggga ggtgccatcc aaagcctgag tggaggagca tgaatagatg gttccacctc 66121 tggcactggg cacttaaggg aggagggtga gtccaagttt ggttatctga gaggcaagat 66181 ctgggtgact ctagggaaat tgagtgacaa aagaaaggta ggcccccatg tgccaggtgt 66241 tttttttgtt ttgttttgtt ttttgagatg gagtctagct ctgttgccca ggatggagtg 66301 cagtggcgcg atctcggctc actgcaacct ctgcctccca ggttcaagca gtactctgcc 66361 tcagcctccc gagtagctgg gattacaggt gcctgccacc atgcccggct aatttttgta 66421 tttttagtag agatggggtt ttactatctt ggccaggcag gtcttgaact cctaaccttg 66481 tgatctacct gcctcagcct cccaaagtgc tgggattaga ggcatgagcc accatgcccg 66541 gccatgtgcc aggttttgat ggcgcttttc agtctctttc ccccacccat ctttgttcta 66601 tactggagtc agtgatcagt ggcaatgagg ggagagcagg tacaggtgcc atgaagacct 66661 agtgtctctt cctccttcag aagtgggtgg aggctagcag ggtgtggatg tgtgtgggtg 66721 tggtgaggag cactgaaggt cctggagagg gaagtgtatc agtgattgta cccacaggat 66781 ttcaatctgt gaacaagagt gagtgattac agaaaagcca atgacattga aataattttt 66841 tttttcaatt ggtgagaaat catacctgga tgaaatgatt tttattcctt tcatttcctg 66901 agtgcaggga gcatgccaca tcacataaag ccacagagga aatagcggat ttggtcaggt 66961 ggcagatgca cagttgaagg gagagcatat atcagtggct ttattggggt tttggctgga 67021 aaggcatgca ggggagagtg aagagcttaa gactgggtag tttgcatgat tttggcagcc 67081 ttggggtata gggactatcc ctccttttgt ggtacaagcc ttgtgttgat ttagggcagg 67141 agaaagaatc atgtgtgatt gttagatagg aggcagttca gtctatggga tctggattat 67201 aagagaaatg ttaaaatgct ggttgttatt ttggtcctct aattcttaga tttcaagtag 67261 ataaatacag atctaaggaa acagaataag aagttgctca tgggccgggc gtggtggctc 67321 atgcctgtaa tcccagcaat ttgggaggcc aaggcaggca gttcacgagg tcaggagttc 67381 tagaccagcc tgaacaacat ggtgaaaccc catctctact aaaaatacaa aaattagcca 67441 gtcatggtgg cacgcgcctg taatcccagc tactcaggag gctgaggcag gagaatcact 67501 tgaatccggg aggcggagct tgcagtgagc cgagatcaca ccactgcact ccagcctggc 67561 aacagagcga gactccatct caaaaaaaaa aaaaaaaaga aagaaaaaaa aagttgctca 67621 cagactaaac tgttataatt ttggcaggat tatatggttt ttgaggacgt ggccatacat 67681 ttctcccagg aggagtgggg aattcttaat gacgttcaga gacacctgca cagcgatgtg 67741 atgctggaga actttgcact tttgtcctca gtaggtaagg ccctgacacc tacgtcagtg 67801 tcttgtgctg ggcaatgttt tttcactttt ttccttggca tctccgtctc acaccaggtc 67861 atggatgctg catcctctcc tggtttcctg gcatatgtgt tgtggcagct acagctgggc 67921 tgtgtgatct gtatcaattt tccttaagaa gcttagctct tgtcactctg aagccttata 67981 aggctcaaaa gtcagaagtc ctcaggtttc tcaagaatat cctaatgacc ttgcctgtcc 68041 ctggttagtt tactctgtcc aaatgcatga cactttctct gtgcaaggct ttctccattc 68101 ttcttgctga catagggata ccctgtgtca ggaatttcag ggacctacct cgtcactgct 68161 tgtgaggact gtgggcttat ttctgcagga ctctccacta gaagttctgg taactttaga 68221 atacagcatg tcctgggttt attgctctgt gtacatgctg ttgacaccct ccctttccct 68281 gtctttcccc tagcttggct aacgccacac ctagatagtt gctcaccttg agcaaggtag 68341 aggtccctgg gtgcctgaca gagtggacat tactccattg aaggcaagag atgctcaaaa 68401 ggacttggcc ctggtggatg ggagctcaga aggggtgtga tgacagggct gggttcacat 68461 cacactggga atcaatccag tttgatgccc ctgccagtgg ccacaccgca tgtctctctt 68521 ttcttcccta ttgtgatctc tcttttgact ttcatctcct ggctcccacc tctgacccta 68581 cgtctctggc ctccacctct cacctgttcc cgttttttgt tttttttttg tttttgtttt 68641 tgtttttgag atggagtttc gctcttgcaa tggtgcgcgc gatcttgact cactgcaacc 68701 tctgcttccc gggttcaagc gattcttctg cctcagcatc ccaaatagct gggattacag 68761 gcatgcgcca ccacacctgg ctaattttgt atttttagta gagacggtgt ttctccatgt 68821 tggtcaggct ggtctcgaac tcctgacctc agctgatccg cctgccttgg cctcccaaag 68881 ttctgggatt ataggtgtga gccactgcac ccagccacct gttcctttta ctgttccctt 68941 tgtagtgccc tcaacatatg acctgtactt aaggtgttgt gaggtctgga tgtatatatt 69001 tcagcagtcc ctgaacacca gctctactgt cacatcagat ctctctgggt ctggacacag 69061 tcttctacac attgttcacc agaagctatg gcctgttgaa agccattcac tgaggagtaa 69121 cttggggact ccacttctgt gctctgcact gtaccactcc ctatgttgtt ccccatcatc 69181 aggactcttt cgttatgggt ttctgtcagt cccggttctg cccagtaggc tttcctctca 69241 ctggctttaa tgtcccatgc ctgtcaccaa tcccatggtc ttatttgtga gttggtctga 69301 cagacatttg ttatggggct gcctcttccc tccaaagtca tcatgcactt catgagcatt 69361 tctgttttag gttgttggca tggagccaag gatgaggagg caccttccaa gcaatgtgtt 69421 tctgtaggag tgtcacaggt cacaacttta aagccagctt tgtccaccca gaaggcccag 69481 ccctgtgaga catgtagctc acttctgaag gacattctac acctggctga gcatgacgga 69541 acacacccca agcgtacagc caagctttac ctgcaccaaa aggagcatct tagagagaag 69601 ctcaccagaa gtgatgaagg gaggccttcg tttgtgaatg acagtgttca cctggcaaag 69661 aggaacctca catgcatgca gggtggcaag gattttactg gtgattcaga tcttcaacaa 69721 caggctcttc acagtgggtg gaagccacac agggacactc atggtgtgga ggcctttcaa 69781 agtggacaga ataattacag ctgcacccaa tgtgggaaag acttttgcca ccaacataca 69841 ctgtttgagc accagaaaat ccacacagag gaaaggcctt atgagtgcag tgaatgtggc 69901 aaattgttta ggtacaactc cgaccttatt aaacatcagc gaaatcatac tggagaaagg 69961 ccttataagt gtagtgaatg tggaaaagcc ttcagcctca aatacaatgt tgttcaacac 70021 cagaaaattc acactggaga aaggccttat gagtgcagtg aatgtgggaa agcttttctt 70081 agaaagtctc acctacttca gcaccagagg attcacacca ggccaaggcc ttatgtgtgt 70141 agtgaatgtg ggaaggcctt ccttacacag gctcaccttg ttggtcacca gaaaattcat 70201 actggagaac ggccttatgg atgcaatgaa tgtgggaaat actttatgta cagttcagca 70261 ctcattagac atcagaaagt tcacactgga gaaaggcctt tttattgctg tgaatgtggg 70321 aaattcttta tggacagctg cacactcatt attcaccaga gagttcatac tggagaaaaa 70381 ccttatgaat gcaacgaatg tgggaaattc tttagatacc gttccacact cattagacat 70441 cagaaagttc acactggaga aaagccttat gagtgtagtg aatgtgggaa gttctttatg 70501 gacacttcca cactcattat tcatcagaga gttcatactg gagaaaagcc ttatgaatgc 70561 aacaaatgtg ggaaattctt taggtattgc ttcacactga atagacatca gagagttcac 70621 tctggagaga ggccttatga atgcagtgaa tgtggcaaat tctttgtgga cagctgtaca 70681 ctgaagagtc atcagagagt tcacactgga gaaagacctt ttgaatgcag catttgtggg 70741 aaatccttta gatgtcgctc cacacttgat acacatcaga gaattcacac tggtgaaagg 70801 ccttatgagt gtagtgaatg tgggaaattc tttaggcaca actcaaatca tattagacat 70861 cggagaaatc actttggaga aaggtctttt gagtgcactg agtgtgggag agtttttagc 70921 caaaattccc acctcattcg gcaccaaaaa gttcacacta gggaaagaac ttacaaatgc 70981 agcaaatgtg ggaaattttt tatggacagc tccacactca ttagtcatga gagagttcat 71041 actggagaaa agccttatga gtgcagtgaa tgtgggaaag tctttagata caactccagc 71101 ctcattaaac atcggagaat tcacactgga gagagacctt atcagtgcag tgaatgtgga 71161 agagtcttta accaaaattc tcatctcatt cagcaccaga aagttcacac cagataaaga 71221 atgtatatat aaagcagatg gggaaagact tcacacagaa atctactctg atttagcact 71281 gggacctacg ttttaaaaaa agtattcttg tagaatacag ataacataaa atctaacatc 71341 ttaaccatgt taaagtgtat agttcagtac tgttaagtca ttcacattgt gcaatgaata 71401 tctagaagtc ttttcaactt atgaaactaa gtctatacct tttaaaacct tattcctcac 71461 tccatccagc ctcttgacaa gcaccgctct gtatgaattt tactagtccg ggtacctcat 71521 ataagaaaac ttaagttttg gtcttcttgt ggtttatttt gtggcttatt ttgcttaacg 71581 ttatattttt aaggtttcat gttctaatcc attagaattt ccatcctttt taaaggctga 71641 ataaaattct gttagtcatg tgttgcttaa cagtggggaa gtgtcctgag aaaagtgtta 71701 ttaggtgatt ttctttcttt ttttggtggt gggggggttg cgtgaatgcc taggctgtat 71761 ggtatatcct atagcacctt gctacaaact tgtatagcat attactgtac tgaatactgt 71821 aggctgttgg aacacatggt aagtaattgt ttttaagtat atctaaacag aaaaggtaca 71881 gtaaaaatac agtataaaag aaaaaatgat agactcacag agaacttacc atgaatgaag 71941 cttacagtac tgcaagttgc tctaggtgag tcagtgagtg gtaagtgaat gtgaaggcct 72001 aggttgttac tgtgctgtag actttataga cattgtgtac ttagacgaca atacattttt 72061 atttttatta ttatttttga gacagaatct tgctctgttg cccagactag agtgcagtgg 72121 tgcaatcttg gcttcctgca acctcctcca cctcctggtt caagcagttc tgcctcagct 72181 tcccaagtgt ctgggattac aggcatgcac caccatgccc cgctaatttt tgtattttta 72241 gtagagaacg gggtcttacc atgttggcca ggctggtctc aaactcccga cctcaagtga 72301 gccactcgct ttggcctccc aaagtgctgg gattacaggc atgagccacc gtgcccggct 72361 ggacaaaatt aaatttatag aaatttttct ttaatatatt aaccttagct gaccattttt 72421 tactttataa gcttgtattt ttaaaaactt tttgactttt gtaataatgt tttgcttaaa 72481 acattgtaca actgtacaaa aatatttttt atatcctaac ttcttaaatt ttttttgtta 72541 aaaactaaga tacacacatt tgtctaggcc ggcacaggat caggataatg tcattttatt 72601 ccaccttcac aacctgttcc agaagatctt ctgggacagt aacacacatg gagctgtcat 72661 ctaaaataac aatgcctttt tctggaatac ctcctgaagg acctgcccga ggctgtgtta 72721 cagtttgtgt gtatgtacac ctgtatattt atagacatac acacaactcc atatatactc 72781 tactgttgga gaacacctaa tgataaaaag tgtagtatgc taagaacata agctagtaac 72841 tcgtaagttt tatctactgt acattattgt atatgttata cttctatatg actggcctca 72901 cagtaggttt gtttatacca gcatcaccat gaatatgtga ttaatatatt gccttactac 72961 tgctatcatg tcattaggca ataagcattt ttcagcttca ttgtaatttt atggaaccac 73021 tgtcatgaat gtggcccatt gttgactgaa aagtgaggtg catgattata tatgtatgtg 73081 ccacttttcc tttcattagt ggacatttgg gttgtttcca ctcagctgtt gttaatagca 73141 gatgagcatg aatgtacaaa tgtttctttg aggctgtgct ttaaattcct tttgaggatt 73201 tactcagaag tagaattggt gctttctatg tttttttttt gtttgtttgt ttgtttgttt 73261 gttttttgag acgggagtct cgctctgtcg ccaggctgga gtacagtggc ctgatctcag 73321 ctcactgcag cctctgcctc cggggttcaa gcagttctcc tgcctcagcc tcccaagtag 73381 ctgggactac aggcatgcac caccatgccc agataatttt tgtatgttta gttgagacgg 73441 ggtttcaccg tgttggcaag gatggtctca atctcgacct catgatccac tcacctgtcc 73501 ccctatgttc atttaaaaaa tatttttaat tttttaaatt tctgactcac aatgttccca 73561 ttctatgtta attctgtttt aaaatttttg aggactcttc tatagcagct gcatcatttt 73621 ttaatttaac tttattatta attttttttt ttagtagttt tactgggttg agtttttttt 73681 ttttctttag ggttatttgg ttttgagatg taggagtttt ttagtatatg tggggacttc 73741 agaaagtttg tggaaacatg aaagtaaagg atacaaaaag aaaacaagtt ttatttctca 73801 acataagctc catcaagttc aagacaattt tataagtgat gataccaggc atttagtcca 73861 tccctaaaga agtgaggttc ctgggaattt aaccttgtct atgcaatctt ttttaatatt 73921 aactaaagaa aaatgggtgc cctttaaaga ttttttaaaa gtaggaaaca agaagtcaga 73981 aggagccaaa tccggactgt aaggtggatg ccttagggtt atcagaattc ttgtaaagtt 74041 gcagttattt gatgagagga atgagcagga gcattgaagt gaaggactcc tggtgaagct 74101 ttcccatggg ttttctgcta aagctttagc taactttctc aaaacactct cattgcgggc 74161 gccgtggctc acgtctgtaa tcccagcaat ttgggaggcc gagttgggcg gatcatctga 74221 gctcaggagt ttgagaccag cctgaccaac atggagaaac ccccctctct actaaaaaat 74281 acaaaaatta gcctggcgag gtggtgcatg cctgtaatcc cagctactca ggaggctgag 74341 gcaggagaat cgcttgaacc cgggaggcgg aggtggcagt gagccgagat cacaccattg 74401 cactccagcc tgggcaacaa gagtgaaact ccatctcaaa tagtaattaa taataaagcc 74461 atgtttcatc tgtacaattc ttcaaagaaa tgcttcagca tcttgatcct acttgtttaa 74521 catttacatt gaaggttctg ctcttgtctg cagctgatct ggattcagtg gctttggcac 74581 ccattgagtg gaaagttcgc acaactttaa tttttcagtc agaattgtgt aagctgaacc 74641 aattcagatg tctgtggtgt cagttactgt ttctgctgtt aatcatcagt cttcttcaat 74701 aaggtcatga gcaagatgaa tttcttcctc gcaaattgtt attaatggtt tgccattgtg 74761 ggctgcgtgg tcaacatcat ctcatctctt cttaaaacgt tatcaggccg ggcgcggtgg 74821 ctcacgcctg taatcccagc actttgggag gccgaggctg gtggatcatg aggtcaggag 74881 atcgagacta tcctggctaa cacagtgaaa ccccgtctct actaaaaata caaaaaatta 74941 gccgggcgtg gtggcgggcg cctgtactcc cagctactcc ggaggctgag gcaggagaat 75001 ggcgtgaacc cgggaggcgg aggttgcagt gagtggagat cgtgccactg cactccagca 75061 tgggcgacag agtgagactc cgtctaaaaa aaaagaaaac aggttatcag tttgtaaact 75121 ctgatttctc tgggtcattg tgcccataaa cttttctttt tctttttttt gagacggagt 75181 ctcactctgt cgtctgggct ggagtgcagt ggcgcgatct cagctcactg caacctccgt 75241 ttcctgggtt caagcgattc ttctgcttca gcctcctgtg tagctgggat tacaggcatg 75301 cgccaccacg cccggctaat ttttgtattt ttagtagtga tggggtttca ccatattggt 75361 cagaatggtc ttgaactcct gaccttgtga tccgcccaaa ctcggcctcc caaagtgctg 75421 ggattacagg cgcgagccac cgcacccggc cggttttacc atttttagag ccaagcttta 75481 ctatatattt gatatttgtt ctttcttcaa ccttagctga attcacattc ctctgataga 75541 aggtgttttc aaactgatgc cgttcttagt gcctcaaact agatcctgtt catacttgtt 75601 agaacaagtt attacaaatt cactttggtg taaaaaattg aaatccatac ataatttttt 75661 tttttttttt tgacagagtc tcactaacgc taggttggag tgcagtggca tgatctcggc 75721 tcattgcaac ctccgcctcc tgggttcaag caattctcct gcctcagcct cttgagtagc 75781 tgggattaca ggtgcccaca atcacgccca gctaattttt gtatttttag tagagatggg 75841 ttttcactct gttggccagg ctgctctcga actcctgacc tcaggtgatc cacctgcctg 75901 ggcctcacaa agtgctgtga ttacaggctt aagccaccac ccctggccaa ttttttcata 75961 atatacattt ttttctcatt tttcatgaaa cttttgaaga cccctcatat tctagatatt 76021 ccttctcaga tatgtggttt tcaaatactt tctcccattg agtctttttc cttttcactc 76081 tgtccattat gtcctttttt acacaggaat tttgaattta aatggagtct aatatatctg 76141 ttttataagt ctttgatgca tttgagttca ttttagcaac ttaattcttt tgcgtgtgga 76201 tatccagttt ttttttttta acatcaaaag aataatgttt ttgcctagca ttaaggccct 76261 tggtagaggc ttgtcagtta caattttgga gcagcagatt aagtccacac tcccaaccat 76321 tttccttatc aggctctcaa actctgggcc acaatatgta agacccaatc accccaggat 76381 caggaatcag atatctaggg acagcttctg tgcccaggag cttgtaaaat tattccattg 76441 gtcaatgcac aggggtccct gaaaacctag ctaaccccaa tttacatggc acacacaagc 76501 tgccccctaa gctccagctt gctgttatct tgggttccct cataactctt gcagccctgc 76561 ctatgtcctt aggtttcaag ctgtaagtag caaagtggtc tacattttat gattatcatt 76621 gtgacatgtc ctgacatcag aaaaacacct ttgtatgtta ttactataca accagcagaa 76681 tattatgagt gcagcaaatg ttagaaagta ttcagcctaa cttcactgag caagagtaag 76741 ttcatcctgg agaaagtcct taggaatgca ggcaatatac tttttttcct ttgtcaacag 76801 gtcaaaaaca gcaaagctct atcgagcttg tcttactcac cctatttttt tgttgctctg 76861 ttttgtttta ggcttttagc ctgaagccat ggttttgttt ctgtctctag tggtaggtgg 76921 acaagaggaa tgagatgaga aaggagcttt actggcccag ctagaaacaa actaagaacc 76981 catgactgta ttctttccct tggatgaccc tgtgttagct tgttgaggga gatctcagcc 77041 tgaaattgaa tctcacatcc aaacatccac gcaagggaga tttgttgtaa ttgtcagata 77101 tatggtaaat ttttgtgaat gatgttgcac tttctgaccc tgcctggggc ctttccagag 77161 ttaagttgct gaaagtgtgc attacagaag actcctgcta ttagctgtca tggtgccaca 77221 atgtgcatca ccttagtcac cttaaattac ttagagagtg ataaggtctg gacttctggt 77281 taaatgtttt taaaaaatgg ggggtggggg ggtggtgcat agattgctgt gttctctacc 77341 tttatctgga atattcagtc atttgttccc tttgggggcc tcattcccag tcccctgact 77401 ggtttgggtg tggacatcac ccagctttgg acagagaaca cacgccaact tcagctggca 77461 gcttgtagag atttcctttt ttcagaggta ttattagttg tctgatactg ataatgttga 77521 tgataaattt tctaccttcc aagcttccca acccagtcaa tttccaccta agcattgctg 77581 ttttcttctg atgataaagg tcatattgtt taagctacat ttactcttgg ggttctcttc 77641 actgtgtgct gcgggttgag aacaaaatta ggctttgcca gaatgaaaaa gtgaatggtt 77701 tttggggcct tcaacttttt gtgctcttga agaaataaga agacaaaata gctttcaatc 77761 cacatcaggc ccaatttgca ttgcttcggg agttcctggg aaagtgacgg acttctatcc 77821 aaaatcgcgc cgtgaatttg attattggta gttctacagt cagcttgagg gttgttggtt 77881 tgacagttgt cagagcatgt tgcagctgta tgaggtgggt atctgtacat atggatgtcc 77941 catattctcc agcattgcat aggaatagct ggtgtctaga tcctgcctca ggagctatgt 78001 gtcctgaatt taaaaatcag gtatgttata tccctggggc atgtcagaca tacaacaaca 78061 gtgcattagt ccattttcca ttgcttacaa caaaatacct taaactgggt aatttgtaaa 78121 gaaaataaat ttcttactgt tgttttgttg tgctttcttt tttttttttt tttttttgtt 78181 tactttgttt ttagagacag gatctcactc tgtcccctaa gctggagtac agtggcatga 78241 tcatagctca ctgccacctg gaacccctgg gctcaagtga tcctcctgcc tcagcctctc 78301 aagtagctgg gactacaggt gagcaccacc acacttggct actgttatta ttattttgac 78361 aagatttaat gtagaaagta tagcacctgt tatctaccag gaatagatga ggatgatcag 78421 gtacatttat gtttgaatct gagcgtttaa gtgtatggga agatattaga cactaccttt 78481 cctcataaga cactcagtga tctgaattga ggaacttcta gtttttgctc tcccacttct 78541 tggaaacccc tatctacttt cctatgtatt tgactacttg aggtagctta tacaagtaga 78601 atcatacaaa tgtattgttt tgtgactagc ttacttcact tacaataagt cctcaagttc 78661 ttccatgttg ctgtgtgtgt caaaatttcc ttccttgcta aggctgaata atattccatt 78721 gtatatatat tttctttatc cagtcattca tcaatgaata cttttttttg tgtgtgtgtg 78781 tggtagagtt tcgctcttgt tgcccaggct ggagtgcaat ggcgatctca gctcactgca 78841 acctccacct cctgggttca agcgattctc ctgccttagc ctcctgagta gctgggatta 78901 caggtgccca ccaacacgcc tggctaattt tttgtatttt tagtagagat ggggttttgc 78961 catattggcc gggctggtct cgaactcctg acctcaggtg atctgcctgc ctcggcctcc 79021 caaagtactg ggattacagg cgtgagccac tgcacccagc cacatcaatc aacactttga 79081 ttgtctccac attctggcta ctgtgaacat gggtgtggaa ctatcttcat gaagccctac 79141 tttaatttct ttttgtatat acccagaagt aaaattgctg gatcacatat aattctattt 79201 taaaattttt tcttagtttt aaattatctt ttggagacag ggtattgctc tgtacccagg 79261 ctggagtaag tggcacagtc acagttcact gcatccttga catcctgggc tcaagagatc 79321 ctcctgcctc agatttccaa gtagctagga ctaaaagtgt gccaccacca tgcctggtga 79381 atttttttta ttttttattt tttgaggcag gtcttcctct ggcactcagg ctggagtggt 79441 gcagtggtgt aatctcagct taccacagtc caataccagg ttcaagtgat cctcctacct 79501 cagcctcctg agtagctgag accacaggtg cacagccacc atggtggctg catcatttta 79561 cattcttatc aaaaggcaca tgtttctaat ttttccacat ccttgacaac acttgttgtg 79621 ttcttctgtt tcttttaatt gtagctactc tgttcgaatg gattttgatt gaaaacattt 79681 gaggtatggc ttttaggaga cttgatctta atttcctagc ctttttgacc tatagtttta 79741 ttgtggttta ttgtggaaaa aggtgccctg tagctgccta tccaggtcag tctaatcacc 79801 tttcatcatt caggatttat catctgaggt tcacaccaac atgtatacaa gtggcatgtc 79861 ttagtctgtt ttgtgcttct ataacagaat accacaggct gggtaattta caaagaagag 79921 acatttattg gctcgtagtt tggaggctgg gaagtccaat accaagatac tagcatctgg 79981 tgagggcctt ctcgctgcat cataacatga cagaagccat caaatggtag aagagcaaag 80041 agacagcaag agggaataag aacccattct tgcgatgata gcattagtcc acctatgagg 80101 gtggagccct catgatctga agcctcttaa aggttccacc tcttaatatt gttacagtgg 80161 caattagatt tcagcatgag tttggagaag acaaaggttt aaaccataac atggtgtaag 80221 tcacataact ctgggtcata tgcatatgaa agaattcgaa atttctctat gaacacatgc 80281 tagtcttatt tgggtttaga gaaatcttta gttgcagctg ttaattcatc ctgggtccaa 80341 ggtttacatc tgacagttgc ctacatactt ggctcttgaa agcatagatc tttagaggta 80401 attggcattt tggaatttta ggatattttt tgggaaaata aagggggcct gggcaaagga 80461 tagaagaggg aggaggtgca gaaggagaaa gattaggcag ggtacggtgg ctcacgcctg 80521 taatcccagc actttgggag gccaaggtgg gcagatcacg aggtcaggag attgagacca 80581 tcctggccaa catggtgaaa ccccgtcttt actaaaatac aaaaaattag ctgggcgtga 80641 tggtgcgtgc ctgtagtccc tgctactcag gaggctgagg caggggaacc acttgaaccc 80701 gggaggcaga gattgcagtg agccgaaatt gtgccactgc actccagcct gatgacagag 80761 caggactctg tctcaaaaaa aaaaaaaaaa aaagagaaag attagaaggg taaaggggaa 80821 gtagaagagt tagatataag gcagcaacta gaggaatagg agggggagga tattgcttaa 80881 gtttgccaaa ttgttcaatg gctttattca aggaaatttt tactgaaaca attttttatt 80941 cagaaccttg gagattgtac aaatacgatt cattccaata aaaaactact gcttactggg 81001 cctgagaaat tttgttttct ttcttttcta atgcatctct taaattaaca gtttttgttt 81061 tcttttcaac tgaaatgttc ctcctagtag ccactgaagg caaaaattat ccgggtgaag 81121 ttatgccact tagaaaccta aacactagaa ttgtgattgt aatcagagta catataacat 81181 gcaggagact ttcctggaaa ggagacgatt tcaatttaga cgatccaata agtgctggca 81241 ctgtctgaga ggagtgtaat gattatgtac cttaggcctg gttctacagc atgataggtg 81301 gctacttccc cacaagaaca cgacaatctt cacacctcag catgcagaaa atctgagaat 81361 actctttctg aatgaatgcc taattctctg gacaaaaaag aaaaaaaaaa aagaaaaaga 81421 agataagaga atcttatccc attttatcag tgacccaaag caaagtctgt ccaaatggtc 81481 accagtctgt tgagaatcag aaaacccttg gtctgtgata ttagctgaaa caagtcacta 81541 gagcctatgc ttgagttttt tttgtttttg tttttttaac ggagtttcac tcttgttgcc 81601 caggctgaag tgcaatggcg tgatcttggc tcactgcaac ctctgcctcc tggattcaag 81661 caattctcct gcctcagcat cccgagtagc tgggattata ggcgcatgcc tggccaattt 81721 tgtatttttt agtaaagatg gagtttttag taaagatggg gtttctccat gctggtcagg 81781 ctgatc // LOCUS AC003682 118033 bp DNA PRI 16-DEC-1997 DEFINITION Human DNA from chromsome 19-specific cosmids F18547, R27945 and R28830, genomic sequence, complete sequence. ACCESSION AC003682 NID g2689440 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 118033) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 500 kb ZNF gene family- containing human contig in 19q13.4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 118033) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (16-DEC-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oritented from centromere to telomere. Sequence derived from cosmid F18547 from bases 1 to 37,736, cosmid F11133 from bases 36,861 to 40,261, cosmid R27945 from bases 40,194 to 77, 593, and cosmid R28830 from 74,683 to 118,033. Sequence overlaps cosmid F25419 to the left and cosmid R32804 to the right. FEATURES Location/Qualifiers source 1..118033 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F18547-F11133-R27945-R28830" /chromosome="19" /map="19q13.4 between D19S303 to ZNF134" /cell_line="UV5HL9-5B for F library clones, 5HL2-B for R library clones" /note="cosmid libraries LL19NC02 and C03 constructed at LLNL from flow-sorted chromosomes from human-hamster hybrids UV5HL9-5B and 5HL2-B, respectively, which carry chromosome 19 as their only human chromosome." repeat_region complement(6..62) /rpt_family="Alu" repeat_region 817..1114 /rpt_family="Tigger2" repeat_region complement(817..879) /rpt_family="MER8" repeat_region complement(1117..1317) /rpt_family="Alu" repeat_region 1329..1750 /rpt_family="LTR3" misc_feature 1734..1808 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 78.000" repeat_region complement(2612..4770) /standard_name="HERVK" /rpt_family="LTR/Retroviral" misc_feature 4283..4395 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 43.000" misc_feature 4766..4877 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 85.000" misc_feature complement(6022..6471) /note="BLASTX similarity to 1196429 (1094..1243); match: 0.34, score: 3.4e-283; database searched: nr; (M14123) pol/env ORF (bases 3878-8257) first start codon at 4172, Xxx" misc_feature complement(6483..6719) /note="BLASTX similarity to 1196429 (1247..1325); match: 0.29, score: 3.4e-283; database searched: nr; (M14123) pol/env ORF (bases 3878-8257) first start codon at 4172, Xxx" repeat_region 6786..7215 /rpt_family="LTR3" repeat_region complement(7216..7280) /rpt_family="Alu" repeat_region complement(7355..7500) /rpt_family="Alu" repeat_region 7528..7791 /rpt_family="Alu" repeat_region complement(7828..7900) /rpt_family="Alu" repeat_region 7923..9480 /rpt_family="Tigger2" repeat_region complement(8733..9023) /rpt_family="Alu" misc_feature complement(9210..9310) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 80.000" repeat_region 9812..10109 /rpt_family="Alu" repeat_region complement(10337..10630) /rpt_family="Alu" repeat_region 11021..11316 /rpt_family="Alu" misc_feature 12642..12966 /note="DDS similarity to AA324357 EST27166 Cerebellum II Homo sapiens cDNA 5' end; Score: 593 Identity: 323/332 (97%)." repeat_region complement(13440..13909) /rpt_family="Alu" repeat_region 13941..14244 /rpt_family="Alu" repeat_region 15009..15288 /rpt_family="Alu" misc_feature complement(15451..15604) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 83.000" misc_feature 16168..16318 /note="DDS similarity to AA312440 EST183110 Jurkat T-cells VI Homo sapiens cDNA 5' end; (1..151); 98% identity.~~Other overlapping matches:~AA306860 EST177786 Jurkat T-cells VI Homo sapiens cDNA 5' end similar to similar to transcription factor (GB:L32162); (1..151); 99% identity.~" repeat_region complement(17738..17912) /rpt_family="MER7" repeat_region 17941..18036 /rpt_family="MER44C" repeat_region 18328..18603 /rpt_family="Alu" repeat_region 18653..18792 /rpt_family="MER44C" misc_feature 18984..19142 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 51.000" repeat_region complement(19143..19459) /rpt_family="MER21" misc_feature 19782..19891 /note="DDS similarity to AA312440 EST183110 Jurkat T-cells VI Homo sapiens cDNA 5' end (152..261); 99% identity.~~Other overlapping matches:~AA306860 EST177786 Jurkat T-cells VI Homo sapiens cDNA 5' end similar to similar to transcription factor (GB:L32162); (152..261); 100% identity.~~(19782..19820) predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 76.000" repeat_region 20126..20631 /rpt_family="MER44C" repeat_region complement(21667..21823) /rpt_family="Alu" CDS join(23850..23978,25861..27639) /note="Hypothetical Kruppel-type Zinc Finger Protein; Most similar to zinc finger protein ZNF132 (U09411)[Homo sapiens]" /codon_start=1 /product="F18547_1" /db_xref="PID:g2689441" /translation="MQGHVTFEDIAVYFSQEEWGLLDEAQRCLYHDVMLENFSLMASL FPSTKVCIYFTCIFMLLGCLHGIEAEEAPSEQTLSAQGVSQARTPKLGPSIPNAHSCE MCILVMKDILYLSEHQGTLPWQKPYTSVASGKWFSFGSNLQQHQNQDSGEKHIRKEES SALLLNSCKIPLSDNLFPCKDVEKDFPTILGLLQHQTTHSRQEYAHRSRETFQQRRYK CEQVFNEKVHVTEHQRVHTGEKAYKRREYGKSLNSKYLFVEHQRTHNAEKPYVCNICG KSFLHKQTLVGHQQRIHTRERSYVCIECGKSLSSKYSLVEHQRTHNGEKPYVCNVCGK SFRHKQTFVGHQQRIHTGERPYVCMECGKSFIHSYDRIRHQRVHTGEGAYQCSECGKS FIYKQSLLDHHRIHTGERPYECKECGKAFIHKKRLLEHQRIHTGEKPYVCIICGKSFI RSSDYMRHQRIHTGERAYECSDCGKAFISKQTLLKHHKIHTRERPYECSECGKGFYLE VKLLQHQRIHTREQLCECNECGKVFSHQKRLLEHQKVHTGEKPCECSECGKCFRHRTS LIQHQKVHSGERPYNCTACEKAFIYKNKLVEHQRIHTGEKPYECGKCGKAFNKRYSLV RHQKVHITEEP" misc_feature 23856..23978 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 72.000~~DDS similarity to AA312440 EST183110 Jurkat T-cells VI Homo sapiens cDNA 5' end (262..359); 99% identity.~~Other overlapping matches:~DDS similarity to AA306860 EST177786 Jurkat T-cells VI Homo sapiens cDNA 5' end similar to similar to transcription factor (GB:L32162); (262..384); 100% identity.~~BLASTX similarity to 1572600 (9..54); match: 0.63, score: 1.8e-10; database searched: nr; (U69133) Zik1 [Mus musculus]" repeat_region 25552..25814 /rpt_family="Alu" misc_feature 28143..28588 /note="DDS similarity to AA463280 zx71a09.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 796888 5'~Score: 853 Identity: 440/443 (99%)." misc_feature 28460..28498 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 53.000" repeat_region 29198..29480 /rpt_family="Alu" misc_feature complement(29225..29586) /note="DDS similarity to AA463192 zx71a09.s1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 796888 3' similar to contains Alu repetitive element;contains element MER7 repetitive element ; Score: 724 Identity: 362/362 (100%)." repeat_region 29665..29954 /rpt_family="Alu" misc_feature 30564..30802 /note="DDS similarity to R31607 yh76h05.s1 Homo sapiens cDNA clone 135705 3'. gi|1592846|gb|G29294|G29294 human STS SHGC-30341; Score: 436 Identity: 232/241 (96%)" misc_feature complement(30664..31611) /note="DDS similarity to overlapping ESTs:~(31031..30664) R31347 yh76h05.r1 Homo sapiens cDNA clone 135705 5'; Score: 632 Identity: 351/374 (93%).~~(31270..30874) H84590 ys69b07.r1 Homo sapiens cDNA clone 220021 5'. Score: 741 Identity: 389/400 (97%).~~(31611..31160) AA406399 zv12b06.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 753395 5'; Score: 904 Identity: 452/452 (100%)." misc_feature complement(31412..31548) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 84.000" misc_feature complement(31657..32117) /note="DDS similarity to AA130381 zl30d05.r1 Soares pregnant uterus NbHPU Homo sapiens cDNA clone 503433 5';~Score: 905 Identity: 458/460 (99%)." misc_feature complement(31821..31892) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 83.000" misc_feature complement(32080..32302) /note="DDS similarity to AA370512 EST82251 Prostate gland I Homo sapiens cDNA 5' end; Score: 430 Identity: 219/223 (98%)." repeat_region 32333..32666 /rpt_family="Alu" repeat_region 33563..33837 /rpt_family="Alu" repeat_region 34323..34711 /rpt_family="Alu" misc_feature complement(35368..35731) /note="DDS similarity to AA385612 EST99282 Thyroid Homo sapiens cDNA 5' end similar to zinc finger protein family; Score: 651 Identity: 357/371 (96%)." misc_feature complement(35654..36370) /note="BLASTX similarity to 387079 (216..454); match: 0.56, score: 8.2e-125; database searched: nr; (M36516) zinc finger protein (mkr5) [Mus musculus]" CDS complement(join(35687..36705,44943..45069)) /note="Hypothetical Kruppel- Type Zinc Finger Protein; Most similar to PID|d1024590 (AB007873) KIAA0413 [Homo sapiens]" /codon_start=1 /product="R27945_1" /db_xref="PID:g2689446" /translation="MLVTFKDVAVTFTREEWRQLDLAQRTLYREVMLETCGLLVSLGD RAQVHTREPTTYPPVLSERAFLRGSLTLESSTSSDSRLGRARDEEGLLEMQKGKVTPE TDLHKETHLGKVSLEGEGLGTDDGLHSRALQEWLSADVLHECDSQQPGKDALIHAGTN PYKCKQCGKGFNRKWYLVRHQRVHTGMKPYECNACGKAFSQSSTLIRHYLIHTGEKPY KCLECGKAFKRRSYLMQHHPIHTGEKPYECSQCRKAFTHRSTFIRHNRTHTGEKPFEC KECEKAFSNRAHLIQHYIIHTGEKPYDCMACGKAFRCSSELIQHQRIHTGEKPYECTQ CGKAFHRSTYLIQHSVIHTGEMPYKCIECGKAFKRRSHLLQHQRVHT" misc_feature complement(35832..36705) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 68.000" misc_feature complement(35954..36247) /note="BLASTX similarity to 387079 (88..185); match: 0.48, score: 3.3e-37; database searched: nr; (M36516) zinc finger protein (mkr5) [Mus musculus]" repeat_region 37036..37615 /rpt_family="L1" repeat_region 38003..38160 /rpt_family="Alu" misc_feature 38928..39137 /note="BLASTX similarity to (1144..1213); match: 0.35, score: 3.2e-06; database searched: nr; reverse transcriptase homolog - human transposon L1.1 gi|339771 (M80341) ORF2 contains a reverse transcriptase domain., ORF2 [Homo sapiens]" misc_feature 40919..41361 /note="DDS similarity to R35149 yg62a03.r1 Homo sapiens cDNA clone 37228 5'. Score: 669 Identity: 412/458 (89%)." misc_feature complement(41214..41434) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 47.000" repeat_region complement(42148..42391) /rpt_family="Alu" repeat_region 42617..42898 /rpt_family="Alu" repeat_region complement(44055..44338) /rpt_family="Alu" misc_feature complement(44941..45063) /note="BLASTX similarity to (1..41); match: 0.68, score: 8.0e-11; database searched: nr; finger protein ZNF133 - human (fragment)" misc_feature complement(44944..45072) /note="BLASTX similarity to 688395 (11..53); match: 0.65, score: 3.2e-12; database searched: nr; (S73488) zinc finger transcription factor, Kid-1 {KRAB A and B regions} [rats, liver, Peptide Partial, 110 aa] [Rattus sp.]" misc_feature 45016..45333 /note="DDS similarity to AA357230 EST66191 LNCAP cells I Homo sapiens cDNA 5' end similar to similar to zinc finger protein family; Score: 601 Identity: 311/319 (97%)." misc_feature 45209..45336 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 81.000" repeat_region 45420..46412 /rpt_family="LTR5" misc_feature 46710..46859 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 79.000" misc_feature 46903..47105 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 75.000" misc_feature 47329..47509 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 59.000" misc_feature 48723..48967 /note="DDS similarity to AA338353 EST43276 Fetal brain I Homo sapiens cDNA 5' end; Score: 414 Identity: 229/245 (93%)." misc_feature complement(49472..49769) /note="DDS similarity to AA338352 EST43275 Fetal brain I Homo sapiens cDNA 3' end; Score: 568 Identity: 291/298 (97%)." misc_feature complement(49874..50240) /note="DDS similarity to N20152 yx41a03.s1 Homo sapiens cDNA clone 264268 3'. Score: 914 Identity: 457/457 (100%)." repeat_region complement(51339..51606) /rpt_family="Alu" repeat_region 52007..52283 /rpt_family="Alu" misc_feature complement(52853..54042) /note="BLASTX similarity to 2323287 (292..601); match: 0.54, score: 6.3e-44; database searched: nr; (AF009668) polyprotein [multiple sclerosis associated retrovirus]" repeat_region 53078..53362 /rpt_family="Alu" misc_feature 54656..54782 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 54.000" misc_feature 55193..55318 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 51.000" misc_feature 55506..55541 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 53.000" repeat_region complement(57350..57684) /rpt_family="MER1" repeat_region 58438..58740 /rpt_family="Alu" misc_feature complement(59259..59303) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 71.000" misc_feature complement(60771..61922) /note="BLASTX similarity to P52740 (192..575); match: 0.54, score: 2.3e-183; database searched: nr; ZINC FINGER PROTEIN 132 pir||I38598 zinc finger protein ZNF132 - human >gi|488551 (U09411) zinc finger protein ZNF132 [Homo sapiens]~~Other overlapping matches:~(61254..60869) DDS similarity to AA384293 EST97797 Thyroid Homo sapiens cDNA 5' end similar to zinc finger protein family; Score: 745 Identity: 381/387 (98%).~~(61538..60931) AA312672 EST183346 Jurkat T-cells VI Homo sapiens cDNA 5' end similar to similar to zinc finger protein ZNF132; Score: 1176 Identity: 601/608 (98%)." CDS complement(join(60831..62201,64514..64549)) /note="Hypothetical Kruppel- Type Zinc Finger Protein; " /codon_start=1 /exception="Most similar to Z132_HUMAN ZINC FINGER PROTEIN 132 (U09411); 39% identity." /product="R27945_2" /db_xref="PID:g2689445" /translation="MLENFALITALGDQKHHSAEKPLESDMDKASFVQCCLFHESGMP FTSSEVGKDFLAPLGILQPQAIANYEKPNKISKCEEAFHVGISHYKWSQCRRESSHKH TFFHPRVCTGKRLYESSKCGKACCCECSLVQLQRVHPGERPYECSECGKSFSQTSHLN DHRRIHTGERPYVCGQCGKSFSQRATLIKHHRVHTGERPYECGECGKSFSQSSNLIEH CRIHTGERPYECDECGKAFGSKSTLVRHQRTHTGEKPYECGECGKLFRQSFSLVVHQR IHTTARPYECGQCGKSFSLKCGLIQHQLIHSGARPFECDECGKSFSQRTTLNKHHKVH TAERPYVCGECGKAFMFKSKLVRHQRTHTGERPFECSECGKFFRQSYTLVEHQKIHTG LRPYDCGQCGKSFIQKSSLIQHQVVHTGERPYECGKCGKSFTQHSGLILHRKSHTVER PRDSSKCGKPYSPRSNIV" misc_feature complement(61352..61759) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 44.000" misc_feature complement(62006..62201) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 81.000" repeat_region 62536..62743 /rpt_family="Alu" misc_feature complement(64383..64457) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 59.000" misc_feature complement(64514..64645) /note="BLASTX similarity to 1572600 (11..54); match: 0.75, score: 7.5e-13; database searched: nr; (U69133) Zik1 [Mus musculus]~~Other overlapping matches:~(64642..64516) predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 86.000" misc_feature 65617..65737 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 67.000" misc_feature 67225..67356 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 94.000" repeat_region 68969..69611 /rpt_family="L1" repeat_region 69718..70033 /rpt_family="Alu" misc_feature 70264..70371 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 50.000" repeat_region complement(71402..71651) /rpt_family="Alu" misc_feature 72471..72602 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 53.000" misc_feature 73063..73190 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 91.000" misc_feature 73664..73702 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" repeat_region complement(74453..74754) /rpt_family="Alu" repeat_region 75003..75292 /rpt_family="Alu" repeat_region complement(75456..75578) /rpt_family="MSTAR" repeat_region complement(75612..75967) /rpt_family="THE1" misc_feature 76441..76477 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 62.000" misc_feature 77248..77379 /note="BLASTX similarity to 1572600 (11..54); match: 0.72, score: 8.6e-224; database searched: nr; (U69133) Zik1 [Mus musculus]" misc_feature 77251..77377 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 85.000" CDS join(77344..77377,78723..79987) /note="Hypothetical Kruppel-Type Zinc Finger Protein; Most similar to mouse Zik1 (U69133); 66% residue identity" /codon_start=1 /product="R28830_1" /db_xref="PID:g2689442" /translation="MLENFALVASLGCGHGTEDEETPSDQNVSVGVSQSKAGSSTQKT QSCEMCVPVLKDILHLADLPGQKPYLVGECTNHHQHQKHHSAKKSLKRDMDRASYVKC CLFCMSLKPFRKWEVGKDLPAMLRLLRSLVFPGGKKPGTITECGEDIRSQKSHYKSGE CGKASRHKHTPVYHPRVYTGKKLYECSKCGKAFRGKYSLVQHQRVHTGERPWECNECG KFFSQTSHLNDHRRIHTGERPYECSECGKLFRQNSSLVDHQKIHTGARPYECSQCGKS FSQKATLVKHQRVHTGERPYKCGECGNSFSQSAILNQHRRIHTGAKPYECGQCGKSFS QKATLIKHQRVHTGERPYKCGDCGKSFSQSSILIQHRRIHTGARPYECGQCGKSFSQK SGLIQHQVVHTGERPYECNKCGNSFSQCSSLIHHQKCHNT" misc_feature 78723..79163 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 56.000" misc_feature 79157..79981 /note="BLASTX similarity to 1572600 (188..462); match: 0.89, score: 8.6e-224; database searched: nr; (U69133) Zik1 [Mus musculus]" misc_feature 79371..80054 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 67.000" misc_feature 80018..80478 /note="DDS similarity to AA130267 zl30g01.r1 Soares pregnant uterus NbHPU Homo sapiens cDNA clone 503472 5';~Score: 897 Identity: 460/464 (99%)" repeat_region complement(81760..82043) /rpt_family="Alu" repeat_region 83176..83479 /rpt_family="Alu" misc_feature complement(84329..84467) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 50.000" repeat_region 84970..85329 /rpt_family="THE1" repeat_region 85778..85988 /rpt_family="Alu" repeat_region 86252..86530 /rpt_family="Alu" misc_feature complement(86604..86760) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 55.000" repeat_region complement(86817..87107) /rpt_family="Alu" repeat_region 87214..87696 /rpt_family="MLT2B2" misc_feature complement(89501..89743) /note="BLASTN similarity to Z54932 (1..245); match: 0.94, score: 1.2e-72; database searched: nt; H.sapiens CpG DNA, clone 175d10, reverse read cpg175d10.rt1a ." misc_feature 89853..89997 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 62.000" repeat_region complement(90402..90543) /rpt_family="THE1" misc_feature 91375..91661 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 44.000" repeat_region complement(91407..92068) /rpt_family="THE1" repeat_region complement(91614..91889) /rpt_family="Alu" misc_feature 93034..93118 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000" CDS join(93085..93118,94398..96037) /note="Hypothetical Kruppel-Type Zinc Finger Protein; Most similar to human ZNF84 (zinc finger protein HPF2; M27878); 38% residue identity" /codon_start=1 /product="R28830_2" /db_xref="PID:g2689443" /translation="MLENFAVMASLGCWCGAVDEGTPSAESVSVEELSQGRTPKADTS TDKSHPCEICTPVLRDILQMIELHASPCGQKLYLGGASRDFWMSSNLHQLQKLDNGEK LFKVDGDQASFMMNCRFHVSGKPFTFGEVGRDFSATSGLLQHQVTPTIERPHSRIRHL RVPTGRKPLKYTESRKSFREKSVFIQHQRADSGERPYKCSECGKSFSQSSGFLRHRKA HGRTRTHECSECGKSFSRKTHLTQHQRVHTGERPYDCSECGKSFRQVSVLIQHQRVHT GERPYECSECGKSFSHSTNLYRHRSAHTSTRPYECSECGKSFSHSTNLFRHWRVHTGV RPYECSECGKAFSCNIYLIHHQRFHTGERPYVCSECGKSFGQKSVLIQHQRVHTGERP YECSECGKVFSQSSGLFRHRRAHTKTKPYECSECEKSFSCKTDLIRHQTVHTGERPYE CSVCGKSFIRKTHLIRHQTVHTNERPYECDECGKSYSQSSALLQHRRVHTGERPYECR ECGKSFTRKNHLIQHKTVHTGERPYECSECGKSFSQSSGLLRHRRVHVQ" misc_feature 94398..94798 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 57.000" misc_feature 94892..96034 /note="BLASTX similarity to P52740 (154..534); match: 0.59, score: 4.6e-204; database searched: nr; ZINC FINGER PROTEIN 132 pir||I38598 zinc finger protein ZNF132 - human >gi|488551 (U09411) zinc finger protein ZNF132 [Homo" misc_feature 95373..96011 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 69.000" repeat_region complement(96060..96346) /rpt_family="Alu" repeat_region 97519..97750 /rpt_family="Alu" repeat_region 97772..98054 /rpt_family="Alu" repeat_region 98065..98356 /rpt_family="Alu" misc_feature 98182..98812 /note="DDS similarity to overlapping ESTs:~(98812..98293)AA412283 zu10f05.s1 Soares testis NHT Homo sapiens cDNA clone 731457 3'; Score: 1036 Identity: 519/520 (99%).~~(98182..98749) AA470015 zu10f05.r1 Soares testis NHT Homo sapiens cDNA clone 731457 5' similar to contains Alu repetitive element; Score: 1136 Identity: 568/568 (100%).~~(98513..98226) AA380471 EST93438 Supt cells Homo sapiens cDNA 5' end similar to EST containing Alu repeat; Score: 565 Identity: 287/289 (99%)." repeat_region 101255..101368 /rpt_family="Alu" repeat_region 101625..101915 /rpt_family="Alu" misc_feature 102987..103215 /note="BLASTN similarity to U09412 (1..228); match: 1, score: 6.9e-76; 99% identity; database searched: nt; Human zinc finger protein ZNF134 mRNA, complete cds." misc_feature 103962..104268 /note="BLASTN similarity to T05443 (1..309); match: 0.98, score: 9.1e-112; database searched: est; EST03332 Homo sapiens cDNA clone HFBCY52." repeat_region 104200..104479 /rpt_family="LTR10" repeat_region complement(105231..105309) /rpt_family="MIR" misc_feature complement(105593..105951) /note="DDS similarity to M86035 EST02560 Homo sapiens cDNA clone HFBCY52. Score: 673 Identity: 349/358 (97%)." misc_feature 107504..107589 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 65.000" gene 108181..110115 /note="Human zinc finger protein ZNF134" /gene="ZNF134" CDS join(108181..108220,108872..110115) /gene="ZNF134" /note="Kruppel-Type Zinc Finger protein ZNF134; Note: discrepancy between genomic sequence and ZNF134 mRNA at base 109105 (base 558 of cDNA) extends amino terminus of protein 78 residues relative to published sequence, and changes the sequence of the first 12 residues of the published protein" /codon_start=1 /product="ZNF134" /db_xref="PID:g2689444" /translation="MTLVTAGGAWTGPGCWHEVKDEESSSEQSISIAVSHVNTSKAGL PAQTALPCDICGPILKDILHLDEHQGTHHGLKLHTCGACGRQFWFSANLHQYQKCYSI EQPLRRDKSEASIVKNCTVSKEPHPSEKPFTCKEEQKNFQATLGGCQQKAIHSKRKTH RSTESGDAFHGEQMHYKCSECGKAFSRKDTLVQHQRIHSGEKPYECSECGKAFSRKAT LVQHQRIHTGERPYECSECGKTFSRKDNLTQHKRIHTGEMPYKCNECGKYFSHHSNLI VHQRVHNGARPYKCSDCGKVFRHKSTLVQHESIHTGENPYDCSDCGKSFGHKYTLIKH QRIHTESKPFECIECGKFFSRSSDYIAHQRVHTGERPFVCSKCGKDFIRTSHLVRHQR VHTGERPYECSECGKAYSLSSHLNRHQKVHTAGRL" misc_feature 109068..110112 /gene="ZNF134" /note="BLASTX similarity to P52741 (1..348); match: 1, score: 1.6e-259; database searched: nr; ZINC FINGER PROTEIN 134 pir||I38599 zinc finger protein ZNF134 - human >gi|488553 (U09412) zinc finger protein ZNF134 [Homosapiens]~~Other overlapping matches:~(109444..110002) DDS similarity to AA206670 zq51a09.r1 Stratagene neuroepithelium (#937231) Homo sapiens cDNA clone 645112 5' similar to TR:G488553 G488553 ZINC FINGER PROTEIN ZNF134. ; Score: 834 Identity: 511/543 (94%)." misc_feature 111366..111701 /note="DDS similarity to N88383 K3206F Fetal heart, Lambda ZAP Express Homo sapiens cDNA clone K3206 5' similar to REPETITIVE ELEMENT. Score: 624 Identity: 327/336 (97%)." misc_feature complement(112209..112510) /note="BLASTN similarity to Z30173 (1..302); match: 0.99, score: 1.0e-117; database searched: est; H. sapiens partial cDNA sequence, clone HEA96F" misc_feature 112209..113090 /note="DDS similarity to overlapping ESTs:~(112209..112538) Z30173|HHEA96F H. sapiens partial cDNA sequence; clone HEA96F; single read. Score: 615 Identity: 322/325 (99%).~~(112486..112908) R79262 yi84b04.r1 Homo sapiens cDNA clone 145903 5'. Score: 787 Identity: 418/426 (98%).~~(112365..112922) W27746 37a9 Human retina cDNA randomly primed sublibrary Homo sapiens cDNA. Score: 859 Identity: 517/555 (93%).~~(113090..112708) N35314 yy22c04.s1 Homo sapiens cDNA clone 271974 3'. Score: 752 Identity: 383/385 (99%).~~(113073..112663) R79163 yi84b04.s1 Homo sapiens cDNA clone 145903 3'. Score: 701 Identity: 405/424 (95%).~~(113090..112708) N35314 yy22c04.s1 Homo sapiens cDNA clone 271974 3'. Score: 752 Identity: 383/385 (99%).~" repeat_region 113659..113783 /rpt_family="Alu" repeat_region complement(113920..114200) /rpt_family="Alu" repeat_region 114760..115045 /rpt_family="Alu" repeat_region 115248..115488 /rpt_family="Alu" repeat_region complement(116471..116763) /rpt_family="Alu" BASE COUNT 33165 a 25596 c 27076 g 32196 t ORIGIN 1 gatctgcccg cctcagcctc ccaaagtgct gggattacag gcgtgagcca ccatgcccgg 61 ccaacaattt tataaataag aataaatttg aaaaccttta tgacttcaag acttatgaat 121 gtacggtaat tgagagagta aagtattagc taaaggacag acatagagat gaacagaaga 181 gaatagagaa cccggaaata cactcacaca acactctatt gtcaatctct tttcaacaaa 241 gttgcaaaag tagtaaaaca acttcagaaa acagtgtagc agctttaaag gtagcataaa 301 cttaccatat agcttaggaa tcttacctat agggatatcc ccaagagaac aaaaacatgt 361 gtcctagaaa tacttacact tgcatgctca gcagttttat ttataaaagc caaacactag 421 aagctgtttt tcacacacac acacacacat acacacatac acacactcaa aactaaaatg 481 tccatgaaca gaagaaggaa taatcaatta tggtatatcc ttaaaatacc attcaacaac 541 ataaaaaata aactatgaat acaagcaaaa acagcataga taaaaaaaag ccctgagctg 601 agtggaagaa gccagacaca aaagttgaca tacagtatga ttccatttat atgaaatgaa 661 gaagctgcaa aactagagta aaagaatgca gatcactggt tattgggcct gtggtctgag 721 ggtgaaggtt ctgactgaaa gagtacagga aaatgttatc aggcaatggt aattttctct 781 atcatgactg tgttggtggt ttcatgacag tacagctgac ccttaaataa cacaggtttg 841 aaatgtacag gtccaattat atgtgaattt tatttcaata aatatattgg aaagtttttt 901 gaatatttga gacaatttcc aaaaaccaga attcattgaa ataagtatgt caagaatgca 961 taatacaaaa tatatgtagg tactatttta tcatttacta ccataaaata tacataaatc 1021 tattatacaa ggttaaaatt agtcaagagt cacataaaca cagactgtac ttggtgcttc 1081 cagagaaatg taaacaaaca caagatgcag tatttttttt tttttttttt ttttgagatg 1141 gagtctctct ctgtcaccag gctggagtgc agtggcatga tcttggctca ccgcaacctc 1201 tgcctcccaa gttcaagtga ttctcctgtc tcagtctcct gagtagctgg aactacaggt 1261 gtgcaccacc acgctgagct aatttttgta ttttttagta gagacagggt ttcaccacgt 1321 tggtctgtgt tggaggtcaa aagaattagg gtcatgacca actcagtatg ccactggaga 1381 ctatatgagc aaacaacaaa ctagtctcat gaatgcagta tgttggcaag cctacaactg 1441 tgtctgcagc cagatggaat gctaagggca gtcacacccc aggtgcagtg ttccttgtgg 1501 ttatctacag gaacatctgg agtctgttgt ataaagaaag caattatgtg agcctgtgat 1561 aaatcaagca gctgaccaac tgttacgtct tcctctctgt ggattctacc taataaatac 1621 aaagggctgt agaagctcag ggcccttgtt ccctagaagc aaggagccct ctgacccctt 1681 ctttaaaaca gatctttttg tcttcatttc tgcatttgtc cttcttcatt cagtcccgaa 1741 ccgacagcca caagtggcgc ccgaacaggg acgtgagtga agaaggtctg ctggagcaga 1801 gaaagtgaaa ctgaccagaa gaatgagaaa ccccaggaca agtctgctgg cagtggatat 1861 aaggtcagtg ccctaaagag gtactgggag tgggaagttt ctgaatcagg gtaacatagg 1921 gcagaatttg tctgctgaag agcaacatta tatgcagtca cttaaaggtt tacttaaaca 1981 atctggtgct caggttagtt ctcaaacact agctaagctg ctgcaggagg ttatcacgca 2041 taacccatgg tttccacaga cagacactct tgatgtggaa aactggggtg aagatttgct 2101 tacggcatag gatatgagac ttacaaatta aactattggc aatccaggat ttaaaatgtt 2161 aaagaaaatg agatatcaga gcagacaagg cttaggaaag tccctacagg aaaaccctga 2221 tcctatatca ataactgggc aaacagatag aaaagggcta ggtcgtcaga tttctgatgt 2281 gggtcattga tatttctcct ccacccactg ctttgccact agagtggcta acaaacctgt 2341 atgggtggat caatggcccc tttcacagga gaaactaact caactccatc agctagtaaa 2401 agagcgattg gatgcaggac atattgaaga gtcagcccct ggaattcacc agtatttgta 2461 ataccaaaaa agtcaggaaa atggggactg ctacatgatt tgagagctat taatgcacag 2521 attaaaccaa tgggtgcatt acagcaaggt ctgccatccc tggcaggcat ttccaagaga 2581 ctggcctctt gtagtaatag atcttattgt tttttttact atactattac atcagcagga 2641 taggcctgga tatgccttct ctgtgccttc tgttaatcaa agggagcctg tctcttgtta 2701 tcaatggaaa gttttgcccc aaggcgtgct taacagtcct acattatatc agaattttgt 2761 aggacaggca ttaaaggagc cttgtaatgt gtttcccact ggctatgtca ttcattatat 2821 ggatgatatt cttttggccg ctcctacaga tcaaatctta catcagttat tcagagaaac 2881 aaaacaggct ttgactaaat ggaatctcaa aatagctcct gaaaaggtgc aaaccacctc 2941 cccataccag tacttaggca ctattattac tgaaagaagt gttcagcctc agaaagtagt 3001 catctgtagg gacagattac aaactttgaa tgatttccaa caattatcag gggacattaa 3061 ttggctgcac ccaatgctag gtattgctac ttattaactc aaacaccttt atcagaccct 3121 ccaaggagat tcttcattag actctcctcg gcaacttact aaggaggcag aagctgaatt 3181 atagcttgta gaacagatgc ttcagcaaca acatgccacc tggttacagc cacaaaagcc 3241 tttgcttctg tttattcttc ctacctccca ttctccaaca ggacttttag gccaattcat 3301 acacaaatct gtaatagtat tagaatggct tttttatcca atcagacagt gaaatctttg 3361 caaatttatc tttctttaat tactcaactt ataacaatag gtaggcatag atcaaaaatg 3421 tttatgggat gtgatccaga caaaattatt gttcccttgg attcccaaca acaggccaca 3481 gcatgggaaa tgtcgactgc atggcaaatc actctcacag attttgtagg aataatagat 3541 aaccattatc catcagacaa aattttgcaa ttttatgaag ttcacccttt tacccttcct 3601 gtaatcactc atcacaagcc tattccaggc agacagacct attttactga tgactcttct 3661 aaaggctgtg cagctattta tggacctaag catactgaaa caataaagac ctctggagtt 3721 tcagctcaat gctcagaatt agtggcagtt attcaagttt tacagctcac cgctttatct 3781 cctattaaca ttgtctgtga ttcagcctat attgtaaatg tagccagtca cattgagact 3841 gccactatta aaagcaccct agaaccagag ctgtataatt tgtttctaag acttcaacaa 3901 gctgttcact ctcatgctgc tccttttcat atttctcata tttgctatca cacacaactt 3961 cctggaccac tatctctagg taacaataaa gcagataaac taatcggttc tgtatttcaa 4021 caagcccaat cttctcatgc attcctgcat caaaacacct ctgcccttac tcgtatgttt 4081 catctgcctt gcagccaggc ttgagctatt gcgcaagcct accccacttg ccagcctgtc 4141 cctggcgttg cacccatgga aggatgtaac ccactaggct tggctccata tgaaatctgg 4201 cagatggatg ttacacatat agcagccttt ggcaaactca gctatgttca tgtgactaca 4261 gacacttact cccatatgct acatgtcacg tgccaaactg gggaaacagc tggtcatgtc 4321 tgatgacatt gtctgtcacc ttttgcccat atgggggtcc ctacacaatg aaaaactgac 4381 aatggacctg cttatgttag tcatgctttt caaaattttt tacagttatg ggcaatcact 4441 cataaaacag aaattcctta caatcctcaa ggacaaggaa tcatagagtg ggcacatcaa 4501 acattacaat gcatgttgaa aaaacaaaaa tggggactag gagaccagct accacctcaa 4561 acaaaattac atttagcctt atttacttta aattttttga ctcctggtat ggatattaag 4621 actctggcag aatgacattg gcaaatgtta gagggaaaaa ggaaagttta cccaaaggta 4681 ttacagaaat ccccggaaga aggacaatgg aaaggcccgg tagacttact gatatgggga 4741 cgagggtttg cttgtgtttt tacaggagat ggacaatccg tgtgggtgcc ctcaaggtgt 4801 gtgtgaccat ggaacgggag actggagaaa tccatggatc tcaactgtgg gcccagctcc 4861 tccagtacga gccatgagct agttgaatct gaatgcaaag acagaacaag gaccgactgg 4921 agtcacactg acatcaaccc ccataaaatg gggacagctc aagaaaacca cacaggaagc 4981 tgagaaactg ctggagcaaa acccctgact ccatgtttat ggccatgcta gctgtaatat 5041 cctgtgcggt atgttttccc tgtgcagagg caaaaactta ttgggcatat attcctaacc 5101 caccagtagt acgaccggta ctcgggagca acactcctcc tgagatatat catgatcagg 5161 gagcatggac accaggaccc ctaactcccc ctgacagaga gtgattagat tctcagaaca 5221 atggtatcaa ttatactgca ccattggagg gacttcttta tgtgtcaccc aggatgcatc 5281 actcaaccgc agttgccttg caattcagtc ccaagcatgc ttgagttacc atggaaaaat 5341 tatgtaccta ttaggcctta gctctattaa tattactggt gtagttacta atcactcctg 5401 gccccatcac ccaaattgta ttgattgtac agaatgggct ccctttgata attctcaccc 5461 ctgtccttgg actcagtgtc ttggcccctt agctaaacaa cagtccatgt taatgggaga 5521 cattatcgac tggggtcccc atggtcattt agatgggaga agtgagaatc agacctcatg 5581 gcataaattt tgctggcact ggtggcgaaa cttgaacatc tcttcgctac atcccaatct 5641 gctacacaac ttgcttggca cagaatgggc tttagcccac ctttgcctca atggcattat 5701 caaggaaaga gaggtccaat tcaggagtca atatggaaga cagcactccc atttatgaat 5761 ggcagcattt gggttgggac actatccaat aatagtaata atgctcaatg cagttttaat 5821 gttacctttg tagaaaatac taccacacag tttacaattt gtatttttaa tccgtatgtt 5881 tttctagcag caaaaaagga ccaactccgg gtaaacaatg cccaatcgat ttgtgatgcc 5941 tgtcaactgt atcattgcct taatcatagc acaatacaaa cacacagcat atccacccta 6001 ataattctag gtcacattcc tggattatag attcctgtaa atctgtccga gccttgggag 6061 gccacccccg ctttacattt tgtaaaactc cttcttattc agcttactca tcgtgctcgt 6121 agagctttag gcatgataat ttttgctata atttccctag tcacactaat aacctctgct 6181 gtgttgtcct cagtagcact gcacagctcc attcaaacag ctcaatgtgt gaaaaattgg 6241 atgtgcacag ccgaccagga atggatgctt caaaataaaa ttaacaccaa gatacaaaca 6301 gaagtggcaa tgttaaagac tactgttctg tggctaggat aacaaataca aagcctgaag 6361 ttgcagcagc aattgcattg tcattttaac cctattcata tttgtgtaac taatttggaa 6421 tataatcaaa gtgaatatcc aaggaacttt gtaaaggccc atttacaggg agttttacat 6481 tcaatgttac ttttgatatt aatgattcac aaagtaaaat ccttaacttg aataagcaga 6541 ctcaagtgct tcagccctct ttaaaagctt gggcagaatt ccagcaaggt ttagggagcc 6601 ttaacccttg gacctacttc aaacagcacc tcaatgtctt ttttgtgatt atcggaataa 6661 tgttattatg tttctgtttt ttgttcatag tctgtaaaat cagctggacc accaaccggc 6721 aattgagagc tgcacagcct gcaattacct ttattcaatt aatacaaaaa cagaaaaggg 6781 gggaggttgg aggccgaaag aatgagggtc gtgaccaact cagtatacca ctggaggcta 6841 tatgagcaaa cagcaaactg ttctcatgaa tgcaggatgc tggcaagctg acaactgcat 6901 ctgccaccag aaggaatgct gagggcagtc ataccccacg tgcagtgttc cttgtggtta 6961 tctataggaa catctggagg ctgttgtata aagaaagcaa ttatgtgagc ctgtgataaa 7021 tcaagcagct gaccaaccat tacctcttcc tccctgttga ttctacctaa taaatacaaa 7081 gggctgtaga agctcagggc ccttgttccc tagaagaaag gagccgcctg tctccttctt 7141 taaaacagat ctttttgtct ttgtcttcat ttctgcattt gttctctttt gttcagtccc 7201 aaaccgacag ccataggcca ggatggtctt gatcttttga cctcatgatc tgcccacctc 7261 ggcctcccaa agtgctggga gccaccacac ccagccaaag atgcagtatt aagtcataat 7321 cgtgcatgtg tgtatgtgtg tgtttgaata gacagagtct ccctctgtcg cccaggctgg 7381 agggcagtga catgatcata gattgctgca tactcaaact cctgggctca agtgatcctt 7441 ctgcgtcggc ctccccagca gctgagacca caagcacatg ccaccacact ggctaatttt 7501 aaaatgtttt ctgtacaggc tgggcatggt ggctcacacc tgtaatccca gcattttggg 7561 atggttggca ggtcccagaa gtttgagacc agcctgggaa atatagtaag accccatctc 7621 tacaaaaaaa caaagacaaa ttagctgggt atggtgccac gcacttgtgg ttccagctac 7681 tcggggagct gaagtaggag tatcacctga gcccaggagg ttgaggctgc agtgagctgt 7741 gattgtgcca ctgcactcca gcctgggtaa cagagtaaga ccctgtctca acaattttct 7801 gtagaggcag gggtctcgct atgttgccca ggctactctc caactcctgg cctcaagcaa 7861 tcttcccacc ttgagctccc aaagtgctgg gattacaggc ataagccatc atgctgggct 7921 gataaaatta actacaggac atactacagt accataataa ttttgtagac atctcctgtt 7981 gttcacagtg ttgtgagtat tcacttaaag aaccccatga tgttaatcat ctccacatga 8041 gcagttcatc tctccagtaa atcacatatt gcagtaaaaa gtgatctctc aaggttctaa 8101 tatattattc atagtgttta gtgcgatatt gtaaacttta agtaacgcta tatggcctat 8161 atgaagtgcc agtagtgatg ctggaagtgc tcccaagaag cagggaaaag tcatgacatt 8221 ataagaagca gctgaactgc ttgatatgta ctatagattg aggtttgcag ctgtagttgc 8281 ccaccatttc aagataaatt aatccagtat gaggatcatt gttaaaaaca aacaaaaaag 8341 gaaataaaga cattgctgca gctatgccag caggcacaaa aatcttgcat tttctgcaaa 8401 atatatcttt tatctcattt aaaaatgcag ctttcatgtg ggtgcagaat tgccatgggt 8461 aaggcataca gagagactca aatacgattt gagaaacagt cattatatga caactcaaag 8521 cattaggaag gtgaaggatc taaagctgga gaatttaatg gcagcaaggg aaagtttgat 8581 aattttagaa agatgtttgg ctttgaaaat gtcaagataa taggagaagc aacttctgct 8641 gactacaagg cagccaacaa tgtcccaggt gccattaaga gaaccactga caagaaatta 8701 tatcaacctg aacagatttg ttttttgttt tgttttgttt ttgagacaga gttttgctct 8761 tgttgcccag gctggagtgc aatggcacga tctcagctca ctgcaatctc cacctcccag 8821 gttcaagcga ttctcctgcc tcagcctccc aaagtagctg ggattacagg cacccgccac 8881 catgcccagc taattttttt gttttttttt agtagagatg gccatcttgg ccagggtggt 8941 cttgaactcc tgacctcagg tgatccaccc gcctcagcct ctcaaagtgc tgggattaca 9001 ggcatgagcc actgcacccg gcctgaacag gcttttaatg caaataaagt gccctgtttt 9061 gggaaaaact gccacaaagg acatttatta gtaagtaaga gaagaacata ccaagattta 9121 aaacagaaag ggataggcta actccagtgt tttgttcaaa tgtagttggt tttatgataa 9181 ggactgccct tatctataaa actgctaacc ccaaaccctt gcacaagctg tcagtctttt 9241 ggttgtacaa caagaaggcc tggacaatga gaaccatttt ttctgggttg ttcccattaa 9301 tgctttgttc ctgaagtcag gaagtgcctt gacagaaagt aattgccttt aaagttcttt 9361 tgacattgta caatgcccct ggccagccag aaccccatga gatcaacacc aaagttgtgg 9421 aagtggtcta cttgatccta aacaccctct aatttggcat aaagatgatg gagtcataag 9481 aatctttaaa ttaggatccc taatccccac attgttcaag ggtcaactgt atatattgac 9541 cagtattcat tgaactgtac attttaaact ggcaaatttt attgtgtgta aattacactt 9601 cattaaatct gcttaaagag aaaaaaataa tccatgagtc cacagacata aatgaatgaa 9661 tgaatgaagt aggaaagaaa aagttatact ttatagtaaa atgtcagtta aaatagatgg 9721 agatagaaaa tcatcaatga gagctaaact ggtgaagtaa aagattaatg aggaaaagga 9781 gttttatata tttctaaaga atctttcttc tggccgggtg cggtggctca cgcctgtaat 9841 cccagcactt tgggaggccg aggtgagcgg atcacgaggt caggagatgg agaccatcct 9901 ggctaacacg gtgaaacccc gtctctacta aaaatacaaa aaattatctg ggcgtggtga 9961 tgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 10021 aggcggaact tgcagtgagc cgagatcgtg ccactgcact ccagccagcc tgggcgagag 10081 agagagactc cgtctcaaaa aaaaaaaaat tttcttctat ttatttgcat ttatatagaa 10141 aaaacagtag agagaatgta aacaacccag tcaggaggag actccaacac tgaaaaaaat 10201 tcatgagtac catatttaca tgttttaaaa tcatatttcg tcagaattat attttcacaa 10261 cgatagatat ggtaaaaatg attttgcact ttcttacagt accacaaaac attttaatgg 10321 agcaagagat ttcttttttt tttttttttt ttttgagaag gagttttgct cttgttgccc 10381 aggttggagc gcaatggccc tatctcggct cactgcaacc tccacctcac cggttcaggc 10441 aattctcctg tctcagcctc ccaagtagcc gggattatag gctcctgcca ccatgacagg 10501 ctaatttttt gtattttcag tagagacagg ctttcactat gttggccagg ctggtcttga 10561 actcctgacc tcaggtgatc cacccacctc ggccccccaa agtgttggga ttacagacat 10621 gagccaccgc acctggcgga gtgagagatt tctaaccata actgagattt tgcttattca 10681 tataacttaa tggtttatgt tacaagataa ataaaaatga ttagtatttg gttaaaaaat 10741 tttaggctgg acacggccta ttatctaaga cattagatat atatcttgca tacaaattca 10801 ttactagctg tgttatttgt agatatttta tgctaatcat ttgtagctcc tctttttaat 10861 gttaacaaga tattttgaag agcattgcat ttaattatga gaaaatctaa tttcttaatt 10921 ttcctgttac agtcttgttt ggtgttttta tgtaagaaat atttgcttca ctatgatcac 10981 aaagattttc ctccattatc tcctagaagt tgtatggttt ggccgggcgc agtggctcac 11041 gcctataatc ccagcacttt gggaggccga ggcgggcgga tcacgaggtt aggagatcaa 11101 gaccatcctg actaacatgg tgaaaccccg tctctactaa aaatacaaaa aatttagccg 11161 ggggtggtgg cgggcgcctg cagtcccagc tactcaggag gctgaggcag gagaatggtg 11221 tgaacccagg aggcggagcc tgcagtgagc cgagatcgcg ccactgcact ccagcctggg 11281 cgacagagca agactcagtc tcaaaaaaaa aaaaaagaag ttgtatggtt tttggtttta 11341 catttatgtc tatgaattat ttcaagttaa tttttagcat agaacttgct ggaacatggg 11401 ttaaatggtc aggaaaagtg gtaaattaag gataaaaatt gctacattct ttggaaaatg 11461 aaagcaaata gcaagatgat agtgttattc atagagatga aagtgactgt tagacttggt 11521 gataaaagct gagaaaactg ctcattttgg agcacattta aatatctatc acagaggcaa 11581 attggaaggt cagggaaaaa aattggaaaa atgaatctag aactcaggta acagtaagga 11641 tggttgggaa ggtttaaatt agtaaattgg aagatttcat agaaaactca tgaaggttaa 11701 gtgacctgag gatgcagaag tgaccactgg gctaaagtga agaacaaata tggagagaag 11761 tggcaggctt gcatgataca catgcgttca gtaaagcaaa cacatcacaa cccagtgagt 11821 attctcaata tcagcacatc tcttaaacta gcaccacaat caataatgaa aatccccttc 11881 attcaccaat gggttctaac tcctagaatt tatgatttct agtgctctgt attctaacat 11941 atattttgct catctttgca ttttgtactc ttttgcttaa ttctcagtgg catatcttca 12001 tttggagaga tgtcttttaa gtctagtggt ggcttcttgt catcctggtc ctactcctcc 12061 tcttcctcgt gttcctcttc ctccttctgt tctttctctt cctgctcctt gtccctttac 12121 atctttatct ttatttttac cttcatcttc atattctttt tcaccagaaa tccctcaggt 12181 tgaccaagat ggtggtatgt agctatatgt gccacatgtg gcctggagtg taggactggt 12241 aagatgtgac tcataacaat atggatagac gatgtaattc tgggtttaaa aaaaaaaacc 12301 tctgcatgac ctacacttca gaggccctta gagcttcctc agtaaatttc tgttcatttg 12361 tgtctctttg ggttggtttc ctacctgggg atgtgacaac atggatcaca ggcgttacag 12421 tttcaaggta catttccata gaagaggatg acaaaataat tgcctttaca tagaaaatga 12481 gtcataggag gtttctcttt ggaattctta attgtttacc aacagtttcc ctaaatgcac 12541 caagattcca ccctcacata tgtggcacta ggctacacga acttccagct atgacaggaa 12601 gcatttaagt ttctgctgta aaggaaagaa gagattcggg gcagaaagcg tcctgactct 12661 ggggcattac tgaactcccc tgctgaggaa ggaattttgg ggccaaacag caacaataag 12721 tctatcaagc gtatagcaaa aatattcctg atagtgtcac atgccccaca atatagagat 12781 atctaaggat aattaatatc aaggtctgaa gtgagataat tatggtgggc ctagaatatg 12841 tgaaaacaat gtcacaggaa tggcaagaac caaatgaaca gaaataatag agactgataa 12901 attatttcaa gtcctactct gagtaattac tcatgcatca caacaagaat gattacttca 12961 aactttctga taactatata aacacatgta aagaaggaag caatattttc cgtattatgg 13021 attctttgcg tagctttctg aaattcaaaa ctaaatattg caagctaact aaaagagatg 13081 gaatttcttc tagaatatca accaatactt caagaaagtt gtttaaaaat agcagaaagg 13141 tcaatataat aaaataagtc aatatcaata ttccactaaa tcatatttaa tgcagatatt 13201 catttatatt ttaaacagtc aatacgtaaa gtacctacct gattgttcac tttaaagctg 13261 gtgcaaatta aagtaatatg aaggataata atgctttgca gattactctt taaaaataaa 13321 tgaaagagat atttagagaa tatgtgaact tttagaaaaa cacaaataag aaaaaatgtg 13381 acaaagtttg aatgaagtaa gatttattca ttcgtttatg tatatattta cttctgaggc 13441 ggagtctcgc tctgtcgccc aggctggaat gcagtggcac ggtctcggct cactgcaagc 13501 tccgcctcct gggttcacac cattctcctg cctcagcctc ctgagtagct gggactacag 13561 gagcccgcga ccacgcccgg ctaaattttt ttgcgtttgt tttgttttgt ttttgttttt 13621 gttttttttt tgagacggcg tcttgctctg tcgcccaggc tggagtgcag tggcacgatt 13681 tctgctcact gcaagctctg cctcctgggt tcatgccatt ctcctgcctc agcctcccga 13741 gtagctggga ctacaggcgc caccaccacg cccggctaat tttttgtatt tttagtagag 13801 acggggtttc accattttgg tcaggctgtc tcgaactcct gacctcaggt gatccgcctg 13861 cctcggcctc ccaaagtgct gggattacag gcgtgagcca ccacgcccga cccgtaaaac 13921 tcttaaaata agaggatcat ggccgggtgc agcggctctt gcctgtaatc ccagcatttt 13981 gggaggctga ggtgggccga tcacctgagg tcgggagttc gagaccagcc tgaccagcat 14041 ggagaaatcc cgtctctact aaaaatacaa aattagctgg gcgtggtggc acatgcctgt 14101 aatcccagct actagggagg ctgaggcagg agaatcgctt gaacctggga ggtggaggtt 14161 gcagtgagcc gagatcgtgc cattgcactc cagcctgggc aacaagggcg aaactccatc 14221 tcaaaaaaaa aaaataataa aaaaataaaa ggatcagaaa taaataatag tggagacaaa 14281 cgattgtgat gccctacctt gttttaacct gattgtctct ctcagctgag agagccaaac 14341 agactccatt tttgtttctt cacttgcagc ccccttatcc ccctccctta aggacataac 14401 tagtgcaagc tgactccaag cacatccagg aatgcactta ctgatagaca ctgaggcacg 14461 ctgtaccagc agctcctagg aacgcactca gttaatggta cccaaagccc ctgcgtttat 14521 cactttgtga taattaagcc cctgcacctg gaactgttta ttttcctgta aatgtttgtg 14581 taaccattta tcttttaact ttttgcctgt tctgcttctg taaaaattgc ttcagctaaa 14641 ctccccctcc cctatttaga tcaaggtata aaaagaaatc tagccccttc ttcggggcca 14701 agaattttga gctctagctg tctctcggtc gctggcaata aaaggactcc agaattagtc 14761 tcagagtgtg gcatttctct ataactcgct cggttacaac actatagcca aagattacat 14821 tcataaagat atttttacaa tataggtgtc ttatttaata tgactccaat ctaaatggta 14881 atttttctaa cctttcttaa gtaattataa aatgtcaaaa ttataaacaa ttaaaaacta 14941 aagagaaatg aagacaatgg atagcaacaa aagctaattg ctaaatatgg gggaaaaaag 15001 ccaggcatgg tggctcatgc ttgtaatccc agcactttaa ggcaggcgaa tcacttgagg 15061 ccggttctag accagcctgg ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa 15121 ttatccgggg gtggtggcgc atgcctgtag tcccggctac tcgggaggct gaggcaggag 15181 aatcgcttga acctgggagg tggaggttac agtgagcaga gattgcacca ctgtactcca 15241 gtctgggcga cagagtgaga ctctgtcaaa aaaaaaaaag gaaaaaaaga aaagaaaaag 15301 aagatataat gtttctgcaa tctcatttca gttccactta tgataataga tacattatgt 15361 aatttaagtc actaacactt gagggcaaaa aaacagaata aacacaaacg tgtgggataa 15421 ggggatgctg accccactgt tagtgatcac cttggttgga gacccaggtt ttatcataca 15481 gatgaagcct gcaggtagca agtttcagag aaatagattg taaatgtttc ttatcagact 15541 taaggtctat gctgatgtta ttgctgtagg ggtagaatga ggcatgtcca accccttctt 15601 ccataatggc ctgaactaga ttttcaggtt aactctataa tgctgagagg agggggtcca 15661 ttcagatagt tgggaggcct ttgaatttta tttttggtta tagtttctac attaaaaaaa 15721 agaaaaaaat taagggtcgc gtcaattctc aattgtcaac atttttaatg tattgtatgc 15781 aatatatgtc aacttttgtg ttgagaaatg aagatcaaat cgtattcttt tatcctgtgt 15841 atgtaatagg ctttgctgac cctagtttga tgggtttttt cctttgtcct ctctttcttg 15901 gattgagtcc tcacagcgcg gcggactgcg gcgtggtagg aactacacca cccagaatac 15961 tgtgcgccga gcgtgccggg gccttagacc aatcattgcc caggaaaggg gcacctccgt 16021 aagtgcgcaa tacgccagaa tcttccggcg tctttccggt ggtggtcgtt tttgctgcct 16081 gcgagcgcgt ccgcgggctg ggcgtttccg gctcgctggg tccgggccag gtaactggag 16141 ccggaaaccg gtggaggtgg tgtccgcccg cagaggagct tgcctggtct cggtctgagc 16201 gtcgcccagc gatttgccac cgcacgcacg ccggatcccg ggctttaccg cccgcctttc 16261 caggccccgc cccgcctaaa gtcccatggc cgaggcagcg ctagtgatta cgccgcaggt 16321 gagagcggag tcctcggatc ctcacctggg tcctgaggcc ccggactggg gtcacctgcg 16381 tgggtctctg cagttcaggc ctcttctcca ggacagcgag gagttctggg tggggtcctc 16441 agagctgacg ggcggtgagg tgaagaaggg agagcgttgc gtaccgtggg ggctgcacgc 16501 gcagagcctg ggaggtgcgc cggggtcgcg agcattcgga aacttggggg cggacctgtg 16561 tctgcaggga gcagagagaa tgaggagaag aaggcacggc tgccatggca gcctgagcaa 16621 ccggggtttt taattctgag tctttgggag ccatggaggg ttgtgaacta aaggagaaca 16681 cgatctgagt gagggtctga aaaaatattt caaaggcaat gagtggagaa aacagactgt 16741 tacggagtga tgggggtgga cgcagggata ctggtgagga gactgctgta gtaggctcca 16801 ggaggattat ggtggattct ggaccagacg ggttgctgaa gggacctaag taataagaat 16861 cttggtagat cttaaaggta aagccagcag gattttctgc tccatcattt gtgcctatga 16921 gcaaaagtgg gagtgaagta tgatcccaag tttttctttt acctgaacat ctggaagagt 16981 tgacaaagaa atctggtgag ggagcaggtt tcagggaaat agtaggagtt gaattttgga 17041 cgtggtgttt ctaaattgcc acagagtgaa gttgttacat gggcagttgg aatcataagt 17101 agaaaattca tgggagggga ctgactcagt gatgtgtaaa tggagacagg ctggggcgta 17161 gactcaagcc aagccttaaa tttagacggg aaatcttaag gttttacttt gagatgaaga 17221 gagggagggc tgaggctgga ggctgggaaa gtgtgggtcg gccttggcga agctgcctgc 17281 agataggagg gccatggggt tccgggggta cttccaaagg gaatgtgtaa ggaatacaca 17341 ctgcatagga tggatttgag ggtgggacac tgtgtagagt caccattcag aagacttggc 17401 gaactgttca ctctatgtgg agccagagaa aggtcagcca gactagcaga gaagcccaag 17461 cccagcaaga gcagctctcc caggaaaata caagtttaga ctgtatagat ttaacaaatt 17521 aaatttatgc aagcaattag ttaaccgaat aaaatgtcac tataactctg tgctctaata 17581 aaaaataaat gtttgttttt tgtcatgctt tcctggcaca gaattcctaa aatcgttgta 17641 atttcctcag tgaaaagtgt cttgtgataa gtcatggacc gcataacaac agtgttgacc 17701 acatcagtga cggtggtccc ataagattat gataccatat ttttactgta ccttttcatt 17761 ttctttcttt ctgatcagga ggcaaactgc attttctaag tttataaatg tttaaatgca 17821 caaataatta tcactgtgtt acagttgctg acagtattta gtatagtaac ttgctgtaca 17881 ggtttgtagc ctaggagcaa taggctatac catttagtag ttgaacaatg aatgtctgaa 17941 cagtagttcc cccttatcct cagaggatac ctttcaagaa ccccagcgga ggcttgaaac 18001 tgcagatagt acccaaccta tatatactat gtttcttcct agactacata cctgtaataa 18061 aatccagttt ataaatgagg cagtatagga gatgaacaat aataactgga aataatatag 18121 aacagttata ctttataagt taataaaagt taagtaactg cagttgagac aataaaaaag 18181 caaaagttaa attataaaaa acatgtgaat gtggtatctc tctaaatatc ttactgtact 18241 attcaccctt gtgatgaaga agggactaaa ctcgatagag tgagatttcc tcacactact 18301 tacaatggtg cactgttggc cagcgcagtg gctcacacct gtaatcccag cactttggga 18361 ggctgaggca ggcagatcac aaggtcagga gatcgagacc atcctggcta acatggtgaa 18421 accccgtctc tactaaaaat acaaaaaatt agccaggtgt ggtagtgggc gcccgtagtc 18481 ccagctactc cggaggctga ggcaggagaa tggtgtgaag ctgggaggca gagcttgcag 18541 tgagccgaga ttgcaccact acactccagc ctgggcgaca gagcgaggct ccgtctccaa 18601 aaataaataa ataaaataaa taaataaata aataaataaa ggatggtgca ctatttaaaa 18661 catgaattgt ttatttctgg aattttaata ttactatttt cagactgaag ttgaaattta 18721 atacttataa ttttcacact gaagttgagt gtgggtaact gaaactgcaa atagcaaaac 18781 agctgataag ggaagacttc tatacttgag ttttatgaga tggcttaggg tgagcagaca 18841 aggtggagtc tctagatagc ctcagagtgg agtcagtcac cagaaaaatc aagtgattat 18901 agaatttgaa ctttgagccc cacctctggc atccagggag gggagcagag atctggagat 18961 taggttatat aaaatcttga acaatgagtt ctagaagctt caaggttgtg aacacttgga 19021 agtgctggga gagtgacgta agagagcgtg gaagtactct gccttacccc ctctcccagt 19081 accttgccat atgcatcgct tccatttggc tgttcctgag ttacatccat tatcataaac 19141 tggtaaacat aagtaaaata ttttcctgag ttctgtaagt ggttctagca aattattgaa 19201 cgtaaggaga gaggtcatgg cagcccccaa atgtgtagcc aagtggagaa aattacagtg 19261 gtctggacat tggcttgcaa ttgccatatg aagtgagggc agttttgtac gactgagccc 19321 ttaaacctgt tggtttagct gcaactccag gtagttagtg ttagaatagg attgaattgt 19381 aggacaccta gttggtatca gaatcagaga atggtgtggg aaaagacact acatatttat 19441 tatcaaagaa gcacagaagc ctctcataat ttgaaaagat accacattca cataaagact 19501 taaatggcaa gtaagaaaca ccagagtttt aaaaaatgtt tacagaaatc ttaatgcaat 19561 agtcagtact agctaaatct tagcattgga cacgtatgaa tgagtttgat gcctacgatg 19621 gcactacgag atagccacca ttaagtgttt ccattttaca gatgaaggca acctcagaga 19681 ggttaggtct ttgtctagtg tcacacagtt ggcaagagca acttaccctt tgtaaatgcc 19741 accttgaaca cctgtatttc atagcctccc tcttcctaca gattcccatg gtaacagaag 19801 agtttgtgaa accatcacag gtaagtggaa gaagtcccag gtcttcactg gccctgttcc 19861 ctaaagccat gggtctggcc cacagatgca ggtaacaata atgtcctggg actctttttc 19921 cttcatcttg ggtccctggg gccacctcag gcccaggatg attaatcagg gtcaggagtt 19981 atacagagat gttcccattt ctgcatccaa taggatgaag tgatgaaagg tttcctgggc 20041 ataggcgcca gcatgtccaa atgcttagag atgagaatga tatgggcata tatagggaac 20101 tgcaagtctg tgcactgtgg ccaaaacttt tgcagtttga gctgtgacag aactaacatg 20161 agtttctttt tccttcttta caatctcaca gatacatgat ttgtcctttt cgtagatctt 20221 agcaaactca gagtacaata ttttttttct tgttaagaac tctcaccttt tcacttaaag 20281 gaagtacttc acaggttgtc ttttacatat ctgaatggct ggtattatcg ctcttgtgct 20341 ttgggggtat taagtaaaat aagcgttact tgaacacaag cactgtgata atgagacagt 20401 tgatatgata accaagatgg ccacatagtg attaacaggc aggtagtgta ttcatcatgg 20461 atctattgga caaacggatg attcacgtct tgggcagaac agcgtgggac agtgtaagat 20521 tttatcacac tactcagaga agcacgtcat ttataatttt tgggttgttt atttctggaa 20581 ttatccatgt aatattttca gaccacagtt aaccagtagt aagtgaaacc atggaaaaca 20641 aaattgtggc taagatcggg gacgactgta tatgtattca gtgctataca ttttattcta 20701 agcaccgctt cgactgcatc tcacaaattc taagttgtgt tttcatttca tttagttcaa 20761 aatattttta aatttatctt gagtatttct tctttgttcc atgcgttgtc ttgtttacat 20821 ctcccaggta ttttgggatt tgccagctct ctttccatta ctgctttcta gttgaattcc 20881 attgtgattg cagagcagac cttgtattat ttctagtaaa tttattaaga tgtgttttat 20941 aactgtgtat tgtaaactgg ggagtaaatg cccttcgtgg ggtctctagg atcattccaa 21001 cactgccgtg gaaatgaaac tctgagaagc ataaaggatt tattatattc ataggtcctc 21061 atagagggtg gcacagcaaa ccatgcatgg agccatatgg aaatagctct cagggagtgg 21121 actcaaccaa gcaggcaggg agccaagagg gagagcaagg acctgtgggc aggtgccttt 21181 cctactgtgt ttgaatgtca ttcagtcaca gtcaaaggat gaaaagaagg ggaggttgtg 21241 acagggacca gtcttagcat gctggggacc tggtcaccgg ggtgggttgc tcataatgta 21301 ttggggatgt tgaagcatca ggaaaatagg aagttttaaa gatatataaa acaggcccag 21361 aatgtggtct gtcttgttga atattccgtg tgagcttgag aacagcatgt attctgttct 21421 tgttggacaa agtaatctaa agatggtgat tatgtccagt tgattgatgg tgttgttgaa 21481 ttcagttaag tttagtggtt ttctgcctac tgtatctgtc agtaacagaa aagagtgtgg 21541 aagacttcaa ttatgatggt ggattgatct gtttctcctt gaagatctgt cagtttttgc 21601 ctcatatagt ttgatgctct gtggtcaggc atgtatgtgc taaggattct tgtatctttt 21661 tggaaactgc ctcaacctcc cgagtggctg ggattacagg cgcccgccac cacgcccaac 21721 taatttttgt atttttagta gagacagggt ttctgcacat tggccaggct gatctcgaac 21781 tcctgacctc aggtgatcca cccgccttgg cctcccaaag tgcctgtgat gttttcttga 21841 tagctacaca tgatgagctc agtaaaagga attgctgtaa ataggccttt aataatgtgg 21901 tggtgaggtg tgtcagcggg caggggaagc attttaagtc ccacagttag gtctcagtgt 21961 tttagtgagc ctgttcctct tgactgaaat tcacatgtgt ttctaaattg tgtttattat 22021 gctttttctc tcccattatg tggaagagga atactagagt gggttgtagt tggttatttc 22081 ccttccccca gatagattag gctctgataa tacctctgca ggttaggctc tgattaactg 22141 ttgtctccca agggcaggct ttgtttagag gaacacattg ctatggcata ttttaaaatg 22201 attcattttc ccctctctgc tggaagcaca aggggatttt tttttttcca gtattcacca 22261 tgagaacttg gttaagtttc tggaggtaaa tattgcaaga ttgtgggggc cccctatcag 22321 agtgcctcct gaattttcag ctcacagaat tgtccacaca gaccttccag caatttatgc 22381 attatagatt gggttttcct aacctagcac tggtttgcat gatggtttcc tttcatgagt 22441 ttctgctcct gtgagccttg attctcctgt atgccctcgt ttgtctctcc aatcttgtgg 22501 gcagcagtgt gtgctgtgcc cttatctttc ctgtgtggat cctagaattg tccatttttc 22561 agtctgttta actctttact tgctattgtg atagagtggc cacttccaag ctccttacat 22621 gcagaataga aaccaaagtc caggagttag gccttcacaa tgtggagtct catgttggga 22681 cttcttggtg gtggtgttag gtgcacagca agagacagac acgtggagtt cagggaaggg 22741 atccaagtga aagctatccg tcagccagtg gctggcctgt ggaccatggc agatttaaga 22801 ggaacactct tcaggtcata gtggcagtcc catcagtgag gaaggatgta gagagtattc 22861 tctaatgact gggtctcttg catggtcacc tagatagaag tgggaaccat cactaagaca 22921 gataaagaag agtagtaacc atgggtgtgt gggggtagga tgaaacgggt gtccagctgt 22981 tggcacagtc tttaaatggg gaatgttaag agattcattc agaggcaaag cagtaatgga 23041 ggtcaaataa caccaaggtc agggctggtg cttttcctcc cttttgctgg ctcagacttt 23101 tgttggtaac tcttggtagg gcctaagtag tttcagacgg tgtgacagat taggttaaaa 23161 gcaaatgaga ttgactggga gagtcattca gcctgtggtg ggagcagctg ggggccaagc 23221 tttgactgag tataggtgga gagaggaaag atatgactgg tgtggggaca gcacaggcaa 23281 cactaaggag gaagtggact ttggcagcct gagctcacac aaggtctgag tggctgaaat 23341 tggaacaaag agccaggcca catggggacg tcttcataac agtcctgaga ctgcctctga 23401 cttgggtcta acaaagatgt gaggtggtgc tggatccttg ttgcatttag caagcgctgc 23461 ctggatccct tgagcagagt ggcatgtgga gaagccaggt gaagtacctg tgtcatggat 23521 gctgagggca gtggagttca aggtgtagca gtcacgatct ggagatgtgt tcccatgaat 23581 ggtgtgcaac cggggaggga gtatggctgt actcaggatc ctcgtactgg cactttaaac 23641 caagatggta agcacaggtt tcagtatctg ggtctccagg gaggtgaact agcaggagca 23701 tggatgagat tccatgatgt aaggcccaga gcccagtggg agacaccata aggacctggt 23761 atctcctctg agttggagaa ctagggagga gcgagattta ctggggttcc tggggattga 23821 gtctgctcat gatttcatct gcccccatca tgcagggcca tgtgaccttt gaggatattg 23881 ctgtgtactt ctcccaggag gagtggggcc tccttgatga agctcaaagg tgcctgtatc 23941 atgatgtgat gctggagaac ttttcgctta tggcctcagt aggtaagatc tcttaaaacc 24001 acaccatcaa cctctgctgg actctgctct tccccttttt ccaggagttt cctgtctttc 24061 ccagagttgg acatgggccc ttcttctttc cttggttctt gacctgtgga agacataggc 24121 gttgtggttg ccaggcctgg gttttttctg ccccaacatc tactgtcctg aagcctttta 24181 gttaagagtt tgggttcaga ctcttgtaac ttgcccaacc gacaccactt ggttttgcca 24241 ctccttgcta aggtgactgt tcagttctgt ggtggacctg tgttacaggt tctccccctg 24301 cctttagtag acgtgtcttt ggacctgcct ttctcaggaa ttacaggtac ttacatggtt 24361 gctatttgta gggaacccta gactatgaca gcaagacccc ctcccaaggg tttttttgta 24421 ataaatttcc ctcctgtagt ctgtcatggg ttggcgcacc ctgggccagt gttgttgacc 24481 cacctccctg tttttccctt aggatttctt ctaccttcca ggacccagat agtctcctga 24541 cttaagctgg ggacagggga gaggcctgtg ttcctgaaag cgtagtgaag tcttcagcca 24601 cagcaagggg ggttaacagg gtttgggacc tgtgagtagg agcagtggga gggcatgatt 24661 tgagggctga actcaaatca taccaggcaa atgtcctatg ttccccacta ttttggtgcc 24721 agggctcatg gccatacctc atttttctct tttctctgcc tgcttgacaa tttgcctact 24781 tgtcgtttct actgtgattg tcccctacct ctgttctcca catctctcct gttcctgttc 24841 tgcagtgttc tccaacatta gcctatacct aatgtattta atattatgag ctgtgcagat 24901 acagatgttc ctcagctcgt gatggggtta tatcctgata aacccattgt aagttgagaa 24961 tatcgtaagt cagaaatgca tttaatacac ccaacttatt agacatccta gcttagcaac 25021 acagtagagt gtcagttgtt tagcctggtg attgcatggc ttactgagag ctgcagcttg 25081 ctaccactgc atcacaagag tatacacact gcgtatcact agcctgggaa aagttcaaaa 25141 ttcaaagtat ggtttctact gaatgtgtca ctttcgtatc atcataaagt ggaaaaatca 25201 taagttcaac cattataaat caaagactgt ctacttaagc tctcaaacat cagctatcca 25261 gttgtaattg cattttcctg gggtgacaca cattctatgc aggtctcccc tgtctccatc 25321 agatgacgtc ctattgaaaa ccattgacag agaagggagt tgtatgcccc ctgcctctct 25381 tcccaccata gtaccccatt ccttgttctc ttgctggtta tgggtcttcc cctacatggt 25441 aaccttaaga ggcctaatac tgcaagtatt atcttccttg acctgactcc agctcccttg 25501 tcctgaaaac aaacccatgg cctgggtgca atggctcgtg cctataatcc tagcacttgg 25561 ggaagctgag gtgggaggat cacttgagct aaggaatttg agaccagcct cgacaacata 25621 atgagacctg tatctccaca agaaattcta aaaaaaaatt agccaggtat ggtggtatat 25681 gcctgtagtc ctagctactt gggaggctga gatgggagtg tcactggagc ttggaaggtc 25741 aaggctgcag tgagatatga ctgcaactgc actcccatct gggtgacaga gcaagaccct 25801 atctcaaaga aaaaccatgg actcgtccct ccctgtgttc cacaggtatt tgtaatggag 25861 ctgttccctt ctaccaaagt ctgcatatac ttcacttgca tttttatgct tttaggttgt 25921 ttgcatggaa tagaggctga ggaggcccct tctgagcaga ctctttctgc gcaaggagtg 25981 tcacaggcca ggactccaaa gctaggtcct tccatcccaa atgctcattc ttgtgagatg 26041 tgtatcctgg tcatgaaaga cattttgtac ctcagtgagc atcaggggac acttccctgg 26101 cagaaacctt atacgtctgt ggccagtggg aaatggtttt catttggttc taacctgcaa 26161 cagcaccaga accaggacag tggagagaaa cacatcagaa aggaggagag cagtgccttg 26221 cttctgaata gctgcaaaat tcctctgtca gacaatcttt tcccatgcaa agatgttgag 26281 aaggattttc caaccatcct gggccttctc caacaccaga ccacccacag cagacaagag 26341 tatgcacata gaagcaggga gacctttcaa caaagacgtt acaaatgtga gcaagttttc 26401 aatgagaaag ttcatgttac tgagcatcag agagtccaca ctggagaaaa agcttataag 26461 cgtagggaat atgggaaatc cttgaactct aaatacttat ttgttgaaca ccagagaacc 26521 cataatgcag aaaagcctta tgtgtgcaat atatgtggga aatcattcct ccataaacaa 26581 acactcgttg ggcaccagca gagaattcac actagagaaa ggtcttatgt gtgcatcgaa 26641 tgtgggaaat ccttgagctc caaatactca cttgtggaac accagagaac ccataatgga 26701 gaaaagcctt atgtgtgcaa tgtatgtggg aaatcattcc gccacaaaca aacatttgtt 26761 ggccatcagc agagaatcca cactggagag aggccttatg tgtgtatgga atgtgggaaa 26821 tcttttattc attcctatga ccgcattcga caccagagag ttcacactgg agaaggggct 26881 tatcagtgca gtgaatgtgg gaaatccttc atatacaaac agtcacttct tgatcaccat 26941 agaatccaca cgggagaaag gccttatgag tgcaaagaat gtgggaaggc cttcattcac 27001 aaaaaaagac ttcttgagca ccagagaatt catactggag aaaagcctta tgtgtgcatc 27061 atatgtggga aatcatttat ccgctcgtct gactacatgc gacaccagag aattcacact 27121 ggagaaaggg cttatgaatg cagtgactgt gggaaagcct tcatctccaa acaaacactt 27181 cttaagcatc acaaaatcca cactagagaa aggccttatg aatgcagtga atgtggaaaa 27241 ggcttctacc ttgaggttaa acttcttcag caccaaagaa tccatactag agaacaactt 27301 tgtgagtgca atgaatgtgg aaaagtcttc agccaccaaa aaagacttct tgagcaccag 27361 aaagttcaca ctggcgaaaa gccctgtgag tgcagtgaat gtgggaaatg ctttagacac 27421 cgcaccagcc tcattcaaca ccagaaagtt cacagtggag agaggcctta taactgcact 27481 gcatgtgaga aggcctttat ctataaaaac aaacttgttg agcatcagcg aatccacacc 27541 ggagaaaagc cgtatgaatg tggtaaatgt gggaaagcct tcaacaaaag atattccctt 27601 gtcaggcacc agaaggtaca tataacagaa gagccctagc aattgttggg atgtgtaatt 27661 gtcttattca ctgtaggaca ccagagagct gatttttcaa gggatccaac agacagaaat 27721 tcaccctcat acatctgcat atcactagtt gaaagattca ctacaaggtc caagtacttg 27781 ggaagctttc tagagattac ttgtactttc taatctgccc agtgttacaa cagacactac 27841 catgtggcat atcctcaccg ttttcatcag tcactcacat gtgctcaagg aatgcagacc 27901 actgtgctta ccaaattctg aagggaataa tataataaaa gcctgttgag ggtcctcttc 27961 catcttgctg aactgttcga ggcaaagaca aaacccaact ttgaccaaga agagcaatct 28021 ggattcttgt agcccatgga ggtttacctt cttcataggt tttttttttt tttatgtgat 28081 tgccatagtg ggatattccc tctcccacat tacccaagag ggaaatgggc tgtcttacca 28141 tgtgctgatg tatgtgctct gccagtgtga agggtgaaca gctccaactt tatgtccccc 28201 cagggacata gtatgagatc ggaaaatagt tcttagtcct gtttgtctta atttttattg 28261 gtttccaagg tctcctagga aggagagcag agctgtgtgg tcctaatcat ggtgtcgatg 28321 ccagatttat ttgtaggtcc agaggtctcc ctgagaagag gcatttgtga caaactcaag 28381 tgctgggacc tgcatgaatt gtttatttgc agtgatgctt catatttccc agcactgttc 28441 atggtatctt atttcccagg tccttcattg tagccatggg cactgaagga ctgaattggt 28501 aggtaggtca ctggttgtaa gacctactgg ggccaggtaa gaacagtaat tggcccagac 28561 aggtaggtgt gataaacaga gtagaaaaga ttgtggccaa gaagagtgaa tgtgtcccat 28621 ccaaaggtgc atccagatat gtctgtgtta ctatggcttc atggtttggg ggtatataga 28681 aattctcagg acctcagacc tttgattctc ctttagcaac acacatcgtc agcataaggc 28741 atctaaagat tttgtggact cataacttat gtactgtttt tgagatgatt gctgcacggg 28801 caggaaagca ggatttgaga gaacatacac cttgccttcc agatatgtgg tttttgtgat 28861 tcacccagat ctgttgtgcc agttaggaat tatctttatt acttcccgtt tattgccact 28921 gctgagaagg ctgtccttcc taacttgtga ggagatgaat ccttaataat aagtaaatct 28981 tgtgttatcc tatagtctgg ttttccaaaa ggagaggtag atgcatattt ttgtgtggaa 29041 ccatttatct taaaacattg aggtgcagtt ttcatacaca aaaccttcta tctttttaag 29101 aactgtctgt atggtttatt gcccagttaa tgggttttga tagtcatgtt atctcagtaa 29161 gtgtcttgtc ttagaaatat gtgtgtatag gctgggtgcg gtggctcatg cctgtaatcc 29221 cagcactttg ggaggccaag gcaggtggat cacctgaggt caggagtttg agaccagcct 29281 gaccaacatg gtgaaatcca gtctctactt aaaatacaaa aattagccag gcgtggtggc 29341 gtgcatttgt aatcccagct acacaggagg ctgaggcagg agaattgctt gaacctgggg 29401 ggcggaggtt gcaatgagcc aaggtcgcgc cattgcactc cagcctgggc gacagagtga 29461 cacttctcaa aaaacaaaaa caaaaacaaa aagaacagat aagaaatacg tgtgtatata 29521 ttctattttg tgtgttttca ttcaccttat ttcattcagt cttccaaatg aaatgttctt 29581 aattttgatg aagtctaatt tgtctgtttg ttcatttact gttattttgg tgctattaaa 29641 aaaattttat tccttggctg ggcacggtgg ctcacgccca taatcccagc actttgggaa 29701 gccaaggtgg gccggatcac ctgaggtcag gagtttgaga ccagcctggc caacatggca 29761 aaaccccatt ctactaaaaa tacaaaaatt agctgggcat ggtggtgggc gcctgtagtc 29821 ctagctagct acttgggagg cctaggtggg aggactgctt gaacccagga ggcggaggct 29881 gcagtgagcc aagattgcgc cactacactc cagcctgggc gacagagcag gactcccatc 29941 tcgaaaaaaa aaaattgtgt aattatagga actttggaaa gattgttatt caactgatat 30001 tacataaggt atttattttg taaattattt gcaatattag aatttctgtc agcctacagt 30061 atttctctgc tgttattctg accttaacac attgaaagag ttcttaattt tgcttcgcca 30121 tcccttacgc atacttaatt tttgttattt tgccaaattc caattatctt tacaagatca 30181 ttttattttt taaaagagct aaccagacaa gaaaatttgc ccttgttatt cccacttcct 30241 atatagcttg tctgccaaaa cctttatgcc attattctca tcagtctcat tctcaggggt 30301 ttctctgagc aaatccatat aaaagtggaa ctatatgaca tttcctgccc cgttctcatg 30361 cttcatcttt aataggtaaa tttaccattt taaatagttt taaaattatc ttcactgtgt 30421 gtactttgga tttaatcctt actaatgatt aagatgctct aagaatcaat gtaaaactaa 30481 ttttaaagcc agcagcataa ttttgtgagt tatgcctgag tattcaaaaa tcctaacaag 30541 tcaaaaatat taattaaaat caatatgttt aatctttcca attaaatact tccattccat 30601 aaacttcaga accaaagtta gataccaaca agagactgaa gataaataca gtgtcaatag 30661 tatcaaggga ctagcccata taatatactt gaaaatcgta ttaatcacca ataaagtacc 30721 ccaccataaa caaaatacac aataaaaagt caagatacaa ataaagacag gccaacatat 30781 gagtagacca ttgacagaag aggaaaaaca aaaggcaatg aaaggatatg aactaatgtt 30841 caaatgtatt tcaatatgag aaataccaga tatattctat atcactttaa acttctaaag 30901 taaaaactgg aaagctgggt aattctaaat attgtctggg acacagtgat acagaaacac 30961 ttttgtccat cttatgggag aggaagggca atgtataaca taattccagt tacatactag 31021 atcaccatat gacctagcca ttcagctgca aggtacattt ctcatgtatg gccataagtt 31081 tatttagaac ataatactga tgaccttatt tttaaggttg aatcaagctg gtggtgatat 31141 atttcatagc aggaaaatgg aggaaataca tgtccataat cagaggattt gaatcaagga 31201 tgaaagcttc atgatgtaaa atattacatg gcagttaaaa acaattggct agaggtaagc 31261 ccatgaaata taggatagaa atataatatt ttgctgtagt ttgagtgaaa aagaaaaaaa 31321 aagaattgtg aggtctacaa taaaacataa tttaaaaata atttaatacc acaaaaactc 31381 caaagaaaaa aaaccaaaac aacccagaca cttcttacta gaacatatgt tccatgcttc 31441 tatctcggct tctggtagcc ccgtgtgttc cttggcttga agatgttctt cctgtgtctt 31501 cacatcgtct tccctctgta catctgcctc tgtgaccaaa tttcccattt ttctaaggac 31561 acagtcatag tcaattagga actaacctaa tggattaact aacatgatta cctgaggttc 31621 tgagggtaac gactccaaca tcttttttgg agacacaaac acttaacatt gccctatgaa 31681 tttctgcagt tcttcaaagg gcagagtgaa taataaccac agcaatttca agacacacag 31741 gcttccattg tgagatggct tgtttaaaga aactattaat tccctacact cttctgatca 31801 ctctgttgtt cacattcatt ttaggatttc tcttccaaat cacttctggt gctttccttc 31861 ttgcttcttt catttccttc ggtgttgtag ttctattatg tttttattcc gtcagcaatt 31921 tcttaaattt ctcttttcaa ccttaagcag tttttctgaa attataaaat ttactgtgaa 31981 catttcaaac atttccacat gtgcatgaag ttaaagtgaa ctctgacatt cccctctacc 32041 ccatcccttc cctccctcta cacccaagga gcaaatacca tttaactggg tgaggatcct 32101 tccagacaca ttttttgtct gcttacaaat attaagattt ctaattcgtc tttgcatgac 32161 cgacttgccg agttaattta ccgtccacat ttcctctgtg acatttactg attgaagtgc 32221 accaactgtg aatggtcata actagtccca cccccacaat catttctcac agattaagca 32281 gcataagagc atagtgcttt ttatctaatt tgttcattaa atgtaacaaa gttacaaaaa 32341 catgaacatg aaaacgtttt acctctgccg ggcacggtgg ctcacgcctg taatcgcagc 32401 actttgggag gcagaggcgg gtggactaca tgaggtcaag agttcaagac cagcctggcc 32461 aacatggtga aaccccgtct ctactaaaaa tacaaaaaaa ctagccgggc atggtggtgg 32521 gcgcctgtaa tcccagctac tcgggaggct gaggcaggag aatcccttga actcgggagg 32581 cagaagttgc agtgagctga gatggcgcca ctgcactcca gcctgggcaa caagagtgaa 32641 actctgtctc aaaaacaaaa caaaaaacat gttttacctc aatcataaac tgtggaaaac 32701 atacagaagt tcacaacttt tactgtaagt tcaaaacttc tactgactta aaaacgggtg 32761 ctaaacaagt ttaagtcaat atattgcacc tgttctcatg tgtttttcct atattagaca 32821 gtaagttcct tccctgctca agggcaggac ccatagccca gccatccact gcatcccact 32881 cttctaagta tgaccacaac agtgagtggc catggagcta aatgacaggt gtctcatgag 32941 attcttgcct atcacagcag caagttcata gtagatcctt gtaccaaaca cttcagtgaa 33001 gactgtatgg aaagagctgg aagctaagaa aaacctaggg tgtcctttat cccatggaaa 33061 acactcctgc tttgggacaa actggggcct aggtcctcag gcctggaaag aaccctttga 33121 tgacagccaa tatgtttcta gaggggtttc tccaactctc ttgaacccag ggccttaggt 33181 acaagtacag ccacagctac aatacatgca gccttccaga gagaggatgc aggagggtgg 33241 gggtcctcaa gagtcaactc tccttcccac aggagaggaa actgtgcact ctgtcctgaa 33301 aaaacaaaaa cagagaactc ttaactcacc aagaattcta tttataatat gtgcaaaagg 33361 accacagtaa aaggtacatg cccactatga aaacatgaat atatatgcag tgctttaaaa 33421 ggttgtgagg aaagctcaaa gtgagaaaat gataattttc tatttttcac acgttttaac 33481 ctaacattat ctcatgtagc ttgaaaacta aaagaaaatc tcacaaaagg aatttcaaat 33541 taccagattc aggctaagta cagtggctca cacctgtaat cccagctttt tggcaggctg 33601 aggtgggtgg atcacctgag gtcaggagtt cgagaccagc ctgggcaaca tggcaaaacc 33661 ctgtctctac taaaaataca aaaattagct gggcatggta gtacatgcct gtaatcccag 33721 cttcttggga ggctgaggca ggagaatcgc ttgaacccgg gaggcagagg ttgcaatgag 33781 ccaagatcgc actcctgcac tccagcctag gcagcggagt gagactccat ctcaaaacac 33841 aacaaaacta attaccatat tcagtgctag caccaaggga ctaaggtcat ggtggttaac 33901 tatgaaactc acgtaaacct gcccacgtat atcctgattc acctgttaaa atgctaggca 33961 ctttattttg tagcaaaact tcacgcaaat gtttataaga tcagacaatg agattcctct 34021 ccatgttgtt ttcttgggtt tactgggttt accatggatt catcttagtt gtgtgtttta 34081 ttttgcctgt tttagctata attcaatccc tagaaaaatg ctgaggggga cctaaaaata 34141 cccctctaca caggattaaa gaatctgata aacaccagaa agaattagga ctcatctgag 34201 aaggccaagc aagtacaaag tgatcactat gttgccagag aaaataaaga caacaaacag 34261 acagaatgga atcaggaaaa tgtgagagtg ctgagcattt gaaaccatat taggctaggt 34321 gtggtggctc acacctgtaa tcccaacact ttaggaggct aagcaggatg atgacttgag 34381 tctagcagtt tgagacaaat ctggggaaca cagcaagacc tcatctatac aaaaaaattt 34441 aaggctgggt gcagtgactc acacctgtaa tcccagaact ttgggaggcc aaggcaggtg 34501 gaccacttga agccaggagt tccagaccag cccagccaac atggtaaaac cttgtctcta 34561 ctaaaattac aaacattagc caggcgtagt ggtgtatgcc tgtatcccaa gctactcaga 34621 aacttgaggc aagaggacca tttgagccca ggagttcgag gttacaggga actatgatca 34681 tgccactgca ctctagcctg ggtgacagag caagtccctg tctcttaaaa aataatgagt 34741 ttattaataa aaatcaaagc cttaaacagt cttaggtatt gcacatttta atgtggtgca 34801 cattttaatg tggccatctg tacatgggtt cttctcaccc tagatataaa aaagaagtgt 34861 cacatgaaga tgttctcagt attctaattt ctactaaatt caagaatctc atgacacaag 34921 ccttcatatt tctcacaaag aattactctc ctcatggata tcctcccaga gatgtctgga 34981 tcaaaggcaa cttccaagaa atgcctgaag agctgccctc actgtctaac tctgacaaag 35041 gatggcagtg aatgtatgtt aaaaaaaaaa aaaaggaata aaaccccttt acctctcaaa 35101 cattccccta ccaaaaaata aaactgatct tacagggaag tctgtaagga cataaagcaa 35161 gaagatacct gtaactgaag gttttcataa gttcatagag aaaatcccaa ggtgggtcaa 35221 attccagcat ggctctttat aggacaagca agtgacaacc tggtcacaaa gttttctcac 35281 atcaataaca attatgtagt aacttttcat tatgattctc agatagtaag atttctgctc 35341 tggccaaaga ctgtccttaa tagttatatt tggaatgtct ttctccagca tgagttctct 35401 gatgttgatg gaatgccttc cacatttccc cctggtatga atccttctgc aatgagtgag 35461 aactgagtgg tgactgaagg gatccttaca ttcactgtat tcacagggct tctctccaaa 35521 gtgactgctc atgtgttagt taagaaagag ctgacaaaaa aaatgaaaag tttcctgaca 35581 ttctttgcat tcgaaggact tcttcctaca gtgtgggtcc cattatgaat gataaaataa 35641 cagtgggtaa aggccttgct gtattcattg cagcaatagg gttctctcat gtatggaccc 35701 tttggtgctg cagaaggtgt gacctgcgtt tgaaggcctt cccacactcg atacacttgt 35761 agggcatctc cccagtgtgg atgacagagt gctgaatgag gtacgtgctc cggtgaaagg 35821 ctttcccaca ctgggtgcat tcatagggct tctccccagt gtgaatccgc tggtgctgta 35881 tgagttctga gctgcatctg aaggcctttc cacatgccat gcaatcatag ggcttctctc 35941 cagtgtggat gatgtaatgt tgaattaggt gggctctatt gctaaaggct ttctcacatt 36001 ctttgcactc aaagggcttt tctccagtgt gggtcctatt atggcgaata aaagtagacc 36061 ggtgggtgaa ggcctttcga cattgactac actcataggg tttctctcca gtgtggattg 36121 gatggtgctg catgaggtac gacctgcgtt tgaaggcctt cccacactca aggcacttgt 36181 atggtttctc cccggtgtgg atgaggtagt gccgaatgag agttgagctc tggctaaagg 36241 ctttcccaca tgcattgcat tcatagggtt tcattccagt gtgaaccctc tgatgtcgaa 36301 cgagatacca cttcctgtta aaacctttcc cacactgctt gcatttgtag gggtttgtcc 36361 ctgcatgaat caaggcgtct ttccctggtt gctgtgagtc acattcatgg agaacatctg 36421 ctgataacca ctcttgtaat gccctggagt gcagaccatc atctgtcccc aaaccttcac 36481 cttcaaggct cactttccca agatgggtct ccttgtggag gtctgtctcc ggtgtcactt 36541 tacctttctg catttccaaa agcccttcct catccctagc tctccccaac ctcgaatcac 36601 ttgaggttga agattccaga gtcaggcttc cccggagaaa ggcccgctca gataagactg 36661 gcgggtaagt ggttggctct ctggtatgaa cttgtgctct gtcacctgaa gggcatcaac 36721 aaacaaagga ggggtctttt agtgataaaa tatagaaaaa cacaaaataa ccacttcagt 36781 aggaaaatga agatgaaaat agtgcctatg atacttgctg catttttaca tctctaaccc 36841 aggttaccaa atactgctta cggtccttgt atccttgaca ccattaaagg gaaaatctgg 36901 agggcaggct ggaccggggg acagaggatc cggagtaaag acatttctct ttacagttca 36961 gagtaaggaa gagataatag agttttgatg acattgtctg aagggcattt gctttaagaa 37021 ttgaacgctt acatccatct gacaagggat taataaccag aatatataag gagcttaaac 37081 cactctatag gaaaaaatct aagaatctga tttaaaaatg ggcaaaagat ctgaatagac 37141 acttctcaaa agaatacata caaatggcca acaggtatat gaaaaactgc tcaatataat 37201 tgaccatcaa agaaatgcaa atcaaaacta caataaggta tcatctcatc ctagtcaaaa 37261 tggcttttgt ccaaaagaca ggcaataaca aatggtggca aggatgtgga aaaaagggaa 37321 ccctcataca ctgttggtgg gaatttcaat tagtacaact gccatggaga acaatctgga 37381 ggttcctcaa aaaactaaaa atagagccac catatgatcc agcaatccca gtggaggtat 37441 atacccaaaa gaaaaaaaac atcagtacat caaagagata tctgcactac catgtttact 37501 gcagtactag gcacaatagc caagatttgg aaacaattta agggtccatc aacaatgaat 37561 ggacaaagaa aatgtggtgt gtatacacaa tggggtacta tttggccata aaaaaggaga 37621 tcctatcatt tgcaacaaca gggatggaac taaaaatcaa aacaactgaa ctcatggaga 37681 gagagtagta agatggttac tggaaactgg gaacggcagt aggatggagt aggatcctac 37741 tccatggagt agtgaggatg gttaatgggt accaaaatga agttagaaaa gaatgaataa 37801 gacctagtat ttgatagcac aaccgggtgg ctatagtcaa aaataattta ttctatactt 37861 tgaaataact aagggtataa cgggattgtt tgtaacacaa agaaaggata aatgcttgag 37921 gtgacagata cttcctttac cttgatgtga ttattatgca ttgtatgcct atattaaaat 37981 atcttatgta cccacataaa ttaaaaatta gccaggcgtg gtggtgcatg cctgtaatcc 38041 cagctactcg ggaggctgag gcaggagaat cgcttgaacc cgggaggcag aggttgcagt 38101 gagccgagat tgcaccattg cactctagcc tgggcaacaa gagcaaaact ctgtctcaaa 38161 gaagaaaaag aaaaataaaa ataaatgaag ccttttggtg ctataagagt ttagggaagt 38221 tgcacctaat gagactggaa agaaaaagag atcatggaag gctttctaga ggaggtgtcc 38281 cataagctga gtcctaaaag agaagccaga gtcattgcca gagagaggaa gtgggcaatg 38341 ggttgaaggt gaggaagtaa aaatgggtag attcaccctt tgggcaccag atgcctggtg 38401 aaaccaatat aaaagggtga ttagagttgg agcaagaaaa tgaagattcc agcacaaaag 38461 ggatggaagt acacacagga acaaatggta caaaaggcct agtgtagtaa gatctgacaa 38521 gtttctgctg tgtcaggagg cacaatgcat ggcagacaag agggtactgg agacagaatg 38581 actcagttca aatcctgcct gtgtcaagtg ggttgatact ggaggatgac tttctccacc 38641 aggtctccaa ttcctcaact aagagactag gagactcaat ttctcaggct gtcatgaggt 38701 taaatgagaa aatgcacagt tcttgatgca gcgcaagtgt tcattaaatt gtgggacatt 38761 acccttatta ttttcattgc tgcgctgggc aatttgggta atgccagtga cactagcaag 38821 aacaacttca gtgctgtgat ctgaggaagc tagcagctgt gcttcatgaa gtggccagca 38881 tcgcatgctt agaagacccg agtgcagcag cagtctcata cctggaatgg tggggatggg 38941 aaatggtgcg gctactctgg aaaaatattt gggggtttga cttaaaactg agcacagatc 39001 tgtctcttga ctccaccagt catctcctgg gtgtatatgc aacagaaatg agtgtttata 39061 tccactaaat attatggaaa atagtattca tagcagcatt cttcataatc tccaaacaag 39121 gtaaacaata caaatgtcaa ccaataggag agtgaagaaa cagatcctga tgttttcaca 39181 taattttatg gtacacagca tataggataa acccagctca tggacacaat gatggtgaaa 39241 agaagaaata cacaaaagag aaactgcttt ggggctccaa ttatgcccat tcacaaccaa 39301 ttaaaaacta gtccctaggg agagagtcta aacagtggct cccactgcag gaggagctat 39361 tgacaaagac agggtaacag aaactgccga ggtgctggaa atattgtaaa tcttgaccta 39421 gatgaagaat tcaatttgca gaaatgcaaa aatttaagct gagcccttca aatttgtact 39481 tattattgta tacataagat tttttttagt ttgagaaatt ttaaacaaaa aaatattaaa 39541 gtcaagtgta aacatctctt ccaaacatga gacaactcca agaagcttgg ctttacaagt 39601 ggggtcaaca caagttacag ggtattgtag gtgaaaaagc attttgtctg ttttcgccaa 39661 gtgtaagcaa tttgagtaca cttttttttg tttgtttaaa cttatattca ctttctggtt 39721 ttgtattgtg cacacttttt atttgaatat agataagttg aaggtacaga caaagcagag 39781 atttttttta aaaaaagcaa gttccctgag aacacagcag gtatgatgca ggcactggca 39841 aagacactga ccatgactag aggtaagatc tcctccatag aggacaagag aagccaagta 39901 tgagaccgtg cagatgaatc tttaagaaca ttaaatacac atcctatttt taacatgaaa 39961 ttaattatat actgaaacat cagctgctgg ggaagtggtg aaaatctgaa acacatctga 40021 gtggctgcag gagccggtgc agggacaaga gctgcccaga ggaatggggc agatgtgggg 40081 agacctgtca ggagcctctg gtatggccca ggcagaagat tatgagagcc cggactaggg 40141 tgttggtgaa ggagccaagg gaagcagtgg aagtgatgga cttaggagat atggatcatt 40201 taaaatgagc aggactaggg atgaatggat gctgggtttg atgaaggtag gtggagtcca 40261 ggagctatgt ggtgtgtttc cagccttcta acctgcttgg atggtaagac attctctgaa 40321 ctgagtgtct ttttaggcat caggagctaa aggtgcctct gagacactta agtggagatg 40381 tgggattgag ggaatgggaa gggaggtatt tggaacacag ggttgcagat ggggctggcc 40441 aagaagccca aggggagagc ccatgactca gtggtgatac aaatccacaa ggcaaatggc 40501 tccagctaca ctcatcagca cagatgtagg gcagagaagg ccaaatgtca ccttggtgtt 40561 tctcagggca agaggggcag gaagatatgg agcaaagggc tgcttccgag tctctgatgg 40621 tctaactgga ggatggatga gtggacggag agggtggaaa agagtgggat tcacagcata 40681 atgcagacag taatatcaag gggaggaaag tcccctttct cttgagtcca gaaagaaatg 40741 tatgacatta catttatgat gattaaaact gttcatgcat aatactaact agctgacatt 40801 tgttgatcct atggaccagg aactgaacta agcagtgtca tgtgttatct ctacatcctc 40861 taacaactca atgacacaga aattatatca ctctgatttt gttgagtaga ggagtctgag 40921 gtgatgtaaa ccaaacccat taattctttt ccatgggcac atagccaaat cacatttgtc 40981 agtcctgagc atctaggtgt ggccaaatgc ctagttcttg tcaagactat gtggttgtca 41041 agtatgtgta caactgggta tgacttctcc ccttcttctt gttccccttc taccagctgg 41101 attgagccag agcaaccata aagattcctg ttcacagcaa aaggtacctg tgtctctgaa 41161 taattgtgtg gctcagtgct cacctgatga tggacactat gacccaaagt gagtcagaaa 41221 caaccaggcc cccggaaagc cacaatgagc agtttcacac agaaattcag aggcagtggt 41281 gccctcacat tatctgaaga ctgccatgac tccacatgac cagctggtgg ctgcttacct 41341 gcacaggtag catgtgagag gcctctcttc actatccaca gctcctgccc atgctctagc 41401 aggtggacca actctggttt gggaacccga tgccctgttg atagcaaaag aaatagaata 41461 ggggtaattt aatgagaaaa tgaaagtctc tccatcttgc ctaggacttc atgacagggt 41521 ccagacatga atgcagctgc tcccaaagta agcaaaatgc tgatgtgagc cctgaaaatc 41581 aagggcattg ctctgagtga gctgtgttcc ccataaaaat catatgttga tgccctaaca 41641 cccaacgtga ctgtatatgg agacaggatc tttagaaggt aattaaaatt aaatgaggcc 41701 acaagggtgg agccttaatc tgataaaatt atggcctcat aaaaacagag agagatctct 41761 cctctgtccc tccaccttgt aagaacacag caagtcagcc atctgcaagc caggaagaga 41821 ggtctcaata ggaatcaaat ttgccgacac cttgatcttg gacttctcag cctccagaag 41881 cataagaaaa taaactctcg ttgtttaaac cacccaatcc atgacatttt attatggcag 41941 ccaaggagat aaaagacagg cacaaatccc actgttgggc tacaactgcc atggagggta 42001 gtgctatctg tctacacaac acccactgcc catgacctcc gtctatggaa gccaacgttg 42061 ttcaggcaca caacactttg acagccatgg gacacggcaa aggtgtgctg cttcctccag 42121 aaggcatttt ctttctttct ttctttcttt ttttgaaaca gagtctcact ctgttgccca 42181 ggctggagtg cagtgctcgg ctcactgcaa cctctgcctc ccaggttcaa gggattctcg 42241 tgcctcagcc acccgagtag cagggattac aggcatgcaa aaccatgcct ggctagtttt 42301 tgtattttta atagagactg ggattcacaa tattggtcag gctggtctcg agtgcctggc 42361 ctcaagtgat cctcctgcct cagtctccca aggtgctact gcacccggcc ccaggaggca 42421 ttttctaaca aatctaaatc attgcttctc aaacacctcg tgctttcaga accacttggg 42481 gatgcgtgat tccaacctca aaccaaatga aacaaaattt cttcgtgtga gacctgggca 42541 tcagtacttc caaaatgtgc cccccaaaat tctggtgccc agatggggct gcaaaccact 42601 ggctgaggct ggacatggtg gctcatgctg taatcccagc actataggag gccgaggcag 42661 gaggatcact tgaggccagg agttctaaac cagcctggcc aacatggtga aaccccgtct 42721 ctaccaaaaa tataaaaaat tagccaggta tggtggtgtg cacctatggt cctagctact 42781 caaggggatg aggttgggag gattgcttga gcctaggaag cagaggctgc agtgagctga 42841 gatcatggca ctgcacttca gcctgaatga cagagcaaga ctccatctca aaaaaaaagg 42901 aaaagaaaaa ccagtggttg aaataaattg gtaatgcaag agtcttcatt aggcaggtga 42961 ccaaagctgg attctcattg tgaaaagaaa gcatttattc caaggtggga ggatggattg 43021 aacatggggg cctgctgccc cagtcaaggc tgcagcaccc agcatcccta tgaggagcca 43081 gcctgaggac aatcagaagc ctgaaccaag aagaaggaac agagaaccaa gacatccttg 43141 atggcgtcac tgggctgcgg caccgaacaa gcctggagcc aaccacacct caggcttctc 43201 atcaggcgag ttagtgaatt tccttccagg tgaagccagt ttgatctggt tttctgttat 43261 atgcagccaa atgctttcca ctagatgtgg tgagtaaaag acaggaggat tctgtataga 43321 agaccaatga ctgaggtaca aggatccaca gaaacatcga ctggcagatt ctgaaccata 43381 gcccaggatt ccttcagaca tgtcacagta aggtccacca atgtgtgcta catgcctggc 43441 tttgagttct gggacagggc tgtgggaaag gtgaatcaga tgatcctttg gctctcccaa 43501 gagctgagct ccctgtcact tttcacacgc tggttcctcc acctggaaca ctgttaccct 43561 cttccacctc cttctatcca gctaactcct actcagcttt caggtctcag ctccactgtc 43621 acttcctgag atgccctctt tggaactccc acaataggtt atgacatcct gcttatttat 43681 tcctaaattc ccctcatttt tctttctaga acttatttca gttgtcatct atggattgat 43741 gtccaccttc ccaactgaac tcagagtgtg ccatcaattc taataacttc tttatcctga 43801 cccacttttg ttaaatattg tattctaaaa gggaaccttg ttttgctctg gtaagtgaga 43861 aagtcaattt cccttcctac aaatagaagt ctgcaatcac aacacaacaa aatcaaaaca 43921 atgccactaa gtcccaggaa acccttgtat gtttcttagg gctaagttgg aggttgtccc 43981 ctctgttaaa aaaaagaaat gaactgcact gggggagtga aaggaaaatg aaagactcag 44041 cgctgatctg agacttttct ttttttttga gccggagtct tgctcagtca cccaggctag 44101 aatgcactgg cgcaatgttg gctcactgca acctccgcct cccaggttca agcgattctc 44161 ctgcttcaaa ctcctgagta gctgggatta caggcgcgca ccactgcgcc cagctaattt 44221 ttgtattttt agtacagacg gggtttcacc atcttggcca ggttggtatt gaactcctga 44281 cctcgtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt gagccaccac 44341 acccagctga tgagactttt atgacaagaa ttagcgtgct ggacagagaa agtaaaagga 44401 atggctttat cagggtatca aaccatgcca tttgaggaca ttcctgtgta ccctctaaaa 44461 acacctcctg gatcaacaga gcatggatat caagcatcag aaaacaacta atgaccatgc 44521 aggcacccag atgccttttt acagagattg tattgcttag gaagatatcc gaggggcact 44581 gaggacaaat gtcttctggg gggctgcctg actcacccct ttctctaaga acagcccatc 44641 acccatggcc cctgcttaat agggctgtgg ccacagctga ccccaccagg atggccaatc 44701 agcttccttc ttatgcaatc ttgattaagg acctcagaat gggctgagca ctggagtgag 44761 ggtttgtgtt aaaaggacat tttctacccc gacaatctct gatgcagaca aagggagacc 44821 agtcaggaag aggtgggaac agaaaggtct gtcaagagaa aggtgggcag ttccagtgac 44881 ctttttgcct ggccaagaat cagggttagg gtcctcctca gggatgagag gtgagtcatt 44941 acctagtgaa accagaagcc cacaggtctc cagcatcacc tctcggtaca gggtcctctg 45001 ggccaggtcc agctgtctcc actcctcccg ggtaaaggtc acagccacat ccttgaaggt 45061 caccaacatc tggaaagtca aacagaagta atgctattgg ccttgtgttg tcgcatagga 45121 tcagaacctg tcactcagtc tctagtcaat gtgcagagag atccaggtct gaatcaacta 45181 ggtctccttt gggctgactt ccgtgcaggg ctcaaagggg aggcagatac tgccttttga 45241 caatgcaatt gtgcctgaga gaaaagcctt ctatggggaa gctgttgaag atgacacaaa 45301 caatggcaat cccacgagga ctatgggaag cagagagtaa gtactctggc atattaaaag 45361 aatggaggaa aatgactgat actaatccaa ccacaataac agcaacagca gcagctaagt 45421 gtgggggaaa gaaaaataga tcagaccgtt actgtgtcta tgtagaaaaa ggaagacata 45481 agaaactcca ttttgatctg tactaagaaa aattcttctg ccttgagatg ctgttaatct 45541 gtaaccctag ccccaaccct gtgctcacag aaacatgtgc tgtattgact caaggtttaa 45601 tggagttagg gctgtgcagg atgtgccttg gtaaacatgt gtttgcaggc agtatgcttg 45661 gtaaaagtca tcgccattct cccgtctcgg gtacccaggg acacaatgca ctgcggaagg 45721 ccacagggac ctctgcccga gaaagcctgg gtattgtcca aggtttcccc ccactgagac 45781 agcctgagat gtggcctcgt gggaagggaa agaccttacc ttccctcagc ccgacaccca 45841 taaagggtct gtgctgacga ggattcatga aagaggaagg cctctttgca gttgagataa 45901 gaggaaggca tctgtctcct gcttgtccct gggaatggaa tgtcttggtg taaaacccga 45961 gcgtacattc tatttactga gataggagaa agccgcctta tggctggagg taagacatgc 46021 tggcggcaat actgctcttt actgcactga gatgtttggg taaagtcaaa cataaatctg 46081 gcctacgtgc acatcgaggc acagcacctt tccttaaact tatttatgac acagagtcct 46141 ttgctcacat gttttcctgc tgaccctctc cccactcatt accctatagt cctgccacat 46201 ccccctcatg gagatggtag agatagtgat caataaatac tgagagaact cagagactgg 46261 tgctggtgtg ggtcctccat atgctgagtg ccagtcccct gggcccactt ttcttcctct 46321 atactttgtc tctgtgtctt atttcttttc tcagtctctc atctccacct tgcaagaaat 46381 acccacaggt gtagaggggc aggcaccctt cactaagtcc tctggtcctc actgtgtccc 46441 tgccaggcta aggcttcctt agccgtggta aagcaaataa aggcctgact taaatcaggc 46501 ctcctgaatc tgacacttgc cagctatgtg actatgacga tgtcacatga tctctctaag 46561 gctcgatttc cgaatcaaca caataactat tcccatctca gcaggcagct ctaagaacta 46621 aacagaaaat gttcacttag gcaatcagtg gagtgcctag cacatgtaag tgcctgatgt 46681 gtttgggaac tttattaatc ccctgacagg cccagaggta aatgtcgtta tcacttccat 46741 ttcacagtca gcaaaggctc agggagcttt agaaatcagg cacagcctca catttaaaaa 46801 gcaaaagcag catttgagca ccgctaaaac acctcaccaa gctgccagag agcaggcagg 46861 tgggagtgag gggtgctttt agctacagcc agcactgcca ggatgggtgt ggaactgcag 46921 ccccgcagag acctgagaac aatgtctcag acaaagggaa gtgcaagggc aaaggctgaa 46981 aatgggaaag agcttggcaa gctcaggatt ccctggatga atgcagtgag aaaggggaca 47041 atggaagagg agaaagaaga tgcggtcaaa agggcaaacc aagtacagcc ttgcagacca 47101 tgctggtagg agggggtgga tttattggaa atgtgatgag agtccctata agagaggcct 47161 attcaatcat caaccattta ttaagtacaa taaatgtact taagcctggc tgttctgggt 47221 cttgggagag ggaagcaaca ggcactcaca ttccagggtt ggaaaaagat cctaaacagg 47281 caactaaata aatgagtaag gcaatgtcag atggccttaa agattcagat gacaagaaaa 47341 cagagcaagg agaccaaaga gactctggtc agctttaaac ggtgcaacag acgagacctc 47401 tttaagcagc ggacaccaga gggagtcaga aataaagcaa aggagggaga ggtggatgag 47461 aaattaacaa ctacgacagg agctgacagt aatctgggct atcatgaaag taatcagagt 47521 gactgctaga gagtaaccct gggcctcatg tgttatctcc aaagggacac tacatcctgc 47581 ataccatcgc cactgaacat tttaatggac tcgtttttca cctgtaacta aatgtcttaa 47641 acaggaaaat atactgttag ttgcaaccca gcgtcccttg ccctaaactg aaagacaaat 47701 tcctcagcct gccctaccac cttttctgcc ctggcatagc agcaaccgca gaaacgctcc 47761 tcccacctct ttcagccgct gcccttcgtg ttgtagagac aggaggccaa ggaggccaga 47821 gggtcagact cctgctgggg ttgagtctcc aaacctagca caggcccagg gagcagctcc 47881 ggaaacaaca tcctgagcga cgtccccgga ccgtgctgtg cactgagacg gacacacaca 47941 catcacccag ggagccgttc aagcttcctc aaccccgggg gggggaagag taaatatttc 48001 tactttgcgc gcgcaaaaat gaaggacgag gacagtaacg ggacttgttc aaggtcgcag 48061 gcccgagaaa ggacatggtt ggaagagcca cgaggaggac tgcggtcgct cctctcacct 48121 gtgaggggtt ccgggaccct aaaagcccta tttcatacca caacgccgcg caccccagaa 48181 acggggacgc tccccacgcc accgcgggcc tgcattccgg acccgggacc gcgcgacccc 48241 cttctaggcc tctaagaagc aacggcaggg actgcactga ggacgaccct gaaacgccgt 48301 ccttccgctc gtcgggcccg ggacactgag gccgggtggc cgcggggagc cgagccagtc 48361 tgctcacctg cgctgcgtcc ttcgtctccg ccatccgacc gttggcagga cgaggccgcg 48421 tggggcagct cccacgggct caaaacccta ccccgggtcc tacaacggtg cggaggtgag 48481 ccccagtcgc cctaccatcc ttcccagcac agttctgaga gcgcggacca acacgcccca 48541 ccacaaacgt ccacgccagc gaggccggaa gtcccgcccc tgtgtcgcgg tcgcctgcca 48601 cgccttttcc tgcgccggcc cactgccttc taggtaatgt agtcttgtcc cgtaagaggc 48661 caggcttggc tctttttcct tttttattat gcacaccatg tttctgtata tctcagatga 48721 tcttagcgga gcaagtgtac acactggatt gtaaattttt aaaacttcct attattcgga 48781 tgcctcaacg tccctacaag caggctgtga gcactcgccc aggcacccag gtaaacgagt 48841 ttgataaagg gggccctgcc acgaggttcc ccgcttgcct cctctgagtg acccagtgac 48901 attcccatgt accagtaaaa tcttacgcct ggttactgcg tcctcctcca ccctgacgcc 48961 aggcactgtc cctgggactg gcgtggatct ggacccccgc ttcccgcgtg gagccattcc 49021 cgggacatgc tcgtggggtt ccgtctgtgg gacgtctgcc tcctctgtgt ggactggtgt 49081 tgcccagggc tcaggaaggt ctccaggggc cgtgagtccc cgcccgcttc cggagtccag 49141 cttcgtgaca ccatcgtgag aaagaatgca gggatgagtc aaaatgaagc caaaggcaag 49201 aagcttttat tgggaagaaa aagtacacat ttttgagaag agtgtacttt tcccaagaga 49261 agagggaaat gcggacgtac gagggagagc gggtcatgca caatggagtt tgggtttcta 49321 attttatggg ctcttctaat tagggggtgg gataatcatg aggtttttct aggaaaaggt 49381 gggaatttct tagaattggg gtgccaacca tttttatact aaatgtgggc gtgcttggat 49441 tggcctggca ctggtgggtg tgtgatttaa tgtggcaatg agtgtataat taggtctggg 49501 gtaggaaatg ggtcaaatct agcgtcatgg gacagcctag ccccaatcct gtttgttaag 49561 gtctcatcag cccagtcttc ttgttggagt agctaatttt aacagcttga tgttttcccc 49621 cttctcctgt gaccacacag cattcctatt ttgtgggtgt ttctttaatt agagagtgga 49681 ataattatta ggtattctgg aaaaggaggg aaattcaggg acccacctgg ttactgcccc 49741 ctttctctct tatttggctt tgcccagaag agtcatggac atgtcatcct gactgggatt 49801 ttggccattc tttctcttat tttgggtttt ctgttatcct atggtttctt tgcctagttc 49861 ttgtttttat ttgttgttcg aatttttcca tcctcctgca accacccagt gctattccta 49921 tatcactgtg agtataaaca ccctttaacc ttataggctt cttgaagtat aattcccaag 49981 accatgttgc agtggtcctt aaagaaccaa taaggcagag agaaacaaac atgttccaaa 50041 ttttgttctc aggagtaaac cttactcaat tgttaaaggc tgtagctagc tcaaaagaag 50101 tttccttgac tctgaaaaac aaaacaagga tcagcaatgt tccaagcaaa agtcaaaaag 50161 attatttcag tttttctatt agttcagcct attctgttaa ctcttgttct gcttgatatt 50221 tataaacatt ttagcttttc atgagtcctg tatgtttttc agttatcaga aacctgcatt 50281 taggaatacc tgttaaagta ttcctacatc tgattatgaa ccatattttg aagaggatta 50341 aaacaagaca acaattgtct gtaaatgaca aaatgtccag ggtggttagt caaaaacatg 50401 attgacaaaa aaaatttggt tatctccgtg gtttacaata acaacataac aaccttaatt 50461 gtggttgata gtgtatactt tcacattaga gttttctaaa tcccatgcag ttttggaaca 50521 catattaata gtatttgctg aaatataacc tgaagaagat tagacatcat tttggcaatc 50581 ccatgtaaca aaacatgtca aataatcctg cttgcctctt ttctggatgc cccagggacc 50641 ctctgtagca tccaaaacgt aggtgtcaag aaagacaatt gatttgtgga agcctgttaa 50701 atatgttaac aggcttaaaa tatttgatgt tatgtactag aattccagat tactataagt 50761 tatttatttt gccaaaatga tgactcaaaa atttgaaaaa gcaaaaacct ttcattagcc 50821 tttactatta catgaaaatt gtgttcaaga gagagaaagc caaatttcac ccttacatta 50881 gtgtactatt aatgtcaacc acaattttta atgaaacaat tataggcaat tctatccaat 50941 cttaaccagt ttgacaatga ggtgagattt tcacaaacct tttataaccc tttacaaatt 51001 ttgctaaaga gtagattcat gcattaagag ttctttgttg tgcttttatt tcaatgctca 51061 atttacagaa aaccacataa tacccttttg aatgtagtca atatattcac acagagtttc 51121 ctttgcaaga ttaattctta caattttttc cccagtttgc ttaaaccttc agttgtattt 51181 tatctacttt aagacaattc tttatcccta ggcaaaacgt acattgccat gccttcttat 51241 aattttttac aaaaaacaca ttttactgtt tttacacaca ttgcatgtaa atctattcag 51301 tagtctcaat tacatgtcat aatggtaact cttagcaatt ttttttttaa gatggaatct 51361 tgctctgtca cccaagctgg agtgcagtag cacagtcttg gttcaccgca accaccacct 51421 cccaggttca agcgattctc ctgcctcagc ctcccaagta gctgggacta caggcacatg 51481 ccaccatgcc aggctgattt tttgtatttt tagtagagat ggggtttcac cgcattagcc 51541 aggatggtct caatctcctg acctcgtgat ctgtccgcct cagcttccca aagtgctagg 51601 attacatgtg tgagctacca cgcctggccc tacctcttag caatttttaa ctttagtgta 51661 aaacctggta agttgtttta attatttgct gggtgcagat aaagtttaac tccttccaga 51721 ataagttagg ggcttggtta cttccatatg ttcccagact ttacccattg tgaagcaggc 51781 aagttgaaca gttcttaaag gccaaaggag cagtacaacc ttaaaacatt tagcaaacct 51841 accatctgac ctgcataatt tagaccacat atttacatct tgagtacatt tgtattttac 51901 caataatccc taagactgtt ttatttttaa agattaaagt cacatgaact aagagaaatt 51961 acagttttta ctattctttc aaaaaagatt tgatcggcca ggcgcagtgg ctcacacctg 52021 taatcccaac actttgggag gctgaggtgg gtggatcata aggtcaggat attgagacca 52081 tcttggccaa catggtgaaa ccccatctct actaaaatac aaaaaattag ccaggcatgg 52141 tggcacgtgc ctgtaatccc agctacttgg gaggctgagg cagagcaatt gcttaaacct 52201 gggaggcgga ggttgcagtg agccaagatc gcactactgc actccagcct ggtgacagag 52261 caagactcag tctgggaaaa aaagaatttt atttgatcta agtgcttata ggccaatcaa 52321 ttacagctct tttttataga catcacacaa cacatatatg actacacaga cagaagaaga 52381 tccagcagct cagggtggag ccctttaaga ataaggctac aaaagcatgc agtttctggg 52441 gcctaataaa caggcatagc tcgaaggcca aaacagattt tgagagggat ttatccatct 52501 ctaattcttg gggttccatg aggaaaacag atttccccca aattgaatct gtggcacctt 52561 gtctgttttc tcaaggagtc caaggccacc agaagtcatc cttgagcctt tcatgcatgc 52621 accaagattg gcaagacaga gtggagaaaa gtaattcagt tgactgagat acaaaccttt 52681 tccagaaaaa caagatctaa gaagagaaaa acataaaggc gtttcaagga cacctataac 52741 ttgggtatcc acttttaatt aagctgagtg ctctttcata aaattcttct ttactaatta 52801 aaactttaca gataatataa acaatgatcc ttatcttttt ttttttttta ctggtttgca 52861 tcactctttg ttcacaatca tgttcaggtt ctccagttta cttttgggga aagtggctgg 52921 attcaggcaa ggacaggttt ttaactggac tgtagatccc tctaacagca agccctgata 52981 tttgaagagg caattgtctg ttagccagag acttccctta gaggatagca gtcctgctat 53041 attatgtggg gtctaaacag ttattttccg gctgggtgcg gtggctcatg cctgtaatcc 53101 cagcactttg ggaggctgag gtgggtggat caaaaggtca agagatcaag actatcctgg 53161 tcaacatagt gaaaccccat ctctactaaa aatacaaaag ttagcagggc atggtggtgc 53221 acacctgtag tgccggctac tcaggaggct gaggcagaag aattgcttga acccaggagg 53281 cagaggttgc aatgagccga gatcacgcca ttgcactcca gcctggtgac agaacgagac 53341 tctgtctcaa acaaacaaaa aacaaacaaa cagttatttc ccattattaa cttggtggcc 53401 tctggcacga gcaaagccac caatgcaact tcttggagga aggctggcca tcctttaacc 53461 accaagtcaa gttccttgct taggtaaccc actggctgtt gagctggacc tcgagccttt 53521 gttaaaattc cctgggccgt ttcctttctt tctgatacat agagattaaa tgtgttccct 53581 atgagaagac tgagggctgg tgttttaagt aatgcttgtt ttaactggtt aaggcttttg 53641 agttttaaga ttccaagtat tagttgcttg agtttctttt atgaggtggt ataaagggtg 53701 aggtatttca ccatatccag gtacccacag tctgcaaaat tcagtaattc ctaagaatcc 53761 ccttaactgt taaaggaaat gggcttaatc ctttccttac ctggtgctct tgtcccttct 53821 gataagacta gacctaggta ctttactgaa gtctgacaga gctgagcttt agattttgaa 53881 gtcgtatatc ccctttctgc caagaaattg aagagagcct cagtgctttc ctgttacttc 53941 cttggttgga gcacagagga gactgtcatt tatatacttc aaagtttaaa cttgagggtg 54001 agaaaaatta gaaaaacatt tggacagggt ctgtttcagc aacttccttc tgatatcggg 54061 gggtgcctga gtaataaacc tgctctttat tgaattggga gacagggagg tgtgtttcac 54121 taaggcctct ctcagccttt ccataacggc tgagggattt tcatctgatt tttgatttaa 54181 catggattgt ttagagtaat taagaggttt ggttttgttt tttgctagcc ttttaatatg 54241 cacatcagaa agtgtattcc actcatctat gcgatgacta gtctccactt agggttttca 54301 aggggcactg tttatcttcc tactgggaat gggaatcctg tctctttttt accctctttc 54361 cccttttgac tgggtttctt ttttgactgg ctataggata catgctgttt gtcggtgaat 54421 ctctctgctg ccttagggct gcctgtttct cagcagcagt tagagtttgg cttaggagta 54481 acataacatc tccccatgca agatcaaaca tttgggttag attttagaaa gcttttatgt 54541 atctatcagg gtcatccaaa aacttgccta agaccccctt tgtttgtcta aggtactgca 54601 atgatatggg aacttgaact ctagtggcac catgtccatt gggcatttcc tgcagggtca 54661 acagtgaagt tgagttttct tgtgtggcac agctggagga gctgatgctt ggctattgga 54721 ggccccagat aagggatggg gaggtagagc atccagaagt tgcatttgag ggggttttct 54781 gactttgcgg tgttgtccac tgtgggcttg tctgatatga ctcttaagag ggcagagtca 54841 attttgcagt gcttacaaag agctgggctg tttcacaggg caaagaaggc ttgcacatag 54901 gggacctcag accgtttgcc ctcccacgtg cagaaaagat ctagttattg gatattattg 54961 aaattaacac ttccctcagc agcccaggcc tatccatcct ggagctggta agatggctat 55021 gtcctcgtac agaagaatat aagccatttt tcttcagtct cagggttaaa agtctcagtg 55081 tctcaggatg cactcaagag gagtgcagac tgaagatggt ttgttacctg ttttaaaaag 55141 agtaggagaa aaggcgtccc ttggtctttt tccttttttt ggtgtgaccc agggtggagg 55201 gaaagatcga aagtgcatcc cacttctctt tcttcccata ctctgggtcc tggccaccat 55261 cataggtgtc acccatggat gcaagcatga ccttcacccg tggatctgga agagctagtt 55321 gggtgtaata gtcatgctta cctgtatgag gacctgactc tccacgctgc tggtttccta 55381 gtcccaagtg gcccataagg ctcccagggt accctagtgg tctgggagat attgtatttg 55441 ggtgagaccc tttaatggag ggagtgtttt aatactatct ctggcttccc ttgctatggc 55501 cctagcaaaa cattgaaatc ccagagaatg ggaccgattg acttccaaat atgaaatctt 55561 ctttttattt aaatgccagt gtagttcaat gcagaacagg tgactcaaaa gaacataagg 55621 attgagtggc tgttctcccg actgttaaag gtacagcttt gctgttaaca gatggagcat 55681 ggagcttagt ccctaacaga ggcacacagg agggagagga attggggaac taatggtttt 55741 gcacaaaggg cagatagggc tccttatgga gaaaaaatcc tatttcactt ggtggcgctg 55801 taggatctga aatgttaaat aagaactctg actccaaatt ctttccagga agaagttaga 55861 aagagaggtt tgggcttaat aggctgtccc cattgtatgc cttccagaag aagaaaatta 55921 acttgtctca taatggaact gtctagattc attgggcagt cctgagcttt tacatggagg 55981 aaaaacacaa cccaaatgga gagggagaag ggtattcact cagcatgaaa tatcccctct 56041 atacagtgcc atgaatgtct gtcattaggg acaaaaagct ctcactaggt gaaagtttag 56101 acagaaatcc tctaattgtt ctaattagat caaacctcta tagtaaacag ttcaaaaatt 56161 atagtgtcag ggaatcatct tcatggttgc tggacttgtc atcaacatcc ccaggatcta 56221 gagttccatc ttctggccta tctggaaaac ctcacagaca tctacccagt acttctaact 56281 aaccatagca atcccaaagt accaaaaccc ctacttattc agtggaactc tccttcactt 56341 aatggattcc acctgaatgt ccccttccac acttgccatt atacctggac ttgaaaagtt 56401 gagtatatct ccagtgcatt aatgattctt ttgcactcac tactatgtta atgacacaaa 56461 agcagacgtt tccctgtgat gctctgatct aactatagtt accaagtttc caaggcaaca 56521 gcaaggtaaa tggtatttca cttgtaggca agaagaccat atatgatgtc ttctcccaga 56581 ccagaacaca cgggaaaaaa tgactgccta atctgggagg gtgggagtac tattcgcttc 56641 tggaaaacaa ggtttgtttg accaccccca ttggaacagg ggaagaatat ctctatagac 56701 ccaacttggc agcaaaggat agacattaca ccctgtagga cctctatgct ccaactggac 56761 tcatttttgt ttgtggccat gaatgagaag tcacatgctg taactactcc tggctcccca 56821 gggagccatc tgttctttta ggagtagctt tcccttatat atcaaaaact tggaacaaag 56881 ataaatgtat gttgaccacc cttgcccctc caagggtcac agtctataac ccaataaggc 56941 ccaggaacac cagaagtcaa caagcaatag gattaattct ggtgggaatc agggcagcca 57001 taggacaagc agcaccctag ggtggctttg cctaccacaa gtcaacccta aagaacttga 57061 ctcaaactct agaatcctta gccaccaaca caggttaggc actaccagca attcaagaat 57121 ctctagactc tttggcaaat gcagttctca ataacacact ggcattagat tatctactag 57181 cggaacaagg taaagtctgt gcagttaata aaacctgctg cacatatatt agcaactctg 57241 gataggttaa cattcaaaag atctataagc aagtgaccct gttatgtata ttttctttgg 57301 ctcttgcttg tttaacctct tagtaaagtt tgtgttttct agatcaatag tccccaacct 57361 ttttggcatc agggaccagt ttcatggaag acaatttttc tatggatggg gtatgtgggg 57421 ctggggctgg gggctggtct caggatgaaa ctcttccacc tcagatcatc aggtactaga 57481 ttttcataag gagtgtgcaa ccccagtccc ttgcatgcac agtttatgat aaggttagca 57541 cccctatgag aatctaatgc tgccactgat ctggcaggag gtggagctca ggcagtaatg 57601 ctcactcacc tgccactcac ctcctactgt gcagcccagt tcctaacagg acatggacca 57661 gtatccacag gagaggggtt ggggtatcct gttctagttc caggtaaaga caatgctggc 57721 acaaggcttc caacccatcc ttgtctacta ttccagagaa tggaagtgtc cttcctttgg 57781 ggaccttata tgtggtatcc agggatgttt actcctttag tgctaggcag ggcctaggcc 57841 cacaaaatca gcagcaagca gtcacagagc ccccttaaga ttaaagagga gtatctaatc 57901 tctgatggga gagtgagata ggaggcagga ctccatacca gactgaagac tggctgaaac 57961 agggaacagg caccaagagc atatctacat aatacacacc caccagtgcc atggcagttt 58021 accatcacca tgacaacacc cggaaattac catccctttt ctagaaattt ccaaataacc 58081 caccctttgc atgtagttaa aagcgggtat aaatatgatt gcagagctgc cccctgagct 58141 gatactcgac acactgccta tggggttgcc ctgctctgta ggagcagcca cagagttgta 58201 accctgctgc ctcaacaaag ctattttcca tctaccacta gcttgctctt gaattctttc 58261 ctgagtgaag ccaagaacct tcttggacta agccccaatt ttggggctca cctgccctac 58321 aacgccacct gatgtcctac ctctgtgctt tctaaaacca tctcccttac cactggaaga 58381 atataactta ggcctcagga aatgaggata attcatgctg acataaggag ctctttaaaa 58441 aataaaaaag cagctgggcg cagtggctca tgcgtgtaat cccagcactt tgggaggccg 58501 aggcagctgg atcacctgag gtcaggagtt tgagacaagc ctggccaacg tggtgaaacc 58561 tcatctctac taaaaataca aaaattagcc gggtgtggtg gcacacgcct ataatcccag 58621 ctactcagga gactgaggca ggagaatcgt ttgaacctgg gaggtggagt ttgtagtgag 58681 ccaagatcat gccactgcac tccagcctgg gcgagagtga gactccatca aaaaaaaaaa 58741 gaggcagcat tatataataa gatacctggg ggatattgat ggacatgcta ccttggagtc 58801 ctgcctagct cagccccttc ttcgagaata atccactacc cacagtcctg ttaaccaatc 58861 tctggggata cctgacccaa ccaggctaga tcaagttttc ttgcctgaat tctggaataa 58921 ggacctcaga cagagggcag agtgtgctgg gcactagggt gaggatttgt gtttaaaggg 58981 ctgtttctac cccagtgatc cccagtgcag acacagggag accaggcaaa agcatgtgag 59041 accaaaacag agaatacaac aggacctgtg ggtagaggga gaagggcttg agatgccttt 59101 ctcaccaggc agtatgtgca agaggatgga atcagaggac tagggtcacc cacatgggac 59161 agaaataaag ctttacccag caagaccaag agcccacagt tctgcagcct cacatcttgg 59221 tgcagtgtcc tctgggccag gcccaaatat ccacatttct accagatgaa ggtcagggcc 59281 atgaccttga aaggcaccaa tgcctggcaa gtcaaacaga aagttgacct cagactgtta 59341 gatattagaa cctattgctt ggtcactggg aaaggcacag agaaccagga catctgaggg 59401 ttgagctcca tgtagggcac aagggtgagg gacactgcct ctgaaaatag ggtaactgtg 59461 gcagaggata aagcctatga tgggggatgt catttccaag ttaaggtttc tgctacccag 59521 gctccttctg attctctacc tcccacatgt cctcctgcac tcacattaaa tcatcccaag 59581 taatcctcca ggttgcttgc ctggctccat gctctgacaa tcatgctgca gaatggagct 59641 gttccctcta agccaggcta tgcctacact gtcacacctt gtacaaggag attgtgtgtg 59701 ttgtaaaaat agacacagcc agttactaga tgtgtagatg cacttttacc acaggcacat 59761 ttgttttgct attacctcct ccactttcta ttaatcatac ccaaaatttc tgtaccaatt 59821 atgtcccatg cttagctccc atcagcttga gtaccaccat ctatggaatc atgggcatca 59881 gggcacatag gttcctttgc agaagctggg aaaaagcttc gtctcaacag tgctgggcat 59941 tacagcctgt agaacaatgc aataaatcca tcttattccc taaagcacag gagcctgtca 60001 taacagccga cttgtcctca ggatgacctc tggtccagag aaagtgtagg gatccaggcc 60061 ataaattagg acaacaaaat tcttgcattg tttcctagga ggccttgagc agcaaggcaa 60121 agtggactaa aaccagaaca agaactactg tcattttact tttcctgcac aggccagcaa 60181 cattggggcc ccacaaaagc tgtccatgcc tttatgtagg agcccaaatg tgttttccag 60241 ccacctcaca cagtaaacca aaccctaaga ccacacatag atttaaaaat aaggccttta 60301 ttaccacaca aaccagagca atgtgaggca gccagttact aggtggtgac ttggaggctg 60361 gaaaaactac caagtgttgt gtgaaaacaa gaatttccat gaagggaagc cttccacatc 60421 agcttccagc agaacccagg ctgggccctt ctggccaaaa ctggtcagat gtatgctgta 60481 atcagtcaga aaagggaagg aattcccctt aaaggctcag actgctcaag gaatttcctc 60541 caagggttgg gagaacacag gtgtatctgc ctcgccttag tatgcaaggg tgactaatga 60601 tttaactctg gcaaaggcct ccagcagttt agaacatgca aaggagctcc tgcaagacac 60661 atatgcctgg aactaatggg agtctgaccc atcgggaggt ctagaagtat gaggtttaac 60721 tttaggcaga cggctccttc acaaaggctc tctatagcca taacactgaa gaaagacatg 60781 gcacattccc tgcaggtcta aggcttttct caggtttgga gtttcaagag ttaaacaatg 60841 ttagatcttg ggctgtaggg ttttccacat ttgctgctgt cacgaggcct ctccacagtg 60901 tgagattttc ggtggagaat gaggccagag tgttgtgtaa aggacttccc acatttgcca 60961 cactcatatg gcctttctcc tgtgtgaacc acttggtgtt gaatgaggct agacttttgg 61021 ataaaggatt tcccgcactg tccacagtcg taaggcctta atccagtgtg aattttctgg 61081 tgttcaacga gggtatagct ttgtctaaaa aatttcccac attcactgca ctcaaaaggc 61141 ctttctccag tgtgagttct ctggtgccta acaagtttag atttgaacat aaaagctttc 61201 ccacattccc cacatacata aggcctttct gcagtgtgaa ctttgtggtg tttattgagg 61261 gtggttcttt ggctaaagga ttttccgcac tcatcacact caaagggcct agctccactg 61321 tgaattaact ggtgctgaat gaggccacac tttaggctaa atgatttccc acactggcca 61381 cactcataag gccttgctgt agtgtgaatt ctctggtgta caacaaggct gaagctttgt 61441 ctgaataatt tcccacattc accacactca tatggctttt ctcctgtgtg agttctctgg 61501 tgtcgaacaa gagtggattt ggacccaaag gcttttccac attcatcaca ctcataaggc 61561 ctttctccag tgtgaattct gcaatgttca ataaggttgg aactttggct aaaagatttc 61621 ccacattcac cacactcgta aggcctttct ccagtgtgaa ctctgtgatg tttaatgagg 61681 gtggctcttt ggctaaatga tttcccacac tgaccacaca cataaggcct ttctccagtg 61741 tggattctcc gatgatcatt cagatgagag gtttggctaa aagatttccc acattcactg 61801 cactcataag gcctttctcc agggtggact ctttgcagct gaacaaggga gcactcacag 61861 cagcaggctt tcccacattt gctagattca taaagccttt ttccagtgca gactctaggg 61921 tgaaaaaaag tgtgtttgtg gctggactct ctcctgcatt gactccactt gtaatgactt 61981 attccaacat gaaaggcctc ctcacatttg ctgattttgt ttggcttctc atagttagca 62041 atagcttgcg gctgaagaat gcccaatggg gctaggaagt ccttcccaac ctcactgctg 62101 gtgaaaggca ttcctgactc atggaacagg cagcactgca caaatgaggc cttgtccatg 62161 tcactttcca agggtttctc tgcactatga tgcttctggt cctggtgaaa gaccgcacat 62221 gccccagtca agtatagttc ctgcccaggc aaatcggtca ggtgcaaaat gtcggtcagg 62281 aatgggacac acatgtcaca ggattgaatc ttctgggtgg atggactggc ctctggagtc 62341 ctgacctgag gtactccttc tacagaaaca ctttgctcag gtgtctcttc atcctccatc 62401 ccatgccaac aaactgaaag cagagaaaca ctgatgaaat gcacgttgac tatgatggga 62461 gtgtattagt catttaaaaa actattttct gaataattgg tgttctgaca tcttaaaaac 62521 tttgctggcc tgagctcagg agtttgagac cagtctgggc aatatggtga aaccctggct 62581 ctactaaaaa tacaaaaaat tatccaggcg tggtggtggg tgctactcaa gaggctgagg 62641 cacaagaatt gcttgaatcc gggaggcaga ggttgcagtg agcccagatc gcaccactgc 62701 actccagcct gggagacaaa gcgaaactgt ctcaaaaaca aaacaaaaca aaacaaaata 62761 aaaaacattg ctggctgggg agagaatggt gcttccagag tgagccaact cttaaagaga 62821 tagcaaaggg ctcagcaagg agcataacaa atacaccatc aatccagagc ccatactccc 62881 aatcacattt tctatgtgac ttggatgctt caggagctaa tattcctctg ctctagtcat 62941 cccaaggtca ggcaggagac aactacgcct agtgagcccc tttgccccaa agcaaactac 63001 tcaaattacc caatcctaaa ctgtttatta tgtcttgtgt tacctgcaga aaccccaata 63061 aaggctgtgg cttaatattt tccctcctct cttgcttcta ccttctgacc actgagcact 63121 tcctcatgtg gccctgcatg gcatggtgtt cacatgctct tgctactgca ggtgataaat 63181 tcctatgtca ctgacattaa cctctctgta ttggcactct actacctcca ttaattaaaa 63241 tcctgtgggt acataacagg aaatgggaag caccttcaca tatacatcaa ggaatgaatt 63301 cacagtattg ctttgaagca atattgagga actgaaagca ggataagaaa gaggctgctt 63361 ggcagtagtg ggcctctaaa gatcacaaaa tggggaggcc tcaccaaaca caggacgtaa 63421 gaatggggta cagtacacaa ggagaggcag ggtctccaat ctattctttt gccaaatcct 63481 gtggcaagat cttgtcagct ggtggcacgt gtatgctgtg ctgaaagctg gcattattcg 63541 ggcccaagat aatgcaattg gaagtgagtg ggagattcca attgcgatct tgtgttcaag 63601 aaatatttaa ttggcatgtg cacaactcac aacatatcgt gagtgggtca gtgttggaaa 63661 acatgctggg gggcagtgca gagaaactgg tgacacatgg agggcaggaa gatgggggca 63721 gcacgaggcc ggatgtccca gatgaagcaa taagccagca aagtgtcaag gaagaggaag 63781 aaaacataga aatgagatgt gtccagtggc attagcacca aactctggat gacacagggc 63841 aaattcctgg taagattaga aaacaggctg gatgccatgt cctcacccag gtgccactga 63901 ctgaggccag gcttcctctg aagccatatt cccatcttca ccctttccaa caaccagggc 63961 tttcccacca cctctagatg tgccactact tgggacctga atgaagcaag tcctaaggga 64021 aagctgggac aggtgagtat cctgcagtgg cccagagcaa gtcagcagcc ttgagaacac 64081 aagaaagaaa attaatatca cacagatcct tgaaggaggt tctggcaacc acagctcatg 64141 gtccatgtaa atactgatga ggaaaattct gcaaattcct aacacaggta gtcaaaggaa 64201 atgtcatcta ggaggcagaa tttgagacac aaagtaccat gaatatggac acagccaccc 64261 aaagaggaag tgaagaagta aacggagatg cattaggctg actcaagaca cctaacagcc 64321 agacccatcc ctgtgcattt tgggacagca aatgtcaggg ctgattaagg agtgcaatga 64381 tttcattcct tacaccacag cacatacgcc aagacagtgg ggaagtcagc aaagcccacg 64441 gtccggcaga gctgcctctg aggaaaaaga ggatagacga agaccaggac acgagagtgg 64501 gtgtgagggc cttacccagc gcagttataa gtgcaaagtt ctccagcatc acatcgcggt 64561 acaggagcct ctgagcctca tcaaggagcc cccattcttc ctgggagaag taaatggcca 64621 cgtcctcaaa ggtcacacag ccctgccatg atggggatag atctttccat gatcagattc 64681 tctcctagga ccccaagttc atgcccccca cacatccttt ccctgcttat ccccatacct 64741 aagccccacc tcagaggaga taccaagacc tggtggcact gattcctgcc ctctcctcat 64801 ggtcccatca tcactgtgtc ttcccacagc accaacaggc aggtgggcca ggaaatgcca 64861 ttttaaactc tgggtctagc atatagggct ggattttgtc ttgtcagaca gttctcctgg 64921 ggtttccaaa attatgccac tcacatacaa acaatgatga ggtttgccat cattcttata 64981 tcctcaatgc catggccatg ggttgttcag ttcacacttc ctctccatat gcacatcact 65041 ctttgaagca cacactctct caaacaatct cactcccacg gctatttcca ccctgttgcc 65101 ccacaccata ggcatctctc tggcatcccc acgatagttg gaatgcaaca ttcagaaagt 65161 gcttggcaaa ctgaaatgag gtccggtgcc actcaggctg aatggtcccc actgccaaag 65221 gcagcccttc agagaccctg ctctgaaggc ctccctgcct gactctgtcc ctttctacat 65281 cctcagtcac ccagaaatac ctggggatct gacaccgatg gccctcttgc actagcgctg 65341 cacttgatgt cctctcttaa gtcacagcct tcctctcctg ctaaccctca ttcaaagtct 65401 gccccagggg acccccacac ctcaggccca ataactccca gatgaccacc tgtgcccctc 65461 aacttatcca cctccctgaa attcctgaca cctcaagggt tgccacaaaa ccctggtaga 65521 tatatccaga gtgctgagta agcactagtt ctggtctcag tgtcatcttg tctcaattac 65581 ttccaagcat tgatctcatg cccactcttc tcctagatcc ttggccactt gccagatcat 65641 ccttgctctc acaccctacc atagccacct ctctcccata cctgtctttg tgattcccac 65701 catgaccaac catgtcccca agcaggagag ccagtagtga cccccaacct tccttccctg 65761 ggctggtaat agcattacca aagtgatcac gattccacct cctaattgtg tcccaagcta 65821 tacccctagg caagtagtgg ccaccacctt ttaaatgtct ctatcctgaa tccctcaaca 65881 cagttctagt cctgtaggat aaactatcca cttcatgtct aaactaactg ttctttgaaa 65941 catctcagac tctaaaagat gtcccattgg gccatgttca gatgattttt attaaaaaat 66001 tttaaaaata aagatggtcc atatcaaaat ttctgattat ctctagagat aactgactct 66061 actaccaact ttacaatcct cgctagtgga gcttccatat ttccagctgc taaacccaaa 66121 caccttgctt ttacccctaa atcctacttt tctatctcaa aatccaggct tccctactta 66181 aaaaaactga tctaaaatcc aactaccacc tccacagcca ccacacaagt ccagccacca 66241 gtaacactca cctggactac tgcagcagcc ttcttcacta tcttcctatc tcatacctca 66301 cactcagcct gttctctcag cttggccaca cacagggaac ctgatgagac ctggatcaga 66361 tcacatacct cctctgcccc caaactgtca ttgctcatca cacactccag aagaaacacc 66421 cgacaactga cagccatttt ttgtctgatt cctgcatccc agctttcagt tcccatgaga 66481 aagaaaacag ctgtagtttc ctaaaccagc tcagcctgac cttcaagcac ttgctcatct 66541 ggtatctaag cctgagacaa cgttccacac cttactttac tggttcccag caataggaac 66601 cattctatac cagcttccag cctggatgta tgaccctggg actgatgttt gttcaggatt 66661 cctagaacca aggacccaaa gagtgccaag agaagagtcc cagaacacaa tctcagagac 66721 aaacactggg gtcagcatca gtgagggccc gaaacactct ccactcacct gtgtaaagcc 66781 cataagttta gcttctgcag tcacgggaac ctgcgggaag aggaaggcta tgagaggcag 66841 gtaactatgg tggaagctcc atgggctgat gcctccttgt ttccctgcgc agccctggag 66901 ccaggtgact tggtgatgac taaattatta ttgctcctca gttccgtgtg actcttacct 66961 gctgtgggac cttgcacaag gactcaactt ccctgtgcct cggtgttatc ccccttccag 67021 gctgttccaa ctctgagtaa ctttagaaaa aatttgaaaa gtctaactca gaccatgtgc 67081 ctcgtctgtt cacaaccacc catggctctc agcgtcttga aaagaaagag cctcatttta 67141 ccattcgggc gtgaacgtta ctggctcaat cgtcccagca agattcacag aaaccggccc 67201 ctccgcctgc cacaccctcc acagcccacg acaccaggct tgtaaatggg agccccaccc 67261 gcgagggcct catagtcctg gactctaggg tcgggctgca gggacgcaag caggcgcccc 67321 tcatgcaggg tgttggattt ccggcggaga gggcaggtga ggcccgggag acgcagcact 67381 caccgaagtc gaatccctaa gcacggccgc cgccatcgga ttgtgagcgg agcggggccg 67441 ggagcggcgg gcgacccggg gcgggaaccc aggcacggct gccaccagcg ccagcgaccc 67501 accggctgat gcgcagcggg gcgacccccg ctctgtgccg gaggcagcgt ttctaactca 67561 ggcggcgtgg gccgaggtag agaaccacca aaattaccat ctcggagcgg tgccggaagt 67621 cccgccttac cgtatgcaac aaacggagac aggtccgttc ctgaacgttg attggctccg 67681 cgtctttgca gcttggcgct cggcattatg ggtaatgtag ttctcagaag cccagggttc 67741 cggtgccctc cctgagattc acccggaaca agggctaagc ggcccgcgga gggcgctagg 67801 aaaaggcgcg acaccaagag cgagacaggg acttgctgct ccttaaaggg gccgctggcc 67861 gatttccgca gtcccatacc cactgaaggg agtgtctggg ttaagataag aggttgtaga 67921 gaccaaggtt cttattaagc agatgaagct ttcacgcagc aggcttcaga gggaatagat 67981 tgtaaatgtt tcctatcaga ctttgaaagg tgccagactc ttaattcttt cctggaccat 68041 gaaaataaaa aaaccctgga gaggaaaggg gattctcagc aaaatgcaga ttccacaaga 68101 cacacctttg cagggcctct tcaaaatatg tcaaataaat aaagtttaga gtaaaatact 68161 tcaattcctt tcagggcctg ctatctgtca tgtaatgcta tactacaatc aggctggaat 68221 tcactgtctt attgctacaa gaagtcttaa aatctctgtt ttaatgttaa tgctggtcag 68281 ttgtgcctga attccaaggg gaggagggca agtctgaccc tcacttccca tcacgtcctg 68341 aactagtttt tcagattaac tttggaatgc ccttggccaa gaggaaggat gcattcagat 68401 ggttgggggg ctgggaattt tatttttggt ttacacttct catgcctctt ggaatctctg 68461 aaccacatac aacggccatt tgttgagtaa aagcttccaa tgcaggaaac tgtctctttc 68521 agcctcatgc tatacaccat taaaacagta attactggca ataagactgg gctttcttaa 68581 acatacagga tctcacagac cataagccta tgatccttca taccagctgc ccataatact 68641 tttggttttg gaagcagcac cccataacct cagcatggct accaaggcct ccttaagaca 68701 gggtagaatg aataagatct agtatttgat agcacagtag agagactaca gtcaacagta 68761 atttattgta cattaaaaac aactaaaaga gtattataac tagaatgttt gtaacacaaa 68821 gaaatgataa atgcttgtgt accctgatat gattattaca cattgtatgc ctgtatcaca 68881 atatctcacg tatcccataa atatacatac atactatgta ctcataaaac taaaaatttt 68941 aaaagacaga taaatggcaa gcaggtagat gaaaacatgc ttaacatcat tgatcactat 69001 aaaaatgcaa atcaaaacta caatgagata tcatctcact ctagttaaaa tgacttttat 69061 gcaaaagaca gggaataatg aatgctggcc aggacagaga ggaaagggaa acttcatacg 69121 ctgttggtgg gagtgtaaat tagtaaagcc actatgggga acagtttgga ggttccccac 69181 agaagtaaaa atagaattac catatgatac agtttacacc acccagaaaa gaaatcagta 69241 tatctaatag agatctgcac tcccatgttt attgcatcac tattcacaac agccaagatt 69301 tggaagctac ctaactgtcc atagataaat ggataaagaa aatgtgatgc atatacacaa 69361 tggagtagta tttagccata aaaaaagaat gagatcccat caattgcaag aagacgggtg 69421 aaattggagg aaattatgtt aagtaaaata agccaggcac agaaagacaa acttcacatg 69481 ttctcactta tttgtgggag ctaaaaattt aaacaattga actcccggag attaagctta 69541 gaaggatggc taccagaggc tgagaaggat agcagaaatg gaagggaaag tggggatagt 69601 taatgggtgc aaaaatatag tcagatagaa ggaataaggc atagtatttg atagcacaac 69661 aaggtgacta cagtcaacaa caatctattg tagaattaaa cataactcac cgggcatggt 69721 ggctcacgcc tgtaatccca gcactttggg aggccgaggc gggcagatca caaggtcagg 69781 agatcgagac catcctgact aacacggtga aaccccatct ctaaaatcaa atctctataa 69841 taaatctcta aaaatacaaa aaattagccg ggcgtggtgg tggtcacctg tagtcccagc 69901 tactaaggag gctgaggcag gagaatggcg ttaacccggg aggcgttgct tgcagagagc 69961 tgagatcgcg ccactgcact ccagcctggg caacagagca agacttcgtc tcaaaaaaaa 70021 aaaaaaaaaa aaagaattaa acgtgactaa gagtataact ggaatgctta taacacaaag 70081 aaatgataaa tgcttgaggt gatgaatacc atttatccca atgtgattat tacacactgt 70141 gtgcctgtat caaactacac catataactc ataaatatat acacctacta tgtaccccta 70201 aagactaaag ataagaaatc aaaaaaataa agacaggatg gagctaaacc tgggccctct 70261 ggaatgttct acctgcagat ggagctggcc tccactatct tcagtccctt gctagatgtc 70321 atggtccttg aggtcaacac tccccacacc tgcggaatcc cttgggatta actgggtgaa 70381 cagtaatcga agttagtgga ctatatgaat gacgctgcca agctatactt gaatagctgt 70441 gccaaataga caaacacaga aagaaaacac aaaacaaccc atccacatcc taagggcttc 70501 acactttctt gcactctatc ttcaatgttt tttcagcttt ctctttaggt acttattgac 70561 tgttggtctc acataagttt agccatagat ggaatttatt actgctttgt gaccactggc 70621 agcctccctc atctctggct gggcttctcc ctgaaggact gggcctcagg agtaggtggg 70681 tcatttgtcc cctgggcatg ccacacagag caggttttcc gaactaggct atccctcctc 70741 cccaagttat ccaggcaatt ccgctagttc tatgcctaga tggacacaca tgcttgaagt 70801 ttggggagcc atgtttgatg tgatgtggca actctgaaag gacagtgagg gtgagggcag 70861 gtgttaggtg gccattgtgg gggctgctgg cctctgggac cagagggtaa tgaggtgcac 70921 aaccgtgtca cctagagggt gccagtgtta ataaggaaaa gggttgctgt ccgacgcaca 70981 cagaagccaa tgccatggca ccaggttttt gagaaaagca aacctttatt gcaagtcgac 71041 aaacaagaag acaggagtcc agctcaaatc tgtctcccca atatggggtt tggggcagct 71101 tttaaggatt aggaaggaca ggttggtaag cactggaagg gtggattttg attggaagga 71161 cttcaaacaa atctattgat ggcaaggcgg ggatggggtt gttatcaccc aacattcaga 71221 gacaacagac cccttgcttt cggaagtgtt tcagcacttg ggttccagcc atagcctgat 71281 attttggttt cctgtggtga cagggtggag tctggtaggt gagacttcag ctccgagtgt 71341 tgggtcaatg ttttcttttc tcttttcttt ttatttattt atttatttac ttatttattt 71401 attgagacag ggtctggctc cgttgcccag gctggagtgc agtggcataa tcataactca 71461 ctgtaaccgt gatctcctgg gctcaagcga tactcccacc tcagcatcct gagtagctag 71521 gattacaggt gcaccaccat actcgggtaa ttttttagtt tttgtagaga cggggtctgg 71581 ctatgttgcc caagtggtcc ttccacctca gcctcccaat gtgcttggat tacaggtgtg 71641 agccaccgcg ctgggcctga cattttcttt tctgcacatt ctctagctgt gtaacttgca 71701 actctttagc tctgtgcctg caaaataact ggacattatg ttaccaacag aggatggcca 71761 gatgagactg gtccagtggt taaacctgca ccttcatgga ggccattatg agtatcccta 71821 caggccgagt ctcagggtgc ccccagaagc ccataccagc cgctgtaagg cagagcatgg 71881 acttcccctc ggggactaaa tcccattctt gggctccaga ggccttgttc ctaccactgt 71941 acgccaccgc ttggggtgcc ctgggaaccc gcctacctcc ctactgccct gagggcggtt 72001 ggagatggtc ccagcaggtg tgggagatga ggtaggagac gtcttcaacg gggtccaagg 72061 ctcatctgga cccagttctc agccgagggg cggtggagag ggcgtcagaa aacgcaggaa 72121 cggctcctag acccaggagg ggactcaggg aagcagaggc gctttcctcg gcggctctgg 72181 ggccttccca gctcgcccgg cccccctgga ggagacgtcc cccggggtgg agctacggag 72241 aaccccttcc catgggacgt ctcggccccg cctcccgcct ctgggtacag tcaaagggct 72301 caggtcagag caggagcagc cgggggcgcg gcccccacgt ggcctcccgg gacacgtgcc 72361 cacagcgcga cacctaagtc gctcctttca cagaatagcc ttggccccgg cacggcctcg 72421 aacgccttga ccccgcaagg gaggggcacg aggctccttt gctcccgcag gagcgtcgaa 72481 tacgtatgga atacgcgtcg gtgtggatct ggcctcagct tcctgagcct gaatcaatgc 72541 ttctaatcac gtgggcgctc agccgcgacg ggaccacaga aatcgaccag cggcctgttt 72601 aaggagcgac aaggcccagc cctctcctgc attgtggcga cgccatttcc ctgcgccccc 72661 agcgcggccg cttggctttt gtttgaggtg acgctgggcg gcagcatcct tccctcagcc 72721 ctggggacca gcggggacta cagaacccag aaggttttgt ctccagccgc agggatgctg 72781 agccaatccg cgctgagaaa agggtcgttt ccgctttggg accaatgggt cggggaggga 72841 cttccggtat cacttcagtg gcggtcattt ttgcagcgct tgggtgcatc cagaccgtca 72901 gagctttggg agcgctttgt ttggcgacag tcggaaggcg cgaggggagg ggtcctcccg 72961 ctgaacagtg ggggttctaa gggtcggcgg cggcggggtt gacggctttg cctaggtccc 73021 tccgcccgta gctgtcgggt cccggccccg ctctgcccac agactccgat ggctgcggcc 73081 gcgctgaggg ccccgactca ggtgagcgct gcctctactg ggcctcaccc tccatcccca 73141 aattagtgcc ttcttgggtc actacggtcg agatcctcat gtccagtaca gtgggggctc 73201 gtgggtgggg tccctatttc cagaacttgg gtgatggggc acggaagagg gttacaggca 73261 aagggaccag cgtttctaaa ctcttggaga cacagtgaag aaggttcata cctggagtgc 73321 caaggtgagg agttttccct caattcttag gatgctagga gccgtggagg gttgtgtaca 73381 gaagcggcat atggtctgag ttaggtcttt cgaagttttc caaaaacctc atggagttag 73441 actagaatag aatggaataa cacagaggca cagggaggtt gagttcttgt gcaaggtccc 73501 acagttgcta agagttggca ctgaggactg gaacgctgag gctcagaaat aatgcaggca 73561 tcctcagtcc acctggctct ggcctggaca ggaatccagg gaagcctgaa tccgtagagc 73621 cctgtatggt cacctgcctt tcatgatctt tggctttcca caggttactg tgtctccaga 73681 aacacatatg gacctcacaa aggtgagtgg agggtgtccc aggccctcac tggtcctggc 73741 cccatcctgg ggttttttca tggatctttt tgccaccatc cagttctttg acctaaggac 73801 tcttcacaaa ctggaagctt gacatcctga ttctagactg gtggttattt gtagacgttc 73861 ttattgctgg cagacaatgg aatgaggtgt gaaaggttgc cccgcctacg tacaataatg 73921 tgtaaggttg gcagtgaagc tgagctgatt cagggaaaca caactctcct ttctttctca 73981 tgggaactca gagcagcagg gcaggactca ggcctggaat ggctctgaga aattgggtgt 74041 tttttctgga ggtgatgaga acaatgggag gtttagagaa gagaacggag gttatctgac 74101 actggtctca tcaggttcac tgttaatgca gcatagagaa tactctgtgg gtgtgatgga 74161 ggcgatagaa agaccaggga ggaggctact gcagtggcca aggtgagttt tgatagtggc 74221 ttcaccagtg tggtggctgt gaaggtagaa gaagtggctg gattctgggt cagaatttta 74281 attaggcctg cttgcatttt cagattttga gggtaagaga tactggtcaa gcattactcc 74341 aggcattttg aatttagcat ctggaaggat ggaatctcca tcaaatagca tgttatagga 74401 ttgattttat ttttatttta tttatttatt tgtctatttt tagacagtct cactctgtca 74461 ctcaggctgg agtgcagtgg ctcgatcttg gctctttgca acctccacct cccgggttca 74521 agcaatcctc ctgcctcagc ctcctgagta gttgggacta caggcacatg ctaccgtgct 74581 accatgccca gcgaattttt tttttttttt ttttttgtat ctttagtaga gacagggttt 74641 caccatgttg gccaggctgg tctcaaactc ctgacctcag gtgatccacc tgtcttggcc 74701 tcccaaagtg ctgggattac aggcgtgagc caccatgccc agccaattaa ttttcattaa 74761 ggataacagg agttttgtct ttagctatta tcatagtctg agatattcca aagaatgata 74821 tatggagaca tcaggtgagt agctgtccat gtacatctag aattctggaa gtggtttaag 74881 atggagaaat ccaaaaggtg gtggccagtg tttggcccag agtagctatg ggtcacaatt 74941 aggagtgggt aatcctggtc ttgaagataa tgctatttat aatgcaggaa agggctgggt 75001 gtggtggctc acacctgtaa tcccagcact ttgggcggcc gaggcaggcg gatcacttga 75061 ggtcaggagt tctagaccag cttggccaac atggtgaaac tccgtctcca ctaaaaatac 75121 aaaaattagc caggcgtggt ggcaggcacc tgtaatccca gctactaggg aggctgaggc 75181 aggagaattg cttgagccca ggaggcggag gttgcagtga gccaagatca cgccactgca 75241 ctccagcctg ggtgacagag tgagactcca tctcaaaaac aaaacaaaaa aacagaaaag 75301 ggaaggttgg ggctatgact gtgtctcatg cttggacacg tggtagaggt ggtgagcatc 75361 acaaacagtt gtgagagaaa agtggtcatg ggaagattgt aggaaagagg atgatctggt 75421 gtatcagtcc attttcacac tgctgataaa gacagcaagt ctctaggaag ttccagactt 75481 tcccatattt tcctgtcttc ttctgagccc tccaaactgt tccaacctct gcctgttccc 75541 caattccaaa attgcttcca cattttcagt tatctttaca gcacagcccc actttactca 75601 tactgattta ctgtattagt ccattttcat gctgctgata aagatgtgct ggagactaga 75661 taatttgtaa ggaaaaagag gtttaatgga ctcatgattc cgcgtggctg gggaggcctc 75721 acaatcatgg tggagggtga aaggcatgtc ttgcatggca gcagacaaga gagaacttgt 75781 tcaggggaac tcccccctta taaaaccatc aaatcttgtg agacttattt actatcatga 75841 gaacagcatg ggaaaggcct gcccctatgg ttcgattacc tctcaccagg tccctcccat 75901 gacatgggaa ttgtgggaac tacaattcaa gaagagattt gggtggggac gcagccaaat 75961 catatcatct gataagtggt caaggcttcc aaggcattag ggacataaga cagacatgga 76021 gataaagaaa taattggtag aagatgacac tgaggccagg tacagtggac tgtgtccccg 76081 caaaattcat atgcttaaag cctaattcca gtgtgatggt atttggaggt tgggcctttg 76141 ggagataatt aggtcatgag ggtggagccc taatgaatgg gattagtgag tgctcttaaa 76201 caaagaggcc agagagctac tatctagtgc tctttccaac aagtgaggat acaagaagtc 76261 agcagtctgc aacctggaaa gagtccctac cagaacctga ccatgctggc actctgatct 76321 caggcttcct gcctgcagat ctgtgagaaa taaatttctg ttgttgataa gccacccagt 76381 ttgtggtatt atgctagcag gccaaactga ctaaaacact aggactagtg cttattcacc 76441 atggtaaaaa ctcgccagga atttgttgtg acccttggtg agtctgggga gggtgtgagt 76501 caagcagttg aataagttta ggagtacaag tgttcatctg tatgagtcat taggttggag 76561 gtagagagcc actgagagtc tgtgccactg aggccactga ttttgaatga gggttaagag 76621 gtgagaagaa ggttgttgat tgaagagggg acatcaagtg cagcacaagt ggaggacggc 76681 catcggtgtc tgacatcagg tgattcttgg tggctgaggt tgaagcaagg aacagagtca 76741 ggaagatagc ccttcagagc agggtggggg ctgtctatga cagtaggaac catcagaggt 76801 ctgtgtagca ctggatgatg tctcaatttg gcaggcagag gggtggaggt agccgtggga 76861 gtgagactga gcgagtatgg ttcaaagaat gatgtgtaca tggagaggga gtgtgaactg 76921 aaggacccag gcccctggca ttgaggactt aagggtcaaa ggtgaagcca catttgattg 76981 tgtctaggag gcactatctg aaagaccctt ggggaactgc ctgacaagaa gggatacact 77041 cctacatacc gggcccaaag tttagatgtg gtgcatcctg cccacctcct ggttggtgct 77101 gtagaaggat acagtgatca ttgggaccat gaaaagagag tgggcatctg tgtcaccagg 77161 gcctgtattt ccttaaggtg gggatgattg cgtgggtgga tgaacttgtg gaggcgttga 77221 ttgtggagtg acctgtcccc atcatgacag ggctgtgtga cctttgagga catcgccatt 77281 tacttctcac aggacgagtg gggacttctt gatgaggctc agagactcct gtaccttgaa 77341 gtgatgctgg agaactttgc ccttgtagcc tcactgggta aggccctaat accaactcca 77401 gtgtcctggt ctgtctatgg tcttttgctg acttctccac tttcttggga tatgtgctat 77461 gggatccagc actgtgcact gcctgagtag ccctaacacc tgctgcccag tggtctggga 77521 acacctgctg cccagtggtc tgggaacacc tgccactggg cagcaggtgt tcttcagtca 77581 gccaaatgga tctctgttcc ttgctctctt cttggctggg tgtctgtgtt cagatccatg 77641 acaccttgtg tcccaagttc cctgcacccc catctgctag ctgacatttt cttgtggcta 77701 cctgtgtcat gattgtagtc attgtcattg tctatactta tggggacaat gggctcttct 77761 tactcactcc ttctagtttg tttgtttgtt tgtttgttta atgatattaa ctttcttgcc 77821 catgggctgt tgtgtgctga gtcgttctgc atcagtgctg atcactcaca tgttctgtct 77881 ttctggtagg acttggcttc atgaaggtcc cacatagcta gacaactgag ggtgtgcaga 77941 gaaacctggt tgctttatag ggtgaacatg gtctcagaga aggcctgggc ttgttgagtg 78001 gcaactggtt gggggcatga tatccgggct gttttccgat cacataagga gcctgtcctg 78061 tcttcaccac tgtttgggtg cacatgccac tggccaaatc tcatttctat attttcttct 78121 cagtacttga cacttctctc tcttattatt tttatctatg acttctagca cctggctacc 78181 tccacctttc tgatcttcat gcatcatctg cttctctgca ctgctccctc tgcaatgttt 78241 tccaacattg acacacacat aatgtctcat gaagtatgta tatatgacca catgaagata 78301 tgtgatcata tcacttgcta ctcagtttca attgtatttt cctgggcttg ggcacagtca 78361 accttctgca cagcatgcag gtgtcaccag cagacaaggt cctcctgaag actatgtggt 78421 gagttttagg aattggaagg ctactcaagg ataatttttg aggctgttgg tagaggaatg 78481 gtttgaaggc aatgccattc cctgtccttc atcccatcct tgtgtccttt atttggtgag 78541 gcctccctca ttctgtgatc tttagagatc cagtattgca aagcaacccc ttcctcaact 78601 tgcccccagc tccattatcc tcaaaacaaa cctggtgaat cattcgtttg gtatgtgtgt 78661 aatggggctt cttgctctca gcatagtcaa tgtatatttc atcagcggtt ctctgctttc 78721 aggttgtggc catggaacag aggatgaaga gacaccttct gaccagaatg tttctgtagg 78781 agtgtcacag tcaaaggcag gttcatccac acagaagact caatcctgtg agatgtgtgt 78841 cccagtcctg aaagatattt tgcatctagc tgatctccct gggcagaaac catacttggt 78901 tggagaatgt acaaaccatc accagcacca gaagcatcac agtgcaaaga aatccttgaa 78961 gagggacatg gacagagcct catatgtgaa gtgctgccta ttctgtatgt cattgaagcc 79021 ctttcgcaaa tgggaggttg gaaaggacct tccagccatg ttgcggcttc tgaggtccct 79081 ggtctttcct ggaggcaaga aacccggcac aattactgaa tgtggggagg acattcgcag 79141 tcaaaaaagt cattacaagt caggtgaatg tgggaaggct tccaggcaca aacacactcc 79201 tgtttaccat ccaagagtct acactggaaa aaagctttat gagtgtagca aatgtgggaa 79261 agccttccgt ggcaagtact cacttgttca gcaccagaga gtccatactg gagaaaggcc 79321 ttgggagtgc aatgaatgtg gaaaattctt tagccaaacc tcccacctga atgatcatcg 79381 gagaatccac accggagaaa ggccttatga gtgcagcgaa tgtggaaaat tatttagaca 79441 aaactccagc cttgttgacc accagaaaat acacactgga gcaaggcctt atgagtgtag 79501 ccagtgtggg aaatccttta gccaaaaagc cacccttgtt aaacaccaaa gagttcacac 79561 tggagaaagg ccttataagt gtggtgaatg tgggaattcc tttagtcaaa gtgccattct 79621 taatcaacac cgaagaattc acactggagc aaagccttat gagtgtggcc agtgtgggaa 79681 atcctttagt caaaaagcta ccctcattaa acaccagaga gttcacactg gagaaaggcc 79741 ttataagtgt ggtgactgtg ggaaatcctt tagtcaaagc tccatcctta ttcaacaccg 79801 gagaattcat actggagcaa ggccttatga gtgtggccag tgtggaaagt cctttagcca 79861 aaagtctggt ctcattcaac accaagtggt tcacactgga gaaaggcctt atgagtgcaa 79921 caaatgtggg aattccttta gccaatgctc cagcctcata catcaccaaa aatgtcataa 79981 cacatagagg cctcatgaat gcagcaaatg tggaagcgcc ttcaactcaa gatctatcat 80041 catttagctc ctgaaagtcc acacttaagt agagccttag acctacaggg aaagtgctgt 80101 ctctgtagta ttgtagcagt agagagcctt tgtgagggag ccatctgcct gaagttgaac 80161 ctcattcttc cttgtttctc tggtagaaac catctaccct ctaccacctt gcacagtggg 80221 cactggtcac tcctatgtgc taagacaagg cagacatctg tgtgttctct taagtctttg 80281 gaggaaatct tgagcagtct aagcctttag agaaaattca ttcttttttc tgactgatca 80341 cagcatacgt gtgacccagt ttgggtcagg agggcccagc cttggttctg ctggacactt 80401 atgtgcaagg attcccttca tgtaaattct tggtctcaca tgacacttgg tcattcttcc 80461 agcctccatg tcaccacgtg gtgaatggct gcctcacatt gctccagttt gtgcactaat 80521 aaaagcctta tatttgaatc tacctgtagt cttggggttc tgtttactgt gtggggtggc 80581 tgggagacag acttcaactc tatatgaagg aatggatggc ttttgtgggc ctctgcagga 80641 aagtaagatg acagagtaat tctaattctg gttttggtca tacttgcttt gctacctaaa 80701 atctcctagg aaaaaatgca aggttttggt tattctaatt tgtggcctgg atccctattc 80761 tttctgtgag actagaggtc atcctgagga gaggccagct gttatgacaa gcatgtgtgc 80821 ttcagggaat aggacaattt tattccattg tttccagagg atgtcatatg atgcccagtg 80881 ctgctgagaa gcttttcatg gggttctata aggaggcatg ccctgatatc aaacattcca 80941 taggccgatg tcacgcagaa gacaacgcga gtcacatgtg aactgtaatt ggtacagaaa 81001 tacctgggta tttctgtact gtgtgtactg tagcaaacta gttggaatgt gcctcttata 81061 aaagtacatt tacaaatctt cccgtgactg tggctttgag cagtcatagg acctagaaat 81121 ctgtgtatgt ccaatagctg aggttatttt cagcaaaaat aattaaaggg ttttattttt 81181 taattcttgt tggttttcta ggttgttcac ctcaagtgca ttgctgtaga ggcagaaaaa 81241 ggaggataaa gataacagaa gtcctatagg ccagggatgt attgatagct cttgtgattt 81301 ccaccagtgt tgctgttgtc tcaaattgcc acagccttca ttgcttgcca acatttcctg 81361 catggaggga ctcatggttg cccttcccca ggcctgaaga gagagtgcag tcaacatgag 81421 attgctaggc attctggttt ctgaaagttg ggtgatcaga tactttattg tgaaacatgt 81481 tttacaaact ttcttgatgt gtaagtgaca tgccatagtt tacatccatt tatggtgtat 81541 aatttgaaga gtttgtcata caagcctgtg aaaccataat catgatcatg aacatattca 81601 tgattcacct cttgctgttt tacaatctct gcgtgtactt tccaggcctt caggagtcct 81661 gtcatttact ttccctacag gagaatagtt tgtgttttct aggattttat gtgaattgaa 81721 acgtaaaata cttactccat ttttcctgat gatgcacaat tttttttttt tttttgagat 81781 ggagtctcag tctgtcacca ggctggagtg cagtggcatg atcttggctc actccaacct 81841 cctcctgggt tccagcgatt ctcctgcctc agcctcctga gaagctagga ctacaggcat 81901 gcactaccac gcccaactaa tttttgtatt tttagtagag atgtggtttt accatgtttg 81961 ccaggatggt cttgatctct tgacctcgtg atccgcccgc ctcagcctcc cgaagtgctg 82021 ggattacagg cgtaagccac cacaccagct gatgcacaat tacttttaag atttatcaat 82081 attgcttatg tgaatagttc atttgtattg ctgagaaata gtctgtggat atgtcacaat 82141 ttggataaac agttggatgc tttccagttt tgggctgttg caaataatgc tactatgaac 82201 ataggcatat aactcttaat atgcagaaat gctttatttc tcctgtgtca gtagtacagt 82261 gtacgtgaat gcgtcacttt taaagacaag gccagactat aacctaactg taccatttgc 82321 attctcatca acagtatgtg aaccttctta tttctctaca ttcttgccat acttggtatg 82381 gttagtcttt ttaattttag ttgttttaat agctgtataa tagcatctgt ctgtgatttt 82441 attgcatttt tctggtattt tatggtattg aacatttgaa catcttgtat ctagttgcca 82501 cctgtatatc ctctttggaa gaatgtttat taatgtcttt tgcttatttt taaatggatg 82561 ttggtgatac tttttcatta ataaatagac gtataacaaa gacttatgtt gaaactttac 82621 tcattcttca gtaatgtgat gcacttattt gctaaatgat aaaattgttt tcaaatactg 82681 ggaaaataat ttctcagttt tttggtctat tcacagtgta acaactacag acactatata 82741 cctgaaacac taccttcatt gacaaattct ccatcacttt cttaggtcta accattcaat 82801 gaaacaaata agccctggtt tttaatattt gccaatttcc ctggtaaaat actcccacca 82861 tggccaatta caaggtgtca aaataatgtc actgaacatg aattgagaag aagtgggtag 82921 cgtcacacca ctgtattcta ctttcaccat gccaatacag tagccgcaga taacctgaag 82981 atcatgtatg taggaaccat acagcctgtg gtgtggcaag actgatgcca tcttgaagtg 83041 aagctgtcat ggtcacctat gttctgaagc agtagcatag atatcatcaa aatgcttctc 83101 tcccccaatg gtcatgagtc ttggcaagaa agtctgaaga cgtgaagagc tgcacgtttt 83161 accctaaaaa ctccaggccg ggcgtgttgg ctcttgcctg taatcccaac actttgggag 83221 gccaaggtgg gcagatcacg aggtcaggag atggagacca tcctggctaa catggtgaaa 83281 ccccatctct actaaaaata gaaaaattag ccggttgtgg tggcttacgg gtgcctgtaa 83341 tcccagctac ttgggaggct gaggcaggag aatcacttga atccaggagg tggaggttgc 83401 agtgagccga gatcacgcca ttgcactcca gcctggcaac agagcaagac tccgtctaga 83461 aaaacaaaca aacaaaaaac tccagccttg ttgaccacta caaaaagggt cctataaagg 83521 atgtaaagga aactgccttc tggagggcag atacagggat ccattgtctc acagctgcac 83581 aagacatcac ttctgttcat gagtccctat gaaatgtttc tgaaaacaat ttggcctctt 83641 cctccttctt tggcctttca gcttcgttgg cctttgggat agattttcat acacctgctc 83701 actatagaac aatgtatata taagagtaaa acagtaaaat gattaagaat tgacttttga 83761 gtatttatta cttgtgtttt taaaagtggc ttaagtgaaa tctagttgtc ttaaattctg 83821 catatttaaa gtatacaatt tctattttaa tatatactcg aaaccataag tttccacctg 83881 ccctttttag tccatccctc ttaacaagta atgggtacaa aaacttattt ttcctcttct 83941 aatggatatt ttcaagtata tacacaaatg gaaaggactg tgttcagatg cactatgaaa 84001 cctttctaaa tcttccatta ttacttatca tttagtagcc attcttgttt caactctttt 84061 cacacataat tattttactt acaattccca catcttgtca ttcaatgctg cacctcttca 84121 gaatgcattt tcaagcataa gagctctttc ataattccta taactttatc aatcattatt 84181 taaggcatca taaaatatat catcagtgct cagtcacctc gaattgtctc caaaattgat 84241 ttttaagaca aattcattgc aatcagatcc aaaatgaatc tatatggcta ttagtagttt 84301 catctggtga gtctataata atacacagct accacttttt attattattc ttgaagttga 84361 tgagacattt gtccagaatt tgccacattc tggaatggtg cgtgtttttg ttttgtgtct 84421 ctctgatgtc atttaacatg ctgacccttt ctccatattc ctgtgcacta gatattaagg 84481 aaatgggtaa ctgaactctg gtcctgcaga taggtgtgca atagtaacac acttatgcag 84541 tgtgactgtg ttgtgtgtat gtgtatgtct attcaaaagg tgacattcac agacatctgg 84601 aggcttagag attattgaaa aagtaaaaaa ggaagaagca ctttgttttt gcttcctcta 84661 agatacctga gaaggtttct caactgattt gaaatgtgaa acaatgccaa aacaattacc 84721 atagatccag taatactgct tcttggtttt actgaaatga gttaaacatt tatgtccaca 84781 caaaatcttg tacatgaata tttatagcac ctttgttcat aattggcaaa atttggtagc 84841 aaccaaaatg tccttcagta agtgaacagg taagtgtggt acatttagaa aatggtatat 84901 ttatccacat taaaataaat aataaaagat atggaagaac ttaaatacat atttctaagt 84961 gaaaggatct gatatggttt ggctgtgtcc ccactcaaat ctcatcttga attataagta 85021 ccacatgtca tgggaggaac ccagtgggag gtgattgaat tataggagtg ggtttttccc 85081 acgctgttct catgacagta aattagtctc acgagatctg atggctttaa aatcgggagt 85141 ttagctgcac aagcttattc tctttgcctg ctgccatcca cctaggacgt gacttgctcc 85201 tccttgcctt ctgcaataat tgtgaggctt ccccagccac gtggaattgt gagttctcca 85261 ttaaaccgct ttcctttgta aattgcccag tctcgggtat gtctttatca gcagcgcgaa 85321 aacggactat tacaggatcc aaactaaaag gactacatac tgtgatttga actatatgac 85381 attccagaaa aagcaaaact atggagacag taaaaaatga gtggttgcca ggggtttggg 85441 ggaagggagg aagggatgaa caggtggagc acggaggact tttagggcag tgaaactact 85501 ctgtgatact accgtggtgt gtatacaagt cactatgcct ttgtcaaaac ccatataaca 85561 accagctaga gtaaatccta atgtaaacta tggatttggg atgataatga tatgccagcg 85621 taccatcaat tgtaacaaat atactattct agtggggaat atttatagtg gagtggctgt 85681 acatgcatgg gacaagggat atgtggaagc tctataattt tttgctcaat tttcctatga 85741 accaaaactg ctctaaaaaa taatctatta aaactttacc agcctgagca acatggcgag 85801 accccgtgcc tacaaaaaat caaaaaatta tcagggcatg gtggtgcagg tgcatgtctg 85861 ttaccagcta ttctggaggc tgaggtggga gtattgcttg agcccaggaa ttcaaagcta 85921 cagtgagctc tgattgtgcc actgcactcc agcctggatg acagagtgag accctgtctc 85981 taaaaaaatt aaaattaact aatttgcaaa ttagtttaaa atatgtattt tgcaatgttt 86041 gtgctgtgta tttgtataac tcaagggaga ttaattttaa caaattgaat ttggcaagcc 86101 agttatttcc atgagcagcc tagttaattt atctacttaa acaacaaaaa caagtagata 86161 cacagatttt ctggcacttt ccaagctcac tatttcccta ctttgactat tggcctaaga 86221 aattgccata agagctgtgt gtacctgtag ccccagctac ttgggaggct gaggcggaag 86281 gatcacttga gaccaggagt tttaaaccag cctgggcaac atagcaagaa ccctgtctgt 86341 aaaaaaaaaa tatatatata ttttaaaaat tagctgggca cagtagtgag tgcctgcaat 86401 cccatatact caggaagctg acacagaaat gtcgctgaag ccgaagagtt tgaggctgca 86461 gtaatctatg atcctgccac tgcactccag cctgggcaac agagcaagac tcttcctcaa 86521 aaaaaaaaaa ttaccattag acactcaaat atgcaggcaa aatgagtcct ttgtggagtc 86581 actcagaagg gctccccact tacctttcta tatgaaatca tgctggagcc acaccttctc 86641 cctttccttt tcagacagga cttaaattca gcaactgaag agctatctgc tctatgtgtc 86701 ttcagggtaa acatttccag agccaagcca aatgtactta tagacttata tatgactgca 86761 ctaaaaaaag caaagtaagt ctatgttcag ttcaattaat tatgttcaat attctttttt 86821 tttttctctc tactgagatg gagtctcact ctgtcaccag gcaggagtgc agtggtgtga 86881 tctcggctca ctgcaacctc cacctcctgg gttcaagcgc ttctcctgcc tcagcctccc 86941 aagtagctgg gactacaggc gcacgccacc acacccagct aatttttgta tttttagtag 87001 agacagggct tcaccatgtt ggccaggctg gtcttgatct cttgacctca tgagccacct 87061 gccttggcct cctaaagtgc tgggattaca ggcatgagcc accgcgcttg gcctagccac 87121 aagattatta atagttgtga cgtataatgt tttgtgtcga cttgcctgca ttaagggcta 87181 cccaaatagt tgggaaaacc ttagtacagg atatgtctgt gagaatgttt atggaagata 87241 ttagcagttg agtctgtaga caatgtaaag gagatctggc ctcatcagtg ttaggggcca 87301 acatccaatg ctttgagagc caagatagaa caaaaaggca gagaaagagc aagttactct 87361 cttcttgccc tggatcatcc accttctgtt ttccaacatt aaagttcttg gatcttgggc 87421 ctttggactc tggaacatac aagaatgacc ctcacccagt tctcaggcct tcagcctcca 87481 actgggaatt tcaccatagg ctcccttggt tttcaggcct tcagatgcag actgaatgat 87541 accacccacc ttcgtgtctt accaggttgc aaaaggtata tcctgggact tcttagcctc 87601 cataactgga tgagccaagt cccataataa atcccctctt gtacatctat gtatcctatt 87661 ggttctgtgt ttctctggag aatcctgact aatacaatag gtttcagtaa ttttcatttc 87721 acaactgttt ttctaagatg atctcaactc cccatgcaag cactgtccct gcagtaacag 87781 tatcaatgat caaaatttcc catcaaagag atgaaagcta ggtgtcccaa acctttattc 87841 tggtattctg atgtcccctt tgcacaagtg ctctgactgt gaacactaca gacactgttc 87901 tactttaaaa tggcatatgc taagtggctc ttctgtttag aaagttgcta caatcctaaa 87961 agaagcatat ctaagaatat ggaggaagaa atattatatg aatacaacac tgttccttac 88021 agggctagac acactcaccg attacacagg caggagaggc attttctgtg agacaaaatt 88081 cctctttcgt cctccagctt cggtgcgatc taacggatgt tttatcattc ctgtcatggc 88141 gctatgtaaa ctctatttcc cttattcctt actgagcata atgaaactca caaagctgac 88201 tctaacgttc tactttacat aaatgtgtgg agaaagaagg taatgtttac taggctctcg 88261 ccaagggtag aggcattttg tatctgaata actctgacta cctaggtgac tagttgtaca 88321 caataccctc caaaaatctc tgtatggtac cttaaaaaag cagcgaagct cctcccttgt 88381 ccaaggaggc tggagacacc acctctcagc cttcccctcc acctttctcc atagcaacgg 88441 ggtcattcct ccctgaggtc caggagaagg gcgacctctg ccagcccaag aagcggtgag 88501 aactacatta cccagaggcc cgtgagccag gttcttggcg tggagccaat gaggctcagg 88561 ggagggacat ttccgctctg tccaacttgt cggagcggaa cttccggcgt cctccctgtg 88621 gcgggcactt tggcttgtgt cagttccatc cgcgggtgcc ggatctggac ctaggtgctg 88681 acagcgagaa ggcgcgagga gagtcgtttt ctcagctgca cagccggggc ctgacggtcg 88741 ccggcggtgg tgacagcttt gctcttgtct ccgcccggat cgtccaccgc tcccggcccg 88801 ctccgcccag agtccgatgg cggcggcact gagggccccg acccaggtga gcgctgcgtc 88861 ctcccggcct cctctgccct ccccacccga aactgaggga cgcgtgaagg gttttctgca 88921 gcccggaccg cactgtccgg cacagtgagg cgctggtgct ggatctcgtt tctggtagtg 88981 tggcagggag cgggagagcg acacggtcag ggggctgcgc gggcaacagc ttggagctgc 89041 gcctgagccg ggaggctggg gagacccagg ccggcctcac gcgtgcaggt agcagagggt 89101 ggggcagcgc cgcgctgcat tggccccact gagtttcacc agcacactga gggccacagc 89161 gcgtaggaca gaagaggggc acggccccag tcacgcttct aaaagtttca caaaagctgt 89221 tcagaggaac gcgctggaag ggcagtgatg atagcaggca tggagggacc tagtcttttt 89281 acgaggtcac aaagctggta aaagtcagca ccaaggactg acacccagat acaaagcagt 89341 catccctgga cacctggctc tggtgttcca cattggaacc agaagtccat ggagccacac 89401 tttggtcacc tgcctctcag aacatgtttc ctcccacaga ttgccactag tacttacagg 89461 tccgtaagaa gcacttgaag accctacaag ggtaagtgga ggaaatccag aagctcattg 89521 attccagcaa tgggatctct aggattggga ctgtggtgac ctgggattct ttacttaccg 89581 cttttccagg ccttgagcct agggactctg aacaaaaata tgtccagggt gagtattata 89641 tgaggtggtg cctgttgctg gcaaataata ggatgaatgt ggaagtttgt cccagagata 89701 agtacaggtg tttggagctg cttggaggtg aggctgagct ggttaagaga acctcaagtc 89761 tcatttcctt ctcatgggaa cagagagcag aggtacagga gttagaaact aaatggctgt 89821 ctcagatgtc tttgttctgg agatgatggc agcaatgaag atttggagca gaagagggag 89881 gtgatcttcc ccaggcctca tgaggctcct gtggctgagc acagacaaca gactgtgggt 89941 ttaagggagt tggtagcaat accaaagagg aaagaggaga ctactgcaat agtccaggtg 90001 agtgtagatg atgtgggcca gagtagtggc cacagtgtgg cagaaatggt tgaattttgg 90061 attgtaggtc ggctctgatt ttctggtatt gggactgaga gataagaagg ggtgactcaa 90121 ggctttttga gtttagtagc tggaagcata gaagctccat ctactgggag agccaagtag 90181 atagtagaat tgtttttcac tcaggacatc aggaatgtgt tttgtccatg ttaaaaatat 90241 gagatgtttt agagaataag taagtagcta tatcaagtgg ctaggtggac acacaggtct 90301 tggactctga ggaagggtgc aggatagaga cttctacaaa gagtggagga tattattgaa 90361 ctaggatgga gccctatgtc tgttaggcca tttgcattac tataaagaaa tacctaaggc 90421 tgggtagttt ataaagaaac aaggtttatt acagcttata gttctgcagg ctgtacagga 90481 agcttagcgc tagtctccac ttctggtgag ggcctcagga agcttccaat cattgcagaa 90541 ggcaaaaagg gagccagcaa ggcaggaggg aggcaccagt ctctttttag caaccaggtc 90601 tcatgtgaac ttattactgt tggaaggata ctatgccatt catgagggat ccacctccat 90661 gacccagata acctcccact aggccccact tccaacattg gggatcacat ttccaggtga 90721 gatttgaagg ggacaaatat ccaggctata tcagtgtctt accttgggtt gataatgaaa 90781 taacctatat tggtggctta aacaacaaaa atttgtttct cacagttctg caggctagga 90841 aattctacat cagggtgcct ctagattcac ttatttgtga tgggctgctt cttgacttgc 90901 agtggctgcc ctcttgcttt ttcctcacat ggcctttcct gggagggagt atgtaaagaa 90961 agcaaggaat ctctttcttt ctattcttac aaaagcatta attccatcat gagggcccca 91021 ctcttttgac cttatctaaa tctaattacc acccaaagat tccatctcca aatactatca 91081 cattatatta ggcttcattg tgttagtgct tcaatgtaag cattttgtgg gtgggcgtgg 91141 ggcagaaaca ttaagcccat aacatcatag atcccattta gtagtgggat tcctggtcat 91201 gatgataata ttaatagcag gccaggaagg ggaagggtgg gagccacaag agggtaccat 91261 acttgggtat ctggatggat gttgtggaaa tcacaaatac aggttataga gggaagcagc 91321 cctaggaata atacagagag aggataccag gcaggtggcc aagggttatg aaggatgagt 91381 gaatataaga tggatgtgga ggctgcatta gtccattttc gtgctgctga taaagacata 91441 cctgagactg ggaagaaaaa gagggttaat ggacttgcag ttccacatgg ctggggaggc 91501 ctcacaatca tggtggaagg caaggaagag caagtcacat cttacatgga tggcagcagg 91561 caaaaagaga gtttgtgcag gggaactcct ctttttaaaa ccatcagttc ttcttttttt 91621 gagatggagt cttgctctgt tccccaggct ggagggcagt ggtgagatct cagctcccta 91681 cagcctccac ctcccagact caagggattc tcctgtctca gcctcccaag tagctgggat 91741 tacaggcatc caccaccaca cccagctaat tttcgtattt ttagtagaga cggggtttca 91801 ccacattggc caggctggtc tcaagctcct gatcaagtga tccacccgcc ttggcctccc 91861 aaagtgctgg gattagaggc gtgagccact gcaaccagcc ttaaaaccat cagttctcgt 91921 gagacttatt cactattgcg agaacagcac aagaaagacc cacccccatg attcaggtac 91981 ctcctactgg atccctccca tgacacatgg gaattgtggg agttacaatt caagatgaga 92041 tttgggtgga gacacagcca aaccatatta gagacataat gggatttcag acaagatgag 92101 gctgaagcca gacctaatgc tttactgcta tatagcctca ccaggatctt atgactggct 92161 cctatgccta gggagtttca tgcaggctgg tggatcagtt taggggcaga agtagtcatc 92221 tgggagcatc acttaacctt gggtggaggg ccaccaggca aggttttgaa tgaagggtag 92281 caagacagtt actgttactg ttactgttta gtgcagttac tgtttagtgc agtacaagtt 92341 ggaaagggcc atgtgtatct gagatatgta tgtggttctg ggtggctgag gttgatgaaa 92401 ggagtagact caggtaggaa gggcttcaga gcaggatatg gggctgcctg tggcagtagg 92461 gaccatcata ggtctgggta gtcctggaac ttgtttgaat ttgcaagcac tttctagatg 92521 tcttgtgctg agtaacatat aggaaggcag gagagatgcc tgtgttgtgg gatcggggtg 92581 gaggtggctg agggagtgag ttgggtaaga gaggtgtggt ggaaagagtg atgcacatat 92641 gaagacggag tgtgagcttt tgtttcatga gtctggacat tgaggacatg agggtgaaat 92701 gtgaacccag accttcagtt gtgtttggga gccatggtct gagggattcc aggagaattt 92761 atagacacaa gaatgtggtt tctatctgtc cacctgcctg ttgttgctgt gggagaacac 92821 agtgaccaat gagaccatga gagagtaggc atcagtggca tcagggcctg ttgtattcgc 92881 tgaggtgggg agatggggag gagggtgagc cagggatgga tgtttggggg aatcaacttg 92941 gggtcctagg aaagatgctg accatggaat gaactgtccc catcatagca ggtttttgta 93001 gcctttgagg atgtggccat ttacttctcc caggaggagt gggagctcct tgatgagatg 93061 cagaggctcc tgtaccgcga tgtgatgctg gagaactttg cagttatggc atccctaggt 93121 aaggccctcc tattcctccc agtttccttt gtgggtctct ccttttcccc tgtccctgag 93181 gcagctctgt cctttcttca gttagaccct gagctctaca gaccttccca ctttcttggc 93241 atatgtattc tgagggacat cgccaagcaa ggcactctca gactgtccta acacctgttt 93301 cctctaggta tgcagtgaag ggtctgggag acaggatctt tctttcaacc taaaggattt 93361 gtccttgcct tctccatgac aggatggctt tctccagttc taggatgctc tttgttccat 93421 gttctgcctt ttgcttctaa ctaaaatctc aagctgcctg tttcaggaat tgcaggtatc 93481 atcatgttaa ctacttaact ggtccatagg cttttcataa agatccacct gaaagatttt 93541 cttgtaacat tatttttcac tgctgtagtt tgtcatgtgc tgagttgctc tgtgccagtg 93601 ctggctgtgc acctctctgg tcttcccttt agggcataca tcttataggt ctggtatagt 93661 tgcacaactg ggattcgagg atggctctgg ttgctttaca gggtgcaaac agcaagatag 93721 tctcagagga ggcctggccc taatgagtga caactaaggg agggcaagac actagggctg 93781 tttctaatca caccataaac ctgtcctgag ttctcagtga ttgggtgctg attccatagg 93841 acatatatca tttctctctt ttttttcttt tctacacatg ttctgatttt tatttcatcc 93901 atgacttcca gcccttagct gctaccaatt tctggcctcc atgtgtcacc tgttcctttc 93961 cagtgcaact tctagagtgt tctctaacac tgacccacat gtaatatgtt gtcagatgtg 94021 cacatgtttt taaatagtcc ttccacacct agtattcagt tcccatgggt gtctactggg 94081 cctggacacc aataatcttc tgtgcagtgt accgaagtcc tctcttctca tcaggggagg 94141 aagtcctgat aaaagctttt ggcagaatag tgagtcagag accctacctg ttctctctga 94201 gttgtgcccc atctttgtgt ctttcatctc ataaggcctc ccttgttctg tgacttgtag 94261 aggctttttt ccacaaattt ccaccagctc tcatatccta aaaacatttt cttagactca 94321 ttctttgaca cacatttttg atgtaacttc tcacaccata gtaaatctgt gttttctcag 94381 catttctctt ctttcaggtt gctggtgtgg agcagtagat gaggggacgc cttctgcaga 94441 gagcgtttct gtggaagaac tgtcacaggg caggactcca aaggcagata catccactga 94501 taagagtcac ccctgtgaga tttgtacccc agtcctgaga gacattttac aaatgattga 94561 gctccatgcc tcaccctgtg gacagaaatt gtacttgggt ggagcatcaa gagatttctg 94621 gatgagttca aaccttcacc agctccagaa gcttgataat ggagagaagc tctttaaagt 94681 ggatggggac caggcctcat ttatgatgaa ctgcaggttc catgtgtcag gaaaaccctt 94741 cacgtttggg gaagtcggga gggacttttc agccacctca ggacttctcc agcatcaggt 94801 gactcccacc attgagagac cacacagcag gattagacac ttgagagttc ccactggacg 94861 aaagcctctc aaatacactg aatccaggaa atcttttaga gagaaatctg tattcattca 94921 acaccaaaga gctgactctg gagaaaggcc ttacaagtgc agtgaatgtg ggaaatcctt 94981 tagtcaaagt tctggctttc ttcgacacag gaaagcacac ggtagaacaa ggactcatga 95041 atgtagtgaa tgtgggaaat catttagtcg caaaactcac ctaactcaac accaaagagt 95101 tcacactgga gaaaggcctt atgactgcag tgaatgtggc aaatcctttc gccaggtatc 95161 tgtcctcatt caacatcaac gagttcacac tggagaaagg ccttatgagt gcagtgaatg 95221 tgggaaatct tttagccaca gcactaacct ctatcgtcac aggagtgccc acactagcac 95281 aaggccttat gagtgcagtg aatgtggaaa atcctttagc catagcacta acctctttcg 95341 acactggaga gttcacactg gagtaaggcc ttatgagtgt agtgaatgtg ggaaagcatt 95401 tagttgcaat atctacctta ttcaccacca aagatttcac actggagaaa gaccttatgt 95461 gtgcagtgaa tgtgggaaat catttggcca gaaatctgtc ctcattcaac accaaagagt 95521 tcacactgga gaaaggcctt atgagtgcag tgaatgtggg aaagttttta gccaaagctc 95581 tggcctcttt cgacacagaa gagctcacac taaaacaaag ccttatgagt gcagtgaatg 95641 tgaaaaatca tttagttgca aaactgacct cattcgacac cagacagttc acactggaga 95701 aaggccttat gagtgcagtg tatgtgggaa atcttttatc cgaaaaaccc acctcattcg 95761 acaccagact gttcacacta atgaaaggcc ttatgagtgc gatgaatgtg ggaaatccta 95821 tagccaaagc tctgccctcc ttcagcatag gagagttcac actggagaaa ggccttatga 95881 gtgcagagaa tgtgggaaat cttttacccg caaaaatcac ctcattcaac acaagacagt 95941 tcacactgga gaaaggcctt atgaatgcag tgaatgtgga aaatccttta gccaaagctc 96001 tggcctctta agacacagaa gagttcatgt gcagtgaatg tgggaaatta ttttgtcact 96061 tttttttttt tttttgagat ggaattttgc tcgtcaccca ggctggagtg caattgtgca 96121 atctcagctc actgcaacct ccgcctcctg ggatcaagtg attctcctgc cacagcctcc 96181 tgagtacctg agattacagg cacccaccac catgcccagc taatttttgt atttttaata 96241 gaaacaaggt ttcaccatgt tggtcaggct ggtgttgaac tcctgacctc aggtaatcca 96301 cctgcttcag cctaccaaag tgctgggatt acaggcgtga gccaccatgt ccggcttgtg 96361 ttgtgacatt taacacaaga tttcccaaag aagaaaggcc ttatatatgc tgtgaatgtg 96421 tttttgcttg ttagtttgtt ttaacttggt atgacggcac tggaggctag cctttgagaa 96481 gagccttctc tgatgtgact catgtatcca aacatccatg gatttcccat aaatttgagg 96541 tatgtgggaa gcatgtgtag ctatgttgta ctctctaaac ctgcccaggg accttaccag 96601 attcgtgtca ctgccagttt ctgcggctga agccttttca tcactacccg ctgggagacc 96661 cacagagtgt gcatcagtca ccaccccaac atgctcaggg aggcagagtt cttccattag 96721 ttggaggaaa gcacgagtag tctaagctct taggggggtt tctcatttcc tcctgtggct 96781 acatagtgca tataactacc ccagtgttgg ccccaggacc tgatctgagt tctgctggca 96841 gcttccagag gattgccttt tttgaggaca ttgttgatct caggcgatgc ttgtgatgaa 96901 gcatttttta gcttccacca cttaccagga actgcctgtt aactggcttg tgcaatgata 96961 aaggcctttt gtttaagtac ctttaatctt gtaagttgta ttaattgttg agtagtttgg 97021 aaacataata ggactctgtc agcttaaaag ggtatataac atttgtgggg attcaaattt 97081 tgtttgcctc agaggggaca gtatgctcac agaatataga ttttgctcca tttttggcct 97141 atttcgcatt gtttccaagg ccttctagga gacactgcag ggctttctgg ctctgatttt 97201 tgcctgcatg tgggctcaga ggtcattctg cagagggggc agttgtggca acttccattg 97261 tttctggaga caacacaaag tttttcttgt tgagtgagga tatcacataa tgccctgcaa 97321 tgtttagaat cccttaatga ggtcctgtga aggagctgtg tcccttgaaa cacagaattc 97381 agtgagctag tgtccctagc tataaaacca tggggctgga gcgaaacata attggtaaag 97441 aaacgctggg ttgaataata gggaatggag gagactgcag caaattagat ggactgttcc 97501 tcatataaag tttcatccgg ccgggcgcag tggctcacac ctttaatcct agcactttgg 97561 gaggctgagg tgggtggatc acaaggtcgg gagttcaata acagcctggc caaggtggtg 97621 aaaccctgga aggctgaggc agataattgc ttgaacttgg gaggcagagg ttgcagtgag 97681 ctcagatcac accactgcac tccagcctgg gtgacagagc gagactgcgt ctcaaaaaaa 97741 aaaaaaaaaa gatccattca ggccagtcac agtggctcat gcgtgtaatc cctacacttt 97801 ccgaggccat ggcgtgcaga tcacttgagg tcaggagtca gagaccagac tggccaacat 97861 ggtgaaaccc catctctact gaaaatacaa aaattaacca ggtgcagtgg tgcacgtcta 97921 taatcccagc tgctctgaag gctgacgcac caagaatagc ttgaacccgg aagtggaggt 97981 tgcagtgagc tgagaccaca ccactgcact ccagcccggg tgatggagtg agactctgtt 98041 tccaaaaaaa aaaaggctgg gcgtggtggc tcacgcctgt aatcccagca ctttaggagg 98101 ccgaggtggg tggataaccc aaggtcagga gttcgagacc agcctggtca acatggcgaa 98161 accctgtctc tactaaaagt acaaaaattg taaaagtaaa aagtaaaagt aaaagctggg 98221 tgtggtggca ggcaccagta atcctagcta ctcaggagac aggcagaaga atctcttgaa 98281 cctgggaggt gagccgagat cgcaccattg cacttcggcc tgggcaacaa cagcgaaact 98341 ctgtctcaaa aaaaaagtat ccacaaatgt tccagtgact atgactgcaa gatcactgtg 98401 tgcagaaatc tctagaaatg tgtgtcttgt agctcaagtt attactagag acaacaaggg 98461 ttttgtggat tcctgtagcc tctctggcct gtttacttca aggccattgc tgcacagaca 98521 aaaaaaaaaa aggatgaagt caagagagat tcggttgtcc ttggtgcggc ttttgtgatc 98581 cagtagcaat cctcttaacg ctgatggaaa tggccttcat tgctcacctg tatttcctgc 98641 cactgaagca ggtctgcctc tacgtggcct ggggagctag atggtagagt caatagagac 98701 ttagtaaacc agactttttg aagtgtcaat gaccaagatt gttattgcat ccattaaaaa 98761 aaaatcaagt tataattgat gtggaataaa ttatatatat ttaaagtgta caacttgata 98821 agttctgaca taaatgtaaa cccatggaag cattgttgta atcataagaa tgagcatatc 98881 tcacctgcat attttacttt tgctctttta taatacattc cagaatatgc aggcaacttt 98941 taacattcct ttacaattga ttgacttgaa ttttctggaa ttttatttta gacatcataa 99001 agtgtgtatt tttttgtgct tttatttata aaacttcccc ttaggtaagg gcctgaggtt 99061 aactttggag gtgacagata tgtccaacac cattgatagg gtgatgattt caaaagtgta 99121 catatgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tatacttact gttttaatat 99181 gtgcagtaca ttgcatgttg attacacctc aatgaaccta taaaaagtta ttccatttaa 99241 aaatctgggc ctcccacttc tattggaaac tggacaactt gacagcctgc agtgttttga 99301 aacagcatgg attgggtgtc ttgtttccag catatgtccc atgttcccca acactgttcc 99361 tcaggaaaca gggaggaaca gctgttcctc aggacctgct gagtggccat attccctgaa 99421 ggccagaaaa gtaacaaggt gaccgttttg gacaaaattc tgagcttaaa accttgcctg 99481 aatcagtagc ccccaaattc attgtgcaca tcttataatt tccaagaaca cacatattaa 99541 gtgaatattt ttatgtttat aattgtgaaa tggtctgttc agggctgtca ttcctcaacc 99601 ttttttttca accaccctct cattaaggaa gcatatgtat taatcatggt ttctgttttt 99661 tgtattatat gtcttgacat cttaaaattc ttgctggtga cagagagact taccctacca 99721 gaggtagcta atttctaaaa atagtaaaca actaccttga gagcaaggca agccaactac 99781 cctctttatc taactcttac tcaccaagcc aatactttcc ctgcactaaa tcatcccagc 99841 gttactgcca gtgattttgc agcagtctgc agcaattttg gtccttgcct cctcagaaga 99901 aaattcaacc gaggaggcag aggttggctt aaggcacagg aagagaccaa ggcaagtttt 99961 ttttagcagg agtgagagtt tattaaaatg ttttagagta ggagtgaaag gaatcaaagt 100021 acacttggaa aagggccaag tggataactt gagaggtcaa gtgccccact tgatccttgg 100081 cttgggactt tatacatttg cttatttccc aggctcttta ttctccttcc ttcggtaggc 100141 tgttgcttga ctactgcatg cacagtggtc tgccagtact tgggaggggc cgcatgcagt 100201 ttgtttactg aagtcataca tatactcatt tggggtgatc ttcccttaac aagtcatgca 100261 cccccagagg aaggtcaaat accagtcaaa ttttgccatc ttgccccatt ttgtgcatgc 100321 tcgaaacctt atcaggggtt gtggttcact ggctccaggt gttttctatc tgttggaaaa 100381 ctcttttccc tcctggtacc aatcatggcc acttatcatc tcagagggat agttttatgc 100441 tttcctgtcc ttcacccaat gagagctgga catctcgggg gtccctctct tgttttgttt 100501 attatctcag aggaaaagat ttatgattgc ctgtccttga ctccaaaatt gcctgacatt 100561 cctggggtcc ctctctcctg ccctactctt atctacctac tctaacacca gggtgagggt 100621 atgaggcaac tagagaccac ccctgtaaat cgaagccaac cagaattatt cagactagcc 100681 agtactcagc tgtttgctct gatctggcct aggctttccc ctagctcttg cttcttcctt 100741 tagactgaaa cctgttcctc ctgtggccct gcatggcata ctgtacttcc catttctagg 100801 gaaattctga gtaacattaa acttttattt cactggcatt gagctctcca tgtccttttc 100861 caataaagac ccatccacaa attggcctct cttggctatg taatattaca tatctttcca 100921 gcacttctca atgccttcac cttctctttt ctccattatg cctgccacat tctaacattc 100981 catataattt agcatttttt tcatatttcc tcctctttct tgagtacagc gtaagctatt 101041 ggggtctgtc tcttgttttg taaatgtccc atgcagttaa gatagtgtct gaaatacagt 101101 aggtgttcct tgaatatttc ttagaactat gaataagcat aatactacag catccctgtt 101161 caccttaata ttctgatagc actttctaaa tgttttcttt acttctcctt ccaggtatta 101221 ttgagtgttt tcaagaatga aaagaggcct ggcatggtgg ctcacatctg taatcccaac 101281 tttttgggag gctgaggtgg gaggatccct tgagcccaag agtttgaggc tgcagcaagc 101341 tatgatcgtg ccactacact ccagcctgtg caacacagtg agatcctgtc tccaaaatta 101401 aataaataaa taaaattaca ttttggagac taaacgctgc ttgtagtcat agttctaaca 101461 cagctgtgtg ctcagcttca tttgctcact tctcagaatt caatgtctcc tgctcaaatt 101521 ttgctgtcat cttaaccgtt ttcacttatc acagatatgt ttctccaggt ccatttcgtc 101581 atgtatctgt gaataactac agaattatta actttgagta atttggccgg gcgcgggggc 101641 tctcctgtaa tcccagcact ttgggaggct gaggcgggca gatcacgagg tcaggagatc 101701 gagaccatcc tggctaacac ggtgaaaccc catctctact aaaaatacaa aaagttagcc 101761 gggcatggtg gcaggcgcct gtaatcccag ctactccgga ggctgaggca ggagaatagt 101821 gtgaacctgg gaggcggagc ctgcagtgag ctatggtcgc accactgcac tccagcctgg 101881 gcgactgagc aagactccgt ctcaaaaaaa aaaaacacaa aacaactttt gagtaattta 101941 aacaattcac aaatgttttt gtctttcttt ccccatacac tcgccctttt cttttttaaa 102001 accataaata aatctcaatt ctcaattcag acactgtcgg tgtaattgca ggtgaaatga 102061 gaatcacctc ccagccaaga aatctgatcc caattttcca aatctttatt tctgggttca 102121 gacaccacat tttcagcccc tgtcttggca tataaaagca tttataatgg ttcttctttt 102181 cagataagaa acatcaccaa aaggaaatag atatatctca acttttgggc taggaactct 102241 catttggata aaagaatatt tctcctttgg ttggacatta gtgaccatac aggcaggaaa 102301 ggtgccttcc ttgaccacct tctctcgcct gtcttcctcc tcatggttct ctcaggtatt 102361 ttttttctga ggaaaccaga ggaagagctc tatattagtc cacttgcctc agggagctcc 102421 agcatccctc catagccact gtctggagac cattcttcca ctccatggcc ctgcctgggg 102481 acaattctct gttaactctg acccaactga ggctcacaaa actgactcaa agcttccctc 102541 tacagtacac atatttttta aatgaaggaa atttgactga actatgagtg gactcagagt 102601 gacattggat gttgtgtgtc catttggtgg tgtggcttca gggaacatct gtaagaatat 102661 atcccaatat ttcggggatc cctgcagacc tcactacaca agacctggtt gagggtcctt 102721 taaaaggagc gaagccacgc cctgacctca gagtctcagg gcataatttc tcagcgcccc 102781 tcgcagtacc atttggcctt tgcccagcgt gcaggggagt tagccgtcac cacggccaag 102841 tgcacatgag aactgcattg cccagaaacc tgtgcgccgc ccggcggcgg cactcttagg 102901 ggcgtctccc tgcggacgga agctctctgg gcgggacttc cggtatcttc ctcgcggtgg 102961 acatcttgtc ggctcttagg tggaaccatc ggagcagaag ctcggggttg ctgggcggtt 103021 ccgaggtgac ggaagcggga gggtgcggga gaagtcgctg ttcgctctgc ggagtggctc 103081 gccagcgaag accccgcctg cgcccccggg gacggacgac cgcggtgcca gggtcccgcg 103141 acctgggacc ccctcgcggc tccgggtggt ctacgaactg tgatggcggc ggccgcggtg 103201 atgggcccgg cgcaggtggg tgctgccttt cccagacttt cgcccgcccc aaatcctgaa 103261 gttccaaatg aggagcgcct gtctgagtcc ctgcagcgca ggccccagtg tccaaggcag 103321 cggggcgctg gtgggtgggg gcgagtgtga ctggcagagg ggcagcctga gcataggttt 103381 ggagctggac tgagcccgta gcagtcggga gcgtgtgtga accgtagtca ggcctgcaat 103441 gtcgagggga gaagttgctc cttcattgcg aggacgatag gagccatggc gggttttgaa 103501 tggtggaggg aagggatccg aaaaaggatt tttaaagtat tccaatgttt gctgaggagg 103561 aaaccgacta cagtgaggta gaaacgatga ggatggaggc aaggagacgt ttgaggaggt 103621 ccctgcaaca aactccagaa gtgttgcggt ggtggctggg ccagagcagt ggcaggaggg 103681 gttgggtggg gaagtcatga gattctgggt agatttttaa agatggaacc aatggggttt 103741 cctgccgcat cagatgtggt cgtgagtgaa tgtagggagg aaagggctat ccagggtttt 103801 tttggcctgt tttccttcct gaacgtgtga aagaatggaa attggtaagt cacagcaggg 103861 agcacgtttc aggggcatca ggaattgggt tttggacttg gtgaatctaa ggtgttctgc 103921 actggaggtt catcacgggt agctgaacat gcaggtctgg ggttctagag agggagttgt 103981 gctagagact tcaatgggtg aggcataagg tcaaagcctg tagtagatca gatttagagg 104041 gtggaatttc aaggaagtta ttttgtgtgg tctctgaagt gagaggtggg tttagggtgg 104101 aaggccaggg atgaaggggt ggacacttaa gattccttcc agaaaggagg ggtcctggca 104161 tgtgaaagca ttctagatgg tgttgttgga agtcaatgct tggtgctgca aagtgaaacc 104221 agcactcagg caaaagtttt ctcagcaagg caatttactt ctgtgcaagt gtgctgcctg 104281 tgttaatcac gatcccaaga gcacactgaa caaaggaggg aaggggtttt tatctctaac 104341 ccatagtccc taccttcctc tgtgtcactc ccccatgggc tggggtcgga ccgcttaatc 104401 taagctgagc caattgggta tttgtgagta ttttccaaat aaggaagagg gaaggggaag 104461 gtgagttaca gtggtgggac atgtgagttg tgtaggacgt gtggttttgg cgggggcaat 104521 gggtgcaaag tgagtaaggg aacagatgtg aattattggt tagtgcttac aggaaggttg 104581 tttatagtaa ctaggggcaa ggaggcatgg agaacaagaa aggtttgaga acaaagaaca 104641 aggaagttaa caggctaaac ctttgaagag gaattttatt atatcttata gtgtcagcaa 104701 gaggagaggg ctgtagagca taagtagcct ggcctgcact ggagaccaag gtgagggatg 104761 cagtaggttt gccaccctac aggctcagca tgccttcatg ggcgctgcct gtggagcagt 104821 cccatcccca accagtgcat tggtgcctca gttacccaaa ccctcaggta agcagaattg 104881 atgaacatct ctaatgaagg atcactgcag ctccccttca gaggaatatg tatcaggtag 104941 gctggtttag tgcaagggct agtgtacacc ttgcaagtac gcagaaggaa ctaaaaaatg 105001 tatgtacaaa gtaagggaaa tttatactag cacttaatga ttggtcaaaa tattaagttg 105061 ccttgaagta ttttattaca atggaaaaat cttaaccaaa ttttcttaaa agctttaaag 105121 caggcaaaac aaacctaatt tttcaaaaac tcatcagttc atttgcatgg atagtagtag 105181 ctgtttcaaa ggcacttgag atgtacagac aagcctgata caaacagtag ccctatgagg 105241 tgggtgccat tattatcccc acttttaaga tgaggacact cagacgctgg tctttatgtt 105301 cacacagctg gtgggagcca gcactgagat ctgagtatct gaagttagag aggtaaccca 105361 aaatcattcc tgccaaaggg cagggtaggc agggcttagg gaaacagttc agtcgatgag 105421 atcccactgt gggaacctgc cactcccagc ttccctcctc ttgtgaggtt gccatggtgg 105481 cagaaactta tgattcctac ctagatgaga gtagattgtc tcaggccctc acttagccca 105541 acacaccccc attatctcag caactcagtg tggagaatga tagtgttctg ggactcttca 105601 gtgtcccttc tgcctaccgt tcttggaggt cctgcagtca cccaaagctc tgtgttgtaa 105661 attctgtttt ggaattatgt ggaagggttt ctatttctgg caagcaatgg gatgaaatgt 105721 ggaacattat ctcaggcata ggaaccagca tgtgcaaata cttaacttaa ggctaaacta 105781 ggagtgttca gggaacctca ggcctccatt atgtctggtg ggatcagaca gcagggacag 105841 gattcagccc agaaatggca gcctgaggat ctgggtttgt atgctgagag aattgggagc 105901 cactgcaggt ttggagctag gagggagtga tctgactttg gtcttaacca acttcctctg 105961 tacagtgaaa aaagactgtg ggagtgagga tgaaggaagg aataccactg aggagcttaa 106021 tgcaacagtt caggttggac cagagtggtc gctgtagagg tgggagatgt ggatggattc 106081 tgaatcattt ttcttaaagt gggaccaacc agattttctg atgtggaaga tgagaaagaa 106141 aaggcaaggg gtgaaggaag accctggatt ttttagtctg agaagatata aagatgccat 106201 caactgaagt gacaaagttg gtggtaggag tacctttcag aggtaatatc aggagtgcag 106261 tttggctatg tggaatctac catgttccaa agacacttga gtggaggtgt caagtagaca 106321 gctggacaca gagatctgaa gttcagagaa gggattcagg ttggagctat ccacagggtg 106381 gcagccagca tgtggatcag gatggagctt tgcttccaat ttagagagag aatctttagg 106441 tcatggtgac agtgccatca gcacactgta agggcagaca cagcagggta gaggtagggg 106501 ccatcttaga gtctagaaga gggagggaaa tgtccctggg aggaagggga ataatagggt 106561 caggcaggtg gcaaagcgtt ccaggcagag agtggatgtg aggtgaagta ctgtagtaat 106621 agaagtgagg tggcagtgag gccagagcca gtgcttattc agctctttat gtgctcctca 106681 ggcttccatg gtgacctttg gtgggacctg tggagtttca gtcaagctag tgaattagcc 106741 cagggacaga ttgggtcagt aggcagaatc cctgtgcttg gggtaggtgc ccagtgagtg 106801 taggttttga atgagttaaa tgttgagaag cctgtgactg acctcagtag tacaacaaca 106861 aagcagaaaa gggctgagtt tatctgggat ctgcacatgg ttctggattg ctgaattgac 106921 ccatgaaata gagccaggag gggaagattt aataggtgga ttcagggcaa cccatggcag 106981 tgtggccgtt gatcaggacc agatgtaatc ctggatgcat tatccaagca ttctctggat 107041 gctgtgtgca gatttgcatg tggggaagcc atggaggtac cagtgttgtg ggtgtccata 107101 gatgtaaagg aggtatagcc aaaagggtga tgtgcacatg gagatggagt gtgaattcat 107161 gagcccagag ctttggtatg gagggtttct gaaagaatgc ttagctcagg tttgtgtatg 107221 aggaataatc tgttgatccc cagggagctg gctgatagga gaagggttgg gtcctacata 107281 ccaggcccat ggtgttttat ctgtccactt gcctgttctg agctggacaa aatgattaat 107341 ggaatatgag tcgagaacag gcaccaatca caccagggcc tgttatctca tttgagttag 107401 atgtgacaag gagtcctgga gagagaggct gtccttaagc tcagttgtcc acctcacaca 107461 gggctatgtg accttcaagg atttatttgt ttacttctct taggaggaat gggagctgct 107521 ggatgaggct cagagactct tctattatga aataatggct tctgtgcttc gcatttatag 107581 tctccctggg taaggcccca gtatcccatt tttctgggct gggtcctact cttccttttt 107641 gccctagggg ttgctctttc ctttccatag ctagagcatg ggtactgctc tcttctctga 107701 ttccctcaca taggtcccct gggtgcctag gctcagctag atgcattggc ccctctcttc 107761 cctgtgttcc tcaactcctg ctgccctgaa gccttgccag gaagggtttg gaggtcaggg 107821 gtcctaaggt gacccagtga attctgcttg gcctctccct ggccaggtga ctgtccacac 107881 tcatgacacc tctggactcc aggctccacc cgcttcctct aattgtaatt tccttgagtc 107941 tgcctgtgtc ctcaatttca gtcatcatgg tcaccacttg tgggcccact gcactgttcc 108001 tgcaggacaa ttttctgagg ttcattttgt agcatttgtt ttcttgcaat tttgcatggg 108061 tgggctactc tggctagtgc tctcagcctg tttcaccttt cccctatgct ttgtatcttc 108121 cagatcctct gtggttgttg aattgtaaca agagagagaa ctctggctgc ctgagagggc 108181 atgactctag tcacagcagg aggggcttgg acaggccctg gtgagtggga gctgagggac 108241 gccataacct cagggctggg ttcctgtagt tccggaaacc tatgttgtgc ccatcacaag 108301 tgccaaggcc agaggcccta acttatttct acctttttat ccccattctg tacactcttg 108361 tcatttctaa tctgacttct gacccagggc tgtcttctcc cattctctac cagaggggct 108421 ctagtagagc aatagctctc tctgcgatgc tttccaatat tgtactacac ttacactatt 108481 gtgtcataag atgtgtgcat atctgtaaat agtccctgaa cacgttactt atttttaatc 108541 acattttcct gggcctgtac tcacatttcg cataggagtc acctgtcacc cacagtcagc 108601 caaggtcctg ctaaaagtca ttgcagagga ttaagttgga gaccttgtac tacaccccat 108661 cctcgtgtct ttgtttcttg tgaggccatc tccattctgt gactcttaga ttccagtact 108721 gtacagcagc ccttttttca acctgatcct agctctgttc caccaggaaa cttaggtact 108781 cagtcctggg tacagcagac acatatttgt gatgtagttg ccccacccac caaagtcagc 108841 atgtacatca atagtatttt tctgccttca ggttgttggc atgaagtgaa ggatgaagag 108901 tcatcttctg aacagagcat ttctatagca gtgtcacatg ttaatacttc caaggcaggt 108961 ttgcccgcac agacggctct cccttgtgac atatgtggcc ccatcttgaa agatattttg 109021 cacctggatg aacaccaggg tacacaccat ggactgaaac ttcacacatg tggggcatgt 109081 gggagacaat tctggttcag tgcaaacctt catcagtacc agaagtgtta cagtatagag 109141 caacccttaa gaagggataa aagtgaggcc tcaattgtga agaactgcac agttagcaaa 109201 gaacctcatc cgtcagagaa gccctttacg tgtaaggagg agcagaaaaa cttccaggct 109261 actttgggtg gctgccaaca aaaggccatc cacagtaaga ggaagacaca caggagcact 109321 gagagtgggg atgcatttca tggtgaacaa atgcattaca agtgcagtga atgtgggaaa 109381 gctttcagcc gcaaagacac acttgtccag caccagagaa ttcatagtgg agagaagcct 109441 tatgagtgca gcgaatgtgg gaaagccttc agccgcaaag ctacacttgt ccagcatcag 109501 agaatccata ctggagaaag gccttatgaa tgcagcgaat gtggaaaaac cttcagtcga 109561 aaagacaacc ttactcagca caagagaatc cacactggag aaatgcctta taagtgcaat 109621 gaatgtggga aatattttag ccatcactcc aatctaattg tacaccagag agttcacaat 109681 ggagcaaggc cttataagtg cagtgattgt gggaaagtct tcagacacaa atctacactt 109741 gttcagcatg agagtattca cactggagaa aatccttatg attgcagtga ttgtgggaaa 109801 tcctttggcc acaaatacac cctcattaaa catcagcgaa ttcacactga gtcaaagccg 109861 tttgagtgca ttgaatgcgg gaaattcttt agtcgaagtt ctgactatat tgcacaccag 109921 agggttcaca ctggtgaaag gccttttgtg tgcagtaaat gtgggaaaga ctttatcaga 109981 acctcccacc ttgttcgaca ccaaagagtt cacactggag aaaggccata tgagtgcagt 110041 gaatgtggga aggcctacag cttaagctcc cacctcaatc ggcaccagaa agttcacact 110101 gcaggcaggc tttaggagtg ctttgaatac aacaggactc atcaatcaga tgttgaattt 110161 catgtatctg aacattgaca caaaggagat accttatggt gccaggtacg tgggaacctt 110221 ctagggatat gttgcacttt ctgacttgct caggtttttt gccagagtta tgtcactgtc 110281 aatccatgtg gccgaaacca tcttaactct accagctaag ataccccagc attggggaag 110341 gcagggtttt gtattgtcca gtccctggag aaaatcatga aatgcctgag ttcattgggg 110401 gtcctcattc ccttctgtat gacaggtata ggtatggata tgacccattt ttagccaaga 110461 gggtctgagc tgtatctgct ggtggcttat acaaaaagtt tactttcttc atggatattc 110521 ttggtctcac atacttgtaa tcaagttttt ccagcctcca agtcacctgg cctgggaaag 110581 tacttgcctc atgttgctct ggtttgtgat aataaaggct ttacagttta agccacattt 110641 aatcttgggg cttcttctta tggtctgggg tggattgaaa acaggctctg ccaaactgaa 110701 gacagccttt gtgcggtgcc tccaactttg cctcaaatgg gacagtgggt tgagggagaa 110761 cagttcttag tccagttttg atgttaactt ccatagctga caaagcttgt taagtaagaa 110821 ttaagatctt gtgtagacct gatttgtctg gattttagag ttatttgaga gcccatattt 110881 caccttgagg agggtgctgc tgctgtgaca gcctgcagtg ttttgaaaca gcatggattg 110941 ggtgtcttgt ttgcagcatg tgtcccatgt tccccaacac tgttgaggga aagctgttcc 111001 tcaggacctg ctgagtggcc atattccctg aaggcctgaa tctgtttcac aggccactgt 111061 tggtaagatc taaagcatcc agtagggaaa caaaattgat aaatattgag tgtgagtaat 111121 tgggattggg gagattgtgg caaactagag gggaagtgcc cattgtaaaa acacatccac 111181 agacagtcca ggcactaagg ctgaatggga tcagggtatc cagaaatctc aggatctcca 111241 gggccatgtt actgttaggt caaggtcact ggtgcagcaa cgaatgtagt ttttctagat 111301 tcctctccct ccctgggctc tttacctaat gtctttgcgg cacaggcggt aaccctggga 111361 gtaaagaggt gtggtccaag gaagtagctt ttgtgaccag ctggagtttc tggtgactct 111421 tttggcaatg gtcctcattg tttgccagtt tttcttacta ctgaggaaga gattgtcttc 111481 ccaagatatt tggagtgata agagtcacca tagtgaggta gagtcactct tagtgatgtc 111541 cagttgtcca gtttccaata gaagtgggag atccagattt ttaaatggga taacttttta 111601 taggttcact gaggtgtaat caacatgcaa tatattgcac atatttaaag tgtacgagtt 111661 aagtcttgat acacacacac acacacacac acacacacac acacacactt gaaactatca 111721 ccctatcagt ggtattggac atatctgtta cccccaaagt taacctgagg cccttaacct 111781 ttctctcagt gctcgccttc ccccagaatc cctaggcaac cactgagtag ttttcactgt 111841 aggtaaattt gtcttttcta gaattttatg taagtctgcc tataaagtgt taattttgca 111901 tgctgtcttt catgcaacat aatttgaaat ttgtccatgg tatgtatgaa cagtccattt 111961 taaaattact gagtcatatt ctgtatggat ataccacagt ttatccattt atcttaattg 112021 atgtctgtac tccaccccca tttttaaata ttaaaaagct gctgcaaata ttgatgcatg 112081 agtctttatg tggacactta gctgagaaca tttttaagca gaggttttga aggtgtgacc 112141 atatatctta ctaattatag taaaatatat tggaaaagag atacatacag ggaagaactg 112201 tgaaacagaa aagacctaag atttgatgat ttgggaaatt ctcagcccat cttaattgga 112261 aaaaaaaaac aaaccattga aattaagaga ttcattgtta ggaaagatag atgttttaga 112321 gagagagatg ttttagagag atagccaagg atgtttgctg ataccgggat caaaacatac 112381 atcagaacac tgtcacacaa aaagctcttt gaagagatta aatgtgactt gtagatcccc 112441 tcaatacaaa caatttataa gaagtttaaa ctatcactca tctcagcaaa agccaaaaat 112501 atagataggg gattccctag gaggaataat ctgcataaac ctcttttcta atgttgtgaa 112561 tttcagtgat gtacaggaga ctcacaaaat tcttgaaaat tttataccag tggaaatgct 112621 caccttgggt ctaaagggac cttgaaagta tgaagttaaa gggtgtaggc atgtaaagtg 112681 tgaagttaaa ggttgtaata ttctatacat gggactggct gcggaaacag gtgcaagccc 112741 ttgctacctt tcatgagaaa ggaaggatga ctcagagcag aggagagtcc agtgggcaga 112801 gtcctgaacc atgtgatatt cccatgtctt gactcccagt ggcatttgcc aaactagatt 112861 tcagattttc ttgggattgg tggttccatt ttttttgttt tccccccttc catttcctcc 112921 ccttttgaaa caatgcatat acatataact attatcctag gtctttccca accttttgtt 112981 tgggagcaga taactagttt tctagtttta cagatccaaa tgagacagga atagcactgg 113041 gtggtcacag gaggatggaa aatctcaaca ccctggtttt ggtgcagctg aaaaaaaaaa 113101 agaaaaaccc aaacaacagc taaaacggga actaggcaaa gaaaccgtgg gataacagaa 113161 aacccaaaac aaaagagaaa actgccagaa ccccagtcag ggtgacatgt ctgtaactct 113221 tccaggcaaa cccaaataag ggaggagggg tggtaatcag gggtccctga aatcccctct 113281 tttcccagaa tacctaatga ttacccctat taaaggaaca cccatacaat tagaaaccca 113341 aactttgttt tgcatgactt gctctcaggg gcactcccac acttctctct tgtgtgtact 113401 tttgcttcac aataaaagct gcttgccttt gcttcactgt gactcgtcac tgaattcttt 113461 caatggtgtt aagaacctgg atgctgctgg ggctgggatc tcaccaacaa tgtccagaga 113521 cccccccgaa ccccccagca acacagagaa gaattgtgcc ccaagatggg attataccta 113581 gagcctcatc catattggac ttagacaatt tagatgaaat tggagacttt agctcagtgt 113641 ggaggcgcac acctgtagtc ccagctgctc aggaggctaa ggcaggagaa tcgcttgaac 113701 ccaggagaca gaggtggcag tgagccaaga ttgcgccact gcactctagc ctgggcaata 113761 aagcgagact gtctcaaaaa aaatcataaa atagtaaaag gcatcaaatt tttgagggga 113821 aatacctatt ggtgaactga aaactcgaca acagttctgg tgtgtgggat aacatgttaa 113881 tagccttgat cattttgaat ttcataatca taaatgggat tttgtttttg agatggggtt 113941 tcactcttgt tgcccaggct ggagtgccat ggcactatct cagctcactg caacctctgc 114001 ctcctgggtt caagcgattc ttctgcctca gcctcctgag tagctgggat tacaggcatg 114061 caccaccacg cctggctcat tttgtatttt tagtagagac agggtttctc catgttggtc 114121 aggctggtct tgaacttctg acctcaggtg atccggctgc cttgacctcc caaagtgcca 114181 ggattacagg catgagccac tgtgcctggc cttttgggat tggtttaaag agctgtggaa 114241 gacaattgag caattaccac agagattagc aatggagttc ctagtatatt tgaaaatgaa 114301 taaaattgtt cctgggaaaa ttaacaatag tgatctcagt ctaaattaca gagtaagcaa 114361 ttcagagagg ataatatgcc acatatgcat agatttgttt aaaacagtaa attatatagc 114421 actgtaaaaa ttgaaaaaaa tactattaga acaacaaaca attaggttga tgaagcacaa 114481 taagagacta atgaaaagga aagaaatgaa attacttaat atcaaaatgt atgaaataaa 114541 tttaagccat atacaagcat gccaaattaa cttgtgagat taacaaagag tgtattaatt 114601 ggggaaaaag actaggagtc tgcaaatcct aagattaggc ttaacactaa gcctcttaca 114661 aacttcaagt tttcaaaaag gctcaaggtt ttctggtaca gaggtactat gggagcacag 114721 ttcagattta acttaacatt acgttattta ggccaggcac ggtggctcat gcctgtaatc 114781 tcagcacttt gggaggctga ggcgggcaga tcacgaggtc aagagatgga gaccatcatg 114841 gctaccacag tgaaacccca tatctactaa aaatacaaaa aattagccgg gtgtggtggc 114901 aggcgcctgt agtcccagct acccgggagg ctgaggcagg agaatggcat gaacccggga 114961 ggcggagctt gtggtgagcc gagatcacac cactgcactc cagcctgggt gatagagcaa 115021 gactccacct caaaaaaaaa aaaaagaaag aaaattacag tctttaggaa actagatatt 115081 catcagagag tgtatgtgac cagaataagc catcttacga cgtctagatg acacagtaaa 115141 tacttatgct gattcaaaat tacctgagct acttcaacat aggacatagc aaaacactga 115201 taattacatt actgtatact tataaaagat gcttgtggct gggtgcagtg gctcacacct 115261 ataatcccag cccttcggaa ggccaaggca ggcggatcat ctgaggacag gagttccaga 115321 ccagcctggc caacatggtg aaaccccatc tctactaaaa atactaacat taggcaggag 115381 aattgcttga atccgggagg cagaggttgc agtgagcaga gattgcgcca ttactccagc 115441 ctgggcggcg agagcaaaac tccgtcaaaa aaacaaaaac aaaaaaaaca aaccatgctt 115501 tcaaatctgc ctaacaatta tagcttcaga gtccttgtat tcaggattat ctatttgtaa 115561 cttactactt tttacaaata tttttcttac actaaattag aaagaatgtt tggtctcgca 115621 ggtgtctgag aatctttagt aaaatacccc tacagattct attttggaat catggtttta 115681 aaaaaagcaa aagtgtgtat ttccaattaa ttgctacaga gatgccaaat gtccactcag 115741 tagttgtgag taaaacaaat tgtcttattg ctcttcactt cacaggcaaa taaactcttc 115801 gagtgattgt gtgaagttca gagtattagg tcagggatga aatggaggct aatccagaca 115861 cgttataaga gaaaatatgg aatagagatt ctaggggcac taaagcaaat gcatgaggtg 115921 gtctatgctt ctgagtccct catagtatgt tccttctgag tatttgtatt ctaaacctca 115981 aaattacaat tttttgtata acatttctta ctggattctc caagatataa tctacttttt 116041 gaagaatatg acatttatgt agaaatagtg gttacttacc aaactggtat ttgctactag 116101 gaaagtagca tcagtttttg gcatgggcat ctgaactctg agaataaggt ttgggattcc 116161 atgacctcat ctccttgctg ggaagtgatg gtcattttat tcttccacct gtaattgcag 116221 tgacagtgtc tccaagagga ttggagatga tctctgaaaa aaatttttga agttagaaaa 116281 ttgtagaaag ttattaacat taattactgg agttatccat ttagaatgac atcataccta 116341 aaagagaaaa ttatctgtga aatgggaaca aataaaggta aatagaggaa acagttctaa 116401 gacatgggac atgtaagttg aacccttggt tatgctggaa tcttggctcc tatgaaattt 116461 tttttttttt tttttttttt ttttttttga gatgtcttgc tctgtcgccc aggctggagt 116521 gcaatggcgc gatctcggct cactgcaacc tccgcctccc aggttcaaac aattcttcag 116581 cctcagcctc tcaagtagct gggattacag gcgcccacca ctgcacctgg ctaatttttg 116641 tatttttagt agcgatgggg ttttgctatg ttggccaggc tggttttgaa ctcctgacct 116701 caggtgatcc gcccgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccaccgcg 116761 cccagccgaa attcttattt ctacaaagtc tgacattttt tcattcatat gaatggtgag 116821 gaataacaaa cttggaagga aagaaatagt aaaaagagag tgagagtaca aaggtgaaag 116881 agttgaggga aagcattact caacatctta tctggttcag aggctcatct agatttgtat 116941 ttacagcagt gaacaaccta caaagaaatg tattctcaag taggctttat tcaaattaag 117001 agatgtggtc tggaaataat aaagaaaata ttggaaaata tacactaagt tataaagaga 117061 cagattctat ggaaaatatc agaataccag caccaagtaa taacgaacac gtgaagtagg 117121 tagaatttct ataggaaatt tctatagtca gggttgtcct tatagtggag ataatatttg 117181 agcatatatt taaagcacag cagagagaga gtgatttgta aacacagaga aaatgtgcta 117241 tggaaaagaa atagccagta cagtgtccaa aaagcctgga agtgaaaaat catgtggtca 117301 aagcacaatt caagagtgac tggtgatggt ttttatacca tcaaaaatct ccccaatgtg 117361 cacaattacc catgtatcca ttcacatgcc attatctcag acctccttgt caactgtctt 117421 ctcaaacata ttcttccttg cccttgatca cattgccagg tcatggactg ctatcctgta 117481 aaatacttgt tctttggaca cttattcctc ctcataaaaa ggtttacaag gtatgccact 117541 tgtgacttca cctatgggga aaattttgat ccccaactat ctttcaagag gatgcagaat 117601 atggctctga ggcagccaca gttttgtttt gttttttttt ttttgaaatt tatctttttt 117661 tcccaccctg ccttatggtg ctgacacaat cttttttatt actaggtttt taagagcact 117721 tttaggtttg cagtaaaatt aaggggaagg taaagagata ttccatatac tccctgcacc 117781 tacataagca tagcgcccct caccccaacc gcagtggcac atttgtagca attataaacc 117841 tacactgaca catcattatc atccaaagtc cctactttac tttagggttc actcttagtg 117901 ttgtacattc tatgtgtttg gacaaaggta caatgacacg tgtccactat tagagtgtca 117961 tctaaaatag taatcctctg tgctccacaa attcatccct cctacccttc taaaccccca 118021 gcaaccactg atc // LOCUS AC003972 42667 bp DNA PRI 05-JAN-1998 DEFINITION Homo sapiens DNA from chromosome 19, cosmid R33485 containing pNORF1, complete sequence. ACCESSION AC003972 NID g2739354 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 42667) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Gordon,l., Christensen,M., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Nolan,M., Trong,S., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of an ~1 Mb region containing the MEF2B gene in 19p12 JOURNAL Unpublished REFERENCE 2 (bases 1 to 42667) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (02-JAN-1998) Joint Genome Institute, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA REFERENCE 3 (bases 1 to 42667) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (05-JAN-1998) Joint Genome Institute, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT Map and sequence oriented from p telomere to centromere. Cosmid R33485 overlaps cosmid F19807 to the left and cosmid R32469 to the right. FEATURES Location/Qualifiers source 1..42667 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R33485" /chromosome="19" /map="19p12 between UBA52 and D19S451" /cell_line="5HL2-B" /clone_lib="LL19NC03 R chromosome 19 cosmid library" /note="LL19NCO3 cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries chromosome 19 as its only human chromosome." repeat_region complement(7..190) /rpt_family="Alu" CDS join(891..1121,14661..14800,16425..16514,18756..18923, 19369..19549,20816..20977,21668..21752,21933..22031, 23282..23390,23560..23719,23805..23923,24606..24770, 24867..24981,25558..25701,26001..26214,29002..29119, 29507..29663,30691..30833,32119..32293,32851..32932, 33970..34131,34242..34459,34725..34844) /note="human type 1 RNA helicase; DPS similarity to (U59323) type 1 RNA helicase pNORF1 [Homo sapiens]; Score: 5190 Identity: 1103/1118 (98%); ~gi|1575536 (U65533) regulator of nonsense transcript stability [Homo sapiens]; Score: 5170 Identity: 1100/1118 (98%); gi|1944407|gnl|PID|d1020441 (D86988) KIAA0221 [Homo sapiens]; Score: 5257 Identity: 1115/1129 (98%)" /codon_start=1 /product="pNORF1" /db_xref="PID:g2739355" /translation="MSVEAYGPSSQTLTFLDTEEAELLGADTQGSEFEFTDFTLPSQT QTPPGGPGGPGGGGAGGPGGAGAGAAAGQLDAQVGPEGILQNGAVDDSVAKTSQLLAE LNFEEDEEDTYYTKDLPIHACSYCGIHDPACVVYCNTSKKWFCNGRGNTSGSHIVNHL VRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLLGFIPAKADSVVVLLCRQPCASQSS LKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRARQITAQQINKLEELWKENPSAT LEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEADYDKKLKESQTQDNITVRWD LGLNKKRIAYFTLPKTDSDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIA IELRSSVGAPVEVTHNFQVDFVWKSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEV EDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVLQRPLSLIQGPPGTGKTVTSATIVYH LARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSREAIDSPVSFLALHNQIR NMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVGAGDPR LAKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQ SLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQNGVTAADRVKKGFDFQWP QPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKPDQIGIITPYEG QRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQGIGFLNDP RRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS KPRKLVNTINPGARFMTTAMYDAREAIIPGSVYDRSSQGRPSSMYFQTHDQIGMISAG PSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGP SQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFK SQIDVALSQDSTYQGERAYQHGGVTGLSQY" misc_feature 2165..2375 /note="BLASTN similarity to cpg266h4.ft1a (1..211); match: 0.97, score: 5.2e-77; database searched: Sanger CpG." misc_feature complement(2412..2605) /note="BLASTN similarity to cpg266h4.rt1a (11..204); match: 0.97, score: 6.4e-70; database searched: Sanger CpG." repeat_region complement(4029..4312) /rpt_family="Alu" repeat_region complement(4351..4463) /rpt_family="L1" repeat_region complement(5435..5721) /rpt_family="Alu" repeat_region complement(5811..6056) /rpt_family="Alu" misc_feature 6120..6633 /note="DDS similarity to AA313476 EST185361 Colon carcinoma (HCC) cell line Homo sapiens cDNA 5' end; Score: 1019 Identity: 513/513 (100%)." repeat_region 6893..7341 /rpt_family="Alu" repeat_region complement(7787..8107) /rpt_family="Alu" misc_feature complement(10130..10336) /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 52.000" repeat_region 10617..10784 /rpt_family="Alu" repeat_region 11410..11684 /rpt_family="Alu" repeat_region complement(12200..12398) /rpt_family="Alu" repeat_region complement(12555..12707) /rpt_family="Alu" repeat_region complement(13002..13304) /rpt_family="Alu" misc_feature complement(14635..14794) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 71.000~~Other overlapping matches:~(14664..14800) DDS similarity to H13969 EST00061 Homo sapiens genomic clone D4-1 5' (1..137); 100% identity.~~(14687..14800) DDS similarity to H13971 EST00063 Homo sapiens genomic clone D4-16 5' (1..114); 99% identity." repeat_region 15399..15658 /rpt_family="Alu" misc_feature 16425..16514 /note="DDS similarity to overlapping ESTs:~~(16425..16489) H13969 EST00061 Homo sapiens genomic clone D4-1 5'(138..194); 96% identity.~~(16425..16514) H13971 EST00063 Homo sapiens genomic clone D4-16 5' (115..203); 96% identity." misc_feature 21933..22053 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 52.000~~Other overlapping matches:~(21960..22031) DDS similarity to AA170942 ms50b01.r1 Life Tech mouse embryo 13 5dpc 10666014~ Mus musculus cDNA clone 614953 5' similar to SW:YAC6_SCHPO Q09820 HYPOTHETICAL 105.6 KD PROTEIN C16C9.06C IN CHROMOSOME I; (1..72); 90% identity." misc_feature 23282..23390 /note="DDS similarity to AA170942 ms50b01.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 614953 5' similar to SW:YAC6_SCHPO Q09820 HYPOTHETICAL 105.6 KD PROTEIN C16C9.06C IN CHROMOSOME I; (73..182); 88% identity." misc_feature 23560..23876 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 78.000" misc_feature 23560..23731 /note="DDS similarity to AA170942 ms50b01.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 614953 5' similar to SW:YAC6_SCHPO Q09820 HYPOTHETICAL 105.6 KD PROTEIN C16C9.06C IN CHROMOSOME I; (183..342); 86% identity." misc_feature 23805..23915 /note="DDS similarity to AA170942 ms50b01.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus cDNA clone 614953 5' similar to SW:YAC6_SCHPO Q09820 HYPOTHETICAL 105.6 KD PROTEIN C16C9.06C IN CHROMOSOME I; (343..453); 85% identity." repeat_region 24318..24534 /rpt_family="Alu" misc_feature 24867..24981 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000~~Other overlapping matches:~(24883..24981) DDS similarity to H14015 EST00042 Homo sapiens genomic clone D5-25 5'. Score: 173 Identity: 95/96 (98%)." misc_feature 25209..25677 /note="DDS similarity to AA309347 EST180284 Liver III Homo sapiens cDNA 5' end similar to similar to NAM7 protein; Score: 922 Identity: 465/469 (99%)." misc_feature 26001..26214 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 99.000" misc_feature complement(28473..28784) /note="DDS similarity to AA309352 EST180289 Liver III Homo sapiens cDNA 5' end; Score: 620 Identity: 311/312 (99%)." misc_feature 29002..29119 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 100.000~~Other overlapping matches:~(29041..29119) AA416671 zu18a07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 738324 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN ; (1..79) 100% identity." misc_feature 29396..29951 /note="DDS similarity to AA573895 nk09a05.s1 NCI_CGAP_Co2 Homo sapiens cDNA clone IMAGE:1012976 similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN ; (1..557)~Score: 1081 Identity: 550/557 (98%)" misc_feature 29507..29663 /note="DDS similarity to AA416671 zu18a07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 738324 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN ;(80..235); 99% identity." repeat_region 30260..30521 /rpt_family="Alu" misc_feature 30654..30833 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 98.000~~Other overlapping matches:~(30691..30833) DDS similarity to |AA416671 zu18a07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 738324 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN; (236..378); 100% identity." misc_feature 30785..30833 /note="DDS similarity to AA393082 zt69a12.r1 Soares testis NHT Homo sapiens cDNA clone 727582 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (1..49); 88% identity." repeat_region 31258..31530 /rpt_family="Alu" misc_feature 32117..32293 /note="DDS similarity to overlapping ESTs:~~(32119..32171) AA416671 zu18a07.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 738324 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN; (379..431); 100% identity.~~(32119..32293) AA393082 zt69a12.r1 Soares testis NHT Homo sapiens cDNA clone 727582 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (50..222); 99% identity.~~(32117..32293) AA410212 zv22h03.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 754421 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (1..176); 99% identity.~~(32119..32293) predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature 32851..32932 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000~~Other overlapping matches:~(32851..32932) DDS similarity to AA410212 zv22h03.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 754421 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (177..257); 99% identity.~~(32851..32932) DDS similarity to AA393082 zt69a12.r1 Soares testis NHT Homo sapiens cDNA clone 727582 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (223..303); 99% identity.~" repeat_region complement(33271..33715) /rpt_family="Alu" misc_feature 33970..34131 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 90.000~~Other overlapping matches:~(33970..34131) DDS similarity to AA393082 zt69a12.r1 Soares testis NHT Homo sapiens cDNA clone 727582 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (304..465); 100% identity.~~(33970..34131) |AA410212 zv22h03.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 754421 5' similar to SW:NAM7_YEAST P30771 NAM7 PROTEIN (258..419); 100% identity.~~(34071..34131) |T03660 IB693 Infant brain, Bento Soares Homo sapiens cDNA clone IB693 3'end (480..420); 83% identity." misc_feature 34242..34459 /note="DDS similarity to T03660 IB693 Infant brain, Bento Soares Homo sapiens cDNA clone IB693 3'end (419..202); 100% identity." misc_feature 34421..34459 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 52.000" misc_feature 34725..34844 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 84.000" misc_feature complement(35866..36121) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 63.000" CDS complement(join(37278..38071,38664..38988)) /note="embryonic growth/differentiation factor; DPS similarity to gi|121105|sp|P27539|GDF1_HUMAN EMBRYONIC GROWTH/DIFFERENTIATION FACTOR GDF-1 PRECURSOR; gi|106077|pir||C39364 GDF-1 embryonic growth factor - human; gi|183052 (M62302) ~growth/differentiation factor 1 [Homo sapiens]~Score: 1940 Identity: 370/372 (99%)~~~gi|109860|pir||A39364 GDF-1 embryonic growth factor - mouse; gi|193460 (M62301) growth/ differentiation factor 1 [Mus musculus]; Score: 1111 Identity: 255/357 (71%)" /codon_start=1 /product="GDF-1" /db_xref="PID:g2739356" /translation="MPPPQQGPCGHHLLLLLALLLPSLPLTRAPVPPGPAAALLQALG LRDEPQGAPRLRPVPPVMWRLFRRRDPQETRSGSRRTSPGVTLQPCHVEELGVAGNIV RHIPDRGAPTRASEPASAAGHCPEWTVVFDLSAVEPAERPSRARLELRFAAAAAAAPE GGWELSVAQAGQGAGADPGPVLLRQLVPALGPPVRAELLGAAWARNASWPRSLRLALA LRPRAPAACARLAEASLLLVTLDPRLCHPLARPRRDAEPVLGGGPGGACRARRLYVSF REVGWHRWVIAPRGFLANYCQGQCALPVALSGSGGPPALNHAVLRALMHAAAPGAADL PCCVPARLSPISVLFFDNSDNVVLRQYEDMVVDECGCR" misc_feature complement(39258..39300) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 84.000" repeat_region complement(39776..40328) /rpt_family="Alu" repeat_region complement(41022..41620) /rpt_family="Alu" repeat_region 42107..42537 /rpt_family="Alu" BASE COUNT 8492 a 11177 c 12739 g 10259 t ORIGIN 1 gatccacccg cctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgctcgg 61 cctaattttt atattttttg tagagatgga gtctcactct gtctcccagg ctggtctcga 121 actcctgagt tcaagcagtt ctcccgcctc agcctctgag tgttgggatt acaggtgtga 181 ccaccgcgcc gagccccact ccctttttaa accaatgcac agggagcatc tactatgtgc 241 gaggcaccct ccattttaca gatggggaaa ctgagttcca agcagctttt gccaggagtg 301 ggagcgcacc gtccctgccc agtctctcca gttgggcgtt ccgagaccac ctatgtaggg 361 tattcggcta gcgctcaatt aaaccgaaac accgtcgcag cgaccacagg tgcagaccac 421 caagtccaca ccgccagccg cccgggcgtc cccaattagc agttggggcg ggaccaagtc 481 ctcgagctgc cgccgcagcg gcggttgaaa gccccacccc ctgcgcgtgc gcggtgcgcg 541 tcccgtcttc ctgctgggca ccaggcgccc gcgaagagcg cgcgcgcttc cgccggcgtc 601 gggctcgcgc acgcgcacgg cgacggcggc ggtggcggca gttcctgctc taggctgcga 661 gcggctggcg gcttcgaggg gagctgaggc gcggaggggc tcggcggcag cggcggcggc 721 tcggcactgt tacctctcgg tccggctggc gccggggcgc gcggtttggt cctttccggg 781 cgcgcggggg cgacagcggc agcgacccga ggcctgcggc ctaggcctca gcgcggcggc 841 gggctcgagt gcagcgcgga accggcccga gggccctacc cggaggcacc atgagcgtgg 901 aggcgtacgg gcccagctcg cagactctca ctttcctgga cacggaggag gccgagctgc 961 ttggcgccga cacacagggc tccgagttcg agttcaccga ctttactctt cctagccaga 1021 cgcagacgcc ccccggcggc cccggcggcc cgggcggtgg cggcgcggga ggcccgggcg 1081 gcgcgggcgc gggcgctgcg gcgggacagc tcgacgcgca ggtgagcggc acatggcggt 1141 gccggaagcc cgggcccggc ctcggcgcct gagacctgcc ccgaactcgc ctcgggcccg 1201 gcctgtgttt ggccggagtc ccccatcgcg gccgggcctg gggcgatctg ccccgggtcc 1261 cctactctgg ctcggcctga gcttacctgc cccaggtcct gcaatctgcc cgggcctggc 1321 ctgatctgcc ccgggtcccc cgctccggcc gcgacctggc tgagttcctt cggcccccaa 1381 ctcctgggcc gggcagactt gcctcgagtc ctccacttcg ttcgggaccg agctaacctg 1441 accctgttcc tttacgccag ctgggtttgc tccccctcct gggcccgggc tgacctgcct 1501 tgtgtcccac aatcagtctg gcctggagta acctgccccc tgacttatct gagttcttcc 1561 attccatccg ggctagggtg acctgccctg gtccctcttt cgcaggtctc ggtattctgc 1621 ctcgagttct cctacgacct gggccgggtc ttgctcaggc cggagcttga gtgacttccc 1681 tcagtccctt tccgggagta tccagggtga cctgccctaa tcctgtctct ggcctggtcc 1741 gggggtcctg cccggtctcc ccattggtgg gtctggccgc ctgtcccacc tcccatctgt 1801 ggtcgcagcc atctgggcac cccacctttc acagtcacgg agcgtaggtt cgactttccc 1861 catgcatagg ggcgctggcc agcggaaggt tgaggcagag aagaattttt gcctcccatg 1921 gaggagcaca tattggctgg ctattaagat cattcaggaa gcaggttgta ttttgttttt 1981 tcttgcatcg cctgttgctg aaaccgacct gttacataac ttcacaactc ttgccttgga 2041 aaagttgggt ttgagttttg ttcttccaat aagacagcaa taatggcttt acataaacga 2101 gtctgccacg ttggaaactt atttttaatg atgtggatgt ttagagaagg ttacttaggt 2161 attttaaatg ggggcttggg ttgggagaag tatcgggcat gcatttcctg gtcctgggcg 2221 ctgcttgcct ggggtgggag aggggctggg ggggtggcgc gtgttgcacc ggggtcctgg 2281 ggaatgtcct catcaccctc tggatttcag gttgctttca ccttaggctt gctttgtgcc 2341 tgacctctcg ctcaggtttg ttttcctgaa tgctccgaca ggaagaagct ccgaggctta 2401 ccttggcaca ggtgtccctt ggcatatgct ggtgtttcag aactggggga accactcctt 2461 gctcatccag agtgagctca gaccctgtcc ttgcatgggg ggcttacgtg gtagtcgatt 2521 gagtcactct cgtgggtaga ccacccactt gggctttctt ttagccccta atggaagcaa 2581 aaaacatttg actcattcta accccatatt ccttaaaata gaaggtgggg tgtgaggaga 2641 acaggtgtgc acgtatgggt gtttttttca gttttgttta cattttgcga agggagggag 2701 ggagagagga gagttctcct gaagaaggga gtctgagctg cggaggcccc acctccttgt 2761 agttttcatg ccctgtagac ctttctgctt ccgcagctgt gaaatggggg tggcagttag 2821 atttttattc cttttaaaca taaaggaaac ctagatccat gtgtgagtcg actgtatacg 2881 ctgtctgtaa tctgacgtcc agcctgtcgg ggagactcgg taaagatctc tgtggagcag 2941 tgagccatga gcgtgacagc gcggatcggg gggttggctg ggtgtgaaag gtaatcattt 3001 tcttgtgaat ccttttaact gtagcgagca taaaacagca aacctggggc agcctttagt 3061 tctgcttttg tttctggcat tccgtttggg attaattgtg gggcatgtcc ttctctttca 3121 gttgaaacaa ttgatttttt ttcctgcacc tgaacacact gggctgtaca aaaggagagc 3181 attaaaacgc tcccctggag gtaaacattg cagttgagct cactagaatt ctgacgacat 3241 cagagggttt gtcctgtgct acttgtaatt gacccagatg ccccacccac ctgttacctg 3301 taatttggat ttgtttgtgg ggaatgggag tggtacccct ttgcatcaca aggggaagaa 3361 cggaatttgg gggcaaggaa gtctatgaat atcattcaag ggaaagcgga agaaagttat 3421 ctcttgtgtg aattgaaggc agtgtctgct aagatatttg ggaagacttt ccctgtttgt 3481 cctcaggagt tggccgtgta accagctgtt aagttggggg ctcctggaaa ctctcattct 3541 gcttttttaa tttttatttt actggggaag tggctatcag ctgtcaagtg ccctgcagtc 3601 acttttgctt agcggcttta ttgaaataga attcacttgc cacaacatac aatttaccca 3661 tataaagtgt acagttcggc ggcttttatt tagtatattc agagttgtgc agccattacc 3721 acagtaaact ttagaacatt tttgtcaccc cagagataaa ccctgcaccc cttagcccat 3781 ccgtcctacc tccacagccc ctggcaacca ctcacctgct ttctgtctca agatgtgcct 3841 gttctgggca tttcacatca acggaatcat atcatatatg gtgttttgtg tctggctcct 3901 ttcactctgc gtgatgtgtg agaagtagct acaaggtgag cttcgtagct gttccctggc 3961 cgagcattca ggaaaatgaa cctgatcgag gttcatccac gctgtggtat gtgtcatcat 4021 ttcattcctt tttttttttt gagacagagt ctcgctctgt tgcccaggct ggggtgcagt 4081 ggcgtgatct tggctcactg caacctgtgc ctcccaggtt caagcgattc tcctgcctca 4141 gcctccccag tagctgggat tacaggtgca caccaccatg ccccggctaa tttttgtatt 4201 tttagtagag acggggtttc accatgttgg ccagatggtc ttgatttcct gaccatgtgg 4261 tccacttgcc tcggcctccc aaagtgctgg gattacaagc atgagccacc gcacctggcc 4321 agcatttcat tcctttttaa ggctgagtaa tattccatgg gtggaatggt ccagatggat 4381 ggaccatgtt ttgtttatct cgtcatctgt tgatggacat ttgggttgtt tccacctctt 4441 ggcttttgtg aatagcactg ctgtggccat gagtgtacag gtttctgcgt ggatgtgttt 4501 atagacctgc tgggccgcgt agtaaccttg ggtttaatgg tttcagtaac tgcaagactg 4561 ttttccagtg tgtagccact ttccatacca accctacctg aactggtttt gcgcagttgg 4621 ttttccctta atgtgtgttg aaagacagtg tgtccccacc gggatgggaa agtggcagca 4681 ggggtgactc cctgcactca gagctggttc agataacact ccttttcttg tttatgaaat 4741 gaaactgata tttatcaaag cccttgagat cctgggatga aaagcgctga cacagtattg 4801 ccaagtgtcc ttaagataca gggacagttt tcgagcatct gaccttttcc tgggcccttc 4861 tctgctgcca cttctcctgc tcccaggaca gtgaaggtaa tttcagaagt cctttgcata 4921 atagctggtg ttacaagcct tttggtctgc ctggttggaa tatttcctct tgactgtagc 4981 ttctgccctg gatcctccaa acgggaacta gagtgaaact cttaagagtt ggaaatactg 5041 ctcagtatta agaaggaaca aactattgat acctgcaaca acatggatgt gtctcataat 5101 aattactttt ttgtatgtgt cattcatcat atgatttaat tcataaaaaa attctagaaa 5161 ttgccagcta gcctgcagca atagaaagca gaccagtggt ggctggacag aaaggtggga 5221 aaggggaggc aggtgacaaa gggcacagga aacttggggg taatggacgt ttttattatc 5281 gtgattgtag tgatagtttt atgggtgttt aagtacattg atgtatgtca gtttctaaaa 5341 tggcacacta aacatttgca gcttatggta tgtcagttat gcctcaatga aactggaaag 5401 aaaagtcagg gagataccac tatccaccca tttttttttt tttttttttt tggagacgga 5461 gtctcgctct gttgcctagg ctggagtgca gtggcacaat cttggctcac tgcaacctcc 5521 acctcctggg ttcaagcgat tctcctgccc tagcctctcg agtagctggg actacaggtg 5581 tgtgctacca tgcccggcta attttttgta tttttataga ggcagggttt caccatgtta 5641 gtcaggatgg tcttgatctc ctgacctcgt gttccgccct cctctacctc ctaaagtgct 5701 gggattacag ccgtgagcca ctgcgcccag cctgccatac tttttttaaa aggctgaaag 5761 tgctacttgg gttttttgtt tttgttttaa ggcaaagtct cattctgtca cccaggctgt 5821 agtgcagtgg cgttatcatg ggtcactgca gcctcgacct cctgggctca agcgatcctt 5881 ccacctcagc ctcccaagta gctgggacta caggtgtgtg ccaccacgtc tggctgattt 5941 ttcttttttt gtagagatgg ggttttgcca tgttacccag gctggtcttg aactcctggg 6001 ctcaagagat cctctcacct caggcctccc aagtgttgga attacagagt gagccattga 6061 gcctggcaaa agtgctgcct ggaactctca tgccatacac ctacctgtga cctagcagag 6121 atgaaggtgt gtgtacgcag ggctgcgcca aatgctgaca gcatctcttc atgagaaccc 6181 caaaccggaa gcagccccag tggccatcag cagatgaatg ggtaaacaaa gcagtgtgta 6241 catccagtgg gaaaaatgtg tgcctattgg tttgagttgc ttttcttagc tcacctgaaa 6301 gtatgaggtc tgtgggtagg tacagacctt ttaggcagca acacgcatgt ttttgtgcag 6361 gtgctctggg tgaccttgta gatagatgct gcttggtact ggtcccactt tgttccttca 6421 tcctcaggtc aggtctgcac tgcctgttcc gtctccactc ctggtagcct gtgggagtca 6481 cagctgaata gtcatctcgt actgggattt tactctgagt gcatgagttg tttccctagg 6541 aaaaacatgc tgtctttctg gatcgtaagg gacaactagc taaagctcct agccgacctg 6601 tcgtccagcc ctcactctcc tcctgccaga cttctctgta ttgcctccac ttctctgagt 6661 gtgtaaacct gggctcactt ttccttttga agtggtaaaa tactcaggat agaaagttgc 6721 tactttgtct tcaaagccat tgtcctgacc tatccttgac tgtccaggtg aacacaccat 6781 cttctgctga gccctgagat tatgaccaag gcggggtgag ttccatacag tcttgggttc 6841 cgaaaagtgg tgagtttaaa atctgcacat gtagctgggc acgatggccc atgcctgtaa 6901 tcccagtact ttgggaggct gaggcaggca gatcatttga gctcaggagt ttgagaccag 6961 cctgggcaaa atgtccatga tcgcaccact gcactccagc ctgggtgaca gagtgagacc 7021 ttgtcttaat aaaaagaaga aaatccaggc cgggcttggt ggctcacgcc tgtaatctca 7081 gcactttgag aggcagaggc aggtgaatca ccggaggtca ggagttccag accagcttga 7141 ccaacatgga gaaatcctgt ctctactaaa aatacaaaat tagctgggta tagtggcaca 7201 tgcctgtaat cccagttact tgggaggctg aggcaggaga atcgcttgaa cccgggaggt 7261 ggaggttgcg gtgagctgag attgtgccat tgcactccag cctgggcaac aagagcaaaa 7321 ctccatctta aaaaaaaaaa atgctcatga cttcattttg gctttttgaa tcctctggta 7381 gcccaggtgt ggttttaggg tgccagaaca ttggagataa gatgccgttt acctggagga 7441 gtgttggcaa gtgtttggtc agcacttacc ccttagtaga cttactgagt gtgttttcaa 7501 attgttagga gtttcccagt gattatgtga aagtgacctg agctgacact ttacagttag 7561 tggtgaatac acgaaacaat ttctattttt atcacaatta attacccgga ttcaggatga 7621 cggggtgggg gattatgact aatactatac caggactaac taaaaaaaga gttccctctc 7681 agcctagcta ccaccaccac aaaagcttac caccatcagc taccaccaca tttctctcat 7741 tgcttgattg aagtctgatg acctaccagg gaaccccctt cttttctttt tagacggagt 7801 ctcgcactgt cgcccaggct ggagtgcagt ggcgcgatct tggctcacta caacctctgc 7861 ctcccgggtt taagtgattt tcctgcctca gcctcccgag tagctgggat tacaggtgcg 7921 caccaccatg cccaactaat ttttttgtat ttttagtaga gacggggttt caccatgttg 7981 gccaggctgg tctggagctc ctgacctcag gtgatctgcc caccttggcc ttccaaggtg 8041 ctgagattac aggcatgagc caccacacct ggccccttta agtatagaac ttagtgctgt 8101 tttagtaaat ttacggaatt gggcagccac cacagcagtg ccttgctgtc tctgatttct 8161 ctttcgctcc tttccgctcc ctccccacca ccccttgtca gaccctgtgc tcatgcccaa 8221 ggtcacccgt tagggaggtg tgtccaccgc ctcagactga gccctgccct ggggaggtca 8281 gaaacatggc caagccctgg ccccatgctc ccccgccttt aagccccgtg tctgttaact 8341 tttgcaccct ctgcgagctg ctgggcaacc tcttcagcct ggccctggac ttcagccatc 8401 ccctgctgtt ttcttacttc ctggccttgc tgtgcttgtg ggactgaggg cagtgaggcc 8461 agcagaggga gagtgaagcc tttgaccccc gcggttgcac tgttagaaga cagcacatgc 8521 cgcctggcca gggtgctgtg agggccccag gagtaggagt gggctcccct gctaagggaa 8581 agcctgtggg aatggcgcct ctggagttgt tcagaagctg tacctgccag gcagaggggt 8641 gggcacagca ctttgcctag aaggagggag ggtggatgtg gggccagtgg ggccagggca 8701 ggggctgggc tgtgtcctgg agcctttagg agctgctggg gtgccttgac caaggctggg 8761 aggagggctc tgagatggca gtggcggggg ttaggggagg ggctgtggtg ttggaggttg 8821 ggcattgtgg cccaggttgg accccgaatg gatggagggg ccacagggca gtggagacgg 8881 aggacaccgg gatcagggct gtgtgtgtac atgtgtgtat gcatgagtaa gtgtgtaagt 8941 gtgtgtgcct gtgggaactg ccagggtacc ggctgccccc tcccagtcct cacagggagc 9001 cacagactta aaaagcagct ccagctcctg cctgtggtct catcagggtc cataattccc 9061 caggctgagc cggggctctc tccgctgggg aggcaccgtc ctgagggcct tgtgagaggg 9121 agaggcctcc ccagagcggc tgcaaggaag ctaaagggca catgctgggc agcagctcac 9181 aactgagggg cagtttgtgc ttgtcatggt ggtgagagtg ctttcatggt ggtcgggata 9241 gtctagcttg gcagagaagg gccgtgacac ccagctgtgc tctgtggcat tcagggccga 9301 ggggcctggc tggaggggct ggtgggagga gtgggtccgg gccctggcac cagtgctgtt 9361 cggggctccg gagggttcat ggccttgagc tctccttgct ggagcacata cctatgagag 9421 caaagttctg ctttctctga tacatggagg gctgctgggt cctggagtct ggtggctggc 9481 agcaggtcct tgtgcctttt tctgtgttgg tcgaatagca gcttaagagc cgggtaaccc 9541 agccgggctt ctcaggtccc gtgggccttt gccaggatgg tgtctagatg tgtaaaggtg 9601 gctcgttggg ctctagagag cccttgtgac agtgagcatg gcatcttcct gaagcatcag 9661 ctccttgcag agcgagtgca ggctcctggc ggccctggga gagggggttg cagatcccag 9721 gcagaggtgt ttggtcagag ctgcgctgcc tgcaccgtcc tcacctcttt ttcagtacgg 9781 ggatccatgc gcatagcttg cagccctgct gcatcctccg tgtagaaacg tctacgatga 9841 attctctaac atacaagagc tggtgagccc tgtgcctccg tctccctgat gagacagttg 9901 tgaaggtttt cttctccctg ttttcttttt ttttcttcac tggtgtgtgt ttttaaagca 9961 agtcttttgt gtcctttcac ctttgtgtct gaggtggagc cctgtgaatt ggcgatactc 10021 agccattttg acctgcagta tgtagtatgc atttctagaa gcagaaaagg gcgcctgtcc 10081 tgaatagctg cagcgccgtt atcacacaag cagcgtcaac aatggctttt cacgattgtc 10141 cgatatgtag ggacagttgg cctctctcca cagcagattc tttactggga tcctggaggt 10201 ccgctgttga gatgatggct gtctctgggc cctgcgttcc gagcggcctc tcccgctgct 10261 tttcatctcg ccaagctgca gtctggcctg ggttgttgtc ctgtgaggtg tgcaccttct 10321 ggactgaacc cctcattgtc acacttcagc ttcttgctga aggtgtggag tggatttcat 10381 ctcaggctgt gggctccagg gagagtggct gatccccggg ctgggaaggc aggggcggcc 10441 tgaggtgagc tggggatgcc ttgcacactc ggcccgttga aatgaacctc acgcggactc 10501 tagcacacct ggctggaaca agggtctgcc ggtccacaca ggtgccattg gtgagagggg 10561 aggcgagagg ccctttctct ttgggaagaa cgaatgtgag cctaggcaac atattgaaaa 10621 attaaaaaac tttctaggca ttgtgcacac ctgtagtctc agctacttgg gacgctgagg 10681 caagaggatc tcttttaagt tcaggagttt gaggatgcag tgagctatga ttctaccact 10741 gcactccagc ctgggtggca gagcaagacc ctgtctcaaa aaaataaagg aatggagagc 10801 ctggggaccg ctgtgtcggg tggtgggact atgtgaaaga tggtcaactc ctagcagttt 10861 cacgtggttc aaactaatag attaaaggtg gctaaagaga catcacaaat cacgtgcagt 10921 cattaactgg gacttggatt cttaaaaacc tgtaaaagac gttatgaaga cattggggtg 10981 atgggtgaat gtgtattaac caacagtgtg gcaggctggg tgcgctcatg agccagctgt 11041 tttgtgggga tgtggagaaa gccctcccct ggggccggcc actctgaaat gggtcaggaa 11101 gagataagtg tgtgagagca aggggagtcg gagtgcaggg ggatgtgatg gagagtgtcc 11161 agctggaggg gtgtgggggt cggatttctg tagctttgga atttttaaaa ataagaggaa 11221 atatcagggc agaatccttc cagtttgggg gccagacctc ctggcaaccc tctgccttga 11281 gagggctcag agaccctgca gcttcttggc agtcatcact ctgagggtgt ggtgaacttc 11341 ctattccctc cccttttttt cctttacaat ttttttaatt aaaaaaaatt tgggggggtg 11401 ctgggtgcag tggctcacac ctgtaatccc agcattttgg gaggccaagg tgggtggatc 11461 acctgaggtt gggagttcga gaccagccag accaacatgg agaaaccctg tctctactaa 11521 aaataggcgt ggtggcgcat gcctgtaatc ccagctactt gggaggctga ggcaggagaa 11581 tcgcttgaac ccgggaggca gaggttgcgg tgagccgaga tcgcgccatt gcactccagc 11641 gtaggcaacg agagtgaaac tccgtctcaa aaaaaaaaaa aatttaaata gagatggggc 11701 ctcactatct tgcccaggct attctcaaac cctgggctca agatcctcct gccttggcct 11761 cccagagtgc tgggattaca ggtatgagcc atcacgcctg gcctcctgcc ctctctttta 11821 tttaagcctt ggaattagag aatcaggaat gagctgcttt gccttttgag cttggccttt 11881 agccaacaat gttgagggcc tcctggggtg caggctccat ctaggtgctg ggctcatggc 11941 agggagcagg acagacaatg ccccacgggg gccttgaggt ggggtcagag gatgcctgcc 12001 tgagaggcac cctgtggacc tcaccctgag gctgagggaa cagcaggtgc aaggcctggg 12061 tgctggcaaa tgcagacagt ggtgcaaatg ttggagtaga tgtctgcggc ctgagagttt 12121 atgatggcac atttgggggg atttttgagt ttttgaattt ctgtggtttt cgtctgtggg 12181 aatgaagtga ctttctttgt tttttttttt tttttttgag acagagtctt gctctgtgac 12241 tcaggctgga gtgcagtggt gtgatctcgg ctcactgcaa cctccgcctc ctgggttcaa 12301 gcgattctcc tgcacgggga ttacaggtgc gcatcaccat gcccagctaa tcttttgtat 12361 ttttagtaga gatggggttt tgccatgttg gccaggcttg tgtgtgtgtg tgtgtgtgtg 12421 tgtgtgtgtg tgttgttgtt aagagagaca gggtcttggt ctgttgctca ggctaaagtt 12481 cagtggtacc atcatagctc acagcggcct tgacctgctg gggtcaagca atcctccggc 12541 ctcaacctcc tgattagctg ggactacaag tgtgcgccac cacacctggc tcgtttgtgt 12601 atttttttgt ggagacagga tcttggctgt gttgcccagg ctggtcttga gaactcccgg 12661 gctcaagcga tcctcccaaa gtgttgggat gacaggtgtg agccaccatt cctagccttg 12721 aagtgaccta ctgttgtgct gccctgatgc acggtctgag ttctggagca tgtctctggg 12781 cagaaagccc ctgtacagct cttagatgga cttccaggat gtggttttag tcgtagctag 12841 gttcacatcc agaactaaag gagttagaga ctccagctct ggggcctcgt gtatgactag 12901 gaagtgggaa ttctaggata cgtctttaaa tactgtgtcc agagcagagg tgatgggagg 12961 agctgctttt tttgggggcc gggggggggt ttgggttttt cttttttctt tttttttttg 13021 agatggagtt ttgctcttgt tgcccaaggt ggagcgtgca gtggcacggt ctcagctcac 13081 tgcaacctgc gcctcctggg ttcaagcaat tctcctgtct cagcctccca agtagctgag 13141 attacaggca cgtgccacca cgtccagcta attttttttt tttttgcctt tttagtaaaa 13201 acgaggtttc accatgttgg ccaggctggt ctcgaactcc tgacctcagg tgatccgccc 13261 tgccgaggag ctgcttcttg gtgaatgcct gggatgggta cagggaggcc ttgcccctcc 13321 agagcctccc tggtgagggg ataagttgac agcctcaaag gactttattg gagtggcagc 13381 tctgtgccag ctgaagatga tgaccctctt gtccttcacc catcatgtcc cctggcgtgg 13441 gctgtggaac ttctctgtgc caggcactgt cccctactgc tgttttacct catttcatga 13501 tagttcctag ggtcaaatat atttttgaaa ttaatgtttt atcttgaaac ttttcagaga 13561 gtgaagggag aagttaaccc agaagtcagc ttgtgttgtg accagatcac tgcagggcag 13621 cattgggccc agtgtgcatc tgtgcgtgca ggtagcacgt gcccatgcac cccagccagg 13681 cgggctggag gggcctctgt ggggtgctgc tccccatgag cacctgtccc taggctcgca 13741 caggtgctgc atcacacagc aagggcatta ggccgagggg ctgtctgggg tgctgggact 13801 ggcagtggga gatttggtgc cgtccctagg ggagctccca tcactcctcc ctggccctta 13861 attttgtgcc accttgtgtt ctgccatgac acagtgctcc cgctgcctga ccccgtggtg 13921 gggttaggtg gcctatgggg gccatgccag ggactgttgg ccatgtcatt tgaggtgctg 13981 gtacttggga gtggcatgag gagccccagt ggctgtgtgg tctccttgca agtgtcctgt 14041 gcagttgtgg cctgatgact tagagccatg cgctgtctgc ctttgctgaa catagaagag 14101 gcgccttcag tgagggtgtg gcctcggttg ccaagggtat ctcagtgttc agtttctact 14161 tcatcctcag gagagggctg ccttgaaggg atggggaggg tgaggtggca gtggatgtgg 14221 ccagtgacat ttccacatct gcatgggtgg cagcttcgta gtgtttgatg agccttacac 14281 tgtggtgggt ctggcatcat gaagctgtcc tcagggcaca ctgctgtcag tctgtaaaag 14341 tcaccctgct ccttgtggtc ctcagtcatg ggaaagaagc atcttcctaa gcatcctggt 14401 tgtgggcatg gacatggctc tgtggctagg cgggggttag accagctgtg tgggggaccg 14461 gggggtgtag aaggccacca gtggcccttt tttggccagg taggtttgct agagaagagt 14521 gaaatccagt cagaacaagg gaaactgtct caaagcagag atgaggccag ggtgtcacac 14581 cagagtcatc gaggctggtt cttctgcact gagtcctgga agctgcagcc tcactcaggt 14641 gacactgcgt tctgctgcag gttgggcccg aaggcatcct gcagaacggg gctgtggacg 14701 acagtgtagc caagaccagc cagttgttgg ctgagttgaa cttcgaggaa gatgaagaag 14761 acacctatta cacgaaggac ctccccatac acgcctgcag gtgagctgag ctcagctggg 14821 cctgggcatg tgctggacag gtgggtgcct ctggcatggg cctgggacac tcacctccca 14881 ggtacagggt ggcgcccttg tgctgtgact ctgggtggcg gtgtcctcgg gagtaaggaa 14941 attgctcagg ccatgtgcac agccaaggcc tgttaccggt caggaggtgg gctcagcaac 15001 ctccgacacc ctgaattcac cagctggtgg gaaaagtttt ccttcctttg gcctcctcag 15061 accctggggg cattgcctgg tacctgcgca ctcactcata gaaagaattt gtcatggcat 15121 tgggtgagca tctggggtgg gagggtgagt ggagcagctg agataccctc acaaagcagt 15181 tgctgcaggg ttctggggca tccctgcctc ccgctgcctt ggccgtgcct ggagaattgg 15241 atgtgaagac agtagttagt ggatagacct gcatgttgac aaatttagaa ttctcatctc 15301 gagtgggatc gtcccttgtt gaggcttatt tctgtatttt tgtgacacac aagtgaggaa 15361 atgtagatta gaagagggga gaaggttttg gccaggcacg gtggctcact cctgtaatcc 15421 cagcactttg ggaggccgag gtgggcagat cacgaggtca ggagatcgag accatcctgg 15481 ctaacacggt gaaaccctgt ctccactaaa aatacaaaaa attagccaag catggtggcg 15541 ggcgcctgta gtcccagcta gctacttggg aggctgaggc aggagaatga catgaacccg 15601 ggaggcggag gttgtagtga gccgagatcg caccactgca ctccagcctg agcgacagtc 15661 acggcaggtt tgggcctgag ccctgcctct gtctgtgggc acccacttgg ggccctccct 15721 gcagctcctt gcccgcgtgc tgcccagcgc tccgggactg aggctgctct tcgctaggct 15781 tggtgggcag gcactctttc ctgcatctca gctcttcact cattctcatt tacagatgtt 15841 ttcgtgtatg taattgtgtt ctcttaacac acagagatgg ccataagtcc tggtgcagct 15901 ctggggagct agtatttttg ggtttggtat gtggattttc agaaaaccac agccagtggt 15961 ggggtgaggc tctcccgagg ctcctcgctt gggcttcggg gcctgagctt cagggcgctg 16021 ctgctgcagg gtctgtgccc agctccctgg gagcacagtg ccctctgctg gagcccagag 16081 ttcaccatga ccactgggtc tgaggctgga ttcctgaatt tcaggccaaa gaacggggac 16141 tagaatggcc ctggaggagt gaggcctcct tccagcagga ctctccttgg aggtggcttt 16201 gctccttatc ccctcggagc ttctgccaca taccccttcc ctgcattctt ggatttgaga 16261 agccctggtt aactggtttg atttggttac tgggattgtt ggggttgagt atgtgtttaa 16321 tgaagaactt aaaaatgttg tcagaggaaa gaatttcaca tcttgccaaa tgaaaaccag 16381 agagcaagct tccagctgta actgtcgctg ttttgatttt ttagttactg tggaatacac 16441 gatcctgcct gcgtggttta ctgtaatacc agcaagaagt ggttctgcaa cggacgtgga 16501 aatacttctg gcaggtagat aatcagacca tgcatgtgtg tttaatcagt gctgtgctca 16561 gatttgtgtg taaattatgg aaatgttgaa gcaaagcgta atataaaatc acgctaacaa 16621 acctgggttt tttcctcctc tgtgactggt ttaacttgaa acagatggtg gtgtgtgaat 16681 gccttaatgt atttaccctt gatttattct ctgtggctct ttttattatg caagtagggt 16741 tggcttaatt ctccttccca gtttttattt cgatttttct gcaagatgag ctcttggcac 16801 acccgctgat cttcattaac gggaggatgt tcccaaggca gtaacgggca tccaggggcg 16861 ctggttcctg agcaagcttg ctctggcctg agctcatcgg agctgtccgg aggctaattc 16921 aggccagatc ctgaatcatc ggcttcaatt aagttcactt gaccgtcggc gctgggcttg 16981 ccttcctgat gttgatcagc tgcagctgga ctgtgtcgct tgttctgtgt ttgggacagg 17041 acagtgttct cccagtgccc aaattccaag gggggagttc gatggtcttg cccgagtggt 17101 gggactctcg tcctggtggg tcccgtgcta gcggctcccc ttcccctcca ggttgctgtg 17161 tacaggtcac tccactgctg gagaggatgg agcagagcct gggaggcaga gctccgaggt 17221 gctagcaagg aagtgtgggg tctttttctg aggttcacgt tctggagaaa gaaaatgacc 17281 aaagcccgga ggcaggctta cttgtgctga cacaagcccc tcctgtgcct ttgacagctg 17341 tgggtctcgg agggttacag atggagcagt cagatctcgt ggcatctctg agttagtaac 17401 ttgaaatatg gtgttatgag caaagccaga aaggggctgc tgtttttaaa agattttaaa 17461 gaagggtatc ttctggataa ccaaaagaca ttcttgtccc attttcagaa tcttctgggt 17521 aatttaagtg ttgctatgtt tgcatttagt tcctcttgtt caggaaaagc aactaaataa 17581 aacttctctt agctggcaac cccagcttcc ggtcagcata gccaagaaca gggaagtgag 17641 caaaagggcc tctgagaaac aacaggcttg agccaagcac cactgagcat tctgtgcgag 17701 gcccagggga aggtgaggaa gcagccgctg gcccttaggg atcttggctg tgacagggtg 17761 aggatgcatg agaagacacg gagacagctg attctgccca cgtggccggc agggctccag 17821 agtggagtcg ggctttggag atgctcagag tggccatggc tggttagaga atcctgggcc 17881 tcgtggcagg gagaagggtt agaagattag ctgagctgaa gcttttggca agaatggttt 17941 tccagagggg aagagagagg gcgcggtgag gaaggctggg aggccctgca ggcgtgttca 18001 gagctgctgg agacaccgga agagcccaaa ggcagagcag ctgggcatcc ttgttagata 18061 cggcagatag gtgctggagg gtggctggct tggcacaggg cagggttcag cccagaccgc 18121 ggtttaggtg agcagtgtcc tggccacgtc gcagcaagga aggagccgag ccagccaggg 18181 cttcacacac agccctgggc ttctgtagaa ccgcccccgc ctctgctcaa cctttttgct 18241 gctttttgcg ggggtttcca gtcgcctgct tatcagattt cccggcctgc agcaaagcag 18301 gagggactgt tgctcctgga agtaggcaca gggcctctga ggagtgattt cagaagaagc 18361 cacagaagat ggtggcgggg gttgaccttg gaaagaaggg gctggggcct tgtgggaaat 18421 gtctctgacc ccacagcagc cctgggccaa caatgggtta aattaggcct gataagttgg 18481 atttaagaaa gcaggatttt taaaaaaggc aggaactatg tgaaccgtgt taaggaagtt 18541 ttccttaaag tggttcaagt tgggattgat ttttgtgctc agtggggagc tagtttgggg 18601 gagggctggc ccccagagat gccagatgga tgaggtgtga ctgcctctgc taatggaccg 18661 tgaacggtac cggaactttt aacaggggcc cgaaaattgg aagtggtgaa aagccaaatt 18721 ttgggtgtta accgtttatc atttcctggt ttcagccaca ttgtaaatca ccttgtgagg 18781 gcaaaatgca aagaggtgac cctgcacaag gacgggcccc tgggggagac agtcctggag 18841 tgctacaact gcggctgtcg caacgtcttc ctcctcggct tcatcccggc caaagctgac 18901 tcagtggtgg tgctgctgtg caggtgagtg gtccccagat gtctcctggg ggtgaccttt 18961 aagctccagc cgtctcctca caagccttgg cccagcccag cccagccgtg gctctaactc 19021 cagggagttg tcctccaaag atggtttttg ctgaagggtg aggcatgaga gcgtttaggc 19081 gctgaggctt gttaaggagt cggcagtgcc gtgtaactct tttggggcgt tctggctaag 19141 tcataaaaac aattctgtaa aaacaaagtc tgttctgagt tttaagagtt cagagctcaa 19201 gtgcacaggg agatgcgatt tattactaac agtttgaagg taatgtgatc acataataaa 19261 atgcagggca tgcccctttg ggtgaaaggt cagcatggga gggggccctc cctgctccgg 19321 ggcttcaggg acgggagctg gtcctcacgg ccccctcccg ctctgcaggc agccctgtgc 19381 cagccagagc agcctcaagg acatcaactg ggacagctcg cagtggcagc cgctgatcca 19441 ggaccgctgc ttcctgtcct ggctggtcaa gatcccctcc gagcaggagc agctgcgggc 19501 acgccagatc acggcacagc agatcaacaa gctggaggag ctgtggaagg tggggctgcc 19561 cagcgggccg acccgtgcct tcgtgtggtt tctggttgcg gggaggggag tgtcttcaga 19621 gacggcttga cccagtgaga ccgctggaga ttctctgaaa ggaattcagg cagacctctg 19681 ccacctctac gtggaatgac ctcagggcac cttggtcacc ttccatccag gcagccttac 19741 ctgaccgaga agatcctggc cttggggaaa ggaaagctct ggctgctgga atttttcttg 19801 tcttgccggg gtgaggatgg gtggtgggag gggaagaatg gggctggctg tgggcagact 19861 tgtcctgcag agcctgaacc tgcccagaca ggtgggctgc gagaatgtcc tggggaaata 19921 ggttggtgtc gaggtcggca cagtcagctg agacagaggg gagaggagag ggtgccgggt 19981 caagtagtgg gtgcctggcc ctccttccac agacagccct cacttttccc cacggaagcc 20041 tttctgtgct tccagctgtt taggattttg tgataaagta gttctaataa ctaggctaga 20101 agttttcatt tgcctcccat tcagtcagaa atctcacata ggaaatgaag tgcaagttaa 20161 aatatttttt taagcattgt tcagccgggt acctttccac ttggattttc cggttgccag 20221 atgatttcat gattcccgtt tatatctgaa cagcccagaa aatgtcctgt ccccaccagc 20281 atggcactca ctgagggagc tggcccccag gggaagagct gcgggcccat gttggtctgg 20341 ctcagggtta gcagacgtgg gagacaggcc tcccgggctg tggacacagc gagagaaacc 20401 ggtggtcttt gtggcagcct gcgaggcggg ttagctgctc tcgttcacaa gctgggcata 20461 cttggaggaa gcagctggct ctgaaggtgc cctctgctgt ggacagtgtg tcaggctggt 20521 gcctggggct ttgtaggcag gttcacccaa gtaccggaac cccgctgggg tgcggagcag 20581 cagctgaggc ccaggcatgg gccaggccgg tgtgcagctc cctgtgtggc atggagttcc 20641 cgttcccagg gttctccttg caggtggggc tgcgccagaa cccctccatg ccacccaccg 20701 tggcccattc tgagaagcgg catgtgtgct ccatccctct ggtgcctctg cgccctcgtt 20761 catttgcatg tagggaaaaa caggacgagt gtggcgcggt gttgttgtct tctaggaaaa 20821 cccttctgcc acgctggagg acctggagaa gccgggggtg gacgaggagc cgcagcatgt 20881 cctcctgcgg tacgaggacg cctaccagta ccagaacata ttcgggcccc tggtcaagct 20941 ggaggccgac tacgacaaga agctgaagga gtcccaggtg atgtgtgcga gagggcttgg 21001 cctggggtgg gctctggctc tcacagctct ctcctcaggc ttccagaggg aggttttctc 21061 tccaggctgt gggagctgat gtgggctgca gccgcgacac ctgagagttg gaacttgcgg 21121 atccgggcct ggcagcccgg tggtctgagc ccccgccctc agatctgaca tcagcctcag 21181 tgttttctgt ttaaatccaa cacatctttc ccagtggggt ccacagcttt gcccacagtt 21241 gcgagagtta gactcgggcg tgatccctgt gaagaccaca tttaggcaac gggagccctg 21301 gagcgctgac ctcccctccc tccctcccct cctctttctg tccctccctc ccctcctctt 21361 tctgtccctc cctcccctcc tctctctctt cctccctccc acagcagctg ctccctggcc 21421 acggaggcct ctggtgtgcg gtgaagcctc cagacgggat tggcctgtgt gtgttccggg 21481 gactcctaat ggcttctctt tcagatgtgc ccttggtggc atttggggct gcaggtgcca 21541 ctcacggcgc ctgctctcac ggcagaggct tgctgtaggg cccgcctcat ggggcctcgg 21601 gcatgtggag gccaggcccg ggcctgtgct ggaggctaac cggggctctt gtttttcatg 21661 tgctcagact caagataaca tcactgtcag gtgggacctg ggccttaaca agaagagaat 21721 cgcctacttc actttgccca agactgactc tggtaatgag gatttagtca taatttggtt 21781 aagaggtgat tttaagtttt aaaatatttg tgaccataag tagcataaat tcctagttcc 21841 acccttgtaa agtgcccctt aatttgaact ctccctggtg gaagcgacgg cgtgggttaa 21901 aatggccacc tctctcactt ttttacctca agacatgcgg ctcatgcagg gggatgagat 21961 atgcctgcgg tacaaagggg accttgcgcc cctgtggaaa gggatcggcc acgtcatcaa 22021 ggtccctgat agtatccttc atgtgaagag ggtgtggccg gctggtggga gaggaaagtg 22081 ggggcatcag gtggaggcca ctgtggattt gatgctcact gctgggccag catctcatgc 22141 tctgtggtgg gtgctggttg gcatcgccct ccactgctct taggagaatc acagggcctt 22201 caccttcagg ggacagctgg ttgtatggtt gcttcccctc tgcccgaatg gtgatgttgg 22261 tgatgccact tgtgtggcgc gtccctgggc tgactctgga agttaatgta tgccgcttgt 22321 gtggcacgtc cctggactga ctctggaggt taatgtggct gcaggagcgc cttacctgga 22381 cgctgactct gcacacggag gttccagaag ccagtggtca catgcagacc ctctgagcac 22441 acacttgctc ccaggccatg tttgggaatg tgcagtgacc tcatggtgat gcagcctgcc 22501 ggccagagtg gggcagctgt aaggcacagg tgcagtcaga ggcttgggtg gggaggagtc 22561 ctgggagaag acaccgcggg tcagactcag gctgcctctt ggtgactggc caatgctgct 22621 ggggcctgac cattcagcgg tggcctccac aggggtaggg agagttgtct cccacaccca 22681 ccctgggcct gacaagctca tcctgaacac gtgagaccgg ttttggggct cggggtcgag 22741 ggaagggtgt tgcctaccgg ccacagcctc ctggagggca gtttgcttgt gtatgtggag 22801 ccttttgctt ggccgtactg ggaggggtgt tgtttctgtg gaactgtgag aagagctgct 22861 gtggccaaag ctgggttctg cgagctctgc ctgcccctgc tggccacatg tgcaggagct 22921 cccggtgagg tcctggggcc caggatgtgg ctgctccctg ctggtgagcg ctgctggctt 22981 cgctgccgcc ctctgcgtct ctgaaagggg cctagatggg tggctgggag tggcaaggag 23041 gaactccagg ctggctgggg gtgggacacc ggctcatggt gaggtagagt ctttcttcaa 23101 ggatgctgtc cttctggtct ctggtctggt gttggctgca cgtttttagc gtttggtgca 23161 gagccagcgg tgtcttaagt aactaaattt taaaaacgct gtttaaagag cgtgtaccaa 23221 gggtggccca gaaaggtcag cccggctttt gacaaggatg caaacttaac tcagcacaca 23281 gattatggcg atgagatcgc cattgagctg cggagcagcg tgggtgcacc tgtggaggtg 23341 actcacaact tccaggtgga ttttgtgtgg aagtcgacct cctttgacag gtacgtcttc 23401 tcccatcact gccccctgtt ccctggttgc cacctgtggc atctttatga gccctccccg 23461 tcactgtgga gtggggttcc ccacccagct ctgcaaactc aggatgtcgg agaggcggcc 23521 acagctgtgc gtgtcgccaa ccccaaaccc tcctcacagg atgcagagcg cattgaaaac 23581 gtttgccgtg gatgagacct cggtgtctgg ctacatctac cacaagctgt tgggccacga 23641 ggtggaggac gtaatcatca agtgccagct gcccaagcgc ttcacggcgc agggcctccc 23701 cgacctcaac cactcccagg tgcgcgccgt cctcagcgcg cggggcctcg cccatgggcc 23761 gggacgcaag cggaggctgc ccctaacggc cgcttgtatt gaaggtttat gccgtgaaga 23821 ctgtgctgca aagaccactg agcctgatcc agggcccgcc aggcacgggg aagacggtga 23881 cgtcggccac catcgtctac cacctggccc ggcaaggcaa cgggtagggc tgacacggcc 23941 cttgcgggca agacccggga gggctttagg gtggccagat ggaaggcctg gtgctgggag 24001 ccttgggctc tgtcacaccg aagagagcac gtggcgggta gtgtcgccat ggtgcctacc 24061 gctctctatg tgacattatt cgtgtcagct ccaaccttag gattgcattt tagtaaccag 24121 gtctgctact ggtttagaaa actgggggca gggggggcat ggctgcagca gcgtgagttc 24181 cgtgtgcggc actttatagt gtggcgggat ggtgcaagtt ccagccttgg cattgcttgc 24241 ggtgggtaga gctgtgaagc ctgggctgtc tggatctgag ctccttcggc atcgtgtcat 24301 tactgcctgt taaaaatgtc aggagtttaa ggccagcctg aacagggtga gacgtctcta 24361 caaaaaaaaa attcagaaat tagctggggg tagtggcgtg cacctgtggt cccagctact 24421 caggaggctg aggtgggagg attgcttgag cccaggatgt tgaggctgca attagctgag 24481 atcacaccac tgcactccag cctgggcaac agagcaagac cctgtctcaa aaaacagagt 24541 cttggcttac tacgttcacc gagcttcctc tgggtaagca ctgagctgcc ccaatggtgt 24601 tgcaggccgg tgctggtgtg tgctccgagc aacatcgccg tggaccagct aacggagaag 24661 atccaccaga cggggctaaa ggtcgtgcgc ctctgcgcca agagccgtga ggccatcgac 24721 tccccggtgt cttttctggc cctgcacaac cagatcagga acatggacag gtgtgtgtcg 24781 agtccatccc tcccagttgg tccctgagct tctgcgggtg acatgtacag aactcaggca 24841 ccctgctgac ctgcatgtgc ttccagcatg cctgagctgc agaagctgca gcagctgaaa 24901 gacgagactg gggagctgtc gtctgccgac gagaagcggt accgggcctt gaagcgcacc 24961 gcagagagag agctgctgat ggtgagtgcc cctcctgcct gcaaaagggc ctgtgggctg 25021 gcggcctgat ggtttttgtt tgggccaaag cacctatttg aattgttgct ttgcttctgg 25081 aaagagaact tagatgctct ctacttggcc tctctgcgct cagttgtaca gtaaattagg 25141 gacatctcgc tgcctggaac acggtcttct agggagtctc cactccctgc cagcctccac 25201 tgggatccgg cagtggatct agaacactcc tgctctttca taaccaacca gcctgaactg 25261 gaatcagcct gagtgcaggt tccagcagct cagcactgcc ccttggaggc tgtcgtgagg 25321 cgcagagctc ttggtgtgga gaggttgcct tctgtgaaac atgaccgttt aaggatctta 25381 aaagtttgta atcaccacag cctggaccat gtatcttgtc tgggagggac agcttgtttt 25441 ttatagtgtc agggagggtg agaaacagga gagcttctgt gttctgtccc acctgccctc 25501 cggggtctgg tggcgtttac agtgcaggtg ccctgatgcc tctgcaccct tccccagaac 25561 gcagatgtca tctgctgcac atgtgtgggc gccggtgacc cgaggctggc caagatgcag 25621 ttccgctcca ttttaatcga cgaaagcacc caggccaccg agccggagtg catggttccc 25681 gtggtcctcg gggccaagca ggtgggctgc ctcccctgcc ctcctgtgtg aaaactcgtg 25741 tgtgtgattc ttggtgtttg tcttttaaaa caccttgtat tgacgtagga tgcccagcag 25801 agtgtgcgca cactgggggc tccctgaggg gtttatagag ggtgggttct gccagctgca 25861 cgtccagtgt gcgtggtgag caatgcccct gtggccaggt ggtgtcctgt gcatcctggg 25921 gcccctactt ccttgctagt gtgtgttgag gctgattcac acctgagctt cttgacttgt 25981 gggggcccct gttcctacag ctgatccttg taggcgacca ctgccagctg ggcccagtgg 26041 tgatgtgcaa gaaggcggcc aaggccgggc tgtcacagtc gctcttcgag cgcctggtgg 26101 tgctgggcat ccggcccatc cgcctgcagg tccagtaccg gatgcaccct gcactcagcg 26161 ccttcccatc caacatcttc tacgagggct ccctccagaa tggtgtcact gcaggtaacg 26221 gggctctgcc cagggcaggg gcttctacag agaagcggca tcaagggaat gtggactggg 26281 gagaaggaac tggcccatca gcttcccctg agcggcttca catagccact gcttgaggtc 26341 agggcacttt gtgcccaagg tcttgatggt atggtcttgg atcaagtggt gtctctgggg 26401 catgtgggga gcctgtgtcc tctgtaaaac tcagtcattg cctgagaatt catccaagcc 26461 tcatgtggca gcatgcacag cagccaggac aggctgttag gggagctgag cccgggcctt 26521 aacgtgggct gtgcattcag tgcttcatga agaacagctg ttctcagggc ctgggcccct 26581 ggggcccgca agaccgtgtc cggcttccct ggataacaca gaggcctcac ctgccctttt 26641 tcactctgct gacgctcaca ctgctggtga ccccctggtg cttctgcaag agtcagccca 26701 gagcagccag actgtgcccg gatcacagta agaaacgccc tggatgagga agtgaacagt 26761 acatcttact aagcctcgag tgtttcactg tgaccagata aaatttctcg tgacaccttt 26821 gggtatgccg gcgcttgaag gaagtccttg tggacacacc ctgggtattc accccaaggt 26881 gttgggcaga ctttttccaa aaatgatcga acctgtcgct tcatggagaa ctagtggcaa 26941 catctgttac agtagtaaaa ttcaagtgaa aaccagaatt ttggaaaact taacgtcgct 27001 tgccatgagc ttggtgtctt cccaacacct agagactttt ctggtgaggt ctgtgggggt 27061 gacagatgga atgtttccac cattgccagt gaaatgtgtc agcctttgga agacctactt 27121 gactcggtgg gagatccttc ctgaatgacc agcagagtgt cacgaagtca catggggatg 27181 gagtggcaaa gagttgtcac aggcctgcag aaacccacct ttgtccagct tgatgtggcc 27241 tcagggctgg agactcccag ccatgcagaa aggctgtaaa aacaaaccca tctccctttc 27301 cagcctcagg catccatgag gcctggcttc cccacagcgt ccccagagcc caggagggtg 27361 gactgagcag aggtggaccc cagggcccag ctgcctccga ttcagccaga tagtaaggag 27421 atttgcagaa atgggagaca ggaccactct tcttacttag cttatttttg gagaaataac 27481 taacacgtaa tgggttattt ttaaatgaat taatatattt tttaaatttc caagtttaga 27541 gaatacagtt atccaataaa caaaagctgt ttggactccc taacttttga gagtgtaaag 27601 ggtcctgaaa gctgccgaca cagcgcctca tgaactccaa acctgtgagg ttggtattgt 27661 caggcctagt tacaggagaa agtgaggctc agagcccacc ccccgggttg ggccacacca 27721 ctctttgcct agcggtaggt cttgggagtt gatgccacta ttgacatcgc agggacaccg 27781 ctgttcccag cgcatggcaa ggctgaaaac acaaggccac agagacctag cttgtgctct 27841 ggggtgtctg agcactggct gggaaaggcg tggctcggcg caggctcaga cccctcatgg 27901 tccccagggc atcggcccag agggcagacg ctggtgcaca ctgctctcta attgatgcac 27961 agcacgtagt tcacacccag gaagtacatt ttaggaaatt ctctattgaa gtgtaaacat 28021 gtggccacat tacaagtgtc tggctcaatg agtggttact aagtgagcat atttggatag 28081 ccagcacctg aacccagaac cacctgacca gctgtgagaa gccgcctcat ccctgcaggt 28141 ggccactgtc caggggtgcg catccacagc agcccctggg ctgagggact tctgagctca 28201 tccggcggcc cctgtgctcg ctgtaacagc agtgatccac ttggcctcat tttcagctcg 28261 acgtgggatc atatgatgtg tcctggcgtg tctggcttcc tctgtccatg tcaggctggt 28321 gctgtgcaaa aacaccggtt ttctccactg atggaagctc ccagtagagt gaggagggca 28381 cctgtgaatt gagaaactgg ttttacagtt ttaaggaaac gcaacagggc ccagaaaagc 28441 caagatgagc atgaagagcc cgaagaaagt ggctgctcac atggcaggac gccccggccc 28501 ccaagctgca gtgttgggcc ccggcccagg agggagccca gaggcagagc caccggtcaa 28561 caggccctca gcaggaatgg acaaggctgc agggcagggg gacactgttg gctcaacagt 28621 gtcacacaca agtcagttgt aggggctgtg gcgcctttaa ggcaagcttc tggaaggaag 28681 caggagaata tcctgtatcc tggggttggc agagatccag gctgcaagaa agccctggct 28741 gagaccctgt tccctgttca ggggctgagc cagctctgtg ctggccatgg tgctctcggt 28801 ggcctctcct cagccttgcc caatcctggg catctctggc caggggaccg gctcaggtga 28861 cctcaccagg gcctcacaga tcgaagtctc ctgccccagg acctgcagca ctgtagcgta 28921 gcaactacat tgccctgtgt ctgaactcat ttgaggcggg ctagggcttt tgaagtgtta 28981 cttctttccc tcccctcaca gcggatcgtg tgaagaaggg atttgacttc cagtggcccc 29041 aacccgataa accgatgttc ttctacgtga cccagggcca agaggagatt gccagctcgg 29101 gcacctccta cctgaacagg tgagcaggga caggcccacc ggcgtctgca ggtcttgggg 29161 acagcttgag aggttgttga cccattctac ttatttttaa catgggactg aaactttttt 29221 tgccttcatt tcctgtttag cctgtttaga ctctaaaccg tgttgtttct gcctcctttt 29281 ccattgtact tgttttttat tgggcagata cttgctaatc ccacatggca gccgtttctg 29341 cccctcgaga ctcccctggt gaaggctggg gaagctcagc tgtgcaaacg ccagtgatgg 29401 cagctgcctt cccgcagggt gttgtgtctg cacccctcac ggcctccagc ctcagggctc 29461 gggatcccac aggtggagcc cagcactgac agcctgggtt tcttaggacc gaggctgcga 29521 acgtggagaa gatcaccacg aagttgctga aggcaggcgc caagccggac cagattggca 29581 tcatcacgcc ctacgagggc cagcgctcct acctggtgca gtacatgcag ttcagcggct 29641 ccctgcacac caagctctac caggtgcgct gcgccctcgg gcacacttgg tctcctgggc 29701 catgcaaggg tattgaccct tgacctttaa gttacccccc aagaggggcc cgtcctggct 29761 ggagctcaga atggcccagg aagcaccagc tggcccaccc tctggggagg gcacagacag 29821 cacatcccca cagaccctgc gaaattccac atgatcagga cagagacttt gagaaataca 29881 tcacagggat cagaaacatg ggctgagatt ggcacatcaa ggtgttttct cgggatggtt 29941 gaaacaagtc caaggtggaa cggcagtggt ctgtgcctgc cacacactgg atgttgtgct 30001 gtggagaccg tggtgttcac acctttattg acacccggat gaaacaccca ctgcgggtgt 30061 gtggggagga ggcacagctc tgcctccatg aggaaatgag cccaagtgct gagtctgagt 30121 ggccacatgg cagtggtttc tgttctgttt ttgagcacgg ctcaggttct ctgggatgag 30181 cacgtgtgct cttagtgctg agaaacaaat ctccgtggct aaaagagaac aggctaggtg 30241 tcatggccca cacctggagt cccagcactt tgggaggctg aggcagggaa gtcgcttgag 30301 gctaggagtt ccagaccaac ctgggcaacg tagggagacc ctgcctccac caaaaaaaaa 30361 taataactgg gtatggtggc atgtgtctat ggtcccagct actcaggaga ctgaggtggg 30421 atcagctgag cctgggaggt caaggctgca gcgagctgag attgtaccac tgtactctag 30481 cctgggtgac agagcaagat cctgtctcta aaaaaagaaa acaaaacgaa atggcagcag 30541 agccaggaca gcccctaggt gcggtgagca ggcgccaggc cccaagctcc cgggtgggat 30601 ttgagggtgg gtcttggtgt gtttcctgca ttttgggggg actgggaagg cagcctgctg 30661 gctgatagtg accacaaagc tcccttccag gaggtggaga tcgccagtgt ggacgccttt 30721 cagggacgcg agaaggactt catcatcctg tcctgtgtgc gggccaacga gcaccaaggc 30781 attggctttt taaatgaccc caggcgtctg aacgtggccc tgaccagagc aaggtaggga 30841 ggctgcctgc tgcatgctgc ccagccgctc atcggtcctc acttcccagg gaatttgggg 30901 ctagggttgg gggttgccag ggccagaggt cagaggactc tgagcagcag ttgagaaacg 30961 atgtctcact ggagagatct ttgtttccgt gtcctggggg tttgctgtgg cccctgtgcc 31021 tcagcccagc aggacgtatt tttccaggcc cttctggtct gattcaaaat aaatcttgtg 31081 tgtgtctgtc aagttgttta atctgagagg ctgtaagttt tggtcctgtc cactattttt 31141 ggttctaaaa tacaactttt gtgaactgat gatagcagaa gggaccccga gaacaaggtg 31201 tccattctgt catcccacat aaaactagac atcaggccag gcatggtgtc tcacacatgt 31261 aatcccagca ctttgggagg ccgaggcagg tggatcacct gaggtcaggt gttctcgacc 31321 agcctaacat ggtgaaacct cctctctact aaatacaaaa aaatgagccg ggcgtggtgg 31381 cacatgcctg taatctgagc tacttgggag gctgaaacag gagaatcgct tgtacctggg 31441 aggaggaggt tgtggtgagc cgagattgcg ccattacact ccagccactg ggcaacaaga 31501 gcaaaactct gtctccaaag gaaaaaaaaa aacactagac agcaaggacc cattcaccag 31561 ctggaagttg gggggacctg gcctgtggct ccacccagca cagggagaga gcctggtgct 31621 ccttccggct catatgcgcg gtggttctcc tctaggtcac catggctttg tcattggtta 31681 ctccctcttt ctaaggcgcc ctcttgtttg gtgggcagta ttgggtgggt ccccccacag 31741 cttcgtgagg tgggctagag gagctgggca tcgggtcagt gccccggcct gctgggggcc 31801 ctgtggggcc gcgtgtgccc cggtgcctgg aaggccgact ctcttgacag caggtcttct 31861 ctccaaacgt atccacccag ccaggtgtct gccatggggc tgcttagagt cggccacaaa 31921 atcaacccgt gtgcagggtc agtggcttgg cattgggctt tggggcctgt ccctgtggct 31981 ggcagcctgc ctgctgcccg gtccacgcct ctgttgcctt ggatttgggt tctgagtgaa 32041 tgcagccttg cctcttggac cgtcctgtga gacgggcagc tctccacctg cgtcctcagc 32101 actgcgccct tgttgcaggt atggcgtcat cattgtgggc aacccgaagg cactatcaaa 32161 gcagccgctc tggaaccacc tgctgaacta ctataaggag cagaaggtgc tggtggaggg 32221 gccgctcaac aacctgcgtg agagcctcat gcagttcagc aagccacgga agctggtcaa 32281 cactatcaac ccggtgagcg cctgcacagg acagcagggc agcacggaga aacccgggcc 32341 caaaacactg ctgggaacgt gccagcttgg cccgtgtgcc cgtgagctga gggagggctc 32401 tcctagggga gggtgggctc tcctagggga gggtgggcca gcccacttct tgtctcccga 32461 gccgtgtgtg gcatttttgg cttggggccg ctaagtcctg ggtggcctct gggaattggg 32521 ctcagggctg ctgtgctgca tgcagcagcc tgggaccttg gacacgggca gttggaagct 32581 ttgcaagccc ggggtgatct ggtagtgagg gggtgtgaag cttcctgagc tgaggcccaa 32641 gggtcagcat cagagcccag gtctgccgag gagggcaggc gagacccaga ggccatgaga 32701 cagcagctcc ctcctgtctc cagggagagc ccctggcaca gtgtcaacac ctcacagctg 32761 catacctgcc acctcccagg ccaccgggcc cgtgggaagt tgttccttct gttgagatga 32821 caaattcctc acctatctaa accttcgcag ggagcccgct tcatgaccac agccatgtat 32881 gatgcccggg aggccatcat cccaggctcc gtctatgatc ggagcagcca gggtgagtcg 32941 ctcagcaggg gacctggccg accccttgtc ctcacaccgg gatgctggcc gttcccatgg 33001 ctgcagcttt aggtgccccc atctttcctt cccctcaagg gcggtgcctg gctcccttgg 33061 cctccccgcc cctagggcat ctctagcccc ggaacactcc tggggtgttt gtctttaacc 33121 agagtttatt catttgtgtt tcctggtgat caagctagtt agctttttct tgccttttgg 33181 tagggcaggg actgcatgta gtacaaatag aacattttca taaaaaggat tttttttttc 33241 ctttggtaaa tgcagggtga ttttcattta ttttcttttt tttgtttaga gacagggtct 33301 ccctatgttg tccaggctgg tctcaaactc ctgcctcaag cagtccttct gctcagcctc 33361 ccaaagtgct gggattacag gcgtgagtca ctgcacctgg ccaatttgca ggtgtttttt 33421 tttttttttt tttttttgga gaccgagtct cgctctgttg cccaagctgg agtacagtgg 33481 tgtgatcttg gctcactgca acctccccct ctgggttcaa gcgattctcc tgccttggcc 33541 tcccgagtag ctgggattac aagcacccac caccatgccc ggctagtttt tgtgttttta 33601 gtagaaacga ggtttcgcca tgttggcctg gctggtctca aactcctgat ctcaggtggt 33661 ccacctcggc ttcccagagt gctgggatta taggcttgag ccaacacgcc cggccccaat 33721 tttcagtttt taagatctgt gtaggccaat gattcccagc agactctcct cggggaaggt 33781 gcctgcccaa gcacccagcc ccaggcaggc tgctgtagtt aacatggagt cctgcgaatc 33841 cgcatcttca gcctgggcag agccaggaca gatgtgcagc tccggctgac tggctggtgg 33901 ggtgggtggg gtatcgctgg ggtttgaccg aggcaggtga cacctgccgt gttccactgt 33961 gatttgcagg ccggccttcc agcatgtact tccagaccca tgaccagatt ggcatgatca 34021 gtgccggccc tagccacgtg gctgccatga acattcccat ccccttcaac ctggtcatgc 34081 cacccatgcc accgcctggc tattttggac aagccaacgg gcctgctgca ggtgagcatc 34141 tgtggctgcg gctgggtgtg gccctcctga gagctcttga gggtgtgctt gtctgcgagg 34201 ccctggcctc cttcggatca ccctggactg ctgtctttca gggcgaggca ccccgaaagg 34261 caagactggt cgtgggggac gccagaagaa ccgctttggg cttcctggac ccagccagac 34321 taacctcccc aacagccaag ccagccagga tgtggcgtca cagcccttct ctcagggcgc 34381 cctgacgcag ggctacatct ccatgagcca gccttcccag atgagccagc ccggcctctc 34441 ccagccggag ctgtcccagg tgagcccgcc cctgggacgg gacttacctg agtgagggtg 34501 gggctatgca cctgaaacat tccctctgaa gagccccaga gagctggcct ggcccatgtc 34561 cactgtctga attacctgtc cctgggctgg ggtcatcaga gtgggtctcc tgggtcttag 34621 tttggggacg ggttttccat tcttttctct ggggctgctg agggctgggt ggatgtgagc 34681 acccttggcc tgtggcttgc ttacctcctg accttgtctt tcaggacagt taccttggtg 34741 acgagtttaa atcacaaatc gacgtggcgc tctcacagga ctccacgtac cagggagagc 34801 gggcttacca gcatggcggg gtgacggggc tgtcccagta ttaaaaggca agcccccctg 34861 gagcaggcct ggccccaccc cagcctcaag gagaaggatg ggagggggct ctccccaggg 34921 agctgcactg gaggggtggt atctggaaat gtgtgctgtg cctggtgggg gtcataggcc 34981 ctcagggcca gcttggcctg tgcccttcac tgctagtcag ggtggctcct caccccaccc 35041 taccccatcc tgtctgctcc ggggaccacc gcgggacctc agtttcctca tcagagtcgg 35101 ggagggcagc tcgggctctc cctcactgtc ctgtgtgctg ggtacctgtg gggctcaggt 35161 cgcagcctct caccgcctcc tgcccttctc cctcctgaca ggtggcggcg gaagagctaa 35221 gcaacgtggc ttagtccatc agcatcttat tctgggtaat aaaaaataaa aataaacgga 35281 tacctgtttt ccactgctaa aactgaagca ccactgtgtg agcaacagga agggagagcg 35341 cacgagggag aggagccgag gccgagcgcc ccctgctggc ccgcggcggc gaggagcaga 35401 gggagcggag gaggggccgg cccgcgggag ccgcggccac caggaggccc cgctccgtcc 35461 catcggggct gcggccaggg cggagggagg aagaccctca tctcagagta gccctttcct 35521 ctgttctttt atttcttttt ctctttgatt gaaaggggac tacgtcttag caggaaaaaa 35581 aacttcgcat ttctgtgccc gagcaggctc cttgcaaaga cagcagcgtg cggggcagag 35641 ccccgggagg gcgcgtctgt ccacgcctac cggacgcgcc gaggtcgcgc tgcctgtgtt 35701 ctccgagggc cttcatttaa agaaaataag ggtgttttgg gtttttctct ttgttttttt 35761 caagattctt ttaaaggagt actgaagaat actttcctaa gtttgtctgt aaaatcttag 35821 cggtggacct gggagatttg agaagcttcc agaaacagtt taaacaagcc agcgctactg 35881 gagaagagga gcaacacctg tgccgcggcc ggaggagttt tgttgttggt tttagcttcc 35941 agtggcttct ttctgcgggg catcaggctg ctggggtagc cgcccgccga gcctggaagc 36001 tgctcgttct ccgctggact cagaagccaa gctgcttccc gcctagactc ggcgcagggc 36061 cccgcaccgg tgaggaaggt gcttttggcc ccattgcgag gggccttggc caggactggc 36121 cctgtggcca ggaggcgaga aggtggctgt tcccggattg acggcttttt cccgggggcc 36181 tttggaagat ttggtggaag gacaagaggg cctgtccctg tccccgtccc caggaggtac 36241 cgacagtccc tgtgctggtt agacacggag cgctgcacac cgaaagccca aattgggagc 36301 tctgcctgcc ggcaactttg ctgatggggt gattgctgct tctggggggt aaggaaacaa 36361 gttacagaaa ttaccgcgtt ctgtgtgaag ggactgaggg tgtggtgtca ttggcagagg 36421 gtcattttag gagagctgcc ccagcccctc gaacgcctgg cttggggtgt cattctgcct 36481 ggcggccagg cctccagctt cccctgcccc gggcctgggg ctgtcactgg ccctgatccg 36541 aacacctcca gattccggct tctacatggg acagacgggg acgcacaggc caccttcctt 36601 ctggcaggga ctcttattta ttcccattgc tctagggctt tcggtttccc cttcttccgg 36661 taggccgcgt agaggcatgc accgggtagg tttccgcggt gaccccgcgg cggcctgagg 36721 gacgctccct gccccatccc ggctgttggg ctgggccgct ttgcctctgc ttcgccctgt 36781 gctgtgttct ccagctttgt agcagcagcc ttgacaaacc caggcgcact gtaccaaggc 36841 aatgtaactt ttgattttcg gtcaatttaa gttcttttgt caccaaatat taataaacag 36901 ttttgacttc acaccaaggt tggataaact gcagggggtg gagggtgctg gggttttgcc 36961 ttggctgagt gacgggttgg ggtgtgaagg aggccagagg gagacaggca aagcccagaa 37021 gggacaggtc tgccttgcag gactgaaggg cgacccctgc ccctgggccc ttcccgtagg 37081 cagcacctgg gcctgtccgg gcggctcccc ggagtggcgc gttaggcctc tggcaggtgg 37141 cctggaggtg gaaaggccta tccgatcacc aagactgagg ggcgccctcc ccaccaccca 37201 ggcaaggagt ccaaggagac cagcggagca gaccacgcgg catttattgt tgggcccgcg 37261 tccctgcccg ccccgggtta gcggcagccg cactcgtcca ccaccatgtc ctcatactgc 37321 cgcagcacca cgttgtcgct gttgtcaaag aagagcacgg agatgggcga caggcgcgcg 37381 ggcacgcagc agggcaggtc ggcggctccc ggggcggccg cgtgcatgag cgcgcgcagc 37441 acagcgtggt tgagcgccgg cggccccccg gaccccgaca gcgcgacggg cagcgcgcac 37501 tgaccctggc agtagttggc caggaagccg cgcggcgcga tgacccagcg gtgccagccc 37561 acctcgcgga agctcacgta cagccgccgc gcgcgacaag cgcccccggg gccgccgccc 37621 aacacgggtt cggcgtcgcg ccgcggccgg gccagggggt ggcacaggcg cgggtcgagg 37681 gtcaccagca gcagcgaggc ctcggccagg cgcgcgcagg cggcaggggc ccgggggcgt 37741 agcgccagcg ccaggcggag gctgcgcggc catgaggcgt tgcgagccca agcggcgccc 37801 agcagctccg cgcgcactgg cggccccagg gcgggcacca actggcggag cagcaccggc 37861 ccggggtccg cgcccgcgcc ctggcccgct tgcgccacgc tcagctccca gccgccctcc 37921 ggggctgccg ccgccgccgc cgcgaaacgc agctccaggc gggcccggct cgggcgctca 37981 gcgggttcca cagccgacag gtcgaagacg actgtccact cagggcaatg ccccgcggcc 38041 gaggcaggct ccgaggcccg ggtgggcgca cctggggagg taggaacagg aactcggctc 38101 gcgctgcgtc cccggcctgc ccatggggtc tcggttaggg acagggagga aggtgaacgc 38161 tggggctcgg ggcgcaccgt ctgtgggaag ggtgtgaagc ggcggggtgg gctgatggag 38221 tgggggcagg gcatgagttc caaggaaggc gccctgcggg gtgcggcggg gggggtgggc 38281 tgccctcggc ggggggcata ggaggagggg tggccgaagt gggagacatg agccagaacc 38341 tgggcccctg gggaagccat gctagggtgt ggggatggca tctgcaggaa ggcattagtg 38401 tcaagaggtg ggggtgggca tggggtgggg acagggcttc agcatcaagc agaaggactg 38461 gtcttgggaa ccccctgaca tgtgtccccg gcgtgcagca gatgacgagg aaagggtgag 38521 gtgcggtggc cgaagttgct agtagcctgg acagggcggg tggggaccct cggagctgct 38581 cgggcatccc cgggccactc tcgaggctcg ccctgcgagg tctggcctcc aggaccagtg 38641 tccccagcga aagccccact caccgcggtc cgggatgtgg cgcacgatgt ttccggcgac 38701 ccccagctcc tccacgtggc acggttgcag ggtgacccct ggggacgtcc gccgcgagcc 38761 agacctggtc tcctgggggt cccggcgtcg aaacaggcgc cacatgaccg ggggaaccgg 38821 ccggagcctg ggggcaccct ggggctcatc gcgcagtcct agagcctgga gcagggcggc 38881 ggctgggcct gggggcacgg gggcgcgggt caggggcagc gagggcagca gcagggccag 38941 gaggaggagg aggtggtggc cgcagggacc ttgctgcggc ggtggcatct tcctcccagg 39001 cgatgaccag agagtgcgca gggtccgcgg cggcccggga ccagtgggct gagggcgggg 39061 ccggtgtccc cggaggggca ggggtcctgg ggggcgtggc cgggaactgg aggcaggatg 39121 agggggcggg gtcccagggg aggtggcggc ggccctagag gagcagagtt ggagggggtg 39181 gaggggcggc caaggacggg gagcgtggcc ggggtattcg gggtggggcc gggtccacgg 39241 gggcggggcc gaggggttca gaagcgcttg tccttcacca ggccgttcct cagtggcttc 39301 ctgggggtca gaaccggcgc aggttagcct gggagcccca cgcggccgcc tggccctctt 39361 tcccgcttct tctctggccg tttcacaccc cctggctcct tttacccggc caggcccggg 39421 cctcgccttg tggcttcctc ctcgccttca ccctgcccct ccttgccttc accctgcccc 39481 tccttcaacc tgccccgtag gcacgtatgt cccccctggc accctggcac ttcctccttg 39541 cttccccctc tccaattaaa ccttcctctt cccctcctcc ctgcctctcc aggccgctgg 39601 agggcaaaac ccacgtaccg gcctgggcct gacaactcca ccgcctccac agtgggaccc 39661 tgcacatcct cctctcctct gccacccacc ccacgaaggc tagtccttga ctccagtggc 39721 cctggctggc atctgctgtc tccacctact tcccttccat gttcttttat tttccttttt 39781 tttttttggt cttactctgt ggtccaggct agagctgtgg cacaacctcc aaggctcaat 39841 ggatcctccc acctcagcct ccagagtagc tgggattaca ggggcacgcc accactccca 39901 gcaatttttt tttttttttt tttttgagac agaatcttgc tctgttgcct agtctggagt 39961 gtggtggcac aatctcagct cactggaacc tctgcctcct gagttcaagc gattctcctg 40021 cctcagcctc ctgagtagtg tggtggcacc tgccaccaca cccagctaat ttttgtattt 40081 ttagtagaga cagggtttca ccatatcacc caagctggtc ttgaactcct gacctcaggt 40141 gatccgccca ccttggcctc ccaaagtgct gggattacag gcgtgagcca ccgtgcctgg 40201 ccccagctaa ctttttttat tgtttgttga gatgaggtcg ctatgttgta gaggtgaggt 40261 tgctatgttg tccaggctgg tctcaaactc ctgggctcaa gcagtcagcc ctcctccgcc 40321 tcccaaaggg cttctgtcac tttaattcgc tcttctctta ggctgaagct cagcttctct 40381 gcatgcattt cacagccaac tggctgtccc catccaagga ctgttggtct ttatctgaga 40441 ggagataagc cttgcttgga cctcccatct gtcacttgtc ttctgatttt gcctgtgatc 40501 cttgagaccg tgagtggcgt ttcttcatcc acccttgtgg cttctcgatt cttcccataa 40561 acaaacctcc gtgacacact gtcccctttt ggctctggga tgactccagg gattccattt 40621 ctgcatcgac acctttggta taaggtgggg gtttgcccta aggcttatgg gaaacaaaga 40681 cacaccctca ctgtgttttc cagagggtcc ccgtgctttc cattcctgcc ctccaggatg 40741 gtctttagta tttgctaaag gttttaaggt cttaggttag tttctccact cagttcccac 40801 agagagtata gattcctcta atggtaaagt ttagcatcct cggtgatggg tgcccccgcc 40861 ctggggcttt ttatcagagc tttcctgctg attcttgttt gttttttctg caaaagcttc 40921 aaagtgttct ctacccactc agaaacagct gtcagtattt gccttttgag gtgtgggtgg 40981 cggtgtaccc accttggggg agatagcttt tctctgttat ctttttttct tgttttgttt 41041 ttttttgaga tggagtctcg ctctgtcgcc caggctggag tgcagtggcg caatcttggc 41101 tcactgcaac ctctgcctcc caggttcacg ggattctcct gcctcagcct ccccagtagc 41161 tgggactaca ggcatgcaac accatgcccg gggtaatttt tttttttttt tttgagacga 41221 agtctcgctc ttgtccccca ggcgggaatg cggtggcacg atcttggctc actgcaacct 41281 ctgcatcctg ggttcaagtg attctcctgc ctcagccccc caagtagctg ggattacagg 41341 cgcctgccac cacgcccaga taatttttgt atttttagta gagatgcggt tttaccatgt 41401 tggccaggct ggtctagaac tcctgacctc aggtgatcca cccgcctcag cctccaaagt 41461 gctgggatta caggcgtgag cgccgcgcct ggctaatttt tatattttta gtagagatgg 41521 gatttcacca cgttggccag gttggtctcg aactcctgac ctcaggtgat ccatccgcct 41581 cagcctctca aagtgctggg atttatgggc gtgagccacc atgccgggcc gagatggcct 41641 tactctgatg gacacctttt ggcttctatc caccactgtg gtgaaaagag ccagagcgag 41701 tgtgcagaga gagaaggatg atgcaggcac tttctagatg tttcttcata gatgaggagg 41761 agagggagac agaggagaca agaccccctg ggctcttgca agggttttgc cccaaattgc 41821 tgtcaacacc gatagactag tgttgacact agtctatcac tagagaggga gtctggggtg 41881 aaattggggg ctgatttgag atattttaga agatgctgga aatgtcaaac acgggagatg 41941 gagaatggca ctcaggggag ctgcgtggct aggagagaaa tgtaggagtc ggtgtggatt 42001 aagtgtgggt agagaaaggg ccccagaaga gaaggaggcc agccaggagg ccagagagac 42061 agagcaggag tgcagagtaa ataaagggct tcaagggcca ggcgcagtgg ctcacgcctg 42121 taatcccagc actttgggag gctgagtcag gtggatcacc tgaccctagg agtttgagat 42181 cagcctgggc aacacagtga gaccccatct ctacaaaaat tttaaaaatt agccggggcc 42241 aagcatggtg gctgatacct gtaatcctag cactttggga ggccaagatg ggtggatcac 42301 ctgaggtcag gtgttcaaga ccagcctggg ccaacattgt gaaacctcat ctctactaaa 42361 aaaaatacaa ctattagcca ggtgtggtgt acacgcctgt aatcccagct actcgggagg 42421 ctgaggcagg agaatcgctt gaacctggaa ggcggaggtt gcagtgagcc gagatctcac 42481 cactgtactc cagcctgggt gacagagcga gactctgtct cgaaaaaaaa aaaaaaagaa 42541 gaagaagggg aggggactct ctggggttag caccaagaga gccacatggg agctggacag 42601 gagtggtctt cgggatgggg ggaggaagat ggggcacagc tgggtttgag gaggacccag 42661 gaggatc // LOCUS AC003982 122302 bp DNA PRI 13-JAN-1998 DEFINITION Human PAC clone 166H1 from 12q, complete sequence. ACCESSION AC003982 NID g2769695 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 122302) AUTHORS Bradshaw,H, Wu,X and Ozersky,P. TITLE The sequence of H. sapiens PAC clone 166H1 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 122302) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (13-JAN-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This clone was originally isolated in the laboratory of Professor Graeme Bell, Howard Hughes Medical Institute and Departments of Biochemistry and Molecular Biology, and Medicine, The University of Chicago, Chicago, IL, USA. The clone was provided by the laboratory of Dr. Roger Cox at The Wellcome Trust Centre For Human Genetics, Oxford, UK. Some contig information was also obtained from Yamagata et al., Nature 384:455-8 (1996). SOURCE INFORMATION: Clone 166H1 is from the first release of the human BAC library CITB-978SK-B. The library contains cloned DNA from the male fibroblast cell line 978SK. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); U-J. Kim et al., Genomics 34:213-8 (1996). This clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the left is 278C19; The clone sequenced to the right is 15E1. Actual start of this clone is at base position 1 of 166H1; actual end is at 122302 of 166H1. FEATURES Location/Qualifiers source 1..122302 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /clone="166H1" /clone_lib="CITB-978SK-B" /map="12q" repeat_region 1..238 /rpt_family="Alu" repeat_region 239..265 /rpt_family="AT_rich" repeat_region 268..565 /rpt_family="Alu" repeat_region 1001..1309 /rpt_family="Alu" repeat_region 1440..1745 /rpt_family="Alu" repeat_region 1909..1940 /rpt_family="MER1_type" repeat_region 1955..1983 /rpt_family="AT_rich" repeat_region 2063..2362 /rpt_family="Alu" repeat_region 2516..2650 /rpt_family="Alu" repeat_region 2674..2730 /rpt_family="L1" repeat_region 2735..2788 /rpt_family="(GAAAA)n" repeat_region 2790..3077 /rpt_family="Alu" repeat_region 3081..3139 /rpt_family="L1" repeat_region 3716..4020 /rpt_family="Alu" repeat_region 4196..4492 /rpt_family="Alu" gene 4560..14034 /gene="WUGSC:H_166H1.1" CDS join(4560..5056,13418..13748,13882..14034) /gene="WUGSC:H_166H1.1" /note="unknown function; 60% similar to Z50177 (PID:g927403) (PID:g927402); H_166H1.1" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2769696" /translation="MKMSFALTFRSAKGRWIANPSQPCSKASIGLFVPASPPLDPEKV KELQRFITLSKRLLVMTGAGISTESGIPDYRSEKVGLYARTDRRPIQHGDFVRSAPIR QRYWARNFVGWPQFSSHQPNPAHWALSTWEKLGKLYWLVTQNVDALHTKAGSRRLTEL HGCMDRAYCSVSVFLGSRVLCLDCGEQTPRGVLQERFQVLNPTWSAEAHGLAPDGDVF LSEEQVRSFQVPTCVQCGGHLKPDVVFFGDTVNPDKVDFVHKRVKEADSLLVVGSSLQ VYSGYRFILTAWEKKLPIAILNIGPTRSDDLACLKLNSRCGELLPLIDPC" repeat_region 5366..5655 /rpt_family="Alu" repeat_region 5672..5798 /rpt_family="Alu" repeat_region 5802..5834 /rpt_family="(TA)n" repeat_region 5834..5857 /rpt_family="POLY_A" repeat_region 5870..6184 /rpt_family="Alu" repeat_region 6224..6268 /rpt_family="(CA)n" repeat_region 6269..6558 /rpt_family="Alu" repeat_region 6729..7029 /rpt_family="Alu" repeat_region 7051..7172 /rpt_family="Alu" repeat_region 7192..7500 /rpt_family="Alu" repeat_region 7518..7820 /rpt_family="Alu" repeat_region 8207..8500 /rpt_family="Alu" repeat_region 8576..8878 /rpt_family="Alu" repeat_region 9023..9154 /rpt_family="Alu" repeat_region 9165..9470 /rpt_family="Alu" repeat_region 9887..10188 /rpt_family="Alu" repeat_region 10221..10342 /rpt_family="L1" repeat_region 10408..10707 /rpt_family="Alu" repeat_region 10713..11014 /rpt_family="Alu" repeat_region 11020..11110 /rpt_family="Alu" repeat_region 11146..11386 /rpt_family="Alu" repeat_region 11208..11386 /rpt_family="Alu" repeat_region 11391..11515 /rpt_family="Alu" repeat_region 11568..11705 /rpt_family="Alu" repeat_region 11737..12032 /rpt_family="Alu" repeat_region 12034..12066 /rpt_family="7SLRNA" repeat_region 12075..12372 /rpt_family="Alu" repeat_region 12448..12753 /rpt_family="Alu" repeat_region 12763..12874 /rpt_family="L2" repeat_region 12999..13303 /rpt_family="Alu" misc_feature complement(13674..13752) /gene="WUGSC:H_166H1.1" /note="match to EST N80020 (NID:g1242721) za91a08.s1" misc_feature complement(13840..14195) /note="match to EST N80020 (NID:g1242721) za91a08.s1" repeat_region 14196..14229 /rpt_family="AT_rich" repeat_region 14240..14339 /rpt_family="U6" repeat_region 14349..14646 /rpt_family="Alu" repeat_region 14652..14940 /rpt_family="Alu" repeat_region 14949..15238 /rpt_family="Alu" repeat_region 15807..15937 /rpt_family="Alu" repeat_region 15941..16234 /rpt_family="Alu" repeat_region 16236..16408 /rpt_family="Alu" repeat_region 16434..16730 /rpt_family="Alu" repeat_region 16745..16791 /rpt_family="(TA)n" repeat_region 16792..17018 /rpt_family="Alu" repeat_region 17062..17313 /rpt_family="Alu" repeat_region 17326..17352 /rpt_family="AT_rich" repeat_region 17353..17662 /rpt_family="Alu" repeat_region 17792..17891 /rpt_family="MIR" repeat_region 19143..19398 /rpt_family="Alu" repeat_region 19484..19516 /rpt_family="(TAAA)n" repeat_region 19759..19795 /rpt_family="AT_rich" repeat_region 19824..19931 /rpt_family="MIR" repeat_region 20113..20184 /rpt_family="L2" repeat_region 20194..20489 /rpt_family="Alu" repeat_region 20648..20928 /rpt_family="Alu" repeat_region 20992..21291 /rpt_family="Alu" repeat_region 21360..21595 /rpt_family="MIR" repeat_region 21600..21637 /rpt_family="(CAAAA)n" repeat_region 21775..21940 /rpt_family="Alu" repeat_region 22011..22311 /rpt_family="Alu" repeat_region 22421..22587 /rpt_family="MIR" repeat_region 22751..23051 /rpt_family="Alu" gene complement(23191..28751) /gene="WUGSC:H_166H1.3" CDS complement(join(23191..23315,25932..26059,26859..27018, 28718..28751)) /gene="WUGSC:H_166H1.3" /note="match to protein P04054 (PID:g129404) and EST T29344 (NID:g611442); H_166H1.3" /codon_start=1 /product="Phosphatidylcholine 2-acylhydrolase" /db_xref="PID:g2769697" /translation="MKLLVLAVLLTVAAADSGISPRAVWQFRKMIKCVIPGSDPFLEY NNYGCYCGLGGSGTPVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGS AITCSSKNKECEAFICNCDRNAAICFSKAPYNKAHKNLDTKKYCQS" repeat_region 23429..23581 /rpt_family="Other" repeat_region 23582..23883 /rpt_family="Alu" repeat_region 24262..24559 /rpt_family="Alu" repeat_region 24641..24669 /rpt_family="(CA)n" repeat_region 24939..25027 /rpt_family="L2" repeat_region 25028..25315 /rpt_family="Alu" repeat_region 25383..25685 /rpt_family="Alu" repeat_region 25698..25816 /rpt_family="MIR" misc_feature complement(25952..26068) /gene="WUGSC:H_166H1.3" /note="match to EST AA366418 (NID:g2018768)" misc_feature complement(25962..26068) /gene="WUGSC:H_166H1.3" /note="match to EST AA366498 (NID:g2018837)" repeat_region 26231..26497 /rpt_family="Alu" misc_feature complement(26857..27038) /gene="WUGSC:H_166H1.3" /note="match to EST AA366498 (NID:g2018837)" misc_feature complement(26857..27031) /gene="WUGSC:H_166H1.3" /note="match to EST AA366418 (NID:g2018768)" repeat_region 27208..27255 /rpt_family="(TA)n" repeat_region 27255..27284 /rpt_family="(GAA)n" repeat_region 27343..27468 /rpt_family="Alu" repeat_region 27497..27575 /rpt_family="L2" repeat_region 27577..27623 /rpt_family="(TAA)n" repeat_region 27624..27674 /rpt_family="L2" repeat_region 27687..27988 /rpt_family="Alu" repeat_region 28074..28311 /rpt_family="Alu" repeat_region 28314..28343 /rpt_family="(TAAAA)n" repeat_region 28355..28488 /rpt_family="Alu" misc_feature complement(28718..28749) /gene="WUGSC:H_166H1.3" /note="match to EST AA366498 (NID:g2018837)" repeat_region 29124..29325 /rpt_family="Alu" repeat_region 29422..29506 /rpt_family="MIR" repeat_region 29661..29951 /rpt_family="Alu" repeat_region 29972..30277 /rpt_family="Alu" repeat_region 30288..30481 /rpt_family="L1" repeat_region 30487..30788 /rpt_family="Alu" repeat_region 30903..31074 /rpt_family="Alu" repeat_region 31089..31376 /rpt_family="Alu" repeat_region 31377..31499 /rpt_family="Alu" repeat_region 31501..31556 /rpt_family="L1" repeat_region 31557..31589 /rpt_family="(CA)n" repeat_region 31593..31740 /rpt_family="Alu" repeat_region 31744..32040 /rpt_family="Alu" repeat_region 32042..32137 /rpt_family="L1" repeat_region 32169..32454 /rpt_family="Alu" repeat_region 32459..32604 /rpt_family="MER1_type" repeat_region 32949..33206 /rpt_family="Alu" repeat_region 33278..33360 /rpt_family="MIR" repeat_region 33876..34174 /rpt_family="Alu" repeat_region 34234..34444 /rpt_family="MIR" repeat_region 34796..34955 /rpt_family="Alu" repeat_region 35020..35323 /rpt_family="Alu" repeat_region 35328..35348 /rpt_family="AT_rich" repeat_region 35354..35648 /rpt_family="Alu" repeat_region 35696..35722 /rpt_family="POLY_A" repeat_region 35792..35818 /rpt_family="POLY_A" repeat_region 35819..35941 /rpt_family="MaLR" repeat_region 35947..36309 /rpt_family="MaLR" repeat_region 36310..36675 /rpt_family="Retroviral" repeat_region 36673..37312 /rpt_family="Retroviral" repeat_region 37313..37632 /rpt_family="MaLR" repeat_region 37633..37968 /rpt_family="MaLR" repeat_region 37969..38891 /rpt_family="MaLR" repeat_region 38890..38961 /rpt_family="MaLR" repeat_region 38962..39167 /rpt_family="MaLR" repeat_region 39184..39525 /rpt_family="MaLR" repeat_region 39624..39950 /rpt_family="L1" repeat_region 39952..40260 /rpt_family="Alu" repeat_region 40263..40420 /rpt_family="L1" repeat_region 40422..40457 /rpt_family="Alu" repeat_region 40565..40780 /rpt_family="MER21_gro" repeat_region 40781..41079 /rpt_family="Alu" repeat_region 41088..41381 /rpt_family="Alu" repeat_region 41428..41500 /rpt_family="MER21_gro" repeat_region 41507..41811 /rpt_family="Alu" repeat_region 41812..41883 /rpt_family="MER21_gro" repeat_region 41885..42182 /rpt_family="Alu" repeat_region 43649..43707 /rpt_family="POLY_A" repeat_region 44199..44521 /rpt_family="Alu" repeat_region 44525..44640 /rpt_family="L2" repeat_region 44857..45153 /rpt_family="Alu" repeat_region 45165..45331 /rpt_family="L1" repeat_region 45419..45720 /rpt_family="Alu" repeat_region 45724..46031 /rpt_family="Alu" repeat_region 46073..46286 /rpt_family="Alu" repeat_region 46314..46334 /rpt_family="AT_rich" gene complement(46589..70089) /gene="WUGSC:H_166H1.2" CDS complement(join(46589..46630,47133..47320,48444..48512, 52342..52398,54297..54377,57900..58017,58814..58896, 60003..60051,64041..64133,65712..65753,69006..69090, 69190..69271,69831..69871,70031..70089)) /gene="WUGSC:H_166H1.2" /note="similar to murine RNA-binding protein; 99% similar to D49654 (PID:g1434857); H_166H1.2" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2769698" /translation="METDAPQPGLASPDSPHDPCKMFIGGLSWQTTQEGLREYFGQFG EVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQSRHELDSKTIDPKVAFPRRAQP KMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGKVDDAMLMFDKTTNRHRGFGFVTFESE DIVEKVCEIHFHEINNKMVECKKAQPKEVMSPTGSARGRSRVMPYGMDAFMLGIGMLG YPGFQATTYASRSYTGLAPGYTYQFPEFRVERTPLPSAPVLPELTAIPLTAYGPMAAA AAAAAVVRGTGSHPWTMAPPPGSTPSRTGGFLGTTSPGPMAELYGAANQDSGVSSYIS AASPAPSTGFGHSLGGPLIATAFTNGYH" repeat_region 46866..46977 /rpt_family="L2" repeat_region 47694..47745 /rpt_family="L2" repeat_region 47943..48247 /rpt_family="Alu" repeat_region 48248..48338 /rpt_family="L2" repeat_region 49420..49543 /rpt_family="L2" repeat_region 49545..49578 /rpt_family="AT_rich" repeat_region 49600..49911 /rpt_family="Alu" repeat_region 49920..50036 /rpt_family="Alu" repeat_region 50094..50395 /rpt_family="Alu" repeat_region 50396..50531 /rpt_family="Alu" repeat_region 50532..50670 /rpt_family="L2" repeat_region 51051..51323 /rpt_family="Alu" repeat_region 52030..52115 /rpt_family="L2" repeat_region 52519..52607 /rpt_family="(GA)n" repeat_region 52668..52704 /rpt_family="(GAAAA)n" repeat_region 52706..53005 /rpt_family="Alu" repeat_region 53007..53034 /rpt_family="AT_rich" repeat_region 53035..53302 /rpt_family="Alu" repeat_region 53447..53580 /rpt_family="MIR" repeat_region 54515..54627 /rpt_family="MIR" repeat_region 54652..54897 /rpt_family="L2" repeat_region 54979..55272 /rpt_family="Alu" repeat_region 55423..55723 /rpt_family="Alu" repeat_region 55820..55906 /rpt_family="L1" repeat_region 55910..56043 /rpt_family="Alu" repeat_region 56054..56121 /rpt_family="L1" repeat_region 56122..56420 /rpt_family="Alu" repeat_region 56421..56565 /rpt_family="L1" repeat_region 56584..56624 /rpt_family="(CATA)n" repeat_region 56626..56768 /rpt_family="L1" repeat_region 57294..57337 /rpt_family="L2" repeat_region 57590..57680 /rpt_family="(TGGA)n" repeat_region 58136..58308 /rpt_family="MIR" repeat_region 58499..58802 /rpt_family="Alu" repeat_region 59034..59166 /rpt_family="MIR" repeat_region 59328..59434 /rpt_family="(CA)n" repeat_region 60252..60368 /rpt_family="MIR" repeat_region 60989..61282 /rpt_family="Alu" repeat_region 61642..61800 /rpt_family="MIR" repeat_region 61823..61951 /rpt_family="Alu" repeat_region 61954..62041 /rpt_family="MIR" repeat_region 62216..62243 /rpt_family="(TGAA)n" repeat_region 62692..62746 /rpt_family="GC_rich" repeat_region 65908..66208 /rpt_family="Alu" repeat_region 66236..66429 /rpt_family="MIR" repeat_region 66782..66923 /rpt_family="MIR" misc_feature 68984..70554 /note="CpG_island (%GC=77.2, o/e=1.00, #CpGs=204)" repeat_region 69319..69436 /rpt_family="(CGG)n" repeat_region 69695..69813 /rpt_family="(CGGG)n" repeat_region 70090..70183 /rpt_family="(CGG)n" repeat_region 70203..70483 /rpt_family="GC_rich" repeat_region 73216..73278 /rpt_family="L2" repeat_region 73321..73391 /rpt_family="(TGAA)n" repeat_region 73495..73782 /rpt_family="Alu" repeat_region 74213..74510 /rpt_family="Alu" repeat_region 74657..74683 /rpt_family="(TA)n" repeat_region 74689..74995 /rpt_family="Alu" repeat_region 75017..75369 /rpt_family="L1" repeat_region 75370..75663 /rpt_family="Alu" repeat_region 75665..75785 /rpt_family="L1" repeat_region 75812..76128 /rpt_family="Alu" repeat_region 76132..76309 /rpt_family="L1" repeat_region 76618..76894 /rpt_family="Alu" repeat_region 76906..77077 /rpt_family="MIR" repeat_region 77296..77597 /rpt_family="Alu" repeat_region 77726..77826 /rpt_family="Alu" repeat_region 77847..77970 /rpt_family="MIR" repeat_region 78646..78703 /rpt_family="MIR" repeat_region 79300..79445 /rpt_family="L1" repeat_region 79456..79735 /rpt_family="Alu" repeat_region 79762..80055 /rpt_family="Alu" repeat_region 80095..80151 /rpt_family="Alu" repeat_region 80160..80184 /rpt_family="AT_rich" repeat_region 80210..80435 /rpt_family="MER1_type" repeat_region 80449..80750 /rpt_family="Alu" repeat_region 80757..80807 /rpt_family="MER1_type" repeat_region 80820..81101 /rpt_family="Alu" repeat_region 81122..81166 /rpt_family="MER1_type" repeat_region 81175..81311 /rpt_family="L1" repeat_region 81324..81626 /rpt_family="Alu" repeat_region 81656..81952 /rpt_family="Alu" repeat_region 82067..82362 /rpt_family="Alu" repeat_region 82578..82814 /rpt_family="MIR" repeat_region 82840..82879 /rpt_family="(TAAA)n" repeat_region 83134..83173 /rpt_family="MIR" repeat_region 83618..83638 /rpt_family="AT_rich" repeat_region 83646..83943 /rpt_family="Alu" repeat_region 84006..84245 /rpt_family="Alu" repeat_region 84385..84430 /rpt_family="(TAAA)n" repeat_region 84461..84509 /rpt_family="L2" repeat_region 84566..84859 /rpt_family="Alu" repeat_region 84860..85029 /rpt_family="Alu" repeat_region 85043..85323 /rpt_family="Alu" repeat_region 85362..85652 /rpt_family="Alu" repeat_region 85964..86189 /rpt_family="Alu" repeat_region 86310..86611 /rpt_family="Alu" repeat_region 86612..86747 /rpt_family="Alu" repeat_region 86957..87032 /rpt_family="L1" repeat_region 87151..87450 /rpt_family="Alu" repeat_region 87500..87799 /rpt_family="Alu" repeat_region 87808..88123 /rpt_family="Alu" repeat_region 88131..88686 /rpt_family="L1" repeat_region 88711..88993 /rpt_family="Alu" repeat_region 89000..89256 /rpt_family="L1" repeat_region 89873..89930 /rpt_family="MIR" repeat_region 90579..90715 /rpt_family="(GGAA)n" repeat_region 90736..90934 /rpt_family="(GGAA)n" repeat_region 90938..91179 /rpt_family="Alu" repeat_region 91201..91508 /rpt_family="Alu" repeat_region 91523..91664 /rpt_family="(GGAA)n" repeat_region 91712..91850 /rpt_family="(GGAA)n" repeat_region 91873..92104 /rpt_family="L2" repeat_region 92179..92247 /rpt_family="L2" repeat_region 92338..92415 /rpt_family="MIR" repeat_region 92416..92436 /rpt_family="AT_rich" repeat_region 92437..92592 /rpt_family="Alu" repeat_region 92593..92898 /rpt_family="Alu" repeat_region 92899..93187 /rpt_family="Alu" repeat_region 93189..93327 /rpt_family="Alu" repeat_region 93401..93604 /rpt_family="Alu" repeat_region 93607..93906 /rpt_family="Alu" repeat_region 93915..93949 /rpt_family="Alu" repeat_region 93983..94281 /rpt_family="Alu" repeat_region 94483..94605 /rpt_family="MIR" repeat_region 94841..95011 /rpt_family="Alu" repeat_region 95015..95308 /rpt_family="Alu" repeat_region 96087..96214 /rpt_family="MIR" repeat_region 96319..96399 /rpt_family="Alu" repeat_region 96443..96492 /rpt_family="(CAGG)n" repeat_region 96498..96793 /rpt_family="Alu" repeat_region 96794..96895 /rpt_family="MIR" repeat_region 97086..97294 /rpt_family="L1" repeat_region 97308..97608 /rpt_family="Alu" repeat_region 97612..97794 /rpt_family="L1" repeat_region 97795..98095 /rpt_family="Alu" repeat_region 98096..98215 /rpt_family="L1" repeat_region 98240..98374 /rpt_family="Alu" repeat_region 98383..98416 /rpt_family="AT_rich" repeat_region 98453..98495 /rpt_family="(CAAA)n" repeat_region 98566..98760 /rpt_family="Other" repeat_region 99332..99378 /rpt_family="L2" repeat_region 99552..99753 /rpt_family="MIR" repeat_region 99993..100294 /rpt_family="Alu" repeat_region 100466..100789 /rpt_family="L1" repeat_region 100963..101014 /rpt_family="L2" repeat_region 101043..101153 /rpt_family="MIR" repeat_region 101154..101448 /rpt_family="Alu" repeat_region 101468..101748 /rpt_family="Alu" repeat_region 101949..102053 /rpt_family="MIR" repeat_region 102132..102425 /rpt_family="Alu" repeat_region 102429..102589 /rpt_family="L1" repeat_region 102592..102897 /rpt_family="Alu" repeat_region 102899..102989 /rpt_family="L1" repeat_region 103616..103712 /rpt_family="MIR" repeat_region 103910..104200 /rpt_family="Alu" repeat_region 104214..104291 /rpt_family="MIR" repeat_region 104297..104595 /rpt_family="Alu" repeat_region 104994..105083 /rpt_family="MIR" repeat_region 105096..105231 /rpt_family="Alu" repeat_region 105808..105845 /rpt_family="(TA)n" repeat_region 105846..106119 /rpt_family="Alu" repeat_region 106143..106441 /rpt_family="Alu" repeat_region 106456..106579 /rpt_family="Alu" repeat_region 106612..106741 /rpt_family="Alu" repeat_region 106813..106953 /rpt_family="(TA)n" repeat_region 107636..107761 /rpt_family="MIR" repeat_region 107916..108247 /rpt_family="Alu" repeat_region 108430..108564 /rpt_family="Alu" repeat_region 108568..108591 /rpt_family="AT_rich" repeat_region 108595..108742 /rpt_family="MIR" repeat_region 108896..109187 /rpt_family="Alu" repeat_region 109201..109372 /rpt_family="Alu" repeat_region 109380..109668 /rpt_family="Alu" repeat_region 110075..110373 /rpt_family="Alu" repeat_region 110377..110404 /rpt_family="(GAAAA)n" repeat_region 110464..110604 /rpt_family="MIR" repeat_region 110615..110914 /rpt_family="Alu" repeat_region 110923..111209 /rpt_family="Alu" repeat_region 111214..111245 /rpt_family="AT_rich" repeat_region 111367..111576 /rpt_family="Alu" repeat_region 111608..111644 /rpt_family="AT_rich" repeat_region 111660..111961 /rpt_family="Alu" repeat_region 112167..112187 /rpt_family="AT_rich" repeat_region 112195..112325 /rpt_family="Alu" repeat_region 112420..112717 /rpt_family="Alu" repeat_region 112748..113047 /rpt_family="Alu" repeat_region 113075..113163 /rpt_family="L1" repeat_region 113310..113641 /rpt_family="MER2_type" repeat_region 113696..113948 /rpt_family="Alu" repeat_region 114330..114371 /rpt_family="AT_rich" repeat_region 114403..114695 /rpt_family="Alu" repeat_region 115297..115420 /rpt_family="(CA)n" repeat_region 115420..115496 /rpt_family="(GA)n" repeat_region 115501..115670 /rpt_family="MIR" repeat_region 115704..116002 /rpt_family="Alu" repeat_region 116023..116108 /rpt_family="L1" repeat_region 116140..116442 /rpt_family="Alu" repeat_region 116475..116777 /rpt_family="Alu" repeat_region 117440..117735 /rpt_family="Alu" repeat_region 117969..118119 /rpt_family="MIR" repeat_region 118137..118409 /rpt_family="Alu" repeat_region 118413..118540 /rpt_family="Alu" repeat_region 118603..118899 /rpt_family="Alu" repeat_region 118967..119146 /rpt_family="MIR" repeat_region 119162..119333 /rpt_family="MER1_type" repeat_region 119697..119831 /rpt_family="Alu" repeat_region 120023..120311 /rpt_family="Alu" repeat_region 120434..120500 /rpt_family="L1" repeat_region 120501..120688 /rpt_family="Alu" repeat_region 120982..121164 /rpt_family="Alu" repeat_region 121168..121301 /rpt_family="L1" repeat_region 121336..121640 /rpt_family="Alu" repeat_region 121642..121922 /rpt_family="Alu" repeat_region 121950..122239 /rpt_family="Alu" repeat_region 122243..122302 /rpt_family="Alu" BASE COUNT 31712 a 30233 c 29632 g 30725 t ORIGIN 1 gatcacgagg tcaagagata gagaccatcc tggctaacac ggtgaaaccc cgtctctact 61 aaaaaaaaaa tacaaaaaat tagccgggtg tggtggcggg cgcctgtagt ccggaggctg 121 aggcaggaga atcgcgtgaa cccgggaggc ggagcttgta gtgggccgag atctctccac 181 tacactccag cctgggcgac agagcgagac tccgtctcaa aaataaataa ataaataaat 241 aaatacaata ataataataa taaaagtggc ggggtgcagt ggctcactcc tgtaatccta 301 gcattttagg agactgaggc aggtgggtca cctgaggtca cgagttcaaa accagcctgg 361 ccaacatgac gaaaccccat ctctactaaa aatacaaaaa ttagccgggc gcagtggcga 421 gcgcctgtaa tcctagctac tccggaggcc gaggcagcag aatcacttca gccgggaagg 481 cagaggcttc agtgagtgaa gatcgcgcca ctgcactcca gcctgggcaa cagagctaga 541 ctctctcaaa aaaaaaaaaa aaaaattaaa gttagtttta cttacaagtc ttactgagga 601 ttacagaccg agggctatag tgtgggagca gttccatcag actggtccaa cacagaattt 661 cagccactca tatacaagtg gtgagtgtac ggcgcctgca aaatcacatt aaacttgctt 721 caaagttaca ttaaagcaca atcatatcaa agtttgggtg caagagtata tctggttatg 781 gattacagag gtatcatcac taaccccatc agacattatc ttgtgcagga aaaggcaatg 841 accagagtca tttatctttt aatgaatata gtgacttgga agagacatgg agggctgtgt 901 gctctattct gttttgtctt caaagcatct ttctggagag ctgtatgttg tcacagagtc 961 aggggcttca tgaaacgatg ctgccaagtc gaaataagca ggcccggcat ggtggctcac 1021 acctgtaatc ccagcacttt gggaggccaa agcaagagct ttccttgagt tcaggagttt 1081 gagaccagcc tgagcaacat agtgagaccg tgtctcgaca aaaataaaaa ataacaaaat 1141 ttagctgggc gtggtggtgg atgcctgtga tctcagctac tcgagaggat gaggttggag 1201 gatcccttgg gcctgggaag tagaaactgc agtgagctgt gatgtcacaa ctgcactcca 1261 gcctgggtga cagaacgaga ccctgtctca aaaaaaagaa aaaaaaaaag cagaaatggg 1321 caaactacta aatggtcatt ttgccaatat tacctccaaa gatacaatga tagaaccaaa 1381 aacctaaaaa ctctttctca agcttccaca ttacttctct gcacttcagt tcatctttat 1441 ttattattat tattttttga agagatggga tctcactttg tcacccaggc tgaagtgcag 1501 tcgcacaatt atggctcact gcagcctcga tctcctgggc tcaaagtgat cctcccacct 1561 gagcctcttg agtagctggg actacaggca cacaccacca caccccgcta attttgttta 1621 ttttttgtag tgagtgggtc tcactatgtt gcccaagctg gtcttgaact tttgggctca 1681 agtgatcctc ctgccttggc ctcccaaagt gttggtatta caggtgggaa ccactgcact 1741 cagcccaatt catcttaaaa tgggggaagg gagtccttag gtggtcttca aggtccctgc 1801 tggccgccca cctttatgtc caaagacact tatacattta ttgatttagg tgatgctgca 1861 gagcggcagt tctggttgtc tacacattag gaacaattag gaaatttgta gatcagaatc 1921 cctgggagtg agacgcaggc cttgtttgat taacttaata aaaaaaatat gataaataaa 1981 attggagaag ccaagcaggt taagtacatt agtccaggta taatatattt ggcttcatat 2041 taactgtaaa aacatcactt ttggctgggc acagtggctt acgcctgtaa tcccagtact 2101 ttgggaggct agggtgggca gatcacgagg tcaggagatc aagaccatcc tggctaacat 2161 ggtgaaacct tgtctctact aaaaataaaa aaattaaccg ggcgtagtgg catgcgccca 2221 taatccctgc tactcaggag gctgaggcag gagaattgct tgaacccggg aggcggaggt 2281 tgcagtgagc cgaaatggca ccattgcact ccagcctggg caacagagcg agactccgtc 2341 tcaaaaacaa acaaacaaac aacaacaaca acaacaacaa actatcactt tttcagatat 2401 gtataaacac atttcttcac aagttcaagg tattttacca atagcaaata acttgatact 2461 gtcgttgtga atgaaaagga tcacaaactc tgatgtttag aatgagtcaa atctaggccg 2521 ggaacagtgg ctcaggcctg taatcccagc attttggggg gccgaggttg gcggatcacc 2581 tgaggtcggg agttcgagac cagcctggcc aacatggtga aactctgtct ctactaaaaa 2641 tacaaaaact ttaaaaaaaa gtcaaatcta gatccatgat taatgacgtg ctcatttcac 2701 attgcatgtt tgtatcaaaa catctcatgt gggtaaagca tagacaggca cagagaaaag 2761 aaaaaaagac aaaaaataag agaaaagatg gccgggcgca gtggctcacg cctataatcc 2821 cagcactttg ggaggtcaag gcgtgcggat cacctgaggt taggagttca agacgagcct 2881 ggccaacatg gcgaaacccc cgtctctact aaaaatacaa aaataatccc agctactcgg 2941 gaggctgagg caggagaatc ggttgaaccc gggagaggga ggttgcagtg agccgagatc 3001 gtgccactgc actccagact ggagtttttt caaaaaaaaa aaacaacaac tccgtcttaa 3061 aaaaaaaaaa gaaaaaagaa aaaaaatctt atgtacacta taaatatata cacctactaa 3121 gtgcccacag aaattgaaaa caaagaaaaa aatccatgat taaaagcttt tatattaacg 3181 ggggaacctc tgaccagaaa gagccgacag ggggcgttcc tggatgatct ggggctgtca 3241 attgcgagac gcggaggatt gtgggacttg tggtttttct cgcatcattt aagctttctg 3301 gactacatgt cccaaaatgc aaatgcaatc agacggtccc actgtggggt gtgaagtgtc 3361 cgtagagctg tgagaggtaa gtgtgttctg tgtgtgctgc ggatttctag aagttgtgga 3421 tttcttgtga gggattctat ttctcctctt tgccctcggt gggacctgga agagagtagt 3481 tgttccactg atgcacgttt tcagcggggg tttcccctga ccctttggtc tcgggaaacc 3541 ggactgtgtg tttgttcagc tgttcacgtt ttgctgtgga atgtcaagga gaaagcgggg 3601 tcctgatctc agatctggaa agccaaacat aaaataatgt ttacatacgg ggtgcatgtc 3661 actagaggat ttatgcattg cattgtgtca aataatatgt caaagttttt tctttttctt 3721 tttttttttt tttttgatac ggagtcttgc ttcttcgccc aggctggagt gcaacggcgc 3781 gatctcggct cactgcaacc tccgcctccc gggttcaagc aattctcctg cctcagcctc 3841 cctagtagct gggattacgg gcacgtgcca caatgcccgg ctaatttttt ttgtattttt 3901 agtagagtcg gggtttcacc atgttggcca ggcatgtctc gaactcctga tctcaagtga 3961 tctgcctgcg tcggcctccc aaagtgctgg gatttcagac gtgagccacc gcacccggcc 4021 ctctattgtt acagctagaa aagctgacgg gcaggggtct tgtctctctg atgtgtccaa 4081 gtagcagagc ttgcagtgga gggtaaacta aaataagggg gaaatgaaaa aataaaagga 4141 atgcgtgtgt gtgcttaatt acatgaacaa aatcaaacaa aacctgactc attagggcca 4201 gactcagagg ctcacacctg taattccagc acgttgggag gccgaggcgg gcagatcact 4261 agagctcagg agttcgagac tagcctggcc aacatggtga aaccctgcct ctactaaaaa 4321 tacaataatt agccgggcat ggtggccatg tagtcccagc tattctggag gctgaggcag 4381 gagaatcact tgaacccggg aggcggcggt tacagtgagc caagatcatg ccactgtact 4441 ctagcctggg caacagaggg agactctgtc tcaaaaaaca aaacgaaaac aaacaaacaa 4501 aaaatgagaa aaaaaaaaaa ccaccccagt ttctcacatg aggcttcttt ttctctagaa 4561 tgaagatgag ctttgcgttg actttcaggt cagcaaaagg ccgttggatc gcaaacccca 4621 gccagccgtg ctcgaaagcc tccattgggt tatttgtgcc agcaagtcct cctctggacc 4681 ctgagaaggt caaagagtta cagcgcttca tcaccctttc caagagactc cttgtgatga 4741 ctggggcagg aatctccacc gaatcgggga taccagacta caggtcagaa aaagtggggc 4801 tttatgcccg cactgaccgc aggcccatcc agcatggtga ttttgtccgg agtgccccaa 4861 tccgccagcg gtactgggcg agaaacttcg taggctggcc tcaattctcc tcccaccagc 4921 ctaaccctgc acactgggct ttgagcacct gggagaaact cggaaagctg tactggttgg 4981 tgacccaaaa tgtggatgct ttgcacacca aggcggggag tcggcgcctg acagagctcc 5041 acggatgcat ggacaggtgc aggagctgta cacggtttga acgcaaaccc gtgtgttgtt 5101 ttgggagagt agggaccttg gctgtcttga tcaccacagt cttagacctt gagaaaagga 5161 tttcagagca agttgtttat tttgcagttg attccagtaa attcattgac ggagcaagga 5221 tgtaaaacag gaaagagagg aaagccaaga aagtgtgtgt taatgagtgg gttactgcct 5281 tgggcactgg ggttcagtcc tgtgagggac cctccaactg tctaggacat gcctgaggat 5341 tgtctcaatg aaacttgagg actggggccg ggcgcagtgg ctcacgcctg taatcccagc 5401 attttgggag gctgaggtgg gcagatcaca tgagctcagg agttcgagac cagcctggcc 5461 aacatggtga aaccccattt ctactaaaaa tacaaaaatt agccagtcgt ggtggcacat 5521 gcctgtaatc ccagctgctt gggaggctga ggcaggaaaa tctcttggga ggcagaggtt 5581 gctgcagtga gccgagattg caccactaca ctccagcctg ggcaacagtg caaaaaaaaa 5641 gaaaaaaaga aaaaaaaaag ttgaggatgg tgcacagtgg ctcatacctg taatcctagc 5701 actttggaag gctgaggcag gaagattgct tgaggctaac agttcaggac caacctggca 5761 aacacagtga gactctgtct ctatataagt aaagtaaatt ttatatatat atatatatat 5821 atatatatat atattttttt tttttttttt tttttttaaa gaaagttgag gccgggtgtg 5881 gtggctcacg cctgtaatcc cagcactttg ggaggccaag gcaggcagat tacctgaggt 5941 caggagttca agaccaatct ggtcaacatg gtgaaacccc gtctctacta aaaatacaaa 6001 aatctgttgg aagtggtggc acgcacctgt aattccagct actcaggagg ctgaggcagg 6061 agaatcgctt gaacctggga ggtagaggtt gcagtgagct gagatcatga gctgagatca 6121 tgccaccaca ccccagcctg ggcaacagag caaaactccg tctcaaaaag aaaaaaaaaa 6181 aagaaagttg agcaagctag ggtgattttc ttttattttc ttttgtgttt gtgtgtgcac 6241 gtgtgtgtgt gtgtgtgtgt gtgtgtatga caggctttca ctctgtcaca cagactggga 6301 tgtggtggtg tgattatagc tcacttcagc ctcaatctct tgggctcaag tgatcctttc 6361 accccagcct cctaagtagc tgggactaca ggcaaacgcc accgtgccca gctaattttt 6421 aatttaattt ttattttttg tagagacgaa gtcttactat gttgcccagg ctggtctcga 6481 actcctgggc tcatgtgatc ctcctgcctt ggcatctcaa agtgctggga tgactagtgt 6541 gagccactgt gcttggcctt gaagctaggg tatttattca ccatctctca tccctcatgg 6601 gtccaggatc actctagggg aattaattcc ctaacatttt gactctgctc ctcccaagga 6661 ggcttccgtg ggcagagaac atctgaggca agagagacac aggaaactgc ttgtaagaaa 6721 attgtgtagg ctgggcgcgg cggctcacgc ctgtaatccc aacactttgg gaggctgagg 6781 caggcggatc acgaggtcag ttgatcgaga ccatcctggc taacatggtg aaaccccgtc 6841 tctactaaaa atacaaaaaa ttagctgggc gcggtggcgg acgcctgtag tcccagctac 6901 tcgggaggct gaggcaggag aatggcgtga acctgggagg tggagcttgt ggtgagccga 6961 gatcgcgcca ctgcactcca gcctgggtga cagagcgaga ctccgtctca aaaaaaaaaa 7021 aaaaagaaag aaagaaaatt gtgtccacca gggtgctgtg gttcatatct gtcatcccag 7081 caactcagga ggctgaagca ggaggattac ttgaggccag gagatcaaga cctgcctggg 7141 taacacagcc agactccgtc tctacaaaaa atttttttaa gaaaggaaat tggctgggca 7201 cggtggctcg cgcctgtaat cccagcactt tgggaggctg agttgggcgg atcacctgag 7261 gtcaggagtt cgagaccagc ctgaccaaca tggagaaacc ctgtctctac taaaaataca 7321 aaattagctg ggcgtggtgg tgcatgcctg taatcctagc gactcgggag gcttagacag 7381 gagaatagct tgaacccaag aggcggaggt tgcggtgagc cgagattgtg ccattgtact 7441 ccagcctgca gcctgggcaa caagaacaaa actccgtcaa aaaaaaaaac aaacaaaaaa 7501 acagagaaag gaaactggga cgggtgcggt ggctcatgct tgtaatccca gcactttggg 7561 aggcagaggc aggcagatca cttgaggtca ggagttcgag accatcctgg cccacacggt 7621 gaaaccccat ctctactaaa aatacaaaaa attacctggg catagtggcg cacgcctgta 7681 gtctcagcta tttgggaggc tgaggcacga gagtcgcctg aacctgggag gcagaggttg 7741 cagtgagcca agattgtgcc actgcactcc agcctggaca acagagtgag actctatctc 7801 aataaaaaaa caaaaagaaa ggaaactgtt cgaacgtggc ctttagcatg ttctgaggaa 7861 aatatgagct gaacaccaat agcatgtgct ataaactgct gggtttccaa tgcctaggat 7921 ggctagatac atagaccata acgtagccgg gcagtagtca gaactcaatc aataatgagg 7981 gaacagatac atgggagggt atcatgctta tgggtctggc aataattgta gctaattgga 8041 gacttatttc tttaagccct agagaggctc ctctgagggc tgaagtcagt ctgccaatgc 8101 tgaaatattt actgggtgcc tagaatgtat ggtgtacata cgttacacta tgtagtacat 8161 tataacacta actactgtgc ttcttagtgg tcagaaccaa agtataggct gggctcagtg 8221 gctcacacct ataatcccag cattttgaga ggctgagggg gcggatcact tgagctcagg 8281 agtttgagac cagcctggcc aatatgtcga aacccagtct ctactaaaaa taaaagattg 8341 gccaggcgtg gtggcgggcg cctatagtcc cagctactcg ggaggctgag gcaggagaat 8401 tgcatgaaca tgggtggcag aggttgcagt gagccaagat cgtgccattg cactccagcc 8461 tgggcgacag agggagattg tctcaaacaa caaacaaaaa ccaaagtata aaacatggcc 8521 cactttatat gcaagaactt tgtaatattt gggaagacac aaaaataaca taattggccg 8581 ggcacggtgg ctcacgcctg taatccgagc actttgggag gccgaggcag gggatcacct 8641 gaggtcagga gttcaagacc agcctgacca acacagtgaa accctgtctc tactaaaaat 8701 acaaaattag ccaggcatgg tggtgcatgc ctataatccc agctactcag gaggctgaga 8761 caggagaatc atttgaaccc gggaggcaga ggttgtaatg agctgaaatc gcaccattgc 8821 actccagcct gggcaacaag aacgaaactc cttctcaaaa aacaaccaag caaacaaaca 8881 aagaaaagca cataattaat gataaatgac aacgcatgat tgtgaaattg tgtggttctt 8941 gatttatatg ctgtgcatgt tcaaataaac agatttagta aagattggaa taatcagttg 9001 agaaagcttt ttaaaaaaaa aatttttttt tttaaataga gacaggatct tgctgtgttg 9061 cctagtctgg tctcgaactc ctaagctcaa gccctcctcc tgccttgatc ttccaaagtg 9121 ttgggattac aggtgtgagc caccatgcct ggcccaagaa agcttttttt tttttttttt 9181 ttttgaaaca cagtcttgtt cttgttgccc aggctggagt gcagtggcac ggtctcggct 9241 cactgcaacc tccgcctcct gggttcaagt gattctcctg cctcagcctc ccaagtagtt 9301 gggattacag gtgcctgcca ccacccccag ctaatttttt gtatttgtag tagaggtagg 9361 gtttcatcat gttggccagg ctggtctcga actcactcct gacctcgtga tccgcctgcc 9421 tcggcctccc aaagcgctgg gattacaggc gtgagccaca gcgcctgccc caagaaagct 9481 tttatgggta cagggggctt gagattggct caggtttgtc taggcaagag cgggaaacag 9541 ggctttccaa atgaagggaa gttgtagatc ccagaggagt ctggcctggc ctctgtagag 9601 gtttggacct gggacttgca gaaaccagat tgtctgggaa agttaggccc aaatcatggt 9661 tacatttgaa aatcaggcag tgcagttggg tctgctgtga gacaccaaag gggtttatta 9721 gtggtttctg agcaagcaag caacatgata aaaatagcat ttgaggaagg ctaaacagga 9781 ttaaaaagag caggtcaggg aggtcagcct gaaagaccta gactaaggtg gtatccaggg 9841 aatttgagtt aatgatgctt atggattcat taaaacaatt ttttccgccg ggcgtggtgg 9901 ctcatgcctg tcatctcagc actttgggat gccgaagcgg gtggataacc tgaggtcggg 9961 agtcaagacc agcctgacca acatggagaa accctgtcac tactaaaagt acaagaacaa 10021 caaaaaatta gccgggcatg gtggcacatg cctgtaatcc cagctactcg ggaggctgag 10081 gcaggagaat cgcttgaacc tgggaggcgg aggttgcggt gagccaagac tgttgtgcca 10141 ttgccctcca gcctgggcga caagagtgaa actctgtctc aaaaaaaact cacaattttt 10201 ttcatgaaga aaatctgttc ttttaataat gaaaatttca aacttcacaa aagtaaagag 10261 aattgtataa tgaattctca tataccatca cccagattcc aaggttttat caagattttg 10321 tcatgtctac tttatccatt ctgtctcttc ttaagcattt taaaacaatc caggacttct 10381 tgacatttca caggggcctt tttttctttt cttttttttt tttttttgag acagagtctc 10441 actctgtcgc ctaggctgga gagcagtggt gagatcttgg ctcactgcaa cctccacctc 10501 ccgggttcaa ccgattctcc tgcctcagcc tcctgagtag ctgggattac aggcatgcac 10561 caccatgccc ggctaatttt tgtatttttg gtagagactg ggtttcaccg tgttgggcca 10621 ggctggtctc aaactcctga cctcaggtga tctgcctgcc ttggcctccc aaagtgctgg 10681 gattacaggc atgagccact gtgcccgctt tctttttttt tttttttttt tgagatgaag 10741 tcttgctctg tcacctaggc tggagtacaa tggcatgatc ttggctcact gtaacctcta 10801 catcctgggt tcaagcaatt ctcctgtctc aacctcctga gtagctggaa ttacaggtgt 10861 gcaccaccac acccggctaa tttttttgta tttttgtaga gacagggtat caccatgttg 10921 gccaggcggg tctcaaactc ctgacctcaa ctgatccacc catcttggcc tcccaaattg 10981 ctgggattac aggtgtgagt caccgtgccc ggcctggcct ttttttcttt tttagacagg 11041 gtctggttct gacacccagg ctagagtgca atggtgccat cataactccc tgtaacctag 11101 aactcctggg taattagaac tccagctaat taaaaaattt tttttggcta ggcgtggtgg 11161 ctcacgtctg tagtctcagc actttgggag gccgaggtgg gcagatcaca aaaattagct 11221 gggcgtggtg gcaggcgcct gtaatcccag ctgctcggga ggctgaggca ggagaatcgc 11281 ttaagcccag gaggcggagg ttgcagtgag ccgagatcgc gccattgcac tccagcctgg 11341 gcgacagagc aagaccccgt ctcaaaaaac aaagacaagc caaaaaaatt ttttttgtag 11401 agagagcgtc ttgctgtcta cccagactgg tctggaactc ctgagctcaa gcgattctct 11461 tgctttacgc tcccaaagtg ctgggattac aaggtgtgag ccaccatgcc tggccacagg 11521 ggttctttta agagaagaaa atcaacataa tctgatggct gactagaact aaaaggagaa 11581 aaatgaggct ggttgtggtg gctcacacct gtagtcccag cactttggaa gactaaggca 11641 ggaggattgc ttgagcctag gagttcgagg ctgcagtgag ctatgatcat gccactgcac 11701 tccaggtaag agccaccacg cctggccaag accctgtctc tttttttttt ttgagacgga 11761 gtctcgctcg ttgcccaggc tggagtgcag tggtgctatc tcggctcact gcaagctccg 11821 cctcccgggt tcacgccatt ctcctgcctc agcctcccga gtagctggga ctacaggcgc 11881 ccgccaccac ggccggctaa ttttttgtat ttttagtaga gatggggttt caccgtgtta 11941 gccaggatgg tctcgatctc ctgacctcgt gatccgcccg cctcggcctc ccaaagtgtt 12001 gggattacag gcgtgagcca ccacgcccag cctgaccctg tctcttaaaa aaaaaaaaaa 12061 aaaaaaaaaa aataggccgg gcgcagtggc tcatgcctgt aatcccagca ttttgggagg 12121 ccaaggcggg cagaccacct gaggtcaaga gttcgagacc agcctcaaca tggagaaacc 12181 ccgtctctac taaaaataca aaattagccg ggcgtggtgg tgcatgcctg taatcccagc 12241 tactcgggag gctgaggcag gagaattgct tgaacctggg aggcagaggt tgcggtgagc 12301 caagatcgcg ccattgcact ccagcctggg caacaagagt gaaactccgt ctcaaaaaaa 12361 taaaaaataa aataaaaaaa taaaaacaga cacaaagtgc tctccctcac tggctcaagc 12421 tctggtgatt ggagaggtga gaaggggggc caggtgcagt ggctcacacc tgtaatccca 12481 gcactttggg aggccaaggc aggcggatca cctgaggtcg ggagtttgag accagcctta 12541 ccaacatgga gaaaccccgt ctctactaaa aatacaaaaa aaattagtcg ggtgtggtag 12601 agcctgcctg taatcccggc tactcaggag gctgaggcag gagaatcgct tgagaccagg 12661 aggtggaggt tgcggtgagc caagatcatg ccattgcact ccaacctggg caatgagagt 12721 gaaacttgct ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aagaggtgag aagagaacca 12781 ggagacactg tgtcctggaa accaagaagg agaagcattt tagaaggaat ggtaatagac 12841 agtggcaaat tcagcagatg agcaagggaa agtatagagg gaaaaacgca ggtttgggca 12901 ttaaggaaat cagcaggcaa ggcatttgca gtcatgttaa ggataaaagt gagctatagc 12961 agaaggttag gaaggagtgt ataggaagga aattgatggg ctgggcgcag tggctcacac 13021 ctgtaatccc agcactttgg gaggccaagg agggaggatc acttgaggcc aggagtttga 13081 gaccagcctg ggcgacacag tgaaacaccg tctctaccaa aactacaaaa attagctggg 13141 cgagatggcg cgtgtttgta gtgcagtccc tgccactcag gaggatgagg tgggaggata 13201 acctgagccc aggaggttga ggctgcagtg agctgagatc gtgccactgc attccagcct 13261 gggcaacagc cagaccctgt ctccaaaaat aaaaaaaata aaatagaaag tatggctaaa 13321 atggggtggg gattgtgtgt tagggaaaga aagcagcgat ggaagggctg aaaatggttg 13381 gagatgaaac gtctctgaca gctttgtgcc tcccaagggc atactgttca gtcagcgtct 13441 tccttggttc cagggtcctg tgcttggatt gtggggaaca gactccccgg ggggtgctgc 13501 aagagcgttt ccaagtcctg aaccccacct ggagtgctga ggcccatggc ctggctcctg 13561 atggtgacgt ctttctctca gaggagcaag tccggagctt tcaggtccca acctgcgttc 13621 aatgtggagg ccatctgaaa ccagatgtcg ttttcttcgg ggacacagtg aaccctgaca 13681 aggttgattt tgtgcacaag cgtgtaaaag aagccgactc cctcttggtg gtgggatcat 13741 ccttgcaggt atctgacttg gcaagagtgg taaccacccc ttgtgcggga ttgggagtcc 13801 tggagagaca ccctgtttgg tttaactttt tctagatcta gagaacctag acattcttat 13861 gtgtgtcttt tctccgtgca ggtatactct ggttacaggt ttatcctcac tgcctgggag 13921 aagaagctcc cgattgcaat actgaacatt gggcccacac ggtcggatga cttggcgtgt 13981 ctgaaactga attctcgttg tggagagttg ctgcctttga tagacccatg ctgaccacag 14041 cctgatattc cagaacctgg aacagggact ttcacttgaa tcttgctgct aaatgtaaat 14101 gccttctcaa atgacagatt ccagttccca ttcaacagag tagggtgcac tgacaaagta 14161 tagaaggttc taggtatctt aatgtgtgga tattcttaat taaaactcat tttttttaaa 14221 taaaaaattg ttcagcttta tatgaaatgc ttcatgaatt tgtgtgtcat tcttgcacag 14281 gggtcatact aacctctgta tcattccaat tttagtatat gtgctgctga agagagcact 14341 aaaactaatt tttttttttt tttgagacag gatctggctc tgtcacccag gccgaagtgc 14401 aatggcctga cctcagctca ctgcaacttc tgcctgccag gttcaagcca tcctcccacc 14461 tctgcctccc aagtagctag gactacaggc gtgcaccacc atacccaact aattgttttt 14521 attttttgta gagacagggt ctcactttgt cacccaggct ggtctcgaag tcctgagctc 14581 aagctctcca cccgcctcag cctcccaaag tgctgacttt ataggcacga gctaccacac 14641 ccggccaaaa cttttttttt tgagacagtc tcgctctgtt gcccaggctg gagtgcagtg 14701 gtgtgatctt ggctcactgc atcctctgcc tcctggattc aagtgattct cctgcctcag 14761 cctcccaagt agctgagatt acaggcacgt accactacac ctggctaatt ttttatagtt 14821 ttggtagaga cgaggtttca ccatattggc caagctggtc tcaaactcct gacctcaagt 14881 gatctgcctg ccttggcctc ccaaagtgct gggattacag gcgtgaacca ccgtgcccgg 14941 gtgttttgtt ttgttttgtt ttgtctttga gatggagtct cgctctgtag cccaggctag 15001 agtgcagtgg catgatcttg gctcactgca acctctgcct cccagattca agcaattctc 15061 ctggctcagc ctcccgagta gctgggacta caggtgccca ccaccatgcc cggctaattt 15121 ttgtattttc agttcaccat gttggccagg ctggtcttga actcctggcc tcaagtgatc 15181 cacctgcctt agcctccaaa agtgctggga ttacaggcat gagccaccaa gcccggccag 15241 tggtcatctt tccatgtcct aagtgctatg gtattgtgac tgataggctg atgttcacaa 15301 gcattccctg gaccctgctc ttcgtttatc tttgcattca cttgtttcat ttctttacag 15361 atccctaagg agcatccgtt ttttctcaaa ctaatcaaag ctcgctgagt gcagcctgta 15421 gggggcaaca aatcaacacc tctctgccag cccatttctc aacccggcag ctttcccagc 15481 caaagagtta aggtctaggc tgaggagatt tgccccagaa caaagacttt ttcaaagaag 15541 cagctgcatt catttccttt ttatctcacc tggcaggaga aatggcctgg gacatatgta 15601 ggccagagtg gtctctggag cgccaaccca aggatctcac taattttgtg gggtaagggt 15661 agaaggaaat gactaaattt ataatccaaa tggttaagtg tctttaactg aaccattttg 15721 atataatttt tgaaaaaata agcaatggcc atgaattaat tttttataac caagagataa 15781 aggaatcttt caaaaatagc acagcagctg ggcgtggtgg ctcaagcctg taatctcagc 15841 actttgggag gccaaagcgg gcggatcacg aggtcatgag atcgagacca tcctggctaa 15901 cacggtgaaa ccccgttttt attaaaaata caaaaaaatg ggccaggcac agtggctcac 15961 ttctgtaatc ccagcacttt gggaggccaa gacaggcaga tcacgaggtc aggagatcga 16021 gaccatcctg gctaacacga tgaaacccct tccctactaa aaatacaaaa aaattagctg 16081 ggcgtggtgg cgggtgcctg tagtcccagc tactggggag actgaggcag gagaatggcg 16141 tgaacctggg aggcggagct tgcagtgagc cgagatcttg ccactgcact ccagcttggg 16201 cgacagagcg agactcaaaa aacaaacaaa caaacaaaat tagccaggcg tcgtggcagg 16261 cgcctatcgt tccagctact tgggaggctg aggcaggaga acggtgtgaa cccgggaggc 16321 ggagcttgca gtgagctgac atcgcgccac tgcactccag cttgggcgac agagcaagac 16381 tccgtctcaa aaaaaataaa aataaaaaat aaaaaataaa aatagcacac cagggccagg 16441 cacagcggct cacgcctgta atccctgcac tttgggaggc ataggcgggt ggatcacaag 16501 gtcaggagat ggagaccatc ctggccaaca tgatgaaacc ccgtctctac taaaaataca 16561 aaaattagcc aggcatggtg ggatgtgctt gtagtcccag ctactcagga ggctgacgca 16621 ggagaatcgc ttgaaccagg gaggcggagg ttgcagtgag ccgagattgt gccactgaac 16681 ggcctgggtg aaagagaaag actctgtctc aaaaacaaaa caaaacaaaa caaaaaaacc 16741 acactaggta tgtggattgt atgtttatat atttaattat atatatatat atttttgaga 16801 cggagtctcg ctctttctct tttgcccagg ctggagtgca gtggcgcaat cttggctcac 16861 tgcaagctct gcctcctggg ttcatgggac tacgggcgcc agccaccacg cccagctaac 16921 ttttttgtat ttttaataga gacggggttt cgctgtgtta gccaggatgg tcttgatctc 16981 ctgacctcgt gatccacccg cctcagcctc ccaaagtgta cctgtttgtc ttttttaaaa 17041 aagattgcaa atgaggctcg gtgggaggcc gaggtgggca gatcacctga ggtcaggagt 17101 tccagactag cctggccaac atggtgaaaa ctgtctctac tgaaaataca aaaattagcc 17161 gggcgtagtg gccagtgcct gtaaccccag ctactcggga ggctgaggct gaagaatcgc 17221 ttgagcagag gaggcagagg ctgcagtgag ccaaggtcat gccactgcac tctggcccgg 17281 gcgatagagc aagactccgt ctcaaaaaaa aaattatata gacacatata tatacataaa 17341 ataataataa taggcccggc acggtggctt atgtgagtct gtaatcccag cactttggag 17401 ggctgaggcg ggtggatcac ccgaagtcag gagttcgaaa ccagccagcc tgaccaacat 17461 ggtgaaaccc tgcctctact aaaaatacag aattagctgg gcatggtagc cggcgcctgt 17521 aatcccagct acttgggagg ccgaggcagg agaatcgctt gaacctggga ggcggaggtt 17581 gcagtgagct gagatagcat cattgcactc cagcctgggc aacaagagca aaactccgtc 17641 tcaaaaaata aaataaaata aaaataatag taataaaaag taaaaaaata gaaaaagatt 17701 acaaatcaag gcttgcttaa tgtaataaag actttctcaa gaaaaattaa agtgcattat 17761 cctttcctct ccttcccctt aggaaaacag tatttatcga acatttacta tacaacaagc 17821 acttaggaag catttacaca acatatttgg acacttaatt cccacaaccc tatgataaga 17881 gcactattat ttcgggtaac ctgcccaagg tcaccaagag ggtgactagg ccaggaatga 17941 aggccagtct gtccagttct gagttcgctc tcggactttc tgctgtagaa acccggtggg 18001 gtaacagcag ctgtacctgc ggaagaagga caaagctcaa gaaacttcca gaagtgaccc 18061 aggctgggtt ctgaaagatg aacatgcctt cacgaggtag gaaaagggag gaaaccactc 18121 atgcaaaaag catggctgtt acaccaccaa atctcttcgg tttttttttt cccttcaatg 18181 tgggggtgga aatttaagtg tagatttggg gcgcctcaga ggcctgtggc tcgcccaaca 18241 tccccttgcc tctctgcttg gccacctctc ctctcagctc ccctttgttc cctgcacccc 18301 actttcccta ccccctaatc cgtccccggg ataaaagcaa tgcaggcgat gcccacagga 18361 aaccggcagg aggagctggg tgacagggaa gagcgacccg gagagtcaca tccacgtgag 18421 ttcgccctct tttcacactc tagcgcccat tctgtccttc ctgctggatt ctggctcagg 18481 agaagagcgc cgccaggacc ccaggaagct gcagacagga gccgctgaca cctccacctc 18541 acgcggacgc cacctagtgt gaccattacc agccactgcg cctgcgcaga ctcgggtttt 18601 ctgtgcgcgc cagaggggcg gggcctgggc cgggaagggg cagaagtaga ccgacggctg 18661 ggccgccatt caccaataga aaagagagat cgagagccac gtgctcataa gatatccaat 18721 cacaatcaca tcctgtggcg gcggattggg gggcgtggac tagaggttga tcgggttggg 18781 cacacagcat cacgtgacac gaaaaggtaa ataaggccac gtggcttttt ctccttggct 18841 tcggcgaaac attacgtcac actaccgtct ctgaacaatt atttaatgaa cttatgtgat 18901 gcgctagaca atgtgaagac aatgcggttg ctagaaattg taatctctcc atcaaattac 18961 tctgtccagc ttatttcctt gtttcttcat tcaaatcatc ctagacattc aactagtttt 19021 ccgtatccta ctttcataca aatgctcatt taaagataat taagggaatt tattctctct 19081 ttcccaagcc ttgataatga aaaagaggta ggttattttg ttgttggttt tttgttttct 19141 tttttctttt tttttttttt ttgagacagg gtctcattct atcacccagg ccagagtgca 19201 gtctcactgc agcctcaacc ttctgggctc aggtgatact cccatctcag cctcccgagt 19261 agctgggact acagccatgc aacaccttgc tgattttttg tagagacggg atttcaccgt 19321 gttgtccagg ctggactcaa gcaagcctca gcttctcaaa gtgctgggat ttacaggcct 19381 gagctaccac gcccggccca agagagattt ttacatcaga ggatatcttt agagccttta 19441 gtagccctgt gactgctctc tctctctcgt ttgttctgtc atctaaataa ataaataaat 19501 aaataagaag aagtaagcac cattttccta aaagcccact tttcttccat ccaacaggga 19561 cttgcaattt ctttccactg aagaaatatt ttttatggcc ttcacagttc gttgaagatt 19621 aaggccaggt ctaacttgat tttgcatccc ttgaaattct tcatagcaca atatgcaaag 19681 aagggttctt cggtttatgc ttgttgaatt tgaatttgag aatagtccca ttcttgcctg 19741 ccaaagaatt aaaaagactt aaaaagaatt aattttaatt aaaaagaatt aaaaagatag 19801 cacttctctg aattatctgg atttgggttc caaggtcagc tatgccaccc aggaattagt 19861 gacttttggc aatccattgc atctttaagc cagtttcctc atccatcaga tgtggatata 19921 acagtaccta caactataga acttcaagga tgacaagata attgtgcaaa agacatgaca 19981 ccgtctggag catacagcat aatatataca aggatgccag tttgaagagt agggaagaaa 20041 gtctcagagc agactgaatt gtagacagat actaacacta aatcattgtc ttgtggccag 20101 ccccgaatac attgtatcct cagcattttg tgtggtgcct ggcccagagt aagtgttcaa 20161 taaataatca ttgaatggat gaataaaatg agggaccggg cgtggtggct cacgcctgta 20221 atcccagcac tttgggaggc caagtcgggc agatcacgag gtcaggagat cgagaacatc 20281 ctggccaaca tagtgaaacc ctgtctctaa aatacaaaaa attagctggg cgtggtggtg 20341 cgtgcctata gtcccagcta ctcaggaggc tgaggcaggg gaatctcttg aacccgggag 20401 gcagaggtta cagtgagccg agattgcgcc actacactcc agcctggtga cagagtgaga 20461 atctgtctca aaaaaaaaat tataaataaa taaataaatg agggagaaat tatgccaaaa 20521 gataagtcca cacacactcc cttcttcaaa tggaaggtct tctattgccc accacatacc 20581 cagaaatggt ggctgttctg tcagtttcca gtggtctgaa actgcttaat ggtgcctttt 20641 gatagtggac agagtctcac tgtgtcaccc aggctggagt gcagtggtgt tatcttggct 20701 cactgcaacc tcttcctccc aggttcaagt gattcttctg tctcagcctc ctgagtagct 20761 gggattacag gtgcacgcta ccatgcccag ctaaattttg tattttttag tagagatgag 20821 gttttgccat gttggccagg ctggtctcga actcctgacc tgaggtgatc tgccctcttc 20881 ggcttcccaa agtactggga ttacaggcgt gagccaccat gcctggcctc aactttgctt 20941 ttctgcttca gttcccttct ctgcctctcc ccacttgctc tggtgaattt cttttttttt 21001 tttctttctt tggccgagta tcactctgtc acccacgctg gagtgcagtg gtgtgatctc 21061 agctcactac aacctcagcc tccctggttt aagccattct cctgcgtcag cctcctgagt 21121 agttgggatt acaggcctat gccaccatac tcggctaatt tttgtatttt ttagtagaga 21181 cagggttttg ccatgttggc caggctggtc tcaaactcct ggcctcaagt gatctgccca 21241 cctagacttc tcaaagtcct gggattacag gcgtgagcca ctgtccctgg cttagttctg 21301 gtgaatttca atgtaaatgt agtcataaat actcatagag gaattgggct gaagtgactt 21361 agtggttaaa agcttgggca ctgccaccag acagacctgg ccaaatcctg attttgccat 21421 ttgtgtgatc ttggacaaat gacttaactt ctctgaattt ttagttccac atccatgaaa 21481 tgggagcaaa aatatcaata tcacagagct ggtttgaaca tttaatgaga tgagatgtgg 21541 aaaactccga acaaggaagt atgcagtaag tgctccagaa atgcaagctg ttattggttt 21601 tttgttttgt tttgttttgt tttgttttgt tttgtttgtg ttttgtgtgt gtgtttgttt 21661 gagatagtgt ctctctctgt cactcaggct ggagtgcagt ggcgtgatca cagctcactg 21721 cagcctccaa ctcctgggct caggggatcc ttccacttca gcctccctag tagcttgaaa 21781 cagggtctca ctctttcacc caggctggag tgcagtggtg tgatcatagc tcactgcagc 21841 ctcaaactcc cgggatcaag agatcctcct gcctcagcct cttgagtagc tacgatcata 21901 ggcgcgtgcc gccatgctcg gcttgttaat gtttattata tttcatgact acagtgtcaa 21961 aattagcctg tagagtgtgg tgaagtcttt tatttagaaa agaaagtcgt ggccgtgtgc 22021 ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggcgga tcacgaggtc 22081 aggagatcga gaccatcctg gctaacacgg tgaaaccccg tctctactga aaatacaaaa 22141 aattagctgg gcgtggtggc aggagcctgt aaacccagct actcgggaag ctgaggcagg 22201 agaatggcgt gaacctggga ggcagagctt gcagtgagcc gagattgcag cactgcactc 22261 cagcctgggc gacagagcga gactccatct caaaaaaaaa aaaaaaaaag aaagtcattg 22321 tagaaaaatg gtttaataca gttgaaaatg gaggcagaat catgagacgg gggcaaggtc 22381 tggctcaaac taaagcctac cacttcaggc agggaagggc agtggttcag catgtgtttt 22441 ctgttgttag acttgccgag ttcaaattcc ttctctgctg cttactagat gatagcattg 22501 gaaaagtgac ctctcctatt gagcctcagc ttaggaaaat aagatttgct aatagcgcca 22561 atacctcaca gggctgtcaa gaggatttat ggacacacag tcctggtaca gagtaaataa 22621 tcagtagatg gtgatcgtta atcattactc aggtttcaag gtgcttgctt gcttcctgga 22681 agtgtctttc taattcaact tagcttccaa ggctggaaga aatgatactc acaaaagact 22741 aggcagagta gccaggcgcg gcggctcatg catgtaatcc caacactttg cgaggccgag 22801 ccaggcagat cacttaagct gaggagtttg agaccagcca gggcaacatg gcgaaactct 22861 gtctctacaa aaaatacaaa aattagccag gcatggtggt gcatgcctat ggtcccagct 22921 actcgggagg atgaggcggg aggattgctt gagcccggga ggcggaggtt gcagtgaggc 22981 aaggtggcat cactgcactc cagcctgggc cacagagcaa gactccacct agaaaaaata 23041 aatacataaa aataaaagag taggcagaga cctagtttat agacagagaa taaaacaata 23101 tccaaacatg aggtctttca acaaggtgct ttattggaga gtacagtgtg agatgaggca 23161 gatagaggtg atgcttttga gaggtgatat tcaactctga caatacttct tggtgtccag 23221 gttcttgtgt gccttgttat atggagcttt tgaaaagcag atggcagcgt tgcggtcgca 23281 gttgcaaatg aaggcctcac actctttgtt tttgcctgga gagggatgaa aggagaggac 23341 tgagccaagg tggaccctgt tttgtacaat ctggggcaag aagaattaac tggaataagc 23401 tggcagccat tccactgatg cacatttagg gctgacagat taagtgaata caagtacatc 23461 aggatgccca gttaaatttg actttcagat aaacaatgaa tgcaatattt gggacatata 23521 ccccccaaaa aattatccat tgcttatctg aaagtcatat ttatctgggc atcctgtgtt 23581 tttttgtttt gttttgtttt taagacggag tttccctttt gtcgcctagg ctggagtgca 23641 gtggcgtgat ctcagctcac tgcaacttct gcctcctgag ttcaagtgat actcctgcct 23701 cagactcctg agtagctggg actacaggtg ctcgccacca tgcccagcta atttttgtat 23761 ttttattaga gagggtgttt catcatgttg tccaggctgg tctcaaactc cggacctcaa 23821 gtgatccatt tgcctcggcc tcccaaagtg ctgggataca ggcgtgagcc accatgccca 23881 acctagtttc tctactttaa agcatgtatt ttaaaaatta tggaacatcc ttaaatggaa 23941 tactacatag acatttagaa aaattaggta tgcctgcaca tggaaatttt cccaaataaa 24001 aaaagctagt gtatgcccca aaaatctgga ggaatatgta actggttatc tttgggtagt 24061 aggattatag atgccttcca ctttccataa tagatattta tgtagagttt gaagttgcat 24121 tacttttggc ttgtttactt ttataaccaa aataaacaat aagattacag ttaaaatgca 24181 tttaaacttt attacatata atttacgtat ctctgcttac cttcaggaat cttattagtt 24241 ttgtttctct ccgatccttt attttattta tttatttttt tgagatagag tctcgctgtg 24301 tcgcccaggc tggagtgcag tggtgggatc tcggctcact gcaacctctg cctcccaggt 24361 tcaagcaatt ctcttgcctc agcctcccga gtagctggga ctacaggcat gcatcactac 24421 acccagctaa tttttgtatt tttagtagag atagggtttc accatattga ccaggctggt 24481 ctcgaacttc tcacctcaag tgatccaccc accttggact tccaaagtac tgtggttaca 24541 ggcttgagtc actgtgccca actgagttat ttttaattaa tatgaacaaa catcctctaa 24601 tgcaggtttc ttgtcttagt tgcaaacatt aaaaaaaaac tgtgtgtgag tgtgtgtgtg 24661 tgtgtgtatt tctctgggga gagggttcat agctttcatc agatttgtaa atttggcctt 24721 ctaaagtgga gcaggaggat gtataagcaa attaagccat attattcagt gactaaaata 24781 gtaacaatga aaaaacagta acaataaagg tttgaggcaa tatgaaaaag tcataacaag 24841 gtgataagtg aaaaagcaga atgcaagatt gtatttattc tgcaaagttg tcaagattgt 24901 tttaaaaaaa gaaaaagaaa gtttatatct aggctgtgca ttcatcaatt atttattgag 24961 cacctactat atgtcagaca ctgctctaga tctggggata tagcaataaa gaaaatagac 25021 agtaatcggc caggcacagt gactcacgcc tataattcca gcactttggg agactgaggc 25081 aggcagatca cttgaggtca ggagtttgag accagcctgg gcaacatggt aaaaccccat 25141 ctctgctaaa aatacaaaaa attagccaag catggtggtg cacgcttgta attccagcta 25201 ctcaggaggc ttaggcagga gaatcgcttg aacccgggag gcagaggttg cagtgagctg 25261 aacggaggtt gaggtgatga agtgagactc tgtctcaaaa aagaaaggaa gaaaataggc 25321 agtaatccct gttctgtgga tcttacattc tattggaaag atggaatgta aagaaataag 25381 tagccgcaca tggtggctca cccctgtaat cccagcactt tgggatgcca aggtaggagg 25441 attgcttgaa gccaggagtt taagacaagc ctggccaacg tagtaagacc cagtctctac 25501 aaaaaaattt ttttaattag ccaggcgtgg tggtgcacgc tcgtagtccc agctacttgg 25561 gaggctgagg tcagcagatc ccttgagctc aggaggttga ggctgcagtg acctatggtt 25621 gcaccactgt gctcatgcct gggtgacaga gcaagaccct gtctctaaac aacaacaaaa 25681 aaaaacccaa actgtcacca cttacaagct gtgtgatctt gggcaaatag ctcagccttc 25741 tgagcctcag ttttcttatc tgttaagtga ggataattgt acctatccca tagtgttgtt 25801 ctgaggatta aaggagacac tgcccagaac caagaaaatt aacactaaat catggctgtt 25861 gttactatta tttccccccg gcctactgag aaccaactag aattcatagg tcaaggaagg 25921 gataaaccta ctgctacagg tgattgccga gccagagcac gagtatgaat aggtgtgggt 25981 gtacgggttg tccagcagaa atttacagct gtccagcttc ttggcctggt catagcagtt 26041 gtcatgtgtc tggcagcacc tggaaagtgg gagggacagc tgagatagga gtaagtgcag 26101 agcaataggt cagccctcac ctgcccactc tcaggaacag gtggggatga cttcgccaag 26161 atgcttcagg agaaatgatc ctatggcttg aaaaaatata agtcctttta tcatatattt 26221 attgaaagca tttttttttt ttgagacaga gttgctctgt tacccaggct ggacagcagt 26281 gacacgacct tggctcactg aaacctccaa ctcagtctcc caagtagctg gggttacagg 26341 cgctcgccac catgtccggc taatttttgt attttttagt agagacaggg tttcaccatg 26401 ttggccaggc tggtctcgaa ctcctgacct caagtgatcc gcccgcctcg gcctcccaaa 26461 gtgctgggat tacaggcatg agccaccgcg cctggcctga aagcgatttt taacatggtt 26521 gcaagaatac agtaaaatcc tgtattcaga aggtgcagtg acaaccactg ctagaaaatg 26581 acacgtggat agttgctgtg atgtttactg ttgctacatt ctgccactgg agggcagcat 26641 cgcccactgt tttgcacgtc ggtcattgtt tccatctcta acggggtgcg ttcgctacag 26701 ggtcctcttg gaacatattt gtttatttga caagaggtga aaatatgtta tccttagcag 26761 atatgcaagt cccctttgta tgcctcgtga gatccttggc gtgtgcccca ccccgccccc 26821 ggcaggcact ccaattttcc tgcaggcgga tcacttactt gtccagttca tccacggggg 26881 tgcctgagcc ccccaagcca cagtagcagc cgtagttgtt gtattccaag aaggggtcac 26941 tccccgggat cacgcacttg atcattttgc ggaactgcca cacggcccga gggctgatgc 27001 cgctgtcggc ggcggccact gcaagaagac atagccagag ttcaaatcgg tctgccagca 27061 ccccggggac acactgcctt cctgctccct cgggtcccac gctgcctccc tgacccctgc 27121 cagctgcctc ctctgaagac cgttggaccc aggaacctgg taaagtgaaa gtgggtagag 27181 gttttttttt ttttttaatg aaaactttat atatataaag tttatatctt tgtgtgtgtg 27241 tatatatata tatatcttct tctttttctt cttcttcttc ttctgggacc ctaccgcctc 27301 agctaggatt acaggcatgt gccacccagc taggcctgat agtttttttt ttttttttga 27361 gatggagtct tgctctgttg cccaggttgg agtgcagtga cacgatttca gctcactgca 27421 aactctgctt cccgggttca cgccattctc ctgcctcagc ctccctaggt cagatagttt 27481 taaactatct gcactgttta tccagtgaat atttattgag cacttactat gtgccaggct 27541 ctgtcctagg agctagagat acagcagtga acagatataa taataataat aataataata 27601 ataataataa taataataat aatctctaac ctcatggagc ttacaaccta gtgggagaga 27661 aaaacaataa ataataaatg tgataaggcc gggcacggtg gctcatgcct gtaatcccag 27721 cactttggga ggccgaggca ggcggatcac aaggtcatga gatcgagacc atcctggcta 27781 acacgatgaa accccgtctt tattaaaaat acaaaaaaat tagcggggcg tggtggcagg 27841 cgcctttagt cccagctact ggggaggctg aggcgggaga atggcgtgaa cccagcaggt 27901 ggagcttgca gtgagccgag atcgcaccac tgtactccag ccttggcgac aaagcaagac 27961 tctgtctcaa taataataag aataataata aatgtgataa gtaaggaaac tggatactgc 28021 attagaaagg ggaagaatag agcaggatga ggagaatcta aaagggaggt cacccagcct 28081 ggccaacatg gtgaaaacct gtctctacca aaaaaaacac acaaaaaatt agctgggcat 28141 ggtggtgtgt gccagctact tgagagctac aagtcccagc tacttgagag gctgaggcaa 28201 gagaattatt tgaacccagg agatggaggt tgtagtgatc tgagatcacg ccaccgcact 28261 ccagcctggg cgacagagtg agccccgtct caaaaataaa ataaagtaaa ataaaaataa 28321 aaattaaaat aaaataaaat aaatgggaga ttggcctaac tacttagttg agtaaggtgg 28381 gaggatcact tgagcccacg agttggaagc tgtggtgagc tatgatcatg ccactgaact 28441 ccagcctggg tgatagagca agaccctgtc tggaaaaaag aaaataaagg caagctttct 28501 tagctcaggc aatctgagtt cagacttgca ggttgaaaaa gtgccagcct tgagggaacc 28561 ccaatagagg agcccttaag ccccactggg aacctcgaat tgagactgcg gggctggcca 28621 tccctgtttg ctgtggcctg tggcccccat tccagaggag tgagatctta gctcacttgg 28681 gagagaaagg cgggtggagc cggggagact tgcctacctg tgagcagcac agctagcaca 28741 aggagtttca tcttgcagtc aaggtgagaa aagaactgag atgaccagtc tcaggtatag 28801 tcttatagtc agtctccaca caaccctgcc cagataaggc tggagtgttt gctttgctct 28861 tggctgagtt tggggctggc cttgaatctg tctgttgcag gaatgaccct gacgatttgc 28921 cagcatggga tcactggaaa atatgttatg catttgtcct tagcaactgt taaccaaagg 28981 gatggtttct ggagttttca gatcttacaa tcttgaccct catacccacc actttgtact 29041 cagcccagct tcacgggctg gttggggaaa gcaaagccgt cagttcagtc ggacgctggg 29101 ataagggaga caagaaggaa gtgcagcctg ggcaacatag agagacctca tctctaaaaa 29161 aaaaaatgtt aaccaggtgt ggtggtggat tgcacctgtg atcccagcaa ctcaggaggc 29221 tgagatggga ggatctcttg agccagggag gtcaaggctg caatgaacag tgattgagcc 29281 tctgcagcag agtgagaccc tatttcaaaa aaaaaaaaaa aaaaagaaag aaagaaaaaa 29341 gatcgctgcc tgcagcctca atgtcttatc aagtgcctga atctgtgcac tgggtctggt 29401 tgaatagtcc catcagtcgt gcaccttgca gataaagaga caggttcaga gagttgcggt 29461 aacttgccca aggtcacaca gttggcaaat gacaaaatag ggctccagtt tagatctgtc 29521 tgattccaag gcagaggttt tatttttttc caccactccg ctctgcagtg tgggtttgtg 29581 agaaaaacct gggctttgga ttccaaaggc ctgagtctga attatagaat actaaagggc 29641 attagaactg tcatatcaga ggccaggcac ggtggctcat gcctgtaatc ccagcacttt 29701 gggaggctga ggctggcaga tcacctgagg tggagagttc gagaccagcc tgaccaacat 29761 ggaataaccc tgtctctact aaaaacacaa aattagccag gcatggtggc ttatgcctgt 29821 aatcccagct actccggagg ccgaggcagg agaatggctt gaaccctgga ggcggaggtt 29881 gcggtgagct gagatcacgc cattgcactc cagcctaggg aacaagagca aaactccatc 29941 tcaaaaaaaa acctgtcaca tcacatatct atttatttat atatattttt tgagacggag 30001 tcttgctctg ttgcctaggc tggagtgcaa tggcgcgatc ttggctcact gcaagctctg 30061 cctcccaggt tcatgccatt ctcctgcctc agcctcccaa gtagctggga ctacaggcac 30121 ccaccaccac gcccggttaa tttttttttt tgtattttta gtagagatgg ggtttcaccg 30181 cattagccag gatggtctcg atctcctgac ctcgtgatcc acctgcctcg gcctcccaaa 30241 atgctgggat tacaggcgtg agccaccgcg cccggccaaa tatctacata acattgttta 30301 gctgaaattc acataacatt aaccacgttt aagtaaacaa ttcgggggca tttagtacct 30361 tcacagtgtt gtgcaaccat cacctctatg gggttccaga acatttttat cactccagaa 30421 ggaaaccctg tacccattag cagtcacact ccctcagccc ctggtaactg ctaatctgct 30481 tctctctctc tctctctttt ttttttgaga cagagtttcg ctcttgtcgc ccaggctgga 30541 gtgcaatggc acgatctcgg cttactgcaa cctccgcctc ccaggttcaa gcgattctcc 30601 tgcctcagcc tctcgtgtag ctgggaatta caggcgcctg ccagcatgcc tggataattt 30661 ttgtattttt agtagagaca aggtttcacc atgttggtca ggctggtctc aaactcttga 30721 cctcaggtga tccactcctc agcctcccaa aatgctggga ttataggcat gagccactgt 30781 gcctggcctc tgatctgctt ctctctatat agatttataa ttggttaact tggtgctctt 30841 ctgttcctat gtcacctcta caacaaccct gtgggataag gagggcctgg ttatttaatt 30901 tatttatctg tttttttttt ttgagacaaa gtgtcgctct gtttcccaga ctgatgtgca 30961 gtgacacgat catagttcac tgcagcctca acctcctggg ctcaagcaat cctcccacct 31021 tagcctcccg attagtgggg actacagaca tgtgtcacca cacctggcta atttaaaaaa 31081 attttttggg ccaggcacgg tggctcatac ctgtaatccc agcacttggg gagaccgagg 31141 caggcggatc acgaggtcag gtgatggaga ccatcctggc taacacagtg aaaccttgtc 31201 tctactaaag atacaaaaaa taagctgggc atggtggcgg gtgcctgtaa tctcagcaac 31261 ttgggaggct gaggcagaag aatggcgtga acccaggagg cagagtttgc agtgagccga 31321 gatcacgcca ctgtactccg gcctgggcga cacagcgaga ctctgtctca aaaaaatttt 31381 tggagacacg gcgtctccct atgttgccca ggctggtctt gaactcctgg cctcaagtga 31441 tcctcccatc taagcctccc aaagcattgg gatcacaggc gtgagccact gtgcccagca 31501 accttgatag ttttgaggag tattggtcag gatgaccctc tcttggaatt tgtgtgtgtg 31561 tgtgtgtggg tgtgtgtgtg tgtgtttgtt gtagtctcac tctgtcgccc aggctggagt 31621 gcagtggcac aatctcggct cactgcaatc tctgtctccc aggttcaagt gattcttgtg 31681 cctcagcctc tgagtagctg ggattacagg cacgtgccac cacgcccagc taatttttat 31741 ttatttattt ttatttattt ttgagacgga gtctcactct gtcaccaggc tggagtgcag 31801 tggcgcgatc tcagctcact gcaacctctg cctcccgggt tcaaatgatt ctcctgcctc 31861 agcctcccga gtagctggga ctacaggtgc acaccactgc gtccagctaa tttttgtatt 31921 tttagtagag acggggtttc accatgttgg ccaggatggt cttgatttct tgacctcatg 31981 atctgcccgc ctcagcctcc caaagtgctg ggattacagg cgtgaaccac tgcacctggc 32041 tctctattgg aattgtcctg tgtttttctc ataattagat tacagttatg ggtttggggg 32101 aggaagacca cagaggcaaa gtgccatttc atcacatggc tgctttatgt ttaatcatct 32161 cttttctggg ccgggcgtgg tggctcatgc ctgcagtccc agcacttcgg gaggccgaag 32221 cgggtggatc acctgaggtc aggagatcga gaccatggtg aaaccccatc tctactaaaa 32281 atacaaaaaa ttagccgggc gtggtggcgg gtgcctgttg tcccaggtac tcccgaggct 32341 gaggcaggag aatggcgtga acccgggagg cggagcttgc agtgagccga gtgccactgc 32401 actccagccc gggtgacaga gtgagactct gcctcaaaat aaataaaaaa aaaataaaaa 32461 taaaaatgaa ataccattaa aagttcagtt tctcagtcac aatggccaca tttcaaggct 32521 catagacaca tgtggcgagt ggctgccaga ttggcagcca cagaatgttt ccatgatcac 32581 agaaggttct gttggacagg gctgctgtgg attgtcacag agcctgtctc agggctgccc 32641 catgggaggt gggggctatg aggactaatg gagggggact tggttaggtc agaatcccac 32701 atcagcttga ctgaatggtc ctgcctctgc agccccagaa tgagtggagt ctgcccaagc 32761 aaagggaagc cttgcccagc aaaatgttca gtccctctta tgtgggttcc ttggggtcat 32821 aacaagacag aggaccaggc actgctccta ggtttaggcc tgaaggatgc taagtgggtg 32881 attgggggtg gctcaaagaa ggagaagcag ctatggttgc aagtaaagtt attaattacc 32941 ctttttttcc aggctggagt gcagtggtgc aatcttggct cactgcaacc tccacctccc 33001 aggttcaagt gattcttctg ccccagcctc ctgtgtagct gggactatag gcaagtgcca 33061 ccacacctgg ctaatttttg tatttttatt agagatagtg ttttgccatg ttggccaggc 33121 tggtctcaaa ctcctgctct caagtgatcc acccaactcg gtctcccaaa gtgctgggaa 33181 tacagacatg agccaccgca cctggctaat gagttactat taactgagcg ctaactccat 33241 cccaggtccc atatgctaag aacctggcac aaaggtcgag gtaggtccag tacaaacttc 33301 attttacaga gaagacagct gaggctgaga gaggttaagt gactgcccca gagtcacaca 33361 tcaggaccaa gatatgagtt cgatctggtt ctcatgctac catctcaggc tagggagtac 33421 tgaaagtccc ttgaggaagg aagagattca gtccaaccta cgacttccct ccttccccag 33481 cccatgaaag ccttcctaag aacaatctgg acaaactcca cgtgctggag tttaggcaaa 33541 gcacctggtg gggagggggt ccggccgctg ccacaaccct ctccccctgg aaagagacat 33601 gctcactgtg acagcccaga cctctgtggc tcctgccaca aaggtgacag tataagggac 33661 ctggaagggc actaggtagc tccccaagcc cctacactct ttgtaccttg accttccacc 33721 accccagatc cctgcacagg tgtgcgaagc acagcctctc ccagcctggg aaaaacacac 33781 tcttcagtgg aaagaacagc taagcatagc tgcttggccc cttcaggtga gggcgtaccc 33841 acgcttcagc aaggccacat ctacttggct tctttttttt tttttttttt tttttgagat 33901 ggagtttcgc tcttgttgcc caggctggag tacaatggcg tgatctcagc tcaccgcaac 33961 ctccgcctcc caggttcaag caattctcct gcctcagcct cctgagtagc tgggattaca 34021 ggaaggcacc accacaccca gcaaattttg tatttttagt agagatgggg tttctccatg 34081 ttgaggctgg tctcgaactc ctgacctcag gtgatccgcc cgcctcggcc tcccaaagtg 34141 ctgggattac aggcgtgagc caccgcaccc ggccaggtct atttggcttc taaaggatag 34201 ggctcctgtt tctaggaagt gcttaacagc agctaccagc taccatctgc tgagtgtttc 34261 tatacgcttg gcactgtgct agcactgtgc tttacatatg ttgtctttca gcctaacaac 34321 actcttctga tatagttatt attactgtga tagttgtcat catcttcccc atcttataga 34381 tagatgatga agcagtttca gagagtgtaa gtaacttgct caaagtcaca tggctagaaa 34441 gtggagaaaa aatttgaacc tgtgtctacc tgcctctgat tcctgctctc tccaatgata 34501 acaacagcag caattaacat gctgtgaggc tgtgctagga aggatactaa gagccttgta 34561 taggctagcc caccatcagg tgccaatgag caaaatgagg cagtataccg gtgtaactaa 34621 accttttcca ctcccaaagc caccatcttc agtgtcagga tcttcctgac agcctcctaa 34681 ctgctataat ggcactgtct acctccatct ctccggagaa gggatagtgt agccagtggc 34741 tcccaaactt gactgatccc tggactcccc tggaaaactt aaagagtgat atagtccagg 34801 catgatggct caagcttata atcccagcta ttctggaagc tgaggcagga ggattgcttg 34861 agtcaggaat tcaaggccac attgtgctat gattgtatca ctctactcca gcctgggcaa 34921 cagagggaga ccctgtttca aaaaaaaaaa aaaaaggtaa tacagattat tggaatgtaa 34981 gatgtgcagg ttgggtataa ggatctgaaa tctgtggtct ctctctcttt tttttttttg 35041 agacgatgtc tggttctgtc acccaggctg gagtgcagtg gtgcaatctc ggctcactgc 35101 aacccctgcc tcccaggttc aagtgattct ccagcctcag cctcccaagt agctgggatt 35161 acaggcatgc gccatcacgc ttggctaatt tttttgtatt tttagtagaa atggggtttc 35221 gccatgttgg ccaggttggt cttaaacgtt tgacctcagg tgatctgccc gccttggcct 35281 cccaaagtgc tgggattaca ggcatgagcc accgtgccca gccgctcttt tttatttaaa 35341 ttttttttca gcaggccagg catggtggct cacgcctgta atcccagcat tttgggaggc 35401 cgaggccagc ggatcacccg aggttgggag tttgagacca gcctgaccaa catggagaaa 35461 ccccgtctct actaaaaata taaaaattag ccgggtgtgg tggtgcatgc ctgtaatccc 35521 agctacttgg gaagctgagt caggagaacc gcttgaaccc gggaggcaga ggttgccgtg 35581 aaccaagatc gcatcattgc actccagcct gggcaaccag agtgaaactc catctcgaaa 35641 aaaaaaaatt tttttttcta ggtgattctg atagttatgt agttttaaac cactgttttt 35701 ttgttttttt gttttttttt ttgaaatgac tattgggttg gatagaggtt actgggtgtt 35761 attatgcttt aaaactcaaa tatactatat attttttctc ttcttttttt ttttttttga 35821 gaacaccatg ggggaaaacg cccccataat ccaatcacct tccaccaggt cctttcttcc 35881 acacttgggg aatacaattt gagatgagat tttggtggga aaacagagcc aaaccatatc 35941 attcctcctt ggctcctccc aaatcttatg tccttttcac atttcaaaat caatcatgcc 36001 ttccccacag tcctccaaag tcttaattca ttccagcaat aacccaaaag tccaagttta 36061 aagtctcatc tgagacaagg taagtccctc cacctatgag tctgtaaaat aagaaacaag 36121 ttggttactt ccaagaaaca acgcagatac aggcattggg taaatgttcc cattccaaat 36181 gagagaaatt ggccaaaaca aaggggtcac aggccccatg caagtccgaa acccagcagt 36241 gcggtcatta aaccttaaag ctctgaaatg atctcctttg actgcatgtc tcacatccag 36301 ggcacactgt gtgggcgaca agccacccag gcgccgaggc aagagactga ggacatgagc 36361 tgttccagta taataaaata taaaacaaga atagttatac tagatataga tcttagatat 36421 gattatatat gaatatcaat aatcattagt tggtagcaat gactctttat tccaatatta 36481 taataatcct cgctctataa tcataactta ggaaaaacca gaccatgcag agatgggagc 36541 tgaggggaca tagtgaggtg tgaccggaag acaagagtgc gagtcttctg ttatgcccgg 36601 acagggccac cagagggctc cttggtctag cggtaacgcc agtgtctgtg aagacgcccg 36661 ttgccaggtg gaccgtggtc tagtggtagc ataagtgtca agggaaaaca cccactgctt 36721 agcagaccgg gaaagggagt ctccctttcc ctgggggagt ttagagaaga ctctactcct 36781 ccacctcttg tggagggcct gacattagtc aggctcgccc atggttatct ggaggcctaa 36841 ccgtctccct gtgatgctgt gcttcagtgg tcacgctcct agtccgcctt catgttccat 36901 cctgtacacc tggctctgcc ttctagatag cagtagcaaa ttagtgaaag tactaaaagt 36961 ctctaataag cagaaacaat ggcgtaagct gtctctctct ctccctctct ctctctgcct 37021 cggctgccag gcagggaagg gccccctgtc cagtggacac gtgacccacg tgaccttacg 37081 tatcattgga gatgactcgc actctttacc ctgccccttt tgctttgtat ccaataaata 37141 acagtgcagc cagacattca gggccactac cggtctccgt gacttggtgg tagtggttcc 37201 ccgggcccag ctgtcttttc ttttatctct ttgtcttgtg tctttatttc tacactctct 37261 cgtctctgca catggggaga gacccactga ccctgtgggg ctggacccta cacactgatg 37321 taaggggtgg gttcctaagg cctaaggcag ctccatcctg tggctttgca gggtacagcc 37381 cccatggctg ctttcagggg ctatgtacct gtggcttttc caggcacatg gtgtgagctg 37441 tcagtggatc tacctttctg gggtctggaa gatgatggcc ctcttctcag ctgcactagg 37501 caatgcccca gtggggattt tgtgtggggg ctccaacccc acgtttccct tctgcactgc 37561 cctagcagag gttttccatg agggctctgc ccctgcagca gacttctgcc tggacatcca 37621 ggcattttca tatgtattag tccattttca tgctgctgat aaagacactt ctgaggctgg 37681 gaagaaaaaa agatttaacg gacttacact tccacatggc tgaggatgct tcacaatcat 37741 ggcagaaagt gaaaggcatg tctcacatgg tggcaggtaa gagaagagac cttgtgtagg 37801 gaaactcccc cttatagaat catcagatct tatgagactt attcactatc atgagaacag 37861 cacaggaaag acctgccccc attattcaat tacctcacac caggtccctc ccacaacacg 37921 tgggaattca agatgagatt tgggtgggga cacagccaaa ccatatcacc atacatcctc 37981 taaaacctag acagaggttc ccaaacctca actcttttct tctgcacacc tgcaggccta 38041 tcaccacatg gaagctgaca atgcttgtgg cttgcaccct ctgaagcaat ggcccgagct 38101 gtaccttggc tccttttagc cacagctgga gctggagtgg ctgggatgca gggcaccatg 38161 tctgggggct acacagagca atggagccct ggatctgacc cacgaaacca tttttctgtc 38221 ctaggcctct gtgcctgtga tggtaggggc tgtcatgaag atttctgatg tgccttggag 38281 acattttccc ccatggtctt ggccattaac acatttggct ttttgttact tatgcaaatt 38341 tctgcagctg gcttgaatcc ccagaaaatg ggtttttctt ttctaccaca tggtcaggct 38401 gcaagttttc caaaccttta tgctctgctt cccttttaaa cataagtccc aatttcaaac 38461 catctctttg tgaatgcaca tgatagaatg ctttcagaaa aggccactca cctcttgaat 38521 gcttttttgc ttagaatttt tcttctgcca gataacctaa ataatctctc tcaagttcaa 38581 agttccacag atctctaggg caggagcaaa atgccaccag tccctttgct aaagcatagc 38641 aagaatgacc tttgctccag gaccaagtaa attcctcatc tccatctgag accatctcag 38701 cctggacttc attatccata tcactatcaa cattttggtc aataccattc aacaagtctc 38761 taagaagttc caagctttcc cacatcttcc tgtcttcttc tgagcccccc ctaaactgtt 38821 ccgacctctg cctgttatcc aattccaaag tcacatctac aagtttcagg ttatctgtat 38881 agcagtacac ctagcagagg ttccccatga gggctctgcc cctgcagctg acttctgcct 38941 ggacatccag gcattttcac atgtattagt cagttttcat gctgctgata aagacgtacc 39001 tgagactggg aagaaaaaaa gatttaatgg atttatagtt acacctggct ggggaggcct 39061 cacaatcatg gcaaaagttg aaaggcatgt ctcacatggt ggcaagacaa gagaagagac 39121 cttgtgtagg aaaactcccc cttatataat catcggatct catgagagct ggtaccaatt 39181 ctctgtatta gtctgttctc acactgctat aaagatacta cctgagactg ggtaatttat 39241 aaagaaagga ggtttgatag ttccacatgg ctgggaagcc tcaggaaact tacaatcatg 39301 gcagaaggca aaggggaagc aaagcacatc ttacatggca gcaggagaga gagagcagag 39361 tggggagtgt tacttttaaa ccactggatc ttgtgagaac tcactatcat gagaacagca 39421 tggggggaac cacccccata atccaatcac ctcccaccag atccctccct ggggatttca 39481 attccagatg agatttgggt ggggacacag agccaaacca tatcacccac atacttcttt 39541 tttattggct attttaaagc aaatcatagc tctcaagtca ttttattcat aagcacttct 39601 gtatgcattt caaactttaa gtatttttta attctggtaa aaaatatata aagtttacca 39661 tcttaaccat atttgaatag gcagttcagt agtgttcagt atattcacac tgataacatt 39721 tttcatcttg caaatccaaa actgcaccca ttaaacaact ctccttctcc cttgccccag 39781 gctctggtaa ccaccattct acttcttgtt tctgtgaatt tgactaatct cagtaccttg 39841 tataagtgga atcatatagt atttcatcca cagtcttatt tcatataaca cgatgtaagc 39901 tgatgtcctc aaggttcatc catgttgtag tatatgacag tattcccttc cttttttttt 39961 tttttttttt gagacagagt cttgctctgt cacccagggt ggaggctgga gtacagtggc 40021 actatcttgg ctcactgcaa cctccacctc ctgggttcaa gcgattcttc tgcctcagcc 40081 tcccaagtag ctgggattac aggtgcccac caccacgtct ggctaatttt tgtattttta 40141 gtagagacag ggtttcacca tgttggccag gctggtctca aactcctgac ctcaggtaat 40201 ccacctgctt tggtctccca aagtgctggg gattacaggt gtgagccacc acattcagcc 40261 caatttcctt cctttttaag gctgaataaa tagtccattg tatggataga tcacatttta 40321 cgtatccatt tatccattga tggacttttg ggttgtttct accttttggc tatcatgaat 40381 attgctgctg tgaacactgg cgtacacata tctgtttgaa gggctgggtg cggtggtgca 40441 cgcctgtaat cccagcaatg agtgtgaagt ggcatctcat tgtcaacact aagctctggg 40501 tcaagtggaa agtaaagacc ttcccaggag gaaagcagga gggagaaagt gcagagaata 40561 tttaaaaata tccactctga ttatttaagt taaaggcact tgaaaaacag caggaacaag 40621 atgatcactc tcacctttgt gctgtttctt aaaagcagaa gatgatgaaa ttcctgagtg 40681 aaagaccccc aacctgtact ggaaggaaag gcaacatcct tatcttcgag ggcgggaagt 40741 cggcaccgag atgattctgt gcaggccctt gttaaagtaa ttcttttttt ttttgagaca 40801 gggtcttgct ctgtcaccca ggctggagta cagtggtgca atctcagctc actgcagcct 40861 ctgtctccct ggctcaagaa atcctcccat ctcagcctcc caagtagctg agactacagg 40921 cctgcaccac cacacaaccg gctaattttt gtattttttt gtagagatgg ggttttacca 40981 tatcacccag gcaagtctcg aactgctgga ctcaagggat ccgccttcct cagcctccca 41041 aagtgtttcg attacagttg tgagccaacg tgcccagcca aaatagcttt tttttttttt 41101 ctttgagaca gtctcactcc gtcacccagg ctggagtgca gtggcgcgac cttggctcac 41161 tgcaacctct gcctcctggg ctcaaacgat tcttatgcct cagcctccca agtagctggg 41221 actacaggtg cgtgccacca cacccagcta attttttgta ttttagtaga gacggggttt 41281 catgttgcac agggtggttt cgaacacctg agctcaggcg atccgcccgc cttggcctcc 41341 caaagtgctg ggagtatagg cgtgagccac catgcctggc ccaaaatagc tgttatcttt 41401 taagcttctc cacataattg agtggctttt tcataattta ctattctttg tccaatccag 41461 tatatcagta actctaactg cttctttagg tcttcatttc tccctctctc tctcttcttt 41521 ttttttgaga atgagagcat cttgctctgt cctccaggct ggaatgcagt gtcaagaaca 41581 tggctcactg cagcctccaa ctcccagggc tcaagcaatc ttcccacctc tgtcccctga 41641 gtagctggga ccacaggctt gtgccaccat acctaaataa tttaatattt tatatagatg 41701 aaggtctcac taagttgccc aggctggtct tgaacccctg gcctcaagca ctccttttgc 41761 ctcagcctcc ccaagtactg ggattacagg tgtgagcaac cacactctgc cttcatttct 41821 ttatggaggc tcccatgcca tgcacaactt ttaataaata aatgtgtacg cttttctcct 41881 attctttttt tttttttttt tttgagacag agtctcactc tcacccaggc tagagtgcgg 41941 tggcatgatc tcagctcacc gcagcctcca gctccagggt tcaagagact ttcctgcctc 42001 agcctcctga gtagctggga ccacaggcac ccaccaccac actcggctaa tttttgtatt 42061 tttagtagag atggggtttc gccatgttga ccagcctggt ctcgaactcc tgacctcaat 42121 tgatctgccc acctctgcct cccacagtgc tgggattaca ggcatgagcc actgtgactg 42181 actggccctg ttcatctatc ttgtgtcaat ttaattcttg gatcctgcca ggaccctaag 42241 aggatggagg tgagtcctgc catccctaaa acaggaaaat gaggtatccc tgaatgcaca 42301 gagacacaag acattcacaa gcccagcaac gttttcaaat aatttattag gaatttaaaa 42361 ctgaaaataa aacctggaaa aagaagttac agatgtggag agaagagaca ccggaggatg 42421 gtaacttgct ggcttcgaaa caccatgtaa catcttaaaa aaaaaaaaaa tcccaaagca 42481 aatcagaaaa cggaattcca gggtcctgag cccatggttg ggcccagtgg ggtggaaggg 42541 tccgggaatg agggagaggg aagctaagtg tctcaggact cagctcaaac gtgtagaaaa 42601 ttaaaaataa aaaccaataa aatgcagctt ctcttttatt aggaaacatt aaaaaaaaaa 42661 aaaacccaaa acacgaacag ccgcgcatct cagtaacaaa gattattgct ttgtgttctc 42721 agggctgata ggttaagcac ctcacacaga caattaactc tccaaaggtg gggtttccgg 42781 gtgggggcaa tgctggggaa gagagctcag gccctgggcc tcataactgg gggagaggga 42841 cacacagaag ggggatggca gtgggtgggc cttggccctg ccacgccagc caggccactg 42901 ttcatgaagg tccaacgctc tgaacctctc tttttcctta gaaggggggt ctggggtgag 42961 gggctgccaa gggactctgg ctgtggggtc atcctggtgg gaaacttcag tgagagaatg 43021 tgggggttcc ctgagacgcc ctttgctttc ccctggggtc tgccctcgcc agagcgtccc 43081 ctggtgccca ctcattgcgg gggtcgaggg ggaaggggga cagactgagc agaaaccatg 43141 aagccccaac ctaggtaggg agcaggggag acctttggag agttaattcc tgtccagcag 43201 tgtcgctggg caagccctgc tcttcccacg ccctttgccc ccacactggg tttttggagt 43261 gggcaggtcc aaccaaggcc ttgaccctga gggcctgaga gagcgggcct gccgtggcca 43321 cagctgaggc ctgcaagctt acagtaacgg tgtctgaggg gacagaaaca ggggaggggg 43381 gagcccctca cccccgaggg gtcttagaga ggggtgggca catcacctca cagtatttac 43441 atatgataca ggacgggatg gttccagggg ctcggcctgg cctccccgca gccctgccct 43501 cctctccagg gctggaggga ggcggccagg ggcccacacc cagatagaca ctttgttctc 43561 agctggtggg gggcacctgg ccccttgccc cctgcggcct ggggcttcac attcacaaac 43621 ctagaaatag tttaaaaaag gtttctttaa aaaaaaaaaa aaaaaaaaaa agggaaaggg 43681 taaaggaggg gaaatctgaa taaaaaacag gggttgggct gcggcctgta gcaggctccc 43741 tccgccttca ctccagctat gcacaaatcc aaaagccttt tggggagggg gcggtcctag 43801 gggggcgcgg cacggaggga gggatggacc agagggctgg ggtgggcaac cggctaatcc 43861 aaaataaata aaagcagggc cgggagaggg gcgggtgggg atgggggcaa gggcgaagag 43921 ggggacgtta gtaggggagc cagacgctgt tgggggcagc agggcagggg ccgggcccgg 43981 ggagttgggg gcaggcagta gcgggtccga gtcgctgcag gggagggggc ggcggctgcg 44041 gctgaggtct cccgccccct cgctggctca ctcgtggtcc tcagtcagct gcaggctggg 44101 gcgctgggaa cacagccagg aggttatggg ggctcccgga gcacacgcag ccctccccag 44161 ttctctccca gaagagcttt gttcgtaaac tacctacttt cttcttctta tttttttaga 44221 gacagagtcc agctctgttg cccaggctgg agtgcagtga cagcgatcat agttcactgc 44281 agcctcaacc tccctaggct caagtgatcc tcctgcctca gcctcctgag actctgggat 44341 tacaggtagt atctgggact acaggtgaaa gcaaacacac tctgctcatt tttgttgttg 44401 tagcagagat ggggattcac tatattgccc aggctagtct caaactcttg tcctcaagcc 44461 atcctcctgc cttggcctcc caaagatctg ggattacgaa tgtgagccac cacacccagc 44521 cagactttgc actttctgtt ccctggttgg atgcctcttt ccccagattt cgaatggctg 44581 gatttattta aatgtcccct tctcagagat accttctcta aaccccacat ctaaaatagc 44641 cctgtcatgc acagtctgga attatctcat atgttcactt gatttttgtc ctgtgaaatg 44701 agagaggact ttgtctgtct tgttcatcct ctttcctgga atggggctca taaatgaatg 44761 ggtgaatgaa cagaatgtca tgggctcaca gtctcccacc atctggatag acactgtgat 44821 ggccgtgtac acagtcccac aagtcgaggt atcctgttgt tttttgtttg ttgaggtgga 44881 gtcttgctct gttgcccagg ctggagtgca gtggcctgat ctcggctcac tgcagccttc 44941 gctttctggg ttcaagcgat tcttgtccct cagcctcccg agtagctgag attacaagtg 45001 cgcaccatca cgccagccaa gttttgtatt tttagtagag atggggtttt accatgttgt 45061 ccaggctggt ctagaactcc tgacctcagg tgatccgcct gccttggccc cgcaaagtgc 45121 tgggattata ggcgtgagcc actgcgcccg gccaaggtat cctgttttga tagctgtgta 45181 ctataccact gtacacacac taatttgttt aaccactttc cttttgatgg atatttcagt 45241 tatttgcagt tttaaaaata aatgcctaga agtggaattg ttgggtccaa gggcatgaat 45301 gtttgaaact gggacaaaca ctgccagact gccccgggaa cccctgagcc actgtactta 45361 cttgcttctc ccatatcttg atcaatgctg ggcagtgttc aacttaaaaa atctttgtgg 45421 ctgggcccgg tggctcatgc ctgaaatccc agcactttgg gaggccaagg cagaggaaga 45481 tcgtttgagc caggagttca agaccaacct gggccacata atgagacccc atctctacaa 45541 aaaaattttt taaaaactta gccaggcatg gtggcatgct cttgtggtcc ccgctacttg 45601 ggaggctgag gtggaggatg gcttgagcct aggagttcca ggctgcagcg agctgtgatt 45661 gcacactgca ctccagcctg ggtggcagtg agaccacatc tctattaaaa ataataataa 45721 tagggctggg cacggtggct cacccctgta atcccagcac tttgggaggc cgaggcagat 45781 ggatcacttg aggccaggag tttgagacca gcctggccaa catggcaaaa ctcgtctctg 45841 ctaaaaacaa aaaaaatacc aaaattagct gggtgcggta gcacatgcct gtaatcccag 45901 ctactcggga ggctgagaca cgagaattgc ttgaacccag gggacgagtt tgcagtgagc 45961 caagattgcg ccactgcact ccagcctggg tgacagagcg agactccatc tacaaaaata 46021 aaataaaata atagtaataa taataaacga tcttaaaaaa tctttgtggc ctgcgctggt 46081 ggctcacacc tgtaatccca gcactttggg ggaccagggc aggaggatcc cttgagccca 46141 ggagtttgag accagcctgg gcaacacagt gaaaccctgt ctctacaaaa aaagtaaaaa 46201 aattagccag gcatggtggc aggtgcttgc agtcccagct actcgggagg agtctgaggt 46261 gggaggatcc cttgagccca ggaggtgtgg gtgacagaac gatatcttgt ctcaaaaata 46321 ataataataa taatctttgc caacctgata ggtgggatgt gaagccttac tgttgctttg 46381 atttctgtgt ttaaatggag tagggtgagc agcttttcat ctttattgac cacagctggt 46441 ataaagatct gtctcccctg ggactgtggt tcttgagggc agggatgggg tctgtgtccc 46501 agattcgatg gcccaggaca ggcttcccca gtgccacccc accacctgcc cacccccaca 46561 tcctcacctc ctgccaccgt cccctgcttc agtggtaccc attggtgaag gctgtggcaa 46621 tcaaagggcc ctgaaaagga aagaatttga ctcctagatt cctgggaaat agaggaagat 46681 caggggctcc ttctctctct gcaggacatc tgaagtgagg cgctgacctc tctctacctg 46741 tccggatcgc ccccctactg caggtctgcc tacccggatc accccccact gcaggtctgc 46801 tcaaagattc tccaacagag ggccagggaa tctgcacaaa ccgtaccctg gataaaacca 46861 gggcacccct ggaatcaaac ctccatgtga gcagagatgc catgtgtctc atttgcacct 46921 agagcccact atgcctggca ttcagaggcg ataactaagt gcccactgaa tgaatgataa 46981 cccccacaac tcttccttag tgatctttct ccattcatcc tccctcaccc tccttttgcc 47041 cccagagaca aagaagttct ctcccttccc caacttgggg gtctctcaag gtgtgggggg 47101 tgaggtgggg gccgctaggc ctggccactc accccaagac tgtggccgaa gccggtgctg 47161 ggggcagggc tggcggcgct gatgtaactg ctgacccccg agtcctggtt ggccgccccg 47221 tagagctcgg ccatggggcc ggggctggtg gtccccagga agccccctgt gcggctggga 47281 gtcgaacctg gagggggagc catcgtccag gggtgagagc ctggcaaccc agaaagaaga 47341 cggtgatagc cctggtgccc tctgccaccc cacggcaccc cctcaagcct gtcccaccca 47401 ccctaacctg gacccccaca tttcagaaag tcctgtgggc ccacctcctg gagatctccc 47461 catgcccatc cacccagtca cccctagccc tgacccccat catctcgtct gaccactcag 47521 cagcctcacc tcctgagccc cttcccaacc tctccccacc tagcagctga atgcatgttc 47581 taaaccctga cccgcgtccc tctgctactt gcaatttcca ctgtctcccc accacccggg 47641 ctcctcctgc ctccgttctc cactccccac ttccgcactt tccccaccca ctggcctcca 47701 agcctttgca tatgctattc cctcaaccag aaacactttt cctcctcttt gcctttgaat 47761 gcaaagttaa gaggcccttc ctcttggaag ccttccctcg cctttcctag aggccccagc 47821 tgggagctcc tctggtactg gactccccct agtgcagcag catcagccat cagcattatt 47881 gcctttttaa tacctgatgc caccaccagg ctgcaagccc tggaaggcag agaccctgtc 47941 agtctttttt ttttttcttt tgagacagac ggagtctcac tctgttgccc aggctggagt 48001 gcagtgacgt gatctcagct cactgcaacc tccgcctccc gggttcaagc gattctcctg 48061 ccttggcctc ccgagtagct gggattacag gcgcgcacca ccacgcccag ctaatttttg 48121 tatttttagt agagacgggg tctcaccatg ttggccaggc tgctctggaa ctcctgacct 48181 caggtgatct gcccaccttg gcctcccaaa gtgccaggat tataggcagg agccaccgcg 48241 cccagccgac cctgtcagtc ttgctgaggg ctggctcgcc catgctggca cgatgcccag 48301 cacagagcag gtaatcaggt gatacttgag tgaatgaaca agtgtccctc tgcaaagggg 48361 tggcagtggc cttctcccct cccctggact tctctgctcc ctgtgctccc tcatctctct 48421 tcctgcacct cagaagagct cacctgtccc tcgaaccaca gccgctgccg ccgctgccgc 48481 cgccattggt ccgtaggcag tgagaggaat ggctgaaagg aaagggatgg gcacatcagc 48541 atggcggtga ggggcagaga tgggtgaagg aggcccctac cttttaggca gagcctgtcc 48601 agcctggccc tacctcagag ggttccatgt agccccaggg tcaaggcaga aaaggctacc 48661 ttgggccaca ccatctgtgt cctgacctgt ggcatggatg gaaaaaggtg ggagagagga 48721 cagcttccca gtgagtagct gagcctcaat ttctcccctg gtctgggggg acatgcttga 48781 ataacccaaa gctcatggag ggtggaggca gggcacccaa ccctgagaag ctgcgaaggt 48841 tgcccgcagc tgcccccttc tccaggatgc ccttggcaaa acagggatca cttggcccag 48901 aggtgcctgg gagctcacgt ccttcttcgc agcccacgac acctggcgat agacagacat 48961 gggacaggcc cccaggcctg ctgcccccag gtagagcgct ctaggggtca ggctgaggct 49021 gggctcacac cgaggccata gcgttgcacc gcaccttctg ctgcctgtcc tgctcccctc 49081 gcccctgtct cctgagagca ccctaagggc ctccctgacc acccagaaat gaccactgcc 49141 tcccccaaat ccccccagcc tctttgtctg ccatctcctc ttcagacaca ggcacctatc 49201 cccttgtttg ctgtctgcca tcatcgccac ctctgcagca cctagcacct tgccttgtac 49261 acggcaggtg cttaatacct gctgaaagag tagaaaggga atctgcttaa aacacccagt 49321 ggcttgaatg gcttgagcct ggggctccct ggggctgaga gatgagcacc tcctactgtg 49381 aatctttcct aaaaacttca tacagtgggc tccttctcac tcagatgcca cctcctctaa 49441 gaagctttcc ctgaataccc ccactcaggc ggccccttgt ccccaaccat aacgccttag 49501 cacatttccc tgctttaatt tcctcatgga acttctcacc ctccaaaaat atcttttttt 49561 ttttaaatta atttttttgt aaagataggg tcttggaaag gctgggtgcg gtggctcatg 49621 cctgtgatcc cagcactttg ggaggccgag gcgggcggat cacctgaagt cgggagttca 49681 agtccagcct gaccaacatg gagaaacccc gtctctacta aacaaaaaac aaaaaacaaa 49741 attagccagg tgtggtggtg catgcctgta atcccagcta catgggaggc tgagacagga 49801 gaatcgcttg cacccaggag gtggaggttg cggtgagcca agaccgcgcc attgcactcc 49861 agcctgggca acaagagcac aacttcatct caaaaaaaat aaataaataa aataaaataa 49921 aagatagggt cttggtatgt tgcccaagct ggtctcgaac tcctgacttc aagctatcct 49981 cctgtgttgg cctctcaaag tgttgggatt acaggcatga gccacaacac ccagcccaaa 50041 aatatcttat ttgctttggg gatactcagg taagcctttc ttgttttact tgattttttt 50101 tttttttttt tgagacagag tctagctctg ttgcccagtc tggagtgcag tggcatgatc 50161 tcagctcact gcgacctctg cctcctggat tcaagtgatt ctcctggctc agcctcctaa 50221 gtagctggga ctacaggtgt gcaccatcat gcctggctaa ttttttttgt atttttagta 50281 gagatggggt ttcaccatgt tggccaggct ggtctcaaac tcttgacctc aggtgatctg 50341 cctgcctcag gttcccaaag tgctgggatt acaggcatgc accaccacgc ccagctaatt 50401 tttgtatttt taatagagat ggggtttcaa catgttggcc aagctggtct tgaactcctg 50461 acttcaagta atccacctgc tgcagcctcc caaagtgctg ggattacagg catgagccac 50521 cacccccggc ctgtttactt gtttactgct gtctccttcc tctggaattt cagttccagg 50581 aagacaggaa ccttgtctgt gtcactcact gttgtggccc cagtgcctgg agtggtgtct 50641 ggtacaatga atatttatcg aatgcatgaa aaaaccaccc aagtctcctt ttctccaagg 50701 ctagaggaaa gttcagatcc taggtgccca agcactgttg atcttgtacc tactcccttg 50761 cctgccaaga tatctctaaa agggacattt cttctgtcat ttctttaagg ccagggactg 50821 ggctctgagg tctgttgttt ggctcattta ggcatttcca taaagtcaga ggagccaacc 50881 ccagcatgtg cttaccacac tgtaaagaat gacttattta tataataagt atttgctccc 50941 ctctctaaac tgcgagttct gtgagatcag ggcctgcctc tgtcttatcc ccgctctgtc 51001 cccagggtct gggagggcat cctgtgcaca gtaggtccgt aacacatggt gctctgttgc 51061 ccaggctgga gtgcagtgat gcaatcatgg ctcaccgcag ccttcacctc ttgggctcaa 51121 acgatcctcc cacctcagcc tcccaagtag ctgggactac aggcatgcac caccaccaca 51181 cctggctagt tttttatttt ttttagtaga gatggggttt caccatgttg cccaggctag 51241 tcttgaactt ctggcctcaa gggatactcc tgctttggcc tctcagagtg ctgggattat 51301 aggcgtgaac caccacgccc agcacacaca gttcttaaat gagtaacaag gacctccttc 51361 cagccctcca ccctgacaac tagtggccaa actctccctc ttttcaggga agagcctctg 51421 ttacctgcat ctctagggtt ctcctcccca ccccttcctt cactccaggt ttctcttcct 51481 ctcccccatt ttgggggaac cctgagcacc ccttggaagg aaggtcttat ctaaagcagg 51541 gtccatgctc acgtcttcca cacatggcca ggttgagggg agagtctggg ggtggggtgg 51601 gggagaagca gcttggacag agaaaccccc tgcagcccac cccacgtgct tgcagtctcc 51661 tcggctcaag gctggctccg cagctgagct ggctggcccg agtgaaggtg ccagccagga 51721 aaccggcccc ctgtcacctt caccccactt gtccagaagt caccaaaggg ctggattcgg 51781 cccttggcaa tatttcattt ggctcacagt gttttaaaca tttagaatct gttgccaaca 51841 ttgaaaaata atggaaaaat tcatacgaaa gcaaaagcca ggtttctggc atctctggaa 51901 aaacatgaga tctgagcata gtgagtctgt gtcccctccc ggcaagagtc agctggcacc 51961 gagggcaggt atgcccacgc tggtgtgcag acgtccccac ccagcccact ttgcaatgag 52021 tttgagaccc tggcttgttc actgcgggat cccagagcct ccatggagcc tggcacacga 52081 gaggtgctag ataaatattt gccaaatgga tgaatgagtg actttgcctc catctgcttg 52141 gcttggccct atgccccacc cttaagggcc atgtagccgc ccagactggc cagtgcccag 52201 gcacagtctg ctgtgtcccc acagccggag ggctggcggg caggagaaaa ggatcccgct 52261 agctttcccc aggaaactcc ctgggcccct tctgtttctc cccatgtggc ctgagaggta 52321 tacaaaatcc gagcgactga cctgtaagct cggggaggac tggggcgctc gggagagggg 52381 tccgctctac acggaattct aaaatgaaac gtaaaacagc ttaggaagaa gcaggggagg 52441 ccccttggac atggcccagg gaggacaaat gcacccacag ggacactggg tgaggagaca 52501 ggccatgcac aggacccaag agctggggag ggggagagca cagttcccag aaggcagggg 52561 agggcaagga gacagacaca gaggagagac agagagaggg agcagagttg ccacctatac 52621 cctgggcacg gggcagggct ggagctacga ttccaggtgt tggcttgttt ccttcttttc 52681 tgtttctttt ctttttcttt ctttctctct tttttttttt ttttgaaatg gagtctcgct 52741 ctgtcgcccg gtctggagtg cagtggcgcg atctcggctc actgcaagct ccgccttccg 52801 ggttcacgcc attctcctgc ctcagcctcc caagaagctg agactacagg cgcccgccac 52861 cacgcccgga taattttttg tatttttagt agagacgggg tttcactgta ttagccagga 52921 tggtctcgat ctcctgacct cgtgatccgc ccgtcttggc ctcccaaagt gctgggatta 52981 caggcatgaa ccaccgcgcc tggccctttt tatttaaaaa aaaaaaaaaa aaaaagacag 53041 ggtctcactc tgctgcccag gctggaatgc agtggcatga acacagctca ctacacactc 53101 gacctcctgg gctcaagcga tcctcccacc tcagcctcct cagtagctgg gaccacaggc 53161 acacaacatc atgcctggct aacttgtaga gacaaggcct cactatgttg cccaggctga 53221 tctcaaactc atgggctcaa gccatcctcc catctcaggc tcttaaagtg ctgggattac 53281 aggcgtgagt caccatgtcc aggctcgggt gtctctaaaa ggcttacaat ttgactatta 53341 ccaaaaacat ctcccagaat cctcttcttc ttgtgtgtgt gtgtgtgtgt tgggggaagg 53401 cagcctgatg aaaagcgacg gccctagaat aggtagaata ggtcaggctt gggttcaagt 53461 cccagctctg ccacttccta gttggctcat cttagctttg tcattcttcc tctgagccta 53521 tgtcctcatc tctaaaatga ggataagcat ccttccccca tgagattatt gagagaatta 53581 ggcaacgtcc cgtacacaaa gttctccaag ctgaacagcc ctgaggcaga ggatgtaagt 53641 gcttaggggc cctccaaata aataagaccc ccaagtaaat aaataaataa atggccttgc 53701 aaaagtttgt atttcttatt tcaaagaaaa ctgcattgaa tagtcatcat gggaagtcca 53761 gaactgggct tcctgacatt tactttcagg tctagtgtta tctttatgtt ggcttacaaa 53821 tgagcgtggt aatgataatg cttttcaact ggttttcagc atttagggtc tcagcataat 53881 accttgtctg gacttgaaag agctcagcct agagcctggc cattataggt gtctgtgatg 53941 atgcaaccaa caatggccct cggagaggcc cccactacaa gcccaatgcc ctctctggtg 54001 gaagtggcaa ccctggagtc gggtggggag gagagacagg caggaggcag gcagaggatg 54061 gcctgggaga aaacggtgcc atgcagggcc acgctgggag cccctgtccc cagcttctgg 54121 ggaggggttt gggggggatg ggtggggaga ggccctcttt cctcagacat gcccttcctt 54181 ccaagcctcc agcaagcaga cccccagcca tgcaccccta gacaccaccc tgagaagtca 54241 gactgcccgg gaaccagggg catgctggca gcagagggat actgagcagg acttaccggg 54301 gaactggtag gtgtagccag gggcgaggcc tgtataactc cggctggcgt aggttgtggc 54361 ttggaaacct gggtaacctg atggggcaag ggggcagtgt cagatggctc atccacaggg 54421 cggatcctga tcatggagaa ggacctgagc cactcatgtc ccctcacctt cctttatcat 54481 gaataatagc tataccctcc aaataccttc ttcataccaa gccctgtgcc aagcacactg 54541 cacactatta tcccgttaaa ctcgccttag aggtaggtac tgttaggcac cccattttac 54601 agacaggaaa ctgaggctct gaaaggtgac actttttggc tgagaataat ccaatgctct 54661 ttgagctctg gctcctgctt ccctcctgtc ctcatctcct gcacactgct cctccctccc 54721 tctgctccag ccacattggc ttccttgctg tttttcagct ttgccaagct cattcccacc 54781 tcagggcctt tgcacatgct ggtgcctttg cctggagcac tctttccaga tgactacatg 54841 cctgagtccc tctcattgta caagactccg aaacatcttc ccataggtgt cctccttccc 54901 atcagctaca atttctatca tgtcaccttg gcttttctta gtactgatga tcattatctg 54961 aacttatata atgatttttt tttctttttt cttttttaga gacaggatct tgctcttttg 55021 cccaggctgg agtgcacagt ggtacaattg taggtcactg cagcctcaaa cttctgggct 55081 caagtgattc tcctgcctca gcctccctag tagctgtgac tacaaggtgt gtgctgctat 55141 gcctggctaa tttttaattt ttttgtagag atggggtctt gctgtattgc ccaggctggt 55201 ctcgaactac tagcctcaag tgatcctcct gccttggtct ggaattacaa gcatgagcca 55261 ccacacttgg ccagaaagta ctcatgattt ctttaagtgc tgattttctg tcttctcaaa 55321 gaaaaatcca atagttagca ataatggtgc aattaagatt aaaattcaga tctgacgggc 55381 tccaagcctc ttaaccaccg atctcactat ttgttagata atttcttttt ccactttttt 55441 ttttttgaga tggagtttca ctcttgttcc ccaggctgga gtgcaatggc atgatctcgg 55501 ctcaccgcca cctccgcctc ccaggttcaa gcaattctcc tgcctaagcc tcccgaggag 55561 ctgggaataa aggcatgtgc cactaccccc agctaatttt gtatttttag ttgagacagg 55621 gtttctccat gttggtcagg ctggtctcga actcccaacc tcaggtgatc cacccgcctc 55681 agcctcccaa agtgctggga ttacaggcgt gagccaaggt gcctaacttc tttttccacc 55741 tcttaaatga agggtgaaaa cagctcatga atcaccttag aataaaactt tctcaatgtt 55801 ctaaataaaa caaagtgaaa gagacaagac aactaactac aatacctgat tctagcctgg 55861 atcttattct gaaggacaaa aatattctat aaagagcatt actgggtctg gctggccatg 55921 gtggctcatg cctaccacca caacactttg ggaacctgag gtgagaagat tgcttgaggc 55981 caggagttcg agaccagctg ggcaacatag tgagaccccg tctctaccaa aaaaaaaaaa 56041 aaaaaaaaaa aaaagagcat tattgggtca attgataaaa ttggaaaata gaaagtagat 56101 tagataaaaa tattgtatca aggccgggcg cggtggctca cgcctgtaat cccagcactt 56161 tgggaggctg aggcgggcag atcacgaggt caggagatca agaccatcct ggctaacaca 56221 gtgaaacccc gtctctacta aaaatacaaa aaattagctg ggcacggtgg cgggcgcctg 56281 tagtcccagc tactcgggag actgaggcag gagaatggcg tgaacccagg aggcggagct 56341 tccagtgagc caagatcccg ccactgcagt ccggcctgcg aaagagtgag actccgtctc 56401 aaaaaaaaaa aaaaaaagaa agaaaaaaat attgtatcaa ttataaactt aggaagttga 56461 taactaaact gtggttacat gaaagaatat ccctaatctt aggaagcatc aaagggtaaa 56521 gggccacgat gcatgtaact agccctcaaa tgattcagtg aaaaatcaca agtctgtgtg 56581 tctatacatg tgtacataca tatttgcaaa cacacataca cacacatata tacacataga 56641 gtaaaactga tcaaccaaaa tgggggcaaa atgttaagaa caggtgaatc tgataacagg 56701 gtatatggat gttctttgta ctgttttatt tttgcagctc ctctgagttt gggattactt 56761 ccaaataaca tgttaaagaa aaaagaaaag cttcaaggaa gactgtgccc tatttggagg 56821 ctgctcgcct ccccgacttc tgaccctcta gcccatgccg tgccccctcc tctgaattcc 56881 cagagtacct ctctgtgtgc cactccccca tgcctggcat aaagaccccc ttgtactgtt 56941 ctctttctgt gtgtacacat ctcatctccc cagggagcct atgagcttcc tggggggcca 57001 aactcagatt taccaccctc agtcttggac ccaaagcaca gctgactcct gctcagcttt 57061 caaggctcat ggcagccttc acctcttcca ggaagtctcc tcaacctcca ggtgggttta 57121 ggggcctccc acagcctcca gtcactctgt agtggcagtg cccatttact cctgtctcca 57181 acttgaagga gagccccatg agagcaggct ctccttctca gaagttgctc aataaatatt 57241 acctgaatga ataagaccag gcagccttcc ttttgccccc tccaaaggcc tttggccttt 57301 gcaaatgctg ttccctccgc ctggaacact tttccccttc ccgtctctcc agcctcccca 57361 tcactggctc ctactccacc cgcagactca acttaaatgt aacttctttt cccacttctc 57421 caagtcacgg tcaagttccc tgtgaccaca ctccctcggc accctacact tctgcatctg 57481 aaatctattt tatcttctta cttgttttcc ctgccagatt gtaaagaaag tcattgttgc 57541 agttcctggc ataatgtctg gcatatagag ttgctcaata actgtgtact ggatggatga 57601 atggatagag gaatggacag atggagggat ggatggacga gagaatggac agattgacag 57661 gaccaatgaa tatggttgga cctttctgct ggctaccacc agagggcagc accccccaac 57721 caaaccaccc caccttgggt aggactgcca gacccgacag atcccttgtc cccactcctg 57781 ctcaatgact cagacaggag gagggcagac cacaggggag gtgcaacacc cacacagccc 57841 ctgctagtcc tgggagcagt tctggcagaa agaagccgca ggcgcaggct cgcgcatacc 57901 cagcatgccg atgcccagca tgaaggcgtc cattccgtag ggcatgactc gagacctccc 57961 ccgggctgag cccgttggcg acatcacctc ctttggctga gctttcttac attccacctg 58021 caatgagacc tggcggttag tctttcccac agagctagag tcattagccc tcttgttccc 58081 tggaaagccc ctttattaat tttattaatt tcactgtcca gctgaaaact atttcatggt 58141 taccactgat ggagcactca ctatgtgctg gggagtgtgt aaacggcttt gcatgcaatc 58201 gttcaatgaa tgcttacacc aatactacca atactatgat ggaggagctt ttccagtccc 58261 atttgacaga tgtggaaact gaggctcaga gaagggaggt catctgcctg attaactacc 58321 gtgtgacact gcctccctgt ctaggttttt aggggttgga agaactggct ctgcctccac 58381 tcactcgaag cccctctgcc tgcctttaga acctcatagc ttccctggca ggccccacaa 58441 ggagccccag tcttctctgt gcctgcgctc catccaatcc caactcacca tgtttttgtt 58501 tttgttttgt tttgtttgag acggagtctc actcactctg ttgcccaggc tgaagtgcag 58561 tggcacaatc tcagctcact gcaacctcca cctcccgggt tcaagcgatt ctcctgcctc 58621 agcctcccaa gtagctggga ttacaggcac gcaccaccat gcctggctaa tttttgtatt 58681 tttagtagag acagggtttc actgtgttgg ccaatctggt ctcaaactcc tgacctcatg 58741 taatctgccc acctcaacct ccccaaagtg cttggcttac aggcgtgagc cactgcgccc 58801 ggacccaact caccattttg ttgttgattt catgaaaatg aatttcacac actttctcca 58861 cgatgtcctc actctcaaac gtgacaaacc cgaaccctag aggttggaca aaggataaag 58921 gcaaggtcag aaccagagcg cagggtcaaa actcaacccc aggtcgtacc aatatccccg 58981 ctcctctctc tctggcgagt gagagttaat gaaaggctct gctcacagcc tggaatcctc 59041 tcaactatcc actgagacag ggtttcaagt cccagaatac agatgaggaa aataaggttc 59101 aggtaagtga agtaacttgc ccaaggtcac acagcaagtt agaaagtggt agagctgtga 59161 tgagaataca ctcaacatgc tttttctagt cctcaggtgt tttccaaaga gggaagaagg 59221 aacacacatg cacactcatt taaacttaca caaaggtgta catgtccaaa catacaaatg 59281 tacatagaca cacattcacg catgcataca tctagatacc tgcgtatcac gcagacacat 59341 gcagttaccc tgacatacca gatataaaca gatatttagg cacagatacc cccttttgca 59401 cagatacaaa tacacacaca cacagaacac ataccccact cccaaaatgg actaaaagtc 59461 ctgaaacaac tggagaaaca accctgtctc ctcctgggtg tgccccaggg actggaaagg 59521 gatttctcag aaggtaaggc cctcgggaga aaaacagcca ggtggcgcac acgcacacgg 59581 tgtggcccag atgctcacag gttcagaggc aacggggctg ggggctgggg gctgggcagg 59641 aggtgactgc ctggggctgt ggagtcagcc tgaccctggg cggcctggcc taagctctgg 59701 ttaccaggga caaggtaagc ccgggcagag gcagacacca ctagggtggg ggtaggagac 59761 tcagtgggtt gcagggactg ggggggtggg tactcctctg ttcgtctcgt gtgtctgcaa 59821 ttcggcaagt ttctagcaca ggaaatggtg ggacaaaagc cttctgtaag gccaggcctg 59881 ggagaggacg cagatcctgg gggataggat gtcagaaggc gagatgaaag ggcagctgtg 59941 acctgcagcc ccctggctga ccacggggcc cagcctggga agggggaggg ggccacactc 60001 acctcggtgc cggttggtgg ttttgtcaaa catcagcatg gcgtcgtcca cctgaaacac 60061 agcccgccat ggaggaccca gcagatacca gcggaaccca ctaccaccaa ggaacagggg 60121 ctctggcaac ccactgccct cttgcccact ccaggcttga ccttcaacct gctccctgca 60181 caggggacaa ctttccttct gcaaccccac tgaaaaccct tttcaatcac catcatctca 60241 atgcaccagc ctgcatttat taagctccat tgtcctccca acattccctg catggtaaga 60301 tccccatggt gtagataaga aagtaggctc agagaggtga agttgcctgc ccaagatcac 60361 acagcaagac cttgaccaaa ctggggaggc acagagaaga tgctagagaa ggaaatatgg 60421 gagggaggaa aatgacaaga tggggaagca aaagcctcat ccctttctgt aaaaggccag 60481 gaggacagca accagactgg actcccctct ccggctacag atcttttcct ctaattgagg 60541 tttccttttg ctagaggctg taattaaagc aagtagagtt cagtgcctgg gctgctttgc 60601 ctcccactga ctgggggaag ggaaggagac tgggaaaggg ggaggatcta catcatccat 60661 cctctcttgg accccagtct actcttgtta cttcctctag gtaccccttg gatatacggt 60721 caccaaaggc cttggtgcca ccttccattt gcctgcatct ggtgcctcct ccttgccctc 60781 tttcttagtt aaggtctgag tcacctttgg tataaatcag gggctgcctc tgatacccaa 60841 cccaatccta ctccaggccc caaccctccc ccaaatgacc agacatccaa aggattaaaa 60901 aaaaactttt ctcaagccat caagccatgg gtaaccagcc tcttccctac tctccttccc 60961 tctacttcca ggcccagcat ccaccacctt cttttttttt ggagacagag tcttactctg 61021 tcacccaggc tggagttctg tggcatgatc tcggctaact gcagcccccg cctcccaggt 61081 ttaaggaatt ctcctgcctc agcctcctga gtagctggga ttacaggtgc atgccaccac 61141 tcctggctaa tttttgtatt ttagtagaga tggggtttca ccatgttggc caggctggtc 61201 tcgaactcct ggcctcaagt gatccacccg cctcagcctc ccaaagtgct gggattacag 61261 gcgtgagcca ccaggcccga ccccatcatc taccttctaa aggccctccc ctgcctcatt 61321 aaacaccctg aactaccgtg atcctccagc tctccttagg catcaagcta cagtgggcct 61381 ctgactccca ctgggagaac caggacccca ctgagccagg aggcaacaag gtccctggag 61441 aatctcatga catctctctt ctcaaagagc tgcctgtact ggggcccacg caacagagtg 61501 gcactctcac tcccatccta tggacccact ctgagtccca atcccaaatc agctggacca 61561 cccgaatggg acaatcattt caggttcagc acatcttgcc cacccctcac ttaaccaatg 61621 aaaaacagaa acccagaaag aaggattgtg gtcatagcac aggctttggg taggacgggc 61681 ctgagctgga atcctggatc ctgcatttgc tggttgggta accttgggca ggtgacttga 61741 tgtttctctg gctttggttc ctcatctgtg aaataggcat aacagtgtct atctcacagg 61801 ttgaagttaa atgaaagaca cctttttttt tttttggaaa agggttctcg ctatattgcc 61861 caggctagtc ttgaactccc aggctcaagc aatcctccca tctcaccctc cttggtagct 61921 gggactacag gtgtgcacca ccaagcctgg ctatgagagg atttagtgag atagtgcatg 61981 taacactctt aactcacaga cccaggatat agtaagtgcc taataaataa tagctattat 62041 tattatcttt acaagggagg gacttgccca tagttacacg gggggaacgt ggcaaagctg 62101 gggagagagc agccagatac taatggcttg aaggttgcat gaggacatcg tctgtgtcat 62161 ttagggccat gaccaccatc ctattgagca ccttgactgg aggtcaatca ttttttgaat 62221 gaatgaatga atgaatgaac gaaccagcca caccctcccc cctaatccta cagctccgag 62281 tccctgcctg acccacaaat ggattcagcc caaatccaca acttctccca ccttcctcca 62341 cccaaccctg caaggaaggc caccatccct ggccccacta cacccttcaa cctgggcctt 62401 ggctggagag gcccctagtg ggacggtggg gttgaggcca gggcaggggg caaggaagat 62461 ggctgggggt ccgggttagc cccaccggtg tcccctcctg gggcgagctg aggggaggga 62521 ggctcccggc ccagtgtcct taataggctt tggtctccat gggaacccga gggcaggggg 62581 gatcgggcgg ggggagggga ggcggccgag gcctgctgac cagtaggagc cgagccgcca 62641 agttcggcgg ccgctaagcc tgagtggcag ccatggcggc tgaccattgt tcccccctcg 62701 gcggcggccc ctcgatccgg gcggggggcc ccccgccgcg gggcccttgg cattcccggc 62761 tgtccccctc gctccctggc agcctattct ccggctcccc ctggcctccc cgggccggtt 62821 tcacatgcca acgccgattt aggcccgaac aaaagacgcg gccgctgacg gcttctcctg 62881 gggcccggtt gccatggcaa cgccggcccc gggggcaggc ggcctccaat cacagctgct 62941 ggatcagatt agtgcgagct ctaatgctcg gcgctcagac acaaagacac cccgccgcag 63001 ccgagccggg cctgctgctt cccaggagcg ccctaccctc tggggcccgg gcccgcccga 63061 cctgccctct gtcccggacc caggtgccca ccgaggcacg aaagccgagg cctctggtcc 63121 cccacaccac cagcctcagc tggccagggg acacctcccc cccccacaca atcccaggcc 63181 catttcccag gccacgtccc ccttcgcttg cctgggggtg actcgaagtg tcttgatggg 63241 aaagtagaga cccagcctct cgcctgtctc acctaaaaga agaaagacca gggaccccgg 63301 ccccaagcaa aacaaccctc tttccctcct agacttcaaa cgtaggaccc tctctgccat 63361 ctccccatgg gactccaaag ttcacattgt ccacctttca gaggacaccc taatgtcgga 63421 tgtcgccaca acatctacct cttcagaggc ccagaaatta aagctccatg ccccttacag 63481 ctcacagcca atcctctggg cctgatcagg aattggggga tgctctcatc agttcccccc 63541 aatacccagg ggaccagcag ccccccagct ctggctccaa gctccccacc tgtgcaaata 63601 gggtgccgag tgcaattggg agtcatattt attgcctcct tatcatcctc gcctggaagg 63661 agtggggaaa ggttcaatag cctgtggcct ggacaggtgc aaactcccag gaggccacgg 63721 gctgacttga ttgaactggc tgtgggagca ctgcctccct tgtgagaatg aagataagta 63781 agagaaaaat acagtggaag ggcaacaaca gggccagcta ctgtgtggct tctctgtccc 63841 gagggatatc ctgagtctga agctcacaga tgagccagga ggctgcaagg tggattggcc 63901 tggtctgccc ccacagcagg atccagcccc actctcatct tttgcaggaa tggaatgacc 63961 tggctggatt cggcttgcta gtgccccatt ctacagtgtg ttcacagccc cagcagcaag 64021 cgcccccagt ttaaacccac cttcccaaac tgctcaaaat attgcttcac gtcctccacc 64081 gtggtgttca ccgacagccc ccccacaaag atcttcttcg ttcgagtcac catctgttag 64141 gggagggaat gagaaagtgg gcatctgagt cctgtggcct accgccacca gcaggggctc 64201 gagcccccct gccaggatgc cagctgacaa gctctctgtg gccccagcct ctttaaaggc 64261 cccccccccg caagctccct cttggacccc cgcctctagg cctgtcatgg gcagctgtgc 64321 acccacaccc tggctggctc gagtgtgcct ccttcctgcc tccctcccta gggactatgt 64381 ccagaaaatg ccttccctaa ggactaacac agggaagggg gcgggccagt gacccagacc 64441 cacagcctaa cactaagacc tcactggccc tcctgggaga ccccccaagg tccatcacag 64501 agtctccctg aactcagtgc cgacattttc ctcatcagcc tcctcccaac acaacctctg 64561 gcagccctgt ggccccagcc ctgttttagg aagaacacat ggtcctaatt accccaggag 64621 cgcatggcta atccgcggca attcccagcc ccatcctaac actaaagcac cttctgtcac 64681 caaactgctt aatggaatta acccagttcc aaccatgtgc tccctggtgc ggcttccttc 64741 gaatacctgc tccataattg ctttcaggcc ctctactgga cccttccaga gatgcccctt 64801 tcctgtctcc caaaactctt cctctccttc agccctggca ggagttgggg gacttgatgg 64861 aattcactgg cccggaatct taaagagctc tagttccacc tggcaaagga ggaaaccgaa 64921 ctctcctaga gagggacctc gtcctaggac ccatgataag tcagtaaact caggggtcct 64981 gaccccatgt tccgcaatca gtctctcctt ctgtcctcta tattcaattg cttcaaaagg 65041 aagacctggc tagttgccct gaggcctaga atagactctt aggcccctgc cctgcaccaa 65101 aagttttttg agtgttgtat cactcaactg ctcagaaatc cattttctgc gcaccttgac 65161 cctctcactg ggagaactcg ccagagcagc agggtaagag agcattgaga aagcagccac 65221 agacatctaa actgaaagac aacaggtgac agagtctgtg gtcccctctg cctctgcaaa 65281 gggtgactgg ggggatccaa gggtagctcc ctagcctgcc tgtagcgcgg ccttagggtc 65341 taagtctgtc ctgtatttca cttccagagg cctggacatc aaggctgtgg tctatgatgt 65401 tggaattaga ctcaggctcc aaactctact gggttcaaat acaggacgat gaatcctctg 65461 agtagctccc ctttctccca acacacacaa gcccctctcc agtccacccc agagctcacg 65521 tggaaagaca tatagtgtct tcaggccaca gagcaaggac gtcaagaatc ccctcttccc 65581 agcccagccc attccacaaa cactccaagc agaggttcca gaagcatctt ccagagcagg 65641 aaatactggg aaaatcactg cccagcattg cccatagtaa gagagctcct cccagacata 65701 gacacaccta ccttgggctg tgctcgccga gggaaggcca ccttagggtc aatctacaag 65761 aaaagggaga ggtagaaggg gtcttggtta tcgcattttt agaagagcga ccacacagcc 65821 ttcagccctc aggacctctc tcctcccagg acacaaaggt gacagaaatc acaaaagcct 65881 ctaggaggag agaatagtgg ttaagcattt atttttattt tttcttcaga cagagtctca 65941 ctctatcacc caggctggag tgcagtggca cgatctcagc tcactgcaac ctctgcctct 66001 cgggttcaag tgattctcct gcctcagcct ccctagtagc tgggattaca ggtgcgcacc 66061 accacaccca gctaattttt gtatttttag tagagacggg gtttcaccat gttggccaga 66121 ctggtttcaa actcctgacc tcaggtgagc cgcccgcctc agccttccaa agtcccggga 66181 ttacaggcac gagtcaccac gcccagccag ttaagcattt agaccctaac gtcccactgc 66241 tgggttcaaa tcccatttgc actgcttccc agctgtgaga ccatggacaa gttatttccc 66301 ctctttgaac gtcagggtca agttcacctg caaaatgggc aaataataac acccacctca 66361 aagggtcatc agcagattat tggagataat gcatacaaag catttggcat ggtgtctagc 66421 tcacagcaaa aacaaaaaca aaaaaaatgt aagccgtctt attccttcaa gtatcaggta 66481 aatcgagggc cttccttgcg taactgactt cacacagatg acctatctag gtaaggtatg 66541 caattcaggc tttggaaacc ctgcaaatgg gaatactctt tgtctgcaaa aggctacaag 66601 ctgagagttc ataaccatca ggagtaggct cagttctgtt ttcgcagtac tatcaacatc 66661 aaatagctct ccttgaggct gtgagcaaac agctccttga taatgcagct aggcttgggg 66721 caaggcagag tgcagtcttc ggtgacgtga aagggcatgg gacttgacct agaaactctg 66781 ggcctcttgc caagcctgtg atcttggtaa tgcactcgaa ctctgtgagc ctcagttcct 66841 tcatctgcaa agtggggaca atcacacccg cttttcccag gggcagctgc ggggaatcag 66901 atgagataac gcgggtgaaa gcggcccttt gtgaactgcc cttcattgca caaacaaagg 66961 tgctattacc agcaggccac ctttgccctt ggctgtgaac aaccagaggg aaaaatcaag 67021 atgcatctgg gtgtactcct gggtactata gctgtacgac aaacagcaaa agacagggca 67081 tgagacgctg tgatccatgc agggcctaca gggccaggtc aggccagagc tttttgccac 67141 ccacatgggt ggacgtcatt cataagttca ccagccaatc gcagggaaga tctatctttt 67201 cccaaagttc cttctgttca aaactcagcc tcgttctagg cagctggctc tggcttctct 67261 ctgaatggca cagaccagga actagcctcc tacaggctcc cccactggaa tctgtccaca 67321 gagatggaac ctccaaaatc cacactgggg caatgcctcc tctcctgcca agtcccactg 67381 ggctccaggg aaaaatcaca actgcccatt tctcccgcct gaagtttcaa ttccacacct 67441 ccacggacat gtccatcctt ctctttctcc cagaagcagc cgccccagcc accagcgcac 67501 cacacacagt ccccctacac agaccccgat ggggagatgg tacccaagcc acgatcccac 67561 agacccacgc acaccccatc cctgacacac cccacctcca ctgtgactca cagctggcaa 67621 cataaataaa acccaacaag taaataaaag gtgaaagtca gagctcaaaa tagactcccc 67681 acagtaaacc tccctcccac cccctttccc aggctggagg acgccaaaca cagacaaagc 67741 cctggccctt ttcagcccca tctcctaagg atgctcatct caccctcctt cctccctcat 67801 tctcttcccc tttatcaatc ttccactcct ccatccccca gcccctaaat ctcagcttcc 67861 tcagcactgc agaagccatc agtgcattct gaggacaggg gatggtagca gaacaggcag 67921 cttgatggca gataatcacc aaaaacaata ggccactgtc tctccatttc caggcttggg 67981 tcactcaggc tgcgccaaga agatcctttc agagccggca aagttgaggc ctggcctcat 68041 ttgctcatta ctgcgagcgg gagcgggagc cactgggcag agataacagg agctggggac 68101 aggcggtaat ttgacaaatt cttgggcaca agcagctatt gttgcaggta aacgggagcc 68161 agggactgac ccaagcagag aggcccaagg ctgggggaca gcagggaagc atggagagaa 68221 gagatccaca ctatccctgc ccccagcaga cagacacccc catcacatgg gagtaacccc 68281 ctcctctgac ttttctactt ttccatctgc cctggagcca gcgccccctg cagggtctaa 68341 gacacgaagg agctgtggga caacagcaaa atgttccatg agtttgaatg ccaaccaagc 68401 agcttctcct aagatctagt cttccgaagg gtcgtttcca ggcaagctag agcctttctc 68461 ataaatcccc tccgtgggca aagcaggtcc ctgtcttcca cacaaagcaa ggagctcatt 68521 gaccaaaagt cgggggagga gggaaggaga gagggagggg gcttcaaggg atgctttgcc 68581 attggtctgg atgaagaggg ggtgtcgcag agccaaatac tagttagagt tcacagaagc 68641 caccgggtgg gtcagtgcta ccctgactca cccaatttgg gctgggggag ggacctagcc 68701 ccaccccaag ctctgctctg aggcatcatg attagtactt ctagggagag ggtggcggct 68761 gcctctcccc tcaccaggac taggggaggg gggatgtagc atctcatggc aggccaggga 68821 tttttctcct ccctctcccc cccaactcta gtgacaaagt ttcagagcat ggcagccctc 68881 ttactcagaa aaactccctc cttcctcatg gaccccgtcc agctgtcccc tggggagagt 68941 cctgaccctc tcctccggca gcctccctct cccaaaggac ccccgaagcc ccgagggcca 69001 ctcactgttt tggagtcgag ctcgtgccgc gattgcgcca gcactttatc cacccccgcc 69061 tggtccatga aagtgacgaa gccgaaaccc ctgcgcgccg tagtgaggga gaggcagatg 69121 gttacaaggc agtgagtggc gggtggaggg gggcgagccg ggagcaggag gagggggtga 69181 ggggctcacc tggatctctt ggtcaggggg tcccgcatca ccagacactc cttcacctcc 69241 ccgaactggc cgaagtattc gcgcagccct tctgtaacca cacacccgcc ttcggaccag 69301 cccgggcccc gcgcccttcc cccccccccg tcctttgccc ccggtgaccc cggagcggcc 69361 cggccgcccc cgcgccaagc tgcccgcgcg ttctccactg ccgccgcccc ccaccgccct 69421 cgccccgttc ccgctgagcc tcctggcgcc caccggggcc ccgggaagcc gagggccgag 69481 ctgggctgga agggggacgg ctccggccgg gttcccgccg ctccgggagc agcctcacaa 69541 aagtttgagc cgcaggtgcg agcggagttg gcgctgccgc cggcgggtcc cgggggccca 69601 gcccaccccc cgataccccc tgaacccctc atctgctccc ctccacccgc tgggccgggt 69661 gcgctcgcgg atcgccctgc gctctcgggg tctccgggcg gggcgcgaaa gagggcgcga 69721 gggcgcccgg ggtcagcagg gcgcagggcc gggctggggt gtccgggtcc ggggcgccgg 69781 ggggtccggg gtgccctgcc ggaccggcgg gcgctcccgg gctcgctcac cctgcgtagt 69841 ctgccaactg agtcccccga tgaacatctt gctgcgggag gaggagagac acaaagggcc 69901 cgcgtgagcg ccgggcgcca gggcgcaggg ggcgcgggcc cgggctccgg ggaggcccgg 69961 ccggacccgg atcggccatg ttggcggggc cggggcgggc gcgggccaag caggccgggc 70021 cgggccgtac caggggtcgt gcggcgagtc cggggaggcg aggccgggct ggggcgcgtc 70081 agtctccatc gggagccgcg ggcggcgcgg gcagcggagc ggcggcggcg gcggcggcgg 70141 cggcgctcgg cgcggggcag atgaggagcg cggcgaaggg ggccggacgg acaggccatg 70201 ctgccccctc ccccgacccc gctcgggcgg gcgggcgggg acggccgagg ggagggcccg 70261 ccgggggccg acctgccggc tcctcccccc gccgccctgc gcgcataaag ccccgcgctc 70321 gcccgcgagc ccgggcgccc gcggaggcgg cggcgccggg acccccttcc cgcggggctc 70381 ctgggtctcc ggggccgagc gagacccccg aatcccggtg cgggccggga ccagggcgcg 70441 gccgcacccc ccacccaccc cggcgcgggg gcccggccgc gggaacgccc tcccggaatg 70501 aagcgcttcc cggtgccttc aaggtagctc ggttccggat cgggaacccc cccggtcttc 70561 cctctcaggt ccctgctgtc cagaattgaa gagaacccca atctcctcgg gtcgggcggg 70621 agatagccct agtgctcccc agtccccaca tgctccattt ccccagtcgg agcgagaagc 70681 cgccctcaga gcccccagtt tcctcccaac ctaatttaca gccccccacc cccaccccag 70741 ggctctaatt tagagccctc gctccgttcc caggatccca tatccagttt gggggaccca 70801 ctcagtgcct cgcgtcccga gtggggaacc tccatcagca ccccagtaga aaccgtatcc 70861 tcttccccaa tccgtcctca gcccttggtg tccaaagtga gagtccccct tagtttcccc 70921 aaatccctgc cgtccacatt agtggaccct atggtctgtt gagaaaatcc cgcgccccaa 70981 gcccgggttc tcagtttgcc gccaccccca cctcccggtg tgaggagtct cccctggttc 71041 tctccctccc tggccggttt ttgtgagatc cattcagact cctgtcctga ttcgggaaac 71101 ccggttccag acactcacca cccccaccac accgaggctg gaggaaaagg cagtgcctct 71161 tagagcctcc tgtttccagt ttaggagccc aacgcctcag ctgtgcaccg tcccctccga 71221 atgaccccct tcttttgcag aggtctaagc tccaccccat cccattggaa gagcctaaac 71281 tacctaccca ctgcatttgg aatgacttcc ctcagtccct cttattttga ggctactccc 71341 cgtgaaggca aacaccttcc tttctacccc catcttctgg gtctctggga tcccccagta 71401 cttgcaccca aggctccttg gagtcacctt acgtacccct aaaattccct ccctccagcc 71461 tgggaacaac cctgagaaag tattctggtt caccagggga gctgcggggc cactctcctt 71521 atccccacct ctccttatcc ccgcctctcc atatccccac ctctccatat tcccacctct 71581 ccatatccaa gccttaggac ccttcacttc cccagctgaa accagccaaa aaaggatcaa 71641 accaggaagc ttaggtggcc aacacccctt ctccccatcc ctgctggggg tctcagtgcc 71701 acacaagccc ctccctccca actcaccttc tactccttcc cccatcccta ccttaatcca 71761 tccggtgtaa tccagtgaca acccatgttg acaagccctt tgctgttcac cttcacattc 71821 catcctgaat tcaacacctt ttattcctgg gcatcataac tctgacctct ggaagggtgc 71881 ccatccccct cttgatagcc ctccttccct ctccccaccc caaccatcag gcagagaaga 71941 aagtgagaga cttacagtaa tggtttcctt ggggaccttc tgggaatgct cagcaggtat 72001 atcctgccac cctgtgagca taacactaga aacaaacttg ctttttcctg tttgtgtccc 72061 ctactctgcc ccacagtttc tcagcatccc ctggactggg gactggaggg ccacccacac 72121 aaggcatttc ttaccattcc gacacaaaag cattccagtt tgtgttgtag cagagacaca 72181 gatgagggtg agcaggtagc ctctgcggta ttagggaggt gactcttctt aggtctgaac 72241 ttaaaaaggg tcattggaga ggtgcgggga gtagggatct tccagtgttc ctggactttg 72301 gtttgtgtgc tggtgctgct gggcagatga gcttgaagga taggaaatgg gtcctttccg 72361 gcagtggcga cctacccacg agaacatgcc tctcgacagg gatctccttc atccttctcc 72421 agaagagaag aggaaacaga agaagaaacg cctggtgcag agtcccgatt cctacttcat 72481 ggatatgaca tgcccaggat gctataaaat caccatggtc tgtagccatg cacaaacggt 72541 agttttgtgt gttgcctgct ccactgttct ctgccagcct acaggaggaa aaggaaggct 72601 acagaaggat gttccttcag gaggaagcag cacccaaagc actctgaatc aagatgagtg 72661 ggaaaccatc tcaataaaca cattttggat taaaaaaaaa aaaggaaatg ggtaaaggca 72721 tctgttcaga gtcagccaat gctgtgtaag aagtagcttg agataggatc catgtattag 72781 catattcatt cagaatctga ggaggaagcg tctacctctt ttggtcagta gcttgtttgt 72841 atcacagtaa catgggttag acttagacaa atcctctttc agtgcaggtg tatctcagct 72901 gcagcagctt cagtccactt tgttcacaat aggggcgaca agcccttcta gaggtagccc 72961 tcacaggaca atgcagttgt aaactcatga tcctcatagt aaattgagga tctcagtggt 73021 attaaggaaa aatgatcttt agagcaacag gcccagactt cagaattaag agattagagg 73081 atgtggttcc agctcttttt ctctggtact acacacacag aacatcactc aaagaataaa 73141 agaagaattt gctttatagt ttttaaaaaa tcctcttgag aaagacagtg ggcaaatgat 73201 attatgcaac ttggcacagg tatttaccga acatcttcta tgtactagac actgtcctag 73261 gtgttgtagt cccagcagct gcctttgtaa agcttgcatt ctagagatga gagtcataca 73321 agtaaatgaa tgaatgaatg agtcaatgaa tgaatgagtg aatgaatgaa taaacaatag 73381 ataaataaat ggtgtgttca gttagatgtt gtaggtgcca cagagaagga agggtgatag 73441 ggagggtggg tactattggg aagaaggtga gatgaaattt tatatatata tatatttttg 73501 agatggagtc ttgctctgtc acccaggctg cagtgcagtg gcatgatctc ggctcactgc 73561 aatctctgcc tcttggattc aagtgattct cctgcctcag cctccccagt agctggggtt 73621 acaggcatcc accatcactc ctgactaatt ttttgtattt ttagtagaga tggggtttcg 73681 ccgtgttggc caggctggtc ttgaacgcct gacctcaggt gatctgccca cctctgccct 73741 tgaaagtgtt gggattacag gcgtgagcca ccacacctgg cctgagatgc aattttaaat 73801 caacaacatt ctcatgggga gggagacgtt ttagcaaaga cttgaaggag atgaggtaaa 73861 tgagtctact ctatgataca gaactctcat ttaagcagaa tgtctgcatc tcttaagtac 73921 aatgataatt aacgttcata ttttgacttt cactactcta aaactgtagg tcaagtggac 73981 taggggtgga tgggggctca gaaaatggca gtttttgaac agacacctct atcagatgag 74041 aaggctgaga gctgagggtt ttccttggcc actcttggag aaacacagct ttaagatata 74101 ttgaagagcc tttgagaatg acaacaacaa aattaaagat taaattatag tccatgtggt 74161 tttagcgtgc agagtatttc acagtgttct cattctttga aaatgcaata ttggccgggc 74221 gcagtgattc acgcctgtaa tcccagcact ttgggaggcc gaggtgggta gatcacgagg 74281 tcaggagttc gagaccacct ggccaacatg gtgaaacccg gtctctacta aaaatacaaa 74341 aattagcctg gcgtggtggc gcacgcctgt aatcccagct actggggaga ctgaggcagg 74401 agaattgctt gaactcagga gacggaggtt gaagtgagct aagattgcac cactgcactc 74461 cagcctgggt gacagaatga gactccgtct cagaaaaaaa aagaacaaaa gaaaatgcaa 74521 tatccaagca aagccctgaa ttacctggaa cttcccattt cttgtccatg caagacaagg 74581 aaaaatttac tgtaagctga ttaagagcca tgcatttata atcagcccat tttacaaact 74641 tcccccactt aaaaaaatat atatatatac ctatatatat atactttatt ttttctttct 74701 ttttttgagg cagggtctca ctctgttgcc caggctggag cgcggtggca cgatgagggt 74761 tcactgtagc ctcaacctcc tgggctcaag cagtcttccc aatgcagcct cccaaatagc 74821 tgggactaca catgcacgcc accatgccca gctaattttt gtttcgtttt gttttgcaga 74881 acggggtcta actatgttgc ccaggctggt ctcaagctcc tgcactcaag caatcctctg 74941 ccaccacggc ctcccaaagt gctgggatta caaatataag ccactatgcc cagcctaaaa 75001 agtagacttt atttttattg aggagaaagt acagagattt ccccataccg cctgacccca 75061 catgtgtgta acctttccca ctgtcaacat cccccaccag agtggtacat ttgttacaat 75121 tggtgaacct atactgacac attgttatca cccagagtcc atagtttaga gttcactctt 75181 ggttttatac attctatggg tttgggcaaa tgtataagga catgtatctc ccattatact 75241 gtcatacaga gtagtttcac tgtcctaaac attctctgtg ctccacctat ttgttctccc 75301 ctcccccaac ccctgacaac ccctgatctt tcactctcca tagttttctc ttttcaagaa 75361 tgtcatatag gccggacgcg gtggctcacg cctgtaatcc cagtactttg ggaggccgag 75421 gcaggcggat cacgaggtca ggagatcgag cccatcctgg ctaacatgat gaaaccccgt 75481 ctctactaaa aatacaaaaa attagctggg catggtagcg ggcgctttta gttccagcta 75541 ctcgggaggc tgaggcagga gaatggcgtg aacccaggag gcagagcttg cagtgagcca 75601 agatcgcccc gttgcgctcc agcctgggca acagagcaag acttcgtctc aaaaaaaaaa 75661 gaacgtcgta tagcttgaat catacagtag tagctgtttc agattggctt ctttcactta 75721 gtaacacaca gttaagattc ctctatggtg gctttttttg tgtgtgggtt ttattttttt 75781 gtttcctggg tttttttgct cgtttgtttc gtttgtttgt ttgtttgttt gagacaaggt 75841 ttcactctgt cgcccaggct agagtgcagt ggcatgttca ggctcactgc agcctcaacc 75901 tctcaggctc aagcaatcct cccacctcag cctccccaac agctgggact acagaatagc 75961 tgggactaca ggcacacacc accgcccctg gctaattttt gtattttttg tagagacagg 76021 gttgcaccat gttgcccagg ctggtctgga actcctagct tcaagtgatc cacctgcctc 76081 agcctctcaa agtgctggga ttacaggtgt gagccacctc gcctgacccc gtctatgtct 76141 ttccatagct tgatagcttg tctgttttta gtgctgcata atagtctgtg ctctagatgt 76201 accacggttt atgcatccct tcacctcctg aaagacatct tggttgcctt caagttctgg 76261 caattatgat taaagctcag gtttttgtgt ggacataggt cttcagtccc tcactttttt 76321 gtcctaattt ttctctcttt ttttttcctt ctctcagatt tctggactca accagaaggc 76381 tatgagctgg agtaatgaaa gcacttgact gagagtccaa agatccatat gctggtcttg 76441 ggactaccac tgtttctggt cgggtggttc tagggataca ccacttagcc ttctgagctt 76501 ccatttactt tcggagttgc tgcaagactt aagtgagatc atgtatttga aagcacctta 76561 gacactctaa agcactatgg gaaagtaaag gtagtccatc aaaagcacta tgggggtgcc 76621 gggtgtgctg actcacgcct gtaatcctag cattttggga ggctgaggtg gacagatcac 76681 ttgaggttag gagtttgaga ccagcctggc caacatggca aaaccccgtc tctacgaaaa 76741 atacaaaaat tgaccaggcg tggtggcaca tgcctgtaat cccagctact ggggaggctg 76801 aggtgggaga atcacttgaa cccaggaggc agaggctgca gcgagccgag atcctgccac 76861 tgcactcccg cctgggcaac agagtcagac tctgagcaat tattgtgtct caggcacttt 76921 gcgttgggaa actcattcaa tccttgcaac agctctctga ggcaggtcgc tattttacag 76981 atcaggaaat gaaagcattg agaggtcaac ttgctaataa agataagacg ggtggtaagt 77041 gagggggctg agatttgaat tcaagtctga taccagaatt cccatcatgc acttcatcat 77101 gaacataatt ttatgttatc tacctattta atgcacacac aatgcaccta ttattacaga 77161 tggttctgga atgtctgtta tgctggaatg ggagctctct aagggaaagt cccttaaacc 77221 tctttgcatt ctcctccacc accccttatc ctcaggacca taccttacac taagtactca 77281 atagttggtt tttgtttgtt tgtttgtttg tttttgagac agggtctcac tctgtcactc 77341 aggctggaat gcagtggcat gatctcggct cactgcaacc tctgcctcct gggctcaaag 77401 gatcctccca cctcagcccc ccaaatagct gcgactacag gctcaagcca ccacactcag 77461 ctagtttttg tattttttat agagacagag ttttgctatg ttgcccaggc tggtctcgaa 77521 ttcctgagct caagtgatct gcctgcctca gcctccccag gtgctgggat tacaggcatg 77581 agtcaccatg cccggcctca ctagttgtgc ttgcctagaa accatcactg tgctagggct 77641 ggaggatgag atggtgaatg agacagacag tgattcctac tttcacagat gtgacaatct 77701 ggagacacag tagttttaca tacatctttt ttgcctaggc tggtcttgaa ctcctgagct 77761 caagcgatat tcccgcctca gcctcccaga gtgctgggat tacatgcatg caccaccacg 77821 cctggctgat acacatcttt ttaaccaagt catttgaatc actgcaataa ttaaatgtag 77881 taggttctat taatgtgctt gccttacaga tgacgaagct gaaactcaaa catctgaagt 77941 gacatgccca aggtcatgca gctggccagt ctaaaaatct agctctacct gacccaaaca 78001 ttatgctaca ctctgtctcc attccccacc ccccgacttt ccccactgcc ctggcaggga 78061 gggaattaaa taaactattc tgcttgccca gctgagaaca cccacagtct gtccagacct 78121 gctgactctc ccactttctc actttctcac tgtcgcttcc aatgacagcc agggctggtt 78181 tgatgtgtcg gtgaaagggg ggatgcttca gagttctatg gtttcagcat ctgaagtcac 78241 ctctctgctg taattaactc tgtgatttaa tctgctcaac taatcattct acaattcagg 78301 ggccaaagac aaagatgtta agaggatgcc tcctttatgt tttattgttt ttaatgttta 78361 ttttgggaaa ctgaaactgt ggacctggag aagtgaaacc gcaggtctgt cagtgctgac 78421 ggtgggaaag gggtgccagg ctgcaggcag gggcttctct tacacttagg ccagacctaa 78481 aggtgaaggg ctggctagag gggaggttct catctctgag ccccatctgg agaacaaagc 78541 accctcccta gagcccaggt tttcagggac cagggaaata gtttttggag tttgtgaact 78601 tagtggaaaa agtattaggg agttaggagc ctggggctcc acctctgacc ttgaaacagg 78661 ctcttcccct ctctgagcgc tagtttattc atgtgcaaaa tggaaagtgg catattattc 78721 cagagatgga aaattggttt cagtttttct acttggcctc tcaaattacc tccaaaatga 78781 ctggtagtgg ctagggctgg gcaaggatag aaatcgggtg attgattagt gatgtctgcc 78841 ttggtgcagg tagtcatgga acagtagaag gtgtgctata gcgttgcctt ctctggacta 78901 gatgatcact acatcctttc cagttttggt cttctagaga ccttacaaca agaagggcaa 78961 ggattaagag ctacgcagca cccccaggag tcagacaacc aagggaggaa ccatcaaagt 79021 gacaatttta ttaataatga taatacatga ttctatatgt ggaggaaaag aatttacatc 79081 aaaccataag ccatttgcat caaaccatta atgtggtggg gaggcgtgtg tgttagtatg 79141 agtatgaatt ttttctgtga gtatagattt atcttaacaa ccagaaaaca gtaaaaactg 79201 tttaacgtgg tggggaggcg tgtgtgttag tatgagtatg aattttttct gtgagtatgg 79261 atttatctta acaaccagaa aacagtaaaa ataatttaaa tgagtctatt taaagactgt 79321 tcaagaacac tgatagcagc attattcata ataagcagaa attggaagta atccaaacgt 79381 ccatcaacag gagaatggat aaactatgat atattcgtac aatggaatac tatgtaaaat 79441 gaaaagggcc gggcgcccaa cactttggga ggccaaggcg ggtggatcac ttgaggccag 79501 gagttcagga ccagccttgc caacatggtg aaaccccatc tttactaaaa atacaaaaat 79561 tagctgggtg tagtggcaca ggcctgtaat cccagctact tgggaggctg agacaggaga 79621 atcacttgaa cccgggaggt ggaggttgca gtgtgccaag atcactccaa gatcacgcaa 79681 ctgcactcca gcctgggtga cagagattcc atctcaaaaa aaaagaaaaa aaaaaaagaa 79741 aaagaaaaaa agaccatgct tggtggctca cgcttgtaat cccagcactt tgggaggctg 79801 aggtggtcag attgcttgag gccaggagtt caagaccagc ctggccaaca tcgtgagact 79861 gtctctacta aaaatacaaa aattagccag gtgtggtggt gtgagcctgt ggtcccagct 79921 actggggagg ctgaggcagg agaattgcct gaatccagga agcagagatt ggagtaagcc 79981 aagatcgaga tcgtgccact gcactccagc ctgggcaaca gaacaaaact ccatctcaaa 80041 aaaaaaaaaa aaaaacacac aaatgacaga tacatataac aatacaaatg aatttctttc 80101 tttctttttt ttttgaaaca aagtcttgct ctgtcaccca ggctggagtg cctcccaagt 80161 tttatatata aatttttata tatagctaaa acccagatta agatacagat ccataaaagt 80221 gaccactcac cacgtgtggc tactgagcac ttgagatgca gctggcccaa actaaccagt 80281 gctgtaagtg caaaatatac attggatatc aaagaatgca gaaaatgaat gtaaaataca 80341 gtgataataa ttttacattg attacatgtt gaaacgtaat ttttgaatat attattaaaa 80401 taaatttcac ctgtttcatt ttatgtcttt ttatttttat ttatttattt atttattttt 80461 atttttttga gatggagtct tactctttca cccaggctgg agtgcagtgg cgcaatcttg 80521 gctcattgca acctccacct ccttcgttca aatgattctt ctgcctcagc ctcccgagta 80581 gctgcgatta caagcaccca ccaccacacc cagctaattt ttgtattttt agtagagaca 80641 tggtttcacc ctgttggcca ggctggtctc caactcctga cctcaggtga tccacttgcc 80701 tcggcctccc aaagtgctgg gattacagga gtgagccacc gcgcccggcc tttatgtctt 80761 tttaatgtgg ttcctagaaa atttttaaat tacatatatg gtttacagct gtagctggct 80821 tttttttttt tttttgagac ggagtcttgc tctgttgccc aagctggagt gcagtggcac 80881 gatcttgtct cactgcaagc tccgcctcct gggttcatgc cattctcctg cctcagcctc 80941 ccaagtagct gggaccacag gcgcccgcca ccacgcccgg ccaatttttt gcatttttag 81001 tagagatggg gtttcacagt gttagtcagg atggtctcaa tctcctgacc tcgtgatccg 81061 cctgccttgg cctcccaaag tgctgggatt acaggcgtga gtgtagctgg cattttattt 81121 ctattgggca gcactgatat tgaagctttc cttcctccag aaagttatct cctgcttcca 81181 agtcaatatc tgcttcccaa agataaatgc gactatgacc tgtcttccca tagattagtt 81241 ttgcctgttc cataatttca tataaatgga atttttgtat gtattttttt gcatctggct 81301 tcttttgctc atacagtttg ttgttgttgt tgttgttgtt ttgaaacaag gtctcacttt 81361 gtcacacagg ctggagtgca gtggcatgat ctcggctcac agcagccttg acctcaagca 81421 agttcaagcg atcctcctgc ctcagcctcc tgagtagctg ggactacagg catgcaccac 81481 catccccggc taatttttgt atttttcata gagatggggt tttgccatgt tgtccaggct 81541 ggtctcaaac tcctaggctc aagtgatctg ccctccacca cctcctaaag tgctaggatt 81601 acaggcatga gtcactgtgc ccggcctcaa cctacaattt ttgaaattca tccaagccga 81661 gcatagtggc tgacacctct aatcccagca ctttgtgagg ccaaggtggg aggatcactt 81721 gaggccagga gttcaagacc agcctgggca acataacaaa accccacctc tacaaaaata 81781 taagaaaatt agccagatat gttggtatga gcctgtagtc ccagctactt gagaggctaa 81841 ggcaggagga tctctttagc ccgggagttt gaggaaacag tgaaccatga ttgtgccact 81901 gcattccagc ctgggtgaca gaacaagacc ctgtcaaaaa agaaaaaaag aagtgagagt 81961 gagagtgata ctttacaaaa aaaaaaaaac agaaaaaatt aaaaggggaa taggatgagc 82021 gagtggagat agagtaacaa aggtaaagaa ggattgagct acatctggcc gggtgcggtg 82081 gctcacacct gtaatcccag cactttggga ggccaaggca ggtggatcac tagctcagga 82141 gatcgagacc atcctggcta acacagtgaa accccgtctc tactaaaaat acaaaaaatt 82201 agcatggtgg cgggcgcttg tagttccagc tactcagaag gctgaggcag gagaatggcg 82261 tgaacccggg aggcggagct tgcagtgagc cgagatcgcg ccactgcact ccagcctggg 82321 caacagagtg agactgtatc tcaaaaataa aataaaacaa aataagaaag gattgagcta 82381 catccgtgaa tctatttcca ccagcagaga aattcaccac ggctcagcgt ccccaagact 82441 gggtgagaaa atcacaaaag atcaaatttg gcagaagcta atcaagagtt tactttttgg 82501 tctcagcatc atatcccaat tttttagtgt gtcacatgta acatgtaact aataacaaca 82561 aagtagatat tgacaacttt actgagccct taccatgtac agcatggtgc taagagcttt 82621 cctgtctgat cgcatttaat cccctctaca gtcttctaaa gttagttcta tgatgatcct 82681 cccatgttgt agatggggaa actgaggctc agcaaactta cccaaggata ctacactcag 82741 aaatggcaaa gaggggattc aaaggcaagt gtctctggtt tctggtctgg tatttttaac 82801 cactacctct actgaagctt gagaatgctt tctttgcctt aaataaataa atgaagaaga 82861 aaaagaaata aacaaatgag gagaagtgga gtcaaggtta tgtctataaa taacgtcact 82921 cttgctccct ggcctttgga caagctctcc agagggagag aggaagagct ggggttacgc 82981 agaattcaga ctatcttatc tgaacaccat ggaagcatga gtcacgtggg aatgccattc 83041 atcaggagtg tcccaatttt aaaaaggttg agaaaggctg ccttggagag cctctgatgc 83101 ccccctcttg ctttatggat aaacttagag atccaaagaa agaattattt ttcccaaggt 83161 cacacagcta gtactcagcg gctacatctt tgcctcttct cttctcagtg aatgaatcaa 83221 atgtgccttt tccaccgtgt caggtactct ggaggtatcc acacctgaat aaggcttgct 83281 gacagttctt gaggcattta cgctccagta aagggacaaa ttatatccca aagcaaagta 83341 tgagagcctt aagagagctt tggaagctca agggagggag ggagtacagt gggttggaga 83401 ggaaagggaa ggctggcatc taagctgggt cggaaaagca gaaacagaga agtgggcact 83461 cctcaaagga gagccagtgg gtacttgtgt tggctgaact gtgtatttac caccggccca 83521 actattgcct ctagtgtcca ccatgatgtt gagtcaacat taaattattg gttaactgat 83581 gcagaataga actcattcag ggtggtttcg ggggcagaaa aaaaattaaa ttaaaaaact 83641 catttggcca ggctaggtgg ctcacgcctg caatctcagc acttcgggag gccaaggtgg 83701 gcggatcacc tgaggtcagg agttcgagac cagcctggct aacctcatct ctactagaaa 83761 tacaaaatta ctgggcatgg tggcacatgc ctgtaatccc agctgctcag ctgctcggga 83821 ggctgagcca caagaatcag ttgaaactgg gaggcagagg ttgcagtgag ctgagattgc 83881 accaccgcac tccagcctgg gcgacagagt gaaataccat ctaaaaaaaa aaaaaaaaaa 83941 aaaaaaaaaa aaactcattc atcagaaatg cagagataaa attagctgag agtggtgtgc 84001 atgtccctgt aatcctagct agacaggagc ccaagatagg aggaccactt gaggtcagga 84061 gttcaagaca gagcagccta agcaaaagac tgagaacccc acctctacta aaaataaaag 84121 attagcagag tatggtggct catgccttga gtatggtggc tcccagccac ttgggaggct 84181 gaagtgggag gatcccttga gtccagaagg tagaggctgt agtgagttat gattgctcca 84241 ctgcataaat acagacaatg tgtatttgtg tgtgtatatt gtaatgtctg tgtatcccga 84301 ggtcatatgc ttctcaaaag aaaggactgt tagccaggca ccgttacaaa cacctaggca 84361 acataacaag actctcacct ctaaaaataa ataaataaat aaataaataa ataaataaat 84421 aaataaataa taaggactgc tttcttcttg ttccccatta tggctggcac atagcaggca 84481 ttctttacat cttggctgaa tgaacaaata ggtaaattgt ggggaagaca cattggcccc 84541 tttaagcctt aagaaataat aaccagctgg gcgtggtggc tcacgcctgt aatcccaaca 84601 ctttgggagg ctgaggcggg cagatcacga ggtcaggagt tcgagaccag cctggccaat 84661 atggtgaaat cctgtctcta ctaaaaatac aaaaattagc cgggcgaggt ggcatgcgcc 84721 tgtaatccca gctactcggg aggctgagtc aggagaattg cttgaacctg ggaggtggag 84781 gttgcagtga gccaagatcg agccattgta ctccagttct gggcgacaga gcaagattct 84841 gtcatgggaa aaaaaaaaat tagcctggca tgggggcgga tgcttgtaat ctcagctgtt 84901 caggaggctg aagcaggagg attgcttgaa cctgggaggc agaggttgca gtgagcagag 84961 attgcaccat tgcactccag cctgagggac aagagtgaaa ctccgtctca aaaaaaaaaa 85021 aaaaaaaaaa ggctaagtgt aaggctcaca cctataaacc caacactctg gaaggccgag 85081 gtgggaggat tgcttaaggc caggagttcg agaccagcct gagcaacaaa gggagactct 85141 gtctctacaa aaatttaaaa attagccagg catggtggca cttgcctgta ggcccagcta 85201 cccggaaacc tgaggtggga ggatcgcttg ggcccgggag atcaagggtg cggtgagcta 85261 tgatcgtgcc actgcattcc agcctggata ccagaagaat accctgtcta aaaaaaaaaa 85321 aaaggctgtt aagactttgc tttttaatat agctatatgg cttttatttt gagacggtgt 85381 ctcactttgt tgcctaggct ggagtgcagt ggtgtgatct tggctcactg caacctccac 85441 cttccaggtt caagcgattc tcctgtctca gcctcccaag tatctgggat tgtaggcccc 85501 tgctagcaca tctggctaat ttttgtattt ttagtacaga cgggatttca tcatgttggc 85561 caggctggtc tcgaactact aacctcaggt gatccacctg cctcagcctc ccaaagtgct 85621 gcaattatag gtgtgagcca ccacacgcgg ccaaatacag cttaatatag tgacacacaa 85681 ctaggaaata gatggaacca ggattcaaac acccgtcctt ctgatgtgaa attttgtgac 85741 tcagaagagg gaagtcagac ccaacctgga gagcccactc ccatgcctgc ctgcagctgt 85801 cccagctgtg cctttagagg gtgggctgac ctcactctgt catcagcagc accagcaagc 85861 accagagggg tgagtgtctc tgggagcact caaaatcatc caccaagaaa agatcaggcc 85921 ttctatagta tttttttcta tttcatctct tttccccgct cttttttttt tttttttttt 85981 tttaagacag ggtgtgcctc tgttgcccag gctggagtgc agcagtgatg taatcataac 86041 tcaccgaagc cttgaactcc tgggttcaag ccttcctccc actttagcct cctgagtagc 86101 taggacaaca ggtgcgtgcc actatatctg gctaatttct tattttattt atttatttag 86161 acagaatctc actttgtcgc ctaggctaga gtgcagtggc ccaatcttag ctcactgcaa 86221 tgcaatctct ttttcccagg ctcaagtgat tcttttgcct cagcctactg agtatctggg 86281 attatatgca tgtgccacca cgcccagctt tttttttttt tttttttttg agatggagtc 86341 tcactctgtg gcccaggttg gagggcagtg gtgcaatctc ggctcactgc aacctctgcc 86401 tcctgggttc aagagattct cctgccccag cctcccaagt agctgggatt acaggcacac 86461 gccaccatgc ccagctaatt tttgtatttt tagtagagat ggggttttgc catgttggcc 86521 aggctggtct cgaatccctg acctcaggtg attcgcctgc ctcagcatcc caaagtgctt 86581 ggattgcagg tgcgtgccac tgctcccggc ctaatttttg tatttttagg agagactggg 86641 cttcaccgta ttggccaggc tgggcttgaa ctcctcgcct ccagtgatcc acccacctca 86701 gcctcccaaa gtactgggat tacaggcatg agacactgtg cccagcctat ttcatctttt 86761 taaaatttct tttgcataga tgtttgtaac atagatatta taacagtaga gtatgtatat 86821 attttgtaaa tatataaatg tatgtatgct agaaatatga gctcaaagat tgtttcctct 86881 tagggatgta tcatcacaaa aatttggaga tcatttctta agacagcaaa gacaaagatt 86941 aaaaacaatt attataggat atttacatgt gaaagaataa agttggacct ggcaccatgt 87001 aaaaacaaaa tgaactcaaa atggataaaa gaaacacagg ggtaaatctt tatgacctta 87061 aatttggcaa ttcttagtta tgtcaccaaa tgcacaaaca acacaagaaa acaatagata 87121 tattggactt catttaaatt taaaactttt ggctgggtgc agtggctcat gcctgtaatc 87181 ccagcatttt gtgaggctaa gcaggagggt tgcttaaact caggagtttg agaccagcct 87241 gggcaacata gtaagacctc gtctctacat aaaataacaa aatttagctg ggtggtggca 87301 cgagcctgga gtcctaggta cttggaaggc taaggctgga ggatggcttg aactcaggag 87361 ttcgaggctg tagtgagcta tgattgtgcc actgcactcc agcatgggtg acagagcgag 87421 accctgtctc aaaataaata aatgaataga taaataaaaa taaaaacttg ggcatcaaaa 87481 ggcactatca agaaagtgag gccaggtgtg gtggctcacg cctataatcc caacattttg 87541 ggacgccaag gcaggcgaat cacttgaggt caggagttcg agaccagcct ggccaacatg 87601 gtaaaacccc atctctatta aaaatacaaa aaattagctt agagtggtgg tgggcgcctg 87661 tagtcccagc tactcgggag gctgaggtgg gagaatcgct ggaacctggg aagtggaggt 87721 tgctgtgagc taagattgtg ccactgcact ccagtctggg caacagagcg agactgtctc 87781 aaaaaaaaaa aaaaaaaaag tgaaaagggc tgggtgcagt ggctgacacc tgtaacacct 87841 gtaatctcag cactttggga ggctgaggtg ggcggatcac ttgaggtcag gagtttgaga 87901 acagcctggc caacatggtg aaaccctatc tctacaaaaa aaaaaaaaaa tacaaaaatt 87961 agccaggcat ggtggcgtgt gcctgtagtc ccagctactt gggaggctga ggcaggagga 88021 cttgagccca gggaggtcga ggctacagtg agctgtgatc aagccactat actccagcct 88081 gggcaataaa gtgcgactct ttctcaaaaa ataaattaat aaataagtac aaataggcaa 88141 aggacttgaa tagatatttc ttcaaagaaa atacataaat agccaataag ctcgtgaaaa 88201 gatattcatc ttgagccatt agtgaaatgc aaggcaaagc cacaatgaga tatcattcca 88261 cacctactac cacagctatg aataaataaa taaatggaaa atgacaagtg ttggcaagaa 88321 tgtggagaaa ttggagcttt ctgtacgttg ctggtgtgag cgtaaaatgg tgtggcactg 88381 tggaaaaccg ttcgatggta tctcaaaaag taaaacagaa ttatcatatg acccagatcc 88441 taggcatata cccaaaataa ttaaaaacag gtattcagac aaaaacatgc atgatcatag 88501 cagcaccatt cacgactgcc aaaagattga aacaacccaa atgtccatca agtgatgaat 88561 gggtaaacaa gatgtagtat acccatgcaa gggaatacta tttagccgtt aaaaggaatg 88621 aaataggaca catgctacaa catggataaa cctgagggac atcatcctaa atgaaaaaaa 88681 ccaggctgag ctcaggggtg cattttgccg gccttaatcc cagcactttg ggaggccaag 88741 ccagatagat cgcttgaatc cagtagtttg agatcagcct ggacaacatg gagaaaccct 88801 atctctatca aaaatacaaa aaaattagtt gggtgtggtg gcggacacct gtggtctcag 88861 ctacttggga ggctgaggca ggaggatcgc ttgagcccgg gaggtggagg ttgcagtgag 88921 ctgagatcgc gccactgcac tccaacctgg gtgacagagt gagagctcat ctaaaaaaaa 88981 aaaaaaaaaa gaaggaaaag aaaaaagcca gatgaaaaag gtcacatact gattccattt 89041 ctatgaaatg ctataattcc atttctatga aatatctaaa ataggtaaat ctgtagagac 89101 agaaagcagg ttagtgattg ccaggggcgg ggggaataag gaaggagaag taattgctta 89161 ctggatatga tgttagaagt ttgcctttag ggtgatgaaa acgttgtaaa tatactaaat 89221 gcaactgaaa tgcaagcttt aaaatagtta acttcaataa aacaaaaata attatcataa 89281 aataaataca tgatcatagt aagaaaatat cttttttcaa acagcacaga atggtactaa 89341 gtgacttctc tccatgctct atttccactc ttcaaaggaa ataaaggata ctacaattta 89401 tgatgttttt tccccaaatt tattccatac agatagaaat gtgcgatatg tgtatacatg 89461 tttgtgtgta gatatttcct tttcctctta tgcaaatgag atcatattcc attcaatgta 89521 ctgccttttt ctcttggcaa tgtgtcaatt accattccat atcaatacag ttttcttaat 89581 ggcttcgtaa tatgggcttc ccatactccc tgtctcttca tcttcagacc tgagggcccc 89641 actgtttccc tccaagttct gagtcccaac gccatcctgc ctcggacttg tcagtgactt 89701 tgacctcaat cgttgctcct tcacgtggca ttgtcaggag tctctaatat ccagcaggtg 89761 cctggcccca tgggctttat aaaaatggca gctcccttct tctgcaccct gtgctatgct 89821 agggccagct gggggtacca tagaataatg cccgagggtg tagactccct gtcttaccag 89881 ccgtatgacc tcaggcaagt gccctaacct ctctgagcct cagtttcctc cgttgcattg 89941 aggctattca cagtcccgta caatgatgct cacacagcgc tgagcccagg gcccagctcg 90001 gagccagtcg gcatcacgtg atctttcggg aagaaaaggt gggcagagcc ctgggtcggt 90061 ggccacctgc ttttgggtag ggcactaata cagggtgctg aagcgggaag gtgaggccga 90121 gctgggcggc cctgggaacc cgggaaggag gcgaggctgg agtggaaacg attagaatga 90181 gagcaaactg gctcctgaat agaagctttt attcctgccg gagagggctt tgttccacaa 90241 acacgtgaca cacaaaagcc tctacttgaa ttcgggaacc ctaattggtt ctcaggtggg 90301 ggaggagagg ggtgtccgga tttagggatt ccttaaaggg acaatactgt ggggagctga 90361 gcccagggtt cctccccatt gtcagtggga gggggtggcg tggaggggag gggatggggt 90421 agaggggagg ggaagagctg gaccagtggg ggtggggcgg ctgttagaag aggaggggga 90481 ggggagagga agggagaagg gaagagggag agaagcacag agagactcag aggcctgcag 90541 ccccctgggc cagtgcttga gaaacagtag gcatggacgg agggaaagaa ggaaggacgg 90601 agggaaggaa ggaagagagg aaggagggga ggaaggaagg aggggaggga aggaaggaag 90661 gtaggaaggg agagaggaag gagggaggga gggaaggaga ggaggaagga agggagagag 90721 gaaggaggga gggagggaag gagaggagga aggaaggaag agagggagga ggggaggaag 90781 gaaggagaga aagaaggaag gaaggtagga aggcagggaa aaaggggagg aaggggggaa 90841 ggaagaaagg aaggagagag agaaaagaag gaagggagga acggagggag ggaagggagg 90901 aagaaaaaga ggaaggaaag aaggaaggaa ggaaggagcc tggcgcggtg gctcacacct 90961 gtaatcccag cactttggga ggctgaggtg ggcagatcac ttgaggtcag gagttcgaga 91021 ccagcctggc caacatggtg aaaccccgtc tctactaaaa acacaaaaat tagccagatg 91081 tggtggtagg tgcttgtaat cccagctact taggagactg aggcaggaga attgcttgaa 91141 cccgggaggc agaggttaga gtgagctgag atggtgccag aacaagactc tgtctctttt 91201 tttttttttt tttttttttt gaggcggagt ctcgctctgt cgcccaggct ggagtgcagt 91261 ggcatgatct tggctcactg caactgctgc ctcccaggtt caagtgattc tcctgcttca 91321 gtctcccaag taggtgggat tacaggcacc caccaccaca cccagctaat ttttgtattt 91381 ttactagaga cagggtttca ccatgttggc caggttggtc tcaaactctc ctgacctcat 91441 ctcaggtgat ccacttgcct cggcctcccc aagtactggg agtacaggcg tgagccacta 91501 tgcctggctg agactctgtc tcaaaaaaaa aaaaaaaaaa aaaggacaga ggaaggagag 91561 aaggaaaggg agggaagagg gaagaaggag ggaaggaagg aaaggagagg ggggaggaag 91621 gaacaagaga aggaaggaag gagggaagga aagtaggaag agagagagag attgagggag 91681 gaagtaagaa aaaaaggatg ggagggaagg agaaggaaag gaaggaaggg aagaaaaaag 91741 agaaggaagg caggaaggga gggagggaag aaaggaagga ggaggaggaa gggaaggaag 91801 gaagggaggg agagaagaag gaaaggaaga gggaggaagg gaaggaagga gttcttagga 91861 tgtcccttaa ggcccttcct ccctctccag cctggcccat cccactctca ctctccctcc 91921 cagtgctcca gccacactgg ctgcctttca gttcctgaga taccccaaac taagtcctac 91981 ctcaggacct ttgcacctgc tctcccgcct acctggaatg cttcccactt acctgccccc 92041 ctcttcccat ggcaatcctg cagttcaagt atcacctata cagaagaagt cttcctagac 92101 ccccaggcct ctagcctatc accctgtttt actgtcctca agcccttaac actcacagac 92161 attgatggaa aatggcgatt gtgctcagct ctatcccctg ccctcagcac aatgcttaac 92221 acacagcaga agatcaaaag atacctgaga ctatctggct ggctgagtgg aaagggagca 92281 gtggctgcaa taatgcccat cctttcctgg ctccagagcc attgttctta gggttagctg 92341 agctcaagcc tccctttgaa actttctagc cgtgtggcct tgggcaagtc actttgtctc 92401 tgacttcatt ttccttttta aaaaattatt attatattta tttatttatt tattttgaga 92461 cagagtctca ttcactctgt ggcccagtct cattcactct gagtgcagtg ggcttcccag 92521 gttcaagaga ttctcctgcc tcagcctccc aactagctgg gattacaggc gtgtgtcatc 92581 acacctaact aatttttttt tttttgagat ggcatttcgc tattgttgcc ccagctagag 92641 ggcaatggtg tgatcttggc tcacggcaac ctctgcctcc tgggttccag tgatgctcct 92701 gcctcagcct cctgagcagc tgggattaca ggcatgcacc accacgcttg gctaattttt 92761 ttttttgtat gtttagtaga gatggggttt ctccatgttg gtcaggctgg tctcgaactc 92821 cttacctcag gtgatctgcc tgcctcagcc tcccaaagtg ctgggattac aggcaggtgt 92881 gagtcactgc gcccggcctt tttttttttt aaacagagtc tcgctctgtc gccaggctgg 92941 agtgcagtgg cacgatcttg gctcactgca acttctgact tcccggttca agcgattctc 93001 ctgcctcagc ctcccgagta gctaggatta caggcatgtg ccaccacgcc cagctaattt 93061 ttgtattttt agtagagacg gggtttcacc atgttgacca ggatggtctc gatctcctga 93121 ccttgtgatc ggcccgcctc ggcctcccaa agttctgggg ttacaggtgt gagccactgt 93181 gcctggctct aattttttaa tatttttagt agagatgggg tttcaccatg ttggccaggc 93241 tggtcttgaa ctcctgacct caagagatca gcttgccttg ccctcccaaa gttctgtgat 93301 tacagtcatg agccaccaaa tctggcctaa aaatcattat tttttatttt aaaaaatttt 93361 gtttgtagag atgatgtctc accatgttgc ccaggctgat cttgcactcc tgggcttaag 93421 ccatcctccc acctcagcct cccaaagtgt tggtattaga gccgcaagcc accacgctca 93481 gccttaacat tttgaatata taattttgtt gaagaaacag ggtcttgcta cattgctcag 93541 gctgatctca aactcctggc ctcaagtgat cctcctgcct aggtcttcca aagtgctagg 93601 attattggcc aggcgcagtg actcacacct gtaatcccag cactttggga ggccgaggtg 93661 ggcagatgac ctgaggtcag gagttcaaga ccagtctggc caacgtggcg aaaccccatc 93721 tctacaaaat tacaaaaatt agctgggcgt ggtggcaggt gcctgtaatc ccagctactt 93781 gggaggctga gataggagaa tcacttgaac ccgggaggca gaggttgcag tgagccgaga 93841 ttgtgccact gcactccagc ctgggcgaca gaacaagact ctgcctcaaa caaacaaaca 93901 aacaaacaaa caaaaaagtg ctgggattat aggcatgagc cactgcacca ccgggcccct 93961 tttctttaaa aagcacagaa ctggcaggga gcggtggctc accccgtaat cccagcactt 94021 tgggaagccg aggcgggtgg atcacgaggt caggagatca agaccatcct ggctaacaca 94081 gtgaaatgct gtctctacta aaaaaataca aaaaattagc tgtgtgtgat ggcaggtgcc 94141 tgtagtccca gctacttggg aggctgaggc aggagaatgg catgaacccg ggaggtggag 94201 cttgcagtga gccgagatcg caccgctgca ctccagcctg ggcgacagag tgagactcca 94261 tctcaaaaaa aaaaaaagaa agcacagaac ttacctcaca agggcattga caggatggca 94321 gagagaacaa gcactgagcg ggtgcatgat ggggacctgg ggaacattag ccacgcgggt 94381 agaaaggcag tgggtatttc tacagaagag gaaggactcc cattccctga gccccaactc 94441 tgtggcagac gctctcatat gatattttaa gaaatacctg taaacaactt ttgaagcagg 94501 ttttgatcat ccccatttac agaagagaaa acagcctcag agaggttaag agacttcctc 94561 tggatcacac agctagtaag cagcagagcc aggattcgaa cccagctcta ccctctatct 94621 catcccaaag actatgttct gcctactctg atccaagtgc cctcttggac ttccagtgca 94681 aggaggctgt gggaagggga aggaagggct taggaccctg agggacctgc aaggctgggg 94741 ctgcccctgt accctccatt agcattcagc ctgggccctc attatatgag taggtagagt 94801 cctgagttct ggactatggg ctccatgctt cagtgagtta tcttatttta tttattttcg 94861 tgacaagttt ttgctctgtc acccagggtg gaatgcggtg gcaggatcgc agcttactga 94921 agcctcagcc tcccaggctc aagcgatctt cccacctcaa cctcccaagt agctgggacc 94981 acaggcacgc accaccacac ctggataatt tattttattt tatttttatg ttttgagaca 95041 gagtcttgct ctgttgcccg ggatggaatg cggtggggca atcttggctc actgcaacct 95101 ccacctcccg agttcaagcg attctcatgc ctcagcctcc tgagttgctg ggattacagg 95161 cgtgtgccac cacgcccagc tcatttttgt attgttagta gagacaaggt tttgccatgt 95221 tggccaggct ggtctgaaac tcaagtgatc ctcctgcctt ggcctcccaa agtgctagga 95281 ttacaggcat gagccactgc gcctggccca gttacttatt ttagaagtta tatttgagca 95341 cctattctgt gccgagccct ggcatgagct gtgaacaggc catatctatc ctagatgtgc 95401 actaatgggg ctttggaggg tggcaacagg aggcccggtg aaatccccgg tgagagcagc 95461 cttttcgccg tggcctgccg tgaagcacta tggcagcacc cacacctgcc aggactgaag 95521 gtattgtcgg gccctctcct gctcccagct gcagccaagg ccgtgtgtac agggcccttt 95581 aagtagtctg tgccctccct aattaaccag ctaaagaaag agctggcttg aaatgggatt 95641 gtgcagaccc agctttggga cccgaggacg ccggatcggg gattgttatg ctaatcgcct 95701 gagatcagca gttcccgtgc ccttcagatg gcaggtagcg aggccgggct ctgcccagcg 95761 gctgtggcta caggaggcca ggcttttcct ccagcttccg gcttcctctg ttcccctcct 95821 taccccgtca gacttgccct gcctgtcctc ccacctgccc gttggctgga aggcttggcg 95881 tctcctagaa atgtccaggc cagctcctcc caccctccaa agagaaacct gcttgccttc 95941 aagctgggcg ctaattgggg atgcagaggc aggaaggaga aggtacccag ctcctcctcc 96001 tgtataaccc attctcatct ccaggcatga gagcctcatt ggtccctcct gcccccttgg 96061 gctctggaaa acacactagg tggtgccttt gccgcttact aactatacca ttttgggcaa 96121 gtgactgcct ctcagagcct cagtttccta atctgtaaaa taggttgata ataacagtag 96181 ctacctcaaa gggttgaaat gagaatagaa tgagaacagg catctaacat agccagttca 96241 attgctactg cccatccagc actggataat agctccctgt gtagagcacc cactatgtgt 96301 cagccctatg ctgtatcctt tttttttttt tttttgagac ggggtctcac tatgttgccc 96361 aggctgatct tgaactcctg ggctcaagca actctcctgt gtgccaccat ggctggccct 96421 tttataaggt gttaccccat tttgccttgc ctgcccgcct gcctgcctgc cttccttcct 96481 ttcttccttc ctccttcttt cctttctttc ttttttgaga cagtctcact ttcttgccca 96541 ggctggagtg cagtggcacc atctcggctc actgcaacct ctgctgcccg ggttcaagca 96601 attctcctgc ctcagcctcc caagtagctg ggattaaggt gcctgccacc gtgcctggct 96661 aatttttgtg gtttgagcag agacagggtt ttgtcatctt ggccaggcta gtcttgaact 96721 cctgacctca tgatccaccc acctcagcct ccaaaagtgc tgggattaca ggtgtgagcc 96781 accgtgcccg gcccattttg ccttttcaac cactctgtga gataattaac attatcaccc 96841 cccattgcac agatgaagaa accgaggccc agagggactg acgcacttgc tcgagaccta 96901 ccccaagcca gtctgttctc ttaacgtcta ttctgcagtt tgatatgtga ccattcacag 96961 tgagcttggt caatacaaag atgttcccag aaacatgtgt catcgttata tatttaatgt 97021 gcaggagaaa aaaaattaaa gtagcatttt aagcctgtga gttcacaaat agtattgctt 97081 aggttaaata tatttaagtt catacagtga ccagtgcaag tatggtcata gcggctttat 97141 tcataatagc aaaaactgga aacaatcaag acacccatca acaagaaaat agaggaacaa 97201 accgtggaac agccatataa tggaatactc agccagcagc gaaagggaac aaacaactgg 97261 tcaatctaca atggggacaa atgtcacaga ctttttttgt ttgtttgttt tgttttttgt 97321 ttttttgaaa tgaaatctca ctctgtagcc caggctggag tgcagtggtg caattttggc 97381 tcactgcaac ctccgcctcc cggattcaag tgattctcct atctcagcct cccaagtagc 97441 tgggattaca gtcacctgcc accatgccca gctaattttt gcatttttgt agagatgggg 97501 tttcgctgtg ttggccagct ggtctcgaac tcctgacctc aggtgatact gatccaccca 97561 cctgggcctc tcaaagtgct gggattacag gcgtgagcca ccatgcccat cctcacaaac 97621 ataatactga gcgaaagaag ctagacttga gtgtacacat tatactatac gattccactg 97681 atacgaagtt caagaatagc taaagcaaat ctatggctgc caatggggag gtggatactg 97741 ctcaactgct cagggacgcc agggaacctt ctggggtgat gggaatattc tatatctttt 97801 tttttttttt tttgagatgg agtctcgctc tgtcgcccag gccagagtgc agtggcacca 97861 cttcggttca ctgcaacctc cgcttctcgg gttcaagtaa ttctcctgcc tcagcctcct 97921 gagtagctgg aattacaggc gtgcaccacc acccctggct aatttttgta tttttagtag 97981 agaggggatt tcaccttgtt ggccaggctg gtctcgaact cctgacttca agtgattcgc 98041 ccacctcggc ctcccaaagt gctgagatta caggcgtcag ccaccactgc tggccgggaa 98101 tattctatat cttgatgtgg gtgcggtcac atgagtgttc ttgtaggcaa aaatgcatca 98161 acaggtacat ttcatatttg ttcaagttaa tgtatttgca ctatgcatca ataaatcacg 98221 gttaataggt taagggagtt ttttgttttg ttttgtagag atgggatctc actttgttgc 98281 tcaagctgat cttgaactcc tggcttcaag tgatcctccc cccctcagcc tcccaaagtg 98341 ctgggattac aggcttgagc cacggagctc ggccaggtta agatttttaa aaaataaatt 98401 tatttaaaga aaaattctat ctaaatagca gttcggggtg tgcagatatg gccaaaaaat 98461 aaaaataaaa aacaaaaagc aagcaaacaa acaaaacttc aaccactgaa gttgggggta 98521 atataagtgc taggcaatgc agatgtattc agcttcccct tcaaagggtg ccagacaaaa 98581 tacaggacgc ctagttaagt tgaaatttca gataatcaac caatactttt tgagtgtaag 98641 tatgtcccat gcaatatttg ggacatactt atactttgtt ttttgtctgc ttttgttatt 98701 tatcagaaat tcacatttaa ctggaagtcc tgtattttta tttgctaaat ctggccaccc 98761 tgccctcctg ggatcccacc ccgtctcgca gcccggtgcc caccgcaggc gggtacaccc 98821 tgcgcggtgc agtcactccg ggcgcagcct ggcaacgcgg ccacaaaggg agcccgggag 98881 gcggaatctt tccacatgcc tgatcaatgg gggcgcggag acggcggcgc aaacaggcct 98941 gagacagccg cacaaagagg aggcccctcg gtcgctcccg ccttccattg atcctggtcc 99001 tttgtggctc ggacaacggg ggcgcgggag gcgccttgcg gggacttggc cgtttctcat 99061 tagagggaga aggcctgtga gcacccccgt cccgccagcc ctccccaccg ttcctcctcc 99121 cgtgggctcc aggcctcagg tctggggctg tccaaaccta tcaggggaac tgggcagggc 99181 acagggtggg gtcggaccag gctaggaagg tctggagcct gcctgtgggg gccagccacc 99241 cagcccccgc ctcctgagga ccccccagtc tcctctggcg ggccctctct gtccccttgg 99301 ttacaggcag gttttgcaca gtctctgttt ttttatttat tgagcaaata tttattgagc 99361 gtttgttcca agctgttccc actccctcct gcccccatgg ccacttaaac tagccaattc 99421 ctcctccaag catcacctct ctgggaaagc acctctcagt cctctctcac ctcaggttgg 99481 gtagtgaaac cccactacac atacacacac cccgtgtgcc tctccccacc tccctgctct 99541 tcattcatga taaaaatgac taacaattat gaagtactta ctatgcctgc catgctaaac 99601 attttattgc tgtctccttt aattctcata acacccctct agattggagc tactatgatc 99661 tccattttcc agaggtggaa actgaggttt ggcatgctaa aatggtcacc caaagtcata 99721 cagcaagcgg tagagtggag atttgaatgc aggtgatgaa ttgcagtttg agatcttcac 99781 atcccaagtt ccagactcta tgcaggattc attgctgtgg cccagtaatg aactgtttac 99841 tccatcagca catttaccca ctgtactgtg tctatttggc cttttccgtc tgtgttcccc 99901 actagactga ttggtctgtg accgcagcac ccagcacaga gaaggtgccc agggtatatt 99961 tgctaatctg aactattggg tttaagaatt ttggttgggc gcgatggctc acgcctgtca 100021 tcccaacact ttgggaggcc aaggcgggca gatcacctga ggtcaggagt tcaagaccag 100081 cctggccaac atggtgaaac cccgtcttta ctaaaaatac aaaaattagc cagatgtggt 100141 ggcagtcgcc tgtaatccca gctacttggg aggctgaggc aagagaatct cttgaacccg 100201 ggaggcagag gttgcagtga gccaacattg tgctgccaca ctccaacctg ggcaacatag 100261 tcagagtcca tctcaaaaaa aaaaaacaaa aaaaagaatt ttgctaggcc ctcaggctgg 100321 gaagtgaggg gaggacctaa agatgaatct attacagtct ctgcccacaa atgagcttga 100381 aatcagacac tcaaataata taggaagggt tgcagagctc tggagtctgg tataccttcc 100441 ccaccccctt gaaaagatca gcttcattga ggtataattt acatgcagga aaatacaccc 100501 attttaagtg tccagttcga tgaacattga caaatgcata cacccatgta actagtatca 100561 atgtcagatg taaaacattt ccatcttctc aaataggttc ctcaggctct ttgcagtgaa 100621 tgccccgccc accctgtccc acgcaacccc tgatctgctt tctgtcacta taagttagtt 100681 tgcattttct agagtttcat ataaatgaaa tcatacaatg tgtactcgtt tgcatctggc 100741 ttttttcgct caccatgcta atgtggtgat tcatccatac tgtggagtga cctttcagtt 100801 ctaaatcctc tctctgcttg tgacctcttt gagtttcacc ctcttcatct gtaagacagg 100861 ggtggattct gtgaaatttc gttaagagaa tgtctgccaa gtgcacagga cctgcacaca 100921 gttggcatgc aataaacgct cttgattagt ataaccttcc tgctagactg taagctccat 100981 gaggacaggg accttggtca ttttgttcac cactctccac taactgcagt ggctaccgtt 101041 tattacagac agtgaatagg agctcggaga cctacagcaa acctgtccaa gatcacacaa 101101 ctagtaagag gcagagctgg gatttgaacc caggcaaact agtgctggag tccttacttt 101161 ttaaattttt ttgaaacagg gtctcactgt gtcgcccaga ctggagagca gcagcacgat 101221 cttggctcac tgcagcctcc acctcccagg ctcaagtgat cctcccacct cagcctccca 101281 agtagttggg actacatgta cccaccacca cgcccagcta atttttgtac tttttgtaga 101341 gacagggtct cactatgttg cccaggctgg tcttgaactc ctgaactcaa gcaatccact 101401 cccctcagct tcccaaagtg ctggattata ggcgtgagcc accacaccac acctggcctc 101461 ttttttattt tatttattta ttttttttgg acagagtcac gctctgttgc ccatcttggc 101521 tcactgaaac tttcacctcc tgggttcaag caattctcat gcctcacttt cccaagtagc 101581 tgggactaca ggcaccctcc accatgcctg gctaattttt gtagttttag tagagacagg 101641 gtttcaccat gttgcccagg ctggtcttga actcctgaac tcaagcgatc tgcccacctc 101701 agcctcccaa agtgctggga ttacggtcat gagccaccgc agctgacctc ttttattatt 101761 attattacgg tattatgtat ggggtcctca tcttaatcac tatactaaac tacctcacaa 101821 tcattgttct ctctaacaat ctccccaaat cttctcttct tgtcaccttc accctttact 101881 gaccactcta tggagggagc ataatataaa aaaaaaaact ctacaactac taatatatat 101941 ttttagttta tgtgtgaggc agtattctaa gcacttgaca tttatttact tgtgtaactc 102001 ctcaataatg ctgtaaggta gcattattgg gcatctcccc attttacaga tgaatgagta 102061 aattgtagtg taactgagag gctgtggtgt tggagtcaag aaatcttcct tttttaaaaa 102121 agtgacacat gggccgggcg cggtggctca tgcctgcaat cccagcactt tgggaggcca 102181 aggcaggcgg atcacgaggt caggagattg agaccatcct ggctaacatg gtgaaacccc 102241 gtctctacta aaaatacaaa acatcagccg ggcgtgatgg cgggcgcctg tagtcccagc 102301 tactggggag gctgaggcag gagaatggcg tgaacccggg gggcggagct tgcagtgagc 102361 tgagatggcg ccactgcact caggactggg cgacagagcg atactccatc tcaaaaaaaa 102421 aaaaagttat acataataat ggtacatatt ttgggggcac atgtggtatt ttgatacata 102481 catgcaatgt gtgatgatca aatcaggata actgggacat ccatcaaccc aaatattcac 102541 cccttcttca tgttgggaaa catctccgtt cctctcctcc agcaaatttt tttttttttt 102601 tttttttttt tgagacggaa tctcattctg tggcccaggc cagagtgcag tggcgtgatc 102661 tcggcttacg gcaacctccg cctcccaggt tcatgtgatt ctcctgcctc agcctcccaa 102721 gtagctggga ttacaggcac ccgctacaac atccagctaa tttttgtatt tttttagtag 102781 agacgggttc tttcgccatg ttggacaggc tggtctcgaa ctcctgacct caggtgatct 102841 gcctgcctcg gcctcccaaa gtgttgggat tacaggcgtg agccaccgct cccagccaag 102901 caattttgaa ctctatgaca aattattaac tctagtctcc ctactgtatg gtagaacgct 102961 agatcttatt ctttctcaat gtattttcga aggacagctt ccttttgaag cccggctttg 103021 tcatgctttc cacgggactc gagcaaatct cctcccctga cagcttgtgt attcctcttg 103081 ccaggacgtc cccagagccc cttctttgac ctgacatttt tttctgggcg cttctctctg 103141 acccctcggc agggttttct cttcctcttc taagtttctt tccccagtcc cttcctgggc 103201 cttagcaggc caaggttaca atttcaaaag gagttgagtt tcttctggaa aggagctggg 103261 accgaacccc tcccctccat ggtctcttcc cataaatgcc ccctgtgtgc agcgcccagg 103321 ggccccgggg agcctggccg gctgatctgt aattagaaac ttttctgcct tcacgacagg 103381 aggagggagg gcccaggtct ggcccagtca ttttgttcgt catgaattat tcaggagatg 103441 ggaagtggcg tgaagccggc ccctctgctg accaaagcca tgcccaaggc agcccgacct 103501 actcagctct gcctctttca ccgtttcaga tccagagaat aaacaaacca tgtgccaggc 103561 acgtttatgg agggcgtttt catcagactc gatgagggag catgttcttc agctgagagg 103621 ggtgacaggg acttatccaa ggtcacaagc ctaggaagag gacggctgag ctgggagtca 103681 aacccaggaa catgatgcga ggttccgtgc tcatttggtg ctattacaga tgctgggcag 103741 aggaggaggg gaggagaccc caccaggccg gctaggagaa gataaaagat gctttccttt 103801 ctccgcttgc cttgctcttg ctaagtctgg gggatggaga agggggataa tatggaaagg 103861 agacaactaa cctcatggag accagctggg tgacaagcaa gaattattat tgttattgtt 103921 actattacag acaggatctt gctctgttgc ccgggctggc gtgatctcgg ctcaatgcag 103981 ccttgacttc cctggctcaa gagatcctcc catctcggcc tcctgagtac ctgggactac 104041 aggtgcacgc caccacaccc agctaatttt tcatattttt tgtagagatg aggtctttct 104101 atgttgccca ggttgatctt gaactcctga gctcaagcga tcctcctgcc tcagcctccc 104161 gaagtgctgg gattacaggc atgagccacc gcacccagcc aagcaggtat tattttgcct 104221 gcagtattta atagaatcct cgctaacctc tgaggaaggt gttattatgc ccattttatg 104281 gatgtagaaa caaagatctt gtttgttttt ttgagatgga gtttcgctct tgttgcccag 104341 gctggagtgc aatggcgcaa tcttggctca ctgcaacctc cacctcctgg gttcaagcga 104401 ttctcttgcc tcagcctccc aagtagctgg gattacaggc aagcaccacc acacctggct 104461 aatttttgta tttctagtag agacgaggtt ttgccatgtt ggccaggctg gtctcaaact 104521 cctgacctca ggtgatccgc ccactttggc ctcccaaagt gctgggatta caggcgtgag 104581 ccaccgtgcc tggccacaga gatcttaaag gagtttattc aagacacata acagctggtt 104641 agcggcaggt taggggctca atcttggcac ttctctgtgc gttgcactct ttctcctcca 104701 ctcctgtggg tagggaagca gggctaccga ggaatctcag ctcctgagtc cacaatggtc 104761 cttggatcag acttgccttg ttgtcacacc tctcagtgtc caggcactgc tccaggataa 104821 accctcatga gtgtgtggat gatatctgcc atcccacatt ggttaacctt gcaaggtggt 104881 cctttagaag gctcagggct tcctcattgg cacggagctg gcaattctga gagatagatc 104941 agctctaact gtagttcctg gagaaactaa agcacagaat ttggaaagga tgccctggct 105001 ccactcctta ctctctgtat aaacttgggc caattcctta acctctctgt caatttgccc 105061 attgatgaag tgagatgaag ataagaaata aacatggcca ggcgtggtgg ccatgcctgt 105121 aatcccagca ctttaggaag ccgaggtggg cagttgcctc agcttaggag ttggagacca 105181 gcctgggcaa catggtgaag ccccagctct agtaaaaata caagaaaatt actcctgatg 105241 acagcttctc attaaatttg gttaacagag aggcagcaac taccagatgg tgaacatgtg 105301 tcttctggaa acagcccccc agttttccct ggggaactct ccctctccca atatcagttc 105361 ctgtatcttg gatggaactt actctgccca cctgtgcctc cagggttggc cctgggatcc 105421 agcctggcca caggattggt tgaagaacag gcatatgacc caatgggagc caatcagtgg 105481 gagacttgag atttttgctg gagagaggca ctgtttttta accaacagga aaaagcatgg 105541 gtggagctcc tgagagcctt cttgtcaccc tggggaggag cgtgtctgtc tgagattgaa 105601 accacagaag aaagcagagt caagacatgg caaggacatg aattggaatc tttgaactcc 105661 ttgatccagt tgtaaatcag taaataataa taattattat gattctttgc tttgtttgca 105721 tcagtgagtt ttctgtcatt tacaactagg aatgctgagg aaacttgcca acaatgtgcc 105781 aatgtgtgtg ctctctctct ctctctctgt ctatatatat gtacatatat atatatatat 105841 atatattttt tttttttttt tgagacggag tcttgctctg ttgcccaggc cacagttcag 105901 tggccgatct cagctcactg aaacctccat ctcctgggtt caagcaattc tcccgtccag 105961 cctcctgagt agctgagact acagggcaca gcatcacgcc cagctaattt ttgtatttct 106021 agtagagatg gggtttcacc atattggtca ggctggtctc gaacacctga cctcaggtga 106081 tctacctgcc tcagcctccc ggagtgctgg gattatagga ttacaggcgt gtgctatata 106141 tgggccagac gcggtggctc acgtctgtaa tcccagcact ttgggaggct gagatgggtg 106201 gatcatgagg tcagtagttc gagaccagcc tggccaatat ggtgaaaccc tgactctact 106261 aaaaatacaa aaattagcta ggtgcggtgg cgtgagcctg tagtcccagc tactcaggag 106321 gctgaggcaa gagaatcgct tgaacccgga aggtggaggt tgcagtgagc caagatcacg 106381 ccaccgcact ccagcctgag tgacatagtg agactccgta tctacgaaac aaaacaaaca 106441 atatatgtat atttgttttg ttttgttttg tagagacagg gtctcgctat gttgcccagg 106501 ctgatcttca acacctgacc tcaagcgatc ctcccacctc ggcctcccaa agtacagatg 106561 tgagccactg tgcccagcca cacatctttc tctctctcca tatatatata tggtggtgca 106621 cacctgtaat ttcagcactt tgggaggctg agggaggagg atcacttgag cccaggagtt 106681 tgaggccagc ttgggcaaca tagtgagagc ccatctctat ttattgaaaa taaaacaaga 106741 aataaaacaa aatggggcca ggtgtggtgg ctcatgtctg ttatatatat atacacacac 106801 acacacacac acacatatat atagtgtata tatatgtata tgtgtatata gagtataaat 106861 atatatactc tatatacata tatacatata tacacatata cgtatatata gataaatata 106921 tatacatata tatttataat atatatgtat atagaggata tatactatag atcctctata 106981 tacatatata cacacacata tatagtgtgt gtgtatatct atctttgatt aaaataacag 107041 tatatgctga agagtcactt ctaattatta cttttgtttt acaaacttag gtcaagagag 107101 aatgtggagt aaggccttaa gattcataac aacgcctagg tgcaaatgaa gggcctttac 107161 tcactggact tatgctgtca tggagtcata gacttagagt catggattgg ctttaactat 107221 ggttttcctt tctgctttca ggaaaacatt taatttccac cctgggcgta gtctatgccc 107281 agcttccttg tggtgcgctg tttctgtctg ccttttgtct gtgtgctggt ggtgtcagag 107341 ccagtaacta tctgtttaca gtttcagaca gtagcattaa ttgccttttg aacctcccat 107401 tttcctccaa gaactccctg cctaactgtc tcaactgtct ccctggagtt tcttttctgt 107461 gtggaaatgt ctattttttt tcccataaaa tctgaccacc tgcaaatccc tacatattct 107521 cttgtgagta ggcaacatga gagaaagaca tgtaggtggc agataatctg cgcaaatcag 107581 agcttacagg atgaagcttc taggccaaac aacaaggcag ctgttgggtg gcagaaagag 107641 ccctggaatt ggaggcagac cagcaagtga aatctggttt tgttacttac ttgctctgtg 107701 acctgggaca agttactgca tactgctgag cctcagtctc ctcatccata aaatgggcac 107761 agactgggtg cagtggtgca cacctgtaat tccagcactt tgggaggctg aggcaggagg 107821 atcacttgag cccaggactt tgagactagc ctgggcaaca tagcgagatc ccatctctat 107881 ttctttaaaa caaaacaaaa aataaaataa aatggggcca agcgtggtgg cttatgcctg 107941 taatcccagc actttgggag gctgaggtgg gtggattgcc tgaggtcagg agttcgagac 108001 cagtgaaact ccgtctctac taaacataca aacatgtcta caacatggtg aatctccgtc 108061 tctactaaaa atacaaaaat taactgggca tggtggcaca cacctgtaat cccagctact 108121 tgggaggctg aggcaggaga ctcgcttgaa cctgggaggc gtaggttgca gtgaggcaag 108181 atcatgccat tgcactccag cctgggcgac aagagtgaaa ctctgtctca aaaaaaagaa 108241 aagaaaataa tacctgtttc acaaggttgg ataatttacc cgtatgacaa aggcactgtc 108301 ttctggaaaa gcacatttct ctgcccttta cttgtctagt atgggttcaa aatagtaata 108361 aaactaacat ttattgaaca ctgagtttga gctcagcatt atttcagcct tacgacaaat 108421 ccttcagggg gccagccact gtggctcaca cctgtaatcc caatactttg ggaggccgag 108481 gtggcaggat cacttgagcc caggggctcg agaccagcct aggcaacata gggagaccct 108541 gtctctatta aaaaaataaa taaaataaaa aataaaaata aatttaaaat accaaaccct 108601 tcaaagttgt tgctattata atactcccac ttcacaggga tggaagtaga ggatccgaga 108661 gtaagttgct gaaatcacac agctaagtgg caaagctggg atttgaaccc aacagtttca 108721 ctttacagct tactcttaac caggaccata tggcctccac tgtggtagta gatatcagcc 108781 acccacagtg gaggccagat agggctaaaa gctgtgggtg ctgcctcctg gggaaaacca 108841 cattgttctc cgacttccct ggcccatatg ccaactctac ttcataaaga gctgattttt 108901 tttttttttg agatggaatc tcgctctgtc acccagactg gagtgcaatg gcgtgatctt 108961 ggcctactgc aacctctgtc tgctgggttc aagcaattct cctgcctcag cctcctgagt 109021 aactgggatc acaggtgtgc caccatgccc ggctaatttt ttgtaatttt agtagagacg 109081 gggtttcacc atgctggcaa tgctggtctc caaatcctga ccttgtgatc cgcctgcctc 109141 agcctcccaa agtgctggga ttacaggcgt gagccaccgc gccctgcaag agctaatgtt 109201 ttattttgtt attttttata gagacaaggt cttgctctgt caccccggct agagtgtagt 109261 ggtgtgatca tagctcactg cagcctcaaa cttttgggct taagtgatcc tcctgcctca 109321 gcctctcaaa gtgctgggat tacaagtgta agccattgtg cctggccagc ttgatgaggt 109381 ttttttgttt gtttgagaca gaatctggct ctgttgccca tactagagtg caatggcatg 109441 atgtcagctc actgcaacct ccgcctccgg ggttcaagca attctcctgc ctcagcctcc 109501 caagtagctg ggaccacagg cgtgtgccac catgcccagc taatttttgt atttttagta 109561 gagacggagt tttgccttgt tggccaggtt ggtcccgaac tcctgacctt aagtgatcca 109621 cccacctcgg cctcccaaag tgctgggatt aagggtgtga gccaccgcct cccacccagc 109681 ttgatgtttt aagagtccag accctaacta ctcttgggag acatggtggg aataaggaag 109741 tggcagctgg gctttggaat ccaaaccctg gcccctcctg gacactcatg cctcctgtgt 109801 accaagggga aggtggagaa accaaggaga taggtcctgt gggagcctgg acaggggaat 109861 ccatgggcac atacatctct acgtgcttgg gcaccatgat gtcagcagag cactggttaa 109921 cacatgcatg cacccacgcc gtcagactca cggtgaacac gtgagtacac atgtgcatat 109981 ctgtggctcc acaagtacgc atatccctca ctcacacact cacaggtccg cccaccctgc 110041 actgaattcc tggattttgt cagaacccca ttagggccgg gcgcggtggc tcacgcctgg 110101 taatcccagc actttgggag gccaaggcag gcagatcact tgaggtcagg agtttgagac 110161 cagcctggcc aacatggtga aaccccatct ctactaaaaa tacaaaaatt agccgggtgt 110221 ggtggtgcac acctgtaatc ccagctactc aagaggctga ggcaggagaa ttgcttgaac 110281 tcaggagctg gaggttgcag tgagccgaga tcgcaccact gcattccagc ctgggcaaca 110341 gagtgagact ctgtctcaaa aaataaaaaa aaatttaaaa aaaagaaaag aaaagaaaaa 110401 agaacctcat taggcagaat agcagaatag ctaggtcaag tgtacaactc tgctcttgcc 110461 tgttgggttt gaatcctggt tctgccaaat accagccaag tctttgggca aatcctctca 110521 tctttctgtg cctctactta cccatctgca aatgggagta ataataatat ctgttccttt 110581 gagttgttgg gagtttaaag aattgtaggg tgtcgccggg cacggtgtct catgcctgta 110641 atcccagcac tttgggaggc cgaggcgggt ggatcacgag gtcaggagat cgagaccatc 110701 ctggctaaca tggtgaaacc cctgtctcta ctaaaaatat aaaaattagc tgggcatggt 110761 ggtgtgtgcc tatagtccca gctacttggg aggctgaggc aagagaattg cttgaacccg 110821 ggaggaagag gttgtggtga gccaagatca tgccactgca ctccagcctg ggtgacagag 110881 caagactttg tctcaaaaaa aaaaaaaaaa aaaaaaaaag gtcctggtgc ggtggctgac 110941 gcccgtaatc ccagcacttt gagaggccaa ggtgggtgga tcatgagatc aggagttcaa 111001 gaccagccta cccaatatag tgaaactccg tctctactaa aaacacaaaa cttagttggg 111061 catggtggcg tgtgcctgta gtcccagctg ctccagaggc tgaggtaaga gaatcacttg 111121 aacccaggag gcagaggttg cagtgagcta agactgcacc actgcactcc agcctgggca 111181 acagggcaaa actctgtctc aaaaaaaaat ttgtaaaata ttaaaaatgt ataaatatat 111241 atatagagaa aactagaagg aaataaatta aaatgttaat atggttgcct tgaataatat 111301 gggggattat attttattta atcaacacta tttctttaag aaactgtgtt tatttaatgg 111361 acattttttg tttttttggt ttttttttga gatggagtct tgctctgttg ccaaactgga 111421 gtgcagtggc gagatctctg ctcactgcaa cctccgcctc caaagttcaa gcgattctcc 111481 tgcctcagcc tcctgagtag ctgggactac aggtgcgcct caccatgccc agctaatttt 111541 tatattttta gtagagacag ggtttcacca tgttggatta caggtgtgag cctccgtgcc 111601 cggcctgtat tttctttaat aaatatatat tagaattata tatagcaaaa ccacatttct 111661 tttgttttgt tttgtttttg agacagagtc ttgctctgtc acccaggctg atatgcagtg 111721 gcatgatctc tgctcactgc aacctctgcc ttccgggttc aaacaattct cctgcctcag 111781 cctcccaagc agctgggatg acaggcgcct gccaccaccc ccggctaatt tttgtatttt 111841 tagtagagat agagtttcac catgttggcc aggcttgtcc tgaactcctg accttaggtg 111901 atccacccac ctcagcctcc caatgtgtag ggattacagg catgagccac tgcatccagc 111961 cctaaaccac atttctatta gctagttttt aaaaattaca aatgtaacat gtgctcacat 112021 taaaagaaaa aaatgcgtca gtacacaaag gtataagatg aaaaagtaaa agttctttac 112081 cccttagact ccttccctaa atgtaactgt taccaatttc atgtatatcc ttccaaaaat 112141 gttccatgcc tatatactta gatatgatta ttatatttta aaaaatagat ataggctact 112201 tgggaggctg agatgggagg atcacttgag cccaagaggt cgaggctgca gtgaggtata 112261 atcacgccat tgcactctag tctgggcaac agagtgagac cgtctcaaac aaaacaaaac 112321 aaaaatagat atagaatcac acaatccaaa tgattttgga acttgcttag ttatttacta 112381 tagtacagat ttttggatat ctttgcatgt atgatatact cttttttttt tttttttgag 112441 acggagtctc actgtcaccc aggctggaat gcagtgacgt gatcttggct cactgcaacc 112501 tctgcctccc agattcaagc aattctcatg cctcagcctc ccgagtaact gggattatgg 112561 gtgagggcca ccatgcccgg ctaatttttg tatttttagt agagacaggg tttcactatg 112621 ttggccaggc tggtctccaa ctcctggcct caagtgatac acccgcctca gcctcccaaa 112681 tggctgggat tacaggcgtg agtcgctgca cctggcctga tatactcttt ttttcttttt 112741 ctttctttct tttttttttt tttttttgag atggagcctt gctctgtcgc ccaggctgga 112801 gtgcagtggc acgatcttgg ctcactgcaa cctctgcctc ccgggttcga gtgattctcc 112861 tgcctcagcc tccctagtag ctgggattac aggtgcgcac caccacgcct ggctaatttt 112921 tgtattttta gtagaggcag ggtttcacca tattggctag gctgatcctg aactcctgac 112981 ctcgtgatcc acccgcctcg gcttcccaaa gtgctgggat tacaggcctg agccaccacg 113041 cccagccgat atactcttat aatgactaca aaacatttca ctgtaaagtt acatgatgat 113101 ttacttactt aatttcctac tgatgaccat caagtttgtt gctagccttt gcttttataa 113161 acaccttcaa aatgaacaaa tttgtttcta ctatttacct actcctactc acatgaatag 113221 aattttagat gtggaattgc tggaccccaa gggcttgtgt atttcatatt tcaaaataat 113281 gccatccagt aacatacata tatataggtg gtcctctgta tttattcatt ccacatccct 113341 ggactccacc aaccttgatg gaaaatattt gggaaaaaac cctgcatctg tattgaacat 113401 agacagactt ttttccttat cattattccc taaaaaacac agtataacaa tgatttaccg 113461 ggcattgaca ttgtattagg gatcataagt aatctagaaa tgacttagag tatacaggag 113521 ggtatgcata gataatatgc aaatactatg ccattttata tcagggactt gaacatctgt 113581 ggattttggt atctgaggga ggtgccagaa ctaatccccc agggacactg aatgacgact 113641 gtatataaag cacaagaaga gaaaagatga atggaagaaa aaaacctcag tgtgaggcca 113701 gatgccatgg ctcacgcctg tgatcccagc agcactttgg gaggctaagg cgggtggatc 113761 accaggtcag gagttcgaga ccagcgtggc caacatggtg aaacccggtc tctactaaaa 113821 atacaaaaat tagccaggtg tggtggcgcg tgcttgtaat cccagctact caggaggttg 113881 aggcaggaga atcgcttgaa cccaggaggc agaggaggca gtgagccaag gtcacaccac 113941 tgcaccccga ctcaaaataa ataaataaat aaaaagaata aaagaaaaaa aaagaaaacc 114001 tcagtgtgtt aagttgtacc caaagtgttt gtgcagtata agtatgcaaa atgaacataa 114061 aataaaaatg aactacttag ccaagaaata aatcatctat ggccatatac ttaaatatta 114121 aaagtgtttt tctttgattt cctttaatgc agaatcaagg aagaaaaaaa atgatattct 114181 ttcatggagc cacagatagt ttttatactt tattttctca atgtcttata tcaaatatta 114241 atttcttttc tgtaatcaga aaacagtacc atggatacta tgaaaaataa cgttgtgctt 114301 tttaattaaa aagcaataca tagtcattgt aataagttta aaaataaaaa atgtataaaa 114361 atgttaataa tgtccctttc tttacctcta aatttacccg ctggccaggc acagtagctc 114421 acacctgtaa tcccagcact tgggaggccg aggtgggcgg aacacctgag gtgagaccag 114481 cctggccaac agggtgaaac cccctctcta ctaaaaatac aaaaattagc caggcatggt 114541 ggcgcgcacc tgttatccca cctactcggg aggctgagga cacgagaatc acttgaaccc 114601 gggaggcaga ggttgcagga agctgaaatt gccccactgc actccagcct ggatgacaga 114661 gtgagactcc gtctcaaaaa taaataaaca aacaaactta cccctcgaaa taaacaatgt 114721 taacggcctg gataggaatc ttttcaaaca tttttctata atagagtgtg tgatttttac 114781 atcagaaatg aattttaaaa ccttaaaaat tttgtttaat tacccctgcc tggcacttca 114841 gcagatcacc cttgctatct ccaaccctgt caagctgtgt gccctctttc tgtgtccaaa 114901 caacttgccc tcaaagaaac ctacttgggg ccctgctccc tggaggcacc tgcgcctctg 114961 ccttggctca ggttcagcag aaatcaaggc cagacccagg aagtgttggc cccagctcta 115021 gcctgatccc ggccctggag agccagcgct ccccagcctg gggagcgagg ggcccggcgc 115081 caggcgaggc cctgggctcc cagagaggcc tccttcccgc ccgctttctc taggagtaat 115141 taagagtgaa atcctcctcc tggtgacccc gtgctccagg cccgggctga tttacacatg 115201 aaaggtctac ggagggcctg ggctttctcc ttttcacatt cacaatgagg cacagcctat 115261 aaataaaacc tctctgggaa acaagaggct tcaaaatgtg tgtgtgtgtg tgtgtgtgtg 115321 tctgtccccc agagggagtg tgtgtgtgtg tgtgtccctg tgtgtgtccc agagtgtgtg 115381 tgtgtgtccc agagggagtg tgtttgtgtg tgtgtctgtg agagggagag acagacccag 115441 aaggagagag accagaggga gagggggagt ttgaaagggc tagaaaaggg aaggagtgtg 115501 aataaaagct ggtgttaatt gagcgcttac tgtgtgccag gcaaccatcc tcacagcaat 115561 cctgtaaggg tggttttatg gttcccattt taaagatgaa gaaactgagg ttcaaagagg 115621 gcaagagttg cccaagataa cacagtgaat taaggggaag gcttggactc aggaagtggg 115681 atgaacataa gattcaaatg atgggccggg cgcggtggct tacacctata atcccagcac 115741 tttgggaggc tgaggtgggt ggatcacctg aggtcaggag ttcaagacca gtctgtccaa 115801 catggtgaaa ccctgtctcc actaaaaata caaaaattag ctgagcgtgg tggtgcgtgc 115861 ctgtggtccc tgctactccg gaggctgagg caggataatt gcttgaacct gggaagcaga 115921 ggttgcagtg agccaagatc acgccactgc actccagcct cggtaaaagg gcaaggttcc 115981 gactcaaaaa aaaaaaaaaa gattcaaatg atgacctgtg tctttgtgtc ttgattgtga 116041 tgatgattac aggactatgc atttgccaca tatgtactaa aaagggagaa ttttactgta 116101 tgtatactgc aataaccatg acttttatca tttttaattt tatttatgta tttatttttg 116161 agaaagagtc tcgctctgtt gcccaggcta gactacagtg gcacggtctg agctcactac 116221 aacctccacc tcccaggttc aagcaattct cctacctcag cctcctgagt agctgggact 116281 acaggtgtgc accaccatgc ccagctaatt tttgtatttt ttagtagaga tggggttttg 116341 ccatattggg caggctggtc tcaatctcct gacctcaggt gattcacctg cctcggcctc 116401 ccagagtact gggattacag gtgtgagcca ccgtgcccgg ccaatcctga cttttaaaaa 116461 ggtaaaataa aataggccag gtgtggtggc tcacgtctgt aatcccagca ctttggaagc 116521 cgaggtggga ggagctcttg atctcaggag ttcaagacca gcctgggcaa catggcaaga 116581 tcctatctct accaaaaaaa aaaaaaaaat taactgggca cagtggcaca tgcctgtagt 116641 ccttgctagt ggggaggctg agatgggagg attgcttgag gctagggaat tgaggctgca 116701 gtgagttata tttccaccac tgcactccag cctggacaac ggagtgagac catgtctcaa 116761 aatttacaaa ataataataa taaaataaaa tacatacaaa aaaacagagg ccaatgggca 116821 gtgattggtg aggtatcctg gatacttgag gcctctctat accattccct ctagcaggcc 116881 tgttaacctt attcacctct gtatccccag gctcagggct caataatggg gaatgtagat 116941 attagggaaa tgtttgctca attgattgaa tacatgggct aagtcattct ctctggactc 117001 attttgccat ctataaaatg agaaaatgat gaaacactat ggcaaacacc tcctaatcag 117061 cctcagaggt caccacctgc aggaagtctt catggattcc cttccacccg cagattgatc 117121 aagggcctct tctggcctct tacagagcca ggttgtgttc ccatccctgc actgactata 117181 gtgtacaata tttgtctggt tctgctctgt ggaggcacct gtgcctctgc cttggcccgg 117241 gttcagcaaa aatcaaggct aggcctagaa agtgatggga aatactccac gtgaatattc 117301 aggttctttg cagtagggcc aatgtccatt ttgctcctct ctgtacccca agtacctatc 117361 ccatagcatg tagaatgttt gagaaattaa tgaattaatg catttatgag ctctggtctc 117421 aggttcaaat gtgaatcatg gctgggcgcg gtggctcaca cctgtaatcc cagcactttg 117481 ggagaccgag gagggtgcat cacctgaggt caggagttgg agaccagcct ggacaacatg 117541 gcgaaaccct gtctctacta aaagtaccaa aattagccag gtgtggtggt gcactcctgt 117601 agtcccagct atttgggagg ctgaggtagg agaattgctt gaatctggga ggcagaggtt 117661 gcagtgagcc gagattacgt cactgcactc cagcctgggc aacgagaggg aaactccatc 117721 tcaaaaaaac agaaagtgaa tcatgctggt gttggctggg gctgggcaag ggtctcaggc 117781 agcagaaacc ggccaatgga gaaatccctg tgcagtaaag atgagctgaa aggcctgact 117841 tctagagccc caagaaaaca atcagagggg cagatggttt ggggaagggg ggaggctctg 117901 gtgagtgggc aatctgcctg catctgtttg aggctgagag tgaggtgggg cacatggata 117961 tggttaggtc ccagctctgc tacttcttag ctatcattca gccccaccaa ccctcagtct 118021 ccatatctgt gaaatgggca gggaatataa tgatacctgc ttctacttta acatgtttat 118081 tcggagggtg aaggtacata ataaggtcaa agtgcctagt cagaaatagt ttgtaattta 118141 tttatttaga gactgagtct cattctgttg cccaggctgg agtgcagtgg tgcgatctct 118201 gctcactgca gactccacct cctgggttca agtgattctc ctgtctcagc ctcccgagta 118261 gctgggatta caggtgccca ccaccacacc cgggtaattt ttatattttt agtagagacg 118321 gggtttcaca atgttggcca ggctggactt gaactcctga cctcaagtga ttcgcccacc 118381 tcagcctccc aaagtactgg gattataggt acttatttat ttttatagag atggggtcct 118441 actatgttgc ccaggctggt cttgaactcc tggactcaag caatcctcct gcctcaggcc 118501 tcccagagtg ctgtgattac aggcatgagc cactgttccc caccagaaat attttaaact 118561 ccataacatc ttttcctctt ttctcccaaa tgtttttctt tcttcctttt ttttttcctt 118621 tgagacaaag tctcactcag ttgcccaagc tggagtacag tggcacaatc acggctcact 118681 acggcttcaa tctcctgagc tcaagccatc ctcccagctc agcctcctga gtagctggaa 118741 ctacaggcac atactaccac atgcagctaa tttaaaattt tttgtggaga caggagtttc 118801 actatattcc ccagactggt ctccaactcc tggactcaag tgatgctccc acctcagtct 118861 cccaaagtgc tgggattaca ggtgtgagct accacaccct actcccatgt ttttcaccct 118921 aattagagct gccactcatg aattcctaaa aataatatta gctttcctaa catttagtga 118981 gtgtcttctg tgtgtcaggc actgtgccta cacatattat ttaatttaac cttcacaatc 119041 attctgtaag gtaggtctgc tattattctt atcagagaag aaataggttc agagaggtgc 119101 agtgacttgt ctgagatcac acagcctctg agtggtaaac ctgtgaggtg agccatctgt 119161 taatttgagg gaacaccaga atcctttgga gggcttgttc aaacacagat ttctgaaccc 119221 catcccaaag tatctgatgc agcagttctg gggcggctcc caagaattta catttctagc 119281 aagttcccag gcgatgttgc tgctgccagt cctggggcac actttgagaa acatggccta 119341 ataccaagat ttttccactg tcattatccc aattaggcat gttatgtcaa gtccagcctt 119401 ttacaatacg ctcgattcaa ggaatttgaa gtctagtata gttcattcag ccctaggcat 119461 atatgaccct atagtcttaa ttaagtcttc aaaacacaca cacacacaca catcccaact 119521 ttcatctaca ttaaaggtac aaaaagctgg aaaatataaa tgagtaaaaa ggacagaatc 119581 aaccctggtc acttcaatta ctccagtgag atcacttttt gttatacttt ttgtttatgg 119641 ttgtataggg aaagctttct ggataaacca actcactgga ttaaaaattc atgggaggcc 119701 agacgtggtg gctcattcct gtaatcccag cgacttgaga ggccaaggta ggaggattgc 119761 ttgaggccaa gagttcagaa gcagcctggg caacttggca agaccccgtc tctaccaaaa 119821 aaaaaaaaaa aaaaatccct ggaattggct cactagctta acaaatgctc gctttgcctg 119881 ctataaaatc tagaacaatg tgaggcccac agcagctggt gggttcattt ctagtgtgac 119941 aagccactaa ctcctgttga gtgaagatgt gaccatgtgg ttatcaagag tcaggggggt 120001 tggttctccc atcagttact cctttttttg agacggagtc tcactccgtt gccaagggtg 120061 gagtacagtg gcataatctc ggctcactgc aaccttcacc tcccagcttc aagcgattct 120121 tgtgcctcag cctccggagt agctgggatt acgggcacat accaccatgc ctggctaatt 120181 tttgtatttt tagtagagac aatgtttcac catgttggcc aggctggtct cgaactcctg 120241 acttcaagta atctgcctgc ctcggcctcc caaagcgctg ggattacagg catgagccgc 120301 tgcgcccggc catcagttac tcttgtaatt aaatcttata attatatagg acaaactgat 120361 aaaatatgag tcggaaaaga agacggatgt ttctatgaat gcagttgaat gctttggaaa 120421 gactccaaaa agttagttgg tacatatcat tatacatctg tccaaaccca cagaatgtgc 120481 agcatcaaga gtgagccata gctgggcgcg gtggctcacg cctgtaatcc caacattttg 120541 agaggccgag gcgggcgcat catgacatca gaagatagag accatcctgg ccaaaatggt 120601 gaaaccccgt ctctactaaa aatacaagct gggtgcggtg gctcacgcct gtaatcccaa 120661 cactctgaga ggccaaggag ggcggatcat gaggtcagaa gatagagacc atcctggcca 120721 aaatggtgaa actctgtctc tactaaaaat acaggctggg cgcggtggct catgcgtata 120781 atcccagcac tttaggaggt cgaggcaggt ggatcacctg aggtcagaag ttcaagtgcc 120841 tgtaatccca gctactcggg aggctgaggc aggagaattg cttgaacctg ggaggcagag 120901 gtttcaggga gctgagatcg caccgttgca ctctagcctg ggagacaaga gcgaaactct 120961 gtctcaaaaa acaaaaaaag aaaaaatata aaaattagct gagtgtggtg gcacgtgcct 121021 gtagtcccag ctacttgtga ggctgaggca ggagaagcgc ttaaacccgg gagtcggagg 121081 ttgcggtgag ccgagatcac gccactgcac tccagcctgg gcaacaggag cagaactctg 121141 cctcaaaaac aaacaaacaa acaaaacacc actcatgtag ggtacgttga taatggggga 121201 agctgtgcct gtgtctggag aggagagaga tatataggaa atctctgtac cttctgctta 121261 atattgctgt gaacctaaaa ctgctccaaa aaataaagtt tttgtttgct tgtttgttta 121321 ttaagacagg gtctcggctg ggcatggtgg ctcacgcctg taatcccagt actttgggag 121381 actgaggtgg gcggatccca aggtcaggag atggagacca tcctggctaa cacggtgaaa 121441 ccccatctct actaaaaata caaaaacaaa attagtcagg cgtggtggcg ggcgcctgta 121501 gtcccagcta ctcgggaggc tgaggcagga gaatggtgtg aacccggaag gcggagcttg 121561 cagtgagccg agatcgcgcc actgcactca agcctcggca acagagcgag actccgtctc 121621 aaaaaaaaaa aaaaaaaaaa aagacagggt ctcactctgt tgtccaggct ggagtgcagt 121681 gacgcaatca tggcttattc agcattgacc tcctgagctc ctgtgatcct cctgcctcca 121741 ccttccgagt agctgagact acaggtgtgc accaccatgc ctggctaact tttgtatttt 121801 ttgtagagac gtgggtctca ctatgttgcc caggctgatc ttgacctccc gggctcaagc 121861 aatcctcctg ccttggcctc ccaaagtgct aggattacag gcatgagcca ccatgcctgg 121921 ccaataaagc ttaattaaaa agaaaagaag ccaggtgcgg tggttcctgc ctgtaatccc 121981 agcactttgg gaggctgagg tgggtgcatc acctgaggtc aggagtttga gaccagcctg 122041 gccaacatag tgaaacctca tctctactaa aaatacaaaa cttggccggg catggtggcg 122101 ggtgcctgta atcccagcta ctcgggatgc tgagacagga gaattgcttg aacctgggag 122161 gcggaggttg cagtgagcca agatctgcca ctgctctcca gcctgggcaa cacagtgaga 122221 ctctgtctca aaaaataaac tactgggagc cgtgactcat gcctgtaatc ccagcacttt 122281 gggaggctga ggcgggcgga tc // LOCUS AC004030 39631 bp DNA PRI 23-JAN-1998 DEFINITION Homo sapiens DNA from chromosome 19, cosmid F21856, complete sequence. ACCESSION AC004030 NID g2804590 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 39631) AUTHORS Lamerdin,J.E., McCready,P.M., Skowronski,E., Adamson,A.W., Burkhart-Schultz,K., Gordon,L., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Poundstone,P., Christensen,M., Georgescu,A., Avila,J., Liu,S., Bruce,R., Quan,G., Montgomery,M., Ow,D., Nolan,M., Trong,S., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 3.5 Mb contig in 19p13.3 between CDC34 and D19S342 JOURNAL Unpublished REFERENCE 2 (bases 1 to 39631) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (23-JAN-1998) Joint Genome Institute, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA FEATURES Location/Qualifiers source 1..39631 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F21856" /chromosome="19" /map="19p13.3 between CDC34 and D19S342" /cell_line="UV5HL9-5B" /clone_lib="LL19NC02 F chromosome 19-specific cosmid library" /note="cosmid library constructed at LLNL from flow-sorted chromosomes from hybrid UV5HL9-5B, which carries chromosome 19 as its only human chromosome" repeat_region 326..601 /rpt_family="Alu" misc_feature 687..837 /note="DPS similarity to gi|1665807|gnl|PID|d1014090 (D87460) KIAA0270 [Homo sapiens] (49..98); 100% identity.~~predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000" CDS join(687..837,3765..3786,5611..5670,9944..10075, 15877..16406) /note="human protein of unkown function, partial coding sequence" /codon_start=2 /product="KIAA0270" /db_xref="PID:g2804591" /translation="LEKEIEVLERGDSAPATAKENAAAPSPVRAPAPSPAKEERKTEV VMNSQQTPVGTPKDKRVSNTPLRTVDGSPMMKAAMYSVEITVEKDKVTGETRVLSSTT LLPRQPLPLGIKVYEDETKVVHAVDGTAENGIHPLSSSEVDELIHKADEVTLSEAGST AGAAETRGAVEGAARTTPSRREITGVQAQPGEATSGPPGIQPGQEPPVTMIFMGYQNV EDEAETKKVLGLQDTITAELVVIEDAAEPKEPAPPNGSAAEPPTEAASREENQAGPEA TTSDPQDLDMKKHRCKCCSIM" repeat_region complement(1279..1531) /rpt_family="Alu" repeat_region complement(1618..1780) /rpt_family="Alu" repeat_region 2005..2295 /rpt_family="Alu" repeat_region complement(2522..2808) /rpt_family="Alu" repeat_region 3308..3567 /rpt_family="Alu" misc_feature 3765..3786 /note="DPS similarity to gi|1665807|gnl|PID|d1014090 (D87460) KIAA0270 [Homo sapiens] (99..106); 100% identity." repeat_region 3885..4415 /rpt_family="Alu" misc_feature 5611..5670 /note="DPS similarity to gi|1665807|gnl|PID|d1014090 (D87460) KIAA0270 [Homo sapiens] (107..126); 100% identity.~~predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 89.000" repeat_region 6360..6651 /rpt_family="Alu" misc_feature 7477..7535 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 70.000" repeat_region 7710..8129 /rpt_family="Alu" repeat_region 9003..9571 /rpt_family="Alu" misc_feature 9944..10075 /note="DPS similarity to gi|1665807|gnl|PID|d1014090 (D87460) KIAA0270 [Homo sapiens] (127..170); 100% identity.~~predicted exon, program: grail2exons_human_1.3, ~frame: 0, quality: excellent, score: 98.000" repeat_region 10454..10701 /rpt_family="Alu" repeat_region complement(11471..11565) /rpt_family="Alu" repeat_region 11710..11753 /rpt_family="Alu" repeat_region 11923..12211 /rpt_family="Alu" misc_feature complement(13647..13991) /note="DDS similarity to AA340747 EST46017 Fetal kidney II Homo sapiens cDNA 3' end similar to EST containing Alu repeat. Score: 670 Identity: 340/345 (98%)." repeat_region 13711..14006 /rpt_family="Alu" repeat_region 14214..14488 /rpt_family="Alu" repeat_region 14499..14777 /rpt_family="Alu" repeat_region 15012..15294 /rpt_family="Alu" repeat_region 15379..15657 /rpt_family="Alu" misc_feature 15877..16403 /note="DPS similarity to gi|1665807|gnl|PID|d1014090 (D87460) KIAA0270 [Homo sapiens] (171..345); 100% identity.~~(15877..16380) predicted exon, program: grail2exons_human_1.3, ~frame: 0, quality: excellent, score: 90.000" misc_feature 16334..17921 /note="DDS similarity to overlapping ESTs:~(16334..16708) H14167 ym62f07.r1 Homo sapiens cDNA clone 163525 5'. Score: 724 Identity: 372/377 (98%).~~(16334..16735) H14179 ym62h07.r1 Homo sapiens cDNA clone 163549 5'. Score: 756 Identity: 397/406 (97%).~~(16584..16949) D56226|HUM420E08B Human fetal brain cDNA 5'-end GEN-420E08. Score: 699 Identity: 359/365 (98%).~~(17162..16845) AA569621 nm38g01.s1 NCI_CGAP_Pr4.1 Homo sapiens cDNA clone IMAGE:1062480. Score: 525 Identity: 301/319 (94%).~~(17098..17588) AA233201 zr69a10.r1 Soares NhHMPu S1 Homo sapiens cDNA clone 668634 5'. Score: 948 Identity: 488/489 (99%).~~(17384..17921) AA442399 zv70c01.r1 Soares total fetus Nb2HF8 9w Homo sapiens cDNA clone 758976 5'. Score: 1008 Identity: 529/539 (98%).~~(17920..17412) N25536 yx76d01.s1 Homo sapiens cDNA clone 267649 3'. Score: 882 Identity: 497/519 (95%).~~(17921..17542) AA232968 zr69a10.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 668634 3'. Score: 711 Identity: 376/389 (96%).~~(17899..17544) H28126 yo78c03.s1 Homo sapiens cDNA clone 184036 3'. Score: 687 Identity: 351/355 (98%).~~(17921..17444) R60222 yh13e04.s1 Homo sapiens cDNA clone 42841 3'. Score: 745 Identity: 450/499 (90%).~~(17921..17598) AA232859 zr46c01.s1 Soares NhHMPu S1 Homo sapiens cDNA clone 666432 3'. Score: 636 Identity: 321/324 (99%).~~and others ..." repeat_region 18449..18737 /rpt_family="Alu" repeat_region 19160..19436 /rpt_family="Alu" repeat_region complement(19791..20081) /rpt_family="Alu" repeat_region 21935..22125 /rpt_family="Alu" misc_feature 22342..22853 /note="DDS similarity to AA577849 nn24h02.s1 NCI_CGAP_Gas1 Homo sapiens cDNA clone IMAGE:1084851. Score: 1002 Identity: 509/510 (99%)." repeat_region complement(22982..23253) /rpt_family="Alu" repeat_region complement(23377..23513) /rpt_family="Alu" repeat_region 23637..24227 /rpt_family="Alu" repeat_region 25247..25537 /rpt_family="Alu" misc_feature 26539..28318 /note="Overlapping exon prediction and ESTs matches:~(26539..28560) predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 93.000~~Other overlapping matches:~(26809..26520) AA213683 zq92h01.r1 Stratagene hNT neuron (#937233) Homo sapiens cDNA clone 649489 5'.~Score: 432 Identity: 268/302 (88%).~~(26873..27134) AA552431 nk15c04.s1 NCI_CGAP_Co2 Homo sapiens cDNA clone IMAGE:1013574. Score: 504 Identity: 262/262 (100%).~~(27960..28318) AA160684 zo72g01.r1 Stratagene pancreas (#937208) Homo sapiens cDNA clone 592464 5' (1..358); 99% identity.~~(28000..28318) AA179443 zp45b09.r1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 612377 5' (1..325); 95% identity.~" CDS join(26539..28318,29501..29631,31217..31255,33093..33182) /note="hypothetical human protein of unknown function" /codon_start=1 /product="F21856_2" /db_xref="PID:g2804592" /translation="MDRVTRYPILGIPQAHRGTGLVLDGDTSYTYHLVCMGPEASGWG QDEPQTWPTDHRAQQGVQRQGVSYSVHAYTGQPSPRGLHSENREDEGWQVYRLGARDA HQGRPTWALRPEDGEDKEMKTYRLDAGDADPRRLCDLERERWAVIQGQAVRKSSTVAT LQGTPDHGDPRTPGPPRSTPLEENVVDREQIDFLAARQQFLSLEQANKGAPHSSPARG TPAGTTPGASQAPKAFNKPHLANGHVVPIKPQVKGVVREENKVRAVPTWASVQVVDDP GSLASVESPGTPKETPIEREIRLAQEREADLREQRGLRQATDHQELVEIPTRPLLTKL SLITAPRRERGRPSLYVQRDIVQETQREEDHRREGLHVGRASTPDWVSEGPQPGLRRA LSSDSILSPAPDARAADPAPEVRKVNRIPPDAYQPYLSPGTPQLEFSAFGAFGKPSSL STAEAKAATSPKATMSPRHLSESSGKPLSTKQEASKPPRGCPQANRGVVRWEYFRLRP LRFRAPDEPQQAQVPHVWGWEVAGAPALRLQKSQSSDLLERERESVLRREQEVAEERR NALFPEVFSPTPDENSDQNSRSSSQASGITGSYSVSESPFFSPIHLHSNVAWTVEDPV DSAPPGQRKKEQWYAGINPSDGINSEVLEAIRVTRHKNAMAERWESRIYASEEDD" repeat_region complement(28623..28904) /rpt_family="Alu" repeat_region complement(28992..29291) /rpt_family="Alu" misc_feature 29501..29631 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000~~Other overlapping matches:~(29501..29631) DDS similarity to AA160684 zo72g01.r1 Stratagene pancreas (#937208) Homo sapiens cDNA clone 592464 5' (359..488); 99% identity.~~(29501..29641) AA179443 zp45b09.r1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 612377 5' (326..462); 90% identity.~~(29501..29631) AA308296 EST179126 HCC cell line (metastasis to liver in mouse) II Homo sapiens cDNA 5' end (23..154); 98% identity.~" repeat_region complement(29957..30245) /rpt_family="Alu" repeat_region complement(30682..30838) /rpt_family="Alu" repeat_region 30850..30928 /rpt_family="Alu" misc_feature 31027..31052 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 67.000" misc_feature 31217..31255 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 83.000~~Other overlapping matches:~AA308296 EST179126 HCC cell line (metastasis to liver in mouse) II Homo sapiens cDNA 5' end (155..193); 100% identity.~~(31222..31255) A133671 zl92e08.r1 Stratagene colon (#937204) Homo sapiens cDNA clone 512102 5' (1..34); 100% identity.~" repeat_region complement(31525..32136) /rpt_family="Alu" repeat_region complement(32246..32552) /rpt_family="Alu" repeat_region 32609..32884 /rpt_family="Alu" misc_feature 33006..33910 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 95.000~~Other overlapping matches:~(33093..33302) DDS similarity to AA308296 EST179126 HCC cell line (metastasis to liver in mouse) II Homo sapiens cDNA 5' end (194..404); 99% identity.~~(33093..33454) A133671 zl92e08.r1 Stratagene colon (#937204) Homo sapiens cDNA clone 512102 5' (35..406); 96% identity.~~(33110..33517)AA148152 zo31e02.r1 Stratagene colon (#937204) Homo sapiens cDNA clone 588506 5';~Score: 676 Identity: 392/418 (93%).~~(33907..33428) A552295 nk06e08.s1 NCI_CGAP_Co2 Homo sapiens cDNA clone IMAGE:1012742; Score: 943 Identity: 477/479 (99%).~~(33910..33435) AA581989 nn36c08.s1 NCI_CGAP_GC5 Homo sapiens cDNA clone IMAGE:1085966; Score: 944 Identity: 474/476 (99%).~~(33909..33434) AA179299 zp45b09.s1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 612377 3'.~Score: 891 Identity: 472/481 (98%).~~(33476..33682) AA327165 EST30488 Colon I Homo sapiens cDNA 5' end. Score: 399 Identity: 205/208 (98%)~~(33907..33478) AA552252 nk06d08.s1 NCI_CGAP_Co2 Homo sapiens cDNA clone IMAGE:1012719. Score: 843 Identity: 427/429 (99%).~~(33910..33487) A159525 zo72g01.s1 Stratagene pancreas (#937208) Homo sapiens cDNA clone 592464 3'. Score: 840 Identity: 422/424 (99%).~~(33533..33682) AA367214 EST78261 Pancreas tumor III Homo sapiens cDNA 5' end. Score: 296 Identity: 149/150 (99%)." repeat_region 35336..35617 /rpt_family="Alu" repeat_region complement(35653..35820) /rpt_family="MLT1" repeat_region complement(36427..36733) /rpt_family="Alu" misc_feature 36816..36896 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 61.000" repeat_region 37158..37426 /rpt_family="Alu" repeat_region complement(37505..37771) /rpt_family="Alu" repeat_region 37835..38111 /rpt_family="Alu" repeat_region 38131..38297 /rpt_family="Alu" repeat_region 38516..38588 /rpt_family="MIR" repeat_region 38692..38982 /rpt_family="Alu" repeat_region 39230..39525 /rpt_family="Alu" repeat_region complement(39570..39622) /rpt_family="Alu" BASE COUNT 8458 a 10781 c 12095 g 8297 t ORIGIN 1 gatctgcttg tcccgttatc tcttctaata cttggaaagt gacatggttt ttccagtttc 61 cctggtgaga taaagtttcc tgttaaaatg tgcaaatttg agcgggaaag gagagttcat 121 ttgatacaga cagtgacata aacagcgcag gccaggggtc aggggacaca gggagtcagc 181 cctggtttgg atcccgcctc tgctacctgc tggccatgtg gcttttgcac tccgtgcctc 241 agtttccccc atcatgaaag agcacgatgc ctcttggggc tgcctgggag ccttaaattc 301 agagagctag gccgagtgcg gtgggtcacg cctggaatcc cagcactttg ggaggatgag 361 gcgggcagat cacctgaggt cgggagttca agaccagcct ggccaacatg gtgaaaccct 421 gtctctgcta aaaatacaaa aaattagctg ggtgtggtgg tgcacgcctg taatcccagc 481 tactcgagac cctgaggcag gaggatcact tgaacccggg aggtggaggt tgcagcgagc 541 caagatcacg ccattgcact ccagcctggg caacaagagc gagaatctgt ctcaaaaaaa 601 agacactggg gagctgtcgt acagaaggtg cccagagccc caccggggat gcaggagtca 661 ccctcacagg cacaccctct ccccaggttg gagaaggaaa ttgaggtgct ggagcgtgga 721 gactccgccc cagccactgc caaggagaac gcggcggccc cgagcccagt ccgggcccca 781 gccccgagtc cagccaagga ggagcgcaag acagaggtgg tgatgaattc acagcaggta 841 agggggtgac tggggggagc ggatccccag gcacccactc ccgctggccc cagagcgagg 901 ccccagccct tccatgtcag cgtagctgag gggacaggca cagactagag ccaggcactt 961 gatttccagc tcaccggagc cacttcccag ctgtgtgagc cgggacacgt ggtttcgcct 1021 tgctgggcct cggtttcccc atctgtaaga tgggggagta atcagagcag ccaccgctgg 1081 gggcatcaga aggactcagg gaagcaggag acccacctgg gggcatcaga agggctcagg 1141 gaagcaggag acccacctgg gggcatcaga agggctcagg gaagcaggag acccgcctgg 1201 gggcgtcaga agggctcagg gagcagcagg agacccggtg ctggtgggtg tggtcatagt 1261 gtttttttgc ttttgttgtt ttttgagact cgttctgttg cccaggctgg agtgtagtgg 1321 tacagtctgc aacctccacc tcccaggttc aagcaattct cctgcctcag cctcccaagt 1381 agctgggact acaggcgcct gccaccatac ccggctaatt tttttgtatt tttagtagag 1441 atgaggtttc cccatgttgg ccaggctggt cccaaactcc tgacctcagg tgatccaccc 1501 aacctcggcc tcccaaattg ctgggattac acgcatgagt caccttactt ggcctggtta 1561 ttttgtttct gtgtttgttt ttgacacagg gtctcactct gtcacccagg ccagccagag 1621 tgcagtggca caatcacagc tcactgcagc ctcgacttcc ccagctccag cgatcctcct 1681 gcctccgcct cccgagtagc tgggaccaca ggcacgcacc accacgcccg gctaatttct 1741 gtatttttgg tagaaacggg gtctctgctg tccaggctgg ggttttcatg gtttctatcg 1801 atcacctgca tctgtttcat gtggccaagg ctcctgaaag cccttcctgt gggaacagta 1861 tccccgtgtg atggaagtaa aaatcgcagc ccacagaggc agagaggctg gctcaaggcc 1921 acacagctga gtggtcctta gtggagtcgg ctgttaattc tggcatctgc catctaagcc 1981 agacgatgtt aaaaccagac ccttggccgg gcttagtggc tcacgcctgt aatcgcagca 2041 ctttgggagg ccgaggcagg cacatcattt gaggtcagga gttcaagacc agcctgacca 2101 acatgataaa accccatatc tactaaaaat acaaaaatta accagatgtc gtggccggca 2161 cctgtaatct cagctgctcg ggaggctgag gctgagaatc acttgtaccc agggggcaga 2221 ggttgcagtg agctgagatc acgccactac actccagcct gggagacaga gcaagattct 2281 ttctaaaaaa aaaaacctgg ttctgggcac agtcctggac ctcactgagc tcacgtcagg 2341 tgggaagatg gcgtgtattg gggaccaccc agggagggtg gtcaggccga agaggtggga 2401 agcccagggc gctgtggaat ccccaaggaa gtgcctcatt cgtgggaggg ctgggcaggc 2461 gacctggagg atgtgggttt cgtgggaggg ctgggcaggc gacctggagg acgtggggtg 2521 gtttttttgt gttttttttt ttgagacaga gtttcactct tgttacccag gctggagtgc 2581 aatggcacga tttcagcaac ctctacctcc tgggttcaag tgattctcct gccttagcct 2641 cctaagtatc tgggattgca ggcgcccgcc accacgccca gctaattttg tatttttagt 2701 agagacaggg tttggccatg ttggtcaggc tggtctcgaa ctcctgacct caggtgatcc 2761 gcccgcctcg gcctcccaaa gtgctgggat tagaggcatg agccaccgtg cccagccaag 2821 gaagtgggtt ttgaaggttg aataggagtt ctccaggtgc ttttccctga aggttgtttg 2881 agcatggact agtgctgtca tgggagactc agcaatacac atgcacggcc cctgccttcc 2941 tgacgggcac acccagtgac gtgatgaggg aagagtgcac tgagggagca tcagacacca 3001 cagccctgag acaggagtgt gtcccagaaa cagcaagaag accagggatg ccagggcaca 3061 gttcagggag aggggcctgt gtgcagggtc tgaccatgca gggcagaacg ggttcagaca 3121 ggcgacacta gaccctgagc ctcgcacagt ggctcccagg aggccaaggg aggaggatcg 3181 cttgagagca ggagtggacg agactggagg gggcacggag gagccggctg gattcagggt 3241 ggcctgcggg agccgctgat gcgggggtat gtggggaggg agggaagaaa tgagatggag 3301 cggacgtggt ggctcagcct gtaatcccgg cactttggga ggccgaggtg ggtggatcac 3361 ctgaggtcag gggatcgaaa ccagcctggc caacatggag aaaccccgtc tctactaaaa 3421 atacaaaatt agccaggtgc gatgacgcag gcctgtaacc ccagctactg gggaggctga 3481 ggcacgagaa tcgcttgaac ccaggaggcg gaagttgcag tgagccaaga ttgccccatt 3541 gcactccggc ctgggcgaca agagcgaaac tccatctcaa taaaaagaaa aaagaaagat 3601 gacctctgtg tttctggcca gagtcccaag aggataggat gagaagcggg tgcgtatgac 3661 gtcacccact gtgccatgcc ctccccaggc tcctgacccc atggctcccc ttcctcttcc 3721 cggggatgct gacgcccctg acccttctct cttctttctt gcagacgccg gtgggcacgc 3781 ccaaaggtag gacctctgga aggaactcgt ggggcctcgg cgcactctgg tggccagaga 3841 ggggatggca gggcggggag tgaagctaca aagttgctcg gtgcctcacg cctgtgatcc 3901 cagcactttg ggaggccgag gcgggtagat ctcctgaggt caggagttcg agaccagcct 3961 gaccaacatg gcgaaactca gtgtctactg aaaaaatgca aagacaatta gccgggtgtg 4021 gtggctcatg cctgtaatcc cagcactttg ggaggctgag gcaggaggat cgatcgcttg 4081 agcccaggag ttcaagagca tgctgggcaa cacagggaga ccctgtctct gccaaacata 4141 aaaaaattag ccgccggcca ggcccagtgg ctcagacctg taaccccagc gctctgggag 4201 gctgagacag gtggatccct tgaggtcagg agttcaagac cagcctggcc aacatagtga 4261 aaccccgtct ctactaaaaa tacaaaaatt agccgtgtgt gctgcacgcc tgtagtccca 4321 gctgctcggg aggctgaggt gggaggatca cttgagccca ggaggttgag ctgtgatcgc 4381 accactgcac cccagcctgg gcgacagagc gagaccctgt ctgtaaactg catttcacgg 4441 aacgtgtccg cttctgtatc gatagcggga cagccaccag ccgggattga cgctggcgtc 4501 tgaagtgctt ctggcctggt ttgttgtgtg ttaattattc ataaggagag tgtcttcttg 4561 tcttactgta gaattaaaaa ttaatcatta aaaaaatcca gacaggcctc tcagggattt 4621 tgtgaaaatc actgggctga ggatgaggaa tgaactgtga ggctgcctgt gggattttct 4681 aggggttgcg gtgggggtcc aggtgcctgt gtctgtgcgg ggcctgtgtc tgggggccca 4741 ggtgcctgtg tcctggtgtc tgggtggggt ctgtgtgtct gggggcccag gtgcctgtgt 4801 cctggtgtct gggtggggtc tgtgtgtctg ggggcccagg tgcctgtgtt tgggtgtctc 4861 ggtggggtct gtgtgtctgg gggcccaggt gcttgtctgg atgtctgggg gcccacaggt 4921 gcctgtgtcc gggtgtctgg gtgggatctg tgtgtctgag ggcccaggtg cctgtctgga 4981 tgtctggggg cccaggttcc tgtgtccggg tgtctggatg ggatccgtgt gtctgagggc 5041 ccaggtgcct gtctggatgt ctgggggccc aggtgcctgt gtccgggtgt ctgggtggga 5101 tctgtgtgtc tgagggccca ggtgcctgtc tgggtgtctg ggggcccagg ttcctgtgtc 5161 tgggtgtctg ggtgggatcc gtgtgtctga gggcccaggt gcctgtctgg gtgtctgggg 5221 gcccaggtgc ctgtgtccgg gtgtctgggt gggatccgtg tgtctgaggg cccaggtgcc 5281 tgtctgggtg tctgggggcc caggttcctg tgtctgggtg tctgggtggg atccgtgtgt 5341 ctgagggccc aggtgcctgt ctgggtgtct gggggcccag gtgcctgtgt ccgggtgtct 5401 gggtgggatc tgtgtgtctg agggcccagg tgcctgtctg ggtgtctggg ggcccaggtt 5461 cctgtgtctg ggtggggtct gggtgtcaaa ggcccaggtg catgtgggag tccggatgca 5521 cttctgtgaa ggggcgttgg gctgtctggg gggtccatgt gacctctcct ctgaccctca 5581 tctctctctc cgcttccacc tcccgtgcag acaagcgagt ctccaacacg cccctgagga 5641 cggttgacgg ctcccccatg atgaaggcag gtgggttggc ccccaggctc tgggccccag 5701 atccagccgc tgtcagggac caagtctcgg tccggggggc cgggtggggc gggggcgaca 5761 ccgacctgct ctccgcagcg tctgacgggg ccgtggttat gggggtggca gctggaccga 5821 ggccgaggct ccgagcctgg ggtgggggcc ccctctgctt tgttgccctg gtcccacgcc 5881 ggcccacgct tccccgagaa gacgggcccc acgcatggcg gctctaaccg acgcgacctc 5941 acggctttca ttccacccta atctcacggc tccctcctgg ggcccgctgg gggtggcgct 6001 cacgcctgaa catgctcgct gcaaatctta cctgctccag tcctcgggca gtaggactgg 6061 ctgccccggg agaaagtggg tgccccgtcc ctggaggcaa ccaagccagg tccccagcac 6121 cgctgcgctt ccgagaagca ggggaaggtg ggggttgacg ctgagtgcca cacggtgctc 6181 aagtcaggct cgaacccctg gtctgccgcc ttgaatttct ccgcctgcca gaggggtggg 6241 acctcagtac gccatcctgg ggcccggcac aggacaggag cccaataatg agaaatcgtc 6301 gcagtgggca aaaccactgc tatgtgagtg agctggaaca gcacagaaag aaggcagagg 6361 gccgggcgtg gtggctcaca cctgtaaccc accccagcac tttgggaggc cgaggcgggc 6421 agatcgcctg aggtcaggag ttcgaaacca gcctggccaa catggcgaaa ccccatctct 6481 actaaaatac aaaaattagc cgggcatggt ggtgcacgcc tgtaatccca gctactcggg 6541 acgctgaggc aggagaatcg cttgaacctg ggaggcggag tttgtagtga gccaaggtcg 6601 tgccactgca ctccagcctg ggcgacaaga gcaagactcc gtctcagaaa aggcagcaag 6661 gcagggaggt gaagaaggac tctggggtgg ctgcccagga cgggacaggg ctggaaggaa 6721 ggagagccac ccggaggccc aggtctccag cacggtccag ggtgggggtt gccccaggat 6781 gcagcagaca gaggcatttg tcagagcagg tggtgcctgt gggcctctgt gccagtgaga 6841 ccagccctgg cagacaccga ggcccaggtc ccccgctgat ccccgtggcc actgactttg 6901 tgcagtgcac aacctgctca acagtgtggg cagctctgga ctagctgtct ggggtggagg 6961 gctgaggctg tgcagagcta tagagttgaa aaacagtcgt cccttcaccc agaatgcctt 7021 tcaagcgcct gctgcatgcc aggcattggg ggagagggta ttctaatgag ggccacaacc 7081 tctactacgg tagaaaagtg aggccaagga cgttgagctg gaggaaggag aatttgaacc 7141 agagggctca ggggagggga gcgttcaacg taagaggtga cgtctgagca aagacccaga 7201 ggtgctgagg gagcaaggca cgcagctatc tggaggaagg tgtccaaggc acgcagctat 7261 ctggggaaag gtgttccagg cagagggcac ggccagtgca aaggccccgg ggcaggactg 7321 cgcctggggt gttggaggaa cggccaggag atccgtgtgg ctggagcagc gtgaggaggg 7381 ggagagggcg gaggagagat gatggagaga ttgttgaggg ccctgtggca ggggcgggag 7441 gggacgtggg cttttcccgg gagggaggca ggaaccatgg ggggctgcag gcaaagccgg 7501 gactgccccg actccgtgct cacccacgcc ctctggtgac tgcagggaga acagaccgtg 7561 gacggtgcca ggtggagggg acagggctgc tgggggagga ggctgggatg ggccgtggag 7621 gggcgagcag tgggcggttc cttatgttct gaaggtggag cccccagggt ttgaggagaa 7681 acgaggtgtc agggaggact ggccaggcac ggtggctcac gcctgtaatc ccagcacctt 7741 tggaggccaa ggcgggcaga tcatttgagg tcaggagttc aagaccagcc tgagcaacat 7801 ggtgaaaccc cgtctctact aaaaaataga aaaattagcc aggcgtggtg gctcacccct 7861 gtaatcccag cactttggga ggccgagggg ggcagatcat gaggtcagga gatcgagacc 7921 atcttggcta acacggtgaa accctgtctc tagtaaaaat acaaaaaatt agctgggcat 7981 ggtggcaggt gcctgtaatc ccagctgctc gggagcctta ggcaggagaa tcggttgaac 8041 ctgggaggca gagattgcag tgatctgaga tcacgccact gcactccagc ctgggcaaca 8101 agagcaaaac tttgtctcaa aaaaaaaaag agggaggact gagctgcccc caacgggtgg 8161 gaagagccag ggtgggggca ggtgtagggg gcagattggg agtagtgtta tggacgtgtg 8221 agagtggggc ccctgtggat aggggctgag tccaggtcaa ggtccaggtg gggatgggtg 8281 tgcgggagat tagagccgag tgcctgggcg aggtccccgg cgcagagcag agtccagggc 8341 cgcagccggg cctaagcgtc gaggttgggt gggaggaggg aaacgaggtg gctatggggg 8401 ccgaggggcc aggcgatggc agggctgccc acgatgggga ccgatggccc ctcagaagtg 8461 gtgaggacat gcagctatct ggggggtgtg ccaggcattg ggaacagcct gtgcaaaggc 8521 cctgaggctg ggccgcacct ggggtgcagt gtcatgggtc aggccagtcc tggcagccct 8581 gttcagctcc cgctgcgggg cggggtctca gggaggtgcc tggtgatccc cagtgctcgt 8641 catcttcaag caggcctggg ggtggcgggc agccaccagt cgctgttagg atgcaggctc 8701 tgtgtggttt ctgcaggaat gatttttaca gagtggggcg aggtgctggc tgtggagtta 8761 gggccctgat gagtagcaag ttcgactatc tcaggagggt tccttcattc tgagtctaaa 8821 tagatgacgg cgtgtccaac cgttggctgg aaggaggcgt caagtcctgg gagcttggat 8881 ggtcggggcg ggggctgtgg gtctgcagac tgagccccag tctccgccgt gaccccgggc 8941 aaggcaggat ctctctggga ttcacctgct cctctgagaa accaggaatg gtggggctcg 9001 agggctcacg tttataatcc cagggctttg ggaggctgag gtgggaggat tgcttgagcc 9061 caggagtttg agaccagcct gggcaacata gtgagacccc catttccaaa aaagataaat 9121 aaattagccg ggtgtggtgg cgcacacctg tggtcccagc tacttgggag gctgaggtgg 9181 gagaatgggg tgagccagga aggtcatgac tgcagtgagc tgtgatcgtg ccgctccact 9241 ccagcctggc caacagaggg agaccctgta tctatttaaa taataggctg ggcgcggtgg 9301 ctcacgcctg taatcccagc actttgggag gccaaggcag gtggatcacc tgaggtcagg 9361 agtttgagac cagcctgacc aacatagaga aaccccgtct ctactaaaaa tacaaaaatt 9421 agctgggcgt ggtggcgggc gcctgtagtc ccagctacgt ggaaggctga ggcaggagaa 9481 tcgctagaac cgggaggtga gtccagatca caccactgtg ctccagccgg ggcgacagag 9541 caaggcggct ccatctcaaa aaaattaaaa agatccggga acaatgaagg ctttccagat 9601 gggctgggct ggaacctgag cttgacagac aatagtttgg tggcgaaaca gaggacacag 9661 tgcagagtac agggacgagg accatctggg cggacagcag gatgggcctc aaatgccaag 9721 gagaggtgtt cagactggtc gtctccaggc actggggagc tatggaggga tgtagagttg 9781 gggagggccg cgtcgagaag gtgcccaggc atccccggct ccgcctgccc catcgctgct 9841 tggctctgcg ctgctggctc ccccctccct ggcttggccg cagcccggcg ggggggtgcg 9901 ggggcgggca ggccgtggtg taaccccgtg actctcgtgc cagccatgta ctcggttgag 9961 atcactgtgg agaaggacaa ggtgacaggg gagaccaggg tgctgtccag caccacgctg 10021 ctccctcggc agccgctccc tctgggcatc aaagtctacg aggacgagac caaaggtacg 10081 agcaccccgg cccctgccct ccctccacct gggccagagc cccgaacacg tgtgctccca 10141 gcgtgtgggt ctggcgtctg ccccggaatc aggaccccga ttcgatgtga agcagatgac 10201 gctgttcagg ccagggcctc accccctgtg gcccacagct gggctcagag gtcgttcagc 10261 cctgggctca ctgcccaatc attcatccac tccagaaagc tgcctgaaag ctgagccggg 10321 caccaggtac ccagctgggc tctggaggga aatccggagg tagattcagc ttctgtcctt 10381 gccgggtggc aaaccgggcc cttccatctg gagctcctaa ataattcgtt taaataatta 10441 ctgctagccc aagggctcac gcctttagtc ccagcacttt gggaggccga gggaggagga 10501 tcgcttgagc ccagaagttt gagaccagcc tgggcagcat agcaaggctc cgtctctacc 10561 aaaaaaaaaa aaaaaaaaaa attaccctgg catggtggtg cgtgcctgtg gtcccagcta 10621 cttgggaggc tgagatggga ggatcacttg ggcccaggag gttgaggctg tagtgagccg 10681 tgatcgcacc actgcactcc aacctgggca acagagtgag accctgtctc ttaagaataa 10741 aaaataataa taaagaattg ctagcgtgac agaagccaca gaaaatgcac atccaggggg 10801 ctgtgagaag tctcacaggg tgaggatggg gccgggggag gcttcccaga ggaggcaagc 10861 aggggctcat cagagacacg cagtgcatgg gaaccacacc tggcagagcc gcagcccctg 10921 caaaggccct gaggtaggat ggagtcagct gggctggagg gctgagggga gttgggctgc 10981 aggagtgagg ggagacgggc tgcaggggtg aggggagacg ggctgcaggg gtgaggggag 11041 acgggctgca ggggtgaggg gagacgggct gcaggggtga ggggagctgg gctgcagggg 11101 tgaggggaga cggggtgagg ggagacgggc tgcaggggtg aggggagacg ggctgcaggg 11161 gtgaggggga accaggttgg tcagggagtt gaggagctgg gctcacgcag ggaggagaag 11221 ggtgctttgc ctggaggcag ggggacaggc ggggccgtgt cctgagggct ttgggaagcc 11281 ttgggtgcgt tgggtcctga cggtgctgag gtctaatttg gttttaaggg ctcctcctag 11341 aggtcaagag gcaaacaggc tgtgggagat gccagggacc ccaggaggag gagggaagct 11401 ccacactgaa gccccactct ctctctttct ttttcttttt atttatttac tttctgaaga 11461 aagagtttca ctctgtcgcc cggcccgcag tgcagcgatg ccatcacagc tcactgcagc 11521 ctccagctcc tgggctcagg cgatcctcct gcctcggcct cccgaaatgc tgagaccaca 11581 ggcatgagtc actgcacccg gcctcttttt cttttatgag atataattcg cagaccataa 11641 aattccgcct taatgggagg ctcatctggg cccgggacat tgaggctgca atgagctatg 11701 atggcacccc tgcactccag cctgggccac agaccaagac cctgtctcaa aaacaagaaa 11761 gaggagagaa gccccatccc catcagctgt cacttcctgt cccctcccca gtcagcgtca 11821 ctccctatcc cctccccagc tctggcaccc acgcatcccc tcctgtctct ctggattggc 11881 ctgtcctgga catttcatag aaatgagatc acacggccgg gtgcggtggc tcacacctgt 11941 aatcccggca ctttgggagg ccgaggcggg tggatcacct gaggtcagga gttcgagacc 12001 agcctggcca acatggagaa cccccgtctc tgctaaaaat acaaaaatta gctggacgtg 12061 gtggcggacg cctgtaatcc cagatacttt ggaggctgaa gcaggagaat cacttgaacc 12121 caggaggcgg aggttgcagt gagccgagat cgcgccactg ctctccagcc tgtgcgacaa 12181 gagtgaaact ctgtctcaaa aaaaaaaaaa agaaagaaaa gagaataaat ggggtcacac 12241 actgcacgtg gccttctgtg tctggcgtct ctcactgggc gtgacgtccc caaggtgcat 12301 cggcactgtg gcctgggtcg gagcctgact ccttttcatg gctgaatact attccacggg 12361 ttgggtgggc catggtgggt ttatcccttc ttccatctgt agacagttgg gctgtttcta 12421 ccctttggct gcagtgagta gcatgctggg gaatttgtgt gcaagtattt gttcacgccc 12481 ctgctttcga ggcctttggg tgtacacgta ggagtgaact gctgggtcct gaggcgactg 12541 tgtggagctc cgtgaggacc tgctgttttc cacggaggct gcaccgtccc gttcccacag 12601 gtcgtggatg agggcccagt ttctccacgt cctcaccagc gctcgtagcc cattctcgtg 12661 gctgtcctgg tggcgtgacg tggggtctcc ctgtggtttg gattcccaaa gccttctgtt 12721 ctcagaggca gcttggcctc tgcggtgccc cagatgctgt cagaggggtc cccgcagccc 12781 agctctggag tggctttggg gtgtggcgtt ggtggcatct cagctctagc tctccccagc 12841 tgtgtggccc tgggaagtcc ccatgcctcg tcttccggat gaggtcctca cccccatcac 12901 tccaccctcc tcagcctcac ctggtccctg gggattccag acctgccagg ctggacactg 12961 gttgcttagc aacgggcagg tgatccgttt ccatggcagc cgagggctcc aattcctgtc 13021 ttggggcagc ggccagtggg acacactgag ttcagttccg cctgatccag acacacccca 13081 ccctctgggc ctccagccca aagtcattga gatgccttga agcatggact cccctccctg 13141 ggggcttaga aactgaaagt ggagaacact ttccttctgg tctgtagcca ggaacaggag 13201 ccctgaccct tggccgagag gaggcgagtc cctgggaccc cccttgcgtc cctcccaact 13261 cccagcgtgg cccaggcagc cgtgtttcta gaaggaattt gcagggctgt gagctagggc 13321 tggtgttaga tcatagcagg gagccgtggg gtgctgaaat ccggtccttg gagagtccaa 13381 cacacctcaa gaaatgtcaa gtcagagaca cacagataag gaaatttgga aagacttgag 13441 aaacacattt agcaagattg attatatgta taaaatgcta cccagtggag aatacacatt 13501 cttttcaaac acagaacatt gtcagaaagt gaccaccgtg gcccaggctg ccaaggaggt 13561 tttaggactg ttaacagagt ttgtatcttc tgctctgtgt tcctctcctc cctgaaattt 13621 aaaaacaccc tctaaataaa ttgtggcttc ctgtggcaat caatcaaact aggaattaaa 13681 aactatttac acttgaaaat gaataacatt ggccgggcat ggtggctcac gcctgtaatc 13741 ccagcacttt gggaggctga ggcaggagga tcacgaggtc aggagatcga gaccagcctg 13801 gctaacaagg tgaaacccca tctctactaa aaatacaaaa aattagccag gtgtggtggc 13861 aggcgcctgt aatcccagct agtcaggagg ctgaggcagg agaatggcgt gaacccggga 13921 agcagagctt gcagtgagcc gagatggcac cactgcactc cagcctgggc gacagagcga 13981 gactccatct caaaaaaaaa aaaaaataga aagtgaataa catcatctaa ctcgtaggac 14041 acagccaaag cagacctcag aggaaaatgc atagccttta gggaagctcc tagaaaacaa 14101 aaataaatga atgaaacttt aactcaagtt agaatgaatg aaaatttcaa tttaagatca 14161 atcaaataca cccatagatg gaaaaaatca gaatccctgt aatggccagg catggtggct 14221 cacacctgta atcccagcac tttgggaggc caaggcgggt ggctcacctg aggtcaggag 14281 ttcaagacca gcctggccaa cgtggtgaaa ccccatctct actaaaaata caaaaatcag 14341 ccgggcatgg tggcgggtgc ctgtaatccc agctactctg gaggctgagg caggagaatt 14401 gcttgaacct aggaagtgga ggttgcagtg agctgagact gcgccactgc actccaccct 14461 gggcgacaga gggagtcaaa aaaaaaaagc agcaggcacg gtggctcacg tgtgtaatcc 14521 cagcaccttg ggaggccaaa gtgggtggat cacctaaggt ctggagttca agaccagact 14581 ggccaacgtg gtgaaacccc atctctacta aacgtacaaa aatcagcctg gtgtggtggc 14641 aggtgcctgt aatcccagct actctagaga ctgaggcagg ataattgctt gaacctggaa 14701 ggtggaggtt gcagtgaact gagatcgtgc cattgcactc cagcctgggc aacaagagcg 14761 aaactccatc tcaaaaagtg aaagaatccc tgtaacctcc agttccccag ggtgggtggt 14821 cctgtcttgt agaagagaaa caggccccga tgcggggaag gagcaggggt ttggctctgt 14881 gaccttggcc cagctgtgtc ccttctttga actttcattt ttgctatctt tacagcggaa 14941 atgaatccct acttttcagg gtgatcacgt gggagctagt tggctaaaga atatggaatt 15001 acaggccagg tgcggtggct cacgcctgtc atcccagcac tttgggaggc tgaggtgggc 15061 agatgacttg aggtcaggag tttgagacca gcctggccaa catagtgaaa ccccagctct 15121 attaaaaata caaaaaatta gccaagcatg ttggtgcatg cctgtaatcc cagctactca 15181 ggaggctgag gcaggagaat tgcttgaact agggaggcgg aggttgcggt gagccgagac 15241 agtaccactg cactccagcc tgggcagctc catctcagaa aaaaaaaaaa aaaatcccaa 15301 ttcattcctt tccactcttt cttggttcct aataaatgtt tttgtgctta taaattttcc 15361 tcattgtcgg ctgggcctgg tggctcacgc ctgtgatccc agcactttag gaggccaaga 15421 cgggcggatc acctgaggtc aggagttcga gaccagcctg gccaacatgg agaaaccccg 15481 tctctactaa aaatacaaaa ttagctgggc gtggtggcgg attcctgtag tcccagctac 15541 tcaggaggct gaggcaggag aatcgcttga acccgggagg tggagcttgc agtgagctga 15601 gatcacgcca ttgcactcca gcctgggcga cagagtgaga ctccgtctca aaaaaaattt 15661 tttttcctca ttgtacggca ttggctgcat ccacagatgc caacaggggg ctttaattgt 15721 ctgtcagttc tgacggtgta atctagttcg tgatttcctc tttagcctgg aggaggatac 15781 aagccttgcc aaggtttccc tcctgcctga gccaggtcac tctctctgtc cctcctgcct 15841 ggagctgggt cattctctct gtctctcctt gtacagtggt ccatgctgtg gacggcaccg 15901 ccgagaacgg gatccacccc ctgagctcct ccgaggtgga cgaactcatc cacaaagcgg 15961 acgaggtcac gctgagcgag gcagggtcca cggccggggc ggcagagacc cggggggctg 16021 tggagggggc agcccggacc acgccctccc ggcgggagat caccggtgtg caggcacagc 16081 caggcgaggc cacgtccggc ccgccgggga tccagcccgg ccaggagccc ccggtcacaa 16141 tgatcttcat gggttaccag aacgtggagg atgaggccga gaccaagaag gtgctgggcc 16201 ttcaagatac catcacggcg gagctggtgg tcatcgaaga cgcggctgag cccaaggagc 16261 ctgcaccacc caacggcagt gctgccgagc ctcccacgga ggccgcctcc agggaagaga 16321 atcaggcggg gcccgaggcc accaccagcg acccccagga cctcgacatg aagaagcacc 16381 gttgtaaatg ctgctccatc atgtgagccg gcccccgaga ccccggcccc caccccacac 16441 cacagacacc caccagcccg gcccctcccg gcgcctgccc accctccacc cacagcctca 16501 cgggtccagg acttggcgtg ttgttacatg ttccttccga gttttctttc gctggaaaga 16561 gggacagggg cccccacccg tcaccacgcc ccaacactcc ccccgaacca gagccgtgca 16621 cttgtgcctg gtaggagaga gacaggacag acccgctttt cccgagacaa ggacccccca 16681 tgtcacggca gcttcacaga cgcggctcgc gcccaccggg gtcctggcgg gtgggacccg 16741 cagcctccac gcggcccagg ccagcctgcc accctctggg cctcctacct gtgcctttct 16801 ctgaggggac accccgccag agagggcccc gggagccggg ggtgggtact gaggcctgct 16861 caggccctgg aagtgaggct ctatggggtt ccctggccaa ggcgctggcc ccccaatctc 16921 aggcagttgg ggtgaggccg tgcctctttg ggggctaaag gtcttgggtg gaggacaggc 16981 ccctctgctg tgcccctatg ccctgtgtgg gcccaaccag tggacaatgg agtctggggg 17041 agggggaacc ccggggacat gcccccaccc gggaggggcc ggtaacccct gggctatctt 17101 ctagacgggg cgaaccaggg gtcattgacc tgccccctgc acagggcagg gaccgagtga 17161 gccactcctt gtcccgagct cccgccccca ctgggccctc cttcctcctg gtgctaattt 17221 ggggacccca ggggccgccc ccggcctctt ctccatcctg cttggaccag ggtcctgggt 17281 cttcccaacc ataccccgag atcaggcccc acctgccagc tctactgggc ttggagcacg 17341 tccgggcagt ggagggaggg acacagcctg ggacaggaag cctcttgggt tggagcagga 17401 gaccctcatt tgccacccag accaatgtga gcctgccccc agccccctct cattggaagt 17461 ggcaaggggc ttccctcctg ggggcagcta cactcgtccc cagaggcaca ttcgtgcaca 17521 ttctcacaga caccgtctca cacgttggct ttggacaacc aggccccaac ttggtccctg 17581 ccctagggac ctccagcctg gtgcccagtg ctcaggccac ctcctggtcc agtcaccacc 17641 tgcagcctcg gcagggcagg tacaggggcc acctcagatg ggagcctggg tccctgcctc 17701 cgctctgccc ctgggtggct gggaggagag gccctctcgg gggtgacctg ggcgtcagcc 17761 gtggaacccc ctcctcctcc ctggagtctg cctgagtccc tcgagccgcg agccttcgct 17821 gaagtgccct tgctataacc ccctctgctt ctggtgtgtg acgaggcccc cgatgttctt 17881 gattttccca gagaagcaaa taaacagcgt gaacagcccc agttcctgga gttcttgcct 17941 ttcaacctgg cgaggaaggc gtggccaggc cagggggtgg ggcctgccgg cctttgagcc 18001 attaaaggga ggctgagcag ggttgcggtg tgctggcttc acctgcctgc acccgctcac 18061 cctgagcgcc ttggggtggt gggaggcgct ggaatcccca ctgtgcaggt aaggcctggc 18121 tgtaaggggc tgctgtgggg gggcaggact ggcttcccgg ggggccctcg ggttctgcgg 18181 ccgccccgtc ctgtcctttg ttcctccctt tgctgggctc acagcccctg ggtgatgaca 18241 gagggtcccc tggacaccca caagctctgt agagaagccg gcccctggga aatacacagg 18301 aggcctgggg aggaggtgtc tgtagagacc agtgtcatgc tcacgggaag gtgctgcccc 18361 gccagcccca ctctctcacg ctgtagtctt ggagaagtcg ctgcctcctc tgggcctcag 18421 ttttcccatc tgtaaaataa gccttcgtgg ccgggcacgg tgtttcacac ctgcaatcct 18481 agtactttgg gaggccgagg tgggcggatc acaaggtcaa gagatcgata ccatcctggc 18541 caacatactg aaaccctgtc tctactaaaa atacaaaaac ttagccgggc gtggtggctc 18601 acgcctgtaa tcctagctat tcaggaggct gaggcaggag aatcgcttga acctgggagg 18661 cggaggctgc agtgagctga gatcgtgcca ctgcactcca acctgggcga cagagcgaga 18721 ctctgtctca aataaaataa aataggcctt cggagccctg agtgggaggc cagcctcagc 18781 ctgggcgtgg gacgtctcca cagtgccccc agctcatggg cgatccagcc tccaccctgc 18841 tggggcctca gttttccccg atgaggtggt gagtgtggtg gagagcactg ctgagggaac 18901 tcaggttcca attccagcct aaacccccgc cattcctgag gcgcgatttt aagctttccg 18961 tacctccgtt tccctatcta caagccaggg agaaaaggac gtgaaggggg tggcgggagg 19021 cggcggtggg aggcgccgtg aggccgcgca ccctgagctt tcacacaccc agcagcgtta 19081 ggtgccacgc tgtcctttgg gggcccttta tccagatttg tccttaaaca cccccatctt 19141 ggccaggcgc cgtggctcgc gcctgtaatc ccagcacttt gggaggctga ggcgggggca 19201 tcacctgagg tcaggagttc gagaccagcc tggctaacgt ggcaaaaccc tgtctctact 19261 aaaaatacaa aaatagctgg gcgtggtggc tcatgccagt aatcccagct actagggaag 19321 ctgaggcagg agaattgctt gaacagggga ggttgccgtg agccgagatc gtgccactgc 19381 actccagcct gggtgacaag agcgagactc cgtctcaaaa aaaaaacaca aaaaaagcac 19441 ccccctcctg agcagccctg ctggattcgt agctgggggc ctttttactc gaagaagaga 19501 ctgaggttca gaggggctcc acaagcagcc cctctggcgt aggtcaggac agagccccag 19561 cctggggaac tgaagtccct gccccctcag ctgtgttccc tgccctcata gacggcgcta 19621 ggcccagtgg cccgacttcc ccacagatgc tgggctcccc acagtcccag caaggcctct 19681 gcaccttgac taagaatcca ggaggaccat gggcgcctga gggcctctgc gtccccatct 19741 ctcggggtgg cctgggtctt ttgacggcct cgtgcggttc tttttttttt tttttttttt 19801 ttttttttga gttggagttt tgctcttgtc gcccaggctg gagtgcagtg gtgtgatctc 19861 ggctcactgc aacctccaac tcctgggttc aagtgattct tctgcctcag cctcccaagt 19921 agctgggatt ataatagcgt gccaccatac cccgctaaat tttgtatttt tagtagagac 19981 ggggtttcgc catgttggcc aggctggtct cgaactcctg gcctcaagtg atccgcccgc 20041 ctcggcctcc caaagtgctg ggattacagg tgtgagccac caagtccggc cccctcgtat 20101 gggtttttat gtggatgggg cctagggaat tcctcccagc agccctgagg gagcccgggc 20161 gtgatgttcc cccacgtccc ggaggagaaa acagggctaa gagaggtcag gtgggatcag 20221 aacccaggcc acaccaaacc cccccgagtc tgccaaacct ggtttatgca agaagggggc 20281 catctagggg agttgcccca aggggtgggc tgggccagaa ggcacggacg ggattgggac 20341 tcagcccctc cggcccccca ggagacacag gccactgtgc acgctgagct gctgtgtggc 20401 cctggacacc tgctgaccct ctctgagcgg tgctgagagg cggtgggagg gccaggcagc 20461 caagggaggt ttcggctgag ttctggctcc tcggggccac ctggaggccc tgggggtgga 20521 ggggcagccg gcagcgggca cggtgcccgc ccttgcccag cctggtatcc tctttctccc 20581 tcctcctcct ctggactttg tttcctgatc ccaggtgggg ctggggggag ggggcacacc 20641 tgcctcccct gggtggggcc tctgttccct ggcaacctgg cgggcagggc ggagctggga 20701 ggcctctgtg cccatcgagg agtcagagtg gaggctgcag actgtggagc cgggagccgg 20761 caggtgaggc tgggggcgcc ccgggccggg ccgggcgggg atcctgtgtg gggcggttgg 20821 atccacattt ggcgtgggag cgtcatgtgt ctgtgggcgg ggtctgcttg cctggcggca 20881 cttgggatcc agggaggccc cctgccccac cccgtcaccc tcggagcctc cctgaggcac 20941 ctttccctct gcccccgcta ggtactctgg ggcctcaagg gcacccactg ggagagcaga 21001 gaggcccttg tccctgaact ctgctggggc ttttgaaccc caaatcccag agccctccaa 21061 gggaggggcc gggttggcac ccccatggtc agaggccaga ggggcccctc catggttctg 21121 ggaattccag aggcttctgg agagatgctt atccgtccca ccaggcccct gactgggagg 21181 ctctgggcag taaacacccc cgtggggctg ggggctggtt tggggacacg gaggaggggt 21241 ggctgagtac aggcggaagg agtctcaggc cggggctggg gacttggcct gggtcctcgg 21301 gtgggggtga ggtggtagcc tgggatcccg ggcagccagc agggactcgg cccagcgtct 21361 gccggtggag ctccgcccag gagcccaagg agctgagatg actcaagggt gactgcaacc 21421 ctgagggatg ggaaaaagga acggggccct cccttctgca tggcctcctt gtctgagtct 21481 cggccccaca agggaacctt tgcccctcac atagagccaa ctggctgatg aggaaggagg 21541 tgtggcacca gggtccataa acccagtttc attctggtat caacagagtt cagggtgagg 21601 ctgggaggag caacccaggg ggcttcctgt aggagggggt gatttggagc aaagtctgaa 21661 aactggcttc agacacttgc tctgtcatcg ctgtgtgacc tcagatgagc aggttctcct 21721 ctctgggtcc cagctccctc tctcgaaaac cggagtcgaa aacggttcca aggtgttgca 21781 ggaagagtct caggacgccg aggtccttag ccaccgccgg gccttgcagg ggccatcttc 21841 cgtcctgggg ctcttccccc gagggcagcg ccgctgccca agaatgccgg ccatgagagc 21901 cgccgtggat ccaggggttt ttttggccga cacagtgaaa ccccatctct actaaaaata 21961 caaaacttag ccgggcgtgg tggcgggcgc ctgtagtccc agctacttgg gaggctgagg 22021 caggagaatg gtgtgaatcc gggaggcaga ggttgcagtg aatggagatc gcgccactgc 22081 actccagcct gggcgacaga gggagactct atctcaaaaa aaaaagcggg gggcgggggt 22141 gggcactgag gcacaggcag caggggaggg ggcccagtgg ccttgaaccc aagagctgga 22201 ccccaggccc cacccgctgg cttcagtgaa tcactgggga ggaatgagca gaggaggatg 22261 cgtgccttca tccccaccca accccgaccc cactgggaga cccggcacct gctcgaggct 22321 gggcggacag ctccctttct cccggagttg acgtctgatg tgggttataa atcccgctgt 22381 gtaaacgtct gtccactcac tgggcaaaca ttcctacacc cgccctcccc ggggagctcc 22441 agggtgcagg gcgtggacct gcccagccct tcagggccag aaggcacggt ggagccggtc 22501 tgcattccgc aatgagcccg acccccacgt ctgagcagct gggatgagtg gtgtttgccc 22561 agccagtatt tattaggcgc ctgctgtata ccagactctg ggctgcagga agggtgctga 22621 cctgtggcca cagcaggcga gagcgaggcc tgggctgacg gtttcgggtg ggagaaggca 22681 aggccttccg tgagctgggc aggcggcgtc cacggaggag gaggcctcgg gtcaggggag 22741 ccatctgtgc gtcacgcttt gggaggaagc tcagctgttt gtgtgggatt tattcaccaa 22801 tgccgcctgg ttttcaatgt ttcccctatt ttagtttctg acatttgata cgcgcggaag 22861 attgcatgac acattgcgtg tagtttcaac acggctcccc acggtctctc cactttatgc 22921 tgtttgatgc cactttccag ttcagagagg ggctaagttt ttcttttttt ttcttttttt 22981 cttttttttt ttttttgagg cggagtctag ctctgtcacc aggctggagt gcaatggtgc 23041 gatctcagct ccctgcaagc tccacctccc gggttcacag cattctcctg cctcagcctc 23101 ctgagtagct gggactacag gcgcccgcca gcacacccgg ttaatttttt ttgtattttt 23161 agtagagatg gggtttcacc attttagcca ggatggtctc gatctcctga cctcgtgatc 23221 cgccaaagtg ctgggattac aggtgtgagc cactgtgcct ggccgaccct gtctgtttaa 23281 aaacaaaaaa aaatgctgct ggttggtgta aaggcagcag atctaaataa tagccttggt 23341 ggcccgggag cgacagtcct gctttgtggg ggtttgtttt tcttttgaga cagggtcttt 23401 ctctgtcacc caggctggag tgcaatggca caatctcagc tcaccgcaac ctccacctca 23461 caggttcaag tgattctccc acctcagcct cccaagtagc tgggattaca ggcctgagcc 23521 cctgtgccca gccgacagtg ctttttttag gagtcaggag ctcaagcttt tgagatccct 23581 gtgtcttcag gtgaggcttc ctttcctcaa ctatgaaaga tgatgtctgc tggacacggt 23641 ggctcacgcc tgtaatccct gcactttggg aggccgaggc aggcggatca cctgaagtca 23701 ggagttcaag accagcctgg ccaacatggt gaaacccgtc tctactaaca atacaaaaat 23761 tagccgggtg tcctggctgg gcgcggtggc tcacgcctgt aatcccaaca ctttgggagg 23821 ccgaggcggg aggatcacga ggtcaggaga tcaagaccat cctggctaat atggtgaaac 23881 cccgtctcta ctaaaaatac aaaaaattag ccaggcgtgg tggcgggcgc ccgtagtccc 23941 agctctaggg aggctgaggc aggagaatgg cgtgaacctg ggaggcagag cttgcaggga 24001 gccgagatcg cgccactgca ctccagcctg ggcgacagag tgagactccg cctcaaaaaa 24061 aagaaaaaat tagccgggtg tgggggcggg tgcccgtagt cccagctact caggaggctg 24121 aggcaggaga attgcttgaa cccgggaggt ggagattgca gtgagctgag atcacgctac 24181 cgcagtccag cctgtgcatc gcagcgagac tctgtctcaa aacaaaagac aatacctaag 24241 acgggtgctg ggtacaaggc ctgacatgaa ggaggtgttc agatgatggc tctgggggag 24301 gggcaatctg ggttacttcc tggaggaggt gtccatgggc ttcagcagag gaagccgtgg 24361 agaagcagaa acaggcagcc tcgagacaga gagagctgca ggtagaggag gagggcgagg 24421 tttggggtgg aggaagaggg cctggtttgg ggtggaagag gagggtctgg tttgggttgg 24481 aggaagaggg cctggtttgg gtggaagagg agggtctggt ttgggtggaa gaggagggtc 24541 tggtttgggg tggaagagga gggcctggct tggggtggag gaagagggcc tggtttgggg 24601 tggaagagga gggcctggtt tggggtggaa gaggagggtc tggtttgggg tagaagagga 24661 gggcctggct tggggtggag gtgcccacag aaattggggt ttcgtgctaa ggccaggagc 24721 agggtgtgag aactgtgccc tgcagatgcc aggagccaca gaagcgatgc gagcaggtgt 24781 gtgacctccg ggaaggctcc tggaactgag cccagcagca acctggaccc ttgggagagg 24841 gcgagagccg gaaggcactg ggtcagggtg tcacccgctc gctccacggc cccacctggg 24901 cccagcggga caaggtccac ggtgttcctt tgcccatcct ggctttatgg cgacaaggcc 24961 caggcgggga tgttgtctca gatgggacga ggtgtccagc tgggcagggc agggtggaga 25021 accaggatgt gggagccggg aaactggagt tgctgtcagt ccttctctct gggcctcagt 25081 ttctccattt gacatccctt agcaaggggc catctgtaat tgtcagatgg aaaaatccca 25141 caccaggagt ctcagagtga gctcagggtg gaaaggctca gtgcacagga gacacagata 25201 aatatggtgc ggccaaggaa ggcttcagag atcaaggctg aggccaggcg cggtggctca 25261 cgcctgtaat tccagcactt tgggaggccg aggcgggcag atcacctgag ctcaggagtt 25321 cgagaccaac ctggccaaca tggtgagacc tcgtctctac taaaaataca aaaaattagc 25381 cggacttggg aggttagcgc ctgtaatccc agctactcag gaggctgagg cacgagaatt 25441 gcttgaacct ggaagcagag gttgcagtga gctgacattg tgccactgca ctccaggctg 25501 ggcaacagag cgagactcca tctcaaaaaa aaaaaaatca aggctgagct ggactccaca 25561 gggcaggtgt gtgtgtctgg gagataaggt gggcagggca ccctgatttt attgggctgc 25621 aaatgttaga aacatataat gcgagttgtt ggggtagtaa ccgggacatt cagagattca 25681 ggtatggttg gaatcaggga gcttaaaagt tttcaggatg ctttgctggt gttggggttg 25741 ctgtcaagca agctgtaccc actgggggca tttacgtccc cagcaattca ctatctcatt 25801 ccatcagcct aacaagacca gcgggaagag ggtcccatct tctgcattaa ttgtaaagtc 25861 ccagggtctc atgtcattgg ctagactggg tcacatgctc atctctgaac caattacaga 25921 ggccagaggg tgaaatgctc tcactggctg gactaggtca cttacccatc tttgaaccaa 25981 tcacagaggc cagtgggtgg gatgctctga ttgatcagga tgagtcactt acccatccct 26041 gaaccaatca cagaggccag agaatgggct actctgactg gccaggctgg gccatttact 26101 catccttgaa ccaatcacag aggccagaag gtgggatgct ctgattggcc aggctgagtt 26161 acttacctat cccttagcca atcacagaga ttagacaatg ggctgctctg agtggctgta 26221 ctgtgtcacg cacccctaaa ccaatcacag aggttagaga atgggatgca ctgattggct 26281 ggactgtcac ttacgcaccc ctaaaccaat cacagaggtt agagaatggg ctgctctgag 26341 tggctggact gtgtcactta cccatcccta aaccaatcac aaaggccatt tgatggaccc 26401 tttgcccttt gggggactgg aggctgttcc tctccctagg tgctctgact tgatggccct 26461 catttccttc tcctccctca gtaagcccag aggtctccac cccacgggag gaaggctgag 26521 gccaagaccc cggaagagat ggaccgcgtg accagatacc ccatcctggg catccctcag 26581 gcacaccgtg gcaccggcct ggtgctggat ggagacacca gctacacata ccatctggtg 26641 tgcatgggcc ccgaggccag cggctggggc caggatgagc cgcagacatg gcccactgac 26701 cacagggccc agcagggcgt gcagaggcag ggggtgtcct acagcgtgca tgcctacact 26761 ggccagccgt ccccacgggg gctccactcg gagaacaggg aggatgaggg ttggcaggtt 26821 taccgcctgg gcgccaggga tgcccaccag ggacgtccaa catgggcact ccgcccagag 26881 gacggggagg acaaggagat gaagacctac cgcctggatg ctggggacgc tgaccccagg 26941 aggctgtgtg acctggagcg ggagcgctgg gccgtcatcc agggccaggc agtcaggaag 27001 agcagcaccg tggccacgct ccagggcact cctgaccacg gagaccccag gacccccggc 27061 ccacctcggt ccacgcccct ggaggagaac gtggttgaca gggagcagat tgacttcctg 27121 gcagcgagac agcagttcct gagtctggag caggcgaaca agggggcccc tcatagctcc 27181 ccggccaggg ggacccctgc aggcacaacc ccaggggcca gccaggcccc caaggccttc 27241 aacaagcccc acctggccaa cgggcacgtg gttcccatca agccccaggt gaagggggtg 27301 gtcagggaag agaacaaggt gcgtgctgtg cccacctggg ccagtgtcca agttgtggat 27361 gaccctggct ccttggcctc agtggagtcc ccggggaccc ccaaggagac gcccatcgag 27421 cgggagatcc gtctggctca ggagcgtgag gcagacctgc gagagcagag ggggcttcgg 27481 caggcaaccg accaccagga gctggtggaa atccccacca ggccgctgct gaccaagctg 27541 agcctgatca cagccccacg gcgggagaga gggcgcccgt ccctctacgt gcagcgggac 27601 atagtacagg agacacagcg tgaggaagac caccggcggg agggcctgca cgtgggccgg 27661 gcgtccacac ccgactgggt ctcggagggt ccccagcccg gactccggag agccctcagc 27721 tcagattcca tcctcagccc ggccccagat gcccgtgcgg ccgacccagc tccagaagtg 27781 aggaaggtga accgcatccc acctgatgcc taccagccgt acctgagccc cgggaccccc 27841 cagctagaat tctcagcctt cggagcattc ggcaagccca gcagtctctc cacagcggag 27901 gccaaggctg cgacttcacc aaaggccacg atgtccccga ggcatctctc agaatcctct 27961 ggaaaacccc tgagcacaaa gcaagaggca tcgaagcccc ctcggggatg cccgcaagcc 28021 aacaggggtg tcgtgcggtg ggagtacttc cgcctgcgtc ctctgcggtt cagggcccca 28081 gacgagcccc agcaggccca agtcccccat gtctggggct gggaggtggc tggggcccct 28141 gcactgaggc tgcagaagtc ccagtcatct gatctgctgg aaagggagag ggagagtgtc 28201 ctgcgccggg agcaagaggt ggcagaggag cggagaaatg ctctcttccc agaggtcttc 28261 tccccaacgc cagatgagaa ctctgaccag aactccagga gctcctccca ggcatccggt 28321 gagaaggggc tccagggagt ggctgcttgg ctcagggtct cagaggttgg agcaggggag 28381 ggtggggagg tggagtttga ggctgtcaaa gggctggggt tggggagaca ttgcctcctg 28441 actgttcatc ccctcagtcg ctggtttgtc tgcctgtcta ttaaaaatgt agattttgga 28501 gagaaatgtc aactttacct tagggctttt gctttatgga tttggatgcc ttgttgttag 28561 gcgcataaat gttcatatat gttttgtgtg ttttgttttt gtttttgttt tttgtttttc 28621 ggttttcttt tttgagacgg agtctcgctc tgtcgcccag gctggagtgc agtggcagga 28681 tctcaactca ctgtaacctc cacctcctgg gttcaagcga ttctcctgcc tcagcctcct 28741 gagtagctgg gattacaggt gcccaccacc actcccggct aatttttgta tttttagtag 28801 agacaggttt catcatgttg gccaggctgg tcttgaactc ctgacctcag gcgatccacc 28861 ctgccttggc ctcccaaagt gctgggatta caagcatgag ccactgctcc cggcctgttc 28921 atatatgttt atcctcttgg acagttgcac ctttttgaca atatgaaata ttccttgtgt 28981 cacttttaat gttttttttt tttttttttg agatggagtc tcgctctgtc acccaggctg 29041 gagtgcagtg gcgcgatctc ggctcactgc aagctccgcc tcccgggttc acgccattct 29101 ctcgcctcag cctcctgagt agctgggact acaggcgcct gcctccacgc ctggctaatt 29161 tttttgtatt tttagtagag atggggtttc accgtgttag ccaggacggt ctcgatctcc 29221 tgacctcgtg atccacccac ctcggcctcc caaagtgctg ggattacagg cgtgagccac 29281 cgcgcccggc cacttttaat gttcttcacc ttgcatgtcg gcccatctgt gcatccagtt 29341 tctcctgact ggctactttg gtcaatctgt cagtttcccg gatagtcatc tgctctgtct 29401 gtccatacat ctgtctcctt agctgctggc tagagactct gtctcccgag cagaggtctg 29461 acaaatgttc taatgttctg ccctccctgc tcccctacag gcatcacggg cagttactcg 29521 gtgtctgagt ctcccttctt cagccccatc cacctacact caaacgtggc gtggacagtg 29581 gaagatccag tggacagtgc tcctcccggg cagagaaaga aggagcaatg ggtgagtctg 29641 gaacccgtct ctgcagaggc caggctgagg tcagacagcc cctattaggg ccacaggaag 29701 ttcaagcagg actcaagcct gacctgggtt caagcactac ccccacccct ggccgtgcct 29761 cagtctcctg cagatgtcag gtagaagatg ataaacttgg tgtccttgta tctctgggct 29821 tctgcttgga gaggccatat aagctggggt ctttgtcccc atcaggaggt catctccagg 29881 atcaaagacc aaaggaccaa ctctgcttgg aggaggagga cactggcaat ttgtcatcgt 29941 caccagcatt agtttctttt tttttttttt tttttgagac ggagtctcac tctgtcgccc 30001 aggctggagt gcagtagcac gatctcggct cactgcaacc tccacctccc gggttgaagt 30061 gattctcctg cctcagcctc ctgagtagct gggattacag gcgtgcacca ccacgcccgg 30121 ctaatttttt tgtattttta gtagagacgt ggtttcacca tgttagccag gctggtctca 30181 aaccgatctc aagtgacctg cccacttggg cctcccaaag tgctgggatt ataggcatga 30241 gccactgcac ctggccacag atatttactt ctaagaaaga gagagagaga gagagaaggc 30301 aggagggagg gagggatgga gggaaggaag gaacaaagcg aggaaggaag ggagggaggg 30361 agagggagag acggagggat gtgtctgtca cacgctggct cggggcttcc aatgaaccac 30421 acctttagca ccctggtggt ctcacatgcc cagggttagg cagctcatga acccaggctt 30481 gactccagga cctacaggcc acagagcttt gccttagggg ttttacttta tgaattttga 30541 tgccttgttg ttagacatgt aattgttcat gtatgtttat cctcttggat gagagctcct 30601 tttgtcagta tgaaatattc ctcacgtcac ttttaatgac acgaggaata tttagtgcca 30661 tgaggaatat tatgtccttt tttttttttt tttttttttg agatggagtc tcgctctgtc 30721 acccaggctg gagtgcagtg gcacgatctt ggctcactgc aacctctgcc tcccgggttc 30781 aagcgattct cctgcctcgg cctcccgtgt agctggaatt acaggtacgt gccaccactc 30841 ctggctaaag ggaggctgct agggaggctc catccgggag gtggaggttg cagtgagccg 30901 agatcgcacc accgcactcc agcctgggtg atagagcaag actcagaccc aatataaata 30961 aataaattaa ttaattaaat taaataaatc atgaatgttg gacagaggct gcactgaggg 31021 gaacagcatg aggaacagca agggggatgg ctgtgagtct gggtgatcgt gggacacgtg 31081 ttgagaacac tcagggcagg ggatgccttc cactcttccc caaatggtga cagagagggc 31141 tgtgtgggag ctctggtcgg actctgcacc gggcagggca gaaaggcctg ggctgacttg 31201 tgttcctttc ctgtagtacg ctggcatcaa cccctcggac ggtatcaact cagaggtgag 31261 tatgctcctg ggcacgaaga ctcaagtctt tcccccccac atctgtgccc ctgcacacag 31321 gggccaacgg aagccccttc ctccaggtgt ggggatcgta ttgggtgtct tctcaattgg 31381 tagtgtctgc tcagggtgga ttttgctgtg atctgcttgt gatgtctgcc atgggcatgg 31441 attaggagaa gagggttcac atgccacctc tccaccacgt ggtgaaggtt tgtgaggtta 31501 gaccccaaac aaaggcaatg ttgctttttt tttgtttttg tttttgtttt tgtttttgag 31561 acagagtctc gctctgtcgc ccaggctgga gtgcagtggt gcgatctcgg ctcactgcaa 31621 gctccgcctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag ctgggactac 31681 aggcgcccac caccacgccc ggctattttt tttttttgta tttttagtag agacggggtt 31741 tcaccgtgtt agccaggatg gtctcgatct cctgacctcg tgatccaccc gcctcggcct 31801 cccaaagtgc tgggattaca ggcgcgagcc accgcgcccg gccaaggcaa tgtttttttg 31861 ggaggatgga gtctcgctct gttgcccagg ctggagtgga gtggcttgac ctcggctcac 31921 tgcaacttcc gcctttcagg ttcaagcgat tctcctgcct cagcctccca agttgctggg 31981 attacaggca cccaccacca tgcccagcta atttttgtat ttttagcaga gatggggttt 32041 caccatgttg gccaggatgg tctcgaactc ctgacctcag gtaatcctcc cgcctccacc 32101 tcccaaagtg ctgggattac aggtgtgagc caccgcaccc agccaaaggc actttctttg 32161 aggaaggtat gcatctgaga ctattcaagc caagccccct cagtcggttt atggcacacg 32221 tgtgtggccc tcctttttat ttttattttt ttttattttt acttttttga gacaggctct 32281 cactctgtca cccaggctgg agtgctatga tgtggtccca gctcactgca accttgacct 32341 cctgggctca agcaatcctc ccacctcagc ctcccaagta ggaacatagg aatgtgccac 32401 tgcacccaga tagtttattt ttatttattt atttatttat ttttgtacag gcagagtctc 32461 actaggttgc ccaggctggt ctcgaattcc tgcattcaag ggatcctccc acctcagcac 32521 accaaatttc tgggattata ggcatgagcc actatcgcct gacccctctt tgagctgaat 32581 ccgaaatgcc aaatacattg gacaggcacg gtggctcacg cctgtaatct cagcactttg 32641 ggaggccgag gcgggaggat catgaggtca ggagatcgag accatggtga aaccccatct 32701 ctactaaaaa tacaaaaaat tagccaggcg ccgtagcggg cctctgtagt cccagctact 32761 caggaggctg aggcaggaga atggtgtgaa cctgggaggc agagcttgca gtgagccaag 32821 atcgcgccac tgcagtccag cctgggcaac aaagtgagac tccatctcaa aaacaaacaa 32881 aaaagccaaa tacagacatt tttcctgccc tggagctcag catgagttag catattcttt 32941 gatcatatta ctaggagtag agggtctgaa acagaggagg ggatcctact ttacccccgt 33001 tctaggcctc tctaatgaat tggcagttgg aggagacatc ttgggggtgt ggtaggacct 33061 ggatcggggg aattcatgcc tcctttttgc aggtcctgga agccatacgg gtgacccgtc 33121 acaagaacgc catggcagag cgctgggaat cccgcatcta cgccagtgag gaggatgact 33181 gagcctcggg atggggcgcc caccccctgc cctgccctga ccctcgtggg aactgccaag 33241 accatcgcca agcccccacc ctaggaaatg ggtcctaggt ccaggatcca agaaccacag 33301 ctcatctgcc aacaatccca ccatgggcac atttgggact gttgggtttt tcgtttccgt 33361 ttctatcttc ctttagaaat gtttctgcct ttggggtcta aagcttttgg ggatgaaatg 33421 ggacccctgc tgattctttc tgcttctaag actttgccaa atgccctggg tctaagaaag 33481 aaagagaccc gctcctccac tttcaggtgt aatttgcttc cgctagtctg agggcagagg 33541 gaccggtcaa agagggtggc acagatcgca gcaccttgag gggctgcggg tctgagggag 33601 gagacactca gctcctccct ctgagaagtc ccaagctgag aggggagacc tgcccctttc 33661 caaccctggg aaaccatcca gtctgaggga ggaggccaaa ctcccagtgc tgggggtccc 33721 tgtgcagccc tcaaaccctt caccttggtg cacccagcca cacctggtgg acacaaagct 33781 ctcacatcga taggatccca tgaggatggt ccccttcacc tgggagaaaa gtgacccagt 33841 ttaggagctg gaggggggtc tttgtccccc acccccaaac tgccctgaaa taaacctgga 33901 gtgagctgcc tgccttggtc cctgcctggt ccggacaaag tcccctgggc ctccactctc 33961 tgtttcattt tcttggcgac attaggttcc accccgctag gccgtgagcc tgtggggctg 34021 tcaggttccc atgggggggt tgccagcatc tgcatgcagt aggtgcttag ctgatgcttg 34081 ttgagtgact gacaattgat catggagagg caccttcagc aacaccaggc ccggggccca 34141 acctcaccag cccggctcct ccagccccgt ctcaccacgt ccatcgcagt acccagggtg 34201 cttattaagc atttgctgaa tccctcacca ctctggtctc agaggaggcc ggagggtcat 34261 taattcggct ccgcatgtat accaggctca catccaggga agccagatcc tactggtcag 34321 aggcgtgggc gctggaccca agtcctggct ctgctgcctg ctgtgtggcc ttgggtgggg 34381 gggggtcctc tcactgcccc atgcctcagt ttccctctga cacacaggga taacgatagg 34441 accaggctcc cgggcagtag gagttaattt acgtaaagct ctcggcacat ggggaccatt 34501 gacgttggat tcagtgcctg atggcagggg cgtgtcctca gacacccccc aaagcaggcc 34561 tcggccccca ggctccccca accctgctgg gagcagcggg tgttaattac acctaggcca 34621 gtgtctcctc cttggctgct cagtgtccgc tgcgctgcct cagggaaaac cgcgcagcca 34681 cgccgggccc aggcaggcag atgtggaacc gaaatagctg ggcagtgggg gctgtgggga 34741 ccctggggag gcgggggaag ggggctgagg ggctacatct gggcagagca gagactgagg 34801 gaggagggag agcccctgtc ttcagggaga ccccagtctg atggaggagg catctgcctt 34861 cacagagatc ccaggctgat ggaggaggcc cctgccccca gggagacccc agtctgacgg 34921 aggaggcatc tacccacaga ggggccacag tctgacggag gaggcctctg ccctcaggga 34981 gaccctatct gatggaggag gcatctgccc tcgggggacc ccagtctgat ggaggaggca 35041 ggttatgtga ccatagaggg aggctcagtg gccatagttg gtgacttctt tcatgtcggg 35101 gcaattctat tccaacccca agaaccaaac gtggggcctg cgggaaggga gggcctcaca 35161 cacagtcctg ggtgagtccc atcagtttct ccactgtctg ccaccccatc ttcccccgat 35221 gccccatgtt ctgtccctct gtccctgtgc tctgaggatg tgtgagcctc cactggctac 35281 tataacaaat caccacccac ctaatgactt aaaacagcag acatggttgg gcgcagtggc 35341 tcacgcctgt aatcccaaca ctttgggagg ccgaggcggg cagatcacct gaggtcagga 35401 gttcgagacc agcttgacca acatggtgaa agtccgtctc tactaaaaat acaaaaaatt 35461 agctgggcct ggtggcaaat gccagtaatc ccagctactt gggaggctga ggcaggagaa 35521 tcacttgaac tggggaggtg gagattgcag tgagccaaga tcgtgctatt gcactccagc 35581 ctgggtgaca gagtgagact ctgtctcaaa aaaaaaaggc aatctctccc tgtctctctc 35641 aggctcacct tcctgcctcc tctcgtgagg gccctatgag gacactgggc ctccatgtca 35701 cccaggatgt tctcctgtct cacggtccat tactcagtcc catctgcagg tgcttctgcc 35761 acgagaggtg acacggccac aggtcccggg gattaggaaa gggatgtctt tggggccatt 35821 gttctgccca ccagaggatc tcgacatccc ccttctccgt gactcttcag tcacctctgc 35881 tcagggccgg ccaggactgt tgcacaggct gtgccctgca caagccccct gccaagacca 35941 aagggcaagg ctctgtgtgc ggccagggtc cccgtcttca gtgtccctgg ccctggctgt 36001 ggcctgttgc cagacaccgc gtcaccaagg ggtgcattat tcccacatca tcccagttgg 36061 acggtcacct caccctgccc atcgtcctgc cagcaagatg gtcccacagc ggcctcagtt 36121 ctcagaggct cggcagggtc tcccctgacc ccgtaatgcc cctcactgcc cctgggcgct 36181 gggttcaaga agaaccgcct gcttcgttgc aggaagtgat gtccaccccc aactctggat 36241 cggaagccac gatctatcct ccagacccaa caccaggtcc ccagcccttt atctctcctg 36301 tctgtctgtc catttctggt gttcaggcct ctctctccac gtctatctcc ctgtttctgt 36361 ccctcctgac ctctctcttt ttccatgatt tttgctttcc tccctccccg tctcatccta 36421 aatttgtttt tttgtatttt gttttttttt tgagacgaag tctcgctctg tcgcccaggc 36481 tggagtgcag tgacgccatc tcagctcact gcaagctccg cctcccgggt tcaagggatt 36541 ctcccacctc agcctcccga gtagctggga ttacaggtgc ccgccaccac gcccggctaa 36601 tttttgtatt tttagtagag acggggtttc tccatgttgg gcaggctggt ctcaaacccc 36661 tgacctcagg tgatccgccc gcctcggcct cccaaattac tgggattatg ggcgtgagcc 36721 accacgcccg gccctcatcc taaattctta aaacctgctc tcctctcagc catgattttc 36781 tttgtcctgt gccataaaga attctcagaa gcaggatgca gcttctgctg ctcaagggtg 36841 agcccagcca gtgggatctg gtgtcgctgc ccgcagaccc ctcagtcaag agcggggtgg 36901 ggggcagcag gagctacagt aacagcccgg atggtcctgc tcctgaaggc accccaccct 36961 gctgtccaga ggagagaccg aggctcagac atttgagccc cacataaccc agcctgccac 37021 acaccgcaga ctccaacagt gtagcccctg taccccagaa gtcacttcag caaatcactc 37081 agggctccac ctcacatctg tccccaagtc cttggggact tgctcactca gcagaagacc 37141 aagagaagcc agatgcagtg gctcacacct gtaatcccaa cactttggga ggctgaagcg 37201 ggcggatcac ctgaggtcag gagtttgaga ccagcctggc caacatggtg aaaccccgtc 37261 tctactaaaa atacaaaaaa tagtcaggtg tggtggtggg cgcctgtaat cccagctacc 37321 cgggaggccg aggcaggaga atcctttgaa cccaggaggt ggaggttgca ctgagccaag 37381 attgtgccac tgcactccgg cctgggcgac agaacaaggc tccgtcctaa agaattaatt 37441 aattaattta aaaataaaaa gaaaatgtgc actttcatct tgtattattc tgtccttttg 37501 tttgtttgag ataaagtttc gctccatttg cccaggctgg agtgcgtgat ctcggctcac 37561 cgaaacctct gcctcctgaa ttccagtgat tctcctgcct cagcctctcg agtagctggg 37621 attacaggcc cacgccccca ccatacccag ctaattttgt acttttagta gagacagggt 37681 ttctccatgt tgggcaggat ggtctggaac tccagacctc agatgatcca cccgcctcag 37741 cctcccaaag tgctggggtt acaggcgtga gacactgcac ctggcctcaa ctccatttta 37801 attaaccctg caggggccag gcatggtagc tcatgcctgt aatcctggca ttttgggaag 37861 ctgaggcagg aggtggatca cctgaggtca ggagtttgag accagcgtga ccaacatggt 37921 gaaaccctgt ctctaggaaa tacaaaaaat tagctgggcg tggtggcagg tgtctgtaat 37981 cccagctact cgggaggctg aggcaggaga atcacttgaa cccaggaggc aaagatcgtg 38041 gtaagccgag atcgcaccat tgcactccag gctgggcaac aagagcgaga ctccatctca 38101 aaaaataaaa ataaataaat aaaaaataaa taaaaataaa aaaaattagc caggcgtggt 38161 agcgcgtgcc tagtcccagc tactcaggag gctgagacac gagaatcact taaacccggg 38221 agatggaggt tgcagtgagc tgagatcgcg ccactacact ccggcctggg caacagagca 38281 agacttcgtc tcaaaaataa ataaataaat aaataaataa aataaagaga tttaggagag 38341 gacacaagca aacataactc agactgactt tggcaaaact aggctttatg ggttcccgta 38401 actgaaacat caatgtaggg ccagcttcag gctcagctgg ttccaggtgc cacacagcat 38461 cctcaggttc cttcggtgtt gggagctcag gctccaagcc caggacagat tcccagctct 38521 gccacatgcc cctgagggac cctgggcaag gcctacccct ctcccagcct cagtgtcctt 38581 atctgtaagt gaagtcgcca cattgacatc cctgccctgg gggacgtcat gagggtcacc 38641 gggcacctgg tgaaaatcag tgaggaataa gaatgaatgt tgaggccagg tgcggtggct 38701 cacgcctgta atcccagcac tctgggaggc cgaggcgggc ggatcacgag gtcaggagat 38761 caagaccatc ctggctaaca tggtgaaact ccgtctctac taaaaataca aacaattagc 38821 cgggcgtggt ggtgggcgcc tgtaatccca gctactcggg aggctgacgc aggagagtgg 38881 ggtgaaccca ggaggtggag cttgcagtga gccgagatcg cgccactgca ctccagcctg 38941 ggcgacagag cgagactccg tctcaaaaaa aaaaaaaaaa aaaaaaaaga atgaatgttg 39001 gttctgacca agtaaattaa cgaattgtca ccagacaatg acttccatgt gactttctgg 39061 tgtgagccgg agaggagccg atgcacgctg ccaccagagg gcggcaaaca ccaaggatgc 39121 agccgcggcc ctcgccccgc cagggagaaa acgcacctgc cgccactctc accttctcat 39181 tcccgtaacc ggatggtgtt tgcttagtgc tccttgaaag cctggacagg gccgggcgcg 39241 gtggctcacg cctgtaatcc cagcacttcg ggaggccaag acgggtggat cacttgaggt 39301 caggagttca agaccagcct ggccaacgtg gcaaaacccc gtctctactt aaaaaaaaaa 39361 aaaaaaatag ccgggtgtga tggcgttcac ctgtaatccc agctactcag gaagctgagg 39421 caggagaatt gcttgaacct gggaggtgga ggttgcagtg agctgagatt gagccattgc 39481 actccagcct gggtgacaga ggaagactcc atctaaaaaa aaaaattatt tgtagagatg 39541 acagagtgag actccatcta aaaaaaaatt atttgtagag atggggttct caccatgttt 39601 cccaagctgg tcttgaactc ctcaagtgat c // LOCUS AC004079 102717 bp DNA PRI 29-JAN-1998 DEFINITION Homo sapiens PAC clone DJ0167F23 from 7p15, complete sequence. ACCESSION AC004079 NID g2822174 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 102717) AUTHORS Jones,K., Hinds,K., Hawkins,M. and Duckels,G. TITLE The sequence of Homo sapiens PAC clone DJ0167F23 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 102717) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (29-JAN-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This clone was provided to the Washington University Genome Sequencing Center by Dr. Stephen Scherer, Department of Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada. This clone was isolated as part of a chromosome 7 mapping effort supported by the Canadian Genome Analysis and Technology program. For more information, see http://www.genet.sickkids.on.ca/chromosome7 SOURCE INFORMATION: This clone was derived from human PAC library RPCI-1, prepared by Pieter de Jong and coworkers at Roswell Park Cancer Institute, using the method described by Ioannou et al., Nature Genetics 6:84-9 (1994). The library is from one male donor. For further details, see http://bacpac.med.buffalo.edu/ The clone is available from Genome Systems, Inc. (http://www.genomesystems.com). VECTOR: pCYPAC2 NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of DJ0167F23; the actual end is at base position 102717 of DJ0167F23. The orientation of this clone is unknown. This clone contains STS sWSS3140 (NID:g1113702). FEATURES Location/Qualifiers source 1..102717 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="DJ0167F23" /clone_lib="RPCI-1" /map="7p15" repeat_region 1..1096 /rpt_family="L1" repeat_region 1099..1402 /rpt_family="Alu" repeat_region 1408..2184 /rpt_family="L1" repeat_region 2187..2490 /rpt_family="Alu" repeat_region 2491..2910 /rpt_family="L1" repeat_region 3131..3332 /rpt_family="Alu" repeat_region 3355..3696 /rpt_family="MaLR" repeat_region 3698..3799 /rpt_family="Alu" repeat_region 4145..4322 /rpt_family="L2" repeat_region 4926..5064 /rpt_family="MER1_type" repeat_region 5073..5322 /rpt_family="L1" misc_feature 6111..6426 /note="similar to EST AA305266 (NID:g1957592)" gene 6133..7304 /gene="WUGSC:H_DJ0167F23.4" CDS join(6133..6227,6304..6389,6916..7304) /gene="WUGSC:H_DJ0167F23.4" /note="similar to TROPOMYOSIN, CYTOSKELETAL TYPE (TM30-NM), but note similarity to numerous pseudogenes as well; 69% similar to P12324 (PID:g136096); H_DJ0167F23.4" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822178" /translation="MAGIATMEEVKCKIQVLQQQADDAEERAERLQFEEELDHAQERL TTALQKLEEEEKAADEKEADRKYEEVPRKLVIIEGDLECTEERAELAESRCQETDEQI RLMDQNLKCLSDAEEKYSQKEDIYEEEIKILTDKLKEAETRAEFTERLVAKLEKTTDD LEYKLKCNKEENLCTQRMLYQTLLDLNEM" misc_feature 6148..6426 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA488595 (NID:g2216026) ab39h05.r1" misc_feature 6323..6426 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA339022 (NID:g1991333)" repeat_region 6427..6546 /rpt_family="Alu" repeat_region 6572..6870 /rpt_family="Alu" misc_feature 6847..7045 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA339022 (NID:g1991333)" misc_feature 6847..7104 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA488595 (NID:g2216026) ab39h05.r1" misc_feature 6847..7014 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA305266 (NID:g1957592)" misc_feature 6865..7062 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA299340 (NID:g1951672)" misc_feature 6884..7073 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST AA369774 (NID:g2022094)" misc_feature 6958..7178 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST T99878 (NID:g749615) ye68d02.r1" misc_feature 7089..7322 /note="similar to EST T39754 (NID:g647441) ya11b07.r2" misc_feature 7089..7274 /gene="WUGSC:H_DJ0167F23.4" /note="similar to EST T39831 (NID:g647513) ya12b07.r1" repeat_region 7942..8238 /rpt_family="Alu" misc_feature 8370..8755 /note="similar to EST AA427386 (NID:g2111290) zw21g04.r1" misc_feature complement(8458..8885) /note="similar to EST N99988 (NID:g1271219) zb86g11.s1" misc_feature complement(8467..8892) /note="similar to EST AA086054 (NID:g1629604) zl84g06.s1" repeat_region 8991..9295 /rpt_family="Alu" repeat_region 10407..10702 /rpt_family="Alu" misc_feature 10674..11042 /note="match to EST AA663135 (NID:g2617126) ab73g05.s1" repeat_region 11251..11503 /rpt_family="MIR" repeat_region 11814..11854 /rpt_family="MIR" repeat_region 12097..12395 /rpt_family="Alu" repeat_region 12503..12542 /rpt_family="MIR" repeat_region 12567..12687 /rpt_family="Alu" repeat_region 12705..12783 /rpt_family="MIR" gene 13114..30898 /gene="WUGSC:H_DJ0167F23.5" CDS join(13114..13216,30546..30898) /gene="WUGSC:H_DJ0167F23.5" /note="40% similar to yeast high mobility group-like nuclear protein, P32495 (PID:g417360); H_DJ0167F23.5" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822179" /translation="MKQKDLCFAVRKKKNCGKQCKNKNKQASVWGISAGFRGCCDDQN KGRFDGPEAQEEACSGERTYQELLVNQNPIVQPLASRRLTRNLYKCIKKAMKQKQLRR GVKEVQKFVNKGEKGIMVLAEDTLPIEVYCHLPVMCEDRNLAYVSIPLR" repeat_region 13560..13859 /rpt_family="Alu" repeat_region 14007..14163 /rpt_family="MIR" repeat_region 14402..14604 /rpt_family="MIR" repeat_region 15123..15178 /rpt_family="L2" repeat_region 15759..16057 /rpt_family="Alu" repeat_region 16134..16433 /rpt_family="Alu" repeat_region 16868..16975 /rpt_family="MIR" repeat_region 16995..17352 /rpt_family="MaLR" repeat_region 17436..17735 /rpt_family="Alu" repeat_region 17736..18005 /rpt_family="Alu" repeat_region 18498..18788 /rpt_family="Alu" repeat_region 18806..19101 /rpt_family="Alu" repeat_region 19105..19398 /rpt_family="Alu" repeat_region 19429..19726 /rpt_family="Alu" repeat_region 19945..20237 /rpt_family="Alu" repeat_region 20652..20951 /rpt_family="Alu" repeat_region 21688..21979 /rpt_family="Alu" repeat_region 21986..22071 /rpt_family="L2" repeat_region 22437..22551 /rpt_family="L1" repeat_region 23060..23140 /rpt_family="MIR" repeat_region 26001..26458 /rpt_family="L1" repeat_region 26479..27338 /rpt_family="SVA" misc_feature 26622..27985 /gene="WUGSC:H_DJ0167F23.5" /note="CpG_island (%GC=73.1, o/e=0.70, #CpGs=104)" repeat_region 27339..27511 /rpt_family="SVA" repeat_region 27579..28533 /rpt_family="SVA" repeat_region 28594..29134 /rpt_family="L1" repeat_region 29667..29971 /rpt_family="Alu" repeat_region 30075..30380 /rpt_family="Alu" misc_feature 30552..31110 /note="similar to EST AA481436 (NID:g2210988) zv45a01.s1" misc_feature 30552..30987 /note="similar to EST AA481993 (NID:g2209671) zv42d05.s1" misc_feature 30598..31042 /note="similar to EST AA122068 (NID:g1678087) zk93c11.s1" misc_feature complement(30928..31280) /note="similar to EST AA593078 (NID:g2408840) nm97h03.s1" repeat_region 31326..31570 /rpt_family="Alu" repeat_region 31654..31942 /rpt_family="Alu" repeat_region 31947..32251 /rpt_family="Alu" repeat_region 32929..33226 /rpt_family="Alu" repeat_region 33382..33542 /rpt_family="Alu" repeat_region 33543..33841 /rpt_family="Alu" repeat_region 33854..33984 /rpt_family="Alu" repeat_region 34235..34345 /rpt_family="Alu" repeat_region 34493..34789 /rpt_family="Alu" repeat_region 35213..35429 /rpt_family="Alu" repeat_region 35452..35537 /rpt_family="Alu" repeat_region 35559..35856 /rpt_family="MER4-group" repeat_region 35857..35934 /rpt_family="Alu" repeat_region 37164..37431 /rpt_family="Alu" repeat_region 37703..37998 /rpt_family="Alu" repeat_region 38056..38348 /rpt_family="Alu" repeat_region 38384..38673 /rpt_family="Alu" repeat_region 38835..39000 /rpt_family="MER1_type" repeat_region 39114..39282 /rpt_family="Alu" repeat_region 39283..39584 /rpt_family="Alu" repeat_region 39605..39772 /rpt_family="L2" repeat_region 39790..40088 /rpt_family="Alu" repeat_region 40128..40316 /rpt_family="L2" repeat_region 40354..40635 /rpt_family="Alu" repeat_region 40659..40792 /rpt_family="Alu" repeat_region 41023..41326 /rpt_family="Alu" repeat_region 42468..43849 /rpt_family="L1" repeat_region 43930..44224 /rpt_family="Alu" repeat_region 44227..44390 /rpt_family="Alu" repeat_region 44474..44776 /rpt_family="Alu" misc_feature complement(44735..44779) /note="similar to EST AA715640 (NID:g2727914) nv89h01.s1" misc_feature complement(44735..44777) /note="similar to EST AA694589 (NID:g2695527) ah19h07.s1" misc_feature complement(44735..44777) /note="similar to EST AA086054 (NID:g1629604) zl84g06.s1" misc_feature complement(44741..44782) /note="similar to EST AA617831 (NID:g2505036) nq01b11.s1" repeat_region 44880..45179 /rpt_family="Alu" repeat_region 45401..45520 /rpt_family="L2" repeat_region 45718..46016 /rpt_family="Alu" repeat_region 46170..46469 /rpt_family="Alu" misc_feature complement(46449..46485) /note="similar to EST AA723045 (NID:g2740752) zg83a08.s1" repeat_region 47016..47203 /rpt_family="L1" repeat_region 47411..47705 /rpt_family="Alu" repeat_region 47716..47879 /rpt_family="Alu" repeat_region 48921..49212 /rpt_family="Alu" repeat_region 49728..50023 /rpt_family="Alu" repeat_region 51581..51794 /rpt_family="Alu" repeat_region 51806..52116 /rpt_family="Alu" repeat_region 52444..52745 /rpt_family="Alu" repeat_region 53113..53389 /rpt_family="Alu" repeat_region 53432..53551 /rpt_family="(TA)n" repeat_region 53572..53858 /rpt_family="Alu" repeat_region 54873..55174 /rpt_family="Alu" repeat_region 56030..56330 /rpt_family="Alu" repeat_region 57564..57709 /rpt_family="L1" repeat_region 57710..58014 /rpt_family="Alu" repeat_region 58033..58199 /rpt_family="L1" repeat_region 58212..58340 /rpt_family="Alu" repeat_region 58376..58684 /rpt_family="L1" repeat_region 58757..59032 /rpt_family="L1" repeat_region 59091..59383 /rpt_family="Alu" repeat_region 59854..60151 /rpt_family="Alu" repeat_region 60154..60242 /rpt_family="MER2_type" repeat_region 60288..60574 /rpt_family="Alu" repeat_region 60575..60774 /rpt_family="Alu" repeat_region 60834..61134 /rpt_family="Alu" repeat_region 61135..61552 /rpt_family="MER2_type" repeat_region 61577..61633 /rpt_family="Alu" repeat_region 62262..62569 /rpt_family="Alu" repeat_region 62808..63107 /rpt_family="Alu" repeat_region 63169..63367 /rpt_family="L2" repeat_region 64265..64565 /rpt_family="Alu" repeat_region 65370..65483 /rpt_family="L2" repeat_region 65486..65710 /rpt_family="MER1_type" repeat_region 65820..65883 /rpt_family="L2" repeat_region 66149..66371 /rpt_family="MIR" repeat_region 67146..67439 /rpt_family="Alu" repeat_region 67507..67641 /rpt_family="MER2_type" repeat_region 67752..67841 /rpt_family="MER2_type" repeat_region 67985..68191 /rpt_family="MIR" repeat_region 68253..68543 /rpt_family="Alu" repeat_region 69104..69143 /rpt_family="MIR" repeat_region 69217..69518 /rpt_family="Alu" repeat_region 69605..69901 /rpt_family="Alu" misc_feature 70277..71144 /note="CpG_island (%GC=63.7, o/e=0.90, #CpGs=77)" misc_feature 75698..76035 /note="match to EST AA199817 (NID:g1795542) zq52a07.s1" misc_feature 75698..76073 /note="match to EST AA069960 (NID:g1577523) zm69c05.s1" misc_feature 75712..76213 /note="match to EST AA173290 (NID:g1753422) zp31f06.s1" gene complement(76835..78307) /gene="HOXA1" CDS complement(join(76835..77190,77656..78307)) /gene="HOXA1" /note="match to U10421 (PID:g500757); H_DJ0167F23.1" /codon_start=1 /product="HXA1_HUMAN HOMEOBOX PROTEIN HOX-A1" /db_xref="PID:g2822175" /translation="MDNARMNSFLEYPILSSGDSGTCSARAYPSDHRITTFQSCAVSA NSCGGDDRFLVGRGVQIGSPHHHHHHHHRHPQPATYQTSGNLGVSYSHSSCGPSYGSQ NFSAPYSPYALNQEADVSGGYPQCAPAVYSGNLSSPMVQHHHHHQGYAGGAVGSPQYI HHSYGQEHQSLALATYNNSLSPLHASHQEACRSPASETSSPAQTFDWMKVKRNPPKTG KVGEYGYLGQPNAVRTNFTTKQLTELEKEFHFNKYLTRARRVEIAASLQLNETQVKIW FQNRRMKQKKREKEGLLPISPATPPGNDEKAEESSEKSSSSPCVPSPGSSTSDTLTTS H" misc_feature complement(76954..77193) /gene="HOXA1" /note="match to EST AA173231 (NID:g1753364) zp31f06.r1" misc_feature complement(77655..77752) /gene="HOXA1" /note="match to EST AA173231 (NID:g1753364) zp31f06.r1" misc_feature complement(77691..77752) /gene="HOXA1" /note="match to EST AA070261 (NID:g1577621) zm69c05.r1" misc_feature complement(77954..78361) /note="match to EST AA070261 (NID:g1577621) zm69c05.r1" misc_feature complement(77954..78023) /gene="HOXA1" /note="match to EST AA173231 (NID:g1753364) zp31f06.r1" misc_feature complement(77962..78384) /note="similar to EST AA199907 (NID:g1795641) zq52a07.r1" misc_feature 78119..79512 /note="CpG_island (%GC=65.4, o/e=0.80, #CpGs=116)" misc_feature complement(78599..78785) /note="match to EST AA531291 (NID:g2273997) nj09e08.s1" misc_feature 80393..80581 /note="match to EST AA336435 (NID:g1988673)" misc_feature 80908..81267 /note="match to EST R46672 (NID:g806069) yj53b10.s1" misc_feature 80940..81372 /note="match to EST AA669105 (NID:g2630604) aa81g04.s1" misc_feature 80945..81343 /note="match to EST AA744848 (NID:g2783612) ny71e07.s1" misc_feature 81116..81382 /note="match to EST AA360916 (NID:g2013465)" misc_feature 81261..81642 /note="similar to EST R55138 (NID:g824367) yj76a08.r1" misc_feature 81445..81688 /note="similar to EST W25014 (NID:g1302869) zb66b10.r1" misc_feature 81477..81767 /note="match to EST AA489505 (NID:g2219107) ab41h04.r1" misc_feature complement(81559..82053) /note="match to EST N95621 (NID:g1267891) zb66b10.s1" misc_feature complement(81607..82053) /note="match to EST AA553617 (NID:g2324156) nk82a08.s1" misc_feature complement(81612..82053) /note="match to EST R55001 (NID:g819320) yj76a08.s1" misc_feature complement(81666..82054) /note="match to EST R46671 (NID:g806068) yj53b10.r1" misc_feature complement(82172..82361) /note="match to EST AA531291 (NID:g2273997) nj09e08.s1" misc_feature 82173..82263 /note="match to EST AA489505 (NID:g2219107) ab41h04.r1" misc_feature complement(82174..82645) /note="match to EST AA707122 (NID:g2717040) zj33b11.s1" misc_feature complement(82174..82653) /note="match to EST AA290602 (NID:g1938864) zs45c05.s1" misc_feature complement(82217..82384) /note="match to EST AA677273 (NID:g2657795) zj61c04.s1" misc_feature complement(82272..82654) /note="match to EST AA677950 (NID:g2658472) zi14a03.s1" misc_feature complement(82288..82532) /note="match to EST AA489506 (NID:g2219108) ab41h04.s1" misc_feature complement(82370..82660) /note="match to EST AA688076 (NID:g2674982) nv58e05.s1" misc_feature complement(82458..82654) /note="match to EST AA678296 (NID:g2658818) zi16a06.s1" misc_feature 83092..83562 /note="match to EST W72556 (NID:g1382193) zd63c10.s1" gene complement(83121..84895) /gene="WUGSC:H_DJ0167F23.2" CDS complement(join(83121..83860,84505..84895)) /gene="WUGSC:H_DJ0167F23.2" /note="human HOXA2; 94% similar to P31245 (PID:g399983); H_DJ0167F23.2" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822176" /translation="MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSH STLIPPPFEQTIPSLNPGSHPRHGAGGRPKPSPAGSRGSPVPAGALQPPEYPWMKEKK AAKKTALLPAAAAAATAAATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEK EFHFNKYLCRPRRVEIAALLDLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCKSLE DSEKVEEDEEEKTLFEQALSVSGALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSP LTSNEKNLKHFQHQSPTVPNCLSTMGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCL QLSDAVSPSLPGSLDSPVDISADSLDFFTDTLTTIDLQHLNY" misc_feature complement(83394..83862) /gene="WUGSC:H_DJ0167F23.2" /note="similar to EST W76504 (NID:g1386806) zd63c10.r1" misc_feature 84683..85139 /note="match to EST AA627201 (NID:g2540245) nq61g03.s1" misc_feature complement(84721..85038) /note="match to EST W63822 (NID:g1371548) md78e10.r1" misc_feature complement(89428..89662) /note="match to EST D79340 (NID:g1179691)" misc_feature complement(90219..90260) /note="match to EST AA688076 (NID:g2674982) nv58e05.s1" gene complement(90310..93035) /gene="WUGSC:H_DJ0167F23.3" CDS complement(join(90310..91115,92510..93035)) /gene="WUGSC:H_DJ0167F23.3" /note="human HOXA3; 95% similarity to e307530 (PID:g1888441); H_DJ0167F23.3" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822177" /translation="MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGE YHRPACSLQSPSSAGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAP QPPQPAPQPPAPTPAAPPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMK ESRQNTKQKTSSSSSGESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCR PRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGY LNSMHSLVNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGA GAGGTPDYDPHAHGLQGNGSYGTPHIQGSPVFVGGSYVEPMSNSGPALFGLTHLPHAA SGAMDYGGAGPLGSGHHHGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL" misc_feature 90366..91165 /gene="WUGSC:H_DJ0167F23.3" /note="CpG_island (%GC=68.1, o/e=0.70, #CpGs=67)" repeat_region 102658..102717 /rpt_family="Alu" BASE COUNT 30364 a 22423 c 22561 g 27369 t ORIGIN 1 gatcctgaaa aagcatttga taaagttcaa cgtcctgtta tgataaaatc cctcaaaaac 61 ctggggatag aaggaacata tctcaacata ataaaagcta catatgacag acccacggta 121 tcatactgaa tggggaaaaa ctgaaagcct ttcctctaag atctggaata tggcaaggat 181 gcccactgtc accactgtta tttaacatag tactggaaat cctaactaga gcaatcagac 241 aagagaaagc tataaagggc acccaaatta gaaaggaaga agtcaaatta tccttgtttg 301 cagatgatat aatcttataa ttggaaaaac ctactaaaga ctccacaaga aaactattag 361 aactgagaaa caaattcagt aaatttgcag gatacaaaat caacacacaa aaatcagtag 421 catttctata tgccataggt gaacaatatg aaaaagaaat ttaaaaagta atcccactta 481 tacacataaa attaaatacc cggaaattta cttgccaaag aagtgaaaga tctctataat 541 gaaaactaca aaatactgaa agaaattgaa ggggacacca aaaaatggaa aaatattcca 601 tgttcatgga ttggaagaac caatattgtt acaatgtcca taccacccaa agcaatctac 661 agattcaatg caatccctat caaaatacta atgatatcct tccccagaat acaaaaaaat 721 tctaaaattt atatggaacc acaaaagtcc cagaataacc aaagctatcc taagcaaaag 781 gaacaaaact ggaggaatta cattacctga ctttgaatta tactacagag ctatagtaac 841 caaaacagca tggtactggc acaaaaaaag acaatagctc tatggaacag aatagagaac 901 ccagaaacag atccacacgc ctagagtgaa gtcatgtttg acaaaagtac caagagcata 961 cactggggaa aagacagtct cttcaataga tggcactggg aaaactggat atccatatgc 1021 agaagagtga aactagaccc ctatctctca ccatataaaa aaatcaaata aaaagagtgg 1081 attaaaagac ttaaatctgg ctgggcaagg tggctcacac ctataatccc agcactttgg 1141 gaggccaagg caggtgggtc actggaggtc aggagttcaa gaccagcctg accaacatga 1201 tgaaactctg tctctactaa aaatccaaaa aaattagcta ggcatggtgg tgggcacctg 1261 tagtcccagc tacttgggag gctgaggcag gagaattgct tgaacccagg aggcagaggt 1321 tgcggtgagc caagattgtg ccactgcact ccagcctggg tgacagagtg agtctctgtc 1381 tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aacttaaatc taagacctga aactatgaaa 1441 ctgcttcaag tcaaaattgg agaaaatctc tagggcattg gtctgggcaa aaatttcttg 1501 agtcccacaa gtacaggcaa ccaaaacaaa catggacaaa tgagataaca tcaagttaaa 1561 aagcttctgc acagcaaaga atacaatcaa caaaatgaag agacaaccca cagaatggga 1621 gaaaatcttt gcaaaatact cctctgatag gggattaaca accagaatat ataaggaact 1681 catacaactc tataggaaaa aaaatctaac aacccaatca aaaaatgggc aaaacatttg 1741 aacagacata tctcaaaaga agacatacaa atggcaaaga agcatatgaa atggtgctca 1801 acattactga tcatcagaga aatgcaaatc aaaactacaa tgagatacca tctcacccct 1861 gttaaaatgg cttatataca aaagacaggc aataacaaat gtttgtgagg atgtggataa 1921 aagggaaccc ttgtattctg ttggtaggaa tgtaaattag tacaatcact agggagaatt 1981 tggaggttcc tcaaaaaact gatcattgag ctaccatagg atccagcaat tccactgctg 2041 ggtatatccc ccaaagaaag gaaatcatta tatcaaagag atatttccac tcctatgttt 2101 gctgcagcac tgtttacaat agctaggatt tggaagcaac ctaagtgtcc atcaacagat 2161 gaatggataa agaaaatgtg gtacttggcc gggaacagtg gctcacacct gtaatcccag 2221 cactttggga ggctgaggcg ggtggatcac gaggtcagga attcgagacc agcctggtca 2281 acatggtgaa accccatctc tactaaaaat acaaaaatta gtagccgggc atggtggctc 2341 atgcttgtaa tcccagctac tcgggaggct gagacaggag aatcgcttga acccaggagg 2401 gagaggttgt ggtgagctga gatcacacca ttgcactgca gcctgggcaa caaatgtgaa 2461 actctgtctc aaaaaaaaag aaagaaaaaa gaaaatgtga tacttataca caatgaagta 2521 ctattcagcc ataaagaagg atgataacca gtcatttgca acatggatgg aactggagat 2581 cattatgtta aggaaaacaa gccaggcaca gaaagacaaa catcgtatgt tctcacttac 2641 ttgtgggatc taaacatcaa aacaattgaa ctcatggaca cagagagtag aaaaatagtt 2701 gccagaggct gggaaagata gtgggggaat cgggggagat gaggatggtt aatgagtgcc 2761 aaaaaaatgg aacaaatgaa taagacctac tatttgatag gacaacaggg tgactatagt 2821 ctataataac ttaaatgtat attttaaaat aacgtatgaa gtgtatcgta ttgtttgtaa 2881 ctcaaataac aaatgcttga agggatggat tttttaaaaa agaatctcat ccactgggga 2941 aaaaatttca ctaaattaag taagtatgaa atacatttaa ataagtattt aaagggtgtt 3001 tttataagtc tgtagcattt atgcttctga aattattaaa acttaaaact acactaagac 3061 ttttgggaca ctgtgtttta actgtacaat ccccgtcatg gatgtatggg tacactacat 3121 aactacacag ttttgttttt tttttttgag atggagtgtt gctcagttgc ccaggctgaa 3181 gtgcagtggc gcaatcttgg ctcactgcaa cttctgcctt cagggttcaa gcgattcttg 3241 tgcctcagtc tcccgagtag ctgggattac aggtatgagc caccaggcct ggctactttt 3301 ttgtattttt agtagagatg ggttttcgct attgatatgg tttggctggg tctccaccca 3361 aatctcatct tgaattgtag ctcccataat tctctcgtat agtgggacag acccagtggg 3421 agataactga atcatggggg tggtttcccc catactgttc tcgtggtagt aaataagtct 3481 cacaagagct gatggtttta taagaggaaa cccctttcac ttggctgtca ttctcttctc 3541 ttgtctgctg ccatgtgaga cgtgcctttc accttctgcc atgattgtga ggcctcccca 3601 gccacatgga actgtgagtc cattacacct ctttcttttg taaactgccc agtcccaggt 3661 atgtctttat cagcagtgtt aaagcagact aatacagcta tgttggccag gctggtttca 3721 aactcttggc ctcaagtgat ctgcctgcct cagcctccca aagtgctgag attacaaatg 3781 tgaaccactg tgcccagtct acacaattct ttggactctt ttggacattt gaaatttttc 3841 acaataaaat attcgaaaaa agtcgggtca agagatataa tttaattatg agaaattaat 3901 aaagatatat gaaaaaggct aagcatagaa gcaaagtgat tatcaacatt attataccca 3961 agacaaattc ttgctacagt ttagtcctcc tttcaggaac ttttttccaa gatctatatt 4021 tatgaccaag aagtgtgtct tatatccagg aagctttttt ttctttcttt ctttcttttt 4081 tgccgtaaag aatgactgca acacttccta agatgcttgg gcatgttaga tagctacttt 4141 gtgaactcat tcagctgttc gtcatgtatt cagtaagtat ttcctatgtg ccaggcacag 4201 tacacagcag tggagatacg atcataaaac acacaatctt taaccaactg actttatagt 4261 ctagctgaag agagagagga gggcagaaaa atagagtctt atgacaaagt gtaaagtgct 4321 atatagcatg ttcagcatta tagcactatt gtattacagc atattatgga accacaaaag 4381 aatagaatgc aatccaacct tgcaaataca ataatggcct ccctgaataa gtgagaggtg 4441 ttagcggtgg ggagaaggta tagaaattaa ccaggtaaag aggggattga tgaatgtctc 4501 aattagaggg aatcccataa gcaaagtctc agaggtgcct ggtaatatgg tacatctgag 4561 aaagaacaag gaactactgg tgaaagcaag tagatgtctg gaataggatg ctgatgccct 4621 ggtaagagaa gaaatttaag acattcacaa aagtaatcca ttcccagctg aaagtggaaa 4681 aaaaagccaa cgtatttatt tgattatttc agttaaaaat aaaggttttt taaattttat 4741 ttggcttcac tttatgaatg ctttgaaaca aaattttaaa gtttgaatta tccaaattca 4801 gtataattgc tttaacttgc tattgcattc cgcaaatggg aaaatagggt gaatgatgaa 4861 aaacactaaa tacacacgga tgacatggat acacaatgag tgaggtacat aaggaaactt 4921 tttaccagtg atcctcaaac ttcagtgagc acaagtggac tactttaaaa gccattttct 4981 atctttcatc cttgagactc tgacttaaaa gatctgtggt ggagtccagg aatctgaatt 5041 tttaacaaga ccacaaatga ttcttttaac tatttgcaat agaaatttta aacctaaact 5101 aaattagaga gaatactata acatcacccg catactcctc actcaccttc cataaataaa 5161 catatggcca gtcttgtttc atctacattg ccactcattt acccttcatc ctaatttctt 5221 tttaaaataa tcccaaatat cttgacattt tatccactaa cacttcagtt tgtatctcta 5281 aaaagaatga atatttttag cattatcaca atacaataat cacacctaaa gacccaaggt 5341 gatcctaatg taactggtcc agtctcagaa aaactggact tgagaaagag ggtaacaagt 5401 cagtgtctaa gtggactact tgggaagaca taacttaatt tagttcatca tagattcaaa 5461 ctctttgttc aagaacaagc aagctattca gttatgtttc tgaaactgtt accttaaata 5521 ttcaaaaaat gtatttatgg aaagcataaa attaacatac tgaattgaat taaaaaaaac 5581 tttttcaggg ttattgatca aaatttttaa ggaaaaagaa gttgttttgt taaatacatt 5641 tttacacctt atcaaacctc aaagaggaaa gaacaaattg ctcaatttgc ttttcctcca 5701 gagttacttg gatttcacag agtttaaaaa tatttttctc aatgaacctc agaaaagttt 5761 ctcagagtaa tgaagaacac aaaatgtgcc ctgataactt cctcacaatt taattaactc 5821 caaatgctaa agaatcttgc tatttataat catatctata gaatacaatt tgaatatcaa 5881 ggttagagaa atctgattcc ttcagtcata cttcagctcc tttcttttac ctgctttatc 5941 atccccaatc agaaccaata tttataaagt tcaagtcagt atttccttga catttttatc 6001 tcctttcatc tgttgcttta aaaacaaaag ggaacaaaac ttaatacaga aaaataaaag 6061 aaaggagaag cagctgcaat gctgagcaga agaggcagga accagaactg gagcagtagc 6121 tgggtcggca ccatggctgg gatcgccacc atggaggaag tgaagtgcaa gatccaggtt 6181 ctgcagcagc aggcagatga tgcagaggag agagctgagc gcctccagtg agaagttgag 6241 gaagaaaagt gggccgggaa caaggaggct gaggtggcct ccttgaatcg taggatccag 6301 caggtttgaa gaggagctgg accatgctca ggagcgcctg accactgccc tgcaaaagct 6361 ggaagaagag gagaaagctg ctgatgagag tgaaagagat atgaaggtta ttgaaaacca 6421 ggccttgccg aggcccggga atggcgtgac ccccgggagg cggagcttgc agtgagccga 6481 gatcgtgcca ctgcactcca gcctgggcaa caaagtgaga ctccgtctca aaacaaacaa 6541 ataaaaccag gccttaaaag atgaagaaaa gggcccagcg cagtggttca aacctgtaat 6601 cccagcactt tgggaggcca agccaggcga atcacgaggt caagagtttg agactatcct 6661 ggccaacacg gtgaatccct gtctctacta aaaatacaaa aaattagctg ggcgtgatgg 6721 cacacgcctg tagttccagc tactcgggag gctgaggcag gagaatcgct tgaacccagg 6781 aggcggaggt tgcagtgagc tgagatcgcg ccaccgcact ccagcctggc gagacagtga 6841 gactctgtct taaacaaaca aacaaaaaga tggaactcca ggaaatccaa ctcaaagaag 6901 ctaagcatat tgcagaagag gcagatagga aatatgaaga ggtgcctcgt aagttggtga 6961 tcattgaagg agacttggaa tgcacagagg aacgagctga gctggcagag tcccgctgcc 7021 aagagacaga tgagcagatc agactgatgg accagaacct gaagtgtcta agtgatgctg 7081 aagaaaaata ctctcaaaaa gaagacatat atgaggaaga gatcaagatt ctcactgaca 7141 aactcaagga ggcagagacc cgtgctgagt ttactgagag attggtagcc aagctggaaa 7201 agacaactga tgacttggaa tataaactga aatgcaacaa agaggagaac ctctgtacac 7261 aaaggatgct gtaccagact ctgcttgacc tgaatgagat gtagaacacc ccagtcctac 7321 cctgctgctg ctcctccctt tgtccctgac tccacctgag gccggcctgc ctgaagctga 7381 tctttaactg agggctgatc tttaatttgg gggctgcttt ctactttcgc agccccctcc 7441 ttccctgttt cttttttgcc aaactgtctc tgcctcttcc tggagattcc agctgggcta 7501 aaggctgagc acctttggaa acaacattta agggaatgtg agcacaatgc atagtgtctt 7561 taaaagcatg ttatgatgtg caagtgtctt taaaagcatg ttacgatgcg cacattttgt 7621 aattaccttt tttgttgttt tgtagcaacc atttgcaaaa cattccaaat aattccacag 7681 ctctgaagca gcaatctaat ccctttctca ctttcggaag gtgacttttc agctaaatgc 7741 atattgccct ctccatagag gagaggaaaa agtataggcc tgccttactg agagccaaac 7801 agagcccagg aaaagactcc actatgagaa acctcgttgc tctgtacaaa ataccagcca 7861 aaccagaaag gtgattccag gaggagttag ccaaacaaca acaaaaaaaa aacaaaaaag 7921 atattttaaa gctgaaaacc tggctgggtg ccgtggctca cgcctgtatt cccaccagtt 7981 tgggaggcca aggcaggtgg atcacgaggt caggagtttg agaccagccg gaccaacgtg 8041 gtaaaacctc gtctctacta aaaatacaaa aattagccag gcgtggtggc gtgtgcctgt 8101 aatcccagct actcaggagg ctgaggcagg agaatcactt gaacccagga ggcggaagtt 8161 tcagtgagcc tagatcatgt actgtactcc agcctgggcg acagagtgag actctgtctc 8221 aaaaaaaaca aagaaagata tctttggaca acgttatttc aaattttttt cattagaagt 8281 gaccaaatta agatggtaag acctctgaga ccaaattttt gtcctcccaa ctgcttacag 8341 aatggatcac gtgcccctta ggttgaggtg actacttaat tgctttccta ccttcttgaa 8401 agaaagaaag attatgtttt cgccactgat ttagccatgt gaaactcatc tcattagcct 8461 tttctgggtt tgaagttgct atctctaaaa gtgccatctc attgtgcttt gtatcagtca 8521 gtgctggaga aaccttgaat accttatata caaaattttt ttaaattttg tattattttg 8581 aaactttgct tccttgggtt tgtggcaccc tggccacccc catctggctg tgacagcctc 8641 tgcagtccgt gggctggcag tttgctgatc ttttaaagtt tctttcccta cccagtcccc 8701 attttctggt aaggtttcta tgaggtctgt taggtgtaca tcctgcagct tattggcttg 8761 aaatgtactc tcctttgatg tggtctcttt ggggccgatt gggagaaaga gaaatcaata 8821 gtgcaactgt tttgatactg aatattgata actgtctttc tgatataaaa aaccagtccc 8881 tccaaaacaa cacctgagtt tatagctgaa tatcagaagc aatattaagt tttctttctt 8941 aaccacgcag atcagaaaat cttttgcaac attaacattt taaaatatca ggctgagcac 9001 agaggctctt gcctgtaatc ccagcacttt gggaggccaa gatgagagca tcacttaagc 9061 ccaggagctc aagaccagcc tgggcaacat agggagaccc catttctaca aaaaataaaa 9121 aattagctag gtgtggtggc atgcaactat ggtcctagct acttagcacg ctgagatggg 9181 aggatcgctt gagacaagga aattgaggct acagtgagcc ataatgatgc cactgcactc 9241 cagccttggc aacagagtgt gactctactc atctcaaaaa acaaaacaaa acaaaaaaaa 9301 cttacatttc actgtcaatc cctaaaatca catttgtgta acactggtgt aaattatata 9361 tttgcgtttt tcagctatat tccatttttt aaagtacagt gatgatctgt gatagcttac 9421 aagctatttc ttatttatca acagtatgaa accatgaatc catatcaaaa ctatgacata 9481 aaatcctaca tatgtgatta ttaaccttag aatgcctaag atgaaactgt tccttgatac 9541 tgtcattaaa agaagcattt aataaagtac taaacatcac atgtttaagg tatttaattt 9601 cttttttaaa ctctttaccc tttgctgagg ttttttattt gacatgtgtc caaggtaaaa 9661 agacaaatct actcttttgc tctttgataa ggtgtcagag cttacttaaa ggtttaggaa 9721 tagagttgtt gaactaattt ttttgtactg caagttatat atgagaattg gtagcagtct 9781 gttgaaagaa gtttacctgt ggttaatagc acttatctca cttctatcta tcatgccatc 9841 aatttcttta ttacaagagt ttacctttgg gcagctcata caccagctgt caatctgtcc 9901 tcaccccgag actttgcagt aaaccaagct caggtgtgtt gaggtttcac catgctagtt 9961 gttttccttc tttctccaac tagaacaggt gtagtcaaaa tgcaagcctc tggcactcca 10021 gatgtgcccc atcacaagac agagccagaa cttcttcatt cctgcctcct agatcacaac 10081 ttcgtatctg gagattttct agatgttgct tcttgtctta attttacctt tcctgcccct 10141 tcactgaagc attttaagtg gtagctttaa aaagactaaa aaactccaat gtccatcaat 10201 aaaggactgg tccaataaag tatactccac gaaacgaatt actatacctt gtaaaaaatg 10261 agaaaggaaa aaatgaaaat gctctctaag tagtaatatg gaaagatttc caagatacaa 10321 taagttaaaa aagtaagatg cagaatagta ttaccttttg tattttaaaa aaaatcataa 10381 ttgcttctat attcataaag aaacttgcca ggagcagtgg ctcatgcctg taatcccagc 10441 actttgggag gccgaggcgg acagatcacg aggttagcag atcgagacca tcctggctaa 10501 cacggtgaaa tcccgtctct actaaaaata caaaaaatta gacagttgtg gtggcgggcg 10561 cctatagtcc cagctactcg agaggttgtg gcaggagaat cgcttgaacc cgagaggcgg 10621 agcttgcagt gagccgagat cgcgccactg cactccagcc tgagcgacag agcaagactc 10681 agtcttgaaa aaaaaaaaga aactctgaaa agatagaata caaactaaga atactgatta 10741 ttggggttgg ggtgtttagg aactaagcat gatgaaagga gtggaaagga gaattttcat 10801 gacatgcctt cttattcttt ttatttttga actatattgc ctattcaaaa tattcaattt 10861 aaaaagaaag agataatagc ttcattcagt gcaaggaggc taaaagaaag aagaactaca 10921 ctaagtaaga aagtctcatt cttactcagg ctattggcac tatagtttgg gggaaagaat 10981 aaattatcag gaaactgaaa aataaaattt agaaaatcac cagtaaaagc agctaggact 11041 ggttctggtt gaaggacagg ctaaaagaca aaaacattag ctaccaatct tatggctttg 11101 ctgattcatc tgtgttttaa cagtgagcaa aaagtatttt gaagacaaat aacattacag 11161 caggatcaaa tgctctctcc ttaagcattt tcttcagttt tctaaaagat atcccatgtg 11221 ctgcttctct cacatcccag attacaaagg cagcatggtg aagaggcaag cactggagcc 11281 ctggtgcaat gctgactagc ctcagttcca gctttaacac ttaggtgctg tatggcctgt 11341 acacattatt taaccacatc gtgctttgtt tcttcacctg tacagtggcg ataagaagag 11401 tatctacctc ataaagtcac tgtgaggatt aatcaagtta atacaccaaa acatgtagaa 11461 cagcgcctgg ctcacagtaa gaactctgta aatgtttgct cttttacttg ctcttcaact 11521 gttttgctgt ttgtctttct ttaaaagtgc agcctccaag gcctaaaact taccctttcc 11581 acttcccatt aacccttgtt cttttttttt ctccattttc cttctacttt tcttttttaa 11641 agagggatac aatcggtcac taaaatactc ccacagcatt tttgagacaa gatcatctca 11701 aagaaaaata ttcataaagg aaatagaaag tgtggtgttt gccagaaaaa ttcagaattt 11761 aaagcattgg cataaagata attccttaca tttctatatc actctatata gtgatatagg 11821 ttatctcatt tgatccttac aacaaccttt tgagaccatt agggcagtaa aagaatacaa 11881 ggcaaatgag attaaaaggg ttgcctgagg ctaaattggt gacaatatct gaccagctca 11941 gttcgacaac aatttattga acacctgcca cagcttggcc acttgtgctg acctccaaat 12001 tctaaatgct ctctgctaca aagagtctac atggtataaa acacctctca agagcttcac 12061 tcaacttgat aatcatgcta cctaaaagcc actataggcc gggcacagtg gctcatgcct 12121 gtaatcccag cattttggga ggccgaggca ggcagatcat gaggtcagga gtttgagaca 12181 gcctgactaa tacaagtaaa accccgtctc tactaaaaat acaaaattta gccgggtgtg 12241 gtggcacagg cctgtaatcc cagctactca ggaggctgag acaggagaat tgctagaacc 12301 agggaggagg aggttgcagt gagccgagat agtgccactg cactccagcc tgggtgacag 12361 ggtgagactc cgtctcaaaa aacaagaaaa gaaaagagaa gagaagagaa aagaaaagaa 12421 aagaaaagaa aaaaaagaga aaaagctgct atagtatcag atagcagaaa ggaacaggga 12481 ttccctggca aaagtatatt gggctgcatg atcttgggca aattacttaa ctactctatg 12541 ccagttttct cttctttttt gttaaattct ttttctttta tagagatagg atctcctatg 12601 ttgcccaggc tgttttcaaa ctcctgggct caagtgatcc ctctgcctca gcctcccaaa 12661 atgctgggat tacaagcatg agccacctca accaccagct aagccagttt tctcatacct 12721 aaatatatgt attatactta ccaccaggtg gttgtaaggg ttaaatgaga gaatgaatat 12781 aaaatctgtt gtaaactgta acatcccatc gggtattagt ttttttggtg ttaatctgat 12841 ttagcagtgg aatgcaagat ggaacctagg cggggggcgg agggaaaaaa agtctgatag 12901 aaaaacatct ttaggtactc agcccttaaa agacctccca gggagagtag caacattaca 12961 aaaactcaca ctgtttcccc tgtggagggt acccactgag ctaacagagg taggcctcat 13021 gggtcagccc cattctacca aatgtattct ttcccatttg ctccagctgg tggtaacagg 13081 tacagaggga gcaaaccaga cagctggtgc cacatgaaac agaaagatct gtgcttcgcc 13141 gtaaggaaaa aaaaaaactg tgggaaacag tgcaagaaca aaaataagca ggcaagtgtt 13201 tggggcattt cagctggtaa gtgagagaga gaagcagccc caagacagat gcagtccacg 13261 ccctccatct gtaaccagag agtggctcac atgtcagaaa aaagtttgtc ctcccttagg 13321 ggaatcttga aggctgtcaa tgttctaacc atgaggaatc acaaaatcta tacggaacat 13381 tccaacatgt gcaggcctga taggtctcag agcaatgttc tatgggagtc tgagttttgc 13441 aggcttctca cttggtggac tgggaatgtc agtgagaggc gtgctttaca agggagggga 13501 agtggtgtct gaatctgctc agagacaggc ccactactta atttaagaag gcagtagaag 13561 gccgggcgca gtggctcacg cctgtaaccc tagcactttg ggaggccaag gagggcggat 13621 cacctgaggt caggagttca aaaccagcct agccaacatg gtgaaacccc gcctctacta 13681 aaaatacaca aaaattagct gggtatggtg gtgggcgcct gtaatcccat ctactcggca 13741 gctgaggcag gagaatcgct tgaacccaga aggtggaggt tgcagtgagc tgaggtcgcg 13801 ccactgcact ccagcccagg cgacagagtg agactccatc tcaaaaaaaa aaaaaaaaag 13861 ggcagtagaa cagggtggtt aagtaagcca actgtcagtc aaacaacctg agttcaaatc 13921 ccaaatccac tacttactta cttaacctat gtgatttagg gttgcttaat tttgctaaat 13981 tgcttaactt tagtagcaac atttcctaac tttggtagca aagttgctta acctgactaa 14041 gcttcatttt ttgatctgta aaataggggc gaatagtaga agaatgttgt aatgaataag 14101 atgcttgtaa aacaattagg aagtgcctgg tacaaagtga gcagccaata aatgctggct 14161 atttcagtat tgtttcaaaa ggcaagttta ttaggttacc tagttcctaa ggcctagaga 14221 acaggaagtt aaaagtgaaa gaaagtgaaa atgtttctgt agcatcaaac agtagcacat 14281 tcactggcaa gtgagataag tttcaaatga aaccttatga actattttag ataagttaaa 14341 catacacgta gaaagacaat gactaaatac tcagtcacta aatactagaa tcagagctct 14401 atttggattc cagctccatc tattaatagc tctgtgccct tggctttatg cctctgttcc 14461 acatttgtaa aatgaggatg ataatagtat atacctcata atgttggagt aaggataaaa 14521 ataaatgagg caatgcctgt ctcagtacag gccttagtac agtcactacc aatgaaaagt 14581 acttaataat tagtaggtac tgtttccttt attacaaaaa aaaaatcaga gggctatttg 14641 ttaaccaaat tgaaagccta actctctgaa cagagaaagt aaaatatagt gttttttgag 14701 aatcagttct tgctcttacc agaagtaact tttctatttg aagaaaatgt gtcagtcttt 14761 cctaggaaga actaatttac ccagggtatc ataacacttt tagattggtc caccctacaa 14821 tcttgctctt actattgtgc tctaactaaa ctaagcagaa aatttatctg aggtaattga 14881 aaaatctgaa ttgtctgaca gctatgatta atattctgcc catatttata attaagtcat 14941 agccataaaa taaggcaaat gtctcaaaaa attctagcaa aaattaaact acatttgctt 15001 aaagatttat cagtcttaca attcattcca catatttaaa aagtttgcca gttcagtaaa 15061 atttagtagt aacaaaataa aatgtatatt aagtacatca gaaattaaaa atacacccac 15121 tcacttattc attcaaaaaa catttctgaa tgcctactat gtgccaaaca ctgccctatg 15181 ttttaaatca aatatactgt atttgttgcc ttttacaaac ttctttcatt tatcattttc 15241 catatttcat tcaaactttc acataacctg ataacactgc aaactgatta aaagagttaa 15301 ctataaaagc cctgtcactt taagaatttg taattttttc cattaacatt cctctctggt 15361 tttatgtact tggtctgtgc tgtgggatgt agcataaatt atctgccttc agcaaaacga 15421 gcagctgaaa tgtgaggctg cctgacagtg cttgttaagc cattagctat tgtgagcaca 15481 cactactagg gatgtctggc tctcatgaga agtgtttaag tggttcaaaa tataggagtt 15541 tggtcatgca gtacaaaaaa aacaagccca gtattttagc accatgctta ggggttaatt 15601 aataaaaagg atgaaaacaa tcctgggatg tttattttta aagaaaagta ttacaagtga 15661 acactttgtg atttactgaa agccagtaac aggataggat gaaacaatat ttttagcaaa 15721 atccttctac cacagtccaa gaaaataagt gtatttcttt ttttcttttt tttttctgga 15781 gacggaatct cactctctcg cccaggctgg agtgcaatgg catgattttg gcttactgca 15841 acctccgcct cccgggttca agcaagtctc tgcctcagcc tccagagtag ctgggattac 15901 aggcacctgc cagcacaccc agctaatttt tgtattttta gtagagtcgg ggtttcacca 15961 ttttggccag gctggtcttg aactcctgac ctcatgatcc acccgtctcg gcctcccaaa 16021 gtgctgggat tacatgggtg agccaccgcg cccagccaaa aataagtgta tttctaaaac 16081 acaaaagtgg ttgtttgttt aactaaaaat tatttttaaa atattacctc tttggccggg 16141 tttggtggct cacgcctgta atcccagcat tttgggaggc cgagggaggt ggatcacaag 16201 gtcaggagtt caagaccagc ctggccaaga tgatgaaacc ccatctctac taaaaataca 16261 aaaattagcc agatgtggtg tcaggcgcct gtaatcttag ctactcggga agctgaggca 16321 ggagaatctc ttgaacatgg gaggcagagg ttgcagtgat ccgagatggc accactgcac 16381 tctagcctag gcgacagagc aagactccgt ctcaaaaata aaaataaaat aaaaataaaa 16441 aaatatatat atcctctttt attctcccca catccatttg cttgcactat ctgcaataac 16501 ctgctatacc cctccactgc tcccttatct ctataccagg ccttatggaa atccttttca 16561 gttttgtaca gcccaactca gaagcacccc ctcttccata tgcctcctta ccctcatctg 16621 ccaccccaga gtgttttcac atggccctgg ctgaaagaga tctcagcacc cagagaaatc 16681 tgggcactaa ttcattgtcc ttcttacctc ttctgtacct tctcttaggc catcttgttt 16741 tatacttact ggtgtctata tctatctcgt caactcgact ttgagtttat tcattttgca 16801 tttcacacaa tgcctagttc aaattctgtt aagcgtcagg tgaataagcc tgcttaatac 16861 attcatactt tatgtacctc aatttcctca tctagcaagt gagcctaata atatctgccc 16921 tgcctaagta gagtttgtta tgccatcaaa tcagataaca tatgtaaaag cgtttttgaa 16981 aactactgca catttgtctt agtccattca gtctattatt tttaaaaata ccttagactg 17041 ggtaatatac aaagaaattt atttctcata gttctagaag ctgggaagtc caagatcaaa 17101 atgccgacag attcagtgtc tggtgatggc tttttgcttc acagatgtgt cttcttgcta 17161 agtcctcaca tagtggaaga gcaaaaaggc tccctcaagc ctcttttata aaagcaccaa 17221 tcctattcat gagggcagag tcctcatgac ctaatcacct ccccaaagcc ccactattta 17281 atattattgc attggagatt aggtttcaac atacgaattt ggtgagggaa caaaaacatt 17341 cagaccatag cacattcttt ggcaagcaat ttggcaatgt ttatcaaaag ccttaaatct 17401 ttttcttttt tctttttttc taagagacat ggtcagccgg acacagtggc tcacgcctat 17461 aatcctagca ctttgggagg ccgaggcagg tggatctcct gaggtcagga gtttgagacc 17521 agcctgacca acatggcaaa accctgtctc tactaaaaat acaaaaatca cccaggcgtg 17581 gtggtgcatg cctgtaatcc cagctactca ggaggctgag acaggagaac tgcttgaacc 17641 caggaggccg aggttgcagt gaactgagat cgtgccactg cactccaggc tggcaacaga 17701 gcaagactct gtctcaaaaa aaaaaaaaaa aaaaaagaca gggtctcact ctatcaccca 17761 ggctggagta cagtggtatg ataatagctc actgcagctt caaactcccg ggttcaagca 17821 atcctcccac ctcagcctcc tgagtagcta ggcgtataag catgcaccat catgcctggc 17881 ttttttttaa tttttttgta gagatggggt ctcactatgt ttcccaggct ggtcttgaac 17941 tcctgggctc aagtgatcct cctgcctagg cctcccaaag tgctgggatt aaaggcatga 18001 gccacactgc acctggtctt attttttctt attcactaat tatacttcca gtttttaatc 18061 ctaaggaaaa aaatgtaaac atggaaaaat tttaagcaca acaatgttca gatattacat 18121 ctaataataa tgaatgtgaa ataatttaaa catccaacaa aggggaatgg ctaagtaaat 18181 tactgcatat tgacttgatg aaacattata caataatttt taaattataa acattatttt 18241 aagagatgat ataattacag aaatgaggaa tgtttaagat gaagtgttaa aagaaaaaat 18301 gtagcatcga aattgtacat ctatcttgac aatatataaa aaagaaacat caattcacca 18361 aaaaaaaaat gattgcaagt acactaaaat gctaataaaa gttattccaa ggtaagagaa 18421 tgagtagttt cttttctata ttattttatg ttttccaaat gttgttaagt acacattacc 18481 ttgaaaataa aaagactcca ggcatagtgg ctcacacctg taatcccaac actttgggag 18541 gccaaggcag gaggattgct tgaggtcagg tgttcaagac tagcctgggc aacacagcaa 18601 gaccctgtct ctacaaaatt tttttttaat tagctgggta tggtggtgtg tgcctatagt 18661 cccagctact ccggaggctt gagtcaagag gattgcctga acccaggagt ttgagactgc 18721 agcaagctat gattatgcca ctgcactgag gcctgggtga cagagtgaga ccttgtctct 18781 taaaaaaatt taaaatagga ccagggggca ggcatggtga ctcacgcttg taatcccagc 18841 actttgagag ggcaaggagg gcggatcatg aggtcaggag ttccagacca gcctggccaa 18901 cacagtgaaa acccgtctct actaaaaata caaaaattag ctgggcatgg tggcgggcac 18961 ctgtaatccc agctacttgg gaggctgagg caggagaatt acttgaatcc gggaggcgga 19021 ggttgcagtg agccaaatca cgccactgca ctccagcctg ggtgacagag ctagactcaa 19081 tctcaagaaa aaaaaaaaaa atagggctgg gtgcggtgac acatacctgt aatcccagcc 19141 ctttgggagg ccgaggcagg tggattcgag accagcctgg cttacatggt gaaaccccat 19201 ttctactaaa aaaaatacaa aaaaattagc caggcatgat ggcaggcacc tgtaatccca 19261 gctactaggg aggctgaggc aggagaatca cttgaacccg ggagggaggc agaggtcgca 19321 gtgaaccaag atcgcgccat tgcactcctg cttggacaac aagagtgaaa ctccatctca 19381 aaataataat aataataata ataataataa taataataat aataataagg ttgggcatgg 19441 tggctcacgc ctgtaatccc aacagtttgg gaggccgagg cagatggatc atctgaggtc 19501 aggagttcga gaccagactg gccaacatgg taaaacccca tctctactaa aaatacaaaa 19561 attagccagg tgtggcagcg cacctgtatt cccagctact caggaggctg aggcaggaga 19621 atatcttgaa ctcgggagat ggaggctgca gtgagctgag attgtgccac tgcactccag 19681 cctgggcgac agagtgagac tccacctcaa aaaaagacaa aataaataaa caaataataa 19741 aaagaaagat aaactttcaa cagtaattaa aaagctatat aaatacatga atttaagagc 19801 tatgaatatt gtatctttgg attgcatcac ttcactatta tataggctct aggatagctc 19861 attttcttaa ataaaaataa actaactaaa tcagactatg ttgtacttca ggcagtactt 19921 gttagacaac aaaaataaaa tttaggccag gcacggtggc tcacacctgt aatcccagca 19981 ctttgggagg ctgaggtggg tggatcactt gaggtcagga gtttgagacc agcctggcca 20041 acatggtgaa accccgtctc tactaaaata caaaaattag ctgggtgtgg tggtgcacgc 20101 ctgtaatccc agttacttgg gaagctgagg cacgagaatc acttgaaccc aggaggcaga 20161 ggttgcagtg agcagagatg gtgcctctgc actccagcct gggcgataga gtgagactct 20221 gtatcaaaaa taaataattt aatgtaattt aatttaaacc agatttcttt ggtcctaact 20281 taaaaataat ttcaataatt ttaaccagtg agtaaaaatg tcactccttt cctttttatt 20341 actcaaatat attctcactc actgtaagaa cctgtcttct gtctctatag gtgtgtcata 20401 aatgctaatg ttatctttaa gagtatttca gaatccaaag ctaccttttc tttcactcac 20461 ttctgaacat taaccttctt aagtaaactt gctctgtttg aaagaatcat atttgcccat 20521 gtgttcaaca agcctaagca tttttctaga tcatcatcaa atgtcaaggt tcacaagttt 20581 ttcaaatgta cttaatggta cttaacgttt ccatactggg ggaaattttt tttacaaaaa 20641 gaagaggggt ggggcatggt gactcacccc tgtaatccca gcactttggg acgccaagac 20701 aggtggatcg cttgagccca ggagtttgag accagcctgg gcaatatagt gagacttcat 20761 ctctacaaga aacaatttaa aaagtagcaa agcatagtgg caggtgactg tagtctcagt 20821 tactgaagag gctgaggtgg aaggactgtt tgagcccaaa aggtagaggc tgcagtgagc 20881 tgagatcgtg ccactgcact ccagcctggg taacagagca agaccctgtc aaaaaaaaaa 20941 aagaagagga agaaaagaaa atgtctacta cttgtcagaa aatctcattc cataaagtca 21001 tgtttctaaa tgtgaatgat tttgtgttac cttaaatata aaattaaaat acttgttttt 21061 cattttctgg taattttcat gcacttttta aaacacatct cttaccggag tttgcaattc 21121 ccattctctt tctagtaata tctatattga tttaatttac atagcaaata gtctttacca 21181 aggagaaaaa aatctaactt ttaaattttt tccaaaccca aatcttcctt ttagagaact 21241 tgaataaaaa aactgcctta tttgactatg tcattgctct ttcccaagtt aattttatga 21301 ctatgtcttt gtattgaatt tgttcactca catctaactg acccaaatgc cttatcaagt 21361 gcccaaatca ggtcaaggat aagagctgct agtgatgcta ttttgagaag aaacaaaatg 21421 tactgtaata taccagttcc cattatgcca aaatgcacca tttctttggc acatacatgt 21481 tatgttttac caataagaga aacagtgatg agctgagaag gtagttcatc tgtacagaat 21541 ggtttgacta ttttgtctat ggatattcac tgaaaccttt ctaaagtgtg gttagtatat 21601 gaaagcatat attttgattt acaaatgcag aattttactt gctacatata ggtgatcatt 21661 agtcctttca aggacgggaa atgtcccttt tttaattgag atggagtttt gctcttttgg 21721 tccaggctgg aatgcaatgg tgtgatctcg gctcactgta acgtctgcct cccggattca 21781 agcgattctc ctacctcagc ctcccaagta gctgggatta caggcatgca ccaccacgcc 21841 cagctagttt tgtattttta gtagaggcag ggtttctcca tgttggtcag gctggtctcg 21901 aactcctgac ttcaggtgat ccgcctgcct cagcctccca aagtgctgcg attacaggcg 21961 taagccaccg tgccggacca agaaatgtat ttcttttacc tttgtacact gagtatctag 22021 cccattgcct gcacattgta agtgcttaag aagtgtttgt taaaccagtg ataccacaac 22081 acagttgact tattgaaaag caactgccaa gttgtaccat actgcaacag gcactgtggc 22141 taattagctc agtttattag tgcaaggaag tgaggtcaag gactcaattc ccaaaaggat 22201 cttttggcct tattctagtc acaatttcat accttgttcc atccaacaag ccatctctcc 22261 tatgactgca aggagaatca gacaagcaga ggcagaaaga aatagtttaa cacaattcat 22321 caccattcta atacagataa tttagaaatt aaacaggtaa taaatgaact gtgcatcctt 22381 ttgatatgaa aaattccaat atttcagtat gtgagatttg gttataaggg taaaataaaa 22441 agttctggag ataatggtta caaaacaatg tgtatgacac taaattgtac attttaaata 22501 gttaaatggt aaatttaatg ttatgcatat tttgccacaa taaaaaataa agttaaatta 22561 ttaaaatgtg aaaaaaaatt ataagttaac acagcttaaa ttaacatcca gcaacctcac 22621 atgtaaaata aacaggtaaa aaatgaaaaa aaaagagtaa aataataaca ttaactgctg 22681 atccttaaag taatacaaac taatatagaa accctgtatc attttattat aactgtaaag 22741 tttgtcacag agttcttatg aggataatgt cttgtgattg ctataaaaag tcatttaatt 22801 caaaacattt ggctaataaa ggatttcttt tttttggaca agggaatttc aataaaatgc 22861 tttgcaaaat aaagaaaatg aatggataca agacatatag tcttaaaaat aaagcttttc 22921 tccaaggact ctcaaataaa cttctattat gaacagagcc tatttcagta gtttgaacac 22981 aggagaattt taatttgttt tcaagatttc actggctcta atgtctcagg aaagtaagcc 23041 agcttctata tacacatttt ctatataagt ggaaataata ctacttactg cttcatatgg 23101 ggtgctatgt gaattaattg gttaatggtc ttaaagcact gcctaagtac taaacgttat 23161 cattagtgtg cattgtatat ctctttgcaa ttatctgcca ccttaaaatg ctgtggcttg 23221 taaggctgcc tctctcttgg aatatttaca aaaggttaaa gaaagttcaa gaggaaaaaa 23281 agcatgtagg ctaaggtaag gcagaagaaa gggaaattca aaatctgaca tgttttatta 23341 catgcttttt gttacatcgt attacatgtt tcttgaaatt acagatcatt atgctagaag 23401 aaaaaattta gtcacacaca ttttgaattc aaaatatgct acatagtttt aagaaatcaa 23461 tctataatag aactaagcaa ataaagtata cattgcatta agataaagtt ataatttatt 23521 ttattataat ttaattgtta atcataatag ttacatgttt aatcttatta gttgaaagag 23581 cctatctgac atctgtgaga ccagatccaa taatgcctgt gttttttttt tttctgctgt 23641 taaagtcttc aaatattttt tggaagataa attcttaaat gcacagaata aaatttctac 23701 tgaattaata agaaaaacaa tagtagtagt agtaatacct tacacttaca cagtgattta 23761 taattatacc aggtagcaag aactgtgaag gtctgagact ttaccttacc agtctgcaaa 23821 ctagtctgct agtttcatgg gtgctgacag aagacacaag actcctggat caaagacaag 23881 tgacagttta ttactaatag caatggcagt agccagagca tcttcatttt cttgtggcag 23941 tagcccaagc cccagttccc acagggtgac acaagggcta cgagacatgt gcacacacac 24001 tgggttgtat tacagaagag taaccctgag cttagaaaac ctcagtcttt tccctttgct 24061 ccagagaaag acattctctc ttccaaggct gtttgctata ccaaaattca tgaaaaaaag 24121 tccagaataa gaggctagca gtgcctctgc tcacaagaca tgcataaata tttgagaacc 24181 aaggagagtt atgtcccaat actagggtga ccatatatcc tagtttgctt gtgacagtct 24241 tgaccaatac tagttgtctc agcttaatta ttaatagcac ccctttccac tctcgcaagt 24301 gtaccagttt ggacaagaaa ttatatagtt accacagtta taattcttta aaaaattacc 24361 tgcctggtca gagtactgct gatatctaca gttagggttg ggtcaatcaa atagcacaag 24421 agagtaaaag tcaaaagggg gaaaagcaaa tcatgcatat cattttaata atatgcatta 24481 ttatatgcat agaataatat gcattattct aaatgaagaa gacttcaagc acttatttag 24541 aagaaaagag actattccag gaaaaataaa agctggtttg attgtgtctt aaccacttaa 24601 gacaaaccac ttaaaaatgg caactgaagc tttgcataac tggtacagta attacactat 24661 tttgcctgaa tacttgtcct tctttactga aagatctcag gtaattttta ccctaagtat 24721 ccctgaaact aagaaatttt ctgacttcca ttttgagttg taaagaatat taaaacttta 24781 cagaatatta aaaattacca ggatatttca gccacatcca gacaatttac tcctggcagg 24841 ttccccattc ttctcacccc tccgcacagt tcaagctatt tgattttgac agcattccac 24901 acccgtcctc attctcttcc cctacttccc cacccttgag ctcaactctg acccctgtgt 24961 ctaccatcaa ttctgacctt agacaatagt gatttaatga gttttcactc aatactatgg 25021 aaccaagctc aatttatccc tggctcaaat agcccagttc tatctagagc actttggtct 25081 tattccatag atcatcagtg tccatccaaa tttccaatta cagtaagtaa atccttgatg 25141 gtctcattcc ctgaacttaa attggtactt cagtccttgg ggattgggac cctcccatac 25201 acttgtcagg cccctcgcag ggaagagaat cctgcagcag atcataagtc ctgaaccaag 25261 ggtttgccaa cctgaccatg ctggttttca aatatcacaa gcttctgggt tctctgtacc 25321 tatgttttgc atgcctgttg aggaccatta tcgccacatg ggactttcga gatacacgac 25381 attttactgg aatatcttcc accccatata cctaccatgg agtgcccatg agcatggact 25441 ttctctctta atttacccag agtctctcat gataatgatc atttcaaaga cctcaggtca 25501 attacagtct tattccttcc cattctgcca gttctgcacc aagtggcctg ttcatacttg 25561 actctattct gaaatgtcac tgagaatttt tgtattgctg ttctcctttc acaactatat 25621 ctaatgaatt aatttcatta ttccagcagg ttctctgtat gaaaacaaca aagtttaaca 25681 aagtgtctgt gttgcaaaat ggtctcagta tgtgatagat tatgagaatg cctagacttt 25741 gtttatgagt ttgcaaattt tcaattcata tgggtcctta ggaaattact taaatctcaa 25801 tgtaaaataa aataacaaaa ttccatagaa aaagttgtat gtacattata attgcaattg 25861 tgtaaagaaa aaatcatggg tgtgagcaaa gtctaaaaag caatccacaa acagcaatat 25921 atttcataaa caagtgaata gttaagttag agtaatgtaa ttttggagtt gttttctttt 25981 ttaaaaaata tccatgttgt gagatctaat taaaataaag agcttatgca cagcaaaaga 26041 aactatcaac agagtaaaca gacaacttac agaatgggag aaaatttttg caaactatgc 26101 atctgagaaa ggtctaatat ctagcatcta caaggaactt aaacaaattt acaagaaaaa 26161 aaacccataa aaaagtgggc aaaggacatc aacagacact tttcaaaaga agacatacat 26221 gtggccaaca atcatatgaa aaaagctcaa catcactgat cattagagaa atgcaaatca 26281 aaaccacaat gagatacccc tcataccagt cagtggctat tattaaaaag tcaaaaaata 26341 actgatgctg gtgaggttgt ggagaaaaag aaatgtttac acactgttag tgggagtgta 26401 aattagttca actattgtgg aagacagtgt tcctcaaaga cttaaagaca gaaataccgc 26461 tgtaagaaac atagaaaacc ctctccctct ccctctccct ctccctctcc ctctcccgat 26521 gccgagccga agctggactg tactgctgcc atctcggctc actgcaacct ccctgcctga 26581 ttctcctgcc tcagcctgcc cagtgcctgc aattgcaggc acgcgacgcc acgcctcact 26641 ggttttcgta tttttttggt ggagacgggg tttcgctgtg ttggccgggc tggtctccag 26701 ctcctagccg cgagtgatcc gccagcctcg gcctcccggg gtgccgggat tgcagacgga 26761 gtctggttca ctcaatgctc aatggtgccc aggctggagt gcagtggcgt gatctcggct 26821 cgctacaacc tccacctccc agccgcctgc cttggcctcc caaagtgccg agattgcagc 26881 ctctgcccgg ccgccacccc gtctgggaag ggaggagcgt ctctgcctgg ccgcccatca 26941 cctgggacgt gaggagcccc tctgcctggc tgcccagtct ggaaagtgag gagcgtctct 27001 gcccggccgc catcccatct aggaagtgac gagcgcctct tcccggtcgc catcccatct 27061 aggaagtgag gagcgtctct gcccggccgc ccatcgtctg agatgtgggg agagcctctg 27121 ccccgccgcc ccgcctggga tgtgaggagc gcctctaccc ggccgcgacc ccgtctggga 27181 ggtgaggagc gtctctgccc ggccgccccg tctgagaagt gaggagaccc tccgcctggc 27241 aaccgcccat ctgagaagtg aggagcccct ccgcccggca gtcaccccgt ctgggaagtg 27301 aggagcgtct ccgcccagca gccaccccgt ccgggaggga ggtgggggtc agcccccgcc 27361 agaccagccg ccccgtccag gagggaggtg ggggggtcag ccccccgccc ggccagccgc 27421 cccatccggg aggtgagggg cgcctctgcc ctgccgcctc tactgggaag tgaggagccc 27481 ctctgcccgg ccagccgccc cgtcagggag ggaggtgggg gggtcagccc cctgcccggc 27541 cagccacccc gtctgggagg gaggtggggg ggtcagcccc cccgcccggc cagctgcccc 27601 ttctgggagg cgaggggcgc ctctgcccgg ccgcccctac tgggaagtga ggaacccctc 27661 tgcctggcca gccgccccgt ccgggaggga ggtggggggg tcagcccccc acccggccag 27721 ccaccccgtc cgggagggag gtggggggat cagccccctg cccggccagc cgctccgtcc 27781 gggagggagg tggggggggt cagcccgccg cccggccagc tgccccgtcc gggaggtgag 27841 gggtgcctct gcccggccgc ccctactggg aagtgaggag cccctctgcc cggccagccg 27901 cccgtccggg agggaggtgg gggggtcagc cccctgcccg gccagccgcc ccatccggga 27961 ggtgaggggc gcctctgccc agccgcccct actgggaagt gaggagcccc tctgcccggc 28021 caccaccccg tctgggaggt gtacccaaca gctcattgag aacgggccat gatgacaatg 28081 gcggttttgt ggaatagaaa gggggaaagg tggggaaaag attgaggaat cggatggttg 28141 ctgtgtctgt gtagaaagag gtagacatgg gagacttttc attttgttct gtactaagaa 28201 aaattcttat cctgttgatc tgtgacctta cccccaaccc tgtgctctct gaatcatgtg 28261 ctgtgtccac tcagggttaa atggattaag ggcggtgcaa gatgtgcttt gttaaacaga 28321 tgcttgaagg cagcatgctc attaagagtc atcaccactc cctaatctca agtacccagg 28381 gacacaaaca cggcggaagg ccgcagggtc ctctgcctag gaaaaccaga gacctttgtt 28441 cacttgttta tctgctgacc ttccctccac tattgtccta tgaccctgcc aaatccccct 28501 ctgcgagaaa cacccaagaa taatcaataa aaataaataa ataaataaat aaataaataa 28561 ataaataaaa agaaacattt aaaaaaaaaa agacagaaat accatttgac ccagcaatcc 28621 cattactggg aatataccca aagtaataca agtcattcta ttacaaagac acatgtgtgt 28681 gtatgtttat tgcaacacta ttcacaatag caaagacatg gaatcaacct aaatgctcat 28741 cagtgataga ctggataaag aaaatgtggt atatatacac catggaatac tatgcagcca 28801 taaaaaaaag atcatgtcct ttgcagggac atggatgaag ctggaggcca ttatccttag 28861 caaactaaca caggaacaga aaaccaaata ctgcatgttc tcgcttataa gttggagcta 28921 aatgatgaaa acacatggac acattgtggg gaacaacaca cactggggcc tatcagaggg 28981 tagagggtgg gaggagggag aggatcagga aaaataactt agggtaatag gcttaatcac 29041 ctgggtgatg aaataatctg cacaacaaac ccacatgaca cacatttacc tatataacaa 29101 acctgcacat gtacccctga acttaaaata aaaggaaaaa aaatccattt tgcattttga 29161 aaaggaaaag aaaattatga tatgtattta tcataaggta atactttgta agcacactgg 29221 gataacaatg aaaaatactg tagttatcaa tgcttccttt ttaatgagct cttccttaat 29281 caaatcttgg tttagctttg tattgatttt agaacatgct ccctttccta tgcacatgtg 29341 tgtttttcaa aggctgcaaa aggttaacaa ttgttatata tcattatata ctaacagttg 29401 gtctaatgac taaggattga atcagcaatc ttggtaactt cctcttagct ctatgaccct 29461 agcaaggcac ttaacctctg tttctctttc aaaaggaata acaacaattg ctgaattcgg 29521 aaaatatcta tagactatat ttatattaag acctttcagt gaaaatgaga aatgaatgta 29581 aactcataaa gacccagttc agataatgta cttctgtagt ctaacacatg aatttttttt 29641 cttttttttt ttttttgttt cttttgtttt ttgggttttt tttttttgag acagagtctc 29701 actttgtcac acaagctgaa gtgcagtggt gcaatctcgg ctcattccaa cctctgcctc 29761 ccgggttcaa gccattctcc tgagtcagcc tcctgagtag ctgggactac aagcgcccgc 29821 caccataccc ggctaatttt tttgtatttt tagtagagac agggtttcac cctgttagcc 29881 aggctggttt cgaactcaag tcctcaggtg atccacctgc ctcagcctct caaagtgctt 29941 ggattacagg catgaaccac cgtgcctggt ctaatacatg aaatttttat ttgataaaaa 30001 tgtgttctta gattgtaact gagattactg actatggaat ttaagttaga ttgttattat 30061 attaaaacac tttgggccgg gcgtggtggc tcacgcctgt aatcccagca gtttgggaag 30121 ctgaggcggg tggatcacct gagatcagga gttcgagacc agcctggcca acatggtgaa 30181 accctgactc taccaaatac aaaaaattag ccggccttgg tggcgcatgc ctgtaatccc 30241 agctactcgg gaggctgagg caggaaaatc gcttcaccct gcgaggcaga ggttgcagta 30301 agctgagatg gcgccatcgc actccatcca gcctgggcaa caagagcgaa actccatctc 30361 aaaataaata aataaataaa actttgctaa atatcagaac aacattttga caaattttca 30421 atgaaaggca tagtatgaca tgtagaaaac atataatttc catgggccaa agttagaatt 30481 taccattata gagaaaataa tttctttata gaaatgttaa agttcttcgg cctccttggg 30541 actaggtttc cgcggctgct gcgatgacca aaataaaggt agattcgatg ggcccgaggc 30601 tcaggaggag gcgtgctccg gggagcgcac ctaccaggag ctgctggtta accagaaccc 30661 catcgtgcag cccctggctt ctcgccgcct cacgcggaac ctctacaaat gcatcaagaa 30721 agccatgaag cagaagcagc ttcggcgcgg ggtgaaagag gttcagaaat ttgtcaacaa 30781 aggagaaaaa gggatcatgg ttttggcaga agacacactg cccattgagg tatactgcca 30841 tcttccagtc atgtgtgagg accgaaatct ggcctatgtc tctatccctc taagatgaac 30901 ctgggtgcag ccgcaggctc caagtgcccc acctgtgtga taatggtcaa gccccacaag 30961 gagtaccagg aggcctacga caagtgcctg gaggaggtgc agtccctacc cctaccccta 31021 tgaggggctc cagtagcacc tgggcacctg cctctggaag ctactgggct gacggcggga 31081 cgaccggctg tcctcttgcc cacccacact gacggcatct tcctagttcc ccaaggcacg 31141 ccttcttccc aggcagctct tacagccctt tcatgaaggt aatgcccgcc ttctctccat 31201 cagtgccatt tcccatagaa ctaaagggta ttccaagaat ggggggtggg gaaagtaaat 31261 gctaagacta aaaagaataa aaaataaaaa taaaataaaa taaaattaat gttaaagttc 31321 ttctaggcca gacatggtgg tatgtgcctg cattctcagc tactcaggat gctgaggcag 31381 gaagatcact tgagtccaac ctgagcaaca tagtgaaacc ccatctctaa aaaaataagt 31441 aaatcagcca ggcacagtgc ctcacacctg taatcccagc actttgggag gccagggtag 31501 aaggatagct tgaagccagg agtttgagac caacctgggc aacatggcaa gaccccattt 31561 ctaaaataaa ttaattaaat taaattaact tattctaaat agtttaacat attcaccagc 31621 tgtaatggta aattaataaa aatcctgctt tcactgggtg cggtggctca cgcctgtaat 31681 cgcagcactt tgggaagctg aggcgggtga atcacctgag gtcgggagtt cgagaccagc 31741 ctgaccaaca gggagaaacc ccgtctctac taaaaataca aaattagccg gggtggtggt 31801 gcatgcctgt aatcctagct acgcgggagg ctgaggcagg agaatcgctt gaacccggga 31861 ggtggaggtc gcggtgagcc gagatcacgc cattgcactc cagcctgggc aaaaagagtg 31921 aaactccgtc tcaaaaaaaa aatccaggct gggcacggtg gctcacacct gtaatcccag 31981 cactttggga ggccgaggcg ggcagatcat gaggtcagga gatcgagacc atcctggcta 32041 acacggtgaa accccgtctc tactaaaaat acaaaaaaaa aaaaaaaatc agctaggcat 32101 ggtggcacgt gcctttagtc ccagctactc gggagtctaa ggcaggagaa tcgcttgaac 32161 ccgggagacg gaggttgcag tgatccgcaa ttgcgccact gcactccagc ctgagcgaca 32221 gagtgagact ccgtgtcaaa aaaaaaaaaa atcctgcttt ctcggaagtt attgaagaaa 32281 tattgctgca aagattaatt accattctac tgtctgaaaa gtgtaccttc aagctgttaa 32341 attagtgaat ggagatctat tatgacttta acaagaatat tactcaccag ggggattact 32401 gctattaaca gtaggatgta tcatttaaga cattaaaatt gaatggaaat gtttttcaac 32461 catttaaata aaactccttg ttctttactt cacaatgaag ctttcttttg ctagcttctg 32521 tattgtagga gtttatgcaa gattttcctg gtagtaccat ttcagattac ttttacattt 32581 gtaaacccct atattcccat ttctgtttca tagagacaga actagagaga tactaaattg 32641 atataattta gcaatgtttg caatctcata ggtgagaact ctattctagc tacaaaatta 32701 ccaccattac tataagccaa ctacatatct cacggttaga caaactattt gcattcaaag 32761 acctcagatg ctaaatctgt tagctttctt cctatatgtg ctagattcta tgtagggaac 32821 ttaatcaaat ataaagtata tgggtcacta cagaaagagg aagagaatga tctctgtgtg 32881 tgtgtgtgtg tgtgtgtgtg tgtgtatcta tctatctatc tatatatatt tttttttttt 32941 ttttggacac agagtttcgt tttgttgccc aggctggagt acagtggtgc catcttggtt 33001 cactgcagcc tccacctcct gggttcaagc gattctcctg cctcagcctc ccaagtagct 33061 gagactacag gcatgcgcca ccacacccag ctaattttgt agttttagta gagacagggt 33121 ttcaccatgt tggtaaggct ggtctcgaac tcctgacctc aagtgatcca cctgcctcag 33181 cctcccaaag tgctgggatt acaggcatga gccaccgcac ctggccctaa aatattttaa 33241 aatgataaaa tactatctcc aaaattcaaa ttagtacaga tttaaattta ttttaattct 33301 taatgctggc catggtgcaa aaagttggac acttatattc acttcatttt gcaaagtaaa 33361 ttagtacttt ttggagagca atttgtttgt tttgagacag ggttttgctc tgtcaccaag 33421 gcaggagtgc agtggcatga tcacagctca cggcagtctc gacctccagg gtacaagtga 33481 tcctcctgct tcagcctccc aagtggctgg gactacaggg gtgtgccacc atgcctggct 33541 aatttttttc tttttttttt gagacagagt ctcacactgt cacccaggct ggagtgcaat 33601 ggcacaagct tggctcactg caacctccgc ctcccgggtt caagccattc tcctgcctca 33661 gcatcctgag tagctggggt tacaggcatg gatcaccatg cccagctaat tttgtatttt 33721 cagtagagat ggggtttcac catgttagcc aggctagtct tgaactcccg accccagatg 33781 atccacccac ctcagcctcc caaagtgctg ggatgacagg cgtgaatcac cgcagccggc 33841 caatatatat atattttttt cttaagtaga gatgaggtct tgctatgtta cccaggctgg 33901 tctcgaactc ctgagttcga gcaactcacc tccctcggcc tcccaaagtg ctaggattac 33961 agatgtgagc cactatgccc agccaaaagc aatatggtta atatgaatca agagccttgt 34021 gaacaatctc accctttgac ccagtactac cacttctggg agtctatcct tagaaagtga 34081 cgcaaaatgc tgactacccc caaaaatgtg ccaaagatgt taataatagc aaaatgccag 34141 aaatgatcta agtccaacaa aaggaatgtt aaagcaaatt atggtaatgc aaagaatgaa 34201 gtattttgca gctattaaaa attatgcatg cattggccag gtgtggtggg tcacgcctgt 34261 aatctcagca ctttggatgg ccgaggtggg tgggtcactt gaggccagga gttcgagaac 34321 agcctggcca acatggtaag actccagcct gggcaacaga gcaagactcc gtttcaaaaa 34381 aaaaaaaaaa ttatgcatgc atttatatat atatatatat ataaaacata tatttgtgta 34441 tatatatata aatgcatgca taattatata tatatatgtt atagatatat aattttttaa 34501 gagacagggt ctcgctctgt cattcatgct ggagtgcagt gacatgatca cagctcactg 34561 cagcctgtaa ctcctgggct caaaggatcc tcctgcctca gcctcccaag tagctaggtc 34621 tatagctaca tgccaccatg cagggcaaat ttttttgttt ttaatttttt tgtagagaca 34681 aggtcttgct atgttgccca ggctggtctt gaactcctgg attcaagcaa tccttccacc 34741 tctgcctccc aaagtactgg gattataggc acaagccacc atacctggct tgtacaaata 34801 ttttttaatg acaagccaaa atgcttctaa tgaaatgtta agtgaaaaag gcaagataca 34861 aaattatatg tgctctatga tgtaaacagg aatggagggc aggaaattgg atttaaaaat 34921 aaaaagacca aatagaaatg atgacaacaa tagtagcttc tagtttgagg agaccgtggg 34981 tgaatttttt ctagttcttc acacttactt ttcagtactt tccaaacatt ttccaaacat 35041 tttatattta catttaatga gaatgtagta tttttataat gacagaaaat gggttgggaa 35101 ctgaaagcaa ataaatttta ctttttttag aaaatggaag gcatgaaggt attcagatat 35161 aagcaaattt tttaaaaagc aaatacctaa aagtgaacgt agattgatct tatttttgtt 35221 tttgtttttt tgagacaggg tctcactctg tcactcaggc tgaagtgcag tggcacaatc 35281 tttgctcatt gcaactgcca cctcccaggc tcaagcaatc ctcccacctc agcctcttga 35341 gtagctgggg ctacagacac cctgccatca cgcctggcta attttttgta ttttttgtag 35401 agatgagttt caccatattg cacagactgt tttgttttgt tttttaagta tgggcacagt 35461 gactggtgcc tataatccca gctactcagg aggctgagtt gggatgatcc tttgagccct 35521 ggagttcaag ttgcagttgt cataggactt tctccttagg gttcttgtca cacaaccagg 35581 aaagattagg ttctcagaca ctttgaaggg tgagaaaggc aggatttatt gggtgaaaag 35641 gaaaaaaaaa ggaaacaggg actctcagaa aagcaagagt cctgctggct ggcttcctgc 35701 ctcacagact gaatcccagg ttaccacaca ggaatagcag aggccagact cctcccccat 35761 gcaaagggca cgaacatccc agggctctac cccattctct gggtgcacag gccagtcaaa 35821 agttctccgt ggaccacttt atacttggct gtctcacagt gagctatgat catgtcactg 35881 cactctggcc tgagcggcaa agcaagaccc tgtctccaaa caaacaaaca aaaatattct 35941 cagtaagctg gtactactgt tacaattttt ttttaattgg actgagacct aggtccattt 36001 gcaatttctt gcttttaatt gctatagctt tcaataaaag aataatgagc taagtcactt 36061 gtgtccaatg taacaatcaa tagaaacatg cagcctgtaa atcaaagcac aaccgggaaa 36121 catttaaaca catggatcat ttccctaaga agttaccttt ttaaatcaac tttttaatgg 36181 caaagtgttg gaaaaagaaa agattattgt acttagggga aagaatggaa tagatgaaaa 36241 atatcaaaac tgcttacaaa tttaccagtc agttcacatt caagtacaag aataaaaatt 36301 taattggaaa atctctgtac ttactaaaaa ttatctatat aaaatgagct aatagcagaa 36361 acacaaccat ttgtaatcaa aattcgtatt tccattgaat gtggctgttt tatttacagc 36421 aacacagaga cagatataga gaagaatcca atcttcatgc tttggtaatg gtatctacct 36481 ggtcaaagtg ctgggaatca gtgtccaagc tctctggata tacttacaat gaacattcat 36541 tcatttattc actgaccatt tctataaacc tttattgaat agctactgct acctatccaa 36601 tatcatgtga gtctaaaaca actagaatat cagtttcaaa atattttgtg ggcactacct 36661 atgtatttag cactctgcta aatatggtag caattactat agaggaatgt ggagaggcaa 36721 ctatcacttc tatacaatca gcatttaaaa tagaatttta gaactgaaga aatattaaag 36781 cagtgagtcc ttactttttt gagttaacac tgtttaagtg gaatgataat tctgttagat 36841 ggtatctgtt ttcgttagtg gccttactta ggaaaataaa atatttgcaa acatatatga 36901 gtttcctaac tttattttac aatattacaa gtacatgaaa tccaaaagtt tagaaactat 36961 tgtcctaaag gccatagaga gaaaagtata catttaattc tataggcagt tgaaatggtc 37021 tctgagtatg gaataagact ggagtagtac aatgtaataa tagattagat ggtcttattt 37081 gttgacatct tcttaattga ctttagaaaa ggtaatttag taaaatattt ttttaaaagt 37141 tttaatataa aaagaagaaa cttggctggg cgtggtgtct catgcctgta atcccagcac 37201 tttgggaagc tgagatgagc ggatcaattg agctcaggag ttcaagacca acctgggcaa 37261 catggcaaaa acaaaaaaat tagcctgatg tggtggtgca agcctatagt cccagctact 37321 caggaggctg aggtgggaga acccattgag cccaggaggt gtggtcgcac cactgctctc 37381 cagcctgggt gacagaacga gaccctgtct tggaaaaagc aaaaaataaa aataagtagg 37441 ataaactatt gcataggaga aactattgaa tagagtgatt taatagagaa aatatgaaat 37501 acaaacagaa tgagcaggat ataagtttaa atggtccaaa ctattcccaa acacatacac 37561 atacctatat aacacagaga ttgtaaaact aataaggtta gtgtgtttgt actgaactga 37621 gatatcagaa tcaactcaat acattaagta tattctgtta aattctattg tcaaaatcta 37681 aggttaaaaa gtcattctct tttttttttt tttttttttt ttgagacagt ctcactctgt 37741 cctccaggct ggagtacagt ggtgctatct cgacttactg caacctctgc tttctgggtt 37801 caagtaattc ttgtgcctca gcctcctgca acagctggga ctacaggcac cctccaccaa 37861 gccgggctaa tttttgtatt tttagtaaag atggggtttc accatgttgg ccaggctggt 37921 ctcgaactcc tggactcaag catctgcctc agcctcacaa agtgctggga ttacagccat 37981 gagccaccgc acccagccta aagagtcatt cttgaagtgt tgcctctaaa ttttcattaa 38041 tatagttatt taatatttta ttgatttatt tttagagaca ggatctcact ctgtcaccca 38101 ggctggaatg cagtggtgtg atcatagctc actgcagcct ggaactcctg ggtttaagag 38161 atcttcttgt cccagtctcc acagtagctg agactacagg tacacaccac catgcccggc 38221 taattttttt tttttgaaag acagggtctc actttgctgt ccaggctggt cttgagctcc 38281 tggcatcaaa caatcctccc accccgggcc tcccaaaaag tgttgggatt ataggcatga 38341 gccaccacaa ccatccccac ctctgtgatt atcttttgaa tacttttttt tttgagacgg 38401 agtttcactc ttgttgccca ggctagagtg caatggcgca atctcggctc accgcaacct 38461 ctgcctcccg ggttcaagcg attctcctaa ctcagcctcc tgagtagctg ggattacagg 38521 gatgcaccac cacgcccggc taattttgta tttttagtag aggcggggtt tctccaagtg 38581 ggtcaggctg gtctcaaact cctgacctca ggtgatccac ccgcctcagc ctcccaaagt 38641 gctgggatta caggcatgag ccaccgcgcc tggacttgaa tacattttta taaacattaa 38701 cttgtagacg ttgcatgtgc tgtctggttc cagccttgat cttcaaataa acttttcagg 38761 gtgctgcatt tgtggttcct tgagtgaacc tgtctggaaa ctaccatctg cccctggaat 38821 caagcttttg tcctcagcag ttagcaactt cgatggcatt accatcccca tgcagcttct 38881 taaacatacg gaggcctgac cccagcccga ctcaattaaa atcagaaatt ttggtgatgg 38941 cacccagaaa tgtatagttc ctaactttct ctactttgca gccaaaactg agaactactg 39001 ataataggta gaaatcagtt atttccaaat ttatcacatt gacttgacat tatttctctg 39061 tattatttct ctgtatttct ggtaaaccat agcaggtacc caataatttc attaaaaata 39121 aaaacccagg ccaagcaagt tggctcacac ctgtaatccc agcactttgg gaggattgct 39181 tgagcctggg aagtcgaggg tgcaatgagc tgtaatcgcg ccaccgcact tcagcctggg 39241 tgacagagca agatcctgtc tcaaaaaaat aaaacaataa aagccgggtg aggtggctca 39301 cgcctgtaat cccagcactt tgggaggctg aggcaggcgg atcacctgat gtcaggagtt 39361 caagaccagc cgggccaaca tgacaaaact ccatctctac taaaaataca aaaattagcc 39421 aggcatggta gcaggagcct gtaatcccag ctacttggga ggctgaggca gaagaatcac 39481 ttgaacctgg gaagcagagg atgcagtgag ctgagatcgc gccattgcac tccagcctgg 39541 gcaacaagag tgaatctcca actcaaaaaa cataaataaa taaataaaat aacaaaataa 39601 aataaaaatc caaactccat accatggcct aatggcccac tgtgatcttt ccccagcctg 39661 tccaccttat acctccccac tgccccctga gtgtcttgca aacatgccaa atttattcct 39721 acctctaggc ctttgcattt atcatcccct ctacaagaaa cacttctccc agtcctttat 39781 ttctctggct cttttttttt tttttttttg agacagagtc tggctctgtc acccaggctg 39841 gggtgcagtg gcatgatctc agctcactgc aacctccacc tcccgggttc aagctatttt 39901 cctgcctcag catcctgagt agctgggatt acaggtgcct gccaccatgc ccagctaatt 39961 tttgtatttt agtagagaca gggtttcact gtgttggcca ggctggtctt gaactcctga 40021 cctcatgatc cgcctgcctc ggcctcccaa agtgctggga ttacaagcat gagccactgt 40081 gcccgccctc tctggctcat tttcatcaca gaagtcttta ctctgtactt agaaaagtcc 40141 tttcctgacc agcctatgta aaaataacca ctccctttat ggtctctctc acctcctact 40201 gtcttatttc cttcatggaa cttacctcta ccagatatta actttcttat ttattatcta 40261 cttatttatt gtctgttccc aaccacctcc acactagatt gcaaattcta agagagtcag 40321 aacggagaat cattgataaa atgtggaaat cacttttttt ttttttctga gacagggctt 40381 gctctgtcac acaggcccga gtacagtggt gcaatcatgg ctcactgcag ccttgatctc 40441 caggtctcaa gtaatcctct cacctcagcc tcccaggtag ctgagactac aggcacgtgc 40501 caccatgtcc agttaatttt taaatttttt gtatcaatgt tgcccaggct ggtcttgaac 40561 tcctgggctc cagcaatcct ctcatctcag cctcccaagt agctgggacc acaggcatgc 40621 accaccacac ctggctaatt tttgttgttg ttgttgttat tgttgttgtt ttggtagaga 40681 tgagtttttg ccatgttgcc caagatgttc tcaaactcct gagctcaagt gatcctccct 40741 cctggccctt ccaaagtgct aggattgcag atgtgaatga ccacacaggg cctcagtaat 40801 tatttctgaa ttaatgtgaa taagcattta aatgaatgaa tgatagacta ctgaaaagtt 40861 caagaacacc ctgacctttt agtaacatat gtatgatatt actctctgat caccctattt 40921 ctctttgaat atgtgccaat aaatgttctg tgccatttgt taaagtcaat gtcgatttta 40981 catcatctac caagatcaac ttatgtttcc caaagtcaac aatttctttt ttttgagatg 41041 gagtctcatt ctgtcaccca ggctggagtg cagtggcgtg ttctcggctc actgcaagct 41101 tcgcctcctg ggttcacgtc attctcctgt ctcagcctcc cgagtaggtg ggactacagc 41161 tgcctgccac catgcccagc taatatatat atatatattt gtatttttag tagagacagg 41221 gtttccctgt gttagccggg atggtctcga tctcctgacc ttatgatcca cctgccttgg 41281 cctcccaaag tgctgggatt acaagtgtga accactgtgc ccagccaagg tcaacaattt 41341 ctaaaggtac taaagaacaa gtctcaataa ttttcagcaa gatatgaaaa tcacatgcta 41401 tcctgctgaa aattattggg acttggatgt tgataaagta tgaagattat tcattctgct 41461 gtattatatt taataaagtt actgacatct ttttagttta acctttattt cattagatta 41521 gaaatagaaa taatctttat ttaaattcat atattaaata tatgtttata tattaaaacg 41581 agttaaatta gtatatcata ttttcatttc ctttcaatat atgctagaaa gtttcaattt 41641 ttttatctta aaaggtcaac agtccttaca gaaggatcaa tttctaattc tatatataat 41701 gaaatttagt gtggaaaaga cttaaaatac agaaattcac tttatagaga atatttaaag 41761 aatccatcta ttccgtaaag taagcttcag ctaacggctc atgtaagaaa caggataaga 41821 aaagtatcaa acaagatgga gtaaatgtat tatcatgtgg tatctattac tgcctactac 41881 tcataccata acatttaact gacttttttg agtggcttgc attatattta gatactcttc 41941 gatcacacac atcaaccctg ataaaatgat tatagagaac tttgcaaatc atgtaataga 42001 attcaaatct taagagcaat gcaatgccat tgaacaagca ttctaaaacg agaaactatt 42061 agatttggta ttttagaaat ttcactctgc aggatggaga atggagtgaa agatatcagg 42121 gctcaagaca gggagtcgca aatgagagct agctacccac ctaaagtaga atagtggcaa 42181 tgaacgtgag aaaaagcaag cagattctag aaagacttaa gaggtaaaac agaaattaat 42241 tagcactgtt tattatatct tgtccccttc tacttatact ttttgtgcat tttacatata 42301 gggaggggat tcaaagttta caaactaaag ctagttcaca ttaaatactg gtggaaataa 42361 tatcttgttt ctattcattt caaaatctta ggggatttat ggaagcaaag ctttcaacag 42421 attcacagaa tactgattag atgtagcatc aaactattta ccatgtatgc caagaacaat 42481 caccgaggaa aggacagtgt cttcaataaa tggtgctggg aaaactggat gttcacacgc 42541 aaaagaatga aactagactt tcacgtctca ccttatacaa aactcaactc aaaatggctt 42601 aaagacccaa agcataaaat tgctagaaga aaacataagg gacacacttc aggacattgg 42661 tctgggaaaa gattttatga atatgacttt aaaaacacag gcaacaaaag caaaaataaa 42721 cagatgagat tatatcaaac taaaaagctc ctgtacagca aaagaaacaa ccaagagagt 42781 gaaaagacaa cctacagaat gggggaaaat atttgccaac tattcatcct gcagggagat 42841 taatagccag aatatgcagg aaaatcaaac atctcaacag aaaaacaaaa taaaacaaag 42901 aatccaattt aaagatgggc aaacaatctg aatagacatt tctcaaaaga agatatacaa 42961 atggttaaca aatacatgaa aaaaatgccc aacgtcacta atcatcaatg aaatgaagat 43021 caaaactaca atgaaatatt atctcacccc agttaggatg gctgttatca aaaagacaaa 43081 aaataacaaa tgcaggtgag aatgcagaga aaaaggaact catacattgt tggtggaaat 43141 gtaaactagt acagctacta tggagaacag tatggaggtt cctcaaaaaa ttaccagtag 43201 aacgaccaag tgatccagca attccactac tgggcgttta tccaaaggaa aggaaatcag 43261 tatattgaag agatatctac actccatgtt tactgcagca ctattcaaaa ttgccaagat 43321 atagaatcaa cctaaatgtc caacaacaga tgaatggata aggaaaatat ggaatcccgt 43381 cattcacagc aacatgaatg gaactgaagg aaaattacgt taagtgaaat cagccagaaa 43441 cagaaagtta aacaccacat tttctcactc atatgtagaa aagctttaaa aaaaaaaaaa 43501 aaagatctcc tagaaataat aagtagaaca gaggttactg gagactggaa agagcatggg 43561 gaagaaaagg ataaggagag actccttaaa ggctacaaaa ttacagctag ataggaggat 43621 aagttctagt gttctatagc accttaagat gactataatt cgcaatcgta tgtattttca 43681 aatagctaga agagaagatg tgaaatgtcc caacaagaag aaataataaa tgtttgatat 43741 gatggaatgc taattgccct gatctgatca ctatacattg tatgtactga aacatcacat 43801 acaccccata aatatgtaca attaccatgt gtaaatttaa ataaataaaa gaatattcac 43861 catgtaattt ctttttatta ctttttttgt ctttttgttt ctttgttggt tttttttttt 43921 tttttttggt tttgtttact tttgtttgag atggagtctc ctctatcgcc caggctggag 43981 tgcagcagca tgatcttggc tcactgcaga actccacctc ccaggttcaa gtagttctcc 44041 tgcctcagcc tcctgagtag ctggggttac aagcgtgcac caccatgcct ggctaatttt 44101 tatattttta ctagagatgg ggtttcacca gtttggtcag gctggtctcg aactcctgac 44161 ctcaagtgat ccacccacct cagcctccca aagtgctggt aggcgtgagc cactaagccc 44221 agcccatttt ttaacattta gggacagggt ctcgctctgt cacccagtgc agtggcataa 44281 tcatagcaca ctgcagcctc aaactcaagg gatgctccca cctcagcctc ccaagtggct 44341 ctgactacag gcatgtgcta ccatgcctgg ctattgtcat gtcatttctt ttgtcccttt 44401 tacaactagt aagacataac ttagagataa cttatacatg atacatatgt attatataag 44461 acataagttg acagccaggt gcagtagctc acacctgtaa tcccagcact ttgggaggct 44521 gaggcaggaa gactgctgga gcccaggagt ttgagaccag cctgggcaac atggcaaaat 44581 cctgtctcta caaagaaaaa tacaaaaaat actagccagg agtggcacac gtttatagcc 44641 ccagctactc aggaggctga ggtgggagga tcacttgagc ctaggaggta gaggctgaag 44701 taaccatgat catgccacta tactccagcc tgggtgacag aatgagaccc tgtctcaaaa 44761 aaaaaaaaaa aaaaaaaaaa aaaaaggaga agaagaagac ataggcaagc tgacaaacaa 44821 caatatcact tgttcaggca ccagcctccc tatggctctt ccctaaaaaa tgcaatgcag 44881 gccaggtgtg gtggctcaca cctgtaatcc cagcactttg ggaggctgag gcaggcagat 44941 cacttgaggt caggggttcg agaccagctt ggccaacatg gtgaaacctt gtctctacta 45001 aaaatacaaa aattacctgg gcatggtggc acacgcctgt agtcccagct actcaggagg 45061 ctgaggtagg agaatcgctt gaacctggga ggcagaggtt tcagtgagct ggaatcgagc 45121 cactgcactc cagcctgcgt gacagagcga gactctgtct ttaaaaacaa caacaacaac 45181 aacaaaaaaa aaacaatgca atgcttcata ataattgcaa ggcctttatg ttctttcctg 45241 catatttctg catatccaaa tccagtatat attcctcagc ttatttcaac cccatgcagc 45301 ctcccagatc ccagaatctg ggagtaacaa attttctcct tttcagaatt cccattttcc 45361 tcttatggca cttcatattc tacttagtgt aagggcttat ctccagtaga tggtaagcat 45421 cttgtaacca gggttcttgt catttccact ttaagttacc cacagaactt ggcagcactt 45481 ggcatgtaac aaaagctcca aaaatattta tcaaatgaat acatcaatag ctatttccag 45541 ctgctctttc caccctgcct tgtttcaaca tcagtggaca tgctgatcac gatagacatg 45601 ctgctagaga tcaaaacctc aatgggacct tttgaattat aggctgcggg ggagtggaag 45661 gctcactgac aacttggtag aaactatgcc agatatttta cttaaaaaga tactgttggg 45721 caggtgcagc ggctcccgcc tgtaatccca gcactttggg aggccaaggc aggcggataa 45781 cctgaagtca ggagttcgag gccagcctgg ccaacgtagt gaaatctcat ctctactaaa 45841 aatacaaaaa atttgtcggg catggtggca gccacctgta atcccagcta ctcgggaggc 45901 tgaggcagga gaatcacttg aaccctggaa gtggaagttg cagtgagccg aaatcgcacc 45961 attgcactcc agcttgggca acaagagtga aactccttct aaaaaaaaaa aaaagatact 46021 gttaagccac aggtctaaac tttgatttat aatctgtcta acaaccgtcc taattgaaat 46081 ttttagtgta aaaattccaa tcaccagttt atcttggaac ctggttatcc cagcatatca 46141 atccgtacat aagaaattga tgcaggctgg gctgggcatg gtggctcatg cctataatcc 46201 cagcactttg ggaggctgag gcgggcggat cacgagctca ggagttcgag accagcctga 46261 ccaacatagt gaaaccccgt ctccactaaa agtacaaaaa ttagcccagc gtggtggcac 46321 ggacctgtaa tcccagctac ttgggaggct gagacaggag actcgcttga acccgggagg 46381 cggaggttgc ggtgagccaa gatcgcacca ttgcactaca gcctggggga cagagtgaga 46441 ctccgtttca aaaaaaaaaa aaaaagaaag aaagaaagaa agaaattagc tcaagctaac 46501 cagactgact gggttgcctg tattgaaact caagatagca ttttaaaaca ttaaaataaa 46561 ttttgagctt cacatgcaat ttataagtag ttttctaagg aattccgttt ccagaaaaca 46621 atcatgttga aatcccaaga gagaaaacat taatttcaaa aaaaaaaaag acaaccgtta 46681 aagacaatca gatccacaaa actggcagct gcagcagttt tcaaaatgtc taaactgctt 46741 caacaggaag ccagccaaag ttctgtccat tcaggcatag aacaaactta cccttttaat 46801 tcctttctta ctgtgtggaa tactgtaata ctgctgtatt gtgaatgctc tctaggcctg 46861 atactgggtc aaaggtgtat ttggcttcta gggttttagg aaaaatgagt ttaagcttct 46921 gataccccag catatttgga tttgtagctg aatcctttct agctgtatta aataactaac 46981 aaaacaaggc cattttttct ttaaaaaaaa agttgatact gtggtatgta ggggtaaaat 47041 ggcatggtat ctgggacttg ctttaaaata cttcagcaaa gggggaaaag gagatgacag 47101 atgaagaaaa tgacaaagtc tagacagtaa tttcatttgt gtggtgagta gatggcagtt 47161 cattgtacta cttctctaac atttgtgtat gtttggattt tttctttttg agtgttactt 47221 tattgcaata acaattttaa attaaaaaca gaaatctgga caaacagtgc ctagagtcct 47281 gaatcattct tgagaactgc agggtgtatt ccatcttagt gtatcttagt ttaattctag 47341 actcatgtcc ttaaagcttg catgttggct ctcagctttg ctgtttcctt taaaaaaaaa 47401 aaagggttca ctgggtgtcg tggctcacgc ctgtaatccc agcactttgg gagactgagg 47461 caggcggatc acctgaggtc agggttcgaa accagcctgc cctacatggt gaaaccctgt 47521 atctattaaa aatacaaaaa ttagccaggc gtggtggcgt gcacctgtaa tcccagctta 47581 ctcaggaggc tgaggcagga gaatcacttg aacctaggag gtggaggttg cagtgagccg 47641 agatcacgcc actgcactcc agcctgggca acagagcaag actccatctc agggaagaaa 47701 aaaaaggtgt cttttttagc aaggcatgat ggcacatgcc tatagtccca gctactaggg 47761 aggctgaggt gcaaggatct cctgacccag gacagaggct acagtgagct atgattgtgc 47821 cactgcactc tagcctgggt gacagagcaa gattctatct ctaataataa taataataat 47881 aaatttttaa aatgttttct ccgaaagtct aactcaacca agtgagatgt gcctgtggtc 47941 cagggcacca aaacacagtc agaaatttca ggatggttca tgacttcata tctcatttct 48001 aagtttctgc ggacacagta tcttcttggt agaggtagct tgctttctat ttgtggtgga 48061 catgtgttgc ttttgctgtc caggatttac actgcttccc gtaatttaaa aaaatcccta 48121 gatttccatt tggggattca ccctagcccc attctcagtc catgatgagg ccgattctac 48181 ctttggatgt gggggtggag tttaagatca agagtaagtc agagcatccc attcccccag 48241 acactgactt tttcatagat gggcacctga cctaagtcag gccaatcaga attaagggag 48301 gttcaggtct tagacttaaa cctatgtttg tgctgccagg aagtgaactc tctttctcat 48361 gtgatttggt attgggttaa tgtgttcttg tagggcttga agccaccatg gatggcctgc 48421 caattaatta aatacagcag aaagtagcaa tatagccaat ggatgtgaaa agacgatagg 48481 ttctaaagac aacatgtgag ctctgaccag ctgcccttga agactgttcc tgtcctttct 48541 gttatgtgag ccaataaatt acctgagtgc tggctaatcc atagtttaat cacttcctct 48601 tataattaag ccacactgta tagtcagata gcttgataat atagcacttt ctaaggtttg 48661 catgccaaaa caagatgcat aaacttaaag cataattctt acttatcaaa atacatgaat 48721 gagttctgga actgtggcgt catggtcatt aaattctgtg attaattcaa ccaatattgc 48781 atgcctattg tgtgactgtg ctaagtgctg attctagcaa gataagggca gctcagcaac 48841 ggaagataaa tgcagtccct aaatgtttta atttgctcaa ctgacatatc tagttatttt 48901 tgaaattttt gtaatgaaac tttttttttt tgagacggag tctcgctctg ccgcccaggc 48961 tagagtgcag tggcgtaatc taggctcact gcaagctctg cctcccaggt tcacgccatt 49021 ctcctgcctc agcctcccaa gtagctggga ctaaaggcac ccgccaccac gcctggctaa 49081 ctttttgtat ttttagtaga gacggggttt caccgtgtta gccaggatgg tctcgatctc 49141 ctgaccttgt gatccacctt ccttggcctc tgaaaatgct gggattacag gtgtgagctg 49201 ccgcgtccag cctataatga aacattttaa gagcatttca gtaacataac tgttactagt 49261 cagacatcct cctagatatt tgggggtcca ggtgacatac tcagtctagg tttttaagaa 49321 caggggtggg aatttgagta agaaactcta cggctgctaa gtctagaaac tatgaagatc 49381 ccaaagatag cttctggaaa gaaagctgag aacaagcaac acaatcccac caaagcaata 49441 atatgagggg actatacata gggtaagaga gaaataaatg ggagtcagtg gagaaattgg 49501 gagcaactgg ttggaagaga agatggaaga agacggagaa cctctgagac cctggttatc 49561 acatcttcaa tcaatacttg ggaaagggct atggaggtgc taagctgttt cagataatag 49621 tagaaaaata ttggagagcc atttttaaac ttttactact ttagtgaaaa aatgtactca 49681 tctaaatgaa aatcaaaggt cataggaaga ggaagagaac ccacagattt cctttttttt 49741 ttttttgaag acagagtttc actctggttg cccaggctag agtgcaatgg tgcaatcttg 49801 gctcatggca acctccgcct cccaggttca agcgattctc ctgcctcagc ctcccaagta 49861 gctgggatta taggcatacg ctaccatgcc tggctaattt tgtattttta gtagagacgg 49921 ggtctctcca tgttggtcag gctggtctca aactcccgac ctcaggtgat ctgcccgctt 49981 cagcctccca aagtgctggg attacaggcg tgagccactg cactggctac agattttact 50041 gttacatact tcttctttcg atggaaatga caaattagcc acaccattcc aacaaccttc 50101 caaaatgcct ttcagtactg cagaagctgg aacgctaaat tcatattttc cctatactcc 50161 cccgcagcaa gagttctaga attaggacct accaataaga tgcaatcacg tgagatctga 50221 aaggtggaag agacatgatg gcaatcaccc tgctgccctc tctgtatggt tagggaggag 50281 agatgctggg tttttctgca gcagagttcc agcatactgt cactagcttt gtgaagctac 50341 tgtggtgagt tcttatattc tacagttcca tggcagcctc ctgattcctc accttcatga 50401 tcaagtagca gctaaaatgt caggccagtt tatcagtatt attctgggag tcactcctgc 50461 aagcccaacc ttgagctcac tactccaatt ttatgaggac ctaattccct atatttaaat 50521 ctctttcttc atagaagatc tagaatgatt tctgtttcct gaaactaaat gctgaagcat 50581 tctactttct aaagcattat agatatgtga tgataaacaa gtcaaaccaa agatgcatct 50641 catttacagg gaggtgtcag ttggacacaa aaaaaaatta cttggaaaaa gcattggggt 50701 atatgcaagt gagcaaagcg gcagagcata taaatctctg aagcagccaa tagtaagact 50761 gtgtactgtc ctcaggtaga caggtttcta tggggcaaat ttccataatg ccaagccatt 50821 tgtttagcag tgttacccaa gttaatgagg tcactggtac tcacttgttg tcatttcccc 50881 attgtaggca cttaggccca atgtgaaaca ttaatgaatt gaataggtca ccaaatacag 50941 tggtctggtc aagatgtagc tgtttgtaaa aggttatgaa tcagattaaa atcctcattt 51001 gtcacagagc aagggaggca atgctcttcc cttagccaca gatccctcca cacttttcag 51061 agaagaacta ttatagaact gttataataa ttacaatatt attagaaagc attaccaatt 51121 agaatacgca gatcactacc tgtttctctc ataaaaagca cagcaaggaa taaaaaacca 51181 caaaatgcta tgctgggaaa aggatgggct tgaaggtaaa tatgaaagtt gaacttagca 51241 ttctctaatg accctggaag aaaggtgatt cagctaatac tggtatttgt taggatttaa 51301 ttaaaaacct ctaatcactt ttcagtgcta caagtatgag gtaagggctt tttcttcccc 51361 tagtggagtc aggctgggag ctccagctct atcctgtgtt tagcactaat ttagggtcat 51421 atacccccaa aagacctggg agtccagttc ccatggaact gcccatggaa ctcccaaagt 51481 gactccagca aattcccaat gcaggatcta cctgcaagct gcagtagcct ttgaaaaact 51541 gaaagagaaa cttgaggagc agagggttat tagttttttt ttctttcttt tttctttttt 51601 aagatagggt tctcttctgt catgcaggct ggagtgcagt ggcataatca tagctcactg 51661 cagccttgat ctcctgggct caagtgattc tccagcctca gtctcccaag tagctgggac 51721 cacaggcgca caccaccatg cctggctaat atttaaattt tttgtagaga tgggatctca 51781 ctatgtttcc cagggttgaa tgcttttttt tttttttttt tttttgagac agagtctcgc 51841 tctatcgccc aggctggaat gcagtggcgc catctcggct cgctgcaaac tccgcctccc 51901 gggttcatgc cattctcctg ccacagcttc ccaagtagct gggactacag gcacctgcca 51961 ccacgcctgg ctaacttttt tttttctttt gtatttttag tagagacggg gtttcactgc 52021 gttagccagg atagtctcga tctcctgacc tcgtgatccg cccgcctcgg cctccctaag 52081 tgctgggatt acaagcgtga gccaccacgc caggccaggg ttgaataatt tttaaatgca 52141 ttttacactt ggagaagatt gtaccaaaat catccatttg cattttcagt gaatgacata 52201 taaacatttt agtcattata tataagctga aagtataaaa agattgttta tatatttttt 52261 tagtatatct acttttcatt tgtgagaacc ctagaggaaa aagtgttaca aaaaagaaaa 52321 caccccccaa aatctctact ttaacgaata aaagctaatg aagtaggctg cattcctatt 52381 acttagttcc tctaaatgtc tattccctta ttaataactc tgctttatta aaataactgt 52441 ctcggccggg catggtggct gacacctgta atcccagcac tttgggaggc tgaggtgggt 52501 ggatcacctg aggtcgggag ttcaagagca gcctggctaa catggtgaaa ccctgccact 52561 actaaaaaca caaaaattag ctgggcatgg tggtgggcac ctgtaattcc agctactcgg 52621 gaggctgagg caggagaatt gcttgaaccc aggaggtgga ggttgcagtg agccaagatt 52681 gcgccactgc actccagcct aggcaacaga ccgagactcc gtctaaaaaa aaaaaaaaac 52741 aaaaaaaaac ttttttttga gggggtatat gaccctaaat tagtgctaaa tatatatcta 52801 aatatatata tatgtatgtc ttttgggggt atatgaccct aaattcgtgc tatcgtgcta 52861 aatatatatc taaatatata tatatatgtt tgtgcgcttc cgctactaaa ctatcatcat 52921 taatccagtt ggaaagatgt gagaaagttt aaatgtaagc acatcttatg gttaaagagc 52981 taccaaaaca aagcaaaggt atattttatc ataaaatgcc acatgtgact ttaagcactt 53041 tagagtatat cattccactg acataaaatt aattattgat atgtatgtgt aggtgtacaa 53101 aatgtattta caggccaagc atggtggctc acacctgtaa tcccagcact ttgggaggcc 53161 aaggcaggtg gatcacttga ggtcagaagg tcgagactag cctgaccaac atggtgaaac 53221 cccatctcta ctaaaaatat gaaaattagc caggcatggt ggtgcacgtc tgtaatccca 53281 gctactctgg aggctgaggc acaagaatca cttgaacttg ggaggtggag gttgcagtga 53341 gccgagattg cagagtacga ctctttctca aaaagaaaga aagaaaagaa aacaaaagaa 53401 agaaaggaag aaagaagaaa gaaaagaaag atatatatat atacgtatat acatacacat 53461 atatatacat atatatgtgt atatatatgt atatatatat gtgtatatat atacgtatat 53521 atatatatgt atatatatat atacgtatat atatatatat ttacatatat tggctgggtg 53581 cggtggctca cgcctgtaat cccagcactt tgggaggcag aggtgggcag atcacaaggt 53641 caggagttcg agaccagcct gaccaacatg gtgaaacccc atctgtacta aaaatacaaa 53701 aattatctgg gcgtggtggt acgtgcctgt aatcccagct actcagaagg ctgaggcagg 53761 agaatcgctt gaacctggga ggcagaggtt gcagtgagcc gagattgcac cactgcactc 53821 cagcctgggc gacagaccga gactccatct caaataaata tatatatata tatttacata 53881 tagtaccaat atttgaaaac atggttaaat tagaaaattt caaataatac tttaaaactt 53941 attaattgtt cttttataat gttagcttta tgaagcaaga agcaaataaa atattaacca 54001 ctcacaactc tgcttatttc aaaatatata ctaaaagagc atggcgtttg tatcgctgtt 54061 tatcacttat gtgtctatca gactttgcaa gtgattccgc agttgttgca caatgggagt 54121 ttctaggcct ctaagatatc tgattcctgt gctctgcaac agatctatct cctgcaccca 54181 gcagagctca gggtcattct taggtcatag tggttaaaga cggctgtcaa ttcactgctt 54241 gggctaggtt tcaagtctcc tttatctctg ttatattgat gattcccagt ggttaagaaa 54301 ttaaccatac tatcagtatg catcttaatg ttttgttgtg tagcaccaaa gtgactgtgt 54361 taaaagctca gtgtgctgtt atgtgaaagt cggctaccct ttaaaaacta agaactggaa 54421 caaatatatt tgaataaata gttatggttc tatgctgtgc tacacatttt taaaagggag 54481 gttataataa aaggttgaaa ctagctgatc ctgcctcttc tcagtgcaaa ggtaattatt 54541 attcagtgaa aaaatatgca gtgaaaataa gtaattgcat aagaagagat tatagtcctg 54601 agcgtataca cctggtgttt tgatttttat gaataccctt ttggatcccc atgcctgtgc 54661 aacttaatgt tcatcacaat tctccatggc aaaggaaaaa gacaaggctt aattgtaaat 54721 atgtgcatgc tttattgcca atgcaatggc atgaacacat ttaacaggat tgtcattcca 54781 tttaattact ctgattttga cagggaccta ttacagataa aattgatagc atgagtaata 54841 aggaccataa aaaaaaaaaa tcactgcttt ctggccaggc acagtggctc acgcctgtaa 54901 tcccagcact ttgggaggcc aaggtgggtg gatcacctga ggtcaggagt ttgaaaccag 54961 cctggccaac atggtgaaac cccatctcta caaaaaatac aaaaattagc caggtgtggt 55021 ggtgcctgcc tgtaatccca gctatttggg aggctgaggc aggagaatca cttgaactca 55081 ggaggtggag gttgcagtga gctgagatca cgccactgca ctccaggcta ggcaacagag 55141 taagactctg tctcgaaaaa aaaagaaaaa gaaaaaaaaa tcactgcttt ctcttcaaag 55201 agtgagtatt aagatcttcc atattctcta tggctacatg cattaggtga agaaaaaaat 55261 aaagcatagt aataggatag ccaccctcaa attgcttcct tccaagttga gttatgttta 55321 tgatatgaca gtgatgcatg gtcatttgtt tagactaact gggttagatc tgtgacatct 55381 aaagaaaagc aacaaatgaa taagtaggca agtggaaaaa attagagaaa agcactctcc 55441 caaattaatt attcatgtgt taattacatg ttctgtttct cagttttttc tcagtgatat 55501 ttaaagatat tcattacact tttgcctgag acaggtaata aaagttcagg agctgaacac 55561 taagcttaag atatgtacag atgaacagga gggatgagag gactatggta acagtcagtc 55621 attgccagaa atttgaaact ggcattaaga ggaagtcagc agctctattc aagtgacaaa 55681 gcagtggtgt ggttcttggc atcctggcat gtatagccaa atggaaaggt gccaggtatt 55741 ggctggaaga gcataaacta ctgattttct cttccataaa ttaagggaag gaagagtttt 55801 agcagcacag catcaatgtt ttatgtagtc acattaaact atctgctact tgagttcatt 55861 ttccacattc tagtttatga cttacttttc aaaaagctag tgtcttctta aaatgtctcc 55921 ataagcctca cacccctgct catgaagaaa aaaatctaaa tttctaaata cacctataaa 55981 tctatcaccc aaataggtga tgaatcaaag catctccttt ttggtctttt tttttttttt 56041 tttttttttg agacagagtc ttgctctttt gcccagtctg gagtgcagtg gtgcaatctc 56101 agctcactgc aacctccgcc tcctgggttc aagcgattct cctgtctcag cctcccgagt 56161 agctggaact acaggcatgt gccatcacgc ccggataatt ttttgcattt ttagtagaga 56221 cagggtttca ctgtgttagc caggatggtc tccacctcct gacatcgtga tccgcccacc 56281 tcggcctccc aaagtgctag gattacaggc gtgagccacc atgcccagcc tcctttttag 56341 tcttttaaac ttgaaactgt cattcaaata agctcatatt tatttgacta ttgatcaaat 56401 ggatatcttc aataaaaata tttttccaag ctacttttgt ctttcaatta agaaatggtt 56461 tatgctatgc caataggatg agatattcag aggattaaat caataggcta taaattatat 56521 gcaaaaaaaa aaaaatcagg tcactaccct gacacaacag atttattatt aaagtaggta 56581 actccatcgc aattcacatt agtttttata aagtaagaga cagggagaga tcaatgagaa 56641 ctttggctta ctgctgcttt taatggttca tttataatag ctacttatct cctattactt 56701 tagcttgcta taataatgcc aagccaagaa aaaagataat caccattatg ggaatatata 56761 tttaattcaa gtctgattga ctgttcttta atctgattca tgtatttgta gagtggccag 56821 aagtactcta aaagcatagt gattcttagg tacacataaa aaccccatta tacagtccac 56881 atcaggttta gagtaaaaaa catctggcag ttttggcttt attttcatag gcatattatt 56941 attagataaa caaccacatt gcagtgcaca cccaaaaccc accagctcaa acccaactgc 57001 tactatcaac atcaaggata acagccaaac cttaaatgga cctttcagcc ccactatgtt 57061 tcctggagac tactatcttt gcaaataaaa agcagatttt tgctctacac atgcacatgg 57121 atccacagtc cattcctgga ttctctacaa tcaccttgcc attcttgatg attaattacc 57181 ctgattgcag tggagatttg atacaggctt ccaaaggcca tctcaaatga aggagcaata 57241 agctcactga tttcatttaa gaaggttgaa caagggcagt cctcaccaac caatccccca 57301 tattttttct tccagggaaa ctgctattaa ataaatatta aaggagagga ttagggttac 57361 agaattaggg agggaaggct ccccagtact atcactatgg tggaatggta attaaggact 57421 caggtcttct tggcaatata aggtggctct accactgttc atccttactg gttgccttgt 57481 taagtctctg cccagaaaca taaggaacct gactgatcac tcctgattct ctacatctct 57541 aaatgacact attggtggtg tacatgcaat ttcactccaa aagcagtggt tacagtacac 57601 tctccaaaaa gctaaaatta gtaagactgg caataccaag tgttggcaag gatgtggagt 57661 aactggaact ctcatacatt gatggtgaga gtattaaatt gatacaaccg gctgggtaca 57721 gtggctcacg cctgtaatcc cagcactttg ggatgccaag gcagacggat cacctgaggt 57781 caggagttca agactagcct gattaatatg gtgaaacccc gtctctacta aaaaaaatac 57841 aaaaattggc tgggcgtggt ggggggcgcc tatagtccca gctactcagc aggctgagac 57901 aggagaattg cttgaacctg ggaggtggag cttgcagtga gctgagatca tgccacggca 57961 ctcttgcctg ggtgacagag caagactcca tctcaaaaaa ataataataa ataagtaaat 58021 aaataaataa acaaattggt gcaaccactt tagaaaagtg tttggcagcc tctatgaaag 58081 ctaaacacat acctagccta tgatctaaca attccactct taggaacata gccaagagat 58141 atggatacat attctcacca aaagacatta agaagagtgt tcttataagt tttttataag 58201 acttactcaa agccaggtgt ggtggcatgt gtctgcagtc ccagctactc aggaggctga 58261 agtggaatga ttacttgagc ccaggagatc aaggccaccc tgggcaacat agtgagactc 58321 catgtctaaa aaggaaaaga tttactcaaa atagccaaaa atacaaaaca accaaaatgt 58381 caccacagta gaagagataa atgaaacatg gtatattcat acaatggaat attacacagt 58441 tattttttta aaggacaact attattacta gctataacat gaataaatcc tacggatatt 58501 acattaagaa aatagacaca cagtatctac cgtatgaatc catttatatg aaatacaaaa 58561 ataggaaaac taatccagca gtagaagtca gaatagtggg ggtgaggcat gaggggtggg 58621 aattgactgg aaaaggacac aaaagaatct tctgggatgc tagaaaagtt ctatatcttg 58681 atctaagtaa tcaagataca catcagatac acatcacatc aagtatgatc gagatacaca 58741 tcaaatacag gtatgtgatc aatcttaaaa gcgttataga aaaaagacac aaaagtctat 58801 atgtagtatg atttcatttt ggaaaggaca aaactggagg gatagacaat aggtcagtgg 58861 ttaccagaag atttgggtaa aaggagggaa ctgcaaaggg ttacaagcaa acttttgggg 58921 atgatagaaa cagtctatat tttgactgtg gttgtggtaa tctgactcta catagttatc 58981 aacattcatc aaattataca tttaaaagga tgaatcttat agctcaatta taatttgttg 59041 cctagtttga aaatattcaa caatctaaac acttaagatt agtatacttt ggctgggcac 59101 ggtggctcac acctgtaatc ccagcacttt gggaagcaga ggcaggcgga tcacctgagg 59161 tcgggagttc gagaccagcc tgaccaacat ggagaaaccc catctctact aaaaatacaa 59221 aaattagccg ggcgtggtgg cacatgcctg taatcccagc tactcgggag gctgaggcag 59281 gagaatcact tgaacccggg aggtggatgt tgcagtgagc caagatggca ccactgcact 59341 ccagcctggg caacatgagt aaaattctgt ctcaaaacaa agattagtat actgtgtgta 59401 cttcctatac tatgtgtata ttatatttta attttaaaaa gtttgaaaag taaagaaaaa 59461 aaaaggaggt taagtgggtt gtaaggcatg cattccagtc cagactttgc tactggtgtg 59521 ttatgtgaaa tgctttaatt gttgctactg aataaaaata cagaaagtgt taggcctgga 59581 atgggccctg aaactcacat aatctcactt tgatctaccg aggagtaatg atctctgaaa 59641 tatcacacag ctagtcagtt gtggaggcag aaatgaaaac ctaggacaac tgactctcag 59701 actctggttc aattccagtt caagcgagat ttttaaaaag tcaaactgat catttgataa 59761 taaagaagta gcaaattttc aactaattgt gtaatactgg gcccctgttt gcacatacag 59821 tcatgtgcca cataagatgt tttggtcaac aatggtcggg cacggtggct catgcctgta 59881 atcccagcac tttgggaggc cagggcgggt ggatcacctg aggtcaggag gtcgagacca 59941 gcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattag ccaggcgtgg 60001 tggcaggcgc ctgtaatccc tgctactcgg ggggccgagg caggagaatc acttgaaccc 60061 gggaggagga ggttgcagtg agccgagatg gcgccatcgc actccagcct gggggacaag 60121 agcgaaattt tgtctcaaaa aaaaaaaaag atattttggt caacaataga ccacatatac 60181 aatggtggtc ctataagatt ataatggagc tgaaaaatta ctatcaccta gtgatgtcat 60241 agctgtcagg atataatgaa actaattttt aaaagataaa tttagtgggc tgggcgcggt 60301 ggcttgggcc tgtaatctca gaactttggg aggctgagcg gggcggatcg cctgagctca 60361 ggagttcgaa accaccctgg gcaacatggt gaaaccccat ctctactaaa acacaaaaaa 60421 tgagctggac ataacggcac atgcctgtag tcccagctac tcaggaggct gaggcacaag 60481 gaacgcttga gcccgggagg tagaggttgc agacagccaa gatcacgtca ctgcactcca 60541 gcttgggcta cagagtgaga ctctatctca aaaacatggc gaaactccgt ctctactaaa 60601 aatacaaaaa ttagccaggc gtggtggcac acacctataa tcccagctac tcaggaggct 60661 gaggcaggag aatcatttga acccgggagg tggaggttgc agtgagccca gattgcacca 60721 ttgcactcca gcccgggcaa cgagagtgaa tctccgtctc aaaaaaaaaa aaaactttaa 60781 aaataaataa atttagtgta gactaagtgt acaatggctt ttgttttgtt ttgttttgtt 60841 ttttgttttt tttgagatgg agttttgcct gtcacccagg ctggagtgca gcggcatgat 60901 ctcggctcac cgcaacctcc gcctcccggg ttcaagtgat tctcctgcct cagcctcctg 60961 agtagctggg attacaggtg cctgccacca tgactggcta atttttgtat ttttagtaga 61021 gatggggttt caccatgttg gctaggctgg tcttaaactc ctgacctcag gtgatccacc 61081 cgccctggcc tcccaaagtg ctgggattac agccctgagc cactgcgcct ggccagtgta 61141 cagtgtttat taagtttaca acagtggata ataatgacct gggccttctc attcacccac 61201 cactcactcc tgactcatcc agagcaactt ccagtcctac aagctccatt catggtaagt 61261 gtcctataca ggcataccat ttttatcttt tatacgtatt tttattgtat cttttcaaca 61321 tttagataca caaatactta ccactgtatt acaattgcct acggtattca gcatagtaat 61381 aagttgtaca agtttgtagc ctaggaacat ctataccata tagcctatgt gtgtactagg 61441 ttatatacca tctatattac tgtaaataca ctctatgatg tttgcacaaa gatgaaataa 61501 tctaacgaca catttatcag aacatagtcc tgtcattaag caacatatga ctacatttgc 61561 ttttcacttc ccaaatgcca ctgtactcca gcctaggtga gagagtaaaa tcctgtctct 61621 aagaaaaaaa aaattattgt tatatatatg agttatccgt gatcatttga taatagcaat 61681 ttacccaaaa atgcagatta tgacagcctc tatttaggaa gctccagcta aatcttggca 61741 taccatctac cccttcctgt ctggggccat ctccaatgtc acttagtcat acaaagtacc 61801 caaactgctc taccactggg agaggtattt taggtacaat ccagattata tgactccttt 61861 attaactcct ctgaaaattc agttcctctg aaacaaagtc aacattttta agtccacaca 61921 ttcacaaaca ccccagatct tatcaaattc tataatataa gatggtgttc attgtgctac 61981 atagttccta acaatattct cataaaattg caaacaagag cagttatgcc atacttctct 62041 tgaaaattga aacattaagg aagtgctaaa tcagtgatct ctagtttctc tttctatgct 62101 ctgtagctat ctaaatgacc ttttgacccc cttcccctcc catccctcac taaaacccaa 62161 acactaactt ctctctgacc tccaccccct gcaaaacaga aaagaaacta gtatgttatt 62221 tacacttttt gattgactga ctagttacac ttaagttcct gttgttgttg ttgttgttga 62281 gacagagttt cattctgtca cctagcctgg agtgcagtgg ttcagtcact gctcactgca 62341 gccttgactt cccagcctca agcaatcctc ttgcctcagc ctcccaagta gctgggacaa 62401 tgggtgtgta ccactacact ctgctaattt tttttttttt ttaattttta gtagagatga 62461 ggtaacattc tgctgcccag gctggtctcc aacttctgtg ctcaactaat ccatccccct 62521 caacctccca aagtgctagg attacaggca tgagccaccg cacctggccc actttagttc 62581 tagtaagttt tagcagtatg ttaattcaag caatatttac attttaagcc attttatccc 62641 ttattgcata atgaaaggcc aattgcttct ccattaaaga attctgacct tcaatttaac 62701 ttcagtaagc aagtgagaaa tattttggaa acaaggaaga acttgcaact acactttgta 62761 ttcctagtct tggggagtca tgaaccacat tagaatagct gagaggagct gggcgcggta 62821 gctcacgcct gtaatcccag cactttggga ggccgaggca ggcggatcat gaggtcagga 62881 gatcaagacc gtcctggcta acatggtgaa accccatctc tactaaaaac acaaaaaatt 62941 agctcggcgt ggtggcatgt gccagtagtc ccagctactc aggaggctga ggcaggagaa 63001 tctcttgaac ccgggaggcg gaggttgcga tgagccaaga tctcgccact ggattccagc 63061 ctgggcgaca gagcaagact ctgtctcaaa aaaacaacaa caacaaaaag aatagctgag 63121 aggctataaa gaaagaaccc aaaggatcat ttgctcattt actcactcat tcgtgcattc 63181 agtaaacatt taccaagggt ctaccacgtg ccaggtattg tacaaggcag acagaattat 63241 aagatatggt ctctatcctg aagtagtttg caaggaggag tcgcaaacag attaaaaata 63301 tttaaagccc aaaagtgaga agcgctctga caagtagaaa acactgaaaa agaacaggag 63361 agatgggatt cacttgccct ggggagagtc agagcaggtt tcatacggac agacctgtga 63421 gctcagcttg aaggaatgtg acatgttgaa aggcaaagaa agggttgaaa tgggtttccc 63481 aaactttcag ccaaatatga aaggagacag gaggagggga atgggttacc tttctttgca 63541 cagagggaat ttaaggccag agaagctaat gacaagccca gtgtccaccg cagccacaaa 63601 gaaaaagggc tgtggcagca gctgggcccc ttcctcttat ctgtagtcac acagagaaca 63661 cagttgatga caagccttgc caacatgccc ttagccactt taggggagga caggtgcaaa 63721 cattttcata taactccata ctcagcctac actctttaaa aagttatgtt tgtatcagga 63781 tatatcattg gaacataaat tttatccaca tcctttaaaa tagttacaat ttggttttgc 63841 cctgtttttg gcctaagaat ttcaaagcct ttgttggtgt tgaatttagg atcacaaagg 63901 tttgtaatca aacaacgtcc ttgcctcaaa atagcttcct tctagccagg gaagtattag 63961 agggggtctt gcctagctat agcagataaa ggcttattcc tggaagcttg ctgaaggcag 64021 agaagctcct attccctcct ccccaaattc tagcaaagcc aggtgtgtct ctacaatatc 64081 cagtgctgac ctcttttgca caatttcttg gcctaagcaa ctttcttagg tctttaaccc 64141 tcccttgccc caaaatgtgc agatgaatag gttgtcataa agcatatttg atgttatttt 64201 agtcagtgtc taatattttt gtatcacaac cgtatttctt agagttctaa aatatagccc 64261 tcaaggctgg gcactgtggc tcacgcctgt aatcctagca ctttgggaca ccaaggtgga 64321 tggatcacct gaggtcagga gttcaagacc agcctggcca acatggcaaa accctctctc 64381 tactaaaaat acaaaaatta gccaggcgtg gtggcacacg cctataatcc cagctacttg 64441 ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccaagat 64501 cacaccactg cattccagcc taggcaacaa gagtgaaact ctgtcacaaa aaacaaaaac 64561 aaaaacaaaa acaaaatagc cctcaaaccc ctacattatt ccaaactctg acaatgttaa 64621 caatgaatat ccattcttta ttctttttaa aaatatattc attctttcaa cagcatttat 64681 ccaacagtga ggcaacgggt tagttatccc ctcaatgccc attgtctcct tcttccactt 64741 tagctgggct catggccacc cacccacaga tggtatttcc tagtatccta ggtgtggcca 64801 catgactaca tttatcagaa tggaatgtca gtgaaatctc caactatatt gtaaattcct 64861 gatgatcaga aaccctgtct aacatatatt ttatttccaa cagtacccat ccagtattat 64921 gttaataaat gtttgttgat ctagcttttt atcttgacta atgtttcatc tatactatat 64981 attttgcatc agaaataaat tttatgattc tagaaacatg aaaaagaata gaatgtaaat 65041 ggaagtaatg tgtacaactt acaaatcatt tacttcacag gaaattgctt gctctccatt 65101 tttgctctac tcctcttcct aaaaactgaa atagggatgt ggagctgaca cagcttcaac 65161 catgtagaaa agactaggga atgtcagagc agcacagtga aagaaacaga ttctgaatga 65221 cttcacatgc tgaggtaccc caccagtgta gatgcttaca ttcctctaac atgttacata 65281 caagagaatt aaacttcagc catatttaag ccacttaatt ttttggtttc tttgtcacag 65341 tagcttagct ggaccctagt taatataaac atttattcaa tgtcttttga gcgtttacta 65401 tgtgacaaga actgtcttct gggtaaacat aaagtggcga ataagcggaa aagctcccag 65461 aacacaggtt gcttatagtc tagaacaggg gtgtccaatc ttttggcttc ctggggtcac 65521 attggaagaa gaagaattgt cttgggccac acataaaata cactaacact aatgatacct 65581 gatgagctaa aaatatatat cacaaaaaaa actcataatg ttttaagaaa gtttacaaat 65641 ttgtattggg ccacattcaa agccatcctg tgctacatgc agcccatagg ccatgggtta 65701 gacaagcttg ctctagaaag tgatattaac aaaaaatgta aaactaaatt tttaaactgc 65761 gctaagtatg aaaaggaaac aaagagaatg ctgaagtaga atataacaga gaaggataca 65821 ttctgagtag aatagtccac taaggcctcc ctgaggaagc atcatttagc taaggcctca 65881 aggtcataga aagaaatgga gagagggagt ccattctagc atgcaaagaa agacccaagg 65941 caagaaagag cttggctctt tgaatcaagc tcaaccactc acctaacaca attatagatg 66001 ctcaatcagg gtctccacag atgccttcat gacattctca aataaagaaa gaaagtagag 66061 agagagaaga gaagaaagag gaaaggtaag aaaaaaaaag aaatttctag tcaatcaaac 66121 cagttaggga aaagagtaga gagtgttatt aagaatgagc gcttgggagt caggcagctg 66181 agtttaaaag tggactttga catttcacag ctgggtgacc tctcatcttt ggtttcctct 66241 tctgtaaaat gtgggttata atggcaccta cacctcataa ggttgtgaag aggatttaaa 66301 tgagtcaatc catgtgaagt acttagcaca gtgcccagca caatgtaagt gatcaacaag 66361 tgtcaactac tgctatacat ataggagctt cagaagaata aatactcaag ctgatacgat 66421 gggaaggctt taaaataaaa tagaaaataa attatgctct ccaactcagc tggataaatt 66481 tgtgttttct tggtgggtaa agaaaagggc catcattcac tctttttata tatttgtata 66541 ggtttaacaa ggcaagagag tatgcatcaa aataaaccaa agctaggcaa agagaaaaaa 66601 tggaatatga agaagctggc ttctggtcca ccttctatag atacttacta aatcatatag 66661 tccaattttc ctcccataat tccagaccac agtccacagg ccccaaaaca tttcaaagct 66721 caaatttggt atattatatg gtacacatat caggtttctt agttgtaagc aacagcaact 66781 gactatgggt gttttaattg gaacaggaac agatgggtag ctcagaacat tgctacaaag 66841 actgggaatc aagcttgaaa aatgaaaagc gtggcagtca gatctagggc taactgttac 66901 cacaaagcca ggctggtgag tagacaactg cttctgcccc aaacctcatg ctgtcacctg 66961 cactgtcaag ccactagacc tggtcactac tggatgccat ctttagctcc tttatcatgg 67021 ctgtgctaga aataggtgac tctgctgaga aagagaggat atggacttct cagacttata 67081 atgaaaaatt atctaaacat aaaaggttca gacagtagca aacctcatca aaaaagaaaa 67141 aagaagccgg gggcagtggc tcatgcctgt aatcccagca ctttgggagg ctgaggtggg 67201 tgggttgcct gagatcagga gttcaagacc atcctggcca acatggtgaa accccatctc 67261 taccaaaaat acaaaaatta gccaggcgtg gtggtggact tctgtaatcc cagctactca 67321 ggaggctgag acaggagaat cacttgaacc cagggagcga ggttgcagtg agccaagatc 67381 aggccattgc actccagcct gggtgacgag aaactccatc tcaaaaaaat aaaaatgaac 67441 tttattctat gtgtgtatat agtcatatat gttttattgt acaaaactaa gatcatattg 67501 tatacacagg ttgagtatta catatctgag atgcttggga caaaagtgtt tcagatttca 67561 gattttttta gattttggaa catttgcata tgcctaacga gatatcttga ggatggaacc 67621 cacatctaaa cacgaaattc atttatgttt catgtacata ttatacacat aagcctaaag 67681 gtaattttgt gcaatatttt aaataacttt tgtgtgaaac aaagtttttg ttaagtactt 67741 acatatggaa ttttccactt gtggtgccat gttgacacaa aaagttcaag atttggcagt 67801 atttcagatt ttggattttt ggattaggga tgctccatct gtatagattt taacaaccta 67861 acaccaggct tctaaaatat aaatgtattt taaatatgaa gttaaattta ggattagttt 67921 taattacatt tctacttttg cagtcctaaa atatgagatt tcaacatgtg ataactgaat 67981 gaaatgcatg ggctgcaaat tccaactttg ctaatcactc actgtgtaat ggtgagaaac 68041 ttacagaact tccctaagtc tcagtttctc catgtgtaaa tgggaataat aaaacccata 68101 taataaggtt gctgtgagga aaaacaagat aattcatata aaacccttag ctcagcatct 68161 agtgccaaaa aaaagtcttc agtaaatgtt aacataaaaa tataagttgt cataattttc 68221 tgaagtctag atttttctca tagaaagagt actttttttt tttttgagac agggtctcac 68281 tctgttgccc aggctggagt gcagtggcac aatcacagct cactgcagcc ttgatcacca 68341 ggctcaggtg attctcccac ctaagcctcc tgggtagctg ggactacagg catgtgccac 68401 cacatccagc taattttctg tattttttgt agagacaggg tttctccatg ttgcccaaac 68461 tggtcttgaa ctcctgaggt caagtgatcc tcctgccttg gcctcccaaa gtgctgtgat 68521 tataggcatg agccaccgtg cccccaacag ggaagagtat attttatgta agataatgaa 68581 ctgcaagtaa agtctttatg gttcatccac gtagccagaa tgccctcaat ttccaagatt 68641 cttgaatcaa acagttgact agggagctaa catgccatct gagcagcttg ggcagggcaa 68701 ttataaagct tgctctcttg atttttttgt ctgtctcgtc tggcaaatca tgtaaggaat 68761 ttgtgaaatg gaagacataa aatagatcaa ccagtttctc tttacccatt gggcaggaga 68821 gatctaagct gttacccttc ctatagggtt catgaaagta aacaggaagg aaaaaaccct 68881 ctggccaaat atctgcttat tcaaacttgc ttatttttaa taactttaag ttcttcaata 68941 tttaatgata actataatta gcatgcttaa tatagtaatt aattattaat gttattaata 69001 tatctagcct ggaattaatt tttgcctgtt ttattaaata cgtacttaaa gtacatatta 69061 aagatataac taaatttttt ttctcaccat tattgctgct aacacccttc tgggcctcag 69121 tttcttcacc tgtaccatgg agaagccaga ttagattatt ttaaaagtct cttatttgat 69181 tctgtgattg tagatataat taattaatgt cagtaatttt tttttttttt tttttgagtg 69241 ggagttttgc tcttgtttcc caggcgggag tgcaatggca tggtctcagc tcactgcaac 69301 ctctgcctct caggttcaag cgattctcct gcctcaccct cccgaggagc tggaattaca 69361 ggcgcctgcc accacgtaca gctaattttt gtatttttag tagagacaag gattcaccat 69421 gttggccaag ctggtcttga acttctgacc tcaggaaatc tgcccgcctc ggcctcccaa 69481 agtgctggga ttagaagtgt gaaccaccgc gcccggccaa tgtcagtaat ttctaatgtc 69541 gataattatc ccaagtcctt gttcacttat gcaagtttaa aaaaaaagct ggcggggggc 69601 gggaggctgg gtgcagtggc tcatgcctgt aatcccagca ctttgggagg ccaaggcagg 69661 cggatcactt gaggtcagga gtttgagacc agcctggcca atatggtgga accccgtctc 69721 tactaaaaat acaaaaatta gccaggcgtg gtggcaggtg catgtaatcc cagctactcg 69781 ggaggctgag gcaggagaat cgctagaacc cgggaggcag aggttgcagt gagccgagaa 69841 cgcgccactg cactccagcc tgggcgacag acctagactc cgtctcaaaa gaaaaaaaaa 69901 agtgctattg ttaggagggg gaagccgaaa gagtaggaag cagtacttgg aaactttcta 69961 gaaagaagtg agacgaggca aggtgttaag gcattcgact cagcaattgg agtagcagaa 70021 ggcatctgat ctgagaggaa aggcaaggat gcgtccctga acatccaata ttcagaaacc 70081 cgcaccgcta ctgttgaaaa caactcaatt gaactaggag gagaatgaat atgttactta 70141 attaagtaca ttctgaagat atctggggca tggggcggac agggaatcaa atctcctttc 70201 ccctttcaaa tcttagagac agacgagggg cgaggcctct gacctgggtt tccttctctg 70261 aggaagtgtt gggtcccgaa gccctcttcg ggagagtcag gtggcggccc ggggcagctc 70321 agaccgcgcc tcaggagccc tcgaagctgg gcgacgctgg cagaaagaaa cgcgaggcct 70381 cattgtcccc cgagacaaag cggagctctt cgcagcctcg gggacctgtc cggactcact 70441 ttcccttccc tccgaaacca cccaccgacg cgggccggga aacgctcagg cccaagatgg 70501 gctctgagcg cgaaccaggg aagagcccgg atgctgctcc tttcccgccc cggaccggcg 70561 caggtccatc cagaggcgcg aagcctgcag gccagggaag agcccggatg ctgctccctt 70621 cgtgctctgc ctggggccca ggaagcagga gaaacggaac ggcgacgttg ggtcaagaac 70681 tcagagggtg aaggctggga agccaaggaa ggaaagccgt cctgacctcg gcgtcggggc 70741 tcctctacaa ccccgcaggg tcgcgaaggt cctgcggcat ctcctctcgc agttgccgca 70801 gcctagccgg ccgggggcag gcgctggtgc cccccgcccg ctccccgcag ccccagcaga 70861 gccggagttc ccgcggccgc cgctgcccga gcgactcgat cgcccgagcc gacctcttcc 70921 caagccttgg acagctgacc cctgcgcatc ctaaaggaaa gaccccatct gttcctcaga 70981 atgggaaaat tcccgtgcat actttgccag aaccgtgttg ctgatatttc gggggttggg 71041 gttaccattt taaatcgtgg ctttctgcat tcttgttacg gttctgtggc gacttagaac 71101 gaagtgggtc gtaaatcgta aaattgtcca acgcggctaa aacgtgtccc agtttgaaaa 71161 gaaaagtcgc tcaataattt ttcttttatt ttctgttgca agcatttgtg tattttttcc 71221 caaccgatta accacacaag cctaaatgaa aatataaaaa atctgatttc tatagattaa 71281 ccttgtgatt agacgtttta ccaacaggtt tttaatacaa gagtaaaatg taatgggctg 71341 attaaaaata gagacacgtg ttttagttgt gtttgaacat gttgcaaaaa gagacattgg 71401 tcttttcatt ttttccagta aactcaaaat gcagtcattt aaggatgttt tgcaggatat 71461 gatgagggtt tagagttgcc taaggaaaaa aaaaaaaggt gtttagaggc tagcgcctac 71521 tgaacatact tgtttaattg atattttgtt tgtagttctg ctatcaaagg tcagctagct 71581 gtatttgatg gctggtgcta ttaaaaataa ggcactaaaa cgctgaggat tccagctcac 71641 ctcttcaaac actgacaacc attgatgtcc ctttgggcat caggttctgc tgagatcagt 71701 aggaccccat ggggatgaga tggaggagaa tgtggaaagt tgtattgcaa ggagacttta 71761 tgttaatagt ttgggcttgg gtgtgtgtgt gtgtgttagg tacatgagtg agtcttggaa 71821 aaaattaaga tatgtgagtc agaacaattt agaaggtatg ctgcgccaca aactagccag 71881 aaaaagcaaa cacattttta aaaggttcag agttagaacc cctcctggaa tgtcccttgc 71941 atcttgaagc cccttctaga tgatacttct tgacaattta atggaggtcg aatttgtgtg 72001 taggcaactc ctctttcatt tattctttcc catccaagtt taaaaacaaa acagaaatat 72061 atgtgtgtgt gttgaatttt acttgccctc cagattttta tttttatatt ttaagaacat 72121 atgtatttaa aggtaagtag cacattttat tattcaaaca agatggcaga agcattcaac 72181 tcagaggttg tttcaaagat tcggtattcc taatggcctc aaagtcaagg acttgactga 72241 gtgggtggcc agtaaagcag aaatggaaaa ataaatcata tctagtctga aacaaattac 72301 atggaataaa tcagggaggg acaaaattga atcaggactt tagaatttaa aagtgcactg 72361 gactcaggtt ctaagctgcc ctgttttttt atttaggaag taagagtatg gatactgact 72421 gtgctctcag tcaaaccaat ccacctctgg gtgccttttt ctgatccata gagtgagaga 72481 ttagggatgt gggtcccatt caattctggt catttaggca caaggagtcc taaggcagaa 72541 tttttgtgtg ttctcacaaa tgcaggcaca aaggggactt atctaaggca gagggtcacc 72601 aaagttgacc agtgactagc ttatcccaat agtggcatgc taactaaagg gtgttcttgt 72661 atgtactgaa aaagccatag cgtttctttt acagccccat ccaatataaa gaaagatgtt 72721 gttcatgcac taatttcagt atttttaagc acaggagttg cctgtctctc aaaataaatt 72781 aaagcctgaa atgcctagag cttctgcttt aagaatgatt attcttaagg aggaaacaca 72841 aacagcagtg accatttttc acagacgtgc tcagccatca ggaaagctta ataaatccct 72901 atttagtatg catttatgca gtgattacat gaactttgac ctctgaagag ctgtagcaat 72961 tagcttcagt gtctagcatc ttctactcac cattccccac tggcttcctt gtgtcttcct 73021 ttttcatctc tcaatacttc atgcattaag aaggcgtaaa atttaaaagt tgactttcca 73081 cctgaggtat ttgcttctgc gcaagccaat tctccctccc ccctttccca ctcaacagcc 73141 atctcagatg gatattgggg ctctgattgt caatattact caaacacaaa tggatttcaa 73201 ttcaatcttg ccctgtactt tcacaaagat aaactctaat ggcccaaaga gcttcatttc 73261 atagcttgtg gcatctgaaa taaatcagat atcttctgtc agactaattt aacattttaa 73321 caatatttga tgtgatttat ttaaagtcaa atgaaactga atggattggc agaaaactag 73381 aggagggggc aggagaaggg cagctggaag cttccaaaaa acaaaaaaca aaaaagaaaa 73441 aaaggaagaa aagaaaagaa ccaagagggg agaggaaagt gttttaagtt tctagtaaac 73501 attgtttagc aaaagaataa taggaaagtt gccccttttc caaagaggga tggaactata 73561 attgttgcct ttagttgaga gctcagataa actgctggga ctcattctaa agtgacccat 73621 cactgcattc atctcttctt gaactttcgg tgaacctgag attaaacagg ggtgagcagc 73681 taagcagttt gttgttagca gccttttcaa caaccctttc agactgagta aatattgatc 73741 ggtttgaatc tgattgcccc agaggaaaac accaccgcat ttgaatagcc tccaaaagta 73801 aactttaaaa ggtgaaaccc agagcagact tcagagccaa agcaagacct caactcagcc 73861 tagtttggga acactctgta tttggggagt actgtggata aatgtagagg aagtgaggaa 73921 aaaaacattg gaaaactaga attttgtatc gcagttatcc caggtgccta actgagaccc 73981 ttgtttcctt tgtaaatgta tcaataaaat gttattttca gttttcccac tctgtgttgt 74041 ttcagttacc attgctgtca tttttcaagt agctaactgt gctccttaga gaccaaagca 74101 gagagacctt gaaataggat gtgttaagcg cctttgattt aatcgattga gacagctaaa 74161 aagatacaaa tgttttcctt gttgaaaggt gtattaatgt ttcctaaatg cattgcttgt 74221 taactgattt attccatgac ctagagaaag ctagagatgg ttagccataa tttattgttc 74281 tctcttgctg ctacattaaa accagcatct taaagactgg aatctctggg attccagtgg 74341 atctcgccct gcttttagat attgaaatgg agtcaatggc atgcaaatga cactttaaca 74401 cattaaagta tttggaagcc agccatctgt tttgctgtgt ggccctacac taataaacaa 74461 aagatacagt ataaggccag gcaggaaatg aaagaaaggg ttgaccgtct ttgaagttct 74521 tctccatcca cttgagtctt ttacaggcaa cctcagaacc cagcagctga ggggagggca 74581 cccggtgccc ttgatatggc cgcagcaact gcaccgcaaa gttcagcctt ctcttctttg 74641 acagggaagt gcaggaattc tcacaagtgc cctagacggc ctgcaaattc caaatttttg 74701 tagggttttt ctcctccagc ccgtcctttg taaaacaccc tttggggagg aaggaagtca 74761 tctgtctgtg tttgttttca atgctgtaaa ttttctgctg tactgttggg ctaaataaaa 74821 tgcctccaaa aatgtgtgaa agtaaaggaa agcaataaag aggcctggga ggagagccac 74881 tttgaactct atcatttgct ggcttcctct tttgagagca gatacatacc ttggtcagaa 74941 acaacttgga tattgaaaga cttcagggta gcttccacaa gttcacctgg aagtacctgt 75001 ggagcccaga gaagaaaatg actttttttt tctgggttat gtggcatgga attagaactg 75061 tggatccttt acagccatga cggctatgaa atcaaagctg agtctgaagg gaataagatg 75121 tgcaacccag agatggggaa gaaggggtga agagccgcag gaagggatgg atgttgacag 75181 gaagaatggg ggaagggatg caagggttta ctaaaaacac tccatttcaa gggtggaata 75241 tatcagaact ctgcctgaca atccaaaaag aaggaagagg agtggaataa gggccccatc 75301 ctcacaacaa ctttatactg gagctaatta aatgtgaatc tcttagctct tggttcactg 75361 atgtgaggaa agactgactg aagacagttt tggcttttga agggagttct gtttatatat 75421 acgtcaacat ccagttggag gtgaaaaggt tagcacttga cccaggaagt atccatgttt 75481 gtttcaaaaa taaatctgct tcataaattt cttcatcagt ctttttttcc attatgagct 75541 ttgattataa taaaggagct gttattaact tttattcaag aaaaggccca tctctttgaa 75601 aatatttacc acccttctcc ctttcccctc atgaaatgtg ccaacttcat aggaattaac 75661 aaattgtagc ccagccaaat acacggatgc ttaagcatac ctgaaacttg agtatattta 75721 tttattacag acatcctaag acccgtaaac tctgctctgg atcatatcac tccaggatct 75781 cagagctgtt catgattgta caggaaatgg ggaatatcat aggctcacaa aggataactg 75841 atagaactca gtgtggtact ttggggacat caaacattgt gcgacatgca aaagactatt 75901 cacgaataac acaaaatata cattcattgt gccatccatc acattaacaa ttgagctgaa 75961 aatacattat atccagctaa gataactgtg gaaggaagaa attggtttga ataatacttt 76021 taggttctga ataacccagc acaaatttta aacagagggt ggcccgagaa gaaaggggta 76081 gagattggga aagacttagc acaggaagcc gggtttctga agtttgtgct ctgcagggct 76141 tcttaactgt aagaacaaat caaggctacc ctctgaggca tctgattggg tttaaatgag 76201 ggaatttttt ctttcaccta taaaattgta ccagtttaga gagtttgccc accctgtttt 76261 agtaacctaa acatttctag aaaatctgta taaagataaa tctcttagga caaagtattt 76321 acaaccagca aactcacaca catgaaaatg acttaaatta agggatgaat taattgtgta 76381 aacatatagt gcatctcttc ttcctgagct cctggactcg cctttcgcta tatcctactt 76441 tcaaggacaa gggaggggag agctgtacat atagttagat aaaagatgag aagattcctt 76501 ctggcatgtt tctgttggca aagggaacta ttttccaaaa ggtcatctga aaggaacagt 76561 aggttctgtg aattctccta aaagcaggag ggatgttaag gcccaccaga aaatgtatgc 76621 tggcacccaa tctggatgaa ggtgttaacc ccgcaccaag tctctggtcc agaattatct 76681 gcaaatatat tatcctggcc aggagctccc cagataggat tagaaaggaa gaaagagact 76741 gtaaatggaa agaaagataa gctaagcatg tgctttgggt aagaagtccc agcccaagga 76801 gatgcctggg ctgttgtctg gggctggagc cgcctcagtg ggaggtagtc agagtgtctg 76861 aggtagaaga ccccggggaa ggaacgcagg gcgaagagct ggacttctct gaggattcct 76921 cggccttctc gtcgtttcct ggcggggtgg ccggagagat gggcaagaga ccctccttct 76981 cacgtttctt ttgcttcatt cggcggttct ggaaccagat cttcacttgg gtctcgttga 77041 gctgcaggga tgcagcgatc tccaccctgc gggcgcgcgt caggtacttg ttgaagtgga 77101 actccttctc cagttccgtg agctgcttgg tagtgaagtt ggtgcgcacc gcgttgggtt 77161 gacccaggta gccgtactct ccaactttcc ctggggcaaa gtgggaagcc atgagacgga 77221 aatgtaaaaa tttttaaatc gacttgagat tccccacacg cttcatggca acactcaggt 77281 aaagaaaaga tcaagaactc agcacaaatc gggctgtgga gggtgagtga tgaggtgtaa 77341 agtgttaacc tgatgtaaac cattagcatg gtcagaccgg tgattaatgg agcctcaaga 77401 tattaacaga acactaccgt cacaataacc acccccacat acttcctatt tcccaaatgt 77461 ataaaatcct tgaaaacaca ccaatccctg agacttcttt gccccaacac ctctgggcac 77521 cctctccatg cactacaaca ctagtctgat acaaaagcct tttaaaaaaa agatcattat 77581 taatttcctt ggaaattaag cataccagct ccttccagaa taatcaagga gcatccacca 77641 accagcagga ctgacctgtt ttgggagggt ttcttttgac tttcatccag tcaaaagtct 77701 gcgctggaga agatgtctcc gatgcggggg agcgacaggc ttcttggtgg ctggcgtgga 77761 gaggggacaa ggagttatta tacgtagcca gggccaggct ctggtgctcc tgtccatatg 77821 agtggtgaat gtattgaggc gagcccaccg cgcccccagc ataaccctgg tggtggtggt 77881 gatgctggac catgggagat gagagatttc cagagtaaac agcgggagcg cactgggggt 77941 acccaccact tacgtctgct tcctgattta acgcgtaggg gctgtaaggc gcactgaagt 78001 tctgtgagcc atagcttgga ccacaacttg agtgggagta ggacaccccc aggttcccgg 78061 aagtctggta ggtagccggc tgggggtggc gatggtggtg gtggtggtgg tggtggggcg 78121 aaccgatctg cacccccctg cccactagga agcggtcgtc gccgccgcaa ctgttggcgc 78181 tgaccgcgca cgactggaaa gttgtaatcc tatggtccga ggggtaggct cgggctgagc 78241 aggtccccga gtcgccactg ctaagtatgg ggtattccag gaaggagttc attcttgcat 78301 tgtccatctg tcactgagtg acctggtcct gcgaagcccg gcgtgactgt gccaactttc 78361 tcacttcctc catggggccg gagaagaaaa atgatatgaa tgtacagtgc gcaagagggg 78421 gggcgggagg gcggaggacg ctggaggagg ggcacgtgac ggtgtcagcc aatggctgag 78481 cctcctgcaa aagtttgccg gcttccgcag tgatggatca ccgttttagt ggcatttaaa 78541 tccccggcgc tccgccgtct aggtgacgcg cagtcgcccc cccaggcagc ctaggcggcg 78601 gcagctgctg cggcgactgc aaaggccgat ttggagtgct ggagcgaaga agagcaaaag 78661 ctgcgttctg cgcgcgcccg actccgctgc ccgccccgcc aggcctccgg gaggtggggg 78721 ctgggaggcg tcccccgctc ccgccccctc cccaccgttc aatgaaagat gaactggcga 78781 gaggtgagaa gggaagaggg ctcccggctc tctcggggcg ggaatcagtg ggccagagct 78841 cgccgggtgg ccgcaagtac gccggcccag cccgcagcgc gcccagccgg aaggcgggga 78901 atccggctga caccgcgccc cgggttccca ggccacctcc tctgttctga ggctgggctg 78961 ggagaccgtg gggctgtgag gagcgcatag aaccgtggtg gagggcgagg ctgggccacc 79021 ggctcttcaa gctcggaatg gagggggaag agcgcagagg gctggctggg aggaactcgg 79081 gtgggcgtga aggagacgag ggcaagaaaa gaaacttccc ttcttccagg agggtcttcg 79141 aaaccctctc cccacagccc ctctcgtcat tagcatggca atgaggagtt tctgtaattc 79201 gacttggagg ggcggatgag ccctggaaac tcagagctcg ccggaaaagg ccgggggcgg 79261 ccgggctctt cttccccacc ttccctctct cgtcgctctc cgcccctttc tctttcccac 79321 tcagttttgc accgggagcc ctccgggatg cggagctact cgaccgccgg atttttaggg 79381 gtaggaggcg ggggagagag atgacgctgg cggacgtggc cagcgcgggg gccgggcggt 79441 gcgctgcagg ccatctgccg gcgccctgag acccaggagc ctccgcgctc ccgcgtgggc 79501 ctcacagggc cggtccacag ctccaacata gtagctgaac tcccttcgtt gcgttcctct 79561 ttttctggag gggaatgtta gaagagagag agagcttcct tttataacct tcctcattct 79621 gctgcacgtc tagagtgggt gtgggggctg gcaggtggga ggggcggtgg acaaatggct 79681 gatggtggac gggacacttt accccaacga cacctcctcc cctttccaac tggctgtgta 79741 gttgcttatg agaaccttca agtccttccc tagagagaca catgcaaatc tgagcctcat 79801 cccaggccag gggtcctgtt cctcatcacc ctacttccct gaggctgctg aggtcgttaa 79861 attgttgttt actattaggt ttcacgtcaa ccctgggctt gtagagagaa aaagccaaac 79921 ggagaccaag aattgatgca gtcttgggta ggagaaatcg agagcttgtc caggaagctt 79981 tgctgtataa attataagca atgctgtata aattttactc caaccatgtg tacaatgttg 80041 gaatcagatg aaatttatag tgaatgaatg tatgtgggtt tggggtttcc actcctcttc 80101 agcctttcct cccgttagaa caaggaaggt tttttttttt tcaaggaagg tacatttcaa 80161 atatgttagt caccctttca gtcttctgta ttctgttctc cacgtaccga agttccccca 80221 aacctgtcct ctcaagagaa aaaacccatg ctgactctgg actccctcag taacaatgaa 80281 acattctccc caaacatttc ctttcagaat agtatttgtg actttgatcc atcccaagca 80341 tggttaagag ggagcacggg caggaaaggc ccactttctg gggttgggca gccacccctg 80401 ccccagtttc ggctcctggg aatcctccga ctggagaagg ggaaaggcaa ggcagtcctc 80461 ctggaggcgg cttccttggg agcaccagct tccagcggcg gggagagaag gagctcctgt 80521 gggagagggg gcaggatgtg agtaggtcgg tgctggctat gcgagcaatc ctccctccaa 80581 gcctgagcaa gtcggtacat tttcccccgc tgcctcattc ctgtaccttg gttgccctcc 80641 tcagcctggg tttgcaggac cccctcggct gcagggcgcc tgccacaaag ccgaccccgg 80701 caggagccac tctctctgct agttcgctgc ctcggccctg ctctccctca gcctctcttc 80761 ttctctcctg gcctcttttc tggggcatcc tgggtggagt gtttttcttg ggatcacgag 80821 cttgcactcg cacacaggcc cgcagacaca caggcccggc ggccgccctt ctccgcttac 80881 tgttcccggc tccccgcagg ccgggtgctc gcagccgggc tggctatgcc tcgcctggca 80941 gcccagagcg ccgctccccg ggaacagcac acaaaggcag cctcccctgg cctctagccc 81001 ttaggcttct gtagctcagt tctttcccca cacccctccc ccaagaaatt ctgggggccg 81061 ttccaccgag taggagatcc ttggccctct aggcaagtag gtcagcgccc caagactgga 81121 gctggtctct ttcaacgcct tgggagactg ggtgaaaggc gagcttggtt acgcttaaaa 81181 tgatcgccta caagcggttc tcttggctca aaacgcctct ttcagggctc ttatgctaga 81241 aaggaaagga ataaggagga gataaaatga cgccgaggcc ctgaactgtt catggcatcc 81301 gcggctcagc caagctgttg ttttaaaaga gcaataaaaa tgaattatga ctaaacgcct 81361 tctaacttaa tgctttcgga cggggatccc cggcaaataa cgtaagagga tttttatttg 81421 tgcatgtgtt cctgcaattg atctctttga tgacattctc attcatagaa agcgtttgat 81481 ttatgagcgt aggacgaatc gcatccagga gctgcgcagc cctggccgct gccgggacgc 81541 cctgctccgc gctgagcttg gggccagaaa ccagccatag tccccacact ccgccgccgc 81601 agctgagatt tagcggagga aggggcgagg gaaggtaggg agcaaaccta tgaagaaaca 81661 tcgcgttgtc attggaactt ccaagccttt gctgttaaga gccaggttct taaatcaacc 81721 cgccccacac acatgttgct tacatgctgc gttttctcac ggtaagtaag taggcatcag 81781 aagagagctc agaaagggaa aaataaacat ggggagacct aacaagggag gggggaaaag 81841 caacaaccca gtgacacacc ctagatcaca ttttattgat atttcattta aagtgctttt 81901 ctctctctct agaacctgcg ttaatttata acctagggtg tagaaacaca ccattttaaa 81961 tctcaacttg ctaacctgac ttcgaagcat taacgatcta ttattagcaa tgtaaacagg 82021 tttaagaatg aaatattgat ttttctgacc ctgaaaaaaa aaaaaaaatc ctagagtgac 82081 tgaaccagag gattggaaag aacattttct agggtgcatt ttccagatag taatcactgt 82141 tttgttttat ttttgttttt ctctatcttt taggtctgtt ttgcctgaac ccatcaacag 82201 ctgggagatt aatcaaccac actgaaaatg tggagggatt tatgggggag ggggttgaaa 82261 tgtgggtgtt tgaaacaaaa gtgtataaac aaatgaattg ttgataactt agttattgac 82321 ctggagactg gtagcttatt aaagaaactc cgtgttactc attcctggag ttgggggttt 82381 ctgtaggcac tttatttctc cactttcaag agcttgggct tggcccaaat cttagactgt 82441 ccaattctgc ctctattacc aatttaaatc tatggcttga acctgtgcac tgaaaatcaa 82501 atcctttaaa aagaaagagg agaagaagaa gcaaaaaaga aagaaaaaac acttattaga 82561 agccctagtc attttttggc tttctgtttt gttgctgtcc attgaagact ttgaacatgc 82621 cgccttaata aatgtattaa aattgaaaaa agaggataaa ttgtgacgaa ttttatttac 82681 gaagttagac taaaagaaga aaaagaaccc gttatgagcc ttgaaaaaag agggggagaa 82741 aaataagcta cttatagcaa aggagaattt attctacaaa aaatatgcat gacaatgcat 82801 cccaatgtaa tacaaaaata aaaagaaagt gaagatacaa ttatatgatc actttcttgc 82861 aggcctcata ctgctctcag gaatcacata aagggttgtt gagttaggtt taaaacaact 82921 aaaactccaa aataatctgt aaaagatggc tctgtttgaa actgggtcag ggaatcacta 82981 aacagaaaat cctcaacact taaaggaggg aaggggtagg tcaagaagaa aataaaaaat 83041 aaactcccaa ataaaagaag gcaaaaccac ctggtcaaag gagtttttgt ttggtgatgc 83101 tttgttttgc tttaatgttt ttagtaattc agatgctgca agtcgattgt ggtgagtgtg 83161 tctgtaaaaa agtctaagct gtcagctgaa atatctacgg gactgtcgag ggaacctggc 83221 aaactgggtg aaactgcatc tgaaagctgc aggcaggaat ctgtggagaa aacgctaaag 83281 tcctgcaaag aggggacctc aagggcctca ggactgtcat tgtttaggcc agctccacag 83341 ttctggccca ttgttgacaa gcagttggga acagtgggtg actggtgctg aaaatgtttc 83401 agatttttct cattgctggt taaaggcgag actgggaaac tttgggagtc gccattgtgt 83461 ccattgggag cctgctgctg agagagggca ttttgctgaa aagtgtagcc ttccctctcc 83521 agaagggccc cagagacgct aagggcttgc tcaaagagcg tcttctcttc ctcgtcctcc 83581 tctactttct cggagtcctc aaggctttta catttccctt cgctgttttg gttttccttg 83641 cactgggtct gcctcttgtg cttcatcctc cggttctgaa accacacttt cacttgtctc 83701 tcagtcaaat ccagcagcgc tgcaatctcc acccttcggg gtctgcaaag gtacttgttg 83761 aaatgaaatt ctttttccag ctctagaagc tgtgtgttgg tgtaagcagt tctcaggcgc 83821 cgcgatcccc cgccgctgcc atcggcgatt tccagggatt ctgcggaaag ggaaaccaac 83881 aagagacaca cgcacagttg gaggtggagg ggtccgagcg gggttattcc actggagaat 83941 aaatatagca gaaaagatca actgcaacaa aatggccgcc cctggatgca gtagagctat 84001 tgtgctgcct ttcctgggag cccagcctgg gcagaccccg tcccctccat ctctatcgaa 84061 ttcatgcctg tggctccccc caacctcttc atccgggagc aaactttata ttagccacaa 84121 cacaatttat aattaatgcg tcagctgctt agctgagcaa gagcgatcta tcactcttca 84181 ttactgtcaa aaagccaaac tctaggacaa ctagacagga ggaggtcagt tccaactcaa 84241 ataaatcatc ccacattaca caagttcggg aaagtttccc cccacttcct aaaaatatat 84301 atgtctcatt gtagagcgca ggatcccctt tcctctccat caaacccact cctgaaccca 84361 caggggcagg gacaaggcaa ccaagcatct ctccctctcc tccttctctc ccagcccact 84421 ctcccctccc cccacagcca ctctcggggc agcccggaga aggaagaggg tcccagagac 84481 ctggggccaa gtctttggac tgacctttgt ggctgaggca agcagggccg gtggctgcgg 84541 cggtggcggc ggcggcggcg gccggcagaa gtgcggtttt cttggccgcc ttcttctcct 84601 tcatccaggg gtactcgggc ggctgcaggg cgccggcggg caccgggctg ccgcggctgc 84661 ccgcggggct cggcttgggg cggccgccag cgccgtggcg agggtgactg ccggggttca 84721 ggctgggaat ggtctgctca aaaggaggag gaatcagtgt cgagtgtgaa agcgtcgagg 84781 tcttgattga tgaactttga aatgtatcag cgacaggggg aaaagatgtc aggcactcag 84841 cgagcgacgg ctggctattg ataaaaccaa tctctcgctc aaattcgtaa ttcatggcct 84901 tctccttgga gccccctcag agaaaaagtt ccctcttttg gaggggcttt gggggggcaa 84961 ggcctaggaa aaaggcgagc gcagaggaaa aaaaatctat catagaatat cgctgctagg 85021 gtgttttttt tctaattcac tgattacagc cgtatgggga ccgcgctact attaaactat 85081 tgaattcatg gagacaaggt tgaaattgga ccgaattggc tgtcacatga ttgcttctgc 85141 ccaatgacaa tttgggcttt aatcaaaaga agccactgtc tgtttgattg atccaaaaaa 85201 gtcgggaagg aacgcctcat tgggggccag cgaggcttta tttacacttt tttcagagca 85261 aaaatacata tatgttggtg tgggtgggga tgtcccggag tacgtggggg cgagggtgcc 85321 tgcgtgcctc ctgatctgca aggatctatc tagtgtgtgt gcctgggagt gtgtgtgtgt 85381 gtgtgtgaat gtgcgcgagt gtaagccctg ctgtcggtcc cgccggtggc tgccctctgc 85441 ctcccccgca cactccgcgc attgtttggg actgtcggga agacgcctcg cacctcacaa 85501 atcatttaag cacctcagtc tgacgcctgc agtcattaac aaagtaatcc attaatcttc 85561 aaagttttga cacccgaggg ccctgcgtcc cagccacata agttctgtta aagcaggaga 85621 aaggagcaga ggaagagagg agatgagaga gggagaataa agagagaggg aagaagagag 85681 agtttgagag atggagaaag agaagacaga aaagagagaa gaaaggaaag attttggttg 85741 ggaaggggtc ttccttttct ttcccttttc ctccttcact tttcctaaaa gcagttcctg 85801 gatctcaaaa ctgatctctc tcggtctgtt ttccttcctg tctgtccctt tccctctttt 85861 tcctcttctc cccctttctc cctccctctc tcttctgtcc gcctcccctt ctcccctcag 85921 cctctcgccc ccctcccagt gtccagccca gagtctgcgc cccgggccca ttgttagcag 85981 gctattccac ggcagctttg catctggctc cggcgggaag cggaaaacgg gaggcggctc 86041 tcgaagcttc ccgaccttcc tgcgccatca acttctcagg agtggctgga gaaagatgca 86101 tgtgccaagc gagacaacaa ccccaggtcc atgtgtccaa atccccgtag ccagggcggc 86161 ggccagccaa agaaatgccg ccccgagcag gcgcgtgcgg ctcctggcat tctgggtttc 86221 atacccgtag ggctcgggtg cggtgagtat ttccgatttc caggaagtct gttggaagtt 86281 accagcaggg aaagagaaga tgcttgctcc tctctttccc tccctctctc ttttttttct 86341 acccttcttc ttgtccttgt cgctctggtt gggggctggt tggtttagat acagcgagtg 86401 ctccctggca gcctgaactg cctcccacac cttcctgatc cttgggcacc ggggatgttt 86461 gcttccgggt tctgttgtgg gaaacagcac tgcgggggag aggaggcgac ccaaggagga 86521 attataaagg ttttcgttct ccttcattat cttttttcca attcggagac ttgagactga 86581 gcgcatcttg gacagtgcta gagggttcaa aagataacac ctttaggtac caaaaaataa 86641 tttttaaaaa atccaccaag ccagaaagag gtgaggaggc tgagaggtag atctggaggt 86701 acctctttcc cccccaaacc ctgaggatgg tgattcgctt ttgcttcctt tgaaatagac 86761 taactgatca gactccccca ggccggcagg catcaatggc aggaaaataa atcattctga 86821 tggtcaggga gagagaggtt ggagggggag aaaaatctac gtagaaaaca agaaaggcca 86881 accaaggagc aaaggcagga ccccatctct gaggccagcg tcaaagaggg tttggggtgg 86941 aaattagaca taagtggcct atatcattga ccagcccaga gcattgcctt gcagatagag 87001 atgctatttt ggaaaatgtg aggtttttta gttttcttgc ttttattcca aacaaagctc 87061 caaaggaagt ctgggcgcga tcaatcttgc tcaccaaaag ccttgacagc ttctagccct 87121 taagaacaca tttcagcttc cagctccttc caagataaaa tgtggccgag caaagaggag 87181 agcaggaaat gcaagtaaat aaggaaaact agtcttttaa aaaatatatt ttggcagtga 87241 aaagggttat agggctcctt ttaggaaggg tgctttgcga accctgggat ggcagctttt 87301 ggcatctgct tttaccagaa aatattacag gcttaaggga acgtgggcgg ggggtgcaga 87361 gagggagcag cgctgccccc atatctctga ggctttgtca ggggctgcgt gggtgagagc 87421 tttctgctgc ctcagccgcc ttaattcaaa taccatctgg aagaaaaccc ttttctttga 87481 ccactcaaca ctgaaatctt ccccaaatct attactttaa cttatgagat attttaataa 87541 ggatttcttt ttaatatcta ggaaactggg gggtgaagtc aatacttttt tctttgaaag 87601 ccaaaatgaa tgtcttgact ggtaagatgc gggaaatcac attagaatct cattttaaaa 87661 tcttagccct gggggaagga ggaagcctcc aaaccacccc tccctcagct aaagagtaga 87721 tttcagaaac gtgttcaaat agtgatatcc ctgaaggatc ttcatcctta ggtgaatctc 87781 tcccctagct gttccttatt tttccaggta attttacacg tgtctctaag ctgattactt 87841 aaaagaagac tcctgtatct gcttagcgtg acgaagaggt ggaagctcta gtcttcacaa 87901 agcctggttt tgcagcctct gcatttctga ggtgcgccat attttggcgc agaggttgag 87961 gtgcggagag gaaatctatt tctcccatag cccaaaccct ttaaaaagga tatatatatc 88021 agcagcctta ggacagatat ggggaaagca cctctgggtg tgagagaaac actgagctct 88081 ccacagcagc ctggcttggc atcctccgag ccttttcctt aaaccatgag gagtttctct 88141 aaaagcagct cgttgggacc cagcaggtac ccagaaatgg ctccagtgcc tgcttctaag 88201 gataatctga gggagcttca cagggggaag ataaaggcca tgatgatgga tattttggat 88261 tgcaaacaaa atggcgacat tttccacttc tgtctctgaa agatacaatc cagggagaat 88321 agggatttat ttctcacatc cgatgctcct gatccccaga cgaaagcagt tcctcgcttc 88381 tagctctgaa gggggtgatg atggggatgc aggtctgagg tctgcatttt ctgaatttga 88441 ttcctttctc gaggaatcct taaatctgaa gatgaaaata tttcccttta ctgtgccttc 88501 ttttgcctct tctagccagt cagtaagcag ccctatctta aatccaggag tggaaatgta 88561 atatttagct cctacaacat caaatgaaaa atttataaaa ttgcagtttt attacaaaga 88621 taccgatctc cttgctctct ccccctcccc ctggggccat aaaaagaaag ctgggtgggg 88681 gaaggagagg aaataaattc agagaatgcc aaatactaca ttcagcagga agctaatgct 88741 gggattccgg cagtccaggg aagggctggc tcagggtgac tctccccagt tcagtcctcc 88801 gtttgctgga gacctgggtg agcctcagtc cctttcaaac tgtttcgctc cgcgtgcaga 88861 tttttggagc aattctttcc tcctgacgcg ataacagacc gcgaagcatc cccggaggag 88921 accaggagtc cgattgctcc cggaaagtgc tcggcatcct ctgcgcggct ggctaatacc 88981 gaggtttagg cgttagttgg tccgctttga cccgaggaag agcatggagc agaagaccgc 89041 cctgcgttat cccccggctg ggccgcgggg cacctgtgcg ctcccgctgc gccttgggcc 89101 gggatgtctt tgtttcttcc gtgcacagca ctcggcagcc cctgcgccct cagcctgcct 89161 ccgcagcagc tgggaaccgg gcagcccagc tggcccgccg cacggcgtta ccagagctgc 89221 cgcagccgcc tggagcccca gccgccatcc tgagaattcg ccgagcttcc aggcgaatag 89281 cctggccagg ccccatgctg gagccagagg gctgctgcgt ccggcctgct ctgccggctc 89341 gcctcggcct gaattccacg cagacccacg tcttcggcca aagaacccct tcttaacacg 89401 gatattattc agaataaaaa ctttatttct ttttgtttct cagaatgtga gcagcaagga 89461 atgaagaact ctcaaaacaa atctagtatt tttcgcgttt acagattttt tttttcagtt 89521 ttgccggatg ttacaaacct aaatcactgt tttcaaaagc tggatcttct ctggttttat 89581 tgtatcgtta ccgtttaaag gaattatatt tctctttcaa ataaaaaaaa ttctaagtac 89641 gccaatacac aattgaacaa aaggcccaag aacccttctg aaccatcagg gtacagaatt 89701 ctttaatata aatacgaacg gatggggata aataatattt aaaacttaga ttattcaatg 89761 cttgcaaaaa aatataaata aatctgactg ttcaccagca tacacacacg gaaagacgta 89821 cacttagtca tccttgcaca gagagcccct gtttcaaagc gcctttgatt ttttttctca 89881 ctctttacaa gaagtgcaat taaaaaaaaa atttaaaatt aaaaaaagta atcgcttccc 89941 ctcgggagcc ataatgtaag aacaaattca cattgaaaac tcgactagaa aatttgttca 90001 tataccctgt ttctgatcaa agagtggaga aggtaaaggg tgcagggcca gtggcctatc 90061 gaggagcagg aagagataaa tatcgctatg atacagccat tccagcaacc aagattgcta 90121 cgtcacataa actataaaaa cgccttacca acgagggggg aaaccgggaa acggagtgcg 90181 gggcggagaa gagagaaaag gaaggaagga aagggcagga agaacctaaa aaaaaaaaaa 90241 aaaaaaagca accaaagaaa aaaggtgggt ggggggagac tctcctggcg cgtagcccca 90301 agcccactat cacaggtggg tgagcttggg tgcttcctga attcttccct gagaaggatg 90361 gtggccggta aggtccgtgt aggtggggtg cggctcccca ggccccggcc cgtggtggtg 90421 gccgctgccc agcggcccgg cacccccata gtccatggcg cccgaggcag cgtgggggag 90481 gtgagttaga ccaaagaggg ctggcccgga gttgctcatg ggctccacat agctgccccc 90541 cacgaagacg gggcttccct gtatgtgtgg ggtcccatag ctgccgttgc cctgcaggcc 90601 atgagcgtgc gggtcatagt cgggggtgcc ccctgcgccc gcccctgccg ccgtgtagcg 90661 cttctgtggg ggtggcgggg gtgcgcagct gggcagggac gcagggtagg aggcgggggg 90721 cagcccgtag gtaccctggg ggggcttgga gaagggcggg ggcgactggg gctcatacgg 90781 gacgctgttg accagcgaat gcatagagtt cagatagcca ccggctccgg ggggcacggg 90841 gctgcgactt ggagactggc cccccgatga cgttagcatg cccttgccct tctgatcctt 90901 tttgtacttc atgcggcgat tctggaacca gatcttgatc tggcgctcag tgaggttcag 90961 cagattggcc atctccaccc ggcgcggccg gcacaggtag cggttgaagt ggaactcttt 91021 ctccagctcc accagctgcg cgctcgtgta ggccgtgcgc gcgcgcttgg acgaagcctg 91081 ccccggcggg ctcttgtcgc cagcgcagct ttcgcctgcg aggacagaga gaggaagagc 91141 ggcgtcaggg gctgccgcgg ccccgcccag cccctgaccc agcccggccc ctccttccac 91201 caggccccaa aggttcctgc atccgtcagg tcccagagag aagtaggaac taggtgctat 91261 cctctctcct ggtgggtaaa acagtgatac tccctcattt agccaaggag caaatcacag 91321 ccttctcggg ggaaaggaca gagaagtaga acccccggtt tgccttcctg gtctggattt 91381 tctgtttaga gctattaact ttctgactca tttactggtc cctttggcct atcggacaac 91441 agcctccctg tgggccggtc tcagcatctc cattctcttc acagtaactg aaatgtggga 91501 aacctaatgt acacactcat ttccctccca cactgtctga caataattag gcagtagctt 91561 caactggcca cttttttctc tctacccaga ggagatgggc agactggaaa cctcaaactg 91621 gaaggccttc tctgcccaaa gaagagatcc cattgaaggt atttctggtg gggaggccta 91681 ggcccttcat gctaggtcct attctttagg gttccctatg tcatcaagca gccagcccca 91741 gcttttgtcc actaatcttg tacccttctg tcttccctgg ccccccagat cctagctctg 91801 gaagtcccag atgctcaggc tcagagcagg gcactggcaa ctggagaagg caggggcaaa 91861 ggtaaccaaa atgctaggaa gcctaaagcc tggggccttt cctctactgg agcccgcttt 91921 caggaggccc atgtggttcc cattaggatg ataagtctgc atccccctcc cccaggagat 91981 ctttggctgc tttcattgta ctaagtctga gtagggtctt cccagaggta aggcagccac 92041 tccatctgct ggagcaaaaa ttggctctga ttataggcaa agcattgatt ctgtcttgga 92101 gggtgattag gcctgttttt attctgccac atttcagtaa ggggaggatg tcactgtttc 92161 attagaagag ctgtgctaaa aagatagaca gtggagctca ggttgtttct gcacagaggt 92221 cctaactcca agagtggagc aagaagagat tccaactgca ctagacccta cttgcccttg 92281 gagaagtcca gagggactcc cagcaggtct ctggtgtttt ccagcctaca gaggcctatc 92341 acctccctaa gttgcaaaga ctatgaaaac ctttttggtg ggcagtggtg tgggagcagt 92401 gaattccaga catgctccat cgctcctagg ctgtgctgga tgtcgtgggc agtagcttgg 92461 ggaccaggag ccaggccaga ggaccctctt ccagaaggca ctcctttacc tgagctggag 92521 ctgctggttt tctgctttgt gttttgtcga gactctttca tccaggggaa gatttgtttg 92581 gccactgtgg gtgagttgag cagggggctc ttggccgcgt tggcaggggt agggttgttg 92641 ctggcattct gaggagggga ggcagaagag ggaggcgggg gcgcggcagg ggtaggtgca 92701 gggggctgag gtgcgggctg aggcggctgt ggggcagggg gcgcggcctg gggcggcggc 92761 gggtgcaggg gcggctctcc caggcttgga ggctggctag gtggggcgct cagggtgcgc 92821 aggcacgcct cactcagttc gtgtgccttg gggtggcccc cggcgctgga gggagactgg 92881 agggagcagg cgggtcggtg gtactcgccg tcggcgccca aagcggcgga cgccgggtac 92941 ggctgctgat tggcattata agcgaacccg ttggctgcct ggtaggggta gccaccgtag 93001 atcgccgagc tgtcgtagta ggtcgctttt tgcatcgcgt tgtttcacga tcttgatcgc 93061 acactctgac aggggtttga cacccgtgag ggcgcacatt ggcacgcccc cgcggtcacg 93121 tgacactccg ccgccaatgg ccgccccgcg cagacctggt ggggcgagaa gcgcagcgcg 93181 gtgagggctc cgcgcaaatc catcttactc tcaatagcta agtgacatga aagccataaa 93241 agaaaaagtg gtcagcaata tttagcagca cgacttggcc ccgggcgcag ggagccgtgc 93301 tataaaaaac cgctggaatt tactggcagc tacaaatatt tgcttaactt gcgtctggag 93361 ttgggggatt ttccggggag aaggagaatg agtgagggct gcaagctgat tctcaggagc 93421 cgggatccaa aaggagaaag gcttgatagg ctagaaagga aaaaggctgg gatctttctt 93481 ttccagggaa gaagaaactt ggggtgtcgc ttagtttctg ctccttggcc tcctccagag 93541 ggcccaagac tcctccactc tgggaatgtt gggaagggaa cgaggaggca aaggggagct 93601 tgggtcgcca atgttttctc cgctttagga ctgatgtttg ccaaaagagc cctgagatgg 93661 ggtaacttcc cacccagctc cttcttggac cttcctgctc ccaagagagg tttgcacaaa 93721 aaatttcagg caatttgccc catccaacca tgctggattt cagaagctga gcttgttagg 93781 aagttaatcc accttgttgg ggatatgact cacctcctcc aaatgaaccc cttgtggcca 93841 agccaagggg ggagggaaaa ctttgtgtgg aacacattgc gtgtgtgtgt gtgtgtgtgt 93901 gtgtgtgttt agggtgaggg accaacagta acaccccacc cagcaagtca caactaaaat 93961 cctggagagt tctttactcc tttcctctcc ctgtttctca ggcacttccc agcaagccac 94021 ccccacttct ttacttcttt cccccagtta cagaaggttc ccagaacctc ctgtcttgga 94081 ctttctaagg tgctgttatt cggggcaact atcaaattct acctgttaaa acatgatgga 94141 ttagaggggg aaaaaaaacc cctccaggaa cctgaaaaac cagcgttcat ctccatgacc 94201 acagtttgaa ctctccttcc aacttaacga taataggttc tgtcttagaa actgctatgt 94261 aatttgatgt atgggggtca tttggctccc gacgagggat ggaagcacaa gccataaaat 94321 cctgccagag tttcctgatc tttgtgtgct gttgttgttg ttgatatttt gttacttggc 94381 ctatttacct gcttcagaaa taccaagtaa aggataaccc tgaaaatcta aaaagcagtt 94441 gaaaacctcc tcaaccctct ttcatttaga aagcagtttg caaaagttaa atctgttttc 94501 tttttgtttt caaccactgc atggtcaacc tctagttctc agcacagaaa agttccatgt 94561 gagtttcaaa tattcaccca cacacacact cagaatctta gtccctctga ccactcaaga 94621 aaattataac aaattcaaaa gaaaatggtt aacatgcaat ccaactatgt atattgagtc 94681 ccagagttat tcaagttttc caggaatatt agatgctttg ttaactagaa aaaaaaagat 94741 aatgtattgc atctaccagt agatgattat aattccattg ttgataatgg actcctttag 94801 atttcattat taagccaaca atcttaccaa catctgcaaa acaggcacaa ttaattttcc 94861 agaacctcga ttaaaaatta cagcatcgct tatccagagt tattgtcagt gatcatgagc 94921 cctggagcat tgtttctctt ttggaaatag gaagctacac ataccatggt tgaaagtaaa 94981 taaaggataa agtaaaactt aaaactgaat agcaatttag tgtatccgtt tccttttctt 95041 ttactgaaaa cagttcctat tacatttggc agtgtcaatg tctgccatta caaatttttt 95101 agataccagg cttgtgtaaa ataaaatgtc aggttaagaa aattagacac tcgaacagca 95161 agatattaga taaaacaaaa ttaagctgat tttgcacatg caatatttct agtgacttgc 95221 taatccatta tcacgttata ccagcctgct acttagttaa gcctttttat caagttgtgc 95281 attgtcttac aaaataacaa aaatatcttt gaaaatcaaa gagcatacaa cagccctcgt 95341 aaacttaaaa tataaattcg attctgggat cccaccaaaa tccttcctat tgttcgaggt 95401 cagcagacca ccaccttgga gctcatacat ccccccaccc caacccagcc cccaaagttt 95461 gcaacctagt aaagaagttg gagatttgcc agccaggaaa ctttgctctg taccgccaag 95521 caagggctgg ggatgagggg agattcctcc acaaagatct gtttctggac aataatcccc 95581 aggtgcaaga gccgaaagag agttgaagag gctaaattga gtgcaagaag caagccctat 95641 tgtctctctg gaagatgtgc ccaaattcaa cctccgtcct tcagacatcg ctccctcttc 95701 ccattccctc ctctttctta gccttccctc accccaaatc tgagcgggtt cgtcggatag 95761 gagccccgag tcccctgccc acctctcctt ccctctgcaa cccagaaaag ttaaggctgg 95821 gctaggggag aggcaagagg tgggggcggg gatgggaaag cggcctgaac tcgaagtgta 95881 agggtatcag tcactgccgg gaaggaagct ccggcttgtg aactcttctt gaaccgagat 95941 ttaagtctgc agccataaat caccgagacg taaactccgc cattcagcgc cgggagcggc 96001 cggttgcggc cctgcttacc ttaacttctg acgagcgcag gctcgggctc aggctccgac 96061 acgggctctg gcccccggcc tggcttggcg gcccggccgg gtcccctcgg cgcgggatgg 96121 gcacggaccc tcagcatcag ccgagatggc aggcgggctg gagaccccga ggcccggtca 96181 ccgccgcgca gagcgtggcc gcctcgctgc tccagcacgg ctcgcccagg gccgactggg 96241 gcggcttgga cccccctccc acccacccca cccacccacc cctcccccac acccccgccc 96301 cctgcccttc ccgagggcag cagccgcggc ccggagataa gcgctagtgc ggcgccgggc 96361 tctgcctggg ctagtggcac cgacttgggt atgtttctta tgaatattac acgcggagca 96421 gcgtctggtc cgggggtgcg gtggggggtg ttggggcggg cgggagggga gaccaaggcg 96481 gctggggaag cgcgggctgg tgtgggggat ggggctgagg cgaggggagg gaccggaggg 96541 agaaggggag ctcggcggtg gcgggaggag ggcagcctcc ccccaccccc caacgccctc 96601 cctctcctgc tcccaggcgt cacgaaagtc caggaaaatt atgctaatga ggacaaacgg 96661 ctctcacaaa ggggctctct tgctgggctc tatttcttag gaattcgttt gagaaacttc 96721 gtattcctct gccctggaca cttccaattg tggggtgagg ggaaagagaa ccaagaaaag 96781 tgagtttccc agactttgcc cagtttactt cacctgcaac tcccaaacag atctttgggt 96841 gggggtgtgg ggagctggga taaggctttt ctctagagac cctcaagagc caactgggct 96901 ctctttggtt aaattttcag gcagaaactg ctgacatcct agcagccagg agaaaaatgt 96961 cctgggagag acaggtactt gttccagcag tgaaatggga tgctctgggc tcaggtacaa 97021 acaagggtgt ttagaaacgc ggatctccag ggctcatgtg cccatttcag agtgcagaca 97081 cgcagctcac ccacagggaa gacagagaaa gctctgaccc caagagaaac cgacatcata 97141 cacacacaca cacacacaca cacgcagggc agaggaatat gtggtttctc aaacggattc 97201 ctaaaacata catatatata taatatatat tatatatata atatatatta tatatataat 97261 atatattata tatatgtata tacatatttt cctcattagt gggatggagt aacgctgagc 97321 tgggcaaaag aaaggtccga atttgcctgg taatattgcc acaacttgaa agtggatgat 97381 tttcctccta aagcctcctt tgctcccagc ttcagaagca aactctgcaa gggtacgagt 97441 gtggcaaaaa gagaaatgga gatttctctc ttttgctgca atcccaaggc gtcaacggcg 97501 ggtccttctc aaaggaaggt gttaaaaata taagtagtat gtttaaactg tcaaggagcc 97561 aaagtgtcac cttagagcaa aaacaaagcc aggggaaaag ggagaaatca aaaagcactc 97621 cgggccaggg tggcctcatg cataccaatg gtttttgtca tttatggctg ggcaagattt 97681 atgactcggc gccccaaagc tgtaaacaga gcacaaaaca gcaaacactt gctccttatg 97741 gcatattgca gagcaacttg aaagggggag ctggccgcgt aggaggtgag agccagccgc 97801 aaaaatccgt tcggcggcat tttctcttca aacccgcccg gtcggatgtg gcatttgaag 97861 aaagaaggcg tgggtcaata atcaacccgc actttctcct ccaaactgct gacgcgactc 97921 tcaccgcctt ctcagctaag ccagctgccg ggaaggccag ctctccgtaa gaacctgcca 97981 ggggaaggga ttctctgggc caaggggacc gagggctccc gagcaagggc tggggcgccc 98041 tgaaagcata cctttccact ggcgccaaga ccctgccggg caaaacgcgg ggaccagacc 98101 tgaggagccg tgggagccca gggagatctg gggcgggagc ttcgttccgg ccagggcctc 98161 tatccgcccc gatacgtggc tcctgggccg cgggcctcag cgggcagaag tggggctccg 98221 acctcaagat gaaggtccag cgtcctggca tgcagatggt cggtgcccca gcctcaagcc 98281 gctctgctta aatccaaagg actggctttg gagcaagcgt aggcgagatt attttaaaat 98341 aataatttat gctggcggaa ggtttttatt tgttggtttg ctattggtgt tttgtttttt 98401 aagcgaaaat gccgattgtt gatttccacc tccagaaatt accagccgcc gccgctgcgc 98461 cctttctaga caaatcccac agcagttcac ccggctcccc caaaaccgca gagcttttct 98521 ttactgagag gaccccttgg ttccaccaat tcccccaaat ccccgtggct cccgatctct 98581 catccagggg gactagaaga aaagattttc tcagcactag aagaatactt ggagcagcaa 98641 aggtgcccgt ggtgtctttc taccattcaa ggggtgtcct cctgaaggcg tctgagaggg 98701 cctctggcgg ttccaaggct gtgcagccgg ggaggaaggt gaggaatatc gtgggtaaga 98761 agaggcacca aaatcacaga atagagagga aaattattga ggaaatgtac ttgcctgtga 98821 gggtgaagct ggcatttcga ggtagaaggg gactagcctt ttttgatatc taatagttta 98881 gcattatacg taatggactc actttagcac tttcatggga ggaaaaacca atctgagctc 98941 tataaatgtg gctcttggct tctcactgtg catgctttta tgtagaacca gagtccatcc 99001 ttagatattt attcagcttg ctggcactga agcaacaata aaaagatgcc tctaacaaac 99061 ctctattccc tctctcattc tttctagcaa aactatatta tttatgagtc tttaactggg 99121 tcaaaaatgc gccagaacta ggtcataaat catatgaaat gctgacaagt aattgaacag 99181 caattaaagc ttacatttta tttccaaggg gtgttttaga gcacttgaaa gggcctcagt 99241 taaacacttg gttactgccg aggccggcac tggggagggc agaagggccc tttcgttctg 99301 gggtagtaaa tcccactcaa gtaggggatg cctgatttga tttttttttc acgtacagaa 99361 agaaataatc tatattttaa gaagtgaaag gggcacatcc gacctctgac tccttataag 99421 catccctttc cctactgcaa ggagcataat agagtttcct gcacctttcc agggagttcc 99481 cacccttccc aggagtgtgg ggtccccaaa gcccaattag ctggggcttt cccaataatc 99541 tggaaacatg cagtttaact ttcatctgcc tccaccctgg ctacagctct gttttggagg 99601 ctctgaagct ggagagatgt ctgcctctgc ttcctgagga ctcttgtctc tggacttccc 99661 tccaacccag ggcccactgg ccagttctgg aactctccaa gccagctggg ggtacacaag 99721 gaatgcccag accaggcgca ctgatgaagt ccaaacctaa tcctggctcc ctccttcctc 99781 tgtctcccga atgttagaat aaggagcccc tgggcatcca gcatgctcac cccaccccaa 99841 ggctctccta ggtctgagct gtctttcttc aaccctctgg ggcagagggt gctcagggaa 99901 acaatggcgg gccctccggt tcagatcctt gggtctggct tcttgggtcc ggcttgtggg 99961 gaagcatgct ctgcaatctg gcactaaaca aagtgcctgg aagcatcatc aataaaatgg 100021 cagcgcatct gcacatttcc ctcatcgatt tccctcttcc agctgctccg gctgctcagg 100081 tcagccccta atcctccatc ccctgcctga aaacaccgcc atcctgggag agattcccct 100141 tctcacccac ccatccccca cccctcccag gcaatctagc cctgcaagcc ggtccacctc 100201 tgcctgcacc tggcttctgg accctctttc cgcctgggga tcagctcctg agactccagc 100261 cctgccctgc tgcagtggac aaagccaccc ttgcactggg ggccactggg cccacctagg 100321 agccccccta gtgccccctc ctttacaccc ccaacacttt ccagggaatt aaattcagag 100381 tctgaggcgc tactgggaat ctttcaaatg gtttcggttc taaaagagaa acaaaggaag 100441 agagaggctg agggtcctgt ccatgagcca gggtctcctc gcctgacctt tgtctctcac 100501 agccataact cactctagat atgaacaatc ggcggaaata ccttaaaata aattccatcc 100561 tcctggtata gcctcgactt ccatcggcta gctcgcccag aaccgctcca tgagtgctga 100621 aaactgcaat cggctcagag agcgctctcc ggagaagggg aaggagaagg aggaggtggg 100681 tgcgtggggg ttgggggaaa gctctggctc tgtagagaat ctagaaaacg taatcacagg 100741 cgaagcctgt ctgggtctcc gctccaaatg ccacaataat gcgctgtatt atgtgctcac 100801 gtggtcatac gtcaacttaa atccttttat ttgaaatagg gaaccactga aggaactagg 100861 tttccaaaca gaatatttag ctttggggga gggtaggtgg ggaaccttga ccagcagttt 100921 tgagtttaag agttctgtct aagcaatagg atgcttggag ccatcctccc atgaatgcca 100981 tataagacat acaagcttat ctccacttat atttctttac acaccctcca aacacaaggg 101041 ttacagcaat gctattcaca tttttagaat ccatttaact ttccttacag cctcccaccc 101101 acggacaagc tggatcctgc acacccatgc accccagtct cctctggccc acaggccagt 101161 atatcaaaag gcaaggggca ggcccagaaa taactggcct ggacagcgga cttcctcaca 101221 gcctgccatt gtatggagtt tgctaacacc cacaccatac actacagacg caagaaacac 101281 caaaaaggca cataattacc tgacaaaagg ttaaagcatc tcctccaagt ccagttggtt 101341 ttgttcagtg ggagcccact aggaaagatg tgcttggcta tagagtggtg ggtccgggtt 101401 ttgcaaaatc tagagaaacc tagtgccctg actaaacttg cacttaagca tttcctcagt 101461 gcacaactga gtgatacatg gaattataag tttctaaaca taaaaaatat cagaggaaat 101521 gatctaattg ctctgcaaga gacccccccc cccaaaaaaa ataagaaaaa aaagaaaaag 101581 aatcaacctt cactggagtg ccttaagtta acaaggcaat ctgggaccct ctccacaggc 101641 aggtctgtcc acccactttt ccaagacaat tctgtgtcca ccagcaaatt ctctccccac 101701 ccccacacaa gttcctatag agatatcaac tcaaagatga gaaagagccc ccttcagaac 101761 tctcaaaata atcttgagca gaaaaccagt agagaaccaa ctgtgctcag gaagaaccta 101821 actgaaacaa ataagctgct cccacccttg gttccacccc tactcctgcc ttctgcctga 101881 tgacattgcc cactaatttc aacattgtac cgtttcatga gctgtctaaa aatcttctct 101941 tttctggtct tagtttctga acttctgaga gtgctctgag gctagtgaga tcctcaggcc 102001 atattctttt tttcctggag aatgggtctt cagctggcta tgcctgagtg tgaggactca 102061 cttatagcag ctgcaggagg ctgtgcatga ttcctcccct tcactcctcc cctgcagagg 102121 aacagaaggg gtctgatgaa aatgtaaggc ctttcataag agaacttgtt catgggttgt 102181 ccaaatcaga aaggcaataa aagaagagac ctaatcccct ggaagcaagt cctggcctaa 102241 gctggtgaca atatttcagt aaggttcaat ctgtggtgta gctggaagag aacctccatg 102301 tgaacttttc cagctcctat ctaagttaca agctggaaag taagaactcc aagtgaaaac 102361 ctgcctctct caggtttggg ggatagagga caggagtaga agctggtttc caaaaagcgt 102421 tctcctgacc cccaagaact cactcctctc attcttactg ctgcctgcct catacccctc 102481 accaatcatt cacactgagc tctccagata ccctagaaag ggacctggag gtattgctgg 102541 caaattcctc cctaccccca agtttctgaa gaggcatcaa ctcaaagatg agaaggggcc 102601 cccttcaaag ctcttggaac agtcgagcaa atgaaaactc ttaaagccat aaattgtccg 102661 ggtgcggtgg ctcatgcctg taatcccagc actttgggag gccgaggtgg gtggatc // LOCUS AC004080 129354 bp DNA PRI 29-JAN-1998 DEFINITION Homo sapiens PAC clone DJ0170O19 from 7p15-p21, complete sequence. ACCESSION AC004080 NID g2822164 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 129354) AUTHORS Bradshaw,H., Hinds,K. and Keppler,D. TITLE The sequence of Homo sapiens PAC clone DJ0170O19 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 129354) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (29-JAN-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This clone was provided to the Washington University Genome Sequencing Center by Dr. Stephen Scherer, Department of Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada. This clone was isolated as part of a chromosome 7 mapping effort supported by the Canadian Genome Analysis and Technology program. For more information, see http://www.genet.sickkids.on.ca/chromosome7 SOURCE INFORMATION: This clone was derived from human PAC library RPCI-1, prepared by Pieter de Jong and coworkers at Roswell Park Cancer Institute, using the method described by Ioannou et al., Nature Genetics 6:84-9 (1994). The library is from one male donor. For further details, see http://bacpac.med.buffalo.edu/ The clone is available from Genome Systems, Inc. (http://www.genomesystems.com). VECTOR: pCYPAC2 NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the right is H_DJ1200I23. The actual start of this clone is at base position 1 of DJ0170O19; the actual end is at base position 129354 of DJ0170O19. This clone contains STS sWSS1331 (NID:g454747). FEATURES Location/Qualifiers source 1..129354 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="DJ0170O19" /clone_lib="RPCI-1" /map="7p15-p21" misc_feature 1888..2352 /note="match to EST AA449704 (NID:g2163454) zx09a02.s1" misc_feature 1892..2325 /note="match to EST AA580334 (NID:g2355661) nn12h08.s1" misc_feature complement(1903..2126) /note="match to EST AA258329 (NID:g1893470) zr59a04.s1" misc_feature 1953..2227 /note="match to EST T81791 (NID:g704798) yd30f04.s1" misc_feature complement(2216..2636) /note="match to EST AA448557 (NID:g2162227) zx09a02.r1" misc_feature complement(2545..2839) /note="match to EST T82108 (NID:g705115) yd30f04.r1" gene complement(2595..4103) /gene="HOXA4" CDS complement(join(2595..2941,3488..4103)) /gene="HOXA4" /note="match to Q00056 (PID:g123229); H_DJ0170O19.9" /codon_start=1 /product="HXA4_HUMAN HOMEOBOX PROTEIN HOX-A4" /db_xref="PID:g2822173" /translation="MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQ PPAPPTQHLPLQQPQLPHAGGGREPTASYYAPRTAREPAYPAAALYPAHGAADTAYPY GYRGGASPGRPPQPEQPPAQAKGPAHGLHASHVLQPQLPPPLQPRAVPPAAPRRCEAA PATPGVPAGGSAPACPLLLADKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRS RTAYTRQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKL PNTKMRSSNSASASAGPPGKAQTQSPHLHPHPHPSTSTPVPSSI" misc_feature complement(2692..2754) /gene="HOXA4" /note="match to EST R02467 (NID:g752203) ye86a01.r1" misc_feature complement(3077..3356) /gene="HOXA4" /note="match to EST AA340685 (NID:g1992922)" misc_feature 3324..4389 /note="CpG_island (%GC=73.0, o/e=0.70, #CpGs=118)" misc_feature complement(14878..15140) /note="match to EST T29089 (NID:g611187)" misc_feature 14909..15281 /note="match to EST N80726 (NID:g1243427) za98a07.s1" misc_feature 14918..15458 /note="match to EST AA612758 (NID:g2463796) nq27b09.s1" misc_feature 14921..15323 /note="match to EST N75430 (NID:g1238008) za82g02.s1" misc_feature 14921..15340 /note="match to EST N89758 (NID:g1443085) zb14e02.s1" misc_feature 14921..15332 /note="match to EST N74705 (NID:g1231990) za79e11.s1" misc_feature complement(15021..15463) /note="similar to EST AA220389 (NID:g1838849) my27g09.r1" misc_feature complement(15177..15457) /note="match to EST AA081617 (NID:g1623675) zn18g03.r1" gene complement(15205..16977) /gene="HOXA5" CDS complement(join(15205..15455,16416..16977)) /gene="HOXA5" /note="match to P20719 (PID:g123225); H_DJ0170O19.8" /codon_start=1 /product="HXA5_HUMAN HOMEOBOX PROTEIN HOX-A5" /db_xref="PID:g2822172" /translation="MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGR YGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLP CSAVAPSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQA SAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNR YLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP" misc_feature complement(16096..16145) /gene="HOXA5" /note="match to EST AA081617 (NID:g1623675) zn18g03.r1" misc_feature 16365..19313 /note="CpG_island (%GC=59.3, o/e=0.70, #CpGs=216)" misc_feature complement(16412..16545) /gene="HOXA5" /note="similar to EST AA220389 (NID:g1838849) my27g09.r1" misc_feature complement(16418..16890) /gene="HOXA5" /note="similar to EST W30090 (NID:g1310256) mc26f03.r1" misc_feature complement(16421..17029) /note="similar to EST AA036271 (NID:g1509456) mi72d12.r1" misc_feature complement(16571..17004) /note="similar to EST W07524 (NID:g1281527) za98a07.r1" misc_feature complement(16596..17006) /note="similar to EST W05093 (NID:g1277815) za79e11.r1" misc_feature complement(16610..17036) /note="similar to EST W05150 (NID:g1277872) za82g02.r1" misc_feature complement(16678..17034) /note="match to EST W17241 (NID:g1291621) zb14e02.r1" gene complement(19028..21119) /gene="HOXA6" CDS complement(join(19028..19287,20678..21119)) /gene="HOXA6" /note="match to P31267 (PID:g399987);but note possible polymorphisms; H_DJ0170O19.7" /codon_start=1 /product="HXA6_HUMAN HOMEOTIC PROTEIN HOX-A6" /db_xref="PID:g2822171" /translation="MSSYFVNPTFPGSLPSGQDSFLGQLPLYQAGYDALRPFPASYGA SSLPDKTYTSPCFYQQSNSVLACNRASYEYGASCFYSDKDLSGASPSGSGKQRGPGDY LHFSPEQQYKPDSSSGQGKALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQ TYTRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLIN STQPSGEDSEAKAGE" misc_feature 20679..21443 /note="CpG_island (%GC=63.0, o/e=0.80, #CpGs=69)" misc_feature 24026..24866 /note="CpG_island (%GC=62.9, o/e=0.80, #CpGs=72)" misc_feature complement(25836..26045) /note="similar to EST H28870 (NID:g899780) yp03a11.s1" misc_feature complement(27304..27660) /note="match to EST AA557521 (NID:g2327998) ng50g04.s1" misc_feature complement(28149..28184) /note="match to EST R80374 (NID:g856655) yi96h02.r1" misc_feature 28178..28594 /note="match to EST AA126727 (NID:g1686291) zk95d06.s1" misc_feature 28200..28570 /note="similar to EST AA181926 (NID:g1765419) zp66b09.s1" misc_feature complement(28202..28610) /note="match to EST AA187553 (NID:g1773763) zp66b09.r1" gene complement(28279..29915) /gene="HOXA7" CDS complement(join(28279..28592,29537..29915)) /gene="HOXA7" /note="match to AF026397 (PID:g2739071); H_DJ0170O19.6" /codon_start=1 /product="homeobox transcription factor HOXA7" /db_xref="PID:g2822170" /translation="MSSSYYVNALFSKYTAGASLFQNAEPTSCSFAPNSQRSGYGAGA GAFASTVPGLYNVNSPLYQSPFASGYGLGADAYGNLPCASYDQNIPGLCSDLAKGACD KTDEGALHGAAEANFRIYPWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRR IEIAHALCLTERQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADE EDDDEEEEDEEE" misc_feature complement(28896..29270) /gene="HOXA7" /note="match to EST AA570160 (NID:g2344140) ne57g06.s1" misc_feature 29353..30318 /note="CpG_island (%GC=64.8, o/e=0.80, #CpGs=100)" misc_feature 33413..34711 /note="CpG_island (%GC=62.7, o/e=0.70, #CpGs=94)" misc_feature 35808..36184 /note="match to EST AA496921 (NID:g2230242) ae32b09.s1" misc_feature 36213..36484 /note="match to EST AA577580 (NID:g2355054) nn15b05.s1" misc_feature 36275..36702 /note="match to EST AA541292 (NID:g2287726) nf97f01.s1" gene complement(36973..38827) /gene="HoxA9" CDS complement(join(36973..37211,38248..38827)) /gene="HoxA9" /note="match to U82759 (PID:g1778588); H_DJ0170O19.5" /codon_start=1 /product="homeodomain protein HOXA9" /db_xref="PID:g2822169" /translation="MATTGALGNYYVDSFLLGADAADELSVGRYAPGTLGQPPRQAAT LAEHPDFSPCSFQSKATVFGASWNPVHAAGANAVPAAVYHHHHHHPYVHPQAPVAAAA PDGRYMRSWLEPTPGALSFAGLPSSRPYGIKPEPLSARRGDCPTLDTHTLSLTDYACG SPPVDREKQPSEGAFSENNAENESGGDKPPIDPNNPAANWLHARSTRKKRCPYTKHQT LELEKEFLFNMYLTRDRRYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAKDE" misc_feature 37667..40213 /note="CpG_island (%GC=61.2, o/e=0.80, #CpGs=201)" misc_feature complement(38607..38888) /note="match to EST AA497085 (NID:g2230406) ae32b09.r1" misc_feature complement(39186..39522) /note="match to EST AA035273 (NID:g1506785) zk24c05.s1" misc_feature 39562..39825 /note="match to EST AA579981 (NID:g2355308) ng51d06.s1" misc_feature 42623..43504 /note="CpG_island (%GC=68.3, o/e=0.70, #CpGs=76)" misc_feature 43970..44323 /note="match to EST AA599729 (NID:g2433354) ag11d04.s1" misc_feature 43977..44180 /note="match to EST AA195633 (NID:g1783712) zr38a07.s1" misc_feature 43977..44330 /note="match to EST AA026855 (NID:g1493064) zk02a09.s1" misc_feature complement(44183..44409) /note="match to EST AA508563 (NID:g2246066) nh67c09.s1" misc_feature complement(44198..44328) /note="similar to EST AA284178 (NID:g1928671) zs57h11.s1" misc_feature complement(44198..44319) /note="similar to EST R07021 (NID:g758944) yf13b03.s1" misc_feature complement(44227..44616) /note="similar to EST AA026854 (NID:g1493063) zk02a09.r1" misc_feature complement(44241..44492) /note="match to EST AA194034 (NID:g1783894) zr38a07.r1" misc_feature 44608..45020 /note="match to EST AA535695 (NID:g2279948) nf88b10.s1" misc_feature 44608..45116 /note="match to EST AA149433 (NID:g1719949) zl26b02.s1" misc_feature 44608..45187 /note="match to EST AA054590 (NID:g1545659) zk68c06.s1" misc_feature 44629..45018 /note="match to EST AA134158 (NID:g1691372) zo18b11.s1" misc_feature 44629..45030 /note="match to EST AA079375 (NID:g1618269) zm95d12.s1" misc_feature 44629..45064 /note="match to EST AA582417 (NID:g2359777) nn49h11.s1" misc_feature complement(44855..45158) /note="match to EST AA079374 (NID:g1618268) zm95d12.r1" misc_feature complement(44901..45374) /note="match to EST AA054778 (NID:g1545714) zk68c06.r1" misc_feature complement(45101..45354) /note="match to EST T29707 (NID:g611805)" misc_feature complement(45156..45447) /note="similar to EST AA151516 (NID:g1720003) zl26b02.r1" gene complement(45269..47691) /gene="HOXA10" CDS complement(join(45269..45543,46719..47691)) /gene="HOXA10" /note="91% similar to Mus musculus homeobox protein HOX-A10, P31310 (PID:g1708349); H_DJ0170O19.3" /codon_start=1 /product="homeobox protein HOX-A10" /db_xref="PID:g2822167" /translation="MFCTRNVSQKGLSAPFAKLSHNNVMLGEPAANSFLVDSLISSGR GEAGGGGGGAGGGGGGGYYAHGGVYLPPAADLPYGLQSCGLFPTLGGKRNEAASPGSG GGGGGLGPGAHGYGPSPIDLWLDAPRSCRMEPPDGPPPPPQQQPPPPPQPPQPAPQAT SCSFAQNIKEESSYCLYDSADKCPKVSATAAELAPFPRGPPPDGCALGTSSGVPVPGY FRLSQAYGTAKGYGSGGGGAQQLGAGPFPAQPPGRGFDLPPALASGSADAARKERALD SPPPPTLACGSGGGSQGDEEAHASSSAAEELSPAPSESSKASPEKDSLGNSKGENAAN WLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISRSVHLTDRQVKIWFQNR RMKLKKMNRENRIRELTANFNFS" misc_feature complement(45280..45547) /gene="HOXA10" /note="match to EST AA134157 (NID:g1691371) zo18b11.r1" misc_feature 46168..48146 /note="CpG_island (%GC=67.5, o/e=0.70, #CpGs=199)" misc_feature 51134..51562 /note="match to EST AA429135 (NID:g2111910) zw51c04.r1" misc_feature complement(52163..52595) /note="match to EST AA428199 (NID:g2111849) zw51c04.s1" misc_feature complement(53011..53120) /note="match to EST AA134157 (NID:g1691371) zo18b11.r1" misc_feature 54884..55273 /note="match to EST AA598674 (NID:g2432257) ae40b01.s1" misc_feature 54885..55206 /note="match to EST N26932 (NID:g1141280) yw67g12.s1" misc_feature 54885..55331 /note="match to EST AA114866 (NID:g1669897) zl02d12.s1" misc_feature 54886..55323 /note="match to EST AA042944 (NID:g1522460) zk53b05.s1" misc_feature 54886..55366 /note="match to EST AA039979 (NID:g1516256) zk45h04.s1" misc_feature 54886..55362 /note="match to EST AA029311 (NID:g1496715) zk10e04.s1" misc_feature 54886..55326 /note="match to EST AA152286 (NID:g1721486) zl03b01.s1" misc_feature 54889..55218 /note="match to EST T96674 (NID:g735298) ye52e08.s1" misc_feature 54890..55441 /note="match to EST H94842 (NID:g1102475) yu57a03.s1" misc_feature 54917..55084 /note="similar to EST H91677 (NID:g1087255) yv04a05.s1" misc_feature complement(55204..55741) /note="match to EST AA029865 (NID:g1496092) zk10e04.r1" misc_feature complement(55220..55624) /note="match to EST AA150295 (NID:g1721816) zl03b01.r1" misc_feature complement(55283..55741) /note="match to EST AA040021 (NID:g1516316) zk45h04.r1" misc_feature complement(55294..55634) /note="match to EST H91773 (NID:g1087351) yv04a05.r1" misc_feature complement(55325..55635) /note="similar to EST T96789 (NID:g735413) ye52e08.r1" misc_feature complement(55387..55741) /note="similar to EST AA043026 (NID:g1522579) zk53b05.r1" misc_feature complement(55503..55968) /note="match to EST AA114865 (NID:g1669896) zl02d12.r1" misc_feature complement(55805..56222) /note="match to EST H94900 (NID:g1102533) yu57a03.r1" misc_feature complement(55850..56139) /note="match to EST N39891 (NID:g1163436) yw67g12.r1" gene complement(56165..58513) /gene="HOXA11" CDS complement(join(56165..56397,57805..58513)) /gene="HOXA11" /note="match to AF039307 (PID:g2745851); H_DJ0170O19.4" /codon_start=1 /product="HXAB_HUMAN HOMEOBOX PROTEIN HOX-A11" /db_xref="PID:g2822168" /translation="MDFDERGPCSSNMYLPSCTYYVSGPDFSSLPSFLPQTPSSRPMT YSYSSNLPQVQPVREVTFREYAIEPATKWHPRGNLAHCYSAEELVHRDCLQAPSAAGV PGDVLAKSSANVYHHPTPAVSSNFYSTVGRNGVLPQAFDQFFETAYGTPENLASSDYP GDKSAEKGPPAATATSAAAAAAATGAPATSSSDSGGGGGCRETAAAAEEKERRRRPES SSSPESSSGHTEDKAGGSSGQRTRKKRCPYTKYQIRELEREFFFSVYINKEKRLQLSR MLNLTDRQVKIWFQNRRMKEKKINRDRLQYYSANPLL" misc_feature 57340..58346 /gene="HOXA11" /note="CpG_island (%GC=62.2, o/e=0.80, #CpGs=96)" misc_feature complement(58170..58562) /note="match to EST AA098502 (NID:g1644051) mo08d12.r1" misc_feature 60610..60899 /note="match to EST AA480677 (NID:g2208828) ne24e05.s1" misc_feature complement(65532..65927) /note="match to EST AA378801 (NID:g2031141)" misc_feature 65556..66847 /note="CpG_island (%GC=66.1, o/e=0.80, #CpGs=114)" misc_feature 67741..67982 /note="similar to EST R33638 (NID:g789496) yh82d01.s1" misc_feature 67759..68137 /note="match to EST R80259 (NID:g856540) yi96h02.s1" misc_feature complement(67877..68338) /note="match to EST R80374 (NID:g856655) yi96h02.r1" misc_feature 68434..68672 /note="match to EST AA088239 (NID:g1633751) zl82a04.s1" misc_feature complement(68665..69010) /note="match to EST R33750 (NID:g789608) yh82d01.r1" misc_feature 68952..69374 /note="match to EST AA088276 (NID:g1633797) zl82g03.s1" misc_feature complement(68973..69360) /note="match to EST AA088289 (NID:g1633863) zl82a04.r1" misc_feature complement(69285..69692) /note="match to EST AA088393 (NID:g1633906) zl82g03.r1" misc_feature 69566..69937 /note="match to EST AA595017 (NID:g2410367) no31g05.s1" gene complement(71567..73446) /gene="HOXA13" CDS complement(join(71567..71811,72525..73446)) /gene="HOXA13" /note="H_DJ170O19.2; match to U82827 (PID:g1832353)" /codon_start=1 /product="transcription factor HOXA13" /db_xref="PID:g2822166" /translation="MTASVLLHPRWIEPTVMFLYDNGGGLVADELNKNMEGAAAAAAA AAAAAAAGAGGGGFPHPAAAAAGGNFSVAAAAAAAAAAAANQCRNLMAHPAPLAPGAA SAYSSAPGEAPPSAAAAAAAAAAAAAAAAAASSSGGPGPAGPAGAEAAKQCSPCSAAA QSSSGPAALPYGYFGSGYYPCARMGPHPNAIKSCAQPASAAAAAAFADKYMDTAGPAA EEFSSRAKEFAFYHQGYAAGPYHHHQPMPGYLDMPVVPGLGGPGESRHEPLGLPMESY QPWALPNGWNGQMYCPKEQAQPPHLWKSTLPDVVSHPSDASSYRRGRKKRVPYTKVQL KELEREYATNKFITKDKRRRISATTNLSERQVTIWFQNRRVKEKKVINKLKTTS" misc_feature complement(71693..71819) /gene="HOXA13" /note="similar to EST AA160421 (NID:g1735215) zo64h02.r1" misc_feature 72441..74061 /note="CpG_island (%GC=72.2, o/e=0.80, #CpGs=192)" misc_feature complement(72525..72807) /gene="HOXA13" /note="similar to EST AA160421 (NID:g1735215) zo64h02.r1" misc_feature complement(75781..76191) /note="match to EST AA515492 (NID:g2255092) nf65g05.s1" misc_feature 78281..79047 /note="CpG_island (%GC=63.9, o/e=0.80, #CpGs=69)" repeat_region 81945..82000 /rpt_family="(CA)n" repeat_region 82819..82864 /rpt_family="7SLRNA" repeat_region 82869..83031 /rpt_family="Alu" repeat_region 85528..85662 /rpt_family="L1" repeat_region 85676..85765 /rpt_family="MIR" misc_feature 86432..87585 /note="CpG_island (%GC=64.7, o/e=0.70, #CpGs=84)" repeat_region 90158..90373 /rpt_family="MER1_type" repeat_region 91086..91380 /rpt_family="Alu" repeat_region 91430..91545 /rpt_family="(GAAA)n" repeat_region 91900..92076 /rpt_family="MIR" repeat_region 92108..92405 /rpt_family="Alu" repeat_region 93419..93568 /rpt_family="MIR" repeat_region 94317..94381 /rpt_family="L2" repeat_region 95119..95178 /rpt_family="MIR" repeat_region 95750..95952 /rpt_family="MIR" repeat_region 96059..96418 /rpt_family="L1" repeat_region 96673..96903 /rpt_family="MIR" repeat_region 101688..101980 /rpt_family="Alu" repeat_region 102232..102317 /rpt_family="MIR" repeat_region 102344..102399 /rpt_family="MIR" repeat_region 102897..102941 /rpt_family="L2" repeat_region 103010..103090 /rpt_family="MIR" repeat_region 103389..103511 /rpt_family="MIR" repeat_region 103747..103819 /rpt_family="Alu" repeat_region 103826..103957 /rpt_family="Alu" repeat_region 103988..104101 /rpt_family="(GGAA)n" repeat_region 104089..104378 /rpt_family="Alu" repeat_region 104512..104642 /rpt_family="Alu" repeat_region 104649..104777 /rpt_family="Alu" repeat_region 104784..105090 /rpt_family="Alu" repeat_region 105107..105266 /rpt_family="Alu" repeat_region 105270..105431 /rpt_family="Alu" repeat_region 105625..105700 /rpt_family="MIR" repeat_region 105759..105879 /rpt_family="Alu" repeat_region 105895..106194 /rpt_family="Alu" repeat_region 106226..106383 /rpt_family="Alu" repeat_region 106551..106690 /rpt_family="MIR" repeat_region 106854..107051 /rpt_family="MIR" repeat_region 107293..107584 /rpt_family="Alu" repeat_region 107935..108059 /rpt_family="Alu" repeat_region 108146..108283 /rpt_family="MER1_type" repeat_region 109510..109626 /rpt_family="MIR" repeat_region 111232..111346 /rpt_family="MIR" misc_feature 115837..116886 /note="CpG_island (%GC=65.2, o/e=0.80, #CpGs=87)" gene 116400..119794 /gene="EVX1" CDS join(116400..116826,118417..118673,119255..119794) /gene="EVX1" /note="match to P49640 (PID:g1352398); H_DJ0170O19.1" /codon_start=1 /product="EVX1_HUMAN HOMEOBOX EVEN-SKIPPED HOMOLOG" /db_xref="PID:g2822165" /translation="MESRKDMVVFLDGGQLGTLVGKRVSNLSEAVGSPLPEPPEKMVP RGCLSPRAVPPATRERGGGGPEEEPVDGLAGSAAGPGAEPQVAGAAMLGPGPPAPSVD SLSGQGQPSSSDTESDFYEEIEVSCTPDCATGNAEYQHSKGSGSEALVGSPNGGSETP KSNGGSGGGGSQGTLACSASDQMRRYRTAFTREQIARLEKEFYRENYVSRPRRCELAA ALNLPETTIKVWFQNRRMKDKRQRLAMTWPHPADPAFYTYMMSHAAAAGGLPYPFPSH LPLPYYSPVGLGAASAASAAASPFSGSLRPLDTFRVLSQPYPRPELLCAFRHPPLYPG PAHGLGASAGGPCSCLACHSGPANGLAPRAAAASDFTCASTSRSDSFLTFAPSVLSKA SSVALDQREEVPLTR" misc_feature 118390..119987 /note="CpG_island (%GC=68.0, o/e=0.70, #CpGs=136)" misc_feature 124870..125947 /note="CpG_island (%GC=65.3, o/e=0.80, #CpGs=89)" repeat_region 127149..127195 /rpt_family="L2" repeat_region 127622..127921 /rpt_family="Alu" repeat_region 128053..128574 /rpt_family="MaLR" repeat_region 128920..129083 /rpt_family="Alu" repeat_region 129201..129315 /rpt_family="Alu" BASE COUNT 31356 a 33395 c 33338 g 31265 t ORIGIN 1 gatcattttc tataatctgg gttttgtaaa actcaccaat accttaataa tcaatacagt 61 tgggacaaaa atgccatctt gtcctggaat atcagaaaaa tgatctcaat cctctctctt 121 ccctttctgg atggacttat aagtgcaagt gaaagaaagt cacccctttc atcctcatac 181 aactgcctgc agcatgaagc taatgaatat tcccataccc ctcttttccc acaaaggaga 241 aatataagaa tttaccttaa atgatgatag tctgccattt gagattttgc agcttgacac 301 tcaatttcca ctgagagtct aggcattttt tttttttagc agacagccaa gatgctgtgt 361 ccttgcctga gaccacatta tttgcaagaa gatccctctc tcctccacag acttttccca 421 gagaatgcct ttcagaattg ctgttgaatt acaaagaagt atttattgtt tgaggtgatt 481 aatgattcct gggaaaattg ctattttcag cgtgatttaa ttttgtttta caagtgtgtg 541 tagtaggggg agggttaata agtttcagca gcagttgcat tgcctccaga aatccctggc 601 acccgttcac accttataaa atgcacttta tgccaggttt atcctgattt gggaaggtaa 661 gtgtgtattt gattgctccc gaattttctt ggctcttcag agactggtat ctatcacttt 721 cctctcaaat ctgatttgta ggtatagaac ggatcaccct tgtttgtatc acacacaggg 781 cagcaggggt gaaaggggta ccagctctga cactggctaa agggttttgc cactaatgag 841 agaaatgtgt ttaaaaattc tctgatattg cagaggcagg tgagcactta ctcccatcta 901 tgctaaaatt actacattac ttaaaagttg cactttttca aggccaggtg acagttctaa 961 tattctgggg ctagtgtgca caggtttttg tggtcttatt gtagggaact ccctgcagta 1021 agcatcaaat ctaaatcagt aaacaaatac tagggttata gagatcgaga agtgggtttt 1081 gattttcagc cttttggatc tagatctctg gaatcgagca gaaaattcac aaagcatttg 1141 aggaaagtga cccagtttgg ctattttcca gcagtaaact ctcccaggtc tactttgcat 1201 ggggtgaact ttaggggtat attatgcttt ttcctcagaa gagacattgg agagtgtact 1261 tgtaaaattc agtttatttg aaagctcttt cttcaaaaaa gagaaacaaa caagcaaaca 1321 aacaaaaatt cccagtctgt agttagtgaa gaggacctta agaacagcca ggaaacaaaa 1381 actaatctca accctttgca caaactattt tttatctcct gggaaacagg cagattaagt 1441 gtgaaggtgc aacaaatacc aaataccaaa gagaccactt ctttccaagt gtgctcagag 1501 ttcagtgtgt agaacagctg cagaaactcc tgggcccaca gcactgcatg gtttcaactt 1561 tcattgcagg gaaggacacg tttctatgcc ttacagagac ttgaagcctg aaagcctggc 1621 caagtttggg aagaagagga gccctctcag agctcaagag ccttcctact ctttggaact 1681 tttccacagt aggccaagct tgacaagagt tcagctcaag ttgaacatac atacacacac 1741 tctcacacac aaattatgtg agccgtcaga atccaagtga atccagctca agctatctac 1801 aaggttttta catgcaaggt caagtatctc aatccagagg acttttgttt tcttaatgaa 1861 aagcttagaa aacacatgaa tcttagattt ttaatgtttt ttaaatggag tttattctta 1921 gcacatggct ttctatgtag ccacatcaca atttgtacag ttccacataa gtctaaatgc 1981 actcccctct ccccaaagac cgtgccccag aaggggacaa cagtatctct gtaacagtgt 2041 cttaaataaa tgcaagtaag aaaaactaac atgtcacacc taccatcaag gtctacacat 2101 cttaagaatt aaataatctt gtgaggtcca caatgtctac tcatttattc agttaaatac 2161 aagcttggat gagctgcctt aatgggggaa gaggaggtaa ggaaggtggg gaaagggtct 2221 gtcttcttct gctttcaaca aagatgcaag tgtatgtccc cactcagcat gggctgtcca 2281 gctagctgac tgtagccatc tcaaaagtat ttgataccaa gtagtccttc tcaggtatcc 2341 acacctggca gccttgtttc gggccagcag gttgttccac cagccagcat cctggacaac 2401 tgttctctct tgggtggcaa ccagcacaga ctcttaaccg gataatgtct tctttttgat 2461 tattcttttc accaatttgg gtttgttttg gtgtctatta tggtccagat ggggaggggt 2521 ggatgaggaa cggagcagga gaagagaaga gaaaagcagg taagggatag aaactggtta 2581 agatctctag aagattatat ggaggaggga acgggtgtgg aggtgctcgg gtgggggtgg 2641 ggatggaggt gtgggctctg agtttgtgct ttccctggtg ggccggcaga ggccgaggcc 2701 gaattggagg atcgcatctt ggtgttgggc agtttgtggt ctttcttcca cttcatcctc 2761 cggttctgaa accagatctt gacctggcgc tcagacaaac agagcgtgtg ggcgatctcg 2821 atgcggcgcc gccgggtcag gtatcgattg aagtggaact ccttctccag ctccaagacc 2881 tgctgccggg tgtaggcggt tcgagagcgc ttaggctccc ctccgttata actggggtta 2941 actgaaaacc cagaaccccg aaatagaagg ccaaggagga gggagcggaa gagagggaaa 3001 ggaggaggag agagaaggtg gggtggggaa gagcaattgg acatcatcat tatataataa 3061 cctgacacca agttcacgca agatacataa aacggcaaag aaaatgtttc acaggctatt 3121 gacaacggga aacacccgag agccataaag aatggtgctg ggcattgggg ctgaagaaaa 3181 gcttcaaaga cacatggcct gaagcctctg ccagggcaat taaatttatg ggggctataa 3241 ttactgccct aacagtttgg cgtctcgtaa atctcctgat aaagggaccc tgggtacaaa 3301 agggttctca tgttgggatc aggcggctgg ctggcgcgca catacccaca tctcaccgca 3361 gcccgggtca gatgggggct cccctcccga ggcccccttc ccctgagcct ctccctcctg 3421 accccgaccc tcgaacccag gcccagcccc ggcccacctc ccgcgcctcc caagcggcgc 3481 cacgtaccgg cgctgacatg gatcttcttc atccaggggt acaccacggg ctccttgccc 3541 ttcaggccca gcgggctctt gtcggccaag agcagcgggc acgcgggggc gctgccccct 3601 gccgggacgc ctggggtggc gggggccgcc tcgcagcgcc gcggggccgc tgggggcacg 3661 gcgcgaggct gcaggggcgg cggcagctgg ggctgcagga cgtggctcgc atgcaggccg 3721 tgcgctgggc ccttggcttg cgccgggggc tgctcgggct ggggcggccg cccggggctg 3781 gcgccgccgc ggtagccata ggggtaggcg gtgtccgcgg ccccatgcgc ggggtacagc 3841 gcggcagcag ggtaggcggg ctcgcgggcg gtccgcggcg cgtagtagga ggcagtgggc 3901 tctcggccgc cgcccgcgtg agggagctgg ggctgctgca gcggcaggtg ctgggtcggg 3961 ggcgctgggg gctgctggta gccggggccc ccgcccgggc cgccgtctgc gccgcccgag 4021 ccgctgtgct gcgcgtactc ctcgaaggga gggaacttgg gctcgatgta gttggagttt 4081 atcaaaaacg agctcatggt cattaatttg tgaagtgcaa aaatactaat ttttctcgcg 4141 ttgtcgtttt ttctgggctt gccgaggccc ctccccctcc tgcctcgctt cccatccccc 4201 tttcctctgc gcccttcccc tccccccgct gtcaagtgcc cactcctccc cctcccgcag 4261 acgccgccac caaagttcga gccgctcctc cccagcccag cgcgcgcccc gccccgtgcc 4321 ccacgtgcag cgcccccacc aatgggcgca ccgcgcgcgc ggacccggat caggaaacgc 4381 gcgggtgcgt gatggatgct gctgtccggc ccctgggctg ggggagggag caggagcttt 4441 ggaccccagc cccccagctt tggttcccgc tgggaattca ggccctgtca ggctgtaggt 4501 cctctcggga gccctctgcc tgccctactg ctggcctagg cctcgggctg tctggcggcc 4561 gcgactcagc gctgacctcg ggcgcaaccc agtcaggctt cgtgtccttc aggggttcta 4621 ggctaacagg cgaaaggaag ggcgttggga ccgaggggca tcctggtttt tatgtacgcc 4681 actgagaggc caccagacac attttctcaa ccgcagatcc cccttcccca caccctgctc 4741 cttgcgtgtc agcctgagag cccttgcttt gagaagcttg gcagaagctg caaagggtgg 4801 gcgggcagct aagagaaatc gacccaagga tgtaaatcga ggccattcca ttataactgg 4861 atggacactt ttcatttttt ccttctttca gagacaatct gtttcgtgtt ttcctaagaa 4921 aaattggaac cttcgtaata gcatctaatt tgacgggggt tgtcgatgtg agagctaaat 4981 atgcccgcat ttactaggtg cgattgtgag agagaaggtg gcccaaggat gggaatggat 5041 agaagcaaca cctccacaga accgagcttt gaaaacaata acttcctatt tcagaactat 5101 ccccaaacaa aaacaagcta agggtagaat aaacaccttg ccgggtctga tcgctgatgg 5161 gtcttttcca gctaagaatt tcatgttttc tcttttagat cctgctttct caggcagtat 5221 ctgaggctag agttatattt gcaggacagt ctataatttc tgaattgctg aaaattagcg 5281 tattaacgat atcagaagct ccggaaagga gggagaggag actgttgcct gctatttggt 5341 aattgaaatt tgatgggtac actaattacg ccattattaa caaataaatt acttattaat 5401 tccacctaat gttgatcttt gaagtaaata ctgatgcctt atttgtgctg tgtgctttct 5461 ccctttcttt tctgagtagt agacatatct agatcctcta cttttcagcc taaattaaag 5521 cagtgtaaac tagcatagtc accattctaa aaatattttc atattggcat gcaaaagcaa 5581 ggatttttca gctggtgcac cttagttgat ttttcaaaga gcagtataaa cagccttctc 5641 acaactgagt ctggaacgca gacaaggaaa attatttcct aagcctggag acacttgaaa 5701 aggaatgtca attctatctt cattcatact ggttactcat atgagttact aaatgctgga 5761 atatatccat ttgatggata gtcacttaat gcttagccac ataaagccta ttatatggga 5821 ctaatcttta aactaattta ggaaaagagg ttaaaaaggg gatcatatta gctttctaac 5881 tggaatcacc ctgaagaggt acaaagagat tttccacgtt aggtgtatat gagtgtgaag 5941 agtgctgtcc attcacatga ggcaccctga aaatttgttt ttaaagaaat ttgagccaca 6001 gacagaaatc aacactgagt gtaatcttta gccatcctct ctagactgga ggaaaaattt 6061 agaatgtgat acatctacct gaaccaatat ctctccctag caagaaaaaa taatatacac 6121 ataggttata taaaatgtaa tataaataat atatagacat acgtattaca aatctgaacc 6181 ctataagttt cagggggaca aaaagcatga caaggaaatc ttccctcctt ctcatgtcat 6241 cagccttgag tactagggtc ttaaccatat ctgtttaata tttacagaca ctaaaacaca 6301 aaattctgtt gtttagcctc agaaccttgt accaagtttc tatttttaag tattaacgag 6361 acataaacac tgttttgtat acggttaacc caaacgagtt agctgtgcct gtgttttgtg 6421 tgattctatt actttaggaa gatggcctta cacagaatcc cccaaggcct gtaacttgtc 6481 tttgtggttc gtatcataaa cacaaacgga gccaggacca ccaagtgtta tctcaacacc 6541 gacattttga cattttactg caagatttat ggctgtaata aacaatctca gtaccttttc 6601 tgaaccttcc tcaatctccc tttgcaaacc atagcatcat tccattgaat caaataatct 6661 tttgaaaaac atttaaaaaa aatacctctt gcctttacac aatatccaag acaccaaagt 6721 aaagccagga agaaactaac tcaattaata aacaaactga agtttaccag cagcatctcg 6781 cctgagaaaa gatgggatgc cctgaaatgt agcagagagg gagcatgcta atcctcacac 6841 accaactggc tccagtccca agcggggtga aagcgttatc ctttccttag gaaactggtg 6901 agcacgtttg ctcatttcca cgtgcaggga taacatatat tcccaacaaa agctttctta 6961 aaatcccatt aggtgaaata acttttcatc atgtcctcga atcccagatg gagaagagtg 7021 aagggagtcg gagggagagg gagggtgcaa gggaggcaat gttttgcagc ttggtttgaa 7081 tctgatttga atcattttga atatatttgt aacagcattc cctcttgaat gcaaccctgt 7141 cccaagtttc aaagtgaccg aacagtgaca ccgtgtgcat tttgtttctt attaatctta 7201 cacattgaca gtctttgtta aatcacaagg cgcgcccttc actagccgac attttcatat 7261 ttgttagacg cactgacctg aagttcacct cggccttgga ctttgcgctt ctaaaaggtc 7321 tatacagtgt cttttagaga gcagggtgct ttgcccaggt cactccttct caggaaaaac 7381 caaggggaaa agccaaagga aatgtaaacg ttatggaatg tattgactgt atttgtcctt 7441 tgttctttag agcgagagtc ccccagactg ttctctatct gatgcatgtc tctagagctg 7501 aacagtggaa tggcagaatt tcaaaacgcc tgatggtggc atttgaaggc ttccccacca 7561 cctacactag acacaagatt tgagaggaac aacactttac cagccatttg accaattaat 7621 tctttgggga taattttctt gtagtagttt aaaataatgc acacaacgca gggatgagga 7681 ctgatattca tattgggatt acacatgaat tttaactggg attgtttgag aggcctgagg 7741 ttcaaaatcc tccagataaa gcaagcacac taaaagcaat aaattctgca agtactcttt 7801 tcttttactt tgaagactag ctaagaggta tctatggttt ttgaagctga catgtctata 7861 aggtgtgtca catgttttta accaaaaagc acaataaaaa ggtttttccc aaagagacac 7921 gtaattgtct tgttgactca tcgaggggtt tcagttttcc tcatttcact agcccaaatg 7981 tggtgaaatg ttcactgctg caacagcaat caccacagtt gtttcctttc ttctgtttca 8041 tctggcaaac ccccatttgg cttcaagctc ttggccagag tgaaaacttt acacattgca 8101 cagaagcacc ctgattactt ccatgaaggc agtgtttgga aaatatttac tttaccactg 8161 aacatacctg gcaccattaa atccaatcaa ccaaaatatt gggatgatct taaacattcc 8221 tgcaagagtc cacattctga gtagatgaat tatttccaaa gttaaaaaag aaaaacctag 8281 ggaaaatatt tcacttttct cttctctgtt ttcctatact gatcccttga aggtcaattc 8341 atagaaaagg gaaatatgtc ctctggaaaa tagattctta cagcaccaac acttaaagcc 8401 attctagatg catgaaaaat aaaatattgt ttagctcttc agttgcaact cacacatgag 8461 gcatggttct agtcggcttc cttaatacac tattctcttt cttttctgct ctcccaccct 8521 tcttcttagg ttgctttatc ttctccttgg cttttttttt taattccacg tgtatcatta 8581 aaaagtacat tctgaagaat agaaaatatt ctattctgtc ctggtggtct tacagagtag 8641 cctgttattt gtggatttca cctttctgca ttccctacag tctagttatt cacttatagc 8701 ttgtagcatt tctcttacat tcaattgtgg tttaataata aacattaaaa aaatttccaa 8761 acaggaacat tttcatggca ccagtaagca ttttgtcact ggcagtggtg gtggaagggg 8821 tgaagggaga attctgtgtc tttcaggagg gtttccactt ccttccttcc ccttctcaga 8881 tctcagaacg ctttgcattc agcggactgt agtttcagaa aaagcatatt ctgtgtttga 8941 aaactgcaaa gattatattt tgcaagaagt gtctgtgttt gcatttattc ttacacactt 9001 taggggtcat catgtgtact aaaaagacaa aaaaccggcc aatcagaatc cctcttttca 9061 aataaaggag gtttcctgca ccattctgtt gcctttgaag gcataatgaa atattggaaa 9121 cttgtgacat tagtttttaa agctccacag atgagttttt agcattttta ttttgtgaca 9181 aacccacaga ctcctggttc tccaacacct aaggtgttga tgtttcagta atctatgcct 9241 atttacctgc tgctattccc tcagaatggg agcgataatt caagatgaga tacagcatgt 9301 attactcttg aaaagaggaa ttttctatcc tttcctccgt aattgaggtc attcaaccac 9361 tagggttcac ctggagtcca taccgtgata cacgcgtcac tctgagccat tttatctttt 9421 gtgctgatag tcaagatcac agctctaaca ttgacatcaa actctgtctg ggcagatgac 9481 taagagcact gcacaatgta aacttttgac cctcaacttt ttgacctgca gttgtaacgc 9541 actaaccgca aagatacaca aagccgagcc tcttctttca gggggaaggg gccccccagc 9601 atctcaggat gccctgcttc tgccactgcc atttgaaatt agagggtgaa atggatattt 9661 ttgtgtgttt gtgactgtac tttttgttaa atcagcctat gacctcttcg ttagcaccta 9721 ggaactagat taacttgaaa tcactcgtga ttctatttta caaggaaaat ttggagcaga 9781 atgggagaac cttgcaaaaa gtgaaagaaa agagaagatg ggggaaagca ggcaatggga 9841 ggtggagaca ctttttccct ttatttaaaa ctaaagacgc agccctaatt gttgggagag 9901 ctggcccaag cgggtgaatt gactgtgaac ttgtactaaa gcgtgctctg ctggcgattc 9961 ctagggtgtg cagatttatc ttctctgcat ttacttaacc cggcagtgaa ctgcgcgggc 10021 gtcatttgtt aggcgatgac agacttcacc tccagcaagg gctgcttcac aaaatcgcaa 10081 taattatcta ataaccttca taacaaatat tattattgaa aagactggtt tgtggggagg 10141 ggacctggtg ggagaacaaa tttatttgtg aacaacaaca aacaaaacaa acctgggcag 10201 accttcaagt tctggggctt agaatggctg gggctgtgga tcccctcccc tacttgggtg 10261 ggagcttagg ctgaccccct cagccctgcc tgggagcccc gtttatagtt ttgccattga 10321 ctagaaggaa actcctcctc agaaaccaaa gggagggagc ccacaatgct ctgcactctc 10381 catggtgggc aagccatgga cagaccccca gccaaggcag gggggaggct gagaagggca 10441 tcttttaagc taaaaggatt gttttcctct ttaattgcct atcttttaag atgtgatttg 10501 ctttccactc actaattatt tcgatataat actctcagaa tctcaacaaa tgaacaggac 10561 tctgtttttt ggtgggaaat tctgtcttgc tctctcagag ccgccaacaa tgaagcaggg 10621 gaaagagcag gagaaaggga atcttggcat aatgttgtga aattagacca tggaaaccct 10681 aacaaaccac taagtaagtg tgaccagaag cttcctgttg tatttatagt tcagaaatat 10741 tgtctcttca gcttgtggga acaaacgagc ccccgcacat tgccgctgag gaggagcaca 10801 gacacgcact tctgccaccg gctgaggctg gatgtcttca taaagccctc agtgacagac 10861 atattttttc ttagtaagtt cctctgcaag aacaacccaa aagaatccac aaaagaaata 10921 acttatctac agaatgagca gaaaaccagc catcctcttt attatgcttc ctatgaaaat 10981 aggaagaaag aaaaaaatct tccagtaaca cataggtctg actgcatgat gtatttttaa 11041 agtcatttta attccatgtg gccatgtggg tttgcctgct ctcttaaatt ctacttaagt 11101 tttgtgaaga ttaaaacaga cagaaataag caagctgaca atatttacag cctgtaattt 11161 ttctcattcc ttggaaagat tctctatgtt ctgtggtact ggatatgact tcaacaggct 11221 ttctgctcat tcccacaccc cagggtggaa tatggccatg aagtagtgtg gatattttct 11281 gtgtaagtaa ctcaaattaa actggcagaa tccccgtcac tctttttttt ttctaatttc 11341 aatcaccaag aaatcactca agcaagatca ccaaatcagt aactaaaatg gaaccataac 11401 gcaatatttt ccaataagga gcccaaaatt cagagcagca aaacaaggaa tccagtattc 11461 tcacagacac ataacattat aaaagagaac ccatacccat gtagagttta tatccttgtt 11521 cccactaaga tgtggacaca tcttcttgaa tgctgaaata ccaatgttta ctttaatagg 11581 ttacacacaa tgacttcagg attcttcacc ttgccactat tcatgagaag tagcacttgt 11641 gggagggttt tgatttttca aaaaaacttt ctaggttttg ctttctggac ctctgacttt 11701 agggacatct gttggactta tgttgagtgt aggtggcctc tgcacaataa gtttattgaa 11761 attccaaatc tatactttca attttttcac tttaagcact taataggtat ctttaccaat 11821 taatacttgc tgaaaactgc ccagctccta aggagaaaag cagatcctat tttttgtttc 11881 atttctgaat gcagtaggag aatttggctt aattcctaaa ataggattgg aggaaatcta 11941 ctgggtccct tgtgggtacc catccagaaa aagatcccag gacaggccac agtccccagt 12001 cactgggctt gggttttgcc attgaagaat atggggggtt ggggccagaa ggggtgactg 12061 gggccaatat ggaattgtgc ccaggataaa cttatttcac cttacttcac ccattggtgc 12121 aattttggag actgttctgg aaatcataga ttatgtaaat ttcctgggat caaacagaaa 12181 gagcaactaa caaaagaaag gcggaaatct cctactgaca aaggaccaat ttcttcccta 12241 aactaccgtt tatgatgtgt caggaaaaac aacctaatgg ctctggggac ttttaagttg 12301 ggcactgaag acacctcaat ttcccccaaa actttagagc acagtttgga acagagaatt 12361 cgcctgtatg ttgaggggga gtgaatttct ccaatcttaa tgttatccag ggggccgcct 12421 aagttgcctt ctgagggtcc tgtgcgtaga tgtttttaat tctacaaaga aggagaggaa 12481 caggaaagaa gagagggaga agaaaggcaa agcggaagaa aagaaagcgc tttaacccct 12541 ttcaattagc ctggggattc aaagactaaa gttaaatccg gccataaagt ttattgcttc 12601 agactcacaa gcggctgaga acagtcccgc cgaaataaaa agaacatgca ggcaaacagg 12661 gttcagggcc tggtcccggg tgcgggggag ggggtcctga acaccccccc acaccagggt 12721 ggggatcctt ggtcctcagg gtccagtggg cgctagcagc ccaggatcca ccttgcaacc 12781 cgggggccca gcctggaggt gcagccccag cctcgccggc ctctgccacc ctcccgctct 12841 cgcgagctag cctgaaaccc ggccccgaag gccgccgcct caattcagcc ctgccaaatg 12901 accccggccc gcgaagacat attgccacag ccccgtaagg aatcccgcca gagtccgcct 12961 cggccctgcc ccggcctttc tttcaaactc ctgagcgcag cgcggccctc ggcgcccgcg 13021 gccggcgccc cactgtctcc cagccccgac cccagggctc cgcgaccccc aggagctggc 13081 cccggccggc ccagcaattg cgcgggggac tgggggtgcg gccctgccag gtccccacac 13141 acaggcccat tcgcacacaa aaatcatctt tttgcacgcc ggcgggagca gcggaagtca 13201 ttaacatccg cggttgtgct gcaattaaag ttaggcctgg ggatgcggcg cggccacagg 13261 cgctgctcac tctgctgcct ccgcagagtt ggctcctggc gctgctcttt tgggcagagg 13321 gaaagtttgc tctgcctttt cgaattcaga ggcagcctga gttattgaac cagagagaga 13381 gagagagaga gagagagaga gaaagtttcc aaagtacaaa taaacttgaa agcgctcagg 13441 aggcgagctt accttaactc ggagggagcc atttttcaga gagttttgag aacttgtggt 13501 ttggacactt ctggacctaa aattgacagt ttgaatggcc aggcggcaca cgtagcctgc 13561 aaaagagtca aatggagtcc agcgttagtg agattatatg ttatgtggta tataatgttg 13621 gatgtcaact ccccaaaacc ataaaactta ctttaatggc cccacgtgac gttttatagc 13681 cagtgagccg atctgtctgt gctatggatg attttacgat ctaattcata gacaaaaccc 13741 tattcatttg gcacccaaat gtcatatagc cggaactggg gcttataaag tttactgttt 13801 tataactttt aaaaggaaag acggcatcag tgtaagcagt cggtaaatgt gcaaatctct 13861 agttgcgctt tagctgctct gaggagtttc ccaatcgagc taggatgggg taagtacctt 13921 caatttgtag caaattaatt gtagcaaaag aagccaactg ggtcccgggt gaagagtggg 13981 gaaggggtgc tgggatgggt taagggcaga gggtttgggg tccacagaca gacatagcag 14041 cgtcttcagc aagtggaggc ctaggacagc cttaggaaag aggcaggatc tgtgtggcct 14101 gagggcggct aacaaagccc tgggtttttt ctcctttttt cttgctcttt ctctcttttt 14161 tgtacccagc aagttaactt ggtttcctca gagatggaca gggtgttctg gggctttgga 14221 acagcctaca gctttttcca ccttctgccc tgaactttgc aatgggtcag aggtagggaa 14281 gcgatgggac agtgttggta tgaggtctcc ctgcacaggt catctgctca ggtagcctca 14341 gacccaacag cttccaagac tgcacagaca gacagaaaag cagacagagc cgctcactat 14401 ttggcacaaa ccagaccaag agaacttaca atagaaagtt tattttttgt tccagtcagt 14461 attttttcct taaaaacaaa tacaaaaaaa aaaaaaaaaa aaaaaaagct gatcacagtt 14521 tgcttaaaac agccagactt ggacaatatt tgtaactttg ttcacaaaaa catacatcac 14581 tgaagctgcg cttataagag ccacttccag agttcgtgca aagggtccta taaaggcacg 14641 cagggacaca ccgcttggag tcacagtttt catcacagag tcactagtca ctacacgtcg 14701 aacaagttgt gtctcatcaa gtcacctcta caacagcatt aattacacaa ggaatatagg 14761 tagtttgaat aaaaatatct ttaacagctt ggagctattg agacaggaac acttccacgc 14821 acatgcacag ttaaacaact tgagtgcaac acacaacatt ggcactaaac gagattgaag 14881 ggggactttt tgtgtgtttt tttttctctt ttcttttttt gttatagtta cttcaagtaa 14941 cacagcttgc ttcatataaa taagttaaaa catctatttt ttttcaagac aaagccattc 15001 aggacaaaga gatgaacaga aagcagatct acttatacag gcgctataat ggcaataaac 15061 aggctcatga ttaaaagatg aattagggca acgagaacag ggcttcttca cagaaggaac 15121 acaagggagt ttcagaaagt caccttagta ctgacactac gcgggatccg ctaatactgc 15181 tcagtacttt aaacgctcag atactcaggg acggaaggcc cctcctgccg cggccatgct 15241 catgcttttc agcttattat cttttttcca cttcattctc cggttttgga accagatttt 15301 aatttgtctc tcggagaggc aaagagcatg tgctatttca atcctccttc tgcgggtcag 15361 gtaacggttg aagtggaact ccttctccag ctccagggtc tggtagcgcg tgtaggccgt 15421 ccgggccctt ttgccttccg ggccgcctat gttgtctgca atagaaaagt cagcggttta 15481 gccaccaact cctgtcttcc aaagtccgcc agggggacaa gcttgggtca tgagcaggga 15541 acccaggcga aaagctcaac aagttctgcc taccagcccg cacacccctc ccgaatttcc 15601 ttctctcttc ctttctagaa agaaaacaat acgatttgga ccctgggaac aatctgccca 15661 tctgaggctg gggccgtgtc ccggcggact ccggctttcc ctggcccctc tcctgccccc 15721 tccgccctgc cccgggcgcc ccgatcggga ggcacagccc tcccaggctg cccaccgcac 15781 agaaacccag gaagcaaggc cctttcctga gcgcccaagt ggccttcggg tcaccctccc 15841 tcaaagttcc agccccgaga gccgcctccc gtttccagcc tgcagggttg gggagcctgt 15901 tttctttttc ttccctttcc ttctctctcc ctcctgcccc caaaattcag aatcctgcag 15961 gctctcgcct cgattctttc ccccaagccc cttttcgggg gctgtaatta gtaacgctgt 16021 ttccccagcg tagccctcct cataaattat ccgccgtgac aagcccgatt cacggctgct 16081 acagccatcc tctacctctc tgcgccttgc tcggctggcc tgacccggga gcgcgtccca 16141 aggcgtgggg ttccagaggg gttttttgct tcctccccct tccaacgtct aaactgtccc 16201 agagaacgcc catttccccc actatttgtg agcgcagggt gctcgcaaag aagaggagga 16261 aggaggaagg caggggaggg agaacggcaa ggagagctcc gcagggctgg gagaaatgag 16321 accaagagag actgggagag ggcggcagag aagagagggg ggaccgagag ccgcgtcccc 16381 gcggtcgcgt ggatttagaa aaaggctggc tttaccatga cttatgtgca gcttgcgcat 16441 ccaggggtag atctggggtt gggcgggcgg cgccgggctc ggctcgctct gcgcactcgc 16501 ctgctcgctg ctggcagggg cgtcctcctc ggctccggac gccgtgccaa ccccctctct 16561 gctgctgatg tgggtgctgc cggcgtcggc cgaggcgccg ctggagttgc ttagggagtt 16621 tttcccgccg tggtggctgt cgctgccggg cgagggggcc acggcggagc agggcagcgg 16681 atcgggctga ggagagtgcg tggacgtggc cggctggctg tacctgggct cggcgggcgc 16741 cgcgctggcg ctggcagcgt agctgcgggc gcgctctccg gagccaaagt ggccggagcc 16801 cgagcggccg acgctgagat ccatgccatt gtagccgtag ccgtacctgc cggagtgcat 16861 gctcgccgag tccctgaatt gctcgctcac ggaactatga tctccataat tatgcaactg 16921 gtagtccggg ccatttggat agcgaccgca aaatgagttt acaaaataag agctcatttg 16981 ttttttgata tgtgtgcttg atttgtggct cgcggtcgtt tgtgcgtcta tagcaccctt 17041 gcacaattta tgatgaatta tggaaatgac tgggacatgt acttggttcc ctcctacgta 17101 ggcacccaaa tatggggtac gacttcgaat cacgtgcttt tgttgtccag tcgtaaatcc 17161 tgcctgatga cctctagagg taaactcgtg cactaatagg ggagttgggt ggaggcgagg 17221 ggggtggcgc gcgcgccccg ggcgcgtgcc cgccgccagt tgccgccgtt cagccggact 17281 cgagcgccac ccgctggagg cagggctcat cgcccagctt ccgaccgggg gctgcaaggg 17341 ccggggtcga attgaggtta cagcccatta tggcaaaatt attgcatttc cctcgcagtt 17401 ccattaggat gtaccaattg ttaggccgtc agctgccgat cgcgcgcccg gcgaggatgc 17461 agaggattgg ggggaggtgg tgacttgcat tttatttaca acaactttat ttcccccgtt 17521 ttgcagcccc tcttattttt gtgtcgaggt tggggtcggt actgaccgtc ctgccagcag 17581 ctctgaattt tgaaaataca gatatcacct tcggggaagg gggaaagcca tttagccaat 17641 tggagaaata aatcctgccc gcagcagcag cagctacaat tacggctctg tttttgcgag 17701 cgcatgaggg acagtgtccc tgccgctctt aaatgacagg cgtctattaa agatagcttt 17761 tgtgtagtgt ttctccaagg cgaggtcaaa ttccatacac ttttataacc gtagtcgatt 17821 tttctttcgt gtgaatatgg ttttcgtgtc attagtttgc gatttgattt gcttacgtat 17881 ccagcctgga aaatcttcat cacagggtcc ggttcctcga gccagccggg ccccaagtcg 17941 gagggttctc cttgaaccca gcgagtgggc ccaggctccc tgcagccaca gaggctgcct 18001 ggggtctggg gatccgtggg gcgggttact ggggtcttgc ttagacctcc aggagtaaaa 18061 tgagggcgat aatggaagca ttccttggca gtgcctagta tctctgtagt tattttccac 18121 ggctccgaaa gactcaagta aatcacaaat atagctgaga ggcaagtgga gtctccccgc 18181 tggaggcccg gcgttgcagg cgcccctggc acgtctggaa gccaggactc tggcggctcc 18241 catggccctg ggcccctcgt tgggtcctga acgctgctgt ggcggcgacg cgggcgctat 18301 cggaggctgg gagcgggaat ccggagccgg gagcctaccc cgggctgtaa tgttccaccc 18361 gcgcccaggt taactcgcct cggctgaggc tgcttctctt ccactgacgg ttgcacacgc 18421 gggaccgaga gactgggctc tgttggggcc ccctttgttc ctcgagcttc cttcctgttc 18481 tgggaggcgg cttgggaggc cgcgacaagg ccgggctcca gctcttagac cccctctttc 18541 cactggccag agatgatttg atgatgccct tcgggactta ctggcgaggg acttaggcag 18601 agacgcccag acacgaaacg gggctcggcc cagggctctt tcctccccag cagccccgcg 18661 tcccgaggtc ggggagctca gagacactag cacaggagcc ccagacgcat tcagggcgca 18721 ccccagaact ccggagccgg tttgggcatc cttgtggagc gggactgggt gtgtgcagtg 18781 cgccccgctc caccgctggt attggctgtg tgtgaggttt tgttttgttt tgttttgttt 18841 tgttttgttt tgttttgttt tgttttgtaa gaaataaatg cacagacgct tgcaaagctc 18901 cgggctcccc tgaagctgcg gaagccccca gatgggagca ggcggggaga aaagttgggg 18961 aacaggcgag ggcaaggggg caaagccgaa ggaggttgca gcgctggcct ggtccctgcc 19021 caggcatcta ctcgcccgcc tttgcctctg agtcctcccc gctgggctgc gtggaattga 19081 tgagcttgtt ttcctttttc cacttcatgc ggcggttctg gaaccagatc ttgatctggc 19141 gctcggtgag gcagagcgcg ttggcgatct cgatgcggcg gcgccgtgtc aggtagcggt 19201 tgaagtggaa ctccttctcc agctccagtg tctggtagcg cgtgtaggtc tggcggcctc 19261 ggcgcccatg gctcccatac acagcaccta cgagcagaaa cggccgggcg ccggtaagcc 19321 agggcctgga gggttgaccc agtcagccca gtgcctccca caagaggcac ccagactaga 19381 aaccaccccg cctccactcc agcctctccc actatgtctg gggcccaaag catactgaat 19441 gggagattat ttcataccaa gcgagatgtt ttccacctat aacgacttgg ggtggacatt 19501 atcgtttttg aaaaatctgc ctcataggaa aaccattaca agcaaaataa cagaatgagc 19561 attcttaatt tgggatgccc aggtcgtgtt tttgtgggga tgcagggtgt gtgtgtgtgt 19621 ttgtgtgtgt gtgtgtgtgt gtgtctgggg tgggtggtgt ggaaccatgt aagtagtgta 19681 cacactctga attcatagtg gtctctctga ggatgctact gtgaccagcc atgtcggaac 19741 tcaacacatt ttttgaagag ttaaaaatgg tcagccctct tgaatcttta cacattctcc 19801 caaattatat gacccttctg agaaatgcag aactccccca aggataaagt ggatgttaat 19861 ggaaaacatt ataattttaa caatccgcat taggctctgt cctagggaat tcctaggagt 19921 ctttgggatt ccctgtggag tccttggaaa ccagaggcta agagatggag ctataggtag 19981 tctaggattt ctggcaatgt aggattgtgt tgggctgtct gatctggtgc caaagttccc 20041 caagggtggt ggggggtccc tccaagacag gtgtatgaag ctcaggcatg ggctcagaag 20101 agggcagaag ttggctaaga gtgggcagtt gagtagaagc cgggggagat gagggaggag 20161 agagaaaaaa atcctgagtc tggggctgtg gccctccaag gctttgggga gacactggaa 20221 gaggccaaat ggccacctct ctgtacgggt ttctgaaggg cagaaaggag agggggttgg 20281 gaagcaaagg gtatttatca gtgacgcagt gcaaaaggcc ctctggcttg ggaagctgag 20341 cagggtactc agggtgcatg gacaggctgt ccctccccac cagcctctct ctctcttcag 20401 gctgtccctg tcacaggtct catatacgtg ccagtgcccc catcctcccc taaggtgtca 20461 gcttacatgg acactggggc cttgcccttc ctctgccatg gcctgatagc cccattggga 20521 actgattttt tcttctcttt taagaagcca aagaaacttt gctggttctt tcatcctttc 20581 tttctctcta ttcctctttc tactctgttt cctccttctt tctttctttt cctgcctcct 20641 ttcccctctc cctccactgt cttgggatat gtcttacccg cgcaggagtt catccgctgc 20701 atccaagggt aaaccgggct cgtgtacttc cggtcggcgc cttcgtcatg gagtgctttg 20761 ccctgcccgc tgctgctgtc gggtttgtac tgctgctcgg gagaaaagtg caggtagtcc 20821 ccggggcccc tctgcttgcc actgcccgag ggcgaggcgc cactgaggtc cttatcagaa 20881 tagaaacacg aggccccgta ctcgtaggac gcccggttgc aggccaggac cgagttggac 20941 tgttggtaga aacaaggtga ggtgtacgtc ttgtccggga gactcgacgc cccgtacgag 21001 gccgggaagg gcctcagcgc gtcatagcca gcctggtaga ggggcagctg gcccaagaag 21061 gagtcctggc cgctgggaag gctcccgggg aaagtgggat tcacaaaata ggaactcatt 21121 tgcgcgcccc tctgcaggac tgtgatttgt tgtgtattag tacatctggc tataactatt 21181 agtagtcatc gaactggttt gtttctggat ggccgaacgc caaaagcgac agcagcaaat 21241 cgcaccagct gacgcggcgg cggccaatgg gagcgaacgc cggagcccgc tccccgggga 21301 ccgcggcgac cgaggcagca aagttacaaa cagcccccag cccgcccgcc cgccgcccgc 21361 ccggcagatt aatgggcgcg cacagctagc cggcccggct ctcttcccga acgccagcgg 21421 cgggcacggc cctttcagtg gcgagcagat caggccagag tagggacact aatgcttggg 21481 ccgcctcaaa ggccccgcca ggtcccctcc ttcctcctcc cctttttgct ttattgaatt 21541 ttgcgggttc cccgcccctc ctcctccaaa caggaaacct gaaggtcctg ctccaggagg 21601 ggcagtgaca aaggtggctt tgctccctgc cggcctgcac tccaggcacg gcccagagtt 21661 gcccctcccc tagcgctccc ctattcgtgg gtgcgagttc ttgagctgga gcaagggcca 21721 tgagaaacca ggccttcact ctatgctctt cactcgtgcg ccttaggaat cggtgaactc 21781 cctcattctc tccttctccg ctagcaccag gggcctgggc cttgcagccc agagctgaag 21841 gatcacctcg agctgaccag gactcagcca aacgtaggtt cttcccaccc attccctcct 21901 cccacataca catcctgttt gagtcccaag gggctcaatg gcccgcgcct gcctggtgtc 21961 aaagcactcc ttccagctgg gcgtccccat cccacctttg aacccaggga attgaaaggc 22021 cagaaaccac tggtttcaac aagaaaggac cgacagagca gtggccgtga gactgggctc 22081 ctaagacagc cgagggacct gctcccctgc ctagcgcctg gggactcagc atccactgcg 22141 agaaccgctc agagctgagg cagataaaaa tccagatgtg gaatctctct gtccacattc 22201 cagaggcaca gaagacgcag aggccctggc caagctgggc acagagggca gctctgcagg 22261 gcccaaggcc gcgggataat tgatgggctc ggtttccggg gcgcccgaca accggcactg 22321 tcctggggcc gccctcttta ctgcccttcc tgggaccgca gggagggctg ccgcccctca 22381 gtcgctccgg aaccgggaat agtctcaccg gctccagaaa tcctgaattg ctttcactgc 22441 tgtcccaggc ctggcggtcc ccagacagga cattaaaagc ctgtcctgag ctgctcactc 22501 ttgcctgaga ggtgtgcctt cgagatagtt gacttttaca acacatggat caatctcgtt 22561 ttgtaaaatg tgatagcagc ccaatggcat ttttacaact gtgtttgttg ggcttgtaaa 22621 accccttgaa ttagctttaa aatcctcaat tctgtggagg ttgaagaaag actatttacg 22681 tgatttggat ggatttctgg agactgttta caagatctgt aaaaccctgg ccaatgcagg 22741 ctcacccggt tcaggactta gtggggaggt ggcaattggc agtgttccag taggaatcta 22801 gtaatgactt agtctcttcc ttgcttttca atttttttcc tgaactctcc tcccacaaag 22861 ggaaaaaatt gggaaaggac tggcctaagg tcgacctccg aagagcccaa gaggcactcc 22921 tggaaggcgg ctgctagccc ggcaacgggg gctacccctg gaagccggat gcctgttccc 22981 cagtcgatcc gtcctggaaa gggtttactt tgcatataaa gcagggcctc gaatgagagg 23041 atagaattgg gcatttccca cacattactt cagggaaaga attcaatagg tgaaaatgaa 23101 aactcgagtc aaaaaattaa aagggacttt taaaaatagc tgtttgaaga aaaacatgtt 23161 tgtgttattt ttttcttaaa caaaattatc agacaacagt gattttaagc aagttattat 23221 ttaaaatgaa aacaaccgtt tggagatgac aatcaatgtt tacatttatg atttcctttt 23281 cattccgcca tgtcccacat agtggtgggt cagtgtcaat ttactattat tattatttgt 23341 catgccttga ccaggattct cgatctaatt attgttccag ctaacaagag aatttgattt 23401 ctcagctaag aaccagaaaa cattatattt ctagcttcta taggcactaa gatggaaaaa 23461 catatattac attggaatct ttacatgtta gccaaggcta gtgcatatca tttctccaac 23521 cagatattct ctctctctct ctctctctag agagagaggg ctattccggc aagaataggc 23581 accccagacc ctccaaaggg cctctcctgc gatggtgaga atggccaccc cagggctctc 23641 tctcctccct gccaaagcca cgcagaaagc gacttacctt tcagtttctc tccaaggggg 23701 gcaggacggc gagctgggcc ccactgtctg cccttgccct gggctgccca ttgaacagct 23761 ccacctctgc cagcctggag ggtggtctgg gagtatgtag gtgtgggacc ttgaaggcca 23821 cctcggtcca gccagctcgc tagcccagac cagagcccaa agcacacagg tgcggcctct 23881 gcccttcaca gccctctcct ggagccaaag ccgtccttct ttgccatagc caatcctcct 23941 cagagcagcc aggcctacag gcctcaggac tacaggtagt ctctccgatt gcagaatccc 24001 aaagccagag tgttccctga accagcgtgc gcccccagaa cgccttggga cgcgcagttt 24061 ggacagaagc agccatgacg tgccaggcgg cctgtgcaac cagtctccat ggtgctggag 24121 tgggggtggg ggaggacccg agtctataac ggcgtccagc tccggtcagg ggtagccgag 24181 gcggggagat ttcccggcca tataggctct gggtgatcgc cccccaggtc cccttgcgtc 24241 cccctctacc ctgcttgctg ggaaatctct gaggggccag tacagcagtc gcgccaggct 24301 gcaagaggcg ggctgcagct ggcgccggtc ccggcgacgg ccacggcgtg gcagcagcga 24361 gcgccagcac ggtcgcaata aataatcggc ccgcggcagc cgcagtcagg aaggcggcgg 24421 acctaggatg caaatgcgcc gctttatccg ccccgagcgc acgcagagca cgggctaacg 24481 tcctaaacat ccacagcccc ttcctattta tcgaatccat accaagatat tgtctcaagt 24541 tggggaaaca acaacccaaa caaaaaacaa acaaaaagaa cctcggactc ttttcttgtc 24601 gctgtccgag gagttctcgg ggcaaatctg gtctgatgtg cacttaggcc gagctacagt 24661 tcgagccacc gcgctacgca aagccaccgc cagcagcacg ctctgcgctt cctcccgggc 24721 agctccacca tcctctcccc gctcgcggaa ggtggggagt gcggtcgcca gtccgttgcg 24781 ccaaggtgga ttggggggtc cccctatcgc ccagactcgg tttgcgtcct tccttcaacc 24841 ttagctgcgg gaccctgcca cgcgcgctaa cagcattatg tcctctgcac cgaaacttcc 24901 caccaagtaa cacccaatta atcccatcct ctcctccaac acagaattcc acccacgcac 24961 ctattccccc tcccagcgct tcagacacct ctctcatcga aaaaccgggc gaagggaagc 25021 cggcaacgag cggagaactc tggctgaatt agtaagtttt tctgtttccc ccttctacct 25081 ttctctctga aaccaattcc cacggagtag aaagtgctgt gctgatgcta ataaaatgcc 25141 tgtggattaa tgcactttgc cctcacctct tcccaccttc ccaggtcaag gatggtctcc 25201 tcttgttggg gaatccaaca aaaccaaata gggccatgaa tgaaaacaca aaaatccaaa 25261 cccaaagggc tggcggggaa cacagtgtgc cgggccactt ccttttgggc ttcacggggt 25321 cgctagggaa ggggagagag acgacccaag ggatgctgcc ctgtgccttg gcgctgtgcc 25381 acggagggcc gaaggaaggc gggcacttac tgccaaggcg agccagggag gggagagaaa 25441 gggagtggtg tgtgtgtgtg tgtgcgtgcg cgcgtgtgtg cctgtgtgtg tgacgtgtgt 25501 gggctgtagt gtgtgatggg ggcaagtgtg aatgggtgag tatgggtaag ggggagtatg 25561 cccctcttaa agcctccctg ggtgccctct cccaccgcta ccgccgccgg ctgtcgcctc 25621 accacctttg ctcctatctc ctccccacat gtctcacctt cagacggtgg ctcccagaag 25681 ctcctgcccc tctgacagct gtcgcttggg cagcccgaga gaagaattgt cctctttcct 25741 ggtgccagag gacgcaggaa attagccagg ttgcgagttg caaagctgct gccgcggcgc 25801 cgggaacgga gcgcgcccaa tctccagcgg gagccgccag gcctggcctg gccggggctt 25861 cccttcgctc gccatctccg gacaaagcac agccgagccc ggctggaagg cagagctccg 25921 aagcaggcag gacggagcgg agcaaaagaa tgcggctcta ttctcgcaag ggaaattata 25981 aaaaagttca tgttcacggt tctcatccac atgaccgaca gcggccaatg gaagggccga 26041 acaactcata aagttgtatt gcaaagttgt aaattttcat aaacaacaac ggatttatga 26101 ccctttcccc atcactgaga ggaggcagct cttacaccgg cgccatctta ccaccgaggc 26161 cgccccgact tggggcctca ggttttacag acccttttgg gccaggtttt actaaaagag 26221 ccataagaag cgggcccagc ccaggcagga gactggagac gaggtcttgc aggcggaact 26281 caggatgctc tgagctgccc gcacaacccc tggaccttca cccctcgccc cttccccgca 26341 tccagctgcc ccagcccctg cccaggctgc gtagcctagc gggggtctgc ggtcctagcc 26401 cctccccgcg ccacctactg cagtgccgga ccctggggcc ccctcgcctg gtctgcaggc 26461 ggggtgggga ccttaaatcc catttcctag cctggggctg ggttcagggc gcatgcgaat 26521 ccggaatcag ctctgggtaa tgcccctttc caagcccact gctcagcctt agaggaaagt 26581 gtggatttga aatttcctca tggaattgat ggaggttttt aggtagattc atagaatata 26641 acgtatctac caaagattcc gttttcaagg gatctagaag atgttagtgc acacgcaaaa 26701 accagacaaa cgtctctaca cggataaagg cacatataca attatgcaca cagggaaggg 26761 catacactct attgtgggca cagaatgaca tgcaattatg gacacacaaa aacacatgca 26821 cccaattatg gacaccaaaa tatatacaat tgtggaatta ggtaaaaaca cacacacaga 26881 aatacataca cagaaaaata agcacatact catacaaata cacacataaa aatacattaa 26941 aaagatacat gacaccaata catgggtacc caacacttgg accatcacaa ggacagccac 27001 cccacttttg cttccccact gccccctgcc ctccagccat actcacctcc cctttcccag 27061 tcccctctgg ataaggcagt ccacattttt ctttgtcacc acgcatcttt attttcggtt 27121 acataaaaca cagctgggct gggaagtgtg ccttccctga accccaggat ggagctgagc 27181 agggtacagg acaacacagg agatgaaggg cattgcggag ggcattggac ctccccaccc 27241 actacagtta actcaagaca acataccatg ctacaaagtc accccattaa cacatccttt 27301 ccaagtcaag acactgcctt acaaatgaac tccaagacta tagaaatgat aaaaaaaaat 27361 cttgttcaaa tatacagtat ctgctattat aggaaacatc agggcgtaca tatttaacac 27421 agctgaacag taagatacag gagccagagg aaaggacagc gaagctggaa gcatctccac 27481 agtcctgcta agcagaagct aacccacaga tctgcagcca gctcaggaac attcccctcc 27541 agaagtgggg gttgatgggc ctgagctgtg ggtgccaagc cagagaagga gggattgatt 27601 ctagggtgca agcacttagg atgctttttg gaataaatat attatttttc gatttaaata 27661 gatgccaata ccctgatcct ggacctcagc acattctcag ggcagcctca gggaccccaa 27721 aagctgcggg ctgtaagcag caggggactt gcctgggagc agtcggcact aggtagcagg 27781 caagccagcc agcacaaaat aggtagtttt aggggagtag gtagtagtga gattcacttt 27841 cttgcgggtc tgggagggtg gtgctgggtg tctgccagtg ttgggataca tagggacttc 27901 ctgggaatgg aggccctctg gggctggata cataggtagt ttgggggtgc ctcgagcaga 27961 ggcctgtgct aggtagtatt ttggacgcgc cagagcaggg ccggctggcc tggggttggg 28021 ggtgtctttt ggggtcctcg gaggcagagg gaatccaagg cgacccagtc tctgcggccg 28081 ctcagtccac aaaagttggg agctggagta ggtgatgggg gtgggtagag tgcaggttgg 28141 ggactgggtt gcttttttgt ttttgttttt gttttttaca ttttctttta tttttcccat 28201 ttttgtaagt aaaaccagtg agtctcttaa agacgctttt ccgactgtcc ggtgcagaga 28261 gggccccgga tcggcccctc attcctcctc gtcttcctct tcttcatcat cgtcctcctc 28321 gtcggccttg tccgcggcag cagtggcggc ggcagagggc acggcgccct cgggagctgc 28381 ggcggcagtc ggaccttcgt ccttatgctc tttcttccac ttcatgcggc ggttctggaa 28441 ccagatctta atctggcgct cggtgaggca gagcgcgtgg gcgatttcaa tgcggcggcg 28501 ccgcgtcagg tagcggttga agtggaactc cttctccagc tccagcgtct ggtagcgcgt 28561 gtaggtctgg cggccccgct tcctgtcagg tcctgagaac agacatgcag acacatgaac 28621 acaaggacag acaagtagac agggcactcg ttaggctgct gtcccagagc ccgcaccttc 28681 ctcctggcct agtccccagc gagcatcccc ctctgcccca ggccccgaac tgagctaggg 28741 gaggaggggg agtgttaggg aaagacccca actgcagtgc cagacgcgca ggcagctctg 28801 taatgagcaa aggcacagaa tctcaacttt acaaccgacc tttccagccg gctaagcttc 28861 cacaatgtcc tgcttcctct gacaaaggaa aactgtaaat atagagtgtg agcaagtggg 28921 aaacgctgca cttttgccat tcaaagatga gcccggccat tcccctgcct tgctaggcaa 28981 gtgggcgact cttcccagca gcctgagccc tcatccccag gaccttccta gggcaccccg 29041 accctctgtc ctcattccct cgcccccatc ttgaaatgga ccctggcaca gggtcgggtg 29101 agaggccctg gagggcttgg ctctcctagc ttttgagaaa gaaatgtcag gcagcaagga 29161 aaatgaggag agagagaaga agaaagggag ggagggtgac agaggaggga gaaagagaga 29221 cagaatagcg aacaaactta atgttaaaat tccaagacaa atggagttaa ataaatttac 29281 gaggatcgaa cccattaatt gggccataaa aagttttatg agcctcattt acatacaatg 29341 ctatgggctc cacgcaatgg cgcctccgct ccaattaaaa ccagaaaggc tgcgccggga 29401 gtcacggggc taccggctcg caacagcctg gctccgctct tccggccccg cgccccgcgc 29461 tccgcgctcc ccagcgctgc gctccccgct cccggtcccg ctccgccagc ctggcccgcc 29521 tagcgactgc gcctacctga agaccgcatc caggggtaga tgcggaaatt ggcctcagcc 29581 gcgccatgca gcgcgccctc gtccgtcttg tcgcaggcgc ctttggcgag gtcactgcag 29641 agcccgggga tgttttggtc gtaggaggcg cagggcaggt tgccgtaggc gtcggcgccc 29701 aggccgtagc cggacgcaaa ggggctctga taaagggggc tgttgacatt gtataagccc 29761 ggaacggtcg aggcgaaggc gccggcgccc gccccgtagc cgcttctctg tgagttggga 29821 gcaaaggagc aagaagtcgg ctcggcattt tggaacagag aagcccccgc cgtatatttg 29881 ctaaaaagcg cgttcacata atacgaagaa ctcataattt tgacctgtga tttgttgtcc 29941 ggcagctttc agtgtcggtt ttacgaggta gagtgatata tgataacatt acacccccag 30001 atttacacca aaccccattt tcttttggac ggagctcgcc gcagcacgtg accgcccaca 30061 tgaccgcctc cgccaatctc agcagtcctc acaggtggtc tcgctccgca gggcccgcag 30121 ccgcctagaa tggaagggca agaggctcaa atatgcggcc aaagaatccg cccgcgcccg 30181 gcgggcctgg cgcgtcccgc ggaaaaagac ctggaggctc cgcgggagcg cccagctggc 30241 ggccaacctc cgcactgggg tctgcggacg ccaggcggcc cggccccacg cagcaccccc 30301 caccccgccc ccccgccgac tcctgctagt gagccctgga ccaagcttgg gatcctcccc 30361 atccctctcc tgtccgcctg cccagaccct ggaagggtct ctgtcccccg caacagcctg 30421 ccccgcggtg gccttgtggg caggactcag ctatgagcag atcgactctg cccaagtctt 30481 ctctcaccca ggtccagtgg gcgacaggcc ggacttagac tcggatccag acggggaagg 30541 cgcagcatct cttgcagctg cagagagatt gccaccgcaa actggagcca tgtggttcga 30601 ataaagtcaa cgtctcccag cttcctttcc ttaatcggag gcacactgtt tatccgccct 30661 aaaggaagca gtgaaatatt tatctattaa tgagactcat ttgccaacag atttattaac 30721 gtggggttcc cctccctcct cccggacgct gtagtgctgc aggctctgtg ccttcgctcc 30781 tgggcacctg gctggctcca gcagtccgat aaattgctaa agattccttt gtcctttcca 30841 caacttctgg ttcccctctg gcgcatgggg agccagggct gtttccccca gcttggaaaa 30901 atctcgggcc tgcacccttc caggcactcc caatactgga aggtttctgg ggtaggccgg 30961 ggtgcctggg aacaatacat gctttagagc ggatttggag agggggctct ggcgtctagg 31021 gactgcaacc cactgtggac ttccttttct tttgaagaca ccgaaaacaa ataggggaaa 31081 ccacccccta aaggccaccc agctgtggag gcttcaccta tccaagtacc agctcacatg 31141 gagctgagcg cacagtggtc tcacttcctc ccatccaggc ctccaattcc taccttccag 31201 gcctccgacc cctgcaccat gttgcaccag ctgaagcccc tgccgcccac cgctcggtgg 31261 aggatggtgg ggagaaccct tcccctaaac gcctcataaa ctgcccggct gcgctggaag 31321 cccggccacc ttttactgct cagctcgatt acacaccata aacccgacct cacaatggaa 31381 ttgccccgga aattctcctg taaattgtag agtttagttc cttgtgtact gagcacttcc 31441 catcactttc agagggtcca ggccgtcccc gtttaccgca ccctctcagg gggccccagg 31501 aatacaaaag gtggagggag ggttcagatc gggacgccac ggagccccag ttcctgcgca 31561 gtgaaccacc ggggcttggg taggaaggcg ggggctggtg gggcaggtgg gcgctgggct 31621 gtctcctccc gggccacgga agcctggggg taggggtgtg gagtgaggga caacatccgg 31681 ggacgccttt atggcggatc agatgatacc ttgtctccga tggagaatag agggagtaag 31741 acaaaaagaa aagacctaaa taatcccagt cgcatcgcct tctggaagga aataaaaacc 31801 caagaaagtt aaaggaagat gaagcaaaca agaaggctag gagatcaaag aggcagaatt 31861 agaccgtttt agaccaataa atttttcctg gggtgacaca cagcaagacg caaagagaaa 31921 agaccaaggc ccccgccgcc gccgtctgtc tagactcaag cgactgaagg ggccaacaga 31981 gctggtgttt aaagtagaac ctgcccagtc caacagcccg agcagggagc gatttcgggg 32041 atcgcgggaa ggaacgcact tcgccaaggg agggccgggt gccctcgcca ccggctcatt 32101 cctgctccgg ttttgcccga tgcgcgtcca ggaggttctg gcaggacgca ctgcccctct 32161 gccccggcca aggaggatgc ggatactgcc cgcaaggctt cggcctttat ggacccaagt 32221 cagccaactg ggccgagtcc tgcggacacc gaaacctccc tttcgtttcc aggcttcctt 32281 ctcccctctt gccctctgtg gtctgattta aaacgaaaag gtcggataaa atcaggcttt 32341 caataaggct tctttaactg tgtgttctct attcattggt tctctactta tttgactgaa 32401 aagacacaaa tgcactaggt tatgtgagat aattttcaca gaaatactca ttgaccctca 32461 gcctgaagca ggcacatgta ggcgggtttc taatcagtag agctatgttt agatagacat 32521 tttccactgc cgtccctgag cttccttcct acctactggc agagcgtgtt cactctgctt 32581 cttgttacca aataccaata tttaaattca attatgaagg taaagccagc tctaggcagg 32641 gaacagcgcc ttccagagat ttgggtggca ggcaaattgc ttcctaaagg tttccaccct 32701 tgcctctctc accagctgca tctgttcccc atccagtgag tgagtgatcc ctagcctggg 32761 tgcaaagaca aactcgtacc tgtcgtgtac aaatgagatt gggttggtgc ggggtcttgc 32821 ctgggtgaag gtccgcagcg gggaagccac tcccaggctt cccagaggaa aagcagactc 32881 ccagccacca gcctgccaag ggcaagggag gagcctttgg acagtccacc caggcctggg 32941 agaccctggg cggcctgcac ctacctgagg cctccgtgat cagtggagag acgcagtcgc 33001 caccgatctg tgacttcatt tatttgtagt tacacaaatc gagctttctc tctgcatcca 33061 cggatctgcg acttcactat ttatttgcag ttaacataaa ttgagctctc tatctgcggc 33121 ttgaatacaa agccacggtt gactcccagg gaagctataa aacccttctt acaatcagac 33181 gttgtgaaat tatatacgga aatgtaatga aggtgtgtgt ctgtaatatc tgtatctatc 33241 tatctatcta tccatatata tatgtatatg tatatatgag atgagagaca gagagggaga 33301 gagagaaagg agagatattg tgtgtgtgtg aaaggaaaga gagaacaaac acccgggaga 33361 gacatcaacc aaaatccagt ccccagtttt acagcgtgaa agcactggga tgcgggtcgt 33421 aaacattttg tgggcttggc ggagactatt acgacccaaa taaatgcact gtgtagcgtg 33481 ttcacagggc tccggggcct ttcgaaaggt tctctgtttg cttttgcgtt ttgcctctgg 33541 aaccattcga catccgtggc tgatgcgcgc acccgagagg aggccgaagc gtgttccccg 33601 cctagggtct gggagaggct gggcctggat tggggtcccc tcccttctgg cctcctgatg 33661 ggtgaacgcc agagcagcct cggttcctgt acagcggagg gcatgccgcg gccagagaca 33721 gcccgggcgg cttccacact gtgtgcgaca cttttggtgc tacggtgtga ttgttatatt 33781 aaacatcaca ttaaaaaaat aaccaaagca gtccctggtt tgtcccccag gattctcctc 33841 ccagccaact cggcccggcc cctataccta cagtggtcag acaccgagat tgcctttacg 33901 cccaattttc accgcctaat tttattgtct cgattcaagc cagccgagtc ccggggtcct 33961 ggctctgtct tgggttctgg ctcgccgggc cctgacccag gtcgcaggtt tcgggggcct 34021 cctgggcggc cgcggacccc gcggtcacgg tgtgagccct cggcacgcac cgtgcacacc 34081 ctctgggcgg tcatcaagtt ctggggctgc aggcgttcct cctgtgtccg ccagtcagcc 34141 ggggctctct ggcgcctggc tgtgtcaggc ctgtcccagc tgaggacggt gaccctggag 34201 ccgcgcccgc gcctggagct gcagccccgc ggccgcccgg acagctgcag cccggcagca 34261 ctgggccccc gggatggggt cagggaggcg cgagcggtga gggtcgggca agcccccgcc 34321 gattccttgt ttcccagcga ggcttctagc caggcttgag cagcccagaa aatatagggc 34381 ggctgttcac taaaatctgg ggctcccatt ccagaaaggc tcgtgtcaag tcgacatctt 34441 aggaacttca caagggtcgc ggaggcggga gatggcggcg cggaagcctc ttgcatggag 34501 ccacactgcc atctgctggc cgccgtttgg tactgcagcc tcagctacct ccgccggcct 34561 gagctttggg agccgccggc tagcccaccg caccccactc ccaaacgagg ggcgtgggcc 34621 taggtcccgg ggagtctgcg tggagcccgg aatctactgc agaggaggca atgccaataa 34681 aagaggtgtt tccgcagccg ctttatcggc ggcagacaga acaaatgaac aaatgaaagg 34741 gctttggtgg aatatcctaa ttggggctgg acagttgtaa actcattaag ggacgaattc 34801 cagacagttc gcagacccgg cctttatggc acactcttac tgcctattct ttgaagttcc 34861 ttctgtctgc aacaacagag ggcgtcccaa cctggcggcg gggtgggggt gggggttagg 34921 ggacgcctca tccacccgcg ccctggttcc ccttttaagc ttagaacaca tccaactttg 34981 cagaagtcgt ggagtggggg aagggggtcg ggcgagggaa gcaagggggt ggaagcgagg 35041 ccgaaccctc cggccacatc tggaaagcgc tcaggcctcc gaagttgtgg ccgttccccc 35101 ccccccgccc ctcgtagccc tttacacccc ataaacggta acagccctca ttttctttta 35161 tggcgcttag gggctcgtca attgcctcag ccctgcctct gttaggcctt gtgctcgaga 35221 ggtgcttgca gtgcaggacc cagattctct tgggccaggc tggcaggact ccgcgggggt 35281 ggcgggcggt gcgaggatgc agggctttgt tcctttgcaa gcccccaaaa tgtttgcctg 35341 catggccacg ttgggccggt ggggctccta tttccctctt cagagcctgc ttagaaggcg 35401 gctcccagct ctctatcctg tcaccctctc ccccttttcc tcaccaagct aacgaaggcc 35461 caggtttggg aattttctcg ggggcaccaa ggtgagagtg gtctttccac tgctactggc 35521 cagaccagac cagctcctca ggggcttctc cgccgtcccg cacatagaca caaggccggg 35581 agcctggctc ctccgccggg tttcccagct ttttctcgcc gccaggcagc tggcctgttt 35641 gggagagctt agctcggcgg actggttgtg gcaggtcccg ctcctgaaag ccagccagag 35701 gcaaaattgg ggctctttct caccacccca acccgcccct acccagagac tctacatagg 35761 ctcccctgac ccagagaaga aggtgcaaag agcagacccg caaaaaatag aaaagaatca 35821 atatatttta tttggcaaaa agttaaatat catctcaaca caacaatttg gtcagtaggc 35881 cttgaggtaa ctattgcaaa atatacagtg taagttcagt ctgatggaaa ccccagattc 35941 atcaaggata caaatctaca gtagcccaat ggcggtttca tagtgtataa tttattatca 36001 ataaaattaa ctccgttaca atcagcattc atttcctcca attaaaatta agcataaacc 36061 ctaggtagta accttctgca catatgtata gctccgaatt tcctcactgt tcgtctggtg 36121 caaaaacaat attcaagctt gtctgattat gcatattttc tttaatcata tagattatat 36181 atacaataga caagacagga ctatatagat aatggacaga cttaaatgcc cgcattttta 36241 aggtggagaa aatgatgaat ctatgcatcc ccgagaacac ttaaaatttt tttttatttc 36301 actgggaaat tcttacagct actttacaat cataggttaa cagcctagtt atacagaaga 36361 catattccac tacagagcta tactctatgc aactgttttt ttcccctcat aaacaacctg 36421 agttcaaatt gaattctatc ttccacaatc acaatgggtg catcacccag tacacagaag 36481 tttgaatcac aaaacataat taccacaata aaacacagtg ttcaagtatc ttggcagagc 36541 aatctgccgc acaaactgca aattaaatta actacacaga ctaaaaacta tacagcctac 36601 catcaacagt tgtgcattat aaaaaggtag tttctttcct tttgttttaa gtcaggaaca 36661 ggtagatttt taaaaatata tatacaagct aacacacaca gctatcagca ctaatgcccc 36721 cccctcaact tttccttttt cttatagaaa atggaaagct tacaatacct cctccatcaa 36781 agcggcaggc ctacgagcca gcctgaacag ggtttgcctt ggaaaagatg tggcctgagg 36841 tttagagccg ctttgtgcgg ggatggtgga ggctagggtg ggggtgagag aagggagaag 36901 gcggaagggg gacggacagt tctttctttt tctctctagc ttaccctttt ttctaaataa 36961 gcccaaatgg catcactcgt cttttgctcg gtctttgttg attttcttca ttttcatcct 37021 gcggttctgg aaccagatct tgacctgcct ctcggtgagg ttgagcagtc gagccacctc 37081 gtacctgcgg tccctggtga ggtacatgtt gaacagaaac tctttctcca gttccagggt 37141 ctggtgtttt gtataggggc accgcttttt ccgagtggag cgcgcatgaa gccagttggc 37201 tgctgggtta tctgcgggga agagaaacac tgggtttagg agcagaagac gcacatcccg 37261 ctggggcaaa tgagcctcct gcatggggtc tctggccgaa gtgcagaact ctgctgaact 37321 ggttaggaaa ggcagtcagg cctcggacac aatggaaccc tggcagacag acgcacagac 37381 agtcacttaa aattgcacgc agtaaaactt tggctcgccc tcccctccga gaccttcctt 37441 tctcctactc tgtcctcttg tcccccttct ccttcctccc actctcaaaa cgctgtatag 37501 aatgaaattt ggaaacaggt ctccttgcca cgcagaggga aagcatgctg ccttgtgttc 37561 tgtagcaaga ttaggattcc tctgtcccgt tcactgactt cgtctttctt tcccaacctg 37621 tccctctacg ccccccactc cttatttaac cttcctggaa ggccttcgga gctgggcaag 37681 ccgtcagggc gccctaaggc cgctgatcac gtctgtggct tatttgaata atctgtcatg 37741 gggacccttg tggcccgggt cgcccgcagc ctcatcttgg caggatttac gccgccactg 37801 gccgaaggca agaagtggaa ggaatcggcc gtctccccca gcgtcccagc tccggctgcc 37861 ctggctgccg ccgctcacgg acaatctagt tgtacaaaag gctctctggg ctgcactgct 37921 ttcgaagaac ggcccaaagt atctcggtcc tgggcctggg cagccaagga gaggggcggc 37981 cagtcttggc tcgtcccgaa gtgcccgccc cgccccctct cgctgcagca gccgcctcct 38041 ctcccgtagc cctgcgggcc gctcttcact gctctccaga cttggggccc tatctgaggc 38101 gtcccaaaca ccaacttctg gctcctggcc ccaactcgag aggcttccag cgaggacgaa 38161 ggcaggctcg agagaaacct ggcgggccag cagatccggg aggccggcgt ggaggcggcg 38221 gcggatttga agggaggaga cacttactgg gatcgatggg gggcttgtct ccgccgctct 38281 cattctcagc attgttttca gagaaggcgc cttcgctggg ttgtttttct ctatcaactg 38341 gaggagaacc acaagcatag tcagtcaggg acaaagtgtg agtgtcaagc gtgggacagt 38401 caccccttct ggccgacagc ggttcaggtt taatgccata aggccggctg gagggcaagc 38461 ccgcgaagga gagcgcaccg ggcgtgggct ccagccagga gcgcatgtac ctgccgtccg 38521 gcgccgccgc cgccacgggc gcctgggggt gcacgtaggg gtggtggtga tggtggtggt 38581 acaccgcagc gggtacagcg ttggcgcccg ccgcgtgcac tgggttccac gaggcgccaa 38641 acaccgtcgc cttggactgg aagctgcacg ggctgaagtc ggggtgctcg gccagcgtcg 38701 ccgcctgccg gggaggctgg cccagggtcc ccggcgcata gcggccaacg ctcagctcat 38761 ccgcggcgtc ggcgcccagc aggaacgagt ccacgtagta gttgcccagg gccccagtgg 38821 tggccatcac cgtgcccagc gcctggcccg cccggcccga cccacggaaa ttatgaaact 38881 gcagatttca tgtaacaact tggtggcacc gggggggaag tacagtcacc taataagttg 38941 ccggcgcccg cgcccccatt ggccgtgcgc gtcacgtgcc cgtccagcag aacaataacg 39001 cgtaaatcac tccgcacgct attaatggtc cgatgttttg cagtcataat ttttatagca 39061 aaagccatat gtttttatgt aaagggatcg tgccgctcta cgatggggtt tgttttaatt 39121 gtggccaacg acgattaaaa gatcaaatct agccttgtct ctgtactctc ccgtctcccc 39181 ccccatacac acacttctta agcggactat tttatatcac aattaatcac gccatcaaga 39241 aggcgcgggt cccgcgtgcg agtgcggcca gcggagcccc tcacataaaa ttagacaata 39301 attgaagcca taaaaaagca gccaaatcgc attgtcgctc tactgtattt aaatctatat 39361 ttatgatatt tcataaggag ttattgtttc agaagccaca caggctggcg ggaagtcgga 39421 aacgaccaac agattcgttt gcctcgccgt ggctcccagc tgtaaaaatt tacgaggact 39481 tggaaaggtt agactgttgt gtttggttgg cgagctccct gtaaataatc cctgcggtcc 39541 ccgggagagg cgagtttacc cgcggccgcc ctcgaaaagt caaattcaac gcaggatccg 39601 tcccaaacgg agccgccgcc ggccctacca gggcactcca ggcagggacc ggccgctcag 39661 ggagtaccgc gggtgtaggt ccccacagct acccgcctgg agcgaggggc gcccgggcaa 39721 cccttaaatt cgcctttgct acgaggaccc cacggaggag ctggccagga gggagcggcc 39781 agccgccacc agggcgaagg ttttgagggc ctggttggtt gtgcggcgcg ctcggtcccc 39841 ggccctcgac cccacgcaca cgcgcgccca gcccgccttt ctcatcagct ggcaatcagg 39901 attcccaggc gcaggcggct ggcgacccag ccctgtgctc cagcctcaga ggctctaacc 39961 atgagcgctg caagcctggt tgcgctccgt gaatcccagc tggggaaaaa actacaagtg 40021 gcatgaatgg aaggcaagtt cggtttggga aaaggcagcc tcgcctaaga gaccccgcag 40081 ctccggaacc tgggaggccc gcaccgatgt ggcctgtccc ggggccgcgt gagcctttca 40141 gggctccttc ctccctttcc agctgctact ccgggcctcg ccttggttac ctacggggcc 40201 cggagactcg gcggagaggt acaaggccca aagagaggca gccacagctc aaggccaggg 40261 ctggaaatta gaacggggag gggtaaaagg gcatcgactc cagtcccatt cctgggcctg 40321 gccacgttgg ggaagtttat ttctcacccg ttgggggtaa attaaaaggt cgccgccact 40381 ccgttaattg gaaggaaact ccccctgccc ccaattccta acagaaagca gcgactccta 40441 gaacaggggt aatcaaattc acgtgtggat actgtgcctg caacagtgtg tttttcatta 40501 gcccacttcc ctggcggcga ggctggcggc ctcgggcgct tccatctctc tctctctctt 40561 ttgccttcat cctcaccagc agttccagta atccccccct caaacaccct gacacacttc 40621 cggctgggac tcccaaatac cagcgaggct gccaagccgc gcggataccg actgggtgcc 40681 ccttcctgca cccgcgcctg gaagagggaa gtggccgaca caatgaactc cgaaatggcc 40741 ccgtcctgtc cgcctcatct ccctccccct aatattttcc ttgccccata aattcctcta 40801 ggtcatgccc acccccacca cagtccaccg tgtcctaaat accccgcagt ccgccaggcc 40861 tctgagattt tcatttaaaa aattaaccct ggaggaaacc ctggctccca attttaagtg 40921 tctgcaaatg ggctgggcat gtctggatgc cttttccacg ttttatgcct gagaagacac 40981 tgattatttc agtatttttt aagtaaaaaa gctgttcctt taaacagcct tagccccaaa 41041 ataagagagt tactgaacaa ttagcaggcg ttgagtgtta agcagatgtt actggtgcct 41101 agaaagcgga gaaaaatcag aaaaccaaat attgtcttcc ttcagcccaa aggtttggag 41161 ccagaatagt ctgacttttt tgtctgcttt tattttcgaa gtgaaagaga ttgcgtatgc 41221 ataaagaaat aataaatgag tataatttaa agcggcccct tctcgactga gatttctgaa 41281 actgtgatct ctgaaataaa tccaatgccc tggtccagaa aatggcaggg aggagttgaa 41341 gggaatgggg tgtgtggtcc tttcagacag caggtcggtg gctgtgcaac gaaagagttc 41401 ctgggcctcc agtccaggca gggaaaggaa acagctttgt gggcacgaat ctgtaactgt 41461 gtgtgttggg agtagggaga gatagtttgt tttctgtttt ctgtagggaa aatggtcaga 41521 ctggctccca tctgtagact acaatttgga acgatgactc aagtttatat caatggctct 41581 ggaatttggg attctttttg cccatttaaa acatactggc tcttggaagg gtccccttct 41641 ccagcaccca gctgcacaaa ggtgctgtat ggaacatatt ttgtacctaa tggaagccac 41701 tagtaagcaa gcagtcaagc tttgccctag ccagtctccc tcaatgcggt caaaccccaa 41761 gctgtgaccg gcaggccggg aagagccagc taagagcttc ccagcgaatg gccaggctcc 41821 agcgaggctg gttgggcctc agctccagtc cccagtgagg ctggcgaagg cctccctgcc 41881 ctcgatatgg gcacacaaag ctgcagcgaa tgtcccctaa tcagatctcc tagtcagccg 41941 ttagcgacag gcgaagaaac gcaaggctgc cgccatccgg tcgtgatcat aaccgaggcc 42001 ttgtctgcag agtaacacac caggcccaaa cccaccgccc tcggcagccg ccccacgcgg 42061 gcccttcctc ggcagacttc ccaacctcta cttgagccgc agaggaaagt gagaccccct 42121 aggctctcct gaagccagct ctgggcccct ccccaaggat gctttgggaa ggagataagg 42181 aggtgagata gaatctggca gagacggaag atgaaaaaaa gaccaacgga agaaaagaaa 42241 gcaggaagga gcaaaggaaa gaagaaaaaa agaaagaaga gataaaagag gaagggaaga 42301 cagggaaaag aaggaagaaa agagaggcga ctcctagcag cggccgagct tacagagaga 42361 agggtaagtg acaaggccag gatcccagcc tgccctatcc ttcctagtcc agcctgagtc 42421 tccactggaa ggaggttctt tcctgagctc acacccggga ctggtgtgtg tgtgtgtgtg 42481 tgtgtgtgtg tgtgtgtgtg tgttccagcc ttacaagaga atggcaggag gtccctgggc 42541 aggcgcaggc ctccagtggg aggctcagga tggaagcggg cgcccctgac cttgaatggc 42601 ccaaagccca gaattcctac cacgcccccg gcgtccgcga aggagcagcc aacctaaccc 42661 tacctgctgt gaccaggtgg aggtgtgtgg tggaagggga aagccggccg gctggcaaag 42721 cgctgcggag aaagacacga ggctcctgag cagggaaagc cgaggttgcc accgcaggcc 42781 tggcacgacc agggccgtga tgccccgccc ggcccgaccc ccgcgcgcag aggtacctgg 42841 agacgatttc aactgaagta atgaaggcag tgtcgtgctg tcgagagaaa ggtggatccc 42901 aacaacagga aactacctaa atcaccgacc agttctggtg ctgcccgcga agggctgcct 42961 cgcccgccgc cgccgccgcc tccgccgctg ccgccgccgc caaggagaga accctgccat 43021 cgcgcctggc ccggcccagc ccagccccta ggcaacctgc gcccgccagt gcaacagagt 43081 gccccaggcg gccgcaaatg cgtcaaggaa ggggaagcca caggccccag taaggtattc 43141 ctgggaggga gagggaggaa aagagaggga ggaaaggcag ggagagagga ataaaggcgg 43201 ggagcaggcg agacgagagc agctccgaga agcagtgtgc gcgccgcttt cccaaatctt 43261 gcagcccagc gagccggcgc caagaggcgg tagccgtgga aggctcgaaa gcgccaggga 43321 cggtacagat cccggctccc tgtctggccc cggcctcccg cctctcgctc tccccctctc 43381 cctcctagct gcccgcccgc cggggcgcgc gctcctgcgc cccctcccct tagcccccgc 43441 cccccccact ctggcaagca ggaaacgcgt ggccctaagg agtgccgcca ggtgggggag 43501 gccgtactgg gggaggggga ggcccccatg cagtccagcc cgggtccacc taaccgctgc 43561 ctccagcggc tagagatgcg ctgtgggcca ggcctgctcc ccgattcaga ttgagggtca 43621 tctgtgaccc agacacccgc aaaataaccg gcctctgcaa gcctccctgg ggctcccgaa 43681 agaaatcctg tttggcttcc tctgtctatg tagctcccct ctcaactgaa atcactggtc 43741 caagacagcc acaatccaga tatactagca agccataaaa ctgaacaaaa gccgacaaac 43801 cttacggcac cagggctata gggcccagac aaattacgac cgtctaggta atatttaaat 43861 agatgcctaa agtaattgca ggggctcggg ttagtcctcc tgttgtaaaa gtggtgaacg 43921 ggataaagtg gaaggtgaag ataagaagtc aaaaaagagg taaaattaaa aagcttcatt 43981 ccacagcttt tattctataa gaacataaac atcgtctttt tccacgcaca gcagcaatac 44041 aatattaatt tattctgatt taagattaga agtaaataga gctagaacta acatttataa 44101 taacgataga atgcatttgc ttaaaacaag attggcaatt cttcataata atagacattt 44161 atacagattt gtatttatag tttttttctt tttctgcatc tacaggtttg acccttttat 44221 gcatgtaact tcacagtata aaggaaatcc aaacaatgtc tcccttctct agtttattcc 44281 gcttacccca gtcctcctag gcttctgtga atttcagaaa gcaaaaacaa acaaaaaaaa 44341 aactttttgt tcaaggcgat ttaaaaaaac aacttcacaa gatagggaga attgtggtgt 44401 gcttgtcaca tgttaccagt ggtaacaatt ttaatgacaa aaaaatccac taattccaaa 44461 tgcataaaca aaactacatt atttatctac agccagaagg atatggaaat ttagaggtaa 44521 atgaaacatt ttagtcaggc aatgtaagac cttacagaaa ctggaagaga agtccccttc 44581 tcttggtaat tctttttttt tctttttaaa gctgggatat cttacagagg aaggaaaaat 44641 taaccttttt tactttcttt ctcacttttt aaatcagcca aagtcaagcc cgtttgccaa 44701 cctgcatgtc catgcctgta agcccttctc ttggccaagg aagaaaggaa gaaagaaaaa 44761 agaaacccag gggcctgtat cccctgatta aacacagcac agcactccag gcagacatgc 44821 ccggtggcgg ctcctttgca ccattgacct caggccagac acctcagcgc caacaatggg 44881 acctcggcct tccggctagg tttgccccag gctgggcagg aaaccagctc ggccgaagac 44941 aggggccatt tcgagcagtg ggaccccaag acagcaaacc cagcccagtc aggacttgac 45001 acttaggaca atatctatct ctatagaatt ttagcatgat atggcttttt cccccagaaa 45061 acaacaaata aaccagcacc aagcaaacac aaagaaacaa aaagtcagaa caaaccagcc 45121 ctgcacagat gtaacggccc aggagatggc gagtgtggga gggaggaaca gggctccagc 45181 acaggtgcga gttcctgggc agagcctgaa gacagaggga ggggaccagc gctcgggaag 45241 tgaaaaaacc gcgtcgcctg gagattcatc aggaaaaatt aaagttggct gtgagctccc 45301 ggatccggtt ttctcgattc attttcttca gtttcatcct gcggttctga aaccagattt 45361 tcacttgtct gtccgtgagg tggacgctgc ggctaatctc taggcgccgc tctcgagtaa 45421 ggtacatatt gaacagaaac tccttctcca gctccagtgt ctggtgcttc gtgtaggggc 45481 agcgcttctt ccgaccactc tttgccgtga gccagttggc tgcgttttca cctttggaat 45541 tgcctggcat gtaagagaat aaagagggga tgattaagtc gaggccacac gggctgcccg 45601 cgggggtgaa ttgcctccgt ttctcccata agagagatgt cccaagtcac agagaagaga 45661 gtcgtgcccc cgtttttggc ttgctgagag caagcagttc ctccaaaagt catgacaaaa 45721 attgagtggg ccttcaatct acatgaccct tccaatttac atttccccca cttttccaaa 45781 cacctgtttt agcgctaagc cgtagatgct tgcagaagga aaggcctggg agggcaggct 45841 gtacatcttg aacttaacgc ttttctttgc ctcttgccat atggcagaca agcatttcct 45901 gtagccccca ggctaggagc gcggggtgct tacttggaag atgggccagg cagcttggct 45961 gtctcacccc agggccctga ttgcccaaga ctcgataagg ggagaaagaa gggcatcatt 46021 ggtccaatgg ggaaggcagg aaaaaccgat tcgggggtca agggccctcc ctcaagtttc 46081 tccaggagcc agggatagat agctgggcga ttccgaggtc caggggaagg gaaatggccc 46141 ttctctggct ctcagctcag ggccccccgc ttccaggccg gatttgcctt ttcttcttcc 46201 cgcaacgaag attcccgccc ctcagcaact ttgaaaaaag catgggggat cgtaaactcg 46261 aacttcgccg gttaatgggc ttatttattg gcgctggcgg ctgcttattt tggatgcctt 46321 acaaacatcc gcgctatctg cgggcgagct actttccctc cctccccctc cccccgcgtg 46381 ggccgcgccg cgcaggctgg gcagggacca gggctctggg tcctcccggc cacaggaaag 46441 agcgcacagg agggggcctg ctcgctggtg tcctcgtccc tagtcagggg gagctgaggc 46501 cagcgccgag gacgtcttgc tgtggggcgc taagccggac atgaatttta ctgcgtcccc 46561 acgcccaaat attaaaaagc aagttcacaa ggtcagcctg cctgcagctt gggccaaggc 46621 cggccggctg ctgcgcgggc tcctagtttt ctgatccttc tcctccttgt gtctgcctgt 46681 ctgcccgcct gactgcagcc ctctgcagcc ctgcttaccc agggaatcct tctccggcga 46741 ggctttgctg ctctcggaag gggccgggga gagctcctcc gcggccgagg acgacgcgtg 46801 cgcctcctcg tcgccctgcg agcccccgcc gctgccgcaa gccagcgtgg ggggcggcgg 46861 cgaatcgagg gctcgctcct tccgggccgc atcggccgag ccggaggcta gcgcgggcgg 46921 gagatcgaaa ccgcgccccg ggggctgcgc ggggaacggg ccagccccga gttgctgcgc 46981 gccgccgccg ccgctgccat agcccttggc ggtgccgtag gcctgagaaa ggcggaagta 47041 gccaggcact ggcaccccgc tggaggtgcc cagggcgcag ccgtcgggcg gcgggccccg 47101 cgggaaggga gccagttcgg cggcggtggc cgagactttg gggcatttgt ccgccgagtc 47161 gtagaggcag taggagctct cttctttgat gttctgcgcg aaagagcacg aggtggcctg 47221 cggcgctggc tggggtggtt gcggcggggg cggcggctgc tgctggggcg gcggcggcgg 47281 cccgtcaggc ggctccatcc ggcaagaccg gggcgcgtct agccacaggt ctatgggcga 47341 gggcccgtag ccgtgcgccc cgggacctag acccccgcca ccgccaccgc tgcccggcga 47401 cgctgcctca ttgcgcttgc cgcccagcgt ggggaagagc ccgcagctct gcagcccgta 47461 gggcaggtcg gcggcgggcg gcaggtagac cccgccgtgg gcgtagtaac cgccaccgcc 47521 gccgcccccc gcgccaccac caccgccgcc tgcctcgcct ctgcccgagc tgatgagcga 47581 gtcgaccaaa aaagagttcg cggcgggctc tccgagcatg acattgttgt gggataattt 47641 ggcgaaggga gcagatagcc ctttctggct gacatttctt gtgcaaaaca tgctgaatac 47701 gattagcaat ccccccgcac cgcggcgggc gcccgcagcc aatcccgagc cagagtttcc 47761 gcgcgaccac tcccagtttg gtttcgtagg cgcggggccg ctctccgagg gcgccctcag 47821 agcccgcgat tgatataaat atgtaatctg tattgatggg ccaggagacg caccccgaca 47881 ccttggcccg aaggccggga gctgtggggg ctgccccaac gtggctggtg gggggcctgg 47941 ccattgggct cgccccgccc ctacccggac gtgagcccca taccggggtc ccttagaagg 48001 gcccttgggc cccgcgcagt taacaagtgg ggtgtttatg gtgcgcgccc agtctgcctt 48061 gggtgctcac catccctgtc gcagaagctg ccactagtcc ccggtgtact ctaaccactg 48121 aagcggccgt gtcggggact cacgcgcttc ccattcagct ctggatctgg aactggcccc 48181 ttgtctgaat tctgcctcct caaaagtggc gaacctggcc ctatggccgt caggatcctc 48241 agagtgtcag gagcccagag tgaactagaa gctgacttgc ctctacttcc agtatccaca 48301 gatttttccc caaaatgcag tggttgttcc ctagccccta acccccaaca cttttctccc 48361 ctagtccgtt gatccagtta ggatcttctc ctctggtgtt gtgctcaccc ggcttgcctt 48421 ctctacaact cctgtcagtg gactttggtg ggctggcggg actgggggtg ggaagcggct 48481 gccagccttg atggaaagac ggccccactc ccactcccaa ggccagagcc agatccaagc 48541 atcccctcat cacctaggga agactctaag cccacagtca tttcggggaa gtaaaatgct 48601 ggccttgggg tccttggctg gcccaaagtg agagacttgg aggggtgtgc ccggtgtaca 48661 cctccgctgc tttatgttcc tgctctcctg ttagctttgg cactcatatg ctcctggagg 48721 ctacaacaag aaccagtgtc actacactcc tacctctctg tcccctcccc tcttcccctc 48781 cctcccatac cctcagctag ggcagtcctt gctcctgcag tgtgtgcatt ttaacgaggc 48841 ctcggtgagg aaatcctttc cctgcaaggc cgggctgggc tcatggtcct gttataagag 48901 catgtggagg tcaggccctt tgctcagcag gctcaggggc tctgttttag ccaactggaa 48961 atggccggcc tggggtgcta gcccccaagt gtgagcctca tgctagtctc aggaaatgca 49021 accttttccg gccagatcca gtggagcccg ggaggtcgct gccttgcccc tggcccgtgg 49081 gctacaggtc tccaggatac tctaggcctg cccgcccctc cggccccagt cggagctggc 49141 attactgaac gccggcatcc caggaaatag accgctcagg ccgctgcctt cagctgggaa 49201 agactcttgc atccgggtca tacgcggcac tttgcctcct tcctcccagt gtgaatccag 49261 cccaaggtgg aagaggagcc tgagaggacc ctgaaagcgc agagagagct ggctaggtag 49321 agctccagct ctggcttctg agattcaagc agctgcggtc gctgcgggca gtggctgctc 49381 cgagctccgg gcccagaagg ccgctccacc gctagcgcgg cgctccagcc acttcaacct 49441 cggcgcccca cggtgacccg gccgtaagga ctgagcatca gggtgcggag gaggaggatg 49501 gagagaaaaa gagagagata gaaggagaga cggacagagg gagagaaaag agagagaaac 49561 aggaggggag agagaataag aaagccagtt ctctccagcc tgacagggtc ttgaagctgc 49621 agcagcggct ttagagggat caagacaggg tctgtgccaa gagatcatta actctccttc 49681 gctcttcgca tgctcctgcg cccaaggcca aggcaaattc tctaccctct aggcccatga 49741 gacggacgaa cctgagtgca gaaaagcttc agagcaccgc agccagtggc cctatcttta 49801 gcttcccgga tctgcctggt ccttaccctg ctcaccagaa gggtggaact ggctgggact 49861 gttgccctta gcaatgccac ctggaaatgc cacagagctg ggggctgcct aggaaggaag 49921 ctttgctctt cagcactgcc catgcctctc agaaaagctg tgtcatttgg caagtgggtc 49981 ccaagacctt cgaggtttcc tcaaggataa tgagccgtgg caggtcagtg ggccaggacc 50041 tgggcctaga gtcatgcctc tccaggctaa atgcagaaag caacttccca ggccgacatc 50101 tgggaaggtc cagtatatat ttgagatagt ggaggctata caaataaagt aaaagacaat 50161 gatcttaacc tatctttcct aaccttttag ggagtcagga cccttctgga aaatccaagt 50221 aagttttgac tcattctgtg aaaaatgcat atctgtatat tcccacaatt tgatccatca 50281 tttcgaagga gttcacacac ccctcagtct tttccttggg tcttccagag tcccccagtg 50341 tcctgggact tgggggccct gccaatttgg ctggcaagac aagctgggtg ccctgcagca 50401 gacagctcag gtcagggccc cgtagggcag ccaaatgggc ttctaggctg gatagaatcc 50461 agggacctgc acgtgccaac caaggttaca aaagtcagtt tgtatcgcat ctgctacaaa 50521 aggctgcccc agccgaagag tagaggagct gggaggcctg ctatgggcct cgcaagccgt 50581 ggcgttggca cttgggtctt cagtgacctt tctgtggtga gccagatgtg tttctttagt 50641 atgacttttt cccccttgta ataaaccaaa cgggatgttg ggcttgagct ccttctggct 50701 tttaaaaaga gtgtgtaagc tcatgtatgg cagcccctct ctggcccagg actgccttcc 50761 aactcccacc agggcttgct gagtgggaaa cgcagcgaga cactgccggt gagctggccg 50821 gctgcaggcg ggccggctgc tgctggttca aatgagagga tactaacgca gcagaagctc 50881 aagttaaacg cctccttggc cctctcagtt ggtctcctag cccccagcat ccctgaccga 50941 cccccaaggc tcccatgctc tctagacttt aacactggag tgttaggttt ctcaataaaa 51001 tattccattt attgagcgcc tcggctttcc tgagattccc caccccaaga gatgaagaaa 51061 gcgattcagt tgtagtgtgg gtcggcttcc aagcactttt cctccctcgt gctcatttgc 51121 cttctttctt ttttgccatt cttcccagct actaaatttc tccctctccc tcagggtaac 51181 tttgacctaa atcttcagtg cctgcttgcc tcaaccaggc cttatgcagc ggttacacag 51241 tcacctccag agcccatgtt ttatttagga ggaaatattg cccatttcta cccgggtaat 51301 aacctgccag ccctatgggg ccaaagtaaa caccaggccc atggattaag gccttcggca 51361 aggattaggc tgcagttcac taggtactcc ttctgagtag gcccttgtaa gcagtgagaa 51421 ataaactatt atgaaaaaca agttcctaat gaaaagatac caggtcctga gatcgaaact 51481 gctagtaaac aatttatatc tggagggcac tctcaatgct ttcaagactc tgaattatcc 51541 agcggtggag ttgggctgca gtcccacaga ggaaaaataa aggctgcttt tacatatctg 51601 gctttagcaa aaaaagaaaa aaaaatcaga gcttctcata tccatgcatt tatgcatagt 51661 tagaaaattt ttccacaagc aatgatgttc ttttcccatt catctttaag aatgtcaaca 51721 ttttcaggga tatttgatgt gtctgttttg aatctatcag gtgcaatcta ttactaaaca 51781 acatactttt aaagttttct aaatccaacc attaagatgg cccttcccct gttagggaag 51841 tgttggggcc ttttcacaca ccctttgaat tgcaatattc ttagatttgt tcataataaa 51901 cagcctctgg ccttagaact ggaggcggct ctggcattgt agggagggct cccgctgcag 51961 agagcagctc actgggtggg cgctcatttc ttgccttgtt gctttgctcc gtcatagttt 52021 gggagccagc cttcaacaaa gactagtaac tatgataggc aaaatttacg caacaaacag 52081 gcagtgccta ctttctctgg gattcacccc actcaccacg cactgtatat ttgggaaaag 52141 aaaaagatgt gggaaactaa tggaaaaata gtagaaaaaa ttaaatatag gagaaagagg 52201 gaatttgaat cgatccttca aaatgtaatg gaattttttc atttaaaatg tgtttgatga 52261 caaccccaat ggagtgattg ctccgtcaaa atttatcctc ggttccagct ttttaaaaaa 52321 gtattagtat ttaccaatac atttttgcat atttgcaaaa tatcttctac aaaagatcta 52381 attttcatat gtaagaaatt gctgcgaaat tgcagctggg tttataacag cgtactccaa 52441 gtatgcaaca cagagacttg tctggcactc tcaaaaatca catgcgactt ctccgaattg 52501 gcctttttcg gaacaaaggt tttctccaag ctgccgggca gacatagaga tgcacctagt 52561 aggaaaccag tgacctccag gacttctgcc cctgggaata agctcgcctt agcggccgca 52621 gacttctacc accccattgt ttagaatgga aactttgttt tcctcaaccc gcgatcacct 52681 tttttttttt ttttgccctc ccacaacccc ctatcgcagc acagctggaa cactttcgcc 52741 tagatgcctc tcatccctag cccattctat tctgtgtatt tctacaaaat cgaaagattc 52801 gccttgagtg aaatgctggc gcaaggagct aaaatcctca acttttctac ttaggcctcc 52861 ccctgctttc caaccttagg gagcaatggg gtgggggctc cctaccgcgt caccccacaa 52921 acccagccac gttcccgaga gccctgctta gacatcggcc acctccccac tcccgcccca 52981 ccagactgaa attgctaaac ttgtggcctc ttaccttgac acatttccga aatcactgcc 53041 aagggacagc tggcttctcc gcgcggcgac gctcgcgagg cctagcgaat gcgcgttgct 53101 ttaaattacc ataccaatca cttcttgagg gtgagtcccc tttttctgtt atgaagggga 53161 gcgggacaag tgaaataatg taccgtgctg ctcttagtat cagaagcgaa caaaggccaa 53221 gaatcatgct ggggttcccg gctccccggc ggctttgaca ttgatcggaa gtgcgccatc 53281 tcgtggcggc tgcgcgccta ggttgggccg gagttccagc cccgagccga gagacggaaa 53341 ccagctccgg gcagagagag aaggagagag gagaggatgt gcccagcccg ctgctattga 53401 gatctcattt ttacatctaa gaaatcgctg caaaacccca gccgggttta tagcggcgca 53461 ttccaaatat gcaaattggc cggccccgga cgggtttacg accacattgt cacagccatc 53521 ggaggatggg cttttatagg gctcagaaat caaacccgcg cccgcccgcc gcccgcccgc 53581 gagcagtcct cctggctaga ctctctctag caacttgaga gactttggtt aatctttaac 53641 catcccaaag gaagtctttc cctaaaccca ggcttcccag cccgccccct ccctgccccc 53701 caggagggcc cttgttcatg tctgcgtgtc tgcctatcaa cctagacatt catctctaga 53761 tctgtcccac taccctttca gctcgatttc cccagtcgcg ccagttagac aaacacacaa 53821 acaaataaac agagtggggt ctggggcctc tctccaaatg cgaccctatc tgctgctctg 53881 gccctgcctg ggtggctgaa gagagggtgg ggtgggcaac aaagggctct gtcctttcag 53941 cccttctcct caaggttatg ggtgatgtcc aattttaagg cagaagttca aaggcagcaa 54001 acaaagagga aatcggcatc cttttttttt cccccaaagg gaaaaagcag cctcagctgg 54061 gagctgggga gaaagcaccc ttagaggctc ctgggagtct gctctgcctt ggaacccaga 54121 ccaggctccc tcttgttgaa gcctccacgg gcccccggac ggtcccactg caggcctacc 54181 tgtcctgagg tgagcgaggg cagcctgggg tctgacctgt tggccacctc aggccccaag 54241 gccctctcca gggctgagat cagtctgagg ggataaagtc ctatattcca ggccctgaga 54301 taccacccag gtccccactt cccacaagga cgtagccaac caccttggtt tcctaagcct 54361 ggcttctgtt gtagccaatt cctgactcag tcaccttctc accccctcag gggcctgata 54421 acttgctcct caactaggta aggcattttt tgggggggtg gggggagggg ggcagacatt 54481 tggtagtaaa aggcgatcag ggagaggaat gtcacaccag gatactcaaa tttccaccgt 54541 ctttattttc cttgtgccca gttgcctgta taagtgctgc aacacacacg gtgggtaaga 54601 accagaattg aggacaggcc aacactccca gtacaaatgg agccaacaga catttcttaa 54661 cacagggagg caaaggagat tcaatgggag ggtgcctggc tttcccagat gagatcccca 54721 ggccggccag gccgactgcc tctgagcatt tccctaactc tttccaaatg ttgcaagaga 54781 ttaaaaagac cattctcaat accttttcca ccccctccaa caccctaaag gaaatcaact 54841 aatggcaaaa gaaaaaagaa aaagagaaag aaagaaaaga aaattgatta tggttccttt 54901 atttacaagt tttttgaaca ccatccctgt atgaagcata cattcaaata ttcttagcag 54961 tgagcgagtt taaccacgga acactgtaaa cagatagggc ctttaaatat tttcatcatc 55021 tcttttcccc tctacattgc atcttttaaa attcgctttg gttccttcag ggaaatatat 55081 atatatataa tataatattt atcttttaaa ataattccac ctcactctca ggctcttgga 55141 aggtcaccag aagcttccaa gctcagttca agagccaatg agggctgagt gtggtgctgg 55201 aacccggtgt tttgggggac tcaaggccct catagccaaa gctggggacc tgctggccag 55261 gcccactgct ctgcagccag acactgagca aatccaagtt tcattacgtg tggtggagag 55321 tttgccaaac tgccacactc cacagccatt tcctgcagag atctcagggg ccagtggcct 55381 ggctgcagcc tcttggcttt tccacatctc ccacctcagg gaacagtcca ctctgtgtcg 55441 aggctttggg cctgagtggc aggctggaac caagaccctc atttggtaga aagaaaagcc 55501 aggtgggctg taggaggcgg tggctaggaa ctcagggctg gatcagtgtg gtgggtggct 55561 aggaggagtg gaaagacact gatgccacct ggaatcatag atttcttttc ccacaaggat 55621 ttgcagccct cttccacctc aaagctacct ccaagtccag ccgctgttca cattggcttt 55681 ggaaagcagc ctcattctgg gcacctagta atcctacccc ttcctcctgc ccctgtcccc 55741 aagctgaact gggcttggag agcacacagc ttttttactc aggggtcctg ggggtgggta 55801 ggctcccagt agagggaggg tgtggtgggg ttagtctcca gggggtctgg caggggccca 55861 atccccagtg gagaccacac ccagcccgca gctctgcagg ctccaagagt gaaccaccag 55921 gggtcccaaa cctgtcattc tagctgatgc acagctctca gaatccaatg attattagga 55981 atcttaacca ctgagatctt aatcaagaga gtcccagacc acctcctgtg gggctatctc 56041 catgcatccc tctcttgcac acctcttttc aaaagtcacc atgtggcttg actttgtcaa 56101 gggcaaaatc tgcatattat ctcatgtgta tgaagccccc cacccaattc cagccgctgg 56161 agtcttagag gagtggattt gctgagtagt actgtaaacg gtctctgtta attttttttt 56221 ccttcattct cctgttctga aaccagattt tgacttgacg atcagtgagg ttgagcatgc 56281 gggacagttg caggcgcttc tctttgttaa tgtagacgct gaagaagaac tcccgttcca 56341 gctctcggat ctggtacttg gtataggggc agcgcttttt gcgggtgcgt tggccacctg 56401 tggagggaga aaaggcatgg ggtgagccag gtgtgggggc tgcaatccac cccagagccc 56461 atagctgagg agaagggctg ccatggggac tggtggtcca gcccaagccc cgtcaaagtc 56521 tgcccaggcc cctgccttta acgctccgac tgcatgcttg gaacggccgc agttggaggc 56581 tcagccacag ataaaagcca gctccctaaa taggcccccg gggaagctgc tcccgctgcc 56641 cccagaacag aaaggagagc ttcgcacagc aacaggcgag tttgcgctgc ctaagcctct 56701 ttgaaagcag ccaagtgggg gttgccggct gcctctcgca ggccagacta aacaaacagc 56761 cgcctgcgga catttcacct tccagcacct gggctgcccc ttcccaaggc agtaaaggcg 56821 gctccagtcc cagtctttcc cggctgcggc caagactggg cagtccttaa catctggggc 56881 ggctgccggg tctttgtcag cctgagtcct ggccaccagc ttctgcctcc ccggccagcc 56941 ttccggctcc gccgaggtgg ggaggtggcg gcggaagccc cctaccctag gccttcgctg 57001 agcaccacgg ccacacggcc atcctgcaca cggcagagca gcggctctat cttaggtagg 57061 gtgaagggaa agggggcttc ccgaagctgc gggcaggctc tacttgctcc ccctttaaaa 57121 aaaaaaaaaa aaaaaaaaaa aagacccaca agacaaaaaa aaaatcgctt ttgacagatt 57181 gaataacaat tgtgaataac atgcagttgg gcagaaggag gcacgtaatt gccaccacgc 57241 cagaggaaaa tggcttcctt tggacaaaaa ggccaacttt gggttaattt gttcttttaa 57301 tatattgcta gagctaagcg ggctacttta tctgttaaac gcgctcctag cgccgtcgtt 57361 aaacagcgac gcctttgaat cccggccggg actggagtcc cggcgcaaca catggctttt 57421 ataaaaatct ccggattacc tcgctatcaa aagctcccga agcccttgca gggggaattt 57481 acaggcgccc acctccggct ccccagcctg gagctggccc cgagcggggt cgagctgctg 57541 ggcttgggag ctgagaaggg aaaaaaggga aaaggggagt tgtttttttt tgcaggcatg 57601 ccttggccgg tgggtatttc acggccaatt tcagcactcg ccacgtgatc ccgcctttta 57661 taacaaagtt ttgttggggg aaacctaaag gcccttcata aaccttatat gcttataaaa 57721 cagcatataa aaatttaaca gcggtgctgc gctagatttc caactcccct ttcataaagc 57781 gcagggcgct gcctttatac gtactggagc cgccggcctt gtcctcagtg tggccggaag 57841 acgactcggg gctgctgctg ctctcggggc gccgccgccg ctctttctcc tctgctgccg 57901 ccgccgtctc ccggcagccg ccgccgccgc cgctgtccga acttgaagtt gccggcgcgc 57961 ccgttgcagc cgccgccgcc gccgcggagg tcgccgtggc cgccgggggc cccttctcgg 58021 cgctcttgtc cccggggtag tcggaggagg cgaggttttc cggggtgccg taggctgtct 58081 cgaaaaactg gtcgaaagcc tgtggcagga cgccgttcct gcccacggtg ctatagaaat 58141 tggacgagac tgcgggggtg gggtggtggt agacgttggc cgagctcttg gccagcacgt 58201 cgccaggcac gccggccgcg ctgggcgcct gcaggcagtc tctgtgcacg agctcctccg 58261 cggagtagca gtgggccaga ttgccgcggg ggtgccattt agtggcgggc tcaatggcgt 58321 actctctgaa ggtcacttcg cgcacgggtt ggacctgggg caggttggag gagtaggagt 58381 atgtcattgg gcgcgaagac ggggtctggg gcagaaaaga agggaggctg gagaaatctg 58441 gacccgagac gtagtaagta caacttggca aatacatgtt agaggagcag ggaccacgct 58501 catcaaaatc cattattggg ctaccttggg ctctccgcag tagccgagct taacatgatt 58561 ctccactgca gctgcctctt tgaagcggat ccgtgaagta gaaatttgga gacgtaagct 58621 gacgtggaaa tctatcccca tccttagcag ggaggtgctg gtcatgtgac ccgatgttga 58681 aattgacaag ctgctagcta gtccgggcct tttccccccc cctttccttt tttttttttc 58741 ctcctctccc ctccctcccg gcttcctttc tttgtagcca cctcagggga agcaacagat 58801 cgtcactcgg tgttctcacc gaaagcacgt aatcgccggt gtaactcatg ttggctgggg 58861 ggcctcccgg cgcgcgcgga gaggctgggg tgcgccccca tgcagcatgc ttgtgctcaa 58921 ttgcagggtc ctcgttctcg agtgtgcaga gggcggtgag agctcaactc tcgtccccac 58981 ctcccacccg cagctccccg ggtgggtgag ggatgccctg gactggggat agccaggtgg 59041 gagtccgtcg ctgtgtggcc tgtggtctcg gagtctgttc tcctggagtc tcgcatttgc 59101 acccccttct tcgcagtccc cctcccatag acttgctctg ggaagcgcct ctgcctccga 59161 ccctagccgg aaccccttcg gggccagagt ttgaagccgt ggatgtgcct gcctggtggc 59221 ttgtccgatt tgcacggtga cttgattaca ctctctcatt catggtcact tccgaagcgc 59281 tttagtgcct tccgtcccta aaccgccaac agccagaacg gcttctcccc gcggtttgtc 59341 actgatccgc agggcccgga agggccttcg tcttacccgg gatccacctc tcccctcatc 59401 ttccctgcct acctcttcat cccaccttct gtccttggag aaactccctc ctcctcgctg 59461 cctgccgggc ttcggagtga ctcggcagag acagaggcac aggggctgcc ctgctgctca 59521 ccggtccacc catctgcctg gtcttctgga gctgaggact cgggaaacca tgcaattgag 59581 gcaagccttg ggctgcttta gaggcgctga catccgagga gacttctcct gggtatgctg 59641 catcttcgtg gggggcccat caggctgctt acagagctcc aggggtgtgg ggagaatgga 59701 ggtggaaagg acgggctgag ggccaaggag aggtggctga gaaaggggta acccacctac 59761 tctcctactc tccttccttg caatgtgtga gagtgaaagc aagggatgtg cagggcaaaa 59821 actcagagag ctctgtgcct ttcctactgc tcattccaca ggaggaatac aataccagag 59881 aaaggagagg tgcttcccga actccctgca cctggggaaa cagggatgtc tttaaaaagc 59941 taggtttgtg ctaaggatag agagaggcca tagcaatact ctgagtctag ctttcttgag 60001 aagaggaaaa aaaataaatt aagaaggcaa aatatgctcc ccatcctgca ggattgaagg 60061 agcaatgttt ggaggaagcg aaagaaagga gaggacacag agcacagagc agccaggcag 60121 agccaggagc tgagaagggc ccagacctga ggcctcccaa caactctctt cttggaagga 60181 tctgggatgt tgctgaagga aaataaaaaa atatgtaaaa agataacctt ttgtttttcc 60241 ctctccagga aatagccaaa gttatttaca tatcttgggg agatttagag tataaactct 60301 aagatctttg gtatttaagt gtcaacatcg atttatttat ttattgctga gctgactgta 60361 actgactcaa taacaaatct aatcgtgtat tgcactggaa aagaaatatt cttattatgt 60421 attttctcca aataatggcc taccattgca tttgaatacc tgctgtaaat atcaataata 60481 tgaagtaatt actctgtagt cgagtaaact aatttattag cattaatgtt tatgtggctc 60541 tccacctccc cccaccacca cctttacaag gattgcttat cacaaatcaa gctacttgga 60601 cacattggtt taatgaactc tttattcaag atttgctaca agaattttca tgtcctttga 60661 atccctgaga actgaacttg aaattatttg tgcatcttca gcttgacata tttgtccact 60721 gtggcttgtc cagagggcag cagtgctgtg agtgagaagg tccagtggga aggaaggtta 60781 ttggagaaga gtagcctcag acctcctaaa gctgggaagc acatttacag aattgccctg 60841 cagcgaaaaa acttctattt ccagcagtga ccaacaaggc aaaatgtttg tttctccaca 60901 gcatctcttt tttagagaca gaatataaaa gacagaagga gaggtttcaa acagcggtat 60961 tcgagctgtt ccctggcttg attttggcta tcccaagctc tctctctgca gcccacccag 61021 tccacgccac ccctaccttc gaacaaaagg aatgcatgaa gggtttcagt gactttgcca 61081 taacaaaggc gccaccattg cggggctcgc cccgcccctg ggtgaaggca aacaaattct 61141 tgcacttgta ttagggcttt taagaccata attgaacccg ggggcgtcta ggaaaaccga 61201 aaacagttct agacagacct gcggttttat agcagttttg gcagtcaact tcagcttgtg 61261 cctgagcaga cggggttgtg gtggcccgcc agcgggggat gccaggccac ctcccccagc 61321 ggcacgcagc ccctctctta attagatcgg ttttcccctg gtgtccggga gagcggtccc 61381 ggcagaaagg tcggtatggg ggtgtgcgct gttccgcata accactgcct cccatgtcct 61441 cctcgagggc cgaaccgaga gggtgctggc agggctggat cccacgggtg tccgcaggag 61501 acaaaggcga attccggaga aaaggctggg gctgagaaag gcgctccggg agcggctggc 61561 agggcaattc ggcaggctgc accgaagccg agtgcccgga gggacttgcc gcccggaagg 61621 gggtgtgtgg gggcgctgcc gtgaagatgg atgagggaaa aggtttttga tatcagcaga 61681 agggaaaacg cctggagtgg ccgaacactt ttagttgccc agcaggaata ggagacgggt 61741 actcagctcc ccaaggctgc gcaatatccc agctttgccc gctcctgccc tcgtgttcgg 61801 aatatgctgg cggtgtgaat gtgaaggttt ctccaggcag gcggctgggg cgaggggtgg 61861 gcgcagctct gaggatcaga ccaaatctag gggaaaaagg ggaggcagac cttcgggtcc 61921 ttggttttga ctttgctctg agcctatgat tgactcttcc gctttgcagg aggtccaaca 61981 gccgagctta gcccaccggg ctctgggaaa gacccgactg aggctaaagc cgccccggaa 62041 ggccaagtcc gagttccatt tcttgaagag gccggcgcgc gtaaggctgt gacattggcc 62101 ctggcgactg gcttcccagg agctgttctt tctcaggagc tccacagcgc gggccatctc 62161 cagaaaactg tcttcagagt gtatttcctt ttatcgtcaa cccagagccc caccgcggct 62221 aatgcaagag gccaaaaaat gtttggagga agaaaaacaa aggcaggaag tggcggcggc 62281 ctgacggtgc gtgtgtgtct gcagagaagg gagggagccg gctcagtctc ttcttgtttt 62341 tccaaacttc aaggtccagg cagccctctg cagggccggg ccccattgct ccccgcgcgg 62401 cattggaggt ggccgcccgg agaggagaag gccaacgcct gcgccaggct tgtcaggcgg 62461 aaacggctaa caaggagatt tggtcagcaa aacagaccca gcctttccga ggcttcgtct 62521 gacttggccc gaaaggttgg ggaggggggg cttgcgcaga gcctcaggga ccctcctctc 62581 tggggactac catccctgag ccttacgctt ctttccacag cctttgcagg cggaatatcg 62641 gaataaagtg ggtccaggcg cctctgccgc ctccgcttct tcttgcagcc tgaatggtcc 62701 gggaggcagc gggagggcgc cggcgggcag tgcccaggcg gggtcgcctc gaggccgtcg 62761 gtgagcaccg ggcaatgcga ggccttgtta ccgaggttgt tgtgctgggg gcgtttccag 62821 cacagtcatt caagggacgt gagctgcaag gatcgtcact tgacaggcgg cttaacaccc 62881 tctcactcca agggaggata cagctgggtg gccgggagca cctccacact ccaggcttcc 62941 cagctccaac tggcaaagct gctgggccac cctcttctct ccctggggct ggcttgggag 63001 gcagcagctg ctccttgtcc cagaaatctg agaattcgaa gtccccagta cttgcacatg 63061 cagcccttta aatgcccagc atttgtccac agtctcaatg tcaatgccaa gctgccccag 63121 tcaccactgt ccttctcaag ggggtcctgt ccctctacag ttcctgatcc tgctcctgtc 63181 ccccctacag ttcctgtctc ctgccctgcc tccttccagc cattctcacc tgctgggacc 63241 tcctgtctct tccctacctc ttttgatgcc tctgctagac tgagcaacag ctgaggtgcc 63301 actggagggc atccaatact caggcgcaga gaagaggggc tgttaggtcc ctgactctcc 63361 cccaacttcg ggcttccacc agagcctgaa ctgagtctcc cacctccttc ttaggaatac 63421 tccacctcaa ggccccacca agtctggctt atctcccttc ctctcccctc ccttctctac 63481 cctcctttct tccccaacac ttcctccttc tcttctgcgg acccagctgc tgcttttcta 63541 ggctgggcaa cccagatggg ataggcctgg gaggggtcag gacccacttg acagactgaa 63601 tctcactccc agatggctac agttaagttt ctctcttggt ggattgctta gggcaccatc 63661 attattttgc ctcctcctgc ttgcctaatc ccttactttc ctctgtccac aaatgtcaca 63721 ctgtttctag aaagctacac agaagagtat ttcaaccctg gtgaccatac tgcacacagg 63781 tgcaagaact ggcccggagg ccccgggtag aaatggcaac agagctcttc tccattacag 63841 gaatatctcc ttccaaatca aagggttcct gtgggcacat ctgtcttatg tggtgagcta 63901 ggggagatgg gttgtctggg aaggcccagg agatctccac gtatttgttc tcagggcatg 63961 gcttattgcc agatcataga caatccatca cattttccag gcgctgccca gaggcagcac 64021 tgaagcttca attaaaaaat caagagatca cctctatcac taccaggtgg aggcaggggt 64081 taatttgtcc ctttgttgag acacctcacc ctccccgaag gtccccaaca cttcctaaaa 64141 gctcagaaac aggtaatgcc ctgaagtgga cctcataatt ctgcttggtc tgactggcag 64201 tttcagtcat ccttcaaagc ccctaaatat ctgaaaaggc ccataccctt cccctggcca 64261 ctgcagagtg gggcacaggg ctcagctgag ggtctcaagt acctaagcaa tctagttatc 64321 atttacttta gtggatccca aactgaaggg agaagagaag attacttcct ttaataacta 64381 tttttatttt tattgttgcc aattattaga cttctttttc ctatagcatg ggaacaaatg 64441 gggtggggga gactttgaag ggcgaagccc tagttctcag tgtctgcctt ccattcccca 64501 tgcagctttg gcccaaacag gtttcccagc agagtgtgtc tgagataaag aagacaaaag 64561 ggggtggccc aacccgaagg ttggctctgg tccttgggtc tcggtctcta cagccgtagc 64621 tcacaggaag cccccacgcc acaaagctcc aactgcttcc tagggcagct ccatttccac 64681 tcagtgctca tgttggctca cattccaggc ttgagaagag ccatcacccc atccagccct 64741 cctcccctac acacccttcc aggcagttct gcgatgggac ccattcacag atcattgttt 64801 ctgctatata tgcccccaca gcccagccca gcccatccac gcagccctag gggtacaatc 64861 cagatttttg gcttccgcga aaagaaatcc tttgcatgtc ctctttggtt cacctggaga 64921 gataggctca gcggcagatc acgcttaagc taacgtttac cgcagggcct gatgctctga 64981 tctgctggag gcaggctata ggcagggggt aaggggctct ctccctcgca gagcgcagtg 65041 aaggattctt gggggctggg agtcaggagt ttttaccttg gaagagtgcg cctcacatct 65101 gtgtggattt taggcaggag aagagcatca agggtctcac caaaggcttg ccactcactg 65161 agggagaggc ctggattctg ggaccccagg agagtctctc cagcagggtc ttggctcccc 65221 tatcctcttc cctgccaagt cggccctatg ttcctaccag gacccagaac tccgatccag 65281 ccacgtcctt tcccctgggc ttgcaggcgc tccctctcgc ccccgctagg ccttagcggg 65341 ctccgaagtc tgctggactg gggcaaaccc cgctcttctg ggacggggac acggcctggg 65401 ccacccgtgg ctgccgggac tcctttccaa cagggctggg gatgggagag ttggaaatca 65461 agaaagaacc tctgcctcca gtgcacgagc ccagagccca ggctaccaga acccgcttgc 65521 agcaaggtaa ctgtgcgtcc tgtgccttct tgaggcggcg gcagcgatcg gccctgacca 65581 tagaggcggc cgtggcgcgg aggacttgac ctttacggca cgagaggagg gcgctggctg 65641 agccgcaggg agggggacgc tgctctctcc agcctcctcc acccaatagc agtccgctcg 65701 acccacgcca gctgcgcgct cacccccttt ccccctccat tttgtgcagt gtccgcagac 65761 ccgcggccgg aaacaaagcc cccagccctg ctctcaccac ccttcttgca agctgctgtc 65821 gccaaccccc cagaccagag gctcagagct aacgctaagc cccttaaggc cctctccagg 65881 gccctcttct ctcgccagcc gccctcagcc agctcctgac cccagccgcc ctcagccagc 65941 tcctgggtac cactctgggt gccaaccggg acctgccggt gccacggtcc cgtgcgccgg 66001 gtcccggcct cgggctgcgg gcggcttcgg ggcctacagc ccagcggcgc ggagcgtggc 66061 agaggcgcgg ggcgcagcgc agcccgaggc tgcctcccgc ccgggcggac ggcgcggccg 66121 ctggggtgca cagcgtagcc ccgcgtgccc ggctcgctct ccgttccctc ggatagctcc 66181 caccgctcgg ctccgggctc agaaagcgga agtatttggt agagaaaaac acggtttctt 66241 ttcagctcgt ctcaaaatcc cttttagaga aaatgctcct gtcaagtttt atttcccgtt 66301 gcaaaccacc ttccacgctg ccaagaatta agaccgggag agattaaata cccgattatt 66361 ctcctgggga gggcggggcg gggccgggaa gtgggcaccc acaccaaaca ttccttgaag 66421 taggcttgtc ctgatccagc cgcgcccggg gagccccacg aaggcccgcg cggcctgcgg 66481 tgacgtcagc ctgcagttgc ggggccactc acccgcacag acacggccct tgctgttccg 66541 cgccgggacc tccgccagcc gtcctgccgc agccccgact ggcccgcgtg ccgtcagagg 66601 gaggcccgct ttacaccggc ctgagcctcc attttgagta gggggttctt gtagaagcgt 66661 ggagggtttg gggccgcgcc tctgtccccg aggggcgcga catcaggtag ctctccgagt 66721 tcacacccca gtacctgggg gaggacctat gtcgccgcat aaccgcccgc agaggtttgg 66781 aaatctctga gacccgttgt aattatttct gcccagtgga tccggctctt cagcgtcacg 66841 agagccgggc agaaatgaaa tcaactgtgg caaggccttg gctgctttca cggaggagtt 66901 tttctgcgcc agtgtctttt tccttccctt taaaataaaa ttaaaaatag caagcacttc 66961 tcaggcattc atcagagata gatagatgca cgaggattga gtgggcattt tcataaagaa 67021 tgaggccggc tgttatagac cggcggccta gcagatgaaa acttaattag cgtgcctgtc 67081 ctaaaaccta ggcataaatc tccctctgcc ttttggataa cgctatatct ttgcttatga 67141 gaaatgggat gtgagcaact cgctgcacat ttctctgatt ctccaggtct tggtcggctg 67201 acacgcattc gatcaagttt aaaggaatgc gcataaatca gcaagcccct agcgtctcct 67261 tgggagaggt ccgcaaatcc aggagggcgc ctctgaaccc accgggtctg gggattagca 67321 gtccagggca acctccgtct ctgctcctga actcgggaat tcacagagga agcaagacac 67381 tgcatcttca ccaaggcctc caaacacatg cagcagagtg caatctgcac ttacatgtat 67441 tacaaagtga aatctgtgtc aactctccgc acacaaatgt tgcatctgca gctgaatttc 67501 actgcctagt ggtgaatttt taagaaaaga tttcaactag gttgttttaa tttttttctt 67561 cccttttctg ttaatttttt ttaaaaaccc acaacttgaa taacttgaat gggtggcttc 67621 agctctgcat cagtcacaaa taggagtgaa atgcatagcg acatttaaca atcatccact 67681 taaaataagt aaataaatat gatagtactg agagcagata gaaaaagtag cgtttttttt 67741 taaagtccca tttttatttt cttaattcag gaagagtttt ctttttagaa aaaaatactt 67801 taatcaggct ttcaacaaca ttatccatgg gtcagtggct gatactatta ttcctatttt 67861 tcaggaggtg gctggtctct ccttgatttt tgtttttgtt tttgtttttg ttttaaggtt 67921 ttagactgat tgctatttgg gcattaaagg agccataata aataatccat gcccacttta 67981 ggttatctgg tagatccaca gaaattttaa ataggaggag agttaggtaa gatcgacact 68041 atcaatgacc attttagaac tggggggaaa aaatccccac aacaaccctg aaatgtcttc 68101 tgtcattaca gtttcaaaaa ctagagagag aaaaaaagaa ggctactact ttacccaggg 68161 ttcctgtagt ggtgatggct ttcgaaaggg gcgggatccc ggctggagag ctgctgttgg 68221 cctccttcct aggctcgagg ctcagaatat ttcttacatc taaagaaaaa tatcccctgt 68281 caacagaaga gtcccttttg gagctgttct taaacacaca gtttgatcca gctttgaggg 68341 gattttccac cactttaaac attttgggag aaagttgtta ctttggcttg atggcagctc 68401 atttggaaat ggagtactgt ttggaacaag aggtggagag gtgggtctga agcaacatta 68461 tcatttgttt ccacaagtgg agtgaaaatc ctcagggcag caaaatataa ttgaatttct 68521 cgagaccttt cgatatgtat gtttcaacac cagcctgttt ttgagacagc tttagagact 68581 ctttcgtaat tctcatctat aaagaagttg tgagtcctca ggagaggttg gagaggtttc 68641 cggcagccac ttttgtaacc aatcaatatt attttccata aaatgatgaa tctggttctt 68701 ccattcacta ttactttcct ctaacgtaaa gataaaatta gcctgcatct cacaattctg 68761 catcccacgg ctactgattc caccaacatt ttaatacata tgcgcatagc atagatttga 68821 caaaaacaca ttatcctatg tgtatattaa atatacaaac atacatatgt attctttatg 68881 actctgaata cagcacctga ttgttgttag aggtcaacag tagatataat attattatta 68941 tagttttgat ggaagttgtt aatattaaac atatttattt atcctacaaa tcaaatggca 69001 gattcacaag catcaatata agattgcaca ccaccctgga gcagtactca gctctgagtt 69061 atccaggata actctcgttc tgtatccaat cctgtggtca ttttgtagat atgaaatatt 69121 aagtcctctt tactgacctc tcaagaggtg ggtttcttag actttctgaa acataactct 69181 agaactctag acttagcaag gcatctgaat cttaatttgc tgtaaagact cctcaatagc 69241 tctaattaca gtgtgagacc accctggagg gtgaggacca ggaaggaatg gtgatttaag 69301 aacacatctt agactgtaac aaaaatgtac agctttaaag cagatctgca gaaatctgat 69361 gcagagcaag ccactgtgaa gacaaatcaa atttggccat tagagaaaag gaacaaggct 69421 tcatcttgta aagtacattt tgtacctcaa ttctagaaat caccttaaat tacaatatct 69481 tgcgtcatta gcaattaatg atatgatatc tgtacagatt ctacaaatat atatacaagc 69541 acacacaagc actaatatgc aaatgtatac atgtgtacac atattatata tatttacata 69601 tatacaggga catgtataat ttacatatat tcaaatttgc tccagctgga gccaaataat 69661 tgagtactga atctctaaag tcaaggaggt aagagctctt gattcccttg gttgaagaat 69721 tacaaggagg cttgtcaacg cgaggtggcg cccttgatct actaatccag ctaaggccaa 69781 ttcatgagct gtcaaaagtc aaggtcacaa attgtctttt ttttcgtcaa ggtcaaggtt 69841 taaggccttt cgtagtcctg tcatttcaac aaatatcaaa acctgccctc tcagacacat 69901 gcagacccaa cagcagattt taaaatacga cagcctaggc attgctgcta caaaacaact 69961 ggttttcatt ttccacaggg agcctgggaa ttcatactat ctcttcctcc tttcttttct 70021 ctcccttttg taatacctga cactttaatc tccttgaagc acttatcatt ttttaggatt 70081 ttagttatga gggatttctc ccttttctta aaaatttggt agcaaaagaa gtactgaata 70141 ataataaaga atggtccttg gtttatataa gataaacctg ctccttttct ttctgtgcag 70201 tgcacattta gcataggaaa gtaaagagta ttttctgcag attttaatgg cagtgactat 70261 tttattaaca attaatatta cgatatcact atctacaggt ctaagggttt ttttttttca 70321 tttttagtag aaatatttaa aaagcaggtg cacaaataca ttttcacagt gtgctgaatg 70381 tctttattta caagatatca ttctatagtg aatatgaaca aaacgaatgt gctggttgaa 70441 ataactgctt gattaaaaat gtgctgtgaa gatgaatcac taatctttct aatgcactct 70501 gataacacaa taaacatgga aaaatactaa tcccttaata gatcaaaata tagaatatag 70561 acacctaaat atttcagggg aatgaatttt cattctgagt tttctaaaaa agaaaaaaga 70621 aaaatgattt ctccagcaaa tgtttagcaa atattgggaa atgccaattc aaatgaaaaa 70681 acgaattgtg ttcaaaccaa agtccatcat gttgggatgg aaactctcgg gagatctcac 70741 ataaagtgaa ttctgtggat catctgatga tgtaaacatt ttcaaaaaga tacaaaatcc 70801 tttggaaaat aacggatatt ggagcagtaa atatagttca aatgccacaa aatatgcttg 70861 ttataagcaa ttaaaaatta ttcttaaaaa gacattaccg acccgggttc tcttgtagcc 70921 tgaaaggttc agctggttgg ttttactata ctaccaagac aaatgatcct cagaagagtt 70981 ttctctccct ttctcccttc acagcaaata ttccaagatc attatttcag taccttctcg 71041 tttcccctct aaatctatac ataaaatgtt ttgcagaacg tacaacaacg gtaggaattt 71101 cactaagatt tacctgagca gacgcttaac atgcaaaggg aatggcgacc taagaaggac 71161 aagcagatgt ttacaatggc ctcttgcgtt tgtaaccaac ttccagatta ttaccatcta 71221 acgcagtgtc ctaaaatgta acgatcactt aaatacttac aagatttcag taactgcgta 71281 acttatctga aattgcgtat tttgggggtt gacgtttgac atttaacggg ctgggctgat 71341 ggggttggcg gaagaactgg cagtctttac ctttcttaaa gttttaaaac agttgtagat 71401 tccattaaag agaaagagat ctccttcggg agaggaaaat gccagtctct gtctctttct 71461 ctttcccatt cttcaattat tattaagcat tattatcatt atctgggcaa agcaacgagt 71521 tctgaagcgt ttcttcaagt tgccttcttg ctctattttt aatccattaa ctagtggttt 71581 tcagtttgtt gatgactttt ttctctttaa ccctcctgtt ctggaaccag attgtgacct 71641 gccgctcaga gagattcgtc gtggctgata tccgcctccg tttgtcctta gtaatgaatt 71701 tattcgtggc gtattcccgt tcaagttctt ttaattgcac cttggtataa ggcacgcgct 71761 tctttctccc cctcctatag gagctggcat ccgagggatg ggagaccacg tctagaagac 71821 aagggagaag gcaagttaca cagggatcgg accccagcca gggcataggc gacagctcga 71881 tctgagccac ccgccttgca gcgcccggct gctctttgcc acccgctgta caatcctgtc 71941 ttctgctaaa gcctagaggg tcagtgggga aggtagttag ttctgaactg aaatgaaatc 72001 acccagggct ccagtgactt ccccaacccg gccatcctgc aggagcagcg cgtaggcagc 72061 ctcgagtgat agcctggtcc aacggcccac accttagcgc caggctcaag gtacaacact 72121 tctgtgcctg cctcctttct gggtgcccgt cccaatactc gaagcttcta cactgaagcc 72181 atttttgaga gaatgaatag ggaggggcaa tttggggagc tgttttctgg tcaatttctc 72241 atccttaaat tggtgtcagg ggctggggag ggcggggcgc agagggagag ggaagctagc 72301 cgaggtctcc acaagccacc cctcacccag ggagagccag ggaccagctc aactcgaggc 72361 agtgcaggca gacccagcca gccatgctcg ggctcccagg ggtgcagagc cgctagggca 72421 gaccaggaag agaacagaaa cgcacccggg atcgcccggg tgcgagcgga gaagaagctg 72481 gagcagagcc ggaagaccag ggctgggaat aggtcgtcat ttaccgggca gagtggactt 72541 ccagaggtgg ggaggctgcg cctgctcttt ggggcagtac atttggccgt tccagccgtt 72601 gggcagcgcc cagggctggt agctttccat gggaagaccc aagggttcgt ggcgcgactc 72661 gccggggccc ccgaggcccg gcaccactgg catatccagg tagccaggca tgggctgatg 72721 gtggtggtaa ggcccggctg cgtagccctg gtggtagaag gcgaactcct tagcgcggga 72781 gctgaactcc tcggcagctg ggccggcggt atccatgtac ttgtccgcga aggcggcggc 72841 ggcggcggcc gaggcgggct gcgcgcacga cttgatggcg ttggggtgcg ggcccatgcg 72901 ggcgcacggg tagtagccgc tgccgaagta gccatagggc agcgccgcgg gccccgacga 72961 gctctgcgcc gctgccgagc aggggctgca ttgcttggcg gcctctgcgc ccgccgggcc 73021 cgccgggccg ggacctcccg aggacgacgc ggcggcggcg gcggcggctg cagcggcagc 73081 cgcggcagca gcggcggcag ccgacggggg cgcctccccg ggggcgctgc tgtaggcgga 73141 cgcggctcct ggcgccaagg gcgccgggtg cgccatcagg ttgcggcact ggttggccgc 73201 ggccgccgcc gcagccgcgg ccgccgccgc caccgagaag ttgccccctg ccgccgcagc 73261 cgccgggtgg gggaagcccc cgcccccggc cccggcagcc gccgccgctg cagccgctgc 73321 tgcagccgcc gccgcccctt ccatgttctt gttgagctcg tcggccacca ggccgccgcc 73381 gttgtcgtag agaaacatga cggtgggctc gatccagcgg gggtggagga gcacggaggc 73441 tgtcatagcc cgagccgcat ggagaagacc ccagtggcgc tgttttaaaa agcccccaag 73501 aagtgaagag cgcgcggcgc ggggccgggg cccgagcgag gggggcggat cgcgccccgc 73561 ggggtcgcgc caggcggccg cccattggcc cggccccccc gctccgccca cggcgtatgc 73621 aaagcgggcg gctccgactc cagctctgca cggtgcaccc gtcgtccccg ccgcggcccg 73681 cgccgtcccc agcgccagcc gcgcggccgc gcaccattca cccgggggag aggcggagga 73741 acgtgcgggt ggggcaggga aggaaggcgg ccccatctcc gccccccttc ccttccttta 73801 tcccagtggg gcccagaccc gcgcaaccag gcggggaggg gaggtgggcg cgcgattggg 73861 ttgcgatctg gagcagtggg gacaggtcag gtaattctgc cggcggtcag ggatgggaga 73921 gtggggagga cgggaccagc ctagctcaga atgggtgttt cttggagcct ctgggactgt 73981 tttctttcca ggaaccggcg cgtatttctg cagtgagacc acaggacgga catcggcgcc 74041 ttcggcttcg atggagttgc gattttgctc tttccaggga aacagtggca gggtgtttgc 74101 tgcttatcgg ttcctgcgga tatgcctggg tcccaggaca ttccactgga ggcttggact 74161 gcatttagga gcccctatcc cttccctgtc cacactgtta gtgagcaatt tcatatgttt 74221 gcatttagac ccatagactc agaacgactc atcacacaca cacacagtgt acactgacac 74281 actcacattc gcacacttag gtatacagcc tgatccttgc tctgacctgg taacaacgct 74341 tcctcctcca gagactttga gatagagcga gcgatccctg tgcaccattc atccatgctc 74401 ccacctcgcc agtatggctg gcttagttct ggaaggggct taagaggaac aagccccagc 74461 tgtgcttctg gctgggactt aaacccccct tctgggccct aaagccacgc ttctttgtgg 74521 accggacctg actctccagg aatctgggaa cccgctattt cactctattt tgggacaaga 74581 aaaaggggct ctttggggcc acttcctgcc ttcccctcaa gtaggatctc cagcctgcag 74641 agggtgccta gtccttcttt gcccaagaac cagtccaaga agcctttcct ctgtgcctgg 74701 gaaatgcaac cttttcttgg gagcatggta gggtgttggt gctgaagaac caagcagcga 74761 cccgtcttgt agctgccatg ttttgtcgag gggttctggg ggtcctgctg ctttagagcc 74821 acatacttcc acttcctgat tcactactgt gagctggtca gatgcctaga agaggaacaa 74881 gcgttcaaag tgaaagtggg cacattaccg gaatagtgct ggggagagtg ctggattctt 74941 ttccacccca ggcggactgg tgagaagcca ggcttggacc tgtcctctgc tcctagcttg 75001 cacactcagc cctaaactca gagcagcacg cataccaccc ctcacacaca ccccaccatc 75061 tgctgtctaa ggcccctggg cttcctgcag gatccagacc aatgtggctg ggcttgggct 75121 tttatctgtc ctgatcctgg atttgtcctg accaatgtaa gtgtcgccca ataaaacctt 75181 ctatgacccc cacaccagcc acccccccac caagtgtgcc ctttccttct tgacttttta 75241 gcagttctgg gtaaatattg atttgccccc agtttacctt ctccctgact ggccatttgc 75301 agactcagga actagcctct gtagggactt gatttttctg ttactttctg gccgtttcac 75361 cacccccctt cctccctcca agtggcattg taaaactcac agtgacaaag agacagagta 75421 gggttctagg cccctgttcc tggggacttg aaggcggttt tacatactgg tcagacacgg 75481 ctggaggcca aggtcaagtt gaaagttgca gtccagccag catgagaact gccatgcgag 75541 cgtagagaca caggcagcag caaaaggccc attgcccaca tcccctcact cttaattttc 75601 tctctctttt taaaattctc gcctctgact ctgttcggct gcccagaatt ttttggtgcc 75661 ttcgtggggt ttttggggcg gtgtttaccg actcttctct gcctccgccc tgctcagcca 75721 gggctttgag cctcttcggt tttccggcca gacccggaaa aacgaaaaca cagcttgggg 75781 agcccccact agccggcgcc tgtgccagct cacctctggc catggcgcag ctgccggtgc 75841 acacggcggc caaggccagc tccacattct tccctccccc tcccacttca ccgtagcccc 75901 gaaccctgcg cgcagagaaa gggtctcagc tccacagacg actgggtccc tcctcaccaa 75961 aaatggtgag acaagatttc atctgtcggc cgaggagcca caagcaggtt tgtctgagag 76021 ggatggtgct gggggaaggc tttggattgc atctcaaatt aagctttgct ccttaaatgt 76081 ggcgctctcg ccaagaaaaa gcttggggac tgaattcttg agatttatgg tgcaccttat 76141 tgatcaaatt tatctggact ttttttagtt ccccgatgtg tccctatcat taaaaaaaaa 76201 aaaaaaaaaa aaaaacccct ctcaagacat gcttatttag aggaaggcac cctgcttgtt 76261 ctcttcttgt cttggatctc aacatgtatc ttatttttct accagacatg tccatggcgt 76321 cctggcatgc cccactgttt tcaaatgtct gcatgtcctg tcagtattga gttgtagcca 76381 ccaatcagtg gaggtcgttt atagaaggag aacgagctct gaataatttg ggtctgagtt 76441 tcagaatttc tactcacttt taagcaagaa taatcttcct tccctgtctt ctgctaaggg 76501 agataagata tttattaatg tttcccaata tgatttcttt ccctgccagg cagccaacaa 76561 actgacttgc tgtggcaaaa gaaagtttgg aggacgattg agacgggtgt gctctgggaa 76621 gcaagcacag accctcctag catttttcct cttttcccaa cggcaaggtc aagatttgct 76681 tgcctgccaa ggtagtgggt cctgatcttc tgagctgtct gtccctctga ttttttgttc 76741 tccaggctca ggaattccta taggattgca gaggcacctt caacctgata attttatatt 76801 ctcaggctcc tcctgggtta gactggggaa gccagggcac ccactaggat cggccggcag 76861 cggtggcacc ttaggcatga cagccaagca gagggctgca tgagtagagg tgggagaggt 76921 gggagaggct gcagtcactt cccaggtact gtctggtgca ctgatttccc cggaggagcc 76981 ctgagagggc agtgcaggca ggaccggagg gtagtttctg cagcacggcc tttatttcag 77041 gaatcaccag agcctttctc agtgttccaa aatgccttgg ctgacaaggc ttcttacttc 77101 ctggggtagg ccagggtgcc ctgcagggac tggttgccca aaaacctctc actgaagaaa 77161 aaaaccgcag ctaggttttt ctgggtgggg aactggatgc cagcctggtc cacgcggctc 77221 tgggctccca gcattggaga agaaaaggaa aaagggaaac caggttttaa tcgagcttga 77281 ttttcattag caaatgaaat ttacctgctc tcactttaac tctgaggagg ctgtcagcac 77341 tgctccctgg cactatttct ttgcatccat ccagcactag gataaacatt ccttggttga 77401 ttctaaagac actttgtatt tcaggatatt catgtgcttt taatgtatcc aagcagcttt 77461 acaagccact tttgcagaac atcttaaccc tctctctccc accctaggtg ggctggggca 77521 cagagataat ggcaaaggaa gaaagatcca gaggaaaaac tggttgaggg gctgaggaga 77581 gagaaagaga aaagacgaaa tcgtcaaaca gtgtagcacc tctgaagcca aacttgagca 77641 gatcatttgg gaaaggagca agtgggggac tgctctgctt atggagggaa ttttgagtgt 77701 ccccatcagg tagaagctgt accctctttc ttaaggggaa agggttggta ggtcctgaga 77761 tctgggctga agttggccag cggacaggga aatggtgcat cgagtcactc actcaacctt 77821 ttgttccaga tttttgtgca tttgaaacaa taataaggca gtaaaaggct tgaggctgtg 77881 ggatagtttt gggctattgt cactgtggtg accaaatgtt aacatggtga aagtggaagt 77941 gcaaggtcca ggaagatcta cagggacccc cactttgcct ttaagatccc ttcatggggt 78001 ggatctcaag gacagcaggt ttttggagga gcagctctcc tgctgcctgt agagtttttt 78061 gttttgtttt gttttttcag gtaccacaaa gccactagtg cacagggact cagaaaagac 78121 ggcaggagcc caaggaaaac tccaatttga gtacagccct gccttgtttc ccccagagag 78181 tccctgagca aggagacctc caccccacac acaccatttc agaacaacca ggttccagac 78241 tcccatgagg agcatctccc actgcagagc cttggccagc cgcgcccgga ctcctcagag 78301 ctggcgcaaa ctccgtcctc caaaactcgg ctctgggagg cctaagtgac tccgaagccg 78361 gcggcagccg cggcagcggc cgtggtggtg gaagagctct tttccccgac agtgccactg 78421 atcgctcttc actggagctg gaaacagcct tcgcggaaag gaccggagca tgcgttagaa 78481 gcagagggag cttggtgaag ggctcggctg gaaggaggaa acgccttctc gcagtgcgcg 78541 gccagcccgc gggggacacc ggcttgctgg actgcagggg cccgtgccac ccaggaagtg 78601 acctgcgggt cactcagccg gggcgctggg cgagcgcggg acggcccgga gaattccgtg 78661 cggctgcgac gggaaaagga cgaggggtct ctgtacccga cgctgccact ggcccaaagg 78721 aattttaccc gcgagcgccc accccaccct agcttgatgc ttacgcccgc aacaaaacag 78781 gaaaccagga ctgggcagtg cattctttaa gtcaacaaat acactgaaga cttcgagcgt 78841 ttgaaggaag gagggggttt gcacgtaagc ctggccccgc cgggctcggc tttctcgctg 78901 agaaagcggc gcaggcagcc aggcggcctg ggcccgcggg ggtccatctc gccctagact 78961 cctaagaact cccacggccc tgttcccagc tgcgaattct taatgcacaa cgcgacggag 79021 ggaaggaaat tcaccagcgc agcgacgagg aaggggaact caggacccct tcaagtacac 79081 actgaggtgt gatcagagtt ttatgggcac tttatatgct gtaatcataa cgatgtgtgt 79141 gccttgatat gcacgcatat tcacgcatca aacgtgcata cacacacaga gtgaatgtgc 79201 gcatccaatg tcatgtgggt gaaatacaag catcataccc agccctacga aaaaaaaatt 79261 caccctgtcg gaccaggctg gtgacatact tcgctggcgc atctccttac tcactcttac 79321 ttttccgacc cctcaccatt ccctctcctg tggcttggta aatacacctg ccctccgtgg 79381 aaggtgagtc ctggactggc gttgccaggt tcgcatgtcc tccccagaac ctccgtctgg 79441 ctccagggac tctcactgag cgggtctaga gcacccagca cttttcaagg aacagccgcg 79501 gttcctttgt cccgcggctc cagccccgtt cggcccagct ctcagggaaa cgaagcgctc 79561 agtaagaact tttgatatta gtttgtatgg gtatttacac tctggtgagg ggagctgagt 79621 acggaagttc cattaatcat actccaacct tgggtttaga tattcagttt atgggttggg 79681 agagggagtt tgccggaaag aaagcatcaa ggttggccgc tgactccaga gaaatgaaaa 79741 gggagcaagg tcgttttctg tttctggaaa tcaagaatta ggaatgggca actacaggtg 79801 ctaaccaaca gaccactttt ttgttttttg gtagcccttt ggcagggata gtttttccac 79861 ctttgcccga tacaatttaa aaaaaaaaat ccttttatta tggaatttgt caaacacaca 79921 cacaagcata acaaacccct aggtacccat ctccaagttt tgacccctat tataatttca 79981 tcttcagtgt tttattatcc acttcctctc tctctatctt tagtatttta aagtaaatcc 80041 cagatagcat cacatcattt cacccccacc ataggatttc aaagatctgt tatatttcaa 80101 gattgagtaa aagggcttga aattgggtta ttgcaatgaa actctagaaa aagcttgagg 80161 gttcacccag gagtaagctg gacaaaaaag gggtttgagg ggtggaccca tcttgcctaa 80221 aaatcttgtc tcatctttct aaaaattaca tatgaaagag gaagatttat gttacttttt 80281 tatatgagag aatcgtcctt taatagaaaa tttctattgc tgcatcagaa ttatggagga 80341 acacaaaaaa catacctcag tccttagtgt gtcctaaatt aacacatatt cacttattag 80401 tgggtaaatg actatatttc atttcagcac aacttctccc ctggtagaaa cacaaaagaa 80461 atttctaatg attaaactag gaaagtttgc actgaattga tggcttatca gagcaaccgc 80521 agttttcagg aagaaattca atgccatgcg ttgaaaatat ccccctagca ataagggatt 80581 atttttaaaa aagaatgaat aaagatgttc tggtttcttt tgttttaatc tggtagtctc 80641 atttacaacg agcatgattc tccctgtcga actctgaaag tgacttaact gaaaggcttg 80701 gcaacttcag aaagcaaaaa ggtaaaaaca gaaaatagca cacggttgaa tttgacaact 80761 tttacactac ccggctgctt aataaattct aaccccactt gtctgagtgg atactgatca 80821 tcttttctat ggcagtattt tgtatttggg ttgtttatgg tttcttaatt aatttttttg 80881 agtagtgatt aaatatctgg gatgctttta cactaagcat ataaattctc atggtattag 80941 aaaagagcta tttgatgaaa ctcataaggg gtatgtaaat taaaaaagag aagaaagtgt 81001 gtgtacatat tttacaataa tctcgaacga ctccaactat atgttgcaga agccatgagc 81061 tctctgccca ctgtctgtgg ggtatcagca agattgactc ctaccaagcc ttgggcacag 81121 tggtgggttt aagggtccca tgtctccaga tcctaagatg gagttcaccg caggggtgag 81181 ggctcgggtc agggtaaggg tggggactgc agaagtcaaa caattcggag gaagagggag 81241 aggggaagag agaagccaca gacaagccag gagacaaaga ggaagggaga gatgggaggg 81301 agaagaataa gaagcaatga aactgtaatt ctccacaaaa atttaagcta cagcagctca 81361 aagaaggcct gtttttaaag cttagcagaa gcccattctc cgctgaagcc tgttctttgt 81421 ggaagcctgg ctggggggca gtgggggcgg tgccactgtc gagtgttcct tgccatttag 81481 gtccaaagcc tccctgtgat ttctgtaact gcacaaaaac aatctgggct ttaagaacct 81541 gattattgtc cagagttaaa tgcaggctcg accttctgcc catagagagt ccaaacgcat 81601 gaattcctgc tggtatcagc ttcggaggca gaggtcgcga gctgagcttg ggacaaccct 81661 tcaagcctgc tagcaattac ctgctctcag ccacttttaa aaggccatat ttctttacat 81721 ttctaggatc taatgtgagg ttatgtcagg tcatgggacc ctaggctggt gcaaagtaat 81781 ttgatgtcac caagagcaag cagactcgtt tgtcaccctg aggaatagaa gtccctgtga 81841 tttctgctgg gagcagagca gggttgcttg gctgggggtg tggaatccag acaggggtcc 81901 agtgtgggcc taaccaggtg agctgggagg ggactttctg gatcgtgtgt gtgtgtgtgt 81961 gtgtgtgtgt gtgtgtgtgt gtgtgtgtat gtttgtctgt ctcaggttga gaggattaag 82021 ggatcagctc tgctccaagc tcacacagca gcccattgga actcagcatg tgacctggac 82081 tgatccaggc ccttggatcc caggttcact tgtgacatga tctgggtctc agtttttctg 82141 aagctaaatc tggttagccc acctgcagtc ctttcttccc ctagccctta cccactgcta 82201 ctttaatagt ctgtagctga ttaaataaaa caacggagtc ttcctacctg aggtttatcc 82261 agccattagg tgattggtgg gagcacagaa ggagtcatag cccttgtacc cctttgggga 82321 aacacttctg tgagtgataa gagacggcta atgtctcagt gaagggcttc aaggcatttg 82381 ctagaatggc ctcttccact atttccagtt cttccatttt ttcaaggggg gtgggggttg 82441 tagggggttg tgtaacaaag ccaaagaaca gggtccctta agccttcatt tgagaatcag 82501 ggagagtgcg aaccctggag gagtatgggt aggcaacttg atgggtaaaa tggaaaagga 82561 agaggccgct tcccagaggc ctggccgatg ctaggtccca gagacttgag catgacaggg 82621 tggggggcct cctacagaag ccctggaaac tctatcggta gagctgcctg tgggggctcc 82681 acaatgtcct gggattcata cattgaaatg tggagcatgg ttccttggcc ttctatatca 82741 ttcccaaaag gagtatactg gtgccactct tcctgacaat tctaggatga tctaatgcca 82801 caactaatag actgagagag cctgggcaac atggagaaac ccagtctctt taaaaaacac 82861 aaaaattagc caggtgtggt agcatgtgcc tgtggtccca gctactgggg aggctgaggt 82921 gggaggattg cttgatcccc agcaagttga ggctgcagtg agccatgata gtaccactat 82981 gttccagcct gggcgaaaga gtgagaccct gtctcaaaaa aaaaaaaaaa aaaaaaaaag 83041 actgaaggag agaactcaga ctcagactca gccaaagtgg tcagcaagga ggcacacccc 83101 tacaaggcgg accccatttg aagcaggcga aggcacccaa tagagaacct gcagtgagca 83161 tctcagaggc acatggagag gcctgcacag aattctctat cttttggcca taaggttagg 83221 aggtgctggt ggggccagag gagctcagat aagagccagc tctctatatg tggggctcat 83281 ttccatgggg ttgtgagaga ggccaatcct tcacagaaat tagaatctgg atgcaagaag 83341 gcagccagga tcagatctgg ccagatggcc ccacggcaga gatggggaaa atgtcacctt 83401 cgaatcctaa gaagctaggc tcctgtgctt tgccaaaggc ctcgtgggca gcagtctgta 83461 gctccagcgg ttttgcatgc cagcctctgg actcacttaa cttctgcagc aatctgttgg 83521 tccccagagc cactctgcac cctccaggag aacctacata tgggcgctcc agggatttgc 83581 ccgcattagg atcaggagcc ccaggaagag ttagatggga atccaggcat cctgggccct 83641 gtgctttcgg tggacagcta gatctttctt gtcaaggtgg aaatagcgtc tctgccctgg 83701 gtgggaaccc agacccagcc ctgcaatcac gcaggcaata aagtaagcac tccactgtcg 83761 acggggaggc cagggagaga gaggcttctc tttgaggcct cccaccccca tccaatccag 83821 gaggttgcag tcgctgtggg cggcggcgga ggatgctcgc aggacaccgc tgcagttgcg 83881 acctcttccc actagatgtc ttcccagtta atcggaactc aagggcgctg tctctgctcc 83941 tcgggccgag actcggtttc ccctgccgct ttttgaagtc tagcattcct ctgtgaaagc 84001 tggcttttcc ctttttttcc tatactcccc ttcccgccct gcgctctaaa tcctggctct 84061 ggaaatccac cagaatcaaa tgctggctcc ctgcgggccc gcgtcggagc ggctcacgcc 84121 attggaatct ttttatgacc tttgatgtgt ttaaataaca tggcaagtct gagcacacac 84181 ctcggattaa actttaatcg gctgcagtca aggtcgccca ggtggtaaaa gcgtttcctc 84241 cgacgtctcc tggcagacgc tgcggggtcc aaggtctggg gcaaagccct aggcccccgc 84301 cataccctcc tgcctctgtc aggcctgggt gggggccagg cccagggcca ccccatcgag 84361 ctccaaagag gactaaggaa atgggatcgg aggtagacac tcagggtggg ctaatgacct 84421 gagcacagtt tttatagctg ctagacagaa tagttaaaaa ttccaagaag cactttcaat 84481 ttcccctgac tccagaaact ggttgaaggt ctagtcatgg tcagtgcctt attgcctcct 84541 ccccatggct ctgctcccct agggagaaag ggggtgctgg taatgatgtt gcggccaagt 84601 tagctgactg tttttaaggt caccaaaccc attaactgtg ggccaggatc tagaacccag 84661 gctaaagcag cccttctgtg tgtcaggcta gacaaggttg gcttcaattt tatctgggat 84721 gagcatatct ttctgagcca gtatacctgt catggagaat tgtcatgggc agtatttagg 84781 aaaagctcca agaacagggc agcatatgta gtgagagctt accactgtgc ttttaaaaaa 84841 gaaggaatgt acaaaagggc tagaaggaca gacaagtcat cagtaacagt gcttcccctg 84901 ggaagaactg gtgttcagag gatctgagta gagaattttt tttctataaa atctttttgt 84961 agtgtttgtg aagtgcacca attgtatgta ttagctattg aggttatact tttcagaaaa 85021 ctggttaccc tatgtatctc atggccacat attcctttgg ggaatttcct ccacacatca 85081 taccagcctt ggattcaggg gagaacactg aaataagtta ccccaaggct tgaatcttag 85141 tcttacggac atcaactcag atttccatcc cctgggagag gttctcaggc ctctctctct 85201 gggcttaccc ctaacttgta aatgaacatg ccagccccag ccattcacaa aagagaatgc 85261 gttgtctcaa aacgaattaa agcctgatct tcaattttcc cccaatccct gagtgcctag 85321 gggtttgcag aaaaggccca caaatgctcc tccagaccag agattcccac agccccttga 85381 catgtatatt tggtttgtca attcttgtct tgaagacact atcaagtgtc ttcaagggcc 85441 acatactaga ctgacctgtt gatctcttga cttagagata acactatcac tgtctaaaat 85501 tagttctcat agtgcttctt catggccttc tgggggctgc taatactctt tatcttgatc 85561 tttgtagtag ttacacaaat gtatacatat ataaaaattc agtgaactgg gggcttaaga 85621 tttgtatgct ttgctatacg taagttatat ggcaataaaa aaatttttaa gtgcttcctc 85681 ctaacaacct gtgaggttgg tcccaaggtt attacaatca tctccatttt acagacaaag 85741 caactaagac tcaagggaag ctagggtccc aggcatctgt ggaagaaaat aaccagccag 85801 agactaacca gcagcttctt cctctgggtc ttgctgcttc aatatccagc tttcttttcc 85861 agcagatttc tcaggtgaca ggagccggtt cagtatagag aggctctttt tagagagatt 85921 gtgaggtagg aacatgtggt ctgagtcttc aaatttttta gtcttggcgt agggagtcag 85981 ggagggcacg cgcccagagg gtgtggggcc tagagggccc cacttgtgtg cctctaatga 86041 ggaagagggc cctcagggct tggaacatat tttcttctta ttccagtcct ccaagcccac 86101 acgcacccga ggctcaggag acctcttcag gcgaccgcac ccttccgctg aagtccccca 86161 cttctgcccc tccgaggtgg ggtgggggtg ttctgaaata acggtggctg cagttagatg 86221 gctgaggaca tagaaagcgg ttgttattta ttccaggcac catcacaaaa ggcttccaaa 86281 aatatcgagc cgagatattt tattgtccag cctgtctgca gcctgtaatt atgcgttcaa 86341 gatgtttaat gtgtgcaaat gcattagccg ctttcgctgg tgaaaaccct agctgggggc 86401 ggggtagagg caagtctgga gataattatg tcgtacagtc gcaaacatta ttccgttctt 86461 actgtaaacg gccccggcca cctttacgag aaaccaggaa acttctgaga gttactagca 86521 gcgtttacgc gggcaaactg agttcttttt ctttctctcc cggattgttc gaagtatcta 86581 tcgggcggct tcgatgccag gttcagaggc gcgccaggga gagggcgccc cgcagaggag 86641 cgcagcggag aggcctacgc aggtccccgg tgcccgcggc cctcggaggc cgggccctgc 86701 gtcttggcca ggcactgggt ggcagctgag gctggtggcc cggagccctc gcggccgcgg 86761 gcaggcccct tcttgggcag ggtcgggcac tcccgctgtc cagggctctt cggcaccctc 86821 cttccaatca ggtcgctctc ccctgctccc cagactcaac tcctccgaag ctgctccagg 86881 ttgaaatgtg accgctaggc cgactccctg ggcccgcgag cagttctcga aaggtgcgga 86941 ctgagccctt tctggggtgg ggtgcgggtt ggttctcgca agtgtgaccc agggtgaact 87001 tgctatttcg ggtcccgggt gctgcagggc caggagaaca gctgggatgg gggacccccg 87061 cctccaccct cgggccggca cgtccgcgcc ctgtcaggtc cccctccctc ctctatgatg 87121 gccaaggcgt gcgccagggc tatccgggaa ccttgtaagg cctcgtgctg gcacctaacc 87181 ccactcgcgg cacacttcct ctatgtagtc tgcggccccg cctgccaaat gagagtgacc 87241 agtgcaggga cagaatgcca ggctggtggc cgaccgcctg agggacaaag gcgagcattc 87301 acaagccaac agcagacccc tgccccccat atttccattt cgctcaggct tttaggacaa 87361 aatcaacaag gccgcagagt ggtgcaggcg ctcaccccgg gtgacagcct ggggagccac 87421 tggttccgcg accctgggca tgaaactcct caagggcggc cctcgagacg caggggagag 87481 gatgctgccg gcgcctgccc gagggcttct ctgcgggaag cgggcaggca ccccaccgga 87541 gtcattgccg ggaccctcag cgcaacgcgg gcctgtgtcc tctcgtttct ctttagaaaa 87601 agactggatt ttaagactcg ttttaggcca atgatttaaa acaaaaccaa accaaactac 87661 tgggcgctcg aggagtagtg tgggacacat ttaaaaaaaa acttgtgtgg gatccagggt 87721 gcattttctg gggcctggca gcgctgctgc agccccaggg catggagaac agggggacat 87781 ggaaagatct gggaggatga tgaaacccca gggagggaaa cgctcacaca gagaggaaaa 87841 acaaaaccca ctagacgtga tttgttcctg ctccaaatga atcatctaaa gagaatccct 87901 ctgggctaca tttactgggg cttagggtct ggggccctca gatccaccag tggtgggggt 87961 agttccctga aatttgcact cccaatctcc cccgtgaagg tgcaagctgt agctacaaag 88021 ggaaaaggag aggaagaggc tagtaggatt cctgggtggc tgcagcccca aagtaaaccc 88081 catgctttga gtgcctctgc ataacaaagt ttcccaggcc ccccagactt ggaggggggt 88141 ggttagttat gcagcaacta gacgtttggg aacagcaaga ccctggcagg aggtgcagac 88201 taaagctttc aggttcctac ctcttgttct ttcctacccc ctgtgtctgc ctttctctct 88261 gccacctttc cagagactca ggcctccagg agcaagtggc tttgctctga ggccctacct 88321 ggatgggcca gagagactgg gaagtttaag gacaggtctc acacttggtt cagatttcag 88381 acacctcaga ggtggctata acttaaagcc tgcaaaaata aactttaacc acattcgagc 88441 gtcccccaag cctatagttt tgaaggccta aaatgtgcga tcttttaact cagaggttgc 88501 tggttgcctg agaggctgag cagaatttcc caagcaagag gggacaggga aaaaggccat 88561 tgtttctgta gatatccagg atcagttcaa aatcagggct cactgaagag ggcatgtgtc 88621 ctgtgactgc tctcttgaat gtaatttttg tctctctctc tctgtctctc ttgctcccat 88681 agaaccgtgc cctcacagca atccagtgtt tgattctccc tatagaaagt gaatctgatc 88741 agggccttct tcactctcct cctccttctc ttctgttctg cagctcattc aaaagagccc 88801 atagaaacac agagagaccc atgaggagtc tcttgccact gggactgact ctgcctggga 88861 cccagagacc agaaatggtg agtggttttt cttcaaaggc tgctgggtcc tgaggctggg 88921 attgcctgca cagcgagagg gcctcccttg cagctgcacc caaccaggga aaggggtgga 88981 aagagttttt gaaggactct agaagatggt aagcagggcc acctcgaaga cctagatttc 89041 tcaactctac atctcctgga ggagggattc tctactggga gtgatcagtg ccaattccag 89101 tttccagatt tggtttgcct gcctaggaaa attggagcct catttaaatc ttttcctaga 89161 agtcttgtgc tcattctttg gctttctgtt ttctattagc cagagactca tgtaagataa 89221 tctgggtttc tccctccgac ctctgaaaca tgtaatttgg gggagatact gtatttaaaa 89281 tgccagtccc agtaatgagc ccagcttagc agtgtgagct cccaccctcc ccttaatgaa 89341 cagagctgtc gagaaataat ttcactttga tctagtttgt gacctgagtt tttggcttaa 89401 agaagactgg gcagctaatt ggattcttca gttgccttta atctccacct caacacttgg 89461 aacctgacca agctgaatta aaacaagtct gataaacaag atgtagctgg ctcccaggtt 89521 aacagtaaca agaaaatccc caccaccact gttgggttta atgatcacag catggttggc 89581 acctcacaaa aaacatgagg ctgtaggccc gcagaacctg cagcacggaa gtcttcagaa 89641 tcctttcccc tccctcctag ccccctggat gtagcttaat tagaaagaga aggaggggta 89701 gagggaggtg gggaagaaaa acttacagtt ggtgaaagaa tcaggttggg ggttcatctc 89761 aaagggaggc aggacgttga ggagactttg tccctgggag aggaagcagg ccgagccttt 89821 gagggctgga cagtaaagat tgcaatttga atttccaact gtgtttcaag aatgagggga 89881 tgcaggaggg aaataggata tgaggggcag ggttatgcag cctggcgggc tgggaatcat 89941 ggattgtatt taatcttttc agatgccagg gttttgtgtt ctaggagcaa agcagaagga 90001 ggaagaagtc tgctagacct ccccattctt agtttggggc ctgagaaggg attctcttgg 90061 actcccctct ggatgctcct gtctgctgcc tgcctaagtg gctggaggag ttctgaggga 90121 atgctttctc caaggcagca ggggtttggt ctaggctagg atttctcaaa cctccacaat 90181 attgatgctt tgggctggat gattaattcc ttgtcgtgga ggctgtcctg tgcgttgtaa 90241 gatgtttacc agtatctttg gcctctactt acagatgtca gtagtgccta ccccagttgt 90301 gacacccaga aaagtctcca gacactgcca aatgtcccct ggggggcaaa aattgccccc 90361 acttaagaac caccagtgta gacctctgtg aggcaggcat ttctgggaaa aaagtgtagt 90421 gctttgcttg aaatgatctt tgtatagttt cagagagcta ggggagcagg gcggggtact 90481 cccgctcctt ctttgaacct ctccagcagg gcagatcatg aattccttcc cagggctgtc 90541 ttgatctcct tactgtaggt gctacacctt cattaaggct ttgagcaagg acatagctgc 90601 cagggcatat aatcctaatc tttagacaca ctgaatccag gataagtgtg tgtgtgtgtg 90661 tggggaaggt ggtgggtggg taggtgtata tatgggtctg tataagataa atgctaggca 90721 aaagagatcc aggcctcacc ttttcctgca gctttgaaag ctctttcttt ggcatacttg 90781 gccttgaagt gggaaacagc tgggttttca taaaatggtt gttgaggtta tcagggaatg 90841 gaaagcctat tacacatatt gaggtgagag gtagtgtcta ggatgctgtt accagattct 90901 tatgcagtag gtggtcttac agctcctacc catctcttcc ctctagctct gactctcagc 90961 aagctcttct cacacaggct ttctggaaac tgttaagatc ctagaaagta ggagcttgac 91021 ataatttatt gaacacttgg tattactatt ataatgctga ttgaaaatat ttaaaaaaat 91081 aggaggctgg gcacggtggc tcacacctgc aatcccagta ctttgggagg cagaggcagg 91141 tggatctcaa tgtcaggagt ttgagaccag cctggccaat atggtgaaac cccgtcccta 91201 ctaaaaatac aaaaattagc tgggcatggt ggcgggtgcc tgtagtccca gctactcggg 91261 aggctgaggc aggagaattg tttgaaccca ggaggcagag gttgcagtga gccgagattg 91321 caccactgca ctccagcctg ggggacagag tgagactcca tctcaaaaaa aaaaaaaaaa 91381 aaagaatttt atcacaatcc agacatctca accagatcca cccatccctt ttctttctct 91441 cttctttctt tctctcttct ttctttttct ctttctttct ctctttctct ctttctcttt 91501 ctctttttct ttctttcttt ctctctcttt ttctttcttt ctttccctcc ttccctccct 91561 ccctccttac ctacctccct tcttccctcc ctccctccca tcctcccttt gtttcttcct 91621 ttcattttct ttccagaatg tgttcacata gaacacaaat gaaggatggt aaagattaag 91681 agaaactgat taactgacaa agacaattct atcaagtttg cattgcatta atagcaataa 91741 caatagggtt aagttatgtg acccctatat atatttttca catctagagt gggttatagt 91801 cacttagggc taggcatatt tagctgtcag cattgccaaa tgaattaagc tttaggtatt 91861 tgactaaaat cactggtcga atgcttacag agtggtcatc actttacaaa tattaactca 91921 tgtaatctgt tgaacacctc taggaactat tataatcatc ttcatctcac agatggtgaa 91981 acagaggcct agaaagctta agtaacttgc ccaaggtcat ccagctacta agggacaaca 92041 ctaggattgg actgaggcat tggatttagg gtccatattt tctattttct ttctttcttt 92101 cctttttttt ttttttcttt ctcaatacag agtttcactc ttgttgccta ggctggagtg 92161 caatggtgcg atcttggctc actgcaacct ccgcctcccg gattcaagca attctcctgc 92221 ctcagcctcc cgagtagctg ggattacagg catgcgacac catgcctggc taatttttgt 92281 atttttagta gagacagggt ttctccatgt tggtcaggct ggcctcgaac ttccgacctc 92341 aggtgatctg cccacctcgg cctcccaaag tgctgggatt acaggcgtga gccacctcac 92401 ccagctgggt ccatattttc aatgactaat ttctgctgcc aattgttaat atgctgggta 92461 gggacagtgg cccaggggac agctgttggg gaggtgggca tctctaagga tgtagccact 92521 gctctggaag cctcattctg agctttgggg ccggattcct tgtccacagt gtgtctatta 92581 atataagcct ttctgataat tgaaagtaag ttatgggaaa tgtaatatat atccccggta 92641 tttactgcct cgggaacaag ggagacataa tgaaggaggg aagtttgtaa ttgcagaaaa 92701 acatcagttc taggaacaga aatcagtggc agccagagac gttctcccca ggtttccggc 92761 cctgctggtg gcccctccac caggctccat acaggcagga acccccagaa gaggaaaggg 92821 gcctgtaata ggagtcatca acctgtcacc acggctaggg tgaatatggc ctctgagttg 92881 ggggaagggg ggaggtttac tctttagcac caccagctgc agggatgggg aggggctgtg 92941 gagggtgggg tggggagggg aggctggagg tggagctggc cttctctatc cacagcccac 93001 agcctcctca gaacagacac tgggctagta acgtgcgatg ctgcggtcta ggtccctggc 93061 cccacttggc actaaggtga gcatggtggt ccttaccctc aggcagcctt cacatctata 93121 agaatgtgca tacatgcgtg ggtttgcgtg tgaaggaaac atggagggtg acacagctac 93181 ataaatgatt gcaccaaagg ctgagataag aacctaggga tggagcagta tgaaccgggt 93241 gccaagtggt gtggggtgtc agggtctggc tgagggtggg cagcaaggag gctgagataa 93301 tgacctggac aggtggagag gggccgtccc aggtggagat gcaaggagga tgctgggaac 93361 aaactctggc tcttgggcta gaggggaagg gagggtaaag gaggcattgt ctggcaccta 93421 ccaggtgctg tgctttgcag ggatgatctc atcgaaccct cacaatatcc ccatgtggta 93481 ggtacttgtg ttacccaaat tctacagatg aggaaacagg cagagagggt cagtgacttg 93541 ggcaaggtcc cgtggctggg tggcagagag cctctgtgag ttgctctgaa ccggtccata 93601 ggaggaagcc cagcacctgc agtgaagccc tggtaaaggg gaaacttagt tgggaggctt 93661 tgattttgaa ttgaggcctt gcttgcttta agaaggttct cataaggaaa agctgcatgc 93721 gtcttacatt tctgtatttc ccccgtgcca ctgccagccc catcttataa gttatccgcg 93781 agtgatgaga agtcccccct cctccccggg gtgagtgaga gggcaccatc cctggggcaa 93841 gggcccccag gcgcaggatg cacgaaccga gggacacacc gccgccagac tgacctggtg 93901 tggcggtcgg gcggggccgg gccaggccgc gaccgcgaga aaccacagcc ccacggagga 93961 ggccgggccg cggggctggc ggggaccctg caggccgggc cgaggtgcgg tgaggcctcc 94021 tcccgacctg gccgcgtcct cagagttcgc tcggggcttc gtgtttgcag agcagcctcc 94081 cgcctgcccg gcttgcccgg ggatgtgggt ggacccgccc cgcgcggccg cggcccagtg 94141 caaaccgtga tccaccctct tccgctcggt gggaggaacc cggggctttg cgcccctaac 94201 cagcagcgtg accctcgcag tcagggaact tctcatctgt gaagaaagct taagaatgac 94261 acccctgaca ggagcatacg tgcttgcctg cccatagggg acaggaacaa acgagatcat 94321 gtattccatg aacacgcttt taatgcccac ctactatatg acaggtgctg tgctaggcac 94381 ttctgtattc attctaaaat attaattttg catcgtgtcc ttttggtata ctttattgaa 94441 atactatgta caaaaagtgc aaaagtcata agatcatcta acaccaacaa atatttattt 94501 tcattcaaca aataaatgat tctgtaaaac ttattggcaa tgtgtgagag tattgcacac 94561 tgcaataaag tgttgtcagt aagaggcctc ttctctacat tacctgctgt tgatctggga 94621 tgaatttacc tgagtacttg cttcttctgg gactggagat gggttttcgg ctccattctg 94681 tcactaaaca ccctggctac ctcctgtgat ctctctggaa ctcctcagtg gaatgaggcg 94741 gctggccttc acactaagga ccctacttcc tgtgactgat gagggtggct tgggtgtatt 94801 tcagaaataa taggacacaa tttgagtgtg gcttcttgta tttatgatta ctaacacaaa 94861 tgttatccac acattcatga taatacacat ttgcaggtga atgtctttgg cagagctgct 94921 ggaagaaatg tgaggcctgt gactcatcat gaaatcccaa gaaggtttca ctactttacc 94981 aactgagaag ttctcatagg tggctttaga gaagagcgtt tcttcttgtg caaatagaat 95041 ccttctactg agcagtcttc taaagcagtc atctcaaaaa tgaagataat aacactactc 95101 attttatatt ttatgcagaa agcttttaga acagtgcctg ggcacacact gagagtttaa 95161 taaaagttag ccacaattat tattcactct ttgacactcc cccaaaagct tatttgtctc 95221 cattcttccc tctttaagaa aggaggtagt aggttggaag gtctagccta attgttataa 95281 ggtgctgcca tgaatcccaa cctaaaaatt aacagtagga ggcatagatc cctaagtagt 95341 tttctcccta cacattcatc ccttgtgaat acaggatgat tgggtaaagg tccaacctcc 95401 cttcctgggc tccctggcct ggagaggcac cagggcactc cctaaggcaa cctcatcaga 95461 aatcagaatc ctcatcagca gacctaagtc cttcaggcca ccagtgcctt cccaaggggg 95521 ttaaaagttc tctaacaagg gcagtgtcag gtatggagaa aatgatggag taagaattcc 95581 ttccatatga gtggagattt acagttgaca aagcactggc catttctcca tgtttagtgg 95641 ggttgatagg aagacttccc tgggttttgt gtgtttgttt tcgtgatgat ggaagggtac 95701 tgagtgcaaa gagcctttgg tgttttgaat gcaaggtgac tctgtctttg gctctggagt 95761 tacagacacc tagatttgag ccaccactct gccatctatt tgctttgtga ccttcggcaa 95821 gtctgtttat atttttctta actttagctt cctcaactcg aaaatggatg taacaatatc 95881 acctagttag cccataaggc tgttgtgcag tttaatttct atgatgcaaa taaagcactt 95941 agctcagtgc ctaaagtaat gtagatgcaa tactagttag ctttcaatat atattatcaa 96001 tattcttata tgattctctt agacatggca gaaaaaattc attctgattt tataggtgat 96061 aaaaccgtat ggaactactg atatttacta tagcatggat gaacctcaaa aattgtgcta 96121 agtgaaagaa gccagaccca acagactacc tgttgtgtga ttatgtctgt ctaaaatgcc 96181 cagaaaaggt aaacttatta agatagaaca ttagcggttg cctcaagctg gacgtggaaa 96241 ggggaattaa cttaaaaaaa aaacgaggga tctcatgggt tgagaagatg aaaatgttct 96301 gaaatggatt tatcgtgatg gttgtaccac ttggtaaagc tactactaaa aatcattgtt 96361 ttgtacacct gaagtgggta aatgttatga tgcataacat atacatcaat aaagttgttt 96421 taaaaaccca aaatatcaca ggagaagagg gggaatagaa tctaggtttc ctattaggtg 96481 ttctttagac tctgctacct cccattgggt aggagagaga aggcaaacag caggggagga 96541 gctatggtta gaatctccgt taaataagag aatggaacta agacaattct gagaatgtgg 96601 aggtggtagt ttcttaaaga gggaagaatg gagagaaata agcttttggg ggagtgtaaa 96661 agagtgaata ataattgtgg ctaacttttg ttaaactctc agtgtgtgcc caggcactgt 96721 tctaaaagct ttctgcataa aatataaaat gagtagtgtc attatcttca tttttgagat 96781 gatgaacttg aagcatagag aatgtaaata ccttgaccac agtcataagc taatatatgg 96841 cagggtcagg atttgaaccc aagtagcttg ttccaaagca cctgcatttt accctgatgc 96901 tattaataca tataaaaaga aggagagaaa tgtcaggttt ctgggcaaaa atctggagtt 96961 acattgtgtg gatgcgtgag cctggaacat tctgagactt tgctttaaaa atgtgatgtt 97021 gttttatata taaatatgaa ccacaccaga tatcttgctg ggggggtgga tttgctccac 97081 catctattgc atggataata tgcagcatgg atatcatttt tgtatatgaa tatcttaaaa 97141 tgccagacaa tcccttcact tctgtgtttg aagtgtggct atgctcattt cccagagtat 97201 gaactgagct cacgttttgg gaacaacatt cagactgatg taaacaatct cagaattact 97261 tggaccatgt tgccaagctc catttcccct cccccagtct cattccgggt gcctccttca 97321 tccctgtgcc ctttgcctca ttcctgtccc tctttaccct tccaatgctc ttttctccat 97381 gaaataggga gtctttctgg attgaataga aaaattatgc cttaaagatc cattcttttg 97441 gtttgctgca ttgaggaaga atatgtgtgc agctagaaga aaaaacaggt catatctata 97501 cagaacttga aaagtaagga ttaaaaattt caatgcttcc aaggcccaaa aaatattgca 97561 ctattttttc atttaaatta tatgtataga ttaagtgaaa ataatattcc catttctttc 97621 attcttttgg tttaagtgat ttaaaactat ttgcatgtat aaaacaggca acttgtaaat 97681 ttaagtgaaa ggaatgcaga tttcttggat gctgtacagg gagggagagg ttgaaaagaa 97741 aattaacata gtaatgaaaa gattagagaa aagtggtcaa gtgatcacaa tctggtggcc 97801 ttgggatggg gaagaggtat aatttttaaa aacctttatt gttttgtgtg ggaggaaata 97861 cacaactaat gcatgttcat ttaaaaagat aatcagtttt tgaagaaata gtgtggcaag 97921 ttaatgcccc atatgccccc ataatcccaa agtaacagtt gaatgtatat tcctccatat 97981 ctatgtgcat ataaaatata catatatttc ctatataact atccatgaaa ttaagtcact 98041 aatcatatct gattgcttgc agacatttgc ttacaatttc tttctccatt tttacaaatg 98101 tggaatcata ttacacaaac tattcttttt gcaatttatc ctttttcatt taaagtaagc 98161 ttagacctgt ttgcctgtga gcacacagat ctcgtctgaa tatagagtat tcctttatat 98221 gaatgttcat cattttattt aaccatcatt ttatttataa attgatgact gtcagctatt 98281 tccacttttt ccttatcact aaaataattt agtgaatatt cttgaatgca ttcattcggg 98341 aatatttact agattatttt agtgataatg aaaactatgc aaattttcag aagtgacatt 98401 tcagggacaa accgaataat aactgtaaat tcataaatgt caaaattgcc ttcaaaaagg 98461 ttttgccgtt ttgttttttg tttgttttaa cgttccacca atggttttcg ggtgacggtg 98521 tccttgcacg atgtagtccg tcttcaaagt tcttttacac aacgtgaatt atattgaaat 98581 tgggctactt catagagctt cgcctcagtg gaggacagcc tggaataggt gagggggtct 98641 cgaaaggaat accaggccgg ggagtactgg caagtgtcgg aggggaaggg caggtcgggg 98701 agaggatccc caggggctgg gtgtctgcag ggctccccgt aactcagagg ggccggcggc 98761 cgccgcccga agcctgagct tttctcagca gatggagaat tcatttttaa cccccaggaa 98821 ggaatgggct gagacatcag accaccgctc ctagagggtc aaatttcctc cccataaatc 98881 caaaacattc tctttaggga gttccctccg tggcgcgcaa acccgggaag ccgcgcgccc 98941 tggggtttct tggcgggccg tgcgcgcagc ctgaaccgcc aaagtcgcgt catcccgcat 99001 ttgcctgcac ccaggagttt gcaacccgca gagcccgcag aggccaccgc caagagtgcg 99061 cgccttggcg ccccgtcggc ctctacttcc tcggatctga gccagcgccg ctaatccgca 99121 ggctgcagca gaacctgtcg ccagcttgga gagtccctcg cgctgccctg gcttcccggt 99181 ccgcgggcga ctcgcgctgc tcccgggttt gtcgggaccc caggagggcg ctcaagcctg 99241 tcggccccta cttgtgtgta gagaataggc ccgggtacac cacctcagaa atgcagagga 99301 agccggggac actgaagccc gggtggttgc agtgaggctg aggatctcag atgggtgacc 99361 cccagctctg gccagtatct gaggccactg cacgtggggg acactgcccc ccaggttccc 99421 acctagttgg tctcttcctt gggtcagtgg actgcagatg actccaattg gcctagtggg 99481 ccatttctgt cctagagtca ttttttggct gatacagaat tttataataa ttcaaattag 99541 ttgccaacat gtagaaacac ttgaaaatga aatatctaac aatgatatca gctggagctg 99601 aattctgtga ctccagtctt ctcaattccc aaatgtgacc atggccattt ctctcatttc 99661 tgttacattt ctgttaccag cccactcctg taggcattga tgtgtgtccc ctgttggagt 99721 catagaatac catggtgggg gcatgaaaat agaatatatt ttcatatatt tcagaaaaat 99781 agacccttta gctcccactg atatttgatg ctgtgaattg gtcaaggtta cccagggtgt 99841 ggcagagttg agattagcac tttattcact gatgagagcc cagaggaggg aagagacttc 99901 tgcaggacca cccagccaga gagcggcagg ggcacgccta gagcccacgt ttctgaccat 99961 tagccccttc tcctgaccgc acaggggccc cagtaaaaca cagcctatga ccaaagagcc 100021 cttgcttggg ctctcaggct ttgacccaac acctagtcat tggtcagacc aactgagctt 100081 ccactgcctc tgtctgtagc tatagaatgg agggagtcca tgagggctgg cttgtgtctt 100141 gctgtcagct tccaggttct gggttcttgc ctaagaaaca actgcatcga actgtgggca 100201 gatattcctt atcttcttgc atagtctaaa aatacacctg gccatactat agcaattgta 100261 tttgaactta ctgatccaat cccattggaa atcagtgcat ctagctatta ggggaatgtt 100321 tcccaaacct tcatacttgc cacctccagg gtaaagggag actggaaaag aaacagccat 100381 gtcacttgcc agccttgtca cagttctgag ttctgtccta catttttcat ttgggagcca 100441 gaaagaagtg gctagactat cagaaggccc ataggggtgg aaactcccta tcttaaaacc 100501 agtacaggga aacacagtga agggaaaatg aatggctgga gataacagcc agtgtgggga 100561 aaagaagtac gtctgaaatg gcttcttatt ttacttggga gcattcccta cccaagacta 100621 aagcatgcaa agcaggccag agactttgat gtttctcctt aaggaaatga aagtttgagg 100681 catttatgtt aacatatgga atatctatct gtgactgcat tagctcttca gacgtatctt 100741 cttacctccc taggagtaag tgaacagctt acaaaagctg cagtttctgt gggagaaact 100801 caatttggat ctgaaatcaa gaaggcttaa caacacccac tctgtatgaa attaccaccc 100861 cattttaata aagtccaaga tgtttaatag ctcacttaag gtcttacagc acattgggag 100921 cagaaataga agagttaact gtttttctca tttggtctcc tgagaaaggc ttgttcattt 100981 taattctatc tccagaaaaa cgttttattt gtcctcttca caatccctcc ttaatatctc 101041 acttaggtgt tatcactgtg caggtaaaat aatttatttt taaagtggca tgttgtacac 101101 ataaaaataa ttgagtctgc cggcctggaa aaaggttcag ctgcctgcct ggaaaaaggt 101161 ttactgtagc ctaaatttct atgaagatgt ataatttgtc agtgaaaaat atatatgaac 101221 acacatgcat acccctttat taccactcag aaagacttgg gaaacttaca ttttcttctt 101281 ctctctctct catttggaaa acatggctcc ttcatttccc attctcttgt agaccttcgg 101341 tccttgtgac cctgttctct ttacctttat gtttgcaagg cagagtacgg cacaggcata 101401 ccatattctc atcacctata ggccaagaac ttggtctcag ataatagtgc tagtaaggaa 101461 tagctctggc ttccagtctt agaaaactta aaataaccag ggcttaaaca ggataaaagt 101521 ttacagtttt ccacctaaaa gcctggatag gggctccagg gctggtgtga tgcttcatga 101581 tgtcagagat ctgagctgct actatcttgt tgctctacct tatccaggcc caggatgtca 101641 gtagacaagc tccaggcggc agaatggagg gataaagaaa accactggcc ggatgcggta 101701 gctcaggcct ataatcccag cactttggga ggctgaggtg ggtggattac ctgaggtcag 101761 gagttcgaga ccagtctggc caacatggcg aaaccccatc tgtactaaaa ctacaaaaat 101821 tagctggatg tgacatgcac ctatagtcac agctactcag gaggctaagg caggagaatc 101881 acttgaactg gggaggtgga tgttgcagtg ggccaagatt gcgccactgc actccagcct 101941 gggtgacaga gtgagacttt gtctcaaaaa aaaagaaaaa gaaaagaaag aaaaaaaacc 102001 atgcaaagag cttgtacatc tctgtctttt aaggagtgtg cctggatttt gctacaggac 102061 atttctgctg acattccaat gggccagaat ttatttggct ctaaatataa gtggccaccc 102121 actagtatat ttgttccagg tatggagttg ttgcttcatt cctcttcctt tcagtatata 102181 aacttactga aattccttag gctcacattt agggaagtaa catgcgacaa tagtaatagc 102241 taacatttat tgaggattag tgagtgctag atattcttgg aagtgctctg tgtgtactaa 102301 ctcattttac cctcatatgt ccccgtttta cagatggggc caaattagaa cccaagcagt 102361 ctaatttaag tatgggcact taaccattat tctaaattgc cactataatt aagacggcaa 102421 caattgtacc ctgttattat ttatggatca tcttacgcct ttactacaaa aggaacattc 102481 ccttatcaca ccgggcatgc cttgattttc ttctcactat tccaaatccc agtgaatgtc 102541 ttttgcaaaa aggacacaaa atcctggatg ccatttacta tagcactgca aagatagcag 102601 ccagataatt ctttatttca atttcacatt taccatctcc acatctgctg ctttccaaca 102661 aatgcattaa tgcaccctag ttttgctctg tagttattta tttctgggca tatgttaacc 102721 accacatagg gagagggggg caaacatgtg atgtgattcc ctgagcacca ttcaccctca 102781 cttctcatcc acagctgctt ttcactcaga agagtggaat gttgagaaaa agaatcacag 102841 acatagtaga gagtgaagta gggggaccgt caaggtgaga gagaagcagt aacttgcatt 102901 tgcggagtac ccattgtagg ccagacactg tgcaaggccc tttccattag aacggctctg 102961 aaaggtaggc cttgtattat tttagatctg acatcctgca gctttggtct gcccaaggtc 103021 actggtgagt aagctccaga gctgaggttc aagctcagga ctgtctggct tccaagtcca 103081 ggttcttgac atgtcatcaa ctacctctga agtaccagaa atcttggctg ggcaagtttg 103141 agagcactgc tccaccgttg attctgttct cctagctgtt acagaccagt ttccatcttc 103201 ccaaatttcc ctgggatctg tctgtcttcc tgctggactc aaaatccagg cctcattcca 103261 ccacacacca gctttttgct gctcctctcc tgctgggtca gaagatctca caatatatat 103321 tcatagagaa agtggattca tggctattgg aagtaagatt tggtgtttac aagcatgggt 103381 ttttccttga ctctcccact tactatctgt gtgagctcag gcaagttgtt taagttcttt 103441 aggcttcagt ttcctcattt gtaaagtggg gagacccata cctccctctg agggttgtta 103501 taagacttag agccagagca catgggattt ttctttttcc attgcctgta ctgggaaaag 103561 gtgagggtta tgagggaagg acagttctaa taaaaggaaa attggccaat ttttggagtg 103621 aatctggctt ttcagttgtt aaaggtccat atctaagaga aaatcatcac ctgactgttg 103681 ccacttgtaa aagtgatacc accctttcct cagggctcaa gaccaggaca gagttaaatc 103741 ctggatctgg gttcaagaga tcttcccacc tcaacctctt gagtagctag gactataggt 103801 gtgcaccacc atgcctggct aattgttttt aatgttttgt agaaacagga tcttgctatt 103861 ttgtccatgc tggttttgaa ctcctgtcct caagcaatcc tcctgctttg gtcttccaaa 103921 aatgccggga ttataggcgt gagtcactgt gtccagcctc atgcctattt ttgaaacaag 103981 gacctatctt tcatctttga ttccttcctt ccttccttcc ttccttcctt ccttccttcc 104041 ttccttcctt ccttccttcc tccctccctc cctccctttc ttccttcctt tcttttcttt 104101 caagttttgc tcttgttgcc caggctagag tgcagtggca caatctcggc tcactataac 104161 ttccgcctcc tggattcaaa caattctcct gcctcagcct cccgagtagc tgggattaca 104221 ggcaccaacc accacacctc gctaattttt tgtatttgtg gtagagatga ggtttcacca 104281 tgttggccag tctggtctcg aacccctgac ctcaggtgat ccacctgcct tggcttccca 104341 aagtgctggc attacaggcg tgagccactg cacccggcct gaattatttc taagaatagt 104401 aaggtaaact ccagaataat taggaaaacg aagttactat attgccaata atggaagtta 104461 tgaaacaact ttaagtccat tataataaaa atatacatta acaatacatt ggtcaggcat 104521 ggtggctcac acctgtagtc ctagcacttt gaaaggctga ggcgggtgga tcacctgagg 104581 tcaggagttc aagaccagcc tggtcaacat ggtgaaaccc tgtctctacc aaaaatacaa 104641 aaatttaggc caggcgcggt ggctcacgcc tgtaatccca gcaatttggg aggccgaggg 104701 gggtggatca cgaggtcagg agttcaagac cagcctggcc aagatggtga aaccccatct 104761 ctactaaaaa tacaaaaatt agtgctgggc acaatggctc acacctgtaa tcccagcact 104821 ttgggagacc aaggcaggtg gatcacctga ggtcaggagt tcgagaccag cctgatcaac 104881 atggagaaac cccatctcta ctaaaaatac aaaaaaaaaa aaaattagct ggtcatggtg 104941 gtgcatccct gtaatcccag ctacttggga ggctgaggca ggaaaatcac ttgaacctgg 105001 gaggcggagg ttgcggtgag ctgagattgc accattgcac tccagcctgg gcaataagag 105061 tgaaactctg tctctaaaaa aaaaaataaa ataaatacaa aaattagcca ggcatggtgg 105121 caggcgcctg taatcccagc tactggggag gctgaggcag ggaattgctt gaacccggga 105181 ggcggagttt gcagtgagcc gagattgtgc cactgcactc cagcctgggt gacagagcaa 105241 gactctgtct caaaaaaaaa aaaaaattag ccaggtgcgg tggtgcgtgc ctgtagtccc 105301 agctactcgg gaggctgagg caggagaatc acttgaaccc aggaggcgga gattgcagtg 105361 agccaagatt atgccattgc actccagcct gggtgacaga gcaagacacc atctcaaaac 105421 aaaaacaaaa acaaaaacgt tacgccatgt gtaattaaaa gtgttggtag catttcacaa 105481 ccatctgcct tttgtttgtg catctatcag tatttccagg cctcaaggat gaggcaggga 105541 ctaagaataa tttattcatt ctgtagaatt ctttacaaaa ttcatactct tgctccatga 105601 agagtctgag actgattctt gtcatggctc tgtgaacttg gccaagtcct ctgaactctc 105661 tgaatcttgg cttcctcaac tggaaactga aaagaacaat tctggcccct ccaactttat 105721 taggaacctg gagaggagct aataaaatgc tggacatggc cgggcatggt gactcacgcc 105781 tgtaatccca gcacttgggt aggatgaggc aggcagatca caaggtcagg agttcaagac 105841 cagcctggcc aacatggtga aaccccatct ctactaaaat acacaaatta ataggccagg 105901 tgcggtggct cacgcttgta atcccagcac tttgggaggc cgaggcacgt ggatcacctg 105961 aggtcaggag ttcaagacca gcctgaccaa catggagaaa ccccgtgtct actaaaaaaa 106021 atacaaaaaa ttagccgggt gtagtggcgc atgcctgtaa tcctagctac ttgggaggct 106081 gaggcaggag aatctcttga agccgggagg cggaggttgc aatgagctga gatcgtgcca 106141 ttgcactcca gcctgggcaa caagaacaaa actctgtctc aaaaaataaa taaataaata 106201 ataaaaaata aaaatacaaa aattagccag gggtggtggt gcacacctgt agtcccagct 106261 acttgggagg ctgaggcagg agaattgctt aaagatggga ggtggaggtt gcagtgagcc 106321 aaaatcatgc cattgcactc cagcctggtg acagagcgag actctgtctc aaacaaaaca 106381 aaacaaaaca aacaaacaag aaaactgctg gacacagaag tccacctgag cagcctaaag 106441 taatagatgg aagttaagga ttaatatgat gatctgtgaa tatgtgtctg aagtggaaat 106501 gaaagaaaaa taagaaacag agacaacgta gtcaaaagga gtaacggtgt tccctgattt 106561 gccatgtatt agctgagtga tcctaatctc tctgggtctc agcttcctca tctgtaaagt 106621 ggggattata ataatagctg ctctttccat ttgataagac tattgtatca aatgggtcag 106681 tgagtgtgaa aatgttactt ggaagagacc atctaactgt taattattat taatactgat 106741 agtagttctc ctagaagaag ttccagtgtt cttccagggc ttcaactagt ttggtcacct 106801 ttggaaatga aatctatctt ttgagatcac acggtagtca caggcaggga ctgagcctaa 106861 tgagtaggag cctggctcct ggcatgacac tgctaggatt caaatttcat ctctgccact 106921 aataggctat gtgatcctga gcaagcattt aacccttctt tgacttagct tctccatcca 106981 tagaatgggg atgataatta tgtagtctct tacctgctgg agttgtaaga attaagtgcc 107041 ataatacatg tgtgctatcc agtacacaga agatatttgt aggtatcttt attcttggat 107101 atagggtatt actctcgggt attcttggtg gcaccttggc cccatactgc ctaatttaca 107161 acatttgctg taactttaac ttatcaaata gcatcccaga gtaagcctga tactagaaaa 107221 atatgactga atgaaaaagc atctatatct acagctacat ggctgaggcc cttttggctc 107281 ctactaaaac catttgcttt ctttttgaga cagactctag ctctgtcacc caggctggag 107341 tgcagtggca tgatcttggc tgcaacctcc acctccctga ttgaagcaat tctccccgcc 107401 tcagcctccc gagtagctgg gattacaggt gcccactacc atgcctgcct aatttttgca 107461 tttttaatag agacagggtt tcaccatgtt ggccaggctg gtcttgaatg cctgagctca 107521 ggtgatctgc ccgcctcggc ctcccaaagt gctgggatta ccgccgtgag ccaccatgcc 107581 tggccttctc cattttagag ccagcaagcc aatcatattt caaggcctct ggtccccttt 107641 tgtgaaagaa atctctaggt agggcaagtt ttccaccctc ctccagctcc tgcagttcca 107701 gagcagtttc taacctgctg tcacagagtg aggatttcca tgtccttagg cagtggatga 107761 atgagagggt gagtgagaag ctaagttaga taatggcatc attaggtaat aagttagtgc 107821 acaaagatct acatggaggt gataaggtgg aaggagcagg gggtaacaga tgcataaatc 107881 tgtactaggg attttttttt tttttaaaga tacactttca aaaagccatt ccaggctggg 107941 tgcaggggct catgcctgta atcacagcac ttgggaggcc aaggtgggag gaatgcttga 108001 gcccaggagt tcgagaccag cctggacaat acagtgagac ccccatctca aacaaataac 108061 ataaaaaaaa aaataaagcc attccaggca gaatggagaa tcacagaaac ttatcaagtc 108121 gaataatgaa attcgtaggg ctgcagttct taaacttgag tgtgcattag aatcacttga 108181 agggcttgtt aaaacattgc tgggcccacc cccaaaattt ctgattcaat aggtccagag 108241 tggggcctca ggaattgcat ttctagttag ttcccaggtg acgtggggac cacaatcact 108301 gtcataggga atctgagaag agttgctgta aggctgtctt attagaaatg gtcctcagac 108361 gtaagtgagg cccaagaaag agtccaagga gctacagaga taggagagtc caggggcttc 108421 ccacagcctg gacttcctga gccacaacaa atggggccta ctagtgtgtt ggtggcatga 108481 ttgaacatat aggctgacca ggcactctca cccatgagac tctccttatc cagaccttgg 108541 ctgatatgtg gtggcggggg gagtttgggg gaaagtgttc agcgtttcct ctcaaacgac 108601 agggcggggt ggggtggggt tgggaggaag gagctcgcag gagaactgct tgatacttcc 108661 cgccccaccg cggttaatca ctgtctagag aagccgaggt taagtggctc tgacctatca 108721 cccatcccgc ttagggtcca gctcagagag cgggcgcata gggcctggtg gaacaggctg 108781 ccctctgggg atgggggatt ggggccgctg ctccgggcag agccgcggga tccgagccag 108841 caaggatgct gggactcgcc gcgacttccg cctctggcga tcggcagtgg ggggcgcacc 108901 aagacagcag ttcgggagct gcgctttgtc accgaggcgg cgaggggcgg agggcgcaga 108961 aaggggccgc ggagaaagga caaggcggcc gaccagctcc caggatggaa agggaccctc 109021 aaggggctcc aggcagcacc tggggcaagg ggagggactg atactcccgg ccggagccca 109081 acctgtggcg cgtgcttcag tcctgagcca gcggctccct ggcttctggg ctggcatccg 109141 tcctaccctc ctcccgctgc aacacgcccc tgatagccgc tcccccaccc tctgaaactc 109201 acccgcacag actcaccgcc aggcttctgg gctgggatcg cgcagtccct ccacgtcgcg 109261 gcgccctgag gcccctcccg gtgagccacc cggccttcgt gccttgctcc ccggatacca 109321 acctcacctg gctcccgcgc gtcgctcacc gcccgcgggg acagcggccg catccgccca 109381 acccacttgg gaagggggat ccctatctgg gtctttcaat catacccaag gcatcgcagt 109441 tttctgtctg ggtgttaggg aaagcagtcc atggtcctgg ggcacggcgt taatgtcaga 109501 ggagaatctg gcccttggag tctggacaac ttgcgttaaa atcctggctt ggcctcttcc 109561 ccactgtgta actctgagca aattcctcaa cctctcgggg agtccgtttt cccgctgcac 109621 gacgggaggg actgtgcctc gtcggacgct cagaagctca gtaggagggg agtggggact 109681 agtctgggcc aggaggggtc tctaggcaag tcagtcccca gaactccgca tagagagagt 109741 tttccatttc acgggaggga gaggctaagc cagcccaacc agctggactt ccttgcctgg 109801 ggtggggaaa gggggcgtcg gggcagttct ctcatcccac acctggtaaa ccgactcctc 109861 tactccttat taggcgccac cgagtcttcc gtcttccggg cccaggagag gggaaggggc 109921 tttggcctcc tcagcttggg aactgagaga gacacagggt cagagagcac acagagacag 109981 agagacaccc gcaccctgga ggaagaattt tgggaccatc gagttgtcct ctactactcc 110041 acagcccgaa atgatgcctc cacccagatt tgtggtcctg cattgccctt gttgggtggc 110101 cctgacaggc gctgcagtgg gcaggttccc tcttacagtg caggggtgga agagggtgga 110161 gatggtgccc catggcaggg gactgggagt agggggagca aggtggtcag gtttgtctgg 110221 cccttggata gcccagtgct tggaaaggat ttgtgcctgg acaaaatcta ttaaaaacaa 110281 acaaaagatc ctgaaaacat atcctcccct gaccccactc caatctcttc tcaataatgt 110341 aaatgaagca atgttgtaat acagtgataa taacaggaca gttccctgga ctctgggttt 110401 cctttatccc aaaggaaggt cagcccaagc cgagtcctgt cttttacagg agaaacccca 110461 gcggggtgat tctctccctg atgcaggtcc agagcctaga ggtttggatg gaggcacatg 110521 cagaacataa aggatgtccc cagccaggtt gcaggtgtga ccaggcctgg ccagactctt 110581 gggttttgcc ccttggacca atcaatcctg gaggtcaatt tgaagggggt agcagccatg 110641 taagggaaaa taccctggta aagggtactc tatgaaatgg ctcagggaat tgtgcctgga 110701 gtatggtggc tccaagggtg tgggggatct gggaactgaa ggccaacttc tctgcctctg 110761 cattttcgtg ggaagagggc tgatggctat gtgaaggggc catacctaag aggttaaaac 110821 tgcctgggtg accttggtga gtcctgctcc tttccagaaa tgttctcagg gcctcagtgg 110881 attgcaatgg tcagagaggg actgggcttt gtctcttagg gtctccagga tcagcagctg 110941 cactgatatt tctgacattc tggcctcagg ccctgagacc cttcccacat gcttcggcag 111001 agcagttctc ctaccctaca ttcctgaggg tggctctccc aaccccctcc atgagttaat 111061 ctagaaatga ggtagagaag aggccttggt cacaaaaaat cacaagataa atttttggaa 111121 gagaagtctc agaaggcaat gcttagcacc ttcctaattc tcaaattgtg aaagtggcct 111181 ttgagctgga gggaggggtt gaatcctgat gctcatttac acatcagctt tctgtctgtg 111241 ctttgggttt ctcagccaca aaatagagat gattcatgct gtcttcctca gagcctgttg 111301 tgaggctgga gagctaatac atgtgaaacg cttagagcag tgactgctct tatcatgtgg 111361 gaactccaga cttctaagag ggatttctct ttgccctcct ggaaaagtat gtgtggtgtg 111421 ggtgagggat gggggtggca gggtgaggac tagtccatcc agctgtgtgg gccctgagcc 111481 ttccagccat gcaggtgctt ctaagggcag gagggacact ctctgagagg cccacagacc 111541 tgcctaaacc ccagtaaagg ctgaggcttt gtggagggtt gccttatgga gagttaaagc 111601 attgctgggg gttgaggcat ccctgaaggc cagggaggga catggccttg gaagggagac 111661 tgagcttggt ttgacactca gccctgacct agctatctga gtcaagacca ctagtcattc 111721 tacaactgta ggaaaaaatg cactgtggac tcttgaggcc ttttaaagga ctgttaaatt 111781 aaatgagatt gttagattaa atgagaaatg ggtgttgatg ttctttgtaa taaaagcaaa 111841 cttcaggaca ataatgaaga agaagaatcc atagcatctc tgctagggtt gaggagggag 111901 atgtctgagc ctttgggtgc ttgggtgttg gagaattgct cttggacaca aactgcactg 111961 ggcccttcca gtcagtggct gaggtggcag caaaccttga gcttggggag gtcttcagct 112021 gccagtctca ccctgggtcc tccagcccta ggatgggcca gtggtgggaa tggggatgtg 112081 gaccagacat ccttagcaca gggccaggaa gctaccaggc tggccttaag gagcccaggc 112141 agacaggtgt acaggcagga gtgcagcagg gtcagctgcc tcccaggacc tggccctcca 112201 ttccaggagg gtggagagga gggggcactc ttcaaaatca gagttccaga agccgcagtt 112261 tccccctgtc tgaaggagaa agtgttccac ggtgcctgcc acagttacac agctgccacc 112321 gactttctca gcagattcca ctgcagtgca aggaggtgac aggagtggtc aaccctcatc 112381 tacctgcaag tgtgagatcc actcgacctt attccccagg ggtgcaccca gcagtggcac 112441 taggttccca aaagggcatc agcagctgcc aaggcagaag ggggaagcgg gtcccagaac 112501 cacccacctc cggctgtccc caccgcgagg acccagcagt ctggcgcccc caccacggcc 112561 tggaagatga cggagggccc aagactaata ttcacgacag ccagaccacg cttattgttt 112621 agaaggaagc tccctttgtt cttacttttt aaccaaagag aagcgaaaac atttttttcc 112681 tgatcacatt ttcaccgaca cctgagccga caagccagct cctggccccc ggctcaggac 112741 tcctcgctct ctcccttctc ggggccctgt cgccgttgaa aggcccgctg caggctgggg 112801 agggtgatcg gggccgcggg ccatctcccc cgagccgggc gggcagactg cggaggcagg 112861 ccccacacgc gccgcttttc cgagcccggt tttcttcagg agcgaagctg ttccagctga 112921 cccgcgcgtc tgggggccta tgcccggctt ccgattccat ttaaaacgac ccgcgcatct 112981 tatctccgtc gcctccccgg ggttcccacc cacccccctc cggcccgggc caggccagcc 113041 cagccccggc ggaagccaag ctgggagctt ttgaagtccg gagaatttca atccgagagg 113101 agccggctgg accggagccc gtcgccccag cgggggaagg gacggggggc ctgccgtgtg 113161 gcaggtgggg gatgggtgtc ccccgccgcg agaaatgaga agccgccggg cctggagcgg 113221 cctccacctc agctgctatc accccctctc cgctgtcatg ggattgccca ggcttaatgg 113281 ggttgtgcag ggcttcactt gctctcggaa tttaccttcc ttctcgagcc gtgccgcgaa 113341 taaccttcac gacacagtga taacgatgtt attaaatact ccccccacat tcaccgccca 113401 aaaaaccaaa acagtgttct tatccagtcc tctctttttt tgtcttcttt cctttaaaaa 113461 cccaaccgct cttaatgtga ggttgatgaa aggatgcttt tggaagaagt gacatttggt 113521 taaaacgttt tccccctaat gcgccggtgg aaaggggcgg gggtgggtgt ggttccctag 113581 gctcctaaga ctggccagtc agctttgaaa gagcggggca gaagtcggga gagggctggg 113641 gaaggctttg ggctgagggg atgtgtccct gaagcttcag atctggagcc caagggaggc 113701 aacctccctc tgaaaagccg gtcctcagct ccatccagcc ctctccgcgg gatgtaagcc 113761 ctgctccgta gggcccgacc caggaagcag actaagtggc actccttgat ctctcactcc 113821 tactcggctt tcatctcacc cagaagaaaa gcggatcaaa cagaggattc ccattgaatg 113881 cagagctggg ggcggggtgg ggagccagga gtccctctga ggacccagcc cccaggagat 113941 gggggcagca ggctactcac gagaaggtta agcgctcaag gaggaaagga gggtgggcac 114001 ctcaaaggcc cagtgctgga gggccctata gagagtaggc aaccttggaa gctctaagac 114061 agatgcccag ggggcttgcc attgacagga aagagagcaa gccaggtgct ccctcaggct 114121 gtggctgctg gtagggagga cagagaaaga tagaatacct ggacagctct ccaaggcagg 114181 aaaaggttct aggatttgac tcccctcact cccaattgat cctctcacct ctaaaacatt 114241 tccctggatt ttggggctag cagtgctcag catatagtgg cttcaggagt cccaaggagt 114301 agaaatatct atgtggacac taccttccgg atcccgctgc ctccgccccc aggcaagatt 114361 aatgaacatc gtcttagact gaaagagaag tggaaagccg gagtgaagag gaaactttgt 114421 ctttctttgg accccttact tcctctacct tccttaaccc tatggatagg aagatggagg 114481 agagaagcaa agaaggagaa gaatacagaa agagaaaaag gaggagagaa aagtaggaga 114541 taacagataa aaaggtcacg gaagaggact tacctgtaaa agctggggct ttgctaggga 114601 cacagaaagg agggaggtgc tgggctgccc tttgcagctg gctgcctagg gctagaagct 114661 cttcgggagg aggcggagaa aggggggttg agcatagaag gtgcccacct cctgctctca 114721 gcaatgtggg gatagactca gcccacccac ctggccccag cacacttgct tggagctcaa 114781 aaagggtcag tttgaaggag ttttatacca gcttcttccc tgggagagga gccaagagag 114841 gcctatggca gcttccagat attaagttcc acaacccctg cccccacctc cagcccttca 114901 tcagagtccc tacttacttg ctgttttagg gcattttcag cgacttcact ctcttctacc 114961 tcaaacgcca tccacttttg gagacaggta ccactgcccc taggcaggga cttgggagag 115021 gaccttatga gtcaaacctc tatgaacccc aacctttttg tactcgggga ggctgaaccc 115081 ctgcccaaaa tagcgcggtg aaagctactg ccttctccca agtaggggcc tccagtactg 115141 ccacagcagg ggccgcattc ctggcgcctc ttcattcgaa aaacctcttt ccaggagact 115201 tcgctgattc tgaacgaata ctttaaaata tgggcaaggg aaaaaaaaag acaactctta 115261 gaaactctcc ccacccaaca tacctggagt gcaggcgagg tccaggcaaa cgggaaagcg 115321 acttcttgta cgaatagcca tctcaagatg aagccaggct gtggaaaact taaggccatc 115381 tagatgggaa ggaggtattt aaaagaaatc agcacacact cttacagaag ccaaggcggg 115441 tctttaaatt aaaaaataat aataaggtat ttatttcttg ccccttaggg aagaaaaata 115501 gcaaagtttg gttcgcagga gtaaggagac agaggtgggt agggagaggg aggagaaaag 115561 aggaaaagat ggcccagcct caagctctcc agtcaacttg gggcggggga atacattttt 115621 ccgtagcttg taaaaagact tttcagccag tttgaggtta gctgccaggg gcagacagtt 115681 gtaataatgc cgacagaatt gaggcttgta agccgggctt cacacttcta aaatgcagtt 115741 atggattctt tcatcggtgt tagatcaagg caccaagtta agaaaacaaa agtcaaaaca 115801 acaataaaag aaaaaatcaa aagcttattt aattaccgag tcccgctgcc cctgcggaac 115861 tcgccgcgcg cgttaagctc tcttttcttt ctgcggaatt ccatttgcat cccctctgcc 115921 tgggtgtctc cctctctcag tgtgtgtgtc tctctgtctg ttttcacact ctcctcccca 115981 atcgagcgag gcccacacct ggcgcatcac tgccgagcca ttagctgcgg gtttcctttc 116041 atcttcgctg tggcagacgt ttctatttat ccacttgcgc tcgccgagtg gcgtcaccag 116101 cggtactgta atgacgattg cagcaggagg atgacagctt agaaagaaga gggcaatggg 116161 gcttcctccc agaggcggtg cggcacagag gagcgctcgc ttcacaaggt gaccctagct 116221 cccaccgcca ccgccgcggt cgcggtccag accgcgctcc agcagctccg cgccctccca 116281 ggcacccggc ctttctttct ccctcttgca accaagatcc gtccggccgc tggagaccca 116341 gggagccggg gttaggaact cacttggggc tttcccctcc cccaccggag agccccggga 116401 tggagagccg aaaggacatg gttgtgtttc tggatggggg tcagcttggc actctggttg 116461 gcaagagagt ctcaaatttg tccgaagccg tgggcagccc gctgccggag ccgcccgaga 116521 aaatggtgcc ccgtggttgc ctgagccctc gggccgtccc tccggccacc cgggagcgcg 116581 gcgggggagg cccggaggag gagccggtag atggactcgc aggcagcgcg gcggggccgg 116641 gcgccgagcc ccaggtagct ggggcggcca tgctcggccc aggacccccg gccccctcag 116701 tcgacagcct ctccggacag gggcaaccca gtagctcgga caccgagtcg gatttctatg 116761 aagaaatcga ggtgagctgc accccggact gcgccaccgg gaacgccgag taccagcaca 116821 gcaaaggtag ccaccgtgcc cctccgctcc ccgggcctcc cactgcgccc acccttcact 116881 tcggcgcagg ccaggaggaa gacactccct tcccctaggg caggatggct ggggggaccc 116941 acctgagcaa ctctctctgc tatctgcgtt ctggcggggg tctcctactg tgttctggca 117001 ttggcgggac tgagggtgac agcagtgcct tgagtgcggg gtgctgaggg ggcggatgca 117061 agtcctggac ttgggggatt cgaagctcac cccaagcacc cagtgtttca actgctcggg 117121 gaatgcttca attgctcggg gaagacactt tccccaggcg agggcaagat caaacgccga 117181 tccgggcagt ttgtggctgg cagggtgtaa gaggcatgga ggcgcggaag ccaggagtcc 117241 ataaaggacc gtaaaattgc ggcccacttg ggcagcccgg gtgctgcagc cctccgacca 117301 gtttgcacgt cggtcagagg tccaaattac cttgtcactt cccgggcttc gcggcgccag 117361 gtcggaaatg gtcccaatgg tctaattgcc tttggtctcc ggttgcattt gaaaaggcag 117421 agatcgggtc ctcccccctt cccctttcct tcctagtccc acttctccac ccaaaggaaa 117481 aggagctgca gggggctgga gccccaccct tctcagaggt aggcccaaag gggggctggt 117541 ttaactggag aacccctccc caccaaaggc taatgggaaa ggggtggata gcccggaagg 117601 gagtttccct ctgtgccaac aatcacctcc ccagaagggg gtagaaaact gggcgcgggt 117661 tggtgggggg gaggagaggg gagcccacca gcagacactc ctccacagaa ctgtaggagt 117721 gggtggaaag agcctggggg cgggggggag aaagaccacc ccctggtctt ggcagccaac 117781 gccttgttga atacctgcac ctacccctta ctatcttatc accgatttca cccagcctcc 117841 ttcccataac cctcagaaca acctggactc cactcacata tactaaggta ataaataaga 117901 tactcaacat gcattctcac tccagccctg caatacacgg aacctagtac acattcccac 117961 acctgttctg gcagcccgtc acacataaga gcaaggaaac tctctgaatg cccagcatac 118021 tacaatgcac ttgacccaga gtttacaacc ctctagcaca aaggtgcatc tcaactcatg 118081 tgcctgtcag aagtgcacgc cctgccaacg ggaggcagaa atctcaccta tgctccaggg 118141 caggtgggaa gggcggctgg gaacccctgt acccaggatg ccttagagga agggaaggcc 118201 tccccaaaga cctctaccta cccaatcaag ggcaggccct tattttccct tcttgggttc 118261 cccagaggcc gcagtaccct agcagaaaca gttactgagg tggctgacag ggtgtccctt 118321 cccaaatcac cctcccacct taggcctaca gccccacttc aatggcgttt gtgtgtctgt 118381 gtctgtacac gcctgtgctc tggactcgct gtgcagggtc cggctccgag gcgctggtcg 118441 gcagtccgaa cggagggagc gagaccccca agagcaacgg cggcagtggt gggggcggct 118501 cgcaaggcac cctggcgtgc agcgccagtg accagatgcg tcgttaccgc accgccttca 118561 cccgagagca gattgcgcgg ctggagaagg aattctaccg ggagaactac gtatccaggc 118621 cgcggagatg tgagctggcg gccgccctaa acctgccgga aaccaccatc aaggtatgcg 118681 gggtccaggc tggggaggcg ggtgtgcacc tatttagcgg gaagtaaatg ccaactgcca 118741 actccctgaa acgcaggcca ggaatctggg cctggggtct ccctgcccgg cgcgtgcaga 118801 ttgaccctcg tgacagctcc taggcaggca ttgctgccat gtggctgact ctgtcccttt 118861 cctggttaaa tagacaaggg gtgggcgtgg gggaagggga tagagtgcct gtgcgggtga 118921 caaggaagtt tctggggaca cgctctctgc ggccgcagac caattgagtc catgtccttt 118981 cactgctcct cccatacaca cactggcctc tggcaccccg ggggcctggc cacctgggca 119041 gaaggaagga gggagcgggc tggagtcact ccccaaacct ctctcggagg gattccagct 119101 ccaggtggtg gtggtggggt cggtctctac ccggctggtg tctgctttgg ctgttcctgc 119161 cctgcgaaca ctgtccccgg agcgggacca gactactggc ctctgagcat cgggccaagt 119221 ccagctactg aacctgctcc gctcctctcc ccaggtgtgg ttccagaacc ggcgcatgaa 119281 ggacaagcgg cagcgcctgg ccatgacgtg gccgcacccg gcggaccccg ccttctacac 119341 ttacatgatg agccatgcgg cggccgcggg cggcctgccc taccccttcc catcgcacct 119401 gcccctgccc tactactcgc cggtgggcct gggcgccgca tccgccgcct ccgccgccgc 119461 ctcgcccttc agcggctcgc tgcgcccgct cgacacgttc cgcgtgctgt cgcagcccta 119521 cccgcggccc gaactgctgt gcgccttccg ccacccgccg ctctaccccg ggcccgcgca 119581 cggactgggc gcctctgccg gcggcccctg ctcctgcctc gcctgtcaca gcggcccggc 119641 caacgggctg gcgccccggg ctgccgccgc ctcggacttc acctgtgcct ccacctcccg 119701 ctcggactcc ttcctcacct tcgcgccctc ggtgctcagc aaggcctcct ccgtcgcgct 119761 ggaccagagg gaggaggtgc ccctcactag ataaggggcc gccggctggc tgccggctcc 119821 atgacgcccg tggggtcacc ccccggcccc gggactcagc cagcctcgct cctcgctcct 119881 cgctcctcgc ccctaggacg ccaaggggga aaggagaggg cggaaaagga ccagcgggat 119941 ccggccgcaa gaattggaaa gcctaggaag tggcggtggc tggcgcgttt ggggagcagg 120001 agtggggata gggaagcaga gcttgagaga ccttcctccg gggcagcctc cggacccacc 120061 gccccccacc agggtcgagg ctgtagctcc aaagctaaac aaaacttagc agcaacagca 120121 accaatatcc agtccctcgg cccctcggcc cctcaccctc cacctcacac tcccttctca 120181 ccgggccccc tctccccagc caaggcccaa gcactggaaa gggaaattgc tgtctctctg 120241 aacaaaatgc tgtgtatgca gagcaggtag agattaatct ttgccagctt ttccaaggca 120301 tgacaagggg ctggtggatg gcaacatacc agtcatttgg aggagagagt gagagatgat 120361 ttactaccag ggagaatcca gccccttggc atgggacctg gagcctcgac tacacagcat 120421 cttctgggtc tggcgtctgc cagcacctga tctctttcct cattcccagc tttgtgacac 120481 ttctcaactt gcggctccat ctctccctgc ccccactttt ttgttggcca gggaggctgc 120541 agatgcccca ggagcccttt gccgcttcta tgaggccaag ccttttttcc ctgggcccag 120601 cacacaccct gattagcaag tgatgtgtgc gaggagggtt tgtgaatgtt gaatgtgtaa 120661 taatgatcac catggagctg gccactgacc ccagagctga gctgttaaca aggcgcccag 120721 ggaagagctt agggagtggg aacttcacct ccctctctcg gtatctggcg gtaaattaga 120781 ggcaattttc atcctttgct tgttcacctt cacttcacca ggaactttct ggccctaccc 120841 tttgcattgg gtattttaca actttctctc attttcttcc caagctacca ctggagcttg 120901 actttcagat accagtggga gccttctgtc ccttttgggg accctgtctg tggcctccac 120961 cagggtttgt ttagagccac tcccaaatcc tcactcccac actcatcctt gcagccagtt 121021 tttgaggaag aggagaacgt gtaaccccaa tgcaagcttc accctgactg agaggtagtg 121081 gttcttcctg tagggaatga atttggtttg atttggggtt ttcctttgaa gcccaaagaa 121141 cttgctgtta tgattcgtta accatattgc aataaaagct ggacataatt ctcactttac 121201 attttagttt ttgtggcagc aggcagctct taactgaatt ttaaagttgc tatttcagcc 121261 accatgggtt cgaataggga aggaaccaaa agatggaccc acagagtctt ccaagccact 121321 gtttgtcact ggtaggctcc aagcagccat tgcatggagg tggagaagac aggctccaag 121381 ctgccctggg gtgcatgttt ccttaatttt gggtatgttg ggaacagttc cttggtaaag 121441 tggcagcaga gtgctgccag gggcccaccc ctgtggtttc ttctccagtt gttccaatcc 121501 tttgagcaac agaaagggga cccaggttgt ggtctctgag caacctgcct ccaccccagc 121561 ccaaagggaa ggtgagtggg gcagtggcag tagagcctga gtactgcaga gacctgagtt 121621 ttctggtcaa actgggtctg cacttgaggg tggaggacag cagggaggaa ggctccttgg 121681 ttgggatcat cttggtctga attgtcctca tactggaagt catcttgagg gtgggattct 121741 gaggggttag gacaagagga aaggagtgct cagcagctac cttccagccc gcacctgcat 121801 cccccacagg agagccaaat gtggctggtt gtggagttcc acttaagcag gggagagatg 121861 ggtgccttct gggagtccag gaggctctgg attgcataga cacgaggagt cctctgccac 121921 tgtctttgcc agcacataga gactcctcaa atgccctcct ccctccttgg gttccacagc 121981 aaatcctcct gcacacagta gcaggtgccc tcccaagcaa attgtgccct gaggtttcca 122041 gcaccccctc tcctgcagcc aaaagctcct gagtgccaaa tcctggtaaa ttttgcagtt 122101 gctccatatt gggcagagat tgcttgcgac aggtaaatgg gctgcatgtt gccttaatgc 122161 caggcgtatt ttcagttaat agggagggaa aatgaggtgt ctgcagagat attggaaagt 122221 acaagatcga tgatccagct tcgtgtccaa tcagcagcct ttaattgact gtattttctg 122281 ggtaattgag cctctcttcc tccaaattga agtggaagat ataacaatac atcttgccag 122341 gaggaattac tcattacttc ataatgaaat ttcccttttg ccaaggtttg cggtttggcg 122401 ctaggcacga cttctcactc cccctctccc cccaaatatt cttagcccac ttgcatcagg 122461 ctaccccatc tggcatgaca cgttccattt gggtggtggc agggaggaag tgggagttga 122521 cttgaagaag atactgcccc ttcacaatca cagcctcctt atctgtaaaa tgagaaggct 122581 agaccagggc ctagcatgat gttgttttca tttttcaact ctctcagtgt ctctacttag 122641 ctttgcctca tctcggttcc attttgcagg agggaaaacg gaggctccag gtctacacag 122701 ctattagtgg ccgatgtagc tatgagctcc gggctcccac ttcctcttct cttaccctgt 122761 tctggggcac agattctgag gtaaccccca gtgaagcctg acacctcact cccgactcac 122821 gctttcccag ggctcttcaa tcccggccag ccggcagggc ctgggagcgc ggtccagcgc 122881 ccagcccaga ttcgcccagt tgggcaaggc tggccacccc gcagcccgag tgtgcaccgg 122941 cccccggcaa ttgctggctt gttagcgtgt gattgattgc cttattaaag cgtgttcttg 123001 taagtgtgac caaactgatt gcattgcata tgtttgggat aatgctcatt ttaaacaaca 123061 ggataaagag gatgagctcc gcagagcctt gaagacaaaa tgattctggc tagcagctcc 123121 ctaactatgg cattaaattg ctcagttaca tagcaaatcg ctacttggtt tctgcatcct 123181 tgtccgtatt tttttttttt ctgtcctaca gtttgatcct gtctcccaat gtggcatgcc 123241 ccacgaacgg tttctttagt gattatgttg actccacctt cctagtttca aaccctgcac 123301 ccgcactctc cagacaggaa gtccctctcc ctcgaccagg gattgaggga gaagagattc 123361 agaatgcatc tccttgccct aggcctttcg ggggggacat ggccttagac agtcaccccg 123421 aaattgggct caggtcctca cagtaagtcg agtgtccccc cacatcccca accccggccc 123481 tttcactaaa gtttaaaaga ttcttttgtt cctcaggctt ctcccgcttg gttgcttcca 123541 ttattcccca actgcggaaa caccagccac catatgtccg ccggagaata agcgtgagtg 123601 tcgcggtgga cagttgccgc caaaggtgtc cagacggatg gcattctcaa agtagggatc 123661 ttagcatcct ctccatacat catcacccaa gaatggtaga ggaaagagta ggagtggggt 123721 tacagatagg agagaaagga ggttgaggcc acaaaattgg catttctcta attatcaacc 123781 ccctcctcca ccatcctgcc accatttgtt atggcaaagt tctctttctg tgccccttcc 123841 ctaataaaga aatcctcagt cagcctccac ctactccttt tgatttgcag gaagccaaac 123901 tgccagcaga ttgagctttc taagggatac ctttgggata tgttggaatt ctgatttgtt 123961 caagatgatg acggtcatca gtacaaatca gaaacaatgc cactcttcag ggtgtcattt 124021 gtaagtgcaa taacacggat tcatttgcaa aacccagcta aactgcgtct ggataagtga 124081 ctagatccaa acgacaaatt tcttccatca ctcagctcct cccaagctct gcgggggcag 124141 cacagaagac tgctttaggg tgaaactaag gctcagctca ccatttgtgc cagcttgagg 124201 gaccctggaa atgttgactc tgagaaaaag acaaaacaaa acaaaacaaa acaaaaccca 124261 atcacaaaac atatcctact gtctacaaag caaacagaaa cacaaagttg tttccaaaat 124321 caccttttca gcaaataagt atgtgatatt ctgagcagga gcaggaggga gagaaagact 124381 gccacttaaa aaaacatgaa agcattaagt aataatttaa aacacggctg tttagaaaaa 124441 tatttatatt tattgcagct gctcaattgg cctcgttaaa gtcggcaagc atttaaattg 124501 tgttacaatc tcatttaaat cccgttccgt tccagcaggc tgaagagctt gaatagacca 124561 atcacttcat aatgatgatg aatgagaaat taattcagat acaagcaaga caatttaggc 124621 cttcatctgt ttaaatagcc ttccaatatt attcgccatc gcgggtcaca gattgaatca 124681 attgtccaat cttgtgaaag tcatatcctt gcgtcttcat cgagattatt tatagcgcag 124741 tgagcgctgc tgaaaggtat acgctgttaa cagggacaat tacttaaagg gaagcgtttg 124801 taaaatggaa aacaaatgcc ggggttggta acaaatggga tcaaaagcag ccattccaat 124861 ctggctcccc gagggaaagg agcgggcgtg gccctgtagg attaggggcg cttctgacct 124921 ccccaagacc caggtgaggc cgagggtccc cggcgcgcca gcctgcgccg tggctgtggc 124981 tgcggtggcg actcgggccg ggcctcgctc tcgccggctt caggttcccg cctccctcgc 125041 gcaggcagcg ggcgcgtgtg gcccgggctg ggcaagccga ggaacagcga gcccccggac 125101 gctgactgca ggacgtccca gtttgtgccc gggtctccgt ccctccccgt acggggctcg 125161 tacccccggg cctgggtctg acccacaggg cgctgaggcc tttgtagctg aagtcggaag 125221 gcctcgttgc gagcgcggca caggttgctg gtagcttctg gactctggag gcttggcctt 125281 ccttctaagc cgatggcggg gaaagaacct cgtttccaca gcttccccga cccccgccgc 125341 ttgccatttg gggacgggaa gcgcgcccgg gtcgcttcac gtccctctgg gccggagccc 125401 tttccatggc tggctcctct gggggccctt gggcctgtga gcagcgtcta cttccctcag 125461 agaagaatcc tttccttccc ccatcgaagt gtccctttct gtatcctgaa ataacccctc 125521 ctgggtgagg ccagttcccc tctgtcgccc tcctcccgca ggcgtccggg agcctcgtga 125581 ggaccccgtg cagttgagtc caggcgacag gtgcctcccc aggtgtctac ttgccctccc 125641 tcaacctgtt tagggaaaga ccagaacgat gtgcgcgggg aggtggtttt gtttctcccg 125701 aaactggcgc tcttgggtaa aaccgcgtgc cgcgtctacg cgggctctgt tagtgtgcgt 125761 gtcccagaag agtgggcctt ctaggcggca ctctttggag aagtaagtga ggtgaactca 125821 gaacagagaa gcgggaacga tcgtttgatg tgccgagccg gaaaaagaga ggagagggag 125881 aggctcccag cgagcgcgga gacagaggag cccacgcggc acagcacgag ccctactctt 125941 gcccgcgtag aggtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 126001 tttcctctgt atttctgtgt ttgaaacaaa agcccaacta acggattcct ggtgctctga 126061 ggactcactc ccgggagcga ctttgatgta ttgctgtcca attagtgcct ttaattggtg 126121 tcctcggaga caggcaggcc cggagctggg acagagcctc ctctcgcctc tcctgcacac 126181 tggaaaggta aaatttataa cacactgtgg ccaccccttg ggagagcgaa agagagtggg 126241 aggcagggaa gaaagggaga gatgcagctg tttgcagagg acggagcctc cagccccggg 126301 ggtcgtgggg aagggcccag accaccgggt tcgactcagt tttctgtgta agagggttcg 126361 ggccggctga gtcgctgagg attatgaatt tgctcgcaga aaggcctagg gagctctcat 126421 tgtctgtaca aacccacccc atgtggtcta ccatctcgtt aataatggct gtttgttaac 126481 aaggatgcgg aggttaaata atccgggtgg aagattaaac acgtctcatc aaggcgcttg 126541 gaagccgcgc ggtgcagaga cctgcgctgg agccgcctgg ctcactggag cctggaggga 126601 gatgggagca cagtctgcaa ggagccccgg ggtagaagag gaggcaaggt ctagtctccc 126661 tcttcagggt gtcctgctat gccaggacag gtctaactct tccacatgtc aaacagaaaa 126721 tccacataga agtcagcccc aatgaggtgt ggaggtcaaa ccctcggagt agttggctgg 126781 agggcagaca tggagcatct ggggtctgaa ccaaaggccc ccttttgagg aggggtggcc 126841 atcccctgcc tcccagtcct tacatgtcat gcatctggca atttgtgcct acatgggatt 126901 agcattggat tagaagacag aagatgtggt ttctagtctg ggttctgccc ctaaatcact 126961 gtagaatttt tattacatta ttctcccagc ttcatttttc tactgtgtaa gagaaaggag 127021 atggggcttg atgtagatga tgtacagcat ttcttctttt gtccagcttc ccaccagagt 127081 tagggaactg aacaccttat aagatgctct gtctccccta aaccttgcag gatgcagtca 127141 aaaagaattt tatttagcaa atatctgagg gcctactctg caccagtcat ggtgcagtat 127201 gtatttttta actgagagca gaatatcagt tcatattccc ctgttattct caaatataat 127261 gttttataac aaaagatgaa agagaaagaa cttcaaagag aatagaatag gatgggtatg 127321 tatgtacagg gtagagaaac tatggaggta aaggtaaagg cattgacaaa agctgtgaag 127381 aacttgttga actggcaagg aagtcagtgg ataaatactg cagaagtagc tccttaacag 127441 agatactatg taggagagta gagagaaggt cccctcctga atcctggtga tttggcagag 127501 aacaaaaggg taagggttgc agaatttgac taggaggccc aagttgtatg catctaatat 127561 tgccaatttt tggtttatat attctttcag tagttggaat ttctctctct ctctctctct 127621 ctctttttct ttttgagatg gggtcttgct ctgttgctca ggctggagtg cagtggcccc 127681 atcaaggctc actgcagcct taaactccca ggctcaagtg atcctctcat ctcagtctcc 127741 cgagtagcaa gggccacagg catatgccac catgcctgga taattttttt ttttggttga 127801 ggggtacaga tggggtcttg ctgtgttacc caggctggtc tcaagcttct aggctcaatg 127861 aaccctccca ccctggtgtc ccaaagtgtt gagattacag gaatgagcca ctgtgcccgg 127921 cctaggaagt ttactcttta gctaattttg aaacattact ttctgaatcc accaatcagg 127981 aaactcggct gctaggctgg gactcttcaa gcttctgggt gatgggcatt tagcctcaag 128041 gaaggcaatg atgcttaaat cactaaacca tgtctcagtt tctgtgagtc aggaatctag 128101 tgagcttagc tgtgtgtccc tggctccagg ctattccact ggggttgtgg tcttatctaa 128161 agactcagct gggtgaggat atgttccggg ctcactcatg tggttgtttg gcaggattca 128221 gttccttgct ggtgggctat ttggctgaag gcctcagttc cttcttggtg gttggccaga 128281 ggtttctctg ttccttacca tatgagcctc tctatagggt agcttataaa gtggcagctg 128341 tctttcctca gactgagtaa gcaagagtga gagtggaggg gggagaaagc acccaagaca 128401 gaagctacac tcttttggtc atctaatctt ggaagtgact ttccatcact atgccatatt 128461 atatttgtta gaagtgagtc actagatcca gcccacacta aaggaaaggg gattacagaa 128521 ggcatcaaat ccaggaggca gggatcactg gggactgtct tagaggctgc tgcccagacc 128581 ccttttatat aacagctctg aggcttatat ggttttatat ttatgaagag cttaggacag 128641 cacctgtttg ctctgatcat cagcatgagt gatccttctg atccaatgga gcacccattt 128701 cttcatctag accctcctat gtcaacagtt ctaatttttt cctgcaaagc cctgtaaaag 128761 agtgtaggaa aggcaagaca gtaactgctg caaggaaaga ggctgaggaa acagaataga 128821 aaggggatag gatgggagat cctgaaaaac tctgggggag ggtgcctaga gacaaaaggg 128881 aaagtaggag acccttgtct cctatggtgg gctttacact tttttttttt tttgagacaa 128941 tgacttgctc tgtcacccag aacccaggct ggagtatagt ggtgtgatct ccaatcactg 129001 caaccttcac cttcagagct caagcaatcc tcctgcctca gcctcctaag tagctgggac 129061 tacaggcatg tgccaccaca ctcaactagt tcatatatat atatatatgt acgtatatat 129121 atatatatgt atatatatat atatatatat atagagagag agagagagag agagagagta 129181 tgtatatatg tatattggta gagaccaggt tttgccatgt tgcccaggct ggtcttgaac 129241 tcctaggctc aagtgattcg cccacttcag cctcccaaag tgctgggact acaagcataa 129301 gccactgtgc ctggccagct ttacactgtt tagagtattt ccacatacat gatc // LOCUS AC004084 93273 bp DNA PRI 29-JAN-1998 DEFINITION Homo sapiens BAC clone RG158O17 from 7q22-q31.1, complete sequence. ACCESSION AC004084 NID g2822156 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 93273) AUTHORS Bradshaw,H., Tin-Wollam,A. and Hawkins,M. TITLE The sequence of Homo sapiens BAC clone RG158O17 JOURNAL Unpublished (1998) REFERENCE 2 (bases 1 to 93273) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (29-JAN-1998) Department of Genetics, Washington University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: The sequence of this clone was established as part of a mapping and sequencing collaboration between the NHGRI Chromosome 7 Mapping Project (Eric D. Green, Director), John D. McPherson in the Department of Genetics (Washington University), and the Washington University Genome Sequencing Center. For additional information about the map position of this sequence, see http://www.nhgri.nih.gov/DIR/GTB/CHR7 or send mailto:egreen@nhgri.nih.gov SOURCE INFORMATION: Clone RG158O17 is from the first release of the human BAC library CITB-978SK-B. The library contains cloned DNA from the male fibroblast cell line 978SK. See: Shizuya et al., Proc. Natl. Acad. Sci. USA 89:8794-7 (1992); U-J. Kim et al., Genomics 34:213-8 (1996). This clone is available from Research Genetics, Inc. (http://www.resgen.com). VECTOR: pBeloBAC11 Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of RG158O17; actual end is at 93273 of RG158O17. The orientation of this clone is unknown. This clone contains STS sWSS3609 (NID:g1923098) and sWSS1433 (NID:g1408087). FEATURES Location/Qualifiers source 1..93273 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="RG158O17" /clone_lib="CITB-978SK-B" /map="7q22-q31.1" repeat_region 43..327 /rpt_family="ALU" repeat_region 544..611 /rpt_family="MIR" repeat_region 689..851 /rpt_family="ALU" repeat_region 1063..1233 /rpt_family="ALU" repeat_region 1234..1515 /rpt_family="ALU" repeat_region 2131..2191 /rpt_family="MIR" repeat_region 2203..2359 /rpt_family="ALU" repeat_region 2364..2665 /rpt_family="ALU" repeat_region 2666..2824 /rpt_family="ALU" repeat_region 3385..3533 /rpt_family="ALU" repeat_region 3552..3843 /rpt_family="ALU" repeat_region 3992..4315 /rpt_family="ALU" repeat_region 4682..4981 /rpt_family="ALU" repeat_region 5568..5862 /rpt_family="ALU" misc_feature 5863..5914 /note="match to EST R20885 (NID:g775666) yh18a08.r1" repeat_region 5915..5993 /rpt_family="(TGAG)N" misc_feature 5994..6167 /note="match to EST R20885 (NID:g775666) yh18a08.r1" repeat_region 6711..7016 /rpt_family="ALU" repeat_region 8148..8449 /rpt_family="ALU" repeat_region 8460..8638 /rpt_family="ALU" repeat_region 8666..8842 /rpt_family="ALU" repeat_region 9069..9358 /rpt_family="ALU" repeat_region 10381..10714 /rpt_family="ALU" repeat_region 11019..11077 /rpt_family="L1" repeat_region 11098..11362 /rpt_family="ALU" repeat_region 11378..11610 /rpt_family="L1" repeat_region 11623..11908 /rpt_family="ALU" repeat_region 11910..12105 /rpt_family="L1" repeat_region 12106..12403 /rpt_family="ALU" repeat_region 13080..13163 /rpt_family="L2" repeat_region 13416..13546 /rpt_family="ALU" repeat_region 13565..13749 /rpt_family="ALU" repeat_region 13788..14083 /rpt_family="ALU" repeat_region 14454..14637 /rpt_family="ALU" repeat_region 14647..14933 /rpt_family="ALU" repeat_region 15239..15374 /rpt_family="ALU" repeat_region 15375..15667 /rpt_family="ALU" repeat_region 15809..15978 /rpt_family="MIR" repeat_region 15997..16296 /rpt_family="ALU" repeat_region 16446..16738 /rpt_family="ALU" repeat_region 17568..17867 /rpt_family="ALU" repeat_region 17870..17995 /rpt_family="ALU" repeat_region 18006..18299 /rpt_family="ALU" repeat_region 18300..18597 /rpt_family="L1" repeat_region 18608..18910 /rpt_family="ALU" repeat_region 18914..19222 /rpt_family="ALU" repeat_region 19378..19679 /rpt_family="ALU" repeat_region 19788..20067 /rpt_family="ALU" repeat_region 20068..20150 /rpt_family="(CATA)N" repeat_region 20186..20411 /rpt_family="ALU" gene 20774..54837 /gene="WUGSC:H_RG158O17.1" CDS join(20774..20838,26478..26603,28166..28279,28798..28859, 31628..31757,35206..35287,37223..37354,37657..37751, 41485..41602,41830..41980,42245..42309,43055..43167, 43372..43564,43770..43907,44476..44620,45160..45329, 47321..47458,49845..49980,53270..53390,54540..54604, 54716..54837) /gene="WUGSC:H_RG158O17.1" /note="similar to GTPase-activating proteins; 35% similar to JC5047 (PID:g2136083); H_RG158O17.1" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822157" /translation="MAKRSSLYIRIVEGKNLPAKDITGSSDPYCIVKVDNEPIIRYRP HPQDRGALSLSSARALPAKGTATVWKTLCPFWGEEYQVHLPPTFHAVAFYVMDEDALS RDDVIGKVCLTRDTIASHPKGFSGWAHLTEVDPDEEVQGEIHLRLEVWPGARACRLRC SVLEARDLAPKDRNGTSDPFVRVRYKGRTRETSIVKKSCYPRWNETFEFELQEGAMEA LCVEAWDWDLVSRNDFLGKVVIDVQRLRVVQQEEGWFRLQPDQSKSRRHDEGNLGSLQ LEVRLRDETVLPSSYYQPLVHLLCHEVKLGMQGPGQLIPLIEETTSTECRQDVATNLL KLFLGQGLAKDFLDLLFQLELSRTSETNTLFRSNSLASKSMESFLKVAGMQYLHGVLG PIINKVFEEKKYVELDPSKVEVKDVGCSGLHRPQTEAEVLEQSAQTLRAHLGALLSAL SRSVRACPAVVRATFRQLFRRVRERFPGAQHENVPFIAVTSFLCLRFFSPAIMSPKLF HLRERHADARTSRTLLLLAKAVQNVGNMDTPASRAKEAWMEPLQPTVRQGVAQLKDFI TKLVDIEEKDELDLQRTLSLQAPPVKEGPLFIHRTKGKGPLMSSSFKKLYFSLTTEAL SFAKTPSSKKSALIKLANIRAAEKVEEKSFGGSHVMQVIYTDDAGRPQTAYLQCKCVN ELNQWLSALRKVSINNTGLLGSYHPGVFRGDKWSCCHQKEKTGQGCDKTRSRVTLQEW NDPLDHDLEAQLICRHLLGVEAMLWERHRELSGGAEAGTVPTSPGKVPEDSLARLLRV LQDLREAHSSSPAGSPPSEPNCLLELQT" repeat_region 21454..21577 /rpt_family="MIR" repeat_region 23030..23119 /rpt_family="MIR" repeat_region 24272..24575 /rpt_family="ALU" repeat_region 25069..25117 /rpt_family="(TGAA)N" repeat_region 25120..25221 /rpt_family="L2" repeat_region 26850..27017 /rpt_family="MIR" repeat_region 27654..27955 /rpt_family="ALU" misc_feature 28119..28279 /gene="WUGSC:H_RG158O17.1" /note="match to EST AA078429 (NID:g1838113)" misc_feature 28791..28848 /gene="WUGSC:H_RG158O17.1" /note="match to EST AA078429 (NID:g1838113)" repeat_region 29352..29653 /rpt_family="ALU" repeat_region 29688..29987 /rpt_family="ALU" repeat_region 29992..30270 /rpt_family="ALU" repeat_region 30271..30390 /rpt_family="ALU" repeat_region 30813..30929 /rpt_family="ALU" repeat_region 30925..31039 /rpt_family="ALU" repeat_region 31079..31195 /rpt_family="ALU" repeat_region 31201..31342 /rpt_family="ALU" repeat_region 31453..31519 /rpt_family="MIR" repeat_region 31907..32201 /rpt_family="ALU" repeat_region 33434..33732 /rpt_family="ALU" repeat_region 33758..33946 /rpt_family="ALU" repeat_region 34047..34346 /rpt_family="ALU" repeat_region 34353..34648 /rpt_family="ALU" repeat_region 34822..34854 /rpt_family="MIR" repeat_region 35679..35973 /rpt_family="ALU" repeat_region 36011..36478 /rpt_family="L1" repeat_region 36479..36777 /rpt_family="ALU" repeat_region 36786..36975 /rpt_family="L1" repeat_region 37960..38259 /rpt_family="ALU" repeat_region 38306..38604 /rpt_family="ALU" repeat_region 38821..38952 /rpt_family="ALU" repeat_region 38956..39203 /rpt_family="ALU" repeat_region 39253..39409 /rpt_family="ALU" repeat_region 39411..39580 /rpt_family="ALU" repeat_region 39620..39889 /rpt_family="ALU" repeat_region 39892..40188 /rpt_family="ALU" repeat_region 40192..40243 /rpt_family="MER53" repeat_region 40251..40375 /rpt_family="ALU" repeat_region 40397..40445 /rpt_family="MER53" repeat_region 40640..40932 /rpt_family="ALU" repeat_region 42311..42368 /rpt_family="MIR" repeat_region 42443..42714 /rpt_family="ALU" repeat_region 42764..42859 /rpt_family="ALU" misc_feature complement(44098..44249) /gene="WUGSC:H_RG158O17.1" /note="match to EST AA076746 (NID:g1836430)" misc_feature 44496..44644 /gene="WUGSC:H_RG158O17.1" /note="match to EST AA077669 (NID:g1837143)" misc_feature complement(44852..44959) /gene="WUGSC:H_RG158O17.1" /note="match to EST AA076746 (NID:g1836430)" misc_feature 45155..45329 /gene="WUGSC:H_RG158O17.1" /note="match to EST AA077669 (NID:g1837143)" repeat_region 45480..45781 /rpt_family="ALU" repeat_region 45798..45873 /rpt_family="MIR" repeat_region 46317..46588 /rpt_family="ALU" repeat_region 46659..46958 /rpt_family="ALU" repeat_region 48050..48092 /rpt_family="ALU" repeat_region 48219..48357 /rpt_family="L2" repeat_region 48373..48659 /rpt_family="ALU" repeat_region 48772..48951 /rpt_family="ALU" repeat_region 48968..49056 /rpt_family="L2" misc_feature complement(49220..49392) /gene="WUGSC:H_RG158O17.1" /note="match to EST AA076983 (NID:g1836457)" repeat_region 49396..49470 /rpt_family="ALU" repeat_region 49480..49604 /rpt_family="ALU" repeat_region 49786..49817 /rpt_family="L2" misc_feature complement(49842..49946) /gene="WUGSC:H_RG158O17.1" /note="match to EST AA076983 (NID:g1836457)" repeat_region 50257..50481 /rpt_family="ALU" repeat_region 50496..50786 /rpt_family="ALU" repeat_region 51050..51105 /rpt_family="MER1_TYPE" repeat_region 51121..51424 /rpt_family="ALU" repeat_region 51455..51759 /rpt_family="ALU" repeat_region 51797..51858 /rpt_family="MER1_TYPE" repeat_region 52061..52367 /rpt_family="ALU" repeat_region 52479..52582 /rpt_family="MIR" repeat_region 52689..52985 /rpt_family="ALU" repeat_region 53692..53845 /rpt_family="MER1_TYPE" repeat_region 53987..54282 /rpt_family="ALU" misc_feature complement(54861..55032) /note="match to EST AA029228 (NID:g1496661) zk09g12.s1" misc_feature complement(54884..55032) /note="match to EST AA447206 (NID:g2159871) zw91f09.s1" misc_feature complement(54902..55032) /note="match to EST AA476264 (NID:g2204475) zw44h05.s1" misc_feature complement(54915..55032) /note="match to EST AA149282 (NID:g1719858) zl25d05.s1" misc_feature complement(54935..55032) /note="match to EST Z28929 (NID:g434814)" repeat_region 55033..55114 /rpt_family="L1" misc_feature complement(55115..55285) /note="match to EST AA149282 (NID:g1719858) zl25d05.s1" misc_feature complement(55115..55270) /note="match to EST Z28929 (NID:g434814)" misc_feature complement(55115..55285) /note="match to EST AA029228 (NID:g1496661) zk09g12.s1" misc_feature complement(55115..55289) /note="match to EST AA476264 (NID:g2204475) zw44h05.s1" misc_feature complement(55115..55287) /note="match to EST AA447206 (NID:g2159871) zw91f09.s1" repeat_region 56747..57049 /rpt_family="ALU" repeat_region 57075..57209 /rpt_family="ALU" repeat_region 57238..57361 /rpt_family="L1" misc_feature 57368..57526 /note="match to EST AA025913 (NID:g1491242) ze91b08.r1" repeat_region 57527..57658 /rpt_family="ALU" misc_feature 57659..57736 /note="match to EST AA025913 (NID:g1491242) ze91b08.r1" repeat_region 58222..58500 /rpt_family="ALU" repeat_region 58541..58837 /rpt_family="ALU" repeat_region 59029..59322 /rpt_family="ALU" repeat_region 60121..60442 /rpt_family="ALU" repeat_region 60445..60746 /rpt_family="ALU" repeat_region 61097..61396 /rpt_family="ALU" repeat_region 61489..61788 /rpt_family="ALU" repeat_region 61904..61937 /rpt_family="MER1_TYPE" repeat_region 62020..62364 /rpt_family="ALU" repeat_region 62468..62510 /rpt_family="ALU" repeat_region 62511..62775 /rpt_family="ALU" repeat_region 63120..64090 /rpt_family="RETROVIRAL" repeat_region 64101..64369 /rpt_family="ALU" misc_feature complement(65035..65107) /note="match to EST AA077570 (NID:g1837044)" misc_feature 65036..65115 /note="match to EST AA461132 (NID:g2186252) zx64f03.r1" gene 65057..>69567 /gene="WUGSC:H_RG158O17.2" CDS join(65057..65106,67653..67742,69393..>69567) /gene="WUGSC:H_RG158O17.2" /note="similar to DNA-DIRECTED RNA POLYMERASE II 13.3 KD POLYPEPTIDE; 98% similar to P5243 (PID:g1710661); H_RG158O17.2" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2822158" /translation="MNAPPAFESFLLFEGEKITINKDTKVPNACLFTMNKEDHTLGNI IKSQLLKDPQVLFAGYKVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLISELSLLEE RFR" repeat_region 65279..65385 /rpt_family="MIR" repeat_region 65540..65619 /rpt_family="MIR" repeat_region 65633..65928 /rpt_family="ALU" repeat_region 65935..66100 /rpt_family="ALU" repeat_region 66248..66448 /rpt_family="MER1_TYPE" repeat_region 66512..66685 /rpt_family="MER1_TYPE" repeat_region 66911..67064 /rpt_family="ALU" repeat_region 67067..67134 /rpt_family="ALU" repeat_region 67265..67574 /rpt_family="ALU" misc_feature complement(67651..67867) /gene="WUGSC:H_RG158O17.2" /note="match to EST AA077570 (NID:g1837044)" repeat_region 67984..68285 /rpt_family="ALU" misc_feature 69393..69570 /note="match to EST AA461132 (NID:g2186252) zx64f03.r1" repeat_region 69997..70233 /rpt_family="ALU" repeat_region 70276..70659 /rpt_family="MALR" repeat_region 71006..71305 /rpt_family="ALU" repeat_region 71308..71605 /rpt_family="ALU" repeat_region 71735..72061 /rpt_family="RETROVIRAL" repeat_region 72149..72357 /rpt_family="ALU" repeat_region 72368..72587 /rpt_family="MER21_G" repeat_region 72597..72901 /rpt_family="ALU" repeat_region 72918..73079 /rpt_family="ALU" repeat_region 73193..73321 /rpt_family="ALU" repeat_region 73438..73737 /rpt_family="ALU" repeat_region 73816..73917 /rpt_family="ALU" misc_feature 74977..75219 /note="match to EST AA282160 (NID:g1925241) zt12a11.r1" misc_feature complement(75143..75219) /note="match to EST AA282209 (NID:g1925125) zt12a11.s1" misc_feature complement(75295..75459) /note="match to EST AA282209 (NID:g1925125) zt12a11.s1" misc_feature 75295..75400 /note="match to EST AA282160 (NID:g1925241) zt12a11.r1" misc_feature 75418..75608 /note="match to EST N44656 (NID:g1185822) yy33e11.r1" misc_feature 75604..75817 /note="match to EST N44656 (NID:g1185822) yy33e11.r1" repeat_region 76068..76279 /rpt_family="ALU" repeat_region 77203..77339 /rpt_family="ALU" repeat_region 77341..77638 /rpt_family="ALU" repeat_region 77640..77805 /rpt_family="ALU" repeat_region 78560..78861 /rpt_family="ALU" repeat_region 79594..79886 /rpt_family="ALU" repeat_region 79921..80202 /rpt_family="ALU" repeat_region 80909..81236 /rpt_family="ALU" repeat_region 81939..82255 /rpt_family="ALU" repeat_region 82332..82623 /rpt_family="ALU" repeat_region 82624..82783 /rpt_family="ALU" repeat_region 83475..83776 /rpt_family="ALU" repeat_region 83859..84112 /rpt_family="ALU" repeat_region 85277..85572 /rpt_family="ALU" repeat_region 85641..85938 /rpt_family="ALU" repeat_region 86761..87059 /rpt_family="ALU" repeat_region 87137..87269 /rpt_family="ALU" repeat_region 87270..87568 /rpt_family="ALU" repeat_region 87571..87749 /rpt_family="ALU" repeat_region 87767..87875 /rpt_family="L2" repeat_region 87963..88257 /rpt_family="ALU" repeat_region 88258..88289 /rpt_family="(CATA)N" repeat_region 88372..88419 /rpt_family="(CA)N" repeat_region 88462..88787 /rpt_family="ALU" repeat_region 89475..89761 /rpt_family="ALU" repeat_region 89796..89905 /rpt_family="(GAAA)N" repeat_region 89906..89982 /rpt_family="ALU" repeat_region 89989..90084 /rpt_family="ALU" repeat_region 90199..90358 /rpt_family="ALU" repeat_region 90386..90682 /rpt_family="ALU" repeat_region 90690..90974 /rpt_family="ALU" repeat_region 91300..91442 /rpt_family="ALU" repeat_region 91444..91736 /rpt_family="ALU" repeat_region 91737..91904 /rpt_family="ALU" repeat_region 91908..91984 /rpt_family="MALR" repeat_region 92021..92320 /rpt_family="ALU" repeat_region 92573..92872 /rpt_family="ALU" misc_feature complement(93000..93273) /note="match to EST R11672 (NID:g764407) yf40g09.s1" misc_feature 93090..93273 /note="match to EST AA324364 (NID:g1976608)" BASE COUNT 21492 a 25218 c 24834 g 21729 t ORIGIN 1 aagcttctgg accctgagag agattgtttc ttttctttac tatttttttt tttttttttg 61 agacggagtc tcactctgtc gccaggctgg cgcgatcttg gctcactgca acctctgcct 121 cacgggttca agcgattttc ctgcttcagc ctcctgagta gctgggacta caggcacacg 181 ccaccatgcc cagctcattt ttgtattttt agtagagatg gggtttcacc atgttggcca 241 ggatggcctc catctcttga ccttgtgatc cgcccgactc ggcctcccaa aatgctggga 301 ttacaggcgt gagccatcaa gtctggcgag agagattgtt tctagatgag ggtgggggcg 361 ggtgtcctta gcccaaagct tgtgccagtc tctatcagaa ataaatgccc ccaaaacctc 421 cctgcctgtc cgtgacatca tacacctggg tagtctttct gcaccactgc ccctggctcc 481 tcctcctgag gaggtcccca agggcgtgta ggaccagggc aaggctcagg atcctagaga 541 aattgtcacc tggggcagga caggcctctc tgccagcctt actgtcccta tctgtaaaat 601 gaggatggta accctgaacc cacaaggcca cagtaaggct aaaccaaggt gaacatgcct 661 tgggaactgt gaagactgtg gaccccaagc cggttgcggt ggctcaggcc tgtaatccca 721 gcactttggg aggctggggc aggaggatca cttgagccca ggagttcgag accagcctgg 781 gcaacatggt gaaaccccat ctctacaaaa atgacaaaaa attagctggg cttgttggcg 841 gactgtggac ctgagatagg tgctcctctt gtcctgccca gacagcctcc tggtaaccaa 901 aggccacggg gcagcatggc tggcacttgc tatctttcca cccaagcatc ctccctgtgg 961 cccctctccc cagcctgccc cctgttggct ggaggacttc aagccctctg cctccacttc 1021 ctccagtcct ttcctagagt acacaaaagg caaaaggcgc agtttctttt tttttttttt 1081 ttgaaatgga gtttcacctt atagcccagg ctggagtgca gtggcaccat catagctcac 1141 tgcagtctct aattcctggg ctccggtgat cctcccacct cagcctcctg agagtagctg 1201 ggactacagg cacatgacac cacgtccggc taatttttaa tttttttgta gacatggaat 1261 ctcactctgt tgcccaggct ggaaagaaca gtggtgtgat catagctcat tgcagcctca 1321 aactcctggg tgcaagcgat ccttccgcct cagcctccta agtagcttgg actataggta 1381 catggcacta tgtctggcta attttttaaa tttctttaga gacggggtct cactatgttg 1441 ctcaggctgg tctcgaactc ctggtctcaa gtgatcctcc tgctgggatt acaggcgtga 1501 gccactgtgc ccagctcatg caacacttat catcaggatt cccgctcagt gctcggaccc 1561 caggggccag agatgcccgc ggcaaggttc tgccctcaag gtgcccactg ctttaggggg 1621 aagacagtga tactaccaca tacgagagaa ggagaggaca caactgtcag ggagatggac 1681 tgttggagct tcagaggcag catcttgggg aggggtggct ttgaagaagg agcaggacac 1741 ttgggccagg tggaccttgg aagggtgcca ggcaggagga gcaatgtgtt gctgatgggc 1801 aggtagggga gggtgaggga tggagaggag accggaagac accctgtgcc atgtggaggg 1861 gccctgccct ctggggcaag aggtcaggga ggagggcgct ggctgggttc ccaggcgcct 1921 gcccctgaga ggcgcaaatg gccctaatgg ggttattagg tggagcagat gcgtccggct 1981 cctgctgtgg aggaaatgct caaatgatct tgaagctacc aggctgggag aggcaggcag 2041 gggatggggc agacaggctg ccaacctctg tgagataagc cctggcctct ccagaacctt 2101 ttcagaacag ctatgagccc caaccctggg gctgagtgac ccaagacaga tcactcacac 2161 tctctgggcc tcagtttccc cttacataaa aagcaatggt taggccaggt gcagatggct 2221 cacacctcta atcctagtgc tttgggaggc cgaggtggga ggattgcttg agaccaagag 2281 ttcaagtcca gcctgggcaa catagcaaca ccccatctct acaaaacata aatacaaaaa 2341 ttaaggccgg gcatggtggt taaggccagg cgtggtggct cacacttgga atcccagcat 2401 tttgggaggc agaagcagga ggatcacctg aggtcaggag tttgagacca acctggccaa 2461 catggtgaaa acccgtctct atacaaaata caaaaattag ccagccatgg tggtgcatgc 2521 ctgtaatccc tgctactgtg gaggctgagg caggagaatc acttgaaccc aggtgaagga 2581 ggtttcagtg agcccagatc atgccactgc actccagcct gggtgacaga gaacaagact 2641 gtctcaaaac aaaacaaaag aaaaattagc cagacatggt agtgtgaacc tgcagtccca 2701 gctactatgg aggctgaggt gggaggatcg cttgagccca ggagttggag gctgcagtga 2761 gctatgatgg cacccctgga caccagcctc tgtgacagag caagagcctg tctctaaagt 2821 gaaacatttt taaaagggaa ggtgaactgg gtgaactcta agaccctgca tttcacaaac 2881 ccttgcccac accctgtatg cgtcgggccc tggcagggaa cagatggtac gctgtctcac 2941 agggcaattg agagcgttta caaaggtgtg ggcagcatta ggggaatgga ccagagaggg 3001 tgacctggga cttggctgta atggggccct aagcactcct ggatctgatt gctgatggga 3061 gggagggagt gatgtttccc gatcagcagg cagtggccat gggtgagggg ccccgcagga 3121 gcaggggctg taagtgaaag gatgaagcca ccagagggaa ctgggctctg tcttcccgcc 3181 ccctgatctc atgccagtgc ctccactggc cacacctcac gagaagcaag agagtgtggg 3241 gtctcagtgg tgcagtccct ggaggccaat ctgctgggtc agagcaggac agagaaagcg 3301 aaatggacat acataggcac acgccgcttt catgctgatc cagtggaatt agatccttcc 3361 tgttctttct tttgtttctt tctttttttt tttttccttt tttagagatg gggtctcact 3421 ctgttgccca ggctggggta tggtggcgtc ttcatagctc actgtagcac tgaactcctg 3481 ggctcagacg atcctcccac cttggcctcc caaagtgctg ggattacagg cactacaggc 3541 acctagctaa ctttcttttt ttttgtagag gcagggtctc tgtcacccag gcttgagtgc 3601 agtggcacga tcatagctca ctgcagcctt gaacccctgg gttcaagcaa tcctcccacc 3661 tcagcctccc aagtagctgg gattataggt atgtgccacc acccagctaa tttttttatt 3721 ttttgtagag atggggtctc gccatgttgc ccaggctagt ttggaactcc tgagctcaag 3781 agatgctccc accttggcct cccaaagtgc tgaaattaca agcataagcc accatgcctg 3841 gcctcccttc ctggccttct ggtctccaga ctggagagag ggccacaaag tcctgcccag 3901 acagagggct gctaaggctg gggtggggct gggggtgtgc agaagggact ctgcaggggc 3961 tgacctctga gtctcagggg acagggcaaa tttttttttt tttttttttt tgagatggag 4021 tcttgctctg tcgcccaggc tggggtacaa tggcacgatc tcggctcact gcaatctccg 4081 cctcctgggt tcaagtgatt ctcatctctc agcctcccga gtagctggga ttacaggagt 4141 gcaccaccat gcccagctga tttttgtatt tttagaagag attgggcggg gtggggggtg 4201 gggtgcgggt ttcaccgtgt ggcccaagct ggtcttaaac tcctgacctc aagtgatcct 4261 cccgcttcag cctccccagg tgctgggatt acaggcatga gccaccgcac ccagcgggga 4321 caggacaatt ttgccagctg gagaaggggt ggcccccaaa tgtccccttc accaccctcc 4381 tgcctcttcc ttcagaacac tttttccctg aagccctcct gggtagcccc ctacccacat 4441 gcctcagcat tgggaggtgg ggagtatggg ggtccccctt ctcctcagcc ccatttggag 4501 taatgtcctg tgcagctgag ctcacaactc ctcctccacc tgtcccttca cccggtgtca 4561 ctggcaattg ctcacttcct gggcctgcac ccactcagct cccctatccc tgggcatcag 4621 cctcagccag ctcccatgtg gactaatgac ccttgtccct cccctcagct gcctctcttt 4681 tttctttttt tttttgtttt tgagacagag tctggctctg tcacccaggc tggagtgcag 4741 tggcgcgatc ttgactcatt gcaacctcca cctcctgggt tcaagcgatt ctcgtgcctc 4801 accctcctga atagctggga ttacaggcac ccgccaccat gcccggctga tttttgtatt 4861 tcggtagaga cggggtttca ccatgttggc caggctggtc tcgaactccc gacctcaggt 4921 gatccgcccg cctcagcctc ccaaattgct gggattacag gcgtgaggca gctcgcccgg 4981 caaccctcag ctgcctctca tatgtccttg gccccctggc catctctgac cagacatcta 5041 ggaggcccct cctgccaata tgcttccccg acagccgagc ccaaagctgt ctccagccct 5101 gcaccttgtc ttggttcccc gggtgctccc tttcccaggc tgaagcctgg acagccccag 5161 gcaacctctc tccctgacca cagtaccacc tggccacatg ccacactgct ttctacagca 5221 gacaccgcct ggccacaaag acttaagcca cttccttggg tcctgctcag cgctgagccc 5281 agagcccaat ctcagggact gggagagagc ccagtacatg gccagtggga ttctcccgat 5341 gaaaatccca gccatgtgag gctggcttct ccaggttctc tgaggcctgt gagtccaggg 5401 ctggctggag ggaaagcccc tggcccctta aagccactga tcatgttccc atagagggat 5461 cctgagaatc ccctgggaga gaaaatgtcg gagccccagt cctggcaccc ccctgaccag 5521 cttgggcctc agcctcctca tccatgaaaa gaatggacaa gaggggtctg ggtgcggtgg 5581 ctcacgcctg taatcccagc actttgggag gccgaggcag gtggatcacg aggtcaggag 5641 ttcaagacca gcctggtcaa catggcaaaa ccctgtctct actaaaaata caaaaattag 5701 ccggttgtgg tggtgcacgc ctgtagttgc agctatttag gaggctgagg caggaggatc 5761 actggaaccc gagaggcaga agttgcagtg agctgagatt gtgccacggc actccagctt 5821 gggcaacaga gccagactcc atcttaaaaa aaaaaaaaaa aaggagggac aagaggcacc 5881 agaagctcct gtccaggtct gggcctccct cgggtcactc agttctcccc cattcattca 5941 ttcattcact cactcactca ctcactcatt cccacactct gcagccactg acttcagcta 6001 gccatgctgg atgctaagac atagagtccc ggccatcaag gccagtccat ctcagtggca 6061 tctagacatg cacagagatg accagggagc ccctaaggga cagagaggca caccctgcct 6121 cctgatccca tctgggggct gcatgaagga ggggcacagg tgtggccttg aaggctgggt 6181 aaggggctgg gagggcttta ggggcattcc agggggcagg cacagaactg gaggctctgc 6241 cgggcacacc cacaggagag caatggctgg agggtgacat ggactgcagg gtatggccgc 6301 ggcagcccag ctcttaaggg gacagcctgg gaaaacagac ttagggacaa tgatcttgag 6361 ccatgaaatg atgtccaatg ccccttgtgc cacctgacat cagcctgatt aatgaacaaa 6421 cagaagagga agcaggggac ctgtgtgtcc ctctctctgc ttttctgtct ttctctgtgc 6481 cccctctctt atgtctctca cgctgtctct tgtccttgtt tttaactctg acacgccatg 6541 ggaacttggc tcacttccta gggggaagct gtctggactg gcccagagat gggcaccacc 6601 atttcccaat aaacacaaga taaaatcaga gagctggcgt cagttccgcg gctggactca 6661 gtggggaaaa gggcaaaaag cagcagaaat aagagcaggg aaagaacagg ggctgggtgc 6721 gatgggtccc atctgtaatc ccagcgcttt gggaggccta ggtgggagga tcgcttgagc 6781 ccagaagttc aagaccagcc tgggcaacat acggagaccc ccatttctaa aagttttaaa 6841 aattagctgg gcatggtagt gtgcatctgt ggtcccagct actcaggagg ccgaggtgtg 6901 aggatcgctt gagcccagga ggtcgaggct gcagtgagct atgattgcat cactgcactc 6961 cagcccgggc acagagcaag accctgagac cttggcagaa agaaagaaag agagaaagag 7021 agaggagagg gagagggaga aagaaagggg ggggggagag agagagagag agagagagag 7081 agagagatga aagagaagga aggaaagaaa gaaaggaaag aggaaggaag gaaggagagg 7141 ggaaggaagg aaggagggga aggaaggaga gagagaggaa ggaaggaagg agggaaggga 7201 aggaaagggg aggggagggg aggagaggga gaaaagaaag gaaaaagaac aagtggatat 7261 caggggacag aaaccattcg cagcccatcc ctcggcagat gcagagggag tcagcctgga 7321 acccccaccc ctgctggctg tgccccaccg ctgtctccac accccagccc ttcctccaga 7381 cacccagccg tccactgcag caggagcaca ggtgccccaa ggggaccact gttggaaacc 7441 aggcagccct ccactcaccc acacagtgtg tctctgcacc ccctctttga catgggagcc 7501 ggggcgctct tggaaataag ggtttacctg aggggtctga agacttgggc agtggggggc 7561 tgtgaagcct tggggaggga atcagcttgt ccaagagtct ggggcagcga gtggtcgggg 7621 gctacccggt ctgaccccat caaagtaacc ttcctggggg atatttccat tgcagggtca 7681 caaaaaggca agggctaagg ttcgagagag gggtctctct ggggtcccag ccaccctgct 7741 gtgggggccc ttcccaagca ggacactcca ggcaaagggg atggctttcc cttgctccca 7801 ggacccatag gtgaggtcct ggctgaaatg cagactgtcc ccagatgccc ctggggactt 7861 gtagggattt gggggattca gtgttgcagg agctggaacc cttgcttgct ggacgggttt 7921 gagctgtgct gtcaggccca aagctcctgg gtctccagga tggtgcatga gggctcacac 7981 cagctgctcc ctgccaccct tcccccacct cccacccctt ctcctggagc cccagtcctc 8041 tggtcctcca agctcagctt caaatgcttc caccacttct gtctcttcta cctcgagtgg 8101 tttgtcaacc tccaaatcac acggagttca agaatatctc aaatatcgac caggtgcggt 8161 gtctcacacc tgtaatgcca gcactttggg aggccgaggc gggtagatca cctgaggtca 8221 ggagttcgag accaggctgg ccaacttggt gaaactctgt ctctactaaa aatacaaaaa 8281 ttagccgggc gtgatggtgg gtgcctgtaa tcccatctac ttgggaggct gaggcaggag 8341 aatctcttga accagagagg tggagattgc aatgagccaa gattgtgcca ctgcactcca 8401 gcctgggcaa cagagtgaga ctctgtctca acaacaacaa caacaaaaag aatatctcga 8461 atacaaaaat tagctggccg tggtagttta cacctgtaat ccgagctact caggaggctg 8521 aggcagtaga atctcttgaa cctgggaggc agaggttgca ctgagctgag atagcgccac 8581 tgcactccag cctgggtgac agagcaagac gtcatctcga aaaaaaaaaa aaaaaagagt 8641 atatcgaatc caaaggaata tcttgaatac aaaaattagc tgggcatggt ggcacacgcc 8701 tgtaatccta gctactcaag aggttgagac aggagaatcg cttgaaccca ggtgttggag 8761 gttgcactga gctgagattg cgccactgca ctccagcctg ggtgacaaag cgagactgtg 8821 tctcaaaaaa taaataaata aataaaatac aaaattaaaa gaaagattat ctcctggcca 8881 cccatggtgc aggctgtaat ctgtctctgc tggggtgaag ggacggaggg agacagatga 8941 gaaatgagaa cacagacctg gagacccaga ccacccatgc cagctaccca ggtcacttca 9001 gcccagccca gggagcccct gcccagcaca agcctcatgc acagtagcag tgttaagatg 9061 ggcccttggg ctgggtgcag tggctcacac ctgtaatccc agcactttgg gaagctgagg 9121 tgggtggatc acttgaggtc aggagttcga gaccagcctg gccaacatgg cgaaaccctc 9181 gtctctacta aatatacaaa aattaactgg gcgtggtggt gagtgcctgt aatcccagct 9241 attcgggagc ctgaggcagg gcaattgctt gaactcggga ggcggaggtt gcagtgagcc 9301 gagatcgcac cactgtacga cagagcaaga ctccgtctca aaaaaacaat aaaaagaatt 9361 gctggaggag ggaagtgagg ggcggctggc gggggactgc tgggctggtg cagagggagt 9421 caaggatccc taagagagag gcaggggtgt tcagctctcc cacctctgct ggatgaactt 9481 gcagagcctc cttggtgacc cagcccgtgc agagactctg ctcaggacaa cacccttgcc 9541 tgggaccccc acatctctgg ccccagcagc agcccttggg ggggtccctt tatcttcctc 9601 cactcctacc caggcttcct cagcaggaga tgaggcagga catttccact ccctttctgg 9661 aaggttcaga aggttaatga gagctaagca cgtaagtcca tttaggggga tgaacttctg 9721 ggaagagagg aacctgggtc tgggctgacg tccaagggcg ggctgggtga cggtccctct 9781 gatcacggac cctgtccacc cactgcccag ggccctgcct cgacccctct gaccagccac 9841 cgagccccag agggatctcc atgaatgtca gagacattga ctggaggcct tatctccagt 9901 gggagacccc ttctcttccc actgtgggcc ggttccagcc tgggctgtcc aggaagtgac 9961 ctctcagggc ctgggaaggg tgtggccagt ggttcttggt tgtactcaac tcatctgcct 10021 tgggtctaag gctggggtga atggaagggc ccacctggac cctggaggga caccaggctc 10081 atactaaaat cccaaaaagt gaaaagcttt ccccaggccc aagcagagaa actggacctt 10141 gaagctacat ctctggactt agtcctcaaa gtaggagaca tttgcctcta agctgttctc 10201 tcccacccca cctttctgtg agccgccggt tccctgttgt ccacatcaag ctgtgtgctg 10261 ggcactgggt gcaggaatag cttgaccaca gtctctatcc tgggggtaaa ggggtgacca 10321 gcccacagag ggatggactg caaacagaca gtcccaaagt gccatgagag aagctctcag 10381 ggcctgggcg tgatggttca tgcctggaat cccagcactt tgggaggccg aggtgggtgg 10441 atcagttgag gtcaggagtt cgagaccagc ctggccaata aggtgaaact ccatctctac 10501 taaaaataaa taaataaata aataaataaa taaataatac aaaaattagc cgggcatggt 10561 ggcatgcacc tgtaaaccca gctacttggg aggctgaggc aggagaatca cttgaacccc 10621 agaggcagag gctgaagtga gccaagatta cgtcactgca ttccagcctg ggtgacagag 10681 cgagactcca tctcaaaaaa aaaaaagaaa aaaaaagaga gagagagaga cagagagagg 10741 ctctcagcct gcaggaaagg gctctctcct tgttggccaa accagtgggc cctttagctg 10801 tgacgggtgt gccctgggcc caccaggaca ggagcaaggt caggagggct gctctgctct 10861 gcaaaacaga ggctgatgga tctgaagttt ctgttactgg gagaataaag ggaggtgaag 10921 gagacacgtg gtaggtcccc tgggaaggtg gtgggaatgc gtgtgaaatg ctcggagagg 10981 cacagtaaga accggcctgt ttagaatcca gtgttctacg cctgccccag gccccagcaa 11041 ttctgcttct agacatctcc ccatgagaaa tgagcactga gcctcccaaa catgtctttt 11101 gggaggccaa ggggagcgga tcacttgagg tcaggagttc gagaccagcc tggccaacat 11161 attgaaaccc cgtctctact aaaaatacaa aaattagcca ggcctggtgg tgggtgccta 11221 tagtcccagc tactcgggag gctgaggcag gagaatcgct tgaacccggg aggtgaaggc 11281 tgcattgagc caagactgcg ccattgcact ccagcctggg ccacagagtg agactccgtc 11341 tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gacatgtgca agaacattca 11401 gatcagcctt cttcataaca gacccaaaca ggaaacaaca gacgttcatc cacaggctag 11461 tgaagaggca ggttgtgagg cctccgccca caccgtagaa tgctattcag caacatgggt 11521 gactcccctg catgtaacat tggcaaagaa gccaggcaca agcaaatata cactgtagga 11581 ttccatttat ataaagagca aaagcaggca ggagggccgg gtgggcacag tggctcacac 11641 ctgtaatccc agcactttgg gaggccagtt gtggggggtg aatcgcttga gcccaggagt 11701 ttgagaccag cttgggcaac atagcaagac cccatcttaa aaaaaataaa aaattagccg 11761 ggcgtggtgg tgcaggccta tcatcccagc ttctcaggag gctgaggcag gaggatcact 11821 tgagcccaga agttggaggc tgcagtgagc tgtgattgtg ccactgcact ccagtctggg 11881 caacagagtg agaccctatc tcaaaaaagc aggtgggagt aatccatggc aataaattgg 11941 gatagaagtt attcctggag gtggtgatga ctggtatggg acacaaggaa gccacctggg 12001 gctgggaatg ttctagttct tgatctggga gatggtcatg cagtcgtagg cctgtgtaag 12061 acttcctcga gctgtccaca taagatttgt gcactttagc tgaatgtggt ggcttacacc 12121 tggaatccta gcactttgag aggctgaggt gggcagattg ctagaagcca ggagttcgag 12181 accagcctga gcaacatagt gagccccctg actctaaaaa aaaaaatttt ttttttaatt 12241 atctgagcat gctggcatgc acctgtagtc ccagctactt aggaggccga ggtggaaaga 12301 ttgcttgggc ccaggagatt gaggctgcag tgagctatga tcacaccact tcactccagc 12361 ctgggcaacg gagcgagacc ctatctcaaa aaaataaata aaatatttgt gcagttctct 12421 gtatggaaaa tgtccgcgtc tatccagcaa gcatccatgg agtagtccgg agaattgtgg 12481 gttagcaaga cactgggtgg cgtcaggcac tggagggatg ggggaacagc catgcagaag 12541 atcccaggca gagccaaggc agagatgcag ggggatggtg gggggcctac tggggacggc 12601 taagtggtgg ggggacacca gtgatccaag ggctgggggc atctagaggc cagggttcca 12661 gttacccaca gtgcccagga tctgtgggag tagtggatgg gggagtagtg gatgggcaag 12721 ggtcaagctg aaaaacaaga gccgtcaccc tcaccctccc atgcccccac tgtggctcca 12781 tcctggctcc tggttctcat cagccgagcc ggggcccttt ccccggctgg gggtggccat 12841 ggcgccagaa gccacgacag gtttacgttc cagccgcagc tgcactggcc ccatcccaag 12901 tcccctcgct tgcacacaca tttttctcct tgtggagtca gaagaggcct ggccaggtcc 12961 tgcacagggt tctggaggcg agagtcccct tggaatcacg ttgtctcccc cagccctggg 13021 ccgtctctgc tcggggagtg ccctcagagc tgcacctgga ctctttcccc cacttcccag 13081 cccttcctcc tgttctttct cccagcccca gctcttctcc ttcaagccca gctcaggtgc 13141 cgcctcttcc aggaagcccc ctgggaccat ctccagtctc agtgggagca gtgtgtcctt 13201 ggttgctccc agagtctata gcagagatct ggacccaaga ctctggactc ccagctttgg 13261 gctggcaggt aacacccact cctggggctt cacttgcctt gctacgtgtc agtttctcac 13321 tttgaggggg gcagcagtat tcacccacag gggctggggc tgaggggatt aagggactgc 13381 agctccagga ggcctcagag gaaagggtac aggtgggatg cagtggctca cagctctaat 13441 cccagcactt tgggaggcca aggtgggagg atggctcaag tccaggagtt ggagaccaac 13501 ctgggcaaca cagtgagaca tcatctctac aaaaaattta gaagtttttt tctttttctt 13561 ttcttttttt taattattat ttttttgaga cagagtctta ctctgtcact caggcactct 13621 gtcatctcag ctcagtcagg cactctgtca tctcagctca ctgcaacctc cacctcccag 13681 ggtcaagtga ttctcgtgcc tcagcctccg gagtagctgg ggttacaggc atgagccatc 13741 atgcccagcc ctccaaaaag aaaaacaata acaacaacaa caattttggc cgggcatggt 13801 ggctcacacc tgtaatccca gcaccttggg aggccgagga tcacttgagg tcagtagttg 13861 gagaccagcc tggccaacat ggtgaaaccc cgtctctgcg aaaaacacac aaaaaattag 13921 ctgggtatgg tggcaggtgc ctgtgatccc agctacttgg gaggctgagg caggagaatt 13981 gcttgaacct gggaggcaga ggatgcaatg agccaagacc atgccgctgc actccagcct 14041 ggggaacaga gcgagactct gtgtcaaaaa acaaacaaac aaacaaacat tttaaaagaa 14101 agaaagaaaa aagggaaagg agagcatcag agtctaagac cacacagaaa ccctctcttt 14161 actgcagtgc tgaagccctc gtggcagcca tgccctgctc acttttggga tacccatgga 14221 ctcagggcac gtcctaaggt gacacctgtc accagcccta tgcctgcctc tgtccacctt 14281 cgcctctgct ggagaccatt ctcagacacc cctgagatgc ccagagggga cctgcaacct 14341 cttcccaccc cacctcccac aaggggtccc tgggaaaact gggggtaaca ttccccccac 14401 cctcacccct acttcagtct gggtgttcag agcagccctc tctgtctctg aattttctct 14461 ttttttcctt ttggagacgg agtctcactc tgtcaccaag gctggaatgc agtggcgcaa 14521 tctcagctca ctgcaacctc catgtcccgg gttcaagcga ttctcacact tcagcctccc 14581 aagtagctgg gattataggt tcacgccacc actcctggcc tctgaatttt ctctcttggt 14641 gttttgtttt tgtttttgtt tttttgacag agttttgctc ttgttgccca ggctggagtg 14701 taatggctct atctccgctc actgcaacct ccgcctccca ggttcaagcg attctcctgc 14761 ctcagcctcc cgagtaactg ggatcacagg cacctgccac cacacctggc taattttttg 14821 tatttttagt agagacgggg tttcaatatg tttgccaggc tggtctcgaa ctcctagcct 14881 caggtgatcc acttgccttg gcctcccaaa gtgctgggat tacaggcatg agcttgaatt 14941 ttctctttct gatttcctgt ctctctgcct ctgactatgt ttctccatct gtttctctct 15001 gtctgtctgt ctgtctttca gagcagggtc tttctttctg acatctgcag ggctgggggc 15061 tgtgctagga ggggggtgaa tttgtcattg tgactttgca gctccttatc tatgaagtgg 15121 acaggacacc agcatctagt cctggagtcc ttggtgagga cctggtgtgt tgaggctggc 15181 cggtgcagag ttaagggttc agtccatgat ttctgcatcc atgaaagggg gatggtgagg 15241 ctgggcatgg ttgctcatgc ctgtagtcct agcgttttgg gaggccgagg cgggtggatg 15301 acttgaggtc aggagttcaa gaccagcctg gccaacatgg tgaaaccctg tctctactaa 15361 aaatacaaaa attagcaggg cacagtggct catgcctgta atcccagcac tttgggaggc 15421 tgaggcaggc gcatcacttg aggtcagaag ttcaagacca gcctggccaa catggcaaaa 15481 ccctgtctct actaaaaata caaaaattag cagggtgtgg ttgcacacac ctgtaattcc 15541 agctactcgg gaggctgagg caggagaatc acttgaaccc gggaggtgga ggttgcagtg 15601 agccgagatt gcaccattgc actccagcct gggccacaga gcaaaactcc gtctcaaaaa 15661 aaaaaaaggg ggggggggtg gggggtggat ggtaagaaca gccctgctcc tctgcttctg 15721 gggattaacg tgaggatgtc ctgagtgctt gccccaggga ggtgccctct gcggtctggt 15781 gaccaagtgg ctcaccagca cactttgttc ccagccactc tgcggggtgc agccaccatt 15841 gccccactct acagattagg cagatgaagc ccagagaggc tgaataactg gcccagtcac 15901 acagcagctc ctatcatagc agtggaggta ggatttgaac tcaattgtct ggctccagag 15961 cctgcgatct tcactgtgga cctcgttttt ttgtggtttt tgctttttgt ttttggagac 16021 agggtcttgc tctatcaccc aggctggagt gcagtggcac aatcacagct caccgcagcc 16081 gtgacctctg agctcaagcg atcctcccac ctctgcctct cgagtagctg ggactacagg 16141 tgtgcaccac catgcctggc taactttttt atttttcgta aaggcagggt ctcgctatgt 16201 agcctaggct ggtcttgaac tccaggcttc aagcgatcct ttgccttggc ctcccaaagt 16261 gctgggatga caggcattga gccaccaagc ccagcctgtg gaccttggct ttaaaactgt 16321 tattcaatgg tatctgtcta aatggtgcag taggagtgtc aagattggtg gggtaggggg 16381 aagctgggga tagagcccca tctggcctct gatgactccc agcctgcccc tctggccagc 16441 caggattttt tttgtttttg agacagagtc ttgctctgtt gcccaggcta gagtgcagtg 16501 gcatgatctc gactcactgc aacctctgcc tcccgggttc aagtgattct ctgcctcagt 16561 ctcccgagta gctgggatta caggcaagca ataccatttc gggttaattt ttgtattttt 16621 agtagagacg gggtttcatc atcttggccg ggctggtctt gaactcctga cctcgtgatc 16681 cacctgcctt ggcctcccaa agtgctggga ttacaggcat gagccaccgc tcccggccag 16741 gacttcttaa catacagctg tcacttggcc cagagcctgc cagggatcct tgtgtccctg 16801 gctcctgtgc ccagcttttg cgggttcaca gctggactcc tctggccttc ggagagggct 16861 gcccttgcca ccctctgcac ctgccaccca acttcctcct catcccctgc tcctgcctcc 16921 tatccagttt tctgggcttc tccttcttcc tccagcaatg ccttcggtct cattcttcca 16981 ggacctgcct cctcccaaag tcctctctgg gggctctggg ccatggcaca gcaggacagc 17041 caccagaagc cactgtcaga gtctgatggg tgaggacgga aggtggctca ggagtccttc 17101 cttagggcag ggacaatttt ttccaggaga ggtgtgaggc ctccagctcc taaaactcct 17161 cgaggtttaa gagcgggaaa tgctttggct ccaaaatatt aaaactgcaa cattaaaaaa 17221 tagtaaatgt ttactgaaat gtcttctaaa tataacttta tgtcagtttc atccattgtt 17281 aaatttagta tttcaaaatg catcactgcg catgaaaaca attcgtaggt cacgttttct 17341 cactttggaa gaatcctaag attctcaaga atgcatgatg ttggtagaga agtcaaggcc 17401 aggagcaaat caaatgagtt acttgccccc aaaccgcttt gaaagttgtt ataaaattct 17461 tttcaattgt tttatattgc caaaatatat tacagagttg gggtatgggt gcattttttt 17521 tattctggta aaaaaacaca taaggtttac catcttaact attttattta tttattgtta 17581 ttattttgag acggagtctc actgtgtctc cagcctggag tgctgtggtg ccatctggtc 17641 tcactgcaag ctccgcctcc tgggttcaag tgattctcct gcctcagcct cccgagtagt 17701 gggactacag gtgtgcacca ccacgcccag gtaatctttg tatttttagt agagacgggg 17761 tttcaccatg ttggccagga tggtctccaa ctactgacct caagtgatcc tcccgcctcg 17821 gcctcccaaa gtgctgggat tacaggtgtg agccaccacg cccggcctta tttctttatt 17881 ttttaaaatt ttatcatgtt gcccaggctg gtcttgaact cctgagctca agtgatctgc 17941 ccgcctcagc ctcccaaagt gctgggatta caggcatgag tcaccacacc cagcccatct 18001 taagcttttt ttttttttga cgaggtctct ctctgtcgct caggctggag tacagtggtg 18061 ccatctcagc tcaccgcaac ctccacctcc caggttcaag cgattctccc tgcctcagcc 18121 tcctgaatag ctaggactac aggcgtgcac caccacgccc agctaatttt tatatttata 18181 gtagagacag agtttcacca tgttggccag gctggtctca aactcccgac ctgaggcgat 18241 ccacgcacct cggcctccca aagtgctggg attccaggcg tgagccaccg cccctggccc 18301 atcttcacta tttttaagtg gacagcacag tagtattaac tatatgcaca tctctggaac 18361 tttttcacct tgcaatactg aaaccctaca ctcaacaaac aactcgcttt ccctgccccg 18421 ctccagcccc tgaaaacttc tactctcctt tctgcttctg tgaattgtac tattccagat 18481 acctcccata ggtggaagca accactattt gtctttcttt gtaactggct catttcactt 18541 agcattatgt cttcaagttt catccatctt ttagcatgtg taagaatttc cttccttgct 18601 ttttttgttt gtttgtttgt ttgttgagac tgagtttcac tcttgtcacc caggctagag 18661 tgcgatggtg cgaacttggc tcactgcgac ctctgcctcc cgggttctca agcaattctc 18721 ttgcctcagc ctccagagta gctgggatta caggcgcctg ccaccacgcc cggctaattt 18781 ttgtattttt agtagatatg gggtttcgcc atgttggcca ggctggtctc gaactcctga 18841 ccgcaggtga tccacccacc tcggcctccc aaagtgctgg gattacaggt gtgagccacc 18901 acacctggcc tcgtttttgt ttttgttttt ggagacagag ttttgctctg tcgcccacgc 18961 tggagtgcag tggtgcgatc atacctcact gcagcctccc ctcctaggct caagcaatct 19021 tcctgcctcg gctttctgag tagctcgaac tacagacaca cactaccatg cctggctaat 19081 atttttattt tttgtagcaa agcggggggg gcggtctcat tatgttgccc tggctgggct 19141 tgaactcctg gcctcaagtg atcctcccat ctcaagcctc ccaaagtgct gggattatag 19201 gcgtgagccg ctgtgcccag cctcaaacac tttttctttc aacgacattg acctatagtg 19261 aatatcaaaa gcatgcactg aaactcaaga gctgcctcct ctcctgtgcc aggatgcact 19321 gtaacttttg tgagccctag gcatgtttga cttatatggg ttaagatatt caaatacggc 19381 taggcacggt ggctcacgcc tgtaatccta acactctggg agaccgaagt gggcagctca 19441 cctgaggtca ggtgtttgag gccagcctgg ccaacctggt gaaactccat ctctactaaa 19501 aatatagaaa ttacccaggc gtggtggcag gtccctgaag ctctggctac tcgggaggct 19561 gaggcaggag aatgccttga gtcagggagg cagaggttgc agtgagctga gattgcgcta 19621 ctgcactcca gcctgggtga tagagcgaga ctccacccca aaaaaaagaa aaaaaaaaag 19681 atattcaaat gtgtatttta caactgcttt ggcataaaga ctgactgata ccagtcaggt 19741 cagattcatt attacatatt cattactatt atactcctgt ttgactaggc atggtgggct 19801 cacatctgtg gtcccagccc tttgggaggc caaggcagga ggataacttg aggccaggag 19861 ttctagacca gcctgggcaa cataatgaga cccccccatt tctactaaaa ataaaaaaaa 19921 ttagccagtc gtggtggtat gcatctgtag tcccagctac atgggaggct gaggtggtag 19981 gattgcttga acccaggaga ttgaggctgc agtgagctat gattgcacca ctgcacttca 20041 gccaacagag caagaccctg tttcaaatgc atacatacat acatacatac atacatacat 20101 acatacatac atacatacac acacaaacca accaatacaa aaaatacata ttttttttct 20161 tctgattcta aaaaacataa aaacagccag gcacgatggc tcatgcctgt attcccagca 20221 ctttgggagg ccgaggtggg cagattacct gaggtcagga gttcaagacc agcctggcca 20281 acatggtgaa accccgtatc taccgaaaat acaaaaatta gctgggcgtg gtggcgcgtg 20341 tctgcaatcc cagctacttg ggaggctgag acagaagaat tgcttgaaac cgggaggtgg 20401 aggttgcagt gtcccccacc cgctcctcca ggggcgcccc cctcgagcca ccgcgccggc 20461 tgcccggcga gtgtcagcgc ctcctcacag gccccgcccc tcacgcccca gacggccaat 20521 gagagctgcg gccccggccg gcccctcccc gccctgggga accccggtcg cggattggcc 20581 ggcccgggcg cagtgtcccc cgcacgtggg gcgggggcgg ggtgagtgga gtggcagcga 20641 agccccgccc ccgccgcccc gcccccgccc ccgcctcctc cccggcccgc cccccctgcg 20701 gcgcccagtc cagcgcccgc cgcccgccac cccggacccc ggtgtctggc ttcccccgag 20761 ccgggacccc gcgatggcca agcgcagctc gctgtacatc cgcatcgtgg aggggaagaa 20821 ccttcccgcc aaggacatgt gagcgcggcc gggggtggga gccccaactt tccggggagg 20881 ggcactgccc cccaggacgc ctgacctttc cggggtccac ccacaagagc agcggagagg 20941 gtgcgggagt gcggccgtgg tgggggatcg gcggggaaaa tggggacagt gacattttct 21001 gcgagcacca gacggggtga caaccggggg gcggggaagt gtcaccgagg gctgggggct 21061 gggaactagg gagggggcga gaatttgtca gcgatccgcg gctcacgagg ggaaggggct 21121 ttgatacgac aggagctgga gcctggtgac atgccagggg agacgtcttg gtgatctgcc 21181 cccggggtgg ggttggaggg ggtggggtgt cccccgcaaa ggaagggtct tcctcagtag 21241 ttccagagcc cttaatgccc cggagtgtgc acctgaagca catgctgggg cccaggggct 21301 gtccccgggc ttcctgctgt gcactcgggt aggagggact ggcatgaaaa gtgagaacag 21361 gtcaggaggc caggacaggt gcaggactgt gggggaggga aggagggtag taactgtcgc 21421 ttgtgccttg ggctttacac ctgagccatc ccgttaatcc tctcaaccac cccgaaaggc 21481 agttctcacc gttccatttt acagatgaga aaactgaggt tcagagaagt gaaatgctgg 21541 ctccaagtgg caggcagagc agggtcttga gtctagggct tctcacaggc actgacatct 21601 ggggattccc ctggtccctg gctcgatttg gggaggtttc aggggcccct tgttccactc 21661 ttctcagtag gtcccagggc tgagctgcct gacccctggc ccaccatctt ggtcttgcca 21721 ccaccttcct catgtacagg gtttaggcag ctccctgctc ctagagcctc agtttcccct 21781 ctttaaacta ggggttgaaa tccctctaga ggaagagggg gaaagctcaa aggtcaaggg 21841 gcactggctg cctggaggag gagggtggga gggggaaaag tggagcctgg aggaaacaga 21901 gtccaaagtc tgcggagctg cagttcccca ccccttgggc gccccgggct cctggatgca 21961 gagaagtggc tagcatgggc gagggggtgg agcagggtgg ggggcaggca ctctgctcac 22021 acatatgtgt ggtggcccct gtgagcttcc cagcaagctc agcgcgccac caccgagagc 22081 agtgccagtt ccaggccccc tgggtctgat gaggcccagc tcagcagtta ccatgacaat 22141 ggggaaggga gggcggtggg catccctagg tgggtttctg tgcttggagg cttggaggga 22201 ggtggagtgg gagctggcag tgaggtctgt gggctgggaa ggcagaggag ctctgagaag 22261 ggtctcagcc tccaggccca gcctttaagc atcccccacc ccctgccacc actggcacta 22321 ggaactaggt catcttggcc tcttctggct ggtggcagct gggaggagca gcccctccgt 22381 ctgtcgcctc cctcctcatc cccacagcag ctggccctag tgcctggctc caggctctct 22441 ccagcccagg gcagccaagc tcttggagta tgcccacctg ggagcccagt gcctggttcc 22501 ctgctcagtg ctttgcctct gggcaaaggg gttccatcta cagtggaggg gtgccttggg 22561 atctcgatcc tcaccctgcc cccctgcctg ccctactcac gtctcctttg caacccccca 22621 tcttcccctg tggcttgcat acacagctct gacctgtcag ccgcctctta aaccatccca 22681 taaacagtcc ctcacatttg ggctactacc cttcacatct ctgctttcgc ccatgtgtgg 22741 ggacttctgc cttcaagcac tccacttcag tcctccagaa aactcctatt caatgttcag 22801 gactcctctc aaacttctcc tctccagggc attccctggt gcacttgggc agagttcctg 22861 gctcctgggc tcgcccccca gcactctggg tggcaggagg aagcccctgg tgtggcagtt 22921 cctccgtcaa gtggtgcctg tcccagctga ctgtgagggc agggactgtg actcattcat 22981 cctggcaccg ggtccatagt agggagaata agaggtgttg aggacttccc ccacttaatc 23041 ctcttcttag ctttataaga tagacatcct cggtttgcac attgggaaac tgaggctcag 23101 gaagtgaaag ccatttgcca agagggtagg tagagaagtc aggattcaga ttgcctttgc 23161 tgccgacagg ctgagaattc tagcctgccc gcggctgtaa gcttgcggct gggctcagtg 23221 agactgagcc tcggaaatcc tatgcaaagc atttggctgc agagtcgggc aggggtttgg 23281 agatcatggg aggccctcac tgagtactgg agtttcactg tgcctcacca gcaccccgcc 23341 tatcttctga gtagcatctc ctccagccag ggcttgcata cttctgagga cggggagctt 23401 accacctctc atagtggctc atcttgactg agacagttcc aaggaggaag ctgtcctcac 23461 acagctgcca gctgccttcc tgccacttct actcatgggt ttttattcct ctaaagggca 23521 cagagcgcag ctgcattccc catcactgct agctttcaga gcctgcctgt tcctgccctg 23581 cctcttccca ctgtctctcg tccccacgta cctaggcatg agccaagtct cacgatgacc 23641 agtgacccac tgcctcccac accaagtggc cttggggact ggcccaaggc tattttgggg 23701 ttttccctga tcctcctcct tccacagcct agaaactgcc aaggccctaa atatagaggc 23761 cccgaaagta gccagacagg ccagtggcaa ctggctcggg aaaggagaaa tgagagaggt 23821 gggcagccta ggggcctcaa cactagcgaa gaaggaactg ggcagggctt gggtcccagc 23881 agcccactgg ggaccatggt tgggcccagg tccttaaatc tggcctcggc ctgctctcaa 23941 taccgtcctc ttgccccacc tctgatccag ccacccggct cttcatagtc cttgcacacc 24001 aggactcggg tcctgcttcc aggcctttgg aagcagtagc agcccagtag tgggggacag 24061 cccacggagg tcccctgttg cctcagtcag tactgacctc tccctgcctc ctctgagctc 24121 cttgtatggc tcctggatcc tgcagcccag cccagcacct ggggtgttca gaggcctgat 24181 ttcttagcac ctggagtccc ctggctttgg gttgatggga tgcaagctgg ggtattcaca 24241 gggggctccc tggaggtggg ggggctcttt attttatttt atttatttct gagacagagt 24301 cttgctctgt tgcccaggct ggagtgcagt ggcgcgatct tggttcactg caacctcctc 24361 ctccggggtt caagggattc tcctacctca gcctccccag tagctgggat tacagttgta 24421 caccaccact acattcggct aatttttgta tttttcgtag agacggggtt tcaccatgtt 24481 ggccaggctg gtctggaact cctggcctca cgtgatctgc ccacctcggc ctcccaaagt 24541 gctgggatta caggtgtgag ccaccacacc tggccaggag gcgggggctc tcaaatgatg 24601 gcagggggag gttacagtca gagatgcagg cagtgctaac atttactgag cacctccaag 24661 accttctcta tcgtctcagg acccagggaa gctcccttgt gagccggggt tacacttccc 24721 actttatagg taggaaaact gagtcttggc agacatttcc tgtctgtgtc caggagcttc 24781 tggagctgtt ggccccaaat gccttcctct gccggaagag ccccttttga gaagggaagc 24841 agagcttgga attccccacc ctccaccata gaaacacact tcctttttat ggacactcac 24901 tggaggaaca gctctagaga cagccctcac ttccccactt ttacccagta ccagccctag 24961 gcctggctgc aggcctctcc cagccccagg agccccccaa cacctgagac ccttggaact 25021 gagcgcgtgt ttgccttccc ccatccctct ttccccagat agctgttaca tttattcatt 25081 cattcactca tcagttcatt cattcgtctg ttcattcggc aaacactcat gggctgcctg 25141 tcttgtgcca agccctgtgg gggccctggg gacacaggca tgctccccgc ccccaggtag 25201 ctcagagcag gtgggaggca gctgatcact accaggctgg atgaagtggg caggcagtgt 25261 gagtccaggg aggccttcct ccccgtcccg tcctggggcg agggagggct ctggcagggc 25321 tgaaggccga ctgggcagga gggacagcgg tgtcggtgtt ggggagagca ggagcaaagg 25381 ctccagggag agtaaggggc tcgactttgg tgtggagggt aggggtccca ggagcttgcc 25441 tggggctgtt gcttggagct gttctggggg gaggcaggga tggggtagga gggcatagaa 25501 aggaggaaca ttctatcccc tgcgggagga acaggaggtt tccagtaccc cctccctctg 25561 tccctctccc tcccctccct gtctttcccc tctccctctc tacccctccc tctcaccccc 25621 tccctcagtc tccccctccc tctttactcc ctccctctta ctctcctccc tctctccccc 25681 tccctctcac cccctcactc agtctccccc tccctcttac tctcctccct cttactctcc 25741 tccctctctc cccttctctc tctcccccct ctctcacccc ctcagtctcc ccctctgtct 25801 tactctcctt cctcttactc tcctccctct ctccccctcc tactctcctc cctctctccc 25861 cctccctctt actcttctcc ctctctcccc ctccctctca ccccttccct ctcaccccct 25921 cactcagtct ccccctccct cttaactccc tccctcttac tctcctccct ctctctcccc 25981 ctccctatca tcccccttcc ctcttaaccc ccacctctct cctcctccct ctcagccccc 26041 tccctctgtc tcctcttccc tctctctccc ctccctgtcc gtccccttcc ctttctgccc 26101 ctccctctgt ccctctccca gtctctcccc ttcctccttc cccctccttt cttctccttt 26161 ctctctctac ccatccctgt ctcctccccc tccttctccc tcccccacaa ctctccctcc 26221 atctactccc acccctgccc gcgaaagggg agccctggtg cgggttcctg cccttccctc 26281 accatccgct cccgggctga gggcggggac ccgtgtttgg ccggaggagc ctgcggtctc 26341 caaaccagga cgcggagact cagcctccgc tctcttcccc cgcactcctc ccgaggggcc 26401 aaaagagaga ggagcctgcc gcgagtcccc gccccgatcc cgccctccag ccccgcccct 26461 gcctatttgt cttgcagcac tggcagcagc gacccctact gcatcgtgaa ggtggacaat 26521 gagcccatca tcaggtaccg cccccacccc caggaccgag gggcgctcag cctctcatcg 26581 gcccgcgctc tccccgcaaa ggggtgtgaa ttttgttgtg aattttgtac ggagatggga 26641 gggggcttgg agcacctggt cccctgcctg ggattcccca cctcgccctg acaccccagc 26701 taggagacga gaaagacctt cagggaactg gcctggccaa gggcctctca ccggactctg 26761 ttcatctttg caatcatcac aacctccggg agagacagat gctgacgatt tgtaacaaca 26821 actctcattt gtggagccag cccttttcca gcccgtttag ttcttgcaat taacgagata 26881 cgatatcatt agaccccatt tacagacaag gaaactgagg ctcagggaga ctaactaccc 26941 tggccaaggt cacatgccta gtaaaagtgg cagagccaca ctggagccca agtctggcca 27001 ccccagagcc tgagctccct tcccctccct ggtcctggct ctaacttggg tcctgtcccc 27061 tgagagtccc atagtaatat ccctcgaggc tgggcaaccc cagcctgcac cctgaagagt 27121 cacttcctcc ggatgtccct tcctccatgg tccccatgat ctctgggcag ggtctttgct 27181 gtcctctccc ttccttcacc cagactcctg cttccaagtt catggccaga gctgccctac 27241 tgttctgccc acctgagtgc ccgtggccta ctcggagccc agtccttccc tacctgcaag 27301 caggctccca gcctttctcg agggtctgca aaccacagac agcaagaggg agaggggtgt 27361 gatggctctg atgctcttga tacagaaggg aaacagatga cctgtctgag aagtggttgg 27421 catgaggtgg tcccctgaac tttcatccgg gtccttgggg gcttcagcgt ctccccaccc 27481 gctagcccac tgagtcccat gagttgaagg ctctgatcca aagggggttg accagagctg 27541 ggtgaggatg tgaggccagg ctcaccagtc gccctgaggc agctaccctg tgacctgagg 27601 ccaatgcctg ccaactgtga tctcagtttc cccaaatgta aaacagctgg agaggctggg 27661 cacggtggct cacgcctgtc atcccagcac tttggaagtc cgaggtgggc agatcacctg 27721 aggtcaggag ttcgagagca gcctggccaa cagggtgaaa ccctgtctct acgaaaatga 27781 caaaattagc tgggcgtggt ggcaggcgtc tgtaatccca gctactcagg aagctgaggc 27841 agggagaatt gcttgaactt gggaggcaga ggttgcagtg agccgagatg acgccactgc 27901 actctagcct gggcgacaga gcgagactcc gtctcaaaaa aataaaataa aataaataac 27961 agccagagag aggcagccct gtgccccatc cagagcagag aggaatgtca gggggtagtg 28021 attggtggtg aacagtaggg tctggcacag ggaaaacaca tcagtggcag agccagttgt 28081 tataggggta ggggaagata caaggaaggt cccagcccag ggcctgggag tggcctgggc 28141 acatgctgag ggttctctcc tgcaggacag ccacagtgtg gaagaccctg tgccccttct 28201 ggggtgagga gtaccaagtg cacctgccgc ccaccttcca cgctgtggct ttctatgtca 28261 tggatgagga tgccctcagg tgagtgcccc cctctccagc tgggacccag acctggccat 28321 ctgattgctc cctggcccat ttcaccacca ggactcctgg gtcctttttg gcatcctctt 28381 tgcagcctgg agggaggcag agcctggggg cctgggaggg cgaaaagctt gagcgtgtgg 28441 gtgtgcacat gcgtggctcc atggtgcatg cgccacatac acacgtgtgt gtgcaggcat 28501 gcaggcacaa gtgtgcatgg actacacgtg tgcgtgcagg tgtgagctgt gagatgggca 28561 cccagagagt gtgagcttgg catgtgtggg catgtgagaa acctatcaca tccccctaga 28621 gggtccagaa cccacagcct acagaagggc cacaggtcca gctctgttgg gttactctgg 28681 aaatgacagc ggtgtccacc accctgctcc ccccgggggg gccctgaact tggtgggagg 28741 tcccagaggg cagatactga agccctgccc agctctgcct ccgtctccct cctctagccg 28801 ggacgacgtt atcggaaagg tctgccttac aagggacacc atagcctctc accctaaggg 28861 taagttctcc cttccctccc acactggtct gcccagtccc tggcctccct cccactcgga 28921 gacctcccct ctaggctccg tctggtctcc tgctcaggga aagccatttc tactctcccc 28981 agaagccggg gccacgttct gtactcctgg cctctgttct gcagcatgtt cccaagcctg 29041 gttgttactg cctcttccct agggtgaaga ggggctgcta tgggtggaat ctgaggcctc 29101 tgctggcaga agaaggggcc tccttacact ctattgctga agcataggga ccccttctcc 29161 aaatcaggcc agctccttct gtaacccatg gggtcgcctc catctgggcc ctagtactac 29221 tgtgtcctaa gtctgaaggg ttggcctaga gccagtccag gctggagatc cctttcaatt 29281 atttctggga tgcagacatt gttttgtgtt attgttttta caatttttat atagtttttt 29341 tttaaaaaaa aggccgggca cggtggctca cacctgtaat cccagcactt tgggaggccg 29401 agacgggtgg atcacgaggt caggagatcg agaccatcct ggctaacacg gtgaaacccc 29461 gtctctacta aaagtacaaa aaaattagct gggcgtggtg gcaggcacct gtagtcctag 29521 ctacttggca ggctgaagca ggagaatggc gtgaacctgg gaggcagagc ttgcagtaag 29581 ccgagatcgc gccactgcac tccagcctca gcaacggagc aagactccac ctcaaaaata 29641 aataaataaa taaatgaata aaaaataaaa ataaaaaata aaatagaggc cgggtgcagt 29701 ggctcacacc tataatccca gcactttggg aggccgaggc aggtggatca cctgaggtca 29761 ggagttcaag accagcctgg ccaacatggc aaaactctgt ctctactaaa aatacaaaaa 29821 attagctaga cgtcgtggtg ggcacctgta atcccagcta ctcaggaggc tgaggcagga 29881 gaatcgcttg aatccaggag gcggaggttg cagtgagtca agatcacgcc attgcactcc 29941 aacctggatg acagagcaag actccatctc caaaagaaaa acaaagatgg aggccgggca 30001 tggtggctca cacctgtaat cccagtggca attcacttga gattaggagt tcaagaccag 30061 cctcgtcaac atggtgaaac cctgtctcta ctaaaaatat aaaaattacc tgggcatggt 30121 ggcactcacc tgtaatccca gctactgggg aggctgagac aggaaaatct cttgaacctg 30181 ggaggcagag gttgcagtga gccaagattg cgccattgca ctccagccta ggtgacagag 30241 caagactctg tctcaaaaaa aaaaaaaaaa tggagatggg ggtctcactc tgttgcccag 30301 gctggtcttg aactcctggc tttgacagat cctcccgcct tggcctcccc aaggtgctag 30361 gattacaggt gtgaaccacc atgctctgcc cagacatgat tttgagggtt cagccctgtg 30421 ccagcctcac tgtgtgtgac cttgggcagc tcctgccttc tctgggcttc tctgagggtt 30481 caaaccagtt cttggatgct atgggctctg ccatcagagc ctttgtgtcc cgcttactgg 30541 cttcccccca cacacacctg ctcctgtctg tgacgctggg ccagggatgg gggcagggcg 30601 aaggcaggtt ctgctgactc atggggcttc ttggcccctg ccaggccaaa ggcctttccc 30661 acctgcagcc ttagccagcg tctctggggt ctagtggccc cagtgaggct attcctccgg 30721 gtgccttgta gcttccccat ccaggcctca gaccctcctc ccagcagccc taggggaggg 30781 ccactctctt cctgttttaa gatggggaat ctggctgggt acagcggccc acactcataa 30841 tcgcagcacc ttaggaggct gaggtgggag gatggcttga ggtcaggagt ttgagaccag 30901 cctgagcaac atagcaagac tccatctttt tttttttttt ttgagatggg gtcttgctct 30961 atcacccagt ctggagtgca atggcgcaat cttgtagctt ctccatctgg gcctcagacc 31021 ctcctcccag cagccctagg ggagggccac tctcttcccg ttttaagatg gggaatctgg 31081 ctgggtacag cggcccacgc ttataatccc agcaccttgg gaggctgagg tgggaggatt 31141 gcttgaggtc aggagtttga gaccagcctg agcaacatag caagactcca tctttttttt 31201 tttttttttt tttttttttt gagatggggt cttgctctgt cacccagtct ggagtgcaat 31261 ggcgcgatct gagttcaagc aattctcctg cctcagcctc ctgaggagct gggattacag 31321 gcatgcgcca ccacacctgg ctcacagggt gagaccctgt ctctaaaaac aacaatgaca 31381 aaaatgacac aatagtgagg gcacaggccc cagtcactga cttagattct aatcccagct 31441 cttccaactg ctgtttcttc acctgcggag cctcagtttc cccatctgtc attggggata 31501 ccagcccctg cttcacgggt gcctagggga ttcccagatt atggaggtgc ctcaaggtgc 31561 ccggtggggc tgagggtgga gggtatgggg gttcagggcc gggtccctgg ctgagctgac 31621 cccacaggtt tcagcgggtg ggcccacctg acggaggtcg accccgatga ggaggtgcag 31681 ggcgagatcc acctgcggct ggaagtgtgg ccaggggccc gggcctgccg gctacgctgc 31741 tctgtgctgg aggccaggtg agactcaggg gcctgggggc gggcagtggg tcccctgcaa 31801 ctagagaaac ccaatgagga agctgagccc cccctcgccc cacctctacc tcctggtccc 31861 agagctggcc acctcccatc aaagcctgct ctcaagagag ggtctcgcca ggcacggcgt 31921 ctcacacctg taatcccagc actttgggag gccgaggcag gtggttcacc tgaggccagg 31981 agttcaagac cagcctgacc accatggtga gaccctgtct ctactaaaaa tacaacaatt 32041 agccgggcat ggtggcaggc gcctgtaatc tcagctactc aggaggctga ggcaggagtc 32101 ttgaacccag gaggcagagg ttgcagtgag ccgagatggc gccactgcac tccagcctgg 32161 gtgacaagtg agactgtatc tcaaaaaaaa taaaaaagaa agaaaaaaga gagggtctgg 32221 ggaggttttc tggacttgaa gatgtttctt gggtgatttc cactcaaggg gatatgtccc 32281 ttaagggaca gtctaatgtt ctcatggagg aactggagcc atcacagagg agtggagtag 32341 ggggtacggg tgaggagacc ccgaactctg atcacacagc ctcagtcccc cagtgctaag 32401 gccggcttcc caatgtttaa cgccagtgtg aacttggcct ggtgggagtg tgtgtaagtg 32461 ggtccctcgt ggggctgggg gggtgaaaag agttgctgaa aatctcccga tgggcaaatt 32521 gagggttccc ccaaggaggg acagtggttt ggatggttcc atgggcctga gtcacctgtg 32581 acagggccac cccccaaccc ccagggtatt tttagctgca ggctgcactc ccgtctgggc 32641 ctgggtctga gtcacactct gtccgtgaga agcgtcttcc cttccggctc caaagccacc 32701 tccccaagtc gctccctgcc tgtgaggcag gagggtggcc cccagccagg cagccctatg 32761 gagccagtgt tccccccacc cttgaggggt cagcctgtct aagaggaaga cttcccttgg 32821 gcagaggggt gtgggtcgga tgatgcccat gttgctcact ctggcttccc gggagttatt 32881 ttctgtcctg ggaaaataga aatggatgga aaatgtctgg gcttgggccg agcctcagcc 32941 acacccccac actcgctcac tctctggcct ctgccattca ttcccccccg ccccacccca 33001 actcacccgc ttcctccatc cttatccttt ccaggcacca ggctgtctgg gggacaggca 33061 tgcacacgtg tgcacccgct cacacacgtg gccgggctct gggacctcgg ggccacttct 33121 cccaggcacc catcgtcacc cacggagatg ggaggcatgg aagcatgtct ccctggccct 33181 cccctctctc caggaatctc ccctgtccca tcctctgggg ccacagtggt tggccttcta 33241 gtgagtccta gcaggggaga ggaatgtcca ggcctctctc agagtgaggg gacttgtccc 33301 ccgttgtcct cggcaatgag actcctgctg caattccaag tcagcctaag aaggtccatt 33361 tgctgcggag aaagaaagaa catttcctca ttttttctgg ggagattctc aatatttcag 33421 taaaaccttg gggttttttt gttttttgtt tttgagacaa tctcgctcca tcatccaggc 33481 tggagtgcag tggcccgatc tcagctcact gcaacctcca ccttctgggt tcaagcaatt 33541 ctcctgcctc agccttccta gtagctggga ttacaggtgt gcaccaccac gcccggctaa 33601 ttttttgtat ttttagtaga gacgggactt cactatgttg gctaggctgg tctcaaactc 33661 ctgacctcag gtgatctgcc tgcctcggcc tcccaaagta ctgggattac aggcgtgagc 33721 cactgcaccc gggctgtttt tttttttttt tggtgggttt tttttttttt ttttgaggca 33781 gtctggctct gtcgcccagg ctggagtgca gtggcgtttt cttggctcac tgcaacctct 33841 gcctccccag gttcaagcaa ttctcctgcc tcagcctcct tcccaaagtg ctgggattat 33901 aggaatgagc cgctgcaccc agcctcaatg ttggggtttt attagacagt cttgaggggg 33961 aagaaaggca ggtatgagag gcttaaaata tcaaagatga gtgggctttg gtgttcttcc 34021 acaaagagtt ctaagtaggg gatactggca gggtgcagtg gctcactcca gtaatcccaa 34081 tgctttggga agcagaggtg ggaggatctc ttgaggccca gatttcgaga ccagcctggg 34141 caacagcaag atcctgtctc tacaaaaaaa tttaaaaatt agccaggtgt ggtggggcac 34201 acctatagtg ccaactactt gggaggctga cgtgggagga ttgcttgagc ccaggaggtg 34261 gaggctgcag tgggctatga ttgcatcact gtactccagc ctgagcaaca gagcaagacc 34321 ctgtttaaaa aaaaaaaaga aaagaataaa agggccaggc gcggtggctc acgcctgtaa 34381 tcccagcact ttgggaggcc aaggcaggcg gatcacctga ggtcaggagt tcgacggcag 34441 cctggccaac ctggtgaaac cccatctcta ctgaaaatac gaaaattagc tgggcgtggt 34501 ggcaggcacc tgtaatccca gctactcagg aggctgaggc aggagaatta cttgaaccca 34561 ggaggcggag gttgcagtga gctaagattg caccattaca ctccagcctg ggcgacagag 34621 cgagactccg tctcaaaaaa aaaaaaaagc tggaggtacc gtgactagac tagggctgtg 34681 taaagaccca tctatggaga aagggcaggg ccgcccatca gggctgcacc tcactcctcc 34741 accaagtaga gcgacattac tggcagacct tccagaagca cactctctcc ctaaacctga 34801 gcccagtagc gattcacttt tccattttcc tgatgaggaa accgaggctt agggcagccc 34861 tgggaatgga gggccaggaa cattgccccg gcggttccct ggcagcactc acaccacccc 34921 cctgccaccc ccagagcaag gtgcagccac tggcctgcag ctgcctggtg ggaaggaccc 34981 atggcctcta ccttgggggg ttcttggcca catgggcgat cctggggaga tggctatggg 35041 agcagggccc agaaagggca gaggcgcact gtccacagtc agcttgccag ggacggaatg 35101 tgcccaggcc ttttggcccg ctgtggttac ataatggtgg tgacaagtgg gggagggggg 35161 gctcagggac aagtcgatga tccaatcttc ctccaccccc accagggatc tggccccaaa 35221 ggaccgcaat ggcacatctg accccttcgt ccgagtgcgc tacaagggcc ggacacggga 35281 gacctcggtg aggaggccag agggcaggga aggggcaggg ggctcggggg acgtgccagc 35341 cacgccccca ttgctcacct caccacctgc tcaccatccg ctggtgggac agattggggg 35401 ccacagtctt acagaggagg cccagggaaa ggggcgacct ctctgagggg acataatagg 35461 tcccaggacc tctggggcca acttcgctca ctcccggcca tgcttggggg taaggagggg 35521 attgagacag ccggagtggg gtttatttgg gaacacccat gctgccttcc ccacccccag 35581 gaaatgtcca gagtcgaggg cccagtcctg ggcacgaaga caccaaggtc tgctttctca 35641 gtatagtcgc tactgttttt gttttgtgtt tcgtggtgtt tttttttttt gaggccaagt 35701 cccaccctgt cacccaggct ggagtgcagt ggctggatct cagctcactg caggctctgc 35761 cttcccagtt cgagagattc tcctgcctca gccacacgag tagctaggat tacaggcaca 35821 caccaccaca ccaggctaat ttcttgtatt tttagtagag atggggtgtc accatgttgg 35881 ccaggctggt ctcgaactcc tggcctcaag tgatcctccc gcctcggctt cccaaagtgc 35941 tgggattaca ggcatgagcc accacgccca gccctgtgtt ttttgtttgt tttgctctgt 36001 ttttaaatat tttatttcca caggttattg gggaacaggt ggtgtttggt tacataagtt 36061 cttagtggtg atttatgaga ttttggttca cccatcaccc gagcagtata cactgcaccc 36121 aatttgtagt cttttatccc tcaccccctt cccacccttc ccccccgagt ccccaaagtc 36181 cactgtgtca ttgttatgcc tttgcgtcct catagcttag ctcccacata tgagtgagaa 36241 cataacgatg tttggttttc cattcctgag ttactacact tagaataata gtctccaatc 36301 caatccaggt tgctgtgaat gccattaatt cattcctttt ttatggctga gtagcattcc 36361 atcgtatata tataccacag tttctttatc cacttgttga ttgatggaca tttgggctgg 36421 tttcacattt ttgcaattgc gaattgtgct gctataaaca tgcgtctgca agtatcttgg 36481 ccaggcacag tggctcacac ctgtaatccc agcactgtgg gaggccaagg ctggcagatt 36541 gcttgaggcc aggagtttga gaccagcctg gccaacatgg tgaaaccccg tctctactaa 36601 aaatacaaaa attagccggg cgtagcggca ggcccctgaa atcccagcta cttgggaggc 36661 tgaggcagga gaatcacttg aacccagaag gcggaggttg cagtgagctg agatcgtgcc 36721 actgcactcc agcctgggcg acagagcaag actgtttcaa aaacaaaaca aaacaaacaa 36781 aacaaaaaca tatgtgtgca agtatctttt ttgtataatg acttctcctc tgggtagata 36841 cccagtagtg ggattgctga atcaaatggt agttctactt ttagttagtt aaggaatctc 36901 tgctgttttc catagtggtt gtactagttc acattcccac cagcagtgta gaggtattcc 36961 ctgttcacca catcctcata gtcagtactg ttttgctgtg tggccttgga tacatctttt 37021 ccccaccctg ggcctcagtt ctccctatct cagaaatggg tgggagcttg gacaggacat 37081 tttaacctct gctgtggcag aacctccaag gcgaggaggc tgtggggaac ccaggggagg 37141 tgtctagggc agctctgcct tttgcccagc ctggggagtc cagtggcatc agcctaaccc 37201 cctcccaccc tcctctgggc agatcgtgaa gaagtcatgc tacccacgct ggaatgagac 37261 gtttgaattt gagctgcagg agggggccat ggaggcgctg tgcgtggagg cctgggactg 37321 ggacctcgtc agccgaaacg acttcctggg caaagtgagc accaccgccc cgccccccct 37381 cctcccccgg caggctgcac ctgctgtccc ccagcacctg gttccattgc agccatccca 37441 tcctcaggga cctccctccg gcttccccaa agagggacac tcaggaagcc tgggactacc 37501 cccccacctc caccgtctgc cagaaccagc aacttccccc caccaccact ggcactaagg 37561 gacccttgga aggggcccct ccccactaag aacagtagtt ggcaaggagg agacgtaagt 37621 ctcccagctc caggaaaact gccactatgt ccccaggtgg tgattgatgt ccagagactg 37681 cgggtggtgc agcaggagga gggctggttc cggctgcagc ccgaccagtc caagagccgg 37741 cggcatgacg agtaagtgca tggagctggg cagccggcct caaggggtgg ggtgggggct 37801 gacatttggg aattggctac aggcaggcag ggcagggggg ctgctcaggg tgagaggtca 37861 ggagacacag ctccagggtc ccctcagccc ctgcccacct agccagcatc cttctcatcc 37921 tggctgggct ccacctcccc tttatttatt atttatttat ttatttattt atttattttg 37981 agatggagtc tcactctgtc cccaggctgg agtgcaatgg cgtgatctcg actcactgca 38041 ggctccgcct cccgggttca cgccattctc ctgcctcagc ctcccaagta gctgggacta 38101 caggcacccg ccaccgcgcc tggctaattt tttgcatttt tagtagagac agggtttcac 38161 catgttagcc aggatggtct caatctcctg accttgtgat ccacccgtct cggcctcccc 38221 aagtactggg attacaggca tgagccaccg cgcccggccc ctccccttta tttttcccca 38281 cagttctgca ttcagctcag ctctgtcttc tgtctttttt ttgagatgga gtcttgctct 38341 gtcacccagg ctggagtgca gcggcacaat cttggctcac tgcaacctcc acctcctggg 38401 ttccagtgat tctcctgcct caacctcctg attagctggg attacaactg tgtaccacca 38461 tgctcggcta atttttgtat ttttagtaga gacagggttt caccatgttg gccaagttgg 38521 tctcgaacac ctgacctcaa gtgatccacc cgcctcagcc tgccaaagtg ctaggattac 38581 aggcatgggc caccgcaccc ggccaggtct gtcttcttag ctcacaggtt gtacaggtct 38641 cctgctttct gcctctggaa gtttctgaaa caagtactca gacattttcc ctttgaaaat 38701 ttccctgcag acctcagacc agggttcaag tctccaaggt cagtgtccct gagccatagc 38761 tccaactccc tatgccagat tctccaggaa tagggttgtg agataagata ggacaccatt 38821 ggccaggagc ggtggctcat gcctgtaatc ccagctcttt gggaggccaa ggtgggagga 38881 tcacttgggg caaggagttt gagaccagcc tgggcaagaa agtgagactc cgtctctaca 38941 aagaaataaa aattaggcca ggcgcggtgg ctcacgcctg taatcccagc actttgggag 39001 gtcgaggtgg gcggatcacc tgaggtcagg agtttgagac cagcctggcc agatggcggg 39061 cacctgtaat cccagctact cgggaggctg aggcaggcag aattgcttga accccggagg 39121 cagaggttgt agtgagccaa gattgcgcca ttgcactcta gcctgggtga caaagctaga 39181 ctctgtctca aaaaaaaaaa gaacacccag ttaaaactta atttcagata aaagtgaata 39241 tatatatata tatttttttt ttttgagcag aatctcactc tgtcacctag gctggatgga 39301 gtgcagtggc acggtcttgg ctcactgcaa cctccgcctc ccaggttcaa gcaattctca 39361 tgcctcagcc tcccgagtag ctgggattac aggcatgcac ctgtaccacc agtacagtgg 39421 tgatcatagc tcactgtagc ctccacctcc tgggtgcaag caatcctccc atctcaacgt 39481 tccaggtagc taggaccacc tcacctggct aaatttttgt agttttagta gagatggggt 39541 ttcaccgtga tggccgggct tgatgttgaa ctcctgacct gtgcccggct gtgaatacat 39601 ttttagtata aaaataaccg ctgggcgtgg tgactcacac ctgtaatccc aacactttgg 39661 gagtttgagg tgggtggatc acttgaggtc acgagttcga gaccagcctg gccaacatgg 39721 tgaaacaccg tctctactaa aagtacaaaa aatggtggtg ggcacctgta atccaagcta 39781 ctgaagcagg ataattgctg gaacccagga gacggaggtt gcagtgagcc gagatcccac 39841 cactgcactc cagcctcggt gacaagagcg aaactccgtc tcaagaaaac agccgggtgc 39901 ggtggctcat gcctgtaatc tcagcacttg ggtggccgag gcgggtggat catgagatca 39961 ggagttcaag accagcctgg ccaacatggt gaaatcccgt ctgtactaaa aatacaaaaa 40021 ttacccggat gtggtggcgg gcacctctaa tctcagctac ttgggaggct gaggatggag 40081 aattgcttga acctgggagg tggaggttgc agtgagccga gatcgcgcca ctgcactcca 40141 gcctgggtga cagagcagga ctccatctca gcaaaaaaaa taaataaata aagaagaaga 40201 atgaccatgc actatttgag acatacctat actaaaaata ttcaaattta ggccgggtat 40261 ggtggcgcat gcctgtaatc tcagcacttt gggaggccaa ggcaggagga tcacttgagc 40321 ccaggagttc aagaccagcc tgggcaacat agcaagaccc catctctaaa aaaaatagtt 40381 aataagtaaa taaaatattc acatttaact gggtgtcttg ggtcgttttg ctaaatctgg 40441 cagcctgtct aggaagccct ctggcttgtt tgctgtggcc cacagccctc agtggttcag 40501 ctttgtgtct ctgtggaagg ccccagaccc ccattccccc agttcccatc cttaacaggg 40561 taggagcagg tggacggctg tgtgtccagc acaaggtgtt tccatagaac cagccaaggc 40621 ccatcttgcc atcttatttt ttatttattt attttttttg agacagagtc aggctggagt 40681 gcagtggtgc aatcctggct cactgcaacc tctgcttcct gggttgaagc gcttctcctg 40741 cctcagtctc ccgagtagct gcgattacag gcacctgcca caatgcctgg ctaattttta 40801 tatatatatt ttaatagaga tggggttttg ccgtgtcagc caggctggtc ttgaactcat 40861 gacctcaagt gatctgcctg ccttggcctc ccaaagtgct gagattacag gcatgagcca 40921 ctgcgccagg cccaggggcc catcttaaca ctgctctgtg catgaggctt gtgctgagca 40981 ggggcagtgg gagccttaga ggttcccact tggtccacac cctcctctgt tctgaacttc 41041 cctatcagaa cagggaagct gggaagctca atctaatgtg gcaagttgct gaactggctc 41101 gggtctcagt ttccctatta ggttgttatg agggttgggg atttgcttct gctgtaaagt 41161 gtacccatga tggtgccatt atctgggagt tgatcgattc aaggctggca gctccacata 41221 cccaagcttg gaggtggcag ggagccagac ctcgaagaca gatagaggtg ccttgagtgc 41281 agagacagga agggctttcc aaaaacagaa aataaatagc ccgggcagag cctgggaggc 41341 agagctatgc agggcacccg aatggagggc atgagcagag agatgggaag ccggccccaa 41401 gggaaaaaac tggaggccat gggagtggtg tggtccaggg tggcctgagt ccctggtgga 41461 gaggtcacag gctgcccatt ccaggggcaa cctgggctcc ttgcagctgg aggtgcggct 41521 gcgggacgag acggtgctgc cctccagcta ctaccagcca ctggtgcacc tgctgtgcca 41581 cgaggtcaag ctgggcatgc aggtgagggg gcctgggcag ggtgggaggg tccagtggca 41641 gtaggcctgt gcgcctgtcc ttctaaaccc attctgcaga gcagaaacct gaggtctggg 41701 gatgtgagga gctggccagt gtccagggcc ccgtccccag ccaccccttg ggaaaatgtc 41761 ctctttccct gggatgctct gaggtcctca gagggaaagg ttcgggatcc cccgtgcctc 41821 tccctgcagg gcccagggca gctgatccca ctcatcgagg agacaaccag caccgagtgt 41881 cgccaggacg tggccacgaa cctgctcaag ctcttcctgg ggcaggggct ggccaaggac 41941 ttcctggacc tgctcttcca gctggagctg agtcgcacca gtgaggcctg ggaaggagct 42001 gggcccagaa cactgggagg tgtttggggg tgggtgggct cctcctgtag gaggcagatg 42061 gggtcagaga ggagggaggc aaggaacaga gccaagggca gaggtgggac agtgggctgg 42121 gaggtggcga ggaggccctt tattgagggg gcagcttaga tcccccagac ccaaggtcct 42181 ctgtcctccc cctcccccaa aagctttgtg ccagccccct acccactggg acctctgttt 42241 gtaggtgaga ccaacaccct gttccggagc aactctctgg cctcaaagtc catggagtct 42301 tttctgaagg tgaggttgca tgctatattg tcctcatctt gcaaatgaca acactgagct 42361 tcagagagag cgacagaggg ggctgggact tcgactccag tgttctatct ccatattcag 42421 agccctttaa ggttaagaat gtggctgggc atagtggctc atgcctgtaa ttgcagcact 42481 ttgggaggct gaggcgggcg gatcacttaa ggtcaggagt tcgagaccag cctggccaac 42541 atggcgaaac cccatctcta ctaaaaatac aaaaattagc ctggcatggt ggtgggcacc 42601 tgtaatccca gctactgggg aggctgaggc aggagactca cttgaaccag gagccagagg 42661 ttgtagtgag ctcacgccac tgcactctag cctgggcccc gtgctagact ccgtgccccg 42721 cacccccccg ccaaagagaa tgtgtgtgtg attaaaattt aagttagaaa cagtctcact 42781 atgttgccca ggctggtctc aaactcctgg cctcaagtga tcctcctgcc ttggcctccc 42841 aaagtgctag gattacaagg gctaagaatt ctagtcattc attcactcac tcattcattc 42901 attcatgaat aaacagtttg ccctgcatgt gtacctggcc ccctcctggg cggatacggc 42961 aggcaaacgg tgcaagggcg gctgccggga gaaggtgggc agctggagtg ggactggggg 43021 agacaggatc aatgtgacct gcggtggtcc ccaggtggcc gggatgcagt acctgcacgg 43081 cgtcctgggc cccatcatca acaaggtgtt tgaggagaag aagtacgtgg agctggaccc 43141 cagcaaagtg gaagttaagg atgtagggtg aggccggggg taactccggg ggttgcgggg 43201 tgcagcggca gcgggttggg atcaggccct gtcagcatgt gtgtttgtgc ttctgcccac 43261 ccgtgtattg tcccctgtgt ccgtgtcctg gctgctgtga gagccactgt tcctgtcgtg 43321 gccctggcgc tgaccgcgac ctcctctgcc aacccgcccc gttccacgca ggtgctccgg 43381 gctgcaccgc ccgcagaccg aggccgaggt gctggagcag agcgcgcaga cgctgcgcgc 43441 ccacctgggg gccctgctga gcgcgctcag ccgctcggtt cgcgcgtgcc ccgccgtggt 43501 gcgcgccacc ttccgccagc tcttccggcg cgtgcgcgag cgcttccccg gcgcccagca 43561 cgaggtgcgc cctccacccg ctgggctagg ctgggccggg ttggactggg ctggggcgag 43621 gggtgaagag cggtggggaa ccagggagcg ctggggagga ccagggaggc tgtgtcccag 43681 cgagggtcag ggacggagcc taaggggagg gggctgccag gagccctggt ctgggggact 43741 ccagcccgcg ctcttccgca cccctccaga atgtaccgtt catcgccgtc accagcttcc 43801 tgtgcctgcg cttcttctct cccgccatca tgtcgcccaa gctcttccac ctgcgggagc 43861 gccacgcgga cgcccgcacc agccgcaccc tgctcctgtt ggccaaggtg cgggccttgc 43921 ggccacgggc gggatgcaca gctgggtgtc ctaccggggc cctggcctgg actgcagagt 43981 gttccctacc ggctccggcg aggccactgg cacgcccacg taggcactgc acttcacagt 44041 ttgcaaagcc ttcctgtggc caccttgtcc gagctccatc tcatggataa gggaacaggc 44101 ctgcgagggg tcgggggttg ccaaggccgc ccatcctgtc ggtggtgaat cgcgggacct 44161 tccaacccgg gtctgcttac tctcaggctg cacctatgtg cccggcaccg gggtggggta 44221 tccagtgcag cagcgtcagt taaacctaca gaggaaggcc caagtggtct cagttatcat 44281 aaggcatcaa acttcatgat cctctcctcc ctgatcccta gtggggctgg agggggcccg 44341 tgggcatggc ctctgtcgcc tctgcctcca tcgctacagc tgagtagcaa agacttgggg 44401 agggggggtg ggagggggtg cagtggctag gcccctaacc tagcagaacc tggtccttcc 44461 tgctgtcacc tccaggcagt ccagaacgtg ggcaacatgg acacgccggc ttccagggcc 44521 aaggaggctt ggatggagcc gctgcagccc accgtgcgcc agggcgtggc gcagctgaag 44581 gacttcatca ccaagctcgt ggacatcgag gagaaggacg gtgagcgccg ccctgcacag 44641 acccgcccac agcctactgc tgacgtttgc tccgcccttg agccccacgg ccccacccat 44701 cgctgtttgt tcctctcgtg gccccaccca ccgcctgctg ttggcatttg ccaggccccc 44761 aggccataga cccacgcacc actgtttgtg cttcctgtga ccctgcccac tgcctgccgt 44821 tgacgtttgc cccaccccta gacccatggc ctcacccccc catcatctgt gcctgttgac 44881 atttgctcca tccttgagct ccatagcccc gccgccacct gctactgacg cgtgtcccgc 44941 ctccaagccc atggccctgc cttcctcagt ccacctgtca ccccgctcac caccagctgt 45001 tgatgtttgc tccgcccttg agccccatga ctacgtccac tgcctgctgt tcagatttgc 45061 tcagcccttg agccccatgg cccctcccac tgctccgctc cccccagtcc cgccctcatt 45121 cactgggctc tctattcccc tccctccctg gcctggcaga gctggacctg cagcggacgc 45181 tgagtttgca ggcgccacct gtgaaggagg ggccactctt catccacagg accaagggca 45241 agggccccct catgtcctcc tccttcaaga agctctactt ctccctcact accgaggccc 45301 tcagcttcgc gaagacgccc agctccaagg tgggtaagga gggaggccgg caagtggggc 45361 tctgagaggc cggcaagtgg ggcacctgtc cccttccttg gagcccactt gaggtacccc 45421 cggggtcaga gaagaaagcc aggtgtgtgg catctgtact gcttgtcaga acctaggagg 45481 gccgggcacg gtggctcacg cctgtaatcc caggactttg ggaggccaag gcgggtggat 45541 cacttgaggt taggagtttg agaccagcct ggccaacatg gagaaacccc gtctctacta 45601 aaaatacaaa aaatcagccg ggtgtggtgg tgggtgcctg tagtcccagc tactcgggag 45661 gctgaggcac gggaatcgct tgaacctggg aggcagaagt tgcagtgagc cgagatcgtg 45721 ccactgcact ccaacctggg caacagagca agactccgtc tcaaaaaaaa aaaaaaatga 45781 atatggagga agttgatccg accttaccag gttgttgagg attaaatgag atgatctccc 45841 taagggtcct gacccagggc ctggcaggta gtactattta ggcaaatgat attattaaag 45901 atagtataac acctgggctg ctagatgagt aggaggctag ggcaaacagg ggaggcttcc 45961 tggaagaggc aggaatatag aagcacggaa tcttggcgtg ggaagggatc tcaaagatca 46021 ccttgtacag ccctctctga gtcatgggtc acctctcctg caagctctct tctcccagga 46081 agggctgagg acatacttcc tccatggacc tggtgactgc ttgctgtaca cctgccccca 46141 gtcatgcagg ctatggggat gatcttccag gagtacccag cctagccctg ggcaggtgag 46201 agcgggctag gagtaggtga aggagctcat cacccaccct gttctcaacc agctcagggg 46261 tcccaccaga tgaggggccc gccccttccg accaaagacc cctcagatga tgataattat 46321 tttttgagac agggtcttgc tctgtcaccc aggctgtagt gcagtggtgc gatcatagct 46381 gaccgcagcc tcaaactcct gggctcaagc aatcttccta cctcagcctc ccgagtagct 46441 gggactacag gcacgtgcca ccaagcccag ctaattttct ttaacttgtt tttttgcgga 46501 gatgggggtc ttgctatgct gcccaggctg ttctcaaact cctggcctca ggcaatcctc 46561 ctgccccagc ctcccaaagc actcagatga ttattataca tcttccttgc ccgggttgaa 46621 agatttgact acatgggcat tatttaattc tggtcccatt tttttttttt ttttttgaga 46681 tggagtttcg cttttgttgc ccaggccaga gtgcaatggt gcgatcttgg ctcaccgcaa 46741 cctccgcctc ccgggttcaa gcgattctcc tgcctcagcc tcccaagtag ctgggattac 46801 aggcatgcgc caccatgccc ggctaatttt gtatttttag tagacacgga gtttctccat 46861 gtttgtcagg ctggtctcga actcccaacc tcaggtgatc cgccttcctc ggcctcccaa 46921 aatgctggga ttacatgtgt gagccactgc gcccggtcag gatatgacta ttagagcctg 46981 ggactacaaa agcaagttgg aaaattaata aatgcaaagt cacaaagcct tggccacggc 47041 cggattgact ttgtagccca gggaatgctg gctgtcccga cactacccca aagctcacac 47101 ctccgaggcc ttctgggagt gccccccaga gaggccaggt cctcctctcc cagttgctgg 47161 gcacttggtc accatcagtc tgtgctctgc cctggctgag gtcactctag gtctcccatg 47221 cctacccctg gtgctgcaga aggggagact gaggcccaga gatggggaat ggaatgcccc 47281 tggctaggtc cctgagcctc ctcctccctg cctgccccag aaaagcgccc tcatcaagtt 47341 agccaacatc cgggcagcgg aaaaggttga ggaaaagagc tttggcggct cgcacgtcat 47401 gcaggtcatc tacacggacg acgccggcag gccccagact gcctacctgc agtgcaaggt 47461 gcggggaggg gagcgggtgc tgtgggagtg gcctgaacac tgtgggagtg gcttgggtgc 47521 tgtgggagag gcctgggtac tgtgggagtg gcatgagtcc tgtgggagtg gcctgggtat 47581 tgtgggagag gcctgagctc tatgggagtg gcctgggtgc tgtgggagtg ccctggatgc 47641 tatgggaatg ccctgggtgc tgtgggaatg gcctgggtgc tatgggagtg gcctgagctc 47701 tgtgggagtg gcctggctgc tgtgggaggg gcctgagtgg actggatgag aaaagagttg 47761 attgagcctc tcgccccgct accagctaca tcaggcaccg cagggtgtgt ctaatctggg 47821 aggtgtcagg gacaggtgtg tgtgtgttgt ccaggggcct gagccccagg aggccatggc 47881 ccatgtcttt ccctgggcat ccatgtgtag gtctgtgtgt gggccagtgc acagacaggt 47941 gcatccccgt gatgtgaaat ttgcacgttg cacccgtttg ttggaaacta tttccagata 48001 actgagatgg gggccaggca cgctggttca cacggataat cccagcattc tagccagggc 48061 aacagagcaa gaccctttct ctaaaacaga aagtctgagc tggcacctgt gtggttctgt 48121 gtctgtatca atgcctaata atcaaacatt atgctgtgtg tggctctggg tgcatgttct 48181 caagtaacct tctcatctgt tccttcatta actcattcat tcatttattt aaaaaatact 48241 tactgagcac ctcctaggtt ccaggcatga ttctaggtga ggggatacag tggcagatga 48301 ggcagagtcc tgccctcatg gagctgatat ccctgggaga agatgggcca ataaatacat 48361 gtttcaaatg tcggctcatg cctgtagtcc cagtactttg ggaggctgag gtgggaggat 48421 cactggaggc caggagtttg agaacagcct ggccaatgta gcaagacccc atctctacaa 48481 aaagaaaatt agccagccat gatggcatgc acctgtgatt ccagctactt gggaggctga 48541 ggcaggaaga tagcctgaac ccaggagttg gaggctgcag tgaactatga tcacaccact 48601 gcactccagc ctgggctaca gagtgagacc ctgtctctta aaaaaaaaaa aaaaaaaaaa 48661 aagcaaattg tgatagtgac ttgaagggaa ccagaaggga ctaagggagc aagggaggga 48721 agagagtggg gtatagcagg gagcggtctg tcctattaag agtggtcagg gagagtacaa 48781 aaagtagccg ggcgtggtgg cgcgtgcctg tagtcccagc tacttggggg actgaggcag 48841 gagaatcgct tgaacctggg aggtggaggt tgcagtgagc tgagatcgag ccactgcact 48901 ccagcctggt gacagagcaa gactccatct caaaataaat aaataaataa attagtaaat 48961 taattaaaga gtggtcaggg aggacctctg atgggtggca ttcgagcagg gaccttgtag 49021 gagggtgacc cagatgtgta tctagggcag ggacatgtgg gacaagtaag aagcctgcag 49081 ctggaggggg gttggtttct tccgggcttg accgtcctgg cctgtgggga ggaagtcaca 49141 gacctgtccc tctgtgccct tccctctagg agttcccttt tgtgtcaggg tccagagcca 49201 ctgggaaaag tagtgatatc agggccacca aatatatttg gcgggttctg gaatacctac 49261 gagcgtggcc aaggggccag cggcggctga aatccagccc acaccagcct cctgggctcc 49321 taccatcccg gtgccttccg tggggacaag tggagctgct gccaccaaaa agacgagaca 49381 ggtgggagag gagaggatga agtcttgctg tgttgcccag gctggtcttg aattcctggg 49441 ctcaagcgat cctcccacct cggcccccta taatcccagc tgtagtccca gcactttggg 49501 aggctgaggt gggaggatca ctggaggcca ggagtttgag aacagcctgg ccaatgtagc 49561 aagaccccat ctctaaaaaa aagaaattag ccagccatga tggcattata ggagtgagcc 49621 actgggatta taggagtgag ccactgcacc tgctctccag gccatgcgcc cgggcatggg 49681 tcatgttcca ctggaggaag ggcagcttct cctaattcac tcaaagctgc tgggttggta 49741 tggtggggaa catggcgtct ggcgtggtac aggagctccg ggttaggagg gcttcctgga 49801 ggaggtgacg tctgaacagg cctccccctc cgtgtcccct gcagtgtgtg aatgagctta 49861 accagtggct gtctgcgctg cggaaggtga gcatcaacaa caccggactg ctgggctcct 49921 accaccctgg cgtcttccgt ggggacaagt ggagctgctg ccaccaaaaa gagaagacag 49981 gtgggagagg aggcctgggt cccgggctct gcatccgtcc actgcgagat cgggcctctg 50041 gctgaacgat gggaccctga gccctgctgt gcacctgcag ggacctaacc caaggaatgc 50101 tggcccctcc aggagctctg ggcagaaagg gtcccgtgga ccaggtggtt tgggaaatgc 50161 cacgtgatgt gtgtgatgca tatgagtgaa ttcgctgaag gctctaagga ggcccgaaaa 50221 cacgttagaa tcccatttct ttttatttct tcttcttctt tttttttttt ttttcagaca 50281 cagactttca ctctgtcccc cagactggag tgcaatggca cgatctcagc tcgctgcaat 50341 ctccacctcc cagggtcaag caattctcct gcctcaacct ccccagtagc tgggatgacc 50401 ggcactcgcc accaagccta gctaattttt ttgtattttg actagagatg gggtttcacc 50461 atgttagtca agctgctctt gttttgttgt tgttgttgtt gttgttgttg tttgatactg 50521 agtctcgctc tatcatccag gctggagtgc agtggtgcaa tctcggctca ctgcaatctc 50581 cgtttcccag gctcaagtga ttctcctgcc tcagcctccc aagtactaca ggcgcaagcc 50641 actgcaccca gctaagtttt gtattttcag taaagacggg gttttaccat gttggccagg 50701 ctggtcttga actcccgacc tcgtgatctg ccctcctcgg cctcccaaag tgctgggatt 50761 acaggtgtga gccaccacgc ccggcctaga attccatttc taacgtggtt taacccagca 50821 cttcccaaac tcttttgaca gggaaccctt ctcttcccca aggagcttct tacagaacag 50881 gtgttctcag gaacacattc gaggagcccc ctgctaatga gagccagaga gcccctgggt 50941 tcgcacactc caatgctggc catctgaatc cggagctgtg ggtgggaggg gaccctgagc 51001 cttgggcact agagggatgg catagaggag cttctagaag gactccatac tgtgtagtac 51061 tctaggttct agcacctgtg gttcttgagc acttgaaatg tggcttgagg agctgagggg 51121 ttttttttgt ttaatttttt aaaacagggt cttgctctgt tgcccaggct ggagtgcagt 51181 ggcacgatca tagctcactg cagcctccaa ctcctgggct caagcaatcc tcctgcctca 51241 gcctctgagt aggctggcac taccggcatg catcaccaca cctggctcat ttaaaaaaaa 51301 ttgttttaga ggtggggtct tgctatgttg ttcaggctgg tcttgaattc ccaggctcaa 51361 gcaatcctcc tgcctcggcc tcccaaagtg ctgagattgt aggagtgagc caccacatct 51421 ggcctgagtt tcacattttt aaaaatgttg ttttggccgg gtgtggtggc tcacgcctgt 51481 aatcctagca ctttgggagg ccaaggtagg cggattgcct gagctcagga gttagagacg 51541 agcctaggca acatggtgaa accccatctc tgctaaaata caaaaaaaaa attagccagg 51601 catggcggcg tgggcctgta gtcccagcta ctcaggaggc tgaggcagga gaactgcttg 51661 aacctgggag gcagaggttg cagtaagcca agatggtgcc actgcactcc agcctggcga 51721 cagagcgaga ctccatctca aaaaaaaaaa aacaaaaaaa caaaaaaaaa aacagaaaag 51781 aaacaaaaaa cgttgtttta attttaatta actcaaatag cttcatgtgg ctagctgccg 51841 ccctgtagaa cagcacagtt ctagaacttt cgagaccttc tccctgttat ccacacttac 51901 tttacagagt agactcagca cttcgagtcc cctgtccttc aggccaggcc aaaccttggt 51961 ccccagagcc cagtgtggca gaggccatcg aaaactgacc cacgcactct agcccagccc 52021 tggatttaca gccaagcact gtatagggat gggtgactct tttgtttttg tttttgtttt 52081 gagttgggtc tctcactctc tcacccaggc tggagtgcag tggcataatc atagctcact 52141 gtagccttga cctcctgggc tcaagccatc ctcctgcctc agcctcctgc agaactggga 52201 ctacaggcac atgccaccac acccagctat tttttatttt atttttttgt agagtcaggg 52261 tctcactatg ttgcccagac tggtcttgaa ctcctggcct caagctatct tcctgcctca 52321 gcctcccaaa gtgctgggat tacaggtgtg agccactgtg cctggcctct tggtgactct 52381 ttgcaagggc attgctggct ggctgatatg gcctgcagcc tctgcctgta accatcagag 52441 cgatactctc attatcggca aggtgggacc caccctggcc caagagacag ggcctgttat 52501 tccactgtat ggaggagaag ctgaggctta gggaaggcag atgacttggc aaggtcataa 52561 agacagcaag ctgcaggacc agctcattct aaggcatgaa ccccctgtgg cccacctcac 52621 catgatgtta acatttcagc ctgctccctt ccaggcagac agtcttccag aaagttaccc 52681 ggctccctgg ctgggcgcgg tggctcacgc ctgtaatctc agcactttgg gaggccgaga 52741 cgggcaaatc acgaggtcag gagatcgaga ccatcccggc taacacaatg aaaccccgtc 52801 tctactaaaa atataaaaaa ttagctgggc gtggtggcgg gcatctgcag tcccagctac 52861 tcaggaggct gaaacaggag aatggcgtga acctgggagg cagagctcgc agtgagccaa 52921 gatcgcacca ctgcactgca gcctgcacga cagagcgaga ctccgtctca aaaaaaacca 52981 aaagacccag caacccgagt catatcctga tgatatccat gctcctcagt cacgcatccc 53041 gtggtgcagg ggctgacccc aagaggagct gctgccccca gagggtgggg agccgaggca 53101 gggcctgggt cagacttacc aggctatgct cccagcccag ccctcactag ggacccccga 53161 gtgcatctct ctcctctcca ggcctctgtt tctccatctg tgcaaccaca gtgttggaca 53221 tggtagtccc aagtgtctgc tcgtaacttt gccctctctg tgcccccagg tcagggctgc 53281 gataagaccc ggtcacgggt gaccctgcag gagtggaatg accctcttga ccatgacctt 53341 gaggcccagc tcatctgccg gcacctgctg ggcgtggagg ccatgctgtg gtgagtccca 53401 cccaggagag cctgtgcagc ctggggaacg tgccaggctt ggggctgctc catggtagct 53461 ggtcacctct cttgtaattg gccacatctg gggagggcac caggagtacc tctctgcagc 53521 acgaggccag ctcaggttag gacagaggcc ccaagggtca gggccatgcc tgcttcatgc 53581 catgtgctga ctgagcccac cttgcaacct gcccagctgt tcccctgggc ccttggagta 53641 caggcacagc ctcagccaat tccctgagtt ccctggccat cttgcttcgt ccagtggttc 53701 tcagccttgg cagaagctta ggaaaaacag gatgctttta aagatcagct ttaaaaaaca 53761 ggatgccaag gccccatcac tcagggtctc tggagatgtg gctggtgcca ggatacttca 53821 aggtgatcca gtgcttccca tgcgctcagc ttcccctgag ccaactggag ctggagctca 53881 ttcatctgtc acacccctac actctgcaag gaggaagggt gtggtcctcc ccattcctca 53941 tcgactcttc cagggtttat ctctgccctc aaaacccacg aaggtggggt gcggtgactc 54001 atgcctgtta tcccagctct ttgggaggtg gaggcaggca gattgttgga gatcaggagt 54061 ttgaaaccag cctggccaac atggcaaaac ctcttctcta ctaaaaatac aaaaattaac 54121 tgatgtggtg gagggtgcct gtaatcccag ctactcagga ggctgaggca ggagaattgc 54181 ttgaacccag gaggtggaag ttgcagtgag cttagattgc gccactgcac tccagcctgg 54241 accgcagaac aagactctgt ctcaaaaaca aaacaaaaca aatgcaaagg gccttctggt 54301 caacccacag tgggatgggg aggaggagga gacacgggga gggagaaggt ggaaggtgcc 54361 ctggtgggcc ttggcccagc ctcttcacag actgaaagag tcagccctgc catcaccccc 54421 agagccacct ctagcccagg aggcccatgc tgggccaccc ccagccttcc cacaaggaca 54481 ccagaaggtc tggcagggag agtggaggct gctggccctc actctgacct cccccacagg 54541 gagaggcacc gggagctgag cgggggcgca gaggcaggca cggtgcccac gagccctggc 54601 aaaggtaggt cttgcccacc tgggacttgc tcgtggcccg agccccctcc agagatgcac 54661 gggtggggtc ctcgggctca ctcccagggg ccctcacggc tgcctctgcc cacagtcccc 54721 gaggactcat tggcccggct gctccgggtg ctgcaggacc tccgcgaggc ccatagctcc 54781 agcccggccg gctccccacc ctcagagccc aactgcctcc tggagctgca gacgtgaggc 54841 ccgccctacg ctccccttgc tgagtcccct gccaagcgct cggagccccc ccaggacact 54901 ctgcaccccc tcaccccggt cctcctcatt agggtgcagg gcctaggtct cttccaggtg 54961 ggggaggggg gagagtcagg aataagggga tccccagaag tgcagagctg agcaggcttg 55021 ggcctgtcat ggctggccgg aagtgtcccc agctccctac agacgctgta gccatcactg 55081 cctctccagg gaccctcctc tcctgcccag gacagaccca gccagaacca ctgctaggat 55141 gggccgcacc caggggtctg gcctccaggg acctagagaa tgggagggag aacggggccc 55201 caggagaccc ggccgccacc ccacccgcta cccttgggtg ccacagggct gtgctgttgc 55261 caacagtaaa cctgctctta ctgtccaggc tctggggtct tgtgatgagg gtctggggag 55321 aaagtgggcc cggggggacc ccggaggctg tcggtggatg tgccgatgat ggggctgaca 55381 gtatgggctc tgggcatccc tgttcccccc tctttcttcc ccccactctt ctggggtcgg 55441 gggttccttt cccttcccag ttgctgtccc tgggtcccct ctttcatgtc ccacaggcca 55501 cagagcccag tgtgtccaac cagctgttct ctcctcaaag cagcccccaa gcaagtccct 55561 tctctagggt gtccctgagg acagcacaga ggcgggactc agagacccca ttcctcttca 55621 cgcagccctt accccaagcc ctctagctgt gtggctggca gtgttggcca cgtaggggct 55681 cccatccccc caccattgtg tcacatgggc tgccaggctc agctcccagc tgcgtccaca 55741 gtgacctgga tcagggtggg gacaaggact ggaccctcct tctccagaag gccttcagct 55801 cttgccttgc catgcagtca cctccttccc cctctgaccc cagatcccaa aggtgcaccg 55861 ttgccccagc ccctttctgg ccccatgggg tttctctgat gccttcatca tagaggcccg 55921 gggctggtcc gatggttggc aaaacttgac tccggcccag tccccactct tggggactta 55981 gaacccctgc tgtcctggga tctggcctgc ctttctttgg tcagtccctg tggtccccca 56041 ccagctcccc ctcccatagg gctgcccacc aagccctgcc cccagcccaa gaggagcccc 56101 cactgcctgc ggggcagtga tgtctggcca ccggctcaca ccaatgactt ggtcctgggg 56161 tggcagaagc agcaggtgac aggagcaggg cccctgtccc tctcttctgg ccctgtggta 56221 cccaggccac acgttgtgcc cgctcttggg gctgaccggc tgtagggacc accagccgct 56281 gctactgtgg gccgccccgg ggcagggtgg gcagggcttt tgtgggttat gaggacacag 56341 aagtccctga ggcccccaga cctggctcag ccaacctcct tcctcccccg gttgcccccc 56401 actctaaagc ctcctccctc ccagcgtcca ctggctccag gctcctcaca acagcagctc 56461 atagacacgg ggcgtctcca ggtggtccca gccctccaga tgtttctagc tctccaggtg 56521 ggcgctgttt tcacgtctgc ctgcatccat tcattccttc attcctcacc tttatcctgt 56581 tatctctatt tttttaagct accaggaagg aaagggaaga agagatcacg aaactgggac 56641 ccccagaagg gaggagtggg ctttgaactt agacatctac ctcagagctc aaataggttg 56701 tttaaaatca cattcaattt tcagatgaag gggaacttta tagttttttt tttttttttt 56761 ttttttgaga cagagtctca ctgtgttgcc caggctggag tgcaaatggc ttgatcttgg 56821 ttcactgcaa cctctgcctc ccaggttcaa gcaattctct tgcctcagcc tcccgagtag 56881 ctgggactaa aggcgtgtgc caccatgccc agctaattct tgtattttta gtagagacgg 56941 agtttctcca tgttggccag actggtctcg aactcctgac ctcaggtgat ctgaccgcct 57001 tggcctccga aagtgctgag attacagttg cgagccactg tgcgtggcca gaactttata 57061 ataagagact tgaagctggg tgtgacggtg cacacctcta gtcccagcta ctcgggaggc 57121 caagacagaa ggatcacctt gaggccagga gtttaaggcc agcctgggca acatagcaaa 57181 acctagtccc taaaattaaa aaaaaaaaaa aaaaaaaagg aaaataaagg agacttgaaa 57241 tttttgaact aaatagtggt gatggctaca cattgtgaat gtaattaaca ccactgagtt 57301 aaacacttaa aatggttaaa atggcaaatt gtatgttata cctattttac tacaataaaa 57361 agtataaaaa agagaagata tttaggtgac ttacagcaac caattgcaac aaaacaaaat 57421 gttaagaaat gatcttttta tgaggcaatt ggaaatttga acactgatca actataggat 57481 gattggaatt attaattttt aaaggtgtga taagatactg cacttggctg ggcacagtgg 57541 cacatgcctg taatcccagc tacttggcag gctgaggtgg gagaatcgct tgagctcagg 57601 agttcgagac cagcctgggc aacgtggcga aatccccgtc tttacaaaaa caaacaaaca 57661 aacaaaaaag atattgcagt tgtgttgtaa gcgtccttat ctttcagagc tacatagtgg 57721 aatgtttatg gaatatttag gataaatgat ataggcattt gggatttgct gcaaaatgac 57781 ccagaggcag gggtcagggg gagaggtaga gatgagacaa gaggtagagg ggagaggtag 57841 aggtagccac gagctgataa ttacagacaa gagatgcgga gtatgtgggg gctcattatc 57901 ctgcatagtc tatctttgta tatctttgaa cttttcaaga ataaaaaagc ttaaaaagta 57961 tacatggcct ggtcctacca gagactcacc caatgccagc ctccagccag ggagagccaa 58021 gtttgcattt tcacacgcat ctcacactcc tctgcactct caacttggag cgctccaaac 58081 agggaaaccc caagccttgc tggcttctgc caaccccctg agcagaagca tgggtccccc 58141 tgatcaccac ctcaccaccc tcatcctgat ctcactgtac acagcaagca aaccccagtg 58201 taattaacaa gactgaggat acctgtaatc ccagcacttt gggaggctga ggccggcaga 58261 tcacatgagt tcaggagttc gagaccagcc tggcaaacca tggccaacat ggcaaagccc 58321 agtctctact aaaaacacag aaattagcca ggtgcggtgg tgcgcatctg taacccctac 58381 tacttaggag aattggattg aacccgggag gcggaggttg cagtgaactg agatcgcacc 58441 actgcactcc agcctgggtg acagagtgat tctccatctc aaaaaaaaaa aaaaaaacaa 58501 caagacagag gagaggaggg gcagagtctt ccagaacatc tttttttttt tttttttttg 58561 agacagagtc ttactgtgtt gcccaggctg gagtggagtg gagtgatcat aactcactgc 58621 ggcttccaac tcctgagttc aagtgatcct cctgcctcag ccttctgagt agctgggact 58681 acaggtgtgc accatcatgc ccagctaatt ttttgtactt ttggtagaga cagggtctca 58741 ccatgttgtc caggttggtc tccaactcct gggctcaagc catcctccca cttcagcctc 58801 ccagagcgct gggattacag gaccaccgca cccagccacg aacacattct caacgctcat 58861 ccaagcaagc tctgagcctc ccctttgtcc ttagcctagg acagccccca gtgtcccagg 58921 gctcagacat gacccctgac ctggctgccc agctaggttc tggggcaggg ccagtggggc 58981 atcctgtata ccttggctct tccaggaata gtagggaaag gcaaagaggg ctgggcacgg 59041 tggctcatgc ctgtaatccc agtactttgg gaggccaagg tgggtggatc acctgaggtc 59101 aggagttaga gaccagcctg gtcaacatgg tgaaaccctg tctctactaa aaatacaaaa 59161 attagctgga catggtggca ggtgcctgta atcactactt gagaggctga ggcaggagaa 59221 ttgcttgaat ttaggaggtg gaggttgcag tgagccgaga tcgggccact ccactccagc 59281 ctgggtgaca gagagacgct gtctcaaaaa aaaaaataaa aaggtcaagg ggaaattaag 59341 attcaagggg cctgccctct ccctcttccc tatggctttt ggggccatac caaccttttc 59401 tcctggctgt ggccttggtg gataccattt cccccatagg cagcccacca gcccgacccc 59461 ttagagcttc atagcctgct gtgctctgcc gtcctatggg caactaagcc tttttgctgg 59521 attgggtccc tcccctccct tcctggccca aggccctggg tccagggaca gctgagcctg 59581 atgcagcccc taccccaccc caaatcacca gcacctggtt ctctggctac ttccttaggg 59641 gtacacaaaa atcctcatgc ctgttggatc tccttgggtt tcaaggacct tctctctagg 59701 aagtatggcc tccagggtct cctgagccta aacgcaggca gtgaaaggga aggagccgtg 59761 gacttattag ggctacaagt ggaaacctgg atcttgagtg cacacacaca gccacccttc 59821 tggcagcatc caccatgcct ctcctcccct ccccagatcc tgaccactgc agctggaagg 59881 gaagttaaca tcagtgcatg cctggcactg ccccctggtt ctcctctcag gccctgagca 59941 gagctgccca tagtccatca tgccacattg tttctggagg ccacctggaa aagcgagagg 60001 agcaggagag ggtatacctt ccatctgtcc cacaagtcta taccaccccc tcccagagac 60061 ttcaagccct ggcattcaac tggggcctgg ggcaggttcc cagatgtttt gtttgtttgt 60121 ttttgttttt gtttttgttt gagatggaat ctggttctgt cgcccaggct ggagggcagt 60181 ggtgtgatct caactcactg caacctccac ctcccaggtt gaggcgattc tcctgcctca 60241 gcctcctgag tagctgggat ttacaggcac atgccaccac tcccagctaa tattttgtct 60301 tttttttttt ttagcagaga tggggtttca tggagtttca ccatattggc catgctggtc 60361 ttaaactcct gacctaagat gatccttctg cctcagcctc ccaaagtgct gagattacag 60421 gtgtaagcca ccgcacctgg cctatttttt tttttttttt tttgagacag agtctccctc 60481 tgttacccaa gctggaggat agtggaacat tctcggctca ctgcaacctc tgccttccaa 60541 gttcaagtga ttatcccacc tcagcctccc gaatagctgg aattacaggt gcccatcacc 60601 acgcctgact aatttttgta ttttttagta gagatggggt ttcaccatgt tggccaggct 60661 ggttttgaac tcctgaactc aagtgatccg cccacctcat cctcacaaag tgctaggatt 60721 acaggcatga gctaccatgc ccagtcccag atgtcttgag gatctgattg gacccccagg 60781 cagatatctg cagcccaggg ggctggtgtt caccacaggg cagggcaggc accctgggaa 60841 ggcttgactg gacccagctc acccagtacc tgtttctctc aacctcttag gaaatctgaa 60901 caggtatgca gcaaccacct ccggacaagg ccttttcaaa gtataatttt caaacatgta 60961 aaaaagatga ctcatatata tcatgaacac atatcaaaga tctactacgc atcaattcag 61021 aaaaagagtg ggaactctga ttcatggatc attctgagaa gctgggtaca tgtatttcat 61081 gtatttgtaa gcattctttt gttttgtttt gtttttgaga cagggtctcg ctctgccacc 61141 caggctggaa tgcagtggtg cagtcatagc tcattgtagc ctgttcctcc tgggctcaag 61201 caatcctatt gcttcagcct cccaagtaaa tgcaactaca ggcacacact actatgccct 61261 gctgattttt taaattttta gtagagacaa gatcttgcta tattgcccag gctagtttca 61321 aactcctggg ctcaggcaat cctccctcag cctccctaag tgctgggact acagatgtga 61381 gccactgtgc ctggcccgta aacactttaa gagaagaatg ctagaataag attgtgatat 61441 tacatacgaa agtggttaat taatgaagtt ttcttttctt tttttttttt ttgttttttg 61501 ttttttttga gacggaggct cattctgtcg tccaggctgg agtacagtgg tgcgatcttg 61561 gctcactgca acttctgcct cccgggttca agcaattctc ctgcctcagc ctcccgagta 61621 gctgggatta caggtgctca ccgccacacc cagctaattt ttgtattttt agtagagatg 61681 gggtttcacc atgttggcca ggatcgtctc catctcctga cctcgtgatc tgcccacctc 61741 ggcctcccaa agtgctggga ttacaggtgt gagccactgc gcccggccat gaagttttct 61801 tttctacggg atagacaccc acaagctaat aagtaacttc agtgtctggg ctgtaaccaa 61861 acaataaata gcaaagtcaa cacaggcaca acccccaaga ttaggatcag caaacgtttt 61921 ctgtaaagga ccagacatgc agctctcagt cacaaccacc atgccatgta gcatgaaagc 61981 aaccacagcc aatgtgtaac aaattagcat ggtatgtgtg gccaggggca gggctcacgc 62041 ctataatccc agcactttga gaggccaagg cgagaagatt gcttgaggtc aggagttcga 62101 gaccagccta ggcaataaag ttggacccca tcgctacaaa aagtaaaaag aaaaaaacta 62161 gcagggcgcc gtggtgcctg cctgtggtcc caggtactgg ggaggctgag gcaggaggac 62221 tgcttgagcc caggagcttg aagctgcagt gagctatgat ctcaccactg cactgcagcc 62281 tagacaacag agccgagatc atgccactgc actccagcct gggcgacaga acgagatgct 62341 atctcaaaaa aaaaaagaaa aaaagagaaa aaaagaaaaa aagtgataag ggtaataatt 62401 ccctgaggat ataacctcca cacttctggg tcaccctgga aaagcttggg tcaaggatgc 62461 tacccttggc taggaacagt ggctcacgcc tgtaatccca gtactttggg aggctgagtg 62521 cagtggcgca atctcggctc actgcaacct ccgcctcccg ggttcaagca attctcctgc 62581 ctctgcctcc caagtagctg ggattacagg tgcccaccac cacgcccgac taattttttg 62641 tatttttagt agagatgagg tttcaccatg ttggccaggc tggtctctaa ctcctgacct 62701 caagtgatcc gcctgcctca gcctctcaaa gtgccaaagt gctaggatta caggcgtgag 62761 tcaccgcacc cagcctaccc atatcacttt ctgttcctca gcagttaaag aggtgcccac 62821 ccacttcgcc tggaaagtca gattctgtgt ggggctgttg cagaatgaga tttcctcctg 62881 ccagggtgca agaactcaag taggaaatgg gccttcccag aaggggtaga tgctccagat 62941 ttcctcagca ctgtagctgc tgctcagcct tggagaaaca gctgccaaac tgttagggcc 63001 ctggtagctc ctctgagacc ccagtccctg ttgcttcaaa cagcttaagg agggccttta 63061 tgctaaactg tagtctgcaa agcaaccaaa ccaaaaaggg taagatacac ttatccacct 63121 gtagggtcca gccccacagg gtcggtgggt ttctctccat gtgcagagac gagagagcgc 63181 agaaataaag acacaagaca aagagataaa agacagctgg gctgggggga ccactaccac 63241 caagacgcag agaccagtag tggccctgaa tgccaggctg cgctgatatt tattggatac 63301 aagacaaagg ggcaggataa ggagagtgag ccatctccaa tgataggtaa ggccacatgg 63361 gtcacgtgtc cactggacag gggcccttcc ctgcctggca accgaggcag agagagagag 63421 gagaaagaga gagagacagc ttacaccatt atttctgctt atcagagact tttagtactt 63481 tcactaattt gctactgcta tctagaaggc agagccaggt gtacaggatg gaacatgaag 63541 gtggactagg agcgtgacca ctgaagcaca gcatcacagg gagacggtta tgcctccggg 63601 taactgtggg cgggcctgtg gaggagtaga gtcttctcta aactcccccg gggaaaggga 63661 gactcccttt cccggtctgc taagtagcgg gtgtttttcc ttgacactaa cgctaccgct 63721 agaccacggt gggcttggca acaggtgtct tcccagatgc tggcgttacc gctagaccaa 63781 ggagccctct ggtggccctg tccgggcata acagaaagct tgcactcttg tcttctggtc 63841 actcctcact gtcccctcag ctcccatctc tgtatggcct ggtttttcct aggttatgat 63901 tgtagagcga ggattattat aatattggaa taaagaataa ttactacaaa ctaatgatta 63961 atgattcata tataatcata tctaagatct atatctagta taactattct tattttatat 64021 attttattat actggaacag ctcgtgccct cggtctcttg cctcggcacc tgggtggctt 64081 gctgcccaca tccaccaagt gcactttggg aggctgaggc tggaggactg ctggaggcca 64141 ggagttcaat accagcctgg gcaacatagg gagacccccc ccccccccac catctccaaa 64201 aataaaaaaa aaaatagcta ggtgtgatgg cacatgcctg cagtcctaac tatcgggagg 64261 ctgaggcagg aggatctctg gagcccagga gttggaggct gcatgatggc gccactgtac 64321 ttcagcctgg gcgacagagg gagacgctgt ctctaaataa taataataat atataaaaag 64381 atatacgtaa ccctacgggc tcccctctca ctgctgtata cacctgcctc cttctgcatc 64441 cctcacccaa ttgtccctcc cggctaacaa gccagctgca ccccccaggc agaggtcctg 64501 cagcagtcgg tataggcagc ctgggcaggg ggcttgtcat agatctgcct ctggccacga 64561 gatcttctgc tccccctacg gctgctagcc gcagctgtcc actacccctt agctttattc 64621 tctctattca ggggcctcct gctgcgcact ccaagggggt gcacgtttcc aaagagagat 64681 caagggacat ggggaaagga gaaaggtaca gcttgaggtt cccgctcagc caggactagg 64741 aggcctctgt aggggcagaa cagccctgcc ggcgctctga gcatgcgccg aaaagcgccc 64801 acggggaaag ccatgccggg gtgcggggtg gggggggggg gggggcggtg cgctcgccac 64861 tgacgcatgc gcagagacac gggtctctct cggggttttc tcgcgcctgc gcaagatcct 64921 ccagcccaac agggggcgat gagccgatcc ttgagcgggt ttgccccgcc ggagtaatcc 64981 ggaagaggcc tcttattagg gctctggtgg cgacggtggc ggacacttgg ggtctggacg 65041 caacggcggc gggagcatga acgcccctcc agccttcgag tcgttcttgc tcttcgaggg 65101 cgagaagtaa gtgacgccgg ctgcggaggg ccgaggtgcg cgggcctgcg gctgtcgcat 65161 tctggggtgt ctcctgccca tctcttgccc cgggagggaa acagcttttg cttgatcgtc 65221 tgctcggcct gtgtcagcca aatttaaggc cgttccgcga ggtagtcgcg agtgtgttca 65281 ttttgcagat gaggacactg aggcttggag gcgagcaagt tgccccggac acaggaagtg 65341 gcggcgcttg tattcgaacc cagagtgtct ggttctccat agcccttttc ctctgggatc 65401 tggcagcttc ctctttttgt ctctcgcccg ccacctccat ccttcatccc ctccgatctg 65461 tcagcggcct gcgtaaaagc cgcagtggct cctcgctgcc tgcgggttga aggagtccag 65521 gggagagagc ttgggctcac tctgtatctg tgggcgagtg gcttgatctg tcggaggccc 65581 tgtttcctca tctggaaaat gagtttatta ctgtacttaa acgtgggatt caggccgggt 65641 gcggtggctg acacctgtaa tcccagcact ttgggagccc gaggcgggcg gatcacctga 65701 ggtcgggagt tcaagaccag cctgaccaac atggagaaac cctgtctcta ctaaaaatac 65761 aaaattagcc gggcgtggtg gcagatgcct gtaatcccag ctactcagaa ggctgaggca 65821 ggagaatcac ttgaacctgg gaagtggagg ttgaggtgag cggagatcac gccattgcac 65881 tccagcctga gcaacaagag cgaaactccg tttcaaaaaa caaaaaaagg gatctggcca 65941 ggtgcgctgg ttcacgcttt taatcccggc actttgggag gctcaggtgg gaggattgct 66001 tgagctgagc acaggagttc caggctgcag tgagctttga tcgcatcact gcactccagc 66061 ctgggcgact gagcgcaatc ctgtctcaaa ataaaaagaa ttggatgagg taatttgtgt 66121 aaaactcttg gtccaaagct cagcccctgg aaaacctcat aaccattggc tcttattgtt 66181 cccaggttgg cgtgcagagg tcttggtctg gcccttgcca gccacttcgg cctcctttag 66241 agcagactct caactgggga ccatttggcc ccacaaggga catgtggcaa taactgagaa 66301 cattattggt tgtcaaaggg ggttatgact ggaatctact gggtagaggt cagggatgct 66361 gctgatgtcc tacaatgtcc acaagacagc accccacagg aaagaatgat gcaccctaaa 66421 atgtgtgtgg tgtcagcctg gagaaaccac cttagaaaaa aggatattca acttgatacc 66481 tctcacctgt gtgagatttt actgtcagca tagttgccct caaattgagt gtgcattaga 66541 atcacctgaa gggcttctta aaccatagat ccctgtgccc cacctccaga gttctgggtc 66601 cagtaagtgg gaggtgggag actcttaaca agttcccagg ggattctgat gctgcttatc 66661 tgggaactgc actctgaaaa ccccttgtct tggaaaatcc aatggaagtt tcctgtcttt 66721 gtccatctaa gcgttcacta aaatcatagt aaatgccaga gcatgacttg ttttggggaa 66781 tggatggaag gactgcacac tccagccagg gggatggttg ctttcggaac tcacagggtc 66841 aggaccagcc tcaccgtgta acttcagggc ctctcttctt caccttttgg ttgcttcatt 66901 atgttggcgt aagaaaaaag gaacaggcca ggcatgatga cgtgcgcctg tggtcccagc 66961 tactcaggag gctaaggcgg gaggatcact ttagccgagg agttggaggc tgcagtgagc 67021 tatgatcaca ccactgcact ccagcctggg cagcaggaga gatcatttga ggcgaggagt 67081 tcaagaccag cgaggacaac aaggcaaaac cccatctcta ccggaaaaaa aaaaggagga 67141 attcagaaaa ggccaaggtt ccatagacct ggggtagcag gaagtagggg accgggttgg 67201 cgaggccttt atgggatgaa gagattgaga aaggccttca ggaagaggca gagcttattt 67261 ttccttttta tttttatatt tttgatacag agttccactc tgttgcccag gctggagtgc 67321 agtggcatga tctcggctca ctgcaactcc acctcccagg ttcaggtgat tctcctgcct 67381 cagtctccca agtagctagg actacaggca tggatcacca cactcagcta attttttttt 67441 ttctttgtat ttttagtaca gacagggttt tgccatgttg ggcaggctgg tcttgaactt 67501 ctgacctcag gtcatccatc cacctcagcc tcccaaagtg ttgggattac agacacgagc 67561 caccacacct ggccgggagg agacagggtt tagactcagg gtgggaggag gggagcccct 67621 gctcctcatc tgtggcctct gtgtcttcct aggatcacca ttaacaagga caccaaggta 67681 cccaatgcct gtttattcac catgaacaaa gaagaccaca cactgggaaa catcattaaa 67741 tcgtaagttt cccgtcacag gctctcaggg gctatgttta atgggcccct ctgggcaaag 67801 cacaagttcc aagtcttcct cagtctctct gagctacagg aagtccttct gggggcttgc 67861 tctgggcatt ttataatctt ggtacctttt tatcaaaatg acaccgacct ccatgtcttc 67921 acataagcca cacttctgta gtggcttatt ctgtagatga acacgtgcat ttacattcac 67981 tgtttgtttg tttgtttgtt tttgagatgg aattttctct ctttgttgcc caggctggag 68041 tgcaatggca cgatcttgct cactgcaacc tccgcctccc gggttcaagt gattctcctg 68101 cttcagcctt cctagtagct gggactacag gcccacgcca cgatgcccgg ctaatttttg 68161 tgtttttagt agagacgggg attcactgtg ttggccaggc tggtctcgaa ctcctgacct 68221 caggtgatcc acctgccggc ttcccaaagt gctgggatta gagacgtgag ctactgcgcc 68281 cggccttaca ttcaccatta agtgcacatt ctgtaaatga acgagcagca gattccagca 68341 cagtaggcct gtttgttatt catgctgagg gacattgaaa agcagtcaag attagctccc 68401 ctcatgccac ccgtttgtga cctcagtgcc acctgggcct ctaagatgta tgtggcagag 68461 actagaactt ctatttaaca agagaaagaa gtaggagttc tgtggctatc cagtgcccct 68521 aatgggcacc ctgcattggt tggtgggtgt ttgtttatac cagtgaggtc ctagccaggt 68581 ggttggtcag ggatgtgtct gccattctgg cccagggact tgccatttta ctgtagactt 68641 ggctcttccc tctggtgcca ccagcgaagc caggcttttc cccgtctgtg agtggggtct 68701 gcccagtctc ctggtgtggt caggcacggg gatggaggaa tcctgcccgg agcccagcca 68761 gtggggcacc caagctctcc tggggccctg ctgctcccct gctcttgagc tcagtgttgc 68821 cacctatgca cgggtccccc ttgtggtgct cacgccactc tgagccgctc tagaggagac 68881 ccctgcgtgc ccagggcaca gtacgcgacg cagccccatg gcacaaggcc cagtgtgcgt 68941 ggccagccgc tggcggctgt gctgctcgga gggcagagtt ccatttgcat tccactcact 69001 gccgggagct ggccctcctc tcctgctgcc tttatttgct gcgtggaggc tctttgtggg 69061 ctttttgggt ctgagtcact gaacactggc tggacattca ctgcctggga gatctctttg 69121 ccagcactgc ggtgcccggt gggaagaggt tgctggaatc tctttccagc tcgggtcctt 69181 caacctcatg gatgcagagt gtgtgtgtgt ggcggggata gggtgtcccc acatgaggta 69241 tcttcctaaa agtgttactt tccttgaaga ctttttggga aaaaagttgg gggggatttt 69301 tggctagaaa gtctgttctt ctgctctgag ctggctgcta cccagcttgg gcgtgaggct 69361 agaccctcac actgtctgtc cctggacctc agacaactcc taaaagaccc gcaagtgcta 69421 tttgctggct acaaagtccc ccaccccttg gagcacaaga tcatcatccg agtgcagacc 69481 acgccggact acagccccca ggaagccttt accaacgcca tcaccgacct catcagcgag 69541 ctgtccctgc tggaggagcg cttccgggtg agggcagggc ctggaggggc agacggggtg 69601 ggctggacac tggcccgtgt gcccaggcct gggacagccc tggcctgttt cttcggaggt 69661 cctcagggag aggcggcggt gatggaagaa cagggacttc caccacaggc tccaggacat 69721 gtggactgag gggctgtgga gtctgggcct gtggctcccg tctgccccat gggacttctg 69781 tagtgctgca gggtccctcg ggtgctgtgg gccagatccg ggcggggacc tactgtcctt 69841 tgggggtgct cttctacgtc ccttgtcggt gattggcaag gcctggtcct tccaggcctc 69901 tgggaggcag ctcaccccag ggtggcccac acctgttcct agcagggcgc ctgggaatct 69961 agaacagttt agaggggaaa gagccacagc aaagaaaagc cgaggcaggg tgatcacgag 70021 gtcaggagtt caagaccagc ctggcaaaca tggtgaagcc ctgtttctac taaaaataca 70081 aaaattagct aggcatggtg gcatgtgctg tagtcccagc tactcgggag gctgaggcag 70141 gagaatcgct tgaacccggg aggcggaggt tgcagtgagc cgagattgtg ccactgcact 70201 ccagcctagg taacagagca ggactccatc tcagtcaatc aatcaatcaa tcaatctcag 70261 cggttgaact acccttgaca tggttcagct ctgtatccac acccaaatgt catgtcaaat 70321 tgtaattccc agtgttgtgg gagggacctg gtgggaggtg attggctcat gggggccgac 70381 ttcccccttg ctgttctcgt gatattgagt gagcgcttgt gggatctggt tgtttaaaag 70441 cgtgcagccc tcccacttca ctctctctgt ctctcctgct ccaacatggc cagacgtgcc 70501 tgcttcccct tcgccttctg ccgtgattgt cagtttcctg aggcctcccc agccacgctt 70561 cctgtacagc ctgcagaact gtgagtcaat taaacctctt ttcttcataa attacccagt 70621 ttctcatagt tctttatagc agtgtgaaaa cagactaatg gacccttctg gttgaaggaa 70681 tgcagccatt ctgcttgttt gactatgtcc tttctattca tctctatttc ctgggaggtg 70741 tttatccaag tgcaatagga ggtattggtg accgcacagt cccctcagtg ttctgctagt 70801 aaatagttga aggttgatca ttgatcttct gcgttttcag tctggcatgg aaaagcccct 70861 gtgcaactgg taaagatatc aataagcacc aggaggtatc taaatccacc aggagccata 70921 ggcatcacgt tgacgtccat ttaccagtct tccctggcaa gattcttctg aattgtgctg 70981 ccttggccaa aagaggtatg ggaggggctg ggcgcagtgg cttgtgcctg taatcccaac 71041 attttgggaa accaattcag gtggatcgtt agaggtcagg ggttcaagac catcctggcc 71101 aacatggtga catcccatct ctactaaaaa tacaataagt tagctgggtt tggtgttggg 71161 tgcctgtaat cccagctact cgagaggctg aggcaggata atcgcttgaa cctgggagga 71221 ggaggtggca gtgagctgag atcgtgccat tgcactccag cgtgggcaac aagagtgaaa 71281 cgtcgtctca aaaaataaaa aaaaagtccg ggcttgatgg ctcacacctg taatcccagc 71341 actttgggag gccaagacgg gtggatcacg aggtcaggag ttcaagacca gcctggccta 71401 gatggtgaaa ccctgtctct actaaaacta caaatattag ctgggcatgg tggcatgcac 71461 ctgtaatctc agctactcag aagtctgatg caggagaatt gctaaaaccc aggagggaga 71521 ggttgcagtg agccgagatt gcaccactgc actctagcct gggcgacaga gcaagactcc 71581 gtctcgaaag aaagaaagaa agaaaggaaa ttccccaggg aagtacctcg gctgatttca 71641 taaacaggta ccgaaggaag cagaggcatg tggaggactt ccccacctca tgcagctatt 71701 tgggccgtgg cgtctgaaat ttattatttc agagtcaccc ctttgatgac cttggcagtg 71761 aactgcagtc atctgtttag gcctttccat ggcccacgtc aatgccgtta tttctgtttg 71821 ttgcacattt gatttccttg ttgttggcat ttagaaggcc ccctgcttcc cagatcacac 71881 cacgggcatg gaccacagag attgcatctt gtgagtctgt agaaatggtc aaggccttgt 71941 cctctcttag gtccagagct caggtgaatg cagattttcc cggccatctg tgctgaagtc 72001 cctgtgggga ggctcctggc tggtttcctg taggtagaca gctacacgtc ctgcccttca 72061 ttggcttctt ttcatgaagc tcctgccatc tacaaaacat gtctcccttc ttgaatcaca 72121 tctctgttat tgaaactcta gaagtcaacc gggcatggtg gctatgccta taatcccagc 72181 attttgggat gccaaggcgg gtggatcacc tgaggtcagg agttcaagac cagcctggcc 72241 aacatggcga aaccccgtct ctaatacaaa tacaaaaatt agccaagcat ggtggtcact 72301 gtactccagc ctgggtgaca gagcaagact ccgtctcaaa aaaaaaaaaa aaaaaaaaag 72361 agaaagaaag tatcatgctt ttctgcattc tgtgaattgt tttagtgagt tatcgaactt 72421 gagggcatgg tgggaacctc caaatttgca gccagttggt gagaagtaca tgtggtctga 72481 ggacacccaa gcctgcaggt gtgtctaaag cgagggcagc ctagtgggga ctggtggcct 72541 taacctgtgg catttgaggt aacatcaggg agttgacatc agaattgcat cacataggct 72601 gggcgcggtg gctcacgcct gtaatcctag cactttggga ggccaaggcg ggcagatcac 72661 gaggtcagga gatcgagacc atcctggcta agacagtgaa atcccgtctc tactaaaaat 72721 tcaaaaaatt agccaggcat ggtggcgggc gcctgtagtc ccagctactc gggaggctga 72781 ggcaggagaa tggcgtgaac ccaggaggca gagcttgcat tgagccaaga tcacgccacc 72841 gcactccagc ctgggtgaca gagcgagact ccatcccccc gaaaaaaaaa agacaaaaaa 72901 aaaattgtgt cacacaggcc agatgcagtg gctcatgctt ataatcccag caatttgaaa 72961 ggcaaggtaa gaggatcgct tgagcttgag tctgaggccg cagtgagcta tgaccacacc 73021 actgcacccc agtctgggtg acagcgcaag accccaactc caaaaaggaa aaagaaaaat 73081 cacaaggaat tgcatgtcag agtgcctgtc tttcacagct ttaactgctg caggaacgaa 73141 cttttttttt tttttttttt ttgagagggt gtgaggagac acaatctctg ctagtgattc 73201 tcctgcctca gcctcccaaa tagctgggat tataggcgtg caccaccacg cctgcctaat 73261 ttttgtattt ttagtagaga cagggtttca ccatgttggc caggctggtc tcaaactcct 73321 gctgggatca tgggcgtgag ccaccacgcc cggccacctt tagagttttc ttaccacctg 73381 gttttcctct ctcaatatct ttctctcatt tcctgcctta aaactctagc ttggcatctg 73441 ggcgcagtag ctcatgcctg taatcccagc actttgggag gccgaggtag gtggatcact 73501 tgaagtcagg agttcgagac cagcctggcc aacatggtga aaccttgtct ctactatttt 73561 tacaaaagtt agtcggacgt acagacgggt gcctgtagtc ccagctactt gggaggctga 73621 ggcaggagaa tttgtttgaa cccagaggtg aaagttgcag ggagccgagg ttgtgccact 73681 gcactccagc ctgggagaca gagcaagact ctgtctccaa aacaaacaaa caaacaaaaa 73741 aaccctgtag cttgggatca gccttctctt ctattgtttt tctttaaaaa ataaaaatta 73801 aaaatggatg tagatgctat gttgctgagg ctggcctcaa actcctggcc tcaggtgatc 73861 ctcccgccat gacctccaaa accacaggga ttgtaggtgt gagcactgca cccagcctta 73921 tgtttttttc tacataaaaa acagcacagg attatcttcc agagctaata aatatgttca 73981 aataaccaca accccattaa ggaaaaatat cactgggcag caaataatca atccagacca 74041 atatgatcac aattgctgtg aaggtgagaa aagttcattt ttattatgtt tccccaagag 74101 acgcactcta ttgttctctt gaaaacacac agctcatgtc ctcctttaga acacacatcc 74161 tctttaaagt aacatacaaa catgccaaaa caaggtaaaa aattacatct gaattctcac 74221 atttcaaata tatatgaaat atcaaataaa aatttatttt tacaagaatt taggggaact 74281 actacatagc tataaatgta atatatatgt taactaagta tcatagataa aaaccatgct 74341 cccttcagca gcacgtgtaa taatagatgc aaagattgaa aggtaaaaga tttaggatga 74401 aaagaatcct ctcttaaaaa ggaaaacaaa attatatgta tgtgtatata acagttataa 74461 tatccatcac acagctttat agaaacagca tctattcaaa aataccagta tttccaaaat 74521 atttaaaata atatttaaag taataataat atttaaataa ataaatatat ttaataaata 74581 tttaaataaa taaaataata tttaaataat tctataccca tgtttttcaa aataaaccaa 74641 taaaatagat agtatatatt agacgtgtta gtatatatat ctgaggcatg ttaaaaatca 74701 caactgaatt ctcacaattc agtcacaaac ctaaacagca aataaaaatt tctatcacca 74761 gaattatgtt tttttctggt ggggaactac caatagctat aaatagaaga gattattatg 74821 gaagtatcat agataaaaag agtgctcgct tcaggagcac atgtaataat acagaaacaa 74881 atttaaagat aataaaatat ttaggataaa aagaattgtc tcttaaaaat gaaaagaaaa 74941 ttagctttat gtatatataa caactataac tctcatcaaa aaactacagg aacagcatgt 75001 tttcaaaagt acaacaattt ccaaactatt tgaaataaat ctatgaataa ttcaatggcc 75061 gacattttcc atacaaacca ataaaatgca gagtgtgcat gaagctatct gttacaatct 75121 gtggcactga tatttcacaa aagaattctg tcccaatctg agcccctgca ttgtgccttc 75181 aaatcctcct ggactgcaag tccgtaagaa acaggacctc caggttccgc cccagggagg 75241 ttggaattca gcaatataaa aagggtggtg gtgccgcagg aaagggtgga actggaaacg 75301 ctcctggttt cttacttttc tccaaggact cctagaagga ccccaccccc ctccccccac 75361 ccctgctcct aggaggacaa cgtgatcact gtattcagct ccatcaagaa tggtccaggt 75421 tcttctagat gatctgcaca aatggttcct ctcctcctgc ctggtgtctg ccattagcat 75481 tggaataaag ttcctgctga aaatccacat ctcccctggg tccggtgttc tggaagcgag 75541 agagacaatg tcacacttca aggaggcagc tctctagaca ggaaagttat tcacgtccca 75601 tgtcaattga gaaatgcaat tttatctgct gcctttcatt ctataccctg cttctgaacc 75661 atcgtgttca actgtgaaac tcacactttg gtgaccacga ctccaaaact cacttaatac 75721 acccaaggtc agccccagtg atctgcttca tagcaaggac tttgggtggg tctgcccagg 75781 gagtagggca ccctcagaga atgtggcttt ggactttatc acagctgggg ccttttgtgt 75841 cacttaagat ctaaacttgt aaccatgcta gatgtgtttc taatgtgaca acatcacgaa 75901 ccacgagtcc agaagcctaa tccttaatcc tacctcctca tgatgaagtc tcatgctctg 75961 tgctcaacgt ggttagctgc acaagatgta aaccaaagct tcactgaacc ctcgacccaa 76021 atcagtaact caagtgcgtc aatcataatg aacctcccca aactcagttt ttatgattat 76081 ttttgaggca gggtctcact ctgtcgccca gactggagtg cagtggcagg atcagggctc 76141 cgtgcggccc cgaccttcca ggctccagcg atcctcccgc ctcagcctcc tgagtagttg 76201 ggagtagaga tgcgtcccac atcgcctggc taatttttgt atttttgtgg agaggggatc 76261 tcgccacgtt gcccaggctt gaagccagat caagcaattg ggttcttcgg atttccgaaa 76321 tagaccccaa tattctgcct ttaccccaga ggatgcagat gtaccttctc tcaggccgat 76381 gacctcaggc ctccccggtc cctggagctc taggaaaggt gagcgcgatc tcgcgcccac 76441 acccagtgct ctgggtcata agcctggatc tggaaaaaca aacgcccttt gagaagacgg 76501 ggactcgcca ggatatccct ctctcccctc atccagcctc cagcccaccc aattcctccc 76561 cacctcctcc acctccccag gccccactca cctcctccaa ctcctctggg gaaacccaag 76621 ccctgcagct catggaacag aagaagtgga accgtcgttt ctggaacagg actatctgag 76681 agcggttctt cctggccctc gagttcatgg aacggtataa ctggaaccga cgcttacgga 76741 gcaagggtat gcgagagcgg ttcttcctgg ccctcgggtt cgtggaacgg tataactgga 76801 accgacgctt acggagcaag ggtatgcgag agcggttctt cctggccctc gggttcatgg 76861 aatggcctaa ctggaaccaa cgcttacgga gcaagggtat gcgagagcgg ttcttcctat 76921 acaggaagtg gaagatgttt tgtttggagt cctcgtcgtc ctcctccatg tcattggcca 76981 ggtagctgag gacagaaatc aggttgctgc tcaggggcac caccaggaga ggcctccggc 77041 tgaggtcagc ttcccagaga ggaaggtaag ggaccgtccc tagctcagga ctggcaccca 77101 ccctgcagag agccatgcct tcctcaggag ggctctgctg gacagagacc tgatcaaggg 77161 cgtctcccac tccttcagga tggagacaaa aacccaactg gtggccgaga gtggtggctt 77221 acgcctggaa tcccagcaca ctgggaggcc aaagcaggag gatcacttga ggccaggagt 77281 ttgagacggg cctgggcaac atagcaagac cctcgtctct attaaaaatg taagaaatat 77341 gccagacgcg gtggctcatg cctgtaatcc cagcacttta gaaggctgaa gcaggtggat 77401 cgcttgagac caggagttgg agaccagcct ggtcaacacg gagaaacccc atctctacta 77461 aaaatacaaa aatgagcctg gtgcggtggc acacccgtta ggcctagcta ctcaggaggc 77521 tgaagcacaa gaattgtgtg aacccaggag gcggaggttg cagtgagttg agattgggcc 77581 actccattcc agcctgagag gcagaacaag actctgtctc aataaacaaa caaacaaaca 77641 aactgtccag gtgtggtggc acagccctgt agtcggagct aataaagaag ctgaggtggg 77701 aggatcgctt gagcccagga tatggaggct gcggtgagct atgatctcac cactgcactc 77761 cagcttgggg gacagggcaa gtctgtctca aaaaaataaa agaaattgaa tacattgata 77821 ttttgccagg accctgcctt ctacaggcat ctagtctaat gggactggga gtaatcaagg 77881 cagatgacct aatcccagtg tccaggatgt aactagagag ctacgggcat gcagaagttg 77941 gaagatgagg gaaggcatca cagaggctgt ggggtgaact gatttcaagg aatgggtcct 78001 tcccttcaga gccacatgtg tgcgggacac ccagacagaa aacacaaaca caaagtcgag 78061 tggagggcat ttggaaggag cagtgaagcc gagccaggaa ataccaagat ggcgagccag 78121 tgtgcttgta gagattgtag agagggtaga attgacactg tggaccctgg cctcgataga 78181 gaaaggcatc agctaaggaa gttgttcagg tgggcagtga ggttgtcgtg ctttggaaag 78241 atgttcaggc tgcactagga agccccctgg cttggggaga gactccagga gaccccagcg 78301 gggagcattt gacagtaaat tcgagtgatg cgagggggac ctgaactgtg gcctctgtca 78361 tgggaaccca gaggaggtcg atggcatttg tggttgatgt gggaaggaga gagagagaag 78421 aaccagaaac gtctgcttgc tggaggaagc ggcatgtccg ctcctccact ccttttcttt 78481 tccccttagg agcggtttat ggttcctttt gttttattct tttatttgta cactggcatt 78541 ggagtttgtt tttttggctt tttttttttt tttttttttg agaaaaagtc tcactctgtc 78601 acccaggctg gagtacagtg gctcgacctt agcttactgc aacctccacc tcctgggttc 78661 aaagggttct cttgcctcag cctcccgagt agctgggatt acagatgcac accaccacgc 78721 ccagctaatt tttctatttt tagtagagac gggatttggc catgttggcc aggctggtct 78781 cgaactgctg acctcaggtg atccgcctgc ctcggcctcc caaagtgctg ggattacagg 78841 cgtattccac tgtgcccagc ctgagtttct gtttagaaac aacagtctat gatagtataa 78901 tcctctcttt tttgtacaca gagtaaagag gacaaatagg tgaaagaata aatgaaaggc 78961 tggaatccca cttcccccgc tgtcccaggg cattggatat tgacggatag gaggcagcaa 79021 accactcaca gagccaggaa gaaatgaatg cgttggtatt gccaggaggg gaagccggcc 79081 cggctgaaat acgctatgac catagccagg agatactgat ggagagaaag gaacacagag 79141 agggagaggt cacatcttgg aagaggaaga ttgtggagag ggggaatgag ggtctgggga 79201 ggggctgccc atcagagaag ggacctcagt gttggggtga ctactcattt ggaaattgcg 79261 ggatggaggg gtattcgaag gtcggatgca aatccgagaa gccagaggaa gggttttggg 79321 tgatgctccc aggatggtgg gctccgatgg gatctttgga gggggtgtgt ctaggtcggc 79381 tggtgtcagg agggtctttt gtgtgccagg cagagaactg tcccgaagag ctgagagtag 79441 aggggccagg agcttcaggg ctgcggccag actgtggccc agagctcaga tcccaaagga 79501 cccataggag aggcaggggc cactcattca ctctgcaaga gaccagcaga atcctgaggg 79561 agatgctgac aaatcataaa aagaccaaga atagccggga gtggcggctc aagcctgtga 79621 tcccagtact ttttgagagg tggagacagg aggatcatgt gagcccaaca gttcgagaac 79681 aacctgggca acatagtgag accctgtttc tacaaacatt tcaaaaatta gttgagcatg 79741 gtggcatgtg cctagtccca gctcctcagg aggctgagga aagaagattg cttgagccca 79801 ggaattagag gctgcaatga gctatgatca tgccactgca ctccatcctg gggagcagag 79861 ctagactctg tctcacaaaa aaaaaatttg tgggtgccaa gactcaagac catgggagct 79921 ggtcgggcac agtggctgac gtctataatc tcagcacttt gggaggccaa ggcgggtgga 79981 tcgcctgagg tcaggtgttc aggaccaacc tggccaacat ggcaaaaccc cgtttctact 80041 aaaaacacaa aaattagcca ggcgtggtgg ttcatgtctg taatcccagc tgcttggagg 80101 ctgaggcagg agaatcgctt gaacccagga ggcatcggct gcagtgagtc aagatcgaga 80161 cactgccctc cagcctgggc aacagagcaa gactctgtct cacacacaca cacacacaca 80221 cacacacaca aaaaaaaaaa aaaaaaaaaa gactgtagga gcatctggtg ggaggtggtg 80281 gagggagaac tgtgggtttg gaagctgcgc cctcccccca gccatgcgtt ggaacaggaa 80341 cagttacatg gagaacaacc ttaccttgtc cgacaccctc agatctttgt cccaggccaa 80401 gaatctttta atgacaggat cctctgtgat tagagagcag atgtcagtgt gagaagcagg 80461 acagggtttc cgtgggagca gcagggcagg gaggagaagt gtgcctcccg gggggaagtc 80521 tcaggattgt ggccgcgggt gaggtggatg ggagagggga gaatgacttt cactgggcaa 80581 gggagagagg ctcctgctct gagactcccc tgagaagagg ccgaaggagg ccctgggtgt 80641 gagaatctac aggatgtaga gctgggaatc agccaggacc ccctccagca gacacggagg 80701 gaccactgca gagtcataaa ggaattccca tcatttcctc atgagacagt cacatcaggg 80761 tgtgaccatg gccttggtat cccccactat ggatggagac acttaggttt agaaaagtca 80821 gtaagagaca ttaagtttca gagggcacag ctgaaaccac tttctttgtt tattgatttt 80881 gtttttcttt atttgatttt tatttttatt tatttattaa tttattttga gacagagtct 80941 tgctctgtgg gccaggctgg aatgcagtgg cctgatcttg gctcactgca acctctgcct 81001 cccgggttta agcgattctc ctgtctcagc ctcccgagta gctgggatta catgcatgag 81061 ctactgtgcc cagccttggt ttttcttttg agacagggtt ttgctctgtc acccaggctg 81121 gagtgcagtg gtgtagtcat agctcactgc agcctcaaag tcctgagttc aagcaatcct 81181 cttgcctcag cctcccaacg tgctgggatc tcaggcggga gccactgcgc ctggcccgaa 81241 accaagcttt cttatcccaa gcgctgacct ttatcaagtt gacctaatcc tttatcatct 81301 cctaagtgtc cctcatgagt gatcacttca cattcctccc acatggagag ctcacccact 81361 ggggcctatt tttcccattg gaaaagtgtg gttattggaa gtttcctgtt tttggaaaga 81421 acaggattgg aggtgctctc tggggtgtcc tcctaccaag cagcctgttg aaggcctcgt 81481 ggtgctcagg gagcacgagc gacactcgcc gtcgcttcag cttcatcttg aggccacaca 81541 gcatctccgc cacccagatc tcctcaggct caggggcgag caccttccgt ggctcctcct 81601 ccaacgactc ctcagattcg tcccaccact ccatcttcct tttccagcaa aaggacctat 81661 gcggggggct gggatctacc ccaggggctg agtaaagaaa ccaggccacg gtgtaatgct 81721 tctgcagttg atcacactag agcccgaccc aaaaccccaa accactctcc atcctcccca 81781 gcctcgcaga ctgctggctt ctccaagcca tctttccttc tgtctgtctc ctctgctgag 81841 ctgcatgtgc cgctccttct cctccccatt ctcccgtttt tctgtcctca gaacacttcc 81901 tcatatcctt ccctggtccc tggctctctg agtccctttt tttttttttt ttttttgttg 81961 ttgttgttgt tgttgagaaa cagtcttgct ttgtggccta ggctggagtg tagtggtgcg 82021 atcttggctc actgcaacct ctgcctcccg ggttccagtg attctcctgc ctaagcctcc 82081 caagtagctg ggattacagg tgcccaccag aacgcccagc tcatttttgt gcttctagaa 82141 gagacagggt ttcaccatgt tggccaggct ggtctccaac tcctggcctc aagtgatctg 82201 cctgcctggc ctcccaaagt gctgggatta caggtgtgag ccactgcacc ctgcctcagt 82261 acctccattc ttcccacaca ccctcctcat gtgctccttc ctgacttctg ggcccttcct 82321 tccttctttt tttttttttt tttttttttt tgagacagcg tctcactctc tcacccagaa 82381 tggaatgcag tggcgctatc ttggctcaaa gcaacctctt ccacctgggt tcaagcgatt 82441 atcctgtctc agcctcctga gtagctggga taacaggcat gcctggctaa tttttgtatt 82501 gttagtataa atgaggtttc gctatattgg tctggttggt ctcgaacaac tgacctcaag 82561 tgatccaccc atctcagcct cccaaagtaa tgggattaca ggcatgagct accacacctg 82621 gccttcgttt ttcttttgac acagggtttt gctctgtcac ccaggctgga gtgcagtggt 82681 gcagtcatag ctcagtgcag cctcaaagtc ctgagttcaa gcaatcctct tgcctcagcc 82741 tcccaacgtg ctagcatctc aggcgtgagc cactgcacct ggcccgaaac caagctttct 82801 catcccaagc gccaaccttt atcaagtcta gcctagtctt ctatcgtctc ctaagtgtcc 82861 ctcatgagtg atcacttctg agtcctcctg cgtggagagc tcagccactg ggggcgtatc 82921 tttcccattg gaaaagtgtg gttattggaa gtttcctctt tttagaaaga acaggattgg 82981 aggtgctctc tggggtatcc tcctaccaag ctgactgttg aagtccttgt ggtgctcagg 83041 gaggatgggt gacactcgct gttgcttcag cttcatcttg agcccacaca gcatctccac 83101 tacccaggtc tcctcaggct caggggcgag ctccttctcc ggctcctcct cagattcatc 83161 tgaccactcc ctcttccttt tccagccaag ggacctacat ggggggctgg gatctacccc 83221 aggggctgag taaagaaacc aggccaccgt gtaatgcttc tgcatctgat caccttagac 83281 cccgacccca aaccccaaac cactctccat cctccccaga cttgcagact gctggcttct 83341 ctaagccatc tttctgattt tctcctctgc tcaaccccat gtgccgctcc ttcccctccc 83401 cattcttctc tctctctgtc ctccgaacac tgcttcatgt ccttccctgg tccctggctc 83461 tctgagtccc tccttttttg ttttgttttg ttttgacaca gaatcttgct ttgtcaccca 83521 ggctggagtg tagtggtgca atctcagctc actgcaacat ccatctcctg gattccattt 83581 attcttctgc ctcagcctct caggtagctg ggattacagg tgcctgccat aatgcccagc 83641 tcaattttgt acttttagta gagacagggt ttcaccatgt tggccaggct ggtctcaaac 83701 tcctggcctc aagtgatccg cctgccttgg cctcccaaag ttctgggctt acaggtgtga 83761 gccactgcac ccagcctgaa tttctccatt cttcccacac accctcctca ggttctcctt 83821 cctgaccgct gacccttctt ttcttttttt tttttttttg gagtgcagta gtgtgctctc 83881 agctcactgc aacctcttcc tcccagtctc aagtgattct cctgtctcag cctcctgagt 83941 agctgggatt acaggtgtgc accactacaa cttggctaat ttttatactt ttagtagaga 84001 tggggtttca ccatattggc caggctggcc ttgatctcct gacctcaggt gatccgcccg 84061 cctcggcctc ccaaagtgct ggggttacag gcgtgagcca ccgcacccgg cccccttcct 84121 tcgtcttagt caatcctatc ccacctcttc ttccaccagt cccctcacct gatggtccca 84181 acatttcatc atccaccacc tcctggaggg ggtaccccga ggtgctccgc tggggactct 84241 gctcattctg ggggtgcggt tgacggctgg tcgtgatctt tcccgtaatc tgtcccctct 84301 tacggaacct agtctccgtt ctgtccatgg ccttcttctg gacacttcta ggatccagaa 84361 gagtatgtta tcaattctca agcctaggag aagtcaggag tagagaacag ctctgagaag 84421 atactgttgt ccaactgatc tccaggcacc acggagtccg gtccctccaa tcaggaaggt 84481 cggaatctct gatgtcatcg ttcatgccaa cctggcaacc agtttgaaaa aaaacacatg 84541 taactgccag gctgatctct tgtcctggag atcctgggtg aatggtatct cctgccactg 84601 tcccaacctc agaccattgt ccaaaagcat cttcagggac tccacatccc tctattccct 84661 gtcccagcag aggctgtgtc ctctccactc aaagcctgaa gcatgttggg gtctcttcgt 84721 ctctgtacat gcccatttca gagtccagtc tggtgggaga gggaacagag tgggaaagaa 84781 aactagggta agcagaaacg atgaaacctt ataagagtga gagtatcatg tacaagagtg 84841 agattatcat gtacaagagt gagattatca cgtacaagag atcccaggaa tactgacttg 84901 atgaaaaagt cacatcagag cactcagttt ggcagagctt ttctgccgaa tgtttactca 84961 cattcactgt ccgagattct atactggggg tacacacgtc ctctgcccta aggcaatttt 85021 gagtccaaga gacattttga ggcctaaaaa tcataggaaa ctgcccctga gctcacacat 85081 atttccaatg gtgtccccag tttcagggaa tccatggatt acctaagcca gcccctccag 85141 ttcggctaag aaactctagt ctatatatca agttttgtat catatgtatt gctctgaact 85201 cagaaatttc ccttccattt atggattcta tgaataaaat atcacatgta caaaaagact 85261 aagtcaaaaa atttcagctg tgcacagtgg ctcatgcttg taatcccagc actttgggtg 85321 gccaagggag gaagattgcc tgaggccagc agttcgagac cagtataggc aacatagcaa 85381 gagcccatct ctaaaaaaac aaaaccaaac caaattagcc aggtgtggtg gctggcacct 85441 gtgttccaac tacttgggag actcatgtga caggaagatc acttgagccc aggagttaga 85501 agctgcagtg agccgtgatc ttgccactgc actccagtct gggcaacaca gcaagatact 85561 gtgtcaaaaa aagttttttt gataaaaaat aaaagagtta catgacattc agagaccatc 85621 caaaaaacct gtgggttccc ggctgggctc agtggctcat gcctgtaatc ccagcacttt 85681 gggaggccaa agtgggtgga tcacttgagg tcaggagttt gagaccagcc tggacaacat 85741 ggtgaaaccc catctctact aaaaatacaa aaaattagcc aggcatggtg gtggatgcct 85801 gtaatcgcag ctactcagga gagggcgctg gagaatcact tgaactcatg gtgcgcaggt 85861 tgcagggagc caagatggca ccattgtgct ccagcctggg caacgagagc aaaactccat 85921 ctcaaaaaaa ataaagaacc tgcgagtgag ttcccacacg ttttcctaat gggctgctgc 85981 tttcctagga gtctctcgct catagaaaag gcacaaactg aaagaggaag cagatcccat 86041 tgctgtggaa gtcccattgt taggaagctc tgcttttctg gagttcaaat tcgcattcat 86101 gacgctttaa accgtcagag ctgggtgggt cctcctacaa caaaatagtt tgctctctct 86161 ctcctagtta acaggctttc aaatattaga agatcaatgt tctgacccca ttaaaatttc 86221 tcttttgtgg aatgaaaagc tctgatttaa cccatcttca aggctggttt gatggaggaa 86281 taggggctga gtcacctgca tttcccctcc ctgcacaaag tcctgggccc agatctgggg 86341 tctgtctctg ctgagggtgg ggtgaaccag gaagcacctc cttcttcatc tccttgatga 86401 atgggtataa tggttgccat ggaactgggg cttgtttgat gacctggggc tgggtgggcc 86461 tctgagagcc tttatagctg attgcctttt gggagagggg aggtgggagc cccaccctgt 86521 ctcatgagtc accccaaagg tgcatgggca ggcaggtgct ggggaatcgg ctactcccca 86581 gagcttggcg tggccatccc tgtggcccct ctgggagtct ggagcccatt ccctcacact 86641 ggtactcact gcagctgggg acatctgcac taggaagaca ggacacggca tggaagctgg 86701 cctctgccca gaagccatga cattctggtc accagcctga tgctataaaa cgagtgtcac 86761 ggccgggcat ggtggttcac acctgtaatc ccagcacttt aggaggccaa ggcgggtgga 86821 tcatgaggtc tggagttcga gaccagcctg gccaacatgg cgaaatcccg tctctactaa 86881 aaataagaac attagccagg tgtggtggca catacctgta gtcccagctc ctctggaggc 86941 tgaggcagga gaatcactta aacccaggag gcggagattg cagtgagccg agaccacggc 87001 attggactcc aggctgggca acagagcacg actccatctc aaaaacaaac aaaaaaaaag 87061 agtgtcacct ggggctactt ggccagacac agagagcaag gagacatccc tattatctgt 87121 caaaaataat tgttggggct gagcacagtg gctcatgcct ataatctcag cactttggga 87181 ggtcggggca ggaggacttg aggcctagag tttgagacca gcctgggcaa catagcgagc 87241 accccatctc cagaaaaaat ttaaaaattg gctgggcgca gtggctcatg cctgtaatcc 87301 cagcactttg ggaggccgag ggggatggat catttgaggt caggagtttg agaccagcct 87361 ggccaacgtg gtgaaacccc atctctacta aaaatacaaa aattagccgg gcatggtggt 87421 gggcacctgt aatcctagct acttgggagg ctgaggcagg agaatcgctt gaacccagga 87481 ggcggtggtt gtagtgagct aggatcatgc cattgcagtc cagcctggac agcaaagcta 87541 gactccatct caaaaaagaa aagaaaaagt aaaaaattta aaaattagat gggcatggtg 87601 acatgtgcct gtaatcccag gtactaagga agctgaggta ggaggatgac ttgagcctag 87661 gagttcgagg ctgcagtgag ctctgatcgc accactgcac tccagcctga gtgacacagc 87721 aagaccctgc ttcaaaaaaa aaaaaaaaat tactggacac aattattgtg ccagacccct 87781 aggttaacag tgaggattca gtgggagaac taaacagata aacagatcca gtccccgccc 87841 tcagagagtt acagtttaat gggaaaaaga gacattcacc aaagaaggac acagcccact 87901 agtgagttac actcgagagg gatcatttac agaacaaagc agattataaa aatacagcga 87961 ttggctgggt gcagtggctc acgcctgtaa tcccagcact ttgggaggca gaggcgggcg 88021 gatgacttga ggtcaggagt tctcaaccag cctggccaaa atggtgaaac cccatctcta 88081 ctaaaaacac aaaaattaac caggcgtggt ggtgggcacc tgtaatccta gctacttggg 88141 aggctgaggc aggagaatga atcgcttgaa cccaggaggt ggaggttgca gtgagccgag 88201 accacaccat tgcactctag cctgggcaac aagagcaaaa ctccgtctca aaataaatac 88261 atacatacat acatgcatac atacatacag ggattaaaat agtctagtac tgacacctga 88321 acagacagat tgatccaaga aatgaaacag aaattccaga agttgacctg aacacacaca 88381 cacacacaca cacacacaca cacacacaca cacacacacg aaggcgtgaa agactccatg 88441 accctcaagg tataagatgc attttttttt ttttttgaga cagggtctca ctctgtcacc 88501 cagactggat gcagtggtgc actatcccag ctcagttcta ccctccatcc ccccaacctc 88561 ccccaaccac cctgagctca agcaattctc atgcctcaac cctcagcctc atgagtaact 88621 gggactacag gcgtgcacca ccatgcgcag ctaatttttt gtatttttag tagagatggg 88681 tctaaccata ttgcctaggc tggtctcgaa ctcctgagct caagcgatcc tcttgcctca 88741 acctcccaaa gtgctgggat tacagctgtg agccaccgca cccggccgca ttcttctaaa 88801 tcacagtaca tctggttccc agtgcccagg ctctcagggc agagggtcca gtgtgatcac 88861 tttgcatggc ctctctcccc tcctgagctt gtgccagggc cccagggctg acctggagaa 88921 ggaaaatggc agagggtgaa gatggggtgt ctggtttggg gaccatcctg gccccccttg 88981 tcactgttgg catctcttct gcacagtggc attgctggga ggtgcttact gtgcctattc 89041 aaggggctgg cagccgcagc ctcactgcag atcagggact tggcttcccg gttgaccaca 89101 ggtccaagaa cctgcagggt ccagcctccc ccccatcccc agtcttcccc accctggccc 89161 ggccctccag gtgcagaaac atgcaggccc ctctccagga ctgtgggagg agtgtgtccc 89221 tcagactggc ctgtgtcctg gctcctctta ccacctcttc cagaggttgt cacctgcagc 89281 tgccccagga taaaggcaag gccagagagg actcctgaac tcctgtgtgc ctggggtggc 89341 aggggcaaac atagccaact ggtggcctga gcagggccat ggtgaggaca cccttggtgg 89401 cttgtcccac atcaagctgg gaggtgacac tgaggatgca ttagtctgca gcgtatgata 89461 aaaacggcat ttcaggccag gcgtggtggc tcatgcctgt caccccagca ccttgggagg 89521 ccgaggtggg cagatcatat gaggtcagga ctttgagacc agcctggcca acatggtgaa 89581 aactcatctg tactaaaaaa acaaaaatta tgtgggttgg tggtgtgcgc ctgtaatccc 89641 agctacttgg gaggctgagg caggagaatc acttgaacct gggaggcgga ggctacaacg 89701 agccgaaatt gcaccactgc actccagcct gactccgtct caaaaaaaaa aaaaaaaaaa 89761 aaaaggcatt tcagttcaaa tagggaaagg atacatcttt ctttctttct ttctttcttt 89821 ctttctttct ttctttcttt ctttctttct ttctttcctt tctttgtttt tctttcctcc 89881 cttcctttat ctctctctct ctctctcttt ctctctttct ttttgagatg gagtttcact 89941 ctcgctgccc aggctggagt acaatggtgc gatctcggct tattattatt ctccatgttg 90001 gtcaggctgg tcttgaactc ccaacctcag gtgatccgcc tgcgttggcc tcccaaagtg 90061 ctggtgtgag ccactgcacc cggcttagga tgcatttttc aatattttag tgtttgaata 90121 acgggctaac ttgagaaaaa aataatttga atcacacatc acaccaaaaa tctaggtgga 90181 ttttaacact ttcaaaaatt attattagtt tagagacagg gtctcactcc gtcgcctagg 90241 ctggagtgca gtggtatgat catggttcac tgcaacctta aactcctggc ctcatatgat 90301 cctccggcct cagcctctca aagtactgga actacaaaca tgcaccacca cgcccagcct 90361 aggtgggttt ttaaaatcca ttcaagggcg ggtgcagtgg ctcacacctg taatcccagc 90421 attttgggaa gccaaggtgg gaggatcact tgagcccagg agttcgagac cagcctgggc 90481 aacatagtga gaccacatct ctacaaaaaa tttaaaaatg agccaggcat ggtggtgcac 90541 acctgtagtc tctgctattc aggaggctga ggcgggatca ttgtttgagc ccaggagaca 90601 gattgcagtg agctatgatg gcaccactgc atggcagcct gggtgacaaa gggagactca 90661 gtctcaaaaa aaaaaaaaaa aaggtgacag gctgggtgca gtggctcaca cctgtaatcc 90721 cagcactttg ggaggccgag gcaggtggat cacctgaagt caggagtttg agaccagcct 90781 ggccaacatg gtgaaaccct gtctttacta aaaatactaa agttggccag gcgtggtggc 90841 gtgtgcccgt aatctcagct acctgggagg ttgagacagg agaatcactg gaaccccaga 90901 ggcagaagct gcagtgagct gagattgtgc cactgcactc cagcctgggt gacagaccga 90961 gactccaact caaattaatt aattaaattt taaaaaaaaa ggcaaagagg gagtcgtgcc 91021 tgggtacagg agacctgggg ttgggtccca gccctgccgc tgacctgccc tgggactgca 91081 ggaagtttct ttccctgccg ggccccaggt ttcttctcat ccgtataatg agattagtca 91141 taacacctgc cctgcccatc tctggggacc caaggagagg atgagtcgat gagcaagaac 91201 gtattttcaa aaaggggaaa caagcccttc ccacattctg atgtctgctt tcaactgcct 91261 tttgaaaggg agtgagagag ggaaaaattg tattatacag gccggacatg gtggctcaca 91321 cctggaatcc cagaacttca ggaggctgag gtgggaggat cacttgagcc caggaatttg 91381 agaccagcct gggcaacaca gtaagactct ctttctacaa aaagttaaaa aattagctgg 91441 gcagccggac acagtggctc acacctgtaa tttcagcact ttgggaggcc gaggcaggca 91501 gatcacatga ggtcaggggt tcaagaccag cctggtcaac atggcgaacc ccgtctctac 91561 caaaaataca aaaattaacc aggagtggtt gtgcacgcca gtaatcccag ctacgcagca 91621 ggctgaggca ggagaatcac ttgaacccag gaggtggagg ttgcagtgag ccgagatcgt 91681 gccattgcac tccagcctga acgacagtga gactctgtct caaaacaaac aaaaaattag 91741 ctgggcatgg tgccttatac ctgcagtctc agctactagg gaggctgaag tgggaggatc 91801 acttgatcct gggaggtcaa ggctgcagtg agctatgatc acaccactgc actccagcct 91861 gggcaacaga gtgagacccg tctcaaaaaa taataataat ataataaata aaagagaccc 91921 cagagagctt ccttgaccct tccaccatat gaggacacag ctagaaggca ctctctacga 91981 accaaaaaca gacaccaact cttcctagtt tttgtttttg ttcttgttct ttttttttta 92041 agatggagtc tcattttgtc acccaggctg gagtgcagtg gtgtgatctc ggctcactac 92101 ggcctcccca tcctgggtcc aagcgattct tgtgcctcaa cctcccgagt agctggaatt 92161 acaggcgcgt gccaccacga ctggctaatt ttgtatcttt agtagagacg gggtttttcc 92221 acattggtca ggctggtctc aaacttctga cctcagatga tccgcccacc ttggccaccc 92281 aaagtgctgg gattacaggt gtgaaccact gcgcccggcc agaagttttt cttgtgagtt 92341 gaggtgaaca tctgaatccc ctcggggaac tgaggtatgt ccctttcagg gaatggcccc 92401 tcttcatcca aaaatggggc ccacaggtga taatctgttg ggtttcccag aatgaagaca 92461 tctggcataa ttagagcgtg aaggtcaggc tgacgggggc tagtaccccg atccagggga 92521 gaggtggtca caggtctcgc tccacctgcc ctgagcccca tctcatcttt tttttttttt 92581 tttttttttt ttgaggggga gtttcactct tgttgcccag gctggaatgc agtgatgcga 92641 tctcggctca ttgcaacctc tgcctcctgg gttcaagtgg ttctcctgcc tcagcctcct 92701 gcgtagctgg agttacaaga acaagccacc acgtccagct aattttgtat ttttagtaga 92761 gatggggttt cgccgtgttg gccaggctgg tctcgaactc ctgacctcag gtgatccacc 92821 cacctcggcc tctcaaaatg ctgggattac aggcgtgagc caccatgcct ggacttccat 92881 ctggtaattt tctgtctcga ggaggaacta ccctgccctg acagaggctg gggatgaagg 92941 aggaagagac agccggtccc ctggataaag aggtgagggg gtgtgaagtc tgggagcccc 93001 cctcccaggg ccaggctaca gggggacagc agggctatac caggagtgcc acagagcttg 93061 catttgggtg tgatctaatc cccagtcctg ccctgacctc ctgggtgctc ttagccaagc 93121 cctcttccct gtgtgtgcta gaaagtgagg gtcactccct tccacacaag ggagtgaatt 93181 gagggtgggg atctgagctc tctgagcagc aaagctggac acaggccaaa ttgccaatcc 93241 tgagtccctc tgtcatgtgg acctgggaag ctt // LOCUS AF000237 1522 bp mRNA PRI 30-JUL-1997 DEFINITION Homo sapiens lysophosphatidic acid acyltransferase mRNA, complete cds. ACCESSION AF000237 NID g2282589 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1522) AUTHORS Eberhardt,C., Gray,P.W. and Tjoelker,L.W. TITLE Human lysophosphatidic acid acyltransferase. cDNA cloning, expression, and localization to chromosome 9q34.3 JOURNAL J. Biol. Chem. 272 (32), 20299-20305 (1997) MEDLINE 97390478 REFERENCE 2 (bases 1 to 1522) AUTHORS Eberhardt. C., Gray,P.W. and Tjoelker,L.W. TITLE Direct Submission JOURNAL Submitted (17-APR-1997) ICOS Corporation, 22021 20th Ave. S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..1522 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34.3" CDS 67..903 /codon_start=1 /product="lysophosphatidic acid acyltransferase" /db_xref="PID:g2282590" /translation="MELWPCLAAALLLLLLLVQLSRAAEFYAKVALYCALCFTVSAVA SLVCLLRHGGRTVENMSIIGWFVRSFKYFYGLRFEVRDPRRLQEARPCVIVSNHQSIL DMMGLMEVLPERCVQIAKRELLFLGPVGLIMYLGGVFFINRQRSSTAMTVMADLGERM VRENLKVWIYPEGTRNDNGDLLPFKKGAFYLAVQAQVPIVPVVYSSFSSFYNTKKKFF TSGTVTVQVLEAIPTSGLTAADVPALVDTCHRAMRTTFLHISKTPQENGATAGSGVQP AQ" BASE COUNT 263 a 492 c 502 g 265 t ORIGIN 1 cgcgggggag aagcgggagc gggagcggga gcgagctggc ggcgccgtcg ggcgccgggc 61 cgggccatgg agctgtggcc gtgtctggcc gcggcgctgc tgttgctgct gctgctggtg 121 cagctgagcc gcgcggccga gttctacgcc aaggtcgccc tgtactgcgc gctgtgcttc 181 acggtgtccg ccgtggcctc gctcgtctgc ctgctgcgcc acggcggccg gacggtggag 241 aacatgagca tcatcggctg gttcgtgcga agcttcaagt acttttacgg gctccgcttc 301 gaggtgcggg acccgcgcag gctgcaggag gcccgtccct gtgtcatcgt ctccaaccac 361 cagagcatcc tggacatgat gggcctcatg gaggtccttc cggagcgctg cgtgcagatc 421 gccaagcggg agctgctctt cctggggccc gtgggcctca tcatgtacct cgggggcgtc 481 ttcttcatca accggcagcg ctctagcact gccatgacag tgatggccga cctgggcgag 541 cgcatggtca gggagaacct caaagtgtgg atctatcccg agggtactcg caacgacaat 601 ggggacctgc tgccttttaa gaagggcgcc ttctacctgg cagtccaggc acaggtgccc 661 atcgtccccg tggtgtactc ttccttctcc tccttctaca acaccaagaa gaagttcttc 721 acttcaggaa cagtcacagt gcaggtgctg gaagccatcc ccaccagcgg cctcactgcg 781 gcggacgtcc ctgcgctcgt ggacacctgc caccgggcca tgaggaccac cttcctccac 841 atctccaaga ccccccagga gaacggggcc actgcggggt ctggcgtgca gccggcccag 901 tagcccagac cacggcaggg catgacctgg ggagggcagg tggaagccga tggctggagg 961 atgggcagag gggactcctc ccggcttcca aataccactc tgtccggctc ccccagctct 1021 cactcagccc gggaagcagg aagccccttc tgtcactggt ctcagacaca ggcccctggt 1081 gtcccctgca gggggctcag ctggaccctc cccgggctcg agggcaggga ctcgcgccca 1141 cggcacctct gggagctggg atgataaaga tgaggcttgc ggctgtggcc cgctggtggg 1201 ctgagccaca aggcccccga tggcccagga gcagatggga ggaccccgag gccagacgca 1261 cactgtccga gccctctgct cagccgcctg ggacccacca gggtgcagct gggctccagg 1321 gtccagccca caagctgcat cagggtctct gggagaggag gggcctccag ggccaggagt 1381 cccagactca cgcaccctgg gccacaggga gccgggaatc ggggcctgct gctcctgctg 1441 gcctggaaga ctctgtgggg tcagcactgt actccgttgc tgttttttta taaacacact 1501 cttggaagtg gaaaaaaaaa aa // LOCUS AF000364 2644 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens heterogeneous nuclear ribonucleoprotein R mRNA, complete cds. ACCESSION AF000364 NID g2697102 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2644) AUTHORS Chan,E.K.L., Mathison,D.A., Portman,D., Dreyfuss,G., Steiner,G., Tan,E.M. and Hassfeld,W. TITLE Molecular definition of heterogeneous nuclear ribonucleoprotein R (hnRNP R) using autoimmune antibody immunological relationship with hnRNP P JOURNAL Nucleic Acids Res. 26, 439-445 (1998) REFERENCE 2 (bases 1 to 2644) AUTHORS Hassfeld,W., Chan,E.K.L. and Tan,E.M. TITLE Direct Submission JOURNAL Submitted (18-APR-1997) Molecular and Experimental Medicine, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2644 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="T24" /clone="BL14" CDS 62..1963 /note="hnRNP R" /codon_start=1 /product="heterogeneous nuclear ribonucleoprotein R" /db_xref="PID:g2697103" /translation="MANQVNGNAVQLKEEEEPMDTSSVTHTEHYKTLIEAGLPQKVAE RLDEIFQTGLVAYVDLDERAIDALREFNEEGALSVLQQFKESDLSHVQNKSAFLCGVM KTYRQREKQGSKVQESTKGPDEAKIKALLERTGYTLDVTTGQRKYGGPPPDSVYSGVQ PGIGTEVFVGKIPRDLYEDELVPLFEKAGPIWDLRLMMDPLSGQNRGYAFITFCGKEA AQEAVKLCDSYEIRPGKHLGVCISVANNRLFVGSIPKNKTKENILEEFSKVTEGLVDV ILYHQPDDKKKNRGFCFLEYEDHKSAAQARRRLMSGKVKVWGNVVTVEWADPVEEPDP EVMAKVKVLFVRNLATTVTEEILEKSFSEFGKLERVKKLKDYAFVHFEDRGAAVKAMD EMNGKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPPPIRGRG RGGGRGGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYDDGYAVRGRGGGRGG RGAPPPPRGRGAPPPRGRAGYSQRGAPLGPPRGSRGGRGGPAQQQRGRGSRGSRGNRG GNVGGKRKADGYNQPDSKRRQTNNQQNWGSQPIAQQPLQQGGDYSGNYGYNNDNQEFY QDTYGQQWK" BASE COUNT 815 a 476 c 665 g 688 t ORIGIN 1 gcgcctcgcg ctgattctca cgggcccggc tgccggcccc cgctctgccc tgcataataa 61 aatggctaat caggtgaatg gtaatgcggt acagttaaaa gaagaggaag aaccaatgga 121 tacttccagt gtaactcaca cagaacacta caagacactg atagaggcag gcctcccaca 181 gaaggtggca gaaagacttg atgaaatatt tcagacagga ttggtagctt atgtcgatct 241 tgatgaaaga gcaattgatg ctctcaggga atttaatgaa gaaggagctc tgtctgtact 301 acagcagttc aaggaaagtg acttatcaca tgttcagaac aaaagtgcat ttttatgtgg 361 agttatgaag acctacaggc agagagagaa acaggggagc aaggtgcaag agtccacaaa 421 gggacctgat gaagcgaaga tcaaggcctt gcttgagaga actggttata ctctggatgt 481 aaccacagga cagaggaagt atggtggtcc tccaccagac agtgtgtact ctggcgtgca 541 acctggaatt ggaacggagg tatttgtagg caaaatacca agggatttat atgaggatga 601 gttggtgccc ctttttgaga aggccggacc catttgggat ctacgtctta tgatggatcc 661 actgtccggt cagaatagag ggtatgcatt tatcaccttc tgtggaaagg aagctgcaca 721 ggaagccgtg aaactgtgtg acagctatga aattcgccct ggtaaacacc ttggagtgtg 781 catttctgtg gcaaacaaca gactttttgt tggatccatt ccgaagaata agactaaaga 841 aaacattttg gaagaattca gtaaagtcac agagggtttg gtggacgtta ttctctatca 901 tcaacccgat gacaaaaaga agaatcgggg gttctgcttc cttgaatatg aggatcacaa 961 gtcagcagca caagccagac gccggctgat gagtggaaaa gtaaaagtgt ggggaaatgt 1021 agttacagtt gaatgggctg accctgtgga agaaccagat ccagaagtca tggctaaggt 1081 aaaagttttg tttgtgagaa acttggctac tacggtgaca gaagaaatat tggaaaagtc 1141 attttctgaa tttggaaaac tcgaaagagt aaagaagttg aaagattatg catttgttca 1201 ttttgaagac agaggagcag ctgttaaggc tatggatgaa atgaatggca aagaaataga 1261 aggggaagaa attgaaatag tcttagccaa gccaccagac aagaaaagga aagagcgcca 1321 agctgctaga caggcctcca gaagcactgc gtatgaagat tattactacc accctcctcc 1381 tcgcatgcca cctccaatta gaggtcgggg tcgtggtggg gggagaggtg gatatggcta 1441 ccctccagat tactacggct atgaagatta ctatgatgat tactatggtt atgattatca 1501 cgactatcgt ggaggctatg aagatcccta ctacggctat gatgatggct atgcagtaag 1561 aggaagagga ggaggaaggg gagggcgagg tgctccacca ccaccaaggg ggaggggagc 1621 accacctcca agaggtagag ctggctattc acagaggggg gcacctttgg gaccaccaag 1681 aggctctagg ggtggcagag ggggtcctgc tcaacagcag agaggccgtg gttcccgtgg 1741 atctcggggc aatcgtgggg gcaatgtagg aggcaagaga aaggcagatg ggtacaacca 1801 gcctgattcc aagcgtcgtc agaccaacaa ccaacagaac tggggttccc aacccatcgc 1861 tcagcagccg cttcagcaag gtggtgacta ttctggtaac tatggttaca ataatgacaa 1921 ccaggaattt tatcaggata cttatgggca acagtggaag tagacaagta agggcttgaa 1981 aatgatactg gcaagatacg attggctcta gatctacatt cttcaaaaaa aaaaattggc 2041 ttaactgttt catctttaag tagcattttg ctgccatttg tattgggctg aagaaatcac 2101 tattgtgtat atactcaagt ctttttattt ttcctctttt cataaatgct cttggacatt 2161 attgggcttg cagagttccc ttattctggg gattacaatg cttttatcgt ttcaggcttc 2221 attttagctt caaaacaagc tgggcacact gttaaatcat gattttgcag aacctttggt 2281 tttggacagt ttcatttttt tggatttggg atagattaca taggagtatg gagtatgctg 2341 taaataaaaa tacaagctag tgctttgtct tagtagtttt aagaaattaa agcaaacaaa 2401 tttaagtttt cttgtattga aaataaccta tgattgtatg ttttgcattc ctagaagtag 2461 gttaactgtg tttttaaatt gttataactt cacacctttt tgaaatctgc cctacaaaat 2521 ttgtttggct taaacgtcaa aagccgtgac aatttgttct ttgatgtgat tgtatttcca 2581 atttcttgtt catgtaagat ttcaataaaa ctaaaaaatc tattcaaaaa aaaaaaaaaa 2641 aaaa // LOCUS AF000367 4152 bp mRNA PRI 07-DEC-1997 DEFINITION Homo sapiens cdc14 homolog mRNA, complete cds. ACCESSION AF000367 NID g2662416 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4152) AUTHORS Li,L., Ernsting,B.R., Wishart,M.J., Lohse,D.L. and Dixon,J.E. TITLE A family of putative tumor suppressors is structurally and functionally conserved in humans and yeast JOURNAL J. Biol. Chem. 272 (47), 29403-29406 (1997) MEDLINE 98037751 REFERENCE 2 (bases 1 to 4152) AUTHORS Li,L. and Dixon,J.E. TITLE Direct Submission JOURNAL Submitted (18-APR-1997) Biochemistry, University of Michigan, 1301 Catherine, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..4152 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 398..2140 /codon_start=1 /product="cdc14 homolog" /db_xref="PID:g2662417" /translation="MKDRLYFATLRNRPKSTVNTHYFSIDEELVYENFYADFGPLNLA MVYRYCCKLNKKLKSYSLSRKKIVHYTCFDQRKRANAAFLIGAYAVIYLKKTPEEAYR ALLSGSNPPYLPFRDASFGNCTYNLTILDCLQGIRKGLQHGFFDFETIDVDEYEHYER VENGDFNCIVPGKFLAFSGPHPKSKIENGYPLHAPEAYFPYFKKHNVTAVVRLNKKIY EAKRFTDAGFEHYDLFFIDGSTPSDNIVRRFLNICENTEGAIAVHCKAGLGRTGTLIA CYVMKHYRFTHAEIIAWIRICRPGSIIGPQQHFLEEKQASLWVQGDIFRSKLKNRPSS EGSINKILSGLDDMSIGGNLSKTQNMERFGEDNLEDDDVEMKNGITQGDKLRALKSQR QPRTSPSCAFRSDDTKGHPRAVSQPFRLSSSLQGSAVTLKTSKMALSPSATAKRINRT SLSSGATVRSFSINSRLASSLGNLNAATDDPENKKTSSSSKAGFTASPFTNLLNGSSQ PTTRNYPELNNNQYNRSSNSNGGNLNSPPGPHSAKTEEHTTILRPSYTGLSSSSARFL SRSIPSLQSEYVHY" BASE COUNT 1279 a 815 c 841 g 1217 t ORIGIN 1 atcactttgg aagccggggg gaacactttg ccctgccctg agagctggtc tgcgtttccc 61 aggcgcggcg gcggcggagc agcagctgca gcagccgagt ccaaatagga gcggccacag 121 ccaggggcgt gtgcgccccg cgcggagcga gctcgggttc ccctcggaat gtccccgggg 181 cgcccggcgc gctgaccccg aagccgcctc cgccttcggc gcctgctgcc tccctcggcc 241 aggcttgttg ttcgggactg tgagcttcct ggctcctggg cagtggggaa gcccccgggg 301 gcgagtgacc tcagctggcc acgacccagc cctcccccgt gcgtatctcg cttaagatgg 361 cagcggagtc agggaactaa tcggggcttg tgagttcatg aaagatcggt tatattttgc 421 tactttaagg aatagaccaa aaagcacagt aaatacccac tatttctcca tcgatgagga 481 gctggtctat gaaaatttct atgcagattt tggaccgctg aacttggcaa tggtgtacag 541 atattgctgc aaactaaaca agaaactaaa atcatacagt ttgtcaagaa agaaaatagt 601 gcactacacc tgttttgacc aacggaaaag agcaaatgca gcatttttga taggtgccta 661 tgcagtaatc tatttaaaga agacaccaga agaagcctac agagcactcc tgtctggctc 721 aaaccccccc tatcttccat tcagggatgc ttcctttgga aattgcactt acaatctcac 781 cattctcgac tgtttgcagg gaatcagaaa gggattacaa catggatttt ttgactttga 841 gacaattgat gtggatgaat atgaacatta tgagcgagtt gaaaatggtg acttcaactg 901 tattgttcca ggaaaatttt tagcatttag tggaccacat cctaaaagca aaattgagaa 961 tggttatcct cttcacgccc ctgaagccta ctttccttat ttcaaaaagc ataatgtgac 1021 tgcagttgtg aggctaaaca aaaagattta tgaggcaaag cgcttcacag acgctggctt 1081 cgagcactat gacctcttct tcatagatgg cagcacaccc agtgacaaca tcgtgcgaag 1141 gttcctgaac atctgtgaga acaccgaagg ggccatcgcc gttcactgca aagctggtct 1201 tggaagaaca gggacattga tagcctgtta tgtaatgaaa cactacaggt ttacacatgc 1261 tgaaataatt gcttggatta gaatatgccg gccaggctct attataggac cccagcagca 1321 cttcctggaa gaaaaacaag catcgttgtg ggtccaagga gacattttcc gatccaaact 1381 gaaaaatcga ccatccagtg aaggaagtat taataaaatt ctttctggcc tagatgatat 1441 gtctattggt ggaaatcttt caaaaacaca aaacatggaa cgatttggag aggataactt 1501 agaagatgat gatgtggaaa tgaaaaatgg tataacccag ggagacaaac tacgtgcctt 1561 aaaaagtcag agacagccac gtacctcacc atcctgtgca tttaggtcag atgatacaaa 1621 aggacatcca agagcagtgt cccagccttt cagattaagt tcatccctgc aaggatctgc 1681 agttactttg aagacatcaa aaatggcact gtccccttca gcaacggcca agaggatcaa 1741 cagaacttct ttgtcttcgg gtgccactgt aagaagcttt tccataaact cccggctagc 1801 cagttctcta gggaacttga atgctgcaac agatgatcca gagaacaaaa agacctcctc 1861 atcctctaag gcaggcttca cagccagccc gtttaccaac ctcttgaatg gcagctccca 1921 gccaactacc agaaattacc ctgagctcaa caataatcag tacaacagaa gcagcaacag 1981 caacgggggc aacctgaaca gccccccagg cccccacagc gccaagacag aggagcacac 2041 caccatcctc cgaccctcct acaccgggct ttcttcttct tcagcgagat tcctgagccg 2101 ttctatccct tcccttcagt ctgaatatgt tcattactaa ggccttgcca ctccagtgaa 2161 agctgttctt ctcttagaca caatttcttc atctggacga gcagtggaga gggaaagcaa 2221 cttcttgctg gaagaatatc tctgccttct taccttaaat taaaaagagc actaagataa 2281 caccttcaag agacttgaaa acagaaaact ggttaatgac tactataaat gcactgaaac 2341 tatgttatgg agatttccat acttttaaag acagttttaa tgttgaattt ggtattttga 2401 agggttattt ttaatgtatt ttggtaatac atttattatt atatttacat gtacagtgtt 2461 acattatata tgtattgtga actttaaaag actattttga taaatttata aatatataaa 2521 attatgtaaa aactacacta tattttgatt tagattttcc tgctgtttgc taccaaaaat 2581 ttgtatttta aatctgttta gttttagtat ggttttgtct ctaatgaata aataattcct 2641 tcttattaag aagaagtaag ggagaaagtt tttagaaagt gatttttatg ctcgcactat 2701 aaatatggca ggtcagttca ttcttttggg aagtcagttt agttacactg agtttatcca 2761 agtttatctc taccaagagt ataatggcat gggatggctt atttaggaca attccctttc 2821 ctattgtttt tgttgctgag ccaatttgag ttagttttgc atcctggggg gctttaaaat 2881 acagcatgca gtgaaagatc agaattcact gaatatttct tctgagagca tggtttcatg 2941 gtttttctct atgaaatgac tcaatattcc aaatgttttt ttttccttcc tcctttcaaa 3001 agagttctta acccaattag gatatcctgc tttgggtatg aggttgttgt tgcctgtaat 3061 cacacatggt ttgacatcag ttttaaatca atggagagaa aaaactgaaa aagatgctgc 3121 taagtagttc tctgtattaa aggagatatt tttaaaacag ggtacaaccc cctgctgcac 3181 acgctagcat atctggaacc tactatgaaa atgaaaggac ccttataggt actcacagcc 3241 ctttcatgta agtatgatct gatatttagg tcttcagaag cctgtaggtt tcatttctat 3301 gaggaatcga ggagcgttac atcctgatat ccttccaggc tgcttaagaa tggactgctt 3361 cgacactgaa agtgctagtt aaatggattc atatgaagtg ctttactccc aaccattgag 3421 ttatttataa tgtatttatt aggggagggt accttgagtc tattatatat gcttcatcaa 3481 aacatcttgt tcatgtttta tgtttttaaa aaaggcattt gaatgaatgt ttgactcagg 3541 tttgttaaat taacttcagt aactgcagta ccaaaaatta cactcaactg atgaaaaaaa 3601 cgaattgtat gatttaggaa tcaaaaacta aaataagtgg aattatgtat cttttctaaa 3661 gttaaaaaag taaaatattt tattatgagt tattataaaa attggttaat tgtataggaa 3721 gatgacagta tttttttcaa gttatcataa aaagtaattc agatgacatt tgagaagtag 3781 gggaaaggga atcatgttga cagttttagt tctgtgaaca ctaatttgtg tgaagctatt 3841 aaaatgattg taaagttgac tactgtaaat ttcccataat tatgtgtgta tatgtgtcat 3901 atgtatgtac atgtatatgt ctaaaaatta ctttacacat gtgcctacat agacacacca 3961 agaagtggat gtatataata tagaaagtat atagcaaagt aattttactc tgataataaa 4021 aattgtttga catgtatttt gttatgaata gtttatcttc caaaagatat tttgctctat 4081 tttaaagtgt agaagaatac actgctaata aataataaaa gttttattca atttaaaaaa 4141 aaaaaaaaaa aa // LOCUS AF000424 635 bp mRNA PRI 26-NOV-1997 DEFINITION Homo sapiens LST1 mRNA, cLST1/C splice variant, complete cds. ACCESSION AF000424 NID g2145063 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 635) AUTHORS de Baey,A., Fellerhoff,B., Maier,S., Martinozzi,S., Weidle,U. and Weiss,E.H. TITLE Complex expression pattern of the TNF region gene LST1 through differential regulation, initiation, and alternative splicing JOURNAL Genomics 45 (3), 591-600 (1997) MEDLINE 98035883 REFERENCE 2 (bases 1 to 635) AUTHORS de Baey,A., Fellerhoff,B., Maier,S., Martinozzi,S., Weidle,U. and Weiss,E.H. TITLE Direct Submission JOURNAL Submitted (21-APR-1997) Anthropology and Human Genetics, University of Munich, Richard Wagner Str. 10/1, Munich 80333, Germany FEATURES Location/Qualifiers source 1..635 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /cell_line="interferon-gamma stimulated U937 cell line" /map="6p21-p23" gene 1..635 /gene="LST1" CDS 226..501 /gene="LST1" /note="cLST1/C splice variant" /codon_start=1 /db_xref="PID:g2145064" /translation="MIYVSTGAWGWAGSCFWQWSFCPPACVGCIEEHLLSWSQAQGSS EQELHYASLQRLPVPSSEGPDLRGRDKRGTKEDPRADYACIAENKPT" BASE COUNT 134 a 208 c 168 g 125 t ORIGIN 1 acttcagccc tagcagcatc tgcctgtggg aagcagctct ccacaccagc caagggggcc 61 cccacactcc cgcgctgctc tgcggctcag ggagcagccc acctgctgga tgaggaactt 121 gaggcaagtc accagcccct gatcatttcg cctaaaagag caaggactag agttcctgac 181 ctccaggcca gtccctgatc cctgacctaa tgttatcgcg gaatgatgat atatgtatct 241 acgggggcct ggggctgggc gggctcctgc ttctggcagt ggtccttctg tccgcctgcc 301 tgtgttggct gcatcgaaga gcaccttctg tcctggtccc aggcccaggg ctcctcagag 361 caggaactcc actatgcatc tctgcagagg ctgccagtgc ccagcagtga gggacctgac 421 ctcaggggca gagacaagag aggcaccaag gaggatccaa gagctgacta tgcctgcatt 481 gctgagaaca aacccacctg agcaccccag acaccttcct caacccaggc gggtggacag 541 ggtccccctg tggtccagcc agtaaaaacc atggtccccc cacttctgtg tctcagtcct 601 ctcagtccat ctcgagcctc cgttcaaatt gatca // LOCUS AF000571 3129 bp mRNA PRI 07-OCT-1997 DEFINITION Homo sapiens kidney and cardiac voltage dependent K+ channel (KvLQT1) mRNA, complete cds. ACCESSION AF000571 NID g2465530 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3129) AUTHORS Chouabe,C., Neyroud,N., Guicheney,P., Lazdunski,M., Romey,G. and Barhanin,J. TITLE Properties of KvLQT1 K+ channel mutations in Romano-Ward and Jervell and Lange-Nielsen inherited cardiac arrhythmias JOURNAL EMBO J. 16 (17), 5472-5479 (1997) MEDLINE 97459933 REFERENCE 2 (bases 1 to 3129) AUTHORS Barhanin,J., Romey,G. and Lazdunski,M. TITLE Direct Submission JOURNAL Submitted (21-APR-1997) IPMC, CNRS, route des lucioles, Sophia-Antipolis, Valbonne 06560, France FEATURES Location/Qualifiers source 1..3129 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /chromosome="11" /map="11p15.5" gene 1..3129 /gene="KvLQT1" CDS 111..2141 /gene="KvLQT1" /note="Delayed rectifier IKs alpha subunit" /codon_start=1 /product="kidney and cardiac voltage dependent K+ channel" /db_xref="PID:g2465531" /translation="MAAASSPPRAERKRWGWGRLPGARRGSAGLAKKCPFSLELAEGG PAGGALYAPIAPGAPGPAPPASPAAPAAPPVASDLGPRPPVSLDPRVSIYSTRRPVLA RTHVQGRVYNFLERPTGWKCFVYHFAVFLIVLVCLIFSVLSTIEQYAALATGTLFWME IVLVVFFGTEYVVRLWSAGCRSKYVGLWGRLRFARKPISIIDLIVVVASMVVLCVGSK GQVFATSAIRGIRFLQILRMLHVDRQGGTWRLLGSVVFIHRQELITTLYIGFLGLIFS SYFVYLAEKDAVNESGRVEFGSYADALWWGVVTVTTIGYGDKVPQTWVGKTIASCFSV FAISFFALPAGILGSGFALKVQQKQRQKHFNRQIPAAASLIQTAWRCYAAENPDSSTW KIYIRKAPRSHTLLSPSPKPKKSVVVKKKKFKLDKDNGVTPGEKMLTVPHITCDPPEE RRLDHFSVDGYDSSVRKSPTLLEVSMPHFMRTNSFAEDLDLEGETLLTPITHISQLRE HHRATIKVIRRMQYFVAKKKFQQARKPYDVRDVIEQYSQGHLNLMVRIKELQRRLDQS IGKPSLFISVSEKSKDRGSNTIGARLNRVEDKVTQLDQRLALITDMLHQLLSLHGGST PGSGGPPREGGAHITQPCGSGGSVDPELFLPSNTLPTYEQLTVPRRGPDEGS" BASE COUNT 579 a 1041 c 952 g 557 t ORIGIN 1 gggcggcggg gctggcagca gtggctgccc gcactgcgcc cgggcgctcg ccttcgctgc 61 agctcccggt gccgccgctc gggccggccc cccggcaggc cctcctcgtt atggccgcgg 121 cctcctcccc gcccagggcc gagaggaagc gctggggttg gggccgcctg ccaggcgccc 181 ggcggggcag cgcgggcctg gccaagaagt gccccttctc gctggagctg gcggagggcg 241 gcccggcggg cggcgcgctc tacgcgccca tcgcgcccgg cgccccaggt cccgcgcccc 301 ctgcgtcccc ggccgcgccc gccgcgcccc cagttgcctc cgaccttggc ccgcggccgc 361 cggtgagcct agacccgcgc gtctccattt acagcacgcg ccgcccggtg ttggcgcgca 421 cccacgtcca gggccgcgtc tacaacttcc tcgagcgtcc caccggctgg aaatgcttcg 481 tttaccactt cgccgtcttc ctcatcgtcc tggtctgcct catcttcagc gtgctgtcca 541 ccatcgagca gtatgccgcc ctggccacgg ggactctctt ctggatggag atcgtgctgg 601 tggtgttctt cgggacggag tacgtggtcc gcctctggtc cgccggctgc cgcagcaagt 661 acgtgggcct ctgggggcgg ctgcgctttg cccggaagcc catttccatc atcgacctca 721 tcgtggtcgt ggcctccatg gtggtcctct gcgtgggctc caaggggcag gtgtttgcca 781 cgtcggccat caggggcatc cgcttcctgc agatcctgag gatgctacac gtcgaccgcc 841 agggaggcac ctggaggctc ctgggctccg tggtcttcat ccaccgccag gagctgataa 901 ccaccctgta catcggcttc ctgggcctca tcttctcctc gtactttgtg tacctggctg 961 agaaggacgc ggtgaacgag tcaggccgcg tggagttcgg cagctacgca gatgcgctgt 1021 ggtggggggt ggtcacagtc accaccatcg gctatgggga caaggtgccc cagacgtggg 1081 tcgggaagac catcgcctcc tgcttctctg tctttgccat ctccttcttt gcgctcccag 1141 cggggattct tggctcgggg tttgccctga aggtgcagca gaagcagagg cagaagcact 1201 tcaaccggca gatcccggcg gcagcctcac tcattcagac cgcatggagg tgctatgctg 1261 ccgagaaccc cgactcctcc acctggaaga tctacatccg gaaggccccc cggagccaca 1321 ctctgctgtc acccagcccc aaacccaaga agtctgtggt ggtaaagaaa aaaaagttca 1381 agctggacaa agacaatggg gtgactcctg gagagaagat gctcacagtc ccccatatca 1441 cgtgcgaccc cccagaagag cggcggctgg accacttctc tgtcgacggc tatgacagtt 1501 ctgtaaggaa gagcccaaca ctgctggaag tgagcatgcc ccatttcatg agaaccaaca 1561 gcttcgccga ggacctggac ctggaagggg agactctgct gacacccatc acccacatct 1621 cacagctgcg ggaacaccat cgggccacca ttaaggtcat tcgacgcatg cagtactttg 1681 tggccaagaa gaaattccag caagcgcgga agccttacga tgtgcgggac gtcattgagc 1741 agtactcgca gggccacctc aacctcatgg tgcgcatcaa ggagctgcag aggaggctgg 1801 accagtccat tgggaagccc tcactgttca tctccgtctc agaaaagagc aaggatcgcg 1861 gcagcaacac gatcggcgcc cgcctgaacc gagtagaaga caaggtgacg cagctggacc 1921 agaggctggc actcatcacc gacatgcttc accagctgct ctccttgcac ggtggcagca 1981 cccccggcag cggcggcccc cccagagagg gcggggccca catcacccag ccctgcggca 2041 gtggcggctc cgtcgaccct gagctcttcc tgcccagcaa caccctgccc acctacgagc 2101 agctgaccgt gcccaggagg ggccccgatg aggggtcctg aggaggggat ggggctgggg 2161 gatgggcctg agtgagaggg gaggccaaga gtggccccac ctggccctct ctgaaggagg 2221 ccacctccta aaaggcccag agagaagagc cccactctca gaggccccaa taccccatgg 2281 accatgctgt ctggcacagc ctgcacttgg gggctcagca aggccacctc ttcctggccg 2341 gtgtgggggc cccgtctcag gtctgagttg ttaccccaag cgccctggcc cccacatggt 2401 gatgttgaca tcactggcat ggtggttggg acccagtggc agggcacagg gcctggccca 2461 tgtatggcca ggaagtagca caggctgagt gcaggcccac cctgcttggc ccagggggct 2521 tcctgagggg agacagagca acccctggac cccagcctca aatccaggac cctgccaggc 2581 acaggcaggg caggaccagc ccacgctgac tacagggcca ccggcaataa aagcccagga 2641 gcccatttgg agggcctggg cctggctccc tcactctcag gaaatgctga cccatgggca 2701 ggagactgtg gagactgctc ctgagccccc agcttccagc aggagggaca gtctcaccat 2761 ttccccaggg cacgtggttg agtgggggga acgcccactt ccctgggtta gactgccagc 2821 tcttcctagc tggagaggag ccctgcctct ccgcccctga gcccactgtg cgtggggctc 2881 ccgcctccaa cccctcgccc agtcccagca gccagccaaa cacacagaag gggactgcca 2941 cctccccttg ccagctgctg agccgcagag aagtgacggt tcctacacag gacaggggtt 3001 ccttctgggc attacatcgc atagaaatca ataatttgtg gtgatttgga tctgtgtttt 3061 aatgagtttc acagtgtgat tttgattatt aattgtgcaa gcttttccta ataaacgtgg 3121 agaatcaca // LOCUS AF000652 2193 bp mRNA PRI 21-JAN-1998 DEFINITION Homo sapiens syntenin (sycl) mRNA, complete cds. ACCESSION AF000652 NID g2795862 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2193) AUTHORS Grootjans,J.J., Zimmermann,P., Reekmans,G., Smets,A., Degeest,G., Durr,J. and David,G. TITLE Syntenin, a PDZ protein that binds syndecan cytoplasmic domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (25), 13683-13688 (1997) MEDLINE 98054294 REFERENCE 2 (bases 1 to 2193) AUTHORS Grootjans,J.J. and David,G. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) Center for Human Genetics, Laboratory for Glycobiology, Catholic University, Campus Gasthuisberg, Herestraat 49, Leuven 3000, Belgium FEATURES Location/Qualifiers source 1..2193 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2193 /gene="sycl" CDS 149..1045 /gene="sycl" /note="mda-9; pbp-1" /codon_start=1 /product="syntenin" /db_xref="PID:g2795863" /translation="MSLYPSLEDLKVDKVIQAQTAFSANPANPAILSEASAPIPHDGN LYPRLYPELSQYMGLSLNEEEIRASVAVVSGAPLQGQLVARPSSINYMVAPVTGNDVG IRRAEIKQGIREVILCKDQDGKIGLRLKSIDNGIFVQLVQANSPASLVGLRFGDQVLQ INGENCAGWSSDKAHKVLKQAFGEKITMTIRDRPFERTITMHKDSTGHVGFIFKNGKI TSIVKDSSAARNGLLTEHNICEINGQNVIGLKDSQIADILSTSGTVVTITIMPAFIFE HIIKRMAPSIMKSLMDHTIPEV" BASE COUNT 635 a 418 c 442 g 698 t ORIGIN 1 ggcacgaggc gggggcggtg catgacgcgc ctcgggggcg gtcctcgggc gcgcaccgct 61 ctcttacact cgggcctcag aagtccgtgc cagtgaccgg agcggcggcg gcgagcggtt 121 ccttgtgggc tagaagaatc ctgcaaaaat gtctctctat ccatctctcg aagacttgaa 181 ggtagacaaa gtaattcagg ctcaaactgc tttttctgca aaccctgcca atccagcaat 241 tttgtcagaa gcttctgctc ctatccctca cgatggaaat ctctatccca gactgtatcc 301 agagctctct caatacatgg ggctgagttt aaatgaagaa gaaatacgtg caagtgtggc 361 cgtggtttct ggtgcaccac ttcaggggca gttggtagca agaccttcca gtataaacta 421 tatggtggct cctgtaactg gtaatgatgt tggaattcgt agagcagaaa ttaagcaagg 481 gattcgtgaa gtcattttgt gtaaggatca agatggaaaa attggactca ggcttaaatc 541 aatagataat ggtatatttg ttcagctagt ccaggctaat tctccagcct cattggttgg 601 tctgagattt ggggaccaag tacttcagat caatggtgaa aactgtgcag gatggagctc 661 tgataaagcg cacaaggtgc tcaaacaggc ttttggagag aagattacca tgaccattcg 721 tgacaggccc tttgaacgga cgattaccat gcataaggat agcactggac atgttggttt 781 tatctttaaa aatggaaaaa taacatccat agtgaaagat agctctgcag ccagaaatgg 841 tcttctcacg gaacataaca tctgtgaaat caatggacag aatgtcattg gattgaagga 901 ctctcaaatt gcagacatac tgtcaacatc tgggactgta gttactatta caatcatgcc 961 tgcttttatc tttgaacata ttattaagcg gatggcacca agcattatga aaagcctaat 1021 ggaccacacc attcctgagg tttaaaattc acggcaccat ggaaatgtag ctgaacgtct 1081 ccagtttcct tctttggcaa cttctgtatt atgcacgtga agccttcccg gagccagcga 1141 gcatatgctg catgaggacc tttctatctt acattatggc tggggatctt actctttcat 1201 ctgatacctt gttcagattt caaaatagtt gtagccttat cctggtttta cagatgtgaa 1261 ctttcaagag atttactgac tttcctagaa tagtttctct actggaaacc tgatgctttt 1321 ataagccatt gtgattagga tgactgttac aggcttagct ttgtgtgaaa accagtcacc 1381 tttctcctag gtaatgagta gtgctgtcat attactttag ttctatagca tacttgcatc 1441 tttaacatgc tatcatagta catttagaat gattgccttt gatttttttt tttaaattct 1501 gtgtgtgtgt gtgtaaaatg ccaattaaga acactggttt cattccatgt aagcattaaa 1561 cagtgtatgt aggtttcaag agattgtgat gattcttaaa ttttaactac cttcacttaa 1621 tatgcttgaa ctgtcgcctt aactatgtta agcatctaga ctaaaagcca aaatataatt 1681 attgctgcct ttctaaaaac ccaaaatgta gttctctatt aacctgaaat gtacactagc 1741 ccagaacagt ttaatggtac ttactgagct atagcatagc tgcttagttg tttttgagat 1801 tttttagtca acacataatg gaaacttctt tcttctaaaa gttgccagtg ccacttttaa 1861 gaagtgaatc actatatgtg atgtaaaagt tattacacta aacaggataa acttttgact 1921 ccccttttgt tcatttgtgg attaagtggt ataatactta attttggcat ttgactctta 1981 agattatgta acctagctac tttgggatgg tcttagaata tttttctgat aacttgttcc 2041 ttttcctgac tcctccttgc aaacaaaatg atagttgaca ctttatcctg atttttttct 2101 tttttttggt ttatgtctat tctaattaaa tatgtataaa taaagttaca ttttagtctg 2161 tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS AF000959 1364 bp mRNA PRI 21-JUN-1997 DEFINITION Homo sapiens transmembrane protein mRNA, complete cds. ACCESSION AF000959 NID g2150012 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1364) AUTHORS Sirotkin,H., Morrow,B., Saint-Jore,B., Puech,A., Das Gupta,R., Patanjali,S.R., Skoultchi,A., Weissman,S.M. and Kucherlapati,R. TITLE Identification, characterization, and precise mapping of a human gene encoding a novel membrane-spanning protein from the 22q11 region deleted in velo-cardio-facial syndrome JOURNAL Genomics 42 (2), 245-251 (1997) MEDLINE 97336049 REFERENCE 2 (bases 1 to 1364) AUTHORS Sirotkin,H., Morrow,B., St. Jore,B., Puech,A., Das Gupta,R., Patanjali,S., Skoultchi,A., Weissman,S.M. and Kucherlapati,R. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Molecular Genetics, Albert Einstein college of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1364 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" /note="between DSEG numbers: D22S941 and D22S944" CDS 121..777 /note="deleted in Velo-Cardio-Facial Syndrome (TMDVCF)" /codon_start=1 /product="transmembrane protein" /db_xref="PID:g2150013" /translation="MGSAALEILGLVLCLVGWGGLILACGLPMWQVTAFLDHNIVTAQ TTWKGLWMSCVVQSTGHMQCKVYDSVLALSTEVQAARALTVSAVLLAFVALFVTLAGA QCTTCVAPGPAKARVALTGGVLYLFCGLLALVPLCWFANIVVREFYDPSVPVSQKYEL GAALYIGWAATALLMVGGCLLCCGAWVCTGRPDLSFPVKYSAPRRPTATGDYDKKNYV " BASE COUNT 201 a 442 c 473 g 248 t ORIGIN 1 aggggactgg ggccaagagc cgggagcgcg ggcgcaaagg caccagggcc cgcccagggc 61 gccgcgcagc acggccttgg gggttctgcg ggccttcggg tgcgcgtctc gcctctagcc 121 atggggtccg cagcgttgga gatcctgggc ctggtgctgt gcctggtggg ctgggggggt 181 ctgatcctgg cgtgcgggct gcccatgtgg caggtgaccg ccttcctgga ccacaacatc 241 gtgacggcgc agaccacctg gaagggcctg tggatgtcgt gcgtggtgca gagcaccggg 301 cacatgcagt gcaaagtgta cgactcggtg ctggctctga gcaccgaggt gcaggcggcg 361 cgggcgctca ccgtgagcgc cgtgctgctg gcgttcgttg cgctcttcgt gaccctggcg 421 ggcgcgcagt gcaccacctg cgtggccccg ggcccggcca aggcgcgtgt ggccctcacg 481 ggaggcgtgc tctacctgtt ttgcgggctg ctggcgctcg tgccactctg ctggttcgcc 541 aacattgtcg tccgcgagtt ttacgacccg tctgtgcccg tgtcgcagaa gtacgagctg 601 ggcgcagcgc tgtacatcgg ctgggcggcc accgcgctgc tcatggtagg cggctgcctc 661 ttgtgctgcg gcgcctgggt ctgcaccggc cgtcccgacc tcagcttccc cgtgaagtac 721 tcagcgccgc ggcggcccac ggccaccggc gactacgaca agaagaacta cgtctgaggg 781 cgctgggcac ggccgggccc ctcctgccag ccacgcctgc gaggcgttgg ataagcctgg 841 ggagccccgc atggaccgcg gcttccgccg ggtagcgcgg cgcgcaggct cctcggaacg 901 tccggctctg cgccccgacg cggctcctgg atccgctcct gcctgcgccc gcagctgacc 961 ttctcctgcc actagcccgg ccctgccctt aacagacgga atgaagtttc cttttctgtg 1021 cgcggcgctg tttccatagg cagagcgggt gtcagactga ggatttcgct tcccctccaa 1081 gacgctgggg gtcttggctg ctgccttact tcccagaggc tcctgctgac ttcggagggg 1141 cggatgcaga gcccggggcc cccaccggaa gatgtgtaca gctggtcttt actccatcgg 1201 caggcccgag cccagggacc agtgacttgg cctggacctc ccggtctcac tccagcatct 1261 ccccaggcaa ggcttgtggg caccggagct tgagagaggg cgggagtggg aaggctaaga 1321 atctgcttag taaatggttt gaactctcaa aaaaaaaaaa aaaa // LOCUS AF001042 4998 bp mRNA PRI 25-MAY-1997 DEFINITION Homo sapiens RNA editase (RED1) mRNA, complete cds. ACCESSION AF001042 NID g2114492 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4998) AUTHORS Villard,L., Tassone,F., Haymowicz,M., Welborn,R. and Gardiner,K. TITLE Map location, genomic organization and expression patterns of the human RED1 RNA editase JOURNAL Somat. Cell Mol. Genet. (1997) In press REFERENCE 2 (bases 1 to 4998) AUTHORS Villard,L., Tassone,F. and Gardiner,K. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Eleanor Roosevelt, 1899 Gaylord Street, Denver, CO 80206, USA FEATURES Location/Qualifiers source 1..4998 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.3; within 300 kb of the CD18 gene" gene 1..4998 /gene="RED1" CDS 16..2241 /gene="RED1" /codon_start=1 /product="RNA editase" /db_xref="PID:g2114493" /translation="MDIEDEENMSSSSTDVKENRNLDNVSPKDASTPGPGEGSQLSNG GGGGPGRKRPLEEGSNGHSKYRLKKRRKTPGPVLPKNALMQLNEIKPGLQYTLLSQTG PVHAPLFVMSVEVNGQVFEGSGPTKKKAKLHAAEKALRSFVQFPNASEAHLAMGRTLS VNTDFTSDQADFPDTLFNGFETPDKAEPPFYVGSNGDDSFSSSGDLSLSASPVPASLA QPPLPVLPPFPPPSGKNPVMILNELRPGLKYDFLSESGESHAKSFVMSVVVDGQFFEG SGRNKKLAKARAAQSALAAIFNLHLDQTPSRQPIPSEGLQLHLPQVLADAVSRLVLGK FGDLTDNFSSPHARRKVLAGVVMTTGTDVKDAKVISVSTGTKCINGEYMSDRGLALND CHAEIISRRSLLRFLYTQLELYLNNKDDQKESIFQKSERGGFRLKENVQFHLYISTSP CGDARIFSPHEPILEGSRSYTQAGLQWCNHGSLQPRPPGLLSDPSTSTFQGAGTTEPA DRHPNRKARGQLRTKIESGEGTIPVRSNASIQTWDGVLQGERLLTMSCSDKIARWNVV GIQGSLLSIFVEPIYFSSIILGSLYHGDHLSRAMYQRISNIEDLPPLYTLNKPLLSGI SNAEARQPGKAPNFSVNWTVGDSAIEVINATTGKDELGRASRLCKHALYCRWMRVHGK VPSHLLRSKITKPNVYHESKLAAKEYQAAKARLFTAFIKAGLGAWVEKPTEQDQFSLT P" exon 44..978 /gene="RED1" /number=2 exon 979..1093 /gene="RED1" /number=3 exon 1094..1262 /gene="RED1" /number=4 exon 1263..1411 /gene="RED1" /number=5 exon 1412..1531 /gene="RED1" /number=6 exon 1532..1700 /gene="RED1" /number=7 exon 1701..1882 /gene="RED1" /number=8 exon 1883..2061 /gene="RED1" /number=9 exon 2062..>2241 /gene="RED1" BASE COUNT 1140 a 1292 c 1457 g 1109 t ORIGIN 1 caaaagtatt ttgccatgga tatagaagat gaagaaaaca tgagttccag cagcactgat 61 gtgaaggaaa accgcaatct ggacaacgtg tcccccaagg atgcgagcac acctgggcct 121 ggcgagggct ctcagctctc caatgggggt ggtggtggcc ccggcagaaa gcggcccctg 181 gaggagggca gcaatggcca ctccaagtac cgcctgaaga aaaggaggaa aacaccaggg 241 cccgtcctcc ccaagaacgc cctaatgcag ctgaatgaga tcaagcctgg tttgcagtac 301 acactcctgt cccagactgg gcccgtgcac gcgcctttgt ttgtcatgtc tgtggaggta 361 aatggccagg tttttgaggg ctccggtccc acaaagaaaa aggcaaaact ccatgctgct 421 gagaaggcct tgaggtcttt cgttcagttt cctaatgcct ctgaggccca cctggccatg 481 gggaggaccc tgtctgtcaa cacggacttc acatctgacc aggcggactt ccctgacacg 541 ctcttcaatg gttttgaaac tcctgacaag gcggagcctc ccttttacgt gggctccaat 601 ggggatgact ccttcagttc cagcggggac ctcagcttgt ctgcttcccc ggtgcctgcc 661 agcctagccc agcctcctct ccctgtctta ccaccattcc cacccccgag tgggaagaat 721 cccgtgatga tcttgaacga actgcgccca ggactcaagt atgacttcct ctccgagagc 781 ggggagagcc atgccaagag cttcgtcatg tctgtggtcg tggatggtca gttctttgaa 841 ggctcgggga gaaacaagaa gcttgccaag gcccgggctg cacagtctgc cctggccgcc 901 atttttaact tgcacttgga tcagacgcca tctcgccagc ctattcccag tgagggtctt 961 cagctgcatt taccgcaggt tttagctgac gctgtctcac gcctggtcct gggtaagttt 1021 ggtgacctga ccgacaactt ctcctcccct cacgctcgca gaaaagtgct ggctggagtc 1081 gtcatgacaa caggcacaga tgttaaagat gccaaggtga taagtgtttc tacaggaaca 1141 aaatgtatta atggtgaata catgagtgat cgtggccttg cattaaatga ctgccatgca 1201 gaaataatat ctcggagatc cttgctcaga tttctttata cacaacttga gctttactta 1261 aataacaaag atgatcaaaa agaatccatc tttcagaaat cagagcgagg ggggtttagg 1321 ctgaaggaga atgtccagtt tcatctgtac atcagcacct ctccctgtgg agatgccaga 1381 atcttctcac cacatgagcc aatcctggaa gggtctcgct cttacaccca ggctggattg 1441 cagtggtgca atcatggctc actgcagcct cgacctcctg ggctcttaag cgatccttcc 1501 acctcaacct tccaaggagc tgggactaca gaaccagcag atagacaccc aaatcgtaaa 1561 gcaagaggac agctacggac caaaatagag tctggtgagg ggacaattcc agtgcgctcc 1621 aatgcgagca tccaaacgtg ggacggggtg ctgcaagggg agcggctgct caccatgtcc 1681 tgcagtgaca agattgcacg ctggaacgtg gtgggcatcc agggatccct gctcagcatt 1741 ttcgtggagc ccatttactt ctcgagcatc atcctgggca gcctttacca cggggaccac 1801 ctttccaggg ccatgtacca gcggatctcc aacatagagg acctgccacc tctctacacc 1861 ctcaacaagc ctttgctcag tggcatcagc aatgcagaag cacggcagcc agggaaggcc 1921 cccaacttca gtgtcaactg gacggtaggc gactccgcta ttgaggtcat caacgccaca 1981 actgggaagg atgagctggg ccgcgcgtcc cgcctgtgta agcacgcgtt gtactgtcgc 2041 tggatgcgtg tgcacggcaa ggttccctcc cacttactac gctccaagat taccaagccc 2101 aacgtgtacc atgagtccaa gctggcggca aaggagtacc aggccgccaa ggcgcgtctg 2161 ttcacagcct tcatcaaggc ggggctgggg gcctgggtgg agaagcccac cgagcaggac 2221 cagttctcac tcacgccctg acccgggcag acatgatggg gggtgcaggg ggctgtgggc 2281 atccagcgtc atcctccaga acctcacatc tgaactgggg gcaggtgcat accttgggga 2341 gggagtaggg ggacacgggg gaccaccagg tgtccacggt tgtccccagc atctcacatc 2401 agacctgggg caggtgcgca gtgtggggag gggatggggt gcgtcagggc ccagcatcgc 2461 cgcctggcat ctcctcgccg cagcatttcc ccttctgaac cgtccagtga ctgctttcaa 2521 tctcggttta cgtttagaaa ttgagttcta ctgagtaggg cttccttaag tttaggaaaa 2581 tagaaattac tttgtgtgaa attcttgaat aaataattta ttcagagcta ggaatgtggt 2641 ttataaaata ggaagtaatt gtgtcaggtc acttttatgc cacattattt taattgcaaa 2701 aaagcatcta tatatggagg agggtgggaa aatagaggta ggaaatagta gcctaaagga 2761 aatcgccaca cgtctgtcta aacttaggtc tcttttctcc gtaggtacct ccctgggtag 2821 ttccacacac taggttgtaa cagtctctcc ctgaggagca gactcccagc atggtgtagc 2881 gtggccctgt catgcacatg gggtcccgca gcagtgactg tgtgtcctgc agaggcgtga 2941 cccaggcccc tgtagccctc agcctcctct agaagcttct gtactccttg taggatcaga 3001 tcatggaaaa cttttctcag tttacttcta agtaatcaca gataatacat ggccagtaat 3061 cccaggctgg ccattcattc aggtttttta aaggatattt aacttttatg gactagaagg 3121 aatcacgagg gctactgcac aatacatggc ctaagttccc tctgttcctt cctctgaatc 3181 gaatggatgt gggtgaccgc ccgaaggcct tcacaggatg gaagtagaat gatttcagta 3241 gatactcatt cttggaaaat gccatagttt taaattattg tttccagctt tatcaaagac 3301 atgtttgaaa aataaaaagc atccaagtga gagctggtga gaccacgtgc tgctggcgta 3361 gtgtaggcca gacattgaca gtcctgacgg gagctcaggg ctgcccagcg cccagcgtgc 3421 acgggacggc cccacgacag agggagtcag cccgggaggt caggagcgcg gcgggcgagg 3481 gccctgtgtg gaccacctcc accaagctca gagatttgca ccaggtgcct tgttgcctcc 3541 gctcaggatg aaagaggagc tgagagaagt gctctgcctg ccagtgcagt gcccagctcc 3601 aaggctctag agggtgttca ggtgggtctc ctggggccat ggggagagat tggtgcagac 3661 cttaccccac agcatacacc tgccacagcg aaatccaggg tgttggcacc tgtgtgtccg 3721 tgatgagcct aggaaaccag agcaggggca gaggggcgtc atcctcccac cggacgctgg 3781 gagctcagac cccaaaactg aaacaccgtg gcttcggcgg ggggtgtgcc tcctgatgtc 3841 aggagcccca tccacgtgtg tccacacaga tctcgtcgca gcacggcagg aaggggtgct 3901 gcttagggct cattgttggg gacatgaccg ggttcagcgg ctaaaacatc tgccccacag 3961 cagcctcctc ctccaccgaa aagggtagtt gtctccctga agcagtcaca gcaggcgtct 4021 ctgccgctcc gtcaccacag tggggttttg ttcaggcaaa tcgcgctggg gttctgcacc 4081 tgcaaaagga gaggggtctg ttgtcgctgg ctttccccca agcaggctct tgcacactct 4141 agaaaaaaca ccttgtaagt ctgtgcattt ttattgtctt gataaattgt atttttttct 4201 aatggggatt gggagatgga cttcgttttt aaaaatatgt ggattttggt taccaagttt 4261 agtgttaata tattccatat acatacaaaa ctacccggta tgtctggctt ttcccttctg 4321 tcaggtaata gctaaagtca gcatgattgc tccctgtacc accccaaata agtgagtgcc 4381 tcaccttgtg gggcctgagc agctaccttg agaccatgtg aggtggcacc tttccggggt 4441 ggactcgtgc ggccttgagg acaggcacag ggcaccctat cccaagccgt ccaggcagga 4501 ggaaggcagc caaggcaact gggttctggg agccctgggt ggggcagctg tggggaggaa 4561 ctgggttcgg ggcagccctg ggcagggcgg ctgtggggca agaactgggt tcggggagcc 4621 ctgtccggcg gggggctgtg gggcggggag ctgggttcgg ggagccctgg gcggggtggc 4681 tgttgggggg aactgggttc ggggagccct gggcggggtg gcttttgggg ggaactgggt 4741 tcggggagcc ctgtgcgggg tggctgttga ggaggacttg ggttcaggga gccctgggcg 4801 gggtggctgt cagggggaac tggtttccgg gagccctggg ccggggcagg gggcggctgt 4861 aggaaggaac tggtttcggg gagccctggg cggggcggct gtggggagga aggtgacgtg 4921 caggggacca gaggctctgc actgctccta ggacagctca tctgtaatca gaaaaaaaat 4981 aaacaaaata cagaacgc // LOCUS AF001174 1310 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens p38beta2 MAP kinase mRNA, complete cds. ACCESSION AF001174 NID g2232137 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1310) AUTHORS Kumar,S., McDonnell,P.C., Gum,R.J., Hand,A.T., Lee,J.C. and Young,P.R. TITLE Novel homologues of CSBP/p38 MAP kinase: activation, substrate specificity and sensitivity to inhibition by pyridinyl imidazoles JOURNAL Biochem. Biophys. Res. Commun. 235 (3), 533-538 (1997) MEDLINE 97350815 REFERENCE 2 (bases 1 to 1310) AUTHORS Kumar,S. TITLE Direct Submission JOURNAL Submitted (24-APR-1997) Cellular Biochemistry, UW2109, SmithKline Beecham, 709 Swedeland Road, King of Prussia, PA 19406, USA FEATURES Location/Qualifiers source 1..1310 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 7..1101 /function="Protein kinase" /codon_start=1 /product="p38beta2 MAP kinase" /db_xref="PID:g2232138" /translation="MSGPRAGFYRQELNKTVWEVPQRLQGLRPVGSGAYGSVCSAYDA RLRQKVAVKKLSRPFQSLIHARRTYRELRLLKHLKHENVIGLLDVFTPATSIEDFSEV YLVTTLMGADLNNIVKCQALSDEHVQFLVYQLLRGLKYIHSAGIIHRDLKPSNVAVNE DCELRILDFGLARQADEEMTGYVATRWYRAPEIMLNWMHYNQTVDIWSVGCIMAELLQ GKALFPGSDYIDQLKRIMEVVGTPSPEVLAKISSEHARTYIQSLPPMPQKDLSSIFRG ANPLAIDLLGRMLVLDSDQRVSAAEALAHAYFSQYHDPEDEPEAEPYDESVEAKERTL EEWKELTYQEVLSFKPPEPPKPPGSLEIEQ" BASE COUNT 268 a 408 c 404 g 230 t ORIGIN 1 ccggacatgt cgggccctcg cgccggcttc taccggcagg agctgaacaa gaccgtgtgg 61 gaggtgccgc agcggctgca ggggctgcgc ccggtgggct ccggcgccta cggctccgtc 121 tgttcggcct acgacgcccg gctgcgccag aaggtggcgg tgaagaagct gtcgcgcccc 181 ttccagtcgc tgatccacgc gcgcagaacg taccgggagc tgcggctgct caagcacctg 241 aagcacgaga acgtcatcgg gcttctggac gtcttcacgc cggccacgtc catcgaggac 301 ttcagcgaag tgtacttggt gaccaccctg atgggcgccg acctgaacaa catcgtcaag 361 tgccaggcgc tgagcgacga gcacgttcaa ttcctggttt accagctgct gcgcgggctg 421 aagtacatcc actcggccgg gatcatccac cgggacctga agcccagcaa cgtggctgtg 481 aacgaggact gtgagctcag gatcctggat ttcgggctgg cgcgccaggc ggacgaggag 541 atgaccggct atgtggccac gcgctggtac cgggcacctg agatcatgct caactggatg 601 cattacaacc aaacagtgga tatctggtcc gtgggctgca tcatggctga gctgctccag 661 ggcaaggccc tcttcccggg aagcgactac attgaccagc tgaagcgcat catggaagtg 721 gtgggcacac ccagccctga ggttctggca aaaatctcct cggaacacgc ccggacatat 781 atccagtccc tgccccccat gccccagaag gacctgagca gcatcttccg tggagccaac 841 cccctggcca tagacctcct tggaaggatg ctggtgctgg acagtgacca gagggtcagt 901 gcagctgagg cactggccca cgcctacttc agccagtacc acgaccccga ggatgagcca 961 gaggccgagc catatgatga gagcgttgag gccaaggagc gcacgctgga ggagtggaag 1021 gagctcactt accaggaagt ccttagcttc aagcccccag agccaccgaa gccacctggc 1081 agcctggaga ttgagcagtg aggtgctgcc cagcagcccc tgagagcctg tggaggggct 1141 tgggcctgca cccttccaca gctggcctgg tttcctcgag aggcacctcc cacactccta 1201 tggtcacaga cttctggcct aggacccctc gccttcagga gaatctacac gcatgtatgc 1261 atgcacaaac atgtgtgtac atgtgcttgc catgtgtagg agtctgggca // LOCUS AF001294 760 bp mRNA PRI 21-NOV-1997 DEFINITION Homo sapiens IPL (IPL) mRNA, complete cds. ACCESSION AF001294 NID g2150049 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 760) AUTHORS Qian,N., Frank,D., O'Keefe,D., Dao,D., Zhao,L., Yuan,L., Wang,Q., Keating,M., Walsh,C. and Tycko,B. TITLE The IPL gene on chromosome 11p15.5 is imprinted in humans and mice and is similar to TDAG51, implicated in fas expression and apoptosis JOURNAL Hum. Mol. Genet. 6 (12), 2021-2029 (1997) MEDLINE 97472453 REFERENCE 2 (bases 1 to 760) AUTHORS Qian,N., Frank,D., O'Keefe,D., Dao,D., Zhao,L., Yuan,L., Wang,Q., Keating,M., Walsh,C. and Tycko,B. TITLE Direct Submission JOURNAL Submitted (25-APR-1997) Pathology, Columbia University, 630 W. 168th Street, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..760 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" gene 1..760 /note="Imprinted in Placenta and Liver" /gene="IPL" CDS 57..515 /gene="IPL" /codon_start=1 /product="IPL" /db_xref="PID:g2150050" /translation="MKSPDEVLREGELEKRSDSLFQLWKKKRGVLTSDRLSLFPASPR ARPKELRFHSILKVDCVERTGKYVYFTIVTTDHKEIDFRCAGESCWNAAIALALIDFQ NRRALQDFRSRQERTAPAAPAEDAVAAAAAAPSEPSEPSRPSPQPKPRTP" polyA_signal 744..749 /gene="IPL" BASE COUNT 140 a 276 c 209 g 135 t ORIGIN 1 agagccggcg ccgtcaccgc ccgcattgcc gctcccagtc ccgcgctcgg cacgacatga 61 aatcccccga cgaggtgcta cgcgagggcg agttggagaa gcgcagcgac agcctcttcc 121 agctatggaa gaagaagcgc ggggtgctca cctccgaccg cctgagcctg ttccccgcca 181 gcccccgcgc gcgccccaag gagctgcgct tccactccat cctcaaggtg gactgcgtgg 241 agcgcacggg caagtacgtg tacttcacca tcgtcaccac cgaccacaag gagatcgact 301 tccgctgcgc gggcgagagc tgctggaacg cggccatcgc gctggcgctc atcgatttcc 361 agaaccgccg cgccctgcag gactttcgca gccgccagga acgcaccgca cccgccgcac 421 ccgccgagga cgccgtggct gccgcggccg ccgcaccctc cgagccctcg gagccctcca 481 ggccatcccc gcagcccaaa ccccgcacgc catgagcccg ccgcgggcca tacgctggac 541 gagtcggacc gaggctagga cgtggccggc gctctccagc cctgcagcag aagaacttcc 601 cgtgcgcgcg gatcctcgct ccgttgcacg ggcgccttaa gttattggac tatctaatat 661 ctatgtattt atttcgctgg ttctttgtag tcacatattt tatagtctta atatcttgtt 721 tttgcatcac tgtgcccatt gcaaataaat cacttggcca // LOCUS AF001383 2115 bp mRNA PRI 19-JUN-1997 DEFINITION Homo sapiens amphiphysin II mRNA, complete cds. ACCESSION AF001383 NID g2199534 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2115) AUTHORS Tsutsui,K., Maeda,Y., Tsutsui,K., Seki,S. and Tokunaga,A. TITLE cDNA cloning of a novel amphiphysin isoform and tissue-specific expression of its multiple splice variants JOURNAL Biochem. Biophys. Res. Commun. (1997) In press REFERENCE 2 (bases 1 to 2115) AUTHORS Tsutsui,K., Maeda,Y., Tsutsui,K., Seki,S. and Tokunaga,A. TITLE Direct Submission JOURNAL Submitted (27-APR-1997) Molecular Biology, Okayama University Medical School, 2-5-1 Shikata-cho, Okayama 700, Japan FEATURES Location/Qualifiers source 1..2115 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" CDS 172..1620 /note="amphiphysin isoform II; subtype 2" /codon_start=1 /product="amphiphysin II" /db_xref="PID:g2199535" /translation="MAEMGSKGVTAGKIASNVQKKLTRAQEKVLQKLGKADETKDEQF EQCVQNFNKQLTEGTRLQKDLRTYLASVKAMHEASKKLNECLQEVYEPDWPGRDEANK IAENNDLLWMDYHQKLVDQALLTMDTYLGQFPDIKSRIAKRGRKLVDYDSARHHYESL QTAKKKDEAKIAKAEEELIKAQKVFEEMNVDLQEELPSLWNSRVGFYVNTFQSIAGLE ENFHKEMSKLNQNLNDVLVGLEKQHGSNTFTVKAQPSDNAPAKGNKSPSPPDGSPAAT PEIRVNHEPEPAGGATPGATLPKSPSQLRKGPPVPPPPKHTPSKEVKQEQILSLFEDT FVPEISVTTPSQPAEASEVAGGTQPAAGAQEPGETAASEAASSSLPAVVVETFPATVN GTVEGGSGAGRLDLPPGFMFKVQAQHDYTATDTDELQLKAGDVVLVIPFQNPEEQDEG WLMGVKESDWNQHKELEKCRGVFPENFTERVP" BASE COUNT 474 a 625 c 665 g 351 t ORIGIN 1 ctcgcccgtc cggcgcacgc tccgcctccg tcagttggct ccgctgtcgg gtgcgcggcg 61 tggagcggca gccggtctgg acgcgcggcc ggggctgggg gctgggagcg cggcgcgcaa 121 gatctccccg cgcgagagcg gcccctgcca ccgggcgagg cctgcgccgc gatggcagag 181 atgggcagta aaggggtgac ggcgggaaag atcgccagca acgtgcagaa gaagctcacc 241 cgcgcgcagg agaaggttct ccagaagctg gggaaggcag atgagaccaa ggatgagcag 301 tttgagcagt gcgtccagaa tttcaacaag cagctgacgg agggcacccg gctgcagaag 361 gatctccgga cctacctggc ctccgtcaaa gccatgcacg aggcttccaa gaagctgaat 421 gagtgtctgc aggaggtgta tgagcccgat tggcccggca gggatgaggc aaacaagatc 481 gcagagaaca acgacctgct gtggatggat taccaccaga agctggtgga ccaggcgctg 541 ctgaccatgg acacgtacct gggccagttc cccgacatca agtcacgcat tgccaagcgg 601 gggcgcaagc tggtggacta cgacagtgcc cggcaccact acgagtccct tcaaactgcc 661 aaaaagaagg atgaagccaa aattgccaag gccgaggagg agctcatcaa agcccagaag 721 gtgtttgagg agatgaatgt ggatctgcag gaggagctgc cgtccctgtg gaacagccgc 781 gtaggtttct acgtcaacac gttccagagc atcgcgggcc tggaggaaaa cttccacaag 841 gagatgagca agctcaacca gaacctcaat gatgtgctgg tcggcctgga gaagcaacac 901 gggagcaaca ccttcacggt caaggcccag cccagtgaca acgcgcctgc aaaagggaac 961 aagagccctt cgcctccaga tggctcccct gccgccaccc ccgagatcag agtcaaccac 1021 gagccagagc cggccggcgg ggccacgccc ggggccaccc tccccaagtc cccatctcag 1081 ctccggaaag gcccaccagt ccctccgcct cccaaacaca ccccgtccaa ggaagtcaag 1141 caggagcaga tcctcagcct gtttgaggac acgtttgtcc ctgagatcag cgtgaccacc 1201 ccctcccagc cagcagaggc ctcggaggtg gcgggtggga cccaacctgc ggctggagcc 1261 caggagccag gggagacggc ggcaagtgaa gcagcctcca gctctcttcc tgctgtcgtg 1321 gtggagacct tcccagcaac tgtgaatggc accgtggagg gcggcagtgg ggccgggcgc 1381 ttggacctgc ccccaggttt catgttcaag gtacaggccc agcacgacta cacggccact 1441 gacacagacg agctgcagct caaggctggt gatgtggtgc tggtgatccc cttccagaac 1501 cctgaagagc aggatgaagg ctggctcatg ggcgtgaagg agagcgactg gaaccagcac 1561 aaggagctgg agaagtgccg tggcgtcttc cccgagaact tcactgagag ggtcccatga 1621 cggcggggcc caggcagcct ccgggcgtgt gaagaacacc tcctcccgaa aaatgtgtgg 1681 ttcttttttt tgttttgttt tcgtttttca tcttttgaag agcaaaggga aatcaagagg 1741 agacccccag gcagaggggc gttctcccaa agattaggtc gttttccaaa gagccgcgtc 1801 ccggcaagtc cggcggaatt caccagtgtt cctgaagctg ctgtgtcctc tagttgagtt 1861 tctggcgccc ctgcctgtgc ccgcatgtgt gcctggccgc agggcggggc tgggggctgc 1921 cgagccacca tgcttgcctg aagcttcggc cgcgccaccc gggcaagggt cctcttttcc 1981 tggcagctgc tgtgggtggg gcccagacac cagcctagcc tggctctgcc ccgcagacgg 2041 tctgtgtgct gtttgaaaat aaatcttagt gttcaaaaca aaatgaaaca aaaaaaaaat 2101 gataaaaact ttcag // LOCUS AF001437 2320 bp mRNA PRI 09-AUG-1997 DEFINITION Homo sapiens dihydrolipoamide dehydrogenase-binding protein mRNA, complete cds. ACCESSION AF001437 NID g2316039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Harris,R.A., Bowker-Kinley,M.M., Wu,P., Jeng,J. and Popov,K.M. TITLE Dihydrolipoamide dehydrogenase-binding protein of the human pyruvate dehydrogenase complex. DNA-derived amino acid sequence, expression, and reconstitution of the pyruvate dehydrogenase complex JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2320) AUTHORS Harris,R.A., Bowker-Kinley,M.M., Wu,P., Jeng,J. and Popov,K.M. TITLE Direct Submission JOURNAL Submitted (28-APR-1997) Biochemistry and Molecular Biology, Indiana University School of Medicine, 635 Barnhill Dr., Indianapolis, IN 46202-5122, USA FEATURES Location/Qualifiers source 1..2320 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p12-p13" CDS 9..1514 /note="E3-binding protein; E3BP; dihydrolipoamide dehydrogenase-binding protein of pyruvate dehydrogenase complex; protein X" /codon_start=1 /product="dihydrolipoamide dehydrogenase-binding protein" /db_xref="PID:g2316040" /translation="MAASWRLGCDPRLLRYLVGFPGCRSVGLVKGALGWSVSRGANWR WFHSTQWLRGDPIKILMPSLSPTMEEGNIVKWLKKEGEAVSAGDALCEIETDKAVVTL DASDDGILAKIVVEEGSKNIRLGSLIGLIVEEGEDWKHVEIPKDVGPPPPVSKPSEPR PSPEPQISIPVKKEHIPGTLRFRLSPAARNILEKHSLDASQGTATGPRGIFTKEDALK LVQLKQTGKITESRPTPAPTATPTAPSPLQATSGPSYPRPVIPPVSTPGQPNAVGTFT EIPASNIRRVIAKRLTESKSTVPHAYATADCDLGAVLKVRQDLVKDDIKVSVNDFIIK AAAVTLKQMPDVNVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAAKGIQEIADS VKALSKKARDGKLLPEEYQGGSFSISNLGMFGIDEFTAVINPPQACILAVGRFRPVLK LTEDEEGNAKLQQRQLITVTMSSDSRVVDDELATRFLKSFKANLENPIRLA" BASE COUNT 730 a 446 c 522 g 622 t ORIGIN 1 ggcacgagat ggcggcctcc tggaggctgg gctgtgatcc gcggctgctg cgttatcttg 61 tgggcttccc tggctgccga agcgtagggc tggtgaaggg ggctcttggg tggtccgtaa 121 gccgcggagc taattggaga tggtttcaca gcacgcagtg gcttcggggt gatcccatta 181 agatactaat gccatcactg tctcctacaa tggaagaagg aaacattgtg aaatggctga 241 aaaaggaagg tgaagcggtg agtgctggag atgcattatg tgaaattgag actgacaaag 301 ctgtggttac cttagatgca agtgatgatg gaatcttggc caaaatcgtg gttgaagaag 361 gaagtaaaaa tatacggcta ggttcactaa ttggtttgat agtagaagaa ggagaagatt 421 ggaaacatgt tgaaattccc aaagacgtag gtcctccacc accagtttca aaaccttcag 481 agcctcgccc ctcaccagaa ccacagattt ccatccctgt caagaaggaa cacatacccg 541 ggacactacg gttccgttta agtccagctg cccgcaatat tctggaaaaa cactcactgg 601 atgctagcca gggcacagcc actggccctc gggggatatt cactaaagag gatgctctca 661 aacttgtcca gttgaaacaa acgggcaaga ttaccgagtc cagaccaact ccagccccca 721 cagccactcc cacagcacct tcgcccctac aggccacatc tggaccatct tatccccggc 781 ctgtgatccc accagtatca actcccggac aacccaatgc agtgggcaca ttcactgaaa 841 tccccgccag caatattcga agagttattg ccaagagatt aactgaatct aaaagtactg 901 tacctcatgc atatgctact gctgactgtg accttggagc tgttttaaaa gttaggcaag 961 atctggtcaa agatgacatt aaagtatcag taaatgattt tatcatcaag gcagcagctg 1021 ttacccttaa acaaatgcca gatgttaatg taagctggga tggagagggc ccaaagcaac 1081 tgccatttat tgacatttca gtggctgtgg caacagataa aggcttactt actccaatca 1141 taaaagatgc tgctgctaaa ggtatccagg aaattgctga ctctgtaaag gctctatcaa 1201 agaaagcaag agatggaaaa ttgttgcctg aagaatacca aggaggatct tttagtattt 1261 ccaacttggg gatgtttggc atcgacgaat ttactgcagt gattaaccct cctcaggcct 1321 gcattttggc ggttgggagg ttccgacctg tgctgaagct cactgaggat gaagagggaa 1381 atgccaaact gcagcagcgc cagctcataa cagtcacaat gtcaagtgac agtcgagtgg 1441 ttgatgacga actggcaacc aggtttctta aaagttttaa agcaaaccta gagaatccta 1501 tccgacttgc ctagtcctca aagataagaa gttggtgttc agcttagttg attcagtagt 1561 tgttaccaag aaacatatgt tataggaaaa caacttggta tttaagtatg aagtggatga 1621 aatgtttatt tatttaaggt gaaagcattt gacccagggt gtcttcatct tcaatttggg 1681 tttaatgtta tagaaataaa tgatgataaa ctctaactaa taaaggaaag agaatatttg 1741 gttactcaga tccattttta acctctggtg ctgtataaag ggaatattaa actagatgta 1801 aatcaaagta tatgtttggc tcatttgagc attttggaat atttgagaat gtatgataca 1861 tgtaaaatta aaaaaactat tagaactgta ccataattat gttgaaggta gaagtgatct 1921 tcaaagagat ggccattaac ttagcagtgg gacctcactt ttacaagcac tgctctagat 1981 atacttgaag aatttaatag gtacagaagt ttattctgga taataaataa ataaggatca 2041 cactgtatta ggggttatgg caacattatt gaatttttta tgtacataaa gccatatgtt 2101 tagggtggtt tctatctgtc ttgtttttca cttatataac actgtgaact tctaaagcaa 2161 gaggataaaa gaagcatgaa tgaaaagaat gacatttcaa aaaaatggtt caatgaaaaa 2221 ctatagctaa aatatgtaaa cctttctagg taaaccgctt gccttcatct tgagtcggaa 2281 tatatttaaa taaattgtgt tatctcttgc caaaaaaaaa // LOCUS AF001687 2794 bp DNA PRI 01-DEC-1997 DEFINITION Homo sapiens U4/U6 snRNP 60 kDa protein gene, complete cds. ACCESSION AF001687 NID g2653735 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2794) AUTHORS Lauber,J., Plessel,G., Prehm,S., Will,C.L., Groening,K. and Luehrmann,R. TITLE The human U4/U6 snRNP contains 60 and 90kD proteins that are structurally homologous to the yeast splicing factors Prp4p and Prp3p JOURNAL RNA (1997) In press REFERENCE 2 (bases 1 to 2794) AUTHORS Lauber,J., Plessel,G., Prehm,S., Will,C.L., Groening,K. and Luehrmann,R. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) Institut fuer Molekularbiologie und Tumorforschung, Emil Mannkopff Str. 2, Marburg 35037, Germany FEATURES Location/Qualifiers source 1..2794 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 94..1659 /codon_start=1 /product="U4/U6 snRNP 60 kDa protein" /db_xref="PID:g2653736" /translation="MASSRASSTATKTKAPDDLVAPVVKKPHIYYGSLEEKERERLAK GESGILGKDGLKAGIEAGNINITSGEVFEIEEHISERQAELLAEFERRKRARQINVST DDSEVKACLRALGEPITLFGEGPAERRERLKNILSVVDTDALKKTKKDDEKSKKSKEE YQQTWYHEGPNSLKVARLWIANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLR SLNNFCSQIGDDRPISYCHFSPNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVG AIVFHPKSTVSLDPKDVNLASCAADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSG RFLGTTCYDRSWRLWDLEAQEEILHQEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWD LRTGRCIMFLEDHLKEIYGINFSPNGYHIATGSGDNTCKVWDLRQRRCVYTIPPHQNL VTGVKFEPIHGNFLLTGAYDNTAKIWTHPGWSPLKTLAGHEGKVMGLDISSDGQLIAT YSYDRTFKLWMLE" BASE COUNT 744 a 633 c 708 g 709 t ORIGIN 1 gggcggagca cttcccctct gctgggcgcg cggtggacgg tctgaaaggg agtgttcggg 61 tttcgctggg gcctcgcggc tccagagccc agcatggctt cctcgcgagc ctcttccacg 121 gcaaccaaaa ctaaagcacc cgacgactta gttgctccgg tcgtgaaaaa accacacatc 181 tattatggaa gtttggagga gaaggagagg gagcgtctgg ccaaaggaga gtctgggatt 241 ttggggaaag acggacttaa agcagggatc gaagctggaa atattaatat aacctctgga 301 gaagtgttcg agattgaaga gcatatcagc gagcgacagg cagaattatt ggctgagttt 361 gagagaagga agcgagcccg gcagatcaat gtttccacag atgactcaga ggtcaaagct 421 tgccttagag ccttggggga acccatcaca ctttttggag agggtcctgc tgaaagaaga 481 gaaaggttaa aaaatatcct ctcagttgtc gatactgatg ccttgaaaaa gaccaaaaag 541 gatgatgaga agtctaaaaa gtccaaagaa gagtatcagc aaacctggta tcatgaagga 601 ccaaatagct tgaaggtggc aagactatgg attgctaatt attcgttgcc cagggcaatg 661 aaacgcttgg aagaggcccg actccataag gagattcctg agacaacaag gacctcccag 721 atgcaagagc tgcacaagtc tctccggtct ttgaataatt tttgcagtca gattggggat 781 gatcggccta tctcctactg tcactttagt cccaattcca agatgctggc cacagcttgt 841 tggagtgggc tttgcaagct ctggtctgtt cctgattgca acctccttca cactcttcga 901 gggcataaca caaatgtagg agcaattgta ttccatccca aatccactgt ctccttggac 961 ccaaaagatg tcaacctggc ctcttgtgcg gctgatggct ctgtgaagct ttggagtctt 1021 gacagtgatg aaccagtggc agatattgaa ggccatacag tgcgtgtggc gcgggtaatg 1081 tggcatcctt caggacgttt cctgggcacc acctgctatg accgttcatg gcgcttatgg 1141 gatttggagg ctcaagagga gatcctgcat caggaaggcc atagcatggg tgtgtatgac 1201 attgccttcc atcaagatgg ctctttggct ggcactgggg gactggatgc atttggtcga 1261 gtttgggacc tacgcacagg acgttgtatc atgttcttag aagaccacct gaaagaaatc 1321 tatggaataa atttctcccc caatggctat cacattgcaa ccggcagtgg tgacaacacc 1381 tgcaaagtgt gggacctccg acagcggcgt tgcgtctaca ccatccctcc tcatcagaac 1441 ttagtgactg gtgtcaagtt tgagcctatc catgggaact tcttgcttac tggtgcctat 1501 gataacacag ccaagatctg gacgcaccca ggctggtccc cgctgaagac tctggctggc 1561 cacgaaggca aagtgatggg cctagatatt tcttccgatg ggcagctcat agccacttac 1621 tcatatgaca ggaccttcaa gctgtggatg ctggaataga tgacaatggg aaaaggactt 1681 gaacctcaag ctctctctaa ggagctgttt tcctcaaacg agaagaattg aagtgttagt 1741 tctatcatgt tttctgccaa ttaccatgca tagaccctca gtagaattgg atttccatgt 1801 cagcccccac tccaggaagg cagcccaatc cctaggtgat ggggaacccc tctcacggtt 1861 caaaatttat taccttttta cgccctgcca cgaactgtgt agacattgtt tttattaatc 1921 ttttgtttgg ccgggcgtgg tggctcacgc ctgtaatcct agcactttgg gaggccgagg 1981 tgggtagatc gcttgagctc aggagttcaa gatgagcctg ggcaacatgg caaatgccgt 2041 ctctgcaaaa aaatactaaa attagctggt cgcggtggct tctgcctgtg attccggcta 2101 cttgggaggc tgaggtggga gggattgctt aagcctggga ggtagaggtt gcagtgagcc 2161 gagattgcgc cattgcactc tagcctgtgt gacagagcaa gaccctgtct caaaaaaaaa 2221 aaaaaatttg ttcgaatgcc ttatagcctt cctcacagca cccaggattg tgactgactc 2281 tgcattttta attcttgaaa cttggctttc cataacatgg tacatgcttc aggcctacat 2341 atgacccaga gagcaaggtg gctgaactat agtctggaag ccctcaggta aagaggcaca 2401 tctcaccact cattgcttaa acaattgatt catagcgagc acttttcctt tccctggaga 2461 atgggatgtg aagcagtaga ccgcagccac gccgatggtt atacagtgaa gaagacttca 2521 cctcttccta ttgagtttgc ttggaatgct gacagctcag gcactctgaa ctgaacattt 2581 gctttgtcag aaaatatctt tttttttacc tttgaagttt ggcaaccttc atgttacccc 2641 aaagcaaaac cattgtgtca ggagtcaaac aaatgtttag aaagcaaaca tgacgtctct 2701 attgtacaac ctcctttctc ttggctgttt aaaggatgta cttcgtgtat taaagggtac 2761 tttatgttga gtacgaaaaa aaaaaaaaaa aaaa // LOCUS AF001787 1175 bp mRNA PRI 26-JUN-1997 DEFINITION Homo sapiens uncoupling protein 3 mRNA, complete cds. ACCESSION AF001787 NID g2198812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1175) AUTHORS Vidal-Puig,A., Solanes,G., Grujic,D., Flier,J.S. and Lowell,B.B. TITLE UCP3: an uncoupling protein homologue expressed preferentially and abundantly in skeletal muscle and brown adipose tissue JOURNAL Biochem. Biophys. Res. Commun. 235 (1), 79-82 (1997) MEDLINE 97339440 REFERENCE 2 (bases 1 to 1175) AUTHORS Vidal-Puig,A., Solanes,G., Grujic,D., Flier,J.S. and Lowell,B.B. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) Medicine, Beth Israel Deacones Medical Center, 330 Brookline Ave., Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..1175 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 184..1122 /codon_start=1 /product="uncoupling protein 3" /db_xref="PID:g2198813" /translation="MVGLKPSDVPPTMAVKFLGAGTAACFADLVTFPLDTAKVRLQIQ GENQAVQTARLVQYRGVLGTILTMVRTEGPCSPYNGLVAGLQRQMSFASIRIGLYDSV KQVYTPKGADNSSLTTRILAGCTTGAMAVTCAQPTDVVKVRFQASIHLGPSRSDRKYS GTMDAYRTIAREEGVRGLWKGTLPNIMRNAIVNCAEVVTYDILKEKLLDYHLLTDNFP CHFVSAFGAGFCATVVASPVDVVKTRYMNSPPGQYFSPLDCMIKMVAQEGPTAFYKGF TPSFLRLGSWNVVMFVTYEQLKRALMKVQMLRESPF" BASE COUNT 248 a 362 c 337 g 228 t ORIGIN 1 aggaggggcc atccaatccc tgctgccacc tcctgggatg gagccctagg gagcccctgt 61 gctgcccctg ccgtggcagg actcacagcc ccaccgctgc actgaagccc agggctgtgg 121 agcagcctct ctccttggac ctcctctcgg ccctaaaggg actgggcaga gccttccagg 181 actatggttg gactgaagcc ttcagacgtg cctcccacca tggctgtgaa gttcctgggg 241 gcaggcacag cagcctgttt tgctgacctc gttacctttc cactggacac agccaaggtc 301 cgcctgcaga tccaggggga gaaccaggcg gtccagacgg cccggctcgt gcagtaccgt 361 ggcgtgctgg gcaccatcct gaccatggtg cggactgagg gtccctgcag cccctacaat 421 gggctggtgg ccggcctgca gcgccagatg agcttcgcct ccatccgcat cggcctctat 481 gactccgtca agcaggtgta cacccccaaa ggcgcggaca actccagcct cactacccgg 541 attttggccg gctgcaccac aggagccatg gcggtgacct gtgcccagcc cacagatgtg 601 gtgaaggtcc gatttcaggc cagcatacac ctcgggccat ccaggagcga cagaaaatac 661 agcgggacta tggacgccta cagaaccatc gccagggagg aaggagtcag gggcctgtgg 721 aaaggaactt tgcccaacat catgaggaat gctatcgtca actgtgctga ggtggtgacc 781 tacgacatcc tcaaggagaa gctgctggac taccacctgc tcactgacaa cttcccctgc 841 cactttgtct ctgcctttgg agccggcttc tgtgccacag tggtggcctc cccggtggac 901 gtggtgaaga cccggtatat gaactcacct ccaggccagt acttcagccc cctcgactgt 961 atgataaaga tggtggccca ggagggcccc acagccttct acaagggatt tacaccctcc 1021 tttttgcgtt tgggatcctg gaacgtggtg atgttcgtaa cctatgagca gctgaaacgg 1081 gccctgatga aagtccagat gttacgggaa tcaccgtttt gaacaagaca agaaggccac 1141 tggtagctaa cgtgtccgaa accagttaag aatgg // LOCUS AF001862 2578 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens FYN binding protein mRNA, complete cds. ACCESSION AF001862 NID g2232149 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2578) AUTHORS da Silva,A.J., Li,Z., de Vera,C., Canto,E., Findell,P. and Rudd,C.E. TITLE Novel T-cell Protein FYB binds FYN and SLP-76 and Regulates Interleukin 2 Production JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 2578) AUTHORS da Silva,A.J., Li,Z. and Rudd,C.E. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) Tumor Immunology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2578 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" CDS 68..2419 /note="FYB; T-cell and myeloid cell specific signaling substrate; regulator of interleukin-2 in T-cells associates with src-family FYN kinase and SLP-76" /codon_start=1 /product="FYN binding protein" /db_xref="PID:g2232150" /translation="MAKYNTGGNPTEDVSVNSRPFRVTGPNSSSGIQARKNLFNNQGN ASPPAGPSNVPKFGSPKPPVAVKPSSEEKPDKEPKPPFLKPTGAGQRFGTPASLTTRD PEAKVGFLKPVGPKPINLPKEDSKPTFPWPPGNKPSLHSVNQDHDLKPLGPKSGPTPP TSENEQKQAFPKLTGVKGKFMSASQDLEPKPLFPKPAFGQKPPLSTENSHEDESPMKN VSSSKGSPAPLGVRSKSGPLKPAREDSENKDHAGEISSLPFPGVVLKPAASRGGPGVS KNGEEKKEDRKIDAAKNTFQSKINQEELASGTPPARFPKAPSKLTVGGPWGQSQEKEK GDKNSATPKQKPLPPLFTLGPPPPKPNRPPNVDLTKFHKTSSGNSTSKGQTSYSTTSL PPPPPSHPASQPPLPASHPSQPPVPSLPPRNIKPPFDLKSPVNEDNQDGVTHSDGAGN LDEEQDSEGETYEDIEASKEREKKREKEEKKRLELEKKEQKEKEKKEQEIKKKFKLTG PIQVIHLAKACCDVKGGKNELSFKQGEQIEIIRITDNPEGKWLGRTARGSYGYIKTTA VEIDYDSLKLKKDSLGAPSRPIEDDQEVYDDVAEQDDISSHSQSGSGGIFPPPPDDDI YDGIEEEDADDGFPAPPKQLDMGDEVYDDVDTSDFPVSSAEMSQGTNFGKAKTEEKDL KKLKKQEKEEKDFRKKFKYDGEIRVLYSTKVTTSITSKKWGTRDLQVKPGESLEVIQT TDDTKVLCRNEEGKYGYVLRSYLADNDGEIYDDIADGCIYDND" BASE COUNT 882 a 589 c 565 g 542 t ORIGIN 1 cttttgtctc tcagctattt tttgttccct atgtttgtag gatggaaagg cagatgtaaa 61 gtccctcatg gcgaaatata acacgggggg caacccgaca gaggatgtct cagtcaatag 121 ccgacccttc agagtcacag ggccaaactc atcttcagga atacaagcaa gaaagaactt 181 attcaacaac caaggaaatg ccagccctcc tgcaggaccc agcaatgtac ctaagtttgg 241 gtccccaaag ccacctgtgg cagtcaaacc ttcttctgag gaaaagcctg acaaggaacc 301 caagcccccg tttctaaagc ccactggagc aggccaaaga ttcggaacac cagccagctt 361 gaccaccaga gaccccgagg cgaaagtggg atttctgaaa cctgtaggcc ccaagcccat 421 caacttgccc aaagaagatt ccaaacctac atttccctgg cctcctggaa acaagccatc 481 tcttcacagt gtaaaccaag accatgactt aaagccacta ggcccgaaat ctgggcctac 541 tcctccaacc tcagaaaatg aacagaagca agcgtttccc aaattgactg gggttaaagg 601 gaaatttatg tcagcatcac aagatcttga acccaagccc ctcttcccca aacccgcctt 661 tggccagaag ccgcccctaa gtaccgagaa ctcccatgaa gacgaaagcc ccatgaagaa 721 tgtgtcttca tcaaaagggt ccccagctcc cctgggagtc aggtccaaaa gcggcccttt 781 aaaaccagca agggaagact cagaaaataa agaccatgca ggggagattt caagtttgcc 841 ctttcctgga gtggttttga aacctgctgc gagcagggga ggcccaggtg tctccaaaaa 901 tggtgaagaa aaaaaggaag ataggaagat agatgctgct aagaacacct tccagagcaa 961 aataaatcag gaagagttgg cctcagggac tcctcctgcc aggttcccta aggccccttc 1021 taagctgaca gtgggggggc catggggcca aagtcaggaa aaggaaaagg gagacaagaa 1081 ttcagccacc ccgaaacaga agccattgcc tcccttgttt accttgggtc cacctccacc 1141 aaaacccaac agaccaccaa atgttgacct gacgaaattc cacaaaacct cttctggaaa 1201 cagtactagc aaaggccaga cgtcttactc aacaacttcc ctgccaccac ctccaccatc 1261 ccatccggcc agccaaccac cattgccagc atctcaccca tcacaaccac cagtcccaag 1321 cctacctccc agaaacatta aacctccgtt tgacctaaaa agccctgtca atgaagacaa 1381 tcaagatggt gtcacgcact ctgatggtgc tggaaatcta gatgaggaac aagacagtga 1441 aggagaaaca tatgaagaca tagaagcatc caaagaaaga gagaagaaaa gggaaaagga 1501 agaaaagaag aggttagagc tggagaaaaa ggaacagaaa gagaaagaaa agaaagaaca 1561 agaaataaag aagaaattta aactaacagg ccctattcaa gtcatccatc ttgcaaaagc 1621 ttgttgtgat gtcaaaggag gaaagaatga actgagcttc aagcaaggag agcaaattga 1681 aatcatccgc atcacagaca acccagaagg aaaatggttg ggcagaacag caaggggttc 1741 atatggctat attaaaacaa ctgctgtaga gattgactat gattctttga aactgaaaaa 1801 agactctctt ggtgcccctt caagacctat tgaagatgac caagaagtat atgatgatgt 1861 tgcagagcag gatgatatta gcagccacag tcagagtgga agtggaggga tattccctcc 1921 accaccagat gatgacattt atgatgggat tgaagaggaa gatgctgatg atggtttccc 1981 tgctcctcct aaacaattgg acatgggaga tgaagtttac gatgatgtgg atacctctga 2041 tttccctgtt tcatcagcag agatgagtca aggaactaat tttggaaaag ctaagacaga 2101 agaaaaggac cttaagaagc taaaaaagca ggaaaaagaa gaaaaagact tcaggaaaaa 2161 atttaaatat gatggtgaaa ttagagtcct atattcaact aaagttacaa cttccataac 2221 ttctaaaaag tggggaacca gagatctaca ggtaaaacct ggtgaatctc tagaagttat 2281 acaaaccaca gatgacacaa aagttctctg cagaaatgaa gaagggaaat atggttatgt 2341 ccttcggagt tacctagcgg acaatgatgg agagatctat gatgatattg ctgatggctg 2401 catctatgac aatgactagc actcaacttt ggtcattctg ctgtgttcat taggtgccaa 2461 tgtgaagtct ggattttaat tggcatgtta ttgggtatca agaaaattaa tgcacaaaac 2521 cacttattat catttgttat gaaatcccaa ttatctttac aaagtgttta aagtttga // LOCUS AF001891 1389 bp mRNA PRI 19-NOV-1997 DEFINITION Homo sapiens clone lambda MEN1 region unknown protein mRNA, complete cds. ACCESSION AF001891 NID g2529720 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1389) AUTHORS Guru,S.C., Agarwal,S.K., Manickam,P., Olufemi,S.-E., Crabtree,J.S., Weisemann,J.M., Kester,M., Kim,Y.S., Emmert-Buck,M.R., Liotta,L.A., Spiegel,A.M., Boguski,M., Roe,B.A., Collins,F.S., Burns,A.L., Marx,S.J. and Chandrasekharappa,S.C. TITLE A transcript map for the 2.8-Mb region containing the multiple endocrine neoplasia type 1 locus JOURNAL Genome Res. 7 (7), 725-735 (1997) MEDLINE 97397562 REFERENCE 2 (bases 1 to 1389) AUTHORS Chandrasekharappa,S. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) LGT, NHGRI, Bldg. 49, Room 3A76, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..1389 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" /clone="lambda" /note="transcripts seen on northern blots are 1.4 and 3.5 kb" CDS 148..504 /codon_start=1 /evidence=not_experimental /product="unknown" /db_xref="PID:g2529721" /translation="MGLCKCPKRKVTNLFCFEHRVNVCEHCLVANHAKCIVQSYLQWL QDSDYNPNCRLCNIPLASRETTRLVCYDLFHWACLNERAAQLPRNTAPAGYQCPSCNG PIFPPTNLGWPRGLPH" BASE COUNT 288 a 464 c 386 g 249 t 2 others ORIGIN 1 gagggataat cggcggccgg ggctgaaggg agaggcgcag gagcctgggg agagtggtcc 61 ctgcccttcc gcgcctcgag ccatcgctac cgcccttcgg aaccagtgca gcggccgatc 121 agtaaacaca gagactgggg atcgatcatg gggctttgta agtgccccaa gagaaaggtg 181 accaacctgt tctgcttcga acatcgggtc aacgtctgcg agcactgcct ggtagccaat 241 cacgccaagt gcatcgtcca gtcctacctg caatggctcc aagatagcga ctacaacccc 301 aattgccgcc tgtgcaacat acccctggcc agccgagaga cgacccgcct tgtctgctat 361 gatctctttc actgggcctg cctcaatgaa cgtgctgccc agctaccccg aaacacggca 421 cctgccggct atcagtgccc cagctgcaat ggccccatct tccccccaac caacctgggc 481 tggccccgtg ggcttccgca ctgagagaga agctggccac agtcaactgg ggcccgggca 541 ggactgggcc tccctctgat cgatgaggtg gtgagcccag agcccgagcc cctcaacacg 601 tctgacttct ctgacttggt cttagtttta atgccagcag tacccctgga ccagaggagg 661 tagacagcgc ctctgctgcc ccagccttct acagccgagc cccccggccc ccagcttccc 721 caggccggcc cgagcagcac acagtgatcc acatgggcaa tcctgagccc ttgactcacg 781 cccctaggaa ggtgtatgat acgcgggatg atgaccggac accaggcctc catggagact 841 gtgacgatga caagtaccga cgtcggccgg ccttgggttg gctggcccag ctgctcagga 901 gccgggctgg gtcccgaagc ggccgctgac cctgctccag cgggcggggc tgctgctact 961 cttgggactg ctgggcttcc tggccctcct tgccctcatg tctcgcctag gccgggccgc 1021 agctgacagc gatcccaacc tggacccact catgaaccct cacatccgcg tgggcccctc 1081 ctgaagcccc cttgcttgtg gctaggccag cctaggatgt gggttctgtg gaggagaggc 1141 ggggtaatgg ggaagctgaa gggcacctct tcactgcccc tctccctcaa gcctaagaca 1201 ctaagacccc agacccaaag ccaagtccac cagagtggct gcaggccagg cctggagtcc 1261 ccgtgggtca agcatttgtc ttgacttgct ttcctcccgg gtctccagcc tccgacccct 1321 cgccccatga argarctggc aggtggaaat aaacaacaac tttattaaaa caaaaaaaaa 1381 aaaaaaaaa // LOCUS AF001900 2075 bp mRNA PRI 25-JUN-1997 DEFINITION Homo sapiens secreted frizzled-related protein mRNA, complete cds. ACCESSION AF001900 NID g2213818 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2075) AUTHORS Finch,P.W., He,X., Kelley,M.J., Uren,A., Schaudies,R.P., Popescu,N.C., Rudikoff,S., Aaronson,S.A., Varmus,H.E. and Rubin,J.S. TITLE Purification and Molecular Cloning of a Secreted, Frizzled-Related Antagonist of Wnt Action JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 2075) AUTHORS Finch,P.W., Aaronson,S.A. and Rubin,J.S. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) LCMB, DBS/NCI, 9000 Rockville Pike, Bethesda, MD 20892, USA REFERENCE 3 (bases 1 to 2075) AUTHORS Finch,P.W., Aaronson,S.A. and Rubin,J.S. TITLE Direct Submission JOURNAL Submitted (09-JUN-1997) LCMB, DBS/NCI, 9000 Rockville Pike, Bethesda, MD 20892, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..2075 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p11.1-p12" /clone="HS1" /dev_stage="embryonic" /tissue_type="lung" /cell_line="M426" /cell_type="fibroblast" CDS 303..1244 /note="FRP" /codon_start=1 /product="secreted frizzled-related protein" /db_xref="PID:g2213819" /translation="MGIGRSEGGRRGALGVLLALGAALLAVGSASEYDYVSFQSDIGP YQSGRFYTKPPQCVDIPADLRLCHNVGYKKMVLPNLLEHETMAEVKQQASSWVPLLNK NCHAGTQVFLCSLFAPVCLDRPIYPCRWLCEAVRDSCEPVMQFFGFYWPEMLKCDKFP EGDVCIAMTPPNATEASKPQGTTVCPPCDNELKSEAIIEHLCASEFALRMKIKEVKKE NGDKKIVPKKKKPLKLGPIKKKDLKKLVLYLKNGADCPCHQLDNLSHHFLIMGRKVKS QYLLTAIHKWDKKNKEFKNFMKKMKNHECPTFQSVFK" BASE COUNT 473 a 596 c 626 g 380 t ORIGIN 1 cctgcagcct ccggagtcag tgccgcgcgc ccgccgcccc gcgccttcct gctcgccgca 61 cctccgggag ccggggcgca cccagcccgc agcgccgcct ccccgcccgc gccgcctccg 121 accgcaggcc gagggccgcc actggccggg gggaccgggc agcagcttgc ggccgcggag 181 ccgggcaacg ctggggactg cgccttttgt ccccggaggt ccctggaagt ttgcggcagg 241 acgcgcgcgg ggaggcggcg gaggcagccc cgacgtcgcg gagaacaggg cgcagagccg 301 gcatgggcat cgggcgcagc gaggggggcc gccgcggggc cctgggcgtg ctgctggcgc 361 tgggcgcggc gcttctggcc gtgggctcgg ccagcgagta cgactacgtg agcttccagt 421 cggacatcgg cccgtaccag agcgggcgct tctacaccaa gccacctcag tgcgtggaca 481 tccccgcgga cctgcggctg tgccacaacg tgggctacaa gaagatggtg ctgcccaacc 541 tgctggagca cgagaccatg gcggaggtga agcagcaggc cagcagctgg gtgcccctgc 601 tcaacaagaa ctgccacgcc gggacccagg tcttcctctg ctcgctcttc gcgcccgtct 661 gcctggaccg gcccatctac ccgtgtcgct ggctctgcga ggccgtgcgc gactcgtgcg 721 agccggtcat gcagttcttc ggcttctact ggcccgagat gcttaagtgt gacaagttcc 781 cggaggggga cgtctgcatc gccatgacgc cgcccaatgc caccgaagcc tccaagcccc 841 aaggcacaac ggtgtgtcct ccctgtgaca acgagttgaa atctgaggcc atcattgaac 901 atctctgtgc cagcgagttt gcactgagga tgaaaataaa agaagtgaaa aaagaaaatg 961 gcgacaagaa gattgtcccc aagaagaaga agcccctgaa gttggggccc atcaagaaga 1021 aggacctgaa gaagcttgtg ctgtacctga agaatggggc tgactgtccc tgccaccagc 1081 tggacaacct cagccaccac ttcctcatca tgggccgcaa ggtgaagagc cagtacttgc 1141 tgacggccat ccacaagtgg gacaagaaaa acaaggagtt caaaaacttc atgaagaaaa 1201 tgaaaaacca tgagtgcccc acctttcagt ccgtgtttaa gtgattctcc cgggggcagg 1261 gtggggaggg agcctcgggt ggggtgggag cgggggggac agtgcccggg aacccgtggt 1321 cacacacacg cactgccctg tcagtagtgg acattgtaat ccagtcggct tgttcttgca 1381 gcattcccgc tccctttccc tccatagcca cgctccaaac cccagggtag ccatggccgg 1441 gtaaagcaag ggccatttag attaggaagg tttttaagat ccgcaatgtg gagcagcagc 1501 cactgcacag gaggaggtga caaaccattt ccaacagcaa cacagccact aaaacacaaa 1561 aagggggatt gggcggaaag tgagagccag cagcaaaaac tacattttgc aacttgttgg 1621 tgtggatcta ttggctgatc tatgcctttc aactagaaaa ttctaatgat tggcaagtca 1681 cgttgttttc aggtccagag tagtttcttt ctgtctgctt taaatggaaa cagactcata 1741 ccacacttac aattaaggtc aagcccagaa agtgataagt gcagggagga aaagtgcaag 1801 tccattatct aatagtgaca gcaaagggac caggggagag gcattgcctt ctctgcccac 1861 agtctttccg tgtgattgtc tttgaatctg aatcagccag tctcagatgc cccaaagttt 1921 cggttcctat gagcccgggg catgatctga tccccaagac atgtggaggg gcagcctgtg 1981 cctgcctttg tgtcagaaaa aggaaaccac agtgagcctg agagagacgg cgattttcgg 2041 gctgagaagg cagtagtttt caaaacacat agtta // LOCUS AF001954 2061 bp mRNA PRI 11-JUN-1997 DEFINITION Homo sapiens growth inhibitor p33ING1 (ING1) mRNA, complete cds. ACCESSION AF001954 NID g2183220 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2061) AUTHORS Garkavtsev,I., Kazarov,A., Gudkov,A. and Riabowol,K. TITLE Suppression of the novel growth inhibitor p33ING1 promotes neoplastic transformation JOURNAL Nature Genet. 14 (4), 415-420 (1996) MEDLINE 97099452 REFERENCE 2 (bases 1 to 2061) AUTHORS Garkavtsev,I. and Riabowol,K. TITLE Extension of the replicative life span of human diploid fibroblasts by inhibition of the p33ING1 candidate tumor suppressor JOURNAL Mol. Cell. Biol. 17 (4), 2014-2019 (1997) MEDLINE 97219991 REFERENCE 3 (bases 1 to 2061) AUTHORS Garkavtsev,I., Kazarov,A., Gudkov,A. and Riabowol,K. TITLE Direct Submission JOURNAL Submitted (01-MAY-1997) Medical Biochemistry, University of Calgary HSC, 3330 Hospital Dr. NW, Calgary, Alberta T2N 4N1, Canada FEATURES Location/Qualifiers source 1..2061 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q33-q34" /cell_line="Hs68" /cell_type="diploid fibroblast" gene 16..900 /gene="ING1" CDS 16..900 /gene="ING1" /note="growth inhibitor; tumor suppressor candidate" /codon_start=1 /product="p33ING1" /db_xref="PID:g2183221" /translation="MPLCTATRIPRYSSSSDPGPVARGRGCSSDRLPRPAGPARRQFQ AASLLTRGWGRAWPWKQILKELDECYERFSRETDGAQKRRMLHCVQRALIRSQELGDE KIQIVSQMVELVENRTRQVDSHVELFEAQQELGDTVGNSGKVGADRPNGDAVAQSDKP NSKRSRRQRNNENRENASSNHDHDDGASGTPKEKKAKTSKKKKRSKAKAEREASPADL PIDPNEPTYCLCNQVSYGEMIGCDNDECPIEWFHFSCVGLNHKPKGKWYCPKCRGENE KTMDKALEKSKKERAYNR" BASE COUNT 602 a 439 c 515 g 505 t ORIGIN 1 gagtaacccg ataatatgcc gttgtgcacg gcgacgagaa ttcccagata tagcagtagc 61 agtgatcccg ggcctgtggc tcggggccgg ggctgcagtt cggaccgcct cccgcgaccc 121 gcggggccgg ctcggagaca gtttcaggcc gcatctttgc tgacccgagg gtggggccgc 181 gcgtggccgt ggaaacagat cctgaaggag ctagacgagt gctacgagcg cttcagtcgc 241 gagacagacg gggcgcagaa gcggcggatg ctgcactgtg tgcagcgcgc gctgatccgc 301 agccaggagc tgggcgacga gaagatccag atcgtgagcc agatggtgga gctggtggag 361 aaccgcacgc ggcaggtgga cagccacgtg gagctgttcg aggcgcagca ggagctgggc 421 gacacagtgg gcaacagcgg caaggttggc gcggacaggc ccaatggcga tgcggtagcg 481 cagtctgaca agcccaacag caagcgctca cggcggcagc gcaacaacga gaaccgtgag 541 aacgcgtcca gcaaccacga ccacgacgac ggcgcctcgg gcacacccaa ggagaagaag 601 gccaagacct ccaagaagaa gaagcgctcc aaggccaagg cggagcgaga ggcgtcccct 661 gccgacctcc ccatcgaccc caacgaaccc acgtactgtc tgtgcaacca ggtctcctat 721 ggggagatga tcggctgcga caacgacgag tgccccatcg agtggttcca cttctcgtgc 781 gtggggctca atcataaacc caagggcaag tggtactgtc ccaagtgccg gggggagaac 841 gagaagacca tggacaaagc cctggagaaa tccaaaaaag agagggctta caacaggtag 901 tttgtggaca ggcgcctggt gtgaggagga caaaataaac cgtgtattta ttacattgct 961 gcctttgttg aggtgcaagg agtgtaaaat gtatattttt aaagaatgtt agaaaaggaa 1021 ccattccttt catagggatg gcagtgattc tgtttgcctt ttgttttcat tggtacacgt 1081 gtaacaagaa agtggtctgt ggatcagcat tttagaaact acaaatatag gtttgattca 1141 acacttaagt ctcagactga tttcttgcgg gaggaggggg actaaactca ccctaacaca 1201 ttaaatgtgg aaggaaaata tttcattagc ttttttattt taatacaagt aatattatta 1261 ctttatgaac aatttttttt aattggccat gtcgccaaaa atacagccta tagtaaatgt 1321 gtttcttgct gccatgatgt atatccatat aacaattcag taacaaaggt ttaaagtttg 1381 aagattattt tttaaaaagg taaaaggtta aattttacat gacagatatt ttatctattg 1441 gcctgttccc caaatggcca ttttaaaatg cttgggtaca cttctcttaa gtggtctagt 1501 caaggaacct caagtcatgc ttttgctatc accaatcata gtgtacccat ctttaattta 1561 tatcaggtgt ataaatgtac atttccaaat gaacttgcac tgtaatatta taattggaag 1621 tgcagtcagc agtagctgtc ggagctaatg tcacaattat gtgcaaaggt gtgcttcctg 1681 ctgtatgtga gctgtaaaaa tgttacgtga agaaataaat gaaacttggc cagtttgttc 1741 ctctagtagt atatttaatt ttgacataag taacttttaa aatttgtctt aaaaatttat 1801 acaccagcaa tttagacaaa gccttaagca aattttgtat tattgttctc acttattatt 1861 aataatgaag tagaagttac ttaattgcca gcaaataaat acgtgtcaaa aaagaatctg 1921 tattcagacc cctggggtca ggaaattact gccccacttg tcaagttcag cccaccatct 1981 gtttgaacat tatatgaagt ttaaattcta gtgtccataa ataaagtttc agcggcaccc 2041 caaaaaaaaa aaaaaaaaaa a // LOCUS AF002020 4673 bp mRNA PRI 25-JUL-1997 DEFINITION Homo sapiens Niemann-Pick C disease protein (NPC1) mRNA, complete cds. ACCESSION AF002020 NID g2276462 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4673) AUTHORS Carstea,E.D., Morris,J.A., Coleman,K.G., Loftus,S.K., Zhang,D., Cummings,C., Gu,J., Rosenfeld,M.A., Pavan,W.J., Krizman,D.B., Nagle,J., Polymeropoulos,M.H., Sturley,S.L., Ioannou,Y.A., Higgins,M.E., Comly,M., Cooney,A., Brown,A., Kaneski,C.R., Blanchette-Mackie,E.J., Dwyer,N.K., Neufeld,E.B., Chang,T.Y., Liscum,L., Strauss,J.F. III, Ohno,K., Zeigler,M., Carmi,R., Sokol,J., Markie,D., O'Neill,R.R., van Diggelen,O.P., Elleder,M., Patterson,M.C., Brady,R.O., Vanier,M.T., Pentchev,P.G. and Tagle,D.A. TITLE Niemann-pick C1 disease gene: homology to mediators of cholesterol homeostasis JOURNAL Science 277 (5323), 228-231 (1997) MEDLINE 97362323 REFERENCE 2 (bases 1 to 4673) AUTHORS Morris,J.A. TITLE Direct Submission JOURNAL Submitted (02-MAY-1997) Dev. and Metabol. Neurology Branch, NINDS/NIH, Bldg. 10, Room 3D-04, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4673 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18q11" gene 1..3960 /gene="NPC1" CDS 124..3960 /gene="NPC1" /codon_start=1 /product="Niemann-Pick C disease protein" /db_xref="PID:g2276463" /translation="MTARGLALGLLLLLLCPAQVFSQSCVWYGECGIAYGDKRYNCEY SGPPKPLPKDGYDLVQELCPGFFFGNVSLCCDVRQLQTLKDNLQLPLQFLSRCPSCFY NLLNLFCELTCSPRQSQFLNVTATEDYVDPVTNQTKTNVKELQYYVGQSFANAMYNAC RDVEAPSSNDKALGLLCGKDADACNATNWIEYMFNKDNGQAPFTITPVFSDFPVHGME PMNNATKGCDESVDEVTAPCSCQDCSIVCGPKPQPPPPPAPWTILGLDAMYVIMWITY MAFLLVFFGAFFAVWCYRKRYFVSEYTPIDSNIAFSVNASDKGEASCCDPVSAAFEGC LRRLFTRWGSFCVRNPGCVIFFSLVFITACSSGLVFVRVTTNPVDLWSAPSSQARLEK EYFDQHFGPFFRTEQLIIRAPLTDKHIYQPYPSGADVPFGPPLDIQILHQVLDLQIAI ENITASYDNETVTLQDICLAPLSPYNTNCTILSVLNYFQNSHSVLDHKKGDDFFVYAD YHTHFLYCVRAPASLNDTSLLHDPCLGTFGGPVFPWLVLGGYDDQNYNNATALVITFP VNNYYNDTEKLQRAQAWEKEFINFVKNYKNPNLTISFTAERSIEDELNRESDSDVFTV VISYAIMFLYISLALGHIKSCRRLLVDSKVSLGIAGILIVLSSVACSLGVFSYIGLPL TLIVIEVIPFLVLAVGVDNIFILVQAYQRDERLQGETLDQQLGRVLGEVAPSMFLSSF SETVAFFLGALSVMPAVHTFSLFAGLAVFIDFLLQITCFVSLLGLDIKRQEKNRLDIF CCVRGAEDGTSVQASESCLFRFFKNSYSPLLLKDWMRPIVIAIFVGVLSFSIAVLNKV DIGLDQSLSMPDDSYMVDYFKSISQYLHAGPPVYFVLEEGHDYTSSKGQNMVCGGMGC NNDSLVQQIFNAAQLDNYTRIGFAPSSWIDDYFDWVKPQSSCCRVDNITDQFCNASVV DPACVRCRPLTPEGKQRPQGGDFMRFLPMFLSDNPNPKCGKGGHAAYSSAVNILLGHG TRVGATYFMTYHTVLQTSADFIDALKKARLIASNVTETMGINGSAYRVFPYSVFYVFY EQYLTIIDDTIFNLGVSLGAIFLVTMVLLGCELWSAVIMCATIAMVLVNMFGVMWLWG ISLNAVSLVNLVMSCGISVEFCSHITRAFTVSMKGSRVERAEEALAHMGSSVFSGITL TKFGGIVVLAFAKSQIFQIFYFRMYLAMVLLGATHGLIFLPVLLSYIGPSVNKAKSCA TEERYKGTERERLLNF" BASE COUNT 1103 a 1154 c 1131 g 1285 t ORIGIN 1 tttgctcctg ctcctccgct cctcctgcgc ggggtgctga aacagcccgg ggaagtagag 61 ccgcctccgg ggagcccaac cagccgaacg ccgccggcgt cagcagcctt gcgcggccac 121 agcatgaccg ctcgcggcct ggcccttggc ctcctcctgc tgctactgtg tccagcgcag 181 gtgttttcac agtcctgtgt ttggtatgga gagtgtggaa ttgcatatgg ggacaagagg 241 tacaattgcg aatattctgg cccaccaaaa ccattgccaa aggatggata tgacttagtg 301 caggaactct gtccaggatt cttctttggc aatgtcagtc tctgttgtga tgttcggcag 361 cttcagacac taaaagacaa cctgcagctg cctctacagt ttctgtccag atgtccatcc 421 tgtttttata acctactgaa cctgttttgt gagctgacat gtagccctcg acagagtcag 481 tttttgaatg ttacagctac tgaagattat gttgatcctg ttacaaacca gacgaaaaca 541 aatgtgaaag agttacaata ctacgtcgga cagagttttg ccaatgcaat gtacaatgcc 601 tgccgggatg tggaggcccc ctcaagtaat gacaaggccc tgggactcct gtgtgggaag 661 gacgctgacg cctgtaatgc caccaactgg attgaataca tgttcaataa ggacaatgga 721 caggcacctt ttaccatcac tcctgtgttt tcagattttc cagtccatgg gatggagccc 781 atgaacaatg ccaccaaagg ctgtgacgag tctgtggatg aggtcacagc accatgtagc 841 tgccaagact gctctattgt ctgtggcccc aagccccagc ccccacctcc tcctgctccc 901 tggacgatcc ttggcttgga cgccatgtat gtcatcatgt ggatcaccta catggcgttt 961 ttgcttgtgt tttttggagc attttttgca gtgtggtgct acagaaaacg gtattttgtc 1021 tccgagtaca ctcccatcga tagcaatata gctttttctg ttaatgcaag tgacaaagga 1081 gaggcgtcct gctgtgaccc tgtcagcgca gcatttgagg gctgcttgag gcggctgttc 1141 acacgctggg ggtctttctg cgtccgaaac cctggctgtg tcattttctt ctcgctggtc 1201 ttcattactg cgtgttcgtc aggcctggtg tttgtccggg tcacaaccaa tccagttgac 1261 ctctggtcag cccccagcag ccaggctcgc ctggaaaaag agtactttga ccagcacttt 1321 gggcctttct tccggacgga gcagctcatc atccgggccc ctctcactga caaacacatt 1381 taccagccat acccttcggg agctgatgta ccctttggac ctccgcttga catacagata 1441 ctgcaccagg ttcttgactt acaaatagcc atcgaaaaca ttactgcctc ttatgacaat 1501 gagactgtga cacttcaaga catctgcttg gcccctcttt caccgtataa cacgaactgc 1561 accattttga gtgtgttaaa ttacttccag aacagccatt ccgtgctgga ccacaagaaa 1621 ggggacgact tctttgtgta tgccgattac cacacgcact ttctgtactg cgtacgggct 1681 cctgcctctc tgaatgatac aagtttgctc catgaccctt gtctgggtac gtttggtgga 1741 ccagtgttcc cgtggcttgt gttgggaggc tatgatgatc aaaactacaa taacgccact 1801 gcccttgtga ttaccttccc tgtcaataat tactataatg atacagagaa gctccagagg 1861 gcccaggcct gggaaaaaga gtttattaat tttgtgaaaa actacaagaa tcccaatctg 1921 accatttcct tcactgctga acgaagtatt gaagatgaac taaatcgtga aagtgacagt 1981 gatgtcttca ccgttgtaat tagctatgcc atcatgtttc tatatatttc cctagccttg 2041 gggcacatca aaagctgtcg caggcttctg gtggattcga aggtctcact aggcatcgcg 2101 ggcatcttga tcgtgctgag ctcggtggct tgctccttgg gtgtcttcag ctacattggg 2161 ttgcccttga ccctcattgt gattgaagtc atcccgttcc tggtgctggc tgttggagtg 2221 gacaacatct tcattctggt gcaggcctac cagagagatg aacgtcttca aggggaaacc 2281 ctggatcagc agctgggcag ggtcctagga gaagtggctc ccagtatgtt cctgtcatcc 2341 ttttctgaga ctgtagcatt tttcttagga gcattgtccg tgatgccagc cgtgcacacc 2401 ttctctctct ttgcgggatt ggcagtcttc attgactttc ttctgcagat tacctgtttc 2461 gtgagtctct tggggttaga cattaaacgt caagagaaaa atcggctaga catcttttgc 2521 tgtgtcagag gtgctgaaga tggaacaagc gtccaggcct cagagagctg tttgtttcgc 2581 ttcttcaaaa actcctattc tccacttctg ctaaaggact ggatgagacc aattgtgata 2641 gcaatatttg tgggtgttct gtcattcagc atcgcagtcc tgaacaaagt agatattgga 2701 ttggatcagt ctctttcgat gccagatgac tcctacatgg tggattattt caaatccatc 2761 agtcagtacc tgcatgcggg tccgcctgtg tactttgtcc tggaggaagg gcacgactac 2821 acttcttcca aggggcagaa catggtgtgc ggcggcatgg gctgcaacaa tgattccctg 2881 gtgcagcaga tatttaacgc ggcgcagctg gacaactata cccgaatagg cttcgccccc 2941 tcgtcctgga tcgacgatta tttcgactgg gtgaagccac agtcgtcttg ctgtcgagtg 3001 gacaatatca ctgaccagtt ctgcaatgct tcagtggttg accctgcctg cgttcgctgc 3061 aggcctctga ctccggaagg caaacagagg cctcaggggg gagacttcat gagattcctg 3121 cccatgttcc tttcggataa ccctaacccc aagtgtggca aagggggaca tgctgcctat 3181 agttctgcag ttaacatcct ccttggccat ggcaccaggg tcggagccac gtacttcatg 3241 acctaccaca ccgtgctgca gacctctgct gactttattg acgctctgaa gaaagcccga 3301 cttatagcca gtaatgtcac cgaaaccatg ggcattaacg gcagtgccta ccgagtattt 3361 ccttacagtg tgttttatgt cttctacgaa cagtacctga ccatcattga cgacactatc 3421 ttcaacctcg gtgtgtccct gggcgcgata tttctggtga ccatggtcct cctgggctgt 3481 gagctctggt ctgcagtcat catgtgtgcc accatcgcca tggtcttggt caacatgttt 3541 ggagttatgt ggctctgggg catcagtctg aacgctgtat ccttggtcaa cctggtgatg 3601 agctgtggca tctccgtgga gttctgcagc cacataacca gagcgttcac ggtgagcatg 3661 aaaggcagcc gcgtggagcg cgcggaagag gcacttgccc acatgggcag ctccgtgttc 3721 agtggaatca cacttacaaa atttggaggg attgtggtgt tggcttttgc caaatctcaa 3781 attttccaga tattctactt caggatgtat ttggccatgg tcttactggg agccactcac 3841 ggattaatat ttctccctgt cttactcagt tacatagggc catcagtaaa taaagccaaa 3901 agttgtgcca ctgaagagcg atacaaagga acagagcgcg aacggcttct aaatttctag 3961 ccctctcgca gggcatcctg actgaactgt gtctaagggt cggtcggttt accactggac 4021 gggtgctgca tcggcaaggc caagttgaac accggatggt gccaaccatc ggttgtttgg 4081 cagcagcttt gaacgtagcg cctgtgaact caggaatgca cagttgactt gggaagcagt 4141 attactagat ctggaggcaa ccacaggaca ctaaacttct cccagcctct tcaggaaaga 4201 aacctcattc tttggcaagc aggaggtgac actagatggc tgtgaatgtg atccgctcac 4261 tgacactctg taaaggccaa tcaatgcact gtctgtcctc tcctttttag gagtaagcca 4321 tcccacaagt tctataccat atttttagtg acagttgagg ttgtagatac actttataac 4381 attttatagt ttaaagagct ttattaatgc aataaattaa ctttgtacac atttttatat 4441 aaaaaaacag caagtgattt cagaatgttg taggcctcat tagagcttgg tctccaaaaa 4501 tctgtttgaa aaaagcaaca tgttcttcac agtgttcccc tagaaaggaa gagatttaat 4561 tgccagttag atgtggcatg aaatgaggga caaagaaagc atctcgtagg tgtgtctact 4621 gggttttaac ttatttttct ttaataaaat acattgtttt cctaaaaaaa aaa // LOCUS AF002163 4950 bp mRNA PRI 19-SEP-1997 DEFINITION Homo sapiens delta-adaptin mRNA, complete cds. ACCESSION AF002163 NID g2290769 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4950) AUTHORS Ooi,C.E., Moreira,J.E., Dell'Angelica,E.C., Poy,G., Wassarman,D.A. and Bonifacino,J.S. TITLE Altered expression of a novel adaptin leads to defective pigment granule biogenesis in the Drosophila eye color mutant garnet JOURNAL EMBO J. 16 (15), 4508-4518 (1997) MEDLINE 97447555 REFERENCE 2 (bases 1 to 4950) AUTHORS Ooi,C.E., Moreira,J.E., Dell'Angelica,E.C., Poy,G., Wassarman,D.A. and Bonifacino,J.S. TITLE Direct Submission JOURNAL Submitted (02-MAY-1997) Cell Biology and Metabolism Branch, NICHD, National Institutes of Health, Bldg.18T/Rm.101, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4950 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 312..3773 /note="subunit of putative vesicle coat adaptor complex AP-3" /codon_start=1 /product="delta-adaptin" /db_xref="PID:g2290770" /translation="MALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIK QELKQDNIAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSF HEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTPDLARDLANDIMTLMSHTKP YIRKKAVLIMYKVFLKYPESLRPAFPRLKEKLEDPDPGVQSAAVNVICELARRNPKNY LSLAPLFFKLMTSSTNNWVLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLL YECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKYLGLLAMSKILKTH PKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTY RDELLTKIIDICSQSNYQYITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKA IRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHHTLEAMLR PRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADL EVQERASCILQLVKHIQKLQAKDVPVAEEVSALFAGELNPVAPKAQKKVPVPEGLDLD AWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPFY IKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKD KRRKKRKEKEKKGKRRHSSLPTESDEDIAPAQQVDIVTEEMPENALPSDEDDKDPNDP YRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKER DKEKKKEKEKKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAAGEPVQNGAPEEE QLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAIVLENRSSSILKGMELSVLDSL NARMARPQGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEG ATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQNLLAKI CFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKGENSVSVDGKCSDSTLLSNLLEEM KATLAKC" BASE COUNT 1141 a 1440 c 1421 g 948 t ORIGIN 1 ccaagcctga gtgttaattt aactctatgt tgtccgccgt gtaaacatcc gaggtcattt 61 gttgcgttga attatctgac catccttttt tactgtgact cttcccattc tctttggcaa 121 gaagtcccct tctcgccccc aaaccagcaa gggactcccc cacctgggtc tgtgccctgc 181 cccgcgctgg gggccgagtc cttgaatgtg gcttcagggg ctcctgtcct gacggttgcg 241 tccgggggag gggaaggaag ggccgctgtc gccaaggttt tctctcccag aacccacagt 301 gggccgccgc gatggccctc aagatggtga agggcagcat cgaccgcatg ttcgacaaga 361 atctgcagga cttggtccgc ggcatccgta accacaagga ggacgaggca aaatacatat 421 ctcagtgcat tgatgagatc aagcaggagc tgaagcagga caacatagcg gtgaaggcga 481 acgcggtctg caagctgacg tatttacaga tgttgggata cgacatcagc tgggccgcct 541 tcaacatcat agaagtgatg agtgcctcca agttcacctt caagcgaatt ggctacctcg 601 ctgcttccca gagctttcac gaaggcaccg acgtcatcat gctgaccacc aatcagatcc 661 gtaaggactt gagcagcccc agccagtacg acacaggtgt tgcactgacg ggtctgtcct 721 gcttcgtcac cccagacctt gccagagacc tggcaaatga catcatgaca ctgatgtcac 781 acaccaagcc ctacatcagg aagaaggctg tgctgatcat gtacaaggtg ttcctgaagt 841 accccgagtc gctgcgccct gcctttcccc ggctgaagga gaagctggag gaccccgacc 901 ccggggttca gtcggctgcc gtcaatgtca tctgcgagct ggccagacgc aaccctaaga 961 actacctgtc cctggccccg ctctttttca agctgatgac gtcctccacc aacaactggg 1021 tcctcatcaa gatcatcaag ctgttcggtg ctcttactcc tttggaaccg cggctgggca 1081 agaagctgat cgagcccctc accaatctca tccacagcac gtctgccatg tctctcctct 1141 atgaatgtgt gaacaccgtg attgcagtgc tcatctcgct gtcctccggc atgcccaacc 1201 acagcgccag catccagctt tgtgttcaga aattaaggat attgatcgag gactccgatc 1261 agaacttgaa gtacctgggg ctgctggcaa tgtccaagat cctgaagacc caccccaagt 1321 ccgtgcagtc ccacaaggac ctcatcctgc agtgcctgga cgacaaggac gagtccatcc 1381 ggctgcgggc cctggacctg ctctatggga tggtgtccaa gaagaacctg atggagatcg 1441 tgaagaagct gatgacccac gtagacaagg cagagggtac cacctaccgt gacgagctgc 1501 tcaccaagat cattgacatc tgcagccagt ccaactacca gtacatcacc aacttcgagt 1561 ggtacatcag catcctggtg gagctgaccc ggctggaggg cacacggcac ggccacctca 1621 tcgccgccca aatgctggac gtggccatcc gcgtgaaggc catccgcaag ttcgccgtgt 1681 cccagatgtc tgcgctgctt gacagtgcac acctgctggc cagcagcacc cagcggaacg 1741 ggatctgtga ggtgctgtac gctgccgcct ggatctgcgg ggagttctca gagcatctgc 1801 aggaaccaca ccacactttg gaggccatgc tgcggcccag agtcaccacg ctgccaggcc 1861 acatccaggc cgtgtatgtg cagaacgtgg tcaagctcta cgcctccatc ctgcagcaga 1921 aggagcaggc cggggaggca gagggcgctc aggccgtcac ccagctcatg gtggaccggc 1981 tgccccagtt tgtgcagagc gcagacctgg aggtgcagga gcgggcgtcc tgcatcctgc 2041 agctggtcaa gcacatccag aagcttcagg ccaaggacgt gcctgtggca gaggaggtca 2101 gcgctctctt tgctggggag ctgaacccag tggcccccaa ggcccagaag aaggttccag 2161 tccccgaagg cctggacctg gacgcctgga tcaatgagcc actctcggac agcgagtcag 2221 aggacgagag gcccagggcc gtcttccacg aggaggagca gcggcgtccc aagcaccggc 2281 cgtcggaggc ggacgaggag gagctggctc ggcgccgaga ggcccggaag caggagcagg 2341 ccaacaaccc cttctacatc aagagctcgc catcgccaca gaagcggtac caggacaccc 2401 cgggcgtgga gcacattccc gtggtgcaga ttgacctctc cgtccccttg aaggttccag 2461 ggctgcctat gtcagatcag tatgtgaagc tggaggagga gcggcggcac cggcagaagc 2521 tggagaagga caagaggagg aaaaagagga aggagaagga gaagaagggc aagcgccgcc 2581 acagctcgct gcccacggag agcgacgagg acatcgcccc tgcccagcag gtggacatcg 2641 tcacagagga gatgcctgag aatgctctgc ccagcgacga ggatgacaaa gaccccaacg 2701 acccctacag ggctctggat attgacctgg ataagccctt agccgacagc gagaaactgc 2761 ctattcagaa acacagaaac accgagacct caaaatcccc tgagaaggac gttcccatgg 2821 tagaaaagaa gagcaagaaa cccaagaaga aagagaaaaa acacaaagag aaagagagag 2881 acaaggagaa gaagaaggag aaggagaaga agaaatctcc caagcctaag aagaagaaac 2941 acaggaagga gaaggaggag cggaccaaag gcaagaagaa gtccaagaag cagcctccag 3001 gcagcgagga ggcagcgggg gagccggtgc agaatggcgc gccagaggag gagcagctcc 3061 cgcctgagtc cagctactcc ctcctcgctg aaaattccta tgttaaaatg acctgtgaca 3121 tccggggcag tctgcaggag gacagccagg tcactgtggc catcgtgctg gagaacagga 3181 gcagcagcat cctcaagggc atggagctca gcgtgctgga ctcactcaat gccaggatgg 3241 cccggccgca gggctcctcc gtccacgatg gcgtccccgt gcctttccag ctgcccccag 3301 gcgtctccaa cgaagcccag tatgtgttca ccatccagag catcgtcatg gcgcagaagc 3361 tcaaggggac cctgtccttc attgccaaga atgacgaggg tgcgacccac gagaagctgg 3421 acttcaggct gcacttcagc tgcagctcct acttgatcac cactccctgc tacagtgacg 3481 cctttgctaa gttgctggag tctggggact tgagcatgag ctcaatcaaa gtcgatggca 3541 ttcggatgtc cttccagaat cttctggcga agatctgttt tcaccaccat ttttccgttg 3601 tggagcgagt ggactcctgc gcctccatgt acagccgctc catccagggc caccatgtct 3661 gcctcctggt gaaaaagggt gagaactctg tctcagtcga cgggaagtgc agtgactcca 3721 cgctactgag caacttgtta gaagagatga aggcgacgct ggccaagtgt tgagagctgc 3781 ctgcgagccc cgcaccaccc cgcggagcac gtacccaggg accgcagccc tgacgtgtct 3841 cgcctctcct cagtcgtgtg tactgtaccc aagcctgagt gttaatttaa ctctatgttg 3901 tccgccgtgt agacatccga ggtcatttgt tgcgttgaat tatctgacca tcctttttta 3961 ctgtgactct tcccattctc tttggcaaga agtccccttc tcgcccccaa accagcaagg 4021 gactccccca cctgggtctg tgccctgccc cgcgctgggg gccgagtcct tgaatgtggc 4081 ttcaggggct cctgtcctgg gccagggcct gatgggcacc acgtgagggg cacttggtgg 4141 acagggcggg gctgacgtgg cctcctctgg ggtcgcctgc ttttgaccca aaggtcctga 4201 cggttgcgtc cgggggaggg gaaggaaggg ccgctgtcgc caaggttttc tctcccagaa 4261 cccacagtgg gaaagcggtc ttgccaggcg ttgtccattg tcagtgtgct cgtgggctgg 4321 tgactgggtc ttgggatccc aggccacgcg ccagccaggc tgtgggcagg gcggggccag 4381 ggacgccaaa gagaggttgc agtcagaacc gtggacgggg tgggttgagg cctctctgcc 4441 acccgtcttc ctggtcagca gaagtgcatc tcggcttggg tttggggtgg tccgcatccc 4501 ctgcttgcca ctatgcgcac caaggtttcc ccacatcctt cccagcaccc ttaggaaggc 4561 ccaggcaggg cctggaagca gcggacctgg gctgttctgt gttgaaggag tgtgcccagt 4621 gcccttgggc aggacctgtg agagccacct cacaggcaga gcccccacca ggcagggcaa 4681 ggagactccg ctcactcccc acggccagcg tgggcacagg actgaccctt cttcagagat 4741 aatgacattt tatcttctcc ttttgatgaa aactgtcact ttagcatgta atccattaca 4801 gaatcccatg cagtgattcc aggatttgaa attgtatgat gtgttacata agaatttatt 4861 tgctatcgac attcccgtat aaagagagag acatatcacg ctgctgtcat gattttgtgt 4921 caagatgatc caataaagtt gtaaaacagg // LOCUS AF002210 1068 bp mRNA PRI 24-SEP-1997 DEFINITION Homo sapiens copper chaperone for superoxide dismutase (CCS) mRNA, complete cds. ACCESSION AF002210 NID g2431867 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1068) AUTHORS Culotta,V.C., Klomp,L.W., Strain,J., Casareno,R.L., Krems,B. and Gitlin,J.D. TITLE The copper chaperone for superoxide dismutase JOURNAL J. Biol. Chem. 272 (38), 23469-23472 (1997) MEDLINE 97442401 REFERENCE 2 (bases 1 to 1068) AUTHORS Culotta,V.C., Klomp,L.W.J., Strain,J., Krems,B., Casareno,R.L.B. and Gitlin,J.D. TITLE Direct Submission JOURNAL Submitted (02-MAY-1997) Pediatrics, Washington University School of Medicine, St. Louis Childrens Hospital, 1 Childrens Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1068 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" gene 1..1068 /gene="CCS" CDS 44..868 /gene="CCS" /function="proposed role in copper delivery to copper zinc superoxide dismutase" /note="encodes a putative copper-binding site; functional and structural homolog of S. cerevisiae LYS7 gene" /codon_start=1 /product="copper chaperone for superoxide dismutase" /db_xref="PID:g2431868" /translation="MASDSGNQGTLCTLEFAVQMTCQSCVDAVRKSLQGVAGVQDVEV HLEDQMVLVHTTLPSQEVQALLEGTGRQAVLKGMGSGQLQNLGAAVAILGGPGTVQGV VRFLQLTPERCLIEGTIDGLEPGLHGLHVHQYGDLTNNCNSCGNHFNPDGASHGGPQD SDRHRGDLGNVRADADGRAIFRMEDEQLKVWDVIGRSLIIDEGEDDLGRGGHPLSKIT GNSGERLACGIIARSAGLFQNPKQICSCDGLTIWEERGRPIAGKGRKESAQPPAHL" misc_feature 101..118 /gene="CCS" /note="encodes a putative copper-binding site" BASE COUNT 231 a 279 c 353 g 205 t ORIGIN 1 cgccggagga gttctgcgtc tcggggtggt gactgggtcc agaatggctt cggattcggg 61 gaaccagggg accctctgca cgttggagtt cgcggtgcag atgacctgtc agagctgtgt 121 ggacgcggtg cgcaaatccc tgcaaggggt ggcaggtgtc caggatgtgg aggtgcactt 181 ggaggaccag atggtcttgg tacacaccac tctacccagc caggaggtgc aggctctcct 241 ggaaggcacg gggcggcagg cggtactcaa gggcatgggc agcggccagt tgcagaatct 301 gggggcagca gtggccatcc tgggggggcc tggcaccgtg cagggggtgg tgcgcttcct 361 acagctgacc cctgagcgct gcctcatcga gggaactatt gacggcctgg agcctgggct 421 gcatggactc cacgtccatc agtacgggga ccttacaaac aactgcaaca gctgtgggaa 481 tcactttaac cctgatggag catctcatgg gggcccccag gactctgacc ggcaccgcgg 541 agacctgggc aatgtccgtg ctgatgctga cggccgcgcc atcttcagaa tggaggatga 601 gcagctgaag gtgtgggatg tgattggccg cagcctgatt attgatgagg gagaagatga 661 cctgggccgg ggaggccatc ccttatccaa gatcacaggg aactccgggg agaggttggc 721 ctgtggcatc attgcacgct ccgctggcct tttccagaac cccaagcaga tctgctcttg 781 cgatggcctc accatctggg aggagcgagg ccggcccatc gctggcaagg gccgaaagga 841 gtcagcgcag ccccctgccc acctttgagc aggacctcac cttggctctg ttgctgtcct 901 ccagggcgag cactttccac ttccagaggg ggccagaggg actttgcctg cccagtcttt 961 ggagagctca gtacagggca ggagctgctg tggtgttccc ttggcaaatg aaagttttat 1021 tttcgtttgg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS AF002246 7642 bp mRNA PRI 12-JUN-1997 DEFINITION Homo sapiens neural cell adhesion molecule (CALL) mRNA, complete cds. ACCESSION AF002246 NID g2190957 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7642) AUTHORS Wei,M.-H. and Lerman,M. TITLE The CALL gene is a new neural CAM and a member of the Ig superfamily JOURNAL Unpublished REFERENCE 2 (bases 1 to 7642) AUTHORS Wei,M.-H. TITLE Direct Submission JOURNAL Submitted (05-MAY-1997) IRSP, SAIC-Frederick, NCI-FCRDC, Building 560, Rm 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..7642 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p26" /tissue_type="brain" gene 272..3946 /gene="CALL" CDS 272..3946 /gene="CALL" /note="member of neural CAM superfamily" /codon_start=1 /product="neural cell adhesion molecule" /db_xref="PID:g2190958" /translation="MEPLLLGRGLIVYLMFLLLKFSKAIEIPSSVQQVPTIIKQSKVQ VAFPFDEYFQIECEAKGNPEPTFSWTKDGNPFYFTDHRIIPSNNSGTFRIPNEGHISH FQGKYRCFASNKLGIAMSEEIEFIVPSVPKFPKEKIDPLEVEEGDPIVLPCNPPKGLP PLHIYWMNIELEHIEQDERVYMSQKGDLYFANVEEKDSRNDYCCFAAFPRLRTIVQKM PMKLTVNSLKHANDSSSSTEIGSKANSIKQRKPKLLLPPTESGSESSITILKGEILLL ECFAEGLPTPQVDWNKIGGDLPKGREAKENYGKTLKIENVSYQDKGNYRCTASNFLGT ATHDFHVIVEEPPRWTKKPQSAVYSTGSNGILLCEAEGEPQPTIKWRVNGSPVDNHPF AGDVVFPREISFTNLQPNHTAVYQCEASNVHGTILANANIDVVDVRPLIQTKDGENYA TVVGYSAFLHCEFFASPEAVVSWQKVEEVKPLEGRRYHIYENGTLQINRTTEEDAGSY SCWVENAIGKTAVTANLDIRNATKLRVSPKNPRIPKLHMLELHCESKCDSHLKHSLKL SWSKDGEAFEINGTEDGRIIIDGANLTISNVTLEDQGIYCCSAHTALDSAADITQVTV LDVPDPPENLHLSERQNRSVRLTWEAGADHNSNISEYIVEFEGNKEEPGRWEELTRVQ GKKTTVILPLAPFVRYQFRVIAVNEVGRSQPSQPSDHHETPPAAPDRNPQNIRVQASQ PKEMIIKWEPLKSMEQNGPGLEYRVTWKPQGAPVEWEEETVTNHTLRVMTPAVYAPYD VKVQAINQLGSGPDPQSVTLYSGEDYPDTAPVIHGVDVINSTLVKVTWSTVPKDRVHG RLKGYQINWWKTKSLLDGRTHPKEVNILRFSGQRNSGMVPSLDAFSEFHLTVLAYNSK GAGPESEPYIFQTPEGVPEQPTFLKVIKVDKDTATLSWGLPKKLNGNLTGYLLQYQII NDTYEIGELNDINITTPSKPSWHLSNLNATTKYKFYLRACTSQGCGKPITEESSTLGE GSKGIGKISGVNLTQKTHPVEVFEPGAEHIVRLMTKNWGDNDSIFQDVIETRGREYAG LYDDISTQGWFIGLMCAIALLTLLLLTVCFVKRNRGGKYSVKEKEDLHPDPEIQSVKD ETFGEYSDSDEKPLKGSLRSLNRDMQPTESADSLVEYGEGDHGLFSEDGSFIGAYAGS KEKGSVESNGSSTATFPLRA" BASE COUNT 2412 a 1423 c 1534 g 2273 t ORIGIN 1 cggaccctgc gcgcccccgt cccggctccc ggccggctcg ggggagaagg cgcccgaggg 61 gaggcgccgg acagatcgcg tttcggaggc ggcgcaggtg ctgtaaactg caaaccataa 121 tcctgtctta atactgcaaa caaatcatag tggaactaag gggaacttaa tttactgttt 181 ccaggttaac taaggtctca gctgtaaacc aaaagtgaga ggagacatta agattttcat 241 tcttaccggg ttgtcttctt cctgaagagc aatggagccg cttttacttg gaagaggact 301 aatcgtatat ctaatgttcc tcctgttaaa attctcaaaa gcaattgaaa taccatcttc 361 agttcaacag gttccaacaa tcataaaaca gtcaaaagtc caagttgcct ttcccttcga 421 tgagtatttt caaattgaat gtgaagctaa aggaaatcca gaaccaacat tttcgtggac 481 taaggatggc aacccttttt atttcactga ccatcggata attccatcga acaattcagg 541 aacattcagg atcccaaacg aggggcacat atctcacttt caagggaaat accgctgctt 601 tgcttcaaat aaactgggaa tcgctatgtc agaagaaata gaatttatag ttccaagtgt 661 tccaaaattc ccaaaagaaa aaattgaccc tcttgaagtg gaggagggag atccaattgt 721 cctcccatgc aatcctccca aaggcctccc acctttacac atttattgga tgaatattga 781 attagaacac atcgaacaag atgaaagagt atacatgagc caaaagggag atctatactt 841 cgcaaacgtg gaagaaaagg acagtcgcaa tgactactgt tgctttgctg catttccaag 901 attaaggact attgtacaga aaatgccaat gaaactaaca gttaacagtt taaagcatgc 961 taatgactca agttcatcca cagaaattgg ttccaaggca aattccatca agcaaagaaa 1021 acccaaactg ctgttgcctc ccactgagag tggcagtgag tcttcaatta ccatcctcaa 1081 aggggaaatc ttgctgcttg agtgttttgc tgaaggcttg ccaactccac aggttgattg 1141 gaacaaaatt ggtggtgact taccaaaggg gagagaagca aaagaaaatt atggcaagac 1201 tttgaagata gagaatgtct cctaccagga caaaggaaat tatcgctgca cagccagcaa 1261 tttcttggga acagccactc acgattttca cgttatagta gaagagcctc ctcgctggac 1321 aaagaagcct cagagtgctg tgtatagcac cggaagcaat ggcatcttgt tatgtgaggc 1381 tgaaggagaa cctcaaccca caatcaagtg gagagtcaat ggctccccag ttgacaatca 1441 tccatttgct ggtgatgttg tcttccccag ggaaatcagt tttaccaacc ttcaaccaaa 1501 tcatactgct gtgtaccagt gtgaagcctc aaatgtccat ggaactatcc ttgccaatgc 1561 caatattgat gttgtggatg tccgtccatt gatacaaacc aaagatggag aaaattacgc 1621 tacagtggtt gggtacagtg ctttcttaca ttgcgagttc tttgcttcac ctgaggcagt 1681 cgtgtcctgg cagaaggtgg aagaagtgaa acccctggag ggcaggcggt atcatatcta 1741 tgaaaatggc acattgcaga tcaacagaac caccgaagaa gatgctgggt cttactcatg 1801 ttgggtagaa aatgctatag gaaaaactgc agtcacagcc aatttggata ttagaaatgc 1861 tacaaaactt agagtttctc ctaagaatcc tcgtatcccc aaattgcata tgcttgaatt 1921 acattgtgaa agcaaatgtg actcacattt gaaacacagt ttgaagttgt cctggagtaa 1981 agatggagaa gcctttgaaa ttaatggcac agaagatggc aggataatta ttgatggagc 2041 taatttgacc atatctaatg taactttaga ggaccaaggt atttactgct gttcagctca 2101 tactgctcta gacagtgctg ccgatataac tcaagtaact gttcttgatg ttccggatcc 2161 accagaaaac cttcacttgt ctgaaagaca gaacaggagt gttcggctga cctgggaagc 2221 tggagctgac cacaacagca atattagcga gtatattgtt gaatttgaag gaaacaaaga 2281 agagcctgga aggtgggagg aactgaccag agtccaagga aagaaaacca cagttatctt 2341 acctttggct ccatttgtga gataccagtt cagggtcata gccgtgaacg aagtagggag 2401 aagtcagcct agccagccgt cagaccatca tgaaacacca ccagcagctc cagataggaa 2461 tccacaaaac ataagggttc aagcctctca acccaaggaa atgattataa agtgggagcc 2521 tttgaaatcc atggagcaga atggaccagg cctagagtac agagtgacct ggaagccaca 2581 gggagcccca gtggagtggg aagaagaaac agtcacaaac cacacattgc gggtgatgac 2641 gcctgctgtc tatgcccctt atgatgtcaa ggtccaggct atcaatcaac taggatctgg 2701 gcctgaccct cagtcagtga ctctctattc tggagaagac tatcctgata cagctccagt 2761 gatccatggg gtggacgtta taaacagtac attagttaaa gttacctggt caacagttcc 2821 aaaggacaga gtacatggac gtctgaaagg ctatcagata aattggtgga aaacaaaaag 2881 tctgttggat ggaagaacac atcccaaaga agtgaacatt ctaagatttt caggacaaag 2941 aaactctgga atggttcctt ccttagatgc ctttagtgaa tttcatttaa cagtcttagc 3001 ctataactct aaaggagctg gtcctgaaag tgagccttat atatttcaaa caccagaagg 3061 agtacctgaa cagccaactt ttctaaaggt catcaaagtt gataaagaca ctgccacttt 3121 atcttgggga ctacctaaga aattaaatgg aaacttaact ggctatcttt tgcaatatca 3181 gataataaat gacacctacg agattggaga attaaatgat attaacatta caactccatc 3241 aaagcccagc tggcacctct caaacctgaa tgcaactacc aagtacaaat tctacttgag 3301 ggcttgcact tcacagggct gtggaaaacc gatcacggag gaaagctcca ccttaggaga 3361 agggagtaaa ggtatcggga agatatcagg agtaaatctt actcaaaaga ctcacccagt 3421 agaggtattt gagccgggag ctgaacatat agttcgccta atgactaaga attggggcga 3481 taacgatagc atttttcaag atgtaattga gacaagaggg agagaatatg ctggtttata 3541 tgatgacatc tccactcaag gctggtttat tggactgatg tgtgcgattg ctcttctcac 3601 actactatta ttaactgttt gctttgtgaa gaggaataga ggtggaaagt actcagttaa 3661 agaaaaggaa gatttgcatc cagacccaga aattcagtca gtaaaagatg aaacctttgg 3721 tgaatacagt gacagtgatg aaaagcctct caaaggaagc cttcggtccc ttaataggga 3781 tatgcagcct actgaaagtg ctgacagctt agtcgaatac ggagagggag accatggtct 3841 cttcagtgaa gatggatcat ttattggtgc ctacgctgga tctaaggaga agggatctgt 3901 tgaaagcaat ggaagttcta cagcaacttt tccccttcgg gcataaacac aacatatgta 3961 agcaacgcta ctggttcacc ccaaccttcc atatttatct gttcaaagga gcaagaactt 4021 tcatatagga atagaaacat gctggccgaa gatttcatcc agaagtcaac atcctgcaat 4081 tatgttgaaa agagtagtac tttcttcaaa atataaaatg ccaagcactt caggcctatg 4141 ttttgcttat attgttttca ggtgctcaaa atgcaaaaca caaaacaaat cctgcattta 4201 gatacacctc aactaaatcc aaagtcccca ttcagtatat tccatatttg cctgatttta 4261 ctattcggtg tgtttgcata gatgttgcta cttggtgggt ttttctccgt atgcacattg 4321 gtatacagtc tctgagaact ggcttggtga ctttgcttca ctacaggtta aaagaccata 4381 agcaaactgg ttatttaaaa tgtaaaaagg aatatgaaag tcttattaaa acacttcatt 4441 gaaaatatac agtctaaatt tattatttaa attttactag caaaagtctt aggtgaacaa 4501 tcaactagta tttgttgagc tcctatttgc ccagagatgg tcatatttaa acagaagtat 4561 acgtttttca gtttcaacat gaattttttt atttctgtca gttatgacat ccacgagcat 4621 cactttttgt gtctgttttt ttttttttct tggactaaat tcaactgcat ggaagcggtg 4681 gtcagaaggt tgttttatac gagaacaggc agaaagtgcc cattgttcag gattctaata 4741 gctacatcta cttaatatct tcatttctaa attgactgct tttacctttt tctcatgttt 4801 atataatggt atgcttgcat atatttcatg aatacattgt acatattatg ttaatattta 4861 cacaatttaa aatatagatg tgttttattt tgaagtgaga aaatgaacat taacaggcat 4921 gtttgtacag ctagaatata ttagtaagat actgtttttc gtcattccag agctacaact 4981 aataacacga ggttccaaag ctgaagactt tgtataaagt atttgggttt tgttcttgta 5041 ttgctttctt tcaacagttt caaaataaaa tatcatacaa atattgaggg aaatgttttc 5101 atatttttca aaataggttt ttattgttga atgtacatct accccagccc ctcaaaagaa 5161 aaactgttta catagaaatt cctacacata cgtttgcgta tatgttattt taaacatctt 5221 tgtggtgaga attttttccc cgatattctc cttctgtcaa agtcagaaca aattcaggga 5281 atttattttc tggcagttgt gctccagtcc ttttaaaatt gtacatgaac atgttttaga 5341 aacaatatgg aggatgatgc atacatgtcg gtcaagttca gcgctcgaca ttttatggaa 5401 agattttttt aaccttacca cgaaatactt aactactgtt taagtgaatt gacttatttc 5461 actttagttt ttgaactgtg attattggta tactgttata tcctcaactt ggatttatgg 5521 taaccccttt tagttcatgg agaccaaaat ttggggtatt tataatagtc agcgcaggaa 5581 tgcacatgga atatctactt gtccttttga acctcacgag tcatccagaa tgtatagaca 5641 ggaaaagcat gtcttattta aaactgtaat ttatgggctc aggatctgac cgcagtcccg 5701 ggagtaagca tttcaaaggg ggaaggcagt gtggtcccta ccctgtgtga atgtgaggat 5761 gtagacatcc atcagtgcaa ctcgagctcc atcctcctcc gatttctaag gttccagttt 5821 tctggaggga cagtcatcat gttttgattt atctgggaga aaactgtggt gcacagcttg 5881 tgaggagggc aaggttgtga cgttcgagct tagttctggt gttattctgt ctcctcttct 5941 ttgtcatcag ccaaaacgtg gtttttaaag agagtcatgc aggttagaaa taatgtcaaa 6001 aatatttagg aatttaataa cctttaagtc agaaactaaa acaaatactg aaatattagc 6061 tcttcctaca cttcgtgttc ccctttagct gcctgaaaat caagattgct cctactcaga 6121 tcttctgagt ggctaaaact tatggatatg aaaaatgaga ttgaatgatg actatgcttt 6181 gctatcattg ttacctttcc tcaatactat ttggcaacta ctgggactct tcagcacaaa 6241 aggaatagat ctatgattga ccctgatttt aattgtgaaa ttatatgatt catatatttt 6301 atgaatcaga ataaccttca aataaaataa atctaagtcg gttaaaatgg atttcatgat 6361 tttccctcag aaaatgagta acggagtccc acggcgtgca atggtaatta taaattggtg 6421 atgcttgttt gcaaattgcc cactcgtgat aagtcaacag ccaatattta aaactttgtt 6481 cgttactggc tttaccctaa ctttctctag tctactgtca atatcatttt aatgtaattg 6541 attgtatata gtctcaagaa tggttggtgg gcatgagttc ctagagaact gtccaagggt 6601 tgggaaaatc caaattctct tcctggctcc agcactgatt ttgtacataa acattaggca 6661 ggttgcttaa cctttttatt tcaaactctc tcaactctaa agtgctaata ataatctcag 6721 ttaccttatc tttgtcacag ggtgttcttt tttatgaaga aaaatttgaa aatgataaaa 6781 gctaagatgc cttctaactt cataagcaaa cctttaacta attatgtatc tgaaagtcac 6841 ccccacatac caactcaact tttttcctgt gaacacataa atatattttt atagaaaaac 6901 aaatctacat aaaataaatc tactgtttag tgagcagtat gacttgtaca tgccattgaa 6961 aattattaat cagaagaaaa ttaagcaggg tctttgctat acaaaagtgt tttccactaa 7021 ttttgcatgc gtatttataa gaaaaatgtg aatttggtgg ttttattcta tcggtataaa 7081 ggcatcgata ttttagatgc acccgtgttt gtaaaaatgt agagcacaat ggaattatgc 7141 tggaagtctc aaataatatt tttttcctat tttatactca tggaagagat aagctaaaga 7201 ggggacaata atgagaaatg ttggtgtgct tttctaagca tttaaaacat aattgccaat 7261 tgaaacccta aatatgttta cataccatta agatatgatt catgtaacaa tgttaaatta 7321 attataatgg gattgggttt gttatctgtg gtagtatata tcctagtgtt cctatagtga 7381 aataagtagg gttcagccaa agctttcttt gttttgtacc ttaaattgtt cgattacgtc 7441 atcaaaagag atgaaaggta tgtagaacag gttcacgtga ttaccttttt cttttggctt 7501 ggattaatat tcatagtaga actttataaa acgtgtttgt attgtaggtg gtgtttgtat 7561 tatgcttatg actatgtatg gtttgaaaat attttcatta tacatgaaat tcaactttcc 7621 aaataaaagt tctacttcat gt // LOCUS AF002668 1375 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens putative fatty acid desaturase MLD mRNA, complete cds. ACCESSION AF002668 NID g2232173 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1375) AUTHORS Cadena,D.L., Kurten,R.C. and Gill,G.N. TITLE The product of the MLD gene is a member of the membrane fatty acid desaturase family: Overexpression of MLD inhibits EGF receptor biosynthesis JOURNAL Biochemistry (1997) In press REFERENCE 2 (bases 1 to 1375) AUTHORS Kurten,R.C., Cadena,D.L. and Gill,G.N. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Physiology & Biophysics, University of Arkansas for Medical Sciences, 4301 W. Markham Slot 750, Little Rock, AR 72205, USA FEATURES Location/Qualifiers source 1..1375 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 72..1043 /note="putative fatty acid desaturase" /codon_start=1 /product="MLD" /db_xref="PID:g2232174" /translation="MGSRVSREDFEWVYTDQPHADRRREILAKYPEIKSLMKPDPNLI WIIIMMVLTQLGAFYIVKDLDWKWVIFGAYAFGSCINHSMTLAIHEIAHNAAFGNCKA MWNRWFGMFANLPIGIPYSISFKRYHMDHHRYLGADGVDVDIPTDFEGWFFCTAFRKF IWVILQPLFYAFRPLFINPKPITYLEVINTVAQVTFDILIYYFLGIKSLVYMLAASLL GLGLHPISGHFIAEHYMFLKGHETYSYYGPLNLLTFNVGYHNEHHDFPNIPGKSLPLV RKIAAEYYDNLPHYNSWIKVLYDFVMDDTISPYSRMKRHQKGEMVLE" BASE COUNT 393 a 294 c 302 g 384 t 2 others ORIGIN 1 gccgccgcca cctctgagca gccggctggg agcgagagcc gacagctagt ctgcaagcca 61 ccgctgtcgc catggggagc cgcgtctcgc gggaagactt cgagtgggtc tacaccgacc 121 agccgcacgc cgaccggcgc cgggagatcc tggcaaagta tccagagata aagtccttga 181 tgaaacctga tcccaatttg atatggatta taattatgat ggttctcacc cagttgggtg 241 cattttacat agtaaaagac ttggactgga aatgggtcat atttggggcc tatgcgtttg 301 gcagttgcat taaccactca atgactctgg ctattcatga gattgcccac aatgctgcct 361 ttggcaactg caaagcaatg tggaatcgct ggtttggaat gtttgctaat cttcctattg 421 ggattccata ttcaatttcc tttaagaggt atcacatgga tcatcatcgg taccttggag 481 ctgatggcgt cgatgtagat attcctaccg attttgaggg ctggttcttc tgtaccgctt 541 tcagaaagtt tatatgggtt attcttcagc ctctctttta tgcctttcga cctctgttca 601 tcaaccccaa accaattacg tatctggaag ttatcaatac cgtggcacag gtcacttttg 661 acattttaat ttattacttt ttgggaatta aatccttagt ctacatgttg gcagcatctt 721 tacttggcct gggtttgcac ccaatttctg gacattttat agctgagcat tacatgttct 781 taaagggtca tgaaacttac tcatattatg ggcctctgaa tttacttacc ttcaatgtgg 841 gttatcataa tgaacatcat gatttcccca acattcctgg aaaaagtctt ccactggtga 901 ggaaaatagc agctgaatac tatgacaacc tccctcacta caattcctgg ataaaagtac 961 tgtatgattt tgtgatggat gatacaataa gtccctactc aagaatgaag aggcaccaaa 1021 aaggagagat ggtgctggag taaatatcat tagtgccaaa gggattcttc tccaaaactt 1081 tagatgataa aattagccgg gcgtggcggc acatgcctgt aatcccagct acatgggagg 1141 ctgaggtggg agaattgctt gaacccagga ggcggaggca gaggctgcag tgacccaaga 1201 ttgtgccact gcactccacc ctgggcaaca gagcaagacc ccatcntcga gagatnagat 1261 gagatatata taaaaaataa aaagctattt ctagtttatt tcactataaa gttttgcttt 1321 attaaaaagc taataaacag ctattaatca caaaaaaaaa aaaaaaaaaa aaaaa // LOCUS AF002672 3380 bp mRNA PRI 12-JUN-1997 DEFINITION Homo sapiens breast cancer suppressor candidate 1 (bcsc-1) mRNA, complete cds. ACCESSION AF002672 NID g2190973 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3380) AUTHORS Monaco,C., Negrini,M., Sozzi,G., Veronese,M.L., Vorechovsky,I., Godwin,A.K. and Croce,C.M. TITLE Molecular cloning and characterization of BCSC-1, a candidate tumor suppressor gene at 11q23 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3380) AUTHORS Monaco,C., Negrini,M., Sozzi,G., Veronese,M.L., Vorechovsky,I., Godwin,A.K. and Croce,C.M. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Dipartimento di Medicina sperimentale e diagnostica, Universita' di Ferrara, via Luigi Borsari, 46, Ferrara 44100, Italy FEATURES Location/Qualifiers source 1..3380 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q23-q24" gene 1..3380 /gene="bcsc-1" CDS 1054..2367 /gene="bcsc-1" /note="BCSC-1" /codon_start=1 /product="breast cancer suppressor candidate 1" /db_xref="PID:g2190974" /translation="MEEALGRVKLMQADLGGTEILAPLQNIYRGPSIPGHPLQLFVFT DGEVTDTFSVIKEVRINRQKHRCFSFGIGEGTSTSLIKGIARASGGTSEFITGKDRMQ SKALRTLKRSLQPVVEDVSLSWHLPPGLSAKMLSPEQTVIFRGQRLISYAQLTGRMPA AETTGEVCLKYTLQGKTFEDKVTFPLQPKPDVNLTIHRLAAKSLLQTKDMGLRETPAS DKKDALNLSLESGVISSFTAFIAINKELNKPVQGPLAHRDVPRPILLGASAPLKIKCQ SGFRKALHSDRPPSASQPRGELMCYKAKTFQMDDYSLCGLISHKDQHSPGFGENHLVQ LIYHQNANGSWDLNEDLAKILGMSLEEIMAAQPAELVDSSGWATILAVIWLHSNGKDL KCEWELLERKAVAWMRAHAGSTMPSVVKAAITFLKSSVDPAIFAF" BASE COUNT 909 a 787 c 802 g 882 t ORIGIN 1 gcacaccatg gtgcacttct gtggcctact caccctccac cgggagccag tgccgctgaa 61 gagtatctct gtgagcgtga acatttacga gtttgtggct ggtgtgtctg caactttgaa 121 ctacgagaat gaggagaaag ttcctttgga ggccttcttt gtgttcccca tggatgaaga 181 ctctgctgtt tacagctttg aggccttggt ggatgggaag aaaattgtag cagaattaca 241 agacaagatg aaggcccgca ccaactatga gaaagccatc tcccagggcc accaggcctt 301 cttattggag ggggacagca gctccaggga tgtcttctct tgcaatgtgg gtaacctcca 361 acctgggtcg aaggcggcag tcaccctgaa gtatgtgcag gagctgcctc tggaagcaga 421 tggggctctg cgctttgtgc tcccagctgt cctgaatcct agataccagt tctctgggtc 481 gtctaaggac agttgcctta atgtgaagac tcctatagtc cctgtggagg acctgcccta 541 cacactcagc atggtcgcca ccatagattc ccagcatggc attgagaagg tccaatccaa 601 ctgccccttg agtcctaccg agtacctagg agaggacaag acttctgctc aggtttccct 661 ggctgctgga cacaagtttg atcgggacgt ggaactcctg atttactaca atgaggtgca 721 tacccccagc gtggttttgg agatggggat gcctaacatg aagccaggtc atttgatggg 781 agatccatct gcaatggtga gtttctatcc aaatatccca gaagatcaac catcaaatac 841 ctgtggagag tttatctttc tcatggaccg ctcgggaagt atgcagagcc ccatgagtag 901 ccaggataca tctcgctgcg aatacaggca gccaaggaaa cactgatttt gctgctgaag 961 agtttaccta taggctgtta tttcaacatc tatggatttg gctcttccta tgaggcatgc 1021 tttccggaga gtgtgaagta cactcagcaa acaatggagg aggctctggg gagagtgaag 1081 cttatgcagg ccgacctagg gggcactgaa atcttggcac cactccagaa catttacagg 1141 ggaccctcca tcccaggcca ccccctacag ctttttgtct ttacagatgg agaagttaca 1201 gacacgttta gtgtaattaa agaagttagg atcaacagac agaaacacag gtgtttctca 1261 tttggtattg gagaaggcac ctccaccagc ctaataaaag gtattgcccg ggcatcaggg 1321 ggcacctcag aatttatcac aggcaaagac aggatgcagt ccaaggctct caggactctg 1381 aaacgctctc tgcagcctgt ggtagaggat gtctctctga gctggcattt gcctcctggt 1441 ctgtctgcta aaatgctttc cccagaacag actgtcatct ttaggggtca gagattaatc 1501 agctatgccc agctgaccgg gaggatgcca gcagcagaga caacaggaga agtatgcctc 1561 aaatatacac tccagggcaa gacttttgag gataaggtga catttcctct acaacccaag 1621 cctgatgtca acctcaccat tcaccgcctt gctgccaagt ccttgctcca gaccaaggac 1681 atgggcctca gggagactcc agcaagtgat aaaaaagatg cattgaacct tagccttgag 1741 tctggtgtca taagctcctt cacagctttc attgctatca ataaggagct caacaagccg 1801 gttcaggggc ctctggctca tagggacgtc ccaaggccaa ttctgttggg tgcttctgcc 1861 ccattgaaga taaaatgcca atcaggtttt cgaaaggcct tacactctga ccgtcctcct 1921 tctgcatctc agcccagagg ggaacttatg tgttataagg ccaagacatt ccagatggac 1981 gattacagtc tctgtgggtt gataagtcac aaggaccagc acagtccagg ctttggagag 2041 aatcaccttg tgcagctgat ttaccaccaa aatgcaaatg gttcctggga tctgaatgaa 2101 gatctagcca agatcctagg tatgagtttg gaagaaataa tggctgcaca gcctgccgag 2161 cttgtggatt cctcaggctg ggccaccatc ctggccgtga tctggctgca cagcaatggt 2221 aaggacttga agtgtgaatg ggagcttctg gaaaggaagg ccgtggcctg gatgcgtgcc 2281 catgcaggct ccaccatgcc ttcggttgtg aaagctgcta ttactttcct gaagtcatct 2341 gtggatcctg ctatctttgc cttttgaaga taccatccag aaaaagaagt gcctttaatt 2401 tgctactgtc atttcctcta gtatcacttt tgctgtgatg atgtgttctt gtgtattata 2461 actctttatt ttttgccata aaagtaaagg atgcttactc cacttcgctt ctctgctcca 2521 ggttcacttt ggatatgatc tttcttttcc caacatatgc cctcagaaaa gtgacagtgg 2581 tcccagaacc tattcccttt cttgagggag ttcaaaacat tcataggcag taatgttcct 2641 cccagggttt ccagggaaac aacatgaaaa acaggtgaca tgaactacag actaaagatt 2701 gcagcattta tgttagagaa tgcttgaatt agagaatttt ctgcattatc tttgtctgtt 2761 cactttctat cttatatact tatcagggcc atactggtaa gcttgcgtag gaggagttag 2821 agggaagttg aaagccaaca tctggatcaa tgtaatgtca agatcacaaa gacagagact 2881 gcaggggtcc actgtgagag gtgacactgt tggggacctt cctgattcat tcttcttggg 2941 ctttgctagc ctgtacaacc tacatgtctt ttcttccact gcctgaaaga cttgggttga 3001 actataactg ttggagagag atgttcctct ttaatcatga aacaccttaa gaagtctata 3061 atgcaatcct tagtcctacc ctgaacctat gtgtcctcta agtcaggccc tgatctagtg 3121 cagtaaaggg aagggtgggc ttaatgggag ctttgcctgg gacctgaacc tggagcactt 3181 accgcattag gaagaaagga gctccccgta atcgttcctg acccttgtgt ctcatatacc 3241 ctatcctggt ggaaatgacc ctatttgata tgctgtccct taaaataact tgtatcaata 3301 ttaaaatgac tatttctacc ctttaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3361 aaaaaaaaaa aaaaaaaaaa // LOCUS AF002697 1535 bp mRNA PRI 12-OCT-1997 DEFINITION Homo sapiens E1B 19K/Bcl-2-binding protein Nip3 mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF002697 NID g2511528 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS Chen,G., Shi,L., Ray,R., Dubik,D., Bleackley,C., Gietz,R.D. and Greenberg,A.H. TITLE The E1B 19K/Bcl-2-binding protein Nip3 is a dimeric mitochondrial protein that activates apoptosis JOURNAL J. Exp. Med. (1997) In press REFERENCE 2 (bases 1 to 1535) AUTHORS Chen,G., Shi,L., Ray,R., Dubik,D., Bleackley,C., Gietz,R.D. and Greenberg,A.H. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Cell Biology, University of Manitoba, 100 Olivia St., Winnipeg, MB R3E 0V9, Canada FEATURES Location/Qualifiers source 1..1535 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" CDS 127..711 /note="pro-apoptotic mitochondrial protein" /codon_start=1 /product="E1B 19K/Bcl-2-binding protein Nip3" /db_xref="PID:g2511529" /translation="MSQNGAPGMQEESLQGSWVELHFSNNGNGGSVPASVSIYNGDME KILLDAQHESGRSSSKSSHCDSPPRSQTPQDTNRASETDTHSIGEKNSSQSEEDDIER RKEVESILKKNSDWIWDWSSRPENIPPKEFLFKHPKRTATLSMRNTSVMKKGGIFSAE FLKVFLPSLLLSHLLAIGLGIYIGRRLTTSTSTF" misc_feature 616..678 /note="encodes transmembrane domain" BASE COUNT 454 a 324 c 323 g 434 t ORIGIN 1 cctccgctca gtccgggagc gcacgtgggc cgcggcgctc cgacctccgc tttcccaccg 61 cccgcagctg aagcacatcc cgcagcccgg cgcggactcc gatcgccgca gttgccctct 121 ggcgccatgt cgcagaacgg agcgcccggg atgcaggagg agagcctgca gggctcctgg 181 gtagaactgc acttcagcaa taatgggaac gggggcagcg ttccagcctc ggtttctatt 241 tataatggag acatggaaaa aatactgctg gacgcacagc atgagtctgg acggagtagc 301 tccaagagct ctcactgtga cagcccacct cgctcgcaga caccacaaga taccaacagg 361 gcttctgaaa cagataccca tagcattgga gagaaaaaca gctcacagtc tgaggaagat 421 gatattgaaa gaaggaaaga agttgaaagc atcttgaaga aaaactcaga ttggatatgg 481 gattggtcaa gtcggccgga aaatattccc cccaaggagt tcctctttaa acacccgaag 541 cgcacggcca ccctcagcat gaggaacacg agcgtcatga agaaaggggg catattctct 601 gcagaatttc tgaaagtttt ccttccatct ctgctgctct ctcatttgct ggccatcgga 661 ttggggatct atattggaag gcgtctgaca acctccacca gcaccttttg atgaagaact 721 ggagtctgac ttggttcgtt agtggattac ttctgagctt gcaacatagc tcactgaaga 781 gctgttagat cctggggtgg ccacgtcact tgtgtttatt tgttctgtaa atgctgcgtt 841 cctaatttag taaaataaaa gaatagacac taaaatcatg ttgatctata attacaccta 901 tgggatcaat aagcatgtca gactgattaa tgtctactgt gaaaatttgg tagtaaattt 961 tcatttgata ttagatataa atatctgaat ataaataatt ttaatatact agtcatgatg 1021 tgtgttgtat tttaaaaatt atctgcaacc ttaattcagc tgaagtactt tatatttcaa 1081 aagaatgaat aacattgata ataaaatcgc tactttaagg ggtttgtcca aaataaatat 1141 tgtggcctta tatatcacac tattgtagaa agtattattt aatttaaatg gatgcaggtt 1201 gtctactaaa gaaagattat atataactat gctaattgtt cataatcaac agaaaccaag 1261 atagagctac aaactcagct gtacagttcg tacactaaac tcttcttgct tttgcattat 1321 aaggaattaa gtctccgatt attaggtgat caccctggat gatcagtttt ctgctgaagg 1381 cacctactca gtatcttttc ctctttatca ctctgcattg gtgaatttaa tcctctcctt 1441 tgtgttcaac ttttgtgtgc ttttaaaatc agctttattc taagcaaatc tgtgtctact 1501 ttaaaaaact ggaaatggaa aaaaaaataa atctt // LOCUS AF002700 1526 bp mRNA PRI 28-MAY-1997 DEFINITION Homo sapiens TGF-beta related neurotrophic factor receptor 2 (TRNR2) mRNA, complete cds. ACCESSION AF002700 NID g2145079 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1526) AUTHORS Baloh,R.H., Tansey,M.G., Golden,J.P., Creedon,D.J., Heuckeroth,R.O., Keck,C.L., Zimonjic,D.B., Popescu,N.C., Johnson,E.M. Jr. and Milbrandt,J. TITLE TrnR2, a novel receptor that mediates neurturin and GDNF signaling through Ret JOURNAL Neuron 18 (5), 793-802 (1997) MEDLINE 97325791 REFERENCE 2 (bases 1 to 1526) AUTHORS Baloh,R.H. and Milbrandt,J.D. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Pathology, Washington University, 660 South Euclid Ave, Box 8118, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1526 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p12-p21" gene 36..1430 /gene="TRNR2" CDS 36..1430 /gene="TRNR2" /function="mediates neurturin and GDNF signaling through Ret" /note="TRN receptor, GPI-anchored" /codon_start=1 /product="TGF-beta related neurotrophic factor receptor 2" /db_xref="PID:g2145080" /translation="MILANVFCLFFFLDETLRSLASPSSLQGPELHGWRPPVDCVRAN ELCAAESNCSSRYRTLRQCLAGRDRNTMLANKECQAALEVLQESPLYDCRCKRGMKKE LQCLQIYWSIHLGLTEGEEFYEASPYEPVTSRLSDIFRLASIFSGTGADPVVSAKSNH CLDAAKACNLNDNCKKLRSSYISICNREISPTERCNRRKCHKALRQFFDRVPSEYTYR MLFCSCQDQACAERRRQTILPSCSYEDKEKPNCLDLRGVCRTDHLCRSRLADFHANCR ASYQTVTSCPADNYQACLGSYAGMIGFDMTPNYVDSSPTGIVVSPWCSCRGSGNMEEE CEKFLRDFTENPCLRNAIQAFGNGTDVNVSPKGPSFQATQAPRVEKTPSLPDDLSDST SLGTSVITTCTSVQEQGLKANNSKELSMCFTELTTNIIPGSNKVIKPNSGPSRARPSA ALTVLSVLMLKQAL" BASE COUNT 333 a 491 c 419 g 283 t ORIGIN 1 gagaaagaca aaaaaaacgg tgggatttat ttaacatgat cttggcaaac gtcttctgcc 61 tcttcttctt tctagacgag accctccgct ctttggccag cccttcctcc ctgcagggcc 121 ccgagctcca cggctggcgc cccccagtgg actgtgtccg ggccaatgag ctgtgtgccg 181 ccgaatccaa ctgcagctct cgctaccgca ctctgcggca gtgcctggca ggccgcgacc 241 gcaacaccat gctggccaac aaggagtgcc aggcggcctt ggaggtcttg caggagagcc 301 cgctgtacga ctgccgctgc aagcggggca tgaagaagga gctgcagtgt ctgcagatct 361 actggagcat ccacctgggg ctgaccgagg gtgaggagtt ctacgaagcc tccccctatg 421 agccggtgac ctcccgcctc tcggacatct tcaggcttgc ttcaatcttc tcagggacag 481 gggcagaccc ggtggtcagc gccaagagca accattgcct ggatgctgcc aaggcctgca 541 acctgaatga caactgcaag aagctgcgct cctcctacat ctccatctgc aaccgcgaga 601 tctcgcccac cgagcgctgc aaccgccgca agtgccacaa ggccctgcgc cagttcttcg 661 accgggtgcc cagcgagtac acctaccgca tgctcttctg ctcctgccaa gaccaggcgt 721 gcgctgagcg ccgccggcaa accatcctgc ccagctgctc ctatgaggac aaggagaagc 781 ccaactgcct ggacctgcgt ggcgtgtgcc ggactgacca cctgtgtcgg tcccggctgg 841 ccgacttcca tgccaattgt cgagcctcct accagacggt caccagctgc cctgcggaca 901 attaccaggc gtgtctgggc tcttatgctg gcatgattgg gtttgacatg acacctaact 961 atgtggactc cagccccact ggcatcgtgg tgtccccctg gtgcagctgt cgtggcagcg 1021 ggaacatgga ggaggagtgt gagaagttcc tcagggactt caccgagaac ccatgcctcc 1081 ggaacgccat ccaggccttt ggcaacggca cggacgtgaa cgtgtcccca aaaggcccct 1141 cgttccaggc cacccaggcc cctcgggtgg agaagacgcc ttctttgcca gatgacctca 1201 gtgacagtac cagcttgggg accagtgtca tcaccacctg cacgtctgtc caggagcagg 1261 ggctgaaggc caacaactcc aaagagttaa gcatgtgctt cacagagctc acgacaaata 1321 tcatcccagg gagtaacaag gtgatcaaac ctaactcagg ccccagcaga gccagaccgt 1381 cggctgcctt gaccgtgctg tctgtcctga tgctgaaaca ggccttgtag gctgtgggaa 1441 ccgagtcaga agatttttga aagctacgca gacaagaaca gccgcctgac gaaatggaaa 1501 cacacacaga cacacacaca ccttgc // LOCUS AF002715 5445 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens MAP kinase kinase kinase (MTK1) mRNA, complete cds. ACCESSION AF002715 NID g2352276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5445) AUTHORS Takekawa,M., Posas,F. and Saito,H. TITLE A human homolog of the yeast Ssk2/Ssk22 MAP kinase kinase kinase, MTK1, regulates the p38 and JNK pathways JOURNAL Unpublished REFERENCE 2 (bases 1 to 5445) AUTHORS Takekawa,M., Posas,F. and Saito,H. TITLE Direct Submission JOURNAL Submitted (06-MAY-1997) Division of Tumor Immunology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..5445 /organism="Homo sapiens" /db_xref="taxon:9606" gene 143..4966 /gene="MTK1" CDS 143..4966 /gene="MTK1" /codon_start=1 /product="MAP kinase kinase kinase" /db_xref="PID:g2352277" /translation="MREAAAALVPPPAFAVTPAAAMEEPPPPPPPPPPPPEPETESEP ECCLAARQEGTLGDSACKSPESDLEDFSDETNTENLYGTSPPSTPRQMKRMSTKHQRN NVGRPASRSNLKEKMNAPNQPPHKDTGKTVENVEEYSYKQEKKIRAALRTTERDHKKN VQCSFMLDSVGGSLPKKSIPDVDLNKPYLSLGCSNAKLPVSVPMPIARPARQTSRTDC PADRLKFFETLRLLLKLTSVSKKKDREQRGQENTSGFWLNRSNELIWLELQAWHAGRT INDQDFFLYTARQAIPDIINEILTFKVDYGSFAFVRDRAGFNGTSVEGQCKATPGTKI VGYSTHHEHLQRQRVSFEQVKRIMELLEYIEALYPSLQALQKDYEKYAAKDFQDRVQA LCLWLNITKDLNQKLRIMGTVLGIKNLSDIGWPVFEIPSPRPSKGNEPEYEGDDTEGE LKELESSTDESEEEQISDPRVPEIRQPIDNSFDIQSRDCISKKLERLESEDDSLGWGA PDWSTEAGFSRHCLTSIYRPFVDKALKQMGLRKLILRLHKLMDGSLQRARIALVKNDR PVEFSEFPDPMWGSDYVQLSRTPPSSEEKCSAVSWEELKAMDLPSFEPAFLVLCRVLL NVIHECLKLRLEQRPAGEPSLLSIKQLVRECKEVLKGGLLMKQYYQFMLQEVLEDLEK PDCNIDAFEEDLHKMLMVYFDYMRSWIQMLQQLPQASHSLKNLLEEEWNFTKEITHYI RGGEAQAGKLFCDIAGMLLKSTGSFLEFGLQESCAEFWTSADDSSASDEIIRSVIEIS RALKELFHEARERASKALGFAKMLRKDLEIAAEFRLSAPVRDLLDVLKSKQYVKVQIP GLENLQMFVPDTLAEEKSIILQLLNAAAGKDCSKDSDDVLIDAYLLLTKHGDRARDSE DSWGTWEAQPVKVVPQVETVDTLRSMQVDNLLLVVMQSAHLTIQRKAFQQSIEGLMTL CQEQTSSQPVIAKALQQLKNDALELCNRISNAIDRVDHMFTSEFDAEVDESESVTLQQ YYREAMIQGYNFGFEYHKEVVRLMSGEFRQKIGDKYISFARKWMNYVLTKCESGRGTR PRWATQGFDFLQAIEPAFISALPEDDFLSLQALMNECIGHVIGKPHSPVTGLYLAIHR NSPRPMKVPRCHSDPPNPHLIIPTPEGFSTRSMPSDARSHGSPAAAAAAAAAVAASRP SPSGGDSVLPKSISSAHDTRGSSVPENDRLASIAAELQFRSLSRHSSPTEERDEPAYP RGDSSGSTRRSWELRTLISQSKDTASKLGPIEAIQKSVRLFEEKRYREMRRKNIIGQV CDTPKSYDNVMHVGLRKVTFKWQRGNKIGEGQYGKVYTCISVDTGELMAMKEIRFQPN DHKTIKETADELKIFEGIKHPNLVRYFGVELHREEMYIFMEYCDEGTLEEVSRLGLQE HVIRLYSKQITIAINVLHEHGIVHRDIKGANIFLTSSGLIKLGDFGCSVKLKNNAQTM PGEVNSTLGTAAYMAPEVITRAKGEGHGRAADIWSLGCVVIEMVTGKRPWHEYEHNFQ IMYKVGMGHKPPIPERLSPEGKDFLSHCLESDPKMRWTASQLLDHSFVKVCTDEE" BASE COUNT 1550 a 1199 c 1381 g 1315 t ORIGIN 1 aagatggccg cggcgcgcac ggctcctgcg gcggggtaga ggcggaggcg gagtcgagtc 61 actcccgcac ttcggggctc cggtgccccg cgccaggctg cagcttactg cccgccgcgg 121 ccatgcgggg ctccgtgcac ggatgagaga agccgctgcc gcgctggtcc ctcctcccgc 181 ctttgccgtc acgcctgccg ccgccatgga ggagccgccg ccaccgccgc cgccgccacc 241 accgccaccg gaacccgaga ccgagtcaga acccgagtgc tgcttggcgg cgaggcaaga 301 gggcacattg ggagattcag cttgcaagag tcctgaatct gatctagaag acttctccga 361 tgaaacaaat acagagaatc tttatggtac ctctcccccc agcacacctc gacagatgaa 421 acgcatgtca accaaacatc agaggaataa tgtggggagg ccagccagtc ggtctaattt 481 gaaagaaaaa atgaatgcac caaatcagcc tccacataaa gacactggaa aaacagtgga 541 gaatgtggaa gaatacagct ataagcagga gaaaaagatc cgagcagctc ttagaacaac 601 agagcgtgat cataaaaaaa atgtacagtg ctcattcatg ttagactcag tgggtggatc 661 tttgccaaaa aaatcaattc cagatgtgga tctcaataag ccttacctca gccttggctg 721 tagcaatgct aagcttccag tatctgtgcc catgcctata gccagacctg cacgccagac 781 ttctaggact gactgtccag cagatcgttt aaagtttttt gaaactttac gacttttgct 841 aaagcttacc tcagtctcaa agaaaaaaga cagggagcaa agaggacaag aaaatacgtc 901 tggtttctgg cttaaccgat ctaacgaact gatctggtta gagctacaag cctggcatgc 961 aggacggaca attaacgacc aggacttctt tttatataca gcccgtcaag ccatcccaga 1021 tattattaat gaaatcctta ctttcaaagt cgactatggg agcttcgcct ttgttagaga 1081 tagagctggt tttaatggta cttcagtaga agggcagtgc aaagccactc ctggaacaaa 1141 gattgtaggt tactcaacac atcatgagca tctccaacgc cagagggtct catttgagca 1201 ggtaaaacgg ataatggagc tgctagagta catagaagca ctttatccat cattgcaggc 1261 tcttcagaag gactatgaaa aatatgctgc aaaagacttc caggacaggg tgcaggcact 1321 ctgtttgtgg ttaaacatca caaaagactt aaatcagaaa ttaaggatta tgggcactgt 1381 tttgggcatc aagaatttat cagacattgg ctggccagtg tttgaaatcc cttcccctcg 1441 accatccaaa ggtaatgagc cggagtatga gggtgatgac acagaaggag aattaaagga 1501 gttggaaagt agtacggatg agagtgaaga agaacaaatc tctgatccta gggtaccgga 1561 aatcagacag cccatagata acagcttcga catccagtcg cgggactgca tatccaagaa 1621 gcttgagagg ctcgaatctg aggatgattc tcttggctgg ggagcaccag actggagcac 1681 agaagcaggc tttagtagac attgtctgac ttctatttat agaccatttg tagacaaagc 1741 actgaagcag atggggttaa gaaagttaat tttaagactt cacaagctaa tggatggttc 1801 cttgcaaagg gcacgtatag cattggtaaa gaacgatcgt ccagtggagt tttctgaatt 1861 tccagatccc atgtggggtt cagattatgt gcagttgtca aggacaccac cttcatctga 1921 ggagaaatgc agtgctgtgt cgtgggagga gctgaaggcc atggatttac cttcattcga 1981 acctgccttc ctagttctct gccgagtcct tctgaatgtc atacatgagt gtctgaagtt 2041 aagattggag cagagacctg ctggagaacc atctctcttg agtattaagc agctggtgag 2101 agagtgtaag gaggtcctga agggcggcct gctgatgaag cagtactacc agttcatgct 2161 gcaggaggtt ctggaggact tggagaagcc cgactgcaac attgacgctt ttgaagagga 2221 tctacataaa atgcttatgg tgtattttga ttacatgaga agctggatcc aaatgctaca 2281 gcaattacct caagcatcgc atagtttaaa aaatctgtta gaagaagaat ggaatttcac 2341 caaagaaata actcattaca tacggggagg agaagcacag gccgggaagc ttttctgtga 2401 cattgcagga atgctgctga aatctacagg aagtttttta gaatttggct tacaggagag 2461 ctgtgctgaa ttttggacta gtgcggatga cagcagtgct tccgacgaaa tcatcaggtc 2521 tgttatagag atcagtcgag ccctgaagga gctcttccat gaagccagag aaagggcttc 2581 caaagcactt ggatttgcta aaatgttgag aaaggacctg gaaatagcag cagaattcag 2641 gctttcagcc ccagttagag acctcctgga tgttctgaaa tcaaaacagt atgtcaaggt 2701 gcaaattcct gggttagaaa acttgcaaat gtttgttcca gacactcttg ctgaggagaa 2761 gagtattatt ttgcagttac tcaatgcagc tgcaggaaag gactgttcaa aagattcaga 2821 tgacgtactc atcgatgcct atctgcttct gaccaagcac ggtgatcgag cccgtgattc 2881 agaggacagc tggggcacct gggaggcaca gcctgtcaaa gtcgtgcctc aggtggagac 2941 tgttgacacc ctgagaagca tgcaggtgga taatctttta ctagttgtca tgcagtctgc 3001 gcatctcaca attcagagaa aagctttcca gcagtccatt gagggactta tgactctgtg 3061 ccaggagcag acatccagtc agccggtcat cgccaaagct ttgcagcagc tgaagaatga 3121 tgcattggag ctatgcaaca ggataagcaa tgccattgac cgcgtggacc acatgttcac 3181 atcagaattt gatgctgagg ttgatgaatc tgaatctgtc accttgcaac agtactaccg 3241 agaagcaatg attcaggggt acaattttgg atttgagtat cataaagaag ttgttcgttt 3301 gatgtctggg gagtttagac agaagatagg agacaaatat ataagctttg cccggaagtg 3361 gatgaattat gtcctgacta aatgtgagag tggtagaggt acaagaccca ggtgggcgac 3421 tcaaggattt gattttctac aagcaattga acctgccttt atttcagctt taccagaaga 3481 tgacttcttg agtttacaag ccttgatgaa tgaatgcatt ggccatgtca taggaaaacc 3541 acacagtcct gttacaggtt tgtaccttgc cattcatcgg aacagccccc gtcctatgaa 3601 ggtacctcga tgccatagtg accctcctaa cccacacctc attatcccca ctccagaggg 3661 attcagcact cggagcatgc cttccgacgc gcggagccat ggcagccctg ctgctgctgc 3721 tgctgctgct gctgctgttg ctgccagtcg gcccagcccc tctggtggtg actctgtgct 3781 gcccaaatcc atcagcagtg cccatgatac caggggttcc agcgttcctg aaaatgatcg 3841 attggcttcc atagctgctg aattgcagtt taggtccctg agtcgtcact caagccccac 3901 ggaggagcga gatgaaccag catatccaag aggagattca agtgggtcca caagaagaag 3961 ttgggaactt cggacactaa tcagccagag taaagatact gcttctaaac taggacccat 4021 agaagctatc cagaagtcag tccgattgtt tgaagaaaag aggtaccgag aaatgaggag 4081 aaagaatatc attggtcaag tttgtgatac gcctaagtcc tatgataatg ttatgcacgt 4141 tggcttgagg aaggtgacct tcaaatggca aagaggaaac aaaattggag aaggccagta 4201 tgggaaggtg tacacctgca tcagcgtcga caccggggag ctgatggcca tgaaagagat 4261 tcgatttcaa cctaatgacc ataagactat caaggaaact gcagacgaat tgaaaatatt 4321 cgaaggcatc aaacacccca atctggttcg gtattttggt gtggagctcc atagagaaga 4381 aatgtacatc ttcatggagt actgcgatga ggggacttta gaagaggtgt caaggctggg 4441 acttcaggaa catgtgatta ggctgtattc aaagcagatc accattgcga tcaacgtcct 4501 ccatgagcat ggcatagtcc accgtgacat taaaggtgcc aatatcttcc ttacctcatc 4561 tggattaatc aaactgggag attttggatg ttcagtaaag ctcaaaaaca atgcccagac 4621 catgcctggt gaagtgaaca gcaccctggg gacagcagca tacatggcac ctgaagtcat 4681 cactcgtgcc aaaggagagg gccatgggcg tgcggccgac atctggagtc tggggtgtgt 4741 tgtcatagag atggtgactg gcaagaggcc ttggcatgag tatgagcaca actttcaaat 4801 tatgtataaa gtggggatgg gacataagcc accaatccct gaaagattaa gccctgaagg 4861 aaaggacttc ctttctcact gccttgagag tgacccaaag atgagatgga ccgccagcca 4921 gctcctcgac cattcgtttg tcaaggtttg cacagatgaa gaatgaagcc tagtagaata 4981 tggacttgga aaattctctt aatcactact gtatgtaata tttacataaa gactgtgctg 5041 agaagcagta taagcctttt taaccttcca agactgaaga ctgcacaggt gacaagcgtc 5101 acttctcctg ctgctcctgt ttgtctgatg tggcaaaagg ccctctggag ggctggtggc 5161 cacgaggtta aagaagctgc atgttaagtg ccattactac tgtacacgga ccatcgcctc 5221 tgtctcctcc gtgtctcgcg cgactgagaa ccgtgacatc agcgtagtgt tttgaccttt 5281 ctaggttcaa aagaagttgt agtgttatca ggcgtcccat accttgtttt taatctcctg 5341 tttgttgagt gcactgactg tgaaaccttt accttttttg ttgttgttgg caagctgcag 5401 gtttgtaatg caaaaggctg attactgaaa tttaagaaaa aggtt // LOCUS AF002985 995 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens putative alpha chemokine (H174) mRNA, complete cds. ACCESSION AF002985 NID g2580585 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 995) AUTHORS Jacobs,K.A., Collins-Racie,L.A., Colbert,M., Duckett,M., Golden-Fleet,M., Kelleher,K., Kriz,R., LaVallie,E.R., Merberg,D., Spaulding,V., Stover,J., Williamson,M.J. and McCoy,J.M. TITLE A genetic selection for isolating cDNAs encoding secreted proteins JOURNAL Gene 198 (1-2), 289-296 (1997) MEDLINE 98036061 REFERENCE 2 (bases 1 to 995) AUTHORS Jacobs,K.A., Collins-Racie,L.A., Colbert,M., Duckett,M., Golden-Fleet,M., Kelleher,K., Kriz,R., LaVallie,E.R., Merberg,D., Spaulding,V., Stover,J., Williamson,M.J. and McCoy,J.M. TITLE Direct Submission JOURNAL Submitted (07-MAY-1997) Genetics Institute, 87 Cambridge Park Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..995 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA and PMA activated human peripheral blood mononuclear cells" gene 1..995 /gene="H174" CDS 88..372 /gene="H174" /codon_start=1 /product="putative alpha chemokine" /db_xref="PID:g2580586" /translation="MSVKGMAIALAVILCATVVQGFPMFKRGRCLCIGPGVKAVKVAD IEKASIMYPSNNCDKIEVIITLKENKGQRCLNPKSKQARLIIKKVERKNF" BASE COUNT 382 a 170 c 194 g 249 t ORIGIN 1 gaattcggcc aaagaggcct acttccaaga agagcagcaa agctgaagta gcagcaacag 61 caccagcagc aacagcaaaa aacaaacatg agtgtgaagg gcatggctat agccttggct 121 gtgatattgt gtgctacagt tgttcaaggc ttccccatgt tcaaaagagg acgctgtctt 181 tgcataggcc ctggggtaaa agcagtgaaa gtggcagata ttgagaaagc ctccataatg 241 tacccaagta acaactgtga caaaatagaa gtgattatta ccctgaaaga aaataaagga 301 caacgatgcc taaatcccaa atcgaagcaa gcaaggctta taatcaaaaa agttgaaaga 361 aagaattttt aaaaatatca aaacatatga agtcctggaa aagggcatct gaaaaaccta 421 gaacaagttt aactgtgact actgaaatga caagaattct acagtaggaa actgagactt 481 ttctatggtt ttgtgacttt caacttttgt acagttatgt gaaggatgaa aggtgggtga 541 aaggaccaaa aacagaaata cagtcttcct gaatgaatga caatcagaat tccactgccc 601 aaaggagtcc aacaattaaa tggatttcta ggaaaagcta ccttaagaaa ggctggttac 661 catcggagtt tacaaagtgc tttcacgttc ttacttgttg tattatacat tcatgcattt 721 ctaggctaga gaaccttcta gatttgatgc ttacaactat tctgttgtga ctatgagaac 781 atttctgtct ctagaagtta tctgtctgta ttgatcttta tgctatatta ctatctgtgg 841 ttacagtgga gacattgaca ttattactgg agtcaagccc ttataagtca aaagcaccta 901 tgtgtcgtaa agcattcctc aaacatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 961 aaaaaaaaaa aaaaaaaaaa aaaaaaagcg gccgc // LOCUS AF002986 1272 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens platelet activating receptor homolog (H963) mRNA, complete cds. ACCESSION AF002986 NID g2580587 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1272) AUTHORS Jacobs,K.A., Collins-Racie,L.A., Colbert,M., Duckett,M., Golden-Fleet,M., Kelleher,K., Kriz,R., LaVallie,E.R., Merberg,D., Spaulding,V., Stover,J., Williamson,M.J. and McCoy,J.M. TITLE A genetic selection for isolating cDNAs encoding secreted proteins JOURNAL Gene 198 (1-2), 289-296 (1997) MEDLINE 98036061 REFERENCE 2 (bases 1 to 1272) AUTHORS Jacobs,K.A., Collins-Racie,L.A., Colbert,M., Duckett,M., Golden-Fleet,M., Kelleher,K., Kriz,R., LaVallie,E.R., Merberg,D., Spaulding,V., Stover,J., Williamson,M.J. and McCoy,J.M. TITLE Direct Submission JOURNAL Submitted (07-MAY-1997) Genetics Institute, 87 Cambridge Park Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..1272 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA and PMA activated human peripheral blood mononuclear cells" gene 1..1272 /gene="H963" CDS 220..1179 /gene="H963" /note="seven transmembrane protein; G-protein coupled receptor" /codon_start=1 /product="platelet activating receptor homolog" /db_xref="PID:g2580588" /translation="MTNSSFFCPVYKDLEPFTYFFYLVFLVGIIGSCFATWAFIQKNT NHRCVSIYLINLLTADFLLTLALPVKIVVDLGVAPWKLKIFHCQVTACLIYINMYLSI IFLAFVSIDRCLQLTHSCKIYRIQEPGFAKMISTVVWLMVLLIMVPNMMIPIKDIKEK SNVGCMEFKKEFGRNWHLLTNFICVAIFLNFSAIILISNCLVIRQLYRNKDNENYPNV KKALINILLVTTGYIICFVPYHIVRIPYTLSQTEVITDCSTRISLFKAKEATLLLAVS NLCFDPILYYHLSKAFRSKVTETFASPKETKAQKEKLRCENNA" BASE COUNT 382 a 265 c 237 g 388 t ORIGIN 1 gaattcggcc aaagaggcct atgcttctct gaagacttgc agcaaggctt gctgaggctc 61 acagaagata gccccagtgt tttggagtgg ttttgaatgt gattctgaga tcagactgac 121 tgagctggaa tcctggcttt atatcttacc agctacacaa ccttggagtc ttagaaattt 181 tttcttttca ataagcagtc atccttactt tccctcaaga tgacaaacag ttcgttcttc 241 tgcccagttt ataaagatct ggagccattc acgtattttt tttatttagt tttccttgtt 301 ggaattattg gaagttgttt tgcaacctgg gcttttatac agaagaatac gaatcacagg 361 tgtgtgagca tctacttaat taatttgctt acagccgatt tcctgcttac tctggcatta 421 ccagtgaaaa ttgttgttga cttgggtgtg gcaccttgga agctgaagat attccactgc 481 caagtaacag cctgcctcat ctatatcaat atgtatttat caattatctt cttagcattt 541 gtcagcattg accgctgtct tcagctgaca cacagctgca agatctaccg aatacaagaa 601 cccggatttg ccaaaatgat atcaaccgtt gtgtggctaa tggtccttct tataatggtg 661 ccaaatatga tgattcccat caaagacatc aaggaaaagt caaatgtggg ttgtatggag 721 tttaaaaagg aatttggaag aaattggcat ttgctgacaa atttcatatg tgtagcaata 781 tttttaaatt tctcagccat cattttaata tccaattgcc ttgtaattcg acagctctac 841 agaaacaaag ataatgaaaa ttacccaaat gtgaaaaagg ctctcatcaa catactttta 901 gtgaccacgg gctacatcat atgctttgtt ccttaccaca ttgtccgaat cccgtatacc 961 ctcagccaga cagaagtcat aactgattgc tcaaccagga tttcactctt caaagccaaa 1021 gaggctacac tgctcctggc tgtgtcgaac ctgtgctttg atcctatcct gtactatcac 1081 ctctcaaaag cattccgctc aaaggtcact gagacttttg cctcacctaa agagaccaag 1141 gctcagaaag aaaaattaag atgtgaaaat aatgcataaa agacaggatt ttttgtgcta 1201 ccaattctgg ccttactgga ccataaagtt aattatagct ttgaaagata aaaaaaaaaa 1261 aaaagcggcc gc // LOCUS AF002999 2907 bp mRNA PRI 16-OCT-1997 DEFINITION Homo sapiens TTAGGG repeat binding factor 2 (hTRF2) mRNA, complete cds. ACCESSION AF002999 NID g2529439 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2907) AUTHORS Broccoli,D., Smogorzewska,A., Chong,L. and de Lange,T. TITLE Human telomeres contain two distinct Myb-related proteins, TRF1 and TRF2 JOURNAL Nature Genet. 17 (2), 231-235 (1997) MEDLINE 97467741 REFERENCE 2 (bases 1 to 2907) AUTHORS Broccoli,D., Smogorzewska,A., Chong,L. and de Lange,T. TITLE Direct Submission JOURNAL Submitted (07-MAY-1997) Laboratory for Cell Biology and Genetics, Rockefeller University, 1230 York Avenue, NY 10021, USA FEATURES Location/Qualifiers source 1..2907 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2907 /gene="hTRF2" CDS 125..1627 /gene="hTRF2" /note="telomeric protein" /codon_start=1 /product="TTAGGG repeat binding factor 2" /db_xref="PID:g2529440" /translation="MAGGGGSSDGSGRAAGRRASRSSGRARRGRHEPGLGGPAERGAG EARLEEAVNRWVLKFYFHEALRAFRGSRYGDFRQIRDIMQALLVRPLGKEHTVSRLLR VMQCLSRIEEGENLDCSFDMEAELTPLESAINVLEMIKTEFTLTEAVVESSRKLVKEA AVIICIKNKEFEKASKILKKHMSKDPTTQKLRNDLLNIIREKNLAHPVIQNFSYETFQ QKMLRFLESHLDDAEPYLLTMAKKALKSESAASSTGKEDKQPAPGPVEKPPREPARQL RNPPTTIGMMTLKAAFKTLSGAQDSEAAFAKLDQKDLVLPTQALPASPALKNKRPRKD ENESSAPADGEGGSELQPKNKRMTISRLVLEEDSQSTEPSAGLNSSQEAASAPPSKPT VLNQPLPGEKNPKVPKGKWNSSNGVEEKETWVEEDELFQVQAAPDEDSTTNITKKQKW TVEESEWVKAGVQKYGEGNWAAISKNYPFVNRTAVMIKDRWRTMKRLGMN" BASE COUNT 735 a 717 c 758 g 692 t 5 others ORIGIN 1 ggaattcggc acgagggacg gcgggccccg cttccggccc gggcgtcgtg cgtgacccag 61 cggcgtcaca gccgaggaag cggcccggcc gggagggcgg ggaggcgcgc ggcgatcgga 121 cacgatggcg ggaggaggcg ggagtagcga cggcagcggg cgggcagctg gcaggcgggc 181 gtcccgcagt agcgggcggg cccggcgggg gcgccacgag ccggggctgg ggggcccggc 241 ggagcgcggc gcgggggagg cacggctgga agaggcagtc aatcgctggg tgctcaagtt 301 ctacttccac gaggcgctgc gggcctttcg gggtagccgg tacggggact tcagacagat 361 ccgggacatc atgcaggctt tgcttgtcag gcccttgggg aaggagcaca ccgtgtcccg 421 attgctgcgg gttatgcagt gtctgtcgcg gattgaagaa ggggaaaatt tagactgttc 481 ctttgatatg gaggctgagc tcacaccact ggaatcagct atcaatgtgc tggagatgat 541 taaaacggaa tttacactga cagaagcagt ggtcgaatcc agtagaaaac tggtcaagga 601 agctgctgtc attatttgta tcaaaaacaa agaatttgaa aaggcttcaa aaattttgaa 661 aaaacatatg tccaaggacc ccacaactca gaagctgaga aatgatctcc tgaatattat 721 tcgagaaaag aacttggccc atcctgttat ccagaacttt tcatatgaga ccttccagca 781 gaagatgctg cgcttcctgg agagccacct ggatgacgcc gagccctacc tcctcacgat 841 ggccaaaaag gctttgaaat ctgagtccgc tgcctcaagt acagggaagg aagataaaca 901 gccagcacca gggcctgtgg aaaagccacc cagagaaccc gcaaggcagc tacggaatcc 961 tccaaccacc attggaatga tgactctgaa agcagctttc aagactctgt ctggtgcaca 1021 ggattctgag gcagcctttg caaaactgga ccagaaggat ctggttcttc ctactcaagc 1081 tctcccagca tcaccagccc tcaaaaacaa gagacccaga aaagatgaaa acgaaagttc 1141 agccccggct gacggtgagg gtggctcgga actgcagccc aagaacaagc gcatgacaat 1201 aagcagattg gtcttggagg aggacagcca gagtactgag cccagcgcag gcctcaactc 1261 ctcccaggag gccgcttcag cgccaccatc caagcccacc gttctcaacc aacccctccc 1321 tggagagaag aatcccaaag tacccaaagg caagtggaac agctctaatg gggttgaaga 1381 aaaggagact tgggtggaag aggatgaact gtttcaagtt caggcagcac cagatgaaga 1441 cagtacaacc aatataacaa aaaagcagaa gtggactgta gaagaaagcg agtgggtcaa 1501 ggctggagtg cagaaatatg gggaaggaaa ctgggctgcc atttctaaaa attacccatt 1561 tgttaaccga acagctgtga tgattaagga tcgctggcgg accatgaaaa gacttggcat 1621 gaactgaaac aggctttcat ttccacagaa ttcacaggag catggttcct aataatagcc 1681 cctgatagtc tgctctttct ttctttttct tttttttttt tttttgagac agagtctcgc 1741 tctgtcaccc aggctggagt gcagtggcgt gatctcggct cactgcgacc tccgtctccc 1801 gggctcacgc cattctcctg cctcagcctc cgagtagctg ggactacagg cgcccgccat 1861 cacgcccggc taatgttttg tatttttagt aaanacgggg tttcaccgtg ttggccagga 1921 tggtctcgat ctcctgacct cgtgatccac ccaactcggc ctcccaaagt gctgggatta 1981 caggcatgan ccaccgcgcc tggcatctgc tgtttctttc agaagctggg ctgggatgag 2041 aattttgggc aacctccttc gacgtggggg aggtcccatt tccacttcat cactgttgga 2101 gatcatggag ctaagaagca gagccaagtc cacccatgtc cttggcagag atgacggcac 2161 acagcttgtg cagtgccaga atatcattag cgtttccctt ctttagtggt ttgcttaaat 2221 ttaaatccct ggtaatctgt agaaccttct cctaggaaat ggtgaagtct attaggagcc 2281 acttgtgact ccatgacctg ttaaaaccag caatgtgagt attatttgga gtaaatttgt 2341 tccacgtcaa gttctggcct tctgatgcaa atgcaaagga acttagtntg ttatgaaccc 2401 aggttgatga cagaccagtc cttgtggaat aagattccct ttaaaaactc tttagccagt 2461 cgtgacatca accctagacc tgtctgcctt ggcatttgct gtcaanatnt gctgggctat 2521 gtaggcaggt taatcctcca cttctcatgt ggttgaacca gtgtgttttt tggtaaaatg 2581 gtgattgtag ataagattag ttccctgatc ccctgccccc tgtcccctgc ctcttttccc 2641 aattcccttc cttatgctgg acttttaaag cttaaaaaaa atccgattga atataaatgc 2701 ctaatttcat tcttttgtga aatggttgct tcctcctgat tccctaattg tgctgtgttc 2761 gtgtcttgca ctggaattca acattccctt ctccttttgt actgtgttgt gcttgctgtc 2821 tctcccggac acccttaaag actgtctttt tagcaaaaaa tttcagtaaa gtgttttctg 2881 taatcttttt ttaaaaaaaa aaaaaaa // LOCUS AF003001 2626 bp mRNA PRI 16-OCT-1997 DEFINITION Homo sapiens TTAGGG repeat binding factor 1 (hTRF1-AS) mRNA, alternatively spliced, complete cds. ACCESSION AF003001 NID g2529443 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2626) AUTHORS Broccoli,D., Smogorzewska,A., Chong,L. and de Lange,T. TITLE Human telomeres contain two distinct Myb-related proteins, TRF1 and TRF2 JOURNAL Nature Genet. 17 (2), 231-235 (1997) MEDLINE 97467741 REFERENCE 2 (bases 1 to 2626) AUTHORS Broccoli,D., Smogorzewska,A., Chong,L. and de Lange,T. TITLE Direct Submission JOURNAL Submitted (07-MAY-1997) Laboratory for Cell Biology and Genetics, Rockefeller University, 1230 York Avenue, NY 10021, USA FEATURES Location/Qualifiers source 1..2626 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2626 /gene="hTRF1-AS" CDS 16..1275 /gene="hTRF1-AS" /note="alternatively spliced; telomeric protein" /codon_start=1 /product="TTAGGG repeat binding factor 1" /db_xref="PID:g2529444" /translation="MAEDVSSAAPSPRRCADGRDADPTEEQMAETERNDEEQFECQEL LECQVQVGAPEEEEEEEEDAGLVAEAEAVAAGWMLDFLCLSLCRAFRDGRSEDFRRTR NSAEAIIHGLSSLTACQLRTIYICQFLTRIAAGKTLDAQFENDERITPLESALMIWGS IEKEHDKLHEEIQNLIKIQAIAVCMENGNFKEAEEVFERIFGDPNSHMPFKSKLLMII SQKDTFHSFFQHFSYNHMMEKIKSYVNYVLSEKSSTFLMKAAAKVVESKRTRTITSQD KPSGNDVEMETEANLDTRKRSHKNLFLSKLQHGTQQQDLNKKERRVGTPQSTKKKKES RRATESRIPVSKSQPVTPEKHRARKRQAWLWEEDKNLRSGVRKYGEGNWSKILLHYKF NNRTSVMLKDRWRTMKKLKLISSDSED" BASE COUNT 870 a 498 c 552 g 706 t ORIGIN 1 atcgagccat ttaacatggc ggaggatgtt tcctcagcgg ccccgagccc gcggcggtgt 61 gcggatggta gggatgccga ccctactgag gagcagatgg cagaaacaga gagaaacgac 121 gaggagcagt tcgaatgcca ggaactgctc gagtgccagg tgcaggtggg ggcccccgag 181 gaggaggagg aggaggagga ggacgcgggc ctggtggccg aggccgaggc cgtggctgcc 241 ggctggatgc tcgatttcct ctgcctctct ctttgccgag ctttccgcga cggccgctcc 301 gaggacttcc gcaggacccg caacagcgca gaggctatta ttcatggact atccagtcta 361 acagcttgcc agttgagaac gatatacata tgtcagtttt tgacaagaat tgcagcagga 421 aaaacccttg atgcacagtt tgaaaatgat gaacgaatta cacccttgga atcagccctg 481 atgatttggg gttcaattga aaaggaacat gacaaacttc atgaagaaat acagaattta 541 attaaaattc aggctatagc tgtttgtatg gaaaatggca actttaaaga agcagaagaa 601 gtctttgaaa gaatatttgg tgatccaaat tctcatatgc ctttcaaaag caaattgctt 661 atgataatct ctcagaaaga tacatttcat tccttttttc aacacttcag ctacaaccac 721 atgatggaga aaattaagag ttatgtgaat tatgtgctaa gtgaaaaatc atcaaccttt 781 ctaatgaagg cagcggcaaa agtagtagaa agcaaaagga caagaacaat aacttctcaa 841 gataaaccta gtggtaatga tgttgaaatg gaaactgaag ctaatttgga tacaagaaaa 901 aggtctcaca agaatctttt cttatctaag ttgcaacatg gaacccagca acaagacctt 961 aataagaaag aaagaagagt aggaactcct caaagtacaa aaaagaaaaa agaaagcaga 1021 agagccactg aaagcagaat acctgtttca aagagtcagc cggtaactcc tgaaaaacat 1081 cgagctagaa aaagacaggc atggctttgg gaagaagaca agaatttgag atctggcgtg 1141 aggaaatatg gagagggaaa ctggtctaaa atactgttgc attataaatt caacaaccgg 1201 acaagtgtca tgttaaaaga cagatggagg accatgaaga aactaaaact gatttcctca 1261 gacagcgaag actgattgtg tttgtaaaag cttgatgaaa ggacagttaa gtattttgat 1321 cactgcattt tgtttgaaac ttgtgtcatt gatgtaattt aaaacttttg tttaaagcat 1381 tacagtattt ttctgtgacc atcaattaat gagggtttgt gctaccagag ttaaagcata 1441 tgctatcatt gtattcttta agaaccttat tttgataaaa tgtaaatttg ttgaaccctg 1501 ccacatttag tatccccacc cccaaatcct gttccaatga aaaaattaaa acctgatacg 1561 aaaaaaaaaa aattccagtt aacctatttt gtgtctgtag gctgacctca accctgtaac 1621 gtaacccatt aaaatgaatt tctttttttt taagacagag tttctctctg ttgcccaggc 1681 tggagtgcag tggcgcaatt tcagctcact gaacctctgc ctcccaggtc aagtgattct 1741 cctgcctcag cctctgagta gctgggatta caggcacaca ccaccagcca gctaattttt 1801 gtatttttag tagaggcggg gtttcaccat gctggtcagg atgttctcca actcctgact 1861 tcatgatcca cccacctcgg cctcccaaag tgctgagatt acagacgtga gccactgcgt 1921 cctgcctaaa atgaattttc tagatgattg aataacagta gtagtccttt gatagaagat 1981 aatgacttgg tttatggcct taatatacta cttaattact taagatgttt attaatagaa 2041 tgataaatgt acagagtaac ctataagcat gacatacttt tgctttcagt agtttcatgt 2101 aaagaaaaaa acttgaaaat agtaatacct gagtacccat gggaataata gacactgggg 2161 aggtagggtg gggagcggga cgaagagctg aaaaacttac ctactgggga ctgtgctcac 2221 tacctgggtg acaggatcat acgtacccca aacctcaaca tcacacagta tactcagcta 2281 acaaacctgc ccatgtgttt cctgaatcta aaataaaaat cgaaataatt tttttaaaaa 2341 agaaaaagac aatagtatta cccatgggac aaaatttgta ctattagcaa gaatcatttt 2401 gtgtctcatt tagaaacaat ttgacttttg ttccagtgtt taaactttga caaaaatggt 2461 tttgaataga tctttataac ctgatgccta aatacaagat tctctgatac cttcatttaa 2521 tatatcaata ttggcctaaa acgtattctg taaagcttaa attggtatta actatgatca 2581 tcttgatgtc tatgatagat aataaacaag gtcatacata ccttaa // LOCUS AF003341 1506 bp mRNA PRI 02-AUG-1997 DEFINITION Homo sapiens aldehyde dehydrogenase 1 (ALDH1) mRNA, complete cds. ACCESSION AF003341 NID g2183298 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1506) AUTHORS Zheng,C.F., Wang,T.T. and Weiner,H. TITLE Cloning and expression of the full-length cDNAS encoding human liver class 1 and class 2 aldehyde dehydrogenase JOURNAL Alcohol. Clin. Exp. Res. 17 (4), 828-831 (1993) MEDLINE 94027752 REFERENCE 2 (bases 1 to 1506) AUTHORS Kathmann,E.C. and Lipsky,J.J. TITLE Cloning of a cDNA encoding a constitutively expressed rat liver cytosolic aldehyde dehydrogenase JOURNAL Biochem. Biophys. Res. Commun. 236 (2), 527-531 (1997) MEDLINE 97382470 REFERENCE 3 (bases 1 to 1506) AUTHORS Kathmann,E.C., Lipsky,J.J. and Weiner,H. TITLE Direct Submission JOURNAL Submitted (08-MAY-1997) Pharmacology, Mayo Foundation, 200 First Street S.W., Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1506 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q21" /chromosome="9" /tissue_type="liver" gene 1..1506 /gene="ALDH1" CDS 1..1506 /gene="ALDH1" /EC_number="1.2.1.3" /note="cytosolic protein; class 1" /codon_start=1 /product="aldehyde dehydrogenase 1" /db_xref="PID:g2183299" /translation="MSSSGTPDLPVLLTDLKIQYTKIFINNEWHDSVSGKKFPVFNPA TEEELCQVEEGDKEDVDKAVKAARQAFQIGSPWRTMDASERGRLLYKLADLIERDRLL LATMESMNGGKLYSNAYLSDLAGCIKTLRYCAGWADKIQGRTIPIDGNFFTYTRHEPI GVCGQIIPWNFPLVMLIWKIGPALSCGNTVVVKPAEQTPLTALHVASLIKEAGFPPGV VNIVPGYGPTAGAAISSHMDIDKVAFTGSTEVGKLIKEAAGKSNLKRVTLELGGKSPC IVLADADLDNAVEFAHHGVFYHQGQCCIAASRIFVEESIYDEFVRRSVERAKKYILGN PLTPGVTQGPQIDKEQYDKILDLIESGKKEGAKLECGGGPWGNKGYFVQPTVFSNVTD EMRIAKEEIFGPVQQIMKFKSLDDVIKRANNTFYGLSAGVFTKDIDKAITISSALQAG TVWVNCYGVVSAQCPFGGFKMSGNGRELGEYGFHEYTEVKTVTVKISQKNS" BASE COUNT 441 a 293 c 391 g 381 t ORIGIN 1 atgtcatcct caggcacgcc agacttacct gtcctactca ccgatttgaa gattcaatat 61 actaagatct tcataaacaa tgaatggcat gattcagtga gtggcaagaa atttcctgtc 121 tttaatcctg caactgagga ggagctctgc caggtagaag aaggagataa ggaggatgtt 181 gacaaggcag tgaaggccgc aagacaggct tttcagattg gatctccgtg gcgtactatg 241 gatgcttccg agagggggcg actattatac aagttggctg atttaatcga aagagatcgt 301 ctgctgctgg cgacaatgga gtcaatgaat ggtggaaaac tctattccaa tgcatatctg 361 agtgatttag caggctgcat caaaacattg cgctactgtg caggttgggc tgacaagatc 421 cagggccgta caataccaat tgatggaaat ttttttacat atacaagaca tgaacctatt 481 ggtgtatgtg gccaaatcat tccttggaat ttcccgttgg ttatgctcat ttggaagata 541 gggcctgcac tgagctgtgg aaacacagtg gttgtcaaac cagcagagca aactcctctc 601 actgctctcc acgtggcatc tttaataaaa gaggcagggt ttcctcctgg agtagtgaat 661 attgttcctg gttatgggcc tacagcaggg gcagccattt cttctcacat ggatatagac 721 aaagtagcct tcacaggatc aacagaggtt ggcaagttga tcaaagaagc tgccgggaaa 781 agcaatctga agagggtgac cctggagctt ggaggaaaga gcccttgcat tgtgttagct 841 gatgccgact tggacaatgc tgttgaattt gcacaccatg gggtattcta ccaccagggc 901 cagtgttgta tagccgcatc caggattttt gtggaagaat caatttatga tgagtttgtt 961 cgaaggagtg ttgagcgggc taagaagtat atccttggaa atcctctgac cccaggagtc 1021 actcaaggcc ctcagattga caaggaacaa tatgataaaa tacttgacct cattgagagt 1081 gggaagaaag aaggggccaa actggaatgt ggaggaggcc cgtgggggaa taaaggctac 1141 tttgtccagc ccacagtgtt ctctaatgtt acagatgaga tgcgcattgc caaagaggag 1201 atttttggac cagtgcagca aatcatgaag tttaaatctt tagatgacgt gatcaaaaga 1261 gcaaacaata ctttctatgg cttatcagca ggagtgttta ccaaagacat tgataaagcc 1321 ataacaatct cctctgctct gcaggcagga acagtgtggg tgaattgcta tggcgtggta 1381 agtgcccagt gcccctttgg cggattcaag atgtctggaa atggaagaga actgggagag 1441 tacggtttcc atgaatatac agaggtcaaa acagtcacag tgaaaatctc tcagaagaac 1501 tcataa // LOCUS AF003521 4702 bp mRNA PRI 17-JUN-1997 DEFINITION Homo sapiens Jagged 2 mRNA, complete cds. ACCESSION AF003521 NID g2197066 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4702) AUTHORS Mann,R.S., Gray,G.E., Henrique,D., Ish-Horowicz,D. and Artavanis-Tsakonas,S. TITLE Human Jagged 2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 4702) AUTHORS Mann,R.S., Gray,G.E., Henrique,D., Ish-Horowicz,D. and Artavanis-Tsakonas,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-1997) Howard Hughes Medical Institute, Yale University, 295 Congress Ave, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..4702 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 33..3749 /note="ligand for Notch receptor; Contains EGF repeats and DSL domain" /codon_start=1 /product="Jagged 2" /db_xref="PID:g2197067" /translation="MRAQGRGRLPRRLLLLLALWVQAARPMGYFELQLSALRNVNGEL LSGACCDGDGRTTRAGGCGHDECDTYVRVCLKEYQAKVTPTGPCSYGHGATPVLGGNS FYLPPAGAAGDRARARARAGGDQDPGLVVIPFQFAWPRSFTLIVEAWDWDNDTTPNEE LLIERVSHAGMINPEDRWKSLHFSGHVAHLELQIRVRCDENYYSATCNKFCRPRNDFF GHYTCDQYGNKACMDGWMGKECKEAVCKQGCNLLHGGCTVPGECRCSYGWQGRFCDEC VPYPGCVHGSCVEPWQCNCETNWGGLLCDKDLNYCGSHHPCTNGGTCINAEPDQYRCT CPDGYSGRNCEKAEHACTSNPCANGGSCHEVPSGFECHCPSGWSGPTCALDIDECASN PCAAGGTCVDQVDGFECICPEQWVGATCQLDANECEGKPCLNAFSCKNLIGGYYCDCI PGWKGINCHINVNDCRGQCQHGGTCKDLVNGYQCVCPRGFGGRHCELERDKCASSPCH SGGLCEDLADGFHCHCPQGFSGPLCEVDVDLCEPSPCRNGARCYNLEGDYYCACPDDF GGKNCSVPREPCPGGACRVIDGCGSDAGPGMPGTAASGVCGPHGRCVSQPGGNFSCIC DSGFTGTYCHENIDDCLGQPCRNGGTCIDEVDAFRCFCPSGWEGELCDTNPNDCLPDP CHSRGRCYDLVNDFYCACDDGWKGKTCHSREFQCDAYTCSNGGTCYDSGDTFRCACPP GWKGSTCAVAKNSSCLPNPCVNGGTCVGSGASFSCICRDGWEGRTCTHNTNDCNPLPC YNGGICVDGVNWFRCECAPGFAGPDCRINIDECQSSPCAYGATCVDEINGYRCSCPPG RAGPRCQEVIGFGRSCWSRGTPFPHGSSWVEDCNSCRCLDGRRDCSKVWCGWKPCLLA GQPEALSAQCPLGQRCLEKAPGQCLRPPCEAWGECGAEEPPSTPCLPRSGHLDNNCAR LTLHFNRDHVPQGTTVGAICSGIRSLPATRAVARDRLLVLLCDRASSGASAVEVAVSF SPARDLPDSSLIQGAAHAIVAAITQRGNSSLLLAVTEVKVETVVTGGSSTGLLVPVLC GAFSVLWLACVVLCVWWTRKRRKERERSRLPREESANNQWAPLNPIRNPIERPGGHKD VLYQCKNFTPPPRRADEALPGPAGHAAVREDEEDEDLGRGEEDSLEAEKFLSHKFTKD PGRSPGRPAHWASGPKVDNRAVRSINEARYVGKE" BASE COUNT 805 a 1472 c 1557 g 868 t ORIGIN 1 ggccggggcc gggcgggcgg gtcgcggggg caatgcgggc gcagggccgg gggcgccttc 61 cccggcggct gctgctgctg ctggcgctct gggtgcaggc ggcgcggccc atgggctatt 121 tcgagctgca gctgagcgcg ctgcggaacg tgaacgggga gctgctgagc ggcgcctgct 181 gtgacggcga cggccggaca acgcgcgcgg ggggctgcgg ccacgacgag tgcgacacgt 241 acgtgcgcgt gtgccttaag gagtaccagg ccaaggtgac gcccacgggg ccctgcagct 301 acggccacgg cgccacgccc gtgctgggcg gcaactcctt ctacctgccg ccggcgggcg 361 ctgcggggga ccgagcgcgg gcgcgggccc gggccggcgg cgaccaggac ccgggcctcg 421 tcgtcatccc cttccagttc gcctggccgc gctcctttac cctcatcgtg gaggcctggg 481 actgggacaa cgataccacc ccgaatgagg agctgctgat cgagcgagtg tcgcatgccg 541 gcatgatcaa cccggaggac cgctggaaga gcctgcactt cagcggccac gtggcgcacc 601 tggagctgca gatccgcgtg cgctgcgacg agaactacta cagcgccact tgcaacaagt 661 tctgccggcc ccgcaatgac tttttcggcc actacacctg cgaccagtac ggcaacaagg 721 cctgcatgga cggctggatg ggcaaggagt gcaaggaagc tgtgtgtaaa caagggtgta 781 atttgctcca cgggggatgc accgtgcctg gggagtgcag gtgcagctac ggctggcaag 841 ggaggttctg cgatgagtgt gtcccctacc ccggctgcgt gcatggcagt tgtgtggagc 901 cctggcagtg caactgtgag accaactggg gcggcctgct ctgtgacaaa gacctgaact 961 actgtggcag ccaccacccc tgcaccaacg gaggcacgtg catcaacgcc gagcctgacc 1021 agtaccgctg cacctgccct gacggctact cgggcaggaa ctgtgagaag gctgagcacg 1081 cctgcacctc caacccgtgt gccaacgggg gctcttgcca tgaggtgccg tccggcttcg 1141 aatgccactg cccatcgggc tggagcgggc ccacctgtgc ccttgacatc gatgagtgtg 1201 cttcgaaccc gtgtgcggcc ggtggcacct gtgtggacca ggtggacggc tttgagtgca 1261 tctgccccga gcagtgggtg ggggccacct gccagctgga cgccaatgag tgtgaaggga 1321 agccatgcct taacgctttt tcttgcaaaa acctgattgg cggctattac tgtgattgca 1381 tcccgggctg gaagggcatc aactgccata tcaacgtcaa cgactgtcgc gggcagtgtc 1441 agcatggggg cacctgcaag gacctggtga acgggtacca gtgtgtgtgc ccacggggct 1501 tcggaggccg gcattgcgag ctggaacgag acaagtgtgc cagcagcccc tgccacagcg 1561 gcggcctctg cgaggacctg gccgacggct tccactgcca ctgcccccag ggcttctccg 1621 ggcctctctg tgaggtggat gtcgaccttt gtgagccaag cccctgccgg aacggcgctc 1681 gctgctataa cctggagggt gactattact gcgcctgccc tgatgacttt ggtggcaaga 1741 actgctccgt gccccgcgag ccgtgccctg gcggggcctg cagagtgatc gatggctgcg 1801 ggtcagacgc ggggcctggg atgcctggca cagcagcctc cggcgtgtgt ggcccccatg 1861 gacgctgcgt cagccagcca gggggcaact tttcctgcat ctgtgacagt ggctttactg 1921 gcacctactg ccatgagaac attgacgact gcctgggcca gccctgccgc aatgggggca 1981 catgcatcga tgaggtggac gccttccgct gcttctgccc cagcggttgg gagggcgagc 2041 tctgcgacac caatcccaac gactgccttc ccgatccctg ccacagccgc ggccgctgct 2101 acgacctggt caatgacttc tactgtgcgt gcgacgacgg ctggaagggc aagacctgcc 2161 actcacgcga gttccagtgc gatgcctaca cctgcagcaa cggtggcacc tgctacgaca 2221 gcggcgacac cttccgctgc gcctgccccc ccggctggaa gggcagcacc tgcgccgtcg 2281 ccaagaacag cagctgcctg cccaacccct gtgtgaatgg tggcacctgc gtgggcagcg 2341 gggcctcctt ctcctgcatc tgccgggacg gctgggaggg tcgtacttgc actcacaata 2401 ccaacgactg caaccctctg ccttgctaca atggtggcat ctgtgttgac ggcgtcaact 2461 ggttccgctg cgagtgtgca cctggcttcg cggggcctga ctgccgcatc aacatcgacg 2521 agtgccagtc ctcgccctgt gcctacgggg ccacgtgtgt ggatgagatc aacgggtatc 2581 gctgtagctg cccacccggc cgagccggcc cccggtgcca ggaagtgatc gggttcggga 2641 gatcctgctg gtcccggggc actccgttcc cacacggaag ctcctgggtg gaagactgca 2701 acagctgccg ctgcctggat ggccgccgtg actgcagcaa ggtgtggtgc ggatggaagc 2761 cttgtctgct ggccggccag cccgaggccc tgagcgccca gtgcccactg gggcaaaggt 2821 gcctggagaa ggccccaggc cagtgtctgc gaccaccctg tgaggcctgg ggggagtgcg 2881 gcgcagaaga gccaccgagc accccctgcc tgccacgctc cggccacctg gacaataact 2941 gtgcccgcct caccttgcat ttcaaccgtg accacgtgcc ccagggcacc acggtgggcg 3001 ccatttgctc cgggatccgc tccctgccag ccacaagggc tgtggcacgg gaccgcctgc 3061 tggtgttgct ttgcgaccgg gcgtcctcgg gggccagtgc tgtggaggtg gccgtgtcct 3121 tcagccctgc cagggacctg cctgacagca gcctgatcca gggcgcggcc cacgccatcg 3181 tggccgccat cacccagcgg gggaacagct cactgctcct ggctgtcacc gaggtcaagg 3241 tggagacggt tgttacgggc ggctcttcca caggtctgct ggtgcctgtg ctgtgtggtg 3301 ccttcagcgt gctgtggctg gcgtgcgtgg tcctgtgcgt gtggtggaca cgcaagcgca 3361 ggaaagagcg ggagaggagc cggctgccgc gggaggagag cgccaacaac cagtgggccc 3421 cgctcaaccc catccgcaac cccattgagc ggccgggggg gcacaaggac gtgctctacc 3481 agtgcaagaa cttcacgccg ccgccgcgca gggcggacga ggcgctgccc gggccggccg 3541 gccacgcggc cgtcagggag gatgaggagg acgaggatct tggccgcggt gaggaggact 3601 ccctggaggc ggagaagttc ctctcacaca aattcaccaa agatcctggc cgctcgccgg 3661 ggaggccggc ccactgggcc tcaggcccca aagtggacaa ccgcgcggtc aggagcatca 3721 atgaggcccg ctacgtcggc aaggaatagg gcggctgcag ctgggccggg acccagggcc 3781 ctcggtggga gccatgccgt ctgccggacc cggaggccga ggccatgtgc atagtttctt 3841 tattttgtgt aaaaaaacca ccaaaaacaa aaaccaaatg tttattttct acgtttcttt 3901 aaccttgtat aaattattca gtaactgtca ggctgaaaac aatggagtat tctcggatag 3961 ttgctatttt tgtaaagtag ccgtgcgtgg cactcgctgt atgaaaggag agagcaaagg 4021 gtgtctgcgt cgtcaccaaa tcgtcgcgtt tgttaccaga ggttgtgcac tgtttacaga 4081 atcttccttt tattcctcac tcgggtttct ctgtgctcca ggccaaagtg ccggtgagac 4141 ccatggctgt gttggtgtgg cccatggctg ttggtgggac ccgtggctga tggtgtggcc 4201 tgtggctgtc ggtgggactc gtggctgtca atgggacctg tggctgtcgg tgggacctac 4261 ggtggtcggt gggaccctgg ttattgatgt ggccctggct gccggcacgg cccgtggctg 4321 ttgacgcacc tgtggttgtt agtggggcct gaggtcatcg gcgtggccca aggccggcag 4381 gtcaacctcg cgcttgctgg ccagtccacc ctgcctgccg tctgtgcttc ctcctgccca 4441 gaacgcccgc tccagcgatc tctccactgt gctttcagaa gtgcccttcc tgctgcgcag 4501 ttctcccatc ctgggacggc ggcagtattg aagctcgtga caagtgcctt cacacagacc 4561 cctcgcaact gtccacgcgt gccgtggcac caggcgctgc ccacctgccg gccccggccg 4621 cccctcctcg tgaaagtgca tttttgtaaa tgtgtacata ttaaaggaag cactctgtat 4681 aaaaaaaaaa aaccggaatt cc // LOCUS AF003522 3162 bp mRNA PRI 17-JUN-1997 DEFINITION Homo sapiens Delta mRNA, complete cds. ACCESSION AF003522 NID g2197068 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3162) AUTHORS Mann,R.S., Gray,G.E., Henrique,D., Ish-Horowicz,D. and Artavanis-Tsakonas,S. TITLE Human Delta JOURNAL Unpublished REFERENCE 2 (bases 1 to 3162) AUTHORS Mann,R.S., Gray,G.E., Henrique,D., Ish-Horowicz,D. and Artavanis-Tsakonas,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-1997) Howard Hughes Medical Institute, Yale University, 295 Congress Ave, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..3162 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 323..2494 /note="ligand for Notch receptor" /codon_start=1 /product="Delta" /db_xref="PID:g2197069" /translation="MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNC CRGGAGPPPCACRTFFRVCLKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADS AFSNPIRFPFGFTWPGTFSLIIEALHTDSPDDLATENPERLISRLATQRHLTVGEEWS QDLHSSGRTDLKYSYRFVCDEHYYGEGCSVFCRPRDDAFGHFTCGERGEKVCNPGWKG PYCTEPICLPGCDEQHGFCDKPGECKCRVGWQGRYCDECIRYPGCLHGTCQQPWQCNC QEGWGGLFCNQDLNYCTHHKPCKNGATCTNTGQGSYTCSCRPGYTGATCELGIDECDP SPCKNGGSCTDLENSYSCTCPPGFYGKICELSAMTCADGPCFNGGRCSDSPDGGYSCR CPVGYSGFNCEKKIDYCSSSPCSNGAKCVDLGDAYLCRCQAGFSGRHCDDNVDDCASS PCANGGTCRDGVNDFSCTCPPGYTGRNCSAPVSRCEHAPCHNGATCHERGHGYVCECA RGYGGPNCQFLLPELPPGPAVVDLTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAVV VCVRLRLQKHRPPADPCRGETETMNNLANCQREKDISVSIIGATQIKNTNKKADFHGD HSADKNGFKARYPAVDYNLVQDLKGDDTAVRDAHSKRDTKCQPQGSSGEEKGTPTTLR GGEASERKRPDSGCSTSKDTKYQSVYVISEEKDECVIATEV" BASE COUNT 705 a 873 c 902 g 682 t ORIGIN 1 gaattccatt ttaagttata caaaactgat taccataagt gcggtcgact gcttttattt 61 ttacgttgtg tgtgttggaa aaatgctaaa acatcagtct acaattctat atattgttat 121 taaagattaa tccaaccagc aacccaagga catataagcg atttccacta ttgcatcaga 181 gcactcggca ggaaaggcct agccacgggg aacattagaa gctacagaag cattgcagag 241 aagagaagat cccccgcgcg tccgccgctg ttctaaggag agaagtgggg gccccccagg 301 ctcgcgcgtg gagcgaagca gcatgggcag tcggtgcgcg ctggccctgg cggtgctctc 361 ggccttgctg tgtcaggtct ggagctctgg ggtgttcgaa ctgaagctgc aggagttcgt 421 caacaagaag gggctgctgg ggaaccgcaa ctgctgccgc gggggcgcgg ggccaccgcc 481 gtgcgcctgc cggaccttct tccgcgtgtg cctcaagcac taccaggcca gcgtgtcccc 541 cgagccgccc tgcacctacg gcagcgccgt cacccccgtg ctgggcgtcg actccttcag 601 tctgcccgac ggcgggggcg ccgactccgc gttcagcaac cccatccgct tccccttcgg 661 cttcacctgg ccgggcacct tctctctgat tattgaagct ctccacacag attctcctga 721 tgacctcgca acagaaaacc cagaaagact catcagccgc ctggccaccc agaggcacct 781 gacggtgggc gaggagtggt cccaggacct gcacagcagc ggccgcacgg acctcaagta 841 ctcctaccgc ttcgtgtgtg acgaacacta ctacggagag ggctgctccg ttttctgccg 901 tccccgggac gatgccttcg gccacttcac ctgtggggag cgtggggaga aagtgtgcaa 961 ccctggctgg aaagggccct actgcacaga gccgatctgc ctgcctggat gtgatgagca 1021 gcatggattt tgtgacaaac caggggaatg caagtgcaga gtgggctggc agggccggta 1081 ctgtgacgag tgtatccgct atccaggctg tctccatggc acctgccagc agccctggca 1141 gtgcaactgc caggaaggct gggggggcct tttctgcaac caggacctga actactgcac 1201 acaccataag ccctgcaaga atggagccac ctgcaccaac acgggccagg ggagctacac 1261 ttgctcttgc cggcctgggt acacaggtgc cacctgcgag ctggggattg acgagtgtga 1321 ccccagccct tgtaagaacg gagggagctg cacggatctc gagaacagct actcctgtac 1381 ctgcccaccc ggcttctacg gcaaaatctg tgaattgagt gccatgacct gtgcggacgg 1441 cccttgcttt aacgggggtc ggtgctcaga cagccccgat ggagggtaca gctgccgctg 1501 ccccgtgggc tactccggct tcaactgtga gaagaaaatt gactactgca gctcttcacc 1561 ctgttctaat ggtgccaagt gtgtggacct cggtgatgcc tacctgtgcc gctgccaggc 1621 cggcttctcg gggaggcact gtgacgacaa cgtggacgac tgcgcctcct ccccgtgcgc 1681 caacgggggc acctgccggg atggcgtgaa cgacttctcc tgcacctgcc cgcctggcta 1741 cacgggcagg aactgcagtg cccccgtcag caggtgcgag cacgcaccct gccacaatgg 1801 ggccacctgc cacgagaggg gccacggcta tgtgtgcgag tgtgcccgag gctacggggg 1861 tcccaactgc cagttcctgc tccccgagct gcccccgggc ccagcggtgg tggacctcac 1921 tgagaagcta gagggccagg gcgggccatt cccctgggtg gccgtgtgcg ccggggtcat 1981 ccttgtcctc atgctgctgc tgggctgtgc cgctgtggtg gtctgcgtcc ggctgaggct 2041 gcagaagcac cggcccccag ccgacccctg ccggggggag acggagacca tgaacaacct 2101 ggccaactgc cagcgtgaga aggacatctc agtcagcatc atcggggcca cgcagatcaa 2161 gaacaccaac aagaaggcgg acttccacgg ggaccacagc gccgacaaga atggcttcaa 2221 ggcccgctac ccagcggtgg actataacct cgtgcaggac ctcaagggtg acgacaccgc 2281 cgtcagggac gcgcacagca agcgtgacac caagtgccag ccccagggct cctcagggga 2341 ggagaagggg accccgacca cactcagggg tggagaagca tctgaaagaa aaaggccgga 2401 ctcgggctgt tcaacttcaa aagacaccaa gtaccagtcg gtgtacgtca tatccgagga 2461 gaaggatgag tgcgtcatag caactgaggt gtaaaatgga agtgagatgg caagactccc 2521 gtttctctta aaataagtaa aattccaagg atatatgccc caacgaatgc tgctgaagag 2581 gagggaggcc tcgtggactg ctgctgagaa accgagttca gaccgagcag gttctcctcc 2641 tgaggtcctc gacgcctgcc gacagcctgt cgcggcccgg ccgcctgcgg cactgccttc 2701 cgtgacgtcg ccgttgcact atggacagtt gctcttaaga gaatatatat ttaaatgggt 2761 gaactgaatt acgcataaga agcatgcact gcctgagtgt atattttgga ttcttatgag 2821 ccagtctttt cttgaattag aaacacaaac actgccttta ttgtcctttt tgatacgaag 2881 atgtgctttt tctagatgga aaagatgtgt gttatttttt ggatttgtaa aaatattttt 2941 catgatatct gtaaagcttg agtattttgt gatgttcgtt ttttataatt taaattttgg 3001 taaatatgta caaaggcact tcgggtctat gtgactatat ttttttgtat ataaatgtat 3061 ttatggaata ttgtgcaaat gttatttgag ttttttactg ttttgttaat gaagaaattc 3121 ctttttaaaa tatttttcca aaataaattt tatgaggaat tc // LOCUS AF003594 1935 bp mRNA PRI 15-JUN-1997 DEFINITION Homo sapiens growth-factor inducible immediate early gene product CYR61 mRNA, complete cds. ACCESSION AF003594 NID g2196781 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1935) AUTHORS Kolesnikova,T.V. and Lau,L.F. TITLE Human growth-factor inducible gene product CYR61, complete sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 1935) AUTHORS Kolesnikova,T.V. and Lau,L.F. TITLE Direct Submission JOURNAL Submitted (09-MAY-1997) Genetics, University of Illinois, 900 S. Ashland Ave., M/C 669, Chicago, IL 60607, USA FEATURES Location/Qualifiers source 1..1935 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p22" /tissue_type="placenta" CDS 124..1269 /codon_start=1 /product="growth-factor inducible immediate early gene product CYR61" /db_xref="PID:g2196782" /translation="MSSRIARALALVVTLLHLTRLALSTCPAACHCPLEAPKCAPGVG LVRDGCGCCKVCAKQLNEDCSKTQPCDHTKGLECNFGASSTALKGICRAQSEGRPCEY NSRIYQNGESFQPNCKHQCTCIDGAVGCIPLCPQELSLPNLGCPNPRLVKVTGQCCEE WVCDEDSIKDPMEDQDGLLGKELGFDASEVELTRNNELIAVGKGSSLKRLPVFGMEPR ILYNPLQGQKCIVQTTSWSQCSKTCGTGISTRVTNDNPECRLVKETRICEVRPCGQPV YSSLKKGKKCSKTKKSPEPVRFTYAGCLSVKKYRPKYCGSCVDGRCCTPQLTRTVKMR FRCEDGETFSKNVMMIQSCKCNYNCPHANEAAFPFYRLFNDIHKFRD" BASE COUNT 488 a 481 c 510 g 456 t ORIGIN 1 gggcgggccc accgcgacac cgcgccgcca ccccgacccc gctgcgcacg gcctgtccgc 61 tgcacaccag cttgttggcg tcttcgtcgc cgcgctcgcc ccgggctact cctgcgcgcc 121 acaatgagct cccgcatcgc cagggcgctc gccttagtcg tcacccttct ccacttgacc 181 aggctggcgc tctccacctg ccccgctgcc tgccactgcc ccctggaggc gcccaagtgc 241 gcgccgggag tcgggctggt ccgggacggc tgcggctgct gtaaggtctg cgccaagcag 301 ctcaacgagg actgcagcaa aacgcagccc tgcgaccaca ccaaggggct ggaatgcaac 361 ttcggcgcca gctccaccgc tctgaagggg atctgcagag ctcagtcaga gggcagaccc 421 tgtgaatata actccagaat ctaccaaaac ggggaaagtt tccagcccaa ctgtaaacat 481 cagtgcacat gtattgatgg cgccgtgggc tgcattcctc tgtgtcccca agaactatct 541 ctccccaact tgggctgtcc caaccctcgg ctggtcaaag ttaccgggca gtgctgcgag 601 gagtgggtct gtgacgagga tagtatcaag gaccccatgg aggaccagga cggcctcctt 661 ggcaaggagc tgggattcga tgcctccgag gtggagttga cgagaaacaa tgaattgatt 721 gcagttggaa aaggcagctc actgaagcgg ctccctgttt ttggaatgga gcctcgcatc 781 ctatacaacc ctttacaagg ccagaaatgt attgttcaaa caacttcatg gtcccagtgc 841 tcaaagacct gtggaactgg tatctccaca cgagttacca atgacaaccc tgagtgccgc 901 cttgtgaaag aaacccggat ttgtgaggtg cggccttgtg gacagccagt gtacagcagc 961 ctgaaaaagg gcaagaaatg cagcaagacc aagaaatccc ccgaaccagt caggtttact 1021 tacgctggat gtttgagtgt gaagaaatac cggcccaagt actgcggttc ctgcgtggac 1081 ggccgatgct gcacgcccca gctgaccagg actgtgaaga tgcggttccg ctgcgaagat 1141 ggggagacat tttccaagaa cgtcatgatg atccagtcct gcaaatgcaa ctacaactgc 1201 ccgcatgcca atgaagcagc gtttcccttc tacaggctgt tcaatgacat tcacaaattt 1261 agggactaaa tgctacctgg gtttccaggg cacacctaga caaacaaggg agaagagtgt 1321 cagaatcaga atcatggaga aaatgggcgg gggtggtgtg ggtgatggga ctcattgtag 1381 aaaggaagcc ttctcattct tgaggagcat taaggtattt cgaaactgcc aagggtgctg 1441 gtgcggatgg acactaatgc agccacgatt ggagaatact tgcttcatag tattggagca 1501 catgttactg cttcattttg gagcttgtgg agttgatgac ttctgttttc tgtttgtaaa 1561 ttatttgcta agcatatttt ctctaggctt ttttactttt ggggttctac agtcgtaaaa 1621 gagataataa ggttagttgc acagtttaaa gcttttattc gtccttacaa aagtaaatgg 1681 gagggcattc catccccttc cctgaagggg gacactccat gagtgtctgt gagaggcagc 1741 tatctgcact ctaaactgca aacagaaatc aggtgtttta agactgaatg ttttatttat 1801 caaaatgtag cttttgggga gggaggggaa atgtaatact ggaataattt gtaaatgatt 1861 ttaattttat attcagtgaa aagattttat ttatggaatt aaccatttaa taaagaaata 1921 tttacctaaa aaaaa // LOCUS AF003837 5942 bp mRNA PRI 06-SEP-1997 DEFINITION Homo sapiens Jagged1 (JAG1) mRNA, complete cds. ACCESSION AF003837 NID g2228792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5942) AUTHORS Oda,T., Elkahloun,A.G., Pike,B.L., Okajima,K., Krantz,I.D., Genin,A., Piccoli,D.A., Meltzer,P.S., Spinner,N.B., Collins,F.S. and Chandrasekharappa,S.C. TITLE Mutations in the human Jagged1 gene (JAG1) are responsible for the Alagille syndrome JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 5942) AUTHORS Oda,T., Elkahloun,A.G., Meltzer,P.S. and Chandrasekharappa,S.C. TITLE Identification and cloning of the human homolog (JAG1) of the rat Jagged1 gene from the Alagille syndrome critical region at 20p12 JOURNAL Genomics 43 (3), 376-379 (1997) MEDLINE 97422615 REFERENCE 3 (bases 1 to 5942) AUTHORS Oda,T. and Chandrasekharappa,S.C. TITLE Direct Submission JOURNAL Submitted (12-MAY-1997) LGT, NHGRI, NIH, 49 Convent Dr., MSC4442, Bldg. 49, Room 3C36, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..5942 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20p12" gene 1..5942 /gene="JAG1" exon 1..540 /gene="JAG1" /number=1 CDS 460..4116 /gene="JAG1" /note="similar to R. norvegicus Jagged1 protein" /codon_start=1 /product="Jagged1" /db_xref="PID:g2228793" /translation="MRSPRTRGRSGRPLSLLLALLCALRAKVCGASGQFELEILSMQN VNGELQNGNCCGGARNPGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSTPVI GGNTFNLKASRGNDRNRIVLPFSFAWPRSYTLLVEAWDSSNDTVQPDSIIEKASHSGM INPSRQWQTLKQNTGVAHFEYQIRVTCDDYYYGFGCNKFCRPRDDFFGHYACDQNGNK TCMEGWMGRECNRAICRQGCSPKHGSCKLPGDCRCQYGWQGLYCDKCIPHPGCVHGIC NEPWQCLCETNWGGQLCDKDLNYCGTHQPCLNGGTCSNTGPDKYQCSCPEGYSGPNCE IAEHACLSDPCHNRGSCKETSLGFECECSPGWTGPTCSTNIDDCSPNNCSHGGTCQDL VNGFKCVCPPQWTGKTCQLDANECEAKPCVNAKSCKNLIASYYCDCLPGWMGQNCDIN INDCLGQCQNDASCRDLVNGYRCICPPGYAGDHCERDIDECASNPCLDGGHCQNEINR FQCLCPTGFSGNLCQLDIDYCEPNPCQNGAQCYNRASDYFCKCPEDYEGKNCSHLKDH CRTTPCEVIDSCTVAMASNDTPEGVRYISSNVCGPHGKCKSQSGGKFTCDCNKGFTGT YCHENINDCESNPCRNGGTCIDGVNSYKCICSDGWEGAYCETNINDCSQNPCHNGGTC RDLVNDFYCDCKNGWKGKTCHSRDSQCDEATCNNGGTCYDEGDAFKCMCPGGWEGTTC NIARNSSCLPNPCHNGGTCVVNGESFTCVCKEGWEGPICAQNTNDCSPHPCYNSGTCV DGDNWYRCECAPGFAGPDCRININECQSSPCAFGATCVDEINGYRCVCPPGHSGAKCQ EVSGRPCITMGSVIPDGAKWDDDCNTCQCLNGRIACSKVWCGPRPCLLHKGHSECPSG QSCIPILDDQCFVHPCTGVGECRSSSLQPVKTKCTSDSYYQDNCANITFTFNKEMMSP GLTTEHICSELRNLNILKNVSAEYSIYIACEPSPSANNEIHVAISAEDIRDDGNPIKE ITDKIIDLVSKRDGNSSLIAAVAEVRVQRRPLKNRTDFLVPLLSSVLTVAWICCLVTA FYWCLRKRRKPGSHTHSASEDNTTNNVREQLNQIKNPIEKHGANTVPIKDYENKNSKM SKIRTHNSEVEEDDMDKHQQKARFAKQPAYTLVDREEKPPNGTPTKHPNWTNKQDNRD LESAQSLNRMEYIV" exon 541..846 /gene="JAG1" /number=2 exon 847..898 /gene="JAG1" /number=3 exon 899..1153 /gene="JAG1" /number=4 exon 1154..1214 /gene="JAG1" /number=5 exon 1215..1345 /gene="JAG1" /number=6 exon 1346..1465 /gene="JAG1" /number=7 exon 1466..1579 /gene="JAG1" /number=8 exon 1580..1693 /gene="JAG1" /number=9 exon 1694..1807 /gene="JAG1" /number=10 exon 1808..1854 /gene="JAG1" /number=11 exon 1855..2028 /gene="JAG1" /number=12 exon 2029..2179 /gene="JAG1" /number=13 exon 2180..2344 /gene="JAG1" /number=14 exon 2345..2458 /gene="JAG1" /number=15 exon 2459..2572 /gene="JAG1" /number=16 exon 2573..2686 /gene="JAG1" /number=17 exon 2687..2803 /gene="JAG1" /number=18 exon 2804..2831 /gene="JAG1" /number=19 exon 2832..2917 /gene="JAG1" /number=20 exon 2918..3031 /gene="JAG1" /number=21 exon 3032..3141 /gene="JAG1" /number=22 exon 3142..3375 /gene="JAG1" /number=23 exon 3376..3507 /gene="JAG1" /number=24 exon 3508..3658 /gene="JAG1" /number=25 exon 3659..5942 /gene="JAG1" /number=26 BASE COUNT 1520 a 1393 c 1544 g 1485 t ORIGIN 1 ccgggtcctt ctccgagagc cgggcgggca cgcgtcattg tgttacctgc ggccggcccg 61 cgagctaggc tggttttttt tttttctccc ctccctcccc cctttttcca tgcagctgat 121 ctaaaaggga ataaaaggct gcgcataatc ataataataa aagaagggga gcgcgagaga 181 aggaaagaaa gccgggaggt ggaagaggag ggggagcgtc tcaaagaagc gatcagaata 241 ataaaaggag gccgggctct ttgccttctg gaacgggccg ctcttgaaag ggcttttgaa 301 aagtggtgtt gttttccagt cgtgcatgct ccaatcggcg gagtatatta gagccgggac 361 gcggcggccg caggggcagc ggcgacggca gcaccggcgg cagcaccagc gcgaacagca 421 gcggcggcgt cccgagtgcc cgcggcgcgc ggcgcagcga tgcgttcccc acggacgcgc 481 ggccggtccg ggcgccccct aagcctcctg ctcgccctgc tctgtgccct gcgagccaag 541 gtgtgtgggg cctcgggtca gttcgagttg gagatcctgt ccatgcagaa cgtgaacggg 601 gagctgcaga acgggaactg ctgcggcggc gcccggaacc cgggagaccg caagtgcacc 661 cgcgacgagt gtgacacata cttcaaagtg tgcctcaagg agtatcagtc ccgcgtcacg 721 gccggggggc cctgcagctt cggctcaggg tccacgcctg tcatcggggg caacaccttc 781 aacctcaagg ccagccgcgg caacgaccgc aaccgcatcg tgctgccttt cagtttcgcc 841 tggccgaggt cctatacgtt gcttgtggag gcgtgggatt ccagtaatga caccgttcaa 901 cctgacagta ttattgaaaa ggcttctcac tcgggcatga tcaaccccag ccggcagtgg 961 cagacgctga agcagaacac gggcgttgcc cactttgagt atcagatccg cgtgacctgt 1021 gatgactact actatggctt tggctgcaat aagttctgcc gccccagaga tgacttcttt 1081 ggacactatg cctgtgacca gaatggcaac aaaacttgca tggaaggctg gatgggccgc 1141 gaatgtaaca gagctatttg ccgacaaggc tgcagtccta agcatgggtc ttgcaaactc 1201 ccaggtgact gcaggtgcca gtacggctgg caaggcctgt actgtgataa gtgcatccca 1261 cacccgggat gcgtccacgg catctgtaat gagccctggc agtgcctctg tgagaccaac 1321 tggggcggcc agctctgtga caaagatctc aattactgtg ggactcatca gccgtgtctc 1381 aacgggggaa cttgtagcaa cacaggccct gacaaatatc agtgttcctg ccctgagggg 1441 tattcaggac ccaactgtga aattgctgag cacgcctgcc tctctgatcc ctgtcacaac 1501 agaggcagct gtaaggagac ctccctgggc tttgagtgtg agtgttcccc aggctggacc 1561 ggccccacat gctctacaaa cattgatgac tgttctccta ataactgttc ccacgggggc 1621 acctgccagg acctggttaa cggatttaag tgtgtgtgcc ccccacagtg gactgggaaa 1681 acgtgccagt tagatgcaaa tgaatgtgag gccaaacctt gtgtaaacgc caaatcctgt 1741 aagaatctca ttgccagcta ctactgcgac tgtcttcccg gctggatggg tcagaattgt 1801 gacataaata ttaatgactg ccttggccag tgtcagaatg acgcctcctg tcgggatttg 1861 gttaatggtt atcgctgtat ctgtccacct ggctatgcag gcgatcactg tgagagagac 1921 atcgatgaat gtgccagcaa cccctgtttg gatgggggtc actgtcagaa tgaaatcaac 1981 agattccagt gtctgtgtcc cactggtttc tctggaaacc tctgtcagct ggacatcgat 2041 tattgtgagc ctaatccctg ccagaacggt gcccagtgct acaaccgtgc cagtgactat 2101 ttctgcaagt gccccgagga ctatgagggc aagaactgct cacacctgaa agaccactgc 2161 cgcacgaccc cctgtgaagt gattgacagc tgcacagtgg ccatggcttc caacgacaca 2221 cctgaagggg tgcggtatat ttcctccaac gtctgtggtc ctcacgggaa gtgcaagagt 2281 cagtcgggag gcaaattcac ctgtgactgt aacaaaggct tcacgggaac atactgccat 2341 gaaaatatta atgactgtga gagcaaccct tgtagaaacg gtggcacttg catcgatggt 2401 gtcaactcct acaagtgcat ctgtagtgac ggctgggagg gggcctactg tgaaaccaat 2461 attaatgact gcagccagaa cccctgccac aatgggggca cgtgtcgcga cctggtcaat 2521 gacttctact gtgactgtaa aaatgggtgg aaaggaaaga cctgccactc acgtgacagt 2581 cagtgtgatg aggccacgtg caacaacggt ggcacctgct atgatgaggg ggatgctttt 2641 aagtgcatgt gtcctggcgg ctgggaagga acaacctgta acatagcccg aaacagtagc 2701 tgcctgccca acccctgcca taatgggggc acatgtgtgg tcaacggcga gtcctttacg 2761 tgcgtctgca aggaaggctg ggaggggccc atctgtgctc agaataccaa tgactgcagc 2821 cctcatccct gttacaacag cggcacctgt gtggatggag acaactggta ccggtgcgaa 2881 tgtgccccgg gttttgctgg gcccgactgc agaataaaca tcaatgaatg ccagtcttca 2941 ccttgtgcct ttggagcgac ctgtgtggat gagatcaatg gctaccggtg tgtctgccct 3001 ccagggcaca gtggtgccaa gtgccaggaa gtttcaggga gaccttgcat caccatgggg 3061 agtgtgatac cagatggggc caaatgggat gatgactgta atacctgcca gtgcctgaat 3121 ggacggatcg cctgctcaaa ggtctggtgt ggccctcgac cttgcctgct ccacaaaggg 3181 cacagcgagt gccccagcgg gcagagctgc atccccatcc tggacgacca gtgcttcgtc 3241 cacccctgca ctggtgtggg cgagtgtcgg tcttccagtc tccagccggt gaagacaaag 3301 tgcacctctg actcctatta ccaggataac tgtgcgaaca tcacatttac ctttaacaag 3361 gagatgatgt caccaggtct tactacggag cacatttgca gtgaattgag gaatttgaat 3421 attttgaaga atgtttccgc tgaatattca atctacatcg cttgcgagcc ttccccttca 3481 gcgaacaatg aaatacatgt ggccatttct gctgaagata tacgggatga tgggaacccg 3541 atcaaggaaa tcactgacaa aataattgat cttgttagta aacgtgatgg aaacagctcg 3601 ctgattgctg ccgttgcaga agtaagagtt cagaggcggc ctctgaagaa cagaacagat 3661 ttccttgttc ccttgctgag ctctgtctta actgtggctt ggatctgttg cttggtgacg 3721 gccttctact ggtgcctgcg gaagcggcgg aagccgggca gccacacaca ctcagcctct 3781 gaggacaaca ccaccaacaa cgtgcgggag cagctgaacc agatcaaaaa ccccattgag 3841 aaacatgggg ccaacacggt ccccatcaag gattacgaga acaagaactc caaaatgtct 3901 aaaataagga cacacaattc tgaagtagaa gaggacgaca tggacaaaca ccagcagaaa 3961 gcccggtttg ccaagcagcc ggcgtatacg ctggtagaca gagaagagaa gccccccaac 4021 ggcacgccga caaaacaccc aaactggaca aacaaacagg acaacagaga cttggaaagt 4081 gcccagagct taaaccgaat ggagtacatc gtatagcaga ccgcgggcac tgccgccgct 4141 aggtagagtc tgagggcttg tagttcttta aactgtcgtg tcatactcga gtctgaggcc 4201 gttgctgact tagaatccct gtgttaattt aagttttgac aagctggctt acactggcaa 4261 tggtagtttc tgtggttggc tgggaaatcg agtgccgcat ctcacagcta tgcaaaaagc 4321 tagtcaacag taccctggtt gtgtgtcccc ttgcagccga cacggtctcg gatcaggctc 4381 ccaggagcct gcccagcccc ctggtctttg agctcccact tctgccagat gtcctaatgg 4441 tgatgcagtc ttagatcata gttttattta tatttattga ctcttgagtt gtttttgtat 4501 attggtttta tgatgacgta caagtagttc tgtatttgaa agtgcctttg cagctcagaa 4561 ccacagcaac gatcacaaat gactttatta tttatttttt taattgtatt tttgttgttg 4621 ggggagggga gactttgatg tcagcagttg ctggtaaaat gaagaattta aagaaaaaaa 4681 tgtcaaaagt agaactttgt atagttatgt aaataattct tttttattaa tcactgtgta 4741 tatttgattt attaacttaa taatcaagag ccttaaaaca tcattccttt ttatttatat 4801 gtatgtgttt agaattgaag gtttttgata gcattgtaag cgtatggctt tatttttttg 4861 aactcttctc attacttgtt gcctataagc caaaattaag gtgtttgaaa atagtttatt 4921 ttaaaacaat aggatgggct tctgtgccca gaatactgat ggaatttttt ttgtacgacg 4981 tcagatgttt aaaacacctt ctatagcatc acttaaaaca cgttttaagg actgactgag 5041 gcagtttgag gattagttta gaacaggttt ttttgtttgt ttgttttttg tttttctgct 5101 ttagacttga aaagagacag gcaggtgatc tgctgcagag cagtaaggga acaagttgag 5161 ctatgactta acatagccaa aatgtgagtg gttgaatatg attaaaaata tcaaattaat 5221 tgtgtgaact tggaagcaca ccaatctgac tttgtaaatt ctgatttctt ttcaccattc 5281 gtacataata ctgaaccact tgtagatttg attttttttt taatctactg catttaggga 5341 gtattctaat aagctagttg aatacttgaa ccataaaatg tccagtaaga tcactgttta 5401 gatttgccat agagtacact gcctgcctta agtgaggaaa tcaaagtgct attacgaagt 5461 tcaagatcaa aaaggcttat aaaacagagt aatcttgttg gttcaccatt gagaccgtga 5521 agatactttg tattgtccta ttagtgttat atgaacatac aaatgcatct ttgatgtgtt 5581 gttcttggca ataaattttg aaaagtaata tttattaaat ttttttgtat gaaaacatgg 5641 aacagtgtgg ctcttctgag cttacgtagt tctaccggct ttgccgtgtg cttctgccac 5701 cctgctgagt ctgttctggt aatcggggta taataggctc tgcctgacag agggatggag 5761 gaagaactga aaggcttttc aaccacaaaa ctcatctgga gttctcaaag acctggggct 5821 gctgtgaagc tggaactgcg ggagccccat ctaggggagc cttgattccc ttgttattca 5881 acagcaagtg tgaatactgc ttgaataaac accactggat taatggaaaa aaaaaaaaaa 5941 aa // LOCUS AF004022 1253 bp mRNA PRI 06-AUG-1997 DEFINITION Homo sapiens protein kinase mRNA, complete cds. ACCESSION AF004022 NID g2306914 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1253) AUTHORS Sanseau,P. and Prigent,C. TITLE Cloning of a new human protein kinase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1253) AUTHORS Sanseau,P. and Prigent,C. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) Genomics, Glaxo-Wellcome, Gunnels Wood Road, Stevenage, Herts SG1 2NY, UK FEATURES Location/Qualifiers source 1..1253 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" CDS 59..1102 /codon_start=1 /product="protein kinase" /db_xref="PID:g2306915" /translation="MAQKENSYPWPYGRQTAPSGLSTLPQRVLRKEPVTPSALVLMSR SNVQPTAAPGQKVMENSSGTPDILTRHFTIDDFEIGRPLGKGKFGNVYLAREKKSHFI VALKVLFKSQIEKEGVEHQLRREIEIQAHLHHPNILRLYNYFYDRRRIYLILEYAPRG MLYKELHKTCTFDEQRTATVRRIMEELADALMYCHGKKVIHRDIKPENLLLGLKGELK IADFGWSVHAPSLRRKTMCGTLDYLPPEMIEGRMHNEKVDLWCIGVLCYELLVGNPPF ESASHNETYRRIVKVDLKFPASVPTGAQDLISKLLRHNPSERLPLAQVSAHPWVRANS RRVLPPSALQSVA" BASE COUNT 290 a 355 c 333 g 275 t ORIGIN 1 cggccgggag agtagcagtg ccttggaccc cagctctcct ccccctttct ctctaaggat 61 ggcccagaag gagaactcct acccctggcc ctacggccga cagacggctc catctggcct 121 gagcaccctg ccccagcgag tcctccggaa agagcctgtc accccatctg cacttgtcct 181 catgagccgc tccaatgtcc agcccacagc tgcccctggc cagaaggtga tggagaatag 241 cagtgggaca cccgacatct taacgcggca cttcacaatt gatgactttg agattgggcg 301 tcctctgggc aaaggcaagt ttggaaacgt gtacttggct cgggagaaga aaagccattt 361 catcgtggcg ctcaaggtcc tcttcaagtc ccagatagag aaggagggcg tggagcatca 421 gctgcgcaga gagatcgaaa tccaggccca cttgcaccat cccaacatcc tgcgtctcta 481 caactatttt tatgaccgga gaaggatcta cttgattcta gagtatgccc cccgcgggat 541 gctctacaag gagctgcaca agacctgcac atttgacgag cagcgaacag ccacggtccg 601 gcggatcatg gaggagttgg cagatgctct aatgtactgc catgggaaga aggtgattca 661 cagagacata aagccagaaa atctgctctt agggctcaag ggagagctga agattgctga 721 cttcggctgg tctgtgcatg cgccctccct gaggaggaag acaatgtgtg gcaccctgga 781 ctacctgccc ccagagatga ttgaggggcg catgcacaat gagaaggtgg atctgtggtg 841 cattggagtg ctttgctatg agctgctggt ggggaaccca ccctttgaga gtgcatcaca 901 caacgagacc tatcgccgca tcgtcaaggt ggacctaaag ttccccgctt ctgtgcccac 961 gggagcccag gacctcatct ccaaactgct caggcataac ccctcggaac ggctgcccct 1021 ggcccaggtc tcagcccacc cttgggtccg ggccaactct cggagggtgc tgcctccctc 1081 tgcccttcaa tctgtcgcct gatggtccct gtcattcact cgggtgcgtg tgtttgtatg 1141 tctgtgtatg tataggggaa agaagggatc cctaactgtt cccttatctg ttttctacct 1201 cctcctttgt ttaataaagg ctgaagcttt ttgtaaaaaa aaaaaaaaaa ata // LOCUS AF004291 1842 bp mRNA PRI 22-JUN-1997 DEFINITION Homo sapiens germ cell nuclear factor (GCNF) mRNA, complete cds. ACCESSION AF004291 NID g2209118 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1842) AUTHORS Agoulnik,I.U., Cooney,J.A. and Kieback,D.G. TITLE Cloning and expression analysis of the human homolog of the orphan receptor GCNF JOURNAL Unpublished REFERENCE 2 (bases 1 to 1842) AUTHORS Agoulnik,I.U., Cooney,J.A. and Kieback,D.G. TITLE Direct Submission JOURNAL Submitted (14-MAY-1997) OB/GYN, Baylor College of Medicine, 6550 Fannin, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1842 /organism="Homo sapiens" /db_xref="taxon:9606" gene 187..1551 /gene="GCNF" CDS 187..1551 /gene="GCNF" /function="nuclear receptor" /codon_start=1 /product="germ cell nuclear factor" /db_xref="PID:g2209119" /translation="MERDEPPPPRNGFCQDELAELDPGTNDRAEQRTCLICGDRATGL HYGIISCEGCKGFFKRSICNKRVYRCSRDKNCVMSRKQRNRCQYCRLLKCLQMGMNRK AIREDGMPGGRNKSIGPVQISEEEIERIMSGQEFEEEANHWSNHGDSDHSSPGNRASE SNQPSPGSTLSSSRSVELNGFMAFREQYMGMSVPPHYQYIPHLFSYSGHSPLLPQQAR SLDPQSYSLIHQLLSAEDLEPLGTPMLIEDGYAVTQAELFALLCRLADELLFRQIAWI KKLPFFCELSIKDYTCLLSSTWQELILLSSLTVYSKQIFGELADVTAKYSPSDEELHR FSDEGMEVIERLIYLYHKFHQLKVSNEEYACMKAINFLNQDIRGLTSASQLEQLNKRY WYICQDFTEYKYTHQPNRFPDLMMCLPEIRYIAGKMVNVPLEQLPLLFKVVLHSCKTS VGKE" BASE COUNT 452 a 470 c 501 g 419 t ORIGIN 1 ggcacgaggc ggcgcggagg ggcgcggagc ggcgcggaac cgggcggctc ggggcccaga 61 gagagccgcg gccgggagct cgcgggctcc tgacaacctc ctcccctcgg cggacgacga 121 ccacggcgac tagggcgccg gtcatggcgg agcaacaaac ccggcgcgga ccctaggcac 181 caccgcatgg agcgggacga acctccgccg ccgcgcaacg gtttctgtca ggatgaattg 241 gcagagcttg acccaggcac taatgatcgg gctgaacaac gaacctgtct catttgtggg 301 gaccgcgcta caggcttgca ctatgggatc atctcctgtg agggctgcaa agggtttttc 361 aagcggagca tttgcaacaa acgggtatat cgatgcagtc gtgacaagaa ctgtgtcatg 421 tctcggaagc agaggaacag gtgccagtac tgccgcctgc tcaaatgcct ccagatgggg 481 atgaaccgga aggctatcag agaagatggc atgcctggag gccggaataa gagcattggg 541 ccagtccaga tatcggaaga agaaatcgaa aggatcatgt ctgggcagga gtttgaggaa 601 gaggccaatc actggagcaa ccatggtgat agtgaccaca gttcccctgg gaacagggct 661 tcggagagca accagccctc accaggctcc acactgtctt ccagtaggtc tgtggaactg 721 aatggattca tggccttcag ggaacagtac atgggaatgt ctgtgcctcc acattaccaa 781 tatataccgc acctttttag ctattctggc cactcaccac ttctgcccca acaagctcgc 841 agcctggatc cccagtcata cagtctgatt caccagctgt tatcagccga ggacctggaa 901 ccattgggca cgcccatgtt gattgaagat ggatacgctg tgacacaggc agaactattt 961 gccctgcttt gccgcctggc cgacgagctg ctctttaggc agattgcctg gatcaagaaa 1021 ctgcctttct tctgcgagct ctcaatcaag gattacacgt gcctcttgag ctctacgtgg 1081 caggagctaa tcctgctgtc ttccctcacc gtttacagca agcagatctt tggggaactg 1141 gctgatgtca ctgccaagta ctcgccctcc gatgaagaac tacacagatt tagtgatgaa 1201 gggatggagg tgatcgagcg gctcatctac ctctatcaca agttccatca gctaaaggtc 1261 agcaacgagg agtatgcttg catgaaagca attaacttcc taaatcaaga tatcaggggt 1321 ctgaccagtg cctcacagct ggaacaattg aataaacgat actggtacat ttgccaggat 1381 tttactgaat ataaatacac acatcagccg aaccgctttc ctgatctcat gatgtgctta 1441 cctgagattc gatatattgc aggaaagatg gtgaatgtgc ccctggagca gctgcccctc 1501 ctctttaagg tggtgctgca ttcctgcaag accagtgtgg gcaaggaatg acctgttcca 1561 ggcgccctcc tcaggccaac cacagcgtct tgggtgggca ggacaggctc tggagggaaa 1621 agccagagag accaagatgg aggctgtgga gcagcatttc ccgttgcctc catagcaaga 1681 agagtttttg tttgtttgtc tgttttttta acctcatttt tctatatatt tatttcacga 1741 cagagttgaa tgtatggcct tcaacatgat gcacatgctt ttgtgtgaat gcagccaatg 1801 cattttctta cagtttacag aatgtgaaga tgtttgtaat tt // LOCUS AF004327 2269 bp mRNA PRI 16-JUL-1997 DEFINITION Homo sapiens angiopoietin-2 mRNA, complete cds. ACCESSION AF004327 NID g2257932 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2269) AUTHORS Maisonpierre,P.C., Suri,C., Jones,P.F., Bartunkova,S., Wiegand,S.J., Radziejewski,C., Compton,D., Aldrich,T.H., Papadopoulos,N., Daly,T.J., Davis,S., Sato,T.N. and Yancopoulos,G.D. TITLE Angiopoietin-2, a natural antagonist for tie2 that disrupts in vivo angiogenesis JOURNAL Science 277 (5322), 55-60 (1997) MEDLINE 97349327 REFERENCE 2 (bases 1 to 2269) AUTHORS Maisonpierre,P.C., Suri,C., Jones,P.F., Bartunkova,S., Wiegand,S.J., Radziejewski,C., Compton,D., Aldrich,T.H., Papdopoulos,N., Daly,T.J., Davis,S., Sato,T.N. and Yancopoulos,G.D. TITLE Direct Submission JOURNAL Submitted (16-MAY-1997) Discovery, Regeneron Pharmaceuticals, 777 Old Saw Mill River Rd., Tarrytown, NY 10591, USA FEATURES Location/Qualifiers source 1..2269 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /dev_stage="fetal" /clone_lib="human fetal lung cDNA library (Clontech)" CDS 350..1840 /note="Ligand for Tie2/Tek receptor tyrosine kinase; Method: conceptual translation with partial peptide sequencing" /codon_start=1 /product="angiopoietin-2" /db_xref="PID:g2257933" /translation="MWQIVFFTLSCDLVLAAAYNNFRKSMDSIGKKQYQVQHGSCSYT FLLPEMDNCRSSSSPYVSNAVQRDAPLEYDDSVQRLQVLENIMENNTQWLMKLENYIQ DNMKKEMVEIQQNAVQNQTAVMIEIGTNLLNQTAEQTRKLTDVEAQVLNQTTRLELQL LEHSLSTNKLEKQILDQTSEINKLQDKNSFLEKKVLAMEDKHIIQLQSIKEEKDQLQV LVSKQNSIIEELEKKIVTATVNNSVLQKQQHDLMETVNNLLTMMSTSNSAKDPTVAKE EQISFRDCAEVFKSGHTTNGIYTLTFPNSTEEIKAYCDMEAGGGGWTIIQRREDGSVD FQRTWKEYKVGFGNPSGEYWLGNEFVSQLTNQQRYVLKIHLKDWEGNEAYSLYEHFYL SSEELNYRIHLKGLTGTAGKISSISQPGNDFSTKDGDNDKCICKCSQMLTGGWWFDAC GPSNLNGMYYPQRQNTNKFNGIKWYYWKGSGYSLKATTMMIRPADF" BASE COUNT 742 a 495 c 518 g 514 t ORIGIN 1 tgggttggtg tttatctcct cccagccttg agggagggaa caacactgta ggatctgggg 61 agagaggaac aaaggaccgt gaaagctgct ctgtaaaagc tgacacagcc ctcccaagtg 121 agcaggactg ttcttcccac tgcaatctga cagtttactg catgcctgga gagaacacag 181 cagtaaaaac caggtttgct actggaaaaa gaggaaagag aagactttca ttgacggacc 241 cagccatggc agcgtagcag ccctgcgttt cagacggcag cagctcggga ctctggacgt 301 gtgtttgccc tcaagtttgc taagctgctg gtttattact gaagaaagaa tgtggcagat 361 tgttttcttt actctgagct gtgatcttgt cttggccgca gcctataaca actttcggaa 421 gagcatggac agcataggaa agaagcaata tcaggtccag catgggtcct gcagctacac 481 tttcctcctg ccagagatgg acaactgccg ctcttcctcc agcccctacg tgtccaatgc 541 tgtgcagagg gacgcgccgc tcgaatacga tgactcggtg cagaggctgc aagtgctgga 601 gaacatcatg gaaaacaaca ctcagtggct aatgaagctt gagaattata tccaggacaa 661 catgaagaaa gaaatggtag agatacagca gaatgcagta cagaaccaga cggctgtgat 721 gatagaaata gggacaaacc tgttgaacca aacagctgag caaacgcgga agttaactga 781 tgtggaagcc caagtattaa atcagaccac gagacttgaa cttcagctct tggaacactc 841 cctctcgaca aacaaattgg aaaaacagat tttggaccag accagtgaaa taaacaaatt 901 gcaagataag aacagtttcc tagaaaagaa ggtgctagct atggaagaca agcacatcat 961 ccaactacag tcaataaaag aagagaaaga tcagctacag gtgttagtat ccaagcaaaa 1021 ttccatcatt gaagaactag aaaaaaaaat agtgactgcc acggtgaata attcagttct 1081 tcaaaagcag caacatgatc tcatggagac agttaataac ttactgacta tgatgtccac 1141 atcaaactca gctaaggacc ccactgttgc taaagaagaa caaatcagct tcagagactg 1201 tgctgaagta ttcaaatcag gacacaccac aaatggcatc tacacgttaa cattccctaa 1261 ttctacagaa gagatcaagg cctactgtga catggaagct ggaggaggcg ggtggacaat 1321 tattcagcga cgtgaggatg gcagcgttga ttttcagagg acttggaaag aatataaagt 1381 gggatttggt aacccttcag gagaatattg gctgggaaat gagtttgttt cgcaactgac 1441 taatcagcaa cgctatgtgc ttaaaataca ccttaaagac tgggaaggga atgaggctta 1501 ctcattgtat gaacatttct atctctcaag tgaagaactc aattatagga ttcaccttaa 1561 aggacttaca gggacagccg gcaaaataag cagcatcagc caaccaggaa atgattttag 1621 cacaaaggat ggagacaacg acaaatgtat ttgcaaatgt tcacaaatgc taacaggagg 1681 ctggtggttt gatgcatgtg gtccttccaa cttgaacgga atgtactatc cacagaggca 1741 gaacacaaat aagttcaacg gcattaaatg gtactactgg aaaggctcag gctattcgct 1801 caaggccaca accatgatga tccgaccagc agatttctaa acatcccagt ccacctgagg 1861 aactgtctcg aactattttc aaagacttaa gcccagtgca ctgaaagtca cggctgcgca 1921 ctgtgtcctc ttccaccaca gagggcgtgt gctcggtgct gacgggaccc acatgctcca 1981 gattagagcc tgtaaacttt atcacttaaa cttgcatcac ttaacggacc aaagcaagac 2041 cctaaacatc cataattgtg attagacaga acacctatgc aaagatgaac ccgaggctga 2101 gaatcagact gacagtttac agacgctgct gtcacaacca agaatgttat gtgcaagttt 2161 atcagtaaat aactggaaaa cagaacactt atgttataca atacagatca tcttggaact 2221 gcattcttct gagcactgtt tatacactgt gtaaataccc atatgtcct // LOCUS AF004561 736 bp mRNA PRI 24-JUN-1997 DEFINITION Homo sapiens p21-Arc mRNA, complete cds. ACCESSION AF004561 NID g2209346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 736) AUTHORS Machesky,L.M., Reeves,E., Wientjes,F., Mattheyse,F., Grogan,A., Totty,N., Burlingame,A.L., Hsuan,J.J. and Segal,A.W. TITLE Direct Submission JOURNAL Submitted (17-MAY-1997) MRC-LMCB, University College London, Gower St., London WC1E 6BT, England FEATURES Location/Qualifiers source 1..736 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="yy68c05" CDS 1..537 /note="Arp2/3 complex 21kDa subunit" /codon_start=1 /product="p21-Arc" /db_xref="PID:g2209347" /translation="MPAYHSSLMDPDTKLIGNMALLPIRSQFKGPAPRETKDTDIVDE AIYYFKANVFFKNYEIKNEADRTLIYITLYISECLKKLQKCNSKSQGEKEMYTLGITN FPIPGEPGFPLNAIYAKPANKQEDEVMRAYLPQLRQETGLRLCEKVFDPQNDKPSKWW TCFVKRQFMNKSLSGPGQ" BASE COUNT 239 a 148 c 161 g 188 t ORIGIN 1 atgccggctt accactcttc tctcatggat cctgatacca aactcatcgg aaacatggca 61 ctgttgccta tcagaagtca attcaaagga cctgccccca gagagacaaa agatacagat 121 attgtggatg aagccatcta ttacttcaag gccaatgtct tcttcaaaaa ctatgaaatt 181 aagaatgaag ctgataggac cttgatatat ataactctct acatttctga atgtctgaag 241 aaactgcaaa agtgcaattc caaaagccaa ggtgagaaag aaatgtatac gctgggaatc 301 actaattttc ccattcctgg agagcctggt tttccactta acgcaattta tgccaaacct 361 gcaaacaaac aggaagatga agtgatgaga gcctatttac cacagctaag gcaagagact 421 ggactgagac tttgtgagaa agttttcgac cctcagaatg ataaacccag caagtggtgg 481 acttgctttg tgaagagaca gttcatgaac aagagtcttt caggacctgg acagtgaagg 541 gagcccgggc agccactgtc tccagagccc tgggcagcat tttccagcaa gatgtacaca 601 atcttttgcc tttatttcgt aaagttttat acagaagaga gaagagcatg tctttacttg 661 aaaaactctt gatcaagaat ttgggtggga gaaaagaaag tgggttatca agggtgattt 721 gaaattttct gcagca // LOCUS AF004668 1522 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens Sia alpha2,3Galbeta1,4GlcNAcalpha 2,8-sialyltransferase mRNA, complete cds. ACCESSION AF004668 NID g2653773 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1522) AUTHORS Kim,Y.-J., Kim,K.-S., Kim,C.-H. and Lee,Y.-C. TITLE Molecular cloning and expression of human Sia alpha 2,3Gal beta 1,4GlcNAc alpha 2,8-sialyltransferase (hSTSia8 III) JOURNAL Unpublished REFERENCE 2 (bases 1 to 1522) AUTHORS Kim,Y.-J., Kim,K.-S., Kim,C.-H. and Lee,Y.-C. TITLE Direct Submission JOURNAL Submitted (17-MAY-1997) Molecular Glycobiology, Korea Research Institute of Bioscience and Biotechnology, P.O. Box 115, Yusong, Taejon 305-600, South Korea FEATURES Location/Qualifiers source 1..1522 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="Clontech # HL1162a" 5'UTR 1..52 CDS 53..1195 /note="hST8Sia III; exhibits activity toward gangliosides, GM1b, GD1a, GT1b and GD3" /codon_start=1 /product="Sia alpha2,3Galbeta1,4GlcNAcalpha 2,8-sialyltransferase" /db_xref="PID:g2653774" /translation="MRNCKMARVASVLGLVMLSVALLILSLISYVSLKKENIFTTPKY ASPGAPRMYMFHAGFRSQFALKFLDPSFVPITNSLTQELQEKPSKWKFNRTAFLHQRQ EILQHVDVIKNFSLTKNSVRIGQLMHYDYSSHKYVFSISNNFRSLLPDVSPIMNKHYN ICAVVGNSGILTFIQCGREIDKSDFVFRCNFAPSEAFQRDVGRKTNLTTFNPSILEKY YNNLLTIQDRNNFFLSLKKLDGAILWIPAFFFHTSATVTRTLVDFFVEHRGQLKVQLA WPGNIMQHVNRYWKNKHLSPKRLSTGILMYTLASAICEEIHLYGFWPFGFDPNTREDL PYHYYDKKGTKFTTKWQESHQLPAEFQLLYRMHGEGLTKLTLSHCA" 3'UTR 1196..1522 BASE COUNT 436 a 344 c 315 g 427 t ORIGIN 1 ggcgctcaat ggaccgattt ccccggtttc cctgaaccca gcccagcccg ggatgagaaa 61 ctgcaaaatg gcccgggtcg ccagtgtgct ggggctggtc atgctcagcg tcgccctgct 121 gattttatcg ctcatcagct acgtgtccct gaaaaaggag aacatcttca ccactcccaa 181 gtacgccagc ccgggggcgc cccgaatgta catgttccac gcgggattcc ggtcacaatt 241 tgcgctgaag tttctagacc cgtcattcgt gcccattacg aattctctca cccaggaact 301 ccaagagaaa ccttctaagt ggaaatttaa tcggacagcg tttttacatc aaaggcaaga 361 aattcttcag catgtcgatg taataaaaaa tttttctttg accaagaata gtgttcggat 421 tggacaactg atgcactatg attattccag ccataaatat gttttctcta ttagcaataa 481 cttccggtca cttcttccag atgtgtcacc cattatgaac aagcattata atatttgtgc 541 tgtggttgga aatagtggga tcctgacatt tatccagtgt ggacgagaaa tagataaatc 601 agattttgtt ttccgttgca atttcgcccc atcggaggct ttccaaagag atgttggaag 661 gaaaaccaat cttaccacct tcaaccccag catcctggaa aaatattaca acaatctctt 721 gactattcag gaccgtaaca actttttcct cagtttaaaa aagcttgacg gggccattct 781 ttggatccct gcatttttct tccacacttc agcaactgtg accaggacat tagttgactt 841 ttttgttgaa cacagaggtc agttaaaagt ccaactggct tggccgggaa atataatgca 901 acatgtcaac aggtactgga aaaacaaaca tttgtcacct aaacggctga gcacaggtat 961 tcttatgtac acccttgcat cagcaatatg tgaagagatc cacttgtatg gattttggcc 1021 gtttggattt gaccccaaca caagggaaga tcttccatac cattactatg acaaaaaagg 1081 aaccaaattt accaccaagt ggcaggagtc ccaccagctg cctgctgagt ttcagctgct 1141 gtaccgaatg catggggaag ggctcaccaa gctgactctg tcacactgtg cctaagaact 1201 ccaaacggaa agcgccaaat ggctgtttaa aaagtgcccc aaatcaaatt gaatagcctt 1261 cagaatagaa ccctagagaa tgtcttataa ggattgtctg ccatttaaaa ggaaagatgt 1321 cttttctctt ttgcactgct cttttaagag ttttagcaga tttagcagga cagatgcatt 1381 gaagccacat ggtttagact tgattgataa agggaatgtt gcatttggga ctatgctgct 1441 aacgaaatgg tttgaagtat tttcatgttt ggattttaat aataaactgc ctctcatttt 1501 tatgaagact agagctatat tt // LOCUS AF004709 1838 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens stress-activated protein kinase 4 mRNA, complete cds. ACCESSION AF004709 NID g2232213 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1838) AUTHORS Kumar,S., McDonnell,P.C., Gum,R.J., Hand,A.T., Lee,J.C. and Young,P.R. TITLE Novel homologues of CSBP/p38 MAP kinase: activation, substrate specificity and sensitivity to inhibition by pyridinyl imidazoles JOURNAL Biochem. Biophys. Res. Commun. 235 (3), 533-538 (1997) MEDLINE 97350815 REFERENCE 2 (bases 1 to 1838) AUTHORS McDonnell,P.C. and Young,P.R. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) Molecular Immunology, SmithKline Beecham, 709 Swedeland Road, King of Prussia, PA 19406-0939, USA FEATURES Location/Qualifiers source 1..1838 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="activated T cell" CDS 49..1146 /function="serine/threonine protein kinase" /note="SAPK4; similar to CSBP/p38 MAP kinase" /codon_start=1 /product="stress-activated protein kinase 4" /db_xref="PID:g2232214" /translation="MSLIRKKGFYKQELNKTAWELPKTYVSPTHVGSGAYGSWCSAID KRSGEKVAIKKLSRPFQSEIFAKRAYRELLLLKHMQHENVIGLLDVFTPASSLRNFYD FYLVMPFMQTDLQKIMGMEFSEEKIQYLVYQMLKGLKYIHSAGVVHRDLKPGNLAVNE DCELKILDFGLARHADAEMTGYVVTRWYRAPEVILSWMHYNQTVDIWSVGCIMAEMLT GKTLFKGKDYLDQLTQILKVTGVPGTEFVQKLNDKAAKSYIQSLPQTPRKDFTQLFPR ASPQAADLLEKMLELDVDKRLTAAQALTHPFFEPFRDPEEETEAQQPFDDSLEHEKLT VDEWKQHIYKEIVNFSPIARKDSRRRSGMKL" BASE COUNT 418 a 498 c 523 g 396 t 3 others ORIGIN 1 gcacgagcgc agccgccacg ccggggccgc cgagatcggg tgcccgggat gagcctcatc 61 cggaaaaagg gcttctacaa gcaggagctc aacaagaccg cctgggagct gcccaagacc 121 tacgtctccc cgacgcacgt cggcagcggg gcctatggct cctggtgctc ggccatcgac 181 aagcggtcag gggagaaggt ggccatcaag aagctgagcc gaccctttca gtccgagatt 241 ttcgccaagc gcgcctaccg ggagctgctg ctgctgaagc acatgcagca tgagaacgtc 301 attgggctcc tggatgtttt caccccagcc tcctccctgc gcaacttcta tgacttctac 361 ctggtgatgc ccttcatgca gacggatctg cagaagatca tggggatgga gttcagtgag 421 gagaagatcc agtacctggt gtatcagatg ctcaaaggcc ttaagtacat ccactctgct 481 ggggtcgtgc acagggacct gaagccaggc aacctggctg tgaatgagga ctgtgaactg 541 aagattctgg attttgggct ggcgcgacat gcagacgccg agatgactgg ctacgtggtg 601 acccgctggt accgagcccc cgaggtgatc ctcagctgga tgcactacaa ccagacagtg 661 gacatctggt ctgtgggctg tatcatggca gagatgctga cagggaaaac tctgttcaag 721 gggaaagatt acctggacca gctgacccag atcctgaaag tgaccggggt gcctggcacg 781 gagtttgtgc agaagctgaa cgacaaagcg gccaaatcct acatccagtc cctgccacag 841 acccccagga aggatttcac tcagctgttc ccacgggcca gcccccaggc tgcggacctg 901 ctggagaaga tgctggagct agacgtggac aagcgcctga cggccgcgca ggccctcacc 961 catcccttct ttgaaccctt ccgggaccct gaggaagaga cggaggccca gcagccgttt 1021 gatgattcct tagaacacga gaaactcaca gtggatgaat ggaagcagca catctacaag 1081 gagattgtga acttcagccc cattgcccgg aaggactcac ggcgccggag tggcatgaag 1141 ctgtagggac tcatcttgca tggcaccgcc ggccagacac tgcccaagga ccagtatttg 1201 tcactaccaa actcagccct tcttggaata cagcctttca agcagaggac agaagggtcc 1261 ttctccttat gtgggaaatg ggcctagtag atgcagaatt caaagatgtc ggttgggaga 1321 aactagctct gatcctaaca ggccacgtta aactgcccat ctggagaatc gcctgcaggt 1381 ggggcccttt ccttcccgcc agagtggggc tgagtgggcg ctgagccagg ccgggggcct 1441 atggcagtga tgctgtgttg gtttcctagg gatgctctaa cgaattacca caaacctggt 1501 ggattgaaac agcagaactt gattccctta cagttctgga ggctggaaat ytgggatgga 1561 ggtgttggca gggctgtggt ccctttgaag gctctgggga agaatccttc cttggctctt 1621 tttagcttgt ggcggcagtg ggcagtccgt ggcattcccc agcttattgc tgcatcactc 1681 cagtctctgt ctcttctgtt ctctcctctt ttaacaacag tcattggatt tagggcccac 1741 cctaatcctg tgtgatytta tyttgatcct tattaattaa acctgcaaat actctagttc 1801 caaataaagt cacattctca ggttccaggt ggacatga // LOCUS AF004715 2889 bp mRNA PRI 07-AUG-1997 DEFINITION Homo sapiens jerky gene product homolog mRNA, complete cds. ACCESSION AF004715 NID g2314828 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2889) AUTHORS Zeng,Z., Kyaw,H., Gakenheimer,K.R., Augustus,M., Fan,P., Zhang,X., Su,K., Carter,K.C. and Li,Y. TITLE Cloning, mapping, and tissue distribution of a human homologue of the mouse jerky gene product JOURNAL Biochem. Biophys. Res. Commun. 236 (2), 389-395 (1997) MEDLINE 97382443 REFERENCE 2 (bases 1 to 2889) AUTHORS Zeng,Z.Z., Kyaw,H., Gakenheimer,K.R., Augustus,M., Fan,P., Zhang,X.C., Su,K., Carter,K.C. and Li,Y. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) Protein Therapeutics, Human Genome Sciences, Inc., 9410 Key West Avenue, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..2889 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 207..1535 /note="similar to Mus musculus jerky gene product encoded for by the sequence presented in the file with GenBank Accession Number U35730" /codon_start=1 /product="jerky gene product homolog" /db_xref="PID:g2314829" /translation="MLEWFNQQRAKGNPISGPICAKRAEFFFYALGMDGDFNPSAGWL TRFKQRHSIREINIRNERLNGDETAVEDFCNNFRDFIERENLQPEQIYNADETGLFWK CLPSRISVIKGKCTVPGHKSIEERVTIMCCANATGLHKLKLCVVGKAKKPRSFKSTDT LNLPVSYFSQKGAWMDLSIFRQWFDKIFVPQVREYLRSKGLQEKAVLLLDNSPTHPNE NVLRSDDGQIFAKYLPPNVASLIQPSDQGVIATMKRNYRAGLLQNNLEEGNDLKSFWK KLTLLDALYEIAMAWNLVKPVTISRAWKKILPMVEEKESLDFDVEDISVATVAAILQH TKGLENVTTENLEKWLEVDSTEPGYEVLTDSEIIRRAQGQADESSENEEEEIELIPEK HINHAAALQWTENLLDYLEQQGDMILPDRLVIRKLRATIRNKQKMTKSSQ" BASE COUNT 981 a 415 c 572 g 921 t ORIGIN 1 ccacgcgtcc gataataaag aaacttgaag acggaggttc ttccaaacaa ctggcagtga 61 tttatggaat tggtgaaaca acagttcggg atataagaaa aaataaggaa aagattataa 121 cttatgcaag cagttctgat tccacaagtc ttttggccaa gaggaaatct atgaagccat 181 ccatgtatga ggaattggac agggcaatgc tggaatggtt caaccagcaa agagcaaaag 241 ggaatcccat atctggacca atttgtgcaa aaagggcaga gttcttcttt tatgctttgg 301 gaatggatgg tgattttaac ccctctgccg gttggctaac tcgttttaag cagcggcaca 361 gcattagaga gattaacatt agaaatgaaa gattaaatgg agatgagact gcggtggaag 421 atttttgtaa taactttcga gattttattg aacgagagaa tttacagcct gaacaaatct 481 acaatgcaga tgaaactgga ctcttttgga agtgcttgcc ttctaggatt tcagtaatca 541 aaggtaaatg cactgtccct gggcacaaat caattgaaga aagagtcaca atcatgtgtt 601 gtgccaatgc aacaggttta cacaaactta aactttgtgt tgtggggaaa gcaaagaaac 661 ctcgctcctt taaatcaact gacaccttaa acctgccagt ctcttatttc agccaaaaag 721 gtgcatggat ggatctttcc attttccgac aatggtttga taaaattttt gtgccgcaag 781 ttcgagagta tttaagatct aaaggcttgc aggaaaaggc tgtgctcttg ttggataatt 841 caccaacaca tccaaatgaa aatgtcctaa ggtcagatga tggccaaata tttgctaaat 901 atttaccacc taatgtggcc tcattgattc agccttcaga tcagggagtc atagcaacga 961 tgaagagaaa ttatcgtgca ggtcttctcc agaacaactt ggaagaaggt aatgacctga 1021 aatcattctg gaagaagcta actctgttgg atgcacttta tgaaatagca atggcatgga 1081 acttagtaaa accagttacc attagcagag catggaagaa gattctccct atggtagagg 1141 agaaagagag cctggacttt gatgttgaag atatttctgt ggctactgtg gctgccattt 1201 tacaacacac caaaggattg gaaaatgtga ctactgagaa ccttgaaaaa tggcttgaag 1261 tagacagtac tgaaccaggc tatgaagtgt taactgatag cgaaatcatc agaagagcac 1321 aaggccaggc agatgaatcc agtgaaaatg aggaggagga aatagaacta attccagaga 1381 aacatattaa tcatgcagct gccctccagt ggactgaaaa tttattggat tatctagaac 1441 aacaaggtga tatgattcta cctgatagac tggtaatacg taaacttcga gccaccatca 1501 gaaataaaca gaagatgaca aagtcaagtc aataatgtca tttcaatttt attgttctgc 1561 tcattgtgtt tgtgacaaac tctttgcaat atggcttaat tttctttgtg ttctgaattc 1621 tcagacttgg tcctgtgaaa tacaggcaca aaatgtatct gaagtggttt gaggattatg 1681 tgttttcatc atctgtgtct tttgtccttt tatttgtaca gataatcaga agatgatact 1741 gaatagatat aaattacatg tacacatgta ttcacttttt agaatctgca attatacctt 1801 ctgtaacagt ggcattccct taattttcta gtgaaagtta gagataactg aacagactga 1861 agcacttttc tgaaatcttt tgcttgattt atgaaggctg ccatagttat cttttcttgt 1921 gttaaccatc ttaaatgatg ttttgtatat tttatagact gataggatga gaaagattta 1981 tattattaga tttcaggatg atttataata attcaaaaat gaaattcaat aatggggaaa 2041 taattatgaa ttatagaaat tatgccttca ttctcttaca tttgtgtggg ttgcaagagg 2101 gggagattat tctggaaatg aagtaaatat gggaagttat tgccaaatga gagagaaact 2161 atgggaaagc tgatctataa agaggcattc tgatcaattc atttgtagga aactgggaaa 2221 taaaaacctg gggaacttta ggttatttat acaaagggaa taaataggct gattttaatt 2281 tggtaagttg atctttttat tatgaatttg gtaatagtat aggtttatta tttattcatc 2341 taattttata gtacaggttt tgtaatgtta catgtgatga tatgagctcc caccttatat 2401 gggggaacat cttgggaatt tgagatttaa taagtttttt tttttttttt ttagtgtttt 2461 tactgcatac tcacaaatgt tgtctataat ttgaaaaata ttgtcatatc tggccctttg 2521 atgagaaaag gaaattacaa taataaagtt ttatgatttt aaataagtca tatgtttgta 2581 tcctgtttta tgaagaaagc agaaataatt actgaaagtg ccagacacta aggaatatta 2641 ttgtttatta ttttaataca tataaaaagg gattaatctg ctaaaatgta atctaaatca 2701 gaattttgat aaattttttt tgtaaactaa gtatgtttat tcaagacatt gaaactactt 2761 tgcacatatg aatattaatg taacttgtaa tttaaaagta aagtgtttcc atgctatttc 2821 atgttttggc caaaaatttt taaaaaataa attacaattg ttctctatta gtaaaaaaaa 2881 aaaaaaaaa // LOCUS AF004841 3979 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens CDO mRNA, complete cds. ACCESSION AF004841 NID g2406629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3979) AUTHORS Kang,J.S., Gao,M., Feinleib,J.L., Cotter,P.D., Guadagno,S.N. and Krauss,R.S. TITLE CDO: an oncogene-, serum-, and anchorage-regulated member of the Ig/fibronectin type III repeat family JOURNAL J. Cell Biol. 138 (1), 203-213 (1997) MEDLINE 97362072 REFERENCE 2 (bases 1 to 3979) AUTHORS Krauss,R.S., Kang,J.S., Gao,M. and Feinleib,J.L. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) Biochemistry, Mount Sinai Medical Center, One Gustave L. Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..3979 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q23-q24" CDS 1..3681 /note="immunoglobulin superfamily member; contains fibronectin type III-like domain" /codon_start=1 /product="CDO" /db_xref="PID:g2406630" /translation="MDLAPYFTSEPLSAVQKLGGPVVLHCSAQPVTTRISWLHNGKTL DGNLEHIKIHQGTLTILSLNSSLLGYYQCLANNSIGAIVSGPATVSVAVLGDFGSSTK HVITAEEKSAGFIGCRVPESNPKAEVRYKIRGKWLEHSTENYLILPSGNLQILNVSLE DKGSYKCAAYNPVTHQLKVEPIGRKLLVSRPSSDDVHILHPTHSQALAVLSRSPVTLE CVVSGVPAPQVYWLKDGQDIAPGSNWRRLYSHLATDSVDPADSGNYSCMAGNKSGDVE YVTYMVNVLEHASISKGLQDQIVSLGATVHFTCDVHGNPAPNCTWFHNAQPIHPSARH LTAGNGLKISGVTVEDVGMYQCVADNGIGFMHSTGRLEIENDGGFKPVIITAPVSAKV ADGDFVTLSCNASGLPVPVIRWYDSHGLITSHPSQVLRSKSRKSQLSRPEGLNLEPVY FVLSQAGASSLHIQAVTQEHAGKYICEAANEHGTTQAEASLMVVPFETNTKAETVTLP DAAQNDDRSKRDGSETGLLSSFPVKVHPSAVESAPEKNASGISVPDAPIILSPPQTHT PDTYNLVWRAGKDGGLPINAYFVKYRKLDDGVGMLGSWHTVRVPGSENELHLAELEPS SLYEVLMVARSAAGEGQPAMITFRTSKEKTASSKNTQASSPPVGIPKYPVVSEAANNN FGVVLTDSSRHSGVPEAPDRPTISTASETSVYVTWIPRANGGSPITAFKVEYKRMRTS NWLVAAEDIPPSKLSVEVRSLEPGSTYKFRVIAINHYGESFRSSASRPYQVVGFPNRF SSRPITGPHIAYTEAVSDTQIMLKWTYIPSSNNNTPIQGFYIYYRPTDSDNDSDYKRD VVEGSKQWHMIGHLQPETSYDIKMQCFNEGGESEFSNVMICETKVKRVPGASEYPVKD LSTPPNSLGSGGNVGPATSPARSSDMLYLIVGCVLGVMVLILMVFIAMCLWKNRQQNT IQKYDPPGYLYQGSDMNGQMVDYTTLSGASQINGNVHGGFLTNGGLSSGYSHLHHKVP NAVNGIVNGSLNGGLYSGHSNSLTRTHVDFEHPHHLVNGGGMYTAVPQIDPLECVNCR NCRNNNRCFTKTNSTFSSSPPPVVPVVAPYPQDGLEMKPLSHVKVPVCLTSAVPDCGQ LPEESVKDNVEPVPTQRTCCQDIVNDVSSDGSEDPAEFSRGDSCAHSETEINIVSWNA LILPPVPEAVLRRQCGLHLAFL" BASE COUNT 1081 a 948 c 943 g 1007 t ORIGIN 1 atggacttgg caccttattt tacttctgag ccgctctctg ctgtccagaa acttggtgga 61 cctgtagtac tgcattgttc tgctcaacct gtgaccactc gtatctcatg gctgcataac 121 ggaaaaacat tggatggaaa cctggaacat attaagattc atcaggggac tctgacaatt 181 ctttctctca actcctctct tttgggttac taccagtgcc ttgccaacaa tagcatcggt 241 gccattgtga gtggccctgc gacagtatct gtggcagttc ttggtgattt tggttcatcc 301 acaaagcatg ttattacagc agaagaaaaa agtgctggtt tcattggctg cagggtaccg 361 gagagtaacc ccaaagctga ggtgcgctat aaaatccggg gaaaatggct ggaacattcc 421 acagagaatt acttaatcct tccatcagga aatcttcaga ttttgaatgt atccttagag 481 gacaagggat catacaaatg tgcagcttat aatcctgtca cacatcaatt aaaagttgaa 541 cctattggcc gaaagctcct tgtgagtcgt ccttcttcag atgatgttca cattcttcac 601 cccacccatt cacaggcatt agctgttctt tctcgtagcc ctgtaacctt ggagtgtgtg 661 gtgagtgggg tcccggctcc tcaagtgtat tggctaaagg acgggcagga cattgcacca 721 ggaagcaact ggagaaggtt gtattctcat cttgccactg atagcgttga cccggcggac 781 tccggaaact attcctgcat ggcgggaaac aagtctggag atgtagaata tgtgacttac 841 atggttaatg tacttgaaca tgcttccatt tctaaaggac tacaggatca gatagtgtct 901 ctgggtgcca cagtacactt tacctgcgac gttcatggga acccagcccc caactgtacc 961 tggtttcaca atgcacagcc tattcatcct tctgcacgac atctaactgc aggaaacgga 1021 ctgaaaatca gtggggttac tgtggaagat gttgggatgt atcagtgtgt agcagataat 1081 gggattggat ttatgcactc tactggaaga cttgaaattg aaaatgacgg tggattcaag 1141 ccagttataa ttacggcacc agtaagtgca aaggttgcag acggagactt tgttactctg 1201 tcctgcaatg ccagtgggct gccggttccg gtcattcgtt ggtatgacag ccatggattg 1261 ataaccagcc atccatctca agtcctgaga tcgaaatccc gaaaatcaca gttatcaaga 1321 cctgagggct tgaacctgga gcctgtgtac ttcgtcctgt cccaagctgg tgcaagctct 1381 ctccatattc aggctgtgac tcaggaacat gcggggaaat acatctgcga agctgcaaat 1441 gaacatggta ccacacaggc agaagcatct ctcatggttg ttccttttga aacaaataca 1501 aaagcagaga cagtcacact tcctgatgct gctcagaatg atgacagaag taagagagat 1561 ggttcagaaa ctgggttact gagctcattt ccggtgaagg tccatcccag tgcagtggaa 1621 tcagcaccag agaaaaacgc cagcggcatc tctgttcctg atgcccccat catactgagc 1681 cccccacaga cccacacacc agacacgtac aacctggtgt ggagggcagg caaggatggt 1741 gggctgccca tcaatgctta ctttgtgaag tatcgaaagc tggatgatgg ggttggcatg 1801 ctgggaagct ggcacacggt tcgagtccca ggaagtgaaa atgagctcca tttagctgag 1861 ctggagccat ctagtcttta tgaagtcttg atggtagcaa gaagcgcagc aggtgaaggc 1921 caacctgcca tgattacctt ccgaaccagc aaagaaaaaa cagcgtcatc aaaaaacacc 1981 caggcatcct ctccacccgt gggcatccct aagtatcccg ttgtttcaga ggctgcaaac 2041 aacaattttg gagtggtact tacagattcc tctaggcaca gtggagttcc agaggcacca 2101 gatcggccta ccatctccac tgcatcagag acatcagtct atgtcacttg gattcctcgg 2161 gcaaacgggg gttctccaat cactgccttc aaagtcgaat ataaacggat gaggaccagc 2221 aattggctgg tggcagctga agacatccct ccttccaaac tttcagtgga agttcgtagt 2281 ttagaaccag gttcaacata caaatttagg gtcattgcca tcaaccatta tggtgagagt 2341 tttcggagtt cagcatctcg tccttatcaa gtggttgggt tccccaatcg cttttccagc 2401 cgtccaataa ctggacctca cattgcatac acagaggctg tcagcgatac tcagatcatg 2461 ctaaagtgga cgtacattcc atcaagtaac aataacactc ccattcaagg attttatatc 2521 tattaccgac caacagatag tgacaatgac agtgattaca agagggatgt tgtagaaggt 2581 tcaaagcagt ggcacatgat tggccacctg cagccagaaa cctcctatga cattaaaatg 2641 caatgcttca atgaaggagg agaaagtgaa tttagcaatg tgatgatctg cgagactaaa 2701 gtgaaacgtg ttcctggagc ttctgaatat cctgtcaaag acttgagtac ccctccaaat 2761 tctttgggaa gtggaggaaa tgtggggcct gcaaccagcc ctgccagaag cagtgacatg 2821 ttatatctga tcgttggctg tgtgctgggc gtcatggtcc tcattctgat ggttttcatt 2881 gcaatgtgcc tgtggaagaa tcgccagcag aataccatac aaaaatatga cccaccagga 2941 tatctctacc aaggatcaga tatgaacggg cagatggtgg actacaccac tctctcagga 3001 gcaagtcaga taaatggaaa tgttcacgga ggcttcctaa ccaatggcgg tctcagcagt 3061 ggctattccc accttcacca taaggtcccc aatgcagtca atggaattgt gaatgggagc 3121 ctaaatggag ggctttactc cgggcacagc aactctctaa ccaggacaca cgtggatttt 3181 gaacatcctc atcatctagt gaatggtggt ggaatgtaca cggccgtgcc tcagattgac 3241 cctctggagt gtgttaactg ccgaaattgt cgaaacaaca ataggtgttt caccaaaacc 3301 aacagcactt tcagcagcag ccctcctcct gtggtccctg tggtagcacc ttatcctcag 3361 gatggtttgg aaatgaagcc cctcagtcac gtgaaggtgc ctgtatgcct gacttccgca 3421 gtccctgatt gtggccagtt gccggaggag agcgtcaagg acaatgtgga accagtccct 3481 actcagcgta cctgctgtca ggacattgta aatgacgtca gctctgatgg ctcagaagat 3541 ccagcagagt tcagcagagg agacagctgt gcccattcag aaacagagat caacattgta 3601 agttggaatg ctcttatttt gccacctgtc cccgaggctg tgctgagaag acaatgtggt 3661 ctccacctgg cattccttta gacagcccga cagaggtcct tcagcagccc cgggaaacct 3721 gagacatgca acaaccagtc atgttccaac ttcaagccgg taacacacaa caggctggga 3781 gcgaactgtg tgaaggacct taattcaaat cagagaaaat cattatttat ttttttgtag 3841 tagtaatgtc atatgaatgt atcttaaaac gtgtgccctt ttatattatt tatgccttaa 3901 atgttttctt ccccattcct tcctccccct cggtaggaaa caaccttgtt ttgcatagta 3961 ttcagtcacc tggagggca // LOCUS AF004849 3657 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens serine/threonine protein kinase mRNA, complete cds. ACCESSION AF004849 NID g2627330 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3657) AUTHORS Begley,D.A., Berkenpas,M.B., Sampson,K.E. and Abraham,I. TITLE Identification and sequence of human PKY, a putative kinase with increased expression in multidrug-resistant cells, with homology to yeast protein kinase Yak1 JOURNAL Gene 200 (1-2), 35-43 (1997) MEDLINE 98038974 REFERENCE 2 (bases 1 to 3657) AUTHORS Begley,D.A., Berkenpas,M.B. and Abraham,I. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) Cell Biology and Inflammation Research, Pharmacia & Upjohn, 301 Henrietta Street, Kalamazoo, MI 49007, USA FEATURES Location/Qualifiers source 1..3657 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KB-V1" CDS 10..3657 /note="PKY" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g2627331" /translation="MASQVLVYPPYVYQTQSSAFCSVKKLKVEPSSCVFQERNYPRTY VNGRNFGNSHPPTKGSAFQTKIPFNRPRGHNFSLQTSAVVLKNTAGATKVIAAQAQQA HVQAPQIGVWRNRLHFLEGPQRCGLKRKSEELDNHSSAMQIVDELSILPAMLQTNMGN PVTVVTATTGSKQNCTTGEGDYQLVQHEVLCSMKNTYEVLDFLGRGTFGQVVKCWKRG TNEIVAIKILKNHPSYARQGQIEVSILARLSTENADEYNFVRAYECFQHRNHTCLVFE MLEQNLYDFLKQNKFSPLPLKVIRPILQQVATALKKLKSLGLIHADLKPENIMLVDPV RQPYRVKVIDFGSASHVSKTVCSTYLQSRYYRAPEIILGLPFCEAIDMWSLGCVIAEL FLGWPLYPGALEYDQIRYISQTQGLPGEQLLNVGTKSTRFFCKETDMSHSGWRLKTLE EHEAETGMKSKEARKYIFNSLDDVAHVNTVMDLEGSDLLAEKADRREFVSLLKKMLLI DADLRITPAETLNHPFVNMKHLLDFPHSNHVKSCFHIMDICKSHLNSCDTNNHNKTSL LRPVASSSTATLTANFTKIGTLRSQALTTSAHSVVHHGIPLQAGTAQFGCGDAFQQTL IICPPAIQGIPATHGKPTSYSIRVDNTVPLVTQAPAVQPLQIRPGVLSQTWSGRTQQM LVPAWQQVTPLAPATTTLTSESVAGSHRLGDWGKMISCSNHYNSVMPQPLLTNQITLS APQPVSVGIAHVVWPQPATTKKNKQCQNRGILVKLMEWEPGREEINAFSWSNSLQNTN IPHSAFISPKIINGKDVEEVSCIETQDNQNSEGEARNCCETSIRQDSDSSVSDKQRQT IIIADSPSPAVSVITISSDTDEEETSQRHSLRECKGSLDCEACQSTLNIDRMCSLSSP DSTLSTSSSGQSSPSPCKRPNSMSDEEQESSCDTVDGSPTSDSSGHDSPFAESTFVED THENTELVSSADTETKPAVCSVVVPPVELENGLNADEHMANTDSICQPLIKGRSAPGR LNQPSAVGTRQQKLTSAFQQQHLNFSQVQHFGSGHQEWNGNFGHRRQQAYIPTSVTSN PFTLSHGSPNHTAVHAHLAGNTHLGGQPTLLPYPSSATLSSAAPVAHLLASPCTSRPM LQHPTYNISHPSGIVHQVPVGLNPRLLPSPTIHQTQYKPIFPPHSYIAASPAYTGFPL SPTKLSQYPYM" BASE COUNT 1103 a 803 c 792 g 959 t ORIGIN 1 taggaaggta tggcctcaca agtcttggtc tacccaccat atgtttatca aactcagtca 61 agtgcctttt gtagtgtgaa gaaactcaaa gtagagccaa gcagttgtgt attccaggaa 121 agaaactatc cacggaccta tgtgaatggt agaaactttg gaaattctca tcctcccact 181 aagggtagtg cttttcagac aaagatacca tttaatagac ctcgaggaca caacttttca 241 ttgcagacaa gtgctgttgt tttgaaaaac actgcaggtg ctacaaaggt catagcagct 301 caggcacagc aagctcacgt gcaggcacct cagattgggg tgtggcgaaa cagattgcat 361 ttcctagaag gcccccagcg atgtggattg aagcgcaaga gtgaggagtt ggataatcat 421 agcagcgcaa tgcagattgt cgatgaattg tccatacttc ctgcaatgtt gcaaaccaac 481 atgggaaatc cagtgacagt tgtgacagct accacaggat caaaacagaa ttgtaccact 541 ggagaaggtg actatcagtt agtacagcat gaagtcttat gctccatgaa aaatacttac 601 gaagtccttg attttcttgg tcgaggcacg tttggccagg tagttaaatg ctggaaaaga 661 gggacaaatg aaattgtagc aatcaaaatt ttgaagaatc atccttctta tgcccgtcaa 721 ggtcaaatag aagtgagcat attagcaagg ctcagtactg aaaatgctga tgaatataac 781 tttgtacgag cttatgaatg ctttcagcac cgtaaccata cttgtttagt ctttgagatg 841 ctggaacaaa acttgtatga ctttctgaaa caaaataaat ttagtcccct gccactaaaa 901 gtgattcggc ccattcttca acaagtggcc actgcactga aaaaattgaa aagtcttggt 961 ttaattcatg ctgatctcaa gccagagaat attatgttgg tggatcctgt tcggcagcct 1021 tacagggtta aagtaataga ctttgggtcg gccagtcatg tatcaaagac tgtttgttca 1081 acatatctac aatctcggta ctacagagct ccagagatta tattggggtt gccattttgt 1141 gaagccatag acatgtggtc attgggatgt gtgattgcag aattatttct tggatggccg 1201 ctctacccag gagccttgga gtatgatcag attcgataca tttctcagac tcaaggtttg 1261 ccaggagaac agttgttaaa tgtgggtact aaatccacaa gatttttttg caaagaaaca 1321 gatatgtctc attctggttg gagattaaag acattggaag agcatgaggc agagacagga 1381 atgaagtcta aagaagccag aaaatacatt ttcaacagtc tggatgatgt agcgcatgtg 1441 aacacagtga tggatttgga aggaagtgat cttttggctg agaaagctga tagaagagaa 1501 tttgttagtc tgttgaagaa aatgttgctg attgatgcag atttaagaat tactccagct 1561 gagaccctga accatccttt tgttaatatg aaacatcttc tagatttccc tcatagcaac 1621 catgtaaagt cctgttttca tattatggat atttgtaagt cccacctaaa ttcatgtgac 1681 acaaataatc acaacaaaac ttcactttta agaccagttg cttcaagcag tactgctaca 1741 ctgactgcaa attttactaa aatcggaaca ttaagaagtc aggcattgac cacatctgct 1801 cattcagttg tgcaccatgg aatacctctg caggcaggaa ctgctcagtt tggttgtggt 1861 gatgcttttc agcagacatt gattatctgt cccccagcta ttcaaggtat tcctgcaaca 1921 catggtaaac ccaccagtta ttcaataagg gtagataata cagttccact tgtaactcag 1981 gccccagctg tgcagccact acagatccga ccaggagttc tttctcagac gtggtctggt 2041 agaacacagc agatgctggt gcctgcctgg caacaggtga cacccctggc tcctgctact 2101 actacactaa cttctgagag tgtggctggt tcacacaggc ttggagactg ggggaagatg 2161 atttcatgca gcaatcatta taactcagtg atgccgcagc ctcttctgac caatcagata 2221 actttatctg cccctcagcc agttagtgtg gggattgcac atgttgtctg gcctcagcct 2281 gccactacca agaaaaataa acagtgccag aacagaggta ttttggtaaa actaatggaa 2341 tgggagccag gaagagagga aataaatgct ttcagttgga gtaattcatt acagaatacc 2401 aatatcccac attcagcatt tatttctcca aagataatta atgggaaaga tgtcgaggaa 2461 gtaagttgta tagaaacaca ggacaatcag aactcagaag gagaggcaag aaattgctgt 2521 gaaacatcta tcagacagga ctctgattca tcagtttcag acaaacagcg gcaaaccatc 2581 attattgccg actccccgag tcctgcagtg agtgtcatca ctatcagcag tgacactgat 2641 gaggaagaga cttcccagag acattcactc agagaatgta aaggtagtct agattgtgaa 2701 gcttgccaga gcactttgaa tattgatcgg atgtgttcat taagtagtcc tgatagtact 2761 ctgagtacca gctcctcagg gcagtccagc ccatccccct gcaagagacc gaatagtatg 2821 tcagatgaag agcaagaaag tagttgtgat acggtggatg gctctccgac atctgactct 2881 tccgggcatg acagtccatt tgcagagagc acttttgtgg aggacactca tgaaaacaca 2941 gaattggtat cctctgctga cacagaaacc aagccagctg tctgttctgt tgtggtgcca 3001 ccagtggaac tagaaaatgg cttaaatgcc gatgagcata tggcaaacac agattctata 3061 tgccagccat taataaaagg acgatctgcc cctggaagat taaaccagcc ttctgcagtg 3121 ggtactcgtc agcaaaaatt gacatcagca ttccagcagc agcatttgaa cttcagtcag 3181 gttcagcact ttggatctgg gcatcaagag tggaatggaa actttgggca cagaagacag 3241 caagcttata ttcctactag tgttaccagt aatccattca ctctttctca tggaagtccc 3301 aatcacacag cagtgcatgc ccacctggct ggaaatacac acctcggagg acagcctact 3361 ctacttccat acccatcatc agccaccctc agtagtgctg caccagtggc ccacctgtta 3421 gcctctccgt gtacctcaag acctatgtta cagcatccaa cttataatat ctcccatccc 3481 agtggcatag ttcaccaagt cccagtgggc ttaaatcccc gtctgttacc atccccaacc 3541 attcatcaga ctcagtacaa accaatcttc ccaccacatt cttacattgc agcatcacct 3601 gcatatactg gatttccact gagtccaaca aaactcagcc agtatccata tatgtga // LOCUS AF004900 1524 bp mRNA PRI 18-JUN-1997 DEFINITION Homo sapiens NHE3 kinase A regulatory protein E3KARP mRNA, complete cds. ACCESSION AF004900 NID g2198848 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1524) AUTHORS Yun,C.H., Oh,S., Zizak,M., Steplock,D., Tsao,S., Tse,C.M., Weinman,E.J. and Donowitz,M. TITLE cAMP-mediated inhibition of the epithelial brush border Na+/H+ exchanger, NHE3, requires an associated regulatory protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (7), 3010-3015 (1997) MEDLINE 97250481 REFERENCE 2 (bases 1 to 1524) AUTHORS Yun,C.H. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) Medicine, Johns Hopkins School of Medicine, GI Unit, 918 Ross Building, 720 Rutland Avenue, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1524 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 17..1030 /note="NHE3 associated regulatory protein; NHE3 kinase A regulatory protein" /codon_start=1 /product="E3KARP" /db_xref="PID:g2198849" /translation="MAAPEPLRPRLCRLVRGEQGYGFHLHGEKGRRGQFIRRVEPGSP AEAAALRAGDRLVEVNGVNVEGETHHQVVQRIKAVEGQTRLLVVDQETDEELRRRQLT CTEEMAQRGLPPAHDPWEPKPDWAHTGSHSSEAGKKDVSGPLRELRPRLCHLRKGPQG YGFNLHSDKSRPGQYIRSVDPGSPAARSGLRAQDRLIEVNGQNVEGLRHAEVVASIKA REDEARLLVVDPETDEHFKRLRVTPTEEHVEGPLPSPVTNGTSPAQLNGGSACSSRSD LPGSDKDTEDGSAWKQDPFQESGLHLSPTAAEAKEKARAMRVNKRAPQMDWNRKREIF SNF" BASE COUNT 305 a 498 c 495 g 226 t ORIGIN 1 gtgggcagcg ggcgccatgg ccgcgccgga gccgctgcgg ccgcgcctgt gccgcttggt 61 gcgcggagag cagggctacg gcttccacct gcacggcgag aagggccgcc gcgggcagtt 121 catccggcgc gtggaacccg gttcccccgc cgaggccgcc gcgctgcgcg ctggggaccg 181 cctggtcgag gtcaacggcg tcaacgtgga gggcgagacg caccaccagg tggtgcaaag 241 gatcaaggct gtggaggggc agactcggct gctggtggtg gaccaggaga cagatgagga 301 gctccgccgg cggcagctga cctgtaccga ggagatggcc cagcgagggc tcccacccgc 361 ccacgacccc tgggagccga agccagactg ggcacacacc ggcagccaca gctccgaagc 421 tggcaagaag gatgtcagtg ggcccctgag ggagctgcgc cctcggctct gccacctgcg 481 aaagggacct cagggctatg ggttcaacct gcatagtgac aagtcccggc ccggccagta 541 catccgctct gtggacccgg gctcacctgc cgcccgctct ggcctccgcg cccaggaccg 601 gctcattgag gtgaacgggc agaatgtgga gggactgcgc catgctgagg tggtggccag 661 catcaaggca cgggaggacg aggcccggct gctggtcgtg gaccccgaga cagatgaaca 721 cttcaagcgg cttcgggtca cacccaccga ggagcacgtg gaaggtcctc tgccgtcacc 781 cgtcaccaat ggaaccagcc ctgcccagct caatggtggc tctgcgtgct catcccgaag 841 tgacctgcct ggttccgaca aggacactga ggatggcagt gcctggaagc aagatccctt 901 ccaggagagc ggcctccacc tgagccccac ggcggccgag gccaaggaga aggctcgagc 961 catgcgagtc aacaagcgcg cgccacagat ggactggaac aggaagcgtg aaatcttcag 1021 caacttctga gccccttcct gcctgtctcg ggaccctggg acccctcccg cacggacctt 1081 gggcctcagc ctgccccgag ctcccccagc ctcagtggac tggagggtgg tcctgccatt 1141 gcccagaaat cagccccagc cccggtgagc ccccatcctg cccctgccca ccaggtactg 1201 ggggcctgtg gcagcaagat agggggagag agacccagag atgtgagaga gagtcagaga 1261 cagagacaga gagagagaga gagagacaca gagagagaca gagagagagc gagcgagcgc 1321 gcggcagccg cggggcgagg gcctttgctg ctctgccggg gcctgctgac tgaaaggaat 1381 ttgtgttttt gctttttttc caaaaagatc tccagctcca cacatgtttc cacttaatac 1441 cagagacccc ccccttcccc tcccccttcc cctccccctt gggacgcgct ctaaataatt 1501 gcaataaaac aaacctttct ctgc // LOCUS AF005037 1139 bp mRNA PRI 10-JUL-1997 DEFINITION Homo sapiens secretory carrier membrane protein (SCAMP1) mRNA, complete cds. ACCESSION AF005037 NID g2232238 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1139) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Three mammalian SCAMPs (secretory carrier membrane proteins) are highly related products of distinct genes having similar subcellular distributions JOURNAL J. Cell. Sci. (1997) In press REFERENCE 2 (bases 1 to 1139) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Direct Submission JOURNAL Submitted (22-MAY-1997) Cell Biology, University of Virginia, Box 439 Health Sciences Center, 1300 Jefferson Park Ave, Charlottesville, VA 22903, USA FEATURES Location/Qualifiers source 1..1139 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="Stratagene #937216" gene 1..1139 /gene="SCAMP1" CDS 44..1060 /gene="SCAMP1" /note="similar to Rattus norvegicus SCAMP37, PIR Accession Number S37395" /codon_start=1 /product="secretory carrier membrane protein" /db_xref="PID:g2232239" /translation="MSDFDSNPFADPDLNNPFKDPSVTQVTRNVPPGLDEYNPFSDSR TPPPGGVKMPNVPNTQPAIMKPTEEHPAYTQIAKEHALAQAELLKRQEELERKAAELD RREREMQNLSQHGRKNNWPPLPSNFPVGPCFYQDFSVDIPVEFQKTVKLMYYLWMFHA VTLFLNIFGCLAWFCVDSARAVDFGLSILWFLLFTPCSFVCWYRPLYGAFRSDSSFRF FVFFFVYICQFAVHVLQAAGFHNWGNCGLISSLTGLNQNIPVGIMMIIIAALFTASAV ISLVMFKKVHGLYRTTGASFEKAQQEFATGVMSNKTVQTAAANAASTAASSAAQNAFK GNQI" BASE COUNT 324 a 249 c 231 g 335 t ORIGIN 1 aactagtgga tccccgggct gcaggaattc ggcacgagga gagatgtcgg atttcgacag 61 taacccgttt gccgacccgg atctcaacaa tcccttcaag gatccatcag ttacacaagt 121 gacaagaaat gttccaccag gacttgatga atataatcca ttctcggatt ctagaacacc 181 tccaccaggc ggtgtgaaga tgcctaatgt acccaataca caaccagcaa taatgaaacc 241 aacagaggaa catccagctt atacacagat tgcaaaggaa catgcattgg cccaagctga 301 acttcttaag cgccaggaag agctagaaag aaaagccgca gaattagatc gtcgggaacg 361 agaaatgcaa aacctcagtc aacatggtag aaaaaataat tggccacctc ttcctagcaa 421 ttttcctgtc ggaccttgtt tctatcagga tttttctgta gacattcctg tagaattcca 481 aaagacagta aagcttatgt actacttgtg gatgttccat gcagtaacac tgtttctaaa 541 tatcttcgga tgcttggctt ggttttgtgt tgattctgca agagcggttg attttggatt 601 gagtatcctg tggttcttgc tttttactcc ttgttcattt gtctgttggt acagaccact 661 ttatggagct ttcaggagtg acagttcatt tagattcttt gtattcttct tcgtctatat 721 ttgtcagttt gctgtacatg tactccaagc tgcaggattt cataactggg gcaattgtgg 781 gttgatttca tcccttactg gtctcaacca aaatattcct gttggaatca tgatgataat 841 catagcagca cttttcacag catcagcagt catctcacta gttatgttca aaaaagtaca 901 tggactatat cgcacaacag gtgctagttt tgagaaggcc caacaggagt ttgcaacagg 961 tgtgatgtcc aacaaaactg tccagaccgc agctgcaaat gcagcttcaa ctgcagcatc 1021 tagtgcagct cagaatgctt tcaagggtaa ccagatttaa gaatcttcaa acaatgacga 1081 ctgttacctt tttgcactgt acctttttcc tccagttact gtattctaca aatattttt // LOCUS AF005038 1243 bp mRNA PRI 10-JUL-1997 DEFINITION Homo sapiens secretory carrier membrane protein (SCAMP2) mRNA, complete cds. ACCESSION AF005038 NID g2232240 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1243) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Three mammalian SCAMPs (secretory carrier membrane proteins) are highly related products of distinct genes having similar subcellular distributions JOURNAL J. Cell. Sci. (1997) In press REFERENCE 2 (bases 1 to 1243) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Direct Submission JOURNAL Submitted (22-MAY-1997) Cell Biology, University of Virginia, Box 439 Health Sciences Center, 1300 Jefferson Park Ave, Charlottesville, VA 22903, USA FEATURES Location/Qualifiers source 1..1243 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="Stratagene #937216" gene 1..1243 /gene="SCAMP2" CDS 30..1016 /gene="SCAMP2" /note="similar to Rattus norvegicus SCAMP37, PIR Accession Number S37395" /codon_start=1 /product="secretory carrier membrane protein" /db_xref="PID:g2232241" /translation="MSAFDTNPFADPVDVNPFQDPSVTQLTNAPQALAEFNPFSETNA ATTVPVTQLPGSSQPAVLQPSVEPTQPTPQAVVSAAQAGLLRQQEELDRKAAELERKE RELQNTVANLHVRQNNWPPLPSWCPVKPCFYQDFSTEIPADYQRICKMLYYLWMLHSV TLFLNLLACLAWFSGNSSKGVDFGLSILWFLIFTPCAFLCWYRPIYKAFRSDNSFSFF VFFFVFFCQIGIYIIQLVGIPGLGDSGWIAALSTLDNHSLAISVIMMVVAGFFTLCAV LSVFLLQRVHSLYRRTGASFQQAQEEFSQGIFSSRTFHRAASSAAQGAFQGN" BASE COUNT 227 a 402 c 304 g 310 t ORIGIN 1 gcggagttcg ccgctggccc ccgatcacca tgtcggcttt cgacaccaac cccttcgcgg 61 acccagtgga tgtaaacccc ttccaggatc cctctgtgac ccagctgacc aacgccccgc 121 aggcgctggc ggaattcaac cccttctcag agacaaatgc agcgacaaca gttcctgtca 181 cccaactccc tgggtcctca cagccagcgg ttctccagcc atcagtggaa ccaacccagc 241 cgacccccca ggccgtggtg tctgcagccc aggcaggcct gctccggcag caggaagaac 301 tggacaggaa agctgccgag ctggaacgca aggagcggga gctgcagaac actgtagcca 361 acttgcatgt gagacagaac aactggcccc ctctgccctc gtggtgccct gtgaagccct 421 gcttctatca ggatttctcc acagagatcc ctgccgacta ccagcggata tgcaagatgc 481 tctactatct gtggatgttg cattcagtga ctctgtttct gaacctgctt gcctgcctgg 541 cctggttctc gggcaacagc tccaagggag tggactttgg cctctccatc ctgtggtttc 601 tgatcttcac tccctgtgcc ttcctttgtt ggtaccgacc catctataag gcctttaggt 661 ccgacaactc tttcagcttc tttgtgttct tctttgtatt tttttgtcaa atagggatct 721 acatcatcca gttggttggc atccctggcc tgggggacag cggttggatt gcagccctgt 781 ctacactgga taatcattcc ctggccatat cagtcatcat gatggtggtg gctggcttct 841 tcaccctctg tgccgtgctc tcagtcttcc tcctgcagcg ggtgcactcc ctctaccgac 901 ggacaggggc cagcttccag caggcccagg aggagttttc ccagggcatc ttcagcagca 961 gaaccttcca cagagctgct tcatctgctg cccaaggagc cttccagggg aattagtcct 1021 cctctcttct ctccccctca gcctttctct cgcctgcctt ctgagctgca ctttccgtgg 1081 gtgccttatg tggtggtggt tgtgcccagc acagacctgg cagggttctt gccgtggctc 1141 ttcctcctcc ctcagcgacc agctctccct ggaacgggag ggacagggaa ttttttcccc 1201 ctctatgtac aaaaaaaaac aaagctctct ttccttctct ggt // LOCUS AF005039 1433 bp mRNA PRI 10-JUL-1997 DEFINITION Homo sapiens secretory carrier membrane protein (SCAMP3) mRNA, complete cds. ACCESSION AF005039 NID g2232242 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1433) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Three mammalian SCAMPs (secretory carrier membrane proteins) are highly related products of distinct genes having similar subcellular distributions JOURNAL J. Cell. Sci. (1997) In press REFERENCE 2 (bases 1 to 1433) AUTHORS Singleton,D.R., Wu,T.T. and Castle,J.D. TITLE Direct Submission JOURNAL Submitted (22-MAY-1997) Cell Biology, University of Virginia, Box 439 Health Sciences Center, 1300 Jefferson Park Ave, Charlottesville, VA 22903, USA FEATURES Location/Qualifiers source 1..1433 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="Stratagene #937216" gene 1..1433 /gene="SCAMP3" CDS 97..1140 /gene="SCAMP3" /note="similar to Rattus norvegicus SCAMP37, PIR Accession Number S37395" /codon_start=1 /product="secretory carrier membrane protein" /db_xref="PID:g2232243" /translation="MARSRDGGNPFAEPSELDNPFQDPAVIQHRPSRQYATLDVYNPF ETREPPPAYEPPAPAPLPPPSAPSLQPSRMLSPTEPKNYGSYSTQASAAAATAELLKK QEELNRKAEELDRRERELQHAALGGTATRQNNWPPLPSFCPVQPCFFQDISMEIPQEF QKTVSTMYYLWMCSTLALLLNFLACLASFCVETNNGAGFGLSILWVLLFTPCSFVCWY RPMYKAFRSDSSFNFFVFFFIFFVQDVLFVLQAIGIPGWGFSGWISALVVPKGNTAVS VLMLLVALLFTGIAVLGIVMLKRIHSLYRRTGASFQKAQQEFAAGVFSNPAVRTAAAN AAAGAAENAFRAP" BASE COUNT 282 a 455 c 370 g 325 t 1 others ORIGIN 1 tgaggcgagt gaagtggact ctgagggcta ccgctaccgc cactgctgcg gcaggggcgt 61 ggagggcaga gggccgcgga ggccgcagtt gcaaacatgg ctcggagcag agacggcgga 121 aacccgttcg ccgagcccag cgagcttgac aacccctttc aggacccagc tgtgatccag 181 caccgaccca gccggcagta tgccacgctt gacgtctaca acccttttga gacccgggag 241 ccaccaccag cctatgagcc tccagcccct gccccattgc ctccaccctc agctccctcc 301 ttgcagccct cgagaatgct cagccccaca gaacctaaga actatggctc atacagcact 361 caggcctcag ctgcagcagc cacagctgag ctgctgaaga aacaggagga gctcaaccgg 421 aaggcagagg agttggaccg aagggagcga gagctgcagc atgctgccct gggaggcaca 481 gctactcgac agaacaattg gccccctcta ccttcttttt gtccagttca gccctgcttt 541 ttccaggaca tctccatgga gatcccccaa gaatttcaga agactgtatc caccatgtac 601 tacctctgga tgtgcagcac gctggctctt ctcctgaact tcctcgcctg cctggccagc 661 ttctgtgtgg aaaccaacaa tggcgcaggc tttgggcttt ctatcctctg ggtcctcctt 721 ttcactccct gctcctttgt ctgctggtac cgccccatgt ataaggcttt ccggagtgac 781 agttcattca atttcttcgt tttcttcttc attttcttcg tccaggatgt gctctttgtc 841 ctccaggcca ttggtatccc aggttgggga ttcagtggct ggatctctgc tctggtggtg 901 ccgaagggca acacagcagt atccgtgctc atgctgctgg tcgccctgct cttcactggc 961 attgctgtgc taggaattgt catgctgaaa cggatccact ccttataccg ccgcacaggt 1021 gccagctttc agaaggccca gcaagaattt gctgctggtg tcttctccaa ccctgcggtg 1081 cgaaccgcag ctgccaatgc agccgctggg gctgctgaaa atgccttccg ggccccgtga 1141 cccctgactg ggatgccctg gccctgctac ttgagggagc tgacttagct cccggcccta 1201 aggtctytgg gacttggaga gacatcacta actgatggct cctccgtagt gctcccaatc 1261 ctatggccat gactgctgaa cctgacaggc gtgtggggag ttcactgtga cctagtcccc 1321 ccatcaggcc acactgctgc cacctctcac acgccccaac ccagcttccc tctgctgtgc 1381 cacggctgtt gcttcggtta tttaaataaa aagaaagtgg aactggaact gac // LOCUS AF005043 4069 bp mRNA PRI 25-JUN-1997 DEFINITION Homo sapiens poly(ADP-ribose) glycohydrolase (hPARG) mRNA, complete cds. ACCESSION AF005043 NID g2213921 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4069) AUTHORS Ame,J.-C. and Jacobson,M.K. TITLE Isolation and characterization of the cDNA encoding human poly(ADP-ribose) glycohydrolase JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 4069) AUTHORS Ame,J.-C. and Jacobson,M.K. TITLE Direct Submission JOURNAL Submitted (21-MAY-1997) College of Pharmacy, University of Kentucky, 800 Rose Street, Lexington, KY 40502, USA FEATURES Location/Qualifiers source 1..4069 /organism="Homo sapiens" /db_xref="taxon:9606" gene 167..3097 /gene="hPARG" CDS 167..3097 /gene="hPARG" /codon_start=1 /product="poly(ADP-ribose) glycohydrolase" /db_xref="PID:g2213922" /translation="MNAGPGCEPCTKATRWGAATTSPAASDARSFPSRQRRVLDPKDA HVQFRVPPSSPACVPGQAGQHRGSATSLVFKQKTITSWMDTKGIKTAESESLDSKENN NTRIESMMSSVQKDNFYQHNVEKLVNVSQLSLDKSLTEKSTQYLNQHQTAAMCKWQNE GKHTEQLLESEPQTVTLVPEQFSNANIDRSPQNDDHSDTDSEENRDNQQFLTTVKLAN AKQTTEDEHAREAKSHQKCSKSCHPGEDCASCQQDEIDVVPKSPLSDVGSEDVGTGSK NDNKLIRQESCLGNSPPFEKESEPESPMDVDNSKNSCQDSEADEETSPGFDEQEDGSS SQTANKPSRFQARDADIEFRKRYSTKGGEVRLHFQFEGGESRTGMNDLNAKLPGNISS LNVECRNSKQHGKKDSKITDHLMRLPKAEDRRKEQWETKHQRTERKIPKYVPPHLSPD KKWLGTPIEEMRRMPRCGIRLPLLRPSANHTVTIRVDLLRAGEVPKPFPTHYKDLWDN KHVKMPCSEQNLYPVEDENGERTAGSRWELIQTALLNKFTRPQNLKDAILKYNVAYSK KWDFTALIDFWDKVLEEAEAQHLYQSILPDMVKIALCLPNICTQPIPLLKQKMNHSIT MSQEQIASLLANAFFCTFPRRNAKMKSEYSSYPDINFNRLFEGRSSRKPEKLKTLFCY FRRVTEKKPTGLVTFTRQSLEDFPEWERCEKPLTRLHVTYEGTIEENGQGMLQVDFAN RFVGGGVTSAGLVQEEIRFLINPELIISRLFTEVLDHNECLIITGTEQYSEYTGYAET YRWSRSHEDGSERDDCERRCTEIVAIDALHFRRYLDQFVPEKMRRELNKAYCGFLRPG VSSENLSAVATGNWGCGAFGGDARLKALIQILAAAAAERDVVYFTFGDSELMRDIYSM HIFLTERKLTVGDVYKLLLRYYNEECRNCSTPGPDIKLYPFIYHAVESCAETADHSGQ RTGT" BASE COUNT 1229 a 829 c 930 g 1081 t ORIGIN 1 ggcgtctggg aagtgaggag cgtctctgcc tggcagaggc tgcaatctct gcactttggg 61 gggccaaggc aggcgctgag aaggacgcgc agtccatctc tctcaggtta gtgaaatgag 121 gctctccgcg gggccggccc ggggacagtg cgctgctggt cccagcatga atgcgggccc 181 cggctgtgaa ccctgcacca aagcgacccg ctggggcgcc gctacaactt cgccggctgc 241 ttcggacgcc cggagctttc cgagcaggca gaggcgcgtc ctcgacccca aggacgctca 301 cgtgcagttc agggtcccac cgtcctcgcc agcctgcgtc ccagggcagg cgggacagca 361 cagaggcagc gccacctcgc ttgttttcaa acaaaagact attaccagtt ggatggacac 421 taaaggaatc aagacagcgg aatcagaaag tttggatagt aaagaaaaca acaatacaag 481 aatagaatcc atgatgagtt ctgtacaaaa agataacttt taccaacata atgtagaaaa 541 attagtaaat gtttctcagc taagtcttga taagtcactc actgaaaaaa gtacacagta 601 tttgaaccag catcagactg cagcaatgtg taagtggcaa aatgaaggga aacacacgga 661 gcagcttttg gaaagtgaac ctcaaacagt aaccctggta ccagagcagt ttagtaatgc 721 taacattgat cggtcacctc aaaatgatga tcacagtgac acagatagtg aagagaatag 781 agacaatcaa cagtttctca caactgtaaa gcttgcaaat gcaaagcaga ctacggaaga 841 tgaacacgcc agagaagcca aaagccacca gaagtgcagc aagtcttgcc atcctgggga 901 agactgtgca agttgtcagc aagatgagat agacgtggtg ccaaagagtc cattgtcaga 961 tgttggctct gaggatgttg gtactgggtc aaaaaatgac aacaaattga ttagacaaga 1021 aagttgccta ggaaattctc ctccatttga gaaggaaagt gaacccgaat caccgatgga 1081 tgtggataat tctaaaaata gttgtcaaga ctcagaagca gatgaggaga caagtccagg 1141 ttttgatgaa caagaagatg gtagttcctc ccaaacagca aataaacctt caaggttcca 1201 agcaagagac gctgacattg aatttaggaa acggtactct actaagggcg gtgaagttag 1261 attacatttc caatttgaag gaggagagag tcgcactgga atgaatgatt taaatgctaa 1321 actacctgga aatatttcta gcctgaatgt agaatgcaga aattctaagc aacatggaaa 1381 aaaggattct aaaatcacag atcatttgat gagactgccc aaagcagagg acagaagaaa 1441 agaacagtgg gaaaccaaac atcaaagaac agaaaggaag atccctaaat acgttccacc 1501 tcacctttct ccagataaga agtggcttgg aactcccatt gaggagatga gaagaatgcc 1561 tcggtgtggg atccggctgc ctctcttgag accatctgcc aatcacacag taactattcg 1621 ggtagatctt ttgcgagcag gagaagttcc taaacctttt ccaacacatt ataaagattt 1681 gtgggataac aagcatgtta aaatgccttg ttcagaacaa aatttgtacc cagtggaaga 1741 tgagaatggt gagcgaactg cggggagccg gtgggagctc attcagactg cacttctcaa 1801 caaatttaca cgaccccaaa acttgaagga tgctattctg aaatacaatg tggcatattc 1861 taagaaatgg gactttacag ctttgatcga tttctgggat aaggtacttg aagaagcaga 1921 agctcaacat ttatatcagt ccatcttgcc tgatatggtg aaaattgcac tctgtctgcc 1981 aaatatttgc acccagccaa taccactcct gaaacagaag atgaatcatt ccatcacaat 2041 gtcgcaggaa cagattgcca gtcttttagc taatgctttc ttctgcacat ttccacgacg 2101 aaatgctaag atgaaatcgg agtattctag ttacccagac attaacttca atcgattgtt 2161 tgagggacgt tcatcaagga aaccggagaa acttaaaacg ctcttctgct actttagaag 2221 agtcacagag aaaaaaccta ctgggttggt gacatttaca agacagagtc ttgaagattt 2281 tccagaatgg gaaagatgtg aaaaaccctt gacacgattg catgtcactt acgaaggtac 2341 catagaagaa aatggccaag gcatgctaca ggtggatttt gcaaatcgtt ttgttggagg 2401 tggtgtaacc agtgcaggac ttgtgcaaga agaaatccgc tttttaatca atcctgagtt 2461 gattatttca cggctcttca ctgaggtgct ggatcacaat gaatgtctaa ttatcacagg 2521 tactgagcag tacagtgaat acacaggcta tgctgagaca tatcgttggt cccggagcca 2581 cgaagatggg agtgaaaggg acgactgcga gcggcgctgc actgagatcg ttgccatcga 2641 tgctcttcac ttcagacgct acctcgatca gtttgtgcct gagaaaatga gacgcgagct 2701 gaacaaggct tactgtggat ttctccgtcc tggagtttct tcagagaatc tttctgcagt 2761 ggccacagga aactggggct gtggtgcctt tgggggtgat gccaggttaa aagccttaat 2821 acagatattg gcagctgctg cagctgagcg agatgtggtt tatttcacct ttggggactc 2881 agaattgatg agagacattt acagcatgca cattttcctt actgaaagga aactcactgt 2941 tggagatgtg tataagctgt tgctacgata ctacaatgaa gaatgcagaa actgttccac 3001 ccctggacca gacatcaagc tttatccatt catataccat gctgtcgagt cctgtgcaga 3061 gaccgctgac cattcagggc aaaggacagg gacctgagga gccgagcgaa tagcatctcc 3121 tcccacctcc caccagagac gtcctgtttg agctgtcagg tgtaatatat gaattgactt 3181 aagttaatat aaatgtgtac ataatccaca tttgtagtca aggacgcaat ctcttccaca 3241 catgtgcagt tgtcagttgg tacatctaaa ctccctccat cctgactcac gtggacttag 3301 atatgttttg tttctatttt cttctatttc agtttttcat tctttgatgt ttatttcttt 3361 tgtccatcag atctcttgtg aaatcccatg gaaggttgtg ctcagctgtc gggtctcttt 3421 cttcctgccc atatattata ccagttgctt ctgcagcccg cagatgccca gcgatgccca 3481 ggaaacaagt tgaaatccca ggaatctctt taactgattt tgctaaaaat ctccctgtga 3541 gccttccact caactcttaa tatgcttgca ttgtttaagt ttttaaattc tgaaaattaa 3601 taattagggt ttttttcata tgtgttgcat aatgcaaacc tcctaggtta aaatagtttc 3661 tttatttaag atagaataat ttccagaaat tgtacttttg aggtatcatt tttatctgta 3721 atggtttgtc tgtctttttt cctctgatca gtattttttt ataccagttt tggagactgc 3781 ctgagatgaa aggaaatgtg gaataaaagg aggttttcct gatgtggtgt aaagaaaaca 3841 gattccaaga gaattgaaga ttttttttgt ttccttggta cttttttctt tttaaattag 3901 gactaatgtt tcttttgtgg tgcttgaggc atattcatat aaccaaagtt tgagaactgg 3961 gaacttcatg ctgatttgta catattgaag tttctctggt attcaaaggt tatatagtga 4021 atgaattttc attaataaat cactttgtca gaaaaaaaaa aaaaaaaaa // LOCUS AF005080 625 bp mRNA PRI 08-NOV-1997 DEFINITION Homo sapiens skin-specific protein (xp5) mRNA, complete cds. ACCESSION AF005080 NID g2589187 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 625) AUTHORS Zhao,X.P. and Elder,J.T. TITLE Positional cloning of novel skin-specific genes from the human epidermal differentiation complex JOURNAL Genomics 45 (2), 250-258 (1997) MEDLINE 98008911 REFERENCE 2 (bases 1 to 625) AUTHORS Zhao,X.P. and Elder,J.T. TITLE Direct Submission JOURNAL Submitted (22-MAY-1997) Dermatology, University of Michigan, 1500 W Medical Center Dr., C560 MSRB II, Ann Arbor, MI 48109-0672, USA FEATURES Location/Qualifiers source 1..625 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" /tissue_type="skin" gene 1..625 /gene="xp5" CDS 52..384 /gene="xp5" /codon_start=1 /product="skin-specific protein" /db_xref="PID:g2589188" /translation="MSCQQNQQQCQPPPKCPPKCTPKCPPKCPPKCLPQCPAPCSPAV SSCCGPISGGCCGPSSGGCCNSGAGGCCLSHHRPRLFHRRRHQSPDCCESEPSGGSGC CHSSGGCC" BASE COUNT 128 a 192 c 147 g 157 t 1 others ORIGIN 1 ggacgtgtct gtgctcctgc gtgtgaccag ggttgactaa actctgccag gatgtcttgc 61 cagcaaaacc agcagcagtg ccagccccct cccaagtgtc ctcccaagtg taccccaaaa 121 tgtccaccta agtgtccccc taaatgcctg ccccagtgcc cagctccatg ttcccctgca 181 gtctcttctt gctgtggtcc catctctggg ggctgctgtg gtcccagctc tgggggctgc 241 tgcaactctg gggctggtgg ctgctgcctg agccaccaca ggccccgtct cttccaccgg 301 cgccggcacc agagccccga ctgctgtgag agtgaacctt ctgggggctc tggctgctgc 361 cacagctctg ggggctgctg ctgacctggg ctaagaaaaa ctctttggac agaatgttta 421 agaacctcct acagcctgat gcttaaccct ttccatttcc tctcattcca ttcatgggtg 481 gacagcgacc acaaagactc atggggcttc cctggganaa ctttgcactt gatggaacac 541 ctcaattgca ggttttgttt tcctccttta cctcatgttt gttaataaac tctgtttctg 601 actctcaaaa aaaaaaaaaa aaaaa // LOCUS AF005216 4161 bp mRNA PRI 30-OCT-1997 DEFINITION Homo sapiens receptor-associated tyrosine kinase (JAK2) mRNA, complete cds. ACCESSION AF005216 NID g2570358 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4161) AUTHORS Peeters,P., Raynaud,S.D., Cools,J., Wlodarska,I., Grosgeorge,J., Philip,P., Monpoux,F., Van Rompaey,L., Baens,M., Van den Berghe,H. and Marynen,P. TITLE Fusion of TEL, the ETS-variant gene 6 (ETV6), to the receptor-associated kinase JAK2 as a result of t(9;12) in a lymphoid and t(9;15;12) in a myeloid leukemia JOURNAL Blood 90 (7), 2535-2540 (1997) MEDLINE 97465498 REFERENCE 2 (bases 1 to 4161) AUTHORS Peeters,P., Cools,J. and Marynen,P. TITLE Direct Submission JOURNAL Submitted (23-MAY-1997) Human Genome laboratory, Center for Human Genetics, Herestraat 49, Leuven B-3000, Belgium FEATURES Location/Qualifiers source 1..4161 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9p24" gene 1..4161 /gene="JAK2" CDS 495..3893 /gene="JAK2" /note="Janus kinase 2" /codon_start=1 /product="receptor-associated tyrosine kinase" /db_xref="PID:g2570359" /translation="MGMACLTMTEMEGTSTSSIYQNGDISGNANSMKQIDPVLQVYLY HSLGKSEADYLTFPSGEYVAEEICIAASKACGITPVYHNMFALMSETERIWYPPNHVF HIDESTRHNVLYRIRFYFPRWYCSGSNRAYRHGISRGAEAPLLDDFVMSYLFAQWRHD FVHGWIKVPVTHETQEECLGMAVLDMMRIAKENDQTPLAIYNSISYKTFLPKCIRAKI QDYHILTRKRIRYRFRRFIQQFSQCKATARNLKLKYLINLETLQSAFYTEKFEVKEPG SGPSGEEIFATIIITGNGGIQWSRGKHKESETLTEQDLQLYCDFPNIIDVSIKQANQE GSNESRVVTIHKQDGKNLEIELSSLREALSFVSLIDGYYRLTADAHHYLCKEVAPPAV LENIQSNCHGPISMDFAISKLKKAGNQTGLYVLRCSPKDFNKYFLTFAVERENVIEYK HCLITKNENEEYNLSGTKKNFSSLKDLLNCYQMETVRSDNIIFQFTKCCPPKPKDKSN LLVFRTNGVSDVPTSPTLQRPTHMNQMVFHKIRNEDLIFNESLGQGTFTKIFKGVRRE VGDYGQLHETEVLLKVLDKAHRNYSESFFEAASMMSKLSHKHLVLNYGVCVCGDENIL VQEFVKFGSLDTYLKKNKNCINILWKLEVAKQLAWAMHFLEENTLIHGNVCAKNILLI REEDRKTGNPPFIKLSDPGISITVLPKDILQERIPWVPPECIENPKNLNLATDKWSFG TTLWEICSGGDKPLSALDSQRKLQFYEDRHQLPAPKWAELANLINNCMDYEPDFRPSF RAIIRDLNSLFTPDYELLTENDMLPNMRIGALGFSGAFEDRDPTQFEERHLKFLQQLG KGNFGSVEMCRYDPLQDNTGEVVAVKKLQHSTEEHLRDFEREIEILKSLQHDNIVKYK GVCYSAGRRNLKLIMEYLPYGSLRDYLQKHKERIDHIKLLQYTSQICKGMEYLGTKRY IHRDLATRNILVENENRVKIGDFGLTKVLPQDKEYYKVKEPGESPIFWYAPESLTESK FSVASDVWSFGVVLYELFTYIEKSKSPPAEFMRMIGNDKQGQMIVFHLIELLKNNGRL PRPDGCPDEIYMIMTECWNNNVNQRPSFRDLALRVDQIRDNMAG" BASE COUNT 1314 a 792 c 946 g 1109 t ORIGIN 1 ctgcaggaag gagagaggaa gaggagcaga agggggcagc agcggacgcc gctaacggcc 61 tccctcggcg ctgacaggct gggccggcgc ccggctcgct tgggtgttcg cgtcgccact 121 tcggcttctc ggccggtcgg gcccctcggc ccgggcttgc ggcgcgcgtc ggggctgagg 181 gctgctgcgg cgcagggaga ggcctggtcc tcgctgccga gggatgtgag tgggagctga 241 gcccacactg gagggccccc gagggcccag cctggaggtc gttcagagcc gtgcccgccc 301 cggggcttcg cagaccttga cccgccgggt aggagccgcc cctgcgggct cgagggcgcg 361 ctctggtcgc ccgatctgtg tagccggttt cagaagcagg caacaggaac aagatgtgaa 421 ctgtttctct tctgcagaaa aagaggctct tcctcctcct cccgcgacgg caaatgttct 481 gaaaaagact ctgcatggga atggcctgcc ttacgatgac agaaatggag ggaacatcca 541 cctcttctat atatcagaat ggtgatattt ctggaaatgc caattctatg aagcaaatag 601 atccagttct tcaggtgtat ctttaccatt cccttgggaa atctgaggca gattatctga 661 cctttccatc tggggagtat gttgcagaag aaatctgtat tgctgcttct aaagcttgtg 721 gtatcacacc tgtgtatcat aatatgtttg ctttaatgag tgaaacagaa aggatctggt 781 atccacccaa ccatgtcttc catatagatg agtcaaccag gcataatgta ctctacagaa 841 taagatttta ctttcctcgt tggtattgca gtggcagcaa cagagcctat cggcatggaa 901 tatctcgagg tgctgaagct cctcttcttg atgactttgt catgtcttac ctctttgctc 961 agtggcggca tgattttgtg cacggatgga taaaagtacc tgtgactcat gaaacacagg 1021 aagaatgtct tgggatggca gtgttagata tgatgagaat agccaaagaa aacgatcaaa 1081 ccccactggc catctataac tctatcagct acaagacatt cttaccaaaa tgtattcgag 1141 caaagatcca agactatcat attttgacaa ggaagcgaat aaggtacaga tttcgcagat 1201 ttattcagca attcagccaa tgcaaagcca ctgccagaaa cttgaaactt aagtatctta 1261 taaatctgga aactctgcag tctgccttct acacagagaa atttgaagta aaagaacctg 1321 gaagtggtcc ttcaggtgag gagatttttg caaccattat aataactgga aacggtggaa 1381 ttcagtggtc aagagggaaa cataaagaaa gtgagacact gacagaacag gatttacagt 1441 tatattgcga ttttcctaat attattgatg tcagtattaa gcaagcaaac caagagggtt 1501 caaatgaaag ccgagttgta actatccata agcaagatgg taaaaatctg gaaattgaac 1561 ttagctcatt aagggaagct ttgtctttcg tgtcattaat tgatggatat tatagattaa 1621 ctgcagatgc acatcattac ctctgtaaag aagtagcacc tccagccgtg cttgaaaata 1681 tacaaagcaa ctgtcatggc ccaatttcga tggattttgc cattagtaaa ctgaagaaag 1741 caggtaatca gactggactg tatgtacttc gatgcagtcc taaggacttt aataaatatt 1801 ttttgacttt tgctgtcgag cgagaaaatg tcattgaata taaacactgt ttgattacaa 1861 aaaatgagaa tgaagagtac aacctcagtg ggacaaagaa gaacttcagc agtcttaaag 1921 atcttttgaa ttgttaccag atggaaactg ttcgctcaga caatataatt ttccagttta 1981 ctaaatgctg tcccccaaag ccaaaagata aatcaaacct tctagtcttc agaacgaatg 2041 gtgtttctga tgtaccaacc tcaccaacat tacagaggcc tactcatatg aaccaaatgg 2101 tgtttcacaa aatcagaaat gaagatttga tatttaatga aagccttggc caaggcactt 2161 ttacaaagat ttttaaaggc gtacgaagag aagtaggaga ctacggtcaa ctgcatgaaa 2221 cagaagttct tttaaaagtt ctggataaag cacacagaaa ctattcagag tctttctttg 2281 aagcagcaag tatgatgagc aagctttctc acaagcattt ggttttaaat tatggagtat 2341 gtgtctgtgg agacgagaat attctggttc aggagtttgt aaaatttgga tcactagata 2401 catatctgaa aaagaataaa aattgtataa atatattatg gaaacttgaa gttgctaaac 2461 agttggcatg ggccatgcat tttctagaag aaaacaccct tattcatggg aatgtatgtg 2521 ccaaaaatat tctgcttatc agagaagaag acaggaagac aggaaatcct cctttcatca 2581 aacttagtga tcctggcatt agtattacag ttttgccaaa ggacattctt caggagagaa 2641 taccatgggt accacctgaa tgcattgaaa atcctaaaaa tttaaatttg gcaacagaca 2701 aatggagttt tggtaccact ttgtgggaaa tctgcagtgg aggagataaa cctctaagtg 2761 ctctggattc tcaaagaaag ctacaatttt atgaagatag gcatcagctt cctgcaccaa 2821 agtgggcaga attagcaaac cttataaata attgtatgga ttatgaacca gatttcaggc 2881 cttctttcag agccatcata cgagatctta acagtttgtt tactccagat tatgaactat 2941 taacagaaaa tgacatgtta ccaaatatga ggataggtgc cctagggttt tctggtgcct 3001 ttgaagaccg ggatcctaca cagtttgaag agagacattt gaaatttcta cagcaacttg 3061 gcaagggtaa ttttgggagt gtggagatgt gccggtatga ccctctacag gacaacactg 3121 gggaggtggt cgctgtaaaa aagcttcagc atagtactga agagcaccta agagactttg 3181 aaagggaaat tgaaatcctg aaatccctac agcatgacaa cattgtaaag tacaagggag 3241 tgtgctacag tgctggtcgg cgtaatctaa aattaattat ggaatattta ccatatggaa 3301 gtttacgaga ctatcttcaa aaacataaag aacggataga tcacataaaa cttctgcagt 3361 acacatctca gatatgcaag ggtatggagt atcttggtac aaaaaggtat atccacaggg 3421 atctggcaac gagaaatata ttggtggaga acgagaacag agttaaaatt ggagattttg 3481 ggttaaccaa agtcttgcca caagacaaag aatactataa agtaaaagaa cctggtgaaa 3541 gtcccatatt ctggtatgct ccagaatcac tgacagagag caagttttct gtggcctcag 3601 atgtttggag ctttggagtg gttctgtatg aacttttcac atacattgag aagagtaaaa 3661 gtccaccagc ggaatttatg cgtatgattg gcaatgacaa acaaggacag atgatcgtgt 3721 tccatttgat agaacttttg aagaataatg gaagattacc aagaccagat ggatgcccag 3781 atgagatcta tatgatcatg acagaatgct ggaacaataa tgtaaatcaa cgcccctcct 3841 ttagggatct agctcttcga gtggatcaaa taagggataa catggctgga tgaaagaaat 3901 gaccttcatt ctgagaccaa agtagattta cagaacaaag ttttatattt cacattgctg 3961 tggactatta ttacatatat cattattata taaatcatga tgctagccag caaagatgtg 4021 aaaatatctg ctcaaaactt tcaaagttta gtaagttttt cttcatgagg ccaccagtaa 4081 aagacattaa tgagaattcc ttagcaagga ttttgtaaga agtttcttaa acattgtctg 4141 ttaacatcac tcttgtctgg c // LOCUS AF005271 591 bp mRNA PRI 31-JUL-1997 DEFINITION Homo sapiens FMRFamide-related prepropeptide mRNA, complete cds. ACCESSION AF005271 NID g2232300 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 591) AUTHORS Perry,S.J., Yi-Kung Huang,E., Cronk,D., Bagust,J., Sharma,R., Walker,R.J., Wilson,S. and Burke,J.F. TITLE A human gene encoding morphine modulating peptides related to NPFF and FMRFamide JOURNAL FEBS Lett. 409 (3), 426-430 (1997) MEDLINE 97367936 REFERENCE 2 (bases 1 to 591) AUTHORS Perry,S.J., Burke,J.F. and Cronk,D. TITLE Direct Submission JOURNAL Submitted (27-MAY-1997) Sussex Centre for Neuroscience, University of Sussex, Falmer, Brighton, Sussex BN1 9QG, UK FEATURES Location/Qualifiers source 1..591 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" CDS 164..505 /function="morphine modulating peptides" /codon_start=1 /product="FMRFamide-related prepropeptide" /db_xref="PID:g2232301" /translation="MDSRQAAALLVLLLLIDGGCAEGPGGQQEDQLSAEEDSEPLPPQ DAQTSGSLLHYLLQAMERPGRSQAFLFQPQRFGRNTQGSWRNEWLSPRAGEGLNSQFW SLAAPQRFGKK" sig_peptide 164..223 mat_peptide 356..394 /product="neuropeptide FF" mat_peptide 437..496 /product="neuropeptide AF" BASE COUNT 126 a 152 c 189 g 124 t ORIGIN 1 catgaagtcc tgggggcgcc atgggaggag atcccaggtg gctcctaatg agccctgcat 61 ttcatttgcc tgctctagat tcccctaagg ctactgtgag gctgggggtg ggggaacagc 121 aggtataaga ggttggggtg gctgtaggag ggtaggtggc agcatggatt ctaggcaggc 181 tgctgcactg ctggtgctgc tgctgttaat agacgggggc tgtgctgaag ggccaggagg 241 ccagcaggaa gaccagctct ccgcggagga agacagcgaa cccctcccac cacaggatgc 301 ccagacctct gggtcactgt tgcactacct gctccaggca atggagagac ctggccggag 361 ccaagccttc ctgtttcagc cccagaggtt tggcagaaat acccagggat cctggaggaa 421 tgaatggctg agtccccggg ctggagaggg gctgaattcc cagttctgga gcctggctgc 481 ccctcaacgc tttgggaaga agtgacatgt catcccttga tatgtctgca tgcaaggtcc 541 acacccaaaa gtgtcaatgt ttgcccccca aataaaattg tctggcttct g // LOCUS AF005418 1743 bp mRNA PRI 16-DEC-1997 DEFINITION Homo sapiens retinoic acid hydroxylase mRNA, complete cds. ACCESSION AF005418 NID g2688845 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS White,J.A., Beckett-Jones,B., Guo,Y.D., Dilworth,F.J., Bonasoro,J., Jones,G. and Petkovich,M. TITLE cDNA cloning of human retinoic acid-metabolizing enzyme (hP450RAI) identifies a novel family of cytochromes P450 JOURNAL J. Biol. Chem. 272 (30), 18538-18541 (1997) MEDLINE 97373542 REFERENCE 2 (bases 1 to 1743) AUTHORS White,J.A., Beckett-Jones,B., Guo,Y., Dilworth,F.J., Bonasoro,J., Jones,G. and Petkovich,M. TITLE Direct Submission JOURNAL Submitted (26-MAY-1997) Cancer Research Labs, Queen's University, Botterell Hall, Rm 355, Kingston, Ont K7L 3N6, Canada FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 39..1532 /note="hP450RAI; cytochrome P450" /codon_start=1 /product="retinoic acid hydroxylase" /db_xref="PID:g2688846" /translation="MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPL PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL GDDRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEV GSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMT RNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIE HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKG LLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW NVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKI LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI" BASE COUNT 423 a 442 c 477 g 401 t ORIGIN 1 gaattcggca cgagtggcgc gggaggtcgc ggcgcgccat ggggctcccg gcgctgctgg 61 ccagtgcgct ctgcaccttc gtgctgccgc tgctgctctt cctggctgcg atcaagctct 121 gggacctgta ctgcgtgagc ggccgcgacc gcagttgtgc cctcccattg ccccccggga 181 ctatgggctt ccccttcttt ggggaaacct tgcagatggt actgcagcgg aggaagttcc 241 tgcagatgaa gcgcaggaaa tacggcttca tctacaagac gcatctgttc gggcggccca 301 ccgtacgggt gatgggcgcg gacaatgtgc ggcgcatctt gctcggagac gaccggctgg 361 tgtcggtcca ctggccagcg tcggtgcgca ccattctggg atctggctgc ctctctaacc 421 tgcacgactc ctcgcacaag cagcgcaaga aggtgattat gcgggccttc agccgcgagg 481 cactcgaatg ctacgtgccg gtgatcaccg aggaagtggg cagcagcctg gagcagtggc 541 tgagctgcgg cgagcgcggc ctcctggtct accccgaggt gaagcgcctc atgttccgaa 601 tcgccatgcg catcctactg ggctgcgaac cccaactggc gggcgacggg gactccgagc 661 agcagcttgt ggaggccttc gaggaaatga cccgcaatct cttctcgctg cccatcgacg 721 tgcccttcag cgggctgtac cggggcatga aggcgcggaa cctcattcac gcgcgcatcg 781 agcagaacat tcgcgccaag atctgcgggc tgcgggcatc cgaggcgggc cagggctgca 841 aagacgcgct gcagctgttg atcgagcact cgtgggagag gggagagcgg ctggacatgc 901 aggcactaaa gcaatcttca accgaactcc tctttggagg acacgaaacc acggccagtg 961 cagccacatc tctgatcact tacctggggc tctacccaca tgttctccag aaagtgcgag 1021 aagagctgaa gagtaagggt ttactttgca agagcaatca agacaacaag ttggacatgg 1081 aaattttgga acaacttaaa tacatcgggt gtgttattaa ggagaccctt cgactgaatc 1141 ccccagttcc aggagggttt cgggttgctc tgaagacttt tgaattaaat ggataccaga 1201 ttcccaaggg ctggaatgtt atctacagta tctgtgatac tcatgatgtg gcagagatct 1261 tcaccaacaa ggaagaattt aatcctgacc gattcatgct gcctcaccca gaggatgcat 1321 ccaggttcag cttcattcca tttggaggag gccttaggag ctgtgtaggc aaagaatttg 1381 caaaaattct tctcaaaata tttacagtgg agctggccag gcattgtgac tggcagcttc 1441 taaatggacc tcctacaatg aaaaccagtc ccaccgtgta tcctgtggac aatctccctg 1501 caagattcac ccatttccat ggggaaatct gatgagcttg aatgttcaaa cctgagactt 1561 attggaagtg tacatatgag tttttaagga gtgttgtgtt gactttatat ttaatttcta 1621 aatgtatatt ataatattta tgtgttttga ctatactacc acaatcttta aatattaaaa 1681 taatgaattt gtatcatttc caaataaagt aaaatttgaa ggtaaaaaaa aaaaaaaaaa 1741 aaa // LOCUS AF005419 1113 bp DNA PRI 09-AUG-1997 DEFINITION Homo sapiens P2Y5-like receptor gene, complete cds. ACCESSION AF005419 NID g2240034 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1113) AUTHORS Janssens,R., Boeynaems,J.M., Godart,M. and Communi,D. TITLE Cloning of a human heptahelical receptor closely related to the P2Y5 receptor JOURNAL Biochem. Biophys. Res. Commun. 236 (1), 106-112 (1997) MEDLINE 97366605 REFERENCE 2 (bases 1 to 1113) AUTHORS Janssens,R., Boeynaems,J.M., Godart,M. and Communi,D. TITLE Direct Submission JOURNAL Submitted (26-MAY-1997) Institute for Interdisciplinary Research, Universite Libre de Bruxelles, Route de Lennik 808, Building C, C5-145, Brussels 1070, Belgium FEATURES Location/Qualifiers source 1..1113 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lDashII" mRNA 1..1113 /product="P2Y5-like receptor" CDS 1..1113 /codon_start=1 /product="P2Y5-like receptor" /db_xref="PID:g2240035" /translation="MGDRRFIDFQFQDSNSSLRPRLGNATANNTCIVDDSFKYNLNGA VYSVVFILGLITNSVSLFVFCFRMKMRSETAIFITNLAVSDLLFVCTLPFKIFYNFNR HWPFGDTLCKISGTAFLTNIYGSMLFLTCISVDRFLAIVYPFRSRTIRTRRNSAIVCA GVWILVLSGGISASLFSTTNVNNATTTCFEGLSKRVWKTYLSKITIFIEVVGFIIPLI LNVSCSSVVLRTLRKPATLSQIGTNKKKVLKMITVHMAVFVVCFVPYNSVLFLYALVR SQAITNCFLERFAKIMYPITLCLATLNCCFDPFIYYFTLESFQKSFYINAHIRMESLF KTETPLTTKPSLPAIQEEVSDQTTNNGGELMLESTF" BASE COUNT 276 a 262 c 201 g 374 t ORIGIN 1 atgggtgaca gaagattcat tgacttccaa ttccaagatt caaattcaag cctcagaccc 61 aggttgggca atgctactgc caataatact tgcattgttg atgattcctt caagtataat 121 ctcaatggtg ctgtctacag tgttgtattc atcttgggtc tgataaccaa cagtgtctct 181 ctgtttgtct tctgtttccg catgaaaatg agaagtgaga ctgctatttt tatcaccaat 241 ctagctgtct ctgatttgct ttttgtctgt acactacctt ttaaaatatt ttacaacttc 301 aaccgccact ggccttttgg tgacaccctc tgcaagatct ctggaactgc attccttacc 361 aacatctatg ggagcatgct ctttctcacc tgtattagtg tggatcgttt cctggccatt 421 gtctatcctt ttcgatctcg tactattagg actaggagga attctgccat tgtgtgtgct 481 ggtgtctgga tcctagtcct cagtggcggt atttcagcct ctttgttttc caccactaat 541 gtcaacaatg caaccaccac ctgctttgaa ggcctctcca aacgtgtctg gaagacttat 601 ttatccaaga tcacaatatt tattgaagtt gttgggttta tcattcctct aatattgaat 661 gtctcttgct cttctgtggt gctgagaact cttcgcaagc ctgctactct gtctcaaatt 721 gggaccaata agaaaaaagt actgaaaatg atcacagtac atatggcagt ctttgtggta 781 tgctttgtac cctacaactc tgtcctcttc ttgtatgccc tggtgcgctc ccaagctatt 841 actaattgct ttttggaaag atttgcaaag atcatgtacc caatcacctt gtgccttgca 901 actctgaact gttgttttga ccctttcatc tattacttca cccttgaatc ctttcagaag 961 tccttctaca tcaatgccca catcagaatg gagtccctgt ttaagactga aacacctttg 1021 accacaaagc cttcccttcc agctattcaa gaggaagtga gtgatcaaac aacaaataat 1081 ggtggtgaat taatgctaga atccaccttt tag // LOCUS AF005482 1981 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens histone deacetylase-3C mRNA, complete cds. ACCESSION AF005482 NID g2654534 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1981) AUTHORS Yang,W.M., Yao,Y.L., Sun,J.M., Davie,J.R. and Seto,E. TITLE Isolation and characterization of cDNAs corresponding to an additional member of the human histone deacetylase gene family JOURNAL J. Biol. Chem. 272 (44), 28001-28007 (1997) MEDLINE 98010646 REFERENCE 2 (bases 1 to 1981) AUTHORS Yang,W.-M. and Seto,E. TITLE Direct Submission JOURNAL Submitted (26-MAY-1997) Molecular Oncology, H. Lee Moffitt Cancer Center at USF, 12902 Magnolia Drive, MRC3037, Tampa, FL 33612, USA FEATURES Location/Qualifiers source 1..1981 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" CDS 254..1369 /note="HDAC3C; similar to yeast RPD3 protein" /codon_start=1 /product="histone deacetylase-3C" /db_xref="PID:g2654535" /translation="MCRFHSEDYIDFLQRVSPTNMQGFTKSLNAFNVGDDCPVFPGLF EFCSRYTGASLQGATQLNNKICDIAINWAGGLHHAKKFEASGFCYVNDIVIGILELLK YHPRVLYIDIDIHHGDGVQEAFYLTDRVMTVSFHKYGNYFFPGTGDMYEVGAESGRYY CLNVPLRDGIDDQSYKHLFQPVINQVVDFYQPTCIVLQCGADSLGCDRLGCFNLSIRG HGECVEYVKSFNIPLLVLGGGGYTVRNVARCWTYETSLLVEEAISEELPYSEYFEYFA PDFTLHPDVSTRIENQNSRQYLDQIRQTIFENLKMLNHAPSVQIHDVPADLLTYDRTD EADAEERGPEENYSRPEAPNEFYDGDHDNDKESDVEI" BASE COUNT 452 a 490 c 521 g 518 t ORIGIN 1 gaattcgcgc cgctggaaag gaaaagaatg gtcttcaggg ggtcatttgt aataacttac 61 ctggttaatc ggagtctctc tactgactta aggattgtgg gatgagggga atgctatgtt 121 tggcttcata tagtctccag gactgggtca gtatgggtgg tctgagcctg tggagtgggg 181 ctgaccctgc ctgggttcat cttgtgtctc catcccgagg ctcttcaagc cataccaggc 241 ctcccaacat gacatgtgcc gcttccactc cgaggactac attgacttcc tgcagagagt 301 cagccccacc aatatgcaag gcttcaccaa gagtcttaat gccttcaacg taggcgatga 361 ctgcccagtg tttcccgggc tctttgagtt ctgctcgcgt tacacaggcg catctctgca 421 aggagcaacc cagctgaaca acaagatctg tgatattgcc attaactggg ctggtggtct 481 gcaccatgcc aagaagtttg aggcctctgg cttctgctat gtcaacgaca ttgtgattgg 541 catcctggag ctgctcaagt accaccctcg ggtgctctac attgacattg acatccacca 601 tggtgacggg gttcaagaag ctttctacct cactgaccgg gtcatgacgg tgtccttcca 661 caaatacgga aattacttct tccctggcac aggtgacatg tatgaagtcg gggcagagag 721 tggccgctac tactgtctga acgtgcccct gcgggatggc attgatgacc agagttacaa 781 gcaccttttc cagccggtta tcaaccaggt agtggacttc taccaaccca cgtgcattgt 841 gctccagtgt ggagctgact ctctgggctg tgatcgattg ggctgcttta acctcagcat 901 ccgagggcat ggggaatgcg ttgaatatgt caagagcttc aatatccctc tactcgtgct 961 gggtggtggt ggttatactg tccgaaatgt tgcccgctgc tggacatatg agacatcgct 1021 gctggtagaa gaggccatta gtgaggagct tccctatagt gaatacttcg agtactttgc 1081 cccagacttc acacttcatc cagatgtcag cacccgcatc gagaatcaga actcacgcca 1141 gtatctggac cagatccgcc agacaatctt tgaaaacctg aagatgctga accatgcacc 1201 tagtgtccag attcatgacg tgcctgcaga cctcctgacc tatgacagga ctgatgaggc 1261 tgatgcagag gagaggggtc ctgaggagaa ctatagcagg ccagaggcac ccaatgagtt 1321 ctatgatgga gaccatgaca atgacaagga aagcgatgtg gagatttaag agtggcttgg 1381 gatgctgtgt cccaaggaat ttcttttcac ctcttggaag ggctggaggg aaaaggagtg 1441 gctcctagag tcctgggggt caccccaggg gcttttgctg actctgggaa agagtctgga 1501 gaccacattt ggttctcgaa ccatctacct gcttttcctc tctctcccaa ggactgacaa 1561 tggtacctat tagggatgag atacagacaa ggatagctat ctgggacatt attggcagtg 1621 ggccctggag gcagtcccta gccccccttg ccccttattt cttccctgct tccctcgaac 1681 ccagagattt ttgagggatg aacgggtaga caaggactga gattgcctct gacttcctcc 1741 tcccctgggt tctgaccttc ttcctcccct tgcttccagg gaagatgaag agagagagat 1801 ttggaagggg ctctggctcc ctaacacctg aatcccagat gatgggaagt atgttttcaa 1861 gtgtggggag gatatgaaaa tgttctgttc tcacttttgg ctttatgtcc attttaccac 1921 tgtttttatc caataaacta agtcggtatt ttttgtacct ttgatggttt agcggccgcg 1981 c // LOCUS AF005632 3904 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens phosphodiesterase I/nucleotide pyrophosphatase beta (PDNP3) mRNA, complete cds. ACCESSION AF005632 NID g2465539 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3904) AUTHORS Jin-Hua,P., Goding,J.W., Nakamura,H. and Sano,K. TITLE Molecular cloning and chromosomal localization of PD-ibeta (PDNP3), a new member of the human phosphodiesterase I genes JOURNAL Genomics 45 (2), 412-415 (1997) MEDLINE 98008933 REFERENCE 2 (bases 1 to 3904) AUTHORS Sano,K. and Piao,J.-H. TITLE Direct Submission JOURNAL Submitted (24-MAY-1997) Pediatrics, Kobe University School of Medicine, 7-5-1 Kusunoki-cho, Chuo-ku, Kobe, Hyogo 650, Japan FEATURES Location/Qualifiers source 1..3904 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q22" /tissue_type="prostate" gene <1..>3904 /gene="PDNP3" CDS 90..2717 /gene="PDNP3" /EC_number="3.1.4.1" /EC_number="3.6.1.9" /note="ecto-enzyme" /codon_start=1 /product="phosphodiesterase I/nucleotide pyrophosphatase beta" /db_xref="PID:g2465540" /translation="MESTLTLATEQPVKKNTLKKYKIACIVLLALLVIMSLGLGLGLG LRKLEKQGSCRKKCFDASFRGLENCRCDVACKDRGDCCWDFEDTCVESTRIWMCNKFR CGETRLEASLCSCSDDCLQKKDCCADYKSVCQGETSWLEENCDTAQQSQCPEGFDLPP VILFSMDGFRAEYLYTWDTLMPNINKLKTCGIHSKYMRAMYPTKTFPNHYTIVTGLYP ESHGIIDNNMYDVNLNKNFSLSSKEQNNPAWWHGQPMWLTAMYQGLKAATYFWPGSEV AINGSFPSIYMPYNGSVPFEERISTLLKWLDLPKAERPRFYTMYFEEPDSSGHAGGPV SARVIKALQVVDHAFGMLMEGLKQRNLHNCVNIILLADHGMDQTYCNKMEYMTDYFPR INFFYMYEGPAPRIRAHNIPHDFFSFNSEEIVRNLSCRKPDQHFKPYLTPDLPKRLHY AKNVRIDKVHLFVDQQWLAVRSKSNTNCGGGNHGYNNEFRSMEAIFLAHGPSFKEKTE VEPFENIEVYNLMCDLLRIQPAPNNGTHGSLNHLLKVPFYEPSHAEEVSKFSVCGFAN PLPTESLDCFCPHLQNSTQLEQVNQMLNLTQEEITATVKVNLPFGRPRVLQKNVDHCL LYHREYVSGFGKAMRMPMWSSYTVPQLGDTSPLPPTVPDCLRADVRVPPSESQKCSFY LADKNITHGFLYPPASNRTSDSQYDALITSNLVPMYEEFRKMWDYFHSVLLIKHATER NGVNVVSGPIFDYNYDGHFDAPDEITKHLANTDVPIPTHYFVVLTSCKNKSHTPENCP GWLDVLPFIIPHRPTNVESCPEGKPEALWVEERFTAHIARVRDVELLTGLDFYQDKVQ PVSEILQLKTYLPTFETTI" BASE COUNT 1218 a 822 c 857 g 1007 t ORIGIN 1 cggacagttt cttagccata gagattcaca gcccagagca ggaggactac tttattctga 61 taaaacaggt ctatgcagct accaggacaa tggaatctac gttgacttta gcaacggaac 121 aacctgttaa gaagaacact cttaagaaat ataaaatagc ttgcattgtt cttcttgctt 181 tgctggtgat catgtcactt ggattaggcc tggggcttgg actcaggaaa ctggaaaagc 241 aaggcagctg caggaagaag tgctttgatg catcatttag aggactggag aactgccggt 301 gtgatgtggc atgtaaagac cgaggtgatt gctgctggga ttttgaagac acctgtgtgg 361 aatcaactcg aatatggatg tgcaataaat ttcgttgtgg agagaccaga ttagaggcca 421 gcctttgctc ttgttcagat gactgtttgc agaagaaaga ttgctgtgct gactataaga 481 gtgtttgcca aggagaaacc tcatggctgg aagaaaactg tgacacagcc cagcagtctc 541 agtgcccaga agggtttgac ctgccaccag ttatcttgtt ttctatggat ggatttagag 601 ctgaatattt atacacatgg gatactttaa tgccaaatat caataaactg aaaacatgtg 661 gaattcattc aaaatacatg agagctatgt atcctaccaa aaccttccca aatcattaca 721 ccattgtcac gggcttgtat ccagagtcac atggcatcat tgacaataat atgtatgatg 781 taaatctcaa caagaatttt tcactttctt caaaggaaca aaataatcca gcctggtggc 841 atgggcaacc aatgtggctg acagcaatgt atcaaggttt aaaagccgct acctactttt 901 ggcccggatc agaagtggct ataaatggct cctttccttc catatacatg ccttacaacg 961 gaagtgtccc atttgaagag aggatttcta cactgttaaa atggctggac ctgcccaaag 1021 ctgaaagacc caggttttat accatgtatt ttgaagaacc tgattcctct ggacatgcag 1081 gtggaccagt cagtgccaga gtaattaaag ccttacaggt agtagatcat gcttttggga 1141 tgttgatgga aggcctgaag cagcggaatt tgcacaactg tgtcaatatc atccttctgg 1201 ctgaccatgg aatggaccag acttattgta acaagatgga atacatgact gattattttc 1261 ccagaataaa cttcttctac atgtacgaag ggcctgcccc ccgcatccga gctcataata 1321 tacctcatga cttttttagt tttaattctg aggaaattgt tagaaacctc agttgccgaa 1381 aacctgatca gcatttcaag ccctatttga ctcctgattt gccaaagcga ctgcactatg 1441 ccaagaacgt cagaatcgac aaagttcatc tctttgtgga tcaacagtgg ctggctgtta 1501 ggagtaaatc aaatacaaat tgtggaggag gcaaccatgg ttataacaat gagtttagga 1561 gcatggaggc tatctttctg gcacatggac ccagttttaa agagaagact gaagttgaac 1621 catttgaaaa tattgaagtc tataacctaa tgtgtgatct tctacgcatt caaccagcac 1681 caaacaatgg aacccatggt agtttaaacc atcttctgaa ggtgcctttt tatgagccat 1741 cccatgcaga ggaggtgtca aagttttctg tttgtggctt tgctaatcca ttgcccacag 1801 agtctcttga ctgtttctgc cctcacctac aaaatagtac tcagctggaa caagtgaatc 1861 agatgctaaa tctcacccaa gaagaaataa cagcaacagt gaaagtaaat ttgccatttg 1921 ggaggcctag ggtactgcag aagaacgtgg accactgtct cctttaccac agggaatatg 1981 tcagtggatt tggaaaagct atgaggatgc ccatgtggag ttcatacaca gtcccccagt 2041 tgggagacac atcgcctctg cctcccactg tcccagactg tctgcgggct gatgtcaggg 2101 ttcctccttc tgagagccaa aaatgttcct tctatttagc agacaagaat atcacccacg 2161 gcttcctcta tcctcctgcc agcaatagaa catcagatag ccaatatgat gctttaatta 2221 ctagcaattt ggtacctatg tatgaagaat tcagaaaaat gtgggactac ttccacagtg 2281 ttcttcttat aaaacatgcc acagaaagaa atggagtaaa tgtggttagt ggaccaatat 2341 ttgattataa ttatgatggc cattttgatg ctccagatga aattaccaaa catttagcca 2401 acactgatgt tcccatccca acacactact ttgtggtgct gaccagttgt aaaaacaaga 2461 gccacacacc ggaaaactgc cctgggtggc tggatgtcct accctttatc atccctcacc 2521 gacctaccaa cgtggagagc tgtcctgaag gtaaaccaga agctctttgg gttgaagaaa 2581 gatttacagc tcacattgcc cgggtccgtg atgtagaact tctcactggg cttgacttct 2641 atcaggataa agtgcagcct gtctctgaaa ttttgcaact aaagacatat ttaccaacat 2701 ttgaaaccac tatttaactt aataatgtct acttaatata taatttactg tataaagtaa 2761 ttttggcaaa atataagtga ttttttctgg agaattgtaa aataaagttt tctatttttc 2821 cttaaaaaaa aaaccggaat tccgggcttg ggaggctgag gcaggagact cgcttgaacc 2881 cgggaggcag aggttgcagt gagccaagat tgcgccattg cactccagag cctgggtgac 2941 agagcaagac tacatctcaa aaaataaata aataaaataa aagtaacaat aaaaataaaa 3001 agaacagcag agagaatgag caaggagaaa tgtcacaaac tattgcaaaa tactgttaca 3061 ctgggttggc tctccaagaa gatactggaa tctcttcagc catttgcttt tcagaagtag 3121 aaaccagcaa accacctcta agcggagaac atacgattct ttattaagta gctctgggga 3181 aggaaagaat aaaagttgat agctccctga ttgggaaaaa atgcacaatt aataaagaat 3241 gaagatgaaa gaaagcatgc ttatgttgta acacaaaaaa aattcacaaa cgttggtgga 3301 aggaaaacag tatagaaaac attactttaa ctaaaagctg gaaaaatttt cagttgggat 3361 gcgactgaca aaaagaacgg gatttccagg cataaagttg gcgtgagcta cagagggcac 3421 catgtggctc agtggaagac ccttcaagat tcaaagttcc atttgacaga gcaaaggcac 3481 ttcgcaagga gaagggttta aattatgggt ccaaaagcca agtggtaaag cgagcaattt 3541 gcagcataac tgcttctcct agacagggct gagtgggcaa aatacgacag tacacacagt 3601 gactattagc cactgccaga aacaggctga acagccctgg gagacaaggg aaggcaggtg 3661 gtgggagttg ttcatggaga gaaaggagag ttttagaacc agcacatcca ctggagatgc 3721 tgggccacca gacccctccc agtcaataaa gtctggtgcc tcatttgatc tcagcctcat 3781 catgaccctg gagagaccct gataccatct gccagtcccc gacagcttag gcactccttg 3841 ccatcaacct gaccccccga gtggttctcc aggctccctg ccccacccat tcaggccgga 3901 attc // LOCUS AF005654 2677 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens actin-binding double-zinc-finger protein (abLIM) mRNA, complete cds. ACCESSION AF005654 NID g2337951 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2677) AUTHORS Roof,D.J., Hayes,A., Adamian,M., Chishti,A.H. and Li,T. TITLE Molecular characterization of abLIM, a novel actin-binding and double zinc finger protein JOURNAL J. Cell Biol. 138 (3), 575-588 (1997) MEDLINE 97392688 REFERENCE 2 (bases 1 to 2677) AUTHORS Roof,D.J., Hayes,A., Adamian,M., Chishti,A.H. and Li,T. TITLE Direct Submission JOURNAL Submitted (23-MAY-1997) Ophthalmology/Howe Lab, Harvard Medical School, Massachusetts Eye and Ear Infirmary, 243 Charles Street, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..2677 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2677 /gene="abLIM" CDS 121..2457 /gene="abLIM" /codon_start=1 /product="actin-binding double-zinc-finger protein" /db_xref="PID:g2337952" /translation="MPAFLGLKCLGKLCSSEKSKVTSSERTSARGSNRKRLIVEDRRV SGTSFTAHRRATITHLLYLCPKDYCPRGRVCNSVDPFVAHPQDPHHPSEKPVIHCHKC GEPCKGEVLRVQTKHFHIKCFTCKVCGCDLAQGGFFIKNGEYLCTLDYQRMYGTRCHG CGEFVEGEVVTALGKTYHPNCFACTICKRPFPPGDRVTFNGRDCLCQLCAQPMSSSPK ETTFSSNCAGCGRDIKNGQALLALDKQWHLGCFKCKSCGKVLTGEYISKDGAPYCEKD YQGLFGVKCEACHQFITGKVLEAGDKHYHPSCARCSRCNQMFTEGEEMYLQGSTVWHP DCKQSTKTEEKLRPTRTSSESIYSRPGSSIPGSPGHTIYAKVDNEILDYKDLAAIPKV KAIYDIERPDLITYEPFYTSGYDDKQERQSLGESPRTLSPTPSAEGYQDVRDRMIHRS TSQGSINSPVYSRHSYTPTTSRSPQHFHRPGNEPSSGRNSPLPYRPDSLPLTPTYAQA PKHFHVPDQGINIYRKPPIYKQHRALAAQSKSSEDIIKFSKFPAAQAPDPSETPKIET DHWPGPPSFAVVGPDMKRRSSGREEDDEELLRRRQLQEEQLMKLNSGLGQLILKEEME KESRERSSLLASRYDSPINSASHIPSSKTASLPGYGRNGLHRPVSTDFAQYNSYGDVS GGVRDYQTLPDGHMPAMRMDRGVSMPNMLEPKIFPYEMLMVTNRGRNKILREVDRTRL ERHLAPEVFREIFGMSIQEFDRLPLWRRNDMKKKAKLF" BASE COUNT 717 a 712 c 677 g 571 t ORIGIN 1 tcgagcggcc gcccgggcag gtagacagta aggagccagg ggaacagaga gaattgtggg 61 gcagcaccgc tccttgggtc cccactcccc atttctcatt gcttggaaat ccatgccaag 121 atgcctgcct tccttggtct aaagtgtctg gggaaattgt gcagctctga gaaaagcaaa 181 gtcacctcat ctgagagaac cagtgccagg ggctcgaaca gaaagagact gattgttgag 241 gaccggaggg tctctgggac ctccttcacc gctcataggc gtgccactat cactcatttg 301 ctgtatctct gtcccaagga ctactgccca cgtgggcgtg tatgtaacag cgttgatcct 361 tttgtggccc accctcagga ccctcaccac ccatcagaga agcctgtcat tcactgccat 421 aaatgtgggg agccttgcaa gggtgaagtg cttcgggtcc agaccaaaca tttccacatc 481 aagtgtttca cctgcaaagt gtgtggctgt gacctggcac aagggggctt cttcataaag 541 aacggagagt atctctgcac cctggactac cagcggatgt acgggacacg ctgccatggc 601 tgtggggagt tcgtggaggg cgaagtggtg actgctctgg gcaagaccta ccatcccaat 661 tgctttgctt gtactatctg caagcgcccg tttccacccg gagaccgagt cacattcaat 721 gggagagact gcctttgtca actctgtgca cagccgatgt cgtccagtcc gaaagaaacc 781 accttctcca gcaattgtgc cggctgcgga agagatatca agaacgggca ggcgctgctg 841 gcgctggata agcagtggca cttggggtgc tttaaatgca agtcctgcgg gaaggtcctc 901 accggggagt acatcagcaa ggatggtgct ccgtactgtg aaaaggacta ccagggactc 961 tttggggtga aatgtgaggc gtgtcaccag tttatcacag ggaaagtcct ggaggcaggt 1021 gacaaacatt accaccccag ctgtgcacga tgcagcagat gcaaccagat gttcacagaa 1081 ggagaggaaa tgtatcttca aggctccacc gtttggcatc ccgactgtaa gcaatctacg 1141 aagaccgagg aaaagctgcg gcctaccagg acatcctcgg aaagtattta ttctaggcca 1201 ggctccagta ttcctggctc accaggtcat actatctatg caaaagtaga caatgagatc 1261 ctggattaca aggatttagc agccattccg aaggtcaagg caatttatga cattgaacgt 1321 ccagatctta ttacctatga gcctttctac acttcgggct atgatgacaa acaggagaga 1381 cagagccttg gagagtctcc gaggactttg tctcctactc catcagcaga agggtaccag 1441 gatgttcggg atcggatgat ccatcggtcc acgagccagg gctccatcaa ctcccctgtg 1501 tacagccgcc acagctacac tccaaccacg tcccgctctc cccagcattt ccacagacct 1561 ggcaatgagc cgtccagcgg ccggaactcc cctctccctt accggccaga cagcctccct 1621 ctaactccaa cttacgctca ggcccctaaa catttccatg ttccagatca aggaatcaac 1681 atttaccgaa agccacccat ctacaaacag catcgtgcct tggcagccca gagcaagtcc 1741 tcagaagata tcatcaagtt ttccaagttc ccagcagccc aggcaccaga ccccagcgag 1801 acaccaaaga ttgagacgga ccactggcct ggtcccccct catttgctgt cgtaggacct 1861 gacatgaaac gcagatctag tggcagagag gaagatgatg aggaacttct gagacgtcgg 1921 cagcttcaag aagagcaatt aatgaagctt aactcaggcc tgggacagtt gatcttgaaa 1981 gaagagatgg agaaagagag ccgggaaagg tcatctctgt tagccagtcg ctacgattct 2041 cccatcaact cagcttcaca tattccatca tctaaaactg catctctccc tggctatgga 2101 agaaatgggc ttcaccggcc tgtttctacc gacttcgctc agtataacag ctatggggat 2161 gtcagcgggg gagtgcgaga ttaccagaca ctcccagatg gccacatgcc tgcaatgaga 2221 atggaccgag gagtgtctat gcccaacatg ttggaaccaa agatatttcc atatgaaatg 2281 ctcatggtga ccaacagagg gcgaaacaaa atcctcagag aggtggacag aaccaggctg 2341 gagcgccact tagcccctga agtgtttcgg gaaatctttg gaatgtccat acaggagttt 2401 gacaggttac ctctttggag acgcaacgac atgaagaaaa aagcaaaact cttctaagtc 2461 ccactcgtgt aatggcaatt agagaaggac tgacagtgcg gtgccccata ggatgtcata 2521 ttgaggccca aacttgattg gagatttgca aactaccgtc gctcagcaac accaaaaaga 2581 gaaagtctgg ttaaaacacc atgagtcaaa tgtcgggcca gccaacagta acacttgcca 2641 agaagcatgg cgtagaaatt tctatgttcc gaaaccc // LOCUS AF005774 2040 bp mRNA PRI 31-JUL-1997 DEFINITION Homo sapiens caspase-like apoptosis regulatory protein (clarp) mRNA, alternatively spliced, complete cds. ACCESSION AF005774 NID g2286144 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS Inohara,N., Koseki,T., Hu,Y., Chen,S. and Nunez,G. TITLE CLARP, a novel DED-containing protein interacts with caspase-8 and regulates apoptosis JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 2040) AUTHORS Inohara,N., Koseki,T., Hu,Y., Chen,S. and Nunez,G. TITLE Direct Submission JOURNAL Submitted (28-MAY-1997) Department of Pathology, University of Michigan Medical School, 1150 W. Medical Center Dr., C558 MSRBII, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..2040 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2; between D2S116 and D2S307" /chromosome="2" gene 1..2040 /gene="clarp" CDS 436..1878 /gene="clarp" /note="CLARP; alternatively spliced" /codon_start=1 /product="caspase-like apoptosis regulatory protein" /db_xref="PID:g2286145" /translation="MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDVVPPNVRDLLDIL RERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLVSDYRVLMAEIGE DLDKSDVSSLIFLMKDYMGRGKISKEKISWDLVVELEKLNLVAPDQLDLLEKCLKNIH RIDLKTKIQKYKQSVQGAGTSYRNVLQAAIQKSLKDPSNNFRLHNGRSKEQRLKEQLG AQQEPVKKSIQESEAFLPQSIPEERYKMKSKPLGICLIIDCIGNETELLRDTFTSLGY EVQKFLHLSMHGISQILGQFACMPEHRDYDSFVCVLVSRGGSQSVYGVDQTHSGLPLH HIRRMFMGESCPYLAGKPKMFFIQNYVVSEGPAGDSSLWRVDGPAMKNVEFRAQKRGL CTVHREADFFWSLCTADMSLLEQSHSSPSLYLQCLSQKLRQERKRPLLDLHIELNGYM YDWNSRVSAKEKYYVWLQHTLRKKLILSYT" repeat_region 1879..2023 /rpt_family="Alu" BASE COUNT 574 a 467 c 543 g 456 t ORIGIN 1 agcgagcttg cagcctcacc gacgagtctc aactaaaagg gactcccgga gctaggggtg 61 gggactcggc ctcacacagt gagtgccggc tattggactt ttgtccagtg acagctgaga 121 caacaaggac cacgggagga ggtgtaggag agaagcgccg cgaacagcga tcgcccagca 181 ccaagtccgc ttccaggctt tcggtttctt tgcctccatc ttgggtgcgc cttcccggcg 241 tctaggggag cgaaggctga ggtggcagcg gcaggagagt ccggccgcga caggacgaac 301 tcccccactg gaaaggattc tgaaagaaat gaagtcagcc ctcagaaatg aagttgactg 361 cctgctggct ttcctgttga ctggcccgga gctgtactgc aagacccttg tgagcttccc 421 tagtctaaga gtaggatgtc tgctgaagtc atccatcagg ttgaagaagc acttgataca 481 gatgagaagg agatgctgct ctttttgtgc cgggatgttg ctatagatgt ggttccacct 541 aatgtcaggg accttctgga tattttacgg gaaagaggta agctgtctgt cggggacttg 601 gctgaactgc tctacagagt gaggcgattt gacctgctca aacgtatctt gaagatggac 661 agaaaagctg tggagaccca cctgctcagg aaccctcacc ttgtttcgga ctatagagtg 721 ctgatggcag agattggtga ggatttggat aaatctgatg tgtcctcatt aattttcctc 781 atgaaggatt acatgggccg aggcaagata agcaaggaga agatttcttg ggaccttgtg 841 gttgagttgg agaaactaaa tctggttgcc ccagatcaac tggatttatt agaaaaatgc 901 ctaaagaaca tccacagaat agacctgaag acaaaaatcc agaagtacaa gcagtctgtt 961 caaggagcag ggacaagtta caggaatgtt ctccaagcag caatccaaaa gagtctcaag 1021 gatccttcaa ataacttcag gctccataat gggagaagta aagaacaaag acttaaggaa 1081 cagcttggcg ctcaacaaga accagtgaag aaatccattc aggaatcaga agcttttttg 1141 cctcagagca tacctgaaga gagatacaag atgaagagca agcccctagg aatctgcctg 1201 ataatcgatt gcattggcaa tgagacagag cttcttcgag acaccttcac ttccctgggc 1261 tatgaagtcc agaaattctt gcatctcagt atgcatggta tatcccagat tcttggccaa 1321 tttgcctgta tgcccgagca ccgagactac gacagctttg tgtgtgtcct ggtgagccga 1381 ggaggctccc agagtgtgta tggtgtggat cagactcact cagggctccc cctgcatcac 1441 atcaggagga tgttcatggg agaatcatgc ccttatctag cagggaagcc aaagatgttt 1501 tttattcaga actatgtggt gtcagagggc ccagctggag acagcagcct ctggagggtg 1561 gatgggccag cgatgaagaa tgtggaattc agggctcaga agcgagggct gtgcacagtt 1621 caccgagaag ctgacttctt ctggagcctg tgtactgcgg acatgtccct gctggagcag 1681 tctcacagct caccatccct gtacctgcag tgcctctccc agaaactgag acaagaaaga 1741 aaacgcccac tcctggatct tcacattgaa ctcaatggct acatgtatga ttggaacagc 1801 agagtttctg ccaaggagaa atattatgtc tggctgcagc acactctgag aaagaaactt 1861 atcctctcct acacataaga aaccaaaagg ctgggcgtag tggctcacac ctgtaatccc 1921 agcactttgg gaggccaagg agggcagatc acttcaggtc aggagttcga gaccagcctg 1981 gccaacatgg taaacgctgt ccctagtaaa aatacaaaaa ttaaaaaaaa aaaaaaaaaa // LOCUS AF005887 2474 bp mRNA PRI 01-AUG-1997 DEFINITION Homo sapiens ATF family member ATF6 (ATF6) mRNA, complete cds. ACCESSION AF005887 NID g2245629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2474) AUTHORS Zhu,C., Johansen,F.E. and Prywes,R. TITLE Interaction of ATF6 and serum response factor JOURNAL Mol. Cell. Biol. (1997) In press REFERENCE 2 (bases 1 to 2474) AUTHORS Zhu,C., Johansen,F.E. and Prywes,R. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) Dept. of Biological Sciences, Columbia University, 1212 Amsterdam Avenue, Fairchild 813B, New York, NY 10027, USA FEATURES Location/Qualifiers source 1..2474 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2474 /gene="ATF6" CDS 43..2055 /gene="ATF6" /function="bZIP transcription factor" /note="CREB binding serum response factor; similar to CREB-RP/G13 and ATF1; binds serum response factor; ATF family member" /codon_start=1 /product="ATF6" /db_xref="PID:g2245630" /translation="MGEPAGVAGTMESPFSPGLFHRLDEDWDSALFAELGYFTDTDEL QLEAANETYENNFDNLDFDLDLLPWESDIWDINNQICTVKDIKAEPQPLSPASSSYSV SSPRSVDSYSSTQHVPEELDLSSSSQMSPLSLYGENSNSLSSPEPLKEDKPVTGSRNK TENGLTPKKKIQVNSKPSIQPKPLLLPAAPKTQTNSSVPAKTIIIQTVPTLMPLAKQQ PIISLQPAPTKGQTVLLSQPTVVQLQAPGVLPSAQPVLAVAGGVTQLPNHVVNVVPAP SANSPVNGKLSVTKPVLQSTMRNVGSDIAVLRRQQRMIKNRESACQSRKKKKEYMLGL EARLKAALSENEQLKKENGTLKRQLDEVVSENQRLKVPSPKRRVVCVMIVLAFIILNY GPMSMLEQDSRRMNPSVGPANQRRHLLGFSAKEAQDTSDGIIQKNSYRYDHSVSNDKA LMVLTEEPLLYIPPPPCQPLINTTESLRLNHELRGWVHRHEVERTKSRRMTNNQQKTR ILQGVVEQGSNSQLMAVQYTETTSSISRNSGSELQVYYASPRSYQDFFEAIRRRGDTF YVVSFRRDHLLLPATTHNKTTRPKMSIVLPAININENVINGQDYEVMMQIDCQVMDTR ILHIKSSSVPPYLRDQQRNQTNTFFGSPPAATEATHVVSTIPESLQ" BASE COUNT 758 a 554 c 524 g 638 t ORIGIN 1 aagatattaa tcacggagtt ccagggaaaa ggaacttgtg aaatggggga gccggctggg 61 gttgccggca ccatggagtc accttttagc ccgggactct ttcacaggct ggatgaagat 121 tgggattctg ctctctttgc tgaacttggt tatttcacag acactgatga gctgcaattg 181 gaagcagcaa atgagacgta tgaaaacaat tttgataatc ttgattttga tttggatttg 241 ttaccttggg agtcagacat ttgggacatc aacaaccaaa tctgtacagt taaagatatt 301 aaggcagaac cccagccact ttctccagcc tcctcaagtt attcagtctc atctcctcgg 361 tcagtggact cttattcttc aactcagcat gttcctgagg agttggattt gtcttctagt 421 tctcagatgt ctcccctttc cttatatggt gaaaactcta atagtctctc ttcaccggag 481 ccactgaagg aagataagcc tgtcactggt tctaggaaca agactgaaaa tggactgact 541 ccaaagaaaa aaattcaggt gaattcaaaa ccttcaattc agcccaagcc tttattgctt 601 ccagcagcac ccaagactca aacaaactcc agtgttccag caaaaaccat cattattcag 661 acagtaccaa cgcttatgcc attggcaaag cagcaaccaa ttatcagttt acaacctgca 721 cccactaaag gccagacggt tttgctgtct cagcctactg tggtacaact tcaagcacct 781 ggagttctgc cctctgctca gccagtcctt gctgttgctg ggggagtcac acagctccct 841 aatcacgtgg tgaatgtggt accagcccct tcagcgaata gcccagtgaa tggaaaactt 901 tccgtgacta aacctgtcct acaaagtacc atgagaaatg tcggttcaga tattgctgtg 961 ctaaggagac agcaacgtat gataaaaaat cgagaatccg cttgtcagtc tcgcaagaag 1021 aagaaagaat atatgctagg gttagaggcg agattaaagg ctgccctctc agaaaacgag 1081 caactgaaga aagaaaatgg aacactgaag cggcagctgg atgaagttgt gtcagagaac 1141 cagaggctta aagtccctag tccaaagcga agagttgtct gtgtgatgat agtattggca 1201 tttataatac tgaactatgg acctatgagc atgttggaac aggattccag gagaatgaac 1261 cctagtgtgg gacctgcaaa tcaaaggagg caccttctag gattttctgc taaagaggca 1321 caggacacat cagatggtat tatccagaaa aacagctaca gatatgatca ttctgtttca 1381 aatgacaaag ccctgatggt gctaactgaa gaaccattgc tttacattcc cccacctcct 1441 tgtcagcccc taattaatac aacagagtct ctcaggttaa atcatgaact tcgaggatgg 1501 gttcatagac atgaagtaga aaggaccaag tctagaagaa tgacaaataa tcaacagaaa 1561 acccgtattc ttcagggtgt tgtggaacag ggctcaaatt ctcagctgat ggctgttcaa 1621 tacacagaaa ccactagtag tatcagcagg aactcaggga gtgagctaca agtgtattat 1681 gcttcaccca gaagttatca agactttttt gaagccatcc gcagaagggg agacacattt 1741 tatgttgtgt catttcgaag ggatcacctg ctgttaccag ctaccaccca taacaagacc 1801 acaagaccaa aaatgtcaat tgtgttacca gcaataaaca taaatgagaa tgtgatcaat 1861 gggcaggact acgaagtgat gatgcagatt gactgtcagg tgatggacac caggatcctc 1921 catatcaaaa gttcgtcggt tcctccttac ctccgagatc agcagaggaa tcaaaccaac 1981 accttctttg gctcccctcc cgcagccaca gaggcaaccc acgttgtcag caccatccct 2041 gagtcattac aatagcaccc gcagctatgt ggaaaactga gcgtgggacc cccagactga 2101 agagcaggtg agcaaaatgc tgcttttcct tggtggcagg cagagaactg ttcgtactag 2161 aattcaagga gaaaagaaga agaaataaaa gaagctgctc catttttcat catctaccca 2221 tctatttgga aagcactgga attcagatgc aagagaacaa tgtttcttca gtggcaaatg 2281 tagccctgca tcctccagtg ttacctggtg tagatttttt tttctgtacc tttctaaacc 2341 tctcttccct ctgtgatggt tttgtgttta aacagtcatc ttcttttaaa taatatccac 2401 ctctcctttt tgccatttca cttattgatt cataaagtga attttattta aagctaaaaa 2461 aaaaaaaaaa aaaa // LOCUS AF005898 2014 bp DNA PRI 26-JUN-1997 DEFINITION Homo sapiens Na,K-ATPase beta-3 subunit pseudogene, complete sequence. ACCESSION AF005898 NID g2209237 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2014) AUTHORS Malik,N., Canfield,V., Sanchez-Watts,G., Watts,A., Scherr,S., Beatty,B., Gros,P. and Levenson,R. TITLE Direct Submission JOURNAL Submitted (27-MAY-1997) Pharmacology, Hershey Medical Center, 500 University Drive, Hershey, PA 17033, USA FEATURES Location/Qualifiers source 1..2014 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p13-p15" CDS 1..2014 /note="Na K-ATPase beta-3 subunit" /codon_start=1 /pseudo repeat_region 242..326 /note="imperfect 24 bp repeats" /rpt_type=tandem CDS 357..548 /note="orf" /codon_start=1 /db_xref="PID:g2218164" /translation="MKKESLNQSLAEWKLFIYNPTTGEFLWRTAKSWGLILLFYLIFY GFLAALFSFTMWVMLQTQR" BASE COUNT 519 a 473 c 457 g 565 t ORIGIN 1 gggcccgtag atagcttcag gatggggcta gtcaccagaa agtccaggtg attagaggat 61 tagaggattg gaacttaagc cccacccacc aacatctgga agggctgagg gtgcaagaaa 121 tttagctcta ttaaaaattt aaacactgca gtcggctcga gtactcccag taacgaggag 181 gtgttctcgg tcgtcccacc ctcccctgcc gtcgccgggc tgcgccgccg gagccgggac 241 gcgcctccgc tgccctcgcc tcctccgtcc ccgctgccct cgccgccccc gtccccgctg 301 ccctcgccgc ctccgtcccc gcggccgccg ctcccctcgt cgtccgcgcg cacacgatga 361 agaaggagtc cctcaaccag agcctcgccg agtggaagct cttcatctac aacccgacca 421 ccggagagtt cctgtggcgc accgccaaga gctggggttt gatcttgctc ttctacctaa 481 ttttttatgg gttcctggct gcactcttct cattcacgat gtgggttatg cttcagactc 541 aacgatgagg ttccaaaata ccgtgaccag attcttaacc caggactgat ggtttttcca 601 aaaccagtca ctgcattgga atatgcattc agtaggtctg atctaacttc gtatgcaggg 661 tacactgaag accttaagaa gtttctaaaa ccatatactt tagaagaaca gaaaaacctc 721 acagtctgtc ctgatggagc actttttgaa cagaagggtc cagtttatgt tgcttgtcaa 781 tctcccattt cgttacttca agcatgccgt ggtatgaatg atcttgattt cggctattct 841 caaagaaacc cttgtattct tgtgaaaatg aacagaataa ttggattaaa gtctgaagga 901 gtgccaagga tagattgtgt ttcgaagaat gaagatatac caaatgtagc agtttatcct 961 cataatggaa ttatagactt aaaatatttc ccatattatg gggaaaaact gcatgtgggg 1021 tatctacagc cattggttgc tgttcaggtc agctttgctc ctaacgatac tgggaaagaa 1081 gtaacagttg aatgcaagat tgatggatca gccaacctta aaagtcagga tgatcgtgac 1141 aagtttttgg gacgagttat gttcaaaatc acagcatgtg catagtatga gtaggatgtc 1201 tccacagagt aaattttgtg ttgtctgtct tcattttgta tcagctggac cttccattct 1261 agaattatga gaccaccttg gagaaaggtg tgtggtacat gacactgggt tacatcataa 1321 cgtgcttcca gatcatagtg ttcaatgtcc tctgaagtaa ctgcctcttg cctctgctgc 1381 cctttgaacc agtgtacagt caccagatag ggaccggtga acacctgatt ccaaacatgt 1441 aggatggggg tcttgtcctc tttttatgtg gtttaattgc caagtgtcta aagcttaata 1501 tgctgtgcta tgtaaatatt ttatggctat aacactgtca tattttgatg tcaacagagt 1561 tttagggata aaatggtacc cggccaacat caagtgactt tatagctgca agaaatctgg 1621 tatgtggaga agttctgcat gtgaggaagg aaaaaaagac aataaaaatg tatttgaaaa 1681 atatttttaa aaaaatttaa acactaagat ttgatgaatt tcctttctgc tgaacacctg 1741 gaggtgctgg gagggaggca cacccacagg tcgtggcagc accacacccc tccccccgta 1801 ctttgccata tccatctctt catctggatg ttcatctgta tcttatgatt tttttttttt 1861 gagacagaat cttgctctct tgcccaggct ggattgcaga ggcacgatca tggctcactg 1921 caacctctgc ctcccaagta gctgggatta caggtacgta ccaccacgcc tggctaattt 1981 ttgtattttt tctacagatg ggatttcacc atgg // LOCUS AF006011 2889 bp mRNA PRI 02-AUG-1997 DEFINITION Homo sapiens dishevelled 1 (DVL1) mRNA, complete cds. ACCESSION AF006011 NID g2291005 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2889) AUTHORS Semenov,M.V. and Snyder,M. TITLE Human Dishevelled Genes Comprise a DHR-containing Multigene Family JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 2889) AUTHORS Semenov,M.V. and Snyder,M. TITLE Human Dishevelled 1 (hDVL-1) mRNA, Complete cds JOURNAL Unpublished REFERENCE 3 (bases 1 to 2889) AUTHORS Semenov,M.V. and Snyder,M. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) Biology, Yale University, 219 Prospect Street, New Haven, CT 06520-8103, USA FEATURES Location/Qualifiers source 1..2889 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2889 /gene="DVL1" CDS 4..2016 /gene="DVL1" /codon_start=1 /product="dishevelled 1" /db_xref="PID:g2291006" /translation="MGETKIIYHMDEEETPYLVKLPVAPERVTLADFKNVLSNRPVHA YKFFFKSMDQDFGVVKEEIFDDNAKLPCFNGRVVSWLVLAEGAHSDAGSQGTDSHTDL PPPLERTGGIGDSRPPSFHPNVASSRDGMDNETGTESMVSHRRERARRRNREEAPRTN GHPRGDRRRDVGLPPDSASTALSSELESSSFVDSDEDGSTSRLSSSTEQSTSSRLIRK HKRRRRKQRLRQADRASSFSSITDSTMSLNIVTVTLNMERHHFLGISIVGQSNDRGDG GIYIGSIMKGGAVAADGRIEPGDMLLQVNDVNFENMSNDDAVRVLREIVSQTGPISLT VAKCWDPTPRSYFTVPRADPVRPIDPAAWLSHTAALTGALPRYELEEAPLTVKSDMSA VVRVMQLPDSGLEIRDRMWLKITIANAVIGADVVDWLYTHVEGFKERREARKYASSLL KHGFLRHTVNKITFSEQCYYVFGDLCSNLATLNLNSGSSGTSDQDTLAPLPHPAAPWP LGQGYPYQYPGPPPCFPPAYQDPGFSYGSGSTGSQQSEGSKSSGSTRSSRRAPGREKE RRAAGAGGSGSESDHTAPSGVGSSWRERPAGQLSRGSSPRSQASATAPGLPPPHPTTK AYTVVGGPPGGPPVRELAAVPPELTGSRQSFQKAMGNPCEFFVDIM" BASE COUNT 542 a 946 c 929 g 472 t ORIGIN 1 accatgggcg agaccaagat tatctaccac atggacgagg aggagacgcc gtacctggtc 61 aagctgcccg tggcccccga gcgcgtcacg ctggccgact tcaagaacgt gctcagcaac 121 cggcccgtgc acgcctacaa attcttcttt aagtccatgg accaggactt cggggtggtg 181 aaggaggaga tctttgatga caatgccaag cttccctgct tcaacggccg cgtggtctcc 241 tggctggtcc tggctgaggg tgctcactcg gatgcggggt cccagggcac ggacagccac 301 acagacctgc ccccgcctct tgagcggaca ggcggcatcg gggactcccg gcccccctcc 361 ttccacccga atgtggccag cagccgtgac gggatggaca acgagacagg cacggagtcc 421 atggtcagtc accggcggga gcgtgcccga cgccggaacc gcgaggaggc cccccggacc 481 aatgggcacc caaggggaga ccgacggcgg gatgtggggc tgcccccaga cagcgcgtcc 541 accgccctca gcagcgagct tgagtccagc agctttgtgg actcggacga ggatggcagc 601 acgagcaggc tcagcagctc cacggagcag agcacctcat ccagactcat ccggaagcac 661 aaacgccggc ggaggaagca gcgccttcgg caggcggacc gggcctcctc cttcagcagc 721 ataaccgact ccaccatgtc cctcaacatc gtcactgtca cgctcaacat ggaaagacat 781 cactttctgg gcatcagcat cgtggggcag agcaacgacc gtggagacgg cggcatctac 841 attggctcca tcatgaaggg cggggctgtg gccgctgacg gccgcatcga gcccggcgac 901 atgttgctgc aggtgaatga cgtgaacttt gagaacatga gcaatgacga tgccgtgcgg 961 gtgctgcggg agatcgtttc ccagacgggg cccatcagcc tcactgtggc caagtgctgg 1021 gacccaacgc cccgaagcta cttcaccgtc ccacgggctg acccggtgcg gcccatcgac 1081 cccgccgcct ggctgtccca cacggcggca ctgacaggag ccctgccccg ctacgagctg 1141 gaagaggcgc cgctgacggt gaagagtgac atgagcgccg tcgtccgggt catgcagctg 1201 ccagactcgg gactggagat ccgcgaccgc atgtggctca agatcaccat cgccaatgcc 1261 gtcatcgggg cggacgtggt ggactggctg tacacacacg tggagggctt caaggagcgg 1321 cgggaggccc ggaagtacgc cagcagcttg ctgaagcacg gcttcctgcg gcacacggtc 1381 aacaagatca ccttctccga gcagtgctac tacgtcttcg gggatctctg cagcaatctc 1441 gccaccctga acctcaacag tggctccagt gggacttcgg atcaggacac gctggccccg 1501 ctgccccacc cggctgcccc ctggcctctg ggtcagggct acccctacca gtacccggga 1561 cccccaccct gcttcccgcc tgcctaccag gacccgggct ttagctatgg cagcggcagc 1621 accgggagtc agcagagtga agggagcaaa agcagtgggt ccacccggag cagccgccgg 1681 gccccgggcc gtgagaagga gcgtcgggcg gcgggagctg ggggcagtgg cagtgaatcg 1741 gatcacacgg caccgagtgg ggtggggagc agctggcgag agcgtccggc cggccagctc 1801 agccgtggca gcagcccacg cagtcaggcc tcggctaccg ccccggggct ccccccgccc 1861 caccccacga ccaaggccta tacagtggtg ggggggccac ccgggggacc ccctgtccgg 1921 gagctggctg ccgtcccccc ggaattgaca ggcagccgcc agtccttcca gaaggctatg 1981 gggaacccct gcgagttctt cgtggacatc atgtgactcg tggcgcatgc cccagccctg 2041 cctgaggtgg ggagctggcg gtcctgccgc atgcagagct cgcgtgggct tgccttcgtg 2101 ggggccagga cgggaggcag ggtggggggc aggctggacc accaccatct gccctggcag 2161 cctggctgct ccagctcctg acagcacctg tgtctgagca gccgtgttgg gggcgctccc 2221 tctctgcccc tcagcgagag cctcggacct cccaacccct tgtgtctggt ggggatccct 2281 cctgggatga ggaagacccc ctcgggctct cggctgaccc ccacctcctg cacagctgtg 2341 cccaggcccc cagggtggtc cacgcgggga caccccctgc ggtgcacagg ccccctgtct 2401 ggagtaggga tctaatttat ttatttattg cctggccggt gactcggggg aggaggcgac 2461 cctgtcatct gtcccacctg ctgctgcccc ttggagcagc ctgcaccttc tctcctccca 2521 tccggcaaca gtctgaaagt acgtggagga cgggaccgga agacgagaga gggctggaca 2581 tcctgcccac cgtgtcccag ccagggcagg gagggaccat ggcccgcagg gtcaaggggc 2641 cccgatgtgc acagctgcca cagggaggga ggtcttgggg agatgggcag tcaggtggcc 2701 cgtccttggt gagtgcacac actgcgcgca cacatcgcgg ccttcctggc ttctctggcc 2761 cccacgtgtc tgtgctgtag atactgtatc aaagtcccag cgtttagatg gttaacatag 2821 agctgcttct gtgtaaatgc tgcttatttt aaacactaaa aagcgtttaa ttttatggga 2881 aaaaaaaaa // LOCUS AF006043 2478 bp mRNA PRI 11-DEC-1997 DEFINITION Homo sapiens 3-phosphoglycerate dehydrogenase mRNA, complete cds. ACCESSION AF006043 NID g2674061 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2478) AUTHORS Cho,H.M., Jun,D.Y. and Kim,Y.H. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) Microbiology, College of Natural Sciences, Kyungpook National University, Taegu 702-701, South Korea REFERENCE 2 (bases 1 to 2478) AUTHORS Cho,H.M., Jun,D.Y. and Kim,Y.H. TITLE Direct Submission JOURNAL Submitted (10-DEC-1997) Microbiology, College of Natural Sciences, Kyungpook National University, Taegu 702-701, South Korea REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..2478 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell" /cell_line="Jurkat" CDS 693..2294 /note="A10" /codon_start=1 /product="3-phosphoglycerate dehydrogenase" /db_xref="PID:g2674062" /translation="MAFANLRKVLISDSLDPCCRKILQEGGLQVVEKQNLSKEELIAE LQDCEGLIVRSATKVTADVINAAEKLQVVGRAGTGVDNVDLEAATRKGILVMNTPNGN SLSAAELTCGMIMCLARQIPQATASMKDGKWERKKFMGTELNGKTLGILGLGRIGREV ATRMQSFGMKTIGYDPIISPEVSASFGVQQLPLEEIWPLCDFITVHTPLLPSTTGLLN DNTFAQCKKGVRVVNCARGGIVDEGALLRALQSGQCAGAALDVFTEEPPRDRALVDHE NVISCPHLGASTKEAQSRCGEEIAVQFVDMVKGKSLTGVVNAQALTSAFSPHTKPWIG LAEALGTLMRAWAGSPKGTIQVITQGTSLKNAGNCLSPAVIVGLLKEASKQADVNLVN AKLLVKEAGLNVTTSHSPAAPGEQGFGECLLAVALAGAPYQAVGLVQGTTPVLQGLNG AVFRPEVPLRRDLPLLLFRTQTSDPAMLPTMIGLLAEAGVRLLSYQTSLVSDGETWHV MGISSLLPSLEAWKQHVTEAFQFHF" BASE COUNT 525 a 710 c 775 g 468 t ORIGIN 1 cacctttccg cgggccgcgg ggatggcggc gcagggcgta gggcctgggc cggggtcggc 61 ggcgcccccg gggctggagg cggcccggca gaagctggcg ctgcggcgga agaaggtgct 121 gagcaccgaa ggagatggag ctgtacgagc tggcgcaggc ggcgggcggc gctatcgacc 181 ccgacgtgtt caagatcctg gtggacctgc tgaagctgaa cgtggccccc ctcgccgtct 241 tccagatgct caagtccatg tgtgccgggc agaggctagc gagcgagccc caggaccctg 301 cggccgtgtc tctgcccacg tcgagcgtgc ccgagacccg agggagaaac aaaggcagcg 361 ctgccctcgg gggagcattg gccctggcgg aacgcagcag ccgcgaagga tccagccaga 421 ggatgccacg ccagcccagc gctaccaggc tgcccaaggg gggcgggcct gggaagagcc 481 ctacacgggg cagcacctag gatggggcag agacttgttg catctttgtc cccagcaaag 541 gctacatgtt acctccttca attgataata aacctttctg agatgcaaac tcgagaatac 601 tgcccagtta ctctagcgcg ccaggccgaa ccgcagcttc ttggcttagg tacttctact 661 cacagcggcc gattccgagg ccaactccag caatggcttt tgcaaatctg cggaaagtgc 721 tcatcagtga cagcctggac ccttgctgcc ggaagatctt gcaagaggga gggctgcagg 781 tggtggaaaa gcagaacctt agcaaagagg agctgatagc ggagctgcag gactgtgaag 841 gccttattgt tcgctctgcc accaaggtga ccgctgatgt catcaacgca gctgagaaac 901 tccaggtggt gggcagggct ggcacaggtg tggacaatgt ggatctggag gccgcaacaa 961 ggaagggcat cttggttatg aacaccccca atgggaacag cctcagtgcc gcagaactca 1021 cttgtggaat gatcatgtgc ctggccaggc agattcccca ggcgacggct tcgatgaagg 1081 acggcaaatg ggagcggaag aagttcatgg gaacagagct gaatggaaag accctgggaa 1141 ttcttggcct gggcaggatt gggagagagg tagctacccg gatgcagtcc tttgggatga 1201 agactatagg gtatgacccc atcatttccc cagaggtctc ggcctccttt ggtgttcagc 1261 agctgcccct ggaggagatc tggcctctct gtgatttcat cactgtgcac actcctctcc 1321 tgccctccac gacaggcttg ctgaatgaca acacctttgc ccagtgcaag aagggggtgc 1381 gtgtggtgaa ctgtgcccgt ggagggatcg tggacgaagg cgccctgctc cgggccctgc 1441 agtctggcca gtgtgccggg gctgcactgg acgtgtttac ggaagagccg ccacgggacc 1501 gggccttggt ggaccatgag aatgtcatca gctgtcccca cctgggtgcc agcaccaagg 1561 aggctcagag ccgctgtggg gaggaaattg ctgttcagtt cgtggacatg gtgaagggga 1621 aatctctcac gggggttgtg aatgcccagg cccttaccag tgccttctct ccacacacca 1681 agccttggat tggtctggca gaagctctgg ggacactgat gcgagcctgg gctgggtccc 1741 ccaaagggac catccaggtg ataacacagg gaacatccct gaagaatgct gggaactgcc 1801 taagccccgc agtcattgtc ggcctcctga aagaggcttc caagcaggcg gatgtgaact 1861 tggtgaacgc taagctgctg gtgaaagagg ctggcctcaa tgtcaccacc tcccacagcc 1921 ctgctgcacc aggggagcaa ggcttcgggg aatgcctcct ggccgtggcc ctggcaggcg 1981 ccccttacca ggctgtgggc ttggtccaag gcactacacc tgtactgcag gggctcaatg 2041 gagctgtctt caggccagaa gtgcctctcc gcagggacct gcccctgctc ctattccgga 2101 ctcagacctc tgaccctgca atgctgccta ccatgattgg cctcctggca gaggcaggcg 2161 tgcggctgct gtcctaccag acttcactgg tgtcagatgg ggagacctgg cacgtcatgg 2221 gcatctcctc cttgctgccc agcctggaag cgtggaagca gcatgtgact gaagccttcc 2281 agttccactt ctaaccttgg agctcactgg tccctgcctc tggggctttt ctgaagaaac 2341 ccacccactg tgatcaatag ggagagaaaa tccacattct tgggctgaac gcgggcctct 2401 gacactgctt acactgcact ctgaccctgt agtacagcaa taaccgtcta ataaagagcc 2461 tacccccaaa aaaaaaaa // LOCUS AF006082 2704 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens actin-related protein Arp2 (ARP2) mRNA, complete cds. ACCESSION AF006082 NID g2282029 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2704) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL Journal of Cell Biology (1997) In press REFERENCE 2 (bases 1 to 2704) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..2704 /organism="Homo sapiens" /db_xref="taxon:9606" /note="EST clone Id number 587272" gene 1..2704 /gene="ARP2" CDS 75..1259 /gene="ARP2" /note="one of seven subunits of the Arp2/3 protein complex; actin-related protein" /codon_start=1 /product="Arp2" /db_xref="PID:g2282030" /translation="MDSQGRKVVVCDNGTGFVKCGYAGSNFPEHIFPALVGRPIIRST TKVGNIEIKDLMVGDEASELRSMLEVNYPMENGIVRNWDDMKHLWDYTFGPEKLNIDT RNCKILLTEPPMNPTKNREKIVEVMFETYQFSGVYVAIQAVLTLYAQGLLTGVVVDSG DGVTHICPVYEGFSLPHLTRRLDIAGRDITRYLIKLLLLRGYAFNHSADFETVRMIKE KLCYVGYNIEQEQKLALETTVLVESYTLPDGRIIKVGGERFEAPEALFQPHLINVEGV GVAELLFNTIQAADIDTRSEFYKHIVLSGGSTMYPGLPSRLERELKQLYLERVLKGDV EKLSKFKIRIEDPPRRKHMVFLGGAVLADIMKDKDNFWMTRQEYQEKGVRVLEKLGVT VR" BASE COUNT 786 a 471 c 599 g 848 t ORIGIN 1 ggcacgagga gaaaacggcc gggcggcggt ggctgtaggt tgtgcggctg cagcggctct 61 tccctgggcg gacgatggac agccagggca ggaaggtggt ggtgtgcgac aacggcaccg 121 ggtttgtgaa gtgtggatat gcaggctcta actttccaga acacatcttc ccagctttgg 181 ttggaagacc tattatcaga tcaaccacca aagtgggaaa cattgaaatc aaggatctta 241 tggttggtga tgaggcaagt gaattacgat caatgttaga agttaactac cctatggaaa 301 atggcatagt acgaaattgg gatgacatga aacacctgtg ggactacaca tttggaccag 361 agaaacttaa tatagatacc agaaattgta aaatcttact cacagaacct cctatgaacc 421 caaccaaaaa cagagagaag attgtagagg taatgtttga aacttaccag ttttccggtg 481 tatatgtagc catccaggca gttctgactt tgtacgctca aggtttattg actggtgtag 541 tggtagactc tggagatggt gtgactcaca tttgcccagt atatgaaggc ttttctctcc 601 ctcatcttac caggagactg gatattgctg ggagggatat aactagatat cttatcaagc 661 tacttctgtt gcgaggatac gccttcaacc actctgctga ttttgaaacg gttcgcatga 721 ttaaagaaaa actgtgttac gtgggatata atattgagca agagcagaaa ctggccttag 781 aaaccacagt attagttgaa tcttatacac tcccagatgg acgtatcatc aaagttgggg 841 gagagagatt tgaagcacca gaagctttat ttcagcctca cttgatcaat gttgaaggag 901 ttggtgttgc tgaattgctt tttaacacaa ttcaggcagc tgacattgat accagatctg 961 aattctacaa acacattgtg ctttctggag ggtctactat gtatcctggc ctgccatcac 1021 ggttggaacg agaacttaaa cagctttact tagaacgagt tttgaagggt gatgtggaaa 1081 aactttctaa atttaagatc cgcattgaag acccaccccg cagaaagcac atggtattcc 1141 tgggtggtgc agttctagcg gatatcatga aagacaaaga caacttttgg atgacccgac 1201 aagagtacca agaaaagggt gtccgtgtgc tagagaaact tggtgtgact gttcgataaa 1261 ctccaaagct tgttcccatc atacccgtaa tgctttcttt tttcctttat tgccaatctt 1321 tgaactcatt caactccagg acatggaaga ggcctctctc tgccctttga ctggaaaggt 1381 caagttttat tctggtgtct tggggaagct ttgttaaatt tttgttaatg tgggtaaatc 1441 tgagtttaat tcaactgctt ccctatatag actagagggc taaggattct gtctgctgct 1501 ttgtttcttc taagtaggca tttagatcat tcctgtaggc ttcctatttt cactttactg 1561 ctctaatgct gctagtcgta gtctttagca cactaggtgg tatgccttta ttagcataaa 1621 acaaaaaaaa ctttaacagg agcttttaca tattactggg atggggggtg gttcgggatg 1681 ggtgggcagc tgctgaaccc tttagggcat ttcctctgta atgtggcgct ttcaactgta 1741 ctgctgcagc tttaagtacc ttaaagcttc tcctgtgaac ttcttaggga aatgttaggt 1801 tcagaactaa agtgttttgg gtgggttttg ttgcgggggg gagggtaaca atgggtggtc 1861 ttctgatttt tatttttgag gttttgtcaa ctggagtacg tagaggaact ttatttacag 1921 tactttgatt tggcaggttt tcttctactt gtgctctgcc tggagctgtt tccatatgat 1981 ataaaaagca agtgtagtat tccattacta tgtggcttag ggatttattt gttttttaaa 2041 atcaaccatg ttagctggga ttagactccc tacagtcctt caatggaaaa gtaacattta 2101 aaaatccttt gggtaattca aattacagat ttaaaagagc ttaagatctg gtgttttgtt 2161 aatgcttctg tttattccag aagcattaag gtaacccatt gccaagtatc attcttgcaa 2221 attattcttt tatataactg accagtgctt aataaaacaa gcaggtactt acaaataatt 2281 actggcagta ggttataatt ggtggtttaa aaataacatt ggaatacagg acttgttgcc 2341 aattgggtaa ttttcattag ttgttttgtt tgttttgatt tgaaacctgg aaatacagta 2401 aaatttgact gtttaaaatg ttggccaaaa aaatcaagat ttaatttttt tatttgtact 2461 gaaaaactaa tcataactgt taattctcag ccatctttga agcttgaaag aagagtcttt 2521 ggtattttgt aaacgttagc agactttcct gccagtgtca gaaaatccta tttatgaatc 2581 ctgtcggtat tccttggtat ctgaaaaaaa taccaaatag taccatacat gagttatttc 2641 taagtttgaa aaataaaaag aaattgcatc acactaatta caaaataaaa aaaaaaaaaa 2701 aaaa // LOCUS AF006083 2146 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens actin-related protein Arp3 (ARP3) mRNA, complete cds. ACCESSION AF006083 NID g2282031 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2146) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL J. Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 2146) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..2146 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2146 /gene="ARP3" CDS 173..1429 /gene="ARP3" /note="one of seven subunits of the Arp2/3 protein complex; actin-related protein" /codon_start=1 /product="Arp3" /db_xref="PID:g2282032" /translation="MAGRLPACVVDCGTGYTKLGYAGNTEPQFIIPSCIAIKESAKVG DQAQRRVMKGVDDLDFFIGDEAIEKPTYATKWPIRHGIVEDWDLMERFMEQVIFKYLR AEPEDHYFLLTEPPLNTPENREYTAEIMFESFNVPGLYIAVQAVLALAASWTSRQVGE RTLTGTVIDSGDGVTHVIPVAEGYVIGSCIKHIPIAGRDITYFIQQLLRDREVGIPPE QSLETAKAVKERYSYVCPDLVKEFNKYDTDGSKWIKQYTGINAISKKEFSIDVGYERF LGPEIFFHPEFANPDFTQPISEVVDEVIQNCPIDVRRPLYKNIVLSGGSTMFRDFGRR LQRDLKRTVDARLKLSEELSGGRLKPKPIDVQVITHHMQRYAVWFGGSMLASTPEFYQ VCHTKKDYEEIGPSICRHNPVFGVMS" BASE COUNT 643 a 387 c 504 g 612 t ORIGIN 1 ggcacgaggc ctgctgcttt cttgctactg cttcggcttc ccggctaccc cccggacggt 61 gaaggcggcc cagctgtgga tggtcagata gcccttgtct cccgccgcca atctctggcc 121 cctagcagca cggagcagac ggcggcagca gcagcagcag gcgaggagga agatggcggg 181 acggctgccg gcctgtgtgg tggactgtgg cacggggtat acaaaactag gatatgctgg 241 aaatacagaa ccacagttta tcatcccttc ctgtattgct attaaggagt cagcaaaagt 301 gggtgatcaa gctcaaagga gggtgatgaa aggtgttgat gacctagact tcttcattgg 361 tgatgaagca atagaaaaac ctacatatgc aacaaagtgg ccaatccgcc atggtatagt 421 tgaagattgg gacttaatgg aaaggtttat ggagcaagtg atctttaaat atttaagggc 481 agaacctgaa gaccattatt ttcttttgac tgaacctcca ttgaatactc cagaaaacag 541 ggaatatact gctgaaataa tgtttgagtc cttcaatgtt ccaggcttgt acattgctgt 601 gcaggctgtt cttgccttag ctgcatcttg gacctcaaga caagtaggag aacggacgtt 661 gaccggtacg gtaatagaca gtggagatgg tgtcactcat gtcattcctg tggctgaagg 721 gtatgtgatt ggcagctgta ttaaacacat tccaatcgca ggacgagata taacatattt 781 tattcagcaa ctgctgagag accgagaagt aggaatccct ccagaacaat ccttggaaac 841 tgctaaggca gtaaaggagc gctatagtta tgtctgccca gatttagtaa aagaatttaa 901 caagtatgat acagatgggt caaaatggat taaacagtat actggaatca atgctatctc 961 aaagaaagag ttttctatcg atgttggtta tgagagattt ttgggacctg aaatcttttt 1021 tcatccagag tttgctaatc cagactttac acaacctatc tcagaagttg tagatgaagt 1081 aattcagaat tgtcctattg atgtcagacg tcctctctac aagaatattg tcctctctgg 1141 aggttcaacc atgttcaggg actttggacg tcgcttgcaa agagatttga aaagaactgt 1201 agatgcccgg ctgaaattaa gtgaggaatt gagtggtggt agattgaagc caaaacctat 1261 tgatgtacaa gtcattacac accacatgca gcgatatgca gtttggtttg gaggatcaat 1321 gctggcttcc acgcctgagt tctaccaagt atgccacacc aaaaaggatt atgaagaaat 1381 tggacctagc atttgtcgtc acaatccagt gtttggagtc atgtcgtaaa attggcttca 1441 tagttattgg ggttagggag gtggggaaga gataatcttt ctgattacct gttttgtctg 1501 gatggctggt tttgaggttt taaacctgac ttgaaatagt aacaccaaac atgattatac 1561 aggaatattt taataagtgt atcaccatgc agatgtagaa gagagcgaaa gtgattgtgt 1621 ttttctttag attgaatatt tgaatcttat gtgtaacaaa aagaagtggg ttttagttct 1681 ttctgtgccc tgatattttg tatattaatg aattatccaa gattcgatgg gatttatcag 1741 tgtgtagata gctctataat gcttgaattg tacacttcta agtgtgcagt gcaagagctt 1801 gtttatattt catacttttt atactttgag gaaaaaaagt caaagaaaaa ttgtatttga 1861 gggaaaaaac catgaccaag taaaggataa attcaaaaaa tagcctcatg agacttggca 1921 tacacactcg tgggattcca gttattatgg agtgcttcca tccctctcca ccccttcccc 1981 ccaaaaggtt ttctttgcaa gtgcttttgg aactaagagc tagtatcttg gattaactga 2041 tgcctgctag tgctttctga ttactcgcat tctgtttctt gctttaaaag aagagtaaag 2101 acaagagtgt tggaccagta aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS AF006084 1428 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens Arp2/3 protein complex subunit p41-Arc (ARC41) mRNA, complete cds. ACCESSION AF006084 NID g2282033 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1428) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL J. Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 1428) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..1428 /organism="Homo sapiens" /db_xref="taxon:9606" /note="EST clone Id number 527050" gene 1..1428 /gene="ARC41" CDS 81..1199 /gene="ARC41" /note="WD repeat containing protein; similar to Sop2Hs; 41 kD subunit of the Arp2/3 protein complex" /codon_start=1 /product="p41-Arc" /db_xref="PID:g2282034" /translation="MAYHSFLVEPISCHAWNKDRTQIAICPNNHEVHIYEKSGAKWTK VHELKEHNGQVTGIDWAPESNRIVTCGTDRNAYVWTLKGRTWKPTLVILRINRAARCV RWAPNENKFAVGSGSRVISICYFEQENDWWVCKHIKKPIRSTVLSLDWHPNNVLLAAG SCDFKCRIFSAYIKEVEERPAPTPWGSKMPFGELMFESSSSCGWVHGVCFSASGSRVA WVSHDSTVCLADADKKMAVATLASETLPLLALTFITDNSLVAAGHDCFPVLFTYDAAA GMLSFGGRLDVPKQSSQRGLTARERFQNLDKKASSEGGTAAGAGLDSLHKNSVSQISV LSGGKAKCSQFCTTGMDGGMSIWDVKSLESALKDLKIK" BASE COUNT 312 a 405 c 445 g 266 t ORIGIN 1 ggcacgaggg agcccagagc cggttcggcg cgtcgactgc ccagagtccg cggccggggc 61 gcgggaggag ccaagccgcc atggcctacc acagcttcct ggtggagccc atcagctgcc 121 acgcctggaa caaggaccgc acccagattg ccatctgccc caacaaccat gaggtgcata 181 tctatgaaaa gagcggtgcc aaatggacca aggtgcacga gctcaaggag cacaacgggc 241 aggtgacagg catcgactgg gcccccgaga gtaaccgtat tgtgacctgc ggcacagacc 301 gcaacgccta cgtgtggacg ctgaagggcc gcacatggaa gcccacgctg gtcatcctgc 361 ggatcaaccg ggctgcccgc tgcgtgcgct gggcccccaa cgagaacaag tttgctgtgg 421 gcagcggctc tcgtgtgatc tccatctgtt atttcgagca ggagaatgac tggtgggttt 481 gcaagcacat caagaagccc atccgctcca ccgtcctcag cctggactgg caccccaaca 541 atgtgctgct ggctgccggc tcctgtgact tcaagtgtcg gatcttttca gcctacatca 601 aggaggtgga ggaacggccg gcacccaccc cgtggggctc caagatgccc tttggggaac 661 tgatgttcga atccagcagt agctgcggct gggtacatgg cgtctgtttc tcagccagcg 721 ggagccgcgt ggcctgggta agccacgaca gcaccgtctg cctggctgat gccgacaaga 781 agatggccgt cgcgactctg gcctctgaaa cactaccact gctggcgctg accttcatca 841 cagacaacag cctggtggca gcgggccacg actgcttccc ggtgctgttc acctatgacg 901 ccgccgcggg gatgctgagc ttcggcgggc ggctggacgt tcctaagcag agctcgcagc 961 gtggcttgac ggcccgcgag cgcttccaga acctggacaa gaaggcgagc tccgagggtg 1021 gcacggctgc gggcgcgggc ctagactcgc tgcacaagaa cagcgtcagc cagatctcgg 1081 tgctcagcgg cggcaaggcc aagtgctcgc agttctgcac cactggcatg gatggcggca 1141 tgagtatctg ggatgtgaag agcttggagt cagccttgaa ggacctcaag atcaaatgac 1201 ctgtgaggaa tatgttgcct tcatcctaac tgctggggaa gcggggagag gggtcaggga 1261 ggctaatggt tgctttgctg aatgtttctg gggtaccaat acgagttccc ataggggctg 1321 ctccctcaaa aagggagggg acagatgggg agcttttctt acctattcaa ggaatacgtg 1381 cctttttctt aaatgctttc atttattgaa aaaaaaaaaa aaaaaaaa // LOCUS AF006085 1198 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens Arp2/3 protein complex subunit p34-Arc (ARC34) mRNA, complete cds. ACCESSION AF006085 NID g2282035 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1198) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL J. Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 1198) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..1198 /organism="Homo sapiens" /db_xref="taxon:9606" /note="EST clone Id number 547253" gene 1..1198 /gene="ARC34" CDS 85..987 /gene="ARC34" /note="34 kD subunit of the Arp2/3 protein complex" /codon_start=1 /product="p34-Arc" /db_xref="PID:g2282036" /translation="MILLEVNNRIIEETLALKFENAAAGNKPEAVEVTFADFDGVLYH ISNPNGDKTKVMVSISLKFYKELQAHGADELLKRVYGSFLVNPESGYNVSLLYDLENL PASKDSIVHQAGMLKRNCFASVFEKYFQFQEEGKEGENRAVIHYRDDETMYVESKKDR VTVVFSTVFKDDDDVVIGKVFMQEFKEGRRASHTAPQVLFSHREPPLELKDTDAAVGD NIGYITFVLFPRHTNASARDNTINLIHTFRDYLHYHIKCSKAYIHTRMRAKTSDFLKV LNRARPDAEKKEMKTITGKTFSSR" BASE COUNT 344 a 287 c 283 g 284 t ORIGIN 1 ggcacgagct ctccctccgt ccttgcctcc cttacccacc ctcaccggcc cttgtttctc 61 cttcccctgg gggcagccgc cgccatgatc ctgctggagg tgaacaaccg catcatcgag 121 gagacgctcg cgctcaagtt cgagaacgcg gccgccggaa acaaaccgga agcagtagaa 181 gtaacatttg cagatttcga tggggtcctc tatcatattt caaatcctaa tggagacaaa 241 acaaaagtga tggtcagtat ttctttgaaa ttctacaagg aacttcaggc acatggtgct 301 gatgagttat taaagagggt gtacgggagt ttcttggtaa atccagaatc aggatacaat 361 gtctctttgc tatatgacct tgaaaatctt ccggcatcca aggattccat tgtgcatcaa 421 gctggcatgt tgaagcgaaa ttgttttgcc tctgtctttg aaaaatactt ccaattccaa 481 gaagagggca aggaaggaga gaacagggca gttatccatt atagggatga tgagaccatg 541 tatgttgagt ctaaaaagga cagagtcaca gtagtcttca gcacagtgtt taaggatgac 601 gacgatgtgg tcattggaaa ggtgttcatg caggagttca aagaaggacg cagagccagc 661 cacacagccc cacaggtcct ctttagccac agggaacctc ctctggagct gaaagacaca 721 gacgccgctg tgggtgacaa cattggctac attacctttg tgctgttccc tcgtcacacc 781 aatgccagtg ctcgagacaa caccatcaac ctgatccaca cgttccggga ctacctgcac 841 taccacatca agtgctctaa ggcctatatt cacacacgta tgcgggcgaa aacgtctgac 901 ttcctcaagg tgctgaaccg cgcacgccca gatgccgaga aaaaagaaat gaaaacaatc 961 acggggaaga cgttttcatc ccgctaatct tgggaataag aggaggaagc ggctggcaac 1021 tgaaggctgg aacacttgct actggataat cgtagctttt aatgttgcgc ctcttcaggt 1081 tcttaaggga ttctccgttt tggttccatt ttgtacacgt ttggaaaata atctgcagaa 1141 acgagctgtg cttgcaaaga cttcatagtt cccaagaatt aaaaaaaaaa aaaaaaaa // LOCUS AF006087 820 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens Arp2/3 protein complex subunit p20-Arc (ARC20) mRNA, complete cds. ACCESSION AF006087 NID g2282039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 820) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL J. Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 820) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..820 /organism="Homo sapiens" /db_xref="taxon:9606" /note="EST clone Id number 187446" gene 1..820 /gene="ARC20" CDS 16..522 /gene="ARC20" /note="20 kD subunit of the Arp2/3 protein complex" /codon_start=1 /product="p20-Arc" /db_xref="PID:g2282040" /translation="MTATLRPYLSAVRATLQAALCLENFSSQVVERHNKPEVEVRSSK ELLLQPVTISRNEKEKVLIEGSINSVRVSIAVKQADEIEKILCHKFMRFMMMRAENFF ILRRKPVEGYDISFLITNFHTEQMYKHKLVDFVIHFMEEIDKEISEMKLSVNARARIV AEEFLKNF" BASE COUNT 201 a 201 c 231 g 187 t ORIGIN 1 cagccagcgc ccgcgatgac tgccactctc cgcccctacc tgagtgccgt gcgggccaca 61 ttgcaggctg ccctctgcct ggagaacttc tcctcccagg ttgtggaacg acacaacaag 121 ccggaagtgg aagtcaggag tagcaaagag ctcctgttac aacctgtgac catcagcagg 181 aatgagaagg aaaaggttct gattgagggc tccatcaact ctgtccgggt cagcattgct 241 gtgaaacagg ctgatgagat cgagaagatt ttgtgccaca agttcatgcg cttcatgatg 301 atgcgagcag agaacttctt tatccttcga aggaagcctg tggaggggta tgatatcagc 361 tttctgatca ccaacttcca cacagagcag atgtacaaac acaagttggt ggactttgtg 421 atccacttca tggaggagat tgacaaggag atcagtgaga tgaagctgtc agtcaatgcc 481 cgtgcccgca ttgtggctga agagttcctt aagaattttt aaaccatctg gctggatctc 541 gtggccttcc ccctcagact acccatgtct ccacgaaggc gtcctggagt cactccccga 601 gcagcgcggc ggcggcaggg agttgggttg gggtgggcat ttgatgcggg aggtgggtgg 661 tgtgcttgct agctgggcaa gaaagcagca gtggacctgc cccaaggcca cacgtgcctg 721 gtcaggctgg cttctgatgt tcagtcccct gggccgggac agattttttt taacgtcttg 781 aaacttaaac tctgtgcttg taaaaaaaaa aaaaaaaaaa // LOCUS AF006088 1833 bp mRNA PRI 29-JUL-1997 DEFINITION Homo sapiens Arp2/3 protein complex subunit p16-Arc (ARC16) mRNA, complete cds. ACCESSION AF006088 NID g2282041 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1833) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly JOURNAL J. Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 1833) AUTHORS Welch,M.D., DePace,A.H., Verma,S., Iwamatsu,A. and Mitchison,T.J. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Cellular and Molecular Pharmacology, University of California, San Francisco, 513 Parnassus Ave., San Francisco, CA 94143-0450, USA FEATURES Location/Qualifiers source 1..1833 /organism="Homo sapiens" /db_xref="taxon:9606" /note="EST clone Id number 300395" gene 1..1833 /gene="ARC16" CDS 25..480 /gene="ARC16" /note="16 kD subunit of the Arp2/3 protein complex" /codon_start=1 /product="p16-Arc" /db_xref="PID:g2282042" /translation="MSKNTVSSARFRKVDVDEYDENKFVDEEDGGDGQAGPDEGEVDS CLRQGNMTAALQAALKNPPINTKSQAVKDRAGSIVLKVLISFKANDIEKAVQSLDKNG VDLLMKYIYKGFESPSDNSSAMLLQWHEKALAAGGVGSIVRVLTARKTV" BASE COUNT 563 a 321 c 389 g 560 t ORIGIN 1 cctttgccgc tggtcgggat tgggatgtcg aagaacacag tgtcgtcggc ccgcttccgg 61 aaggtggacg tggatgaata tgacgagaac aagttcgtgg acgaagaaga tgggggcgac 121 ggccaggccg ggcccgacga gggcgaggtg gactcctgcc tgcggcaagg aaacatgaca 181 gctgccctac aggcagctct gaagaacccc cctatcaaca ccaagagtca ggcagtgaag 241 gaccgggcag gcagcattgt cttgaaggtg ctcatctctt ttaaagctaa tgatatagaa 301 aaggcagttc aatctctgga caagaatggt gtggatctcc taatgaagta tatttataaa 361 ggatttgaga gcccgtctga caatagcagt gctatgttac tgcaatggca tgaaaaggca 421 cttgctgctg gaggagtagg gtccattgtt cgtgtcttga ctgcaagaaa aactgtgtag 481 tctggcagga agtggattat ctgcctcggg agtgggaatt gctggtacaa agaccaaaac 541 aaccaaatgc caccgctgcc ctgtgggtag catctgtttc tctcagcttt gccttcttgc 601 tttttcatat ctgtaaagaa aaaaattaca tatcagttgt cctttaatga aaattgggat 661 aatatagaag aaattgtgtt aaaatagaag tgtttcatcc tttcaaaacc atttcagtga 721 tgtttatacc aatctgtata tagtataatt tacattcaag tttaattgtg caacttttaa 781 cccctgttgg ctggtttttt gttctgtttt gttttgtatt atttttaact aatactgaga 841 gatttggtca gaatttgagg ccagtttcct agctcattgc tagtcaggaa atgatattta 901 taaaaaatat gagagactgg cagctattaa cattgcaaaa ctggaccata tttcccttat 961 ttaataagca aaatatgttt ttggaataag tggtgggtga ataccactgc caagttatag 1021 ctttgttttt gcttgcctcc tgattatctg tactgtgggt ttaagtatgc tactttctct 1081 cagcatccaa taatcatggc ccctcaattt atttgtggtc acccagggtt cagagcaaga 1141 agtcttgctt tatacaaatg tatccataaa atatcagagc ttgttgggca tgaacatcaa 1201 acttttgttc cactaatatg gctctgtttg gaaaaaactg caaatcagaa agaatgattt 1261 gcagaaagaa agaaaaacta tggtgtaatt taaactctgg gcagcctctg aatgaaatgc 1321 tactttcttt agaaatataa tagctgcctt agacattatg aggtatacaa ctagtattta 1381 agataccatt taatatgccc cgtaaatgtc ttcagtgttc ttcagggtag ttgggatctc 1441 aaaagatttg gttcagatcc aaacaaatac acattctgtg ttttagctca gtgttttcta 1501 aaaaaagaaa ctgccacaca gcaaaaaatt gtttactttg ttggacaaac caaatcagtt 1561 ctcaaaaaat gaccggtgct tataaaaagt tataaatatc gagtagctct aaaacaaacc 1621 acctgaccaa gagggaagtg agcttgtgct tagtatttac attggatgcc agttttgtaa 1681 tcactgactt atgtgcaaac tggtgcagaa attctataaa ctctttgctg tttttgatac 1741 ctgctttttg tttcattttg ttttgttttg taaaaatgat aaaacttcag aaaataaaat 1801 gtcagtgttg aataaaaaaa aaaaaaaaaa aaa // LOCUS AF006259 1284 bp mRNA PRI 24-JAN-1998 DEFINITION Homo sapiens Rad51-interacting protein mRNA, complete cds. ACCESSION AF006259 NID g2804673 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1284) AUTHORS Kovalenko,O.V., Golub,E.I., Bray-Ward,P., Ward,D.C. and Radding,C.M. TITLE A novel nucleic acid-binding protein that interacts with human rad51 recombinase JOURNAL Nucleic Acids Res. 25 (24), 4946-4953 (1997) MEDLINE 98060891 REFERENCE 2 (bases 1 to 1284) AUTHORS Kovalenko,O.V., Golub,E.I., Ward,D.C. and Radding,C.M. TITLE Direct Submission JOURNAL Submitted (01-JUN-1997) Genetics, Yale University, 333 Cedar St., New Haven, CT 06511, USA FEATURES Location/Qualifiers source 1..1284 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" CDS 52..1059 /function="interacts with human Rad51 recombinase" /note="Pir51; novel DNA-binding protein" /codon_start=1 /product="Rad51-interacting protein" /db_xref="PID:g2804674" /translation="MVRPVRHKKPVNYSQFDHSDSDDDFVSATVPLNKKSRTAPKELK QDKPKPNLNNLRKEEIPVQEKTPKKRMALDDKLYQRDLEVALALSVKELPTVTTNVQN SQDKSIEKHGSSKIETMNKSPHISNCSVASDYLDLDKITVEDDVGGVQGKRKAASKAA AQQRKILLEGSDGDSANDTEPDFAPGEDSEDDSDFCESEDNDEDFSMRKSKVKEIKKK EVKVKSPVEKKEKKSKSKCNALVTSVDSAPAAVKSESQSLPKKVSLSSDTTRKPLEIR SPSAESKKPKWVPPAASGGSRSSSSPLVVVSVKSPNQSLRLGLSRLARVKPLHPNATS T" BASE COUNT 438 a 237 c 277 g 332 t ORIGIN 1 ctgaagccaa caagatttga gaactgtaaa taccaagcct tgaaagggac catggtgcgg 61 cctgtgagac ataagaaacc agtcaattac tcacagtttg accactctga cagtgatgat 121 gattttgttt ctgcaactgt acctttaaac aagaaatcca gaacagcacc aaaggagtta 181 aaacaagata aaccaaaacc taacttgaac aatctccgga aagaagaaat cccagtacaa 241 gagaaaaccc ctaaaaaaag gatggcttta gatgacaagc tctaccagag agacttagaa 301 gttgcactag ctttatcagt gaaggaactt ccaacagtca ccactaatgt gcagaactct 361 caagataaaa gcattgaaaa acatggcagt agtaaaatag aaacaatgaa taagtctcct 421 catatctcta attgcagtgt agccagtgat tatttagatt tggataagat tactgtggaa 481 gatgatgttg gtggtgttca agggaaaaga aaagcagcat ctaaagctgc agcacagcag 541 aggaagattc ttctggaagg cagtgatggt gatagtgcta atgacactga accagacttt 601 gcacctggtg aagattctga ggatgattct gatttttgtg agagtgagga taatgacgaa 661 gacttctcta tgagaaaaag taaagttaaa gaaattaaaa agaaagaagt gaaggtaaaa 721 tccccagtag aaaagaaaga gaagaaatct aaatccaaat gtaatgcttt ggtgacttcg 781 gtggactctg ctccagctgc cgtcaaatca gaatctcagt ccttgccaaa aaaggtttct 841 ctgtcttcag ataccactag gaaaccatta gaaatacgca gtccttcagc tgaaagcaag 901 aaacctaaat gggtcccacc agcggcatct ggaggtagca gaagtagcag cagcccactg 961 gtggtagtgt ctgtgaagtc tcccaatcag agtctccgcc ttggcttgtc cagattagca 1021 cgagttaaac ctttgcatcc aaatgccact agcacctgag tgtggtacag gaggaatgtt 1081 tggttgggag aatcacagct ttacaagggt gtttatattt gatttgtgtt tatatttgag 1141 gcaggtattg taatataaag gaatccatta ccatgtccta taaatgacct ctagccattt 1201 tatgattatg ttctctgtaa aactcttcaa gacttcaatg agaagtttgt ttataagaat 1261 tatcttctca tacctttcct tgtg // LOCUS AF006305 1579 bp mRNA PRI 25-JUN-1997 DEFINITION Homo sapiens 26S proteasome regulatory subunit (SUG2) mRNA, complete cds. ACCESSION AF006305 NID g2213931 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1579) AUTHORS Li,Y. and Benezra,R. TITLE Human SUG2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1579) AUTHORS Li,Y. and Benezra,R. TITLE Direct Submission JOURNAL Submitted (30-MAY-1997) Cell Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1579 /organism="Homo sapiens" /db_xref="taxon:9606" gene <1..1579 /gene="SUG2" CDS 16..1185 /gene="SUG2" /codon_start=1 /product="26S proteasome regulatory subunit" /db_xref="PID:g2213932" /translation="MADPRDKALQDYRKKLLEHKEIDGRLKELREQLKELTKQYEKSE NDLKALQSVGQIVGEVLKQLTEEKFIVKATNGPRYVVGCRRQLDKSKLKPGTRVALDM TTLTIMRYLPREVDPLVYNMSHEDPGNVSYSEIGGLSEQIRELREVIELPLTNPELFQ RVGIIPPKGCLLYGPPGTGKTLLARAVASQLDCNFLKVVSSSIVDKYIGESARLIREM FNYARDHQPCIIFMDEIDAIGGRRFSEGTSADREIQRTLMELLNQMDGFDTLHRVKMI MATNRPDTLDPALLRPGRLDRKIHIDLPNEQARLDILKIHAGPITKHGEIDYEAIVKL SDGFNGADLRNVCTEAGMFAIRADHDFVVQEDFMKAVRKVADSKKLESKLDYKPV" BASE COUNT 532 a 231 c 360 g 456 t ORIGIN 1 ggcacgaggc tcatcatggc ggaccctaga gataaggcgc ttcaggacta ccgcaagaag 61 ttgcttgaac acaaggagat cgacggccgt cttaaggagt taagggaaca attaaaagaa 121 cttaccaagc agtatgaaaa gtctgaaaat gatctgaagg ccctacagag tgttgggcag 181 atcgtgggtg aagtgcttaa acagttaact gaagaaaaat tcattgttaa agctaccaat 241 ggaccaagat atgttgtggg ttgtcgtcga cagcttgaca aaagtaagct gaagccagga 301 acaagagttg ctttggatat gactacacta actatcatga gatatttgcc gagagaggtg 361 gatccactgg tttataacat gtctcatgag gaccctggga atgtttctta ttctgagatt 421 ggagggctat cagaacagat ccgggaatta agagaggtga tagaattacc tcttacaaac 481 ccagagttat ttcagcgtgt aggaataata cctccaaaag gctgtttgtt atatggacca 541 ccaggtacgg gaaaaacact cttggcacga gccgttgcta gccagctgga ctgcaatttc 601 ttaaaggttg tatctagttc tattgtagac aagtacattg gtgaaagtgc tcgtttgatc 661 agagaaatgt ttaattatgc tagagatcat caaccatgca tcatttttat ggatgaaata 721 gatgctattg gtggtcgtcg gttttctgag ggtacttcag ctgacagaga gattcagaga 781 acgttaatgg agttactgaa tcaaatggat ggatttgata ctctgcatag agttaaaatg 841 atcatggcta caaacagacc agatacactg gatcctgctt tgctgcgtcc aggaagatta 901 gatagaaaaa tacatattga tttgccaaat gaacaagcaa gattagacat actgaaaatc 961 catgcaggtc ccattacaaa gcatggtgaa atagattatg aagcaattgt gaagctttcg 1021 gatggcttta atggagcaga tctgagaaat gtttgtactg aagcaggtat gttcgcaatt 1081 cgtgctgatc atgattttgt agtacaggaa gacttcatga aagcagtcag aaaagtggct 1141 gattctaaga agctggagtc taaattggac tacaaacctg tgtaatttac tgtaagattt 1201 ttgatggctg catgacagat gttggcttat tgtaaaaata aagttaaaga aaataatgta 1261 tgtattggca atgatgtcat taaaagtata tgaataaaaa tatgagtaac atcataaaaa 1321 ttagtaattc aacttttaag atacagaaga aatttgtatg tttgttaaag ttgcatttat 1381 tgcagcaagt tacaaaggga aagtgttgaa gcttttcata tttgctgcgt gagcattttg 1441 taaaatattg aaagtggttt gagatagtgg tataagaaag catttcttat gacttatttt 1501 gtatcatttg ttttcctcat ctaaaaagtt gaataaaatc tgtttgattc agttctccta 1561 aaaaaaaaaa aaaaaaaaa // LOCUS AF006386 921 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens axonemal dynein light chain (hp28) mRNA, complete cds. ACCESSION AF006386 NID g2352533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 921) AUTHORS Kastury,K., Taylor,W.E., Shen,R., Arver,S., Gutierrez,M., Fisher,C.E., Coucke,P.J., Van Hauwe,P., Van Camp,G. and Bhasin,S. TITLE cDNA cloning and characterization of the human axonemal dynein light chain gene: a putative candidate gene for the immotile cilia syndrome JOURNAL J. Clin. Endocrinol. Metab. (1997) In press REFERENCE 2 (bases 1 to 921) AUTHORS Kastury,K., Taylor,W.E., Shen,R., Arver,S., Gutierrez,M., Fisher,C.E., Coucke,P.J., Van Hauwe,P., Van Camp,G. and Bhasin,S. TITLE Direct Submission JOURNAL Submitted (31-MAY-1997) Endocrinology Div., Charles Drew University of Medicine and Science, 1621 E. 120th St., Los Angeles, CA 90059, USA FEATURES Location/Qualifiers source 1..921 /organism="Homo sapiens" /strain="sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p35.1" exon 1..140 /gene="hp28" /number=1 gene 1..921 /gene="hp28" CDS 57..830 /gene="hp28" /note="dynein arm of flagellum motor protein; similar to Chlamydomonas p28 dynein" /codon_start=1 /product="axonemal dynein light chain" /db_xref="PID:g2352534" /translation="MIPPADSFAQVRHPSAGEPEHGETEPQGRLLKVSPQQPGPSGSA PQPPKTKLPSTPCVPDPTKQAEEILNAILPPREWVEDTQLWIQQVSSTPTHQDGRGAP PGAVRLKAAAAAGQGNRHLPCPQELYSQCFDELIRESPSTVRRGGCCCCSRDEIRMTI AAYQTLYESSVAFGMRKALQAEQGKSDMERKIAELETEKRDLERQVNEQKAKCEATEK RESERRQVEEKKHNEEIQFLKRTNQQLKAQLEGIIAPKK" exon 141..283 /gene="hp28" /number=2 exon 284..453 /gene="hp28" /number=3 exon 454..629 /gene="hp28" /number=4 exon 630..672 /gene="hp28" /number=5 exon 673..794 /gene="hp28" /number=6 exon 795..921 /gene="hp28" /number=7 BASE COUNT 257 a 244 c 263 g 157 t ORIGIN 1 caaacaaggc ccacactgga cagggcagct gctgggttgc tactctcgcc tccgccatga 61 ttccgcccgc agactctttt gctcaagtac gacaccccag tgctggtgag ccggaacacg 121 gagaaacgga gccccaaggt cggctactga aagtcagccc ccagcagcct ggaccttcag 181 gttcagcccc acagccaccc aagaccaagc tcccctcaac tccctgtgtc ccagatccta 241 caaagcaggc agaagaaatc ttgaatgcca tactaccccc aagggagtgg gtggaagaca 301 cgcagctatg gatccagcag gtgtccagca cccctacgca ccaggatgga cgtggtgcac 361 ctccaggagc agttagactt aaagctgcag cagcggcagg ccagggaaac aggcatctgc 421 cctgtccgca ggaactctac tcacagtgtt ttgatgagtt gatccgggag tcaccatcaa 481 ctgtgcggag agggggctgc tgctgctgca gtcgggacga gatccgcatg accatcgctg 541 cctaccagac cctgtacgag agcagcgtgg cgtttggcat gaggaaggca ctgcaggctg 601 agcaggggaa gtcagacatg gagaggaaaa tcgcagaatt ggagacggaa aagagagacc 661 tggagaggca agtgaacgag cagaaggcaa aatgtgaagc cactgagaag cgggagagcg 721 agaggcggca ggtggaggag aagaagcaca atgaggagat tcagttcctg aagcgaacaa 781 atcagcagct gaaggcccaa ctggaaggca ttattgcacc aaagaagtga taatttccac 841 atgattaatt tccaacaaga cacttgggag ttatttactg tgttcctctg gcagccaata 901 aaatcatcat aagccctttg t // LOCUS AF006621 3129 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens embryonic lung protein (HUEL) mRNA, complete cds. ACCESSION AF006621 NID g2654558 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3129) AUTHORS Chow,V.T.K. and Sim,D.L.C. TITLE HUEL, a novel gene isolated from human embryonic lung cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 3129) AUTHORS Chow,V.T.K. and Sim,D.L.C. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) Microbiology, National University of Singapore, 10, Kent Ridge Cresent, Kent Ridge, Singapore 119260, Republic of Singapore FEATURES Location/Qualifiers source 1..3129 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /dev_stage="fetal, 22-23 weeks" /clone_lib="Human Fetal Lung Marathon-Ready cDNA (Clontech) (pooled from 4 male/female Caucasian fetuses)" gene 1..3129 /gene="HUEL" CDS 107..1813 /gene="HUEL" /codon_start=1 /product="embryonic lung protein" /db_xref="PID:g2654559" /translation="MLPGLAAAAAHRCSWSSLCRLRLRCRAAACNPSDRQEWQNLVTF GSFSNVVPCSHPYIGTLSQVKLYSTNVQKEGQGSQTLRVEKVPSFETAEGIGAELKAP LKQEPLQVRVKAVLKKREYGSKYTQNNFITGVRAINEFCLKSSDLEQLRKIRRRSPHQ DTESFTVYLRSDVDAKSLEVWGSPEALAREKKLRKDAEIEYRERLFRNQKILREYRDF LGNTKPRSRTASVFFKGPGKVVMVAICINGLNCFFKFLAWIYTGSASMFSEAIHSLSD TCNQGLLALGISKSVQTPDPSHPYGFSNMRYISSLISGVGIFMMGAGLSWYHGVMGLL HPQPIESLLWAYCILAGSLVSEGATLLVAVNELRRNARAKGMSFYKYVMESRDPSTNV ILLEDTAAVLGVIIAATCMGLTSITGNPLYDSLGSLGVGTLLGMVSAFLIYTNTEALL GRSIQPEQVQRLTELLENDPSVRAIHDVKATDLGLGKVRFKAEVDFDGRVVTRSYLEK QDFDQMLQEIQEVKTPEELETFMLKHGENIIDTLGAEVDRLEKELKKRNPEVRHVDLE IL" BASE COUNT 942 a 577 c 662 g 948 t ORIGIN 1 cgaagccatc ggtgttcgct gatgtccagt ctatggagtc agttggtacc ggtggcggcg 61 cggaggcaga aggcggtgtc cgagtagggg cctctgcccc accaggatgt taccgggctt 121 ggccgccgcc gcggcccaca gatgtagctg gtcctccctg tgccggctcc gtctgcgatg 181 cagggcggcg gcctgtaatc ccagcgaccg ccaggagtgg cagaatttag tgacatttgg 241 aagcttttca aacgtggttc cctgtagtca tccatatatt ggtaccctga gtcaagtaaa 301 gttgtactcc acaaatgttc agaaagaagg acagggatca caaacactca gagtggaaaa 361 agtaccatca tttgaaacag cagaaggtat aggcgcagaa ctcaaagctc cacttaagca 421 agaacctctc caagtaagag ttaaagcagt ccttaagaaa agggagtatg gatcaaagta 481 cactcagaat aatttcatca ctggagtcag agcgataaat gagttctgcc tcaaatccag 541 tgatctagaa caacttcgaa aaatcagacg acgaagtccc catcaagata ctgagtcttt 601 tactgtatac ttgagatcag atgtggacgc aaaatctttg gaagtttggg gaagccctga 661 agctcttgcc agagagaaaa aattgcgtaa ggacgcagaa atagaataca gagaaaggct 721 atttagaaac caaaaaatat taagagaata cagagatttc ttgggaaata ccaagccacg 781 ctccagaaca gcatcagtgt tttttaaggg accaggaaaa gtggtgatgg ttgcaatttg 841 catcaatgga ttaaactgct tctttaaatt tcttgcctgg atttataccg gttcagcaag 901 tatgttctca gaagctatac actcattatc tgatacttgt aatcagggtt tactagcatt 961 gggcatcagt aagtctgttc aaacaccaga tccttctcat ccgtacggat tttcaaatat 1021 gcgctatatt tcttcgctaa ttagtggtgt tggtattttc atgatgggtg caggactatc 1081 ttggtaccat ggagtcatgg gattgcttca tcctcaacca atagaatccc ttctatgggc 1141 atattgtatt ttagcaggat cattagtatc tgaaggagca acacttcttg ttgctgtaaa 1201 tgaacttcgt aggaatgctc gggctaaagg aatgtcattt tacaagtatg taatggaaag 1261 tcgtgatcct agtacaaatg tgatattatt ggaggatact gctgcagtct tgggagttat 1321 aatagcagcc acttgcatgg gccttacttc tataacaggc aatccactgt atgacagcct 1381 aggttctttg ggtgtgggca ccttattagg catggtctca gcattcctca tctacactaa 1441 cacagaagca ctcttagggc ggtccatcca gccagaacaa gtacaacggc tcactgaact 1501 cctggagaat gacccatcag taagggcaat tcatgatgtt aaagccacag atctgggatt 1561 aggtaaagta agatttaagg cagaagtaga ttttgatggg cgagttgtta caagatcata 1621 tttggaaaaa caagattttg accaaatgtt acaagaaatt caagaagtga aaactcctga 1681 agaactagag acctttatgc ttaaacatgg agaaaatatt attgatactt taggagctga 1741 agtagataga cttgagaagg aactgaaaaa acgaaatcct gaagttcgac atgtagattt 1801 ggagatactg tgagtttgat ggaatgaatc acctgggtgg ggaccttgga aacaagtttg 1861 tccgtccact ctacaaagtt tcctcctctc ctacactgaa agactcagtg ccatgcagaa 1921 gccttttttt taagatgaag gaaatatttt atgtaaagag caactcagca ggacacagaa 1981 ctaaaactac tacttacatc taacagacac actacaagtt gaatcaattt gaaaatcatg 2041 tttttatgct tccataggga acattttggt tatttaaatt gttcataatg tcccatattt 2101 cacctgttca gtgtatactg tactttgcaa tcatctttcc ttttttcaca ttggtaaaaa 2161 taagtggcat ccataggatc atgattttta atttgttgcc tctgaagatt tcactccatc 2221 aagatctgcc aatcttcaat attctggcta aatcttggta tgtggttttt aaacagtcac 2281 tccgtttcaa agtctgtctt tccttataga atgtggaaat tatttctcca taccttgtga 2341 ttttgacctg agtgctaaga gaatcactct ccttacctag ttatctacaa atgttcattc 2401 cagaaatgtt tagttactga attgaatgaa gacatctcag tacactcttt taggtcatag 2461 tagttgccat tttgtaaaat ttcttttttc ttctttgctt ttttcccctt atttggttta 2521 atttttctaa tgttaggaga tatagtccta gatatttcca tgggccagtg tgatgacttt 2581 tttttaaatg aggttcagta ccataatgtt tatttactgg aagataatgc atttataagc 2641 attttaaaat tctgtaaagt gggttagaaa tatttataat tttacaggca ggacagcatt 2701 tgacttttat ttaaaaggcg gcactactta tgtaaatctg agctgtggga tatttcttgc 2761 tttaagagag agacagaatc tctcactgaa actcatggtc atgattttgt ataatatagt 2821 tcatactgtg tctgtgagta tattcaatta caaatgggca tttagtatag ttatattgac 2881 tataacatgt aagtaaatag ctttctactg accctaagtc atcaaggtgg aaaaaaaaca 2941 tgcaattcag taattgaaaa tgtggtgaaa agctgcagct gtcatcatca aaacaactca 3001 taacatactt taaaatgttc aggtagcagt gagcattgtt catatgagaa tggcggctgg 3061 gtgatctctt tgctgaatta atgagttctt aacatgtgga cccaactgcc tgtgtgagat 3121 ctgtgtctt // LOCUS AF006689 1392 bp mRNA PRI 28-JAN-1998 DEFINITION Homo sapiens MAP kinase kinase Jnkk2 mRNA, complete cds. ACCESSION AF006689 NID g2811125 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1392) AUTHORS Lu,X., Nemoto,S. and Lin,A. TITLE Identification of c-Jun NH2-terminal protein kinase (JNK)-activating kinase 2 as an activator of JNK but not p38 JOURNAL J. Biol. Chem. 272 (40), 24751-24754 (1997) MEDLINE 97460048 REFERENCE 2 (bases 1 to 1392) AUTHORS Lu,X., Nemoto,S. and Lin,A. TITLE Direct Submission JOURNAL Submitted (04-JUN-1997) Pathology, University of Alabama at Birmingham, 1670 University Blvd., Birmingham, Al 35294, USA REFERENCE 3 (bases 1 to 1392) AUTHORS Lu,X., Nemoto,S. and Lin,A. TITLE Direct Submission JOURNAL Submitted (27-JAN-1998) Pathology, University of Alabama at Birmingham, 1670 University Blvd., Birmingham, Al 35294, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..1392 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 76..1281 /note="a new family member of MAP kinase kinases" /codon_start=1 /product="Jnkk2" /db_xref="PID:g2811126" /translation="MAASSLEQKLSRLEAKLKQENREARRRIDLNLDISPQRPRPTLQ LPLANDGGSRSPSSESSPQHPTPPARPRHMLGLPSTLFTPRSMESIEIDHKLQEIMKQ TGYLTIGGQRYQAEINDLENLGEMGSGTCGPVWKMRFRKTGHVIAVKQMRRSGNKEEN KRILMDLDVVLKSHDCPYIVQCFGTFITNTDVFIAMELMGTCAEKLKKRMQGPIPERI LGKMTVAIVKALYYLKEKHGVIHRDVKPSNILLDERGQIKLCDFGISGRLVDSKAKTR SAGCAAYMAPERIDPPDPTKPDYDIRADVWSLGISLVELATGQFPYKNCKTDFEVLTK VLQEEPPLLPGHMGFSGDFQSFVKDCLTKDHRKRPKYNKLLEHSFIKRYETLEVDVAS WFKDVMAKT" BASE COUNT 301 a 443 c 427 g 221 t ORIGIN 1 aattcggcac gaggtgtttg tctgccggac tgacgggcgg ccgggcggtg cgcggcggcg 61 gtggcggcgg ggaagatggc ggcgtcctcc ctggaacaga agctgtcccg cctggaagca 121 aagctgaagc aggagaaccg ggaggcccgg cggaggatcg acctcaacct ggatatcagc 181 ccccagcggc ccaggcccac cctgcagctc ccgctggcca acgatggggg cagccgctcg 241 ccatcctcag agagctcccc gcagcacccc acgccccccg cccggccccg ccacatgctg 301 gggctcccgt caaccctgtt cacaccccgc agcatggaga gcattgagat tgaccacaag 361 ctgcaggaga tcatgaagca gacgggctac ctgaccatcg ggggccagcg ctaccaggca 421 gaaatcaacg acctggagaa cttgggcgag atgggcagcg gcacctgcgg accggtgtgg 481 aagatgcgct tccggaagac cggccacgtc attgccgtta agcaaatgcg gcgctccggg 541 aacaaggagg agaacaagcg catcctcatg gacctggatg tggtgctgaa gagccacgac 601 tgcccctaca tcgtgcagtg ctttgggacg ttcatcacca acacggacgt cttcatcgcc 661 atggagctca tgggcacctg cgctgagaag ctcaagaagc ggatgcaggg ccccatcccc 721 gagcgcattc tgggcaagat gacagtggcg attgtgaagg cgctgtacta cctgaaggag 781 aagcacggtg tcatccaccg cgacgtcaag ccctccaaca tcctgctgga cgagcggggc 841 cagatcaagc tctgcgactt cggcatcagc ggccgcctgg tggactccaa agccaagacg 901 cggagcgccg gctgtgccgc ctacatggca cccgagcgca ttgacccccc agaccccacc 961 aagccggact atgacatccg ggccgacgta tggagcctgg gcatctcgtt ggtggagctg 1021 gcaacaggac agtttcccta caagaactgc aagacggact ttgaggtcct caccaaagtc 1081 ctacaggaag agcccccgct tctgcccgga cacatgggct tctcggggga cttccagtcc 1141 ttcgtcaaag actgccttac taaagatcac aggaagagac caaagtataa taagctactt 1201 gaacacagct tcatcaagcg ctacgagacg ctggaggtgg acgtggcgtc ctggttcaag 1261 gatgtcatgg cgaagacctg agtcaccgcg gactaacggc gttccttgag ccagccccac 1321 cttggcccct tcttcaggtt agcttgcttt ggccggcggc caacccctct ggggggccag 1381 ggcattggcc cc // LOCUS AF006740 2534 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens Coch-5B2 mRNA, complete cds. ACCESSION AF006740 NID g2801412 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2534) AUTHORS Robertson,N.G., Skvorak,A.B., Yin,Y., Weremowicz,S., Johnson,K.R., Kovatch,K.A., Battey,J.F., Bieber,F.R. and Morton,C.C. TITLE Mapping and characterization of a novel cochlear gene in human and in mouse: a positional candidate gene for a deafness disorder, DFNA9 JOURNAL Genomics 46 (3), 345-354 (1997) MEDLINE 98110569 REFERENCE 2 (bases 1 to 2534) AUTHORS Robertson,N.G., Skvorak,A.B., Yin,Y., Weremowicz,S., Johnson,K.R., Kovatch,K.A., Battey,J.F., Bieber,F.R. and Morton,C.C. TITLE Direct Submission JOURNAL Submitted (04-JUN-1997) Pathology, Brigham and Women's Hospital, 75 Francis Street Room 523 Thorn, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2534 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q11.2-q13" gene 1..2534 /gene="Coch-5B2" CDS 57..1709 /gene="Coch-5B2" /codon_start=1 /db_xref="PID:g2801413" /translation="MSAAWIPALGLGVCLLLLPGPAGSEGAAPIAITCFTRGLDIRKE KADVLCPGGCPLEEFSVYGNIVYASVSSICGAAVHRGVISNSGGPVRVYSLPGRENYS SVDANGIQSQMLSRWSASFTVTKGKSSTQEATGQAVSTAHPPTGKRLKKTPEKKTGNK DCKADIAFLIDGSFNIGQRRFNLQKNFVGKVALMLGIGTEGPHVGLVQASEHPKIEFY LKNFTSAKDVLFAIKEVGFRGGNSNTGKALKHTAQKFFTVDAGVRKGIPKVVVVFIDG WPSDDIEEAGIVAREFGVNVFIVSVAKPIPEELGMVQDVTFVDKAVCRNNGFFSYHMP NWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGDSNFRLMLEFVSN IAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAI SFTVRNVFGPIRESPNKNFLVIVTDGQSYDDVQGPAAAAHDAGITIFSVGVAWAPLDD LKDMASKPKESHAFFTREFTGLEPIVSDVIRGICRDFLESQQ" BASE COUNT 774 a 498 c 557 g 705 t ORIGIN 1 gcactcgggc gcagccgggt ggatctcgag caggtgtgag cagcctatca gtcaccatgt 61 ccgcagcctg gatcccggct ctcggcctcg gtgtgtgtct gctgctgctg ccggggcccg 121 cgggcagcga gggagccgct cccattgcta tcacatgttt taccagaggc ttggacatca 181 ggaaagagaa agcagatgtc ctctgcccag ggggctgccc tcttgaggaa ttctctgtgt 241 atgggaacat agtatatgct tctgtatcga gcatatgtgg ggctgctgtc cacaggggag 301 taatcagcaa ctcaggggga cctgtacgag tctatagcct acctggtcga gaaaactatt 361 cctcagtaga tgccaatggc atccagtctc aaatgctttc tagatggtct gcttctttca 421 cagtaactaa aggcaaaagt agtacacagg aggccacagg acaagcagtg tccacagcac 481 atccaccaac aggtaaacga ctaaagaaaa cacccgagaa gaaaactggc aataaagatt 541 gtaaagcaga cattgcattt ctgattgatg gaagctttaa tattgggcag cgccgattta 601 atttacagaa gaattttgtt ggaaaagtgg ctctaatgtt gggaattgga acagaaggac 661 cacatgtggg ccttgttcaa gccagtgaac atcccaaaat agaattttac ttgaaaaact 721 ttacatcagc caaagatgtt ttgtttgcca taaaggaagt aggtttcaga gggggtaatt 781 ccaatacagg aaaagccttg aagcatactg ctcagaaatt cttcacggta gatgctggag 841 taagaaaagg gatccccaaa gtggtggtgg tatttattga tggttggcct tctgatgaca 901 tcgaggaagc aggcattgtg gccagagagt ttggtgtcaa tgtatttata gtttctgtgg 961 ccaagcctat ccctgaagaa ctggggatgg ttcaggatgt cacatttgtt gacaaggctg 1021 tctgtcggaa taatggcttc ttctcttacc acatgcccaa ctggtttggc accacaaaat 1081 acgtaaagcc tctggtacag aagctgtgca ctcatgaaca aatgatgtgc agcaagacct 1141 gttataactc agtgaacatt gcctttctaa ttgatggctc cagcagtgtt ggagatagca 1201 atttccgcct catgcttgaa tttgtttcca acatagccaa gacttttgaa atctcggaca 1261 ttggtgccaa gatagctgct gtacagttta cttatgatca gcgcacggag ttcagtttca 1321 ctgactatag caccaaagag aatgtcctag ctgtcatcag aaacatccgc tatatgagtg 1381 gtggaacagc tactggtgat gccatttcct tcactgttag aaatgtgttt ggccctataa 1441 gggagagccc caacaagaac ttcctagtaa ttgtcacaga tgggcagtcc tatgatgatg 1501 tccaaggccc tgcagctgct gcacatgatg caggaatcac tatcttctct gttggtgtgg 1561 cttgggcacc tctggatgac ctgaaagata tggcttctaa accgaaggag tctcatgctt 1621 tcttcacaag agagttcaca ggattagaac caattgtttc tgatgtcatc agaggcattt 1681 gtagagattt cttagaatcc cagcaataat ggtaacattt tgacaactga aagaaaaagt 1741 acaaggggat ccagtgtgta aattgtattc tcataatact gaaatgcttt agcatactag 1801 aatcagatac aaaactatta agtatgtcaa cagccattta ggcaaataag cactccttta 1861 aagccgctgc cttctggtta caatttacag tgtactttgt taaaaacact gctgaggctt 1921 cataatcatg gctcttagaa actcaggaaa gaggagataa tgtggattaa aaccttaaga 1981 gttctaacca tgcctactaa atgtacagat atgcaaattc catagctcaa taaaagaatc 2041 tgatacttag accaaaagca acattcgttc tctaaccatt ctgtattgat tatataagca 2101 aaatgaaaag agaaacttaa atgaacacag ctctttaaca tggttcaggt acacatattt 2161 tgacccaagt ggatattttc ttaaaaccaa tcaataatag ctagctatta ctgcagacta 2221 taaaatctgg atatagaaag gagacctgta tcaaactgct tttgtagtgt gttttcataa 2281 caacttatga ctaaaaatat cacactgaat aagagagcag gattgccagg tatttttcta 2341 tttctctcct taattttata tgtatataga tatatttggc ttatattcta agtcacctaa 2401 gtacttaaaa gttaagttgg taaagtattt actgactgct tataaacatt taaagacaaa 2461 gacatttcaa ataactgcag aaaaaatatt gtagtttgaa tatttaagca ataaaactgc 2521 tagtgagtta ttgt // LOCUS AF006822 2308 bp mRNA PRI 09-JUL-1997 DEFINITION Homo sapiens myelin transcription factor 2 (MYT2) mRNA, complete cds. ACCESSION AF006822 NID g2246660 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2308) AUTHORS Kim,J.G., Armstrong,R.C., Berndt,J.A. and Hudson,L.D. TITLE Direct Submission JOURNAL Submitted (05-JUN-1997) LDN,NINDS, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2308 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" gene 1..2308 /gene="MYT2" CDS 1156..2283 /gene="MYT2" /note="DNA-binding protein" /codon_start=1 /product="myelin transcription factor 2" /db_xref="PID:g2246661" /translation="MDSSLSIITYSMSVHSLQKNGEPMKQEKQQFIKTDWTIDMINAV GNLRNMPLIFLTATKDIHRGGWVERIDNKAWQYVRVYENGDIEVLITLQIIRFHKYFD TPVWLQFNPNHLLPADYVQLDHVMKYVDHSHLTRADLANDIYNINLQRYDFGLFGGTR DIYRSLSGDLETRYWGRRKSERQIRLYDKMREMKKHGKADDIPDGITDWWRLEFQFRG GKVESWQEEVMDKMQSFHVLAVDDNDDLSEIDKAILARVNADKFDFKRVGKRYAAKIR KMVRENVGFDTTVAELSLKTFNEQKDELQRQLDSMLAKYNIGAQTEEMTAYFEEELKQ TGNLDFSVVESESALERNVIRNIAKSWREENSLTHRTNTNS" BASE COUNT 747 a 382 c 509 g 670 t ORIGIN 1 tattttagtt agtgaaagga cataatcatc aatgtttata atctggagtg cgttatcatg 61 gtttatagca attagttcat ttgcatttat gtttttatat tcatcagcaa aattgcaatg 121 ggtatttgga ttgattggaa ttagtcagat acttttatgg ctactaatat ggcgtttttc 181 acgttcagac aagaacggta gcgattaaaa tatttcggca gacatgtgcg tagcacgttt 241 atagcgtccg ctttgttagc atgtattaga cgttaacgaa cgattcgctt atcaaatttg 301 aacgatacgg ttgctacatg gtataatttt gttagaagaa ttgtggtttg gagtcttatc 361 ttcaatccag agcttgtcgt ctgtcaacga tgagcttttt ttgtgcaaaa aaatagcccc 421 gtcctgtcaa acgaggcact tgtattagcg acgattgtcg aaatactttt gcaacaagat 481 taatactaag ccagcaaaaa tctgccaaag tgtggcttgg agctgttgca ttagaaggat 541 tgcggtttgc atggagtctc acctcccttc tgggcgtcta atctcaacgc cacgagaaat 601 agtataccat gactttaaaa tggcttagaa cgcaaataag agccgtttaa gagtatttaa 661 gtttgcggtg gtatcttcgt atcggtggcg cgaccaaagg gagcgacaga cgacacacag 721 caggacactc caagtgtcca gtgtgctggg gtgccgttaa gaaatgacga ccagtcattt 781 taggcatctc gtccttgccg atatggagat accacctcta ggcgaaatga ggagagctct 841 gctcgacgat tggagacttt ttgacgggag gcgatcagcc gaccagaaaa aagcgggctg 901 ttactaacag cccgctaaca gcccgctaga ggtttccttt aggtaaccgc acaaccaaaa 961 aagcagtcag aatgacaagc caaataacta gtaattcgcc cgtaaaatgg gaacgtagaa 1021 ctcgtcatga cacttgtggt aaaataacat tattcgcttg caaaaaacga tgtactttga 1081 caatttaaga cacaaaaaaa cacaacctcg acaatttagg agttagagag gtcgtgtttt 1141 gagattagtg agactatgga tagttcatta agtattataa catactcaat gagtgtacat 1201 agtctccaga aaaatggaga acccatgaaa caagaaaaac agcagtttat aaaaacggat 1261 tggacgattg acatgataaa tgccgtcggc aatttgagaa atatgccatt gatatttctc 1321 acagctacta aggatattca tcgcggtgga tgggtagaac gaattgataa taaggcttgg 1381 cagtatgtca gagtttatga aaatggcgac attgaagttt taattacact tcagataata 1441 agattccaca aatattttga tacaccagtt tggttacagt ttaacccgaa tcatctttta 1501 cccgcagatt atgtgcaact tgatcatgtg atgaaatatg ttgatcactc acatttaact 1561 cgcgccgatt tagcaaatga tatttataac atcaacttgc aacgttacga ttttggattg 1621 tttggaggaa ctagagatat ctatcgtagt ttatctggtg atttagaaac acggtattgg 1681 ggacgccgaa aaagtgagcg ccaaattcgt ctttatgaca aaatgcgcga gatgaaaaaa 1741 catggcaaag cagatgatat tccagacggt attactgatt ggtggcgttt agaatttcaa 1801 tttagaggtg gcaaggttga gtcatggcaa gaagaagtaa tggacaaaat gcagtcgttc 1861 catgttcttg ctgttgatga taatgatgat ttaagtgaaa tagataaagc cattttagca 1921 cgtgttaatg ccgataaatt tgattttaaa agagtcggta aacgttatgc tgcaaaaatt 1981 cggaaaatgg tacgtgaaaa cgttggtttt gatacgaccg ttgcagaatt atctctcaaa 2041 acattcaacg aacaaaaaga tgagttgcag agacaactag atagcatgct tgcaaaatat 2101 aatatcggtg cacaaacaga agaaatgaca gcatattttg aggaagagct taaacaaaca 2161 ggtaatcttg atttttctgt tgttgagagt gaatctgcgt tagagcgtaa tgtgatacga 2221 aatattgcta aaagttggcg tgaagaaaat agtttaacgc acagaactaa tactaatagt 2281 tgatgcgtat tttttatttt tctaatat // LOCUS AF006823 2590 bp mRNA PRI 07-OCT-1997 DEFINITION Homo sapiens TWIK-related acid-sensitive K+ channel (TASK) mRNA, complete cds. ACCESSION AF006823 NID g2465541 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2590) AUTHORS Duprat,F., Lesage,F., Fink,M., Reyes,R., Heurteaux,C. and Lazdunski,M. TITLE TASK, a human background K+ channel to sense external pH variations near physiological pH JOURNAL EMBO J. 16 (17), 5464-5471 (1997) MEDLINE 97459932 REFERENCE 2 (bases 1 to 2590) AUTHORS Duprat,F., Lesage,F., Fink,M., Reyes,R., Heurteaux,C. and Lazdunski,M. TITLE Direct Submission JOURNAL Submitted (05-JUN-1997) IMPC, CNRS, 660 Route des Lucioles, Sophia-Antipolis, Valbonne 06560, France FEATURES Location/Qualifiers source 1..2590 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2590 /gene="TASK" CDS 126..1310 /gene="TASK" /note="pore-forming K+ channel subunit" /codon_start=1 /product="TWIK-related acid-sensitive K+ channel" /db_xref="PID:g2465542" /translation="MKRQNVRTLALIVCTFTYLLVGAAVFDALESEPELIERQRLELR QQELRARYNLSQGGYEELERVVLRLKPHKAGVQWRFAGSFYFAITVITTIGYGHAAPS TDGGKVFCMFYALLGIPLTLVMFQSLGERINTLVRYLLHRAKKGLGMRRADVSMANMV LIGFFSCISTLCIGAAAFSHYEHWTFFQAYYYCFITLTTIGFGDYVALQKDQALQTQP QYVAFSFVYILTGLTVIGAFLNLVVLRFMTMNAEDEKRDAEHRALLTRNGQAGGGGGG GSAHTTDTASSTAAAGGGGFRNVYAEVLHFQSMCSCLWYKSREKLQYSIPMIIPRDLS TSDTCVEQSHSSPGGGGRYSDTPSRRCLCSGAPRSAISSVSTGLHSLSTFRGLMKRRS SV" BASE COUNT 502 a 829 c 789 g 470 t ORIGIN 1 tgccctgcgc ggagagcggc gagcgcagcc atgccccagg ccgcctccgg ggcagcagca 61 gcggcggccg gggccgatgc gcgggccggg ggcgccgggg ggccggcggc ggcccgggcg 121 ggacgatgaa gcggcagaac gtgcgcacgc tggcgctcat cgtgtgcacc ttcacctacc 181 tgctggtggg cgccgcggtc ttcgacgcgc tggagtcgga gcccgagctg atcgagcggc 241 agcggctgga gctgcggcag caggagctgc gggcgcgcta caacctcagc cagggcggct 301 acgaggagct ggagcgcgtc gtgctgcgcc tcaagccgca caaggccggc gtgcagtggc 361 gcttcgccgg ctccttctac ttcgccatca ccgtcatcac caccatcggc tacgggcacg 421 cggcacccag cacggatggc ggcaaggtgt tctgcatgtt ctacgcgctg ctgggcatcc 481 cgctcacgct cgtcatgttc cagagcctgg gcgagcgcat caacaccttg gtgaggtacc 541 tgctgcaccg cgccaagaag gggctgggca tgcggcgcgc cgacgtgtcc atggccaaca 601 tggtgctcat cggcttcttc tcgtgcatca gcacgctgtg catcggcgcc gccgccttct 661 cccactacga gcactggacc ttcttccagg cctactacta ctgcttcatc accctcacca 721 ccatcggctt cggcgactac gtggcgctgc agaaggacca ggccctgcag acgcagccgc 781 agtacgtggc cttcagcttc gtctacatcc ttacgggcct cacggtcatc ggcgccttcc 841 tcaacctcgt ggtgctgcgc ttcatgacca tgaacgccga ggacgagaag cgcgacgccg 901 agcaccgcgc gctgctcacg cgcaacgggc aggcgggcgg cggcggaggg ggtggcagcg 961 cgcacactac ggacaccgcc tcatccacgg cggcagcggg cggcggcggc ttccgcaacg 1021 tctacgcgga ggtgctgcac ttccagtcca tgtgctcgtg cctgtggtac aagagccgcg 1081 agaagctgca gtactccatc cccatgatca tcccgcggga cctctccacg tccgacacgt 1141 gcgtggagca gagccactcg tcgccgggag ggggcggccg ctacagcgac acgccctcgc 1201 gacgctgcct gtgcagcggg gcgccacgct ccgccatcag ctcggtgtcc acgggtctgc 1261 acagcctgtc caccttccgc ggcctcatga agcgcaggag ctccgtgtga ctgccccgag 1321 ggacctggag cacctggggg cgcgggcggg ggacccctgc tgggaggcca ggagactgcc 1381 cctgctgcct tctgcccagt gggaccccgc acaacatccc tcaccactct cccccagcac 1441 ccccatctcc gactgtgcct gcttgcacca gccggcagga ggccgggctc tgaggacccc 1501 tggggccccc atcggagccc tgcaaattcc gagaaatgtg aaacttggtg gggtcaggga 1561 ggaaaggcag aagctgggag cctcccttcc ctttgaaaat ctaagaagct cccagtcctc 1621 agagaccctg ctggtaccac accccacctt cggaggggac ttcatgttcc gtgtacgttt 1681 gcatctctat ttatacctct gtcctgctag gtctcccacc ttcccttggt tccaaaagcc 1741 agggtgtcta tgtccaagtc acccctactc agccccactc cccttcctca tccccagctg 1801 tgtctcccaa cctcccttcg tgttgttttg catggctttg cagttatgga gaaagtggaa 1861 acccagcagt ccctaaagct ggtccccaga aagcaggaca gaaagaagga gggacaggca 1921 ggcagcagga ggggcgagct gggaggcagg aggcagcggc ctgtcagtct gcagaatggt 1981 cgcactggag gttcaagcta actggcctcc agccacattc tcatagcagg taggacttca 2041 gccttccaga cactgccctt agaatctgga acagaagact tcagactcac cataattgct 2101 gataattacc cactcttaaa tttgtcgagt gatttttagc ctctgaaaac tctatgctgg 2161 ccactgattc ctttgagtct cacaaaaccc tacttaggtc atcagggcag gagttctcac 2221 tcccatttta cagatgagaa tactgaggcc tggacaggtg aagtgaccag agagcaaaag 2281 gcaaaggggt gggggctggg tgcagtggct cacacctgta ttcccaacac ttttggaggc 2341 tgaggttgga ggattgcttg agcccaggaa ttcgagacca gcctaggtga catagtgaga 2401 ccccatctct acaaaaaata aaaaattaac caggtgtggt ggcacgtgcc tgggagtccc 2461 agcgacttgg gaggctgagg tgggaggatt gtttgagcct gggaggtcga ggctgtagtg 2521 agccctgatt gcaccactgt actccagcct gggtgacagg gcaagaccct gtctcaaaaa 2581 aaaaaaaaaa // LOCUS AF007111 2192 bp mRNA PRI 14-JUL-1997 DEFINITION Homo sapiens MDM2-like p53-binding protein (MDMX) mRNA, complete cds. ACCESSION AF007111 NID g2253390 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2192) AUTHORS Shvarts,A., Bazuine,M., Dekker,P., Ramos,Y.F.M., Steegenga,W.T., Merckx,G., Van Ham,R.C.A., Van der Houven van Oordt,W., Van der Eb,A.J. and Jochemsen,A.G. TITLE Isolation and identification of the human homolog of a new p53-binding protein, Mdmx JOURNAL Genomics 43 (1997) In press REFERENCE 2 (bases 1 to 2192) AUTHORS Jochemsen,A.G., Shvarts,A., Steegenga,W.T., Riteco,N., Van Laar,T., Dekker,P., Bazuine,M., Van Ham,R.C.A., Van der Houven van Oordt,W., Hateboer,G. and Van der Eb,A.J. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) Molecular Celbiology; Section Molecular Carcinogenesis, Leiden University, Wassenaarseweg 72, Leiden 2333 AL, The Netherlands FEATURES Location/Qualifiers source 1..2192 /organism="Homo sapiens" /db_xref="taxon:9606" gene 117..1589 /gene="MDMX" CDS 117..1589 /gene="MDMX" /codon_start=1 /product="MDM2-like p53-binding protein" /db_xref="PID:g2253391" /translation="MTSFSTSAQCSTSDSACRISPGQINQVRPKLPLLKILHAAGAQG EMFTVKEVMHYLGQYIMVKQLYDQQEQHMVYCGGDLLGELLGRQSFSVKNPSPLYDML RKNLVTLATATTDAAQTLALAQDHSMDIPSQDQLKQSAEESSTSRKRTTEDDIPTLPT SEHKCIHSREDEDLIENLAQDETSRLDLGFEEWDVAGLPWWFLGNLRSNYTPRSNGST DLQTNQDVGTAIVSDTTDDLWFLNESVSEQLGVGIKVEAADTEQTSEEVGKVSDKKVI EVGKNDDLEDSKSLSDDTDVEVTSEDEWQCTECKKFNSPSKRYCFRCWALRKDWYSDC SKLTHSLSTSDITAIPEKENEGNDVPDCRRTISAPVVRPKDAYIKKENSKLFDPCNSV EFLDLAHSSESQETISSMGEQLDNLSEQRTDTENMEDCQNLLKPCSLCEKRPRDGNII HGRTGHLVTCFHCARRLKKAGASCPICKKEIQLVIKVFIA" BASE COUNT 670 a 441 c 491 g 590 t ORIGIN 1 cggcacgagc taggatctgt gactgccacc cctcccccca cccgggctcg gcgggggagc 61 gactcatgga gctgccgtaa gttttaccaa cagactgcag tttcttcact accaaaatga 121 catcattttc cacctctgct cagtgttcaa catctgacag tgcttgcagg atctctcctg 181 gacaaatcaa tcaggtacga ccaaaactgc cgcttttgaa gattttgcat gcagcaggtg 241 cgcaaggtga aatgttcact gttaaagagg tcatgcacta tttaggtcag tacataatgg 301 tgaagcaact ttatgatcag caggagcagc atatggtata ttgtggtgga gatcttttgg 361 gagaactact gggacgtcag agcttctccg taaagaaccc aagccctctc tatgatatgc 421 taagaaagaa tcttgtcact ttagccactg ctactacaga tgctgctcag actctcgctc 481 tcgcacagga tcacagtatg gatattccaa gtcaagacca actgaagcaa agtgcagagg 541 aaagttccac ttccagaaaa agaactacag aagacgatat ccccacactg cctacctcag 601 agcataaatg catacattct agagaagatg aagacttaat tgaaaattta gcccaagatg 661 aaacatctag gctggacctt ggatttgagg agtgggatgt agctggcctg ccttggtggt 721 ttttaggaaa cttgagaagc aactatacac ctagaagtaa tggctcaact gatttacaga 781 caaatcagga tgtgggtact gccattgttt cagatactac agatgacttg tggtttttga 841 atgagtcagt atcagagcag ttaggtgttg gaataaaagt tgaagctgct gatactgaac 901 aaacaagtga agaagtaggg aaagtaagtg acaaaaaggt gattgaagtg ggaaaaaatg 961 atgacctgga ggactctaag tccttaagtg atgataccga tgtagaggtt acctctgagg 1021 atgagtggca gtgtactgaa tgcaagaaat ttaactctcc aagcaagagg tactgttttc 1081 gttgttgggc cttgaggaag gattggtatt cagattgttc aaagttaacc cattctctct 1141 ccacgtctga tatcactgcc atacctgaaa aggaaaatga aggaaatgat gtccctgatt 1201 gtcgaagaac catttcggct cctgtcgtta gacctaaaga tgcgtatata aagaaagaaa 1261 actccaaact ttttgatccc tgcaactcag tggaattctt ggatttggct cacagttctg 1321 aaagccaaga gaccatctca agcatgggag aacagttaga taacctttct gaacagagaa 1381 cagatacaga aaacatggag gattgccaga atctcttgaa gccatgtagc ttatgtgaga 1441 aaagaccacg agacgggaac attattcatg gaaggacggg ccatcttgtc acttgttttc 1501 actgtgccag aagactaaag aaggctgggg cttcatgccc tatttgcaag aaagagattc 1561 agctggttat taaggttttt atagcataat ggtagtacga acataaaaat gcatttattc 1621 agttcactta ccacattatt tgaaaatcaa tcctttattt aattttattt ccaacctgtc 1681 agagaatgtt cttaggcatc aaaatccaag gtagctgtaa gaaaaatact ggagctaaca 1741 atgaagaaca gaagtaatct gattagtcaa attattaagt gccatggatt actttatgca 1801 gcagtcaggt acatagttag gtgaacccaa aagaaaaact cttgaaaaca agagatttct 1861 tccatgcaca tttacaatat tgaggtataa ttaacatgat aaagtgtttc cttctaacga 1921 gttgtagaaa tctgagtaac cacccaaaaa agcaatagaa tgtttgtgtc accccaaaac 1981 actcccttct gcccctcttc agacagtcct tcagctattt catggctctc accctagttt 2041 tttttttttt ttgcactttt ttttttccgg gggtataggg gaggtgtggg gcgacagggt 2101 ctgtcttgtt ctgtctccca ggctgaagtg cagtgagtca agattgagcc actgcactcc 2161 agcctgggtg acagcgcgag actccatctc ag // LOCUS AF007165 1888 bp mRNA PRI 09-JUL-1997 DEFINITION Homo sapiens suppressin (spn) mRNA, complete cds. ACCESSION AF007165 NID g2246693 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1888) AUTHORS LeBoeuf,R.D., Blalock,J.E. and Tauber,J.D. TITLE Cloning and sequence analysis of the cDNA for human suppressin (spn) JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1888) AUTHORS LeBoeuf,R.D. and Tauber,J.D. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) Physiology, University of Alabama at Birmingham, 1918 University Blvd. BHSB 850, Birmingham, AL 35294, USA FEATURES Location/Qualifiers source 1..1888 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" /tissue_type="breast" gene 1..1888 /gene="spn" CDS 61..1554 /gene="spn" /function="proliferation inhibitor" /codon_start=1 /product="suppressin" /db_xref="PID:g2246694" /translation="MAADPGHMDMGAEALPGPDEAAAAAAFAEVTTVTVANVGAAADN VFTTSVANAASISGHVLSGRTALQIGDSLNTEKATLIVVHTDGSIVETTGLKGPAAPL TPGPQSPPTPLAPGQEKGGTKYNWDPSVYDSELPVRCRNISGTLYKNRLGSGGRGRCI KQGENWYSPTEFEAMAGRTSSKDWKRSIRYAGRPLQCLIQDGILNPHAASCTCAACCD DMTLSGPLRLFVPYKRRKKENELPTTPVKKDSPKNITLLPATAATTFTVTPSGQITTS GALTFDRASTVEATAVISESPAQGDVFAGATVQEASVQPPCRASHPEPHYPGYQDSCQ IAPFPEAALPTSHPKIVLTSLPALAVPPPTPTKAAPPALVNGLELSEPRSWLYLEEMV NSLLNTAQQLKTLFEQAKHASTYREAATNQAKIHADAERKEQSCVNCGREAMSECTGC HKVKYCSTFCQRKDWKDHQHICGQSAAVTVQADEVHVAESVMEKVTV" BASE COUNT 438 a 576 c 556 g 318 t ORIGIN 1 gaggaggacg cagactcgga ggcggagcgg gagacgccgc gggtcacggc agtggcggtg 61 atggcggcgg accccgggca catggacatg ggcgccgagg ccctgcccgg ccccgacgag 121 gccgccgctg ccgcagcctt cgcagaggtg accacagtga cagtggccaa cgtgggggct 181 gctgcagaca atgtcttcac cacgtctgtg gcgaacgcgg catccatctc aggacatgtt 241 ctgtctggta ggacggccct tcagatcggg gacagcctga acaccgaaaa agcgacactg 301 attgtcgtcc acacagatgg gagcatcgtg gagaccaccg ggctgaaagg cccggcagct 361 cccctcaccc caggtcctca gtctcctcca acccctctgg ctcccggcca agaaaaaggt 421 ggaactaaat acaactggga cccttctgtg tacgacagtg agctgcccgt acggtgccgg 481 aacatcagcg gcactctgta caagaacagg ctcggctcag gcggccgggg acggtgcatc 541 aagcaggggg agaactggta cagtcccacc gagtttgagg ccatggcagg aagaaccagc 601 agtaaggact ggaaaagaag cattcgctac gcgggccgac ccttgcagtg cctcatccag 661 gatgggatct taaaccctca cgctgcctct tgcacctgtg ctgcctgctg cgacgacatg 721 accttaagtg gcccactcag gctttttgtg ccttacaaaa ggcgcaagaa ggagaatgaa 781 ctgcccacaa ctcccgtgaa gaaggactcc cccaagaaca tcacattgct tccagccacc 841 gcggctacca ccttcaccgt gaccccctcg ggacagatca cgacctcggg ggcactgacc 901 tttgaccgag cgtccacggt agaggccact gctgtcatat cagagagtcc ggcccagggc 961 gacgtcttcg caggggccac agtccaagag gccagcgtgc agcccccatg cagggccagc 1021 caccctgagc ctcactaccc cggctatcag gacagctgcc agatcgcacc gtttccagaa 1081 gctgcgttgc caacgtcaca tcccaaaata gtgttgacat ccctgcctgc gctggcggtc 1141 ccacccccga ctcccaccaa agcggcacct cccgcgttgg tcaatgggct ggagctgtca 1201 gagccgcgga gctggctgta cctagaagag atggtcaact ccttgctcaa cacagcgcag 1261 cagctgaaga cgctgtttga gcaagccaag catgccagca cctaccgaga agctgccaca 1321 aaccaggcca agatccacgc tgacgcagag cggaaggagc agtcctgcgt taactgcggc 1381 cgggaggcta tgagcgagtg caccggctgc cacaaggtca aatattgttc caccttctgc 1441 caacgcaagg attggaagga tcaccagcac atatgcggcc agtcagcagc tgtcaccgtc 1501 caggcagacg aagtccacgt ggctgaaagc gtgatggaga aggtgaccgt gtgaggtcca 1561 tcggccgccc tgggagctgg ggcccctcgc actcctgtga ggcttttgca ggtcgaaggc 1621 ccccctgagg attttggggg gacgttgaga agaggggtgt gggaaggtaa agaaacttgc 1681 tggacaagtc attaacacac tttaagcgaa tggtgccctg ggaagcgcat tccccctgcc 1741 cgggccccct gcccgttcgc ggacagattt tttatccttg ggatcatgag cgtccggtct 1801 tgcccacagg gcctgtgctg cgacgcacat acatacgtgt tgtgtctgtc aataaagtgt 1861 aaataaggtc aaaaaaaaaa aaaaaaaa // LOCUS AF007216 7586 bp mRNA PRI 26-JUL-1997 DEFINITION Homo sapiens sodium bicarbonate cotransporter (HNBC1) mRNA, complete cds. ACCESSION AF007216 NID g2281471 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7586) AUTHORS Burnham,C.E., Amlal,H., Wang,Z., Shull,G.E. and Soleimani,M. TITLE Cloning and functional expression of a human kidney Na+:HCO3- cotransporter JOURNAL J. Biol. Chem. 272 (31), 19111-19114 (1997) MEDLINE 97382229 REFERENCE 2 (bases 1 to 7586) AUTHORS Burnham,C.E., Amlal,H., Wang,Z., Shull,G.E. and Soleimani,M. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) Nephrology, University of Cincinnati, 231 Bethesda Avenue, Cincinnati, OH 45267-0585, USA FEATURES Location/Qualifiers source 1..7586 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /sex="male" /dev_stage="38 year old" /note="Caucasian" gene 1..7586 /gene="HNBC1" CDS 150..3257 /gene="HNBC1" /note="Na+:HCO3- cotransporter" /codon_start=1 /product="sodium bicarbonate cotransporter" /db_xref="PID:g2281472" /translation="MSTENVEGKPSNLGERGRARSSTFLRVVQPMFNHSIFTSAVSPA AERIRFILGEEDDSPAPPQLFTELDELLAVDGQEMEWKETARWIKFEEKVEQGGERWS KPHVATLSLHSLFELRTCMEKGSIMLDREASSLPQLVEMIVDHQIETGLLKPELKDKV TYTLLRKHRHQTKKSNLRSLADIGKTVSSASRMFTNPDNGSPAMTHRNLTSSSLNDIS DKPEKDQLKNKFMKKLPRDAEASNVLVGEVDFLDTPFIAFVRLQQAVMLGALTEVPVP TRFLFILLGPKGKAKSYHEIGRAIATLMSDEVFHDIAYKAKDRHDLIAGIDEFLDEVI VLPPGEWDPAIRIEPPKSLPSSDKRKNMYSGGENVQMNGDTPHDGGHGGGGHGDCEEL QRTGRFCGGLIKDIKRKAPFFASDFYDALNIQALSAILFIYLATVTNAITFGGLLGDA TDNMQGVLESFLGTAVSGAIFCLFAGQPLTILSSTGPVLVFERLLFNFSKDNNFDYLE FRLWIGLWSAFLCLILVATDASFLVQYFTRFTEEGFSSLISFIFIYDAFKKMIKLADY YPINSNFKVGYNTLFSCTCVPPDPANISISNDTTLAPEYLPTMSSTDMYHNTTFDWAF LSKKECSKYGGNLVGNNCNFVPDITLMSFILFLGTYTSSMALKKFKTSPYFPTTARKL ISDFAIILSILIFCVIDALVGVDTPKLIVPSEFKPTSPNRGWFVPPFGENPWWVCLAA AIPALLVTILIFMDQQITAVIVNRKEHKLKKGAGYHLDLFWVAILMVICSLMALPWYV AATVISIAHIDSLKMETETSAPGEQPKFLGVREQRVTGTLVFILTGLSVFMAPILKFI PMPVLYGVFLYMGVASLNGVQFMDRLKLLLMPLKHQPDFIYLRHVPLRRVHLFTFLQV LCLALLWILKSTVAAIIFPVMILALVAVRKGMDYLFSQHDLSFLDDVIPEKDKKKKED EKKKKKKKGSLDSDNDDSDCPYSEKVPSIKIPMDIMEQQPFLSDSKPSDRERSPTFLE RHTSC" BASE COUNT 2211 a 1473 c 1501 g 2401 t ORIGIN 1 gttctttgtg acacatcaca cagaattgga gtgctgtcct tctggagagt ggtggagaac 61 caagatacag ttcagaacca aaggaataga gaagggcttt gatttctttt tggctttaga 121 ttggggattt gggaggctta gcaggaaaga tgtccactga aaatgtggaa gggaagccca 181 gtaaccttgg ggagagagga agagcccgga gctccacttt cctcagggtt gtccagccaa 241 tgtttaacca cagtattttc acttctgcag tctctcctgc tgcagaacgc atccgattca 301 tcttgggaga ggaggatgac agcccagctc cccctcagct cttcacggaa ctggatgagc 361 tgctggccgt ggatgggcag gagatggagt ggaaggaaac agccaggtgg atcaagtttg 421 aagaaaaagt ggaacagggt ggggaaagat ggagcaagcc ccatgtggcc acattgtccc 481 ttcatagttt atttgagctg aggacatgta tggagaaagg atccatcatg cttgatcggg 541 aggcttcttc tctcccacag ttggtggaga tgattgttga ccatcagatt gagacaggcc 601 tattgaaacc tgaacttaag gataaggtga cctatacttt gctccggaag caccggcatc 661 aaaccaagaa atccaacctt cggtccctgg ctgacattgg gaagacagtc tccagtgcaa 721 gtaggatgtt taccaaccct gataatggta gcccagccat gacccatagg aatctgactt 781 cctccagtct gaatgacatt tctgataaac cggagaagga ccagctgaag aataagttca 841 tgaaaaaatt gccacgtgat gcagaagctt ccaacgtgct tgttggggag gttgactttt 901 tggatactcc tttcattgcc tttgttaggc tacagcaggc tgtcatgctg ggtgccctga 961 ctgaagttcc tgtgcccaca aggttcttgt tcattctctt aggtcctaag gggaaagcca 1021 agtcctacca cgagattggc agagccattg ccaccctgat gtctgatgag gtgttccatg 1081 acattgctta taaagcaaaa gacaggcacg acctgattgc tggtattgat gagttcctag 1141 atgaagtcat cgtccttcca cctggggaat gggatccagc aattaggata gagcctccta 1201 agagtcttcc atcctctgac aaaagaaaga atatgtactc aggtggagag aatgttcaga 1261 tgaatgggga tacgccccat gatggaggtc acggaggagg aggacatggg gattgtgaag 1321 aattgcagcg aactggacgg ttctgtggtg gactaattaa agacataaag aggaaagcgc 1381 cattttttgc cagtgatttt tatgatgctt taaatattca agctctttcg gcaattctct 1441 tcatttatct ggcaactgta actaatgcta tcacttttgg aggactgctt ggggatgcca 1501 ctgacaacat gcagggcgtg ttggagagtt tcctgggcac tgctgtctct ggagccatct 1561 tttgcctttt tgctggtcaa ccactcacta ttctgagcag caccggacct gtcctagttt 1621 ttgagaggct tctatttaat ttcagcaagg acaataattt tgactatttg gagtttcgcc 1681 tttggattgg cctgtggtcc gccttcctat gtctcatttt ggtagccact gatgccagct 1741 tcttggttca atacttcaca cgtttcacgg aggagggctt ttcctctctg attagcttca 1801 tctttatcta tgatgctttc aagaagatga tcaagcttgc agattactac cccatcaact 1861 ccaacttcaa agtgggctac aacactctct tttcctgtac ctgtgtgcca cctgacccag 1921 ctaatatctc aatatctaat gacaccacac tggccccaga gtatttgcca actatgtctt 1981 ctactgacat gtaccataat actacctttg actgggcatt tttgtcgaag aaggagtgtt 2041 caaaatacgg aggaaacctt gtcgggaaca actgtaattt tgttcctgat atcacactca 2101 tgtcttttat cctcttcttg ggaacctaca cctcttccat ggctctgaaa aaattcaaaa 2161 ctagtcctta ttttccaacc acagcaagaa aactgatcag tgattttgcc attatcttgt 2221 ccattctcat cttttgtgta atagatgccc tagtaggcgt ggacacccca aaactaattg 2281 tgccaagtga gttcaagcca acaagtccaa accgaggttg gttcgttcca ccgtttggag 2341 aaaacccctg gtgggtgtgc cttgctgctg ctatcccggc tttgttggtc actatactga 2401 ttttcatgga ccaacaaatt acagctgtga ttgtaaacag gaaagaacat aaactcaaga 2461 aaggagcagg gtatcacttg gatctctttt gggtggccat cctcatggtt atatgctccc 2521 tcatggctct tccgtggtat gtagctgcta cggtcatctc cattgctcac atcgacagtt 2581 tgaagatgga gacagagact tctgcacctg gagaacaacc aaagtttcta ggagtgaggg 2641 aacaaagagt cactggaacc cttgtgttta ttctgactgg tctgtcagtc tttatggctc 2701 ccatcttgaa gtttataccc atgcctgtac tctatggtgt gttcctgtat atgggagtag 2761 catcccttaa tggtgtgcag ttcatggatc gtctgaagct gcttctgatg cctctgaagc 2821 atcagcctga cttcatctac ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt 2881 tcctgcaggt gttgtgtctg gccctgcttt ggatcctcaa gtcaacggtg gctgctatca 2941 tttttccagt aatgatcttg gcacttgtag ctgtcagaaa aggcatggac tacctcttct 3001 cccagcatga cctcagcttc ctggatgatg tcattccaga aaaggacaag aaaaagaagg 3061 aggatgagaa gaaaaagaaa aagaagaagg gaagtctgga cagtgacaat gatgattctg 3121 actgcccata ctcagaaaaa gttccaagta ttaaaattcc aatggacatc atggaacagc 3181 aacctttcct aagcgatagc aaaccttctg acagagaaag atcaccaaca ttccttgaac 3241 gccacacatc atgctgataa aattcctttc cttcagtcac tcggtatgcc aagtcctcct 3301 agaactccag taaaagttgc ctcaaattag actagaactt gaacctgaag acaatgatta 3361 tttctggagg agcaagggaa cagaaactac attgtaacct gtttgtcttt cttaaaactg 3421 acatttgttg ttaatgtcat ttgtttttgt ttggctgttt gtttattttt taacttttat 3481 ttcgtctcag tttttggtca caggccaaat aatacagcgc tctctctgct tctctcttgc 3541 atagatacaa tcaagacaat agtgcaccgt tccttaaaaa cagcatctga ggaatccccc 3601 ttttgttctt aaactttcag atgtgtcctt tgataaccaa attctgtcac tcaagacaca 3661 gacacccaca gaccctgtcc tttgcctcta ttaagcagag gatggaagta ttaaggattt 3721 tgtaacacct tttatgaaaa tgttgaagga acttaaaact ttagctttgg agctgtgctt 3781 actggcttgt ctttgtctgg tagaacaaac cttgacctcc agacagagtc ccttctcact 3841 tatagagctc tccaggactg gaaaaagtgc tgctatttta acttgctctt gcttgtaaat 3901 cctaatctta gagttatcaa aagaagaaaa aactgaaggt actttactcc ctatagagaa 3961 accattgcca tcattgtagc aagtgctgga atgtcccttt tttcctatgc aactttttta 4021 taacccttta atgaacttat ctgtggagta cattgaagaa tatttttctt cctagatttt 4081 gttgtttaaa ttatggggcc taacctgcca cttatttttt gtcaattttt aaaacttttt 4141 tttaattact gtaaagaaaa tgaatttttt cctgcagcag gaaacatagt tttcagtagt 4201 tctacctctt atttgtagct gccaggcttt ctgtaaaaat tgtattgtat ataatgtgat 4261 ttttacacat acatacacac acaaatacac aatctctagg gtaagccaga aggcaagatc 4321 agattaaaaa caccatgttt ctaagcatcc atttttccct ttctttaaaa gaaacttaac 4381 tgttctatga aggagattga gggagaagag acaaactcct atgtcatgag aataaccgat 4441 gttctgataa tagtagcatc taggtacaga tgctggttgt attaccacgt caatgtccta 4501 tgcagtattg ttagacattt tctcattttg aaatatttgt gtgtttgtgt atgtgctctg 4561 tgccatggct ggtgtatata tgtgcaatgt tagaaggcaa aagagtgatg gtaggcagag 4621 ggcaaagtca ttgaatctct tatgccagtt ttcataaaac ccaaaccaca tatgaaaaaa 4681 tccattaagg gtccaagaag tctgtccata tgaaaatgag ggtaaatata gtttatttcc 4741 caggtatcag tcattataat tgatataata gctctaacat gcaatataaa attcatagga 4801 gtattaatag cccatttaca catctataaa atgtaatggg attgcagagc tgcagagtac 4861 agtgtaacag tactctcatg caattttttt caggatgcaa aggcaattat tctttgtaag 4921 cgggacattt agatatattt gtgtacatat tatatgtatg tatatttcaa agtaccacac 4981 tgaaaattag acatttatta accaaattta acgtggtatt taaaggtaat atttttaata 5041 tgatacatta catattgtga atgtatacta aaaaaacatt ttaaatgtta aaattataat 5101 ttcagattca tataaccaca actgtgatat atcctaacta taaccagttg ttgaggggta 5161 tactagaagc agaatgaaac cacatttttt ggtttgataa tatgcactta ttgactccca 5221 ctcattgtta tgttaattaa gttattattc tgtctccttg taattttgat tacaaaaatt 5281 ttattatcct gagttagctg ttacttttac agtacctgat actcctaaaa cttttaactt 5341 atacaaatta gtcaataatg accccaattt tttcattaaa ataatagtgg tgaattatat 5401 gttattgtgt taaaacctca cttgccaaat tctggcttca catttgtatt tagggctatc 5461 cttaaaatga tgagtctata ttatctagct ttctattacc ctaatataaa ctggtataag 5521 aagactttcc ttttttcttt atgcatggaa gcatcaataa attgtttaaa aaccatgtat 5581 agtaaattca gcttaacccg tgatcttctt aagttaaagg tacttttgtt ttataaaagc 5641 tctagataaa actttctttt ctgatcatga atcaagtatc tgtggtttca tgcccctctc 5701 tatacctttc aaagaactcc tgaagcaact taactcatca tttcagcctc tgagtagagg 5761 taaaacctat gtgtacttct gtttatgatc catattgata tttatgacat gaacacagaa 5821 tagtacctta catttgctaa acagacagtt aatatcaaat cctttcaata ttctgggaac 5881 ccagggaagt ttttaaaaat gtcattactt tcaaaggaac agaagtagtt aaccaaacta 5941 acaagcaaaa cctgaggttt acctagtgac accaaattat cggtatttta actgaattta 6001 cccattgact aagaatgaac cggatttggt ggtggttttg tttctatgca aactggacac 6061 aaattacaac agtaaatttt tttataagtg cttctccctt ctccatgatg tgacttccgg 6121 agataaagga ttcaaaagat aaagacaaag tacgctcaga gttgttaacc agaaagtcct 6181 ggctgtggtt gcagaaacac tgttggaaga aaagagatga ctaagtcaag tgtctgcctt 6241 atcaaaagag caaaaatgcc tctggttttg tgtttgggag aaaaatatct tggacgcact 6301 gttttccttg ataaaagtca tcttctctac tgtgtgaaat gaatacttgg aattctaatt 6361 gttttgtgtg ccaggggcag taatgtccct gcctcttctc ccaatcaagg ttgaggagtg 6421 gggctgggga gaggacttaa ctgacttaag aagtaggaaa acaaaaacct ctctcctcag 6481 ccttccacct ccaagagagg aggaaaaaca gttgtctgct gtctgtaatt cagtttgcgt 6541 gtattttatg ctcatgcacc aacccataca gagtaaatct tttatcaact atatactggt 6601 gtttaataga gaatgattgt cttccgagtt ttttggttcc ttttttaact gtgttaaagt 6661 acttgaaatg tattgactgc tgactatatt ttaaaaacaa aatgaaataa tttgagttgt 6721 attacagagg ttgacattgt tcagggatgg gacaaagcct tcttcaatcc ttttcatact 6781 acttaatgat tttggtgcag gaacctgaga ttttctgatt tatatttcat gatatttcac 6841 atttgctctt cacagcatga gcatgaagcc cagtggcacc aaatggctgg gtacaatcaa 6901 gtgatatttt gtagcacctc actatctgaa aggccatgag ttttcagatg atttcattga 6961 gcttcattgc agcctgaaat tttaaaaaag ttgtgtaata cgccaaccag tcaagttgtg 7021 ttttggccag agatttagat atgtccaatt tcctggctca tttcattgtg ctctatgggt 7081 acgtataaaa agcaagaatt ctgtttccta ggcaaacatt gcaactcagg gctaaagtca 7141 tccagtgaaa cttttagagc cagaagtaac tttgtcccag tcctacaatg tgaaaagagt 7201 gaatagttgc ctctttttag ccattttcat ggctggtaca tattcgtacg cattactttt 7261 cagaatcaat acgcactttc agatattctt atttttattc tcttaagtct ttattaactt 7321 tggagagaga aatgatgcat ctttttattt taaatgaagt agatcaacat ggtggaacaa 7381 aatgataaag aacagaaaac atttcaatat attactaata actttttcca atataaatcc 7441 taaaattcct ataacatagt attttacagt tttatgaagc tttctattgt gacttttatg 7501 gaattaagag atgaagaaga tgagatattt tagcatttat atttttcaaa attatatgta 7561 tacttaaaaa taaagtaact ttatgc // LOCUS AF007217 6411 bp mRNA PRI 14-JUL-1997 DEFINITION Homo sapiens Trip230 mRNA, complete cds. ACCESSION AF007217 NID g2253416 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6411) AUTHORS Chang,K.H., Chen,Y., Chen,T.T., Chou,W.H., Chen,P.L., Ma,Y.T., Yang-Feng,T.L., Leng,L., Tsai,M.J., O'Malley,B.W. and Lee,W.H. TITLE A novel thyroid hormone receptor coactivator negatively regulated by the retinoblastoma protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 6411) AUTHORS Chang,K.H., Chen,Y., Chen,T.T., Chou,W.H., Chen,P.L., Ma,Y.T., Yang-Feng,T.L., Leng,L., Tsai,M.J., O'Malley,B.W. and Lee,W.H. TITLE Direct Submission JOURNAL Submitted (07-JUN-1997) Molecular Mecicine, Biotechnology, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..6411 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q31" /cell_type="fibroblast" gene 1..6411 /gene="Trip230" CDS 352..6288 /gene="Trip230" /function="coactivator of thyroid hormone receptor-mediated transcription" /codon_start=1 /product="Trip230" /db_xref="PID:g2253417" /translation="MSSWLGGLGSGLGQSLGQVGGSLASLTGQISNFTKDMLMEGTEE VEAELPDSRTKEIEAIHAILRSENERLKKLCTDLEEKHEASEIQIKQQSTSYRNQLQQ KEVEISHLKARQIALQDQFLKLQSAAQSVPSGAGVPATTASSSFAYGISHHPSAFHDD DMDFGDIISSQQEINRLSNEVSRLESEVGHWRHIAQTSKAQGTDNSDQSEICKLQNII KELKQNRSQEIDDHQHEMSVLQNAHQQKLTEISRRHREELSDYEERIEELENLLQQGG SGVIETDLSKIYEMQKTIQVLQIEKVESTKKMEQLEDKIKDINKKLSSAENDRDILRR EQEQLNVEKRQIMEECENLKLECSKLQPSAVKQSDTMTEKERILAQSGSVEEVFRLRQ ALSDAENEIMRLSSLNQDNSLAEDNLKLKMRIEVLEKEKSLLSQEKEELQMSLLKLNN EYEVIKSTATRDISLDSELHDLRLNLEAKEQELNQSISEKETLIAEIEELDRQNQEAT KHMILIKAQLSKQQNEGDSIISKLKQDLNDEKKRVHQLEDDKMDITKELDVQFVLLIQ SEVALNDLHLTKQKLEDKVENLVDQLNKSQESNVSIQKENLELKEHIRQNEEELSRIR NELMQSLNQDSNSNFKDTLLKEREAEVRNLKQNLSELEQLNENLKKVAFDVKMENEKL VLACEDVRHQLEECLAGNNQLSLEKNTIVETLKMEKGEIEAELCWAKKRLLEEANKYE KTIEELSNARNLNTSALQLEHEHLIKLNQKKDMEIAELKKNIEQMDTDHKETKDVLSS SLEEQKQLTQLINKKEIFIEKLKERSSKLQEELDKYSQALRKNEILRQTIEEKDRSLG SMKEENNHLQEELERLREEQSRTAPVADPKTLDSVTELASEVSQLNTIKEHLEEEIKH HQKIIEDQNQSKMQLLQSLQEQKKEMDEFRYQHEQMNATHTQLFLEKDEEIKSLQKTI EQIKTQLHEERQDIQTDNSDIFQETKVQSLNIENGSEKHDLSKAETERLVKGIKEREL EIKLLNEKNISLTKQIDQLSKDEVGKLTQIIQQKDLEIQALHARISSTSHTQDVVYLQ QQLQAYAMEREKVFAVLNEKTRENSHLKTEYHKMMDIVAAKEAALIKLQDENKKLSTR FESSGQDMFRETIQNLSRIIREKDIEIDALSQKCQTLLAVLQTSSTGNEAGGVNSNQF EELLQERDKLKQQVKKMEEWKQQVMTTVQNMQHESAQLQEELHQLQAQVLVDSDNNSK LQVDYTGLIQSYEQNETKLKNFGQELAQVQHSIGQLCNTKDLLLGKLDIISPQLSSAS LLTPQSAECLRASKSEVLSESSELLQQELEELRKSLQEKDATIRTLQENNHRLSDSIA ATSELERKEHEQTDSEIKQLKEKQDVLQKLLKEKDLLIKAKSDQLLSSNENFTNKVNE NELLRQAVTNLKERILILEMDIGKLKGENEKIVETYRGKETEYQALQETNMKFSMMLR EKEFECHSMKEKALAFEQLLKEKEQGKTGELNQLLNAVKSMQEKTVVFQQERDQVMLA LKQKQMENTALQNEVQRLRDKEFRSNQELERLRNHLLESEDSYTREALAAEDREAKLR KKVTVLEEKLVSSSNAMENASHQASVQVESLQEQLNVVSKQRDETALRFCLSGTVKQY RLSLANLQMVLEHFQQEEKAMYSAELEKQKQLIAEWKKNAENLEGKVISLQECLDEAN AALDSASRLTEQLDVKEEQIEELKRQNELRQEMLDDVQKKLMSLANSSEGKVDKVLMR NLFIGHFHTPKNQRHEVLRLMGSILGVRREEMEQLFHDDQGGVTRWMTGWLGGGSKSV PNTPLRPNQQSVVNSSFSELFVKFLETESHPSIPPPKLSVHDMKPLDSPGRRKRDTNA PESFKDTAESRSGRRTDVNPFLAPRSAAVPLINPAGLGPGGPGHLLLKPISDVLPTFT PLPALPDNSAGVVLKDLLKQ" BASE COUNT 2405 a 1094 c 1405 g 1507 t ORIGIN 1 gaattcgcgg ccggcggcgt cgagttggca ggagtaaccc acggaactga ggaaagtcat 61 tagagctgag aaagaagtgg cccaatctgg acggtgggaa ttcgtgggaa tgagcagaag 121 gccctccgta ggtgactgtg tcactagagg cgggcccctg gtaaaattcc aggccaggcc 181 tctgcgtttc taggcagaac ctggagtcgg ccttgcctga gaacccagct ttgtgttatc 241 gtatcctgtc tcgcgaaggc aggcgttcaa ggatatttgg tcggatcgcc cggcggcgct 301 aaacgttttc ttttttccga gcggaccggg tcgttctcta aactcgccgc gatgtcgtcc 361 tggcttgggg gcctcggctc cggattgggc cagtctctgg gtcaagtcgg gggcagcctg 421 gcttccctca ctggccagat atcaaacttt acaaaggata tgctgatgga gggcacggag 481 gaagtggaag cagaattacc tgattctagg acaaaggaaa ttgaagccat tcatgcaatc 541 ttgagatcag agaatgaaag gcttaagaaa ctttgtactg atctagaaga gaaacatgaa 601 gcatcagaga ttcaaataaa gcagcaatct acaagttacc gaaatcaact tcaacaaaaa 661 gaggtagaaa tcagccatct taaagccaga cagattgccc tccaggatca gttcctgaaa 721 ctgcagtcag ctgctcagtc agtaccttca ggagctggtg taccagcaac cactgcatca 781 tcttcattcg cttatgggat tagtcatcat ccttcagctt tccatgacga tgacatggac 841 tttggtgata taatttcatc ccaacaagaa ataaaccgac tctcaaatga agtttcaaga 901 cttgagtctg aagttggcca ttggaggcat attgctcaga cttccaaagc acaaggaaca 961 gataactctg atcaaagtga aatatgtaaa ctacaaaata tcattaagga actaaaacag 1021 aaccgaagtc aggaaattga tgaccatcaa catgaaatgt cagtactgca gaatgcacac 1081 caacagaaat tgacagaaat aagtcgacga catcgagaag aattaagtga ctatgaagaa 1141 cgaattgaag aacttgaaaa tctgttacaa caaggtggct ctggagttat agaaactgat 1201 ctctctaaaa tctatgagat gcaaaaaact attcaagttc tacaaataga aaaagtggag 1261 tctaccaaaa aaatggaaca acttgaggat aaaataaaag atataaataa aaaattatct 1321 tctgcagaaa atgacagaga tattttgagg agagaacaag aacagctaaa tgtggaaaag 1381 agacaaataa tggaagaatg tgaaaacttg aaattggaat gtagtaaatt gcagccttct 1441 gctgtgaagc aaagtgatac tatgacagaa aaggaaagaa ttcttgccca gagtggatca 1501 gtggaagaag tgttcagact acgacaagca ctgtctgatg ccgaaaatga aataatgaga 1561 ttgagtagtt taaaccagga taacagtctt gctgaagaca atctgaaact taaaatgcgt 1621 atcgaagttt tagaaaaaga gaagtcatta ctgagtcaag aaaaggaaga acttcagatg 1681 tcacttttaa aattgaacaa tgaatatgaa gtaattaaaa gtacagctac aagagacata 1741 agtttggatt cagaattaca tgacttaaga cttaatttgg aggcaaagga acaagaactc 1801 aatcagagta ttagtgaaaa ggaaacactg atagccgaga tagaagaatt ggacagacag 1861 aatcaagaag ctacaaagca catgattttg ataaaagctc agctatcaaa acaacaaaat 1921 gaaggagata gcatcatcag taaactgaaa caagatctaa atgatgaaaa aaagagagtt 1981 catcaacttg aagatgataa aatggacatt actaaagagt tagatgtaca gtttgttttg 2041 ctaattcaaa gtgaagtggc cctaaatgat ttacatttaa ccaagcagaa acttgaggac 2101 aaagtagaaa atttagtaga tcagctaaat aaatcacaag aaagtaatgt aagcatccag 2161 aaggagaatt tagaacttaa ggagcatatt agacaaaatg aggaggagct ttctagaata 2221 aggaatgagt taatgcagtc tctaaatcaa gactctaata gtaattttaa ggatacctta 2281 cttaaagaaa gagaagctga agttagaaac ttaaagcaaa atctttcaga attagaacag 2341 ctcaatgaaa atttaaagaa agttgctttt gatgtcaaaa tggaaaatga aaagttagtt 2401 ttagcatgtg aagatgtgag gcatcagtta gaagaatgtc ttgctggtaa caatcagctt 2461 tctctggaaa aaaacactat tgtggagact ctaaaaatgg aaaaaggaga gatagaggca 2521 gaattgtgtt gggctaaaaa gaggctgttg gaagaagcaa acaagtatga gaaaaccatt 2581 gaagaactgt caaatgcacg taatttgaat acctctgcct tacagctgga acatgagcat 2641 ttaattaaac tcaatcaaaa gaaagacatg gaaatagcag aactcaaaaa gaatattgaa 2701 caaatggata ctgaccataa agaaactaag gacgttttgt catctagttt agaagagcag 2761 aagcagttga cacaacttat aaacaagaaa gaaattttta ttgaaaagct taaagaaaga 2821 agttcaaagc tgcaggagga attggataaa tattctcagg ccttaagaaa aaatgaaatt 2881 ttaagacaga ccatagagga aaaagaccga agtcttggat ccatgaaaga ggaaaataat 2941 catctgcaag aagaattgga acgactcagg gaagagcaga gtcgaaccgc acctgtggct 3001 gaccctaaaa cccttgatag tgttactgaa ctagcatctg aggtatctca actgaacacg 3061 atcaaggaac atcttgaaga ggaaattaaa catcatcaaa agataattga agatcaaaac 3121 cagagtaaga tgcaactact tcagtcttta caagagcaaa agaaggaaat ggatgagttt 3181 agataccagc atgagcaaat gaacgccaca cacacccagc tctttttaga gaaggatgag 3241 gaaattaaga gtttgcaaaa aacaattgaa caaatcaaaa cccagttgca tgaagaaaga 3301 caggacattc aaacagataa ctctgatatt tttcaagaaa caaaagttca gagccttaat 3361 atagaaaatg gaagtgaaaa gcatgattta tctaaagctg aaacggaaag attagtgaaa 3421 ggaataaaag agcgagaact ggagattaaa cttctaaatg aaaagaatat atctttaact 3481 aaacagattg atcagttgtc caaagatgaa gttggtaaac taactcagat tattcagcag 3541 aaagatttgg agatacaagc tcttcatgct agaatttctt caacttccca tactcaagat 3601 gttgtttacc ttcaacagca actgcaggct tatgctatgg aaagagaaaa ggtatttgct 3661 gttttgaatg agaagactag ggaaaatagc catctaaaaa cagaatatca caaaatgatg 3721 gatattgttg ctgccaagga agcagctctt atcaaactgc aagatgaaaa taaaaaattg 3781 tccactagat ttgaaagtag tggccaagat atgtttagag aaactattca gaatttatca 3841 cgtatcattc gagaaaaaga catcgaaata gatgcactaa gtcagaaatg tcagacttta 3901 ttggcagttt tacaaacatc cagcactggt aatgaggctg gaggtgttaa tagtaatcaa 3961 tttgaggagc ttctacagga acgtgacaag ttaaaacagc aagtaaagaa aatggaagag 4021 tggaagcagc aggtgatgac cacagtacaa aatatgcaac acgagtcagc ccagcttcag 4081 gaagagcttc accaacttca agcacaggtt ttggttgaca gtgataataa ttctaaatta 4141 caagtggact atactggcct gatccaaagt tatgagcaga atgaaaccaa actcaaaaat 4201 tttgggcagg aattagcaca agttcagcac agcattgggc agctttgcaa taccaaggat 4261 cttcttttag gaaaacttga tattatttca ccccagctgt cttctgcatc attgcttact 4321 ccccagtctg cagagtgtct tagagcaagt aagtctgaag tattgagtga atcttctgaa 4381 ttgcttcagc aagagttaga agagctaaga aaatcactac aggaaaaaga tgcaacaatt 4441 agaactctcc aggaaaataa ccacagattg tctgattcga ttgctgccac ctcagagcta 4501 gaaagaaaag aacacgaaca aaccgattca gaaatcaagc agctaaagga gaaacaagat 4561 gttttgcaaa agttacttaa ggaaaaagac ctcttaatca aagccaaaag tgatcaacta 4621 ctttcttcca atgaaaattt cactaacaaa gtaaatgaaa acgaactttt gaggcaggca 4681 gtaacaaacc tgaaggagag aatattaatt ctagagatgg acattggcaa actaaaagga 4741 gaaaatgaaa aaatagtgga aacatacagg ggaaaggaaa cagaatatca agcgttacaa 4801 gagactaaca tgaagttttc tatgatgctg cgagaaaaag agtttgagtg ccactcaatg 4861 aaggagaagg ctcttgcttt tgaacagcta ttgaaagaga aagaacaggg caagactgga 4921 gagttaaatc agcttttaaa tgcagttaaa tcaatgcagg agaagacagt tgtgtttcaa 4981 caggagagag accaagtcat gttggccctg aaacaaaaac aaatggaaaa tactgcccta 5041 cagaatgagg ttcaacgttt acgtgacaaa gaatttcgtt caaaccaaga gctagagaga 5101 ttgcgtaatc atcttttaga atcagaagat tcttataccc gtgaagcttt ggctgcagaa 5161 gatagagagg ctaaactaag aaagaaagtc acagtattgg aggaaaagct agtttcatcc 5221 tctaatgcaa tggaaaatgc aagccatcaa gccagtgtgc aggtagagtc attgcaagaa 5281 cagttgaatg tagtttccaa gcaaagggat gaaactgcgc tgcgcttctg tctctcagga 5341 accgtaaagc agtatcgtct gtcactggcc aacctgcaga tggtactaga gcatttccaa 5401 caagaggaaa aagctatgta ttctgctgaa ctcgaaaagc aaaaacagct tatagctgaa 5461 tggaagaaaa acgcagaaaa tctggaagga aaagtgatat cattacagga atgtttggat 5521 gaagcaaatg ctgcattgga ttcagcatca agacttacag aacagttaga tgtaaaagaa 5581 gaacaaattg aagaacttaa aagacaaaat gagctccgac aagaaatgct ggatgatgta 5641 caaaagaaat tgatgagctt agcaaacagc tcagaaggaa aagtagacaa agtcctaatg 5701 agaaacctct tcattggtca tttccacaca ccgaaaaatc agcgtcatga agtgttacgg 5761 ttaatgggga gcatcctggg cgtcagaagg gaggagatgg agcagttgtt tcatgacgac 5821 cagggcggtg ttaccaggtg gatgactggg tggcttggag gaggatcaaa aagtgttccc 5881 aacacacctt tgagaccaaa tcagcaatct gtggttaata gttctttttc agaacttttt 5941 gttaaatttc tagaaacaga atctcatcca tccattccac caccaaagct ttctgttcat 6001 gatatgaaac ctctggattc accaggaaga agaaaaagag atacaaatgc accagaaagt 6061 tttaaagata cagcagaatc caggtctggt agaagaacag atgtaaatcc gtttttggct 6121 cctcgctcgg cagctgtacc tcttattaac ccagctggac ttggacctgg tgggcccggg 6181 catcttcttc tgaaacccat ctcagatgtt ttgcccacat ttacaccttt gccagcgtta 6241 cctgacaaca gtgctggggt tgtgctgaaa gaccttttaa agcaatagat gattctcaag 6301 ccagagacaa tctagcactt taaagaaacc atgaacacta tatgtatgta ctttatcaca 6361 aagtggcctt tggggagaaa gtcatgtatt tggttcggcg gccgcgaatt c // LOCUS AF007393 1479 bp mRNA PRI 30-JAN-1998 DEFINITION Homo sapiens P52rIPK mRNA, complete cds. ACCESSION AF007393 NID g2822276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1479) AUTHORS Gale,M. Jr., Blakely,C.M., Hopkins,D.A., Melville,M.W., Wambach,M., Romano,P.R. and Katze,M.G. TITLE Regulation of interferon-induced protein kinase PKR: modulation of P58IPK inhibitory function by a novel protein, P52rIPK JOURNAL Mol. Cell. Biol. 18 (2), 859-871 (1998) MEDLINE 98107671 REFERENCE 2 (bases 1 to 1479) AUTHORS Gale,M.J. Jr. and Katze,M.G. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) Microbiology, University of Washington, 1705 NE Pacific St., Box 357242, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..1479 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13.5" /cell_line="HeLa" CDS 1..1479 /function="repressor of p58IPK protein kinase inhibitor" /function="upstream regulator of interferon induced protein kinase PKR" /note="52 kDa" /codon_start=1 /product="P52rIPK" /db_xref="PID:g2822277" /translation="MPNFCAAPNCTRKSTQSDLAFFRFPRDPARCQKWVENCRRADLE DKTPDQLNKHYRLCAKHFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKR IKELSEDEIRTLKQKKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEK ENKEYLKSLFEILILMGKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRK RFETTAVNTLFCSKTQQRQMLEICESCIREETLREVRDSHFFSIITDDVVDIAGEEHL PVLVRFVDESHNLREEFIGFLPYEADAEILAVKFHTMITEKWGLNMEYCRGQAYIVSS GFSSKMKVVASRLLEKYPQAIYTLCSSCALNMWLAKSVPVMGVSVALGTIEEVCSFFH RSPQLLLELDNVISVLFQNSKERGKELKEICHSQWTGRHDAFEILVELLQALVLCLDG INSDTNIRWNNYIAGRAFVLCSAVSDFDFIVTIVVLKNEIKI" BASE COUNT 477 a 272 c 328 g 402 t ORIGIN 1 atgccgaact tctgcgctgc ccccaactgc acgcggaaga gcacgcagtc cgacttggcc 61 ttcttcaggt tcccgcggga ccctgccaga tgccagaagt gggtggagaa ctgtaggaga 121 gcagacttag aagataaaac acctgatcag ctaaataaac attatcgatt atgtgccaaa 181 cattttgaga cctctatgat ctgtagaact agtccttata ggacagttct tcgagataat 241 gcaataccaa caatatttga tcttaccagt catttgaaca acccacatag tagacacaga 301 aaacgaataa aagaactgag tgaagatgaa atcaggacac tgaaacagaa aaaaattgat 361 gaaacttctg agcaggaaca aaaacataaa gaaaccaaca atagcaatgc tcagaacccc 421 agcgaagaag agggtgaagg gcaagatgag gacattttac ctctaaccct tgaagagaag 481 gaaaacaaag aatacctaaa atctctattt gaaatcttga ttctgatggg aaagcaaaac 541 atacctctgg atggacatga ggctgatgaa atcccagaag gtctctttac tccagataac 601 tttcaggcac tgctggagtg tcggataaat tctggtgaag aggttctgag aaagcggttt 661 gagacaacag cagttaacac gttgttttgt tcaaaaacac agcagaggca gatgctagag 721 atctgtgaga gctgtattcg agaagaaact ctcagggaag tgagagactc acacttcttt 781 tccattatca ctgacgatgt agtggacata gcaggggaag agcacctacc tgtgttggtg 841 aggtttgttg atgaatctca taacctaaga gaggaattta taggcttcct gccttatgaa 901 gccgatgcag aaattttggc tgtgaaattt cacactatga taactgagaa gtggggatta 961 aatatggagt attgtcgtgg ccaggcttac attgtctcta gtggattttc ttccaaaatg 1021 aaagttgttg cttctagact tttagagaaa tatccccaag ctatctacac actctgctct 1081 tcctgtgcct taaatatgtg gttggcaaaa tcagtacctg ttatgggagt atctgttgca 1141 ttaggaacaa ttgaggaagt ttgttctttt ttccatcgat caccacaact gcttttagaa 1201 cttgacaacg taatttctgt tctttttcag aacagtaaag aaaggggtaa agaactgaag 1261 gaaatctgcc attctcagtg gacaggcagg catgatgctt ttgaaatttt agtggaactc 1321 ctgcaagcac ttgttttatg tttagatggt ataaatagtg acacaaatat tagatggaat 1381 aactatatag ctggccgagc atttgtactc tgcagtgcag tgtcagattt tgatttcatt 1441 gttactattg ttgttcttaa aaatgaaatc aaaatctga // LOCUS AF007545 1029 bp mRNA PRI 30-JUL-1997 DEFINITION Homo sapiens SIV/HIV receptor Bonzo (Bonzo) mRNA, complete cds. ACCESSION AF007545 NID g2253421 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1029) AUTHORS Deng,H.K., Unutmaz,D., KewalRamani,V.N. and Littman,D.R. TITLE Expression cloning of new receptors used by simian and human immunodeficiency viruses JOURNAL Nature 388 (6639), 296-300 (1997) MEDLINE 97373958 REFERENCE 2 (bases 1 to 1029) AUTHORS Deng,H., Unutmaz,D., KewalRamani,V.N. and Littman,D.R. TITLE Direct Submission JOURNAL Submitted (08-JUN-1997) Molecular Pathogenesis, Skirball Institute for Biomolecular Medicine, 540 First Avenue, New York, NY 10016, USA FEATURES Location/Qualifiers source 1..1029 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell" gene 1..1029 /gene="Bonzo" CDS 1..1029 /gene="Bonzo" /note="seven-transmembrane G protein-coupled receptor; potential chemokine receptor" /codon_start=1 /product="SIV/HIV receptor Bonzo" /db_xref="PID:g2253422" /translation="MAEHDYHEDYGFSSFNDSSQEEHQDFLQFSKVFLPCMYLVVFVC GLVGNSLVLVISIFYHKLQSLTDVFLVNLPLADLVFVCTLPFWAYAGIHEWVFGQVMC KSLLGIYTINFYTSMLILTCITVDRFIVVVKATKAYNQQAKRMTWGKVTSLLIWVISL LVSLPQIIYGNVFNLDKLICGYHDEAISTVVLATQMTLGFFLPLLTMIVCYSVIIKTL LHAGGFQKHRSLKIIFLVMAVFLLTQMPFNLMKFIRSTHWEYYAMTSFHYTIMVTEAI AYLRACLNPVLYAFVSLKFRKNFWKLVKDIGCLPYLGVSHQWKSSEDNSKTFSASHNV EATSMFQL" BASE COUNT 229 a 275 c 238 g 287 t ORIGIN 1 atggcagagc atgattacca tgaagactat gggttcagca gtttcaatga cagcagccag 61 gaggagcatc aagacttcct gcagttcagc aaggtctttc tgccctgcat gtacctggtg 121 gtgtttgtct gtggtctggt ggggaactct ctggtgctgg tcatatccat cttctaccat 181 aagttgcaga gcctgacgga tgtgttcctg gtgaacctac ccctggctga cctggtgttt 241 gtctgcactc tgcccttctg ggcctatgca ggcatccatg aatgggtgtt tggccaggtc 301 atgtgcaaga gcctactggg catctacact attaacttct acacgtccat gctcatcctc 361 acctgcatca ctgtggatcg tttcattgta gtggttaagg ccaccaaggc ctacaaccag 421 caagccaaga ggatgacctg gggcaaggtc accagcttgc tcatctgggt gatatccctg 481 ctggtttcct tgccccaaat tatctatggc aatgtcttta atctcgacaa gctcatatgt 541 ggttaccatg acgaggcaat ttccactgtg gttcttgcca cccagatgac actggggttc 601 ttcttgccac tgctcaccat gattgtctgc tattcagtca taatcaaaac actgcttcat 661 gctggaggct tccagaagca cagatctcta aagatcatct tcctggtgat ggctgtgttc 721 ctgctgaccc agatgccctt caacctcatg aagttcatcc gcagcacaca ctgggaatac 781 tatgccatga ccagctttca ctacaccatc atggtgacag aggccatcgc atacctgagg 841 gcctgcctta accctgtgct ctatgccttt gtcagcctga agtttcgaaa gaacttctgg 901 aaacttgtga aggacattgg ttgcctccct taccttgggg tctcacatca atggaaatct 961 tctgaggaca attccaagac tttttctgcc tcccacaatg tggaggccac cagcatgttc 1021 cagttatag // LOCUS AF007548 639 bp mRNA PRI 04-NOV-1997 DEFINITION Homo sapiens golgi SNARE (GS27) mRNA, complete cds. ACCESSION AF007548 NID g2316087 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 639) AUTHORS Lowe,S.L., Peter,F., Subramaniam,V.N., Wong,S.H. and Hong,W. TITLE A SNARE involved in protein transport through the Golgi apparatus JOURNAL Nature 389 (6653), 881-884 (1997) MEDLINE 98007979 REFERENCE 2 (bases 1 to 639) AUTHORS Lowe,S.L., Peter,F., Subramaniam,V.N., Wong,S.H. and Hong,W. TITLE Direct Submission JOURNAL Submitted (09-JUN-1997) Membrane Biology Laboratory, Institute of Molecular and Cell Biology, 15 Lower Kent Ridge Road, Singapore 119076, Singapore FEATURES Location/Qualifiers source 1..639 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..639 /gene="GS27" CDS 1..639 /gene="GS27" /codon_start=1 /product="golgi SNARE" /db_xref="PID:g2316088" /translation="MDPLFQQTHKQVHEIQSCMGRLETADKQSVHIVENEIQASIDQI FSRLERLEILSSKEPPNKRQNARLRVDQLKYDVQHLQTALRNFQHRRHAREQQERQRE ELLCRTFTTNGSDTTIPMDESLQFNSSLQKVHNGMDDLILDGHNILDGLRTQRLTLKG TQKKIPDIANMLGLSNTVMRLIEKRAFQDKYFMIGGMLLTCVVMFLVVQYLT" BASE COUNT 182 a 163 c 164 g 130 t ORIGIN 1 atggatcccc tgttccagca aacgcacaag caggtccacg agatccagtc ttgcatggga 61 cgcctggaga cggcagacaa gcagtctgtg cacatagtag aaaacgaaat ccaagcaagc 121 atagaccaga tattcagccg tctagaacgt ctggagattt tgtccagcaa ggagccccct 181 aacaaaaggc aaaatgccag acttcgggtt gaccagttaa agtatgatgt ccagcacctg 241 cagactgcgc tcagaaactt ccagcatcgg cgccatgcaa gggagcagca ggagagacag 301 cgagaagagc ttctgtgtcg aacgttcacc actaacggct ctgacaccac cataccaatg 361 gacgaatcac tgcagtttaa ctcctccctc cagaaagttc acaacggcat ggatgacctc 421 attttagatg ggcacaatat tttagatgga ctgaggaccc agagactgac cttgaagggg 481 actcagaaga agatccctga cattgccaac atgctgggct tgtccaacac agtgatgcgg 541 ctcatcgaga agcgggcttt ccaggacaag tactttatga taggtgggat gctgctgacc 601 tgtgtggtca tgttcctcgt ggtgcagtac ctgacatga // LOCUS AF007551 617 bp mRNA PRI 14-JUL-1997 DEFINITION Homo sapiens Bet1p homolog (hbet1) mRNA, complete cds. ACCESSION AF007551 NID g2253425 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 617) AUTHORS Zhang,T., Wong,S.H., Xu,Y. and Hong,W. TITLE Human homolog of yeast Bet1p (hbet1) JOURNAL Unpublished REFERENCE 2 (bases 1 to 617) AUTHORS Zhang,T., Wong,S.H., Xu,Y. and Hong,W. TITLE Direct Submission JOURNAL Submitted (09-JUN-1997) Membrane Biology Laboratory, Institute of Molecular and Cell Biology, 15 Lower Kent Ridge Road, Singapore 119076, Singapore FEATURES Location/Qualifiers source 1..617 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..617 /gene="hbet1" CDS 120..476 /gene="hbet1" /note="similar to yeast Bet1p; integral membrane protein" /codon_start=1 /product="Bet1p homolog" /db_xref="PID:g2253426" /translation="MRRAGLGEGVPPGNYGNYGYANSGYSACEEENERLTESLRSKVT AIKSLSIEIGHEVKTQNKLLAEMDSQFDSTTGFLGKTMGKLKILSRGSQTKLLCYMML FSLFVFFIIYWIIKLR" BASE COUNT 174 a 120 c 139 g 184 t ORIGIN 1 ggggaagaag ttggtgtttc gctgggccct ggtactgaag acgcggtccg ggtcgcccct 61 agctgtttcc tactcaccca aagccccgca cccgcctttt ctctctctcc tctggcagga 121 tgaggcgtgc aggcctgggt gaaggagtac ctcctggcaa ctatgggaac tatggctatg 181 ctaatagtgg gtatagtgcc tgtgaagaag aaaatgagag gctcactgaa agtctgagaa 241 gcaaagtaac tgctataaaa tctctttcca ttgaaatagg ccatgaagtt aaaacccaga 301 ataaattatt agctgaaatg gattcacaat ttgattccac aactggattt ctaggtaaaa 361 ctatgggcaa actgaagatt ttatccagag ggagccaaac aaagctgctg tgctatatga 421 tgctgttttc tttatttgtc ttttttatca tttattggat tattaaactg aggtgatgca 481 tgtaattgtg aatttggaat ttgttccaac ttaatggctt gcagtgcagt accactttga 541 taaaaatcag catcaaaaca ttcccagtgt tcaaatacgt ggcattttcc attgaaaatt 601 gctgaatttt agactta // LOCUS AF007748 3027 bp mRNA PRI 27-SEP-1997 DEFINITION Homo sapiens karyopherin beta2b homolog mRNA, complete cds. ACCESSION AF007748 NID g2440003 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3027) AUTHORS Bonifaci,N., Radu,A. and Blobel,G. TITLE Human Karyopherin beta2b: an homolog of human Karyopherin beta2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3027) AUTHORS Bonifaci,N., Radu,A. and Blobel,G. TITLE Direct Submission JOURNAL Submitted (09-JUN-1997) Laboratory of Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..3027 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 292..2955 /note="similar to the transport factor human karyopherin beta2" /codon_start=1 /product="karyopherin beta2b homolog" /db_xref="PID:g2440004" /translation="MDWQPDEQGLQQVLQLLKDSQSPNTATHRIVRDKLKQLNQFPDF NNYLIFVLTRLKSEDEPTRSLSGLILKNNVKAHYQSFPPPVADFIKQECLNNIGDASS LIRATIGILITTIASKGELQMWPELLPQLCNLLNSEDYNTCEGAFGALQKICEDSSEL LDSDALNRPLNIMIPKFLQFFKHCSPKIRSHAIGCVNQFIMDRAQALMDNIDTFIEHL FALAVDDDPEVRKNVCRALVMLLEVRIDRLIPHMHSIIQYMLQRTQDHDENVALEACE FWLTLAEQPICKEVLASHLVQLIPILVNGMKYSEIDIILLKGDVEEDEAVPDSEQDIK PRFHKSRTVTLPHEAERPDGSEDAEDDDDDDALSDWNLRKCSAAALDVLANVFREELL PHLLPLLKGLLFHPEWVVKESGILVLGAIAEGCMQGMVPYLPELIPHLIQCLSDKKAL VRSIACWTLSRYAHWVVSQPPDMHLKPLMTELLKRILDGNKKVQEAACIAFATLEEKA CTELVPYLSYILDTLVFAFGKYQHKNLLILYDAIGTLADSVGHHLNQPEYIQKLMPPL IQKWNELKDEDKDLFPLLECLSSVATALQSGFLPYCEPVYQCCVTLVQKTLAQAMMYT QHPEQYEAPDKDFMIVALDLFSGLAEGLGGHVEQLVARSNIMTLLFQCMQDSMPEVRQ SSFAFLGDFTKACSSHVKPCIAEFMPILGTNLNPEFISVCNNATWAIGEICMQMGAEM QPYVQMVLNNLVEIINRPNTPKTLLENTAITIGRLGYVCPQEVAPMLQQFIRPWCTSL RNIRDNEEKDSAFRGICMMIGVNPGGVVQDFILFCDAVASWVSPKDDLRDMFYKILHG FKDQVGEDNWQQFSEQFPPLLKERLAAFYGV" BASE COUNT 676 a 914 c 800 g 636 t 1 others ORIGIN 1 cgcggggaac cggaaaccta ggaatcccaa ccgctccttc agggtgtttt ccgcgattct 61 ccgagtggag ttaggggcgt gcagacactt ccctaacctt ggaggcccgg gagtggattt 121 tcatgaatga acggagccgg gactgacttt gcctacggca aagaaggaaa aacggtgccc 181 tactgtcttc agacgtcgca aggatcaaga aaatttcttt cttcatgacc ctgcccttgc 241 cagaatttcc tgcgctgttc cacctcattc aacttcagct tgccttgcgc catggactgg 301 cagccagacg agcagggcct gcagcaggtc ctgcagctgc tcaaagactc acagtcgccc 361 aacacagcca ctcaccgcat cgtgcgggat aaactcaaac aactcaatca gtttcctgac 421 ttcaacaact acctgatttt cgtcctgacc agactcaagt cagaagatga gccaacgcgc 481 tctctcagtg gcctcatcct caagaacaac gtgaaggcac actatcagag cttcccaccc 541 cctgtggcag acttcatcaa acaggagtgt ctcaacaaca ttggcgatgc ctcctcgctc 601 atccgagcca ccattggcat tctcatcacc accatcgctt ccaagggtga gctgcagatg 661 tggcccgagc tgctgcccca gctctgcaac ctgcttaact cggaggatta caacacttgt 721 gagggagcct ttggagccct gcagaagatc tgtgaagact catcagagct tctggacagt 781 gacgccctca acaggcccct caacatcatg atccccaagt tcctgcagtt cttcaagcac 841 tgcagtccca agatccggtc ccacgccatc ggctgcgtga accagttcat catggaccgg 901 gcccaggcgc tgatggacaa tattgacacc ttcatcgagc acctatttgc cctggctgtg 961 gatgatgacc ccgaggtgcg gaagaatgtg tgccgtgccc tggtgatgct tctggaagtg 1021 cggattgaca ggctcatccc ccacatgcac agcatcatcc agtacatgct gcagaggacc 1081 caggaccatg atgagaacgt tgcccttgag gcctgtgagt tctggctgac gctggccgag 1141 cagcccatct gcaaggaagt cctggcctcc catctggtcc agttgatccc catcttggtg 1201 aatgggatga agtactcgga aattgacatc atcctgctca agggggatgt ggaggaggat 1261 gaggctgtcc ccgacagtga gcaggacatc aagccacgct tccacaagtc acgcacggtc 1321 acactgcccc acgaggctga gcggcctgat ggctccgagg acgcggagga tgacgatgat 1381 gatgatgctc tgtccgactg gaatttgagg aagtgctcag cggctgcact ggacgtcctc 1441 gccaatgtct tccgggagga actgctgccc cacctactcc cactactcaa aggcctcctc 1501 ttccaccccg agtgggtggt caaggagtcg ggcatcctgg tgctgggcgc cattgctgag 1561 ggctgcatgc agggcatggt gccctacctg cctgagctga tcccgcacct gatccagtgc 1621 ctgtcggata agaaagcctt ggtccgctcc atcgcctgct ggacgctgag ccgctatgcc 1681 cactgggtgg tcagccagcc acccgacatg cacctcaagc ccctgatgac agagctgctc 1741 aaacgcatcc tggatggcaa caagaaggta caggaggcgg cctgcattgc ttttgccacc 1801 ctggaagaaa aggcctgtac ggagctggtg ccctacctca gctacatcct ggacaccctt 1861 gtctttgcct ttgggaaata ccagcacaag aacctgctca tcctctatga cgccattggc 1921 accctggccg actctgttgg ccaccacctc aaccagccgg aatacatcca gaagctgatg 1981 cccccactga tccagaagtg gaatgagctc aaggacgaag acaaggacct cttccccctg 2041 ctggagtgtc tgtcatcggt ggccaccgcc ctgcagagtg gcttcctgcc ttactgtgag 2101 cccgtctacc agtgctgtgt caccctggtg cagaagacac tggctcaggc catgatgtac 2161 acccagcacc ctgagcagta tgaggctccc gacaaggact ttatgatagt agcactggat 2221 ctgttcagcg gcctggccga gggcttggga ggtcacgtgg agcagctggt ggcccgcagc 2281 aacatcatga cattgttgtt ccagtgcatg caggactcga tgcctgaggt ccggcagagt 2341 tcttttgcct tcttgggaga cttcaccaaa gcctgttcat cccatgtcaa gccctgtatc 2401 gccgagttca tgcccattct gggcaccaac ctgaacccag agttcatctc cgtctgcaac 2461 aacgccacct gggccattgg tgaaatttgc atgcagatgg gggcagagat gcagccttat 2521 gtgcagatgg tcctcaacaa cctggtggaa atcattaacc gacccaacac acccaagaca 2581 ctgctggaaa acacagccat caccatcggc cgcttgggct acgtgtgccc ccaggaggtg 2641 gcacccatgc tgcagcagtt catccggcct tggtgcacgt ccctcaggaa catcagggac 2701 aacgaggaga aggactcagc cttccgcggc atctgcatga tgatcggtgt caacccgggg 2761 ggcgttgtgc aggactttat tttattctgc gatgctgtag cctcctgggt gagcccgaag 2821 gatgaccttc gggacatgtt ttataagatt ctccacggct tcaaagacca agttggggaa 2881 gataactggc agcagttctc tgagcaattc ccgccgctgc tcaaggagag gctggcggct 2941 ttctatgggg tctaggtgat cctcgagcac caccaccacc accactgaga tccggcttgc 3001 tacaaaaccc gaaaagcgct gagttng // LOCUS AF007833 2128 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens kruppel-related zinc finger protein hcKrox mRNA, complete cds. ACCESSION AF007833 NID g2257985 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2128) AUTHORS Widom,R.L., Culic,I., Lee,J.Y. and Korn,J.H. TITLE Cloning and characterization of hcKrox, a transcriptional regulator of extracellular matrix gene expression JOURNAL Gene 198 (1-2), 407-420 (1997) MEDLINE 98036076 REFERENCE 2 (bases 1 to 2128) AUTHORS Widom,R.L., Culic,I., Lee,J. and Korn,J.H. TITLE Direct Submission JOURNAL Submitted (11-JUN-1997) Arthritis Center, Boston Univ. School of Medicine, 80 E. Concord Street, Boston, MA 02118, USA FEATURES Location/Qualifiers source 1..2128 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin fibroblast" CDS 213..1832 /note="zinc-finger transcription factor" /codon_start=1 /product="kruppel-related zinc finger protein hcKrox" /db_xref="PID:g2257986" /translation="MGSPEDDLIGIPFPDHSSELLSCLNEQRQLGHLCDLTIRTQGLE YRTHRAVLAACSHYFKKLFTEGGGGAVMGAGGSGTATGGAGAGVCELDFVGPEALGAL LEFAYTATLTTSSANMPAVLQAARLLEIPCVIAACMEILQGSGLEAPSPDEDDCERAR QYLEAFATATASGVPNGEDSPPQVPLPPPPPPPPRPVARRSRKPRKAFLQTKGARANH LVPEVPTVPAHPLTYEEEEVAGRVGSSGGSGPGDSYSPPTGTASPPEGPQSYEPYEGE EEEEELVYPPAYGLAQGGGPPLSPEELGSDEDAIDPDLMAYLSSLHQDNLAPGLDSQD KLVRKRRSQMPQECPVCHKIIHGAGKLPRHMRTHTGEKPFACEVCGVRFTRNDKLKIH MRKHTGERPYSCPHCPARFLHSYDLKNHMHLHTGDRPYECHLCHRAFAKEDHLQRHLK GQNCLEVRTRRRRKDDAPPHYPPPSTAAAFPAGLDLSNGHLDTFRLSLARFWEQSAPT WAPVSTPGPPDDDEEEGAPTTPQAEGAMESS" BASE COUNT 462 a 683 c 616 g 367 t ORIGIN 1 tcacctgtct gaacctcagt ttccttgtct gtgaaacaga tgatgatcat acttcagagg 61 gttctggaag atgaaatgag tgtaaagtac tcagaagggt gactggcatg cagtcatccc 121 tcaataagta ttggctaaag ttggtcctct ttccccagac cctgtgggct gagcgctctt 181 aatctcccct ctacttgact ctgcaggaga agatggggag ccccgaggat gacctgattg 241 ggattccatt cccggaccac agcagtgagc tcctgagctg cctcaatgag cagcgccagc 301 tgggccacct atgtgacctc accatccgga cgcagggcct tgaataccgc acccacaggg 361 ctgtgctagc tgcctgtagc cactacttca agaagctttt cactgagggc ggtggcggag 421 ctgtcatggg ggccgggggt agcgggacgg ccactggggg agcaggggcc ggtgtgtgtg 481 agctggactt tgtagggcca gaggcactag gcgccctcct tgaatttgcc tatacagcca 541 cactgaccac cagcagcgcc aacatgccag ctgtgctcca ggctgcccgc ctgctggaga 601 tcccgtgtgt catcgctgct tgcatggaga ttctgcaggg cagtgggcta gaagctccca 661 gcccggacga ggatgactgt gagcgagccc gccagtatct ggaggccttt gccacagcca 721 cggcctctgg agttcccaat ggtgaagaca gtcctccaca ggtgcccctc ccaccacctc 781 cgccaccgcc acctcggcct gttgcccgcc gcagccgcaa gccccggaaa gctttcctgc 841 aaaccaaggg ggccagagca aaccacctag tccctgaggt gcccacagtg cccgcccatc 901 ccttgaccta tgaggaggag gaggtggcgg gcagagtggg cagcagtggg ggcagtgggc 961 cgggggacag ctacagccct cccacaggaa ctgcctcccc tcctgagggt ccccagagct 1021 acgaacccta tgagggtgag gaagaagaag aggagctggt atatccccca gcctatgggc 1081 tggcgcaggg tggcgggccc ccgctgtccc cagaggagct gggctcagat gaggatgcca 1141 tcgatcctga cctgatggcc tacctaagct ccctgcacca ggacaacctg gcaccaggcc 1201 tggacagcca agacaagctg gtgcgcaaac gccgctccca gatgcctcag gagtgccctg 1261 tctgccacaa gatcatccat ggggcaggca aactgcctcg ccacatgagg acccacacag 1321 gcgagaagcc ctttgcctgc gaggtctgcg gtgttcgatt caccaggaac gacaagctga 1381 agatccacat gcggaagcac acgggagagc gcccctactc atgcccgcac tgcccagccc 1441 gcttcctgca cagctacgac ctcaagaacc acatgcacct gcacacaggg gaccggccct 1501 atgagtgcca cctgtgccac agggctttcg ccaaggagga ccacctgcag cgccacctca 1561 aaggccagaa ctgcctggag gtgcgcaccc gacggcgccg caaggacgat gcaccacccc 1621 actacccacc accctctacc gctgctgcat tccccgctgg cctcgacctc tccaatggcc 1681 acctggacac cttccgcctc tctctagctc gattctggga gcagtcagcc cccacctggg 1741 ccccggtctc taccccgggg ccccctgatg acgatgagga ggaaggggca cccaccacac 1801 cccaggctga aggtgccatg gagtcctctt aaagagggac gagggccaga ctgaagcagc 1861 acaaggccgg ggacacccat gccaagcagt gggagcacgc aggacagaca cagcaggggt 1921 ctggggcacg gagccttgcc ggcatcagca tcagcccttc ctcccagagc cctcattcca 1981 attccaagct aagaaggtat tggggcagag gctccccaaa ttggggtgat cccccaagga 2041 gtgatacata tattgtgtat atatttacag ctgtattgta aaagtggggt ccctgtccca 2101 gctgctcctg gggagtagaa gcaaaaaa // LOCUS AF007871 2072 bp mRNA PRI 06-SEP-1997 DEFINITION Homo sapiens torsinA (DYT1) mRNA, complete cds. ACCESSION AF007871 NID g2358278 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2072) AUTHORS Ozelius,L.J., Hewett,J.W., Page,C.E., Bressman,S.B., Kramer,P.L., Shalish,C., deLeon,D., Brin,M.F., Raymond,D., Corey,D.P., Fahn,S., Risch,N.J., Buckler,A.J., Gusella,J.F. and Breakefield,X.O. TITLE The early-onset torsion dystonia gene (DYT1) encodes an ATP-binding protein JOURNAL Nature Genet. 17 (1), 40-48 (1997) MEDLINE 97434210 REFERENCE 2 (bases 1 to 2072) AUTHORS Ozelius,L.J., Hewett,J.W., Page,C.E., Bressman,S.B., Kramer,P.L., Shalish,C., deLeon,D., Brin,M.F., Raymond,D., Corey,D.P., Fahn,S., Risch,N.J., Buckler,A.J., Gusella,J.F. and Breakefield,X.O. TITLE Direct Submission JOURNAL Submitted (11-JUN-1997) Molecular Neurogenetics, Massachusetts General Hosp., 13th St. Bldg. 149, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..2072 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34" gene 1..2072 /note="early onset torsion dystonia gene; DQ2" /gene="DYT1" CDS 43..1041 /gene="DYT1" /note="has ATP-binding site similar to heat shock proteins/Clp proteases" /codon_start=1 /product="torsinA" /db_xref="PID:g2358279" /translation="MKLGRAVLGLLLLAPSVVQAVEPISLGLALAGVLTGYIYPRLYC LFAECCGQKRSLSREALQKDLDDNLFGQHLAKKIILNAVFGFINNPKPKKPLTLSLHG WTGTGKNFVSKIIAENIYEGGLNSDYVHLFVATLHFPHASNITLYKDQLQLWIRGNVS ACARSIFIFDEMDKMHAGLIDAIKPFLDYYDLVDGVSYQKAMFIFLSNAGAERITDVA LDFWRSGKQREDIKLKDIEHALSVSVFNNKNSGFWHSSLIDRNLIDYFVPFLPLEYKH LKMCIRVEMQSRGYEIDEDIVSRVAEEMTFFPKEERVFSDKGCKTVFTKLDYYYDD" polyA_site 1390 /gene="DYT1" polyA_site 2054 /gene="DYT1" BASE COUNT 530 a 489 c 510 g 543 t ORIGIN 1 cgcggtcggc gcgagaacaa gcagggtggc gcgggtccgg gcatgaagct gggccgggcc 61 gtgctgggcc tgctgctgct ggcgccgtcc gtggtgcagg cggtggagcc catcagcctg 121 ggactggccc tggccggcgt cctcaccggc tacatctacc cgcgtctcta ctgcctcttc 181 gccgagtgct gcgggcagaa gcggagcctt agccgggagg cactgcagaa ggatctggac 241 gacaacctct ttggacagca tcttgcaaag aaaatcatct taaatgccgt gtttggtttc 301 ataaacaacc caaagcccaa gaaacctctc acgctctccc tgcacgggtg gacaggcacc 361 ggcaaaaatt tcgtcagcaa gatcatcgca gagaatattt acgagggtgg tctgaacagt 421 gactatgtcc acctgtttgt ggccacattg cactttccac atgcttcaaa catcaccttg 481 tacaaggatc agttacagtt gtggattcga ggcaacgtga gtgcctgtgc gaggtccatc 541 ttcatatttg atgaaatgga taagatgcat gcaggcctca tagatgccat caagcctttc 601 ctcgactatt atgacctggt ggatggggtc tcctaccaga aagccatgtt catatttctc 661 agcaatgctg gagcagaaag gatcacagat gtggctttgg atttctggag gagtggaaag 721 cagagggaag acatcaagct caaagacatt gaacacgcgt tgtctgtgtc ggttttcaat 781 aacaagaaca gtggcttctg gcacagcagc ttaattgacc ggaacctcat tgattatttt 841 gttcccttcc tccccctgga atacaaacac ctaaaaatgt gtatccgagt ggaaatgcag 901 tcccgaggct atgaaattga tgaagacatt gtaagcagag tggctgagga gatgacattt 961 ttccccaaag aggagagagt tttctcagat aaaggctgca aaacggtgtt caccaagtta 1021 gattattact acgatgattg acagtcatga ttggcagccg gagtcactgc ctggagttgg 1081 aaaagaaaca acactcagtc cttccacact tccaccccca gctcctttcc ctggaagagg 1141 aatccagtga atgttcctgt ttgatgtgac aggaattctc cctggcattg tttccacccc 1201 ctggtgcctg caggccaccc agggaccacg ggcgaggacg tgaagcctcc cgaacacgca 1261 cagaaggaag gagccagctc ccagcccact catcgcaggg ctcatgattt tttacaaatt 1321 atgttttaat tccaagtgtt tctgtttcaa ggaaggatga ataagtttta ttgaaaatgt 1381 ggtaacttta tttaaaatga tttttaacat tatgagagac tgctcagatt ctaagttgtt 1441 ggccttgtgt gtgtgttttt ttttaagttc tcatcattat tacatagact gtgaagtatc 1501 tttactggaa atgagcccaa gcacacatgc atggcatttg ttcctgaaca ggagggcatc 1561 cctggggatg tggctggagc atgagccagc tctgtcccag gatggtccca gcggatgctg 1621 ccaggggcag tgaagtgttt aggtgaagga caagtaggta agaggacgcc ttcaggcacc 1681 acagataagc ctgaaacagc ctctccaagg gttttcacct tagcaacaat gggagctgtg 1741 ggagtgattt tggccacact gtcaacattt gttagaacca gtcttttgaa agaaaagtat 1801 ttccaacttg tcacttgcca gtcactccgt tttgcaaaag gtggcccttc actgtccatt 1861 ccaaatagcc cacacgtgct ctctgctgga ttctaaatta tgtgaatttt gccatattaa 1921 atcttcctca tttatactat tatttgttac gttcaatcag aatccccgaa acctcctata 1981 aagcttagct gccccttctg aggatgctga gaacggtgtc tttctttata aatgcaaatg 2041 gctaccgttt tacaataaaa ttttgcatgt gc // LOCUS AF008192 2700 bp mRNA PRI 02-AUG-1997 DEFINITION Homo sapiens putative GR6 protein (GR6) mRNA, complete cds. ACCESSION AF008192 NID g2291083 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2700) AUTHORS Pekarsky,Y., Rynditch,A., Wieser,R., Fonatsch,C. and Gardiner,K. TITLE Activation of a novel gene in 3q21 and identification of intergenic fusion transcripts with EVI1 in leukemia JOURNAL Unpublished REFERENCE 2 (bases 1 to 2700) AUTHORS Pekarsky,Y., Rynditch,A., Wieser,R., Fonatsch,C. and Gardiner,K. TITLE Direct Submission JOURNAL Submitted (12-JUN-1997) Eleanor Roosevelt Institute, 1899 Gaylord St., Denver, CO 80206, USA FEATURES Location/Qualifiers source 1..2700 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q21" /note="isolated from the 3q21 leukemia cluster region" gene 1..2700 /gene="GR6" CDS 968..1417 /gene="GR6" /codon_start=1 /product="putative GR6 protein" /db_xref="PID:g2291084" /translation="MKEALHQIVVRCSELVSSTSLPRLSVSRLQGPPDSQPLGTLGQG GWKLLGIVGSLAPETLGGLGTEFGPCTHPLPFDMVRERERDDELRQGWLLQCPQCART LLCHCGPFLTPPSQTSSSGFQLCSLKPSGSLVTATEPLSNFAFSYFP" BASE COUNT 562 a 755 c 734 g 649 t ORIGIN 1 agcaattggg aacagaaagt gcttcctgtc ccggcctgag acaatggaca ataaacccag 61 cctggtgggc cccggcccct gaccccaagg cttcaggaag aggctcccac acgcacacac 121 ctgcggtgtg cgaagctcgc attgttctgg gctcatggcc gagtagggcc ccttgctctc 181 aggagcctcc agctctttct ggacctgagc cctgggagcc ttcactctgc ccagatgtgc 241 ttcctgtgga aggagacccc cagatgacac agatggagtc ttgctctgtc acccaggctg 301 gaatgcagtg gcatgatctt ggctcactga aacctctgcc tcccggttca agcaattctc 361 ctgcctcagc ctcctgagta gctgggatta cagaaaatct gagcagtggg tgaggccgag 421 ggactgccca gagccccaga gcacggcagt gggcaagcag tcgtccctgt cctgggcatc 481 cctgtggcca tggagctcct gggtgagcag ctgctctgag acaagtctgg tgaagaatac 541 agcaggactc ctgaacattc ttccttaaat gcttatctgt ggaaaccaga actgcccttg 601 gcgcgtggct cgtggacgtg ctgggtggct gggtggggcg ggtgcacggg tggagcctca 661 gctctgtttc tttctctggg tagtgtcccc actgccgggc ttctcccgct ctcagctctc 721 tggggaggtc ccctgtgaag ggcctgtctt tcctgtcccc tccctcggat cattaatgaa 781 ccactctctc tctctttttc cttttcagtg attgggctgt cggaatcaaa gaggccctcc 841 ggtgactcag gcgtttccca ttgccctatt ctagcatcct ctactcttag cactgagagt 901 ttttcacgtc ctatttggaa ctgataggaa accctttcat tgtttcgcta catggatatt 961 agcactgatg aaggaagcgc tccatcagat agtggtccgc tgcagtgagc tggtttccag 1021 cacgagcctg cctagactca gcgtctcccg cctccaggga cccccagact ctcagcccct 1081 aggcaccctg ggccagggtg gttggaagct tctaggcatt gtggggtctc tggcaccaga 1141 gacactcggg ggtctgggga ccgagtttgg gccctgtacc cacccactac catttgacat 1201 ggtgagagag agagagagag atgacgagct caggcaggga tggcttctcc agtgcccaca 1261 atgtgccagg actctgctgt gccactgtgg gcctttcctc acccctccct cccagacatc 1321 aagctccggt ttccagctgt gttccttgaa gccgtcgggc tctctggtga cagccacaga 1381 gcctctgagc aactttgctt tcagctattt tccatgaccc agtcctgggt aaggcttcat 1441 ttgaatgaag ggttctgctg ctaaaaaaaa aaaaaaagtt ggaaaaacca cctgatcaca 1501 actttctaag gcagctattt tgttttgcaa tgagatattt gcagtggttt cactaaagcc 1561 tgatcttccc actggactgt cggcctgtga ggatgaggac gtgtctatca cagtcagtat 1621 gccctgaatg aatgggtgtc accctccgcc ccctagtgag agataaggac ctcaggctca 1681 gagaggttat gtggttttct caaagccaca cagcaagatt tggcaggtga gccttacttc 1741 tcttcctggg ctagcctgcg gtcaggggca aggcttttgg atgataagag tgaggtgcag 1801 tcagctctat attgggggtg gcagagggaa aaggaggggg ggttcccaaa gcaccattct 1861 gctttctgtg agaagggatt tcccctcaac aacttctgtc ccatttttca gagctaaagt 1921 gacaagccac atccacgtcc ctggttaaag gtgctccccc tgcaggaaac acatgtgcac 1981 ttgggatgga gccctctgtg cgggagcggc ttgctcacct ctcagccagt gccagctggg 2041 tgggtctctc ccagctctcg aatcctaact gcagccacca tttattcagc acctgttatg 2101 taccaggcac tttatgtatc ttattgctaa ccctcaaaac aagcccttcc caacataagc 2161 gcttttgtgc caaacccata ggtctgaaac tgagacccag agaggctgaa tgacatgctc 2221 acagtcacac agcaggcagt ggcaggcagt agtgaagtcg gctctatgga ttgtatacct 2281 gggctcctcc aggcctctgc gctgcccggc ctgataccct agcaggttgg gtattcagag 2341 gtgcctgggg agcttgttaa aagtgtagat tccaggcctc cttaccctta gagactgtga 2401 ctcagtatcc ggggcagggc ccaggcatct tcattttgac aggccgtccc ggtggctgcc 2461 atgcgcaggc aggttttagg aacatcatgg gtggatgatt cctggtctcc ccagccctct 2521 tgacctggcc acagagagct gcctttccag cgaggtcagc cgagatgacg ctgccgggag 2581 agcagtcttc tctgtagtcc ccggagggga ctcctcggag gaaggatgag tctgagggtt 2641 ccggttgtct tggcctgaga ccacccaagg tcttttaaca gaaaaaaact tcacccaaaa // LOCUS AF008442 1277 bp mRNA PRI 20-JUL-1997 DEFINITION Homo sapiens RNA polymerase I subunit hRPA39 mRNA, complete cds. ACCESSION AF008442 NID g2266928 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1277) AUTHORS Dammann,R. and Pfeifer,G.P. TITLE Cloning and characterization of the third largest human RNA polymerase I subunit hRPA39 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1277) AUTHORS Dammann,R. and Pfeifer,G.P. TITLE Direct Submission JOURNAL Submitted (13-JUN-1997) Biology, Beckman Research Institute of the City of Hope Medical Center, 1450 East Duarte Rd, Duarte, CA 91010, CA FEATURES Location/Qualifiers source 1..1277 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" CDS 13..1053 /codon_start=1 /product="RNA polymerase I subunit hRPA39" /db_xref="PID:g2266929" /translation="MAASQAVEEMRSRVVLGEFGVRNVHTTDFPGNYSGYDDAWDQDR FEKNFRVDVVHMDENSLEFDMVGIDAAIANAFRRILLAEVPTMAVEKVLVYNNTSIVQ DEILAHRLGLIPIHADPRLFEYRNQGDEEGTEIDTLQFRLQVRCTRNPHAAKDSSDPN ELYVNHKVYTRHMTWIPLGNQADLFPEGTIRPVHDDILIAQLRPGQEIDLLMHCVKGI GKDHAKFSPVATASYRLLPDITLLEPVEGEAAEELSRCFSPGVIEVQEVQGKKVARVA NPRLDTFSREIFRNEKLKKVVRLARVRDHYIFSVESTGVLPPDVLVSEAIKVLMGKCR RFLDELDAVQMD" BASE COUNT 333 a 288 c 345 g 311 t ORIGIN 1 gagagagaga agatggcggc ttctcaggcg gtggaggaaa tgcggagccg cgtggttctg 61 ggggagtttg gggttcgcaa tgtccatact actgactttc ccggtaacta ttccggttat 121 gatgatgcct gggaccagga ccgcttcgag aagaatttcc gtgtggatgt agtacacatg 181 gatgaaaact cactggagtt tgacatggtg ggaattgacg cagccattgc caatgctttt 241 cgacgaattc tgctagctga ggtgccaact atggctgtgg agaaggtcct ggtgtacaat 301 aatacatcca ttgttcagga tgagattctt gctcaccgtc tggggctcat tcccattcat 361 gctgatcccc gtctttttga gtatcggaac caaggagatg aagaaggcac agagatagat 421 actctacagt ttcgtctcca ggtcagatgc actcggaacc cccatgctgc taaagattcc 481 tctgacccca acgaactgta cgtgaaccac aaagtgtata ccaggcatat gacatggatc 541 cccctgggga accaggctga tctctttcca gagggcacta tccgaccagt gcatgatgat 601 atcctcatcg ctcagctgcg gcctggccaa gaaattgacc tgctcatgca ctgtgtcaag 661 ggcattggca aagatcatgc caagttttca ccagtggcaa cagccagtta caggctcctg 721 ccagacatca ccctgcttga gcccgtggaa ggggaggcag ctgaggagtt gagcaggtgc 781 ttctcacctg gtgttattga ggtgcaggaa gtccaaggta aaaaggtggc cagagttgcc 841 aacccccggc tggatacctt cagcagagaa atcttccgga atgagaagct aaagaaggtt 901 gtgaggcttg cccgggttcg agatcattat atcttctctg ttgagtcaac gggggtgttg 961 ccaccagatg tgctggtgag tgaagccatc aaagtactga tggggaagtg ccggcgcttc 1021 ttggatgaac tagatgcggt tcagatggac tgagcttgga tgcttctgag gcaagctgaa 1081 gctttgggtt ctgactgacc caccctacag gactgctgaa cagagagccc agtgtgacta 1141 gggatcctga gttttctggg acaattccag ctttaatcaa tacattttgt taaatgtgcc 1201 ataaaatgag actttttacg cctttataag gccttagatg taaataaact cacccaaaca 1261 aaaaaaaaaa aaaaaaa // LOCUS AF008445 1445 bp mRNA PRI 30-JUL-1997 DEFINITION Homo sapiens phospholipid scramblase mRNA, complete cds. ACCESSION AF008445 NID g2282600 KEYWORDS calcium; phospholipid asymmetry; phospholipid flip/flop; activated platelets; procoagulant; apoptosis. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1445) AUTHORS Zhou,Q., Zhao,J., Stout,J.G., Luhm,R.A., Wiedmer,T. and Sims,P.J. TITLE Molecular cloning of human plasma membrane phospholipid scramblase. A protein mediating transbilayer movement of plasma membrane phospholipids JOURNAL J. Biol. Chem. 272 (29), 18240-18244 (1997) MEDLINE 97364751 REFERENCE 2 (bases 1 to 1445) AUTHORS Zhou,Q., Zhao,J., Stout,J.G., Luhm,R.A., Wiedmer,T. and Sims,P.J. TITLE Direct Submission JOURNAL Submitted (13-JUN-1997) Blood Research Institute, The Blood Center of Southeastern Wisconsin, Milwaukee, WI 53201-2178, USA FEATURES Location/Qualifiers source 1..1445 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K-562" CDS 211..1167 /note="plasma membrane phospholipid; expressed in erythrocyte membranes, leukocytes, platelets, T & B lymphocytes, vascular endothelium, spleen, thymus, prostate, testis, uterus, intestine, colon, blood leukocytes; also expressed in the following cell lines: HL-60, Hela, K-562, MOLT-4, Raji, SW480, A549 and G361" /codon_start=1 /product="phospholipid scramblase" /db_xref="PID:g2282601" /translation="MDKQNSQMNASHPETNLPVGYPPQYPPTAFQGPPGYSGYPGPQV SYPPPPAGHSGPGPAGFPVPNQPVYNQPVYNQPVGAAGVPWMPAPQPPLNCPPGLEYL SQIDQILIHQQIELLEVLTGFETNNKYEIKNSFGQRVYFAAEDTDCCTRNCCGPSRPF TLRIIDNMGQEVITLERPLRCSSCCCPCCLQEIEIQAPPGVPIGYVIQTWHPCLPKFT IQNEKREDVLKISGPCVVCSCCGDVDFEIKSLDEQCVVGKISKHWTGILREAFTDADN FGIQFPLDLDVKMKAVMIGACFLIDFMFFESTGSQEQKSGVW" BASE COUNT 404 a 308 c 331 g 402 t ORIGIN 1 cgcggccgcg tcgaccgaaa ccaggagccg cgggtgttgg cgcaaaggtt actcccagac 61 ccttttccgg ctgacttctg agaaggttgc gcagcagctg tgcccgacag tctagaggcg 121 cagaagagga agccatcgcc tggccccggc tctctggacc ttgtctcgct cgggagcgga 181 aacagcggca gccagagaac tgttttaatc atggacaaac aaaactcaca gatgaatgct 241 tctcacccgg aaacaaactt gccagttggg tatcctcctc agtatccacc gacagcattc 301 caaggacctc caggatatag tggctaccct gggccccagg tcagctaccc acccccacca 361 gccggccatt caggtcctgg cccagctggc tttcctgtcc caaatcagcc agtgtataat 421 cagccagtat ataatcagcc agttggagct gcaggggtac catggatgcc agcgccacag 481 cctccattaa actgtccacc tggattagaa tatttaagtc agatagatca gatactgatt 541 catcagcaaa ttgaacttct ggaagtttta acaggttttg aaactaataa caaatatgaa 601 attaagaaca gctttggaca gagggtttac tttgcagcgg aagatactga ttgctgtacc 661 cgaaattgct gtgggccatc tagacctttt accttgagga ttattgataa tatgggtcaa 721 gaagtcataa ctctggagag accactaaga tgtagcagct gttgttgtcc ctgctgcctt 781 caggagatag aaatccaagc tcctcctggt gtaccaatag gttatgttat tcagacttgg 841 cacccatgtc taccaaagtt tacaattcaa aatgagaaaa gagaggatgt actaaaaata 901 agtggtccat gtgttgtgtg cagctgttgt ggagatgttg attttgagat taaatctctt 961 gatgaacagt gtgtggttgg caaaatttcc aagcactgga ctggaatttt gagagaggca 1021 tttacagacg ctgataactt tggaatccag ttccctttag accttgatgt taaaatgaaa 1081 gctgtaatga ttggtgcctg tttcctcatt gacttcatgt tttttgaaag cactggcagc 1141 caggaacaaa aatcaggagt gtggtagtgg attagtgaaa gtctcctcag gaaatctgaa 1201 gtctgtatat tgattgagac tatctaaact catacctgta tgaattaagc tgtaaggcct 1261 gtagctctgg ttgtatactt ttgcttttca aattatagtt tatcttctgt ataactgatt 1321 tataaaggtt tttgtacatt ttttaatact cattgtcaat ttgagaaaaa ggacatatga 1381 gtttttgcat ttattaatga aacttccttt gaaaaactgc tttaaaaaaa agtcgacgcg 1441 gccgc // LOCUS AF008935 1089 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens syntaxin-16A mRNA, complete cds. ACCESSION AF008935 NID g2352813 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1089) AUTHORS Simonsen,A., Bremnes,B., Renning,E., Aasland,R. and Stenmark,H. TITLE Cloning and expression of three forms of syntaxin-16, a new putative Golgi t-SNARE JOURNAL Unpublished REFERENCE 2 (bases 1 to 1089) AUTHORS Simonsen,A., Bremnes,B., Renning,E., Aasland,R. and Stenmark,H. TITLE Direct Submission JOURNAL Submitted (17-JUN-1997) Dept. of Biochemistry, The Norwegian Radium Hospital, Montebello, Oslo N-0310, Norway FEATURES Location/Qualifiers source 1..1089 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /note="vector=pGAD GH (Clontech)" CDS 141..1052 /note="putative Golgi t-SNARE" /codon_start=1 /product="syntaxin-16A" /db_xref="PID:g2352814" /translation="MATRRLTDAFLLLRNNSIQNRQLLAEQLADDRMALVSGISLDPE AAIGVTKRPPPKWVDGVDEIQYDVGRIKQKMKESASLHDKHLNRPTLDDSSEEEHAIE ITTQEITQLFHRCQRAVQPCRAGPGPAPSRRGGCLGTWCLVAQALQELSTSFRHAQSG YLKRMKNREERSQHFFDTSVPLMDDGDDNTLYHRGFTEDQLVLVEQNTLMVEEREREI RQMVQSISDLNEIFRDLGAMIVEQGTVLDRIDYNVEQSCIKTEDGLKQLHKAEQYQKK NRKMLVILILFVIIIVLIVVLVGVKSR" BASE COUNT 290 a 241 c 315 g 243 t ORIGIN 1 tctttttggc ttgagcctga ggccttgtcg agaagcttcc gtgaaagggt gggccagccg 61 ggccacgaga aagaaagtga ataaatcagg aatataagtg ggcggggggc ccctgagagg 121 ggggtcgcaa agggtgagac atggccacca ggcgtttaac cgacgctttc ttgttgttgc 181 ggaataattc catccaaaac cggcagctgt tagccgagca acttgctgat gaccgtatgg 241 cactggtgtc aggcatcagc ttagatccag aagcagcgat tggtgtgaca aaacggccac 301 ctcctaagtg ggtggatgga gtggatgaaa ttcagtatga tgttggccgg attaagcaga 361 agatgaaaga atcggccagc cttcatgaca agcatttaaa cagacccacc ctggatgaca 421 gcagcgaaga ggaacatgcc attgagataa ctacccaaga gatcactcag ctcttccaca 481 ggtgccagcg tgccgtgcag ccctgccgag ccgggcccgg gcctgctccg agcaggaggg 541 gcggctgctt gggaacgtgg tgcctcgtgg cgcaggccct gcaggaactc tccaccagct 601 tccggcacgc acaatcaggc tacctcaaac gcatgaagaa tcgagaggaa agatcccagc 661 attttttcga cacatcagta ccactaatgg atgatggaga cgataacact ctttaccatc 721 ggggttttac agaggaccag ttagttctgg tggagcagaa cacactgatg gtggaagagc 781 gggaacgaga gattcgccag atggtacagt ccatttctga cctgaatgaa atattcaggg 841 acttaggggc gatgattgta gaacagggta cagtccttga cagaattgac tataacgttg 901 aacagtcctg tatcaaaact gaagatggtt tgaaacagct tcacaaggca gaacagtatc 961 aaaagaagaa tcggaagatg cttgtgattt taatattatt tgtcatcatc attgtgctca 1021 ttgttgtcct cgttggcgtg aagtctcgat aagtggcatt gggttttcgt gtgtgccgcg 1081 cgtgtggat // LOCUS AF009005 2094 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens immunoglobulin-like transcript 2a mRNA, complete cds. ACCESSION AF009005 NID g2660701 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2094) AUTHORS Colonna,M., Navarro,F., Bellon,T., Llano,M., Garcia,P., Samaridis,J., Angman,L., Cella,M. and Lopez-Botet,M. TITLE A common inhibitory receptor for major histocompatibility complex class I molecules on human lymphoid and myelomonocytic cells JOURNAL J. Exp. Med. 186 (11), 1809-1818 (1997) MEDLINE 98044246 REFERENCE 2 (bases 1 to 2094) AUTHORS Colonna,M. TITLE Direct Submission JOURNAL Submitted (17-JUN-1997) Basel Institute for Immunology, 487 Grenzacherstrasse, Basel CH-4005, Switzerland FEATURES Location/Qualifiers source 1..2094 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /cell_type="myelomonocytic; NK; T; B" /map="19q13.4" CDS 1..1953 /note="ILT2a; allelic form of ILT2" /codon_start=1 /product="immunoglobulin-like transcript 2a" /db_xref="PID:g2660702" /translation="MTPILTVLICLGLSLGPRTHVQAGHLPKPTLWAEPGSVITQGSP VTLRCQGGQETQEYRLYREKKTALWITRIPQELVKKGQFPIPSITWEHAGRYRCYYGS DTAGRSESSDPLELVVTGAYIKPTLSAQPSPVVNSGGNVILQCDSQVAFDGFSLCKEG EDEHPQCLNSQPHARGSSRAIFSVGPVSPSRRWWYRCYAYDSNSPYEWSLPSDLLELL VLGVSKKPSLSVQPGPIVAPEETLTLQCGSDAGYNRFVLYKDGERDFLQLAGAQPQAG LSQANFTLGPVSRSYGGQYRCYGAHNLSSEWSAPSDPLDILIAGQFYDRVSLSVQPGP TVASGENVTLLCQSQGWMQTFLLTKEGAADDPWRLRSTYQSQKYQAEFPMGPVTSAHA GTYRCYGSQSSKPYLLTHPSDPLELVVSGPSGGPSSPTTGPTSTSGPEDQPLTPTRSD PQSGLGRHVGVVIGILVAVVLLLLLLLLLFLILRHRRQGKHWTSTQRKADFQHPAGAV GPEPTDRGLQWRSSPAADAQEENLYAAVKHTQPEDGVEMDTRSPHDEDPQAVTYAEVK HSRPRREMASPPSPLSGEFLDTKDRQAEEDRQMDTEAAASEAPQDVTYAQLHSLTLRR KATEPPPSQEGPSPAVPSIYATLAIH" BASE COUNT 448 a 686 c 574 g 386 t ORIGIN 1 atgaccccca tcctcacggt cctgatctgt ctcgggctga gtctgggccc caggacccac 61 gtgcaggcag ggcacctccc caagcccacc ctctgggctg aaccaggctc tgtgatcacc 121 caggggagtc ctgtgaccct caggtgtcag gggggccagg agacccagga gtaccgtcta 181 tatagagaaa agaaaacagc actctggatt acacggatcc cacaggagct tgtgaagaag 241 ggccagttcc ccatcccatc catcacctgg gaacatgcag ggcggtatcg ctgttactat 301 ggtagcgaca ctgcaggccg ctcagagagc agtgaccccc tggagctggt ggtgacagga 361 gcctacatca aacccaccct ctcagcccag cccagccccg tggtgaactc aggagggaat 421 gtaatcctcc agtgtgactc acaggtggca tttgatggct tcagtctgtg taaggaagga 481 gaagatgaac acccacaatg cctgaactcc cagccccatg cccgtgggtc gtcccgcgcc 541 atcttctccg tgggccccgt gagcccgagt cgcaggtggt ggtacaggtg ctatgcttat 601 gactcgaact ctccctatga gtggtctcta cccagtgatc tcctggagct cctggtccta 661 ggtgtttcta agaagccatc actctcagtg cagccaggtc ctatcgtggc ccctgaggag 721 accctgactc tgcagtgtgg ctctgatgct ggctacaaca gatttgttct gtataaggac 781 ggggaacgtg acttccttca gctcgctggc gcacagcccc aggctgggct ctcccaggcc 841 aacttcaccc tgggccctgt gagccgctcc tacgggggcc agtacagatg ctacggtgca 901 cacaacctct cctccgagtg gtcggccccc agtgaccccc tggacatcct gatcgcagga 961 cagttctatg acagagtctc cctctcggtg cagccgggcc ccacggtggc ctcaggagag 1021 aacgtgaccc tgctgtgtca gtcacaggga tggatgcaaa ctttccttct gaccaaggag 1081 ggggcagctg atgacccatg gcgtctaaga tcaacgtacc aatctcaaaa ataccaggct 1141 gaattcccca tgggtcctgt gacctcagcc catgcgggga cctacaggtg ctacggctca 1201 cagagctcca aaccctacct gctgactcac cccagtgacc ccctggagct cgtggtctca 1261 ggaccgtctg ggggccccag ctccccgaca acaggcccca cctccacatc tggccctgag 1321 gaccagcccc tcacccccac caggtcggat ccccagagtg gtctgggaag gcacgtgggg 1381 gttgtgatcg gcatcttggt ggccgtcgtc ctactgctcc tcctcctcct cctcctcttc 1441 ctcatcctcc gacatcgacg tcagggcaaa cactggacat cgacccagag aaaagctgat 1501 ttccaacatc ctgcaggggc tgtggggcca gagcccacag acagaggcct gcagtggagg 1561 tccagcccag ctgccgatgc ccaggaagaa aacctctatg ctgccgtgaa gcacacacag 1621 cctgaggatg gggtggagat ggacactcgg agcccacacg atgaagaccc ccaggcagtg 1681 acgtatgccg aggtgaaaca ctccagacct aggagagaaa tggcctctcc tccttcccca 1741 ctgtctgggg aattcctgga cacaaaggac agacaggcgg aagaggacag gcagatggac 1801 actgaggctg ctgcatctga agccccccag gatgtgacct acgcccagct gcacagcttg 1861 acccttagac ggaaggcaac tgagcctcct ccatcccagg aagggccctc tccagctgtg 1921 cccagcatct acgccactct ggccatccac tagcccaggg ggggacgcag accccacact 1981 ccatggagtc tggaatgcat gggagctgcc cccccagtgg acaccattgg accccaccca 2041 gcctggatct accccaggag actctgggaa cttttagggg tcactcaatt ctgc // LOCUS AF009225 2273 bp mRNA PRI 15-AUG-1997 DEFINITION Homo sapiens IkB kinase alpha subunit (IKK alpha) mRNA, complete cds. ACCESSION AF009225 NID g2327068 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2273) AUTHORS DiDonato,J.A., Hayakawa,M., Rothwarf,D.M., Zandi,E. and Karin,M. TITLE A cytokine-responsive IkappaB kinase that activates the transcription factor NF-kappaB JOURNAL Nature 388 (6642), 548-554 (1997) MEDLINE 97394468 REFERENCE 2 (bases 1 to 2273) AUTHORS DiDonato,J.A., Hayakawa,M., Rothwarf,D.M., Zandi,E. and Karin,M. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) Pharmacology, UCSD, School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..2273 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2273 /gene="IKK alpha" CDS 36..2273 /gene="IKK alpha" /note="protein kinase" /codon_start=1 /product="IkB kinase alpha subunit" /db_xref="PID:g2327069" /translation="MERPPGLRPGAGGPWEMRERLGTGGFGNVCLYQHRELDLKIAIK SCRLELSTKNRERWCHEIQIMKKLNHANVVKACDVPEELNILIHDVPLLAMEYCSGGD LRKLLNKPENCCGLKESQILSLLSDIGSGIRYLHENKIIHRDLKPENIVLQDVGGKII HKIIDLGYAKDVDQGSLCTSFVGTLQYLAPELFENKPYTATVDYWSFGTMVFECIAGY RPFLHHLQPFTWHEKIKKKDPKCIFACEEMSGEVRFSSHLPQPNSLCSLIVEPMENWL QLMLNWDPQQRGGPVDLTLKQPRCFVLMDHILNLKIVHILNMTSAKIISFLLPPDESL HSLQSRIERETGINTGSQELLSETGISLDPRKPASQCVLDGVRGCDSYMVYLFDKSKT VYEGPFASRSLSDCVNYIVQDSKIQLPIIQLRKVWAEAVHYVSGLKEDYSRLFQGQRA AMLSLLRYNANLTKMKNTLISASQQLKAKLEFFHKSIQLDLERYSEQMTYGISSEKML KAWKEMEEKAIHYAEVGVIGYLEDQIMSLHAEIMGLQKSPYGRRQGDLMESLEQRAID LYKQLKHRPSDHSYSDSTEMVKIIVHTVQSQDRVLKELFGHLSKLLGCKQKIIDLLPK VEVALSNIKEADNTVMFMQGKRQKEIWHLLKIACTQSSARSLVGSSLEGAVTPQTSAW LPPTSAEHDHSLSCVVTPQDGETSAQMIEENLNCLGHLSTIIHEANEEQGNSMMNLDW SWLTE" BASE COUNT 692 a 425 c 544 g 612 t ORIGIN 1 tcgacggaac ctgaggccgc ttgccctccc gccccatgga gcggcccccg gggctgcggc 61 cgggcgcggg cgggccctgg gagatgcggg agcggctggg caccggcggc ttcgggaacg 121 tctgtctgta ccagcatcgg gaacttgatc tcaaaatagc aattaagtct tgtcgcctag 181 agctaagtac caaaaacaga gaacgatggt gccatgaaat ccagattatg aagaagttga 241 accatgccaa tgttgtaaag gcctgtgatg ttcctgaaga attgaatatt ttgattcatg 301 atgtgcctct tctagcaatg gaatactgtt ctggaggaga tctccgaaag ctgctcaaca 361 aaccagaaaa ttgttgtgga cttaaagaaa gccagatact ttctttacta agtgatatag 421 ggtctgggat tcgatatttg catgaaaaca aaattataca tcgagatcta aaacctgaaa 481 acatagttct tcaggatgtt ggtggaaaga taatacataa aataattgat ctgggatatg 541 ccaaagatgt tgatcaagga agtctgtgta catcttttgt gggaacactg cagtatctgg 601 ccccagagct ctttgagaat aagccttaca cagccactgt tgattattgg agctttggga 661 ccatggtatt tgaatgtatt gctggatata ggcctttttt gcatcatctg cagccattta 721 cctggcatga gaagattaag aagaaggatc caaagtgtat atttgcatgt gaagagatgt 781 caggagaagt tcggtttagt agccatttac ctcaaccaaa tagcctttgt agtttaatag 841 tagaacccat ggaaaactgg ctacagttga tgttgaattg ggaccctcag cagagaggag 901 gacctgttga ccttactttg aagcagccaa gatgttttgt attaatggat cacattttga 961 atttgaagat agtacacatc ctaaatatga cttctgcaaa gataatttct tttctgttac 1021 cacctgatga aagtcttcat tcactacagt ctcgtattga gcgtgaaact ggaataaata 1081 ctggttctca agaacttctt tcagagacag gaatttctct ggatcctcgg aaaccagcct 1141 ctcaatgtgt tctagatgga gttagaggct gtgatagcta tatggtttat ttgtttgata 1201 aaagtaaaac tgtatatgaa gggccatttg cttccagaag tttatctgat tgtgtaaatt 1261 atattgtaca ggacagcaaa atacagcttc caattataca gctgcgtaaa gtgtgggctg 1321 aagcagtgca ctatgtgtct ggactaaaag aagactatag caggctcttt cagggacaaa 1381 gggcagcaat gttaagtctt cttagatata atgctaactt aacaaaaatg aagaacactt 1441 tgatctcagc atcacaacaa ctgaaagcta aattggagtt ttttcacaaa agcattcagc 1501 ttgacttgga gagatacagc gagcagatga cgtatgggat atcttcagaa aaaatgctaa 1561 aagcatggaa agaaatggaa gaaaaggcca tccactatgc tgaggttggt gtcattggat 1621 acctggagga tcagattatg tctttgcatg ctgaaatcat ggggctacag aagagcccct 1681 atggaagacg tcagggagac ttgatggaat ctctggaaca gcgtgccatt gatctatata 1741 agcagttaaa acacagacct tcagatcact cctacagtga cagcacagag atggtgaaaa 1801 tcattgtgca cactgtgcag agtcaggacc gtgtgctcaa ggagctgttt ggtcatttga 1861 gcaagttgtt gggctgtaag cagaagatta ttgatctact ccctaaggtg gaagtggccc 1921 tcagtaatat caaagaagct gacaatactg tcatgttcat gcagggaaaa aggcagaaag 1981 aaatatggca tctccttaaa attgcctgta cacagagttc tgcccgctct cttgtaggat 2041 ccagtctaga aggtgcagta acccctcaga catcagcatg gctgcccccg acttcagcag 2101 aacatgatca ttctctgtca tgtgtggtaa ctcctcaaga tggggagact tcagcacaaa 2161 tgatagaaga aaatttgaac tgccttggcc atttaagcac tattattcat gaggcaaatg 2221 aggaacaggg caatagtatg atgaatcttg attggagttg gttaacagaa tga // LOCUS AF009242 4490 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens proline-rich Gla protein 1 (PRGP1) mRNA, complete cds. ACCESSION AF009242 NID g2338289 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4490) AUTHORS Kulman,J.D., Harris,J.E., Haldeman,B.A. and Davie,E.W. TITLE Primary Structure and Tissue Distribution of Two Novel Proline-Rich gamma-Carboxyglutamic Acid Proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 4490) AUTHORS Kulman,J.D., Harris,J.E., Haldeman,B.A. and Davie,E.W. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) Biochemistry, University of Washington, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..4490 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..4490 /gene="PRGP1" CDS 166..822 /gene="PRGP1" /note="potential vitamin K-dependent transmembrane protein" /codon_start=1 /product="proline-rich Gla protein 1" /db_xref="PID:g2338290" /translation="MGRVFLTGEKANSILKRYPRANGFFEEIRQGNIERECKEEFCTF EEAREAFENNEKTKEFWSTYTKAQQGESNRGSDWFQFYLTFPLIFGLFIILLVIFLIW RCFLRNKTRRQTVTEGHIPFPQHLNIITPPPPPDEVFDSSGLSPGFLGYVVGRSDSVS TRLSNCDPPPTYEEATGQVNLQRSETEPHLDPPPEYEDIVNSNSASAIPMVPVVTTIK " BASE COUNT 1331 a 802 c 847 g 1510 t ORIGIN 1 cacgcgtccg cgagtagcgg agcgggaccc gctgatgctc accccattta gagaaagaaa 61 tccacctccc caaccccaat caagaacctg ctattgtata tcatcataga gccagattac 121 ctagggaatc atcatccagg gacgtgccag aaaccacaag aaaacatggg gagggttttc 181 ctcacgggag aaaaagccaa ttccatatta aaacgctacc caagagctaa tgggtttttt 241 gaagaaataa gacagggcaa cattgagcgt gagtgcaaag aagaattctg tacatttgaa 301 gaagcaagag aagcttttga aaataatgaa aaaactaagg agttttggag cacctacaca 361 aaagcgcaac aaggggagag taaccgagga agtgactggt ttcagtttta ccttaccttt 421 ccgttaatct ttggcctctt cattatcctc cttgtcattt tcctaatctg gagatgcttc 481 ctaagaaaca aaactcgtag acagacagtg actgaaggcc acattccttt ccctcagcac 541 cttaatatta tcaccccacc ccccccacca gatgaagtgt ttgacagcag tggattgtct 601 ccaggctttc tgggatatgt agttgggcgc tcagattccg tctctactcg cctgtccaat 661 tgtgatcccc cgccaaccta tgaggaagcc actggccaag tgaacctgca gaggagtgaa 721 acagaacctc atttagaccc acccccagag tatgaggaca tagtcaactc caactcagcc 781 agtgccattc ctatggtgcc tgtggtcacc accatcaaat gaagctgcaa acttcttttt 841 actctaatca tttttaaaat actaatggaa gaactttcta gcactttacc actacataaa 901 tgttcattga cttattttat tggactctta ccgcatacca cttcacactt gttttatttt 961 ctttagtttt gtttcttgtt atagaatcat tatccatgct catttttgct aggggaaata 1021 tatgaagagg gaaaacatac taatgggggt ctttctgtga tgtgatgaga catacatgta 1081 agtgtatata tgtgtgtata ggcatatata cgtgtgtatg catcaacaca gtatatgtaa 1141 aactgtctta aaaatccatt aacttctacc taaatcacct ggaaggagag cattactcac 1201 caaaattgca aaacaagggt atcaagaatt tgtgtaatag ccagtgacat gctgtagatt 1261 tttgcaaact ggatgtactt agcatgtttt ctaattctga ctggcttttg ttaacttgat 1321 aattcttcat ctaccttaaa aagaaaaaaa ttacacatag tcattcttga tgttataaat 1381 agagaaaaag tgtgtgtgag caataatgca taagctactg ataacttgct tacagcagat 1441 agcaataagg tatttggtgg cattcggctt gttttgtaat agggattttt tttttggttg 1501 accactcccc cacacttcca aaattaaaca gtgttttctt agcatcttga atatctcctg 1561 cggtgtatat taacatcttg atgagacaga tttccaggca acaaaataat ttctaaaatg 1621 gatatatgtg tggattaatg acaggcagta aatacccatt actcctttac tcatagctgg 1681 taaaattatt cccactgttt tattgccttt tactgtacgt tctacactct gtcctactcc 1741 cacagaattt tcaagccctt aagagtttag ttaaaataaa atttttgaaa ttattgtctt 1801 aatattttta tataggctga tgtctttgcc tcaagattgt taggaggtaa ttttccattg 1861 aattatcaac tgtgattttt atattgccct ccaagtggta gaagaagatt gcaaagtcca 1921 tgttatgcta ggtgcacaat aaatctagta atagccccac acagatctca tcattgttgc 1981 tacttccttt tgtattttca tcaggtattt ttttaactgt agggttttta cttttttctt 2041 ggagcagaga gaacaggctg taaatgggtt gccaacataa gctggctgag aaataaaaga 2101 aaacaagaca gttgttcata aagtttcatt ttgtatgcac tgatggcaaa ttcattaggt 2161 cagttaaggg aaatatttgt accacttcca aacttttcag cgttggataa aatgattgat 2221 gaggcaggca gaaggaatgt aggtttcagg tgtgtcattt cctgctgctt ccagctccat 2281 ccctacagac tcctccccga gtcctgccct ggaaccaaag gaaggaggaa cactgagggg 2341 aatcctgaag taggagtcag atgacctgaa ctcagatctc cctctatcac atgcttgcca 2401 ccactgtacc ttgagcaaga atcatatctg accctcaacc tcctcaactc taaaatgggg 2461 ataacatcat ttgtcctgca cttctctaag ggctttacaa ggatcaaata agatggtgtg 2521 tatgtaagaa atttgtaaaa tgtgaagagc tatctacact aagttcgtaa tgttattatt 2581 attgtgcttc atggagaatt ttcccctctg ttttcctaaa ttgtatgaga gctttcacac 2641 agtgagaaat agagcaggct gccccataaa tgggtaacat attcctaatc tgagtgtgtg 2701 ggctgttaga gaacccctgc catgctctgg tctgttctga aactgtgcca actgaaagat 2761 gatagtccac acagcacaaa caggtttaag caaatgatag aaagggaagt aaggcgtgtg 2821 tgctagttaa taggtttagt agcttttatg gactaaaaat gattgattgt atcttgaccc 2881 tggtctcaga aatgacattt ttacttttgc catgagtaca catcagatat ctttggcttc 2941 tatttaaagc taaaggtaga agtgtttgat ccagtgaact gtgtatgtat gtgtggggtt 3001 tttttcttta tttttaaatg aaaattaaga caccttttgt gtggacatgt ttttgtcttt 3061 aatgtcaggc tttagattag accagcagtt ttcaaagtat ggccaatgga cccctgggtt 3121 ccttgagagc ctttcagggg ggactatgag atcaaaatat ttttattata atgtgaagac 3181 attgtctttt cactctatct caaaaaaagt gttgcaaatg taaaacattg ccatcttctc 3241 acaaatctct ttttttgttt ttgaaaatat ggccattttt cataaaatgt tatttgtgtt 3301 agcatgtaat gggtttacta tgtttaaata agttaatact ttaaaaattt tcaggttttt 3361 ttagtatggt gaatattgat agatataaac ctcatgaaca aaagtacttt ggcatccaga 3421 ttctcaataa atgttaagag cgtaagtgta agggggtcca gagaccaaaa gttttagagc 3481 tacaggatta gacataggag caggatattc tgttagtgtg atttcttgca actttatttt 3541 atattttaaa ctgctgatat tggatataat gctgcttttt agagacacct aaattgcagt 3601 atcagaatga atgttgatgt ttgaagccaa aaagccaaat gcttaaactg atcaatgact 3661 gtagcttttt agactgttgg tcaaagaaca ttctacttca cagtaatagc tctatcagcc 3721 acagatctca tggtggctgt tgcatgataa tgataggata aacaaaatac cactgtcttc 3781 aagaaacatt atcttaggtt tgtttgtttg gtttgagttt gatttggctt ttatattttt 3841 taaaatccct tttgctaccc catctggttt tataaactga gtttcttagc attcgttaaa 3901 attaaggggt ttgtttggaa taatatatat tttttatgct tttgtctttc ttacctgatt 3961 gatattacat tcacctttga ttgtttttta aaagtttatt tttacagaat atatttagta 4021 cctttcttaa ggagtaactg aattgaatca accagtttgc atttaaataa aagaacaggc 4081 tcagtggtct tcctgtagaa tggtttacat gcctgcatgt gcagtagttg tgtctggaat 4141 cctagaattg gcactttctg cctccttgct ctaaatgtca caaaaaatta tacttcctta 4201 aagtaaatgt aatgatttct tcttttccta ttgaccagta cagatagata tgttgtgttt 4261 gcttcatttt taatgatgac ttcaagattg atgatgtgat ccaataactg tggaggtagc 4321 tttaacttgg ttctgtgtaa atagtatgta ttttattata atatttctca ttttaagatg 4381 cttggtttac attaaattat ggtatttaac tatttttatg tttatactag gtagggtctt 4441 tcttatgttt ctgtgttttt ggtatgctaa ataaagctat ttttaaaccc // LOCUS AF009243 1167 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens proline-rich Gla protein 2 (PRGP2) mRNA, complete cds. ACCESSION AF009243 NID g2338291 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1167) AUTHORS Kulman,J.D., Harris,J.E., Haldeman,B.A. and Davie,E.W. TITLE Primary Structure and Tissue Distribution of Two Novel Proline-Rich gamma-Carboxyglutamic Acid Proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 1167) AUTHORS Kulman,J.D., Harris,J.E., Haldeman,B.A. and Davie,E.W. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) Biochemistry, University of Washington, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..1167 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1167 /gene="PRGP2" CDS 10..618 /gene="PRGP2" /note="potential vitamin K-dependent transmembrane protein" /codon_start=1 /product="proline-rich Gla protein 2" /db_xref="PID:g2338292" /translation="MRGHPSLLLLYMALTTCLDTSPSEETDQEVFLGPPEAQSFLSSH TRIPRANHWDLELLTPGNLERECLEERCSWEEAREYFEDNTLTERFWESYIYNGKGGR GRVDVASLAVGLTGGILLIVLAGLGAFWYLRWRQHRGQQPCPQEAGLISPLSPLNPLG PPTPLPPPPPPPPGLPTYEQALAASGVHDAPPPPYTSLRRPH" BASE COUNT 227 a 379 c 348 g 213 t ORIGIN 1 ctggaaaata tgaggggcca cccctctctg ctgctgctat atatggcatt aaccacctgc 61 ctggatactt cacccagtga ggagacagac caagaagtct tcctgggtcc cccagaggcc 121 cagagcttcc tgagtagcca tacccggatt ccaagagcca accactggga cctggagctg 181 ctcacaccag ggaacctgga acgggagtgt ctggaagaga ggtgttcctg ggaagaggcc 241 agggagtatt ttgaggacaa cactctcacg gagcgctttt gggagagcta catctacaat 301 ggcaaaggag ggcgtggacg agtggatgtg gccagcctgg ctgtggggct gacaggtggc 361 atcctgctca ttgtcctggc cggcctggga gccttttggt atctgcgctg gcgacagcac 421 cgaggccagc agccctgtcc ccaagaggcc gggctcatta gccctctgag tcctttgaac 481 cctctgggcc caccgacgcc cctgcctcca cccccacccc cacccccagg cctccccacc 541 tatgagcagg cgctggcagc ctctggggta cacgacgcac ctccaccccc ctacaccagc 601 ctcaggaggc ctcactgaag agctgctttc gagacccggc tctccgaacc gtgcccctga 661 ttcataccgg attccggaag ccgctaggcc tcatagacgc cgaagctgga cttggagtgg 721 ggaatggtgg gagtaggggt catccggccc gaggctgccc tggcacacgc gtttccgccg 781 cgtatggata tacacatgtt ttcggcaacg tgttcccgtg tcctggcccc tcacgggccc 841 ccacactctc ctgaccgtga gggcactggt cagttccgcc cccgtggtag gcagacgcgc 901 ggggaaattc ggacccagga gcccagcccc ggctgtgcca tcttgtgtat gggcagatat 961 gacctgacag ccccctccag tgccacaggg tacgcacacg cagagccccg cctgtgcaca 1021 cgcgtgtctt cgtgcactcc ccgtgcggta caggggcact tcgtaaccca gggaaagggc 1081 ggggggcata tttgcaagcg cgctcggtgc gggcaggctc gcattgcacc cagggagctg 1141 gagttgagct gttcccctaa ataaaaa // LOCUS AF009301 3323 bp mRNA PRI 18-AUG-1997 DEFINITION Homo sapiens TEB4 protein mRNA, complete cds. ACCESSION AF009301 NID g2331103 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3323) AUTHORS Simmons,A.D. and Lovett,M.L. TITLE High resolution physical and transcription maps of the Cri-du-chat critical region JOURNAL Unpublished REFERENCE 2 (bases 1 to 3323) AUTHORS Simmons,A.D. and Lovett,M.L. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) Department of Otorhinolaryngology, Molecular Biology and Oncology, and The McDermott Center, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75235-8591, USA FEATURES Location/Qualifiers source 1..3323 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5p15.2" /clone="TEB4" /note="identified within the Cri-du-chat critical region" CDS 140..1612 /codon_start=1 /product="TEB4 protein" /db_xref="PID:g2331104" /translation="MFLHWLVGMVYVFYFASFILLLREVLRPGVLWFLRNLNDPDFNP VQEMIHLPIYRHLRRFILSVIVFGSIVLLMLWLPIRIIKSVLPNFLPYNVMLYSDAPV SELSLELLLLQVVLPALLEQGHTKQWLKGLVRAWTVTAGYLLDLHSYLLGDQEENENS ANQQVNNNQHARNNNAIPVVGEGLHAAHQAILQQGGPVGFQLYRRPLNFPLRIFLLIV FMCITLLIASLICLTLPVFAGRWLMSFWTGTAKIHELYTAACGLYVCWLTIRAVTVMV AWMPQGRRVIFQKVKEWSLMIMKTLIVAVLLAGVVPLLLGLLFELVIVAPLRVPLDQT PLFYPWQDWALGVLHAKIIAAITLMGPQWWLKTVIEQVYANGIRNIDLHYIVRKLAAP VISVLLLSLCVPYVIASGVVPLLGVTAEMQNLVHRRIYPFLLMVVVLMAILSFQVRQF KRLYEHIKNDKYLVGQRLVNYERKSGKQGSSPPPPQSSQE" BASE COUNT 864 a 670 c 725 g 1064 t ORIGIN 1 tagtggtggg taagaaaatt ggaagtattc cctcctcatt tggtgggttg gtggctggga 61 atatctgttc ccttggaaat gtttgatgct actctgaaag atcgagaact gagctttcag 121 tcggctccaa ggtactacca tgtttctgca ttggctagtg ggaatggtat atgtcttcta 181 ctttgcctcc ttcattctac tactgagaga ggtacttcga cctggtgtcc tgtggtttct 241 aaggaatttg aatgatccag atttcaatcc agtacaggaa atgatccatt tgccaatata 301 taggcatctc cgaagattta ttttgtcagt gattgtcttt ggctccattg tcctcctgat 361 gctttggctt cctatacgta taattaagag tgtgctgcct aattttcttc catacaatgt 421 catgctctac agtgatgctc cagtgagtga actgtccctc gagctgcttc tgcttcaggt 481 tgtcttgcca gcattactcg aacagggaca cacgaagcag tggctgaagg ggctggtgcg 541 agcgtggact gtgaccgccg gatacttgct ggatcttcat tcttatttat tgggagacca 601 ggaagaaaat gaaaacagtg caaatcaaca agttaacaat aatcagcatg ctcgaaataa 661 caacgctatt cctgtggtgg gagaaggcct tcatgcagcc caccaagcca tactccagca 721 gggagggcct gttggttttc agctttaccg ccgaccttta aattttccac tcaggatatt 781 tctgttgatt gtcttcatgt gtataacatt actgattgcc agcctcatct gccttacttt 841 accagtattt gctggccgtt ggttaatgtc gttttggacg gggactgcca aaatccatga 901 gctctacaca gctgcttgtg gtctctatgt ttgctggcta accataaggg ctgtgacggt 961 gatggtggca tggatgcctc agggacgcag agtgatcttc cagaaggtta aagagtggtc 1021 tctcatgatc atgaagactt tgatagttgc ggtgctgttg gctggagttg tccctctcct 1081 tctggggctc ctgtttgagc tggtcattgt ggctcccctg agggttccct tggatcagac 1141 tcctcttttt tatccatggc aggactgggc acttggagtc ctgcatgcca aaatcattgc 1201 agctataaca ttgatgggtc ctcagtggtg gttgaaaact gtaattgaac aggtttacgc 1261 aaatggcatc cggaacattg accttcacta tattgttcgt aaactggcag ctcccgtgat 1321 ctctgtgctg ttgctttccc tgtgtgtacc ttatgtcata gcttctggtg ttgttccttt 1381 actaggtgtt actgcggaaa tgcaaaactt agtccatcgg cggatttatc catttttact 1441 gatggtcgtg gtattgatgg caattttgtc cttccaagtc cgccagttta agcgccttta 1501 tgaacatatt aaaaatgaca agtaccttgt gggtcaacga ctcgtgaact acgaacggaa 1561 atctggcaaa caaggctcat ctccaccacc tccacagtca tcccaagaat aaagtagttg 1621 tctcaacaac ttgaccttcc cctttacatg tccttttttg tggacttctc tctttggaga 1681 tttttcccag tgatctctca gcgttgtttt taagttaaat gtatttgact tgtgttctca 1741 gcattcagag agcagcggtg taagattctg ctgttctccc tggatcttct gacattactg 1801 ctgtctgaga tttgtatatg tgtaaataca agttccttga taccctaaaa ccttggatta 1861 aacagaatgt gcattgtaca tctttaaaca aaatgtatat taatttatta aatctagttg 1921 tcactttatt ttggacctgc tgtgatctcg acaggaaacg tgccacagag cagtagtgcg 1981 caggcaagac ttttcagtga cgccttgtgg aacgcagttc atgatgtcct agcagctctc 2041 actaagggaa ctgtacattc tttctttctt ggctattcag accttaccaa gaacgttaaa 2101 ggaaacaagt agaaatcagc agtggagtgt ctgtggtaag aaaacatgaa ctttatgctt 2161 cactgttagt tgtttgtgga agttattttg tataacacca aagctgttgt acatttccta 2221 ctgcctgatt tttttcatgt gtctgtgttt gtaatattgt atagtatctt gtgctaggtg 2281 aggaaattat tttttaattt tgataattta atattcctag tgtgatcagc attgggagtt 2341 gggtttcagt ggggcatgtc tatacttaga gaaaaaaagt cccaatgaag attttcatga 2401 gtcagccccc ccgcccgccc ccaccccaca cccacatcct ctcttttcca cacacaacta 2461 tctgtttatt ttttgtagca gtggccgaaa gtcctgcaag gtcataaatc tttcagagtg 2521 acatcaccaa ctgtactgca tcttactgga tttaggactt ctgagatgct tgtgaagtat 2581 agatgtggtt gtggtcttag attgacagca ttagagaaga ctggttagaa catctggtct 2641 cgctggttag tgcctcgttg gctgaggact aggtgtgcat ttctcctagc ttttcatcag 2701 gaaatcccaa agtttccaaa gctttttgtt tacagaataa aacttcaaat aaaaccaatt 2761 cattatttgt ccagaaggaa gcttggctga gctggccttt taacatagga atgtatttcg 2821 ttggaaacat tctgaaaaat ctcagagaac tgaaccctta caaactttgt tttccctcat 2881 aaccaaagct tcaggttaga agtttagaaa aatagaatgg ttgggtacat gatctaaatg 2941 tttaatgcta aaggtatatc gtaagggtag tgtttgtttt tgaacgataa tttagaagtt 3001 ctcatagaaa gcgtataaca taggtcttca gaaactataa aagaattttc atatagtatt 3061 aaaatccata gactaaaatc tgagaatttt ttaacatatg caagtcagcc aaacataagc 3121 taccaaaata aagagcaatg tgttctggct gttttatact tcaacaattt tttccctaag 3181 tggtaagcaa ttactttaaa acatattttt aaaaacatcg gtatcgggag ctgcggtggc 3241 tccggccggt tgtcctggca cacaaggagg cgaggctatg cgttcgaggc caacctaggc 3301 aaaattggaa aaaaaaaaaa aaa // LOCUS AF009353 3039 bp mRNA PRI 22-JUL-1997 DEFINITION Homo sapiens transcription intermediary factor 1 (TIF1) mRNA, complete cds. ACCESSION AF009353 NID g2267584 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3039) AUTHORS Thenot,S., Henriquet,C., Rochefort,H. and Cavailles,V. TITLE Differential interaction of nuclear receptors with the putative human transcriptional coactivator hTIF1 JOURNAL J. Biol. Chem. 272 (18), 12062-12068 (1997) MEDLINE 97277352 REFERENCE 2 (bases 1 to 3039) AUTHORS Thenot,S., Henriquet,C., Rochefort,H. and Cavailles,V. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) INSERM U148, 60 rue de Navacelles, Montpellier 34090, France FEATURES Location/Qualifiers source 1..3039 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="estrogen receptor positive breast cancer cell" /cell_line="ZR75-1" gene 1..3039 /gene="TIF1" CDS 1..3039 /gene="TIF1" /note="interacts with nuclear receptors which contain hormone-dependent activating function (AF2); involved in transcriptional regulation" /codon_start=1 /product="transcription intermediary factor 1" /db_xref="PID:g2267585" /translation="MEVAVEKAVAAAARLGCAPGGPRGGGENEAESRQGPDSERGGEA ARLNLLDTCAVCHQNIQSRAPKLLPCLHSFCQRCLPAPQRYLMLPAPMLGSAETPPPV PAPARRSASSPFATQVGVIRCPVCSQECAERHIIDNFFVKDTTEVPSSTVEKSNQVCT SCEDNAEANGFCVECVEWLCKTCIRAHQRVKFTKDHTVRQKEEVSPEAVGVTSQRPVF CPFHKKEQLKLYCETCDKLTCRDCQLLEHKEHRYQFIEEAFQNQKVIIDTLITKLMEK TKYIKFTGNQIQNRIIEVNQNQKQVEQDIKVAIFTLMVEINKKGKALLHQLESLAKDH RMKLMQQQQEVTGLSKQLEHVMHFSKWAVSSGSSTALLYSKRLITYRLRHLLRARCDA SPVTNNTIQFHCDPSFWAQNIINLGSLVIEDKESQPQMPKQNPVVEQNSQPPSGLSSN QLSKFPTQISLAQLRLQHMQQQQPPPRLINFQNHSPKPNGPVLPPHPQQLRYPPNQNI PRQAIKPNPLQMAFLAQQAIKQWQISSGQGTPSTTNSTSSTPSSPTITSAAGYNGKAF GSPIIDLSSPVGGSYNLPSLPDIDCSSTIMLDNIVRKDTNIDHGQPRPPSNRTVQSPN SSVPSPGLAGPVTMTSVHPPIRSPSASSVGSRGSSGSSSKPAGADSTHKVPVVMLEPI RIKQENSGPPENYDFPVVIVKQESDEESRPQNANYPRSILTSLLLNSSQSSTSEETVL RSDAPDSTGDQPGLHQDNSSNGKSEWLDPSQKSPLHVGETRKEDDPNEDWCAVCQNGG ELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPEVEYDCDAPSHNSEKKKT EGLVKLTPIDKRKCERLLLFLYCHEMSLAFQDPVPLTVPDYYKIIKNPMDLSTIKKRL QEDYSMYSKPEDFVRDFRLIFQNCAEFNEPDSEVANAGIKLENYFEELLKNLYPEKRF PKPEFRNESEDNKFSDDSDDDFVQPRKKRLKSIEERQLLK" BASE COUNT 941 a 738 c 664 g 696 t ORIGIN 1 atggaggtgg cggtggagaa ggcggtggcg gcggcggcac ggctcggctg cgctccgggg 61 ggccctcggg gcggcgggga gaacgaggcc gagagtcggc agggcccgga ctcggagcgc 121 ggcggcgagg cggcccggct caacctgttg gacacttgcg ccgtgtgcca ccagaacatc 181 cagagccggg cgcccaagct gctgccctgc ctgcactctt tctgccagcg ctgcctgccc 241 gcgccccagc gctacctcat gctgcccgcg cccatgctgg gctcggccga gaccccgcca 301 cccgtccctg ccccggctcg ccggtcagcc tcgtcgccgt tcgccaccca agttggagtc 361 attcgttgcc cagtttgcag ccaagaatgt gcagagagac acatcataga taactttttt 421 gtgaaggaca ctactgaggt tcccagcagt acagtagaaa agtcaaatca ggtatgtaca 481 agctgtgagg acaacgcaga agccaatggg ttttgtgtag agtgtgttga atggctctgc 541 aagacgtgta tcagagctca tcagagggta aagttcacaa aagaccacac tgtcagacag 601 aaagaggaag tatctccaga ggcagttggt gtcaccagcc agcgaccagt gttttgtcct 661 tttcataaaa aggagcagct gaagctgtac tgtgagacat gtgacaaact gacatgtcga 721 gactgtcagt tgttagaaca taaagagcat agataccaat ttatagaaga agcttttcag 781 aatcagaaag tgatcataga tacactaatc accaaactga tggaaaaaac aaaatacata 841 aaattcacag gaaatcagat ccaaaacaga attattgaag taaatcaaaa tcaaaagcag 901 gtggaacagg atattaaagt tgctatattt acactgatgg tagaaataaa taaaaaagga 961 aaagctctac tgcatcagtt agagagcctt gcaaaggacc atcgcatgaa acttatgcaa 1021 caacaacagg aagtgactgg actctctaaa caattggagc atgtcatgca tttttctaaa 1081 tgggcagttt ccagtggcag cagtacagca ttactttata gcaaacgact gattacatac 1141 cggttacggc acctccttcg tgcaaggtgt gatgcatccc cagtgaccaa caacaccatc 1201 caatttcact gtgatcctag tttctgggct caaaatatca tcaacttagg ttctttagta 1261 atcgaggata aagagagcca gccacaaatg cctaagcaga atcctgtcgt ggaacagaat 1321 tcacagccac caagtggttt atcatcaaac cagttatcca agttcccaac acagatcagc 1381 ctagctcaat tacggctcca gcatatgcag caacagcaac cgcctccacg tttgataaac 1441 tttcagaatc acagccccaa acccaatgga ccagttcttc ctcctcatcc tcaacaactg 1501 agatatccac caaaccagaa cataccacga caagcaataa agccaaaccc cctacagatg 1561 gctttcttgg ctcaacaagc cataaaacag tggcagatca gcagtggaca gggaacccca 1621 tcaactacca acagcacatc ctctactcct tccagcccca cgattactag tgcagcagga 1681 tataatggaa aggcttttgg ttcacctata atcgatttga gctcaccagt gggagggtct 1741 tataatcttc cctctcttcc ggatattgac tgttcaagta ctattatgct ggacaatatt 1801 gtgaggaaag atactaatat agatcatggc cagccaagac caccctcaaa cagaacggtc 1861 cagtcaccaa attcatcagt gccatctcca ggccttgcag gacctgttac tatgactagt 1921 gtacaccccc caatacgttc acctagtgcc tccagcgttg gaagccgagg aagctctggc 1981 tcttccagca aaccagcagg agctgactct acacacaaag tcccagtggt catgctggag 2041 ccaattcgaa taaaacaaga aaacagtgga ccaccggaaa attatgattt cccagttgtt 2101 atagtgaagc aagaatcaga tgaagaatct aggcctcaaa atgccaatta tccaagaagc 2161 atactcacct ccctgctctt aaatagcagt cagagctcta cttctgagga gactgtgcta 2221 agatcagatg cccctgatag tacaggagat caacctggac ttcaccagga caattcctca 2281 aatggaaagt ctgaatggtt ggatccttcc cagaagtcac ctcttcatgt tggagagaca 2341 aggaaagagg atgaccccaa tgaggactgg tgtgcagttt gtcaaaacgg aggggaactc 2401 ctctgctgtg aaaagtgccc caaagtattc catctttctt gtcatgtgcc cacattgaca 2461 aattttccaa gtggagagtg gatttgcact ttctgccgag acttatctaa accagaagtt 2521 gaatatgatt gtgatgctcc cagtcacaac tcagaaaaaa agaaaactga aggccttgtt 2581 aagttaacac ctatagataa aaggaagtgt gagcgcctac ttttatttct ttactgccat 2641 gaaatgagcc tggcttttca agaccctgtt cctctaactg tgcctgatta ttacaaaata 2701 attaaaaatc caatggattt gtcaaccatc aagaaaagac tacaagaaga ttattccatg 2761 tactcaaaac ctgaagattt tgtacgtgat tttagattga tctttcaaaa ctgtgctgaa 2821 ttcaatgagc ctgattcaga agtagccaat gctggtataa aacttgaaaa ttattttgaa 2881 gaacttctaa agaacctcta tccagaaaaa aggtttccca aaccagaatt caggaatgaa 2941 tcagaagata ataaatttag tgatgattca gatgatgact ttgtacagcc ccggaagaaa 3001 cgcctcaaaa gcattgaaga acgccagttg cttaaataa // LOCUS AF009368 1406 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens Luman mRNA, complete cds. ACCESSION AF009368 NID g2367449 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1406) AUTHORS Lu,R., Yang,P., O'Hare,P. and Misra,V. TITLE Luman, a new member of the CREB/ATF family, binds to herpes simplex virus VP16-associated host cellular factor JOURNAL Mol. Cell. Biol. 17 (9), 5117-5126 (1997) MEDLINE 97415590 REFERENCE 2 (bases 1 to 1406) AUTHORS Lu,R., Yang,P., O'Hare,P. and Misra,V. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) Veterinary Microbiology, University of Saskatchewan, 52 Campus Drive, Saskatoon, SK S7N 5B4, Canada FEATURES Location/Qualifiers source 1..1406 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" CDS 14..1129 /function="transcription factor; binds to cAMP response element CRE and weakly binds to CCAAT/enhancer element and activates transcription" /note="basic leucine zipper (BZIP) protein; binds to herpes simplex virus VP16-associated host cellular factor (HCF); member of CREB/ATF protein family; mouse LZIP homolog" /codon_start=1 /product="Luman" /db_xref="PID:g2367450" /translation="MELELDAGDQDLLAFLLEESGDLGTAPDEAVRAPLDWALPLSEV PSDWEVDDLLCSLLSPPASLNILSSSNPCLVHHDHTYSLPRETVSMDLESESCRKEGT QMTPQHMEELAEQEIARLVLTDEEKSLLEKEGLILPETLPLTKTEEQILKRVRRKIRN KRSAQESRRKKKVYVGGLESRVLKYTAQNMELQNKVQLLEEQNLSLLDQLRKLQAMVI EISNKTSSSSTCILVLLVSFCLLLVPAMYSSDTRGSLPAEHGVLSRQLRALPSEDPYQ LELPALQSEVPKDSTHQWLDGSDCVLQAPGNTSCLLHYMPQAPSAEPPLEWPFPDLSS EPLCRGPILPLQANLTRKGGWLPTGSPSVILQDRYSG" BASE COUNT 344 a 371 c 383 g 308 t ORIGIN 1 gtagttgtcc caaatggagc tggaattgga tgctggtgac caagacctgc tggccttcct 61 gctagaggaa agtggagatt tggggacggc acccgatgag gccgtgaggg ccccactgga 121 ctgggcgctg ccgctttctg aggtaccgag cgactgggaa gtagatgatt tgctgtgctc 181 cctgctgagt cccccagcgt cgttgaacat tctcagctcc tccaacccct gccttgtcca 241 ccatgaccac acctactccc tcccacggga aactgtctct atggatctag agagtgagag 301 ctgtagaaaa gaggggaccc agatgactcc acagcatatg gaggagctgg cagagcagga 361 gattgctagg ctagtactga cagatgagga gaagagtcta ttggagaagg aggggcttat 421 tctgcctgag acacttcctc tcactaagac agaggaacaa attctgaaac gtgtgcggag 481 gaagattcga aataaaagat ctgctcaaga gagccgcagg aaaaagaagg tgtatgttgg 541 gggtttagag agcagggtct tgaaatacac agcccagaat atggagcttc agaacaaagt 601 acagcttctg gaggaacaga atttgtccct tctagatcaa ctgaggaaac tccaggccat 661 ggtgattgag atatcaaaca aaaccagcag cagcagcacc tgtatcttgg tcctactagt 721 ctccttctgc ctcctccttg tacctgctat gtactcctct gacacaaggg ggagcctgcc 781 agctgagcat ggagtgttgt cccgccagct tcgtgccctc cccagtgagg acccttacca 841 gctggagctg cctgccctgc agtcagaagt gccgaaagac agcacacacc agtggttgga 901 cggctcagac tgtgtactcc aggcccctgg caacacttcc tgcctgctgc attacatgcc 961 tcaggctccc agtgcagagc ctcccctgga gtggccattc cctgacctct cttcagagcc 1021 tctctgccga ggtcccatcc tccccctgca ggcaaatctc acaaggaagg gaggatggct 1081 tcctactggt agcccctctg tcattttgca ggacagatac tcaggctaga tatgaggata 1141 tgtggggggt ctcagcagga gcctgggggg ctccccatct gtgtccaaat aaaaagcggt 1201 gggcaagggc tggccgcagc tcctgtgccc tgtcaggacg actgagggct caaacacacc 1261 acacttaatg gctttctggg tcttttattt gtacccatgt gtctgtcaca ccatgaatgt 1321 acctggggaa atcaactgac ctccctgaac atttcacgca gtcagggaca ggtgaggaaa 1381 gaaataaata agtgattcta atgctg // LOCUS AF009424 8494 bp mRNA PRI 23-JUL-1997 DEFINITION Homo sapiens clone 22 mRNA, alternative splice variant alpha-1, complete cds. ACCESSION AF009424 NID g2271468 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8494) AUTHORS Yoshikawa,T., Sanders,A.R., Esterling,L.E., Overhauser,J., Garnes,J.A., Lennon,G., Grewal,R. and Detera-Wadleigh,S.D. TITLE Isolation of chromosome 18-specific brain transcripts as positional candidates for bipolar disorder JOURNAL Am. J. Med. Genet. 74 (2), 140-149 (1997) MEDLINE 97275951 REFERENCE 2 (bases 1 to 8494) AUTHORS Yoshikawa,T. and Detera-Wadleigh,S.D. TITLE Multiple Transcriptional Variants and RNA Editing in Clone 22, a Positional Candidate Gene for Bipolar Disorder on 18p11.2 JOURNAL Unpublished REFERENCE 3 (bases 1 to 8494) AUTHORS Yoshikawa,T. and Detera-Wadleigh,S.D. TITLE Direct Submission JOURNAL Submitted (20-JUN-1997) Clinical Neurogenetics Branch, National Institute of Mental Health, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..8494 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18p11.2" CDS 470..1390 /note="alternatively spliced; alpha-1 form; possible membrane-spanning protein" /codon_start=1 /product="clone 22" /db_xref="PID:g2271469" /translation="MPEAGFQATNAFTECKFTCTSGKCLYLGSLVCNQQNDCGDNSDE ENCLLVTEHPPPGIFNSELEFAQIIIIVVVVTVMVVVIVCLLNHYKVSTRSFINRPNQ SRRREDGLPQEGCLWPSDSAAPRLGASEIMHAPRSRDRFTAPSFIQRDRFSRFQPTYP YVQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQMELNRESVRAPPNRTIFDSDL IDIAMYSGGPCPPSSNSGISASTCSSNGRMEGPPPTYSEVMGHHPGASFLHHQRSNAH RGSRLQFQQNNAESTIVPIKGKDRKPGNLV" BASE COUNT 2285 a 1900 c 1907 g 2400 t 2 others ORIGIN 1 agcggttcaa gctctacgtt cgtgacatca aacctcctgt tgggccattt ccgagaactc 61 ccatcagttt ctgtatagtg taaaagtttc agaggcggac acgagagagc tgcggctggg 121 acaaggagca cccgcgtgca ggtgcgaccc tgcaggatgc tggcagcggc gtggccaggg 181 gcgcccgtgt tctgagggcc tgagggccag cccctccacc cgcgccatgg cctccgcgcc 241 ctggctggac ggctgacggg agcagggacc gccgccgccc aggtgccaca cccaggtacc 301 gcccgcccgc tgcgagagcc gggcaggtgg gccgcggatg ctcccagagg ccggcccagc 361 agagcgatgg acttggacag gctaagatgg aagtgacctg agcctcgccc ggcggcttcc 421 tcgacgggac agcgcaagag ttggagcaca ggcttgtccg gggagcagta tgccggaagc 481 tggttttcag gccacaaatg ctttcacaga gtgcaaattc acctgcacca gtggtaaatg 541 cttgtatctt ggttcgctgg tctgtaacca acagaacgac tgtggggaca acagtgacga 601 agagaactgt ctcctggtga ccgagcaccc gcctccgggc atcttcaact cggagctgga 661 gttcgcccaa atcatcatca tcgtcgtggt ggtcacggtg atggtggtgg tcatcgtctg 721 cctgctgaac cactacaaag tctccacgcg gtccttcatc aaccgcccga accagagccg 781 gaggcgggag gacgggctgc cgcaggaagg gtgcctgtgg ccttcagaca gcgccgcacc 841 gcggctgggc gcctcggaga tcatgcatgc cccgcggtcc agggacaggt tcacagcgcc 901 gtccttcatc cagagggatc gcttcagccg cttccagccc acctacccct atgtgcagca 961 cgagattgat cttcctccca ccatctccct gtccgacggt gaagagccac ctccttacca 1021 ggggccctgc accctgcagc tccgggaccc tgaacagcag atggaactca accgagagtc 1081 cgtgagggcc ccacccaacc gaaccatatt tgacagtgat ttaatagaca ttgctatgta 1141 tagcgggggt ccatgcccac ccagcagcaa ctcgggcatc agtgcaagca cctgcagcag 1201 taacgggagg atggaggggc caccccccac atacagcgag gtgatgggcc accacccagg 1261 cgcctctttc ctccatcacc agcgcagcaa cgcacacagg ggcagcagac tgcagtttca 1321 gcagaacaat gcagagagca caatagtacc catcaaaggc aaagatagga agcctgggaa 1381 cctggtctga ttccttccaa cgtgcacttc agctggagaa agaaaccaag aagggaagcg 1441 gccgctgggc ccctcctgcg cacagtgttg ttcagtttca catggtacaa ataagtaaaa 1501 ccaaatgagc aaacacggtc tttgtttctg attcctttta ggggaattgc atgcaaacta 1561 gactgaaatg atacaaactt ccatctggtc tgaccgcaaa cagtgtttat ttggggacag 1621 gggttgggat gggggtgtgg gcaggggaaa acagagaacg ggatgctttg aagataccat 1681 gaaataaaac ccacagaggt atttgatgta tttaattgtg aaaggagact ttgcagataa 1741 atgaggccag aatggcatgt tttataatta actgaataaa gaaggaagca ttattatata 1801 ttattgtggg gaagaaccag ccagttcgct ttttctccta aggtgtggac ttttattttg 1861 ttttaaaaat atgaatcaaa attcctgtgt tgtgtgccaa ggtataaagt ggagaagtta 1921 gatgagtgca aggagctcct ttgtgttgtg atgatgtgtt ttaaaagttg cactatctta 1981 atgttgaaaa tatttacaag ggaactgttt tacgtgaagt tctgtatgtt gtcttttcac 2041 ctgtggattg taatcaggcc caaggaatat cctggagtgg tccccagaag catccaagaa 2101 aagatatttg gggacgtagc ctaacatttt accaacttac gtaaatcaaa aaagtcatta 2161 ttgttgcagg agtttgcatc aaatagcagt gcatcgctga agcttttgga gacttttgga 2221 tggaagataa gatagggaag attaagttcc agcatttctg acttgttatt ttgagttact 2281 ctgctactct taggctgcat agtttatgag aaaatgaaca catgcattta tggatccagt 2341 atcatgcagt gctgccctca tcctccagca gtgcaatttc ttcagtaatt tagatttttt 2401 tcactatagc atgaaatata ttcaaataca taccttattt tatgcaataa attgtttaaa 2461 atgcaaggtg gttattctgc atactgttga aatatgtgac tcctcagtat attcccattg 2521 cctctccccc tttcctcgac agcttagttc agttctgcag ggctgctcag ttcacaggag 2581 gctcccagca gccaccccac atccagccta cacagaactt tcgtgtggga gtggtgtggg 2641 tggtggtttt cttatgcttt ggaagcccct agaaataatg acggaagaat gccatgttgc 2701 tgatcgtggt aataagccat tgtgggttat tgtatgtcac tagtattagc atagcattct 2761 taaaggaatg cagtgttcaa aacctaccca aattccccgc aggattttac caaacccttc 2821 cccaggccag ttttgtactg aaggcaagaa ctggacagtc agagaacagt ggagggggca 2881 agtgactgaa gagcaccggg taaaaagcac aacatgcagt taaaatgcaa actagaaaac 2941 taattttaaa tattgttagt tttaatattt cctgatattt acaaatattc attcttatat 3001 acaatgaaaa aaataacttt cttctgcaga tgtaagcact ggcttttata agagcagcag 3061 ccaacacgtt tagcagacac tgcgcgtgga gaagggctta tctgcagtac actctgccat 3121 gtggagggtg ggcctctgtg gcctcttcac ataacaagat gagctggaat gatgattcca 3181 tgactcccac ctatgcagcc ttaaagccaa atccgcgtgt gtgtgtttgt gtctgtctgt 3241 gggtctcgaa ggtgatccgt cggtgcggtg gctctgtgct gtaactggag agactgttcc 3301 aaaccccaag agttgtctga tcctagtctg ttcccttctg cttcttacct ctgtagatag 3361 gtcactggtt tttgtttgtt tgttttgagg attggaattt ccattacatt catcctttgc 3421 acacagtaac atccacagaa ctagtccaac tcttaaaagg agagaggaaa aacacaggca 3481 ccagttgtca gctcatgctt acaacctgtg tggaagtata tacagttgag agtcacagtg 3541 gaggttctga gactggattc agtcttgttc cagtgacagt tggaaggcct ctgctggaga 3601 gacaccagct ctcagggcag agattggctt ggggccagaa ggaccctccc caaccctgga 3661 gacaccctga aggttcactg gctctccaga ttagcctctc ttcctctgtc aggcaaagat 3721 gaggagcccg tgttcccatc gggccctgct ggcagggact tgcagtggat tcttggtcag 3781 gtgtgcccac agatgcggag gcgaggtgag tgattccatc atttcagttc tcacctgcag 3841 ttttggtgaa gcaggagatg caccccacag ctctagctct caaatggctt cacagtcctt 3901 acttctctac ctgcctcaag aaggggctca gagcagagac ttgtgaattc cttagtaact 3961 gtgagtatat gaatgtgttg cacatgtcca cagtattggc gagataatta cataattcag 4021 atacctttaa tcatctttca agaaagaggc tcctcccatt caaccaccct agagaactgc 4081 ctttgttaaa tagttattta aagactcata catatcaaac catgactttg aaaggtcttc 4141 gaggctgggg ctctgtaatg aattagttta aaagccaagg tcataacatg aattgatggt 4201 caatttccct tcagcagaag gaaaaggtga tttagatcag tagctctttt gaaggttgtg 4261 gctgacctgt tcataccgtg tcgcctcatg gctagtgtgg cgttgaaaga gtagcgactg 4321 ggaagataca acttacacag tggggcctat tgttctttca agaacccttt ttttagctta 4381 tagaacccat gggtccagtt tagtaacgag tgatttaggc aatcaatgat aggtttataa 4441 tcttagatta ttccagcaaa gtgtggattg cattgttagg aagaacattt ggtgggaatg 4501 aacactcctg ggcataccgc tgacttttgt cccttgttcc cggtgtagga gacccaaggc 4561 atcttgaatc ccatctataa gaacacaatc ttccagcata cgtttgcttt ttcagaaact 4621 ctagcattct ctttaaatac tgacgcaatc cttaatggaa aagagatttc atgaagcaaa 4681 ttatgtattt caatagttct tctattttta gtgtccaaaa tttactaata cagaagcttg 4741 acaagcatgt cctcaccctc cccaccacat aaacacatgg acacacaccc aagccacaag 4801 aaatcccaag agagcagaag cgaattttta aaagatttat cgtgaggact gcatttccat 4861 tcactaattt tggctcaaac ttatgaggca ggaaataggg gccaacagta aatgggggag 4921 gcctcctgac accagcagag gaattttgta cccaggcgag gacttcttga acttctgcgt 4981 atctccgttt gatctctttc acctttattt catcttcata agaatgagaa aggctcaaaa 5041 ggaagcactt ttagaaatct tctctgacct agaagaatcc atccaaatcc ctgccttcct 5101 ctctgaacca acagttccct tctctgacag ggggccatcc tctatcttcc atccagcggc 5161 tcttcctttt aggaaggctc tggtgcagag cacttcaaat atgtcctcag gccagatact 5221 gattgctagt agagagacac ccggcaccca gtccgaagcc ctccctcaaa ggaccggctt 5281 atggcgttgg tcactggcag gctcagagac attctactgt gggcgcaggg agcccggccc 5341 cccatgcagc catgactgga tgcgccccca tctcgggggc ttgctgcact gcttgtttat 5401 tgaattttgc tacttagaat ggcaacatta actttgtgta ccattcattt tttaaaaatt 5461 ttccaaagct cggcagtgta tgaaagaaaa aactgggaaa gatacttggt ttctgttaac 5521 ttttgtgttg cttgcttaag tgattaaagc cagtgcttgg agccaagcct tcatgccacg 5581 aacatgctcc acagcctgcc ctttgctctc ctgctcacac tgaccaagaa tgccgcgtgc 5641 ttggcctact gaggtgaaag gacaattgaa tgacaggtgg gcaaagggag aacttcccct 5701 tcttggtgcg aggaaagtca caaatttaaa aatgttgctt ccagcccaga tcctaaatgc 5761 tagttctcag cagctgcgtg gcttaccgtt cgccatttcc accaccgcca gctgccagca 5821 ccgctacaga tcacagagat gtgaacagac aatggaaagc actcttagcc ttgcagtggt 5881 ctacattttt taggaaccaa tatttcagca ttctttatta cccggcacgc tgtgtccttt 5941 gcagagttca agtttatgtt actgccaggg tcagacagtc atttgctgct gctgctgctg 6001 ctgctgctgc ttctcgaact ggatgcatta ggaagctgct gtctgagtgt aggaatgtct 6061 tgctaagaaa gcaatgtctt ccttcatcct tttctttctt ccctctgcgt gtccttgttt 6121 ttgtgtaatg cgggagaggg ttagagctat agagattata tatacactat ccgtgcacat 6181 tatatatatg tagatatacc cctatcatgt cagagatctg catgtcagtt tttcagcaac 6241 taaggtgcct catgttctga gttcagcaga tataggaacc aagccgcccc ctcctgcact 6301 tgatgctccc acctttgttg tgcctcactt aaaatggtgc ttttttcagt tgtctgtctt 6361 ttcttatgtt tttatttgta aggtgctgta tataagttga atatattatg cacatatcct 6421 acccaatggg tagaacaaaa agttgttaat actgtaatat aatgtataga tgataccaat 6481 tttaacagaa atggcataga atttgtgaat gcctatgtgc tttgtcctct tttgtaagga 6541 aatttgcaaa tggatgcata cagattaaag tctatgtagt ttattttcct attaaatatc 6601 aatattataa cacaagagaa agaagtgtga acaaacaagc aacagtttat gaccagcgta 6661 tatatagcaa tggaaagttg catctttgct gtgaaaacac tttaaagaaa atacttttta 6721 aaaaatccca cagctttttg gttgccacta gacgcttctt attttaatca ttttagtaat 6781 gctcagctgg accagtgtta gttatatttg agtcagaaaa atgttgtttt tcaacttgct 6841 ttataatctc ctgcatctat ctcctgctgt agcatcayga aggtgtcagg caacagtgaa 6901 aagtgcacat ttttgttgtt gcagaaactg tgtcagagga ataagtaaat cagcctgcag 6961 cagaagactt tgttcagctc cagaggcatc tgtgaccgtc tgtgtccaag tctctctgtg 7021 cctttttctt ttacaaactg aagctgtgga gccaatgaag taacagtaga gattgtaggg 7081 aaagaatacc tcaggaaaaa caaatacact tacaagaaga ccctgttctt agaaaatgtg 7141 tttagttatg ggttagcact agaagagact tggctgtcag ccagccaagt gaaggacctc 7201 tcatccattc ccattcatgt cccatcataa tacggacmca aaaagcaaac tcggttttgc 7261 catcagttag aaattacgtt ttggattgta tattgttaca tctctcttcc agcttagttt 7321 ttagtgtctg attgtgacct ctgcatttat cttcaaatac cctaatttta aaacaaaaga 7381 acaagaaaag tttataacac catgttcact aaaaccacgg ttgaatcttg ggtgtgggca 7441 tcctttcgag tgttgtccat aagagcagtt cgtggaattt tgcccatctg acccatatta 7501 tcagcttatt ctgccaccag agtagagtct aataaattcc aaagttttta tttgctccat 7561 ggtgtatgtt ctgactttga aaatgtcaga ttctataatc atacccctaa catccaggag 7621 acaaatgaca gattatcttt aaactgaaat tgactctaca atgcaaccct taatgctgaa 7681 tggattaaaa aagtcagccc ttttagtatc tgtttgaaag ggccgtaaaa agttgacact 7741 tttgttgttg tggatcctgc gtgtctagac ccacgtgttg tttccatcgt atactgtagg 7801 gtgcacccct tgggattcat cattaagaac tgaggctcac tgttgtcaga aacaaagctc 7861 ccacccccca ggttcaacct tgtgggagaa ctgttgagca tgagaatgtt ctagactcag 7921 aggtactaaa atttgttacc acatcattgc ttcctttcta caggacgaat tgaggcttaa 7981 actttactgt taatgatact ggttcatttt aatgtgcttg ttggtatgtt gctatttttc 8041 atttcatagc tttcaaaaat catgctaatt gtatacttgt ctagtttaag gctattttaa 8101 aatatgtaca atactattca cagcatttag ttcgtttaat ttttattata aagcaatcta 8161 ctaaaaaagt acaactgtat ttgaactttt caatagttgt ttgtgagcta tgataatcaa 8221 aagtcattaa agtctttttt aacaaacatt cgtgcttact tttcaacata attcccagtt 8281 atatacagaa aaagatttcc acctgtcacg tatctgcctc ttttacctga gcaatggtgt 8341 agttcttaga cctaaggtct gtaattgcaa tacttttaaa gaaagatgtt gctctaagtg 8401 ctgtttgtta gttatgaaat cagatttttc tgcttgttct taatgctgtg gtcaaaccat 8461 agcacaaaat cattaaaaat aatcagcggc atac // LOCUS AF009510 1423 bp mRNA PRI 11-SEP-1997 DEFINITION Homo sapiens tapasin mRNA, complete cds. ACCESSION AF009510 NID g2388665 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1423) AUTHORS Ortmann,B., Copeman,J., Lehner,P.J., Sadasivan,B.K., Herberg,J.A., Grandea,A.G., Riddell,S.R., Tampe,R., Spies,T., Trowsdale,J. and Cresswell,P. TITLE A critical role for tapasin in the assembly and function of multimeric MHC class I-TAP complexes JOURNAL Science 277 (5330), 1306-1309 (1997) MEDLINE 97419259 REFERENCE 2 (bases 1 to 1423) AUTHORS Copeman,J., Ortmann,B., Lehner,P.J. and Cresswell,P. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) Immunobiology, HHMI, Yale University, 310 Cedar St., New Haven, CT 06511, USA FEATURES Location/Qualifiers source 1..1423 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /cell_line="EBV transformed lymphoblastoid" CDS 17..1363 /function="physically associated with MHC class I molecules and the transporter associated with antigen processing" /note="immunoglobulin superfamily member" /codon_start=1 /product="tapasin" /db_xref="PID:g2388666" /translation="MKSLSLLLAVALGLATAVSAGPAVIECWFVEDASGKGLAKRPGA LLLRQGPGEPPPRPDLDPELYLSVHDPAGALQAAFRRYPRGAPAPHCEMSRFVPLPAS AKWASGLTPAQNCPRALDGAWLMVSISSPVLSLSSLLRPQPEPQQEPVLITMATVVLT VLTHTPAPRVRLGQDALLDLSFAYMPPTSEAASSLAPGPPPFGLEWRRQHLGKGHLLL AATPGLNGQMPAAQEGAVAFAAWDDDEPWGPWTGNGTFWLPRVQPFQEGTYLATIHLP YLQGQVTLELAVYKPPKVSLMPATLARAAPGEAPPELLCLVSHFYPSGGLEVEWELRG GPGGRSQKAEGQRWLSALRHHSDGSVSLSGHLQPPPVTTEQHGARYACRIHHPSLPAS GRSAEVTLEVAGLSGPSLEDSVGLFLSAFLLLGLFKALGWAAVYLSTCKDSKKKAE" BASE COUNT 238 a 487 c 423 g 275 t ORIGIN 1 aggaggtcgc agcgccatga agtccctgtc tctgctcctc gctgtggctt tgggcctggc 61 gaccgccgtc tcagcaggac ccgcggtgat cgagtgttgg ttcgtggagg atgcgagcgg 121 aaagggcctg gccaagagac ccggtgcact gctgttgcgc cagggaccgg gggaaccgcc 181 gccccggccg gacctcgacc ctgagctcta tctcagtgta cacgaccccg cgggcgccct 241 ccaggctgcc ttcaggcggt atccccgggg cgcccccgca ccacactgcg agatgagccg 301 cttcgtgcct ctccccgcct ctgcgaaatg ggccagcggc ctgacccccg cgcagaactg 361 cccgcgggcc ctggatgggg cttggctgat ggtcagcata tccagcccag tcctcagcct 421 ctccagcctc ttgcgaccac agccagagcc tcagcaggag cctgttctca tcaccatggc 481 aacagtggta ctgactgtcc tcacccacac ccctgcccct cgagtgagac tgggacaaga 541 tgctctgctg gacttgagct ttgcctacat gccccccacc tccgaggccg cctcatctct 601 ggctccgggt ccccctccct ttgggctaga gtggcgacgc cagcacctgg gtaagggaca 661 tctgctcctg gctgcaactc ctgggctgaa tggccagatg ccagcagccc aagaaggggc 721 cgtggcattt gctgcttggg atgatgatga gccatggggc ccatggaccg gaaatgggac 781 cttctggctg cctagagttc aaccctttca ggagggcacc tatctggcca ccatacacct 841 gccatacctg caaggacagg tcaccctgga gcttgctgtg tacaaacccc ccaaagtgtc 901 cctgatgcca gcaacccttg cacgggccgc cccaggggag gcacccccgg aattgctctg 961 ccttgtgtcc cacttctacc cttctggggg cctggaggtg gagtgggaac tccggggtgg 1021 cccagggggc cgctctcaga aggccgaggg gcagaggtgg ctctcggccc tgcgccacca 1081 ttccgatggc tctgtcagcc tctctgggca cttgcagccg cccccagtca ccactgagca 1141 gcatggggca cgctatgcct gtcgaattca ccatcccagc ctgcctgcct cggggcgcag 1201 cgctgaggtc accctggagg tagcaggtct ttcagggccc tcccttgagg acagcgtagg 1261 ccttttcctg tctgcctttc ttctgcttgg gctcttcaag gcactgggct gggctgctgt 1321 ctacctgtcc acctgcaagg attcaaagaa gaaagcagag tgagggcact cactgccatc 1381 ctgtggaagc caccatcatc tctggcccaa gcttctgtag tag // LOCUS AF009615 3410 bp mRNA PRI 27-SEP-1997 DEFINITION Homo sapiens ADAM10 (ADAM10) mRNA, complete cds. ACCESSION AF009615 NID g2393946 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3410) AUTHORS Rosendahl,M.S., Ko,S.C., Long,D.L., Brewer,M.T., Rosenzweig,B., Hedl,E., Anderson,L., Pyle,S.M., Moreland,J., Meyers,M.A., Kohno,T., Lyons,D. and Lichenstein,H.S. TITLE Identification and characterization of a pro-tumor necrosis factor-alpha-processing enzyme from the ADAM family of zinc metalloproteases JOURNAL J. Biol. Chem. 272 (39), 24588-24593 (1997) MEDLINE 97450992 REFERENCE 2 (bases 1 to 3410) AUTHORS Lichenstein,H.S. TITLE Direct Submission JOURNAL Submitted (20-JUN-1997) Inflammation, Amgen, 3200 Walnut St., Boulder, CO 80301, USA FEATURES Location/Qualifiers source 1..3410 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3410 /gene="ADAM10" CDS 470..2716 /gene="ADAM10" /codon_start=1 /product="ADAM10" /db_xref="PID:g2393947" /translation="MVLLRVLILLLSWAAGMGGQYGNPLNKYIRHYEGLSYNVDSLHQ KHQRAKRAVSHEDQFLRLDFHAHGRHFNLRMKRDTSLFSDEFKVETSNKVLDYDTSHI YTGHIYGEEGSFSHGSVIDGRFEGFIQTRGGTFYVEPAERYIKDRTLPFHSVIYHEDD INYPHKYGPQGGCADHSVFERMRKYQMTGVEEVTQIPQEEHAANGPELLRKKRTTSAE KNTCQLYIQTDHLFFKYYGTREAVIAQISSHVKAIDTIYQTTDFSGIRNISFMVKRIR INTTADEKDPTNPFRFPNIGVEKFLELNSEQNHDDYCLAYVFTDRDFDDGVLGLAWVG APSGSSGGICEKSKLYSDGKKKSLNTGIITVQNYGSHVPPKVSHITFAHEVGHNFGSP HDSGTECTPGESKNLGQKENGNYIMYARATSGDKLNNNKFSLCSIRNISQVLEKKRNN CFVESGQPICGNGMVEQGEECDCGYSDQCKDECCFDANQPEGRKCKLKPGKQCSPSQG PCCTAQCAFKSKSEKCRDDSDCAREGICNGFTALCPASDPKPNFTDCNRHTQVCINGQ CAGSICEKYGLEECTCASSDGKDDKELCHVCCMKKMDPSTCASTGSVQWSRHFSGRTI TLQPGSPCNDFRGYCDVFMRCRLVDADGPLARLKKAIFSPELYENIAEWIVAHWWAVL LMGIALIMLMAGFIKICSVHTPSSNPKLPPPKPLPGTLKRRRPPQPIQQPQRQRPRES YQMGHMRR" BASE COUNT 1006 a 682 c 830 g 892 t ORIGIN 1 gaattcgagg atccgggtac catgggcggc ggcaggccta gcagcacggg aaccgtcccc 61 cgcgcgcatg cgcgcgcccc tgaagcgcct gggggacggg tatgggcggg aggtaggggc 121 gcggctccgc gtgccagttg ggtgcccgcg cgtcacgtgg tgaggaagga ggcggaggtc 181 tgagtttcga gggagggggg gagagaagag ggaacgagca agggaaggaa agcggggaaa 241 ggaggaagga aacgaacgag ggggagggag gtccctgttt tggaggagct aggagcgttg 301 ccggcccctg aagtggagcg agagggaggt gcttcgccgt ttctcctgcc aggggaggtc 361 ccggcttccc gtggaggctc cggaccaagc cccttcagct tctccctccg gatcgatgtg 421 ctgctgttaa cccgtgagga ggcggcggcg gcggcagcgg cagcggaaga tggtgttgct 481 gagagtgtta attctgctcc tctcctgggc ggcggggatg ggaggtcagt atgggaatcc 541 tttaaataaa tatatcagac attatgaagg attatcttac aatgtggatt cattacacca 601 aaaacaccag cgtgccaaaa gagcagtctc acatgaagac caatttttac gtctagattt 661 ccatgcccat ggaagacatt tcaacctacg aatgaagagg gacacttccc ttttcagtga 721 tgaatttaaa gtagaaacat caaataaagt acttgattat gatacctctc atatttacac 781 tggacatatt tatggtgaag aaggaagttt tagccatggg tctgttattg atggaagatt 841 tgaaggattc atccagactc gtggtggcac attttatgtt gagccagcag agagatatat 901 taaagaccga actctgccat ttcactctgt catttatcat gaagatgata ttaactatcc 961 ccataaatac ggtcctcagg ggggctgtgc agatcattca gtatttgaaa gaatgaggaa 1021 ataccagatg actggtgtag aggaagtaac acagatacct caagaagaac atgctgctaa 1081 tggtccagaa cttctgagga aaaaacgtac aacttcagct gaaaaaaata cttgtcagct 1141 ttatattcag actgatcatt tgttctttaa atattacgga acacgagaag ctgtgattgc 1201 ccagatatcc agtcatgtta aagcgattga tacaatttac cagaccacag acttctccgg 1261 aatccgtaac atcagtttca tggtgaaacg cataagaatc aatacaactg ctgatgagaa 1321 ggaccctaca aatcctttcc gtttcccaaa tattggtgtg gagaagtttc tggaattgaa 1381 ttctgagcag aatcatgatg actactgttt ggcctatgtc ttcacagacc gagattttga 1441 tgatggcgta cttggtctgg cttgggttgg agcaccttca ggaagctctg gaggaatatg 1501 tgaaaaaagt aaactctatt cagatggtaa gaagaagtcc ttaaacactg gaattattac 1561 tgttcagaac tatgggtctc atgtacctcc caaagtctct cacattactt ttgctcacga 1621 agttggacat aactttggat ccccacatga ttctggaaca gagtgcacac caggagaatc 1681 taagaatttg ggtcaaaaag aaaatggcaa ttacatcatg tatgcaagag caacatctgg 1741 ggacaaactt aacaacaata aattctcact ctgtagtatt agaaatataa gccaagttct 1801 tgagaagaag agaaacaact gttttgttga atctggccaa cctatttgtg gaaatggaat 1861 ggtagaacaa ggtgaagaat gtgattgtgg ctatagtgac cagtgtaaag atgaatgctg 1921 cttcgatgca aatcaaccag agggaagaaa atgcaaactg aaacctggga aacagtgcag 1981 tccaagtcaa ggtccttgtt gtacagcaca gtgtgcattc aagtcaaagt ctgagaagtg 2041 tcgggatgat tcagactgtg caagggaagg aatatgtaat ggcttcacag ctctctgccc 2101 agcatctgac cctaaaccaa acttcacaga ctgtaatagg catacacaag tgtgcattaa 2161 tgggcaatgt gcaggttcta tctgtgagaa atatggctta gaggagtgta cgtgtgccag 2221 ttctgatggc aaagatgata aagaattatg ccatgtatgc tgtatgaaga aaatggaccc 2281 atcaacttgt gccagtacag ggtctgtgca gtggagtagg cacttcagtg gtcgaaccat 2341 caccctgcaa cctggatccc cttgcaacga ttttagaggt tactgtgatg ttttcatgcg 2401 gtgcagatta gtagatgctg atggtcctct agctaggctt aaaaaagcaa tttttagtcc 2461 agagctctat gaaaacattg ctgaatggat tgtggctcat tggtgggcag tattacttat 2521 gggaattgct ctgatcatgc taatggctgg atttattaag atatgcagtg ttcatactcc 2581 aagtagtaat ccaaagttgc ctcctcctaa accacttcca ggcactttaa agaggaggag 2641 acctccacag cccattcagc aaccccagcg tcagcggccc cgagagagtt atcaaatggg 2701 acacatgaga cgctaactgc agcttttgcc ttggttcttc ctagtgccta caatgggaaa 2761 acttcactcc aaagagaaac ctattaagtc atcatctcca aactaaaccc tcacaagtaa 2821 cagttgaaga aaaaatggca agagatcata tcctcagacc aggtggaatt acttaaattt 2881 taaagcctga aaattccaat ttgggggtgg gaggtggaaa aggaacccaa ttttcttatg 2941 aacagatatt tttaacttaa tggcacaaag tcttagaata ttattatgtg ccccgtgttc 3001 cctgttcttc gttgctgcat tttcttcact tgcaggcaaa cttggctctc aataaacttt 3061 taccacaaat tgaaataaat atattttttt caactgccaa tcaaggctag gaggctcgac 3121 cacctcaaca ttggagacat cacttgccaa tgtacatacc ttgttatatg cagacatgta 3181 tttcttacgt acactgtact tctgtgtgca attgtaaaca gaaattgcaa tatggatgtt 3241 tctttgtatt ataaaatttt tccgctctta attaaaaatt actgtttaat tgacatactc 3301 aggataacag agaatggtgg tattcagtgg tccaggattc tgtaatgctt tacacaggca 3361 gttttgaaat gaaaatcaat ttaccccatg gtacccggat cctcgaattc // LOCUS AF009620 1395 bp mRNA PRI 23-SEP-1997 DEFINITION Homo sapiens apoptotic caspase Mch5-beta mRNA, alternatively spliced, complete cds. ACCESSION AF009620 NID g2429161 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1395) AUTHORS Srinivasula,S.M., Ahmad,M., Ottilie,S., Bullrich,F., Banks,S., Fernandes-Alnemri,T., Croce,C.M., Litwack,G., Tomaselli,K.J., Armstrong,R.C. and Alnemri,E.S. TITLE FLAME-1, a novel FADD-like anti-apoptotic molecule that regulates Fas/TNFR1-induced apoptosis JOURNAL J. Biol. Chem. 272 (30), 18542-18545 (1997) MEDLINE 97373543 REFERENCE 2 (bases 1 to 1395) AUTHORS Alnemri,E.S. TITLE Direct Submission JOURNAL Submitted (18-JUN-1997) Microbiology and Immunology, Thomas, Jefferson University, Kimmel Cancer Institute, 233 S. Tenth, Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..1395 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q33-34" /cell_line="Jurkat" /cell_type="T-lymphocyte" CDS 1..1395 /function="promotes apoptosis by interaction with the adaptor molecule FADD/Mort-1" /note="caspase-8; contains death effector domain; alternatively spliced form of Mch5/MACH/FLICE" /codon_start=1 /evidence=experimental /product="apoptotic caspase Mch5-beta" /db_xref="PID:g2429162" /translation="MDFSRNLYDIGEQLDSEDLASLKFLSLDYIPQRKQEPIKDALML FQRLQEKRMLEESNLSFLKELLFRINRLDLLITYLNTRKEEMERELQTPGRAQISAYR VMLYQISEEVSRSELRSFKFLLQEEISKCKLDDDMNLLDIFIEMEKRVILGEGKLDIL KRVCAQINKSLLKIINDYEEFSKGEELCGVMTISDSPREQDSESQTLDKVYQMKSKPR GYCLIINNHNFAKAREKVPKLHSIRDRNGTHLDAGALTTTFEELHFEIKPHHDCTVEQ IYEILKIYQLMDHSNMDCFICCILSHGDKGIIYGTDGQEAPIYELTSQFTGLKCPSLA GKPKVFFIQACQGDNYQKGIPVETDSEEQPYLEMDLSSPQTRYIPDEADFLLGMATVN NCVSYRNPAEGTWYIQSLCQSLRERCPRGDDILTILTEVNYEVSNKDDKKNMGKQMPQ PTFTLRKKLVFPSD" BASE COUNT 438 a 298 c 330 g 329 t ORIGIN 1 atggacttca gcagaaatct ttatgatatt ggggaacaac tggacagtga agatctggcc 61 tccctcaagt tcctgagcct ggactacatt ccgcaaagga agcaagaacc catcaaggat 121 gccttgatgt tattccagag actccaggaa aagagaatgt tggaggaaag caatctgtcc 181 ttcctgaagg agctgctctt ccgaattaat agactggatt tgctgattac ctacctaaac 241 actagaaagg aggagatgga aagggaactt cagacaccag gcagggctca aatttctgcc 301 tacagggtca tgctctatca gatttcagaa gaagtgagca gatcagaatt gaggtctttt 361 aagtttcttt tgcaagagga aatctccaaa tgcaaactgg atgatgacat gaacctgctg 421 gatattttca tagagatgga gaagagggtc atcctgggag aaggaaagtt ggacatcctg 481 aaaagagtct gtgcccaaat caacaagagc ctgctgaaga taatcaacga ctatgaagaa 541 ttcagcaaag gggaggagtt gtgtggggta atgacaatct cggactctcc aagagaacag 601 gatagtgaat cacagacttt ggacaaagtt taccaaatga aaagcaaacc tcggggatac 661 tgtctgatca tcaacaatca caattttgca aaagcacggg agaaagtgcc caaacttcac 721 agcattaggg acaggaatgg aacacacttg gatgcagggg ctttgaccac gacctttgaa 781 gagcttcatt ttgagatcaa gccccaccat gactgcacag tagagcaaat ctatgagatt 841 ttgaaaatct accaactcat ggaccacagt aacatggact gcttcatctg ctgtatcctc 901 tcccatggag acaagggcat catctatggc actgatggac aggaggcccc catctatgag 961 ctgacatctc agttcactgg tttgaagtgc ccttcccttg ctggaaaacc caaagtgttt 1021 tttattcagg cttgtcaggg ggataactac cagaaaggta tacctgttga gactgattca 1081 gaggagcaac cctatttaga aatggattta tcatcacctc aaacgagata tatcccggat 1141 gaggctgact ttctgctggg gatggccact gtgaataact gtgtttccta ccgaaaccct 1201 gcagagggaa cctggtacat ccagtcactt tgccagagcc tgagagagcg atgtcctcga 1261 ggcgatgata ttctcaccat cctgactgaa gtgaactatg aagtaagcaa caaggatgac 1321 aagaaaaaca tggggaaaca gatgcctcag cctactttca cactaagaaa aaaacttgtc 1381 ttcccttctg attga // LOCUS AF009644 1983 bp mRNA PRI 07-DEC-1997 DEFINITION Homo sapiens clone 41 immunoglobulin-like transcript 5 protein mRNA, complete cds. ACCESSION AF009644 NID g2662447 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1983) AUTHORS Colonna,M., Navarro,F., Bellon,T., Llano,M., Garcia,P., Samaridis,J., Angman,L., Cella,M. and Lopez-Botet,M. TITLE A common inhibitory receptor for major histocompatibility complex class I molecules on human lymphoid and myelomonocytic cells JOURNAL J. Exp. Med. 186 (11), 1809-1818 (1997) MEDLINE 98044246 REFERENCE 2 (bases 1 to 1983) AUTHORS Colonna,M. TITLE Direct Submission JOURNAL Submitted (20-JUN-1997) Basel Institute for Immunology, 487 Grenzacherstrasse, Basel CH-4005, Switzerland FEATURES Location/Qualifiers source 1..1983 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myelomonocytic" /chromosome="19" /clone="41" /map="19q13.4" CDS 3..590 /note="immunoglobulin superfamily; inhibitory receptor" /codon_start=1 /product="immunoglobulin-like transcript 5 protein" /db_xref="PID:g2662448" /translation="MTPALTALLCLGLSLGPRTRVQAGPFPKPTLWAEPGSVISWGSP VTIWCQGSLEAQEYQLDKEGSPEPLDRNNPLEPKNKARFSIPSMTQHHAGRYRCHYYS SAGWSEPSDPLELVMTGAYSKPTLSALPSPVVASGGNMTLRCGSQKGYHHFVLMKEGE HQLPRTLDSQQLHSGGFQALFPVGPVTPSHRWRRV" BASE COUNT 428 a 662 c 551 g 342 t ORIGIN 1 ccatgacgcc cgccctcaca gccctgctct gccttgggct gagtctgggc cccaggaccc 61 gcgtgcaggc agggcccttc cccaaaccca ccctctgggc tgagccaggc tctgtgatca 121 gctgggggag ccccgtgacc atctggtgtc aggggagcct ggaggcccag gagtaccaac 181 tggataaaga gggaagccca gagcccttgg acagaaataa cccactggaa cccaagaaca 241 aggccagatt ctccatccca tccatgacac agcaccatgc agggagatac cgctgccact 301 attacagctc tgcaggctgg tcagagccca gcgaccccct ggagctggtg atgacaggag 361 cctatagcaa acccaccctc tcagccctgc ccagccctgt ggtggcctca ggggggaata 421 tgaccctccg atgtggctca cagaagggat atcaccattt tgttctgatg aaggaaggag 481 aacaccagct cccccggacc ctggactcac agcagctcca cagtgggggg ttccaggccc 541 tgttccctgt gggccccgtg acccccagcc acaggtggag gcgtgtctag gaagccctcc 601 ctcctgaccc tgcagggccc tgtcctggcc cctgggcaga gcctgaccct ccagtgtggc 661 tctgatgtcg gctacgacag atttgttctg tataaggagg gggaacgtga cttcctccag 721 cgccctggcc agcagcccca ggctgggctc tcccaggcca acttcaccct gggccctgtg 781 agccgctcct acgggggcca gtacaggtgc tatggtgcac acaacctctc ctccgagtgg 841 tcggccccca gtgaccccct ggacatcctg atcacaggac agatctatga caccgtctcc 901 ctgtcagcac agccgggccc cacagtggcc tcaggagaga acatgaccct gctgtgtcag 961 tcacgggggt attttgacac tttccttctg accaaagaag gggcagccca tcccccactg 1021 cgtctgagat caatgtacgg agctcataag taccaggctg aattccccat gagtcctgtg 1081 acctcagccc acgcggggac ctacaggtgc tacggctcac gcagctccaa cccctacctg 1141 ctgtctcacc ccagtgagcc cctggagctc gtggtctcag gacactctgg aggctccagc 1201 ctcccaccca cagggccgcc ctccacacct ggtctgggaa gatacctgga ggttttgatt 1261 ggggtctcgg tggccttcgt cctgctgctc ttcctcctcc tcttcctcct cctccgacgt 1321 cagcgtcaca gcaaacacag gacatctgac cagagaaaga ctgatttcca gcgtcctgca 1381 ggggctgcgg agacagagcc caaggacagg ggcctgctga ggaggtccag cccagctgct 1441 gacgtccagg aagaaaacct ctatgctgct gtgaaggaca cacagtctga ggacagggtg 1501 gagctggaca gtcagagccc acacgatgaa gacccccagg cagtgacgta tgccccggtg 1561 aaacactcca gtcctaggag agaaatggcc tctcctccct cctcactgtc tggggaattc 1621 ctggacacaa aggacagaca ggtggaagag gacaggcaga tggacactga ggctgctgca 1681 tctgaagcct cccaggatgt gacctacgcc cagctgcaca gcttgaccct tagacggaag 1741 gcaactgagc ctcctccatc ccaggaaggg gaacctccag ctgagcccag catctacgcc 1801 actctggcca tccactagcc cggggggtac gcagacccca cactcagcag aaggagactc 1861 aggactgctg aaggcacggg agctgccccc agtggacacc agtgaacccc agtcagcctg 1921 gacccctaac acagaccatg aggagacgct gggaacttgt gggactcacc tgactcaaag 1981 atg // LOCUS AF009746 2927 bp mRNA PRI 08-NOV-1997 DEFINITION Homo sapiens peroxisomal membrane protein 69 (PMP69) mRNA, complete cds. ACCESSION AF009746 NID g2343156 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2927) AUTHORS Holzinger,A., Kammerer,S. and Roscher,A.A. TITLE Primary structure of human PMP69, a putative peroxisomal ABC-transporter JOURNAL Biochem. Biophys. Res. Commun. 237 (1), 152-157 (1997) MEDLINE 97410133 REFERENCE 2 (bases 1 to 2927) AUTHORS Holzinger,A., Kammerer,S. and Roscher,A.A. TITLE Direct Submission JOURNAL Submitted (21-JUN-1997) Department of Clinical Chemistry and Metabolism, Dr. v. Hauner Children's Hospital, Ludwig-Maximilians-University, Lindwurmstrasse 4, Munich 80337, Germany FEATURES Location/Qualifiers source 1..2927 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q24.3" gene 1..2927 /note="PXMP1-L; P70R" /gene="PMP69" 5'UTR 1..51 /gene="PMP69" CDS 52..1872 /gene="PMP69" /note="69 kDa peroxisomal ABC-transporter" /codon_start=1 /product="peroxisomal membrane protein 69" /db_xref="PID:g2343157" /translation="MAVAGPAPGAGARPRLDLQFLQRFLQILKVLFPSWSSQNALMFL TLLCLTLLEQFVIYQVGLIPSQYYGVLGNKDLEGFKTLTFLAVMLIVLNSTLKSFDQF TCNLLYVSWRKDLTEHLHRLYFRGRAYYTLNVLRDDIDNPDQRISQDVERFCRQLSSM ASKLIISPFTLVYYTYQCFQSTGWLGPVSIFGYFILGTVVNKTLMGPIVMKLVHQEKL EGDFRFKHMQIRVNAEPAAFYRAGHVEHMRTDRRLQRLLQTQRELMSKELWLYIGINT FDYLGSILSYVVIAIPIFSGVYGDLSPAELSTLVSKNAFVCIYLISCFTQLIDLSTTL SDVAGYTHRIGQLRETLLDMSLKSQDCEILGESEWGLDTPPGWPAAEPADTAFLLERV SISAPSSDKPLIKDLSLKISEGQSLLITGNTGTGKTSLLRVLGGLWTSTRGSVQMLTD FGPHGVLFLPQKPFFTDGTLREQVIYPLKEVYPDSGSADDERILRFLELAGLSNLVAR TEGLDQQVDWNWYDVLSPGEMQRLSFARLFYLQPKYAVLDEATSALTEEVESELYRIG QQLGMTFISVGHRQSLEKFHSLVLKLCGGGRWELMRIKVE" 3'UTR 1873..2927 /gene="PMP69" polyA_signal 2230..2235 /gene="PMP69" /note="potential polyA signal" polyA_site 2351 /gene="PMP69" /note="alternative polyadenylation site" polyA_signal 2891..2896 /gene="PMP69" /note="potential polyA signal" BASE COUNT 724 a 729 c 747 g 727 t ORIGIN 1 ccagtactga actaggggca gctgggctcc agagtccctc ggtctcaggt catggcggtc 61 gcggggcccg cgcccggagc tggcgccagg cccaggttag atctgcaatt tctccagcgg 121 ttcctgcaga tactgaaggt tttgtttcct tcttggtcat cacaaaatgc cttgatgttc 181 ctgacccttt tgtgcctgac cctactggag caatttgtga tctaccaggt tggcttgatc 241 cccagtcagt actatggggt cctgggaaac aaagacttgg aagggtttaa gactctgaca 301 ttcctggctg tcatgctcat tgttctgaac tccacgctga agagctttga tcagttcacc 361 tgcaacctgc tgtatgtgag ctggaggaag gacctcactg agcaccttca ccgcctctac 421 ttccggggcc gtgcgtacta caccctcaac gtgctgcggg atgacatcga taacccggac 481 cagcgcatca gccaggacgt ggagcgattc tgccggcagc tcagcagcat ggccagcaag 541 ctcatcatct ccccgttcac cctcgtctac tacacttacc agtgcttcca aagcacaggc 601 tggctcgggc ctgtgagcat cttcgggtat ttcatcctgg ggaccgtggt gaacaaaact 661 ttgatgggcc ccattgtgat gaagctggtg catcaggaga agctggaggg agattttagg 721 ttcaagcaca tgcagattcg ggtgaatgcg gagcctgctg ctttctacag agctgggcat 781 gtggagcaca tgaggacaga ccgcaggctg cagagactcc ttcagaccca gagggagctg 841 atgtccaagg agctctggct gtacatcggc atcaacacct ttgactatct gggcagcatc 901 ctgagttatg ttgtcatcgc aatccccatt ttcagcgggg tctatggaga cctgagtccc 961 gcagagctta gcaccctggt cagcaagaat gcctttgtgt gcatctacct catcagctgc 1021 ttcacccagc tcatcgacct gtccacgacg ctctcagatg tggctggcta cacgcacaga 1081 attgggcagc ttcgggagac gcttctggac atgtccctga agtcacagga ctgcgagatc 1141 ctgggcgaga gcgagtgggg cttggacaca cccccagggt ggccagcggc agagccagca 1201 gacacagcat ttctccttga gcgggtctcc atctctgccc cctcctctga caaaccccta 1261 atcaaggatc tgagcctaaa gatctccgag ggacagagcc tgctcatcac aggcaacacg 1321 ggcactggca agacctcctt gctccgggtt ctaggtggcc tctggacgag tacacggggc 1381 tcagtgcaga tgctgacgga ctttgggccc catggggtgc tattcctgcc acaaaagcca 1441 ttcttcactg acgggaccct tcgggagcag gtgatatatc ccctgaagga ggtctacccc 1501 gactcaggtt ctgccgatga tgagaggatc ttgaggttct tggaattggc aggcctgtcc 1561 aacttggtgg caaggacaga gggtctggac cagcaggtgg actggaactg gtatgatgtt 1621 ctgtccccgg gggagatgca acggctctcc tttgcccgac tcttctacct gcagccgaag 1681 tacgcagtgc ttgatgaagc caccagtgcc ctgacagagg aagtggagag cgagctctat 1741 cgcatcggcc agcagctggg gatgacgttc atcagtgtgg gacatcggca gagccttgag 1801 aagtttcatt ccttggttct gaaactctgt ggaggaggaa gatgggagct gatgagaatc 1861 aaagtggaat gaagctctgg cttttggaag gagaaccaca ctgtggcggg tcggcggccc 1921 tcaagagaca ccaggaggac tgacagcgaa gatcgagctc aggttcgcca catagatccc 1981 gtgcaggagc cgcatgggtc ctgtgcagga cccctagcag tggtgggctg agcccaggtc 2041 taggtttctg tgggggacat tgaatctccc agtgttcagt ctcccaggac tctgctgcct 2101 cagccagagc ctccatatgc ttgaagtgct gattacctac aaatgatttc agatcatgtt 2161 tgctaaagag aaatctggaa gtgtgagatc tgtaagaaat gaaagaaatg actcttggag 2221 tcaagagatc tggaaatctt ttaatcagtt aaattgtgca gcaatagatt tttaacttta 2281 actgaccatt taagtttttt aataagtttt ttacaaagaa aagttaaaca ttaaaaagaa 2341 ttacagcttt ctgtcttctc tatcatggaa tgattttttt tattgaatct ccagatttgt 2401 atttgacagc ttggtgggaa gggaagcaca ctctgctgtt ctggaatctt atgcccaggg 2461 tttttcactt ctccccacat ctccctttcc acttgccagt gttgtgtagt tagaacctga 2521 accactaact tctaggggcc tttggtctgc cctaccttaa cccaaatgaa agtaaatccc 2581 tttcccctta gccaaaataa ggttgggttt tctaaaaaaa tagtctatat tagggaacaa 2641 caacagcaaa ttagacaaaa cccagaaagc acaaagcatg aggtggagtt actgtgccca 2701 aagtcctcac tcagaccagt gcccctccag ttcagttgtc tatgtattac cttccttacc 2761 ttcataatgt ttgccaggct tctgtacttc tagtacttga gttacttaat atttttaaaa 2821 acaaaccttt taagtttaaa tggattttta agtgaaattt tacacttaac atagatggaa 2881 agtaactaaa aataaataca agaaaaacaa aaaaaaaaaa aaaaaaa // LOCUS AF010126 550 bp mRNA PRI 26-JUL-1997 DEFINITION Homo sapiens breast cancer-specific protein 1 (BCSG1) mRNA, complete cds. ACCESSION AF010126 NID g2281473 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 550) AUTHORS Ji,H., Liu,Y.E., Jia,T., Wang,M., Liu,J., Xiao,G., Joseph,B.K., Rosen,C. and Shi,Y.E. TITLE Identification of a breast cancer-specific gene, BCSG1, by direct differential cDNA sequencing JOURNAL Cancer Res. 57 (4), 759-764 (1997) MEDLINE 97178957 REFERENCE 2 (bases 1 to 550) AUTHORS Ji,H., Liu,Y.E., Jia,T., Wang,M., Liu,J., Xiao,G., Joseph,B.K., Rosen,C. and Shi,Y.E. TITLE Direct Submission JOURNAL Submitted (23-JUN-1997) Ped. Res., Long Island Jewish Medical Center, 270-05 76th Ave., New Hyde Park, NY 11040, USA FEATURES Location/Qualifiers source 1..550 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast cancer" /note="cDNA highly abundant in a breast cancer library but not in normal library" gene 1..550 /gene="BCSG1" CDS 12..395 /gene="BCSG1" /note="breast cancer-specific protein 1; synuclein-like; AD amyloid-like" /codon_start=1 /product="BCSG1 protein" /db_xref="PID:g2281474" /translation="MDVFKKGFSIAKKGVVGAVEKTKQGVTEAAEKTKEGVMYVGAKT KENVVQSVTSVAEKTKEQANAVSKAVVSSVNTVATKTVEEAENIAVTSGVVRKEDLRP SAPQQEGEASKEKEEVAEEAQSGGD" BASE COUNT 132 a 145 c 192 g 81 t ORIGIN 1 cacgagccac catggatgtt ttcaagaagg gcttctccat cgccaagaag ggcgtggtgg 61 gtgcggtgga aaagaccaag cagggggtga cggaagcagc tgagaagacc aaggaggggg 121 tcatgtatgt gggagccaag accaaggaga atgttgtaca gagcgtgacc tcagtggccg 181 agaagaccaa ggagcaggcc aacgccgtga gcaaggctgt ggtgagcagc gtcaacactg 241 tggccaccaa gaccgtggag gaggcggaga acatcgcggt cacctccggg gtggtgcgca 301 aggaggactt gaggccatct gccccccaac aggagggtga ggcatccaaa gagaaagagg 361 aagtggcaga ggaggcccag agtgggggag actagagggc tacaggccag cgtggatgac 421 ctgaagagcg ctcctctgcc ttggacacca tcccctccta gcacaaggag tgcccgcctt 481 gagtgacatg cgggtgccca cgctcctgcc ctcgtctccc tggacaccct tggcctgtcc 541 acctgtgctg // LOCUS AF010187 1137 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens FGF-1 intracellular binding protein (FIBP) mRNA, complete cds. ACCESSION AF010187 NID g2738519 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1137) AUTHORS Kolpakova,E.N., Wiedlocha,A., Stenmark,H., Klingenberg,O. and Olsnes,S. TITLE Cloning of an intracellular protein that binds specifically to mitogenic acidic fibroblast growth factor JOURNAL Unpublished REFERENCE 2 (bases 1 to 1137) AUTHORS Kolpakova,E.N., Wiedlocha,A., Stenmark,H., Klingenberg,O. and Olsnes,S. TITLE Direct Submission JOURNAL Submitted (25-JUN-1997) Department of Biochemistry, The Norwegian Radium Hospital, Montebello, Oslo N-0310, Norway FEATURES Location/Qualifiers source 1..1137 /organism="Homo sapiens" /db_xref="taxon:9606" /note="Clontech HeLa S3 cDNA library in pGAD GH" gene 1..1137 /gene="FIBP" CDS 19..1137 /gene="FIBP" /codon_start=1 /product="FGF-1 intracellular binding protein" /db_xref="PID:g2738520" /translation="MTSELDIFVGNTTLIDEDVYRLWLDGYSVTDAVALRVRSGILEQ TGATAAVLQSDTMDHYRTFHMLERLLHAPPKLLHQLIFQIPPSRQALLIERYYAFDEA FVREVLGKKLSKGTKKDLDDISTKTGITLKSCRRQFDNFKRVFKVVEEMRGSLVDNIQ QHFLLSDRLARDYAAIVFFANNRFETGKKKLQYLSFGDFAFCAELMIQNWTLGAVDSQ MDDMDMDLDKEFLQDLKELKVLVADKDLLDLHKSLVCTALRGKLGVFSEMEANFKNLS RGLVNVAAKLTHNKDVRDLFVDLVEKFVEPCRSDHWPLSDVRFFLNQYSASVHSLDGF RHQASGTATWAPSAAASCACIMTEVPPNRPPTLTIKLL" BASE COUNT 246 a 334 c 318 g 239 t ORIGIN 1 gggcttgcgg gcttcgccat gaccagtgag ctggacatct tcgtggggaa cacgaccctt 61 atcgacgagg acgtgtatcg cctctggctc gatggttact cggtgaccga cgcggtggcc 121 ctgcgggtgc gctcgggaat cctggagcag actggcgcca cggcagcggt gctgcagagc 181 gacaccatgg accattaccg caccttccac atgctcgagc ggctgctgca tgcgccgccc 241 aagctactgc accagctcat cttccagatt ccgccctccc ggcaggcact actcatcgag 301 aggtactatg cctttgatga ggcctttgtt cgggaggtgc tgggcaagaa gctgtccaaa 361 ggcaccaaga aagacctgga tgacatcagc accaaaacag gcatcaccct caagagctgc 421 cggagacagt ttgacaactt taaacgggtc ttcaaggtgg tagaggaaat gcggggctcc 481 ctggtggaca atattcagca acacttcctc ctctctgacc ggttggccag ggactatgca 541 gccatcgtct tctttgctaa caaccgcttt gagacaggga agaaaaaact gcagtatctg 601 agcttcggtg actttgcctt ctgcgctgag ctcatgatcc aaaactggac ccttggagcc 661 gtcgactcac agatggatga catggacatg gacttagaca aggaatttct ccaggacttg 721 aaggagctca aggtgctagt ggctgacaag gaccttctgg acctgcacaa gagcctggtg 781 tgcactgctc tccggggaaa gctgggcgtc ttctctgaga tggaagccaa cttcaagaac 841 ctgtcccggg ggctggtgaa cgtggccgcc aagctgaccc acaataaaga tgtcagagac 901 ctgtttgtgg acctcgtgga gaagtttgtg gaaccctgcc gctccgacca ctggccactc 961 agcgacgtgc ggttcttcct gaatcagtat tcagcgtctg tccactccct cgatggcttc 1021 cgacaccagg cctctgggac cgctacatgg gcaccctccg cggctgcctc ctgcgcctgt 1081 atcatgactg aggtgcctcc caaccgtccg cccacgctga caataaagtt gctctga // LOCUS AF010193 3111 bp mRNA PRI 18-OCT-1997 DEFINITION Homo sapiens MAD-related gene SMAD7 (SMAD7) mRNA, complete cds. ACCESSION AF010193 NID g2252821 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3111) AUTHORS Hayashi,H., Abdollah,S., Qui,Y., Cai,J., Xu,Y.-Y., Grinnell,B.W., Richardson,M.A., Topper,J.N., Gimbrone,M.A. Jr., Wrana,J.L. and Falb,D. TITLE The MAD-Related protein, Smad7, Associates with the TGFb receptor and Functions as an Antagonist of TGFb Signalling JOURNAL Cell (1997) In press REFERENCE 2 (bases 1 to 3111) AUTHORS Topper,J.N., Cai,J., Qui,Y., Anderson,K.R., Xu,Y.-Y., Deeds,J.D., Feeley,R., Gimeno,C.J., Woolf,E.A., Tayber,O., Mays,G.G., Sampson,B.A., Schoen,F.J., Gimbrone,M.A. Jr. and Falb,D. TITLE Vascular MADs: two novel MAD-related genes selectively inducible by flow in human vascular endothelium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (17), 9314-9319 (1997) MEDLINE 97404392 REFERENCE 3 (bases 1 to 3111) AUTHORS Falb,D. TITLE Direct Submission JOURNAL Submitted (25-JUN-1997) Millennium Pharmaceuticals, 640 Memorial Drive, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..3111 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" gene 1..3111 /gene="SMAD7" CDS 296..1576 /gene="SMAD7" /codon_start=1 /product="MAD-related gene SMAD7" /db_xref="PID:g2252822" /translation="MFRTKRSALVRRLWRSRAPGGEDEEEGAGGGGGGGELRGEGATD SRAHGAGGGGPGRAGCCLGKAVRGAKGHHHPHPPAAGAGAAGGAEADLKALTHSVLKK LKERQLELLLQAVESRGGTRTACLLLPGRLDCRLGPGAPAGAQPAQPPSSYSLPLLLC KVFRWPDLRHSSEVKRLCCCESYGKINPELVCCNPHHLSRLCELESPPPPYSRYPMDF LKPTADCPDAVPSSAETGGTNYLAPGGLSDSQLLLEPGDRSHWCVVAYWEEKTRVGRL YCVQEPSLDIFYDLPQGNGFCLGQLNSDNKSQLVQKVRSKIGCGIQLTREVDGVWVYN RSSYPIFIKSATLDNPDSRTLLVHKVFPGFSIKAFDYEKAYSLQRPNDHEFMQQPWTG FTVQISFVKGWGQCYTRQFISSCPCWLEVIFNSR" BASE COUNT 648 a 862 c 882 g 719 t ORIGIN 1 ggcacgagcg gagagccgcg cagggcgcgg gccgcgcggg gtggggcagc cggagcgcag 61 gcccccgatc cccggcgggc gcccccgggc ccccgcgcgc gccccggcct ccgggagact 121 ggcgcatgcc acggagcgcc cctcgggccg ccgccgctcc tgcccgggcc cctgctgctg 181 ctgctgtcgc ctgcgcctgc tgccccaact cggcgcccga cttcttcatg gtgtgcggag 241 gtcatgttcg ctccttagca ggcaaacgac ttttctcctc gcctcctcgc cccgcatgtt 301 caggaccaaa cgatctgcgc tcgtccggcg tctctggagg agccgtgcgc ccggcggcga 361 ggacgaggag gagggcgcag ggggaggtgg aggaggaggc gagctgcggg gagaaggggc 421 gacggacagc cgagcgcatg gggccggtgg cggcggcccg ggcagggctg gatgctgcct 481 gggcaaggcg gtgcgaggtg ccaaaggtca ccaccatccc cacccgccag ccgcgggcgc 541 cggcgcggcc gggggcgccg aggcggatct gaaggcgctc acgcactcgg tgctcaagaa 601 actgaaggag cggcagctgg agctgctgct ccaggccgtg gagtcccgcg gcgggacgcg 661 caccgcgtgc ctcctgctgc ccggccgcct ggactgcagg ctgggcccgg gggcgcccgc 721 cggcgcgcag cctgcgcagc cgccctcgtc ctactcgctc cccctcctgc tgtgcaaagt 781 gttcaggtgg ccggatctca ggcattcctc ggaagtcaag aggctgtgtt gctgtgaatc 841 ttacgggaag atcaaccccg agctggtgtg ctgcaacccc catcacctta gccgactctg 901 cgaactagag tctccccccc ctccttactc cagatacccg atggattttc tcaaaccaac 961 tgcagactgt ccagatgctg tgccttcctc cgctgaaaca gggggaacga attatctggc 1021 ccctgggggg ctttcagatt cccaacttct tctggagcct ggggatcggt cacactggtg 1081 cgtggtggca tactgggagg agaagacgag agtggggagg ctctactgtg tccaggagcc 1141 ctctctggat atcttctatg atctacctca ggggaatggc ttttgcctcg gacagctcaa 1201 ttcggacaac aagagtcagc tggtgcagaa ggtgcggagc aaaatcggct gcggcatcca 1261 gctgacgcgg gaggtggatg gtgtgtgggt gtacaaccgc agcagttacc ccatcttcat 1321 caagtccgcc acactggaca acccggactc caggacgctg ttggtacaca aggtgttccc 1381 cggtttctcc atcaaggctt tcgactacga gaaggcgtac agcctgcagc ggcccaatga 1441 ccacgagttt atgcagcagc cgtggacggg ctttaccgtg cagatcagct ttgtgaaggg 1501 ctggggtcag tgctacaccc gccagttcat cagcagctgc ccgtgctggc tagaggtcat 1561 cttcaacagc cggtagccgc gtgcggaggg gacagagcgt gagctgagca ggccacactt 1621 caaactactt tgctgctaat attttcctcc tgagtgcttg cttttcatgc aaactctttg 1681 gtcgtttttt ttttgtttgt tggttggttt tcttcttctc gtcctcgttt gtgttctgtt 1741 ttgtttcgct ctttgagaaa tagcttatga aaagaattgt tgggggtttt tttggaagaa 1801 ggggcaggta tgatcggcag gacaccctga taggaagagg ggaagcagaa atccaagcac 1861 caccaaacac agtgtatgaa ggggggcggt catcatttca cttgtcagga gtgtgtgtga 1921 gtgtgagtgt gcggctgtgt gtgcacgcgt gtgcaggagc ggcagatggg gagacaacgt 1981 gctctttgtt ttgtgtctct tatggatgtc cccagcagag aggtttgcag tcccaagcgg 2041 tgtctctcct gccccttgga cacgctcagt ggggcagagg cagtacctgg gcaagctggc 2101 ggctggggtc ccagcagctg ccaggagcac ggctctgtcc ccagcctggg aaagcccctg 2161 cccctcctct ccctcatcaa ggacacgggc ctgtccacag gcttctgagc agcgagcctg 2221 ctagtggccg aaccagaacc aattattttc atccttgtct tattcccttc ctgccagccc 2281 ctgccattgt agcgtctttc ttttttggcc atctgctcct ggatctccct gagatgggct 2341 tcccaagggc tgccggggca gccccctcac agtattgctc acccagtgcc ctctcccctc 2401 agcctctccc ctgcctgccc tggtgacatc aggtttttcc cggacttaga aaaccagctc 2461 agcactgcct gctcccatcc tgtgtgttaa gctctgctat taggccagca agcggggatg 2521 tccctgggag ggacatgctt agcagtcccc ttccctccaa gaaggatttg gtccgtcata 2581 acccaaggta ccatcctagg ctgacaccta actcttcttt catttcttct acaactcata 2641 cactcgtatg atacttcgac actgttctta gctcaatgag catgtttaga ctttaacata 2701 agctattttt ctaactacaa aggtttaaat gaacaagaga agcattctca ttggaaattt 2761 agcattgtag tgctttgaga gagaaaggac tcctgaaaaa aaacctgaga tttattaaag 2821 aaaaaaatgt attttatgtt atatataaat atattattac ttgtaaatat aaagacgttt 2881 tataagcatc attatttatg tattgtgcaa tgtgtataaa caagaaaaat aaagaaaaga 2941 tgcactttgc tttaatataa atgcaaataa caaatgccaa attaaaaaag ataaacacaa 3001 gattggtgtt ttttcctatg ggtgttatca cctagctgaa tgtttttcta aaggagttta 3061 tgttccatta aacgattttt aaaatgtaca cttgaaaaaa aaaaaaaaaa a // LOCUS AF010309 1670 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig3 (PIG3) mRNA, complete cds. ACCESSION AF010309 NID g2754811 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1670) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 1670) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..1670 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p" /cell_line="DLD-1; colon cancer cell line" gene 1..1670 /gene="PIG3" CDS 528..1496 /gene="PIG3" /note="NADPH quinone oxidoreductase homolog; p53 induced" /codon_start=1 /product="Pig3" /db_xref="PID:g2754812" /translation="MLAVHFDKPGGPENLYVKEVAKPSPGEGEVLLKVAASALNRADL MQRQGQYDPPPGASNILGLEASGHVAELGPGCQGHWKIGDTAMALLPGGGQAQYVTVP EGLLMPIPEGLTLTQAAAIPEAWLTAFQLLHLVGNVQAGDYVLIHAGLSGVGTAAIQL TRMAGAIPLVTAGSQKKLQMAEKLGAAAGFNYKKEDFSEATLKFTKGAGVNLILDCIG GSYWEKNVNCLALDGRWVLYGLMGGGDINGPLFSKLLFKRGSLITSLLRSRDNKYKQM LVNAFTEQILPHFSTEGPQRLLPVLDRIYPVTEIQEAHSTWRPTRT" BASE COUNT 351 a 475 c 520 g 324 t ORIGIN 1 ccagccgtcc attccggtgg aggcagaggc agtcctgggg ctctggggct cgggctttgt 61 caccgggacc cgcagagcca gaaccactcg gcgccgctgg tgcatgggag gggagccggg 121 ccaggagtaa gtaactcata cgggcgccgg ggacccgggt cggctggggg cttccaactc 181 agagggagtg tgatttgcct gatcctcttc ggcgttgtcc tgctctgccg catccagccc 241 tgtaccgcca tcccacttcc cgccgttccc atctgtgttc cgggtgggat cggtctggag 301 gcggccgagg acttcccagg caggagctcg gggcggaggc gggtccgcgg cagaccaggg 361 cagcgaggcg ctggccggca gggggcgctg cggtgccagc ctgaggctgg ctgctccgcg 421 aggatacagc ggcccctgcc ctgtcctgtc ctgccctgcc ctgtcctgtc ctgccctgcc 481 ctgccctgtc ctgtcctgcc ctgccctgcc ctgtgtcctc agacaatatg ttagccgtgc 541 actttgacaa gccgggagga ccggaaaacc tctacgtgaa ggaggtggcc aagccgagcc 601 cgggggaggg tgaagtcctc ctgaaggtgg cggccagcgc cctgaaccgg gcggacttaa 661 tgcagagaca aggccagtat gacccacctc caggagccag caacattttg ggacttgagg 721 catctggaca tgtggcagag ctggggcctg gctgccaggg acactggaag atcggggaca 781 cagccatggc tctgctcccc ggtgggggcc aggctcagta cgtcactgtc cccgaagggc 841 tcctcatgcc tatcccagag ggattgaccc tgacccaggc tgcagccatc ccagaggcct 901 ggctcaccgc cttccagctg ttacatcttg tgggaaatgt tcaggctgga gactatgtgc 961 taatccatgc aggactgagt ggtgtgggca cagctgctat ccaactcacc cggatggctg 1021 gagctattcc tctggtcaca gctggctccc agaagaagct tcaaatggca gaaaagcttg 1081 gagcagctgc tggattcaat tacaaaaaag aggatttctc tgaagcaacg ctgaaattca 1141 ccaaaggtgc tggagttaat cttattctag actgcatagg cggatcctac tgggagaaga 1201 acgtcaactg cctggctctt gatggtcgat gggttctcta tggtctgatg ggaggaggtg 1261 acatcaatgg gcccctgttt tcaaagctac tttttaagcg aggaagtctg atcaccagtt 1321 tgctgaggtc tagggacaat aagtacaagc aaatgctggt gaatgctttc acggagcaaa 1381 ttctgcctca cttctccacg gagggccccc aacgtctgct gccggttctg gacagaatct 1441 acccagtgac cgaaatccag gaggcccata gtacatggag gccaacaaga acataggcaa 1501 gatcgtcctg gaactgcccc agtgaaggag gatgggggca ggacaggacg cggccacccc 1561 aggcctttcc agagcaaacc tggagaagat tcacaataga caggccaaga aacccggtgc 1621 ttcctccaga gccgtttaaa gctgatatga ggaaataaag agtgaactgg // LOCUS AF010312 1677 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig7 (PIG7) mRNA, complete cds. ACCESSION AF010312 NID g2415299 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 1677) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1; colon cancer cell line" gene 1..1677 /gene="PIG7" CDS 80..766 /gene="PIG7" /note="p53 induced" /codon_start=1 /product="Pig7" /db_xref="PID:g2415300" /translation="MSVPGPYQAATGPSSAPSAPPSYEETVAVNSYYPTPPAPMPGPT TGLVTGPDGKGMNPPSYYTQPAPIPNNNPITVQTVYVQHPITFLDRPIQMCCPSCNKM IVSQLSYNAGALTWLSCGSLCLLGVHSGLLLHPLLRGCPAGRGPLLSQLQSSPGHLQA FVGLSQTWREPGAAGSPFHLSSSFTPGGGSALVVSPLQGAHLHVFFWGEYVAKLTNLQ TPEIAAWSRA" BASE COUNT 359 a 464 c 393 g 461 t ORIGIN 1 cacgcgcagc atagcagagt cgacactaga ggcatccaaa gaataccggc acgagcaggc 61 ggcgcgggcg gcggttaaaa tgtcggttcc aggaccttac caggcggcca ctgggccttc 121 ctccgcacca tccgcacctc catcctatga agagacagtg gctgttaaca gttattaccc 181 cactcctcca gctcccatgc ctgggccaac tacggggctt gtgacggggc ctgatgggaa 241 gggcatgaat cctccttcgt attataccca gccagcgccc atccccaata acaatccaat 301 taccgtgcag acggtctacg tgcagcaccc catcaccttt ttggaccgcc ctatccaaat 361 gtgttgtcct tcctgcaaca agatgatcgt gagtcagctg tcctataacg ccggtgctct 421 gacctggctg tcctgcggga gcctgtgcct gctgggggtg catagcggcc tgctgcttca 481 tccccttctg cgtggatgcc ctgcaggacg tggaccatta ctgtcccaac tgcagagctc 541 tcctgggcac ctacaagcgt ttgtaggact cagccagacg tggagggagc cgggtgccgc 601 aggaagtcct ttccacctct catccagctt cacgcctggt ggaggttctg ccctggtggt 661 ctcacctctc cagggggccc accttcatgt cttcttttgg ggggaatacg tcgcaaaact 721 aacaaatctc caaaccccag aaattgctgc ttggagtcgt gcataggact tgcaaagaca 781 ttccccttga gtgtcagttc cacggtttcc tgcctccctg agaccctgag tcctgccatc 841 taactgttga tcattgccct atccgaatat tttcctgtcg accccgggcc accagtggct 901 cttttttcct gcttccatgg gcctttctgg tggcagtctc aaactgagga agccacagtt 961 gcctcatttt tgaggctgtt ctccccagga gcttcggctg gaaccaggcc tttaggtggc 1021 cttaccattt atctctatat ccggctcttt cccgttccct ggatggacaa aaatcttgcc 1081 cttgacagga ctttaacagg gcttgggctt tgagattctg ttaacccgca ggacttcatt 1141 aggcacacaa gattcacctt aatttctcta aatttttttt tttttaaaat accaagggaa 1201 gggggctaat taacaaccca gtacaggaca tatccacaag ggtcggtaaa tggcatgcta 1261 ggaaaaatag gggccttgga tcttattcac tggccctgtc ttccccttgg tttctcttgt 1321 ggccagatct ttcagttgcc ccttttccat aacaggggat tttttttctt cataggagtt 1381 aattattatg ggaacagttt tttatggacc tcccttttgg tctggaaata ccttttcgaa 1441 cagaatttct tttttttaaa aaaaaacaga gatggggtct tactatgttg cccaggctgg 1501 tgtcgaactc ctgggctcaa gcgatccttc tgccttggcc tcccgaagtg ctgggattgc 1561 aggcataagc ttaccatgct gggcctgaac ataatttcaa gaggaggatt tataaaacca 1621 ttttctgtaa tcaaatgatt ggtgtcattt tcccatttgc acaatgtagt ctcactt // LOCUS AF010313 2177 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig8 (PIG8) mRNA, complete cds. ACCESSION AF010313 NID g2415301 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2177) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 2177) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..2177 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1; colon cancer cell line" gene 1..2177 /gene="PIG8" CDS 73..1029 /gene="PIG8" /note="p53 induced" /codon_start=1 /product="Pig8" /db_xref="PID:g2415302" /translation="MIWGHFSLLCVVDSLGGEEMADSVKTFLQDLARGIKDSIWGICT ISKLDARIQQKREEQRRRRASSVLAQRRPQSIERKQESEPRIVSRIFQCCAWNGGVFW FSLLLFYRVFIPVLQSVTARIIGDPSLHGDVWSWLGFFLTSIFSAVWVLPLFVLSKVV NAIWFQDIADLAFEVSGRKPHPFPSVSKIIADMLFNLLLQALFLIQGMFVSLFPIHLV GQLVSLLHMSLLYSLYCFEYRWFNKGIEMHQRLSNIERNWPYYFGFGLPWLFSQQCSP HILSVAASFLSSFLYSLSAPMKQRPLAKHISSSCAYSPWWSS" BASE COUNT 516 a 486 c 516 g 659 t ORIGIN 1 cggactggcc cgcggtgggc atggggcagg gccggagccg cggcggcgga gctgtggatc 61 cttcatgatg tgatgatttg gggacacttc tctctcctgt gtgtagttga tagtttgggt 121 ggtgaagaga tggctgacag tgtcaaaacc tttctccagg accttgccag aggaatcaaa 181 gactccatct ggggtatttg taccatctca aagctagatg ctcgaatcca gcaaaagaga 241 gaggagcagc gtcgaagaag ggcaagtagt gtcttggcac agagaagacc ccagagtata 301 gagcggaaac aagagagtga accacgtatt gttagtagaa ttttccagtg ttgtgcttgg 361 aatggtggag tgttctggtt cagtctcctc ttgttttatc gagtatttat tcctgtgctt 421 cagtcggtaa cagcccgaat tatcggtgac ccatcactac atggagatgt ttggtcgtgg 481 ctgggattct tcctcacgtc aattttcagt gctgtttggg tgctcccctt gtttgtgctt 541 agcaaagtgg tgaatgccat ttggtttcag gatatagctg acctggcatt tgaggtatca 601 gggaggaagc ctcacccatt ccctagtgtc agcaaaataa ttgctgacat gctcttcaac 661 cttttgctgc aggctctttt cctcattcag ggaatgtttg tgagtctctt tcccatccat 721 cttgtcggtc agctggttag tctcctgcat atgtcccttc tctactcact gtactgcttt 781 gaatatcgtt ggttcaataa aggaattgaa atgcaccagc ggttgtctaa catagaaagg 841 aattggcctt actactttgg gtttggtttg ccttggcttt tctcacagca atgcagtcct 901 catatattat cagtggctgc ctcttttcta tcctctttcc tttattcatt atcagcgcca 961 atgaagcaaa gacccctggc aaagcatatc tcttccagtt gcgcctattc tccttggtgg 1021 tcttcttaag caacagactc ttccacaaga cagtctacct gcagtcggcc ctgagcagct 1081 ctacttctgc agagaagttc ccttcaccgg catccgtcgc ctgccaaact gaaggctact 1141 gcaggtcact gagttgcctg ccatccaaag gggatgggcg ggattggaag aagctgtggc 1201 agctcttttc cctgttcacc tcccgcctgc cagggaaggc aggacccgct ctgccaaggg 1261 cccctctgcg tattcccttc tctctgagga attgaaattt ttgtctctgg tgcacgtaag 1321 gcagaatgtt ccctgacacc agtgtgtgga tttttaacat caccgtgagt ctgaaaggcc 1381 acaggttttt ctgcagctat tttctagcat ttgccagtcc ctgtgcctgg actgattgga 1441 acactttgtt tttctccctg tgccatttac ccttccacct ttccatcctg ccttctacca 1501 cccttggatg aatggatttt gtaattctag ctgttgtatt ttgtgaattt gttaattttg 1561 ttgtttttct gtgaaacaca tacattggat atgggaggta aaggagtgtc ccagttgctc 1621 ctggtcactc cctttatagc cattactgtc ttgtttcttg taactcaggt taggttttgg 1681 tctctcttgc tccactgcaa aaaaaaaaaa aaaaaaaaaa aaaagcctga agagatgaga 1741 taggaggaaa gaccttcaca gcccagatct gctgggtttt gaggagtgat tttctttcct 1801 ttcccttgaa ggggaaaaag ctattttcaa ttggtacatt taaagtcccc caactatggg 1861 gaggtaccaa ttctggacaa agtgccacta caacaacact aaacctgaac tttttcaact 1921 ccgttggtgg tgggaggcca gcgggcagaa atttactgtt ggccactgcc aggtctattt 1981 ccatatttca aaggaatatt gggtgctgca tataggaact gaaggggtca atgtattaaa 2041 cctgtgatta ggttgttttc ctgtcatttt tgagagacta aaattgtggg ggggcagatg 2101 ttcaaaatac ctggtacaag tttttaaaaa atggtcacaa ttaaacatga gctggtttcc 2161 caaataaaaa aaaaaaa // LOCUS AF010314 2326 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig10 (PIG10) mRNA, complete cds. ACCESSION AF010314 NID g2415303 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2326) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 2326) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..2326 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1; colon cancer cell line" gene 1..2326 /gene="PIG10" CDS 425..1975 /gene="PIG10" /note="p53 induced" /codon_start=1 /product="Pig10" /db_xref="PID:g2415304" /translation="MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRL FTDVLLHAGNRTFPCHRAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLL DYAYSSRVIHQLEGKCRNSLLGSLVTCWSFKDIRDACAEFLEKNLHPTNCLGMLLLSD AHQCTKLYELSWRMCLSNFQTIRKNEDFLQLPQDMVVQLLSSEELETEDERLVYESAI NWISYDLKKRYCYLPELLQTVTRALLPAIYLMENVAMEELITKQRKSKEIVEEAIRCK LKILQNDGVVTSLCARPRKTGHALFLLGGQTFMCDKLYLVDQKAKEIIPKADIPSPRK EFSACAIGCKVYITGGRGSENGVSKDVWVYDTLHEEWSKAAPMLVARFGHGSAELKHC LYVVGGHTAATGCLPASPSVSLKQVEHYDPTINKWTMAAPRPRRRYNCAQVVSAKLKL FAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQPWRIHSQASCPGGTQDFLLWGVIQ NFSACFCL" BASE COUNT 544 a 635 c 637 g 510 t ORIGIN 1 aggccggaga ggaggcggtg cggcggtggc cgtgcggaga cccggtccag acgcctggcg 61 gccgccggca cacaaggcgc tttctagctc cctcccccga gcgcacagcc cgcctccttc 121 cgcggcgcct gcagtggcac ggattgctct gccctaccgt gacgcgctcc ggagacgctc 181 tgcgggtcct ggacaccggg tccggcggcg tggggacgac agacggaggc gaacgcatcc 241 ggtagccggt ccgcgagcca tcgttcgggg cgcagtcctc tccccggctg gccctccttt 301 ctccggggca ttcgccaccg cttccctggg gctgagacga ccggttcgtc gcctccttgc 361 ccgtgaccgt cgctagaact cagttgtgcg ttgcggccag tcgccactgc tgagtggaag 421 caaaatgtca gtcagtgtgc atgagaaccg caagtccagg gccagcagcg gctccattaa 481 catctatctg tttcacaagt cctcctacgc tgacagcgtc ctcactcacc tgaatctttt 541 acgccagcag cgtctcttca ctgacgtcct tctccatgcc ggaaatagga ccttcccttg 601 ccaccgggca gtgctggctg catgcagtcg ctactttgag gccatgttca gtggtggcct 661 gaaagagagc caggacagtg aggtcaactt tgacaattcc atccacccag aagtcttgga 721 gctgctgctt gactatgcgt actcctcccg ggtcattcat caattggaag gaaaatgcag 781 aaattcgctc ctgggaagct tggtgacatg ctggagtttc aaggacatcc gggatgcatg 841 tgcagagttc ctggaaaaga acctgcatcc caccaactgc ctgggcatgc tgctgctgtc 901 tgatgcacac cagtgcacca agctgtacga actatcttgg agaatgtgtc tcagcaactt 961 ccaaaccatc aggaagaatg aagatttcct ccagctgccc caggacatgg tagtgcaact 1021 cttgtccagt gaagagctgg agacagagga tgaaaggctt gtgtacgagt ctgcaattaa 1081 ctggatcagc tatgacctga agaagcgcta ttgctacctc ccagaactgt tgcagacagt 1141 aacgcgggca cttctgccag ccatctatct catggagaat gtggccatgg aggaactcat 1201 caccaagcag agaaagagta aggaaattgt ggaagaggcc atcaggtgca aactaaaaat 1261 cctgcagaat gacggtgtgg taaccagcct ctgtgcccga cctcggaaaa ctggccatgc 1321 cctcttcctt ctgggaggac agactttcat gtgtgacaag ttgtatctgg tagaccagaa 1381 ggccaaagaa atcattccca aggctgacat tcccagccca agaaaagagt ttagtgcatg 1441 tgcgattggc tgcaaagtgt acattactgg ggggcggggg tctgaaaatg gggtctcgaa 1501 agatgtctgg gtttatgata ccctgcacga ggagtggtcc aaggctgccc ccatgctggt 1561 ggccaggttt ggccatggct ctgctgaact gaagcactgc ctgtatgtgg ttggggggca 1621 cacggccgca actggctgcc tcccggcctc cccctcagtc tctctaaagc aggtagaaca 1681 ttatgacccc acaatcaaca aatggaccat ggcggcccca cgtccgagaa ggcgttacaa 1741 ctgcgcacag gtagtgagtg ccaaacttaa gttatttgct ttcggaggta ccagtgtcag 1801 tcatgacaag ctccccaaag ttcagtgtta cgatcagtgt gaaaacaggt ggactgtacc 1861 ggccacctgt ccccagccct ggcgtataca cagccaagca agctgtcctg ggggaaccca 1921 ggatttttta ttatgggggg tgatacagaa tttctctgcc tgcttctgct tataaattcg 1981 caacagtgag acttaccagt ggaccaaagg tgggagatgt gacagcaaag cgcatgagct 2041 gccatgctgt tggcctctgg aaacaaactc ttacgtggtt ggaggatact ttgggcattc 2101 agcgatgcaa gactttggac tgctacgatc caacattaga cgtgtggaac agcatcacca 2161 ctgtcccgta ctcgctgatt cctactgcat tttgtcagca cctggaaaca tctgccttct 2221 taaatgcagt acattctaaa gagaagatga gcatgagctc actccatcac tcgatgagat 2281 aatatgagat ttctacttcg gagaggccaa gtctaatgaa gagaaa // LOCUS AF010315 2302 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig11 (PIG11) mRNA, complete cds. ACCESSION AF010315 NID g2415305 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2302) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 2302) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..2302 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1; colon cancer cell line" gene 1..2302 /gene="PIG11" CDS 1099..1632 /gene="PIG11" /note="p53 induced" /codon_start=1 /product="Pig11" /db_xref="PID:g2415306" /translation="MHTPEVASPSPGDRCREKSSPRPQTRSPITGIEWEDGWRVSIGT ARSGCCQVGPCQGSSPLSTLCLAVPAPRYSNMACTQQSGGNHHGYSGCPETLLGKRGR AEWEAGTLRKGPSPGICLLLPKPVPLPKELPPCCRALVWPRATTASHRHHCLPLLPTG GAANGQCLQAEAFNPPS" BASE COUNT 497 a 752 c 594 g 459 t ORIGIN 1 ctaaatcaag ctggagtcat gagggtagtg ggctaagtcg agggtccagc ctcttctgcc 61 aggaagccct tcttgctttt gagagagggc tgtgaccacc ccccatcctt ctccctacac 121 tcccagccaa cctagtgccc aagcagctaa acttggcttc cttctaatcc tggaaaaccc 181 tgtacccctc ctcctcaatc tggccctctc cacatgcaca ccctgagaac acacacagac 241 acacaacaca cacacataca cacccctgaa cacacacaca gacacacata cacccatgat 301 gtgagcaaac acacacacgt gcgccttcat agcccagcca aggcatcgca ggcagggtgt 361 gctgcctgag atggcacctc cctttcagcc attcttcaag aatgggccac acacagctag 421 aagtcctctc ccagctagaa gtcctgtccc actctcctgg cctgacaaga tgagctctcc 481 tgggaccttg ctctagggca ctctgcctct accctaggac actggaatgc cctgggagcc 541 ccctccctgc aaccagcctg agttcagccc cacggacaaa gggacacaca gcccccaatg 601 gagaccattg taagtggtgg ggctgggaga ggaggaacag aaggaaagcc atagcgctct 661 cttgcccctt ggcatgtacc ccaaggcctg atggccactg ggctcagcct gtcccccact 721 cctgcctgct tcccggtgag ctgcccccga cacgtgcagc ccgggctgcc tccagggtct 781 ggctgagtgg gatcaggtgg ccctccaact cagcacagga aataagtaga aacatttcag 841 caggccacct cccctcatct tccccgccct gtccagcgcc ctggcaaagg ctgacaactg 901 gctgtcttgg ggccgaacag ccctgcctgc tctgagggcc acagcctgtg ctgcataccc 961 accgcccagc ttctccctga gggcccacca gcctgtgctg catacccacc acccagcttc 1021 tccctgaggg cccaccagcc tgtgctgtac accccgttag tccctgatcc caaccttctc 1081 cctcctgcca gcacaccgat gcacacaccg gaagtggcga gcccaagccc tggggacagg 1141 tgtagggaga aaagcagccc caggcctcag actcgctctc ccatcactgg catagagtgg 1201 gaggatggct ggagggtgtc tataggtaca gcccgctctg gctgctgcca ggtgggcccc 1261 tgccaggggt cctcacccct gtccaccctg tgcctggctg tccctgcacc cagatacagc 1321 aacatggcct gtacccagca gagtggtggc aaccaccatg gttacagcgg atgccccgag 1381 actctgcttg gtaaacgtgg cagagcagaa tgggaggctg ggaccctgag gaagggcccc 1441 tctcctggca tctgtctctt gctacctaag cctgtgcctc tccctaaaga gctgcctccc 1501 tgctgccgag ccctggtctg gccacgagcc actactgcct cccacaggca ccactgcctc 1561 ccgctgctgc ccacaggtgg tgccgccaat gggcagtgcc tccaggccga agccttcaat 1621 cccccatctt gagccagggc ctaaatcctc ttaatagtga tggttggttt tgtcctccca 1681 ttaactgcag gtgggatttc cacctggggg aatgaggctt gcgttgttcg ggcgtctgct 1741 ggccctgaga catccagtct tccacactca actgtgggat gggagggtgg cgtggcttta 1801 ccccatggag gctgttccag ggctctgggc acacagctgt gctcacacaa aatactgggt 1861 ggcttggttt agagctaatt gtagtggaag cctgcaaggt tgaggggtga aggggagggg 1921 gcttgcaagg tccaggtaaa gatctggaaa gacagaacgt acagcttgga gggcaagggg 1981 gactctaaag tgcaaggaga tttacagttg ggaaaggagg cagtggcaga ggggttgagg 2041 gacaggggcc cttaagtcca gcgaggaaag ctcggtgtgg ggcccgctct acgctccgtt 2101 tggggtgacc tggaacgcct cttctcccag ctccctccag ccatcagcag cctcttgtca 2161 agcttctgcc tcgccccagt ctatccccaa ccccaaatca agaccacctt tcttcaacgg 2221 tcactattta ttctttgttc ctttttcttt tgtgtaagaa acattcacaa aaaccagtgc 2281 caaaaccatc aaaaaaaaaa aa // LOCUS AF010316 1729 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens Pig12 (PIG12) mRNA, complete cds. ACCESSION AF010316 NID g2415307 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1729) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE A model for p53-induced apoptosis JOURNAL Nature 389 (6648), 300-305 (1997) MEDLINE 97449378 REFERENCE 2 (bases 1 to 1729) AUTHORS Polyak,K., Xia,Y., Zweier,J.L., Kinzler,K.W. and Vogelstein,B. TITLE Direct Submission JOURNAL Submitted (27-JUN-1997) Oncology, Johns Hopkins Oncology Center, 424. N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..1729 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1; colon cancer cell line" gene 1..1729 /gene="PIG12" CDS 10..471 /gene="PIG12" /note="microsomal glutathione transferase homolog; p53 induced" /codon_start=1 /product="Pig12" /db_xref="PID:g2415308" /translation="MPAHSLVMSSPALPAFLLCSTLLVIKMYVVAIITGQVRLRKKAF ANPEDALRHGGGPQYCRSDPDVERCLRAHRNDMETIYPFLFLGFVYSFLGPNPFVAWM HFLVFLVGRVAHTVAYLGKLRAPIRSVTYTLAQLPCASMALQILWEAARHL" BASE COUNT 323 a 489 c 486 g 431 t ORIGIN 1 tggccagaga tgcctgccca cagcctggtg atgagcagcc cggccctccc ggccttcctg 61 ctctgcagca cgctgctggt catcaagatg tacgtggtgg ccatcatcac gggccaagtg 121 aggctgcgga agaaggcctt tgccaacccc gaggatgccc tgagacacgg aggaggcccc 181 cagtattgca ggagcgaccc cgacgtggaa cgctgcctca gggcccaccg gaacgacatg 241 gagaccatct accccttcct tttcctgggc ttcgtctact cctttctggg tcctaaccct 301 tttgtcgcct ggatgcactt cctggtcttc ctcgtgggcc gtgtggcaca caccgtggcc 361 tacctgggga agctgcgggc acccatccgc tccgtgacct acaccctggc ccagctcccc 421 tgcgcctcca tggctctgca gatcctctgg gaagcggccc gccacctgtg accagcagct 481 gatgcctcct tggccaccag accatgggcc aagagccgcc gtggctatac ctggggactt 541 gatgttcctt ccagattgtg gtgtgggccc tgagtcctgg tttcctggca gcctgctgcg 601 cgtgtgggtc tctgggcaca gtgggcctgt gtgtgtgccc gtgtgtgtgt atgtgtgtgt 661 gtatgtttct tagccccttg gattcctgca cgaagtggct gatgggaacc atttcaagac 721 agattgtgaa gattgataga aaatccttca gctaaagtaa cagagcatca aaaacatcac 781 tccctctccc tccctaacag tgaaaagaga gaagggagac tctatttaag attcccaaac 841 ctaatgatca tctgaatccc gggctaagaa tgcagacttt tcagactgac cccagaaatt 901 ctggcccagc caatctagag gcaagcctgg ccatctgtat tttttttttc caagacagag 961 tcttgctctc gttgcccaag ctggagtgaa gtggtacaat ctggctcact gcagcctccg 1021 cctcccgggt tcaagcgatt ctcccgcctc agcctcctga gtagctggga ttacaggcgc 1081 gtatcaccat acccagctaa tttttgtatt tttagtagag acgggttcac catgttgccc 1141 aggagggtct cgaactcctg gcctcaagtg atccacgcct cggcctccca aagtgctggg 1201 atgacaggca tgaatcactg tgctcagcca ccatctggag tttaaaagga cctcccatgt 1261 gagtccctgt gtggccaggc cagggacccc tgccagttct atgtggaagc aaggctgggg 1321 tcttgggttc ctgtatggtg gaagctgggt gagccaagga cagggctggc tcctctgccc 1381 ccgctgacgc ttcccttgcc gttggctttg gatgtctttg ctgcagtctt ctctctggct 1441 caggtgtggg tgggaggggc ccacaggaag ctcagccttc tcctcccaag gtttgagtcc 1501 ctccaaaggg cagtgggtgg aggaccggga gctttgggtg accagccact caaaggaact 1561 ttctggtccc ttcagtatct tcaaggtttg gaaactgcaa atgtcccctg atggggaatc 1621 ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgttt tctcctagac 1681 ccgtgacctg agatgtgtga tttttagtca ttaaatggaa gtgtctgcc // LOCUS AF011466 1734 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens G protein-coupled receptor Edg-4 mRNA, complete cds. ACCESSION AF011466 NID g2735848 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1734) AUTHORS An,S. TITLE Human Edg-4 cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1734) AUTHORS An,S. TITLE Direct Submission JOURNAL Submitted (28-JUN-1997) Medicine, UCSF, 533 Parnassus Ave., Rm Ub8, San Francisco, CA 94143-0711, USA FEATURES Location/Qualifiers source 1..1734 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="ovarian tumor" /cell_line="NbHOT" CDS 85..1233 /note="similar to LPA receptor" /codon_start=1 /product="G protein-coupled receptor Edg-4" /db_xref="PID:g2735849" /translation="MVIMGQCYYNETIGFFYNNSGKELSSHWRPKDVVVVALGLTVSV LVLLTNLLVIAAIASNRRFHQPIYYLLGNLAAADLFAGVAYLFLMFHTGPRTARLSLE GWFLRQGLLDTSLTASVATLLAIAVERHRSVMAVQLHSRLPRGRVVMLIVGVWVAALG LGLLPAHSWHCLCALDRCSRMAPLLSRSYLAVWALSSLLVFLLMVAVYTRIFFYVRRR VQRMAEHVSCHPRYRETTLSLVKTVVIILGAFVVCWTPGQVVLLLDGLGCESCNVLAV EKYFLLLAEANSLVNAAVYSCRDAEMRRTFRRLLCCACLRQSTRESVHYTSSAQGGAS TRIMLPENGHPLMTPPFSYLELQRYAASNKSTAPDDLWVLLAQPNQQD" BASE COUNT 302 a 543 c 506 g 383 t ORIGIN 1 ggcacgaggc gccgggccat gggcctcgag cccgccccga acccccgcga gcccgccttg 61 tctgcggcgt gactggaggc ccagatggtc atcatgggcc agtgctacta caacgagacc 121 atcggcttct tctataacaa cagtggcaaa gagctcagct cccactggcg gcccaaggat 181 gtggtcgtgg tggcactggg gctgaccgtc agcgtgctgg tgctgctgac caatctgctg 241 gtcatagcag ccatcgcctc caaccgccgc ttccaccagc ccatctacta cctgctcggc 301 aatctggccg cggctgacct cttcgcgggc gtggcctacc tcttcctcat gttccacact 361 ggtccccgca cagcccgact ttcacttgag ggctggttcc tgcggcaggg cttgctggac 421 acaagcctca ctgcgtcggt ggccacactg ctggccatcg ccgtggagcg gcaccgcagt 481 gtgatggccg tgcagctgca cagccgcctg ccccgtggcc gcgtggtcat gctcattgtg 541 ggcgtgtggg tggctgccct gggcctgggg ctgctgcctg cccactcctg gcactgcctc 601 tgtgccctgg accgctgctc acgcatggca cccctgctca gccgctccta tttggccgtc 661 tgggctctgt cgagcctgct tgtcttcctg ctcatggtgg ctgtgtacac ccgcattttc 721 ttctacgtgc ggcggcgagt gcagcgcatg gcagagcatg tcagctgcca cccccgctac 781 cgagagacca cgctcagcct ggtcaagact gttgtcatca tcctgggggc gttcgtggtc 841 tgctggacac caggccaggt ggtactgctc ctggatggtt taggctgtga gtcctgcaat 901 gtcctggctg tagaaaagta cttcctactg ttggccgagg ccaactcact ggtcaatgct 961 gctgtgtact cttgccgaga tgctgagatg cgccgcacct tccgccgcct tctctgctgc 1021 gcgtgcctcc gccagtccac ccgcgagtct gtccactata catcctctgc ccagggaggt 1081 gccagcactc gcatcatgct tcccgagaac ggccacccac tgatgactcc accctttagc 1141 taccttgaac ttcagcggta cgcggcaagc aacaaatcca cagcccctga tgacttgtgg 1201 gtgctcctgg ctcaacccaa ccaacaggac tgactgactg gcaggacaag gtctggcatg 1261 gcacagcacc actgccaggc ctccccaggc acaccactct gcccagggaa tgggggcttt 1321 gggtcatctc ccactgcctg ggggagtcag atggggtgca ggaatctggc tcttcagcca 1381 tctcaggttt agggggtttg taacagacat tattctgttt tcactgcgta tccttggtaa 1441 gccctgtgga ctggttcctg ctgtgtgatg ctgagggttt taaggtgggg agagataagg 1501 gctctctcgg gccatgctac ccggtatgac tgggtaatga ggacagactg tggacacccc 1561 atctacctga gtctgattct ttagcagcag agactgaggg gtgcagagtg tgagctggga 1621 aaggtttgtg gctccttgca gcctccaggg actggcctgt ccccaataga attgaagcag 1681 tccacgggga ggggatgata caaggagtaa acctttcttt acactcaaaa aaaa // LOCUS AF011757 405 bp mRNA PRI 04-AUG-1997 DEFINITION Homo sapiens RAGE binding protein (P12) mRNA, complete cds. ACCESSION AF011757 NID g2293532 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 405) AUTHORS Li,J.F.,., Hofmann,M.A., Drury,S., Qu,X. and Schmidt,A.M. TITLE Functional identification of a novel ligand for RAGE JOURNAL Unpublished REFERENCE 2 (bases 1 to 405) AUTHORS Li,J.F.,., Hofmann,M.A., Drury,S., Qu,X. and Schmidt,A.M. TITLE Direct Submission JOURNAL Submitted (30-JUN-1997) Physiology, Columbia University, 630 W. 168th St., New York, NY 10032, USA FEATURES Location/Qualifiers source 1..405 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" gene 1..405 /gene="P12" CDS 1..279 /gene="P12" /codon_start=1 /product="RAGE binding protein" /db_xref="PID:g2293533" /translation="MTKLEDHLEGIINIFHQYSVRVGHFDTLNKRELKQLITKELPKT LQNTKDQPTIDKIFQDLDADKDGAVSFEEFVVLVSRVLKTAHIDIHKE" BASE COUNT 136 a 103 c 87 g 79 t ORIGIN 1 atgactaagc tggaggacca cctggaggga atcatcaaca tcttccacca gtactccgtt 61 cgggtggggc atttcgacac cctcaacaag cgtgagctga agcagctgat cacaaaggaa 121 cttcccaaaa ccctccagaa caccaaagat caacctacca ttgacaaaat attccaagac 181 ctggatgccg ataaagacgg agccgtcagc tttgaggaat tcgtagtcct ggtgtccagg 241 gtgctgaaaa cagcccacat agatatccac aaagagtagg aagctctttc cagcaatgtc 301 cccaagaaga cttacccttc tcctccctga ggctgggtta cccgagggaa gagagaatta 361 ataaacgtac tttggcaaag ttcttagcaa aaaaaaaaaa aaaaa // LOCUS AF011792 1603 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens cell cycle progression 2 protein (CPR2) mRNA, complete cds. ACCESSION AF011792 NID g2352901 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1603) AUTHORS Edwards,M.C., Liegeois,N., Horecka,J., DePinho,R.A., Sprague,G.F., Tyers,M. and Elledge,S.J. TITLE Human CPR (Cell cycle Progression Restoration) genes impart a Far- phenotype on yeast cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1603) AUTHORS Edwards,M.C. and Elledge,S.J. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1603 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7p13-p14" gene 1..1603 /gene="CPR2" CDS 175..1257 /gene="CPR2" /note="leucine rich; imparts Far- phenotype on yeast" /codon_start=1 /product="cell cycle progression 2 protein" /db_xref="PID:g2352902" /translation="MMKVGHLSEPLMNRLEDKCLELVEHFGPNELRKVLVMLAAQSRR SVPLLRAISYHLVQKPFSLTKDVLLDVAYAYGKLSFHQTQVSQRLATDLLSLMPSLTS GEVAHCAKSFALLKWLSLPLFEAFAQHVLNRAQDITLPHLCSVLLAFARLNFHPDQED QFFSLVHEKLGSELPGLEPALQVDLVWALCVLQQAREAELQAVLHPEFHIQFLGGKSQ KDQNTFQKLLHINATALLEYPEYSGPLLPASAVPLGPQPLTGRRPPCKRSCKTLKGLL GSADKGSLEVATQYGWVLDSEVLLDSDGEFLPVRDFVAPHLAQPTGSQSPPPGSKRLA FLRWEFPNFNSRRRTCWVALFWPGDT" BASE COUNT 331 a 477 c 466 g 329 t ORIGIN 1 gaattcggtc cgctggcgca tgcggaagct caagtacaag cacctggcct tcctggcaga 61 gtcctgtgcc accctctcac aggagcagca ctcgcaggag ctgctggctg agctgctcac 121 acactggaaa ggcgttggac agaaattgaa gattcccaca cattagtgac cgtcatgatg 181 aaggtgggac acctctcgga gccactaatg aaccgcctgg aagacaagtg cctggagttg 241 gtggagcact ttggccccaa tgagctgcgg aaggtgctgg tgatgctggc agctcagagc 301 cggcggtccg tgcccttgct gcgggccatc tcctaccacc tggtgcagaa gcccttctct 361 ctgacgaaag atgtgctctt ggacgtggcc tatgcctatg gcaaactcag ctttcaccag 421 acccaggtgt cccagcgcct ggccaccgac ctgctatccc tcatgcccag cctgacttct 481 ggtgaggtgg cccactgtgc caagtccttc gccttactca agtggctcag cctgcccctg 541 tttgaggcct ttgcccagca cgtcctgaac agagcgcagg acatcaccct gccccacctg 601 tgcagcgtac ttctggcttt tgcgcgtctg aacttccatc cagaccaaga ggatcagttc 661 ttcagcctgg tacatgagaa gctggggtca gagctgccag gcctggagcc agccctgcag 721 gtggacctgg tgtgggccct gtgtgtgctg cagcaggcac gggaagcaga gctgcaagcc 781 gtcctccacc ctgaatttca catccaattt ctagggggca agtctcagaa ggatcagaac 841 accttccaga agctgctcca catcaacgcc actgccctgc tggagtaccc cgagtactcg 901 ggtccccttc tgcctgcctc ggctgtgccc ctgggccctc agcccttgac aggaaggaga 961 cccccctgca aaaggagctg caagacgctg aaggggctgc tggggagcgc cgacaagggc 1021 agcctcgagg tggccacgca gtatggctgg gtgctggatt ctgaggtgct gctggacagt 1081 gacggcgagt ttctgcccgt aagggacttt gtggcacctc accttgccca gccaactggg 1141 agccagtcac cacctccagg gtctaagagg ctagcgttct tgcggtggga gttccccaac 1201 ttcaacagcc gaagaaggac ttgctgggtc gctttgttct ggcccggcga cacatagtgg 1261 ctgcaggctt cctgatagtg gacgtcccat tctatgagtg gctggaactc aagtctgaat 1321 ggcagaaagg cgcctacctc aaggacaaga tgcgcaaagc ggtggctgag gagctggcca 1381 agtgacttgt gccagcagca tggactgcgt gcctctccgc cggaggtcta gctgtgggcg 1441 gccaagaagg gtcacccttg aggacaaacc tctgtgcagg accttggcca ctctgaggga 1501 cagaacgtcc tcttgtgtat aataaacctt taattttggt gttggacccc tggggccttc 1561 ccaggcttgg tcaccctctg cactgtcaaa aaaaaaaaaa aaa // LOCUS AF011793 1701 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens DNJ3/CPR3 mRNA, complete cds. ACCESSION AF011793 NID g2352903 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1701) AUTHORS Edwards,M.C., Liegeois,N., Horecka,J., DePinho,R.A., Sprague,G.F., Tyers,M. and Elledge,S.J. TITLE Human CPR (Cell cycle Progression Restoration) genes impart a Far- phenotype on yeast cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1701) AUTHORS Edwards,M.C. and Elledge,S.J. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1701 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1701 /gene="DNJ3/CPR3" CDS 13..1260 /gene="DNJ3/CPR3" /note="functionally rescues ydj1 mutant in yeast and imparts a Far- phenotype on yeast cells; Ydj1 homolog" /codon_start=1 /product="Dnj3/Cpr3" /db_xref="PID:g2352904" /translation="MANVADTKLYDILGVPAGASENELKKAYRKLAKEYHPDKNPQMQ ETNFKEISFAYEVLSNPEKRELYDRYGEQGLREGSGGGGWHGLIFSLTVFCGGLFGFM GNQSRSRNGRRRGEDMMHPLKVSLEDLYNGKTTKLQLSKNVLCSACSGQGGKSGAVQK CSACRGRGVRIMIRQLAPGMVQQMQSVCSDCNGEGEVINEKDRCKKCEGKKVIKEVKI LEVHVDKGMKHGQRITFTGEADQAPEWNPETLFFLLPGEKNMEVFQRDGNDLHMTYKI GLVEALCGFQFTLSHLDGRQIVVKYPPGKVIEPGCVRVVRGEGMPQYRNPFEKGGLYI KFDVQFPENNWINPDKLSELEDLLPSRPEVPNIIGETEEVELQEFDSTRGSGGGQRRE AYNDSSDEESSSHHGPGVQCAHQ" BASE COUNT 522 a 307 c 426 g 446 t ORIGIN 1 cggccggccg ccatggctaa cgtggctgac acgaagctgt acgacatcct gggcgttccc 61 gcgggcgcca gcgagaacga gctgaagaag gcatacagaa agttagccaa ggaatatcat 121 cctgataaga atccccaaat gcaggagaca aactttaaag aaataagttt tgcatatgaa 181 gtactatcaa atcctgagaa gcgtgagtta tatgacagat acggagagca aggtcttcgg 241 gaaggcagcg gcggaggtgg gtggcatgga ttgatatttt ctctcaccgt tttttgtggg 301 ggattgttcg gcttcatggg caatcagagt agaagtcgaa atggcagaag aagaggagag 361 gacatgatgc atccactcaa agtatcttta gaagatctgt ataatggcaa gacaaccaaa 421 ctacaactta gcaagaatgt gctctgtagt gcatgcagtg gccaaggcgg aaagtctgga 481 gctgtccaaa agtgtagtgc ttgtcgaggt cgaggtgtgc gcatcatgat cagacagctg 541 gctccaggga tggtacaaca gatgcagtct gtgtgctctg attgtaatgg tgaaggagag 601 gtaattaatg aaaaagaccg ctgtaaaaaa tgtgaaggga agaaggtgat taaagaagtc 661 aagattcttg aagtccacgt agacaaaggc atgaaacatg gacagagaat tacattcact 721 ggggaagcag accaggcccc agagtggaac ccggagacat tgttcttttt gctaccagga 781 gaaaagaaca tggaggtatt tcagagagat gggaatgatt tgcacatgac atataaaata 841 ggacttgttg aagctctatg tggatttcag ttcacattaa gccaccttga tggacgtcag 901 attgtggtga aatacccccc tggcaaagta attgaaccag ggtgtgttcg tgtagttcga 961 ggtgaaggga tgccgcagta tcgtaatccc tttgaaaaag gtgggcttta cataaagttt 1021 gatgtgcagt ttcctgaaaa caactggatc aacccagaca agctttctga actagaagat 1081 cttctgccat ctagaccgga agttcctaac ataattggag aaacagagga ggtagagctt 1141 caggaatttg atagcactcg aggctcagga ggtggtcaga ggcgtgaagc ctataatgat 1201 agctctgatg aagaaagcag cagccatcat ggacctggag tgcagtgtgc ccatcagtaa 1261 actctgcaaa caaattgcac aggtggattt tctttccaca tttgcctgat ttgttctcag 1321 caatccagct ggagtgtctt atcaatccag atgaactgag ggacatctgt tggtctatgt 1381 ataactttta aaattggtat agtatctaca gagtgtataa tttaaactaa ccacaaagct 1441 ttacatcttc attttgactg ttccatagca gaataaagca cttgaaagga aacaagactc 1501 cctttcacac atggattatt ataagtttca atcctggtat ctgtgcttga tttttatcag 1561 ttttgtgtag atttttatgt ttcatatttt aaatttaaat cccacattgt aaagtttgta 1621 caatttgtcc tgaagctttg tgtttggctg cacctgcata agctgctaca aatagaataa 1681 agaatttcat agcctgtaaa a // LOCUS AF011794 1856 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens cell cycle progression restoration 8 protein (CPR8) mRNA, complete cds. ACCESSION AF011794 NID g2352905 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1856) AUTHORS Edwards,M.C., Liegeois,N., Horecka,J., DePinho,R.A., Sprague,G.F., Tyers,M. and Elledge,S.J. TITLE Human CPR (Cell cycle Progression Restoration) genes impart a Far- phenotype on yeast cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1856) AUTHORS Edwards,M.C. and Elledge,S.J. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1856 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1856 /gene="CPR8" CDS 13..1140 /gene="CPR8" /codon_start=1 /product="cell cycle progression restoration 8 protein" /db_xref="PID:g2352906" /translation="MLKRELERERLVTTALRGELQQLSGSQLHGKSDSPNVYTEKKEI AILRERLTELERKLTFEQQRSDLWERLYVEAKDQNGKQGTDGKKKGGRGSHRVKNKSK GTFLGSVKETFDAMKNSTKEFVRHHKEKIKQAKEDVKENLKKFSDSVKSTFRHFKDTT KNIFDEKGNKRFNATKEAAEKPRTVFSDYLHPQYKAPTENHSRPYYAKRWKEEKPVHF KEFRKNTNSKKCSPGHDCRENSHSFRKACSGVFDCAQQESMSLFNTVVIPIRMDEFRQ IIQRYMLKELDTFCRWNELDQFINKFFLNGVFIHDQKLFTDFVNDVKIILGNMKEYEV DNDGVFEKLDEYIYRHFFGHTFSPPYGPRSVYIKPCHYSSL" BASE COUNT 671 a 282 c 346 g 557 t ORIGIN 1 gaattcgcaa agatgctaaa gagagaactg gagagagaac gactagtaac tacggcttta 61 aggggggaac tccagcagtt aagtggtagt cagttacatg gcaagtcaga ttctcccaat 121 gtatatactg aaaaaaagga aatagcaatc ttacgggaaa gactcactga gctggaacgg 181 aagctaacct tcgaacagca gcgttctgat ttgtgggaaa gattgtatgt tgaggcaaaa 241 gatcaaaatg gaaaacaagg aacagatgga aaaaagaaag ggggcagagg aagccacagg 301 gttaaaaata agtcaaaggg aacatttttg ggttcagtta aggaaacatt tgatgccatg 361 aagaattcta ccaaggagtt tgtaaggcat cataaagaga aaattaagca ggctaaagaa 421 gatgtgaagg aaaatctgaa aaaattctca gattcagtta aatccacttt cagacacttt 481 aaagatacca ccaagaatat ctttgatgaa aagggtaata aaagatttaa tgctacaaaa 541 gaagcagctg aaaaaccaag aacagttttt agtgactatt tacatccaca gtataaggca 601 cctacagaaa accattcaag gccctactat gcaaaaagat ggaaggaaga aaagccagtt 661 cactttaaag aattcagaaa aaatacaaat tcaaagaaat gcagtcctgg gcatgattgt 721 agagaaaatt ctcattcttt cagaaaggct tgttctggtg tatttgattg tgctcaacaa 781 gagtccatga gcctttttaa cacagtggtg atccctataa ggatggatga atttagacag 841 ataattcaaa ggtacatgtt aaaagaactg gatacttttt gtcgctggaa cgaacttgat 901 cagttcatca ataagttttt cctaaacggt gtctttatac atgatcagaa gctcttcact 961 gactttgtta atgatgttaa gattatctta ggaaacatga aggaatatga agtagataat 1021 gatggagtat ttgagaagtt ggatgaatat atatatagac acttctttgg tcacactttt 1081 tcccctccat atggacccag gtcggtttac ataaaaccgt gtcattacag tagtttgtaa 1141 catttgtaga ttggatacga tttttatgat ttgatgagtt tcttgtaagg ttaccgtttc 1201 taagagttgt gctttatggc cactgagaga attcagaata aattgaaaga tggagtctaa 1261 aaattattag ctgttacaaa tggaacaatt tcattataac gtgatcactt tgacttgagc 1321 aaatggttta atttttatct taaaatcagt taagaatata taaaatccta cctttggcca 1381 agtttgtttc ttttcattat agtttatatg aaaagatcac cttaagtgaa attattttcc 1441 ttattttcct ttaatctttt atgtatttat tcacttctgg aagctaggaa tgagcaacac 1501 aaattttact ctgaagtcag aagagctcat atatataatt ctaatgtccc acctatgtcc 1561 attccatgta ccagcttagt tatatactag tcacataatt atctttgata aaggtagagg 1621 cacaaagagg caaactaaca agtcaaattc taatgtgtgt acttcataat aattttttat 1681 ccattttcat cttctttatc tttatattct gtaacatgaa acttacctaa tcttcaaatg 1741 ttagcttcat tttttacctt tgaaatactt aatctttctg aataaatata atggtctata 1801 aaaaaaaaaa aaaaaaaaaa aaaaaaaccg tcgaaaagcg gccgccaccg cgtgga // LOCUS AF012023 1420 bp mRNA PRI 11-DEC-1997 DEFINITION Homo sapiens integrin cytoplasmic domain associated protein (Icap-1a) mRNA, complete cds. ACCESSION AF012023 NID g2305237 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1420) AUTHORS Chang,D.D., Wong,C., Smith,H. and Liu,J. TITLE ICAP-1, a novel beta1 integrin cytoplasmic domain-associated protein, binds to a conserved and functionally important NPXY sequence motif of beta1 integrin JOURNAL J. Cell Biol. 138 (5), 1149-1157 (1997) MEDLINE 97428321 REFERENCE 2 (bases 1 to 1420) AUTHORS Chang,D.D. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Medicine, UCLA, 10833 Le Conte Avenue, Los Angeles, CA 90095, USA FEATURES Location/Qualifiers source 1..1420 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" gene 1..1420 /gene="Icap-1a" CDS 169..771 /gene="Icap-1a" /function="binds integrin b1 cytoplasmic domain" /codon_start=1 /product="integrin cytoplasmic domain associated protein" /db_xref="PID:g2305238" /translation="MFRKGKKRHSSSSSQSSEISTKSKSVDSSLGGLSRSSTVASLDT DSTKSSGQSNNNSDTCAEFRIKYVGAIEKLKLSEGKGLEGPLDLINYIDVAQQDGKLP FVPPEEEFIMGVSKYGIKVSTSDQYDVLHRHALYLIIRMVCYDDGLGAGKSLLALKTT DASNEEYSLWVYQCNSLEQAQAICKVLSTAFDSVLTSEKP" BASE COUNT 439 a 291 c 338 g 352 t ORIGIN 1 gcggacgtgg gcaggagggc tggaaaagcc ggcgctggag cggaacggga gtagctgcct 61 gggcgccaaa ggccgcggca ctcccacgtg gaccccgaag tcccgaaccc ggggatgggc 121 ccgcggctgc gaggggatct tctctggatc aagcaatggt ggtgaaaaat gtttcgcaag 181 ggcaaaaaac gacacagtag tagcagttcc caaagtagcg aaatcagtac taagagcaag 241 tctgtggatt ctagccttgg gggtctttca cgatccagca ctgtggccag cctcgacaca 301 gattccacca aaagctcagg acaaagcaac aataattcag atacctgtgc agaatttcga 361 ataaaatatg ttggtgccat tgagaaactg aaactctccg agggaaaagg ccttgaaggg 421 ccattagacc tgataaatta tatagacgtt gcccagcaag atggaaagtt gccttttgtt 481 cctccggagg aagaatttat tatgggagtt tccaagtatg gcataaaagt atcaacatca 541 gatcaatatg atgttttgca caggcatgct ctctacttaa taatccggat ggtgtgttac 601 gatgacggtc tgggggcggg aaaaagctta ctggctctga agaccacaga tgcaagcaat 661 gaggaataca gcctgtgggt ttatcagtgc aacagcctgg aacaagcaca agccatttgc 721 aaggttttat ccaccgcttt tgactctgta ttaacatctg agaaaccctg aatcctgcaa 781 tcaagtagaa gtcaacttca tctgaaagtt cagctgtttt caaactgcaa tgctgaaatg 841 ttatgcaaat aatgaagtta tcccttgctc tagattttct gaagaaaatg gattgtgtaa 901 aatgctgatc atttgtttat taaaatgtgt cctattacac agtgagttaa ctctcaatga 961 agtcatctat tttctgggct aaaaaacttc atttgtcttt ttcaacttct aataagctta 1021 acctaagtgt cacgaagacg agatgtcaca gaggtccact cagtgacaca cactgaaggc 1081 ctgagggaag actgaggaca tgggctcagt ggtggcttcc cagtcatggt atcactggca 1141 tggacctctg tccggcagag gtgtggactg gagaccagga ttcatgctgg tctggaacaa 1201 tgacattgcc aacttaagac acacaaagca gattttcaga agtgtctggt caagataaca 1261 tgctggccaa ccacaattcc tagagttaag agaaccttaa aagattaccg ctcatgctaa 1321 aagtatgtaa agatcccatg tacagtatga tagtgtactt tttttaaagg actgtcaata 1381 tacaaaactt taaagattaa aaacattaaa aataaaaaaa // LOCUS AF012088 4215 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens eIF4G1 mRNA, complete cds. ACCESSION AF012088 NID g2660711 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4215) AUTHORS Imataka,H. and Sonenberg,N. TITLE Human eukaryotic translation initiation factor 4G (eIF4G) possesses two separate and independent binding sites for eIF4A JOURNAL Mol. Cell. Biol. 17 (12), 6940-6947 (1997) MEDLINE 98038763 REFERENCE 2 (bases 1 to 4215) AUTHORS Imataka,H. and Sonenberg,N. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Biochemistry, McGill University, 3655 Drummond Street, Montreal, QC H3G 1Y6, Canada FEATURES Location/Qualifiers source 1..4215 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..4215 /function="eukaryotic protein synthesis initiation factor" /codon_start=1 /product="eIF4G1" /db_xref="PID:g2660712" /translation="MSGARTASTPTPPQTGGGLEPQANGETPQVAVIVRPDDRSQGAI IADRPGLPGPEHSPSESQPSSPSPTPSPSPVLEPGSEPNLAVLSIPGDTMTTIQMSVE ESTPISRETGEPYRLSPEPTPLAEPILEVEVTLSKPVPESEFSSSPLQAPTPLASHTV EIHEPNGMVPSEDLEPEVESSPELAPPPACPSESPVPIAPTAQPEELLNGAPSPPAVD LSPVSEPEEQAKEVTASVAPPTIPSATPATAPSATSPAQEEEMEEEEEEEEGEAGEAG EAESEKGGEELLPPESTPIPANLSQNLEAAAATQVAVSVPKRRRKIKELNKKEAVGDL LDAFKEANPAVPEVENQPPAGSNPGPESEGSGVPPRPEEADETWDSKEDKIHNAENIQ PGEQKYEYKSDQWKPPNLEEKKRYDREFLLGFQFIFASMQKPEGLPHISDVVLDKANK TPLRPLDPTRLQGINCGPDFTPSFANLGRTTLSTRGPPRGGPGGELPRGPQAGLGPRR SQQGPRKEPRKIIATVLMTEDIKLNKAEKAWKPSSKRTAADKDRGEEDADGSKTQDLF RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMC RCLMALKVPTTEKPTVTVNFRKLLLNRCQKEFEKDKDDDEVFEKKQKEMDEAATAEER GRLKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVVKLLKNHDEESLECL CRLLTTIGKDLDFEKAKPRMDQYFNQMEKIIKEKKTSSRIRFMLQDVLDLRGSNWVPR RGDQGPKTIDQIHKEAEMEEHREHIKVQQLMAKGSDKRRGGPPGPPISRGLPLVDDGG WNTVPISKGSRPIDTSRLTKITKPGSIDSNNQLFAPGGRLSWGKGSSGGSGAKPSDAA SEAARPATSTLNRFSALQQAVPTESTDNRRVVQRSSLSRERGEKAGDRGDRLERSERG GDRGDRLDRARTPATKRSFSKEVEERSRERPSQPEGLRKAASLTEDRDRGRDAVKREA ALPPVSPLKAALSEEELEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGV ESTLERSAIAREHMGQLLHQLLCAGHLSTAQYYQGLYEILELAEDMEIDIPHVWLYLA ELVTPILQEGGVPMGELFREITKPLRPLGKAASLLLEILGLLCKSMGPKKVGTLWREA GLSWKEFLPEGQDIGAFVAEQKVEYTLGEESEAPGQRALPSEELNRQLEKLLKEGSSN QRVFDWIEANLSEQQIVSNTLVRALMTAVCYSAIIFETPLRVDVAVLKARAKLLQKYL CDEQKELQALYALQALVVTLEQPPNLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQ GKGVALKSVTAFFKWLREAEEESDHN" BASE COUNT 1093 a 1128 c 1211 g 783 t ORIGIN 1 atgtctgggg cccgcactgc ctccacaccc acccctcccc agacgggagg cggtctggag 61 cctcaagcta atggggagac gccccaggtt gctgtcattg tccggccaga tgaccggtca 121 cagggagcaa tcattgctga ccggccaggg ctgcctggcc cagagcatag cccttcagaa 181 tcccagcctt cgtcgccttc tccgacccca tcaccatccc cagtcttgga accggggtct 241 gagcctaatc tcgcagtcct ctctattcct ggggacacta tgacaactat acaaatgtct 301 gtagaagaat caacccccat ctcccgtgaa actggggagc catatcgcct ctctccagaa 361 cccactcctc tcgccgaacc catactggaa gtagaagtga cacttagcaa accggttcca 421 gaatctgagt tttcttccag tcctctccag gctcccaccc ctttggcatc tcacacagtg 481 gaaattcatg agcctaatgg catggtccca tctgaagatc tggaaccaga ggtggagtca 541 agcccagagc ttgctcctcc cccagcttgc ccctccgaat cccctgtgcc cattgctcca 601 actgcccaac ctgaggaact gctcaacgga gccccctcgc caccagctgt ggacttaagc 661 ccagtcagtg agccagagga gcaggccaag gaggtgacag catcagtggc gccccccacc 721 atcccctctg ctactccagc tacggctcct tcagctactt ccccagctca ggaggaggaa 781 atggaagaag aagaagaaga ggaagaagga gaagcaggag aagcaggaga agctgagagt 841 gagaaaggag gagaggaact gctcccccca gagagtaccc ctattccagc caacttgtct 901 cagaatttgg aggcagcagc agccactcaa gtggcagtat ctgtgccaaa gaggagacgg 961 aaaattaagg agctaaataa gaaggaggct gttggagacc ttctggatgc cttcaaggag 1021 gcgaacccgg cagtaccaga ggtggaaaat cagcctcctg caggcagcaa tccaggccca 1081 gagtctgagg gcagtggtgt gcccccacgt cctgaggaag cagatgagac ctgggactca 1141 aaggaagaca aaattcacaa tgctgagaac atccagcccg gggaacagaa gtatgaatat 1201 aagtcagatc agtggaagcc tccaaaccta gaggagaaaa aacgttacga ccgtgagttc 1261 ctgcttggtt ttcagttcat ctttgccagt atgcagaagc cagagggatt gccacatatc 1321 agtgacgtgg tgctggacaa ggccaataaa acaccactgc ggccactgga tcccactaga 1381 ctacaaggca taaattgtgg cccagacttc actccatcct ttgccaacct tggccggaca 1441 acccttagca cccgtgggcc cccaaggggt gggccaggtg gggagctgcc ccgtgggccg 1501 caggctggcc tgggaccccg gcgctctcag cagggacccc gaaaagaacc acgcaagatc 1561 attgccacag tgttaatgac cgaagatata aaactgaaca aagcagagaa agcctggaaa 1621 cccagcagca agcggacggc ggctgataag gatcgagggg aagaagatgc tgatggcagc 1681 aaaacccagg acctattccg cagggtgcgc tccatcctga ataaactgac accccagatg 1741 ttccagcagc tgatgaagca agtgacgcag ctggccatcg acaccgagga acgcctcaaa 1801 ggggtcattg acctcatttt tgagaaggcc atttcagagc ccaacttctc tgtggcctat 1861 gccaacatgt gccgctgcct catggcgctg aaagtgccca ctacggaaaa gccaacagtg 1921 actgtgaact tccgaaagct gttgttgaat cgatgtcaga aggagtttga gaaagacaaa 1981 gatgatgatg aggtttttga gaagaagcaa aaagagatgg atgaagctgc tacggcagag 2041 gaacgaggac gcctgaagga agagctggaa gaggctcggg acatagcccg gcggcgctct 2101 ttagggaata tcaagtttat tggagagttg ttcaaactga agatgttaac agaggcaata 2161 atgcatgact gtgtggtcaa actgcttaag aaccatgatg aagagtccct tgagtgcctt 2221 tgtcgtctgc tcaccaccat tggcaaagac ctggactttg aaaaagccaa gccccgaatg 2281 gatcagtatt tcaaccagat ggaaaaaatc attaaagaaa agaagacgtc atcccgcatc 2341 cgctttatgc tgcaggacgt gctggatctg cgagggagca attgggtgcc acgccgaggg 2401 gatcagggtc ccaagaccat tgaccagatc cataaggagg ctgagatgga agaacatcga 2461 gagcacatca aagtgcagca gctcatggcc aagggcagtg acaagcgtcg gggcggtcct 2521 ccaggccctc ccatcagccg tggacttccc cttgtggatg atggtggctg gaacacagtt 2581 cccatcagca aaggtagccg ccccattgac acctcacgac tcaccaagat caccaagcct 2641 ggctccatcg attctaacaa ccagctcttt gcacctggag ggcgactgag ctggggcaag 2701 ggcagcagcg gaggctcagg agccaagccc tcagacgcag catcagaagc tgctcgccca 2761 gctactagta ctttgaatcg cttctcagcc cttcaacaag cggtacccac agaaagcaca 2821 gataatagac gtgtggtgca gaggagtagc ttgagccgag aacgaggcga gaaagctgga 2881 gaccgaggag accgcctaga gcggagtgaa cggggagggg accgtgggga ccggcttgat 2941 cgtgcgcgga cacctgctac caagcggagc ttcagcaagg aagtggagga gcggagtaga 3001 gaacggccct cccagcctga ggggctgcgc aaggcagcta gcctcacgga ggatcgggac 3061 cgtgggcggg atgccgtgaa gcgagaagct gccctacccc cagtgagccc cctgaaggcg 3121 gctctctctg aggaggagtt agagaagaaa tccaaggcta tcattgagga atatctccat 3181 ctcaatgaca tgaaagaggc agtccagtgc gtgcaggagc tggcctcacc ctccttgctc 3241 ttcatctttg tacggcatgg tgtcgagtct acgctggagc gcagtgccat tgctcgtgag 3301 catatggggc agctgctgca ccagctgctc tgtgctgggc atctgtctac tgctcagtac 3361 taccaagggt tgtatgaaat cttggaattg gctgaggaca tggaaattga catcccccac 3421 gtgtggctct acctagcgga actggtaaca cccattctgc aggaaggtgg ggtgcccatg 3481 ggggagctgt tcagggagat tacaaagcct ctgagaccgt tgggcaaagc tgcttccctg 3541 ttgctggaga tcctgggcct cctgtgcaaa agcatgggtc ctaaaaaggt ggggacgctg 3601 tggcgagaag ccgggcttag ctggaaggaa tttctacctg aaggccagga cattggtgca 3661 ttcgtcgctg aacagaaggt ggagtatacc ctgggagagg agtcggaagc ccctggccag 3721 agggcactcc cctccgagga gctgaacagg cagctggaga agctgctgaa ggagggcagc 3781 agtaaccagc gggtgttcga ctggatagag gccaacctga gtgagcagca gatagtatcc 3841 aacacgttag ttcgagccct catgacggct gtctgctatt ctgcaattat ttttgagact 3901 cccctccgag tggacgttgc agtgctgaaa gcgcgagcga agctgctgca gaaatacctg 3961 tgtgacgagc agaaggagct acaggcgctc tacgccctcc aggcccttgt agtgacctta 4021 gaacagcctc ccaacctgct gcggatgttc tttgacgcac tgtatgacga ggacgtggtg 4081 aaggaggatg ccttctacag ttgggagagt agcaaggacc ccgctgagca gcagggcaag 4141 ggtgtggccc ttaaatctgt cacagccttc ttcaagtggc tccgtgaagc agaggaggag 4201 tctgaccaca actga // LOCUS AF012106 1355 bp mRNA PRI 02-NOV-1997 DEFINITION Homo sapiens DnaJ protein (HSPF2) mRNA, complete cds. ACCESSION AF012106 NID g2581951 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1355) AUTHORS Lloyd,S.E. and Thakker,R.V. TITLE Identification and characterisation of a novel human Dnaj gene, HSPF2, mapping to chromosome 11q13 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1355) AUTHORS Lloyd,S.E. and Thakker,R.V. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Molecular Endocrinology, Hammersmith Hospital, Du Cane Road, London W12 ONN, UK FEATURES Location/Qualifiers source 1..1355 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" /tissue_type="placenta" gene 1..1355 /gene="HSPF2" CDS 761..1168 /gene="HSPF2" /codon_start=1 /product="DnaJ protein" /db_xref="PID:g2581952" /translation="MGPGEGQAKGDKGSPAPPQLHPDRDPGNPSLHSRFVELSEAYRV LSREQSRRSYDDQLRSGSPPKSPRTTVHDKSAHQTHSSWTPPNAQYWSQFHSVRPQGP QLRQQQHKQNKQVLGYCLLLMLAGMGLHYIAFR" BASE COUNT 298 a 362 c 416 g 279 t ORIGIN 1 gggaggcgtg agccaccgcg cctgtccagg aatcagtttt ttccaacaag tttttggact 61 ccactttgag aaatcctgac ctgcactatg cctgaaacac tccaaagggg gtgctgatct 121 ggagacagat ctgtcctggg acctgtttca gccctgtcag ctcctggtgg ctgagttgaa 181 gggtgtggcc caaaaggcag ggcaaagcta gcttctgggt tccttcagcc cctgcaaggt 241 ttgggactct gtccacaggt ccagacccag tacttattat gaactgttgg gggtgcatcc 301 tggtgccagc actgaggaag ttaaacgagc tttcttctcc aagtccaaag aggtacccgt 361 acccctgagg tggggagggg gagcaggaac tttgagccag tgggggtgtg cttcaaattc 421 ccaggagtag accagcatct ctggtgggtg ccacattttt tcagccctca tagggagtgg 481 caggtgctca atggtagggg aaaggctgtg ggcaggagcc aggggtcctg tctgatgggc 541 tgggcttggt tttcttgacc tgcctgcctg ctcttgagaa acctggacca gggcatatgg 601 tgaatgcagg gggctggcag gtggggaggg cttgaacaca ggaggaagca ggagtccatg 661 ctctctggcc ttctgaggag tgggcaggcc tgagtgaggg tcactgacca ggcagggcct 721 ctgggtatga ctggagccct ggtgggtgga tgtcatcagc atgggtcctg gggagggaca 781 ggcaaaggga gataagggga gtcctgcccc accccagctg cacccagacc gggaccctgg 841 gaacccaagc ctgcacagcc gctttgtgga gctgagcgag gcataccgtg tgctcagccg 901 tgagcagagc cgccgcagct atgatgacca gctccgctca ggtagtcccc caaagtctcc 961 acgaaccaca gtccatgaca agtctgccca ccaaacacac agctcctgga caccccccaa 1021 cgcacagtac tggtcccagt ttcacagcgt gaggccacag gggccccagt tgaggcagca 1081 gcaacacaaa caaaacaaac aagtgctggg gtactgcctc ctcctcatgc tggcgggcat 1141 gggcctgcac tacattgcct tcaggtgatg cctgttctcc ccggggtgat gggcaaaggg 1201 caggagggtg ggtgaattcc agtgtggtca gccagtcctc cacagcacct cgtggcccgt 1261 actcttgtgt ctctcaagct aaaactgctt gaagttggtc tctggacctc tccttactta 1321 ttaaaggcag cgctgcacca aaaaaaaaaa aaaaa // LOCUS AF012126 3467 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens zinc finger protein (ZNF198) mRNA, complete cds. ACCESSION AF012126 NID g2832227 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3467) AUTHORS Xiao,S., Nalabolu,S.R., Aster,J.C., Ma,J., Abruzzo,L., Jaffe,E.S., Stone,R., Weissman,S.M., Hudson,T.J. and Fletcher,J.A. TITLE FGFR1 is fused with a novel zinc-finger gene, ZNF198, in the t(8;13) leukaemia/lymphoma syndrome JOURNAL Nature Genet. 18 (1), 84-87 (1998) MEDLINE 98085877 REFERENCE 2 (bases 1 to 3467) AUTHORS Xiao,S. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Pathology, Brigham and Women's Hospital, 75 Francis St., Boston, MA 02115, USA REFERENCE 3 (bases 1 to 3467) AUTHORS Xiao,S. TITLE Direct Submission JOURNAL Submitted (04-FEB-1998) Pathology, Brigham and Women's Hospital, 75 Francis St., Boston, MA 02115, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..3467 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q12" gene 1..3467 /gene="ZNF198" CDS 454..2727 /gene="ZNF198" /note="involved in stem cell leukemia/lymphoma syndrome" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g2832228" /translation="MQSSPNGQFVAPSDIQLKCNYCKNSFCSKPEILEWENKVHQFCS KTCSDDYKKLHCIVTYCEYCQEEKTLHETVNFSGVKRPFCSEGCKLLYKQDFARRLGL RCVTCNYCSQLCKKGATKELDGVVRDFCSEDCCKKFQDWYYKAARCDCCKSQGTLKER VQWRGEMKHFCDQHCLLRFYCQQNEPNMTTQKGPENLHYDQGCQTSRTKMTGSAPPPS PTPNKEMKNKAVLCKPLTMTKATYCKPHMQTKSCQTDDTWRTEYVPVPIPVPVYIPVP MHMYSQNIPVPTTVPVPVPVPVFLPAPLDSSEKIPAAIEELKSKVSSDALDTELLTMT DMMSEDEGKTETTNINSVIIETDIIGSDLLKNSDPETQSSMPDVPYEPDLDIEIDFPR AAEELDMENEFLLPPVFGEEYEEQPRPRSKKKGAKRKAVSGYQSHDDSSDNSECSFPF KYTYGVNAWKHWVKTRQLDEDLLVLDELKSSKSVKLKEDLLSHTTAELNYGLAHFVNE IRRPNGENYAPDSIYYLCLGIQEYLCGSNRKDNIFIDPGYQTFEQELNKILRSWQPSI LPDGSIFSRVEEDYLWRIKQLGSHSPVALLNTLFYFNTKYFGLKTVEQHLRLSFGTVF RHWKKNPLTMENKACLRYQVSSLCGTDNEDKITTGKRKHEDDEPVFEQIENTANPSRC PVKMFECYLSKSPQNLNQRMDVFYLQPECSSSTDSPVWYTSTSLDRNTLENMLVRVLL VKDIYDKDNYELDEDTD" BASE COUNT 1117 a 615 c 687 g 1048 t ORIGIN 1 ggtgttggca gaggcaaaaa gtgtaatgaa aaattggtaa ttcttttcta gttaagttgt 61 agtcttaaaa attatctttt tatgtttcaa attttagaaa tatggaaaac tgacaacttg 121 tactggttgc cgaacacagt gcaggttttt tgatatgact cagtgtatag gtcctaatgg 181 atatatggag ccatattgtt caactgcttg tatgaacagt cacaagacaa aatatgcaaa 241 atcacaaagt ttgggaatta tttgccattt ttgtaagcga aactctttac ctcaatacca 301 agccacaatg cctgatggaa aactgtacaa cttttgcaat tccagttgtg tggctaaatt 361 tcagggtatt acccaaaggt tcaaacccgt gcataaagcc atgcttcatg agttaattca 421 tccaacaatt ctgctttatt ccaggctcta agtatgcagt catctccaaa tggccagttt 481 gtagcgccaa gtgatattca gttgaaatgc aactactgca aaaattcctt ttgttcaaaa 541 ccagaaatcc tggaatggga gaacaaagtg catcagttct gcagcaaaac ttgttcagat 601 gactataaga agttgcattg catagttaca tattgcgaat actgtcaaga ggagaagact 661 cttcatgaaa cagtaaattt ctctggcgtt aagagacctt tctgtagtga aggctgcaaa 721 ttattataca aacaggattt tgccagacgt ttaggattga gatgtgttac ttgcaactat 781 tgttctcagc tatgtaagaa gggagcaact aaagaactcg atggtgttgt gagagatttc 841 tgcagtgaag attgctgtaa aaaatttcag gattggtact acaaggctgc aaggtgtgac 901 tgttgtaaat ctcaaggaac tcttaaagag cgagttcagt ggcgtgggga aatgaaacat 961 ttctgtgatc aacattgctt actgcgtttc tactgtcaac aaaatgagcc caacatgaca 1021 actcagaaag gacctgaaaa cttacattat gatcagggtt gtcagacatc tcgaaccaaa 1081 atgacaggtt cagcaccacc cccttctcca acacctaaca aagagatgaa gaacaaagca 1141 gttctttgca aacctttaac aatgacaaaa gctacttact gtaaacctca catgcagacc 1201 aaatcttgtc agacagatga tacttggagg acagaatatg ttccagtgcc tatccctgtg 1261 cctgtgtata tcccagttcc tatgcacatg tacagtcaga atattcctgt tcctactaca 1321 gttcctgttc ctgtgccagt tcctgttttt ctgcctgctc cattggacag cagtgagaag 1381 attcctgcag caattgagga gctaaaaagc aaggtttctt cagatgctct tgatacagag 1441 ttgcttacaa tgacggatat gatgagtgaa gacgagggga aaacagagac aaccaacatc 1501 aacagtgtaa ttattgaaac agatataatt ggttcagacc ttttgaagaa ctctgaccca 1561 gagacacagt ccagcatgcc tgatgtacca tatgaaccag atttggatat cgaaatagat 1621 tttcccagag ctgctgagga gcttgatatg gaaaatgaat ttttattacc acctgttttt 1681 ggcgaagaat atgaggaaca gcccagacct cgatctaaaa aaaagggagc caagagaaag 1741 gctgtatcag gataccagtc tcatgatgat agttctgaca attcagaatg cagctttcct 1801 ttcaaatata cgtatggcgt aaatgcatgg aaacactggg tcaaaactag gcaacttgat 1861 gaagatcttc tggtattaga tgagttaaaa tcttctaaat cagtaaagtt aaaagaggat 1921 ctactctctc acaccacagc tgagcttaac tatgggttag ctcattttgt caatgagatc 1981 cgacggccaa atggagagaa ttatgcacct gacagcatct attacctttg ccttggaata 2041 caggagtatt tgtgtggaag taatcgaaaa gacaacatat ttattgatcc tggataccaa 2101 acatttgagc aagaattgaa taaaatactg cgaagctggc aaccaagcat acttccagat 2161 gggtcaatat tctctcgagt tgaagaagac tatctctgga ggataaaaca actaggatca 2221 cactctccag tagctcttct gaatacactg ttctacttta acactaagta ttttggcctg 2281 aaaacagtgg aacaacactt aagactttcc tttggcactg tgtttaggca ttggaaaaaa 2341 aatcctttaa cgatggaaaa caaagcgtgt cttcgatacc aagtgtcttc cttgtgtgga 2401 acagataatg aagataaaat tactactgga aaaagaaaac atgaagatga tgagccagta 2461 tttgaacaaa ttgaaaacac agccaatcct tccagatgtc ctgtgaaaat gtttgaatgc 2521 tacttgtcta aaagtccaca gaatcttaat cagaggatgg atgtttttta tttgcaacca 2581 gaatgctcta gttctacaga tagccctgtc tggtatacgt ctacttcact ggaccgaaac 2641 accttggaaa atatgcttgt acgggttctt ctagtaaaag atatttatga taaagacaat 2701 tatgaactgg atgaagacac agactaaaaa ggaacgttgc agaagcaatc gggataaaac 2761 agcattagat agtcatgctg ctagatcttt attatggaaa acatttcaag tttactcctt 2821 ctgttttgag ttttgtagca gtgtacccac gctgggtatt accatgtaaa taatctgtga 2881 gtgaaagttg ccattattct atgtagtggt tttaggatac ttaacaaata cattcaaatt 2941 ctttttttat tattatttat ttgattaggt atgtttgtaa ctttttacat tacagaatat 3001 gaatgagaat gtgccatgta taattttttt cttgtagtaa gaaacatcca tattgcacaa 3061 ctctactgtt gcaaagcttc cttggaaggg ggctctttta ctgggttctt aaccagatgg 3121 ttgtgtatgg gtagcactac taaaagttta gaacttgcag tgtctttcgg aatttttaaa 3181 ataaactgta aactaatagg ctggggtttt tgttttgttt tggggttttg ttttgtttgg 3241 ttttacattt tagttactga agccttacaa ggttatgtag agagatacca tcttctgtac 3301 caaaaataga caagagaatg ctgtcaatat tggtgtactg taatgtgaat ctatgctggt 3361 gaaaacaatt ttttgttccc cttattaaaa ccttagtgtc cttttctcat ttgtggcttt 3421 ctgcatcacc caatcaataa aaaacaaata tatatatgta aaaaaaa // LOCUS AF012128 1394 bp mRNA PRI 17-JAN-1998 DEFINITION Homo sapiens putative DNA methyltransferase (DNMT2) mRNA, complete cds. ACCESSION AF012128 NID g2627430 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1394) AUTHORS Yoder,J.A. and Bestor,T.H. TITLE A candidate mammalian DNA methyltransferase related to pmt1p of fission yeast JOURNAL Hum. Mol. Genet. 7, 279-284 (1998) REFERENCE 2 (bases 1 to 1394) AUTHORS Yoder,J.A. and Bestor,T.H. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Genetics and Development, Columbia University, 701 W. 168th St., New York, NY 10032, USA REFERENCE 3 (bases 1 to 1394) AUTHORS Yoder,J.A. and Bestor,T.H. TITLE Direct Submission JOURNAL Submitted (21-NOV-1997) Genetics and Development, Columbia University, 701 W. 168th St., New York, NY 10032, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..1394 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10p12-p14" gene 1..1394 /gene="DNMT2" CDS 8..1183 /gene="DNMT2" /codon_start=1 /product="putative DNA methyltransferase" /db_xref="PID:g2627431" /translation="MEPLRVLELYSGVGGMHHALRESCIPAQVVAAIDVNTVANEVYK YNFPHTQLLAKTIEGITLEEFDRLSFDMILMSPPCQPFTRIGRQGDMTDSRTNSFLHI LDILPRLQKLPKYILLENVKGFEVSSTRDLLIQTIENCGFQYQEFLLSPTSLGIPNSR LRYFLIAKLQSEPLPFQAPGQVLMEFPKIESVHPQKYAMDVENKIQEKNVEPNISFDG SIQCSGKDAILFKLETAEEIHRKNQQDSDLSVKMLKDFLEDDTDVNQYLLPPKSLLRY ALLLDIVQPTCRRSVCFTKGYGSYIEGTGSVLQTAEDVQVENIYKSLTNLSQEEQITK LLILKLRYFTPKEIANLLGFPPEFGFPEKITVKQRYRLLGNSLNVHVVAKLIKILYE" BASE COUNT 452 a 260 c 261 g 421 t ORIGIN 1 cgcggggatg gagcccctgc gggtgctgga gctatacagc ggcgtgggcg gcatgcacca 61 cgcgctgaga gaaagctgta tacctgcaca agtggtggct gccattgatg tcaacactgt 121 cgctaatgaa gtatacaagt ataattttcc tcacacacag ttacttgcca agacgattga 181 aggcattaca ctcgaagagt ttgacagatt atcttttgat atgattttaa tgagccctcc 241 ctgccagcca ttcacaagga ttggccggca gggtgatatg actgattcaa ggacgaatag 301 cttcttacat attctagata ttctcccaag attacaaaaa ttaccaaagt atattctttt 361 ggaaaatgtt aaaggttttg aagtatcttc tacaagagac ctcttgatac aaacaataga 421 aaattgtggc tttcagtacc aagaatttct attatctcca acctctcttg gcattccaaa 481 ttcaaggcta cgatattttc ttattgcaaa gcttcagtca gagccattac cctttcaagc 541 ccctggtcag gtactgatgg agttccccaa aattgaatct gtacatccac aaaaatatgc 601 aatggatgta gaaaataaaa ttcaagaaaa gaacgttgaa ccaaatatta gctttgatgg 661 cagcatacag tgttctggaa aagatgccat tctttttaag cttgaaactg cagaagaaat 721 tcacaggaaa aatcaacaag atagtgatct ctctgtgaaa atgctaaaag attttcttga 781 agatgacact gacgtgaacc agtatctttt accaccaaag tcattgctgc gatatgctct 841 tctgttagac attgttcagc ccacttgtag aaggtccgtg tgctttacca aaggatatgg 901 aagctacata gaagggacag ggtctgtgtt acagactgca gaggatgtgc aggttgagaa 961 tatctacaaa tcccttacca atttgtcaca agaagaacag ataacaaagc tgttaatact 1021 taaactgcga tatttcactc ctaaagaaat agcaaatctc cttggatttc ctccagagtt 1081 cggatttcct gagaagataa cagtgaaaca gcgttatcgc ctacttggaa atagtctcaa 1141 cgtgcatgta gtagctaaac taatcaaaat cttatatgaa taattttgaa ataactctga 1201 aagatggtca tatgatattc cttcattttc agagagtaat tctgaaattc tgttttgaac 1261 taattctggt gaaatttaac taaattattt taatctgtcc ttattaagaa atttggattt 1321 tattaaaaaa atccatgtgt ttcatcaaat ttatattact gtattttata aaatacgaac 1381 tattgttatg cttt // LOCUS AF012130 1482 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens brachyury variant A (TBX1) mRNA, complete cds. ACCESSION AF012130 NID g2735860 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1482) AUTHORS Chieffo,C., Garvey,N., Gong,W., Roe,B., Zhang,G., Silver,L., Emanuel,B.S. and Budarf,M.L. TITLE Isolation and characterization of a gene from the DiGeorge chromosomal region homologous to the mouse Tbx1 gene JOURNAL Genomics 43 (1997) In press REFERENCE 2 (bases 1 to 1482) AUTHORS Chieffo,C., Garvey,N., Gong,W., Roe,B., Zhang,G., Silver,L., Emanuel,B.S. and Budarf,M.L. TITLE Direct Submission JOURNAL Submitted (01-JUL-1997) Div. of Human Genetics, Children's Hosp. of Phila., 1 Children's Center, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..1482 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" /tissue_type="adult skeletal muscle; testis" gene 1..1482 /gene="TBX1" CDS 130..1326 /gene="TBX1" /note="variant A; similar to murine Tbx1" /codon_start=1 /product="brachyury" /db_xref="PID:g2735861" /translation="MHFSTVTRDMEAFTASSLSSLGAAGGFPGAASPGADPYGPREPP PPPPRYDPCAAAAPGAPGPPPPPHAYPFAPAAGAATSAAAEPEGPGASCAAAAKAPVK KNAKVAGVSVQLEMKALWDEFNQLGTEMIVTKAGRRMFPTFQVKLFGMDPMADYMLLM DFVPVDDKRYRYAFHSSSWLVAGKADPATPGRVHYHPDSPAKGAQWMKQIVSFDKLKL TNNLLDDNGHIILNSMHRYQPRFHVVYVDPRKDSEKYAEENFKTFVFEETRFTAVTAY QNHRITQLKIASNPFAKGFRDCDPEDWPRNHRPGALPLMSAFARSRNPVASPTQPSGT EKGGHVLKDKEVKAETSRNTPEREVELLRDAGGCVNLGLPCPAECQPFNTQGLVAGRT AGDRLC" BASE COUNT 314 a 478 c 462 g 228 t ORIGIN 1 ccggcagggg gagcgaggag gaagggaacc gcggccgggc cagcggaggc ggcggagcgc 61 accgcccacc agggctcagg gtcctccgac cgggtgaagc ttcgctggct gccaggatcc 121 ccggcaggga tgcacttcag caccgtcacc agggacatgg aagccttcac ggccagcagc 181 ctgagcagcc tgggggccgc ggggggcttc ccgggcgccg cgtcgcccgg cgccgacccg 241 tacggcccgc gcgagccccc gccgccgccg ccgcgctacg acccgtgcgc cgccgccgcc 301 cccggcgccc cgggcccgcc gccgccgccg cacgcctacc cgtttgcgcc ggccgccggg 361 gccgccacca gcgccgccgc cgagcccgag ggccccgggg ccagctgcgc ggccgcagcc 421 aaggcgccgg tgaagaagaa cgcgaaggtg gccggtgtga gcgtgcagct agagatgaag 481 gcgctgtggg acgagttcaa ccagctgggc accgagatga tcgtcaccaa ggccggcagg 541 cggatgtttc ccaccttcca agtgaagctc ttcggcatgg atcccatggc cgactatatg 601 ctgctcatgg acttcgtgcc ggtggacgat aagcgctacc ggtacgcctt ccacagctcc 661 tcctggctgg tggcggggaa ggccgaccct gccacgccag gccgcgtgca ctaccacccg 721 gactcgcctg ccaagggcgc gcagtggatg aagcaaatcg tgtccttcga caagctcaag 781 ctgaccaaca acctactgga cgacaacggc cacattattc tgaattccat gcacagatac 841 cagccccgct tccacgtggt ctatgtggac ccacgcaaag atagcgagaa atatgccgag 901 gagaacttca aaacctttgt gttcgaggag acacgattca ccgcggtcac tgcctaccag 961 aaccatcgga tcacgcagct caagattgcc agcaatccct tcgcgaaagg cttccgggac 1021 tgtgaccctg aggactggcc ccggaaccac cggcccggcg cactgccgct catgagcgcc 1081 ttcgcgcgct cgcggaaccc cgtggcttcc ccgacgcagc ccagcggcac ggagaaaggt 1141 ggacatgtcc tgaaggacaa ggaagtgaaa gctgagacgt ctaggaacac accagagaga 1201 gaagtggagc ttctgaggga tgcaggtggc tgtgtgaacc tggggctccc ctgccccgca 1261 gagtgccaac ccttcaatac ccagggcctg gtggctggga ggaccgcagg tgaccgtctt 1321 tgttgaatgc tgaggccggg ccatgggcac atggagttgt cgtgtttccc ttcactttgg 1381 ttcatgtttg aaatttccaa aattaaaaaa acagtgactt gttcagtaaa ttccaatatg 1441 aataaagtgc atgttttgta ataaaaaaaa aaaaaaaaaa aa // LOCUS AF012270 1374 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens visual pigment-like receptor peropsin (Rrh) mRNA, complete cds. ACCESSION AF012270 NID g2307009 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1374) AUTHORS Sun,H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Peropsin, a novel visual pigment-like protein located in the apical microvilli of the retinal pigment epithelium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (18), 9893-9898 (1997) MEDLINE 97420780 REFERENCE 2 (bases 1 to 1374) AUTHORS Sun,H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (02-JUL-1997) Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1374 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q" /tissue_type="retinal pigment epithelium; apical microvilli" gene 1..1374 /gene="Rrh" CDS 51..1064 /gene="Rrh" /note="G-protein coupled receptor" /codon_start=1 /product="visual pigment-like receptor peropsin" /db_xref="PID:g2307010" /translation="MLRNNLGNSSDSKNEDGSVFSQTEHNIVATYLIMAGMISIISNI IVLGIFIKYKELRTPTNAIIINLAVTDIGVSSIGYPMSAASDLYGSWKFGYAGCQVYA GLNIFFGMASIGLLTVVAVDRYLTICLPDVGRRMTTNTYIGLILGAWINGLFWALMPI IGWASYAPDPTGATCTINWRKNDRSFVSYTMTVIAINFIVPLTVMFYCYYHVTLSIKH HTTSDCTESLNRDWSDQIDVTKMSVIMICMFLVAWSPYSIVCLWASFGDPKKIPPPMA IIAPLFAKSSTFYNPCIYVVANKKFRRAMLAMFKCQTHQTMPVTSILPMDVSQNPLAS GRI" BASE COUNT 362 a 295 c 285 g 432 t ORIGIN 1 gaataagcct tcgataatta tgaagggtgt ttcggtatct tccctccaaa atgctaagaa 61 ataatttagg caacagttca gactctaaaa atgaagatgg ctcggtcttt tcacagactg 121 aacacaatat tgttgcaact tacttgatta tggcaggtat gataagtatt atcagcaaca 181 taatagttct gggcatcttc attaagtaca aggaacttcg gacacccaca aatgcaatta 241 ttattaacct ggctgttact gatatagggg tcagtagcat tggctatccc atgtctgctg 301 cctcagatct gtatggaagt tggaaatttg gatacgcagg ctgtcaggtt tatgctggat 361 tgaatatttt ttttggaatg gcaagcattg gattactcac ggtcgtggct gtggaccgat 421 acctgaccat ctgccttcct gacgtaggga gaagaatgac caccaacact tacatcggct 481 tgattctggg agcctggatc aatggcctgt tttgggcttt gatgcctatc atagggtggg 541 ctagttatgc cccagatcct actggtgcta cgtgtaccat aaactggagg aaaaatgata 601 gatcttttgt gtcttacacc atgacagtta ttgcgataaa ttttattgtg cccttgacag 661 tgatgtttta ctgctattac catgtcacgc tatccattaa acatcacact accagtgact 721 gcactgagtc cctcaacaga gactggtcag atcagataga tgtaacaaag atgtctgtga 781 tcatgatctg catgtttctg gtggcatggt ccccttattc catcgtgtgc ttatgggctt 841 cttttggtga cccaaagaag attcctcccc ccatggccat catagctcca ctgtttgcaa 901 aatcttctac attctataac ccctgcattt atgtggttgc taataaaaag tttcggaggg 961 caatgcttgc catgttcaaa tgtcagactc accaaacaat gcctgtgaca agtattttac 1021 ccatggatgt atctcaaaac ccattggctt ctggaagaat ctgaaataag agaaaaggac 1081 acgctatcaa aacactttag ttttttgaca atgcttttct tttaaatatg agcccattta 1141 gatcaagtgc agacatggat cattgtccta tgagagtgta agctcctcaa gcacagctcg 1201 tgcttccgtt tgtgcactct ggctgctgta gtgtatgctt ctctgtgtcc tgatatatca 1261 acttattgct catctccttt gatgaattag gcatcagagg ttaaggtccc ctttctttct 1321 ccctattatg gcatgcatta cactgtactg atgaccttta acttgcctgg ctcc // LOCUS AF012272 2247 bp mRNA PRI 03-JAN-1998 DEFINITION Homo sapiens rho-type GTPase-activating protein rhoGAPX-1 (ARHGAP6) mRNA, alternatively spliced, complete cds. ACCESSION AF012272 NID g2656132 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2247) AUTHORS Schaefer,L., Prakash,S. and Zoghbi,H.Y. TITLE Cloning and characterization of a novel rho-type GTPase-activating protein gene (ARHGAP6) from the critical region for microphthalmia with linear skin defects JOURNAL Genomics 46 (2), 268-277 (1997) MEDLINE 98086484 REFERENCE 2 (bases 1 to 2247) AUTHORS Schaefer,L., Prakash,S. and Zoghbi,H.Y. TITLE Direct Submission JOURNAL Submitted (03-JUL-1997) Howard Hughes Medical Institute, Baylor College of Medicine, 1 Baylor Plaza, Room T836, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2247 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp22.3" gene 1..2247 /gene="ARHGAP6" CDS 43..1485 /gene="ARHGAP6" /note="alternatively spliced; short isoform" /codon_start=1 /product="rho-type GTPase-activating protein rhoGAPX-1" /db_xref="PID:g2656133" /translation="MNSDTHRNFDPTATLRNQGDFTWNSMSGRSVRLRSVPIQSLSEL ERARLQEVPFYQLQQDCDLSCQITIPKDGQKRKKSLRKKLDSLGKEKNKDKEFIPQAF GMPLSQVIANDRAYKLKQDLQRDEQKDASDFVASLLPFGNKRQNKELSSSNSSLSSTS ETPNESTSPNTPEPAPRARRRGAMSVDSITDLDDNQSRLLEALQLSLPAEAQSKKEKA RDKKLSLNPIYRQVPRLVDSCCQHLEKHGLQTVGIFRVGSSKKRVRQLREEFDRGIDV SLEEEHSVHDVAALLKEFLRDMPDPLLTRELYTAFINTLLLEPEEQLGTLQLLIYLLP PCNCDTLHRLLQFLSIVARHADDNISKDGQEVTGNKMTSLNLATIFGPNLLHKQKSSD KEFSVQSSARAEESTAIIAVVQKMIENYEALFMVPPDLQNEVLISLLETDPDVVDYLL RRKASQSSTSSVLPAAVQACPQYPASMFTP" BASE COUNT 612 a 584 c 543 g 508 t ORIGIN 1 ttggagttgt atgatcttca gatcttaggc acaaaaccac cgatgaattc cgatacacat 61 cgtaactttg accccaccgc aacccttaga aatcagggtg atttcacctg gaacagcatg 121 tcaggccgca gtgtgcggct gaggtcagtc cccatccaga gtctctcaga gctggagagg 181 gcccggctgc aggaagtgcc tttttatcag ttgcaacagg actgtgacct gagctgtcag 241 atcaccattc ccaaagatgg acaaaagaga aagaaatctt taagaaagaa actggattca 301 ctaggaaagg agaaaaacaa agacaaagaa ttcatcccac aggcatttgg aatgccctta 361 tcccaagtca ttgcgaatga cagggcctat aaactcaagc aggacttgca gagggacgag 421 cagaaagatg catctgactt tgtggcttcc ctcctcccat ttggaaataa aagacaaaac 481 aaagaactct caagcagtaa ctcatctctc agctcaacct cagaaacacc gaatgagtca 541 acgtccccaa acaccccgga accggctcct cgggctagga ggaggggtgc catgtcagtg 601 gattctatca ccgatcttga tgacaatcag tctcgactac tagaagcttt acaactttcc 661 ttgcctgctg aggctcaaag taaaaaggaa aaagccagag ataagaaact cagtctgaat 721 cctatttaca gacaggtccc taggctggtg gacagctgct gtcagcacct agaaaaacat 781 ggcctccaga cagtggggat attccgagtt ggaagctcaa aaaagagagt gagacaatta 841 cgtgaggaat ttgaccgtgg gattgatgtc tctctggagg aggagcacag tgttcatgat 901 gtggcagcct tgctgaaaga gttcctgagg gacatgccag acccccttct caccagggag 961 ctgtacacag ctttcatcaa cactctcttg ttggagccgg aggaacagct gggcaccttg 1021 cagctcctca tataccttct acctccctgc aactgcgaca ccctccaccg cctgctacag 1081 ttcctctcca tcgtggccag gcatgccgat gacaacatca gcaaagatgg gcaagaggtc 1141 actgggaata aaatgacatc tctaaactta gccaccatat ttggacccaa cctgctgcac 1201 aagcagaagt catcagacaa agaattctca gttcagagtt cagcccgggc tgaggagagc 1261 acggccatca tcgctgttgt gcaaaagatg attgaaaatt atgaagccct gttcatggtt 1321 cccccagatc tccagaacga agtgctgatc agcctgttag agaccgatcc tgatgtcgtg 1381 gactatttac tcagaagaaa ggcttcccaa tcatcaacaa gttctgtgct tccggctgca 1441 gtgcaggcct gcccacagta ccctgccagc atgtttacgc cctgacctgg agggaccagc 1501 taaggaagcc ctgacatgct gcagtcggaa gtttcctttt ccgtgggagg gaggcattca 1561 tctacagact ccaacaaggc ctccagcgga gacatctccc cttatgacaa caactcccca 1621 gtgctgtctg agcgctccct gctggctatg caagaggacg cggccccggg gggctcggag 1681 aagctttaca gagtgccagg gcagtttatg ctggtgggcc acttgtcgtc gtcaaagtca 1741 agggaaagtt ctcctggacc aaggcttggg aaaggtaact ggagcctggc cagcaggcgc 1801 tggccaaaac aagcgaccct cctcttgttg catgtggcat ggtgtggggc tcttcggacc 1861 ttctcttcgt ctctccctta tttgatgttt ctgtaattca acctgcattt accgagtacc 1921 tgctgaacgc ctactctagg aagaggaagg tatggcgagc ccagtcgttg gctgggaggg 1981 tccctgcagg ccacactcca agtctcagat atctgggaaa taaacgcatg cccagaggag 2041 ccactcctat tcccagctga ttctttttcc ctccagcatc ctaggaactt tatgtatact 2101 ggatttgggt tgcatatttg ctagtctgag gtgccaccca attactgctg ttacatttct 2161 aaagaaaaca agctgaacct actatatgta tgaagacaac cttccagaac caaatagtgt 2221 tagtatccaa tgacaagaga cggaatt // LOCUS AF012535 1799 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens death receptor 5 (DR5) mRNA, complete cds. ACCESSION AF012535 NID g2338419 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1799) AUTHORS Sheridan,J.P., Marsters,S.A., Pitti,R.M., Gurney,A., Skubatch,M., Baldwin,D., Ramakrishnan,L., Gray,C.L., Baker,K., Wood,W.I., Goddard,A.D., Godowski,P. and Ashkenazi,A. TITLE Control of TRAIL-induced apoptosis by a family of signaling and decoy receptors JOURNAL Science 277 (5327), 818-821 (1997) MEDLINE 97390509 REFERENCE 2 (bases 1 to 1799) AUTHORS Sheridan,J.P., Marsters,S.A., Pitti,R.M., Gurney,A., Baldwin,D., Ramakrishnan,L., Gray,C.L., Baker,K., Wood,W.I., Goddard,A.D., Godowski,P. and Ashkenazi,A. TITLE Direct Submission JOURNAL Submitted (06-JUL-1997) Molecular Oncology, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..1799 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1799 /gene="DR5" CDS 140..1375 /gene="DR5" /note="tumor necrosis factor receptor family member; mediates apoptosis induction by TRAIL/Apo2L." /codon_start=1 /product="death receptor 5" /db_xref="PID:g2338420" /translation="MEQRGQNAPAASGARKRHGPGPREARGARPGLRVPKTLVLVVAA VLLLVSAESALITQQDLAPQQRAAPQQKRSSPSEGLCPPGHHISEDGRDCISCKYGQD YSTHWNDLLFCLRCTRCDSGEVELSPCTTTRNTVCQCEEGTFREEDSPEMCRKCRTGC PRGMVKVGDCTPWSDIECVHKESGIIIGVTVAAVVLIVAVFVCKSLLWKKVLPYLKGI CSGGGGDPERVDRSSQRPGAEDNVLNEIVSILQPTQVPEQEMEVQEPAEPTGVNMLSP GESEHLLEPAEAERSQRRRLLVPANEGDPTETLRQCFDDFADLVPFDSWEPLMRKLGL MDNEIKVAKAEAAGHRDTLYTMLIKWVNKTGRDASVHTLLDALETLGERLAKQKIEDH LLSSGKFMYLEGNADSALS" BASE COUNT 459 a 454 c 490 g 396 t ORIGIN 1 cccacgcgtc cgcataaatc agcacgcggc cggagaaccc cgcaatctct gcgcccacaa 61 aatacaccga cgatgcccga tctactttaa gggctgaaac ccacgggcct gagagactat 121 aagagcgttc cctaccgcca tggaacaacg gggacagaac gccccggccg cttcgggggc 181 ccggaaaagg cacggcccag gacccaggga ggcgcgggga gccaggcctg ggctccgggt 241 ccccaagacc cttgtgctcg ttgtcgccgc ggtcctgctg ttggtctcag ctgagtctgc 301 tctgatcacc caacaagacc tagctcccca gcagagagcg gccccacaac aaaagaggtc 361 cagcccctca gagggattgt gtccacctgg acaccatatc tcagaagacg gtagagattg 421 catctcctgc aaatatggac aggactatag cactcactgg aatgacctcc ttttctgctt 481 gcgctgcacc aggtgtgatt caggtgaagt ggagctaagt ccctgcacca cgaccagaaa 541 cacagtgtgt cagtgcgaag aaggcacctt ccgggaagaa gattctcctg agatgtgccg 601 gaagtgccgc acagggtgtc ccagagggat ggtcaaggtc ggtgattgta caccctggag 661 tgacatcgaa tgtgtccaca aagaatcagg catcatcata ggagtcacag ttgcagccgt 721 agtcttgatt gtggctgtgt ttgtttgcaa gtctttactg tggaagaaag tccttcctta 781 cctgaaaggc atctgctcag gtggtggtgg ggaccctgag cgtgtggaca gaagctcaca 841 acgacctggg gctgaggaca atgtcctcaa tgagatcgtg agtatcttgc agcccaccca 901 ggtccctgag caggaaatgg aagtccagga gccagcagag ccaacaggtg tcaacatgtt 961 gtcccccggg gagtcagagc atctgctgga accggcagaa gctgaaaggt ctcagaggag 1021 gaggctgctg gttccagcaa atgaaggtga tcccactgag actctgagac agtgcttcga 1081 tgactttgca gacttggtgc cctttgactc ctgggagccg ctcatgagga agttgggcct 1141 catggacaat gagataaagg tggctaaagc tgaggcagcg ggccacaggg acaccttgta 1201 cacgatgctg ataaagtggg tcaacaaaac cgggcgagat gcctctgtcc acaccctgct 1261 ggatgccttg gagacgctgg gagagagact tgccaagcag aagattgagg accacttgtt 1321 gagctctgga aagttcatgt atctagaagg taatgcagac tctgccttgt cctaagtgtg 1381 attctcttca ggaagtgaga ccttccctgg tttacctttt ttctggaaaa agcccaactg 1441 gactccagtc agtaggaaag tgccacaatt gtcacatgac cggtactgga agaaactctc 1501 ccatccaaca tcacccagtg gatggaacat cctgtaactt ttcactgcac ttggcattat 1561 ttttataagc tgaatgtgat aataaggaca ctatggaaat gtctggatca ttccgtttgt 1621 gcgtactttg agatttggtt tgggatgtca ttgttttcac agcacttttt tatcctaatg 1681 taaatgcttt atttatttat ttgggctaca ttgtaagatc catctacaaa aaaaaaaaaa 1741 aaaaaaaaag ggcggccgcg actctagagt cgacctgcag aagcttggcc gccatggcc // LOCUS AF012536 1180 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens decoy receptor 1 (DcR1) mRNA, complete cds. ACCESSION AF012536 NID g2338421 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1180) AUTHORS Sheridan,J.P., Marsters,S.A., Pitti,R.M., Gurney,A., Skubatch,M., Baldwin,D., Ramakrishnan,L., Gray,C.L., Baker,K., Wood,W.I., Goddard,A.D., Godowski,P. and Ashkenazi,A. TITLE Control of TRAIL-induced apoptosis by a family of signaling and decoy receptors JOURNAL Science 277 (5327), 818-821 (1997) MEDLINE 97390509 REFERENCE 2 (bases 1 to 1180) AUTHORS Sheridan,J.P., Marsters,S.A., Pitti,R.M., Gurney,A., Baldwin,D., Ramakrishnan,L., Gray,C.L., Baker,K., Wood,W.I., Goddard,A.D., Godowski,P. and Ashkenazi,A. TITLE Direct Submission JOURNAL Submitted (06-JUL-1997) Molecular Oncology, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..1180 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1180 /gene="DcR1" CDS 193..972 /gene="DcR1" /note="tumor necrosis factor receptor family member; inhibits apoptosis induction by TRAIL/Apo2L" /codon_start=1 /product="decoy receptor 1" /db_xref="PID:g2338422" /translation="MARIPKTLKFVVVIVAVLLPVLAYSATTARQEEVPQQTVAPQQQ RHSFKGEECPAGSHRSEHTGACNPCTEGVDYTNASNNEPSCFPCTVCKSDQKHKSSCT MTRDTVCQCKEGTFRNENSPEMCRKCSRCPSGEVQVSNCTSWDDIQCVEEFGANATVE TPAAEETMNTSPGTPAPAAEETMNTSPGTPAPAAEETMTTSPGTPAPAAEETMTTSPG TPAPAAEETMTTSPGTPASSHYLSCTIVGIIVLIVLLIVFV" BASE COUNT 338 a 326 c 298 g 218 t ORIGIN 1 gctgtgggaa cctctccacg cgcacgaact cagccaacga tttctgatag atttttggga 61 gtttgaccag agatgcaagg ggtgaaggag cgcttcctac cgttagggaa ctctggggac 121 agagcgcccc ggccgcctga tggccgaggc agggtgcgac ccaggaccca ggacggcgtc 181 gggaaccata ccatggcccg gatccccaag accctaaagt tcgtcgtcgt catcgtcgcg 241 gtcctgctgc cagtcctagc ttactctgcc accactgccc ggcaggagga agttccccag 301 cagacagtgg ccccacagca acagaggcac agcttcaagg gggaggagtg tccagcagga 361 tctcatagat cagaacatac tggagcctgt aacccgtgca cagagggtgt ggattacacc 421 aacgcttcca acaatgaacc ttcttgcttc ccatgtacag tttgtaaatc agatcaaaaa 481 cataaaagtt cctgcaccat gaccagagac acagtgtgtc agtgtaaaga aggcaccttc 541 cggaatgaaa actccccaga gatgtgccgg aagtgtagca ggtgccctag tggggaagtc 601 caagtcagta attgtacgtc ctgggatgat atccagtgtg ttgaagaatt tggtgccaat 661 gccactgtgg aaaccccagc tgctgaagag acaatgaaca ccagcccggg gactcctgcc 721 ccagctgctg aagagacaat gaacaccagc ccagggactc ctgccccagc tgctgaagag 781 acaatgacca ccagcccggg gactcctgcc ccagctgctg aagagacaat gaccaccagc 841 ccggggactc ctgccccagc tgctgaagag acaatgacca ccagcccggg gactcctgcc 901 tcttctcatt acctctcatg caccatcgta gggatcatag ttctaattgt gcttctgatt 961 gtgtttgttt gaaagacttc actgtggaag aaattccttc cttacctgaa aggttcaggt 1021 aggcgctggc tgagggcggg gggcgctgga cactctctgc cctgcctccc tctgctgtgt 1081 tcccacagac agaaacgcct gcccctgccc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1141 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF012549 2339 bp mRNA PRI 07-AUG-1997 DEFINITION Homo sapiens outer dense fiber protein 2 (odf2) mRNA, complete cds. ACCESSION AF012549 NID g2317718 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2339) AUTHORS Petersen,C. and Hoyer-Fender,S. TITLE Identification of the cDNAs encoding the human sperm outer dense fiber protein ODF2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2339) AUTHORS Petersen,C. and Hoyer-Fender,S. TITLE Direct Submission JOURNAL Submitted (06-JUL-1997) 3. Zoologisches Institut/Entwicklungsbiologie, Universitaet Goettingen, Humboldtallee 34a, Goettingen, Niedersachsen 37073, Germany FEATURES Location/Qualifiers source 1..2339 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="sperm" gene 1..2339 /gene="odf2" CDS 306..2222 /gene="odf2" /function="may help maintain passive elastic structures and elastic recoil of the sperm-tail" /note="ODF2; one of the major outer dense fiber (ODF) components of sperm-tail; located on the outside of the axoneme in the midpiece" /codon_start=1 /product="outer dense fiber protein 2" /db_xref="PID:g2317719" /translation="MSASSSGGSPRFPSCGKNGVTSLTQKKVLRAPCGAPSVTVTKSH KRGMKGDTVNVRRSVRVKTKNPPHCLEITPPSSEKLVSVMRLSDLSTEDDDSGHCKMN RYDKKIDSLMNAVGCLKSEVKMQKGERQMAKRFLEERKEELEEVAHELAETEHENTVL RHNIERMKEEKDFTILQKKHLQQEKECLMSKLVEAEMDGAAAAKQVMALKDTIGKLKT EKQMTCTDINTLTRQKELLLQKLSTFEETNRTLRDLLREQHCKEDSERLMEQQGALLK RLAEADSEKARLLLLLQDKDKEVEELLQEIQCEKAQAKTASELSKSMESMRGHLQAQL RSKEAENSRLCMQIKNLERSGNQHKAEVEAIMEQLKELKQKGDRDKESLKKAIRAQKE RAEKSEEYAEQLHVQLADKDLYVAEALSTLESWRSRYNQVVKEKGDLELEIIVLNDRV TDLVNQQQTLEEKMREDRDSLVERLHRQTAEYSAFKLENERLKASFAPMEDKLNQAHL EVQQLKASVKNYEGMIDNYKSQVMKTRLEADEVAAQLERCDKENKILKDEMNKEIEAA RRQFQSQLADLQQLPDILKITEAKLAECQDQLQGYERKNIDLTAIISDLRSRVRDWQK GSHELTRAGARIPR" BASE COUNT 686 a 557 c 673 g 418 t 5 others ORIGIN 1 tcgacactat tggatcaaag aattcggcac agcacttggn gccgagcgga tataaaaaca 61 ctntccgcac gtncgccggc gcctcaggtt tcccccggac agttgctgtg cgacttggac 121 agtagaggag cgcctcccaa gttttcatcc aactgccaan cccaaagctt ccacccttct 181 cccctcagag aggacgtttg atgccgggcc ccttgagagg ctcattgaca agcctgcccc 241 tctgggtccc cctgagcaga gcctgctgac ccaattgccc acctttgcgg ctttgatgcc 301 tagccatgtc tgcctcatcc tcaggnggct cccccaggtt tccatcgtgt gggaagaacg 361 gagtaacgag tctcacgcag aaaaaggtct tgagagcacc ttgtggcgca cccagtgtaa 421 ctgtgacgaa atctcacaag cgaggaatga aaggggacac tgtgaatgtg cggcggagtg 481 tccgggtgaa aaccaagaat ccacctcatt gcctggagat cacgccacca tcttcagaaa 541 agctggtctc agtgatgcgg ttaagtgacc tctctacaga agatgatgac tcaggtcact 601 gtaaaatgaa ccgttatgat aagaagattg atagtctaat gaatgcggtt ggttgtctga 661 agtctgaggt caagatgcaa aaaggtgagc gccagatggc caaaaggttc ctggaggaac 721 ggaaggaaga gctggaggag gtggcccacg aactggctga gactgagcac gagaacacgg 781 tgttgaggca caacatcgag cgcatgaagg aggagaagga cttcaccata cttcagaaga 841 aacacctaca acaggagaag gagtgcctca tgtccaagct ggtggaggcg gaaatggatg 901 gggctgcggc tgccaagcag gtcatggcct tgaaggatac catcgggaag ctgaaaacgg 961 agaaacaaat gacctgcacg gacatcaaca ccctgacaag gcagaaggaa cttctcctgc 1021 agaagctgag cacatttgag gagaccaacc gcaccctccg agacctcctg agggaacagc 1081 actgcaaaga ggattctgaa agactaatgg agcaacaagg agcactgctg aaacggctgg 1141 cggaggccga ctcagagaaa gcgcgcctgc tgttactgct gcaagacaag gacaaggagg 1201 tggaagagct ccttcaggaa atacaatgtg agaaggctca agcaaagaca gcgtctgagc 1261 tttctaaatc catggagtcc atgcgtgggc atttgcaggc acagcttcgg tccaaagagg 1321 ctgagaacag tcgcctgtgc atgcagatta agaatctgga gcgcagcggg aatcagcata 1381 aggcagaagt ggaggccatc atggagcagc tgaaggagtt gaagcagaag ggagaccgag 1441 acaaagagag cttgaagaag gccatccgag cccagaagga gcgagccgag aagagcgagg 1501 agtatgctga gcagctacac gtgcaactcg ctgacaagga tctttatgtt gctgaagctt 1561 tatccactct ggaatcctgg aggagccgct acaaccaagt tgtaaaagaa aagggagacc 1621 ttgagctgga aattattgtc ctgaatgacc gggtaacaga tcttgtaaac caacaacaaa 1681 ccctggagga gaagatgcgg gaagaccggg atagcctggt ggagagacta caccgtcaga 1741 ctgctgagta ttccgcattc aagctggaga atgagaggct gaaggccagc tttgctccaa 1801 tggaggacaa actcaaccag gcacacctcg aggtccagca gctgaaggcc tcagtgaaga 1861 actatgaggg gatgattgac aactataaga gtcaggtgat gaagaccaga ttggaggctg 1921 atgaagtagc tgcccagcta gaacgctgtg acaaagagaa caagatcctt aaagatgaga 1981 tgaacaaaga gattgaggcg gcacgaaggc agttccagtc tcagctggct gacctgcagc 2041 agctccctga catcctgaag atcacggagg cgaagctggc tgagtgccaa gaccaactgc 2101 agggctatga gcggaagaac atcgacctca cagccatcat atcagacctg cgcagccggg 2161 taagggactg gcagaaaggg tcccacgaac tgacccgagc aggggcccgc ataccaagat 2221 gagctgcacg ccccccaagg gaggactact tcctttttct tggctgctgc tttttaaaag 2281 gagtgagcta tcatcagtgc tgtgaaataa aagtctggtg tgccaaaaaa aaaaaaaaa // LOCUS AF013168 8600 bp mRNA PRI 19-AUG-1997 DEFINITION Homo sapiens hamartin (TSC1) mRNA, complete cds. ACCESSION AF013168 NID g2331280 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8600) AUTHORS van Slegtenhorst,M., de Hoogt,R., Hermans,C., Nellist,M., Janssen,B., Verhoef,S., Lindhout,D., van den Ouweland,A., Halley,D., Young,J., Burley,M., Jeremiah,S., Woodward,K., Nahmias,J., Fox,M., Ekong,R., Osborne,J., Wolfe,J., Povey,S., Snell,R.G., Cheadle,J.P., Jones,A.C., Tachataki,M., Ravine,D., Sampson,J.R., Reeve,M.P., Richardson,P., Wilmer,F., Munro,C., Hawkins,T.L., Sepp,T., Ali,J.B.M., Ward,S., Green,A.J., Yates,J.R.W., Kwiatkowska,J., Henske,E.P., Short,M.P., Haines,J.H., Jozwiak,S. and Kwiatkowski,D.J. TITLE Identification of the tuberous sclerosis gene TSC1 on chromosome 9q34 JOURNAL Science 277 (5327), 805-808 (1997) MEDLINE 97390505 REFERENCE 2 (bases 1 to 8600) AUTHORS van Slegtenhorst,M., Young,J., Halley,D., Povey,S., Sampson,J.R. and Kwiatkowski,D.J. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) Clinical Genetics, Erasmus University and University Hospital, Dr Molewaterplein 50, Rotterdam 3015GE, The Netherlands FEATURES Location/Qualifiers source 1..8600 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34" gene 1..8600 /gene="TSC1" CDS 222..3716 /gene="TSC1" /function="tumor suppressor protein" /note="tuberous sclerosis complex 1 protein" /codon_start=1 /product="hamartin" /db_xref="PID:g2331281" /translation="MAQQANVGELLAMLDSPMLGVRDDVTAVFKENLNSDRGPMLVNT LVDYYLETSSQPALHILTTLQEPHDKHLLDRINEYVGKAATRLSILSLLGHVIRLQPS WKHKLSQAPLLPSLLKCLKMDTDVVVLTTGVLVLITMLPMIPQSGKQHLLDFFDIFGR LSSWCLKKPGHVAEVYLVHLHASVYALFHRLYGMYPCNFVSFLRSHYSMKENLETFEE VVKPMMEHVRIHPELVTGSKDHELDPRRWKRLETHDVVIECAKISLDPTEASYEDGYS VSHQISARFPHRSADVTTSPYADTQNSYGCATSTPYSTSRLMLLNMPGQLPQTLSSPS TRLITEPPQATLWSPSMVCGMTTPPTSPGNVPPDLSHPYSKVFGTTAGGKGTPLGTPA TSPPPAPLCHSDDYVHISLPQATVTPPRKEERMDSARPCLHRQHHLLNDRGSEEPPGS KGSVTLSDLPGFLGDLASEEDSIEKDKEEAAISRELSEITTAEAEPVVPRGGFDSPFY RDSLPGSQRKTHSAASSSQGASVNPEPLHSSLDKLGPDTPKQAFTPIDLPCGSADESP AGDRECQTSLETSIFTPSPCKIPPPTRVGFGSGQPPPYDHLFEVALPKTAHHFVIRKT EELLKKAKGNTEEDGVPSTSPMEVLDRLIQQGADAHSKELNKLPLPSKSVDWTHFGGS PPSDEIRTLRDQLLLLHNQLLYERFKRQQHALRNRRLLRKVIKAAALEEHNAAMKDQL KLQEKDIQMWKVSLQKEQARYNQLQEQRDTMVTKLHSQIRQLQHDREEFYNQSQELQT KLEDCRNMIAELRIELKKANNKVCHTELLLSQVSQKLSNSESVQQQMEFLNRQLLVLG EVNELYLEQLQNKHSDTTKEVEMMKAAYRKELEKNRSHVLQQTQRLDTSQKRILELES HLAKKDHLLLEQKKYLEDVKLQARGQLQAAESRYEAQKRITQVFELEILDLYGRLEKD GLLKKLEEEKAEAAEAAEERLDCCNDGCSDSMVGHNEEASGHNGETKTPRPSSARGSS GSRGGGGSSSSSSELSTPEKPPHQRAGPFSSRWETTMGEASASIPTTVGSLPSSKSFL GMKARELFRNKSESQCDEDGMTSSLSESLKTELGKDLGVEAKIPLNLDGPHPSPPTPD SVGQLHIMDYNETHHEHS" misc_feature 2409..3116 /gene="TSC1" /note="encodes a coiled-coil domain" BASE COUNT 2365 a 2010 c 2137 g 2087 t 1 others ORIGIN 1 gtgctgtacg tccaagatgg cggcgcctgt aggctggagg gactgtgagg taaacagctg 61 agggggagga gacggtggtg accatgaaag acaccaggtt gacagcactg gaaactgaag 121 taccagttgt cgctagaaca gtttggtagt ggccccaatg aagaaccttc agaacctgta 181 gcacacgtcc tggagccagc acagcgcctt cgagcgagag aatggcccaa caagcaaatg 241 tcggggagct tcttgccatg ctggactccc ccatgctggg tgtgcgggac gacgtgacag 301 ctgtctttaa agagaacctc aattctgacc gtggccctat gcttgtaaac accttggtgg 361 attattacct ggaaaccagc tctcagccgg cattgcacat cctgaccacc ttgcaagagc 421 cacatgacaa gcacctcttg gacaggatta acgaatatgt gggcaaagcc gccactcgtt 481 tatccatcct ctcgttactg ggtcatgtca taagactgca gccatcttgg aagcataagc 541 tctctcaagc acctcttttg ccttctttac taaaatgtct caagatggac actgacgtcg 601 ttgtcctcac aacaggcgtc ttggtgttga taaccatgct accaatgatt ccacagtctg 661 ggaaacagca tcttcttgat ttctttgaca tttttggccg tctgtcatca tggtgcctga 721 agaaaccagg ccacgtggcg gaagtctatc tcgtccatct ccatgccagt gtgtacgcac 781 tctttcatcg cctttatgga atgtaccctt gcaacttcgt ctcctttttg cgttctcatt 841 acagtatgaa agaaaacctg gagacttttg aagaagtggt caagccaatg atggagcatg 901 tgcgaattca tccggaatta gtgactggat ccaaggacca tgaactggac cctcgaaggt 961 ggaagagatt agaaactcat gatgttgtga tcgagtgtgc caaaatctct ctggatccca 1021 cagaagcctc atatgaagat ggctattctg tgtctcacca aatctcagcc cgctttcctc 1081 atcgttcagc cgatgtcacc accagccctt atgctgacac acagaatagc tatgggtgtg 1141 ctacttctac cccttactcc acgtctcggc tgatgttgtt aaatatgcca gggcagctac 1201 ctcagactct gagttcccca tcgacacggc tgataactga accaccacaa gctactcttt 1261 ggagcccatc tatggtttgt ggtatgacca ctcctccaac ttctcctgga aatgtcccac 1321 ctgatctgtc acacccttac agtaaagtct ttggtacaac tgcaggtgga aaaggaactc 1381 ctctgggaac cccagcaacc tctcctcctc cagccccact ctgtcattcg gatgactacg 1441 tgcacatttc actcccccag gccacagtca caccccccag gaaggaagag agaatggatt 1501 ctgcaagacc atgtctacac agacaacacc atcttctgaa tgacagagga tcagaagagc 1561 cacctggcag caaaggttct gtcactctaa gtgatcttcc agggttttta ggtgatctgg 1621 cctctgaaga agatagtatt gaaaaagata aagaagaagc tgcaatatct agagaacttt 1681 ctgagatcac cacagcagag gcagagcctg tggttcctcg aggaggcttt gactctccct 1741 tttaccgaga cagtctccca ggttctcagc ggaagaccca ctcggcagcc tccagttctc 1801 agggcgccag cgtgaaccct gagcctttac actcctccct ggacaagctt gggcctgaca 1861 caccaaagca agcctttact cccatagacc tgccctgcgg cagtgctgat gaaagccctg 1921 cgggagacag ggaatgccag acttctttgg agaccagtat cttcactccc agtccttgta 1981 aaattccacc tccgacgaga gtgggctttg gaagcgggca gcctcccccg tatgatcatc 2041 tttttgaggt ggcattgcca aagacagccc atcattttgt catcaggaag actgaggagc 2101 tgttaaagaa agcaaaagga aacacagagg aagatggtgt gccctctacc tccccaatgg 2161 aagtgctgga cagactgata cagcagggag cagacgcgca cagcaaggag ctgaacaagt 2221 tgcctttacc cagcaagtct gtcgactgga cccactttgg aggctctcct ccttcagatg 2281 agatccgcac cctccgagac cagttgcttt tactgcacaa ccagttactc tatgagcgtt 2341 ttaagaggca gcagcatgcc ctccggaaca ggcggctcct ccgcaaggtg atcaaagcag 2401 cagctctgga ggaacataat gctgccatga aagatcagtt gaagttacaa gagaaggaca 2461 tccagatgtg gaaggttagt ctgcagaaag aacaagctag atacaatcag ctccaggagc 2521 agcgtgacac tatggtaacc aagctccaca gccagatcag acagctgcag catgaccgag 2581 aggaattcta caaccagagc caggaattac agacgaagct ggaggactgc aggaacatga 2641 ttgcggagct gcggatagaa ctgaagaagg ccaacaacaa ggtgtgtcac actgagctgc 2701 tgctcagtca ggtttcccaa aagctctcaa acagtgagtc ggtccagcag cagatggagt 2761 tcttgaacag gcagctgttg gttcttgggg aggtcaacga gctctatttg gaacaactgc 2821 agaacaagca ctcagatacc acaaaggaag tagaaatgat gaaagccgcc tatcggaaag 2881 agctagaaaa aaacagaagc catgttctcc agcagactca gaggcttgat acctcccaaa 2941 aacggatttt ggaactggaa tctcacctgg ccaagaaaga ccaccttctt ttggaacaga 3001 agaaatatct agaggatgtc aaactccagg caagaggaca gctgcaggcc gcagagagca 3061 ggtatgaggc tcagaaaagg ataacccagg tgtttgaatt ggagatctta gatttatatg 3121 gcaggttgga gaaagatggc ctcctgaaaa aacttgaaga agaaaaagca gaagcagctg 3181 aagcagcaga agaaaggctt gactgttgta atgacgggtg ctcagattcc atggtagggc 3241 acaatgaaga ggcatctggc cacaacggtg agaccaagac ccccaggccc agcagcgccc 3301 ggggcagtag tggaagcaga ggtggtggag gcagcagcag cagcagcagc gagctttcta 3361 ccccagagaa acccccacac cagagggcag gcccattcag cagtcggtgg gagacgacta 3421 tgggagaagc gtctgccagc atccccacca ctgtgggctc acttcccagt tcaaaaagct 3481 tcctgggtat gaaggctcga gagttatttc gtaataagag cgagagccag tgtgatgagg 3541 acggcatgac cagtagcctt tctgagagcc taaagacaga actgggcaaa gacttgggtg 3601 tggaagccaa gattcccctg aacctagatg gccctcaccc gtctcccccg accccggaca 3661 gtgttggaca gctacatatc atggactaca atgagactca tcatgaacac agctaaggaa 3721 tgatggtcaa tcagtgttaa cttgcatatt gttggcacag aacaggaggt gtgaatgcac 3781 gtttcaaagc tttcctgttt ccagggtctg agtgcaagtt catgtgtgga aatgggacgg 3841 aggtcctttg gacagctgac tgaatgcaga acggtttttg gatctggcat tgaaatgcct 3901 cttgaccttc ccctccaccc gccctaaccc cctctcattt acctcgcagt gtgttctaat 3961 ccaagggcca gttggtgttc ctcagtagct ttactttctt ccttcccccc caaatggttg 4021 cgtcctttga acctgtgcaa tatgaggcca aatttaatct ttgagtctaa cacaccactt 4081 tctgctttcc cgaagttcag ataactgggt tggctctcaa ttagaccagg tagtttgttg 4141 cattgcaggt aagtctggtt ttgtcccttc caggaggaca tagcctgcaa agctggttgt 4201 ctttacatga aagcgtttac atgagacttt ccgactgctt ttttgattct gaagttcagc 4261 atctaaagca gcaggtctag aagaacaacg gtttattcat acttgcattc ttttggcagt 4321 tctgataagc ttcctagaaa gttctgtgta aacagaagcc tgtttcagaa atctggagct 4381 ggcactgtgg agaccacaca ccctttggga aagctcttgt ctcttcttcc cccactacct 4441 cttatttatt tggtgtttgc ttgaatgctg gtactattgt gaccacaggc tggtgtgtag 4501 gtggtaaaac ctgttctcca taggagggaa ggagcagtca ctgggagagg ttacccgaga 4561 agcacttgag catgaggaac tgcaccttta ggccatctca gcttgctggg ccttttgtta 4621 aacccttctg tctactggcc tccctttgtg tgcatacgcc tcttgttcat gtcagcttat 4681 atgtgacact gcagcagaaa ggctctgaag gtccaaagag tttctgcaaa gtgtatgtga 4741 ccatcatttc ccaggccatt agggttgcct cactgtagca ggttctaggc taccagaaga 4801 ggggcagctt tttcatacca attccaactt tcaggggctg actctccagg gagctgatgt 4861 catcacactc tccatgttag taatggcaga gcagtctaaa cagagtccgg gagaatgctg 4921 gcaaaggctg gctgtgtata cccactaggc tgccccacgt gctcccgaga gatgacacta 4981 gtcagaaaag tggcagtggc agagaatcca aactcaacaa gtgctcctga aagaaatgct 5041 agaagcctaa gaactgtggt ctggtgttcc agctgaggca gggggatttg gtaggaagga 5101 gccagtgaac ttggctttcc tgtttctatc tttcattaaa aagaatagaa ggattcagtc 5161 ataaagaggt aaaaaactgt cacggtacga aatcttagtg cctacggagg cctcgagcag 5221 aaagaatgaa agtctttttt tttttttttt ttttttagca tggcaataaa tattctagca 5281 tccctaacta aaggggacta gacagttaga gactctgtca ccctagctat accagcagaa 5341 aacctgttca ggcaggcttt ctgggtgtga ctgattccca gcctgtggca gggcgtggtc 5401 ccaactactc agcctagcac aggctggcag ttggtactga attgtcagat gtggagtatt 5461 agtgacacca cacatttaat tcagctttgt ccaaaggaaa gcttaaaacc caatacagtc 5521 tagtttcctg gttccgtttt agaaaaggaa aacgtgaaca aacttagaaa gggaaggaaa 5581 tcccatcagt gaatcctgaa actggtttta agtgctttcc ttctcctcat gcccaagaga 5641 tctgtgccat agaacaagat accaggcact taaagccttt tcctgaattg gaaaggaaaa 5701 gaggcccaag tgcaaaagaa aaaacatttt agaaacggac agcttataaa aataaaggga 5761 agaaaggagg cagcatggag agaggcctgt gctagaagct ccatggacgt gtctgcacag 5821 ggtcctcagc tcatccatgc ggcctgggtg tccttttact cagctttata acaaatgtgg 5881 ctccaagctc aggtgccttt gagttctagg aggctgtggg ttttattcaa ctacggttgg 5941 gagaatgaga cctggagtca tgttgaaggt gcccaaccta aaaatgtagg ctttcatgtt 6001 gcaaagaact ccagagtcag tagttaggtt tggtttggtt ttggacatga taaacctgcc 6061 aagagtcaac aggtcacttg atcatgctgc agtgggtagt tctaaggatg gaaaggtgac 6121 agtattactc tcgagaggca attcagtcct gggcaaaggt attagtacaa taagcgttaa 6181 gggcagagtc taccttgaaa ccaattaagc agcttggtat tcataaatat tgggattgga 6241 tggcctccat ccagaaatca ctatgggtga gcatacctgt ctcagctgtt tggccaatgt 6301 gcataaccta ctcggatccc cacctgacac taaccagagt cagcacaggc cccgaggagc 6361 ccgaagttct ctgctgtgca gcatggaatt cctttaaaaa ggtgcactac agttttagcg 6421 gggaggggga taggaagacg cagagcaaat gagctccgga gtccctgcag gtgaataaac 6481 acacagatct gcatctgata gaactttgat ggattttcaa aaagccgttg acaaggctct 6541 gctatacagt ctataaaaat tgttattatg ggattggaag aaacacatgg tcatgaatag 6601 aaaaaaaaca aacccaaagg taggaaggtc aaggtcattt cttagatgga gaagttgtga 6661 aagatgtcct tggagatgag ttttaggacc agcattacta aggcaggtgg gcagacagtg 6721 acctctctag gtgtgtccac agagtttttc aggagagaaa actgcctgac ctttgggact 6781 aagctgcgga atcttcttac taagcttgaa gagtggagag gcgagaggtg agctactttg 6841 tgagccaaag cttatgtgac atggttgggg aaacagtcca aactgttctg agaaggtgaa 6901 ctgttacgac ccaggacaat tagaaaaatt cacccaccat gccgcacatt actgggtaaa 6961 agcagggcag cagggaacaa aactccagac tcttgggccg tccccatttg caacagcaca 7021 catagtttct ggtatatttg ttgggaaaga taaaactcta gcagttgttg aggggaggat 7081 gtataaaatg gtcatgggga tgaaaggatc tctgagacca cagaggctca gactcactgt 7141 taagaataga aaactgggta tgcgtttcat gtagccagca gaactgaagt gtgctgtgac 7201 aagccaatgt gaatttctac caaatagtag agcataccac ttgaagaagg aaagaaccga 7261 agagcaaaca aaagttctgc gtaatgagac tcaccttttc tcgctgaaag cactaagagg 7321 tgggaggagg cctgcacagg ctggaggagg gtttgggcag agcgaagacc cggccaggac 7381 cttggtgaga tggagtgccg cccacctcct gcggatactc ttggagagtt gttcccccag 7441 gggnctctgc cccacctgga gaaggaagct gcctggtgtg gagtgactca aatcagtata 7501 cctatctgct gcaccttcac tctccagggt acatgcttta aaaccgaccc gcaacaagta 7561 ttggaaaaat gtatccagtc tgaagatgtt tgtgtatctg tttacatcca gagttctgtg 7621 acacatgccc cccagattgc tgcaaagatc ccaaggcatt gattgcactt gattaagctt 7681 ttgtctgtag gtgaaagaac aagtttaggt cgaggactgg cccctaggct gctgctgtga 7741 cccttgtccc atgtggcttg tttgcctgtc cgggactctt cgatgtgccc aggggagcgt 7801 gttcctgtct cttccatgcc gtcctgcagt ccttatctgc tcgcctgagg gaagagtagc 7861 tgtagctaca agggaagcct gcctggaaga gccgagcacc tgtgcccatg gcttctggtc 7921 atgaaacgag ttaatgatgg cagaggagct tcctccccac ttcgcagcgc cacattatcc 7981 atcctctgag ataagtaggc tggtttaacc attggaatgg acctttcagt ggaaaccctg 8041 agagtctgag aacccccaga ccaacccttc cctccctttc cccacctctt acagtgtttg 8101 gacaggaggg tatggtgctg ctctgtgtag caagtacttt ggcttatgaa agaggcagcc 8161 acgcattttg cactaggaag aatcagtaat cacttttcag aagacttcta tggaccacaa 8221 atatattacg gaggaacaga ttttgctaag acataatcta gttttataac tcaatcatga 8281 atgaaccatg tgtggcaaac ttgcagttta aaggggtccc atcagtgaaa gaaactgatt 8341 ttttttaacg gactgctttt agttaaattg aagaaagtca gctcttgtca aaaggtctaa 8401 actttcccgc ctcaatccta aaagcatgtc aacaatccac atcagatgcc ataaatatga 8461 actgcaggat aaaatggtac aatcttagtg aatgggaatt ggaatcaaaa gagtttgctg 8521 tccttcttag aatgttctaa aatgtcaagg cagttgcttg tgtttaactg tgaacaaata 8581 aaaatttatt gttttgcact // LOCUS AF013249 1728 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens leukocyte-associated Ig-like receptor-1 (LAIR-1) mRNA, complete cds. ACCESSION AF013249 NID g2352940 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1728) AUTHORS Meyaard,L., Adema,G.J., Chang,C., Woollatt,E., Sutherland,G.R., Lanier,L.L. and Phillips,J.H. TITLE LAIR-1, a novel inhibitory receptor expressed on human mononuclear leukocytes JOURNAL Immunity (1997) In press REFERENCE 2 (bases 1 to 1728) AUTHORS Meyaard,L., Adema,G.J., Chang,C., Woollatt,E., Sutherland,G.R., Lanier,L.L. and Phillips,J.H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) Immunobiology, DNAX Research Institute, 901 California Avenue, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..1728 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.4" /cell_line="IL-2 dependent polyclonal NK" gene 1..1728 /gene="LAIR-1" CDS 69..932 /gene="LAIR-1" /note="membrane glycoprotein" /codon_start=1 /product="leukocyte-associated Ig-like receptor-1" /db_xref="PID:g2352941" /translation="MSPHPTALLGLVLCLAQTIHTQEEDLPRPSISAEPGTVIPLGSH VTFVCRGPVGVQTFRLERESRSTYNDTEDVSQASPSESEARFRIDSVSEGNAGPYRCI YYKPPKWSEQSDYLELLVKETSGGPDSPDTEPGSSAGPTQRPSDNSHNEHAPASQGLK AEHLYILIGVSVVFLFCLLLLVLFCLHRQNQIKQGPPRSKDEEQKPQQRPDLAVDVLE RTADKATVNGLPEKDRETDTSALAAGSSQEVTYAQLDHWALTQRTARAVSPQSTKPMA ESITYAAVARH" BASE COUNT 437 a 496 c 428 g 367 t ORIGIN 1 aaaggctgca gagttctgtc cttgcattgg tgcgcctcag gccaggctgc actgctggga 61 cctgggccat gtctccccac cccaccgccc tcctgggcct agtgctctgc ctggcccaga 121 ccatccacac gcaggaggaa gatctgccca gaccctccat ctcggctgag ccaggcaccg 181 tgatccccct ggggagccat gtgactttcg tgtgccgggg cccggttggg gttcaaacat 241 tccgcctgga gagggagagt agatccacat acaatgatac tgaagatgtg tctcaagcta 301 gtccatctga gtcagaggcc agattccgca ttgactcagt aagtgaagga aatgccgggc 361 cttatcgctg catctattat aagcccccta aatggtctga gcagagtgac tacctggagc 421 tgctggtgaa agaaacctct ggaggcccgg actccccgga cacagagccc ggctcctcag 481 ctggacccac gcagaggccg tcggacaaca gtcacaatga gcatgcacct gcttcccaag 541 gcctgaaagc tgagcatctg tatattctca tcggggtctc agtggtcttc ctcttctgtc 601 tcctcctcct ggtcctcttc tgcctccatc gccagaatca gataaagcag gggcccccca 661 gaagcaagga cgaggagcag aagccacagc agaggcctga cctggctgtt gatgttctag 721 agaggacagc agacaaggcc acagtcaatg gacttcctga gaaggacaga gagacggaca 781 cctcggccct ggctgcaggg agttcccagg aggtgacgta tgctcagctg gaccactggg 841 ccctcacaca gaggacagcc cgggctgtgt ccccacagtc cacaaagccc atggccgagt 901 ccatcacgta tgcagccgtt gccagacact gaccccatac ccacctggcc tctgcacctg 961 agggtagaaa gtcactctag gaaaagcctg aagcagccat ttggaaggct tcctgttgga 1021 ttcctcttca tctagaaagc cagccaggca gctgtcctgg agacaagagc tggagactgg 1081 aggtttctaa ccagcatcca gaaggttcgt tagccaggtg gtcccttcta caatcgagca 1141 gctccttgga cagactgttt ctcagttatt tccagagacc cagctacagt tccctggctg 1201 tttctagaga cccagcttta ttcacctgac tgtttccaga gacccagcta aagtcacctg 1261 cctgttctaa aggcccagct acagccaatc agccgatttc ctgagcagtg atgccacctc 1321 caagcttgtc ctaggtgtct gctgtgaacc tccagtgacc ccagagactt tgctgtaatt 1381 atctgccctg ctgaccctaa agaccttcct agaagtcaag agctagcctt gagactgtgc 1441 tatacacaca cagctgagag ccaagcccag ttctctgggt tgtgctttac tccacgcatc 1501 aataaataat tttgaaggcc tcacatctgg cagccccagg cctggtcctg ggtgcatagg 1561 tctctcggac ccactctctg ccttcacagt tgttcaaagc tgagtgaggg aaacaggact 1621 tacgaaaacg tgtcagcgtt ttctttttaa aatttaattg atcaggattg tacgtaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaagg // LOCUS AF013263 7042 bp mRNA PRI 23-AUG-1997 DEFINITION Homo sapiens apoptotic protease activating factor 1 (Apaf-1) mRNA, complete cds. ACCESSION AF013263 NID g2330014 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7042) AUTHORS Zou,H., Henzel,W.J., Liu,X., Lutschg,A. and Wang,X. TITLE Apaf-1, a human protein homologous to C. elegans CED-4, participates in cytochrome c-dependent activation of caspase-3 JOURNAL Cell 90 (3), 405-413 (1997) MEDLINE 97410306 REFERENCE 2 (bases 1 to 7042) AUTHORS Zou,H., Henzel,H.J., Liu,X., Lutschg,A. and Wang,X. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) Biochemistry, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..7042 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" gene 1..7042 /gene="Apaf-1" CDS 578..4162 /gene="Apaf-1" /function="cytochrome c-dependent activation of caspase-3" /note="similar to C. elegans cell death gene ced-4" /codon_start=1 /product="apoptotic protease activating factor 1" /db_xref="PID:g2330015" /translation="MDAKARNCLLQHREALEKDIKTSYIMDHMISDGFLTISEEEKVR NEPTQQQRAAMLIKMILKKDNDSYVSFYNALLHEGYKDLAALLHDGIPVVSSSSVRTV LCEGGVPQRPVVFVTRKKLVNAIQQKLSKLKGEPGWVTIHGMAGCGKSVLAAEAVRDH SLLEGCFPGGVHWVSVGKQDKSGLLMKLQNLCTRLDQDESFSQRLPLNIEEAKDRLRI LMLRKHPRSLLILDDVWDSWVLKAFDSQCQILLTTRDKSVTDSVMGPKYVVPVESSLG KEKGLEILSLFVNMKKADLPEQAHSIIKECKGSPLVVSLIGALLRDFPNRWEYYLKQL QNKQFKRIRKSSSYDYEALDEAMSISVEMLREDIKDYYTDLSILQKDVKVPTKVLCIL WDMETEEVEDILQEFVNKSLLFCDRNGKSFRYYLHDLQVDFLTEKNCSQLQDLHKKII TQFQRYHQPHTLSPDQEDCMYWYNFLAYHMASAKMHKELCALMFSLDWIKAKTELVGP AHLIHEFVEYRHILDEKDCAVSENFQEFLSLNGHLLGRQPFPNIVQLGLCEPETSEVY QQAKLQAKQEVDNGMLYLEWINKKNITNLSRLVVRPHTDAVYHACFSEDGQRIASCGA DKTLQVFKAETGEKLLEIKAHEDEVLCCAFSTDDRFIATCSVDKKVKIWNSMTGELVH TYDEHSEQVNCCHFTNSSHHLLLATGSSDCFLKLWDLNQKECRNTMFGHTNSVNHCRF SPDDKLLASCSADGTLKLWDATSANERKSINVKQFFLNLEDPQEDMEVIVKCCSWSAD GARIMVAAKNKIFLWNTDSRSKVADCRGHLSWVHGVMFSPDGSSFLTSSDDQTIRLWE TKKVCKNSAVMLKQEVDVVFQENEVMVLAVDHIRRLQLINGRTGQIDYLTEAQVSCCC LSPHLQYIAFGDENGAIEILELVNNRIFQSRFQHKKTVWHIQFTADEKTLISSSDDAE IQVWNWQLDKCIFLRGHQETVKDFRLLKNSRLLSWSFDGTVKVWNIITGNKEKDFVCH QGTVLSCDISHDATKFSSTSADKTAKIWSFDLLLPLHELRGHNGCVRCSAFSVDSTLL ATGDDNGEIRIWNVSNGELLHLCAPLSEEGAATHGGWVTDLCFSPDGKMLISAGGYIK WWNVVTGESSQTFYTNGTNLKKIHVSPDFKTYVTVDNLGILYILQTLE" BASE COUNT 1985 a 1355 c 1580 g 2122 t ORIGIN 1 aagaagaggt agcgagtgga cgtgactgct ctatcccggg caaaagggat agaaccagag 61 gtggggagtc tgggcagtcg gcgacccgcg aagacttgag gtgccgcagc ggcatccgga 121 gtagcgccgg gctccctccg gggtgcagcc gccgtcgggg gaagggcgcc acaggccggg 181 aagacctcct ccctttgtgt ccagtagtgg ggtccaccgg agggcggccc gtgggccggg 241 cctcaccgcg gcgctccggg actgtggggt caggctgcgt tgggtggacg cccacctcgc 301 caaccttcgg aggtccctgg gggtcttcgt gcgccccggg gctgcagaga tccaggggag 361 gcgcctgtga ggcccggacc tgccccgggg cgaagggtat gtggcgagac agagccctgc 421 acccctaatt cccggtggaa aactcctgtt gccgtttccc tccaccggcc tggagtctcc 481 cagtcttgtc ccggcagtgc cgccctcccc actaagacct aggcgcaaag gcttggctca 541 tggttgacag ctcagagaga gaaagatctg agggaagatg gatgcaaaag ctcgaaattg 601 tttgcttcaa catagagaag ctctggaaaa ggacatcaag acatcctaca tcatggatca 661 catgattagt gatggatttt taacaatatc agaagaggaa aaagtaagaa atgagcccac 721 tcaacagcaa agagcagcta tgctgattaa aatgatactt aaaaaagata atgattccta 781 cgtatcattc tacaatgctc tactacatga aggatataaa gatcttgctg cccttctcca 841 tgatggcatt cctgttgtct cttcttccag tgtaaggaca gtcctgtgtg aaggtggagt 901 accacagagg ccagttgttt ttgtcacaag gaagaagctg gtgaatgcaa ttcagcagaa 961 gctctccaaa ttgaaaggtg aaccaggatg ggtcaccata catggaatgg caggctgtgg 1021 gaagtctgta ttagctgcag aagctgttag agatcattcc cttttagaag gttgtttccc 1081 agggggagtg cattgggttt cagttgggaa acaagacaaa tctgggcttc tgatgaaact 1141 gcagaatctt tgcacacggt tggatcagga tgagagtttt tcccagaggc ttccacttaa 1201 tattgaagag gctaaagacc gtctccgcat tctgatgctt cgcaaacacc caaggtctct 1261 cttgatcttg gatgatgttt gggactcttg ggtgttgaaa gcttttgaca gtcagtgtca 1321 gattcttctt acaaccagag acaagagtgt tacagattca gtaatgggtc ctaaatatgt 1381 agtccctgtg gagagttcct taggaaagga aaaaggactt gaaattttat ccctttttgt 1441 taatatgaag aaggcagatt tgccagaaca agctcatagt attataaaag aatgtaaagg 1501 ctctcccctt gtagtatctt taattggtgc acttttacgt gattttccca atcgctggga 1561 gtactacctc aaacagcttc agaataagca gtttaagaga ataaggaaat cttcgtctta 1621 tgattatgag gctctagatg aagccatgtc tataagtgtt gaaatgctca gagaagacat 1681 caaagattat tacacagatc tttccatcct tcagaaggac gttaaggtgc ctacaaaggt 1741 gttatgtatt ctctgggaca tggaaactga agaagttgaa gacatactgc aggagtttgt 1801 aaataagtct cttttattct gtgatcggaa tggaaagtcg tttcgttatt atttacatga 1861 tcttcaagta gattttctta cagagaagaa ttgcagccag cttcaggatc tacataagaa 1921 gataatcact cagtttcaga gatatcacca gccgcatact ctttcaccag atcaggaaga 1981 ctgtatgtat tggtacaact ttctggccta tcacatggcc agtgccaaga tgcacaagga 2041 actttgtgct ttaatgtttt ccctggattg gattaaagca aaaacagaac ttgtaggccc 2101 tgctcatctg attcatgaat ttgtggaata cagacatata ctagatgaaa aggattgtgc 2161 agtcagtgag aattttcagg agtttttatc tttaaatgga caccttcttg gacgacagcc 2221 atttcctaat attgtacaac tgggtctctg tgagccggaa acttcagaag tttatcagca 2281 agctaagctg caggccaagc aggaggtcga taatggaatg ctttacctgg aatggataaa 2341 caaaaaaaac atcacgaatc tttcccgctt agttgtccgc ccccacacag atgctgttta 2401 ccatgcctgc ttttctgagg atggtcagag aatagcttct tgtggagctg ataaaacctt 2461 acaggtgttc aaagctgaaa caggagagaa acttctagaa atcaaggctc atgaggatga 2521 agtgctttgt tgtgcattct ctacagatga cagatttata gcaacctgct cagtggataa 2581 aaaagtgaag atttggaatt ctatgactgg ggaactagta cacacctatg atgagcactc 2641 agagcaagtc aattgctgcc atttcaccaa cagtagtcat catcttctct tagccactgg 2701 gtcaagtgac tgcttcctca aactttggga tttgaatcaa aaagaatgtc gaaataccat 2761 gtttggtcat acaaattcag tcaatcactg cagattttca ccagatgata agcttttggc 2821 tagttgttca gctgatggaa ccttaaagct ttgggatgcg acatcagcaa atgagaggaa 2881 aagcattaat gtgaaacagt tcttcctaaa tttggaggac cctcaagagg atatggaagt 2941 gatagtgaag tgttgttcgt ggtctgctga tggtgcaagg ataatggtgg cagcaaaaaa 3001 taaaatcttt ttgtggaata cagactcacg ttcaaaggtg gctgattgca gaggacattt 3061 aagttgggtt catggtgtga tgttttctcc tgatggatca tcatttttga catcttctga 3121 tgaccagaca atcaggctct gggagacaaa gaaagtatgt aagaactctg ctgtaatgtt 3181 aaagcaagaa gtagatgttg tgtttcaaga aaatgaagtg atggtccttg cagttgacca 3241 tataagacgt ctgcaactca ttaatggaag aacaggtcag attgattatc tgactgaagc 3301 tcaagttagc tgctgttgct taagtccaca tcttcagtac attgcatttg gagatgaaaa 3361 tggagccatt gagattttag aacttgtaaa caatagaatc ttccagtcca ggtttcagca 3421 caagaaaact gtatggcaca tccagttcac agccgatgag aagactctta tttcaagttc 3481 tgatgatgct gaaattcagg tatggaattg gcaattggac aaatgtatct ttctacgagg 3541 ccatcaggaa acagtgaaag actttagact cttgaaaaat tcaagactgc tttcttggtc 3601 atttgatgga acagtgaagg tatggaatat tattactgga aataaagaaa aagactttgt 3661 ctgtcaccag ggtacagtac tttcttgtga catttctcac gatgctacca agttttcatc 3721 tacctctgct gacaagactg caaagatctg gagttttgat ctccttttgc cacttcatga 3781 attgaggggc cacaacggct gtgtgcgctg ctctgccttc tctgtggaca gtaccctgct 3841 ggcaacggga gatgacaatg gagaaatcag gatatggaat gtctcaaacg gtgagcttct 3901 tcatttgtgt gctccgcttt cagaagaagg agctgctacc catggaggct gggtgactga 3961 cctttgcttt tctccagatg gcaaaatgct tatctctgct ggaggatata ttaagtggtg 4021 gaacgttgtc actggggaat cctcacagac cttctacaca aatggaacca atcttaagaa 4081 aatacacgtg tcccctgact tcaaaacata tgtgactgtg gataatcttg gtattttata 4141 tattttacag actttagaat aaaatagtta agcattaatg tagttgaact ttttaaattt 4201 ttgaattgga aaaaaattct aatgaaaccc tgatatcaac tttttataaa gctcttaatt 4261 gttgtgcagt attgcattca ttacaaaagt gtttgtggtt ggatgaataa tattaatgta 4321 gctttttccc aaatgaacat acctttaatc ttgtttttca tgatcatcat taacagtttg 4381 tccttaggat gcaaatgaaa atgtgaatac ataccttgtt gtactgttgg taaaattctg 4441 tcttgatgca ttcaaaatgg ttgacataat taatgagaag aatttggaag aaattggtat 4501 tttaatactg tctgtattta ttactgttat gcaggctgtg cctcagggta gcagtggcct 4561 gctttttgaa ccacacttac cccaaggggg ttttgttctc ctaaatacaa tcttagaggt 4621 tttttgcact ctttaaattt gctttaaaaa tattgtgtct gtgtgcatag tctgcagcat 4681 ttcctttaat tgactcaata agtgagtctt ggatttagca ggccccccca cctttttttt 4741 ttgtttttgg agacagagtc ttgctttgtt gccaggctgg agtgcagtgg cgcgatctcg 4801 gctcaccaca atcgctgcct cctgggttca agcaattctc ctgcctcagc ctcccgagta 4861 gctgggacta caggtgtgcg cacatgccag gctaattttt gtatttttag tagagacggg 4921 gtttcaccat gttggccggg atggtctcga tctcttgacc tcatgatcta cccgccttgg 4981 cctcccaaag tgctgagatt acaggcgtga gccaccgtgc ctggccaggc cccttctctt 5041 ttaatggaga cagggtcttg cactatcacc caggctggag tgcagtggca taatcatacc 5101 tcattgcagc ctcagactcc tgggttcaag caatcctcct gcctcagcct cccaagtagc 5161 tgagactgca ggcacgagcc accacaccca gctaattttt aagttttctt gtagagacag 5221 ggtctcacta tgttgtctag gctggtcttg aactcttggc ctcaagtaat cctcctgcct 5281 cagcctccca aagtgttggg attgcagata tgagccactg gcctggcctt cagcagttct 5341 ttttgtgaag taaaacttgt atgttggaaa gagtagattt tattggtcta cccttttctc 5401 actgtagctg ctggcagccc tgtgccatat ctggactcta gttgtcagta tctgagttgg 5461 acactattcc tgctccctct tgtttcttac atatcagact tcttacttga atgaaacctg 5521 atctttccta atcctcactt ttttcttttt taaaaagcag tttctccact gctaaatgtt 5581 agtcattgag gtggggccaa ttttaatcat aagccttaat aagatttttc taagaaatgt 5641 gaaatagaac aattttcatc taattccatt tacttttaga tgaatggcat tgtgaatgcc 5701 attcttttaa tgaatttcaa gagaattctc tggttttctg tgtaattcca gatgagtcac 5761 tgtaactcta gaagattaac cttccagcca acctattttc ctttcccttg tctctctcat 5821 cctcttttcc ttccttcttt cctttctctt cttttatctc caaggttaat caggaaaaat 5881 agcttttgac aggggaaaaa actcaataac tagctatttt tgacctcctg atcaggaact 5941 ttagttgaag cgtaaatcta aagaaacatt ttctctgaaa tatattatta agggcaatgg 6001 agataaatta atagtagatg tggttcccag aaaatataat caaaattcaa agattttttt 6061 tgtttctgta actggaacta aatcaaatga ttactagtgt taatagtaga taacttgttt 6121 ttattgttgg tgcatattag tataactgtg gggtaggtcg gggagagggt aagggaatag 6181 atcactcaga tgtattttag ataagctatt tagcctttga tggaatcata aatacagtga 6241 atacaatcct ttgcattgtt aaggaggttt tttgttttta aatggtgggt caaggagcta 6301 gtttacaggc ttactgtgat ttaagcaaat gtgaaaagtg aaaccttaat tttatcaaaa 6361 gaaatttctg taaatggtat gtctccttag aatacccaaa tcataatttt atttgtacac 6421 actgttaggg gctcatctca tgtaggcaga gtataaagta ttaccttttg gaattaaaag 6481 ccactgactg ttataaagta taacaacaca catcaggttt taaaaagcct tgaatggccc 6541 ttgtcttaaa aagaaattag gagccaggtg cggtggcacg tgcctgtagt cccagctcct 6601 tgggaggctg agacaggagg attccttgag ccctggagtt tgagtccagc ctgggtgaca 6661 tagcaagacc ctgtcttaaa agaaaaatgg gaagaaagac aaggtaacat gaagaaagaa 6721 gagataccta gtatgatgga gctgcaaatt tcatggcagt tcatgcagtc ggtcaagagg 6781 aggattttgt tttgtagttt gcagatgagc atttctaaag cattttccct tgctgtattt 6841 ttttgtatta taaattacat tggacttcat atatataatt tttttttaca ttatatgtct 6901 cttgtatgtt ttgaaactct tgtatttatg atatagctta tatgattttt ttgccttggt 6961 atacatttta aaatatgaat ttaaaaaatt tttgtaaaaa taaaattcac aaaattgttt 7021 tgaaaaacaa aaaaaaaaaa aa // LOCUS AF013488 962 bp mRNA PRI 27-AUG-1997 DEFINITION Homo sapiens tubulin folding cofactor B mRNA, complete cds. ACCESSION AF013488 NID g2343184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 962) AUTHORS Tian,G., Lewis,S.A., Feierbach,B., Stearns,T., Rommelaere,H., Ampe,C. and Cowan,N.J. TITLE Tubulin subunits exist in an activated conformational state generated and maintained by protein cofactors JOURNAL J. Cell Biol. 137 (1997) In press REFERENCE 2 (bases 1 to 962) AUTHORS Cowan,N.J. TITLE Direct Submission JOURNAL Submitted (10-JUL-1997) Biochemistry, NYU Medical Center, 550 First Avenue, New York, NY 10016, USA FEATURES Location/Qualifiers source 1..962 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 66..800 /function="chaperone protein; cofactor B binds to alpha-tubulin folding intermediates after their interaction with cytosolic chaperonin in the pathway leading from newly synthesised tubulin to properly folded heterodimer" /codon_start=1 /product="tubulin folding cofactor B" /db_xref="PID:g2343185" /translation="MEVTGVSAPTVTVFISSSLNTFRSEKRYSRSLTIAEFKCKLELL VGSPASCMELELYGVDDKFYSKLDQEDALLGSYPVDDGCRIHVIDHSGARLGEYEDVS RVEKYTISQEAYDQRQDTVRSFLKRRKLGRYNEEERAQQEAEAAQRLAEEKAQASSIP VGSRCEVRAAGQSPRRGTVMYVGLTDFKPGYWIGVRYDEPLGKNDGSVNGKRYFECQA KYGAFVKPAVVTVGDFPEEDYGLDEI" BASE COUNT 202 a 270 c 293 g 196 t 1 others ORIGIN 1 gcggcggctg cggagggntg gtgaggcggc tggaccggcc tgcaggcatc cgcaggcgcg 61 gcaagatgga ggtgacgggg gtgtcggcac ccacggtgac cgttttcatc agcagctccc 121 tcaacacctt ccgctccgag aagcgataca gccgcagcct caccatcgct gagttcaagt 181 gtaaactgga gttgctggtg ggcagccctg cttcctgcat ggaactggag ctgtatggag 241 ttgacgacaa gttctacagc aagctggatc aagaggatgc gctcctgggc tcctaccctg 301 tagatgacgg ctgccgcatc cacgtcattg accacagtgg cgcccgcctt ggtgagtatg 361 aggacgtgtc ccgggtggag aagtacacga tctcacaaga agcctacgac cagaggcaag 421 acacggtccg ctctttcctg aagcgcagaa agctcggccg gtacaacgag gaggagcggg 481 ctcagcagga ggccgaggcc gcccagcgcc tggccgagga gaaggcccag gccagctcca 541 tccccgtggg cagccgctgt gaggtgcggg cggcgggaca atcccctcgc cggggcaccg 601 tcatgtatgt aggtctcaca gatttcaagc ctggctactg gattggtgtc cgctatgatg 661 agccactggg gaaaaatgat ggcagtgtga atgggaaacg ctacttcgaa tgccaggcca 721 agtatggcgc ctttgtcaag ccagcagtcg tgacggtggg ggacttcccg gaggaggact 781 acgggttgga cgagatatga cacctaagga attcccctgc ttcagctcct agctcagcca 841 ctgactgccc ctcctgtgtg tgcccatggc ccttttctcc tgaccccatt ttaattttat 901 tcattttttc ctttgccatt gatttttgag actcatgcat taaattcact agaaacccaa 961 aa // LOCUS AF013591 2856 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens homolog of the Aspergillus nidulans sudD gene product mRNA, complete cds. ACCESSION AF013591 NID g2338557 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2856) AUTHORS May,G.S., Anaya,P. and Dai,C. TITLE sudD, an extragenic suppressor of the mitosis-defective bimD6 mutation of Aspergillus nidulans codes for a novel, evolutionarily conserved protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 2856) AUTHORS May,G.S., Anaya,P. and Dai,C. TITLE Direct Submission JOURNAL Submitted (11-JUL-1997) Cell Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2856 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" CDS 94..1653 /codon_start=1 /product="homolog of the Aspergillus nidulans sudD gene product" /db_xref="PID:g2338558" /translation="MDLVGVASPEPGTAAAWGPSKCPWAIPQNTISCSLADVMSEQLA KELQLEEEAAVFPEVAVAEGPFITGENIDTSSDLMLAQMLQMEYDREYDAQLRREEKK FNGDSKVSISFENYRKVHPYEDSDSSEDEVDWQDTRDDPYRPAKPVPTPKKGFIGKGK DITTKHDEVVCGRKNTARMENFAPEFQVGDGIGMDLKLSNHVFNALKQHAYSEERRSA RLHEKKEHSTAEKAVDPKTRLLMYKMVNSGMLETITGCISTGKESVVFHAYGGSMEDE KEDSKVIPTECAIKVFKTTLNEFKNRDKYIKDDFRFKDRFSKLNPRKIHRMWAEKEMH NLARMQRAGIPCPTVVLLKKHILVMSFIGHDQVPAPKLKEVKLNSEEMKEAYYQTLHL MRQLYHECTLVHADLSEYNMLWHAGKVWLIDVSQSVEPTHPHGLEFLFRDCRNVSQFF QKGGVKEALSERELFNAVSGLNITADNEADFLAEIEALEKMNEDHVQKNGRKAASFLK DDGDPPLLYDE" BASE COUNT 898 a 525 c 638 g 795 t ORIGIN 1 gaattcgcgg ccgccccgcc tgtgtcctcg gcggagcctg ctgcccgtcc tgccacctct 61 ctgctctgtt cttgtctctg ccttcattcc cgaatggatc tggtaggagt ggcatcgcct 121 gagcccggga cggcagcggc ctggggaccc agcaagtgtc catgggctat tcctcaaaat 181 acaatatctt gttctttggc tgatgtaatg agtgaacagc tggccaaaga attgcagtta 241 gaagaagaag ctgccgtttt tcctgaagtt gctgttgctg aaggaccatt tattactgga 301 gaaaacattg atacttccag tgaccttatg ctggctcaga tgctacagat ggaatatgac 361 agagaatatg atgcacagct taggcgtgaa gaaaaaaaat tcaatggaga tagcaaagtt 421 tccatttcct ttgaaaatta tcgaaaagtg catccttatg aagacagcga tagctctgaa 481 gatgaggttg actggcagga tactcgtgat gatccctaca gaccagcaaa accggttccc 541 actcctaaaa agggctttat tggaaaagga aaagatatca ccaccaaaca tgatgaagta 601 gtatgtggga gaaagaacac agcaagaatg gaaaattttg cacctgagtt tcaggtagga 661 gatggaattg gaatggattt aaaactatca aaccatgttt tcaatgcttt aaaacaacat 721 gcctactcag aagaacgtcg aagtgcccgc ctacatgaga aaaaggagca ttctacagca 781 gaaaaagcag ttgatcctaa gacacgttta cttatgtata aaatggtcaa ctctggaatg 841 ttggagacaa tcactggctg tattagtaca ggaaaggagt ctgttgtctt tcatgcatat 901 ggagggagca tggaggatga aaaggaagat agtaaagtta tacctacaga atgtgccatc 961 aaggtattta aaacaaccct taatgaattt aagaatcgtg acaaatatat taaagatgat 1021 ttcaggttta aagatcgctt cagtaaacta aatccacgta agatccaccg catgtgggca 1081 gaaaaagaaa tgcacaatct cgcaagaatg cagagagctg gaattccttg tccaacagtt 1141 gtactactga agaaacacat tttagttatg tcttttattg gccatgatca agttccagcc 1201 cctaaattaa aagaagtaaa gctcaatagt gaagaaatga aagaagccta ctatcaaact 1261 cttcatttga tgcggcagtt atatcatgaa tgtacgcttg tccatgctga cctcagtgag 1321 tataacatgc tgtggcatgc tggaaaggtc tggttgatcg atgtcagtca gtcagtagaa 1381 cctacccacc ctcacggcct ggagttcttg ttccgggact gcaggaatgt ctcgcagttt 1441 ttccagaaag gaggagtcaa ggaagccctt agtgaacgag aactcttcaa tgctgtttca 1501 ggcttaaaca tcacagcaga taatgaagct gattttttag ctgagataga agctttggag 1561 aaaatgaatg aagatcacgt tcagaagaat ggaaggaaag ctgcttcatt tttgaaagat 1621 gatggagacc caccactact atatgatgaa tagcactaat acccactgct tcagtgttaa 1681 cacagcagtg attgtcagct gccaatagca aatgaagtta tgggtgactt gaaataccaa 1741 aacctgagga gtgggcaatg gtgcttctgt gcttttcccc cttgtaaccc atgtgccaga 1801 tgtgtggaat ttttagctca gcattgagag aataaaatgt cactacctct catcttatga 1861 acaggataat ataattcttt aacagctata ggttatctgg ctgaagtaga cctaatttta 1921 tgtgacttgt ggtgtaaaat gtcttgatga taatttttaa aacttgggta acacttccaa 1981 atatgggagg aaaggacaga tgtgtttaca agggaggatt ttacaacata cttgctttat 2041 tcacctccct gttttgtgtt gcgtctttcc ttgaatattt tattggccca gagttagcct 2101 ttctcaatta tgtttccaga ctgtggccgt gattctaaag gaaaatgtgt gctctttagt 2161 gggtagaaca aatggaaatt tggtttcaga atggctgaca gaaatcgaca taagtcatgt 2221 aatttttgtt gatatatcat gaaaatgaac agaattcttt ttccatactt atatctaaga 2281 aaaggcatca taggtttctg aaagagataa ctatataaca gctttttaac tatccagtca 2341 actttcagct tttctacatt taggtaaaat ggttaggata taactcatgg tgtggctaat 2401 ctacatttat caataaaatg taaattatct gaaaggacag aatataagat ttaaccatgt 2461 ttgacatatt ttaatttagt taatgaagca aaattcagtt tatatttcac tagaactgtg 2521 tacttgattg attttcagag aaatatcaca aattagaaat attaaatcta aggatgaaag 2581 gtatatataa aacaatttgg gggccaggca cgatggctca aacctgtaat cccagcactt 2641 tgggagacca aggcgggtgg atcacttgag gtcaggagtt caagaccagc ctgggcaaca 2701 tggcgaaacc ctgtctctac taaaaataca aaaattagcc gggtgtggtg gcacttctct 2761 gtaatctcag cttctcagga ggctgagaca ggagaatcgc ttgaacccgg gaggcagagg 2821 ttgcagtgag ctgagatcat gcgcggccgc gaattc // LOCUS AF013956 1867 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens Polycomb 2 homolog (hPc2) mRNA, complete cds. ACCESSION AF013956 NID g2317722 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1867) AUTHORS Satijn,D.P.E., Olson,D.J., van der Vlag,J., Hamer,K.M., Lambrechts,C., Masselink,H., Gunster,M.J., Sewalt,R.G.A.B., van Driel,R. and Otte,A.P. TITLE Interference with the expression of a novel human polycomb protein, hPc2, results in cellular transformation and apoptosis JOURNAL Mol. Cell. Biol. 17 (10), 6076-6086 (1997) MEDLINE 97459707 REFERENCE 2 (bases 1 to 1867) AUTHORS Satijn,D.P.E., Olson,D.J., Van der Vlag,J., Hamer,C.M., Lambrechts,A.C., Masselink,H., Gunster,M.J., Sewalt,R.G.A.B., Van Driel,R. and Otte,A.P. TITLE Direct Submission JOURNAL Submitted (14-JUL-1997) Biochemistry, E.C.Slater, Plantage Muidergracht 12, Amsterdam 1018 TV, The Netherlands FEATURES Location/Qualifiers source 1..1867 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1867 /gene="hPc2" CDS 167..1843 /gene="hPc2" /function="repressor of gene activity" /note="chromatin associated protein" /codon_start=1 /product="Polycomb 2 homolog" /db_xref="PID:g2317723" /translation="MELPAVGEHVFAVESIEKKRIRKGRVEYLVKWRGWSPKYNTWEP EENILDPRLLIAFQNRERQEQLMGYRKRGPKPKPLVVQVPTFARRSNVLTGLQDSSTD NRAKLDLGAQGKGQGHQYELNSKKHHQYQPHSKEGKPRPPGKSGKYYYQLNSKKHHPY QPDPKMYDLQYQGGHKEAPSPTCPDLGAKSHPPDKWAQGAGAKGYLGAVKPLAGAAGA PGKGSEKGPPNGMMPAPKEAVTGNGIGGKMKIVKNKNKNGRIVIVMSKYMENGMQAVK IKSGEVAEGEARSPSHKKRAADERHPPADRTFKKAAGAEEKKVEAPPKRREEEVSGVS DPQPQDAGSRKLSPTKEAFGEQPLQLTTKPDLLAWDPARNTHPPSHHPHPHPHHHHHH HHHHHHAVGLNLSHVRKRCLSETHGEREPCKKRLTARSISTPTCLGGSPAAERPADLP PAAALRQPEVILLDSDLDEPIDLRSVKSRSEAGEPPSSLQVKPETPASAAVAVAAAAA PTTTAEKPPAEAQDEPAESLSEFKPFFGNIIITDVTANCLTVTFKEYVTV" BASE COUNT 400 a 626 c 635 g 206 t ORIGIN 1 gggcgagcgc gacgcgggag ccggggcggc gcgggcagcg cgggccggcc gggctgtgcg 61 gggcgagcgg cggcgcggcg ggggccttcg gccggggcgg cagctgggcg ccggcgggag 121 ctagcagcgt ctgcagccgc gcccggccag ccccctccgg ctcggcatgg agctgccagc 181 tgttggcgag cacgtcttcg cggtggagag catcgagaag aagcggatcc gcaagggcag 241 agtggagtat ctggtgaaat ggagaggctg gtcgcccaaa tataacacgt gggaaccgga 301 ggagaacatc ctggacccca ggctgctgat cgccttccag aacagggaac ggcaggagca 361 gctgatggga tatcggaaga gagggccgaa gcccaaaccg ctagtggtgc aggtgcctac 421 ctttgcccgt cgttccaatg tcctgaccgg cctccaggac tcctccactg acaaccgtgc 481 caagctggat ttgggcgcgc aggggaaggg ccaggggcat cagtacgagc tcaacagcaa 541 gaagcaccac cagtaccagc cgcacagcaa ggaggggaag ccccggccgc cgggcaagag 601 cggcaagtac tactaccagc tcaacagcaa gaagcaccac ccctaccagc ccgaccccaa 661 aatgtacgac ctgcagtacc agggcggcca caaggaggcg cccagcccca cctgcccgga 721 cctgggggcc aagagccacc cgcccgacaa gtgggcgcaa ggtgcggggg ccaaaggcta 781 cctgggggcg gtgaagcccc tggccggtgc ggcgggtgct ccaggcaaag gctccgagaa 841 gggccccccc aacggaatga tgccggcccc caaagaggct gtgacgggca acgggattgg 901 gggcaagatg aagatagtca agaacaagaa caagaacgga cgcatcgtga tcgtgatgag 961 caaatacatg gagaacggca tgcaggcggt gaagatcaag tccggcgagg tggcagaggg 1021 ggaggctcgc tcccccagcc acaagaagcg ggcagccgac gagcgccacc ctcctgccga 1081 caggactttt aaaaaggcgg cgggcgcaga ggagaagaag gtggaggcgc cgcccaagag 1141 gagggaggag gaggtgtccg gggttagcga tccgcagccc caggatgccg gctcccgcaa 1201 gctgtccccg accaaggagg cctttggaga gcagcccctg cagctcacca ccaagcccga 1261 cctgcttgcc tgggacccgg cccggaacac gcacccgccc tcacaccacc cgcacccgca 1321 cccccatcac caccaccacc accaccacca ccaccaccac gccgtcggcc tgaatctctc 1381 ccacgtgcgc aagcgctgcc tctccgagac ccacggcgag cgcgagccct gcaagaagcg 1441 gctgactgcg cgcagcatca gcacccccac ctgcctgggg ggcagcccag ccgctgagcg 1501 cccggccgac ctgccaccag ccgccgccct ccggcagccc gaggtcatcc tgctagactc 1561 agacctggat gaacccatag acttgcgctc ggtcaagagc cgcagcgagg ccggggagcc 1621 gcccagctcc ctccaggtga agcccgagac accggcgtcg gcggcggtgg cggtggcggc 1681 ggcagcggca cccaccacga cggcggagaa gcctccagcc gaggcccagg acgaacctgc 1741 agagtcgctg agcgagttca agcccttctt tgggaatata attatcaccg acgtcaccgc 1801 gaactgcctc accgttactt tcaaggagta cgtgacggtg tagccggagg gcgtcggaag 1861 gggcgcc // LOCUS AF013970 6406 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens MTG8-like protein (MTGR1) mRNA, complete cds. ACCESSION AF013970 NID g2801421 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6406) AUTHORS Kitabayashi,I., Ida,K., Morohoshi,F., Yokoyama,A., Mitsuhashi,N., Shimizu,K., Nomura,N., Hayashi,Y. and Ohki,M. TITLE The AML1-MTG8 leukemic fusion protein forms a complex with a novel member of the MTG8(ETO/CDR) family, MTGR1 JOURNAL Mol. Cell. Biol. 18 (2), 846-858 (1998) MEDLINE 98107670 REFERENCE 2 (bases 1 to 6406) AUTHORS Morohoshi,F., Kitabayashi,I., Mitsuhashi,N., Takahashi,E., Suzuki,M., Mitani,S. and Ohki,M. TITLE Cloning and mapping of human MTGR1 gene encoding a protein similar to MTG8 which is a fusion partner of AML1 in myeloid leukemia with t(8;21) JOURNAL Unpublished REFERENCE 3 (bases 1 to 6406) AUTHORS Morohoshi,F., Kitabayashi,I., Mitsuhashi,N., Takahashi,E., Suzuki,M., Mitani,S. and Ohki,M. TITLE Direct Submission JOURNAL Submitted (13-JUL-1997) Radiobiology, National Cancer Center Research Institute, 5-1-1, Tsukiji, Chuo-ku, Tokyo 104, Japan FEATURES Location/Qualifiers source 1..6406 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q11.21-q11.23" gene 1..6406 /gene="MTGR1" CDS 11..1825 /gene="MTGR1" /codon_start=1 /product="MTG8-like protein" /db_xref="PID:g2801422" /translation="MAKESGISLKEIQVLARQWKVGPEKRVPAMPGSPVEVKIQSRSS PPTMPPLPPINPGGPRPVSFTPTALSNGINHSPPTLNGAPSPPQRFSNGPASSTSSAL TNQQLPATCGARQLSKLKRFLTTLQQFGNDISPEIGEKVRTLVLALVNSTVTIEEFHC KLQEATNFPLRPFVIPFLKANLPLLQRELLHCARAAKQTPSQYLAQHEHLLLNTSIAS PADSSELLMEVHGNGKRPSPERREENSFDRDTIAPEPPAKRVCTISPAPRHSPALTVP LMNPGGQFHPTPPPLQHYTLEDIATSHLYREPNKMLEHREVRDRHHSLGLNGGYQDEL VDHRLTEREWADEWKHLDHALNCIMEMVEKTRRSMAVLRRCQESDREELNYWKRRYNE NTELRKTGTELVSRQHSPGSADSLSNDSQREFNSRPGTGYVPVEFWKKTEEAVNKVKI QAMSEVQKAVAEAEQKAFEVIATERARMEQTIADVKRQAAEDAFLVINEQEESTENCW NCGRKASETCSGCNIARYCGSFCQHKDWERHHRLCGQNLHGQSPHGQGRPLLPVGRGS SARSADCSVPSPALDKTSATTSRSSTPASVTAIDTNGL" BASE COUNT 1582 a 1654 c 1574 g 1596 t ORIGIN 1 caccaggtga atggctaaag aatctggaat aagcttgaaa gaaatacagg tcctggcaag 61 gcaatggaaa gttggtcctg agaaaagggt gccagcgatg cctggatcgc ctgtggaagt 121 gaagatacag tccagatcct cacctcccac catgccaccc ctcccaccaa taaatcctgg 181 aggaccgagg ccagtgtcct tcactcctac tgcattaagc aatggcatca accattctcc 241 tcctaccctg aatggtgccc catcaccgcc acagagattc agcaatggtc ctgcctcctc 301 cacatcatct gcactcacaa atcagcaatt gccagccact tgtggtgctc gacaactcag 361 caagttgaaa cgctttctta ccactctgca acagtttggc aatgacatct cccctgagat 421 tggggagaag gtgcggactc ttgttcttgc actggtgaac tcaacagtga caattgagga 481 attccactgt aagctccaag aagccacaaa ctttcccctt cgtccttttg tgattccatt 541 tctcaaggcc aacctgcccc tgctgcagcg ggaactgctg cactgcgctc gggcggccaa 601 gcagacccca tcccagtacc tggctcagca cgaacacctt ctgctcaaca caagcattgc 661 atcgcctgct gactcgtcag agttgctcat ggaggtgcac ggaaatggga agaggcccag 721 tccagagagg agagaagaga atagttttga tagagacaca attgctcctg agcctcctgc 781 caagagagta tgtaccatca gccctgctcc tcggcacagt cctgctctca ctgtgcccct 841 catgaatccc gggggccaat tccatcctac ccctccacct cttcagcatt acaccttaga 901 ggatattgca acttctcacc tgtatcggga acccaacaag atgctagagc atcgagaagt 961 tcgtgataga caccacagtc ttggtctaaa tggaggctat caagatgagt tggtagatca 1021 tcgtttgaca gaaagggaat gggctgatga atggaaacat cttgaccatg cgctgaattg 1081 cattatggaa atggtagaga aaacaaggcg ctctatggca gttctgcggc gctgtcagga 1141 atcagatcgt gaagaactca actactggaa aagacggtac aatgaaaaca cagagctgag 1201 gaaaacgggg accgagttgg tctccaggca gcacagccct gggagtgcag attctctcag 1261 caatgattct cagagagagt tcaacagcag gccaggtaca ggatacgtac ctgtggagtt 1321 ttggaaaaaa acagaagaag ctgtgaataa ggtgaaaatt caggccatgt cagaagtaca 1381 gaaggccgtc gctgaggcag agcagaaagc ctttgaagtg attgcaacag agagagcacg 1441 aatggagcaa accatagcgg atgtcaagcg gcaggccgca gaggatgctt tcctcgtcat 1501 caatgagcaa gaggagtcca cggagaactg ctggaactgt ggccgcaaag ccagcgagac 1561 atgcagtggc tgcaatatcg cgcgatactg tggctctttc tgccagcaca aggactggga 1621 gcggcaccac cgcctctgtg gtcagaacct gcatggccag agcccccacg gccagggccg 1681 gccgctgctt cctgtaggca ggggctcctc tgccaggtcc gccgactgca gcgtgcccag 1741 cccagccctc gacaagacct cggcaaccac atcgcgttcc tcaacacctg cttctgtgac 1801 agctatcgac accaacggac tctgagcccc ggactctgct taccctgatg gctgctcagc 1861 accacagagt gcttgggctg agggactgac tgttggaacc cgtgcatgta gctgccgggt 1921 catcagcaag aaatgaattg gaggcaggaa gagtccaagc ctgaataata acaccccaca 1981 gcctctctgt gcacttgctg tctgcggagc cagtgtgcca ttctctgcac atgggcagcc 2041 agcctgagct gcctcctcca tggctttcct ggtttgttcc tctctccact gaagctgact 2101 tagccggccc cttttcagtg tagaccacca gctcccctcc ccatctcctt gagtcagcag 2161 actgtccaat gtgctcagcc aggctggagg cggcaggcgg caggcagcag gctgtggagg 2221 aggcccctcg gtcagggagc ggcctgcctc acccctgaat agctccttcg gccccatctc 2281 catcctcagc agatgacact gattggcctc acggggactt gggtaagcaa caggcggcat 2341 tcaggactct tctcaaccct gctgttcaga cttgataaga tctcagagtc cacaggaaag 2401 aagtcactgt tgcaataaaa gcacccgtag tagcaaaaac ataaacaaat aaaacttccc 2461 ccacatcaca gatgattttg gacaagattt tccaaccttg ctggctactt tagtttggga 2521 cctgtttttt ttctcatttg attttgcttg tgcagaaaat agtttccagc acatggattg 2581 atctgagaga gaatgaggct cagttgtgga tagtctgttt tctctgagca tgttggccaa 2641 ctagtatcgt caaattattg agtggatcat ctcttggaaa tgcagaactt ctgccaccac 2701 ttggctattt gcacagtcat cttgttctgt gtccttttat ctctcagacc acacacatct 2761 ggaacgctgt gggcatcttc tgcccatggg ctccatttgg cacctgctga gccacagttg 2821 tcctgctgga tgtgctgtgc aggttggtag gacttgcccc cactgtcaag gcctggtctc 2881 atctgaaaag ccctcctgga cctcaaagaa ttcttcagac ctcatagtta caggtcatta 2941 tatctactat gttgatttat catcaggcac acaacttctg tttccttctc ttgtgttatc 3001 tgatagcgtc cctccttgag ctcatcagaa aggttttatg aaatgtgaac cattttggga 3061 aaagctgatc aatttttctt cctagcttcc cattttcaaa tgggacatca ctcatatccc 3121 tttcagaatg ttaggaactg cctcccacat tcttccctgt ctttttgggt tttgtttttt 3181 gttgttgttg tggtttttta acacaaagcc tgggcaacag agcaaaactc tgtctcaaaa 3241 aaaaaaaaaa aaaggcaaaa ttaaagaatt tcaagtccct aatttactct aaccaagcat 3301 tttcagtctt acaaagaaat ctagtgcacg acacctttaa aatgcctaga tttccattgc 3361 ctaaagtaag gtgtgaccag tgaaacaaca gcagagaatc atgttccttt gctgtggaac 3421 acataagccc cacagttttc cagtcagcct aatacatggg tatcccccga ccccatctgc 3481 ctccttaagc cacagtcctt ggtggggaac tctaaggggg acggagtcag taccccggac 3541 agggccacat ttgcatgaga catggctgtc tcacaggtca cagagccatt acaacagctt 3601 actagttttt catggatttg ttggattaac accttaagca ggttttttat tgttattttg 3661 tttgtttgtt tgttttgaga cgggagtttc tctcttgtct cccaggctgg agtgcaatcg 3721 cgcaatctcg gctcacggta acctctgcct cccaggttcg ggcgagtctc ctgcctcagc 3781 cttctgagta gctgggattg caggcatgcg ccaccacgcc caactgattt tgtattttta 3841 gtggagacgg ggtttctcct tgttggtcag gctggtctca aaccccggac ctcaggtgat 3901 ccccccacct tggcctccca aagtgccagg attacaggca tgagccactg tgtccggtct 3961 tgagcaggtt tttaaaatct ccagtggcca taaaactagc cagggtagct catgctttta 4021 ctggttggaa taaggagcca aaaccaacaa aataatattt attagggttg gatgcccatt 4081 tctgtggtta aggcttaggt ttggtgtgca gagcctggcc tgctcttgtg ggctgtggtt 4141 gctgtgacag tttcgttttc tgaatctttg taatgctctt ccaccctgct cgtgctgccc 4201 agaggctgat ctgcaaacct gggcagtatt ccacactgaa gctcagtttg caaaccttct 4261 gctgtgttaa ttctggtcag tttcacacag aagtcacgag ggcctctgcc caggaccttc 4321 tattcacatt tcaagaatcc ctttctgaaa ttctccaact ctctggtcag gaggaggagc 4381 accattgggg cggggtaggc agctagaggc gcaggagcga atccaggttc cttgcgtggt 4441 tcccagcctc tgctgacagc acctcagcta ccaggtgggg agtggaaatt cagcctgctt 4501 agaggactga cctctacagt tcatatttgc agtgacataa aaggcaaaga ggagctggca 4561 tggtggtaca gaaggtatca gtcaagcgca ggctcttcag aaacctagaa actaaaaaga 4621 gatgcaacat gtgactggcc atcagccaga catgtctagg gtgagaccag tgctcagcac 4681 cacccgccaa ggaacactga tgagttgtgg gacattatgg tcggccaagt gaggaatgat 4741 tgcagggacc agagaagacc aaagatgcca gatgtcccca gagtttcttg agggctgggt 4801 ttcacgtctt tgcttgtcct ttttgtcccc acttgcctgc accgtgcctg actcatccag 4861 tggggaaggt gactcaagaa gttgtttaag cctaggccta ggcagtccac ttcgggaggt 4921 gggggcactc cacactgggg ggccctgcag ctccagggct ggtgctctag cagatccgct 4981 caggcacggt tgggtgaccg tggggtcacc gattgcgtag attatttcta gggcagtgct 5041 gggttagaaa tgtctgtctg gaaacaggac acatcttgcc accacaagtc cacacctggc 5101 cttggctgct gcccgctctc tgcagcgtgg atttcctttg cccatttggg ccagaaggtc 5161 accaggagtt catcactgct tagtgtcaac ttgacatctt cagcttccag acagaaaacc 5221 catgctttac tttgtacagc cctgtcccag agttagtaag cctcggcttg tcacctgctt 5281 gcaaagggta tagacatcat tctgggtagt tctagaggat atcgcaacaa gtgcgtttga 5341 aggttcagca cagctggaat tgtacggtag tgagctcttc aaaatgaaaa cagaatgacc 5401 ggatggccat ctgtcctgga aactgtagaa ttgaataaat cactgaaaat agaacacttc 5461 ttggaggctt tcttcagaac tgaactagaa acaaggtttc caggctccca ggtcagtaga 5521 ccaaaccaac ttccttagat tttttttttt aagctgtttg cagaaaactg ggtggtagcg 5581 tgtctgaacc aaacggaaga atagccagcg gagcatggac tgttccagca cgggccagat 5641 ggccagctgt cgagagcagc cctgcccaga ttgcggtggg ggctgtgcca gcggctccgc 5701 agggcccttc caactctgga gtgtctgtag ttcttgtgaa accaaaacgt tgactacctg 5761 cctctttcct cgggcatcga cgtgctcatt tccaaagatg atggtgcagg tgaccttttc 5821 catcgtgagc taagagaagg ttaggaggcc tgaggggcgg gccttgtgtc ccctcctccc 5881 cctccacagc ccctgaggtg gagcttccct cacagacccc atctggtcga gcctacggct 5941 gcccagtgta cactcagtga tgcttatacc aaatagggca tgtgcgctgg tgtctgttgg 6001 ggacagcccc atccctacct tggagtcggc agagaagcag caataagcat taggcgaatg 6061 attgaaaggg ctgctgcgcc aagaggcctt cagtcccctt aacgtggctt tccaacttcc 6121 tgacgaggct acaaaggctc cgctcacaaa agaaagccaa taatgtaaat aaatagaaaa 6181 cgaagcctcc acgcgtggtt tttaaagggg gatgatgaat gtgacaccac ccacagaatg 6241 cattagaatc tggtttttct gtgctttgtc attttactct gataggaatt tttgttacct 6301 aactctgtgc ataacttatt taatgtactg tataaatgaa ccaaaactgt taaatatgta 6361 tttagtttgt tctacttaaa gtagtcaata aaaaggctat attcct // LOCUS AF013988 1451 bp mRNA PRI 12-AUG-1997 DEFINITION Homo sapiens serine protease mRNA, complete cds. ACCESSION AF013988 NID g2318114 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1451) AUTHORS Little,S.P., Johnstone,E.M., Dixon,E.P., Norris,F., Buckley,W., Becker,G., Johnson,M.,., Dobbins,J.R., Wyrick,T., Miller,J.R., MacKellar,W., Hepburn,D., Corvalan,J., McClure,D., Liu,X., Stephenson,D. and Clemens,J. TITLE Zyme cDNA isolated from AD brain tissue JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 1451) AUTHORS Little,S.P., Johnstone,E.M. and Norris,F. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) CNS Division, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, IN 46285, USA FEATURES Location/Qualifiers source 1..1451 /organism="Homo sapiens" /strain="human" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" /tissue_type="Alzheimer's disease brain tissue" CDS 147..881 /note="Zyme; protease bears homology to Kallikrein class and can be localized to microvessels and microglia; chymotrypsin-like" /codon_start=1 /product="serine protease" /db_xref="PID:g2318115" /translation="MKKLMVVLSLIAAAWAEEQNKLVHGGPCDKTSHPYQAALYTSGH LLCGGVLIHPLWVLTAAHCKKPNLQVFLGKHNLRQRESSQEQSSVVRAVIHPDYDAAS HDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHILGWGKTADGDFPDTIQCAYI HLVSREECEHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNIPC GSKEKPGVYTNVCRYTNWIQKTIQAK" BASE COUNT 342 a 434 c 367 g 308 t ORIGIN 1 gtcgacccac gcgtccggct ggctggctcg ctctctcctg gggacacaga ggtcggcagg 61 cagcacacag agggacctac gggcagctgt tccttccccc gactcaagaa tccccggagg 121 cccggaggcc tgcagcagga gcggccatga agaagctgat ggtggtgctg agtctgattg 181 ctgcagcctg ggcagaggag cagaataagt tggtgcatgg cggaccctgc gacaagacat 241 ctcaccccta ccaagctgcc ctctacacct cgggccactt gctctgtggt ggggtcctta 301 tccatccact gtgggtcctc acagctgccc actgcaaaaa accgaatctt caggtcttcc 361 tggggaagca taaccttcgg caaagggaga gttcccagga gcagagttct gttgtccggg 421 ctgtgatcca ccctgactat gatgccgcca gccatgacca ggacatcatg ctgttgcgcc 481 tggcacgccc agccaaactc tctgaactca tccagcccct tcccctggag agggactgct 541 cagccaacac caccagctgc cacatcctgg gctggggcaa gacagcagat ggtgatttcc 601 ctgacaccat ccagtgtgca tacatccacc tggtgtcccg tgaggagtgt gagcatgcct 661 accctggcca gatcacccag aacatgttgt gtgctgggga tgagaagtac gggaaggatt 721 cctgccaggg tgattctggg ggtccgctgg tatgtggaga ccacctccga ggccttgtgt 781 catggggtaa catcccctgt ggatcaaagg agaagccagg agtctacacc aacgtctgca 841 gatacacgaa ctggatccaa aaaaccattc aggccaagtg accctgacat gtgacatcta 901 cctcccgacc taccacccca ctggctggtt ccagaacgtc tctcacctag accttgcctc 961 ccctcctctc ctgcccagct ctgaccctga tgcttaataa acgcagcgac gtgagggtcc 1021 tgattctccc tggttttacc ccagctccat ccttgcatca ctggggagga cgtgatgagt 1081 gaggacttgg gtcctcggtc ttacccccac cactaagaga atacaggaaa atcccttcta 1141 ggcatctcct ctccccaacc cttccacacg tttgatttct tcctgcagag gcccagccac 1201 gtgtctggaa tcccagctcc gctgcttact gtcggtgtcc ccttgggatg tacctttctt 1261 cactgcagat ttctcacctg taagatgaag ataaggatga tacagtctcc ataaggcagt 1321 ggctgttgga aagatttaag gtttcacacc tatgacatac atggaatagc acctgggcca 1381 ccatgcactc aataaagaat gaattttatt atgaaaaaaa aaaaaaaaaa aaaaaaaaaa 1441 agggcggccg c // LOCUS AF014118 1976 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens membrane-associated kinase (Myt1) mRNA, complete cds. ACCESSION AF014118 NID g2460022 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1976) AUTHORS Booher,R.N., Holman,P.H. and Fattaey,A. TITLE Human Myt1 is a cell cycle regulated kinase that inhibits Cdc2 but not Cdk2 activity JOURNAL J. Biol. Chem. 272 (1997) In press REFERENCE 2 (bases 1 to 1976) AUTHORS Booher,R.N. and Fattaey,A. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) Onyx Pharmaceuticals, 3031 Research Dr., Richmond, CA 94806, USA FEATURES Location/Qualifiers source 1..1976 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1976 /gene="Myt1" CDS 327..1826 /gene="Myt1" /function="activates Cdc2 kinase by phosphorylating residues Thr14 and Tyr15" /codon_start=1 /product="membrane-associated kinase" /db_xref="PID:g2460023" /translation="MLERPPALAMPMPTEGTPPPLSGTPIPVPAYFRHAEPGFSLKRP RGLSRSLPPPPPAKGSIPISRLFPPRTPGWHQLQPRRVSFRGEASETLQSPGYDPSRP ESFFQQSFQRLSRLGHGSYGEVFKVRSKEDGRLYAVKRSMSPFRGPKDRARKLAEVGS HEKVGQHPCCVRLEQAWEEGGILYLQTELCGPSLQQHCEAWGASLPEAQVWGYLRDTL LALAHLHSQGLVHLDVKPANIFLGPRGRCKLGDFGLLVELGTAGAGEVQEGDPRYMAP ELLQGSYGTAADVFSLGLTILEVACNMELPHGGEGWQQLRQGYLPPEFTAGLSSELRS VLVMMLEPDPKLRATAEALLALPVLRQPRAWGVLWCMAAEALSRGWALWQALLALLCW LWHGLAHPASWLQPLGPPATPPDSPPCSLLLDSSFSSNWDDDSLGPSLSPEAVLARTV GSTSTPRSRCTPRDALDLSDINSEPPRGSFPSFEPRNLLSMFEDTLDPT" BASE COUNT 327 a 691 c 615 g 343 t ORIGIN 1 caggactccc gtgaggggga acggcccgtg aacgcgcgcg gagctgctcg cgccccgccc 61 agtcgcccca gggcttcccc acacccacgg agtgaagtca gccgcggccc tgcctgggag 121 gaacttaccg tctaccggga aaggtggcca gcagatgtgt cgggcctggt gagagggtga 181 ggcgagacgg cccgatcgcc cagggccccg gaagctgcgg aggtcacccc cgcctggcct 241 tagctcaggg acaccctgga ttcacgtggg agcccctgct cctgcctccc ccgtcccacc 301 actgaagctg ttgggccagg ccagtcatgc tagaacggcc tcctgcactg gccatgccca 361 tgcccacgga gggcaccccg ccacctctga gtggcacccc catcccagtc ccagcctact 421 tccgccacgc agaacctgga ttctccctca agaggcccag ggggctcagc cggagcctcc 481 cacctccgcc ccctgccaag ggcagcattc ccatcagccg cctcttccct cctcggaccc 541 caggctggca ccagctgcag ccccggcggg tgtcattccg gggcgaggcc tcagagactc 601 tgcagagccc tgggtatgac ccaagccggc cagagtcctt cttccagcag agcttccaga 661 ggctcagccg cctgggccat ggctcctacg gagaggtctt caaggtgcgc tccaaggagg 721 acggccggct ctatgcggta aagcgttcca tgtcaccatt ccggggcccc aaggaccggg 781 cccgcaagtt ggccgaggtg ggcagccacg agaaggtggg gcagcaccca tgctgcgtgc 841 ggctggagca ggcctgggag gagggcggca tcctgtacct gcagacggag ctgtgcgggc 901 ccagcctgca gcaacactgt gaagcctggg gtgccagcct gcctgaggcc caggtctggg 961 gctacctgcg ggacacgctg cttgccctgg cccatctgca cagccagggc ctggtgcacc 1021 ttgatgtcaa gcctgccaac atcttcctgg ggccccgggg ccgctgcaag ctgggtgact 1081 tcggactgct ggtggagctg ggtacagcag gagctggtga ggtccaggag ggagaccccc 1141 gctacatggc ccccgagctg ctgcagggct cctatgggac agcagcggat gtgttcagtc 1201 tgggcctcac catcctggaa gtggcatgca acatggagct gccccacggt ggggagggct 1261 ggcagcagct gcgccagggc tacctgcccc ctgagttcac tgccggtctg tcttccgagc 1321 tgcgttctgt ccttgtcatg atgctggagc cagaccccaa gctgcgggcc acggccgagg 1381 ccctgctggc actgcctgtg ttgaggcagc cgcgggcctg gggtgtgctg tggtgcatgg 1441 cagcggaggc cctgagccga gggtgggccc tgtggcaggc cctgcttgcc ctgctctgct 1501 ggctctggca tgggctggct caccctgcca gctggctaca gcccctgggc ccgccagcca 1561 ccccgcctga ctcaccaccc tgcagtttgc tcctggacag cagcttctcc agcaactggg 1621 atgacgacag cctagggcct tcactctccc ctgaggctgt cctggcccgg actgtgggga 1681 gcacctccac cccccggagc aggtgcacac ccagggatgc cctggaccta agtgacatca 1741 actcagagcc tcctcggggc tccttcccct cctttgagcc tcggaacctc ctcagcatgt 1801 ttgaggacac cctagaccca acctgagccc cagattctgc ctctgcactt ttaacctttt 1861 atcctgtgtc tctcccgtcg cccttgaaag ctggggcccc tcgggaactc ccatggtctt 1921 ctctgcctgg ccgtgtctaa taaaaagtat ttgaaccttg aaaaaaaaaa aagaag // LOCUS AF014398 1447 bp mRNA PRI 23-SEP-1997 DEFINITION Homo sapiens myo-inositol monophosphatase 2 mRNA, complete cds. ACCESSION AF014398 NID g2406665 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1447) AUTHORS Yoshikawa,T., Turner,G., Esterling,L.E. and Detera-Wadleigh,S.D. TITLE A novel human myo-inositol monophosphatase gene, IMP.18p, maps to a susceptibility region for bipolar disorder JOURNAL Mol. Psych. 2 (5), 393-397 (1997) REFERENCE 2 (bases 1 to 1447) AUTHORS Yoshikawa,T. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) Clinical Neurogenetics Branch, National Institute of Mental Health, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1447 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18p11.2" CDS 142..1008 /note="phosphatase" /codon_start=1 /product="myo-inositol monophosphatase 2" /db_xref="PID:g2406666" /translation="MKPSGEDQAALAAGPWEECFQAAVQLALRAGQIIRKALTEEKRV STKTSAADLVTETDHLVEDLIISELRERFPSHRFIAEEAAASGAKCVLTHSPTWIIDP IDGTCNFVHRFPTVAVSIGFAVRQELEFGVIYHCTEERLYTGRRGRGAFCNGQRLRVS GETDLSKALVLTEIGPKRDPATLKLFLSNMERLLHAKAHGVRVIGSSTLALCHLASGA ADAYYQFGLHCWDLAAATVIIREAGGIVIDTSGGPLDLMACRVVAASTREMAMLIAQA LQTINYGRDDEK" BASE COUNT 317 a 380 c 447 g 303 t ORIGIN 1 gtgggacggg cggcggacta ggcacagagc tgcgggagca ggcacaggga gtgtggagcc 61 tggcggcggg acggcgggat ccggtgggag ccggagtccc gccgaggggg gctggaggtg 121 gaggggcccg gcgaggccgc gatgaagccg agcggcgagg accaggcggc gctggcggcc 181 ggcccctggg aggagtgctt ccaggcggcc gtgcagctgg cgctgcgggc aggacagatc 241 atcagaaaag cccttactga ggaaaaacgt gtctcaacaa aaacatcagc tgcagatctt 301 gtgacagaaa cagatcacct tgtggaagat ttaattattt ctgagttgcg agagaggttt 361 ccttcacaca ggttcattgc agaagaggcc gcggcttctg gggccaagtg tgtgctcacc 421 cacagcccga cgtggatcat cgaccccatc gacggcacct gcaattttgt gcacagattc 481 ccgactgtgg cggttagcat tggatttgct gttcgacaag agcttgaatt cggagtgatt 541 taccactgca cagaggagcg gctgtacacg ggccggcggg gtcggggcgc cttctgcaat 601 ggccagcggc tccgggtctc cggggagaca gatctctcaa aggccttggt tctgacagaa 661 attggcccca aacgtgaccc tgcgaccctg aagctgttcc tgagtaacat ggagcggctg 721 ctgcatgcca aggcgcatgg ggtccgagtg attggaagct ccacattggc actctgccac 781 ctggcctcag gggccgcgga tgcctattac cagtttggcc tgcactgctg ggatctggcg 841 gctgccacag tcatcatcag agaagcaggc ggcatcgtga tagacacttc gggtggaccc 901 ctcgacctca tggcttgcag agtggttgcg gccagcaccc gggagatggc gatgctcata 961 gctcaggcct tacagaccat taactatggg cgggatgatg agaagtgact gcggctgagg 1021 caaagctgct cccaaggcct ccctgggctg ctgtgggctc ctggggaggt ggccctcgtg 1081 gcccacgctc catgccagtg gctcacgctc tgctcctggc taccccagag ggagttgtca 1141 cgctacagtg agtggctggc cttttaaatc gacgtctctc tcaccaggat ttggtgttta 1201 gctgtttctc tctttaatct cacgtagcct ttttcaggtt agtacgtgtt cttctgtcag 1261 ggccaaaact caaatctcct gtgaaatacg tattgataat ccaatcttga tttttccccc 1321 cagaatataa atctcaggta ataaggcttt agaactgctg ataaagcgga tcgttctcag 1381 gccctccccc cggagtactt cagaatgcaa taaatcaaaa taatgggcaa aaaaaaaaaa 1441 aaaaaaa // LOCUS AF014404 1144 bp mRNA PRI 01-OCT-1997 DEFINITION Homo sapiens HIV-Nef associated acyl CoA thioesterase (hNAACTE) mRNA, complete cds. ACCESSION AF014404 NID g2318124 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1144) AUTHORS Watanabe,H., Shiratori,T., Shoji,H., Miyatake,S., Okazaki,Y., Ikuta,K., Sato,T. and Saito,T. TITLE A novel acyl-CoA thioesterase enhances its enzymatic activity by direct binding with HIV Nef JOURNAL Biochem. Biophys. Res. Commun. 238 (1), 234-239 (1997) MEDLINE 97445158 REFERENCE 2 (bases 1 to 1144) AUTHORS Watanabe,H. and Saito,T. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) Division of Molecular Genetics, Center for Biomedical Science, Chiba University School of Medicine, 1-8-1 Inohana, Chuo-ku, Chiba, Chiba 260, Japan FEATURES Location/Qualifiers source 1..1144 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Jurkat cells cDNA library (STRATAGENE #938201)" gene 1..1144 /gene="hNAACTE" CDS 31..990 /gene="hNAACTE" /function="thioester bond hydrolysis" /note="E. coli acyl-CoA thioesterase II homolog" /codon_start=1 /product="HIV-Nef associated acyl CoA thioesterase" /db_xref="PID:g2318125" /translation="MSSPQAPEDGQGCGDRGDPPGDLRSVLVTTVLNLEPLDEDLFRG RHYWVPAKRLFGGQIVGQALVAAAKSVSEDVHVHSLHCYFVRAGDPKLPVLYQVERTR TGSSFSVRSVKAVQHGKPIFICQASFQQAQPSPMQHQFSMPTVPPPEELLDCETLIDQ YLRDPNLQKRYPLALNRIAAQEVPIEIKPVNPSPLSQLQRMEPKQMFWVRARGYIGEG DMKMHCCVAAYISDYAFLGTALLPHQWQHKVHFMVSLDHSMWFHAPFRADHWMLYECE SPWAGGSRGLVHGRLWRQDGVLAVTCAQEGVIRVKPQVSESKL" BASE COUNT 261 a 337 c 325 g 221 t ORIGIN 1 gggtgtgcag ggcctgcagc attgaactag atgtcgtccc cgcaggcccc agaagatggg 61 cagggctgtg gcgaccgcgg cgatccccct ggggacctcc gtagcgtctt ggtcacgacc 121 gtgctcaacc tcgagccgct ggacgaggat ctcttcagag gaaggcatta ctgggtaccg 181 gccaagaggc tgtttggtgg tcagatcgtg ggccaggccc tggtggctgc agccaagtct 241 gtgagtgaag acgtccacgt gcactccctg cactgctact ttgttcgggc aggggacccg 301 aagctgccag tactgtacca agtggagcgg acacgaacag ggtcgagctt ctcggtgcgc 361 tctgtgaagg ccgtgcaaca tgggaagccc atcttcatct gccaggcctc cttccagcag 421 gcccagccca gccccatgca gcaccagttc tccatgccca ctgtgccacc accagaagag 481 ctgcttgact gtgagaccct cattgaccag tatttaaggg accctaacct ccaaaagagg 541 tacccattgg cgctcaaccg aattgctgct caggaggtcc ccattgagat caagccagta 601 aacccatccc ccctgagcca gctgcagaga atggagccca aacagatgtt ctgggtgcga 661 gcccggggct atattggcga gggcgacatg aagatgcact gctgcgtggc cgcctatatc 721 tccgactatg ccttcttggg cactgcactg ctgcctcacc agtggcagca caaggtgcac 781 ttcatggtct cactggacca ttccatgtgg ttccacgccc ccttccgagc tgaccactgg 841 atgctctatg aatgcgagag cccctgggcc ggtggctctc gggggctggt ccatgggcgg 901 ctgtggcgtc aggatggagt cctagctgtg acctgtgccc aggagggcgt gatccgagtg 961 aagccccagg tctcagagag caagctgtag ccagaggtac cagcttcgcc tggggcttca 1021 agaacctccc atttatcccc attcctgaga caggagttac agtccctttt ggccctcaca 1081 tccaataaag agactgatac cactgggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1141 aaaa // LOCUS AF014459 2772 bp mRNA PRI 15-OCT-1997 DEFINITION Homo sapiens X-linked juvenile retinoschisis (XLRS1) mRNA, complete cds. ACCESSION AF014459 NID g2522418 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2772) AUTHORS Sauer,C.G., Gehrig,A., Warneke-Wittstock,R., Marquardt,A., Ewing,C.C., Gibson,A., Lorenz,B., Jurklies,B. and Weber,B.H. TITLE Positional cloning of the gene associated with X-linked juvenile retinoschisis JOURNAL Nature Genet. 17 (2), 164-170 (1997) MEDLINE 97467726 REFERENCE 2 (bases 1 to 2772) AUTHORS Sauer,C.G., Gehrig,A., Warneke-Wittstock,R., Marquardt,A., Ewing,C.C., Gibson,A., Lorenz,B., Jurklies,B. and Weber,B.H.F. TITLE Direct Submission JOURNAL Submitted (17-JUL-1997) Biozentrum, Institut fuer Humangenetik, Am Hubland, Wuerzburg 97074, Germany FEATURES Location/Qualifiers source 1..2772 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp22.2" gene 1..2772 /gene="XLRS1" sig_peptide 16..84 /gene="XLRS1" CDS 16..690 /gene="XLRS1" /note="X-linked juvenile retinoschisis precursor protein; encodes cell adhesion motif" /codon_start=1 /db_xref="PID:g2522419" /translation="MSRKIEGFLLLLLFGYEATLGLSSTEDEGEDPWYQKACKCDCQG GPNALWSAGATSLDCIPECPYHKPLGFESGEVTPDQITCSNPEQYVGWYSSWTANKAR LNSQGFGCAWLSKFQDSSQWLQIDLKEIKVISGILTQGRCDIDEWMTKYSVQYRTDER LNWIYYKDQTGNNRVFYGNSDRTSTVQNLLRPPIISRFIRLIPLGWHVRIAIRMELLE CVSNCA" mat_peptide 85..687 /gene="XLRS1" /note="encodes cell adhesion motif; X-linked juvenile retinoschisis" 3'UTR 691..2772 /gene="XLRS1" BASE COUNT 740 a 680 c 691 g 656 t 5 others ORIGIN 1 gaggacgagg ggaagatgtc acgcaagata gaaggctttt tgttattact tctctttggc 61 tatgaagcca cattgggatt atcgtctacc gaggatgaag gcgaggaccc ctggtaccaa 121 aaagcatgca agtgcgattg ccaaggagga cccaatgctc tgtggtctgc aggtgccacc 181 tccttggact gtataccaga atgcccatat cacaagcctc tgggtttcga gtcaggggag 241 gtcacaccgg accagatcac ctgctctaac ccggagcagt atgtgggctg gtattcttcg 301 tggactgcaa acaaggcccg gctcaacagt caaggctttg ggtgtgcctg gctctccaag 361 ttccaggaca gtagccagtg gttacagata gatctgaagg agatcaaagt gatttcaggg 421 atcctcaccc aggggcgctg tgacatcgat gagtggatga ccaagtacag cgtgcagtac 481 aggaccgatg agcgcctgaa ctggatttac tacaaggacc agactggaaa caaccgggtc 541 ttctatggca actcggaccg cacctccacg gttcagaacc tgctgcggcc ccccatcatc 601 tcccgcttca tccgcctcat cccgctgggc tggcacgtcc gcattgccat ccggatggag 661 ctgctggagt gcgtcagcaa ctgtgcctga tgcctgcctc agctcggcgc ctgccagggg 721 gtgactggca cagagcgggc cgtagggacc ccctcacaca ccaccgagat ggacagggct 781 atatttcgca aagcaattgt aactgcagtg ctgggtagat aatttttttt tttttaagat 841 atagctttct gatttcaatg aaataaaaat gaacttattc cccactcagg gccagagaaa 901 gtcagaacaa agaaaatgtc cccgaaacga attttcttac aaaagcctaa gtagcagggg 961 taattttctg ctcatttttt gtctcagtga tactgtgaaa ggtgcagtct caggggaaca 1021 caaagcagcc ctgataattt gaaaattcat ttgctttacc acattcaaga tagaaacata 1081 cagtttccta aagcctggct ttgaatgcag aagggagcag ctcctcctag ttaagtttcc 1141 actaaatcat cgccaaagag gacttcacag ccctggggag gcanctgagg gtctcaaggg 1201 tgactgggtg gcacggatga atgcggtggg tgagaatccc ggtgccctga gaggctatac 1261 gtgacaaatg accaaaagcc caacgtaggg gagtttcctc tgctcacagt tcttaccttc 1321 aaggcggatc tgggcttcca ccctcatgaa cacagggatt ggggagggac cagagcgccc 1381 aatacacaca gctccattat gcaatccatt ccagcaaatt cccgtgtctg tggtcaccat 1441 ttaggtgatc atacaggaca ggctgcacat ctcagtatat gtagggaccc caaatgacca 1501 caacacagta caattgccct ttacctaggg ctaccatttc ctagcaaacc aaacatagtt 1561 cgagaacagc tggcccagga gctaccactg gctactcaga ggaggctcat tagctggcta 1621 catgcttcgc aggaagtggg aaggactcac atcataaaaa ggaccatgta gctttttccc 1681 tgaaagcttc tcaccctcca ccctctgcct tgcaatacgc aaactgcgcc tgctcctgaa 1741 aagctctctg ggaaggaatg ggcctggctt tccgttcctg gaggcggcgc ttagattggg 1801 aggcctcatt ggcacttaga gcgcacgctg agtttccagg ccccttcctg ggagaggctg 1861 ttaacacggg ggaggggcag gagagggata tggagagcag gtggtggaat cagaggacga 1921 ggctgctcta aagactgttc tggccccaga cacagggtag tctttgctag cagctcattt 1981 ccgagttact tttcattttc aaatgccaag gcaagtgact agactcgcgc taatacagtg 2041 ctggacaaca cattcacctt ttctgtgaac aggcagcctt ctaaaagccc caaacatcct 2101 tcttgatgct ttgggggctc aattatttta tatccaaccc cagcatnttt ntagtcccta 2161 tgctgtatgc ttgaacttcg gaaaatgctt tttccccgcc caatnttctc ttcaaatata 2221 aacacatcac aacagggtgt tgggggtggg gggggggtgg ggggacttat ccctggcctt 2281 aggacacagg acaaatctat tttggataga aatgcctgaa cagagaccct tattggaaag 2341 ggaattaact ttggtcacga catggactgt cagacaaaat ggcagtatcc taagagttaa 2401 ggcacatcaa acacaggagt cgagagaggc agttcaggga aaaaggagag gaggaaacna 2461 gtgaggcagg gagaaagctt tccaaataag agttcatgtt ggaaactttt gtcacggctt 2521 tattgagatt aagttcacat acaatttgta tccatttaaa gtgtacaatt tgatgacttt 2581 tggtatattc agagttgtgc aaccattatc actagatcaa ttttagaaag tttatcaccc 2641 caaagagaaa tcctgcaccc atcagccaac actccccaac ccatcggcca ccccaagccc 2701 tctgcaacca cgaatcgact gtctctgtag attggccttc tggacgttct acataaatga 2761 aatcatatag ta // LOCUS AF014643 3528 bp DNA PRI 02-JAN-1998 DEFINITION Homo sapiens connexin46.6 (Cx46.6) gene, complete cds. ACCESSION AF014643 NID g2738576 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3528) AUTHORS Bloemker,B.K., Swaroop,A. and Kimberling,W.J. TITLE Cloning and Molecular Characterization of Human Connexin46.6, a New Gap Junction Gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 3528) AUTHORS Bloemker,B.K., Swaroop,A. and Kimberling,W.J. TITLE Direct Submission JOURNAL Submitted (02-JUL-1997) Genetics Dept., Boystown National Research Hospital, 555 N 30th St., Omaha, NE 68131, USA FEATURES Location/Qualifiers source 1..3528 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q41-q42" repeat_region 764..834 /note="9-22-9-22-9 bp repeat" mRNA <1386..3472 /gene="Cx46.6" /product="connexin46.6" gene <1386..3472 /gene="Cx46.6" CDS 1414..2724 /gene="Cx46.6" /note="gap junction protein" /codon_start=1 /product="connexin46.6" /db_xref="PID:g2738577" /translation="MSWSFLTRLLEEIHNHSTFVGKVWLTVLVVFRIVLTAVGGEAIY SDEQAKFTCNTRQPGCDNVCYDAFAPLSHVRFWVFHIVVISTPSVMYLGYAVHRLARA SEQERRRALRRRPGPRRAPRAHLPPPHAGWPEPADLGEEEPMLGLGEEEEEEETGAAE GAGEEAEEAGAEEACTKAVGADGKAAGTPGPTGQHDGRRRIQREGLMRVYVAQLVARA AFEVAFLVGQYLLYGFEVRPFFPCSRQPCPQVVDCFVSRPTEKTVFLLVMYVVSCLCL LLNLCEMAHLGLGSAQDAVRGRRGPPASAPAPAPRPPPCAFPAAAAGLACPPDYSLVV RAAERARAHDQNLANLALQALRDGAAAGDRDRDSSPCVGLPAASRGPPRAGAPASRTG SATSAGTVGEQGRPGTHERPGAKPRAGSEKGSASSRDGKTTVWI" polyA_signal 3451..3456 /gene="Cx46.6" BASE COUNT 551 a 1082 c 1257 g 638 t ORIGIN 1 gcatgcacgt gtacatgtgt atgcatgtgt gtgcatgggt atctgcatat gcatgtgcgt 61 gttcatgcat gtgcacgtgt gtgcatgtgt acacatgtga atctgtgggt atgtatctga 121 gtgtgtgtgc acatgtgaat gtgtgtggct ctgaggggta ctgcacagat gtgaggaagc 181 agggccactt ccccaggaac cctgactgcc ccctcttcct gccccgggtg gggaggcctc 241 agctgcataa agaggccggg agcctcacca gcgactgctg ctggtcccac acctgccgct 301 gctgcctccc aggccaggta ccgggagggg gccagaggcg gagggagcta aggggtctcc 361 tgcctcagcg acccaggagc aggtactggc cctggggcaa ccgccagcag agggtgggca 421 ggggagctgc aggagctctc cttctttgga gcacaggccc tgctgcacag ccctttcctg 481 ggcacttgcc caccttgggc ttggctggtc tgcggcatag ctgtctctga gggtcgcagg 541 tgctgagtgt ggcctcacat cactgggtct ataacctcgc tggacaccgt ccctcctgga 601 cggacgactg gcttcatcct gaccccagct agagatggtc tgggttgaga ccatggagga 661 ccaggaacct agactgggcg ggcggagccc tgggaccctg ggcacctgag aagggcagcg 721 ggaccagccg ggggctggag ggaggatgga ggattttgtg gaggtggagg gaccccgacg 781 cccctgtcca gggtgtggag ggaccccgac gcccctgtcc agggtgtgga gggaggcaca 841 tcatggcctc tgggggccga ggggctatgg ggattgtgga ctggtggcac tttggggtct 901 gggacctgat ggtggtgcac gcacgggggg ctgtgaagga ctgatggctg atggggctgc 961 tcgggcttgg gggctaggga gctggagggg ccacgaggga ttgagtggct acgcaggctg 1021 gggagaccct ttggatgctg aaactctgtg ggaccctccc caacaacctg gatgcggctg 1081 gggctgggtt ggtggccatg ggtgcactgg acactgatac caccagtccc cacacacttg 1141 agtggtgctt gcctcagtgt ccccatctgc ctcatgaagg caactcaccc acctggggcc 1201 ctgcatctgc accaccatgg gccggatcac gtgggggctt ccacttccgt ttaaggcggt 1261 aagctccacg tcattgactg tgtaagcaga gaggggccag ctgccatgca agcctggagc 1321 cccggctctg agcgccgcgg gctcctaagt gcaggcccct ggctgacccc taccccgccc 1381 cacaggaccc gcccgcccgc ccctatgacc aacatgagct ggagcttcct gacgcggctg 1441 ctggaggaga tccacaacca ctccaccttc gtgggcaagg tgtggctcac ggtgctggtg 1501 gtcttccgca tcgtgctgac ggctgtgggc ggcgaggcca tctactcgga cgagcaggcc 1561 aagttcactt gcaacacgcg gcagccaggc tgcgacaacg tctgctatga cgccttcgcg 1621 cccctgtcgc acgtgcgctt ctgggtcttc catattgtgg tcatctccac tccctcggtc 1681 atgtacctgg gctacgccgt gcaccgcctg gcccgtgcgt ctgagcagga gcggcgccgc 1741 gccctccgcc gccgcccggg gccacgccgc gcgccccgag cgcacctgcc gcccccgcac 1801 gccggctggc ctgagcccgc cgacctgggc gaggaggagc ccatgctggg cctgggcgag 1861 gaggaggagg aggaggagac gggggcagcc gagggcgccg gcgaggaagc ggaggaggca 1921 ggcgcggagg aggcgtgcac taaggcggtc ggcgctgacg gcaaggcggc agggaccccg 1981 ggcccgaccg ggcaacacga tgggcggagg cgcatccagc gggagggcct gatgcgcgtg 2041 tacgtggccc agctggtggc cagggcagct ttcgaggtgg ccttcctggt gggccagtac 2101 ctgctgtacg gcttcgaggt gcgaccgttt tttccctgca gccgccagcc ctgcccgcaa 2161 gtggtggact gcttcgtgtc gcgccctact gaaaagacgg ttttcctgct ggttatgtac 2221 gtggtcagct gcctgtgcct gctgctcaac ctctgtgaga tggcccacct gggcttgggc 2281 agcgcgcagg acgcggtgcg cggccgccgc ggccccccgg cctccgcccc cgcccccgcg 2341 ccgcggcccc cgccctgcgc cttccctgcg gcggccgctg gcttggcctg cccgcccgac 2401 tacagcctgg tggtgcgggc ggccgagcgc gctcgggcgc atgaccagaa cctggcaaac 2461 ctggccctgc aggcgctgcg cgacggggca gcggctgggg accgcgaccg ggacagttcg 2521 ccgtgcgtcg gcctccctgc ggcctcccgg gggcccccca gagcaggcgc ccccgcgtcc 2581 cggacgggca gtgctacctc tgcgggcact gtcggggagc agggccggcc cggcacccac 2641 gagcggccag gagccaagcc cagggctggc tccgagaagg gcagtgccag cagcagggac 2701 gggaagacca ccgtgtggat ctgagggcgc tggcttgcga gctgggccag ggaagaggag 2761 ggttgggggg ctccggtgga aacctgcgac cccttctcct cagccttctc cttagccggt 2821 ggcctcaggc agactctgcc cagaggggca gccaggctgc tcagggaagg ggctgaaagc 2881 ggcagaggag tgccctggct tggtcaccac tggggccaag gtggggtgga gagaggccta 2941 cgagccagaa agggccctct gctgtggtct gaaccccagg gggagtgggg cattgactcc 3001 acccctgtcc tgagctggaa taggtcctct gggatgccag ctctcccctt tgtgcttccc 3061 tgcagcaacc catggagggc ccagggtgcc tggtatgggc atcagttggt gggggtgcgg 3121 gggtgcgtgt ccccattccc tgcaacagca aatggggctc cttcttcagc cctccccttc 3181 ccagccccaa actgagacag actgggagct gggagcctgg ggtggacagg accataccct 3241 ctttgagctt ctgcgatgcc ggccttccgt tcctctggga ggcttgaagt tctgcagaga 3301 tgttgatatg ccttgcagct tggacccaat gggtggtggt cagggcctgg gggcttggcc 3361 atgctggggg aatggggctc tgggttcctg cctgtggcct gtctgtcctc ctccctaatt 3421 cagacccagc ctcaagagga aagggagtaa aataaaacta acttgtttat aaccttgtgt 3481 gtgcatgtgt atgcatgtgc acgtgtggct atgtgtgtga ctgcatgc // LOCUS AF014807 1833 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens phosphatidylinositol synthase (PIS) mRNA, complete cds. ACCESSION AF014807 NID g2338731 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1833) AUTHORS Lykidis,A., Jackson,P.D., Rock,C.O. and Jackowski,S. TITLE The role of CDP-diacylglycerol synthetase and phosphatidylinositol synthase activity levels in the regulation of cellular phosphatidylinositol content JOURNAL J. Biol. Chem. 272 (52), 33402-33409 (1997) MEDLINE 98070552 REFERENCE 2 (bases 1 to 1833) AUTHORS Lykidis,A., Jackson,P.D., Rock,C.O. and Jackowski,S. TITLE Direct Submission JOURNAL Submitted (16-JUL-1997) Department of Biochemistry, St. Jude Children's Research Hospital, 332 North Lauderdale, Memphis, TN 38101-3018, USA FEATURES Location/Qualifiers source 1..1833 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1833 /gene="PIS" CDS 348..989 /gene="PIS" /codon_start=1 /product="phosphatidylinositol synthase" /db_xref="PID:g2338732" /translation="MPDENIFLFVPNLIGYARIVFAIISFYFMPCCPLTASSFYLLSG LLDAFDGHAARALNQGTRFGAMLDMLTDRCSTMCLLVNLALLYPGATLFFQISMSLDV ASHWLHLHSSVVRGSESHKMIDLSGNPVLRIYYTSRPALFTLCAGNELFYCLLYLFHF SEGPLVGSVGLFRMGLWVTAPIALLKSLISVIHLITAARNMAALDAADRAKKK" BASE COUNT 311 a 567 c 544 g 411 t ORIGIN 1 gaattcggca cgaggcgcgg aggaggaggt tcccggaagc cacgcgcact gggagcagcg 61 gcgaccgcag ctggaggccc ggagcgcctg cggggctggc agaggcgagg gaggttgcgg 121 gtaggaaggg cgggactgcg cgcgccccct gcgtcccgcg cacctcgggg ccggtccatg 181 ctcccgacgg ctgcgggctt cagcatctgg ggccaggttg gggcggcggg gtccagggcg 241 cagtggtgcg gccgatgcgc cggggccgga gctgaaggcc gcgctgcggg gctgggacag 301 cactggcatc tccagagcag gcccggggca gcaagggagg cgccgcgatg ccagacgaaa 361 atatcttcct gttcgtgccc aacctcatcg gttatgcccg gattgtcttc gccatcattt 421 ctttctactt catgccctgc tgccccctca cggcctcctc cttctacctg ctcagcggcc 481 tgctggacgc tttcgatgga cacgctgctc gcgctcttaa tcaaggaacc cggtttgggg 541 ccatgctgga catgctgacg gaccgctgct ccaccatgtg cctgttggtc aacctggccc 601 tgctgtaccc tggagccacg ctgttcttcc aaatcagcat gagtttggat gtggccagtc 661 actggctgca cctccacagt tctgtggtcc gaggcagtga gagtcacaag atgatcgact 721 tgtccgggaa tccggtgctt cggatctact acacctcgag gcctgctctg ttcaccttgt 781 gtgctgggaa tgagctcttc tactgcctcc tctacctgtt ccatttctct gagggacctt 841 tagttggctc tgtgggactg ttccggatgg gcctctgggt cactgccccc atcgccttgc 901 tgaagtcgct catcagcgtc atccacctga tcacggccgc ccgcaacatg gctgccctgg 961 acgcagcaga ccgcgccaag aagaagtgac gctggagccc cgggtcctgg ctgcccacct 1021 gccctgggag tcttgctgtg ccacacagct ccccaccccc tgctaggagg tcccagtctc 1081 acgccttcct catgtgttgt tctacctgct gggatggggg tcagcctctc tttggtgacg 1141 tcacgttctc tgggatcctg aggacccggg cctcaaatca gggaggatac gcgggaggcc 1201 ccctccatcc aggcggtgct cctggggtgc cgggaccggg cagtgtcaca ccctgcctgc 1261 tcagtcctgg ggtccgagat gctagggacg cttgagtgag ggaggtggtg tgagggccag 1321 gtttcctgaa aggcgggagt cagacctccg cccccagcca gagcaagctt ggggcaccat 1381 gcccaggagg gaagaagcca tccacagcct tccctgtcac cggctcctct gtcctgcctg 1441 accctggtcc tggcgggact tcactatttg acttggtttc ctttcagata ttcttggctc 1501 agggcctggg ttgagggagc ttagggaagg acgtccgtct gggtgctttt cctccagttt 1561 gctggctggc ttctccgtct acccacagtg acctcacaga gaggccctcc tgccacccat 1621 gctcatgtgg tgtccccacc gcccacttgt ttgatgtcac tgactgtcta catgtattta 1681 tattcttgat attttctacc ctcactagaa tgtaaactcc atgaaggcac agacttttct 1741 tgttctcttc tctatcccta gagtaagacc aacttgaacc tggcatatag tagctgctta 1801 ataaatactc gtctgtcaaa aaaaaaaaaa aaa // LOCUS AF014837 1984 bp DNA PRI 02-OCT-1997 DEFINITION Homo sapiens m6A methyltransferase (MT-A70) gene, complete cds. ACCESSION AF014837 NID g2460036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1984) AUTHORS Bokar,J.A., Shambaugh,M.E., Polayes,D., Matera,A.G. and Rottman,F.M. TITLE Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N6-adenosine-)-methyltransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1984) AUTHORS Bokar,J.A., Shambaugh,M.E. and Rottman,F.M. TITLE Direct Submission JOURNAL Submitted (18-JUL-1997) Molecular Biology, Case Western Res. University, 10900 Euclid Ave., Cleveland, OH 44106, USA FEATURES Location/Qualifiers source 1..1984 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 119..1858 /gene="MT-A70" CDS 119..1858 /gene="MT-A70" /note="AdoMet-binding subunit; (N6-adenosine)-methyltransferase" /codon_start=1 /product="m6A methyltransferase" /db_xref="PID:g2460037" /translation="MSDTWSSIQAHKKQLDSLRERLQRRRKQDSGHLDLRNPEAALSP TFRSDSPVPTAPTSGGPKPSTASAVPELATDPELEKKLLHHLSDLALTLPTDAVSICL AISTPDAPATQDGVESLLQKFAAQELIEVKRGLLQDDAHPTLVTYADHSKLSAMMGAV AEKKGPGEVAGTVTGQKRRAEQDSTTVAAFASSLVSGLNSSASEPAKEPAKKSRKHAA SDVDLEIESLLNQQSTKEQQSKKVSQEILELLITTTAKEQSIVEIRSRGRAQVQEFCD YGTKEECMKASDADRPCRKLHFRRIINKHTDESLGDCSFLNTCFHMDTCKYVHYEIDA CMDSEAPGSKDHTPSQELALTQSVGGDSSADRLFPPQWICCDIRYLVVSILGKFAVVM ADPPWDIHMELPYGTLTDDEMRRLNIPVLQDDGFLFLWVTGRAMELGRECLNLWGYER VDEIIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPQGFNQGLDCDVIVAEVRS TSHKPDEIYGMIERLSPGTRKIELFGRPHNVQPNWITLGNQLDGIHLLDPDVVARFKQ RYPDGIISKPKNL" BASE COUNT 531 a 477 c 509 g 467 t ORIGIN 1 cattttccgg ttagccttcg gggtgtccgc gtgagaattg gctatatcct ggagcgagtg 61 ctgggaggtg ctagtccgcc gcgccttatt cgagaggtgt cagggctggg agactaggat 121 gtcggacacg tggagctcta tccaggccca caagaagcag ctggactctc tgcgggagag 181 gctgcagcgg aggcggaagc aggactcggg gcacttggat ctacggaatc cagaggcagc 241 attgtctcca accttccgta gtgacagccc agtgcctact gcacccacct ctggtggccc 301 taagcccagc acagcttcag cagttcctga attagctaca gatcctgagt tagagaagaa 361 gttgctacac cacctctctg atctggcctt aacattgccc actgatgctg tgtccatctg 421 tcttgccatc tccacgccag atgctcctgc cactcaagat ggggtagaaa gcctcctgca 481 gaagtttgca gctcaggagt tgattgaggt aaagcgaggt ctcctacaag atgatgcaca 541 tcctactctt gtaacctatg ctgaccattc caagctctct gccatgatgg gtgctgtggc 601 agaaaagaag ggccctgggg aggtagcagg gactgtcaca gggcagaagc ggcgtgcaga 661 acaggactcg actacagtag ctgcctttgc cagttcgtta gtctctggtc tgaactcttc 721 agcatcggaa ccagcaaagg agccagccaa gaaatcaagg aaacatgctg cctcagatgt 781 tgatctggag atagagagcc ttctgaacca acagtccact aaggaacaac agagcaagaa 841 ggtcagtcag gagatcctag agctattaat tactacaaca gccaaggaac aatccattgt 901 tgaaattcgc tctcgaggtc gggcccaagt gcaagaattc tgtgactatg gaaccaagga 961 ggagtgcatg aaagccagtg atgctgatcg accctgtcgc aagctgcact tcagacgaat 1021 tatcaataaa cacactgatg agtctttagg tgactgctct ttccttaata catgtttcca 1081 catggatacc tgcaagtatg ttcactatga aattgatgct tgcatggatt ctgaggcccc 1141 tggcagcaaa gaccacacgc caagccagga gcttgctctt acacagagtg tcggaggtga 1201 ttccagtgca gaccgactct tcccacctca gtggatctgt tgtgatatcc gctacctggt 1261 cgtcagtatc ttgggcaagt ttgcagttgt gatggctgac ccaccctggg atattcacat 1321 ggaactgccc tatgggaccc tgacagatga tgagatgcgc aggctcaaca tacccgtact 1381 acaggatgat ggctttctct tcctctgggt cacaggcagg gccatggagt tggggagaga 1441 atgtctaaac ctctgggggt atgaacgggt agatgaaatt atttgggtga agacaaatca 1501 actgcaacgc atcattcgga caggccgtac aggtcactgg ttgaaccatg ggaaggaaca 1561 ctgcttggtt ggtgtcaaag gaaatcccca aggcttcaac cagggtctgg attgtgatgt 1621 gatcgtagct gaggttcgtt ccaccagtca taaaccagat gaaatctatg gcatgattga 1681 aagactatct cctggcactc gcaagattga gttatttgga cgaccacaca atgtgcaacc 1741 caactggatc acccttggaa accaactgga tgggatccac ctactagacc cagatgtggt 1801 tgcacggttc aagcaaaggt acccagatgg tatcatctct aaacctaaga atttatagaa 1861 gcacttcctt acagagctaa gaatccatag ccatggctct gtaagctaaa cctgaagagt 1921 gatatttgta caatagcttt cttctttatt taaataaaca tttgtattgt aaaaaaaaaa 1981 aaaa // LOCUS AF014955 559 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens TFAR19 mRNA, complete cds. ACCESSION AF014955 NID g2772828 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 559) AUTHORS Liu,H.T., Wang,Y.G., Zhang,Y.M., Song,Q.S., Di,C.H., Yuan,Y. and Ma,D.L. TITLE A novel gene may play a role in the process of apoptosis of TF-1 cell line JOURNAL Unpublished REFERENCE 2 (bases 1 to 559) AUTHORS Liu,H.T., Wang,Y.G., Zhang,Y.M., Song,Q.S., Di,C.H., Yuan,Y. and Ma,D.L. TITLE Direct Submission JOURNAL Submitted (18-JUL-1997) Immunology, Beijing Medical University, No. 38, Xueyaun Road, Beijing 100083, Republic of China REFERENCE 3 (bases 1 to 559) AUTHORS Liu,H.T., Wang,Y.G., Zhang,Y.M., Song,Q.S., Di,C.H., Yuan,Y. and Ma,D.L. TITLE Direct Submission JOURNAL Submitted (09-DEC-1997) Immunology, Beijing Medical University, No. 38, Xueyaun Road, Beijing 100083, Republic of China REMARK Sequence update by submitter REFERENCE 4 (bases 1 to 559) AUTHORS Liu,H.T., Wang,Y.G., Zhang,Y.M., Song,Q.S., Di,C.H., Yuan,Y. and Ma,D.L. TITLE Direct Submission JOURNAL Submitted (15-JAN-1998) Immunology, Beijing Medical University, No. 38, Xueyaun Road, Beijing 100083, Republic of China REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..559 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TF-1" gene 1..559 /gene="TFAR19" CDS 25..402 /gene="TFAR19" /function="protein may function in the process of apoptosis" /codon_start=1 /product="TFAR19" /db_xref="PID:g2407068" /translation="MADEELEALRRQRLAELQAKHGDPGDAAQQEAKHREAEMRNSIL AQVLDQSARARLSNLALVKPEKTKAVENYLIQMARYGQLSEKVSEQGLIEILKKVSQQ TEKTTTVKFNRRKVMDSDEDDDY" BASE COUNT 230 a 97 c 126 g 106 t ORIGIN 1 ctgctccagc gctgacgccg agccatggcg gacgaggagc ttgaggcgct gaggagacag 61 aggctggccg agctgcaggc caaacacggg gatcctggtg atgcggccca acaggaagca 121 aagcacaggg aagcagaaat gagaaacagt atcttagccc aagttctgga tcagtcggcc 181 cgggccaggt taagtaactt agcacttgta aagcctgaaa aaactaaagc agtagagaat 241 taccttatac agatggcaag atatggacaa ctaagtgaga aggtatcaga acaaggttta 301 atagaaatcc ttaaaaaagt aagccaacaa acagaaaaga caacaacagt gaaattcaac 361 agaagaaaag taatggactc tgatgaagat gacgattatt gaactacaag tgctcacaga 421 ctagaactta acggaacaag tctaggacag aagttaagat ctgattattt actttgttta 481 ttgtctatat gccttttaaa aaaataaact tgttatgcaa aaaaaaaaaa aaaaaaaaaa 541 aaaaaaaaaa aaaaaaaaa // LOCUS AF014958 1698 bp mRNA PRI 30-OCT-1997 DEFINITION Homo sapiens chemokine receptor X (CKRX) mRNA, complete cds. ACCESSION AF014958 NID g2305263 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1698) AUTHORS Ansari-Lari,M.A., Liu,X.-M., Gorrell,J.H. and Gibbs,R.A. TITLE Haplotype analysis of a gene cluster containing CCR5 and a new member of chemokine receptor gene family JOURNAL Unpublished REFERENCE 2 (bases 1 to 1698) AUTHORS Ansari-Lari,M.A., Liu,X.-M., Gorrell,J.H. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (18-JUL-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1698 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21" /cell_type="white blood cell" gene 1..1698 /gene="CKRX" exon 1..244 /gene="CKRX" exon 245..1698 /gene="CKRX" CDS 257..1291 /gene="CKRX" /codon_start=1 /product="chemokine receptor X" /db_xref="PID:g2305264" /translation="MANYTLAPEDEYDVLIEGELESDEAEQCDKYDAQALSAQLVPSL CSAVFVIGVLDNLLVVLILVKYKGLKRVENIYLLNLAVSNLCFLLTLPFWAHAGGDPM CKILIGLYFVGLYSETFFNCLLTVQRYLVFLHKGNFFSARRRVPCGIITSVLAWVTAI LATLPEYVVYKPQMEDQKYKCAFSRTPFLPADETFWKHFLTLKMNISVLVLPLFIFTF LYVQMRKTLRFREQRYSLFKLVFAIMVVFLLMWAPYNIAFFLSTFKEHFSLSDCKSSY NLDKSVHITKLIATTHCCINPLLYAFLDGTFSKYLCRCFHLRSNTPLQPRGQSAQGTS REEPDHSTEV" unsure 550 /gene="CKRX" /note="either due to PCR error or sequence polymorphism" /replace="t" unsure 664 /gene="CKRX" /note="either due to PCR error or sequence polymorphism" /replace="a" unsure 756 /gene="CKRX" /note="either due to PCR error or sequence polymorphism" /replace="t" unsure 758 /gene="CKRX" /note="either due to PCR error or sequence polymorphism" /replace="a" polyA_signal 1677..1682 /gene="CKRX" polyA_site 1698 /gene="CKRX" BASE COUNT 434 a 413 c 392 g 459 t ORIGIN 1 agacgcttca gagatcctct ggaggcctgg gggagctttt gagtacttta tttcagttgg 61 tccctgagct cggtgagtgg ggcgggtaga gccaccaggg gaatcaacag tggtttctcg 121 tgcccctcag ggtcaggagc agtctgatca aaaggagggc atccactgtc cggggccatt 181 cccacagctc ccggatgctg ggtctggagg ctgcgccctt cccctgcagg agctcagccc 241 agtgggcagt ctgaagatgg ccaattacac gctggcacca gaggatgaat atgatgtcct 301 catagaaggt gaactggaga gcgatgaggc agagcaatgt gacaagtatg acgcccaggc 361 actctcagcc cagctggtgc catcactctg ctctgctgtg tttgtgatcg gtgtcctgga 421 caatctcctg gttgtgctta tcctggtaaa atataaagga ctcaaacgcg tggaaaatat 481 ctatcttcta aacttggcag tttctaactt gtgtttcttg cttaccctgc ccttctgggc 541 tcatgctggg ggcgatccca tgtgtaaaat tctcattgga ctgtacttcg tgggcctgta 601 cagtgagaca tttttcaatt gccttctgac tgtgcaaagg tacctagtgt ttttgcacaa 661 gggcaacttt ttctcagcca ggaggagggt gccctgtggc atcattacaa gtgtcctggc 721 atgggtaaca gccattctgg ccactttgcc tgaatacgtg gtttataaac ctcagatgga 781 agaccagaaa tacaagtgtg catttagcag aactcccttc ctgccagctg atgagacatt 841 ctggaagcat tttctgactt taaaaatgaa catttcggtt cttgtcctcc ccctatttat 901 ttttacattt ctctatgtgc aaatgagaaa aacactaagg ttcagggagc agaggtatag 961 ccttttcaag cttgtttttg ccataatggt agtcttcctt ctgatgtggg cgccctacaa 1021 tattgcattt ttcctgtcca ctttcaaaga acacttctcc ctgagtgact gcaagagcag 1081 ctacaatctg gacaaaagtg ttcacatcac taaactcatc gccaccaccc actgctgcat 1141 caaccctctc ctgtatgcgt ttcttgatgg gacatttagc aaatacctct gccgctgttt 1201 ccatctgcgt agtaacaccc cacttcaacc cagggggcag tctgcacaag gcacatcgag 1261 ggaagaacct gaccattcca ccgaagtgta aactagcatc caccaaatgc aagaagaata 1321 aacatggatt ttcatctttc tgcattattt catgtaaatt ttctacacat ttgtatacaa 1381 aatcggatac aggaagaaaa gggagaggtg agctaacatt tgctaagcac tgaatttgtc 1441 tcaggcaccg tgcaaggctc tttacaaacg tgagctcctt cgcctcctac cacttgtcca 1501 tagtgtggat aggactagtc tcatttctct gagaagaaaa ctaaggcgcg gaaatttgtc 1561 taagatcact taactaggaa gtggcagaac tgattctcca gccctggtag catttgctca 1621 gagcctacgc ttggtccaga acatcaaact ccaaaccctg gggacaaacg acatgaaata 1681 aatgtatttt aaaacatc // LOCUS AF015184 1912 bp mRNA PRI 24-DEC-1997 DEFINITION Homo sapiens clone ET16 ET putative translation product (ET) mRNA, alternatively spliced complete cds. ACCESSION AF015184 NID g2708500 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1912) AUTHORS Sureau,A., Soret,J., Guyon,C., Gaillard,C., Dumon,S., Keller,M., Crisanti,P. and Perbal,B. TITLE Characterization of multiple alternative RNAs resulting from antisense transcription of the PR264/SC35 splicing factor gene JOURNAL Nucleic Acids Res. 25 (22), 4513-4522 (1997) MEDLINE 98026839 REFERENCE 2 (bases 1 to 1912) AUTHORS Sureau,A., Soret,J., Guyon,C., Dumon,S., Gaillard,C., Keller,M., Crisanti,P. and Perbal,B. TITLE Direct Submission JOURNAL Submitted (21-JUL-1997) Oncologie Virale et Moleculaire, CNRS-UMR146, Institut Curie, Centre Universitaire, Bat 110, Orsay 91405, France FEATURES Location/Qualifiers source 1..1912 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25" /cell_type="thymic cells" /clone="ET16" gene 1..1912 /note="located on the opposite strand of the PR264/SC35 gene" /gene="ET" exon 1..95 /gene="ET" exon 96..262 /gene="ET" CDS 167..1516 /gene="ET" /codon_start=1 /product="ET putative translation product" /db_xref="PID:g2708501" /translation="MSPESKKLFNIIILGVAFMFMFTAFQTCGNVAQTVIRSLNRTDF HGSGYTSMAIIYGVFSASNLITPSVVAIVGPQLSMFASGLFYSMYIAVFIQPFPWSFY TASVFIGIAAAVLWTAQGNCLTINSDEHSIGRNSGIFWALLQSSLFFGNLYIYFAWQG KTQISESDRRTVFIALTVISLVGTVLFFLIRKPDSENVLGEDESSDDQDMEVNESAQN NLTKAVDAFKKSFKLCVTKEMLLLSITTAYTGLELTFFSGVYGTCIGATNKFGAEEKS LIGLSGIFIGIGEILGGSLFGLLSKNNRFGRNPVVLLGILVHFIAFYLIFLNMPGDAP IAPVKGTDSSAYIKSSKEVAILCSFLLGLGDSCFNTQLLSILGFLYSEDSAPAFAIFK FVQSICAAVAFFYSNYLLLHWQLLVMVIFGFFGTIFFFTVEWEAAAFVARGSDYRSI" exon 263..318 /gene="ET" exon 319..506 /gene="ET" exon 507..662 /gene="ET" polyA_signal 1871..1875 /gene="ET" BASE COUNT 485 a 395 c 427 g 605 t ORIGIN 1 ggcccctcaa cctggagcgg ataaattctt ggcgcttctc cgggggttgt gctcttccgt 61 actcggatcg cttcttagga gtatcctaac tgccggattg tcagtggctt cgccccgagg 121 agagctgact gccctgggct gctgcctccg gcagagctga gccaaaatgt ccccggaatc 181 taaaaagctt ttcaacatca ttattttagg agttgccttt atgtttatgt tcactgcctt 241 tcaaacttgt ggaaatgtgg cgcaaactgt catcaggagc ttaaatagga cagattttca 301 cggcagtgga tataccagca tggctattat ctatggagtg ttctctgctt caaatttgat 361 tacaccgtca gtggttgcca ttgtaggacc tcaactctct atgtttgcca gtggtttatt 421 ttacagcatg tacattgccg ttttcatcca gcctttcccg tggtccttct acacagcctc 481 tgttttcatt ggaattgctg ctgctgtgct ttggacagca caaggaaact gcctgacaat 541 caattcggat gagcacagca ttgggagaaa cagtgggatt ttctgggcac ttctgcagtc 601 tagcttgttc tttggaaatc tctacatata ttttgcctgg caagggaaaa ctcagatatc 661 agagagtgac cgaagaacag tgtttattgc cctaacggtg attagccttg tggggacagt 721 tctattcttt ctcattcgga aaccagattc tgaaaatgtc ctaggagaag atgagtcttc 781 tgatgaccag gacatggaag tcaacgagtc tgcccagaac aatctgacaa aggcagtaga 841 tgcttttaaa aagtctttta agttatgtgt caccaaggag atgctccttc ttagtattac 901 aactgcttat acaggtctgg aattaacttt cttctctggt gtatatggaa cctgtattgg 961 tgctacaaat aaatttggag cagaagagaa aagccttatt ggactttctg gcattttcat 1021 cggcattgga gaaattttag gtggaagcct cttcggcctg ctgagcaaga acaatcgttt 1081 tggtagaaat ccagttgtgc tgttgggcat cctggtgcac ttcatagctt tttatctaat 1141 atttctcaac atgcctggag atgccccgat tgctcctgtt aaaggaactg acagcagtgc 1201 ttacatcaaa tccagcaaag aagttgccat tctctgcagt tttctgttgg gccttggaga 1261 cagctgcttt aatacccagc tgcttagtat cttgggcttt ctgtattctg aagacagcgc 1321 cccagcattt gccatcttca agtttgttca gtctatttgc gcagccgtgg catttttcta 1381 cagcaactac cttctccttc actggcaact cctggtcatg gtgatatttg ggttttttgg 1441 aacaattttt ttcttcactg tggaatggga agctgccgcc tttgtagccc gcggctctga 1501 ctaccgaagt atctgatctg gtgtccgtga gggacacgta tgacctcaga aacacagctg 1561 gacacagagc ttggtggaag aagtcgcctt tgatcttcac tatatattgg gtgatgttca 1621 gtatggaaaa tcaagggatt aagactgtta aatcagccag agttggtgtt caagtttaca 1681 gatatgagtt atttaaagca agtagaataa gggaaagctg ttctgtcaac tgtaattgtt 1741 caaagatgtt gtttttcatt tcatctatct caattcttat aatcatgtta tagaatgtaa 1801 atgttttctt ctctctcctg ctcttgttgg aagatcctgc cttgatttag aatactaggc 1861 catatgtcat ataaatattt tttctggaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS AF015257 2776 bp mRNA PRI 10-DEC-1997 DEFINITION Homo sapiens flow-induced endothelial G protein-coupled receptor (FEG-1) mRNA, complete cds. ACCESSION AF015257 NID g2353152 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2776) AUTHORS Takada,Y., Kato,C., Kondo,S., Korenaga,R. and Ando,J. TITLE Cloning of cDNAs encoding G protein-coupled receptor expressed in human endothelial cells exposed to fluid shear stress JOURNAL Biochem. Biophys. Res. Commun. 240 (3), 737-741 (1997) MEDLINE 98063308 REFERENCE 2 (bases 1 to 2776) AUTHORS Takada,Y., Kato,C. and Kondo,S. TITLE Direct Submission JOURNAL Submitted (21-JUL-1997) Institute for Life Science Research, Asahi Chemical Industry Co., Ltd., 2-1, Samejima, Fuji, Shizuoka 416, Japan FEATURES Location/Qualifiers source 1..2776 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2776 /gene="FEG-1" CDS 692..1819 /gene="FEG-1" /codon_start=1 /product="flow-induced endothelial G protein-coupled receptor" /db_xref="PID:g2353153" /translation="MDVTSQARGVGLEMYPGTAQPAAPNTTSPELNLSHPLLGTALAN GTGELSEHQQYVIGLFLSCLYTIFLFPIGFVGNILILVVNISFREKMTIPDLYFINLA VADLILVADSLIEVFNLHERYYDIAVLCTFMSLFLQVNMYSSVFFLTWMSFDRYIALA RAMRCSLFRTKHHARLSCGLIWMASVSATLVPFTAVHLQHTDEACFCFADVREVQWLE VTLGFIVPFAIIGLCYSLIVRVLVRAHRHRGLRPRRQKALRMILAVVLVFFVCWLPEN VFISVHLLQRTQPGAAPCKQSFRHAHPLTGHIVNLAAFSNSCLNPLIYSFLGETFRDK LRLYIEQKTNLPALNRFCHAALKAVIPDSTEQSDVRFSSAV" BASE COUNT 540 a 896 c 806 g 534 t ORIGIN 1 ggaaaacgac acctagaagt aggagtgaga ttcgctgaag ttcccttctg aggaagaccc 61 acccctccgc ctggagagcc ggggctggcg gtgcctgagg accccttcgg cctggacagc 121 ccacgcgggc ttggggggcc tcgctctgcc ctcatggggc ggccatcggt tcccgaagcg 181 gcgagtgaaa attcaaatgg ccagtagggg gcgcactcgg aagtggccgc cccgcatgag 241 gcagttcagc ggccccgaga gtccggggag ggaggtttat tctccgcctg cacgagactg 301 tgaaatccgc aaccatgagc aggagaggcg gccctggtgg ggaagaggcc accaacatct 361 ggacggcagg tacccagaga gtgagcagct ccacgcggga ctgtgcacgg tggccgacac 421 ccgcagggac gcccgccgga cgagcacgcg gagggccctc gcctccacgg atgcaccatg 481 ccggtgtgag gagcatctgt tcttcccact ctctgcagtt aacaaaccca acccaaacca 541 ccacaggtgc tcctcctggg gagtttcctg tctgacaaat gccaggctca cttcaaggag 601 aatcacgctt ctttctaaag atggattcac catttaaaac agagctctgg gagcctttcg 661 gcaaatcttg aaagctgcac ggcgcagaga catggatgtg acttcccaag cccggggcgt 721 gggcctggag atgtacccag gcaccgcgca gcctgcggcc cccaacacca cctcccccga 781 gctcaacctg tcccacccgc tcctgggcac cgccctggcc aatgggacag gtgagctctc 841 ggagcaccag cagtacgtga tcggcctgtt cctctcgtgc ctctacacca tcttcctctt 901 ccccatcggc tttgtgggca acatcctgat cctggtggtg aacatcagct tccgcgagaa 961 gatgaccatc cccgacctgt acttcatcaa cctggcggtg gcggacctca tcctggtggc 1021 cgactccctc attgaggtgt tcaacctgca cgagcggtac tacgacatcg ccgtcctgtg 1081 caccttcatg tcgctcttcc tgcaggtcaa catgtacagc agcgtcttct tcctcacctg 1141 gatgagcttc gaccgctaca tcgccctggc cagggccatg cgctgcagcc tgttccgcac 1201 caagcaccac gcccggctga gctgtggcct catctggatg gcatccgtgt cagccacgct 1261 ggtgcccttc accgccgtgc acctgcagca caccgacgag gcctgcttct gtttcgcgga 1321 tgtccgggag gtgcagtggc tcgaggtcac gctgggcttc atcgtgccct tcgccatcat 1381 cggcctgtgc tactccctca ttgtccgggt gctggtcagg gcgcaccggc accgtgggct 1441 gcggccccgg cggcagaagg cgctccgcat gatcctcgcg gtggtgctgg tcttcttcgt 1501 ctgctggctg ccggagaacg tcttcatcag cgtgcacctc ctgcagcgga cgcagcctgg 1561 ggccgctccc tgcaagcagt ctttccgcca tgcccacccc ctcacgggcc acattgtcaa 1621 cctcgccgcc ttctccaaca gctgcctaaa ccccctcatc tacagctttc tcggggagac 1681 cttcagggac aagctgaggc tgtacattga gcagaaaaca aatttgccgg ccctgaaccg 1741 cttctgtcac gctgccctga aggccgtcat tccagacagc accgagcagt cggatgtgag 1801 gttcagcagt gccgtgtaga cagccttggc cgcataggcc cagccagggt gtgactcggg 1861 agctgcacac acctgggtgg acacaaggca cggccacgtc atgtctctaa actgcggtca 1921 gatgtggctt ctggctcctc ggggcctcgc gagggtcacg cttgcctggt caccctgggg 1981 ctgcttagga aacctcacga ctggtcacct tgcactcttc acacagaatt gctacaatcc 2041 caaagcgctc gccccgcagg gtccaaaggc cagcggtgac cagcctgtca cccagctcct 2101 ccccgccaac cctgcctgcc gctgcacctg cctgccgctg caggaaacat ttgacaccgt 2161 cgaccaggaa agccacacgg agaggccact gtgggtgaag cgcctcagtt acacaggaac 2221 cctaaagcaa atctgccacc gtgggggaac tgacgctgga gatgcaaggt gctggtgggt 2281 ctgagctgga cgtcgcggtg tgtcctctgt gcccacggtc tgagctagct agcgcaccgc 2341 cgagttaaag aggagaagga aaacatgctg ctctggtgca cgcctgagcg tcctccatct 2401 tccaggatgg cagcaatggc gctgtgcggc ctcaccaggc ccacgaggag cagcagcgct 2461 cggcccggag cagcaggaag gcccctctgt ggagcgcccg ccgtctgctc cggggtggtt 2521 cagtcactgc ttgttgacat caacatggca attgcactca tgtggactgg gaccgtgcga 2581 gctgccgtgt gggttagtcg ggtgccagga caatgaaata ctccagcacg tgtggctgac 2641 gaatttgttt ctacagaaat aacagctggg gacaactgcg gtgatgatgt aaaaaccttc 2701 ccataaaatg taagaaaagc tgatgaggct ggtgacgttc agcctttgtc aataaacctg 2761 tcatgtgcgg atcctt // LOCUS AF015308 1866 bp mRNA PRI 10-SEP-1997 DEFINITION Homo sapiens nucleolar protein (MSP58) mRNA, complete cds. ACCESSION AF015308 NID g2384716 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Ren,Y. and Busch,H. TITLE Direct Submission JOURNAL Submitted (21-JUL-1997) Pharmacology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1866 /gene="MSP58" CDS 117..1505 /gene="MSP58" /codon_start=1 /product="nucleolar protein" /db_xref="PID:g2384717" /translation="MDKDSQGLLDSSLMASGTASRSEDEESLAGQKRASSQALGTIPK RRSSSRFIKRKKFDDELVESSLAKSSTRAKGASGVEPGRCSGSEPSSSEKKKVSKAPS TPVPPSPAPAPGLTKRVKKSKQPLQVTKDLGRWKPADDLLLINAVLQTNDLTSVHLGV KFSCRFTLREVQERWYALLYDPVISKLACQAMRQLHPEAIAAIQSKALFSKAEEQLLS KVGSTSQPTLETFQDLLHRHPDAFYLGRTAKALQAHWQLMKQYYLLEDQTVQPLPKGD QVLNFSDAEDLIDDSKLKDMRDEVLEHELMVADRRQKREIRQLEQELHKWQVLVDSIT GMSSPDFDNQTLAVLRGRMVRYLMRSREITLGRATKDNQIDVDLSLEGPAWKISRKQG VIKLKNNGDFFIANEGRRPIYIDGRPVLCGSKWRLSNNSVVEIASLRFVFLINQDLIA LIRAEAAKITPQ" BASE COUNT 439 a 535 c 522 g 369 t 1 others ORIGIN 1 ggaatgaatc tcctctcagc ctttaagctc acctggtcag aatccttgga tgagcctgtg 61 ggaccgttcc tcctagcccg gtggtttgga accagtggct ttnggactgt aagaggatgg 121 acaaagattc tcaggggctg ctagattcat ccctgatggc atcaggcact gccagccgct 181 cagaggatga ggagtcactg gcagggcaga agcgagcctc ctcccaggcc ttgggcacca 241 tccctaaacg gagaagctcc tccaggttca tcaagaggaa gaagttcgat gatgagctgg 301 tggagagcag cctggcaaaa tcttctaccc gggcaaaggg ggccagtggg gtggaaccag 361 ggcgctgttc ggggagtgaa ccctcctcca gtgagaagaa gaaggtatcc aaagccccca 421 gcactcctgt gccacccagc ccagccccag cccctggact caccaagcgt gtgaagaaga 481 gtaaacagcc acttcaggtg accaaggatc tgggccgctg gaagcctgca gatgacctcc 541 tgctcataaa tgctgtgttg cagaccaacg acctgacctc cgtccacctg ggcgtgaaat 601 tcagctgccg cttcaccctt cgggaggtcc aggagcgttg gtacgccctg ctctacgatc 661 ctgtcatctc caagttggcc tgtcaggcca tgaggcagct gcacccagag gctattgcag 721 ccatccagag caaggccctg tttagcaagg ctgaggagca gctgctgagc aaagtgggat 781 cgaccagcca gcccaccttg gagaccttcc aggacctgct gcacagacac cctgatgcct 841 tctacctggg ccgtaccgcg aaggccctgc aggcccactg gcagctcatg aagcagtatt 901 acctgctgga ggaccagaca gtgcagccgc tgcccaaagg ggaccaagtg ctgaacttct 961 ctgatgcaga ggacctgatt gatgacagta agctcaagga catgcgagat gaggtcctgg 1021 aacatgagct gatggtggct gaccggcgcc agaagcgaga gattcggcag ctggaacagg 1081 aactgcataa gtggcaggtg ctagtggaca gcatcacagg catgagctct ccggacttcg 1141 acaaccagac actggcagtg ctgcggggcc gcatggtgcg gtacctgatg cgctcgcgtg 1201 agatcaccct gggcagagca accaaggata accagattga tgtggacctg tctctggagg 1261 gtccggcctg gaagatatcc cggaaacaag gtgtcatcaa gctgaagaac aacggtgatt 1321 tcttcattgc caatgagggt cgacggccca tctacatcga tggacggccg gtgctctgtg 1381 gctccaaatg gcgcctcagc aacaactctg tggtggagat cgccagcctg cgattcgtct 1441 tccttatcaa ccaggacctc attgccctca tcagggctga ggctgccaag atcacaccac 1501 agtgaggagt ggtggcagga ctcgtgggcc ctctccggcc tgtttcccct gccactccag 1561 cccccttgag ctgggaactc aggctcctgg aaaaacctgg gcagtgggag gctcagctgc 1621 gggccattga tttgagcctt tgagggagga tagggctggc ctttgtgaag ccagcagagg 1681 ctgagaacct caggcttccc tagatccaga gcccctcccc atcttcctct ctctaaaaac 1741 aaccctaccc cccattgcca ccttcactcc tgtgtctcca gctgattagc ctcagactct 1801 tcttttattg tttttctttt gtaaataaaa agcacccagg ttccaaaaaa aaaaaaaaaa 1861 aaaaaa // LOCUS AF015553 4423 bp mRNA PRI 21-SEP-1997 DEFINITION Homo sapiens TFII-I protein (TFII-I) mRNA, complete cds. ACCESSION AF015553 NID g2415381 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4423) AUTHORS Roy,A.L., Du,H., Gregor,P.D., Novina,C.D., Martinez,E. and Roeder,R.G. TITLE Cloning of an Inr and E-box binding protein TFII-I that interacts physically and functionally with USF JOURNAL EMBO J. (1997) In press REFERENCE 2 (bases 1 to 4423) AUTHORS Martinez,E., Gregor,P.D., Roy,A.L. and Roeder,R.G. TITLE Direct Submission JOURNAL Submitted (22-JUL-1997) Biochemistry and Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..4423 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..4423 /gene="TFII-I" CDS 371..3244 /gene="TFII-I" /function="transcription cofactor" /note="binds Inr and E-box elements; physically and functionally interacts with USF" /codon_start=1 /product="TFII-I protein" /db_xref="PID:g2415382" /translation="MAQVAMSTLPVEDEESSESRMVVTFLMSALESMCKELAKSKAEV ACIAVYETDVFVVGTERGRAFVNTRKDFQKDFVKYCVEEEEKAAEMHKMKSTTQANRM SVDAVEIETLRKTVEDYFCFCYGKALGKSTVVPVPYEKMLRDQSAVVVQGLPEGVAFK HPENYDLATLKWILENKAGISFIIKRPFLEPKKHVGGRVMVTDADRSILSPGGSCGPI KVKTEPTEDSGISLEMAAVTVKEESEDPDYYQYNIQGSHHSSEGNEGTEMEVPAEDDD YSPPSKRPKANELPQPPVPEPANAGKRKVREFNFEKWNARITDLRKQVEELFERKYAQ AIKAKGPVTIPYPLFQSHVEDLYVEGLPEGIPFRRPSTYGIPRLERILLAKERIRFVI KKHELLNSTREDLQLDKPASGVKEEWYARITKLRKMVDQLFCKKFAERLGSTEAKAVP YQKFEAHPNDLYVEGLPENIPFRSPSWYGIPRLEKIIQVGNRIKFVIKRPELLTHSTT EVTQPRTNTPVKEDWNVRITKLRKQVEEIFNLKFAQALGLTEAVKVPYPVFESNPEFL YVEGLPEGIPFRSPTWFGIPRLERIVRGSNKIKFVVKKPELVISYLPPGMASKINTKA LQSPKRPRSPGSNSKVPEIEVTVEGPNNNNPQTSAVRTPTQTNGSNVPFKPRGREFSF EAWNAKITDLKQKVENLFNEKCGEALGLKQAVKVPFALFESFPEDFYVEGLPEGVPFR RPSTFGIPRLEKILRNKAKIKFIIKKPEMFETAIKESTSSKSPPRKINSSPNVNTTAS GVEDLNIIQVTIPDDDNERLSKVEKARQLREQVNDLFSRKFGEAIGMGFPVKVPYRKI TINPGCVVVDGMPPGVSFKAPSYLEISSMRRILDSAEFIKFTVIRPFPGLVINNQLVD QSESEGPVIQESAEPSQLEVPATEEIKETDGSSQIKQEPDPTW" BASE COUNT 1293 a 996 c 1049 g 1085 t ORIGIN 1 aggaggagga gggtgagaga gaagctggga gagcagagaa aaggggccac cggtcgcccc 61 cccgcttccc cgcacgcgct ctccagccgc ggccgcccgc ctgccgcggt caccccggcc 121 tctgcctctg tcccccagtg atcggatcaa ggcgctgagc gaggccctgc ctgcggggcg 181 gccatgcggc ggtgacagga gcgcgaccga cacgcacggg cccctcgccc cctctcgcct 241 cccgtccgct cgccagctcc cctcagccga ggctgctccg cggcggccgc agcccgcgcg 301 cggcccacac tcgcctcccc tcggcacccc cggccccgga gctgcctgga ggcggccgca 361 ctcggggatc atggcccaag ttgcaatgtc caccctcccc gttgaagatg aggagtcctc 421 ggagagcagg atggtggtga cattcctcat gtcagctctc gagtccatgt gtaaagaact 481 ggccaagtcc aaagccgaag tggcctgcat tgcagtgtat gaaacagacg tgtttgtcgt 541 cggaactgaa agaggacgtg cttttgtcaa taccagaaag gattttcaaa aagattttgt 601 aaaatattgt gttgaagaag aagaaaaagc tgcagagatg cataaaatga aatctacaac 661 ccaggcaaat cggatgagtg tagatgctgt agaaattgaa acactcagaa aaacagttga 721 ggactatttc tgcttttgct atgggaaagc tttaggcaaa tccacagtgg tacctgtacc 781 atatgagaag atgctgcgag accagtcggc tgtggtagtg caggggcttc cggaaggtgt 841 tgcctttaaa caccccgaga actatgatct tgcaaccctg aaatggattt tggagaacaa 901 agcagggatt tcattcatca ttaagagacc ttttttagag ccaaagaagc atgtaggtgg 961 tcgtgtgatg gtaacagatg ctgacaggtc aatactatct ccaggtggaa gttgtggccc 1021 catcaaagtg aaaactgaac ccacagaaga ttctggcatt tccctggaaa tggcagctgt 1081 gacagtaaag gaagaatcag aagatcctga ttattatcaa tataacattc aaggaagcca 1141 ccattcttca gagggcaatg aaggcacaga aatggaagta ccagcagaag atgatgatta 1201 ttctccaccg tctaagagac caaaggccaa tgagctaccg cagccaccag tcccggaacc 1261 cgccaatgct gggaagcgga aagtgaggga gttcaacttc gagaaatgga atgctcgcat 1321 cactgatcta cgtaaacaag ttgaagaatt gtttgaaagg aaatatgctc aagccataaa 1381 agccaaaggt ccggtgacga tcccgtaccc tcttttccag tctcatgttg aagatcttta 1441 tgtagaagga cttcctgaag gaattccttt tagaaggcca tctacttacg gaattcctcg 1501 cctggagagg atattacttg caaaggaaag gattcgtttt gtgattaaga aacatgagct 1561 tctgaattca acacgtgaag atttacagct tgataaacca gcttcaggag taaaggaaga 1621 atggtatgcc agaatcacta aattaagaaa gatggtggat cagcttttct gcaaaaaatt 1681 tgcggaacgc ttggggagca ctgaagccaa ggctgtaccg taccaaaaat ttgaggcaca 1741 cccgaatgat ctgtacgtgg aaggactgcc agaaaacatt cctttccgaa gtccctcatg 1801 gtatggaatc ccaaggctgg aaaaaatcat tcaagtgggc aatcgaatta aatttgttat 1861 taaaagacca gaacttctga ctcacagtac cactgaagtt actcagccaa gaacgaatac 1921 accagtcaaa gaagattgga atgtcagaat taccaagcta cggaagcaag tggaagagat 1981 ttttaatttg aaatttgctc aagctcttgg actcaccgag gcagtaaaag taccatatcc 2041 tgtgtttgaa tcaaacccgg agttcttgta tgtggaaggc ttgccagagg ggattccctt 2101 ccgaagccct acctggtttg gaattccacg acttgaaagg atcgtccgcg ggagtaataa 2161 aatcaagttc gttgttaaaa aacctgaact agttatttcc tacttgcctc ctgggatggc 2221 tagtaaaata aacactaaag ctttgcagtc ccccaaaaga ccacgaagtc ctgggagtaa 2281 ttcaaaggtt cctgaaattg aggtcaccgt ggaaggccct aataacaaca atcctcaaac 2341 ctcagctgtt cgaaccccga cccagactaa cggttctaac gttcccttca agccacgagg 2401 gagagagttt tcctttgagg cctggaatgc caaaatcacg gacctaaaac agaaagttga 2461 aaatctcttc aatgagaaat gtggggaagc tcttggcctt aaacaagctg tgaaggtgcc 2521 gttcgcgtta tttgagtctt tcccggaaga cttttatgtg gaaggcttac ctgagggtgt 2581 gccattccga agaccatcga cttttggcat tccgagactg gagaagatac tcagaaacaa 2641 agccaaaatt aagttcatca ttaaaaagcc cgaaatgttt gagacggcga ttaaggagag 2701 cacctcctct aagagccctc ccagaaaaat aaattcatca cccaatgtta atactactgc 2761 atcaggtgtt gaagacctta acatcattca ggtgacaatt ccagatgatg ataatgaaag 2821 actctcgaaa gttgaaaaag ctagacagct aagagaacaa gtgaatgacc tctttagtcg 2881 gaaatttggt gaagctattg gtatgggttt tcctgtgaaa gttccctaca ggaaaatcac 2941 aattaaccct ggctgtgtgg tggttgatgg catgcccccg ggggtgtcct tcaaagcccc 3001 cagctacctg gaaatcagct ccatgagaag gatcttagac tctgccgagt ttatcaaatt 3061 cacggtcatt agaccatttc caggacttgt gattaataac cagctggttg atcagagtga 3121 gtcagaaggc cccgtgatac aagaatcagc tgaaccaagc cagttggaag ttccagccac 3181 agaagaaata aaagagactg atggaagctc tcagatcaag caagaaccag accccacgtg 3241 gtagacctct tccctcctag gcttaaagta tcagtggttg agaagagctt ttcggacctg 3301 ttactacccc aagctgtgta atataccttg gtataacaga aataccttct atacaaacct 3361 ttttttctac ttttagatag aaatgtctac tttttcagca gttctgtgaa ttaaagagca 3421 gagtgactgt gggtctggaa tggctggtgt acttgggaat gtactatcag gattttacag 3481 caatgctggg aaatgacagg gaaaatgaca ggaatgaatc tcaccagatt ttttatgtac 3541 tcagcagagc cttgagttac ggtgtttatt ttccaatcaa gtgaagatat ctcctacttc 3601 tcctactgga acatctcagc ttctgcagtg aagaaaaatt cctgtgatag ttcagttctt 3661 tagtttttct atttgaaaaa aaaaaaatca tttaaatgat cctttgttca cggctctcct 3721 taatgactga gtgaacagtt cctatctgta tatttgacta aaccttttcc taagctatct 3781 ctcatggttc ctatgttttt ttatcataat taaaagcaaa accatctgga tcacctaaca 3841 gtcagaggtc agtatctcag cgtgtgaatt atagaggaat acagagagaa cctctcccac 3901 tttttacttt tcgtccaaat aaaatgcatg gtgtaccaga agttgaagat cgggttgagg 3961 attggggcta gctcgatgac actaaggccc caacatcgcg ggacctgctg tggcgcggat 4021 tcttaggaac gctgttctag ccggccccct ctccaggggt cgccgtggcc ggcattattt 4081 cctagttctt cttgtaaccc tgaggtgcca gcgcggggag tgaggagggg tcagggggct 4141 aaggatgcaa cctctgacgt tctgcgcctt cctaggagag tcttacatgt gttgagattt 4201 cacaagcaat gcgagttgta aaataccagc tctacaagaa actaggctct gtgacggcat 4261 agttttcagt agctttatca caatattcac aatggagaat tatatgacat ggtagcagaa 4321 ataggccctt ttatgtgttg cttctatttt acctcaaatt gtagatatag ggtaatcaat 4381 aaaatccatc catgcctttc acacactaaa aaaaaaaaaa aag // LOCUS AF015913 1996 bp mRNA PRI 13-AUG-1997 DEFINITION Homo sapiens SKB1Hs mRNA, complete cds. ACCESSION AF015913 NID g2323409 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1996) AUTHORS Marcus,S. TITLE SKB1Hs, a human homolog of the fission yeast skb1 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1996) AUTHORS Marcus,S. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Molecular Genetics, University of Texas M.D. Anderson Cancer Center, 1515 Holcombe Blvd., Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1996 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" gene 1..1996 /gene="SKB1Hs" CDS 1..1914 /gene="SKB1Hs" /note="homolog of fission yeast Skb1" /codon_start=1 /product="Skb1Hs" /db_xref="PID:g2323410" /translation="MAAMAVGGAGGSRVSSGRDLNCVPEIADTLGAVAKQGFDFLCMP VFHPRFKREFIQEPAKNRPGPQTRSDLLLSGRDWNTLIVGKLSPWIRPDSKVEKIRRN SEAAMLQELNFGAYLGLPAFLLPLNQEDNTNLARVLTNHIHTGHHSSMFWMRVPLVAP EDLRDDIIENAPTTHTEEYSGEEKTWMWWHNFRTLCDYSKRIAVALEIGADLPSNHVI DRWLGEPIKAAILPTSIFLTNKKGFPVLFKMHQRLIFRLLKLEVQFIITGTNHHSEKE FCSYLQYLEYLSQNRPPPNAYELFAKGYEDYLQSPLQPLMDNLESQTYEVFEKDPIKY SQYQQAIYKCLLDRVPEEEKDTNVQVLMVLGAGRGPLVNASLRAAKQADRRIKLYAVE KNPNAVVTLENWQFEEWGSQVTVVSSDMREWVAPEKADIIVSELLGSFADNELSPECL DGAQHFLKDDGVSIPGEYTSFLAPISSSKLYNEVRACREKDRDPEAQFEMPYVVRLHN FHQLSAPQPCFTFSHPNRDPMIDNNRYCTLEFPVEVNTVLHGFAVYFETVLYQDITLS IRPETHSPGMFSWFPILFPIKQPITVREGQTICVRFWRCSNSKKVWYEWAVTAPVCSA IHNPTGRSYTIGL" BASE COUNT 487 a 520 c 519 g 470 t ORIGIN 1 atggcggcga tggcggtcgg gggtgctggt gggagccgcg tgtccagcgg gagggacctg 61 aattgcgtcc ccgaaatagc tgacacacta ggggctgtgg ccaagcaggg gtttgatttc 121 ctctgcatgc ctgtcttcca tccgcgtttc aagagggagt tcattcagga acctgctaag 181 aatcggcccg gtccccagac acgatcagac ctactgctgt caggaaggga ctggaatacg 241 ctaattgtgg gaaagctttc tccatggatt cgtccagact caaaagtgga gaaaattcgc 301 aggaactccg aggcggccat gttacaggag ctgaattttg gtgcatattt gggtcttcca 361 gctttcctgc tgccccttaa tcaggaagat aacaccaacc tggccagagt tttgaccaac 421 cacatccaca ctggccatca ctcttccatg ttctggatgc gggtaccctt ggtggcacca 481 gaggacctga gagatgatat aattgagaat gcaccaacta cacacacaga ggagtacagt 541 ggggaggaga aaacgtggat gtggtggcac aacttccgga ctttgtgtga ctatagtaag 601 aggattgcag tggctcttga aattggggct gacctcccat ctaatcatgt cattgatcgc 661 tggcttgggg agcccatcaa agcagccatt ctccccacta gcattttcct gaccaataag 721 aagggatttc ctgttctttt caagatgcac cagaggctca tcttccggct cctcaagttg 781 gaggtgcagt tcatcatcac aggcaccaac caccactcag agaaggagtt ctgctcctac 841 ctccaatacc tggaatactt aagccagaac cgtcctccac ctaatgccta tgaactcttt 901 gccaagggct atgaagacta tctgcagtcc ccgcttcagc cactgatgga caatctggaa 961 tctcagacat atgaagtgtt tgaaaaggac cccatcaaat actctcagta ccagcaggcc 1021 atctataaat gtctgctaga ccgagtacca gaagaggaga aggataccaa tgtccaggta 1081 ctgatggtgc tgggagcagg acggggaccc ctggtgaacg cttccctgcg ggcagccaag 1141 caggccgacc ggcggataaa gctgtatgct gtggagaaaa acccaaatgc cgtggtgacg 1201 ctagagaact ggcagtttga agaatgggga agccaagtga ccgtagtctc atcagacatg 1261 agggaatggg tggctccaga gaaagcagac atcattgtca gtgagcttct gggctcattt 1321 gctgacaatg aattgtcgcc tgagtgcctg gatggagccc agcacttcct aaaagatgat 1381 ggtgtgagca tccccgggga gtacacttcc tttctggctc ccatctcttc ctccaagctg 1441 tacaatgagg tccgagcctg tagggagaag gaccgtgacc ctgaggccca gtttgagatg 1501 ccttatgtgg tacggctgca caacttccac cagctctctg caccccagcc ctgtttcacc 1561 ttcagccatc ccaacagaga tcctatgatt gacaacaacc gctattgcac cttggaattt 1621 cctgtggagg tgaacacagt actacatggc tttgcggtct actttgagac tgtgctttat 1681 caggacatca ctctgagtat ccgtccagag actcactctc ctgggatgtt ctcatggttt 1741 cccatcctct tccctattaa gcagcccata acggtacgtg aaggccaaac catctgtgtg 1801 cgtttctggc gatgcagcaa ttccaagaag gtgtggtatg agtgggctgt gacagcacca 1861 gtctgttctg ctattcataa ccccacaggc cgctcatata ccattggcct ctagccctgc 1921 gtgccaagtg tccagagcct tggaagcagc ttcaggttct gctcctgtag tacagaaggt 1981 gcagtacatc tatggg // LOCUS AF015926 1997 bp mRNA PRI 17-OCT-1997 DEFINITION Homo sapiens ezrin-radixin-moesin binding phosphoprotein-50 mRNA, complete cds. ACCESSION AF015926 NID g2529738 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1997) AUTHORS Reczek,D., Berryman,M. and Bretscher,A. TITLE Identification of EBP50: A PDZ-containing phosphoprotein that associates with members of the ezrin-radixin-moesin family JOURNAL J. Cell Biol. 139 (1), 169-179 (1997) MEDLINE 97461574 REFERENCE 2 (bases 1 to 1997) AUTHORS Reczek,D., Berryman,M. and Bretscher,A. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Biochemistry, Molecular and Cell Biology, Cornell University, Ithaca, NY 14853, USA FEATURES Location/Qualifiers source 1..1997 /organism="Homo sapiens" /db_xref="taxon:9606" /note="submitted sequence was obtained from WashU-Merck EST Project clone 33147, GenBank Accession Number: R43820" CDS 213..1286 /note="EBP50" /codon_start=1 /product="ezrin-radixin-moesin binding phosphoprotein-50" /db_xref="PID:g2529739" /translation="MSADAAAGAPLPRLCCLEKGPNGYGFHLHGEKGKLGQYIRLVEP GSPAEKAGLLAGDRLVEVNGENVEKETHQQVVSRIRAALNAVRLLVVDPETDEHCRSR RPGPRGAAARPGTPGQAEPPAAAEVQGAGNENEPREADKSHPEQRELRPRLCTMKKGP SGYGFNLHSDKSKPGQFIRSVDPDSPAEASGLRAQDRIVEVNGVCMEGKQHGDVVSAI RAGGDETKLLVVDRETDEFFKKCRVIPSQEHLNGPLPVPFTNGEIQKENSREALAEAA LESPRPALVRSASSDTSEELNSQDSPPKQDSTAPSSTSSSDPILDFNISLAMAKERAH QKRSSKRAPQMDWSKKNELFSNL" BASE COUNT 418 a 629 c 587 g 363 t ORIGIN 1 cttggcacga gggattggtc tgtgctcctc tctcggctcc tcgcggctcg cggcggccga 61 cggttcctgg gacacctgct tgcttggccc gtccggcggc tcagggcttc tctgctgcgc 121 tcccggttcg ctggacggga agaagggctg ggccgtcccg tcccgtcccc atcggaaccc 181 caagtcgcgc cgctgacccg tcgcagggcg agatgagcgc ggacgcagcg gccggggcgc 241 ccctgccccg gctctgctgc ctggagaagg gtccgaacgg ctacggcttc cacctgcacg 301 gggagaaggg caagttgggc cagtacatcc ggctggtgga gcccggctcg ccggccgaga 361 aggcggggct gctggcgggg gaccggctgg tggaggtgaa cggcgaaaac gtggagaagg 421 agacccacca gcaggtggtg agccgcatcc gcgccgcact caacgccgtg cgcctgctgg 481 tggtcgaccc cgaaacggac gagcactgca gaagccggcg tccaggtccg agaggagctg 541 ctgcgcgccc aggaacgccg gggcaggccg agccgccggc cgccgccgag gtgcaggggg 601 ctggcaacga aaatgagcct cgcgaggccg acaagagcca cccggagcag cgcgagcttc 661 ggcctcggct ctgtaccatg aagaagggcc ccagtggcta tggcttcaac ctgcacagcg 721 acaagtccaa gccaggccag ttcatccggt cagtggaccc agactccccg gctgaggctt 781 cagggctccg ggcccaggat cgcattgtgg aggtgaacgg ggtctgcatg gaggggaagc 841 agcatgggga cgtggtgtcc gccatcaggg ctggcgggga cgagaccaag ctgctggtgg 901 tggacaggga aactgacgag ttcttcaaga aatgcagagt gatcccatct caggagcacc 961 tgaatggtcc cctgcctgtg cccttcacca atggggagat acagaaggag aacagtcgtg 1021 aagccctggc agaggcagcc ttggagagcc ccaggccagc cctggtgaga tccgcctcca 1081 gtgacaccag cgaggagctg aattcccaag acagcccccc aaaacaggac tccacagcgc 1141 cctcgtctac ctcctcctcc gaccccatcc tagacttcaa catctccctg gccatggcca 1201 aagagagggc ccaccagaaa cgcagcagca aacgggcccc gcagatggac tggagcaaga 1261 aaaacgaact cttcagcaac ctctgagcgc cctgctgcca cccagtgact ggcagggccg 1321 agccagcatt ccaccccacc tttttccttc tccccaatta ctcccctgaa tcaatgtaca 1381 aatcagcacc cacatcccct ttcttgacaa atgatttttc tagagaacta tgttcttccc 1441 tgactttagg gaaggtgaat gtgttcccgt cctcccgcag tcagaaagga gactctgcct 1501 ccctcctcct cactgagtgc ctcatcctac cgggtgtccc tttgccaccc tgcctgggac 1561 atcgctggaa cctgcaccat gccaggatca tgggaccagg cgagagggca ccctcccttc 1621 ctcccccatg tgataaatgg gtccagggct gatcaaagaa ctctgactgc agaactgccg 1681 ctctcagtgg acagggcatc tgttatcctg aaccttggca gacacgtctt gttttcattt 1741 gattttgtta agagtgcagt attgcagagt ctagaggaat ttttgtttcc ttgattaaca 1801 tgattttcct ggttgttaat ccagggcatg gcagtggcct cagccttaaa cttttgttcc 1861 tactcccacc ctcagcgaac tgggcagcac ggggagggtt tggctacccc tgcccatccc 1921 tgagccaggt accaccattg taaggaaaca ctttcagaaa ttcagctggt tcctccaaac 1981 caaaaaaaaa aaaaaaa // LOCUS AF015950 4015 bp mRNA PRI 16-AUG-1997 DEFINITION Homo sapiens telomerase reverse transcriptase (hTRT) mRNA, complete cds. ACCESSION AF015950 NID g2330016 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4015) AUTHORS Nakamura,T.M., Morin,G.B., Chapman,K.B., Weinrich,S.L., Andrews,W.H., Lingner,J., Harley,C.B. and Cech,T.R. TITLE Telomerase catalytic subunit homologs from fission yeast and human JOURNAL Science 277 (5328), 955-959 (1997) MEDLINE 97400623 REFERENCE 2 (bases 1 to 4015) AUTHORS Morin,G.B. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Geron Corporation, 230 Constitution Drive, Menlo Park, CA 94025, USA FEATURES Location/Qualifiers source 1..4015 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /map="5p" /dev_stage="embryo" /chromosome="5" gene 1..4015 /gene="hTRT" CDS 56..3454 /gene="hTRT" /codon_start=1 /product="telomerase reverse transcriptase" /db_xref="PID:g2330017" /translation="MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFA LLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFV LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPA PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVVSPA RPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYS SGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL FLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ LLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL TWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTET TFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI PKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLG LDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYC VRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVIEQSSSL NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAG IRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEAL GGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRR KLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPT FFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVT YVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD" BASE COUNT 663 a 1363 c 1275 g 714 t ORIGIN 1 gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 61 gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 181 ggacccggcg gctttccgcg cgctggtggc ccagtgcctg gtgtgcgtgc cctgggacgc 241 acggccgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaagg agctggtggc 301 ccgagtgctg cagaggctgt gcgagcgcgg cgcgaagaac gtgctggcct tcggcttcgc 361 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 481 ccgcgtgggc gacgacgtgc tggttcacct gctggcacgc tgcgcgctct ttgtgctggt 541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 601 tcaggcccgg cccccgccac acgctagtgg accccgaagg cgtctgggat gcgaacgggc 661 ctggaaccat agcgtcaggg aggccggggt ccccctgggc ctgccagccc cgggtgcgag 721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgtggcgc 781 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 841 gcgtggaccg agtgaccgtg gtttctgtgt ggtgtcacct gccagacccg ccgaagaagc 901 cacctctttg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 961 gcaccacgcg ggccccccat ccacatcgcg gccaccacgt ccctgggaca cgccttgtcc 1021 cccggtgtac gccgagacca agcacttcct ctactcctca ggcgacaagg agcagctgcg 1081 gccctccttc ctactcagct ctctgaggcc cagcctgact ggcgctcgga ggctcgtgga 1141 gaccatcttt ctgggttcca ggccctggat gccagggact ccccgcaggt tgccccgcct 1201 gccccagcgc tactggcaaa tgcggcccct gtttctggag ctgcttggga accacgcgca 1261 gtgcccctac ggggtgctcc tcaagacgca ctgcccgctg cgagctgcgg tcaccccagc 1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtg gcggcccccg aggaggagga 1381 cacagacccc cgtcgcctgg tgcagctgct ccgccagcac agcagcccct ggcaggtgta 1441 cggcttcgtg cgggcctgcc tgcgccggct ggtgccccca ggcctctggg gctccaggca 1501 caacgaacgc cgcttcctca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 1561 gctctcgctg caggagctga cgtggaagat gagcgtgcgg gactgcgctt ggctgcgcag 1621 gagcccaggg gttggctgtg ttccggccgc agagcaccgt ctgcgtgagg agatcctggc 1681 caagttcctg cactggctga tgagtgtgta cgtcgtcgag ctgctcaggt ctttctttta 1741 tgtcacggag accacgtttc aaaagaacag gctctttttc taccggaaga gtgtctggag 1801 caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggagctgtc 1861 ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ctgctgacgt ccagactccg 1921 cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 1981 cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactgtt 2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 2161 gccgcctgag ctgtactttg tcaaggtgga tgtgacgggc gcgtacgaca ccatccccca 2221 ggacaggctc acggaggtca tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 2281 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggctc acctgcagga 2401 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctga atgaggccag 2461 cagtggcctc ttcgacgtct tcctacgctt catgtgccac cacgccgtgc gcatcagggg 2521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 2581 cagcctgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggct 2641 gctcctgcgt ttggtggatg atttcttgtt ggtgacacct cacctcaccc acgcgaaaac 2701 cttcctcagg accctggtcc gaggtgtccc tgagtatggc tgcgtggtga acttgcggaa 2761 gacagtggtg aacttccctg tagaagacga ggccctgggt ggcacggctt ttgttcagat 2821 gccggcccac ggcctattcc cctggtgcgg cctgctgctg gatacccgga ccctggaggt 2881 gcagagcgac tactccagct atgcccggac ctccatcaga gccagtctca ccttcaaccg 2941 cggcttcaag gctgggagga acatgcgtcg caaactcttt ggggtcttgc ggctgaagtg 3001 tcacagcctg tttctggatt tgcaggtgaa cagcctccag acggtgtgca ccaacatcta 3061 caagatcctc ctgctgcagg cgtacaggtt tcacgcatgt gtgctgcagc tcccatttca 3121 tcagcaagtt tggaagaacc ccacattttt cctgcgcgtc atctctgaca cggcctccct 3181 ctgctactcc atcctgaaag ccaagaacgc agggatgtcg ctgggggcca agggcgccgc 3241 cggccctctg ccctccgagg ccgtgcagtg gctgtgccac caagcattcc tgctcaagct 3301 gactcgacac cgtgtcacct acgtgccact cctggggtca ctcaggacag cccagacgca 3361 gctgagtcgg aagctcccgg ggacgacgct gactgccctg gaggccgcag ccaacccggc 3421 actgccctca gacttcaaga ccatcctgga ctgatggcca cccgcccaca gccaggccga 3481 gagcagacac cagcagccct gtcacgccgg gctctacgtc ccagggaggg aggggcggcc 3541 cacacccagg cccgcaccgc tgggagtctg aggcctgagt gagtgtttgg ccgaggcctg 3601 catgtccggc tgaaggctga gtgtccggct gaggcctgag cgagtgtcca gccaagggct 3661 gagtgtccag cacacctgcc gtcttcactt ccccacaggc tggcgctcgg ctccacccca 3721 gggccagctt ttcctcacca ggagcccggc ttccactccc cacataggaa tagtccatcc 3781 ccagattcgc cattgttcac ccctcgccct gccctccttt gccttccacc cccaccatcc 3841 aggtggagac cctgagaagg accctgggag ctctgggaat ttggagtgac caaaggtgtg 3901 ccctgtacac aggcgaggac cctgcacctg gatgggggtc cctgtgggtc aaattggggg 3961 gaggtgctgt gggagtaaaa tactgaatat atgagttttt cagttttgaa aaaaa // LOCUS AF015956 2462 bp mRNA PRI 13-AUG-1997 DEFINITION Homo sapiens Fas-binding protein Daxx mRNA, complete cds. ACCESSION AF015956 NID g2323471 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2462) AUTHORS Kiriakidou,M., Driscoll,D.A., Lopez-Guisa,J.M. and Strauss,J.F. III. TITLE Cloning and expression of primate Daxx cDNAs and mapping of the human gene to chromosome 6p21.3 in the MHC region JOURNAL DNA Cell Biol. (1997) In press REFERENCE 2 (bases 1 to 2462) AUTHORS Kiriakidou,M., Driscoll,D.A., Lopez-Guisa,J.M. and Strauss,J.F. III. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Center for Research on Women's Health and Reproduction, University of Pennsylvania, 415 Curie Boulevard, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2462 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /cell_line="HeLa" CDS 55..2277 /codon_start=1 /product="Fas-binding protein Daxx" /db_xref="PID:g2323472" /translation="MATANSIIVLDDDDEDEAAAQPGPSHPLPNAASPGAEAPSSSEP HGARRSSSSGGKKCYKLENEKLFEKFLELCKMQTADHPEVVPFLYNRQQRAHSLFLAS AEFCNILSRVLSRARSRPAKLYVYINELCTVLKAHSAKKKLNLAPAATTSNEPSGNNP PTHLSLDPTNAENTASQSPRTRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDP DSAYLQEARLKRKLIRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLIN KPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIY NFGCHLTDDYRPGVDPALSYPVSARRLRENRILALSRLDQVISFYAMLQDGGEEGKKK KRRARLHGPSSHSANPPEPSLDSGEGPIGMASQGCPSASRAETDDEDDEESDEEEEEE EEEEEEEATDSEEEEDLEQMQEGQEDDEEEDEEEEAAAGKDGDKSPMSSLQISNEKKL EPGKQISRFSGEQQNKGRIVSPSLLSEEPLAPSSIDAESNGEQPEELTLEEESPVSQL FELEIEALPLDTPSSVETDISSSRKQSEEPFTTVLENGAGMVSSTSFNGGVSPHNWGD SGPPCKKSRKEKKQTGSGPLGNSYVERQRSVHEKNGKKICTLPSPPSPLASLAPVADS STRVDSPSHGLVTSSLCIPSPARLSQTPHSQPPRPGTCKTSVATQCDPEEIIVLSDSD " BASE COUNT 623 a 687 c 647 g 505 t ORIGIN 1 agcaggaatt ctgaaatccc caccacttcc tccctccggg ggatttgatc ccctatggcc 61 accgctaaca gcatcatcgt gctggatgat gatgacgaag atgaagcagc tgctcagcca 121 gggccctccc acccactccc caatgcggcc tcacctgggg cagaagcccc tagctcctct 181 gagcctcatg gggccagaag aagcagtagt tcgggcggca agaaatgcta caagctggag 241 aatgagaagc tgttcgaaaa gttccttgaa ctttgtaaga tgcagacagc agaccatcct 301 gaggtggtcc cattcctcta taaccggcag caacgtgccc actctctgtt tttggcctcg 361 gcggagttct gcaacatcct ctctagggtc ctgtctcggg cccggagccg gccagccaag 421 ctctatgtct acatcaatga gctctgcact gttctcaagg cccactcagc caaaaagaag 481 ctgaacttgg cccctgccgc caccacctcc aatgagccct ctgggaataa ccctcccaca 541 cacctctcct tggaccccac aaatgctgaa aacactgcct ctcagtctcc aaggacccgt 601 ggttcccggc ggcagatcca gcgtttggag cagctgctgg cgctctatgt ggcagagatc 661 cggcggctgc aggaaaagga gttggatctc tcagaattgg atgacccaga ctccgcatac 721 ctgcaggagg cacggttgaa gcgtaagctg atccgcctct ttgggcgact atgtgagctg 781 aaagactgct cttcactgac cggccgtgtc atagagcagc gcatccccta ccgtggcacc 841 cgctacccag aggttaacag gcgcattgag cggctcatca acaagccagg gcctgatacc 901 ttccctgact atggggatgt gcttcgggct gtagagaagg cagctgcccg acacagcctt 961 ggcctccccc gacagcagct ccagctcatg gctcaggatg ccttccgaga tgtgggcatc 1021 aggttacagg agcgacgtca cctcgatctc atttacaact ttggctgcca cctcacagat 1081 gactataggc caggcgttga ccctgcacta tcttatcctg tgtcggcccg gcgccttcgg 1141 gaaaaccgga ttttggcctt gagtcggctg gatcaggtca tctcctttta tgcaatgttg 1201 caagacgggg gtgaggaggg caaaaaaaaa aagagaagag ctcggctcca cggcccctct 1261 tcccactctg caaacccccc cgaaccctcc ttggattctg gtgagggccc tattggaatg 1321 gcatcccagg ggtgcccttc tgcctccaga gctgagacag atgacgaaga cgatgaggag 1381 agtgatgagg aagaggagga ggaggaggaa gaagaagagg aggaggccac agattctgaa 1441 gaggaggagg atttggaaca gatgcaggag ggtcaggagg atgatgaaga ggaggacgaa 1501 gaggaagaag cagcagcagg taaagatgga gacaagagcc ccatgtcctc actacagatt 1561 tccaatgaaa agaaactgga acctggcaaa cagatcagca gattttcagg ggagcagcaa 1621 aacaaaggac gcatagtgtc accatcgtta ctgtcagaag aacccctggc cccctccagc 1681 atagatgctg aaagcaatgg agaacagcct gaggagctga ccctggagga agaaagccct 1741 gtgtctcagc tctttgagct agagattgaa gctttgcccc tggatacccc ttcctctgtg 1801 gagacggaca tttcctcttc caggaagcaa tcagaggagc ccttcaccac tgttttagag 1861 aatggagcag gcatggtctc ttctacttcc ttcaatggag gcgtctctcc tcacaactgg 1921 ggagattctg gtcccccctg caaaaaatct cggaaggaga agaagcaaac aggatcaggg 1981 ccattaggaa acagctatgt ggaaaggcaa aggtcagtgc atgagaagaa tgggaaaaag 2041 atatgtaccc tgcccagccc accttccccc ttggcttcct tggccccagt tgctgattcc 2101 tccacgaggg tggactctcc cagccatggc ctggtgacca gctccctctg catcccttct 2161 ccagcccggc tgtcccaaac cccccattca cagcctcctc ggcctggtac ttgcaagaca 2221 agtgtggcca cacaatgcga tccagaagag atcatcgtgc tctcagactc tgattagctg 2281 cctccccttc tccctgcctc cagaatgttc tgggataaca tttggaggaa ggtgggaagc 2341 agatgactga ggaagggatg gactaagcta atcccctttt ggtggtgttt ctttaaaaaa 2401 aaaaaaaaag cttaagtttt acacagaaac attaataaac aataaagttc ttttcttact 2461 gt // LOCUS AF016028 756 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens sarcospan-2 (SPN2) mRNA, complete cds. ACCESSION AF016028 NID g2731763 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 756) AUTHORS Heighway,J., Betticher,D.C., Hoban,P.R., Altermatt,H.J. and Cowen,R. TITLE Coamplification in tumors of KRAS2, type 2 inositol 1,4,5 triphosphase receptor gene, and a novel human gene, KRAG JOURNAL Unpublished REFERENCE 2 (bases 1 to 756) AUTHORS Crosbie,R.H., Heighway,J., Venzke,D.P. and Campbell,K.P. TITLE Sarcospan: a novel multi-transmembrane component of the dystrophin-glycoprotein complex JOURNAL Unpublished REFERENCE 3 (bases 1 to 756) AUTHORS Crosbie,R.H., Heighway,J., Venzke,D.P. and Campbell,K.P. TITLE Direct Submission JOURNAL Submitted (24-JUL-1997) Physiology & Biophysics, University of Iowa, 400 Eckstein Medical Research Building, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..756 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p11.2" /tissue_type="skeletal muscle" gene 1..756 /gene="SPN2" CDS 79..399 /gene="SPN2" /note="component of the dystrophin-glycoprotein complex; predicted to have four transmembrane spanning domains, similar to the tetraspans" /codon_start=1 /product="sarcospan-2" /db_xref="PID:g2731764" /translation="MEPKKGTGAPKECGEEEPRTCCGCRFPLLLALLQLALGIAVTVV GFLMASISSSLLVRDTPFWAGIIVCLVAYLGLFMLCVSYQVDERTCIQFSMKLLYFLL SALG" BASE COUNT 135 a 216 c 220 g 185 t ORIGIN 1 atgggcaaga acaagcagcc acgcggccag cagaggcagg ggggcccgcc ggccgcggac 61 gccgctgggc ccgacgacat ggagccgaag aagggcacgg gggcccccaa ggagtgcggg 121 gaggaggagc cccggacctg ctgcggctgc cggttcccgc tgctgctcgc cctgctgcag 181 ctggccctgg gcatcgccgt gaccgtggtg ggcttcctca tggcgagcat cagctcctcc 241 ctgctagtca gggacactcc attttgggct gggatcattg tctgcttagt ggcctatctt 301 ggcttgttta tgctttgtgt ctcatatcag gttgacgaac ggacatgtat tcaattttct 361 atgaaactgt tatactttct gctgagtgcc ctgggctgac ggtctgtgtg ctggccgtgg 421 cctttgccgc ccaccactat tcgcagctca cacagtttac ctgtgagacc acactcgact 481 cttgccagtg caaactgccc tcctcggagc cgctcagcag gacctttgtt taccgggatg 541 tgacggactg taccagcgtc actggcactt tcaaactgtt cttactcatc cagatgattc 601 ttaatttggt ctgcggcctt gtgtgcttgt tggcctgctt tgtgatgtgg aaacataggt 661 accaggtctt ctatgtgggt gtcaggatat gctccctcac ggcttccgaa ggcccccagc 721 aaaagatcta acattcttgc tcaaagttgc gagaga // LOCUS AF016045 1767 bp mRNA PRI 05-OCT-1997 DEFINITION Homo sapiens OVO-like 1 binding protein (OVOL1) mRNA, complete cds. ACCESSION AF016045 NID g2465206 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1767) AUTHORS Chidambaram,A., Allikmet,R., Chandrasekarappa,S.C., Guru,S.C., Modi,W., Gerrard,B. and Dean,M. TITLE Characterization of a human homolog of the Drosophila Ovo gene which maps to chromosome 11q13 JOURNAL Mamm. Genome (1997) In press REFERENCE 2 (bases 1 to 1767) AUTHORS Chidambaram,A., Gerrard,B. and Dean,M. TITLE Direct Submission JOURNAL Submitted (25-JUL-1997) Human Genetics Section, National Cancer Institute, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..1767 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" gene 1..1767 /gene="OVOL1" CDS 479..1024 /gene="OVOL1" /codon_start=1 /product="OVO-like 1 binding protein" /db_xref="PID:g2465207" /translation="MGHLTDPQSRDHGFLRTKMKVTLGDSPSGDLFTCRVCQKAFTYQ RMLNRHMKCHNDVKRHLCTGCGKGFNDTFDLKRHVRTHTGVRPYKCSLCDKAFTQRCS LESHLKKIHGVQQKYAYKERRAKLYVCEECGCTSESQEGHVLHLKEHHPDSPLLRKTS KKVAVALQNTVTSLLQGSPHL" BASE COUNT 375 a 538 c 500 g 354 t ORIGIN 1 gaattcagag ggaccagccc aggtggggca ggcagaaaac tccggaggga taattggcag 61 ggcagttcgg tcctcccagt ttccgctaag acacaaatta ctcagcgggc agtggcgctt 121 ggctggcatc acctgcaatt accttaatag ggcgggaagg cggtgcagca gtggtggtcg 181 gggcagcccc gctctgctgc atgcctgcga tgggagggac ggagtctcca ggttccagcc 241 gtctcatggc tagggaagtt tccagaaacc cgctggaccc tctcttctcc accaagcctc 301 tcacctgtct gttctctccc cagtcagcct gggcttctgc caccacagcc ctaccgggag 361 ccggaaccct ctgtggccga acccccttcc tgcccgctgg ctttgaacat gagccttcga 421 gactctagct acagcatggc ccccgggctg tgtggtggcc cagctgccct ctgaagacat 481 gggccacttg acagaccccc agagcagaga ccatggcttc ctgcgcacca agatgaaggt 541 gacccttggg gacagtccca gtggagacct gttcacctgc cgtgtctgcc agaaggcctt 601 cacctaccag cgcatgctga accgccacat gaagtgtcac aacgacgtca agaggcacct 661 ctgcacgggc tgcgggaagg gcttcaatga caccttcgac ctcaagagac acgtccgaac 721 tcacactggc gtgcggccct acaagtgcag cctgtgtgac aaggccttca cgcagcgctg 781 ctctctggag tctcacctca agaagatcca tggtgtgcag cagaagtacg cgtacaagga 841 gcggcgggcc aagctgtacg tgtgtgagga gtgcggctgc acatctgaga gccaggaggg 901 ccacgtcctg cacctgaagg agcaccaccc tgacagcccg ctgctgcgca agacctccaa 961 gaaggtggcc gtggcactac agaacactgt cacttccctg ctgcagggca gcccccacct 1021 gtgagtggct cgagccctgg gggtgctcct ggaagcccca agagcatcca ggattgcctc 1081 ccagctgcct ggccagccca ccctcctgca acctctcacc cgaacaccag tgatcaggac 1141 tggagccccc gtgccttggt ctcccccctg ggcacacgtg ctcactcagg cccagcaatg 1201 acctctgctc atttttgcat ttttgactta tgggccgagg ctgttctgag cctgggaaga 1261 tgtacctatg tcaagagaag ggatgaggca aggctgcctt caattagaag cagccgccca 1321 cagagacaca ctgtgtgcct ggcagcagga cttcctaccc agaggaggtt cgagctagga 1381 tcccactgcc cccgcctctc agcacagggc aggggctgca ggtccccagt ggacatcaga 1441 gtcaaaatca ctggcaaagg gtacccctgc aaacaactgt ggtgggggct ggcagcagac 1501 cccccacctg gcagggcttc taatgctcag ggttctggag ggctctgtcc ttccggcaag 1561 gagaggcaca catgtgctgc agccgtgtgt gtgcgtgtgc ttgtgtgtgt gcactgctgt 1621 gtgtgtgtgc acgcacagga agcctttcca catatcacct catttctaag aaataaacta 1681 caaggtgcca agaaggtttt atttcctttt attttttaaa gatgacaaat gtacagatgt 1741 taatatattt ttggtgccaa tgcgatg // LOCUS AF016270 3154 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens thyroid hormone receptor coactivating protein mRNA, complete cds. ACCESSION AF016270 NID g2655005 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3154) AUTHORS Monden,T., Wondisford,F.E. and Hollenberg,A.N. TITLE Isolation and characterization of a novel ligand-dependent thyroid hormone receptor-coactivating protein JOURNAL J. Biol. Chem. 272 (47), 29834-29841 (1997) MEDLINE 98037815 REFERENCE 2 (bases 1 to 3154) AUTHORS Monden,T., Wondisford,F.E. and Hollenberg,A.N. TITLE Direct Submission JOURNAL Submitted (28-JUL-1997) Endocrinology, Beth Israel Deaconess Medical Center, 330 Brookline Ave., Boston, MA 02159, USA FEATURES Location/Qualifiers source 1..3154 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 81..2843 /note="p120; similar to Homo sapiens skeletal muscle abundant protein encoded by the sequence presented in the file with GenBank Accession Number X87613" /codon_start=1 /product="thyroid hormone receptor coactivating protein" /db_xref="PID:g2655006" /translation="MRSGDQNWVSVSRAIKPFAEPGRPPDWFSQKHCASQYSELLETT ETPKRKRGEKGEVVETVEDVIVRKLTAERVEELKKVIKETQERYRRLKRDAELIQAGH MDSRLDELCNDIATKKKLEEEEAEVKRKATDAAYQARQAVKTPPRRLPTVMVRSPIDS ASPGGDYPLGDLTPTTMEEATSGVTPGTLPSTPVTSFPGIPDTLPPGSAPLEAPMTPV TDDSPQKKMLGQKATPPPSPLLSELLKKGSLLPTSPRLVNESEMAVASGHLNSTGVLL EVGGVLPMIHGGEIQQTPNTVAASPAASGAPTLSRLLEAGPTQFTTPLASFTTVASEP PVKLVPPPVESVSHATIVMMPALPAPSSAPAVSTTESVAPVSQPDNCVPMEAVGDPHT VTVSMDSSEISMIINSIKEECFRSGVAEAPVGSKAPSIDGKEELDLAEKMDIAVSYTG EELDFETVGDIIAIIEDKVDDHPEVLDVAAVEAALSFCEENDDPQSLPGPWEHPIQQE RDKPVPLPAPEMTVKQERLDFEETENKGIHELVDIREPSAEIKVEPAEPEPVISGAEI VAGVVPATSMEPPELRSQDLDEELGSTAAGEIVEADVAIGKGDETPLTNVKTEASPES MLSPSHGSNPIEDPLEAETQHKFEMSDSLKEESGTIFGSQIKDAPGEDEEEDGVSEAA SLEEPKEEDQGEGYLSEMDNEPPVSESDDGFSIHNATLQSHTLADSIPSSPASSQFSV CSEDQEAIQAQKIWKKAIMLVWRAAANHRYANVFLQPVTDDIAPGYHSIVQRPMDLST IKKNIENGLIRSTAEFQRDIMLMFQNAVMYNSSDHDVYHMAVEMQRDVLEQIQQFLAT QLIMQTSESGISAKSLRGRDSTRKQDASEKDSVPMGSPAFLLSLFDGGTRGRRCAIEA DMKMKK" BASE COUNT 901 a 734 c 811 g 708 t ORIGIN 1 cgggcaaaca caagctgcta agcactggcc ccacagagcc atggtccatc cgagagaagc 61 tatgtttagc atcttctgtc atgagaagtg gcgatcaaaa ttgggtatca gttagcagag 121 caatcaagcc ctttgcagaa cctggccgcc ctccagactg gttctctcaa aaacattgtg 181 cttcccagta ctcggagctt ttagagacca ctgagacacc aaaacggaaa cgaggtgaaa 241 agggagaagt ggtggaaact gttgaagatg ttattgttcg gaaattgact gctgagcgag 301 ttgaagaact aaagaaagtg ataaaggaaa cccaggagag atatagacgg ctaaagagag 361 atgcagaact aattcaagct ggacacatgg acagcagact ggatgagctt tgcaatgaca 421 ttgcaacgaa aaagaaattg gaagaagagg aggctgaagt aaagaggaag gctacagatg 481 ctgcatacca ggctcgtcaa gcagtaaaaa cacccccccg gaggttaccc actgtgatgg 541 ttcgctctcc tatagattct gcctccccag gaggtgatta tccacttggg gacttgactc 601 caaccactat ggaagaggct acctctgggg taacccccgg gactttgccg agtaccccag 661 tcacctcgtt tcctgggatt cctgacaccc ttcctccagg ctctgcaccc ttagaagccc 721 ccatgacccc agtaacagat gattcacccc agaaaaagat gcttggacag aaagcaactc 781 cacccccctc ccctctgctg tcagagctct tgaagaaggg cagcctcctg cctactagcc 841 ccagactggt caatgagagt gaaatggctg tggcttctgg ccacctgaac agtacaggtg 901 tcctcctgga ggtaggcggg gtccttccca tgatacatgg tggggagata cagcaaacac 961 ccaatactgt tgcagcctcc cctgctgcat caggtgctcc cactctttcc cggcttttag 1021 aagctggtcc tacacagttc accacacctc ttgcttcctt cactactgtt gccagtgagc 1081 ctccagttaa acttgtgcca ccccctgtag agtctgtgtc ccacgctacc attgtcatga 1141 tgcctgcgct gccagcacca tcctctgctc cggctgtctc cactactgaa agtgtagctc 1201 cagtgagtca acccgacaac tgtgttccca tggaggctgt gggggatcca catactgtga 1261 ctgtttccat ggacagcagt gaaatatcca tgatcatcaa ttctatcaaa gaagagtgtt 1321 ttcgatcagg ggtagcagag gctcctgttg gatcaaaggc tcccagcata gatgggaagg 1381 aagaattaga tctggctgag aagatggata ttgctgtgtc ttacacaggt gaagagctgg 1441 attttgagac tgttggagac atcattgcca tcattgagga caaggtagat gatcatcctg 1501 aagtgctgga tgtggcagca gtggaagcag cactgtcatt ttgtgaagaa aatgatgatc 1561 ctcagtccct gcctggcccc tgggagcatc ctatccagca ggagcgggac aagccagtac 1621 ctctccctgc accagaaatg acggtcaagc aagagagact ggactttgag gaaacggaaa 1681 acaagggaat acatgaactg gtggacatca gggagcccag tgcagagatc aaggtggaac 1741 ctgcagaacc agagccagtc atttcaggag ccgaaatagt agctggagtt gttccagcca 1801 caagtatgga gccaccagaa ctcaggagtc aggacttaga tgaggaactg ggaagtactg 1861 cagctggaga gattgttgaa gcagatgttg ccattgggaa aggcgatgag actccactta 1921 caaatgtgaa gacagaggca tcccctgaaa gcatgttgtc tccatcacat ggctcaaatc 1981 ccattgaaga tcctttagag gcagagactc agcacaagtt tgaaatgtca gactcattga 2041 aagaagaatc agggactatt tttggaagcc agataaagga tgccccaggt gaggatgagg 2101 aggaagatgg tgtcagtgaa gcggccagcc tagaggagcc taaggaagag gatcaaggag 2161 aaggctactt gtcagaaatg gataatgaac ctcctgtgag cgagagtgat gatggcttca 2221 gcatacacaa tgctacactg cagtcacaca cactggcaga ctccatcccc agcagccctg 2281 cttcttcaca gttctctgtc tgtagtgagg atcaggaagc tattcaggca cagaaaattt 2341 ggaagaaagc catcatgctt gtatggagag ctgcagctaa tcataggtat gccaatgtct 2401 tcctgcagcc tgttacagat gacatagcac ctggctacca cagcattgtg cagaggccta 2461 tggatttgtc aactattaag aaaaacatag aaaatggact gatccgaagc acagctgaat 2521 ttcagcgtga cattatgctg atgtttcaga atgctgtaat gtacaatagc tcagaccatg 2581 atgtctatca catggcagtg gagatgcagc gagatgtctt ggaacagatc cagcaattct 2641 tggccacgca gttgattatg caaacatccg agtctgggat cagtgctaaa agtcttcgag 2701 ggagagattc tacccgcaaa caggatgctt cagagaagga cagtgtccca atgggctctc 2761 ctgccttcct tctctctctc tttgatggag gaaccagggg acgccgctgt gccattgaag 2821 cagatatgaa gatgaaaaag tgaagcctca gagttaccct ctttgagccg aacctaaaat 2881 aaaagtaaac aagatagagc ttgggcttgc gggcccagtt ccagaggtgg aagttacaga 2941 agaggaggta cctgggccac acgacatgag ctggaaaatc tctcttagag agttggagta 3001 gcacaattgc ctgttttagg gcagaaacca tgggctatgt taatgtccta atgtgtagct 3061 agcagatcgt agctagtttg tattgtcttg tcaattgtac agacttttta aaaaaaacaa 3121 ccaccagtga aatgtgtgtg tatacaataa actg // LOCUS AF016295 1920 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens Ets transcription factor (ELF3) mRNA, complete cds. ACCESSION AF016295 NID g2384739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1920) AUTHORS Tymms,M.J., Ng,A.Y., Thomas,R.S., Schutte,B.C., Zhou,J., Eyre,H.J., Sutherland,G.R., Seth,A., Rosenberg,M., Papas,T., Debouck,C. and Kola,I. TITLE A novel epithelial-expressed ETS gene, ELF3: human and murine cDNA sequences, murine genomic organization, human mapping to 1q32.2 and expression in tissues and cancer JOURNAL Oncogene 15 (20), 2449-2462 (1997) MEDLINE 98055619 REFERENCE 2 (bases 1 to 1920) AUTHORS Tymms,M.J., Ng,A.Y.N. and Kola,I. TITLE Direct Submission JOURNAL Submitted (29-JUL-1997) Molecular Genetics and Development Group, Institute of Reproduction and Development, Monash Medical Centre, 246 Clayton Road, Clayton, Victoria 3168, Australia FEATURES Location/Qualifiers source 1..1920 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q32.2" /chromosome="1" /dev_stage="fetal" /tissue_type="lung" gene 1..1920 /gene="ELF3" CDS 115..1230 /gene="ELF3" /codon_start=1 /product="Ets transcription factor" /db_xref="PID:g2384740" /translation="MAATCEISNIFSNYFSAMYSSEDSTLASVPPAATFGADDLVLTL SNPQMSLEGTEKASWLGEQPQFWSKTQVLDWISYQVEKNKYDASAIDFSRCDMDGATL CNCALEELRLVFGPLGDQLHAQLRDLTSSSSDELSWIIELLEKDGMAFQEALDPGPFD QGSPFAQELLDDGQQASPYHPGSCGAGAPSPGSSDVSTAGTGASRSSHSSDSGGSDVD LDPTDGKLFPSDGFRDCKKGDPKHGKRKRGRPRKLSKEYWDCLEGKKSKHAPRGTHLW EFIRDILIHPELNEGLMKWENRHEGVFKFLRSEAVAQLWGQKKKNSNMTYEKLSRAMR YYYKREILERVDGRRLVYKFGKNSSGWKEEEVLQSRN" polyA_signal 1885..1900 /gene="ELF3" BASE COUNT 422 a 561 c 550 g 387 t ORIGIN 1 gccgggtagg ggagcgcagc ggccagatac ctcagcgcta cctggcggaa ctggatttct 61 ctcccgcctg ccggcctgcc tgccacagcc ggactccgcc accccggtag cctcatggct 121 gcaacctgtg agattagcaa catttttagc aactacttca gtgcgatgta cagctcggag 181 gactccacct tggcctctgt tccccctgct gccacctttg gggccgatga cttggtactg 241 accctgagca acccccagat gtcattggag ggtacagaga aggctagctg gttgggggaa 301 cagccccagt tctggtcgaa gacgcaggtt ctggactgga tcagctacca agtggagaag 361 aacaagtacg acgcaagcgc cattgacttc tcacgatgtg acatggatgg ggccaccctc 421 tgcaattgtg cccttgagga gctgcgtctg gtctttgggc ctctggggga ccaactccat 481 gcccagctgc gagacctcac ttccagctct tctgatgagc tcagttggat cattgagctg 541 ctggagaagg atggcatggc cttccaggag gccctagacc cagggccctt tgaccagggc 601 agcccctttg cccaggagct gctggacgac ggtcagcaag ccagccccta ccaccccggc 661 agctgtggcg caggagcccc ctcccctggc agctctgacg tctccaccgc agggactggt 721 gcttctcgga gctcccactc ctcagactcc ggtggaagtg acgtggacct ggatcccact 781 gatggcaagc tcttccccag cgatggtttt cgtgactgca agaaggggga tcccaagcac 841 gggaagcgga aacgaggccg gccccgaaag ctgagcaaag agtactggga ctgtctcgag 901 ggcaagaaga gcaagcacgc gcccagaggc acccacctgt gggagttcat ccgggacatc 961 ctcatccacc cggagctcaa cgagggcctc atgaagtggg agaatcggca tgaaggcgtc 1021 ttcaagttcc tgcgctccga ggctgtggcc caactatggg ggcaaaagaa aaagaacagc 1081 aacatgacct acgagaagct gagccgggcc atgaggtact actacaaacg ggagatcctg 1141 gaacgggtgg atggccggcg actcgtctac aagtttggca aaaactcaag cggctggaag 1201 gaggaagagg ttctccagag tcggaactga gggttggaac tatacccggg accaaactca 1261 cggaccactc gaggcctgca aaccttcctg ggaggacagg caggccagat ggcccctcca 1321 ctggggaatg ctcccagctg tgctgtggag agaagctgat gttttggtgt attgtcagcc 1381 atcgtcctgg gactcggaga ctatggcctc gcctccccac cctcctcttg gaattacaag 1441 ccctggggtt tgaagctgac tttatagctg caagtgtatc tccttttatc tggtgcctcc 1501 tcaaacccag tctcagacac taaatgcaga caacaccttc ctcctgcaga cacctggact 1561 gagccaagga ggcctgggga gggcctaggg gagcaccgtg atggagagga cagagcaggg 1621 gctccagcac cttctttctg gactggcgtt cacctccctg ctcagtgctt gggctccacg 1681 ggcaggggtc agagcactcc ctaatttatg tgctatataa atatgtcaga tgtacataga 1741 gatctatttt ttctaaaaca ttcccctccc cactcctctc ccacagagtg ctggactgtt 1801 ccaggccctc cagtgggctg atgctgggac ccttaggatg gggctcccag ctcctttctc 1861 ctgtgaatgg aggcagagac ctccaataaa gtgccttctg ggctttttct aaaaaaaaaa // LOCUS AF016370 2344 bp mRNA PRI 23-DEC-1997 DEFINITION Homo sapiens U4/U6 small nuclear ribonucleoprotein hPrp3 mRNA, complete cds. ACCESSION AF016370 NID g2708306 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2344) AUTHORS Horowitz,D.S., Kobayashi,R. and Krainer,A.R. TITLE A new cyclophilin and the human homologues of yeast Prp3 and Prp4 form a complex associated with U4/U6 snRNPs JOURNAL RNA 3 (12), 1374-1387 (1997) MEDLINE 98067393 REFERENCE 2 (bases 1 to 2344) AUTHORS Horowitz,D.S., Kobayashi,R. and Krainer,A.R. TITLE Direct Submission JOURNAL Submitted (29-JUL-1997) Biochemistry and Molecular Biology, Uniformed Services University of the Health Sciences, 4301 Jones Bridge Road, Bethesda, MD 20814, USA FEATURES Location/Qualifiers source 1..2344 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Ntera-2/RA" /cell_type="neuroepithelial" gene 1..2344 /gene="hPrp3" CDS 73..2124 /gene="hPrp3" /note="similar to Saccharomyces cerevisiae Prp3 (Ydr473cp protein) encoded by the sequence presented in the file with GenBank Accession Number U33050" /codon_start=1 /product="U4/U6 small nuclear ribonucleoprotein hPrp3" /db_xref="PID:g2708307" /translation="MALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDK KKAADHLKPFLDDSTLRFVDKLFEAVEEGRSSRHSKSSSDRSRKRELKEVFGDDSEIS KESSGVKKRRIPRFEEVEEEPEVIPGPPSESPGMLTKLQIKQMMEAATRQIEERKKQL SFISPPTPQPKTPSSSQPERLPIGNTIQPSQAATFMNDAIEKARKAAELQARIQAQLA LKPGLIGNANMVGLANLHAMGIAPPKVELKDQTKPTPLILDEQGRTVDATGKEIELTH RMPTLKANIRAVKREQFKQQLKEKPSEDMESNTFFDPRVSIAPSQRQRRTFKFHDKGK FEKIAQRLRTKAQLEKLQAEISQAARKTGIHTSTRLALIAPKKELKEGDIPEIEWWDS YIIPNGFDLTEENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKLRR QTRREAQKELQEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQ KAHEEANAARKLTAEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQ LYLTGVVVLHKDVNVVVVEGGPKAQKKFKRLMLHRIKWDEQTSNTKGDDDEESDEEAV KKTNKCVLVWEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALSESVLEST D" BASE COUNT 727 a 530 c 585 g 502 t ORIGIN 1 gtctcagggg ctgaagtttg tgaggtgtag tattgagtcc tgtttgagct attgttctct 61 ttttcctgaa aaatggcact gtcaaagagg gagctggatg agctgaaacc atggatagag 121 aagacagtga agagggtcct gggtttctca gagcctacgg tggtcacagc agcattgaac 181 tgtgtgggga agggcatgga caagaagaag gcagccgatc atctgaaacc ttttcttgat 241 gattctactc tccgatttgt ggacaaactg tttgaggctg tggaggaagg ccgaagctct 301 aggcattcca agtctagcag tgacaggagc agaaaacgag agctaaagga ggtgtttggt 361 gatgactctg agatctctaa agaatcatca ggagtaaaga agcgacgaat accccgtttt 421 gaggaggtgg aagaagagcc agaggtgatc cctgggcctc catcagagag ccctggcatg 481 ctgactaagc tccagatcaa acagatgatg gaggcagcaa cacgacaaat cgaggagagg 541 aaaaaacagc tgagcttcat tagcccccct acacctcagc caaagactcc ttcttcctcc 601 caaccagaac gacttcctat tggcaacact attcagccct cccaggctgc cactttcatg 661 aatgatgcca ttgagaaggc aaggaaagca gctgaactgc aagctcgaat ccaagcccag 721 ctggcactga agccaggact catcggcaat gccaacatgg tgggcctggc taatctccat 781 gccatgggca ttgctccccc gaaggtggag ttaaaagacc aaacgaaacc tacaccactg 841 atcctggatg agcaagggcg cactgtagat gcaacaggca aggagattga gctgacacac 901 cgcatgccta ctctgaaagc caatattcgt gctgtgaaga gggaacaatt caagcaacaa 961 ctaaaggaaa agccatcaga agacatggaa tccaatacct tttttgaccc ccgagtctcc 1021 attgcccctt cccagcgcca gagacgcact tttaaattcc atgacaaggg caaatttgag 1081 aagattgctc agcgattacg gacaaaggct caactggaga agctacaggc agagatttca 1141 caagcagctc gaaaaacagg catccatact tcgactaggc ttgccctcat tgctcctaag 1201 aaggagctaa aggaaggaga tattcctgaa attgagtggt gggactctta cataatcccc 1261 aatggctttg atcttacaga ggaaaatccc aagagagaag attattttgg aatcacaaat 1321 cttgttgaac atccagccca gctcaatcct ccagttgaca atgacacacc agttactctg 1381 ggagtatatc ttaccaagaa ggaacagaaa aaacttcgga gacaaacaag gagggaagca 1441 cagaaggaac tacaagaaaa agtcaggctg ggcctgatgc ctcctccaga acccaaagtg 1501 agaatttcta atttgatgcg agtattagga acagaagctg ttcaagaccc cacgaaggta 1561 gaagcccacg tcagagctca gatggcaaaa agacagaaag cgcatgaaga ggccaacgct 1621 gcccgaaaac tcacagcaga acagagaaag gtcaagaaaa ttaaaaagct taaagaagac 1681 atttcacagg gggtacacat atctgtatat agagttcgaa atttgagcaa cccagccaag 1741 aagttcaaga ttgaagccaa tgctgggcaa ctgtacctga caggggtggt ggtactgcac 1801 aaggatgtca acgtggtagt agtggaaggg ggccccaagg cccagaagaa atttaagcgt 1861 cttatgctgc atcggataaa gtgggatgaa cagacatcta acacaaaggg agatgatgat 1921 gaggagtctg atgaggaagc tgtgaagaaa accaacaaat gtgtactagt ctgggagggt 1981 acagccaaag accggagctt tggagagatg aagtttaaac agtgtcctac agagaacatg 2041 gctcgtgagc atttcaaaaa gcatggggct gaacactact gggaccttgc gctgagtgaa 2101 tctgtgttag agtccactga ttgagactac tgcaagccct tgcctctcct cccttgcctt 2161 tgtctcttca gtcctctcac ttattctatt tcccaacccc ctcccacttg tttgtgtgat 2221 ctcagaactg tgccaagcag acactgggac aaagggagaa tatcttgctc ccctcctgag 2281 tcagcctggt gttgcccttt attcccctta tgtgcatatg attaaagagt tatttttaaa 2341 aaaa // LOCUS AF016371 772 bp mRNA PRI 23-DEC-1997 DEFINITION Homo sapiens U-snRNP-associated cyclophilin (USA-CyP) mRNA, complete cds. ACCESSION AF016371 NID g2708308 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 772) AUTHORS Horowitz,D.S., Kobayashi,R. and Krainer,A.R. TITLE A new cyclophilin and the human homologues of yeast Prp3 and Prp4 form a complex associated with U4/U6 snRNPs JOURNAL RNA 3 (12), 1374-1387 (1997) MEDLINE 98067393 REFERENCE 2 (bases 1 to 772) AUTHORS Horowitz,D.S., Kobayashi,R. and Krainer,A.R. TITLE Direct Submission JOURNAL Submitted (29-JUL-1997) Biochemistry and Molecular Biology, Uniformed Services University of the Health Sciences, 4301 Jones Bridge Road, Bethesda, MD 20814, USA FEATURES Location/Qualifiers source 1..772 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 1..772 /gene="USA-CyP" CDS 38..571 /gene="USA-CyP" /note="similar to Caenorhabditis elegans cyclophilin isoform 11 encoded by the sequence presented in the file with GenBank Accession Number U34955" /codon_start=1 /product="U-snRNP-associated cyclophilin" /db_xref="PID:g2708309" /translation="MAVANSSPVNPVVFFDVSIGGQEVGRMKIELFADVVPKTAENFR QFCTGEFRKDGVPIGYKGSTFHRVIKDFMIQGGDFVNGDGTGVASIYRGPFADENFKL RHSAPGLLSMANSGPSTNGCQFFITCSKCDWLDGKHVVFGKIIDGLLVMRKIENVPTG PNNKPKLPVVISQCGEM" BASE COUNT 196 a 160 c 207 g 209 t ORIGIN 1 agctcgtgcc gaattcggca cgagccgggt cggagccatg gcggtggcaa attcaagtcc 61 tgttaacccc gtggtgttct ttgatgtcag tattggcggt caggaagttg gccgcatgaa 121 gatcgagctc tttgcagacg ttgtgcctaa gacggccgag aactttaggc agttctgcac 181 cggagaattc aggaaagatg gggttccaat aggatacaaa ggaagcacct tccacagggt 241 cataaaggat ttcatgattc agggtggaga ttttgttaat ggagatggta ctggagtcgc 301 cagtatttac cgggggccat ttgcagatga aaattttaaa cttagacact cagctccagg 361 cctgctttcc atggcgaaca gtggtccaag tacaaatggc tgtcagttct ttatcacctg 421 ctctaagtgc gattggctgg atgggaagca tgtggtgttt ggaaaaatca tcgatggact 481 tctagtgatg agaaagattg agaatgttcc cacaggcccc aacaataagc ccaagctacc 541 tgtggtgatc tcgcagtgtg gggagatgta gtccagacaa agactgaatc aggccttccc 601 ttcttcttgg tggtgttctt gagtaagata atctggactg gcccccgtct ttgcttccct 661 gcctgctgct gccccatttg atcaagagac catggaagtg tcagagattc agaatccaag 721 attgtcttta agttttcaac tgtaaataaa gtttttttgt atgcgtaaaa aa // LOCUS AF016411 1215 bp mRNA PRI 24-DEC-1997 DEFINITION Homo sapiens potassium channel subunit KCNA3.1B (KCNA3B) mRNA, complete cds. ACCESSION AF016411 NID g2708513 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1215) AUTHORS Leicher,T. and Pongs,O. TITLE Direct Submission JOURNAL Submitted (30-JUL-1997) ZMNH, Institut fuer Neurale Signalverarbeitung, Martinistrasse 85, Hamburg 20246, Germany FEATURES Location/Qualifiers source 1..1215 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q13" gene 1..1215 /gene="KCNA3B" CDS 1..1215 /gene="KCNA3B" /note="potassium channel subunit" /codon_start=1 /product="KCNA3.1B" /db_xref="PID:g2708514" /translation="MQVSIACTEQNLRSRSSEDRLCGPRPGPGGGNGGPAGGGHGNPP GGGGSGPKARAAVVPGPPAPGGAVRESTGRGTGMKYRNLGKCGVRVSCLGLGTWVTFG SQISDETAEDVLTVAYEHGVNLFDTAEVYAAGKAERTLGNILKSKGWRRSSYVITTKI FWGGQAETERGLSRKHIIEGLRGSLERLQLGYVDIVFANRSDPNCPMEEIVRAMTYVI NQGLALYWGTSRWGAAEIMEAYSMARQFNLIPPVCEQAEHHLFQREKVEMQLPELYHK IGVGSVTWYPLACGLITSKYDGRVPDTCRASIKGYQWLKDKVQSEDGKKQQAKVMDLL PVAHQLGCTVAQLAIAWCLRSEGVSSVLLGVSSAEQLIEHLGALQVLSQLTPQTVMEI DGLLGNKPHSKK" BASE COUNT 279 a 307 c 391 g 238 t ORIGIN 1 atgcaggtgt ctatcgcgtg taccgagcag aaccttcgca gccggagcag tgaggaccgt 61 ctgtgtggac cccggccggg ccccggaggc ggtaatggtg ggccggccgg gggggggcac 121 gggaatcctc cggggggtgg agggtctggc cccaaggccc gagctgcagt ggttcccgga 181 cccccagcgc ccggtggggc cgtccgagag agcaccggcc gaggcactgg catgaaatac 241 aggaacctag ggaagtgtgg tgttcgggta tcctgtcttg gcctaggtac ctgggtcaca 301 tttggttctc agatctcaga tgagacagca gaggatgtgc tgactgtagc ctatgagcat 361 ggtgtaaacc tgtttgacac cgccgaagtg tacgcagcag gaaaggctga aagaacccta 421 gggaacatcc tcaagagcaa aggttggagg agatcaagct atgtcatcac taccaagatt 481 ttttggggag gacaggcaga aaccgagcga ggtttaagcc gaaagcacat cattgagggc 541 ttgcgaggat ccctggaacg cctccagctg ggatacgtgg acattgtctt tgccaatcgc 601 tcagacccca actgtcctat ggaggagatt gtgcgagcca tgacctatgt catcaaccag 661 ggcctggccc tatactgggg gacatcccga tggggggctg cagaaatcat ggaggcctac 721 tccatggcca gacagttcaa tctgattcct ccagtgtgtg aacaagcgga gcaccatctg 781 tttcagaggg agaaggtgga gatgcagctg ccagagctct accacaagat tggagttgga 841 tcagtcactt ggtaccctct agcctgtggt ctcattacta gcaagtatga tgggcgagtc 901 ccagatactt gcagggcctc catcaagggc taccagtggc tcaaggacaa agtgcagagt 961 gaagatggca agaagcaaca agccaaagtc atggaccttc ttcctgtcgc tcaccagctg 1021 ggctgcaccg tggcccagct tgctattgcg tggtgtctcc gcagtgaggg tgtcagctct 1081 gtcttgctgg gggtgtcgag tgcggagcag ttgatagaac acctgggcgc gctacaggtg 1141 ctgagccagc tgaccccgca gacagtgatg gaaatagacg ggctcctggg aaacaagccg 1201 cattccaaga agtag // LOCUS AF016509 954 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens oxidoreductase mRNA, complete cds. ACCESSION AF016509 NID g2338747 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 954) AUTHORS Biswas,M.G. and Russell,D.W. TITLE Expression cloning and characterization of oxidative 17beta- and 3alpha-hydroxysteroid dehydrogenases from rat and human prostate JOURNAL J. Biol. Chem. 272 (25), 15959-15966 (1997) MEDLINE 97332686 REFERENCE 2 (bases 1 to 954) AUTHORS Kedishvili,N.Y. TITLE Human oxidoreductase similar to retinol dehydrogenases and active with steroid substrates JOURNAL Unpublished REFERENCE 3 (bases 1 to 954) AUTHORS Kedishvili,N.Y. TITLE Direct Submission JOURNAL Submitted (30-JUL-1997) Biochemistry and Molecular Biology, Indiana University School of Medicine, 635 Barnhill Drive, MS461, Indianapolis, IN 46202, USA FEATURES Location/Qualifiers source 1..954 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="T69665, W74219, R35197" CDS 1..954 /note="NAD+ -dependent 3 alpha-hydroxysteroid dehydrogenase; similar to retinol dehydrogenases" /codon_start=1 /product="oxidoreductase" /db_xref="PID:g2338748" /translation="MWLYLAAFVGLYYLLHWYRERQVVSHLQDKYVFITGCDSGFGNL LARQLDARGLRVLAACLTEKGAEQLRGQTSDRLETVTLDVTKMESIAAATQWVKEHVG DRGLWGLVNNAGILTPITLCEWLNTEDSMNMLKVNLIGVIQVTLSMLPLVRRARGRIV NVSSILGRVAFFVGGYCVSKYGVEAFSDILRREIQHFGVKISIVEPGYFRTGMTNMTQ SLERMKQSWKEAPKHIKETYGQQYFDALYNIMKEGLLNCSTNLNLVTDCMEHALTSVH PRTRYSAGWDAKFFFIPLSYLPTSLADYILTRSWPKPAQAV" BASE COUNT 241 a 209 c 270 g 234 t ORIGIN 1 atgtggctct acctggcagc cttcgtgggc ctgtactacc ttctgcactg gtaccgggag 61 aggcaggtgg tgagccacct ccaagacaag tatgtcttta tcacgggctg tgactcgggc 121 tttgggaacc tactggccag acagctggat gcacgaggct tgagagtgct ggctgcgtgt 181 ctgacggaga agggggccga gcagctgagg ggccagacgt ctgacaggct ggagacggtg 241 accctggatg ttaccaagat ggagagcatc gctgcagcta ctcagtgggt gaaggagcat 301 gtgggggaca gaggactctg gggactggtg aacaatgcag gcattcttac accaattacc 361 ttatgtgagt ggctgaacac tgaggactct atgaatatgc tcaaagtgaa cctcattggt 421 gtgatccagg tgaccttgag catgcttcct ttggtgagga gagcacgggg aagaattgtc 481 aatgtctcca gcattctggg aagagttgct ttctttgtag gaggctactg tgtctccaag 541 tatggagtgg aagccttttc agatattctg aggcgtgaga ttcaacattt tggggtgaaa 601 atcagcatag ttgaacctgg ctacttcaga acgggaatga caaacatgac acagtcctta 661 gagcgaatga agcaaagttg gaaagaagcc cccaagcata ttaaggagac ctatggacag 721 cagtattttg atgcccttta caatatcatg aaggaagggc tgttgaattg tagcacaaac 781 ctgaacctgg tcactgactg catggaacat gctctgacat cggtgcatcc gcgaactcga 841 tattcagctg gctgggatgc taaatttttc ttcatccctc tatcttattt acctacatca 901 ctggcagact acattttgac tagatcttgg cccaaaccag cccaggcagt ctaa // LOCUS AF016582 1821 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens checkpoint kinase Chk1 (CHK1) mRNA, complete cds. ACCESSION AF016582 NID g2367668 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1821) AUTHORS Sanchez,Y., Wong,C., Thoma,R.S., Richman,R., Wu,Z., Piwnica-Worms,H. and Elledge,S.J. TITLE Conservation of the Chk1 checkpoint pathway in mammals: linkage of DNA damage to Cdk regulation through Cdc25 JOURNAL Science 277 (5331), 1497-1501 (1997) MEDLINE 97426625 REFERENCE 2 (bases 1 to 1821) AUTHORS Sanchez,Y. and Elledge,S.J. TITLE Direct Submission JOURNAL Submitted (30-JUL-1997) Biochemistry, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1821 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q24" /cell_type="B cell" gene 1..1821 /gene="CHK1" CDS 35..1465 /gene="CHK1" /note="similar to S.pombe CHK1 protein kinase; checkpoint kinase" /codon_start=1 /product="Chk1" /db_xref="PID:g2367669" /translation="MAVPFVEDWDLVQTLGEGAYGEVQLAVNRVTEEAVAVKIVDMKR AVDCPENIKKEICINKMLNHENVVKFYGHRREGNIQYLFLEYCSGGELFDRIEPDIGM PEPDAQRFFHQLMAGVVYLHGIGITHRDIKPENLLLDERDNLKISDFGLATVFRYNNR ERLLNKMCGTLPYVAPELLKRREFHAEPVDVWSCGIVLTAMLAGELPWDQPSDSCQEY SDWKEKKTYLNPWKKIDSAPLALLHKILVENPSARITIPDIKKDRWYNKPLKKGAKRP RVTSGGVSESPSGFSKHIQSNLDFSPVNSASSEENVKYSSSQPEPRTGLSLWDTSPSY IDKLVQGISFSQPTCPDHMLLNSQLLGTPGSSQNPWQRLVKRMTRFFTKLDADKSYQC LKETCEKLGYQWKKSCMNQVTISTTDRRNNKLIFKVNLLEMDDKILVDFRLSKGDGLE FKRHFLKIKGKLIDIVSSQKVWLPAT" BASE COUNT 558 a 344 c 406 g 513 t ORIGIN 1 ggccggacag tccgccgagg tgctcggtgg agtcatggca gtgccctttg tggaagactg 61 ggacttggtg caaaccctgg gagaaggtgc ctatggagaa gttcaacttg ctgtgaatag 121 agtaactgaa gaagcagtcg cagtgaagat tgtagatatg aagcgtgccg tagactgtcc 181 agaaaatatt aagaaagaga tctgtatcaa taaaatgcta aatcatgaaa atgtagtaaa 241 attctatggt cacaggagag aaggcaatat ccaatattta tttctggagt actgtagtgg 301 aggagagctt tttgacagaa tagagccaga cataggcatg cctgaaccag atgctcagag 361 attcttccat caactcatgg caggggtggt ttatctgcat ggtattggaa taactcacag 421 ggatattaaa ccagaaaatc ttctgttgga tgaaagggat aacctcaaaa tctcagactt 481 tggcttggca acagtatttc ggtataataa tcgtgagcgt ttgttgaaca agatgtgtgg 541 tactttacca tatgttgctc cagaacttct gaagagaaga gaatttcatg cagaaccagt 601 tgatgtttgg tcctgtggaa tagtacttac tgcaatgctc gctggagaat tgccatggga 661 ccaacccagt gacagctgtc aggagtattc tgactggaaa gaaaaaaaaa catacctcaa 721 cccttggaaa aaaatcgatt ctgctcctct agctctgctg cataaaatct tagttgagaa 781 tccatcagca agaattacca ttccagacat caaaaaagat agatggtaca acaaacccct 841 caagaaaggg gcaaaaaggc cccgagtcac ttcaggtggt gtgtcagagt ctcccagtgg 901 attttctaag cacattcaat ccaatttgga cttctctcca gtaaacagtg cttctagtga 961 agaaaatgtg aagtactcca gttctcagcc agaaccccgc acaggtcttt ccttatggga 1021 taccagcccc tcatacattg ataaattggt acaagggatc agcttttccc agcccacatg 1081 tcctgatcat atgcttttga atagtcagtt acttggcacc ccaggatcct cacagaaccc 1141 ctggcagcgg ttggtcaaaa gaatgacacg attctttacc aaattggatg cagacaaatc 1201 ttatcaatgc ctgaaagaga cttgtgagaa gttgggctat caatggaaga aaagttgtat 1261 gaatcaggtt actatatcaa caactgatag gagaaacaat aaactcattt tcaaagtgaa 1321 tttgttagaa atggatgata aaatattggt tgacttccgg ctttctaagg gtgatggatt 1381 ggagttcaag agacacttcc tgaagattaa agggaagctg attgatattg tgagcagcca 1441 gaaggtttgg cttcctgcca catgatcgga ccatcggctc tggggaatcc tggtgaatat 1501 agtgctgcta tgttgacatt attcttccta gagaagatta tcctgtcctg caaactgcaa 1561 atagtagttc ctgaagtgtt cacttccctg tttatccaaa catcttccaa tttattttgt 1621 ttgttcggca tacaaataat acctatatct taattgtaag caaaactttg gggaaaggat 1681 gaatagaatt catttgatta tttcttcatg tgtgtttagt atctgaattt gaaactcatc 1741 tggtggaaac caagtttcag gggacatgag ttttccagct tttatacaca cgtatctcat 1801 ttttatcaaa acattttgtt t // LOCUS AF016833 6513 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens maltase-glucoamylase mRNA, complete cds. ACCESSION AF016833 NID g2826520 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6513) AUTHORS Nichols,B.L., Eldering,J., Avery,S., Hahn,D., Quaroni,A. and Sterchi,E. TITLE Human small intestinal maltase-glucoamylase cDNA cloning. Homology To sucrase-isomaltase JOURNAL J. Biol. Chem. 273 (5), 3076-3081 (1998) MEDLINE 98112863 REFERENCE 2 (bases 1 to 6513) AUTHORS Nichols,B.L., Eldering,J.A., Avery,S.E., Hahn,D., Quaroni,A. and Sterchi,E.E. TITLE Direct Submission JOURNAL Submitted (31-JUL-1997) Pediatrics, Baylor College of Medicine, 1100 Bates, CNRC 10066, Houston, TX 77030-2600, USA FEATURES Location/Qualifiers source 1..6513 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /tissue_type="small intestine" /cell_type="enterocyte" CDS 55..5628 /EC_number="3.2.1.20" /EC_number="3.2.1.3" /note="brush border hydrolase; glucosyl hydrolase; 1,4-O-alpha-D-glucanohydrolase; has similarity to sucrase-isomaltase" /codon_start=1 /product="maltase-glucoamylase" /db_xref="PID:g2826521" /translation="MARKKLKKFTTLEIVLSVLLLVLFIISIVLIVLLAKESLKSTAP DPGTTGTPDPGTTGTPDPGTTGTTHARTTGPPDPGTTGTTPVSAECPVVNELERINCI PDQPPTKATCDQRGCCWNPQGAVSVPWCYYSKNHSYHVEGNLVNTNAGFTARLKNLPS SPVFGSNVDNVLLTAEYQTSNRFHFKLTDQTNNRFEVPHEHVQSFSGNAAASLTYQVE ISRQPFSIKVTRRSNNRVLFDSSIGPLLFADQFLQLSTRLPSTNVYGLGEHVHQQYRH DMNWKTWPIFNRDTTPNGNGTNLYGAQTFFLCLEDASGLSFGVFLMNSNAMEVVLQPA PAITYRTIGGILDFYVFLGNTPEQVVQEYLELIGRPALPSYWALGFHLSRYEYGTLDN MREVVERNRAAQLPYDVQHADIDYMDERRDFTYDSVDFKGFPEFVNELHNNGQKLVII VDPAISNNSSSSKPYGPYDRGSDMKIWVNSSDGVTPLIGEVWPGQTVFPDYTNPNCAV WWTKEFELFHNQVEFDGIWIDMNEVSNFVDGSVSGCSTNNLNNPPFTPRILDGYLFCK TLCMDAVQHWGKQYDIHNLYGYSMAVATAEAAKTVFPNKRSFILTRSTFAGSGKFAAH WLGDNTATWDDLRWSIPGVLEFNLFGIPMVGPDICGFALDTPEELCRRWMQLGAFYPF SRNHNGQGYKDQDPASFGADSLLLNSSRHYLNIRYTLLPYLYTLFFRAHSRGDTVARP LLHEFYEDNSTWDVHQQFLWGPGLLITPVLDEGAEKAMAYVPDAVWYDYETGSQVRWR KQKVEMELPGDKIGLHLRGGYIFPTQQPNTTTLASRKNPLGLIIALDENKEAKGELFW DDGETKDTVANKVYLLCEFSVTQNRLEVNISQSTYKDPNNLAFNEIKILGTEEPSNVT VKHNGVPSQTSPTVTYDSNLKVAIITDIDLLLGEAYTVEWSIKIRDEEKIDCYPDENG ASAENCTARGCIWEASNSSGVPFCYFVNDLYSVSDVQYNSHGATADISLKSSVYANAF PSTPVNPLRLDVTYHKNEMPQFKIYDPNKNRYEVPVPLNIPSMPSSTPEGQLYDVLIK KNPFGIEIRRKSIGTIIWDSQLLGFTFSDMFIRISTRLPSKYLYGFGETEHRSYRRDL EWHTWGMFSRDQPPGYKKNSYGVHPYYMGLEEDGSAHGVLLLNSNAMDVTFQPLPALT YRTTGGVLDFYVFLGPTPELVTQQYTELIGRPVMVPYWSLGFQLCRYGYQNDSEIASL YDEMVAAQIPYDVQYSDIDYMERQLDFTLSPKFAGFPALINRMKADGMRVILILDPAI SGNETQPYPAFTRGVEDDVFIKYPNDGDIVWGKVWPDFPDVVVNGSLDWDSQVELYRA YVAFPDFFRNSTAKWWKREIEELYNNPQNPERSLKFDGMWIDMNEPSSFVNGAVSPGC RDASLNHPPYMPHLESRDRGLSSKTLCMESQQILPDGSLVQHYNVHNLYGWSQTRPTY EAVQEVTGQRGVVITRSTFPSSGRWAGHWLGDNTAAWDQLKKSIIGMTEFSLFGISYT GADICGFFQDAEYEMCVRWMQLGAFYPFSRNHNTIGTRRQDPVSWDVAFVNISRTVLQ TRCTLLPYLYTLMHKAHTEGVTVVRPLLHEFVSDQVTWDIDSQFLLGPAFLVSPVLER NARNVTAYFPRARWYDYYTGVDINARGEWKTLPAPLDHINLHVRGGYILPWQEPALNT HLSRQKFMGFKIALDDEGTAGGWLFWDDGQSIDTYGKGLYYLASFSASQNTMQSHIIF NNYITGTNPLKLGYIEIWGVGSVPVTSASISVSGMVITPSFNNDPTTQVLSIDVTDRN ISLHNFTSLTWISTL" misc_feature 103..165 /note="encodes membrane anchor region" misc_feature 166..306 /note="encodes S-T rich stalk" misc_feature 322..444 /note="encodes trefoil domain 1" misc_feature 1633..1650 /note="encodes catalytic site 1" misc_feature 2950..3042 /note="encodes trefoil domain 2" misc_feature 4306..4323 /note="encodes catalytic site 2" BASE COUNT 1747 a 1508 c 1562 g 1696 t ORIGIN 1 attgctaagc catccttcag acagagaggg agcggctgca agaggtaatg agagatggca 61 agaaagaagc tgaaaaaatt tactactttg gagattgtgc tcagtgttct tctgcttgtg 121 ttgtttatca tcagtattgt tctaattgtg cttttagcca aagagtcact gaaatcaaca 181 gccccagatc ctgggacaac tggtaccccg gatcctggga caactggtac cccagatcct 241 ggaacaactg gtaccacaca tgctaggaca acgggtcccc cagatcctgg aacaactggt 301 accactcctg tttctgctga atgtccagtg gtaaatgaat tggaacgaat taattgcatc 361 cctgaccagc cgccaacaaa ggccacatgt gaccaacgtg gctgttgctg gaatccccag 421 ggagctgtaa gtgttccctg gtgctactat tccaagaatc atagctacca tgtagagggc 481 aaccttgtca acacaaatgc aggattcaca gcccggttga aaaatctgcc ttcctcacca 541 gtgtttggaa gcaatgttga caatgttctt ctcacagcag aatatcagac atctaatcgt 601 ttccacttta agttgactga ccaaaccaat aacaggtttg aagtgcccca cgaacacgtg 661 cagtccttca gtggaaatgc tgctgcttct ttgacctacc aagttgaaat ctccagacag 721 ccatttagca tcaaagtgac cagaagaagc aacaatcgtg ttttgtttga ctcgagcatt 781 gggcccctac tgtttgctga ccagttcttg cagctctcca ctcgactgcc tagcactaac 841 gtgtatggcc tgggagagca tgtgcaccag cagtatcggc atgatatgaa ttggaagacc 901 tggcccatat ttaacagaga cacaactccc aatggaaacg gaactaattt gtatggtgcg 961 cagacattct tcttgtgcct tgaagatgct agtggattgt cctttggggt gtttctgatg 1021 aacagcaatg ccatggaggt tgtccttcag cctgcgccag ccatcactta ccgcaccatt 1081 gggggcattc tcgacttcta tgtgttcttg ggaaacactc cagagcaagt tgttcaagaa 1141 tatctagagc tcattgggcg gccagccctt ccctcctact gggcgcttgg atttcacctc 1201 agtcgttacg aatatggaac cttagacaac atgagggaag tcgtggagag aaatcgcgca 1261 gcacagctcc cttatgatgt tcagcatgct gatattgatt atatggatga gagaagggac 1321 ttcacttatg attcagtgga ttttaaaggc ttccctgaat ttgtcaacga gttacacaat 1381 aatggacaga agcttgtcat cattgtggat ccagccatct ccaacaactc ttcctcaagt 1441 aaaccctatg gcccatatga caggggttcg gatatgaaga tatgggtgaa tagttcagat 1501 ggagtgactc cactcattgg ggaggtctgg cctggacaaa ctgtgtttcc tgattatacc 1561 aatcccaact gtgctgtttg gtggacaaag gaatttgagc tttttcacaa tcaagtagag 1621 tttgatggaa tctggattga tatgaatgaa gtctccaact ttgttgatgg ttcggtctca 1681 ggatgttcca caaacaacct aaataatccc ccattcactc ccagaatcct ggatgggtac 1741 ctgttctgca agactctctg tatggatgca gtgcagcact ggggcaagca gtatgacatt 1801 cacaatctgt atggctactc catggcggtc gccacagcag aagctgccaa gactgtgttc 1861 cctaataaga gaagcttcat tctgacccgt tctacctttg cgggctctgg caagtttgca 1921 gcacattggt taggagacaa cactgccacc tgggatgacc tgagatggtc catccctggc 1981 gtgcttgagt tcaacctttt tggcatccca atggtgggtc ctgacatatg tggctttgct 2041 ttggacaccc ctgaggagct ctgtaggcgg tggatgcagt tgggtgcatt ttatccgttt 2101 tctagaaatc acaatggcca aggctacaag gaccaggatc ctgcctcctt tggagctgac 2161 tccctgctgt tgaattcctc caggcactac cttaacatcc gctatactct attgccctac 2221 ctatacaccc ttttcttccg tgctcacagc cgaggggaca cggtggccag gccccttttg 2281 catgagttct acgaggacaa cagcacttgg gatgtgcacc aacagttctt atgggggccc 2341 ggcctcctca tcactccagt tctggatgaa ggtgcagaga aagcgatggc atatgtgcct 2401 gatgctgtct ggtatgacta cgagactggg agccaagtga gatggaggaa gcaaaaagtc 2461 gagatggaac ttcctggaga caaaattgga cttcaccttc gaggaggcta catcttcccc 2521 acacagcagc caaatacaac cactctggcc agtcgaaaga accctcttgg tcttatcatt 2581 gccctagatg agaacaaaga agcaaaagga gaacttttct gggatgatgg ggaaacgaag 2641 gatactgtgg ccaataaagt gtatctttta tgtgagtttt ctgtcactca aaaccgcttg 2701 gaggtgaata tttcacaatc aacctacaag gaccccaata atttagcatt taatgagatt 2761 aaaattcttg ggacggagga acctagcaat gttacagtga aacacaatgg tgtcccaagt 2821 cagacttctc ctacagtcac ttatgattct aacctgaagg ttgccattat cacagatatt 2881 gatcttctcc tgggagaagc atacacagtg gaatggagca taaagataag ggatgaagaa 2941 aaaatagact gttaccctga tgagaatggt gcttctgccg aaaactgcac tgcccgtggc 3001 tgtatctggg aggcatccaa ttcttctgga gtcccttttt gctattttgt caacgaccta 3061 tactctgtca gtgatgttca gtataattcc catggggcca cagctgacat ctccttaaag 3121 tcttccgttt atgccaatgc cttcccctcc acacccgtga acccccttcg cctggatgtc 3181 acttaccata agaatgaaat gccgcagttc aagatttatg atcccaacaa gaatcggtat 3241 gaagttccag tccctctgaa catacccagc atgccatcca gcacccctga gggtcaactc 3301 tatgatgtgc tcattaagaa gaatccattt gggattgaaa ttcgccggaa gagtataggc 3361 actataattt gggactctca gctccttggc tttaccttca gtgacatgtt tatccgcatc 3421 tccacccgcc ttccctccaa gtacctctat ggcttcgggg aaactgagca caggtcctat 3481 aggagagact tggagtggca cacttggggg atgttctccc gagaccagcc cccagggtac 3541 aagaagaatt cctatggtgt ccacccctac tacatggggc tggaggagga cggcagtgcc 3601 catggagtgc tcctgctgaa cagcaatgcc atggatgtga cgttccagcc cctgcctgcc 3661 ttgacatacc gcaccacagg gggagttctg gacttttatg tgttcttggg gccgactcca 3721 gagcttgtca cccagcagta cactgagttg attggccggc ctgtgatggt accttactgg 3781 tctttggggt tccagctgtg tcgctatggc taccagaatg actctgagat cgccagcttg 3841 tatgatgaga tggtggctgc ccagatccct tatgatgtgc agtactcaga catcgactac 3901 atggagcggc agctggactt caccctcagc cccaagtttg ctgggtttcc agctctgatc 3961 aatcgcatga aggctgatgg gatgcgggtc atcctcattc tggatccagc catttctggc 4021 aatgagacac agccttatcc tgccttcact cggggcgtgg aggatgacgt cttcatcaaa 4081 tacccaaatg atggagacat tgtctgggga aaggtctggc ctgattttcc tgatgttgtt 4141 gtgaatgggt ctctagactg ggacagccaa gtggagctat atcgagctta tgtggccttc 4201 ccagactttt tccgtaattc aactgccaag tggtggaaga gggaaataga agaactatac 4261 aacaatccac agaatccaga gaggagcttg aagtttgatg gcatgtggat tgatatgaat 4321 gaaccatcaa gcttcgtgaa tggggcagtt tctccaggct gcagggacgc ctctctgaac 4381 caccctccct acatgccaca tttggagtcc agggacaggg gcctgagcag caagaccctt 4441 tgtatggaga gtcagcagat cctcccagac ggctccctgg tgcagcacta caacgtgcac 4501 aacctgtatg ggtggtccca gaccagaccc acatacgaag ccgtgcagga ggtgacggga 4561 cagcgagggg tcgtcatcac ccgctccaca tttccctctt ctggccgctg ggcaggacat 4621 tggctgggag acaacacggc cgcatgggat cagctgaaga agtctatcat tggcatgacg 4681 gagttcagcc tcttcggcat atcctatacg ggagcagata tctgtgggtt ctttcaagat 4741 gctgaatatg agatgtgtgt tcgctggatg cagctggggg ccttttaccc cttctcaaga 4801 aaccacaaca ccattgggac caggagacaa gaccctgtgt cctgggatgt tgcttttgtg 4861 aatatttcca gaactgtcct gcagaccaga tgcaccctgt tgccatatct gtataccttg 4921 atgcataagg cccacacgga gggcgtcact gttgtgcggc ctctgctcca tgaatttgtg 4981 tcagaccagg tgacatggga catagacagt cagttcctgc tgggcccagc cttcctggtc 5041 agccctgtcc tggagcgtaa tgccagaaat gtcactgcat atttccctag agcccgctgg 5101 tatgattact acacgggtgt ggatattaat gcaagaggag agtggaagac cttgccagcc 5161 cctcttgacc acattaatct tcatgtccgt gggggctaca tcctgccctg gcaagagcct 5221 gcactgaaca cccacttaag ccgccagaaa ttcatgggct tcaaaattgc cttggatgat 5281 gaaggaactg ctgggggctg gctcttctgg gatgatgggc aaagcattga tacctatggg 5341 aaaggactct attacttggc cagcttttct gccagccaga atacgatgca aagccatata 5401 attttcaaca attacatcac tggtacaaat cctttgaaac tgggctacat tgaaatctgg 5461 ggagtgggca gtgtccccgt taccagtgcc agcatctctg tgagtggcat ggtcataaca 5521 ccctccttca acaatgaccc cacgacacag gtattaagca tcgatgtgac tgacagaaac 5581 atcagcctac ataattttac ttcattgacg tggataagca ctctgtgaat ttttacagca 5641 agattctaac taactatgaa tgactttgaa actacttata cttcatactc ataaaaatta 5701 ttgtgtgttg ctaatttgtt catacccact attggtgaaa tatttctgtt aattttgtta 5761 tatgtttttt gtgtgaaccc taaaggttaa accttagccc tgtgggatag gcagttaggg 5821 aggtgtggaa aatctatgca ttaccttaat gtctctgtgt ggttagtatg gtagtgactg 5881 ttcatcatat gacatttact gaagatgaac tgggtccatg atgaagtgtg tgtatgtcca 5941 cgtttgtaat catagaatgg accccattct tttgttaaat acacaagaga aagctttctg 6001 tgacagttcc aggtcttgaa gctaatcagc atctcaagaa agtatccaga aagaacatct 6061 gctagttggt tataggcggt gggaggaata atatacctaa ttggttatag gtggggggag 6121 catgataagc aaagaaaagg caaacacaag gaaagatcag atgaaacaga agatgatagt 6181 aaaagtgatc ctaagtaaga acataatgta aaattgtcag cagcctcatg gggaggaaaa 6241 aggaagagtc aactcacttg aagaagaggg tcttgagaaa tccttagcat aaagggctac 6301 tggtgagatt gagatctgag caggcaaagc tcaaaagaga gtttggaggt taaaaataat 6361 ttatttttgc agtagtgtgc tttgaaatgt gtaaatctta tttctaatgt atacaaccac 6421 atttcacata aaaatatgca atttatatgc cagataaaaa taaaacaagt gaatttgcaa 6481 gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS AF016917 1359 bp mRNA PRI 11-SEP-1997 DEFINITION Homo sapiens GABA-A receptor delta subunit (GABRD) mRNA, complete cds. ACCESSION AF016917 NID g2388692 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1359) AUTHORS Day,T.M., Hartnett,C., Blankenbiller,K. and Ramabhadran,T.V. TITLE Direct Submission JOURNAL Submitted (01-AUG-1997) Molecular Biology, Neurogen Corporation, 35 NE Industrial Road, Branford, CT 06405, USA FEATURES Location/Qualifiers source 1..1359 /organism="Homo sapiens" /db_xref="taxon:9606" /note="includes WashU-Merck EST Project, EST 175536, GenBank Accession Number H41122" gene 1..1359 /gene="GABRD" CDS 1..1359 /gene="GABRD" /note="ion channel subunit" /codon_start=1 /product="GABA-A receptor delta subunit" /db_xref="PID:g2388693" /translation="MDAPARLLAPLLLLCAQQLRGTRAMNDIGDYVGSNLEISWLPNL DGLIAGYARNFRPGIGGPPVNVALALEVASIDHISEANMEYTMTVFLHQSWRDSRLSY NHTNETLGLDSRFVDKLWLPDTFIVNAKSAWFHDVTVENKLIRLQPDGVILYSIRITS TVACDMDLAKYPMDEQECMLDLESYGYSSEDIVYYWSESQEHIHGLDKLQLAQFTITS YHFTTELMNFKSAGQFPRLSLHFHLRRNRGVYIIQSYMPSVLLVAMSWVSFWISQAAV PARVSLGITTVLTMTTLMVSARSSLPRASAIKALDVYFWICYVFVFAALVEYAFAHFN ADYRKKQKAKVKVSRPRAEMDVRNAIVLFSLSAAGVTQELAISRRQRRVPGNLMGSYR SVGVETGETKKEGAARSGGQGGIRARLRPIDADTIDIYARAVFPAAFAAVNVIYWAAY AM" BASE COUNT 255 a 456 c 404 g 244 t ORIGIN 1 atggacgcgc ccgcccggct gctggccccg ctcctgctcc tctgcgcgca gcagctccgc 61 ggcaccagag cgatgaatga catcggcgac tacgtgggct ccaacctgga gatctcctgg 121 ctccccaacc tggacgggct gatagccggc tacgcccgca acttccggcc tggcatcgga 181 ggcccccccg tgaatgtggc ccttgccctg gaggtggcca gcatcgacca catctcagag 241 gccaacatgg agtacaccat gacggtgttc ctgcaccaga gctggcggga cagcaggctc 301 tcctacaacc acaccaacga gaccctgggc ctggacagcc gcttcgtgga caagctgtgg 361 ctgcccgaca ccttcatcgt gaacgccaag tcggcctggt tccacgacgt gacggtggag 421 aacaagctca tccggctgca gcccgacggc gtgatcctgt acagcatccg aatcacctcc 481 actgtggcct gcgacatgga cctggccaaa taccccatgg acgagcagga gtgcatgctg 541 gacctggaga gctacggtta ctcatcggag gacatcgtct actactggtc ggagagccag 601 gagcacatcc acgggctgga caagctgcag ctggcgcagt tcaccatcac cagctaccac 661 ttcaccacgg agctgatgaa cttcaagtcc gctggccagt tcccacggct cagcctgcac 721 ttccacctgc ggaggaaccg cggcgtgtac atcatccaat cctacatgcc ctccgtcctg 781 ctggtcgcca tgtcctgggt ctccttctgg atcagccagg cggcggtgcc cgccagggtg 841 tctctaggca tcaccacggt gctgacgatg accacgctca tggtcagtgc ccgctcctcc 901 ctgccacggg catcagccat caaggcactg gacgtctact tctggatctg ctatgtcttc 961 gtgtttgccg ccctggtgga gtacgccttt gctcatttca acgccgacta caggaagaag 1021 cagaaggcca aggtcaaggt ctccaggccg agggcagaga tggacgtgag gaacgccatt 1081 gtcctcttct ccctctctgc tgccggcgtc acgcaggagc tggccatctc ccgccggcag 1141 cgccgcgtcc cggggaacct gatgggctcc tacaggtcgg tgggggtgga gacaggggag 1201 acgaagaagg agggggcagc ccgctcagga ggccaggggg gcatccgtgc ccggctcagg 1261 cccatcgacg cagacaccat tgacatttac gcccgcgctg tgttccctgc ggcgtttgcg 1321 gccgtcaatg tcatctactg ggcggcatac gccatgtga // LOCUS AF017061 2461 bp mRNA PRI 16-SEP-1997 DEFINITION Homo sapiens vasopressin-activated calcium mobilizing putative receptor protein (VACM-1) mRNA, complete cds. ACCESSION AF017061 NID g2394273 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2461) AUTHORS Longo,K.A., North,W.G., Du,J. and Fay,M.J. TITLE Direct Submission JOURNAL Submitted (02-AUG-1997) Department of Physiology, Dartmouth Medical School, 1 Medical Center Drive, Lebanon, NH 03755, USA FEATURES Location/Qualifiers source 1..2461 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NCI-H146" gene 1..2461 /gene="VACM-1" CDS 1..2346 /gene="VACM-1" /note="HSVACM1" /codon_start=1 /product="vasopressin-activated calcium mobilizing putative receptor protein" /db_xref="PID:g2394274" /translation="MATSNLLKDKGFLQFGDKWDFMRPIVLKLLRRDFVTKRQWFDLF SDVHAFCFWDDKGPAKIHQALKEDFILEFIKQAQARVLSHQDDTALLKAYIVEWRKFF TQCDILPKPFCQLEITLMGKQGSNKKSNVEDSIVRKLMLDTWNESIFSNIKNRLQDSA MKLVHAERLGEAFDSQLVIGVRESYVNLCSNPEDKLQIYRDNFEKAYLDSTERFYRTQ APSYLQQNGVQNYMKYADAKLKEEEKRALRYLETRRECNSVEALMECCVNALVTSFKE TILAECQGMIKRNETEKLHLMFSLMDKVPNGIEPMLKDLEEHIISAGLADMVAAAETI TTDSEKYVEQLLTLFNRFSKLVKEAFQDDPRFLTARDKAYKAVVNDATIFKLELPLKQ KGVGLKTQPESKCPELLANYCDMLLRKTPLSKKLTSEEIEAKLKEVLLVLKYVQNKDV FMRYHKAHLTRRLILDISADSEIEENMVEWLREVGMPADYVNKLARMFQDIKVSEDLN QAFKEMHKNNKLALPADSVNIKILNAGAWSRSSEKVFVSLPTELEDLIPEVEEFYKKN HSGRKLHWHHLMSNGIITFKNEVGQYDLEVTTFQLAVLFAWNQRPREKISFENLKLAT ELPDAELRRTLWSLVAFPKLKRQVLLYEPQVNSPKDFTEGTLFSVNQEFSLIKNAKVQ KRGKINLIGRLQLTTERMREEENEGIVQLRILRTQEAIIQIMKMRKKISNAQLQTELV EILKNMFLPQKKMIKEQIEWLIEHKYIRRDESDINTFIYMA" 3'UTR 2347..>2461 /gene="VACM-1" BASE COUNT 877 a 378 c 512 g 694 t ORIGIN 1 atggcgacgt ctaatctgtt aaaggataaa ggttttcttc agtttggaga caaatgggat 61 tttatgcgcc cgattgtttt gaagctttta cgccgggatt ttgttacaaa acggcagtgg 121 tttgatctgt tttcggatgt gcatgcattc tgtttttggg atgataaagg cccagcaaaa 181 attcatcagg ctttaaagga agattttatt cttgagttta ttaagcaagc acaggcacga 241 gtactgagcc atcaagatga tacggctttg ctaaaagcat atattgttga atggcgaaag 301 ttctttacac aatgtgatat tttaccaaaa cctttttgtc aactagagat tactttaatg 361 ggtaaacagg gcagcaataa aaaatcaaat gtggaagaca gtattgttcg aaagcttatg 421 cttgatacat ggaatgagtc aatcttttca aacataaaaa acagactcca agatagtgca 481 atgaagctgg tacatgctga gagattggga gaagcttttg attctcagct ggttattgga 541 gtaagagaat cctatgttaa cctttgttct aatcctgagg ataaacttca aatttatagg 601 gacaattttg agaaggcata cttggattca acagagagat tttatagaac acaagcaccc 661 tcgtatttac aacaaaatgg tgtacagaat tatatgaaat atgcagatgc taaattaaaa 721 gaagaagaaa aacgagcact acgttattta gaaacaagac gagaatgtaa ctccgttgaa 781 gcactcatgg aatgctgtgt aaatgccctg gtgacatcat ttaaagagac tatcttagct 841 gagtgccaag gcatgatcaa gagaaatgaa actgaaaaat tacatttaat gttttcattg 901 atggacaaag ttcctaatgg tatagagcca atgttgaaag acttggagga acatatcatt 961 agtgctggcc tggcagatat ggtagcagct gctgaaacta ttactactga ctctgagaaa 1021 tacgttgagc agttacttac actatttaat agatttagta aactcgtcaa agaagctttt 1081 caagatgatc cacgatttct tactgcaaga gataaggcgt ataaagcagt tgttaatgat 1141 gctaccatat ttaaacttga attacctttg aagcagaagg gggtgggatt aaaaactcag 1201 cctgaatcaa aatgccctga gctgcttgcc aattactgtg acatgttgct aagaaaaaca 1261 ccattaagca aaaaactaac ctctgaagag attgaagcaa agcttaaaga agtgctcttg 1321 gtacttaagt atgtacagaa caaagatgtt tttatgaggt atcataaagc tcatttgaca 1381 cgacgtctta tattagacat ctctgccgat agtgaaattg aagaaaacat ggtagagtgg 1441 ctaagagaag ttggtatgcc agcggattat gtaaacaagc ttgctagaat gtttcaggac 1501 ataaaagtat ctgaagattt gaaccaagct tttaaggaaa tgcacaaaaa taataaattg 1561 gcattaccag ctgattcagt taatataaaa attctgaatg ctggcgcctg gtcaagaagt 1621 tctgagaaag tctttgtctc acttcctact gaactggagg acttgatacc ggaagtagaa 1681 gaattctaca aaaaaaatca tagtggtaga aaattacatt ggcatcatct catgtcaaat 1741 ggaattataa catttaagaa tgaagttggt caatatgatt tggaggtaac cacgtttcag 1801 ctcgctgtat tgtttgcatg gaaccaaaga cccagagaga aaatcagctt tgaaaatctt 1861 aagcttgcaa ctgaactccc tgatgctgaa cttaggagga ctttatggtc tttagtagct 1921 ttcccaaaac tcaaacggca agttttgttg tatgaacctc aagtcaactc acccaaagac 1981 tttacagaag gtaccctctt ctcagtgaac caggagttca gtttaataaa aaatgcaaag 2041 gttcagaaaa ggggtaaaat caacttgatt ggacgtttgc agctcactac agaaaggatg 2101 agagaagaag agaatgaagg aatagttcaa ctacgaatac taagaaccca ggaagctatc 2161 atacaaataa tgaaaatgag aaagaaaatt agtaatgctc agctgcagac tgaattagta 2221 gaaattttga aaaacatgtt cttgccacaa aagaaaatga taaaagagca aatagagtgg 2281 ctaatagagc acaaatacat cagaagagat gaatctgata tcaacacttt catatatatg 2341 gcataatttt gaatatcatg gacaatattt agaacccaaa ttttggagtg cttgggcaga 2401 aagttgtaaa gtttgtgctg gagaaaggtt tatttggact ttgattacat aaatattaat 2461 a // LOCUS AF017262 1864 bp mRNA PRI 11-SEP-1997 DEFINITION Homo sapiens putative G protein-coupled receptor mRNA, complete cds. ACCESSION AF017262 NID g2388705 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1864) AUTHORS Donohue,P.J., Shapira,H., Mantey,S.A., Hampton,L.L., Jensen,R.T. and Battey,J.F. TITLE A human gene encodes a putative G protein-coupled receptor highly expressed in the central nervous system JOURNAL Unpublished REFERENCE 2 (bases 1 to 1864) AUTHORS Donohue,P.J., Shapira,H., Mantey,S.A., Hampton,L.L., Jensen,R.T. and Battey,J.F. TITLE Direct Submission JOURNAL Submitted (04-AUG-1997) Laboratory of Molecular Biology, NIDCD/NIH, 5 Research Court, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1864 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 18..1859 /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2388706" /translation="MRAPGALLARMSRLLLLLLLKVSASSALGVAPASRNETCLGESC APTVIQRRGRDAWGPGNSARDVLRARAPREEQGAAFLAGPSWDLPAAPGRDPAAGRGA EASAAGPPGPPTRPPGPWRWKGARGQEPSETLGRGNPTALQLFLQISEEEEKGPRGAG ISGRSQEQSVKTVPGASDLFYWPRRAGKLQGSHHKPLSKTANGLAGHEGWTIALPGRA LAQNGSLGEGIHEPGGPRRGNSTNRRVRLKNPFYPLTQESYGAYAVMCLSVVIFGTGI IGNLAVMSIVCHNYYMRSISNSLLANLAFWDFLIIFFCLPLVIFHELTKKWLLEDFSC KIVPYIEVASLGVTTFTLCALCIDRFRAATNVQMYYEMIENCSSTTAKLAVIWVGALL LALPEVVLRQLSKEDLGFSGRAPAERCIIKISPDLPDTIYVLALTYDSARLWWYFGCY FCLPTLFTITCSLVTARKIRKAEKACTRGNKRQIQLESQMNCTVVALTILYGFCIIPE NICNIVTAYMATGVSQQTMDLLNIISQFLLFFKSCVTPVLLFCLCKPFSRAFMECCCC CCEECIQKSSTVTSDDNDNEYTTELELSPFSTIRREMSTFASVGTHC" BASE COUNT 391 a 534 c 515 g 424 t ORIGIN 1 tgtgccctca ccaagccatg cgagccccgg gcgcgcttct cgcccgcatg tcgcggctac 61 tgcttctgct actgctcaag gtgtctgcct cttctgccct cggggtcgcc cctgcgtcca 121 gaaacgaaac ttgtctgggg gagagctgtg cacctacagt gatccagcgc cgcggcaggg 181 acgcctgggg accgggaaat tctgcaagag acgttctgcg agcccgagca cccagggagg 241 agcagggggc agcgtttctt gcgggaccct cctgggacct gccggcggcc ccgggccgtg 301 acccggctgc aggcagaggg gcggaggcgt cggcagccgg acccccggga cctccaacca 361 ggccacctgg cccctggagg tggaaaggtg ctcggggtca ggagccttct gaaactttgg 421 ggagagggaa ccccacggcc ctccagctct tccttcagat ctcagaggag gaagagaagg 481 gtcccagagg cgctggcatt tccgggcgta gccaggagca gagtgtgaag acagtccccg 541 gagccagcga tcttttttac tggccaagga gagccgggaa actccagggt tcccaccaca 601 agcccctgtc caagacggcc aatggactgg cggggcacga agggtggaca attgcactcc 661 cgggccgggc gctggcccag aatggatcct tgggtgaagg aatccatgag cctgggggtc 721 cccgccgggg aaacagcacg aaccggcgtg tgagactgaa gaaccccttc tacccgctga 781 cccaggagtc ctatggagcc tacgcggtca tgtgtctgtc cgtggtgatc ttcgggaccg 841 gcatcattgg caacctggcg gtgatgagca tcgtgtgcca caactactac atgcggagca 901 tctccaactc cctcttggcc aacctggcct tctgggactt tctcatcatc ttcttctgcc 961 ttccgctggt catcttccac gagctgacca agaagtggct gctggaggac ttctcctgca 1021 agatcgtgcc ctatatagag gtcgcttctc tgggagtcac cactttcacc ttatgtgctc 1081 tgtgcataga ccgcttccgt gctgccacca acgtacagat gtactacgaa atgatcgaaa 1141 actgttcctc aacaactgcc aaacttgctg ttatatgggt gggagctcta ttgttagcac 1201 ttccagaagt tgttctccgc cagctgagca aggaggattt ggggtttagt ggccgagctc 1261 cggcagaaag gtgcattatt aagatctctc ctgatttacc agacaccatc tatgttctag 1321 ccctcaccta cgacagtgcg agactgtggt ggtattttgg ctgttacttt tgtttgccca 1381 cgcttttcac catcacctgc tctctagtga ctgcgaggaa aatccgcaaa gcagagaaag 1441 cctgtacccg agggaataaa cggcagattc aactagagag tcagatgaac tgtacagtag 1501 tggcactgac cattttatat ggattttgca ttattcctga aaatatctgc aacattgtta 1561 ctgcctacat ggctacaggg gtttcacagc agacaatgga cctccttaat atcatcagcc 1621 agttcctttt gttctttaag tcctgtgtca ccccagtcct ccttttctgt ctctgcaaac 1681 ccttcagtcg ggccttcatg gagtgctgct gctgttgctg tgaggaatgc attcagaagt 1741 cttcaacggt gaccagtgat gacaatgaca acgagtacac cacggaactc gaactctcgc 1801 ctttcagtac catacgccgt gaaatgtcca cttttgcttc tgtcggaact cattgctgaa 1861 ggac // LOCUS AF017305 3057 bp mRNA PRI 28-JAN-1998 DEFINITION Homo sapiens deubiquitinating enzyme UnpEL (UNP) mRNA, complete cds. ACCESSION AF017305 NID g2656140 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3057) AUTHORS Gray,D.A., Inazawa,J., Gupta,K., Wong,A., Ueda,R. and Takahashi,T. TITLE Elevated expression of Unph, a proto-oncogene at 3p21.3, in human lung tumors JOURNAL Oncogene 10 (11), 2179-2183 (1995) MEDLINE 95303480 REFERENCE 2 (bases 1 to 3057) AUTHORS Frederick,A., Rolfe,M. and Chiu,M.I. TITLE The human UNP locus at 3p21.31 encodes two tissue-selective, cytoplasmic isoforms with deubiquitinating activity that have reduced expression in small cell lung carcinoma cell lines JOURNAL Oncogene 16 (2), 153-165 (1998) REFERENCE 3 (bases 1 to 3057) AUTHORS Frederick,A., Rolfe,M. and Chiu,M.I. TITLE Direct Submission JOURNAL Submitted (07-AUG-1997) Biology, Mitotix, Inc., One Kendall Square, Bldg 600, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..3057 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.31" /cell_type="T cell" gene 1..3057 /gene="UNP" /allele="EL" CDS 4..2895 /gene="UNP" /note="deubiquitinating enzyme; Member of UBP family of ubiquitin-specific proteases; signature Cys and His boxes. Approximately 109 kDa protein found primarily in the cytoplasmic fraction. More than one isoform is recognized by anti-Unp antisera. Gst-UnpEL can cleave ubiquitin-beta-gal substrate in E. coli assay; GST-UnpEL mutant Cys311Ala cannot" /codon_start=1 /product="UnpEL" /db_xref="PID:g2656141" /translation="MAEGGGCRERPDAETQKSELGPLMRTTLQRGAQWYLIDSRWFKQ WKKYVGFDSWDMYNVGEHNLFPGPIDNSGLFSDPESQTLKEHLIDELDYVLVPTEAWN KLLNWYGCVEGQQPIVRKVVEHGLFVKHCKVEVYLLELKLCENSDPTNVLSCHFSKAD TIATIEKEMRKLFNIPAERETRLWNKYMSNTYEQLSKLDNTVQDAGLYQGQVLVIEPQ NEDGTWPRQTLQSKSSTAPSRNFTTSPKSSASPYSSVSASLIANGDSTSTCGMHSSGV SRGGSGFSASYNCQEPPSSHIQPGLCGLGNLGNTCFMNSALQCLSNTAPLTDYFLKDE YEAEINRDNPLGMKGEIAEAYAELIKQMWSGRDAHVAPRMFKTQVGRFAPQFSGYQQQ DSQELLAFLLDGLHEDLNRVKKKPYLELKDANGRPDAVVAKEAWENHRLRNDSVIVDT FHGLFKSTLVCPECAKVSVTFDPFCYLTLPLPLKKDRVMEVFLVPADPHCRPTQYRVT VPLMGAVSDLCEALSRLSGIAAENMVVADVYNHRFHKIFQMDEGLNHIMPRDDIFVYE VCSTSVDGSECVTLPVYFRERKSRPSSTSSASALYGQPLLLSVPKHKLTLESLYQAVC DRISRYVKQPLPDEFGSSPLEPGACNGSRNSCEGEDEEEMEHQEEGKEQLSETEGSGE DEPGNDPSETTQKKIKGQPCPKRLFTFSLVNSYGTADINSLAADGKLLKLNSRSTLAM DWDSETRRLYYDEQESEAYEKHVSMLQPQKKKKTTVALRDCIELFTTMETLGEHDPWY CPNCKKHQQATKKFDLWSLPKILVVHLKRFSYNRYWRDKLDTVVEFPIRGLNMSEFVC NLSARPYVYDLIAVSNHYGAMGVGHYTAYAKNKLNGKWYYFDDSNVSLASEDQIVTKA AYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGDDEACSMDTN" BASE COUNT 834 a 727 c 777 g 719 t ORIGIN 1 gagatggcgg aaggtggagg ctgccgtgag cgaccggatg cggagactca gaagtccgag 61 cttggaccct taatgaggac cacactccaa cgcggggcgc agtggtatct tattgacagc 121 cggtggttca agcagtggaa gaagtatgtg ggctttgaca gctgggacat gtacaatgtg 181 ggtgaacata acctatttcc tggcccaata gacaactctg ggctattttc agatcctgag 241 agtcagacct tgaaagaaca cttaattgat gaattggact atgtattggt ccctaccgag 301 gcgtggaata aactactaaa ctggtacggc tgtgtagaag gccagcaacc catcgtcaga 361 aaagttgtgg agcatggcct gtttgtcaag cactgcaaag tcgaggtgta tttgctggaa 421 ctgaagctct gtgagaacag tgaccccacc aatgtgctga gttgccattt cagcaaggca 481 gacaccattg caaccatcga gaaagagatg cggaagctat tcaacatccc tgcggagcgt 541 gaaacacggc tctggaacaa atacatgagc aacacctacg agcagttgag caagctagac 601 aacactgtcc aggatgctgg gctataccag ggtcaggtgc tagtaattga gcctcaaaat 661 gaagatggca catggcccag gcagaccttg cagtcaaaat caagcactgc gcctagcaga 721 aattttacta cctctccaaa atcatcagca agtccctatt cctcagtgtc tgcctctctc 781 attgcaaatg gtgatagcac tagcacctgt gggatgcaca gttccggtgt cagcaggggt 841 ggatctggct tttctgcttc gtataattgt caggagccac catcctctca tatacaacct 901 gggctctgtg gacttggaaa cctgggaaac acctgcttca tgaactccgc tttgcagtgt 961 ttgagcaaca ctgcaccact gactgactac tttctcaaag atgagtatga agccgaaatc 1021 aacagagaca accctctggg gatgaaaggg gaaattgcag aagcctatgc tgaactcatt 1081 aagcagatgt ggtctggaag ggacgcccat gtggcacctc gcatgttcaa aactcaagta 1141 ggacgttttg ctcctcaatt ttctggctac cagcaacaag attctcagga gctgctggcc 1201 tttcttctag atggattgca tgaagatctg aaccgggtaa agaaaaagcc ctacttggag 1261 ctgaaggatg ccaatgggcg gccagatgcg gtggtggcaa aggaagcctg ggagaatcac 1321 aggttgagga atgattctgt gattgtggat actttccatg gcctcttcaa atctactttg 1381 gtttgcccag aatgtgctaa ggtttctgtg acctttgacc cattttgcta tctaacgctg 1441 ccactgccct tgaagaaaga tcgagttatg gaggttttcc tggttcctgc tgaccctcac 1501 tgcagaccta ctcagtaccg tgtgactgtg ccgctgatgg gggctgtgtc cgacctgtgc 1561 gaggctctct ccaggctgtc tggcattgct gcagaaaata tggtggtcgc agatgtgtat 1621 aatcaccgat tccacaaaat tttccaaatg gatgaaggtt taaaccacat catgcctcgg 1681 gatgacattt tcgtgtacga ggtctgcagc acttccgtgg atggctcgga atgtgtcacg 1741 cttccagtct acttcaggga gaggaagtcc aggccatcaa gcacttcctc cgcatcagcg 1801 ctatatgggc agccactatt gctttctgtc cccaagcaca agttaaccct tgagtctttg 1861 taccaggctg tttgtgatcg tatcagccgc tatgtgaaac agcctttacc tgatgagttt 1921 ggcagctcac ccttggagcc aggggcctgc aatggctcca ggaacagctg tgaaggagaa 1981 gatgaggaag aaatggagca tcaggaagaa ggcaaagagc agctttcaga aacagaaggc 2041 agtggggaag atgagccagg aaatgacccc agtgagacca cccaaaagaa gatcaaaggc 2101 cagccctgcc caaaaaggct ttttaccttc agtcttgtga actcctatgg aacagctgac 2161 ataaattcac ttgcagctga tggaaaacta cttaaactca actctcgatc tacactggcc 2221 atggattggg acagtgaaac tcggagactt tactatgatg agcaagaatc tgaggcctac 2281 gagaagcatg tgagcatgtt gcagcctcag aagaagaaga agaccacagt ggccctgaga 2341 gactgcatcg agctcttcac caccatggag acccttgggg agcatgaccc ctggtactgt 2401 cccaactgta agaagcatca acaggccaca aaaaagtttg acctatggtc cttgcccaag 2461 atcctggtgg tccacctcaa acgtttctcc tacaacagat actggaggga taagctcgac 2521 acagtcgtag aattcccaat cagagggctg aacatgtccg agtttgtctg taacctgtca 2581 gcaaggcctt atgtgtacga cctcattgcc gtgtccaatc attatggagc catgggggtt 2641 ggccactaca ctgcatatgc gaagaacaaa ctgaatggta aatggtatta ctttgatgat 2701 agcaacgtgt ccctggcctc tgaggatcag atagtgacta aagcagctta tgtgctattt 2761 taccaacgtc gagatgatga attttataag acaccttcac ttagcagttc tggttcctct 2821 gatggaggga cacgaccaag cagctctcag cagggctttg gggatgatga ggcttgcagc 2881 atggacacca actaatgctg actccacgat cctgccaccc tgtagcgcca gtgtaatccc 2941 ccaggagaac atctttgaca ctctgcagac tgctagtgtt ctgtctaaaa accagacaag 3001 gaaataccct tcttttatga gcagaaggaa accaaaaaaa aaaaggaggc cgtttac // LOCUS AF017445 3144 bp mRNA PRI 02-NOV-1997 DEFINITION Homo sapiens GDP-L-fucose pyrophosphorylase (GFPP) mRNA, complete cds. ACCESSION AF017445 NID g2582184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3144) AUTHORS Pastuszak,I., Ketchum,C., Hermanson,G., Drake,R. and Elbein,A. TITLE GDP-L-Fucose Pyrophosphorylase: Purification, cDNA cloning, and properties of the enzyme JOURNAL Unpublished REFERENCE 2 (bases 1 to 3144) AUTHORS Hermanson,G., Ketchum,C., Pastuszak,I., Drake,R. and Elbein,A.D. TITLE Direct Submission JOURNAL Submitted (05-AUG-1997) Molecular Biology, Cytel Corporation, 3525 John Hopkins Court, San Diego, CA 92121, USA FEATURES Location/Qualifiers source 1..3144 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3144 /gene="GFPP" CDS 38..1822 /gene="GFPP" /EC_number="2.7.7.30" /function="catalyzes the formation of GDP-L-fucose from GTP and L-fucose-1-phosphate" /codon_start=1 /product="GDP-L-fucose pyrophosphorylase" /db_xref="PID:g2582185" /translation="MAAARDPPEVSLREATQRKLRRFSELRGKLVARGEFWDIVAITA ADEKQELAYNQQLSEKLKRKELPLGVQYHVFVDPAGAKIGNGGSTLCALQCLEKLYGD KWNSFTILLIHSGGYSQRLPNASALGKIFTALPLGNPIYQMLELKLAMYIDFPLNMNP GILVTCADDIELYSIGEFEFIRFDKPGFTALAHPSSLTIGTTHGVFVLDPFDDLKHRD LEYRSCHRFLHKPSIEKMYQFNAVCRPGNFCQQDFAGGDIADLKLDSDYVYTDSLFYM DHKSAKMLLAFYEKIGTLSCEIDAYGDFLQALGPGATVEYTRNTSHVIKEESELVEMR QRIFHLLKGTSLNVVVLNNSKFYHIGTTEEYLFYFTSDNSLKSELGLQSITFSIFPDI PECSGKTSCIIQSILDSRCSVAPGSVVEYSRLGPDVSVGENCIISGSYILTKAALPAH SFVCSLSLKMNRCLKYATMAFGVQDNLKKSVKTLSDIKLLQFFGVCFLSCLDVWNLKV TEELFSGNKTCLSLWTARIFPVCSSLSDSVITSLKMLNAVKNKSAFSLNSYKLLSIEE MLIYKDVEDMITYREQIFLEISLKSSLM" BASE COUNT 999 a 509 c 601 g 1035 t ORIGIN 1 gcgtgctgtg cggcgcggtc tcagggaagg tggggctatg gcagctgcta gggaccctcc 61 ggaagtatcg ctgcgagaag ccacccagcg aaaattgcgg aggttttccg agctaagagg 121 caaacttgta gcacgtggag aattctggga catagttgca ataacagcgg ctgatgaaaa 181 acaggaactt gcttacaacc aacagctgtc agaaaagctg aaaagaaagg agttacccct 241 tggagttcaa tatcacgttt ttgtggatcc tgctggagcc aaaattggaa atggaggatc 301 aacactttgt gcccttcaat gtttggaaaa gctatatgga gataaatgga attcttttac 361 catcttatta attcactctg gtggctacag tcaacgactt ccaaatgcaa gtgctctggg 421 aaaaattttc actgctttac ctcttggtaa ccccatttat cagatgctag aattaaagct 481 agccatgtac attgatttcc ccttaaatat gaatcctgga attctggtta cctgtgcaga 541 tgatattgaa ctttatagta ttggagaatt tgagtttatt aggtttgaca aacctggctt 601 tactgcttta gctcatcctt ctagtttgac gataggtacc acacatggag tatttgtctt 661 agatcctttt gatgatttaa aacatagaga ccttgaatac aggtcttgcc atcgtttcct 721 tcataagccc agcatagaaa agatgtatca gtttaatgct gtgtgtagac ctggaaattt 781 ttgtcaacag gactttgctg ggggtgacat tgccgatctt aaattagact ctgactatgt 841 ctacacagat agcctatttt atatggatca taaatcagca aaaatgttac ttgcttttta 901 tgaaaaaata ggcacactga gctgtgaaat agatgcctat ggtgactttc tgcaggcttt 961 gggacctgga gcaactgtgg agtacaccag aaacacatca catgtcatta aagaagagtc 1021 agagttggta gaaatgaggc agagaatatt tcatcttctt aaaggaacat cactaaatgt 1081 tgttgttctt aataactcca aattttatca cattggaaca accgaagaat atttgtttta 1141 ctttacctca gataacagtt taaagtcaga gctcggctta cagtccataa cttttagtat 1201 ctttccagat ataccagaat gctctggcaa aacatcctgt atcattcaaa gcatactgga 1261 ttcaagatgt tctgtggcac ctggctcagt tgtggagtat tccagattgg ggcctgatgt 1321 ttcagttggg gaaaactgca ttattagtgg ttcttacatc ctaacaaaag ctgccctccc 1381 cgcacattct tttgtatgtt ccttaagctt aaagatgaat agatgcttaa agtatgcaac 1441 tatggcattt ggagtgcaag acaacttgaa aaagagtgtg aaaacattgt cagatataaa 1501 gttacttcaa ttctttggag tctgtttcct gtcatgctta gatgtttgga atcttaaagt 1561 tacagaggaa ctgttctctg gtaacaagac atgtctgagt ttgtggactg cacgcatttt 1621 cccagtttgt tcttctttga gtgactcagt tataacatcc ctaaagatgt taaatgctgt 1681 taagaacaag tcagcattca gcctgaatag ctataagttg ctgtccattg aagaaatgct 1741 tatctacaaa gatgtagaag atatgataac ttacagggaa caaatttttc tagaaatcag 1801 tttaaaaagc agtttgatgt agagatattt taaatattgt acactttgcc tttttgagta 1861 acattccaga gataggtatt tttggtaggc tgtttcactg aactcagtta atgaaaactg 1921 tattaacata attgttgtag cataatatta atagtgcaaa agtacatata agtcattttg 1981 atgaaaaata ttccaagact aagttgagaa aagagatact attttggatg tgtatcagta 2041 tttttgtttt taataatgat tgatttgtgg agcattgttt tttcacataa ttagttttaa 2101 aggtaatttt ctaagcatac ctttggaatt tttccatctt ttttgaggct tttggtccag 2161 tgaagttcta agtattcact ggcacttctc tcctcaactg taattctatt tttaataata 2221 aaaatggcat actgtagggt cttcagagta gtgtaggaat actgtagaaa tactttttca 2281 gaaacgaatc catagctgac aaattcactc agtgcccaat atattgtgat tattttcgtt 2341 gataaagaac tagatacaaa gacctctgaa attgatgata aaatttgtat ctcattaatt 2401 ttatcaaaat gaaactaaaa gtacatatgt attatacact tgaacatgtg tttgtatatc 2461 tttaaaattt tcctttatgt ccattccata ggaagacaca catatgcaca caaaactcag 2521 ttatctgtgg gggaagtggt agtaataatt gagattcatc aataatcata taatttcact 2581 atagacatta cttgcattat ctcctatcag tccttctcac aaaacttcaa ttaccagatg 2641 aactgacttt agttttcatc ttattttttg tccacatgct ggtgatgctg agaaacaata 2701 attggcccaa atgcagacat caagaagcag gcaggaaaat ggaaaaactg agataataca 2761 caggtgataa acagtttcga gaaaataaaa attggctaga atctcaagtt atgtgtttct 2821 atattggtat aagcatataa gtgaaaagtc ttataagtgt aatagagaaa agatttgtca 2881 gtgtgttttt ttaaatgaaa taactagtct gtgctacttt atgtcaatat aaaaattggt 2941 aaactagaag taacttgtcc acaaccctca gttatgatac ttatgtgcgt gttttttttt 3001 ccaaagtttt tcccaaggag aagatacaaa tatatggtag ctattgttta aaaatgattg 3061 atttacttgc agatttttca gaggatgtta tgtctttgtc attcattaaa tgttcaaaat 3121 tgtagtttta aaaaaaaaaa aaaa // LOCUS AF017456 3487 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens lysosomal pepstatin insensitive protease (CLN2) mRNA, complete cds. ACCESSION AF017456 NID g2408231 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3487) AUTHORS Sleat,D.E., Donnelly,R.J., Lackland,H., Liu,C.G., Sohar,I., Pullarkat,R.K. and Lobel,P. TITLE Association of mutations in a lysosomal protein with classical late-infantile neuronal ceroid lipofuscinosis JOURNAL Science 277 (5333), 1802-1805 (1997) MEDLINE 97442529 REFERENCE 2 (bases 1 to 3487) AUTHORS Sleat,D.E., Donnelly,R.J., Lackland,H., Liu,C.-G., Sohar,I., Pullarkat,R. and Lobel,P. TITLE Direct Submission JOURNAL Submitted (07-AUG-1997) CABM, UMDNJ, 679 Hoes Lane, Piscataway, NJ 08854, USA FEATURES Location/Qualifiers source 1..3487 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15" 5'UTR 1..14 /gene="CLN2" gene 1..3487 /gene="CLN2" CDS 15..1706 /gene="CLN2" /note="deficient in late infantile neuronal ceroid lipofuscinosis; similar to prokaryotic pepstatin insensitive carboxyl proteases" /codon_start=1 /product="lysosomal pepstatin insensitive protease" /db_xref="PID:g2408232" /translation="MGLQACLLGLFALILSGKCSYSPEPDQRRTLPPGWVSLGRADPE EELSLTFALRQQNVERLSELVQAVSDPSSPQYGKYLTLENVADLVRPSPLTLHTVQKW LLAAGAQKCHSVITQDFLTCWLSIRQAELLLPGAEFHHYVGGPTETHVVRSPHPYQLP QALAPHVDFVGGLHHFPPTSSLRQRPEPQVTGTVGLHLGVTPSVIRKRYNLTSQDVGS GTSNNSQACAQFLEQYFHDSDLAQFMRLFGGNFAHQASVARVVGQQGRGRAGIEASLD VQYLMSAGANISTWVYSSPGRHEGQEPFLQWLMLLSNESALPHVHTVSYGDDEDSLSS AYIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGRHQFRPTFPASSPYVTTVGGTS FQEPFLITNEIVDYISGGGFSNVFPRPSYQEEAVTKFLSSSPHLPPSSYFNASGRAYP DVAALSDGYWVVSNRVPIPWVSGTSASTPVFGGILSLINEHRILSGRPPLGFLNPRLY QQHGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTLLNP" sig_peptide 15..62 /gene="CLN2" /product="lysosomal pepstatin insensitive protease" variation 538 /gene="CLN2" /note="A to G polymorphism (His to Arg)" /replace="g" mutation 636 /gene="CLN2" /note="C to T transition (Arg to stop)" /replace="t" misc_feature 642..644 /gene="CLN2" /note="glycosylation site" /evidence=not_experimental misc_feature 678..701 /gene="CLN2" /note="glycosylation site" /evidence=not_experimental misc_feature 870..872 /gene="CLN2" /note="glycosylation site" /evidence=not_experimental misc_feature 951..953 /gene="CLN2" /note="glycosylation site" /evidence=not_experimental variation 1058 /gene="CLN2" /note="T to C polymorphism (silent)" /replace="c" mutation 1107 /gene="CLN2" /note="T to C transition (Cys to Arg)" /replace="c" mutation 1108 /gene="CLN2" /note="G to A transition (Cys to Tyr)" /replace="a" misc_feature 1341..1343 /gene="CLN2" /note="glycosylation site" /evidence=not_experimental 3'UTR 1707..3487 /gene="CLN2" variation 2824 /gene="CLN2" /note="G to C polymorphism" /replace="c" BASE COUNT 787 a 1023 c 734 g 943 t ORIGIN 1 cgcggaaggg cagaatggga ctccaagcct gcctcctagg gctctttgcc ctcatcctct 61 ctggcaaatg cagttacagc ccggagcccg accagcggag gacgctgccc ccaggctggg 121 tgtccctggg ccgtgcggac cctgaggaag agctgagtct cacctttgcc ctgagacagc 181 agaatgtgga aagactctcg gagctggtgc aggctgtgtc ggatcccagc tctcctcaat 241 acggaaaata cctgacccta gagaatgtgg ctgatctggt gaggccatcc ccactgaccc 301 tccacacggt gcaaaaatgg ctcttggcag ccggagccca gaagtgccat tctgtgatca 361 cacaggactt tctgacttgc tggctgagca tccgacaagc agagctgctg ctccctgggg 421 ctgagtttca tcactatgtg ggaggaccta cggaaaccca tgttgtaagg tccccacatc 481 cctaccagct tccacaggcc ttggcccccc atgtggactt tgtgggggga ctgcaccatt 541 ttcccccaac atcatccctg aggcaacgtc ctgagccgca ggtgacaggg actgtaggcc 601 tgcatctggg ggtaaccccc tctgtgatcc gtaagcgata caacttgacc tcacaagacg 661 tgggctctgg caccagcaat aacagccaag cctgtgccca gttcctggag cagtatttcc 721 atgactcaga cctggctcag ttcatgcgcc tcttcggtgg caactttgca catcaggcat 781 cagtagcccg tgtggttgga caacagggcc ggggccgggc cgggattgag gccagtctag 841 atgtgcagta cctgatgagt gctggtgcca acatctccac ctgggtctac agtagccctg 901 gccggcatga gggacaggag cccttcctgc agtggctcat gctgctcagt aatgagtcag 961 ccctgccaca tgtgcatact gtgagctatg gagatgatga ggactccctc agcagcgcct 1021 acatccagcg ggtcaacact gagctcatga aggctgctgc tcggggtctc accctgctct 1081 tcgcctcagg tgacagtggg gccgggtgtt ggtctgtctc tggaagacac cagttccgcc 1141 ctaccttccc tgcctccagc ccctatgtca ccacagtggg aggcacatcc ttccaggaac 1201 ctttcctcat cacaaatgaa attgttgact atatcagtgg tggtggcttc agcaatgtgt 1261 tcccacggcc ttcataccag gaggaagctg taacgaagtt cctgagctct agcccccacc 1321 tgccaccatc cagttacttc aatgccagtg gccgtgccta cccagatgtg gctgcacttt 1381 ctgatggcta ctgggtggtc agcaacagag tgcccattcc atgggtgtcc ggaacctcgg 1441 cctctactcc agtgtttggg gggatcctat ccttgatcaa tgagcacagg atccttagtg 1501 gccgcccccc tcttggcttt ctcaacccaa ggctctacca gcagcatggg gcaggactct 1561 ttgatgtaac ccgtggctgc catgagtcct gtctggatga agaggtagag ggccagggtt 1621 tctgctctgg tcctggctgg gatcctgtaa caggctgggg aacacccaac ttcccagctt 1681 tgctgaagac tctactcaac ccctgaccct ttcctatcag gagagatggc ttgtcccctg 1741 ccctgaagct ggcagttcag tcccttattc tgccctgttg gaagccctgc tgaaccctca 1801 actattgact gctgcagaca gcttatctcc ctaaccctga aatgctgtga gcttgacttg 1861 actcccaacc ctaccatgct ccatcatact caggtctccc tactcctgcc ttagattcct 1921 caataagatg ctgtaactag cattttttga atgcctctcc ctccgcatct catctttctc 1981 ttttcaatca ggcttttcca aagggttgta tacagactct gtgcactatt tcacttgata 2041 ttcattcccc aattcactgc aaggagacct ctactgtcac cgtttactct ttcctaccct 2101 gacatccaga aacaatggcc tccagtgcat acttctcaat ctttgcttta tggcctttcc 2161 atcatagttg cccactccct ctccttactt agcttccagg tcttaacttc tctgactact 2221 cttgtcttcc tctctcatca atttctgctt cttcatggaa tgctgacctt cattgctcca 2281 tttgtagatt tttgctcttc tcagtttact cattgtcccc tggaacaaat cactgacatc 2341 tacaaccatt accatctcac taaataagac tttctatcca ataatgattg atacctcaaa 2401 tgtaagatgc gtgatactca acatttcatc gtccaccttc ccaaccccaa acaattccat 2461 ctcgtttctt cttggtaaat gatgctatgc tttttccaac caagccagaa acctgtgtca 2521 tcttttcacc ccaccttcaa tcaacaagtc ctcaatcaac aagtcctact gactgcacat 2581 cttaaatata tctttatcag tccacaagtc cttccaatta tatttcccaa gtatatctag 2641 aacttatcca cttatatccc cactgctact accttagttt agggctatat tctcttgaaa 2701 aaaagtgtcc ttacttcctg ccaatcccca agtcatcttc cagagtaaaa tgcaaatccc 2761 atcaggccac ttggatgaaa acccttcaag gattactgga tagaattcag gctttcccct 2821 ccagccccca atcatagctc acaaaccttc cttgctattt gttcttaagt aaaaaatcat 2881 ttttcctcct ccctccccaa accccaagga actctcactc ttgctcaagc tgttccgtcc 2941 ccttaccacc cctgatacaa ctgccaggtt aatttccaga attcttgcaa gactcagttc 3001 agaagtcacc ttctttcgtg aatgttttga ttccctgagg ctactttatt ttggtatggc 3061 tgaaaaatcc tagattttct aaacaaaacc tgtttgaatc ttggttctga tatggactag 3121 gagagagact gggtcaagta agcttatctc cctgaggctg tttcctcgtc tgttaagtgt 3181 gaatatcaat acctgccttt cataatcacc agggaataaa gtggaataat gttgataaca 3241 gtgcttggca cctggaagta ggtggcagat gttaacgccc ttcctccctt gcactgcgcc 3301 ccctgtgcct acctctagca ttgtaacgac cacatagtat tgaaatggcc agtttacttg 3361 tctgccttcc tttccaagac cgttggtgcc tagaggacta gaatcgtgtc ctatttaact 3421 ttgtgttccc aggtcctagc tcaggagttg gcaaataaga attaaatgtc tgctacaccg 3481 aaacaaa // LOCUS AF017635 1337 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens DCHT mRNA, complete cds. ACCESSION AF017635 NID g2407300 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1337) AUTHORS Baytel,D. and Don,J. TITLE Direct Submission JOURNAL Submitted (08-AUG-1997) Life Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel FEATURES Location/Qualifiers source 1..1337 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="61m" /tissue_type="testis" /note="present in all tissues examined but most abundantly expressed in testis and brain; a testis-specific transcript also exists" gene 1..1337 /gene="DCHT" CDS 27..830 /gene="DCHT" /codon_start=1 /product="DCHT" /db_xref="PID:g2407301" /translation="MKVLMLTLQNDPPTLETGVEDKEMMKKYGKSFRKLLSLCLQKDP SKRPTAAELLKCKFFQKAKNREYLIEKLLTRTPDIAQRAKKVRRVPGSSGHLHKTEDG DWEWSDDEMDEKSEEGKAAFSQEKSRRVKEENPEIAVSASTIPEQIQSLSVHDSQGPP NANEDYREASSCAVNLVLRLRNSRKELNDIRFEFTPGRDTADGVSQELFSAGLVDGHD VVIVAANLQKIVDDPKALKTLTFKLASGCDGSEIPDEVKLIGFAQLSVS" BASE COUNT 421 a 284 c 269 g 363 t ORIGIN 1 gaattcggca cgagaaatat cctcccatga aagtgttaat gttgactttg caaaatgatc 61 cacccacttt ggaaacaggg gtagaggata aagaaatgat gaaaaagtac ggcaagtcct 121 ttagaaaatt actttcactg tgtcttcaga aagatccttc caaaaggccc acagcagcag 181 aacttttaaa atgcaaattc ttccagaaag ccaagaacag agagtacctg attgagaagc 241 tgcttacaag aacaccagac atagcccaaa gagccaaaaa ggtaagaaga gttcctgggt 301 caagtggtca ccttcataaa accgaagacg gggactggga gtggagtgac gacgagatgg 361 atgagaagag cgaagaaggg aaagcagctt tttctcagga aaagtcacga agagtaaaag 421 aagaaaatcc agagattgca gtgagtgcca gcaccatccc cgaacaaata cagtccctct 481 ctgtgcacga ctctcagggc ccacccaatg ctaatgaaga ctacagagaa gcttcttctt 541 gtgccgtgaa cctcgttttg agattaagaa actccagaaa ggaacttaat gacatacgat 601 ttgagtttac tccaggaaga gatacagcag atggtgtatc tcaggagctc ttctctgctg 661 gcttggtgga tggtcacgat gtagttatag tggctgctaa tttacagaag attgtagatg 721 atcccaaagc tttaaaaaca ttgacattta agttggcttc tggctgtgat gggtcggaga 781 ttcctgatga agtgaagctg attgggtttg ctcagttgag tgtcagctga tgtatgtccc 841 ttgatgtcac cctgatctgt catgccccac cgccacccct actcccttca accctccctc 901 tttctgccca tttcctccca ccccctcact cccatttcct agcaaaatca gaagattgtg 961 aagaggccgg cttcaacaaa atgggataaa aaaataattt tttaaaactt acaacactcc 1021 gagttctgct ttattctcta gcaatccaca gtacaagaac aagcaaatgc cacagctgca 1081 cgactgttgc tcatttttcc aaaagctatt taatattctt agcaatcaat ttggatatcc 1141 cttaagtgaa aagaatctga aatacactca ggtggtctta tttattggca acaaaaggaa 1201 ttttctatcc agaagcctat ttctcctttc attgttgtta tttctgttat aatactttaa 1261 ttgtacatct gacaatactg cctcttttat gttgtattta gaaattaata tacttataaa 1321 attaagattt attagcc // LOCUS AF017656 1471 bp mRNA PRI 30-OCT-1997 DEFINITION Homo sapiens G protein beta 5 subunit mRNA, complete cds. ACCESSION AF017656 NID g2570403 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1471) AUTHORS Jones,P.G., Lombardi,S.J. and Cocket,M.I. TITLE Cloning and Distribution of the human G protein beta 5 cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1471) AUTHORS Jones,P.G., Lombardi,S.J. and Cocket,M.I. TITLE Direct Submission JOURNAL Submitted (07-AUG-1997) CNS Disorders, Wyeth-Ayerst Research, CN 8000, Princeton, NJ 08543, USA FEATURES Location/Qualifiers source 1..1471 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 100..1161 /codon_start=1 /product="G protein beta 5 subunit" /db_xref="PID:g2570404" /translation="MATEGLHENETLASLKSEAESLKGKLEEERAKLHDVELHQVAER VEALGQFVMKTRRTLKGHGNKVLCMDWCKDKRRIVSSSQDGKVIVWDSFTTNKEHAVT MPCTWVMACAYAPSGCAIACGGLDNKCSVYPLTFDKNENMAAKKKSVAMHTNYLSACS FTNSDMQILTASGDGTCALWDVESGQLLQSFHGHGADVLCLDLAPSETGNTFVSGGCD KKAMVWDMRSGQCVQAFETHESDINSVRYYPSGDAFASGSDDATCRLYDLRADREVAI YSKESIIFGASSVDFSLSGRLLFAGYNDYTINVWDVLKGSRVSILFGHENRVSTLRVS PDGTAFCSGSWDHTLRVWA" BASE COUNT 345 a 373 c 407 g 346 t ORIGIN 1 tccaagctga attccgggga cggctgctgg agcggcgccc gccgcggctc agcgcattcc 61 cgctctccgc ttccctctcc gctgcgtccc cgcgcgaaga tggcaaccga ggggctgcac 121 gagaacgaga cgctggcgtc gctgaagagc gaggccgaga gcctcaaggg caagctggag 181 gaggagcgag ccaagctgca cgatgtggag ctgcaccagg tggcggagcg ggtggaggcc 241 ctggggcagt ttgtcatgaa gaccagaagg accctcaaag gccacgggaa caaagtcctg 301 tgcatggact ggtgcaaaga taagaggagg atcgtgagct cgtcacagga tgggaaggtg 361 atcgtgtggg attccttcac cacaaacaag gagcacgcgg tcaccatgcc ctgcacgtgg 421 gtgatggcat gtgcttatgc cccatcggga tgtgccattg cttgtggtgg tttggataat 481 aagtgttctg tgtacccctt gacgtttgac aaaaatgaaa acatggctgc caaaaagaag 541 tctgttgcta tgcacaccaa ctacctgtcg gcctgcagct tcaccaactc tgacatgcag 601 atcctgacag cgagcggcga tggcacatgt gccctgtggg acgtggagag cgggcagctg 661 ctgcagagct tccacggaca tggggctgac gtcctctgct tggacctggc cccctcagaa 721 actggaaaca ccttcgtgtc tgggggatgt gacaagaaag ccatggtgtg ggacatgcgc 781 tccggccagt gcgtgcaggc ctttgaaaca catgaatccg acatcaacag tgtccggtac 841 taccccagtg gagatgcctt tgcttcaggg tcagatgacg ctacgtgtcg cctctatgac 901 ctgcgggcag atagggaggt tgccatctat tccaaagaaa gcatcatatt tggagcatcc 961 agcgtggact tctccctcag tggtcgcctg ctgtttgctg gatacaatga ttacactatc 1021 aacgtctggg atgttctcaa agggtcccgg gtctccatcc tgtttggaca tgaaaaccgc 1081 gttagcactc tacgagtttc ccccgatggg actgctttct gctctggatc atgggatcat 1141 accctcagag tctgggccta atcatcttct gacagtgcac tcatgtatac ctgagaattt 1201 gaaatcttca catgtaaata gatattactt ctagaggagc ttagagttta ttgcagtgta 1261 gcttagggga gcaacccatg gctcacaggt cactaagcgt ctccaatatg actattaaaa 1321 ctgtcacctc tggaaataca ctagtgtgag ccttcagcac tgcgagaata ccttcaagta 1381 cagtattttt cttttggaac actttttaaa atgtatctgt ttttaaggtt attctaaatt 1441 atagtagcct caactcattc tgtcaccagt a // LOCUS AF017789 4193 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens putative transcription factor CA150 mRNA, complete cds. ACCESSION AF017789 NID g2460123 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4193) AUTHORS Sune,C., Hayashi,T., Liu,Y., Lane,W.S., Young,R.A. and Garcia-Blanco,M.A. TITLE CA150, a nuclear protein associated with the RNA polymerase II holoenzyme, is involved in Tat-activated human immunodeficiency virus type 1 transcription JOURNAL Mol. Cell. Biol. 17 (10), 6029-6039 (1997) MEDLINE 97459702 REFERENCE 2 (bases 1 to 4193) AUTHORS Sune,C. and Garcia-Blanco,M.A. TITLE Direct Submission JOURNAL Submitted (10-AUG-1997) Molecular Cancer Biology, Duke University Medical Center, Box 3686 Research Drive. LSRC Bldg Room C-115, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..4193 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..4193 /gene="CA150" CDS 1..3297 /gene="CA150" /function="HIV-1 Tat transcriptional coactivator" /codon_start=1 /product="putative transcription factor CA150" /db_xref="PID:g2460124" /translation="MAERGGDGGESERFNPGELRMAQQQALRFRGPAPPPNAVMRGPP PLMRPPPPFGMMRGPPPPPRPPFGRPPFDPNMPPMPPPGGIPPPMGPPHLQRPPFMPP PMSSMPPPPGMMFPPGMPPVTAPGTPALPPTEEIWVENKTPDGKVYYYNARTRESAWT KPDGVKVIQQSELTPMLAAQAQVQAQAQAQAQAQAQAQAQAQAQAQAQAQAQAQAQAQ AQAQAQAQAQAQAQAQAQAQAQAQVQAQVQAQVQAQAVGASTPTTSSPAPAVSTSTSS STPSSTTSTTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSTPARTATPVQTVPQ PHPQTLPPAVPHSVPQPTTAIPAFPPVMVPPFRVPLPGMPIPLPGVAMMQIVSCPYVK TVATTKTGVLPGMAPPIVPMIHPQVAIAASPATLAGATAVSEWTEYKTADGKTYYYNN RTLESTWEKPQELKEKEKLEEKIKEPIKEPSEEPLPMETEEEDPKEEPIKEIKEEPKE EEMTEEEKAAQKAKPVATAPIPGTPWCVVWTGDERVFFYNPTTRLSMWDRPDDLIGRA DVDKIIQEPPHKKGMEELKKLRHPTPTMLSIQKWQFSMSAIKEEQELMEEINEDEPVK AKKRKRDDNKDIDSEKEAAMEAEIKAARERAIVPLEARMKQFKDMLLERGVSAFSTWE KELHKIVFDPRYLLLNPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMMEEAK FNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSD FFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKE KELERQARIEASLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSS DVSWSDTRRTLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDETSAIT LTSTWKEVKKIIKEDPRCIKFSSSDRKKQREFEEYIRDKYITAKADFRTLLKETKFIT YRSKKLIQESDQHLKDVEKILQNDKRYLVLDCVPEERRKLIVAYVDDLDRRGPPPPPT ASEPTRRSTK" BASE COUNT 1352 a 938 c 914 g 989 t ORIGIN 1 atggcggagc gtggcgggga cgggggcgag agtgaacgat tcaacccggg ggagctcagg 61 atggcccaac agcaggcctt gaggttccga ggtccggctc ccccaccaaa tgcagtgatg 121 cgaggcccac cacctctgat gcgacctcct ccaccttttg gtatgatgcg aggccctcct 181 ccaccaccac ggccgccctt tggacgtcct ccttttgatc ctaatatgcc gccaatgcct 241 cctccaggag ggatacctcc acctatgggc cctccacacc tccagagacc acctttcatg 301 cctcctccca tgagttccat gcctcctcct ccgggtatga tgtttccacc aggaatgcct 361 cctgtgactg ctcctggtac tccagcacta cctcctacgg aggagatatg ggttgaaaat 421 aaaactccag atgggaaggt ttattattat aatgctcgga cacgtgaatc tgcatggacc 481 aagccagatg gagttaaggt tattcagcaa tcagaactga cacctatgct tgcagcccag 541 gcacaggttc aggctcaggc ccaggcgcag gctcaggccc aggcgcaggc tcaggcccag 601 gcacaagctc aggcccaggc tcaggctcag gcccaggccc aggcccaggc ccaggcccag 661 gcccaagccc aagcccaggc ccaggctcag gctcaggcac aagctcaggc ccaggcccag 721 gctcaggtcc aggcccaggt ccaggcacaa gtgcaagcac aagcagttgg agcttccacc 781 cctacgacca gtagcccagc acctgcagta tccacttcaa catcatcatc caccccttcc 841 tctaccactt ctaccacaac aactgctact tcagttgcgc agacagtatc aacacccaca 901 acacaagatc agaccccaag ttctgctgtt tcagttgcca cgcctacagt tagtgtttca 961 actcctgctc gtacagccac acctgtgcaa accgttcccc agccgcaccc tcagacgtta 1021 cctcctgctg ttcctcattc agtacctcag ccaacaacag caatacctgc ttttccacca 1081 gtaatggtac ctccgtttcg tgttcccctt cctggcatgc caattccact tccaggtgta 1141 gcaatgatgc aaatagtcag ctgcccgtat gtaaagacag tcgctaccac caagaccggt 1201 gtattgccag gaatggcccc tcctatcgta cccatgatac atccccaggt tgctattgca 1261 gcttcacctg ctaccttagc tggagcaaca gcagtttctg aatggactga atataaaaca 1321 gcagatggga agacatatta ttataataat agaacattag aatcaacctg ggaaaaaccc 1381 caagaactaa aggaaaaaga aaagttagaa gagaagatta aagagccaat taaagaaccc 1441 tctgaagagc ctctgccaat ggagacggag gaggaggatc ctaaagaaga gcctataaag 1501 gagataaagg aggagcccaa agaagaggag atgactgaag aagaaaaggc tgcccagaag 1561 gcaaagccag ttgctactgc tcctattcct ggtactccat ggtgtgtcgt ttggactggt 1621 gatgagcggg tcttctttta taatcccacc actcgtcttt ctatgtggga ccgacctgat 1681 gatctgattg gcagggcaga tgttgacaaa attattcagg agccccctca taaaaaagga 1741 atggaggaat tgaagaaact aaggcaccca actccgacaa tgctgtcgat ccaaaagtgg 1801 caattctcta tgagtgcaat taaagaggaa caagaattaa tggaagaaat taatgaagat 1861 gagcctgtta aagcaaaaaa acggaagaga gacgataata aagacattga ctcagagaaa 1921 gaagctgcca tggaagctga aattaaagct gcccgagaaa gggccattgt ccctctggag 1981 gctcgaatga agcagttcaa ggacatgctg ctagagagag gggtgtctgc tttttcaacg 2041 tgggagaagg agttgcacaa gatagttttt gatccccggt acttacttct caatcctaaa 2101 gagagaaaac aggtgtttga tcagtatgta aagaccaggg cagaggaaga acgcagggaa 2161 aagaaaaata aaataatgca agccaaggaa gatttcaaaa aaatgatgga agaagcaaaa 2221 tttaatccaa gagcaacttt tagtgaattt gcagccaagc atgctaaaga ttcaagattc 2281 aaagcaattg aaaagatgaa agaccgagaa gccttgttta atgagtttgt ggccgctgct 2341 aggaagaaag agaaagaaga ttcgaagacc agaggtgaga agattaaatc ggatttcttt 2401 gaactattat ctaatcatca cttggacagt cagtctcgat ggagcaaagt aaaagacaaa 2461 gtagaaagtg atccacgtta caaagcagta gatagttcat caatgagaga agaccttttc 2521 aaacagtaca ttgaaaaaat agccaagaat ttagactcag aaaaagaaaa ggagcttgaa 2581 aggcaagccc gcattgaggc aagccttcga gaacgagaaa gggaggttca aaaggcccgt 2641 tcagaacaaa caaaagaaat agatcgagag agagagcagc acaaacgaga agaagctatc 2701 cagaatttca aagctcttct gtctgacatg gtacgttctt cagatgtgtc atggtctgat 2761 actcgtagga ccctccgaaa agatcaccgc tgggaatctg gatccttatt ggaaagagag 2821 gagaaagaga agctttttaa tgaacacatt gaagcactta ccaaaaaaaa gagagagcac 2881 tttaggcaac ttctggatga aacttctgca attaccttaa catccacgtg gaaagaagta 2941 aaaaaaatca ttaaggaaga tcctcgatgt attaagttct cctccagtga caggaaaaaa 3001 caaagagaat ttgaagaata tatcagagac aaatatatca cagccaaagc tgacttcagg 3061 acgcttttga aagagaccaa atttataaca tatagatcca aaaaattaat ccaagaatcg 3121 gatcagcacc tgaaagatgt agaaaaaatt ttacagaatg acaaacggta tctagtactg 3181 gactgtgtgc cagaggagag gcgtaaactg attgtggcat atgttgatga cctggatcgc 3241 cggggtccac ccccacctcc cacagcatcg gagcccacga gacgatcaac aaaataattc 3301 taaatactct tccatagggg catctattca aaatgcttgc atgagccaat tttcaggttt 3361 ttacatatat gtgcattagt caacctattg cgaaaccatc tgacaaacag aaggagaagc 3421 atttgtgaac agtttctgaa cagaacactt tggaaatatt tatgcttttc tttgtgtggc 3481 atgactgaca tacatactca aatataggct gtctctagta aatcttaaaa tcttgaagct 3541 aaaattcatc cttttatgag gtgtggaagt cagtgacttg gtgacgttct tcctagcagt 3601 gttaatacat gcaagaagta agagcatttg tggcttgaac ttgccagatg caaataccac 3661 agactccaag aaaacccgag ttggggtttg ttttgttttg attttttttt tttaaagcgg 3721 gtaaaagaga aaacactgaa aattgaattc ttatcttcca gaggctacaa ttattataat 3781 ggacaatact tttacctttg tctctaaaga tcagattagt tttatttgtt cacttacgtg 3841 ctttgattat cccctctgaa ttatagaccg agtcttgttg tttagcctaa gagaagattt 3901 atgtagtaat ttcttctcag gtatggaacc acggtcataa ctaacatgtt ggccagaata 3961 gaaccactgg ttaaacatat tttattcacc attaagtgat ctttatcaat attctggatt 4021 agacaacaaa ttacctttct gggtgtttct tgtaaactat actcctgttt gaatgttaaa 4081 ctttgttgct aaagtttaat tttaagatgt ttgaatgttc agtttatgta tttgaactac 4141 aataaaccaa ccctttttat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS AF017790 2150 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens retinoblastoma-associated protein HEC mRNA, complete cds. ACCESSION AF017790 NID g2501872 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2150) AUTHORS Chen,Y., Riley,D.J., Chen,P.L. and Lee,W.H. TITLE HEC, a novel nuclear protein rich in leucine heptad repeats specifically involved in mitosis JOURNAL Mol. Cell. Biol. 17 (10), 6049-6056 (1997) MEDLINE 97459704 REFERENCE 2 (bases 1 to 2150) AUTHORS Chen,Y. and Lee,W.-H. TITLE Direct Submission JOURNAL Submitted (10-AUG-1997) Molecular Medicine, UTHSCSA, Inst. Biotechnology, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..2150 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2150 /gene="HEC" CDS 105..2033 /gene="HEC" /note="nuclear protein" /codon_start=1 /product="retinoblastoma-associated protein HEC" /db_xref="PID:g2501873" /translation="MKRSSVSSGGAGRLSMQELRSQDVNKQGLYTPQTKEKPTFGKLS INKPTSERKVSLFGKRTSGHGSRNSQLGIFSSSEKIKDPRPLNDKAFIQQCIRQLCEF LTENGYAHNVSMKSLQAPSVKDFLKIFTFLYGFLCPSYELPDTKFEEEVPRIFKDLGY PFALSKSSMYTVGAPHTWPHIVAALVWLIDCIKIHTAMKESSPLFDDGQPWGEETEDG IMHNKLFLDYTIKCYESFMSGADSFDEMNAELQSKLKDLFNVDAFKLESLEAKNRALN EQIARLEQEREKEPNRLESLRKLKASLQGDVQKYQAYMSNLESHSAILDQKLNGLNEE IARVELECETIKQENTRLQNIIDNQKYSVADIERINHERNELQQTINKLTKDLEAEQQ KLWNEELKYARGKEAIETQLAEYHKLARKLKLIPKGAENSKGYDFEIKFNPEAGANCL VKYRAQVYVPLKELLNETEEEINKALNKKMGLEDTLEQLNAMITESKRSVRTLKEEVQ KLDDLYQQKIKEAEEEDEKCASELESLEKHKHLLESTVNQGLSEAMNELDAVQREYQL VVQTTTEERRKVGNNLQRLLEMVATHVGSVEKHLEEQIAKVDREYEECMSEDLSENIK EIRDKYEKKATLIKSSEE" BASE COUNT 810 a 362 c 464 g 514 t ORIGIN 1 ctcgagccac gaaggccccg ctgtcctgtc tagcagatac ttgcacggtt tacagaaatt 61 cggtccctgg gtcgtgtcag gaaactggaa aaaaggtcat aagcatgaag cgcagttcag 121 tttccagcgg tggtgctggc cgcctctcca tgcaggagtt aagatcccag gatgtaaata 181 aacaaggcct ctatacccct caaaccaaag agaaaccaac ctttggaaag ttgagtataa 241 acaaaccgac atctgaaaga aaagtctcgc tatttggcaa aagaactagt ggacatggat 301 cccggaatag tcaacttggt atattttcca gttctgagaa aatcaaggac ccgagaccac 361 ttaatgacaa agcattcatt cagcagtgta ttcgacaact ctgtgagttt cttacagaaa 421 atggttatgc acataatgtg tccatgaaat ctctacaagc tccctctgtt aaagacttcc 481 tgaagatctt cacatttctt tatggcttcc tgtgcccctc atacgaactt cctgacacaa 541 agtttgaaga agaggttcca agaatcttta aagaccttgg gtatcctttt gcactatcca 601 aaagctccat gtacacagtg ggggctcctc atacatggcc tcacattgtg gcagccttag 661 tttggctaat agactgcatc aagatacata ctgccatgaa agaaagctca cctttatttg 721 atgatgggca gccttgggga gaagaaactg aagatggaat tatgcataat aagttgtttt 781 tggactacac cataaaatgc tatgagagtt ttatgagtgg tgccgacagc tttgatgaga 841 tgaatgcaga gctgcagtca aaactgaagg atttatttaa tgtggatgct tttaagctgg 901 aatcattaga agcaaaaaac agagcattga atgaacagat tgcaagattg gaacaagaaa 961 gagaaaaaga accgaatcgt ctagagtcgt tgagaaaact gaaggcttcc ttacaaggag 1021 atgttcaaaa gtatcaggca tacatgagca atttggagtc tcattcagcc attcttgacc 1081 agaaattaaa tggtctcaat gaggaaattg ctagagtaga actagaatgt gaaacaataa 1141 aacaggagaa cactcgacta cagaatatca ttgacaacca gaagtactca gttgcagaca 1201 ttgagcgaat aaatcatgaa agaaatgaat tgcagcagac tattaataaa ttaaccaagg 1261 acctggaagc tgaacaacag aagttgtgga atgaggagtt aaaatatgcc agaggcaaag 1321 aagcgattga aacacaatta gcagagtatc acaaattggc tagaaaatta aaacttattc 1381 ctaaaggtgc tgagaattcc aaaggttatg actttgaaat taagtttaat cccgaggctg 1441 gtgccaactg ccttgtcaaa tacagggctc aagtttatgt acctcttaag gaactcctga 1501 atgaaactga agaagaaatt aataaagccc taaataaaaa aatgggtttg gaggatactt 1561 tagaacaatt gaatgcaatg ataacagaaa gcaagagaag tgtgagaact ctgaaagaag 1621 aagttcaaaa gctggatgat ctttaccaac aaaaaattaa ggaagcagag gaagaggatg 1681 aaaaatgtgc cagtgagctt gagtccttgg agaaacacaa gcacctgcta gaaagtactg 1741 ttaaccaggg gctcagtgaa gctatgaatg aattagatgc tgttcagcgg gaataccaac 1801 tagttgtgca aaccacgact gaagaaagac gaaaagtggg aaataacttg caacgtctgt 1861 tagagatggt tgctacacat gttgggtctg tagagaaaca tcttgaggag cagattgcta 1921 aagttgatag agaatatgaa gaatgcatgt cagaagatct ctcggaaaat attaaagaga 1981 ttagagataa gtatgagaag aaagctactc taattaagtc ttctgaagaa tgaagataaa 2041 atgttgatca tgtatatata tccatagtga ataaaattgt ctcagtaaaa aaaaaaaaaa 2101 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF017988 1984 bp mRNA PRI 21-SEP-1997 DEFINITION Homo sapiens secreted apoptosis related protein 3 (SARP3) mRNA, complete cds. ACCESSION AF017988 NID g2415418 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1984) AUTHORS Melkonyan,H., Chang,W.C., Shapiro,J.P., Mahadevappa,M., Fitzpatric,P.A., Kiefer,M.C., Tomei,D.L. and Umansky,S.R. TITLE SARPs - a new family of proteins that regulate apoptosis JOURNAL Unpublished REFERENCE 2 (bases 1 to 1984) AUTHORS Melkonyan,H., Prochazka,V. and Umansky,S.R. TITLE Direct Submission JOURNAL Submitted (11-AUG-1997) LXR Biotechnology, Inc., 1401 Marina Way South, Richmond, CA 94804, USA FEATURES Location/Qualifiers source 1..1984 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /tissue_type="pancreas" gene 1..1984 /gene="SARP3" CDS 216..1169 /gene="SARP3" /note="similar to frizzled-like receptor, extracellular protein" /codon_start=1 /product="secreted apoptosis related protein 3" /db_xref="PID:g2415419" /translation="MRAAAAAGGVRTAALALLLGALHWAPARCEEYDYYGWQAEPLHG RSYSKPPQCLDIPADLPLCHTVGYKRMRLPNLLEHESLAEVKQQASSWLPLLAKRCHS DTQVFLCSLFAPVCLDRPIYPCRSLCEAVRAGCAPLMEAYGFPWPEMLHCHKFPLDND LCIAVQFGHLPATAPPVTKICAQCEMEHSADGLMEQMCSSDFVVKMRIKEIKIENGDR KLIGAQKKKKLLKPGPLKRKDTKRLVLHMKNGAGCPCPQLDSLAGSFLVMGRKVDGQL LLMAVYRWDKKNKEMKFAVKFMFSYPCSLYYPFFYGAAEPH" BASE COUNT 353 a 634 c 627 g 370 t ORIGIN 1 aagcttgata tcgaattcgc ggccgcgtcg acgggaggcg ccaggatcag tcggggcacc 61 cgcagcgcag gctgccaccc acctgggcga cctccgcggc ggcggcggcg gcggctgggt 121 agagtcaggg ccgggggcgc acgccggaac acctgggccg ccgggcaccg agcgtcgggg 181 ggctgcgcgg cgcgaccctg gagagggcgc agccgatgcg ggcggcggcg gcggcggggg 241 gcgtgcggac ggccgcgctg gcgctgctgc tgggggcgct gcactgggcg ccggcgcgct 301 gcgaggagta cgactactat ggctggcagg ccgagccgct gcacggccgc tcctactcca 361 agccgccgca gtgccttgac atccctgccg acctgccgct ctgccacacg gtgggctaca 421 agcgcatgcg gctgcccaac ctgctggagc acgagagcct ggccgaagtg aagcagcagg 481 cgagcagctg gctgccgctg ctggccaagc gctgccactc ggatacgcag gtcttcctgt 541 gctcgctctt tgcgcccgtc tgtctcgacc ggcccatcta cccgtgccgc tcgctgtgcg 601 aggccgtgcg cgccggctgc gcgccgctca tggaggccta cggcttcccc tggcctgaga 661 tgctgcactg ccacaagttc cccctggaca acgacctctg catcgccgtg cagttcgggc 721 acctgcccgc caccgcgcct ccagtgacca agatctgcgc ccagtgtgag atggagcaca 781 gtgctgacgg cctcatggag cagatgtgct ccagtgactt tgtggtcaaa atgcgcatca 841 aggagatcaa gatagagaat ggggaccgga agctgattgg agcccagaaa aagaagaagc 901 tgctcaagcc gggccccctg aagcgcaagg acaccaagcg gctggtgctg cacatgaaga 961 atggcgcggg ctgcccctgc ccacagctgg acagcctggc gggcagcttc ctggtcatgg 1021 gccgcaaagt ggatggacag ctgctgctca tggccgtcta ccgctgggac aagaagaata 1081 aggagatgaa gtttgcagtc aaattcatgt tctcctaccc ctgctccctc tactaccctt 1141 tcttctacgg ggcggcagag ccccactgaa gggcactcct ccttgccctg ccagctgtgc 1201 cttgcttgcc ctctggcccc gccccaactt ccaggctgac ccggccctac tggagggtgt 1261 tttcacgaat gttgttactg gcacaaggcc taagggatgg gcacggagcc caggctgtcc 1321 tttttgaccc aggggtcctg gggtccctgg gatgttgggc ttcctctctc aggagcaggg 1381 cttcttcatc tgggtgaaga cctcagggtc tcagaaagta ggcaggggag gagagggtaa 1441 gggaaaggtg gaggggctca gggcaccctg aggcggaggt ttcagagtag aaggtgatgt 1501 cagctccagc tcccctctgt cggtggtggg gcctcacctt gaagagggaa gtctcaatat 1561 taggctaagc tatttgggaa agttctcccc accgcccctg tacgcgtcat cctagccccc 1621 cttaggaaag gagttagggt ctcagtgcct ccagccacac cccctgcctt ccccagcttg 1681 cccatttccc tgccccaagg cccagagctc cccccagact ggagagcaag cccagcccag 1741 cctcggcata gacccccttc tggtccgccc gtggctcgat tcccgggatt cattcctcag 1801 cctctgcttc tcccttttat cccaataagt tattgctact gctgtgaggc cataggtact 1861 agacaaccaa tacatgcagg gttgggtttt ctaatttttt taacttttta attaaatcaa 1921 aggtcgacgc gcggccgcgg aattcctgca gcccggggga tccccgggta ccgagctcga 1981 attc // LOCUS AF017995 1891 bp mRNA PRI 30-OCT-1997 DEFINITION Homo sapiens 3-phosphoinositide dependent protein kinase-1 (PDK1) mRNA, complete cds. ACCESSION AF017995 NID g2407612 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1891) AUTHORS Alessi,D.R., James,S.R., Downes,C.P., Holmes,A.B., Gaffney,P.R., Reese,C.B. and Cohen,P. TITLE Characterization of a 3-phosphoinositide-dependent protein kinase which phosphorylates and activates protein kinase B alpha JOURNAL Curr. Biol. 7 (4), 261-269 (1997) MEDLINE 97250749 REFERENCE 2 (bases 1 to 1891) AUTHORS Alessi,D.R., Deak,M., Casamayor,A., Caudwell,F.B., Morrice,N., Norman,D.G., Gaffney,P., MacDougall,C.N., Reese,C.B., Harbison,D., Ashworth,A. and Bownes,M. TITLE 3-phosphoinositide-dependent protein kinase-1 (PDK1): structural and functional homology with the Drosophila DSTPK61 kinase JOURNAL Curr. Biol. 7 (10), 776-789 (1997) MEDLINE 98035195 REFERENCE 3 (bases 1 to 1891) AUTHORS Deak,M., Casamayor,A., Ashworth,A. and Alessi,D.R. TITLE Direct Submission JOURNAL Submitted (09-AUG-1997) MRC Protein Phosphorylation Unit. Dept. Biochemistry, University of Dundee, Dundee DD1 4HN, Scotland (UK) FEATURES Location/Qualifiers source 1..1891 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" /cell_line="MCF7" gene 1..1891 /gene="PDK1" CDS 81..1751 /gene="PDK1" /function="protein kinase" /note="Similar to the Drosophila protein kinase DSTPK61, encoded by GenBank Accession Number Y07908" /codon_start=1 /product="3-phosphoinositide dependent protein kinase-1" /db_xref="PID:g2407613" /translation="MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPGIPGG SRQGPAMDGTAAEPRPGAGSLQHAQPPPQPRKKRPEDFKFGKILGEGSFSTVVLAREL ATSREYAIKILEKRHIIKENKVPYVTRERDVMSRLDHPFFVKLYFTFQDDEKLYFGLS YAKNGELLKYIRKIGSFDETCTRFYTAEIVSALEYLHGKGIIHRDLKPENILLNEDMH IQITDFGTAKVLSPESKQARANSFVGTAQYVSPELLTEKSACKSSDLWALGCIIYQLV AGLPPFRAGNEYLIFQKIIKLEYDFPEKFFPKARDLVEKLLVLDATKRLGCEEMEGYG PLKAHPFFESVTWENLHQQTPPKLTAYLPAMSEDDEDCYGNYDNLLSQFGCMQVSSSS SSHSLSASDTGLPQRSGSNIEQYIHDLDSNSFELDLQFSEDEKRLLLEKQAGGNPWHQ FVENNLILKMGPVDKRKGLFARRRQLLLTEGPHLYYVDPVNKVLKGEIPWSQELRPEA KNFKTFFVHTPNRTYYLMDPSGNAHKWCRKIQEVWRQRYQSHPDAAVQ" BASE COUNT 475 a 509 c 502 g 405 t ORIGIN 1 ccgcttcggg gaggaggacg ctgaggaggc gccgagccgc gcagcgctgc gggggaggcg 61 cccgcgccga cgcggggccc atggccagga ccaccagcca gctgtatgac gccgtgccca 121 tccagtccag cgtggtgtta tgttcctgcc catccccatc aatggtgagg acccagactg 181 agtccagcac gccccctggc attcctggtg gcagcaggca gggccccgcc atggacggca 241 ctgcagccga gcctcggccc ggcgccggct ccctgcagca tgcccagcct ccgccgcagc 301 ctcggaagaa gcggcctgag gacttcaagt ttgggaaaat ccttggggaa ggctcttttt 361 ccacggttgt cctggctcga gaactggcaa cctccagaga atatgcgatt aaaattctgg 421 agaagcgaca tatcataaaa gagaacaagg tcccctatgt aaccagagag cgggatgtca 481 tgtcgcgcct ggatcacccc ttctttgtta agctttactt cacatttcag gacgacgaga 541 agctgtattt cggccttagt tatgccaaaa atggagaact acttaaatat attcgcaaaa 601 tcggttcatt cgatgagacc tgtacccgat tttacacggc tgagatcgtg tctgctttag 661 agtacttgca cggcaagggc atcattcaca gggaccttaa accggaaaac attttgttaa 721 atgaagatat gcacatccag atcacagatt ttggaacagc aaaagtctta tccccagaga 781 gcaaacaagc cagggccaac tcattcgtgg gaacagcgca gtacgtttct ccagagctgc 841 tcacggagaa gtccgcctgt aagagttcag acctttgggc tcttggatgc ataatatacc 901 agcttgtggc aggactccca ccattccgag ctggaaacga gtatcttata tttcagaaga 961 tcattaagtt ggaatatgac tttccagaaa aattcttccc taaggcaaga gacctcgtgg 1021 agaaactttt ggttttagat gccacaaagc ggttaggctg tgaggaaatg gaaggatacg 1081 gacctcttaa agcacacccg ttcttcgagt ccgtcacgtg ggagaacctg caccagcaga 1141 cgcctccgaa gctcaccgct tacctgccgg ctatgtcgga agacgacgag gactgctatg 1201 gcaattatga caatctcctg agccagtttg gctgcatgca ggtgtcttcg tcctcctcct 1261 cacactccct gtcagcctcc gacacgggcc tgccccagag gtcaggcagc aacatagagc 1321 agtacattca cgatctggac tcgaactcct ttgaactgga cttacagttt tccgaagatg 1381 agaagaggtt gttgttggag aagcaggctg gcggaaaccc ttggcaccag tttgtagaaa 1441 ataatttaat actaaagatg ggcccagtgg ataagcggaa gggtttattt gcaagacgac 1501 gacagctgtt gctcacagaa ggaccacatt tatattatgt ggatcctgtc aacaaagttc 1561 tgaaaggtga aattccttgg tcacaagaac ttcgaccaga ggccaagaat tttaaaactt 1621 tctttgtcca cacgcctaac aggacgtatt atctgatgga ccccagcggg aacgcacaca 1681 agtggtgcag gaagatccag gaggtttgga ggcagcgata ccagagccac ccggacgccg 1741 ctgtgcagtg acgtggcctg cggccgggct gcccttcgct gccaggacac ctgccccagc 1801 gcggcttggc cgccatccgg gacgcttcca gaccacctgc cagccatcac aaggggaacg 1861 cagaggcgga aaccttgcag catttttatt t // LOCUS AF018080 3511 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens PYRIN (MEFV) mRNA, complete cds. ACCESSION AF018080 NID g2407315 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3511) AUTHORS The International FMF Consortium. TITLE Ancient missense mutations in a new member of the RoRet gene family are likely to cause familial Mediterranean fever. The International FMF Consortium JOURNAL Cell 90 (4), 797-807 (1997) MEDLINE 97433089 REFERENCE 2 (bases 1 to 3511) AUTHORS Kastner,D.L. TITLE Direct Submission JOURNAL Submitted (08-AUG-1997) ARB/NIAMS, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3511 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" gene 1..3511 /gene="MEFV" CDS 42..2387 /gene="MEFV" /note="mutations in this gene can cause familial mediterranean fever" /codon_start=1 /product="PYRIN" /db_xref="PID:g2407316" /translation="MAKTPSDHLLSTLEELVPYDFEKFKFKLQNTSVQKEHSRIPRSQ IQRARPVKMATLLVTYYGEEYAVQLTLQVLRAINQRLLAEELHRAAIQEYSTQENGTD DSAASSSLGENKPRSLKTPDHPEGNEGNGPRPYGGGAASLRCSQPEAGRGLSRKPLSK RREKASEGLDAQGKPRTRSPALPGGRSPGPCRALEGGQAEVRLRRNASSAGRLQGLAG GAPGQKECRPFEVYLPSGKMRPRSLEVTISTGEKAPANPEILLTLEEKTAANLDSATE PRARPTPDGGASADLKEGPGNPEHSVTGRPPDTAASPRCHAQEGDPVDGTCVRDSCSF PEAVSGHPQASGSRSPGCPRCQDSHERKSPGSLSPQPLPQCKRHLKQVQLLFCEDHDE PICLICSLSQEHQGHRVRPIEEVALEHKKKIQKQLEHLKKLRKSGEEQRSYGEEKAVS FLKQTEALKQRVQRKLEQVYYFLEQQEHFFVASLEDVGQMVGQIRKAYDTRVSQDIAL LDALIGELEAKECQSEWELLQDIGDILHRAKTVPVPEKWTTPQEIKQKIQLLHQKSEF VEKSTKYFSETLRSEMEMFNVPELIGAQAHAVNVILDAETAYPNLIFSDDLKSVRLGN KWERLPDGPQRFDSCIIVLGSPSFLSGRRYWEVEVGDKTAWILGACKTSISRKGNMTL SPENGYWVVIMMKENEYQASSVPPTRLLIKEPPKRVGIFVDYRVGSISFYNVTARSHI YTFASCSFSGPLQPIFSPGTRDGGKNTAPLTICPVGGQGPD" mutation 2081 /gene="MEFV" /standard_name="M680I" /note="not present in normal individuals" /phenotype="FMF" /replace="C" mutation 2121 /gene="MEFV" /standard_name="M694V" /note="not present in normal individuals" /phenotype="FMF" /replace="G" mutation 2218 /gene="MEFV" /standard_name="V726A" /note="not present in normal individuals" /phenotype="FMF" /replace="C" repeat_region 2845..3140 /rpt_type=dispersed /rpt_family="Alu" repeat_region 3161..3450 /rpt_type=dispersed /rpt_family="Alu" polyA_signal 3483..3488 /gene="MEFV" BASE COUNT 842 a 965 c 997 g 707 t ORIGIN 1 ggaagccaga cagctggctc gagcctctcc tgctcagcac catggctaag acccctagtg 61 accatctgct gtccaccctg gaggagctgg tgccctatga cttcgagaag ttcaagttca 121 agctgcagaa caccagtgtg cagaaggagc actccaggat cccccggagc cagatccaga 181 gagccaggcc ggtgaagatg gccactctgc tggtcaccta ctatggggaa gagtacgccg 241 tgcagctcac cctgcaggtc ctgcgggcca tcaaccagcg cctgctggcc gaggagctcc 301 acagggcagc cattcaggaa tattccacac aagaaaacgg cacagatgat tccgcagcgt 361 ccagctccct gggggagaac aagcccagga gcctgaagac tccagaccac cccgagggga 421 acgaggggaa cggccctcgg ccgtacgggg gcggagctgc cagcctgcgg tgcagccagc 481 ccgaggccgg gagggggctg tcgaggaagc ccctgagcaa acgcagagag aaggcctcgg 541 agggcctgga cgcgcagggc aagcctcgga cccggagccc ggccctgccg ggcgggagaa 601 gccccggccc ctgcagggcg ctagaggggg gccaggccga ggtccggctg cgcagaaacg 661 ccagctccgc ggggaggctg caggggctgg cggggggcgc cccggggcag aaggagtgca 721 ggcccttcga agtgtacctg ccctcgggaa agatgcgacc tagaagcctt gaggtcacca 781 tttctacagg ggagaaggcg cccgcaaatc cagaaattct cctgactcta gaggaaaaga 841 cagctgcgaa tctggactcg gcaacagaac cccgggcaag gcccactccg gatggagggg 901 catctgcgga cctgaaggaa ggccctggaa atccagaaca ttcggtcacc ggaaggccac 961 cagacacggc tgcgagtccc cgctgccacg cccaggaagg agacccagtt gacggtacct 1021 gtgtgcgtga ttcctgcagc ttccccgagg cagtttctgg gcacccccag gcctcaggca 1081 gccgctcacc tggctgcccc cggtgccagg actcccatga aaggaagagc ccgggaagcc 1141 taagccccca gcccctgcca cagtgtaagc gccacctgaa gcaggtccag ctgctcttct 1201 gtgaggatca cgatgagccc atctgcctca tctgcagtct gagtcaggag caccaaggcc 1261 accgggtgcg ccccattgag gaggtcgccc tggaacacaa gaagaaaatt cagaagcagc 1321 tggagcatct gaagaagctg agaaaatcag gggaggagca gcgatcctat ggggaggaga 1381 aggcagtgag ctttctgaaa caaactgaag cgctgaagca gcgggtgcag aggaagctgg 1441 agcaggtgta ctacttcctg gaacagcagg agcatttctt tgtggcctca ctggaggacg 1501 tgggccagat ggttgggcag atcaggaagg catatgacac ccgcgtatcc caggacatcg 1561 ccctgctcga tgcgctgatt ggggaactgg aggccaagga gtgccagtca gaatgggaac 1621 ttctgcagga cattggagac atcttgcaca gggctaagac agtgcctgtc cctgaaaagt 1681 ggaccactcc tcaagagata aaacaaaaga tccaactcct ccaccagaag tcagagtttg 1741 tggagaagag cacaaagtac ttctcagaaa ccctgcgttc agaaatggaa atgttcaatg 1801 ttccagagct gattggcgct caggcacatg ctgttaatgt gattctggat gcagaaaccg 1861 cttaccccaa cctcatcttc tctgatgatc tgaagagtgt tagacttgga aacaagtggg 1921 agaggctgcc tgatggcccg caaagatttg acagctgtat cattgttctg ggctctccga 1981 gtttcctctc tggccgccgt tactgggagg tggaggttgg agacaagaca gcatggatcc 2041 tgggagcctg caagacatcc ataagcagga aagggaacat gactctgtcg ccagagaatg 2101 gctactgggt ggtgataatg atgaaggaaa atgagtacca ggcgtccagc gttcccccga 2161 cccgcctgct aataaaggag cctcccaagc gtgtgggcat cttcgtggac tacagagttg 2221 gaagcatctc cttttacaat gtgacagcca gatcccacat ctatacattc gccagctgct 2281 ctttctctgg gccccttcaa cctatcttca gccctgggac acgtgatgga gggaagaaca 2341 cagctcctct gactatctgt ccagtgggtg gtcaggggcc tgactgaatg cccaacactg 2401 catctctctt cctgcttctg gccttgtatc ttgcattcac actcaatagt cacggaatgc 2461 cgactaggtg ctagctgcta tgggaaatgc aaaaataaca aaatagttac tgtgcccacg 2521 gagcctaccc gattatagca gaggtaagtt aggaacgaac atgttagtca atccgggtga 2581 agacatgtac tgatgacaca ccatggattt cagaggagga agtacggagt cgttgcataa 2641 tccgcccctg gtgggtggca ctctcaggtg ctcctgaaca gaagatttgg ccctcatttt 2701 ccctcagaac cccacggcaa ggatatatgt ccccttgttc tctctgcttc tgtcttgagg 2761 atatgggaag cctagagaaa cgcaagcaga ctggattggg atagaagtat ttgtgtacct 2821 ggattaatga actatgattt tttttttttt tttttgagac caaatcttgc tctgtggccc 2881 aggctggagt gcagtggcac gatctcagct cactgcaacc tccacctccc aggttcaagc 2941 gattctcctg cctcagcctc ctgagcagct gggattacag gtgcgtgcca ccacaccagg 3001 ctggttttct tgtattttta gtagagacgg gggtttcacc atgttagcca ggctggtctc 3061 gaactcctga cctcaggtga tccacccgcc tcagcctccc aaagtgctgg gattacaggc 3121 atgagccact gtgcccggcc tatgattctt tttttttttt ttttttgaga caaagttttg 3181 ctcttgtcac ccaggctgga gtgcagtggt gcaatcttgg ctcactgcaa cctccgcctc 3241 ccaggttcaa gagattctcc tgcctcagcc tccgaagtag ctgggattac aggcgcccgc 3301 caccatgccc ggctaatttt ttgcattttt agtagacatg aggtttcatc atgttggcca 3361 ggccggtctc aaactcctga cctcaggtga tgcacccacc tcagcctccc aaagtgcagg 3421 gattacaggc atgagccacc atgcctggcc atgattctta agagaattga ctgggcctca 3481 tgaataaaaa aattagaaaa tctaaaaaaa a // LOCUS AF018164 3732 bp mRNA PRI 16-OCT-1997 DEFINITION Homo sapiens kinesin-like protein 3C (KIF3C) mRNA, complete cds. ACCESSION AF018164 NID g2529574 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3732) AUTHORS Sardella,M., Navone,F., Rocchi,M., Rubartelli,A., Viggiano,L., Vignali,G., Consalez,G.G., Sitia,R. and Cabibbo,A. TITLE Kif3C, a Novel Member of the Kinesin Superfamily: Sequence, Expression and Mapping to Human Chromosome 2p23 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3732) AUTHORS Sardella,M., Navone,F., Rocchi,M., Rubartelli,A., Viggiano,L., Vignali,G., Consalez,G.G., Sitia,R. and Cabibbo,A. TITLE Direct Submission JOURNAL Submitted (11-AUG-1997) Biological and Technological Research (DIBIT), San Raffaele Scientific Institute (HSR), Via Olgettina 58, Milan, MI I-20132, Italy FEATURES Location/Qualifiers source 1..3732 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p23" /clone="KINC2" /cell_type="mature hNT neuron" /clone_lib="Stratagene Catalog number 937233" gene <1..3732 /gene="KIF3C" 5'UTR <1..154 /gene="KIF3C" CDS 155..2533 /gene="KIF3C" /codon_start=1 /product="kinesin-like protein 3C" /db_xref="PID:g2529575" /translation="MASKTKASEALKVVARCRPLSRKEEAAGHEQILTMDVKLGQVTL RNPRAAPGELPKTFTFDAVYDASSKQADLYDEREPLIDSVLQGFNGTVFAYGQTGTGK TYTMQGDLVEPELRGVIPNAFEHIFTHISRSQNQQYLVRASYLEIYQEEIRDLLSKEP GRRLELKENPETGVYIKDLSSFVTKNVKEIEHVMNLGNQTRAVGSTHMNEVSSRSHAI FIITVECSERGSDGQDHIRVGKLNLVDLAGSERQNKAGPNTAGGAATPSSGGGGGGGG SGGGAGGERPKEASKINLSLSALGNVIAALAGNRSTHIPYRDSKLTRLLQDSLGGNAK TIMVATLGPASHSYDESLSTLRFANRAKNIKNKPRVNEDPKDTLLREFQEEIARLKAQ LEKRGMLGKRPRRKSSRRKKAVSAPPGYPEGPVIEAWVAEEEDDNNNNHRPPQPILES ALEKNMENYLQEQKERLEEDKAAIQDDRSLVSEEKQKLLEEKEKMLEDLRREQQATEL LAAKYKAMESKLLIGGRNIMDHTNEQQKMLELKRQEIAEQKRREREMQQEMMLRDEET MELRGTYTSLQQEVEVKTKKLKKLYAKLQAVKAEIQDQHDEYIRVRQDLEEAQNEQTR ELKLGYLIIENFIPPEEKNKIMNRLFLDCEEEQWKFQPLVPAGVSSSQMKKRPTSAVG YKRPISQYARVAMAMGSHPRYRAENIMFLELDVSPPAVFEMEFSHDQEQDPRALHMER LMRLDSFLERPSTSKVRKSRSWCQSPQRPPPSTTHASLASASLRPATVADHE" misc_feature 440..463 /gene="KIF3C" /note="encodes ATP/GTP-binding site motif A (P-loop)" misc_feature 869..904 /gene="KIF3C" /note="encodes kinesin motor domain signature" BASE COUNT 871 a 1076 c 1062 g 722 t 1 others ORIGIN 1 cctcccaggc gtccccaccc taggaggctg catgcggatt gaagacgtgn gcctgggggc 61 tgggccggcc ccgctgatcc cgacctagcg agcaggatag caggaccgcc caggctgcgg 121 aggggctcgg gggcaggaag gtcagagcag caagatggcc agtaagacca aggccagcga 181 ggccctcaag gtggtggccc ggtgccgccc cctcagcagg aaggaggagg ctgctggtca 241 cgagcagatc ctgaccatgg acgtgaaact gggccaggtg accctgcgga acccccgcgc 301 cgccccgggg gagctgccca agaccttcac ctttgacgcc gtgtatgatg ccagctccaa 361 gcaggccgac ctgtatgacg aacgtgagcc cctgatagac tccgtgctcc agggtttcaa 421 tggcacggtg tttgcctatg gccagacggg cactggcaag acctatacca tgcaggggga 481 cctggtggag cccgagctgc gcggggtcat cccgaatgcc tttgagcaca tcttcaccca 541 catctcccgc tcccagaacc aacagtacct ggtccgggcc tcctatttgg agatctacca 601 ggaagagatt cgagacctgc tctccaagga gccgggcagg aggctagagc tgaaagagaa 661 ccccgagact ggcgtctaca tcaaggacct ctcctccttc gtcaccaaga atgtcaagga 721 gattgagcat gtgatgaacc tggggaacca gacccgggct gtgggcagca cccacatgaa 781 tgaggtcagc tcccgctccc atgccatctt catcatcact gtggagtgca gcgaacgtgg 841 ctctgatggc caggaccaca tccgagtggg caagctcaac ctcgtggacc tggctggcag 901 cgagaggcag aacaaggcag gccccaacac agcgggaggg gcagccacac catcctcggg 961 tggcggtggt ggcggtggag gcagtggtgg tggtgctggt ggagagaggc ctaaggaagc 1021 ctccaaaatc aacctctcat tatctgccct gggcaacgtg attgctgccc tggcgggcaa 1081 caggagcacc cacattccct accgggactc caagctgacc cggctgctcc aggactccct 1141 gggggggaat gccaagacca tcatggtagc cacactgggg ccagcttctc acagctacga 1201 tgagagcctc tccaccttgc gctttgccaa ccgagccaag aacatcaaga acaagccccg 1261 ggtgaacgag gaccccaagg acacactgct gcgggaattc caagaggaga ttgcccgcct 1321 gaaggcccag ctggagaaga gggggatgct ggggaagcgg ccccggagga agagcagccg 1381 caggaagaag gccgtgtccg ccccgcctgg gtaccctgag ggcccagtga ttgaggcctg 1441 ggtggcagaa gaggaggatg acaacaacaa caaccaccgc ccgccccagc ccatcctgga 1501 gtcagccttg gagaagaaca tggagaatta cctgcaggaa cagaaggagc ggctggagga 1561 ggataaggca gccatccagg atgaccgcag cctggtgagc gaggagaagc agaagctgct 1621 ggaggagaag gagaagatgc tggaggacct gcggcgggaa cagcaggcca cagagctgct 1681 tgcggccaag tacaaggcca tggagagcaa gctcctcatc gggggcagga acatcatgga 1741 tcacaccaac gaacagcaga agatgttgga actgaagagg caggagattg ccgagcagaa 1801 acgtcgtgag cgggagatgc agcaggagat gatgctccgg gacgaggaga ctatggagct 1861 ccggggcacc tacacatccc tgcagcagga ggtggaggtc aaaaccaaga aactcaagaa 1921 gctctacgcc aagctgcagg cggtgaaggc ggagatccag gaccagcatg atgagtatat 1981 ccgcgtgcgg caggacctgg aggaggcgca gaacgagcag acccgcgaac tcaagctcgg 2041 gtacctaatc atcgagaact tcatcccgcc ggaggagaag aacaagatca tgaaccggct 2101 tttcctggac tgtgaggagg agcagtggaa gttccagcca ctggtgccag ccggcgtcag 2161 tagcagccag atgaagaagc ggccaacatc tgcagtgggc tacaagaggc ctatcagcca 2221 gtatgctcgg gttgccatgg caatggggtc ccaccccagg tacagggctg aaaacataat 2281 gtttctggag ttggatgtgt cccctccagc tgtctttgag atggaattct ctcacgacca 2341 agaacaagac cctcgtgcgc tacacatgga gaggctcatg cgattggaca gctttctgga 2401 aagaccttcc acgtctaaag tccgaaagtc cagatcctgg tgccagagtc ctcagcggcc 2461 tccaccttcc accacacatg cctccctggc ctctgcttct ctgcgccctg caacagtggc 2521 ggaccatgag tgacaaccat cacgtcaggc tgcccatcca atagactcct gggatggggc 2581 agccaaccct ggctcatctc atctgccgct tggtgcgtgt gcgtgtgcgt gcatgtgcgt 2641 gtgcgtgtgt gcaggggtga gaatctggca gatggtgcct ctgcctgctc ttcttcgcct 2701 cctttattta attcatgtta tttattcgcg gagctcagtt cgtgttgggg agatgccctc 2761 gcctgagccg tctgggccta ccgtggtcac tgcgtagctc tttttcttct gacttgagag 2821 ctcccccagt cagatctcag gcttgtcccc ctgtcagctg cctccagaag ggaaggtagc 2881 cagtgcctga gaagacagtc ccttttctac ccaccgcact ccataacctc catcttctcc 2941 cacactgatg gcgagcagcc cctgagcact ttctgggact gggagactgc ttggtgttcc 3001 ctgaggacaa gagacatcct gacagtgttg ggcatctgct accccgtgga cacagcccca 3061 ctctccactt tctgagcctc agacaacctc attcagcctc ttgggctcct tttcaaggac 3121 attaataacc tcaccaacat agctcatgcc cttcagcttt gacaagaact cacggcttcc 3181 caaactctgc tttctgccca ccttggatgg gaactgtgga ccaagcaatt accatcgcct 3241 tggaacctgc aggaaatgga acagcaattg agacaacttg aacagtcatc aacggaagtc 3301 cctccactgg attcctttgt ttctgtcccc tccgaggagt cattttggtc gacaggctct 3361 caaggcaact ccccattttc aagaggctgc tcctgcctgc ttcgatcatt tctccctgca 3421 gctgcctaga ccccgttcac agtgggagga gtcaatgtca ttctacccct cgctaaacga 3481 agatattaac atctattgct ttttcccttc atctgtcaca ggaaacagaa gcccaggcac 3541 aatcttttcc agctttgcct gttacccctg tttctgattg catctttaag gtattatttt 3601 gttgacaata gatcctttat tcactagtta cgcaaattgg ttcctagggg gatactcctt 3661 accttccttt gtgatggccc aaaatgtctc taggtatctc aagtgataag taaatttcta 3721 caaaaaaaaa aa // LOCUS AF018253 3136 bp mRNA PRI 22-NOV-1997 DEFINITION Homo sapiens receptor activator of nuclear factor-kappa B (RANK) mRNA, complete cds. ACCESSION AF018253 NID g2612917 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3136) AUTHORS Anderson,D.M., Maraskovsky,E., Billingsley,W.L., Dougall,W.C., Tometsko,M.E., Roux,E.R., Teepe,M.C., DuBose,R.F., Cosman,D. and Galibert,L. TITLE A homologue of the TNF receptor and its ligand enhance T-cell growth and dendritic-cell function JOURNAL Nature 390 (6656), 175-179 (1997) MEDLINE 98032977 REFERENCE 2 (bases 1 to 3136) AUTHORS Anderson,D.M., Billingsley,W., Dougall,W., Maraskovsky,E., Cosman,D., DuBose,R. and Galibert,L. TITLE Direct Submission JOURNAL Submitted (11-AUG-1997) Molecular Biology, Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..3136 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18q22.1" gene 1..3136 /gene="RANK" CDS 39..1889 /gene="RANK" /codon_start=1 /product="receptor activator of nuclear factor-kappa B" /db_xref="PID:g2612918" /translation="MAPRARRRRPLFALLLLCALLARLQVALQIAPPCTSEKHYEHLG RCCNKCEPGKYMSSKCTTTSDSVCLPCGPDEYLDSWNEEDKCLLHKVCDTGKALVAVV AGNSTTPRRCACTAGYHWSQDCECCRRNTECAPGLGAQHPLQLNKDTVCKPCLAGYFS DAFSSTDKCRPWTNCTFLGKRVEHHGTEKSDAVCSSSLPARKPPNEPHVYLPGLIILL LFASVALVAAIIFGVCYRKKGKALTANLWHWINEACGRLSGDKESSGDSCVSTHTANF GQQGACEGVLLLTLEEKTFPEDMCYPDQGGVCQGTCVGGGPYAQGEDARMLSLVSKTE IEEDSFRQMPTEDEYMDRPSQPTDQLLFLTEPGSKSTPPFSEPLEVGENDSLSQCFTG TQSTVGSESCNCTEPLCRTDWTPMSSENYLQKEVDSGHCPHWAASPSPNWADVCTGCR NPPGEDCEPLVGSPKRGPLPQCAYGMGLPPEEEASRTEARDQPEDGADGRLPSSARAG AGSGSSPGGQSPASGNVTGNSNSTFISSGQVMNFKGDIIVVYVSQTSQEGAAAAAEPM GRPVQEETLARRDSFAGNGPRFPDPCGGPEGLREPEKASRPVQEQGGAKA" BASE COUNT 696 a 879 c 859 g 702 t ORIGIN 1 ccgctgaggc cgcggcgccc gccagcctgt cccgcgccat ggccccgcgc gcccggcggc 61 gccgcccgct gttcgcgctg ctgctgctct gcgcgctgct cgcccggctg caggtggctt 121 tgcagatcgc tcctccatgt accagtgaga agcattatga gcatctggga cggtgctgta 181 acaaatgtga accaggaaag tacatgtctt ctaaatgcac tactacctct gacagtgtat 241 gtctgccctg tggcccggat gaatacttgg atagctggaa tgaagaagat aaatgcttgc 301 tgcataaagt ttgtgataca ggcaaggccc tggtggccgt ggtcgccggc aacagcacga 361 ccccccggcg ctgcgcgtgc acggctgggt accactggag ccaggactgc gagtgctgcc 421 gccgcaacac cgagtgcgcg ccgggcctgg gcgcccagca cccgttgcag ctcaacaagg 481 acacagtgtg caaaccttgc cttgcaggct acttctctga tgccttttcc tccacggaca 541 aatgcagacc ctggaccaac tgtaccttcc ttggaaagag agtagaacat catgggacag 601 agaaatccga tgcggtttgc agttcttctc tgccagctag aaaaccacca aatgaacccc 661 atgtttactt gcccggttta ataattctgc ttctcttcgc gtctgtggcc ctggtggctg 721 ccatcatctt tggcgtttgc tataggaaaa aagggaaagc actcacagct aatttgtggc 781 actggatcaa tgaggcttgt ggccgcctaa gtggagataa ggagtcctca ggtgacagtt 841 gtgtcagtac acacacggca aactttggtc agcagggagc atgtgaaggt gtcttactgc 901 tgactctgga ggagaagaca tttccagaag atatgtgcta cccagatcaa ggtggtgtct 961 gtcagggcac gtgtgtagga ggtggtccct acgcacaagg cgaagatgcc aggatgctct 1021 cattggtcag caagaccgag atagaggaag acagcttcag acagatgccc acagaagatg 1081 aatacatgga caggccctcc cagcccacag accagttact gttcctcact gagcctggaa 1141 gcaaatccac acctcctttc tctgaacccc tggaggtggg ggagaatgac agtttaagcc 1201 agtgcttcac ggggacacag agcacagtgg gttcagaaag ctgcaactgc actgagcccc 1261 tgtgcaggac tgattggact cccatgtcct ctgaaaacta cttgcaaaaa gaggtggaca 1321 gtggccattg cccgcactgg gcagccagcc ccagccccaa ctgggcagat gtctgcacag 1381 gctgccggaa ccctcctggg gaggactgtg aacccctcgt gggttcccca aaacgtggac 1441 ccttgcccca gtgcgcctat ggcatgggcc ttccccctga agaagaagcc agcaggacgg 1501 aggccagaga ccagcccgag gatggggctg atgggaggct cccaagctca gcgagggcag 1561 gtgccgggtc tggaagctcc cctggtggcc agtcccctgc atctggaaat gtgactggaa 1621 acagtaactc cacgttcatc tccagcgggc aggtgatgaa cttcaagggc gacatcatcg 1681 tggtctacgt cagccagacc tcgcaggagg gcgcggcggc ggctgcggag cccatgggcc 1741 gcccggtgca ggaggagacc ctggcgcgcc gagactcctt cgcggggaac ggcccgcgct 1801 tcccggaccc gtgcggcggc cccgaggggc tgcgggagcc ggagaaggcc tcgaggccgg 1861 tgcaggagca aggcggggcc aaggcttgag cgccccccat ggctgggagc ccgaagctcg 1921 gagccagggc tcgcgagggc agcaccgcag cctctgcccc agccccggcc acccagggat 1981 cgatcggtac agtcgaggaa gaccacccgg cattctctgc ccactttgcc ttccaggaaa 2041 tgggcttttc aggaagtgaa ttgatgagga ctgtccccat gcccacggat gctcagcagc 2101 ccgccgcact ggggcagatg tctcccctgc cactcctcaa actcgcagca gtaatttgtg 2161 gcactatgac agctattttt atgactatcc tgttctgtgg ggggggggtc tatgttttcc 2221 ccccatattt gtattccttt tcataacttt tcttgatatc tttcctccct cttttttaat 2281 gtaaaggttt tctcaaaaat tctcctaaag gtgagggtct ctttcttttc tcttttcctt 2341 ttttttttct ttttttggca acctggctct ggcccaggct agagtgcagt ggtgcgatta 2401 tagcccggtg cagcctctaa ctcctgggct caagcaatcc aagtgatcct cccacctcaa 2461 ccttcggagt agctgggatc acagctgcag gccacgccca gcttcctccc cccgactccc 2521 cccccccaga gacacggtcc caccatgtta cccagcctgg tctcaaactc cccagctaaa 2581 gcagtcctcc agcctcggcc tcccaaagta ctgggattac aggcgtgagc ccccacgctg 2641 gcctgcttta cgtattttct tttgtgcccc tgctcacagt gttttagaga tggctttccc 2701 agtgtgtgtt cattgtaaac acttttggga aagggctaaa catgtgaggc ctggagatag 2761 ttgctaagtt gctaggaaca tgtggtggga ctttcatatt ctgaaaaatg ttctatattc 2821 tcatttttct aaaagaaaga aaaaaggaaa cccgatttat ttctcctgaa tctttttaag 2881 tttgtgtcgt tccttaagca gaactaagct cagtatgtga ccttacccgc taggtggtta 2941 atttatccat gctggcagag gcactcaggt acttggtaag caaatttcta aaactccaag 3001 ttgctgcagc ttggcattct tcttattcta gaggtctctc tggaaaagat ggagaaaatg 3061 aacaggacat ggggctcctg gaaagaaagg gcccgggaag ttcaaggaag aataaagttg 3121 aaattttaaa aaaaaa // LOCUS AF018956 2772 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens neuropilin mRNA, complete cds. ACCESSION AF018956 NID g2407640 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2772) AUTHORS He,Z. and Tessier-Lavigne,M. TITLE Neuropilin is a receptor for the axonal chemorepellent Semaphorin III JOURNAL Cell 90 (4), 739-751 (1997) MEDLINE 97433084 REFERENCE 2 (bases 1 to 2772) AUTHORS He,Z. and Tessier-Lavigne,M. TITLE Direct Submission JOURNAL Submitted (11-AUG-1997) Howard Hughes Medical Institute, University of California, San Francisco, 513 Parnassus Avenue, HSE-201, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..2772 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..2772 /codon_start=1 /product="neuropilin" /db_xref="PID:g2407641" /translation="MERGLPLLCAVLALVLAPAGAFRNDECGDTIKIESPGYLTSPGY PHSYHPSEKCEWLIQAPDPYQRIMINFNPHFDLEDRDCKYDYVEVFDGENENGHFRGK FCGKIAPPPVVSSGPFLFIKFVSDYETHGAGFSIRYEIFKRGPECSQNYTTPSGVIKS PGFPEKYPNSLECTYIVFAPKMSEIILEFESFDLEPDSNPPGGMFCRYDRLEIWDGFP DVGPHIGRYCGQKTPGRIRSSSGILSMVFYTDSAIAKEGFSANYSVLQSSVSEDFKCM EALGMESGEIHSDQITASSQYSTNWSAERSRLNYPENGWTPGEDSYREWIQVDLGLLR FVTAVGTQGAISKETKKKYYVKTYKIDVSSNGEDWITIKEGNKPVLFQGNTNPTDVVV AVFPKPLITRFVRIKPATWETGISMRFEVYGCKITDYPCSGMLGMVSGLISDSQITSS NQGDRNWMPENIRLVTSRSGWALPPAPHSYINEWLQIDLGEEKIVRGIIIQGGKHREN KVFMRKFKIGYSNNGSDWKMIMDDSKRKAKSFEGNNNYDTPELRTFPALSTRFIRIYP ERATHGGLGLRMELLGCEVEAPTAGPTTPNGNLVDECDDDQANCHSGTGDDFQLTGGT TVLATEKPTVIDSTIQSEFPTYGFNCEFGWGSHKTFCHWEHDNHVQLKWSVLTSKTGP IQDHTGDGNFIYSQADENQKGKVARLVSPVVYSQNSAHCMTFWYHMSGSHVGTLRVKL RYQKPEEYDQLVWMAIGHQGDHWKEGRVLLHKSLKLYQVIFEGEIGKGNLGGIAVDDI SINNHISQEDCAKPADLDKKNPEIKIDETGSTPGYEGEGEGDKNISRKPGNVLKTLEP ILITIIAMSALGVLLGAVCGVVLYCACWHNGMSERNLSALENYNFELVDGVKLKKDKL NTQSTYSEA" BASE COUNT 772 a 664 c 702 g 634 t ORIGIN 1 atggagaggg ggctgccgct cctctgcgcc gtgctcgccc tcgtcctcgc cccggccggc 61 gcttttcgca acgatgaatg tggcgatact ataaaaattg aaagccccgg gtaccttaca 121 tctcctggtt atcctcattc ttatcaccca agtgaaaaat gcgaatggct gattcaggct 181 ccggacccat accagagaat tatgatcaac ttcaaccctc acttcgattt ggaggacaga 241 gactgcaagt atgactacgt ggaagtcttc gatggagaaa atgaaaatgg acattttagg 301 ggaaagttct gtggaaagat agcccctcct cctgttgtgt cttcagggcc atttcttttt 361 atcaaatttg tctctgacta cgaaacacat ggtgcaggat tttccatacg ttatgaaatt 421 ttcaagagag gtcctgaatg ttcccagaac tacacaacac ctagtggagt gataaagtcc 481 cccggattcc ctgaaaaata tcccaacagc cttgaatgca cttatattgt ctttgcgcca 541 aagatgtcag agattatcct ggaatttgaa agctttgacc tggagcctga ctcaaatcct 601 ccagggggga tgttctgtcg ctacgaccgg ctagaaatct gggatggatt ccctgatgtt 661 ggccctcaca ttgggcgtta ctgtggacag aaaacaccag gtcgaatccg atcctcatcg 721 ggcattctct ccatggtttt ttacaccgac agcgcgatag caaaagaagg tttctcagca 781 aactacagtg tcttgcagag cagtgtctca gaagatttca aatgtatgga agctctgggc 841 atggaatcag gagaaattca ttctgaccag atcacagctt cttcccagta tagcaccaac 901 tggtctgcag agcgctcccg cctgaactac cctgagaatg ggtggactcc cggagaggat 961 tcctaccgag agtggataca ggtagacttg ggccttctgc gctttgtcac ggctgtcggg 1021 acacagggcg ccatttcaaa agaaaccaag aagaaatatt atgtcaagac ttacaagatc 1081 gacgttagct ccaacgggga agactggatc accataaaag aaggaaacaa acctgttctc 1141 tttcagggaa acaccaaccc cacagatgtt gtggttgcag tattccccaa accactgata 1201 actcgatttg tccgaatcaa gcctgcaact tgggaaactg gcatatctat gagatttgaa 1261 gtatacggtt gcaagataac agattatcct tgctctggaa tgttgggtat ggtgtctgga 1321 cttatttctg actcccagat cacatcatcc aaccaaggag acagaaactg gatgcctgaa 1381 aacatccgcc tggtaaccag tcgctctggc tgggcacttc cacccgcacc tcattcctac 1441 atcaatgagt ggctccaaat agacctgggg gaggagaaga tcgtgagggg catcatcatt 1501 cagggtggga agcaccgaga gaacaaggtg ttcatgagga agttcaagat cgggtacagc 1561 aacaacggct cggactggaa gatgatcatg gatgacagca aacgcaaggc gaagtctttt 1621 gagggcaaca acaactatga tacacctgag ctgcggactt ttccagctct ctccacgcga 1681 ttcatcagga tctaccccga gagagccact catggcggac tggggctcag aatggagctg 1741 ctgggctgtg aagtggaagc ccctacagct ggaccgacca ctcccaacgg gaacttggtg 1801 gatgaatgtg atgacgacca ggccaactgc cacagtggaa caggtgatga cttccagctc 1861 acaggtggca ccactgtgct ggccacagaa aagcccacgg tcatagacag caccatacaa 1921 tcagagtttc caacatatgg ttttaactgt gaatttggct ggggctctca caagaccttc 1981 tgccactggg aacatgacaa tcacgtgcag ctcaagtgga gtgtgttgac cagcaagacg 2041 ggacccattc aggatcacac aggagatggc aacttcatct attcccaagc tgacgaaaat 2101 cagaagggca aagtggctcg cctggtgagc cctgtggttt attcccagaa ctctgcccac 2161 tgcatgacct tctggtatca catgtctggg tcccacgtcg gcacactcag ggtcaaactg 2221 cgctaccaga agccagagga gtacgatcag ctggtctgga tggccattgg acaccaaggt 2281 gaccactgga aggaagggcg tgtcttgctc cacaagtctc tgaaacttta tcaggtgatt 2341 ttcgagggcg aaatcggaaa aggaaacctt ggtgggattg ctgtggatga cattagtatt 2401 aataaccaca tttcacaaga agattgtgca aaaccagcag acctggataa aaagaaccca 2461 gaaattaaaa ttgatgaaac agggagcacg ccaggatacg aaggtgaagg agaaggtgac 2521 aagaacatct ccaggaagcc aggcaatgtg ttgaagacct tagaacccat cctcatcacc 2581 atcatagcca tgagcgccct gggggtcctc ctgggggctg tctgtggggt cgtgctgtac 2641 tgtgcctgtt ggcataatgg gatgtcagaa agaaacttgt ctgccctgga gaactataac 2701 tttgaacttg tggatggtgt gaagttgaaa aaagacaaac tgaatacaca gagtacttat 2761 tcggaggcat ga // LOCUS AF019047 2201 bp mRNA PRI 22-NOV-1997 DEFINITION Homo sapiens receptor activator of nuclear factor kappa B ligand (RANKL) mRNA, complete cds. ACCESSION AF019047 NID g2612921 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2201) AUTHORS Anderson,D.M., Maraskovsky,E., Billingsley,W.L., Dougall,W.C., Tometsko,M.E., Roux,E.R., Teepe,M.C., DuBose,R.F., Cosman,D. and Galibert,L. TITLE A homologue of the TNF receptor and its ligand enhance T-cell growth and dendritic-cell function JOURNAL Nature 390 (6656), 175-179 (1997) MEDLINE 98032977 REFERENCE 2 (bases 1 to 2201) AUTHORS Anderson,D.M., Billingsley,W., Dougall,W., Maraskovsky,E., Cosman,D., DuBose,R. and Galibert,L. TITLE Direct Submission JOURNAL Submitted (13-AUG-1997) Molecular Biology, Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..2201 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q14" gene 1..2201 /gene="RANKL" CDS 129..1082 /gene="RANKL" /note="receptor activator of nuclear factor kappa B ligand" /codon_start=1 /product="RANKL" /db_xref="PID:g2612922" /translation="MRRASRDYTKYLRGSEEMGGGPGAPHEGPLHAPPPPAPHQPPAA SRSMFVALLGLGLGQVVCSVALFFYFRAQMDPNRISEDGTHCIYRILRLHENADFQDT TLESQDTKLIPDSCRRIKQAFQGAVQKELQHIVGSQHIRAEKAMVDGSWLDLAKRSKL EAQPFAHLTINATDIPSGSHKVSLSSWYHDRGWAKISNMTFSNGKLIVNQDGFYYLYA NICFRHHETSGDLATEYLQLMVYVTKTSIKIPSSHTLMKGGSTKYWSGNSEFHFYSIN VGGFFKLRSGEEISIEVSNPSLLDPDQDATYFGAFKVRDID" BASE COUNT 658 a 429 c 497 g 617 t ORIGIN 1 ggccaaagcc gggctccaag tcggcgcccc acgtcgaggc tccgccgcag cctccggagt 61 tggccgcaga caagaagggg agggagcggg agagggagga gagctccgaa gcgagagggc 121 cgagcgccat gcgccgcgcc agcagagact acaccaagta cctgcgtggc tcggaggaga 181 tgggcggcgg ccccggagcc ccgcacgagg gccccctgca cgccccgccg ccgcctgcgc 241 cgcaccagcc ccccgccgcc tcccgctcca tgttcgtggc cctcctgggg ctggggctgg 301 gccaggttgt ctgcagcgtc gccctgttct tctatttcag agcgcagatg gatcctaata 361 gaatatcaga agatggcact cactgcattt atagaatttt gagactccat gaaaatgcag 421 attttcaaga cacaactctg gagagtcaag atacaaaatt aatacctgat tcatgtagga 481 gaattaaaca ggcctttcaa ggagctgtgc aaaaggaatt acaacatatc gttggatcac 541 agcacatcag agcagagaaa gcgatggtgg atggctcatg gttagatctg gccaagagga 601 gcaagcttga agctcagcct tttgctcatc tcactattaa tgccaccgac atcccatctg 661 gttcccataa agtgagtctg tcctcttggt accatgatcg gggttgggcc aagatctcca 721 acatgacttt tagcaatgga aaactaatag ttaatcagga tggcttttat tacctgtatg 781 ccaacatttg ctttcgacat catgaaactt caggagacct agctacagag tatcttcaac 841 taatggtgta cgtcactaaa accagcatca aaatcccaag ttctcatacc ctgatgaaag 901 gaggaagcac caagtattgg tcagggaatt ctgaattcca tttttattcc ataaacgttg 961 gtggattttt taagttacgg tctggagagg aaatcagcat cgaggtctcc aacccctcct 1021 tactggatcc ggatcaggat gcaacatact ttggggcttt taaagttcga gatatagatt 1081 gagccccagt ttttggagtg ttatgtattt cctggatgtt tggaaacatt ttttaaaaca 1141 agccaagaaa gatgtatata ggtgtgtgag actactaaga ggcatggccc caacggtaca 1201 cgactcagta tccatgctct tgaccttgta gagaacacgc gtatttacct gccagtggga 1261 gatgttagac tcatggtgtg ttacacaatg gtttttaaat tttgtaatga attcctagaa 1321 ttaaaccaga ttggagcaat tacgggttga ccttatgaga aactgcatgt gggctatggg 1381 aggggttggt ccctggtcat gtgccccttc gcagctgaag tggagagggt gtcatctagc 1441 gcaattgaag gatcatctga aggggcaaat tcttttgaat tgttacatca tgctggaacc 1501 tgcaaaaaat actttttcta atgaggagag aaaatatatg tatttttata taatatctaa 1561 agttatattt cagatgtaat gttttctttg caaagtattg taaattatat ttgtgctata 1621 gtatttgatt caaaatattt aaaaatgtct tgctgttgac atatttaatg ttttaaatgt 1681 acagacatat ttaactggtg cactttgtaa attccctggg gaaaacttgc agctaaggag 1741 gggaaaaaaa tgttgtttcc taatatcaaa tgcagtatat ttcttcgttc tttttaagtt 1801 aatagatttt ttcagacttg tcaagcctgt gcaaaaaaat taaaatggat gccttgaata 1861 ataagcagga tgttggccac caggtgcctt tcaaatttag aaactaattg actttagaaa 1921 gctgacattg ccaaaaagga tacataatgg gccactgaaa tttgtcaaga gtagttatat 1981 aattgttgaa caggtgtttt tccacaagtg ccgcaaattg tacctttttt tttttttcaa 2041 aatagaaaag ttattagtgg tttatcagca aaaaagtcca attttaattt agtaaatgtt 2101 attttatact gtacaataaa aacattgcct ttgaatgtta attttttggt acaaaaataa 2161 atttatatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS AF019225 1279 bp mRNA PRI 17-OCT-1997 DEFINITION Homo sapiens apolipoprotein L mRNA, complete cds. ACCESSION AF019225 NID g2425057 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1279) AUTHORS Duchateau,P.N., Pullinger,C.R., Orellana,R.E., Kunitake,S.T., Naya-Vigne,J., O'Connor,P.M., Malloy,M.J. and Kane,J.P. TITLE Apolipoprotein L, a new human high density lipoprotein apolipoprotein expressed by the pancreas. Identification, cloning, characterization, and plasma distribution of apolipoprotein l JOURNAL J. Biol. Chem. 272 (41), 25576-25582 (1997) MEDLINE 97467346 REFERENCE 2 (bases 1 to 1279) AUTHORS Duchateau,P.N., Pullinger,C.R., Orellana,R.E., Kunitake,S.T., Naya-Vigne,J., O'connor,P.M., Malloy,M.J. and Kane,J.P. TITLE Direct Submission JOURNAL Submitted (13-AUG-1997) Cardiovascular Research Institute, University of California, San Francisco, 505 Parnassus Avenue, San Francisco, CA 94143-0130, USA FEATURES Location/Qualifiers source 1..1279 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /tissue_type="pancreas" CDS 1..1152 /codon_start=1 /product="apolipoprotein L" /db_xref="PID:g2425058" /translation="MSALFLGVRVRAEEAGARVQQNVPSGTDTGDPQSKPLGDWAAGT MDPESSIFIEDAIKYFKEKVSTQNLLLLLTDNEAWNGFVAAAELPRNEADELRKALDN LARQMIMKDKNWHDKGQQYRNWFLKEFPRLKSKLEDNTRRLRALADGVQKVHKGTTIA NVVSGSLSISSGILTLVGMGLAPFTEGGSLVLLEPGMELGITAALTGITSSTIDYGKK WWTQAQAHDLVIKSLDKLKEVKEFLGENISNFLSLAGNTYQLTRGIGKDIRALRRARA NLQSVPHASASRPRVTEPISAESGEQVERVNEPSILEMSRGVKLTDVAPVSFFLALDV VYLVYESKHLHEGAKSETAEELKKVAQELEEKLNILNNNYKILQADQEL" BASE COUNT 352 a 305 c 352 g 270 t ORIGIN 1 atgagtgcac ttttccttgg tgtgagagtg agggcagagg aagctggagc gagggtgcaa 61 caaaacgttc caagtgggac agatactgga gatcctcaaa gtaagcccct cggtgactgg 121 gctgctggca ccatggaccc agagagcagt atctttattg aggatgccat taagtatttc 181 aaggaaaaag tgagcacaca gaatctgcta ctcctgctga ctgataatga ggcctggaac 241 ggattcgtgg ctgctgctga actgcccagg aatgaggcag atgagctccg taaagctctg 301 gacaaccttg caagacaaat gatcatgaag gacaaaaact ggcacgataa aggccagcag 361 tacagaaact ggtttctgaa agagtttcct cggttgaaaa gtaagcttga ggataacaca 421 agaaggctcc gtgcccttgc ggatggggtt cagaaggtcc acaaaggcac caccatcgcc 481 aatgtggtgt ctggctctct cagcatttcc tctggcatcc tgaccctcgt cggcatgggt 541 ctggcaccct tcacagaggg aggcagcctt gtactcttgg aacctgggat ggagttggga 601 atcacagcag ctttgaccgg gattaccagc agtaccatag actacggaaa gaagtggtgg 661 acacaagccc aagcccacga cttggtcatc aaaagccttg acaaattgaa ggaggtgaag 721 gagtttttgg gtgagaacat atccaacttt ctttccttag ctggcaatac ttaccaactc 781 acacgaggca ttgggaagga catccgtgcc ctcagacgag ccagagccaa tcttcagtca 841 gtaccgcatg cctcagcctc acgcccccgg gtcactgagc caatctcagc tgaaagcggt 901 gaacaggtgg agagagttaa tgaacccagc atcctggaaa tgagcagagg agtcaagctc 961 acggatgtgg cccctgtaag cttctttctt gcgctggatg tagtctacct cgtgtacgaa 1021 tcaaagcact tacatgaggg ggcaaagtca gagacagctg aggagctgaa gaaggtggct 1081 caggagctgg aggagaagct aaacattctc aacaataatt ataagattct gcaggcggac 1141 caagaactgt gaccacaggg cagggcagcc accaggagag atatgcctgg caggggccag 1201 gacaaaatgc aaactttttt ttttttctga gacagagtct tgctctgtcg ccaagttgca 1261 ccctcgctcc agcttcctc // LOCUS AF019386 1305 bp mRNA PRI 15-NOV-1997 DEFINITION Homo sapiens heparan sulfate 3-O-sulfotransferase-1 precursor (3OST1) mRNA, complete cds. ACCESSION AF019386 NID g2618972 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1305) AUTHORS Shworak,N.W., Liu,J., Fritze,L.M.S., Schwartz,J.J., Zhang,L., Logeart,D. and Rosenberg,R.D. TITLE Molecular cloning and expression of mouse and human cDNAs encoding heparan sulfate D-glucosaminyl 3-O-sulfotransferase JOURNAL J. Biol. Chem. 272 (44), 28008-28019 (1997) MEDLINE 98010647 REFERENCE 2 (bases 1 to 1305) AUTHORS Shworak,N.W., Liu,J., Fritze,L.M.S., Schwartz,J.J., Zhang,L., Logeart,D. and Rosenberg,R.D. TITLE Direct Submission JOURNAL Submitted (14-AUG-1997) Biology, MIT, 31 Ames Street, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..1305 /organism="Homo sapiens" /db_xref="taxon:9606" /note="isolated from lambda TriplEx Brain cDNA library from ClonTech;" gene 1..1305 /gene="3OST1" 5'UTR 1..118 /gene="3OST1" sig_peptide 119..178 /gene="3OST1" CDS 119..1042 /gene="3OST1" /function="rate limiting enzyme for synthesis of anticoagulant heparan" /note="heparan sulfate sulfotransferase; 3-OST-1; interluminal Golgi resident protein (retension mechanism unknown)" /codon_start=1 /product="heparan sulfate 3-O-sulfotransferase-1 precursor" /db_xref="PID:g2618973" /translation="MAALLLGAVLLVAQPQLVPSRPAELGQQELLRKAGTLQDDVRDG VAPNGSAQQLPQTIIIGVRKGGTRALLEMLSLHPDVAAAENEVHFFDWEEHYSHGLGW YLSQMPFSWPHQLTVEKTPAYFTSPKVPERVYSMNPSIRLLLILRDPSERVLSDYTQV FYNHMQKHKPYPSIEEFLVRDGRLNVDYKALNRSLYHVHMQNWLRFFPLRHIHIVDGD RLIRDPFPEIQKVERFLKLSPQINASNFYFNKTKGFYCLRDSGRDRCLHESKGRAHPQ VDPKLLNKLHEYFHEPNKKFFELVGRTFDWH" mat_peptide 179..1039 /gene="3OST1" /product="heparan sulfate 3-O-sulfotransferase-1" misc_signal 260..268 /gene="3OST1" /note="encodes potential N-linked site" misc_feature 278..1039 /gene="3OST1" /note="encodes presumptive sulfotransferase catalytic domain" misc_signal 527..535 /gene="3OST1" /note="encodes potential N-linked site; this site probably not functional" misc_signal 692..700 /gene="3OST1" /note="encodes potential N-linked site" misc_signal 842..850 /gene="3OST1" /note="encodes potential N-linked site" misc_signal 863..871 /gene="3OST1" /note="encodes potential N-linked site" misc_structure 873..911 /gene="3OST1" /note="encodes cystine-bridged peptide loop" 3'UTR 1043..1305 /gene="3OST1" polyA_signal 1277..1283 /gene="3OST1" polyA_site 1305 /gene="3OST1" BASE COUNT 304 a 380 c 329 g 292 t ORIGIN 1 cgcggctcag taattgaagg cctgaaacgc ccatgtgcca ctgactagga ggcttccctg 61 ctgcggcact tcatgaccca gcggcgcgcg gcccagtgaa gccaccgtgg tgtccagcat 121 ggccgcgctg ctcctgggcg cggtgctgct ggtggcccag ccccagctag tgccttcccg 181 ccccgccgag ctaggccagc aggagcttct gcggaaagcg gggaccctcc aggatgacgt 241 ccgcgatggc gtggccccaa acggctctgc ccagcagttg ccgcagacca tcatcatcgg 301 cgtgcgcaag ggcggcacgc gcgcactgct ggagatgctc agcctgcacc ccgacgtggc 361 ggccgcggag aacgaggtcc acttcttcga ctgggaggag cattacagcc acggcttggg 421 ctggtacctc agccagatgc ccttctcctg gccacaccag ctcacagtgg agaagacccc 481 cgcgtatttc acgtcgccca aagtgcctga gcgagtctac agcatgaacc cgtccatccg 541 gctgctgctc atcctgcgag acccgtcgga gcgcgtgcta tctgactaca cccaagtgtt 601 ctacaaccac atgcagaagc acaagcccta cccgtccatc gaggagttcc tggtgcgcga 661 tggcaggctc aatgtggact acaaggccct caaccgcagc ctctaccacg tgcacatgca 721 gaactggctg cgctttttcc cgctgcgcca catccacatt gtggacggcg accgcctcat 781 cagggacccc ttccctgaga tccaaaaggt cgagaggttc ctaaagctgt cgccgcagat 841 caatgcttcg aacttctact ttaacaaaac caagggcttt tactgcctgc gggacagcgg 901 ccgggaccgc tgcttacatg agtccaaagg ccgggcgcac ccccaagtcg atcccaaact 961 actcaataaa ctgcacgaat attttcatga gccaaataag aagttcttcg agcttgttgg 1021 cagaacattt gactggcact gatttgcaat aagctaagct cagaaacttt cctactgtaa 1081 gttctggtgt acatctgagg ggaaaaagaa ttttaaaaaa gcatttaagg tataatttat 1141 ttgtaaaatc cataaagtac ttctgtacag tattagattc acaattgcca tatatactag 1201 ttatattttt ctacttgtta aatggagggc attttgtatt gtttttcatg gttgttaaca 1261 ttgtgtaata tgtctctata tgaaggaact aaactatttc actga // LOCUS AF019612 1759 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens S2P mRNA, complete cds. ACCESSION AF019612 NID g2745732 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1759) AUTHORS Rawson,R.B., Zelenski,N.G., Nijhawan,D., Ye,J., Sakai,J., Hasan,M.T., Chang,T.Y., Brown,M.S. and Goldstein,J.L. TITLE Complementation Cloning of S2P, a Gene Encoding a Putative Metalloprotease Required for Intramembrane Cleavage of SREBPs JOURNAL Mol. Cell 1 (1), 47-57 (1997) REFERENCE 2 (bases 1 to 1759) AUTHORS Rawson,R.B., Zelenski,N.G., Nijhawan,D., Ye,J., Sakai,J., Hasan,M.T., Chang,T.Y., Brown,M.S. and Goldstein,J.L. TITLE Direct Submission JOURNAL Submitted (14-AUG-1997) Molecular Genetics, UT Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..1759 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /cell_line="HeLa" CDS 100..1659 /codon_start=1 /product="S2P" /db_xref="PID:g2745733" /translation="MIPVSLVVVVVGGWTVVYLTDLVLKSSVYFKHSYEDWLENNGLS ISPFHIRWQTAVFNRAFYSWGRRKARMLYQWFNFGMVFGVIAMFSSFFLLGKTLMQTL AQMMADSPSSYSSSSSSSSSSSSSSSSSSSSSSSLHNEQVLQVVVPGINLPVNQLTYF FTAVLISGVVHEIGHGIAAIREQVRFNGFGIFLFIIYPGAFVDLFTTHLQLISPVQQL RIFCAGIWHNFVLALLGILALVLLPVILLPFYYTGVGVLITEVAEDSPAIGPRGLFVG DLVTHLQDCPVTNVQDWNECLDTIAYEPQIGYCISASTLQQLSFPVRAYKRLDGSTEC CNNHSLTDVCFSYRNNFNKRLHTCLPARKAVEATQVCRTNKDCKKSSSSSFCIIPSLE THTRLIKVKHPPQIDMLYVGHPLHLHYTVSITSFIPRFNFLSIDLPVVVETFVKYLIS LSGALAIVNAVPCFALDGQWILNSFLDATLTSVIGDNDVKDLIGFFILLGGSVLLAAN VTLGLWMVTAR" BASE COUNT 410 a 399 c 380 g 570 t ORIGIN 1 agcggatgct ggggctgtaa ggcgcgcgcg gtcagctgtt ggcggtgcag ggaggaggac 61 gccggggctc gccttccctc ctctgccgcc gctgccgcca tgattccggt gtcgctggtg 121 gtggtggtgg tgggtggctg gactgtcgtc tacctgaccg acttggtgct gaagtcatct 181 gtctatttta aacattctta tgaagactgg ctggaaaaca acggactgag catctcccct 241 ttccacataa gatggcaaac tgctgttttc aatcgtgcct tttacagttg gggacggcgg 301 aaagcaagga tgctttacca atggttcaat tttggaatgg tgtttggcgt aattgccatg 361 tttagctcat tttttctcct tggaaaaacg ctgatgcaga ctttggcaca aatgatggct 421 gactctccct cttcttattc ttcctcctct tcttcctctt cctcctcttc ttcctcttcc 481 tcttcttcat cttcttcctc ttcctcgctt cacaatgaac aggtgttaca agttgtggtt 541 cctggtataa atttacccgt caatcaactg acctatttct tcacggcagt tctcattagt 601 ggtgttgtac atgaaattgg acatgggata gcagctatta gggaacaagt tcgatttaat 661 ggctttggga tttttctctt cattatttat cctggagcat ttgttgatct gttcaccact 721 catttgcaac ttatatcgcc agtccagcag ctaaggatat tttgtgcagg tatctggcat 781 aattttgtcc ttgcactctt gggtatttta gctcttgttc tcctcccagt aattctcttg 841 ccattttact acactggagt tggggtgctc atcactgaag ttgctgagga ctctcctgcc 901 attggaccca gaggcctttt tgtgggagac cttgtcaccc atctacagga ttgtcctgtt 961 actaatgtgc aagattggaa tgaatgttta gataccatcg cctatgagcc ccaaattggt 1021 tactgtataa gtgcatcaac tttacagcag ttaagtttcc cagttagagc atacaaacga 1081 ctagatggtt caactgaatg ctgtaacaat cacagcctca cagatgtgtg cttttcctac 1141 agaaataatt ttaataagcg tttgcataca tgtcttcctg cccggaaagc agttgaagca 1201 actcaagttt gcagaaccaa taaagactgt aaaaaaagct caagttcaag tttctgtata 1261 ataccttctt tggaaactca cactcgctta ataaaagtaa aacacccacc tcagattgat 1321 atgttatacg taggacatcc tctgcatctt cactacacag tgagcatcac cagttttatc 1381 ccacgtttta actttctaag catagatctg ccagtggttg tggagacatt tgtcaagtac 1441 ctgatttccc tctcaggagc tctggctatt gttaatgcag taccctgctt tgctttggat 1501 ggacaatgga ttctaaactc tttcttggat gccaccctta cctcagtgat tggagacaat 1561 gatgtcaaag atctaatagg gtttttcatc ttgctgggtg gcagtgtact tttggctgcc 1621 aatgtgaccc tgggactctg gatggttaca gcacggtaat gtttgcactc atctgacaga 1681 atccctgagt tacagtatac agctatgtgg taatattcat tgccattgaa attcttactt 1741 ggtatgaaat ataaagtgt // LOCUS AF019627 1128 bp mRNA PRI 21-NOV-1997 DEFINITION Homo sapiens myostatin (MSTN) mRNA, complete cds. ACCESSION AF019627 NID g2623581 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1128) AUTHORS McPherron,A.C. and Lee,S.J. TITLE Double muscling in cattle due to mutations in the myostatin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (23), 12457-12461 (1997) MEDLINE 98024153 REFERENCE 2 (bases 1 to 1128) AUTHORS McPherron,A.C. and Lee,S.J. TITLE Direct Submission JOURNAL Submitted (15-AUG-1997) Molecular Biology and Genetics, Johns Hopkins University School of Medicine, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1128 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" gene 1..1128 /gene="MSTN" CDS 1..1128 /gene="MSTN" /note="transforming growth factor-beta superfamily member; growth/differentiation factor-8; GDF-8" /codon_start=1 /product="myostatin" /db_xref="PID:g2623582" /translation="MQKLQLCVYIYLFMLIVAGPVDLNENSEQKENVEKEGLCNACTW RQNTKSSRIEAIKIQILSKLRLETAPNISKDVIRQLLPKAPPLRELIDQYDVQRDDSS DGSLEDDDYHATTETIITMPTESDFLMQVDGKPKCCFFKFSSKIQYNKVVKAQLWIYL RPVETPTTVFVQILRLIKPMKDGTRYTGIRSLKLDMNPGTGIWQSIDVKTVLQNWLKQ PESNLGIEIKALDENGHDLAVTFPGPGEDGLNPFLEVKVTDTPKRSRRDFGLDCDEHS TESRCCRYPLTVDFEAFGWDWIIAPKRYKANYCSGECEFVFLQKYPHTHLVHQANPRG SAGPCCTPTKMSPINMLYFNGKEQIIYGKIPAMVVDRCGCS" BASE COUNT 373 a 220 c 238 g 297 t ORIGIN 1 atgcaaaaac tgcaactctg tgtttatatt tacctgttta tgctgattgt tgctggtcca 61 gtggatctaa atgagaacag tgagcaaaaa gaaaatgtgg aaaaagaggg gctgtgtaat 121 gcatgtactt ggagacaaaa cactaaatct tcaagaatag aagccattaa gatacaaatc 181 ctcagtaaac ttcgtctgga aacagctcct aacatcagca aagatgttat aagacaactt 241 ttacccaaag ctcctccact ccgggaactg attgatcagt atgatgtcca gagggatgac 301 agcagcgatg gctctttgga agatgacgat tatcacgcta caacggaaac aatcattacc 361 atgcctacag agtctgattt tctaatgcaa gtggatggaa aacccaaatg ttgcttcttt 421 aaatttagct ctaaaataca atacaataaa gtagtaaagg cccaactatg gatatatttg 481 agacccgtcg agactcctac aacagtgttt gtgcaaatcc tgagactcat caaacctatg 541 aaagacggta caaggtatac tggaatccga tctctgaaac ttgacatgaa cccaggcact 601 ggtatttggc agagcattga tgtgaagaca gtgttgcaaa attggctcaa acaacctgaa 661 tccaacttag gcattgaaat aaaagcttta gatgagaatg gtcatgatct tgctgtaacc 721 ttcccaggac caggagaaga tgggctgaat ccgtttttag aggtcaaggt aacagacaca 781 ccaaaaagat ccagaaggga ttttggtctt gactgtgatg agcactcaac agaatcacga 841 tgctgtcgtt accctctaac tgtggatttt gaagcttttg gatgggattg gattatcgct 901 cctaaaagat ataaggccaa ttactgctct ggagagtgtg aatttgtatt tttacaaaaa 961 tatcctcata ctcatctggt acaccaagca aaccccagag gttcagcagg cccttgctgt 1021 actcccacaa agatgtctcc aattaatatg ctatatttta atggcaaaga acaaataata 1081 tatgggaaaa ttccagcgat ggtagtagac cgctgtgggt gctcatga // LOCUS AF019952 1705 bp mRNA PRI 11-DEC-1997 DEFINITION Homo sapiens tumor-suppressing subchromosomal transferable fragment 1 (TSSC1) mRNA, complete cds. ACCESSION AF019952 NID g2655036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1705) AUTHORS Hu,R.J., Lee,M.P., Connors,T.D., Johnson,L.A., Burn,T.C., Su,K., Landes,G.M. and Feinberg,A.P. TITLE A 2.5-Mb transcript map of a tumor-suppressing subchromosomal transferable fragment from 11p15.5, and isolation and sequence analysis of three novel genes JOURNAL Genomics 46 (1), 9-17 (1997) MEDLINE 98066757 REFERENCE 2 (bases 1 to 1705) AUTHORS Hu,R.J., Lee,M.P., Conners,T.D., Johnson,L.A., Burn,T.C., Su,K., Landes,G.M. and Feinberg,A.P. TITLE Direct Submission JOURNAL Submitted (18-AUG-1997) Medicine, Johns Hopkins University School of Medicine, 1064 ROSS, 720 Rutland Ave., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1705 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" gene 1..1705 /gene="TSSC1" CDS 152..1315 /gene="TSSC1" /codon_start=1 /product="tumor-suppressing subchromosomal transferable fragment 1" /db_xref="PID:g2655037" /translation="MEDDAPVIYGLEFQARALTPQTAETDAIRFLVGTQSLKYDNQIH IIDFDDENNIINKNVLLHQAGEIWHISASPADRGVLTTCYNRTSDSKVLTCAAVWRMP KELESGSHESPDDSSSTAQTLELLCHLDNTAHGNMACVVWEPMGDGKKIISLADNHIL LWDLQESSSQAVLASSASLEGKGQLKFTSGRWSPHHNCTQVATANDTTLRGWDTRSMS QIYCIENAHGQLVRDLDFNPNKQYYLASCGDDCKVKFWDTRNVTEPVKTLEEHSHWVW NVRYNHSHDQLVLTGSSDSRVILSNMVSISSEPFGHLVDDDDISDQEDHRSEEKSKEP LQDNVIATYEEHEDSVYAVDWSSADPWLFASLSYDGRLVINRVPRALKYHILL" BASE COUNT 435 a 441 c 468 g 361 t ORIGIN 1 aattcggcac gagaagactt ccagtttgga gtcgtttgct gcggggaggg aatgaatggg 61 cgctgggaac acgcccgcga ggtggggacg cgccggccgt agcgaggtcc ttagcgtgtg 121 agtggccggg gtcgggtcgc ttccccgcag catggaggac gatgcaccag tgatctacgg 181 gctggagttc caggcacgtg ccttaacacc tcaaactgca gaaacagatg ccattcggtt 241 tttggttggg acgcagtctc ttaaatatga taatcagatc catatcatag attttgacga 301 tgaaaacaac attataaata aaaatgtcct cctccatcaa gcgggtgaaa tctggcatat 361 tagcgctagc cctgcagaca gaggtgtgct gacgacctgc tacaacagaa cttcagacag 421 caaagtcctg acatgtgcag ccgtgtggag gatgccgaag gaattggaat caggcagcca 481 cgagtcccct gatgattcat ccagcactgc acagaccctg gagctgctct gtcaccttga 541 caacacagcc catggcaaca tggcctgtgt cgtgtgggag ccaatgggag atgggaagaa 601 aatcatttcc ttggctgata accatatcct gctgtgggat ttacaggaaa gctcgagcca 661 ggctgtgctg gccagctcag cgtccctgga agggaaggga caactgaagt tcacctcagg 721 acggtggagc ccacatcata actgcaccca ggtggccaca gcgaacgaca ccaccctccg 781 tggctgggac acccggagca tgagccagat ctactgcata gagaatgccc acggacagct 841 ggtgcgggac cttgacttta atcccaataa gcagtactac ttggccagct gcggagacga 901 ctgtaaggtg aagttctggg acacccgaaa tgtcaccgaa cccgtgaaga ccctggagga 961 gcactcccac tgggtgtgga acgtccgcta caaccactct catgaccagc tggtcctcac 1021 gggcagcagt gacagcagag tcatcctttc caacatggtg tccatctcgt cggagccctt 1081 cggccacttg gtagacgacg atgacatcag tgaccaggag gaccaccgtt ctgaagagaa 1141 gagcaaggag cccctgcagg acaacgtgat cgccacctac gaggagcacg aggacagcgt 1201 ctatgccgtg gactggtcct cggctgaccc gtggctgttt gcctccctga gctatgacgg 1261 gaggctcgtg atcaacaggg tgcccagggc cctgaagtac cacatcctgc tatgactccc 1321 gggcctgggt tatccaggtc ccattgagtg gttttcctct tggcagattc tcaaacagtc 1381 gcagctcttt ggaggtgact cgtgttccag gtggatccct ctctgggaga gccgctgttc 1441 ccttcctgta gcagcagcat ttatgaatgg ggtgaatggg gctattgtcg acggcacagc 1501 taatgcccga acccagcccc tgtcggcaga gacagagccc cacattatta tgtgaataac 1561 aatgttttct gttttaaggg tgtcaggagt ttcgcttttt aaaaaaatgt ctgttcctgc 1621 agtagtaact cttctttctc ttgagagtaa aaaatgaaat aaaataaatc cacgctgaca 1681 aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS AF019968 2733 bp mRNA PRI 21-DEC-1997 DEFINITION Homo sapiens Su(var)3-9 homolog (SUV39H) mRNA, complete cds. ACCESSION AF019968 NID g2707214 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2733) AUTHORS Laible,G., Lebersorger,A. and Jenuwein,T. TITLE Human homolog of Drosophila Su(var)3-9 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2733) AUTHORS Laible,G., Lebersorger,A. and Jenuwein,T. TITLE Direct Submission JOURNAL Submitted (18-AUG-1997) IMP, Dr. Bohr-Gasse 7, Wien 1030, Austria FEATURES Location/Qualifiers source 1..2733 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="B-cell" gene 1..2733 /gene="SUV39H" CDS 46..1284 /gene="SUV39H" /note="pSUV39H" /codon_start=1 /product="Su(var)3-9 homolog" /db_xref="PID:g2707215" /translation="MAENLKGCSVCCKSSWNQLQDLCRLAKLSCPALGISKRNLYDFE VEYLCDYKKIREQEYYLVKWRGYPDSESTWEPRQNLKCVRILKQFHKDLERELLRRHH RSKTPRHLDPSLANYLVQKAKQRRALRRWEQELNAKRSHLGRITVENEVDLDGPPRAF VYINEYRVGEGITLNQVAVGCECQDCLWAPTGGCCPGASLHKFAYNDQGQVRLRAGLP IYECNSRCRCGYDCPNRVVQKGIRYDLCIFRTDDGRGWGVRTLEKIRKNSFVMEYVGE IITSEEAERRGQIYDRQGATYLFDLDYVEDVYTVDAAYYGNISHFVNHSCDPNLQVYN VFIDNLDERLPRIAFFATRTIRAGEELTFDYNMQVDPVDMESTRMDSNFGLAGLPGSP KKRVRIECKCGTESCRKYLF" BASE COUNT 595 a 783 c 789 g 566 t ORIGIN 1 ctcgcgaggc cggctaggcc cgaatgtcgt tagccgtggg gaaagatggc ggaaaattta 61 aaaggctgca gcgtgtgttg caagtcttct tggaatcagc tgcaggacct gtgccgcctg 121 gccaagctct cctgccctgc cctcggtatc tctaagagga acctctatga ctttgaagtc 181 gagtacctgt gcgattacaa gaagatccgc gaacaggaat attacctggt gaaatggcgt 241 ggatatccag actcagagag cacctgggag ccacggcaga atctcaagtg tgtgcgtatc 301 ctcaagcagt tccacaagga cttagaaagg gagctgctcc ggcggcacca ccggtcaaag 361 accccccggc acctggaccc aagcttggcc aactacctgg tgcagaaggc caagcagagg 421 cgggcgctcc gtcgctggga gcaggagctc aatgccaagc gcagccatct gggacgcatc 481 actgtagaga atgaggtgga cctggacggc cctccgcggg ccttcgtgta catcaatgag 541 taccgtgttg gtgagggcat caccctcaac caggtggctg tgggctgcga gtgccaggac 601 tgtctgtggg cacccactgg aggctgctgc ccgggggcgt cactgcacaa gtttgcctac 661 aatgaccagg gccaggtgcg gcttcgagcc gggctgccca tctacgagtg caactcccgc 721 tgccgctgcg gctatgactg cccaaatcgt gtggtacaga agggtatccg atatgacctc 781 tgcatcttcc ggacggatga tgggcgtggc tggggcgtcc gcaccctgga gaagattcgc 841 aagaacagct tcgtcatgga gtacgtggga gagatcatta cctcagagga ggcagagcgg 901 cggggccaga tctacgaccg tcagggcgcc acctacctct ttgacctgga ctacgtggag 961 gacgtgtaca ccgtggatgc cgcctactat ggcaacatct cccactttgt caaccacagt 1021 tgtgacccca acctgcaggt gtacaacgtc ttcatagaca accttgacga gcggctgccc 1081 cgcatcgctt tctttgccac aagaaccatc cgggcaggcg aggagctcac ctttgattac 1141 aacatgcaag tggaccccgt ggacatggag agcacccgca tggactccaa ctttggcctg 1201 gctgggctcc ctggctcccc taagaagcgg gtccgtattg aatgcaagtg tgggactgag 1261 tcctgccgca aatacctctt ctagccctta gaagtctgag gccagactga ctgagggggc 1321 ctgaagctac atgcacctcc cccactgctg ccctcctgtc gagaatgact gccagggcct 1381 cgcctgcctc cacctgcccc cacctgctcc tacctgctct acgttcaggg ctgtggccgt 1441 ggtgaggacc gactccagga gtcccctttc cctgtcccag ccccatctgt gggttgcact 1501 tacaaacccc cacccacctt cagaaatagt ttttcaacat caagactctc tgtcgttggg 1561 attcatggcc tattaaggag gtccaagggg tgagtcccaa cccagcccca gaatatattt 1621 gtttttgcac ctgcttctgc ctggagattg aggggtctgc tgcaggcctc ctccctgctg 1681 ccccaaaggt atggggaagc aaccccagag caggcagaca tcagaggcca gagtgcctag 1741 cccgacatga agctggttcc ccaaccacag aaactttgta ctagtgaaag aaaggggtcc 1801 ctggcctacg ggctgaggct ggtttctgct cgtgcttaca gtgctgggta gtgttggccc 1861 taagagctgt agggtctctt cttcagggct gcatatctga gaagtggatg cccacatgcc 1921 actggaaggg aagtgggtgt ccatgggcca ctgagcagtg agaggaaggc agtgcagagc 1981 tggccagccc tggaggtagg ctgggaccaa gctctgcctt cacagtgcag tgaaggtacc 2041 tagggctctt gggagctctg cggttgctag gggccctgac ctggggtgtc atgaccgctg 2101 acaccactca gagctggaac caagatctag atagtccgta gatagcactt aggacaagaa 2161 tgtgcattga tggggtggtg atgaggtgcc aggcactagg tagagcacct ggtccacgtg 2221 gattgtctca gggaagcctt gaaaaccacg gaggtggatg ccaggaaagg gcccatgtgg 2281 cagaaggcaa agtacaggcc aagaattggg ggtgggggag atggcttccc cactatggga 2341 tgacgaggcg agagggaagc ccttgctgcc tgccattccc agaccccagc cctttgtgct 2401 caccctggtt ccactggtct caaaagtcac ctgcctacaa atgtacaaaa ggcgaaggtt 2461 ctgatggctg ccttgctcct tgctccccca ccccctgtga ggacttctct aggaagtcct 2521 tcctgactac ctgtgcccag agtgccccta catgagactg tatgccctgc tatcagatgc 2581 cagatctatg tgtctgtctg tgtgtccatc ccgccggccc cccagactaa cctccaggca 2641 tggactgaat ctggttctcc tcttgtacac ccctcaaccc tatgcagcct ggagtgggca 2701 tcaataaaat gaactgtcga ctgaaaaaaa aaa // LOCUS AF020044 1407 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens lymphocyte secreted C-type lectin precursor, mRNA, complete cds. ACCESSION AF020044 NID g2828595 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Bannwarth,S., Giordanengo,V., Lesimple,J. and Lefebvre,J.C. TITLE Molecular cloning of a new secreted sulfated mucin-like protein with a C-type lectin domain that is expressed in lymphoblastic cells JOURNAL J. Biol. Chem. 273 (4), 1911-1916 (1998) MEDLINE 98113146 REFERENCE 2 (bases 1 to 1407) AUTHORS Lefebvre,J.C. TITLE Direct Submission JOURNAL Submitted (19-AUG-1997) Virologie, Faculte de Medecine, Av. Valombrose, Nice cedex 2 06107, France FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 180..242 CDS 180..1151 /codon_start=1 /product="lymphocyte secreted C-type lectin precursor" /db_xref="PID:g2828596" /translation="MQAAWLLGALVVPQLLGFGHGARGAEREWEGGWGGAQEEERERE ALMLKHLQEALGLPAGRGDENPAGTVEGKEDWEMEEDQGEEEEEEATPTPSSGPSPSP TPEDIVTYILGRLAGLDAGLHQLHVRLHALDTRVVELTQGLRQLRNAAGDTRDAVQAL QEAQGRAEREHGRLEGCLKGLRLGHKCFLLSRDFEAQAAAQARCTARGGSLAQPADRQ QMEALTRYLRAALAPYNWPVWLGVHDRRAEGLYLFENGQRVSFFAWHRSPRPELGAQP SASPHPLSPDQPNGGTLENCVAQASDDGSWWDHDCQRRLYYVCEFPF" mat_peptide 243..1148 /product="lymphocyte secreted C-type lectin" misc_feature 243..449 /note="encodes glutamic acid rich domain" misc_feature 360..368 /note="encodes RGD triplet" misc_feature 450..491 /note="encodes Pro-Ser/Thr rich domain" misc_feature 501..626 /note="encodes leucine zipper domain" misc_feature 705..1139 /note="encodes C-type lectin domain (long form)" polyA_signal 1371..1376 BASE COUNT 250 a 465 c 481 g 211 t ORIGIN 1 cgaccaacgg accggacaga gacgaggaga ggaacaggaa gagagaagct gggagaatcg 61 ggaacctggg ggctagtgac ctgcacacag ggcaggggca ctcggcagtt cccagaggcc 121 acccctccca ccccagacat ccagacatct ggaactttgg gtgccaagag tccagcttaa 181 tgcaggcagc ctggcttttg ggggctttgg tggtccccca gctcttgggc tttggccatg 241 gggctcgggg agcagagagg gagtgggagg gaggctgggg aggtgcccag gaggaggagc 301 gggagaggga ggccctgatg ctgaagcatc tgcaggaagc cctaggactg cctgctggga 361 ggggggatga gaatcctgcc ggaactgttg agggaaaaga ggactgggag atggaggagg 421 accaggggga ggaagaggag gaggaagcaa cgccaacccc atcctccggc cccagcccct 481 ctcccacccc tgaggacatc gtcacttaca tcctgggccg cctggccggc ctggacgcag 541 gcctgcacca gctgcacgtc cgtctgcacg cgttggacac ccgcgtggtc gagctgaccc 601 aggggctgcg gcagctgcgg aacgcggcag gcgacacccg cgatgccgtg caagccctgc 661 aggaggcgca gggtcgcgcc gagcgcgagc acggccgctt ggagggctgc ctgaaggggc 721 tgcgcctggg ccacaagtgc ttcctgctct cgcgcgactt cgaagctcag gcggcggcgc 781 aggcgcggtg cacggcgcgg ggcgggagcc tggcgcagcc ggcagaccgc cagcagatgg 841 aggcgctcac tcggtacctg cgcgcggcgc tcgctcccta caactggccc gtgtggctgg 901 gcgtgcacga tcggcgcgcc gagggcctct acctcttcga aaacggccag cgcgtgtcct 961 tcttcgcctg gcatcgctca ccccgccccg agctcggcgc ccagcccagc gcctcgccgc 1021 atccgctcag cccggaccag cccaacggtg gcacgctcga gaactgcgtg gcgcaggcct 1081 ctgacgacgg ctcctggtgg gaccacgact gccagcggcg tctctactac gtctgcgagt 1141 tccccttcta gcggggccgg taccccgcct ccttgcccat cccaccaccc ggcctttccc 1201 tgcgccgtgc ccaccctcct ccggaatcgc ccttcccttc ctggccacga atggcagcgt 1261 cctccccgac ccccagtctg ggcgcttctg ggagggctct tgcggtgccg gcactcctcc 1321 ttgttagtgt ctttccttga aggggcgggc accaggctag gtccggtgcc aataaatcct 1381 tgtggaatct gaaaaaaaaa aaaaaaa // LOCUS AF020202 6330 bp mRNA PRI 24-SEP-1997 DEFINITION Homo sapiens Munc13 mRNA, complete cds. ACCESSION AF020202 NID g2431999 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6330) AUTHORS Song,Y., Ailenberg,M. and Silverman,M. TITLE Cloning of a novel gene homologous to Munc13s in the human kidney JOURNAL Unpublished REFERENCE 2 (bases 1 to 6330) AUTHORS Song,Y. and Silverman,M. TITLE Direct Submission JOURNAL Submitted (19-AUG-1997) Clinical Science Division, University of Toronto, 1 Kings College Circle, Medical Science Bldg. rm 7207, Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..6330 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /cell_type="mesangial cells (primary culture)" CDS 225..5000 /note="contains C2 domains; similar to R. norvegicus Munc13-2 encoded by GenBank Accession Number U24071" /codon_start=1 /product="Munc13" /db_xref="PID:g2432000" /translation="MSLLCVRVKRAKFQGSPDKFNTYVTLKVQNVKSTTVAVRGDQPS WEQDFMFEISRLDLGLSVEVWNKGLIWDTMVGTVWIALKTIRQSDEEGPGEWSTLEAE TLMKDDEICGTRNPTPHKILLDTRFELPFDIPEEEARYWTYKWEQINALGADNEYSSQ EESQRKPLPTAAAQCSFEDPDSAVDDRDSDYRSETSNSFPPPYHTASQPNASVHQFPV PVRSPQQLLLQGSSRDSCNDSMQSYDLDYPERRAISPTSSSRYGSSCNVSQGSSQLSE LDQYHEQDDDHRETDSIHSCHSSHSLSRDGQAGFGEQEKPLEVTGQAEKEAACEPKEM KEDATTHPPPDLVLQKDHFLGPQESFPEENASSPFTQARAHWIRAVTKVRLQLQEIPD DGDPSLPQWLPEGPAGGLYGIDSMPDLRRKKPLPLVSDLSLVQSRKAGITSAMATRTS LKDEELKSHVYKKTLQALIYPISCTTPHNFEVWTATTPTYCYECEGLLWGIARQGMRC SECGVKCHEKCQDLLNADCLQRAAEKSCKHGAEDRTQNIIMAMKDRMKIRERNKPEIF EVIRDVFTVNKAAHVQQMKTVKQSVLDGTSKWSAKITITVVCAQGLQAKDKTGSSDPY VTVQVSKTKKRTKTIFGNLNPVWEEKFHFECHNSSDRIKVRVWDEDDDIKSRVKQRLK RESDDFLGQTIIEVRTLSGEMDVWYNLEKRTDKSAVSGAIRLQISVEIKGEEKVAPYH VQYTCLHENLFHYLTDIQGSGGVRIPEARGDDAWKVYFDETAQEIVDEFAMRYGIESI YQAMTHFACLSSKYMCPGVPAVMSTLLANINAYYAHTTASTNVSASDRFAASNFGKER FVKLLDQLHNSLRIDLSTYRNNFPAGSPERLQDLKSTVDLLTSITFFRMKVQELQSPP RASQVVKDCVKACLNSTYEYIFNNCHDLYSRQYQLKQELPPEEQGPSIRNLDFWPKLI TLIVSIIEEDKNSYTPVLNQFPQELNVGKVSAEVMWHLFAQDMKYALEEHEKDHLCKS ADYMNLHFKVKWLHNEYVRDLPVLQGQVPEYPAWFEQFVLQWLDENEDVSLEFLRGAL ERDKKDGFQQTSEHALFSCSVVDVFTQLNQSFEIIRKLECPDPSILAHYMRRFAKTIG KVLMQYADILSKDFPAYCTKEKLPCILMNNVQQLRVQLEKMFEAMGGKELDLEAADSL KEPQVKLNTVLDELSMVFGNSFQVRIDECVRQMADILGQVRGTGNASPDARASAAQDA DSVLRPLMDFLDGNLTLFATVCEKTVLKRVLKELWRVVMNTMERMIVLPPLTDQTGTQ LIFTAAKELSHLSKLKDHMVREETRNLTPKQCAVLDLALDTIKQYFHAGGNGLKKTFL EKSPDLQSLRYALSLYTQTTDTLIKRFVRSQTTQGSGVDDPVGEVSIQVDLFTHPGTG EHKVTVKVVAANDLKWQTAGMFRPFVEVTMVGPHQSDKKRKFTTKSKSNNWAPKYNET FHLLLGNEEGPESYELQICVKDYCFAREDRVLGLAVMPLRDVTAKGSCACWCPLGRKI HMDETGLTILRILSQRSNDEVAREFVKLKSESRSTEEGS" BASE COUNT 1688 a 1543 c 1682 g 1417 t ORIGIN 1 agtcccagcc tgccggccgg tactcaccgc tacccggagt tcgctcagac ggtgagattt 61 ggggcgggtc cgaggcagcg gcgggacgta cctgcgaccg ggaccatgag gagctgccag 121 acccgtgggg ccggtaacga gagcagtcgc ggcacctgct gagaggaaag agggagcggt 181 cccgcgcggc tggggcgcgg cagaggcttg cccgatcctc ggccatgtca ctgctctgcg 241 tgcgcgttaa aagggccaaa ttccagggtt caccagataa atttaacaca tatgtgaccc 301 tgaaagtaca gaatgtgaag agcacaactg tagcagttcg tggtgatcag ccttcctggg 361 aacaggattt catgtttgag attagtcgcc tggacctggg tctaagtgtg gaggtatgga 421 acaaaggact gatctgggac accatggtgg ggactgtgtg gattgcgctg aagactattc 481 gtcagtcgga tgaggaaggg cctggggaat ggtccacatt agaggcagag acgttaatga 541 aagacgatga gatctgtgga actagaaacc caactcctca taaaattttg cttgatacaa 601 gatttgagtt gccttttgat atcccagagg aggaagccag atattggacc tacaaatggg 661 agcaaatcaa tgccttggga gctgacaatg agtattctag tcaagaagaa agccagagga 721 agccattgcc cactgctgcc gcccagtgtt cttttgaaga ccctgatagt gccgtcgatg 781 accgagatag tgactatcgc agtgagacca gcaacagctt cccacctcct taccatacag 841 cttcccagcc caacgcttct gtgcaccagt tccctgtgcc ggtgcgatcg ccacagcagc 901 tgctacttca aggcagttcc cgggactctt gtaatgactc tatgcaaagt tatgaccttg 961 attatccaga gcggcgggct atcagcccca ccagcagcag taggtatggc tcctcctgta 1021 atgtgagtca aggaagctct cagctaagtg aactagacca gtatcacgaa caagatgacg 1081 accatcggga gacggactcg attcattctt gccacagctc tcacagcctg tccagagatg 1141 gccaagcagg ttttggagaa caagagaaac ccttggaggt gacaggtcaa gcagagaagg 1201 aggcagcatg tgaacccaag gagatgaaag aagatgccac aacccaccct cccccagatc 1261 tggtgctgca aaaagaccac ttcctaggtc cccaggagag ttttcctgag gagaatgcat 1321 cttcaccatt tacccaagcc agagcacatt ggatccgagc agttaccaag gttcgactcc 1381 agctgcagga gattccagat gatggtgacc cctctctgcc tcagtggctc ccggaagggc 1441 cagccggagg gctctatggc attgacagca tgccagattt acgcagaaag aagccactgc 1501 cacttgtcag tgatctgtca ctggtccagt ctcggaaggc aggaatcact tctgcaatgg 1561 ctacacgcac ttctcttaag gacgaagagc tgaaatccca cgtgtataag aaaaccctgc 1621 aggccttaat ctaccccatt tcgtgcacca ctcctcataa ctttgaggtc tggacggcca 1681 ctaccccaac ctactgctat gagtgtgaag gcctgctctg gggcattgcc cggcagggca 1741 tgcgctgcag cgaatgtgga gtcaagtgcc atgagaagtg ccaggatctg ctcaatgctg 1801 actgcctgca gcgggctgca gaaaagagct gtaaacatgg agctgaggac cggacccaga 1861 acattatcat ggccatgaag gaccgcatga agatccgaga gcgaaataag ccagagatct 1921 ttgaagttat ccgggacgtc ttcacagtga acaaagctgc ccatgtgcag cagatgaaaa 1981 cagtgaagca gagtgtactg gatggcacct ccaagtggtc agccaagatc accattactg 2041 tggtgtgtgc ccagggccta caagccaagg acaaaacagg atccagtgac ccttacgtga 2101 ctgtgcaagt cagcaaaact aagaagcgta ccaagaccat ttttggaaac ttgaatcctg 2161 tttgggagga gaagttccat tttgagtgcc acaactcctc tgaccgcatt aaggtgcgtg 2221 tatgggatga ggatgatgac atcaagtcaa gagtaaagca acgcctaaag cgagagtctg 2281 atgatttcct tggccaaacc atcattgagg ttcggaccct aagtggcgag atggacgtct 2341 ggtacaactt ggagaagagg acagacaaat cagccgtctc aggggctatc cgactacaaa 2401 tcagtgtgga gatcaagggg gaggagaaag tagccccata ccacgtgcag tatacatgtc 2461 tccatgagaa tcttttccat tacctcacag acattcaggg cagtggagga gtccgcatcc 2521 ctgaagctcg aggagacgat gcctggaagg tgtactttga tgagacagcc caagaaattg 2581 tggatgaatt tgccatgcgt tatggcattg agtccatata tcaggccatg acgcactttg 2641 catgtttatc atccaagtac atgtgtcctg gtgtgccagc agtgatgagc accttactgg 2701 ccaacatcaa cgcctactat gcccacacaa ctgcctctac caatgtctct gcatctgatc 2761 gctttgcagc ctccaacttt gggaaagaga gatttgtaaa actgctggac cagctacaca 2821 actcactgag gatcgacctc tctacataca ggaataattt ccctgctggg agtcctgaac 2881 ggcttcagga cttaaaatcc acagtggatt tgctgaccag cattactttc ttcagaatga 2941 aggtacaaga actgcaaagc cctccaagag ccagccaggt ggtaaaggat tgtgtgaagg 3001 cctgtttgaa ctccacatat gaatatatct tcaacaactg ccacgactta tacagccgcc 3061 agtaccagct gaagcaggag ctacctccag aggaacaagg gcccagcatt cggaacctgg 3121 atttctggcc caagctcatc acactcatcg tgtcaatcat agaggaagat aagaattcct 3181 acacacctgt tctgaaccag tttcctcagg agttgaatgt gggaaaagtc agcgcagaag 3241 tgatgtggca tttgtttgcc caagacatga aatatgcatt ggaggagcat gagaaagacc 3301 acctgtgtaa aagtgctgac tacatgaacc tgcacttcaa ggtgaagtgg ctccacaatg 3361 aatacgtgcg ggatctgcct gtcctccagg ggcaggtgcc tgagtaccca gcgtggtttg 3421 agcagttcgt gctacaatgg ctggatgaga atgaggatgt atccctggaa ttcctgcgtg 3481 gggccctgga acgagataag aaggatggat tccagcagac atcagagcat gcactctttt 3541 cctgctctgt ggtggatgtc ttcacacaac tcaatcagag ctttgagatc atccggaagc 3601 tggaatgccc agaccccagc atccttgccc actacatgag gaggtttgct aagaccatcg 3661 ggaaggtgct gatgcagtat gcagacatct tgtcaaagga cttcccagcc tattgcacaa 3721 aggagaaact gccctgcatc ctgatgaaca acgtgcagca actgagggtc cagctggaga 3781 aaatgtttga ggccatggga ggcaaggagc tggaccttga agctgcagac agtctgaagg 3841 agccgcaggt gaaactgaat acggttctgg atgagctcag catggtgttt ggaaacagtt 3901 tccaggtacg gattgatgag tgtgttcgac aaatggccga catcctgggc caggttcggg 3961 gcacagggaa tgcatctcca gacgccaggg cctcagcggc tcaggatgca gatagcgtac 4021 tccggcctct catggacttc ctggatggca acctcaccct ctttgccact gtgtgtgaga 4081 agacggttct gaagcgtgta ctgaaggagc tctggcgcgt ggtgatgaac acaatggaga 4141 ggatgattgt tctgccccca ctcactgacc agacgggcac ccagctgatc ttcactgctg 4201 ccaaggagct gagccatctt tccaaactca aggatcacat ggtacgagag gaaacacgga 4261 atctcactcc aaagcagtgt gcagtccttg acctcgccct ggacaccatc aagcaatact 4321 tccatgcagg aggcaatggg ctgaagaaaa ccttcctgga gaagagccca gatctgcagt 4381 ctctacgcta tgccctgtct ctgtacacac agactactga cactctcatc aagaggtttg 4441 tgcgctcgca gaccacccaa gggtctggtg tggacgatcc tgtgggagaa gtctctattc 4501 aggtggactt gtttacacac cctggtactg gggagcacaa ggtcacagtg aaagtggtgg 4561 ctgccaatga cctcaagtgg cagacagcgg gtatgttccg gcctttcgtg gaggtgacta 4621 tggttggccc acaccaaagt gataagaaga ggaagttcac aaccaaatcc aaaagcaaca 4681 actgggcccc caagtacaat gagacattcc acttactcct gggaaatgag gaggggcccg 4741 agtcctatga gttgcagata tgcgtgaagg attactgctt tgcccgggaa gatcgcgtgc 4801 tagggctggc tgtgatgcct ctgagggatg tcacagccaa gggcagctgt gcctgctggt 4861 gccccttggg ccggaagatc catatggatg agacaggcct gaccattctc cggattttat 4921 ctcagaggag caatgacgag gtggcccgag aatttgtgaa actcaaatca gagtctcgtt 4981 ccacggagga ggggagctga agaggttcga ctcctgtgcc aatcaggcag cagcaatttc 5041 acaaatcagg gccagtggga gttagctgtg taaccggctt agggtctttg cagtcaagag 5101 gctgacccct tcagttaaag atatttaagg aaaaatttgg ggtggtgata atatggcttt 5161 tcacagaaag ggtcatgaag ccctggccca acaggactgt ggtactaggg gctgggatgt 5221 ggggttacca catggagaga ttttccatta agagagaagg acaaacattt ctgagagtgt 5281 cagccattct tggtagacac ctctccactc ctcatcccac ctctacccat ctccatgcca 5341 caccttatcc agttagacac atacatacca atcattagaa gaacaagttt agaaggtgtg 5401 gaacttgtgc ctggctggct gggtagtcag ctgagcctgt tgctgagccc ggtggtctgg 5461 attggagtat ggccagggca ggagtacaca gaatagaatt tagactgtcc cttgagtaga 5521 atccactgat tttctgtggc tccagtgaga acaaggcttt gaaactgaac aagataactt 5581 ctagaaatga actgtactaa tccctttccc cagattgtat catgagtaga atcaggttca 5641 cgtggtgctt caaagccctg agaagaatat ttctttggac cccaggcact aggggccacc 5701 tgcctgggag tctccctgcc tcactcctct aggcagggga gtgatgcttc aggacgtgac 5761 aggctgttct aacatgtgtc tacctgaggg ctagttgaag gatccaggag tattttcttc 5821 ttgggtgggc ctgaacaaaa gccaaaaatt gtagaaacca gtctagaaaa agtcctgctc 5881 atctgtggcc actgccttct agccgtcctc caccttgcag aaagaatcta gcctttggtc 5941 tctctctctc tcatcggggt catttgctat tcccctctga tattcaaccc tatagaagga 6001 gcctggactc tgatccctct gtacaggctg gatggaaggg gccctccaca cttcctggga 6061 ggtcagagac aaactgtttc agagagtcag atggacttcc caagacttgt tgagagatgt 6121 gacatggttc ttggatttcc tctgtagcag cctcctggac ttcctgagga ctcgacattg 6181 tccacagatg tactggccat tacatgaaac aagaaaccaa gcatctttgc tgttgttaat 6241 tattatatgt gccattgtta caggagatta taggctagtc tgtaataaat tatcttatag 6301 caaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF020351 668 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens NADH:ubiquinone oxidoreductase 18 kDa IP subunit mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF020351 NID g2655052 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 668) AUTHORS Van Den Heuvel,L., Ruitenbeek,W., Smeets,R., Gelman-Kohan,Z., Elpeleg,O., Loeffen,J., Trijbels,F., Mariman,E., De Bruijn,D. and Smeitink,J. TITLE Demonstration of the first pathogenic mutation in human complex I deficiency: A 5 base pair duplication in the nuclear gene encoding the 18 kDa (AQDQ) subunit JOURNAL Unpublished (1997) REMARK Submitted (07-08-1997) Am. J. Hum. Genet. REFERENCE 2 (bases 1 to 668) AUTHORS Van Den Heuvel,L., Ruitenbeek,W., Smeets,R., Gelman-Kohan,Z., Elpeleg,O., Loeffen,J., Trijbels,F., Mariman,E., De Bruijn,D. and Smeitink,J. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Department of Pediatrics and Human Genetics, University Hospital Nijmegen, P.O. Box 9101, Nijmegen, Gelderland 6500 HB, The Netherlands FEATURES Location/Qualifiers source 1..668 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" mRNA <1..668 /product="NADH:ubiquinone oxidoreductase 18 kDa IP subunit" CDS 9..536 /note="subunit of Mitochondrial Complex I" /codon_start=1 /product="NADH:ubiquinone oxidoreductase 18 kDa IP subunit" /db_xref="PID:g2655053" /translation="MAAVSMSVVLRQTLWRRRAVAVAALSVSRVPTRSLRTSTWRLAQ DQTQDTQLITVDEKLDITTLTGVPEEHIKTRKVRIFVPARNNMQSGVNNTKKWKMEFD TRERWENPLMGWASTADPLSNMVLTFSTKEDAVSFAEKNGWSYDIEERKVPKPKSKSY GANFSWNKRTRVSTK" polyA_signal 630..635 polyA_site 653 BASE COUNT 225 a 128 c 151 g 164 t ORIGIN 1 gcagcaagat ggcggcggtc tcaatgtcag tggtactgag gcagacgttg tggcggagaa 61 gggcagtggc tgtagctgcc ctttccgttt ccagggttcc gaccaggtcg ttgaggactt 121 ccacatggag attggcacag gaccagactc aagacacaca actcataaca gttgatgaaa 181 aattggatat cactacttta actggcgttc cagaagagca tataaaaact agaaaagtca 241 ggatctttgt tcctgctcgc aataacatgc agtctggagt aaacaacaca aagaaatgga 301 agatggagtt tgataccagg gagcgatggg aaaatccttt gatgggttgg gcatcaacgg 361 ctgatccctt atccaacatg gttctaacct tcagtactaa agaagatgca gtttcctttg 421 cagaaaaaaa tggatggagc tatgacattg aagagaggaa ggttccaaaa cccaagtcca 481 agtcttatgg tgcaaacttt tcttggaaca aaagaacaag agtatccaca aaataggttg 541 gcactgacta tatctctgct tgactgtgaa taaagtcagc tatgcagtat ttatagtcca 601 tgtataataa atacatctct taatctccta ataaattgga cctttaaact acaaaaaaaa 661 aaaaaaaa // LOCUS AF020352 510 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens NADH:ubiquinone oxidoreductase 15 kDa IP subunit mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF020352 NID g2655054 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 510) AUTHORS Loeffen,J., Smeets,R., Smeitink,J., Ruitenbeek,W., Sengers,R., Trijbels,F., Balemans,M. and Van Den Heuvel,L. TITLE cDNA sequence, chromosomal localisation, tissue distribution, and a mutation analysis study of the 15 kDa Iron-Sulphur subunit of complex I JOURNAL Unpublished REMARK Submitted (15-09-97) Genomics REFERENCE 2 (bases 1 to 510) AUTHORS Van Den Heuvel,L., Ruitenbeek,W., Smeets,R., Gelman-Kohan,Z., Elpeleg,O., Loeffen,J., Trijbels,F., Mariman,E., De Bruijn,D. and Smeitink,J. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Department of Pediatrics and Human Genetics, University Hospital Nijmegen, P.O. Box 9101, Nijmegen, Gelderland 6500 HB, The Netherlands FEATURES Location/Qualifiers source 1..510 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..510 /product="NADH:ubiquinone oxidoreductase 15 kDa IP subunit" CDS 61..381 /note="subunit of Mitochondrial Complex I" /codon_start=1 /product="NADH:ubiquinone oxidoreductase 15 kDa IP subunit" /db_xref="PID:g2655055" /translation="MPFLDIQKRFGLNIDRWLTIQSGEQPYKMAGRCHAFEKEWIECA HGIGYTRAEKECKIEYDDFVECLLRQKTMRRAGTIRKQRDKLIKEGKYTPPPHHIGKG EPRP" polyA_signal 479..484 polyA_site 499 BASE COUNT 156 a 104 c 139 g 111 t ORIGIN 1 accaatctga agtgggagcg gcggccagag aagagtcaag ggcacgagca tcgggtagcc 61 atgcctttct tggacatcca gaaaaggttc ggccttaaca tagatcgatg gttgacaatc 121 cagagtggtg aacagcccta caagatggct ggtcgatgcc atgcttttga aaaagaatgg 181 atagaatgtg cacatggaat cggttatact cgggcagaga aagagtgcaa gatagaatat 241 gatgatttcg tagagtgttt gcttcggcag aaaacgatga gacgtgcagg taccatcagg 301 aagcagcggg ataagctgat aaaggaagga aagtacaccc ctccacctca ccacattggc 361 aagggggagc ctcggccctg aacagagcag ctgctgatgt ctggaggctg attttcctgt 421 tctctgttct ccactggaaa ggttgtttac gacaaacctc cttgtcaaag tgtgtaaaaa 481 taaaggattg ctccatccta aaaaaaaaaa // LOCUS AF020500 1478 bp mRNA PRI 10-JAN-1998 DEFINITION Homo sapiens myristoyl CoA:protein N-myristoyltransferase mRNA, complete cds. ACCESSION AF020500 NID g2760893 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1478) AUTHORS Glover,C.J., Hartman,K.D. and Felsted,R.L. TITLE Human N-myristoyltransferase amino-terminal domain involved in targeting the enzyme to the ribosomal subcellular fraction JOURNAL J. Biol. Chem. 272 (45), 28680-28689 (1997) MEDLINE 98019247 REFERENCE 2 (bases 1 to 1478) AUTHORS Glover,C.J., Hartman,K.D. and Felsted,R.L. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) LDDRD, NCI-FCRDC, Bldg. 1052 Rm. 121, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..1478 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 42..1478 /codon_start=1 /product="myristoyl CoA:protein N-myristoyltransferase" /db_xref="PID:g2443814" /translation="MMEGNGNGHEHCSDCENEEDNSYNRGGLSPANDTGAKKKKKKQK KKKEKGSETDSAQDQPVKMNSLPAERIQEIQKAIELFSVGQGPAKTMEEASKRSYQFW DTQPVPKLGEVVNTHGPVEPDKDNIRQEPYTLPQGFTWDALDLGDRGVLKELYTLLNE NYVEDDDNMFRFDYSPEFLLWALRPPGWLPQWHCGVRVVSSRKLVGFISAIPANIHIY DTEKKMVEINFLCVHKKLRSKRVAPVLIREITRRVHLEGIFQAVYTAGVVLPKPVGTC RYWHRSLNPRKLIEVKFSHLSRNMTMQRTMKLYRLPETPKTAGLRPMETKDIPVVHQL LTRYLKQFHLTPVMSQEEVEHWFYPQENIIDTFVVENANGEVTDFLSFYTLPSTIMNH PTHKSLKAAYSFYNVHTQTPLLDLMSDALVLAKMKGFDVFNALDLMENKTFLEKLKFG IGDGNLQYYLYNWKCPSMGAEKVGLVLQ" BASE COUNT 405 a 384 c 395 g 294 t ORIGIN 1 gtgagacagc agtgaagccg ccggcacctc cgctgccgca gatgatggaa gggaacggga 61 acggccatga gcactgcagc gattgcgaga atgaggagga caacagctac aaccggggtg 121 gtttgagtcc agccaatgac actggagcca aaaagaagaa aaagaaacaa aaaaagaaga 181 aagaaaaagg cagtgagaca gattcagccc aggatcagcc tgtgaagatg aactctttgc 241 cagcagagag gatccaggaa atacagaagg ccattgagct gttctcagtg ggtcagggac 301 ctgccaaaac catggaggag gctagcaagc gaagctacca gttctgggat acgcagcccg 361 tccccaagct gggcgaagtg gtgaacaccc atggccccgt ggagcctgac aaggacaata 421 tccgccagga gccctacacc ctgccccagg gcttcacctg ggatgctttg gacttgggcg 481 atcgtggtgt gctaaaagaa ctgtacaccc tcctgaatga gaactatgtg gaagatgatg 541 acaacatgtt ccgatttgat tattccccgg agtttctttt gtgggctctc cggccacccg 601 gctggctccc ccagtggcac tgtggggttc gagtggtctc aagtcggaaa ttggttgggt 661 tcattagcgc catcccagca aacatccata tctatgacac agagaagaag atggtagaga 721 tcaacttcct gtgtgtccac aagaagctgc gttccaagag ggttgctcca gttctgatcc 781 gagagatcac caggcgggtt cacctggagg gcatcttcca agcagtttac actgccgggg 841 tggtactacc aaagcccgtt ggcacctgca ggtattggca tcggtcccta aacccacgga 901 agctgattga agtgaagttc tcccacctga gcagaaatat gaccatgcag cgcaccatga 961 agctctaccg actgccagag actcccaaga cagctgggct gcgaccaatg gaaacaaagg 1021 acattccagt agtgcaccag ctcctcacca ggtacttgaa gcaatttcac cttacgcccg 1081 tcatgagcca ggaggaggtg gagcactggt tctaccccca ggagaatatc atcgacactt 1141 tcgtggtgga gaacgcaaac ggagaggtga cagatttcct gagcttttat acgctgccct 1201 ccaccatcat gaaccatcca acccacaaga gtctcaaagc tgcttattct ttctacaacg 1261 ttcacaccca gacccctctt ctagacctca tgagcgacgc ccttgtcctc gccaaaatga 1321 aagggtttga tgtgttcaat gcactggatc tcatggagaa caaaaccttc ctggagaagc 1381 tcaagtttgg cataggggac ggcaacctgc agtattacct ttacaattgg aaatgcccca 1441 gcatgggggc agagaaggtt ggactggtgc tacaataa // LOCUS AF020543 1993 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens palmitoyl-protein thioesterase-2 (PPT2) mRNA, complete cds. ACCESSION AF020543 NID g2501960 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1993) AUTHORS Soyombo,A.A. and Hofmann,S.L. TITLE Molecular cloning and expression of PPT2, a homolog of lysosomal palmitoyl-protein thioesterase with a distinct substrate specificity JOURNAL Unpublished REFERENCE 2 (bases 1 to 1993) AUTHORS Soyombo,A.A. and Hofmann,S.L. TITLE Direct Submission JOURNAL Submitted (22-AUG-1997) Internal Medicine and the Hamon Center for Therapeutic Oncology Research, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, Texas 75235-8593, USA FEATURES Location/Qualifiers source 1..1993 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1993 /gene="PPT2" mRNA 360..1268 /gene="PPT2" CDS 360..1268 /gene="PPT2" /note="lysosomal; similar to PPT1" /codon_start=1 /product="palmitoyl-protein thioesterase-2" /db_xref="PID:g2501961" /translation="MLGLWGQRLPAAWVLLLLPFLPLLLLAAPAPHRASYKPVIVVHG LFDSSYSFRHLLEYINETHPGTVVTVLDLFDGRESLRPLWEQVQGFREAVVPIMAKAP QGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYGDTDYLKWLFPTSMRSN LYRICYSPWGQEFSICNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNFLRVG HLVLIGGPDDGVITPWQSSFFGFYDANETVLEMEEQLVYLRDSFGLKTLLARGAIVRC PMAGISHTAWHSNRTLYETCIEPWLS" BASE COUNT 363 a 592 c 538 g 500 t ORIGIN 1 ggggaacgct cgcgcggttg ccagagaaag ccccggacgt gacggatttg cgcgacccca 61 agcagcccgc ccttccccct cccatccgtc attcccctgc gctctctttc ctcacccttc 121 cccccgccac cgtgggttcc agacttggga taagtaaaca gcgggtggag cgaggcctac 181 ggacccaggc caggtgggag tctgcactct tcaaggggcc tgggctgctg ctcacgggta 241 ttaaagaact ccgcgttgtt catggctgag gcgatgcatt aggaagatcc tggacctaga 301 gaacaagtcc cccgaacgct gagttggagg cgggacttcg ggtgcgcgtt ggcgggagca 361 tgctggggct ctgggggcag cggctccccg cggcgtgggt cctgcttctg ttgcctttcc 421 tgccgctgct gctgcttgca gcccccgcgc cccaccgcgc gtcctacaag ccggtcatcg 481 tggtgcatgg gctcttcgac agctcgtaca gcttccgcca cctgctggaa tacatcaatg 541 agacacaccc cgggactgtg gtgacagtgc tcgatctctt cgatgggaga gagagcttgc 601 gacccctgtg ggaacaggtg caagggttcc gagaggctgt ggtccccatc atggcaaagg 661 cccctcaagg ggtgcatctc atctgctact cgcagggggg ccttgtgtgc cgggctctgc 721 tttctgtcat ggatgatcac aacgtggatt ctttcatctc cctctcctct ccacagatgg 781 gacagtatgg agacacggac tacttgaagt ggctgttccc cacctccatg cggtctaacc 841 tctatcggat ctgctatagc ccctggggcc aggaattctc catctgcaac tactggcatg 901 atccccacca cgatgacttg tacctcaatg ccagcagctt cctggccctg atcaatgggg 961 aaagagacca tcccaatgcc acagtatggc ggaagaactt tctgcgtgtg ggccacctgg 1021 tgctgattgg gggccctgat gatggtgtta ttactccctg gcagtccagc ttctttggtt 1081 tctatgatgc aaatgagacc gtcctggaga tggaggagca actggtttat ctgcgggatt 1141 cttttgggtt gaagactcta ttggcccggg gggccatagt gaggtgtcca atggccggta 1201 tctcccacac agcctggcac tccaaccgta ccctttatga gacctgcatt gaaccttggc 1261 tctcctgagg atatattcag gggtccccag gaactcctcg gtccagagac caagtggtgg 1321 ccttggaaag cagatgtcag gctttggtgt gcctgtgacc acctcattgc tcccatatta 1381 tcccccattt ttagtagaga cggggtttta gtagagactt ggcctcccag aacccccttc 1441 ctctggctcc tccatgaatg acaattccag gcctccccta cctcatgtcc tctcatttgg 1501 gggattgctc cgtgctgtcc ctttctctca aggccgaagt tgggaagtga gaaaccatgt 1561 ttttaacttg tggctgcttt tgctgctgct gctcctccgt atctggctgt atgggtggag 1621 aacccacccc ctgcccacca caggggtctc cttccaggcc actcaggaca tttttagctt 1681 ctctcctccc catgttccct tttttctcta aagtcccctg acatcagccc tcccaactcc 1741 taagagggac tacccatgag agtggggttc tgaggctccc ctatggggac agttccgttc 1801 ttgaagtgtc agtgttgggg aatatctgtg gcctatgagg cccatctcag gtttggggat 1861 cccccagtcc ctatgatcag tgttggagta cccccctggg agagcctagt ttctttgagg 1921 ccccaggccc tcttttaact acctttgaat aggtgttatc cctgtattta tggaaataaa 1981 gttccatttc ctc // LOCUS AF020591 3743 bp mRNA PRI 07-FEB-1998 DEFINITION Homo sapiens zinc finger protein mRNA, complete cds. ACCESSION AF020591 NID g2843170 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3743) AUTHORS Hu,P., Yu,L. and Zhang,M. TITLE Cloning of a novel human gene coding a zinc finger protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 3743) AUTHORS Zhang,Q. TITLE Direct Submission JOURNAL Submitted (25-AUG-1997) Institute of Genetics, Fudan University, Lab of Human Gene Research, No. 220. Handan Road, Shanghai, 200433, People's Republic of China FEATURES Location/Qualifiers source 1..3743 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 246..2393 /codon_start=1 /evidence=experimental /product="zinc finger protein" /db_xref="PID:g2843171" /translation="MEARSMLVPPQASVCFEDVAMAFTQEEWEQLDLAQRTLYREVTL ETWEHIVSLGLFLSKSDVISQLEQEEDLCRAEQEAPRDWKAPLEENGLNSEKDRAREE LSHHVEVYRSGPEEPPSLVLGKVQDQSNQLREHQENSLRFMVLTSERLFAQREHCELE LGGGYSLPSTLSLLPTTLPTSTGFPKPNSQVKELKQNSAFINHEKNGADGKHCESHQC ARAFCQSIYLSKLGNVETGKKNPYEYIVSGDSLNYGSSLCFHGRTFSVKKSDDCKDYG NLFSHSVSLNEQKPVHFGKSQYECDECRETCSESLCLVQTERSGPGETPFRCEERCAA FPMASSFSDCNIILTTEKPSVCNQCGKSFSCCKLIHQRTHTGEKPFECTQCGKSFSQS YDLVIHQRTHTGEKPYECDLCGKSFTQRSKLITHQRIHTGEKPYQCIECRKSFTWNSN LIVHQRIHTGEKPYECTHCGKSFSQSYELVTHKRTHTGEKPFKCTQCGKSFSQKYDLV VHQRTHTGEKPYECNLCGKSFSQSSKLITHQRIHTGEKPYQCIECGKSFRWNSNLVIH QRIHTGEKPYDCTHCGKSFSQSYRLVAHKRTHTGEKPYECNECGKAFNRSTQLIRHLQ IHTGEKPYKCNQCNKAFARSSYLVMHQRTHTGEKPFECSQCGKAFSGSSNLLSHHRIH SGEKPYECSDCGKSFRQRSQLVVHRRTHTGEKP" BASE COUNT 1107 a 837 c 939 g 860 t ORIGIN 1 ggcggtgctt gcaggtggca gccaagagcc tcctggcaca gcggcaccag gcaggtgacg 61 cctattggac cccagaggtc atcccagctc cagacgtgga cacattttct tccaatgatg 121 tctacggatg aggaaactga ggcctggaga ggttaaagag acccgttcag agtcctacag 181 tggtagactg gtcttctgag gacctctgcc ctctacacag cggcctcttc aggtgcaggg 241 aggaaatgga agcacgttct atgctggttc caccccaggc atctgtgtgc ttcgaggatg 301 tggctatggc attcacacag gaggagtggg aacagctgga cctggcccag aggacactgt 361 accgagaggt gacactggag acctgggagc atattgtctc cctggggctt ttcctttcca 421 aatctgatgt gatctctcag ctggagcaag aagaggacct gtgcagggca gagcaggagg 481 ccccccgaga ctggaaagct ccacttgagg agaatgggtt gaattctgaa aaagatcgag 541 ctagggaaga actatcccac cacgtggaag tgtacaggag tggaccggag gagccaccct 601 ctttggtatt aggaaaagtg caagatcaga gcaaccagtt aagggaacac caggagaact 661 ccttgaggtt catggtactc acctcagaga gactgtttgc tcaaagggaa cattgtgagc 721 ttgaacttgg gggaggttat tctctacctt ctactttaag ccttctacct acaacattac 781 ctacaagtac aggtttccct aagcccaact cacaagttaa agagttgaaa caaaattcag 841 ctttcattaa tcatgagaaa aatggagcag atgggaagca ctgtgagagt catcagtgtg 901 ctagagcttt ctgtcagagt atttacttga gtaaacttgg aaacgttgaa acaggaaaga 961 aaaaccctta tgaatatatt gtcagtggtg actctctcaa ctatggttcc tccctttgtt 1021 ttcatggtag aactttttca gtgaagaaaa gtgatgactg taaggattat ggaaacctct 1081 tcagtcacag tgtgtctctg aatgaacaga agccagtgca ttttgggaaa agtcagtatg 1141 agtgtgatga gtgcagggaa acctgttctg agagtctgtg ccttgtacaa acagaaagaa 1201 gtggccctgg agagaccccc ttcagatgtg aggaacgctg tgctgccttc cccatggcct 1261 catctttttc tgactgtaac atcatactga ctacagagaa gccatctgtg tgtaatcagt 1321 gtggaaaatc tttcagctgt tgtaagctca tacaccagag aacacacact ggagaaaagc 1381 ccttcgaatg tactcagtgt gggaaatctt ttagccagag ctatgacctt gtcatacatc 1441 agaggacaca cactggagag aagccctatg agtgtgacct gtgtgggaaa tccttcaccc 1501 agagatccaa acttattaca catcagcgaa ttcacactgg agaaaaaccg tatcagtgta 1561 ttgaatgcag aaaatccttc acgtggaact ctaacctcat tgtacatcag agaattcata 1621 ctggagagaa accgtatgag tgcactcact gtggaaagtc cttcagccaa agctatgagt 1681 tagttacaca taaaagaact cacactggag aaaagccctt caaatgtact cagtgtggga 1741 aatctttcag ccagaagtat gaccttgttg tacatcagag gacacacact ggagagaagc 1801 cctatgagtg caacctgtgt gggaaatcct tctcccagag ttccaaactt attacgcatc 1861 agcgaattca cactggagaa aaaccgtatc agtgtattga atgtgggaaa tccttcagat 1921 ggaactctaa cctcgtcata catcagagaa ttcatactgg agagaaaccg tacgattgca 1981 ctcactgtgg aaagtccttc agccaaagct atcggttagt tgcacataaa agaactcaca 2041 ctggagaaaa gccctatgaa tgtaacgagt gtggaaaagc cttcaatcga agcactcagc 2101 tcatcaggca tctgcaaatt cacactgggg agaagccgta caaatgcaat cagtgcaata 2161 aagcctttgc aaggagctcc taccttgtga tgcatcagag aactcacact ggtgagaaac 2221 cttttgagtg tagtcagtgt gggaaagcct tttcagggag ctctaacctt ctttcccatc 2281 acagaattca ttctggagag aaaccctatg aatgtagtga ctgtgggaaa tccttccggc 2341 agcgatctca actagtagtg catcggcgga cacatactgg agagaaacct taggagtgca 2401 gtcattgtgg gaaagctttc atccagaggt ctcccctcat catgcaccag aggacgcatg 2461 tcggtgggaa gagctatcag tgcgacgtgt attaagcagc ggttgtgact cattgaacat 2521 cagaggacat atcctggaga aaagccctac gaatgcattg attgtgggaa agccttcaat 2581 gatcgctcaa cccttagtaa acacgagagg acacacactg gaggcaaacc ctatgaatgt 2641 gaccattgcg agaaagcctt tagccaacgg tgtcaactta ctaggcagca gagaattcat 2701 actggagaga agccctgtga atgttaacaa atgtggaaaa gcttccagtt atgatacttt 2761 ccttattcaa catgagaaag ctcatgggca agaaactcta tgaataacca agatggagcc 2821 gggtgcagtt gccaacacct gtaatcccag caccttgcga ggccgaggca ggtggatcac 2881 ttgagcccag gagtttgaaa ccagcctggg taacatggca aaatcctgtc tacaaaaaat 2941 acaaaaatta gcctagcatg gtggcgcata cctgtagtcc cagctactca ggaggctgag 3001 gtgggaggat cacttggact tgggaggcgg ttgttgcagt gagctgtgat catgccatca 3061 caccgctgcc ttgtgagaga ttgagacact ctaaataaat aataaccaag atgggaaatt 3121 tccctgccag accttgttta ctagaaatca ggtggccaaa acatgactct cagagtgggg 3181 cttcatgacc atgtgcatca gaattgcctg gagtgtgcac tgaaactgtg tattaccaag 3241 ctcactctag ccaactaaat aaaaatctct ggcagtaaaa tccaggagtc tgcagtttct 3301 aaaatcacac aggtgagtgg gcatggtggc tcatgcctgt aatcccagca ctctggaagg 3361 ctgaggtggg tggatcacct gaagtcagaa gtttgagacc agcctggcaa acatggcaaa 3421 actgtctcta ttaaaaatac aggccagggg ctgggcacgg tacctcacac ctgtaatccc 3481 agcacttagg taggccaagg tgggtggatc acgaggtcag gagattgaga ccatcctggc 3541 taatatggtg aaaccctgtc tctactaaaa atacaaaaaa attagctagg catggtggcg 3601 ggctcctgta gtcccagcta ctcaggaggc tagagcagga gaatggcgcg aacatgggag 3661 gcgtagcttg cagtgagctg agatcacacc actgcactcc agcctgggca acagagcaag 3721 actgtctcaa aaaaaaaaaa aaa // LOCUS AF020761 1404 bp mRNA PRI 24-JAN-1998 DEFINITION Homo sapiens stimulator of Fe transport mRNA, complete cds. ACCESSION AF020761 NID g2738924 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Gutierrez,J.A., Yu,J., Rivera,S. and Wessling-Resnick,M. TITLE Functional expression cloning and characterization of SFT, a stimulator of Fe transport JOURNAL J. Cell Biol. 139 (4), 895-905 (1997) MEDLINE 98031921 REFERENCE 2 (bases 1 to 1404) AUTHORS Gutierrez,J.A., Yu,J., Rivera,S. and Wessling-Resnick,M. TITLE Direct Submission JOURNAL Submitted (26-AUG-1997) Nutrition, Harvard School of Public Health, 665 Huntington Ave, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" CDS 86..1102 /note="iron transporter; SFT" /codon_start=1 /product="stimulator of Fe transport" /db_xref="PID:g2738925" /translation="MKEFNHWKVLNIYCLSSPRKRSSLVPIQTDLILKLLHLACTVGS FAICRLCSFRIGFCKFLLEAGFCIFDFCVFGYIFILCREIHELKNYFSLFYFCMNLSH IDPVIDICVIIAIKYKKVEYIVLLDRCFLKYYFVCFYYILFVYLQIQQTCKRMDSEIC NVKIKNIFIYNQSTVKSRFFFNISSKLSTVYLFHCTMKPFDFYHFKCVSSKTKQTSKN TLKTVMRAFIILYALRKTFIMVFKILGHLHLQLTRSTMQLKKQPNYFLLKTRCFLHEK ILYVCLRCQFYKSVFRFHPLLAHFIICIFFLYRTKYILFTCMSTHYFFPVNSIENPNR LIIK" BASE COUNT 440 a 208 c 213 g 543 t ORIGIN 1 gaattcggct gtcgcactta ctgttcaata gtatatactc tgtatttgaa aaatagatgt 61 atatattcta ggtgataaat taaaaatgaa agaatttaat cattggaaag tattaaatat 121 atattgctta tcttctccaa ggaagaggag ttctctcgta cccatccaaa ctgacctaat 181 tctcaagctg cttcatcttg cttgtactgt aggttcattt gcaatttgta gattatgctc 241 cttcaggatt ggcttttgta aatttctgtt agaagctggt ttctgcattt ttgatttttg 301 tgtatttgga tacattttca tattgtgcag agaaatccat gagttaaaaa attatttttc 361 cctgttttat ttctgcatga acctaagtca cattgaccca gtaattgata tatgtgtgat 421 tattgcaatt aagtataaga aggtagaata tatagtttta ttagacagat gcttcctgaa 481 atattatttt gtatgttttt actatatcct ttttgtgtat ctacagatac aacagacatg 541 caagagaatg gactcagaaa tatgcaatgt aaaaatcaaa aacattttca tatataacca 601 gagtactgta aaatctaggt tttttttcaa cattagcagt aaattgagca ctgtttacct 661 gtttcattgt accatgaaac catttgattt ttaccatttt aaatgtgtct caagcaagac 721 aaaacaaact tccaaaaata cccttaagac tgtgatgaga gcatttatca ttttgtatgc 781 attgagaaag acatttatta tggtttttaa gatacttgga catctgcatc ttcagcttac 841 aagatctaca atgcagctga aaaagcaacc aaattatttt ttgctgaaaa ctagatgttt 901 tttacatgag aaaatactgt atgtgtgtct aagatgtcag ttttataaat ctgtattcag 961 atttcatcct ttgttagctc actttataat ttgtattttt tttctgtata gaactaaata 1021 tattctattt acatgtatgt caactcatta cttttttcct gtgaacagta ttgaaaaccc 1081 caaccggctg ataattaagt gaattaactg tgtctccctt gtcttaggat attctgtaga 1141 ttgattgcag atttcttaaa tctgaaatga ctttacactg taattctcag catactgatt 1201 atggagaaca cttgttttga attttgttat acttgactta actttattgc aatgtgaatt 1261 aattgactgc taagtaggaa gatgtgtaac ttttatttgt tgctattcac atttgaattt 1321 tttcctgtat aggcaatatt atattgacac cttttacaga tcttactgta gcaaaaacca 1381 tataaataaa atgctttttc tgct // LOCUS AF020833 1103 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens eukaryotic translation initiation factor 3 subunit (p42) mRNA, complete cds. ACCESSION AF020833 NID g2460199 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1103) AUTHORS Bandyopadhyay,A., Chaudhuri,J., Si,K., Tempst,P. and Maitra,U. TITLE 42kD subunit of eukaryotic translation initiation factor 3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1103) AUTHORS Bandyopadhyay,A., Chaudhuri,J., Si,K., Tempst,P. and Maitra,U. TITLE Direct Submission JOURNAL Submitted (26-AUG-1997) Dev. & Molecular Biol., Albert Einstein College of Medicine of Yeshiva University, 1300, Morris Park Avenue, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1103 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1103 /gene="p42" CDS 24..986 /gene="p42" /note="Similar to WP:F22B5.2 CE02197 RNA binding protein" /codon_start=1 /product="eukaryotic translation initiation factor 3 subunit" /db_xref="PID:g2460200" /translation="MPTGDFDSKPSWADQVEEEGEDDKCVTSELLKGIPLATGDTSPE PELLPGAPLPPPKEVINGNIKTVTEYKIDEDGKKFKIVRTFRIETRKASKAVARRKNW KKFGNSEFDPPGPNVATTTVSDDVSMTFITSKEDLNCQEEEDPMNKLKGQKIVSCRIC KGDHWTTRCPYKDTLGPMQKELAEQLGLSTGEKEKLPGELEPVQATQNKTGKYVPPSL RDGASRRGESMQPNRRADDNATIRVTNLSEDTRETDLQELFRPFGSISRIYLAKDKTT GQSKGFAFISFHRREDAARAIAGVSGFGYDHLILNVEWAKPSTN" BASE COUNT 280 a 332 c 324 g 167 t ORIGIN 1 gcggccgcgt cgaccttttt gcgatgccta ctggagactt tgattcgaag cccagttggg 61 ccgaccaggt ggaggaggag ggggaggacg acaaatgtgt caccagcgag ctcctcaagg 121 ggatccctct ggccacaggt gacaccagcc cagagccaga gctactgccg ggagctccac 181 tgccgcctcc caaggaggtc atcaacggaa acataaagac agtgacagag tacaagatag 241 atgaggatgg caagaagttc aagattgtcc gcaccttcag gattgagacc cggaaggctt 301 caaaggctgt cgcaaggagg aagaactgga agaagttcgg gaactcagag tttgaccccc 361 ccggacccaa tgtggccacc accactgtca gtgacgatgt ctctatgacg ttcatcacca 421 gcaaagagga cctgaactgc caggaggagg aggaccctat gaacaaactc aagggccaga 481 agatcgtgtc ctgccgcatc tgcaagggcg accactggac cacccgctgc ccctacaagg 541 atacgctggg gcccatgcag aaggagctgg ccgagcagct gggcctgtct actggcgaga 601 aggagaagct gccgggagag ctagagccgg tgcaggccac gcagaacaag acagggaagt 661 atgtgccgcc gagcctgcgc gacggggcca gccgccgcgg ggagtccatg cagcccaacc 721 gcagagccga cgacaacgcc accatccgtg tcaccaactt gtcagaggac acgcgtgaga 781 ccgacctgca ggagctcttc cggcctttcg gctccatctc ccgcatctac ctggctaagg 841 acaagaccac tggccaatcc aagggctttg ccttcatcag cttccaccgc cgcgaggatg 901 ctgcgcgtgc cattgccggg gtgtccggct ttggctacga ccacctcatc ctcaacgtcg 961 agtgggccaa gccgtccacc aactaagcca gctgccactg tgtactcggt ccgggaccct 1021 tggcgacaga agacagcctc cgagagcgcg ggctccaagg gcaataaagc agctccactc 1081 tcaaaaaaaa aaaaaaaaaa aag // LOCUS AF020918 1255 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens glutathione transferase (GSTA4) mRNA, complete cds. ACCESSION AF020918 NID g2738932 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1255) AUTHORS Board,P.G. TITLE Identification of cDNAs encoding two human Alpha class glutathione transferases (GSTA3 and GSTA4) JOURNAL Unpublished REFERENCE 2 (bases 1 to 1255) AUTHORS Board,P.G. TITLE Direct Submission JOURNAL Submitted (27-AUG-1997) Molecular Genetics, John Curtin School of Medical Research, PO Box 334, Canberra, ACT 2601, Australia FEATURES Location/Qualifiers source 1..1255 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p12" /note="complete sequence of EST with GenBank Accession Number H22595" gene 1..1255 /gene="GSTA4" CDS 71..739 /gene="GSTA4" /EC_number="2.5.1.18" /note="Alpha class" /codon_start=1 /product="glutathione transferase" /db_xref="PID:g2738933" /translation="MAARPKLHYPNGRGRMESVRWVLAAAGVEFDEEFLETKEQLYKL QDGNHLLFQQVPMVEIDGMKLVQTRSILHYIADKHNLFGKNLKERTLIDMYVEGTLDL LELLIMHPFLKPDDQQKEVVNMAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVIL LQTILALEEKIPNILSAFPFLQEYTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYN IFRP" BASE COUNT 348 a 266 c 272 g 369 t ORIGIN 1 gggaccgctg acctggcgct ttgtgcggct ccaggcctcc gagtgactcc agaaagcctg 61 aaaagctatc atggcagcaa ggcccaagct ccactatccc aacggaagag gccggatgga 121 gtccgtgaga tgggttttag ctgccgccgg agtcgagttt gatgaagaat ttctggaaac 181 aaaagaacag ttgtacaagt tgcaggatgg taaccacctg ctgttccaac aagtgcccat 241 ggttgaaatt gacgggatga agttggtaca gacccgaagc attctccact acatagcaga 301 caagcacaat ctctttggca agaacctcaa ggagagaacc ctgattgaca tgtacgtgga 361 ggggacactg gatctgctgg aactgcttat catgcatcct ttcttaaaac cagatgatca 421 gcaaaaggaa gtggttaaca tggcccagaa ggctataatt agatactttc ctgtgtttga 481 aaagatttta aggggtcacg gacaaagctt tcttgttggt aatcagctga gccttgcaga 541 tgtgatttta ctccaaacca ttttagctct agaagagaaa attcctaata tcctgtctgc 601 atttcctttc ctccaggaat acacagtgaa actaagtaat atccctacaa ttaagagatt 661 ccttgaacct ggcagcaaga agaagcctcc ccctgatgaa atttatgtga gaaccgtcta 721 caacatcttt aggccataaa acaacacatc catgtgtgag tgacagtgtg ttcctagaga 781 tggtattgtc tacagtcatg tcttaatgga tcccagctct gtcatggtgc tatctatgta 841 ttaagttggg tcctaagttg ggtcttttgt gtcaaagaga tcatctcttc tagaaatatc 901 aacctttttt gtccagtaaa taattgttag gggatcttta ttggaaaact tttttggaga 961 ggctggtatt taagttagat ctgattgggc tactcatgtc ctgtagccag ttcatcctca 1021 taataagaat gggcaggatc tcttgttctc tcctgagtgt ctttctactc tcctgagcgt 1081 ctttctgctc tccttatcct gttctcttat ccttatcccc tccagtctct gcctaatttt 1141 tagtgtttaa taacaaccga atgtctagta aatgactctc ctctgagctg taataaataa 1201 aatggtagta atgaatgcaa tcagtgttag ccaaaataaa gatttatgag tcatt // LOCUS AF021336 1272 bp mRNA PRI 18-OCT-1997 DEFINITION Homo sapiens DNA damage-inducible RNA binding protein (A18hnRNP) mRNA, complete cds. ACCESSION AF021336 NID g2541972 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1272) AUTHORS Sheikh,M.S., Carrier,F., Papathanasiou,M.A., Hollander,M.C., Zhan,Q., Yu,K. and Fornace,A.J. Jr. TITLE Identification of several human homologs of hamster DNA damage-inducible transcripts. Cloning and characterization of a novel UV-inducible cDNA that codes for a putative RNA-binding protein JOURNAL J. Biol. Chem. 272 (42), 26720-26726 (1997) MEDLINE 97476281 REFERENCE 2 (bases 1 to 1272) AUTHORS Sheikh,M.S., Carrier,F., Papathanasiou,M.A., Hollander,M.C., Zhan,Q., Yu,K. and Fornace,A.J. Jr. TITLE Direct Submission JOURNAL Submitted (27-AUG-1997) National Cancer Institute, National Institutes of Health, Laboratory of Molecular Pharmacology, Bldg. 37, Rm 5D02, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1272 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1272 /gene="A18hnRNP" CDS 83..601 /gene="A18hnRNP" /note="CIRP; putative; A18 heterogenous ribonucleoprotein" /codon_start=1 /product="DNA damage-inducible RNA binding protein" /db_xref="PID:g2541973" /translation="MASDEGKLFVGGLSFDTNEQSLEQVFSKYGQISEVVVVKDRETQ RSRGFGFVTFENIDDAKDAMMAMNGKSVDGRQIRVDQAGKSSDNRSRGYRGGSAGGRG FFRGGRGRGRGFSRGGGDRGYGGNRFESRSGGYGGSRDYYSSRSQSGGYSDRSSGGSY RDSYDSYATHNE" BASE COUNT 252 a 283 c 383 g 353 t 1 others ORIGIN 1 actcgcgcgt taggaggctc gggtcgttgt ggtgcgctgt cttcccgntt gcgtcaggga 61 cctgcccgac tcagtggccg ccatggcatc agatgaaggc aaactttttg ttggagggct 121 gagttttgac accaatgagc agtcgctgga gcaggtcttc tcaaagtacg gacagatctc 181 tgaagtggtg gttgtgaaag acagggagac ccagagatct cggggatttg ggtttgtcac 241 ctttgagaac attgacgacg ctaaggatgc catgatggcc atgaatggga agtctgtaga 301 tggacggcag atccgagtag accaggcagg caagtcgtca gacaaccgat cccgtgggta 361 ccgtggtggc tctgccgggg gccggggctt cttccgtggg ggccgaggac ggggccgtgg 421 gttctctaga ggaggagggg accgaggcta tggggggaac cggttcgagt ccaggagtgg 481 gggctacgga ggctccagag actactatag cagccggagt cagagtggtg gctacagtga 541 ccggagctcg ggcgggtcct acagagacag ttacgacagt tacgctacac acaacgagta 601 aaaacccttc ctgctcaaga tcgtccttcc aatggctgtg tgtttaaaga ttgtgggagc 661 ttcgctgaac gttaatgtgt agtaaatgca cctccttgta ttcccacttt cgtagtcatt 721 tcggttctga tcttgtcaaa cccagcctga ccgcttctga cgccgggatg gcctcgttac 781 tagacttttc tttttaagga agtgctgttt ttttttgagg gttttcaaaa cattttgaaa 841 agcatttact tttttgacca cgagccatga gttttcaaaa aaatcggggg ttgtgtgggt 901 ttttggtttt tgttttagtt tttggttgcg ttgccttttt tttttagtgg ggttggcccc 961 atgaagtggg tgccccactc acttctctga gatcgaacgg actgtgaatc cgctctttgt 1021 cggaagctga gcaagctgtg gcttttttcc aactccgtgt gacgtttctg agtgtagtgt 1081 ggtaggaccc cggcgggtgt ggcagcaact gccctggagc cccagcccct gcgtccatct 1141 gtgctgtgcg ccccacagta gacgtgcaga cgtccctgag aggttcttga agatgtttat 1201 ttatattgtc cttttttact gaagacgtac gcatactcca tcgatgttgt atttgcagtg 1261 ctgaggattc tt // LOCUS AF021818 1014 bp mRNA PRI 06-OCT-1997 DEFINITION Homo sapiens putative neurotransmitter receptor mRNA, complete cds. ACCESSION AF021818 NID g2465431 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1014) AUTHORS Zeng,Z.Z., Fan,P., Rand,E., Kyaw,H., Su,K., Madike,V., Carter,C.C. and Li,Y. TITLE Cloning of a Putative Neurotransmitter Receptor Expressed in Skeletal Muscle and Brain JOURNAL Unpublished REFERENCE 2 (bases 1 to 1014) AUTHORS Zeng,Z.Z., Fan,P., Rand,E., Kyaw,H., Su,K., Madike,V., Carter,C.C. and Li,Y. TITLE Direct Submission JOURNAL Submitted (29-AUG-1997) Protein Therapeutics, Human Genome Sciences, Inc., 9410 Key West Avenue, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1014 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q23" CDS 1..1014 /codon_start=1 /product="putative neurotransmitter receptor" /db_xref="PID:g2465432" /translation="MRAVFIQGAEEHPAAFCYQVNGSCPRTVHTLGIQLVIYLTCAAG MLIIVLGNVFVAFAVSYFKALHTPTNFLLLSLALADMFLGLLVLPLSTIRSVESCWFF GDFLCRLHTYLDTLFCLTSIFHLCFISIDRHCAICDPLLYPSKFTVRVALRYILAGWG VPAAYTSLFLYTDVVETRLSQWLEEMPCVGSCQLLLNKFWGWLNFPLFFVPCLIMISL YVKIFVVATRQAQQITTLSKSLAGAAKHERKAAKTLGIVVGIYLLCWLPFTIDTMVDS LLHFITPPLVFDIFIWFAYFNSACNPIIYVFSYQWFRKALKLTLSQKVFSPQTRTVDL YQE" BASE COUNT 199 a 296 c 238 g 281 t ORIGIN 1 atgagagctg tcttcatcca aggtgctgaa gagcaccctg cggcattctg ctaccaggtg 61 aatgggtctt gccccaggac agtacatact ctgggcatcc agttggtcat ctacctgacc 121 tgtgcagcag gcatgctgat tatcgtgcta gggaatgtat ttgtggcatt tgctgtgtcc 181 tacttcaaag cgcttcacac gcccaccaac ttcctgctgc tctccctggc cctggctgac 241 atgtttctgg gtctgctggt gctgcccctc agcaccattc gctcagtgga gagctgctgg 301 ttcttcgggg acttcctctg ccgcctgcac acctacctgg acaccctctt ctgcctcacc 361 tccatcttcc atctctgttt catttccatt gaccgccact gtgccatctg tgaccccctg 421 ctctatccct ccaagttcac agtgagggtg gctctcaggt acatcctggc aggatggggg 481 gtgcccgcag catacacttc gttattcctc tacacagatg tggtagagac aaggctcagc 541 cagtggctgg aagagatgcc ttgtgtgggc agttgccagc tgctgctcaa taaattttgg 601 ggctggttaa acttcccttt gttctttgtc ccctgcctca ttatgatcag cttgtatgtg 661 aagatctttg tggttgctac cagacaggct cagcagatta ccacattgag caaaagcctg 721 gctggggctg ccaagcatga gagaaaagct gccaagaccc tgggcattgt tgtgggcata 781 tacctcttgt gctggctgcc cttcaccata gacacgatgg tcgacagcct ccttcacttt 841 atcacacccc cactggtctt tgacatcttt atctggtttg cttacttcaa ctcagcctgc 901 aaccccatca tctatgtctt ttcctaccag tggtttcgga aggcactgaa actcacactg 961 agccagaagg tcttctcacc gcagacacgc actgttgatt tgtaccaaga atga // LOCUS AF021819 900 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens RNA-binding protein regulatory subunit mRNA, complete cds. ACCESSION AF021819 NID g2460317 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 900) AUTHORS Pentyala,S., Beaudoin,R., Bhargava,D., Whyard,T.C., El-Maghrabi,M.R. and Hod,Y. TITLE Identification and Characterization of a Novel Protein that Regulates RNA-Protein Interaction JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 900) AUTHORS Beaudoin,R. and Hod,Y. TITLE Direct Submission JOURNAL Submitted (28-AUG-1997) Urology, SUNY at Stony Brook, Health Science Center, Stony Brook, NY 11794-8093, USA FEATURES Location/Qualifiers source 1..900 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" 5'UTR 1..71 CDS 72..641 /codon_start=1 /product="RNA-binding protein regulatory subunit" /db_xref="PID:g2460318" /translation="MASKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPV QCSRDVVICPDASLEDAKKEGPYDVVVLPGGNLGAQNLSESAAVKEILKEQENRKGLI AAICAGPTALLAHEIGFGSKVTTHPLAKDKMMNGGHYTYSENRVEKDGLILTSRGPGT SFEFALAIVEALNGKEVAAQVKAPLVLKD" 3'UTR 642..900 polyA_signal 844..850 BASE COUNT 273 a 178 c 237 g 212 t ORIGIN 1 ggcacgagcg tgcgtgctgg cgtgcgttca ctttcagcct ggtgtggggc ttgtaaacat 61 ataacataaa aatggcttcc aaaagagctc tggtcatcct ggctaaagga gcagaggaaa 121 tggagacggt catccctgta gatgtcatga ggcgagctgg gattaaggtc accgttgcag 181 gcctggctgg aaaagaccca gtacagtgta gccgtgatgt ggtcatttgt cctgatgcca 241 gccttgaaga tgcaaaaaaa gagggaccat atgatgtggt ggttctacca ggaggtaatc 301 tgggcgcaca gaatttatct gagtctgctg ctgtgaagga gatactgaag gagcaggaaa 361 accggaaggg cctgatagcc gccatctgtg caggtcctac tgctctgttg gctcatgaaa 421 taggttttgg aagtaaagtt acaacacacc ctcttgctaa agacaaaatg atgaatggag 481 gtcattacac ctactctgag aatcgtgtgg aaaaagacgg cctgattctt acaagccggg 541 ggcctgggac cagcttcgag tttgcgcttg caattgttga agccctgaat ggcaaggagg 601 tggcggctca agtgaaggct ccacttgttc ttaaagacta gagcagcgaa ctgcgacgat 661 cacttagaga aacaggccgt taggaatcca ttctcactgt gttcgctcta aacaaaacag 721 tggtaggtta atgtgttcag aagtcgctgt ccttactact tttgcggaag tatggaagtc 781 acaactacac agagatttct cagcctacaa attgtgtcta tacatttcta agccttgttt 841 gcagaataaa cagggcattt agcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF022080 627 bp mRNA PRI 11-NOV-1997 DEFINITION Homo sapiens R-ras3 mRNA, complete cds. ACCESSION AF022080 NID g2599365 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 627) AUTHORS Kimmelman,A., Tolkacheva,T., Lorenzi,M.V., Osada,M. and Chan,A.M. TITLE Identification and characterization of R-ras3: a novel member of the RAS gene family with a non-ubiquitous pattern of tissue distribution JOURNAL Oncogene 15 (22), 2675-2685 (1997) MEDLINE 98062166 REFERENCE 2 (bases 1 to 627) AUTHORS Kimmelman,A. and Chan,A.M.L. TITLE Direct Submission JOURNAL Submitted (01-SEP-1997) Cancer Center, Mt. Sinai School of Medicine, One, Gustave Levy Place, Box#1130, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..627 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="M426 embryonic lung fibroblast cDNA library" CDS 1..627 /note="GTP-binding protein" /codon_start=1 /product="R-ras3" /db_xref="PID:g2599366" /translation="MATSAVPSDNLPTYKLVVVGDGGVGKSALTIQFFQKIFVPDYDP TIEDSYLKHTEIDNQWAILDVLDTAGQEEFSAMREQYMRTGDGFLIVYSVTDKASFEH VDRFHQLILRVKDRESFPMILVANKVDLMHLRKITREQGKEMATKHNIPYIETSAKDP PLNVDEAFHDLVRVIRQQIPEKSQKKKKKTKWRGDRATATHKLQCVIL" misc_feature 124..150 /note="encodes effector loop" misc_feature 613..624 /note="encodes prenylation signal" BASE COUNT 178 a 167 c 165 g 117 t ORIGIN 1 atggcgacca gcgccgtccc cagtgacaac ctccccacat acaagctggt ggtggtgggg 61 gatgggggtg tgggcaaaag tgccctcacc atccagtttt tccagaagat ctttgtgcct 121 gactatgacc ccaccattga agactcctac ctgaaacata cggagattga caatcaatgg 181 gccatcttgg acgttctgga cacagctggg caggaggaat tcagcgccat gcgggagcaa 241 tacatgcgca cgggggatgg cttcctcatc gtctactccg tcactgacaa ggccagcttt 301 gagcacgtgg accgcttcca ccagcttatc ctgcgcgtca aagacaggga gtcattcccg 361 atgatcctcg tggccaacaa ggtcgatttg atgcacttga ggaagatcac cagggagcaa 421 ggaaaagaaa tggcgaccaa acacaatatt ccgtacatag aaaccagtgc caaggaccca 481 cctctcaatg tcgacgaagc cttccatgac ctcgttagag taattaggca acagattccg 541 gaaaaaagcc agaagaagaa gaagaaaacc aaatggcggg gagaccgggc cacagcgacc 601 cacaaactgc aatgtgtgat cttgtga // LOCUS AF022108 2239 bp mRNA PRI 07-FEB-1998 DEFINITION Homo sapiens putative replication initiator origin recognition complex subunit Orc4Lp (ORC4L) mRNA, complete cds. ACCESSION AF022108 NID g2736148 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2239) AUTHORS Quintana,D.G., Hou,Z.H., Thome,K.C., Hendricks,M., Saha,P. and Dutta,A. TITLE Identification of HsORC4, a member of the human origin of replication recognition complex JOURNAL J. Biol. Chem. 272 (45), 28247-28251 (1997) MEDLINE 98019187 REFERENCE 2 (bases 1 to 2239) AUTHORS Quintana,D.G., Hou,Z.H., Thome,K.C., Saha,P. and Dutta,A. TITLE Direct Submission JOURNAL Submitted (29-AUG-1997) Pathology, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA REFERENCE 3 (bases 1 to 2239) AUTHORS Quintana,D.G., Hou,Z.H., Thome,K.C., Saha,P. and Dutta,A. TITLE Direct Submission JOURNAL Submitted (31-DEC-1997) Pathology, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..2239 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2239 /gene="ORC4L" CDS 127..1437 /gene="ORC4L" /codon_start=1 /product="putative replication initiator origin recognition complex subunit Orc4Lp" /db_xref="PID:g2736149" /translation="MSSRKSKSNSLIHTECLSQVQRILRERFCRQSPHSNLFGVQVQY KHLSELLKRTALHGESNSVLIIGPRGSGKTMLISHALKELMEIEEVSENVLQVHLNGL LQINDKIALKEITRQLNLENVVGDKVFGSFAENLSFLLEALKKGDRTSSCPVIFILDE FDLFAHHKNQTLLYNLFDISQSAQTPIAVIGLTCRLDILELLEKRVKSRFSHRQIHLM NSFGFPQYVKIFKEQLSLPAEFPDKVFAEKWNENVQYLSEDRSVQEVLQKHFNISKNL RSLHMLLMLALNRVTASHPFMTAVDLMEASQLCSMDSKANIVHGLSVLEICLIIAMKH LNDIYEEEPFNFQMVYNEFQKFVQRKAHSVYNFEKPVVMKAFEHLQQLELIKPMERTS GNSQREYQLMKLLLDNTQIMNALQKYPNCPTDVRQWATSSLSWL" BASE COUNT 731 a 402 c 430 g 676 t ORIGIN 1 aagcttggca cgaggacggt ccgcagcggc aggtgaagcc tagcagagga cgcggccagg 61 cgattcggtg aagcgattcc tgcaggcgtt ggttcccctc tttgacctgg atttgaattt 121 gttgaaatga gcagtcgtaa atcaaagagt aacagcttaa ttcacacaga gtgcctttca 181 caggtacaaa gaattttacg tgaaagattt tgtcgtcaga gtccacatag taacctattt 241 ggagtgcaag tacaatacaa acacttaagt gagctgctga aaagaactgc tctccatgga 301 gagagtaact ctgtccttat tatcggaccc cgaggatcag gaaaaactat gttaataagt 361 catgctttga aagaactcat ggaaatagaa gaagtgagtg aaaatgtatt acaagttcac 421 ttaaatggac tgctgcagat caatgacaaa atcgccctaa aggaaatcac aaggcagtta 481 aatctggaaa atgtagttgg agataaagtt tttggaagct ttgctgaaaa cctttcattt 541 cttctggaag ctttaaaaaa aggtgaccga actagcagtt gcccagtgat cttcatatta 601 gatgaatttg atctttttgc tcatcataaa aaccaaacac ttctctataa tctttttgac 661 atttctcagt ctgcacagac cccaatagca gttattggtc ttacatgtag attggatatt 721 ttggaactct tagaaaaaag agtgaagtca agattttctc accggcagat acacttaatg 781 aattcatttg gttttccaca gtatgttaaa atatttaaag aacagttatc tctacctgca 841 gagtttccag acaaggtttt tgctgagaag tggaatgaaa atgttcagta tctctcagaa 901 gatagaagtg tgcaagaagt actacagaag catttcaata tcagcaaaaa cctgcggtca 961 ttacacatgc tattgatgct tgctttaaat cgagtaacag catcgcaccc atttatgact 1021 gccgtagatc taatggaagc aagccaactg tgtagcatgg actcgaaagc aaatattgta 1081 catggtctat cagtcttgga aatctgtctt ataatagcaa tgaaacattt aaatgacatc 1141 tatgaggaag agccatttaa ttttcaaatg gtctataatg agtttcagaa gtttgttcaa 1201 aggaaagcac attccgttta taattttgaa aaacctgttg tcatgaaggc ttttgaacac 1261 ctgcagcaat tagaattaat aaagcccatg gaaagaactt caggaaattc acagagagag 1321 taccagctga tgaaactgct tttggataat actcaaatta tgaatgctct gcagaaatat 1381 cccaactgtc ctacagatgt gaggcagtgg gcaacatcct cactaagctg gttatgaata 1441 taaccagtga cttcaacttt ggcatttcat tcatacttct gtagagaacg gaaaactatt 1501 gtccattaac atgatatgct aaacattcta taaacattct tgtatttatg tgagacttgc 1561 ccatctactg tcttggctgt gtcttgcctt ttaatcatga acagttacat gatttataat 1621 ttcactgatt gagattactt tgtaagtagc tgttcagaag aataaaatat gactgtttta 1681 gggactagac catgtgcttt tttaacactt atatatataa tggtctattt gaagagctca 1741 cttcaaccta acagctagat gtctttacaa accttaaacc aaaggagtaa aaaaaacaat 1801 ggtaagcact gaagtataat aagtaacctt tggtacagca ggtttgctgc agtgtttttt 1861 tctgtccaca tgcaaatttt ggattctatc ccagacccag gttttctagt tcagaagact 1921 aaccagctta gtcagaagat ggttccatgg aagaaaaagg ccaaggagtt tgaagatttt 1981 tcttctagat ctagacatct caaaatgtgg tactcatacc atccacatca gaatcccttg 2041 taggattttc taaaattaca gattgttgag cctaccatag gtcaaaagga ctggaatttt 2101 ccttcttaac aagtatagtc atggcaccac gtaacatttt ggtcaatgac attgtataaa 2161 gggtggtctc ataagattat accatatttt tactgtacct tttctatgtc taaatataca 2221 aatgttttac cattgaaaa // LOCUS AF022109 2021 bp mRNA PRI 06-OCT-1997 DEFINITION Homo sapiens HsCdc18p (HsCdc18) mRNA, complete cds. ACCESSION AF022109 NID g2465436 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2021) AUTHORS Saha,P., Chen,J., Hendricks,M., Thome,K.C., Hou,Z.H. and Dutta,A. TITLE Human CDC6/Cdc18 associates with Orc1, cyclin/cdk and MCM proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 2021) AUTHORS Saha,P., Chen,J., Hendricks,M., Thome,K.C., Hou,Z.H. and Dutta,A. TITLE Direct Submission JOURNAL Submitted (29-AUG-1997) Pathology, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2021 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2021 /gene="HsCdc18" CDS 87..1769 /gene="HsCdc18" /note="similar to S. pombe Cdc18 and to S. cervisiae CDC6" /codon_start=1 /product="HsCdc18p" /db_xref="PID:g2465437" /translation="MPQTRSQAQATISFPKRKLSRALNKAKNSSDAKLEPTNVQTVTC SPRVKALPLSPRKRLGDDNLCNTPHLPPCSPPKQGKKENGPPHSHTLKGRRLVFDNQL TIKSPSKRELAKVHQNKILSSVRKSQEITTNSEQRCPLKKESACVRLFKQEGTCYQQA KLVLNTAVPDRLPAREREMDVIRNFLREHICGKKAGSLYLSGAPGTGKTACLSRILQD LKKELKGFKTIMLNCMSLRTAQAVFPAIAQEICQEEVSRPAGKDMMRKLEKHMTAEKG PMIVLVLDEMDQLDSKGQDVLYTLFEWPWLSNSHLVLIGIANTLDLTDRILPRLQARE KCKPQLLNFPPYTRNQIVTILQDRLNQVSRDQVLDNAAVQFCARKVSAVSGDVRKALD VCRRAIEIVESDVKSQTILKPLSECKSPSEPLIPKRVGLIHISQVISEVDGNRMTLSQ EGAQDSFPLQQKILVCSLMLLIRQLKIKEVTLGKLYEAYSKVCRKQQVAAVDQSECLS LSGLLEARGILGLKRNKETRLTKVFFKIEEKEIEHALKDKALIGNILATGLP" BASE COUNT 599 a 436 c 464 g 516 t 6 others ORIGIN 1 gaattccggg gaggcctggg gtctgtgagg cagcggagct gggtgaaggc tgcgggttcc 61 ggcgaggcct gagctgtgct gtcgtcatgc ctcaaacccg atcccaggca caggctacaa 121 tcagttttcc aaaaaggaag ctgtctcggg cattgaacaa agctaaaaac tccagtgatg 181 ccaaactaga accaacaaat gtccaaaccg taacctgttc tcctcgtgta aaagccctgc 241 ctctcagccc caggaaacgt ctgggcgatg acaacctatg caacactccc catttacctc 301 cttgttctcc accaaagcaa ggcaagaaag agaatggtcc ccctcactca catacactta 361 agggacgaag attggtattt gacaatcagc tgacaattaa gtctcctagc aaaagagaac 421 tagccaaagt tcaccaaaac aaaatacttt cttcagttag aaaaagtcaa gagatcacaa 481 caaattctga gcagagatgt ccactgaaga aagaatctgc atgtgtgaga ctattcaagc 541 aagaaggcac ttgctaccag caagcaaagc tggtcctgaa cacagctgtc ccagatcggc 601 tgcctgccag ggaaagggag atggatgtca tcaggaattt cttgagggaa cacatctgtg 661 ggaaaaaagc tggaagcctt tacctttctg gtgctcctgg aactggaaaa actgcctgct 721 taagccggat tctgcaagac ctcaagaagg aactgaaagg ctttaaaact atcatgctga 781 attgcatgtc cttgaggact gcccaggctg tattcccagc tattgctcag gagatttgtc 841 aggaagaggt atccaggcca gctgggaagg acatgatgag gaaattggaa aaacatatga 901 ctgcagagaa gggccccatg attgtgttgg tattggacga gatggatcaa ctggacagca 961 aaggccagga tgtattgtac acgctatttg aatggccatg gctaagcaat tctcacttgg 1021 tgctgattgg tattgctaat accctggatc tcacagatag aattctacct aggcttcaag 1081 ctagagaaaa atgtaagcca cagctgttga acttcccacc ttataccaga aatcagatag 1141 tcactatttt gcaagatcga cttaatcagg tatctagaga tcaggttctg gacaatgctg 1201 cagttcaatt ctgtgcccgc aaagtctctg ctgtttcagg agatgttcgc aaagcactgg 1261 atgtttgcag gagagctatt gaaattgtag agtcagatgt caaaagccag actattctca 1321 aaccactgtc tgaatgtaaa tcaccttctg agcctctgat tcccaagagg gttggtctta 1381 ttcacatatc ccaagtcatc tcagaagttg atggtaacag gatgaccttg agccaagaag 1441 gagcacaaga ttccttccct cttcagcaga agatcttggt ttgctctttg atgctcttga 1501 tcaggcagtt gaaaatcaaa gaggtcactc tggggaagtt atatgaagcc tacagtaaag 1561 tctgtcgcaa acagcaggtg gcggctgtgg accagtcaga gtgtttgtca ctttcagggc 1621 tcttggaagc caggggcatt ttaggattaa agagaaacaa ggaaacccgt ttgacaaagg 1681 tgtttttcaa gattgaagag aaagaaatag aacatgctct gaaagataaa gctttaattg 1741 gaaatatctt agctactgga ttgccttaaa ttcttctctt acaccccacc cgaaagtatt 1801 cagctgggca tttagagagc tacagtcttc attttagtgc tttacacatt cgggcctgaa 1861 aacaaatatg acctttttta cttganggcc aatggnattt taatctatag gattctttta 1921 atattagnca cagnaattaa tatctttggg ggtctttact tatttttacc ccnttaaaaa 1981 gtgaccgggg taggacccct tttttaattn ccatttcact a // LOCUS AF022150 1284 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens intermediate conductance calcium-activated potassium channel hIK1 (IK1) mRNA, complete cds. ACCESSION AF022150 NID g2655058 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1284) AUTHORS Ishii,T.M., Silvia,C., Hirschberg,B., Bond,C.T., Adelman,J.P. and Maylie,J. TITLE A Human Intermediate Conductance Calcium-activated Potassium Channel JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 1284) AUTHORS Adelman,J.P., Bond,C.T., Maylie,J. and Silvia,C. TITLE Direct Submission JOURNAL Submitted (02-SEP-1997) Vollum Institute, Oregon Health Sciences Univ., 3181 SW Sam Jackson Park Road, Portland, OR 97201-3098, USA FEATURES Location/Qualifiers source 1..1284 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1284 /gene="IK1" CDS 1..1284 /gene="IK1" /note="intermediate conductance calcium-activated potassium channel" /codon_start=1 /product="hIK1" /db_xref="PID:g2655059" /translation="MGGDLVLGLGALRRRKRLLEQEKSLAGWALVLAGTGIGLMVLHA EMLWFGGCSWALYLFLVKCTISISTFLLLCLIVAFHAKEVQLFMTDNGLRDWRVALTG RQAAQIVLELVVCGLHPAPVRGPPCVQDLGAPLTSPQPWPGFLGQGEALLSLAMLLRL YLVPRAVLLRSGVLLNASYRSIGALNQVRFRHWFVAKLYMNTHPGRLLLGLTLGLWLT TAWVLSVAERQAVNATGHLSDTLWLIPITFLTIGYGDVVPGTMWGKIVCLCTGVMGVC CTALLVAVVARKLEFNKAEKHVHNFMMDIQYTKEMKESAARVLQEAWMFYKHTRRKES HAARRHQRKLLAAINAFRQVRLKHRKLREQVNSMVDISKMHMILYDLQQNLSSSHRAL EKQIDTLAGKLDALTELLSTALGPRQLPEPSQQSK" BASE COUNT 229 a 388 c 411 g 256 t ORIGIN 1 atgggcgggg atctggtgct tggcctgggg gccttgagac gccgaaagcg cttgctggag 61 caggagaagt ctctggccgg ctgggcactg gtgctggcag gaactggcat tggactcatg 121 gtgctgcatg cagagatgct gtggttcggg gggtgctcgt gggcgctcta cctgttcctg 181 gttaaatgca cgatcagcat ttccaccttc ttactcctct gcctcatcgt ggcctttcat 241 gccaaagagg tccagctgtt catgaccgac aacgggctgc gggactggcg cgtggcgctg 301 accgggcggc aggcggcgca gatcgtgctg gagctggtgg tgtgtgggct gcacccggcg 361 cccgtgcggg gcccgccgtg cgtgcaggat ttaggggcgc cgctgacctc cccgcagccc 421 tggccgggat tcctgggcca aggggaagcg ctgctgtccc tggccatgct gctgcgtctc 481 tacctggtgc cccgcgccgt gctcctgcgc agcggcgtcc tgctcaacgc ttcctaccgc 541 agcatcggcg ctctcaatca agtccgcttc cgccactggt tcgtggccaa gctttacatg 601 aacacgcacc ctggccgcct gctgctcggc ctcacgcttg gcctctggct gaccaccgcc 661 tgggtgctgt ccgtggccga gaggcaggct gttaatgcca ctgggcacct ttcagacaca 721 ctttggctga tccccatcac attcctgacc atcggctatg gtgacgtggt gccgggcacc 781 atgtggggca agatcgtctg cctgtgcact ggagtcatgg gtgtctgctg cacagccctg 841 ctggtggccg tggtggcccg gaagctggag tttaacaagg cagagaagca cgtgcacaac 901 ttcatgatgg atatccagta taccaaagag atgaaggagt ccgctgcccg agtgctacaa 961 gaagcctgga tgttctacaa acatactcgc aggaaggagt ctcatgctgc ccgcaggcat 1021 cagcgcaagc tgctggccgc catcaacgcg ttccgccagg tgcggctgaa acaccggaag 1081 ctccgggaac aagtgaactc catggtggac atctccaaga tgcacatgat cctgtatgac 1141 ctgcagcaga atctgagcag ctcacaccgg gccctggaga aacagattga cacgctggcg 1201 gggaagctgg atgccctgac tgagctgctt agcactgccc tggggccgag gcagcttcca 1261 gaacccagcc agcagtccaa gtag // LOCUS AF022229 1096 bp mRNA PRI 27-JAN-1998 DEFINITION Homo sapiens translation initiation factor 6 (eIF6) mRNA, complete cds. ACCESSION AF022229 NID g2809382 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1096) AUTHORS Si,K., Chaudhuri,J., Chevesich,J. and Maitra,U. TITLE Molecular cloning and functional expression of a human cDNA encoding translation initiation factor 6 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (26), 14285-14290 (1997) MEDLINE 98070743 REFERENCE 2 (bases 1 to 1096) AUTHORS Si,K. and Maitra,U. TITLE Direct Submission JOURNAL Submitted (29-AUG-1997) DMB, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1096 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" gene 1..1096 /note="AAF" /gene="eIF6" CDS 86..823 /gene="eIF6" /note="ribosome antiassociation factor; eIF6" /codon_start=1 /product="translation initiation factor 6" /db_xref="PID:g2809383" /translation="MAVRASFENNCEIGCFAKLTNTYCLVAIGGSENFYSVFEGELSD TIPVVHASIAGCRIIGRMCVGNRHGLLVPNNTTDQELQHIRNSLPDTVQIRRVEERLS ALGNVTTCNDYVALVHPDLDRETEEILADVLKVEVFRQTVADQVLVGSYCVFSNQGGL VHPKTSIEDQDELSSLLQVPLVAGTVNRGSEVIAAGMVVNDWCAFCGLDTTSTELSVV ESVFKLNEAQPSTIATSMRDSLIDSLT" BASE COUNT 219 a 317 c 324 g 236 t ORIGIN 1 gcggccgcgt cgacggacgg aggccgggat acttgggaaa ggatccgccg gccttgaact 61 cccgcctccg ccgcccctag gcctcatggc ggtccgagct tcgttcgaga acaactgtga 121 gatcggctgc tttgccaagc tcaccaacac ctactgtctg gtagcgatcg gaggctcaga 181 gaacttctac agtgtgttcg agggcgagct ctccgatacc atccccgtgg tgcacgcgtc 241 tatcgccggc tgccgcatca tcgggcgcat gtgtgtgggg aacaggcacg gtctcctggt 301 acccaacaat accaccgacc aggagctgca acacattcgc aacagcctcc cagacacagt 361 gcagattagg cgggtggagg agcggctctc agccttgggc aatgtcacca cctgcaatga 421 ctacgtggcc ttggtccacc cagacttgga cagggagaca gaagaaattc tggcagatgt 481 gctcaaggtg gaagtcttca gacagacagt ggccgaccag gtgctagtag gaagctactg 541 tgtcttcagc aatcagggag ggctggtgca tcccaagact tcaattgaag accaggatga 601 gctgtcctct cttcttcaag tcccccttgt ggcggggact gtgaaccgag gcagtgaggt 661 gattgctgct gggatggtgg tgaatgactg gtgtgccttc tgtggcctgg acacaaccag 721 cacagagctg tcagtggtgg agagtgtctt caagctgaat gaagcccagc ctagcaccat 781 tgccaccagc atgcgggatt ccctcattga cagcctcacc tgagtcacct tccaagttgt 841 tccatgggct cctggctctg gactgtggcc aaccttctcc acttccgccc aatctgtacc 901 ggatgctggc agggaggtgg cagagagctc actgggactg aggggctggg cacccaaccc 961 ttttccacct gtgcttatcg cctggatcta tcattactgc aaaaacctgc tctgttgtgc 1021 tggctggcag gccctgtggc tgctggctga gggttctgct gtcctgtggc accccattaa 1081 agtgcagttc ctccgg // LOCUS AF022385 1218 bp mRNA PRI 07-OCT-1997 DEFINITION Homo sapiens apoptosis-related protein TFAR15 (TFAR15) mRNA, complete cds. ACCESSION AF022385 NID g2465728 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1218) AUTHORS Wang,Y.G., Liu,H.T., Ma,D.L. and Zhang,Y.M. TITLE An Apoptosis-Related Gene cDNA Sequence Found in GM-CSF Deprived TF-1 cell line JOURNAL Unpublished REFERENCE 2 (bases 1 to 1218) AUTHORS Wang,Y.G., Liu,H.T., Ma,D.L. and Zhang,Y.M. TITLE Direct Submission JOURNAL Submitted (02-SEP-1997) Immuology, Beijing Medical University, Beijing 100083, P.R.China FEATURES Location/Qualifiers source 1..1218 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="premyeloid cell line TF-1" gene 1..1218 /gene="TFAR15" CDS 154..792 /gene="TFAR15" /note="apoptosis-related protein" /codon_start=1 /product="TFAR15" /db_xref="PID:g2465729" /translation="MRMTMEEMKNEAETTSMVSMPLYAVMYPVFNELERVNLSAAQTL RAAFIKAEKENPGLTQDIIMKILEKKSVEVNFTESLLRMAADDVEEYMIERPEPEFQA LNEKARALKQILSKIPDEINDRVRFLQTIKDIASAIKELLDTVNNVFKKYQYQNRRAL EHQKKEFVKYSKSFSDTLKTYFKDGKAINVFVSANRLIHQTNLILQTFKTVA" BASE COUNT 423 a 211 c 228 g 356 t ORIGIN 1 tgcaaggtgg gaagtgaagt cagtgcctca gttgctgatc agtgtgtttt ttgtgtccaa 61 ttcttttatc accaaaaaag ggaagaaata ttgcagtgaa tgaagattcc tctgcatttt 121 agcactgctt tttcaactgt agttggcttt tgaatgagga tgacaatgga agagatgaag 181 aatgaagctg agaccacatc catggtttct atgcccctct atgcagtcat gtatcctgtg 241 tttaatgagc tagaacgagt aaatctgtct gcagcccaga cactgagagc cgctttcatc 301 aaggctgaaa aagaaaatcc aggtctcaca caagacatca ttatgaaaat tttagagaaa 361 aaaagcgtgg aagttaactt cacggagtcc cttcttcgta tggcagctga tgatgtagaa 421 gagtatatga ttgaacgacc agagccagaa ttccaagccc taaacgaaaa ggcacgagca 481 cttaaacaaa ttctcagtaa gatcccagat gagatcaatg acagagtgag gtttctgcag 541 acaatcaagg atatagctag tgcaataaaa gaacttcttg atacagtgaa taatgtcttc 601 aagaaatatc aataccagaa ccgcagggca cttgaacacc aaaagaaaga atttgtaaag 661 tactccaaaa gtttcagtga tactctgaaa acgtatttta aagatggcaa ggcaataaat 721 gtgttcgtaa gtgccaaccg actaattcat caaaccaact taatacttca gaccttcaaa 781 actgtggcct gaaagttgta tattgttaag agatgtactt ctcagtggca gtattgaact 841 gcctttatct gtaaatttta aagtttgact gtataaatta tcagtccctc ctgaagggat 901 ctaatccaga atgttgaatg ggattattgc catcttacac catatttttg taaaatgtag 961 cttaatcata atctcacact gaagattttg catcactttt gctattatca ttcttttaag 1021 aattataagc caaaagaatt tacgccttaa tgtgtcatta tataacattc cttaaaagaa 1081 ttgtaaatat tggtgtttgt ttctgacatt ttaacttgaa agcgatatgc tgcaagataa 1141 tgtatttaac aatatttggt ggcaaatatt caataaatag tttacatccg aaaaaaaaaa 1201 aaaaaaaaaa acctgccc // LOCUS AF022655 7814 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens cep250 centrosome associated protein mRNA, partial cds. ACCESSION AF022655 NID g2832236 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7814) AUTHORS Mack,G.J., Rees,J., Sandblom,O., Balczon,R., Fritzler,M.J. and Rattner,J.B. TITLE Autoantibodies to a group of centrosomal proteins in human autoimmune sera reactive with the centrosome JOURNAL Arthritis Rheum. (1997) In press REFERENCE 2 (bases 1 to 7814) AUTHORS Mack,G.J., Rees,J., Sandblom,O., Balczon,R., Fritzler,M.J. and Rattner,J.B. TITLE Direct Submission JOURNAL Submitted (03-SEP-1997) Med. Biochemistry, University of Calgary, 3330 Hospital Dr. NW., Calgary, AB T2N-4N1, Canada REFERENCE 3 (bases 1 to 7814) AUTHORS Mack,G.J., Rees,J., Sandblom,O., Balczon,R., Fritzler,M.J. and Rattner,J.B. TITLE Direct Submission JOURNAL Submitted (04-FEB-1998) Med. Biochemistry, University of Calgary, 3330 Hospital Dr. NW., Calgary, AB T2N-4N1, Canada REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..7814 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 222..7550 /codon_start=1 /product="cep250 centrosome associated protein" /db_xref="PID:g2832237" /translation="METRSPGLNNMKPQSLQLVLEEQVLALQQQMAENQAASWRKLKN SQEAQQRQATLVRKLQAKVLQYRSWCQELEKRLEATGGPIPQRWENVEEPNLDELLVR LEEEQQRCESLAEVNTQIRLHMEKADVVNKALRADVEKLTVDWSRARDELMRKESQWQ MEQEFFKGYLKGEHGRLLSLWREVVTFRRHFLEMKSATDRDLMELKAEHVRLSGSLLT CCLRLTVGAQSREPNGSGRMDGREPAQLLLLLAKTQELEKEAHERSQELIQLKSQGDL EKAELQDRVTELSALLTQSQKQNEDYEKMIKALRETVEILETNHTELMEHEASLSRNA QEEKLSLQQVIKDITQVMVEEGDNIAQGSGLENSLELESSIFSQFDYQDADKALTLVR SVLTRRRQAVQDLRQQLAGCQEAVNLLQQQHDQWEEEGKALRQRLQKLTGERDTLAGQ TVDLQGEVDSLSKERELLQKAREELRQQLEVLEQEAWRLRRVNVELQLQGDSAQGQKE EQQEELHLAVRERERLQEMLMGLEAKQSESLSELITLREALESIHLEGELLRQEQTEV TAALARAEQSIAELSSSENTLKTEVADLRAAAVKLSALNEALALDKVGLNQQLLQLEE ENQSVCSRMEAAEQARNALQVDLAEAEKRREALWEKNTHLEAQLQKAEEAGAELQADL RDIQEEKEEIQKKLSESRHQQEAATTQLEQLHQEAKRQEEVLARAVQEKEALVREKAA LEVRLQAVERDRQDLAAQLQGLSSAKELLESSLFEAQQQNSVIDEPQGQLEVQIQTVT QAKEVIQGEVRCLKLELDTERSQAEQERDAAARQLAQAEQEGKTALEQQKAAHEKEVN QLREKWEKERSWHQQELAKALESLEREKMELEMRLKEQQTEMEAIQAQREEERTQAES ALCQMQLETEKERVSLLETLLQTQKELADASQQLERLRQDMKVQKLKEQETTGILQTQ LQEAQRELKEAARQHRDDLAALQEESSSLLQDKMDLQKQVEDLKSQLVAQDDSQRLVE QEVQEKLRETQEYNRIQKELEREKASLTLSLMEKEQRLLVLQEADSIRQQELSALRQD MQEAQGEQKELSAQMELLRQEVKEKEADFLAQEAQLLEELEASHITEQQLRASLWAQE AKAAQLHLRLRSTESQLEALAAEQQPGNQAQAQAQLASLYSALQQALGSVCESRPELS GGGDSAPSVWGLEPDQNGARSLFKRGPLLTALSAEAVASALLKLHQDLWKTQQTRDVL RDQVQKLEERLTDTEAEKSQVHTELQDLQRQLSQNQEEKSKWEGKQNSLESELMELHE TMASLQSRLRRAELQRMEAQGERELLQAAKENLTAQVEHLQAAVVEARAQASAAGILE EDLRTARSALKLKNEEVESERERAQALQEQGELKVAQGKALQENLALLTQTLAEREEE VETLRGQIQELEKQREMQKAALELLSLDLKKRNQEVDLQQEQIQELEKCRSVLEHLPM AVQEREQKLTVQREQIREPEKDRETQRNVLEHQLLELEKKDQMIESQRGQVQDLKKQL VTLECLALELEENHHKMECQQKLIKELEGQRETQRVALTHLTLDLEERSQELQAQSSQ IHDLESHSTVLARELQERDQEVKSQREQIEELQRQKEHLTQDLERRDQELMLQKERIQ VLEDQRTRQTKILEEDLEQIKLSLRERGRELTTQRQLMQERAEEGKGPSKAQRGSLEH MKLILRDKEKEVECQQEHIHELQELKDQLEQQLQGLHRKVGETSLLLSQREQEIVVLQ QQLQEAREQGELKEQSLQSQLDEAQRALAQRDQELEALQQEQQQAQGQEERVKEKADA LQGALEQAHMTLKERHGELQDHKEQARRLEEELAVEGRRVQALEEVLGDLRAESREQE KALLALQQQCAEQAQEHEVETRALQDSWLQAQAVLKERDQELEALRAESQSSRHQEEA ARARAEALQEALGKAHAALQGKEQHLLEQAELSRSLEASTATLQASLDACQAHSRQLE EALRIQEGEIQDQDLRYQEDVQQLQQALAQRDEELRHQQEREQLLEKSLAQRVQENMI QEKQNLGLEREEEEIRGLHQSVRELQLTLAQKEQEILELRETQQRNNLEALPHSHKTS PMEEQSLKLDSLEPRLQRELERLQAALRQTEAREIEWREKAQDLALSLAQTKASVSSL QEVAMFLQASVLERDSEQQRLQDELELTRRALEKERLHSPGATSTAELGSRGEQGVQL GEVSGVEAEPSPDGMEKQSWRQRLEHLQQAVARLEIDRSRLQRHNVQLRSTLEQVERE RRKLKREAMRAAQAGSLEISKATASSPTQQDGRGQKNSNAKCVAELQKEVVLLQAQLT LERKQKQDYITRSAQTSRELAGLHHSLSHSLLAVAQAPEATVLEAETRRLDESLTQSL TSPGPVLLHPSPSTTQAASR" BASE COUNT 2188 a 1867 c 2581 g 1178 t ORIGIN 1 ggaattccgg gaaatcctgg gataagagaa tagtttcctg gaagatctgt gcctccaacc 61 agcagagagg gattgagctt cattgaactc aacagagcca acatttcata gcaccatgtt 121 caagaggagg ttgaagtggc atggcaatgg ttagagaccc tgctgggcgt gaacaccctc 181 tggctaccta gggacctgtg ggcctaccac ctggtgccct catggagaca agaagccctg 241 ggttgaacaa catgaagccc cagtcactgc agctggtact ggaagagcag gtgctggcac 301 tacagcagca gatggcagag aatcaggcag cctcctggcg gaagctgaag aactcccagg 361 aggcccagca gagacaagca acccttgtga ggaagctgca ggccaaggtg ctgcagtacc 421 gaagctggtg ccaagagctg gagaagcggc tagaagccac tggaggacca atcccccaga 481 ggtgggaaaa tgtggaggag ccaaacctgg atgagctgct ggtccgattg gaggaggagc 541 aacagaggtg tgagagtcta gcagaggtga acacccagat tcgactgcac atggaaaaag 601 ctgacgtggt gaataaagcc cttagggcag atgtggaaaa actgacagtg gactggagcc 661 gggcccggga tgagctaatg aggaaggaga gccagtggca gatggagcag gagttcttca 721 agggctacct gaaaggggag cacggtcgcc ttctcagtct atggcgggag gttgtgacat 781 tccgacgcca cttcctggaa atgaagtcag ctactgacag agatctgatg gagctaaaag 841 ctgagcatgt gaggctttca gggtctctgt tgacctgttg tctgcgcttg actgtgggag 901 cacagtctcg ggaacccaac ggatctggaa gaatggatgg gcgggagccg gcccagctgc 961 tgctgctact agccaagacc caggagctgg agaaggaagc ccatgaaagg agccaggagt 1021 taatacagct gaagagtcaa ggggatctgg agaaggctga acttcaggac cgggtgaccg 1081 agctctctgc tctgttgacc cagtctcaga agcaaaatga agattatgaa aagatgataa 1141 aggctctgag agagacagtg gagatcctgg agacaaatca cacagaatta atggaacatg 1201 aagcatctct tagtaggaat gcgcaagagg agaagttgtc tttacagcag gtgatcaagg 1261 atataaccca ggtcatggtg gaagaagggg acaatatagc ccaaggctct ggtcttgaga 1321 actctttgga attggagtct agtatcttct cccagtttga ttaccaagat gcagacaagg 1381 ctcttactct ggtgcgttca gtgctgactc ggagacgcca ggctgtgcag gacctaaggc 1441 agcagcttgc aggctgtcaa gaggctgtga acttgttgca acagcagcat gatcagtggg 1501 aggaagaggg caaagccttg agacagcggc tgcagaagct cactggggag cgggacactc 1561 tggcagggca gactgtggac ctccagggag aggtggactc tctcagcaag gagcgagagc 1621 tgctgcagaa ggccagggaa gagctgcggc agcagctgga ggtgctagag caggaggcat 1681 ggcgcctgcg aagggtaaat gtggagcttc agctgcaggg ggactctgcc cagggccaga 1741 aggaggaaca gcaggaggag ctgcacctgg ctgtccggga gagggagcgt cttcaggaga 1801 tgctgatggg cctggaagcc aaacagtcag aatcactcag tgaactgatc actcttcggg 1861 aagccctgga gtcaattcac ctggaagggg agttactgag gcaagagcaa acggaagtga 1921 ccgcagcgct ggctagggca gagcagtcaa ttgcagagct gtcgagttct gaaaacaccc 1981 tgaagacaga agtagctgat cttcgggctg cagctgtcaa gctcagtgcc ttaaatgagg 2041 ctttggcgtt agataaagtt gggctgaacc agcagcttct ccagttagag gaggagaacc 2101 agtctgtgtg cagcagaatg gaggccgcag agcaggcgag aaatgctttg caggtcgacc 2161 tggcggaggc agagaagagg agggaagccc tgtgggaaaa gaacactcac ctggaggctc 2221 agctgcagaa agctgaggag gctggggctg agctgcaggc agatctcagg gacatccaag 2281 aagagaagga agaaattcaa aagaaactaa gtgagtcacg tcaccagcag gaggcagcca 2341 cgactcagct ggagcagcta catcaggagg caaagcgaca ggaagaagtg cttgccaggg 2401 cagtccagga gaaggaggcc ctagtacgag agaaagcggc tctagaggtg cggctgcagg 2461 ccgtggagcg tgaccggcag gacctcgctg cacaactaca ggggctcagc tcagccaagg 2521 agctactgga gagcagtctg tttgaagccc aacaacaaaa ttctgtgata gacgagccgc 2581 aggggcagct ggaggtccag attcaaactg tcactcaagc caaggaagta atccaagggg 2641 aagtgaggtg cctgaagctg gaactggaca ctgaacggag tcaggcagag caggagcggg 2701 atgctgcagc cagacagctg gcccaggctg agcaagaagg gaagactgcc ttggagcagc 2761 agaaggcagc ccatgagaaa gaggtgaacc agctccggga gaaatgggag aaggagcgct 2821 cctggcacca gcaggagctg gcaaaggctc tggagagctt agaaagggaa aaaatggagc 2881 tggaaatgag gctaaaggag cagcagacag aaatggaggc catccaggcc cagagggaag 2941 aagaacggac ccaggcagag agtgccctat gccagatgca gctggaaaca gagaaggaga 3001 gagtatccct cctggagaca ctgctgcaga cgcagaagga gctagcagat gccagccaac 3061 aactggaacg actgaggcag gacatgaaag tccagaaatt aaaggagcag gagaccactg 3121 ggatactaca gacccagctc caggaggctc aacgggagct gaaggaggca gcccggcagc 3181 acagagatga ccttgctgcc ctccaagaag agagcagctc cctgctgcag gataagatgg 3241 acctgcagaa gcaggtggag gacttgaagt ctcagctggt ggcccaggat gactcccaga 3301 ggctggtgga gcaggaggtt caggagaagc tgagagagac ccaggagtat aaccgaattc 3361 agaaggagct ggagagagag aaagccagcc tgactctgtc actgatggaa aaggaacaga 3421 gactccttgt tttacaagaa gctgactcta ttcgacaaca agagctgagt gccctgcgcc 3481 aggacatgca ggaggcccag ggagaacaga aagagctcag tgctcagatg gaattactaa 3541 ggcaagaggt gaaggaaaag gaggctgact ttctggccca ggaagcacag ctgctggagg 3601 agctggaggc gtctcatatc acggagcagc agctgcgagc ctccttgtgg gcccaggaag 3661 ccaaggcagc ccaactacac ctgcgactgc gcagcacaga gagccagcta gaagcgctgg 3721 ccgcagagca gcagcccggg aaccaggccc aggcccaggc ccagctggcc agcctctact 3781 ctgccctgca gcaggccctg gggtctgttt gtgagagcag gcctgagctg agtggtgggg 3841 gagactctgc tccttccgtc tggggccttg agccagacca gaatggagct aggagcctct 3901 ttaagagagg gcccctgctg actgctctct ccgctgaggc agtagcatct gccctcctca 3961 agcttcatca agacctgtgg aagactcaac agacccggga tgttctgagg gatcaggtcc 4021 agaaactgga agagcgtcta actgatactg aggctgagaa gagccaggtc cacacagagt 4081 tgcaggatct gcagagacag ctctcccaga atcaggaaga gaaatccaag tgggaaggaa 4141 agcagaactc cctagaatct gagctgatgg aactacatga aactatggca tccttacaga 4201 gtcgcctgcg gagagcagag ctacagcgaa tggaagccca gggtgagcga gagttacttc 4261 aggcagccaa ggagaacctg acagcccagg tggaacacct gcaagcagct gtcgtagaag 4321 ccagggctca ggcaagtgct gctggcatcc tggaagaaga cctgagaacg gctcgctcag 4381 cactgaagct gaaaaatgag gaagtagaga gtgagcgtga gagagcccag gctctgcaag 4441 agcagggcga actgaaggtg gcccaaggga aggctctgca agagaatttg gccctcctga 4501 cccagaccct agctgaaaga gaagaggagg tggagactct gcggggacaa atccaggaac 4561 tggagaagca acgggaaatg cagaaggctg ctttggaatt gctgtctctg gacctgaaga 4621 agaggaacca agaggtagat ctgcagcaag aacagattca ggagctagag aagtgtaggt 4681 ctgttttaga gcatctgccc atggccgtcc aggagcgaga gcagaagctg actgtgcaga 4741 gggagcagat cagagagccc gagaaggatc gggagactca gaggaacgtc ttggagcatc 4801 agcttctaga acttgagaag aaagaccaaa tgattgagtc ccagagagga caggttcagg 4861 acctgaaaaa gcagttggtt actctggaat gcctggccct ggaactggag gaaaaccatc 4921 acaagatgga gtgccagcaa aaactgatca aggagctgga gggccagagg gaaacccaga 4981 gagtggcttt gacccacctt acgctggacc tagaagaaag gagccaggag ctgcaggcac 5041 aaagcagcca gatccatgac ctggagagcc acagcaccgt tctggcaaga gagctgcagg 5101 agagggacca ggaggtgaag tctcagcgag aacagatcga ggagctgcag aggcagaaag 5161 agcatctgac tcaggatctc gagaggagag accaggagct gatgctgcag aaggagagga 5221 ttcaggttct cgaggatcag aggacccggc agaccaagat cctggaggag gacctggaac 5281 agatcaagct gtccttgaga gagcgaggcc gggagctgac cactcagagg cagctgatgc 5341 aggaacgggc agaggaaggg aagggcccaa gtaaagcaca gcgcgggagc ctagagcaca 5401 tgaagctgat cctgcgtgat aaggagaagg aggtggaatg tcagcaggag catatccatg 5461 aactccagga gctcaaagac cagctggagc agcagctcca gggcctgcac aggaaggtag 5521 gtgagaccag cctcctcctg tcccagcgag agcaggaaat agtggtcctg cagcagcaac 5581 tgcaggaagc cagggaacaa ggggagctga aggagcagtc acttcagagt caactggatg 5641 aggcccagag agccctagcc cagagggacc aggaactgga ggctctgcag caagaacagc 5701 agcaggccca gggacaggag gagagggtga aggaaaaggc agacgccctc cagggagctc 5761 tggagcaagc ccatatgaca ctgaaggagc gtcatggaga gcttcaggac cacaaggaac 5821 aggcacgaag gctggaggaa gagctggcag tggagggacg gcgggtccaa gccctggagg 5881 aggtgctggg agacctaagg gctgagtctc gggaacagga gaaagctctg ttggccctcc 5941 agcagcagtg tgctgagcag gcacaggagc atgaggtgga gaccagggcc ctgcaggaca 6001 gctggctgca ggcccaggca gtgctcaagg aacgggacca ggagctggaa gctctgcggg 6061 cagaaagtca gtcctcccgg catcaggagg aggctgcccg ggcccgggct gaggctctgc 6121 aggaggccct tggcaaggct catgctgccc tgcaggggaa agagcagcat ctcctcgagc 6181 aggcagaatt gagccgcagt ctggaggcca gcactgcaac cctgcaagcc tccctggatg 6241 cctgccaggc acacagtcgg cagctggagg aggctctgag gatacaagaa ggtgagatcc 6301 aggaccagga tctccgatac caggaggatg tgcagcagct gcagcaggca cttgcccaga 6361 gggatgaaga gctgagacat cagcaggaac gggagcagct gctggagaag tctctggccc 6421 agagggtcca agagaatatg atccaagaga agcagaatct ggggctagag agagaagagg 6481 aggagataag gggccttcat cagagtgtaa gggagctaca gctgactcta gcccaaaagg 6541 aacaggagat tctggagctg agggagaccc agcaaaggaa caacctggaa gccttacccc 6601 acagccacaa aacctcccca atggaggaac aatctctaaa acttgattct ttagagccca 6661 ggctgcagcg ggagctggag cggctacagg cagccctgag acagacagaa gccagggaga 6721 ttgagtggag ggagaaggcc caggacttgg cactctccct agcgcagacc aaggccagtg 6781 tcagcagtct gcaggaggtt gccatgttcc tacaagcctc tgtcctggag cgggactcag 6841 aacagcaaag gctgcaggat gaactggagc tcaccagacg ggctctggag aaggagcggc 6901 tacacagccc aggtgcaacc agcacagcag aactggggtc cagaggggag cagggtgtgc 6961 agctgggaga ggtctcagga gtggaggctg agcctagtcc tgatggaatg gagaagcagt 7021 catggagaca aaggcttgaa cacctgcagc aagcagtggc ccggctggag attgacagga 7081 gcaggctgca gcgccacaat gtccagctgc ggagtacctt ggagcaggtg gagcgagaac 7141 ggaggaagct gaagagggag gccatgcgtg cggcccaggc agggtcccta gagatcagca 7201 aggccacggc ttcttcaccc acacagcagg atgggagagg acagaagaac tcaaatgcca 7261 agtgtgtggc tgaactgcag aaagaggtgg tcctgctgca agctcagctg actttggagc 7321 ggaagcagaa gcaggactac atcacccgct cagcacagac cagccgtgag ctagcaggcc 7381 tgcaccacag cctctcacac tcacttcttg ccgtggccca ggcccctgag gccactgtcc 7441 tggaggcaga gacccgcagg ctggatgagt ccctgactca aagtctgaca tccccagggc 7501 cagtcctgct acaccccagc cccagcacta cccaagccgc ctccaggtag cagccacagc 7561 caggagcaca cagacagaag actgtgtcat gggtcatggc ccctccgcac acctacaggt 7621 ttgccaaagg aaaagcctgg ctctgttagg cacccaggag ccccaggtcg gcgggtgttc 7681 ccaggaagag gaagtaaatc tgcaaccctg gggaggaccc caactcacct gggaatgagg 7741 caaattgcat ttgcttgctc cctatggaat cacccagagg ggtgccttgc cctggctgag 7801 ggacccggaa ttcc // LOCUS AF022799 2352 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens digestive tract-specific calpain (nCL-4) mRNA, complete cds. ACCESSION AF022799 NID g2502076 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Unclassified. REFERENCE 1 (bases 1 to 2352) AUTHORS Lee,H.J., Sorimachi,H., Jeong,S.Y., Ishiura,S. and Suzuki,K. TITLE Molecular cloning and characterization of a novel tissue-specific calpain predominantly expressed in the digestive tract JOURNAL Unpublished REFERENCE 2 (bases 1 to 2352) AUTHORS Sorimachi,H., Lee,H.J. and Suzuki,K. TITLE Direct Submission JOURNAL Submitted (04-SEP-1997) Molecular Biology, Inst. of Molecular and Cellular Biosciences, Univ. of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113, Japan FEATURES Location/Qualifiers source 1..2352 /organism="Homo sapiens" /chromosome="1" /map="1; close to SHGC-30183" /tissue_type="stomach" gene 1..2352 /gene="nCL-4" CDS 114..2186 /gene="nCL-4" /EC_number="3.4.22.17" /note="this molecule is predominantly expressed in the digestive tract, for instance in the stomach and intestine; calcium-dependent cysteine proteinase" /codon_start=1 /product="digestive tract-specific calpain" /db_xref="PID:g2502077" /translation="MPYLYRAPGPQAHPVPKDARITHSSGQSFEQMRQECLQRGTLFE DADFPASNSSLFYSERPQIPFVWKRPGEIVKNPEFILGGATRTDICQGELGDCWLLAA IASLTLNQKALARVIPQDQSFGPGYAGIFHFQFWQHSEWLDVVIDDRLPTFRDRLVFL HSADHNEFWSALLEKAYAKLNGSYEALKGGSAIEAMEDFTGGVAETFQTKEAPENFYE ILEKALKRGSLLGCFIDTRSAAESEARTPFGLIKGHAYSVTGIDQVSFRGQRIELIRI RNPWGQVEWNGSWSDSSPEWRSVGPAEQKRLCHTALDDGEFWMAFKDFKAHFDKVEIC NLTPDALEEDAIHKWEVTVHQGSWVRGSTAGGCRNFLDTFWTNPQIKLSLTEKDEGQE ECSFLVALMQKDRRKLKRFGANVLTIGYAIYECPDKDEHLNKDFFRYHASRARSKTFI NLREVSDRFKLPPGEYILIPSTFEPHQEADFCLRIFSEKKAITRDMDGNVDIDLPEPP KPTPPDQETEEEQRFRALFEQVAGEDMEVTAEELEYVLNAVLQKKKDIKFKKLSLISC KNIISLMDTSGNGKLEFDEFKVFWDKLKQWINLFLRFDADKSGTMSTYELRTALKAAG FQLSSHLLQLIVLRYADEELQLDFDDFLNCLVRLENASRVFQALSTKNKEFIHLNINE FIHLTMNI" BASE COUNT 577 a 618 c 627 g 530 t ORIGIN 1 actcagccca gtggccctct gagctgttcc ttcttgaccg gcacacacag ctcgcttctt 61 cactttcttt tccatccact gccggaccca agccagcctt ccagggagca gccatgcctt 121 acctctaccg ggccccaggg cctcaggcac acccggttcc caaggacgcc cggatcaccc 181 actcctcagg ccagagcttt gagcaaatga ggcaggagtg cctgcagaga ggcaccctgt 241 ttgaggatgc agacttccca gccagcaatt cctccctgtt ctacagtgag aggccgcaga 301 tcccctttgt gtggaaacga ccaggggaaa tcgtgaaaaa cccagaattc attcttggag 361 gggccaccag gactgatatc tgccagggag agctgggaga ctgctggcta ttagccgcca 421 tcgcctccct tacgcttaat caaaaagcac tggccagagt catcccccag gaccaaagct 481 ttggccctgg ttatgccggg atattccatt tccagttctg gcagcacagt gagtggctgg 541 acgtggtgat cgatgaccgc ctgcccacct tcagggaccg cttggttttc ctccactctg 601 ccgaccacaa tgagttctgg agcgccttgc tggaaaaagc ctacgccaag ctaaatggga 661 gctatgaagc tctgaaggga ggcagcgcca tcgaggccat ggaagacttc accgggggtg 721 tggcagagac cttccaaact aaagaggccc ccgagaactt ctatgagatt ctagagaagg 781 ctttgaagag aggctccctg ctgggctgct tcattgatac cagaagtgct gcagaatctg 841 aggcccggac gccgtttggt cttattaagg gtcatgccta cagtgtaacg ggaattgacc 901 aggtaagctt ccgaggccag agaatcgagc tcatccgaat ccggaaccct tggggccagg 961 ttgagtggaa cgggtcgtgg agcgacagtt ctccggagtg gcgttctgtt ggtccagctg 1021 agcagaagcg tctgtgtcac actgctctgg atgatgggga attctggatg gcatttaagg 1081 acttcaaggc ccactttgat aaagtggaga tctgcaacct cactcccgat gccctggagg 1141 aagacgcgat ccacaaatgg gaggtgacgg tccatcaggg aagctgggtt cgcggctcca 1201 cggctggggg ctgccgcaat ttcctggata ccttttggac caatccacaa ataaaattgt 1261 ctctgactga gaaagatgag gggcaggagg agtgtagttt ccttgtagcc ctgatgcaga 1321 aagatagaag gaaactcaag agatttggtg ccaatgtgct gacaatcggc tatgccattt 1381 atgagtgccc tgacaaagac gaacacctga acaaagactt cttcagatac cacgcttctc 1441 gggccagaag caagacgttc atcaacctga gagaagtctc cgaccggttc aagctgcccc 1501 ctggggagta catcctgatt cccagcactt ttgagcccca ccaggaagct gatttctgtc 1561 tgagaatctt ttcagagaaa aaagccatta cccgggatat ggatggaaat gtagacattg 1621 accttcctga gcctccaaag ccaactccac ctgaccagga gacagaggag gagcagcggt 1681 ttcgggctct gtttgaacaa gtcgctggtg aggacatgga ggtgacagca gaggaacttg 1741 agtatgtttt aaatgctgtg ctgcaaaaga aaaaggacat caaattcaag aagctaagcc 1801 tgatctcctg taaaaacatc atttccctga tggacaccag cggcaatggg aagctggagt 1861 ttgatgaatt caaagtgttc tgggacaagc tgaagcagtg gattaacctt ttccttcggt 1921 ttgatgctga caagtccggc accatgtcta cctatgaact acggactgca ctgaaagctg 1981 caggctttca gctgagcagc cacctcctgc agctgattgt gctcaggtat gcggatgagg 2041 agctccagct ggacttcgat gacttcctca actgcctggt ccggctggag aatgcgagcc 2101 gggtgttcca ggctctcagt acaaagaaca aggagttcat tcatctcaat ataaatgagt 2161 tcatccattt gacaatgaac atctgaggct gccttgtaga gatgcagcct gcccagctga 2221 atcttggctt ctggaccttg accttcagaa cttctcttgg tgtggaacca ttacgcccag 2281 ggttcactcc cctctcatcg tccggccttc tcccttcatc ttgatctggg aagaatgaaa 2341 tgaactcagc ta // LOCUS AF022813 1358 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens tetraspan (NAG-2) mRNA, complete cds. ACCESSION AF022813 NID g2586349 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1358) AUTHORS Tachibana,I., Bodorova,J., Berditchevski,F., Zutter,M.M. and Hemler,M.E. TITLE NAG-2, a novel transmembrane-4 superfamily (TM4SF) protein that complexes with integrins and other TM4SF proteins JOURNAL J. Biol. Chem. 272 (46), 29181-29189 (1997) MEDLINE 98030601 REFERENCE 2 (bases 1 to 1358) AUTHORS Tachibana,I., Bodorova,J., Berditchevski,F., Zutter,M.M. and Hemler,M.E. TITLE Direct Submission JOURNAL Submitted (04-SEP-1997) Tumor Virology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1358 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="breast cancer" /cell_line="MDA-MB-435" gene 1..1358 /gene="NAG-2" CDS 105..821 /gene="NAG-2" /codon_start=1 /product="tetraspan" /db_xref="PID:g2586350" /translation="MARACLQAVKYLMFAFNLLFWLGGCGVLGVGIWLAATQGSFATL SSSFPSLSAANLLIITGAFVMAIGFVGCLGAIKENKCLLLTFFLLLLLVFLLEATIAI LFFAYTDKIDRYAQQDLKKGLHLYGTQGNVGLTNAWSIIQTDFRCCGVSNYTDWFEVY NATRVPDSCCLEFSESCGLHAPGTWWKAPCYETVKVWLQENLLAVGIFGLCTALVQIL GLTFAMTMYCQVVKADTYCA" BASE COUNT 265 a 428 c 400 g 265 t ORIGIN 1 cttggtcgca cccaccacct gcctgcccac tggtcagcct tcagggaccc tgagcaccgc 61 ctggtctctt tcctgtggcc agcccagaac tgaagcgctg cggcatggcg cgcgcctgcc 121 tccaggccgt caagtacctc atgttcgcct tcaacctgct cttctggctg ggaggctgtg 181 gcgtgctggg tgtcggcatc tggctggccg ccacacaggg gagcttcgcc acgctgtcct 241 cttccttccc gtccctgtcg gctgccaacc tgctcatcat caccggcgcc tttgtcatgg 301 ccatcggctt cgtgggctgc ctgggtgcca tcaaggagaa caagtgcctc ctgctcactt 361 tcttcctgct gctgctgctg gtgttcctgc tggaggccac catcgccatc ctcttcttcg 421 cctacacgga caagattgac aggtatgccc agcaagacct gaagaaaggc ttgcacctgt 481 acggcacgca gggcaacgtg ggcctcacca acgcctggag catcatccag accgacttcc 541 gctgctgtgg cgtctccaac tacactgact ggttcgaggt gtacaacgcc acgcgggtac 601 ctgactcctg ctgcttggag ttcagtgaga gctgtgggct gcacgccccc ggcacctggt 661 ggaaggcgcc gtgctacgag acggtgaagg tgtggcttca ggagaacctg ctggctgtgg 721 gcatctttgg gctgtgcacg gcgctggtgc agatcctggg cctgaccttc gccatgacca 781 tgtactgcca agtggtcaag gcagacacct actgcgcgta ggccgcccac cgcccgcttc 841 tctgccaaaa ggacgcccac ggggagatgg ccgcacccac agctgccttt cccaccacca 901 gcctcggtgc tctgccccat gctgggagga gggagggagg gacaggtgcc tggagccccc 961 ggaaccctgt ttctggaagg ccctggctca ggtggcttca gggcctccgg accccccctg 1021 ggaggggtgg ccacgtgctg gctgcggaac ccagggcagg ggtgggaggg gcctccagca 1081 ctttttatat ttacgtattc tccaaagcag tgttcacacg ggagccagcc tgtggccccc 1141 agcctcctgg aaaacaggtt ggcgctggag gagccgggtc ttggcatcct ggaggtggcc 1201 ccactggtcc tggtgctcca ggcggggccg tggacccctc acctacattc catagtgggc 1261 ccgtggggct cctggtgcat cttaataaag tgtgagcagc aaaaaaaaaa aaaaaaaaaa 1321 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS AF022815 893 bp mRNA PRI 22-OCT-1997 DEFINITION Homo sapiens proteasome subunit XAPC7 mRNA, complete cds. ACCESSION AF022815 NID g2555135 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 893) AUTHORS Huang,J., Kwong,J., Sun,E.C. and Liang,T.J. TITLE Proteasome complex as a potential cellular target of hepatitis B virus X protein JOURNAL J. Virol. 70 (8), 5582-5591 (1996) MEDLINE 96357088 REFERENCE 2 (bases 1 to 893) AUTHORS Huang,J., Kwong,J., Sun,E.C. and Liang,T.J. TITLE Direct Submission JOURNAL Submitted (04-SEP-1997) NIDDK, NIH, 10 Center Dr., Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..893 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 25..771 /codon_start=1 /product="proteasome subunit XAPC7" /db_xref="PID:g2555136" /translation="MSYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGRDIVVLG VEKKSVAKLQDERTVRKICALDDNVCMAFAGLTADARIVINRARVECQSHRLTVEDPV TVEYITRYIASLKQRYTQSNGRRPFGISALIVGFDFDGTPRLYQTDPSGTYHAWKANA IGRGAKSVREFLEKNYTDEAIETDDLTIKLVIKALLEVVQSGGKNIELAVMRRDQSLK ILNPEEIEKYVAEIEKEKEENEKKKQKKAS" BASE COUNT 242 a 211 c 239 g 201 t ORIGIN 1 ggcacgaggc ggccgcccgc cggcatgagc tacgaccgcg ccatcaccgt cttctcgccc 61 gacggccacc tcttccaagt ggagtacgcg caggaggccg tcaagaaggg ctcgaccgcg 121 gttggtgttc gaggaagaga cattgttgtt cttggtgtgg agaagaagtc agtggccaaa 181 ctgcaggatg aaagaacagt gcggaagatc tgtgctttgg atgacaacgt ctgcatggcc 241 tttgcaggcc tcaccgccga tgcaaggata gtcatcaaca gggcccgggt ggagtgccag 301 agccaccggc tgactgtaga ggacccggtc actgtggagt acatcacccg ctacatcgcc 361 agtctgaagc agcgttatac gcagagcaat gggcgcaggc cgtttggcat ctctgccctc 421 atcgtgggtt tcgactttga tggcactcct aggctctatc agactgaccc ctcgggcaca 481 taccatgcct ggaaggccaa tgccataggc cggggtgcca agtcagtgcg cgagttcctg 541 gagaagaact atactgacga agccattgaa acagatgatc tgaccattaa gctggtgatc 601 aaggcactcc tggaagtggt tcagtcaggt ggcaaaaaca ttgaacttgc tgtcatgagg 661 cgagatcaat ccctcaagat tttaaatcct gaagaaattg agaagtatgt tgctgaaatt 721 gaaaaagaaa aagaagaaaa cgaaaagaag aaacaaaaga aagcatcatg atgaataaaa 781 tgtctttgct tgtaattttt aaattcatat caatcatgga tgagtctcga tgtgtaggcc 841 tttccattcc atttattcac actgagtgtc ctacaataaa cttccgtatt ttt // LOCUS AF022859 2730 bp mRNA PRI 20-OCT-1997 DEFINITION Homo sapiens neuropilin-2(a0) mRNA, complete cds. ACCESSION AF022859 NID g2547129 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2730) AUTHORS Chen,H., Chedotal,A., He,Z., Goodman,C.S. and Tessier-Lavigne,M. TITLE Neuropilin-2, a novel member of the neuropilin family, is a high affinity receptor for the semaphorins Sema E and Sema IV but not Sema III JOURNAL Neuron 19 (3), 547-559 (1997) MEDLINE 97470888 REMARK Erratum:[[published erratum appears in Neuron 1997 Sep;19(3):559]] REFERENCE 2 (bases 1 to 2730) AUTHORS Chen,H., Chedotal,A., He,Z.-G., Goodman,C.S. and Tessier-Lavigne,M. TITLE Direct Submission JOURNAL Submitted (05-SEP-1997) Anatomy, University of California, San Francisco, 513 Parnassus Avenue, Box 0452, San Francisco, CA 94143-0452, USA FEATURES Location/Qualifiers source 1..2730 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..2730 /codon_start=1 /product="neuropilin-2(a0)" /db_xref="PID:g2547130" /translation="MDMFPLTWVFLALYFSRHQVRGQPDPPCGGRLNSKDAGYITSPG YPQDYPSHQNCEWIVYAPEPNQKIVLNFNPHFEIEKHDCKYDFIEIRDGDSESADLLG KHCGNIAPPTIISSGSMLYIKFTSDYARQGAGFSLRYEIFKTGSEDCSKNFTSPNGTI ESPGFPEKYPHNLDCTFTILAKPKMEIILQFLIFDLEHDPLQVGEGDCKYDWLDIWDG IPHVGPLIGKYCGTKTPSELRSSTGILSLTFHTDMAVAKDGFSARYYLVHQEPLENFQ CNVPLGMESGRIANEQISASSTYSDGRWTPQQSRLHGDDNGWTPNLDSNKEYLQVDLR FLTMLTAIATQGAISRETQNGYYVKSYKLEVSTNGEDWMVYRHGKNHKVFQANNDATE VVLNKLHAPLLTRFVRIRPQTWHSGIALRLELFGCRVTDAPCSNMLGMLSGLIADSQI SASSTQEYLWSPSAARLVSSRSGWFPRIPQAQPGEEWLQVDLGTPKTVKGVIIQGARG GDSITAVEARAFVRKFKVSYSLNGKDWEYIQDPRTQQPKLFEGNMHYDTPDIRRFDPI PAQYVRVYPERWSPAGIGMRLEVLGCDWTDSKPTVKTLGPTVKSEETTTPYPTEEEAT ECGENCSFEDDKDLQLPSGFNCNFDFLEEPCGWMYDHAKWLRTTWASSSSPNDRTFPD DRNFLRLQSDSQREGQYARLISPPVHLPRSPVCMEFQYQATGGRGVALQVVREASQES KLLWVIREDQGGEWKHGRIILPSYDMEYQIVFEGVIGKGRSGEIAIDDIRISTDVPLE NCMEPISAFADEYEVDWSNSSSATSGSGAPSTDKEKSWLYTLDPILITIIAMSSLGVL LGATCAGLLLYCTCSYSGLSSRSCTTLENYNFELYDGLKHKVKMNHQKCCSEA" BASE COUNT 643 a 804 c 734 g 549 t ORIGIN 1 atggatatgt ttcctctcac ctgggttttc ttagccctct acttttcaag acaccaagtg 61 agaggccaac cagacccacc gtgcggaggt cgtttgaatt ccaaagatgc tggctatatc 121 acctctcccg gttaccccca ggactacccc tcccaccaga actgcgagtg gattgtttac 181 gcccccgaac ccaaccagaa gattgtcctc aacttcaacc ctcactttga aatcgagaag 241 cacgactgca agtatgactt tatcgagatt cgggatgggg acagtgaatc cgcagacctc 301 ctgggcaaac actgtgggaa catcgccccg cccaccatca tctcctcggg ctccatgctc 361 tacatcaagt tcacctccga ctacgcccgg cagggggcag gcttctctct gcgctacgag 421 atcttcaaga caggctctga agattgctca aaaaacttca caagccccaa cgggaccatc 481 gaatctcctg ggtttcctga gaagtatcca cacaacttgg actgcacctt taccatcctg 541 gccaaaccca agatggagat catcctgcag ttcctgatct ttgacctgga gcatgaccct 601 ttgcaggtgg gagaggggga ctgcaagtac gattggctgg acatctggga tggcattcca 661 catgttggcc ccctgattgg caagtactgt gggaccaaaa caccctctga acttcgttca 721 tcgacgggga tcctctccct gacctttcac acggacatgg cggtggccaa ggatggcttc 781 tctgcgcgtt actacctggt ccaccaagag ccactagaga actttcagtg caatgttcct 841 ctgggcatgg agtctggccg gattgctaat gaacagatca gtgcctcatc tacctactct 901 gatgggaggt ggacccctca acaaagccgg ctccatggtg atgacaatgg ctggaccccc 961 aacttggatt ccaacaagga gtatctccag gtggacctgc gctttttaac catgctcacg 1021 gccatcgcaa cacagggagc gatttccagg gaaacacaga atggctacta cgtcaaatcc 1081 tacaagctgg aagtcagcac taatggagag gactggatgg tgtaccggca tggcaaaaac 1141 cacaaggtat ttcaagccaa caacgatgca actgaggtgg ttctgaacaa gctccacgct 1201 ccactgctga caaggtttgt tagaatccgc cctcagacct ggcactcagg tatcgccctc 1261 cggctggagc tcttcggctg ccgggtcaca gatgctccct gctccaacat gctggggatg 1321 ctctcaggcc tcattgcaga ctcccagatc tccgcctctt ccacccagga atacctctgg 1381 agccccagtg cagcccgcct ggtcagcagc cgctcgggct ggttccctcg aatccctcag 1441 gcccagcccg gtgaggagtg gcttcaggta gatctgggaa cacccaagac agtgaaaggt 1501 gtcatcatcc agggagcccg cggaggagac agtatcactg ctgtggaagc cagagcattt 1561 gtgcgcaagt tcaaagtctc ctacagccta aacggcaagg actgggaata cattcaggac 1621 cccaggaccc agcagccaaa gctgttcgaa gggaacatgc actatgacac ccctgacatc 1681 cgaaggtttg accccattcc ggcacagtat gtgcgggtat acccggagag gtggtcgccg 1741 gcggggattg ggatgcggct ggaggtgctg ggctgtgact ggacagactc caagcccacg 1801 gtaaaaacgc tgggacccac tgtgaagagc gaagagacaa ccacccccta ccccaccgaa 1861 gaggaggcca cagagtgtgg ggagaactgc agctttgagg atgacaaaga tttgcagctc 1921 ccttcgggat tcaattgcaa cttcgatttc ctcgaggagc cctgtggttg gatgtatgac 1981 catgccaagt ggctccggac cacctgggcc agcagctcca gcccaaacga ccggacgttt 2041 ccagatgaca ggaatttctt gcggctgcag agtgacagcc agagagaggg ccagtatgcc 2101 cggctcatca gcccccctgt ccacctgccc cgaagcccgg tgtgcatgga gttccagtac 2161 caggccacgg gcggccgcgg ggtggcgctg caggtggtgc gggaagccag ccaggagagc 2221 aagttgctgt gggtcatccg tgaggaccag ggcggcgagt ggaagcacgg gcggatcatc 2281 ctgcccagct acgacatgga gtaccagatt gtgttcgagg gagtgatagg gaaaggacgt 2341 tccggagaga ttgccattga tgacattcgg ataagcactg atgtcccact ggagaactgc 2401 atggaaccca tctcggcttt tgcagatgaa tacgaggtgg actggagcaa ttcttcttct 2461 gcaacctcag ggtctggcgc cccctcgacc gacaaagaaa agagctggct gtacaccctg 2521 gatcccatcc tcatcaccat catcgccatg agctcactgg gcgtcctcct gggggccacc 2581 tgtgcaggcc tcctgctcta ctgcacctgt tcctactcgg gcctgagctc ccgaagctgc 2641 accacactgg agaactacaa cttcgagctc tacgatggcc ttaagcacaa ggtcaagatg 2701 aaccaccaaa agtgctgctc cgaggcatga // LOCUS AF022912 1131 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens cGMP phosphodiesterase delta subunit mRNA, complete cds. ACCESSION AF022912 NID g2655093 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1131) AUTHORS Erchova,G., Derre,J., Chatelin,S., Nancy,V., Berger,R., Kaplan,J., Munnich,A. and de Gunzburg,J. TITLE cDNA sequence, genomic organisation and chromosomal mapping of the human gene encoding the delta subunit of the cGMP phosphodiesterase from retinal rod cells JOURNAL Cytogenet. Cell Genet. (1997) In press REFERENCE 2 (bases 1 to 1131) AUTHORS Ershova,G., Nancy,V. and de Gunzburg,J. TITLE Direct Submission JOURNAL Submitted (05-SEP-1997) INSERM U-248, Institut Curie - Recherche, 26, rue d'Ulm, Paris 75005, France FEATURES Location/Qualifiers source 1..1131 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q36" /cell_type="retinal rod cells" CDS 151..603 /codon_start=1 /product="cGMP phosphodiesterase delta subunit" /db_xref="PID:g2655094" /translation="MSAKDERAREILRGFKLNWMNLRDAETGKILWQGTEDLSVPGVE HEARVPKKILKCKAVSRELNFSSTEQMEKFRLEQKVYFKGQCLEEWFFEFGFVIPNST NTWQSLIEAAPESQMMPASVLTGNVIIETKFFDDDLLVSTSRVRLFYV" BASE COUNT 309 a 288 c 256 g 278 t ORIGIN 1 ggggagagag aggccggtcc tggtctggcc gctgcggctg ctagcccgag gtctccaagc 61 cgggctgcgg ctccatcctc ggctcctggg caccgtctgc gaggctccgc cgaccagagt 121 gagaaagccg cgggcggcga ccgccgcatc atgtcagcca aggacgagcg ggccagggag 181 atcctgaggg gcttcaaact aaattggatg aaccttcggg atgctgagac agggaagata 241 ctctggcaag gaacagaaga cctgtctgtc cctggtgtgg agcatgaagc ccgtgttccc 301 aagaaaatcc tcaagtgcaa ggcagtgtct cgagaactta atttttcttc gacagaacaa 361 atggaaaaat tccgcctgga acaaaaagtt tacttcaaag ggcaatgcct agaagaatgg 421 ttcttcgagt ttggctttgt gatccctaac tccacaaata cctggcagtc cttgatagag 481 gcagcacccg agtcccagat gatgccagca agcgtcttaa ctgggaacgt tatcatagaa 541 acaaagtttt ttgacgacga tcttcttgta agcacatcca gagtgagact tttctatgtt 601 tgaaagaaga atgtgtgtac atttcaagaa tttgggtttt ttggagggag gaggaaactg 661 tttacttttt tcctccacac gtttgatttt tgacacatac acccctaatt ccctcaacag 721 cagaacctac ctgcagccac caggggacca gctctgtgta ggtaaccaga tggctctttt 781 tcccaagccg ccatcttcca gctgaccaga ctaaactccc aaccccagac cagggcaggg 841 gacaggtctc aagtccttcc cagcatacac acagggaaac aaacacatac cacaaaccgg 901 taactgtacc tgtcaccctc cttgtctcct ccttgggccc tacaggctac acatctacct 961 ttggcccctg gttttggaaa aattccgtgt tcctgaccca tgtttagttt tttcctacca 1021 tttctatttc atacattctc atacatttaa cttgtaaaat agactgtgat attattacat 1081 aatgtaatta aaaatatgaa ttaaaatatt cctacagtca aaaaaaaaaa a // LOCUS AF022913 1890 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens GPI transamidase mRNA, complete cds. ACCESSION AF022913 NID g2558890 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1890) AUTHORS Yu,J., Nagarajan,S., Knez,J.J., Udenfriend,S., Chen,R. and Medof,M.E. TITLE Direct Submission JOURNAL Submitted (05-SEP-1997) Pathology, Pathology, 2085 AdelbertRoad, Cleveland, OH 44106, USA FEATURES Location/Qualifiers source 1..1890 /organism="Homo sapiens" /db_xref="taxon:9606" /db_xref="ATCC:77441" /chromosome="1" /cell_line="lymphoblastoid JY1 cells" /clone_lib="ATTC77441" CDS 18..1205 /note="GPI8; involved in the class K GPI surface protein defect; similar to H. sapiens GPI8 protein encoded by GenBank Accession Number Y07596" /codon_start=1 /product="GPI transamidase" /db_xref="PID:g2558891" /translation="MAVTDSLSRAATVLATVLLLSFGSVAASHIEDQAEQFFRSGHTN NWAVLVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNPRNPKPATV FSHKNMELNVYGDDVEVDYRSYEVTVENFLRVLTGRIPPSTPRSKRLLSDDRSNILIY MTGHGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFYSP NIMALASSQVGEDSLSHQPDPAIGVHLMDRYTFYVLEFLEEINPASQTNMNDLFQVCP KSLCVSTPGHRTDLFQRDPKNVLITDFFGSVRKVEITTETIKLQQDSEIMESSYKEDQ MDEKLMEPLKYAEQLPVAQIIHQKPKLKDWHPPGGFILGLWALIIMVFFKTYGIKHMK FIF" BASE COUNT 600 a 320 c 354 g 616 t ORIGIN 1 atctgaagcc agtaaacatg gccgtcaccg acagcctcag ccgggctgcg actgtcttgg 61 caactgtgtt gctcttgtcc ttcggcagcg tggccgctag tcatatcgag gatcaagcag 121 aacaattctt tagaagtggc catacaaaca actgggctgt tctggtgtgt acatcccgat 181 tctggtttaa ttatcgacat gttgcaaata ccctttctgt ttatagaagt gtcaagaggc 241 taggtattcc tgacagtcac attgtcctaa tgcttgcaga tgatatggcc tgtaatccta 301 gaaatcccaa accagctaca gtgtttagtc acaagaatat ggaactaaat gtgtatggag 361 atgatgtgga agtggattat agaagttatg aggtaactgt ggagaatttt ttacgggtat 421 taactgggag gatcccacct agtactcctc ggtcaaaacg tcttctttct gatgacagaa 481 gcaatattct aatttatatg acagggcatg gtggaaatgg tttcttaaaa tttcaagatt 541 ctgaagaaat taccaacata gaactcgcgg atgcttttga acaaatgtgg cagaaaagac 601 gctacaatga gctactgttt attattgata cttgccaagg agcatccatg tatgaacgat 661 tttattctcc taacataatg gctctagcta gtagtcaagt gggagaagat tcactctcgc 721 atcaacctga tcctgcaatt ggagtccatc ttatggatag atacacattt tatgtcttgg 781 aatttttgga agaaattaac ccagctagcc aaactaatat gaatgacctt tttcaggtat 841 gtcccaaaag tctgtgtgtg tctactcctg gacatcgcac tgatcttttt cagagggatc 901 ctaaaaatgt actgataact gatttctttg gaagtgtacg gaaagtggaa attacaacag 961 agactattaa attgcaacag gattcagaaa tcatggaaag cagctataag gaagaccaga 1021 tggatgagaa actaatggaa cctctgaaat atgctgaaca acttcctgta gctcagataa 1081 tacaccagaa accgaagctg aaagactggc atcctcctgg gggctttatt ctgggattat 1141 gggcacttat tatcatggtt ttcttcaaaa cttatggaat taagcatatg aagttcattt 1201 tttagacttg atgatgaatg aagaatgcat ggaggactgc aaacttggat aataatttat 1261 gtcattatat atttttaaaa atgtgtttct cttgtatgaa ttggaaataa gtataaggaa 1321 actaaatttg aatcaactat taattttata acttaaagaa aaataattgt taatgcaact 1381 gcttaatggc actaaatata ttccagtttt gtattttgtg tattataaaa gcgaatgaga 1441 cagagatcag aatacattga ctgtttttga aaatagtaat ttccccttat ccccttttca 1501 tttggaaaag aaacaattgt gaagacatta aattctcact aacagaagta actttggtta 1561 attatttttt gtatatcctc ccaatctttt gacttatgca catatttttt cccaatatgg 1621 agatcatatg gaatgtacta ttttgtaatg tcttttttca ttttacaatg tattatcaac 1681 cttttccctc tcaaaaatac attgtgaatg actgcatagt attcacttta tgaatattta 1741 attcatttca cagtcttcta ttgttggacc acttacattg taccaaatgt tttcctttgg 1801 tttattcttt aatgtattaa tattttactg ctggtcactc atggaatcct gcagctttaa 1861 ttaaaagcaa agatgaaaaa aaaaaaaaaa // LOCUS AF023158 2646 bp mRNA PRI 07-DEC-1997 DEFINITION Homo sapiens tyrosine phosphatase (cdc14B) mRNA, complete cds. ACCESSION AF023158 NID g2662462 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2646) AUTHORS Li,L., Ernsting,B.R., Wishart,M.J., Lohse,D.L. and Dixon,J.E. TITLE A family of putative tumor suppressors is structurally and functionally conserved in humans and yeast JOURNAL J. Biol. Chem. 272 (47), 29403-29406 (1997) MEDLINE 98037751 REFERENCE 2 (bases 1 to 2646) AUTHORS Li,L. and Dixon,J.E. TITLE Direct Submission JOURNAL Submitted (06-SEP-1997) Biological Chemistry, University of Michigan, 1301 Catherine street, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..2646 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2646 /gene="cdc14B" CDS 1..1380 /gene="cdc14B" /note="dual specific protein" /codon_start=1 /product="tyrosine phosphatase" /db_xref="PID:g2662463" /translation="MKRKSERRSSWAAAPPCSRRCSSTSPGVKKIRSSTQQDPRRRDP QDDVYLDITDRLCFAILYSRPKSASNVHYFSIDNELEYENFYADFGPLNLAMVYRYCC KINKKLKSITMLRKKIVHFTGSDQRKQANAAFLVGCYMVIYLGRTPEEAYRILIFGET SYIPFRDAAYGSCNFYITLLDCFHAVKKAMQYGFLNFNSFNLDEYEHYEKAENGDLNW IIPDRFIAFCGPHSRARLESGYHQHSPETYIQYFKNHNVTTIIRLNKRMYDAKRFTDA GFDHHDLFFADGSTPTDAIVKEFLDICENAEGAIAVHCKAGLGRTGTLIACYIMKHYR MTAAETIAWVRICRPGSVIGPQQQFLVMKQTNLWLEGDYFRQKLKGQENGQHRAAFSK LLSGVDDISINGVENQDQQEPEPYSDDDEINGVTQGDRLRALKSRRQSKTNAIPLTLS ISRTKTVLR" BASE COUNT 766 a 566 c 597 g 716 t 1 others ORIGIN 1 atgaagcgga aaagcgagcg gcggtcgagc tgggccgccg cgcccccctg ctcgcggcgc 61 tgctcgtcga cctcgccggg tgtgaagaag atccgcagct ccacgcagca agacccgcgc 121 cgccgggacc cccaggacga cgtgtacctg gacatcaccg atcgcctttg ttttgccatt 181 ctctacagca gaccaaagag tgcatcaaat gtacattatt tcagcataga taatgaactt 241 gaatatgaga acttctacgc agattttgga ccactcaatc tggcaatggt ttacagatat 301 tgttgcaaga tcaataagaa attaaagtcc attacaatgt taaggaagaa aattgttcat 361 tttactggct ctgatcagag aaaacaagca aatgctgcct tccttgttgg atgctacatg 421 gttatatatt tggggagaac cccagaagaa gcatatagaa tattaatctt tggagagaca 481 tcctatattc ctttcagaga tgctgcctat ggaagttgca atttctacat tacacttctt 541 gactgttttc atgcagtaaa gaaggcaatg cagtatggct tccttaattt caactcattt 601 aaccttgatg aatatgaaca ctatgaaaaa gcagaaaatg gagatttaaa ttggataata 661 ccagaccgat ttattgcctt ctgtggacct cattcaagag ccagacttga aagtggttac 721 caccaacatt ctcctgagac ttatattcaa tattttaaga atcacaatgt tactaccatt 781 attcgtctga ataaaaggat gtatgatgcc aaacgcttta cggatgctgg cttcgatcac 841 catgatcttt tctttgcgga tggcagcacc cctactgatg ccattgtcaa agaattccta 901 gatatctgtg aaaatgctga gggtgccatt gcagtacatt gcaaagctgg ccttggtcgc 961 acgggcactc tgatagcctg ctacatcatg aagcattaca ggatgacagc agccgagacc 1021 attgcgtggg tcaggatctg cagacctggc tcggtgattg ggcctcagca gcagtttttg 1081 gtgatgaagc aaaccaacct ctggctggaa ggggactatt ttcgtcagaa gttaaagggg 1141 caggagaatg gacaacacag agcagccttc tccaaacttc tctctggcgt tgatgacatt 1201 tccataaatg gggtcgagaa tcaagatcag caagaacccg aaccgtacag tgatgatgac 1261 gaaatcaatg gagtgacaca aggtgataga cttcgggcct tgaaaagcag aagacaatcc 1321 aaaacaaacg ctattcctct cactctctcc atttcaagga ctaaaacagt cttgcgttaa 1381 gtaaaaacct gtgaccagag ctgaaggaag actctaggac tgaaaactgc aacagaaatt 1441 agcacaattt gaaaacaaaa caaaattgca aaagccttag ttgctttttc cacctaagaa 1501 gttgatcaat ggagaaaatg tccactggag tttgaataat gaactttgag tttgggtgca 1561 agcaaatgac tcagagaagg gtccagctct caagctgaat gacaaacatg ctgttgtaaa 1621 tttagtctca ggtgtaaata cccaagccct ctggtaccca gggagctggc tggtctgtgg 1681 tgcatgtgtg tccctgtgat ggcaatcatt gtagttgctg gccttcagaa gaattgagga 1741 tctgatggag gttttttatg tatttatttt ctgttcacct tgtgaccctg tgtcaaaatt 1801 tataaagata caaaaggcat tactgaaatg gtactttctg taatttgata ctatttggct 1861 taatcatctt cacttgacta tttgtaatac tgttgtaatg ttaactctgt taagtaccca 1921 agctgcttgt cttccaccaa agagtgcttt attaacaaga atctgtgaaa atcacattta 1981 aacactgttg catgttgtaa gaccaggtgg taccttagta acctaaaact tgcaagagaa 2041 tattaatggt agctttagaa gactcaggag gagaaactga cttcagagtt ggaagatgtt 2101 gcaagtcgtt cctttttctg tccttcaggg actgaagaac tgggaggctg cccattgttt 2161 ggttgccagt catacaaatt aaaatcatat ttccttccat gaatggaaga aacacactat 2221 tggtttttcc ccttggaaac agcaatccca aataatgtcg gcttacaaaa aaaaaagtta 2281 ccactttttt agagtccttn ccctgtaaca ttggattttt ttttccctta tgagatccac 2341 ctaaggccat tgacgtggcc tgcgatctca gtgacaatga tctgctttct ggatctcact 2401 gttgcctttg gttagggaac acagagtgct tctcccgcag ccctactgga acacagcaga 2461 gtctgtgcca tgaagcagtt acagaaacag aattgatgtg ctgctaaaaa aaaaaaaaaa 2521 aatggggccc gggggggcgt ccgccggccc tgcgggccgc cggtgaaata ccactactct 2581 gatcgttttt tcactgaccc ggtgaggcgg gggggcgagc cccgaggggc tctcgcttct 2641 ggcgcg // LOCUS AF023268 75270 bp DNA PRI 28-OCT-1997 DEFINITION Homo sapiens clk2 kinase (CLK2), propin1, cote1, glucocerebrosidase (GBA), and metaxin genes, complete cds; metaxin pseudogene and glucocerebrosidase pseudogene; and thrombospondin3 (THBS3) gene, partial cds. ACCESSION AF023268 NID g2564910 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 75270) AUTHORS Winfield,S.L., Tayebi,N., Martin,B.M., Ginns,E.I. and Sidransky,E. TITLE Identification of three additional genes contiguous to the glucocerebrosidase locus on chromosome 1q21: implications for Gaucher disease JOURNAL Genome Res. 7 (10), 1020-1026 (1997) MEDLINE 97474796 REFERENCE 2 (bases 1 to 75270) AUTHORS Winfield,S.L., Tayebi,N., Martin,B.M., Ginns,E.I. and Sidransky,E. TITLE Direct Submission JOURNAL Submitted (05-SEP-1997) Clinical Neuroscience Branch, NIMH, 9000 Rockville Pike Bldg. 49 Rm. B1EE16, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..75270 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" mRNA join(1..129,2354..2523,3624..3852,4550..4637,4987..5053, 5220..5336,6458..6624,7401..7495,8580..8709,8804..8886, 9055..9134,9315..9405,9956..10482) /gene="CLK2" gene 1..10482 /gene="CLK2" CDS join(2354..2523,3624..3852,4550..4637,4987..5053, 5220..5336,6458..6624,7401..7495,8580..8709,8804..8886, 9055..9134,9315..9405,9956..10138) /gene="CLK2" /codon_start=1 /product="clk2 kinase" /db_xref="PID:g2564911" /translation="MPHPRRYHSSERGSRGSYREHYRSRKHKRRRSRSWSSSSDRTRR RRREDSYHVRSRSSYDDRSSDRRVYDRRYCGSYRRNDYSRDRGDAYYDTDYRHSYEYQ RENSSYRSQRSSRRKHRRRRRRSRTFSRSSSQHSSRRAKSVEDDAEGHLIYHVGDWLQ ERYEIVSTLGEGTFGRVVQCVDHRRGGARVALKIIKNVEKYKEAARLEINVLEKINEK DPDNKNLCVQMFDWFDYHGHMCISFELLGLSTFDFLKDNNYLPYPIHQVRHMAFQLCQ AVKFLHDNKLTHTDLKPENILFVNSDYELTYNLEKKRDERSVKSTAVRVVDFGSATFD HEHHSTIVSTRHYRAPEVILELGWSQPCDVWSIGCIIFEYYVGFTLFQTHDNREHLAM MERILGPIPSRMIRKTRKQKYFYRGRLDWDENTSAGRYVRENCKPLRRYLTSEAEEHH QLFDLIESMLEYEPAKRLTLGEALQHPFFARLRAEPPNKLWDSSRDISR" mRNA join(10950..11272,11624..11701,12699..12821,12908..13028, 14412..14540,15709..15868,15980..16081,16574..16691, 16947..17388) /product="propin1" CDS join(11207..11272,11624..11701,12699..12821,12908..13028, 14412..14540,15709..15868,15980..16081,16574..16691, 16947..17093) /codon_start=1 /product="propin1" /db_xref="PID:g2564915" /translation="MAQSRDGGNPFAEPSELDNPFQDPAVIQHRPSRQYATRDVYNPF ETREPPPAYEPPAPAPLPPPSAPSLQPSRKLSPTEPKNYGSYSTQASAAAATAELLKK QEELNRKAEELDRRERELQHAALGGTATRQNNWPPLPSFCPVQPCFFQDISMEIPQEF QKTVSTMYYLWMCSTLALLLNFLACLASFCVETNNGAGFGLSILWVLLFTPCSFVCWY RPMYKAFRSDSSFNFFAFFFNFFDQDVLFVLQAIGIPGWGFSGWISALVVPKGNTAVS VLMLLVALLFTGIAVLGIVMLKRIHSLYRRTGASFQKAQQEFAAGVFSNPAVRTRAAN AAAGAAENAFRAP" mRNA join(17913..18715,18913..18969,19174..19284,19390..19509, 19636..19743,21463..21638,21770..21869,22207..22276, 22550..23095,24910..24978,25080..25333,25428..26165) /product="cote1" CDS join(18491..18715,18913..18969,19174..19284,19390..19509, 19636..19743,21463..21638,21770..21869,22207..22276, 22550..23095,24910..24978,25080..25333,25428..25601) /codon_start=1 /product="cote1" /db_xref="PID:g2564916" /translation="MMPSPSDSSRSLTSRPSTRGLTHLRLHRPWLQALLTLGLVQVLL GILVVTFSMVASSVTTTESIKRSCPSWAGFSLAFSGVVGIVSWKRPFTLVISFFSLLS VLCVMLSMAGSVLSCKNAQLARDFQQCSLEGKVCVCCPSVPLLRPCPESGQELKVAPN STCDEARGALKNLLFSVCGLTICAAIICTLSAIVCCIQIFSLDLVHTQLAPERSVSGP LGPLGCTSPPPAPLLHTMLDLEEFVPPVPPPPYYPPEYTCSSETDAQSITYNGSMDSP VPLYPTDCPPSYEAVMGLRGDSQATLFDPQLHDGSCICERVASIVDVSMDSGSLVLSA IGDLPGGSSPSEDSCLLELQGSVRSVDYVLFRSIQRSRAGYCLSLDCGLRGPFEESPL PRRPPRAARSYSCSAPEAPPPLGAPTAARSCHRLEGWPPWVGPCFPELRRRVPRGGGR PAAAPPTRAPTRRFSDSSGSLTPPGHRPPHPASPPPLLLPRSHSDPGITTSSDTADFR DLYTKVLEEEAASVSSADTGLCSEACLFRLARCPSPKLLRARSAEKRRPVPTFQKVPL PSGPAPAHSLGDLKGSWPGRGLVTRFLQISRKAPDPSGTGAHGHKQVPRSLWGRPGRE SLHLRSCGDLSSSSSLRRLLSGRRLERGTRPHSLSLNGGSRETGL" mRNA join(32168..32299,32668..32755,33308..33499,33623..33769, 34735..34868,35079..35251,35806..36043,36915..37139, 37540..37703,38073..38189,38284..38931) /gene="GBA" /product="glucocerebrosidase" gene 32168..38931 /gene="GBA" CDS join(32273..32299,32668..32755,33308..33499,33623..33769, 34735..34868,35079..35251,35806..36043,36915..37139, 37540..37703,38073..38189,38284..38389) /gene="GBA" /codon_start=1 /product="glucocerebrosidase" /db_xref="PID:g2564914" /translation="MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCI PKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGT GLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRV PMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPW TSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPS AGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVV LTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVR LGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTF YKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVP LTIKDPAVGFLETISPGYSIHTYLWRRQ" misc_feature complement(38948..42406) /note="metaxin pseudogene" misc_feature 54895..59543 /note="glucocerebrosidase pseudogene" mRNA complement(join(<59671..59885,59978..60132,60598..60674, 60801..60983,61149..61241,62741..62820,62953..63022, 64036..>64116)) /product="metaxin" CDS complement(join(59671..59885,59978..60132,60598..60674, 60801..60983,61149..61241,62741..62820,62953..63022, 64036..64116)) /codon_start=1 /product="metaxin" /db_xref="PID:g2564913" /translation="MAAPMELFCWSGGWGLPSVDLDSLAVLTYARFTGAPLKVHKISN PWQSPSGTLPALRTSHGEVISVPHKIITHLRKEKYNADYDLSARQGADTLAFMSLLEE KLLPVLVHTFWIDTKNYVEVTRKWYAEAMPFPLNFFLPGRMQRQYMERLQLLTGEHRP EDEEELEKELYREARECLTLLSQRLGSQKFFFGDAPASLDAFVFSYLALLLQAKLPSG KLQVHLRGLHNLCAYCTHILSLYFPWDGAEVPPQRQTPAGPETEEEPYRRRNQILSVL AGLAAMVGYALLSGIVSIQRATPARAPGTRTLGMAEEDEEE" mRNA join(<65492..65570,66961..67167,68050..68306,68409..68511, 69827..69853,70061..70143,70229..70280,70404..70552, 70963..71103,71330..71407,71795..71947,72161..72271, 72361..72468,72754..72913,73251..73369,73508..73560, 74757..74950,75138..>75270) /gene="THBS3" /product="thrombospondin3" gene 65492..>75270 /gene="THBS3" CDS join(65492..65570,66961..67167,68050..68306,68409..68511, 69827..69853,70061..70143,70229..70280,70404..70552, 70963..71103,71330..71407,71795..71947,72161..72271, 72361..72468,72754..72913,73251..73369,73508..73560, 74757..74950,75138..>75270) /gene="THBS3" /codon_start=1 /product="thrombospondin3" /db_xref="PID:g2564912" /translation="METQELRGALALLLLCFFTSASQDLQVIDLLTVGESRQMVAVAE KIRTALLTAGDIYLLSTFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQR EDGKVHAVNLQQAGLADGRTHTVLLRLRGPSRPSPALHLYVDCKLGDQHAGLPALAPI PPAEVDGLEIRTGQKAYLRMQGFVESMKIILGGSMARVGALSECPFQGDESIHSAVTN ALHSILGEQTKALVTQLTLFNQILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRS HCSPNPCFRGVDCMEVYEYPGYRCGPCPPGLQGNGTHCSDINECAHADPCFPGSSCIN TMPGFHCEACPRGYKGTQVSGVGIDYARASKQVCNDIDECNDGNNGGCDPNSICTNTV GSFKCGPCRLGFLGNQSQGCLPARTCHSPAHSPCHIHAHCLFERNGAVSCQCNVGWAG NGNVCGTDTDIDGYPDQALPCMDNNKHCKQDNCLLTPNSGQEDADNDGVGDQCDDDAD GDGIKNVEDNCRLFPNKDQQNSDTDSFGDACDNCPNVPNNDQKDTDGNGEGDACDNDV DGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDSCPEMSNPTQTDADSDLVGDVCDTN EDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECDGDDDNDGIPDYVPPGPDNCRLV PNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAEVTLTDFRAYQTVVLDP" BASE COUNT 17453 a 19704 c 19643 g 18470 t ORIGIN 1 tcccagggtc ccgggttggg ggggtggagc agcatttcgt cgccgcgggg gtgccgggac 61 tccggccgca gtgtcgccgc catcacggac ttcctgtggg acaagcgcac gggcctcgcc 121 gccagaacgg tgagcgcgcg ggttggctgg ccgcgcgaaa agatggcgac cgcggggcgg 181 gcggggttag gcttgggttg ggggcaaggg acgggggcga gatttggaga ggaggagagt 241 tgagacctac gaagcgacgg agtagggaga tgagggggaa ggaggtcggc ctgcgttaga 301 tgtaccaaga agacctgtag gaagcttggg cggaaccaga ctagaaggga tgatagagta 361 aggatgggag tggttccaga gatgtgaagg agtggaatat caaagttgga ctcgggtcca 421 atcaaagctg gaggggagga atacatcacc gtgtcacttg gaacatgcag gaagaatttg 481 cctgtaggag ttctggattt tctgagtaag aaaggaaaaa gcaaaagatt tcaaacttga 541 gaaaggtagg ctagtgatgt tttgagggga cagtgctgtg aggccgtttg atggtagtat 601 gtggtgagat ggcatttatg tggctatgct gattaggata actatgctgg gaaaatcttg 661 tgcttgcagc ttggtatgct accggcccca gttacaccaa gaatgcatca aataccaccc 721 tcctcaattt ggggaggttg ggaagaagtg tatacagaat ggtggtcaga catcggaatc 781 attaaacctt ttgcatacgc gattctcata cacccccacc aagacttatg tccctgggcc 841 tgtttcattt tattgtgcct tcctattctc atcttcctgg attgtgacag tgatttttct 901 cttttttcct tgaaggaaaa aaaggatttg tgaaccatcc cattccaatt ccattaactg 961 gaaggtggcc agctgtcagc taagtagggc tgatattaat taaaaaccac ttactggcca 1021 ggcgcggtgg ctcacatctg taatcccagc actttgggag gccaaggcgg gcggatcacc 1081 tgaggttggg agttcgagac cagcctgacc aacatggaga aaccctgtct ctactaaaaa 1141 tacaaaatta gctgggcgtg gcggcgcatg cctgtaatcc cagctactcg ggaggctgag 1201 gcaggagaat cgcttgaacc ctgcaggcgg aggttatggt gagccgagat tgcgccattg 1261 cactccagcc tgggcaacaa gagcaaaact ccgttttcaa aaaaaaaaaa aaaaacacaa 1321 gaaacagttg ttatctacgt cttaagcaca agagtggagg ggccagatgg gatgctgact 1381 gtattttgtg gcctactaca ttaaaacctc tcaagggaga gttggccaaa gatgtgagag 1441 atgattaagt taattcaagg ggcccgaccc agatccctgg agctgatagt atttggcgtt 1501 tacctggtat atgtagaaac ctgggagact ggattcctac cctcaggaca ctgatagtct 1561 acttgaggag atgagatgta ctcttaatga gtgtacaaca tttatcataa ctgtaaacca 1621 aagctttact ctattgactt gtgaggaact cagcatcaca cagatctata tcttagatcc 1681 tgaacgggga tacactgtgg cagtctccat cggtcgtcat acactttgct ctgtagtgaa 1741 ctgtgtccta gacctgaagg tcttcatgga gctgtgtttg aagaaaataa ggatagaaca 1801 cttgaactgg ctgggtgcgg tggctcacgc ctgtaatccc agcactttgg gaggccgagg 1861 caggaggatc acgagtcagg agtttgagac catcctggcc aacatggtga aaccccgtct 1921 ctactaaaaa tataaaaatt agccaggagt ggtggtgcgt gcctgtaatc ccagctacac 1981 aggaggctga ggcaggagag ttgcttgaac ctgggaagtg gaggttgcag tgagccgaga 2041 ttgtgccact gcactccagc ctggccacac agcgagactc tgtctcaaga ataaaataag 2101 aacacttgaa ctgaggggtg ggtaatgagg gatgggtttg ggggctttct agagtttcag 2161 ccagagttag taggtggctg tactccaata aggaggaatt gtaagtacag aggaagactg 2221 aggaagttgt gcctggtaaa aagaggcaga aataagaggg gaaagcagct tagcatcttt 2281 ggagcagcca gtccttggag gagatttctg aaaactcgaa actttcctat ttccgctttc 2341 ccttgccttc cagatgccgc atcctcgaag gtaccactcc tcagagcgag gcagccgggg 2401 gagttaccgt gaacactatc ggagccgaaa gcataagcga cgaagaagtc gctcctggtc 2461 aagtagtagt gaccggacac gacggcgtcg gcgagaggac agctaccatg tccgttctcg 2521 aaggtgaggc ctcaggaaga gatactcaaa cacagggtca ctcgctcatt cctttgttca 2581 acaaatactt attggttata tgctaaaacc attgccttaa gtagctggct gaaggtaatg 2641 tcgggataga taaccacaga agccatttgg gctgcagtgt taagtgttag aatacataat 2701 gaggaactga ggacgtaccc agtgcgtttg gttggagtgg ggcaggatgt taattcagat 2761 aaaacattgc agaaagcaac atctgtatat gtctggagga caagtagagt caggtggggg 2821 aagtgtcttt gggcagaggg aaagcatgtg caaaggactg ggagctacag gtaactaata 2881 agctaacaga taagacaaag ttagatcttt gtgtttttaa ttttttaaaa attttaaaat 2941 atcatttatt tatttttgag acagagtttc actctgttgc ccaggctgga gtgcagtggc 3001 aagatctcag ctcactgcaa actctatggc cccaggttca agttgattct tgtgcctcag 3061 cctcccaagt tacctgggac tacaggcatg cgccatcatg cctggctaat tttttttttt 3121 tttttttgta tttttagtag agacaggatt tcaccatgtt ggcctcaaac tcctgacctc 3181 aagtgctcca cctgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgtgc 3241 ccggcccagt tttaatattt tagtggtatg ttcagtggaa agaatgggtt aaggatggtt 3301 tagagcaagg ggggatttca ttcaggaaag cttgagaaaa aaatggctct ttgtttcttt 3361 gtatcactgc agcgggaaaa aggccccaat agggagtggt tctgcagatg ccttttcaga 3421 ggttttggtg acatggcaga tgtgcttggt taacctcaac ttcttctggt cccttggtcc 3481 tctccagtga ctcaaacaag tgtctgagaa cagtttttct tagcattcac tctttgtcct 3541 cactaaggca ggaccttgtc acagggtatg gtagggaatg tgatctgaac agatacacct 3601 cattggtatt tgttatccaa cagcagttat gatgatcgtt cgtccgaccg gagggtgtat 3661 gaccggcgat actgtggcag ctacagacgc aacgattata gccgggatcg gggagatgcc 3721 tactatgaca cagactatcg gcattcctat gaatatcagc gggagaacag cagttaccgc 3781 agccagcgca gcagccggag gaagcacaga cggcggagga ggcgcagccg gacatttagc 3841 cgctcatctt cggtgagtgc cagcccaggc ccttcctctc cccactcttc tgcaggccct 3901 ctaggactct ggtaagtgag cagtatcctt gttctcagct gaacattggg gcatgaacac 3961 tgaggtgggc actgagtttg cctactttct tggaagctct ccgactcttg aagggccctg 4021 gatctgcttt ggagatggat gggcacggag catttgtgac ccccagtgct ctccctggca 4081 tgttgggctt attgtgttgg gagcagcttc tccgccccag cggcctccac tctttaatgg 4141 ggaccttgct tgttgaactg cctttttccc ccaagccctg ggctctgtag cccccttggc 4201 aggcgggctt gggtgggggc aaggggagat ctgtgtctgc ccggaagggc attgtgtaga 4261 gcatggggtt gtgggtgaca ttggcaacaa caccacttct ctgctctgcc catgcctctc 4321 cctgttcctg ttttgggttt ggggacttgg gattatgcgc cgctctctct tctcacaccg 4381 atccacctta ctgcatctga cgtgttccct tccactgtcc cccatcatcg tctgtccccc 4441 ctggctgggc gcctgtgacc ggtgacccct ctaccaccca ccccgcctcc ctccggcccc 4501 cggctccgaa cactgggctt ggttcggaac cccccctgcc ccccgacagc agcacagcag 4561 ccggagagcc aagagtgtag aggacgacgc tgagggccac ctcatctacc acgtcgggga 4621 ctggctacaa gagcgatgta caagccaaat cgtaacaatc ctatagcctg taatggtccc 4681 atagccatcc taacgtccca agccaacctg agtcacagcc tcttgccttt tgctcagtgg 4741 tgttgtttat ctgggtggtt tgagttgctg ttgagaattg acctctgttg tccactccca 4801 gctctcacgg cccctgggtt aaaggtggtg ggaatcacgc aggggttttc ttcccctgtg 4861 accatctgta tctgttcccc ttccttcatc tccaccccag gttgctgtcc ccttttttct 4921 tccaactcag ctcattcccc accttctctc cctccctctc cccgaccctg ctctctttca 4981 tttcagatga aatcgttagc accttaggag aggggacctt cggccgagtt gtacaatgtg 5041 ttgaccatcg caggtaactg tcagtccctc cctactatgt ggggctaaag agatggttgg 5101 ggttatatgg ggcttttttg ctaattaacc tgaggtagaa tttcttagtc cccctacagc 5161 cctgttcatt ttgagacatt cttgagaacc cagcaaaagc ctctcctgcc aacttacagg 5221 ggtggggctc gagttgccct gaagatcatt aagaatgtgg agaagtacaa ggaagcagct 5281 cgacttgaga tcaacgtgct agagaaaatc aatgagaaag accctgacaa caagaagtaa 5341 gcaagcaagg aagtgtgtag ggaggctgag agccccaacc cctacacggg agagattcct 5401 agacctggtt taggcagaca gggggagtcc tagaccttct catccattca tcttttgttc 5461 atcatcttag agtagtagaa catcagggta ggaaaggggg ggggcccatg acattcctta 5521 atccactccc cttgttttca gggaagttag attgctctgt cgtttgaccc cagcctagaa 5581 tggcttctgt agagatatac cctggacata gccctaggac tgagcatcaa cctcatcgaa 5641 caatgaatta agcttttata agtagaacta tgctgataag gccacagaca tttatgcagt 5701 aatattttgt tcacatacat tcacagtcca aaaatagaat aaggacttta agccaagata 5761 tgaggccagt aggtataaat aggagcccag tggtagctag tggtgaatga atatcaggag 5821 ttgggctggg gtttgggatt taggatggac agtttgatga ttccaggatc tatactgttt 5881 ggagcttggc actccacagc tcttccagca aatagttttt gaactattta aaatgacgca 5941 ggaagatatt ttgaaacttg ggctgggcat ggtggctcac gcctgtaatc ccagcacttt 6001 ggaggccgag gcgggtggat cacaatgtca ggagttcgag accagcctgg ccaatatggt 6061 gaaaccccgt ctgtactaaa aataaaaaaa ttagccgggc atggtggtgg acaactgtag 6121 tcccagctac ttggaaggct gaggcaggag aatcgcttga acaggaggca gaggttgcag 6181 tgagccaaga tcgcgccact gcactccagc ctgggtgaca gagtgagacg ccatctcaaa 6241 aacaaaaaag aaaaaaacat ggcagatagt accatttctt tcttgggtct ggtagaggct 6301 actccttagc tgaactgaat tttggtgtta gtactagctg gcatggtttt acacaagtta 6361 tgtggaatca acagctatga agtacctcct tgttggatgg ctgatgggca gatgggagct 6421 catcagacaa tcccccttcc cccatctctc ttctcagcct ctgtgtccag atgtttgact 6481 ggtttgacta ccatggccac atgtgtatct cctttgagct tctgggcctt agcaccttcg 6541 atttcctcaa agacaacaac tacctgccct accccatcca ccaagtgcgc cacatggcct 6601 tccagctgtg ccaggctgtc aagtgtgagt ggggtgggcc gaagtggact ctggggcagt 6661 ccctcccttc attggatctc ttctgtcggt tgtgcactgg tgaagcccct aaacagtcag 6721 ctgtctgtta tctgcagttc tttgatttac tgtcatcttg aaacgtcttc tgacttaact 6781 ccttgactga tgtctttatc gtcactgatt gctcttactc tacacctagc ctagcagcag 6841 ctagaagaga aagcctttgg aatcaaagca cttaattacc ctcccctttt cctttctccc 6901 tttcttggga aagcatcagt cagacagcaa acataaagag acaaaaatac actccttagg 6961 gtaaaggctt acatttgtct gggatgagat gttcattcac agcaaaggag atgggaacac 7021 agagtatgta gttcagctaa ggggcaaggt ggggaattca aagaaatata tcattcctct 7081 ggagtcatca aaataataca gtttcacaga attgagttaa cataatgcca gctagacaca 7141 tgtcaaagtg tgtacacact acaactcaaa gcaaactttt ttttttttta catcagttga 7201 gctacatatg tatcttacat tagaaagagc agagcttctt agaccaggca ttccatttag 7261 acgtagagcg gaaagcagcc ctagtgattc tggacctgtc tcctcactgg ctttgcccta 7321 ggtaaccagg cctggggtca gctgatacca gttagctctg gccacctgca ccaagcctga 7381 cctggccttc tcccctacag tcctccatga taacaagctg acacatacag acctcaagcc 7441 tgaaaatatt ctgtttgtga attcagacta tgagctcacc tacaacctag agaaggtaag 7501 atggataggg tctgcccttg gttactgggg gcaggcagct gcaccacttt gccttctgcc 7561 gagccctttg ttttctccct tttatttcgt cctcccacat tttctccctg actggctcca 7621 actgggtaaa actaaatagg ttgaaaggga gaaatctctt aagaagactt aaattgggag 7681 ataacttgta caggggactt cagataactt tccagtagag tgaaagtttt taaggttctt 7741 ggtttgggct ttatttattt atttatttat ttatttattt atttatttga gacagagtct 7801 cactctgttg cccagactga tgtgcagtgg cataatctcg gctcactgca acctctacct 7861 cccaggttta agtgattctc ctgcctcagc ctctggagta gctggggtta caggcacccg 7921 ccaccacgcc tggctagttt ttgtgttttt agtagagatg gggttttgcc atgttggcca 7981 ggctggtctc gaacccctaa cctcaggtga tctgcctgtc tcgggctccc agaatgctgg 8041 gattacgggt gtgagccaca gtgcctggcc cggacttctt ttttgagatg tgtatttctg 8101 tgaggtagga aagctcggct cctttgacta actggagata atgaagatct cagacctaaa 8161 gaagctctcc ctcctttgac ccctttgcta catgttacat atttttagag aaactccttt 8221 gctacatgtt acattttttc agaaaaaccc atttgctaca tgttacattt tttcagaaaa 8281 acccatttgc tacatgttac atgttttcag aataactata tatctggcac ttgagtgtag 8341 tcttcagatg cttacagatg tgcacgctgt tctataatca tttcttaatc cttttgctcc 8401 cctacttcta aaagtattat gctggatgtg gtggaagctg taaaacagga atttcccagg 8461 cttgtaaggc caggcaggag atcacagtac cttttttaag gctcagaagc aatgtagaga 8521 atactggtgg aatctcagtc tgggtgcaga ggttcatcct ctttctcctc tgctcctaga 8581 agcgagatga gcgcagtgtg aagagcacag ctgtgcgggt ggtagacttt ggcagtgcca 8641 cctttgacca tgagcaccat agcaccattg tctccactcg ccattaccga gcaccagaag 8701 tcatccttgg taagggaggg caaggctgtc caagtgtgtg agatgatgtg agggtggggc 8761 cacctaagcc tcataacacc ttttccctcc cattctcacc cagagttggg ctggtcacag 8821 ccttgtgatg tgtggagtat aggctgcatc atctttgaat actatgtggg attcaccctc 8881 ttccaggtaa gtgatgggat gtcttacttg actgcctggc attcttctac ctggttcctt 8941 tgttttctgc tgaggacctg cctactcagt tctccacatt gcctgccttc ctggcagctg 9001 atccttagca tgcccttttc agtcccatgc tcagtttctg tttttgtttc ccagacccat 9061 gacaacagag agcatctagc catgatggaa aggatcttgg gtcctatccc ttcccggatg 9121 atccgaaaga caaggtgaac cttgaggggg cactagttaa ctcttttcct tttctctcca 9181 cagaattggt ctatttcaca tcattttctt ttttctttga tacctcctct ccccccagtt 9241 actttcagat ggggaaataa gggaattgta acaagggtga ccttctgatt cctcaacctc 9301 cccttcccct ctagaaagca gaaatatttt taccggggtc gcctggattg ggatgagaac 9361 acatcagctg ggcgctatgt tcgtgagaac tgcaaaccgc tgcgggtgag ctgggctcgg 9421 gataaatagt gcccaccgtc cagaagtcac ttccttctta gggtggttgc cccctggaat 9481 gctcttcaac aagccagagg gttaggaaag gaggggagga aagctgaaag aagacatctt 9541 tggtcaacag aggaaacata agagggagtg gttttgcgga gggaaggagg ttagacagcc 9601 taaccttgag acaaccagag atcaaagcaa tgtcctggat tctttaggtc agacagaaaa 9661 gaataaacta cccttgaaga gcttacattt taatgaggaa ctaaagaaga ttcatgaagt 9721 tgacaaggat atacaagtag aaagaacttt caaagattat ggagtaattg tgctagaggg 9781 aaggtaggtt gagctataat atcagaaacg ttggtcctgg tgtggtcgtg gttaggtagc 9841 cttcaaattg gttgcaagca gagccttggt tctccaagaa tgaaaggtag ggtcttgaag 9901 aaggcagggt ttgtaaggca tcctgcctca cctttttcct gccctcctcc accagcggta 9961 tctgacctca gaggcagagg aacaccacca gctcttcgat ctgattgaaa gcatgctaga 10021 gtatgaacca gctaagcggc tgaccttggg tgaagccctt cagcatcctt tcttcgcccg 10081 ccttcgggct gagccgccca acaagttgtg ggactccagt cgggatatca gtcggtgacg 10141 atcaggccct gggcccccct gcatctttta tagcagtggg tgtccagtcc aggacactgg 10201 tgctttttta tacaagagaa cgagccagag ttcactcctt cctcctggct ctctatatac 10261 ctgtgaatat gtgaaatagt gtaaatatga aagaacttgt acctatcact tcaacccctg 10321 ccttgtacat aatactattc catccacaca gtttccaccc tcacctgccc cctcatacgg 10381 agttggatgg gggccgagtg aggtaaccag gtggcatcta ccccatgttt tataaggaat 10441 tttgtacagt ctttgtgaaa taaaataacg tgcttcattt gacccccatc cctggagttg 10501 gaggtttggg aatgctgggg tggagggatg aaactattgg caaactttct gagtttgggt 10561 atgaagggag tcctccttac cctccaaaat gaagcacagc caggctacca tttatttccc 10621 ctgtccacct tatcatatgg gagggtagtg atgggtgggg cagcatttct ttcagattaa 10681 aacagagaag tgttatgagg tggcacttct cggatgtgga attatgagag ttgggaagat 10741 ctgactccta gagtcattag gccgcggccc agtatagagc cagaaactca ggttgaaaac 10801 gtgctcaacc tggctcctag gggggattgc caggaccggt cagaaggctc ccgtcgtcca 10861 tctcgggaga ctaggaaggc cggattccta cgcgaggcct gctgggaagt gtagttcgtt 10921 agtggaagga agtcacatgg aagaggggcg gtagttggtt gtgggcactg ggttagaggt 10981 atcacgtggg ggcactttcg tcttagcttt tggacaagac gcaggcgcaa acccacggct 11041 gctgcggggg atccttgtgg ccctttccgg tcggtggaac caatccgtgc aacagagaag 11101 cggggcgaac tgaggcgagt gaagtggact ctgagggcta ccgctaccgc cactgctgcg 11161 gcaggggcgt ggagggcaga gggccgcgga ggccgcagtt gcaaacatgg ctcagagcag 11221 agacggcgga aacccgttcg ccgagcccag cgagcttgac aacccctttc aggtgacttg 11281 cgccagtcgg cctcttttgg gcggtcaggt tgattcttcc cggttctgta gggtcgggct 11341 aacttgtatc cccatttgtg acatttgatc ctgggaagag ccgccacgtg gggtgacagt 11401 gactccagac cagtgagcac tctggggggc gggccctgcc ccttaatggg ctggtgctgg 11461 cagtgttgga tgatggattt gaggtaaatg ttgtcccagt cctgggacaa cggcgtggcc 11521 ccgaaggtgc agtacttgga gccacaggca gtttgggaat ggtccctggg aattccccta 11581 ggtggcactg ggtgccagct gagacccggg tctctgccct caggacccag ctgtgatcca 11641 gcaccgaccc agccggcagt atgccacgcg tgacgtctac aacccttttg agacccggga 11701 ggtgagctac tggagagcat aggagttcaa ggaagggaag ggtttgccag tgagacaagt 11761 tagcttgggt acaggggact tttctgcatg cacaggaagg aagaggcact gtgatcttag 11821 ttccctgaga gaggctagtc agccatggga tgcctcatcc ttctagggcc accagggcca 11881 ccaacccaaa tggtttgagc tctaaaagtc agtcctggga agatctggag gctggcaagg 11941 ccttaaccaa tgttctgtat gggaaggctt tcttgaagaa aataaaggtt ggggctggtt 12001 ggagtttagg ttaggtgtgc agtttgaaag caaggaatga ggaggactac acctgtttgt 12061 cccactggat tagaagatgg tgagttgtaa gaatttaagg ggttttcatg gtacttaaaa 12121 cttaaaaaaa aaagccaagc atggtgtgca tgcctgtaat cccacttggg aggctgaggt 12181 gggaggattg cttgagccca ggaatttgat ggagaccatc ctgggcaaca gcgagacccc 12241 catctcttga gacgttttaa aaattggcta ggtgtagtgg cacatgcctg tagtccttgc 12301 tacttgggag gctgggatgg aaagactgct tgggcccagg agtttgaggc ttcgatgagt 12361 gatgattgca ccactgcact ccagcctggg tggcagagag acaaccccac actccatccc 12421 cagaagaaaa gagaaaaaaa aaacacagtt aagtggggag atatggtaaa tgagtgattt 12481 gagtcctcac taaggaattg gatatgaggg atgataaggc tgctacaggt catcatagag 12541 ttctctgaca gaaattcaac agaagaacta gcttcctaac aaatggggga aaggctgctt 12601 ccttacatca tgagttctgg gtctaaaggt attcaagcag actatagatt cctgtgttat 12661 gagggactag atgatcacta aggtcttttc ctcaacagcc accaccagcc tatgagcctc 12721 cagcccctgc cccattgcct ccaccctcag ctccctcctt gcagccctcg agaaagctca 12781 gccccacaga acctaagaac tatggctcat acagcactca ggtacaggag gtgtgcaggt 12841 gagggctggg gagaagggcc agtcctgggg cccagggctc acatctccgt ctgctcacac 12901 tgggcaggcc tcagctgcag cagccacagc tgagctgctg aagaaacagg aggagctcaa 12961 ccggaaggca gaggagttgg accgaaggga gcgagagctg cagcatgctg ccctgggggg 13021 cacagctagt aagtaataga gtggggaaga gcatctgaga gtttgggagg agcaaaaaca 13081 ggtcttaagg gcctgggctc ggctgggtgc ggtggctcac acctataatc ccagcacttt 13141 gggaggctga ggtgggcgga tcacgaggtc acgaggtcaa gaaatcgaga ccatcctagc 13201 caacatggtg aaaccctgtg tctactaaaa atacacacac acaaaaatta gctgggcatg 13261 gtggcgcgca cctgtagtcc cagctactcg ggaggctgag gcagggagaa tcgcttgaac 13321 ccgggaggtg gaggttgcag tgatccgaaa tacgcgccac tgcactccag cctggtgaca 13381 gaaatgagac tccatctcaa aaaaaaaaaa acacaaacct gggctctgga gtcagactgc 13441 tgagtttgaa ttttagagat actgcttact aaccatgagc ttttagggaa gttacttaat 13501 cttgaatgcc tcagtttccc tatttataaa atgaaaacca cgtttgcatc cgtttgacag 13561 ggttgttgtg aaggttaaac aaaatatatg tgaagtactt ggtacagtgc ttagatgttt 13621 taaataataa taaacagtat taatttttcc actctcactt gtctccttgg accccaaatt 13681 ggagataaaa gagaggatcc agggctgggt atggtgcctc acgcctgtaa ttccaacact 13741 ttgggaggcc gaggcaggca ggttgcttga gttcaggagt tagagaccaa cctggacaac 13801 atagtgagat cctgcctcca aaacattaat gaaaaaaatt agcagggcat gatgtacctg 13861 tctgtagtcc tagctcctca agaagctgag gtggacggcc aggcgccggg gctcatgcct 13921 gtaatcccag cactctggga ggccaaggcg ggtggatcac ctgaggtcag gagttcagga 13981 ccagccaaca tggtgaaacc ctgtctctac taaaaataca aaaattagct gggcgtggtg 14041 gcatgtgccc ataatcccag ctactcagga gactgaggca ggagaattac ttgaacccgg 14101 gaggcggaga ttgcactgag ccaagatcac accactgcac tccaacctgg gcaacaagag 14161 cgaaactcta tctcaggaga aaaaaaaaaa aaaaaaaaag ctgaggtggg agggttgctt 14221 gagcccagga ggttgaggct acagtgaacc atgatcatac cactaccttc cagcctgaac 14281 aacagagacc ctatctcaaa aaaaaaaaaa aaaaaaaaag agaggatcca gggatggaga 14341 gaagggggag tgattccttt gttggtctct gttttacttc tggggtccac ctgttttttg 14401 gcgccttaca gctcgacaga acaattggcc ccctctacct tctttttgtc cagttcagcc 14461 ctgctttttc caggacatct ccatggagat cccccaagaa tttcagaaga ctgtatccac 14521 catgtactac ctctggatgt gtgagtagtg agaagccttt tggaggaagt tacaggtaga 14581 tctcttaact gccctggggt ccgctcacca agaaccaaac acttcacctc tatttagaac 14641 tcaccagcct gctagcaaat gttcttgccc tctccccact ttttattgtc catatgcagg 14701 aatatatttt gaattcttta gatgtctttg ggctgggtgc agtggtatac gcctgtaatc 14761 ccagcacttt gggaagctga ggtgggtgga taacctgagg tcagaagttt gagactagcc 14821 tgatcaacat ggagaaaccc catctctact aaaaatacaa aattacctgg gcgtggtggc 14881 acatgcctgt aatcctagct actcaggagg ctgaagcagg agaatcactt gaacccggga 14941 agtggaggtt gcaatgagcc aagatcatgc cattgcactc cagcctgggc aacaagagca 15001 aaactccatc tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaggc cgggcacggt 15061 ggctcacacc tgtaatccca gcactttggg aggcccaggt gggcagaaca tgaggttagg 15121 agatcaacac catcctggct aacacggtga aaccccgtct ctactacaaa tacaaaaaat 15181 tagccgggcg tggtggcggg tgcctgtagt cccagctact taggaggctg aggtaggaga 15241 atgggcgtga acccaggagg cagagcttgc agtgagccga gatcctgcca ctgcactcca 15301 gcctgggcga cagagcgaga ttccatctca aaaaaaaaaa aagaaaaaaa aaaattagct 15361 gggcgtggtg gcgggcgcct gtagtcccag ctactctgga ggctgaggca ggagaatggc 15421 gtgaaccggg gaggcggagc ttgcagtaag ctgagattgc gccactgcac tccagcctgc 15481 gcgacagagc cagactccgt ctcaaaaaaa aaaatgtctt tgcaagggga tcacacatta 15541 ttgacatttg ctttctccat ctcccctgtt gaggattcca tgaaggcagc agctgccttc 15601 atttccttac tctgccatgt ttggtgaata ttataggatg agcatagatg ggaaggagcc 15661 ttcattgcag tccagaaggg ctcctcatcc tgtccctctg ccccttaggc agcacgctgg 15721 ctcttctcct gaacttcctc gcctgcctgg ccagcttctg tgtggaaacc aacaatggcg 15781 caggctttgg gctttctatc ctctgggtcc tccttttcac tccctgctcc tttgtctgct 15841 ggtaccgccc catgtataag gctttccggt aagtgtgtta gtggtgggag agtgatggag 15901 acctgggatg ggccccacgt ctgcccatcc ttcagctcta attcttctcc caccctcccc 15961 atttttttcc tctttgtagg agtgacagtt cattcaattt cttcgctttc ttcttcaatt 16021 tcttcgacca ggatgtgctc tttgtcctcc aggccattgg tatcccaggt tggggattca 16081 ggtttgtgag gctgttatcc accctcacct ttccctctag atccagccag cactgggtgc 16141 tggataggag ttgttcagaa aaaggaaatg tggttttaat ccttgggagg tactagttta 16201 atgagataca agacatactt ccaggataga gcgcagacag cactcttgac acatatagat 16261 tgaagggaag aatgctgcat ttggccaatc taaggtggct tcctggagga ggcatagaac 16321 catggttgaa aggaggaaga agaatcccca gatgagtgcc tggaagagct tggagagctc 16381 agggctaatg gttcagaaac tggaattaaa ctatgaagag attagataca gtttgggatt 16441 gtggtgcttg gaatgctgct cgattgacag ggagccactg ctgatgggaa gggtaggaac 16501 aggggatgct ggtgacataa ccagtggaga agctgaggag cccctcttca ctggtacatc 16561 cttcccttta cagtggctgg atctctgctc tggtggtgcc gaagggcaac acagcagtat 16621 ccgtgctcat gctgctggtc gccctgctct tcactggcat tgctgtgcta ggaattgtca 16681 tgctgaaacg ggtgagggct gtgtcgaagg tggggccggg atggtgagat catgggtccc 16741 caggggcgtg ggtggaacat tcaggagcaa ctggcacagg tcaggctgct gggttgttct 16801 cagctaatgg acctctgggg tgtgtgtttc tgtgtgtgag tgtgtgtgct gggcagcagg 16861 ctgctgagtg gtagtgatgc tgttaggctg ggggtgggga accagtggct ggaatgggcg 16921 gtaatgtctt tgtcctctac ttgcagatcc actccttata ccgccgcaca ggtgccagct 16981 ttcagaaggc ccagcaagaa tttgctgctg gtgtcttctc caaccctgcg gtgcgaaccc 17041 gagctgccaa tgcagccgct ggggctgctg aaaatgcctt ccgggccccg tgacccctga 17101 ctgggatgcc ctggccctgc tacttgaggg agctgactta gctcccgtcc ctaaggtctc 17161 tgggacttgg agagacatca ctaactgatg gctcctccgt agtgctccca atcctatggc 17221 catgactgct gaacctgaca ggcgtgtggg gagttcactg tgacctagtc cccccatcag 17281 gccacactgc tgccacctct cacacgcccc aacccagctt ccctctgctg tgccacggct 17341 gttgcttcgg ttatttaaat aaaaagaaag aggaactgga actgacatcc ccgtttcctg 17401 aatcttcatt gggaattagg ccttatgaaa gagaaaggag agtgtggagt gaggtgggaa 17461 ggtcggaccc ggctttagtg taaactggga gagagatgag gggcggggcg gtggagcccg 17521 agtggctggc gcaggaagga ggtgggaagt ccacggaaac gcgaaacccg gagacgccag 17581 ggagccctgc tccccacccc tctccattaa tgacggggaa gagccaccgc ctctgccggg 17641 aacgccaagg aatacgcggg cctggagcct gaaaagctgg atggggctca agtggaggcc 17701 caaaggatca ctagcagcct agccagggtc cagagcgagg cagggactgg aggagcgctt 17761 atccgactgc ctcgccctgc cgcgggatcc cccaaccccg acagggtctc agtcccgaac 17821 tacaactccc ggggtgcacc gcgccggccc tcgccgccat gcccctcctt cccgcaccat 17881 ccccattccc atcccccttc tctagtcccc gacctgcggc agccggagct cggggagcgg 17941 agcgtggtgg ggaggggagc gggacaggcg acacaggaga cagcggcgcc gcggcctctc 18001 cccaccaggc ggccccggat cctactggac gccctgaggg cacaccgacc gcgcctctag 18061 agtcacccca cgccgacccc tcccctcttc tctagactta tttccatcct tcccgctttt 18121 accctcccca cccgtccctg ggctccaggc cgccgccccc tcctcactcc tggaccggcc 18181 cttctcggtg cccctcttcc ctagggagat gcgatgagcc ggtgcccccg cgtcctcatc 18241 gtcgccccgg gcacggtgcc cgtccagtgc ccgtggtggg gagggagcac tccgcggtcc 18301 ctccgtgacg cccctcgctt ggcccccccc acagctggcg tccctcggcc atgccccagg 18361 ggacccagcc agggggtggg ctctagagcg agtggggtgg agaggagaaa ggacggggcc 18421 ttgggggcct ctgagatgct cccaagtgcc agggagggcc gagcgaggcg caggcaaccg 18481 ggcagcaggc atgatgccct cgcctagtga ctccagccgc tcgctgacca gccggcccag 18541 caccaggggc cttacccacc tccgcctcca ccgaccctgg ctgcaggccc tgcttacgct 18601 ggggctggtc caagtgctcc tgggcatcct ggtggtcacc ttcagcatgg tggcctcttc 18661 cgtcaccacc accgagagca tcaagaggtc ctgcccgtct tgggctgggt tctcggtgag 18721 ttgggtgcac agttgttggg tgggggaggc tcctcgggcc ccaccctcca caccagctgc 18781 aagtccacat gcttgcctct cctccctctc ctggtcctct gcctccttac agggctgggt 18841 gcatcctgtg gtgaggggct cactcaggag cctccccctg ccaggctgag cctgtgccca 18901 ctctgtcccc agctggcgtt ctccggggtg gttggcattg tgtcctggaa gcggccattc 18961 actctagtgg taggtgccag ggtccagtgc ccactgggag gcaggtgccc agcacgcaag 19021 gggaagccca tttatgacct caagaaggga actggctccc cagttggtgc cagccgggtg 19081 ggcacgaagt tctgtgaagg agggggactc tgtccgtggc agaggagtac tcatgagggt 19141 cccagccctg agtctgcacc ctgttttccc cagatctcct tcttctcctt gctttcggtg 19201 ctctgtgtca tgcttagcat ggctggctct gttctctcct gtaagaatgc tcaactggcc 19261 cgagacttcc aacagtgctc tctggtgaga tttgaggagg gagagctgga aagaactggc 19321 tgggggaggt gtgcaggaca cctcagtttg tgctgactca ggctgcctca ccctccctgc 19381 tccactcagg aaggaaaggt ctgtgtgtgc tgtccctctg ttcccctcct ccggccctgt 19441 ccagagtcgg ggcaggaact gaaagttgcc cctaactcca cctgtgatga agcccgaggg 19501 gccctcaagg tgagcttgca ccctgcaaac atcctcctgg ttctcactct tgcctcccct 19561 gctggggatc tcacaggccc ataatgtgtg gaacttagcc gtctcctgac tcctctcctc 19621 ctttcccttc cccagaacct gctcttcagc gtctgtgggc tcaccatttg tgccgctata 19681 atctgtacac tctctgctat tgtctgctgc atccaaatct tctccctgga cctcgtgcat 19741 acggtgagaa gggagcaggg gccagggcac gcaggtatgg tgggggcagg ggtgtgtgtg 19801 ctgagacttg cctgagggaa caatggctac agatctggct tttgagcacc aggggctatg 19861 ggttatctat tatcctcatc tattggagga atggatgttt agtggggagg atgagaaggg 19921 gagatggcag ggagtgagtt aagtatgtga tgctggtctg ggaccaggag agggtgcttt 19981 agagaatggg actggatggt ggggtagagt caagaaggtc ctatgctggg cacggtggct 20041 cattcctgta atcccagcac tttgagaggc ccaggcaggc ggatcacctg aggccaggag 20101 ttcaagaccc acctggccaa catggcgaaa ccctgtctct attaaaaata caaaaatcgg 20161 gccgggcgca gtggctcgtg cctataatcc cagcactttg ggaggctgag gagggcagat 20221 cacctgagat caggagatcg agactatcct ggctaacatg gtgaaacccc atctctacta 20281 aacatacaaa aaattagcca ggcgtggtgg cgggcgcctg tggtcccagc tactcgggag 20341 gctgaggcag gagaatggca tgaacctggg aggcagagct tgcagtgagc tgagatagtg 20401 ccactgcact ccagcctggg cgacagagca agactccatc tcaaaaaaca aacaaacgaa 20461 atacaaaaat tagccaagtg tggtggcggg tgcctgtaat cccagctact tgggagtctg 20521 aggcaggaaa attgcttgaa ccggggagtc agaggctgca gtgagctgag atggtgccac 20581 tgcattccag cctgggcgac aagagcacag actccgtctc aaaaaaaaaa aaaaaaaaaa 20641 aaaaaaaata tatatatata tatatatata tatatatata cacacaaaat tacaaaaatt 20701 aggctgggct ccgtggctca tgcctgtaat cccagcactt tgggaggcca aggcaggagg 20761 atcaccagat atcaggagtt tgacaccagc ctggccaaca tggcgaaacc ctatctctac 20821 taaaaataca aaaattatcc gggtgtggtg gcgggtgcct gtaatcccag ctactcggaa 20881 gactgaggct ggagaatcgc ttgaacctgg gaggcagagg ttgcagtgag ctgagatgta 20941 gccattgtac tccagcctgg gcgacaagag tgaaacttcg tctcgaaata ataataataa 21001 taataattag ctgggcatgg ttgcacaccc ttataattcc tgctactcag gaggatgagg 21061 catgggaatt gcttgatctt gggaggtggg ggttgcagtg agctgagatc gcgccactgc 21121 actccagcaa cagagtcaga ctctgtctca aaaaaaaaaa gaaggtccta gatggaggtg 21181 aggctaaagg gtggacattc ctgtgaagac acagaatgtg tggagcttct ggatggaggt 21241 ggggccataa agaggaattt gtgggtgggc agggccagag gctagaggta cagggtgacg 21301 gtggggttaa ggagagctga ttttgggtga gggaggggcc agaataagag cttctctcag 21361 gggatagggc cagaaaaaat aatgtagatg agagaggagt ttggggccca ggaggagctg 21421 tattcccaga atccagactt gctgacctgc tcgcttcccc agcagctggc ccctgagcgg 21481 tcagtctcag gcccactggg acctctgggc tgcacgtccc cgcccccagc ccctctccta 21541 cacaccatgc tggacctgga ggaatttgtc ccgcctgtgc ccccaccgcc ctactatccc 21601 ccagagtata cctgcagctc agaaacagat gcacagaggt aaggcctggt ggggtcttgc 21661 tgggggagct aaggaagggg actggggctg ggcctggatt gctggagaga gggatctttg 21721 tggaggggga attatttctc ctacattgac cctcttcctc atctgccagc atcacgtaca 21781 atggctccat ggacagccca gtgcccttgt accctaccga ttgcccccct tcttatgagg 21841 cagtcatggg actacgagga gacagccagg tgagagcaca gcacggcttg gggcgggctg 21901 gggagccggg ttgtagcctg gaaagctgaa caggctgtag cctctggata aagatagtaa 21961 cagtggtcaa ggtctttaca gcaccagcat gcccactgtt gcctttgatc ttccctagag 22021 cctggtgagg tgagtgggtt ggaagctatc atttctgcca tacagacgtg gaagctgagg 22081 ctgcacaatt aagtgacttg tacaaaggga caaggctggt ctggaatcta agtctccaga 22141 gctctgtcta gcacatgggg gtgttcagtg tgtagggctg aacccttgac cctgtgtctt 22201 ctgcaggcca ctctctttga ccctcagctt cacgatggct cgtgcatctg tgaacgagtg 22261 gcctccattg tagacggtga gcagggcgta atgaggggtg gacaagggcg gggctgccag 22321 ggatagctgg ggtgggtaga gacaatagaa ggggaaaaca aggcggagtt ggtggtttgg 22381 ggacatagga gggctgtggc aacttggaac cctggactta tttttctcct ctgagataaa 22441 gctggaggca cagtggcccc tagaggctgg gcttgggaga agaggaactg cctgggcagg 22501 gctaggccag gccaggctgg tgctacacag cgccccctgc cgcccacagt gtccatggac 22561 agcgggtctc tggtgctgtc agccattggt gacctccctg ggggctctag cccgtcggag 22621 gactcgtgcc tgctggagct gcagggctcc gtgcgctccg tggactacgt tctctttcgc 22681 tccatccagc gcagccgtgc cggctactgc ctcagcctgg actgtggcct gcggggcccc 22741 ttcgaggaaa gccccctgcc acggcgcccc ccacgggctg cccgctccta ttcctgctct 22801 gcccctgaag ctccaccccc actgggtgcc cccacagctg cccgcagctg ccaccggttg 22861 gagggctggc cgccctgggt gggaccctgc ttccccgagc tgaggcggcg ggtcccccgg 22921 ggagggggcc gcccagccgc agccccgccc acccgagccc cgactcgtcg cttcagcgat 22981 agctcaggtt ccctcacccc accggggcac cggcctcctc atccggcatc cccaccaccg 23041 ctgctgctgc cacggtccca cagcgaccca ggcatcacga cctccagtga cactggtgag 23101 ccccctcccc gactgcccag gctcaggaga gggtaggcac tgggagttag gtggccagtg 23161 atgcccacca ggattggggc acagttgagg tgggtgagga ggaagaaagg gtgagttcac 23221 tcctctggta gacattctca ggtactcacc tcctagggac agaacatcca tagcttagca 23281 gctaaaaaag gagaattttt tttttttttt gagacggagt ctctggctct gtcgcccagg 23341 ctggagtgca gtggcatgat cttggctcac tgcaacccgt ttcccgggtt caagtgactc 23401 tctcctgctt cagcctgccg agtagctggg actacaggtg tgcgccacca tgaccggcta 23461 atattttttt tttttttttt ttttgagacg gtgtctcgct ctgtcaccca ggctggagtg 23521 cagccgcgcg atctgggctc actgcaactc cgcctcccgg gttcacgcca ttctcccccg 23581 ggttcacgcc attctccttc ctcagcctcc cgagtagtga gtagctggga ctacagatgc 23641 ctgccaccac gcccggctat tttttgtatt tttagtagag atggggtttc accgtgttat 23701 ccaggatggt ctcaatctcc tgacctcgtg atctgcccgc ctcggcctcc caaagtgctg 23761 ggattacagg catgagtcac cgtgcccggg caatttttgt attttttagt agagacaggt 23821 ttcacccttt tggccaggct ggtcttgaac tcctgacctc aagtgatcca cccgcctcgg 23881 cctcccaaaa ggatttattt tttgaaacca gttccacagc tctcagcttg gtccacttat 23941 ctgtcctccc caagcttcag ctgtcacttg ttaacatgta taataatagt acttcacgcc 24001 gggcacggtg gcttgcacct gtaatcccag cactgtggga agctgaggtg ggtggatcac 24061 ctaggtcggg agttcgagac cagcctggct aacatggtga aaccctatct ctactaaaaa 24121 tacaaaaatt agccaggtgt ggtggagcgc gcctgtaatt ccagctacta acgagagagg 24181 ctaaggcagg agaatcgctt gaacctggaa agcaggggtt gccgtgagcc aagatcatgc 24241 cactgcactc cagcctgggt gacagagaca cactccatct caaaaacaca aacaaacaaa 24301 caaaaaacat gtataataac agtacttcag ccataggcat tgtactcgaa gatgctgaga 24361 aagaacagtg gcgggcaagg ctgttcacac ctataatccc agcactttgg aggccaaggc 24421 aggtggatca cctgaggtca ggagttcaag accagcctag ccaacatggt gaaaccccca 24481 tctctactaa aaattgaaaa attagctggg cctggtggtg gacgcctgta atcccagcta 24541 ctagggaggc tgaggcagga gaatcgcttg aacctgggag gcggaggttg cagtgagctg 24601 agatcgtacc actgcactcc ggcctgggca acacagtgag actccatctc aaaaaaataa 24661 gaaaggagat agtactgggg aacgctcagc actgtgcgcc aggtgctgaa caacaccact 24721 gcagtccttg ttgtggtgga ttgtaccatc tagttgctgg ctaatatgga cagagatgct 24781 ggccctttga ttggggatgg agcgtgggag ctgtgaaagc tcctctgggc ttgagttccc 24841 acaggagggt gggcgtgtcc acagaacact tccactcact ccctgtctcc ctttctctct 24901 tctccccagc tgacttcagg gacctttata ccaaagtgct tgaggaagaa gctgcttctg 24961 tttcctctgc agatacaggt caggcatgtg gtttgcgccc cagggatggg gattgggcat 25021 ggctgcccag ccccctctcc accctacaat accattctct tatctctgtc tctctgcagg 25081 gctctgctct gaagcctgcc tcttccgcct agcccgctgc ccttccccca agttgctacg 25141 tgcccggtca gccgagaaac ggcgccctgt gcccaccttc caaaaagttc ccctgccctc 25201 gggccctgca cctgcccact ccctggggga cctaaagggc agctggccag gtcggggcct 25261 ggtcactcgt ttcctccaga tatccaggaa agccccagac cccagtggga ctggagctca 25321 tggacataag caggtaggaa ttcggggagc caggaaagat gtttgggaaa gcgtggagct 25381 tcagattgag ccttattgat gatgcccttt cttgtgtccc tgtccaggtg ccccggagcc 25441 tgtggggccg gcctggccga gagagcctcc accttcgcag ctgcggagat ctgagctcta 25501 gctcttccct gcggcgtctc ctgtctggcc gcaggctgga gcgtggtacc cgcccccaca 25561 gcctcagcct caacgggggc agccgggaga ctgggctctg acctaggctt cttgtcacac 25621 tgaacacatc cagccacagg caccagctgg ttgggaccag cagcccccag catcctcttg 25681 cactggctgg cacaaaaaga aacctgctgt atacccccca aagtgtccct ttccctccta 25741 cctctggggt ctcttgctgc ttgcctctgc tgctctggtc tgggagagct tctgtcctgt 25801 gctgcatggg tatttagact gtgggggaga tgccccttct tatagcactg gaggaggaaa 25861 acaaattctt gtccccctca gaatgagagt ggctctttct gatttgcaag ggcactatgg 25921 tcagggcaaa ggcatggccc aggtgtttaa gtacagggtg acgtgtgcct atgcaatggg 25981 gtggtaaggc aggcacgaag agtccaaaaa atctaggtgg cctctcagct ctgccacctc 26041 tagctgcatg accttgggca agctatgtaa ccccaattgc ctgctccatt aaagactgtg 26101 aaggtagaat gtttgtaaag ctcttaacag tatgtaagcc ttcaataaat ttcagttttc 26161 cccttgtttt cttgatcatt ctctgtcacc agtgaaattt gttctagtgt ctctcatatt 26221 taagaaaact ctttcaggac tgggtatggt ggctcacacc tataatccta gcactttggg 26281 aggccgaagc aagaggatcg cctgagccta ggaattcaag accagcctgg gcaacatagt 26341 gagaccctgt ctctacaaaa aacaaaaaat tagccaggca tggtgggaca cgcctgtagt 26401 cccaactact caggtggcta aggtgagagg atcacttgag cttgggaagt ccaggctgca 26461 gtaagctgtg attgagccac tgcactacag cctgggcaac agagcaagac catgtctcaa 26521 aaaaaaaaaa aaaagaaaaa agaaactttc aagacactct ttccaaccac taattgtaac 26581 tctgctcctc cttttcacag caataggttt tctttttctt ccctccactg ttaaacatcc 26641 attctctcct cacccacccc catcagactc cttcccctat ctttccacag ccactgctct 26701 gaccaaactt tccagtgacc acagtggtgt cagacccagt gaccatttct ctgcctgcat 26761 ctcacttgac ctcgaggcag caattaatac ccataatcag catcttcttg aatttgtccc 26821 tttgaaaagg gaaatattgg ctcttctact ttgtcctgct gaactgctta acattggagg 26881 gccccagggc cctcacctaa gccctctttc ctacctccac tctttctata ggtggcccta 26941 ctactaaagt ccatggcttt aaataccatc tttctatgtg ttaatccatg actccagcct 27001 tgacctccca tgagcgccat ccaactcagc atgtctgctt ggatgtctaa tgggcatttc 27061 agattcaaca tggccacaac tgaactcttg attcccaccc cagcaccggt tatttttcca 27121 ctgttcccat ctcaatggca cctccattac ccatttgcac attccaaaag ctcaggaacc 27181 atggtgactt cttttcccat atccaacaca accaatccta tcctgaattc atccacatcc 27241 caccacctcc ccagctacct agctccagcc atcctctctc cacaacctct gaatcagtct 27301 ttcacttttc ccagcaatcc attctccact cagcaaaatg atgataaagc acgtcacatc 27361 aaggctctgc ctcaatttaa tggcttccca ttgtatttag aatcatctcc aaactcccag 27421 agactatggt cgagctacaa tctggcccac cttctgttcc agccaaattt cctcacagca 27481 caaggacgtt tgcacctgct gttttgccaa gcatgaaacc cttggcccct atatctggtg 27541 ctatcaccta atatcaggtt ttagctccat tctcaccatt tcagtgagca cccaatcccc 27601 atcgcagtca ttctatcaca tagccatgtt tttttttgtt tgtttgtttc attttgtctt 27661 tttttgagac agggtcttgc tttgttaccc aggctggagt gcagtggtgt gatttgggct 27721 cactgcaacc ttccacctcc tgggtcaagc aattctcctg cctcagcctc ccgagtagct 27781 gggattacag gcgcccgtcc ccatgcccgc ccagctaaat tttgtatttt tagaagagat 27841 agggtttcac catgttggcc aggcgggtct caaactcctg acctcaagta atccgcctgc 27901 ctcggtctcc caaagtgctg ggattacagg tgtgactcac cgcgcctggc cacataccca 27961 tggtttcagc atgtatcact atctaaaatt attatttttg tttatatatc tgtgtcgtcc 28021 catagaaggt taaggtccca agatcagaaa cttgctcatt gcagtgggtc taacactcag 28081 taggtcctca acaaacattc gttaagatac taaagtggca gggtggggcc ctgtaaacag 28141 cttcaggagc ttcaggaccc tgtgcttgta ggggcaacgt ggtgccctcc aaggaagaca 28201 gggaggtggg aggagcactg cccagagatg gcgtcaggct gcaagacttc ttgaataatt 28261 cagcatcata acaacccagc ctcaggaagg gatagggcac ggccaggacg aaacattagg 28321 aggcgatgga caatgggatt cccacggggc agcttctgcg cactggacgt tccctaacct 28381 gaggctctct aaagaggaag gttaggaatc ctctgagctt cggtgggctg gactcactgt 28441 gggaattcaa tcgcccccat ccaccaacag tgtgctggcg ggaaaacgcc gacacgcatg 28501 cgtagttctc gcgccggctc ctctctctct ctctctctct ctcgctcgct ctctcgctct 28561 ctcgctctct ctcgctcgct ctctcgctct cgctctctct ctctctccgg ctcgccagcg 28621 acacttgttc gttcaacttg accaatgaga cttgaggaag ggctctgagt cccgcctctg 28681 catgagtgac cgtctctttt ccaatccagg tccgcccgac tccccagggc tgcttttctc 28741 gcggctgcgg tgatcggtcg ggctgcatcc tgccttcaga gtcttactgc gcggggcccc 28801 agtctccagt cccgcccagg cgcctttgca ggctgcggtg ggatttcgtt ttgcctccgg 28861 ttggggctgc tgtttctctt cgccgacggt aggcgtaatg aatatttcga cctttggatc 28921 ttagctgtcc cctccctgcg ttcgcactta acctttttca ccattattat tattattgtt 28981 attattatta ttttttgagg gagtctcgcc ctgtcgccca ggctggagtg taatggcgcc 29041 ttcttggctc actgcaacct ccgcctcccg ggttcaggcg attctccgac ctcagcctcc 29101 caagtacgtg ggattacagg cacccgccac cacgcacggc taattttttg tatcttttag 29161 tagagacggg gtttcaccat gttggtcagg ctggtctcca attcctgacc tcgtgatccg 29221 cccgcctcgg cctgccaaac agctgtgatt ataggcgtga gccaccgcgc ccggccaacc 29281 atcattatta tttttaacgg taaggatggt cagattttac taatgaagaa gagattataa 29341 aatcttcaag tctttatatc cacttgcttt ttgaggggtg gagtgggaag aaggttatgt 29401 aattcatacg ttcttcagag atgtgacaaa cattcacgga gccggacgac gtcgggttgg 29461 attcgcactg gagctgcaga tgggtgccag gatggactgg tccctaccct ccgcttgaac 29521 ctaggaggcg gaggttgcag tgaaccgaga tcgtgccact gcactccagc ctgggtgaca 29581 gagatactcc gtctcaaaaa aaaaaacaaa acaaaaaaca agcggactgg gcgcagtgcc 29641 tcaccctgta atcccagcac tttgcaaagc caaggcggga ggatcctttg agtttaggag 29701 tttgagacca acctgcgcaa cacagtaaga ccccgtctct acaaaaaata cagaaattag 29761 ccaggtgtgg tggtgtgcgc ctatagtccc agctattctg gaggctgagg tgggaggatt 29821 gcttattctg gaggcagagg ttgcactgag ccgaaatcaa gctactacac tccatccagg 29881 gcaacatacg gagaccctgt ctcaaacaaa caaacaaaaa attgctcagt acctggccaa 29941 aaaagaagag gctcactatg cagaggggaa gtggaaggag atgtttggac ttctaaactc 30001 aatagagcag gagaggcaaa tgtagaatgt gctcaggaaa tatctgtgag atgaatgaac 30061 ttgagggaag taaggtacta gatattacct gccctaccca gaacaaatcc tgtgcaatgt 30121 ttccttgaaa agtgagaagt ctggaagggg tggctactga catagtgaag caactagttc 30181 aattctacaa cttgacagct accctgtgcc aggctatcta cgaggatact tagaatgcat 30241 aagacattcc ttcaaggaac tccaggaaca gaggcctgac atgttgcaat gtttagtgtc 30301 aagcagtgta ctagagacac attatcacac tcaaacctca caacaattct gtgaggtagg 30361 agttatcact ccccttttat agatgaaaca gaggcttaga gtgattgatt tattgaaagt 30421 caaacagcca gtaaatggtg tagccaggat tccaaacttg ctgtctcact gagactgtac 30481 ttaattactg gagggaccgg gtgtggtggc tcattgctat aatcccaaca ccttgggagg 30541 ctgaggctgg tggatcacct gaggtcaggg gttcgagacc agcctggcca acatggtgaa 30601 accccatctc tactaaaaat acaaaaatta gctgggcatg gtggtgggct cctgtaatcc 30661 cagctactca ggaggctgag gcagggcaat tgcttgagcc gagatcacac tgcactccag 30721 cctgggcaac agggcaagac tctgtctcaa aaccaaaaaa aaaaaaatta ctggaggaac 30781 ctagaagaag aaatgatcaa ttttgcttgg agtgtatcta gaaagacttc actgagatca 30841 tttaaagaac aaaaaggatg gctggggtcc aggcagtggc tcatgcctgt aatcccagca 30901 ctttcggata ccaaggcagc agatcacctg aggtccagag tttcagacca gcctggccaa 30961 catagtgaaa ccccatctct actaaaaata aaaaaattag ctgagcatgt tggagggcac 31021 ctgtaatccc agctacttgg gaggctgagg caggagaatc actcgaaccc aggaggtgga 31081 ggttgcagtg agccaagatc acgccactgc actccagcct gggcaacaga gtgagactct 31141 gtctcaaaaa acaacaacaa caaaaaatac aaacaagaga caagtagttc ccaggtgcct 31201 accaagtggt caggcactgc acttacctca ctgactgcag taaccaccct ttgaggttgt 31261 ggcattgctc cattttccag gcaaggaaat gggctgagag ctgggattag tcaggtcatg 31321 actgtgtgtg ccactcccgc taaatctcat ttgatgtggt tcatgaggcc acaccatgga 31381 cagcttcctc cttgtgtcca ctgaggatat ggctttgtac aacactttgg tttttgaacg 31441 actttacaaa cctccctgtc ttgtgaggaa ggaagaacag ttattaccat ctgcatctga 31501 tgatgaaaca agggacgctg cagaggagcc gcactgacca ctccctccct ccagtcctgt 31561 catcccactg ccagtgtccc accctcttgt gccctgcact tcactggcta ataacccccc 31621 tcactttttc ctctgtgaag ccatcctgga taattcccca cccacgaatg gtccctcctc 31681 atctcagaga gctctccatg cacacctgtt accgtttctg tctttatctg taaatatctg 31741 tgtgtctgac ttccatgcct cacacacctc tatagggcaa agactgtctt aaacatcttg 31801 gtagtgtcag tattttgcac agtgaagttt ttttttttaa attatatcag ctttatttgt 31861 acctttttga catttctatc aaaaaagaag tgtgcctgct gtggttccca tcctctggga 31921 tttaggagcc tctaccccat tctccatgca aatctgtgtt ctaggctctt cctaaagttg 31981 tcacccatac atgccctcca gagttttata gggcatataa tctgtaacag atgagaggaa 32041 gccaattgcc ctttagaaat atggctgtga ttgcctcact tcctgtgtca tgtgacgctc 32101 ctagtcatca catgacccat ccacatcggg aagccggaat tacttgcagg gctaacctag 32161 tgcctatagc taaggcaggt acctgcatcc ttgtttttgt ttagtggatc ctctatcctt 32221 cagagactct ggaacccctg tggtcttctc ttcatctaat gaccctgagg ggatggagtt 32281 ttcaagtcct tccagagagg taagagagag agctcccaat cagcattgtc acagtgcttc 32341 tggaatcctg gcactggaat ttaatgaatg acagactctc tttgaatcca gggccatcat 32401 ggctctttga gcaaggcaca gatggaggga ggggtcgaag ttgaaatggg tgggaagagt 32461 ggtggggagc atcctgattt ggggtgggca gagagttgtc atcagaaggg ttgcagggag 32521 agctgcaccc aggtttctgt gggccttgtc ctaatgaatg tgggagaccg ggccatgggc 32581 acccaaaggc agctaagccc tgcccaggag agtagttgag gggtggagag gggcttgctt 32641 ttcagtcatt cctcattctg tcctcaggaa tgtcccaagc ctttgagtag ggtaagcatc 32701 atggctggca gcctcacagg attgcttcta cttcaggcag tgtcgtgggc atcaggtgag 32761 tgagtcaagg cagtggggag gtagcacaga gcctcccttc tgcctcatag tcctttggta 32821 gccttccagt aagctggtgg tagactttta gtaggtgctc aataaatcct tttgagtgac 32881 tgagaccaac tttggggtga ggattttgtt ttttttcttt tgaaacagag tcttactctg 32941 ttgcctgggc tggagtgcag tggtgcaatt ttggctcatt ccaacctctg cctcccagat 33001 tcaagcgatt ctcttgcttc agcttcccag gtagctggga ttacaggcgg ccaccactac 33061 gcccagctaa tttttgtatt tttagtagag acggggtttc accatgctgg caaggcaggt 33121 ctcaaactcc tcacctcagg tgatccgccc acctcggcct cctaaagtgc taggattaca 33181 ggtgtgagcc cctgcgcccg gccaaggggt gaggaatttt gaaaccgtgt tcagtctctc 33241 ctagcagatg tgtccattct ccatgtcttc atcagacctc actctgcttg tactccctcc 33301 ctcccaggtg cccgcccctg catccctaaa agcttcggct acagctcggt ggtgtgtgtc 33361 tgcaatgcca catactgtga ctcctttgac cccccgacct ttcctgccct tggtaccttc 33421 agccgctatg agagtacacg cagtgggcga cggatggagc tgagtatggg gcccatccag 33481 gctaatcaca cgggcacagg taaccattac acccctcacc ccctgggcca ggctgggtcc 33541 tcctagaggt aaatggtgtc agtgatcacc atggagtttc ccgctgggta ctgataccct 33601 tattccctgt ggatgtcctc aggcctgcta ctgaccctgc agccagaaca gaagttccag 33661 aaagtgaagg gatttggagg ggccatgaca gatgctgctg ctctcaacat ccttgccctg 33721 tcaccccctg cccaaaattt gctacttaaa tcgtacttct ctgaagaagg tgaggaggaa 33781 ggggacaaga tgacatagag ccattgaaac ttttcgtttt tcttttcttt ttttaaaatt 33841 tttttgaggc agaatctcac tctgcccatt ctgtcggcga gacaggagtg cagtggtgtg 33901 atctcccctc acagcaacct ctgcctccca ggctatagtg attctcctgc ctcagcctcc 33961 tgagtagctg gaattatagg cgtgcgccac taccacctgg ctaatttttg tatttttagt 34021 agagacaggg tttcatcatg ttgaccaggc tagtcttaaa ctcctgacct caaatgatat 34081 acctgccttg gcctcccgaa gtgctggaat tacaagtgtg agccaccgag cccagcagac 34141 acttttcttt tttctttttt tttttttgag acagagtctc gcactgtcac ccaggctgga 34201 gtgcagtggc acaatctcag ctcactgcaa cctccacctc ccgggttcag gtgattctcc 34261 tgtctcagcc tctcgagtac ctgggattac aggtgcctgc caccacgccc ggctaatttt 34321 ttgtattttt agtagagaca gggtttcact atgttggcca ggatgattgc gaactcctga 34381 cctcgtgatc tgcccacatc ggcctcccaa agtgctggga ttacatgcgt gagccactga 34441 cacttttctt tgccctttct ttggaccctg acttctgccc atccctgaca tttggttcct 34501 gttttaatgc cctgtgaaat aagatttcgc cgcctatcat ctgctaactg ctacggactc 34561 aggctcagaa aggcctgcgc ttcacccagg tgccagcctc cacaggttcc aacccaggag 34621 cccaagttcc ctttggccct gactcagaca ctattaggac tggcaagtga taagcagagt 34681 cccatactct cctattgact cggactacca tatcttgatc atccttttct gtaggaatcg 34741 gatataacat catccgggta cccatggcca gctgtgactt ctccatccgc acctacacct 34801 atgcagacac ccctgatgat ttccagttgc acaacttcag cctcccagag gaagatacca 34861 agctcaaggt aggcattcta gctttttcag gccctgaggg ccctgatgtc tgggggttga 34921 gaaactgtag ggtaggtctg cttgtacaga cattttgtcc cctgctgttt tgtcctgggg 34981 gtgggagggt ggaggctaat ggctgaaccg gatgcactgg ttgggctagt atgtgttcca 35041 actctgggtg cttctctctt cactaccttt gtctctagat acccctgatt caccgagccc 35101 tgcagttggc ccagcgtccc gtttcactcc ttgccagccc ctggacatca cccacttggc 35161 tcaagaccaa tggagcggtg aatgggaagg ggtcactcaa gggacagccc ggagacatct 35221 accaccagac ctgggccaga tactttgtga agtaagggat cagcaaggat gtgggatcag 35281 gactggcctc ccatttagcc atgctgatct gtgtcccaac cctcaaccta gttccacttc 35341 cagatctgcc tgtcctcagc tcacctttct accttctggg cctttcagcc ttgggcctgt 35401 caatcttgcc cactccatca ggcttcctgt tctctcggtc tggcccactt tctttttatt 35461 tttcttcttt tttttttttt tgagaaggag tctctctctc tgtcacccag gctggagtgc 35521 tgtggcgcca tcttcactca ctgtaacctc tgcctcctga gttcaagcaa ttctcctgcc 35581 tcagccttcc aagtagctgg gattataggc gcctgccacc aggcccagct gatttttcta 35641 tttttagtag agacggggtt tcgccaggct gttctcgaac tcctgaactc aagtgatcca 35701 cctgcctcgg cttcccaaag tgctgggatt acagtgtgag ccaccacacc cagctggtct 35761 ggtccacttt cttggccgga tcattcatga cctttctctt gccaggttcc tggatgccta 35821 tgctgagcac aagttacagt tctgggcagt gacagctgaa aatgagcctt ctgctgggct 35881 gttgagtgga taccccttcc agtgcctggg cttcacccct gaacatcagc gagacttcat 35941 tgcccgtgac ctaggtccta ccctcgccaa cagtactcac cacaatgtcc gcctactcat 36001 gctggatgac caacgcttgc tgctgcccca ctgggcaaag gtggtaaggc ctggacctcc 36061 atggtgctcc agtgaccttc aaatccagca tccaaatgac tggctcccaa acttagagcg 36121 atttctctac ccaactatgg attcctagag caccattccc ctggacctcc agggtgccat 36181 ggatcccaca gttgtcgctt gaaacctttc taggggctgg gcgaggtggc tcactcatgc 36241 aaacccagca ctttgggaag ccgaggcggg tgatcacctg aggtcaggag tttaagacca 36301 ccctggccaa cgtgttgaaa ccctgtgtct actaaaatac aaaaaaaaaa aattatctgg 36361 gcatgatggt gggtgtctgt aatcccagct actcaggagg ctgagaaggg agaatcagtt 36421 gaacccggga gatggtggtt gcggtgagcc gagatcgcgc cactgcactc cagcctggga 36481 ggctgagcga gactccatct cgaaacaaaa caaaacaaaa ctatctaggc tgggggtggt 36541 ggttcatgta tgtatgtgta tatacatata tatgtgttta tatgtatata tatatacaca 36601 cacacacata catacacaca catacacaca caaattagct gggtgtggca cccgtgtagt 36661 cccagctact caggaggcta atgtgggagg atcagttgac cctaggaagt caaggctgca 36721 gtgagtcgtg attgcgccac tgtactccag cccgagtgac agagtgacat cctgtctcaa 36781 aaacaaaaaa aaatctcccc aaacctctct agttgcattc ttcccgtcac ccaactccag 36841 gattcctaca acaggaacta gaagttccag aagcctgtgt gcaaggtcca ggatcagttg 36901 ctcttccttt gcaggtactg acagacccag aagcagctaa atatgttcat ggcattgctg 36961 tacattggta cctggacttt ctggctccag ccaaagccac cctaggggag acacaccgcc 37021 tgttccccaa caccatgctc tttgcctcag aggcctgtgt gggctccaag ttctgggagc 37081 agagtgtgcg gctaggctcc tgggatcgag ggatgcagta cagccacagc atcatcacgg 37141 taagccaccc cagtctccct tcctgcaaag cagacctcag acctcttact agtttcacca 37201 aagactgaca gaagcccttc ctgtccagct ttccccagct agcctgccct tttgagcaac 37261 tctggggaac catgattccc tatcttccct ttccttcaca ggtctgcaca cctcattgcc 37321 ccttttgcaa ctactgaggc acttgcagct gcctcagact tctcagctcc ccttgagatg 37381 cctggatctt cacaccccca actccttagc tactaaggaa tgtgcccctc acagggctga 37441 cctacccaca gctgcctctc ccacatgtga cccttaccta cactctctgg ggacccccag 37501 tgttgagcct ttgtctcttt gcctttgtcc ttaccctaga acctcctgta ccatgtggtc 37561 ggctggaccg actggaacct tgccctgaac cccgaaggag gacccaattg ggtgcgtaac 37621 tttgtcgaca gtcccatcat tgtagacatc accaaggaca cgttttacaa acagcccatg 37681 ttctaccacc ttggccactt caggtgagtg gagggcgggc acccccattc cataccaggc 37741 ctatcatctc ctacatcgga tggcttacat cactctacac cacgagggag caggaaggtg 37801 ttcagggtgg aacctcggaa gaggcacacc catccccttt tgcaccatgg aggcaggaag 37861 tgactaggta gcaacagaaa accccaatgc ctgaggctgg actgcgatgc agaaaagcag 37921 ggtcagtgcc cagcagcatg gctccaggcc tagagagcca gggcagagcc tctgcaggag 37981 ttatggggtg ggtccgtggg tgggtgactt cttagatgag ggtttcatgg gaggtacccc 38041 gagggactct gaccatctgt tcccacattc agcaagttca ttcctgaggg ctcccagaga 38101 gtggggctgg ttgccagtca gaagaacgac ctggacgcag tggcactgat gcatcccgat 38161 ggctctgctg ttgtggtcgt gctaaaccgg tgagggcaat ggtgaggtct gggaagtggg 38221 ctgaagacag cgttgggggc cttggcagga tcacactctc agcttctcct ccctgctccc 38281 tagctcctct aaggatgtgc ctcttaccat caaggatcct gctgtgggct tcctggagac 38341 aatctcacct ggctactcca ttcacaccta cctgtggcgt cgccagtgat ggagcagata 38401 ctcaaggagg cactgggctc agcctgggca ttaaagggac agagtcagct cacacgctgt 38461 ctgtgactaa agagggcaca gcagggccag tgtgagctta cagcgacgta agcccagggg 38521 caatggtttg ggtgactcac tttcccctct aggtggtgcc aggggctgga ggcccctaga 38581 aaaagatcag taagccccag tgtcccccca gcccccatgc ttatgtgaac atgcgctgtg 38641 tgctgcttgc tttggaaact gggcctgggt ccaggcctag ggtgagctca ctgtccgtac 38701 aaacacaaga tcagggctga gggtaaggaa aagaagagac taggaaagct gggcccaaaa 38761 ctggagactg tttgtctttc ctggagatgc agaactgggc ccgtggagca gcagtgtcag 38821 catcagggcg gaagccttaa agcagcagcg ggtgtgccca ggcacccaga tgattcctat 38881 ggcaccagcc aggaaaaatg gcagctctta aaggagaaaa tgtttgagcc cagtcagtgt 38941 gagtggcttt attctgggtg gcagcacccc gtgtccggct gtaccaacaa cgaggaggca 39001 cgggggcctc tggaatgcat gagagtagaa aaaccagtct tgggagcgtg aggacaaatc 39061 attcctcttc atcctcctca gccatgccca gggtccgggt gcctggggcc cgagcaggcg 39121 ttgcccgctg gatggagaca atgccgctga gcaaggcgta gcccaccatg gctgccagtc 39181 ctgccagcac agataggatc tggttccggc gccggtatgg ctcctcctca gtctctgggc 39241 ctgctggtgt ctggcgttgc ggtggtacct cagctgaggg tcaaggaagg aaggtgtgtt 39301 aggagaacta gttcttggat ccctgcccac tctccccagg gctgcccctc ccatctgccc 39361 cttacctcca tcccagggga agtagagact gagaatgtgg gtacaatagg cacagaggtt 39421 gtgcagccca cgcaggtgga cctgcagctt cccactgggc agctttgcct gcagcagcag 39481 ggccaagtag ctgaagacga aggcgtccaa ggaggcaggg ctggagcaga gagagaaggg 39541 tgggatggag gagaaccact ggggtagaag gggtaaagat ggagctggag gaagagtcag 39601 ccttgggagg tgggctctgg gcagcaggcg gccaccaggg aaggacagga cacacagttc 39661 tagacctggt atggggagag atccccaggt ggcgccagct ggccctgaat agggctctat 39721 cccagggctg cataaagggc acactcagtg ccccacagct cttcaggccc ttcctgtgcc 39781 tggctgccct cccaccctac ccttttgtac ctctgagaag gctctggccc cacgcacagc 39841 cccactgtca ccagggccag tatctgtctc agggacctcc tatccagagc ctgagccagc 39901 cccagcccca gccccagctc cagctgctcc atctgaacct gtatcttctt ccaagccacc 39961 cattaccctc ttggagtcag actcacgcat ctccaaagaa gaacttttga gagcccaggc 40021 gctgagagag cagggtcaga cactcccgag cctctcggta cagctgtagg ggcgacacag 40081 gtaggcttgc agctgcggga acagtgccac ctccgcacct aagcactccc attcctggcc 40141 agcatccttg gggctcatct catacaatag cccccggtct cagagctacc tccttctcca 40201 gctcttcctc gtcctcaggc ctgtgctccc cagtcagcag ctgtagccgt tccatgtact 40261 gccgctgcat gcggccaggc aggaagaagt tgaggggaaa gggcatagcc tctgcatacc 40321 acttccgggt cacttctacg tagttcttgg tgtctatcca aaaagtatgt acctggattg 40381 ggtgggcagg aagaaacagg caggtctgag ccagtgcacc tgtctgattc aaggtgggct 40441 tctgacctcc atgctctcct gagtctctgt gtgggtctgt gtgttcccgt cccctccccg 40501 gctggccatg gatgctggga ggtctgggca cactcaccag caccgggatc aacttctcct 40561 ccaggagaga catgaaggcc agggtgtctg ccccttgctg agctgacaga tcataatcag 40621 cattgtactt ctgtggagga aatatccatg gcgtggacgc tggggagctg caagggcact 40681 tcaccaggga ggaaggagtc ctgtctggta cccccctcac tggcctctga gtgcagtgga 40741 ggtacagcaa ggaacttttc ctgccaaggc ccccttgcct gggcccagcc agtagcctgt 40801 tgctgttggc aaaaagcctg ggccttggag cccgctggcc gtcaaggtcc tgggcccatt 40861 gagaagaagg aagaaaggtt gggccgcaaa ctaggagcag ctcccagaat ttccatggaa 40921 agctggaaca atgcctgctg acagcaactt tctaacagta actttcccga cccagacacc 40981 acaaagctag cacaacggag ctcagatgca ggctaggact cggtccatgc ctcaggaacc 41041 agggaaagcc atcctcacac tccctggatc cagggaaccc acgcccaggg ccccccagct 41101 tgttccctca gtgcccagct cttggctatt tctttcactt cattccatcg cccagacacc 41161 attaccacat acacattcca tccatacccc caggtctcag cctgccctac cttcccaggc 41221 tccagtccct gttcctcagc atcccccacc acatcctgag taagctttgt ccccagataa 41281 cctcttcagc atgatcctta aatctcccta agcctcagtt tctcccctgt ggaatggggg 41341 taagaatctc tttctctgaa tgcccctgtg ttaggaaata atttagaata cttcggaaac 41401 aaaaagctct gttcacacct aagcaatcag ggcagtggcc tggccttgcc aggaacttag 41461 gcttttatct ggatcctctt tccaggcctc tcaattaatt ccccaggtcc ttaacctttg 41521 ggaaattaga aattaggaag agtgtcccac ttctgacact gtgttccctc ttggaacctg 41581 accgtcaatg ctagaagaac ccttggaaaa catgctggcc cagccctcta gttttacaaa 41641 taagggagtg cacagccctg agaggttaca tggcctgccc aagatcacgc agtcaatggc 41701 agagtaaaga gcatagccta ggcctcccca ctcctctagt aatgctcttt catcttctcc 41761 aacctggctc taagccttgt ccatcctgag ccccatatct agcccaacct agtccctgaa 41821 aacaagaagt ggcccttaga aatctctctc cagtcccact atcagaggcc aactgctgtc 41881 ttccagtctc cttcagcctg tgctcctctc cctccctgac tgacaggcag aaggtaccgt 41941 gcctctggat atccccacag tgccctgagc tgcatctctt gccgactgct ttaatacatc 42001 acagtgacat tgtgtgtgtc tctgccacca gactattgct ccttgatgct ctgggtcacc 42061 tgcatctagc atggcatata tctagtgctc aataaatgtg tattgtacgg aattgactga 42121 acttctctca ctggcagccc cctctatcca aatcacccac ctctttttga aggtgggtga 42181 tgatcttgtg tggtactgag gtgacctctc catgactggt ccaaagggca ggcagggttc 42241 ctgatactga gagaaaagta taccaaccag gaccttagct gcctatccat acgtagttgc 42301 aacacattcc tgcctatctc ctgcttctcc tcttgtacac acccttcctc acccccaagg 42361 gatactgggt acctgaaggg ctctgccagg ggttgctgat cttgtgctat atggtgcaca 42421 agaaactttt aagaaaaaag aaactttcaa gacacacttt ccaaccacga attctatctc 42481 tgctcctttt catagcaata ggttttcttt ttcttccctc cacacttaaa catccattct 42541 cttatcaccc acccccatca gactccttcc cctgtgtttc ttcagccact gctctgacca 42601 aaatttgagt gaccaaaagt ggtgtcagac ccagtgacca tttctctgcc tgcatttcac 42661 ttgaccttga agcagcaatt aatctccata atcagcatct tcttgaattt ttccctttga 42721 ggacattgct cttctacttt gtgctctggt tatcctttac aaacttttac acctctcctg 42781 aactggttaa cagtagaggg ccccaagggt ctcacgtaag ccttctttgt ttttgttttt 42841 tttcttttct tttatttttt atatattttt ttgagacgga gtcttgctct gttgcccggg 42901 ctggagtgca gttgcacgat ctcggctcac tgcaagctct gcctcccagg ttcatgctat 42961 tctcctgcct cagcctcccg agtagctggg actacaggtg cctgccacca tgcctggcta 43021 attttctgta tttttaacag agacagggat acaccatgtt agccaggatg gtctcgatct 43081 cctgacctcg tgatctgccc acctcagcct cccaaaatgc tgggattaca ggcgtgagcc 43141 actgctccca gccgtcattt ttattttatt ttatttattt ttttgagatg gagtcttgct 43201 cttttgccag gctggagtgc atggtgcgat ctcggctcat tgcaaccccc gcctcccagg 43261 ttcaagcgat tctcctgcct cagcctcctg agtggctggg actacaggtg cctgtcacca 43321 tgcctggcta attttctgta ttttagcttg agacacggtt ttcaccttgt tagccaggat 43381 ggtctcgatc tgtgagcctc gtgatctgcc tgcgtcggcc tcccaaagtg ctgggattag 43441 caggcagtga gccactgcac ccggccatca attttttttt tttttaaatg gagtcttgct 43501 ctgtcaccca ggttggagtg taacagtgca atcttggctc attgcaacct ccgcctcttg 43561 ggttcaagcg attctccagc ctcagcctcc tgagtagctg ggactacagg tgcatgccac 43621 cacacccggc tagtttttgt atttttagta gagacggggt ttcaccatgt tgtccaggat 43681 ggtctcaaac tcctgacctc aaatgatgtg cctgccttga ccttccaaag tgctgggatt 43741 agaggcgtga accacattgc ccaccgtaag ccctcttttc tacttccact ctttctgtag 43801 gtggccctac tactaacgtc catggcttta agtaccatct ttctatgtgt taatgcataa 43861 gtccagcctt gacctctctt gagcgccatc caactcagca tatctggttg gatgtctaat 43921 gagtatttca aattcaacat ggccacaact gaactaactg aactcttttt tttttttagg 43981 cagagttttg ctcttgttgc ccaggctgga gtgcaatggc gagatcttgg ctcactgcaa 44041 cctctgcctc cagggttcaa gcgattctcc tgcctcagcc tcccgagtag atgggattat 44101 aggccccgct acccggctaa tgtttttgta tttttagtag agacagggtg ttgccatatt 44161 gaccaggctg gtctgcaaat cctgacctca ggtgatcccc ctgcctcggc ctcccaaagt 44221 gctggaatta caggcgtgag tcactgcccc ggccacacaa ctgagctctt cattcccacc 44281 ccagcaccgg ttatttttcc actgttccca tctcaatggc acctccatta cccatttgca 44341 cattccaaaa gcccaggaac catggtgact tcttttccca tatccaacac aaccaatcct 44401 atcttgaatt catccacgtc ccaccacctc cccagctacc tagttccagc caccctctct 44461 ccacaacctc tgaatcaatc tttcactttt cccagcaatc cattctccac tcagcaaaat 44521 gatgataaag cacgtcacat caaggctctg cctcaattta atggcttccc attgtattta 44581 gaatcatctc caagctctca gagactatgg tcagctacaa tctggcccac cttctgttcc 44641 agccaaattt cctcacagca caaggacgtt tgcacctgct attttccaag cacgaaaccc 44701 tcgggcagat atatctggtg ctgtcaccta atttcaggtt ttaactccac tctcaccatt 44761 tcagtgagga cctaatcccc atcgcagtca ttctatcaca tagctttatt ttattttatt 44821 ttattttttt ttgagataca gtctggctct ctcacccagg ctggagtgca gtggtgtgat 44881 ctgggctcac tgcaaacttc catctcctgc gttcaagcga ttctcctgcc tcagcttccc 44941 gagtagctgg gattacaggt gtctgccatc acgcctgact aagttttgta ttttcagtag 45001 agacggggtt ttgccatgtt agccaggctg gtctcgaact ccttgacctc aagtgatcca 45061 cctgcctcag cctcccacaa tgttggattt acaggcgtga gccactgctc ccggccacat 45121 agccatgttt taagcatgta tcactatcta aaattatttt ttgtttatac gtttgtgtcg 45181 tcctgtagaa tgtaaggtca caagatcagg gacttgctca ttgcactggg tctaacacac 45241 agtgcttcaa caaacactcg ttaagatact aacgtggcag agtggggcct tgtaaacagt 45301 ttcaggaccc tgtgcttgta agagcaacgt ggtgccctcc caaggaagac agggaggatg 45361 caggagcact gcccagagat ggcgtcaggc tgcaagacat cttgaataat tcaccatcgt 45421 aacaacccag cctcaggaag agatggggca aggccagaac gaaacattag gtaagaggcg 45481 gtggacaatg ggattcccac agggcagctt ttgggcactg gacgttccct aacctgaggc 45541 tctctgaaga ggaaggttag gaatcctctc agcttcggtg ggctggactc actgtgggaa 45601 ttcaatcgcc cccccccacc cactaagggt gtgctggcgg gaaagcgctg agacgcatgc 45661 gtagttctcg cgtctggcac ccgctccctt tccaatacgc ttgcgccccg tctgtgctac 45721 ggatggtcag ggagagttgt ccgtcttcaa atggaccaat gagacttgtg gaggggctct 45781 gagtcccgcc tctggatgag tgaccgtctc ttttccaagt gaggcccgcc cgcctcccca 45841 gggctgcttt tctcgcggca cgggtggtgg ggctgcttct tgacttccgc gcctagtgcg 45901 cagggcccca ttctccagtc ccgcccacgc gcctttggag gctgcggtgg gatttccttt 45961 tgccttcggt tggggctgct gtttctcttc gccgacggta ggcattataa atatttcgcc 46021 ctttgaattt tagctctccc ctcccggcgt tcgcacttag cctttttcat cattatcatt 46081 atttttaatg gtaaggatgg tcaaatttta ctaatgaagc gattataaaa tcttcaagtc 46141 tttgtatcca cttgcttttt gagggctgga gtgggaagaa aggtatataa ttcattcatt 46201 cttcggacat gtgacaaacg ttcacggagc gcggcaacga gcgccggtgt cgcgatgcgc 46261 actggggctg cacatgggag ccaggatgga ctggactggt ccctgccctg cccgctgacg 46321 attggcaggc cactgccttt gatgagctgg gcgctataac tgccattaag ccatttgtac 46381 aattaatcac aacagtgata agagcgacaa aggagctatt cgggtggctg aggcggagga 46441 tcttttgagc ctaggagttc gagaccagcc tggggaatag agcgagatcc tgtctcaaaa 46501 aaagggggag aaaaagtgac aaagaagttg tgagggctat gagtttgcgt tagaggaggt 46561 gtgtgaggtg gggaggcggc agggggcgcg gttgttctta gagaaagtga catccgggct 46621 aattttgaaa gaataagtgt taactaggct aagcggtggg aaagagtagt gtgtgctgaa 46681 ggaagggaag aggaacctgg caagttcatg gcaggcctgt aattggatgc tggtctccct 46741 ccacacttga ctcctacagt ctgatttcta cacagcaccc aaagtgatct cttaaaaata 46801 cacatgtgat cttgtcactt cccagcacca aactctacag tgaatttcca tcttagaaca 46861 aaattcagac tccttaccat ggccaccaag accctacaca atctggcctc ccaatttcct 46921 tttccaaggt caccttttac cactatccat ctcactcaca ctgctccagc actctctatg 46981 cttttatttt tctttcttcc tctttctttc tttctttctt tttctttttc tttctttctt 47041 tccttccttc tctctctctc tctttctttt cttttctctt tctttttttt ttttttgaga 47101 tgcagtcttg ctctgtcacc caggtgtgtg atcttggctc actgtcaacc cagagcagtg 47161 ggggtgatct tggctcactg cagcctccgc cttccaggtt caagcaattc tgcctcagcc 47221 tcctgagtag cagggattag cgccatcagt cccagctaat ttttgtattt ttagtagaga 47281 tttcatcatg ttgtccaggc tggtctcgaa ctcctgactt caagtgatct gcccatctca 47341 gcctcctaaa gtgctgagat tacaggcagg agccaccaca cctggctcag tatttgtttt 47401 atttttgttg tgtttattta tttagagaca aggtcttgct ctgttgctca ggctggagtg 47461 ctgtggcagg aacacagctc actgcagcct ccactacctc aattcaggcg atcctcccac 47521 ctcagctgag actacaggtg tgcatcacca tgcctggtta attttttcgt tttaccatgt 47581 tggccaggtt tgtctcgaac tcttgggctc tgctgtcctc ctaccttagc ctcccaaagt 47641 gctaggattg taggcgtgag ccactgtgcc cagctggtgt tcagtatttg aatccatatt 47701 tcctgtagcc gcaaccaaag ttccactgtt aggtctcact ttgactttaa taattgtgtt 47761 caggctgggc acactggctc aagcctgtaa tctcagcagt ttgggaggcc gaggtgggtg 47821 gatcacatga ggtcaggtgt ttgagaccag cctgtccaac atggcgaaac cctgtctcta 47881 ctaaaaatac aaaacattct ccgatcgtag tggcgggcgc ctgtaatccc agctacttgg 47941 gaggctgaag gaggagaatt gcttgaacct gggaggcaga ggttgcagtg agcggagatc 48001 acgtcattgc actccagcct gggctacaga gcgagactct gtctcaataa gtaaataaat 48061 aaataaataa ataaataaat aaataaataa ttgtgttcgg agtcagcatc attcttcctg 48121 aagttcacca ctcctttgcc aagaacactt cctgaaacac tgacaagcag gaccttgaat 48181 aatggggtat ggttggtaac aactcactca ttcaacaaac attgagcacc tattttgtgc 48241 ttgcctctaa atgaagagct ggttgtatta tttatttttt taggtgacag ggctttccct 48301 atgttgccca ggttcgtctc aaactcctgg gctcaaaaga tcctcatctt ctcaagtggt 48361 tgaatataca cgctccagcg accatgcctg gctgaatgaa gagctttgag attttgaaga 48421 aacaggaacc atgaaatttg ctttgcaact gtttgcaacc tttaaggaag actgaaaagg 48481 cattcctgaa gcatgtgaga agcagtctgt gtgacctgat gactcagaac tgcttggaat 48541 ttagattagg acagatatga gcttaggctt cactctgcca catatttaac ttctctaagt 48601 cttagttttc ttttcttttt tttttttttt gagatggagt ctcgctctgt cacccaggct 48661 ggagtgcagt ggcacgatct tggctcactg caagctccgc ctcccgggtt cacgccatcc 48721 tcctgcctca gccacccgag tagcggggac tacagtcgca caccgccacg cctggctaaa 48781 tttttgtatt tttagtagag acgggtttca ccgtgttagc caggatggtc tcgatctcct 48841 aaccttgtga tccgcccgct tcggcctccc aaagtgttgg gattacaggt gtgagccatc 48901 gcgtctggcc tctaagtctt aattttctta tctgtaatgt agagttgtca ggctagtgca 48961 tgtaataagc actcatgaaa gactgactat tatcttgcga aaaattggaa gagatcatat 49021 gaggtacaag gaccttccat ctgcactggg ctgtacattg aatgttgtgg ttactgttga 49081 agcattggta aatcacatga atccaatggt aaaaccacac cctaaggtca gactcagtgg 49141 ctcacgcctg taattccagc actttgggaa accaaggcaa gaggattgct tgagctcagg 49201 agttcgatac cagtctaggc aacctgtttc tataaaaagt taaaaaatta gcttggtgtg 49261 gtggtgtgca cttctggtcc cagctactca ggaggctaag gtgggaggat cccttgagcc 49321 caggtggtcg aggctgcagt gagccaggat cacaccattg cattccagcc tggatgacag 49381 tgtgaggccc tgtcttaaaa aagacaaaaa caaaccaaaa aaacccacac cctagtgggt 49441 aaggggcagc agaggtccca cccaagagtg aaatcatttt tggtgtcagt aatcagggag 49501 agtaatatac cctcaggcca gaaactagag gtgagtgatg gctcacgcct ataatcccag 49561 caccttggga ggctgaggca ggtggattgc ttgacctcag gaattcgaga ccagcctggg 49621 caacatagga agaccccatc tctaaaaata aatttcaaaa attagttggg catggtggca 49681 tgtacctgta gtcccagcta ctatggaggc tgaggtggga ggatccaatt gagcctgaga 49741 ggtcaaagct gcagtgaatt gtgattgtgc caagaccctg tctctaaaat aaaataaaat 49801 tacaattaaa aactagagct gaaaggacag gttctaggtt aactggtagt agttgttcat 49861 tcatctgtat aaaaagtact tttttgagca cctactctgt gctgtgcatg gaatgaacaa 49921 atgtctttat ttgacttgaa cacagaccat agaagtaact aacagtggaa ctgggagtgg 49981 cactctccat gggacctaaa gacctaggag cctgtgatat gatacctgtc agtgtgaaag 50041 ggctggagtt tgctatctga gccagaccta cactccaaga caggtcttgc tgggatggga 50101 tcaaaattgc tgctgggtca gccgaactat ttttgtactt ctgcttttca cttatattta 50161 tgtcttccct aggtatatag ctgtcttttt ttgtttgctt gtttttctga gtcagagtct 50221 tgctctgtca ctcaggctgg agtgcagtgg cgcgtctcgg ctcactgcaa catccacctc 50281 gggttcaaac gattctcctg cctcagattc ccgagtagct aggattacag gtgcccacca 50341 ccatgcccgg ctaatttttt gtatttttaa tagagatggg gtttagtaga gacagggttc 50401 actgtgttat ccaggatggt ctccatctcc tgacctcgtg atctgcccgc ctcagcttcc 50461 caaagtgctg ggattacagg catgagccac tgcgcccggc cactggagac cccttttcta 50521 ccaaaaaaaa taaaaataaa taaataaata tatatatata tttgtaaaga cggagttctc 50581 tatgttgccc aggctggtct tgaactcctg gcctcaaggg atcctcttgt ctcagcttct 50641 taaaatgatg gggttacagg catgagccac cgtaccaggc cttagcaaac tctttttcaa 50701 gtgctataca aaggggacag aggaacgtga ttgtggccac ctagtatgcc cctattggcc 50761 actgcagtga ggcctaggtg tttgaagaga ggcacctagg gtagtggctc tcaggcacac 50821 cctaggggca ttttggaaat ttgtggggca tctttaatag agacaatgat taggaggcac 50881 tttgagaatt tagtggatgg agcccaggaa aggtagacat tctgcaacat gtagtacatt 50941 catacacaag gaagaattca ctcatttcac ctgacttaca aatataaaat caaatataaa 51001 tcagtaggac tttattataa aatattgcag aaaagtttca caagaatttt ttgtagaaaa 51061 agcccaatca gggccgggca tggtggctca tgcctgtaat ctcaggactt tgggaggccg 51121 aggcgggtga atcacctgag gtcaggagtt tgggacaagc ctggccaaca tggtaaaacc 51181 ccgtcttgac taaaaataca aaaatcagcc aggcatggtg gtgtgcgtct gtaatcccag 51241 gtactcggga ggctgaggca ggagaatcgc ttgaacccag gaggcggggt ttgcagtgag 51301 ccaagatcaa gccactgtgc tccagcctgg gtgaaagagc aagactccgt cgaaagaaag 51361 aaaggaaggg aggaagggag gaagggaggg agggagggag ggagggaaag aaagaaaaaa 51421 gagaagagag gggaggggag ggaagaggag aaaagaagga aggaaggcag gaaggaagga 51481 atccagtcga gatcgttaaa tttgttgtaa ttttgtcttt ggcacaataa acattagtcc 51541 ttattaaaac taactttact tagaaaataa aatagaaaat cctcaaggac aaatcacatt 51601 gtacaggaaa tatgaaatgt gaaataatat ccaccttgga aatttttttt ttttgaaaca 51661 gagtgttact ctgtctccca ggctggagtg cagtggcaca atcttgggtc tctgcaacct 51721 ctgactcctg ggttcaagtg attctcgtac ctcagcttcc caagtagctg ggattacagg 51781 cgtgcaccac cacacccagc taatttttgt attttcattt cagatgggat tttgccatgt 51841 tggccatgaa cacctgtcct caagctatcc actgcctcag cctcccaaag tgctgggatt 51901 ataggcatga gctaccgtgc ccagccaccc tggaaaaatt ttacaaaatg aagtataaaa 51961 agaagaaaag aggggcgggc atggtgcctt gtgcgtgtaa tcctagaact ctgcaaagcc 52021 gaggcaggag atcctttggg tttaggagtt tgagaccaac ctgcacaaca tagcaagacc 52081 ctatttctac aaaaaataca gacactaggc tgggcgtggt ggctcacgtg taatcccagc 52141 actttgggaa gctgaggcca gcagatcacg aggtcaggag atggagacca tcctggctaa 52201 catagtgaaa ccctgtctct actaaaaata caaaaaatta gccgggtgtg gtggcaggcg 52261 cctgtagtcc cagctactca ggaggctgag gcaggagaat ggcgtgaatc cgggaggcgg 52321 agcttgcagt gagccgagat cgtgccactg cactccagcc tgggagacag agcaagactc 52381 cgtctcaaaa aaaaaaaaat tttgctcagt acctggccaa aaaagaagca gctcactccc 52441 tgtacacaga ggggtaagag aaaggagatg gttgaacttc taaactcgct aaagcaggag 52501 aggcaaatgt ggaatgtgct caggaaatat ctgtgagatg aatgaatttg agggaagtaa 52561 ggtactagat aattacctgc cctacccaga acaaatcctg tgcaacgttt ccttgaagag 52621 caggaagtca ggccgggtgc tgtgctcacg cctgtaatcc cagccctttg ggaggccaaa 52681 gtgtgcgaat cacctgaggt caggagattg agaccagtct ggctaacatg gtgaaacccc 52741 atctctacta aaatacaaaa attagccggg cgtggtggtg cgtgcctgta gtcccaacta 52801 cttgggaggc tgaggcagga gaattgcttg aacctgggag gcagaggttg cggtgagctg 52861 agatcggcca ctgcactcca gactgggtga cagagtgaga caacatctca aaaaaacaaa 52921 aaaaaaagag aaaagcagga agtctggaag gggtggctac tgacatagtg aagcaactag 52981 ttcaattcta caacttgaca actacccctg tgccaggctg tctacaagga tatttagaat 53041 gtgtaagaca ttccttcaag gaactccagg aacagaggcc tgacatgttg caatgtttag 53101 tgtcaagcag tgtactagag acacattatc acactcaaac ctcacaacag ttctatgagg 53161 taggagttat cactcccctt ttatagatga aatagaggct tagagtgatt gatttactga 53221 aggtcaaaca gccagtaaat ggtgtagcaa ggattccaac cttgccgtct cactaaaact 53281 gtacaaaaaa agatacaaac aacagacaaa tagttcccag gcgcctccca agttgccagg 53341 cactgcattt acctcactga cccctttgag gttgtggcat tgcctccatt ttctaggtga 53401 ggaaataggc tgagagctgg ggttagtctg gtcatgactg tgtgtgccac tcccaccaaa 53461 tctcatttga tgtggttcat gaggcaaatg gcatggacaa cttccttcac atgtccacta 53521 agcatatggc cttttacaac actttctggt ttctgaacta ctttaaaacc tcactgtcct 53581 gtgaggaagg aagaacagtt attacaatct gcatctggaa gccaattgcc ctttagagat 53641 atggctgcaa ttgcctcact gcctgtgtca tgtgactctc cgaggccctt aatgagtaaa 53701 tgaggggtgc tgcaggggag ccaagctgac cactcccctc cctccagtcc tgccacccca 53761 ctgccagtgt cccaccctcc ttgcgcccta cacttcactg gctaataacc cccctcactt 53821 tttcctgtgt tgaaggcatc ctggataatt ccccacccac gaatggtccc tcctcatctc 53881 agagagctct ccatgcacac ctgttactgt ttctgttttt acctgtaaat atctgtgtct 53941 gacttccatg cttcatgcac ctctataggg caaagactgt gtcttaaaca tcacggtagc 54001 ctcagcatgt tgtgcaatga aggttttttt gtttttgttc tttgtttttt ttttggtatt 54061 agctttattt gtatcatttt gaaattttta tcaaaaaagc agcgtgcctg ctgtggttcc 54121 catcctctgg gatttaggaa tctttacccg attctccatc caagtctgtc tttcgtattc 54181 taggctcttc ctaaagttgt cattcacata taccctccag aattttatag ggtgtataat 54241 ctgtaacaac tcggaggaag ccaattgccc tttagaaata tggctgcaat tgcctcactt 54301 cctgtgtcat gtgactctcc tagtcatcac atgacccatc cacattggga agccagaatt 54361 acttgcagga gtaacctagt gcctatagct atggcaggta tcctgcatcc ttgttttttg 54421 tttagtggat cctctatcct tcagagactc tggaacccct gtgctcttct cctcatctag 54481 tgaccctgag gtgatggagt tttcaagtcc ttccagagag gtaagagaga gagctcccaa 54541 tcagcattgt cacagtgctt ctggaatcct ggcactggaa tttaatgaat gacagactct 54601 ctttgaatcc agggccatca tggctctttg agcaaggcac agatggaggg aggggtcgaa 54661 gttgaaatgg gtgggaagag tggtggggag catcctgatt tggggtgggc agagagttgt 54721 catcagaagg gttgcaggga gagctgcacc caggtgtctg tgggccttgt cctaatgaat 54781 gtgggagacc aggccatggg cacccaaagg cagctaagcc ctgcccggga gagtagttga 54841 ggggtggaga gggacttgct tttcagtcat tcctcattct gtcctcagga atgtcccaag 54901 ccttcgggta gggtaagcat catggctggc agcctcacag gattgcttct acttcaggca 54961 tgtcgtgggc agtcagatga gtgagtcaag gcagtgggga ggtagcacag agcctccctt 55021 ctgcctcata gtcctttggt agccttccag taagctggtg gtagactttt agtaggtgct 55081 caataaatcc ttttgagtga ctgagaccaa ctttggggtg aggatttttg aaaccgtctt 55141 cagtctctcc aaacagctgt gtccgttctc cacatccttg tcagacctca cctctgcttg 55201 tgctccctcc ctcccaggtg gtgcccctgc atccctaaaa gcttcagtac agctcggtgg 55261 tctgtgtctg caatgccaca tactgtgact cttgaccccc cgacctttcc tgccctaggt 55321 gccttcagcc gctacaagag cagaagcagt gggcattgga tggagctgag tacaggacca 55381 tacaggctaa ttgcaccggc acaggtaacc attacaccct tcaccccccg ggccaggctg 55441 ggtcctccta gaggtaaacg gtgtcagtga tcaccatgga gtttctccct gggcactgat 55501 aaccctgtgg atgtcctcag gcctgctact gatcctgcag ccagaagttc cagaaagtga 55561 agggatttgg aggggccgtg acagatgcag gtgccctcaa catccttgcc ctgtcacccc 55621 ctgcccagaa tttgctactt aaatggtact tctctgaaga agatgaggag gaaggggaca 55681 ggatgacata gagccactga cacttttctt tgccaattct tgtggaccct gacttctgcc 55741 catccctgac atttggttcc tgtcttaatg ccagtgaaat aagatttcgc cgcctatcat 55801 ctgctaactg ctacggactc aggctcagaa aggcctgcgc ttcacccagg tgccagcctc 55861 cacaggttcc aacccaggag cccaagttcc ttttggccct gactcagaca ctattaggac 55921 tggcaagtga taagcagagt cccatactct cctattgact cggactacca tatcttgatc 55981 atccttttct gtaggaatcg gatataacat catctgggta cccatggcca gctgtgactt 56041 ctccatccgc acctacacct atgcagacac ccctgatgat ttccagttgc acaacttcag 56101 cctcccagag gaagatacca agctcaaggt aggcattcta gctttttcag gccctgaggg 56161 ccctgatgtc tgggggttga gaaactgtag ggtaggtctg cttgtacaga cattttgtcc 56221 cctgctgttt tgtcctgggg gtgggagggt gggggctaat ggctgaaccg gatgcactgg 56281 ttgggctagt atgtgttcca actctgggtg cttctctctt cactaccttt gtctctagat 56341 acccctgatt caccgagccc tgcagttggc ccagcgtccc gtttcactcc ttgccagccc 56401 ctggacatca cccacttggc tcaagaccaa gggagcgggg aatgggaagg ggccactcaa 56461 gggacagccc agagacatct accaccagac ctgggccaga tacattgtga agtaagggat 56521 cagcaaggat gtgggatcag gactggcctc ccctttggcc atgctgatct gtgtcccaac 56581 cctcaacctg gttccacttc cagatctgcc tgtcctcagc tcacctttct accttctggg 56641 cctttcaaac ttggatctgt cagtcttgcc cactccatca ggcttcctgt tctctcggtc 56701 tggcccactt tcttggctgg atcactcatg acctttctct tgccaggttc ctggatgcct 56761 atgctgagca caagttacag ttctgggcag tgacagctga aaatgagcct tctgctgggc 56821 tgttgagtgg ataccccttc cagtgcctgg gcttcacccc tgaacatcag cgagacttca 56881 ttgcccgtga cctaggtcct acccttgcca acggtactca ccacaatgtc cgcctactca 56941 tgctggatga ccaacgcttg ctgctgcccc actgggcaaa ggtggtaagg cctggacctc 57001 catggtgctc cagtgacctt caaatccagc atccaaatga ttggctccca aacttagagg 57061 gatttttcta cccaactatg gatcctagag caccattccc cgggacctcc agggtgccat 57121 ggatcccaca gttgggactt gaaacctctc taggctgggg gtggtagctc atggctataa 57181 ttccagcact ttgggaaccc aaggtgggtg gatcacttga acctaaggag ttcaagatga 57241 gcctgggaaa catggtgaaa ccctaactct acaaaaaaaa aaatagaaaa gttagccggg 57301 tgtggtggtg gcacgcctat agtcccaagt attctggagg ctaaggcggg aggtttagtt 57361 gagcctagaa tttcaggctg cagtgagcta tgattgtgcc actgtactcc agcctgtgtg 57421 acagagggag accctgtctc aaaaacaaaa acaaaaaatc cctcccaaaa cctctgtagt 57481 tgcattcttc ccaccaccta attcaggatt cctacaagag gaactagaag ttccagaagc 57541 ctgtgggcag ggtccagggt gacttgttct tcctttgcag gtactgacag acccagaagc 57601 agctaagtat gttcatggta ttgctgtaca ttggtacctg gactttctgg ctccagccaa 57661 agccacccta agggagacac accacctgtt ccccaacacc atgctctttg cctcagaggc 57721 ctgtgtgggt tccaagttct gggagcagag tgtgcggcta ggctcctggg atcgagggat 57781 gcagtacagc cacagcatca tcacagtaag ccaccccagt ctcccttcct gcaaaggagg 57841 acctcagacc cattagtagt ctcaccaaag actgatagaa gcccttcctg tccagctttc 57901 cccaggtagc ctgccctttt gggcaactct ggggaaccat gattccctgt cttgcctttc 57961 cttcacaggt ctgcacacct cattgcccct tttgcaacta ctgaggcact tgcagctgcc 58021 tcagacttct cagctcccct tgagatgcct ggatcttcac acccccaact ccttagctac 58081 taaggaatgt gccctcacag ggctgaccta cccacagctg cctctcccac acgtgaccct 58141 tacctacact ctctggggac ccccagtgtt gcgcctttgt ctctttgcct ttgtccttac 58201 cctagaacct cctgtaccat gtggtcggct ggaccgactg gaacccatca ttgtagacat 58261 caccaagcac acgttttaca aacagcccat gttctaccac cttggccact tcaggtgagt 58321 ggagggcggg gcacccccat tccataccag gcctatcatc tcctacatcg gatggcttac 58381 atcactctac accacgaggg agcaggaagg tgttcagggt ggaacctcgg aagaggcaca 58441 cccatcccct tttgcaccat ggaggcagga agtgactagg tagcaacaga aaaccccaat 58501 gcctgaggct ggactgcgat gcagaaaagc agggtcagtg cccagcagca tggctccagg 58561 cctagagagc cagggcagag cctttgcagg agttatgggg tgggtccgtg ggtgggcgac 58621 ttcttagatg agggtttcat gggaggtacc ccgagggact ctgaccatct gttcccacat 58681 tcagcaagtt cattcctgag ggctcccaga gagtggggct ggttgccagt cagaagaacg 58741 acccggacgc agtggcactg atgcatcccg atggctctcc tgttgtggtc gtcctaaacc 58801 ggtgagggca atggtgaggt ctgggaagtg ggctgaagac agcgttgggg gccttggcag 58861 gatcacactc tcagcttctc ctccctgctc cctagctcct ctaaggatgt gcctcttacc 58921 atcaaggatc ctgctgtggg cttcctggag acaatctcac ctggctactc cattcacacc 58981 tacctgtggc gtcgccagtg atggagcaga tactcaagga ggcactgggc tcagcctggg 59041 cattaaaggg acagagtcag ctcacacgct gtctgtgact aaagagggca caacagggcc 59101 agcgtgagct tacagcgacg taagcccagg ggcaatggtt tgggtgactc actttcccct 59161 ctaggtggtg ccaggggctg gaggccccta gaaaaagatc agtaagcccc agtgtccccc 59221 cagcccccat gcttatgtga acatgcgctg tgtgctgctt gctttggaaa ctgggcctgg 59281 gtccaggcct agggtgagct cactgtccgt acaaacacaa gatcagggct gagggtaagg 59341 aaaagaagag actaggaaag ctgggcccaa aactggagac tgtttgtctt tcctggagat 59401 gcagaactgg gcccgtggag cagcagtgtc agcatcaggg cggaagcctt aaagcagcag 59461 cgggtgtgcc caggcaccca gatgattcct atggcaccag ccaggaaaaa tggcagctct 59521 taaaggagaa aatgtttgag cccagtcagt gtgagtggct ttattctggg tggcagcacc 59581 ccgtgtccgg ctgtaccaac aacgaggagg cacgggggcc tctggaatgc atgagagtag 59641 aaaaaccagt cttgggagcg tgaggacaaa tcattcctct tcatcctcct cagccatgcc 59701 cagggtccgg gtgcctgggg cccgagcagg cgttgcccgc tggatggaga caatgccgct 59761 gagcaaggcg tagcccacca tggctgccag tcctgccagc acagatagga tctggttccg 59821 gcgccggtat ggctcctcct cagtctctgg gcctgctggt gtctggcgtt gcggtggtac 59881 ctcagctgag ggtcaaggaa ggaaggtgtg ttaggagaac tagttcttgg atccctgccc 59941 actctcccca gggctgcccc tcccatctgc cccttacctc catcccaggg gaagtagaga 60001 ctgagaatgt gggtacaata ggcacagagg ttgtgcagcc cacgcaggtg gacctgcagc 60061 ttcccactgg gcagctttgc ctgcagcagc agggccaagt agctgaagac gaaggcgtcc 60121 aaggaggcag ggctggagca gagagagaag ggtgggatgg aggagaacca ctggggtaga 60181 aggggtaaag atggagctgg aggaagagtc agccttggga ggtgggctct gggcagcagg 60241 cggccaccag ggaaggacag gacacacagt tctagacctg gtatggggag agatccccag 60301 gtggcgccag ctggccctga atagggctct atcccagggc tgcataaagg gcacattcag 60361 tgccccacag ctcttcaggc ccctcctgtg cctggctgcc ctcccaccct acccttttgt 60421 acctctgaga aggctctggc ccccacacag tcacactgtc actagggcca gtttctatcc 60481 cagggacctc ctatccagag cctgagccag ccccagcccc agccccagct ccagctgctc 60541 catctgaacc tgtatcttct tccaagccac ccattaccct cttggagtca gactcacgca 60601 tctccaaaga agaacttttg agagcccagg cgctgagaga gcagggtcag acactcccga 60661 gcctctcggt acagctgtag gggcgacaca ggtaggcttg cagctgtggg aacagtgcca 60721 cctccccacc taagcactcc cattcctggc cagcatcctt ggggctcatc tcatacaata 60781 gcccccggtc tcagagctac ctccttctcc agctcttcct cgtcctcagg cctgtgctcc 60841 ccagtcagca gctgtagccg ttccatgtac tgccgctgca tgcggccagg caggaagaag 60901 ttgaggggaa agggcatagc ctctgcatac cacttccggg tcacttccac gtagttcttg 60961 gtgtctatcc aaaaagtatg tacctggatt gggtgggcag gaagaaacag gtaggtctga 61021 gccagtgcac ctgtctgatt caaggtgggc ttctgacccc catgctttcc tgagcctgtg 61081 tgtgggtctg tgtgttcccg aaccctcccc ggctggccat ggatgctggg aggtctgggc 61141 acactcacca gcaccgggag caacttctcc tccaggagag acatgaaggc cagggtgtct 61201 gccccttgcc gagctgacag atcataatca gcattgtact tctgtggagg aaatatccat 61261 ggcgtggaca ctagggagct gcaagggcac ttcaccaggg aggaaggagt cctgtctggt 61321 acccccctca ctggcctctg agtgcagtgg aggtacagca aggaactttt cctgccaagg 61381 cccccttgcc tgggcccagc cagtagcctg ttgctgttgg cgaaaagcct gggccttgga 61441 gcctcctggc cgtgaaggtc cagcgcccaa tgcagggaag gaaggaaggc tccgccgcaa 61501 actaggagca gctcccagaa tttccatgga aagctggaac aacgcccgct gacggcaact 61561 ttctaacagt aacttccccg acccagacac cacaaagcta gcacaacgga gctcagatgc 61621 aggctaggac tcggtccatg cctcaggaac cagagaaagc catcctcaca ctccctggat 61681 ccagggaacc cacgcccagg gccccccagc ttgttccctc agtgcccagc tcttggctat 61741 ttctttcact tcattccatc gctcagacac cattaccaca tacacattcc acccataccc 61801 ccaggtctca gcctgcccta ccttcccagg ctccagtccc tgttcctcag catccccctc 61861 cacatcctga gtaagctttg tccccagata acctcttcag catgatcctt aaatctccct 61921 aagcctcagt ttctcccctg tggaatgggg gtaagaatct ctttctctga atgcccctgt 61981 gttaggaaat aatttagaat actttggaaa ctggaaaagc tctgttcaca cctaagcaat 62041 cagggcagtg gcctcggctc tgccaggaac tttggctttt atctggatcc tctctttcca 62101 ggcctctcaa ttaattcccc aggtcctcaa cctttgggaa gttagaaatg aggaagagtg 62161 tcctacttct gacactgttc cctcttggaa cctgaccgtc aatgctagaa gaacccttgg 62221 aaaacatgct ggcccagccc tctagtttta caaataaggg agtgcacagc cctgagaggt 62281 tacatggcct gcccgagatc acatagtcaa tggcagagta aagagcatag cctaggcctc 62341 cccactcctc tagtaatgct ctttcatctt ctccaacctg gctctaagcc ttgtccatcc 62401 tgagccccat atctagccca acctagtccc tgaaaacagg aagtggccct tagaaatctc 62461 tctccagtcc caccatcaga ggccaactgc tgtcttccac tctccttcag cctgtgctcc 62521 tctccctccc tgcctcacag tgccctaagt ttcatctctt gccgactgct ttaatacatc 62581 acagtgacat tgtgtgtgtc tctgccacaa gactgttgct ccttgatgct ctgggtcacc 62641 tgcatctagc atggcatata tctggtgctc aataaatgtg tattgtatag aattgactga 62701 acttctctca ctggcagccc cctctatcca agtcacctac ctcttttcga aggtgggtga 62761 tgatcttgtg tggaactgag atgacctctc catgactggt ccgaagggca ggcagagttc 62821 ctgatattga gagggaagta taccaaccag acccttagct gcctagtcat acatagttgc 62881 aacacattcc tgcctatttc ttgcttctcc ccttgtacac acccttcctt acccccaagg 62941 gatactgggt acctgaaggg ctctgccagg ggttgctgat cttgtgtacc ttcagtggag 63001 caccagtaaa tctggcatag gtctgcaggg aaggaagcag aagtcagaga ggcagagcca 63061 tgtcccccac agtggtatca agaagagaaa taacatttat tgaatacctt atgtgccatt 63121 ccctatactt agtatcttag tctcgaaaaa aaagggggag tcactaccat ttctatttta 63181 caaatggaca acacagggct cagagagatt ccggacatgt ctgtgattac aggacagcca 63241 ggaaaatcct ttcctctatc tcctctcttg ccatctacct ggtgaacatc tatcctcaga 63301 agcttttcct aaccagtatc cctcttcttg gaagcactga taatgccctt ctctttggca 63361 gtatctgtgg tctgttccta ctttaactta tatgtgaata ttttattata tttatttgct 63421 tacctatctg cctttctgta aactcgaagg gaggaaccaa attttattca tctttgcctg 63481 gcaatgttta atatttggaa gaaatctaat agatcctctt taaagcaagg agtcaatgaa 63541 taaatgaagc aaaggaagtc tgattgctaa gctctttata caaagcctgg ctgtgagtcg 63601 tttggtctaa caaacaatag cctcagtaaa tgctgttaag tgaggaggaa gaggagaagg 63661 gggctgagga gaaggcagga attccaagct agagtaggtc cctgagacag tccagcctag 63721 taaacctccg aatggagaaa aaaaatgccc aaagagaaag gatctccgca aggtcacaca 63781 gccagtgaat ggatgagcaa ggactagaac ccatcacctg aactcccaac caggcgtctt 63841 ttcattgctg cattaagcct agaaaactac agcatagggc actgaggcca tatcggccac 63901 gtacgctctt tgactgcctc agtttccccc actgcgcccg tgctgagtag ccctggcaag 63961 gtttggatgc ttggctagtt cggccccctc cccagtcatc agggcacagc agagggcgcc 64021 ggcgccaccc ctcaccagca cggccaggct gtccaggtcc actgacggca gcccccagcc 64081 ccctgaccag cagaacagct ccatgggcgc cgccatcttg cccaccctct gtcccggaaa 64141 cacttccttg tgtgcttctg ccctcccctg cctgggcccg ccccccgcca ccgcccctcc 64201 gatcgttgcg gtcagggggc ctggggagat ccccggggag gcgaggcttc ttctggcccg 64261 actggcagct gaactgcggg ggactgggcc gcgggcctcg ggggagggcg gccgccggcc 64321 catccagagg tggcccacgt agcgggacag cgctgtcggc ccggcgcgcc tcggagtgtc 64381 acggcgcctc gtccaagtgg agccccgaac ccctgaaggc gcggcaggct ctggagagcg 64441 gggtcttgtg cgcctgggcc aggtctgggg gctcttgcca aactgcacgt ggcctgtact 64501 gctccagggc cccttggggc tcgtccccga gcggggactg cgggggggtc ccccgagcag 64561 catgttttcc acagcgcgtt atgtttggag cgggccctgc gccgcctgtc gccatggaaa 64621 caaaacaggg gcggtggcgg cggccggagc ggaggccggg ctggggcttg ggtgggggag 64681 gggaagagag gctcgcaggc tgtcgcttag gtgacgggaa ctcaggcgcc cctctgcttc 64741 atccgggtca cggcccgtcc gctagtaccc acagtgttcc acagtctggt ccttggctcc 64801 tcgcctgtac ccctggtctt ctgcgcctgt ccctggtgtc cctttcctct tttttggttt 64861 cttcactctg acctcactga ccactgcttt agattctccc ttcagttccc gtcagacgct 64921 tccagactcc caagctttcc tacgaatgag ggaaaatgga gaaacaggca cttgtcaggg 64981 gacccccacc ctaataaaga gcacttgctc cgccagaaca gcaaaattca tgccatgtgg 65041 gcatccctgg gcactatagc aagctagttg cggccactcc cttggcatcc tttcctgcca 65101 gctgtggaat aatgcccact gtctagcact gcccctgcca ggggttcttg ccttccacaa 65161 tcgtggcttc cagaaaacag tggcattcgg tagcgctgtg tgccgagacc cccaacaatg 65221 atgactgcgg agagggaggc acctgggggg aggatcatta gggagaggta gaaagcaggg 65281 aggcctccag gattctttcc cagtgcccct ggttcccaga gctgatgatg cctccagggt 65341 gattggcagc tctttgtttc agccccccct cacccgcctg ctggcccccc ctccaattct 65401 gtccctccgc cccccggctc ctgctctctc cgcctagcct tttcccctcc cagctgcctg 65461 cctgccaggg gtagtgagcc ggctgagagg catggagacg caggaacttc ggggggccct 65521 ggctcttctc ctcctttgct ttttcacatc tgccagtcag gatctgcagg gtaagcctgt 65581 ctccatcctc ttagaccgct ctctgcttct tccccatttg ccctcagccc aagtagcaga 65641 gaacatgtgg gcaaggggag aggggaagag tccagaaatt gagccagagg aaaacttaag 65701 actgcctaga gttggtgaaa ttatggcctg gtggagagga gtgggcaccg gagagtagtg 65761 ggggagctgg aacaggacag ggtcaggcat gaggccaggg cagaggactc aggaactgga 65821 tgctcaggct gcctggctgg ggtggttcct gagcatctgt aggcacccta gggtctggga 65881 gagcagcctg aaggtgggcg tactgtaggt ccctgggctt cgtccccctg ggctcctggc 65941 tgcccaagga cagggcgggg gtggggacca agaagcctgg gctctcccgg aggtctggtg 66001 gtgggatggg ccgatggatg tgggagaggt agctggagca gatgacacag gcatagattc 66061 tttactggct ggagggaaaa ccacagattg gccaggacac aggaggcaga ggagctgggt 66121 cctacattcc cgtgggagat agcccagatg gtgggagggt tagagtcctc agtgggcagc 66181 tgttcttatc cagagcgagt gtgctgcgcc cctaggatgg gaggagggga actagggtgt 66241 ggaacgagcc aaacccagga gtggaaagag aagctgcctt ccatttctac gttgtggaca 66301 ccaggtgcca ctcctgtggg ggatcagcac agcatctcct ttgcgccacc tggtgggggc 66361 atctcaaatt tgtggggtgc tgtttctgtg tttccaagtg agtctcaaac ctgcaggttc 66421 ctggaaggta gtaataggca ataatgggga aaggaaggca cagtgcctct gtcctctgag 66481 agcaacaagg gttgaggttt ttttttgttt tcgttttgtc tttttgagat ggagtttcac 66541 tcttgttgcc caggctggag tgcaatggcg cgatctcggc tcactgcaac ctccacctct 66601 caggttcaag cgagtctcct gcctcagcct cctgagtagc tgggattaca ggtgcccgcc 66661 accacgcccg gctaggtggt ttttttttct tttttgttgt tgtttttgta tttttagtag 66721 acacggggtt tcaccgtgtt ggccaggctg gtctcgaact cctgacctta ggcgatccac 66781 ctgcctcaac ctcccaaggt gttgggatta caggtgcgag ccaccatgcc cagccaaggg 66841 ttgaattttg aggtgactaa cgctgcctgg tggtgggggt agggggtgat gtgggctcgg 66901 ggagggcata ccttagcatt tgatcccatt ctctaacctt gataacccct gcctgaccag 66961 taattgacct gctgactgtg ggcgagtctc ggcagatggt agctgtggca gagaagatcc 67021 ggacagcctt gctcactgct ggggacatct acctcttatc caccttccgc ctgcccccca 67081 agcagggtgg tgtcctcttt ggcctctatt ctcgccaaga caacactcga tggctggagg 67141 cctctgttgt aggcaagatc aacaaaggtg actggtgggc atctcctttc ttgcaatggt 67201 gacctcctta agcactctcc ctcatcccta cttcccaagt gcttcctcat tctccttggc 67261 ctccattagg gaccaaggtc ccatcaggct gtgcccagac atcaggggca cctggtatgg 67321 ggtgcttata aggtgcttta ggagcccatg taccaaacaa tagggtgagg cttgagtcct 67381 gatatttata gggatgggag gttgtacttg gggatgcctg gaaggagggt gaggcatgtc 67441 tctgtgcctc agcggtgata cagaaaggtc tttcagatcc ctggaacagt cagtggccac 67501 aaaaccagct gtcaccctag gagaggatac agtcatctga taccaatgaa gactaagaaa 67561 gtatttgtct ctaaaccaat tattttttta aatggagtct tgctttgtca cctaggctgg 67621 agtgcagtgg tacaatctca gctcactgca acctctgcct cctgggttca agcaattctc 67681 ctatctcagc ctcctgagta gctgggacta caggtgccca ccaccatgcc tggctaactt 67741 ttgtattttt agtagaaaca gggttttgcc atgttgacca ggctgttctc gaacttctga 67801 cctcaagtga tctgcccacc ttggccttct aaagtgctgg gattacaggt gtgagccacc 67861 gtgcctggcc tctaaagcaa tttcaaaaac atttattgtg tacctattaa atacaaggtg 67921 ttgagagcca catagtagaa ttagggatag gggccctaga gatgtggtat ctagcagagt 67981 tgatccaagg ctcagctggg aggcaggggg tgggggatag cgcccatact gatctcccct 68041 cctgggcagt actggtgcga taccagcggg aggatggcaa agtccacgcc gtgaacctac 68101 agcaagcggg cctggctgat gggcgcacac acacagttct cctgcgactc cgaggtccct 68161 ccagacccag ccctgcccta catctctacg tggactgcaa actgggtgac caacatgcag 68221 gccttccagc actggccccc attcctccag cggaggtcga tgggctggag attaggactg 68281 gacagaaggc gtatttgagg atgcaggtga gccgggagga gcctctgagt tctgtggaaa 68341 tagagtttgc accagctggg gaaggggttg gtagaccttc ccaccttact ctgctgctat 68401 ccccccaggg ctttgtggaa tctatgaaaa ttattctggg tgggtccatg gcccgggtag 68461 gagccctgag tgagtgtcca ttccaagggg acgagtccat ccacagtgca ggtaacacag 68521 agacttgttt gctgacattg gaccacggat cctgtggcct ttggtgactc ctgtcttctt 68581 gatctccctc tcccttaaaa cccattcctt tggctcacct gttcctaggg tttctaaccc 68641 tgttattcca aatctttcac ctgactccac aatctccaat acacccatcc gagaaaaaaa 68701 gtgatctaag agaagaattg gttaattgct tggccttggc tgaccaagag atactggtct 68761 cgagtatttt tttttttttt ggtgatggag tcttgtcctg ttgcccaggc cggagtgcag 68821 tggtgcaatc tgggtcactg cagtctccac tgagttcaag tgattctcct gcctcagcct 68881 cccaagcagc tgggattaca ggcggccctc caccatgcct agctaatttt gcatttttag 68941 tagagatggg atttcaccac gttggccagg ctggtctcaa acacctaatc tcaagtgatg 69001 cacccacctc ggcctctcaa agtgctggga ttacaggcgt aagccaccgc gcccggcctg 69061 gtctcgagtc ttttatagtt ctcactggca gctgtcacca gcaatttctc tgagtgttgc 69121 ccatctgcct ggtctctctg acagggatgt tcagaagctc tcacctaaat aaaagaccca 69181 cctttcccag atatttgagg gagagctctt gaagaaggga atgggatggc gggtgtggtg 69241 gctcacacct gtaatcccag cactttggga ggctgaggtg ggctgatccc tcaaggtccg 69301 gagttcaaga ccagcctggc caacactgtg aaacctcctc tctactaaaa aatacaaaaa 69361 attagctggg catggtggtg ggcacctgta gtcccagcta cttgggaggc ggaggcagga 69421 gagttgcttg aacccaggag gtggaggttg tggtgagcag agatcacgcc actgcattcc 69481 agcctgggaa acagagcaag actccgactc aaaaaaaaaa aaaatgggga tatactgggg 69541 ccctggccct gctttgggtc catcccttct gccactacca tgcctaggaa ccaggaggat 69601 ttgggttcta acttcctgtg aagcaactcc cttagagggc cttttgcccc atagaaggag 69661 ctggcactgc ttgtctgcca gctctgccct cccagcatcc agcaccccat ctttattctg 69721 gggctccagc cctgtccctg tcctcacctt ccttcctcct tctcaccaac caggcctctc 69781 ttcacttcac ctcacccctc tgactatgtt ttcttctcct ctccagtgac caatgcactg 69841 cactccattc taggtgagta ggccacactg aagcggaagc ggggagcggg gaggaggccc 69901 caggctctgg cagctgcctg aaactaagtc ctcttcagtc aggaatgtag taggtttaaa 69961 ggcaggggtg ggcagccgtg gcaggtactg gcttattgcc catggagggc ccaggactgg 70021 tgctccagta ctgaacccct cacaccctgg gtcccgacag gggagcagac caaggcgctg 70081 gtcacccaac tcaccctctt caaccagatc ctggtggagc tgcgggatga tatacgagac 70141 caggtttggg tgggctggcg aagggtggca ctgattctgg ggtagggtgg cagatgtcaa 70201 gtgctgactc ctccccatcc ttctccaggt gaaggaaatg tccctgatcc gaaacaccat 70261 tatggagtgt caggtgtgcg gtgagtggga gagcagggga ggctccacat gaccgtgcca 70321 cgttcccacc attggctttg gctttccctt ctggtggctt aaatagtgac caccgggtag 70381 ctctgactgt gtccacccct caggcttcca tgagcagcgt tcccactgca gccccaatcc 70441 ctgcttccga ggtgtggact gcatggaagt gtacgagtac ccaggctacc gctgtgggcc 70501 ctgcccccct ggcctgcagg gcaacggcac ccactgcagt gacatcaatg aggtgaggga 70561 ggtcagagcc cagaagggta cagaaaactg gggtgaggat gtcaggaggc acccaagagg 70621 gtgggataaa tgctggtccg gaggagagga atctggagtt taggagaggt cagaggcaag 70681 agaaatgcaa gatgggagag acagaaggcc tggggcaaag actgaaggcc atacagggaa 70741 ggagctgggg aaactgcagg ggaggcttga gatggcgacc agtggcatgg ggagggagag 70801 agggaacccc gagggaagtg gggtggggac caggagcaca aggcagttgt gtggggagag 70861 ctgcacaaag gggagacctg gagcaatggt tcctggatca caggcaggga cctgagtttc 70921 ccagagggcg gcctgacctc tgccttctca tctggtcccc agtgtgctca cgctgacccc 70981 tgtttcccgg gctccagctg catcaacacc atgcccggct tccactgtga ggcctgtcct 71041 cgagggtaca agggcacaca ggtgtctggt gtgggcattg actatgcccg ggccagcaaa 71101 caggtcagac tgggtagtgt gtgtggacaa gggatcttgg ccttgtagag gccaggggct 71161 tctgggttgg acatgggacc cttgtgatga atgggacatg gatggctttg catcggcttt 71221 gggtctgggc ttaatgttga tgttcacaga taaggccata ccagtgtcct ctgctgtatc 71281 tgaggagacc caccataggt ctggttccac ccaaagtctg ccccttcagg tctgcaatga 71341 catcgatgaa tgcaacgatg gcaacaatgg tggctgtgac ccaaactcca tctgcaccaa 71401 cactgtggtg agctgaatat cctgagtgta ttctggggtg gtgggaatgg taaaacccca 71461 acccccgccc cttcttacct tcaaatttcc tgctgccttt cctcctcccc agggcactac 71521 acagattagg tgttcacatg gacttggttt ggaatactgg tttgcctctt gctagctctt 71581 tagcaggtta tttaactttt ctgagcctca gcttcctcat ctgaaaactg aggctgttat 71641 ccaccctgaa aggttgtcat gataacctaa aggagatact gatatgcctg gcataactga 71701 gtgcccagcc cacactccgc aggtggagca gtgctggtcc tgggtgcctg gtgggctcac 71761 cctccttcct aagatgctgc ctctccactc ttagggctct ttcaagtgtg gtccctgccg 71821 cctgggtttc ctgggcaacc agagccaggg ctgcctccca gcccggacct gccacagccc 71881 agcccacagc ccctgccaca tccatgctca ctgtctcttt gaacgcaatg gtgcagtgtc 71941 ctgccaggtg agctaggctt caggcgtgga aggaaaaggg agggtctggg gaaaggagta 72001 gggctatgtt tagggcctgg gttgggggtc ttcataggag agaaggtggg cctgggccca 72061 ggaactgttt ggtggggaga ataggacctg aagcagggaa aatataggga ggaggggagc 72121 cagaccaaac tgctagctcc taccctttgt gttgccctag tgtaacgtgg gctgggctgg 72181 gaatgggaac gtgtgtggga ctgacacaga catcgatggc tacccagacc aagcactgcc 72241 ctgcatggac aacaacaaac actgcaaaca ggtgcaggga gcaggcgggc aagggggcgt 72301 agtggggagc ccaagctggg tcaggccaga actcatccat cctcttcccc tgaactttag 72361 gacaactgcc ttttgacacc caactctggg caggaagatg ctgataatga tggtgtgggg 72421 gaccagtgtg atgatgatgc tgatggggat gggatcaaga atgttgaggt gactcccaga 72481 ctgccctgcc ccttgaagct cctctcccct cctcctgtcc tctctgtgcc tcacttacct 72541 catctggcag ctctctagta agggccaaat actccaaatc agggaggcaa aaacctcgtg 72601 cccaggacaa ggaggctggg tgggtgggac tgtactgagc agtttgtcca tcacaagggt 72661 gtgatcttag aaaaggatac agagacaagg taggtgcaag atgacaagtg atttagggga 72721 gacctgaccc ttcccctccc accctctgcc caggacaact gccggctgtt ccccaacaaa 72781 gaccagcaga actcagatac agattcattt ggtgatgcct gtgacaattg ccccaacgtt 72841 cccaacaatg accagaagga cacagatggc aatggggaag gagatgcctg tgacaacgac 72901 gtggatgggg atggtgcagg cctggggctg aaggggtggc tgggggacct gtgagaattt 72961 ggatcaggtg gggatgaagc agggaagcta ggaagtctct gtgaaatagg gaggcaggct 73021 tgtggacgtt ggcctgggtg aggagagatt acctgcagca gatgtcaata ggaatgtgag 73081 gtagggcgta gtgttaggca gagtgtggac tagagggtga gacaagaaac aggcagattt 73141 cctggccagt tgtcctctgg gtggggagac aaagttcggg actttcacca acctagaaga 73201 gagaatatgg catgttctag taacaacctt gtgctaccca tgtcttctag gcatccccaa 73261 tggattggac aattgcccta aagtccccaa cccactacag acagacaggg atgaggacgg 73321 ggtgggagat gcttgcgaca gctgccctga aatgagcaat cctacccagg tacagggaga 73381 tggtaaggac aggggaggga tgagggtact gatggatgaa gccccagccc tttggatgga 73441 aagtggtcag atcaccctct tcagagttat caagaggaga tggtgagaac aggtccctct 73501 ctctcagaca gatgcagaca gcgacctggt gggggatgtc tgtgatacta atgaagacag 73561 gtaaggtctt ggtcaagaga cgcaaggtct ttcttttttt tgtctttctg agacggcttg 73621 ctctgtcacc taggctggag tacagtggca cgatcttggc tcactgcaac ctccgtctcc 73681 cagggtcaag tgattctcat gcctcaacct ccctgagtgg ctaggattac aggcatgtgc 73741 taccaagccc agctaattct tgtattttca gtagagacag ggtttcacca tgttggccag 73801 gctggtctcg aactccttac ctcaggtgat ccgccagcct ctgcctccca aagtgctgtg 73861 attacaggtg tgagccactg tgccagcgaa atgcaaggtc ttatagggga tattttactt 73921 tcctctagta tatccttttt tttttttttt tgagacggag tcttgctctg tcgcccaggc 73981 tggagtacag tggcatgatc tcggctcact gcaagctcca cctcccgagt tcacgccatt 74041 cttctgcctc agcctcccaa gtagctggga ctacaggcgc ctgccaccac gcccggctaa 74101 ttttttgtgt ttttagtaga gacggggttt cactgtgtta accaggatgg tctcgatctc 74161 ctgatctcgt gatttgcctg cctcagcctc ccaaagtgct gggattacag gcgtgagcca 74221 ccgtgcccgg cctccttttt ttttttgaga tggagtcttg ctctgtcact caagctggag 74281 tgcagtgata tcggctcact gcgacctcca cctccagggt tcaagcgatt ttcctgcctc 74341 agcctcccag tagctggatt acaggcgcgt gctaccaagc tcagctaatt tttgtgtttt 74401 tagtagagac agggtttcat cgtgttggcc aggctggtct cgaactcctg acctcaggtg 74461 atctgcccgc cttggcctcc caaagtactg ggaatacagg catgagccac tgtgcccagg 74521 ccatagtata ttctaaattc cttctatgat ttagccttaa ttctctattg ctattcagca 74581 ggaatttatt cttacagtca tccctccatt cctactgcca gaccttcacc tcacctggct 74641 gactgccagg ggtctgattg tatggagagc aggctccaga gactcccagg cagaagcgaa 74701 gggaaggaca aggagtacct ttaaatcctt tatttggtac ctgttcttct gattagcgat 74761 ggggatgggc atcaggacac caaggacaac tgcccacagc tgccaaatag ctcccagctg 74821 gactctgata acgatggact tggagatgag tgtgatgggg atgatgacaa tgatggcatc 74881 ccagattatg tgcctcctgg tcccgataac tgccgcctgg tacccaatcc caatcagaag 74941 gactcagatg gtaagcctgc ggacccagag cacgttagac tggtgttgcc tttgcccagg 75001 tggaggcagc aagccctgtt gggaagtgag gaagggcaag gtgggaaaga tgtcaggaat 75061 gcaggcccaa cagatggtat tgttgcctaa tggcagtggc cagggccttc ctgagcaccc 75121 agcctcactc tgcccaggca atggcgttgg tgatgtgtgt gaggatgact ttgacaatga 75181 tgctgtggtc gaccccctgg atgtgtgtcc tgaaagtgca gaggtaacgc ttacggattt 75241 tcgggcctat cagaccgtcg tcctggatcc // LOCUS AF023455 2486 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens protein phosphatase with EF-hands-1 (PPEF-1) mRNA, complete cds. ACCESSION AF023455 NID g2586410 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2486) AUTHORS Sherman,P.M., Sun,H., Macke,J.P., Williams,J., Smallwood,P.M. and Nathans,J. TITLE Identification and characterization of a conserved family of protein serine/threonine phosphatases homologous to drosophila retinal degeneration C JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (21), 11639-11644 (1997) MEDLINE 97471020 REFERENCE 2 (bases 1 to 2486) AUTHORS Sherman,P.M., Sun,H., Macke,J.P., Williams,J., Smallwood,P.M. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) Mol. Bio. & Gen., Johns Hopkins School of Medicine/HHMI, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..2486 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" gene 1..2486 /gene="PPEF-1" CDS 83..2044 /gene="PPEF-1" /codon_start=1 /product="protein phosphatase with EF-hands-1" /db_xref="PID:g2586411" /translation="MGCSSSSTKTRRSDTSLRAALIIQNWYRGYKARLKARQHYALTI FQSIEYADEQGQMQLSTFFSFMLENYTHIHKEELELRNQSLESEQDMRDRWDYVDSID VPDSYNGPRLQFPLTCTDIDLLLEAFKEQQILHAHYVLEVLFETKKVLKQMPNFTHIQ TSPSKEVTICGDLHGKLDDLFLIFYKNGLPSERNPYVFNGDFVDRGKNSIEILMILCV SFLVYPNDLHLNRGNHEDFMMNLRYGFTKEILHKYKLHGKRILQILEEFYAWLPIGTI VDNEILVIHGGISETTDLNLLHRVERNKMKSVLIPPTETNRDHDTDSKHNKVGVTFNA HGRIKTNGSPTEHLTEHEWEQIIDILWSDPRGKNGCFPNTCRGGGCYFGPDVTSKILN KYQLKMLIRSHECKPEGYEICHDGKVVTIFSASNYYEEGSNRGAYIKLCSGTTPRFFQ YQVTKATCFQPLRQRVDTMENSAIKILRERVISRKSDLTRAFQLQDHRKSGKLSVSQW AFCMENILGLNLPWRSLSSNLVNIDQNGNVEYMSSFQNIRIEKPVQEAHSTLVETLYR YRSDLEIIFNAIDTDHSGLISVEEFRAMWKLFSSHYNVHIDDSQVNKLANIMDLNKDG SIDFNEFLKAFYVVHRYEDLMKPDVTNLG" BASE COUNT 807 a 505 c 521 g 653 t ORIGIN 1 cagcttaaag ggaggcactt ttcacactct gtcttaaaat cagaagaaga attcatgaac 61 acatatgatt tagatagaag tcatgggatg cagcagttct tcaacgaaaa ccaggagatc 121 tgacacatca ctgagagctg cgttgatcat ccagaactgg taccgaggtt acaaagctcg 181 actgaaggcc agacaacact atgccctcac catcttccag tccatcgaat atgctgatga 241 acaaggccaa atgcagttat ccaccttctt ttccttcatg ttggaaaact acacacatat 301 acataaggaa gagctagaat taagaaatca gtctcttgaa agcgaacagg acatgaggga 361 tagatgggat tatgtggact cgatagatgt cccagactcc tataatggtc ctcggctaca 421 atttcctctc acttgtacgg atattgattt acttcttgag gccttcaagg aacaacagat 481 acttcatgcc cattatgtct tagaggtgct atttgaaacc aagaaagtcc tgaagcaaat 541 gccgaatttc actcacatac aaacttctcc ctccaaagag gtaacaatct gtggtgattt 601 gcatgggaaa ctggatgatc tttttttgat cttctacaag aatggtctcc cctcagagag 661 gaacccgtat gtttttaatg gtgactttgt agatcgagga aagaattcca tagagatcct 721 aatgatcctg tgtgtgagtt ttcttgtcta ccccaatgac ctgcacttga acagagggaa 781 ccacgaagat tttatgatga atctgaggta tggcttcacg aaagaaattt tgcataaata 841 taagctacat ggaaaaagaa tcttacaaat cttggaagaa ttctatgcct ggctcccaat 901 cggtacaatc gttgacaatg aaatcctggt catccatggt gggatatcag agaccacaga 961 cttgaattta ctccaccgtg tagagaggaa caagatgaaa tctgtgctga taccaccaac 1021 ggaaacaaac agagaccatg acactgactc gaagcacaat aaagtaggtg tgacttttaa 1081 tgcacatgga agaatcaaaa caaatggatc tcctactgaa cacttaacag agcatgaatg 1141 ggaacagatt attgatattc tgtggagtga tcccagaggc aaaaatggct gttttccaaa 1201 tacgtgccga ggagggggct gctattttgg accagatgtt acttccaaga ttcttaataa 1261 ataccagttg aagatgctca tcaggtctca tgaatgtaag cccgaagggt atgaaatctg 1321 tcatgatggg aaggtggtga ctatattttc tgcttctaat tattatgaag aaggcagcaa 1381 tcgaggagct tacatcaaac tatgttctgg tacaactcct cgatttttcc agtaccaagt 1441 aactaaagca acgtgctttc agcctcttcg ccaaagagtg gatactatgg aaaacagcgc 1501 catcaagata ttaagagaga gagtgatttc acgaaaaagt gaccttactc gtgctttcca 1561 acttcaagac cacagaaaat caggaaaact ttctgtgagc cagtgggctt tttgcatgga 1621 gaacattttg gggctgaact taccatggag atccctcagt tcgaatctgg taaacataga 1681 ccaaaatgga aacgttgaat acatgtccag cttccagaat atccgcattg aaaaacctgt 1741 acaagaggct cattctactc tagttgaaac tctgtacaga tacagatctg acctggaaat 1801 catatttaat gccattgaca ctgatcactc aggcctgatc tccgtggaag aatttcgtgc 1861 catgtggaaa ctttttagtt ctcactacaa tgttcacatt gatgattccc aagtcaataa 1921 gcttgccaac ataatggact tgaacaaaga tggaagcatt gactttaatg agtttttaaa 1981 ggctttctat gtagtgcata gatatgaaga cttgatgaaa cctgatgtca ccaaccttgg 2041 ctaaacacaa atgagagctt ccctcaggct ccctgaaaca gctaggccca aatcacaagt 2101 acagtccttt ccaacacccc tgaaattcat agtcagtagc agagaaaagc agatcccaat 2161 tcatcccaca aacagatgca tagtatgggt tttggaagtc cctagcaagc tgttattggt 2221 aagattaggt taaatgtcag taataggatt tggtttcagc attagtacct acatattgcc 2281 agtgagaaac tgggttggac ctagtggtgt tgtcgtgagt gccacctaac caggaggcca 2341 gagcggtttg aaaacatcct gaaaggaact catacagcac aagagaaaac tactaagctt 2401 gacatctgtg agtgactgag ggagacagga ggaataccag gttattcatg gaataaagtc 2461 tttccatctt taaaaaaaaa aaaaaa // LOCUS AF023456 3440 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens protein phosphatase with EF-hands-2 long form (PPEF-2) mRNA, complete cds. ACCESSION AF023456 NID g2586412 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3440) AUTHORS Sherman,P.M., Sun,H., Macke,J.P., Williams,J., Smallwood,P.M. and Nathans,J. TITLE Identification and characterization of a conserved family of protein serine/threonine phosphatases homologous to drosophila retinal degeneration C JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (21), 11639-11644 (1997) MEDLINE 97471020 REFERENCE 2 (bases 1 to 3440) AUTHORS Sherman,P.M., Sun,H., Macke,J.P., Williams,J., Smallwood,P.M. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) Mol. Bio. & Gen., Johns Hopkins School of Medicine/HHMI, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..3440 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /tissue_type="retina" gene 1..3440 /gene="PPEF-2" CDS 368..2629 /gene="PPEF-2" /note="PPEF-2(L)" /codon_start=1 /product="protein phosphatase with EF-hands-2 long form" /db_xref="PID:g2586413" /translation="MGSGTSTQHHFAFQNAERAFKAAALIQRWYRRYVARLEMRRRCT WSIFQSIEYAGQQDQVKLHDFFSYLMDHFIPSSHNDRDFLTRIFTEDRFAQDSEMKKC SDYESIEVPDSYTGPSLSFPLLPDHATALVEAFRLKQQLHARYVLNLLYETKKHLVQL PNINRVSTCYSEEITVCGDLHGQLDDLIFIFYKNGLPSPERSYVFNGDFVDRGKDSVE ILMILFAFMLVYPKEFHLNRGNHEDHMVNLRYGFTKEVMNKYKVHGKEILRTLQDVFC WLPLATLIDEKVLILHGGVSDITDLELLDKIERSKIVSTMRCKTRQKSEKQMEEKRRA NQKSSAQGPIPWFLPESRSLPSSPLRLGSYKAQKTSRSSSIPCSGSLDGRELSRQVRS SVELELERCRQQAGLLVTGEKEEPSRSASEADSEAGELRKPTQEEWRQVVDILWSDPM AQEGCKANTIRGGGCYFGPDVTQQLLQKYNLQFLIRSHECKPEGYEFCHNRKVLTIFS ASNYYEVGSNRGAYVKLGPALTPHIVQYQANKVTHTLTMRQRISRVEESALRALREKL FAHSSDLLSEFKKHDADKVGLITLSDWAAAVESVLHLGLPWRMLRPQLVNSSADNMLE YKSWLKNLAKEQLSRENIQSSLLETLYRNRSNLETIFRIIDSDHSGFISLDEFRQTWK LFSSHMNIDITDDCICDLARSIDFNKDGHIDINEFLEAFRLVEKSCPEGDASECPQAT NAKDSGCSSPGAH" variation 725..727 /gene="PPEF-2" /note="Arg/Ser polymorphism" BASE COUNT 932 a 834 c 814 g 860 t ORIGIN 1 gaattccgga tttgaagggg gcctgatctg actgacgtca gggggtgcct actccttcga 61 ggcaagaagc ctgaactttc cttcctgtca ctgtgaatac ttggccaggg ctgtgaatgt 121 caccattgct tttcaggaag aagacatggc agactagggc tgcaccttct ttatggctct 181 gtaggtctga agaatcagat cagcctggga tcctcagcca actctgaaga caggatggga 241 tcacgcctgc atgttaaaga gatttctgcc ttgttgtgtc tcatgtaaat aaaaaaccca 301 gcaaagaagc aaacagctca ctgtcctctg gatctgctgc gtcctgcagg agcattgcgc 361 ttaaactatg ggaagcggca cctccaccca acatcatttt gctttccaga atgcagagag 421 agccttcaag gcagcagccc tgatccagag atggtaccgg cgctacgtgg cccgcctgga 481 gatgaggcgg cgttgcacct ggagcatctt ccagtctata gaatatgctg ggcagcaaga 541 ccaagtcaag ctccatgact tcttcagcta tctcatggat cacttcatcc ccagcagcca 601 caacgacagg gacttcctga cccgcatatt cactgaggac agattcgccc aggactccga 661 gatgaagaaa tgcagtgact atgaatccat agaggtaccc gacagttaca cggggccaag 721 cctctccttc ccactcctgc ctgaccatgc aactgccctg gtagaagcat tcagactgaa 781 acaacagctc catgctcgct acgtcttgaa ccttttgtat gaaaccaaga aacatctggt 841 acagctgcca aacatcaacc gggtctcaac ctgttacagc gaggagatca cagtgtgtgg 901 agacttacat ggccaattgg atgacttaat ctttatattt tataagaatg gcctcccatc 961 gccagaacgg tcatatgtgt tcaacggtga ctttgtggat cgaggcaagg attcagtaga 1021 gatcctgatg attctttttg ccttcatgct ggtttacccc aaagagttcc atcttaacag 1081 aggaaaccat gaggaccata tggtgaactt acgatatggc ttcaccaagg aagtgatgaa 1141 taaatacaag gtacacggga aggaaatact aagaaccctg caagatgttt tctgttggct 1201 tccactggcc actctgatag atgagaaagt tctaattctt catggtgggg tgtcagacat 1261 aactgatctg gagcttttgg acaaaataga gaggagcaag atagtttcca ccatgaggtg 1321 caaaacgaga cagaagagtg agaagcagat ggaggagaag agaagagcca accagaagag 1381 ctctgcacag ggacccatcc catggtttct ccccgaaagc cgctctcttc cctcttcgcc 1441 ccttcggctt ggctcctaca aggcccagaa aaccagcagg tcctccagca tcccctgcag 1501 cggttccctg gacgggcggg agctctcccg gcaggtgcgg agctccgtgg aactggagct 1561 agagcggtgc cggcagcaag caggcctcct ggtgaccgga gagaaagagg agccctcccg 1621 ctcagcctca gaagcagact ctgaagccgg agagctgcgg aagcccactc aggaggagtg 1681 gaggcaggtt gtagatatcc tgtggagtga tcccatggct caagagggct gcaaggccaa 1741 cactattcga ggaggaggct gttattttgg gcctgatgtg acacaacagt tgctacaaaa 1801 atacaacctg caattcctga tccgttcaca tgaatgcaaa cctgaaggct atgaattctg 1861 tcacaaccgc aaggtattaa caatcttttc tgcctccaac tactatgaag ttggcagcaa 1921 cagaggggcc tatgtcaaac tggggccagc cctgacccca catatcgtgc agtatcaagc 1981 taacaaggtg acccacacac tcaccatgag gcaaaggatt agcagagtgg aggagtcggc 2041 tctgagagct ctgagggaga agctgtttgc tcattcttca gatcttctca gtgaatttaa 2101 gaagcatgat gcagataaag tcggtttaat caccttgagt gactgggcag cagcggtgga 2161 gtctgtgttg cacctaggac tgccatggcg gatgctgagg ccacagctgg tgaacagctc 2221 agcagacaac atgctggagt acaagtcttg gctgaagaac ttggccaagg aacaactgag 2281 tcgcgagaac atacaatcaa gtttgctgga aacattgtat cgaaaccgat ccaacctaga 2341 gaccattttt aggatcatag acagtgatca ttcagggttc atctcactgg acgagttcag 2401 gcagacctgg aagctgttca gctctcacat gaatatcgac attacagatg actgcatctg 2461 tgaccttgct cggagcattg atttcaacaa agatggccac attgatatca atgagttcct 2521 ggaggccttc cgccttgtgg agaaatcctg cccagagggc gatgcctcag aatgcccaca 2581 agctacaaat gctaaagaca gtggctgcag cagtccaggt gcacactaag aacagcctgg 2641 tcttcatcac ccaaagtgcc tcataggcaa tgctcagctt ctcactagac tatctccctt 2701 attctccatg tgaaacttta tgctgaaaat ttacctatcc atatgcatca gaatcacctg 2761 tgtatttcag tgtggagggg tgggttgggg tgttgtgtat gtatgtgttt taagtatatg 2821 agtgccccaa ccccacctca caatcttcac aaagtagaac ttaggtatag tgttttcaaa 2881 ttctaaagtc cacttcagtt aagaaccact gacaatgtaa ctctctcatt gttttcattt 2941 tatacgtttt tttgagatgg agtttctctc ttgttgccca ggctggagtg cattggcgcg 3001 atctcggctc accgcaaagt ccgcctccca ggcgattctc ctgcctcagc ctcctgagta 3061 gctgggatta caggcatgca ccaccacacc aggataattt tgtattttta gtagacacag 3121 ggtttctcca tgttggtcag gctggtcttg aactcccaac ctcaggtgat ccaccctcct 3181 cagcctccca aagtgctggg attacaggca tgagccaccg cacccagcct attttatact 3241 ttttatttat tgtctttaac aatgtctatt ggtaaagcaa agttattttt aaaaattgta 3301 ttgtaattcc atgacccaag catatggatt ttcttcatta tttacttttt cttacttgtt 3361 actgtagtgt ttatataatt ttatgttcta cttttaaaaa aataaattaa tatctaattg 3421 taaaaaaaaa aaaaaaaaaa // LOCUS AF023462 1545 bp mRNA PRI 26-OCT-1997 DEFINITION Homo sapiens peroxisomal phytanoyl-CoA alpha-hydroxylase (PAHX) mRNA, complete cds. ACCESSION AF023462 NID g2564670 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1545) AUTHORS Mihalik,S.J., Morrell,J.C., Kim,D., Sachsteder,K.A., Watkins,P.A. and Gould,S.J. TITLE Identification of PAHX, a Refsum disease gene JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 1545) AUTHORS Gould,S.J. TITLE Direct Submission JOURNAL Submitted (08-SEP-1997) Biological Chemistry, The Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1545 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="between D10S226 and D10S223" gene 1..1545 /note="Refsum disease gene" /gene="PAHX" CDS 28..1044 /gene="PAHX" /note="PTS2-targeted peroxisomal matrix protein" /codon_start=1 /product="peroxisomal phytanoyl-CoA alpha-hydroxylase" /db_xref="PID:g2564671" /translation="MEQLRAAARLQIVLGHLGRPSAGAVVAHPTSGTISSASFHPQQF QYTLDNNVLTLEQRKFYEENGFLVIKNLVPDADIQRFRNEFEKICRKEVKPLGLTVMR DVTISKSEYAPSEKMITKVQDFQEDKELFRYCTLPEILKYVECFTGPNIMAMHTMLIN KPPDSGKKTSRHPLHQDLHYFPFRPSDLIVCAWTAMEHISRNNGCLVVLPGTHKGSLK PHDYPKWEGGVNKMFHGIQDYEENKARVHLVMEKGDTVFFHPLLIHGSGQNKTQGFRK AISCHFASADCHYIDVKGTSQENIEKEVVGIAHKFFGAENSVNLKDIWMFRARLVKGE RTNL" BASE COUNT 455 a 341 c 351 g 398 t ORIGIN 1 ggggtggggg ttccccgcgc cgcagccatg gagcagcttc gcgccgccgc ccgtctgcag 61 attgttctgg gccacctcgg ccgcccctcg gccggggctg tcgtagctca tcccacttca 121 gggactattt cctctgccag tttccatcct caacaattcc agtatactct ggataataat 181 gttctaaccc tggaacagag aaaattttat gaagaaaatg ggtttctagt aatcaaaaat 241 cttgtacctg atgccgatat tcaacgcttt cggaatgagt ttgaaaaaat ctgcagaaag 301 gaggtgaaac cattaggatt aacagtaatg agagatgtga ccatttcgaa atccgaatat 361 gctccaagtg agaagatgat cacgaaggtc caggatttcc aggaagataa ggagctcttc 421 agatactgca ctctccccga gattctgaaa tatgtggagt gcttcactgg acctaatatt 481 atggccatgc acacaatgtt gataaacaaa cctccagatt ctggcaagaa gacgtcccgt 541 caccccctgc accaggacct gcactatttc cccttcaggc ccagcgatct catcgtttgc 601 gcctggacgg cgatggagca catcagccgg aacaacggct gtctggttgt gctcccaggc 661 acgcacaagg gctccctgaa gccccacgat taccccaagt gggagggggg agttaacaaa 721 atgttccacg ggatccagga ctacgaggaa aacaaggccc gggtgcacct ggtgatggag 781 aagggcgaca ctgttttctt ccatcctttg ctcatccacg gatctggtca gaataaaacc 841 cagggattcc ggaaggcaat ttcctgccat ttcgccagtg ccgattgcca ctacattgac 901 gtgaagggca ccagtcaaga aaacatcgag aaggaagttg taggaatagc acataaattc 961 tttggagctg aaaatagcgt gaacttgaag gatatttgga tgtttcgagc tcgacttgtg 1021 aaaggagaaa gaaccaatct ttgaaatagc catctgctat aactctttca acagaaaacc 1081 aaaaccaaac gaaatgtcta aggaaaatgt tttcttaatg agatgatgta accttttcta 1141 tcacttgtta aaagcagaaa acatgtatca ggtacttaat tgcatagagt tagttttgca 1201 gcacaatggt gttgctttaa tggaaaaaaa aaacagtaaa agtgaaatat tactgtttta 1261 aggaaaacta atttagggtg gcagccaata aaggtggttg gtgtctaatt taagtgttaa 1321 atcaatttct ttcattcagt tagctcttta cccaagaaga agtgaatgat ttggagctta 1381 gggtatgttt tgtatcccct ttctgataaa cccattccct accaatttta tgtcataaga 1441 gatttttttc ccccaaatct agaacaatgt ataatacatt cacatctagt caagggcata 1501 ggaacggtgt catggagtcc aaataaagtg gatattcctg ctcgg // LOCUS AF023466 1084 bp mRNA PRI 16-OCT-1997 DEFINITION Homo sapiens putative glycine-N-acyltransferase mRNA, complete cds. ACCESSION AF023466 NID g2554940 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1084) AUTHORS van der Westhuizen,F.H., Chambliss,K.L., Hinson,D., Gibson,K.M., de Vries,W.N., Pretorius,P.J. and Erasmus,E. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) Biochemistry and Microbiology, Potchefstroom University for Christian Higher Education, Hoffman, Potchefstroom, North West 2520, South Africa FEATURES Location/Qualifiers source 1..1084 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I.M.A.G.E. Consortium (LLNL) clone ID 124365" CDS 88..576 /EC_number="2.3.1.13" /function="conjugates glycine with acyl-CoA substrates in liver and kidney mitochondria" /note="aralkyl-CoA N-acyltransferase" /codon_start=1 /product="putative glycine-N-acyltransferase" /db_xref="PID:g2554941" /translation="MLPLQGAQMLQMLEKSLRKSLPASLKVYGTVFHINHGNPFNLKA VVDKWPDFNTVVVCPQEQDMTDDLDHYTNTYQIYSKDPQNCQEFLGSPELINWKQHLQ IQSSQPSLNEAIQNLAAIKSFKVKQTQRILYMAAETAKELTPFLLKSKILSPSGGKPK AM" BASE COUNT 321 a 279 c 177 g 307 t ORIGIN 1 gcacgagctc ccagaagggt gttgctcatc gtttcttccc ggaaacatct gcagagacta 61 gcttttcagg ctaaggtatc ctccatgatg ttaccattgc aaggtgccca gatgctgcag 121 atgctggaga aatccttgag gaagagcctc ccagcatcct taaaggttta tggaactgtc 181 tttcacataa accatggaaa tccattcaat ctgaaggctg tggtggacaa gtggcctgat 241 tttaatacag tggttgtctg ccctcaggag caggatatga cagatgacct tgatcactat 301 accaatactt accaaatcta ctccaaagat ccccaaaact gtcaggaatt ccttggatca 361 ccagaactca tcaactggaa acagcattta cagattcaaa gttcacagcc tagcctgaat 421 gaggctatac aaaatcttgc agccattaag tccttcaaag tcaaacaaac acaacgcatt 481 ctctatatgg cagctgaaac agccaaggaa ctgactcctt tcctgctgaa atcaaagatt 541 ttatctccca gtggtggcaa acccaaggcc atgtgagttt gataaaatcc agtctgtacc 601 actcacactt ctcaagtaac ctcccaactt ctctccctgc attcactctg gctttcctac 661 aatcaatgtt gtacaaagaa atcagaatta catttttaaa aaataactca gaacatgttt 721 ctctcctgct taaaagactc caccatctcc cttctcatct agaataaacc aggcctctgg 781 ctaccacttg gacctttatt gcccttcctg ctttcttcct ccaacagctc tgaccttcac 841 tctgttcctc tttcatcaca agctcatcta tattccagga cttcacattt gttatttctt 901 ctgtctactc tgtctctcaa gacaatagag gtcacagcaa atacaaacct ttatactaat 961 ggattaagtt ttcttcttgg accctccata gcaaccaaga gatgttaaaa ctctaatcca 1021 tggatgttac ccatgctcac ttggtgaata aattctggca ttttggtggt aaaaaaaaaa 1081 aaaa // LOCUS AF023476 5067 bp mRNA PRI 13-DEC-1997 DEFINITION Homo sapiens meltrin precursor (ADAM12) mRNA, complete cds. ACCESSION AF023476 NID g2677838 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5067) AUTHORS Gilpin,B.J., Loechel,F., Mattei,M.G., Engvall,E., Albrechtsen,R. and Wewer,U.M. TITLE A novel, secreted form of human ADAM-12 (meltrin-alpha) provokes myogenesis in vivo JOURNAL J. Biol. Chem. (1998) In press REFERENCE 2 (bases 1 to 5067) AUTHORS Gilpin,B.J., Loechel,F., Mattei,M.G., Engvall,E., Albrechtsen,R. and Wewer,U.M. TITLE Direct Submission JOURNAL Submitted (06-SEP-1997) Institute of Molecular Pathology, University of Copenhagen, Frederik V's Vej 11, Copenhagen 2100, Denmark FEATURES Location/Qualifiers source 1..5067 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q26 between D10S216 and D10S575" /tissue_type="placenta" /clone_lib="Clontech #HL50416" gene 1..5067 /gene="ADAM12" 5'UTR 1..311 /gene="ADAM12" sig_peptide 312..395 /gene="ADAM12" CDS 312..3041 /gene="ADAM12" /note="ADAM 12-L" /codon_start=1 /product="meltrin precursor" /db_xref="PID:g2677839" /translation="MAARPLPVSPARALLLALAGALLAPCEARGVSLWNEGRADEVVS ASVRSGDLWIPVKSFDSKNHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLQD GTDVSLARNYTVILGHCYYHGHVRGYSDSAVSLSTCSGLRGLIVFENESYVLEPMKSA TNRYKLFPAKKLKSVRGSCGSHHNTPNLAAKNVFPPPSQTWARRHKRETLKATKYVEL VIVADNREFQRQGKDLEKVKQRLIEIANHVDKFYRPLNIRIVLVGVEVWNDMDKCSVS QDPFTSLHEFLDWRKMKLLPRKSHDNAQLVSGVYFQGTTIGMAPIMSMCTADQSGGIV MDHSDNPLGAAVTLAHELGHNFGMNHDTLDRGCSCQMAVEKGGCIMNASTGYPFPMVF SSCSRKDLETSLEKGMGVCLFNLPEVRESFGGQKCGNRFVEEGEECDCGEPEECMNRC CNATTCTLKPDAVCAHGLCCEDCQLKPAGTACRDSSNSCDLPEFCTGASPHCPANVYL HDGHSCQDVDGYCYNGICQTHEQQCVTLWGPGAKPAPGICFERVNSAGDPYGNCGKVS KSSFAKCEMRDAKCGKIQCQGGASRPVIGTNAVSIETNIPLQQGGRILCRGTHVYLGD DMPDPGLVLAGTKCADGKICLNRQCQNISVFGVHECAMQCHGRGVCNNRKNCHCEAHW APPFCDKFGFGGSTDSGPIRQADNQGLTIGILVTILCLLAAGFVVYLKRKTLIRLLFT NKKTTIEKLRCVRPSRPPRGFQPCQAHLGHLGKGLMRKPPDSYPPKDNPRRLLQCQNV DISRPLNGLNVPQPQSTQRVLPPLHRAPRAPSVPARPLPAKPALRQAQGTCKPNPPQK PLPADPLARTTRLTHALARTPGQWETGLRLAPLRPAPQYPHQVPRSTHTAYIK" mat_peptide 396..3038 /gene="ADAM12" /note="meltrin" misc_feature 396..929 /gene="ADAM12" /note="encodes pro-domain" misc_feature 930..1559 /gene="ADAM12" /note="encodes metalloprotease domain" misc_feature 1560..1847 /gene="ADAM12" /note="encodes disintegrin domain" misc_feature 1848..2431 /gene="ADAM12" /note="encodes cysteine rich domain" misc_feature 2332..2494 /gene="ADAM12" /note="encodes transmembrane domain" misc_feature 2495..3038 /gene="ADAM12" /note="encodes cytoplasmic tail" 3'UTR 3042..5067 /gene="ADAM12" polyA_signal 4934..4939 /gene="ADAM12" polyA_signal 5029..5034 /gene="ADAM12" polyA_site 5049 /gene="ADAM12" BASE COUNT 1341 a 1263 c 1253 g 1210 t ORIGIN 1 ctcttcacta acgctcttcc tagtccccgg gccaactcgg acagtttgct catttattgc 61 aacggtcaag gctggcttgt gccagaacgg cgcgcgcgcg acgcacgcac acacacgggg 121 ggaaactttt ttaaaaatga aaggctagaa gagctcagcg gcggcgcggg ccgtgcgcga 181 gggctccgga gctgactcgc cgaggcagga aatccctccg gtcgcgacgc ccggccccgc 241 tcggcgcccg cgtgggatgg tgcagcgctc gccgccgggc ccgagagctg ctgcactgaa 301 ggccggcgac gatggcagcg cgcccgctgc ccgtgtcccc cgcccgcgcc ctcctgctcg 361 ccctggccgg tgctctgctc gcgccctgcg aggcccgagg ggtgagctta tggaacgaag 421 gaagagctga tgaagttgtc agtgcctctg ttcggagtgg ggacctctgg atcccagtga 481 agagcttcga ctccaagaat catccagaag tgctgaatat tcgactacaa cgggaaagca 541 aagaactgat cataaatctg gaaagaaatg aaggtctcat tgccagcagt ttcacggaaa 601 cccactatct gcaagacggt actgatgtct ccctcgctcg aaattacacg gtaattctgg 661 gtcactgtta ctaccatgga catgtacggg gatattctga ttcagcagtc agtctcagca 721 cgtgttctgg tctcagggga cttattgtgt ttgaaaatga aagctatgtc ttagaaccaa 781 tgaaaagtgc aaccaacaga tacaaactct tcccagcgaa gaagctgaaa agcgtccggg 841 gatcatgtgg atcacatcac aacacaccaa acctcgctgc aaagaatgtg tttccaccac 901 cctctcagac atgggcaaga aggcataaaa gagagaccct caaggcaact aagtatgtgg 961 agctggtgat cgtggcagac aaccgagagt ttcagaggca aggaaaagat ctggaaaaag 1021 ttaagcagcg attaatagag attgctaatc acgttgacaa gttttacaga ccactgaaca 1081 ttcggatcgt gttggtaggc gtggaagtgt ggaatgacat ggacaaatgc tctgtaagtc 1141 aggacccatt caccagcctc catgaatttc tggactggag gaagatgaag cttctacctc 1201 gcaaatccca tgacaatgcg cagcttgtca gtggggttta tttccaaggg accaccatcg 1261 gcatggcccc aatcatgagc atgtgcacgg cagaccagtc tgggggaatt gtcatggacc 1321 attcagacaa tccccttggt gcagccgtga ccctggcaca tgagctgggc cacaatttcg 1381 ggatgaatca tgacacactg gacaggggct gtagctgtca aatggcggtt gagaaaggag 1441 gctgcatcat gaacgcttcc accgggtacc catttcccat ggtgttcagc agttgcagca 1501 ggaaggactt ggagaccagc ctggagaaag gaatgggggt gtgcctgttt aacctgccgg 1561 aagtcaggga gtctttcggg ggccagaagt gtgggaacag atttgtggaa gaaggagagg 1621 agtgtgactg tggggagcca gaggaatgta tgaatcgctg ctgcaatgcc accacctgta 1681 ccctgaagcc ggacgctgtg tgcgcacatg ggctgtgctg tgaagactgc cagctgaagc 1741 ctgcaggaac agcgtgcagg gactccagca actcctgtga cctcccagag ttctgcacag 1801 gggccagccc tcactgccca gccaacgtgt acctgcacga tgggcactca tgtcaggatg 1861 tggacggcta ctgctacaat ggcatctgcc agactcacga gcagcagtgt gtcacactct 1921 ggggaccagg tgctaaacct gcccctggga tctgctttga gagagtcaat tctgcaggtg 1981 atccttatgg caactgtggc aaagtctcga agagttcctt tgccaaatgc gagatgagag 2041 atgctaaatg tggaaaaatc cagtgtcaag gaggtgccag ccggccagtc attggtacca 2101 atgccgtttc catagaaaca aacatccccc tgcagcaagg aggccggatt ctgtgccggg 2161 ggacccacgt gtacttgggc gatgacatgc cggacccagg gcttgtgctt gcaggcacaa 2221 agtgtgcaga tggaaaaatc tgcctgaatc gtcaatgtca aaatattagt gtctttgggg 2281 ttcacgagtg tgcaatgcag tgccacggca gaggggtgtg caacaacagg aagaactgcc 2341 actgcgaggc ccactgggca cctcccttct gtgacaagtt tggctttgga ggaagcacag 2401 acagcggccc catccggcaa gcagataacc aaggtttaac cataggaatt ctggtgacca 2461 tcctgtgtct tcttgctgcc ggatttgtgg tttatctcaa aaggaagacc ttgatacgac 2521 tgctgtttac aaataagaag accaccattg aaaaactaag gtgtgtgcgc ccttcccggc 2581 caccccgtgg cttccaaccc tgtcaggctc acctcggcca ccttggaaaa ggcctgatga 2641 ggaagccgcc agattcctac ccaccgaagg acaatcccag gagattgctg cagtgtcaga 2701 atgttgacat cagcagaccc ctcaacggcc tgaatgtccc tcagccccag tcaactcagc 2761 gagtgcttcc tcccctccac cgggccccac gtgcacctag cgtccctgcc agacccctgc 2821 cagccaagcc tgcacttagg caggcccagg ggacctgtaa gccaaacccc cctcagaagc 2881 ctctgcctgc agatcctctg gccagaacaa ctcggctcac tcatgccttg gccaggaccc 2941 caggacaatg ggagactggg ctccgcctgg cacccctcag acctgctcca caatatccac 3001 accaagtgcc cagatccacc cacaccgcct atattaagtg agaagccgac accttttttc 3061 aacagtgaag acagaagttt gcactatctt tcagctccag ttggagtttt ttgtaccaac 3121 ttttaggatt ttttttaatg tttaaaacat cattactata agaactttga gctactgccg 3181 tcagtgctgt gctgtgctat ggtgctctgt ctacttgcac aggtacttgt aaattattaa 3241 tttatgcaga atgttgatta cagtgcagtg cgctgtagta ggcattttta ccatcactga 3301 gttttccatg gcaggaaggc ttgttgtgct tttagtattt tagtgaactt gaaatatcct 3361 gcttgatggg attctggaca ggatgtgttt gctttctgat caaggcctta ttggaaagca 3421 gtcccccaac tacccccagc tgtgcttatg gtaccagatg cagctcaaga gatcccaagt 3481 agaatctcag ttgattttct ggattcccca tctcaggcca gagccaaggg gcttcaggtc 3541 caggctgtgt ttggctttca gggaggccct gtgccccttg acaactggca ggcaggctcc 3601 cagggacacc tgggagaaat ctggcttctg gccaggaagc tttggtgaga acctgggttg 3661 cagacaggaa tcttaaggtg tagccacacc aggatagaga ctggaacact agacaagcca 3721 gaacttgacc ctgagctgac cagccgtgag catgtttgga aggggtctgt agtgtcactc 3781 aaggcggtgc ttgatagaaa tgccaagcac ttctttttct cgctgtcctt tctagagcac 3841 tgccaccagt aggttattta gcttgggaaa ggtggtgttt ctgtaagaaa cctactgccc 3901 aggcactgca aaccgccacc tccctatact gcttggagct gagcaaatca ccacaaactg 3961 taatacaatg atcctgtatt cagacagatg aggactttcc atgggaccac aactattttc 4021 agatgtgaac cattaaccag atctagtcaa tcaagtctgt ttactgcaag gttcaactta 4081 ttaacaatta ggcagactct ttatgcttgc aaaaactaca accaatggaa tgtgatgttc 4141 atgggtatag ttcatgtctg ctatcattat tcgtagatat tggacaaaga accttctcta 4201 tggggcatcc tctttttcca acttggctgc aggaatcttt aaaagatgct tttaacagag 4261 tctgaaccta tttcttaaac acttgcaacc tacctgttga gcatcacaga atgtgataag 4321 gaaatcaact tgcttatcaa cttcctaaat attatgagat gtggcttggg cagcatcccc 4381 ttgaactctt cactcttcaa atgcctgact agggagccat gtttcacaag gtctttaaag 4441 tgactaatgg catgagaaat acaaaaatac tcagataagg taaaatgcca tgatgcctct 4501 gtcttctgga ctggttttca cattagaaga caattgacaa cagttacata attcactctg 4561 agtgttttat gagaaagcct tcttttgggg tcaacagttt tcctatgctt tgaaacagaa 4621 aaatatgtac caagaatctt ggtttgcctt ccagaaaaca aaactgcatt tcactttccc 4681 ggtgttcccc actgtatcta ggcaacatag tattcatgac tatggataaa ctaaacacgt 4741 gacacaaaca cacacaaaag ggaacccagc tctaatacat tccaactcgt atagcatgca 4801 tctgtttatt ctatagttat taagttcttt aaaatgtaaa gccatgctgg aaaataatac 4861 tgctgagata catacagaat tactgtaact gattacactt ggtaattgta ctaaagccaa 4921 acatatatat actattaaaa aggtttacag aattttatgg tgcattacgt gggcattgtc 4981 tttttagatg cccaaatcct tagatctggc atgttagccc ttcctccaat tataagagga 5041 tatgaaccaa aaaaaaaaaa aaaaaaa // LOCUS AF023611 839 bp mRNA PRI 28-OCT-1997 DEFINITION Homo sapiens Dim1p homolog (hdim1+) mRNA, complete cds. ACCESSION AF023611 NID g2565274 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 839) AUTHORS Larin,D., Ross,B.M. and Gilliam,T.C. TITLE Human homologue of the S. pombe Dim1p gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 839) AUTHORS Larin,D., Ross,B.M. and Gilliam,T.C. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) Psychiatry, Columbia University, 722 West 168th St., NYSPI Unit 58, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..839 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Clontech Human Liver Matchmaker" /tissue_type="liver" gene 1..839 /gene="hdim1+" CDS 127..555 /gene="hdim1+" /note="similar to the S. pombe Dim1p protein encoded by GenBank Accession Number AF001214" /codon_start=1 /product="Dim1p homolog" /db_xref="PID:g2565275" /translation="MSYMLPHLHNGWQVDQAILSEEDRVVVIRFGHDWDPTCMKMDEV LYSIAEKVKNFAVIYLVDITEVPDFNKMYELYDPCTVMFFFRNKHIMIDLGTGNNNKI NWAMEDKQEMVDIIETVYRGARKGRGLVVSPKDYSTKYRY" BASE COUNT 201 a 206 c 245 g 187 t ORIGIN 1 gcgggaccgg atttcgtccg tgggcccggg ggcggcgggg gccggggagt gaggggccgg 61 ctgagcccac ctcgctgggc cctccctggc gccccgcctt gggcggcggc gagcgcgcgg 121 gccgccatgt cgtacatgct cccgcacctg cacaacggct ggcaggtgga ccaggccatc 181 ctctcggagg aggaccgcgt ggtcgtcatc cgcttcggcc acgactggga tcctacgtgc 241 atgaagatgg acgaggtcct gtacagcatc gccgagaagg ttaaaaattt tgcagttatt 301 tatcttgtgg atattacaga agtgcctgac ttcaacaaaa tgtatgagtt atacgatcca 361 tgtactgtca tgtttttctt caggaacaag cacatcatga ttgacttggg gactggcaac 421 aacaacaaga ttaactgggc catggaggac aagcaggaga tggtggacat catcgagacg 481 gtgtaccgcg gggcccgcaa aggccgcggc ctggtggtgt cccccaagga ctactccacc 541 aagtaccgct actgaggcgc cctcagtctg cgcggataaa tgtcgtggag ccctttttgt 601 atggaaacgt tttaagctat ttaaagcctt tggaaaatac aggaagctcc agggctggag 661 cacctctgag atggaattga taacatggtc ttaactcacc gaaataaaca agcacgtggt 721 gagaggagca ggcctacttg tttgttctca ggaaacttaa tgaatagatt actgattttc 781 ctagtcaaag ttaattctta cccttggagt aaaacgaagg tgtttatcct gtgaaaaaa // LOCUS AF023614 1377 bp mRNA PRI 16-OCT-1997 DEFINITION Homo sapiens transmembrane activator and CAML interactor (TACI) mRNA, complete cds. ACCESSION AF023614 NID g2554947 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1377) AUTHORS von Bulow,G.U. and Bram,R.J. TITLE NF-AT activation induced by a CAML-interacting member of the tumor necrosis factor receptor superfamily JOURNAL Science 278 (5335), 138-141 (1997) MEDLINE 97458245 REFERENCE 2 (bases 1 to 1377) AUTHORS von Bulow,G.-U. and Bram,R.J. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) Experimental Oncology, St Jude Children's Research Hospital, 332 North Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..1377 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1377 /gene="TACI" CDS 14..895 /gene="TACI" /codon_start=1 /product="transmembrane activator and CAML interactor" /db_xref="PID:g2554948" /translation="MSGLGRSRRGGRSRVDQEERFPQGLWTGVAMRSCPEEQYWDPLL GTCMSCKTICNHQSQRTCAAFCRSLSCRKEQGKFYDHLLRDCISCASICGQHPKQCAY FCENKLRSPVNLPPELRRQRSGEVENNSDNSGRYQGLEHRGSEASPALPGLKLSADQV ALVYSTLGLCLCAVLCCFLVAVACFLKKRGDPCSCQPRSRPRQSPAKSSQDHAMEAGS PVSTSPEPVETCSFCFPECRAPTQESAVTPGTPDPTCAGRWGCHTRTTVLQPCPHIPD SGLGIVCVPAQEGGPGA" BASE COUNT 364 a 342 c 473 g 198 t ORIGIN 1 agcatcctga gtaatgagtg gcctgggccg gagcaggcga ggtggccgga gccgtgtgga 61 ccaggaggag cgctttccac agggcctgtg gacgggggtg gctatgagat cctgccccga 121 agagcagtac tgggatcctc tgctgggtac ctgcatgtcc tgcaaaacca tttgcaacca 181 tcagagccag cgcacctgtg cagccttctg caggtcactc agctgccgca aggagcaagg 241 caagttctat gaccatctcc tgagggactg catcagctgt gcctccatct gtggacagca 301 ccctaagcaa tgtgcatact tctgtgagaa caagctcagg agcccagtga accttccacc 361 agagctcagg agacagcgga gtggagaagt tgaaaacaat tcagacaact cgggaaggta 421 ccaaggattg gagcacagag gctcagaagc aagtccagct ctcccggggc tgaagctgag 481 tgcagatcag gtggccctgg tctacagcac gctggggctc tgcctgtgtg ccgtcctctg 541 ctgcttcctg gtggcggtgg cctgcttcct caagaagagg ggggatccct gctcctgcca 601 gccccgctca aggccccgtc aaagtccggc caagtcttcc caggatcacg cgatggaagc 661 cggcagccct gtgagcacat cccccgagcc agtggagacc tgcagcttct gcttccctga 721 gtgcagggcg cccacgcagg agagcgcagt cacgcctggg acccccgacc ccacttgtgc 781 tggaaggtgg gggtgccaca ccaggaccac agtcctgcag ccttgcccac acatcccaga 841 cagtggcctt ggcattgtgt gtgtgcctgc ccaggagggg ggcccaggtg cataaatggg 901 ggtcagggag ggaaaggagg agggagagag atggagagga ggggagagag aaagagaggt 961 ggggagaggg gagagagata tgaggagaga gagacagagg aggcagaaag ggagagaaac 1021 agaggagaca gagagggaga gagagacaga gggagagaga gacagagggg aagagaggca 1081 gagagggaaa gaggcagaga aggaaagaga caggcagaga aggagagagg cagagaggga 1141 gagaggcaga gagggagaga ggcagagaga cagagaggga gagagggaca gagagagata 1201 gagcaggagg tcggggcact ctgagtccca gttcccagtg cagctgtagg tcgtcatcac 1261 ctaaccacac gtgcaataaa gtcctcgtgc ctgctgctca cagcccccga gagcccctcc 1321 tcctggagaa taaaaccttt ggcagctgcc cttcctcaaa aaaaaaaaaa aaaaaaa // LOCUS AF024579 800 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens protein phosphatase type-1 glycogen targeting subunit (PPP1R3) mRNA, complete cds. ACCESSION AF024579 NID g2739041 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Xia,J., Scherer,S.W., Cohen,P.T.W., Majer,M., Xi,T., Norman,R.A., Knowler,W.C., Bogardus,C. and Prochazka,M. TITLE Genomic structure and analysis of PPP1R3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 800) AUTHORS Prochazka,M. TITLE Direct Submission JOURNAL Submitted (11-SEP-1997) CDNS/PECRB, NIDDK/NIH, 4212 N. 16th Street, Phoenix, AZ 85016, USA FEATURES Location/Qualifiers source 1..800 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q31.1-q31.2" /tissue_type="skeletal muscle" gene 1..800 /gene="PPP1R3" CDS 1..225 /gene="PPP1R3" /note="alternatively spliced transcript" /codon_start=1 /product="protein phosphatase type-1 glycogen targeting subunit" /db_xref="PID:g2739042" /translation="MEPSEVPSQISKDNFLEVPNLSDSLCEDEEVTFQPGFSPQPSRR GSDSSEDIYLDTPSSERTRAGACKTMERSS" BASE COUNT 306 a 141 c 156 g 197 t ORIGIN 1 atggagcctt ctgaagtacc tagtcagatt agcaaagata attttttaga agttcctaat 61 ttatctgact ctctttgtga agatgaagaa gttactttcc aacctggttt ctcccctcaa 121 ccaagtagac gaggttctga ttcttctgaa gacatatacc tggatacccc atcttcagaa 181 agaacaagag ccggagcctg taaaaccatg gaaagaagtt cctaacagac aaataaaagg 241 ctgcttaaag gtaaaatcaa gtaaagaaga atcatcagta acatcagaag aaaataactt 301 tgagaatcca aagaatacag atacctatat cccaacaatc atttgttctc atgaggacaa 361 ggaagatttg gaagccagta atcgaaatgt aaaagatgta aacagggaac atgatgaaca 421 taatgaaaaa gaattagagt tgatgataaa tcaacactta ataagaacca gaagtactgc 481 ttccagagat gaaaggaata cattttcaac agatccagtc aattttccaa ataaagcaga 541 ggggttagag aagaagcaaa tccatggtga aatatgtact gacttgttcc aaaggtctct 601 gtctccaagt tcatcagcag aaagctccgt aaagggagat ttttactgca atgaaaaata 661 ttcctcagga gatgactgta cacatcaacc ttcagaggaa actacttcaa atatgggaga 721 aatcaagcca tcattgggag atactagtag tgatgaacta gtgcaattac atactggcag 781 caaagaagtc ctggatgata // LOCUS AF024605 1454 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens serine protease-like protease (nes1) mRNA, complete cds. ACCESSION AF024605 S82666 NID g2558911 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1454) AUTHORS Liu,X.L., Wazer,D.E., Watanabe,K. and Band,V. TITLE Identification of a novel serine protease-like gene, the expression of which is down-regulated during breast cancer progression JOURNAL Cancer Res. 56 (14), 3371-3379 (1996) MEDLINE 96320486 REFERENCE 2 (bases 1 to 1454) AUTHORS Liu,X.-L., Wazer,D.E., Watanabe,K. and Band,V. TITLE Direct Submission JOURNAL Submitted (11-SEP-1997) Radiation Oncology, New England Medical Center Hospital, Tufts University School of Medicine, NEMC #824, 750 Washington St., Boston, MA 02111, USA FEATURES Location/Qualifiers source 1..1454 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="76N" /cell_type="mammary epithelial" gene 1..1454 /gene="nes1" CDS 82..912 /gene="nes1" /note="NES1" /codon_start=1 /product="serine protease-like protease" /db_xref="PID:g2558912" /translation="MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLD PEAYGAPCARGSQPWQVSLFNGLSFHCAGVLVDQSWVLTAAHCGNKPLWARVGDDHLL LLQGEQLRRTTRSVVHPKYHQGSGPILPRRTDEHDLMLLKLARPVVPGPRVRALQLPY RCAQPGDQCQVAGWGTTAARRVKYNKGLTCSSITILSPKECEVFYPGVVTNNMICAGL DRGQDPCQSDSGGPLVCDETLQGILSWGVYPCGSAQHPAVYTQICKYMSWINKVIRSN " BASE COUNT 289 a 481 c 377 g 307 t ORIGIN 1 accagcggca gaccacaggc agggcagagg cacgtctggg tcccctccct ccttcctatc 61 ggcgactccc agatcctggc catgagagct ccgcacctcc acctctccgc cgcctctggc 121 gcccgggctc tggcgaagct gctgccgctg ctgatggcgc aactctgggc cgcagaggcg 181 gcgctgctcc cccaaaacga cacgcgcttg gaccccgaag cctatggcgc cccgtgcgcg 241 cgcggctcgc agccctggca ggtctcgctc ttcaacggcc tctcgttcca ctgcgcgggt 301 gtcctggtgg accagagttg ggtgctgacg gccgcgcact gcggaaacaa gccactgtgg 361 gctcgagtag gggatgatca cctgctgctt cttcagggcg agcagctccg ccggacgact 421 cgctctgttg tccatcccaa gtaccaccag ggctcaggcc ccatcctgcc aaggcgaacg 481 gatgagcacg atctcatgtt gctaaagctg gccaggcccg tagtgccggg gccccgcgtc 541 cgggccctgc agcttcccta ccgctgtgct cagcccggag accagtgcca ggttgctggc 601 tggggcacca cggccgcccg gagagtgaag tacaacaagg gcctgacctg ctccagcatc 661 actatcctga gccctaaaga gtgtgaggtc ttctaccctg gcgtggtcac caacaacatg 721 atatgtgctg gactggaccg gggccaggac ccttgccaga gtgactctgg aggccccctg 781 gtctgtgacg agaccctcca aggcatcctc tcgtggggtg tttacccctg tggctctgcc 841 cagcatccag ctgtctacac ccagatctgc aaatacatgt cctggatcaa taaagtcata 901 cgctccaact gatccagatg ctacgctcca gctgatccag atgttatgct cctgctgatc 961 cagatgccca gaggctccat cgtccatcct cttcctcccc agtcggctga actctcccct 1021 tgtctgcact gttcaaacct ctgccgccct ccacacctct aaacatctcc cctctcacct 1081 cattccccca cctatcccca ttctctgcct gtactgaagc tgaaatgcag gaagtggtgg 1141 caaaggttta ttccagagaa gccaggaagc cggtcatcac ccagcctctg agagcagtta 1201 ctggggtcac ccaacctgac ttcctctgcc actccccgct gtgtgacttt gggcaagcca 1261 agtgccctct ctgaacctca gtttcctcat ctgcaaaatg ggaacaatga cgtgcctacc 1321 tcttagacat gttgtgagga gactatgata taacatgtgt atgtaaatct tcatgtgatt 1381 gtcatgtaag gcttaacaca gtgggtggtg agttctgact aaaggttacc tgttgtcgtg 1441 aaaaaaaaaa aaaa // LOCUS AF024636 1970 bp mRNA PRI 02-NOV-1997 DEFINITION Homo sapiens STE20-like kinase 3 (mst-3) mRNA, complete cds. ACCESSION AF024636 NID g2582412 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1970) AUTHORS Schinkmann,K.A. and Blenis,J. TITLE Cloning and characterization of a novel mammalian STE20-like kinase (mst-3) JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 1970) AUTHORS Schinkmann,K.A. and Blenis,J. TITLE Direct Submission JOURNAL Submitted (12-SEP-1997) Hematology/Oncology, Beth Israel Deaconess Medical Center, 4 Blackfan Circle, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..1970 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell" gene 1..1970 /gene="mst-3" CDS 79..1374 /gene="mst-3" /note="protein serine/threonine kinase; similiar to yeast STE20" /codon_start=1 /product="STE20-like kinase 3" /db_xref="PID:g2582413" /translation="MAHSPVQSGLPGMQNLKADPEELFTKLEKIGKGSFGEVFKGIDN RTQKVVAIKIIDLEEAEDEIEDIQQEITVLSQCDSPYVTKYYGSYLKDTKLWIIMEYL GGGSALDLLEPGPLDETQIATILREILKGLDYLHSEKKIHRDIKAANVLLSEHGEVKL ADFGVAGQLTDTQIKRNTFVGTPFWMAPEVIKQSAYDSKADIWSLGITAIELARGEPP HSELHPMKVLFLIPKNNPPTLEGNYSKPLKEFVEACLNKEPSFRPTAKELLKHKFILR NAKKTSYLTELIDRYKRWKAEQSHDDSSSEDSDAETDGQASGGSDSGDWIFTIREKDP KNLENGALQPSDLDRNKMKDIPKRPFSQCLSTIISPLFAELKEKSQACGGNLGSIEEL RGAIYLAEEVCPGISDTMVAQLVQRLQRYSLSGGGTSSH" BASE COUNT 539 a 469 c 491 g 468 t 3 others ORIGIN 1 ggcccgcggg cctcgccgcc ccgcgcggat cgtcgcggcc cggccgtccc gtcccaggaa 61 gtggccgtcc tgagcgccat ggctcactcc ccggtgcagt cgggcctgcc cggcatgcag 121 aacctaaagg cagacccaga agagcttttt acaaaactag agaaaattgg gaagggctcc 181 tttggagagg tgttcaaagg cattgacaat cggactcaga aagtggttgc cataaagatc 241 attgatctgg aagaagctga agatgagata gaggacattc aacaagaaat cacagtgctg 301 agtcagtgtg acagtccata tgtaaccaaa tattatggat cctatctgaa ggatacaaaa 361 ttatggataa taatggaata tcttggtgga ggctccgcac tagatctatt agaacctggc 421 ccattagatg aaacccagat cgctactata ttaagagaaa tactgaaagg actcgattat 481 ctccattcgg agaagaaaat ccacagagac attaaagcgg ccaacgtcct gctgtctgag 541 catggcgagg tgaagctggc ggactttggc gtggctggcc agctgacaga cacccagatc 601 aaaaggaaca ccttcgtggg caccccattc tggatggcac ccgaggtcat caaacagtcg 661 gcctatgact cgaaggcaga catctggtcc ctgggcataa cagctattga acttgcaaga 721 ggggaaccac ctcattccga gctgcacccc atgaaagttt tattcctcat tccaaagaac 781 aacccaccga cgttggaagg aaactacagt aaaccsctca aggagtttgt ggaggcctgt 841 ttgaataagg agccgagctt tagacccact gctaaggagt tattgaagca caagtttata 901 ctacgcaatg caaagaaaac ttcctacttg accgagctca tcgacaggta caagagatgg 961 aaggccgagc agagccatga cgactcgagc tccgaggatt ccgacgcgga aacagatggc 1021 caagcctcgg ggggcagtga ttctggggac tggatcttca caatccgaga aaaagatccc 1081 aagaatctcg agaatggagc tcttcagcca tcggacttgg acagaaataa gatgaaagac 1141 atcccaaaga ggcctttctc tcagtgttta tctacaatta tttctcctct gtttgcagag 1201 ttgaaggaga agagccaggc gtgcggaggg aacttgggtt ccattgaaga gctgcgaggg 1261 gccatctacc tagcggagga ggtgtgccct ggcatctccg acaccatggt ggcccagctc 1321 gtgcagcggc tccagagata ctctctaagt ggtggaggaa cttcatccca ctgaaattcc 1381 tttggcattt ggggttttgt ttttcctttt ttccttcttc atcctcctcc ttttttaaaa 1441 gtcaacgaga gccttcgctg actccaccga agaggtgcgc cactgggagc caccccagcs 1501 ccaggcgccc gtccagggac acacacagtc ttcgctgtgc tgcagccaga tgaagtctct 1561 cagatgggtg gggagggtca gctccttcca gcgatcattt tattttattt tattackttt 1621 gtttttaatt ttaaccatag cgcacatatt ccaggaaagt gtctttaaaa acaaaaacaa 1681 accctgaaat gtatatttgg gattatgata aggcaactaa agacatgaaa cctcaggtat 1741 cctgctttaa gttgataact ccctctggga gctggagaat cgctctggtg gatgggtgta 1801 cagatttgta tataatgtca tttttacgga aaccctttcg gcgtgcataa ggaatcactg 1861 tgtacaaact ggccaagtgc ttctgtagat aacgtcagtg gagtaaatat tcgacaggcc 1921 ataacttgag tctattgcct tgcctttatt acatgtacat tttgaattcc // LOCUS AF024687 923 bp DNA PRI 21-NOV-1997 DEFINITION Homo sapiens putative G protein-coupled receptor (GPR40) gene, complete cds. ACCESSION AF024687 NID g2612945 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 923) AUTHORS Sawzdargo,M., George,S.R., Nguyen,T., Xu,S., Kolakowski,L.F. and O'Dowd,B.F. TITLE A cluster of four novel human G protein-coupled receptor genes occurring in close proximity to CD22 gene on chromosome 19q13.1 JOURNAL Biochem. Biophys. Res. Commun. 239 (2), 543-547 (1997) MEDLINE 98008875 REFERENCE 2 (bases 1 to 923) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" gene 11..913 /gene="GPR40" CDS 11..913 /gene="GPR40" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g2612946" /translation="MDLPPQLSFGLYVAAFALGFPLNVLAIRGATAHARLRLTPSLVY ALNLGCSDLLLTVSLPLKAVEALASGAWPLPASLCPVFAVAHFFPLYAGGGFLAALSA GRYLGAAFPLGYQAFRRPCYSWGVCAAIWALVLCHLGLVFGLEAPGGWLDHSNTSLGI NTPVNGSPVCLEAWDPASAGPARFSLSLLLFFLPLAITAFCYVGCLRALARSGLTHRR KLRAAWVAGGALLTLLLCVGPYNASNVASFLYPNLGGSWRKLGLITGAWSVVLNPLVT GYLGRGPGLKTVCAARTQGGKSQK" BASE COUNT 109 a 338 c 289 g 187 t ORIGIN 1 cggcggcccc atggacctgc ccccgcagct ctccttcggc ctctatgtgg ccgcctttgc 61 gctgggcttc ccgctcaacg tcctggccat ccgaggcgcg acggcccacg cccggctccg 121 tctcacccct agcctggtct acgccctgaa cctgggctgc tccgacctgc tgctgacagt 181 ctctctgccc ctgaaggcgg tggaggcgct agcctccggg gcctggcctc tgccggcctc 241 gctgtgcccc gtcttcgcgg tggcccactt cttcccactc tatgccggcg ggggcttcct 301 ggccgccctg agtgcaggcc gctacctggg agcagccttc cccttgggct accaagcctt 361 ccggaggccg tgctattcct ggggggtgtg cgcggccatc tgggccctcg tcctgtgtca 421 cctgggtctg gtctttgggt tggaggctcc aggaggctgg ctggaccaca gcaacacctc 481 cctgggcatc aacacaccgg tcaacggctc tccggtctgc ctggaggcct gggacccggc 541 ctctgccggc ccggcccgct tcagcctctc tctcctgctc ttttttctgc ccttggccat 601 cacagccttc tgctacgtgg gctgcctccg ggcactggcc cgctccggcc tgacgcacag 661 gcggaagctg cgggccgcct gggtggccgg cggggccctc ctcacgctgc tgctctgcgt 721 aggaccctac aacgcctcca acgtggccag cttcctgtac cccaatctag gaggctcctg 781 gcggaagctg gggctcatca cgggtgcctg gagtgtggtg cttaatccgc tggtgaccgg 841 ttacttggga aggggtcctg gcctgaagac agtgtgtgcg gcaagaacgc aagggggcaa 901 gtcccagaag taacgccact gct // LOCUS AF024711 1445 bp DNA PRI 09-DEC-1997 DEFINITION Homo sapiens cone rod homeobox protein (CRX) gene, complete cds. ACCESSION AF024711 NID g2665533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1445) AUTHORS Freund,C.L., Gregory-Evans,C.Y., Furukawa,T., Papaioannou,M., Looser,J., Ploder,L., Bellingham,J., Ng,D., Herbrick,J.S., Duncan,A., Scherer,S.W., Tsui,L.-C., Loutradis-Anagnostou,A., Jacobson,S.G., Cepko,C.L., Bhattacharya,S.S. and McInnes,R.R. TITLE Cone-rod dystrophy due to mutations in a novel photoreceptor-specific homeobox gene (CRX) essential for maintenance of the photoreceptor JOURNAL Cell 91 (4), 543-553 (1997) MEDLINE 98050929 REFERENCE 2 (bases 1 to 1445) AUTHORS Freund,C.L., Looser,J., Ploder,L., Ng,D. and McInnes,R.R. TITLE Direct Submission JOURNAL Submitted (12-SEP-1997) Genetics, The Hospital for Sick Children, 555 University Ave., Toronto, ON M5G 1X8, Canada FEATURES Location/Qualifiers source 1..1445 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 36..935 /note="homeobox gene; responsible for autosomal dominant cone-rod dystrophy human disease" /gene="CRX" CDS 36..935 /gene="CRX" /note="homeodomain protein" /codon_start=1 /product="cone rod homeobox protein" /db_xref="PID:g2665534" /translation="MMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRKQRRERT TFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQRQQQK QQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVATV SIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGL DPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTW KFTYNPMDPLDYKDQSAWKFQIL" BASE COUNT 300 a 477 c 365 g 303 t ORIGIN 1 gccccctgac ttgggcctca gtgtccccga agatcatgat ggcgtatatg aacccggggc 61 cccactattc tgtcaacgcc ttggccctaa gtggccccag tgtggatctg atgcaccagg 121 ctgtgcccta cccaagcgcc cccaggaagc agcggcggga gcgcaccacc ttcacccgga 181 gccaactgga ggagctggag gcactgtttg ccaagaccca gtacccagac gtctatgccc 241 gtgaggaggt ggctctgaag atcaatctgc ctgagtccag ggttcaggtt tggttcaaga 301 accggagggc taaatgcagg cagcagcgac agcagcagaa acagcagcag cagcccccag 361 ggggccaggc caaggcccgg cctgccaaga ggaaggcggg cacgtcccca agaccctcca 421 cagatgtgtg tccagaccct ctgggcatct cagattccta cagtccccct ctgcccggcc 481 cctcaggctc cccaaccacg gcagtggcca ctgtgtccat ctggagccca gcctcagagt 541 cccctttgcc tgaggcgcag cgggctgggc tggtggcctc agggccgtct ctgacctccg 601 ccccctatgc catgacctac gccccggcct ccgctttctg ctcttccccc tccgcctatg 661 ggtctccgag ctcctatttc agcggcctag acccctacct ttctcccatg gtgccccagc 721 tagggggccc ggctcttagc cccctctctg gcccctccgt gggaccttcc ctggcccagt 781 cccccacctc cctatcaggc cagagctatg gcgcctacag ccccgtggat agcttggaat 841 tcaaggaccc cacgggcacc tggaaattca cctacaatcc catggaccct ctggactaca 901 aggatcagag tgcctggaag tttcagatct tgtagaggac gcagtctcca tctctctcca 961 tcgggcctcg ggaccctttc tcttctgaat ctgcttccct gcagtttaga tcccgggatg 1021 gcattcctga gaaagcaacc cgaaccagct gtccttctga cagctcggtg ttcagcttac 1081 agagaccacc cctttcctcc acagggagag gctcctccct ctcctgggac agctcacagg 1141 tcctagtgat tctctcaacc ctaacaccgt ctggcacgat tgtgaccgct gaagtacacc 1201 acgagctcca ggcttcagaa agtggtgctg agaacttgct ccaagaagaa gtcaaaccaa 1261 acttgcagtt gatttggggt catgtttagg tcagaatcac cgtgcccttg aacaagcagg 1321 taggggggct tgataactta actttccacg tggacagaat tttttttttt gttttgtttt 1381 tgttctgcac tccagcctgg gcaacaagag tgaaactctg tctcaaaaaa aaaaaaaaaa 1441 aaacg // LOCUS AF024714 1485 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens interferon-inducible protein (AIM2) mRNA, complete cds. ACCESSION AF024714 NID g2558941 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1485) AUTHORS DeYoung,K.L., Ray,M.E., Su,Y.A., Anzick,S.L., Johnstone,R.W., Trapani,J.A., Meltzer,P.S. and Trent,J.M. TITLE Cloning a novel member of the human interferon-inducible gene family associated with control of tumorigenicity in a model of human melanoma JOURNAL Oncogene 15 (4), 453-457 (1997) MEDLINE 97384871 REFERENCE 2 (bases 1 to 1485) AUTHORS DeYoung,K.L., Ray,M.E., Su,Y.A., Anzick,S.L., Meltzer,P.S. and Trent,J.M. TITLE Direct Submission JOURNAL Submitted (12-SEP-1997) Laboratory of Cancer Genetics, National Human Genome Research Institute, 49 Convent Dr., Bethesda, MD 20892-4470, USA FEATURES Location/Qualifiers source 1..1485 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q22" gene 1..1485 /note="absent in melanoma 2" /gene="AIM2" CDS 246..1277 /gene="AIM2" /note="absent in melanoma 2; similar to interferon-inducible IFI16 and MNDA" /codon_start=1 /product="interferon-inducible protein" /db_xref="PID:g2558942" /translation="MESKYKEILLLTGLDNITDEELDRFKFFLSDEFNIATGKLHTAN RIQVATLMIQNAGAVSAVMKTIRIFQKLNYMLLAKRLQEEKEKVDKQYKSVTKPKPLS QAEMSPAASAAIRNDVAKQRAAPKVSPHVKPEQKQMVAQQESIREGFQKRCLPVMVLK AKKPFTFETQEGKQEMFHATVATEKEFFFVKVFNTLLKDKFIPKRIIIIARYYRHSGF LEVNSASRVLDAESDQKVNVPLNIIRKAGETPKINTLQTQPLGTIVNGLFVVQKVTEK KKNILFDLSDNTGKMEVLGVRNEDTMKCKEGDKVRLTFFTLSKNGEKLQLTSGVHSTI KVIKAKKKT" BASE COUNT 500 a 284 c 334 g 367 t ORIGIN 1 tcagccaatt agagctccag ttgtcactcc tacccacact gggcctgggg gtgaagggaa 61 gtgtttatta ggggtacatg tgaagccgtc cagaagtgtc agagtctttg tagctttgaa 121 agtcacctag gttatttggg catgctctcc tgagtcctct gctagttaag ctctctgaaa 181 agaaggtggc agacccggtt tgctgatcgc cccagggatc aggaggctga tcccaaagtt 241 gtcagatgga gagtaaatac aaggagatac tcttgctaac aggcctggat aacatcactg 301 atgaggaact ggataggttt aagttctttc tttcagacga gtttaatatt gccacaggca 361 aactacatac tgcaaacaga atacaagtag ctaccttgat gattcaaaat gctggggcgg 421 tgtctgcagt gatgaagacc attcgtattt ttcagaagtt gaattatatg cttttggcaa 481 aacgtcttca ggaggagaag gagaaagttg ataagcaata caaatcggta acaaaaccaa 541 agccactaag tcaagctgaa atgagtcctg ctgcatctgc agccatcaga aatgatgtcg 601 caaagcaacg tgctgcacca aaagtctctc ctcatgttaa gcctgaacag aaacagatgg 661 tggcccagca ggaatctatc agagaagggt ttcagaagcg ctgtttgcca gttatggtac 721 tgaaagcaaa gaagcccttc acgtttgaga cccaagaagg caagcaggag atgtttcatg 781 ctacagtggc tacagaaaag gaattcttct ttgtaaaagt ttttaataca ctgctgaaag 841 ataaattcat tccaaagaga ataattataa tagcaagata ttatcggcac agtggtttct 901 tagaggtaaa tagcgcctca cgtgtgttag atgctgaatc tgaccaaaag gttaatgtcc 961 cgctgaacat tatcagaaaa gctggtgaaa ccccgaagat caacacgctt caaactcagc 1021 cccttggaac aattgtgaat ggtttgtttg tagtccagaa ggtaacagaa aagaagaaaa 1081 acatattatt tgacctaagt gacaacactg ggaaaatgga agtactgggg gttagaaacg 1141 aggacacaat gaaatgtaag gaaggagata aggttcgact tacattcttc acactgtcaa 1201 aaaatggaga aaaactacag ctgacatctg gagttcatag caccataaag gttattaagg 1261 ccaaaaaaaa aacatagaga agtaaaaagg accaattcaa gccaactggt ctaagcagca 1321 tttaattgaa gaatatgtga tacagcctct tcaatcagat tgtaagttac ctgaaagctg 1381 cagttcacag gctcctctct ccaccaaatt aggatagaat aattgctgga taaacaaatt 1441 cagaatatca acagatgatc acaataaaca tctgtttctc attcc // LOCUS AF025409 1557 bp mRNA PRI 02-NOV-1997 DEFINITION Homo sapiens zinc transporter 4 (ZNT4) mRNA, complete cds. ACCESSION AF025409 NID g2582414 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1557) AUTHORS Huang,L. and Gitschier,J. TITLE Identification of a novel gene involved in zinc transport and its defect in the lethal milk mouse JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 1557) AUTHORS Huang,L. and Gitschier,J. TITLE Direct Submission JOURNAL Submitted (13-SEP-1997) Howard Hughes Medical Institute/University of California San Francisco, 3rd and Parnassus Avenues, U280, San Francisco, CA 94143-0724, USA FEATURES Location/Qualifiers source 1..1557 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1557 /gene="ZNT4" CDS 1..1290 /gene="ZNT4" /codon_start=1 /product="zinc transporter 4" /db_xref="PID:g2582415" /translation="MAGSGAWKRLKSMLRKDDAPLFLNDTSAFEFSDEAGDEGLSRFN KLRVVVADDGSEAPERPVNGAHPTLQADDDSLLDQDLPLTNSQLSLKVDSCDNCSKQR EILKQRKVKARLTIAAVLYLLFMIGELVGGYIANSLAIMTDALHMLTDLSAIILTLLA LWLSSKSPTKRFTFGFHRLEVLSAMISVLLVYILMGFLLYEAVQRTIHMNYEINGDIM LITAAVGVAVNVIMGFLLNQSGHRHSHSHSLPSNSPTRGSGCERNHGQDSLAVRAAFV HALGDLVQSVGVLIAAYIIRFKPEYKIADPICTYVFSLLVAFTTFRIIWDTVVIILEG VPSHLNVDYIKEALMKIEDVYSVEDLNIWSLTSGKSTAIVHIQLIPGSSSKWEEVQSK ANHLLLNTFGMYRCTIQLQSYRQEVDRTCANCQSSSP" BASE COUNT 441 a 327 c 349 g 440 t ORIGIN 1 atggccggct ctggcgcgtg gaagcgcctc aaatctatgc taaggaagga tgatgcgccg 61 ctgtttttaa atgacaccag cgcctttgag ttctcggatg aggcggggga cgaggggctt 121 tctcggttca acaaacttcg agttgtggtg gccgatgacg gttccgaagc cccggaaagg 181 cctgttaacg gggcgcaccc gaccctccag gccgacgatg attccttact ggaccaagac 241 ttacctttga ccaacagtca gctgagtttg aaggtggact cctgtgacaa ctgcagcaaa 301 cagagagaga tactgaagca gagaaaggtg aaagccaggt tgaccattgc tgccgttctg 361 tacttgcttt tcatgattgg agaacttgta ggtggataca ttgcaaatag cctagcaatc 421 atgacagatg cacttcatat gttaactgac ctaagcgcca tcatactcac cctgcttgct 481 ttgtggctat catcaaaatc accaaccaaa agattcacct ttggatttca tcgcttagag 541 gttttgtcag ctatgattag tgtgctgttg gtgtatatac ttatgggatt cctcttatat 601 gaagctgtgc aaagaactat ccatatgaac tatgaaataa atggagatat aatgctcatc 661 accgcagctg ttggagttgc agttaatgta ataatggggt ttctgttgaa ccagtctggt 721 caccgtcact cccattccca ctccctgcct tcaaattccc ctaccagagg ttctgggtgt 781 gaacgtaacc atgggcagga tagcctggca gtgagagctg catttgtaca tgctttggga 841 gatctggtac agagtgttgg tgtgctaata gctgcataca tcatacgatt caagccagaa 901 tacaagattg ctgaccccat ctgtacatac gtattttcat tacttgtggc ttttacaaca 961 tttcgaatca tatgggatac agtagttata atactagaag gtgtgccaag ccatttgaat 1021 gtagactata tcaaagaagc cttgatgaaa atagaagatg tatattcagt cgaagattta 1081 aatatctggt ctctcacttc aggaaaatct actgccatag ttcacataca gctaattcct 1141 ggaagttcat ctaaatggga ggaagtacag tccaaagcaa accatttatt attgaacaca 1201 tttggcatgt atagatgtac tattcagctt cagagttaca ggcaagaagt ggacagaact 1261 tgtgcaaatt gtcagagttc tagtccctaa ttttatgtat tgttttagca ttgctgaatt 1321 cactttattt atcctgcagt cacagacttg agagcaataa atgcaaacct aaatgagaaa 1381 atggaatccc tgacagctgt gtccgtatca agcatcagtc tctcaaacag ttgccccagc 1441 ctgacagtgc tagtctctgt ttaatggtaa aaggagactt tgccataatt ttcagatgaa 1501 gatgtttccc aaacactgtt tacagaatga gatgtgactc ctacagatac ctcatag // LOCUS AF025654 4546 bp mRNA PRI 19-DEC-1997 DEFINITION Homo sapiens mRNA capping enzyme (HCE) mRNA, complete cds. ACCESSION AF025654 NID g2697128 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4546) AUTHORS Yue,Z., Maldonado,E., Pillutla,R., Cho,H., Reinberg,D. and Shatkin,A.J. TITLE Mammalian capping enzyme complements mutant Saccharomyces cerevisiae lacking mRNA guanylyltransferase and selectively binds the elongating form of RNA polymerase II JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (24), 12898-12903 (1997) MEDLINE 98058741 REFERENCE 2 (bases 1 to 4546) AUTHORS Yue,Z., Maldonado,E., Pillutla,R.C., Cho,H., Reinberg,D. and Shatkin,A.J. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) Molecular Virology, Center for Advanced Biotechnology and Medicine, 679 Hoes Lane, Piscataway, NJ 08854, USA FEATURES Location/Qualifiers source 1..4546 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..4546 /gene="HCE" CDS 289..2082 /gene="HCE" /function="RNA guanylyltransferase; RNA 5' triphosphatase" /codon_start=1 /product="mRNA capping enzyme" /db_xref="PID:g2697129" /translation="MAHNKIPPRWLNCPRRGQPVAGRFLPLKTILGPRYDSQVAEENR FHPSMLSNYLKSLKVKMGLLVDLTNTSRFYDRNDIEKEGIKYIKLQCKGHGECPTTEN TETFIRLCERFNERNPPELIGVHCTHGFNRTGFLICAFLVEKMDWSIEAAVATFAQAR PPGIYKGDYLKELFRRYGDIEEAPPPPLLPDWCFEDDEDEDEDEDGKKESEPGSSASF GKRRKERLKLGAIFLEGVTVKGVTQVTTQPKLGEVQQKCHQFCGWEGSGFPGAQPVSM DKQNIKLLDLKPYKVSWKADGTRYMMLIDGTNEVFMIDRDNSVFHVSNLEFPFRKDLR MHLSNTLLDGEMIIDRVNGQAVPRYLIYDIIKFNSQPVGDCDFNVRLQCIEREIISPR HEKMKTGLIDKTQEPFSVRNKPFFDICTSRKLLEGNFAKEVSHEMDGLIFQPTGKYKP GRCDDILKWKPPSLNSVDFRLKITRMGGEGLLPPNVGLLYVGGYERPFAQIKVTKELK QYDNKIIECKFENNSWVFMRQRTDKSFPNAYNTAMAVCNSISNPVTKEMLFEFIDRCT AASQGQKRKHHLDPDTELMPPPPPKRPRPLT" BASE COUNT 1402 a 823 c 957 g 1364 t ORIGIN 1 taccagtttg aatacatggc ccagcacctg gtgtttgcca actgcattcc tttgatccta 61 aagttcttca atcaaaacat catgtcctac atcactgcca agaacagcat ttctgtcctg 121 aatgaaagtg gcgcgccgcc cctgacgtta cccggatcgg agaggttgga attcagatta 181 cggctgcgat tcgggtgtct cggaccccgg tgtgcaccgg accacgggga ggcggctcca 241 aaggcgcggt gaacgttggt gagggagggc agctctgcgc cccaagagat ggctcacaac 301 aagatcccgc cgcggtggct gaactgtccc cggcgcggcc agccggtggc aggaagattc 361 ttacctctga agacaatatt aggaccaaga tatgatagtc aagttgctga agaaaatcgg 421 ttccatccca gcatgctctc aaattaccta aagagcctaa aggttaaaat gggcttgttg 481 gtggacctga caaatacttc aaggttctat gaccgaaatg acatagaaaa agaaggaatc 541 aaatatataa aacttcagtg taaaggacat ggtgagtgcc ctaccactga gaatactgag 601 acctttattc gtctgtgtga gcggtttaat gaaagaaatc cgcctgaact tataggtgtt 661 cattgtactc atggcttcaa tcgcactggt ttcctcatat gtgccttttt ggtggagaaa 721 atggattgga gtatcgaagc agcagttgct acttttgccc aagccagacc accaggaatc 781 tacaagggtg attatttgaa ggaacttttt cgtcggtatg gtgacataga ggaagcacca 841 cccccacctc tattgccaga ttggtgtttt gaggatgatg aagacgaaga tgaggatgag 901 gatggaaaga aggaatcaga acccgggtca agtgcttctt ttggcaaaag gagaaaagaa 961 cggttaaaac tgggcgctat tttcttggaa ggtgttactg ttaaaggtgt aactcaagta 1021 acaacacaac caaagttagg agaggtacag cagaagtgtc atcaattctg tggctgggaa 1081 gggtctggat tccctggagc acagcctgtt tccatggaca agcaaaatat taaactttta 1141 gacctgaagc catacaaagt aagctggaaa gcagatggta ctcggtacat gatgttgatt 1201 gatggcacaa atgaagtttt tatgattgat agagacaatt cagtatttca tgtttcaaat 1261 ctggaatttc catttcgtaa agatcttcgt atgcatttat caaatactct cttggatggc 1321 gagatgatta ttgacagagt aaatggacag gctgttccta gatatttgat atatgacata 1381 attaaattca attcacagcc cgttggagat tgtgatttta atgttcgtct gcagtgtata 1441 gaacgagaaa ttataagtcc tcgacacgaa aaaatgaaga ctgggctcat tgacaaaaca 1501 caggaaccat ttagcgtcag aaataagccg ttttttgaca tctgtacttc aagaaagcta 1561 cttgaaggaa attttgccaa agaagtgagc catgaaatgg atggacttat ttttcagcct 1621 actggaaaat acaaacctgg tcgatgtgat gatattttga aatggaagcc tcccagtctg 1681 aattctgtgg attttcgtct aaaaataacc agaatgggag gagaagggtt acttcctccg 1741 aatgttggcc tcctgtatgt tggaggttat gaaagaccct ttgcacaaat caaggtgaca 1801 aaagagctga aacagtatga caacaaaatt atagaatgca aatttgagaa caacagctgg 1861 gtcttcatga gacagagaac agacaaaagt tttcctaatg cctacaacac tgccatggct 1921 gtgtgtaaca gcatctcaaa ccctgtcacc aaggagatgc tgtttgagtt catcgacaga 1981 tgtactgcag cttctcaagg acagaagcga aaacatcatc tggaccctga cacggagctc 2041 atgccaccac cacctcccaa aagaccacgc cctttaacct aagacctgcc tgtgacttga 2101 gggttaagaa gaaagaggaa tgaggaaaaa acgctgttgc ccatttttgt cgccgagaaa 2161 ttgaaaaacg actgtggctt gatgttgata cacatttgaa atttttttta aaaaaaaaga 2221 attatcgtag ccagcctgta aatacttgat gcattcgttt cctcagtgct gcaatacatc 2281 atggacttaa acatcttatt tatctgataa actcagcctt accttgtgga atctgaagat 2341 tgaaatatcc aaaggtttaa acaagatgga aatcaatgaa aaagaggcct tgtttatata 2401 tgggtagctt cacatttaat ttataaagtt aacaatctgg aaactgtgag cccataaaac 2461 agtatttttt agaagttgtg ttacaactgt agcaaatttt ccttttaaaa aatccaggca 2521 atggcattaa acattcttgc tatcatttac catggatttc ctacatttct gagatccctg 2581 tattcaaaaa caaatgacag ggttgttaag ttgtactata ttaaacaact tggtctggtt 2641 tatatcattt taacaagttt tttggaaatg taaaagaaaa aagtaaagct ccatttttgt 2701 gaattgcatt gtttctaaag gaagtatttt gagtattttc atctgtttgt attgtttcga 2761 ctatgatatc agatatccag tgtgtccatt ctggcagtgg ttttgagtta cttcatggaa 2821 cttggacata caccaccagg tcctttatgt gttgttctcc tgggagactt ctttttttaa 2881 atgccaggct acttctgctt tacctcagtg gtatcagcac attgttaatt acgttaaaac 2941 acacaactca ctgtgatcta tgggagtgat atcaaataca aacgatgttg tgttccttct 3001 ctcgaacaaa acagcaaagg gaatgccagc taacttcttt gattagaagg taatgtcatg 3061 gaatagaatg ctgtcaccag aaaatgctct cccactgcta gataaatgca gaggcaaagt 3121 gtagacatct gtaggaacta aatatcaagg aaatcaagac atttattttc aaagcgagaa 3181 ttctatctac attctattct gcggatgtta tttctgttat acatctgaat tttaaaatgt 3241 ttactgatag ataatggatt taaagggcgg ctaattgatt gtctgccatt gttgataaac 3301 tgttctaatt ggtttgtgaa agaactctga gaattagcta aatggaggct agtgtattta 3361 tttttagagg tacatgtttt aagtaaatgt atttaaaaca aattttcatg gtctgaattg 3421 tatatgtaaa tgcatatctg tgtaagtata tgcaagtgta tattctttct actgatttca 3481 gtcagacacg aaatccaact gaattacccg gtagactgaa tatctgacgg gatttctata 3541 tggacatagc acagatgaac caaaaatgta tttatattca cctgagtttt aatgttttta 3601 aacaaagttt tctcctgcag tgcttttcta tatgaaatag ttgagtcaag cttgggcttt 3661 tttcttcgtt gttcccaagt catgtaatgt atagtgaatt tatccaaaac gccattgtaa 3721 cgttttttac tagagtgttt ttgcacattt gggcgctaaa gggattaaaa caactgatgc 3781 cacaatgact attgaaggag ggaaaattct caaggttcca tagtatcttt acaaaatgga 3841 ttaaaaaaaa atgcctatgc aacttcaaat agtctgatgc ctgcttcccc aggacgagtg 3901 actcttccac ggtgtgctgc tttccctccg agcctgtgga aaggcagagc ccttttgcag 3961 atacgaaagc aagcttttgc caatgatctt taatggagtt aaaatgttta tggtaattgt 4021 aacctttcaa atgagttacg tgaaaagagc atctcacttt taatacaccc agatattttc 4081 ttcaagtctt tgcgcatttt tagcaggact taattttcct aaatttattc tttcctttca 4141 attatagtga catagagtag ttggccttat aagtagatcc catttagccc ctgaacataa 4201 taccatctgc agtattataa aagtcattta gaaggtcagg gggataatct agaggcagga 4261 ttttggacat tgtgaaggaa atgtgctcct tctcagctca cttcaataac tattttctga 4321 gactgaagtt ttttagacaa gaataagaaa actttgcttt cttcagttat cacatgtgaa 4381 agcttttgct ttttgttagg aaaaggtgtg gatgactcta ctgcgtggat ggttacattt 4441 gctttccgtg aaatgctcag aaaatgtcag gacattcctt ttccaattgt gtgtcagtgc 4501 gtattgtaat aaaactactg atggaattcc ggaaaaaaaa aaaaaa // LOCUS AF025840 1807 bp mRNA PRI 19-DEC-1997 DEFINITION Homo sapiens DNA polymerase epsilon subunit B (DPE2) mRNA, complete cds. ACCESSION AF025840 NID g2697122 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1807) AUTHORS Li,Y., Asahara,H., Patel,V.S., Zhou,S. and Linn,S. TITLE Purification, cDNA cloning, and gene mapping of the small subunit of human DNA polymerase epsilon JOURNAL J. Biol. Chem. 272 (51), 32337-32344 (1997) MEDLINE 98070407 REFERENCE 2 (bases 1 to 1807) AUTHORS Li,Y., Patel,V.S. and Linn,S. TITLE Direct Submission JOURNAL Submitted (17-SEP-1997) Department of Molecular and Cell biology, Univ. of California, Berkeley, 401 Barker Hall, Berkeley, CA 94720-3202, USA FEATURES Location/Qualifiers source 1..1807 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q13-q21" /cell_line="HeLa S3" gene 1..1807 /gene="DPE2" CDS 131..1711 /gene="DPE2" /codon_start=1 /product="DNA polymerase epsilon subunit B" /db_xref="PID:g2697123" /translation="MAPERLRSRAPSAFKLRGLLLRGEAIKYLTEALQSISELELEDK LEKIINAVEKQPLSSNMIERSVVEAAVQECSQSVDETIEHVFNIIGAFDIPRFVYNSE RKKFLPLLMTNHPAPNLFGTPRDKAEMFRERYTILHQRTHRHELFTPPVIGSHPDESG SKFQLKTIETLLGSTTKIGDAIVLGMITQLKEGKFFLEDPTGTVQLDLSKAQFHSGLY TEACFVLAEGWFEDQVFHVNAFGFPPTEPSSTTRAYYGNINFFGGPSNTSVKTSAKLK QLEEENKDAMFVFLSDVWLDQVEVLEKLRIMFAGYSPAPPTCFILCGNFSSAPYGKNQ VQALKDSLKTLADIICEYPDIHQSRFVFVPGPEDPGFGSILPRPPLAESITNEFRQRV PFSVFTTNPCRIQYCTQEITVFREDLVNKMCRNCVRFPSSNLAIPNHFVKTILSQGHL TPLPLYVCPVYWAYDYALRVYPVPDLLVIADKYDPFTTTNTECLCINPGSFPRSGFSF KVFYPSNKTVEDSKLQGF" BASE COUNT 511 a 381 c 347 g 568 t ORIGIN 1 ctttccgtcc ccttgcttcg tcttcgcttt tctttctact tattcttatc tgtgtctttc 61 gctttgtttg cctctccgtc tgttttccct cagggccccc ttctttcctc gaccttttca 121 aatcgcaaat atggcgccgg agcggctgcg gagccgggcg ccctccgcct tcaagttgcg 181 gggcttgctg ctccgtggtg aagctattaa gtacctcaca gaagctcttc agtctatcag 241 tgaattagag cttgaagata aactggaaaa gataattaat gcagttgaga agcaaccctt 301 gtcatcaaac atgattgaac gatctgtggt ggaagcagca gtccaggaat gcagtcagtc 361 tgttgatgaa actatagagc acgttttcaa tatcatagga gcatttgata ttccacgctt 421 tgtgtacaat tcagaaagaa aaaaatttct tcctctgtta atgaccaacc accctgcacc 481 aaatttattt ggaacaccaa gagataaagc agagatgttt cgtgagcgat ataccatttt 541 gcaccagagg acccacaggc atgaattatt tactcctccg gtgataggtt ctcaccctga 601 tgaaagcgga agcaaattcc agcttaaaac aatagaaacc ttattgggta gtacaaccaa 661 aatcggagat gcgattgttc ttggaatgat aacgcagtta aaagagggaa aattttttct 721 ggaagatcct actggaacag tccaactaga ccttagtaaa gctcagttcc atagtggttt 781 atacacagag gcatgctttg tcttagcaga aggttggttt gaagatcaag tgtttcatgt 841 caatgccttt ggatttccac ccactgagcc ctctagtact actagggcat actatggaaa 901 tattaatttt tttggaggtc cttctaatac atctgtgaag acttctgcaa aactaaaaca 961 gctagaagag gagaataaag atgctatgtt tgtgttttta tctgatgttt ggttggacca 1021 ggtggaagta ttggaaaaac ttcgcataat gtttgctggt tattcaccag cacctccaac 1081 ctgctttatt ctgtgtggta atttttcatc tgcaccatat ggaaaaaatc aagttcaagc 1141 tttgaaagat tccctaaaaa ctttggcaga tataatatgt gaatacccag atattcacca 1201 aagtcgtttt gtgtttgtac ctggtccaga ggatcctgga tttggttcca tcttaccaag 1261 gccaccactt gctgaaagca tcactaatga attcagacaa agggtaccat tttcagtttt 1321 tactactaat ccttgcagaa ttcagtactg tacacaggaa attactgtct tccgtgaaga 1381 cttagtaaat aaaatgtgca gaaactgcgt ccgttttcct agcagcaatt tggctattcc 1441 taatcacttt gtaaagacta tcttatccca aggacatctg actcccctac ctctttatgt 1501 ctgcccagtg tattgggcat atgactatgc tttgagagtg tatcctgtgc ccgatctact 1561 tgtcattgca gacaaatatg atcctttcac tacgacaaat accgaatgcc tctgcataaa 1621 ccctggctct tttccaagaa gtggattttc attcaaagtt ttttatcctt ctaataagac 1681 agtagaagat agcaaacttc aaggcttttg agattcttaa agatcatctg aagaaaattc 1741 atcagttttc tgcttaactc tatatcttat gtgattctga tattacaata aaattatggt 1801 aaacttt // LOCUS AF026004 2697 bp mRNA PRI 13-DEC-1997 DEFINITION Homo sapiens chloride channel protein (ClC-2) mRNA, complete cds. ACCESSION AF026004 NID g2570863 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2697) AUTHORS Rae,J.L. and Shepard,A.R. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Physiology and Biophysics, Mayo Foundation, 200 1st Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..2697 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lens epithelium" gene 1..2697 /gene="ClC-2" CDS 1..2697 /gene="ClC-2" /codon_start=1 /product="chloride channel protein" /db_xref="PID:g2570864" /translation="MAAAAAEEGMEPRALQYEQTLMYGRYTQDLGAFAKEEAARIRLG GPEPWKGPPSSRAAPELLEYGRSRCARCRVCSVRCHKFLVSRVGEDWIFLVLLGLLMA LVSWVMDYAIAACLQAQQWMSRGLNTSILLQYLAWVTYPVVLITFSAGFTQILAPQAV GSGIPEMKTILRGVVLKEYLTLKTFIAKVIGLTCALGSGMPLGKEGPFVHIASMCAAL LSKFLSLFGGIYENESRNTEMLAAACAVGVGCCFAAPIGGVLFSIEVTSTFFAVRNYW RGFFAATFSAFIFRVLAVWNRDEETITALFKTRFRLDFPFDLQELPAFAVIGIASGFG GALFVYLNRKIVQVMRKQKTINRFLMRKRLLFPALVTLLISTLTFPPGFGQFMAGQLS QKETLVTLFDNRTWVRQGLVEELEPPSTSQAWNPPRANVFLTLVIFILMKFWMSALAT TIPVPCGAFMPVFVIGAAFGRLVGESMAAWFPDGIHTDSSTYRIVPGGYAVVGAAALA GAVTHTVSTAVIVFELTGQIAHILPVMIAVILANAVAQSLQPSLYDSIIRIKKLPYLP ELGWGRHQQYRVRVEDIMVRDVPHVALSCTFRDLRLALHRTKGRMLALVESPESMILL GSIERSQVVALLGAQLSPARRRQHMQERRATQTSPLSDQEGPPTPEASVCFQVNTEDS AFPAARGETHKPLKPALKRGPSVTRNLGESPTGSAESAGIALRSLFCGSPPPEAASEK LESCEKRKLKRVRISLASDADLEGEMSPEEILEWEEQQLDEPVNFSDCKIDPAPFQLV ERTSLHKTHTIFSLLGVDHAYVTSIGRLIGIVTLKELRKAIEGSVTAQGVKVRPPLAS FRDSATSSSDTETTEVHALWGPHSRHGLPREGSPSDSDDKCQ" BASE COUNT 514 a 828 c 777 g 578 t ORIGIN 1 atggcggccg cggcggcgga ggaagggatg gagccacggg cgctgcagta cgagcagacc 61 ctgatgtatg gccggtacac tcaggacctt ggggcctttg ccaaagagga agctgctcgg 121 attcgcctgg gagggcctga accctggaaa ggtccccctt cctctcgggc tgccccagag 181 ctcttggaat atggacggag ccgttgcgcc cgatgccgcg tctgttctgt ccgctgccac 241 aagttcctag tatccagggt tggtgaagat tggatcttcc tggtcctgct ggggcttctc 301 atggcattgg tcagctgggt catggactat gccattgctg cctgtctgca agcccagcag 361 tggatgtccc ggggcttgaa caccagcatc ttgctccagt acctggcctg ggtcacctac 421 cctgttgtcc tcatcacttt ctcagccgga ttcacacaga tcctggcccc tcaggctgtc 481 ggctctggca tccctgagat gaagaccatc ttgcggggag tggtgctgaa agaatacctc 541 acactcaaga cctttatagc taaggtcatt gggctgacct gcgccctagg cagcgggatg 601 ccgcttggca aagagggccc ttttgtgcat atcgcaagca tgtgtgctgc ccttctcagc 661 aagttcctct ccctctttgg gggtatctat gagaatgaat cccggaacac agagatgctg 721 gctgccgcct gtgccgtggg ggtgggctgc tgcttcgcgg cacctattgg aggcgtcctc 781 ttcagcatcg aggtcacctc caccttcttt gcagtgcgga actactggcg gggcttcttc 841 gctgccacct tcagtgcctt catcttccgg gtcttggcag tctggaaccg ggatgaagag 901 actattacag ccctcttcaa aacccgattc cggctcgact tcccctttga cctgcaggag 961 ctgccagcct ttgctgtcat tggtattgct agtggcttcg gtggagccct ctttgtctac 1021 ctgaaccgga agattgtcca ggtgatgcgg aagcagaaaa ccatcaatcg cttcctcatg 1081 aggaaacgcc tgctcttccc ggctctggtg accctgctca tctccacgct gaccttcccc 1141 cctggctttg gacagttcat ggctggacag ctctcacaga aagagacgct ggtcaccctg 1201 tttgacaatc ggacgtgggt ccgccagggc ctggtggagg agctagaacc acccagcacc 1261 tcacaggcct ggaacccacc acgtgccaac gtcttcctca ccctggtcat cttcattctc 1321 atgaagttct ggatgtctgc actggccacc accatcccag ttccctgtgg ggccttcatg 1381 cctgtctttg tcattggagc agcatttggg cgtctggtgg gtgaaagcat ggctgcctgg 1441 ttcccagatg gaattcatac ggacagcagc acctaccgga ttgtgcctgg gggctacgct 1501 gtggtcgggg cagctgcgct ggcaggagcg gtgacacaca cagtgtccac ggctgtgatc 1561 gtgttcgagc tcacaggcca gattgcccac atcctgcctg tcatgatcgc cgtcatcctg 1621 gccaacgctg tcgcccagag tctgcagccc tccctctatg acagcatcat ccgaatcaag 1681 aaactgccct acctgcctga gctcggctgg ggccgccacc agcagtaccg ggtgcgtgtg 1741 gaggacatca tggtgcggga tgttccccat gtggccctca gctgcacctt ccgggacctg 1801 cgtttggcac tgcacaggac caagggccga atgctggccc tagtggagtc ccctgagtcc 1861 atgattctgc tgggctccat cgagcgttca caggtggtgg cattgttggg ggcccagctg 1921 agcccagccc gccggcggca gcacatgcag gagcgcagag ccacccagac ctctccacta 1981 tctgatcagg agggtccccc tacccctgag gcttctgtct gcttccaggt gaacacagaa 2041 gactcagcct tcccagcagc ccggggggag acccacaagc ccctaaagcc tgcactcaag 2101 agggggccca gtgtcaccag gaacctcgga gagagtccca cagggagcgc agagtcggca 2161 ggcatcgccc tccggagcct cttctgtggc agtccacccc ctgaggctgc ttcggagaag 2221 ttggaatcct gtgagaagcg caagctgaag cgtgtccgaa tctccctggc aagtgacgcg 2281 gacctggaag gcgagatgag ccctgaagag attctggagt gggaggagca gcaactagat 2341 gaacctgtca acttcagtga ctgcaaaatt gatcctgctc ccttccagct ggtggagcgg 2401 acctctttgc acaagactca cactatcttc tcactgctgg gagtggacca tgcttatgtc 2461 accagtattg gcagactcat tggaatcgtt actctaaagg agctccggaa ggccatcgag 2521 ggctctgtca cagcacaggg tgtgaaagtc cggccgcccc tcgccagctt ccgagacagt 2581 gccaccagca gcagtgacac ggagaccact gaggtgcatg cactctgggg gccccactcc 2641 cgtcatggcc tcccccggga gggcagccct tccgacagcg acgacaaatg ccaatga // LOCUS AF026070 1669 bp mRNA PRI 28-JAN-1998 DEFINITION Homo sapiens death receptor 3 beta (DR3) mRNA, complete cds. ACCESSION AF026070 NID g2570830 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1669) AUTHORS Warzocha,K., Ribeiro,P., Charlot,C., Renard,N., Coiffier,B. and Salles,G. TITLE A new death receptor 3 isoform: expression in human lymphoid cell lines and non-Hodgkin's lymphomas JOURNAL Biochem. Biophys. Res. Commun. 242 (2), 376-379 (1998) MEDLINE 98113360 REFERENCE 2 (bases 1 to 1669) AUTHORS Warzocha,K., Ribeiro,P., Renard,N., Charlot,C., Coiffier,B. and Salles,G. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) Hematology, CTRE Hospitalier Lyon-Sud, Chemin du Grand Revoyet, Pierre Benite 69495, France FEATURES Location/Qualifiers source 1..1669 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Mieliki" /note="identified in human pre-B cell line Mieliki and in patients with non-Hodgkin's lymphoma" gene 1..1669 /note="Apo-3; TRAMP; LARD" /gene="DR3" CDS 69..1349 /gene="DR3" /note="DR3 beta; TNF receptor family member; alternatively spliced product; contains 28 amino-acid extension to the extracellular domain of the ordinary DR3 molecule" /codon_start=1 /product="death receptor 3 beta" /db_xref="PID:g2570831" /translation="MEQRPRGCAAVAAALLLVLLGARAQGGTRSPRCDCAGDFHKKIG LFCCRGCPAGHYLKAPCTEPCGNSTCLVCPQDTFLAWENHHNSECARCQACDEQASQV ALENCSAVADTRCGCKPGWFVECQVSQCVSSSPFYCQPCLDCGALHRHTRLLCSRRDT DCGTCLPGFYEHGDGCVSCPTPPPSLAGAPWGAVQSAVPLSVAGGRVGVFWVQVLLAG LVVPLLLGATLTYTYRHCWPHKPLVTADEAGMEALTPPPATHLSPLDSAHTLLAPPDS SEKICTVQLVGNSWTPGYPETQEALCPQVTWSWDQLPSRALGPARAPTLSPESPAGSP AMMLQPGPQLYDVMDAVPARRWKEFVRTLGLREAEIEAVEVEIGLFRDQQYEMLKHWR QQQPAGLGAVYAALERMGLDGCVEDLRSRLQRGP" BASE COUNT 329 a 532 c 513 g 295 t ORIGIN 1 ctgaaggcgg aaccacgacg ggcagagagc acggagccgg gaagcccctg ggcgcccgtc 61 ggagggctat ggagcagcgg ccgcggggct gcgcggcggt ggcggcggcg ctcctcctgg 121 tgctgctggg ggcccgggcc cagggcggca ctcgtagccc caggtgtgac tgtgccggtg 181 acttccacaa gaagattggt ctgttttgtt gcagaggctg cccagcgggg cactacctga 241 aggccccttg cacggagccc tgcggcaact ccacctgcct tgtgtgtccc caagacacct 301 tcttggcctg ggagaaccac cataattctg aatgtgcccg ctgccaggcc tgtgatgagc 361 aggcctccca ggtggcgctg gagaactgtt cagcagtggc cgacacccgc tgtggctgta 421 agccaggctg gtttgtggag tgccaggtca gccaatgtgt cagcagttca cccttctact 481 gccaaccatg cctagactgc ggggccctgc accgccacac acggctactc tgttcccgca 541 gagatactga ctgtgggacc tgcctgcctg gcttctatga acatggcgat ggctgcgtgt 601 cctgccccac gccacccccg tcccttgcag gagcaccctg gggagctgtc cagagcgctg 661 tgccgctgtc tgtggctgga ggcagagtag gtgtgttctg ggtccaggtg ctcctggctg 721 gccttgtggt ccccctcctg cttggggcca ccctgaccta cacataccgc cactgctggc 781 ctcacaagcc cctggttact gcagatgaag ctgggatgga ggctctgacc ccaccaccgg 841 ccacccatct gtcacccttg gacagcgccc acacccttct agcacctcct gacagcagtg 901 agaagatctg caccgtccag ttggtgggta acagctggac ccctggctac cccgagaccc 961 aggaggcgct ctgcccgcag gtgacatggt cctgggacca gttgcccagc agagctcttg 1021 gccccgctcg tgcgcccaca ctctcgccag agtccccagc cggctcgcca gccatgatgc 1081 tgcagccggg cccgcagctc tacgacgtga tggacgcggt cccagcgcgg cgctggaagg 1141 agttcgtgcg cacgctgggg ctgcgcgagg cagagatcga agccgtggag gtggagatcg 1201 gtctcttccg agaccagcag tacgagatgc tcaagcactg gcgccagcag cagcccgcgg 1261 gcctcggagc cgtttacgcg gccctggagc gcatggggct ggacggctgc gtggaagact 1321 tgcgcagccg cctgcagcgt ggcccgtgac acgcagccca cttgccacct aggcgctctg 1381 gtggcccttg cagaagccct aagtacggtt acttatgcgt gtagacattt tatgtcactt 1441 attaagccgc tggcacggcc ctgcgtaggc acaccagccg gccccacccc tgctcgcccc 1501 tatcgctcca gccaaggcga agaagcacga acgaatgtcg agagggggtg aagacatttc 1561 tcaacttctc ggccggagtt tggctgagat cgcggtatta aatctgtgaa agaaataaag 1621 aaaaaaacaa aacaaaacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS AF026086 4343 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens peroxisome biogenesis disorder protein 1 (PEX1) mRNA, complete cds. ACCESSION AF026086 NID g2655140 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4343) AUTHORS Reuber,B.E., Collins,C., Germain-Lee,E., Morrell,J.C., Ameritunga,R., Moser,H.W., Valle,D. and Gould,S.J. TITLE Mutations in PEX1 are the most common cause of Zellweger syndrome, neonatal adrenoleukodystrophy and infantile Refsum disease JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 4343) AUTHORS Gould,S.J. TITLE Direct Submission JOURNAL Submitted (18-SEP-1997) Biological Chemistry, The Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..4343 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q21-7q22" gene 1..4343 /note="the peroxisome biogenesis disorder group 1 gene" /gene="PEX1" CDS 61..3912 /gene="PEX1" /function="required for stability of PEX5 and protein import into the peroxisome matrix" /note="PEX1 is a member of the AAA subfamily of ATPases" /codon_start=1 /product="peroxisome biogenesis disorder protein 1" /db_xref="PID:g2655141" /translation="MWGSDRLAGAGGGGAAVTVAFTNARDCFLHLPRRLVAQLHLLQN QAIEVVWSHQPAFLSWVEGRHFSDQGENVAEINRQVGQKLGLSNGGQVFLKPCSHVVS CQQVEVEPLSADDWEILELHAVSLEQHLLDQIRIVFPKAIFPVWVDQQTYIFIQIVAL IPAASYGRLETDTKLLIQPKTRRAKENTFSKADAEYKKLHSYGRDQKGMMKELQTKQL QSNTVGITESNENESEIPVDSSSVASLWTMIGSIFSFQSEKKQETSWGLTEINAFKNM QSKVVPLDNIFRVCKSQPPSIYNASATSVFHKHCAIHVFPWDQEYFDVEPSFTVTYGK LVKLLSPKQQQSKTKQNVLSPEKEKQMSEPLDQKKIRSDHNEEDEKACVLQVVWNGLE ELNNAIKYTKNVEVLHLGKVWIPDDLRKRLNIEMHAVVRITPVEVTPKIPRSLKLQPR ENLPKDISEEDIKTVFYSWLQQSTTTMLPLVISEEEFIKLETKDGLKEFSLSIVHSWE KEKDKNIFLLSPNLLQKTTIQVLLDPMVKEENSEEIDFILPFLKLSSLGGVNSLGVSS LEHITHSLLGRPLSRQLMSLVAGLRNGALLLTGGKGSGKSTLAKAICKEAFDKLDAHV ERVDCKALRGKRLENIQKTLEVAFSEAVWMQPSVVLLDDLDLIAGLPAVPEHEHSPDA VQSQRLAHALNDMIKEFISMGSLVALIATSQSQQSLHPLLVSAQGVHIFQCVQHIQPP NQEQRCEILCNVIKNKLDCDINKFTDLDLQHVAKETGGFVARDFTVLVDRAIHSRLSR QSISTREKLVLTTLDFQKALRGFLPASLRSVNLHKPRDLGWDKIGGLHEVRQILMDTI QLPAKYPELFANLPIRQRTGILLYGPPGTGKTLLAGVIARESRMNFISVKGPELLSKY IGASEQAVRDIFIRAQAAKPCILFFDEFESIAPRRGHDNTGVTDRVVNQLLTQLDGVE GLQGVYVLAATSRPDLIDPALLRPGRLDKCVYCPPPDQVSRLEILNVLSDSLPLADDV DLQHVASVTDSFTGADLKALLYNAQLEALHGMLLSSGLQDGSSSSDSDLSLSSMVFLN HSSGSDDSAGDGECGLDQSLVSLEMSEILPDESKFNMYRLYFGSSYESELGNGTSSDL SSQCLSAPSSMTQDLPGVPGKDQLFSQPPVLRTASQEGCQELTQEQRDQLRADISIIK GRYRSQSGEDESMNQPGPIKTRLAISQSHLMTALGHTRPSISEDDWKNFAELYESFQN PKRRKNQSGTMFRPGQKVTLA" BASE COUNT 1330 a 851 c 947 g 1215 t ORIGIN 1 ccgggtcccg ggtcctttgc ggcgctaggg tgggcgaacc cagagggacg ctccgggacg 61 atgtggggca gcgatcgcct ggcgggtgct gggggaggcg gggcggcagt gactgtggcc 121 ttcaccaacg ctcgcgactg cttcctccac ctgccgcggc gtctcgtggc ccagctgcat 181 ctgctgcaga atcaagctat agaagtggtc tggagtcacc agcctgcatt cttgagctgg 241 gtggaaggca ggcattttag tgatcaaggt gaaaatgtgg ctgaaattaa cagacaagtt 301 ggtcaaaaac ttggactctc aaatggggga caggtatttc tcaagccatg ttcccatgtg 361 gtatcttgtc aacaagttga ggtggaaccc ctctcagcag atgattggga gatactggag 421 ctgcatgctg tttcccttga acaacatctt ctagatcaaa ttcgaatagt ttttccaaaa 481 gccatttttc ctgtttgggt tgatcaacaa acgtacatat ttatccaaat tgttgcacta 541 ataccagctg cctcttatgg aaggctggaa actgacacca aactccttat tcagccaaag 601 acacgccgag ccaaagagaa tacattttca aaagctgatg ctgaatataa aaaacttcat 661 agttatggaa gagaccagaa aggaatgatg aaagaacttc aaaccaagca acttcagtca 721 aatactgtgg gaatcactga atctaatgaa aacgagtcag agattccagt tgactcatca 781 tcagtagcaa gtttatggac tatgatagga agcatttttt cctttcaatc tgagaagaaa 841 caagagacat cttggggttt aactgaaatc aatgcattca aaaatatgca gtcaaaggtt 901 gttcctctag acaatatttt cagagtatgc aaatctcaac ctcctagtat atataacgcg 961 tcagcaacct ctgtttttca taaacactgt gccattcatg tatttccatg ggaccaggaa 1021 tattttgatg tagagcccag ctttactgtg acatatggaa agctagttaa gctactttct 1081 ccaaagcaac agcaaagtaa aacaaaacaa aatgtgttat cacctgaaaa agagaagcag 1141 atgtcagagc cactagatca aaaaaaaatt aggtcagatc ataatgaaga agatgagaag 1201 gcctgtgtgc tacaagtagt ctggaatgga cttgaagaat tgaacaatgc catcaaatat 1261 accaaaaatg tagaagttct ccatcttggg aaagtctgga ttccagatga cctgaggaag 1321 agactaaata tagaaatgca tgccgtagtc aggataactc cagtggaagt tacccctaaa 1381 attccaagat ctctaaagtt acaacctaga gagaatttac ctaaagacat aagtgaagaa 1441 gacataaaaa ctgtatttta ttcatggcta cagcagtcta ctaccaccat gcttcctttg 1501 gtaatatcag aggaagaatt tattaagctg gaaactaaag atggactgaa ggaattttct 1561 ctgagtatag ttcattcttg ggaaaaagaa aaagataaaa atatttttct gttgagtccc 1621 aatttgctgc agaagactac aatacaagtc cttctagatc ctatggtaaa agaagaaaac 1681 agtgaggaaa ttgactttat tcttcctttt ttaaagctga gctctttggg aggagtgaat 1741 tccttaggcg tatcctcctt ggagcacatc actcacagcc tcctgggacg ccctttgtct 1801 cggcagctga tgtctcttgt tgcaggactt aggaatggag ctcttttact cacaggagga 1861 aagggaagtg gaaaatcaac tttagccaaa gcaatctgta aagaagcatt tgacaaactg 1921 gatgcccatg tggagagagt tgactgtaaa gctttacgag gaaaaaggct tgaaaacata 1981 caaaaaaccc tagaggtggc tttctcagag gcagtgtgga tgcagccatc tgttgtcctg 2041 ctggatgacc ttgacctcat tgctggactg cctgctgtcc cggaacatga gcacagtcct 2101 gatgcggtgc agagccagcg gcttgctcat gctttgaatg atatgataaa agagtttatc 2161 tccatgggaa gtttggttgc actgattgcc acaagtcagt ctcagcaatc tctacatcct 2221 ttacttgttt ctgctcaagg agttcacata tttcagtgcg tccaacacat tcagcctcct 2281 aatcaggaac aaagatgtga aattctgtgt aatgtaataa aaaataaatt ggactgtgat 2341 ataaacaagt tcaccgatct tgacctgcag catgtagcta aagaaactgg agggtttgtg 2401 gctagagatt ttacagtact tgtggatcga gccatacatt ctcgactctc tcgtcagagt 2461 atatccacca gagaaaaatt agttttaaca acattggact tccaaaaggc tctccgcgga 2521 tttcttcctg cgtctttgcg aagtgtcaac ctgcataaac ctagagacct gggttgggac 2581 aagattggtg ggttacatga agttaggcag atactcatgg atactatcca gttacctgcc 2641 aagtatccag aattatttgc aaacttgccc atacgacaaa gaacaggaat actgttgtat 2701 ggtccgcctg gaacaggaaa aaccttacta gctggggtaa ttgcacgaga gagtagaatg 2761 aattttataa gtgtcaaggg gccagagtta ctcagcaaat acattggagc aagtgaacaa 2821 gctgttcggg atatttttat tagagcacag gctgcaaagc cctgcattct tttctttgat 2881 gaatttgaat ccattgctcc tcggcggggt catgataata caggagttac agaccgagta 2941 gttaaccagt tgctgactca gttggatgga gtagaaggct tacagggtgt ttatgtattg 3001 gctgctacta gtcgccctga cttgattgac cctgccctgc ttaggcctgg tcgactagat 3061 aaatgtgtat actgtcctcc tcctgatcag gtgtcacgtc ttgaaatttt aaatgtcctc 3121 agtgactctc tacctctggc agatgatgtt gaccttcagc atgtagcatc agtaactgac 3181 tcctttactg gagctgatct gaaagcttta ctttacaatg cccaattgga ggccttacat 3241 ggaatgctgc tctcgagtgg actccaggat ggaagttcca gctctgatag tgacctaagt 3301 ctgtcttcaa tggtctttct taaccatagc agtggctctg acgattcagc tggagatgga 3361 gaatgtggct tagatcagtc ccttgtttct ttagagatgt ccgagatcct tccagatgaa 3421 tcaaaattca atatgtaccg gctctacttt ggaagctctt atgaatcaga acttggaaat 3481 ggaacctctt ctgatttgag ctcacaatgt ctctctgcac caagctccat gactcaggat 3541 ttgcctggag ttcctgggaa agaccagttg ttttcacagc ctccagtgtt aaggacagct 3601 tcacaagagg gttgccaaga acttacacaa gaacaaagag atcaactgag ggcagatatc 3661 agtattatca aaggcagata ccggagccaa agtggagagg acgaatccat gaaccaacca 3721 ggaccaatca aaaccagact ggctattagt cagtcacatt taatgactgc acttggtcac 3781 acaagaccat ccattagtga agatgactgg aagaattttg ctgagctata tgaaagcttt 3841 caaaatccaa agaggagaaa aaatcaaagt ggaacaatgt ttcgacctgg acagaaagta 3901 actttagcat aaaatatact tctttttgat ttggttctgt taagtttttt gatggctttt 3961 ccatatgttg taacaggaaa aaaatggtgt ctatgaattt cttcttaatt taacaaattt 4021 ggttaattta taaaatcaca gattggtaaa tgctataatt atgtaatgat caggattgag 4081 attaatactg tagtataaat tgggacatta taacagattc catattttat ttcctaaaat 4141 ctaaattcag tctttaatga aataatatta gccaaatggt ggaactaatt tatttctttt 4201 gaggaaaaga taataaagaa tgtaattaaa tttaaatttc ttggaattcc cagttgtata 4261 ttcatcacct ttgtagcatt tgacaaattt tatgcttagc agcttcttca ctgttttgaa 4321 ataaaatatc ctattaccta ctg // LOCUS AF026132 3246 bp mRNA PRI 28-JAN-1998 DEFINITION Homo sapiens retinal rod Na+/Ca+, K+ exchanger (NCKX) mRNA, complete cds. ACCESSION AF026132 NID g2811134 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3246) AUTHORS Tucker,J.E., Winkfein,R.J., Cooper,C.B. and Schnetkamp,P.P.M. TITLE cDNA cloning of the human retinal rod Na-Ca+K exchanger: comparison with a revised bovine sequence JOURNAL Invest. Ophthamol. Vis. Sci. 39, 435-440 (1998) REFERENCE 2 (bases 1 to 3246) AUTHORS Tucker,J.E., Winkfein,R.J. and Schnetkamp,P.P.M. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) Medical Biochemistry, University of Calgary, 3330 Hospital Drive, N.W., Calgary, AB T2N 4N1, Canada FEATURES Location/Qualifiers source 1..3246 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q22" gene 1..3246 /gene="NCKX" CDS 1..3246 /gene="NCKX" /codon_start=1 /product="retinal rod Na+/Ca+, K+ exchanger" /db_xref="PID:g2811135" /translation="MGKLIRMGPQERWLLRTKRLHWSRLLFLLGMLIIGSTYQHLRRP RGLSSLWAAVSSHQPIKLASRDLSSEEMMMMSSSPSKPSSEMGGKMLVPQASVGSDEA TLSMTVENIPSMPKRTAKMIPTTTKNNYSPTAAGTERRKEDTPTSSRTLTYYTSTSSR QIVKKYTPTPRGEMKSYSPTQVREKVKYTPSPRGRRVGTYVPSTFMTMETSHAITPRT TVKDSDITATYKILETNSLKRIMEETTPTTLKGMFDSTPTFLTHEVEANVLTSPRSVM EKNNLFPPRRVESNSSAHPWGLVGKSNPKTPQGTVLLHTPATSEGQVTISTMTGSSPA ETKAFTAAWSLRNPSPRTSVSAIKTAPAIVWRLAKKPSTAPSTSTTPTVRAKLTMQVH HCVVVKPTPAMLTTPSPSLTTALLPEELSPSPSVLPPSLPDLHPKGEYPPDLFSVEER RQGWVVLHVFGMMYVFVALAIVCDEYFVPALGVITDKLQISEDVAGATFMAAGGSAPE LFTSLIGIFISHSNVGIGTIVGSAVFNILFVIGTCSLFSREILNLTWWPLFRDVSFYI LDLIMLILFFLDSLIAWWESLLLLLAYAFYVFTMKWNKHIEVWVKEQLSRRPVAKVMA LEDLSKLPSLLTRGSSSTSLHNSTIRSTIYQLMLHSLDPLREVRLAKEKEEESLNQGA RAQPQAKAESKPEEEEPAKLPAVTVTPAPVPDIKGDQKENPGGQEDVAEAESTGEMPG EEGETAGEGETEEKSGGETQPEGEGETETQGKGEECEDENEAEGKGDNEGEDEGEIHA EDGEMKGNEGETESQELSAENHGEAKNDEKGVEDGGGSDGGDSEEEEEEEEEQEEEEE EEEQEEEEEEEEEEEEKGNEEPLSLDWPETRQKQAIYLFLLPIVFPLWLTVPDVRRQE SRKFFVFTFLGSIMWIAMFSYLMVWWAHQVGETIGISEEIMGLTILAAGTSIPDLITS VIVARKGLGDMAVSSSVGSNIFDITVGLPVPWLLFSLINGLQPVPVSSNGLFCAIVLL FLMLLFVISSIASCKWRMNKILGFTMFLLYFVFLIISVMLEDRIISCPVSV" BASE COUNT 899 a 818 c 856 g 673 t ORIGIN 1 atggggaaat tgatcaggat ggggccgcaa gagaggtggt tactccggac aaagcggctt 61 cattggagtc gcctcctctt cttactggga atgttgatca tcggttctac ttatcagcac 121 cttaggagac cccggggcct ttcctcattg tgggcagcag tctcttctca tcagcctata 181 aaactggcca gtcgggacct ctccagtgaa gagatgatga tgatgagcag cagcccttca 241 aaacctagct ccgaaatggg gggtaagatg ctggtacccc aagcctcagt gggcagtgat 301 gaagcaacac tgagcatgac agtggagaat atccccagta tgcctaaaag aacagccaag 361 atgatcccaa caacaaccaa gaataattac agcccaacag cagcaggtac agaaagaagg 421 aaggaagaca ccccaacatc cagtagaaca ctgacttact acacctcaac ttcaagcaga 481 caaatagtaa aaaagtatac cccaacaccc aggggagaaa tgaagagcta cagcccaact 541 caagtgaggg aaaaggtgaa gtatactcct tccccacgtg gtagaagagt aggcacttac 601 gtgccgtcca cattcatgac aatggaaaca agccatgcga tcacccccag gacaacagtg 661 aaagacagtg acattacagc aacctataaa atactcgaaa ccaactctct taagagaata 721 atggaggaaa ccaccccaac cactctcaag ggaatgtttg atagcacccc aacttttctg 781 acacatgagg tagaagcaaa cgtcttgact tctccaagga gcgtcatgga aaaaaacaac 841 ctgtttcccc ccagaagagt ggaaagtaac agctcagccc atccctgggg gttagtggga 901 aagagcaacc cgaagactcc ccagggaaca gtcctgttgc ataccccagc cacctctgag 961 gggcaggtga caataagcac catgacaggc agcagcccag cagaaaccaa agccttcact 1021 gctgcctgga gtcttaggaa tccttcaccc aggaccagtg tatcagccat caaaacagcc 1081 ccagccatag tctggaggct ggcaaagaaa ccttccacag cacccagcac ctcaacaacc 1141 cctacggtca gggcaaagct gaccatgcag gtccatcact gtgtggttgt gaagccaacc 1201 ccagccatgc tcaccactcc ctccccaagc ctcacaacag ccctgctccc agaggagctc 1261 agtcctagtc cctcagtgct gcctcccagc ttgccagacc tccaccccaa gggagagtac 1321 cccccagatc tgttcagtgt ggaggagcgg cggcagggct gggtggtcct gcacgttttt 1381 ggcatgatgt atgtgtttgt ggccttggcc attgtttgcg acgagtactt cgttccagcc 1441 ctgggtgtca tcacagacaa gctgcagatc tccgaggatg tggcaggcgc cacattcatg 1501 gctgctggag gctctgctcc tgagctcttc acctccctca tcggtatctt catttcccac 1561 agcaacgtgg gcattggtac cattgtgggc tctgctgtgt tcaacattct ctttgtcatt 1621 ggcacttgtt ccctcttctc ccgagagatc ctcaacctca cctggtggcc cttattccgt 1681 gatgtctcct tctacatcct tgacctgata atgctcatcc tcttcttcct ggacagcctc 1741 attgcctggt gggagagcct gctgctgctg ctggcctatg ccttctatgt gttcaccatg 1801 aagtggaaca agcatatcga ggtctgggtg aaggagcagc tcagcaggag gccagtggcc 1861 aaggtcatgg ccttagaaga cctcagcaag ctcccgtcct tgctgacccg agggagcagc 1921 tcgacctctc tgcacaacag caccatccgc agcaccatct accagctcat gctccacagc 1981 ctggaccccc tgagggaagt tcgccttgcc aaggagaagg aggaggagag cttgaatcaa 2041 ggggccagag cccaacccca ggccaaagca gaaagcaaac cagaagagga ggagccagcc 2101 aagctccctg cggtcacggt cacaccagcc cctgttccag acatcaaggg agatcagaag 2161 gagaatccag gcggtcagga agatgtggct gaggccgaga gcacaggtga aatgccaggc 2221 gaagagggcg aaactgctgg tgaaggtgaa actgaagaga aaagtggagg tgaaactcaa 2281 ccagaaggtg aaggtgaaac tgaaacacaa ggaaaaggag aagaatgtga agatgaaaat 2341 gaagcagaag gaaaaggaga caatgaaggt gaagatgagg gtgaaatcca cgcagaagat 2401 ggtgaaatga aaggtaatga aggtgaaact gaaagccagg aactcagtgc tgaaaatcac 2461 ggtgaagcca aaaatgatga gaaaggtgta gaagatggag ggggaagtga tggaggggat 2521 agcgaagagg aggaagagga ggaggaagag caggaggaag aggaggagga ggaagagcag 2581 gaggaagagg aggaggagga ggaggaagag gaggagaagg gaaatgaaga gcctctgtcc 2641 ctggactggc ctgaaaccag gcagaagcag gccatttacc tcttccttct gcccatcgtg 2701 ttcccactgt ggctgacagt ccccgacgtc cgaaggcagg agtctaggaa gttttttgtt 2761 ttcaccttcc tgggatctat catgtggata gccatgttct catacctcat ggtgtggtgg 2821 gctcaccagg ttggtgaaac aatagggatt tctgaagaga tcatgggcct gacaatcctt 2881 gcagcaggca catcaattcc tgacctcatc accagtgtga ttgtcgctcg aaaaggcctg 2941 ggagacatgg ctgtgtcaag ctctgtgggc agtaacatat ttgatatcac tgtgggcttg 3001 cctgttcctt ggttgctttt ctctcttatc aatggattac agccagttcc agtcagcagc 3061 aatggcttgt tttgtgcaat tgttttgctt tttctcatgc ttctgtttgt gatctcttca 3121 attgcgtcat gtaaatggag aatgaacaag atcctgggct tcacaatgtt cctcctttac 3181 tttgtattcc tgataatcag tgtgatgtta gaagatcgaa tcatatcctg tcctgtatct 3241 gtctga // LOCUS AF026199 2478 bp mRNA PRI 11-NOV-1997 DEFINITION Homo sapiens transcription regulator protein (BACH1) mRNA, complete cds. ACCESSION AF026199 NID g2565399 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2478) AUTHORS Blouin,J.-L., Duriaux Sail,G., Guipponi,M., Rossier,C., Pappasavas,M.-P. and Antonarakis,S.E. TITLE Isolation of the human BACH1 transcription regulator gene, which maps to chromosome 21q22.1 JOURNAL Hum. Genet. (1997) In press REFERENCE 2 (bases 1 to 2478) AUTHORS Blouin,J.-L. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Medical Genetics, University of Geneva Medical School- Centre Medical Universitaire, Rue Michel Servet 1, Geneva CH-1211, Switzerland FEATURES Location/Qualifiers source 1..2478 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.1" gene 1..2478 /gene="BACH1" CDS 244..2454 /gene="BACH1" /codon_start=1 /product="transcription regulator protein" /db_xref="PID:g2565400" /translation="MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQR FRAHRSVLAACSSYFHSRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKEN VDEVCKCVEFLSVHNIEESCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLTLL DQRDLETDEVGEFLENKNVQTPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAAL ALPSLCPKYRKFQKAFGTDRVRTGESSVKDIHASVQPNERSENECLGGVPECRDLQVM LKCDESKLAMEPEETKKDPASQCPTEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGD LNFAGMQNTTVLTEKPLSGTDVQEKTFGESQDLPLKSDLGTREDSSVASSDRSSVERE VAEHLAKGFWSDICSTDTPCQMQLSPAVAKDGSEQISQKRSECPWLGIRISESPEPGQ RTFTTLSSVNCPFISTLSTEGCSSNLEIGNDDYVSEPQQEPCPYACVISLGDDSETDT EGDSESCSAREQECEVKLPFNAQRIISLSRNDFQSLLKMHKLTPEQLDCIHDIRRRSK NRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLLKERDHILSTLGETKQNLTGLCQKV CKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTPDGELALPSIFSLSDRPPAVLPP CARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGGISDFCQQMTDKCTTDE" misc_feature 291..609 /gene="BACH1" /note="encodes broad complex-tramtrack-bric-a-brac domain" misc_feature 1927..2115 /gene="BACH1" /note="encodes cap'n'collar type basic leucine zipper domain" BASE COUNT 739 a 515 c 593 g 631 t ORIGIN 1 gtctactcag cccggtggct gtcgcgcgtg gaatcgcgta agaaaagccg agtttgtggc 61 tggggagaga aggccaccgt gctgagctgg atttagcgaa gactggtttt ggggaccgga 121 gagcccagga ctccctttgt tggagttttg cccacgcgtt gtaattaagc ctcgcacaat 181 atggttgatg ataattagaa gcatgctttc cactgaactt cccgacaaca tttgttatgc 241 agaatgtctc tgagtgagaa ctcggttttt gcctatgaat cttctgtgca tagcaccaat 301 gttttactca gccttaatga ccagcggaag aaagatgtgc tgtgcgatgt caccatcttt 361 gtggaaggac agcggttccg cgctcaccgg tccgtgctgg cggcatgcag cagttacttc 421 cactcaagaa tcgtaggcca ggctgatgga gagctgaaca ttactcttcc agaagaggtg 481 acagttaaag gatttgaacc tttaattcag tttgcctaca ctgctaaact gattttaagt 541 aaagagaatg tggatgaagt gtgcaaatgt gtggagtttt taagtgtaca taatattgag 601 gaatcctgct ttcagtttct gaaatttaag tttttggact ccactgcaga ccagcaagaa 661 tgcccaagaa aaaaatgctt ttcatcacac tgtcagaaaa cagaccttaa acttacactt 721 ttggaccaga gggatctaga aactgatgaa gtgggggaat ttctggaaaa taaaaatgtt 781 cagactcctc agtgtaaact ccgcaggtat caaggaaatg caaaagcctc acctcctcta 841 caagacagtg ccagtcagac atatgagtcc atgtgcttag agaaggatgc tgctctggcc 901 ttgccttctt tatgccccaa atacagaaaa ttccaaaaag catttggaac tgacagagtc 961 cgtactgggg aatctagtgt caaagacatt catgcttctg ttcagccgaa tgaaaggtct 1021 gaaaatgaat gcctgggagg agtcccggag tgtagagatt tgcaggtgat gttaaaatgt 1081 gacgaaagta aattagcaat ggaacctgaa gaaacgaaga aagatcctgc ttctcagtgc 1141 ccaactgaaa aatcagaagt gactcctttc ccccacaatt cttccataga ccctcatgga 1201 ctttattctt tgtctctttt acacacatat gaccaatatg gtgacttgaa ttttgctggt 1261 atgcaaaaca caacagtgtt aacagaaaag cctttgtcag gtacagacgt ccaagaaaaa 1321 acatttggtg aaagtcagga tttacctttg aaatccgact tgggcaccag ggaagatagt 1381 agtgttgcat ctagtgatag gagtagtgtg gagcgagaag tggcagaaca cttagcaaaa 1441 gggttctgga gtgacatttg cagcacggac actccttgcc aaatgcagtt atcacctgct 1501 gtggccaaag atggctcaga acagatctca cagaaacggt ctgagtgtcc gtggttaggt 1561 atcaggatta gtgagagccc ggaaccaggt caaaggactt tcacaacatt aagttctgtc 1621 aactgccctt ttataagtac tctgagtact gaaggctgtt caagcaattt ggaaattgga 1681 aacgatgatt atgtttcaga accccagcaa gaaccttgcc catatgcttg tgtcattagc 1741 ttgggagacg actctgagac ggacaccgaa ggagacagtg aatcctgttc agccagagaa 1801 caagaatgtg aggtaaaact gccattcaat gcacaacgga taatttcact gtctcgaaat 1861 gattttcagt ccttgttgaa aatgcacaag cttactccag aacagctgga ttgtatccat 1921 gatattcgaa gaagaagtaa aaacagaatt gctgcacagc gctgtcgcaa gagaaaactt 1981 gactgtatac agaatcttga atcagaaatt gagaagctgc aaagtgaaaa ggagagcttg 2041 ttgaaggaaa gagatcacat tttgtcaact ctgggtgaga caaagcagaa cctaactgga 2101 ctttgccaga aagtttgtaa agaagcagct ctgagtcaag aacaaataca gatactcgcc 2161 aagtactcag ctgcagattg cccactttca tttttaattt ctgaaaaaga taaaagtact 2221 cctgatggtg aactggcgtt accatcaatt ttcagtttat ctgaccggcc tccagcagtg 2281 ctgcctccct gtgccagagg aaacagtgag cctggctacg cgcgagggca ggagtcccag 2341 cagatgtcca cagccacctc tgagcaagct gggcctgcgg aacagtgtcg tcagagtggt 2401 gggatctcag atttctgtca gcagatgact gataaatgta ctactgatga gtaaacttgc 2461 attcacttcc ttcaaacc // LOCUS AF026219 3766 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens HP protein (HP) mRNA, complete cds. ACCESSION AF026219 NID g2559001 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3766) AUTHORS Wei,M.-H., Pack,S., Ivanov,S. and Lerman,M.I. TITLE Cloning and Molecular Charaterization of the Human Ortholog of the Rat Dual Regulator p122RhoGAP JOURNAL Unpublished REFERENCE 2 (bases 1 to 3766) AUTHORS Wei,M.-H., Pack,S., Ivanov,S. and Lerman,M.I. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Laboratory of Immunobiology, National Cancer Institute, NCI-Frederick Cancer Research and Development Center, Bldg 560, Rm. 12-71, P.O.Box B, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..3766 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p21-p22" /tissue_type="lung" gene 1..3766 /gene="HP" CDS 298..3549 /gene="HP" /note="p122RhoGAP ortholog" /codon_start=1 /product="HP protein" /db_xref="PID:g2559002" /translation="MILTQIEAKEACDWLRATGFPQYAQLYEDFLFPIDISLVKREHD FLDRDAIEALCRRLNTLNKCAVMKLEISPHRKRSDDSDEDEPCAISGKWTFQRDSKRW SRLEEFDVFSPKQDLVPGSPDDSHPKDGPSPGGTLMDLSERQEVSSVRSLSSTGSLPS HAPPSEDAATPRTNSVISVCSSSNLAGNDDSFGSLPSPKELSSFSFSMKGHEKTAKSK TRSLLKRMESLKLKSSHHSKHKAPSKLGLIISGPILQEGMDEEKLKQLNCVEISALNG NRINVPMVRKRSVSNSTQTSSSSSQSETSSAVSTPSPVTRTRSLSACNKRVGMYLEGF DPFNQSTFNNVMEQNFKNRESYPEDTVFYIPEDHKPGTFPKALTNGSFSPSGNNGSVN WRTGSFHGPGHISLRRENSSDSPKELKRRNSSSSMSSRLSIYDNVPGSILYSSSGDLA DLENEDIFPELDDILYHVKGMQRIVNQWSEKFSDEGDSDSALDSVSPCPSSPKQIHLD VDNDRTTPSDLDSTGNSLNEPEEPSEIPERRDSGVGASLTRSNRHRLRWHSFQSSHRP SLNSVSLQINCQSVAQMNLLQKYSLLKLTALLEKYTPSNKHGFSWAVPKFMKRIKVPD YKDRSVFGVPLTVNVQRTGQPLPQSIQQAMRYLRNHCLDQVGLFRKSGVKSRIQALRQ MNEGAIDCVNYEGQSAYDVADMLKQYFRDLPEPLMTNKLSETFLQIYQYVPKDQRLQA IKAAIMLLPDENREVLQTLLYFLSDVTAAVKENQMTPTNLAVCLAPSLFHLNTLKREN SSPRVMQRKQSLGKPDQKDLNENLAATQGLAHMIAECKKLFQVPEEMSRCRNSYTEQE LKPLTLEALGHLGNDDSADYQHFLQDCVDGLFKEVKEKFKGWVSYSTSEQAELSYKKV SEGPPLRLWRSVIEVPAVPEEILKRLLKEQHLWDVDLLDSKVIEILDSQTEIYQYVQN SMAPHPARDYVVLRTWRTNLPKGACALLLTSVDHDRAPVVGVRVNVLLSRYLIEPCGP GKSKLTYMCRVDLRGHMPEWYTKSFGHLCAAEVVKIRDSFSNQNTETKDTKSR" BASE COUNT 972 a 1048 c 966 g 780 t ORIGIN 1 cccagccagg acatggccgc acctctcctc atcaggagcg ccggctcacg gacttctcgc 61 ccaactccct gagcgctccc tcgtttcgat ctttagaaaa ccctgctttc tttctggggc 121 cgtgacgagg ggcagggagc ggcgagcaag gatgcgttga ggaccgcgag ggcgcgcgtc 181 tcgggtgccg ccgtgggtcc cgacgcggaa gccgagccgc ctccgcctgc ctcgacttcc 241 ccacagcgct tccgccgccg cctgccgtgc ttgatgtgca gaaagaagcc ggacaccatg 301 atcctaacac aaattgaagc caaggaagct tgtgattggc tacgggcaac tggtttcccc 361 cagtatgcac agctttatga agatttcctg ttccccatcg atatttcctt ggtcaagaga 421 gagcatgatt ttttggacag agatgccatt gaggctctat gcaggcgtct aaatacttta 481 aacaaatgtg cagtgatgaa gctagaaatt agtcctcatc ggaaacgaag tgacgattca 541 gacgaggatg agccttgtgc catcagtggc aaatggactt tccaaaggga cagcaagagg 601 tggtcccggc ttgaagagtt tgatgtcttt tctccaaaac aagacctggt ccctgggtcc 661 ccagacgact cccacccgaa ggacggcccc agccccggag gcacgctgat ggacctcagc 721 gagcgccagg aggtgtcttc cgtccgcagc ctcagcagca ctggcagcct ccccagccac 781 gcgcccccca gcgaggatgc tgccaccccc cggactaact ccgtcatcag cgtttgctcc 841 tccagcaact tggcaggcaa tgacgactct ttcggcagcc tgccctctcc caaggaactg 901 tccagcttca gcttcagcat gaaaggccac gaaaaaactg ccaagtccaa gacgcgcagt 961 ctgctgaaac ggatggagag cctgaagctc aagagctccc atcacagcaa gcacaaagcg 1021 ccctcaaagc tggggttgat catcagcggg cccatcttgc aagaggggat ggatgaggag 1081 aagctgaagc agctcaactg cgtggagatc tccgccctca atggcaaccg catcaacgtc 1141 cccatggtac gaaagaggag cgtttccaac tccacgcaga ccagcagcag cagcagccag 1201 tcggagacca gcagcgcggt cagcacgccc agccctgtta cgaggacccg gagcctcagt 1261 gcgtgcaaca agcgggtggg catgtactta gagggcttcg atcctttcaa tcagtcaaca 1321 tttaacaacg tgatggagca gaactttaag aaccgcgaga gctacccaga ggacacggtg 1381 ttctacatcc ctgaagatca caagcctggc actttcccca aagctctcac caatggcagt 1441 ttctccccct cggggaataa cggctctgtg aactggagga cgggaagctt ccacggccct 1501 ggccacatca gcctcaggag ggaaaacagt agcgacagcc ccaaggaact gaagagacgc 1561 aattcttcca gctccatgag cagccgcctg agcatctacg acaacgtgcc gggctccatc 1621 ctctactcca gttcagggga cctggcggat ctggagaacg aggacatctt ccccgagctg 1681 gacgacatcc tctaccacgt gaaggggatg cagcggatag tcaatcagtg gtcggagaag 1741 ttttctgatg agggagattc ggactcagcc ctggactcgg tctctccctg cccgtcctct 1801 ccaaaacaga tacacctgga tgtggacaac gaccgaacca cacccagcga cctggacagc 1861 acaggcaact ccctgaatga accggaagag ccctccgaga tcccggaaag aagggattct 1921 ggggttgggg cttccctaac caggtccaac aggcaccgac tgagatggca cagtttccag 1981 agctcacatc ggccaagcct caactctgta tcactacaga ttaactgcca gtctgtggcc 2041 cagatgaacc tgctgcagaa atactcactc ctaaagctaa cggccctgct ggagaaatac 2101 acaccttcta acaagcatgg ttttagctgg gccgtgccca agttcatgaa gaggatcaag 2161 gttccagact acaaggaccg gagtgtgttt ggggtcccac tgacggtcaa cgtgcagcgc 2221 acaggacaac cgttgcctca gagcatccag caggccatgc gatacctccg gaaccattgt 2281 ttggatcagg ttgggctctt cagaaaatcg ggggtcaagt cccggattca ggctctgcgc 2341 cagatgaatg aaggtgccat agactgtgtc aactacgaag gacagtctgc ttatgacgtg 2401 gcagacatgc tgaagcagta ttttcgagat cttcctgagc cactaatgac gaacaaactc 2461 tcggaaacct ttctacagat ctaccaatat gtgcccaagg accagcgcct gcaggccatc 2521 aaggctgcca tcatgctgct gcctgacgag aaccgggagg ttctgcagac cctgctttat 2581 ttcctgagcg atgtcacagc agccgtaaaa gaaaaccaga tgaccccaac caacctggcc 2641 gtgtgcttag cgccttccct cttccatctc aacaccctga agagagagaa ttcctctccc 2701 agggtaatgc aaagaaaaca aagtttgggc aaaccagatc agaaagattt gaatgaaaac 2761 ctagctgcca ctcaagggct ggcccatatg atcgccgagt gcaagaagct tttccaggtt 2821 cccgaggaaa tgagccgatg tcgtaattcc tataccgaac aagagctgaa gcccctcact 2881 ctggaagcac tcgggcacct gggtaatgat gactcagctg actaccaaca cttcctccag 2941 gactgtgtgg atggcctgtt taaagaagtc aaagagaagt ttaaaggctg ggtcagctac 3001 tccacttcgg agcaggctga gctgtcctat aagaaggtga gcgaaggacc ccctctgagg 3061 ctttggaggt cagtcattga agtccctgct gtgccagagg aaatcttaaa gcgcctactt 3121 aaagaacagc acctctggga tgtagacctg ttggattcaa aagtgatcga aattctggac 3181 agccaaactg aaatttacca gtatgtccaa aacagtatgg cacctcatcc tgctcgagac 3241 tacgttgttt taagaacctg gaggactaat ttacccaaag gagcctgtgc ccttttacta 3301 acctctgtgg atcacgatcg cgcacctgtg gtgggtgtga gggttaatgt gctcttgtcc 3361 aggtatttga ttgaaccctg tgggccagga aaatccaaac tcacctacat gtgcagagtt 3421 gacttaaggg gccacatgcc agaatggtac acaaaatctt ttggacattt gtgtgcagct 3481 gaagttgtaa agatccggga ttccttcagt aaccagaaca ctgaaaccaa agacaccaaa 3541 tctaggtgat cactgaagca acgcaaccgc ttccaccacc atggtgtttg tttctagaac 3601 ttttgccagt ccttgaagaa tgggttctgt gtctaatcct gaaacaaaga aaactacaag 3661 ctggagtgta ggaattgact atagcaattt gatacatttt taaagctgct tcctgtttgt 3721 tgagggtctg tattcataga ccttgactgg aatatgtaag actgtg // LOCUS AF026261 1464 bp mRNA PRI 07-JAN-1998 DEFINITION Homo sapiens histamine H1 receptor mRNA, complete cds. ACCESSION AF026261 NID g2605717 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1464) AUTHORS Rae,J.L. and Shepard,A.R. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Physiology and Biophysics, Mayo Foundation, 200 1st Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1464 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lens epithelium" CDS 1..1464 /codon_start=1 /product="histamine H1 receptor" /db_xref="PID:g2605718" /translation="MSLPNSSCLLEDKMCEGNKTTMASPQLMPLVVVLSTICLVTVGL NLLVLYAVRSERKLHTVGNLYIVSLSVADLIVGAVVMPMNILYLLMSKWSLGRPLCLF WLSMDYVASTASIFSVFILCIDRYRSVQQPLRYLKYRTKTRASATILGAWFLSFLWVI PILGWNHFMQQTSVRREDKCETDFYDVTWFKVMTAIINFYLPTLLMLWFYAKIYKAVR QHCQHRELINRSLPSFSEIKLRPENPKGDAKKPGKESPWEVLKRKPKDAGGGSVLKSP SQTPKEMKSPVVFSQEDDREVDKLYCFPLDIVHMQAAAEGSSRDYVAVNRSHGQLKTD EQGLNTHGASEISEDQMLGDSQSFSRTDSDTTTETAPGKGKLRSGSNTGLDYIKFTWK RLRSHSRQYVSGLHMNRERKAAKQLGFIMAAFILCWIPYFIFFMVIAFCKNCCNEHLH MFTIWLGYINSTLNPLIYPLCNENFKKTFKRILHIRS" BASE COUNT 352 a 406 c 359 g 347 t ORIGIN 1 atgagcctcc ccaattcctc ctgcctctta gaagacaaga tgtgtgaggg caacaagacc 61 actatggcca gcccccagct gatgcccctg gtggtggtcc tgagcactat ctgcttggtc 121 acagtagggc tcaacctgct ggtgctgtat gccgtacgga gtgagcggaa gctccacact 181 gtggggaacc tgtacatcgt cagcctctcg gtggcggact tgatcgtggg tgccgtcgtc 241 atgcctatga acatcctcta cctgctcatg tccaagtggt cactgggccg tcctctctgc 301 ctcttttggc tttccatgga ctatgtggcc agcacagcgt ccattttcag tgtcttcatc 361 ctgtgcattg atcgctaccg ctctgtccag cagcccctca ggtaccttaa gtatcgtacc 421 aagacccgag cctcggccac cattctgggg gcctggtttc tctcttttct gtgggttatt 481 cccattctag gctggaatca cttcatgcag cagacctcgg tgcgccgaga ggacaagtgt 541 gagacagact tctatgatgt cacctggttc aaggtcatga ctgccatcat caacttctac 601 ctgcccacct tgctcatgct ctggttctat gccaagatct acaaggccgt acgacaacac 661 tgccagcacc gggagctcat caataggtcc ctcccttcct tctcagaaat taagctgagg 721 ccagagaacc ccaaggggga tgccaagaaa ccagggaagg agtctccctg ggaggttctg 781 aaaaggaagc caaaagatgc tggtggtgga tctgtcttga agtcaccatc ccaaaccccc 841 aaggagatga aatccccagt tgtcttcagc caagaggatg atagagaagt agacaaactc 901 tactgctttc cacttgatat tgtgcacatg caggctgcgg cagaggggag tagcagggac 961 tatgtagccg tcaaccggag ccatggccag ctcaagacag atgagcaggg cctgaacaca 1021 catggggcca gcgagatatc agaggatcag atgttaggtg atagccaatc cttctctcga 1081 acggactcag ataccaccac agagacagca ccaggcaaag gcaaattgag gagtgggtct 1141 aacacaggcc tggattacat caagtttact tggaagaggc tccgctcgca ttcaagacag 1201 tatgtatctg ggttgcacat gaaccgcgaa aggaaggccg ccaaacagtt gggttttatc 1261 atggcagcct tcatcctctg ctggatccct tatttcatct tcttcatggt cattgccttc 1321 tgcaagaact gttgcaatga acatttgcac atgttcacca tctggctggg ctacatcaac 1381 tccacactga accccctcat ctaccccttg tgcaatgaga acttcaagaa gacattcaag 1441 agaattctgc atattcgctc ctaa // LOCUS AF026263 1599 bp mRNA PRI 07-JAN-1998 DEFINITION Homo sapiens muscarinic receptor (CHRM5) mRNA, complete cds. ACCESSION AF026263 NID g2605721 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Rae,J.L. and Shepard,A.R. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Physiology and Biophysics, Mayo Foundation, 200 1st Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lens epithelium" gene 1..1599 /gene="CHRM5" CDS 1..1599 /gene="CHRM5" /codon_start=1 /product="muscarinic receptor" /db_xref="PID:g2605722" /translation="MEGDSYHNATTVNGTPVNHQPLERHRLWEVITIAAVTAVVSLIT IVGNVLVMISFKVNSQLKTVNNYYLLSLACADLIIGIFSMNLYTTYILMGRWALGSLA CDLWLALDYVASNASVMNLLVISFDRYFSITRPLTYRAKRTPKRAGIMIGLAWLISFI LWAPAILCWQYLVGKRTVPLDECQIQFLSEPTITFGTAIAAFYIPVSVMTILYCRIYR ETEKRTKDLADLQGSDSVTKAEKRKPAHRALFRSCLRCPRPTLAQRERNQASWSSSRR STSTTGKPSQATGPSANWAKAEQLTTCSSYPSSEDEDKPATDPVLQVVYKSQGKESPG EEFSAEETEETFVKAETEKSDYDTPNYLLSPAAAHRPKSQKCVAYKFRLVVKADGNQE TNNGCHKVKIMPCPFPVAKEPSTKGLNPNPSHQMTKRKRVVLVKERKAAQTLSAILLA FIITWTPYNIMVLVSTFCDKCVPVTLWHLGYWLCYVNSTVNPICYALCNRTFRKTFKM LLLCRWKKKKVEEKLYWQGNSKLP" BASE COUNT 407 a 459 c 376 g 357 t ORIGIN 1 atggaagggg attcttacca caatgcaacc accgtcaatg gcaccccagt aaatcaccag 61 cctttggaac gccacaggtt gtgggaagtc atcaccattg cagctgtgac tgctgtggta 121 agcctgatca ccattgtggg caatgtcttg gtcatgatct ccttcaaagt caacagccag 181 ctcaagacag ttaacaacta ttacctgctc agcttagcct gtgcagatct catcattgga 241 atcttctcca tgaacctcta caccacctac atcctcatgg gacgctgggc tctcgggagt 301 ctggcttgtg acctttggct tgcactggac tacgtggcca gcaacgcttc tgtcatgaac 361 cttctggtga tcagttttga ccgttacttt tccatcacaa gacccttgac atatcgggcc 421 aagcgtactc cgaaaagggc tggcatcatg attggcttgg cctggctgat ctccttcatc 481 ctctgggccc cagcaatcct ctgctggcag tacttggttg ggaagcggac agttccactg 541 gatgagtgcc agatccagtt tctctctgag cccaccatca cttttggcac tgccattgct 601 gccttctaca tccctgtttc tgtcatgacc atcctctact gtcgaatcta ccgggaaaca 661 gagaagcgaa ccaaggacct ggctgacctc cagggttctg actctgtgac caaagctgag 721 aagagaaagc cagctcatag ggctctgttc agatcctgct tgcgctgtcc tcgacccacc 781 ctggcccagc gggaaaggaa ccaggcctcc tggtcatcct cccgcaggag cacctccacc 841 actgggaagc catcccaagc cactggccca agcgccaatt gggccaaagc tgagcagctc 901 accacctgta gcagctaccc ttcctcagag gatgaggaca agcccgccac tgaccctgtc 961 ctccaagtgg tctacaagag tcagggtaag gaaagcccag gggaagaatt cagtgctgaa 1021 gagactgagg aaacttttgt gaaagctgaa actgaaaaaa gtgactatga caccccaaac 1081 taccttctgt ctccagcagc tgctcataga cccaagagtc agaaatgtgt ggcctataag 1141 ttccgattgg tggtaaaagc tgacgggaac caggagacca acaatggctg tcacaaggtg 1201 aaaatcatgc cctgcccctt cccagtggcc aaggaacctt caacgaaagg cctcaatccc 1261 aaccccagcc atcaaatgac caaacgaaag agagtggtcc tagtcaaaga gaggaaagca 1321 gcccagacac tgagtgccat tctcctggcc ttcatcatca catggacccc gtataacatc 1381 atggtcctgg tttctacctt ctgtgacaag tgtgtcccag tcaccctgtg gcacttgggc 1441 tattggttgt gctatgtcaa tagcactgtc aaccccatct gctatgccct ctgcaacaga 1501 accttcagga agacctttaa gatgctgctt ctctgccgat ggaaaaagaa aaaagtggaa 1561 gagaagttgt actggcaggg gaacagcaag ctaccctga // LOCUS AF026273 1782 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens interleukin-1 receptor-associated kinase-2 mRNA, complete cds. ACCESSION AF026273 NID g2653876 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1782) AUTHORS Muzio,M., Ni,J., Feng,P. and Dixit,V.M. TITLE A novel IRAK/Pelle family member and MyD88 are components of the IL-1R signaling complex JOURNAL Unpublished REFERENCE 2 (bases 1 to 1782) AUTHORS Muzio,M., Ni,J., Feng,P. and Dixit,V.M. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Pathology, University of Michigan, 1150 Med. Ctr. Drive, MSRBI, Room 7210, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..1782 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 10..1782 /note="a novel IRAK/Pelle family member; IRAK-2" /codon_start=1 /product="interleukin-1 receptor-associated kinase-2" /db_xref="PID:g2653877" /translation="MACYIYQLPSWVLDDLCRNMDALSEWDWMEFASYVITDLTQLRK IKSMERVQGVSITRELLWWWGMRQATVQQLVDLLCRLELYRAAQIILNWKPAPEIRCP IPAFPDSVKPEKPLAASVRKAEDEQEEGQPVRMATFPGPGSSPARAHQPAFLQPPEED APHSLRSDLPTSSDSKDFSTSIPKQEKLLSLAGDSLFWSEADVVQATDDFNQNRKISQ GTFADVYRGHRHGKPFVFKKLRETACSSPGSIERFFQAELQICLRCCHPNVLPVLGFC AARQFHSFIYPYMANGSLQDRLQGQGGSDPLPWPQRVSICSGLLCAVEYLHGLEIIHS NVKSSNVLLDQNLTPKLAHPMAHLCPVNKRSKYTMMKTHLLRTSAAYLPEDFIRVGQL TKRVDIFSCGIVLAEVLTGIPAMDNNRSPVYLKDLLLSEIPSSTASLCSRKTGVENVM AKEICQKYLEKGAGRLPEDCAEALATAACLCLRRRNTSLQEVCGSVAAVEERLRGRET LLPWSGLSEGTGSSSNTPEETDDVDNSSLDASSSMSVAPWAGAATPLLPTENGEGRLR VIVGREADSSSEACVGLEPPQDVT" BASE COUNT 388 a 520 c 510 g 363 t 1 others ORIGIN 1 tagcgtgcca tggcctgcta catctaccag ctgccctcct gggtgctgga cgacctgtgc 61 cgcaacatgg acgcgctcag cgagtgggac tggatggagt tcgcctccta cgtgatcaca 121 gacctgaccc agctgcggaa gatcaagtcc atggagcggg tgcagggtgt gagcatcacg 181 cgggagctgc tgtggtggtg gggcatgcgg caggccaccg tccagcaact tgtggacctc 241 ctgtgccgcc tggagctsta ccgggctgcc cagatcatcc tgaactggaa accggctcct 301 gaaatcaggt gtcccattcc agccttccct gactctgtga agccagaaaa gcctttggca 361 gcttctgtaa gaaaggctga ggatgaacag gaagaggggc agcctgtgag gatggccacc 421 tttccaggcc cagggtcctc tccagccaga gcccaccagc cggcctttct ccagcctcct 481 gaagaagatg cccctcattc cttgagaagc gacctcccca cttcgtctga ttcaaaggac 541 ttcagcacct ccattcctaa gcaggaaaaa cttttgagct tggctggaga cagccttttc 601 tggagtgagg cagacgtggt ccaggcaacc gatgacttca atcaaaaccg caaaatcagc 661 caggggacct ttgctgacgt ctacagaggg cacaggcacg ggaagccatt cgtcttcaag 721 aagctcagag agacagcctg ttcaagtcca ggatcaatcg aaagattctt ccaggcagag 781 ttgcagattt gtcttagatg ctgccacccc aatgtcttac ctgtgctggg cttctgtgct 841 gcaagacagt ttcacagctt catctacccc tacatggcaa atggttccct acaggacaga 901 ctgcagggtc agggtggctc ggaccccctc ccctggcccc agcgtgtcag catctgctca 961 gggctgctct gtgccgtcga gtacctgcat ggtctggaga tcatccacag caacgtcaag 1021 agctctaatg tcttgctgga ccaaaatctc acccccaaac ttgctcaccc aatggctcat 1081 ctgtgtcctg tcaacaaaag gtcaaaatac accatgatga agactcacct gctccggacg 1141 tcagccgcgt atctgccaga ggatttcatc cgggtggggc agctgacaaa gcgagtggac 1201 atcttcagct gtggaatagt gttggccgag gtcctcacgg gcatccctgc aatggataac 1261 aaccgaagcc cggtttacct gaaggactta ctcctcagtg aaattccaag cagcaccgcc 1321 tcgctctgct ccaggaagac gggcgtggag aacgtgatgg caaaggagat ctgccagaag 1381 tacctggaga agggcgcagg gaggcttccg gaggactgcg ccgaggccct ggccacggct 1441 gcctgcctgt gcctgcggag gcgtaacacc agcctgcagg aggtgtgtgg ctctgtggct 1501 gctgtggaag agcggctccg aggtcgggag acgttgctcc cttggagtgg gctttctgag 1561 ggtacaggct cttcttccaa caccccagag gaaacagacg acgttgacaa ttccagcctt 1621 gatgcctcct cctccatgag tgtggcaccc tgggcagggg ctgccacccc acttctcccc 1681 acagagaatg gggaaggaag gctgcgggtc atcgtgggaa gggaggctga ctcctcctct 1741 gaggcctgtg ttggcctgga gcctccccag gatgttacat aa // LOCUS AF026292 1863 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens chaperonin containing t-complex polypeptide 1, eta subunit (Ccth) mRNA, complete cds. ACCESSION AF026292 NID g2559009 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1863) AUTHORS Won,K.-A. and Reed,S.I. TITLE CCT-eta JOURNAL Unpublished REFERENCE 2 (bases 1 to 1863) AUTHORS Won,K.-A. and Reed,S.I. TITLE Direct Submission JOURNAL Submitted (23-SEP-1997) Molecular Biology, MB7, Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1863 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" gene 1..1863 /gene="Ccth" CDS 69..1700 /gene="Ccth" /note="CCT containing TCP-1-eta; CCT-eta" /codon_start=1 /product="chaperonin containing t-complex polypeptide 1, eta subunit" /db_xref="PID:g2559010" /translation="MMPTPVILLKEGTDSSQGIPQLVSNISACQVIAEAVRTTLGPRG MDKLIVDGRGKATISNDGATILKLLDVVHPAAKTLVDIAKSQDAEVGDGTTSVTLLAA EFLKQVKPYVEEGLHPQIIIRAFRTATQLAVNKIKEIAVTVKKADKVEQRKLLEKCAM TALSSKLISQQKAFFAKMVVDAVMMLDDLLQLKMIGIKKVQGGALEDSQLVAGVAFKK TFSYAGFEMQPKKYHNPKIALLNVELELKAEKDNAEIRVHTVEDYQAIVDAEWNILYD KLEKIHHSGAKVVLSKLPIGDVATQYFADRDMFCAGRVPEEDLKRTMMACGGSIQTSV NALSADVLGRCQVFEETQIGGERYNFFTGCPKAKTCTFILRGGAEQFMEETERSLHDA IMIVRRAIKNDSVVAGGGAIEMELSKYLRDYSRTIPGKQQLLIGAYAKALEIIPRQLC DNAGFDATNILNKLRARHAQGGTWYGVDINNEDIADNFEAFVWEPAMVRINALTAASE AACLIVSVDETIKNPRSTVDAPTAAGRGRGRGRPH" BASE COUNT 483 a 428 c 521 g 431 t ORIGIN 1 gttccaaggt ttgcggcccg gtctcggaga agaggggaga gtggagggcc gctgaataag 61 cttccaaaat gatgcccaca ccagttatcc tattgaaaga ggggactgat agctcccaag 121 gcatccccca gcttgtgagt aacatcagtg cctgccaggt gattgctgag gctgtaagaa 181 ctaccctggg tccccgtggc atggacaagc ttattgtaga tggcagaggc aaagcaacaa 241 tttctaatga tggggccaca attctgaaac ttcttgatgt tgtccatcct gcagcaaaga 301 ctttggtaga cattgccaaa tcccaagatg ctgaggtggg tgatggcacc acctcagtga 361 ccttgctggc tgcagagttt ctgaagcagg tgaaacccta tgtggaggaa ggtttacacc 421 cccagatcat cattcgagct ttccgcacag ccacccagct ggcagttaac aagatcaaag 481 agattgctgt gaccgtgaag aaggcagata aagtggagca gaggaagctg ctggaaaagt 541 gtgccatgac cgctctgagc tccaagctga tctcccagca gaaagctttc tttgctaaga 601 tggtggtgga tgcagtgatg atgctcgatg atttgctgca gcttaaaatg attggaatca 661 agaaggtaca gggtggagcc ctcgaggatt ctcagctggt agctggtgtt gcattcaaga 721 agactttctc ttacgctggg tttgaaatgc aacccaaaaa gtaccacaat cccaagattg 781 cccttttgaa tgtcgagctc gagttgaaag ctgagaaaga caatgctgag ataagagtcc 841 acacagttga ggattatcag gcaattgttg atgctgagtg gaacattctc tatgacaagt 901 tagagaagat ccatcattct ggagccaaag ttgtcttgtc caaactcccc attggggatg 961 tggccaccca gtactttgct gacagggaca tgttctgtgc tggccgagta cctgaggagg 1021 atctgaagag gacaatgatg gcctgtggag gctcaatcca gaccagtgtg aatgctctgt 1081 cagcagatgt gctgggtcga tgccaggtgt ttgaagagac ccagattgga ggcgagaggt 1141 acaatttttt tactggctgc cccaaggcca agacatgcac cttcattctc cgtggcggcg 1201 ccgagcagtt tatggaggag acagagcggt ccctgcatga tgccatcatg atcgtcagga 1261 gggccatcaa gaatgattca gtggtggctg gtggcggggc cattgagatg gaactctcca 1321 agtacctgcg ggattactca aggactattc caggaaaaca gcagctgttg attggggctt 1381 atgccaaggc cttggagatt atcccacgcc agctgtgtga caatgctggc tttgatgcca 1441 caaacattct caacaagctg cgggctcggc atgcccaggg gggtacatgg tatggagtag 1501 acatcaacaa cgaggacatt gctgacaact ttgaagcttt cgtgtgggag ccagctatgg 1561 tgcggatcaa tgcgctgaca gcagcctctg aggctgcgtg cctgatcgtg tctgtagatg 1621 aaaccatcaa gaacccccgc tcgactgtgg atgctcccac agcagcaggc cggggccgtg 1681 gtcgtggccg cccccactga gaggcacccc acccatcaca tggctggctg gctgctgggt 1741 gcacttaccc tccttggctt ggttacttca ttttacaagg aaggggtagt aattggccca 1801 ctctcttctt actggaggct atttaaataa aatgtaagac ttcaaaaaaa aaaaaaaaaa 1861 aaa // LOCUS AF026402 3237 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens U5 snRNP 100 kD protein mRNA, complete cds. ACCESSION AF026402 NID g2655201 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3237) AUTHORS Teigelkamp,S., Mundt,C., Achsel,T., Will,C.L. and Luehrmann,R. TITLE The human U5 snRNP-specific 100kD protein is an RS domain containing, putative RNA helicase with significant homology to the yeast splicing factor Prp28p JOURNAL RNA 3 (1997) In press REFERENCE 2 (bases 1 to 3237) AUTHORS Teigelkamp,S., Mundt,C., Achsel,T., Will,C.L. and Luehrmann,R. TITLE Direct Submission JOURNAL Submitted (23-SEP-1997) AG Luehrmann, IMT, Emil Mannkopff Str. 2, Marburg 35037, Germany FEATURES Location/Qualifiers source 1..3237 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 40..2502 /note="DEAD-box, RS domain; PRP28p homolog; putative RNA helicase" /codon_start=1 /product="U5 snRNP 100 kD protein" /db_xref="PID:g2655202" /translation="MAGELADKKDRDASPSKEERKRSRTPDRERDRDRDRKSSPSKDR KRHRSRDRRRGGSRSRSRSRSKSAERERRHKERERDKERDRNKKDRDRDKDGHRRDKD RKRSSLSPGRGKDFKSRKDRDSKKDEEDEHGDKKLKAQPLSLEELLAKKKAEEEAEAK PKFLSKAEREAEALKRRQQEVEERQRMLEEERKKRKQFQDLGRKMLEDPQERERRERR ERMERETNGNEDEEGRQKIREEKDKSKELHAIKERYLGGIKKRRRTRHLNDRKFVFEW DASEETSIDYNPLYKERHQVQLLGRGFIAGIDFKQQKREQSRFYGDLMEKRRTLEEKE QEEARLRKLRKKEAKQRWDDRHWSQKKLDEMTDRDWRIFREDYSITTKGGKIPNPIRS WKDSSLPPHILEVIDKCGYKEPTPIQRQAIPIGLQNRDIIGVAETGSGKTAAFLIPLL VWITTLPKIDRIEESDQGPYAIILAPTRELAQQIEEETIKFGKPLGIRTVAVIGGISR EDQGFRLRMGCEIVIATPGRLIDVLENRYLVLSRCTYVVLDEADRMIDMGFEPDVQKI LEHMPVSNQKPDTDEAEDPEKMLANFESGKHKYRQTVMFTATMPPAVERLARSYLRRP AVVYIGSAGKPHERVEQKVFLMSESEKRKKLLAILEQGFDPPIIIFVNQKKGCDVLAK SLEKMGYNACTLHGGKGQEQREFALSNLKAGAKDILVATDVAGRGIDIQDVSMVVNYD MAKNIEDYIHRIGRTGRAGKSGVAITFLTKEDSAVFYELKQAILESPVSSCPPELANH PDAQHKPGTILTKKRREETIFA" BASE COUNT 895 a 760 c 934 g 648 t ORIGIN 1 cgacgttgag gccgcgttgg gcggttcaga ctcagggtga tggcaggaga gctggctgac 61 aaaaaggacc gtgatgcatc accttccaag gaggaaagga agcgatcacg gactcctgac 121 agagagcggg atagagaccg ggaccggaag tcttccccat ctaaagatag aaagcggcat 181 cgttcaaggg atagacgtcg aggaggcagc cgttctcgct ctcgttcccg ttccaaatct 241 gcagaaagag aacgacggca caaagaacga gaacgagata aggagcggga tcggaataag 301 aaggaccgag atcgagacaa ggatgggcac agacgggaca aggaccgtaa acgatccagc 361 ttatctcctg gtcgaggaaa agactttaaa tctcggaagg acagagactc taagaaggat 421 gaagaggatg aacatggtga taagaagctt aaggcccagc cattatccct ggaggagctt 481 ctggccaaga aaaaggctga ggaagaagct gaggctaagc ccaagttcct ctctaaagca 541 gaacgagagg ctgaagctct aaagcgacgg cagcaggagg tggaagagcg gcagaggatg 601 cttgaagaag agaggaagaa aaggaaacag ttccaagact tgggcaggaa gatgttggaa 661 gatcctcagg aacgggaacg tcgggaacgc agggagagga tggaacggga gaccaatgga 721 aatgaggatg aggaagggcg gcagaagatc cgggaagaga aggataagag caaggaactg 781 catgccatta aggagcgtta cctgggtggc atcaaaaagc ggcgccgaac gagacatctc 841 aatgaccgga aatttgtttt tgagtgggat gcatctgagg agacatccat tgactacaac 901 cccctgtaca aagaacggca ccaggtgcag ttgttagggc gaggcttcat tgcaggcatt 961 gacttcaagc agcagaagcg agagcagtca cgtttctatg gagacctaat ggagaagagg 1021 cgaaccctgg aagaaaagga gcaggaggag gcaagactcc gcaaacttcg taagaaggaa 1081 gccaagcagc gctgggatga tcgtcattgg tctcagaaaa agttagatga gatgacggac 1141 agggactggc ggatcttccg tgaggactac agcatcacca ccaaaggtgg caagatcccc 1201 aatcccatcc gatcctggaa agactcttct ctgcccccac acatcttgga ggtcattgat 1261 aagtgtggct acaaggaacc aacacctatc cagcgtcagg caattcccat tgggctacag 1321 aatcgtgaca tcattggtgt ggctgagact ggcagtggca agacagcagc cttcctcatc 1381 cctctgctgg tctggatcac cacacttccc aaaattgaca ggatcgaaga gtcagaccaa 1441 ggcccttatg ccatcatcct ggctcccacc cgtgagttgg ctcaacagat tgaggaagag 1501 accatcaagt ttgggaaacc gctaggtatc cgcactgtgg ctgtcattgg tggcatctcc 1561 agagaagacc agggcttcag gctgcgcatg ggttgtgaga ttgtgattgc tacccctggg 1621 cgtttgattg atgtgctgga gaaccgctac ctggtgctga gccgctgtac ctatgtggtt 1681 ctggatgagg cagataggat gattgacatg ggctttgagc cagatgtcca gaagatcctg 1741 gagcacatgc ctgtcagcaa ccagaagcca gacacggatg aggctgagga ccctgagaag 1801 atgctggcca actttgagtc gggaaaacat aagtaccgcc aaacagtcat gttcacggcc 1861 accatgcccc cagcggtgga gcgtctggcc aggagctatc ttcggcgacc tgctgtggtg 1921 tacattggct ccgcaggcaa gccccatgag cgtgtggaac agaaggtctt cctcatgtca 1981 gagtcagaaa agaggaaaaa gctgctggca atcttggagc aaggctttga cccacccatc 2041 attatttttg tcaaccagaa gaagggctgc gacgtgttgg ccaaatccct ggagaagatg 2101 gggtacaatg cttgcacact gcacggtgga aaaggccagg agcagcgaga gtttgcgttg 2161 tccaacctca aggctggggc caaggatatt ttggtggcta cagatgtggc tggtcgtggt 2221 attgacatcc aagatgtgtc tatggttgtc aactatgata tggccaaaaa tattgaagat 2281 tacatccacc gcattggccg cacgggacga gcaggcaaga gtggggtggc catcaccttc 2341 ctcacaaaag aggactctgc tgtgttctac gagctgaagc aagctatcct ggaaagccca 2401 gtgtcttcct gtccccccga actagccaac cacccagatg cccagcataa gccaggcacc 2461 atcctcacca agaagcgccg ggaagagacc atctttgcct gacacagcac tcttcctgtg 2521 ggctgagggc atctccaaag ctggcctgat gcctgttttt cagaaccctc acatccctct 2581 ttccaggtcc tcactcttgg gatatggggg cttaggaaaa caatccaact ccctagccca 2641 gaccctcagg tcaggaggcc tgcgtgtggg gctgcaaaag gagaggacga cgctgtcgga 2701 ggcagggaga gcaaattacc acagcttctt ggcccagttc tgcccttctt tgctttggga 2761 ttgcactggg ccatcagctc atgccaggct atgggggcag ccagttggca ttgctcccca 2821 gactgaacag aaacctggcc gccggatggg acctcctttg gcacagactt gactgtgtaa 2881 ctgcataaac tgcagtagca tcattgccct agatgcccca ggagacctgg caccatgagg 2941 attacagaca gtggaatctt actgtcatct ggacagctgt tttcctgttt ggatggtaaa 3001 ggaagttgag agtctttaga cctgtgcaca gccccgcacc aaggggtgct gtatgctcta 3061 ggcatcccct cccccagggg attttttaag tagatggggg gacacggtga actggctgtg 3121 tccatctttg tcactgagtg aaatctctgt tttctatttt ctgagaagat aagtttgtat 3181 gttctgagaa taaatacatg aatattaaga ctgttaaaaa aaaaaaaaaa aaaaaaa // LOCUS AF026445 3997 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens cofactor of initiator function (CIF150) mRNA, complete cds. ACCESSION AF026445 NID g2739086 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3997) AUTHORS Kaufmann,J., Ahrens,K., Koop,R., Smale,S.T. and Mueller R. TITLE CIF150, a human cofactor for TFIID-dependent Initiator function JOURNAL Mol. Cell. Biol. (1997) In press REFERENCE 2 (bases 1 to 3997) AUTHORS Kaufmann,J., Ahrens,K., Koop,R., Smale,S.T. and Mueller R. TITLE Direct Submission JOURNAL Submitted (23-SEP-1997) Chiron Technologies, 4560 Horton Street, Emeryville, CA 94608, USA FEATURES Location/Qualifiers source 1..3997 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..3997 /note="Cofactor of Initiator function" /gene="CIF150" CDS 266..3865 /gene="CIF150" /note="similar to Drosophila dTAF150" /codon_start=1 /product="cofactor of initiator function" /db_xref="PID:g2739087" /translation="MPLTGVEPARMNRKKGDKGFESPRPYKLTHQVVCINNINFHRKS VVGFVELTIFPTVANLNRIKLNSKQCRIYRVRINDLEAAFIYNDPTLEVCHSESKQRN LNYFSNAYAAAVSAVDPDAGNGELCIKVPSELWKHVDELKVLKIHINFSLDQPKGGLH FVVPSVEGSMAERGAHVFSCGYQNSTRFWFPCVDSYSELCTWKLEFTVDAAMVAVSNG DLVETVYTHDMRKKTFHYMLTIPTAASNISLAIGPFEILVDPYMHEVTHFCLPQLLPL LKHTTSYLHEVFEFYEEILTCRYPYSCFKTVFIDEAYVEVAAYASMSIFSTNLLHSAM IIDETPLTRRCLAQSLAQQFFGCFISRMSWSDEWVLKGISGYIYGLWMKKTFGVNEYR HWIKEELDKIVAYELKTGGVLLHPIFGGGKEKDNPASHLHFSIKHPHTLSWEYYTMFQ CKAHLVMRLIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSIS NVSGKDIQPLIKQWVDQSGVVKFYGSFAFNRKRNVLELEIKQDYTSPGTQKYVGPLKV TVQELDGSFNHTLQIEENSLKHDIPCHSKSRRNKKKKIPLMNGEEVDMDLSAMDADSP LLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQESILALEKFPTPASRLALT DILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLFTRMFCCKSCPNIVKTNN FMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEM IDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLR AIRVLQKNGHVPSDPALFKSYAEYGHFVDIRIAALEAVVDYTKVDRSYEELQWLLNMI QNDPVPYVRHKILNMLTKNPPFTKNMESPLCNEALVDQLWKLMNSGTSHDWRLRCGAV DLYFTLFGLSRPSCLPLPELGLVLNLKEKKAVLNPTIIPESVAGNQEAANNPSSHPQL VGFQNPFSSSQDEEEIDMDTVHDSQAFISHHLNMLERPSTPGLSKYRPASSRSALIPQ HSAGCDSTPTTKPQWSLELARKGTGKEQAPLEMSMHPAASAPLSVFTKESTASKHSDH HHHHHHEHKKKKKKHKHKHKHKHKHDSKEKDKEPFTFSSPASGRSIRSPSLSD" BASE COUNT 1206 a 798 c 883 g 1110 t ORIGIN 1 caagatgtcg gcggatggta gcttcgagcc cttgcggaga ggagcatctc tgtgacagaa 61 gcttgtcgac ggcggcttct aggagctagt cgaaggagcg aggttgaggc gggcagcgac 121 ccgtcaggtc gctcacctgg gcaccggcca gctgcgagac gtgacttggg gaccgcaggg 181 gagtggagag tgtgaggtgc caaagactag taatgccccg tatcccccta ggaagccggg 241 aagccaagct ccgcgggacc gcttcatgcc gctgactggt gtagagcccg ccagaatgaa 301 caggaagaaa ggagacaagg gctttgaaag cccaaggcca tataaattaa cccatcaggt 361 cgtctgcatc aacaacataa atttccacag aaaatctgtt gtgggatttg tggaactgac 421 tatattcccc acagttgcaa acttgaatag aatcaagttg aacagcaaac agtgtagaat 481 ataccgagta aggatcaatg atttagaggc tgcttttatt tataatgacc caaccttgga 541 agtttgtcac agtgaatcaa aacagagaaa cctcaattat ttttccaatg cttatgcagc 601 tgcagttagt gctgtggacc ctgatgcagg aaatggagaa ctttgcatta aggttccatc 661 agagctatgg aaacacgttg atgagttaaa ggtcctgaag atacacatca atttttcttt 721 ggatcagccc aaaggaggtc ttcattttgt ggtacccagt gtagagggaa gtatggcaga 781 gagaggtgct catgttttct cttgtgggta tcaaaattct acaagatttt ggttcccttg 841 tgttgattca tactctgaat tgtgtacatg gaaattagaa tttacagtag atgctgcaat 901 ggttgctgtt tctaatggcg atttggtgga gacagtgtat actcatgata tgaggaagaa 961 aactttccat tatatgctta ccattcctac agcagcgtca aatatctcct tggccattgg 1021 accatttgaa atactggtag atccatacat gcatgaggtt actcattttt gtttgcccca 1081 acttcttcca ttgctgaaac ataccacatc ataccttcat gaagtctttg aattttatga 1141 agaaattctt acatgtcgtt acccatactc ctgttttaag actgtcttca ttgatgaggc 1201 ttatgttgaa gtggctgctt atgcttccat gagcattttt agcacaaatc ttttacacag 1261 tgccatgatt atagatgaga cacctttgac tagaaggtgt ttagcccaat ccttggccca 1321 gcagtttttt ggttgtttca tatctagaat gtcttggtct gatgaatggg tgctgaaggg 1381 aatttcaggc tatatctatg gactttggat gaaaaaaact tttggtgtta atgagtaccg 1441 ccattggatt aaagaggagc tagacaaaat agtggcatat gaactaaaaa ctggtggggt 1501 tttactacat cccatatttg gtggaggaaa agagaaggat aatccggctt cccatctaca 1561 cttttcaata aagcatccac atacactgtc ctgggaatac tacactatgt ttcagtgtaa 1621 agcccacctt gtgatgagat tgattgaaaa taggatcagt atggaattta tgctacaagt 1681 tttcaataaa ctgctaagtc tggctagtac tgcttcatct cagaagttcc agtcacatat 1741 gtggagtcag atgttggttt ccacatctgg gtttttgaaa tccatttcaa atgtctctgg 1801 caaagatatt cagccgttaa taaagcagtg ggtagatcag agtggagtgg taaaatttta 1861 tggaagtttt gcatttaata gaaaacgaaa tgtcttggaa ctggaaataa aacaggacta 1921 tacatctcct ggaactcaga aatacgtggg accacttaaa gtgacagtgc aggagttaga 1981 tggatccttc aatcatacac tgcaaattga agaaaacagc cttaaacatg atataccctg 2041 ccattccaaa agtagaagga ataaaaagaa aaaaatccca ctgatgaatg gagaagaagt 2101 tgatatggat ctttctgcaa tggatgctga ttcccctttg ctgtggataa ggatagaccc 2161 agatatgtca gtattgagga aggtagaatt tgagcaagct gattttatgt ggcagtatca 2221 gctccgctat gagagagatg ttgttgcaca gcaggaatcc attttggctt tggaaaaatt 2281 ccctactcca gcatctcggc ttgcactcac tgatatatta gaacaagagc agtgtttcta 2341 cagagtaaga atgtcagctt gtttctgtct tgcaaagatt gcaaattcaa tggtgagcac 2401 atggacagga ccaccagcca tgaagtcact cttcactagg atgttttgtt gtaaaagttg 2461 tccaaacatt gtgaaaacaa acaactttat gagctttcaa agctattttc tacagaagac 2521 tatgccagtt gcaatggctt tattaagaga tgttcataat ctttgtccta aagaagtctt 2581 aacatttatt ttagacttaa tcaagtacaa tgacaacagg aaaaataagt tttcagataa 2641 ctattatcgt gcagaaatga ttgatgccct ggccaactct gttacacctg cagtcagtgt 2701 gaataatgaa gttagaactt tggataactt aaatcctgat gtgcgactca ttcttgaaga 2761 aatcaccaga tttttgaata tggaaaaact tcttccgagt tacaggcata ccatcactgt 2821 cagttgtttg agagccatac gggtacttca gaagaacgga catgtgccaa gtgatccagc 2881 tctttttaaa tcttatgctg aatatggcca ctttgtggac attaggatag cagctttgga 2941 agcagttgtt gattatacta aagtggacag aagttatgaa gaactgcaat ggctacttaa 3001 tatgattcag aatgaccctg taccctatgt aaggcataag attctcaaca tgttgactaa 3061 gaacccccca tttactaaga acatggagtc tcccttatgc aatgaagccc tggtagatca 3121 actttggaaa cttatgaatt ctggtacttc acatgactgg aggttacggt gtggtgctgt 3181 ggacttgtac ttcacacttt ttggcctcag tagaccttcc tgtttaccct tgccagagct 3241 tgggttggtt cttaatctaa aggagaaaaa agctgtcttg aatcctacca taattccaga 3301 gtcagtagca ggcaaccaag aagctgcaaa taatccaagc agtcacccac agctagttgg 3361 atttcagaac cctttttcca gttctcaaga tgaggaggag attgatatgg atactgttca 3421 tgatagccag gccttcattt cccatcattt aaacatgctt gaaaggccgt caactccagg 3481 gctctcgaaa tatcggccag ctagctcccg atctgcttta ataccccagc actcagcagg 3541 ctgtgacagc acacccacca caaaacccca gtggagtttg gaacttgcac ggaagggaac 3601 aggtaaagaa caagcacctt tggagatgag tatgcatcca gcggcaagcg ctccactctc 3661 agtctttact aaggaatcta cagcctccaa acacagtgac caccatcacc accatcacca 3721 tgagcacaag aaaaagaaga agaagcataa acataagcac aaacacaagc ataagcatga 3781 cagtaaagaa aaggacaagg agcctttcac tttctccagc cctgccagtg gcaggtctat 3841 tcgttctcct tccctttcag actgagaagg ggacaaaaag acctttcctt tcatgtccag 3901 aagaatgtat gtaactaaag ctttgtcctc tgtgaagaat tataaatgga ggggggaaag 3961 gattcgcctc tcctacagaa attctgaatt catttaa // LOCUS AF026547 6310 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens neurocan (CSPG3) mRNA, complete cds. ACCESSION AF026547 NID g2739088 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6310) AUTHORS Prange,C.K., Pennacchio,L.A., Lieuallen,K., Fan,W. and Lennon,G.G. TITLE Characterization of the human neurocan gene, CSPG3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 6310) AUTHORS Lennon,G.G., Prange,C.K., Pennacchio,L.A., Lieuallen,K. and Fan,W. TITLE Direct Submission JOURNAL Submitted (24-SEP-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, CA 94551, USA FEATURES Location/Qualifiers source 1..6310 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19p12-13.1" /chromosome="19" gene 1..6310 /gene="CSPG3" CDS 2..3967 /gene="CSPG3" /note="aggrecan proteoglycan family" /codon_start=1 /product="neurocan" /db_xref="PID:g2739089" /translation="MGAPFVWALGLLMLQMLLFVAGEQGTQDITDASERGLHMQKLGS GSVQAALAELVALPCLFTLQPRPSAARDAPRIKWTKVRTASGQRQDLPILVAKDNVVR VAKSWQGRVSLPSYPRRRANATLLLGPLRASDSGLYRCQVVRGIEDEQDLVPLEVTGV VFHYRSARDRYALTFAEAQEACRLSSAIIAAPRHLQAAFEDGFDNCDAGWLSDRTVRY PITQSRPGCYGDRSSLPGVRSYGRRNPQELYDVYCFARELGGEVFYVGPARRLTLAGA RAQCRRQGAALASVGQLHLAWHEGLDQCDPGWLADGSVRYPIQTPRRRCGGPAPGVRT VYRFANRTGFPSPAERFDAYCFRAHHPTSQHGDLETPSSGDEGEILSAEGPPVRELEP TLEEEEVVTPDFQEPLVSSGEEETLILEEKQESQQTLSPTPGDPMLASWPTGEVWLST VAPSPSDMGAGTAASSHTEVAPTDPMPRRRGRFKGLNGRYFQQQEPEPGLQGGMEASA QPPTSEAAVNQMEPPLAMAVTEMLGSGQSRSPWADLTNEVDMPGAGSAGGKSSPEPWL WPPTMVPPSISGHSRAPVLELEKAEGPSARPATPDLFWSPLEATVSAPSPAPWEAFPV ATSPDLPMMAMLRGPKEWMLPHPTPISTEANRVEAHGEATATAPPSPAAETKVYSLPL SLTPTGQGGEAMPTTPESPRADFRETGETSPAQVNKAEHSSSSPWPSVNRNVAVGFVP TETATEPTGLRGIPGSESGVFDTAESPTSGLQATVDEVQDPWPSVYSKGLDASSPSAP LGSPGVFLVPKVTPNLEPWVATDEGPTVNPMDSTVTPAPSDASGIWEPGSQVFEEAES TTLSPQVALDTSIVTPLTTLEQGDKVGVPAMSTLGSSSSQPHPEPEDQVETQGTSGAS VPPHQSSPLGKPAVPPGTPTAASVGESASVSSGEPTVPWDPSSTLLPVTLGIEDFELE VLAGSPGVESFWEEVASGEEPALPGTPMNAGAEEVHSDPCENNPCLHGGTCNANGTMY GCSCDQGFAGENCEIDIDDCLCSPCENGGTCIDEVNGFVCLCLPSYGGSFCEKDTEGC DRGWHKFQGHCYRYFAHRRAWEDAEKDCRRRSGHLTSVHSPEEHSFINSFGHENTWIG LNDRIVERDFQWTDNTGLQFENWRENQPDNFFAGGEDCVVMVAHESGRWNDVPCNYNL PYVCKKGTVLCGPPPAVENASLIGARKAKNNVHATVRYQCNEGFAQHHVVTIRCRSNG KWDRPQIVCTKPRRSHRMRGHHHHHQHHHQHHHHKSRKERRKHKKHPTEDWEKDEGNF C" 3'UTR 3968..6286 /gene="CSPG3" polyA_signal 6267..6272 /gene="CSPG3" polyA_site 6286 /gene="CSPG3" BASE COUNT 1441 a 1764 c 1734 g 1371 t ORIGIN 1 gatgggggcc ccgtttgtct gggccttggg ccttttgatg ctgcagatgc tgctctttgt 61 ggctggggaa cagggcacac aggatatcac cgatgccagc gaaagggggc tccacatgca 121 gaagctgggg tctgggtcag tgcaggctgc gctggcggag ctggtggccc tgccctgtct 181 ctttaccctg cagccacggc caagcgcagc ccgagatgcc cctcggataa agtggaccaa 241 ggtgcggact gcgtcgggcc agcgacagga cttgcccatc ctggtggcca aggacaatgt 301 cgtgagggtg gccaaaagct ggcagggacg agtgtcactg ccttcctacc cccggcgccg 361 agccaacgcc acgctacttc tggggccact gagggccagt gactctgggc tgtaccgctg 421 ccaggtggtg aggggcatcg aggatgagca ggacctggtg cccttggagg tgacaggtgt 481 tgtgttccac taccgatcag cccgggaccg ctatgcactg accttcgctg aggcccagga 541 ggcctgccgt ctcagctcag ccatcattgc agcccctcgg catctacagg ctgcctttga 601 ggatggcttt gacaactgtg atgctggctg gctctctgac cgcactgttc ggtatcctat 661 cacccagtcc cgtcctggtt gctatggcga ccgtagcagc cttccagggg ttcggagcta 721 tgggaggcgc aacccacagg aactctacga tgtgtattgc tttgcccggg agctgggggg 781 cgaggtcttc tacgtgggcc cggcccgccg cctgacactg gccggcgcgc gtgcacagtg 841 ccgccgccag ggtgccgcgc tggcctcggt gggacagctg cacctggcct ggcatgaggg 901 cctggaccag tgcgacccgg gctggctggc cgacggcagc gtgcgctacc cgatccagac 961 gccgcgccgg cgctgcgggg gcccagcccc gggcgtgcgc accgtctacc gcttcgctaa 1021 ccggaccggc ttcccctcac ccgccgagcg cttcgacgcc tactgcttcc gagctcatca 1081 ccccacgtca caacatggag acctagagac cccatcctct ggggatgagg gggagattct 1141 gtcagcagag gggcccccag ttagagaact ggagcccacc ctggaggagg aagaggtggt 1201 cacccctgac ttccaggagc ctctggtgtc cagtggggaa gaagaaaccc tgattttgga 1261 ggagaagcag gagtctcaac agaccctcag ccctacccct ggggacccca tgctggcctc 1321 atggcccact ggggaagtgt ggctaagcac ggtggccccc agccctagcg acatgggggc 1381 aggcactgca gcaagttcac acacggaggt ggccccaact gaccctatgc ctaggagaag 1441 ggggcgcttc aaagggttga atgggcgcta cttccagcag caggaaccgg agccggggct 1501 gcaagggggg atggaggcca gcgcccagcc ccccacctca gaggctgcag tgaaccaaat 1561 ggagcctccg ttggccatgg cagtcacaga gatgttgggc agtggccaga gccggagccc 1621 ctgggctgat ctgaccaatg aggtggatat gcctggagct ggttctgctg gtggcaagag 1681 ctccccagag ccctggctgt ggccccctac catggtccca cccagcatct caggccacag 1741 cagggcccct gtcctggagc tagagaaagc cgagggcccc agtgccaggc cagccacccc 1801 agacctgttt tggtccccct tggaggccac tgtctcagct cccagccctg ccccctggga 1861 ggcattccct gtggccacct ccccagatct ccctatgatg gccatgctgc gtggtcccaa 1921 agagtggatg ctaccacacc ccacccccat ctccaccgag gccaatagag ttgaggcaca 1981 tggtgaggcc accgccacgg ctccaccctc ccctgctgca gagaccaagg tgtattccct 2041 gcctctctct ttgaccccaa caggacaggg tggagaggcc atgcccacaa cacctgagtc 2101 ccccagggca gacttcagag aaactgggga gaccagccct gctcaggtca acaaagctga 2161 gcactccagc tccagcccat ggccttctgt aaacaggaat gtggctgtag gttttgtccc 2221 cactgagact gccactgagc caacgggcct caggggtatc ccggggtctg agtctggggt 2281 cttcgacaca gcagaaagcc ccacttctgg cttgcaggcc actgtagatg aggtgcagga 2341 cccctggccc tcagtgtaca gcaaagggct ggatgcaagt tccccatctg cccccctggg 2401 gagccctgga gtcttcttgg tacccaaagt caccccaaat ttggagcctt gggttgctac 2461 agatgaagga cccactgtga atcccatgga ttccacagtc acgccggccc ccagtgatgc 2521 tagtggaatt tgggaacctg gatcccaggt gtttgaagaa gccgaaagca ccaccttgag 2581 ccctcaggtg gccctggata caagcattgt gacgcccctc acgaccctgg agcaggggga 2641 caaggttgga gttccagcca tgtctacact gggctcctca agctcccaac cccacccaga 2701 gccagaggat caggtggaga cccagggaac atcaggagct tcagtgcctc cgcatcagag 2761 cagtccccta gggaaaccgg ctgttcctcc tgggacaccg actgcagcca gtgtgggcga 2821 gtctgcctca gtttcctcag gggagcctac ggtaccgtgg gacccctcca gcaccctgct 2881 gcctgtcacc ctgggcatag aggacttcga actggaggtc ctggcaggga gcccgggtgt 2941 agagagcttc tgggaggagg tggcaagtgg agaggagcca gccctgccag ggacccctat 3001 gaatgcaggt gcggaggagg tgcactcaga tccctgtgag aacaaccctt gtcttcatgg 3061 agggacatgt aatgccaatg gcaccatgta tggctgtagc tgtgatcagg gcttcgccgg 3121 ggagaactgt gagattgaca ttgatgactg cctctgcagc ccctgtgaga atggaggcac 3181 ctgtattgat gaggtcaatg gctttgtctg cctttgcctc cccagctatg ggggcagctt 3241 ttgtgagaaa gacaccgagg gctgtgaccg cggctggcat aagttccagg gccactgtta 3301 ccgctatttt gcccaccgga gggcatggga agatgccgag aaggactgcc gccgccgctc 3361 cggccacctg accagcgtcc actcaccgga ggaacacagc ttcattaata gctttgggca 3421 tgaaaacacg tggatcggcc tgaacgacag gatcgtggag agagatttcc agtggacgga 3481 caacaccggg ctgcaatttg agaactggcg agagaaccag ccggacaatt tcttcgcggg 3541 tggcgaggac tgtgtggtga tggtggcgca tgaaagcggg cgctggaacg atgtcccctg 3601 caactacaac ctaccctatg tctgcaagaa gggcacagtg ctctgtggtc cccctccggc 3661 agtggagaat gcctcactca tcggtgcccg caaggccaag aacaatgtcc atgccactgt 3721 aaggtaccag tgcaatgaag gatttgccca gcaccatgtg gtcaccattc gatgccggag 3781 caatggcaag tgggacaggc cccaaattgt ctgcaccaaa cccagacgtt cacatcggat 3841 gcggggacac caccaccacc accaacacca ccaccagcat caccaccaca aatcccgcaa 3901 ggagcgcaga aaacacaaga aacacccaac ggaggactgg gagaaggacg aagggaattt 3961 ttgctgaaga accagaaaaa agaaagcaca acacctttcc catgcctcct ctggagcctt 4021 cgcctgggga gacagaaccc agagagaaac aagagagtcc agaagtccct gaaccccaaa 4081 ctgttctcgc aaaaaaaata ttcctttgaa caaaggtctt cttttccttt ttttacatac 4141 acaagatctt cttggcaggt ggagccaggt gtctgaaaag ttcattctcg tctggctgaa 4201 ctctgggagt gtgtcccagc tgagggaagc acaagtagca aagctcattg gtctggtctc 4261 ttgtttgcca ggctgattga agcaggcctt gatgagggtg catgagtgta tgtttgcatt 4321 cacatgaagg aattgctttt cacaccagaa attcagactt agtcaatgtt ggctgaattc 4381 ctaaatccag gaagaagcct ggacgtaggg tcattagctt tgggaataga aggctacaca 4441 gaagcacact gtttttgaac ttgacaacag ctctcccttt accctggact tcagcccaag 4501 ttccgtcttt ggtcttggtg gataaacaca cagtgtggag atcccacgta ctgcatttta 4561 gggatgtttt taggacaacc tccctccatg ccttcagagt taggagtgag aatgatcaaa 4621 gcaatatgta ggtgatggag ggagagtgta ttgctaaccc ttccaggtct agtccagcgc 4681 tgagatttgg tggttctgca tgtgtgatga atctctttca cacaaataga cgagaggata 4741 tttagggcta gatgagccca gatttcttcc ccctccatct ctcagggaga caaagaacct 4801 ccttcctgga ccaaggaggt gctgccaagt tttctagccc agtgcacata cccagtcctt 4861 aagcagacat tggtagtgcc cctgccctgg gtcccactcc tgccccaccc cacccttgtc 4921 cctggccatt gcctggtggt ctagaaacac ttaaaacttg aagtagtgac acctacctgc 4981 ggtcatattg tagagagatg ctcagtgtta aaactgaaac acacaaacac acacacacac 5041 acatttttct cttgtagatt ttaatttttt aagtgggaaa gaactcacct tgccttcctc 5101 ccccaaatgt gcaacctgta aaaggtctct ccacaccagg ggccaggatc cagttccctc 5161 atctctggca ggaaagatcc acagcttttc ctccatgtct gttactcact ttcagcagtc 5221 cgggtaaaat ctgtggatca gggttaaaaa agcaccgtgg agaatggccc tcttcaggaa 5281 agaaaaataa gcaaatgaat ggtccaccta ggggttcagt aaagaaagaa atgtgttaac 5341 tgagcctgaa tcccttctgg gaagtaataa tgaccattga caactaagaa gtagacacca 5401 tgctaaagac ttacatacaa tctccttgaa tcttctcaat agcccattga cttagaaact 5461 gttactttcc cattttacac acagtgaaac tgaggctcag atataaagga aaggtactgg 5521 cttgaagtca caaccacgac aggagtaagg atttggaata aggatttggt cctgttttct 5581 ggaccaaatc cttactctgg ctctgcttac actttctctc catcaccaaa tccttactcc 5641 aaatccagaa gtcagagcca actcccatct tggttctgac ccaaatcctg ctctggactc 5701 tggagaggag attgaaatat aattgcaccc tcatacacat ttaggaaatg gttaagaagt 5761 gtaaactgaa cccttatcct tgtcttcaat cttcctccct gtagacatct atcttattat 5821 ggttattatt cagaaaaccc agggatacag gtttgtcttc ttactttgat aactcttctt 5881 agtttaaaat aataataata acacatcttt ggtcatctat gtcacacaaa aattttcctt 5941 tgtttgcggg gggctgggga tgcagtgttt tttggggggt cttggtttat gctccctgcc 6001 cttgagcccc tcagccgttt gccctgcccc cacctcggct ccatggtggg agggggctct 6061 ggtcttttct aaagtgggcg gtttgtcttt tgatctttcc cttttggatg tgcgtgtgtg 6121 tctgcgtgtg ccatgtgcgt ggcacgcata tgagtgtgtg tgcgtgtgaa cggctttggg 6181 tcctgctggt tttgctgtga gctgcagtgt tctgtgggtc tgtggtatct gacactgtgg 6241 acattaatgt acttcttgga cattttaata aattttttaa cagttcaaaa aaaaaaaaaa 6301 aaaaaaaaaa // LOCUS AF026548 1890 bp mRNA PRI 04-NOV-1997 DEFINITION Homo sapiens branched chain alpha-ketoacid dehydrogenase kinase precursor, mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF026548 NID g2583172 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1890) AUTHORS Chuang,J.C., Cox,R.P. and Chuang,D.T. TITLE Direct Submission JOURNAL Submitted (24-SEP-1997) Biochemistry, UT Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, Tx 75235, USA FEATURES Location/Qualifiers source 1..1890 /organism="Homo sapiens" /db_xref="taxon:9606" transit_peptide 274..363 /note="mitochondrial targeting sequence" CDS 274..1512 /note="BCKD kinase" /codon_start=1 /product="branched chain alpha-ketoacid dehydrogenase kinase precursor" /db_xref="PID:g2583173" /translation="MILASVLRSGPGGGLPLRPLLGPALALRARSTSATDTHHVEMAR ERSKTVTSFYNQSAIDAAAEKPSVRLTPTMMLYAGRSQDGSHLLKSARYLQQELPVRI AHRIKGFRCLPFIIGCNPTILHVHELYIRAFQKLTDFPPIKDQADEAQYCQLVRQLLD DHKDVVTLLAEGLRESRKHIEDEKLVRYFLDKTLTSRLGIRMLATHHLALHEDKPDFF GIICTRLSPKKIIEKWVDFARRLCEHKYGNAPRVRINGHVAARFPFIPMPLDYILPEL LKNAMRATMESHLDTPYNVPDVVITIANNDVDLIIRISDRGGGIAHKDLDRVMDYHFT TAEASTQDPRISPLFGHLDMHSGAQSGPMHGFGFGLPTSRAYAEYLGGSLQLQSLQGI GTDVYLRLRHIDGREESFRI" mat_peptide 364..1509 /product="branched chain alpha-ketoacid dehydrogenase kinase" polyA_site 1848 BASE COUNT 396 a 613 c 518 g 363 t ORIGIN 1 atctgtcgac tgctaccggg cgcaggcggc tgggacaatg gcggtggact gttcgagccc 61 ttccgctggg acccgggccc tggctccggc cccgcgatgg gagctgctct ccgcgggctg 121 agcctgtcag catcctcgac gcaccctggt ccctgaagtc ggagaagcgc ccctacccac 181 ccacaccccc ttgccccatt ttgggtcgcc tgggtcctca gtcctagcgg atcctcagtc 241 ctagcggcca ccgggtctga aaggagcaag acgatgatcc tggcgtcggt gctgaggagc 301 ggtcccgggg gcgggcttcc gctccggccc ctcctgggac ccgcactcgc gctccgggcc 361 cgctcgacgt cggccaccga cacacaccac gtggagatgg ctcgggagcg ctccaagacc 421 gtcacctcct tttacaacca gtcggccatc gacgcggcag cggagaagcc ctcagtccgc 481 ctaacgccca ccatgatgct ctacgctggc cgctctcagg acggcagcca ccttctgaaa 541 agtgctcggt acctgcagca agaacttcca gtgaggattg ctcaccgcat caagggcttc 601 cgctgccttc ctttcatcat tggctgcaac cccaccatac tgcacgtgca tgagctatat 661 atccgtgcct tccagaagct gacagacttc cctccgatca aggaccaggc ggacgaggcc 721 cagtactgcc agctggtgcg acagctgctg gatgaccaca aggatgtggt gaccctcttg 781 gcagagggcc tacgtgagag ccggaagcac atagaggatg aaaagctcgt ccgctacttc 841 ttggacaaga cgctgacttc gaggcttgga atccgcatgt tggccacgca tcacctggcg 901 ctgcatgagg acaagcctga ctttttcggc atcatctgta ctcgtctctc accaaagaag 961 attattgaga agtgggtgga ctttgccaga cgcctgtgtg agcacaagta tggcaatgcg 1021 ccccgtgtcc gcatcaatgg ccatgtggct gcccggttcc ccttcatccc tatgccactg 1081 gactacatcc tgccggagct gctcaagaat gccatgagag ccacaatgga gagtcaccta 1141 gacactccct acaatgtccc agatgtggtc atcaccatcg ccaacaatga tgtcgatctg 1201 atcatcagga tctcagaccg tggtggagga atcgctcaca aagatctgga ccgggtcatg 1261 gactaccact tcactactgc tgaggccagc acacaggacc cccggatcag ccccctcttt 1321 ggccatctgg acatgcatag tggcgcccag tcaggaccca tgcacggctt tggcttcggg 1381 ttgcccacgt cacgggccta cgcggagtac ctcggtgggt ctctgcagct gcagtccctg 1441 cagggcattg gcacggacgt ctacctgcgg ctccgccaca tcgatggccg ggaggaaagc 1501 ttccggatct gaccccacag cctttggcct gctcacccga ccagcctggg ccgcattccc 1561 tgcaggacct cccgggtcag gcagggcggc cccctgctac cacacactgc tgcatcttgg 1621 gtctcaggga cccagacaga tggacttaca tggagctggg cactgccctg cctcaacagg 1681 gtccattgcc tcttgcctcc agaacttgga gagcagggaa gtgggcaccc ttgaggcctc 1741 cagcaccagt tccgtcattc tcgttcctgg ggaaccccca ctctgacctg ttattaaagt 1801 tcacattttg aatgccctct cgggccccgt gtgtggggag ggcaggtgaa aaaaaaaaaa 1861 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF026564 1581 bp DNA PRI 07-OCT-1997 DEFINITION Homo sapiens RNA binding protein II (RBMII) gene, complete cds. ACCESSION AF026564 NID g2465929 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1581) AUTHORS Chai,N.N., Zhou,H., Hernandez,J., Najmabadi,H., Bhasin,S. and Yen,P.H. TITLE Structure and organization of the RBM genes on the human Y chromosome: Transposition and amplification of an ancestral autosomal hnRNPG gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1581) AUTHORS Chai,N.N., Zhou,H., Hernandez,J., Najmabadi,H., Bhasin,S. and Yen,P.H. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) Division of Medical Genetics, E-4, Harbor-UCLA Medical Center, 1124 W. Carson St., Torrance, CA 90502, USA FEATURES Location/Qualifiers source 1..1581 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CEPH YAC library" /clone="786C10" /sub_clone="7S2" /chromosome="Y" /map="Yq11.23" gene 119..1075 /gene="RBMII" CDS 119..1075 /gene="RBMII" /note="candidate for the Azoospermia Factor; germ-cell specific expression" /codon_start=1 /product="RNA binding protein" /db_xref="PID:g2465930" /translation="MVEADCHGKLFIGGLNREANEKVLKEVFAKHGPLLEVLLIKGRT SKSRDFVVIIFENAADAKNAARDMNGKSLDGKEIKVEQAKKPSFPSGGRRRPPPSSRN RSPSGSLRSARGSSGGTRPWLPSHEGHLDDGGYALDLNTSSSRGAIPIKRGPSSRSGG PPPKTSAPSAMARSNSWMGGQGPISRGRENYGGPPCREPISSWRNDRMSPRDDGYAIK ERNHPLSRESRDYAPLSRDYAYHDYGHSSWDEHFSRGYSDCDGCGEVMLEIILNVQVE VLIEMHFRDREPLMVHHLQECLCCLMVEAATMIIAINEIDMA" BASE COUNT 479 a 285 c 380 g 437 t ORIGIN 1 caggccagct gcagcggtct ttcctgcagt tggccctgtg gtgtcccgaa gccggatgca 61 tacgacctga gtgacgggag accctgaggc tgtttgtcct cctgaaaagc acctcacaat 121 ggtagaagca gattgtcatg gcaagctttt cattggtggc ctcaatagag aagccaatga 181 aaaggtgctt aaagaagtat ttgcaaaaca tggtcccctt ttggaagttc ttttgataaa 241 aggtcgaacc agtaagtcca gagattttgt ggtcattatt tttgagaatg ctgcagatgc 301 taagaatgct gccagagata tgaatggaaa gtctttggat ggaaaagaaa taaaagtaga 361 acaagcaaag aaaccatctt ttccaagtgg tggtaggcgg agaccaccac cttcttcaag 421 aaacagaagc ccttcaggaa gtctgagatc tgcaagagga agtagtggag gaacaagacc 481 gtggctgccc tcacatgaag gacacttgga tgatggtgga tacgctcttg atctcaacac 541 gagttcttct aggggagcca ttccaattaa aagaggtcca tcttcacgaa gtggaggtcc 601 tcctcctaaa acatctgctc cttctgctat ggcaagaagc aatagttgga tgggaggcca 661 aggtcccata tcacgtggaa gagagaatta tggaggtcct ccatgcagag agccaatctc 721 ttcctggaga aatgaccgta tgtcaccaag agatgatggt tatgcaatta aggaaagaaa 781 tcatccactt tcccgagaat ctagggatta tgctccactg tctagagact atgcatacca 841 tgattatggt cattctagtt gggatgaaca tttctctaga ggatatagtg attgtgatgg 901 ctgtggtgag gtgatgttag agatcattct gaacgtccaa gtggaagttc ttatagagat 961 gcatttcaga gatagggaac ctctcatggt gcaccatctg caggagtgcc tctgttgtct 1021 tatggtggaa gcagccacca tgattatagc aataaatgag atagatatgg cataagtcgg 1081 gagagttact caaggagctg tggtgatttt tattcccgtg attgtgggca cgttgacaga 1141 aaagaccaaa gcaatctacc ttctctggat agggtacacc ctgctccttg tgaaacatgt 1201 ggtagctcaa gatatttgtc atctacagga gatggtgggg aaggtggatc tgacaaaaga 1261 ggctgaagca gatatgaaag caagtattca aataatagtt attgcatact aaaccttgtt 1321 tgcaaatcga aaattgacct gttatttctg cattgttacc tgcgtcttac taaaagaaac 1381 atgtatgttt tgtggagaga ggtagatact aacttcctcc atgaattttt tgaggtattc 1441 aaaggaattt tatttccaat aaataaaggg aattttattt ccaagtaatt tcatactagc 1501 taatgctatt tgaaaactat ctgtttagat gtaatatcta cattaaaatt ttcagaataa 1561 aattttacat gtaatgcaaa a // LOCUS AF026692 2841 bp mRNA PRI 31-OCT-1997 DEFINITION Homo sapiens frizzled related protein frpHE mRNA, complete cds. ACCESSION AF026692 NID g2576419 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2841) AUTHORS Abu-Jawdeh,G.M., Comella,N., Brown,L.F., Tognazzi,K. and Kocher,O. TITLE frizzled related protein frpHE (Homo Sapiens) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2841) AUTHORS Abu-Jawdeh,G.M., Comella,N., Brown,L.F., Tognazzi,K. and Kocher,O. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) Pathology, BIDMC, East Campus, 330 Brookline Avenue, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..2841 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="endometrium" CDS 193..1299 /note="frizzled related protein" /codon_start=1 /product="frpHE" /db_xref="PID:g2576420" /translation="MRVAGRERRFLSAGVAAREGSAMFLSILVALCLWLHLALGVRGA PCEAVRIPMCRHMPWNITRMPNHLHHSTQENAILAIEQYEELVDVNCSAVLRFFFCAM YAPICTLEFLHDPIKPCKSVCQRARDDCEPLMKMYNHSWPESLACDELPVYDRGVCIS PEAIVTDLPEDVKWIDITPDMMVQERPLDVDCKRLSPDRCKCKKVKPTLATYLSKNYS YVIHAKIKAVQRSGCNEVTTVVDVKEIFKSSSPIPRTQVPLITNSSCQCPHILPHQDV LIMCYEWRSRMMLLENCLVEKWRDQLSKRSIQWEERLQEQRRTVQDKKKTAGRTSRSN PPKPKGKPPAPKPASPKKNIKTRSAQKRTNPKRV" BASE COUNT 808 a 625 c 649 g 759 t ORIGIN 1 cagcggccgc tgaattctag ggcgggttcg cgccccgaag gctgagagct ggcgctgctc 61 gtgccctgtg tgccagacgg cggagctccg cggccggacc ccgcggcccc gctttgctgc 121 cgactggagt ttgggggaag aaactctcct gcgccccaga agatttcttc ctcggcgaag 181 ggacagcgaa agatgagggt ggcaggaaga gaaaggcgct ttctgtctgc cggggtcgca 241 gcgcgagagg gcagtgccat gttcctctcc atcctagtgg cgctgtgcct gtggctgcac 301 ctggcgctgg gcgtgcgcgg cgcgccctgc gaggcggtgc gcatccctat gtgccggcac 361 atgccctgga acatcacgcg gatgcccaac cacctgcacc acagcacgca ggagaacgcc 421 atcctggcca tcgagcagta cgaggagctg gtggacgtga actgcagcgc cgtgctgcgc 481 ttcttcttct gtgccatgta cgcgcccatt tgcaccctgg agttcctgca cgaccctatc 541 aagccgtgca agtcggtgtg ccaacgcgcg cgcgacgact gcgagcccct catgaagatg 601 tacaaccaca gctggcccga aagcctggcc tgcgacgagc tgcctgtcta tgaccgtggc 661 gtgtgcattt cgcctgaagc catcgtcacg gacctcccgg aggatgttaa gtggatagac 721 atcacaccag acatgatggt acaggaaagg cctcttgatg ttgactgtaa acgcctaagc 781 cccgatcggt gcaagtgtaa aaaggtgaag ccaactttgg caacgtatct cagcaaaaac 841 tacagctatg ttattcatgc caaaataaaa gctgtgcaga ggagtggctg caatgaggtc 901 acaacggtgg tggatgtaaa agagatcttc aagtcctcat cacccatccc tcgaactcaa 961 gtcccgctca ttacaaattc ttcttgccag tgtccacaca tcctgcccca tcaagatgtt 1021 ctcatcatgt gttacgagtg gcgttcaagg atgatgcttc ttgaaaattg cttagttgaa 1081 aaatggagag atcagcttag taaaagatcc atacagtggg aagagaggct gcaggaacag 1141 cggagaacag ttcaggacaa gaagaaaaca gccgggcgca ccagtcgtag taatcccccc 1201 aaaccaaagg gaaagcctcc tgctcccaaa ccagccagtc ccaagaagaa cattaaaact 1261 aggagtgccc agaagagaac aaacccgaaa agagtgtgag ctaactagtt tccaaagcgg 1321 agacttccga cttccttaca ggatgaggct gggcattgcc tgggacagcc tatgtaaggc 1381 catgtgcccc ttgccctaac aactcactgc agtgctcttc atagacacat cttgcagcat 1441 ttttcttaag gctatgcttc agtttttctt tgtaagccat cacaagccat agtggtaggt 1501 ttgccctttg gtacagaagg tgagttaaag ctggtggaaa aggcttattg cattgcattc 1561 agagtaacct gtgtgcatac tctagaagag tagggaaaat aatgcttgtt acaattcgac 1621 ctaatatgtg cattgtaaaa taaatgccat atttcaaaca aaacacgtaa tttttttaca 1681 gtatgtttta ttaccttttg atatctgttg ttgcaatgtt agtgatgttt taaaatgtga 1741 tgaaaatata atgtttttaa gaaggaacag tagtggaatg aatgttaaaa gatctttatg 1801 tgtttatggt ctgcagaagg atttttgtga tgaaagggga ttttttgaaa aattagagaa 1861 gtagcatatg gaaaattata atgtgttttt ttaccaatga cttcagtttc tgtttttagc 1921 tagaaactta aaaacaaaaa taataataaa gaaaaataaa taaaaaggag aggcagacaa 1981 tgtctggatt cctgtttttt ggttacctga tttccatgat catgatgctt cttgtcaaca 2041 ccctcttaag cagcaccaga aacagtgagt ttgtctgtac cattaggagt taggtactaa 2101 ttagttggct aatgctcaag tattttatac ccacaagaga ggtatgtcac tcatcttact 2161 tcccaggaca tccaccctga gaataatttg acaagcttaa aaatggcctt catgtgagtg 2221 ccaaattttg tttttcttca tttaaatatt ttctttgcct aaatacatgt gagaggagtt 2281 aaatataaat gtacagagag gaaagttgag ttccacctct gaaatgagaa ttacttgaca 2341 gttgggatac tttaatcaga aaaaaagaac ttatttgcag cattttatca acaaatttca 2401 taattgtgga caattggagg catttatttt aaaaaacaat tttattggcc ttttgctaac 2461 acagtaagca tgtattttat aaggcattca ataaatgcac aacgcccaaa ggaaataaaa 2521 tcctatctaa tcctactctc cactacacag aggtaatcac tattagtatt ttggcatatt 2581 attctccagg tgtttgctta tgcacttata aaatgatttg aacaaataaa actaggaacc 2641 tgtatacatg tgtttcataa cctgcctcct ttgcttggcc ctttattgag ataagttttc 2701 ctgtcaagaa agcagaaacc atctcatttc taacagctgt gttatattcc atagtatgca 2761 ttactcaaca aactgttgtg ctattggata cttaggtggt ttcttcactg acaatactga 2821 ataaacatct caccggaatt c // LOCUS AF026939 2056 bp mRNA PRI 07-JAN-1998 DEFINITION Homo sapiens CIG49 (cig49) mRNA, complete cds. ACCESSION AF026939 NID g2612967 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2056) AUTHORS Zhu,H., Cong,J.P. and Shenk,T. TITLE Use of differential display analysis to assess the effect of human cytomegalovirus infection on the accumulation of cellular RNAs: induction of interferon-responsive RNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (25), 13985-13990 (1997) MEDLINE 98054347 REFERENCE 2 (bases 1 to 2056) AUTHORS Zhu,H., Cong,J. and Shenk,T. TITLE Direct Submission JOURNAL Submitted (26-SEP-1997) Molecular Biology, Princeton University, Washington Road, Princeton, NJ 08544, USA FEATURES Location/Qualifiers source 1..2056 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="foreskin" /cell_type="primary fibroblasts" gene 1..2056 /note="human cytomegalovirus inducible gene 49" /gene="cig49" CDS 95..1567 /gene="cig49" /codon_start=1 /product="CIG49" /db_xref="PID:g2612968" /translation="MSEVTKNSLEKILPQLKCHFTWNLFKEDSVSRDLEDRVCNQIEF LNTEFKATMYNLLAYIKHLDGNNEAALECLRQAEELIQQEHADQAEIRSLVTWGNYAW VYYHLGRLSDAQIYVDKVKQTCKKFSNPYSIEYSELDCEEGWTQLKCGRNERAKVCFE KALEEKPNNPEFSSGLAIAMYHLDNHPEKQFSTDVLKQAIELSPDNQYVKVLLGLKLQ KMNKEAEGEQFVEEALEKSPCQTDVLRSAAKFYRRKGDLDKAIELFQRVLESTPNNGY LYHQIGCCYKAKVRQMQNTGESEASGNKEMIEALKQYAMDYSNKALEKGLNPLNAYSD LAEFLETECYQTPFNKEVPDAEKQQSHQRYCNLQKYNGKSEDTAVQHGLEGLSISKKS TDKEEIKDQPQNVSENLLPQNAPNYWYLQGLIHKQNGDLLQAAKCYEKELGRLLRDAP SGIGSIFLSASELEDGSEEMGQGAVSSSPRELLSNSEQLN" BASE COUNT 645 a 437 c 526 g 448 t ORIGIN 1 gtggaaacct cttcagcatt tgcttggaat cagtaagcta aaaacaaaat caaccgggac 61 cccagctttt cagaactgca gggaaacagc catcatgagt gaggtcacca agaattccct 121 ggagaaaatc ctcccacagc tgaaatgcca tttcacctgg aacttattca aggaagacag 181 tgtctcaagg gatctagaag atagagtgtg taaccagatt gaatttttaa acactgagtt 241 caaagctaca atgtacaact tgttggccta cataaaacac ctagatggta acaacgaggc 301 agccctggaa tgcttacggc aagctgaaga gttaatccag caagaacatg ctgaccaagc 361 agaaatcaga agtctagtca cttggggaaa ctacgcctgg gtctactatc acttgggcag 421 actctcagat gctcagattt atgtagataa ggtgaaacaa acctgcaaga aattttcaaa 481 tccatacagt attgagtatt ctgaacttga ctgtgaggaa gggtggacac aactgaagtg 541 tggaagaaat gaaagggcga aggtgtgttt tgagaaggct ctggaagaaa agcccaacaa 601 cccagaattc tcctctggac tggcaattgc gatgtaccat ctggataatc acccagagaa 661 acagttctct actgatgttt tgaagcaggc cattgagctg agtcctgata accaatacgt 721 caaggttctc ttgggcctga aactgcagaa gatgaataaa gaagctgaag gagagcagtt 781 tgttgaagaa gccttggaaa agtctccttg ccaaacagat gtcctccgca gtgcagccaa 841 attttacaga agaaaaggtg acctagacaa agctattgaa ctgtttcaac gggtgttgga 901 atccacacca aacaatggct acctctatca ccagattggg tgctgctaca aggcaaaagt 961 aagacaaatg cagaatacag gagaatctga agctagtgga aataaagaga tgattgaagc 1021 actaaagcaa tatgctatgg actattcgaa taaagctctt gagaagggac tgaatcctct 1081 gaatgcatac tccgatctcg ctgagttcct ggagacggaa tgttatcaga caccattcaa 1141 taaggaagtc cctgatgctg aaaagcaaca atcccatcag cgctactgca accttcagaa 1201 atataatggg aagtctgaag acactgctgt gcaacatggt ttagagggtt tgtccataag 1261 caaaaaatca actgacaagg aagagatcaa agaccaacca cagaatgtat ccgaaaatct 1321 gcttccacaa aatgcaccaa attattggta tcttcaagga ttaattcata agcagaatgg 1381 agatctgctg caagcagcca aatgttatga gaaggaactg ggccgcctgc taagggatgc 1441 cccttcaggc ataggcagta ttttcctgtc agcatctgag cttgaggatg gtagtgagga 1501 aatgggccag ggcgcagtca gctccagtcc cagagagctc ctctctaact cagagcaact 1561 gaactgagac agaggaggaa aacagagcat cagaagcctg cagtggtggt tgtgacgggt 1621 aggaggatag gaagacaggg ggccccaacc tgggattgct gagcagggaa gctttgcatg 1681 ttgctctaag gtacattttt aaagagttgt tttttggccg ggcgcagtgg ctcatgcctg 1741 taatcccagc actttgggag gccgaggtgg gcggatcacg aggtctggag tttgagacca 1801 tcctggctaa cacagtgaaa tcccgtctct actaaaaata caaaaaatta gccaggcgtg 1861 gtggctggca cctgtagtcc cagctacttg ggaggctgag gcaggagaat ggcgtgaacc 1921 tggaaggaag aggttgcagt gagccaagat tgcgcccctg cactccagcc tgggcaacag 1981 agcaagactc ggaattcctg cagcccgggg gatccactat tctagagcgc cgcaacggcc 2041 gtggagtcca gagatg // LOCUS AF026947 1331 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens aflatoxin aldehyde reductase AFAR mRNA, complete cds. ACCESSION AF026947 NID g2736255 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1331) AUTHORS Ireland,L.S., Hayes,J.D., Harrison,D.J. and Neal,G.E. TITLE Molecular cloning, heterologous expression and catalytic activity of a novel human AKR7 member of the aldo-keto reductase superfamily: Evidence that the major 2-carboxybenzaldehyde reductase from human liver is a homologue of rat aflatoxin aldehyde reductase, AFAR JOURNAL Unpublished REFERENCE 2 (bases 1 to 1331) AUTHORS Ireland,L.S. and Hayes,J.D. TITLE Direct Submission JOURNAL Submitted (26-SEP-1997) Biomedical Research Centre, University of Dundee, Level 5, Ninewells Hospital and Medical School, Dundee DD1 9SY, Scotland FEATURES Location/Qualifiers source 1..1331 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 78..1070 /note="2-carboxybenzaldehyde reductase; member of aldo-keto reductase AKR7 family" /codon_start=1 /product="aflatoxin aldehyde reductase AFAR" /db_xref="PID:g2736256" /translation="MSRPPPPRVASVLGTMEMGRRMDAPASAAAVRAFLERGHTELDT AFMYSDGQSETILGGLGLGLGGGDCRVKIATKANPWDGKSLKPDSVRSQLETSLKRLQ CPQVDLFYLHTPDHGTPVEETLHACQRLHQEGKFVELGLSNYASWEVAEICTLCKSNG WILPTVYQGMYNATTRQVETELFPCLRHFGLRFYAYNPLAGGLLTGKYKYEDKDGKQP VGRFFGNSWAETYRNRFWKEHHFEAIALVEKALQAAYGASAPSVTSAALRWMYHHSQL QGAHGDAVILGMSSLEQLEQNLAATEEGPLEPAVVDAFNQAWHLVAHECPNYFR" BASE COUNT 252 a 416 c 393 g 270 t ORIGIN 1 ccgcgtctcg cgtagtctcc cgcgccgccg tccactgcgc gcttcgctct ccgccgcccg 61 aggcccgcgc gctcgccatg tcccggccac cgccaccgcg ggtcgcctcg gtgctgggca 121 ccatggagat ggggcgccgc atggacgcgc ccgccagcgc cgcggccgtg cgcgcctttc 181 tggagcgcgg ccacaccgaa ctggacacgg ccttcatgta cagcgacggc cagtccgaga 241 ccatcctggg cggcctgggg ctcgggctgg gcggtggcga ctgcagagtg aaaattgcca 301 ccaaggccaa cccttgggat ggaaaatcac taaagcctga cagtgtccgg tcccagctgg 361 agacgtcatt gaagaggctg cagtgtcccc aagtggacct cttctaccta cacacacctg 421 accacggcac cccggtggaa gagacgctgc atgcctgcca gcggctgcac caggagggca 481 agttcgtgga gcttggcctc tccaactatg ctagctggga agtggccgag atctgtaccc 541 tctgcaagag caatggctgg atcctgccca ctgtgtacca gggcatgtac aacgccacca 601 cccggcaggt ggaaacggag ctcttcccct gcctcaggca ctttggactg aggttctatg 661 cctacaaccc tctggctggg ggcctgctga ctggcaagta caagtatgag gacaaggacg 721 ggaaacagcc tgtgggccgc ttctttggga atagctgggc tgagacctac aggaatcgct 781 tctggaagga gcaccacttc gaggccattg cgttggtgga gaaggccctg caggccgcat 841 atggcgccag cgcccccagt gtgacctcgg ctgccctccg gtggatgtac caccactcac 901 agctgcaggg tgcccacggg gacgcggtca tcctgggcat gtccagcctg gagcagctgg 961 agcagaactt ggcagcaaca gaggaagggc ccctggagcc ggctgtcgtg gatgccttta 1021 atcaagcctg gcatttggtt gctcacgaat gtcccaacta cttccgctag gcccatcatg 1081 gctcaggctg cccaaggctt ttctgtcacc tcttttgttc tctcacactg accagtcttg 1141 gccttaagct gacttagaag ggtttttctg aattgtctag atccatgcat tatttttcta 1201 gcttcctgcc ttgctcccta ttcactttac actgtgaaag gtggggggtg agtcccactt 1261 gagcgcttcc tgttgaataa agcaggcact tgacctggct gtagcctagg tcttgagtga 1321 accccaaaaa a // LOCUS AF026977 649 bp mRNA PRI 03-NOV-1997 DEFINITION Homo sapiens microsomal glutathione S-transferase 3 (MGST3) mRNA, complete cds. ACCESSION AF026977 NID g2583080 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 649) AUTHORS Jakobsson,P.J., Mancini,J.A., Riendeau,D. and Ford-Hutchinson,A.W. TITLE Identification and characterization of a novel microsomal enzyme with glutathione-dependent transferase and peroxidase activities JOURNAL J. Biol. Chem. 272 (36), 22934-22939 (1997) MEDLINE 97426444 REFERENCE 2 (bases 1 to 649) AUTHORS Jakobsson,P.-J., Mancini,J.A., Riendeau,D. and Ford-Hutchinson,A.W. TITLE Direct Submission JOURNAL Submitted (25-SEP-1997) Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solnavagen 1, Stockholm 171 77, Sweden FEATURES Location/Qualifiers source 1..649 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q23" /note="derived from EST clone ID: 258762 deposited in GenBank Accession Number N40831" gene 1..649 /gene="MGST3" CDS 50..508 /gene="MGST3" /note="GSH-transferase; GSH-peroxidase" /codon_start=1 /product="microsomal glutathione S-transferase 3" /db_xref="PID:g2583081" /translation="MAVLSKEYGFVLLTGAASFIMVAHLAINVSKARKKYKVEYPIMY STDPENGHIFNCIQRAHQNTLEVYPPFLFFLAVGGVYHPRIASGLGLAWIVGRVLYAY GYYTGEPSKRSRGALGSIALLGLVGTTVCSAFQHLGWVKSGLGSGPKCCH" BASE COUNT 164 a 152 c 149 g 184 t ORIGIN 1 gaattcggca cgaggtgctc cagctgttcg aaggtgatcc agacgcaaga tggctgtcct 61 ctctaaggaa tatggttttg tgcttctaac tggtgctgcc agctttataa tggtggccca 121 cctagccatc aatgtttcca aggcccgcaa gaagtacaaa gtggagtatc ctatcatgta 181 cagcacggac cctgaaaatg ggcacatctt caactgcatt cagcgagccc accagaacac 241 gttggaagtg tatcctccct tcttattttt tctagctgtt ggaggtgttt accacccgcg 301 tatagcttct ggcctgggct tggcctggat tgttggacga gttctttatg cttatggcta 361 ttacacggga gaacccagca agcgtagtcg aggagccctg gggtccatcg ccctcctggg 421 cttggtgggc acaactgtgt gctctgcttt ccagcatctt ggttgggtta aaagtggctt 481 gggcagtgga cccaaatgct gccattaaag aattataggg gtttaaaaac tctcattcat 541 tttaaatgac ttacctttat ttccatttac attttttttc taaatataat aaaaacttac 601 ctggcatcag cctcatacct aaaaaaaaaa aaaaaaaaat cgcggccgc // LOCUS AF027150 1285 bp mRNA PRI 30-OCT-1997 DEFINITION Homo sapiens survival of motor neuron protein interacting protein 1 (SIP1) mRNA, complete cds. ACCESSION AF027150 NID g2570924 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1285) AUTHORS Fischer,U., Liu,Q. and Dreyfuss,G. TITLE The SMN-SIP1 complex has an essential role in spliceosomal snRNP biogenesis JOURNAL Cell 90 (6), 1023-1029 (1997) MEDLINE 97462903 REFERENCE 2 (bases 1 to 1285) AUTHORS Liu,Q., Fischer,U., Wang,F. and Dreyfuss,G. TITLE The spinal muscular atrophy disease gene product, SMN, and its associated protein SIP1 are in a complex with spliceosomal snRNP proteins JOURNAL Cell 90 (6), 1013-1021 (1997) MEDLINE 97462902 REFERENCE 3 (bases 1 to 1285) AUTHORS Liu,Q. and Dreyfuss,G. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Howard Hughes Medical Inst., University of Pennsylvania, 415 Curie Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..1285 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1285 /gene="SIP1" CDS 84..926 /gene="SIP1" /note="SMN protein interacting protein 1" /codon_start=1 /product="survival of motor neuron protein interacting protein 1" /db_xref="PID:g2570925" /translation="MRRAELAGLKTMAWVPAESAVEELMPRLLPVEPCDLTEGFDPSV PPRTPQEYLRRVQIEAAQCPDVVVAQIDPKKLKRKQSVNISLSGCQPAPEGYSPTLQW QQQQVAQFSTVRQNVNKHRSHWKSQQLDSNVTMPKSEDEEGWKKFCLGEKLCADGAVG PATNESPGIDYVQIGFPPLLSIVSRMNQATVTSVLEYLSNWFGERDFTPELGRWLYAL LACLEKPLLPEAHSLIRQLARRCSEVRLLVDSKDDERVPALNLLICLVSRYFDQRDLA DEPS" BASE COUNT 360 a 239 c 306 g 379 t 1 others ORIGIN 1 taacgctccc taaactgcca cttgntcagc tccgcgccta aggtgtctat tagtgcgcct 61 gcgctgtgac ctagaatggg cgcatgcgcc gagcggaact ggctggtttg aaaaccatgg 121 cgtgggtacc agcggagtcc gcagtggaag agttgatgcc tcggctattg ccggtagagc 181 cttgcgactt gacggaaggt ttcgatccct cggtaccccc gaggacgcct caggaatacc 241 tgaggcgggt ccagatcgaa gcagctcaat gtccagatgt tgtggtagct caaattgacc 301 caaagaagtt gaaaaggaag caaagtgtga atatttctct ttcaggatgc caacccgccc 361 ctgaaggtta ttccccaaca cttcaatggc aacagcaaca agtggcacag ttttcaactg 421 ttcgacagaa tgtgaacaaa catagaagtc actggaaatc acaacagttg gatagtaatg 481 tgacaatgcc aaaatctgaa gatgaagaag gctggaagaa attttgtctg ggtgaaaagt 541 tatgtgctga cggggctgtt ggaccagcca caaatgaaag tcctggaata gattatgtac 601 aaattggttt tcctcccttg cttagtattg ttagcagaat gaatcaggca acagtaacta 661 gtgtcttgga atatctgagt aattggtttg gagaaagaga ctttactcca gaattgggaa 721 gatggcttta tgctttattg gcttgtcttg aaaagccttt gttacctgag gctcattcac 781 tgattcggca gcttgcaaga aggtgctctg aagtgaggct cttagtggat agcaaagatg 841 atgagagggt tcctgctttg aatttattaa tctgcttggt tagcaggtat tttgaccaac 901 gtgatttagc tgatgagcca tcttgatgta gctgatctct cagggataga agatatttct 961 catgaaggca gcctaactct gaggaaaaca atgccaattc aagtacagat ttcaacacat 1021 cttcaacact atgtgaaggg ttcacatctt aacctgtgca attcagattg atactcagaa 1081 tatgggttga tttgaatatc tgaaatatca atggaaaatc ccactcagtt tttgatgaac 1141 agtttgaaca gttttctgta atcaagcagc ttgcatagaa attgtatgat gaaattttac 1201 ataggttctt ggtgctgttt tgttcttttt ttgttttttg ttgttttgtt atttacttat 1261 atacatataa aattttattg aaaat // LOCUS AF027204 708 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens putative tetraspan transmembrane protein L6H (TM4SF5) mRNA, complete cds. ACCESSION AF027204 NID g2587053 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 708) AUTHORS Mueller-Pillasch,F., Wallrapp,C., Lacher,U., Adler,G. and Gress,T.M. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Medizinische Klinik, Internal Medicine I, Robert-Koch-Str.8, Ulm 89081, Germany FEATURES Location/Qualifiers source 1..708 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p13.3" /cell_line="pancreatic cancer Patu 8988t" gene 1..708 /gene="TM4SF5" CDS 33..626 /gene="TM4SF5" /note="similar to tumor-associated antigen L6" /codon_start=1 /product="putative tetraspan transmembrane protein L6H" /db_xref="PID:g2587054" /translation="MCTGKCARCVGLSLITLCFVCIVANALLLVPNGETSWTNTNHLS LQVWLMGGFIGGGLMVLCPGIAAVRAGGKGCCGAGCCGNRCRMLRSVFSSAFGVLGAI YCLSVSGAGLRNGPRCLMNGEWGYHFEDTAGAYLLNRTLWDRCEAPPRVVPWNVTLFS LLVAASCLEIVLCGIQLVNATIGVFCGDCRKKQDTPH" BASE COUNT 115 a 215 c 209 g 169 t ORIGIN 1 attcaccgcc tgtctttcct gaacacctca ccatgtgtac gggaaaatgt gcccgctgtg 61 tggggctctc cctcattacc ctctgcttcg tctgcattgt ggccaacgcc ctcctgctgg 121 tacctaatgg ggagacctcc tggaccaaca ccaaccatct cagcttgcaa gtctggctca 181 tgggcggctt cattggcggg ggcctaatgg tactgtgtcc agggattgca gccgttcggg 241 cagggggcaa gggctgctgt ggtgctgggt gctgtggaaa ccgctgcagg atgctgcgct 301 cggtcttctc ctcggcgttc ggggtgcttg gtgccatcta ctgcctctcg gtgtctggag 361 ctgggctccg aaatggaccc agatgcttaa tgaacggcga gtggggctac cacttcgaag 421 acaccgcggg agcttacttg ctcaaccgca ctctatggga tcggtgcgag gcgccccctc 481 gcgtggtccc ctggaatgtg acgctcttct cgctgctggt ggccgcctcc tgcctggaga 541 tagtactgtg tgggatccag ctggtgaacg cgaccattgg tgtcttctgc ggcgattgca 601 ggaaaaaaca ggacacacct cactgaggct ccactgaccg ccgggttaca cctgctcctt 661 cctggacgct cactccctgc tcgctagaat aaactgcttt gcgctctc // LOCUS AF027205 1564 bp mRNA PRI 10-NOV-1997 DEFINITION Homo sapiens Kunitz-type protease inhibitor (kop) mRNA, complete cds. ACCESSION AF027205 NID g2598967 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1564) AUTHORS Mueller-Pillasch,F., Wallrapp,C., Bartels,K., Varga,G., Friess,H., Buechler,M., Adler,G. and Gress,T.M. TITLE Cloning of a new Kunitz-type protease inhibitor with a putative transmembrane domain overexpressed in pancreatic cancer JOURNAL Biochim. Biophys. Acta (1997) In press REFERENCE 2 (bases 1 to 1564) AUTHORS Mueller-Pillasch,F., Wallrapp,C., Bartels,K., Adler,G. and Gress,T.M. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Medizinische Klinik, Internal Medicine I, Robert-Koch-Str.8, Ulm 89081, Germany FEATURES Location/Qualifiers source 1..1564 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" /cell_line="Patu 8988t" /tissue_type="pancreatic cancer" gene 1..1564 /gene="kop" CDS 365..1123 /gene="kop" /note="KOP; contains putative transmembrane domain; similar to human placental bikunin; overexpressed in pancreatic cancer" /codon_start=1 /product="Kunitz-type protease inhibitor" /db_xref="PID:g2598968" /translation="MAHLCGLRRSRAFLALLGSLLLSGVLAADRERSIHDFCLVSKVV GRCRASMPKWWYNVTDGSCQLFVYGGCDGNSNNYLTKEECLKKCATVTENATGDLATS RNAADSSVPSAPRRQDSEDHSSDMFNYEEYCTANAVTGPCRASFPRWYFDVERNSCNN FIYGGCRGNKNSYRSEEACMLRCFRQQENPPLPLGSKVVVLAGLFVMVLILFLGASMV YLIRVARRNQERALRTVWSSGHDKEQLVKNTYVL" BASE COUNT 297 a 419 c 491 g 357 t ORIGIN 1 cggacgcgtg ggcggacgcg tgggcgaggg cgcgagtgag gagcagaccc aggcatcgcg 61 cgccgagaag gccggagcgt cggcacctga acgcgaggcg ctccattgcg cgtgcgcgtt 121 gaggggcttc ccgcacctga tcgcgagacc ccaacggctg gtggcgtcgc ctgcgcgggc 181 gtccccacac tgccggtccg gaaaggcgac ttccgggggc tttggcacct ggcggacgct 241 cccggagcgt cggcacctga acgcgaggcg ctccattgcg cgtgcgcgtt gaggggcttc 301 ccgcacctga tcgcgagacc ccaacggctg gtggcgtcgc ctgcgcgtct cggctgagct 361 ggccatggcg cacctgtgcg ggctgaggcg gagccgggcg tttctcgccc tgctgggatc 421 gctgctcctc tctggggtcc tggcggccga ccgagaacgc agcatccacg acttctgcct 481 ggtgtcgaag gtggtgggca gatgccgggc ctccatgcct aagtggtggt acaatgtcac 541 tgacggatcc tgccagctgt ttgtgtatgg gggctgtgac ggaaacagca ataattacct 601 gaccaaggag gagtgcctca agaaatgtgc cactgtcaca gagaatgcca cgggtgacct 661 ggccaccagc aggaatgcag cggattcctc tgtcccaagt gctcccagaa ggcaggattc 721 tgaagaccac tccagcgata tgttcaacta tgaagaatac tgcaccgcca acgcagtcac 781 tgggccttgc cgtgcatcct tcccacgctg gtactttgac gtggagagga actcctgcaa 841 taacttcatc tatggaggct gccggggcaa taagaacagc taccgctctg aggaggcctg 901 catgctccgc tgcttccgcc agcaggagaa tcctcccctg ccccttggct caaaggtggt 961 ggttctggcg gggctgttcg tgatggtgtt gatcctcttc ctgggagcct ccatggtcta 1021 cctgatccgg gtggcacgga ggaaccagga gcgtgccctg cgcaccgtct ggagctccgg 1081 acatgacaag gagcagctgg tgaagaacac atatgtcctg tgaccgccct gtcgccaaga 1141 ggactgggga agggagggga gactatgtgt gagctttttt taaatagcgg gattgactcg 1201 gatttgagtg atcattaggg ctgaggtgtg tttctctggg aggtaggacg gctgcttcct 1261 ggtctggcag ggatgggttt gctttggaaa tcctctagga ggctcctcct cgcatggcct 1321 gcagtctggc agcagccccg agttgtttcc tcgctgatcg atttctttcc tccaggtaga 1381 gttttctttg cttatgttga attccattgc ctcttttctc atcacagaag tgatgttgga 1441 atcgtttctt ttgtttgtct gatttatggt ttttttaagt ataaacaaaa gttttttatt 1501 aacatctgaa agaaggaaag taaaatgtac aagtttaata aaaaggggcc ttccccttta 1561 gaat // LOCUS AF027208 3794 bp mRNA PRI 24-DEC-1997 DEFINITION Homo sapiens AC133 antigen mRNA, complete cds. ACCESSION AF027208 NID g2688948 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3794) AUTHORS Miraglia,S., Godfrey,W., Yin,A.H., Atkins,K., Warnke,R., Holden,J.T., Bray,R.A., Waller,E.K. and Buck,D.W. TITLE A novel five-transmembrane hematopoietic stem cell antigen: isolation, characterization, and molecular cloning JOURNAL Blood 90 (12), 5013-5021 (1997) MEDLINE 98052559 REFERENCE 2 (bases 1 to 3794) AUTHORS Yin,A.H., Miraglia,S., Zanjani,E.D., Almeida-Porada,G., Ogawa,M., Leary,A.G., Olweus,J., Kearney,J. and Buck,D.W. TITLE AC133, a novel marker for human hematopoietic stem and progenitor cells JOURNAL Blood 90 (12), 5002-5012 (1997) MEDLINE 98052558 REFERENCE 3 (bases 1 to 3794) AUTHORS Miraglia,S.J., Godfrey,W.R., Yin,A.H. and Buck,D.W. TITLE Direct Submission JOURNAL Submitted (28-SEP-1997) AmCell Corporation, 1190 Bordeaux Dr., Sunnyvale, CA 94089, USA FEATURES Location/Qualifiers source 1..3794 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="CD34+ stem cells" /tissue_type="fetal liver; WERI RB-1 retinoblastoma" CDS 38..2635 /note="5-transmembrane cell surface receptor" /codon_start=1 /product="AC133 antigen" /db_xref="PID:g2688949" /translation="MALVLGSLLLLGLCGNSFSGGQPSSTDAPKAWNYELPATNYETQ DSHKAGPIGILFELVHIFLYVVQPRDFPEDTLRKFLQKAYESKIDYDKPETVILGLKI VYYEAGIILCCVLGLLFIILMPLVGYFFCMCRCCNKCGGEMHQRQKENGPFLRKCFAI SLLVICIIISIGIFYGFVANHQVRTRIKRSRKLADSNFKDLRTLLNETPEQIKYILAQ YNTTKDKAFTDLNSINSVLGGGILDRLRPNIIPVLDEIKSMATAIKETKEALENMNST LKSLHQQSTQLSSSLTSVKTSLRSSLNDPLCLVHPSSETCNSIRLSLSQLNSNPELRQ LPPVDAELDNVNNVLRTDLDGLVQQGYQSLNDIPDRVQRQTTTVVAGIKRVLNSIGSD IDNVTQRLPIQDILSAFSVYVNNTESYIHRNLPTLEEYDSYWWLGGLVICSLLTLIVI FYYLGLLCGVCGYDRHATPTTRGCVSNTGGVFLMVGVGLSFLFCWILMIIVVLTFVFG ANVEKLICEPYTSKELFRVLDTPYLLNEDWEYYLSGKLFNKSKMKLTFEQVYSDCKKN RGTYGTLHLQNSFNISEHLNINEHTGSISSELESLKVNLNIFLLGAAGRKNLQDFAAC GIDRMNYDSYLAQTGKSPAGVNLLSFAYDLEAKANSLPPGNLRNSLKRDAQTIKTIHQ QRVLPIEQSLSTLYQSVKILQRTGNGLLERVTRILASLDFAQNFITNNTSSVIIEETK KYGRTIIGYFEHYLQWIEFSISEKVASCKPVATALDTAVDVFLCSYIIDPLNLFWFGI GKATVFLLPALIFAVKLAKYYRRMDSEDVYDDVETIPMKNMENGNNGYHKDHVYGIHN PVMTSPSQH" BASE COUNT 1118 a 765 c 813 g 1098 t ORIGIN 1 ccaagttcta cctcatgttt ggaggatctt gctagctatg gccctcgtac tcggctccct 61 gttgctgctg gggctgtgcg ggaactcctt ttcaggaggg cagccttcat ccacagatgc 121 tcctaaggct tggaattatg aattgcctgc aacaaattat gagacccaag actcccataa 181 agctggaccc attggcattc tctttgaact agtgcatatc tttctctatg tggtacagcc 241 gcgtgatttc ccagaagata ctttgagaaa attcttacag aaggcatatg aatccaaaat 301 tgattatgac aagccagaaa ctgtaatctt aggtctaaag attgtctact atgaagcagg 361 gattattcta tgctgtgtcc tggggctgct gtttattatt ctgatgcctc tggtggggta 421 tttcttttgt atgtgtcgtt gctgtaacaa atgtggtgga gaaatgcacc agcgacagaa 481 ggaaaatggg cccttcctga ggaaatgctt tgcaatctcc ctgttggtga tttgtataat 541 aataagcatt ggcatcttct atggttttgt ggcaaatcac caggtaagaa cccggatcaa 601 aaggagtcgg aaactggcag atagcaattt caaggacttg cgaactctct tgaatgaaac 661 tccagagcaa atcaaatata tattggccca gtacaacact accaaggaca aggcgttcac 721 agatctgaac agtatcaatt cagtgctagg aggcggaatt cttgaccgac tgagacccaa 781 catcatccct gttcttgatg agattaagtc catggcaaca gcgatcaagg agaccaaaga 841 ggcgttggag aacatgaaca gcaccttgaa gagcttgcac caacaaagta cacagcttag 901 cagcagtctg accagcgtga aaactagcct gcggtcatct ctcaatgacc ctctgtgctt 961 ggtgcatcca tcaagtgaaa cctgcaacag catcagattg tctctaagcc agctgaatag 1021 caaccctgaa ctgaggcagc ttccacccgt ggatgcagaa cttgacaacg ttaataacgt 1081 tcttaggaca gatttggatg gcctggtcca acagggctat caatccctta atgatatacc 1141 tgacagagta caacgccaaa ccacgactgt cgtagcaggt atcaaaaggg tcttgaattc 1201 cattggttca gatatcgaca atgtaactca gcgtcttcct attcaggata tactctcagc 1261 attctctgtt tatgttaata acactgaaag ttacatccac agaaatttac ctacattgga 1321 agagtatgat tcatactggt ggctgggtgg cctggtcatc tgctctctgc tgaccctcat 1381 cgtgattttt tactacctgg gcttactgtg tggcgtgtgc ggctatgaca ggcatgccac 1441 cccgaccacc cgaggctgtg tctccaacac cggaggcgtc ttcctcatgg ttggagttgg 1501 attaagtttc ctcttttgct ggatattgat gatcattgtg gttcttacct ttgtctttgg 1561 tgcaaatgtg gaaaaactga tctgtgaacc ttacacgagc aaggaattat tccgggtttt 1621 ggatacaccc tacttactaa atgaagactg ggaatactat ctctctggga agctatttaa 1681 taaatcaaaa atgaagctca cttttgaaca agtttacagt gactgcaaaa aaaatagagg 1741 cacttacggc actcttcacc tgcagaacag cttcaatatc agtgaacatc tcaacattaa 1801 tgagcatact ggaagcataa gcagtgaatt ggaaagtctg aaggtaaatc ttaatatctt 1861 tctgttgggt gcagcaggaa gaaaaaacct tcaggatttt gctgcttgtg gaatagacag 1921 aatgaattat gacagctact tggctcagac tggtaaatcc cccgcaggag tgaatctttt 1981 atcatttgca tatgatctag aagcaaaagc aaacagtttg cccccaggaa atttgaggaa 2041 ctccctgaaa agagatgcac aaactattaa aacaattcac cagcaacgag tccttcctat 2101 agaacaatca ctgagcactc tataccaaag cgtcaagata cttcaacgca cagggaatgg 2161 attgttggag agagtaacta ggattctagc ttctctggat tttgctcaga acttcatcac 2221 aaacaatact tcctctgtta ttattgagga aactaagaag tatgggagaa caataatagg 2281 atattttgaa cattatctgc agtggatcga gttctctatc agtgagaaag tggcatcgtg 2341 caaacctgtg gccaccgctc tagatactgc tgttgatgtc tttctgtgta gctacattat 2401 cgaccccttg aatttgtttt ggtttggcat aggaaaagct actgtatttt tacttccggc 2461 tctaattttt gcggtaaaac tggctaagta ctatcgtcga atggattcgg aggacgtgta 2521 cgatgatgtt gaaactatac ccatgaaaaa tatggaaaat ggtaataatg gttatcataa 2581 agatcatgta tatggtattc acaatcctgt tatgacaagc ccatcacaac attgatagct 2641 gatgttgaaa ctgcttgagc atcaggatac tcaaagtgga aaggatcaca gatttttggt 2701 agtttctggg tctacaagga ctttccaaat ccaggagcaa cgccagtggc aacgtagtga 2761 ctcaggcggg caccaaggca acggcaccat tggtctctgg gtagtgcttt aagaatgaac 2821 acaatcacgt tatagtccat ggtccatcac tattcaagga tgactccctc ccttcctgtc 2881 tatttttgtt ttttactttt ttacactgag tttctattta gacactacaa catatggggt 2941 gtttgttccc attggatgca tttctatcaa aactctatca aatgtgatgg ctagattcta 3001 acatattgcc atgtgtggag tgtgctgaac acacaccagt ttacaggaaa gatgcatttt 3061 gtgtacagta aacggtgtat ataccttttg ttaccacaga gttttttaaa caaatgagta 3121 ttataggact ttcttctaaa tgagctaaat aagtcaccat tgacttcttg gtgctgttga 3181 aaataatcca ttttcactaa aagtgtgtga aacctacagc atattcttca cgcagagatt 3241 ttcatctatt atactttatc aaagattggc catgttccac ttggaaatgg catgcaaaag 3301 ccatcataga gaaacctgcg taactccatc tgacaaattc aaaagagaga gagagatctt 3361 gagagagaaa tgctgttcgt tcaaaagtgg agttgtttta acagatgcca attacggtgt 3421 acagtttaac agagttttct gttgcattag gataaacatt aattggagtg cagctaacat 3481 gagtatcatc agactagtat caagtgttct aaaatgaaat atgagaagat cctgtcacaa 3541 ttcttagatc tggtgtccag catggatgaa acctttgagt ttggtcccta aatttgcatg 3601 aaagcacaag gtaaatattc atttgcttca ggagtttcat gttggatctg tcattatcaa 3661 aagtgatcag caatgaagaa ctggtcggac aaaatttaac gttgatgtaa tggaattcca 3721 gatgtaggca ttccccccag gtcttttcat gtgcagattg cagttctgat tcatttgaat 3781 aaaaaggaac ttgg // LOCUS AF027302 3141 bp mRNA PRI 22-OCT-1997 DEFINITION Homo sapiens TNF-alpha stimulated ABC protein (TSAP) mRNA, complete cds. ACCESSION AF027302 NID g2522533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3141) AUTHORS Richard,M. and Beaulieu,A.D. TITLE TSAP, a novel human ATP-binding cassette (ABC) protein found in TNF-alpha-stimulated synoviocytes JOURNAL Unpublished REFERENCE 2 (bases 1 to 3141) AUTHORS Richard,M. and Beaulieu,A.D. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Dept of Medicine, Centre Hospitalier de l'Universite Laval, 2705 boul Laurier, Sainte-Foy, Quebec G1V 4G2, Canada FEATURES Location/Qualifiers source 1..3141 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3141 /gene="TSAP" CDS 95..2518 /gene="TSAP" /codon_start=1 /evidence=not_experimental /product="TNF-alpha stimulated ABC protein" /db_xref="PID:g2522534" /translation="MPKAPKQQPPEPEWIGDGESTSPSDKVVKKGKKDKKIKKTFFEE LAVEDKQAGEEEKVLKEKEQQQQQQQQQQKKKRDTRKGRRKKDVDDDGEEKELMERLK KLSVPTSDEEDEVPAPKPRGGKKTKGGNVFAALIQDQSEEEEEEEKHPPKPAKPEKNR INKAVSEEQQPALKGKKGKEEKSKGKAKPQNKFAALDNEEEDKEEEIIKEKEPPKQGK EKAKKAEQMEYERQVASLKAANAAENDFSVSQAEMSSRQAMLENASDIKLEKFSISAH GKELFVNADLYIVAGRRYGLVGPNGKGKTTLLKHIANRALSIPPNIDVLLCEQEVVAD ETPAVQAVLRADTKRLKLLEEERRLQGQLEQGDDTAAERLEKVYEELRATGAAAAEAK ARRILAGLGFDPEMQNRPTQKFSGGWRMRVSLARALFMEPTLLMLDEPTNHLDLNAVI WLNNYLQGWRKTLLIVSHDQGFLDDVCTDIIHLDAQRLHYYRGNYMTFKKMYQQKQKE LLKQYEKQEKKLKELKAGGKSTKQAEKQTKEALTRKQQKCRRKNQDEESQEAPELLKR PKEYTVRFTFPDPPPLSPPVLGLHGVTFGYQGQKPLFKNLDFGIDMDSRICIVGPNGV GKSTLLLLLTGKLTPTHGEMRKNHRLKIGFFNQQYAEQLRMEETPTEYLQRGFNLPYQ DARKCLGRFGLESHAHTIQICKLSGGQKARVVFAELACREPDVLILDEPTNNLDIESI DALGEAINEYKGAVIVVSHDARLITETNCQLWVVEEQSVSQIDGDFEDYKREVLEALG EVMVSRPRE" BASE COUNT 867 a 779 c 902 g 593 t ORIGIN 1 gcgccgactt ggagagccag ccccatcggg ttccccgccg ccggaagcgg aaatagcacc 61 gggcgccgcc acagtagctg taactgccac cgcgatgccg aaggcgccca agcagcagcc 121 gccggagccc gagtggatcg gggacggaga gagcacgagc ccatcagaca aagtggtgaa 181 gaaagggaag aaggacaaga agatcaaaaa aacgttcttt gaagagctgg cagtagaaga 241 taaacaggct ggggaagaag agaaagtgct caaggagaag gagcagcagc agcagcaaca 301 gcaacagcag caaaaaaaaa agcgagatac ccgaaaaggc aggcggaaga aggatgtgga 361 tgatgatgga gaagagaaag agctcatgga gcgtcttaag aagctctcag tgccaaccag 421 tgatgaggag gatgaagtac ccgccccaaa accccgcgga gggaagaaaa ccaagggtgg 481 taatgttttt gcagccctga ttcaggatca gagtgaggaa gaggaggagg aagaaaaaca 541 tcctcctaag cctgccaagc cggagaagaa tcggatcaat aaggccgtat ctgaggaaca 601 gcagcctgca ctcaagggca aaaagggaaa ggaagagaag tcaaaaggga aggctaagcc 661 tcaaaataaa ttcgctgctc tggacaatga agaggaggat aaagaagaag aaattataaa 721 ggaaaaggag cctcccaaac aagggaagga gaaggccaag aaggcagagc agatggagta 781 tgagcgccaa gtggcttcat taaaagcagc caatgcagct gaaaatgact tctccgtgtc 841 ccaggcggag atgtcctccc gccaagccat gttagaaaat gcatctgaca tcaagctgga 901 gaagttcagc atctccgctc atggcaagga gctgttcgtc aatgcagacc tgtacattgt 961 agccggccgc cgctacgggc tggtaggacc caatggcaag ggcaagacca cactcctcaa 1021 gcacattgcc aaccgagccc tgagcatccc tcccaacatt gatgtgttgc tgtgtgagca 1081 ggaggtggta gcagatgaga caccagcagt ccaggctgtt cttcgagctg acaccaagcg 1141 attgaagctg ctggaagagg agcggcggct tcagggacag ctggaacaag gggatgacac 1201 agctgctgag aggctagaga aggtgtatga ggaattgcgg gccactgggg cggcagctgc 1261 agaggccaaa gcacggcgga tcctggctgg cctgggcttt gaccctgaaa tgcagaatcg 1321 acccacacag aagttctcag ggggctggcg catgcgtgtc tccctggcca gggcactgtt 1381 catggagccc acactgctga tgctggatga gcccaccaac cacctggacc tcaacgctgt 1441 catctggctt aataactacc tccagggctg gcggaagacc ttgctgatcg tctcccatga 1501 ccagggcttc ttggatgatg tctgcactga tatcatccac ctcgatgccc agcggctcca 1561 ctactatagg ggcaattaca tgaccttcaa aaagatgtac cagcagaagc agaaagaact 1621 gctgaaacag tatgagaagc aagagaaaaa gctgaaggag ctgaaggcag gcgggaagtc 1681 caccaagcag gcggaaaaac aaacgaagga agccctgact cggaagcagc agaaatgccg 1741 acggaaaaac caagatgagg aatcccagga ggcccctgag ctcctgaagc gccctaagga 1801 gtacactgtg cgcttcactt ttccagaccc cccaccactc agccctccag tgctgggtct 1861 gcatggtgtg acattcggct accagggaca gaaaccactc tttaagaact tggattttgg 1921 catcgacatg gattcaagga tttgcattgt gggccctaat ggtgtgggga agagtacgct 1981 actcctgctg ctgactggca agctgacacc gacccatggg gaaatgagaa agaaccaccg 2041 gctgaaaatt ggcttcttca accagcagta tgcagagcag ctgcgcatgg aggagacgcc 2101 cactgagtac ctgcagcggg gcttcaacct gccctaccag gatgcccgca agtgcctggg 2161 ccgcttcggc ctggagagtc acgcccacac catccagatc tgcaaactct ctggtggtca 2221 gaaggcgcga gttgtgtttg ctgagctggc ctgtcgggaa cctgatgtcc tcatcttgga 2281 cgagccaacc aataacctgg acatagagtc tattgatgct ctaggggagg ccatcaatga 2341 atacaagggt gctgtgatcg ttgtcagcca tgatgcccga ctcatcacag aaaccaattg 2401 ccagctgtgg gtggtggagg agcagagtgt tagccaaatc gatggtgact ttgaagacta 2461 caagcgggag gtgttggagg ccctgggtga agtcatggtc agccggcccc gagagtgaag 2521 ctttccttcc cagaagtctc ccgagagaca tatttgtgtg gcctagaagt cctctgtggt 2581 ctcccctcct ctgaagactg cctctggcct gcagctgacc tggcaaccat tcaggcacat 2641 gaaggtggag tgtgaccttg atgtgaccgg gatcccactc tgattgcatc catttctctg 2701 aaagacttgt ttgttctgct tctcttcata taactgagct ggccttatcc ttggcatccc 2761 cctaaacaaa caagaggtga ccaccttatt gtgaggttcc atccagccaa gtttatgtgg 2821 cctattgtct caggactctc atcactcaga agcctgcctc tgatttaccc tacagcttca 2881 ggcccagctg ccccccagtc tttgggtggt gctgttcttt tctggtggat ttaatgctga 2941 ctcactggta caaacagctg ttgaagctca gagctggagg tgagcttctg aggcctttgc 3001 cattatccag cccaagattt ggtgcctgca gcctcttgtc tggttgagga cttggggcag 3061 gaaaggaatg ctgctgaact tgaatttccc tttacaaggg gaagaaataa aggaaaggag 3121 ttgctgccga aaaaaaaaaa a // LOCUS AF027515 2233 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens trans-golgi network glycoprotein 48 (TGN) mRNA, complete cds. ACCESSION AF027515 NID g2772909 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2233) AUTHORS Kain,R., Angata,K., Kerjaschki,D. and Fukuda,M. TITLE Molecular cloning and expression of a novel human trans-golgi network glycoprotein, TGN51, that contains multiple tyrosine-containing motifs JOURNAL J. Biol. Chem. 273 (2), 981-988 (1998) MEDLINE 98086273 REFERENCE 2 (bases 1 to 2233) AUTHORS Kain,R., Angata,K., Kerjaschki,D. and Fukuda,M. TITLE Direct Submission JOURNAL Submitted (01-OCT-1997) Glycobiology Program, The Burnham Institute, 10901 N. Torrey Pines Rd, La Jolla, CA 92037, USA COMMENT hTGN48 cDNA and deduced amino acid sequences are identical with hTGN46 in extracellular, transmembranous and part of cytoplasmic tail. hTGN48 contains a longer cytoplasmic tail produced by alternate usage of 3'-splice site in intron III of genomic sequence. This alternate splicing results in frame-shift and leads to a novel larger translation product with one additional, weak tyrosine-containing motif. FEATURES Location/Qualifiers source 1..2233 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p11.2" /tissue_type="liver" /dev_stage="fetal and adult" gene 1..2233 /gene="TGN" CDS 63..1424 /gene="TGN" /note="trans-golgi network glycoprotein 48" /codon_start=1 /product="hTGN48" /db_xref="PID:g2772910" /translation="MRFVVALVLLNVAAAGAVPLLATESVKQEEAGVRPSAGNVSTHP SLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGGEAKTLKDSSNKSGAEA QTPKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEPKT QKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPK DGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELP KADTNQLADKGKLSPHAFKTESGEETDLISPPQEEVKSSEPTEDVGPKEAEDDDTGPE EGSPPKEEKEKMSGSASSENREGTLSDSTGSEKDDLYPNGSGNGSAESSHFFAYLVTA AILVAVLYIAHHNKRKIIAFVLEGKRSKVTRRPKASDYQRLDQKIFSPPSPNRMVYSS GKR" sig_peptide 63..125 /gene="TGN" /product="hTGN48" polyA_signal 2196..2201 /gene="TGN" BASE COUNT 600 a 572 c 605 g 456 t ORIGIN 1 agaggggccc cgcgcgcgga tctcgcgaga gcattagagg gcggaagcgc tatccgagca 61 ggatgcggtt cgtggttgcc ttggtcctcc tgaacgtcgc agcggcggga gccgtgccgc 121 tcttggccac cgaaagcgtc aagcaagaag aagctggagt acggccttct gcaggaaacg 181 tctccaccca ccccagcttg agccaacggc ctggaggctc taccaagtcg catccggagc 241 cgcagactcc aaaagacagc cctagcaagt cgagtgcgga ggcgcagacc ccagaagaca 301 cccccaacaa gtcgggtggg gaggcaaaga ccctaaaaga cagctccaac aagtcgggtg 361 cggaggcaca gacccccaaa ggcagcacta gcaagtcggg ttcggaggcg cagaccacaa 421 aagacagcac tagtaagtcg catccggagc tgcagactcc aaaagacagc actggcaaat 481 cgggtgcgga ggcgcagacc ccagaagaca gccccaacag gtcgggtgcg gagccaaaga 541 cccaaaaaga cagccctagc aagtcaggtt cggaggcgca gaccacaaaa gatgtcccta 601 ataagtcggg tgcggacggc cagaccccaa aagacggctc cagcaagtcg ggtgcggagg 661 atcagacccc aaaagacgtc cctaacaagt cgggtgcgga gaagcagact ccaaaagacg 721 gctctaacaa gtccggtgca gaggagcagg gcccaataga cgggcccagc aagtcgggtg 781 cggaggagca gacctcaaaa gacagcccta acaaggtggt tccagagcag ccttcccgga 841 aagaccattc caagcccatc tccaaccctt ctgataacaa ggagctcccc aaggctgaca 901 caaaccagct tgctgacaaa gggaagcttt ctcctcatgc tttcaaaacc gaatctgggg 961 aggaaactga cctcatttct cccccgcagg aggaagttaa gtcttcagag cctactgagg 1021 atgtggggcc caaagaggct gaagatgatg atacaggacc cgaggagggc tcaccgccca 1081 aagaagagaa agaaaagatg tccggttctg cctccagtga gaaccgtgaa gggacacttt 1141 cggattccac gggtagcgag aaggatgacc tttatccgaa cggttctgga aatggcagcg 1201 cggagagcag ccacttcttt gcatatctgg tgactgcagc cattcttgtg gctgtcctct 1261 atatcgctca tcacaacaag cggaagatca ttgcttttgt cctggaagga aaaagatcta 1321 aagtcacccg gcggccaaag gccagtgact accaacgttt ggaccagaag atcttttctc 1381 ccccaagtcc taacagaatg gtatattcct ctggaaaaag atgaacgtca ccaatggatt 1441 gtgctgctct cgtttcagct ttgatttttt tgtccttgag aaccttgtcc tccctgctga 1501 tttgtttcta aatcaaaaga aatgaagaaa aaagtactgt gacctgagag acaccctcct 1561 ctagaattta gtggcgggtc tgggctggca gaggtagggg gctgctttgg gctttgcacc 1621 tgcactttgg tgacattgtt cttctgtgtt ccctttattt atgctggtgg cttccatccg 1681 ttctcctctg gggtgagtgg aggggtatat ggaaacacgg ctatgaccaa agggagatcc 1741 cagcctgggc agcctgcgct gctgaccacc ctccctgggg cccgggctct gtaggaaagt 1801 tggtccttga ctgtggcatt gcactctgca ctgtttctct ctgcagacct aggggaaaac 1861 tgcaggtgga agtgcttttc tactaaggcc tcttactttg ggggggatgt gccctacaga 1921 agacatagaa gatggggaaa tgccaatggg caaagagcta ctttgaatac ataattctct 1981 tcaaagactt cagcagcaaa cctaaacagc aggttaaaaa aaaagatgct tttttgggtg 2041 caagtctaac ctgtctagca tgagatcttc ttgattttct gattatttta tgtagcttga 2101 gacaaagtga atcaacttcc acttagttgt accgagcata aaacagaact tgggcttcct 2161 ggcagtgagg ccactgtccc atcacagatt tttaaaataa atatgatttg aagtagtgtg 2221 atctttcaca caa // LOCUS AF027957 1299 bp DNA PRI 02-JAN-1998 DEFINITION Homo sapiens G protein-coupled receptor (GPR35) gene, complete cds. ACCESSION AF027957 NID g2739108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS O'Dowd,B.F., Nguyen,T., Marchese,A., Cheng,R., Lynch,K.R., Heng,H.H.Q., Kolakowski,J.L.F. Jr. and George,S.R. TITLE Discovery of three novel G protein-coupled receptor genes JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 1299) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (03-OCT-1997) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q37.3" gene 214..1143 /gene="GPR35" CDS 214..1143 /gene="GPR35" /note="orphan G protein-coupled receptor" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g2739109" /translation="MNGTYNTCGSSDLTWPPAIKLGFYAYLGVLLVLGLLLNSLALWV FCCRMQQWTETRIYMTNLAVADLCLLCTLPFVLHSLRDTSDTPLCQLSQGIYLTNRYM SISLVTAIAVDRYVAVRHPLRARGLRSPRQAAAVCAVLWVLVIGSLVARWLLGIQEGG FCFRSTRHNFNSMRFPLLGFYLPLAVVVFCSLKVVTALAQRPPTDVGQAEATRKAARM VWANLLVFVVCFLPLHVGLTVRLAVGWNACALLETIRRALYITSKLSDANCCLDAICY YYMAKEFQEASALAVAPRAKAHKSQDSLCVTLA" BASE COUNT 189 a 429 c 412 g 269 t ORIGIN 1 tgggaagagg atctgtccag gggttagacc ttcaagggtg acttggagtt ctttacggca 61 cccatgcttt cttgaggagt tttgtgtttg tgggtgtggg gtcggggctc acctcctccc 121 acatcctgcc cagaggtggg cagagtgggg gcagtgcctt gctccccctg ctcgctctct 181 gctgactccg gctccctgtg ctgccccagg accatgaatg gcacctacaa cacctgtggc 241 tccagcgacc tcacctggcc cccagcgatc aagctgggct tctacgccta cttgggcgtc 301 ctgctggtgc taggcctgct gctcaacagc ctggcgctct gggtgttctg ctgccgcatg 361 cagcagtgga cggagacccg catctacatg accaacctgg cggtggccga cctctgcctg 421 ctgtgcacct tgcccttcgt gctgcactcc ctgcgagaca cctcagacac gccgctgtgc 481 cagctctccc agggcatcta cctgaccaac aggtacatga gcatcagcct ggtcacggcc 541 atcgccgtgg accgctatgt ggccgtgcgg cacccgctgc gtgcccgcgg gctgcggtcc 601 cccaggcagg ctgcggccgt gtgcgcggtc ctctgggtgc tggtcatcgg ctccctggtg 661 gctcgctggc tcctggggat tcaggagggc ggcttctgct tcaggagcac ccggcacaat 721 ttcaactcca tgcggttccc gctgctggga ttctacctgc ccctggccgt ggtggtcttc 781 tgctccctga aggtggtgac tgccctggcc cagaggccac ccaccgacgt ggggcaggca 841 gaggccaccc gcaaggctgc ccgcatggtc tgggccaacc tcctggtgtt cgtggtctgc 901 ttcctgcccc tgcacgtggg gctgacagtg cgcctcgcag tgggctggaa cgcctgtgcc 961 ctcctggaga cgatccgtcg cgccctgtac ataaccagca agctctcaga tgccaactgc 1021 tgcctggacg ccatctgcta ctactacatg gccaaggagt tccaggaggc gtctgcactg 1081 gccgtggctc cccgtgctaa ggcccacaaa agccaggact ctctgtgcgt gaccctcgcc 1141 taagaggcgt gctgtgggcg ctgtgggcca ggtctcgggg gctccgggag gtgctgcctg 1201 ccaggggaag ctggaaccag tagcaaggag cccgggatca gccctgaact cactgtgtat 1261 tctcttggag ccttgggtgg gcagggacgg ccaggtacc // LOCUS AF028008 1796 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens SP1-like zinc finger transcription factor SLP mRNA, complete cds. ACCESSION AF028008 NID g2605806 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1796) AUTHORS Cook,T., Mesa,K., Tuma,A. and Urrutia,R. TITLE Identification and functional characterization of SLP, a novel member of the SP1-like zinc finger transcription factor family JOURNAL Unpublished REFERENCE 2 (bases 1 to 1796) AUTHORS Cook,T., Mesa,K., Tuma,A. and Urrutia,R. TITLE Direct Submission JOURNAL Submitted (02-OCT-1997) GI Research Unit, Mayo Clinic, 200 1st Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1796 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 1..1796 /gene="SLP" CDS 18..1556 /gene="SLP" /codon_start=1 /product="SP1-like zinc finger transcription factor SLP" /db_xref="PID:g2605807" /translation="MHTPDFAGPDDARAVDIMDICESILERKRHDSERSTCSILEQTD MEAVEALVCMSSWGQRSQKGDLLRIRPLTPVSDSGDVTTTVHMDAATPELPKDFHSLS TLCITPPQSPDLVEPSTRTPVSPQVTRFQACTATDVLQSSAVVARALSGGAERGLLGL EPVPSSPCRAKGTSVIRHTGESPAACFPTIQTPDCRLSDSREGEEQLLGHFETLQDTH LTDSLLSTNLVSCQPCLHKSGGLLLTDKGQQAGWPGAVQTCSPKNYENDLPRKTTPLI SVSVPAPPVLCQMIPVTGQSSMLPAFLKPPPQLSVGTVRPILAQAAPAPQPVFVGPAV PQGAVMLVLPQGALPPPAPCAANVMAAGNTKLLPLAPAPVFITSSQNCVPQVDFSRRR NYVCSFPGCRKTYFKSSHLKAHLRTHTGEKPFNCSWDGCDKKFARSDELSRHRRTHTG EKKFVCPVCDRRFMRSDHLTKHARRHMTTKKIPGWQAEVGKLNRIASAESPGSPLVSM PASA" BASE COUNT 393 a 509 c 469 g 425 t ORIGIN 1 ttcggtcggc ctgcacgatg cacacgccgg acttcgcagg cccagacgac gcgcgcgcag 61 ttgacatcat ggacatatgt gagtccatcc tggagaggaa gcggcatgac agcgaaaggt 121 ctacttgcag catcttggag cagacagaca tggaagctgt cgaggctctt gtttgtatga 181 gctcctgggg tcaaagatcc cagaaaggtg acctgttgcg gataagaccc ctcacgcctg 241 tctctgactc tggggatgtc accaccactg tgcatatgga tgcagccaca cctgaactac 301 caaaagactt ccattcttta tcgactctgt gcataactcc tcctcagagc cctgatctcg 361 tggagccatc gacaaggaca cctgtttctc cccaagtaac aagattccaa gcatgtacag 421 ccacggatgt tctccagtcc tctgccgtag tggccagagc tctgagcggg ggcgcggaga 481 ggggcttgct gggtttggag ccagtgccca gctctccctg cagggccaag gggactagcg 541 tgatccgaca cactggggag agccctgctg cctgctttcc caccatccag actccagatt 601 gccggctttc tgacagcaga gaaggagaag agcagcttct gggacacttt gaaactttgc 661 aggacacaca cctcacggac agtttactca gcactaactt ggtgtcctgt cagccctgct 721 tgcacaagtc tggtggcctg ctgctcactg acaaaggcca gcaggcaggg tggcctggtg 781 cagttcagac ttgctcacca aagaattatg aaaatgacct gcccaggaaa accacccctc 841 tgatttctgt ctctgtccct gctccccctg tcctttgcca gatgatccct gtgactggac 901 aaagtagcat gttaccagct tttttgaagc cccctcccca gttgtctgtg gggactgtga 961 gacccatcct agctcaggct gctccagcgc ctcaacctgt gttcgtggga cctgctgtgc 1021 ctcagggagc tgtgatgttg gtcctgcccc agggagccct ccctccgcct gccccctgtg 1081 cagccaatgt catggctgcc gggaatacca agttgttgcc ccttgcccct gctccagtgt 1141 tcatcacctc tagccaaaac tgtgtccctc aggtagactt ttcccgaagg aggaactatg 1201 tttgcagctt cccaggttgc cggaagacct acttcaaaag ttcccacctt aaggcccatc 1261 ttcgcactca cacaggggag aagcctttca actgcagctg ggatggctgt gataaaaagt 1321 ttgctcgttc ggatgagctg tcacgccacc gcagaactca cacaggggag aagaagtttg 1381 tgtgcccggt gtgtgaccga cgtttcatgc gcagtgacca cctgacgaag catgcccggc 1441 gccacatgac gaccaagaag atcccaggct ggcaggcaga ggttggcaag ctgaacagaa 1501 tcgcctctgc agagagcccg gggagcccac tggtgagcat gccagcctct gcctgaaagg 1561 tccattagga catcactcat gggattttta aaaagcctct ttccaggaat ggaactgatg 1621 gattcctctc ccactgcctc acccaaaaaa aacggtcttg gcggcctagg ggaagatcgg 1681 ggaggctggt tttgatgaaa gtatgttaac ttttcttttc cacttgggga ccctgttcag 1741 tatcttttgt agtttcagaa gttttttttg ttttggtttt ttttttttaa agaaat // LOCUS AF028738 1520 bp mRNA PRI 04-NOV-1997 DEFINITION Homo sapiens imprinted multi-membrane spanning polyspecific transporter-related protein (IMPT1) mRNA, complete cds. ACCESSION AF028738 NID g2583222 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1520) AUTHORS Dao,D., Frank,D., Qian,N. and Tycko,B. TITLE IMPT1, An Imprinted Gene Similar to Polyspecific Transporter and Multi-Drug Resistance Genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 1520) AUTHORS Dao,D., Frank,D., Qian,N. and Tycko,B. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) Pathology, Columbia University, 630 W 168th Street, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..1520 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" /tissue_type="placenta" gene 1..1520 /gene="IMPT1" CDS 243..1469 /gene="IMPT1" /function="predicted polyspecific transporter" /note="IMPT1 protein; there is an in-frame ATG preceding the probable true start codon which is in a poor initiation context" /codon_start=1 /product="imprinted multi-membrane spanning polyspecific transporter-related protein" /db_xref="PID:g2583223" /translation="MSALGRSSVILLTYVLAATELTCLFMQFSIVPYLSRKLGLDSIA FGYLQTTFGVLQLLGGPVFGRFADQRGARAALTLSFLAALALYLLLAAASSPALPGVY LLFASRLPGALMHTLPAAQMVITDLSAPEERPAALGRLGLCFGVGVILGSLLGGTLVS AYGIQCPAILAALATLLGAVLSFTCIPASTKGAKTDAQAPLPGGPRASVFELKGIASL LRLPDVPKIFLVKVASNCPTGLFMVMFSIISMDFFQLEAAKAGYLMSFFGLLQMVTQG LVIGQLSSHFSEEVLLGASVLVFIVVGLAMAWMSSVFHFCLLVPGLVFSLCTLNVVTD SMLIKAVSTSDTGTMLGLCASVQPLLRTLGPTVGGLLYRSFGVPVFGHVQVAINTLVL LVLWRKPMPQRKDKVR" BASE COUNT 229 a 558 c 430 g 303 t ORIGIN 1 aggtcaccct ggagcgctca ccccaccggc accagtgccc aagcccgccc ctgcaaaggc 61 aggcaaggcc aggcgggtgc tgcctgggac ccagtgactc agcacccctg cccggatcaa 121 ctggactttt gccccctgct ccgccagcct cctgcttgga tctctcctgg gtctccctgc 181 tgcgcctgtc caggatgcag ggagctcggg ctcccaggga ccagggccag tcccccggca 241 ggatgagcgc tctaggccgg tcctcggtca tcttgcttac ctacgtgctg gccgccacag 301 aacttacctg cctcttcatg cagttctcca tcgtgccata cctgtctcgg aaactgggcc 361 tggattccat tgccttcggc tacctgcaaa ccaccttcgg ggtgctgcag ctgctgggcg 421 ggccggtgtt tggcaggttc gcagaccagc gcggggcgcg ggcggcgctc acgctctcct 481 tcctggctgc cttggcgctc tacctgctcc tggcggccgc ctccagcccg gccctgcccg 541 gggtctacct gctcttcgcc tcgcgcctgc ccggagcgct catgcacacg ctgccagccg 601 cccagatggt catcacggac ctgtcggcac ccgaggagcg gcccgcggcc ctgggccggc 661 tgggcctctg cttcggcgtc ggagtcatcc tcggctccct gctgggcggg accctggtct 721 ccgcgtacgg gattcagtgc ccggccatcc tggctgccct ggccaccctc ctgggagctg 781 tcctcagctt cacctgcatc cccgccagca ccaaaggggc caaaactgac gcccaggctc 841 cactgccagg cggcccccgg gccagtgtgt tcgaactgaa aggcatcgcc tccctgctgc 901 ggctgccaga cgtcccgaag atcttcttgg tgaaagtggc ctccaactgc cccacagggc 961 tcttcatggt catgttctcc atcatctcca tggacttctt ccaactggag gccgccaaag 1021 ctggctacct catgtccttc ttcgggctcc tccagatggt gacccagggc ctggtcatcg 1081 ggcagctgag cagccacttc tcggaagagg tgctgctcgg ggcaagcgtg ctggtcttca 1141 tcgtggtggg cctggccatg gcctggatgt ccagcgtctt ccacttctgc ctcctggtgc 1201 ccggcctggt gttcagcctc tgcaccctca acgtggtcac cgacagcatg ctgatcaagg 1261 ctgtctccac ctcggacaca gggaccatgc tgggcctctg cgcctctgta caaccactgc 1321 tccgaactct gggacccacg gtcggcggcc tcctgtaccg cagctttggc gtccccgtct 1381 tcggccacgt gcaggttgct atcaataccc ttgtcctcct ggtcctctgg aggaaaccta 1441 tgccccagag gaaggacaaa gtccggtgac cgctgcccag acacagactg gcaataaact 1501 cctactaaat cccaaaaaaa // LOCUS AF028825 1019 bp mRNA PRI 13-NOV-1997 DEFINITION Homo sapiens Tax interaction protein 15 mRNA, complete cds. ACCESSION AF028825 NID g2613005 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1019) AUTHORS Rousset,R., Fabre,S., Desbois,C., Bantignies,F. and Jalinot,P. TITLE The C-terminus of the HTLV-1 Tax oncoprotein mediates interaction with the PDZ domain of cellular proteins JOURNAL Oncogene 15 (1997) In press REFERENCE 2 (bases 1 to 1019) AUTHORS Rousset,R., Fabre,S., Desbois,C., Bantignies,F. and Jalinot,P. TITLE Direct Submission JOURNAL Submitted (07-OCT-1997) Biologie, ENSL, 46 Allee d'Italie, Lyon 69364, France FEATURES Location/Qualifiers source 1..1019 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral lymphocytes, EBV-immortalized" /clone="15" CDS 42..932 /note="TIP-15; interacts with Tax of HTLV-1; has PDZ domain" /codon_start=1 /product="Tax interaction protein 15" /db_xref="PID:g2613006" /translation="MDCLCIVTTKKYRYQDEDTPPLEHSPAHLPNQANSPPVIVNTDT LEAPGYVNGTEGEMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAA AQDGRLRVNDSILFVNEVDVREVTHSAAVEALKEAGSIVRLYVMRRKPPAEKVMEIKL IKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILAVNSVGLE DVMHEDAVAALKNTYDVVYLKVAKPSNAYLSDSYAPPDITTCESPSNTKVPHDPSRLT HDMQSHDIPVAVPLQIFPMK" BASE COUNT 286 a 287 c 260 g 186 t ORIGIN 1 gtcagagccc cttacccgcc gccgcggcca ggccccccaa catggactgt ctctgtatag 61 tgacaaccaa gaaataccgc taccaagatg aagacacgcc ccctctggag cacagcccgg 121 cccacctccc caaccaggcc aattctcccc cagtgattgt caacacagat accctagaag 181 ccccaggata tgtgaacggg accgaggggg agatggaata cgaggaaatc acattggaaa 241 ggggtaactc aggtctgggc ttcagcatcg caggtggcac tgacaaccca cacatcggtg 301 acgacccatc cattttcatc accaagatca ttcctggtgg ggctgcggcc caggatggcc 361 gcctcagggt caacgacagc atcctgtttg taaatgaagt ggacgtgcgc gaggtgaccc 421 actcagcggc ggtggaagcc ctcaaagagg caggctccat cgttcgcctc tatgtcatgc 481 gccggaagcc cccggctgag aaggtcatgg agatcaagct catcaagggg cctaaaggtc 541 ttggcttcag catcgcaggg ggcgtaggga accagcacat cccaggagat aatagcatct 601 atgtaacaaa gatcatcgaa gggggtgctg cccacaagga tgggaggttg cagattggag 661 acaagatcct ggcggtcaac agtgtggggc tagaggacgt catgcatgaa gatgctgtgg 721 cagccctgaa gaacacgtat gatgttgtct acctaaaggt ggccaagccc agcaatgcct 781 acctgagtga cagctatgct cccccagaca tcacaacctg tgagagccct tcaaacacca 841 aagttcccca tgacccatcc cgtttaaccc atgacatgca gtcccacgac atccctgtag 901 ctgtccctct tcagatattc cctatgaaat agccacaacc tctgatcctt gctgttgttg 961 gagaagagat ggaaaatcaa ttaccataaa agaactagaa aagaaaaaaa aaaaaaaaa // LOCUS AF028840 1852 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens Kruppel-associated box protein mRNA, complete cds. ACCESSION AF028840 NID g2623621 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1852) AUTHORS Wang,Y., Schreiber,M.C., Currier,M.A. and Pike,J.W. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) Molecular and Cellular Physiology, University of Cincinnati, 231 Bethesda Avenue, Cincinnati, OH 45267, USA FEATURES Location/Qualifiers source 1..1852 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="osteosarcoma" CDS 43..1386 /note="zinc finger protein" /codon_start=1 /product="Kruppel-associated box protein" /db_xref="PID:g2623622" /translation="MEEMLSFWDVAIDFSPEEKECLDPAQWDLYRDVMLENFSHLDFL GLAVAKPYLVTFLEQNQGSSGVKSQTSATIPPGTTGNEFNNHNEAIDHSSLSMQCQRI HTGEEPYYFEDCGKALSSQATLSVLQGHFIGDKPYKCRECHKTLSSRSSLLIHQKYHT DEKTYKCEKCGKGFFRSSDLQHHQKIHTGEKPYKCEECDKAFLHHSYLRKHQAVHTGE KPYKCEECGNSFYYPAMLKQHQRIHSGEKLDKCEECGKVFSSAFFLNQHKGIDSGEKR YKCQECGKSFCYRSYLREHYRMHSGEYPYKCEECGKGFSRSSKLQEHQTIHTGVKPYK CEECGKCFSSFTSLKRHQIIHSEDTPHECVECGKRFSSSSRLQEHQKIHTEEKPYKCE ECDKAFLYHSFLRRHKAVHTREKPYMCEKCRKCFSSFTSLKRHQIHSIDISHECV" BASE COUNT 604 a 368 c 359 g 521 t ORIGIN 1 ccagacctgg tccctgctgt gctctcccgt tggatctctg acatggagga aatgctgtcc 61 ttctgggatg tggccattga tttttctcca gaagagaaag aatgcctgga ccctgctcag 121 tgggatctgt acagggatgt gatgctggag aatttcagcc accttgattt cctgggtctt 181 gctgttgcta aaccatactt ggtaacattt ctggagcaaa accaagggtc ttcgggtgtg 241 aaaagccaaa catcagccac cattcctcca ggaacaacag gcaatgaatt taacaatcac 301 aatgaggcca ttgatcacag ctctctaagt atgcaatgcc agagaattca tacaggagag 361 gaaccctact actttgaaga ctgtggtaag gcccttagtt ctcaggcaac actttctgta 421 ctccagggac attttattgg agacaaaccc tacaagtgta gagaatgtca taaaacactt 481 agcagtcgct catcactttt aatacaccag aaatatcata ctgatgaaaa aacctacaag 541 tgtgaaaaat gtggcaaagg gtttttccgt tcctcagatc ttcagcatca tcaaaaaatt 601 catactggag agaaaccata caaatgtgaa gagtgtgaca aagcctttct tcatcactca 661 tatcttagga aacaccaggc agttcatact ggagagaaac cctacaagtg tgaagagtgt 721 ggaaattcat tttactatcc tgcaatgctg aagcaacatc aaagaattca ttctggagag 781 aaacttgaca agtgtgaaga atgtgggaaa gtattttcct ctgctttctt tcttaaccag 841 cataaaggaa ttgattccgg agagaaaagg tacaagtgtc aagaatgtgg caaatccttt 901 tgctatcgtt catatcttag ggaacattat agaatgcatt ctggagaata tccctacaag 961 tgcgaagaat gtggcaaagg gttttcccgt tcctcaaagc ttcaggaaca tcaaacaatt 1021 catactggag ttaaacctta caagtgtgaa gagtgtggga aatgtttttc ctcttttacg 1081 tcccttaaaa gacatcaaat catccattct gaagatactc cccatgagtg tgtagaatgc 1141 ggcaaaaggt tttctagttc ttcccgcctt caggaacatc agaaaattca cactgaagag 1201 aaaccataca aatgtgagga atgtgacaaa gcctttcttt atcactcgtt tcttaggaga 1261 cacaaagcag ttcatactag agagaaaccc tacatgtgtg aaaagtgtag gaaatgtttt 1321 tcttccttta catccctgaa aagacatcaa atccattcca tagacatttc ccatgagtgt 1381 gtataatgtg gcaaaagatt ttctagttct tcccacctac aggaacatct agaaattcat 1441 actggagaaa tcatacaaat atgaagactg tcacaaagcc tttctttatc attcatttct 1501 taggagactt gaggcagttc acagagagaa accatacaag tgtgaagagt atgggaaatg 1561 tctgttcttt tcttcaaccc ttaaaagaca tcaaataatt cattatgaag tcagtctcta 1621 caagtgtggg gaatgtttaa agctccttgt atgctgtcca ctcttaagaa acacaaattt 1681 gttcattttg gagaggaaca tacaagtgtg accaatgtca gaaagccttg tcttttgtaa 1741 tacttctcta tttgagcaca agctaagtca tactggagtg aaattttcct agtttgaaga 1801 atataccaaa atattttact ctaatttata actgaaaaaa aaaaaaaaaa aa // LOCUS AF029106 3753 bp mRNA PRI 20-NOV-1997 DEFINITION Homo sapiens neuronal munc18-1 binding protein (mint 1) mRNA, complete cds. ACCESSION AF029106 NID g2625024 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3753) AUTHORS Okamoto,M. and Sudhof,T.C. TITLE Mints: munc18 interacting proteins JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 3753) AUTHORS Okamoto,M. and Sudhof,T.C. TITLE Direct Submission JOURNAL Submitted (06-OCT-1997) Dept. of Molecular Genetics, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..3753 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 1..3753 /gene="mint 1" CDS 199..2712 /gene="mint 1" /function="putative function in synaptic vesicle exocytosis by binding to munc18-1, an essential component of the synaptic vesicle exocytotic machinery" /note="contains PDZ and PTB domains but no transmembrane region" /codon_start=1 /product="neuronal munc18-1 binding protein" /db_xref="PID:g2625025" /translation="MNHLEGSAEVEVTDEAAGGEVNESVEADLEHPEVEEEQQQPPQQ QHYVGRHQRGRALEDLRAQLGQEEEERGECLARSASTESGFHNHTDTAEGDVIAAARD GYDAERAQDPEDESAYAVQYRPEAEEYTEQAEAEHAEATHRRALPNHLHFHSLEHEEA MNAAYSGYVYTHRLFHRGEDEPYSEPYADYGGLQEHVYEEIGDAPELHARDGLRLYEQ ERDEAAAYRQEALGARLHHYDERSDGESDSPEKEAEFAPYPRMDSYEQEEDIDQIVAE VKQSMSSQSLDKAAEDMPEAEQDLERPPTPAGGRPDSPGLQAPAGQQRAVGPAGGGEA GQRYSKEKRDAISLAIKDIKEAIEEVKTRTIRSPYTPDEPKEPIWVMRQDISPTRDCD DQRPMDGDSPSPGSSSPLGAESSSTSLHPSDPVEVPINKESRKSLASFPTYVEVPGPC DPEDLIDGIIFAANYLGSTQLLSDKTPSKNVRMMQAQEAVSRIKMAQKLAKSRKKAPE GESQPMTEVDLFILTQRIKVLNADTQETMMDHPLRTISYIADIGNIVVLMARRRIPRS NSQENVEASHPSQDGKRQYKMICHVFESEDAQLIAQSIGQAFSVAYQEFLRANGINPE DLSQKEYSDLLNTQDMYNDDLIHFSKSENCKDVFIEKQKGEILGVVIVESGWGSILPT VIIANMMHGGPAEKSGKLNIGDQIMSINGTSLVGLPLSTCQSIIKGLENQSRVKLNIV RCPPVTTVLIRRPDLRYQLGFSVQNGIICSLMRGGIAERGGVRVGHRIIEINGQSVVA TPHEKIVHILSNAVGEIHMKTMPAAMYRLLTAQEQPVYI" BASE COUNT 853 a 1125 c 1082 g 693 t ORIGIN 1 gcggcgggag aagatggtgg cgctggaggc agcagcggct ggagcggagc tttattccca 61 ggcttggcac cagcgctgtc actagcgccg cctgctcccg cgggcccgcg gacccagctg 121 tcaggcaagc ccagtggagc aaaatgagag cgcagtgagc cgggcccagc ttctctccta 181 gccgttccga ctcccaccat gaaccacttg gaggggtctg cggaggtgga ggtgaccgac 241 gaggcggcag gtggggaggt gaacgagtcg gtggaggccg acctggagca ccccgaggtg 301 gaagaggaac agcagcagcc gccgcagcag cagcactatg tgggccgcca ccagcgcggg 361 cgagccctcg aggacctccg cgcccagctc ggccaggagg aagaggagcg cggggaatgc 421 ctggcgcgct cagccagcac ggagagcggc ttccacaacc acacggacac cgccgagggc 481 gacgtgatcg ccgcggcccg cgacggctac gatgcggagc gcgcgcagga ccccgaggac 541 gagagcgcct atgctgtgca gtaccggccc gaggccgagg agtacacgga gcaggcagag 601 gccgagcacg ccgaggccac gcaccgccgc gcgctgccca accacctgca cttccactcg 661 ctggagcacg aggaagccat gaatgcggcc tactcaggct acgtctacac gcaccggctc 721 ttccaccgcg gtgaggacga gccctactcc gagccctatg ccgactacgg cggcctccag 781 gagcacgtgt acgaggagat aggggacgcg cccgagctgc acgcacgcga cggtctgcgg 841 ctctacgagc aggagcgcga cgaggcggcc gcgtaccgcc aggaggccct gggcgcgcgg 901 ctgcaccatt acgacgagcg ctccgacggc gagtccgaca gccccgagaa ggaggccgag 961 ttcgcgccct acccgcgcat ggacagctac gagcaggagg aggacatcga ccagatagtg 1021 gccgaggtga agcagagcat gagctcgcag agcctcgaca aggcagccga ggacatgcct 1081 gaggccgagc aggacctgga gcgtccccct accccggccg ggggtcgccc cgacagcccc 1141 gggctgcagg cgccggcggg gcagcagcgg gcggtgggcc ccgcgggcgg cggcgaggcg 1201 gggcagcggt acagcaagga gaagcgcgat gccatctcgc tggccatcaa ggacatcaag 1261 gaggccatcg aggaggtgaa aaccaggacc atccgttcgc cttacacccc cgacgagccc 1321 aaagagccca tctgggtcat gcgccaggac attagcccca ccagggactg tgacgaccag 1381 aggccgatgg acggagattc tccgtctcct ggcagctcct cccccttggg tgcagagtca 1441 tcaagcacat ctcttcaccc cagtgaccct gtggaagtgc ccattaataa agagtcaaga 1501 aaaagcttgg cttcattccc aacctacgtt gaagttccgg gaccctgcga ccccgaagac 1561 ttgatcgatg gaatcatttt tgccgccaat taccttggct ccactcagct gctctcagac 1621 aaaactcctt ccaaaaacgt gcgcatgatg caggcccagg aagccgtaag caggatcaag 1681 atggcccaga aattagccaa aagcaggaag aaggctcctg aaggcgaatc tcagccaatg 1741 actgaagtgg acctcttcat tcttacccag agaatcaaag tgctgaacgc cgacacacag 1801 gagacaatga tggaccaccc tctgaggacc atttcctaca ttgcggacat tgggaacatc 1861 gttgtgctga tggcccgccg gcggatacct cgctccaact cccaggagaa cgtggaagcg 1921 tcccacccat cccaggatgg gaaaaggcag tacaagatga tctgccacgt cttcgagtct 1981 gaggatgctc agctgattgc acagtccatc ggacaggcat ttagcgtggc ataccaggaa 2041 ttcctcaggg ccaatgggat taaccccgaa gatctcagcc agaaggagta tagtgacctg 2101 ctcaataccc aggacatgta caacgatgac ctgatccact tctccaagtc ggaaaactgt 2161 aaagatgttt tcatagagaa gcagaaagga gaaatcctag gtgtggtgat tgtggagtct 2221 ggctggggat ccatcctccc caccgtgatc attgccaaca tgatgcatgg tggccctgcg 2281 gagaaatctg ggaagctgaa tatcggtgac cagatcatgt ccattaatgg caccagcctg 2341 gtgggcctgc ctctgtccac ctgccagagc attattaagg gcttagagaa tcagtcccga 2401 gtcaagctga atatcgtgag atgtcctccg gtgaccaccg tgttaatcag aagaccagac 2461 cttcgctacc agctcggttt cagcgtccag aatggaatta tctgcagcct catgcgaggg 2521 ggaatagctg agagaggagg cgtccgtgtg gggcaccgga tcattgaaat caatggacag 2581 agcgtcgtgg ccacccccca cgagaagatc gtccacattc tctccaatgc tgttggggag 2641 attcatatga agacaatgcc agccgcgatg tacaggctgc tgacggccca ggagcagcct 2701 gtttacatct gaccgcggcc acacgcggtg gcatgcatgg aggactctcc tcttcgtggt 2761 tgtgtttctc gtgctgcatc cctgtgtcca ctgagacttt cccctctcgc gcccagcatt 2821 tggttttaca caggaagaga agaatccaca aggacctctt tactctctcc gatttgcttt 2881 tttttttttt ttttttcaat accagggaag tttcgtatgc actcccttga ggatggagag 2941 cagccagcac ccacctggta ctgacccagg accatcctgg agggctttct gggtgtgtcc 3001 agggggtggg ctgtcactgc ttgagggaga atcctccctt cccaggaggt gcagacttct 3061 taaaaggagc tcgcggggca gcaaagcagc tgattcagca gtgcctaaaa cccagttgct 3121 gatccctgct ctctgagttt atctgtggga atgtggtagt acccagggcc agcccacgtc 3181 attaaggtta tgcactgcct gccatgtagt tggggcacca tacattattt cttcccagaa 3241 tctctgagcc aaccttaaac ctcttctatt gctagttcta atttcaacgt atgtgtgttt 3301 tctaatacga gccttctcac cccaggataa aaggggaaat aactgccttg ggcaaggaca 3361 ccatagtgtc acagagagct tggcaattca tggggcatct gaagctttac caggttgtcc 3421 tcaagttatc aaccattaga taaacacagg caggtactgt ctgcttctct ctctctcaca 3481 cacacacaca cacacacagt cccattcgga tatcaccctt ccctgctccc accaccagtg 3541 agacaagctg aagattaggc acacagatcc cctggaaacc cacctctctg gaaggtccct 3601 tccctggcag gtggtcaggc aggcgggggc tgttcagcct catcctgaga gacctgtcct 3661 ctctgtgctg aggtccagcc cctccagcac cactttggct tatcgtatcc tcctcctgga 3721 cttaccctct ctgccgaaca cacatcccag gcg // LOCUS AF029213 1740 bp mRNA PRI 10-NOV-1997 DEFINITION Homo sapiens IL-1 receptor accessory protein mRNA, complete cds. ACCESSION AF029213 NID g2599126 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1740) AUTHORS Huang,J., Gao,X., Li,S. and Cao,Z. TITLE Recruitment of IRAK to the IL-1 receptor complex by IL-1R accessory protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1997) In press REFERENCE 2 (bases 1 to 1740) AUTHORS Huang,J., Gao,X., Li,S. and Cao,Z. TITLE Direct Submission JOURNAL Submitted (07-OCT-1997) Biology, Tularik, Inc., 2 Corporate Dr., South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..1740 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 10..1722 /codon_start=1 /product="IL-1 receptor accessory protein" /db_xref="PID:g2599127" /translation="MTLLWCVVSLYFYGILQSDASERCDDWGLDTMRQIQVFEDEPAR IKCPLFEHFLKFNYSTAHSAGLTLIWYWTRQDRDLEEPINFRLPENRISKEKDVLWFR PTLLNDTGNYTCMLRNTTYCSKVAFPLEVVQKDSCFNSPMKLPVHKLYIEYGIQRITC PNVDGYFPSSVKPTITWYMGCYKIQNFNNVIPEGMNLSFLIALISNNGNYTCVVTYPE NGRTFHLTRTLTVKVVGSPKNAVPPVIHSPNDHVVYEKEPGEELLIPCTVYFSFLMDS RNEVWWTIDGKKPDDITIDVTINESISHSRTEDETRTQILSIKKVTSEDLKRSYVCHA RSAKGEVAKAAKVKQKVPAPRYTVELACGFGATVLLVVILIVVYHVYWLEMVLFYRAH FGTDETILDGKEYDIYVSYARNAEEEEFVLLTLRGVLENEFGYKLCIFDRDSLPGGIV TDETLSFIQKSRRLLVVLSPNYVLQGTQALLELKAGLENMASRGNINVILVQYKAVKE TKVKELKRAKTVLTVIKWKGEKSKYPQGRFWKQLQVAMPVKKSPRRSSSDEQGLSYSS LKNV" BASE COUNT 518 a 360 c 411 g 451 t ORIGIN 1 tctcaaagga tgacacttct gtggtgtgta gtgagtctct acttttatgg aatcctgcaa 61 agtgatgcct cagaacgctg cgatgactgg ggactagaca ccatgaggca aatccaagtg 121 tttgaagatg agccagctcg catcaagtgc ccactctttg aacacttctt gaaattcaac 181 tacagcacag cccattcagc tggccttact ctgatctggt attggactag gcaggaccgg 241 gaccttgagg agccaattaa cttccgcctc cccgagaacc gcattagtaa ggagaaagat 301 gtgctgtggt tccggcccac tctcctcaat gacactggca actatacctg catgttaagg 361 aacactacat attgcagcaa agttgcattt cccttggaag ttgttcaaaa agacagctgt 421 ttcaattccc ccatgaaact cccagtgcat aaactgtata tagaatatgg cattcagagg 481 atcacttgtc caaatgtaga tggatatttt ccttccagtg tcaaaccgac tatcacttgg 541 tatatgggct gttataaaat acagaatttt aataatgtaa tacccgaagg tatgaacttg 601 agtttcctca ttgccttaat ttcaaataat ggaaattaca catgtgttgt tacatatcca 661 gaaaatggac gtacgtttca tctcaccagg actctgactg taaaggtagt aggctctcca 721 aaaaatgcag tgccccctgt gatccattca cctaatgatc atgtggtcta tgagaaagaa 781 ccaggagagg agctactcat tccctgtacg gtctatttta gttttctgat ggattctcgc 841 aatgaggttt ggtggaccat tgatggaaaa aaacctgatg acatcactat tgatgtcacc 901 attaacgaaa gtataagtca tagtagaaca gaagatgaaa caagaactca gattttgagc 961 atcaagaaag ttacctctga ggatctcaag cgcagctatg tctgtcatgc tagaagtgcc 1021 aaaggcgaag ttgccaaagc agccaaggtg aagcagaaag tgccagctcc aagatacaca 1081 gtggaactgg cttgtggttt tggagccaca gtcctgctag tggtgattct cattgttgtt 1141 taccatgttt actggctaga gatggtccta ttttaccggg ctcattttgg aacagatgaa 1201 accattttag atggaaaaga gtatgatatt tatgtatcct atgcaaggaa tgcggaagaa 1261 gaagaatttg tattactgac cctccgtgga gttttggaga atgaatttgg atacaagctg 1321 tgcatctttg accgagacag tctgcctggg ggaattgtca cagatgagac tttgagcttc 1381 attcagaaaa gcagacgcct cctggttgtt ctaagcccca actacgtgct ccagggaacc 1441 caagccctcc tggagctcaa ggctggccta gaaaatatgg cctctcgggg caacatcaac 1501 gtcattttag tacagtacaa agctgtgaag gaaacgaagg tgaaagagct gaagagggct 1561 aagacggtgc tcacggtcat taaatggaaa ggggaaaaat ccaagtatcc acagggcagg 1621 ttctggaagc agctgcaggt ggccatgcca gtgaagaaaa gtcccaggcg gtctagcagt 1681 gatgagcagg gcctctcgta ttcatctttg aaaaatgtat gaaaggaata atgaaaagga // LOCUS AF029232 2455 bp mRNA PRI 20-NOV-1997 DEFINITION Homo sapiens calpamodulin (CalpM) mRNA, complete cds. ACCESSION AF029232 NID g2625039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2455) AUTHORS Crotty,P.L. TITLE Direct Submission JOURNAL Submitted (07-OCT-1997) Pathology, Yale University, 310 Cedar Street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..2455 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 1..2455 /gene="CalpM" CDS 92..2017 /gene="CalpM" /note="member of calpain family; lacks cysteine protease activity; active-site cysteine is replaced by lysine" /codon_start=1 /product="calpamodulin" /db_xref="PID:g2625040" /translation="MGPPLKLFKNQKYQELKQECIKDSRLFCDPTFLPENDSLFYNRL LPGKVVWKRPQDICDDPHLIVGNISNHQLTQGRLGHKPMVSAFSCLAVQESHWTKTIP NHKEQEWDPQKTEKYAGIFHFRFWHFGEWTEVVIDDLLPTINGDLVFSFSTSMNEFWN ALLEKAYAKLLGCYEALDGLTITDIIVDFTGTLAETVDMQKGRYTELVEEKYKLFGEL YKTFTKGGLICCSIESPNQEEQEVETDWGLLKGHTYTMTDIRKIRLGERLVEVFSAEK VYMVRLRNPLGRQEWSGPWSEISEEWQQLTASDRKNLGLVMSDDGEFWMSLEDFCRNF HKLNVCRNVNNPIFGRKELESVLGCWTVDDDPLMNRSGGCYNNRDTFLQNPQYIFTVP EDGHKVIMSLQQKDLRTYRRMGKTDNYIIGFELFKVEMNRKFRLHHLYIQERAGTSTY IDTRTVFLSKYLKKGNYVLVPTMFQHGRTSEFLLRIFSEVPVQLRELTLDMPKMSCWN LARGYPKVVTQITVHSAEDLEKKYANETVNPYLVIKCGKEEVRSPVQKNTVHAIFDTQ AIFYRRTTDIPIIVQVWNSRKFCDQFLGQVTLDADPSDCRDLKSLYLRKKGGPTAKVK QGHISFKVISSDDLTEL" BASE COUNT 665 a 577 c 600 g 613 t ORIGIN 1 taacagcagc agcggcaacg gcagcagcag cagcagcaag cgcagcagca gcagcagggc 61 tcctgggata actcaggcat agttcaacac tatgggtcct cctctgaagc tcttcaaaaa 121 ccagaaatac caggaactga agcaggaatg catcaaagac agcagacttt tctgtgatcc 181 aacatttctg cctgagaatg attctctttt ctacaaccga ctgcttcctg gaaaggtggt 241 gtggaaacgt ccccaggaca tctgtgatga cccccatctg attgtgggca acattagcaa 301 ccaccagctg acccaaggga gactggggca caagccaatg gtttctgcat tttcctgttt 361 ggctgttcag gagtctcatt ggacaaagac aattcccaac cataaggaac aggaatggga 421 ccctcaaaaa acagaaaaat acgctgggat atttcacttt cgtttctggc attttggaga 481 atggactgaa gtggtgattg atgacttgtt gcccaccatt aacggagatc tggtcttctc 541 tttctccact tccatgaatg agttttggaa tgctctgctg gaaaaagctt atgcaaagct 601 gctaggctgt tatgaggccc tggatggttt gaccatcact gatattattg tggacttcac 661 gggcacattg gctgaaactg ttgacatgca gaaaggaaga tacactgagc ttgttgagga 721 gaagtacaag ctattcggag aactgtacaa aacatttacc aaaggtggtc tgatctgctg 781 ttccattgag tctcccaatc aggaggagca agaagttgaa actgattggg gtctgctgaa 841 gggccatacc tataccatga ctgatattcg caaaattcgt cttggagaga gacttgtgga 901 agtcttcagt gctgagaagg tgtatatggt tcgcctgaga aaccccttgg gaagacagga 961 atggagtggc ccctggagtg aaatttctga agagtggcag caactgactg catcagatcg 1021 caagaacctg gggcttgtta tgtctgatga tggagagttt tggatgagct tggaggactt 1081 ttgccgcaac tttcacaaac tgaatgtctg ccgcaatgtg aacaacccta tttttggccg 1141 aaaggagctg gaatcggtgt tgggatgctg gactgtggat gatgatcccc tgatgaaccg 1201 ctcaggaggc tgctataaca accgtgatac cttcctgcag aatccccagt acatcttcac 1261 tgtgcctgag gatgggcaca aggtcattat gtcactgcag cagaaggacc tgcgcactta 1321 ccgccgaatg gggaagactg acaattacat cattggcttt gagctcttca aggtggagat 1381 gaaccgcaaa ttccgcctcc accacctcta catccaggag cgtgctggga cttccaccta 1441 tattgacacc cgcacagtgt ttctgagcaa gtacctgaag aagggcaact atgtgcttgt 1501 cccaaccatg ttccagcatg gtcgcaccag cgagtttctc ctgagaatct tctctgaagt 1561 gcctgtccag ctcagggaac tgactctgga catgcccaaa atgtcctgct ggaacctggc 1621 tcgtggctac ccgaaagtag ttactcagat cactgttcac agtgctgagg acctggagaa 1681 gaagtatgcc aatgaaactg taaacccata tttggtcatc aaatgtggaa aggaggaagt 1741 ccgttctcct gtccagaaga atacagttca tgccattttt gacacccagg ccattttcta 1801 cagaaggacc actgacattc ctattatagt acaggtctgg aacagccgaa aattctgtga 1861 tcagttcttg gggcaggtta ctctggatgc tgaccccagc gactgccgtg atctgaagtc 1921 tctgtacctg cgtaagaagg gtggtccaac tgccaaagtc aagcaaggcc acatcagctt 1981 caaggttatt tccagcgatg atctcactga gctctaaatc tgcaatccca gagaatcctg 2041 acaaagcgtg ccacctttta ttttccgtca ggtgccaggt cttagttaag attcacaatc 2101 tttagaaaga atgagattca caataattaa ctcttcctct cttctgataa attccccata 2161 cctcccaatc caagtagcat ctgtagctac ataacctata tacctccagc agctggacat 2221 ggggaggcga cagtcctatc tagacatcat acacatttgc caagaaagga tctctggggc 2281 ttccgggggt gagattcaag caggacaata acaagaggct ggacacccta cagatgtctt 2341 tgatgttttc agttgtttga tatatctccc ctgtagggca tgttgaggaa ggaggagggc 2401 tgatcaaggc caagctggtc tagcctgaca tcctagctcc tgactgaaca ctata // LOCUS AF029678 2011 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens PHD Finger 1 (PHF1) mRNA, complete cds. ACCESSION AF029678 NID g2660719 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2011) AUTHORS Coulson,M., Robert,S., Eyre,H.J. and Saint,R. TITLE The identification and localisation of a human gene with sequence similarity to Polycomblike of Drosophila melanogaster JOURNAL Unpublished REFERENCE 2 (bases 1 to 2011) AUTHORS Coulson,M., Robert,S. and Saint,R. TITLE Direct Submission JOURNAL Submitted (09-OCT-1997) Department of Genetics, University of Adelaide, Adelaide, SA 5005, Australia FEATURES Location/Qualifiers source 1..2011 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /tissue_type="placenta" /note="Stratagene placenta cDNA library" gene 1..2011 /note="PHD Finger 1" /gene="PHF1" CDS 57..1430 /gene="PHF1" /note="Contains two PHD fingers (C4-H-C3)" /codon_start=1 /product="PHF1" /db_xref="PID:g2660720" /translation="MAQPPRLSRSGASSLWDPASPAPTSGPRPRLWEGQDVLARWTDG LLYLGTIKKVDSAREVCLVQFEDDSQFLVLWKDISPAALPGEELLCCVCRSETVVPGN RLVSCEKCRHAYHQDCHVPRAPAPGEGEGTSWVCRQCVFAIATKRGGALKKGPYARAM LGMKLSLPYGLKGLDWDAGHLSNRQQSYCYCGGPGEWNLKMLQCRSCLQWFHEACTQC LSKPLLYGDRFYEFECCVCRGGPEKVRRLQLRWVDVAHLVLYHLSVCCKKKYFDFDRE ILPFTSENWDSLLLGELSDTPKGERSSKLLSALNSHKDRFISGREIKKRKCLFGLHAR MPPPVEPPTGDGALTRAGPWGRGLTSPGEAPEAGARAPEEEAEGESGGAGATLSSAQS ARAPGAEGAGSSAEGTAAAPSGCLLPSTLLPAPQGPLGTVDPQTGHPWNFTLVSPQTS LKVPPTR" BASE COUNT 402 a 571 c 582 g 455 t 1 others ORIGIN 1 ctggcccctg cccggctccc ggcggcccca gctgtcaccg gcccccccag gatgcaatgg 61 cgcagccccc ccggctgagc cgctctggtg cctcctcact ttgggaccca gcttctcctg 121 ctcccacctc tggccccagg cctcggcttt gggagggtca agatgtgctg gccagatgga 181 ctgatgggct gctatacttg ggtaccatca aaaaggtgga cagtgctagg gaggtgtgtc 241 tggtccagtt tgaggatgat tcgcagtttc tggttctatg gaaagacatt agccctgctg 301 ccctccctgg agaggaactc ctctgttgtg tctgtcgctc tgagactgtg gtccctggga 361 accggctggt cagctgtgag aagtgtcgcc atgcttatca ccaggactgc catgttccca 421 gggctccagc ccctggagag ggagagggca catcctgggt atgccgccag tgtgtctttg 481 cgatcgccac caagagggga ggtgccctga agaagggccc ctatgcccgg gccatgctgg 541 gtatgaagct ttctctgcca tatggactga aggggctgga ctgggatgct ggacatctga 601 gcaaccgaca gcagagttac tgttactgtg gtggccctgg ggagtggaac ctgaaaatgc 661 tgcagtgccg gagctgcctg cagtggttcc atgaggcctg cacccagtgt ctgagcaagc 721 ccctcctcta tggggacagg ttctatgaat ttgaatgctg tgtgtgtcgc gggggccctg 781 agaaagtccg gagactacag cttcgctggg tggatgtggc ccatcttgtc ctgtatcacc 841 tcagtgtttg ctgtaagaag aaatactttg attttgatcg tgagatcctc cccttcactt 901 ctgagaattg ggacagtttg ctcctggggg agctttcaga cacccccaaa ggagaacgtt 961 cttccaagct cctctctgct cttaacagcc acaaggaccg tttcatttca gggagagaga 1021 ttaagaagag gaaatgtttg tttggtctcc atgctcggat gcctccccct gtggagcccc 1081 ctactggaga tggagcactc accagggcag ggccctgggg gaggggtctc acgtcccctg 1141 gggaagcgcc ggaggccgga gccagagccc ctgaggagga ggcagaaggg gaaagtggag 1201 gagctggggc caccctcagc agtgcgcaat cagcccgagc cccaggagca gagggagcgg 1261 gctcatctgc agagggcact gcagcagccc catccggatg tttgcttcct tccacccttc 1321 tgccagcacc gcagggacct ctggggacag tggaccccca gacaggtcac ccctggaact 1381 tcacattggt ttccccacag acatccctaa aagtgccccc cactcgatga ctgcctcatc 1441 ttcctcagtt tcatccccat ccccaggtct tcctagacgc tcagcacccc cttctcccct 1501 gtgccgtagt ttgtctcctg ggactggggg aggagtccga ggtggggttg gttacctgtc 1561 ccgaggggac cctgtccggg tccttgctcg gagagtacgg cctgatggct ctgtgcagta 1621 cctggttgag tggggaggag ggggcatctt ctgaacagcc tgcctctgcc cagctcccca 1681 ttcacacaca ccggcacttt cataccctga cctctgacct cacctacagc tgggatgtac 1741 ctggagagat agsgggtagt tctccctact gcccaggctg gaatccaaga gtggggagtg 1801 gggaagaggc cctcttctct accctccttc atgattcctg acccctccca tccttcccat 1861 ttcctttgat gttattttgt tacagctttt taaatatttt ttaaaattat ttaacccctg 1921 ggggcagaga ctgaggaggg aggatgataa gggatcccgg actctgtatg attgaaataa 1981 agagaaataa acaaaaaaaa aaaaaaaaaa a // LOCUS AF029749 1104 bp mRNA PRI 11-NOV-1997 DEFINITION Homo sapiens potassium channel beta 2 subunit (HKvbeta2.1) mRNA, complete cds. ACCESSION AF029749 NID g2599567 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1104) AUTHORS Rae,J.L. and Shepard,A.R. TITLE Direct Submission JOURNAL Submitted (10-OCT-1997) Physiology and Biophysics, Mayo Foundation, 200 1st Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1104 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lens epithelium" gene 1..1104 /gene="HKvbeta2.1" CDS 1..1104 /gene="HKvbeta2.1" /codon_start=1 /product="potassium channel beta 2 subunit" /db_xref="PID:g2599568" /translation="MYPESTTGSPARLSLRQTGSPGMIYSTRYGSPKRQLQFYRNLGK SGLRVSCLGLGTWVTFGGQITDEMAEQLMTLAYDNGINLFDTAEVYAAGKAEVVLGNI IKKKGWRRSSLVITTKIFWGGKAETERGLSRKHIIEGLKASLERLQLEYVDVVFANRP DPNTPMEETVRAMTHVINQGMAMYWGTSRWSSMEIMEAYSVARQFNLTPPICEQAEYH MFQREKVEVQLPELFHKIGVGAMTWSPLACGIVSGKYDSGIPPYSRASLKGYQWLKDK ILSEEGRRQQAKLKELQAIAERLGCTLPQLAIAWCLRNEGVSSVLLGASNADQLMENI GAIQVLPKLSSSIIHEIDSILGNKPYSKKDYRS" BASE COUNT 257 a 314 c 331 g 202 t ORIGIN 1 atgtatccag aatcaacgac gggctccccg gctcggctct cgctgcggca gacgggctcc 61 cccgggatga tctacagtac tcggtatggg agtcccaaaa gacagctcca gttttacagg 121 aacctgggca agtctggcct gcgggtctcc tgcctgggac ttggaacatg ggtgaccttc 181 ggaggccaga tcaccgatga gatggcagag cagctcatga ccttggccta tgataatggc 241 atcaacctct tcgatacagc agaagtctac gcagccggca aggctgaagt ggtactggga 301 aacatcatta agaagaaagg atggaggcgg tccagcctcg tcatcaccac caagatcttc 361 tggggcggaa aggcggagac ggagcggggc ctgtccagga agcacataat cgaaggtctg 421 aaagcttccc tggagcgact gcagctggag tacgtggatg tggtgtttgc caaccgcccg 481 gaccccaaca ccccgatgga agagaccgtc cgcgccatga cccacgtcat caaccagggg 541 atggccatgt actggggcac gtcacgctgg agctccatgg agatcatgga ggcctactcc 601 gtggcccggc agttcaacct gaccccgccc atctgcgagc aggctgagta ccacatgttc 661 cagcgtgaga aagtggaggt gcagctgccg gagctgttcc acaagatagg agtgggcgcc 721 atgacctggt cccctctggc ctgtggcatt gtttctggca agtacgacag tggcatccca 781 ccctactcaa gagcctcctt gaagggctac cagtggctga aggacaagat cctcagtgag 841 gagggccggc gccagcaagc caagctgaag gagctgcagg ccatcgccga gcgcctgggc 901 tgcaccctgc cccagctggc catagcctgg tgcctgagga atgagggagt cagctccgtg 961 ctcctggggg cctccaatgc ggaccagctc atggagaaca ttggggcaat acaggtcctt 1021 ccgaaactgt cgtcttccat tatccacgag attgatagta ttttgggcaa taaaccctac 1081 agcaaaaagg actacagatc ctaa // LOCUS AF029890 605 bp mRNA PRI 07-JAN-1998 DEFINITION Homo sapiens hepatitis B virus X interacting protein (XIP) mRNA, complete cds. ACCESSION AF029890 NID g2745882 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 605) AUTHORS Melegari,M., Scaglioni,P.P. and Wands,J.R. TITLE Cloning and characterization of a novel hepatitis B virus X binding protein that inhibits viral replication JOURNAL J. Virol. (1998) In press REFERENCE 2 (bases 1 to 605) AUTHORS Melegari,M., Scaglioni,P.P. and Wands,J.R. TITLE Direct Submission JOURNAL Submitted (14-OCT-1997) Molecular Hepatology Lab., Massachusetts General Hospital Cancer Center, MGH East Bldg 149 13th Street Room 7308, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..605 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_line="HepG2" /cell_type="hepatoblastoma" gene 1..605 /gene="XIP" CDS 56..331 /gene="XIP" /function="binding to hepatitis B virus X protein and down-regulating HBV replication" /codon_start=1 /product="hepatitis B virus X interacting protein" /db_xref="PID:g2745883" /translation="MEATLEQHLEDTMKNPSIVGVLCTDSQGLNLGCRGTLSDEHAGV ISVLAQQAAKLTSDPTDIPVVCLESDNGNIMIQKHDGITVAVHKMAS" BASE COUNT 164 a 121 c 150 g 170 t ORIGIN 1 tggagaagga cgtgccgtgc cgctgggttc tgagccggag tggtcggtgg gtgggatgga 61 ggcgaccttg gagcagcact tggaagacac aatgaagaat ccctccattg ttggagtcct 121 gtgcacagat tcacaaggac ttaatctggg ttgccgcggg accctgtcag atgagcatgc 181 tggagtgata tctgttctag cccagcaagc agctaagcta acctctgacc ccactgatat 241 tcctgtggtg tgtctagaat cagataatgg gaacattatg atccagaaac acgatggcat 301 cacggtggca gtgcacaaaa tggcctcttg atgctcatat ctgttcttca gcagcctgtc 361 ataggaactg gatcctacct atgttaatta ccttatagaa ctactaaagt tccagtagtt 421 aggccattca tttaatgtgc attaggcact tttctgttta tttaagagtc aattgcttct 481 aatgctctat ggaccgctat caagatatta gtaagaaagg atcatgtttt gaagcagcag 541 gtccaggtca ctttgtaata tagaattttg ctgtattcaa taaatctgtt tggaggaaaa 601 aaaaa // LOCUS AF029893 2011 bp mRNA PRI 13-JAN-1998 DEFINITION Homo sapiens i-beta-1,3-N-acetylglucosaminyltransferase mRNA, complete cds. ACCESSION AF029893 NID g2745740 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2011) AUTHORS Sasaki,K., Kurata-Miura,K., Ujita,M., Angata,K., Nakagawa,S., Sekine,S., Nishi,T. and Fukuda,M. TITLE Expression cloning of cDNA encoding a human beta-1,3-N-acetylglucosaminyltransferase that is essential for poly-N-acetyllactosamine synthesis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (26), 14294-14299 (1997) MEDLINE 98070745 REFERENCE 2 (bases 1 to 2011) AUTHORS Sasaki,K., Kurata-Miura,K., Ujita,M., Angata,K., Nakagawa,S., Sekine,S., Nishi,T. and Fukuda,M. TITLE Direct Submission JOURNAL Submitted (13-OCT-1997) Glycobiology Program, The Burnham Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2011 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 80..1327 /note="glycosyltransferase; poly-N-acetyllactosamine extension enzyme i-antigen; iGnT" /codon_start=1 /product="i-beta-1,3-N-acetylglucosaminyltransferase" /db_xref="PID:g2745741" /translation="MQMSYAIRCAFYQLLLAALMLVAMLQLLYLSLLSGLHGQEEQDQ YFEFFPPSPRSVDQVKAQLRTALASGGVLDASGDYRVYRGLLKTTMDPNDVILATHAS VDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMRARVAMHLVCP SRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNNLLRNLARE GANYALVIDVDMVPSEGLWRGLREMLDQSNQWGGTALVVPAFEIRRARRMPMNKNELV QLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGK VPTFDERFRQYGFNRISQACELHVAGFDFEVLNEGFLVHKGFKEALKFHPQKEAENQH NKILYRQFKQELKAKYPNSPRRC" BASE COUNT 425 a 546 c 596 g 444 t ORIGIN 1 gcggtaaatc cgggcttgcg gccgctggcg tagtctgtgg ccgggtggtc gttgctgcgc 61 gccccgagcc ccgagagcca tgcagatgtc ctacgccatc cggtgcgcct tctaccagct 121 gctgctggcc gcgctcatgc tggtggcgat gctgcagctg ctctacctgt cgctgctgtc 181 cggactgcac gggcaggagg agcaagacca atattttgag ttctttcccc cgtccccacg 241 gtccgtggac caggtcaagg cgcagctccg caccgcgctg gcctctggag gcgtcctgga 301 cgctagcggc gattaccgcg tctacagggg cctgctgaag accaccatgg accccaacga 361 tgtgatcctg gccacgcacg ccagcgtgga caacctgctg cacctgtcgg gtctgctgga 421 gcgctgggag ggcccgctgt ccgtgtcggt gttcgcggcc accaaggagg aggcgcagct 481 ggccacggtg ctggcctacg cgctgagcag ccactgcccc gacatgcgcg ccagggtcgc 541 catgcacctc gtgtgcccct cgcgttacga ggcagccgtg cccgaccccc gggagccggg 601 ggagtttgcc ctgctgcggt cctgccagga ggtctttgac aagctagcca gggtggccca 661 gcccgggatt aattatgcgc tgggcaccaa tgtctcctac cccaataacc tgctgaggaa 721 tctggctcgt gagggggcca actatgccct ggtgatcgat gtggacatgg tgcccagcga 781 ggggctgtgg agaggcctgc gggaaatgct ggatcagagc aaccagtggg gaggcaccgc 841 gctggtggtg cctgccttcg aaatccgaag agcccgccgc atgcccatga acaaaaacga 901 gctggtgcag ctctaccagg ttggcgaggt gcggcccttc tattatgggt tgtgcacccc 961 ctgccaggca cccaccaact attcccgctg ggtcaacctg ccggaagaga gcttgctgcg 1021 gcccgcctac gtggtacctt ggcaggaccc ctgggagcca ttctacgtgg caggaggcaa 1081 ggtgcccacc ttcgacgagc gctttcggca gtacggcttc aaccgaatca gccaggcctg 1141 cgagctgcat gtggcggggt ttgattttga ggtcctgaac gaaggtttct tggttcataa 1201 gggcttcaaa gaagcgttga agttccatcc ccaaaaggag gctgaaaatc agcacaataa 1261 gatcctatat cgccagttca aacaggagtt gaaggccaag taccccaact ctccccgacg 1321 ctgctgagcc cttccctccc ctaatctgag aagtcagcct cttggctcct caggccacca 1381 tttaggcctg actggggtaa gaaatgtcgc tccactttac agaggtagct gtggtgttga 1441 aacactggac ttggatatgg ggtgctggga tcgattccta gctttaccac taactagctg 1501 tgtggccttg agtaaatccc gttacctctc tgagcctcgg ttaccctgtc tgtaaaaagg 1561 gaggtgagaa tacctacctc acggaactgt tgggaggctc agatgagatg ctatatgtga 1621 aaacattctg taagcttcgt acaaatgtga agtattaata ttatcgcagt attattgttg 1681 ttattattat tgttattatt aacaatcttg ggtgggtagt aggagagcaa aaagtatgaa 1741 tgggatggag ctaagaagtc tgaatactta atgaaatgga ctttttggaa agaaatcaga 1801 tgaaggcata aaatttagtt cttagctctt gaacagaagc ctaaaattcc tggttctctc 1861 agggcttcgc cttcaagggt tctggaggag ggaagggtct gcaggttcca tgggtgacag 1921 cctgagatct gtcccttcaa cgggctgggc tgggtatgtg cctaccgatg acaatgtgta 1981 aataaatgcg tgttcacacc cacaaaaaaa a // LOCUS AF029899 2442 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens ADAM 20 mRNA, complete cds. ACCESSION AF029899 NID g2739134 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2442) AUTHORS Hooft van Huijsduijnen,R. TITLE ADAM 20 and 21; two novel human testis-specific membrane metalloproteases with similarity to fertilin-alpha JOURNAL Gene (1998) In press REFERENCE 2 (bases 1 to 2442) AUTHORS Hooft van Huijsduijnen,R. TITLE Direct Submission JOURNAL Submitted (15-OCT-1997) Immunoregulation, Geneva Biomedical Research Institute, 14 chemin des Aulx, Plan-les-Ouates/Geneva 1205, Switzerland FEATURES Location/Qualifiers source 1..2442 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q24.1" /note="linked to SHGC-36001" CDS 29..2209 /note="contains disintegrin and metalloprotease-like domains; testis-specific metalloprotease-like membrane protein" /codon_start=1 /product="ADAM 20" /db_xref="PID:g2739135" /translation="MAVGEPLVHIRVTLLLLWLGMFLSISGHSQARPSQYFTSPEVVI PLKVISRGRGAKAPGWLSYSLRFGGQRYIVHMRVNKLLFAAHLPVFTYTEQHALLQDQ PFIQDDWYYHGYVEGVPESLVALSTCSGGFLGMLQINDLVYEIKPISVSATFEHLVYK IDSDDTQFPPMRCGLTEEKIAHQMELQLSYNFTLKQSSFVGWWTHQRFVELVVVVDNI RYLFSQSNATTVQHEVFNVVNIVDSFYHPLEVDVILTGIDIWTASNPLPTSGDLDNVL EDFSIWKNYNLNNRLQHDVAHLFIKDTQGMKLGVAYVKGICQNPFNTGVDVFEDNRLV VFAITLGHELGHNLGMQHDTQWCVCELQWCIMHAYRKVTTKFSNCSYAQYWDSTISSG LCIQPPPYPGNIFRLKYCGNLVVEEGEECDCGTIRQCAKDPCCLLNCTLHPGAACAFG ICCKDCKFLPSGTLCRQQVGECDLPEWCNGTSHQCPDDVYVQDGISCNVNAFCYEKTC NNHDIQCKEIFGQDARSASQSCYQEINTQGNRFGHCGIVGTTYVKCWTPDIMCGRVQC ENVGVIPNLIEHSTVQQFHLNDTTCWGTDYHLGMAIPDIGEVKDGTVCGPEKICIRKK CASMVHLSQACQRKTCNMRGICNNKQHCHCNHEWAPPYCKDKGYGGSADSGPPPKNNM EGLNVMGKLRYLSLLCLLPLVAFLLFCLHVLFKKRTKSKEDEEG" BASE COUNT 702 a 465 c 569 g 706 t ORIGIN 1 cagatggctc cataatgaca gcttcataat ggcagtgggt gagcccctgg tgcacatcag 61 ggtcactctt ctgctgctct ggttggggat gtttttgtct atttctggcc actctcaggc 121 caggccctcc cagtatttca cttctccaga agtggtgatc cctttgaagg tgatcagcag 181 gggcagaggt gcaaaggctc ctggatggct ctcctatagc ctgcggtttg ggggacagag 241 atacattgtc cacatgaggg taaataagct gttgtttgct gcacaccttc ctgtgttcac 301 ctacacagag cagcatgccc tgctccagga tcagcccttc atccaggatg actggtacta 361 ccatggttat gtggaggggg tccctgagtc cttggttgcc cttagtacct gttctggggg 421 ctttcttgga atgctacaga taaatgacct tgtttatgaa atcaagccaa ttagtgtttc 481 tgccacattt gaacacctag tatataagat agacagtgat gatacacagt ttccacctat 541 gagatgtggg ttaacagaag agaaaatagc acaccagatg gagttgcaat tgtcatataa 601 tttcactctg aagcaaagtt cttttgtggg ctggtggacc catcagcggt ttgttgagct 661 ggtagtggtc gtggataata ttagatatct tttctctcaa agtaatgcaa caacagtgca 721 gcatgaagta tttaacgttg tcaatatagt ggattccttc tatcatcctt tggaggttga 781 tgtaattttg actggaattg atatatggac tgcatcaaat ccacttccta ccagtggaga 841 cctagataat gttttagagg acttttctat ttggaagaat tataacctta ataatcgact 901 acaacatgat gttgcacatc ttttcataaa agacacacaa ggcatgaagc ttggtgttgc 961 ctatgttaaa ggaatatgcc agaatccttt taatactgga gttgatgttt ttgaagacaa 1021 caggttggtc gtttttgcaa ttactttggg ccacgagctt ggtcataatt tgggtatgca 1081 acatgacacc cagtggtgtg tgtgcgagct acagtggtgc ataatgcatg cctatagaaa 1141 ggtgacaact aaatttagca actgcagtta tgcccaatat tgggacagta ctatcagtag 1201 tggattatgt attcaaccgc ctccatatcc agggaatata tttagactga agtactgtgg 1261 gaatctagtg gttgaagaag gggaggaatg tgactgtgga accatacggc agtgtgcaaa 1321 agatccctgt tgtctgttaa actgtactct acatcctggg gctgcttgtg cttttggaat 1381 atgttgcaaa gactgcaaat ttctgccatc aggaacttta tgtagacaac aagttggtga 1441 atgtgacctt ccagagtggt gcaatgggac atcccatcaa tgcccagatg atgtgtatgt 1501 gcaggacggg atctcctgta atgtgaatgc cttctgctat gaaaagacgt gtaataacca 1561 tgatatacaa tgtaaagaga tttttggcca agatgcaagg agtgcatctc agagttgcta 1621 ccaagaaatc aacacccaag gaaaccgttt cggtcactgt ggtattgtag gcacaacata 1681 tgtaaaatgt tggacccctg atatcatgtg tgggagggtt cagtgtgaaa atgtgggagt 1741 aattcccaat ctgatagagc attctacagt gcagcagttt cacctcaatg acaccacttg 1801 ctggggcact gattatcatt tagggatggc tatacctgat attggtgagg tgaaagatgg 1861 cacagtatgt ggtccagaaa agatctgcat ccgtaagaag tgtgccagta tggttcatct 1921 gtcacaagcc tgtcagcgta agacctgcaa catgagggga atctgcaaca acaaacaaca 1981 ctgtcactgc aaccatgaat gggcaccccc atactgcaag gacaaaggct atggaggtag 2041 tgctgatagt ggcccacctc ctaagaacaa catggaagga ttaaatgtga tgggaaagtt 2101 gcgttacctg tcactattgt gccttcttcc tttggttgct tttttattat tttgcttaca 2161 tgtgcttttt aagaaacgca caaaaagtaa agaagatgaa gaaggataag agaaatggga 2221 aaaagaagga gactaaactt tatacttcat ttttaatatc caatttttta atagaaaaat 2281 atgaagccat gtctcactgt ttaaataaaa cttcatggac atttcatgtc aggattgcaa 2341 gcattagcta tcacagcaaa ggattcctag cctattctta cttactttac agtgtcttaa 2401 gcaatattaa aggttccttt tcccaaaaaa aaaaaaaaaa aa // LOCUS AF029914 2239 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens oscillin (hLn) mRNA, complete cds. ACCESSION AF029914 NID g2605948 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2239) AUTHORS Hirata,S., Koh,T. and Hoshi,K. TITLE Direct Submission JOURNAL Submitted (14-OCT-1997) Obstetrics and Gynecology, Yamanashi Medical University, Shimokato 1110, Tamaho, Nakakoma, Yamanashi 409-38, Japan FEATURES Location/Qualifiers source 1..2239 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" gene 1..2239 /gene="hLn" CDS 14..883 /gene="hLn" /note="sperm factor" /codon_start=1 /product="oscillin" /db_xref="PID:g2605949" /translation="MKLIILEHYSQASEWAAKYIRNRIIQFNPGPEKYFTLGLPTGST PLGCYKKLIEYYKNGDLSFKYVKTFNMDEYVGLPRDHPESYHSFMWNNFFKHIDIHPE NTHILDGNAVDLQAECDAFEEKIKAAGGIELFVGGIGPDGHIAFNEPGSSLVSRTRVK TLAMDTILANARFFDGELTKVPTMALTVGVGTVMDAREVMILITGAHKAFALYKAIEE GVNHMWTVSAFQQHPRTVFVCDEDATLELKVKTVKYFKGLMLVHNKLVDPLYSIKEKE TEKSQSSKKPYSD" BASE COUNT 567 a 517 c 546 g 609 t ORIGIN 1 cgtccgtgac aagatgaagc tcatcatcct ggagcactat tctcaggcga gcgagtgggc 61 ggctaaatac atcaggaacc gtatcatcca gtttaaccca gggccagaga agtacttcac 121 cctggggctc cccactggga gtaccccact tggctgctac aagaagctga ttgaatacta 181 taagaatggg gacctgtcct ttaaatatgt gaagaccttc aacatggatg agtacgtggg 241 ccttcctcga gaccacccgg agagttacca ctccttcatg tggaacaact tcttcaagca 301 cattgacatc cacccagaaa acacccacat tctggatggg aatgcagtcg acctacaggc 361 agaatgtgat gcctttgaag agaagatcaa ggctgcaggt gggatcgagc tatttgttgg 421 aggcatcggc cctgatggac acattgcctt caacgagcca ggctccagtc tggtgtccag 481 gacccgtgtg aagacgctgg ccatggatac catcctggcc aatgctaggt tcttcgatgg 541 agaactcacc aaggtgccca ccatggcctt gacggtgggg gtgggcactg tcatggatgc 601 tagagaggtg atgatcctta tcacaggtgc tcacaaggca tttgctctgt acaaggccat 661 cgaggaggga gtgaaccaca tgtggaccgt gtctgccttc cagcagcatc cccgcaccgt 721 gtttgtgtgt gacgaggatg ccaccttgga gctgaaagtg aagactgtca agtatttcaa 781 aggtttaatg cttgttcata acaagttggt ggaccccttg tacagtatca aagagaaaga 841 aactgagaaa agccaatctt cgaagaaacc atacagcgat tagcctgtgc tgggacctag 901 tgtcaagtac ccatagggaa aggcaggtct ttctggaaat tgtctttaga agaaagaatt 961 gtatttcttt aatctagtat ggttactcca gataagtggg tgaacttatt gttcttggcc 1021 atgaggctgg gagcctagtc acggagttta gctataggga gaatgtttgt aacttaatca 1081 gaaaaaaaat atctgcaaaa tgtactccat cattttgatg tctgccaaac ccaggttggg 1141 agttttaaac tttttgttct gcttcagcca tggttcacac atatgacaca ctcccgtcag 1201 gaatttctct ccttacacgc actgattttc aagtgggagg gaattagggg cttatgtata 1261 ttggatacca cctcttgaga gtccttcttg cacaggcctg ccccttggtt gagaaccatt 1321 gttccaagtg aaggcacaaa ctctcaatat ctaaaataag tgcaaggaag cagtctcttt 1381 ggtcagtaac aagtgcaatg gaaagaaaac gatcccttcc ttcttccact ttcacagctt 1441 ttctctgaac taggagaacc tgggggtgga tttgggtggg tggggccaaa gaggaggctt 1501 ctattgataa atccagagcc tcaaggggcc cagccacgtc aaacttctct ccctcaggga 1561 ctctccagca ccaaaaggca gaaggtggaa gccgtttttc ccccagagcc ctgtgttttt 1621 gtgaaaggcc tcactgtggc tcctctgttt tacatactca ttagtaagtg ggaggtccac 1681 tggggcaaca gacactgcca caatttcagt gttgtgttca gccaagggga cggtctggac 1741 aggcagctta agtgtgagtt tagtcacaac tcctgagtgt cccgctctcc tgcttaccta 1801 ggaggtgagt gccaggaaaa tacaccaaat gcttctagta ttgtttcccc acttaaaata 1861 gtcctgctta aattcacatg gtgtggtctg atgttctgag agcatcagga aatacaaccc 1921 ttttgcccat ttacccttct ccccggatcc caaggtggtc tgttgctctg gcttcctttc 1981 attgtcttag gccttcatgg agtggatgct gcctcctcct ggctgttttt gtgcctgttt 2041 gaagctactg ctgcctccat ttctgggaaa gacctttgag agcctagccc aggcctaagg 2101 gctatgtttg gtaccagtgt tttgtcttta gcttttctat gtgattgtgc cgtcattctg 2161 ttttaagctc atggatcaat ggatttgttt acaatgtgat attttctatt aaatccagta 2221 ttttcaaaaa aaaaaaaaa // LOCUS AF030099 1306 bp mRNA PRI 21-DEC-1997 DEFINITION Homo sapiens TWEAK mRNA, complete cds. ACCESSION AF030099 NID g2707218 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1306) AUTHORS Chicheportiche,Y., Bourdon,P.R., Xu,H., Hsu,Y.M., Scott,H., Hession,C., Garcia,I. and Browning,J.L. TITLE TWEAK, a new secreted ligand in the tumor necrosis factor family that weakly induces apoptosis JOURNAL J. Biol. Chem. 272 (51), 32401-32410 (1997) MEDLINE 98070415 REFERENCE 2 (bases 1 to 1306) AUTHORS Bourdon,P., Hession,C., Tizard,R. and Browning,J. TITLE Direct Submission JOURNAL Submitted (14-OCT-1997) Cell Biology, Biogen, 12 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1306 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p13" /tissue_type="tonsil" CDS 18..767 /note="ligand in the TNF family; secreted protein; start codon not verified experimentally" /codon_start=1 /product="TWEAK" /db_xref="PID:g2707219" /translation="MAARRSQRRRGRRGEPGTALLVPLALGLGLALACLGLLLAVVSL GSRASLSAQEPAQEELVAEEDQDPSELNPQTEESQDPAPFLNRLVRPRRSAPKGRKTR ARRAIAAHYEVHPRPGQDGAQAGVDGTVSGWEEARINSSSPLRYNRQIGEFIVTRAGL YYLYCQVHFDEGKAVYLKLDLLVDGVLALRCLEEFSATAASSLGPQLRLCQVSGLLAL RPGSSLRIRTLPWAHLKAAPFLTYFGLFQVH" BASE COUNT 247 a 434 c 368 g 257 t ORIGIN 1 cacagccccc cgcccccatg gccgcccgtc ggagccagag gcggaggggg cgccgggggg 61 agccgggcac cgccctgctg gtcccgctcg cgctgggcct gggcctggcg ctggcctgcc 121 tcggcctcct gctggccgtg gtcagtttgg ggagccgggc atcgctgtcc gcccaggagc 181 ctgcccagga ggagctggtg gcagaggagg accaggaccc gtcggaactg aatccccaga 241 cagaagaaag ccaggatcct gcgcctttcc tgaaccgact agttcggcct cgcagaagtg 301 cacctaaagg ccggaaaaca cgggctcgaa gagcgatcgc agcccattat gaagttcatc 361 cacgacctgg acaggacgga gcgcaggcag gtgtggacgg gacagtgagt ggctgggagg 421 aagccagaat caacagctcc agccctctgc gctacaaccg ccagatcggg gagtttatag 481 tcacccgggc tgggctctac tacctgtact gtcaggtgca ctttgatgag gggaaggctg 541 tctacctgaa gctggacttg ctggtggatg gtgtgctggc cctgcgctgc ctggaggaat 601 tctcagccac tgcggccagt tccctcgggc cccagctccg cctctgccag gtgtctgggc 661 tgttggccct gcggccaggg tcctccctgc ggatccgcac cctcccctgg gcccatctca 721 aggctgcccc cttcctcacc tacttcggac tcttccaggt tcactgaggg gccctggtct 781 ccccacagtc gtcccaggct gccggctccc ctcgacagct ctctgggcac ccggtcccct 841 ctgccccacc ctcagccgct ctttgctcca gacctgcccc tccctctaga ggctgcctgg 901 gcctgttcac gtgttttcca tcccacataa atacagtatt cccactctta tcttacaact 961 cccccaccgc ccactctcca cctcactagc tccccaatcc ctgacccttt gaggccccca 1021 gtgatctcga ctcccccctg gccacagacc cccagggcat tgtgttcact gtactctgtg 1081 ggcaaggatg ggtccagaag accccacttc aggcactaag aggggctgga cctggcggca 1141 ggaagccaaa gagactgggc ctaggccagg agttcccaaa tgtgaggggc gagaaacaag 1201 acaagctcct cccttgagaa ttccctgtgg atttttaaaa cagatattat ttttattatt 1261 attgtgacaa aatgttgata aatggatatt aaatagaata agtcag // LOCUS AF030107 1568 bp mRNA PRI 09-NOV-1997 DEFINITION Homo sapiens regulator of G protein signaling (RGS13) mRNA, complete cds. ACCESSION AF030107 NID g2598184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1568) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Molecular cloning of human RGS13 cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1568) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Direct Submission JOURNAL Submitted (01-OCT-1997) Pharmacology, University of Iowa, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..1568 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" gene 1..1568 /gene="RGS13" CDS 304..783 /gene="RGS13" /note="regulator of G protein signaling" /codon_start=1 /product="RGS13" /db_xref="PID:g2598185" /translation="MSRRNCWICKMCRDESKRPPSNLTLEEVLQWAQSFENLMATKYG PVVYAAYLKMEHSDENIQFWMACETYKKIASRWSRISRAKKLYKIYIQPQSPREINID SSTRETIIRNIQEPTETCFEEAQKIVYMHMERDSYPRFLKSEMYQKLLKTMQSNNSF" BASE COUNT 580 a 244 c 264 g 479 t 1 others ORIGIN 1 ggcggccggg ccacgtcgta gcgaggtcag agtgccatct aaggtaatta tagagacagt 61 aaaattcttt tactctggga aaaataaaat gctgggtgtc tcacaaaatt tcagaacctg 121 atttcaaacg gatcataaca aagaggagat caaatttagc atggtggact gctcgacagg 181 atatatttgt caatggaatg tttccacata ttataccacc aacatgagaa aaaaatgatc 241 attgtttatt tgaagcttga tgatattcta acgctgcctt ttctcttctc attttagaga 301 aaaatgagca ggcggaattg ttggatttgt aagatgtgca gagatgaatc taagaggccc 361 ccttcaaacc ttactttgga ggaagtatta cagtgggccc agtcttttga aaatttaatg 421 gctacaaaat atggtccagt agtctatgca gcatatttaa aaatggagca cagtgacgag 481 aatattcaat tctggatggc atgtgaaacc tataagaaaa ttgcctcacg gtggagcaga 541 atttctaggg caaagaagct ttataagatt tacatccagc cacagtcccc tagagagatt 601 aacattgaca gttcgacaag agagactatc atcaggaaca ttcaggaacc cactgaaaca 661 tgttttgaag aagctcagaa aatagtctat atgcatatgg aaagggattc ctaccccaga 721 tttctaaagt cagaaatgta ccaaaaactt ttgaaaacta tgcagtccaa caacagtttc 781 tgactacaac tcaaaagttt aaatagaaaa cagtatattg aaagtggtgg gtttgatctt 841 tttatttaga aacccacaaa atcagaaaca cagtacaaat aaaacagaaa tcaaactata 901 agttgacttt tagttcctaa aaagaaacat atttcaaaag caatggaatc tagaattctt 961 ataacatgaa taacaaaatg tacagcaagc ctatgtagtt caattaatat ataaggaaaa 1021 ggaaggtctt tcttcatgat acaagcatta taaagttttt actgtagtag tcaattaatg 1081 gatatttcct tgttaataaa attttgtgtc ataatttaca aattagttct ttaaaaattg 1141 ttgttatatg aattgtgttt ctagcatgaa tgttctatag agtactctaa ataacttgaa 1201 tttatagaca aatgctactc acagtacaat caattgtatt ataccatgag aaaatcaaaa 1261 aggtgttctt cagagacatt ttatctataa aattttccta ctattatgtt cattaacaaa 1321 cttctttatc acatgtatct tctacgtgta aaacatttct gatgattttt taacaaaaaa 1381 tatatgaatt tcttcatttg ctcttgcatc tacattgcta taanggatat aaaatgtggt 1441 ttctatattt tgagatgttt tttccttaca atgtgaactc atcgtgatct tggaaatcaa 1501 taaagtcaaa tatcaactga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagcggccgc 1561 tgaattct // LOCUS AF030109 2906 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens regulator of G protein signaling 12 (RGS12) mRNA, complete cds. ACCESSION AF030109 NID g2605779 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2906) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Direct Submission JOURNAL Submitted (01-OCT-1997) Pharmacology, University of Iowa, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..2906 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p16.3" /tissue_type="brain" gene 1..2906 /gene="RGS12" CDS 272..2671 /gene="RGS12" /codon_start=1 /product="regulator of G protein signaling 12" /db_xref="PID:g2605780" /translation="MNLGKELSNETHVSNDQQSATVSDGELTGADLKDCVSNNSLSSN ASLPSVQSCRRLRERRVASWAVSFERLLQDPVGVRYFSDFLRKEFSEENILFWQACEY FNHVPAHDKKELSYRAREIFSKFLCSKATTPVNIDSQAQLADDVLRAPHPDMFKEQQL QIFNLMKFDSYTRFLKSPLYQECILAEVEGRALPDSQQVPSSPASKHSLGSDHSSVST PKKLSGKSKSGRSLNEELGDEDSEKKRKGAFFSWSRTRSTGRSQKKREHGDHADDALH ANGGLCRRESQGSVSSAGSLDLSEACRTLAPEKDKATKHCCIHLPDGTSCVVAVKAGF SIKDILSGLCERHGINGAAADLFLVGGDKPLVLHQDSSILESRDLRLEKRTLFRLDLV PINRSVGLKAKPTKPVTEVLRPVVARYGLDLSGLLVRLSGEKEPLDLGAPISSLDGQR VVLEEKDPSRGKASADKQKGVPVKQNTAVNSSSRNHSATGEERTLGKSNSIKIKGENG KNARDPRLSKREESIAKIGKKKYQKINLDEAEEFFELISKAQSNRADDQRGLLRKEDL VLPEFLRLPPGSTELTLPTPAAVAKGFSKRSATGNGRESASQPGEQWEPVQESSDSPS TSPGSASSPPGPPGTTPPGQKSPSGPFCTPQSPVSLAQEGTAQIWKRQSQEVEAGGIQ TVEDEHVAELTLMGEGDISSPNSTLLPPPSTPQEVPGPSRPGSGTHGSRDLPVNRIID VDLVTGSAPGRDGGIAGAQAGPGRSQASGGPPTSDLPGLGPVPGEPAKPKTSAHHATF V" polyA_site 2891 /gene="RGS12" BASE COUNT 679 a 826 c 863 g 538 t ORIGIN 1 attcttctga tttggtagag gaaatagttt aaaaatgatc tcacagcgtg atttatgtaa 61 acggcgcttc tttctttgga tgagacaatt gagatagagt ctgaaaccct ggcaagccac 121 agcttcctgc agtgtcctgt tgccatgggt tacgaaggga gcgagaggga acttcatcgg 181 aaatgccttt aaacttttct cacacgcaca agctgcggtg ttgaatggtg tgtcttagac 241 ccgggtgcct agtgtggctc ggtgccttac aatgaatttg gggaaagagt tgtcaaacga 301 aacccatgtt tctaatgacc agcagtctgc aactgtgtct gatggcgagt tgacgggcgc 361 cgacctgaag gactgcgtca gcaacaacag cctgagcagc aatgccagcc tccccagcgt 421 gcagagctgc cggcgcctgc gtgagaggag ggtcgccagc tgggccgtgt cctttgagcg 481 cctgctgcag gaccccgtcg gtgtccgcta cttctctgat tttctaagga aagaattcag 541 tgaagaaaac attttattct ggcaggcctg tgaatatttt aatcatgttc ctgcacatga 601 caaaaaggag ctttcctaca gggcccggga gattttcagt aagtttctct gcagcaaagc 661 caccaccccg gtcaacatcg acagccaggc ccagctagca gacgacgtcc tccgcgcacc 721 tcacccagac atgttcaagg agcagcagct gcagatcttc aatctcatga agtttgatag 781 ctacactcgc tttctgaagt ccccgctgta ccaggaatgc atcctggcgg aagtggaggg 841 ccgtgcactc ccggactcgc agcaggtccc cagcagcccg gcttccaagc acagcctcgg 901 ttcagaccac tccagtgtgt ccacgccaaa aaagttaagt ggaaaatcaa aatccggccg 961 atccctgaat gaagagctgg gggatgagga cagcgagaag aagcggaaag gcgcgttttt 1021 ctcgtggtcg cggaccagga gcaccgggag gtcccagaaa aagagggagc acggggacca 1081 cgcagacgac gccctgcatg ccaatggagg cctgtgtcgc cgagagtcgc agggctctgt 1141 gtcctctgcg gggagcctgg acctgtcgga ggcctgcagg actttggcac ccgagaagga 1201 caaggccacc aagcactgct gcattcatct cccggatggg acatcctgcg tggtggctgt 1261 caaggcgggc ttctccatca aagacatcct gtccggactc tgtgagcggc atggcatcaa 1321 cggggcggcc gcggacctct tcctggtggg cggggacaag cctctggtgc tgcaccaaga 1381 cagtagcatc ttggagtcaa gggacctgcg cctagaaaag cgcaccttgt ttcggctgga 1441 tcttgttccg attaaccggt cagtgggact caaggccaag cccaccaagc ccgtcacgga 1501 ggtgctgcgg cccgtggtgg ccagatacgg cctggacctc agtggcctgc tggtgaggct 1561 gagtggagag aaggagcccc tggaccttgg cgcccctata tcgagtctgg acggacagcg 1621 ggttgtcttg gaggagaagg atccttccag aggaaaggca tccgcagata aacagaaagg 1681 tgtgccagtg aaacagaaca cagctgtaaa ttccagctcc agaaaccact cggctacggg 1741 agaggaaaga acactaggca agtctaattc tattaaaata aaaggagaaa atggaaaaaa 1801 tgctagggat ccccggcttt caaagagaga agaatctatt gcaaagattg ggaaaaaaaa 1861 atatcagaaa attaatttgg acgaagcaga ggagtttttt gagcttattt ccaaagctca 1921 gagcaacaga gcagatgacc aacgtgggct gctaaggaag gaagacctgg tgttgccaga 1981 gttcctccgt ttacctcctg gttccacaga actcaccctc cccactccag ctgctgtggc 2041 caagggcttt agcaagagaa gcgccacagg caacggccgg gagagcgcct cccagcctgg 2101 cgagcagtgg gagccagtcc aggagagcag cgacagtccg tccaccagcc cgggctcagc 2161 ctccagcccc cctggacctc ctgggacgac cccccccggg cagaagtctc ccagcgggcc 2221 cttctgcact ccccagtccc ccgtctccct cgcgcaggag ggcaccgccc agatctggaa 2281 gaggcagtct caggaagtgg aggccggggg catccagacg gtggaggatg agcacgtggc 2341 cgagctgacc ctgatggggg agggggacat cagcagcccc aacagcacct tgctgccgcc 2401 gccctccacc ccccaggaag tgccaggacc ttccagacca ggaagtggga cccatggcag 2461 ccgagacctc ccagtcaaca gaatcatcga tgtggatctt gtaactggct cggcgcccgg 2521 gcgggatggt ggcatagcgg gggcacaggc tggccctggg aggtcgcagg ccagtggtgg 2581 gcctcctaca tcagacctcc ctggcttggg ccccgtcccg ggtgagcctg ctaagcccaa 2641 gaccagcgct caccacgcca ccttcgtctg agctgccctg gcctggccaa ctctcctgtg 2701 gacatgtcgg ggtggggcag cccaggtgga ttctgtgggc ctcagggggg ccaccctggc 2761 caccacaccc tcaggagccc agccaggagg gcagggggtg acctcgctgg aggcactggc 2821 cccggacatt cgccatgctg gccatggggc tccctggccc tggcctcctg ctgcccaata 2881 aagcatttct gaaaaaaaaa aaaaaa // LOCUS AF030162 841 bp mRNA PRI 10-NOV-1997 DEFINITION Homo sapiens inner mitochondrial membrane translocase Tim23 (TIM23) mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF030162 NID g2599128 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 841) AUTHORS Bauer,M.F., Hofmann,S., Milisav,I., Brunner,M., Neupert,W. and Gerbitz,K.D. TITLE Identification of the human mitochondrial inner membrane translocase, hTim23 JOURNAL Unpublished REFERENCE 2 (bases 1 to 841) AUTHORS Bauer,M.F., Hofmann,S., Milisav,I., Brunner,M., Neupert,W. and Gerbitz,K.D. TITLE Direct Submission JOURNAL Submitted (15-OCT-1997) Institut fuer Klinische Chemie, KH Muenchen-Schwabing, Koelner Platz 1, D-80804, Muenchen, Germany REMARK Institut fuer Physiologische Chemie, Universitaet Muenchen, Goethestr.33, D-80336, Muenchen, Germany FEATURES Location/Qualifiers source 1..841 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..841 /gene="TIM23" CDS 125..754 /gene="TIM23" /function="translocation of mitochondrial precursor proteins" /note="similar to S. cerevisiae TIM23, Swiss-Prot Accession Number P32897, and ESTs with GenBank Accession Numbers AA187506 and AA181883" /codon_start=1 /product="inner mitochondrial membrane translocase Tim23" /db_xref="PID:g2599129" /translation="MEGGGGSGNKTTGGLAGFFGAGGAGYSHADLAGVPLTGMNPLSP YLNVDPRYLVQDTDEFILPTGANKTRGRFELAFFTIGGCCMTGAAFGAMNGLRLGLKE TQNMAWSKPRNVQILNMVTRQGALWANTLGSLALLYSAFGVIIEKTRGAEDDLNTVAA GTMTGMLYKCTGGLRGIARGGLTGLTLTSLYALYNNWEHMKGSLLQQSL" BASE COUNT 212 a 180 c 239 g 208 t 2 others ORIGIN 1 gaaggtcagc gtgtgaagta ggcgctggca acgcggggtt accctgntnt attgaggagt 61 aacggcccag cggaccaccc aggcttgagg cagcggcggg aaccactcgg tttgctgcga 121 taccatggaa ggaggcgggg gaagcggcaa caaaaccaca gggggattgg ccggcttttt 181 cggagccggc ggagcaggtt actcgcacgc ggatttggct ggcgtcccgc taactggtat 241 gaaccctctg tctccttatt taaatgtgga tccacgatac ctcgtgcagg atacagatga 301 gtttatttta cctaccggag ctaataaaac ccggggcaga tttgagctgg ccttctttac 361 gattggagga tgttgcatga caggggctgc gtttggtgca atgaatggtc ttcggctagg 421 attgaaggaa acccagaaca tggcctggtc caaaccaaga aatgtacaga ttttgaatat 481 ggtgactagg caaggggcac tttgggctaa tactctaggt tctctggctt tgctctatag 541 tgcatttggt gtcatcattg agaaaacacg aggtgcagaa gatgacctta acacagtagc 601 agctggaacc atgacaggca tgttgtataa atgtacaggt ggtcttcgag ggatagcacg 661 aggtggtctg acaggactaa cacttaccag cctctatgca ctatataata actgggagca 721 catgaaaggc tccttgctcc aacagtcact ctgaagattt tgccaactca tgaatggagg 781 acacttcagt agttcatcta ggatcctttt attaaggaca gtttgggagt tatttctctc 841 t // LOCUS AF030165 1720 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens retinal fascin mRNA, complete cds. ACCESSION AF030165 NID g2623644 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1720) AUTHORS Tubb,B.E. and Bryan,J. TITLE Human Retinal Fascin cDNA Sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 1720) AUTHORS Tubb,B.E. and Bryan,J. TITLE Direct Submission JOURNAL Submitted (15-OCT-1997) Cell Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1720 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" /dev_stage="adult" CDS 137..1615 /note="actin-bundling protein" /codon_start=1 /product="retinal fascin" /db_xref="PID:g2623645" /translation="MPTNGLHQVLKIQFGLVNDTDRYLTAESFGFKVNASAPSLKRKQ TWVLEPDPGQGTAVLLRSSHLGRYLSAEEDGRVACEAEQPGRDCRFLVLPQPDGRWVL RSEPHGRFFGGTEDQLSCFATAVSPAELWTVHLAIHPQAHLLSVSRRRYVHLCPREDE MAADGDKPWGVDALLTLIFRSRRYCLKSCDSRYLRSDGRLVWEPEPRACYTLEFKAGK LAFKDCDGHYLAPVGPAGTLKAGRNTRPGKDELFDLEESHPQVVLVAANHRYVSVRQG VNVSANQDDELDHETFLMQIDQETKKCTFYSSTGGYWTLVTHGGIHATATQVSANTMF EMEWRGRRVALKASNGRYVCMKKNGQLAAISDFVGKDEEFTLKLINRPILVLRGLDGF VCHHRGSNQLDTNRSVYDVFHLSFSDGAYRIRGRDGGFWYTGSHGSVCSDGERAEDFV FEFRERGRLAIRARSGKYLRGGASGLLRADADAPAGTALWEY" polyA_site 1664 BASE COUNT 352 a 562 c 547 g 259 t ORIGIN 1 cagggggttc gtgacgccgg ctgggtctgg gggctgtggg ccagccgagc cgacccgggc 61 ttctggggga ccgcgggggc cgtgagcact cagagggtgc atcccaggcc cctccgggga 121 cccggccagc ctgaagatgc cgacgaacgg cctgcaccag gtgctgaaga tccagtttgg 181 cctcgtcaac gacactgacc gctacctgac agctgagagc ttcggcttca aggtcaatgc 241 ctcggcaccc agcctcaaga ggaagcagac ctgggtgctg gaacccgacc caggacaagg 301 cacggctgtg ctgctccgca gcagccacct gggccgctac ctgtcggcag aagaggacgg 361 gcgcgtggcc tgtgaggcag agcagccggg ccgtgactgc cgcttcctgg tcctgccgca 421 gccagatggg cgctgggtgc tgcggtccga gccgcacggc cgcttcttcg gaggcaccga 481 ggaccagctg tcctgcttcg ccacagccgt ttccccggcc gagctgtgga ccgtgcacct 541 ggccatccac ccgcaggccc acctgctgag cgtgagccgg cggcgctacg tgcacctgtg 601 cccgcgggag gacgagatgg ccgcagacgg agacaagccc tggggcgtgg acgccctcct 661 caccctcatc ttccggagcc gacggtactg cctcaagtcc tgtgacagcc gctacctgcg 721 cagcgacggc cgtctggtct gggagcctga gccccgtgcc tgctacacgc tggagttcaa 781 ggcgggcaag ctggccttca aggactgcga cggccactac ctggcacccg tggggcccgc 841 aggcaccctc aaggccggcc gaaacacgcg acctggcaag gatgagctct ttgatctgga 901 ggagagtcac ccacaggtgg tgctggtggc tgccaaccac cgctacgtct ctgtgcggca 961 aggggtcaac gtctcagcca atcaggatga tgaactagac cacgagacct tcctgatgca 1021 aattgaccag gagacaaaga agtgcacctt ctattccagc actgggggct actggaccct 1081 ggtcacccat gggggcattc acgccacagc cacacaagtt tctgccaaca ccatgtttga 1141 gatggagtgg cgtggccggc gggtagcact caaagccagc aacgggcgct acgtgtgcat 1201 gaagaagaat gggcagctgg cggctatcag cgattttgtc ggcaaggacg aagagttcac 1261 cctcaagctc atcaaccggc ccatcctggt gctgcgcggc ctggacggct tcgtctgcca 1321 ccaccgcggc tccaaccagc tggacaccaa ccgctccgtc tacgacgtct tccacctgag 1381 cttcagcgac ggcgcctacc ggatccgagg ccgcgacgga gggttctggt acacgggcag 1441 ccacggcagc gtgtgcagcg acggcgaacg cgccgaggac ttcgtcttcg agttccgtga 1501 gcgcggccgc ctggccatcc gcgcccggag cggcaagtac ctgcgcggcg gcgcctcggg 1561 cctgctgcgg gccgatgccg acgccccggc cgggaccgcg ctttgggagt actgaggccg 1621 cgcccagacc agcctgtcgc gcattaaaac cgtgtctctc ccgcaaaaaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS AF030177 2890 bp mRNA PRI 16-NOV-1997 DEFINITION Homo sapiens N-acetylglucosamyl transferase component Gpi1 (GPI1) mRNA, complete cds. ACCESSION AF030177 NID g2623157 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2890) AUTHORS Tiede,A., Schubert,J., Orlean,P. and Schmidt,R.E. TITLE Human and mouse Gpi1p homologs that restore GPI membrane anchor biosynthesis in yeast mutants JOURNAL Unpublished REFERENCE 2 (bases 1 to 2890) AUTHORS Tiede,A., Schubert,J., Orlean,P. and Schmidt,R.E. TITLE Direct Submission JOURNAL Submitted (17-OCT-1997) Clinical Immunology, Hannover Medical School, Carl Neuberg St. 1, Hannover D-30625, Germany FEATURES Location/Qualifiers source 1..2890 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /cell_line="HL60" gene 1..2890 /gene="GPI1" CDS 72..1817 /gene="GPI1" /function="GlcNAc transferase for the first step in glycosyl phosphatidylinositol membrane anchor biosynthesis" /codon_start=1 /product="N-acetylglucosamyl transferase component Gpi1" /db_xref="PID:g2623158" /translation="MVLKAFFPTCCVSADSGLLVGRWVPEQSSAVVLAVLHFPFIPIQ VKQLLAQVRQASQVGVAVLGTWCHCRQEPEESLGRFLESLGAVFPHEPWLRLCRERGG TFWSCEATHRQAPTAPGAPGEDQVMLIFYDQRQVLLSQLHLPTVLPDRQAGATTASTG GLAAVFDTVARSEVLFRSDRFDEGPVRLSHWQSEGVEASILAELARRASGPICLLLAS LLSLVSAVSACRVFKLWPLSFLGSKLSTCEQLRHRLEHLTLIFSTRKAENPAQLMRKA NTVASVLLDVALGLMLLSWLHGRSRIGHLADALVPVADHVAEELQHLLQWLMGAPAGL KMNRALDQVLGRFFLYHIHLWISYIHLMSPFVEHILWHVGLSACLGLTVALSLLSDII ALLTFHIYCFYVYGARLYCLKIHGLSSLWRLFRGKKWNVLRQRVDSCSYDLDQLFIGT LLFTILLFLLPTTALYYLVFTLLRLLVVAVQGLIHLLVDLINSLPLYSLGLRLCRPYR LAAGVKFRVLRHEASRPLRLLMQINPLPYSRVVHTYRLPSCGCHPKHSWGALCRKLFL GELIYPWRQRGDKQD" BASE COUNT 435 a 1005 c 907 g 543 t ORIGIN 1 catcggggtc cccaacccca tccggacccc gccgcccgag cgcgcggccc cggaagcacc 61 cgcctcccgg catggtgctc aaggccttct tccccacgtg ctgcgtctcg gcggacagcg 121 ggctgctggt gggacggtgg gtgccggagc agagcagcgc cgtggtcctg gcggtcctgc 181 actttccctt catccccatc caggtcaagc agctcctggc ccaggtgcgg caggccagcc 241 aggtgggcgt ggccgtgctg ggcacctggt gccactgccg gcaggagccc gaggagagcc 301 tgggccgctt cctggagagc ctgggtgctg tcttccccca tgagccctgg ctgcggctgt 361 gccgggagag aggcggcacg ttctggagct gcgaggccac ccaccggcaa gcgcccactg 421 cccccggtgc ccctggtgag gaccaggtca tgctcatctt ctatgaccag cgccaggtgt 481 tgctgtcaca gctacacctg cccaccgtcc tgcccgaccg ccaggctgga gccaccactg 541 ccagcacggg gggcctggct gccgtcttcg acacggtagc acgcagtgag gtgctcttcc 601 gcagtgaccg ctttgatgag ggccccgtgc ggctgagcca ctggcagtcg gagggcgtgg 661 aggccagcat cctcgcggag ctggccaggc gagcctcggg acccatttgt ctgctgttgg 721 ccagcctgct gtcgctggtc tcagctgtca gtgcctgccg agtgttcaag ctctggcccc 781 tgtccttcct cgggagcaaa ctctccacgt gcgaacagct ccggcaccgg ctggagcacc 841 tcacgctaat cttcagtaca cggaaggcgg agaaccctgc ccagctgatg aggaaggcca 901 acacggtggc ctctgtgctg ctggacgtgg ccctgggcct catgctgctg tcctggctcc 961 acgggagaag ccgcatcggg catctggccg acgccctcgt tcctgtggct gaccacgtgg 1021 ccgaggagct ccagcatctg ctgcagtggc tgatgggtgc tcccgccggg ctcaagatga 1081 accgtgcact ggaccaggtg ctgggccgct tcttcctcta ccacatccac ctgtggatca 1141 gctacatcca cctcatgtcc cccttcgtgg agcacatcct ttggcacgtg ggcctctcgg 1201 cctgcctggg cctgacggtg gccctgtccc tcctctcgga cattatcgcc ctcctcacct 1261 tccacatcta ctgcttttac gtctatggag ccaggctgta ctgcctgaag atccatggcc 1321 tgtcctcact gtggcgtctg ttccggggga agaagtggaa cgttctgcgc cagcgcgtgg 1381 actcctgttc ctatgacctg gaccagctgt tcatcgggac tctgctcttc accatcctgc 1441 tcttcctcct gcctaccaca gccctgtact acctggtgtt caccctgctc cggctcctgg 1501 tggtcgccgt gcagggcctg atccatctgc tggtggacct catcaactcc ctgccgctgt 1561 actcactggg tcttcggctc tgccggccct acaggctggc ggctggcgtg aagttccgtg 1621 tcctccggca cgaggccagc aggcccctcc gcctcctgat gcagataaac ccactgccct 1681 acagccgcgt ggtgcacacc taccgcctcc ccagctgtgg ctgccacccc aagcactcct 1741 ggggcgccct gtgccgcaag ctgttccttg gggagctcat ctacccctgg aggcagagag 1801 gggacaagca ggactgaggg aactgctggc tcgcctggca ccaccacacg gccacagcca 1861 gccatctgct ctgccagggt ggcaccagct cagctggcgc atgtcccgtg ctttgtggac 1921 gctgctgtgt gctcctgaac acggcaggcc ctgctatcac accttgggct tggaggtcat 1981 tgggagtgag cagatgtggg ggtggccagc caggctggcc gcactccatc actggcactg 2041 cctgccttgg gacccgcttc ccacctgctg cggtcaccat ggtggcgagc acagcaaccc 2101 caggtgtcca gagcactgcc ccatgcccac cctgcatacc caggtccaga gggtccgtcc 2161 accacagcag ccccaggtgg agggctggtc tccctggggg ctccccagtg gctctgccct 2221 ggctgtgggg gtggagggac cttgccagga tgaaccctcc agtcccaggc accctctagc 2281 tccctcagcc gaacagcacc ctgcatctgg gggattgaag cagtcgctga cccccgtccc 2341 cagcgggccc gggccctcac tccctgaacc acacggggtt tatttgcgga tgttccctgg 2401 agaggtcgct ttgtgaagaa accatcagca ggctgtgagc atcgccaggc tgctgtgggg 2461 gcgggagcag cctcagtgtc aagggcctgc ccactgaccc agccgtacct attcgtccac 2521 ggtgccccgt agcagcaggt cctgcggcca aatctgtctc ccttcatggg cctcccaggg 2581 aaggaggaag ccctgctgtg cagacacctc tgtggccccc caggggtgtg agcggcctgg 2641 ggagggggcc gtggcactga ggccgaaagt gcctgccaga cggcacggtc tgggtgcggg 2701 tgttccctgt gagcccgagt ccgcttcagg aggggagcct gcaggtgccg gctggtgagg 2761 ggatgacgcg ctgtgggtgg gaggaggcag cgcccatctc agcagcacca ggactgcctg 2821 ggactccctg gcaacccagc accggggaag ccgtcagctg ctgtgacaat aaaacctgcc 2881 ccgtgtctgg // LOCUS AF030234 5298 bp mRNA PRI 30-JAN-1998 DEFINITION Homo sapiens splicing factor Sip1 mRNA, complete cds. ACCESSION AF030234 NID g2822459 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5298) AUTHORS Zhang,W.J. and Wu,J.Y. TITLE Sip1, a novel RS domain-containing protein essential for pre-mRNA splicing JOURNAL Mol. Cell. Biol. 18 (2), 676-684 (1998) MEDLINE 98107652 REFERENCE 2 (bases 1 to 5298) AUTHORS Wu,J.Y., Zhang,W. and Bookout,J.T. TITLE Direct Submission JOURNAL Submitted (16-OCT-1997) Pediatrics/Molecular Biology & Pharmacology, Washington University School of Medicine, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..5298 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1181..4627 /codon_start=1 /product="splicing factor Sip1" /db_xref="PID:g2822460" /translation="MTTPTRRSTRNTRAETASQSQRSPISDNSGCDAPGNSNPSLSVP SSAESEKQTRQAPKRKSVRRGRKPPLLKKKLRSSVAAPEKSSSNDSVDEETAESDTSP VLEKEHQPDVDSSNICTVQTHVENQSANCLKSCNEQIEESEKHTANYDTEERVGSSSS ESCAQDLPVLVGEEGEVKKLENTGIEANVLCLESEISENILEKGGDPLEKQDQISGLS QSEVKTDVCTVHLPNDFPTCLTSESKVYQPVSCPLSDLSENVESVVNEEKITESSLVE ITEHKDFTLKTEELIESPKLESSEGEIIQTVDRQSVKSPEVQLLGHVETEDVEIIATC DTFGNEDFNNIQDSENNLLKNNLLNTKLEKSLEEKNESLTEHPRSTELPKTHIEQIQK HFSEDNNEMIPMECDSFCSDQNESEVEPSVNADLKQMNENSVTHCSENNMPSSDLADE KVETVSQPSESPKDTIDKTKKPRTRRSRFHSPSTTWSPNKDTPQEKKRPQSPSPRRET GKESRKSQSPSPKNESARGRKKSRSQSPKKDIARERRQSQSRSPKRDTTRESRRSESL SPRRETSRENKRSQPRVKDSSPGEKSRSQSRERESDRDGQRRERERRTRKWSRSRSHS RSPSRCRTKSKSSSFGRIDRDSYSPRWKGRWANDGWRCPRGNDRYRKNDPEKQNENTR KEKNDIHLDADDPNSADKHRNDCPNWITEKINSGPDPRTRNPEKLKESHWEENRNENS GNSWNKNFGSGWVSNRGRGRGNRGRGTYRSSFAYKDQNENRWQNRKPLSGNSNSSGSE SFKFVEQQSYKRKSEQEFSFDTPADRSGWTSASSWAVRKTLPADVQNYYSRRGRNSSG PQSGWMKQEEETSGQDSSLKDQTNQQVDGSQLPINMMQPQMNVMQQQMNAQHQPMNIF PYPVGVHAPLMNIQRNPFNIHPQLPLHLHTGVPLMQVATPTSVSQGLPPPPPPPPPSQ QVNYIASQPDGKQLQGIPSSSHVSNNMSTPVLPAPTAAPGNTGMVQGPSSGNTSSSSH SKASNAAVKLAESKVSVAVEASADSSKTDKKLQIQEKAAQEVKLAIKPFYQNKDITKE EYKEIVRKAVDKVCHSKSGEVNSTKVANLVKAYVDKYKYSRKGSQKKTLEEPVSTEKN IG" BASE COUNT 1906 a 992 c 1110 g 1290 t ORIGIN 1 tggcttaagc cgcgcggagc agcgcaacct gggtcgctcc ctgcttcgcc gccgcctccg 61 gaccgagcca gcggagtcag tgtcctagag accctgtaac accacaaagc ggacgaagga 121 gtccatgttg gggaacttgg cagcggagtg actgggacct gggaacctac tgtggggccg 181 cggccggacc gagcgcctcg acctcggtct gaggaaaccc ttttccaaag agaaatgaag 241 aagaaaactg tatgtaccct aaatatggga gataagaagt atgaagacat ggaaggtgaa 301 gaaaacggag ataatactat ttccactggt ctgttgtaca gtgaggctga cagatgccca 361 atatgtctta attgtctatt agaaaaggaa gttggttttc cagaaagctg taatcatgtg 421 cttctgtatg acttgtattc ttaaatgggc agagacactg gcttcatgtc ctattgaccg 481 taaacctttt caggcagtgt ttaaattcag tgcattggaa ggttatgtta aggttcaagt 541 aaaaaaacag ctgagagaaa caaaagacaa gaaaaatgaa aactcatttg agaaacaggt 601 ctcctgtcat gaaaattcta aaagctgtat aagaagaaaa gccatcgtaa gagaagatct 661 attaagtgca aaagtttgtg acttgaagtg gatacataga aactctttat acagtgaaac 721 aggaggaaag aaaaatgcag caataaagat aaataagcct cagagatcaa attggagtac 781 aaatcagtgc ttcagaaatt ttttctccaa tatgttttct tctgttagcc actctggaga 841 atcttccttt acctatagag cttattgtac agaatttata gaagccagtg aaatcagtgc 901 attgattagg cagaagagac atgaactgga attgtcatgg tttcctgata cattacctgg 961 aattggaaga attggtttta taccctggaa tgttgaaaca gaagtccttc ctctcatttc 1021 ttctgtgttg ccaagaacta tttttccaac aagtaccata tctttcgaac attttggtac 1081 ttcttgcaag ggatatgcat tagcacatac tcaagaaggg gaagaaaaga agcaaacttc 1141 tggtacatca aataccagag gatcaagacg aaaacctgca atgacaactc ctacaaggag 1201 gtctacacgt aacacaagag ctgaaacagc cagtcagtct cagagatccc caatatcaga 1261 caattctggg tgtgatgccc caggtaacag taatccatct ttaagtgttc cctcttcagc 1321 tgagtcagaa aagcaaacaa gacaggctcc aaaacggaag tctgtaagaa gaggaagaaa 1381 accaccttta ctgaaaaaga aacttcggag ctctgtagct gcccctgaaa aatcatcttc 1441 caatgattca gtagatgaag aaacagcaga atctgacaca tcacctgtgt tagaaaaaga 1501 gcaccaacca gatgtagaca gtagtaacat ttgtactgtg cagactcatg tagaaaacca 1561 gtctgctaat tgcttgaaaa gttgcaatga gcaaatagaa gaaagtgaga agcatactgc 1621 aaattatgat acagaggaaa gagtaggatc ttcatcttct gagtcttgtg ctcaagatct 1681 tcctgtgcta gttggtgagg aaggggaagt taaaaaactc gagaatacag gtatagaggc 1741 taatgttttg tgtttggaaa gtgagatttc tgaaaatatt cttgaaaaag gaggtgatcc 1801 attggaaaag caagaccaga tatctggact ttcacaatca gaggtaaaga cagatgtatg 1861 tacagttcat cttccaaatg attttcctac atgtttaaca tctgaaagca aagtgtacca 1921 acctgtatct tgtcccctaa gtgacttatc tgagaatgta gagtcagtgg ttaatgaaga 1981 aaaaataaca gagagttccc tagtagaaat tactgaacat aaagatttta cactaaaaac 2041 agaggagctt atagagagcc ccaagttaga atcttctgag ggtgaaatta tacagacagt 2101 ggacagacaa tctgttaaga gcccagaggt tcaattgctt gggcatgttg aaactgaaga 2161 tgtagaaata attgcaacat gtgatacttt tgggaatgaa gatttcaata atattcaaga 2221 ctctgaaaat aacttactaa aaaataatct tctgaacacc aaattggaaa aatctttaga 2281 agaaaagaat gaatcgctga ccgaacatcc tagatctaca gagttgccta aaacacacat 2341 tgaacagatt cagaagcatt ttagtgagga caacaatgaa atgataccta tggagtgtga 2401 ttcattttgc agtgaccaaa atgaatctga agttgaacca tctgtaaatg ctgatcttaa 2461 acaaatgaat gaaaattctg tgacacactg ttctgaaaat aatatgccgt cttctgatct 2521 tgcggatgaa aaggttgaaa ctgtttctca accatctgaa agcccaaaag ataccataga 2581 taaaaccaaa aagcctcgta ctcgaagatc tagatttcat tctccatcta caacttggtc 2641 acccaacaaa gacactccac aagaaaagaa gcggccccag tctccatctc ccagaagaga 2701 aactgggaaa gaaagcagga agtctcaatc accatctcct aagaatgagt cagccagagg 2761 ccggaaaaaa tcccgttctc agtccccaaa aaaggatatt gcaagagaaa ggaggcaatc 2821 tcagtctcgg tctccaaaaa gggatactac tagggaaagc agaagatctg aatcactgtc 2881 cccaagaaga gaaacttcta gagagaacaa aagatctcag ccaagagtga aagattcttc 2941 cccaggagaa aaatccaggt cccagagcag agaacgagaa agtgatagag atgggcagag 3001 gagagagaga gaaaggagaa ccagaaagtg gtctaggtcc agatctcatt ctaggtcccc 3061 ctcaagatgt agaacaaaaa gtaagagttc atcatttggt agaattgaca gagatagtta 3121 ctctccccgg tggaagggaa gatgggcaaa tgatggttgg agatgtccac gaggaaatga 3181 tcggtacaga aagaatgacc cagagaaaca gaatgaaaat acaagaaaag aaaaaaatga 3241 catccatcta gatgctgatg atccaaattc tgctgacaaa catagaaatg actgtcccaa 3301 ttggataaca gaaaaaataa actctgggcc tgatccaaga accagaaatc cagaaaagtt 3361 gaaagagtct cattgggaag aaaatagaaa tgaaaattca ggaaattctt ggaataaaaa 3421 ctttggttct ggttgggtat ctaaccgtgg tagaggcaga ggcaaccgtg gcagaggcac 3481 ttacagaagt agttttgcct ataaagatca gaatgaaaat cggtggcaaa atcgaaaacc 3541 cctctcaggg aattcaaaca gttcagggag tgaatctttc aagtttgtgg aacagcaatc 3601 ctataagcga aaaagtgaac aggagttctc atttgataca ccagcagata gatctggatg 3661 gacatctgca tccagctggg ccgtgagaaa gactttgcca gcagatgtac aaaactacta 3721 ctcacgacga ggcagaaatt cttcaggtcc acagtctgga tggatgaaac aagaggagga 3781 aacatctgga caggattcta gcctaaaaga ccaaacaaac cagcaagttg atggttctca 3841 gctacctata aatatgatgc aaccgcaaat gaatgtaatg cagcaacaaa tgaatgcaca 3901 acaccagcct atgaatatct tcccatatcc agtgggtgtt catgctcctt tgatgaacat 3961 ccaacgcaat ccatttaaca ttcatcctca gctacccttg catctccaca caggagtgcc 4021 cctcatgcag gtagccactc ctaccagtgt atctcaggga ctaccaccac caccaccccc 4081 tcccccacca tcccaacaag tcaactacat tgcttcacaa ccagatggaa agcaattgca 4141 gggtattcct agttcttctc atgtaagtaa taacatgagt acaccagttt tgcctgctcc 4201 gacagcagcc ccaggaaata cgggaatggt tcagggacca agttctggta atacttcgtc 4261 atcaagtcac agcaaagcct ctaatgctgc tgtaaaattg gcagaaagca aagtaagtgt 4321 tgcagtggaa gccagcgcag atagctcgaa gacagacaag aaattgcaaa ttcaagaaaa 4381 agcagcacaa gaggtaaaat tggccatcaa gccattttac caaaataaag atatcaccaa 4441 ggaagaatat aaagaaattg tacggaaagc agtagataaa gtttgtcata gtaagagtgg 4501 agaagtaaat tctactaaag tggcaaatct ggttaaagcc tatgtagaca aatacaaata 4561 ttcacggaag gggagccaaa agaaaactct ggaagaacct gtgtctactg aaaaaaacat 4621 aggctgaaat ggggaacgct gtcaaggaca ttatcaggat atctgcaaag tgcaatttca 4681 acatgtacca ttaactgaaa atcatacata actgtgattg aaatttggtt ttgataaaat 4741 tattttttta acataggata tgatgttttg ttctaaataa atataggtct gcactgcaac 4801 ttctgtatcc ttccttcccc tccaccctcc cccacaaaat tcaagggaaa gtaaagggtt 4861 taaaggaatg tgcatcttta ctaggactgt gttatagtgt ggatactgga aaatgtatag 4921 ctttttgatt agggcaatgg agtgcataaa ttagaaactt tctaagtgca ctggttttca 4981 aagagatata tataatgcat ttattctgtc aggttaaaat ataaagtatg atctttatga 5041 ttttttccct ctaattatag aaagttaaat aatgtattac catgaaaaat gtttctaata 5101 ttaaatagaa catatcagtt gcaaagttcc taatgtgtat ttttaaagca catatctgaa 5161 taaattgcct agatagaaaa aaaattatca tcgagtaaaa tttagtgttc aaaacattga 5221 gacactcttc acctattgta tgaccaaata aaggttatgc tgcttgttaa aaaaaaaaaa 5281 aaaaaaaaaa aaaaaaaa // LOCUS AF030297 1936 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens G protein coupled receptor (RDC1) mRNA, complete cds. ACCESSION AF030297 NID g2736281 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1936) AUTHORS Bi,A., Yu,L., Zhang,Q., Tu,Q., Xing,Y. and Zheng,L. TITLE Human RDC1 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1936) AUTHORS Bi,A. TITLE Direct Submission JOURNAL Submitted (16-OCT-1997) Institute of Genetics, Biology Science, 220# Handan Road, Shanghai, Shanghai, 200433, China FEATURES Location/Qualifiers source 1..1936 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1936 /gene="RDC1" CDS 63..1151 /gene="RDC1" /codon_start=1 /evidence=experimental /product="G protein coupled receptor" /db_xref="PID:g2736282" /translation="MDLHLFDYSEPGNFSDISWPCNSSDCIVVDTVMCPNMPNKSVLL YTLSFIYIFIFVIGMIANSVVVWVNIQAKTTGYDTHCYILNLAIADLWVVLTIPVWVV SLVQHNQWPMGELTCKVTHLIFSINLFSSIFFLTCMSVDRYLSITYFTNTPSSRKKMV RRVVCILVWLLAFCVSLPDTYYLKTVTSASNNETYCRSFYPEHSIKEWLIGMELVSVV LGFAVPFSIIAVFYFLLARAISASSDQEKHSSRKIIFSYVVVFLVCWLPYHVAVLLDI FSILHYIPFTCRLEHALFTALHVTQCLSLVHCCVNPVLYSFINRNYRYELMKAFIFKY SAKTGLTKLIDASRVSETEYSALEQSTK" polyA_signal 1917..1922 /gene="RDC1" BASE COUNT 441 a 491 c 449 g 555 t ORIGIN 1 caaagtgctc agcataaggg agccagcgca cagacagcca ggaagggagc cgcctcagaa 61 cgatggatct gcatctcttc gactactcag agccagggaa cttctcggac atcagctggc 121 catgcaacag cagcgactgc atcgtggtgg acacggtgat gtgtcccaac atgcccaaca 181 aaagcgtcct gctctacacg ctctccttca tttacatttt catcttcgtc atcggcatga 241 ttgccaactc cgtggtggtc tgggtgaata tccaggccaa gaccacaggc tatgacacgc 301 actgctacat cttgaacctg gccattgccg acctgtgggt tgtcctcacc atcccagtct 361 gggtggtcag tctcgtgcag cacaaccagt ggcccatggg cgagctcacg tgcaaagtca 421 cacacctcat cttctccatc aacctcttca gcagcatttt cttcctcacg tgcatgagcg 481 tggaccgcta cctctccatc acctacttca ccaacacccc cagcagcagg aagaagatgg 541 tacgccgtgt cgtctgcatc ctggtgtggc tgctggcctt ctgcgtgtct ctgcctgaca 601 cctactacct gaagaccgtc acgtctgcgt ccaacaatga gacctactgc cggtccttct 661 accccgagca cagcatcaag gagtggctga tcggcatgga gctggtctcc gttgtcttgg 721 gctttgccgt tcccttctcc attatcgctg tcttctactt cctgctggcc agagccatct 781 cggcgtccag tgaccaggag aagcacagca gccggaagat catcttctcc tacgtggtgg 841 tcttccttgt ctgctggttg ccctaccacg tggcggtgct gctggacatc ttctccatcc 901 tgcactacat ccctttcacc tgccggctgg agcacgccct cttcacggcc ctgcatgtca 961 cacagtgcct gtcgctggtg cactgctgcg tcaaccctgt cctctacagc ttcatcaatc 1021 gcaactacag gtacgagctg atgaaggcct tcatcttcaa gtactcggcc aaaacagggc 1081 tcaccaagct catcgatgcc tccagagtct cagagacgga gtactctgcc ttggagcaga 1141 gcaccaaatg atctgccctg gagaggctct gggacgggtt tacttgtttt tgaacagggt 1201 gatgggccct atggttttct agagcaaagc aaagtagctt cgggtcttga tgcttgagta 1261 gagtgaagag gggagcacgt gccccctgca tccatttctc tttctcttga tgacgcagct 1321 gtcatttggc tgtgcgtgct gacagttttg caacaggcag agctgtgtcg cacagcagtg 1381 ctgtgcgtca gagccagctg aggacaggct tgcctggact tctgtaagat aggattttct 1441 gtgtttcctg aattttttat atggtgattt gtatttaaat tttaagactt tattttctca 1501 ctattggtgt accttataaa tgtatttgaa agttaaatat attttaaata ttgtttggga 1561 ggcatagtgc tgacatatat tcagagtgtt gtagttttaa ggttagcgtg actttcagtt 1621 ttgactaagg atgacactaa ttgttagctg ttttgaaatt atatatatat aaatatataa 1681 atatatgcca gtcttggctg aaatgtttta tttaccatag ttttatatct gtgtggtgtt 1741 ttgtaccggc acgggatatg gaacgaaaac tgctttgtaa tgcagtttgt gacattaata 1801 gtattgtaaa gttacatttt aaaataaaca aaaaactgtt ctggactgca aatctgcaca 1861 cacaacgaac agttgcattt cagagagttc tctcaatttg taagttattt ttttttaata 1921 aagatttttg tttcct // LOCUS AF030335 1116 bp mRNA PRI 11-DEC-1997 DEFINITION Homo sapiens purinergic P2Y11 receptor (P2Y11) mRNA, complete cds. ACCESSION AF030335 NID g2674119 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1116) AUTHORS Communi,D., Govaerts,C., Parmentier,M. and Boeynaems,J.M. TITLE Cloning of a human purinergic P2Y receptor coupled to phospholipase C and adenylyl cyclase JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 1116) AUTHORS Communi,D. TITLE Direct Submission JOURNAL Submitted (18-OCT-1997) I.R.I.B.H.N., U.L.B., Route de Lennik 808, Brussels 1070, Belgium FEATURES Location/Qualifiers source 1..1116 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="cDNA library kindly provided by Pr. P. Chambon (Strasbourg, France)" gene 1..1116 /gene="P2Y11" CDS 1..1116 /gene="P2Y11" /note="G protein-coupled receptor; purinergic P2Y receptor; provisionally called P2Y11" /codon_start=1 /product="purinergic P2Y11 receptor" /db_xref="PID:g2674120" /translation="MDRGAKSCPANFLAAADDKLSGFQGDFLWPILVVEFLVAVASNG LALYRFSIRKQRPWHPAVVFSVQLAVSDLLCALTLPPLAAYLYPPKHWRYGEAACRLE RFLFTCNLLGSVIFITCISLNRYLGIVHPFFARSHLRPKHAWAVSAAGWVLAALLAMP TLSFSHLKRPQQGAGNCSVARPEACIKCLGTADHGLAAYRAYSLVLAGLGCGLPLLLT LAAYGALGRAVLRSPGMTVAEKLRVAALVASGVALYASSYVPYHIMRVLNVDARRRWS TRCPSFADIAQATAALELGPYVGYQVMRGLMPLAFCVHPLLYMAAVPSLGCCCRHCPG YRDSWNPEDAKSTGQALPLNATAAPKPSEPQSRELSQ" BASE COUNT 171 a 404 c 344 g 197 t ORIGIN 1 atggatcgag gtgccaagtc ctgccctgcc aacttcttgg cagctgccga cgacaaactc 61 agtgggttcc agggggactt cctgtggccc atactggtgg ttgagttcct ggtggccgtg 121 gccagcaatg gcctggccct gtaccgcttc agcatccgga agcagcgccc atggcacccc 181 gccgtggtct tctctgtcca gctggcagtc agcgacctgc tctgcgctct gacgctgccc 241 ccgctggccg cctacctcta tccccccaag cactggcgct atggggaggc cgcgtgccgc 301 ctggagcgct tcctcttcac ctgcaacctg ctgggcagcg tcatcttcat cacctgcatc 361 agcctcaacc gctacctggg catcgtgcac cccttcttcg cccgaagcca cctgcgaccc 421 aagcacgcct gggccgtgag cgctgccggc tgggtcctgg ccgccctgct ggccatgccc 481 acactcagct tctcccacct gaagaggccg cagcaggggg cgggcaactg cagcgtggcc 541 aggcccgagg cctgcatcaa gtgtctgggg acagcagacc acgggctggc ggcctacaga 601 gcgtatagcc tggtgctggc ggggttgggc tgcggcctgc cgctgctgct cacgctggca 661 gcctacggcg ccctcgggcg ggccgtgcta cgcagcccag gcatgactgt ggccgagaag 721 ctgcgtgtgg cagcgttggt ggccagtggt gtggccctct acgccagctc ctatgtgccc 781 taccacatca tgcgggtgct caacgtggat gctcggcggc gctggagcac ccgctgcccg 841 agctttgcag acatagccca ggccacagca gccctggagc tggggcccta cgtgggctac 901 caggtgatgc ggggcctcat gcccctggcc ttctgtgtcc accctctact ctacatggcc 961 gcagtgccca gcctgggctg ctgctgccga cactgccccg gctacaggga cagctggaac 1021 ccagaggacg ccaagagcac tggccaagcc ctgcccctca atgccacagc cgcccctaaa 1081 ccgtcagagc cccagtcccg tgagctgagc caatga // LOCUS AF030424 1568 bp mRNA PRI 16-NOV-1997 DEFINITION Homo sapiens histone acetyltransferase 1 mRNA, complete cds. ACCESSION AF030424 NID g2623155 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1568) AUTHORS Verreault,A., Kaufman,P.D., Kobayashi,R. and Stillman,B. TITLE Nucleosomal DNA regulates the core histone-binding subunit of the human Hat1 acetyltransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1568) AUTHORS Verreault,A., Kaufman,P.D., Kobayashi,R. and Stillman,B. TITLE Direct Submission JOURNAL Submitted (17-OCT-1997) Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724-0100, USA FEATURES Location/Qualifiers source 1..1568 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q31.2-q33.1" /clone_lib="teratocarcinoma cDNA library; Soares infant brain 1NIB cDNA library" CDS 37..1296 /function="acetylates newly synthesized histone H4 at lysines 5 and 12" /note="hat1" /codon_start=1 /product="histone acetyltransferase 1" /db_xref="PID:g2623156" /translation="MAGFGAMEKFLVEYKSAVEKKLAEYKCNTNTAIELKLVRFPEDL ENDIRTFFPEYTHQLFGDDETAFGYKGLKILLYYIAGSLSTMFRVEYASKVDENFDCV EADDVEGKIRQIIPPGFCTNTNDFLSLLEKEVDFKPFGTLLHTYSVLSPTGGENFTFQ IYKADMTCRGFREYHERLQTFLMWFIETASFIDVDDERWHYFLVFEKYNKDGATLFAT VGYMTVYNYYVYPDKTRPRVSQMLILTPFQGQGHGAQLLETVHRYYTEFPTVLDITAE DPSKSYVKLRDFVLVKLCQDLPCFSREKLMQGFNEDMAIEAQQKFKINKQHARRVYEI LRLLVTDMSDAEQYRSYRLDIKRRLISPYKKKQRDLAKMRKCLRPEELTNQMNQIEIS MQHEQLEESFQELVEDYRRVIERLAQE" BASE COUNT 512 a 263 c 326 g 467 t ORIGIN 1 cgtccttcct cagccgcggg tgatcgtagc tcggaaatgg cgggatttgg tgctatggag 61 aaatttttgg tagaatataa gagtgcagtg gagaagaaac tggcagagta caaatgtaac 121 accaacacag caattgaact aaaattagtt cgttttcctg aagatcttga aaatgacatt 181 agaactttct ttcctgagta tacccatcaa ctctttgggg atgatgaaac tgcttttggt 241 tacaagggtc taaagatcct gttatactat attgctggta gcctgtcaac aatgttccgt 301 gttgaatatg catctaaagt tgatgagaac tttgactgtg tagaggcaga tgatgttgag 361 ggcaaaatta gacaaatcat tccacctgga ttttgcacaa acacgaatga tttcctttct 421 ttactggaaa aggaagttga tttcaagcca ttcggaacct tacttcatac ctactcagtt 481 ctcagtccaa caggaggaga aaactttacc tttcagatat ataaggctga catgacatgt 541 agaggctttc gagaatatca tgaaaggctt cagacctttt tgatgtggtt tattgaaact 601 gctagcttta ttgacgtgga tgatgaaaga tggcactact ttctagtatt tgagaagtat 661 aataaggatg gagctacgct ctttgcgacc gtaggctaca tgacagtcta taattactat 721 gtgtacccag acaaaacccg gccacgtgta agtcagatgc tgattttgac tccatttcaa 781 ggtcaaggcc atggtgctca acttcttgaa acagttcata gatactacac tgaatttcct 841 acagttcttg atattacagc ggaagatcca tccaaaagct atgtgaaatt acgagacttt 901 gtgcttgtga agctttgtca agatttgccc tgtttttccc gggaaaaatt aatgcaagga 961 ttcaatgaag atatggcgat agaggcacaa cagaagttca aaataaataa gcaacacgct 1021 agaagggttt atgaaattct tcgactactg gtaactgaca tgagtgatgc cgaacaatac 1081 agaagctaca gactggatat taaaagaaga ctaattagcc catataagaa aaagcagaga 1141 gatcttgcta agatgagaaa atgtctcaga ccagaagaac tgacaaacca gatgaaccaa 1201 atagaaataa gcatgcaaca tgaacagctg gaagagagtt ttcaggaact agtggaagat 1261 taccggcgtg ttattgaacg acttgctcaa gagtaaagat tatactgctc tgtacaggaa 1321 gcttgcaaat tttctgtaca atgtgctgtg aaaaatctga tgactttaat tttaaaatct 1381 tgtgacattt tgcttatact aaaagttatc tatctttagt tgaatatttt cttttggaga 1441 gattgtatat tttaaaatac tgtttagagt ttatgagcat atattgcatt taaagaaaga 1501 taaagcttct gaaatactac tgcaattgct tcccttctta aacagtataa taaatgctta 1561 gttgtgat // LOCUS AF030880 4930 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens pendrin (PDS) mRNA, complete cds. ACCESSION AF030880 NID g2654004 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4930) AUTHORS Everett,L.A., Glaser,B., Beck,J.C., Idol,J.R., Buchs,A., Heyman,M., Adawi,F., Hazani,E., Nassir,E., Baxevanis,A.D., Sheffield,V.S. and Green,E.D. TITLE Pendred syndrome is caused by mutations in a putative sulphate transporter gene (PDS) JOURNAL Nature Genet. 17 (4), 411-422 (1997) MEDLINE 98061089 REFERENCE 2 (bases 1 to 4930) AUTHORS Everett,L.A., Glaser,B., Beck,J.C., Idol,J.R., Buchs,A., Heyman,M., Adawi,F., Hazani,E., Nassir,E., Baxevanis,A.D., Sheffield,V.S. and Green,E.D. TITLE Direct Submission JOURNAL Submitted (21-OCT-1997) Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, 49 Convent Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4930 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q22-q31.1" gene 1..4930 /gene="PDS" CDS 225..2567 /gene="PDS" /function="putative sulfate transporter" /note="mutated in Pendred syndrome" /codon_start=1 /product="pendrin" /db_xref="PID:g2654005" /translation="MAAPGGRSEPPQLPEYSCSYMVSRPVYSELAFQQQHERRLQERK TLRESLAKCCSCSRKRAFGVLKTLVPILEWLPKYRVKEWLLSDVISGVSTGLVATLQG MAYALLAAVPVGYGLYSAFFPILTYFIFGTSRHISVGPFPVVSLMVGSVVLSMAPDEH FLVSSSNGTVLNTTMIDTAARDTARVLIASALTLLVGIIQLIFGGLQIGFIVRYLADP LVGGFTTAAAFQVLVSQLKIVLNVSTKNYNGVLSIIYTLVEIFQNIGDTNLADFTAGL LTIVVCMAVKELNDRFRHKIPVPIPIEVIVTIIATAISYGANLEKNYNAGIVKSIPRG FLPPELPPVSLFSEMLAASFSIAVVAYAIAVSVGKVYATKYDYTIDGNQEFIAFGISN IFSGFFSCFVATTALSRTAVQESTGGKTQVAGIISAAIVMIAILALGKLLEPLQKSVL AAVVIANLKGMFMQLCDIPRLWRQNKIDAVIWVFTCIVSIILGLDLGLLAGLIFGLLT VVLRVQFPSWNGLGSIPSTDIYKSTKNYKNIEEPQGVKILRFSSPIFYGNVDGFKKCI KSTVGFDAIRVYNKRLKALRKIQKLIKSGQLRATKNGIISDAVSTNNAFEPDEDIEDL EELDIPTKEIEIQVDWNSELPVKVNVPKVPIHSLVLDCGAISFLDVVGVRSLRVIVKE FQRIDVNVYFASLQDYVIEKLEQCGFFDDNIRKDTFFLTVHDAILYLQNQVKSQEGQG SILETITLIQDCKDTLELIETELTEEELDVQDEAMRTLAS" BASE COUNT 1454 a 937 c 1082 g 1457 t ORIGIN 1 ctcagccttc ccggttcggg aaaggggaag aatgcaggag gggtaggatt tctttcctga 61 taggatcggt tgggaaagac cgcagcctgt gtgtgtcttt cccttcgacc aaggtgtctg 121 ttgctccgta aataaaacgt cccactgcct tctgagagcg ctataaaggc agcggaaggg 181 tagtccgcgg ggcattccgg gcggggcgcg agcagagaca ggtcatggca gcgccaggcg 241 gcaggtcgga gccgccgcag ctccccgagt acagctgcag ctacatggtg tcgcggccgg 301 tctacagcga gctcgctttc cagcaacagc acgagcggcg cctgcaggag cgcaagacgc 361 tgcgggagag cctggccaag tgctgcagtt gttcaagaaa gagagccttt ggtgtgctaa 421 agactcttgt gcccatcttg gagtggctcc ccaaataccg agtcaaggaa tggctgctta 481 gtgacgtcat ttcgggagtt agtactgggc tagtggccac gctgcaaggg atggcatatg 541 ccctactagc tgcagttcct gtcggatatg gtctctactc tgcttttttc cctatcctga 601 catactttat ctttggaaca tcaagacata tctcagttgg accttttcca gtggtgagtt 661 taatggtggg atctgttgtt ctgagcatgg cccccgacga acactttctc gtatccagca 721 gcaatggaac tgtattaaat actactatga tagacactgc agctagagat acagctagag 781 tcctgattgc cagtgccctg actctgctgg ttggaattat acagttgata tttggtggct 841 tgcagattgg attcatagtg aggtacttgg cagatccttt ggttggtggc ttcacaacag 901 ctgctgcctt ccaagtgctg gtctcacagc taaagattgt cctcaatgtt tcaaccaaaa 961 actacaatgg agttctctct attatctata cgctggttga gatttttcaa aatattggtg 1021 ataccaatct tgctgatttc actgctggat tgctcaccat tgtcgtctgt atggcagtta 1081 aggaattaaa tgatcggttt agacacaaaa tcccagtccc tattcctata gaagtaattg 1141 tgacgataat tgctactgcc atttcatatg gagccaacct ggaaaaaaat tacaatgctg 1201 gcattgttaa atccatccca agggggtttt tgcctcctga acttccacct gtgagcttgt 1261 tctcggagat gctggctgca tcattttcca tcgctgtggt ggcttatgct attgcagtgt 1321 cagtaggaaa agtatatgcc accaagtatg attacaccat cgatgggaac caggaattca 1381 ttgcctttgg gatcagcaac atcttctcag gattcttctc ttgttttgtg gccaccactg 1441 ctctttcccg cacggccgtc caggagagca ctggaggaaa gacacaggtt gctggcatca 1501 tctctgctgc gattgtgatg atcgccattc ttgccctggg gaagcttctg gaacccttgc 1561 agaagtcggt cttggcagct gttgtaattg ccaacctgaa agggatgttt atgcagctgt 1621 gtgacattcc tcgtctgtgg agacagaata agattgatgc tgttatctgg gtgtttacgt 1681 gtatagtgtc catcattctg gggctggatc tcggtttact agctggcctt atatttggac 1741 tgttgactgt ggtcctgaga gttcagtttc cttcttggaa tggccttgga agcatcccta 1801 gcacagatat ctacaaaagt accaagaatt acaaaaacat tgaagaacct caaggagtga 1861 agattcttag attttccagt cctattttct atggcaatgt cgatggtttt aaaaaatgta 1921 tcaagtccac agttggattt gatgccatta gagtatataa taagaggctg aaagcgctga 1981 ggaaaataca gaaactaata aaaagtggac aattaagagc aacaaagaat ggcatcataa 2041 gtgatgctgt ttcaacaaat aatgcttttg agcctgatga ggatattgaa gatctggagg 2101 aacttgatat cccaaccaag gaaatagaga ttcaagtgga ttggaactct gagcttccag 2161 tcaaagtgaa cgttcccaaa gtgccaatcc atagccttgt gcttgactgt ggagctatat 2221 ctttcctgga cgttgttgga gtgagatcac tgcgggtgat tgtcaaagaa ttccaaagaa 2281 ttgatgtgaa tgtgtatttt gcatcacttc aagattatgt gatagaaaag ctggagcaat 2341 gcgggttctt tgacgacaac attagaaagg acacattctt tttgacggtc catgatgcta 2401 tactctatct acagaaccaa gtgaaatctc aagagggtca aggttccatt ttagaaacga 2461 tcactctcat tcaggattgt aaagataccc ttgaattaat agaaacagag ctgacggaag 2521 aagaacttga tgtccaggat gaggctatgc gtacacttgc atcctgaaag tgggttcggg 2581 aggtctctat gagcaaggaa tacaagacaa aacttcctca atgcattgac tatttcttca 2641 gactcaaaac actcattctt ttttctatta agccattgaa agagaagcac taagactgct 2701 tctaggcttt atttataaaa taaacacctt atccctaaca tgggcaaaat ggctagaatt 2761 attcagacga tttggcagcg tccagggtaa gctggtgtta taatacgctg ctgatctaca 2821 tcacagattt gctaataatg ttcacgtggg ccctggcata tctctgttca gttagagtga 2881 gtgctgaccc aacagcctct gtggtcaagc gagtcacgaa tgattaatca taaagaaaaa 2941 tcagtttttg actgacctgg atatccatga gctgcactga tcaccatgta aggtcacatt 3001 tagtaaatgc tgaaataaaa tgattaatgc atttatcaat aaaagccttt gaaaatactt 3061 tggataataa attggagttt taaaaatgca aatttgctta gtatctaata atgaagtgtt 3121 attacatata gccggaattg aggatctctt tgatcctgga aatggtttac ctaaaagcta 3181 cagaaccagg ccaatatatt ttgaaatatt gatgcagaca aatgaaataa taaagagatt 3241 ttcatggttt ataaaaatct tttttgatat gataataatc atgatcacaa ctgagatcaa 3301 aaaaatatat gacagattat tttgtttaaa aatgcagttt taattatctt agtctataga 3361 aatgatcatt gcatggaggc atgtataggt atgatctgtg taaaatctga cataaaaaca 3421 gtgctattct gagtgaaaat ttttttgatg tgcttacata accatggtga ttaaaatgag 3481 tttatatttt ttctcaaaaa ttttagcagt gtgtaaagta agtaatcttt aactgaactc 3541 tgaccactta aaaaaaaatc taaaaattga actacctata gtagtctgtg tttaaagtga 3601 atttttaaag acaaagcatt ctaaatgaac tcaatataaa aacattcatt tggaatgtac 3661 atactgaaaa atacaggttt ttttgaccaa aagtttttat atcttttctt tttatttatt 3721 tttttcctaa gtgccaacaa ttttctagat attatataca acacaggctt tgatcttggg 3781 gacttttccc atatatttca cactggagtg aatgaagttg tacttcattt ctagagaaaa 3841 gttataccca ggtccccaat tgagaatgtc ttgcttgatt gaaaacgaca tcatcccttg 3901 gtatactcca gggattggtt tcaggacccc tgcatttacc aaaatttgtg cacactcaag 3961 tcctgcagtc acccctgcct aaagatagaa tggcttctct gtttttcttc tgaaatacaa 4021 ccagaaacaa tgtgtctatt tctgaaagaa taggattaat gatcatacaa atgggttaat 4081 cctgaattct ggttgtaaat ctggttacag cataactagg attataatgc tgcctcattt 4141 tcacagcact acttgcttat attgacaaca aatcatctcg ctaaagagtg aatgtaggcc 4201 aggcgcggtg gctcatgcct gtaatcccag cactttggga ggccgaggcg ggtggatcac 4261 gaggtcagga gatcgagacc atcctggcta acatggtaaa accccgtctc tactaaaaat 4321 agaaaaaaag aaattagcct agcgtggtgg ctggcgggcg cctgtagtcc cagctatttg 4381 ggaggctaag gcaggagaat ggcgtgaacc cgggaggcgg agcttgcagt gagccgaggt 4441 cgtgccactg cactccagcc tgggcgacag agcaagactc cgtctcaaaa aaaaaaaaaa 4501 aaaaaaaaaa agagtgaatg taatagtctt gcagaaaatg aatgaatacc tttgttcaat 4561 aaaggaaata tgcactgctc acttttttga aggaaatgcc aaagttacgt tttacaacaa 4621 ggctagagtt tgtaaattct gggttcattt gtgatgacat aagtcagcaa actgcgggaa 4681 tactgtctct tctatgtatt ttgtgaatag taagcataat tttagttttg tattatcaat 4741 gaaaatttca cttgaaatta aagctgcctt ttgttatatt tttaacctat aggataagat 4801 tccagtattg tatatgagtt ttaacaaatt aaaaaatcaa atcatgtaca tttgaaaata 4861 tttgcacaca tttaaaaata aatgtaaagt tgtcttttaa actactcgga tgtgtccttt 4921 ctgaacaaaa // LOCUS AF031136 1116 bp mRNA PRI 19-NOV-1997 DEFINITION Homo sapiens 1C7 precursor, mRNA, alternatively spliced, complete cds. ACCESSION AF031136 NID g2623872 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1116) AUTHORS Nalabolu,S.R., Shukla,H., Nallur,G., Parimoo,S. and Weissman,S.M. TITLE Genes in a 220-kb region spanning the TNF cluster in human MHC JOURNAL Genomics 31 (2), 215-222 (1996) MEDLINE 96422187 REFERENCE 2 (bases 1 to 1116) AUTHORS Nalabolu,S.R., Raghunathan,A. and Weissman,S.M. TITLE Analyses of the transcription pattern of B144 and 1C7, two immune system related genes encoded near the TNF cluster JOURNAL Unpublished (1997) REFERENCE 3 (bases 1 to 1116) AUTHORS Nalabolu,S.R., Raghunathan,A., Sivakamasundari,R. and Weissman,S.M. TITLE Direct Submission JOURNAL Submitted (23-OCT-1997) Genetics, Yale School of Medicine, 333 Cedar street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..1116 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3; MHC complex" /tissue_type="spleen" CDS 265..798 /note="alternatively spliced; precise translation start site not known; initiating methionine could also be one of the next two methionines in the sequence" /codon_start=1 /product="1C7 precursor" /db_xref="PID:g2623873" /translation="MAWMLLLILIMVHPGSCALWVSQPPEIRTLEGSSAFLPCSFNAS QGRLAIGSVTWFRDEVVPGKEVRNGTPEFRGRLAPLASSRFLHDHQAELHIRDVRGHD ASIYVCRVEVLGLGVGTGNGTRLVVEKEHPQLGAGTVLLLRAGFYAVSFLSVAVGSTV YYQGKYAKSTLSGFPQL" sig_peptide 265..447 mat_peptide 448..795 /product="1C7" misc_feature 695..750 /note="encodes putative membrane anchor site" BASE COUNT 224 a 343 c 293 g 256 t ORIGIN 1 ccacaagctg gccccttggc ctcctagaga ccctgacatc tcctccagca gcatctgtcc 61 tctctcctca gggaggcaag catttgatgc tcgaggtccc tggcagttgt ggtccttggc 121 aagtgatgtg tgagtcccgt gtgtcatagg aagctcccca tccccatctg gtgaccaaag 181 gcctggctac aagtagtgag tccttcctcc tccacccaga cctcactgct cagatcccct 241 tcgccaactg ggacatcttc cgacatggcc tggatgctgt tgctcatctt gatcatggtc 301 catccaggat cctgtgctct ctgggtgtcc cagccccctg agattcgtac cctggaagga 361 tcctctgcct tcctgccctg ctccttcaat gccagccaag ggagactggc cattggctcc 421 gtcacgtggt tccgagatga ggtggttcca gggaaggagg tgaggaatgg aaccccagag 481 ttcaggggcc gcctggcccc acttgcttct tcccgtttcc tccatgacca ccaggctgag 541 ctgcacatcc gggacgtgcg aggccatgac gccagcatct acgtgtgcag agtggaggtg 601 ctgggccttg gtgtcgggac agggaatggg actcggctgg tggtggagaa agaacatcct 661 cagctagggg ctggtacagt cctcctcctt cgggctggat tctatgctgt cagctttctc 721 tctgtggccg tgggcagcac cgtctattac cagggcaaat atgccaaatc tactctctcc 781 ggattccccc aactctgaac tttcccttcc accaggtctg acctggaaag gtccaagaag 841 gcagctgccg gctgtggtcc cagcgcccct cccaccacca tgtgggagct cagcacatct 901 gcttccccca gtcccaggag gctgagcctg attgtcctga gaaatgggaa ggatcagata 961 tgactcctcc ttggcaactg ccctttcctg ccaggcccac acataccctc ttctggctgt 1021 taggggagct tgggtccctg aacactgtca ttcacccaat aaattactat ttgaccccag 1081 agtgggtgga agggtgaaaa aaaaaaaaaa aaaaaa // LOCUS AF031141 459 bp mRNA PRI 16-NOV-1997 DEFINITION Homo sapiens ubiquitin conjugating enzyme (UbcH8) mRNA, complete cds. ACCESSION AF031141 NID g2623259 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 459) AUTHORS Kumar,S., Kao,W.H. and Howley,P.M. TITLE Physical interaction between specific E2 and Hect E3 enzymes determines functional cooperativity JOURNAL J. Biol. Chem. 272 (21), 13548-13554 (1997) MEDLINE 97298053 REFERENCE 2 (bases 1 to 459) AUTHORS Kumar,S., Kao,W.H. and Howley,P.M. TITLE Direct Submission JOURNAL Submitted (22-OCT-1997) Pathology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..459 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..459 /gene="UbcH8" CDS 1..459 /gene="UbcH8" /codon_start=1 /product="ubiquitin conjugating enzyme" /db_xref="PID:g2623260" /translation="MASMRVVKELEDLQKKPPPYLRNLSSDDANVLVWHALLLPDQPP YHLKAFNLRISFPPEYPFKPPMIKFTTKIYHPNVDENGQICLPIISSENWKPCTKTCQ VLEALNVLVNRPNIREPLRMDLADLLTQNPELFRKNAEEFTLRFGVDRPS" BASE COUNT 114 a 146 c 113 g 86 t ORIGIN 1 atggcgagca tgcgagtggt gaaggagctg gaggatcttc agaagaagcc tcccccatac 61 ctgcggaacc tgtccagcga tgatgccaat gtcctggtgt ggcacgctct cctcctaccc 121 gaccaacctc cctaccacct gaaagccttc aacctgcgca tcagcttccc gccggagtat 181 ccgttcaagc ctcccatgat caaattcaca accaagatct accaccccaa cgtggacgag 241 aacggacaga tttgcctgcc catcatcagc agtgagaact ggaagccttg caccaagact 301 tgccaagtcc tggaggccct caatgtgctg gtgaatagac cgaatatcag ggagcccctg 361 cggatggacc tcgctgacct gctgacacag aatccggagc tgttcagaaa gaatgccgaa 421 gagttcaccc tccgattcgg agtggaccgg ccctcctaa // LOCUS AF031383 778 bp mRNA PRI 01-JAN-1998 DEFINITION Homo sapiens hMed7 (MED7) mRNA, complete cds. ACCESSION AF031383 NID g2736289 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 778) AUTHORS Myers,L.C., Gustafsson,C.M., Bushnell,D.A., Lui,M., Erdjument-Bromage,H., Tempst,P. and Kornberg,R.D. TITLE The Med proteins of yeast and their function through the RNA polymerase II C-terminal domain JOURNAL Genes Dev. (1997) In press REFERENCE 2 (bases 1 to 778) AUTHORS Myers,L.C., Gustafsson,C.M., Lui,M., Erdjument-Bromage,H., Tempst,P. and Kornberg,R.D. TITLE Direct Submission JOURNAL Submitted (20-OCT-1997) Structural Biology, Stanford University School of Medicine, Fairchild Center, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..778 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="665201 and 139190" /note="These clones were previously the source of EST sequences" gene 1..778 /gene="MED7" CDS 67..768 /gene="MED7" /note="homolog of yeast Med7, a member of the RNA polymerase II mediator complex" /codon_start=1 /product="hMed7" /db_xref="PID:g2736290" /translation="MGEPQQVSALPPPPMQYIKEYTDENIQEGLAPKPPPPIKDSYMM FGNQFQCDDLIIRPLESQGIERLHPMQFDHKKELRKLNMSILINFLDLLDILIRSPGS IKREEKLEDLKLLFVHVHHLINEYRPHQARETLRVMMEVQKRQRLETAERFQKHLERV IEMIQNCLASLPDDLPHSEAGMRVKTEPMDADDSNNCTGQNEHQRENSGHRRDQIIEK DAALCVLIDEMNERP" BASE COUNT 262 a 148 c 175 g 193 t ORIGIN 1 aattcggcac gggggggaag gcggctacca gtgtaaagcc agagctgagg ttcttgatag 61 tccacaatgg gtgaaccaca gcaagtgagt gcacttccac cacctccaat gcaatatatc 121 aaggaatata cggatgaaaa tattcaagaa ggcttagctc ccaagcctcc ccctccaata 181 aaagacagtt acatgatgtt tggcaatcag ttccaatgtg atgatcttat catccgccct 241 ttggaaagtc agggcatcga acggcttcat cctatgcagt ttgatcacaa gaaagaactg 301 agaaaactta atatgtctat ccttattaat ttcttggacc ttttagatat tttaataagg 361 agccctggga gtataaaacg agaagagaaa ctagaagatc ttaagctgct ttttgtacac 421 gtgcatcatc ttataaatga ataccgaccc caccaagcaa gagagacctt gagagtcatg 481 atggaggtcc agaaacgtca acggcttgaa acagctgaga gatttcaaaa gcacctggaa 541 cgagtaattg aaatgattca gaattgcttg gcttctttgc ctgatgattt gcctcattca 601 gaagcaggaa tgagagtaaa aactgaacca atggatgctg atgatagcaa caattgtact 661 ggacagaatg aacatcaaag agaaaattca ggtcatagga gagatcagat tatagagaaa 721 gatgctgcct tgtgtgtcct aattgatgag atgaatgaaa gaccatgaaa gatgtttc // LOCUS AF031523 507 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens bcl-xL/bcl-2 associated death promoter (BAD) mRNA, complete cds. ACCESSION AF031523 NID g2660728 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 507) AUTHORS Ottilie,S., Diaz,J.L., Horne,W., Chang,J., Wang,Y., Wilson,G., Weeks,S., McConnell,M., Chang,S., Fritz,L.C. and Oltersdorf,T. TITLE Dimerization properties of human Bad: identification of a BH-3 domain and analysis of its binding to mutant bcl-2 and bcl-xL proteins JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 507) AUTHORS Ottilie,S., Diaz,J.-L., Horne,W., Chang,J., Wang,Y., Wilson,G., Weeks,S., McConnell,M., Chang,S., Fritz,L.C. and Oltersdorf,T. TITLE Direct Submission JOURNAL Submitted (27-OCT-1997) IDUN Pharmaceuticals Inc., 11085 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..507 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..507 /gene="BAD" CDS 1..507 /gene="BAD" /note="bcl-xL/bcl-2 binding protein." /codon_start=1 /product="bcl-xL/bcl-2 associated death promoter" /db_xref="PID:g2660729" /translation="MFQIPEFEPSEQEDSSSAERGLGPSPAGDGPSGSGKHHRQAPGL LWDASHQQEQPTSSSHHGGAGAVEIRSRHSSYPAGTEDDEGMGEEPSPFRGRSRSAPP NLWAAQRYGRELRRMSDEFVDSFKKGLPRPKSAGTATQMRQSSSWTRVFQSWWDRNLG RGSSAPSQ" BASE COUNT 99 a 165 c 175 g 68 t ORIGIN 1 atgttccaga tcccagagtt tgagccgagt gagcaggaag actccagctc tgcagagagg 61 ggcctgggcc ccagccccgc aggggacggg ccctcaggct ccggcaagca tcatcgccag 121 gccccaggcc tcctgtggga cgccagtcac cagcaggagc agccaaccag cagcagccat 181 catggaggcg ctggggctgt ggagatccgg agtcgccaca gctcctaccc cgcggggacg 241 gaggacgacg aagggatggg ggaggagccc agcccctttc ggggccgctc gcgctcggcg 301 ccccccaacc tctgggcagc acagcgctat ggccgcgagc tccggaggat gagtgacgag 361 tttgtggact cctttaagaa gggacttcct cgcccgaaga gcgcgggcac agcaacgcag 421 atgcggcaaa gctccagctg gacgcgagtc ttccagtcct ggtgggatcg gaacttgggc 481 aggggaagct ccgccccctc ccagtga // LOCUS AF031647 1556 bp mRNA PRI 16-DEC-1997 DEFINITION Homo sapiens signalosome subunit 3 (Sgn3) mRNA, complete cds. ACCESSION AF031647 NID g2688988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1556) AUTHORS Seeger,M., Kraft,R., Ferrell,K., Bech-Otschir,D., Dumdey,R., Schade,R., Gordon,C., Naumann,M. and Dubiel,W. TITLE Direct Submission JOURNAL Submitted (29-OCT-1997) Biochemistry, Humboldt University Berlin (Charite), Monbijoustrasse 2, Berlin 10117, Germany FEATURES Location/Qualifiers source 1..1556 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" gene 1..1556 /gene="Sgn3" CDS 83..1294 /gene="Sgn3" /function="phosphorylates cJun, Ikappa-Balpha and p105" /note="Sgn3 is a 45.7 kDa component of a 450 kDa complex with kinase activity; signalosome (SGN) complex is similar to the plant COP9 particle" /codon_start=1 /product="signalosome subunit 3" /db_xref="PID:g2688989" /translation="MTQLCELINKSGELLAKNLSHLDTVLGALDVQEHSLGVLAVLFV KFSMPSVPDFETLFSQVQLFISTCNGEHIRYATDTFAGLCHQLTNALVERKQPLRGIG ILKQAIDKMQMNTNQLTSIHADLCQLCLLAKCFKPALPYLDVDMMDICKENGAYDAKH FLCYYYYGGMIYTGLKNFERALYFYEQAITTPAMAVSHIMLESYKKYILVSLILLGKV QQLPKYTSQIVGRFIKPLSNAYHELAQVYSTNNPSELRNLVNKHSETFTRDNNMGLVK QCLSSLYKKNIQRLTKTFLTLSLQDMASRVQLSGPQEAEKYVLHMIEDGEIFASINQK DGMVSFHDNPEKYNNPAMLHNIDQEMLKCIELDERLKAMDQEITVNPQFVQKSMGSQE DDSGNKPSSYS" BASE COUNT 472 a 325 c 335 g 424 t ORIGIN 1 ggaattccgg cacgagggaa aacatggcgt ctgcctggag cagttcgtga acagtgtccg 61 acagctctca gctcaagggc aaatgacaca gctttgtgaa ctgatcaaca agagtgggga 121 actccttgcg aagaacttat cccatctgga cactgtgctc ggggctctgg atgtacaaga 181 acactccttg ggcgtccttg ctgttttgtt tgtgaagttt tctatgccca gtgttcctga 241 cttcgaaacg ctattctcac aggttcagct cttcatcagc acttgtaatg gggagcacat 301 tcgatatgca acagacactt ttgctgggct ttgccatcag ctaacaaatg cacttgtgga 361 aagaaaacag cccctgcgag gaattggcat ccttaagcaa gccatagaca agatgcagat 421 gaatacaaac cagctgacct caatacatgc tgatctctgc cagctttgtt tgctagcaaa 481 atgctttaag cctgcccttc catatcttga cgtggatatg atggatatct gtaaagagaa 541 tggagcctat gatgcaaaac actttttatg ttactattat tatggaggga tgatctatac 601 tgggctgaag aactttgaaa gagctctcta cttttatgaa caggctataa ctactcctgc 661 catggcggtc agtcatatca tgttggaatc atataaaaag tatattttag tgtctttgat 721 attacttggc aaagtacaac agctaccaaa atatacatct caaattgtgg gtagattcat 781 taagcctctt agcaatgcat accacgagtt agcacaagtg tattcaacca acaacccctc 841 agaactccga aacctggtga ataagcacag tgaaaccttc actcgcgata acaacatggg 901 gctggtgaag caatgcttgt catctcttta taagaagaat attcagaggc taacaaagac 961 ctttttaact ctatcattac aagatatggc aagtcgtgtg cagttgtctg gacctcagga 1021 ggcagagaaa tacgttctgc acatgataga agatggtgag atttttgcaa gtattaacca 1081 gaaggacggt atggtcagtt tccatgataa ccctgaaaaa tataataacc cagccatgct 1141 tcataacatt gatcaggaga tgctgaagtg cattgagctg gatgagcggc tgaaagccat 1201 ggaccaggag atcacagtga accctcagtt tgtacaaaag agtatgggct cacaagaaga 1261 tgattcagga aacaaaccat ccagttattc ttgaaactaa catccatcct gagctaaaca 1321 agagaaacta ccatcttggc cagtgacaag tgttcggagg gcagcagaga ggaccaagcc 1381 tgtgtcacct ggagactaaa aaattaagtt ttgttttgac atcttcagtc ctgtgtgctt 1441 tcagaaaacc attttctctg caaagaaagg aaacagattt gcaaacttta aagtctgtcg 1501 tggatttatt tatcctcaga ttattgttac tgcattaaat ctaccttttt gcttta // LOCUS AF031815 2521 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens calcium-activated potassium channel (SKCA3) mRNA, complete cds. ACCESSION AF031815 NID g2832248 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2521) AUTHORS Chandy,K.G., Fantino,E., Wittekindt,O., Kalman,K., Tong,L.-L., Ho,T.-H., Gutman,G.A., Crocq,M.-A., Ganguli,R., Nimgaonkar,V., Morris-Rosendahl,D.J. and Gargus,J. TITLE Isolation of a novel potassium channel gene, hSKCa3, on human chromosome 22q, containing a polymorphic CAG repeat: A candidate for schizophrenia and bipolar disorder? JOURNAL Mol. Psych. (1998) In press REFERENCE 2 (bases 1 to 2521) AUTHORS Chandy,K.G., Fantino,E., Wittekindt,O., Kalman,K., Tong,L.-L., Ho,T.-H., Gutman,G.A., Crocq,M.-A., Ganguli,R., Nimgaonkar,V., Morris-Rosendahl,D.J. and Gargus,J. TITLE Direct Submission JOURNAL Submitted (29-OCT-1997) Physiology and Biophysics, University of California at Irvine, Irvine, CA 92697, USA FEATURES Location/Qualifiers source 1..2521 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11-q13.1" /chromosome="22" gene 1..2521 /gene="SKCA3" CDS 287..2482 /gene="SKCA3" /note="polymorphic poly-Gln repeats in N-terminal region" /codon_start=1 /product="calcium-activated potassium channel" /db_xref="PID:g2832249" /translation="MDTSGHFHDSGVGDLDEDPKCPCPSSGDEQQQQQQQQQQQQPPP PASPAAPQQPLGPSLQPQPPQLQQQQQQQQQQQQQQSPHPLSQLAQLQSQPVHPGLLH SSPTAFRAPPSSNSTAILHPSSRQGSQLNLNDHLLGHSPSSTATSGPGGGSRHRQASP LVHRRDSNPFTEIAMSSCKYSGGVMKPLSRFSASRRNLIEAETEGQPLQLFSPSNPPE IVISSREDNHAHQTLLHHPNATHNHQHAGTTASSTTFPKANKRKNQNIGYKLGHRRAL FEKRKRLSDYALIFGMFGIVVMVIETELSWGLYSKDSMFSLALKCRISLSTIILLGLI IAYHTRGVQLFVIDNDADDWRIAMTYERILYISLEMLVYTNHTIPGEYKFFWAARLAF SYTPSRAEADVDIILSIPMFLRLYLIARVMLLHSKLFTDASSRSIGALNKINFNTRFV MKTLMTICPGTVLLVFSISLWIIAAWTVRVCERYHDQQDVTSNFLGAMWLISITFLSI GYGDMVPHTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKAEKHVDNFMMDTQLTK RIKNAAANVLRETWLIYKHTKLLKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRKLSDQ ANTLVDLSKMQNVMYDLITELNDRSEDLEKQIGSLESKLEHLTASFNSLPLLIADTLR QQQQQLLSAIIEARGVSVAVGTTHTPISDTPIGVSSTSFPTPYTSSSSC" repeat_region 374..409 /rpt_type=tandem /rpt_unit=cag repeat_region 485..526 /rpt_type=tandem /rpt_unit=cag polyA_signal 2482..2487 /gene="SKCA3" BASE COUNT 583 a 823 c 622 g 493 t ORIGIN 1 gcctcacacg ctcctagagg accacctcct gagagagttc tttcaccccc tcttctttct 61 ccaagctccc ctcctgctct ccctccctgc ccaatacaat gcattcttga gtggcagcgt 121 ctggactcca ggcagcccca gagaaccgaa gcaagccaaa gagaggactg gagccaagat 181 actggtgggg gagattggat gcctggcttt ctttgaggac atctttggag cgagggtggc 241 tttggggtgg gggcttgtgc tgcagggaat acagccaggc cccaagatgg acacttctgg 301 gcacttccat gactcggggg tgggggactt ggatgaagac cccaagtgcc cctgtccatc 361 ctctggggat gagcagcagc agcagcagca gcagcaacag cagcagcagc caccaccgcc 421 agcgtcacca gcagcccccc agcagcccct gggaccctcg ctgcagcctc agcctccgca 481 gcttcagcag cagcagcagc agcagcagca gcagcagcag cagcagtcac cgcatcccct 541 gtctcagctc gcccaactcc agagccagcc cgtccaccct ggcctgctgc actcctctcc 601 caccgctttc agggcccccc cttcgtccaa ctccaccgcc atcctccacc cttcctccag 661 gcaaggcagc cagctcaatc tcaatgacca cttgcttggc cactctccaa gttccacagc 721 tacaagtggg cctggcggag gtagccggca ccgacaggcc agccccctgg tgcaccggcg 781 ggacagcaac cccttcacgg agatcgccat gagctcctgc aagtatagcg gtggggtcat 841 gaagcccctc agccgcttca gcgcctcccg gaggaacctc atcgaggccg agactgaggg 901 ccaacccctc cagcttttca gccctagcaa ccccccggag atcgtcatct cctcccggga 961 ggacaaccat gcccaccaga ccctgctcca tcaccctaat gccacccaca accaccagca 1021 tgccggcacc accgccagca gcaccacctt ccccaaagcc aacaagcgga aaaaccaaaa 1081 cattggctat aagctgggac acaggagggc cctgtttgaa aagagaaagc gactgagtga 1141 ctatgctctg atttttggga tgtttggaat tgttgttatg gtgatagaga ccgagctctc 1201 ttggggtttg tactcaaagg actccatgtt ttcgttggcc ctgaaatgcc gtatcagtct 1261 gtccaccatc atccttttgg gcttgatcat cgcctaccac acacgtggag tccagctctt 1321 cgtgatcgac aacgacgcgg atgactggcg gatagccatg acctacgagc gcatcctcta 1381 cattagcctg gagatgctgg tgtacacaaa ccacaccatt cctggcgagt acaagttctt 1441 ctgggcggca cgcctggcct tctcctacac accctcccgg gcggaggccg atgtggacat 1501 catcctgtct atccccatgt tcctgcgcct gtacctgatc gcccgagtca tgctgctaca 1561 cagcaagctc ttcaccgatg cctcgtcccg cagcatcggg gccctcaaca agatcaactt 1621 caacacccgc tttgtcatga agacgctcat gaccatctgc cctggcactg tgctgctcgt 1681 gttcagcatc tctctgtgga tcattgctgc ctggaccgtc cgtgtctgtg aaaggtacca 1741 tgaccagcag gacgtaacta gtaactttct gggtgccatg tggctcatct ccatcacatt 1801 cctttccatt ggttatgggg acatggtgcc ccacacatac tgtgggaaag gtgtctgtct 1861 cctcactggc atcatgggtg caggctgcac tgcccttgtg gtggccgtgg tggcccgaaa 1921 gctggaactc accaaagcgg agaagcacgt cgataacttc atgatggaca ctcagctcac 1981 caagcggatc aagaatgctg cagcaaatgt ccttcgggaa acatggttaa tctataaaca 2041 cacaaagctg ctaaagaaga ttgaccatgc caaagtgagg aaacaccaga ggaagttcct 2101 ccaagctatc caccagttga ggagcgtcaa gatggaacag aggaagctga gtgaccaagc 2161 caacactctg gtggaccttt ccaagatgca gaatgtcatg tatgacttaa tcacagaact 2221 caatgaccgg agcgaagacc tggagaagca gattggcagc ctggagtcga agctggagca 2281 tctcaccgcc agcttcaact ccctgccgct gctcatcgcc gacaccctgc gccagcagca 2341 gcagcagctc ctgtctgcca tcatcgaggc ccggggtgtc agcgtggcag tgggcaccac 2401 ccacacccca atctccgata cgcccattgg ggtcagctcc acctccttcc cgaccccgta 2461 cacaagttca agcagttgct aaataaatct ccccactcca gaagcattaa aaaaaaaaaa 2521 a // LOCUS AF032119 3122 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens hCASK (CASK) mRNA, complete cds. ACCESSION AF032119 NID g2641548 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3122) AUTHORS Cohen,A.R., Woods,D.F. and Anderson,J.M. TITLE hCASK, the human homolog of the C. elegans protein Lin-2, Drosophila Camguk and rat CASK JOURNAL Unpublished REFERENCE 2 (bases 1 to 3122) AUTHORS Cohen,A.R., Woods,D.F. and Anderson,J.M. TITLE Direct Submission JOURNAL Submitted (31-OCT-1997) Cell Biology, Yale University, 333 Cedar Street, P.O. Box 208019, New Haven, CT 06520-8019, USA FEATURES Location/Qualifiers source 1..3122 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver, lung, and brain" gene 1..3122 /note="similar to C. elegans lin-2, Drosophila camguk and rat cask" /gene="CASK" CDS 28..2793 /gene="CASK" /note="membrane associated guanylate kinase; CASK/LIN-2; CAMGUK" /codon_start=1 /product="hCASK" /db_xref="PID:g2641549" /translation="MADDDVLFEDVYELCEVIGKGPFSVVRRCINRETGQQFAVKIVD VAKFTSSPGLSTEDLKREASICHMLKHPHIVELLETYSSDGMLYMVFEFMDGADLCFE IVKRADAGFVYSEAVASHYMRQILEALRYCHDNNIIHRDVKPHCVLLASKENSAPVKL GGFGVAIQLGESGLVAGGRVGTPHFMAPEVVKREPYGKPVDVWGCGVILFILLSGCLP FYGTKERLFEGIIKGKYKMNPRQWSHISESAKDLVRRMLMLDPAERITVYEALNHPWL KERDRYAYKIHLPETVEQLRKFNARRKLKGAVLAAVSSHKFNSFYGDPPEELPDFSED PTSSGLLAAERAVSQVLDSLEEIHALTDCSEKDLDFLHSVFQDQHLHTLLDLYDKINT KSSPQIRNPPSDAVQRAKEVLEEISCYPENNDAKELKRILTQPHFMALLQTHDVVAHE VYSDEALRVTPPPTSPYLNGDSPESANGGMDMENVTRVRLVQFQKNTDEPMGITLKMN ELNHCIVARIMHGGMIHRQGTLHVGDEIREINGISVANQTVEQLQKMLREMRGSITFK IVPSYRTQSSSCERDSPSTSRQSPANGHSSTNNSVSDLPSTTQPKGRQIYVRAQFEYD PAKDDLIPCKEAGIRFRVGDIIQIISKDDHNWWQGKLENSKNGTAGLIPSSELQEWRV ACIAMEKTKQEQQASCTWFGKKKKQYKDKYLAKHNADLVTYEEVVKLPAFKRKTLVLL GAHGVGRRHIKNTLITKHPDRFAYPIPHTTRPPKRDEENGKNYYFVSHDQMMQDISNN EYLEYGSHEDAMYGTKLETIRKIHEQGLIAILDVEPQALKVLRTAEFAPFVVFIAAPT ITPGLNEDESLQRLQKESDILQRTYAHYFDLTIINNEIDETIRHLEEAVELVCTAPQW VPVSWVY" BASE COUNT 960 a 657 c 717 g 788 t ORIGIN 1 cgctgcggcc gctatcccct ccggaccatg gccgacgacg acgtgctgtt cgaggatgtg 61 tacgagctgt gcgaggtgat cggaaagggt cccttcagtg ttgtacgacg atgtatcaac 121 agagaaactg ggcaacaatt tgctgtaaaa attgttgatg tagccaagtt cacatcaagt 181 ccagggttaa gtacagaaga tctaaagcgg gaagccagta tctgtcatat gctgaaacat 241 ccacacattg tagagttatt ggagacatat agctcagatg gaatgcttta catggttttc 301 gaatttatgg atggagcaga tctgtgtttt gaaatcgtaa agcgagctga cgctggtttt 361 gtgtacagtg aagctgtagc cagccattat atgagacaga tactggaagc tctacgctac 421 tgccatgata ataacataat tcacagggat gtgaagcccc actgtgttct ccttgcctca 481 aaagaaaact cggcacctgt taaacttgga ggctttgggg tagctattca attaggggag 541 tctggacttg tagctggagg acgtgttgga acacctcatt ttatggcacc agaagtggtc 601 aaaagagagc cttacggaaa gcctgtagac gtctgggggt gcggtgtgat cctttttatc 661 ctgctcagtg gttgtttgcc tttttacgga accaaggaaa gattgtttga aggcattatt 721 aaaggaaaat ataagatgaa tccaaggcag tggagccata tctctgaaag tgccaaagac 781 ctagtacgtc gcatgctgat gctggatcca gctgaaagga tcactgttta tgaagcactg 841 aatcacccat ggcttaagga gcgggatcgt tacgcctaca agattcatct tccagaaaca 901 gtagagcagc tgaggaaatt caatgcaagg aggaaactaa agggtgcagt actagccgct 961 gtgtcaagtc acaaattcaa ctcattctat ggggatcccc ctgaagagtt accagatttc 1021 tccgaagacc ctacctcctc agggcttcta gcagcagaaa gagcagtctc acaggtgctg 1081 gacagcctgg aagagattca tgcgcttaca gactgcagtg aaaaggacct agattttcta 1141 cacagtgttt tccaggatca gcatcttcac acactactag atctgtatga caaaattaac 1201 acaaagtctt caccacaaat caggaatcct ccaagcgatg cagtacagag agccaaagag 1261 gtattggaag aaatttcatg ttaccctgag aataacgacg caaaggaact aaagcgtatt 1321 ttaacacaac ctcatttcat ggccttactt cagactcacg acgtagtggc acatgaagtt 1381 tacagtgatg aagcattgag ggtcacacct cctcccacct ctccctattt aaacggcgat 1441 tctccagaaa gtgctaacgg aggcatggat atggagaatg tgaccagagt tcggctggta 1501 cagtttcaaa agaacacaga tgaaccaatg ggaatcactt taaaaatgaa tgaactaaat 1561 cattgtattg ttgcaagaat tatgcatggg ggcatgattc acaggcaagg tacacttcat 1621 gttggtgatg aaattcgaga aatcaatggc atcagtgtgg ctaaccaaac agtggaacaa 1681 ctgcaaaaaa tgcttaggga aatgcggggg agtattacct tcaagattgt gccaagttac 1741 cgcactcagt cttcgtcctg tgagagagat tccccttcca cttccagaca gtccccagct 1801 aatggtcata gcagcactaa caattctgtt tcggacttgc catcaactac ccaaccaaaa 1861 ggacgacaga tctatgtaag agcacaattt gaatatgatc cagccaagga tgacctcatc 1921 ccctgtaaag aagctggcat tcgattcaga gttggtgaca tcatccagat tattagtaag 1981 gatgatcata attggtggca gggtaaactg gaaaactcca aaaatggaac tgcaggtctc 2041 attccttctt ctgaacttca ggaatggcga gtagcttgca ttgccatgga gaagaccaaa 2101 caggagcagc aggccagctg tacttggttt ggcaagaaaa agaagcagta caaagataaa 2161 tatttggcaa agcacaatgc agatcttgtc acatatgaag aagtagtaaa actgccagca 2221 ttcaagagga aaacactagt cttattaggc gcacatggtg ttgggagaag acacataaaa 2281 aacactctca tcacaaagca cccagaccgg tttgcgtacc ctattccaca tacaaccaga 2341 cctccaaaga gagacgaaga aaatggaaag aattattact ttgtatctca tgaccaaatg 2401 atgcaagaca tctctaataa cgagtacttg gagtacggca gccacgagga tgcgatgtat 2461 gggacaaaac tggagaccat ccggaagatc cacgagcagg ggctgattgc aatactggac 2521 gtggagcctc aggcactgaa ggtcctgaga actgcagagt ttgctccttt tgttgttttc 2581 attgctgcac caactattac tccaggttta aatgaggatg aatctcttca gcgtctgcag 2641 aaggagtctg acatcttaca gagaacatat gcacactact tcgatctcac aattatcaac 2701 aatgaaattg atgagacaat cagacatctg gaggaagctg ttgagctcgt gtgcacagcc 2761 ccacagtggg tccctgtctc ctgggtctat taggcctctc cccagatatc tgagcataac 2821 tgggagcacc tcatttgtgg aaaagcctct ttgttatcgg ccttgtgtca gcaggtcatg 2881 gtccctagag actacctagt tgtagtgtga cctacattta taattattgt catgtccgaa 2941 tagataggag gagaaaaaca attacacact aatttaaaga gacagtatct tttttaatca 3001 gttctcctaa actttaataa aatgtatctt taaatgtatg tattattcaa tcctttggaa 3061 tgttatattt ttggaaatca tagcttttta tttccaaggc ccctaaaact gcacaaaata 3121 ga // LOCUS AF032387 5027 bp mRNA PRI 23-NOV-1997 DEFINITION Homo sapiens snRNA activating protein complex 190kD subunit (SNAP190) mRNA, complete cds. ACCESSION AF032387 NID g2641556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5027) AUTHORS Wong,M.W., Henry,R.W., Ma,B., Kobayashi,R., Klages,N., Matthias,P., Strubin,M. and Hernandez,N. TITLE The large subunit of the basal transcription factor SNAPc is a Myb domain protein that interacts with Oct-1 JOURNAL Mol. Cell. Biol. (1997) In press REFERENCE 2 (bases 1 to 5027) AUTHORS Henry,R.W. TITLE Direct Submission JOURNAL Submitted (31-OCT-1997) Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..5027 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal cell teratocarcinoma" gene 1..5027 /gene="SNAP190" CDS 376..4785 /gene="SNAP190" /function="transcription factor required for transcription of snRNA genes" /note="DNA-binding protein, basal transcription factor; snRNA activating protein complex 190kD subunit" /codon_start=1 /product="SNAP190" /db_xref="PID:g2641557" /translation="MDVDAEREKITQEIKELERILDPGSSGSHVEISESSLESDSEAD SLPSEDLDPADPPISEEERWGEASNDEDDPKDKTLPEDPETCLQLNMVYQEVIQEKLA EANLLLAQNREQQEELMRDLAGSKGTKVKDGKSLPPSTYMGHFMKPYFKDKVTGVGPP ANEDTREKAAQGIKAFEELLVTKWKNWEKALLRKSVVSDRLQRLLQPKLLKLEYLHQK QSKVSSELERQALEKQGREAEKEIQDINQLPEEALLGNRLDSHDWEKISNINFEGSRS AEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLEWQKIAEELGTSRSAFQCL QKFQQHNKALKRKEWTEEEDRMLTQLVQEMRVGSHIPYRRIVYYMEGRDSMQLIYRWT KSLDPGLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFS LKKGRWNLKEEEQLIELIEKYGVGHWAKIASELPHRSGSQCLSKWKIMMGKKQGLRRR RRRARHSVRWSSTSSSGSSSGSSGGSSSSSSSSSEEDEPEQAQAGEGDRALLSPQYMV PDMDLWVPARQSTSQPWRGGAGAWLGGPAASLSPPKGSSASQGGSKEASTTAAAPGEE TSPVQVPARAHGPVPRSAQASHSADTRPAGAEKQALEGGRRLLTVPVETVLRVLRANT AARSCTQKEQLRQPPLPTSSPGVSSGDSVARSHVQWLRHRATQSGQRRWRHALHRRLL NRRLLLAVTPWVGDVVVPCTQASQRPAVVQTQADGLREQLQQARLASTPVFTLFTQLF HIDTAGCLEVVRERKALPPRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASK SASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGP VVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEA GTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPA ASRKQGLPEAPPFLPGAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPV PVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEP SCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSG RLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQP GPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLLH KKALEHKATSLVVGGEAERPAGALQASLGLVRGQLQDNPAYLLLRARFLAAFTLPALL ATLAPQGVRTTLSVPSRVGSESEDEDLLSELELADRDGQPGCTTATCPIQGAPDSGKC SASSCLDTSNDPDDLDVLRTRHARHTRKRRRLV" BASE COUNT 1052 a 1563 c 1598 g 812 t 2 others ORIGIN 1 aaagcgacac cttctacgct gggtggccag tttccatgtg gcttgrtgtg ggagcagaga 61 cgggacagtg gggcctgtcc ttgttcatcc actccctgcc tgtgctggtt gcagcctcag 121 agcaggcggg agatgttctt ggggcttagg ctcctggcag agcacatagc aggagtttgt 181 ggggtctgag gttcctgtcc cagggttccc cgattctgtg cctggcctat taatctttct 241 cyggggagcc cagtgcccac tgccgagcag gctcctgcca tcccccatgg gcgggtgtgt 301 ctgaaggaga atgggatccc ggcacggtct tggtttctaa catctcgggg tgtgttttgg 361 aggcaggcgg gagtcatgga tgtagatgct gaaagagaga agataacaca ggagatcaag 421 gagctggaaa ggattttgga tcccggctcc tcgggctccc acgtggagat ctcagaatca 481 agtctcgagt cagattctga agcagattca ctgccttctg aggacttgga tcctgccgat 541 cccccgatct cggaagaaga aaggtggggc gaagccagca atgacgagga cgatcccaag 601 gataaaaccc tccctgaaga cccagaaacc tgcctgcagc tgaacatggt ctaccaggag 661 gtcatccagg agaagctggc tgaggccaac ctgctgctgg cccagaaccg ggagcagcag 721 gaggaactca tgagggatct ggctgggtcc aaaggcacca aggtgaaaga cggcaaaagc 781 ctgcccccaa gcacatacat ggggcacttc atgaagccgt atttcaagga caaggtcacg 841 ggcgtggggc cacctgccaa cgaggacaca cgagagaagg ctgcccaggg gatcaaggct 901 ttcgaggagc tccttgtgac caaatggaaa aactgggaaa aggccttgct ccgaaagtca 961 gtggtgagtg accgcctgca gcgattgctt cagcccaagt tactgaagct cgagtacttg 1021 caccagaagc agagcaaagt ctccagtgag ctggagaggc aagccctgga gaagcagggc 1081 agggaagccg agaaggagat ccaggacatc aaccagcttc cagaagaggc cttgctggga 1141 aacaggctgg acagccacga ctgggagaag atttccaata ttaactttga aggcagccgc 1201 agtgcagagg agatccggaa gttctggcag aactcggagc accccagcat caacaagcag 1261 gagtggagca gggaggagga ggagcggctg caggcgatcg cggctgcaca cggccacctg 1321 gagtggcaga agattgcaga ggagctgggg accagccgca gcgccttcca gtgcctgcag 1381 aaattccagc agcacaacaa agctctgaaa cgcaaggagt ggacagagga ggaggaccgc 1441 atgctcacgc agctggtgca ggagatgcgc gtcggcagcc acatccccta ccgcagaatt 1501 gtctactata tggaagggag agactccatg cagctgatct accgatggac caagagcttg 1561 gatcctggtc tgaagaaggg ttactgggcc ccggaggaag atgctaagtt gcttcaagct 1621 gttgccaaat acggggagca ggattggttt aaaatccggg aagaggtgcc aggtaggagc 1681 gatgcccagt gccgagatcg gtatctcagg agattacatt tcagcttgaa aaagggtcgg 1741 tggaatttaa aagaagagga acagttaatt gaattaatag aaaaatatgg tgtcggtcac 1801 tgggcaaaaa tagcttctga gctgccccat cggtctggct cccagtgtct gagcaagtgg 1861 aagatcatga tggggaagaa gcagggtctc cggaggcggc ggcggagggc ccgtcacagc 1921 gtccggtgga gctctaccag cagcagcggc agcagcagtg gcagcagtgg agggagcagc 1981 agcagcagca gcagcagcag cgaggaggac gagccagagc aggcgcaggc cggggagggt 2041 gacagagcgc tgctgtcccc acagtacatg gtcccggaca tggacctgtg ggttcctgcc 2101 aggcagagca ccagccagcc atggagagga ggggcagggg cctggctggg aggccccgct 2161 gcctccctca gccctcccaa ggggtccagt gccagccagg gcggcagcaa ggaagcttcc 2221 accacagccg cggctcctgg agaggagacg agtccggtgc aggtccctgc cagggcccac 2281 ggccctgtcc cgaggtctgc ccaggcctcc cactcagcag acactcgccc ggcgggcgca 2341 gagaagcagg ccctggaggg tgggaggcgt ctgctgacag tgcctgtgga gaccgtgctg 2401 agggtgctca gggccaacac ggctgctcgg agctgcacac agaaagagca gctgaggcag 2461 ccacccctgc ccacctcatc cccaggggtc agctctggtg acagcgtggc ccgatcccat 2521 gtgcagtggc tacggcacag agccacccag agtgggcagc ggcgctggag acacgctctg 2581 caccggaggc tcctgaaccg caggctgctg ctggctgtga ccccttgggt aggggacgtt 2641 gtcgtgccct gcacacaggc ttcccagaga cccgccgtag tgcagactca agcggatggc 2701 ctcagggagc agctgcagca ggcccgcctg gccagcaccc ctgtgtttac cctgtttacc 2761 cagctgttcc acatcgatac tgccggctgc ttggaggtcg tccgagagag gaaggccctg 2821 ccacccaggc tgccccaggc tggtgctcgg gacccaccag ttcatcttct gcaggcatcc 2881 tcaagtgccc agagcacccc aggccacctc ttcccaaacg tgccggctca agaagcctca 2941 aagagtgcca gccacaaagg gagccgaaga ctggcgtcca gccgggtgga gcgcacccta 3001 ccccaggcgt ccctgctggc ttcaaccggc ccccggccca agcccaagac tgtgtcggag 3061 ctgcttcagg agaagcggct tcaggaggcc cgtgccaggg aggccacccg gggcccggtg 3121 gtgctcccgt cccagctgct ggtctcctcg tctgtgatcc tccagccccc tctaccacac 3181 accccacacg gccgcccagc cccgggtccc accgtcttaa atgtaccgct ctctgggcct 3241 ggggcccccg cagcagccaa acctggcact tctggctcct ggcaggaggc tgggacttca 3301 gccaaggaca agagactctc caccatgcaa gccctgcccc tcgctcctgt cttctcagag 3361 gccgaaggca cagcccctgc tgcttcccaa gcccctgccc tgggccccgg ccagatctct 3421 gtgagctgcc ccgagagtgg tctcggacag tctcaggccc ccgctgcatc ccggaagcag 3481 ggcctgcctg aggcgccacc ctttctcccc ggagccccca gccccacccc actgcccgtc 3541 cagcccctca gcctgacgca cataggaggg ccacatgtgg cgaccagtgt ccccctgcct 3601 gtcacctggg tgctcacagc ccaggggctt ctccctgttc ctgtaccagc tgtggtgagc 3661 cttcccaggc cagcagggac ccctggcccc gcagggctgc tggccactct gctgcctccc 3721 ctgactgaga ctcgggcggc ccagggcccc agggccccag cgttgagcag ctcttggcag 3781 cccccagcca atatgaacag ggaaccggag ccttcctgca ggacagacac cccagctcct 3841 cccacacacg ccctctccca aagtcctgca gaagcggatg gcagtgtggc ctttgtccct 3901 ggagaggccc aggtggccag ggagatacct gagcccagga cgtcctccca cgctgaccct 3961 cctgaagcag aacccccttg gtccgggagg ctgccagcct tcggtggtgt catcccagca 4021 actgagccaa gggggacgcc ggggtccccc tcagggacac aggagcccag ggggcctctg 4081 ggcctggaga agctgcccct gcgccagcct gggcctgaga agggggccct ggacctggag 4141 aagccgcccc taccccagcc tgggcctgag aagggggccc tggacctggg cctgctgtcc 4201 caggagggcg aggcagccac acagcagtgg ctggggggcc agcggggggt gcgtgtgcct 4261 cttctgggca gcagactgcc ctatcagccc ccagccctgt gcagcctgcg agctctgtcc 4321 ggtctcctac tccacaagaa ggccctggag cacaaggcca cctccctggt ggtggggggc 4381 gaggctgagc ggccggccgg agcactgcaa gcctcactgg ggctggtgcg ggggcagctc 4441 caggacaacc cggcctacct cctgttgcgg gcgcggttcc tggcagcctt caccctccct 4501 gcgctcctgg ccaccctggc cccccaaggc gtccgcacca ccctctcagt accttcgagg 4561 gtgggctctg agagtgagga tgaagacctc ctgagtgagc tggaacttgc agacagggac 4621 gggcagccgg gctgcacgac agccacatgc cccattcagg gagccccaga ctctggtaaa 4681 tgctctgctt cctcctgcct ggatacttct aatgaccctg acgacctgga cgtgctcaga 4741 acccggcatg ccaggcacac ccggaagcgg aggcggctgg tgtgagcagc aggacaagca 4801 ctttctcaga agcccgtggc cctgctgacg agagacagcc cacccccagc ctgcaagaga 4861 ggcagccaga ggccaggtat ctgctggccg ctgactgacc aggtccgcag tgaacagagg 4921 cgggcctgcc gactgactgt gtggcatgga gcatggctgt tccccaagtg cacacctgaa 4981 cacttggagg aataaagttc tgtttttaat tgtggaaaaa aaaaaaa // LOCUS AF033382 1485 bp mRNA PRI 03-JAN-1998 DEFINITION Homo sapiens potassium channel mRNA, complete cds. ACCESSION AF033382 NID g2739500 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1485) AUTHORS Kui,S., Kyaw,H., Fan,P., Zeng,Z., Shell,B.K., Carter,K.C. and Li,Y. TITLE Isolation, characterization, and mapping of two human potassium channels JOURNAL Unpublished REFERENCE 2 (bases 1 to 1485) AUTHORS Kui,S., Kyaw,H., Fan,P., Zeng,Z., Shell,B.K., Carter,K.C. and Li,Y. TITLE Direct Submission JOURNAL Submitted (07-NOV-1997) Molecular Biology, Human Genome Sciences, Inc., 9410 Key West Ave., Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1485 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p25" CDS 1..1485 /codon_start=1 /product="potassium channel" /db_xref="PID:g2739501" /translation="MDGSGERSLPEPGSQSSAASDDIEIVVNVGGVRQVLYGDLLSQY PETRLAELINCLAGGYDTIFSLCDDYDPGKREFYFDRDPDAFKCVIEVYYFGEVHMKK GICPICFKNEMDFWKVDLKFLDDCCKSHLSEKREELEEIARRVQLILDDLGVDAAEGR WRRCQKCVWKFLEKPESSCPARVVAELSFLLILVSSVVMCMDTIPELQVLDAEGNRVE HPTLENVETACIGWFTLEYLLRLFSSPNKLHFALSFMNIVDVLAILPFYVSLTLTHLG ARMMELTNVQQAVQALRIMRIARIFKLARHSSGLQTLTYALKRSFKELGLLLMYLAVG IFVFSALGYTMEQSHPETLFKNIPQSFWWAIITMTTVGYGDIYPKTTLSKLNAAISFL CGVIAIALPIHPIINNFVRYYNKQRVLETAAKHELELMELNSSSGGEGKTGGSRSDLD NLPPEPAGKEAPSCSSRLKLSHSDTFIPLLTEEKHHRTRLQSCK" BASE COUNT 284 a 487 c 449 g 265 t ORIGIN 1 atggacgggt ccggggagcg cagcctcccg gagccgggca gccagagctc cgctgccagc 61 gacgacatag agatagtcgt caacgtgggg ggcgtgcggc aggtgctgta cggggacctc 121 ctcagtcagt accctgagac ccggctggcg gagctcatca actgcttggc tgggggctac 181 gacaccatct tctccctgtg cgacgactac gaccccggca agcgcgagtt ctactttgac 241 agggacccgg acgccttcaa gtgtgtcatc gaggtgtact atttcgggga ggtccacatg 301 aagaagggca tctgccccat ctgcttcaag aacgagatgg acttctggaa ggtggacctc 361 aagttcctgg acgactgttg caagagccac ctgagcgaga agcgcgagga gctggaggag 421 atcgcgcgcc gcgtgcagct catcctggac gacctgggcg tggacgcggc cgagggccgc 481 tggcgccgct gccagaagtg cgtctggaag ttcctggaga agcccgagtc gtcgtgcccg 541 gcgcgggtgg tggccgagct ctccttcctg ctcatcctcg tctcgtccgt ggtcatgtgc 601 atggacacca tccccgaact gcaggtgctg gacgccgagg gcaaccgcgt ggagcacccg 661 acgctggaga acgtggagac ggcgtgcatt ggctggttca ccctggagta cctgctgcgc 721 ctcttctcgt cacccaacaa gctgcacttc gcgctgtcct tcatgaacat tgtggacgtg 781 ctggccatcc tccccttcta cgtgagcctc acgctcacgc acctgggtgc ccgcatgatg 841 gagctgacca acgtgcagca ggccgtgcag gcgctgcgga tcatgcgcat cgcgcgcatc 901 ttcaagctgg cccgccactc ctcgggcctg cagaccctca cctatgccct caagcgcagc 961 ttcaaggaac tggggctgct gctcatgtac ctggcagtgg gtatcttcgt cttctctgcc 1021 ctgggctaca ccatggagca gagccatcca gagaccctgt ttaagaacat cccccagtcc 1081 ttctggtggg ccatcatcac catgaccacc gtcggctacg gcgacatcta ccccaagacc 1141 acgctgagca agctcaacgc ggccatcagc ttcttgtgtg gtgtcattgc catcgccctg 1201 cccatccacc ccatcatcaa caactttgtc aggtactaca acaagcagcg cgtcctggag 1261 accgcggcca agcacgagct ggagctgatg gaactcaact ccagcagcgg gggcgagggc 1321 aagaccgggg gctcccgcag tgacctggac aacctccctc cagagcctgc ggggaaggag 1381 gcgccgagct gcagcagccg gctgaagctc tcccacagcg acaccttcat ccccctcctg 1441 accgaggaga agcaccacag gacccggctc cagagttgca agtga // LOCUS AF033383 1542 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens potassium channel mRNA, complete cds. ACCESSION AF033383 NID g2739502 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1542) AUTHORS Su,K., Kyaw,H., Fan,P., Zeng,Z., Shell,B.K., Carter,K.C. and Li,Y. TITLE Isolation, characterization, and mapping of two human potassium channels JOURNAL Biochem. Biophys. Res. Commun. 241 (3), 675-681 (1997) MEDLINE 98096380 REFERENCE 2 (bases 1 to 1542) AUTHORS Kui,S., Kyaw,H., Fan,P., Zeng,Z., Shell,B.K., Carter,K.C. and Li,Y. TITLE Direct Submission JOURNAL Submitted (07-NOV-1997) Molecular Biology, Human Genome Sciences, Inc., 9410 Key West Avenue, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1542 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q13" CDS 1..1542 /codon_start=1 /product="potassium channel" /db_xref="PID:g2739503" /translation="MTLLPGDNSDYDYSALSCTSDASFHPAFLPQRQAIKGAFYRRAQ RLRPQDEPRQGCQPEDRRRRIIINVGGIKYSLPWTTLDEFPLTRLGQLKACTNFDDIL NVCDDYDVTCNEFFFDRNPGAFGTILTFLRAGKLRLLREMCALSFQEELLYWGIAEDH LDGCCKRRYLQKIEEFAEMVEREEEDDALDSEGRDSEGPAEGEGRLGRCMRRLRDMVE RPHSGLPGKVFACLSVLFVTVTAVNLSVSTLPSLREEEEQGHCSQMCHNVFIVESVCV GWFSLEFLLRLIQAPSKFAFLRSPLTLIDLVAILPYYITLLVDGAAAGRRKPGAGNSY LDKVGLVLRVLRALRILYVMRLARHSLGLQTLGLTARRCTREFGLLLLFLCVAIALFA PLLYVIENEMADSPEFTSIPACYWWAVITMTTVDYGDMVPRSTPGQVVALSSILSGIL LMAFPVTSIFHTFSPSYLELKQEQERVMFRRAQFLIKTKSQLSVSQDSDILFGSASSD TRDNN" BASE COUNT 254 a 529 c 488 g 271 t ORIGIN 1 atgaccctct taccgggaga caattctgac tacgactaca gcgcgctgag ctgcacctcg 61 gacgcctcct tccacccggc cttcctcccg cagcgccagg ccatcaaggg cgcgttctac 121 cgccgggcgc agcggctgcg gccgcaggat gagccccgcc agggctgtca gcccgaggac 181 cgccgccgtc ggatcatcat caacgtaggc ggcatcaagt actcgctgcc ctggaccacg 241 ctggacgagt tcccgctgac gcgcctgggc cagctcaagg cctgcaccaa cttcgacgac 301 atcctcaacg tgtgcgatga ctacgacgtc acctgcaacg agttcttctt cgaccgcaac 361 ccgggggcct tcggcactat cctgaccttc ctgcgcgcgg gcaagctgcg gctgctgcgc 421 gagatgtgcg cgctgtcctt ccaggaggag ctgctgtact ggggcatcgc ggaggaccac 481 ctggacggct gctgcaagcg ccgctacctg cagaagattg aggagttcgc ggagatggtg 541 gagcgggagg aagaggacga cgcgctggac agcgagggcc gcgacagcga gggcccggcc 601 gagggcgagg gccgcctggg gcgctgcatg cggcgactgc gcgacatggt ggagaggccg 661 cactcggggc tgcctggcaa ggtgttcgcc tgcctgtcgg tgctcttcgt gaccgtcacc 721 gccgtcaacc tctccgtcag caccttgccc agcctgaggg aggaggagga gcagggccac 781 tgttcccaga tgtgccacaa cgtcttcatc gtggagtcgg tgtgcgtggg ctggttctcc 841 ctggagttcc tcctgcggct cattcaggcg cccagcaagt tcgccttcct gcggagcccg 901 ctgacgctga tcgacctggt ggccatcctg ccctactaca tcacgctgct ggtggacggc 961 gccgccgcag gccgtcgcaa gcccggcgcg ggcaacagct acctggacaa ggtggggctg 1021 gtgctgcgcg tgctgcgggc gctgcgcatc ctgtacgtga tgcgcctggc gcgccactcc 1081 ctggggctgc agacgctggg gctcacggcc cgccgctgca cccgcgagtt cgggctcctg 1141 ctgctcttcc tctgcgtggc catcgccctc ttcgcgcccc tgctctacgt catcgagaac 1201 gagatggccg acagccccga gttcaccagc atccctgcct gctactggtg ggctgtcatc 1261 accatgacga cggtggacta tggcgacatg gtccccagga gcaccccggg ccaggtagtg 1321 gccctgagca gcatcctgag cggcatcctg ctcatggcct tcccagtcac ctccatcttc 1381 cacaccttct ccccctccta cctggagctc aaacaggagc aagagagggt gatgttccgg 1441 agggcgcagt tcctcatcaa aaccaagtcg cagctgagcg tgtcccagga cagtgacatc 1501 ttgttcggaa gtgcctcctc ggacaccaga gacaataact ga // LOCUS AF033850 3388 bp mRNA PRI 27-NOV-1997 DEFINITION Homo sapiens phospholipase D2 (PLD2) mRNA, complete cds. ACCESSION AF033850 NID g2645857 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3388) AUTHORS Steed,P.M., Clark,K.L., Boyar,W.C. and Lasala,D.J. TITLE Phospholipase D: molecular cloning and characterization of human PLD2 and the analysis of PLD isoform splice variants JOURNAL Unpublished REFERENCE 2 (bases 1 to 3388) AUTHORS Steed,P.M. and Lasala,D.J. TITLE Direct Submission JOURNAL Submitted (10-NOV-1997) Research, Novartis Pharmaceuticals, 556 Morris Ave, Summit, NJ 07901, USA FEATURES Location/Qualifiers source 1..3388 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3388 /gene="PLD2" CDS 162..2963 /gene="PLD2" /codon_start=1 /product="phospholipase D2" /db_xref="PID:g2645858" /translation="MTATPESLFPTGDELDSSQLQMESDEVDTLKEGEDPADRMHPFL AIYELQSLKVHPLVFAPGVPVTAQVVGTERYTSGSKVGTCTLYSVRLTHGDFSWTTKK KYRHFQELHRDLLRHKVLMSLLPLARFAVAYSPARDAGNREMPSLPRAGPEGSTRHAA SKQKYLENYLNCLLTMSFYRNYHAMTEFLEVSQLSFIPDLGRKGLEGMIRKRSGGHRV PGLTCCGRDQVCYRWSKRWLVVKDSFLLYMCLETGAISFVQLFDPGFEVQVGKRSTEA RHGVRIDTSHRSLILKCSSYRQARWWAQEITELAQGPGRDFLQLHRHDSYAPPRPGTL ARWFVNGAGYFAAVADAILRAQEEIFITDWWLSPEVYLKRPAHSDDWRLDIMLKRKAE EGVRVSILLFKEVELALGINSGYSKRALMLLHPNIKVMRHPDQVTLWAHHEKLLVVDQ VVAFLGGLDLAYGRWDDLHYRLTDLGDSSESAASQPPTPRPDSPATPDLSHNQFFWLG KDYSNLITKDWVQLDRPFEDFIDRETTPRMPWRDVGVVVHGLPARDLARHFIQRWNFT KTTKAKYKTPTYPYLLPKSTSTANQLPFTLPGGQCTTVQVLRSVDRWSAGTLENSILN AYLHTIRESQHFLYIENQFFISCSDGRTVLNKVGDEIVDRILKAHKQGWCYRVYVLLP LLPGFEGDISTGGGNSIQAILHFTYRTLCRGEYSILHRLKAAMGTAWRDYISICGLRT HGELGGHPVSELIYIHSKVLIADDRTVIIGSANINDRSLLGKRDSELAVLIEDTETEP SLMNGAEYQAGRFALSLRKHCFGVILGANTRPDLDLRDPICDDFFQLWQDMAESNANI YEQIFRCLPSNATRSLRTLREYVAVEPLATVSPPLARSELTQVQGHLVHFPLKFLEDE SLLPPLGSKEGMIPLEVWT" BASE COUNT 698 a 1029 c 961 g 700 t ORIGIN 1 ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggtc cggccccgct 61 tcggccggcc ccgcctcggc cggggcgtgg gctccggctg cagctccggt ctgctctctt 121 ggctcgggaa cccccgcggg cgctggctcc gtctgccagg gatgacggcg acccctgaga 181 gcctcttccc cactggggac gaactggact ccagccagct ccagatggag tccgatgagg 241 tggacaccct gaaggaggga gaggacccag ccgaccggat gcacccgttt ctggccatct 301 atgagcttca gtctctgaaa gtgcacccct tggtgttcgc acctggggtc cctgtcacag 361 cccaggtggt gggcaccgaa agatatacca gcggatccaa ggtgggaacc tgcactctgt 421 attctgtccg cttgactcac ggcgactttt cctggacaac caagaagaaa taccgtcatt 481 ttcaggagct gcatcgggac ctcctgagac acaaagtctt gatgagtctg ctccctctgg 541 ctcgatttgc cgttgcctat tctccagccc gagatgcagg caacagagag atgccctctc 601 taccccgggc aggtcctgag ggctccacca gacatgcagc cagcaaacag aaatacctgg 661 agaattacct caactgtctc ttgaccatgt ctttctatcg caactaccat gccatgacag 721 agttcctgga agtcagtcag ctgtccttta tcccggactt gggccgcaaa ggactggagg 781 ggatgatccg gaagcgctca ggtggccacc gtgttcctgg cctcacctgc tgtggccgag 841 accaagtttg ttatcgctgg tccaagaggt ggctggtggt gaaggactcc ttcctgctgt 901 acatgtgcct cgagacaggt gccatctcat ttgttcagct ctttgaccct ggctttgagg 961 tgcaagtggg gaaaaggagc acggaggcac ggcacggcgt gcggatcgat acctcccaca 1021 ggtccttgat tctcaagtgc agcagctacc ggcaggcacg gtggtgggcc caagagatca 1081 ctgagctggc acagggccca ggcagagact tcctacagct gcaccggcat gacagctacg 1141 ccccaccccg gcctgggacc ttggcccggt ggtttgtgaa tggggcaggt tactttgctg 1201 ctgtggcaga tgccatcctt cgagctcaag aggagatttt catcacagac tggtggttga 1261 gtcctgaggt ttacctgaag cgtccggccc attcagatga ctggagactg gacattatgc 1321 tcaagaggaa ggcggaggaa ggtgtccgtg tgtctattct gctgtttaaa gaagtggaat 1381 tggccttggg catcaacagt ggctatagca agagggcgct gatgctgctg caccccaaca 1441 taaaggtgat gcgtcaccca gaccaagtga cgttgtgggc ccatcatgag aagctcctgg 1501 tggtggacca agtggtagca ttcctggggg gactggacct tgcctatggc cgctgggatg 1561 acctgcacta ccgactgact gaccttggag actcctctga atcagctgcc tcccagcctc 1621 ccaccccgcg cccagactca ccagccaccc cagacctctc tcacaaccaa ttcttctggc 1681 tgggcaagga ctacagcaat cttatcacca aggactgggt gcagctggac cggcctttcg 1741 aagatttcat tgacagggag acgacccctc ggatgccatg gcgggacgtt ggggtggtcg 1801 tccatggcct accggcccgg gaccttgccc ggcacttcat ccagcgctgg aacttcacca 1861 agaccaccaa ggccaagtac aagactccca cataccccta cctgcttccc aagtctacca 1921 gcacggccaa tcagctcccc ttcacacttc caggagggca gtgcaccacc gtacaggtct 1981 tgcgatcagt ggaccgctgg tcagcaggga ctctggagaa ctccatcctc aatgcctacc 2041 tgcacaccat cagggagagc cagcacttcc tctacattga gaatcagttc ttcattagct 2101 gctcagatgg gcggacggtt ctgaacaagg tgggcgatga gattgtggac agaatcctga 2161 aggcccacaa acaggggtgg tgttaccgag tctacgtgct tttgccctta ctccctggct 2221 tcgagggtga catctccacg ggcggtggca actccatcca ggccattctg cactttactt 2281 acaggaccct gtgtcgtggg gagtattcaa tcctgcatcg ccttaaagca gccatgggga 2341 cagcatggcg ggactatatt tccatctgcg ggcttcgtac acacggagag ctgggcgggc 2401 accccgtctc ggagctcatc tacatccaca gcaaggtgct catcgcagat gaccggacag 2461 tcatcattgg ttctgcaaac atcaatgacc ggagcttgct ggggaagcgg gacagtgagc 2521 tggccgtgct gatcgaggac acagagacgg aaccatccct catgaatggg gcagagtatc 2581 aggcgggcag gtttgccttg agtctgcgga agcactgctt cggtgtgatt cttggagcaa 2641 atacccggcc agacttggat ctccgagacc ccatctgtga tgacttcttc cagttgtggc 2701 aagacatggc tgagagcaac gccaatatct atgagcagat cttccgctgc ctgccatcca 2761 atgccacgcg ttccctgcgg actctccggg agtacgtggc cgtggagccc ttggccacgg 2821 tcagtccccc cttggctcgg tctgagctca cccaggtcca gggccacctg gtccacttcc 2881 ccctcaagtt cctagaggat gagtctttgc tgcccccgct gggtagcaag gagggcatga 2941 tccccctaga agtgtggaca tagttgaggc ccccgtcagg gagaggtcac cagctgctgt 3001 gccccaccac gtctggctcc ctgcccctta accccaagga ctgagggcag tgccctttga 3061 gatctgggga ggcaggcatt cctgaaggga actagaggtg ttacagagga cccttacgtg 3121 agaaatagct gaaaagggca ctcccaaccc tgggctgggg aggaggagag agtcccagag 3181 ctcatccccc ctgctgccca gtgcaaacca cttctccatg ctgcaaagga gaagcacagc 3241 tcctgccagg gtgagcaggg tcaagcctct tattccagga gaaggggctc tgccccaggc 3301 cctactaccc attgttccct tcctcttcct gcccttgaac cccctccctg tcccagggcc 3361 ctcccagccc attgctgcca aggtggag // LOCUS AF034207 1799 bp mRNA PRI 29-DEC-1997 DEFINITION Homo sapiens RIG-like 14-1 mRNA, complete cds. ACCESSION AF034207 NID g2724103 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1799) AUTHORS Ligon,A.H., Pershouse,M.A., Jasser,S., Hong,Y.K., Yung,W.K.A. and Steck,P.A. TITLE Direct Submission JOURNAL Submitted (13-NOV-1997) Neuro-Oncology, University of Texas M.D. Anderson Cancer Center, 1515 Holcombe Blvd., Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1799 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /tissue_type="brain" /dev_stage="fetus" CDS 785..1042 /codon_start=1 /product="RIG-like 14-1" /db_xref="PID:g2724104" /translation="MCLNYEGFQDYRVCIKVVVGEEGEEDHCLTNIYSKNVRPKVKRV QEGDFRTSLIDSIDQTITHRVREVCLLGKELSQNKVGSQNY" BASE COUNT 550 a 426 c 401 g 417 t 5 others ORIGIN 1 cggtcacacg accttaaggc cgaacaaagt gaaaaccatt atgctgagtg atatcccact 61 taacccatgg cccggggggg agctccagct gccatagcta ttcgaactat agcttaagga 121 agtgtaacaa gagaaaaggg tatctctanc tttgcctaaa agtcccctga gtgtccgacc 181 tctcctaggt acctcgtccg gacgtttaac gggtcctagt ataagaaaac cccctagtaa 241 cagtaggaca gccctctcca gtagacatac caatcaaggg agtaagaact gaagacacca 301 agggtgtctt ctccacccac ctgatcgtcc tcctgaccgt catttcgaac acgtgggaca 361 tctacgcaac taaagtttta taagaaccaa cactaagtaa gccactttta ggtctcttct 421 gtgttaccta caagcgacaa atccgacccc agcttactga aaatgttata aaaactcccg 481 tcatagtccc ctacctccac ttgttagtcc acaacccgga caactcgnna aagataccaa 541 tgatcaagga cctcatcatt gacgatggag tctccgtaat agtcaagaac cccttccacg 601 ggaacgttaa agattcctcg ttccaaagaa acattgtcga cacaggtact ctagtgtgtc 661 tttcaagagt aaagacttcc tttacatagg tctcttngtc taccatgtta aaggtatcac 721 attaaagaga agaaacctat cctgaggacc ctttcgtacc ccttcggttc caaggatgta 781 ggtaatgtgc ctcaattacg aagggtttca ggactaccgg gtgtgcataa aagtagtagt 841 aggagaagaa ggtgaagagg accactgttt aactaacatc tactccaaga atgtgagacc 901 gaaggtaaaa agggttcaag aaggagactt tagaacgagt ttaatagatt cgatcgacca 961 gactataaca caccgtgttc gagaagtttg cttattagga aaagaactgt ctcaaaataa 1021 agtaggttct caaaactatt aagaggtcac tgccaaagtg aaaaccagaa aacccttcct 1081 ctgagttgtc ctctacttta cacacaaaga acacaacgta gaaggacatg tccggaagct 1141 cctgacatct agaagggtgc ggaagattta tgagaacatt ctcacaacgt tagacctctc 1201 taaagactct acctacaaga aagtcttaca ggttctgtca agatggattc gataggaaat 1261 acaaccgtag ttaaggacaa agattgtctt gagcatgttg taggtggaaa ggtttgtttc 1321 gacgaagtac ttcacgtgag gggaaaagac aaactgtcga gtgtaggtaa ggacggaggt 1381 cgtcgtggac gtgctgacga aacaccggta acgcgcgtcg ttccacttca ccncacacga 1441 acgctcacaa cgtcgagtaa ttcaatccta cacgtgacta ctagtaaaaa tggtgagatt 1501 cggcaggcat ctcacggcct ttataccaac atatacgcga cgcttgtgac ttgagatgcg 1561 gtgaggtgtt tactacaaaa gtccacagta cctgacaacg gtggtacata agtaggtctc 1621 aagctaagga cgtcgggccc cctaggtgat caagatctcg ccggcgtggc gccacctcga 1681 ggtcgaaaac aagggaaatc actcccaatt aacttcggct taagacgtct ataggtagtg 1741 tgaccgccgg cgagctcgta cgtagatctc ccgggttaag cgggatatca ctcagcata // LOCUS AF034374 3013 bp mRNA PRI 27-NOV-1997 DEFINITION Homo sapiens molybdenum cofactor biosynthesis protein A and molybdenum cofactor biosynthesis protein C mRNA, complete cds. ACCESSION AF034374 NID g2645878 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3013) AUTHORS Larin,D., Ross,B.M. and Gilliam,T.C. TITLE Direct Submission JOURNAL Submitted (11-NOV-1997) Psychiatry, Columbia University, 1150 St. Nicholas Ave., Room 536, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..3013 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" CDS 18..1175 /codon_start=1 /product="molybdenum cofactor biosynthesis protein A" /db_xref="PID:g2645879" /translation="MAARPLSRMLRRLLRSSARSCSSGAPVTQPCPGESARAASEEVS RRRQFLREHAAPFSAFLTDSFGRQHSYLRISLTEKCNLRCQYCMPEEGVPLTPKANLL TTEEILTLARLFVKEGIDKIRLTGGEPLIRPDVVDIVAQLQRLEGLRTIGVTTNGINL ARLLPQLQKAGLSAINISLDTLVPAKFEFIVRRKGFHKVMEGIHKAIELGYNPVKVNC VVMRGLNEDELLDFAALTEGHPLDVRFIEYMPFDGNKWNFKKMVSYKEMLDTVRQQWP ELEKVPEEESSTAKAFKIPGFQGQISFITSMSEHFCGTCNRLRITADGNLKVCLFGNS EVSLRDHLRAGASEQELLRIIGAAVGRKKRQHAGMFSISQMKNRPMILIGG" CDS 1194..1943 /codon_start=1 /product="molybdenum cofactor biosynthesis protein C" /db_xref="PID:g2645880" /translation="MFPNSPPANPSIFSWDPLHVQGLRPRMSFSSQVATLWKGCRVPQ TPPLAQQRLGSGSFQRHYTSRADSDANSKCLSPGSWASAAPSGPQLTSEQLTHVDSEG RAAMVDVGRKPDTERVAVASAVVLLGPVAFKLVQQNQLKKGDALVVAQLAGVQAAKVT SQLIPLCHHVALSHIQVQLELDSTRHAVKIQASCRARGPTGVEMEALTSAAVAALTLY DMCKAVSRDIVLEEIKLISKTGGQRGDFHRA" BASE COUNT 694 a 876 c 822 g 621 t ORIGIN 1 cgctcgtatc aggcttcatg gcggcgcggc cactgtcccg gatgctgcgg cggcttctga 61 ggtccagcgc ccggagctgc agctcagggg ctccggtgac ccagccctgc cccggggagt 121 ccgcgcgagc tgcctcggag gaggtgtcca ggcggaggca gttcctgcgg gagcatgcgg 181 cccccttctc cgccttcctc acagacagct tcggccggca gcacagctac ctgcggatct 241 ccctcacaga gaagtgcaac ctcagatgtc agtactgcat gcccgaggag ggggtcccgc 301 tgacccccaa agccaacctg ctgaccacag aggagatcct gaccctcgcc cggctctttg 361 tgaaggaagg catcgacaag atccggctca caggtggaga gccgcttatc cggccggacg 421 tggtggacat tgtggcccag ctccagcggc tggaagggct gagaaccata ggtgttacca 481 ccaatggcat caacctggcc cggctactgc cccagcttca gaaggctggt ctcagtgcca 541 tcaacatcag cctggacacc ctggtgcctg ccaagtttga gttcattgtc cgcaggaaag 601 gcttccacaa ggtcatggag ggcatccaca aggccatcga gctgggctac aaccctgtga 661 aggtgaactg tgtggtgatg cgaggcctta acgaggatga actcctggac tttgcggcct 721 tgactgaggg ccaccccctg gatgtgcgct tcatagagta tatgcccttt gatggcaaca 781 agtggaactt caagaagatg gtcagctata aggagatgct agacactgtc cggcagcagt 841 ggccagagct ggagaaggtg ccagaggagg aatccagcac agccaaggcc tttaaaatcc 901 ctggcttcca aggccagatc agcttcatca catccatgtc tgagcatttc tgtgggacct 961 gcaaccgcct gcgaatcaca gctgatggga acctcaaggt ctgcctcttt ggaaactctg 1021 aggtatccct gcgggatcac ctgcgagctg gggcctctga gcaggagctg ctgagaatca 1081 ttggggctgc tgtgggcagg aagaagcggc agcatgcagg catgttcagt atttcccaga 1141 tgaagaaccg gcccatgatc ctcatcggtg ggtgacccat caagttattt ttgatgttcc 1201 ccaattcccc accagccaat ccaagcattt tctcctggga cccgctccat gttcagggtc 1261 taagacccag aatgagtttc tccagccagg tggccacttt atggaaagga tgcagggtcc 1321 cccagacccc tcctctagcc cagcagcggc tggggtctgg ctcctttcag agacactaca 1381 cttcccgtgc agactcagat gccaactcaa agtgccttag cccaggttcc tgggcttctg 1441 ctgccccctc aggaccccag ctaacctcag aacaactaac tcatgtggac tcggaaggac 1501 gggcagctat ggtagatgtg ggcaggaagc cagacacaga gcgggtggct gtggcttcag 1561 ccgtggtcct cctgggaccg gtagccttca agcttgtcca gcagaaccag ctcaagaaag 1621 gagatgccct agtggtggcc cagctggctg gagtccaggc agccaaggtg accagccagc 1681 tgatccctct gtgccaccac gtggccctga gccacatcca ggtgcagctg gagctggaca 1741 gcacacgcca tgccgtgaag atccaggcat cttgccgggc tcggggcccc accggggtgg 1801 agatggaggc cctgacctct gctgcagtgg ccgccctcac cctgtatgac atgtgcaagg 1861 ctgtcagcag ggacatcgtg ttggaggaga tcaagctcat tagcaagact ggtggtcagc 1921 ggggggactt ccatcgggct tagcacctgc ccttctcacc catggcccac ccaggcctgg 1981 agctgggatg caatgtaggc tgagggaaag acgtcaggtt cctttaatca cagtcactgt 2041 ttgtttacct tgagcagtaa acccgaagtc agcctgctct actactaaca aacaggcctg 2101 ctgctagatg atctctaatg accaatgggg cttcctttct atagggagga taccagcagg 2161 cccttaagcc ttccaggaca ctaaggtcgt gggagcggga ctgcaacaag caatgccaga 2221 taactgagaa atcatgttct ttgtggacta tttcagacaa ccaggttccg acagtccagc 2281 ccagaacttt tccttctcat tttgggtttt ctcttctcct gctttcctgg ggagagatta 2341 agcgctcatt aagcagagga gcccactttg aggagagcaa agcacaagct tgcttgaaga 2401 atggatccca acttctcccc ggcagctctg cctccctaag tctgtgaagc cgcagccctg 2461 ccctgtcctg tcctgtcctg acttcatctc tccttctgcc caagtctgtg tcccatcaga 2521 cttgcagcct ttcagcttaa cagttgcccg gtcctgctgg ccccttttcc tctggccccc 2581 ctcttctgaa acaggatgtg cacacatggg ccatagccct aaggactcct gccagaccac 2641 acagcccaca cctggccctg ttcacggctg ttccacccac ccctctttat tctggagcat 2701 atcagggaaa gaaaagttga tgatagattg ccttcaccct cacagcgcac aaataaagct 2761 acgatgccaa ctttgcagat gcaagaatga agacactgtg tgggtagggc actgagctgc 2821 tgcagtttca cagggaaggc tgcacctatc aatcaatcaa tcaatcctat cccaagacac 2881 agttccctga gggaagaaga ggagggacct ggaaaggcct aagggtgtac tctctgtata 2941 gccccgctat gggaaaataa agtggagtag ggggcataga aaaaaaaaaa aaaaaaaaaa 3001 aaaaaaaaaa aaa // LOCUS AF034633 1362 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens orphan G protein-coupled receptor (GPR39) mRNA, complete cds. ACCESSION AF034633 NID g2654160 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1362) AUTHORS McKee,K.K., Tan,C.P., Palyha,O.C., Liu,J., Feighner,S.D., Hreniuk,D.L., Smith,R.G., Van Der Ploeg,L.H.T. and Howard,A.D. TITLE Cloning and Characterization of Two Human G protein-coupled Receptor Genes Related to the Growth Hormone Secretagogue and Neurotensin Receptors JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 1362) AUTHORS McKee,K.K., Tan,C.P., Palyha,O.C., Liu,J., Feighner,S.D., Hreniuk,D.L., Smith,R.G., Van Der Ploeg,L.H.T. and Howard,A.D. TITLE Direct Submission JOURNAL Submitted (17-NOV-1997) Biochemistry and Physiology, Merck and Co., Inc., PO Box 2000, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..1362 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q21-22 " /dev_stage="fetal" /tissue_type="brain" gene 1..1362 /gene="GPR39" CDS 1..1362 /gene="GPR39" /note="orphan G protein-coupled receptor related to growth hormone secretagogue and neurotensin receptors" /codon_start=1 /product="GPR39" /db_xref="PID:g2654161" /translation="MASPSLPGSDCSQIIDHSHVPEFEVATWIKITLILVYLIIFVMG LLGNSATIRVTQVLQKKGYLQKEVTDHMVSLACSDILVFLIGMPMEFYSIIWNPLTTS SYTLSCKLHTFLFEACSYATLLHVLTLSFERYIAICHPFRYKAVSGPCQVKLLIGFVW VTSALVALPLLFAMGTEYPLVNVPSHRGLTCNRSSTRHHEQPETSNMSICTNLSSRWT VFQSSIFGAFVVYLVVLLSVAFMCWNMMQVLMKSQKGSLAGGTRPPQLRKSESEESRT ARRQTIIFLRLIVVTLAVCWMPNQIRRIMAAAKPKHDWTRSYFRAYMILLPFSETFFY LSSVINPLLYTVSSQQFRRVFVQVLCCRLSLQHANHEKRLRVHAHSTTDSARFVQRPL LFASRRQSSARRTEKIFLSTFQSEAEPQSKSQSLSLESLEPNSGAKPANSAAENGFQE HEV" BASE COUNT 263 a 435 c 362 g 302 t ORIGIN 1 atggcttcac ccagcctccc gggcagtgac tgctcccaaa tcattgatca cagtcatgtc 61 cccgagtttg aggtggccac ctggatcaaa atcaccctta ttctggtgta cctgatcatc 121 ttcgtgatgg gccttctggg gaacagcgcc accattcggg tcacccaggt gctgcagaag 181 aaaggatact tgcagaagga ggtgacagac cacatggtga gtttggcttg ctcggacatc 241 ttggtgttcc tcatcggcat gcccatggag ttctacagca tcatctggaa tcccctgacc 301 acgtccagct acaccctgtc ctgcaagctg cacactttcc tcttcgaggc ctgcagctac 361 gctacgctgc tgcacgtgct gacactcagc tttgagcgct acatcgccat ctgtcacccc 421 ttcaggtaca aggctgtgtc gggaccttgc caggtgaagc tgctgattgg cttcgtctgg 481 gtcacctccg ccctggtggc actgcccttg ctgtttgcca tgggtactga gtaccccctg 541 gtgaacgtgc ccagccaccg gggtctcact tgcaaccgct ccagcacccg ccaccacgag 601 cagcccgaga cctccaatat gtccatctgt accaacctct ccagccgctg gaccgtgttc 661 cagtccagca tcttcggcgc cttcgtggtc tacctcgtgg tcctgctctc cgtagccttc 721 atgtgctgga acatgatgca ggtgctcatg aaaagccaga agggctcgct ggccgggggc 781 acgcggcctc cgcagctgag gaagtccgag agcgaagaga gcaggaccgc caggaggcag 841 accatcatct tcctgaggct gattgttgtg acattggccg tatgctggat gcccaaccag 901 attcggagga tcatggctgc ggccaaaccc aagcacgact ggacgaggtc ctacttccgg 961 gcgtacatga tcctcctccc cttctcggag acgtttttct acctcagctc ggtcatcaac 1021 ccgctcctgt acacggtgtc ctcgcagcag tttcggcggg tgttcgtgca ggtgctgtgc 1081 tgccgcctgt cgctgcagca cgccaaccac gagaagcgcc tgcgcgtaca tgcgcactcc 1141 accaccgaca gcgcccgctt tgtgcagcgc ccgttgctct tcgcgtcccg gcgccagtcc 1201 tctgcaagga gaactgagaa gattttctta agcacttttc agagcgaggc cgagccccag 1261 tctaagtccc agtcattgag tctcgagtca ctagagccca actcaggcgc gaaaccagcc 1321 aattctgctg cagagaatgg ttttcaggag catgaagttt ga // LOCUS AF034759 2663 bp mRNA PRI 30-NOV-1997 DEFINITION Homo sapiens MutS homolog (MSH5) mRNA, complete cds. ACCESSION AF034759 NID g2653648 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2663) AUTHORS Bocker,T., Rasio,D., Copeland,T., Kovatich,A. and Fishel,R.A. TITLE hMSH5, a human MutS homolog JOURNAL Unpublished REFERENCE 2 (bases 1 to 2663) AUTHORS Bocker,T., Rasio,D., Copeland,T., Kovatich,A. and Fishel,R.A. TITLE Direct Submission JOURNAL Submitted (15-NOV-1997) Kimmel Cancer Center, Thomas Jefferson University, 233 South 10th Street, RM 939, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..2663 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3-p22.1" gene 1..2663 /gene="MSH5" CDS 61..2565 /gene="MSH5" /codon_start=1 /product="MutS homolog" /db_xref="PID:g2653649" /translation="MASLGANPRRTPQGPRPGAASSGFPSPAPVPGPREAEEEEVEEE EELAEIHLCVLWNSGYLGIAYYDTSDSTIHFMPDAPDHESLKLLQRVLDEINPQSVVT SAKQDENMTRFLGKLASQEHREPKRPEIIFLPSVDFGLEISKQRLLSGNYSFIPDAMT ATEKILFLSSIIPFDCLLTVRALGGLLKFLGRRRIGVELEDYNVSVPILGFKKFMLTH LVNIDQDTYSVLQIFKSESHPSVYKVASGLKEGLSLFGILNRCHCKWGEKLLRLWFTR PTHDLGELSSRLDVIQFFLLPQNLDMAQMLHRLLGHIKNVPLILKRMKLSHTKVSDWQ VLYKTVYSALGLRDACRSLPQSIQLFRDIAQEFSDDLHHIASLIGKVVDFEGSLAENR FTVLPNIDPEIDEKKRRLMGLPSFLTEVARKELENLDSRIPSCSVIYIPLIGFLLSIP RLPSMVEASDFEINGLDFMFLSEEKLHYRSARTKELDALLGDLHCEIRDQETLLMYQL QCQVLARAAVLTRVLDLASRLDVLLALASAARDYGYSRPRYSPQVLGVRIQNGRHPLM ELCARTFVPNSTECGGDKGRVKVITGPNSSGKSIYLKQVGLITFMALVGSFVPAEEAE IGAVDAIFTRIHSCESISLGLSTFMIDLNQVAKAVNNATAQSLVLIDEFGKGTNTVDG LALLAAVLRHWLARGPTCPHIFVATNFLSLVQLQLLPQGPLVQYLTMETCEDGNDLVF FYQVCEGVAKASHASHTAAQAGLPDKLVARGKEVSDLIRSGKPIKPVKDLLKKNQMEN CQTLVDKFMKLDLEDPNLDLNVFMSQEVLPAATSIL" BASE COUNT 603 a 736 c 694 g 630 t ORIGIN 1 gcgtggcggt cggtcagcgg ggcgttctcc cacctgtagc gactcagagc ctccaagctc 61 atggcctcct taggagcgaa cccaaggagg acaccgcagg gaccgagacc tggggcggcc 121 tcctccggct tccccagccc ggccccagtg ccgggcccca gggaggccga ggaggaggaa 181 gtcgaggagg aggaggagct ggccgagatc catctgtgtg tgctgtggaa ttcaggatac 241 ttgggcattg cctactatga tactagtgac tccactatcc acttcatgcc agatgcccca 301 gaccacgaga gcctcaagct tctccagaga gttctggatg agatcaatcc ccagtctgtt 361 gttacgagtg ccaaacagga tgagaatatg actcgatttc tgggaaagct tgcctcccag 421 gagcacagag agcctaaaag acctgaaatc atatttttgc caagtgtgga ttttggtctg 481 gagataagca aacaacgcct cctttctgga aactactcct tcatcccaga cgccatgact 541 gccactgaga aaatcctctt cctctcttcc attattccct ttgactgcct cctcacagtt 601 cgagcacttg gagggctgct gaagttcctg ggtcgaagaa gaatcggggt tgaactggaa 661 gactataatg tcagcgtccc catcctgggc tttaagaaat ttatgttgac tcatctggtg 721 aacatagatc aagacactta cagtgttcta cagattttta agagtgagtc tcacccctca 781 gtgtacaaag tggccagtgg actgaaggag gggctcagcc tctttggaat cctcaacaga 841 tgccactgta agtggggaga gaagctgctc aggctatggt tcacacgtcc gactcatgac 901 ctgggggagc tcagttctcg tctggacgtc attcagtttt ttctgctgcc ccagaatctg 961 gacatggctc agatgctgca tcggctcctg ggtcacatca agaacgtgcc tctgattctg 1021 aaacgcatga agttgtccca caccaaggtc agcgactggc aggttctcta caagactgtg 1081 tacagtgccc tgggcctgag ggatgcctgc cgctccctgc cgcagtccat ccagctcttt 1141 cgggacattg cccaagagtt ctctgatgac ctgcaccata tcgccagcct cattgggaaa 1201 gtagtggact ttgagggcag ccttgctgaa aatcgcttca cagtcctccc caacatagat 1261 cctgaaattg atgagaaaaa gcgaagactg atgggacttc ccagtttcct tactgaggtt 1321 gcccgcaagg agctggagaa tctggactcc cgtattcctt catgcagtgt catctacatc 1381 cctctgattg gcttccttct ttctattccc cgcctgcctt ccatggtaga ggccagtgac 1441 tttgagatta atggactgga cttcatgttt ctctcagagg agaagctgca ctatcgtagt 1501 gcccgaacca aggagctgga tgcattgctg ggggacctgc actgcgagat ccgggaccag 1561 gagacgctgc tgatgtacca gctacagtgc caggtgctgg cacgagcagc tgtcttaacc 1621 cgagtattgg accttgcctc ccgcctggac gtcctgctgg ctcttgccag tgctgcccgg 1681 gactatggct actcaaggcc gcgttactcc ccacaagtcc ttggggtacg aatccagaat 1741 ggcagacatc ctctgatgga actctgtgcc cgaacctttg tgcccaactc cacagaatgt 1801 ggtggggaca aagggagggt caaagtcatc actggaccca actcatcagg gaagagcata 1861 tacctcaaac aggtaggctt gatcacattc atggccctgg taggcagctt tgtgccagca 1921 gaggaggccg aaattggggc agtagacgcc atcttcacac gaattcatag ctgcgaatcc 1981 atctcccttg gcctctccac cttcatgatc gacctcaacc aggtggcgaa agcagtgaac 2041 aatgccactg cacagtcgct ggtccttatt gatgaatttg gaaagggaac caacacggtg 2101 gatgggctcg cgcttctggc cgctgtgctc cgacactggc tggcacgtgg acccacatgc 2161 ccccacatct ttgtggccac caactttctg agccttgttc agctacaact gctgccacaa 2221 gggcccctgg tgcagtattt gaccatggag acctgtgagg atggcaacga tcttgtcttc 2281 ttctatcagg tttgcgaagg tgttgcgaag gccagccatg cctcccacac agctgcccag 2341 gctgggcttc ctgacaagct tgtggctcgt ggcaaggagg tctcagactt gatccgcagt 2401 ggaaaaccca tcaagcctgt caaggatttg ctaaagaaga accaaatgga aaattgccag 2461 acattagtgg ataagtttat gaaactggat ttggaagatc ctaacctgga cttgaacgtt 2521 ttcatgagcc aggaagtgct gcctgctgcc accagcatcc tctgagagtc cttccagtgt 2581 cctccccagc ctcctgagac tccggtgggc tgccatgccc tctttgtttc cttatctccc 2641 tcagacgcag agtttttagt ttc // LOCUS AF034952 570 bp mRNA PRI 01-DEC-1997 DEFINITION Homo sapiens mast cell function-associated antigen (MAFA) mRNA, complete cds. ACCESSION AF034952 NID g2654176 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 570) AUTHORS Lamers,M.B.A.C., Lamont,A.G. and Williams,D.H. TITLE Direct Submission JOURNAL Submitted (18-NOV-1997) Biology, Peptide Therapeutics, 321 Cambridge Science Park, Milton Road, Cambridge, Cambs CB4 4WG, UK FEATURES Location/Qualifiers source 1..570 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="enzyme digested lung" /cell_type="mast" gene 1..570 /gene="MAFA" CDS 1..570 /gene="MAFA" /note="type II integral membrane glycoprotein; similar to C-type animal lectins" /codon_start=1 /product="mast cell function-associated antigen" /db_xref="PID:g2654177" /translation="MTDSVIYSMLELPTATQAQNDYGPQQKSSSSKPSCSCLVAITLG LLTAVLLSVLLYQWILCQGSNYSTCASCPSCPDRWMKYGNHCYYFSVEEKDWNSSLEF CLARDSHLLVITDNQEMSLLQVFLSEAFCWIGLRNNSGWRWEDGSPLNFSRISSNSFV QTCGAINKNGLQASSCEVPLHGVCKKVRL" BASE COUNT 139 a 135 c 134 g 162 t ORIGIN 1 atgactgaca gtgttattta ttccatgtta gagttgccta cggcaaccca agcccagaat 61 gactacggac cacagcaaaa atcttcctct tccaagcctt cttgttcttg ccttgtggca 121 ataactttgg ggcttctgac tgcagttctt ctgagtgtgc tgctatacca gtggatcctg 181 tgccagggct ccaactactc cacttgtgcc agctgtccta gctgcccaga ccgctggatg 241 aaatatggta accattgtta ttatttctca gtggaggaaa aggactggaa ttctagtctg 301 gaattctgcc tagccagaga ctcacacctc cttgtgataa cggacaatca ggaaatgagc 361 ctgctccaag ttttcctcag tgaggccttt tgctggattg gtctgaggaa caattctggc 421 tggaggtggg aagacggatc acctctaaac ttctcaagga tttcttctaa tagctttgtg 481 cagacatgcg gtgccatcaa caaaaatggt cttcaagcct caagctgtga agttccttta 541 cacggggtgt gtaagaaggt cagactttga // LOCUS AF035280 1522 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens clone 23689 mRNA, complete cds. ACCESSION AF035280 NID g2661030 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1522) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1522) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Genome Res. 7 (4), 353-358 (1997) MEDLINE 97264341 REFERENCE 3 (bases 1 to 1522) AUTHORS Yu,W., Sarginson,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (20-NOV-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1522 /organism="Homo sapiens" /db_xref="taxon:9606" /note="This clone is similar to Rattus norvegicus eIF-2B beta subunit mRNA encoded by GenBank Accession Number U31880. The I.M.A.G.E. Consortium clone ID number is 23689" /clone_lib="1NIB" /sex="female" /dev_stage="infant" /tissue_type="brain" /clone="23689" CDS 47..1102 /note="similar to translation initiation factor eIF-2B beta subunit" /codon_start=1 /db_xref="PID:g2661031" /translation="MPGSAAKGSELSERIESFVETLKRGGGPRSSEEMARETLGLLRQ IITDHRWSNAGELMELIRREGRRMTAAQPSETTVGNMVRRVLKIIREEYGRLHGRSDE SDQQESLHKLLTSGGLNEDFSFHYAQLQSNIIEAINELLVELEGTMENIAAQALEHIH SNEVIMTIGFSRTVEAFLKEAARKRKFHVIVAECAPFCQGHEMAVNLSKAGIETTVMT DAAIFAVMSRVNKVIIGTKTILANGALRAVTGTHTLALAAKHHSTPLIVCAPMFKLSP QFPNEEDSFHKFVAPEEVLPFTEGDILEKVSVHCPVFDYVPPELITLFISNIGGNAPS YIYRLMSELYHPDDHVL" BASE COUNT 400 a 372 c 397 g 353 t ORIGIN 1 caggtgtgga ttccgccggt gaaggctgaa ggcagctacc ttaaagatgc cgggatccgc 61 agcgaagggc tcggagttgt cagagaggat cgagagcttc gtggagaccc tgaagcgggg 121 tggtgggccg cgcagctccg aggaaatggc tcgggagacc ctagggttgc tgcgccagat 181 catcacggac caccgctgga gcaacgcggg ggagctgatg gagctgatcc gcagagaggg 241 caggaggatg acggccgctc agccctccga gaccaccgtg ggcaacatgg tgcggagagt 301 gctcaagatt atccgggagg agtatggcag actccatgga cgcagcgacg agagtgatca 361 gcaggagtcc ctgcacaaac tgttgacatc cggaggccta aacgaggatt tcagcttcca 421 ttatgcccaa ctccagtcca acatcattga ggcgattaat gagctgctag tggagctgga 481 agggacaatg gagaacattg cagcccaggc tctggagcac attcactcca atgaggtgat 541 catgaccatt ggcttctccc gaacagtaga ggccttcctc aaagaggctg cccgaaagag 601 gaaattccat gtcattgtag cagagtgtgc tcctttctgc cagggtcatg aaatggctgt 661 gaatttgtcc aaagcaggta ttgagacaac tgtcatgact gatgctgcca tttttgccgt 721 tatgtcaaga gtcaacaagg tgatcattgg cacgaagacc atcctggcca atggggccct 781 gagagctgtg acaggaactc acactctggc actggcagca aaacaccatt ccaccccact 841 catcgtctgt gcacctatgt tcaaactttc tccacagttc cccaatgaag aagactcatt 901 tcataagttt gtggctcctg aagaagtcct gccattcaca gaaggggaca ttctggagaa 961 ggtcagcgtg cattgccctg tgtttgacta cgttccccca gagctcatta ccctctttat 1021 ctccaacatt ggtgggaatg caccttccta catctaccgc ctgatgagtg aactctacca 1081 tcctgatgat catgttttat gaccgaccac acgtgtccta agcagattgc ttaggcagat 1141 acagaatgaa gaggagactt gagtgttgct gctgaagcac atccttgcaa tgtgggagtg 1201 cacaggagtc cacctaaaaa aaaaatcctt gatactgttg cctgcctttt tagtcacccc 1261 gtaacaaggg cacacatcca gcactgtgtc ttgcctttca gatcttaaca gagcagcagg 1321 gcttaacttg ttgattttgg agcctcttag tgacctggtt gcgtctgtgt caggaactta 1381 aactttctgg ttcagtagtg tgttaaacat aacactgaat accttactgg gatacagatt 1441 tttgctcaga aatggctatg acactttttc taggctctac caataaaagc cacttgaagg 1501 ttcaaaaaaa aaaaaaaaaa aa // LOCUS AF035302 2004 bp mRNA PRI 04-DEC-1997 DEFINITION Homo sapiens clone 23717 mRNA, complete cds. ACCESSION AF035302 NID g2661061 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2004) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 2004) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Genome Res. 7 (4), 353-358 (1997) MEDLINE 97264341 REFERENCE 3 (bases 1 to 2004) AUTHORS Yu,W., Sarginson,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (20-NOV-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2004 /organism="Homo sapiens" /db_xref="taxon:9606" /note="This clone is similar to human GT24 mRNA encoded by GenBank Accession Number U72665. The I.M.A.G.E. Consortium clone ID number is 23717" /clone_lib="1NIB" /sex="female" /dev_stage="infant" /tissue_type="brain" /clone="23717" CDS 480..1445 /note="similar to delta-catenin" /codon_start=1 /db_xref="PID:g2661062" /translation="MTVWCARWSTALRNMALDVRNKELIGKYAMRDLVHRLPGGNNSN NTASKAMSDDTVTAVCCTLHEVITKNMENAKALRDAGGIEKLVGISKSKGDKHSPKVV KAASQVLNSMWQYRDLRSLYKKDGWSQYHFVASSSTIERDRQRPYSSSRTPSISPVRV SPNNRSASAPASPREMISLKERKTDYECTGSNATYHGAKGEHTSRKDAMTAQNTGIST LYRNSYGAPAEDIKHNQVSAQPVPQEPSRKDYETYQPFQNSTRNYDESFFEDQVHHRP PASEYTMHLGLKSTGNYVDFYSAARPYSELNYETSHYPASPDSWV" BASE COUNT 573 a 503 c 513 g 415 t ORIGIN 1 cgtgatccag tctgcgctgg ggagcagtga gatcgatagc aagaccgttg aaaactgtgt 61 gtgcatttta aggaacctct cgtaccggct ggcggcagaa acgtctcagg gacagcacat 121 gggcacggac gagctggacg ggctactctg tggcgaggcc aatggcaagg atgctgagag 181 ctctgggtgc tggggcaaga agaagaagaa aaagaaatcc caagatcagt gggatggagt 241 aggacctctt ccagactgtg ctgaaccacc aaaagggatc cagatgctgt ggcacccatc 301 aatagtcaaa ccctacctca cactgctctc tgagtgctca aatccagaca cgctggaagg 361 ggcggcaggc gccctgcaga acttggctgc agggagctgg aagtggtcag tatatatccg 421 agccgctgtc cgaaaagaga aaggcctgcc catcctcgtg gagctgctcc gaatagacaa 481 tgaccgtgtg gtgtgcgcgg tggtccactg cgctgcggaa catggccttg gacgtcagaa 541 ataaggagct catcggcaaa tacgccatgc gagacctagt ccacaggctt ccaggaggga 601 acaacagcaa caacactgca agcaaggcca tgtcggatga cacagtgaca gctgtctgct 661 gcacactgca cgaagtgatt accaagaaca tggagaacgc caaggcctta cgggatgccg 721 gtggcatcga gaagttggtc ggcatctcca aaagcaaagg agataaacac tctccaaaag 781 tggtcaaggc tgcatctcag gtcctcaaca gcatgtggca gtaccgagat ctgaggagtc 841 tctacaaaaa ggatggatgg tcacaatacc actttgtagc ctcgtcttca accatcgaga 901 gggaccggca aaggccctac tcctcctccc gcacgccctc catctcccct gtgcgcgtgt 961 ctcccaacaa ccgctcagca agtgccccag cttcacctcg ggaaatgatc agcctcaaag 1021 aaaggaaaac agactacgag tgcaccggca gcaacgccac ctaccacgga gctaaaggcg 1081 aacacacttc caggaaagat gccatgacag ctcaaaacac tggaatttca actttgtata 1141 ggaattctta tggtgcgccc gctgaagaca tcaaacacaa ccaggtttca gcacagccag 1201 tcccacagga gcccagcaga aaagattacg agacctacca gccatttcag aattccacaa 1261 gaaattacga tgagtccttc ttcgaggacc aggtccacca tcgccctccc gccagcgagt 1321 acaccatgca cctgggtctc aagtccaccg gcaactacgt tgacttctac tcagctgccc 1381 gtccctacag tgaactgaac tatgaaacga gccactaccc ggcctccccc gactcctggg 1441 tgtgaggagc agggcacagg cgctccggga acagtgcatg tgcatgcata ccacaagaca 1501 tttctttctg ttttgttttt ttctcctgca aatttagttt gttaaagcct gttccatagg 1561 aaggctgtga taaccagtaa ggaaatatta agagctattt tagaaagcta aatgaatcgc 1621 aagttaactt ggaaatcagt agaaagctaa agtgatccta aatatgacag tgggcagcac 1681 ctttctagcg tgagctgtag agtaacgaga agtgctttat actgaacgtg gttgatggga 1741 ggagagacga ggcattcggg ccggtggggc gtaagggtta tcgttaagca caagacacag 1801 aatagtttac acactgtgtg ggggacggct tctcacgctt tgtttactct cttcatccgt 1861 tgtgactcta ggcttcaggt tgcattgggg ttcctctgta cagcaagatg tttcttgcct 1921 tttgttaatg cattgttgta aagtatttga tgtacattac agattaaaga agaaaaaaaa 1981 aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS AF035360 3334 bp mRNA PRI 02-FEB-1998 DEFINITION Homo sapiens ring finger protein (FXY) mRNA, complete cds. ACCESSION AF035360 NID g2827993 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3334) AUTHORS Perry,J., Feather,S., Smith,A., Palmer,S. and Ashworth,A. TITLE The human FXY maps to chromosome Xp22.3 : Implications for evolution of the human X chromosome JOURNAL Hum. Mol. Genet. 7 (1998) In press REFERENCE 2 (bases 1 to 3334) AUTHORS Perry,J. TITLE Direct Submission JOURNAL Submitted (20-NOV-1997) Gene Function and Regulation, The Institute of Cancer Research, Fulham Rd., London SW3 6JB, UK FEATURES Location/Qualifiers source 1..3334 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp22.3" gene 1..3334 /gene="FXY" CDS 88..2091 /gene="FXY" /codon_start=1 /product="ring finger protein" /db_xref="PID:g2827994" /translation="METLESELTCPICLELFEDPLLLPCAHSLCFNCAHRILVSHCAT NESVESITAFQCPTCRHVITLSQRGLDGLKRNVTLQNIIDRFQKASVSGPNSPSETRR ERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHR LIEPIPDSHIRGLMCLEHEDEKVNMYCVTDDQLICALCKLVGRHRDHQVAALSERYDK LKQNLESNLTNLIKRNTELETLLAKLIQTCQHVEVNASRQEAKLTEECDLLIEIIQQR RQIIGTKIKEGKVMRLRKLAQQIANCKQCIERSASLISQAEHSLKENDHARFLQTAKN ITERVSMATASSQVLIPEINLNDTFDTFALDFSREKKLLECLDYLTAPNPPTIREELC TASYDTITVHWTSDDEFSVVSYELQYTIFTGQANVVSLCNSADSWMIVPNIKQNHYTV HGLQSGTKYIFMVKAINQAGSRSSEPGKLKTNSQPFKLDPKSAHRKLKVSHDNLTVER DESSSKKSHTPERFTSQGSYGVAGNVFIDSGRHYWEVVISGSTWYAIGLAYKSAPKHE WIGKNSASWALCRCNNNWVVRHNSKEIPIEPAPHLRRVGILLDYDNGSIAFYDALNSI HLYTFDVAFAQPVCPTFTVWNKCLTIITGLPIPDHLDCTEQLP" BASE COUNT 926 a 796 c 730 g 882 t ORIGIN 1 cttcgaaaaa gaactagtgt gcagtccatt gatagctgat cagcttcctt gggttttgct 61 gatgacacaa gagagctttg cctgaagatg gaaacactgg agtcagaact gacctgccct 121 atttgtctgg agctctttga ggaccctctt ctactgccct gcgcacacag cctctgcttc 181 aactgcgccc accgcatcct agtatcacac tgtgccacca acgagtctgt ggagtccatc 241 accgccttcc agtgccccac ctgccggcat gtcatcaccc tcagccagcg aggtctagac 301 gggctcaagc gcaacgtcac cctacagaac atcatcgaca ggttccagaa agcatcagtg 361 agcgggccca actctcccag cgagacccgt cgggagcggg cctttgacgc caacaccatg 421 acctccgccg agaaggtcct ctgccagttt tgtgaccagg atcctgccca ggacgctgtg 481 aagacctgtg tcacttgtga agtatcctac tgtgacgagt gcctgaaagc cactcacccg 541 aataagaagc cctttacagg ccatcgtctg attgagccaa ttccggactc tcacatccgg 601 gggctgatgt gcttggagca tgaggatgag aaggtgaata tgtactgtgt gaccgatgac 661 cagttaatct gtgccttgtg taaactggtt gggcggcacc gcgatcatca ggtggcagct 721 ttgagtgagc gctatgacaa attgaagcaa aacttagaga gtaacctcac caaccttatt 781 aagaggaaca cagaactgga gacccttttg gctaaactca tccaaacctg tcaacatgtt 841 gaagtcaatg catcacgtca agaagccaaa ttgacagagg agtgtgatct tctcattgag 901 atcattcagc aaagacgaca gattattgga accaagatca aagaagggaa ggtgatgagg 961 cttcgcaaac tggctcagca gattgcaaac tgcaaacagt gcattgagcg gtcagcatca 1021 ctcatctccc aagcggaaca ctctctgaag gagaatgatc atgcgcgttt cctacagact 1081 gctaagaata tcaccgagag agtctccatg gcaactgcat cctcccaggt tctaattcct 1141 gaaatcaacc tcaatgacac atttgacacc tttgccttag atttttcccg agagaagaaa 1201 ctgctagaat gtctggatta ccttacagct cccaaccctc ccacaattag agaagagctc 1261 tgcacagctt catatgacac catcactgtg cattggacct ccgatgatga gttcagcgtg 1321 gtctcctacg agctccagta caccatattc accggacaag ccaacgtcgt tagtctgtgt 1381 aattcggctg atagctggat gatagtaccc aacatcaagc agaaccacta cacggtgcac 1441 ggtctgcaga gcggcaccaa gtacatcttc atggtcaagg ccatcaacca ggcgggcagc 1501 cgcagcagtg agcctgggaa gttgaagaca aacagccaac catttaaact ggatcccaaa 1561 tctgctcatc gaaaactgaa ggtgtcccat gataacttga cagtagaacg tgatgagtca 1621 tcatccaaga agagtcacac acctgaacgc ttcaccagcc aggggagcta tggagtagct 1681 ggaaatgtgt ttattgatag tggccggcat tattgggaag tggtcataag tggaagcaca 1741 tggtatgcca ttggtcttgc ttacaaatca gccccgaagc atgaatggat tgggaagaac 1801 tctgcttcct gggcgctctg ccgctgcaac aataactggg tggtgagaca caatagcaag 1861 gaaatcccca ttgagcctgc cccccacctc cggcgcgtgg gcatcctgct ggactatgat 1921 aacggctcta tcgcctttta tgatgctttg aactccatcc acctctacac cttcgacgtc 1981 gcatttgcgc agcctgtttg ccccaccttc accgtgtgga acaagtgtct gacgattatc 2041 actgggctcc ctatcccaga ccatttggac tgcacagagc agctgccgtg agcgtctggc 2101 cacatggagc tgctttctgg ggaacagtaa ggttcaggcc actatttagg ggactgagaa 2161 agcacaggct tcatgagtgt aatgaaatct caccagaagt gtcccgaaat cggctcagat 2221 agggctcaaa acaagagatt cctctccttt tactgtgtct tgtattaagt acgggcttta 2281 ataatttctt taattttttt gtatttagag gaaaatctat agattattta taagagaaac 2341 ataatcagga ttacaacttt taggaattac ttggttttgc acattaagag gcccataagt 2401 ttatcagcta tttacaacct tcatttcatc acaatctgtg ggcttacaaa aaaacaaaaa 2461 cttttgtagt tttgtatgtt actcatcttc ttacctgata tcccatgatg atcccatggt 2521 aggtcttctc acctcgatgg tgcataacag gatgtgtttg aacctagtag gggaggaaac 2581 aggctttctt actctggttt aatttgaagt gttttaattg tgatgtcaaa aagttgtatc 2641 agatcaacta aaatggagag caagacagag aatgaaaaga gttgattttg gacctcggac 2701 cttgccgtgg ctaaatcttt accttctcat agctgatggg ataatgttgg aaagaaaggt 2761 tctgaatcct ttggccacat tttgccctgc ttctctcagg gttaagggtt ctggaagaac 2821 attaagaatg agattgcaat tgaaaatagt cattttgaat cctattgatt attcaaaaat 2881 tcaggctgat ttgtctttta tcagaggtag gattctgttt tatagtatag aatctacttt 2941 atccttcctt ttaatagtac ctttagacct gtgaaatttc ttcactacat ttaatagttc 3001 tcctatttcc cgctccccca tatcaatttt ccttttgtct ccggggctga gtaaataaac 3061 atgttctgtc acaaatagca gcaccacttt ggattgattt tgctctccag gacatcagca 3121 catggccctg atcagcacta ccacatccaa acataagtca ctgaaaaaca cttaatattt 3181 atgagttggt aatgacaagg gacattgtat aaagtactat ttgctagatt catgcctcaa 3241 aagttattat aaacagacct ttattaaaca catcttgaaa gatgtagaag tccctctata 3301 gtctagtata gtttacaata gagttgtaag accc // LOCUS AF035625 2158 bp mRNA PRI 08-JAN-1998 DEFINITION Homo sapiens serine threonine kinase 11 (STK11) mRNA, complete cds. ACCESSION AF035625 NID g2754826 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Jenne,D.E., Reimann,H., Nezu,J., Friedel,W., Loff,S., Jeschke,R., Muller,O., Back,W. and Zimmer,M. TITLE Peutz-Jeghers syndrome is caused by mutations in a novel serine threonine kinase JOURNAL Nature Genet. 18 (1), 38-43 (1998) MEDLINE 98085866 REFERENCE 2 (bases 1 to 2158) AUTHORS Nezu,J. and Jenne,D.E. TITLE Direct Submission JOURNAL Submitted (24-NOV-1997) Neuroimmunologie, Max-Planck-Institute of Psychiatry, Am Klopferspitz 18A, Martinsried 82152, Germany FEATURES Location/Qualifiers source 1..2158 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19p13.3" gene 1..2158 /note="LKB1" /gene="STK11" 5'UTR 1..338 /gene="STK11" CDS 339..1640 /gene="STK11" /note="also LKB1 kinase, encoded by GenBank Accession Number U63333; mutated in patients suffering from the Peutz-Jeghers syndrome (PJS)" /codon_start=1 /product="serine threonine kinase 11" /db_xref="PID:g2754827" /translation="MEVVDPQQLGMFTEGELMSVGMDTFIHRIDSTEVIYQPRRKRAK LIGKYLMGDLLGEGSYGKVKEVLDSETLCRRAVKILKKKKLRRIPNGEANVKKEIQLL RRLRHKNVIQLVDVLYNEEKQKMYMVMEYCVCGMQEMLDSVPEKRFPVCQAHGYFCQL IDGLEYLHSQGIVHKDIKPGNLLLTTGGTLKISDLGVAEALHPFAADDTCRTSQGSPA FQPPEIANGLDTFSGFKVDIWSAGVTLYNITTGLYPFEGDNIYKLFENIGKGSYAIPG DCGPPLSDLLKGMLEYEPAKRFSIRQIRQHSWFRKKHPPAEAPVPIPPSPDTKDRWRS MTVVPYLEDLHGADEDEDLFDIEDDIIYTQDFTVPGQVPEEEASHNGQRRGLPKAVCM NGTEAAQLSTKSRAEGRAPNPARKACSASSKIRRLSACKQQ" 3'UTR 1641..2158 /gene="STK11" BASE COUNT 415 a 670 c 687 g 386 t ORIGIN 1 cccagggtcc ccgaggacga agttgaccct gaccgggccg tctcccagtt ctgaggcccg 61 ggtcccactg gaactcgcgt ctgagccacc gtcccggacc cccggtgccc gccggtccgc 121 agaccctgca ccgggcttgg actcgcagcc gggactgacg tgtagaacaa tcgtttctgt 181 tggaagaagg gtttttccct tccttttggg gtttttgttg cctttttttt ttcttttttc 241 tttgtaaaat tttggagaag ggaagtcgga acacaaggaa ggaccgctca cccgcggact 301 cagggctggc ggcgggactc caggaccctg ggtccagcat ggaggtggtg gacccgcagc 361 agctgggcat gttcacggag ggcgagctga tgtcggtggg tatggacacg ttcatccacc 421 gcatcgactc caccgaggtc atctaccagc cgcgccgcaa gcgggccaag ctcatcggca 481 agtacctgat gggggacctg ctgggggaag gctcttacgg caaggtgaag gaggtgctgg 541 actcggagac gctgtgcagg agggccgtca agatcctcaa gaagaagaag ttgcgaagga 601 tccccaacgg ggaggccaac gtgaagaagg aaattcaact actgaggagg ttacggcaca 661 aaaatgtcat ccagctggtg gatgtgttat acaacgaaga gaagcagaaa atgtatatgg 721 tgatggagta ctgcgtgtgt ggcatgcagg aaatgctgga cagcgtgccg gagaagcgtt 781 tcccagtgtg ccaggcccac gggtacttct gtcagctgat tgacggcctg gagtacctgc 841 atagccaggg cattgtgcac aaggacatca agccggggaa cctgctgctc accaccggtg 901 gcaccctcaa aatctccgac ctgggcgtgg ccgaggcact gcacccgttc gcggcggacg 961 acacctgccg gaccagccag ggctccccgg ctttccagcc gcccgagatt gccaacggcc 1021 tggacacctt ctccggcttc aaggtggaca tctggtcggc tggggtcacc ctctacaaca 1081 tcaccacggg tctgtacccc ttcgaagggg acaacatcta caagttgttt gagaacatcg 1141 ggaaggggag ctacgccatc ccgggcgact gtggcccccc gctctctgac ctgctgaaag 1201 ggatgcttga gtacgaaccg gccaagaggt tctccatccg gcagatccgg cagcacagct 1261 ggttccggaa gaaacatcct ccggctgaag caccagtgcc catcccaccg agcccagaca 1321 ccaaggaccg gtggcgcagc atgactgtgg tgccgtactt ggaggacctg cacggcgcgg 1381 acgaggacga ggacctcttc gacatcgagg atgacatcat ctacactcag gacttcacgg 1441 tgcccggaca ggtcccagaa gaggaggcca gtcacaatgg acagcgccgg ggcctcccca 1501 aggccgtgtg tatgaacggc acagaggcgg cgcagctgag caccaaatcc agggcggagg 1561 gccgggcccc caaccctgcc cgcaaggcct gctccgccag cagcaagatc cgccggctgt 1621 cggcctgcaa gcagcagtga ggctggccgc ctgcagcccg tgtccaggag ccccgccagg 1681 tgcccgcgcc aggccctcag tcttcctgcc ggttccgccc gccctcccgg agaggtggcc 1741 gccatgcttc tgtgccgacc acgccccagg acctccggag cgccctgcag ggccgggcag 1801 ggggacagca gggaccgggc gcagccctcc cccctcggcc gcccggcagt gcacgcggct 1861 tgttgacttc gcagccccgg gcggagcctt cccgggcggg cgtgggagga gggaggcggc 1921 ctccatgcac tttatgtgga gactactggc cccgcccgtg gcctcgtgct ccgcagggcg 1981 cccagcgccg tccggcggcc ccgccgcaga ccagctggcg ggtgtggaga ccaggctcct 2041 gaccccgcca tgcatgcagc gccacctgga agccgcgcgg ccgctttggt tttttgtttg 2101 gttggttcca ttttcttttt ttcttttttt ttttaagaaa aaataaaagg tggatttg // LOCUS AF035718 1254 bp mRNA PRI 05-JAN-1998 DEFINITION Homo sapiens mesoderm-specific basic-helix-loop-helix protein (POD1) mRNA, complete cds. ACCESSION AF035718 NID g2745886 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1254) AUTHORS Quaggin,S.E., Vanden Heuvel,G.B. and Igarashi,P. TITLE Pod-1, A Mesoderm-Specific Basic-Helix-Loop-Helix Protein Expressed in Mesenchymal and Glomerular Epithelial Cells in the Developing Kidney JOURNAL Mech. Dev. (1997) In press REFERENCE 2 (bases 1 to 1254) AUTHORS Quaggin,S.E., Vanden Heuvel,G.B. and Igarashi,P. TITLE Direct Submission JOURNAL Submitted (24-NOV-1997) Internal Medicine, Yale University, 333 Cedar Street, New Haven, CT 06520-8029, USA FEATURES Location/Qualifiers source 1..1254 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" gene 1..1254 /gene="POD1" CDS 261..800 /gene="POD1" /note="Pod-1" /codon_start=1 /product="mesoderm-specific basic-helix-loop-helix protein" /db_xref="PID:g2745887" /translation="MSTGSLSDVEDLQEVEMLECDGLKMDSNKEFVTSNESTEESSNC ENGSPQKGRGGLGKRRRAPTKKSPLSGVSQEGKQVQRNAANARERARMRVLSKAFSRL KTTLPWVPPDTKLSKLDTLRLASSYIAHLRQILANDKYENGYIHPVNLTWPFMVAGKP ESDLKEVVTASRLCGTTAS" misc_feature 492..653 /gene="POD1" /note="encodes basic-helix-loop-helix domain" BASE COUNT 298 a 356 c 317 g 283 t ORIGIN 1 ccacgactct gggagtgggg aaacagagag ccggttcctc tgctgcagaa gtcctcgggg 61 ttccttctca caactctgcg aaggggaaag ggttgtgaga cccaaccaga ccccaactcc 121 agctcccagc aggaggtggc tgcgccacac tcgggaggcc tcttggtttc agggtctctc 181 tgtctctctc tcaccctctt cctcgctttc tctgtctctc tgtctctctc tctctctctc 241 cctcgtccac tcccccaaac atgtccaccg gctccctcag cgatgtggag gaccttcaag 301 aggtggagat gttggaatgt gacgggttga aaatggattc gaacaaggaa tttgtgactt 361 ccaacgagag caccgaggag agctccaact gcgagaatgg gtctccccag aagggccgcg 421 gcggcctggg caagaggagg agggcgccca ccaagaagag ccccctgagc ggggtcagcc 481 aggaggggaa gcaggtccag cgcaacgccg ccaacgcgcg agagcgggcc cgcatgcgag 541 tgctgagcaa ggccttctcc agactcaaga ccaccctgcc ctgggtgccc cccgacacca 601 agctctccaa gctggacacg ctcaggctgg cgtccagcta catcgcccac ttgaggcaga 661 tcctggctaa cgacaaatac gagaacgggt acattcaccc ggtcaacctg acgtggccct 721 ttatggtggc cgggaaaccc gagagtgacc tgaaagaagt ggtgaccgcg agccgcttat 781 gtggaaccac cgcgtcctga ccttggaggt gcgagtctgg gaaaggcgcg ctcccggggg 841 gagcgggccc cgggaaggcg acccctgccc tcagtgctct ctgtctctgc ttccccctcg 901 caatgctcct ctctctgtcc caccccgcga gaacacttta caacgacgag gagattcgtt 961 tccaaaccag aggagatcaa ttgtacttac aaagattccc atctatttaa ctttattaac 1021 ttctaccgtg aatgactctg caagccttgc tggtccaagt gcaatatgta attataaata 1081 tataaataga taagagccta tcaatgtatc ttttgtacaa tatgttgtaa aatgtagatc 1141 ataggatagc tgactttgac agtcacattt ataaagtaat tcacttaaag atatatattt 1201 ttttcaaaca agttttgcta cttttgaaaa taaatctttc tttatattgc taaa // LOCUS AF035752 1321 bp mRNA PRI 09-DEC-1997 DEFINITION Homo sapiens caveolin-2 mRNA, complete cds. ACCESSION AF035752 U32114 NID g2665791 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1321) AUTHORS Scherer,P.E., Okamoto,T., Chun,M., Nishimoto,I., Lodish,H.F. and Lisanti,M.P. TITLE Identification, sequence, and expression of caveolin-2 defines a caveolin gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (1), 131-135 (1996) MEDLINE 96133891 REFERENCE 2 (bases 1 to 1321) AUTHORS Scherer,P.E., Lewis,R.Y., Volonte,D., Engelman,J.A., Galbiati,F., Couet,J., Kohtz,D.S., van Donselaar,E., Peters,P. and Lisanti,M.P. TITLE Cell-type and tissue-specific expression of caveolin-2. Caveolins 1 and 2 co-localize and form a stable hetero-oligomeric complex in vivo JOURNAL J. Biol. Chem. 272 (46), 29337-29346 (1997) MEDLINE 98030620 REFERENCE 3 (bases 1 to 1321) AUTHORS Scherer,P.E. and Lisanti,M.P. TITLE Direct Submission JOURNAL Submitted (25-NOV-1997) Cell Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1321 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 21..509 /note="member of the caveolin gene family" /codon_start=1 /product="caveolin-2" /db_xref="PID:g2665792" /translation="MGLETEKADVQLFMDDDSYSHHSGLEYADPEKFADSDQDRDPHR LNSHLKLGFEDVIAEPVTTHSFDKVWICSHALFEISKYVMYKFLTVFLAIPLAFIAGI LFATLSCLHIWILMPFVKTCLMVLPSVQTIWKSVTDVIIAPLCTSVGRCFSSVSLQLS QD" BASE COUNT 397 a 260 c 251 g 413 t ORIGIN 1 ggccgcgcac caaggctgcg atggggctgg agacggagaa ggcggacgta cagctcttca 61 tggacgacga ctcctacagc caccacagcg gcctcgagta cgccgacccc gagaagttcg 121 cggactcgga ccaggaccgg gatccccacc ggctcaactc gcatctcaag ctgggcttcg 181 aggatgtgat cgcagagccg gtgactacgc actcctttga caaagtgtgg atctgcagcc 241 atgccctctt tgaaatcagc aaatacgtaa tgtacaagtt cctgacggtg ttcctggcca 301 ttcccctggc cttcattgcg ggaattctct ttgccaccct cagctgtctg cacatctgga 361 ttttaatgcc ttttgtaaag acctgcctaa tggttctgcc ttcagtgcag acaatatgga 421 agagtgtgac agatgttatc attgctccat tgtgtacgag cgtaggacga tgcttctctt 481 ctgtcagcct gcaactgagc caggattgaa tacttggacc ccaggtctgg agattgggat 541 actgtaatac ttctttgtta ttataacata aaagcaccac tgttctgttc atttcctagc 601 tgttctaatt aagaaaacta ttaagatgag caaccacatt tagaaatgtt tattgacagg 661 tcttttcaaa taatgctttt ctaattaata gccaaagatt tcatatctaa ctttgtaacc 721 agaattatac agtaagttga caccacttag atttaaaggc agacagtttt gctttagtac 781 aatagtatac attttataat gatgaactta taatgattaa gggacatttc tataaaaata 841 ctacaatagt tttatgcaca acttcccatt aaaaatgaga tttcttattt gtttgtctgt 901 ttttactctg ggagtaatac tttttaaatt acctttacat atatagtcac tggcatactg 961 agaatataca atgatcctgg aaattgcagt accaaaagca cacaacgatt atagtaacta 1021 taagatacaa taaaccaaat aaatgtgaaa gtagattcat gaaaatgtat tcctttaaaa 1081 tattgttttc ctacaggcct atttaacaag atgtttcatt ttctgtatat tttgtagtta 1141 atataaatgt tgctctaatc agattgctta aaagcatttt tattatattt atgttgttga 1201 actaatatat gaaataagta aatgtagctc ccacaaggta aacttcattg gtaagattgc 1261 actgttctga ttatgtaagc atttgtacat cttctttgga aataaaagat aaaagagcga 1321 t // LOCUS AF035811 1768 bp mRNA PRI 09-DEC-1997 DEFINITION Homo sapiens protein H5 (H5) mRNA, complete cds. ACCESSION AF035811 NID g2665833 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1768) AUTHORS Zha,D. and Hu,G. TITLE Direct Submission JOURNAL Submitted (26-NOV-1997) Max-Planck Junior Group No.2, Shanghai Institute of Cell Biology, 320 Yue-Yang Road, Shanghai, Shanghai 200031, People's Republic of China FEATURES Location/Qualifiers source 1..1768 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1768 /gene="H5" CDS 130..1566 /gene="H5" /note="similar to M. musculus protein H5, Swiss-Prot Accession Number P28661" /codon_start=1 /product="protein H5" /db_xref="PID:g2665834" /translation="MDRSLGWQGNSVPEDRTEAGIKRFLEDTTDDGELSKFVKDFSGN ASCHPPEAKTWASRPQVPEPRPQAPDLYDDDLEFRPPSRPQSSDNQQYFCAPAPLSPS ARPRSPWGKLDPYDSSEDDKEYVGFATLPNQVHRKSVKKGFDFTLMVAGESGLGKSTL VNSLFLTDLYRDRKLLGAEERIMQTVEITKHAVDIEEKGVRLRLTIVDTPGFGDAVNN TECWKPVAEYIDQQFEQYFRDESGLNRKNIQDNRVHCCLYFISPFGHGLRPLDVEFMK ALHQRVNIVPILAKADTLTPPEVDHKKRKIREEIEHFGIKIYQFPDCDSDEDEDFKLQ DQALKESIPFAVIGSNTVVEARGRRVRGRLYPWGIVEVENPGHCDFVKLRTMLVRTHM QDLKDVTRETHYENYRAQCIQSMTRLVVKERNRNKLTRESGTDFPIPAVPPGTDPETE KLIREKDEELRRMQEMLHKIQKQMKENY" BASE COUNT 491 a 459 c 468 g 350 t ORIGIN 1 ttggcctcga ggccaagatt cggcacgagg aacagcatca aaacaaggct gtttctgtgt 61 gtgaggaact ttgcctggga gataaaatta gacctagagc tttctgacag ggagtctgaa 121 gcgtgggaca tggaccgttc actgggatgg caagggaatt ctgtccctga ggacaggact 181 gaagctggga tcaagcgttt cctggaggac accacggatg atggagaact gagcaagttc 241 gtgaaggatt tctcaggaaa tgcgagctgc cacccaccag aggctaagac ctgggcatcc 301 aggccccaag tcccggagcc aaggccccag gccccggacc tctatgatga tgacctggag 361 ttcagacccc cctcgcggcc ccagtcctct gacaaccagc agtacttctg tgccccagcc 421 cctctcagcc catctgccag gccccgcagc ccatggggca agcttgatcc ctatgattcc 481 tctgaggatg acaaggagta tgtgggcttt gcaaccctcc ccaaccaagt ccaccgaaag 541 tccgtgaaga aaggctttga ctttaccctc atggtggcag gagagtctgg cctgggcaaa 601 tccacacttg tcaatagcct cttcctcact gatctgtacc gggaccggaa acttcttggt 661 gctgaagaga ggatcatgca aactgtggag atcactaagc atgcagtgga catagaagag 721 aagggtgtga ggctgcggct caccattgtg gacacaccag gttttgggga tgcagtcaac 781 aacacagagt gctggaagcc tgtggcagaa tacattgatc agcagtttga gcagtatttc 841 cgagacgaga gtggcctgaa ccgaaagaac atccaagaca acagggtgca ctgctgcctg 901 tacttcatct cacccttcgg ccatgggctc cggccattgg atgttgaatt catgaaggcc 961 ctgcatcagc gggtcaacat cgtgcctatc ctggctaagg cagacacact gacacctccc 1021 gaagtggacc acaagaaacg caaaatccgg gaggagattg agcattttgg aatcaagatc 1081 tatcaattcc cagactgtga ctctgatgag gatgaggact tcaaattgca ggaccaagcc 1141 ctaaaggaaa gcatcccatt tgcagtaatt ggcagcaaca ctgtagtaga ggccagaggg 1201 cgacgagttc ggggtcgact ctacccctgg ggcatcgtgg aagtggaaaa cccagggcac 1261 tgcgactttg tgaagctgag gacaatgctg gtacgtaccc acatgcagga cctgaaggat 1321 gtgacacggg agacacatta tgagaactac cgggcacagt gcatccagag catgacccgc 1381 ctggtggtga aggaacggaa tcgcaacaaa ctgactcggg aaagtggtac cgacttcccc 1441 atccctgctg tcccaccagg gacagatcca gaaactgaga agcttatccg agagaaagat 1501 gaggagctgc ggcggatgca ggagatgcta cacaaaatac aaaaacagat gaaggagaac 1561 tattaactgg ctttcagccc tggatattta aatctcctcc tcttcttcct gtccatgccg 1621 gcccctccca gcaccagttt tgctcaggcc ccttcagcta ctgccacttc gccttacatc 1681 cctgctgact gcccagagac tcagaggaaa taaagtttaa taaatctgta ggtggctaaa 1741 aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS AF035812 1622 bp mRNA PRI 09-DEC-1997 DEFINITION Homo sapiens dynein light intermediate chain 2 (LIC2) mRNA, complete cds. ACCESSION AF035812 NID g2665835 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1622) AUTHORS Zha,D. and Hu,G. TITLE Direct Submission JOURNAL Submitted (26-NOV-1997) Max-Planck Junior Group No.2, Shanghai Institute of Cell Biology, 320 Yue-Yang Road, Shanghai 200031, People's Republic of China FEATURES Location/Qualifiers source 1..1622 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1622 /gene="LIC2" CDS 7..1485 /gene="LIC2" /note="similar to R. norvegicus and G. gallus dynein light intermediate chain 2, Swiss-Prot Accession Numbers Q62698 and Q90828, respectively" /codon_start=1 /product="dynein light intermediate chain 2" /db_xref="PID:g2665836" /translation="MAPVGVEKKLLLGPNGPAVAAAGDLTSEEEEGQSLWSSILSEVS TRARSKLPSGKNILVFGEDGSGKTTLMTKLQGAEHGKKGRGLEYLYLSVHDEDRDDHT RCNVWILDGDLYHKGLLKFAVSAESLPETLVIFVADMSRPWTVMESLQKWASVLREHI DKMKIPPEKMRELERKFVKDFQDYMEPEEGCQGSPQRRGPLTSGSDEENVALPLGDNV LTHNLGIPVLVVCTKCDAVSVLEKEHDYRDEHLDFIQSHLRRFCLQYGAALIYTSVKE EKNLDLLYKYIVHKTYGFHFTTPALVVEKDAVFIPAGWDNEKKIAILHENFTTVKPED AYEDFIVKPPVRKLVHDKELAAEDEQVFLMKQQSLLAKQPATPTRASESPARGPSGSP RTQGRGGPASVPSSSPGTSVKKPDPNIKNNAASEGVLASFFNSLLSKKTGSPGSPGAG GVQSTAKKSGQKTVLSNVQEELDRMTRKPDSMVTNSSTENEA" BASE COUNT 474 a 356 c 428 g 364 t ORIGIN 1 ggcaagatgg cgccggtggg ggtggagaag aagctgctgc taggtcccaa cgggcccgcg 61 gtggcggccg ccggcgacct gaccagtgag gaggaggaag gccagagcct atggtcctcc 121 attctgagcg aagtgtccac ccgcgccagg tccaagctgc cgtccggcaa gaacatcctg 181 gtcttcggtg aagatggttc tggtaaaaca accctcatga ctaaactaca aggagctgag 241 catggcaaaa aaggaagagg cctagaatat ctctacctca gtgtccatga tgaggaccga 301 gatgatcaca cgcgctgcaa cgtgtggatt ctggatggag acttgtacca caaaggcctg 361 ctgaaatttg cagtttctgc tgaatccttg ccagagaccc tcgtcatttt tgttgcagac 421 atgtctagac cttggactgt gatggaatct ctgcagaaat gggctagtgt tttacgtgag 481 cacattgata aaatgaaaat tccaccagaa aaaatgaggg agctggaacg gaagtttgtg 541 aaagattttc aagactatat ggaacctgaa gaaggttgtc aaggttcccc acagagaaga 601 ggccctctga cctcaggctc cgatgaagaa aatgttgccc tgcctctggg tgacaatgtg 661 ctgactcata acctggggat cccggtgttg gtggtgtgca caaagtgtga tgcggtgagt 721 gtcctggaga aggagcacga ttacagggat gagcatttgg actttatcca gtcacacctg 781 cggaggttct gccttcagta tggagctgcc ttgatttaca catcagtgaa agaagagaaa 841 aacctcgact tgttgtataa gtatattgtt cataaaacat acggtttcca cttcaccaca 901 cctgccttag ttgtggaaaa ggatgccgtt tttatacctg caggctggga caatgaaaag 961 aaaatagcta ttttacatga aaattttaca accgtgaagc cggaagatgc atatgaagac 1021 tttattgtga aacctcccgt gagaaagctg gtccacgaca aagagttggc agcagaagat 1081 gagcaggtgt tcctaatgaa gcaacagtca ctccttgcca agcaaccagc cactcccacg 1141 agagcttctg aatctcctgc aagaggaccc tctggctctc caaggaccca gggtcgggga 1201 gggccagcca gtgtgcctag ctcctcccca ggcacgtcag taaaaaagcc ggacccaaac 1261 atcaaaaata atgcagcaag tgaaggggtg ttggccagct tcttcaacag tctgttgagt 1321 aaaaagacag gctctcctgg aagtcctggt gctggtgggg tgcagagcac agccaagaag 1381 tcaggacaaa agactgtgtt gtcaaatgtt caggaagaac tggatagaat gactcgaaag 1441 ccagactcta tggtaacaaa ctcttcaaca gaaaatgaag cctgaacctc cttaaaaagt 1501 gcatatgtcg aatgaccaaa taactatgta tattgatctg ctaagaccag gatttttctg 1561 atatggcaca tgctatcagt tttttggggc aggggagatg aactttaaaa aaaaaaaaaa 1621 aa // LOCUS AF035824 935 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens vesicle soluble NSF attachment protein receptor (VTI1) mRNA, complete cds. ACCESSION AF035824 NID g2687399 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 935) AUTHORS von Mollard,G.F. and Stevens,T.H. TITLE A human homolog can functionally replace the yeast vesicle-associated SNARE vti1p in two vesicle transport pathways JOURNAL J. Biol. Chem. 273 (5), 2624-2630 (1998) MEDLINE 98112804 REFERENCE 2 (bases 1 to 935) AUTHORS Fischer von Mollard,G. and Stevens,T.H. TITLE Direct Submission JOURNAL Submitted (25-NOV-1997) Institute of Molecular Biology, University of Oregon, Eugene, OR 97403, USA FEATURES Location/Qualifiers source 1..935 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..935 /gene="VTI1" CDS 72..770 /gene="VTI1" /note="Vti1; v-SNARE" /codon_start=1 /product="vesicle soluble NSF attachment protein receptor" /db_xref="PID:g2687400" /translation="MASSAASSEHFEKLHEIFRGLHENLQGVPERLLGTAGTEEKKKL IRDFDEKQQEANETLAEMEEELRYAPLSFRNPMMSKLRNYRKDLAKLHREVRSTPLTA TPGGRGDMKYGIYAVENEHMNRLQSQRAMLLQGTESLNRATQSIERSHRIATETDQIG SEIIEELGEQRDQLERTKSRLVNTSENLSKSRKILRSMSRKVTTNKLLLSIIILLELA ILGGLVYYKFFRSH" BASE COUNT 267 a 218 c 260 g 190 t ORIGIN 1 attccgccgg gctaaggaaa gggcccaggg ccccgaatct cggtggccgc tgctccagcg 61 cggcctgcgc catggcctcc tccgccgcct cctcggagca tttcgagaag ctgcacgaga 121 tcttccgcgg cctccatgaa aacctacaag gggtgcccga acggctgctg gggacggcgg 181 ggaccgaaga aaagaagaaa ttgatcaggg attttgatga aaagcaacag gaagcaaatg 241 aaacgctggc agagatggag gaggagctac gttatgcacc cctgtctttc cgaaacccca 301 tgatgtctaa gcttcgaaac taccggaagg accttgctaa actccatcgg gaggtgagaa 361 gcacaccttt gacagccaca cctggaggcc gaggagacat gaaatatggc atatatgctg 421 tagagaatga gcatatgaat cggctacagt ctcaaagggc aatgcttctg cagggcactg 481 aaagcctgaa ccgggccacc caaagtattg aacgttctca tcggattgcc acagagactg 541 accagattgg ctcagaaatc atagaagagc tgggggaaca acgagaccag ttagaacgta 601 ccaagagtag actggtaaac acaagtgaaa acttgagcaa aagtcggaag attctccgtt 661 caatgtccag aaaagtgaca accaacaagc tgctgctttc cattatcatc ttactggagc 721 tcgccatcct gggaggcctg gtttactaca aattctttcg cagccattga acttctatta 781 gggaagggtt tgtggaccag aactttgacc ttgtgaatgc atgatgttag ggatgtggat 841 aggattagca tattgctgct gtgggctgac atttcaaggg tgcactgtat agccaggctg 901 tgggaggagg gaggaaagat gaaaaaccac ttaaa // LOCUS AF036109 1995 bp mRNA PRI 09-DEC-1997 DEFINITION Homo sapiens Na+/nucleoside cotransporter (hCNT2) mRNA, complete cds. ACCESSION AF036109 NID g2665907 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1995) AUTHORS Ritzel,M.W.L., Yao,S.Y.M., Cass,C.E. and Young,J.D. TITLE Direct Submission JOURNAL Submitted (27-NOV-1997) Physiology, University of Alberta, 7-25 Medical Sciences Building, Edmonton, AB T6G2H7, Canada FEATURES Location/Qualifiers source 1..1995 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q15" /tissue_type="small intestine" gene 1..1995 /gene="hCNT2" CDS 15..1991 /gene="hCNT2" /codon_start=1 /product="Na+/nucleoside cotransporter" /db_xref="PID:g2665908" /translation="MEKASGRQSIALSTVETGTVNPGLELMEKEVEPEGSKRTDAQGH SLGDGLGPSTYQRRSRWPFSKARSFCKTHASLFKKILLGLLCLAYAAYLLAACILNFQ RALALFVITCLVIFVLVHSFLKKLLGKKLTRCLKPFENSRLRLWTKWVFAGVSLVGLI LWLALDTAQRPEQLIPFAGICMFILILFACSKHHSAVSWRTVFSGLGLQFVFGILVIR TDLGYTVFQWLGEQVQIFLNYTVAGSSFVFGDTLVKDVFAFQALPIIIFFGCVVSILY YLGLVQWVVQKVAWFLQITMGTTATETLAVAGNIFVGMTEAPLLIRPYLGDMTLSEIH AVMTGGFATISGTVLGAFIAFGVDASSLISASVMAAPCALASSKLAYPEVEESKFKSE EGVKLPRGKERNVLEAASNGAVDAIGLATNVAANLIAFLAVLAFINAALSWLGELVDI QGLTFQVICSYLLRPMVFMMGVEWTDCPMVAEMVGIKFFINEFVAYQQLSQYKNKRLS GMEEWIEGEKQWISVRAEIITTFSLCGFANLSSIGITLGGLTSIVPHRKSDLSKVVVR ALFTGACVSLISACMAGILYVPRGAEADCVSFPNTSFTNRTYETYMCCRGLFQSTSLN GTNPPSFSGPWEDKEFSAMALTNCCGFYNNTVCA" BASE COUNT 448 a 471 c 533 g 543 t ORIGIN 1 gaggagaaca ggagatggag aaagcaagtg gaagacagtc cattgctctg tccacagtgg 61 agactggcac agtgaacccg gggctggagc tcatggaaaa agaagtagag cctgagggaa 121 gcaagaggac tgacgcacaa ggacacagcc tgggggatgg actgggccct tccacttacc 181 agaggaggag tcggtggcct ttcagcaaag caagaagttt ctgcaaaaca cacgccagct 241 tgttcaagaa gatcctgttg ggcctgttgt gtttggccta tgctgcctat ctcctggcag 301 cttgcatctt gaatttccag agggcactgg ccttgtttgt catcacctgc ttggtgatct 361 ttgtcctggt tcactcgttt ttgaaaaagc tcctgggcaa aaaattaaca agatgtctga 421 agccctttga aaactcccgc ctgaggcttt ggacgaaatg ggtgtttgca ggagtctcct 481 tggttggcct tatactgtgg ttggctttag acacagccca aaggccagag cagctgatcc 541 cctttgcagg aatctgcatg ttcatcctta tcctctttgc ctgctccaaa caccacagcg 601 cagtgtcctg gaggacagtg ttttcgggcc taggtcttca atttgtcttt gggatcttgg 661 tcatcagaac tgatcttgga tatactgtat ttcagtggct gggagagcag gtccagattt 721 tcctgaacta cactgtggcc ggctccagtt ttgtctttgg ggatacactg gtcaaggatg 781 tctttgcttt tcaggcctta ccaatcatca ttttctttgg atgtgtggtg tccattctct 841 actacctggg ccttgtgcaa tgggtagttc agaaggtcgc ctggttttta caaatcacta 901 tgggcaccac tgctacagag accctggctg tggcaggaaa catctttgtg ggtatgacag 961 aggcacctct gctcatccgt ccctaccttg gggacatgac actctctgaa atccatgcgg 1021 tgatgactgg agggtttgcc accatttctg gcactgtgct gggagccttc atagcctttg 1081 gggttgatgc atcatccctg atttctgcct ctgtgatggc cgccccttgt gctctcgcct 1141 catcaaagct agcgtatccg gaagtggagg agtccaagtt caagagtgag gagggggtaa 1201 agctgccccg tgggaaggag aggaatgtcc tggaagctgc cagcaacgga gccgtagatg 1261 ccataggcct tgctactaat gtagcagcca acctgattgc ctttttggct gtgttggcct 1321 tcatcaatgc tgccctctcc tggctggggg aattggtgga catacagggg ctcactttcc 1381 aggtcatctg ctcctatctc ctaaggccca tggttttcat gatgggtgta gaatggacag 1441 actgtccaat ggtggctgag atggtgggaa tcaagttctt cataaatgaa tttgttgctt 1501 atcagcaact gtctcaatac aagaacaaac gtctctctgg aatggaggag tggattgagg 1561 gagagaaaca gtggatttct gtgagagctg aaatcattac aacattttca ctctgtggat 1621 ttgccaatct tagttccata ggaatcacac ttggaggctt gacatcaata gtacctcacc 1681 ggaagagtga cttgtccaag gttgtggtca gggccctctt cacaggggcc tgtgtatccc 1741 ttatcagtgc ctgtatggca ggaatcctct atgtccccag gggagctgaa gctgactgtg 1801 tctccttccc aaacacaagt ttcaccaata gaacctatga gacctacatg tgctgcagag 1861 ggctctttca gagtacttct ctgaatggca ccaaccctcc ttctttttct ggtccctggg 1921 aagataagga gttcagtgct atggccctta ctaactgctg tggattctac aacaataccg 1981 tctgtgcctg aggcg // LOCUS AF036581 1169 bp mRNA PRI 29-JAN-1998 DEFINITION Homo sapiens tumor necrosis factor superfamily member LIGHT mRNA, complete cds. ACCESSION AF036581 NID g2815623 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1169) AUTHORS Mauri,D.N., Ebner,R., Montgomery,R.I., Kochel,K.D., Cheung,T.C., Yu,G.-L., Ruben,S., Murphy,M., Eisenberg,R.J., Cohen,G.H., Spear,P.G. and Ware,C.F. TITLE LIGHT, a new member of the TNF superfamily, and lymphotoxin (LT)a are ligands for herpesvirus entry mediator (HVEM) JOURNAL Immunity 8, 21-30 (1998) REFERENCE 2 (bases 1 to 1169) AUTHORS Ebner,R., Kochel,K.D. and Ware,C.F. TITLE Direct Submission JOURNAL Submitted (02-DEC-1997) Division of Molecular Immunology, La Jolla Institute for Allergy and Immunology, 10355 Science Center Drive, San Diego, CA 92121, USA FEATURES Location/Qualifiers source 1..1169 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /cell_type="peripheral blood mononuclear cells activated with phorbol ester and phytohemagglutinin for 12 hr" CDS 49..771 /function="ligand for herpesvirus entry mediator (HVEM) and lymphotoxin-beta receptor (LTbR)" /codon_start=1 /product="tumor necrosis factor superfamily member LIGHT" /db_xref="PID:g2815624" /translation="MEESVVRPSVFVVDGQTDIPFTRLGRSHRRQSCSVARVGLGLLL LLMGAGLAVQGWFLLQLHWRLGEMVTRLPDGPAGSWEQLIQERRSHEVNPAAHLTGAN SSLTGSGGPLLWETQLGLAFLRGLSYHDGALVVTKAGYYYIYSKVQLGGVGCPLGLAS TITHGLYKRTPRYPEELELLVSQQSPCGRATSSSRVWWDSSFLGGVVHLEAGEEVVVR VLDERLVRLRDGTRSYFGAFMV" repeat_region 855..1140 /rpt_family="Alu" /rpt_type=dispersed polyA_site 1140 BASE COUNT 254 a 314 c 386 g 215 t ORIGIN 1 gaggttgaag gacccaggcg tgtcagccct gctccagaga ccttgggcat ggaggagagt 61 gtcgtacggc cctcagtgtt tgtggtggat ggacagaccg acatcccatt cacgaggctg 121 ggacgaagcc accggagaca gtcgtgcagt gtggcccggg tgggtctggg tctcttgctg 181 ttgctgatgg gggctgggct ggccgtccaa ggctggttcc tcctgcagct gcactggcgt 241 ctaggagaga tggtcacccg cctgcctgac ggacctgcag gctcctggga gcagctgata 301 caagagcgaa ggtctcacga ggtcaaccca gcagcgcatc tcacaggggc caactccagc 361 ttgaccggca gcggggggcc gctgttatgg gagactcagc tgggcctggc cttcctgagg 421 ggcctcagct accacgatgg ggcccttgtg gtcaccaaag ctggctacta ctacatctac 481 tccaaggtgc agctgggcgg tgtgggctgc ccgctgggcc tggccagcac catcacccac 541 ggcctctaca agcgcacacc ccgctacccc gaggagctgg agctgttggt cagccagcag 601 tcaccctgcg gacgggccac cagcagctcc cgggtctggt gggacagcag cttcctgggt 661 ggtgtggtac acctggaggc tggggaggag gtggtcgtcc gtgtgctgga tgaacgcctg 721 gttcgactgc gtgatggtac ccggtcttac ttcggggctt tcatggtgtg aaggaaggag 781 cgtggtgcat tggacatggg tctgacacgt ggagaactca gagggtgcct caggggaaag 841 aaaactcacg aagcagaggc tgggcgtggt ggctctcgcc tgtaatccca gcactttggg 901 aggccaaggc aggcggatca cctgaggtca ggagttcgag accagcctgg ctaacatggc 961 aaaaccccat ctctactaaa aatacaaaaa ttagccggac gtggtggtgc ctgcctgtaa 1021 tccagctact caggaggctg aggcaggata attttgctta aacccgggag gcggaggttg 1081 cagtgagccg agatcacacc actgcactcc aacctgggaa acgcagtgag actgtgcctc 1141 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS AF036717 1532 bp mRNA PRI 24-DEC-1997 DEFINITION Homo sapiens FGFR signalling adaptor SNT-1 mRNA, complete cds. ACCESSION AF036717 NID g2708627 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1532) AUTHORS Xu,H., Lee,K. and Goldfarb,M. TITLE Two human SNT proteins define a family of multifunctional signalling adaptor molecules activated by FGF receptors JOURNAL Unpublished REFERENCE 2 (bases 1 to 1532) AUTHORS Xu,H., Lee,K. and Goldfarb,M. TITLE Direct Submission JOURNAL Submitted (04-DEC-1997) Brookdale Center for Developmental and Molecular Biology, Mt. Sinai School of Medicine, 1 Gustave Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..1532 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p15" /tissue_type="placenta" gene 1..1532 /gene="SNT-1" misc_feature 6..23 /gene="SNT-1" /note="encodes myristoylation motif" CDS 6..1532 /gene="SNT-1" /note="similar to murine FRS2" /codon_start=1 /product="FGFR signalling adaptor SNT-1" /db_xref="PID:g2708628" /translation="MGSCCSCPDKDTVPDNHRNKFKVINVDDDGNELGSGIMELTDTE LILYTRKRDSVKWHYLCLRRYGYDSNLFSFESGRRCQTGQGIFAFKCARAEELFNMLQ EIMQNNSINVVEEPVVERNNHQTELEVPRTPRTPTTPGFAAQNLPNGYPRYPSFGDAS SHPSSRHPSVGSARLPSVGEESTHPLLVAEEQVHTYVNTTGVQEERKNRTSVHVPLEA RVSNAESSTPKEEPSSIEDRDPQILLEPEGVKFVLGPTPVQKQLMEKEKLEQLGRDQV SGSGANNTEWDTGYDSDERRDAPSVNKLVYENINGLSIPSASGVRRGRLTSTSTSDTQ NINNSAQRRTALLNYENLPSLPPVWEARKLSRDEDDNLGPKTPSLNGYHNNLDPMHNY VNTENVTVPASAHKIEYSRRRDCTPTVFNFDIRRPSLEHRQLNYIQVDLEGGSDSDNP QTPKTPTTPLPQTPTRRTELYAVIDIERTAAMSNLQKALPRDDGTSRKTRHNSTDLPM " misc_feature 51..398 /gene="SNT-1" /note="encodes PTB domain" BASE COUNT 501 a 333 c 339 g 359 t ORIGIN 1 aagccatggg tagctgttgt agctgtccag ataaagacac tgtcccagat aaccatcgga 61 acaagtttaa ggtcattaat gtggatgatg atgggaatga gttaggttct ggcataatgg 121 aacttacaga cacagaactg attttataca cccgcaaacg tgactcagta aaatggcact 181 acctctgcct gcgacgctat ggctatgact cgaatctctt ttcttttgaa agtggtcgaa 241 ggtgtcaaac tggacaagga atctttgcct ttaagtgtgc ccgtgcagaa gaattattta 301 acatgttgca agagattatg caaaataata gtataaatgt ggtggaagag ccagttgtag 361 aaagaaataa tcatcagaca gaattggaag tccctagaac acctcgaaca cctacaactc 421 caggatttgc tgctcagaac ttacctaatg gatatccccg atatccctca tttggagatg 481 cttcatccca tccgtcaagc agacatcctt ctgtgggaag tgctcgcctg ccttcagtag 541 gggaagaatc tacacatcct ttgcttgtgg ctgaggaaca agtacatacc tatgtcaaca 601 ctacaggtgt gcaagaagag cggaaaaacc gcacaagtgt gcatgttcca ttggaggcaa 661 gggtttctaa cgctgaaagc agcacaccaa aagaagaacc aagtagtatt gaggacaggg 721 atcctcagat tcttcttgaa cctgaaggag tcaaatttgt tttagggcca acccctgttc 781 aaaagcagtt aatggaaaaa gagaaactgg agcaacttgg aagagatcaa gttagtggaa 841 gtggagcaaa taacacagaa tgggacactg gctatgacag tgatgaacga agagatgcac 901 cctctgttaa caaactggtg tatgaaaata taaatgggct atctatccct agtgcctcag 961 gggtcaggag aggtcgtctg acatccacca gtacctcaga tacccagaat atcaacaact 1021 cagctcagag aagaactgca ttattaaact atgaaaatct accatctttg cctcctgttt 1081 gggaagcccg caagctaagt agggatgaag atgacaattt aggaccaaag accccatctc 1141 taaatggcta ccataataat ctagatccaa tgcataacta tgtaaataca gagaatgtaa 1201 cagtgccagc aagtgctcac aaaatagaat attcaaggcg tcgggactgt acaccaacag 1261 tctttaactt tgatatcaga cgcccaagtt tagaacacag gcagcttaat tacatacagg 1321 ttgacttgga aggtggcagt gactctgaca accctcagac tccaaaaacg cctacaactc 1381 cccttccaca aacccctacc aggcgcacag agctgtatgc cgtgatagac atcgagagaa 1441 ctgctgctat gtcaaatttg cagaaagcac tgccacgaga tgatggtaca tctaggaaaa 1501 ctagacacaa tagtactgat ctgcccatgt ga // LOCUS AF036718 2077 bp mRNA PRI 24-DEC-1997 DEFINITION Homo sapiens FGFR signalling adaptor SNT-2 mRNA, complete cds. ACCESSION AF036718 NID g2708629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2077) AUTHORS Xu,H., Lee,K. and Goldfarb,M. TITLE Two human SNT proteins define a family of multifunctional signalling adaptor molecules activated by FGF receptors JOURNAL Unpublished REFERENCE 2 (bases 1 to 2077) AUTHORS Xu,H., Lee,K. and Goldfarb,M. TITLE Direct Submission JOURNAL Submitted (04-DEC-1997) Brookdale Center for Developmental and Molecular Biology, Mt. Sinai School of Medicine, 1 Gustave Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..2077 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 1..2077 /gene="SNT-2" misc_feature 137..154 /gene="SNT-2" /note="encodes myristoylation motif" CDS 137..1615 /gene="SNT-2" /codon_start=1 /product="FGFR signalling adaptor SNT-2" /db_xref="PID:g2708630" /translation="MGSCCSCLNRDSVPDNHPTKFKVTNVDDEGVELGSGVMELTQSE LVLHLHRREAVRWPYLCLRRYGYDSNLFSFESGRRCQTGQGIFAFKCSRAEEIFNLLQ DLMQCNSINVMEEPVIITRNSHPAELDLPRAPQPPNALGYTVSSFSNGCPGEGPRFSA PRRLSTSSLRHPSLGEESTHALIAPDEQSHTYVNTPASEDDHRRGRHCLQPLPEGQAP FLPQARGPDQRDPQVFLQPGQVKFVLGPTPARRHMVKCQGLCPSLHDPPHHNNNNEAP SECPAQPKCTYENVTGGLWRGAGWRLSPEEPGWNGLAHRRAALLHYENLPPLPPVWES QAQQLGGEAGDDGDSRDGLTPSSNGFPDGEEDETPLQKPTSTRAAIRSHGSFPVPLTR RRGSPRVFNFDFRRPGPEPPRQLNYIQVELKGWGGDRPKGPQNPSSPQAPMPTTHPAR SSDSYAVIDLKKTVAMSNLQRALPRDDGTARKTRHNSTDLPL" misc_feature 182..532 /gene="SNT-2" /note="encodes PTB domain" BASE COUNT 426 a 654 c 568 g 421 t 8 others ORIGIN 1 gggtgaattn tntgcagant gtttgcctga cagcccacca aggacggggg gaataaagtg 61 ggaacccttc cccatgcccc tcccacggtc agntccccga tggcntgggt gagggcaggc 121 tggntgntyt gacaccatgg ggagctgctg cagctgcctg aacagagaca gcgttccaga 181 caaccacccc accaagttca aggtgacaaa tgtggatgat gagggggtgg agctgggctc 241 tggggtgatg gagctgacgc agagtgagct ggtgctgcac ctgcatcggc gtgaggccgt 301 ccgctggcct tatctctgct tgcggcgcta tggctacgac tccaacctct tctcctttga 361 gagtggccgc cgatgtcaga caggccaggg aatatttgca tttaagtgtt cccgggctga 421 ggaaatcttc aacctccttc aggatctgat gcagtgcaac agcatcaatg tgatggaaga 481 gcctgtcatc atcacccgca atagccaccc cgctgagctt gacctccctc gagcccccca 541 gccacccaat gctctaggct acactgtctc cagcttttcc aatggctgcc ctggagaggg 601 cccacgattc tcagctcccc ggcggctctc gacaagcagc ctgcggcacc cctcgcttgg 661 ggaagagtcc acccatgccc tcattgctcc tgatgagcag tcccacacct atgtcaacac 721 accggccagt gaagatgacc accgcagggg ccgccactgc ctgcagcccc tgcctgaggg 781 tcaggcaccc ttcctcccgc aggcccgggg acctgaccaa cgggacccac aggtgttctt 841 gcagccaggc caggtgaagt ttgtgttggg cccgacccct gctcggcggc acatggtgaa 901 gtgccagggc ctctgtccca gcctgcatga ccccccacac cacaataata acaatgaggc 961 cccttctgag tgtccagccc agcccaagtg cacctacgag aacgtcaccg gggggctgtg 1021 gcgaggggct ggctggagac tgagcccaga ggagccgggc tggaatggcc ttgcccaccg 1081 ccgggccgcc ctgctgcact atgagaacct gcccccactg ccccctgtgt gggaaagcca 1141 agcccagcag ctgggagggg aggctgggga tgatggggac tcgagggatg ggctcacacc 1201 ctcttccaat ggcttccctg atggtgagga ggacgagacc ccactgcaga agcccaccag 1261 cacccgggcc gccatccgca gccacggcag ctttcctgtg ccactgaccc gccgccgcgg 1321 ctccccaagg gtcttcaact ttgatttccg ccggccgggg cccgagcccc caaggcagct 1381 taactacatc caggtggagc taaagggctg gggtggagac cgccctaagg ggccccagaa 1441 cccctcgagc ccccaagccc ccatgcccac cacccaccct gcccgaagct cagactccta 1501 cgccgtgatt gacctcaaaa agaccgtggc catgtccaac ctgcagagag ctctgccccg 1561 agacgatggc accgccagga aaacccggca caacagcacc gacctgcctc tgtagggact 1621 tcccggtcct caccaccctc tgccccacca tgatccagcc tctgcctcac actcctgtcc 1681 tctgaaccca ccctccccag ggttcagggt tgctttgcag aaggcatgga ggtgggacca 1741 gatgcttccc tgtgctggct ggagtcccca gagatatcag ccaccgagtc tcctggtctg 1801 tctccaggct ggggagagaa gggtgaccag gcgaggaggg agccgagacg tcaactgtga 1861 agggttcctg tgattgcgtt tagcgccctc ccgttctgtc catttgtctt gtctgccgac 1921 tgtccgtgtg tgtttttttt ggttggaatt ttgaaacatt ttgtacctgt tgattttatt 1981 tatcagttta tttttctatt tattgtttta aatgtaattt aacatattta ttattaatat 2041 aattattttt aaattctgaa aaaaaaaaaa aaaaaaa // LOCUS AF036906 1460 bp DNA PRI 02-FEB-1998 DEFINITION Homo sapiens linker for activation of T cells (LAT) mRNA, alternatively spliced form, complete cds. ACCESSION AF036906 NID g2828025 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1460) AUTHORS Zhang,W., Sloan-Lancaster,J., Kitchen,J., Trible,R.P. and Samelson,L.E. TITLE LAT: the ZAP-70 tyrosine kinase substrate that links T cell receptor to cellular activation JOURNAL Cell (1997) In press REFERENCE 2 (bases 1 to 1460) AUTHORS Zhang,W., Sloan-Lancaster,J., Kitchen,J., Trible,R.P. and Samelson,L.E. TITLE Direct Submission JOURNAL Submitted (05-DEC-1997) Cell Biology and Metabolism Branch, National Institute of Child Health and Development, National Institute of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA COMMENT LAT is a highly tyrosine phosphorylated protein, previously described as p36-38, and it associates with many signaling molecules, such as Grb2, PLC-gamma1, PI-3 kinase, cbl, Vav, and SLP-76, either directly or indirectly upon T cell activation. It is a potential type III transmembrane protein. FEATURES Location/Qualifiers source 1..1460 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat T cells" gene 79..867 /note="linker for activation of T cell" /gene="LAT" CDS 79..867 /gene="LAT" /note="tyrosine kinase substrate; This a alternatively spliced form of LAT" /codon_start=1 /product="LAT" /db_xref="PID:g2828026" /translation="MEEAILVPCVLGLLLLPILAMLMALCVHCHRLPGSYDSTSSDSL YPRGIQFKRPHTVAPWPPAYPPVTSYPPLSQPDLLPIPRSPQPLGGSHRTPSSRRDSD GANSVASYENEGASGIRGAQAGWGVWGPSWTRLTPVSLPPEPACEDADEDEDDYHNPG YLVVLPDSTPATSTAAPSAPALSTPGIRDSAFSMESIDDYVNVPESGESAEASLDGSR EYVNVSQELHPGAAKTEPAALSSQEAEEVEEEGAPDYENLQELN" BASE COUNT 269 a 443 c 432 g 316 t ORIGIN 1 accccatctt catctggcct tgactctgcc cttgaggggc ctaggggtgc agccagcctg 61 ctccgagctc ccctgcagat ggaggaggcc atcctggtcc cctgcgtgct ggggctcctg 121 ctgctgccca tcctggccat gttgatggca ctgtgtgtgc actgccacag actgccaggc 181 tcctacgaca gcacatcctc agatagtttg tatccaaggg gcatccagtt caaacggcct 241 cacacggttg ccccctggcc acctgcctac ccacctgtca cctcctaccc acccctgagc 301 cagccagacc tgctccccat cccaagatcc ccgcagcccc ttgggggctc ccaccggacg 361 ccatcttccc ggcgggattc tgatggtgcc aacagtgtgg cgagctacga gaacgagggt 421 gcgtctggga tccgaggtgc ccaggctggg tggggagtct ggggtccgtc ctggactagg 481 ctgacccctg tgtcgttacc cccagaacca gcctgtgagg atgcagatga ggatgaggac 541 gactatcaca acccaggcta cctggtggtg cttcctgaca gcaccccggc cactagcact 601 gctgccccat cagctcctgc actcagcacc cctggcatcc gagacagtgc cttctccatg 661 gagtccattg atgattacgt gaacgttccg gagagcgggg agagcgcaga agcgtctctg 721 gatggcagcc gggagtatgt gaatgtgtcc caggaactgc atcctggagc ggctaagact 781 gagcctgccg ccctgagttc ccaggaggca gaggaagtgg aggaagaggg ggctccagat 841 tacgagaatc tgcaggagct gaactgaggg cctgtggagg ccgagtctgt cctggaacca 901 ggcttgcctg ggacggctga gctgggcagc tggaagtggc tctggggtcc tcacatggcg 961 tcctgccctt gctccagcct gacaacagcc tgagaaatcc ccccgtaact tattatcact 1021 ttggggttcg gcctgtgtcc cccgaacgct ctgcaccttc tgacgcagcc tgagaatgac 1081 ctgccctggc cccagcccta ctctgtgtaa tagaataaag gcctgcgtgt gtctgtgttg 1141 agcgtgcgtc tgtgtgtgcc tgtgtgcgag tctgagtcag agatttggag atgtctctgt 1201 gtgtttgtgt gtatctgtgg gtctccatcc tccatggggg ctcagccagg tgctgtgaca 1261 ccccccttct gaatgaagcc ttctgacctg ggctggcact gctgggggtg aggacacatt 1321 gccccatgag acagtcccag aacacggcag ctgctggctg tgacaatggt ttcaccatcc 1381 ttagaccaag ggatgggacc tgatgacctg ggaggactct tttagttctt acctcttgtg 1441 gttctcaata aaacagaacg // LOCUS AF037194 822 bp mRNA PRI 25-DEC-1997 DEFINITION Homo sapiens regulator of G protein signaling RGS14-variant, mRNA, complete cds. ACCESSION AF037194 NID g2708807 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 822) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Human RGS14 JOURNAL Unpublished REFERENCE 2 (bases 1 to 822) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Direct Submission JOURNAL Submitted (06-DEC-1997) Pharmacology, University of Iowa, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..822 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 74..679 /codon_start=1 /product="regulator of G protein signaling RGS14-variant" /db_xref="PID:g2708808" /translation="MFRAQQLQIFNLMKFDSYARFVKSPLYRECLLAEAEGRPLREPG SSRLGSPDATRKKPKLKPGKSLPLGVEELGQLPPVEGPGGRPLRKSFRRELGGTANAA LRRESQGSLNSSASLDLGFLAFVSSKSESHRKSLGSTEGESESRPGKYCCVYLPDGTA SLALARPGLTIRDMLAGICEKRGLSLPDIKVYLVGNEQVGT" BASE COUNT 161 a 273 c 234 g 154 t ORIGIN 1 tcgtgtgagc ccagtgaaca tcgaccgtca ggcctggctt ggcgaggagg tgctggccga 61 gccccggccg gacatgtttc gggcacagca gcttcagatc ttcaacttga tgaagttcga 121 cagctatgcg cgcttcgtca agtccccgct gtaccgcgag tgcctgctag ccgaagccga 181 gggacgccct ctgcgggaac ctggctcctc gcgcctcggc agccctgacg ccacgaggaa 241 gaagccgaag ctgaagcccg ggaagtcgct gccgctgggt gtggaggagt tggggcagct 301 gccacccgtt gagggtcctg ggggccgccc tctccgcaag tccttccgcc gggagctggg 361 cgggactgca aacgccgcct tgcgccgaga gtctcagggc tccctcaact cctccgccag 421 cctggacctt ggcttcctag ccttcgtcag cagcaaatct gagagccacc ggaagagcct 481 tgggagcacg gagggtgaaa gtgaaagccg gccagggaag tactgctgtg tgtacctgcc 541 cgatggcaca gcctccttgg ccctggccag acctggcctc accatccgag acatgctggc 601 agggatctgt gagaaacgag gcctctctct acctgacatc aaggtctacc tggtgggcaa 661 tgaacaggtg ggaacgtgac ctggctccaa ctctaacctc cttcctgatc ctgaccccaa 721 ctccatttct gtctctgctc agtcatggct ccaaccccaa ttcaagttct aaacccaact 781 ccaaagcacc ctcagcacca actttcactc actcagacat cc // LOCUS AF037204 2356 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens RING zinc finger protein (RZF) mRNA, complete cds. ACCESSION AF037204 NID g2746332 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2356) AUTHORS Lomax,M.I., Warner,S.J., Bersirli,C.G. and Gong,T.-W.L. TITLE The gene for a RING zinc finger protein is expressed in the inner ear JOURNAL Prim. Sens. Neuron (1998) In press REFERENCE 2 (bases 1 to 2356) AUTHORS Lomax,M.I., Warner,S.J., Bersirli,C.G. and Gong,T.-W.L. TITLE Direct Submission JOURNAL Submitted (08-DEC-1997) Kresge Hearing Research Institute, University of Michigan, 9301E MSRB III, 1150 West Medical Center Drive, Ann Arbor, MI 48109-0648, USA FEATURES Location/Qualifiers source 1..2356 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2356 /gene="RZF" 5'UTR 1..168 /gene="RZF" CDS 169..1314 /gene="RZF" /codon_start=1 /product="RING zinc finger protein" /db_xref="PID:g2746333" /translation="MLLSIGMLMLSATQVYTILTVQLFAFLNLLPVEADILAYNFENA SQTFDDLPARFGYRLPAEGLKGFLINSKPENACEPIVPPPVKDNSSGTFIVLIRRLDC NFDIKVLNAQRAGYKAAIVHNVDSDDLISMGSNDIEVLKKIDIPSVFIGESSANSLKD EFTYEKGGHLILVPEFSLPLEYYLIPFLIIVGICLILIVIFMITKFVQDRHRARRNRL RKDQLKKLPVHKFKKGDEYDVCAICLDEYEDGDKLRILPCSHAYHCKCVDPWLTKTKK TCPVCKQKVVPSQGDSDSDTDSSQEENEVTEHTPLLRPLASVSAQSFGALSESRSHQN MTESSDYEEDDNEDTDSSDAENEINEHDVVVQLQPNGERDYNIANTV" misc_feature 886..1011 /gene="RZF" /note="encodes RING zinc finger domain" 3'UTR 1315..2356 /gene="RZF" polyA_signal 2316..2321 /gene="RZF" polyA_site 2337 /gene="RZF" BASE COUNT 750 a 442 c 447 g 717 t ORIGIN 1 atgattacgc caagcttggc acgaggcgtc cctgctagta ctccgggctg tgggggtcgg 61 tgcggatatt cagtcatgaa atcagggtag ggacttctcc cgcagcgacg cggctggcaa 121 gactgtttgt gttgcggggg ccggacttca aggtgatttt acaacgagat gctgctctcc 181 atagggatgc tcatgctgtc agccacacaa gtctacacca tcttgactgt ccagctcttt 241 gcattcttaa acctactgcc tgtagaagca gacattttag catataactt tgaaaatgca 301 tctcagacat ttgatgacct ccctgcaaga tttggttata gacttccagc tgaaggttta 361 aagggttttt tgattaactc aaaaccagag aatgcctgtg aacccatagt gcctccacca 421 gtaaaagaca attcatctgg cactttcatc gtgttaatta gaagacttga ttgtaatttt 481 gatataaagg ttttaaatgc acagagagca ggatacaagg cagccatagt tcacaatgtt 541 gattctgatg acctcattag catgggatcc aacgacattg aggtactaaa gaaaattgac 601 attccatctg tctttattgg tgaatcatca gctaattctc tgaaagatga attcacatat 661 gaaaaagggg gccaccttat cttagttcca gaatttagtc ttcctttgga atactaccta 721 attcccttcc ttatcatagt gggcatctgt ctcatcttga tagtcatttt catgatcaca 781 aaatttgtcc aggatagaca tagagctaga agaaacagac ttcgtaaaga tcaacttaag 841 aaacttcctg tacataaatt caagaaagga gatgagtatg atgtatgtgc catttgtttg 901 gatgagtatg aagatggaga caaactcaga atccttccct gttcccatgc ttatcattgc 961 aagtgtgtag acccttggct aactaaaacc aaaaaaacct gtccagtgtg caagcaaaaa 1021 gttgttcctt ctcaaggcga ttcagactct gacacagaca gtagtcaaga agaaaatgaa 1081 gtgacagaac ataccccttt actgagacct ttagcttctg tcagtgccca gtcatttggg 1141 gctttatcgg aatcccgctc acatcagaac atgacagaat cttcagacta tgaggaagac 1201 gacaatgaag atactgacag tagtgatgca gaaaatgaaa ttaatgaaca tgatgtcgtg 1261 gtccagttgc agcctaatgg tgaacgggat tacaacatag caaatactgt ttgactttca 1321 gaagatgatt ggtttatttc cctttaaaat gattaggtat atactgtaat ttgatttttt 1381 gctcccttca aagatttctg tagaaataac ttatttttta gtattctaca gtttaatcaa 1441 attactgaaa caggactttt gatctggtat ttatctgcca agaatatact tcattcacta 1501 ataatagact ggtgctgtaa ctcaagcatc aattcagctc ttcttttgga atgaaagtat 1561 agccaaaaca taaaaaaaaa aaaatcctca gtatagcttg caattaagac ctagatcaca 1621 gtatttaagt gttttgcgtt ttatacatga ggtcagtgct acagccacct agcatgaact 1681 aacccagctt ccacctccat aaagttacct agagttgttg agttggaata tgttctggca 1741 tttacctgac ctgccaatca ttagggagag gcaacaaggt aattcagcct ttcctcctat 1801 cagcacaaag aaactcaaag ctgttttttc cctttctgtt ccaaagcagt cttatcctga 1861 caggagcggt ctatactagt gcagatttca acactttttt ttaacgtttt aattactata 1921 gtgttatgta gagatttgat tgagcagcta atgtttctga actttactta ctaattttca 1981 gtgtccttaa gggttctgta gtgttatcaa agcaaaaaga aaatgctgca taaaaatacc 2041 aaacttcagc aactgttaat actcagatca tatacctctt aataaatagc atcttatgct 2101 aattagccct gctaaactat gtacagagga aactgttcaa gtattggatt tgaaagtaag 2161 tgacttatgt ttaacagaac taatgatgta ttgaaacact gtattatgaa aagctaaatt 2221 atacatcatt gtaactatgt agaaagtgta gactaatgta taatcaaaat gctaaggatt 2281 tttatatggc cttgtatgag gggagtttga atgttaataa acatgttttc cactttaaaa 2341 aaaaaaaaaa aaaaaa // LOCUS AF037335 2771 bp mRNA PRI 08-JAN-1998 DEFINITION Homo sapiens carbonic anhydrase precursor (CA 12) mRNA, complete cds. ACCESSION AF037335 NID g2708638 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2771) AUTHORS Ivanov,S.V., Kuzmin,I., Wei,M.H., Pack,S., Geil,L., Stanbridge,E. and Lerman,M.I. TITLE A new family member, CA 12, of alpha-carbonic anhydrases is downregulated by the VHL gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2771) AUTHORS Ivanov,S.V., Kuzmin,I., Wei,M.H., Pack,S., Geil,L., Stanbridge,E. and Lerman,M.I. TITLE Direct Submission JOURNAL Submitted (08-DEC-1997) IRSP, SAIC-Frederick, P.O.Box B, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2771 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q22" 5'UTR 1..115 /gene="CA 12" gene 1..2771 /gene="CA 12" sig_peptide 116..187 /gene="CA 12" CDS 116..1180 /gene="CA 12" /note="transmembrane protein" /codon_start=1 /product="carbonic anhydrase precursor" /db_xref="PID:g2708639" /translation="MPRRSLHAAAVLLLVILKEQPSSPAPVNGSKWTYFGPDGENSWS KKYPSCGGLLQSPIDLHSDILQYDASLTPLEFQGYNLSANKQFLLTNNGHSVKLNLPS DMHIQGLQSRYSATQLHLHWGNPNDPHGSEHTVSGQHFAAELHIVHYNSDLYPDASTA SNKSEGLAVLAVLIEMGSFNPSYDKIFSHLQHVKYKGQEAFVPGFNIEELLPERTAEY YRYRGSLTTPPCNPTVLWTVFRNPVQISQEQLLALETALYCTHMDDPSPREMINNFRQ VQKFDERLVYTSFSQVQVCTAAGLSLGIILSLALAGILGICIVVVVSIWLFRRKSIKK GDNKGVIYKPATKMETEAHA" mat_peptide 188..1177 /gene="CA 12" /product="carbonic anhydrase" misc_feature 209..982 /gene="CA 12" /note="encodes carbonic anhydrase domain" misc_feature 1040..1087 /gene="CA 12" /note="encodes transmembrane domain" 3'UTR 1181..2771 /gene="CA 12" repeat_region 2681..2771 /rpt_family="Alu" BASE COUNT 699 a 739 c 675 g 655 t 3 others ORIGIN 1 gtactcgcca cggcacccag gctgcgcgca cgcggtcccg gtgtgcagct ggagagcgag 61 cggccaccgg gagcccccgg cacagcccgc gcccgccccg caggagcccg cgaagatgcc 121 ccggcgcagc ctgcacgcgg cggccgtgct cctgctggtg atcttaaagg aacagccttc 181 cagcccggcc ccagtgaacg gttccaagtg gacttatttt ggtcctgatg gggagaatag 241 ctggtccaag aagtacccgt cgtgtggggg cctgctgcag tcccccatag acctgcacag 301 tgacatcctc cagtatgacg ccagcctcac gcccctcgag ttccaaggct acaatctgtc 361 tgccaacaag cagtttctcc tgaccaacaa tggccattca gtgaagctga acctgccctc 421 ggacatgcac atccagggcc tccagtctcg ctacagtgcc acgcagctgc acctgcactg 481 ggggaacccg aatgacccgc acggctctga gcacaccgtc agcggacagc acttcgccgc 541 cgagctgcac attgtccatt ataactcaga cctttatcct gacgccagca ctgccagcaa 601 caagtcagaa ggcctcgctg tcctggctgt tctcattgag atgggctcct tcaatccgtc 661 ctatgacaag atcttcagtc accttcaaca tgtaaagtac aaaggccagg aagcattcgt 721 cccgggattc aacattgaag agctgcttcc ggagaggacc gctgaatatt accgctaccg 781 ggggtccctg accacacccc cttgcaaccc cactgtgctc tggacagttt tccgaaaccc 841 cgtgcaaatt tcccaggagc agctgctggc tttggagaca gccctgtact gcacacacat 901 ggacgaccct tcccccagag aaatgatcaa caacttccgg caggtccaga agttcgatga 961 gaggctggta tacacctcct tctcccaagt gcaagtctgt actgcggcag gactgagtct 1021 gggcatcatc ctctcactgg ccctggctgg cattcttggc atctgtattg tggtggtggt 1081 gtccatttgg cttttcagaa ggaagagtat caaaaaaggt gataacaagg gagtcattta 1141 caagccagcc accaagatgg agactgaggc ccacgcttga ggtccccgga gctcccgggc 1201 acatccagga aggaccttgc tttggaccct acacacttcg gctctctgga cacttgcgac 1261 acctcaaggt gttctctgta gctcaatctg caaacatgcc aggcctcagg gatcctctgc 1321 tgggtgcctc cttgccttgg gaccatggcc accccagagc catccgatcg atggatgggt 1381 gcatctcaga ccaagcagca ggaattcaaa gctgcttgct gtaactgtgt gagattgtga 1441 agtggtctga attctggaat cacaaaccaa gccatgctgg tgggccatta atggttggaa 1501 aacactttca tcggggcttt gccagagcgt gctttcaagt gtcctggaaa gtctgctgct 1561 tctccaagct ttcagacaag aatgtgcact ctctgcttag tttgcttggg aaactcaact 1621 tctttcctct ggagacgggg catctccctc tgatttcctt ctgctatgac aaaaccttta 1681 atctgcacct tacaactcgg ggacaaatgg gacaggaagg atcaagttgt agagagaaaa 1741 aagaaaacaa gagatataca ttgtgatata ttagggacac tttcacagtc ctgtcttctg 1801 gatcacagac actgcacaga ccttagggaa tggcaggttc aagttccact tcttggtggg 1861 gatgagaagg gagagagagc tagagggaca aagagaatga gaagacatgg atgatctggg 1921 agagtctcag tttggaatca gaattggaat cacattctgt ttatcaagcc ataatgtaag 1981 gacagaataa tacaatatta agtccaaatc caacctcctg tcagtggagc agttatgttt 2041 tatactctac agattttaca aataatgagg ctgttccttg aaaatgtgtt gttgctgtgt 2101 cctggaggag acatgagttc cgagatgacc caatctgcct ttgaatctgg aggaaatagg 2161 cagaaacaaa atgactgtag aacttattct ctgtaggcca aatttcattt cagccacttc 2221 tgcaggatcc ctactgccaa cctggaatgg agacttttat ctacttctct cttctctgaa 2281 gatgtcaaat cgtggtttag atcaaatata tttcaagcta taaaagcagg aggttatctg 2341 tgcagggggc tggcatcatg tatttagggg caagtaataa tggaatgcta ctaagatact 2401 ccatattctt ccccgaatca acacagacag tttctgacag gcgcaactcc tccattttcc 2461 tcccgcaggt gagaaccctg tggagatgag tcagtgccat gactgagaag gaaccgaccc 2521 ctagttgaga gcaccttgca gttccccgag aactttctga ttcacagtct cattttgaca 2581 gcatgaaatg tcctcttgaa gcatagcttt ttaaatatct ttttccttct actcctccct 2641 ctgactctaa gaattctntn ttctggaatc gcttgaaccc aggaggcgga ggttgcagta 2701 agccaaggtc atgccactgc actctagcct gggtgacaga gcgagactcc atntcaaaaa 2761 aaaaaaaaaa a // LOCUS AF037439 2363 bp mRNA PRI 21-DEC-1997 DEFINITION Homo sapiens protein kinase A anchoring protein mRNA, complete cds. ACCESSION AF037439 NID g2707343 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2363) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Homo sapiens protein kinase A anchoring protein with an RGS domain JOURNAL Unpublished REFERENCE 2 (bases 1 to 2363) AUTHORS Chatterjee,T.K. and Fisher,R.A. TITLE Direct Submission JOURNAL Submitted (09-DEC-1997) Pharmacology, University of Iowa, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..2363 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 138..2126 /note="RGS domain." /codon_start=1 /product="protein kinase A anchoring protein" /db_xref="PID:g2707344" /translation="MRGAGPSPRQSPRTLRPDPGPAMSFFRRKVKGKEQEKTSDVKSI KASISVHSPQKSTKNHALLEAAGPSHVAINAISANMDSFSSSRTATLKKQPSHMEAAH FGDLGRSCLDYQTQETKSSLSKTLEQVLHDTIVLPYFIQFMELRRMEHLVKFWLEAES FHSTTWSRIRAHSLNTMKQSSLAEPVSPSKKHETTASFLTDSLDKRLEDSGSAQLFMT HSEGIDLNNRTNSTQNHLLLSQECDSAHSLRLEMARAGTHQVSMETQESSSTLTVASR NSPASPLKELSGKLMKSIEQDAVNTFTKYISPDAAKPIPITEAMRNDIIARICGEDGQ VDPNCFVLAQSIVFSAMEQEHFSEFLRSHHFCKYQIEVLTSGTVYLADILFCESALFY FSEYMEKEDAVNILQFWLAADNFQSQLAAKKGQYDGQEAQNDAMILYDKYFSLQATHP LGFDDVVRLEIESNICREGGPLPNCFTTPLRQAWTTMEKVFLPGFLSSNLYYKYLNDL IHSVRGDEFLGGNVSPTAPGSVGPPDESHPGSSDSSASQSSVKKASIKILKNFDEAII VDAASLDPESLYQRTYAGKMTFGRVSDLGQFIRESEPEPDVRKSKGSMFSQAMKKWVQ GNTDEAQEELAWKIAKMIVSDIMQQAQYDQPLEKSTKL" BASE COUNT 681 a 535 c 545 g 602 t ORIGIN 1 gcggcttgtt gataatatgg cggctggagc tgcctgggca tcccgaggag gcggtggggc 61 ccactcccgg aagaagggtc ccttttcgcg ctagtgcagc ggcccctctg gacccggaag 121 tccgggccgg ttgctgaatg aggggagccg ggccctcccc gcgccagtcc ccccgcaccc 181 tccgtcccga cccgggcccc gccatgtcct tcttccggcg gaaagtgaaa ggcaaagaac 241 aagagaagac ctcagatgtg aagtccatta aagcttcaat atccgtacat tccccacaaa 301 aaagcactaa aaatcatgcc ttgctggagg ctgcaggacc aagtcatgtt gcaatcaatg 361 ccatttctgc caacatggac tccttttcaa gtagcaggac agccacactt aagaagcagc 421 caagccacat ggaggctgct cattttggtg acctgggcag atcttgtctg gactaccaga 481 ctcaagagac caaatcaagc ctttctaaga cccttgaaca agtcttgcac gacactattg 541 tcctccctta cttcattcaa ttcatggaac ttcggcgaat ggagcatttg gtgaaatttt 601 ggttagaggc tgaaagtttt cattcaacaa cttggtcgcg aataagagca cacagtctaa 661 acacaatgaa gcagagctca ctggctgagc ctgtctctcc atctaaaaag catgaaacta 721 cagcgtcttt tttaactgat tctcttgata agagattgga ggattctggc tcagcacagt 781 tgtttatgac tcattcagaa ggaattgacc tgaataatag aactaacagc actcagaatc 841 acttgctgct ttcccaggaa tgtgacagtg cccattctct ccgtcttgaa atggccagag 901 caggaactca ccaagtttcc atggaaaccc aagaatcttc ctctacactt acagtagcca 961 gtagaaatag tcccgcttct ccactaaaag aattgtcagg aaaactaatg aaaagtatag 1021 aacaagatgc agtgaatact tttaccaaat atatatctcc agatgctgct aaaccaatac 1081 caattacaga agcaatgaga aatgacatca tagcaaggat ttgtggagaa gatggacagg 1141 tggatcccaa ctgtttcgtt ttggcacagt ccatagtctt tagtgcaatg gagcaagagc 1201 actttagtga gtttctgcga agtcaccatt tctgtaaata ccagattgaa gtgctgacca 1261 gtggaactgt ttacctggct gacattctct tctgtgagtc agccctcttt tatttctctg 1321 agtacatgga aaaagaggat gcagtgaata tcttacaatt ctggttggca gcagataact 1381 tccagtctca gcttgctgcc aaaaaggggc aatatgatgg acaggaggca cagaatgatg 1441 ccatgatttt atatgacaag tacttctccc tccaagccac acatcctctt ggatttgatg 1501 atgttgtacg attagaaatt gaatccaata tctgcaggga aggtgggcca ctccccaact 1561 gtttcacaac tccattacgt caggcctgga caaccatgga gaaggtcttt ttgcctggct 1621 ttctgtccag caatctttat tataaatatt tgaatgatct catccattcg gttcgaggag 1681 atgaatttct gggcgggaac gtgtcgccga ctgctcctgg ctctgttggc cctcctgatg 1741 agtctcaccc agggagttct gacagctctg cgtctcagtc cagtgtgaaa aaagccagta 1801 ttaaaatact gaaaaatttt gatgaagcga taattgtgga tgcggcaagt ctggatccag 1861 aatctttata tcaacggaca tatgccggga agatgacatt tggaagagtg agtgacttgg 1921 ggcaattcat ccgggaatct gagcctgaac ctgatgtaag gaaatcaaaa ggatccatgt 1981 tctcacaagc tatgaagaaa tgggtgcaag gaaatactga tgaggcccag gaagagctag 2041 cttggaagat tgctaaaatg atagtcagtg acattatgca gcaggctcag tatgatcaac 2101 cgttagagaa atctacaaag ttatgactca aaacttgaga taaaggaaat ctgcttgtga 2161 aaaataagag aacttttttc ccttggttgg attcttcaac acagccaatg aaaacagcac 2221 tatatttctg atctgtcact gttgtttcca gggagagaat ggggagacaa tcctaggact 2281 tccaccctaa tgcagttacc tgtagggcat aattggatgg cacatgatgt ttcacacagt 2341 gaggagtctt taaaggttac caa // LOCUS AF038168 1209 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens clone 23564 putative prenylated protein mRNA, complete cds. ACCESSION AF038168 NID g2795884 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1209) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1209) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Genome Res. 7 (4), 353-358 (1997) MEDLINE 97264341 REFERENCE 3 (bases 1 to 1209) AUTHORS Yu,W., Sarginson,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (12-DEC-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Hosuton, TX 77030, USA FEATURES Location/Qualifiers source 1..1209 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23564; 23776" /clone_lib="1NIB" /sex="female" /dev_stage="infant" /tissue_type="brain" /note="I.M.A.G.E. Consortium clones" CDS 335..964 /note="similar to human prenylated protein encoded by GenBank Accession Number Y13374" /codon_start=1 /product="putative prenylated protein" /db_xref="PID:g2795885" /translation="MGGGRGLLGRETLGPGGGCSGEGPLCYWPPPGSPPAPSLRASLP LEPPRCPLRSCSLPRSACLCSRNSAPGSCCRPWASLWSEPPPSPSSQPAPPMYIWTLS CAPAASWAPVTHWTDHPLPPLPSPLLPTRLPDDYIILPTDLRCHSHRHPSHPTDRLLL LVIWTHLGGIWAGHSPWTVIQTAGRPPRDLSPSARPISSPPPETSCVLA" BASE COUNT 239 a 412 c 299 g 259 t ORIGIN 1 cgaagcgagg agcagcgatg gacggtcggg tgcagctgat aaaggccctc ctggccttgc 61 cgatccggcc tgcgacgcgt cgctggagga acccgattcc ctttcccgag acgtttgacg 121 gcgataccga ccgactcccg gagttcatcg tgcagacggg ctcctacatg ttcgtggacg 181 agaacacgtt ctccagcgac gccctgaagg tgacgttcct catcacccgc ctcacagggc 241 ccgccctgca gtgggtgatc ccctacatca agaaggagag ccccctcctc aatgattacc 301 ggggctttct ggccgagatg aagcgagtct ttggatggga ggaggacgag gacttctagg 361 ccgggagacc ctcgggcctg ggggcgggtg ctctggggag ggtccgctgt gttactggcc 421 gccgccaggg tcgccaccgg cgccctccct ccgcgcctcc ctccccctcg agccgccgcg 481 atgtcccctg cgctcctgtt ccctcccgcg tagtgcttgc ctttgttcca ggaatagcgc 541 tccaggctcc tgctgccgcc cctgggcctc actctggagc gagccgccgc cctctccttc 601 cagccagcca gcccctccca tgtacatttg gacgctgtcc tgcgctccag ctgcaagctg 661 ggctcctgtt acacactgga cagaccaccc actgccgccg ctgccaagcc ctctcctccc 721 caccagactg ccagacgact acatcattct gcccacagac ctgcgctgcc acagccatcg 781 ccatccatcg catcccaccg acagactgct gctcctagtg atctggactc acctcggagg 841 tatctgggct ggccacagtc cctggacagt gatccagaca gctggccgcc ccccaaggga 901 tctgtcacct tcagcgagac ctatttcctc cccaccccca gaaacctctt gtgttcttgc 961 ctaggcccag gtgttcctgg cagccaaatc gagtctctca ttttctcttg tggaccagtt 1021 agttttgccc ataacgcagt attctgagtt tgcaactgtc tctctgatgt gtgccttttg 1081 ttcaacacag taacccctgc attctgctct gctctaatac actacctgga gaaagtcttt 1141 tccttatttt caataaatgt cagacattat tgaaaagaaa aaaaaaaaaa aaaaaaaaaa 1201 aaaaaaaaa // LOCUS AF038169 1234 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens clone 23790 unknown protein mRNA, complete cds. ACCESSION AF038169 NID g2795886 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1234) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1234) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Genome Res. 7 (4), 353-358 (1997) MEDLINE 97264341 REFERENCE 3 (bases 1 to 1234) AUTHORS Yu,W., Sarginson,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (12-DEC-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Hosuton, TX 77030, USA FEATURES Location/Qualifiers source 1..1234 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="1NIB" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone="23790" /note="I.M.A.G.E. Consortium clone" CDS 406..1017 /codon_start=1 /product="unknown" /db_xref="PID:g2795887" /translation="MTVKWKQLSAPASGAEIQRFPVPAVEPVPAPGADSPPGTALELE EAPEPSCRCPGTAQDQPSEELPDFMAPPVEPPASALELKVWLELEVAERGGQHSSSQQ LPHCSQSWAQWKLWRQRPGFAIWAPLPHWRGTSLIQQSSSPAAEGPAATAAGAVCLPA GGAGEQEKEPVSRGSSRSSCSQRRPPPLGMEVCPQLGIWAICP" BASE COUNT 321 a 311 c 321 g 281 t ORIGIN 1 gcttttagag cactgtgatg taacatgtca agcagaaata gggagcatgt ttacagccat 61 tctatgaaaa agtgttcgga atgtacagac tagcacagaa gctggactaa ttgaacaagt 121 attgctgaaa atgagtgctg tagatgacat gatagcagag taccttcaag tttaaatctg 181 agagtgatat tcatttggca gaacatcata aacaggtttt gtatgatggg aaacttgcaa 241 gtagcattac ctttacatat actgctaagg ccactgatgc tcaactctgc ctggaatcat 301 caccaaaaga gaatgcatca atttttgtgc attcccaaca tgctctaatg cttcagattc 361 aagtgctttt tccactgttt ccccaattgg ataatcggca gctcaatgac agtcaagtgg 421 aaacaactgt ctgctcctgc ttcaggtgca gaaatacagc gatttccagt gccagctgtt 481 gagccagtgc cagcaccagg ggcagattcc cctccaggga cagcgctgga gctagaggaa 541 gctccagagc cctcctgccg ctgccctggg actgcccagg accagcccag tgaggagctg 601 cctgacttca tggcacctcc tgtagagcca ccggcctcag ccctggagct gaaagtgtgg 661 ctggagctag aggtggcaga gaggggtggc cagcacagct ccagccagca gctcccacac 721 tgctcccagt cctgggcaca gtggaagcta tggaggcaga gaccagggtt tgcaatctgg 781 gctcctctgc ctcactggag agggacttct ctcattcagc agagcagcag ccctgctgct 841 gaagggcctg ctgctactgc tgctggggct gtttgcctgc ctgcaggagg tgctggagag 901 caagaaaagg agcctgtgag caggggttcc agcaggtcct cctgctccca gaggcgacct 961 cctcctctag gcatggaggt ttgccctcag ctgggcatct gggccatttg cccctaacgt 1021 gctgcccagg atggcctcct cttgacaggc ggacaggggg tgagggggcc agggggcatc 1081 tccaaaggaa gcttttaaac tcagcagctg caccccagaa tctgtatgcc tgcacctgcc 1141 caaggattta ttcatagctt acctaagaat ttcaaatttc taccataaca ctgaataaag 1201 tttgactttt tgaaaaaaaa aaaaaaaaaa aaaa // LOCUS AF038195 1419 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens clone 23661 unknown protein mRNA, complete cds. ACCESSION AF038195 NID g2795915 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1419) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1419) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Genome Res. 7 (4), 353-358 (1997) MEDLINE 97264341 REFERENCE 3 (bases 1 to 1419) AUTHORS Yu,W., Sarginson,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (12-DEC-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Hosuton, TX 77030, USA FEATURES Location/Qualifiers source 1..1419 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="1NIB" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone="23661; 23769" /note="I.M.A.G.E. Consortium clone" CDS 76..1335 /codon_start=1 /product="unknown" /db_xref="PID:g2795916" /translation="MPLSDFILALKDNPYFGAGFGLVGVGTALALARKGVQLGLVAFR RHYMITLEVPARDRSYAWLLSWLTRHSTRTQHLSVETSYLQHESGRISTKFEFVPSPG NHFIWYRGKWIRVERSREMQMIDLQTGTPWESVTFTALGTDRKVFFNILEEARELALQ QEEGKTVMYTAVGSEWRPFGYPRRRRPLNSVVLQQGLADRIVRDVQEFIDNPKWYTDR GIPYRRGYLLYGPPGCGKSSFITALAGELEHSICLLSLTDSSLSDDRLNHLLSVAPQQ SLVLLEDVDAAFLSRDLAVENPVKYQGLGRLTFSGLLNALDGVASTEARIVFMTTNHV DRLDPALIRPGRVDLKEYVGYCSHWQLTQMFQRFYPGQAPSLAENFAEHVLRATNQIS PAQVQGYFMLYKNDPVGAIHNAESLRR" BASE COUNT 326 a 386 c 389 g 318 t ORIGIN 1 agacggaggg ccagagagtc acggcggttt tcgtaacacc ccagggcctg taaggtttgg 61 tgtttccctt tcaagatgcc actttcagac tttattctgg ctctgaagga caatccctac 121 tttggggctg gatttgggct ggtgggtgtg ggcacagccc tggccctggc ccggaagggt 181 gtccaactgg gcctggtggc attccggcgc cattacatga tcacactgga agtccctgct 241 cgagacagga gctatgcctg gttgcttagc tggctcaccc gccacagtac ccgtactcag 301 cacctcagtg tcgagacttc gtaccttcag catgagagtg gccgcatttc cactaagttt 361 gaatttgtcc ccagccctgg aaaccatttt atctggtatc gggggaaatg gattcgggta 421 gaacgaagtc gagagatgca gatgatagac ttgcagacgg ggactccttg ggaatctgtc 481 accttcacgg ccctgggcac tgaccgaaag gttttcttca acatcctgga ggaagctcga 541 gagctagcct tgcagcagga ggaagggaag accgtgatgt acacagctgt gggctctgaa 601 tggcgtccct ttggctatcc acgccgccgg cgaccactga attctgtggt tctacaacag 661 ggtctggctg accgaattgt cagagacgtc caggaattca tcgataaccc caagtggtac 721 actgacagag gcattcctta cagacgtggc tacctgcttt atgggccccc tggttgcgga 781 aagagcagtt ttatcacagc cctggctggg gaactggagc acagcatctg cctgctgagc 841 ctcacggact ccagcctctc tgatgaccga ctcaaccacc tgctgagcgt ggccccgcag 901 cagagcctgg tactcctgga ggatgtggat gctgcttttc tcagtcgaga cttggctgtg 961 gagaacccag taaagtacca aggcctaggt cgcctcacct tcagtggact gctcaatgcc 1021 ttggatggtg tggcttccac cgaggcccgc atcgtgttca tgaccaccaa ccacgttgac 1081 aggctggacc ctgccctgat acgcccgggg cgagtggacc tgaaggagta cgtgggctac 1141 tgctcacact ggcagctgac ccagatgttc cagaggttct atccagggca ggcaccttcc 1201 ttagctgaga actttgcaga acatgtcctt cgagctacaa accagatcag tcctgcccag 1261 gtgcagggct acttcatgct gtataaaaat gaccctgtag gggcaattca caatgctgag 1321 tctctgagga ggtgatcagg ctgggctcag ctcagctctc ctcctctagc tcaataaaca 1381 tctgccacac taaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS AF038404 3258 bp mRNA PRI 22-DEC-1997 DEFINITION Homo sapiens homolog of Nedd5 (hNedd5) mRNA, complete cds. ACCESSION AF038404 NID g2707904 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3258) AUTHORS Hu,G. TITLE human homolog of mouse Nedd5 mRNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 3258) AUTHORS Hu,G. TITLE Direct Submission JOURNAL Submitted (12-DEC-1997) Shanghai Institute of Cell Biology, 320 Yue-Yang Rd., Shanghai 200031, China FEATURES Location/Qualifiers source 1..3258 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3258 /gene="hNedd5" CDS 79..1164 /gene="hNedd5" /note="similar to mouse Nedd5; similar to yeast CDC10" /codon_start=1 /product="homolog of Nedd5" /db_xref="PID:g2707905" /translation="MSKQQPTQFINPETPGYVGFANLPNQVHRKSVKKGFEFTLMVVG ESGLGKSTLINSLFLTDLYPERVIPGAAEKIERTVQIEASTVEIEERGVKLRLTVVDT PGYGDAINCRDCFKTIISYIDEQFERYLHDESGLNRRHIIDNRVHCCFYFISPFGHGL KPLDVAFMKAIHNKVNIVPVIAKADTLTLKERERLKKRILDEIEEHNIKIYHLPDAES DEDEDFKEQTRLLKASIPFSVVGSNQLIEAKGKKVRGRLYPWGVVEVENPEHNDFLKL RTMLITHMQDLQEVTQDLHYENFRSERLKRGGRKVENEDMNKDQILLEKEAELRRMQE MIARMQAQMQMQMQGGDGDGGALGHHV" BASE COUNT 942 a 631 c 706 g 977 t 2 others ORIGIN 1 cgcgacgntg ggcgggtccg cggcgcgntc ggtcggcgcc tattctcggg ctgtttggcg 61 gacgaagctt cacaaaagat gtctaagcaa cagccaactc agtttataaa tccagaaaca 121 cctggctatg ttggatttgc aaacctcccc aatcaagttc accgaaaatc agtgaaaaaa 181 ggttttgagt tcacactgat ggtggtcggt gaatcaggtc taggaaaatc gactctcata 241 aacagcctat tcctaactga tctgtaccca gaaagagtca tacctggagc agcagaaaaa 301 attgaaagaa ctgtccagat tgaggcttca actgttgaaa ttgaagagcg aggggtcaag 361 ctacgcctga cagtggtaga tacccctggc tatggtgacg ctatcaactg cagagattgt 421 tttaagacaa ttatctccta tattgatgag caatttgaga ggtacctgca tgacgagagc 481 ggcttgaaca ggcggcacat cattgataat agggtgcatt gttgctttta ctttatttca 541 ccttttggac atggacttaa gcccttagat gtggcgttta tgaaggcaat acacaacaag 601 gtgaatattg tgcctgtcat tgcaaaagct gacactctca ccctgaagga acgggagcgg 661 ctgaagaaaa ggattctgga tgaaattgaa gaacataaca tcaaaatcta tcacttacct 721 gatgcagaat cagatgaaga tgaagatttt aaagagcaga ctagacttct caaggctagc 781 atcccattct ctgtggttgg atccaatcag ttgattgaag ccaaaggaaa gaaggtcaga 841 ggccgcctct acccctgggg tgttgtggaa gtggagaacc cagagcacaa tgactttctg 901 aagctgagaa ccatgctcat cacccacatg caggatctcc aggaggtgac ccaggacctt 961 cattatgaaa acttccgttc tgagagactc aagagaggcg gcaggaaagt ggagaatgag 1021 gacatgaata aagaccagat cttgctggaa aaagaagctg agctccgccg catgcaagag 1081 atgattgcaa ggatgcaggc gcagatgcag atgcagatgc agggcgggga tggcgatggc 1141 ggggctctcg ggcaccacgt gtaaggtgat gtgcacatat caagaagtca gagaaaacac 1201 tttcctggat aaaaaagaaa acattccaga tgcatgatcc agctgtgtgt tttcaatcct 1261 tgggagggtg ccatccacat tttaacagta cctgtgcctg agaatttaat ttttaaaaga 1321 ctttgatgtg tttttgtatg aagtactttt aacgtatgta tttcattgct gtgtcacact 1381 ctgtgttttg tgaggtgaat gtcttccttt tctttctccc taaccactaa tgttagaatt 1441 gatttccaag aatcggcatg tatacttaat actgaatttc tttgatttaa ctgacttaac 1501 aactgactaa ccattgatga gcactcctga tttttatcta gaacattcag atttaccata 1561 atgttcctta gtggtagagg tgtgtgccta gtgatgtaga aagatacact gacttggtgc 1621 aaggccatct gcttaccaca tcacaccact tggagatctt tgcttccttg cttttatgtt 1681 tgtacacaac acctaaaacc agttttgctg ctataattct atactgttga ttcgtctgcg 1741 attttatctg ttaaccaaat aaaacataat agaatttcct aatgagatat atctttatac 1801 ttaaacagct tttttagagg tgagttttaa agaagtctct taattctgat gctaggttgt 1861 ttttaaaacc actatgcaaa ggaacaactc accacaagcc accttttgta gtgttctcca 1921 ctaatactgg ttatcctgtg ctacagagaa aatcaaagca gtcataagct ccagttttcg 1981 tattgcaaat aagactctta cctacaaaat gagattcagt gaactaattt ggtttttact 2041 caaccaaatt aaaaattttt ttaaggaaaa ttagcagttg gtctattcag aatcaaacct 2101 ttttatattt tatactgcac ttcagtgtat tttctgtcac tgtaggtata gaagatctgc 2161 ctcccctgtg gaaattgggg tctgttggtg ggcgtgccct gaagcctggc ttgggttgaa 2221 aagtgttccc gccctaaggc cttggtgccc tgaacctctg atgcctaccg ggttctcctg 2281 atttgagttt cctttaaata ctcccttttt gagtaatttt ctgatgggag gaaagtagca 2341 gtcatcatct ttttgtgtgc aggctgtctc atttattttt agccattgtc gtttcattca 2401 ttttgtgtaa tataaaccgt gtgtcatgtc aaagtgaaag acatttcaaa tctgtagcat 2461 aggctagtgg gcaggtccgc acagtcgaag ccacacctgg tctgttttct gtgcactgta 2521 gccttagtgt cacctttctt cttgtgtctc cttatggtac actccagcgg ttgccttttt 2581 tatcatttct actgaagttg ggaaattcaa ccccagaaat tgacagatga aaggagacaa 2641 tggttgtgta gggagatgga gaaaatgctt aatctgagga tgagacaggg ttttttcatt 2701 tttgtggggg ctagaaaaaa cataaaatga ggcagttaaa taataatagt taatgaaggt 2761 gtgctacaga aaataatctg gtgttcttgc taactttgcc cttcactgtt gcttaattgt 2821 gaacagccaa aagctatatg ttatggctta ttgtgtgaag gtaactaaga agtggtgttc 2881 catgacttca gagtacatcc atgcggagtc cattatttga gtttgacatt taataacttt 2941 gctggaaaat ctgtaaaaaa gaaaaacaag tttgctagtg actaagcccc gcatatgtga 3001 gtgaaagtac ttcaggcacg ctgcctcctg gtaacagcta tgcagggagg gaggacccac 3061 actgctacac ttctgatccc ctttggtttt actacccaaa tctaaataga tacttttgat 3121 aatagataac tgctctttta ctaagacata gtctctacct atagaaatgt attttgaaaa 3181 cacttatttt acacagcaat tttgtatcca tttaaactaa ccttttatca ataaagcact 3241 attgtttaga tattaaaa // LOCUS AF038965 1439 bp mRNA PRI 19-JAN-1998 DEFINITION Homo sapiens 26S proteasome ATPase subunit mRNA, complete cds. ACCESSION AF038965 NID g2791679 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1439) AUTHORS Zhang,Q., Mao,M., Huang,Q., Zhou,J., Fu,G., Wu,J., Wang,Y., Chen,S. and Chen,Z. TITLE Human 26S proteasome ATPase subunit gene expressed in hematopoietic progenitor CD34+ cell JOURNAL Unpublished REFERENCE 2 (bases 1 to 1439) AUTHORS Zhang,Q. TITLE Direct Submission JOURNAL Submitted (16-DEC-1997) Shanghai Institute of Hematology, Rui-Jin Hospital, Shanghai Second Medical University, 197 Rui-Jin Road II, Shanghai 200025, P. R. China FEATURES Location/Qualifiers source 1..1439 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="CD34+ hematopoietic progenitor" CDS 23..1279 /codon_start=1 /product="26S proteasome ATPase subunit" /db_xref="PID:g2791680" /translation="MEEIGILVEKAQDEIPALSVSRPQTGLSFLGPEPEDLEDLYSRY KKLQQELEFLEVQEEYIKDEQKNLKKEFLHAQEEVKRIQSIPLVIGQFLEAVDQNTAI VGSTTGSNYYVRILSTIDRELLKPNASVALHKHSNALVDVLPPEADSSIMMLTSDQKP DVMYADIGGMDIQKQEVREAVELPLTHFELYKQIGIDPPRGVLMYGPPGCGKTMLAKA VAHHTTAAFIRVVGSEFVQKYLGEGPRMVRDVFRLAKENAPAIIFIDEIDAIATKRFD AQTGADREVQRILLELLNQMDGFDQNVNVKVIMATNRADTLDPALLRPGRLDRKIEFP LPDRRQKRLIFSTITSKMNLSEEVDLEDYVARPDKISGADINSICQESGMLAVRENRY IVLAKDFEKAYKTVIKKDEQEHEFYK" BASE COUNT 382 a 385 c 386 g 286 t ORIGIN 1 gacagaggcc ggcttggtca ctatggagga gataggcatc ttggtggaga aggctcagga 61 tgagatccca gcactgtccg tgtcccggcc ccagaccggc ctgtccttcc tgggccctga 121 gcctgaggac ctggaggacc tgtacagccg ctacaagaag ctgcagcaag agctggagtt 181 cctggaggtg caggaggaat acatcaaaga tgagcaaaag aacctgaaaa aggaatttct 241 ccatgcccag gaggaggtga agcgaatcca aagcatcccg ctggtcatcg gacaatttct 301 ggaggctgtg gatcagaata cagccatcgt gggctctacc acaggctcca actattatgt 361 gcgcatcctg agcaccatcg atcgggagct gctcaagccc aacgcctcag tggccctcca 421 caagcacagc aatgcattgg tggacgtgct gccccccgaa gccgacagca gcatcatgat 481 gctcacctca gaccagaagc cagatgtgat gtacgcggac atcggaggca tggacatcca 541 gaagcaggag gtgcgggagg ccgtggagct cccgctcacg catttcgagc tctacaagca 601 gatcggcatc gatccccccc gaggcgtcct catgtatggc ccacctggct gtgggaagac 661 catgttggca aaggcggtgg cacatcacac aacagctgca ttcatccggg tcgtgggctc 721 ggagtttgta cagaagtatc tgggtgaggg cccccgcatg gtccgggatg tgttccgcct 781 ggccaaggag aatgcacctg ccatcatctt catagacgag attgatgcca tcgccaccaa 841 gagattcgat gctcagacag gggccgacag ggaggttcag aggatcctgc tggagctgct 901 gaatcagatg gatggatttg atcagaatgt caatgtcaag gtaatcatgg ccacaaacag 961 agcagacacc ctggatccgg ccctgctacg gccaggacgg ctggaccgta aaattgaatt 1021 tccacttcct gaccgccgcc agaagagatt gattttctcc actatcacta gcaagatgaa 1081 cctctctgag gaggttgact tggaagacta tgtggcccgg ccagataaga tttcaggagc 1141 tgatattaac tccatctgtc aggagagtgg aatgttggct gtccgtgaaa accgctacat 1201 tgtcctggcc aaggacttcg agaaagcata caagactgtc atcaagaagg acgagcagga 1261 gcatgagttt tacaagtgac ccttcccttc cctccaccac accactcagg ggctggggct 1321 tctctcgcac ccccagcacc tctgtcccaa aacctcattc cctttttttc tttacccagg 1381 attggtttct tcaataaata gataagatca aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS AF039018 1722 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens 39 kDa protein mRNA, complete cds. ACCESSION AF039018 NID g2773059 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1722) AUTHORS Pietu,G., Alibert,O., Guichard,V., Lamy,B., Bois,F., Leroy,E., Mariage-Sampson,R., Houlgatte,R., Soularue,P. and Auffray,C. TITLE Novel gene transcripts preferentially expressed in human muscles revealed by quantitative hybridization of a high density cDNA array JOURNAL Genome Res. 6 (6), 492-503 (1996) MEDLINE 96425696 REFERENCE 2 (bases 1 to 1722) AUTHORS Bouju,S., Pietu,G., Le Cunff,M., Cros,N., Reguique-Arnould,I., Pons,F., Leger,J.J., Auffray,C. and Dechesne,C.A. TITLE A novel human 39 kDa protein containing a PDZ- and LIM-motif as a striated muscle marker JOURNAL Unpublished REFERENCE 3 (bases 1 to 1722) AUTHORS Bouju,S., Pietu,G., Le Cunff,M., Cros,N., Reguique-Arnould,I., Pons,F., Leger,J.J., Auffray,C. and Dechesne,C.A. TITLE Direct Submission JOURNAL Submitted (17-DEC-1997) Unit 300, INSERM, 15 Avenue Charles Flahault, Montpellier 34060, France FEATURES Location/Qualifiers source 1..1722 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q22-qtel" /clone="GENX-6904" CDS 71..1165 /note="novel; contains PDZ- and LIM-motif; striated muscle-specific" /codon_start=1 /product="39 kDa protein" /db_xref="PID:g2773060" /translation="MPQTVILPGPAPWGFRLSGGIDFNQPLVITRITPGSKAAAANLC PGDVILAIDGFGTESMTHADAQDRIKAAAHQLCLKIDRGETHLWSPQVSEDGKAHPFK INLESEPQDGNYFEHKHNIRPKPFVIPGRSSGCSTPSGIDCGSGRSTPSSVSTVSTIC PGDLKVAAKLAPNIPLEMELPGVKIVHAQFNTPMQLYSDDNIMETLQGQVSTALGETP LMNEPTASVPPESDVYRMLHDNRNEPTQPRQSGSFRVLQGMVDDGSDDRPAGTRSVRA PVTKVHGGSGGAQRMPLCDKCGSGIVGAVVKARDKYRHPECFVCADCNLNLKQKGYFF IEGELYCETHARARTKPPEGYDTVTLYPKA" BASE COUNT 472 a 416 c 429 g 405 t ORIGIN 1 gcggccggtc gaccggagtg gctgccctgc gcggggacac tcagagcccg gtgggcggga 61 ggaaggcggc atgccccaga cggtgatcct cccgggccct gcgccctggg gcttcaggct 121 ctcagggggc atagacttca accagccttt ggtcatcacc aggattacac caggaagcaa 181 ggcggcagct gccaacctgt gtcctggaga tgtcatcctg gctattgacg gctttgggac 241 agagtccatg actcatgctg atgcgcagga caggattaaa gcagcagctc accagctgtg 301 tctcaaaatt gacaggggag aaactcactt atggtctcca caagtatctg aagatgggaa 361 agcccatcct ttcaaaatca acttagaatc agaaccacag gacgggaact actttgaaca 421 caagcataat attcggccca aacctttcgt gatcccgggc cgaagcagtg gatgcagcac 481 tccctccggg attgactgtg gcagtggacg cagcacccct tcttctgtca gtactgttag 541 taccatttgc ccaggtgact tgaaagttgc ggctaagctg gcccctaaca ttcctttgga 601 aatggaactt cctggtgtga agattgtaca tgctcagttt aatacaccta tgcagttgta 661 ctcagatgac aatattatgg aaacactcca gggtcaggtt tcaacagccc taggggaaac 721 acctttgatg aacgagccca cagcctcggt gccccccgag tcggacgtgt accggatgct 781 ccacgacaat cggaatgagc ccacacagcc tcgccagtcg ggctccttca gagtgctcca 841 gggaatggtg gacgatggct ctgatgaccg tccggctgga acgcggagtg tgagagctcc 901 ggtgacgaaa gtccatggcg gttcaggcgg ggcacagagg atgccgctct gtgacaaatg 961 tgggagtggc atagttggtg ctgtggtgaa ggcgcgggat aagtaccggc accctgagtg 1021 cttcgtgtgt gccgactgca acctcaacct caagcaaaag ggctacttct tcatagaagg 1081 ggagctgtac tgcgaaaccc acgcaagagc ccgcacaaag cccccagagg gctatgacac 1141 ggtcactctg tatcccaaag cttaagtctc tgcaggcgtg gcacgcacgc acgcacccac 1201 ccacgcgcac ttacacgaga agacattcat ggctttgggc agaaggattg tgcagattgt 1261 caactccaaa tctaaagtca aggctttaga cctttatcct attgtttatt gaggaaaagg 1321 aatgggaggc aaatgcctgc tatgtgaaaa aaacatacac ttagctatgt tttgcaactc 1381 tttttggggc tagcaataat gatatttaaa gcaataattt tttgtatgtc atactccaca 1441 atttacatgt atattacagc catcaaacac ataaacatca agatatttga aggactctaa 1501 ttgtctttcc ttgacaagtt gattttgcaa ttgtggtaaa tagcaaataa caatcttgta 1561 ttctaacata atctgcagtt gtctgtatgt gttttaacta ttacagtgca tgttagggag 1621 aaattccctg aatttcttta gttttgtatt caaacaatta tgccactcga tgcaacaaac 1681 ataataaata cataaaggat ttaaaaaaaa aaaaaaaaaa aa // LOCUS AF039655 2180 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens filensin mRNA, complete cds. ACCESSION AF039655 NID g2746766 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2180) AUTHORS Hess,J.F., Casselman,J.T., Kong,A.P. and FitzGerald,P.G. TITLE Primary sequence, secondary structure, gene structure and assembly properties suggest that the lens specific intermediate filament protein filensin represents a novel class of intermediate filament protein JOURNAL Exp. Eye Res. (1998) In press REFERENCE 2 (bases 1 to 2180) AUTHORS Hess,J.F., Casselman,J.T., Kong,A.P. and FitzGerald,P.G. TITLE Direct Submission JOURNAL Submitted (22-DEC-1997) Cell Biology and Human Anatomy, University of California, 1 Shields Ave, Davis, CA 95616-8643, USA FEATURES Location/Qualifiers source 1..2180 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" CDS 1..1998 /note="lens intermediate filament protein; Lifl-H" /codon_start=1 /product="filensin" /db_xref="PID:g2746767" /translation="MYRRSYVFQTRKEQYEHADEASRAAEPERPADEGWAGATSLAAL QGLGERVAAHVQRARALEQRHAGLRRQLDAFQRLGELAGPEDALARQVESNRQRVRDL EAERARLERQGTEAQRALDEFRSKYENECECQLLLKEMLERLNKEADEALLHNLRLQL EAQFLQDDISAAKDRHKKNLLEVQTYISILQQIIHTTPPASIVTSGMREEKLLTEREV AALRSQLEEGREVLSHLQAQRVELQAQTTTLEQAIKSAHECYDDEIQLYNEQIETLRK EIEETERVLEKSSYDCRQLAVAQQTLKNELDRYHRIIEIEGNRLTSAFIETPIPLFTQ SHGVSLSTGSGGKDLTRALQDITAAKPRQKALPKNVPRRKEIITKDKTNGALEDAPLK GLEDTRLVQVVLKEESESKFESESKEVSPLTQEGAPEDVPDGGQISKGFGKLYRKVKE KVRSPKEPETPTELYTKERHVLVTGDANYVDPRFYVSSITAKGGVAVSVAEDSVLYDG QVEPSPESPKPPLENGQVGLQEKEDGQPIDQQPIDKEIEPDGAELEGPEEKREGEERD EGSRRPCAMVTPGAEEPSIPEPPKPAADQDGAEVLGTRSRSLPEKGPPKALAYKTVEV VESIEKISTESIQTYEETAVIVETMIGKTKSDKKKSGEKSS" BASE COUNT 606 a 536 c 654 g 384 t ORIGIN 1 atgtaccggc gcagctacgt gttccagacc cgcaaggagc agtacgagca cgccgacgag 61 gcttcgcgcg ccgccgagcc cgagcgcccg gccgacgagg gctgggctgg ggcaacgagc 121 ctggcggcgc tgcaggggct cggcgagcgc gtggccgccc acgtccagcg ggcccgcgcc 181 ctcgagcagc gccatgccgg gctccggagg cagctggatg ccttccagcg cctgggcgag 241 ctggccgggc ccgaggacgc cctcgcccgc caagtcgaga gcaaccgtca gcgcgtccgg 301 gacctggagg ccgagcgcgc ccggctggag cgccagggca ccgaggcgca gcgcgcgctc 361 gacgagttcc gaagcaagta tgaaaatgag tgcgaatgtc aactcctgct aaaagaaatg 421 cttgaacggc ttaacaagga agctgatgaa gccttgctgc ataacctacg ccttcagctg 481 gaagcccaat ttctgcaaga tgatatcagt gcggcaaagg acaggcacaa gaagaatctt 541 ctggaagttc agacctatat cagcatcctg cagcagatca tccacaccac tcctccagca 601 tccattgtga cgagtgggat gagggaggag aagctcctga cggagcggga ggtggccgcc 661 ctgcggagtc agctggagga gggccgggag gtgctctccc acctgcaggc gcagagagtg 721 gagctgcagg cacagacaac aactctggaa caagctatta aaagtgccca tgagtgttat 781 gacgatgaga ttcagcttta taacgagcag attgagacac tgcgcaagga gattgaggag 841 acagagcggg tcctggagaa gtcttcttac gactgccggc agctggcggt cgcccagcaa 901 accctgaaga atgagctgga ccggtatcat cgtatcatcg agattgaagg caacaggctg 961 acctctgcct tcattgaaac tcccattccc ctgttcaccc agagccatgg agtctctctc 1021 agcactggat ccggtgggaa agatcttacc agagctctgc aggatataac agcagcaaaa 1081 ccaagacaaa aagccctccc caagaatgtt ccaaggagaa aagagattat aacaaaagac 1141 aaaaccaacg gagctctgga agatgcacca ttaaaaggtt tggaagacac aaggctggta 1201 caggtggtac ttaaagagga aagtgaatct aagtttgaat cagaaagtaa agaagtaagt 1261 cccctgacac aagaaggggc tccagaggat gtgcctgatg gagggcagat aagcaaaggc 1321 tttgggaaac tatacaggaa ggtcaaggag aaagtgagaa gccccaaaga gcctgagacc 1381 cccactgagc tctacaccaa agagcggcac gtgctggtca caggggatgc caattacgtg 1441 gaccctagat tctatgtctc ctccatcaca gctaaaggtg gggtggctgt ttctgttgca 1501 gaagactctg tgctttatga cggccaggtg gagccctctc ctgagtcacc caagccccct 1561 ttagagaatg ggcaggtggg tctgcaggag aaagaagatg gacaaccaat tgaccagcag 1621 cctatagaca aggagattga gccagatggt gcagagctgg aaggccctga agagaaacgt 1681 gagggtgagg agcgggacga agggtccagg agaccctgtg ccatggtcac acccggtgca 1741 gaggaaccgt ctatacctga gcctccaaag cctgcggctg atcaggatgg agctgaggtg 1801 cttgggacta ggagcagaag cctgccagaa aaaggccctc ccaaggcttt ggcctataag 1861 acagtggaag tggtggaatc tatcgagaag atttccacgg agagcattca gacatatgaa 1921 gaaaccgctg tgatcgtgga gaccatgatt ggaaagacaa agtcagacaa gaagaaatca 1981 ggagagaaga gctcttaaaa tgcccaggct tgatgggata aaatgtattt ggggccactg 2041 taggggtaat gctttgatat tttagagcaa atgataaaag ggtgagggtt cctgtttgga 2101 ttagaccata gttgacccat ctggcattgc caacgaagcc ttcattaaaa tgttttcttt 2161 gcttgcaaaa aaaaaaaaaa // LOCUS AF039656 1486 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens neuronal tissue-enriched acidic protein (NAP-22) mRNA, complete cds. ACCESSION AF039656 NID g2773159 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1486) AUTHORS Park,S., Kim,B., Seong,C., Lee,S., Baek,K. and Yoon,J. TITLE The human cDNA encoding the NAP-22 homolog JOURNAL Unpublished REFERENCE 2 (bases 1 to 1486) AUTHORS Park,S., Kim,B., Seong,C., Lee,S., Baek,K. and Yoon,J. TITLE Direct Submission JOURNAL Submitted (22-DEC-1997) Genetic Engineering, Kyung Hee University, Kiheung-Up, Yongin-City, Kyungki-Do 449-701, Korea FEATURES Location/Qualifiers source 1..1486 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Genome System Co. ID# 631975" gene 1..1486 /gene="NAP-22" CDS 53..736 /gene="NAP-22" /note="22 kDa" /codon_start=1 /product="neuronal tissue-enriched acidic protein" /db_xref="PID:g2773160" /translation="MGGKLSKKKKGYNVNDEKAKEKDKKAEGAATEEEGTPKESEPQA PAEPAEAKEGKEKPDQDAEGKAEEKEGEKDAAAAKEEAPKAEPEKTEGAAEAKAEPPK APEQEQAAPGPLRGGEAPKAAEAAAGPRPRAAPAAGEEPSKEEGEPKKTEAPAAPAAQ ETKSDGAPASDSKPGSSEAAPSSKETPAATEAPSSTPKAQGPAASAEEPKPVEAPAAN SDQTVTVKE" BASE COUNT 381 a 411 c 407 g 287 t ORIGIN 1 gaattcggca cgagctcagg ggctgcatag gcacccagag ccgaactcca agatgggagg 61 caagctcagc aagaagaaga agggctacaa tgtgaacgac gagaaagcca aggagaaaga 121 caagaaggcc gagggcgcgg cgacggaaga ggaggggacc ccgaaggaga gtgagcccca 181 ggcgcccgca gagcccgccg aggccaagga gggcaaggag aagcccgacc aggacgccga 241 gggcaaggcc gaggagaagg agggcgagaa ggacgcggcg gctgccaagg aggaggcccc 301 gaaggcggag cccgagaaga cggagggcgc ggcagaggcc aaggctgagc ccccgaaggc 361 gcccgagcag gagcaggcgg cccccggccc gctgcggggc ggcgaggccc ccaaagctgc 421 tgaggccgcc gccggccccc ggccgagagc ggcccctgcc gccggggagg agcccagcaa 481 ggaggaaggg gaacccaaaa agactgaggc gcccgcagct cctgccgccc aggagaccaa 541 aagtgacggg gccccagctt cagactcaaa acccggcagc tcggaggctg ccccctcttc 601 caaggagacc cccgcagcca cggaagcgcc tagttccaca cccaaggccc agggccccgc 661 agcctctgca gaagagccca agccggtgga ggccccggca gctaattccg accaaaccgt 721 aaccgtgaaa gagtgacaag gacagcctat aggaaaaaca ataccactta aaacaatctc 781 ctctctctct ctctctctct ctctatctct ctctctatct cctctctctc tctcctctcc 841 tatctctcct ctctctctct cctatactaa cttgtttcaa attggaagta atgatatgta 901 ttgcccaagg aaaaatacag gatgttgtcc catcaaggga gggagggggt gggagaatcc 961 aaatagtatt tttgtgggga aatatctaat ataccttcag tcaactttac caagaagtcc 1021 tggatttcca agatccgcgt ctgaaagtgc agtacatcgt ttgtacctga aactgccgcc 1081 acatgcactc ctccaccgct gagagttgaa tagcttttct tctgcaatgg gagttgggag 1141 tgatgcgttt gattctgccc acagggcctg tgccaaggca atcagatctt tatgagagca 1201 gtattttctg tgttttcttt ttaatttaca gcctttctta ttttgatatt tttttaatgt 1261 tgtggatgaa tgccagcttt cagacagagc ccacttagct tgtccacatg gatctcaatg 1321 ccaatcctcc attcttcctc tccagatatt tttgggagtg acaaacattc tctcatccta 1381 cttagcctac ctagatttct catgacgagt taatgcatgt ccgtggttgg gtgcacctgt 1441 agttctgttt attggtcagt ggaaatgaaa aaaaaaaaaa aaaaaa // LOCUS AF039843 2135 bp mRNA PRI 27-JAN-1998 DEFINITION Homo sapiens Sprouty 2 (h-Spry2) mRNA, complete cds. ACCESSION AF039843 NID g2809399 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2135) AUTHORS Hacohen,N., Kramer,S., Sutherland,D., Hiromi,Y. and Krasnow,M.A. TITLE sprouty encodes a novel antagonist of FGF signaling that patterns apical branching of the Drosophila airways JOURNAL Cell 92 (2), 253-263 (1998) MEDLINE 98117253 REFERENCE 2 (bases 1 to 2135) AUTHORS Hacohen,N., Kramer,S., Sutherland,D., Hiromi,Y. and Krasnow,M.A. TITLE Direct Submission JOURNAL Submitted (23-DEC-1997) Biochemistry, Stanford University, Beckman Center 461, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..2135 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="63D1-2" gene 1..2135 /note="sprouty 2; Human homolog of Drosophila melanogaster tracheal system branching inhibitor gene" /gene="h-Spry2" CDS 391..1338 /gene="h-Spry2" /note="membrane-associated protein" /codon_start=1 /product="Sprouty 2" /db_xref="PID:g2809400" /translation="MEARAQSGNGSQPLLQTPRDGGRQRGEPDPRDALTQQVHVLSLD QIRAIRNTNEYTEGPTVVPRPGLKPAPRPSTQHKHERLHGLPEHRQPPRLQHSQVHSS ARAPLSRSISTVSSGSRSSTRTSTSSSSSEQRLLGSSFSSGPVADGIIRVQPKSELKP GELKPLSKEDLGLHAYRCEDCGKCKCKECTYPRPLPSDWICDKQCLCSAQNVIDYGTC VCCVKGLFYHCSNDDEDNCADNPCSCSQSHCCTRWSAMGVMSLFLPCLWCYLPAKGCL KLCQGCYDRVNRPGCRCKNSNTVCCKVPTVPPRNFEKPT" BASE COUNT 583 a 474 c 488 g 590 t ORIGIN 1 ggcacgaggg taaggccgtt ttcttttccc attcgctcat ctgccaggaa aagggacttg 61 ccgttggcgc ttcggcctct tgttcattga gaaaaaagag gaaatactcc gcgtgcgctt 121 gtagaagggg agtcgtctcc agctccgaac cccggagtgt tcatcagcgg ggaatctggc 181 tccgaattct ctttttttct cccgccgatt gctcggaagt tggtctaaag cagaggttgg 241 aaagaaagga aaaaagtttg catcgagact ggatttattt gcacatcgca gaaagaagag 301 aatccaaggg agaggggttg gtgcaaagcc gcgatcacgg agttcagatg tgttctaagc 361 ctgctggagt gaccacactt ccaagacctg atggaggcca gagctcagag tggcaacggg 421 tcgcagccct tgctgcagac gccccgtgac ggtggcagac agcgtgggga gcccgacccc 481 agagacgccc tcacccagca ggtacatgtc ttgtctctgg atcagatcag agccatccga 541 aacaccaatg agtacacaga ggggcctact gtcgtcccaa gacctgggct caagcctgct 601 cctcgcccct ccactcagca caaacacgag agactccacg gtctgcctga gcaccgccag 661 cctcctaggc tccagcactc gcaggtccat tcttctgcac gagcccctct gtccagatcc 721 ataagcacgg tcagctcagg gtcgcggagc agtacgagga caagtaccag cagcagctcc 781 tctgaacaga gactgctagg atcatccttc tcctccgggc ctgttgctga tggcataatc 841 cgggtgcaac ccaaatctga gctcaagcca ggtgagctta agccactgag caaggaagat 901 ttgggcctgc acgcctacag gtgtgaggac tgtggcaagt gcaaatgtaa ggagtgcacc 961 tacccaaggc ctctgccatc agactggatc tgcgacaagc agtgcctttg ctcggcccag 1021 aacgtgattg actatgggac ttgtgtatgc tgtgtgaaag gtctcttcta tcactgttct 1081 aatgatgatg aggacaactg tgctgacaac ccatgttctt gcagccagtc tcactgttgt 1141 acacgatggt cagccatggg tgtcatgtcc ctctttttgc cttgtttatg gtgttacctt 1201 ccagccaagg gttgccttaa attgtgccag gggtgttatg accgggttaa caggcctggt 1261 tgccgctgta aaaactcaaa cacagtttgc tgcaaagttc ccactgtccc ccctaggaac 1321 tttgaaaaac caacatagca tcattaatca ggaatattac agtaatgagg attttttctt 1381 tcttttttta atacacatat gcaaccaact aaacagttat aatcttggca ctgttaatcg 1441 aaagttggga tagtctttgc tgtttgcggt gaaatgcttt ttgtccatgt gccgttttaa 1501 ctgatatgct tgttagaact cagctaatgg agctcaaagt atgagataca gaacttggtg 1561 acccatgtat tgcataagct aaagcaacac agacactcct aggcaaagtt tttgtttgtg 1621 aatagtactt gcaaaacttg taaattagca gatgactttt ttccattgtt ttctccagag 1681 agaatgtgct atatttttgt atatacaata atatttgcaa ctgtgaaaaa caagttgtgc 1741 catactacat ggcacagaca caaaatatta tactaatatg ttgtacattc ggaagaatgt 1801 gaatcaatca gtatgttttt agattgtatt ttgccttaca gaaagccttt attgtaagac 1861 tctgatttcc ctttggactt catgtatatt gtacagttac agtaaaattc aacctttatt 1921 ttctaatttt ttcaacatat tgtttagtgt aaagaatatt tatttgaagt tttattattt 1981 tataaaaaag aatatttatt ttaagaggca tcttacaaat tttgcccctt ttatgaggat 2041 gtgatagttg ctgcaaatga ggggttacag atgcatatgt ccaatataaa atagaaaata 2101 tattaacgtt tgaaattaaa aaaaaaaaaa aaaaa // LOCUS AF040105 644 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens RCL (Rcl) mRNA, complete cds. ACCESSION AF040105 NID g2773296 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 644) AUTHORS Lewis,B.C., Shim,H., Li,Q., Wu,C.S., Lee,L.A., Maity,A. and Dang,C.V. TITLE Identification of putative c-Myc-responsive genes: characterization of rcl, a novel growth-related gene JOURNAL Mol. Cell. Biol. 17 (9), 4967-4978 (1997) MEDLINE 97415576 REFERENCE 2 (bases 1 to 644) AUTHORS Lewis,B.C. and Dang,C.V. TITLE Human homolog of the c-Myc target rcl JOURNAL Unpublished REFERENCE 3 (bases 1 to 644) AUTHORS Lewis,B.C. and Dang,C.V. TITLE Direct Submission JOURNAL Submitted (29-DEC-1997) Medicine, Johns Hopkins School of Medicine, 720 Rutland Ave., Ross 1021, Baltimore, MD 21207, USA FEATURES Location/Qualifiers source 1..644 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 1..644 /gene="Rcl" CDS 18..542 /gene="Rcl" /note="c-Myc target" /codon_start=1 /product="RCL" /db_xref="PID:g2773297" /translation="MAAAMVPGRSESWERGEPGRPALYFCGSIRGGREDRTLYERIVS RLRRFGTVLTEHVAAAELGARGEEAAGGDRLIHEQDLEWLQQADVVVAEVTQPSLGVG YELGRAVAFNKRILCLFRPQSGRVLSAMIRGAADGSRFQVWDYEEGEVEALLDRYFEA DPPGQVAASPDPTT" BASE COUNT 111 a 179 c 227 g 127 t ORIGIN 1 gcgcgggcgg ctggggaatg gctgctgcca tggtgccggg gcgcagcgag agctgggagc 61 gcggggagcc tggccgcccg gccctgtact tctgcgggag cattcgcggc ggacgcgagg 121 acaggacgct gtacgagcgg atcgtgtctc ggctgcggcg attcgggaca gtgctcaccg 181 agcacgtggc ggccgccgag ctgggcgcgc gcggggaaga ggctgctggg ggtgacaggc 241 tcatccatga gcaggacctg gagtggctgc agcaggcgga cgtggtcgtg gcagaagtga 301 cacagccatc cttgggtgta ggctatgagc tgggccgggc cgtggccttt aacaagcgga 361 tcctgtgcct gttccgcccg cagtctggcc gcgtgctttc ggccatgatc cggggagcag 421 cagatggctc tcggttccag gtgtgggact atgaggaggg agaggtggag gccctgctgg 481 atcgatactt cgaggctgat cctccagggc aggtggctgc ctcccctgac ccaaccactt 541 gacttaatct cactttctta aattcttcta ttctcagaca ctgctctagt accattcctt 601 cctcttagcc ccaggagcaa attaaaaggt acagttaaaa tcct // LOCUS AF040958 1894 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens lysosomal neuraminidase precursor, mRNA, complete cds. ACCESSION AF040958 NID g2773338 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1894) AUTHORS Bonten,E., van der Spoel,A., Fornerod,M., Grosveld,G. and d'Azzo,A. TITLE Characterization of human lysosomal neuraminidase defines the molecular basis of the metabolic storage disorder sialidosis JOURNAL Genes Dev. 10 (24), 3156-3169 (1996) MEDLINE 97138158 REFERENCE 2 (bases 1 to 1894) AUTHORS Bonten,E.J., van der Spoel,A.C., Fornerod,M., Grosveld,G. and d'Azzo,A. TITLE Direct Submission JOURNAL Submitted (02-JAN-1998) Genetics, St. Jude Children's Research Hospital, 332 North Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..1894 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21" /dev_stage="infant" /tissue_type="brain" /clone="IMAGE clone 26525" sig_peptide 130..264 CDS 130..1377 /function="hydrolysis of oligosaccharides, gangliosides, glycolipids, and glycoproteins by removing their terminal sialic acid residues" /note="hydrolytic enzyme of the sialidase family; preference for alpha 2-3 and alpha 2-6 sialyl linkage" /codon_start=1 /product="lysosomal neuraminidase precursor" /db_xref="PID:g2773339" /translation="MTGERPSTALPDRRWGPRILGFWGGCRVWVFAAIFLLLSLAASW SKAENDFGLVQPLVTMEQLLWVSGRQIGSVDTFRIPLITATPRGTLLAFAEARKMSSS DEGAKFIALRRSMDQGSTWSPTAFIVNDGDVPDGLNLGAVVSDVETGVVFLFYSLCAH KAGCQVASTMLVWSKDDGVSWSTPRNLSLDIGTEVFAPGPGSGIQKQREPRKGRLIVC GHGTLERDGVFCLLSDDHGASWRYGSGVSGIPYGQPKQENDFNPDECQPYELPDGSVV INARNQNNYHCHCRIVLRSYDACDTLRPRDVTFDPELVDPVVAAGAVVTSSGIVFFSN PAHPEFRVNLTLRWSFSNGTSWRKETVQLWPGPSGYSSLATLEGSMDGEEQAPQLYVL YEKGRNHYTESISVAKISVYGTL" mat_peptide 265..1374 /product="lysosomal neuraminidase" BASE COUNT 399 a 516 c 552 g 427 t ORIGIN 1 cgggaagcgc ggcggggcct ccagaccggg gcgggcttaa gggtgacatc tgcgctttaa 61 agggtccggg tcagctgact cccgactctg tggagtctag ctgccagggt cgcggcagtg 121 cggggagaga tgactgggga gcgacccagc acggcgctcc cggacagacg ctgggggccg 181 cggattctgg gcttctgggg aggctgtagg gtttgggtgt ttgccgcgat cttcctgctg 241 ctgtctctgg cagcctcctg gtccaaggct gagaacgact tcggtctggt gcagccgctg 301 gtgaccatgg agcaactgct gtgggtgagc gggagacaga tcggctcagt ggacaccttc 361 cgcatcccgc tcatcacagc cactccgcgg ggcactcttc tcgcctttgc tgaggcgagg 421 aaaatgtcct catccgatga gggggccaag ttcatcgccc tgcggaggtc catggaccag 481 ggcagcacat ggtctcctac agcgttcatt gtcaatgatg gggatgtccc cgatgggctg 541 aaccttgggg cagtagtgag cgatgttgag acaggagtag tatttctttt ctactccctt 601 tgtgctcaca aggccggctg ccaggtggcc tctaccatgt tggtatggag caaggatgat 661 ggtgtttcct ggagcacacc ccggaatctc tccctggata ttggcactga agtgtttgcc 721 cctggaccgg gctctggtat tcagaaacag cgggagccac ggaagggccg cctcatcgtg 781 tgtggccatg ggacgctgga gcgggacgga gtcttctgtc tcctcagcga tgatcatggt 841 gcctcctggc gctacggaag tggggtcagc ggcatcccct acggtcagcc caagcaggaa 901 aatgatttca atcctgatga atgccagccc tatgagctcc cagatggctc agtcgtcatc 961 aatgcccgaa accagaacaa ctaccactgc cactgccgaa ttgtcctccg cagctatgat 1021 gcctgtgata cactaaggcc ccgtgatgtg accttcgacc ctgagctcgt ggaccctgtg 1081 gtagctgcag gagctgtagt caccagctcc ggcattgtct tcttctccaa cccagcacat 1141 ccagagttcc gagtgaacct gaccctgcga tggagcttca gcaatggtac ctcatggcgg 1201 aaagagacag tccagctatg gccaggcccc agtggctatt catccctggc aaccctggag 1261 ggcagcatgg atggagagga gcaggccccc cagctctacg tcctgtatga gaaaggccgg 1321 aaccactaca cagagagcat ctccgtggcc aaaatcagtg tctatgggac actctgagct 1381 gtgccactgc cacaggggta ttctgccttc aggactctgc cttcaggaac acgggtctgt 1441 agagggtctg ctggagacgc ctgaaagaca gttccatctt cctttagact ccagccttgg 1501 caaaatcacc ttccctttac cagggaaatc acttccttta ggactgaaag ctaggcgtcc 1561 tctcccacaa aaaagtcctg ccctcatctg agaatactgt ctttccatat ggctaagtgt 1621 ggccccacca ccctctctgc ctcccgggac attgattggt cctgtcttgg gcaggtctag 1681 tgagctgtag aattgaatca atgtgaactc agggaactgg ggaaggctga gcctcctctt 1741 tggtgttgcg gtaagataac cgacagggct ggtgaaagtc cccagatggc aggatatttg 1801 gtttcagagt aaggactagg tgcaccacca tgactgacta tcaatcaaaa tgtttgtaac 1861 ttaaaatttt taatgaagga taatgaatat ttta // LOCUS AF040963 879 bp mRNA PRI 20-JAN-1998 DEFINITION Homo sapiens Mad4 homolog (Mad4) mRNA, complete cds. ACCESSION AF040963 NID g2792361 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Pribill,I., Barnes,G.T., Chen,J., Church,D., Buckler,A., Baxendale,S., Bates,G.P., Lehrach H., Gusella,M.J., Duyao,M.P., Ambrose,C.M., MacDonald,M.E. and Gusella,J.F. TITLE Comparison of Exon Trapping and Sequence-based Methods of Gene Finding JOURNAL Unpublished REFERENCE 2 (bases 1 to 879) AUTHORS Pribill,I., Barnes,G.T., Chen,J., Church,D., Buckler,A., Baxendale,S., Bates,G.P., Lehrach H., Gusella,M.J., Duyao,M.P., Ambrose,C.M., MacDonald,M.E. and Gusella,J.F. TITLE Direct Submission JOURNAL Submitted (05-JAN-1998) Molecular Neurogenetics Unit, Massachusetts General Hospital, 13th Street, Charlestown, MA 02129 FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p16.3" gene 1..879 /gene="Mad4" CDS 14..643 /gene="Mad4" /note="IT2" /codon_start=1 /product="Mad4 homolog" /db_xref="PID:g2792362" /translation="MELNSLLILLEAAEYLERRDREAEHGYASVLPFDGDFAREKTKA AGLVRKAPNNRSSHNELEKHRRAKLRLYLEQLKQLVPLGPDSTRHTTLSLLKRAKVHI KKLEEQDRRALSIKEQLQQEHRFLKRRLEQLSVQSVERVRTDSTGSAVSTDDSEQEVD IEGMEFGPGELDSVGSSSDADDHYSLQSGTGGDSGFGPHCRRLGRPALS" BASE COUNT 166 a 296 c 285 g 132 t ORIGIN 1 cgcgggcggg aggatggagc tgaactccct gctgatcctg ctggaggcgg ccgagtacct 61 ggagcgcagg gatcgagagg ccgagcacgg ctacgcctcg gtgctgccct tcgacggcga 121 cttcgccagg gagaaaacaa aggcggccgg cctggtgcgc aaggccccga acaacaggtc 181 ttcacacaac gagctagaaa agcacagacg agccaaactc aggctgtacc ttgagcagct 241 caagcaactg gtgcccctgg gccccgacag cacccgccac accacgctga gcctcctgaa 301 gcgggccaag gtgcacatca agaaactgga ggagcaggac cgccgggcac tgagcatcaa 361 ggagcagctg cagcaggagc atcgtttcct gaagcggcgc ctggagcagc tgtcggtgca 421 gagcgtggag cgcgtgcgca cagatagcac gggctctgct gtctccacgg acgactcaga 481 gcaagaagtg gacatagagg gcatggagtt tggccctggt gagctggaca gtgttggcag 541 cagcagtgac gcggacgacc actacagcct gcagagtggc accggcggcg acagtggctt 601 cgggccccac tgccggcggc tgggccgccc cgccctctcg taggcccgtg ccctctgctc 661 cttggcctgc ctgcccgcca gccacgcgtg tcagccctcc agttctcctt cagttgacgc 721 cagcctctcc acaggcccac tgctgtgcca ttctggaagc tccagctgct gctgggctgc 781 ctggcactgc ccgcttgccg gtcagggcct gccgagctgc ctgccccttc cagctgggca 841 gagtcccctg caaggaggca gggcccagct tccacatcc // LOCUS AF040990 4956 bp mRNA PRI 06-FEB-1998 DEFINITION Homo sapiens roundabout 1 (robo1) mRNA, complete cds. ACCESSION AF040990 NID g2804783 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4956) AUTHORS Kidd,T., Brose,K., Mitchell,K.J., Fetter,R.D., Tessier-Lavigne,M., Goodman,C.S. and Tear,G. TITLE Roundabout controls axon crossing of the CNS midline and defines a novel subfamily of evolutionarily conserved guidance receptors JOURNAL Cell 92 (2), 205-215 (1998) MEDLINE 98117249 REFERENCE 2 (bases 1 to 4956) AUTHORS Kidd,T., Brose,K., Mitchell,K.J., Fetter,R.D., Tessier-Lavigne,M.T., Goodman,C.S. and Tear,G. TITLE Direct Submission JOURNAL Submitted (05-JAN-1998) MCB, UC Berkeley, Berkeley, CA 94720, USA FEATURES Location/Qualifiers source 1..4956 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..4956 /gene="robo1" CDS 1..4956 /gene="robo1" /note="H-Robo1; axon guidance receptor homolog" /codon_start=1 /product="roundabout 1" /db_xref="PID:g2804784" /translation="MKWKHVPFLVMISLLSLSPNHLFLAQLIPDPEDVERGNDHGTPI PTSDNDDNSLGYTGSRLRQEDFPPRIVEHPSDLIVSKGEPATLNCKAEGRPTPTIEWY KGGERVETDKDDPRSHRMLLPSGSLFFLRIVHGRKSRPDEGVYVCVARNYLGEAVSHN ASLEVAILRDDFRQNPSDVMVAVGEPAVMECQPPRGHPEPTISWKKDGSPLDDKDERI TIRGGKLMITYTRKSDAGKYVCVGTNMVGERESEVAELTVLERPSFVKRPSNLAVTVD DSAEFKCEARGDPVPTVRWRKDDGELPKSRYEIRDDHTLKIRKVTAGDMGSYTCVAEN MVGKAEASATLTVQEPPHFVVKPRDQVVALGRTVTFQCEATGNPQPAIFWRREGSQNL LFSYQPPQSSSRFSVSQTGDLTITNVQRSDVGYYICQTLNVAGSIITKAYLEVTDVIA DRPPPVIRQGPVNQTVAVDGTFVLSCVATGSPVPTILWRKDGVLVSTQDSRIKQLENG VLQIRYAKLGDTGRYTCIASTPSGEATWSAYIEVQEFGVPVQPPRPTDPNLIPSAPSK PEVTDVSRNTVTLSWQPNLNSGATPTSYIIEAFSHASGSSWQTVAENVKTETSAIKGL KPNAIYLFLVRAANAYGISDPSQISDPVKTQDVLPTSQGVDHKQVQRELGNAVLHLHN PTVLSSSSIEVHWTVDQQSQYIQGYKILYRPSGANHGESDWLVFEVRTPAKNSVVIPD LRKGVNYEIKARPFFNEFQGADSEIKFAKTLEEAPSAPPQGVTVSKNDGNGTAILVSW QPPPEDTQNGMVQEYKVWCLGNETRYHINKTVDGSTFSVVIPFLVPGIRYSVEVAAST GAGSGVKSEPQFIQLDAHGNPVSPEDQVSLAQQISDVVKQPAFIAGIGAACWIILMVF SIWLYRHRKKRNGLTSTYAGIRKVPSFTFTPTVTYQRGGEAVSSGGRPGLLNISEPAA QPWLADTWPNTGNNHNDCSISCCTAGNGNSDSNLTTYSRPADCIANYNNQLDNKQTNL MLPESTVYGDVDLSNKINEMKTFNSPNLKDGRFVNPSGQPTPYATTQLIQSNLSNNMN NGSGDSGEKHWKPLGQQKQEVAPVQYNIVEQNKLNKDYRANDTVPPTIPYNQSYDQNT GGSYNSSDRGSSTSGSQGHKKGARTPKVPKQGGMNWADLLPPPPAHPPPHSNSEEYNI SVDESYDQEMPCPVPPARMYLQQDELEEEEDERGPTPPVRGAASSPAAVSYSHQSTAT LTPSPQEELQPMLQDCPEETGHMQHQPDRRRQPVSPPPPPRPISPPHTYGYISGPLVS DMDTDAPEEEEDEADMEVAKMQTRRLLLRGLEQTPASSVGDLESSVTGSMINGWGSAS EEDNISSGRSSVSSSDGSFFTDADFAQAVAAAAEYAGLKVARRQMQDAAGRRHFHASQ CPRPTSPVSTDSNMSAAVMQKTRPAKKLKHQPGHLRRETYTDDLPPPPVPPPAIKSPT AQSKTQLEVRPVVVPKLPSMDARTDRSSDRKGSSYKGREVLDGRQVVDMRTNPGDPRE AQEQQNDGKGRGNKAAKRDLPPAKTHLIQEDILPYCRPTFPTSNNPRDPSSSSSMSSR GSGSRQREQANVGRRNIAEMQVLGGYERGEDNNEELEETES" BASE COUNT 1480 a 1230 c 1186 g 1060 t ORIGIN 1 atgaaatgga aacatgttcc ttttttggtc atgatatcac tcctcagctt atccccaaat 61 cacctgtttc tggcccagct tattccagac cctgaagatg tagagagggg gaacgaccac 121 gggacgccaa tccccacctc tgataacgat gacaattcgc tgggctatac aggctcccgt 181 cttcgtcagg aagattttcc acctcgcatt gttgaacacc cttcagacct gattgtctca 241 aaaggagaac ctgcaacttt gaactgcaaa gctgaaggcc gccccacacc cactattgaa 301 tggtacaaag ggggagagag agtggagaca gacaaagatg accctcgctc acaccgaatg 361 ttgctgccga gtggatcttt atttttctta cgtatagtac atggacggaa aagtagacct 421 gatgaaggag tctatgtctg tgtagcaagg aattaccttg gagaggctgt gagccacaat 481 gcatcgctgg aagtagccat acttcgggat gacttcagac aaaacccttc ggatgtcatg 541 gttgcagtag gagagcctgc agtaatggaa tgccaacctc cacgaggcca tcctgagccc 601 accatttcat ggaagaaaga tggctctcca ctggatgata aagatgaaag aataactata 661 cgaggaggaa agctcatgat cacttacacc cgtaaaagtg acgctggcaa atatgtttgt 721 gttggtacca atatggttgg ggaacgtgag agtgaagtag ccgagctgac tgtcttagag 781 agaccatcat ttgtgaagag acccagtaac ttggcagtaa ctgtggatga cagtgcagaa 841 tttaaatgtg aggcccgagg tgaccctgta cctacagtac gatggaggaa agatgatgga 901 gagctgccca aatccagata tgaaatccga gatgatcata ccttgaaaat taggaaggtg 961 acagctggtg acatgggttc atacacttgt gttgcagaaa atatggtggg caaagctgaa 1021 gcatctgcta ctctgactgt tcaagaacct ccacattttg ttgtgaaacc ccgtgaccag 1081 gttgttgctt tgggacggac tgtaactttt cagtgtgaag caaccggaaa tcctcaacca 1141 gctattttct ggaggagaga agggagtcag aatctacttt tctcatatca accaccacag 1201 tcatccagcc gattttcagt ctcccagact ggcgacctca caattactaa tgtccagcga 1261 tctgatgttg gttattacat ctgccagact ttaaatgttg ctggaagcat catcacaaag 1321 gcatatttgg aagttacaga tgtgattgca gatcggcctc ccccagttat tcgacaaggt 1381 cctgtgaatc agactgtagc cgtggatggc actttcgtcc tcagctgtgt ggccacaggc 1441 agtccagtgc ccaccattct gtggagaaag gatggagtcc tcgtttcaac ccaagactct 1501 cgaatcaaac agttggagaa tggagtactg cagatccgat atgctaagct gggtgatact 1561 ggtcggtaca cctgcattgc atcaaccccc agtggtgaag caacatggag tgcttacatt 1621 gaagttcaag aatttggagt tccagttcag cctccaagac ctactgaccc aaatttaatc 1681 cctagtgccc catcaaaacc tgaagtgaca gatgtcagca gaaatacagt cacattatcg 1741 tggcaaccaa atttgaattc aggagcaact ccaacatctt atattataga agccttcagc 1801 catgcatctg gtagcagctg gcagaccgta gcagagaatg tgaaaacaga aacatctgcc 1861 attaaaggac tcaaacctaa tgcaatttac cttttccttg tgagggcagc taatgcatat 1921 ggaattagtg atccaagcca aatatcagat ccagtgaaaa cacaagatgt cctaccaaca 1981 agtcaggggg tggaccacaa gcaggtccag agagagctgg gaaatgctgt tctgcacctc 2041 cacaacccca ccgtcctttc ttcctcttcc atcgaagtgc actggacagt agatcaacag 2101 tctcagtata tacaaggata taaaattctc tatcggccat ctggagccaa ccacggagaa 2161 tcagactggt tagtttttga agtgaggacg ccagccaaaa acagtgtggt aatccctgat 2221 ctcagaaagg gagtcaacta tgaaattaag gctcgccctt tttttaatga atttcaagga 2281 gcagatagtg aaatcaagtt tgccaaaacc ctggaagaag cacccagtgc cccaccccaa 2341 ggtgtaactg tatccaagaa tgatggaaac ggaactgcaa ttctagttag ttggcagcca 2401 cctccagaag acactcaaaa tggaatggtc caagagtata aggtttggtg tctgggcaat 2461 gaaactcgat accacatcaa caaaacagtg gatggttcca ccttttccgt ggtcattccc 2521 tttcttgttc ctggaatccg atacagtgtg gaagtggcag ccagcactgg ggctgggtct 2581 ggggtaaaga gtgagcctca gttcatccag ctggatgccc atggaaaccc tgtgtcacct 2641 gaggaccaag tcagcctcgc tcagcagatt tcagatgtgg tgaagcagcc ggccttcata 2701 gcaggtattg gagcagcctg ttggatcatc ctcatggtct tcagcatctg gctttatcga 2761 caccgcaaga agagaaacgg acttactagt acctacgcgg gtatcagaaa agtcccgtct 2821 tttaccttca caccaacagt aacttaccag agaggaggcg aagctgtcag cagtggaggg 2881 aggcctggac ttctcaacat cagtgaacct gccgcgcagc catggctggc agacacgtgg 2941 cctaatactg gcaacaacca caatgactgc tccatcagct gctgcacggc aggcaatgga 3001 aacagcgaca gcaacctcac tacctacagt cgcccagctg attgtatagc aaattataac 3061 aaccaactgg ataacaaaca aacaaatctg atgctccctg agtcaactgt ttatggtgat 3121 gtggacctta gtaacaaaat caatgagatg aaaaccttca atagcccaaa tctgaaggat 3181 gggcgttttg tcaatccatc agggcagcct actccttacg ccaccactca gctcatccag 3241 tcaaacctca gcaacaacat gaacaatggc agcggggact ctggcgagaa gcactggaaa 3301 ccactgggac agcagaaaca agaagtggca ccagttcagt acaacatcgt ggagcaaaac 3361 aagctgaaca aagattatcg agcaaatgac acagttcctc caactatccc atacaaccaa 3421 tcatacgacc agaacacagg aggatcctac aacagctcag accggggcag tagtacatct 3481 gggagtcagg ggcacaagaa aggggcaaga acacccaagg taccaaaaca gggtggcatg 3541 aactgggcag acctgcttcc tcctccccca gcacatcctc ctccacacag caatagcgaa 3601 gagtacaaca tttctgtaga tgaaagctat gaccaagaaa tgccatgtcc cgtgccacca 3661 gcaaggatgt atttgcaaca agatgaatta gaagaggagg aagatgaacg aggccccact 3721 ccccctgttc ggggagcagc ttcttctcca gctgccgtgt cctatagcca tcagtccact 3781 gccactctga ctccctcccc acaggaagaa ctccagccca tgttacagga ttgtccagag 3841 gagactggcc acatgcagca ccagcccgac aggagacggc agcctgtgag tcctcctcca 3901 ccaccacggc cgatctcccc tccacatacc tatggctaca tttcaggacc cctggtctca 3961 gatatggata cggatgcgcc agaagaggaa gaagacgaag ccgacatgga ggtagccaag 4021 atgcaaacca gaaggctttt gttacgtggg cttgagcaga cacctgcctc cagtgttggg 4081 gacctggaga gctctgtcac ggggtccatg atcaacggct ggggctcagc ctcagaggag 4141 gacaacattt ccagcggacg ctccagtgtt agttcttcgg acggctcctt tttcactgat 4201 gctgactttg cccaggcagt cgcagcagcg gcagagtatg ctggtctgaa agtagcacga 4261 cggcaaatgc aggatgctgc tggccgtcga cattttcatg cgtctcagtg ccctaggccc 4321 acaagtcccg tgtctacaga cagcaacatg agtgccgccg taatgcagaa aaccagacca 4381 gccaagaaac tgaaacacca gccaggacat ctgcgcagag aaacctacac agatgatctt 4441 ccaccacctc ctgtgccgcc acctgctata aagtcaccta ctgcccaatc caagacacag 4501 ctggaagtac gacctgtagt ggtgccaaaa ctcccttcta tggatgcaag aacagacaga 4561 tcatcagaca gaaaaggaag cagttacaag gggagagaag tgttggatgg aagacaggtt 4621 gttgacatgc gaacaaatcc aggtgatccc agagaagcac aggaacagca aaatgacggg 4681 aaaggacgtg gaaacaaggc agcaaaacga gaccttccac cagcaaagac tcatctcatc 4741 caagaggata ttctacctta ttgtagacct acttttccaa catcaaataa tcccagagat 4801 cccagttcct caagctcaat gtcatcaaga ggatcaggaa gcagacaaag agaacaagca 4861 aatgtaggtc gaagaaatat tgcagaaatg caggtacttg gaggatatga aagaggagaa 4921 gataataatg aagaattaga ggaaactgaa agctga // LOCUS AF041254 1469 bp mRNA PRI 27-JAN-1998 DEFINITION Homo sapiens translocase of inner mitochondrial membrane Tim44 precursor mRNA, nuclear gene encoding mitochondrial protein, partial cds. ACCESSION AF041254 NID g2809419 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1469) AUTHORS Hofman,S., Gempel,K., Gerbitz,K.-D., Neupert,W., Brunner,M. and Bauer,M.F. TITLE Homo sapiens translocase of the inner mitochondrial membrane (hTim44) cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1469) AUTHORS Hofman,S., Gempel,K., Gerbitz,K.-D., Neupert,W., Brunner,M. and Bauer,M.F. TITLE Direct Submission JOURNAL Submitted (07-JAN-1998) Clinical Chemistry, KH Muenchen-Schwabing, Koelner Platz 1, Muenchen 80804, Germany FEATURES Location/Qualifiers source 1..1469 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /sex="female" /dev_stage="adult" /note="ends derived by 5' and 3' RACE-RT-PCR amplification" CDS 31..1389 /note="similar to yeast Tim44" /codon_start=1 /product="translocase of inner mitochondrial membrane Tim44 precursor" /db_xref="PID:g2809420" /translation="MAAAALRSGWCRCPRRCLGSGIQFLSSHNLPHGSTYQMRRPGGE LPLSKSYSSGNRKGFLSGLLDNVKQELAKNKEMKESIKKFRDEARRLEESDVLQEARR KYKTIESETVRTSEVLRKKLGELTGTVKESLHEVSKSDLGRKIKEGVEEAAKTAKQSA ESVSKGGEKLGRTAAFRALSQGVESVKKAIDDSVLGQTGPYRRPQRLRKRTEFAGDKF KEEKVFEANEEALGVVLHKDSKWYQQWKDFKENNVVFNRFFEMKMKYDESDNAFIRAS RALTDKVTDLLGGLFSKTEMSEVLTEILRVDPAFDKDRFLKQCENDIIPNVLEAMISG ELDILKDWCYEATYSQLAHPIQQAKALGLQFHSRILDIDNVDLAMGKMMEQGPVLIIT FQAQLVMVVRNPKGEVVEGDPDKVLRMLYVWALCRDQDELNPYAAWRLLDISASSTEQ IL" BASE COUNT 376 a 385 c 461 g 245 t 2 others ORIGIN 1 cgccgcgaga aggtcacacg attctccaac atggcggcgg cggccctgcg gagtggctgg 61 tgccgctgtc cacggagatg cctcggcagt ggaatccaat ttctttccag ccacaaccta 121 ccccatgggt cgacctatca gatgcgccgg ccgggcggag agctgccact gtccaaatca 181 tattcttctg gaaacagaaa aggctttctg tccggcttgc tagataatgt caaacaagaa 241 ttagccaaaa acaaagaaat gaaagaaagt ataaaaaaat tccgtgacga ggccagaagg 301 ctagaagaat cagacgtgct ccaggaggcc agaaggaaat acaaaaccat cgagtcagaa 361 accgtgcgga cgagcgaggt gctacggaag aagcttgggg agctgacggg caccgtgaag 421 gagagccttc acgaagtcag taaaagtgat ctcggccgga aaatcaagga gggcgtggag 481 gaagcagcca agacggccaa gcagtcggcc gagtcggtat ccaaaggcgg ggagaagctg 541 ggcaggacag cggccttcag agccctctcc cagggggtgg agtccgtgaa gaaggcaatt 601 gacgacagcg tyctgggaca gaccgggccc taccggaggc cccagcgact ccggaagaga 661 acggagtttg cgggagataa gttcaaggag gagaaagtgt ttgaggccaa cgaggaggcc 721 ctgggggtcg tgctgcacaa ggactccaag tggtaccagc agtggaagga cttcaaggag 781 aacaacgtgg tgtttaaccg gttcttcgag atgaagatga agtatgacga aagcgacaac 841 gcgttcatcc gggcatcccg ggcccttacg gacaaggtca ccgacttgct ggggggcctg 901 ttctccaaga cagagatgtc ggaggtgctc acggagatcc tccgggtgga cccggccttt 961 gacaaggacc ggtttctgaa acagtgcgag aacgacatca tccccaatgt cctggaggcc 1021 atgatttctg gagagcttga cattctcaaa gactggtgct atgaagctac ttacagccag 1081 ctggcccacc ccatccagca ggccaaggca ctgggtctcc agttccattc tcgcatccta 1141 gacattgaca acgtcgacct ggccatggga aagatgatgg agcaggggcc ggtgctgatc 1201 atcaccttcc aggcacagct ggtgatggtg gtcaggaacc ccaaaggcga ggtggtggag 1261 ggtgacccgg acaaggtgct gcggatgctg tacgtgtggg cgctctgccg agaccaggac 1321 gagctcaacc cctacgcggc ctggcggctc ctggacatct cggcctcaag caccgagcag 1381 attctctgaa gtgtggtgcc gganccaggt agccccggcc tgggtcaatc aaggcacaga 1441 ggcaccgcaa caccacctgc ggcaacttc // LOCUS AF041432 805 bp mRNA PRI 19-JAN-1998 DEFINITION Homo sapiens bet3 (BET3) mRNA, complete cds. ACCESSION AF041432 NID g2791803 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 805) AUTHORS Eva,L., Subramaniam,V.N. and Hong,W. TITLE Direct Submission JOURNAL Submitted (05-JAN-1998) Membrane Biology Laboratory, Institute of Molecular and Cell Biology, 30 Medical Drive, Singapore 117609, Singapore FEATURES Location/Qualifiers source 1..805 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..805 /gene="BET3" CDS 92..634 /gene="BET3" /note="similar to yeast Bet3p" /codon_start=1 /product="bet3" /db_xref="PID:g2791804" /translation="MSRQANRGTESKKMSSELFTLTYGALVTQLCKDYENDEDVNKQL DKMGFNIGVRLIEDFLARSNVGRCHDFRETADVIAKVAFKMYLGITPSITNWSPAGDE FSLILENNPLVDFVELPDNHSSLIYSNLLCGVLRGALEMVQMAVEAKFVQDTLKGDGV TEIRMRFIRRIEDNLPAGEE" BASE COUNT 195 a 185 c 226 g 199 t ORIGIN 1 tgggctgagg ggcagcggct taggctccgg cgtctgcagg ggtcgccgag ctaacccgtg 61 gctaggcgag tggggcgggg cggccggcac catgtcgagg caggcgaacc gtggcaccga 121 gagcaagaaa atgagctctg agctcttcac cctgacctat ggtgccctgg tcacccagct 181 atgtaaggac tatgaaaatg atgaagatgt gaataaacag ctggacaaaa tgggctttaa 241 cattggagtc cggctgattg aagatttctt ggctcggtca aatgttggga ggtgccatga 301 ctttcgggaa actgcggatg tcattgccaa ggtggcgttc aagatgtact tgggcatcac 361 tccaagcatt actaattgga gcccagctgg tgatgaattc tccctcattt tggaaaataa 421 ccccttggtg gactttgtgg aacttcctga taaccactca tcccttattt attccaatct 481 cttgtgtggg gtgttgcggg gagctttgga gatggtccag atggctgtgg aggccaagtt 541 tgtccaggac accctgaaag gagacggtgt gacagaaatc cggatgagat tcatcaggcg 601 gattgaggac aatcttccag ctggagagga ataaccatcc ctacaactcg aggatagcca 661 tcaggagcac tgttggaatc agcaggcctc tgtgctccct ctgccctcca gaactcagtg 721 actcttgaac atggatgtta tatattctta taacctgttt ccattctcca ttcaaataaa 781 gagcagactg cgatatagtc cattt // LOCUS AF042080 1619 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens glial cell line-derived neurotrophic factor receptor alpha (GFRA1) mRNA, complete cds. ACCESSION AF042080 NID g2801556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1619) AUTHORS Shefelbine,S.E., Khorana,S., Schultz,P.N., Huang,E., Thobe,N., Hu,Z.J., Fox,G.M., Jing,S., Cote,G.J. and Gagel,R.F. TITLE Mutational analysis of the GDNF/RET-GDNFRa signaling complex in a kindred with vesicoureteral reflux JOURNAL Hum. Genet. (1998) In press REFERENCE 2 (bases 1 to 1619) AUTHORS Shefelbine,S.E., Khorana,S., Schultz,P.N., Huang,E., Thobe,N., Hu,Z.J., Fox,G.M., Jing,S., Cote,G.J. and Gagel,R.F. TITLE Direct Submission JOURNAL Submitted (08-JAN-1998) Endocrinology-Box 15, M.D. Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1619 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q25-q26" /tissue_type="medullary thyroid carcinoma tumor" gene 1..1619 /note="GDNFRa" /gene="GFRA1" CDS 111..1508 /gene="GFRA1" /note="GPI-linked receptor" /codon_start=1 /product="glial cell line-derived neurotrophic factor receptor alpha" /db_xref="PID:g2801557" /translation="MFLATLYFALPLLDLLLSAEVSGGDRLDCVKASDQCLKEQSCST KYRTLRQCVAGKETNFSLASGLEAKDECRSAMEALKQKSLYNCRCKRGMKKEKNCLRI YWSMYQSLQGNDLLEDSPYEPVNSRLSDIFRVVPFISDVFQQVEHIPKGNNCLDAAKA CNLDDICKKYRSAYITPCTTSVSNDVCNRRKCHKALRQFFDKVPAKHSYGMLFCSCRD IACTERRRQTIVPVCSYEEREKPNCLNLQDSCKTNYICRSRLADFFTNCQPESRSVSS CLKENYADCLLAYSGLIGTVMTPNYIDSSSLSVAPWCDCSNSGNDLEECLKFLNFFKD NTCLKNAIQAFGNGSDVTVWQPAFPVQTTTATTTTALRVKNKPLGPAGSENEIPTHVL PPCANLQAQKLKSNVSGNTHLCISNGNYEKEGLGASSHITTKSMAAPPSCGLSPLLVL VVTALSTLLSLTETS" misc_feature 529..543 /gene="GFRA1" /note="found to be alternative exon after partial analysis of genomic sequence" BASE COUNT 409 a 448 c 419 g 343 t ORIGIN 1 cagagcagca cagctgtccg gggatcgctg cacgctgagc tccctcggca agacccagcg 61 gcggctcggg atttttttgg gggggcgggg accagccccg cgccggcacc atgttcctgg 121 cgaccctgta cttcgcgctg ccgctcttgg acttgctcct gtcggccgaa gtgagcggcg 181 gagaccgcct ggattgcgtg aaagccagtg atcagtgcct gaaggagcag agctgcagca 241 ccaagtaccg cacgctaagg cagtgcgtgg cgggcaagga gaccaacttc agcctggcat 301 ccggcctgga ggccaaggat gagtgccgca gcgccatgga ggccctgaag cagaagtcgc 361 tctacaactg ccgctgcaag cggggtatga agaaggagaa gaactgcctg cgcatttact 421 ggagcatgta ccagagcctg cagggaaatg atctgctgga ggattcccca tatgaaccag 481 ttaacagcag attgtcagat atattccggg tggtcccatt catatcagat gtttttcagc 541 aagtggagca cattcccaaa gggaacaact gcctggatgc agcgaaggcc tgcaacctcg 601 acgacatttg caagaagtac aggtcggcgt acatcacccc gtgcaccacc agcgtgtcca 661 acgatgtctg caaccgccgc aagtgccaca aggccctccg gcagttcttt gacaaggtcc 721 cggccaagca cagctacgga atgctcttct gctcctgccg ggacatcgcc tgcacagagc 781 ggaggcgaca gaccatcgtg cctgtgtgct cctatgaaga gagggagaag cccaactgtt 841 tgaatttgca ggactcctgc aagacgaatt acatctgcag atctcgcctt gcggattttt 901 ttaccaactg ccagccagag tcaaggtctg tcagcagctg tctaaaggaa aactacgctg 961 actgcctcct cgcctactcg gggcttattg gcacagtcat gacccccaac tacatagact 1021 ccagtagcct cagtgtggcc ccatggtgtg actgcagcaa cagtgggaac gacctagaag 1081 agtgcttgaa atttttgaat ttcttcaagg acaatacatg tcttaaaaat gcaattcaag 1141 cctttggcaa tggctccgat gtgaccgtgt ggcagccagc cttcccagta cagaccacca 1201 ctgccactac caccactgcc ctccgggtta agaacaagcc cctggggcca gcagggtctg 1261 agaatgaaat tcccactcat gttttgccac cgtgtgcaaa tttacaggca cagaagctga 1321 aatccaatgt gtcgggcaat acacacctct gtatttccaa tggtaattat gaaaaagaag 1381 gtctcggtgc ttccagccac ataaccacaa aatcaatggc tgctcctcca agctgtggtc 1441 tgagcccact gctggtcctg gtggtaaccg ctctgtccac cctattatct ttaacagaaa 1501 catcatagct gcattaaaaa aatacaatat ggacatgtaa aaagacaaaa accaagttat 1561 ctgtttcctg ttctcttgta tagctgaaat tccagtttag gagctcagtt gagaaacag // LOCUS AF042169 2361 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens putative ATP-dependent mitochondrial RNA helicase (SUV3) mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION AF042169 NID g2801554 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2361) AUTHORS Dmochowska,A., Krawczyk,M., Kalita,K.B., Bartnik,E. and Stepien,P.P. TITLE The human SUV3 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2361) AUTHORS Dmochowska,A., Krawczyk,M., Kalita,K.B., Bartnik,E. and Stepien,P.P. TITLE Direct Submission JOURNAL Submitted (08-JAN-1998) Department of Genetics, University of Warsaw, Pawinskiego 5A, Warsaw 02-106, Poland FEATURES Location/Qualifiers source 1..2361 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="D98-H2 HeLa" gene 1..2361 /gene="SUV3" CDS 1..2361 /gene="SUV3" /note="hsuv3p; similar to yeast SUV3" /codon_start=1 /product="putative ATP-dependent mitochondrial RNA helicase" /db_xref="PID:g2801555" /translation="MSFSRALLWARLPAGRQAGHRAAICSALRPHFGPFPGVLGQVSV LATASSSASGGSKIPNTSLFVPLTVKPQGPSADGDVGAELTRPLDKNEVKKVLDKFYK RKEIQKLGADYGLDARLFHQAFISFRNYIMQSHSLDVDIHIVLNDICFGAAHADDLFP FFLRHAKQIFPVLDCKDDLRKISDLRIPPNWYPDARAMQRKIIFHSGPTNSGKTYHAI QKYFSAKSGVYCGPLTSLAHEIFEKSNAAGVPCDLETGEERVTVQPNGKQASHVSCTV EMCSVTTPYEVAVIDEIQMIRDPARGWAWTRALLGLCAEEVHLCGEPAAIDLVMELMY TTGEEVEVRDYKRLTPISVLDHALESLDNLRPGDCIVCFSKNDIYSVSRQIEIRGLES AVIYGSLPPGTKLAQAKKFNDPNDPCKILVATDAIGMGLNLSIRRIIFYSLIKPSINE KGERELEPITTSQALQIAGRAGRFSSRFKEGEVTTMNHEDLSLLKEILKRPVDPIRAA GLHPTAEQIEMFAYHLPDATLSNLIDIFVDFSQVDGQYFVCNMDDFKFSAELIQHIPL SLRVRYVFCTAPINKKQPFVCSSLLQFARQYSRNEPLTFAWLRRYIKWPLLPPKNIKD LMDLEAVHDVLDLYLWLSYRFMDMFPDASLIRDLQKELDGIIQDGVHNITKLIKMSET HKLLNLEGFPSGSQSRLSGTLKSQARRTRGTKALGSKATEPPSPDAGELSLASRLVQQ GLLTPDMLKQLEKEWMTQQTEHNKEKTESGTHPKGTRRKKKEPDSD" BASE COUNT 656 a 526 c 559 g 620 t ORIGIN 1 atgtccttct cccgtgccct attgtgggct cggctcccgg cggggcgcca ggctggccac 61 cgggcagcca tctgctctgc ccttcgtccc cactttgggc cctttcccgg ggttctgggg 121 caagtttctg tccttgccac cgcctcctcc tctgcctccg gtggctccaa aataccaaac 181 acgtccttgt tcgtgcccct gactgtgaaa cctcagggcc ccagcgccga cggcgacgtc 241 ggggccgagc taacccggcc tctggacaag aatgaagtaa agaaggtctt agacaaattt 301 tacaagagga aagaaattca gaaactgggt gctgattatg gacttgatgc tcgtctcttc 361 caccaagctt tcataagctt tagaaattat attatgcagt ctcattccct ggatgtggac 421 attcacattg ttttgaatga tatttgcttc ggtgcagctc atgcggatga tttattccca 481 tttttcttga gacatgccaa acaaatattt cctgtgttgg actgtaagga tgatctacgt 541 aaaatcagcg acttaagaat accacctaac tggtacccag atgctagagc catgcagcgg 601 aagataatat ttcattcagg ccccacaaac agtggaaaga cttatcacgc aatccagaaa 661 tacttctcag caaagtctgg agtgtattgt ggccctctaa catcactggc acatgagatc 721 ttcgaaaaga gtaatgctgc tggcgtgcca tgtgacttgg agacaggtga agagcgtgtg 781 acagttcagc caaatgggaa acaggcttca catgtttctt gtacagttga gatgtgcagt 841 gttacaactc cttatgaagt ggctgtaatt gatgaaattc aaatgattag agatccagcc 901 agaggatggg cctggaccag agcacttcta ggactgtgtg ctgaagaggt tcatttgtgt 961 ggagaacctg ctgctattga cctggtgatg gagcttatgt acacaacggg ggaggaagtg 1021 gaggttcgag actataagag gcttaccccc atttctgtgc tggaccatgc actagaatct 1081 ttagataacc ttcggcctgg ggactgcatt gtctgtttta gcaagaatga tatttattct 1141 gtgagtcggc agattgaaat tcggggatta gaatcagctg ttatatatgg cagtctccca 1201 cctgggacca aacttgctca agcaaaaaag tttaatgatc ccaatgaccc atgcaaaatc 1261 ttggttgcta cagatgcaat tggcatggga cttaatttga gcataaggag aattattttt 1321 tactccctta taaagcccag tatcaatgaa aagggagaga gagaactaga accaatcaca 1381 acctctcaag ccctgcagat tgctggcaga gctggcagat tcagctcacg gtttaaagaa 1441 ggagaggtta caacaatgaa tcatgaagat ctcagtttat taaaggaaat tttgaagagg 1501 cctgtggatc ctataagggc agctggtctt catccaactg ctgagcagat tgaaatgttt 1561 gcctaccatc tccctgatgc aacactgtcc aatctcattg atatttttgt agacttttca 1621 caagttgatg ggcagtattt tgtctgcaat atggatgatt ttaaattttc tgcagagttg 1681 atccagcata ttccactaag tctgcgagtg aggtatgttt tctgcacagc tcctatcaac 1741 aagaagcagc cttttgtgtg ttcttcactg ttacagtttg ccaggcagta tagcaggaat 1801 gagcccctga cctttgcatg gttacgccga tacatcaaat ggcctttact tccacctaag 1861 aatattaaag acctcatgga tcttgaagct gtccacgatg tcttggatct ttacttgtgg 1921 ctaagctacc gatttatgga tatgtttcca gatgccagcc ttattcgaga tctccagaaa 1981 gaactagatg gtattatcca agatggtgtg cacaatatca ctaaattgat taaaatgtct 2041 gagacgcata agctgttgaa tttggagggc tttccatcag ggagccagtc acgattgtca 2101 ggaaccttaa agagccaagc tagaaggaca cgcggcacca aagctctagg gagtaaagct 2161 actgagccac ccagccccga tgcaggagag ctgtcccttg cttccagatt ggtgcagcaa 2221 ggactcctca ctccagacat gctgaaacag ctagaaaaag agtggatgac acaacaaact 2281 gaacacaaca aagaaaaaac agagtctggg actcatccaa aagggacgag aagaaagaag 2341 aaggaacctg attcggacta g // LOCUS AF042378 3795 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens spindle pole body protein spc98 homolog mRNA, complete cds. ACCESSION AF042378 NID g2801698 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3795) AUTHORS Murphy,S.M., Urbani,L. and Stearns,T. TITLE The mammalian gamma-tubulin complex contains homologs of the yeast spindle pole body proteins spc98 and spc97 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3795) AUTHORS Murphy,S.M. and Stearns,T. TITLE Direct Submission JOURNAL Submitted (09-JAN-1998) Biological Sciences, Stanford University, Gilbert Bldg, Rm 208, Stanford, CA 94305-5020, USA FEATURES Location/Qualifiers source 1..3795 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 85..2808 /note="component of cytoplasmic gamma-tubulin complex; localized to the centrosome" /codon_start=1 /product="spindle pole body protein spc98 homolog" /db_xref="PID:g2801699" /translation="MATPDQKSPNVLLQNLCCRILGRSEADVAQQFQYAVRVIGSNFA PTVERDEFLVAEKIKKELIRQRREADAALFSELHRKLHSQGVLKNKWSILYLLLSLSE DPRRQPSKVSSYATLFAQALPRDAHSTPYYYARPQTLPLSYQDRSAQSAQSSGSVGSS GISSIGLCALSGPAPAPQSLLPGQSNQAPGVGDCLRQQLGSRLAWTLTANQPSSQATT SKGVPSAVSRNMTRSRREGDTGGTMEITEAALVRDILYVFQGIDGKNIKMNNTENCYK VEGKANLSRSLRDTAVRLSELGWLHNKIRRYTDQRSLDRSFGLVGQSFCAALHQELRE YYRLLSVLHSQLQLEDDQGVNLGLESSLTLRRLLVWTYDPKIRLKTLAALVDHCQGRK GGELASAVHAYTKTGDPYMRSLVQHILSLVSHPVLSFLYRWIYDGELEDTYHEFFVAS DPTVKTDRLWHDKYTLRKSMIPSFMTMDQSRKVLLIGKSINFLHQVCHDQTPTTKMIA VTKSAESPQDAADLFTDLENAFQGKIDAAYFETSKYLLDVLNKKYSLLDHMQAMRRYL LLGQGDFIRHLMDLLKPELVRPATTLYQHNLTGILETAVRATNAQFDSPEILRRLDVR LLEVSPGDTGWDVFSLDYHVDGPIATVFTRECMSHYLRVFNFLWRAKRMEYILTDIRK GHMCNAKLLRNMPEFSGVLHQCHILASEMVHFIHQMQYYITFEVLECSWDELWNKVQQ AQDLDHIIAAHEVFLDTIISRCLLDSDSRALLNQLRAVFDQIIELQNAQDAIYRAALE ELQRRLQFEEKKKQREIEGQWGVTAAEEEEENKRIGEFKESIPKMCSQLRILTHFYQG IVQQFLVLLTTSSDESLRFLSFRLDFNEHYKAREPRLRVSLGTRGRRSSHT" BASE COUNT 1059 a 832 c 903 g 1001 t ORIGIN 1 caggaagggc gcgggccgcg gtccctgcgc gtgcggcggc agtggcggct ctgcccggac 61 caccgtgcac ggctccgggc gaggatggcg accccggacc agaagtcgcc gaacgttctg 121 ctgcagaacc tgtgctgcag gatcctgggc aggagcgaag ctgatgtagc ccagcagttc 181 cagtatgctg tgcgggtgat tggcagcaac ttcgccccaa ctgttgaaag agatgaattt 241 ttagtagctg aaaaaatcaa gaaagagctt attcgacaac gaagagaagc agatgctgca 301 ttattttcag aactccacag aaaacttcat tcacagggag ttttgaaaaa taaatggtca 361 atactctacc tcttgctgag cctcagtgag gacccacgca ggcagccaag caaggtttct 421 agctatgcta cgttatttgc tcaggcctta ccaagagatg cccactcaac cccttactac 481 tatgccaggc ctcagaccct tcccctgagc taccaagatc ggagtgccca gtcagcccag 541 agctccggca gcgtgggcag cagtggcatc agcagcattg gcctgtgtgc cctcagtggc 601 cccgcgcctg cgccacaatc tctcctccca ggacagtcta atcaagctcc aggagtagga 661 gattgccttc gacagcagtt ggggtcacga ctcgcatgga ctttaactgc aaatcagcct 721 tcttcacaag ccactacctc aaaaggtgtc cccagtgctg tgtctcgcaa catgacaagg 781 tccaggagag aaggggatac gggtggtact atggaaatta cagaagcagc tctggtaagg 841 gacattttgt acgtctttca gggcatagat ggcaaaaaca tcaaaatgaa caacactgaa 901 aattgttaca aagtagaagg aaaggcaaat ctaagtaggt ctttgagaga cacagcagtc 961 aggctttctg agttgggatg gttgcataat aaaatcagaa gatacacgga ccagaggagc 1021 ctggaccgct cattcggact cgtcgggcag agcttttgtg ctgccttgca ccaggaactc 1081 agagaatact atcgattgct ctctgtttta cattctcagc tacaactaga ggatgaccag 1141 ggtgtgaatt tgggacttga gagtagttta acacttcggc gcctcctggt ttggacctat 1201 gatcccaaaa tacgactgaa gacccttgcg gccctagtgg accactgcca aggaaggaaa 1261 ggaggtgagc tggcctcagc tgtccacgcc tacacaaaaa caggagaccc gtacatgcgg 1321 tctctggtgc agcacatcct cagcctcgtg tctcatcctg ttttgagctt cctgtaccgc 1381 tggatatatg atggggagct tgaggacact taccacgaat tttttgtagc atcagatcca 1441 acagttaaaa cagatcgact gtggcacgac aagtatactt tgaggaaatc gatgattcct 1501 tcgtttatga cgatggatca gtctaggaag gtccttttga taggaaaatc aataaatttc 1561 ttgcaccaag tttgtcatga tcagactccc actacaaaga tgatagctgt gaccaagtct 1621 gcagagtcac cccaggacgc tgcagaccta ttcacagact tggaaaatgc atttcagggg 1681 aagattgatg ctgcttattt tgagaccagc aaatacctgt tggatgttct caataaaaag 1741 tacagcttgc tggaccacat gcaggcaatg aggcggtacc tgcttcttgg tcaaggagac 1801 tttataaggc acttaatgga cttgctaaaa ccagaacttg tccgtccagc tacgactttg 1861 tatcagcata acttgactgg aattctagaa accgctgtca gagccaccaa cgcacagttt 1921 gacagtcctg agatcctgcg aaggctggac gtgcggctgc tggaggtctc tccaggtgac 1981 actggatggg atgtcttcag cctcgattat catgttgacg gaccaattgc aactgtgttt 2041 actcgagaat gtatgagcca ctacctaaga gtatttaact tcctctggag ggcgaagcgg 2101 atggaataca tcctcactga catacggaag ggacacatgt gcaatgcaaa gctcctgaga 2161 aacatgccag agttctccgg ggtgctgcac cagtgtcaca ttttggcctc tgagatggtc 2221 catttcattc atcagatgca gtattacatc acatttgagg tgcttgaatg ttcttgggat 2281 gagctttgga acaaagtcca gcaggcccag gatttggatc acatcattgc tgcacacgag 2341 gtgttcttag acaccatcat ctcccgctgc ctgctggaca gtgactccag ggcactttta 2401 aatcaactta gagctgtgtt tgatcaaatt attgaacttc agaatgctca agatgcaata 2461 tacagagctg ctctggaaga attgcagaga cgattacagt ttgaagagaa aaagaaacag 2521 cgtgaaattg agggccagtg gggagtgacg gcagcagagg aagaggagga aaataagagg 2581 attggagaat ttaaagaatc tataccaaaa atgtgctcac agttgcgaat attgacccat 2641 ttctaccagg gtatcgtgca gcagtttttg gtgttactga cgaccagctc tgacgagagt 2701 cttcggtttc ttagcttcag gctggacttc aacgagcatt acaaagccag ggagcccagg 2761 ctccgtgtgt ctctgggtac cagggggcgg cgcagctccc acacgtgaag ctcgcggtcc 2821 tcccagggag ctgcgggtga tgttcgttgc actgctagac acgaaattcc cattgacgtc 2881 ctgcaggaac tgcatgctgc aggtgtcctg cccttccgcc cacgagtgcg ccatgtttca 2941 gcggagcggc gtgtgggaga agccacgtcg tgtttcacat gtcggagtcg aatgcatttg 3001 taaatcccta agtcaagtag gctggctgca ctgttcacat ttgtctctaa aagtcttcat 3061 cgctaaaaga taccataatt tgctgaggct tcttaagctt tctatgttat aatttatatt 3121 tgtcacttta aaaaatccat ttcttttaga aaaaattagg gtgataggat attcattagt 3181 taagatggta acgtcattgc tattttttta acatcctctt tagaggtaat ttttgttaac 3241 ataaccaaaa attaaattga aacaaaatgt cccaactaag aaaatatata gagcatttta 3301 ttttttttta gtgttgtaaa atattaacct ctgtgagatc ctttgtatct taatgcatta 3361 cctttacaca tatttattct tattttctct cctttcagag tttacatttt tatatttaat 3421 ttactatttc agatttttaa aatagtatag aaaaaagtag gagtgataga gaacaaaaat 3481 actcttatac agtgcaaccc aaataccgcg aatgcatcag ctaaagcagc gtgtaaatag 3541 gagtgatgag aaagttaatg gagtatttta ttttcaaagt tcctgataag cattggaaag 3601 aaatcgacat ggataatgaa gatttccttt ttccttgcct attttttcat tgtaaatatt 3661 tatatactac tgaccaagat gttggggtgg gggggattgt tttttgtaaa aatgtcatta 3721 tcaggtcaca taaatctgcc tttatgttgc ataagtgaaa atttagaaaa ttaaaagcaa 3781 ttatctttca aaaaa // LOCUS AF042379 2846 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens spindle pole body protein spc97 homolog mRNA, complete cds. ACCESSION AF042379 NID g2801700 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2846) AUTHORS Murphy,S.M., Urbani,L. and Stearns,T. TITLE The mammalian gamma-tubulin complex contains homologs of the yeast spindle pole body proteins spc98 and spc97 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2846) AUTHORS Murphy,S.M. and Stearns,T. TITLE Direct Submission JOURNAL Submitted (09-JAN-1998) Biological Sciences, Stanford University, Gilbert Bldg, Rm 208, Stanford, CA 94305-5020, USA FEATURES Location/Qualifiers source 1..2846 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 64..2772 /note="component of cytoplasmic gamma-tubulin complex; localized to the centrosome" /codon_start=1 /product="spindle pole body protein spc97 homolog" /db_xref="PID:g2801701" /translation="MSEFRIHHDVNELLSLLRVHGGDGAEVYIDLLQKNRTPYVTTTV SAHSAKVKIAEFSRTPEDFLKKYDELKSKNTRNLDPLVYLLSKLTEDKETLQYLQQNA KERAELAAAAVGSSTTSINVPAAASKISMQELEELRKQLGSVATGSTLQQSLELKRKM LRDKQNKKNSGQHLPIFPAWVYERPALIGDFLIGAGISTDTALPIGTLPLASQESAVV EDLLYVLVGVDGRYVSAQPLAGRQSRTFLVDPNLDLSIRELVHRILPVAASYSAVTRF IEEKSSFEYGQVNHALAAAMRTLVKEHLILVSQLEQLHRQGLLSLQKLWFYIQPAMRT MDILASLATSVDKGECLGGSTLSLLHDRSFSYTGDSQAQELCLYLTKAASAPYFEVLE KWIYRGIIHDPYSEFMVEEHELRKERIQEDYNDKYWDQRYTIVQQQIPSFLQKMADKI LSTGKYLNVVRECGHDVTCPVAKEIIYTLKERAYVEQIEKAFNYASKVLLDFLMEEKE LVAHLRSIKRYFLMDQGDFFVHFMDLAEEELRKPVEDITPPRLEALLELALRMSTANT DPFKDDLKIDLMPHDLITQLLRVLAIETKQEKAMAHADPTELALSGLEAFSFDYIVKW PLSLIINRKALTRYQMLFRHMFYCKHVERQLCSVWISNKTAKQHSLHSAQWFAGAFTL RQRMLNFVQNIQYYMMFEVMEPTWHILEKNLKSASNIDDVLGHHTGFLDTCLKDCMLT NPELLKVFSKLMSVCVMFTNCMQKFTQSMKLDGELGGQTLEHSTVLGLPAGAEERARK ELARKHLAEHADTVQLVSGFEATINKFDKNFSAHLLDLLARLSIYSTSDCEHGMASVI SRLDFNGFYTERLERLSAERSQKATPQVPVLRGPPAPAPRVAVTAQ" BASE COUNT 664 a 829 c 812 g 541 t ORIGIN 1 ggcggaagtg gctccgggac tgcggagaac atattgtgat gttcgtgcct cagagctaaa 61 actatgagtg aatttcggat tcaccatgac gtcaatgaac tgcttagcct gctgcgtgtc 121 cacggaggag atggggctga ggtctacatt gacctgcttc aaaagaacag gaccccgtac 181 gtcactacca ctgtctctgc tcacagtgcc aaggttaaaa ttgcagagtt ttctcgtact 241 ccagaagact ttctaaagaa atatgatgaa ctgaaatcta aaaatacaag gaaccttgac 301 ccgctggtgt acctgttgtc aaagctcacg gaagacaaag agactctgca gtacttacaa 361 cagaatgcaa aagaaagagc tgagcttgca gccgctgctg tgggcagcag taccaccagc 421 atcaacgtcc ctgccgcggc ctccaagatc tccatgcagg agcttgagga actgaggaag 481 cagcttggca gcgtggccac aggctccacg ctgcagcagt ctctggaact taaaagaaag 541 atgcttcgag acaagcagaa caaaaaaaat tcaggccagc acctccccat cttcccagca 601 tgggtgtatg agagacctgc cctgatcggg gatttcctga ttggtgctgg catcagcaca 661 gacaccgctt tgccgatagg cacgttgccc ctggcctcgc aggagtcggc cgtggtggag 721 gacctgctgt acgtgctggt gggcgtggac gggaggtacg tcagtgctca gcccctggct 781 gggaggcaga gccggacctt cctcgtggac cccaacctgg acctgtccat cagggagctg 841 gtgcacagga tcctcccagt ggccgccagc tactccgctg tgaccaggtt cattgaagag 901 aagtcttcct tcgagtacgg gcaggtgaac cacgccctgg cggccgccat gcgcaccctg 961 gtgaaggagc acctgattct ggtgtcacag ctggagcagc tgcacaggca gggcctcctt 1021 tcgctgcaga agctctggtt ctacatccag ccagccatgc gcaccatgga catcctggcc 1081 tccctcgcca cctcggtgga caaaggcgaa tgtcttgggg ggtccacgct gagcctgctc 1141 cacgacagga gcttcagcta cacaggggac agccaggcgc aggagctatg cctgtaccta 1201 accaaggcgg ccagtgctcc ctacttcgag gttctggaga agtggatcta caggggcatc 1261 atccacgacc catacagtga gtttatggtc gaggagcacg agctgcggaa ggagaggatc 1321 caggaggatt acaacgacaa gtactgggac cagcggtaca ccatcgtcca gcagcagatc 1381 ccgtccttcc tgcagaaaat ggcggacaag atcctcagca caggaaaata tctaaatgtg 1441 gtcagagagt gtggccatga cgtcacctgc ccggtggcta aagagatcat ctacacgtta 1501 aaagagcggg cgtatgtgga gcagatcgag aaggcgttta actacgccag caaggtgctg 1561 ctggacttcc tgatggagga gaaggagctg gtggctcacc tcaggtccat caagcgctac 1621 ttcctcatgg accagggcga cttcttcgtg cacttcatgg acctcgcgga ggaggagctc 1681 cggaagccgg tggaggacat cacgccccct cgcctggaag cgctcctgga gctggcgctg 1741 cgcatgagca cggccaacac tgaccccttc aaggacgacc tcaagatcga cctgatgccc 1801 catgacctca tcactcagct cttgcgcgtc ctggccatcg agaccaagca ggagaaggcg 1861 atggcgcacg ccgaccccac ggagctggcg ctgagcggcc tggaggcctt ctctttcgac 1921 tacatcgtca agtggcccct ttcgctcatc atcaacagga aagccctcac tcgctaccag 1981 atgctcttca ggcacatgtt ctattgcaag cacgtggagc ggcagctctg cagcgtctgg 2041 atcagcaaca aaaccgccaa gcagcactcg ctgcactccg cccagtggtt tgctggggct 2101 ttcactctgc ggcagcgaat gctcaacttc gtccagaata ttcaatacta catgatgttc 2161 gaagtgatgg aaccgacctg gcacatcctg gagaaaaacc tgaaatccgc ctccaacatt 2221 gacgacgtcc ttggccacca cacaggcttc ctggacacct gcctgaagga ctgcatgctc 2281 accaaccccg agctgctgaa ggtcttctcc aagctcatgt ctgtgtgcgt catgttcacc 2341 aactgcatgc agaaatttac acagagcatg aaattagatg gcgagctggg cgggcagacg 2401 ctggagcaca gcaccgtcct ggggctgccc gcaggggccg aggagcgggc ccggaaggag 2461 ctcgccagga agcacctggc tgagcacgca gacactgtgc agctggtgtc cggcttcgag 2521 gccaccatca acaagtttga caagaacttc tcagcccacc tgctggacct cctggcccgg 2581 ctgagcatct atagcaccag tgactgtgag cacggcatgg ccagcgtcat ctccaggctt 2641 gacttcaatg gtttctacac ggagcgcctg gagcgcctgt ctgcagagag gagccagaag 2701 gccacccccc aagtgcctgt cctgcggggg cccccggctc ctgcacccag ggtcgcagtc 2761 accgcacagt gagccctggc tgtgacagga aggaagggtg tggggtcagc agggactggt 2821 gcaaatgggt ccagaatttt caaatc // LOCUS AF042384 903 bp mRNA PRI 02-FEB-1998 DEFINITION Homo sapiens BC-2 protein mRNA, complete cds. ACCESSION AF042384 NID g2828146 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 903) AUTHORS Slater,C., Thill,G. and Obar,R. TITLE Direct Submission JOURNAL Submitted (13-JAN-1998) Protein Chemistry, Matritech, Inc., 330 Nevada Street, Newton, MA 02160, USA FEATURES Location/Qualifiers source 1..903 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 130..798 /note="p32; putative breast adenocarcinoma marker" /codon_start=1 /product="BC-2 protein" /db_xref="PID:g2828147" /translation="MDLLFGRRKTPEELLRQNQRALNRAMRELDRERQKLETQEKKII ADIKKMAKQGQMDAVRIMAKDLVRTRRYVRKFVLMRANIQAVSLKIQTLKSNNSMAQA MKGVTKAMGTMNRQLKLPQIQKIMMEFERQAEIMDMKEEMMNDAIDDAMGDEEDEEES DAVVSQVLDELGLSLTDELSNLPSTGGSLSVAAGGKKAEAAASALADADADLEERLKN LRRD" BASE COUNT 255 a 214 c 277 g 157 t ORIGIN 1 cggcggcggc gacaggaccg aggggcctta gttggtgggc aagtcgggga tcccagaaag 61 agaagcgtga cccggaagcg gaaacgggtg tccgtcccag ctccggcctg ccagtgagct 121 tctaccatca tggacctatt gttcgggcgc cggaagacgc cagaggagct actgcggcag 181 aaccagaggg ccctgaaccg tgccatgcgg gagctggacc gcgagcgaca gaaactagag 241 acccaggaga agaaaatcat tgcagacatt aagaagatgg ccaagcaagg ccagatggat 301 gctgttcgca tcatggcaaa agacttggtg cgcacccggc gttatgtgcg caagtttgta 361 ttgatgcggg ccaacatcca ggctgtgtcc ctcaagatcc agacactcaa gtccaacaac 421 tcgatggcac aagccatgaa gggtgtcacc aaggccatgg gcaccatgaa cagacagctg 481 aagttgcccc agatccagaa gatcatgatg gagtttgagc ggcaggcaga gatcatggat 541 atgaaggagg agatgatgaa tgatgccatt gatgatgcca tgggtgatga ggaagatgaa 601 gaggagagtg atgctgtggt gtcccaggtt ctggatgagc tgggacttag cctaacagat 661 gagctgtcga acctcccctc aactgggggc tcgcttagtg tggctgctgg tgggaaaaaa 721 gcagaggccg cagcctcagc cctagctgat gctgatgcag acctggagga acggcttaag 781 aacctgcgga gggactgagt gcccctgcca ctccgagata accagtggat gcccaggatc 841 ttttaccaca acccctctgt aataaaagag atttgacact aaaaaaaaaa aaaaaaaaaa 901 aaa // LOCUS AF042385 1324 bp mRNA PRI 02-FEB-1998 DEFINITION Homo sapiens cyclophilin-33A (CYP-33) mRNA, complete cds. ACCESSION AF042385 NID g2828148 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1324) AUTHORS Slater,C., Thill,G. and Obar,R. TITLE Direct Submission JOURNAL Submitted (13-JAN-1998) Protein Chemistry, Matritech, Inc., 330 Nevada Street, Newton, MA 02160, USA FEATURES Location/Qualifiers source 1..1324 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1324 /gene="CYP-33" CDS 61..966 /gene="CYP-33" /note="putative breast cancer marker BC-8A; p33A; RNA-binding nuclear cyclophilin isoform A" /codon_start=1 /product="cyclophilin-33A" /db_xref="PID:g2828149" /translation="MATTKRVLYVGGLAEEVDDKVLHAAFIPFGDITDIQIPLDYETE KHRGFAFVEFELAEDAAAAIDNMNESELFGRTIRVNLAKPMRIKEGSSRPVWSDDDWL KKFSGKTLEENKEEEGSEPPKAETQEGEPIAKKARSNPQVYMDIKIGNKPAGRIQMLL RSDVVPMTAENFRCLCTHEKGFGFKGSSFHRIIPQFMCQGGDFTNHNGTGGKSIYGKK FDDENFILKHTGPGLLSMANSGPNTNGSQFFLTCDKTDWLDGKHVVFGEVTEGLDVLR QIEAQGSKDGNPKQKVIIADCGEYV" BASE COUNT 343 a 323 c 364 g 294 t ORIGIN 1 ctactactac taggccacgc gtcgactagt acgggggggg ggggaaagcg cgcgagcaag 61 atggccacca ccaagcgcgt cttgtacgtg ggtggactgg cagaggaagt ggacgacaaa 121 gttcttcatg ctgcgttcat tccttttgga gacatcacag atattcagat tcctctggat 181 tatgaaacag aaaagcaccg aggatttgct tttgttgaat ttgagttggc agaggatgct 241 gcagcagcta tcgacaacat gaatgaatct gagctttttg gacgtacaat tcgtgtcaat 301 ttggccaaac caatgagaat taaggaaggc tcttccaggc cagtttggtc agatgatgac 361 tggttgaaga agttttctgg gaagacgctt gaagagaata aagaggaaga agggtcagag 421 cctcccaaag cagagaccca ggagggagag cccattgcta aaaaggcccg ctcaaatcct 481 caggtgtaca tggacatcaa gattgggaac aagccggctg gccgcatcca gatgctcctg 541 cgttctgatg tcgtgcccat gacagcagag aatttccgct gcctgtgcac tcatgaaaag 601 ggctttggct ttaagggaag cagcttccac cgcatcatcc cccagttcat gtgccagggc 661 ggtgatttca caaaccacaa tggcactggg ggcaagtcca tctatgggaa gaagttcgat 721 gatgaaaact ttatcctcaa gcatacggga ccaggtctac tatccatggc caactctggc 781 ccaaacacca atggctctca gttcttcctg acatgtgaca agacagactg gctggatggc 841 aagcatgtgg tgtttggaga ggtcaccgaa ggcctagatg tcttgcggca aattgaggcc 901 cagggcagca aggacgggaa tccaaagcag aaggtgatca tcgccgactg tggggagtac 961 gtgtgaggcg gcactctcta tgattccccc tccgctcttg accctgcata tccaggaagg 1021 aactgccagc ctcagaggag gcacaccgag ggtgcctgtt tgaagcaagc agcatttggg 1081 atatgtgccc ttcctcaggg tctgcttgga gcagctcctc tgcagcacag cctggactat 1141 tcccaggcac agctgtgggc ccaggagcca gctcaggtgc tcccctccac catgggcagg 1201 ctgtgcaaaa agccactggt ttttctcagc atttgctgct gggcctctcc tgggactacc 1261 agtgtggctc ttacgtgttt tctttgctaa aataaaccct agtcttaaaa aaaaaaaaaa 1321 aaaa // LOCUS AF042792 5463 bp mRNA PRI 17-JAN-1998 DEFINITION Homo sapiens alpha 2 delta calcium channel subunit isoform I mRNA, complete cds. ACCESSION AF042792 NID g2781438 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5463) AUTHORS Wei,M.-H., Latif,F., Duh,F.-M., Adreazzoli-Angeloni,D., Kashuba,V., Zabarovsky,E., Johnson,B. and Lerman,M.I. TITLE A new alpha 2 delta subunit of the L-type voltage gated calcium channel resides in the lung cancer critical region on 3p21.3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 5463) AUTHORS Wei,M.-H., Latif,F., Duh,F.-M., Adreazzoli-Angeloni,D., Kashuba,V., Zabarovsky,E., Johnson,B. and Lerman,M.I. TITLE Direct Submission JOURNAL Submitted (12-JAN-1998) Laboratory of Immunobiology, National Cancer Institute, NCI-Frederick Cancer Research and Development Center, Bldg 560, Rm. 12-71, P.O.Box B, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..5463 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3" CDS 162..3599 /codon_start=1 /product="alpha 2 delta calcium channel subunit isoform I" /db_xref="PID:g2781439" /translation="MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPL WLLLPLLPLLAAPGASAYSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNR NLFEVQENEPQKLVEKVAGDIESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIV YYDAKADAELDDPESEDVERGSKASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGS TVILNELNWTEALENVFMENRRQDPTLLWQVFGSATGVTRYYPATPWRAPKKIDLYDV RRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASFNE KAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRANC NKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFSVGQHNYDVTPLQWMACANKGYYF EIPSIGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNVYEDALGLGLVVTGTLPVFNL TQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYVFAIDLNGYVLLHPNLKP QTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLVKSLDERYIDEVTRN YTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSFESEGHVFIAPR EYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQLVERVWRD QDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYVFKPPH QDALLRPLELENDTVGILVSTAVELSLGRRTLRPAVVGVKLDLEAWAEKFKVLASNRT HQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDAN LMLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAW SLFQQLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGN CSRLFHAQRLTNTNLLFVVAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRR GPHICFDYNATEDTSDCGRGASFPPSLGVLVSLQLLLLLGLPPRPQPQVLVHASRRL" BASE COUNT 1211 a 1664 c 1501 g 1087 t ORIGIN 1 gccagcgctg cagggagata gcagcgcgca gcccgcagag gcgctgcggc ccgtgcagcc 61 ccggaggccc ctcgcggaga aggcggcggc ggaggagagg ccgagttacc gcccgccgcc 121 cgcgcccccc ctccccgcgg cgccgcatct tgaatggaaa catggcggtg ccggctcgga 181 cctgcggcgc ctctcggccc ggcccagcgc ggactgcgcg cccctggccc ggctgcggcc 241 cccaccctgg ccccggcacc cggcgcccga cgtccgggcc cccgcgcccg ctgtggctgc 301 tgctgccgct tctaccgctg ctcgccgccc ccggcgcctc tgcctacagc ttcccccagc 361 agcacacgat gcagcactgg gcccggcgtc tggagcagga ggtcgacggc gtgatgcgga 421 tttttggagg cgtccagcag ctccgtgaga tttacaagga caaccggaac ctgttcgagg 481 tacaggagaa tgagcctcag aagttggtgg agaaggtggc aggggacatt gagagccttc 541 tggacaggaa ggtgcaggcc ctgaagagac tggctgatgc tgcagagaac ttccagaaag 601 cacaccgctg gcaggacaac atcaaggagg aagacatcgt gtactatgac gccaaggctg 661 acgctgagct ggacgaccct gagagtgagg atgtggaaag ggggtctaag gccagcaccc 721 taaggctgga cttcatcgag gacccaaact tcaagaacaa ggtcaactat tcatacgcgg 781 ctgtacagat ccctacggac atctacaaag gctccactgt catcctcaat gagctcaact 841 ggacagaggc cctggagaat gtgttcatgg aaaaccgcag acaagacccc acactgctgt 901 ggcaggtctt cggcagcgcc acaggagtca ctcgctacta cccggccacc ccgtggcgag 961 cccccaagaa gatcgacctg tacgatgtcc gaaggagacc ctggtatatc cagggggcct 1021 cgtcacccaa agacatggtc atcatcgtgg atgtgagtgg cagtgtgagc ggcctgaccc 1081 tgaagctgat gaagacatct gtctgcgaga tgctggacac gctgtctgat gatgactatg 1141 tgaatgtggc ctcgttcaac gagaaggcac agcctgtgtc atgcttcaca cacctggtgc 1201 aggccaatgt gcgcaacaag aaggtgttca aggaagctgt gcagggcatg gtggccaagg 1261 gcaccacagg ctacaaggcc ggctttgagt atgcctttga ccagctgcag aactccaaca 1321 tcactcgggc caactgcaac aagatgatca tgatgttcac ggatggtggt gaggaccgcg 1381 tgcaggacgt ctttgagaag tacaattggc caaaccggac ggtgcgcgtg tttactttct 1441 ccgtggggca gcataactat gacgtcacac cgctgcagtg gatggcctgt gccaacaaag 1501 gctactattt tgagatccct tccatcggag ccatccgcat caacacacag gaatatctag 1561 atgtgttggg caggcccatg gtgctggcag gcaaggaggc caagcaggtt cagtggacca 1621 acgtgtatga ggatgcactg ggactggggt tggtggtaac agggaccctc cctgttttca 1681 acctgacaca ggatggccct ggggaaaaga agaaccagct gatcctgggc gtgatgggca 1741 ttgacgtggc tctgaatgac atcaagaggc tgacccccaa ctacacgctt ggagccaacg 1801 gctatgtgtt tgccattgac ctgaacggct acgtgttgct gcaccccaat ctcaagcccc 1861 agaccaccaa cttccgggag cctgtgactc tggacttcct ggatgcggag ctagaggatg 1921 agaacaagga agagatccgt cggagcatga ttgatggcaa caagggccac aagcagatca 1981 gaacgttggt caagtccctg gatgagaggt acatagatga ggtgacacgg aactacacct 2041 gggtgcctat aaggagcact aactacagcc tggggctggt gctcccaccc tacagcacct 2101 tctacctcca agccaatctc agtgaccaga tcctgcaggt caagtatttt gagttcctgc 2161 tccccagcag ctttgagtct gaaggacacg ttttcattgc tcccagagag tactgcaagg 2221 acctgaatgc ctcagacaac aacaccgagt tcctgaaaaa ctttattgag ctcatggaga 2281 aagtgactcc agactccaag cagtgcaaca acttccttct gcacaacctg atcttggaca 2341 cgggcatcac gcagcagctg gtagagcgtg tgtggaggga ccaggatctc aacacgtaca 2401 gcctactggc cgtgttcgct gccacagacg gtggcatcac ccgagtcttc cccaacaagg 2461 cagctgagga ctggacagag aaccctgagc ccttcaatgc cagcttctac cgccgcagcc 2521 tggataacca cggttatgtc ttcaagcccc cacaccagga tgccctgtta aggccgctgg 2581 agctggagaa tgacactgtg ggcatcctcg tcagcacagc tgtggagctc agcctaggca 2641 ggcgcacact gaggccagca gtggtgggcg tcaagctgga cctagaggct tgggctgaga 2701 agttcaaggt gctagccagc aaccgtaccc accaagacca gcctcagaag tgcggcccca 2761 acagccactg tgagatggac tgcgaggtta acaatgagga cttactctgt gtcctcattg 2821 atgatggagg attcctggtg ctgtcaaacc agaaccatca gtgggaccag gtgggcaggt 2881 tcttcagtga ggtggatgcc aacctgatgc tggcactcta caataactcc ttctacaccc 2941 gcaaggagtc ctatgactat caggcagcct gtgcccctca gccccctggc aacctgggtg 3001 ctgcaccccg gggtgtcttt gtgcccaccg ttgcagattt ccttaacctg gcctggtgga 3061 cctctgctgc cgcctggtcc ctgttccagc agcttctcta cggcctcatc taccacagct 3121 ggttccaagc agaccccgcg gaggccgagg ggagccccga gacgcgcgag agcagctgcg 3181 tcatgaaaca gacccagtac tacttcggct cggtaaacgc ctcctacaac gccatcatcg 3241 actgcggaaa ctgctccagg ctgttccacg cgcagagact gaccaacacc aatcttctct 3301 ttgtggtggc cgagaagccg ctgtgcagcc agtgcgaggc tggccggctg ctgcagaagg 3361 agacgcactg cccagcggac ggcccggagc agtgtgagct agtgcagaga ccgcgatacc 3421 ggagaggccc gcacatctgc ttcgactaca acgcgacaga agatacctca gactgtggcc 3481 gcggggcctc cttcccgccg tcgctgggcg tcctggtctc cctgcaactg ctgctcctcc 3541 tgggcctgcc gccccggccg cagcctcaag tcctcgtcca cgcctctcgc cgcctctgag 3601 caccctgccc caccccacct ccactcccac ctcacccggc ctcttcgcct ttcccaccct 3661 cctgccccac actccccgcc ttagagcctc gtccctccct cactgaagga cctgagctgg 3721 ccaggccctg agagtctggt ctgcgccttg ggatggggag tcccaaagcg ggacgccgca 3781 ggtgtttggc acccaaatca catctcacct ccgaactgtt caagtgtccc cagacccttc 3841 ttgcctgctg ggctcccccc agtgggatgg gacagggagg ccacacgcac tggtgccaaa 3901 accaggcctc tgctgccgcc cttcctggag gctgcctatg ttggggggga ccctgcctca 3961 gctgacccgg cctctctgcc ccacccaagc ccaaacttgg tttctgtgag aatagtggag 4021 gaaggtgaga tggccagttt gaagcctgtg cctcccagct taaatcctag caggagagag 4081 gctctggggc agcccccatg ggctcctgcc cctttcaggc ctacagccac atccccaagc 4141 ccaccaggtg tcaggatagt cacagtgata ccagttcaga cactacccca tatacacctg 4201 gaacattgag gatggaaact ggactcacat tcgacatacc ccactgggca cacgcacaaa 4261 cacacacact atggggtggg gtgggtgtag gggcttacaa agccttacac agggcgaggg 4321 gttggtggga gggttggcac ctgcacactc catctcctgc tcaccacctg cctctaatct 4381 gagctgcagc ctggctggtc ctcccatttc taaagctgaa tgtcaaacag tgccaaatgc 4441 tggggcaggg ggtgaagaac cctctgtccc acccctagcc accagtgtcc tccaagtgcc 4501 ccctcacctc tccaggtgct cattgtaacc atttctcact agtgtcaggc ccccagtggg 4561 accacatgcc actgcctgca cctttcggca gaggaacccc caccagacat caccctttgc 4621 cttagcaggg gtgactttgt ctctcctggc tgggccatcc ttccgccaat ctggccctta 4681 cacactcagg cctgtgccca ctccctatct ccttcccacc cctacacaca cactccctgc 4741 ttgcaggagg ccaaactgtc cctcccttgc tgaacacaca cacacacaca cacacacagg 4801 tggggactgg gcacagctct tcacaccatt cattctggtc atttccccca aaggcatccc 4861 agcctggggg ccagtgggga actgagggca aggggatata gtgatggggc tcagatggac 4921 tgggaggagg gggagggtga tgcattaatt aatggcttcg ttaattaatg tcatgttgct 4981 tgtcgctttc tcagtgtgtg tgtgtggtcc atgcccactg ctggtgccag ggtgggtgtc 5041 catgtgcacc cggcctggat gccagctgtg tccttcgggg gcgtgcgtgt aactgtagtg 5101 tagtcaggtg ctcaatggag aatataaaca tatacagaaa aatatatatt ttaagtttaa 5161 aaaacagaaa aacagacaaa acaatcccca tcaggtagct gtctaacccc cagctgggtc 5221 taatccttct cattacccac ccgacctggc tgcccctcac cttgggctgg gggactgggg 5281 ggccatttcc ttttctctgc cctttttttg ttgttctatt ttgtacagac aagttggaaa 5341 aacaacagcg acaaaaaagt caagaaactt tgtaaaatat cgtgtgtgtg attccttgta 5401 aaatattttc aaatggttta ttacagaaga tcagttatta aataatgttc atattttcac 5461 ttc // LOCUS AF042832 4258 bp mRNA PRI 04-FEB-1998 DEFINITION Homo sapiens forkhead-related transcription factor FREAC-9 (FKHL17) mRNA, complete cds. ACCESSION AF042832 NID g2829130 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4258) AUTHORS Ernstsson,S., Betz,R., Lagercrantz,S., Larsson,C., Ericksson,S., Cederberg,A., Carlsson,P. and Enerback,S. TITLE Cloning and characterization of freac-9 (FKHL17), a novel kidney-expressed human forkhead gene that maps to chromosome 1p32-p34 JOURNAL Genomics 46 (1), 78-85 (1997) MEDLINE 98066765 REFERENCE 2 (bases 1 to 4258) AUTHORS Enerback,S. TITLE Direct Submission JOURNAL Submitted (13-JAN-1998) Molecular Biology, Lundberg Laboratory/Goteborg University, Medicinareg. 9C, Goteborg S-413 90, Sweden FEATURES Location/Qualifiers source 1..4258 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p32-p34" /chromosome="1" gene 1..4258 /note="freac-9" /gene="FKHL17" CDS 2128..3246 /gene="FKHL17" /codon_start=1 /product="forkhead-related transcription factor FREAC-9" /db_xref="PID:g2829131" /translation="MTLGSCCCEIMSSESSPAALSEADADIDVVGGGSGGGELPARSG PRAPRDVLPHGHEPPAEEAEADLAEDEEESGGCSDGEPRALASRGAAAAAGSPGPGAA AARGAAGPGPGPPSGGAATRSPLVKPPYSYIALITMAILQSPKKRLTLSEICEFISGR FPYYREKFPAWQNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPESADMFDNGSFLR RRKRFKRQPLPPPHPHPHPHPELLLRGGAAAAGDPGAFLPGFAAYGAYGYGYGLALPA YGAPPPGPAPHPHPHPHPHAFAFAAAAAAAPCQLSVPPGRAPRLHPDLRRPRCSQARD RPQLLRLPQARARARAPQACPPSWARSWAAPKPSTRRP" BASE COUNT 720 a 1429 c 1401 g 708 t ORIGIN 1 gaattccggc ccttggtcag aggtcccaga gggggcccag gcgacttcct ctcaaaggac 61 atgagtagtc agacctgcct tccatttttt tctgccctta ccttgggggg atgaagaact 121 ctattttgag gattctgtga aaagggttgc aacccaactc tgttctattt ctttgcagga 181 gcctgggtga gttctttctc cctatcctgg gcctctgtag ctccatctgt ggaatgggag 241 agttggatta acaccaggct atcccgtaga ttggaggctg ctgttagatg aaactgcaaa 301 gtgttctcag taccctttcc taaatggcct agaatatggc tacacttata catcagttct 361 gaggtaggtg ttactagcct catttttaca aaggcagaaa caaagcgcaa agaaccttgc 421 ccaaggtgac agatagggta agtagtgaag gcaggattgg aaccccagga gcctgactta 481 aggtcgctgc tcttaactac tatgcagacc aggaaactga ggctccgaag cggggaagtc 541 acctaatgag ggtcacacag ccttgctaca gcggaggcag ctgggattcc ggcctcgcac 601 gcccgagctt tagccactgt gcggcgctgt cggggcagca gcccaggcat ccgctcagtt 661 cctgctcacg tccagcgccc atctcccctt ccgtgcccca gaggctgaaa tgggcccaat 721 ctgtgttgaa gctgcttgtt tctgggtcct ggagggaggc cgggggagaa agaggtctaa 781 taggatgggc aggaacccgg ctagcgagaa acaaggcaca gggccaacct tagttgcaca 841 agacttttga gagatgaagg caacacttgg tggaggtcag ggttcactga gggggtgtcg 901 ggcagggggc ggaggaggtc ctgccgcctg tcggtggggc ccagaaccct gggagccagg 961 aaggtggggg gtgtccagtc ctcagggcaa ctggggccag ggagagggga gggagaaatt 1021 aagcggattc ccagacccat gaacagacat ccagacaaat gccgtttcca cagagactca 1081 cggagagtga gatagtcaca gagactgaga caaacggatg ccgagatcca gagggtcgga 1141 gaacaggcaa actcggagca gtagaccggc ccgcgccccg ggaacgcggc cacagcggcc 1201 ggcgctttgc ccggcctctc cgcgcggaga ggggtggcgg gcccgacagc ggagggagga 1261 gagggcgccg ccgcaatctc acctctccca ggcccaaggc tgcgggcgcc aagttcgctc 1321 ctggtgatca ctcgaatgcg tctcccctcg ggctggaagc gtgcgggctc gggtgggcgc 1381 cggcgggccg cgctggggcg gcgtgggtcc cggcgtctca gcctcgcgct cccactgcgc 1441 ttcggcccgg tggcccgggc gccgccttgg ggagaggacg ggtggggacc gccgccactt 1501 ccctggctgc ggctggcgcc cgagtgagcc ttaacatcca ggggctgagc gttctgaagg 1561 cggcggcttc agggagcaca gggtgcagga gcggcggcga agacaagggc ccgcctccgg 1621 ccactcgagc ccagctcccg ccgcggcggc ggtttgttcc cgccgggtcc ctcagcggag 1681 gcgctacgcc cgcccctgtc gcctcgcccc accccgccca gggagctccg ccctagcccg 1741 cagctcttcc gccttaggca gccgctaggc gggagggaca atcccccacc accaaccact 1801 gccaccccga ggggactagg ggctgaggcc cgcccaggta agggaaagcc tcagctcctt 1861 ccgttgcgcc ccagcggcgg gtcccagctc ggattcccgg ggtagtggcg ggggccgccg 1921 gcgggtcgtg ccctggaagg tgagcgcggc cgagctgggc cgccaggggg cgctgcggag 1981 ccgggggaca cccctccctg cctgcctcag tcccccgccc cctccccgcc cgcgcgcaaa 2041 acgcactcgc cccagaggca gcgcggccga gcccgagccg ctgccggagc ggagccggag 2101 agtggcggcg gcggcggcag cggcaccatg accctgggca gctgctgctg cgagatcatg 2161 tcctccgaga gctccccggc cgcgctgtcc gaggccgacg cagacataga cgtggtgggc 2221 ggcggcagcg gcggggggga gctcccagct cgctccgggc cccgcgcccc ccgggacgtg 2281 ctcccccacg gccacgagcc tcccgcggag gaagccgagg cagacttagc cgaggacgag 2341 gaggagtctg gtggctgctc ggacggcgag ccccgcgctc tggcgtcccg gggggcggcg 2401 gccgcagcgg ggagcccggg gccaggcgcc gcggcggccc gcggcgcagc ggggcccggg 2461 ccgggaccgc cgtcgggggg cgcggcgacg cggagcccgc tggtgaagcc gccctactcg 2521 tacatcgcgc tcatcaccat ggccatcctg cagagcccca agaagcggct gacgttgagc 2581 gagatctgcg agttcatcag cggccgcttc ccctactacc gggagaagtt ccccgcctgg 2641 cagaacagca tccgccacaa cctctctctc aacgactgct tcgtcaagat cccccgcgag 2701 ccgggcaacc cgggcaaggg caactactgg acgctggacc cggagtcggc cgacatgttc 2761 gacaacggca gcttcctgcg gcgtcgcaag cgcttcaagc ggcagcccct gccgccgccg 2821 cacccacacc cgcaccctca cccggagctg ctgctgcgtg gcggggccgc ggcggcgggg 2881 gatcccggcg ctttcctgcc cggcttcgct gcctacggcg cctacggcta cggctacggg 2941 ctggctctcc cggcctacgg cgcacccccg ccggggccgg ccccgcatcc gcacccgcac 3001 ccgcacccgc acgccttcgc tttcgccgcg gcagccgccg ccgctccttg ccagctgtcg 3061 gtacccccag gccgcgcgcc gcgcctccac ccggacctcc gacggcctcg gtgttcgcag 3121 gcgcgggatc ggccccagct cctgcgcctg cctcaggctc gggcccgggc ccgggccccg 3181 caggcctgcc cgccttcctg ggcgcggagc tgggctgcgc caaagccttc tacccggcgt 3241 ccctgagtcc tcccgcagcc ggcaccgcgg cgggtctgcc caccgcactt ctgcgccagg 3301 gcctcaagac ggacgcgggc ggtggtgcag gcggcggggg cgccggggca gggcagaggc 3361 cttccttctc tatagaccac atcatgggcc acggtggcgg cggggcagca cccccgggcg 3421 ccggcgaggg ctctccggga ccgccattcg cggcagccgc gggtcctggg ggccaagccc 3481 aggtcttggc catgctgact gctccggccc tggctcccgt tgctggccac attcgcctct 3541 cgcatcccgg ggacgcgctg ctgtcctcag ggtcccggtt tgccagcaaa gtcgccggcc 3601 ttagtggctg ccacttctga ccgcagcagg cccagggccg gttaggtccg cactcctcag 3661 cctctcccgg gagttcctgc ggtcccagcg gaactcaggg agtctattta tgaagtctcc 3721 agaccttggg ccggcacgcg tgacacggca cttcaggctc cacgcacaga atctcgcaga 3781 tagttgggac taagcgggct ctatcgctca gggcgacagg cccggggcta cgcgaagaag 3841 tcgcaggcca agattcttta cagtttgaga aataaaagca ggggggtggg ggcttcgttt 3901 ttttccctgc ctctgcgcct ctcggggaac acattccggg agagatgcct ggccaggctc 3961 cacggatccc gccagaaaca ccaacagagg gtctcccttt ctgcctttcc cctctcactt 4021 cttccccaac gttgagaccc tgcttgtcca atattataat ttaaagacat ctattatctg 4081 ctttgtgctt aaaagaaaaa ttcaaccttt tttttttttt tttttttgct gttctccaag 4141 gaagttcgtt tcctctgaag cctaaaccag tgtctacgca ggcggagctg aacggagagg 4201 tgaagcaggg ggtctttata ttccctgcag aaaccctgga tcccactccc cggaattc // LOCUS AF043294 3332 bp mRNA PRI 28-JAN-1998 DEFINITION Homo sapiens putative serine/threonine-protein kinase mRNA, complete cds. ACCESSION AF043294 NID g2811265 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3332) AUTHORS Ouyang,B. and Dai,W. TITLE Human putative mitotic checkpoint kinase (HuBub1) mRNA, complete cds JOURNAL Unpublished REFERENCE 2 (bases 1 to 3332) AUTHORS Ouyang,B. and Dai,W. TITLE Direct Submission JOURNAL Submitted (15-JAN-1998) Internal Medicine, University of Cincinnati College of Medicine, 231 Bethesda Avenue, Cincinnati, OH 45267, USA FEATURES Location/Qualifiers source 1..3332 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" /cell_type="erythroleukemia cells" CDS 33..3290 /note="similar to Saccharomyces cerevisiae mitotic checkpoint kinase: Swiss-Prot Accession Number P41695" /codon_start=1 /product="putative serine/threonine-protein kinase" /db_xref="PID:g2811266" /translation="MDTPENVLQMLEAHMQEYKGNDLLGEWERYIQWVEENFPENKEY LITLLEHLMKEFLDKKKYHNDPRFISYCLKFAEYNSDLHQFFEFLYNHGIGTLSSPLY IAWAGHLEAQGELQHASAVLQRGIQNQAEPREFLQQQYRLFQTRLTETHLPAQARTSE PLHNVQVLNQMITSKSNPGNNMACISKNQGSELSGVISSACDKESNMERRVITISKSE YSVHSSLASKVDVEQVVMYCKEKLIRGESEFSFEELRAQKYNQRRKHEQWVNVDRHYM KRKEANAFEEQLLKQKMDELHKKLHQVVETSHEDLPASQERSEVNPARMGPSVGSQQE LRAPCLPVTYQRTPVNMEKNPREAPPVVPPLANAISAALVSPATSQSTAPPVPLKAQT VTDSMYAVASKDAGCVNKSTHEFKPQSGAEIKEGCETHKVANTSSFHTTPNTSLGMVQ STPSKVQPSPTVHTKEALGFIMNMFQAPTLPDISDDKDEWQSLDQNEDAFEAQFQKNV RSSGAWGVNKIISSLSSAFHVFEDGNKENYGLPQPKNKPTGARTFGERSVSRLPSKPK EEVPHAEEFLDDSTVWGIRCNKTLAPSPKSPGDFTSAAQLASTPFHKLPVESVHILED KENVVAKQCTQATLDSCEENMVVLSRDGKFSPIQEKSPKQALSSHMYSASLLRLSQPA AGGVLTCEAELGVEACRLTDTDAAIAEDPPDAIAGLQAEWMQMSSLGTVDAPNFIVGN PWDDKLIFKLLSGLSKPVSSYPNTFEWQCKLPAIKPKTEFQLGSKLVYVHHLLGEGAF AQVYEATQGDLNDAKNKQKFVLKVQKPANPWEFYIGTQLMERLKPSMQHMFMKFYSAH LFQNGSVLVGELYSYGTLLNAINLYKNTPEKVMPQGLVISFAMRMLYMIEQVHDCEII HGDIKPDNFILGNGFLEQDDEDDLSAGLALIDLGQSIDMKLFPKGTIFTAKCETSGFQ CVEMLSNKPWNYQIDYFGVAATVYCMLFGTYMKVKNEGGECKPEGLFRRLPHLDMWNE FFHVMLNIPDCHHLPSLDLLRQKLKKVFQQHYTNKIRALRNRLIVLLLECKRSRK" BASE COUNT 1044 a 684 c 735 g 869 t ORIGIN 1 ggtttgccgc tgccgcccag cgtcttttgg ccatggacac cccggaaaat gtccttcaga 61 tgcttgaagc ccacatgcaa gagtacaagg gcaatgacct tcttggtgaa tgggaaagat 121 acatacagtg ggtagaagag aattttcctg agaataaaga atacttgata actttactag 181 aacatttaat gaaggaattt ttagataaga agaaatacca caatgaccca agattcatca 241 gttattgttt aaaatttgct gagtacaaca gtgacctcca tcaatttttt gagtttctgt 301 acaaccatgg gattggaacc ctgtcatccc ctctgtacat tgcctgggcg gggcatctgg 361 aagcccaagg agagctgcag catgccagtg ctgtccttca gagaggaatt caaaaccagg 421 ctgaacccag agagttcctg caacaacaat acaggttatt tcagacacgc ctcactgaaa 481 cccatttgcc agctcaagct agaacctcag aacctctgca taatgttcag gttttaaatc 541 aaatgataac atcaaaatca aatccaggaa ataacatggc ctgcatttct aagaatcagg 601 gttcagagct ttctggagtg atatcttcag cttgtgataa agagtcaaat atggaacgaa 661 gagtgatcac gatttctaaa tcagaatatt ctgtgcactc atctttggca tccaaagttg 721 atgttgagca ggttgttatg tattgcaagg agaagcttat tcgtggggaa tcagaatttt 781 cctttgaaga attgagagcc cagaaataca atcaacggag aaagcatgag caatgggtaa 841 atgtagacag acattatatg aaaaggaaag aagcaaatgc ttttgaagaa cagctattaa 901 aacagaaaat ggatgaactt cataagaagt tgcatcaggt ggtggagaca tcccatgagg 961 atctgcccgc ttcccaggaa aggtccgagg ttaatccagc acgtatgggg ccaagtgtag 1021 gctcccagca ggaactgaga gcgccatgtc ttccagtaac ctatcagcgg acaccagtga 1081 acatggaaaa gaacccaaga gaggcacctc ctgttgttcc tcctttggca aatgctattt 1141 ctgcagcttt ggtgtcccca gccaccagcc agagcactgc tcctcctgtt cctttgaaag 1201 cccagacagt aacagactcc atgtatgcag tggccagcaa agatgctgga tgtgtgaata 1261 agagtactca tgaattcaag ccacagagtg gagcagagat caaagaaggg tgtgaaacac 1321 ataaggttgc caacacaagt tcttttcaca caactccaaa cacatcactg ggaatggttc 1381 agtcaacgcc atccaaagtg cagccatcac ccaccgtgca cacaaaagaa gcattaggtt 1441 tcatcatgaa tatgtttcag gctcctacac ttcctgatat ttctgatgac aaagatgaat 1501 ggcaatctct agatcaaaat gaagatgcat ttgaagccca gtttcaaaaa aatgtaaggt 1561 catctggggc ttggggagtc aataagatca tctcttcttt gtcatctgct tttcatgtgt 1621 ttgaagatgg aaacaaagaa aattatggat taccacagcc taaaaataaa cccacaggag 1681 ccaggacctt tggagaacgc tctgtcagca gacttccttc aaaaccaaag gaggaagtgc 1741 ctcatgctga agagtttttg gatgactcaa ctgtatgggg tattcgctgc aacaaaaccc 1801 tggcacccag tcctaagagc ccaggagact tcacatctgc tgcacaactt gcgtctacac 1861 cattccacaa gcttccagtg gagtcagtgc acattttaga agataaagaa aatgtggtag 1921 caaaacagtg tacccaggcg actttggatt cttgtgagga aaacatggtg gtgctttcaa 1981 gggatggaaa attcagtcca attcaagaga aaagcccaaa acaggccttg tcgtctcaca 2041 tgtattcagc atccttactt cgtctgagcc agcctgctgc aggtggggta cttacctgtg 2101 aggcagagtt gggcgttgag gcttgcagac tcacagacac tgacgctgcc attgcagaag 2161 atccaccaga tgctattgct gggctccaag cagaatggat gcagatgagt tcacttggga 2221 ctgttgatgc tccaaacttc attgttggga acccatggga tgataagctg attttcaaac 2281 ttttatctgg gctttctaaa ccagtgagtt cctatccaaa tacttttgaa tggcaatgta 2341 aacttccagc catcaagccc aagactgaat ttcaattggg ttctaagctg gtctatgtcc 2401 atcaccttct tggagaagga gcctttgccc aggtgtacga agctacccag ggagatctga 2461 atgatgctaa aaataaacag aaatttgttt taaaggtcca aaagcctgcc aacccctggg 2521 aattctacat tgggacccag ttgatggaaa gactaaagcc atctatgcag cacatgttta 2581 tgaagttcta ttctgcccac ttattccaga atggcagtgt attagtagga gagctgtaca 2641 gctatggaac attattaaat gccattaacc tttataaaaa tacccctgaa aaagtgatgc 2701 ctcaaggtct tgtcatctct ttcgctatga gaatgcttta catgattgag caagtgcatg 2761 actgtgaaat cattcatgga gacattaagc cagataactt catacttgga aacggatttt 2821 tggaacagga tgatgaagat gatttatctg ctggcttggc actgattgac ctgggtcaga 2881 gtatagatat gaaacttttt ccaaaaggaa ctatattcac agcaaagtgt gaaacatctg 2941 gttttcagtg tgttgagatg ctcagcaaca aaccatggaa ctaccagatc gattactttg 3001 gggttgctgc aacagtatat tgcatgctct ttggcactta catgaaagtg aaaaatgaag 3061 gaggagagtg taagcctgaa ggtcttttta gaaggcttcc tcatttggat atgtggaatg 3121 aattttttca tgttatgttg aatattccag attgtcatca tcttccatct ttggatttgt 3181 taaggcaaaa gctgaagaaa gtatttcaac aacactatac taacaagatt agggccctac 3241 gtaataggct aattgtactg ctcttagaat gtaagcgttc acgaaaataa aatttggata 3301 tagacagtcc ttaaaaaaaa aaaaaaaaaa aa // LOCUS AF043453 1598 bp mRNA PRI 01-FEB-1998 DEFINITION Homo sapiens sorting nexin 2 (SNX2) mRNA, complete cds. ACCESSION AF043453 NID g2827433 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1598) AUTHORS Kurten,R.C., Leychkis,Y., Wiley,H.S. and Gill,G.N. TITLE Interaction and Colocalization of Sorting Nexin 2 (SNX2) with SNX1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1598) AUTHORS Kurten,R.C. and Gill,G.N. TITLE Direct Submission JOURNAL Submitted (18-JAN-1998) Physiology & Biophysics, University of Arkansas for Medical Sciences, 4301 West Markham Slot 750, Little Rock, AR 72205, USA FEATURES Location/Qualifiers source 1..1598 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q23" /tissue_type="placenta" gene 1..1598 /gene="SNX2" CDS 18..1577 /gene="SNX2" /codon_start=1 /product="sorting nexin 2" /db_xref="PID:g2827434" /translation="MITAERELLLWGDGKPTDFEDLEDGEDLFTSYCLHPRVKTHHLQ NQPSLPAEDISANSNGPKPTEVVLDDDREDLFAEATEEVSLDGLKGNLSYPRNLPLQS HLSLLLHSAPRIESKSMSAPVIFDRSREEIEEEANGDIFDIEIGVSDPEKVGDGMNAY MAYRVTTKTSLSMFSKSEFSVKRFTDFLGLHTTLPTTYLHVVIFVATSSRKSIVGMTK VKVGKEDSSSTEFVEKRRAALERYLQRTVKHPTLLQDPDLRQFLESSELPRAVNTQAL SGAGILRMVNKAADAVNKMTIKMNESDAWFEEKQQQFENLDQQLRKLHVSVEALVCHR KELSANTAAFAKSAAMLGNSEDHTALSRALSQLAEVEEKIDQLHQEQAFADFYMFSEL LSDYIRLIAAVKGVFDHRMKCWQKWEDAQITLLKKREAEAKMMVANKPDKIQQAKNEI REWEAKVQQGERDFEQISKTIRKEVGRFEKERVKDFKTVIIKYLESLVQTQQQLIKYW EAFLPEAKAIA" BASE COUNT 525 a 315 c 365 g 393 t ORIGIN 1 ctttccgggg cagtcccatg ataacggccg agagggaact cctcctctgg ggggacggga 61 agcccaccga ctttgaggat ctggaggacg gagaggacct gttcaccagc tactgtctcc 121 accctagagt caagacccat catctccaga accagcctag tcttcctgca gaagatatta 181 gtgcaaactc caatggccca aaacccacag aagttgtatt agatgatgac agagaagatc 241 tttttgcaga agccacagaa gaagtatctt tggacggcct gaaagggaac ctctcctatc 301 ctcggaacct tcccctgcag tcacacctgt cactcctact acactctgct cctagaattg 361 aatcaaagag tatgtctgct cccgtgatct ttgatagatc cagggaagag attgaagaag 421 aagcaaatgg agacattttt gacatagaaa ttggtgtatc agatccagaa aaagttggtg 481 atggcatgaa tgcctatatg gcatatagag taacaacaaa gacatctctt tccatgttca 541 gtaagagtga attttcagtg aaaagattca ccgactttct tggtttgcac accacattac 601 caaccacata tttacatgtt gttatatttg ttgccaccag ctccagaaag agtatagtag 661 ggatgaccaa ggtcaaagtg ggtaaagaag actcatcatc cactgagttt gtagaaaaac 721 ggagagcagc tcttgaaagg tatcttcaaa gaacagtaaa acatccaact ttactacagg 781 atcctgattt aaggcagttc ttggaaagtt cagagctgcc tagagcagtt aatacacagg 841 ctctgagtgg agctggaata ttgaggatgg tgaacaaggc tgccgacgct gtcaacaaaa 901 tgacaatcaa gatgaatgaa tcggatgcat ggtttgaaga aaagcagcag caatttgaga 961 atctggatca gcaacttagg aaacttcatg tcagtgttga agccttggtc tgtcatagaa 1021 aagaactttc agccaacaca gctgcctttg ctaaaagtgc tgccatgtta ggtaattctg 1081 aggatcatac tgctttatct agagctttgt ctcagcttgc agaggttgag gagaagatag 1141 accagttaca tcaagaacaa gcttttgctg acttttatat gttctcagaa ctacttagtg 1201 actacattcg tcttattgct gcagtgaaag gtgtgtttga ccatcgaatg aagtgctggc 1261 agaaatggga agatgctcaa attactttgc tcaaaaaacg tgaagctgaa gcaaaaatga 1321 tggttgctaa caaaccagat aaaatacagc aagctaaaaa tgaaataaga gagtgggagg 1381 cgaaagtgca acaaggggaa agagattttg aacagatatc taaaacgatt cgaaaagaag 1441 tgggaagatt tgagaaagaa cgagtgaagg attttaaaac cgttatcatc aagtacttag 1501 aatcactagt tcaaacacaa caacagctga taaaatactg ggaagcattc ctacctgaag 1561 ccaaagccat tgcctagcaa taagattgtt ggccgttt // LOCUS AF043473 4603 bp mRNA PRI 31-JAN-1998 DEFINITION Homo sapiens delayed-rectifier K+ channel alpha subunit (Kv9.1) mRNA, complete cds. ACCESSION AF043473 NID g2815900 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4603) AUTHORS Rae,J.L. and Shepard,A.R. TITLE Direct Submission JOURNAL Submitted (16-JAN-1998) Physiology and Biophysics, Mayo Foundation, 200 1st Street, SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..4603 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q12" /tissue_type="lens epithelium" gene 1..4603 /gene="Kv9.1" exon 105..170 /gene="Kv9.1" /note="alternatively spliced; electrically silent" exon 239..440 /gene="Kv9.1" /note="alternatively spliced; electrically silent" CDS 441..2021 /gene="Kv9.1" /note="alternatively spliced" /codon_start=1 /product="delayed-rectifier K+ channel alpha subunit" /db_xref="PID:g2815901" /translation="MLMLLVRGTHYENLRSKVVLPTPLGGRSTETFVSEFPGPDTGIR WRRSDEALRVNVGGVRRQLSARALARFPGTRLGRLQAAASEEQARRLCDDYDEAAREF YFDRHPGFFLSLLHFYRTGHLHVLDELCVFAFGQEADYWGLGENALAACCRARYLEKR LTQPHAWDEDSDTPSSVDPCPDEISDVQRELARYGAARCGRLRRRLWLTMENPGYSLP SKLFSCVSISVVLASIAAMCIHSLPEYQAREAAAAVAAVAAGRSPEGVRDDPVLRRLE YFCIAWFSFEVSSRLLLAPSTRNFFCHPLNLIDIVSVLPFYLTLLAGVALGDQGGKEF GHLGKVVQVFRLMRIFRVLKLARHSTGLRSLGATLKHSYREVGILLLYLAVGVSVFSG VAYTAEKEEDVGFNTIPACWWWGTVSMTTVGYGDVVPVTVAGKLAASGCILGGILVVA LPITIIFNKFSHFYRRQKALEAAVRNSNHQEFEDLLSSVDGVSEASLETSRETSQEGQ SADLESQAPSEPPHPQMY" BASE COUNT 1057 a 1286 c 1287 g 973 t ORIGIN 1 attacaggtg tgaccgcgcc tgtaatcccg gcacggacgc gtgggcagcg atcgatgcgg 61 ccttcagccg cgggggacta ggcaaggaag aaggaaggag agggcagcgg ggccgcctgc 121 agcaggaggc tcggcggggc cggcggggca gaggcgagac cgagtatagc cagcggctcc 181 cagcctcggt ggctttggga cagacacgcg aggcgcccgg gaggtgccac cacaggtgga 241 gatccaagtg gaggtgggag cggagctaca tcacacgttt attgtgcatt gattatgtgc 301 cagggaccat gctgccgata aacaccttgc atcatttaat tctctgagcc tctcctggaa 361 agtatgatgc tacccggttc cacagacagg gaaacaagcc cagagagggc gacgaatgca 421 ctcaaggaca gcagctagtg atgctgatgc tgctggtccg gggaacacac tatgagaacc 481 tccggtctaa agtggtgctg ccaacacccc taggagggag gagcactgaa acctttgtga 541 gcgagttccc gggccccgac accgggatcc gctggcggcg aagcgacgag gcgctgcgcg 601 tgaacgtggg tggcgtgcgg cggcagctga gcgcgcgcgc cctggcgcgc ttcccgggca 661 cgcggctggg ccgcctgcag gccgcggcgt cggaggagca agcgcggcgc ctgtgcgacg 721 actacgacga ggcggcgcgc gaattctact tcgaccggca cccgggcttc ttcctgagcc 781 tgctgcactt ctaccgcact ggccacctgc acgtgctcga cgagctgtgc gtcttcgcct 841 ttggccagga ggccgactac tggggcctag gcgagaacgc gcttgccgcg tgctgccgcg 901 cgcgctacct ggagaagcgg ctgacccagc cgcacgcctg ggacgaggac agcgacacgc 961 cgagcagcgt ggacccgtgc cccgacgaga tctccgacgt gcagcgagaa ctggcgcgct 1021 atggcgcggc gcgctgtggc cgcctgcgcc gccgcctctg gctgaccatg gagaacccgg 1081 gctactcgct gccgagcaag ctcttcagct gcgtctccat cagcgtggtg ctcgcctcca 1141 tcgccgccat gtgcatccac agcctgcccg agtaccaggc ccgcgaggcg gcggccgccg 1201 tggctgcggt ggccgcgggc cgcagcccgg aaggcgtgcg cgacgacccg gtgctgcgac 1261 gcctcgagta cttctgcatc gcctggttca gcttcgaggt gtcgtcgcgc ctcctgctgg 1321 cgcccagtac gcgcaacttc ttctgccacc cgctcaacct catcgacatt gtgtctgtgc 1381 tgcccttcta tctcacgctg ctggctggtg tggcactggg cgaccagggc ggcaaggagt 1441 tcggccacct gggcaaggtg gtgcaggtgt tccgcctcat gcgcatcttc cgcgtactca 1501 agttggcgcg ccattccacc gggctgcgct cgctgggagc cacgctcaag cacagctacc 1561 gtgaggtggg catcttgctg ctgtacctgg ctgtgggtgt gtcagtgttc tctggtgtgg 1621 cctacacagc tgaaaaggag gaggacgtgg gctttaacac catcccagcc tgctggtggt 1681 ggggcacagt gagcatgacc accgtgggct atggggatgt ggtgccagtg acggtggctg 1741 gcaagctggc agcctcaggc tgcatcctag ggggcatcct ggtggtagca ctccccatca 1801 ccatcatctt caacaagttc tcccacttct accggcgcca gaaggctctg gaggcagccg 1861 tgcgcaacag caaccaccaa gagtttgagg acttgctgag cagcgttgat ggggtgtcgg 1921 aggcatctct ggagacatcc cgagaaacct ctcaggaggg acagtctgca gatctagaga 1981 gccaggcccc cagtgagcct ccacaccctc agatgtatta aaaccaggga tccgtgaccc 2041 cctgccatgc ccctacagta gagattcctc ccctgctaaa gttcttcatg gtagtgagcc 2101 tcccaggatc actcatgctg tcctgagaaa tgagattcaa aagctgcatt ccttcctcca 2161 aaggcccagg acaggacagt gctaccctag attctgtgag attctgcaag caactcagaa 2221 agctttataa acctgcctcc aagccattca tgaagcaggc atgaagccag aggggaggaa 2281 gttgaaagga tactaatgat tcctgagcac ctcctgtaca tgaaccaggt catgtttggc 2341 cttgtgttgg agaaagagaa ttatccccac ttcacagatg aggaaaccaa ggctaagaga 2401 tgagcgatgc ctctcccaga tgtcacaatt agaagtggca gagtcaggtt tggaatctgg 2461 gcttgttggt gtgcctggag caatgctcca tcaactccat ctttcttctg ccctcatgaa 2521 gtttgcgtgt gttttttggt gattcatgga tccccagagt tgatccacaa gataagaaat 2581 ccctttcaga gccaggggaa gggtccagac aggagatttg gatataggag ctctggatct 2641 cagctgtggt tcctttctag gagcatccaa caggagtcag caatcaaaga tgagtcctgg 2701 agggaaggag gatgagtcct gggaggaagt accatagcat tgtacttgcc tctttagctg 2761 gtatgcaaga tttgctaagc aataatcttc cgaccttcct cccagggctg atgtgagaga 2821 aagaaaatct gaagggcccc tggcctgaga tctagacaac ctagctgcgc caagacctgc 2881 tgtgtaaagt ggtccaatcc ctgcccccaa gtccagattc cagcccctct gtcaaatggt 2941 cagaatctgt gacccagaag gcaatgaaag ccaaggagag cagaggggtc tctaacttaa 3001 agccagctca gctctgaaat gttcagaggg ctgacatata tgaatactgg cttctaagac 3061 cctcccccaa cagtctgacc ttggcagttc ctttctgccc atcacacaga cgtccagccc 3121 tacattgtcc agccctacct agagtaccca tccaggccca tgtgtgagag tcctctccct 3181 tctggccagc tccagccccc tcacccttgc tgggggcata tctctcatac ccacctgctg 3241 aatcttctct tcacttgctg agaaagtcag aggaggggat ccagtagccc tttccacata 3301 ctgtggctac agcacccagc actgtgctgt tgttgtctga ctgtggctat ctctgtccca 3361 tcaccctgag cctcaagatg gaagacattg tctgcttgaa ctctgggctt ggtgcagagt 3421 aggtatttgg taaatgttga atagatcaac caattcctac agcccactgg gtgccaagct 3481 cagaggggga aaaaaagata gccagagaga gacctggacc tcaaaaaggt gtgaacaaga 3541 ctgaaaagga ggtccgttca gctccgcctg ccctgtccct gcttggttct tgaagttttt 3601 acaatatatt ccttgtaggt tcttcactca ctcactcaac aaatacccgt tggaagagtc 3661 ttaaactggg gcttcatgtg tggcctgcag tcatgtttcg gcttagtcca acagtgtttt 3721 tattatattt taagaatctg aatgcctttg cgtgaggcat gcactctcag gttccccata 3781 gtgtctcccc cgctgccacc cacaaacaga cacacatgcc gttgtgttat gcccagccac 3841 tctgatatac tggcttctga cttatgggct gcttactagt ctcaagagat acagtggtgg 3901 aaaaatatat ccctatactc atgaagctta ccgactagta gggaagatag actctaatgc 3961 aataatcaca taaatatagc gagctctact aagtacaggt gtgtataatg tatcagggac 4021 catggcccga gaaagtgaga aagctttcct gagcaagtga caatctaacc tgccatccaa 4081 atgataaatt gctaattaag caaatgaggt tccagggggg agtactccaa gcagagagaa 4141 tggcatgtgt gaaggtgtgt gtcatacaca gggcctagta catagtgggc cccctacata 4201 cgcatggatt ctgggtctcc agattggaac agaggatgga gctaagcagc aaagggcctc 4261 ccttgaattc accacacaca taccagccct tctgcacact ttgctcctta catctaggcc 4321 cagctgaacc caggttaggg gcagatcaga agggctgcgt gtagcaccct tatggtcttg 4381 cagaacacac caagaaaccc gacggcttcc atattttctg accctacaaa caggtcccta 4441 atactgatgt ggagaaagct caagagggaa tcatctggcc tgagcctacc atgaggctgt 4501 ttgtggttcc tggcagaaaa gcaacaactc gctccccttc ccatgtatca gtgaaaacca 4561 ttatgcaaat aaagagctgg cccccgaaaa aaaaaaaaaa aaa // LOCUS AF043724 1440 bp mRNA PRI 01-FEB-1998 DEFINITION Homo sapiens hepatitis A virus cellular receptor 1 (hHAVcr-1) mRNA, complete cds. ACCESSION AF043724 NID g2827453 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1440) AUTHORS Feigelstock,D., Thompson,P., Mattoo,P. and Kaplan,G.G. TITLE The human homolog of HAVcr-1 is a cellular receptor for hepatitis A virus JOURNAL Unpublished REFERENCE 2 (bases 1 to 1440) AUTHORS Feigelstock,D., Thompson,P., Mattoo,P. and Kaplan,G.G. TITLE Direct Submission JOURNAL Submitted (20-JAN-1998) Division of Viral Products, CBER - U.S. Food and Drug Administration, Bldg. 29A-NIH, Rm. 1D10, 8800 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1440 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 1..1440 /gene="hHAVcr-1" CDS 52..1131 /gene="hHAVcr-1" /function="cellular receptor for hepatitis A virus" /note="mucin-like class I integral membrane glycoprotein" /codon_start=1 /product="hepatitis A virus cellular receptor 1" /db_xref="PID:g2827454" /translation="MHPQVVILSLILHLADSVAGSVKVGGEAGPSVTLPCHYSGAVTS MCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGV YCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTTVPTVTTVRTSTTVPTTTTVPTTT VPTTMSIPTTTTVPTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMPLPRQNHE PVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQL FLEHSLLTANTTKGIYAGVCISVLVLLALLGVIIAKKYFFKKEVQQLSVSFSSLQIKA LQNAVEKEVQAEDNIYIENSLYATD" BASE COUNT 414 a 361 c 285 g 380 t ORIGIN 1 gttacccagc attgtgagtg acagagcctg gatctgaacg ctgatcccat aatgcatcct 61 caagtggtca tcttaagcct catcctacat ctggcagatt ctgtagctgg ttctgtaaag 121 gttggtggag aggcaggtcc atctgtcaca ctaccctgcc actacagtgg agctgtcaca 181 tcaatgtgct ggaatagagg ctcatgttct ctattcacat gccaaaatgg cattgtctgg 241 accaatggaa cccacgtcac ctatcggaag gacacacgct ataagctatt gggggacctt 301 tcaagaaggg atgtctcttt gaccatagaa aatacagctg tgtctgacag tggcgtatat 361 tgttgccgtg ttgagcaccg tgggtggttc aatgacatga aaatcaccgt atcattggag 421 attgtgccac ccaaggtcac gactactcca attgtcacaa ctgttccaac cgtcacgact 481 gttcgaacga gcaccactgt tccaacgaca acgactgttc caacgacaac tgttccaaca 541 acaatgagca ttccaacgac aacgactgtt ccgacgacaa tgactgtttc aacgacaacg 601 agcgttccaa cgacaacgag cattccaaca acaacaagtg ttccagtgac aacaacggtc 661 tctacctttg ttcctccaat gcctttgccc aggcagaacc atgaaccagt agccacttca 721 ccatcttcac ctcagccagc agaaacccac cctacgacac tgcagggagc aataaggaga 781 gaacccacca gctcaccatt gtactcttac acaacagatg ggaatgacac cgtgacagag 841 tcttcagatg gcctttggaa taacaatcaa actcaactgt tcctagaaca tagtctactg 901 acggccaata ccactaaagg aatctatgct ggagtctgta tttctgtctt ggtgcttctt 961 gctcttttgg gtgtcatcat tgccaaaaag tatttcttca aaaaggaggt tcaacaacta 1021 agtgtttcat ttagcagcct tcaaattaaa gctttgcaaa atgcagttga aaaggaagtc 1081 caagcagaag acaatatcta cattgagaat agtctttatg ccacggacta agacccagtg 1141 gtgctctttg agagtttacg cccatgactg cagaagactg aacaggtatc agcacatcag 1201 atgtctttta gactccaaga caatttttct gtttcagttt catctggcat tccaacatgt 1261 cagtgatact gggtagagta actctcccac tccaaactgt gtatagtcaa cctcatcatt 1321 aatgtagtcc taatttgttt tgctaaaact ggctcaatcc ttctgatcat tgcagagttt 1381 tctctcaaac atgaacactt tagaattgta tgttctcttt agaccccata aatcctgtat // LOCUS AF043906 2036 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens T245 protein (T245) mRNA, complete cds. ACCESSION AF043906 NID g2832292 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2036) AUTHORS Maeda,K., Matsuhasi,S., Hori,K., Xin,Z., Mukai,T., Tabuchi,K., Egashira,M. and Nishikawa,N. TITLE Cloning and characterization of a novel human cDNA (T245) encoding a protein belong to transmembrane 4 superfamily JOURNAL Unpublished REFERENCE 2 (bases 1 to 2036) AUTHORS Maeda,K. and Matsuhasi,S. TITLE Direct Submission JOURNAL Submitted (21-JAN-1998) Biochemistry, Saga Medical School, Nabeshima 5-1-1, Saga City, Saga 849-8501, Japan FEATURES Location/Qualifiers source 1..2036 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq22" gene 1..2036 /gene="T245" CDS 68..805 /gene="T245" /note="member of transmembrane 4 superfamily" /codon_start=1 /product="T245 protein" /db_xref="PID:g2829196" /translation="MASPSRRLQTKPVITCFKSVLLIYTFIFWITGVILLAVGIWGKV SLENYFSLLNEKATNVPFVLIATGTVIILLGTFGCFATCRASAWMLKLYAMFLTLVFL VELVAAIVGFVFRHEIKNSFKNNYEKALKQYNSTGDYRSHAVDKIQNTLHCCGVTDYR DWTDTNYYSEKGFPKSCCKLEDCTPQRDADKVNNEGCFIKVMTIIESEMGVVAGISFG VACFQLIGIFLAYCLSRAITNNQYEIV" BASE COUNT 548 a 391 c 423 g 674 t ORIGIN 1 gggactccgc gtctcgctct ctgtgttcca atcgcccggt gcggtggtgc agggtctcgg 61 gctagtcatg gcgtccccgt ctcggagact gcagactaaa ccagtcatta cttgtttcaa 121 gagcgttctg ctaatctaca cttttatttt ctggatcact ggcgttatcc ttcttgcagt 181 tggcatttgg ggcaaggtga gcctggagaa ttacttttct cttttaaatg agaaggccac 241 caatgtcccc ttcgtgctaa ttgctactgg taccgtcatt attcttttgg gcacctttgg 301 ttgttttgct acctgccgag cttctgcatg gatgctaaaa ctgtatgcaa tgtttctgac 361 tctcgttttt ttggtcgaac tggtcgctgc catcgtagga tttgttttca gacatgagat 421 taagaacagc tttaagaata attatgagaa ggctttgaag cagtataact ctacaggaga 481 ttatagaagc catgcagtag acaagatcca aaatacgttg cattgttgtg gtgtcaccga 541 ttatagagat tggacagata ctaattatta ctcagaaaaa ggatttccta agagttgctg 601 taaacttgaa gattgtactc cacagagaga tgcagacaaa gtaaacaatg aaggttgttt 661 tataaaggtg atgaccatta tagagtcaga aatgggagtc gttgcaggaa tttcctttgg 721 agttgcttgc ttccaactga ttggaatctt tctcgcctac tgcctctctc gtgccataac 781 aaataaccag tatgagatag tgtaacccaa tgtatctgtg ggcctattcc tctctacctt 841 taaggacatt taggtccccc ctgtgaatta gaaagttgct tggctggaga actgacagca 901 ctacttactg atagaccaaa aaactacacc agtaggttga ttcaatcaag atgtatgtag 961 acctaaaact acaccaatag gctgattcaa tcaagatccg tgctcgcagt gggctgattc 1021 aatcaagatg tatgtttgct atgttctaag tccaccttct atcccattca tgttagatcg 1081 ttgaaacctg gtctccctct gaaacactgg aagagctagt aaattgtaaa tgaagtaata 1141 ctgtgttcct cttgactgtt atttttctta gtagggggcc tttggaaggc actgtgaatt 1201 tgctattttg atgtagtgtt accaagatgg aaaattgatt cctctgactt tgctattgat 1261 gtagtgtgat agaaaattca cccctctgaa ctggctcctt cccagtcaag gttatctggt 1321 ttgattgtat aatttgcacc aagaagttaa aatgttttat gactctctgt tctgctgaca 1381 ggcagagagt cacattgtgt aatttaattt cagtcagtca atagatggca tccctcatca 1441 gggttgccag atggtgataa cagtgtaagg ccttgggtct aaggcatcca cgactggaag 1501 ggactactga tgttctgtga tacatcaggt ttcagcacac aacttacatt tctttgcctc 1561 caaattgagg catttattat gatgttcata ctttccctct tgtttgaaag tttctaatta 1621 ttaaatggtg tcggaattgt tgtattttcc ttaggaattc agtggaactt atcttcatta 1681 aatttagctg gtaccaggtt gatatgactt gtcaatatta tggtcaactt taagtcttag 1741 ttttcgtttg tgcctttgat taataagtat aactcttata caataaatac tgctttcctc 1801 taaaaagatc gtgtttaaat taacttgtag aaaatctgct ggaatggttg ttgttttcca 1861 ctgagaaagc taagccctac atttctattc agagtactgt ttttagatgt gaaatataag 1921 cctgcggcct taactctgta ttaaaaaaaa tgtttttgtt taaaaaaaac tgttcccata 1981 ggtgcagcaa accaccatgg cacatgtata cctatgtaac aaacctgcac attttg // LOCUS AF044333 1548 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens hPRL1 mRNA, complete cds. ACCESSION AF044333 NID g2832295 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1548) AUTHORS Okresz,L. TITLE Direct Submission JOURNAL Submitted (23-JAN-1998) Biological Research Center, POB 521, Szeged H-6701, Hungary FEATURES Location/Qualifiers source 1..1548 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q31-q32" gene 1..1548 /gene="hPRL1" CDS 1..1545 /gene="hPRL1" /note="similar to Arabidopsis thaliana PRL1 and PRL2 WD-40 repeat proteins" /codon_start=1 /db_xref="PID:g2832296" /translation="MVEEVQKHSVHTLVFRSLKRTHDMFVADNGKPVPLDEESHKRKM AIKLRNEYGPVLHMPTSKENLKEKGPQNATDSYVHKQYPANQGQEVEYFVAGTHPYPP GPGVALTADTKIQRMPSESAAQSLAVALPLQTKADANRTAPSGSEYRHPGASDRPQPT AMNSIVMETGNTKNSALMAKKAPTMPKPQWHPPWKLYRVISGHLGWVRCIAVEPGNQW FVTGSADRTIKIWDLASGKLKLSLTGHISTVRGVIVSTRSPYLFSCGEDKQVKCWDLE YNKVIRHYHGHLSAVYGLDLHPTIDVLVTCSRDSTARIWDVRTKASVHTLSGHTNAVA TVRCQAAEPQIITGSHDTTIRLWDLVAGKTRVTLTNHKKSVRAVVLHPRHYTFASGSP DNIKQWKFPDGSFIQNLSGHNAIINTLTVNSDGVLVSGADNGTMHLWDWRTGYNFQRV HAAVQPGSLDSESGIFACAFDQSESRLLTAEADKTIKVYREDDTATEETHPVSWKPEI IKRKRF" BASE COUNT 482 a 306 c 366 g 394 t ORIGIN 1 atggtcgagg aggtacagaa acattctgta cacacccttg tgttcaggtc gttgaagagg 61 acccatgaca tgtttgtagc tgataatgga aaacctgtgc ctttagatga agagagtcac 121 aaacgaaaaa tggcaatcaa gcttcgtaat gagtatggtc ctgtgttgca tatgcctact 181 tcaaaagaaa atcttaaaga gaagggtcct cagaatgcaa cggattcata tgttcataaa 241 cagtaccctg ccaatcaagg acaagaagtt gaatactttg tggcaggtac acatccatac 301 ccaccaggac ctggggttgc tttgacagca gatactaaga tccagagaat gccaagtgaa 361 tcagctgcac agtccttagc ggtggcatta cctttgcaga ccaaggctga tgcaaatcgt 421 actgccccta gtggaagtga ataccgacat cctggggctt ctgaccgtcc acagcctaca 481 gcgatgaatt caattgtcat ggagactggc aataccaaga actctgcact gatggctaaa 541 aaagccccta caatgccaaa accccagtgg cacccaccgt ggaaactcta cagggttatc 601 agtgggcatc ttggctgggt tcgatgtatt gctgtggaac ctggaaatca gtggtttgtt 661 actggatctg ctgacagaac tataaagatc tgggacttgg ctagtggcaa attaaaactg 721 tcattgactg ggcatattag tactgtgcgg ggcgtgatag taagcacaag gagcccatat 781 ctgttctctt gtggagaaga caaacaagtg aaatgctggg atctcgaata caataaggtt 841 atacggcatt atcatggaca tttaagtgca gtgtatggtt tggatttgca cccgacaatc 901 gatgtgttgg taacctgtag tcgagattca actgcacgga tttgggatgt gagaactaaa 961 gccagtgtac acacattatc tggacataca aatgcagttg ctacagtgag atgtcaggct 1021 gcagaaccac aaattattac aggaagccat gatactacaa ttcgattatg ggatctggtg 1081 gctggaaaaa caagagtgac attaacaaat cacaaaaaat cagttagggc tgtggtttta 1141 catccaagac attacacatt tgcatctggt tctccagata acataaagca gtggaaattc 1201 cctgatggaa gtttcattca aaatctttcc ggtcataatg ctattattaa cacattgacg 1261 gtaaattctg atggagtgct tgtatctgga gctgacaatg gcaccatgca tctttgggac 1321 tggagaactg gctacaattt tcagagagtt cacgcagctg tgcaacctgg gtctttggac 1381 agtgaatcag gaatatttgc ttgtgctttt gatcagtctg aaagtcgatt actaacagct 1441 gaagctgata aaaccattaa agtatacaga gaggatgaca cagccacaga agaaactcat 1501 ccagtcagct ggaaaccaga aattatcaag agaaagagat tttaatga // LOCUS AF044414 2839 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens alpha mannosidase 6A8B (6a8b) mRNA, complete cds. ACCESSION AF044414 NID g2828701 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2839) AUTHORS Zhu,L.-P., Li,P. and Ma,F.-R. TITLE Direct Submission JOURNAL Submitted (19-JAN-1998) Department of Immunology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, 5 Dong Dan San Tiao, Beijng 100005, P.R. China FEATURES Location/Qualifiers source 1..2839 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2839 /gene="6a8b" CDS 745..2784 /gene="6a8b" /note="similar to rat ER alpha mannosidase" /codon_start=1 /product="alpha mannosidase 6A8B" /db_xref="PID:g2828702" /translation="MHGCGIRRFLTQKLTWNLGESLPTPYFFWEGLDGSRVLVHFPPG DSYGMQGSVEEVLKTVANNRDKGRANHSAFLFGFGDGGGGPTQTMLDRLKRLSNTDGL PRVQLSSPRQLFSALESDSEQLCTWVGELFLELHNGTYTTHAQIKKGNRECERILHDV ELLSSLALARSAQFLYPAAQLQHLWRLLLLNQFHDVVTGSCIQMVAEEAMCHYEDIRS HGNTLLSAAAAALCAGEPGPEGLLIVNTLPWKRIEVMALPKPGGAHSLALVTVPSMGY APVPPPTSLQPLLPQQPVFVVQETDGSVTLDNGIIRVKLDPTGRLTSLVLVASGREAI AEGAVGNQFVLFDDVPLYWDAWDVMDYHLETRKPVLGQAGTLAVGTEGGLRGSAWFLL QISPNSRLSQEVVLDVGCPYVRFHTEVHWHEAHKFLKVEFPARVRSSQATYEIQFGHL QRPTHYNTSWDWARFEVWAHRWMDLSEHGFGLALLNDCKYGASVRGSILSLSLLRAPK APDATADTGRHEFTYALMPHKGSFQDAGVIQAAYSLNFPLLALPAPSPAPATSWSAFS VSSPAVVLETVKQAESSPQRRSLVLRLYEAHGSHVDCWLHLSLPVQEAILCDLLERPD PAGHLTSGQPPEAHLFSLPSAVPVARASASATLSPWGWGFVCRRLWGLLISASPA" BASE COUNT 534 a 863 c 848 g 594 t ORIGIN 1 agagaccccg aagcctcact ctctatgtgg aagtagcctg caatgggctc ctgggggccg 61 ggaagggaag catgattgca gcccctgacc ctgagaagat attccagctg agccgggctg 121 agctagctgt gttccaccgg gatgtccaca tgctcctggt ggatctggag ctgctgctgg 181 gcatagccca gggcctcggg aaggacaacc agcgcagctt ccaggccctg tacacagccc 241 atcagatagt gaacgtgtgt gaccctgccc agcccgagac cttcccagtg gcccaggccc 301 tggcctccag gttctttggc caacatgggg gtttaaaagc caacacacca ttcatgccac 361 aagggcactg ccacattgat acagcctggc tttggccctt caaagagact gtgaggaaat 421 gtgcccggag ctgggtgacc gccctgcagc tcatggagcg gaaccctgtg ttcatctttg 481 cctgctccca ggcgcagcag ctggaatggg tgaagagccg ctaccctggc ctgtactccc 541 gcatggagga gtttgcgtgc cgtgggcagt ttgtgcctgt ggggggcacc tgggtggaaa 601 tggatgggaa cctgcccagt ggagaagcca tggtgaggca gtttttgcag ggccagaact 661 tctttctgca agagtttggg aagatgtgct ctgaattctg gctgccggac acctttggct 721 actccagcac agctccccca gatcatgcac ggctgtggca tcaggcgctt tctcacccaa 781 aaattgacct ggaatttggg cgaatccctt cccacaccat actttttctg ggagggcctg 841 gatggctccc gtgtactggt ccacttccca cctggcgact cctatgggat gcagggcagc 901 gtggaggagg tgctgaagac cgtggccaac aaccgggaca aggggcgggc caaccacagt 961 gccttcctct ttggctttgg ggatgggggt ggtggcccca cccagaccat gctggaccgc 1021 ctgaagcgcc tgagcaatac ggatgggctg cccagggtgc agctatcttc tccaagacag 1081 ctcttctcag cactggagag tgactcagag cagctgtgca cgtgggttgg ggagctcttc 1141 ttggagctgc acaatggcac atacaccacc catgcccaga tcaagaaggg gaaccgggaa 1201 tgtgagcgga tcctgcacga cgtggagctg ctcagtagcc tggccctggc ccgcagtgcc 1261 cagttcctat acccagcagc ccagctgcag cacctctgga ggctccttct tctgaaccag 1321 ttccatgatg tggtgactgg aagctgcatc cagatggtgg cagaggaagc catgtgccat 1381 tatgaagaca tccgttccca tggcaataca ctgctcagcg ctgcagccgc agccctgtgt 1441 gctggggagc caggtcctga gggcctcctc atcgtcaaca cactgccctg gaagcggatc 1501 gaagtgatgg ccctgcccaa accgggcggg gcccacagcc tagccctggt gacagtgccc 1561 agcatgggct atgctcctgt tcctcccccc acctcactgc agcccctgct gccccagcag 1621 cctgtgttcg tagtgcaaga gactgatggc tccgtgactc tggacaatgg catcatccga 1681 gtgaagctgg acccaactgg tcgcctgacg tccttggtcc tggtggcctc tggcagggag 1741 gccattgctg agggcgccgt ggggaaccag tttgtgctat ttgatgatgt ccccttgtac 1801 tgggatgcat gggacgtcat ggactaccac ctggagacac ggaagcctgt gctgggccag 1861 gcagggaccc tggcagtggg caccgagggc ggcctgcggg gcagcgcctg gttcttgcta 1921 cagatcagcc ccaacagtcg gcttagccag gaggttgtgc tggacgttgg ctgcccctat 1981 gtccgcttcc acaccgaggt acactggcat gaggcccaca agttcctgaa ggtggagttc 2041 cctgctcgcg tgcggagttc ccaggccacc tatgagatcc agtttgggca cctgcagcga 2101 cctacccact acaatacctc ttgggactgg gctcgatttg aggtgtgggc ccatcgctgg 2161 atggatctgt cagaacacgg ctttgggctg gccctgctca acgactgcaa gtatggcgcg 2221 tcagtgcgag gcagcatcct cagcctctcg ctcttgcggg cgcctaaagc cccggacgct 2281 actgctgaca cggggcgcca cgagttcacc tatgcactga tgccgcacaa gggctctttc 2341 caggatgctg gcgttatcca agctgcctac agcctaaact tccccctgtt ggctctgcca 2401 gcccccagcc cagcgcccgc cacctcctgg agtgcgtttt ccgtgtcttc acccgcggtc 2461 gtattggaga ccgtcaagca ggcggagagc agcccccagc gccgctcgct ggtcctgagg 2521 ctgtatgagg cccacggcag ccacgtggac tgctggctgc acttgtcgct gccggttcag 2581 gaggccatcc tctgcgatct cttggagcga ccagaccctg ctggccactt gacttcggga 2641 caaccgcctg aagctcacct tttctccctt ccaagtgctg tccctgttgc tcgtgcttca 2701 gcctccgcca cactgagtcc ctggggctgg ggttttgttt gtagaaggct ctggggactc 2761 ctaatttctg cttccccagc ctaaagcagg gatcagtctt ttcttgtgga ataaatcctt 2821 ggatcgggaa aaaaaaaaa // LOCUS CH19HHR23 110096 bp DNA PRI 01-APR-1997 DEFINITION Homo sapiens DNA from chromosome 19p13.2 cosmids R31240, R30272 and R28549 containing the EKLF, GCDH, CRTC, and RAD23A genes, genomic sequence. ACCESSION AD000092 NID g1905905 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 110096) AUTHORS Lamerdin,J., McCready,P., Stilwagen,S., Ramirez,M. and Carrano,A. TITLE Characterization by genomic sequence analysis of a gene-rich 111 kb region of 19p13.2 containing the human DNA repair gene, RAD23A JOURNAL Unpublished REFERENCE 2 (bases 1 to 110096) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) J.E. Lamerdin, Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA, USA, 94551 jane@acgt.llnl.gov ow@tornak.llnl.gov COMMENT GSDB:S:985657. map=19p13.2. FEATURES Location/Qualifiers source 1..110096 /organism="Homo sapiens" /note="constructed at LLNL from flow-sorted chromosomes from hybrid 5HL2-B, which carries chromosome 19 as its only human chromosome" /db_xref="taxon:9606" /chromosome="19" /cell_line="5HL2-B" /cell_type="fibroblast" /clone_lib="LL19NC03 R chromosome 19-specific cosmid library" /map="19p13.2" repeat_region 8..86 /note="repeat match = HSAL09846; putative" /rpt_family="Alu" repeat_region 24..407 /note="repeat match = HSAL02504; putative" /rpt_family="Alu" repeat_region 132..407 /note="repeat match = HSAL06234; putative" /rpt_family="Alu" repeat_region 289..437 /note="repeat match = HSAL05358; putative" /rpt_family="Alu" misc_feature complement(474..648) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature 499..537 /note="ss region (gap in top strand); putative" repeat_region complement(518..651) /note="repeat match = L1MB7; putative" /rpt_family="LINE" repeat_region complement(648..949) /note="repeat match = HSAL02217; putative" /rpt_family="Alu" repeat_region complement(953..1105) /note="repeat match = L1MB7; putative" /rpt_family="LINE" misc_feature 1068..1179 /note="ss region (gap in bottom strand); putative" repeat_region 1465..1576 /note="repeat match = HSAL00829; putative" /rpt_family="Alu" repeat_region 1489..1769 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 1634..1806 /note="repeat match = HSAL00499; putative" /rpt_family="Alu" repeat_region 1843..1995 /note="repeat match = HSAL02158; putative" /rpt_family="Alu" repeat_region 1879..2001 /note="repeat match = HSAL11628; putative" /rpt_family="Alu" misc_feature complement(2081..2169) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=good; putative" repeat_region complement(2833..3132) /note="repeat match = HSAL03055; putative" /rpt_family="Alu" repeat_region complement(3617..4230) /note="repeat match = HSAL02794; putative" /rpt_family="Alu" repeat_region complement(4200..4239) /note="repeat match = HSAL01548; putative" /rpt_family="Alu" repeat_region 4274..4557 /note="repeat match = HSAL06523; putative" /rpt_family="Alu" repeat_region 4282..4489 /note="repeat match = HSAL02768; putative" /rpt_family="Alu" misc_feature complement(4492..4666) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature 4702..4969 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 4887..4970 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" CDS join(<4890..4970,5060..5267,11345..11483,11582..11714, 11852..12017,12253..12354,12516..12638,13189..13298, 14009..14187,14265..14512,15235..15368,15489..15830, 17359..17481,17571..17707,19859..20099,20158..21405) /note="coding region constructed from xgrail 1.3 predictions, EST and SWISSPROT matches; comment for location 17571-17707; BLASTN similarity to H14890: ym25g07.r1 Homo sapiens cDNA (bases 198- 325); (5.9e-57); user-supplied translation (frame +1): GRSSKAKKPPGENDFDTIKLISNGAYGAVYLVRHRDTRQRFAMKKINKQNLILRNQIQ QAFVERDILTFA ENPFVVGMFCSFETRRHLCMVMEYVEGGDCATLLKNIGALPVEMARMYFAETVLALEY LHNYGIVHRDLK PDNLLITSMGHIKLTDFGLSKMGLMSLTTNLYEGHIEKDAREFLDKQVCGTPEYIAPE VILRQGYGKPVD WWAMGIILYEFLVGCVPFFGDTPEELFGQVISDDILWPEGDEALPTEAQLLISSLLQT NPLVRLGAGGAF EVKQHSFFRDLDWTGLLRQKAEFIPHLESEDDTSYFDTRSDRYHHVNSYDEDDTTEEE PVEIRQFSSCSP RFSKVYSSMEQLSQHEPKTPVAAAGSSKREPSTKGPEEKVAGKREGLGGLTLREKTWR GGSPEIKRFSAS EASFLEGEASPPLGARRRFSALLEPSRFSAPQEDEDEARLRRPPRPSSDPAGSLDARA PKEETQGEGTSS AGDSEASPRATNDLVLRRARHQQMSGDVAVEKRPSRTGGKVIKSASATALSAGPMRGC SATALGGGRSRY REMSIFLVLISDHGCVSVVDPHGSSPLASPMSPRSLSSNPSSRDSSPSRDYSPAVSGL RSPITIQRSGKK YGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLCAGDLITHVNGEPVHGMVHP EVVELILKSGNK VAVTTTPFENTSIRIGPARRSSYKAKMARRNKRPSAKEGQESKKRSSLFRKITKQSNL LHTSRSLSSLNR SLSSSDSLPGSPTHGLPARSPTHSYRSTPDSAYLGITSCTCAGTEQRGVAWLSSSPAS STPNSPASSASH HIRPSTLHGLSPKLHRQYRSARCKSAGNIPLSPLAHTPSPTQASPPPLPGHTVGSSHT TQSFPAKLHSSP PVVRPRPKSAEPPRSPLLKRVQSAEKLGASLSADKKGALRKHSLEVGHPDFRKDFHGE LALHSLAESDGE TPPVEGLGAPRQVAVRRLGRQESPLSLGADPLLPEGASRPPVSSKEKESPGGAEACTP PRATTPGGRTLE RDVGCTRHQSVQTEDGTGGMARAVAKAALSPVQEHETGRRSSSGEAGTPLVPIVVEPA RPGAKAVVPQPL GADSKGLQEPAPLAPSVPEAPRGRERWVLEVVEERTTLSGPRSKPASPKLSPEPQTPS LAPAKCSAPSSA VTPVPPASLLGSGTKPQVGLTSRCPAEAVPPAGLTKKGVSSPAPPGP; comment for location 5060-5267; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 1510- 1720); (4.0e-50); comment for location 20158-21405; BLASTN similarity to D61343: Human fetal brain cDNA 5'-end (bases 1- 188); (1.1e-32); comment for location 12516-12638; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 2253- 2382); (1.0e-18); comment for location 13189-13298; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 2388- 2492); (4.0e-12); comment for location 11582-11714; BLASTN similarity to T99935: ye72d02.r1 Homo sapiens cDNA (bases 192- 294) (1.5e-42); comment for location 11345-11483; BLASTN similarity to T99935: ye72d02.r1 Homo sapiens cDNA (bases 53- 195); (1.5e-30) and U02313:Mus MAST205 protein kinase mRNA (bases 1716- 1858); (1.2e-28); comment for location 11852-12017; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 1989- 2157); (5.2e-37); comment for location 17359-17481; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 3344- 3467); (1.8e-19) and H14890: ym25g07.r1 Homo sapiens cDNA (bases 75- 197); (2.6e-42); comment for location 15489-15830; BLASTN similarity to U02313:Mus MAST205 protein kinase mRNA (bases 3117- 3344); (4.2e-40) and H14890: ym25g07.r1 Homo sapiens cDNA (bases 2-74); (4.2e-20); putative" /codon_start=1 /product="hypothetical human serine-threonine protein kinase R31240_1" /db_xref="PID:g1905906" /translation="GRSSKAKKPPGENDFDTIKLISNGAYGAVYLVRHRDTRQRFAMK KINKQNLILRNQIQQAFVERDILTFAENPFVVGMFCSFETRRHLCMVMEYVEGGDCAT LLKNIGALPVEMARMYFAETVLALEYLHNYGIVHRDLKPDNLLITSMGHIKLTDFGLS KMGLMSLTTNLYEGHIEKDAREFLDKQVCGTPEYIAPEVILRQGYGKPVDWWAMGIIL YEFLVGCVPFFGDTPEELFGQVISDDILWPEGDEALPTEAQLLISSLLQTNPLVRLGA GGAFEVKQHSFFRDLDWTGLLRQKAEFIPHLESEDDTSYFDTRSDRYHHVNSYDEDDT TEEEPVEIRQFSSCSPRFSKVYSSMEQLSQHEPKTPVAAAGSSKREPSTKGPEEKVAG KREGLGGLTLREKTWRGGSPEIKRFSASEASFLEGEASPPLGARRRFSALLEPSRFSA PQEDEDEARLRRPPRPSSDPAGSLDARAPKEETQGEGTSSAGDSEASPRATNDLVLRR ARHQQMSGDVAVEKRPSRTGGKVIKSASATALSAGPMRGCSATALGGGRSRYREMSIF LVLISDHGCVSVVDPHGSSPLASPMSPRSLSSNPSSRDSSPSRDYSPAVSGLRSPITI QRSGKKYGFTLRAIRVYMGDTDVYSVHHIVWHVEEGGPAQEAGLCAGDLITHVNGEPV HGMVHPEVVELILKSGNKVAVTTTPFENTSIRIGPARRSSYKAKMARRNKRPSAKEGQ ESKKRSSLFRKITKQSNLLHTSRSLSSLNRSLSSSDSLPGSPTHGLPARSPTHSYRST PDSAYLGITSCTCAGTEQRGVAWLSSSPASSTPNSPASSASHHIRPSTLHGLSPKLHR QYRSARCKSAGNIPLSPLAHTPSPTQASPPPLPGHTVGSSHTTQSFPAKLHSSPPVVR PRPKSAEPPRSPLLKRVQSAEKLGASLSADKKGALRKHSLEVGHPDFRKDFHGELALH SLAESDGETPPVEGLGAPRQVAVRRLGRQESPLSLGADPLLPEGASRPPVSSKEKESP GGAEACTPPRATTPGGRTLERDVGCTRHQSVQTEDGTGGMARAVAKAALSPVQEHETG RRSSSGEAGTPLVPIVVEPARPGAKAVVPQPLGADSKGLQEPAPLAPSVPEAPRGRER WVLEVVEERTTLSGPRSKPASPKLSPEPQTPSLAPAKCSAPSSAVTPVPPASLLGSGT KPQVGLTSRCPAEAVPPAGLTKKGVSSPAPPGP" misc_feature 5059..5328 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" misc_feature 5060..5269 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" repeat_region complement(5578..5645) /note="repeat match = MER420060; putative" /rpt_family="MER" repeat_region complement(5593..6087) /note="repeat match = HSAL05507; putative" /rpt_family="Alu" repeat_region 6084..6274 /note="repeat match = HSAL04930; putative" /rpt_family="Alu" repeat_region 6112..6368 /note="repeat match = HSAL01601; putative" /rpt_family="Alu" repeat_region complement(6389..6518) /note="repeat match = HSAL15162; putative" /rpt_family="Alu" repeat_region complement(6422..6519) /note="repeat match = HSAL06315; putative" /rpt_family="Alu" repeat_region complement(6474..6531) /note="repeat match = HSAL00370; putative" /rpt_family="Alu" repeat_region complement(6643..7262) /note="repeat match = HSAL03071; putative" /rpt_family="Alu" misc_feature 6811..6882 /note="ss region (gap in bottom strand); putative" repeat_region complement(7168..7287) /note="repeat match = HSAL10157; putative" /rpt_family="Alu" misc_feature complement(7373..7483) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature 7415..7544 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" repeat_region complement(7592..7692) /note="repeat match = HSAL05709; putative" /rpt_family="Alu" repeat_region complement(7614..7891) /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region complement(8196..8434) /note="repeat match = HSAL04223; putative" /rpt_family="Alu" repeat_region complement(8247..8835) /note="repeat match = UCAL00006; putative" /rpt_family="Alu" repeat_region complement(8253..8996) /note="repeat match = HSAL03102; putative" /rpt_family="Alu" repeat_region 9078..9172 /note="repeat match = HSAL09671; putative" /rpt_family="Alu" repeat_region complement(9195..9491) /note="repeat match = HSAL06257; putative" /rpt_family="Alu" repeat_region complement(9450..9502) /note="repeat match = HSAL15158; putative" /rpt_family="Alu" repeat_region 9505..9732 /note="repeat match = HSAL06165; putative" /rpt_family="Alu" repeat_region 10366..10459 /note="repeat match = HSAL02629; putative" /rpt_family="Alu" repeat_region 10366..10434 /note="repeat match = HSAL09527; putative" /rpt_family="Alu" repeat_region complement(10461..10508) /note="repeat match = HSAL04998; putative" /rpt_family="Alu" repeat_region complement(10471..10762) /note="repeat match = ALU; putative" /rpt_family="Alu" misc_feature 10483..10643 /note="ss region (gap in bottom strand); putative" repeat_region 10769..10833 /note="repeat match = HSAL08976; putative" /rpt_family="Alu" repeat_region 10784..10947 /note="repeat match = HSAL01737; putative" /rpt_family="Alu" misc_feature 11317..11481 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 11329..11481 /note="similarity: pdb|1APM|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); similarity: pdb|2CPK|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); putative" misc_feature 11345..11483 /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 11527..11637 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 11532..11714 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 11580..11720 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 11580..11636 /note="similarity: pdb|2CPK|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); similarity: pdb|1APM|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); putative" misc_feature 11849..12016 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 11852..12019 /note="similarity: pdb|2CPK|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); similarity: pdb|1APM|E; c-AMP-Dependent Protein Kinase (E.C.2.7.1.37) (cAPK); putative" misc_feature 11868..12017 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" repeat_region 12111..12197 /note="repeat match = HSAL04801; putative" /rpt_family="Alu" repeat_region 12111..12222 /note="repeat match = HSAL00634; putative" /rpt_family="Alu" misc_feature 12249..12362 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 12253..12354 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 12516..12638 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" misc_feature 12521..12640 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 13189..13313 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 13191..13301 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 14006..14050 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 14009..14187 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" misc_feature 14242..14349 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 14265..14512 /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=good; putative" misc_feature 14531..14587 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" repeat_region complement(14639..14704) /note="repeat match = HSAL14836; putative" /rpt_family="Alu" repeat_region complement(14654..14943) /note="repeat match = ALU; putative" /rpt_family="Alu" misc_feature 15178..15384 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=good; putative" misc_feature 15240..15296 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 15300..15383 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 15609..15830 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 15689..15830 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" repeat_region complement(15959..16111) /note="repeat match = HSAL04826; putative" /rpt_family="Alu" repeat_region complement(15992..16259) /note="repeat match = HSAL06556; putative" /rpt_family="Alu" repeat_region complement(16218..16270) /note="repeat match = HSAL07093; putative" /rpt_family="Alu" repeat_region 16306..17211 /note="repeat match = HSAL00546; putative" /rpt_family="Alu" misc_feature 16447..16751 /note="ss region (gap in bottom strand); putative" misc_feature 17359..17481 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 17359..17515 /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 17568..17705 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 17571..17707 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" repeat_region 17862..18460 /note="repeat match = HSAL02794; putative" /rpt_family="Alu" misc_feature complement(18235..18350) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" misc_feature complement(18437..18563) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=good; putative" repeat_region 19146..19448 /note="repeat match = ALU; putative" /rpt_family="Alu" misc_feature 19857..19985 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 19859..20099 /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 20144..21405 /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=good; putative" misc_feature 20152..20433 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 20437..20529 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" misc_feature 20539..20583 /note="similarity: pir||A54602; microtubule-associated serine/threonine protein kinase MAST205; putative" repeat_region complement(21918..22051) /note="repeat match = HSAL10627; putative" /rpt_family="Alu" repeat_region complement(21945..22214) /note="repeat match = HSAL13255; putative" /rpt_family="Alu" repeat_region complement(22105..22460) /note="repeat match = HSAL15572; putative" /rpt_family="Alu" repeat_region complement(22342..22517) /note="repeat match = HSAL02466; putative" /rpt_family="Alu" CDS complement(join(22525..22898,24917..25114,25241..25369, 27336..27414,27506..27686,27824..27909)) /note="coding region constructed from Xgrail 1.3 predictions, EST and SWISSPROT matches; predicted protein most similar to hypothetical proteins of unknown function in C.elegans: (Z46266)Co7B5.5 [caenorhabditis elegans] (+1,5.0e-38);(P34387) YLS2_CAEEL HYPOTHETICAL 27.0 KD PROTEIN Fo9G8.2(+1,8.7e-20); comment for location 22525-22898; BLASTX similarity to: (Z46266) C07B5.5 [Caenorhabditis elegans](aa 267- 365); (1.7e-12); comment for location 25241-25369; BLASTX similarity to: sp|P34387|YLS2_CAEEL HYPOTHETICAL 27.0 KD PROTEIN F09G8.2 IN CHROMOSOME III(aa 113- 148); (1.2e-4) and (Z46266) C07B5.5 [Caenorhabditis elegans] (aa 150- 187); (0.0038); user supplied translation (frame +1): MIPLLLAALLCVPAGALTCYGDSGQPVDWFVVYKLPALRGSGEAAQRGLQYKYLDESS GGWRDGRALINS PEGAVGRSLQPLYRSNTSQLAFLLYNDQPPQPSKAQDSSMRGHTKGVLLLDHDGGFWL VHSVPNFPPPAS SAAYSWPHSACTYGQTLLCKQLTYTYPWVYNYQLEGIFAQEFPDLENVVKGHHVSQEP WNSSITLTSQAG AVFQSFAKFSKFGDDLYSGWLAAALGTNLQVQFWHKTVGILPSNCSDIWQVLNVNQIA FPGPAGPSFNST EDHSKWCVSPKGPWTCVGDMNRNQGEEQRGGGTLCAQLPALWKAFQPLVKNYQPCNGM ARKPSRAYKI; comment for location 27506-27686; BLASTX similarity to: sp|P34387|YLS2_CAEEL HYPOTHETICAL 27.0 KD PROTEIN F09G8.2 IN CHROMOSOME III(aa 33- 87); (Pval= 0.0031); putative" /codon_start=1 /product="hypothetical human protein R31240_2" /db_xref="PID:g1905907" /translation="MIPLLLAALLCVPAGALTCYGDSGQPVDWFVVYKLPALRGSGEA AQRGLQYKYLDESSGGWRDGRALINSPEGAVGRSLQPLYRSNTSQLAFLLYNDQPPQP SKAQDSSMRGHTKGVLLLDHDGGFWLVHSVPNFPPPASSAAYSWPHSACTYGQTLLCK QLTYTYPWVYNYQLEGIFAQEFPDLENVVKGHHVSQEPWNSSITLTSQAGAVFQSFAK FSKFGDDLYSGWLAAALGTNLQVQFWHKTVGILPSNCSDIWQVLNVNQIAFPGPAGPS FNSTEDHSKWCVSPKGPWTCVGDMNRNQGEEQRGGGTLCAQLPALWKAFQPLVKNYQP CNGMARKPSRAYKI" misc_feature complement(22781..22912) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=excellent; putative" repeat_region complement(23002..23589) /note="repeat match = HSAL00718; putative" /rpt_family="Alu" repeat_region complement(23371..23589) /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region complement(23610..24367) /note="repeat match = HSAL04809; putative" /rpt_family="Alu" repeat_region complement(23781..24408) /note="repeat match = HSAL01365; putative" /rpt_family="Alu" repeat_region complement(24029..24508) /note="repeat match = HSAL05507; putative" /rpt_family="Alu" misc_feature complement(24184..24298) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" misc_feature complement(24476..24559) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region complement(24513..24640) /note="repeat match = HSAL05343; putative" /rpt_family="Alu" misc_feature complement(24672..24819) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature complement(24937..25036) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature complement(25123..25207) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region 25395..25446 /note="repeat match = HSAL01321; putative" /rpt_family="Alu" repeat_region 25407..26034 /note="repeat match = HSAL06123; putative" /rpt_family="Alu" repeat_region 26232..26617 /note="repeat match = HSAL02985; putative" /rpt_family="Alu" repeat_region 26233..26759 /note="repeat match = HSAL02430; putative" /rpt_family="Alu" repeat_region 26644..26792 /note="repeat match = HSAL09244; putative" /rpt_family="Alu" repeat_region 26760..26812 /note="repeat match = HSAL02043; putative" /rpt_family="Alu" repeat_region 26908..27212 /note="repeat match = HSAL01476; putative" /rpt_family="Alu" repeat_region complement(28318..28398) /note="repeat match = HSAL01159; putative" /rpt_family="Alu" repeat_region complement(28335..28984) /note="repeat match = HSAL00432; putative" /rpt_family="Alu" repeat_region complement(28341..28625) /note="repeat match = ALU; putative" /rpt_family="Alu" misc_feature complement(28376..28491) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" misc_feature complement(28569..28697) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=excellent; putative" misc_feature complement(28789..28881) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region complement(29065..29157) /note="repeat match = L1001607; putative" /rpt_family="LINE" repeat_region complement(29159..29273) /note="repeat match = HSAL08636; putative" /rpt_family="Alu" repeat_region complement(29212..29291) /note="repeat match = HSAL09753; putative" /rpt_family="Alu" repeat_region complement(29470..29785) /note="repeat match = HSAL00667; putative" /rpt_family="Alu" repeat_region complement(29470..29667) /note="repeat match = HSAL01580; putative" /rpt_family="Alu" repeat_region 29963..30247 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 30205..30258 /note="repeat match = HSAL16995; putative" /rpt_family="Alu" misc_feature complement(30257..30375) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature complement(30463..30575) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature complement(30632..30783) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" repeat_region complement(31334..31371) /note="repeat match = LTR7; putative" /rpt_family="LTR" gene complement(31420..33675) /gene="EKLF" CDS complement(join(31420..31595,31852..32677,33589..33675)) /gene="EKLF" /function="erythroid cell-specific transcription factor of the Kruppel zinc finger family" /note="user supplied translation (frame +1): MATAETALPSISTLTALGPFPDTQDDFLKWWRSEEAQDMGPGPPDPTEPPLHVKSEDQ PGEEEDDERGAD ATWDLDLLLTNFSGPEPGGAPQTCALAPSEASGAQYPPPPETLGAYAGGPGLVAGLLG SEDHSGWVRPAL RARAPDAFVGPALAPAPAPEPKALALQPVYPGPGAGSSGGYFPRTGLSVPAASGAPYG LLSGYPAMYPAP QYQGHFQLFRGLQGPAPGPATSPSFLSCLGPGTVGTGLGGTAEDPGVIAETAPSKRGR RSWARKRQAAHT CAHPGCGKSYTKSSHLKAHLRTHTGEKPYACTWEGCGWRFARSDELTRHYRKHTGQRP FRCQLCPRAFSR SDHLALHMKRHL; comment for location 31852-32677; BLASTX (U37106) erythroid Kruppel-like factor EKLF [Homo sapiens]; 100% identity to residues 31- 292, (Pval= 1.1e-104); comment for location 31420-31595; BLASTX (U37106) erythroid Kruppel-like factor EKLF [Homo sapiens]; 100% identity to residues 306- 362, (Pval- 14.e-39); comment for location 33589-33675; BLASTX (U37106) erythroid Kruppel-like factor EKLF [Homo sapiens]; 100% identity to residues 1-30, (Pval= 3.0e-12); putative" /codon_start=1 /product="erythroid Kruppel-like factor" /db_xref="PID:g1905908" /translation="MATAETALPSISTLTALGPFPDTQDDFLKWWRSEEAQDMGPGPP DPTEPPLHVKSEDQPGEEEDDERGADATWDLDLLLTNFSGPEPGGAPQTCALAPSEAS GAQYPPPPETLGAYAGGPGLVAGLLGSEDHSGWVRPALRARAPDAFVGPALAPAPAPE PKALALQPVYPGPGAGSSGGYFPRTGLSVPAASGAPYGLLSGYPAMYPAPQYQGHFQL FRGLQGPAPGPATSPSFLSCLGPGTVGTGLGGTAEDPGVIAETAPSKRGRRSWARKRQ AAHTCAHPGCGKSYTKSSHLKAHLRTHTGEKPYACTWEGCGWRFARSDELTRHYRKHT GQRPFRCQLCPRAFSRSDHLALHMKRHL" misc_feature complement(31423..31596) /gene="EKLF" /note="similarity: gi|1049020; (U25096) Kruppel-like factor LKLF [Mus musculus]; similarity: gi|1389692; (U37106) erythroid Kruppel-like factor EKLF [Homo sapiens]; similarity: sp|P46099|EKLF_MOUSE; ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION FACTOR (EKLF); putative" misc_feature complement(31426..31596) /gene="EKLF" /note="similarity: gi|912488; (U20344) gut-enriched Kruppel-like factor [Mus musculus]; putative" misc_feature complement(31847..32680) /gene="EKLF" /note="similarity: gi|1389692; (U37106) erythroid Kruppel-like factor EKLF [Homo sapiens]; putative" misc_feature complement(31847..32026) /gene="EKLF" /note="similarity: sp|P46099|EKLF_MOUSE; ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION FACTOR (EKLF); putative" misc_feature complement(31847..31993) /gene="EKLF" /note="similarity: gi|912488; (U20344) gut-enriched Kruppel-like factor [Mus musculus]; putative" misc_feature complement(31847..31981) /gene="EKLF" /note="similarity: gi|1049020; (U25096) Kruppel-like factor LKLF [Mus musculus]; putative" misc_feature complement(32009..32266) /gene="EKLF" /note="similarity: sp|P46099|EKLF_MOUSE; ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION FACTOR (EKLF); putative" misc_feature complement(32285..32440) /gene="EKLF" /note="similarity: sp|P46099|EKLF_MOUSE; ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION FACTOR (EKLF); putative" misc_feature complement(32438..32680) /gene="EKLF" /note="similarity: sp|P46099|EKLF_MOUSE; ERYTHROID KRUEPPEL-LIKE TRANSCRIPTION FACTOR (EKLF); putative" repeat_region complement(33141..33210) /note="repeat match = HSAL00669; putative" /rpt_family="Alu" repeat_region complement(33171..33358) /note="repeat match = HSAL09244; putative" /rpt_family="Alu" misc_feature complement(33584..33834) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=excellent; putative" repeat_region 34680..34888 /note="repeat match = HSAL05507; putative" /rpt_family="Alu" repeat_region 34773..34927 /note="repeat match = HSAL01611; putative" /rpt_family="Alu" repeat_region 34775..34895 /note="repeat match = HSAL05899; putative" /rpt_family="Alu" repeat_region 34969..35040 /note="repeat match = HSAL05437; putative" /rpt_family="Alu" repeat_region 34985..35178 /note="repeat match = HSAL03408; putative" /rpt_family="Alu" repeat_region 35105..35198 /note="repeat match = HSAL02430; putative" /rpt_family="Alu" repeat_region 35202..35488 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 35207..35595 /note="repeat match = HSAL06148; putative" /rpt_family="Alu" misc_feature 35376..35724 /note="ss region (gap in top strand); putative" repeat_region 35520..35636 /note="repeat match = HSAL17011; putative" /rpt_family="Alu" repeat_region complement(35954..36035) /note="repeat match = HSAL04697; putative" /rpt_family="Alu" repeat_region complement(35973..36396) /note="repeat match = HSAL00518; putative" /rpt_family="Alu" repeat_region complement(36363..36407) /note="repeat match = HSAL00080; putative" /rpt_family="Alu" repeat_region 36631..37076 /note="repeat match = HSAL00518; putative" /rpt_family="Alu" repeat_region 36937..37113 /note="repeat match = HSAL01630; putative" /rpt_family="Alu" repeat_region 37028..37136 /note="repeat match = HSAL03507; putative" /rpt_family="Alu" gene 37840..46076 /gene="GCDH" misc_feature 37840..37941 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; putative" misc_feature 37840..37930 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 37840..37902 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" CDS join(37840..37930,38022..38057,38366..38509,38651..38713, 40018..40188,42527..42656,42740..42956,43445..43548, 43838..43963,44238..44398,46003..46076) /gene="GCDH" /function="mitochondrial matrix -associated, FAD-containing dehydrogenase; catalyzes oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA + CO2" /note="comment for location 42527-42656; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 169- 210 (Pval= 4.2e-23); comment for location 40018-40188; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 112- 167, (Pval= 1.5e-32); comment for location 43445-43548; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 285- 317 (Pval= 1.6e-18); comment for location 42740-42956; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 213- 283 (Pval= 1.3e-43); comment for location 38366-38509; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 44- 90, (Pval= 3.9e-45); comment for location 38022-38057; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 32- 42, (Pval= 0.99); comment for location 38651-38713; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 92- 111, (Pval= 3.2e-06); comment for location 44238-44398; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 98% identity to residues 361- 413, (Pval= 2.3e-30); comment for location 43838-43963; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 319- 359, (Pval= 1.8e-21); comment for location 46003-46076; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 415- 437 (Pval= 8.8e-09); user-supplied translation (frame +1): MALRGVSVRLLSRGPGLHVLRTWVSSAAQTEKGGRTQSQLAKSSRPEFDWQDPLVLEE QLTTDEILIRDT FRTYCQERLMPRILLANRNEVFHREIISEMGELGVLGPTIKGYGCAGVSSVAYGLLAR ELERVDSGYRSA MSVQSSLVMHPIYAYGSEEQRQKYLPQLAKGELLGCFGLTEPNSGSDPSSMETRAHYN SSNKSYTLNGTK TWITNSPMADLFVVWARCEDGCIRGFLLEKGMRGLSAPRIQGKFSLRASATGMIIMDG VEVPEENVLPGA SSLGGPFGCLNNARYGIAWGVLGASEFCLHTARQYALDRMQFGVPLARNQLIQKKLAD MLTEITLGLHAC LQLGRLKDQDKAAPEMVSLLKRNNCGKALDIARQARDMLGGNGISDEYHVIRHAMNLE AVNTYEGTHDIH ALILGRAITGIQAFTASK; comment for location 37840-37930; gi|260169 glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; 100% identity to residues 1-30, (Pval= 3.9e-45); putative" /codon_start=1 /product="glutaryl Co-A dehydrogenase" /db_xref="PID:g1905909" /translation="MALRGVSVRLLSRGPGLHVLRTWVSSAAQTEKGGRTQSQLAKSS RPEFDWQDPLVLEEQLTTDEILIRDTFRTYCQERLMPRILLANRNEVFHREIISEMGE LGVLGPTIKGYGCAGVSSVAYGLLARELERVDSGYRSAMSVQSSLVMHPIYAYGSEEQ RQKYLPQLAKGELLGCFGLTEPNSGSDPSSMETRAHYNSSNKSYTLNGTKTWITNSPM ADLFVVWARCEDGCIRGFLLEKGMRGLSAPRIQGKFSLRASATGMIIMDGVEVPEENV LPGASSLGGPFGCLNNARYGIAWGVLGASEFCLHTARQYALDRMQFGVPLARNQLIQK KLADMLTEITLGLHACLQLGRLKDQDKAAPEMVSLLKRNNCGKALDIARQARDMLGGN GISDEYHVIRHAMNLEAVNTYEGTHDIHALILGRAITGIQAFTASK" misc_feature 37900..37965 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" misc_feature 38000..38056 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; putative" misc_feature 38024..38056 /gene="GCDH" /note="similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408...; putative" misc_feature 38356..38508 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" misc_feature 38365..38508 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408...; putative" misc_feature 38392..38521 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 38615..38713 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=good; putative" misc_feature 38650..38715 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408...; putative" repeat_region complement(38792..39061) /note="repeat match = HSAL04474; putative" /rpt_family="Alu" repeat_region complement(38851..39376) /note="repeat match = HSAL02812; putative" /rpt_family="Alu" repeat_region complement(38853..39716) /note="repeat match = HSAL00363; putative" /rpt_family="Alu" repeat_region complement(39508..39770) /note="repeat match = HSAL02071; putative" /rpt_family="Alu" repeat_region complement(39733..39792) /note="repeat match = HSAL05934; putative" /rpt_family="Alu" misc_feature 40017..40208 /gene="GCDH" /note="similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; putative" misc_feature 40017..40193 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" misc_feature 40017..40172 /gene="GCDH" /note="similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408 ...; putative" misc_feature 40018..40188 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 40155..40193 /gene="GCDH" /note="similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408 ...; putative" repeat_region 40454..40499 /note="repeat match = HSAL01587; putative" /rpt_family="Alu" repeat_region 40465..40587 /note="repeat match = HSAL10737; putative" /rpt_family="Alu" misc_feature 40668..41015 /gene="GCDH" /note="similarity: sp|P10660|RS6_HUMAN; 40S RIBOSOMAL PROTEIN S6 (PHOSPHOPROTEIN NP33).; putative" misc_feature 40668..40899 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=good; putative" misc_feature 41011..41412 /gene="GCDH" /note="similarity: sp|P10660|RS6_HUMAN; 40S RIBOSOMAL PROTEIN S6 (PHOSPHOPROTEIN NP33).; putative" misc_feature 41050..41164 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=good; putative" misc_feature 41227..41415 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=good; putative" misc_feature 41230..41412 /gene="GCDH" /note="similarity: sp|P10660|RS6_HUMAN; 40S RIBOSOMAL PROTEIN S6 (PHOSPHOPROTEIN NP33).; putative" repeat_region complement(41795..42454) /note="repeat match = HSAL00919; putative" /rpt_family="Alu" misc_feature 42211..42312 /gene="GCDH" /note="ss region (gap in bottom strand); putative" misc_feature 42526..42657 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408; putative" misc_feature 42563..42656 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 42740..42956 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 42741..42959 /gene="GCDH" /note="similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408; putative" misc_feature 42741..42956 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; putative" misc_feature 42741..42953 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" misc_feature 42741..42785 /gene="GCDH" /note="similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; putative" misc_feature 42801..42953 /gene="GCDH" /note="similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; putative" misc_feature 43430..43549 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; putative" misc_feature 43433..43549 /gene="GCDH" /note="similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408; putative" misc_feature 43439..43549 /gene="GCDH" /note="similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; putative" misc_feature 43445..43552 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" misc_feature 43836..43964 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408; putative" misc_feature 43838..43963 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 44236..44400 /gene="GCDH" /note="similarity: gi|1439521; (U18992) glutaryl-CoA dehydrogenase precursor [Mus musculus]; similarity: gi|260169; glutaryl-CoA dehydrogenase, GCDH [human, liver, Peptide, 438 aa]; similarity: gi|260168; glutaryl-CoA dehydrogenase, GCDH [swine, liver, Peptide Partial, 408; putative" misc_feature 44238..44398 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 44248..44400 /gene="GCDH" /note="similarity: gi|1041334; (Z66513) F54D5.7 [Caenorhabditis elegans]; putative" repeat_region 45164..45225 /note="repeat match = HLAL00010; putative" /rpt_family="Alu" repeat_region 45178..45328 /note="repeat match = HSAL01694; putative" /rpt_family="Alu" misc_feature 46003..46072 /gene="GCDH" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" repeat_region 47283..47357 /note="repeat match = HSAL07754; putative" /rpt_family="Alu" repeat_region 47300..47915 /note="repeat match = HSAL00630; putative" /rpt_family="Alu" repeat_region 47785..48050 /note="repeat match = HSAL03283; putative" /rpt_family="Alu" repeat_region 47954..48080 /note="repeat match = HSAL06104; putative" /rpt_family="Alu" repeat_region complement(48129..48272) /note="repeat match = HSAL04964; putative" /rpt_family="Alu" repeat_region complement(48161..48450) /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region complement(48404..48464) /note="repeat match = HSAL13971; putative" /rpt_family="Alu" repeat_region complement(48483..48761) /note="repeat match = MER7A; putative" /rpt_family="MER" repeat_region 48953..49156 /note="repeat match = HSAL16443; putative" /rpt_family="Alu" repeat_region 49017..49333 /note="repeat match = HSAL03102; putative" /rpt_family="Alu" repeat_region 49219..49367 /note="repeat match = HSAL02663; putative" /rpt_family="Alu" repeat_region complement(49455..49506) /note="repeat match = HSAL02430; putative" /rpt_family="Alu" repeat_region 49513..49595 /note="repeat match = HSAL00416; putative" /rpt_family="Alu" repeat_region 49546..49608 /note="repeat match = HSAL06165; putative" /rpt_family="Alu" repeat_region complement(49623..50209) /note="repeat match = HSAL06169; putative" /rpt_family="Alu" repeat_region complement(49966..50277) /note="repeat match = HSAL06431; putative" /rpt_family="Alu" repeat_region complement(50280..50344) /note="repeat match = HSAL07965; putative" /rpt_family="Alu" repeat_region complement(50294..50421) /note="repeat match = HSAL02075; putative" /rpt_family="Alu" repeat_region complement(50381..50432) /note="repeat match = HSAL04383; putative" /rpt_family="Alu" repeat_region 51255..51321 /note="repeat match = HSAL01428; putative" /rpt_family="Alu" repeat_region 51269..51550 /note="repeat match = HSAL01581; putative" /rpt_family="Alu" repeat_region complement(51867..52260) /note="repeat match = HSAL03071; putative" /rpt_family="Alu" repeat_region complement(51868..52399) /note="repeat match = HSAL02158; putative" /rpt_family="Alu" repeat_region complement(52269..52447) /note="repeat match = HSAL02394; putative" /rpt_family="Alu" repeat_region complement(52469..52685) /note="repeat match = HSAL05071; putative" /rpt_family="Alu" repeat_region complement(52629..52704) /note="repeat match = HSAL01352; putative" /rpt_family="Alu" repeat_region 53162..53438 /note="repeat match = HSAL04932; putative" /rpt_family="Alu" repeat_region 53162..53365 /note="repeat match = HSAL06189; putative" /rpt_family="Alu" repeat_region 53370..53459 /note="repeat match = HSAL06696; putative" /rpt_family="Alu" repeat_region 54031..54112 /note="repeat match = HSAL13218; putative" /rpt_family="Alu" repeat_region 54055..54613 /note="repeat match = HSAL04809; putative" /rpt_family="Alu" misc_feature complement(54273..54549) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region 54624..54677 /note="repeat match = HSAL04644; putative" /rpt_family="Alu" repeat_region 54635..54697 /note="repeat match = HSAL06589; putative" /rpt_family="Alu" misc_feature complement(54856..55044) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=marginal; putative" repeat_region complement(54859..55157) /note="repeat match = HSAL05045; putative" /rpt_family="Alu" misc_feature complement(55201..55280) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=good; putative" repeat_region complement(56080..56202) /note="repeat match = HSAL03290; putative" /rpt_family="Alu" repeat_region complement(56109..56698) /note="repeat match = HSAL02794; putative" /rpt_family="Alu" misc_feature 56647..56930 /note="ss region (gap in top strand); putative" repeat_region complement(56875..56935) /note="repeat match = HSAL03454; putative" /rpt_family="Alu" misc_feature complement(56886..57235) /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=good; putative" repeat_region complement(56890..56935) /note="repeat match = HSAL04833; putative" /rpt_family="Alu" repeat_region complement(56947..57049) /note="repeat match = HSAL11836; putative" /rpt_family="Alu" repeat_region complement(56971..57530) /note="repeat match = HSAL03104; putative" /rpt_family="Alu" repeat_region complement(57216..57795) /note="repeat match = HSAL05507; putative" /rpt_family="Alu" misc_feature 57951..58218 /note="ss region (gap in bottom strand); putative" repeat_region complement(57975..58761) /note="repeat match = HSAL00546; putative" /rpt_family="Alu" repeat_region complement(58154..58767) /note="repeat match = HSAL01995; putative" /rpt_family="Alu" misc_feature complement(58498..58566) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=good; putative" misc_feature 58933..59069 /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=good; putative" misc_feature 59583..59737 /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=marginal; putative" repeat_region complement(61142..61269) /note="repeat match = HSAL06395; putative" /rpt_family="Alu" repeat_region complement(61172..61433) /note="repeat match = HSAL02217; putative" /rpt_family="Alu" repeat_region complement(61401..61445) /note="repeat match = HSAL04238; putative" /rpt_family="Alu" repeat_region 61576..61685 /note="repeat match = L1MB7; putative" /rpt_family="LINE" repeat_region 61597..61660 /note="repeat match = L1000234B; putative" /rpt_family="LINE" repeat_region 61693..61749 /note="repeat match = HSAL06148; putative" /rpt_family="Alu" repeat_region 61703..61747 /note="repeat match = HSAL11165; putative" /rpt_family="Alu" repeat_region 61857..62156 /note="repeat match = HSAL05084; putative" /rpt_family="Alu" repeat_region complement(62604..62904) /note="repeat match = HSAL02762; putative" /rpt_family="Alu" repeat_region complement(63145..63278) /note="repeat match = HSAL05974; putative" /rpt_family="Alu" repeat_region complement(63183..63586) /note="repeat match = HSAL06237; putative" /rpt_family="Alu" repeat_region complement(63353..63586) /note="repeat match = HSAL02985; putative" /rpt_family="Alu" repeat_region complement(63421..64495) /note="repeat match = HSAL00363; putative" /rpt_family="Alu" repeat_region complement(64415..64516) /note="repeat match = HSAL13207; putative" /rpt_family="Alu" repeat_region 64514..64634 /note="repeat match = HSAL15953; putative" /rpt_family="Alu" repeat_region 64544..64634 /note="repeat match = HSAL11771; putative" /rpt_family="Alu" repeat_region 64564..64676 /note="repeat match = HSAL04253; putative" /rpt_family="Alu" repeat_region 65220..65506 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 65434..65529 /note="repeat match = HSAL05171; putative" /rpt_family="Alu" repeat_region complement(66165..66293) /note="repeat match = HSAL06455; putative" /rpt_family="Alu" repeat_region complement(66198..66321) /note="repeat match = HSAL10394; putative" /rpt_family="Alu" repeat_region 66321..66383 /note="repeat match = HSAL01287; putative" /rpt_family="Alu" repeat_region 66336..66623 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 66567..66662 /note="repeat match = HSAL03331; putative" /rpt_family="Alu" repeat_region 66646..66696 /note="repeat match = HSAL01126; putative" /rpt_family="Alu" repeat_region complement(66710..66799) /note="repeat match = HSAL13814; putative" /rpt_family="Alu" repeat_region complement(66940..67002) /note="repeat match = MER390001; putative" /rpt_family="MER" repeat_region complement(67015..67292) /note="repeat match = HSAL02162; putative" /rpt_family="Alu" repeat_region complement(67062..67280) /note="repeat match = HSAL01752; putative" /rpt_family="Alu" repeat_region 67369..67407 /note="repeat match = HSAL01523; putative" /rpt_family="Alu" repeat_region 67377..67424 /note="repeat match = HSAL10276; putative" /rpt_family="Alu" repeat_region 67393..67484 /note="repeat match = HSAL10778; putative" /rpt_family="Alu" repeat_region 67409..67504 /note="repeat match = HSAL05070; putative" /rpt_family="Alu" repeat_region complement(67734..67798) /note="repeat match = HSAL05499; putative" /rpt_family="Alu" repeat_region complement(67749..68314) /note="repeat match = HSAL02504; putative" /rpt_family="Alu" repeat_region complement(67919..68805) /note="repeat match = HSAL00546; putative" /rpt_family="Alu" repeat_region complement(68374..68961) /note="repeat match = HSAL00299; putative" /rpt_family="Alu" gene complement(69316..80172) /gene="FARS" CDS complement(join(69316..69454,70719..70833,71017..71094, 71207..71375,71472..71571,71658..71742,74911..75026, 75104..75232,75324..75416,76792..76910,77040..77096, 77181..77318,80119..80172)) /gene="FARS" /note="coding region predicted on basis of Xgrail 1.3c predictions, ESTand SWISSPROT matches; comment for location 69316-69454; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 466- 500, (Pval= 5.3e-08 (57% identity); comment for location 70719-70833; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 427- 464, Pval= 4.6e-14 (71% identity); comment for location 71017-71094; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 401- 426, Pval= 1.0e-09 (76% identity); user-supplied translation (frame +1) MEHQAVVGAVKSLQALGEVIEAELRSTKHWELTAEGEEIAREGSHEARVFRSIPPEGL AQSELMRLPSGK VGFSKAMSNKWIRVDSMEDEVQRRLQLVRGGQAEKLGEKERSELRKRKLLAEVTLKTY WVSKGSAFSTSI SKQETELSPEMISSGSWRDRPFKPYNFLAHGVLPDSGHLHPLLKVRSQFRQIFLEMGF TEMPTDNFIESS FWNFDALFQPQQHPARDQHDTFFLRDPAEALQLPMDYVQRVKRTHSQGGYGSQGYKYN WKLDEARKNLLR THTTSASARALYRLAQKKPFTPVKYFSIDRVFRNETLDATHLAEFHQIEGVVADHGLT LGHLMGVLREFF TKLGITQLRFKPAYNPYTEPSMEVFSYHQGLKKWVEVGNSGVFRPEMLLPMGLPENVS VIAWGLSLERPT MIKYGINNIRELVGHKVNLQMVYDSPLCRLDAEPRPPPTQEAA; comment for location 71472-71571; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 314- 345, Pval= 2.4e-06 (56% identity); comment for location 71207-71375; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 349- 400, Pval= 1.3e-18 (63% identity); comment for location 75104-75232; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 204- 244, Pval= 7.6e-10 (58% identity); comment for location 75324-75416; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 178- 201, Pval= 0.18 (50% identity); comment for location 74911-75026; BLASTX similarity to P15625 SYFB_YEAST PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN, residues 246- 283, Pval= 2.6e-14 (63% identity); putative" /codon_start=1 /product="putative human phenylalanine tRNA synthetase" /db_xref="PID:g1905910" /translation="MEHQAVVGAVKSLQALGEVIEAELRSTKHWELTAEGEEIAREGS HEARVFRSIPPEGLAQSELMRLPSGKVGFSKAMSNKWIRVDSMEDEVQRRLQLVRGGQ AEKLGEKERSELRKRKLLAEVTLKTYWVSKGSAFSTSISKQETELSPEMISSGSWRDR PFKPYNFLAHGVLPDSGHLHPLLKVRSQFRQIFLEMGFTEMPTDNFIESSFWNFDALF QPQQHPARDQHDTFFLRDPAEALQLPMDYVQRVKRTHSQGGYGSQGYKYNWKLDEARK NLLRTHTTSASARALYRLAQKKPFTPVKYFSIDRVFRNETLDATHLAEFHQIEGVVAD HGLTLGHLMGVLREFFTKLGITQLRFKPAYNPYTEPSMEVFSYHQGLKKWVEVGNSGV FRPEMLLPMGLPENVSVIAWGLSLERPTMIKYGINNIRELVGHKVNLQMVYDSPLCRL DAEPRPPPTQEAA" repeat_region complement(69871..69965) /note="repeat match = HSAL02754; putative" /rpt_family="Alu" repeat_region complement(69891..70465) /note="repeat match = HSAL00518; putative" /rpt_family="Alu" repeat_region complement(70097..70479) /note="repeat match = HSAL02132; putative" /rpt_family="Alu" misc_feature complement(70718..70834) /gene="FARS" /note="similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; putative" misc_feature complement(71015..71095) /gene="FARS" /note="similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; putative" misc_feature complement(71181..71363) /gene="FARS" /note="similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; putative" misc_feature complement(71475..71570) /gene="FARS" /note="similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; putative" misc_feature complement(71657..71710) /gene="FARS" /note="similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: pir||YFBYAC; phenylalanine--tRNA ligase (EC 6.1.1.20) alpha chain, cytosolic; similarity: sp|P15625|SYFB_YEAST; PHENYLALANYL-TRNA SYNTHETASE BETA CHAIN CYTOPLASMIC; putative" repeat_region complement(71782..71893) /note="repeat match = HSAL06165; putative" /rpt_family="Alu" repeat_region complement(71805..72212) /note="repeat match = HSAL02666; putative" /rpt_family="Alu" repeat_region complement(71941..72513) /note="repeat match = HSAL00641; putative" /rpt_family="Alu" repeat_region complement(72256..73140) /note="repeat match = HSAL04809; putative" /rpt_family="Alu" repeat_region complement(72845..73302) /note="repeat match = HSAL06443; putative" /rpt_family="Alu" repeat_region complement(73169..73381) /note="repeat match = HSAL08068; putative" /rpt_family="Alu" repeat_region complement(73336..73394) /note="repeat match = HSAL00453; putative" /rpt_family="Alu" repeat_region 73886..73938 /note="repeat match = HSAL00686; putative" /rpt_family="Alu" repeat_region 73901..74320 /note="repeat match = HSAL02807; putative" /rpt_family="Alu" repeat_region 74021..74611 /note="repeat match = HSAL06169; putative" /rpt_family="Alu" repeat_region complement(74685..74776) /note="repeat match = HSAL03160; putative" /rpt_family="Alu" repeat_region complement(74715..74810) /note="repeat match = HSAL07425; putative" /rpt_family="Alu" repeat_region complement(74735..74807) /note="repeat match = HSAL00758; putative" /rpt_family="Alu" repeat_region complement(75765..75868) /note="repeat match = HLAL00001; putative" /rpt_family="Alu" repeat_region complement(75787..76175) /note="repeat match = HSAL01443; putative" /rpt_family="Alu" repeat_region complement(75787..76075) /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region complement(76095..76196) /note="repeat match = HSAL07283; putative" /rpt_family="Alu" repeat_region complement(77671..77832) /note="repeat match = HSAL04801; putative" /rpt_family="Alu" repeat_region complement(77709..78260) /note="repeat match = HSAL02430; putative" /rpt_family="Alu" repeat_region complement(78606..78898) /note="repeat match = HSAL03071; putative" /rpt_family="Alu" repeat_region complement(81071..81133) /note="repeat match = HSAL13630; putative" /rpt_family="Alu" repeat_region complement(81086..81369) /note="repeat match = HSAL06592; putative" /rpt_family="Alu" repeat_region 81425..81718 /note="repeat match = HSAL06577; putative" /rpt_family="Alu" repeat_region 81867..81959 /note="repeat match = HSAL11907; putative" /rpt_family="Alu" repeat_region 81887..82170 /note="repeat match = HSAL06455; putative" /rpt_family="Alu" repeat_region complement(82478..82664) /note="repeat match = HSAL02144; putative" /rpt_family="Alu" repeat_region complement(82517..82664) /note="repeat match = HSAL14373; putative" /rpt_family="Alu" repeat_region complement(82765..82826) /note="repeat match = HSAL12784; putative" /rpt_family="Alu" repeat_region complement(82780..83189) /note="repeat match = HSAL05507; putative" /rpt_family="Alu" repeat_region complement(82893..83188) /note="repeat match = HSAL02005; putative" /rpt_family="Alu" misc_feature 83214..83602 /note="ss region (gap in bottom strand); putative" repeat_region complement(83513..83673) /note="repeat match = HSAL14693; putative" /rpt_family="Alu" repeat_region complement(83547..83814) /note="repeat match = HSAL00837; putative" /rpt_family="Alu" repeat_region complement(83752..83832) /note="repeat match = HSAL06411; putative" /rpt_family="Alu" gene 85249..91043 /gene="CRTC" CDS join(85249..85339,85703..85804,85997..86200,86622..86716, 86812..87021,87110..87223,87313..87456,90098..90190, 90274..90474) /gene="CRTC" /function="calcium binding protein" /note="comment for location 85997-86200; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 66- 132, Pval= 2.3e-42, (100% identity); comment for location 87313-87456; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 273- 320, Pval= 3.4e-32 (100% identity); comment for location 87110-87223; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 235- 272, Pval= 3.0e-24 (100% identity); comment for location 86812-87021; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 165- 234, Pval= 6.9e-45 (100% identity); comment for location 90274-90474; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 352- 417, Pval= 3.0e-40 (100% identity); comment for location 86622-86716; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 134- 164, Pval= 7.8e-17 (100% identity); comment for location 90098-90190; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 322- 351, Pval= 6.2e-15 (100% identity); comment for location 85249-85339; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 1- 30, Pval= 4.6e-13, (100% identity); user-supplied translation (frame MLLSVPLLLGLLGLAVAEPAVYFKEQFLDGDGWTSRWIESKHKSDFGKFVLSSGKFYG DEEKDKGLQTSQ DARFYALSASFEPFSNKGQTLVVQFTVKHEQNIDCGGGYVKLFPNSLDQTDMHGDSEY NIMFGPDICGPG TKKVHVIFNYKGKNVLINKDIRCKDDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLED DWDFLPPKKIKD PDASKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQN PEYKGEWKPRQI DNPDYKGTWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITNDEAY AEEFGNETWGVT KAAEKQMKDKQDEEQRLKEEEEDKKRKEEEEAEDKEDDEDKDEDEEDEEDKEEDEEED VPGQAKDEL; comment for location 85703-85804; BLASTX sp|P27797|CRTC_HUMAN CALRETICULIN PRECURSOR from residues 31- 64, Pval= 3.8e-18, (100% identity); putative" /codon_start=1 /product="calreticulin" /db_xref="PID:g1905911" /translation="MLLSVPLLLGLLGLAVAEPAVYFKEQFLDGDGWTSRWIESKHKS DFGKFVLSSGKFYGDEEKDKGLQTSQDARFYALSASFEPFSNKGQTLVVQFTVKHEQN IDCGGGYVKLFPNSLDQTDMHGDSEYNIMFGPDICGPGTKKVHVIFNYKGKNVLINKD IRCKDDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLEDDWDFLPPKKIKDPDASKPED WDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPPVIQNPEYKGEWK PRQIDNPDYKGTWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITN DEAYAEEFGNETWGVTKAAEKQMKDKQDEEQRLKEEEEDKKRKEEEEAEDKEDDEDKD EDEEDEEDKEEDEEEDVPGQAKDEL" misc_feature 85249..85356 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 85249..85339 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 85702..85806 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 85703..85904 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 85996..86202 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 85997..86200 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature complement(86130..86459) /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=good; putative" misc_feature 86621..86716 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 86622..86763 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 86809..87021 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 86812..87027 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 87092..87223 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 87110..87223 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 87310..87468 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); putative" misc_feature 87313..87456 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" repeat_region 87736..87813 /note="repeat match = HSAL00116; putative" /rpt_family="Alu" repeat_region complement(87875..88455) /note="repeat match = HSAL02956; putative" /rpt_family="Alu" repeat_region complement(88375..88481) /note="repeat match = UCAL00006; putative" /rpt_family="Alu" misc_feature complement(88478..88675) /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=excellent; putative" repeat_region 88502..88620 /note="repeat match = HSAL17242; putative" /rpt_family="Alu" repeat_region 88527..88620 /note="repeat match = HSAL05033; putative" /rpt_family="Alu" repeat_region 88559..88938 /note="repeat match = HSAL03118; putative" /rpt_family="Alu" misc_feature complement(88802..88930) /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, reverse strand, quality=good; putative" misc_feature 88979..89339 /gene="CRTC" /note="ss region (gap in bottom strand); putative" repeat_region complement(89157..89601) /note="repeat match = HSAL02441; putative" /rpt_family="Alu" repeat_region complement(89320..89902) /note="repeat match = HSAL02794; putative" /rpt_family="Alu" repeat_region complement(89794..89941) /note="repeat match = HSAL01295; putative" /rpt_family="Alu" misc_feature 90095..90190 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: gb|I11262|; Sequence 3 from Patent WO 8909273; putative" misc_feature 90098..90190 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 90271..90471 /gene="CRTC" /note="similarity: sp|P27797|CRTC_HUMAN; CALRETICULIN PRECURSOR (CRP55) (CALREGULIN) (HACBP); similarity: gb|I11262|; Sequence 3 from Patent WO 8909273; putative" misc_feature 90274..90470 /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" 3'UTR 90475..91043 /gene="CRTC" /note="blast results for location 90475-91043; BLASTN gb|M84739|HUMCALRET Human autoantigen calreticulin mRNA, from 1443- 1931, Pval= 0.0, (99% identity); 3'UTR; BLASTN similarity: 1931, (calreticulin, mRNA,, from:1443-); putative" misc_feature complement(90897..90975) /gene="CRTC" /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" misc_feature complement(91067..91247) /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=excellent; putative" misc_feature 91177..91602 /note="ss region (gap in top strand); putative" repeat_region complement(91306..91371) /note="repeat match = HSAL06220; putative" /rpt_family="Alu" misc_feature complement(91385..91470) /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region 91795..92051 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 92004..92069 /note="repeat match = HSAL02729; putative" /rpt_family="Alu" gene 92509..99607 /gene="RAD23A" CDS join(92509..92580,94406..94567,94735..94916,95055..95110, 95244..95371,95639..95717,95833..95966,99247..99411, 99494..99607) /gene="RAD23A" /function="human DNA repair protein" /note="comment for location 95244-95371; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 159- 200, Pval= 9.0e-22, 100% identity; comment for location 94406-94567; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 25- 78, Pval= 5.4e-30, 100% identity; comment for location 92509-92580; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 1- 24, Pval= 4.0e-09, 100% identity; comment for location 95055-95110; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 140- 157, Pval= 0.00092, 100% identity; comment for location 94735-94916; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 79- 138, Pval= 5.3e-34, 100% identity; user-supplied tramslation (frame MAVTITLKTLQQQTFKIRMEPDETVKVLKEKIEAEKGRDAFPVAGQKLIYAGKILSDD VPIRDYRIDEKN FVVVMVTKTKAGQGTSAPPEASPTAAPESSTSFPPAPTSGMSHPPPAAREDKSPSEES APTTSPESVSGS VPSSGSSGREEDAASTLVTGSEYETMLTEIMSMGYERERVVAALRASYNNPHRAVEYL LTGIPGSPEPEH GSVQESQVSEQPATEAAGENPLEFLRDQPQFQNMRQVIQQNPALLPALLQQLGQENPQ LLQQISRHQEQF IQMLNEPPGELADISDVEGEVGAIGEEAPQMNYIQVTPQEKEAIERLKALGFPESLVI QAYFACEKNENL AANFLLSQNFDDE; comment for location 99494-99607; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 327- 363, Pval= 4.0e-19, 100% identity; comment for location 95639-95717; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 201- 226, Pval= 1.2e-10, 100% identity; comment for location 99247-99411; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 272- 326, Pval= 5.5e-31, 100% identity; comment for location 95833-95966; BLASTX gi|498146 (D21235)HHR23A protein [Homo sapiens], from residues 228- 271, Pval= 2.7e-24, 100% identity; putative" /codon_start=1 /product="human RAD23A homolog" /db_xref="PID:g1905912" /translation="MAVTITLKTLQQQTFKIRMEPDETVKVLKEKIEAEKGRDAFPVA GQKLIYAGKILSDDVPIRDYRIDEKNFVVVMVTKTKAGQGTSAPPEASPTAAPESSTS FPPAPTSGMSHPPPAAREDKSPSEESAPTTSPESVSGSVPSSGSSGREEDAASTLVTG SEYETMLTEIMSMGYERERVVAALRASYNNPHRAVEYLLTGIPGSPEPEHGSVQESQV SEQPATEAAGENPLEFLRDQPQFQNMRQVIQQNPALLPALLQQLGQENPQLLQQISRH QEQFIQMLNEPPGELADISDVEGEVGAIGEEAPQMNYIQVTPQEKEAIERLKALGFPE SLVIQAYFACEKNENLAANFLLSQNFDDE" misc_feature 92509..92627 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" repeat_region 93500..93604 /note="repeat match = HSAL11403; putative" /rpt_family="Alu" repeat_region 93701..93981 /note="repeat match = HSAL03157; putative" /rpt_family="Alu" repeat_region 93701..93876 /note="repeat match = HSAL02407; putative" /rpt_family="Alu" misc_feature 94126..95135 /gene="RAD23A" /note="ss region (gap in bottom strand); putative" misc_feature 94406..94630 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature 94406..94567 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature complement(94639..94890) /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=good; putative" misc_feature 94723..94917 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 95047..95109 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 95072..95110 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=good; putative" misc_feature 95244..95376 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" misc_feature complement(95413..95710) /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=marginal; putative" misc_feature 95639..95722 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 95832..95999 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 95833..95966 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=1, forward strand, quality=excellent; putative" misc_feature complement(96064..96238) /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=0, reverse strand, quality=good; putative" repeat_region complement(96965..97246) /note="repeat match = HSAL06264; putative" /rpt_family="Alu" misc_feature complement(97136..97236) /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=1, reverse strand, quality=excellent; putative" repeat_region complement(97204..97257) /note="repeat match = HSAL04311; putative" /rpt_family="Alu" repeat_region 97339..97393 /note="repeat match = HSAL01010; putative" /rpt_family="Alu" repeat_region 97353..97638 /note="repeat match = HSAL00657; putative" /rpt_family="Alu" repeat_region complement(97736..97834) /note="repeat match = HSAL02408; putative" /rpt_family="Alu" repeat_region complement(97984..98072) /note="repeat match = HSAL01568; putative" /rpt_family="Alu" repeat_region complement(98009..98283) /note="repeat match = HSAL03283; putative" /rpt_family="Alu" repeat_region complement(98205..98303) /note="repeat match = HSAL07684; putative" /rpt_family="Alu" repeat_region complement(98521..99036) /note="repeat match = HSAL00335; putative" /rpt_family="Alu" repeat_region complement(98824..99045) /note="repeat match = HSAL00518; putative" /rpt_family="Alu" misc_feature 99238..99423 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 99247..99431 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=2, forward strand, quality=excellent; putative" misc_feature 99491..99604 /gene="RAD23A" /note="similarity: pir||S44443; RAD23 protein homolog2 - human gi|498146 (D21235) HHR23A protein; putative" misc_feature 99494..99603 /gene="RAD23A" /note="predicted exon, grail2exons_human_1.3; frame=0, forward strand, quality=excellent; putative" repeat_region complement(101521..101847) /note="repeat match = L1000415; putative" /rpt_family="LINE" repeat_region 102136..102176 /note="repeat match = HSAL01110; putative" /rpt_family="Alu" repeat_region 102145..102415 /note="repeat match = HSAL04985; putative" /rpt_family="Alu" repeat_region 102335..102438 /note="repeat match = HSAL03290; putative" /rpt_family="Alu" repeat_region complement(102823..102878) /note="repeat match = HSAL02475; putative" /rpt_family="Alu" repeat_region 102930..103219 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region 103162..103240 /note="repeat match = HSAL05344; putative" /rpt_family="Alu" repeat_region 104014..104120 /note="repeat match = HSAL11194; putative" /rpt_family="Alu" repeat_region 104037..104121 /note="repeat match = ALU; putative" /rpt_family="Alu" repeat_region complement(104124..104220) /note="repeat match = HSAL05038; putative" /rpt_family="Alu" repeat_region 104227..104426 /note="repeat match = HSAL03283; putative" /rpt_family="Alu" repeat_region 104305..104457 /note="repeat match = HSAL05418; putative" /rpt_family="Alu" repeat_region complement(104882..105222) /note="repeat match = L1001315; putative" /rpt_family="LINE" repeat_region 105275..105322 /note="repeat match = HSAL05507; putative" /rpt_family="Alu" repeat_region 105285..105919 /note="repeat match = HSAL00338; putative" /rpt_family="Alu" repeat_region 105919..106102 /note="repeat match = L1001153; putative" /rpt_family="LINE" repeat_region 105937..106083 /note="repeat match = L1MB3; putative" /rpt_family="LINE" repeat_region complement(106163..106206) /note="repeat match = HSAL01142; putative" /rpt_family="Alu" repeat_region complement(106175..106774) /note="repeat match = HSAL01365; putative" /rpt_family="Alu" repeat_region complement(106707..106793) /note="repeat match = HSAL09382; putative" /rpt_family="Alu" repeat_region complement(107081..107309) /note="repeat match = HSAL02871; putative" /rpt_family="Alu" repeat_region complement(107336..107596) /note="repeat match = HSAL07957; putative" /rpt_family="Alu" repeat_region complement(107559..107617) /note="repeat match = HSAL13239; putative" /rpt_family="Alu" repeat_region complement(107693..107996) /note="repeat match = HSAL05084; putative" /rpt_family="Alu" repeat_region complement(107995..108097) /note="repeat match = L1MB3; putative" /rpt_family="LINE" repeat_region complement(108095..108557) /note="repeat match = HSAL06223; putative" /rpt_family="Alu" repeat_region complement(108256..108702) /note="repeat match = HSAL06169; putative" /rpt_family="Alu" repeat_region complement(108632..108721) /note="repeat match = HSAL02784; putative" /rpt_family="Alu" repeat_region complement(108828..109225) /note="repeat match = HSAL02663; putative" /rpt_family="Alu" repeat_region complement(108907..109168) /note="repeat match = HSAL04399; putative" /rpt_family="Alu" repeat_region complement(108962..109216) /note="repeat match = HSAL06565; putative" /rpt_family="Alu" repeat_region complement(109266..109314) /note="repeat match = HSAL05838; putative" /rpt_family="Alu" repeat_region complement(109280..109411) /note="repeat match = HSAL01671; putative" /rpt_family="Alu" repeat_region 109434..109634 /note="repeat match = HSAL06178; putative" /rpt_family="Alu" repeat_region complement(109643..109700) /note="repeat match = L1MA10; putative" /rpt_family="LINE" repeat_region 109746..110031 /note="repeat match = HSAL02084; putative" /rpt_family="Alu" repeat_region 109834..110082 /note="repeat match = HSAL13218; putative" /rpt_family="Alu" repeat_region 110035..110096 /note="repeat match = HSAL00396; putative" /rpt_family="Alu" BASE COUNT 25112 a 29361 c 28188 g 27435 t ORIGIN 1 gatcctatga gcccaggaag tcaaggcttc agtaagccat gatcacacca ctgcactcca 61 gcctgggcaa cagggtgaga ccctgtcttg aaaaatgaaa ttaaaaataa taaagaatag 121 gccaggtgca atggcttatg catgtaatcc tagtactttg ggaggccaag atgggtggat 181 cacttgaggt caggaattcg aaaccagcct ggccaacatg gtgaaacccc atttccacta 241 aaaatacaaa aaaattagtt gggcgtggtg gcacatgcct gtaatcccag ctacttggaa 301 ggctgaggca ggagaattgc ttgaacccgg gaggtggagg ttggtgtgag ccaagatccc 361 accactgcac tccagcctgg gcaacagagc gagactctgt ctcaaaataa taataataat 421 aataataata ataataatga ttaatagtaa taaataaaca attctgcagc atttagtaat 481 tcaccatgct gtacagccat tagatctatg tagttgcaga acttttttat cacccccaaa 541 gggaaacctc acacccatta cacactccct cctcattcct ccctccctgc agtccctggc 601 aaccactaat ctattttctc tctctgtgga attgcttttt ctggatattt cttttttttt 661 ttttttttga gacagaattt cactcttgtt gcccaggctg gagtgcaatg gcgtgatctc 721 ggctcactgc aacctccgcc tcccaggttc aagcaattct cctccctcag cctcctgagt 781 agctgggatt ataggcatgc gccaccacac ctggctaatt ttgtattttt agtaaagaca 841 gggtttcacc atgttggtca ggctggtctc gaactcctga cctcaggtga tctgcctgcc 901 ttggcctccc aaagtgctgc gattacaggc gtgagccaca tgcctggcct ttttctggat 961 atttcctata aatgggatcg tatgctctgt ggcctcttat gactggctta tttcacttaa 1021 cataacgttt ctgaggttta tccatgttgt agcttgtgtc actgcttcat tcatttttat 1081 agccgagtaa tattccattg aatggtagat gcattcatcc actggtgaac attgggttgt 1141 ccatctttca tctgttgtaa atagtgctct tatgaatgtg ggtgtacaaa tatttggctg 1201 gacaactgct ttgagttatt taggatatac gtctagaggt ggaatttctt ggtcatatgg 1261 tagctctatg tttaacttat agaggaatcc tgcatgggct ttcgagcaag ctaactggct 1321 gtatgtgtta atggcagctc agcatgggga aaattgccaa ttctattccc aggccagcca 1381 cactttggca gatgttccag atgtctagtg gagagtccta ccccaatggc cacacatgtg 1441 ttttctctga actgcctagt atttttcaaa aatttgggct gggcctgatg gctcacggct 1501 gtaatcccag cactttgaga gactgcggcg ggtggatcac ttgaggtcag gaagtcgaga 1561 ccagcctggc caacatggtg aaaccctgtc tctactaaaa aaatacataa aattagccag 1621 gtgtggtggt gcccacccgt aatcccagct acttgggagg ctgaggcagg agaactgctg 1681 gaaccaggga gacaaagatt gcagtgagcc gatatcaaac cactgcactc cagcctgggt 1741 gagagagtga gactccatct tgaaaaaaat aaaaatagat aaataaaaga agcaaaatat 1801 aaaaattgaa agatttttgc ttcaaaatct gggtttgggc cttctcttga aaaatcagaa 1861 gttctagggc tgggtgtagt ggctcatgcc tgtagtccca gcactttggg aagccaaggt 1921 ggcaggattg cttgagccca ggagtttgaa accagtctgg acaacatagc aagaccctgt 1981 ctctacttaa aaaaaaatag aagaagaaga agttctaata atactgtggt ggaaccagct 2041 gagtgggggt ggtcctgtga tattgggtac ctgccccttc gtcatccatg gcctccaccc 2101 tggtctactg gcccctgaag acgcttaagt ttgtaatctt tatttcaaat tgtgtctata 2161 cagtcacaca gatatataca catacctttc ctttatgtat atatttaaaa ggaatctcac 2221 aatatatgcc atttgtttaa ttttttgaat aggtaatgga atgacatcac tcaaaatcta 2281 aaactctaaa aaaaggtatt tatctctcat tgtttgtagg attctgttat tccttttaat 2341 caaacttgtt aattgagtga ttcaaatctt ccatctccct attcatttat cttgcctgat 2401 taatctaata gtttctgaga gaaatgtgaa accttatttt aagcaaattc tccttgtaat 2461 tctgtcaatt tttttttttt ttttgccatt ttggttttta tttgattata ggtaagagtt 2521 ccatacaata gtgtgcgaag tattgcaaac aataaaggaa aaatacagca taagatacct 2581 cttacactct gcccaattct gtcaattttt atgtcagata tttttaggct acattctttg 2641 gagcatataa gttcaatatt tttatatatc atagatcttt ttatgtaata tctatgtaat 2701 aaatctcttt gtccctaaga aaaatcaaaa taattttcaa gaaagatata cagtgaaaaa 2761 aatcttactc ctaccttttt tccccattgc cctatagccc tgaaatatga ttacatagtt 2821 ttattttgcc actttttttt ttttttttag acaaagttcc actcttgttg cccaggctgg 2881 agtgcaacag tgtgatctcg gctcactgca acctctgctt cctgggttca agtggttctc 2941 ctgcttcagc cttccaagca gctgggatta taggcaacca ccaccacgcc tggctaagtt 3001 tttgtatttt taatagagac agggtttcac catgttggcc aggctggtct cgaactcctg 3061 accttcaggt gatccacttg ccttggcctc ccaaagtgct gggattgcag gtgtgagcca 3121 ccacacctgg cctttgccac atttttttgc acataaacgt acaatacata ttgcctattg 3181 catttttaca cttaataaaa tagaggactt tctaaatcca tagacagaga gtttcctcat 3241 tatttctaac agtggcacaa tgtttcttta tctaactacc ccacatgtgt ttgtttggct 3301 aatttaacgt atggattatt atttccaatc tcacaatgag caagatagta gatataccat 3361 ttttcacatg tgcaagtata tctgtagcat aaattcaaaa gtaaaattgc tgagtcaaag 3421 ggaacatgta tttgtaattc tgatctgtat caaacagtaa tttacactcc caacagcaaa 3481 gtatatatgt tcacctacat atttgcccat agaggatatc aggttttgga gttttttctc 3541 ccaatctgct aggtgaaaaa tggtattttg tggtagttat aatttgcctt tctgttattg 3601 taagtgagat tgagaatttt ttttttttga gatggagttt tgctcttgtt gcccaggctg 3661 gagtgcaatg ccacgatctc ggctcaccac aacctctgcc tcctgggttc aagcgattct 3721 cctgcctcag cctcctgagt agctgggatt acaggcatgt gccatcatgc ccagctgatt 3781 ttgtattttt agtatagatg gggtttctcc atgttggcca ggcttgtctc gagctcctga 3841 cctcaggtga tccaccagcc tcagcctccc aggtgatccg cccgccttgg cctcccaaag 3901 tgctggggtt acaagcgtaa gccactgcac ctggccaaga atcttttttt tttttttgag 3961 acagagtctc gctctgtcac ccaggctgaa gtgcagtggc atgatctctg ctcactgcaa 4021 gctctgcctc ccgggttcac gccattctcc tgcctcagcc tcccaagtag ctgggactac 4081 atgtgcccgc caccatgcct ggctaattgt ttcgtatttt cagtagagat ggggtttcac 4141 agtgttagcc aggatggtct cgatctcctg accgcatgat ccgcccacct tggccttcca 4201 aagtgctgtg attacaggcg tgagccaccg ctccaggccc gaatctttta gtatgtttaa 4261 gagctgtaac tctggctggg tacagtggct catgcctgta gtccctactc tttgggaggc 4321 caaggcaggc agattgcttc agcctaggag ttcaagagca gcctgggcaa gatggcaaaa 4381 ccaaaatact aaaattagcc tggcatggtg gcatgtgcct gtggtcccag ctacctggga 4441 ggctgaggca ggagaatcac ttgagcccta gaggtggagg ctgcagtgag caccactaca 4501 ctccagtctg aatatcagag tgagaaccta tctcaaaaaa aaaaaaaaaa aaaaaaaaaa 4561 aaaaagctct aactgtaatt tacttacctc acccactttt ttctcttcaa ttgttaggct 4621 tttgttattg atttttggga actctttata tatttatata ttaggatgac tattttgttg 4681 gtttcctgct atcttttcta gtggaagaag atgagaatat gtcttcagag tcagtcttgg 4741 gttcaagtcc taactttctc tttcattatg gcacttggat gaaggactct ttcattctgt 4801 gcctcagttt ccccatgtgt aaaatgaaat taaaagcatc ctgctctcat gatgatgatg 4861 gtggtgtggt ctccatcttt ttcctgaagg gccgcagcag caaggccaag aaaccgccgg 4921 gggagaatga cttcgatacc atcaagctca taagcaacgg tgcctacggg tgagccaccc 4981 ggggctctgg cggggggagg gtggcggagg ccgggtgtct cggaggtgac ggccggtcct 5041 cgctctctcc ccctgcagcg ctgtctacct ggtgcggcac cgcgacacgc ggcagcgctt 5101 tgccatgaaa aagatcaaca agcagaactt gatcctccgc aaccagatcc agcaggcctt 5161 tgtggagcgc gatatcctca ccttcgccga gaacccgttt gtggtcggca tgttctgctc 5221 ctttgagact cggcgccacc tctgcatggt catggaatat gtggaaggtg tggctgcctg 5281 cggggctgca gggaagatgg ggccgtgctc actggtcagg gctgcggggt ggcctgcctg 5341 agccgcagtc tccatattga ttagtcggcc tgtatgttta tccatctatt tatttctcca 5401 gtcattgatt tgtccatcca ttctacctat atatttattc agtcatctat tggtttattt 5461 gtttttcatt tcattcagca taaaaacatt cagtctgacc cattatccat ccatctatcc 5521 ttctatctac tgatgtgcta atccatctgt ttttccatcc atacatccat gaacctgttt 5581 ttatttattt atttatttat tcatttattc atttatttat tttttgagac agagtcttgc 5641 tctgtcaccc aggctgcagt gcagtggtgc aatctcggtt cactgcaacc tccgactccc 5701 aggctcaagc gattcttatg cctcagcctc ccgagtagct gggattatgg gcctgtgcca 5761 ccattcatgg gtaatttttc tatttttagc agagacagga tttcgccatg tttgccaggc 5821 tagtcttgaa ctcctgacct taagtgatct gcctgcctgg gcctcccaaa gtgctgggat 5881 tacaggcatg agccactgtg ccaggcctat tttattttat ttatttattt ttgagacaag 5941 atctcactct gttgctcagg ctggaatgta gtggtgcgat cacagctcac tgcagcctca 6001 aatgcctggg ctcaatcaat tctcccacct cagcctccca aatacctggg attataggtg 6061 cggaccacca cactcggcta atttttggga ggccaaggtg gggagatcac aaggtcaggg 6121 gttcaagacc agcctggcca acatggtgaa accctgactc taccaaaaat acaaaaatta 6181 gccagatggc acactcctat aatcccagct actcacgtgg ctgaagtggg aggattgttt 6241 aaacacagga agcagaggtt gcagtaagcc gagatcgtgc tgctgcactc caggctaggc 6301 tgagatcatg ccattgcact ccagccctgg caactgagca agaccctgtc ccgcaaaaaa 6361 aaaaaaaaaa aagttttttg ttttttgttt ttttttgttc gtttgttttg ttttttatgc 6421 agtctccctg tattgcccaa gttagtctcc aactcctggg ctcaagcaac cgtcctgcct 6481 cagccttcca aagtcctggg attacaggca cgagccaccg agcccatccc atatacctgt 6541 ttattaatcc acctttccat ccatccctca catcccgcaa tttgtctttg tggttgtcca 6601 tctgtttatc catccgtatg tctgtttacc attatacacc tatttttctt ttttttgggg 6661 gggacggcgt cttgctctgt ctcccaggct ggagtgcagt ggcgcaatct cagctcactg 6721 caagctccac ctcctaggtt cacaccattc tcctgcctca gcctcctgag tagctgggat 6781 tacaggtgcc cgccaccatg cccgactaat tttttttgta tttttagtag agacggggtt 6841 tcaccgtgtt ggccaggatg gtctcgatct cttgacctca tgatccgccc acctcggcct 6901 cccaaagtgc tgtgattaca ggcgtgaacc accacacccg gcttatacac ctatttttcc 6961 tttttctttc tttctttttt tttttttttt tgagacagag tcttgctctg tcaccaggct 7021 cgaatgcagt ggcgctatct cggctcactg taacctccac cttccaggtt caagtgattc 7081 tcctgcctga gcctcccgag tagctgggat tacaggtgtg tgccaccaca cctggctaat 7141 tttttgtatt tttagtagaa atagggtttt caccatgttg gccaggctgg tctcgaactc 7201 ctgacctcag gtgatcctcc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc 7261 catagcttgc ggcccatata ttgtttttaa tccatccgtg catgtattca cccaaatgtt 7321 aatccatcta ttgtccaaaa ccacacatgc tgtggctatt atgacacata catgatccct 7381 ggggcctcag ctgtgcctcc tccaagcctt ccagtcaacc atccagtcac tcaacgaaca 7441 cctcctcatc atctccctcg tgtcacatgc tgtgctaggt gccagtgaca cagaggagac 7501 ccaggcacag acatctcccc gccatggaga ttccattctg gtctgtatag acagacaata 7561 aacaagcaga gcaagccaaa gagataagga catttattta tttatttatt tatggagtct 7621 cgctctttca cccaggctgg agtgccgtgg cgcaatcttg gtgcacagca acctccatct 7681 cccaggctca aggattctca tgcctcagcc tcctgagtag ctgggactaa aggcgcgcct 7741 ccaccatgcc cagctgattt ttgtattttt agtagagaca ggggtttcac caggttggcc 7801 aggctggtct tgaactcctg acctcaggtg atcagcccac cttggcctcc caaagtgctg 7861 ggattacagg catgagccac catgcccagc cagggcattt catagtgagg acatgaggtc 7921 catgaggctg gaggaggtga tggggaggca ggcagggagc ttctcagagg cctggattat 7981 gatggaataa gtgacctgct agtacaaagt ctcttaggtg acaatttaga tttggaggag 8041 tggcaagcag gcccatggag ccgaaggcaa gtgagcaagg gaggggagat gaggatgaag 8101 gggcgctgat gaggagggta ttccagagac tggaatggca ttggatttca ttctgaatgc 8161 aacagaaggt cactggagca ttgttgtttt tttctttttc tttttctctc tttctttctt 8221 tctttctttc tttctttctt tctttctttc ttttttgaga tagagtctcg ttctgttgcc 8281 caggctggag tgcagtggcg catcctcggc tcactgcaac ctccgcctcc cgggttcaag 8341 cgattctcct gcctcaacct cctgagtagc tgggattaca ggtgcccgcc accacgccca 8401 cctaatttct gtatttttag tagagatggg gtttcatcat gttggtcagg ctggtctcaa 8461 actccacacc tcatgatctg tctgcctcgg cctcccaaag tgctgggatt acaggagtga 8521 gccaccgtgc ctgccttttt tttttttttt ttttgaaatg gagtctcact cacgctgtca 8581 cccaggctag agtgcaatgg ggtgatctcg gattactgca gcctccgtct ccaaggttca 8641 agtgattctc ctgcctcagt ctcccgagta gctaggatta caggcgtgtg ccaccacatc 8701 tggcaaattt ttgtactttt agtagagacg gattttcacc atgttggcca ggctggtctc 8761 gaattcctag ccttaaatga tccatcagtc tcggcctccc gaagtatggg gattacaggc 8821 gtgaatcact gcgcccagcc ttgttttgtt ttatttttag tgcattgtcc aggctggagt 8881 gcagtggtgt gatcacagct tactgcagcc ttgaactcct gggctcaagc aatcctccca 8941 cctcagcctc caaaagtagc tgaaactaca ggcaggcacc accatgtcca gctaatggag 9001 ggttttaaga agatttgata tgatcagatt tatgttttaa gaaaatccat ttggaggccc 9061 attatagtag cttacgactg taatcccagc acttacagtg ggagaccaag gcaggcagaa 9121 cacttgtggt caagagtcca agatcaacct acccagcatg gtgaaaccct gttgttgttg 9181 tttttttctt ttcctttttt tttttttttt tgagacagag tttcgctctg gctaaactcg 9241 ttgcccaggc tagagtgcaa tgacgtgatc ttggctcacc gcaacctcct cctcccgggt 9301 tcaagcgatt ctcctgcctc agcttcttga gtagctggaa ttataggtat gcaccaccac 9361 acctggctat ttttgtattt ttagtagaga tggggtttct ccatgtcagc caggctggtc 9421 tcgaactccc aacctcaggt gatgtgcctg cctcggcttc ccaaagtgtt gggattacag 9481 gcgtgagcca ctgtgcccgg ccaatgaaac cctgtttcta ctaaaattac aaaaattagc 9541 caggagtggc ggtgcacgcc tatcatccca gctactcggg aggctgagga acaagaatct 9601 cttgaacccg ggggcagtga gtgggcagag gttgcagtga gccgagatgg agccactgca 9661 ctctagcctg ggtgacagag ccagactctg actcaaaaaa aaaaaaaaaa aaaaaaagag 9721 aaaaagaaaa aaattaagaa aagataatcc atttggctct tgtggagaga gcagaatcag 9781 gagactagga ggtagatgac tgctctagtc caggttcatt gcagaggagg aggagagaag 9841 ggggccaaat cagtatctct atggtgggtc tgtgtgcaca atagatgtgc tcaaatgcag 9901 attcctgggc cccaccctca gagagtttct gaaaccagca aaatctcgca gaaaagagaa 9961 cagaaaaaag aaagaaatca ttagctttct tggtcattgg tgattgaaaa accaatatta 10021 ccttcaggaa tggctggatg taggtgctca aaccattcca tcaggaaact ctccatctct 10081 tggctctgta ctcccacatg tgggcttcct tctagacaag ccctcatgta gcaaggttgc 10141 catctagcag atctcagctt acattctatg agcttagcaa ttcttacagg aaaagagtga 10201 aaattccaag ggctaactca ctggagcaac ttgggttaca tactgccccg gttgctgcac 10261 atggaaggaa atctagcctt tgtaggataa aaaggaaata gatattgaga aactgaaata 10321 ccaatgccaa ttacagtaaa tctggcagag gggcagaggc agtggccaac actttgggaa 10381 gctgaggcag gaggatcctt tgaacccaag agttggagac cagcttgggc aacacaatga 10441 aactcgtttc tacaaaaaac tttttttttt ttttttttga gacggagtct tgctctgtca 10501 cccaggctgg actgcagtgg tgcgatcttg gatctctgca acctctgcct cccaggttca 10561 agccattctt ctgccttagc ctcccgagta gctgggatta cagccgtctg ccaacaagcc 10621 aggctaactt tagtattctt ttttcagaga tggggtttca ccatgttgac caggctggtc 10681 tcgaactcct gacctcaagt gatccacccg ccttggcttc ccaaagtgct gggatttcag 10741 gcgtgagcta ccatgcccag ccttgaacaa ttttttaaaa atcagctggg catggtggct 10801 cacatctgta gtcccagcta ctcaggaggc tgaagcagga ggattgcttg agctgggagg 10861 ttgaggatgc agtgagctgt gatcttgcca ccgcactcta gactgggcaa tggagtaaga 10921 ccctgtctca aaaaaaaaaa aaaaaaatct agtgaagacc aagaatctgc atctaaggct 10981 tccagctcca aggttcagac accaggagat agagggctgt gcttccagat gcagttgata 11041 agtgggtagc ctttgttccc caactaaacc tcacctctac caacaactgc ctgcatgacc 11101 catcctttct ttgagtctcc atccctttgt gtataagatg agctgatgac accaactttg 11161 gaggatggct aagaggacaa gaagaaccca gtagaggacc tggagggagg gaggcccctc 11221 tcagagcctc agtttccctt atctataaaa cgggtacaac ataacatgag aagttggtgc 11281 tatggtgcat gtccagagag aggaatgcct ccctttttat gcgggcccat ttcctggcct 11341 gcaggcggcg actgtgccac cctgctgaag aatattggag cgctgcccgt agagatggcc 11401 cgcatgtact ttgctgagac ggtgctagcc ctggagtatt tgcacaacta tggcatcgtg 11461 caccgcgacc tcaagcctga caagtgagct ttgatctttc ccatcactcc ctctgtccct 11521 cgggaggcca ggaggcagaa tggacgggcc tcatccctga gatccccacc tgtgcctaca 11581 gcctccttat cacctccatg ggtcacatca agctcacaga tttcggcctc tccaagatgg 11641 ggctcatgag cctcaccacc aacttatatg aaggccacat cgagaaggac gcccgagagt 11701 tcctggacaa acaggtgtgt gtgcgggcat gggggtcgct gagggtggag tgaccctgca 11761 ggacctcggg aacccagggc ctggtggggg gcacagctct cccttgaggg ccctcctctg 11821 gctggggcgt gggctgacag cctgccccca ggtgtgtggg accccagagt acatcgcgcc 11881 cgaggtcatc ctgcgtcaag gctacggcaa gccagtggac tggtgggcta tggggatcat 11941 cctctacgag ttcctggtgg gctgtgtgcc cttcttcgga gacacaccag aggagctatt 12001 tggacaggtc atcagtggta cgtggcttgg cagtgtacag gggcagagtg tggtgtgcac 12061 ggagagatgg acaggctcag ggttccaggg atttcaaaag cgacccccca gaggatcgct 12121 tgcactcagg aggtcaaggc tgcagtgagc catgatcgtg ccactgcact ccagctgggt 12181 gacacagtga gatcctgtgt ccaaacaaca acaacaacaa aaaccgcccc taagttccgt 12241 tttgttttgc agatgacatc ctgtggcccg agggggatga ggccctacct acggaggccc 12301 aactcctcat atccagcctc ctgcagacca accctctggt caggcttggg gcaggtaagt 12361 ccgccatgca gaggagctga gggcccactg atagagagca ggcctccaaa accccaggcc 12421 cagcctgtgc tgtggccccg gggcggaaga catggggggc ggggctgggc tgctgggttg 12481 gccatcagct gtggctggaa tcccttccgt cccaggcggc gcttttgagg tgaagcagca 12541 cagtttcttt cgagacctgg actggacagg gctgctgagg cagaaggccg agttcatccc 12601 ccacctagag tcggaagatg acactagcta ctttgacagt gagctgggac accaggcacg 12661 acctgggtcg aggggtggga tccgggaaga gggaccctgg ggtaagtaaa gcctgggata 12721 gggcctggct aagtaccaag atgtgatgcc taggtgcgaa ggtggtattt tggtgggggc 12781 ggggccaagt ggggcggggc tgacatacag gcggggctca ggctgaaatg taggtaagaa 12841 cctgagattg ggctaggtgt ggggcctggc ctgggtgagt tgtgagtgtg ggcctggaac 12901 tgaactagag agaggtacta gcgctcagaa tggcctgggg tgatgccagg gtgcagagca 12961 ctgtaggtgg agcgtggcct ctttggaggc ggggctagac ggtgccactg agaatggggc 13021 aaggaccttg cctgggagcg gagcctaaat taaatacact aatgaggcag gggcgtggcc 13081 tgacctgaag acttgtaaac ccgtcttgaa ccaggactgg gctcctgtgg ggatgtgata 13141 tgaggaggaa ccccgtaccc tcagtcacag cccatacccg ctccccagcc cgctcagaca 13201 ggtatcacca cgtgaactcc tatgacgagg atgacacgac ggaggaggag cccgtggaaa 13261 tccgccagtt ctcttcctgc tctccgcgct tcagcaaggt gggccaagtc tgggtgtggg 13321 acagggcgag accccaggag ggatggggct tggagagaca gtgagaaaca ggttccctgg 13381 tgcccaaggt ctcaggagcg ggaagttatt gatggggcgg gagtctggaa ggtggtaagg 13441 ccacgcaaga ggcaggttcg ggagtctatg ggacgggcct ttggcactga gtggaatcta 13501 acaggaatca ggacagttgt gcagattgag gccatggtgg ggcggggcta ggtgtgggtg 13561 gggcggggtc aggacgtggg aatgaggcca gcagcgaagg cggagtaaag cccagcgaag 13621 tcttgctttt atttatttat ttttattttt tctaataacg ggcagagaag ccataggcct 13681 tgctttgaga agcaacttgt agaggcgtgc agggcctagc ctgggctaag gagacataaa 13741 ttaagggggt tgctgggaag atggggctgt gtagagagca gattggtcag ggcatcgtcg 13801 gggcagagcc tagaataaaa cgcagtgcct aactttgtgg tgtggggcct gaactgaaga 13861 attgtaggtt gggggatcat gcacaattgg gcggagcagg cctcggaggg tggagtgcgt 13921 tttgcggggg atccactgcc aggaagctga ttagctccgg agtgaaagtg gaacccgggc 13981 tggacttgcc tcccaccacc acctacaggt gtatagcagc atggagcagc tgtcgcagca 14041 cgagcccaag accccagtag cagctgcagg gagcagcaag cgggagccga gcaccaaggg 14101 ccccgaggag aaggtggccg gcaagcggga ggggctgggc ggcctgaccc tgcgtgagaa 14161 gacctggaga gggggctctc cggagatgtg agcaggggaa tggcggagtt tgggggcggg 14221 gtcgaagggg gcgtgtcttc cataaccacg ccccctccat gcagcaagcg attctccgcg 14281 tccgaggcca gtttcctgga gggagaggcc agtccccctt tgggcgcccg ccgccgtttc 14341 tcggcgctgc tggagcccag ccgcttcagc gccccccaag aggacgagga tgaggcccgg 14401 ctgcgcaggc ctccccggcc cagctccgac cccgcgggat ccctggatgc acgggccccc 14461 aaagaggaga ctcaagggga aggcacctcc agcgccgggg actccgaggc cagtgagtgc 14521 cctatcgtgc tgccttcccc aatcttcccc aatgtcctac tggtcatata gtgagcatcc 14581 cacgagcctg gtgctgttta tgaaagatct caggtcctat tcacattgca atttgggatt 14641 tttttttttt tttttttttt tgagacagag tctcactcca tcacccagga tggagtgcag 14701 tggcgtgatc tcggctcact gcaacctcca cctcccaagt tcaagtggtt ctcgcacctc 14761 agcctcccga gtagctggga ttacaggcgc gcgccactac ccccggctaa tttttgtatt 14821 tttagtagag acagggtttc ttcatgttgg caaggctggt ctcgaactcc tgacctcaag 14881 tgatctgcct tccttggcct accaaagtgc tggggttaca gtcgtgagcc accacaccca 14941 gcccaatttg ggattcttaa cattagaaac accccaatac catggtcagt cagtgataca 15001 gaccgataca gactatgtct ctatatgagg agcagtgagg catgcaggtt acctgctcag 15061 atgaaactca ccatccatca agggtaaaaa gtggaaagag cattccaggc agggaaacag 15121 tatgtgggaa atccctggcc ttggactaat tcctaacaca cttgctttct gttgcagctg 15181 accgtccacg cccaggtgac ctctgcccac cctcgaagga tggggatgca tcaggcccaa 15241 gggctaccaa tgacttggtt ctgcgccggg cgcggcacca gcagatgtca ggggatgtgg 15301 cagtagagaa gaggccttct cgaactgggg gcaaagtcat caaatcagcc tcagccactg 15361 ccttatctgt catgattcct gcaggtaatg ctgggcccca cctggcaggg gaggggctgc 15421 cccccatttg aggcaggaca gaccaatgaa aatgctgcat tttccctgtc caccgtgatt 15481 ggctccaggc tggaccaatg agaggctgct ctgccactgc cttgggagga gggaggagca 15541 gatacaggga gatgtctatc ttcttggttc tgatttctga ccatggttgt gtgtctgtag 15601 tggacccaca tggaagttca ccccttgcta gtcccatgtc tccacgatct ctgtcctcca 15661 acccatcctc acgggactcc tcacccagcc gggactactc accagctgtc agtgggctcc 15721 gctcccccat caccatccag cgctcgggca agaagtatgg cttcacactg cgtgccatcc 15781 gtgtctacat gggtgacacg gatgtctata gtgtccacca cattgtctgg gtgagtactc 15841 atgggtggag tctccatcac agagtggagg gtggtgggga aaaggcccct ccagctcaaa 15901 ccagttagcc tgggtgagac acctctgtga ccctctatgc ctcttccctc tctaaacctg 15961 tttctttttt cttttacttt tttattttag gcggagactc actgtgtcgc caggccggag 16021 tgcagtggca ggatctcggc tcactgcaac ctctgcctcc tgagttcaag ccattctcct 16081 gcctcaacct tctgagtagc tgggactaca gacaggcgcg cgccaccacg cccagctaat 16141 ttttgtattt ttagtagaga cggggtttca ccatgttggc taggatggtc tcaatctctt 16201 gacctcgtga ttcacccacc tcggcctctc aaactgctgg gattacaggc gtgagccacg 16261 gtgcctggcc gagactgttt cttcatctgc aaaatgggga gaataggcca ggcgcagtgg 16321 ctcaggcctg taatcccagc actttgggag gctgaggtgg gcagatcact ggaggtcagg 16381 agtttgagac tagcctggcc aatatggtga aaccctgtct ctattaaaaa tacaaaaatt 16441 gggccgagcg cggtggctgt aatcccggca ctttgggagg ccgaggaggg tggatcatga 16501 ggtcaggaga tcgagaccat cctggctaac atggtgaaac cccctctcta ctaaaaatac 16561 aaaaaattag ctgggcatgg tggtgggcgc ctgtagtccc agctacttgg gaggctgagg 16621 caggagaatc gcttgaaccc gggaggcgga gcttgcagtg agccgagatt gcgccactgc 16681 actttagcct gggcgacaga gcaagacacc atctcaaaaa aaaaaaaaaa aaaaaaaaaa 16741 atggccaggc gcggtggctc acgcctgtaa tcccggcact ttgggaggcc gaggtgggtg 16801 gatcacgagg ttaggagttt gagatcagcc tgaccaacat ggtgaaaccc catctctact 16861 aaaaatacaa aaaaaattag ccaggcgtgg tggcagatgc ctgtaatccc agctactcag 16921 gaggctgaga caggataatc gcgtgaaccc gggaggcgga gcttgcagtg agccgacatc 16981 gcgccactgc actccagcct gggtgacaga gcgagagact ccgtctcaaa aaaaaaaaaa 17041 aaaaaaatta gccggaggta gtggcgcatg cctataatcc cagctacttg ggaggctgag 17101 ggaggagaat cgcttgaacc tgggaggcgg gtgttgttgc agtgagccag gattgcgcca 17161 cagcactcca gcctgggtga cagagtgaga ctccgtccaa aaaaaaaaaa aaaatgtggg 17221 gggaggaata ataactattt cacaaggtgt tatgaggatt ctaatgcatg cagagctaag 17281 tgtcctgcat ataagtttag gagctttcct tgttaccaat cccactaacc ctgtccctat 17341 ggggtgctct tttcccagca tgtggaggaa ggaggcccag cccaggaggc aggactctgt 17401 gctggggacc tcatcaccca cgtgaatggg gagcctgtgc atggcatggt gcatcctgag 17461 gtcgtggagc tgatccttaa ggtgagtgca gggaaggagg caccctgggc ggagggtggg 17521 ggaggcctga gcagccccta gcagagcatt ttcccgcatt cttcccccag agtggcaaca 17581 aggtagcagt gaccacaacg cccttcgaaa atacctctat ccgcattggt cccgcaaggc 17641 gcagcagcta caaggctaaa atggctcgga ggaacaagcg accctccgcc aaggagggcc 17701 aggagaggtg ggcacagccg taaacagcct ggtctttgag cagtgggtgg aacttaggcg 17761 ggaggggcac agatgaggat ggagaaggga agagcacggg aaaggtggcg ggaacaaatg 17821 accaacaagc aaagggaaga ggacaattaa gaggggctgc gggctgggcg cggtggctca 17881 cgcctgtaac cccagcactt tgggaggccg aggcgggcag atcacgaggt caggagtttg 17941 agaccagcct ggccaatatg gtgaaacccc atctctacta aaaatacaaa aattagcctg 18001 gcatggtggc gctcgcctgc agtcccagct acttaggaag ctgaggcaga agaatccctt 18061 gaacccggga ggtggatgtt gcagtgagcc aagattgtgc cactgcactc caacctgggc 18121 aacagagtga gactccatct caaacaaaca aacaaacaaa caaacaaaaa aagaggggct 18181 gggatgagtc acacctgtaa tcacagcact ttgggaggct gaggcagaag gatcacttga 18241 gcccaggagt caagattagg ggcaacatag agagacccca tctctacaaa aaacttaaaa 18301 aactagttgg gtttggtggc acacacctgt ggtctcagct actcggaggc tgaagtggga 18361 ggattgcttg agcctgggag gtcgaggctg cagtgagcta tgatctcacc actgcagtcc 18421 agctgtgaca acagagctag actctttcaa aaaaaaaaaa aaaaaaaaaa aagaggggct 18481 gtaaaagtgg aaggagccaa gaatggaagc tgtgccaaat ttctaggatc agagaaaagc 18541 attgtaggag ggtcttgcac atataggcag gcccaggaga tgtgaactgg aaagggagat 18601 gtccagagac ggagggggtc caggagatgg tcctatcctg ttagatgtgt ttagacagaa 18661 ctgggaaagg ctagaataag aaatgtctat ctggaatgag catgccaggc aaggttcagc 18721 caactgaaat cctattttcc cttatagata ggaggagcta ggtgggaggt gggtggggcc 18781 agaagggtat tctgtttgtt tgtttgtttt tagtattctg ttttgatcag tgagtttggc 18841 ccagacaagt acagggattt gggtgagagg agccagacgg ggttgagata agtacatttc 18901 ttcccactct tataggtggg ttgtagtgga agaggcggga ctagaggcag gtgggaccaa 18961 gttggttttg tttggatgtg tttctagcct ggactgggag tggccaaaag gttgggcagg 19021 acctgaagtg ggaggaggga ggagccaagc agcactgagt taagggaagt tctgtcttga 19081 taggtgggtt gtagtgaaag acggagggac tagaggggtg ggcaggacta gaagaggttg 19141 tttttggccg ggcgaggtgg ttcacacctg taatcccagc actttgggag gccgaggtgg 19201 gtggatcacc tgaggtcagg agtttgagac cagcctgacc aacatggtga aaccccatct 19261 ctactaaaaa tacaaaaatt agctgggagt ggtggctcac tcctgtagtc ccacctactc 19321 aggaggctac tcacgaagct gaagcacggt aatcgcttga acccgggagg tagaggttgc 19381 agtgagccga gatcgtgcca ttgcactcca gcctgggcga caagagcgaa acttcgtctc 19441 aaaaaaaaaa aaaaaaatgg ttttgtttgg atgagtgttt gtagccagaa ttgggagtgg 19501 cctaaagttg ggcggggcct gaagtaggag gagctaaagg aagttctatc ttgataggtg 19561 gattgtagtg gaagagggag gagctagaga ggtgggcggg gcagaacagg ttggttttgt 19621 ttggatgctt gtttctagct tggactagga gtggccaaag ggggtgggtg gggactgaag 19681 tgggaggagc caagcagcac tgagctaaag gaagttcttt atcttgatag gtgggttgta 19741 gctggagagg gaagggccag aaggggtggg tggttaccgt tgtgaggccg tgaaatggga 19801 ggagccctga gctctggcgt ccaggtcaag gacgcttggc cccctccctg tcccgcagca 19861 agaagcgcag ctccctcttc cggaagatca cgaagcagtc gaacctgctg catactagcc 19921 gctcgctgtc gtcgctgaac cgctcgctgt catccagcga tagtctcccg ggctcgccta 19981 cgcacgggct gccggcgcgc tcgcccacgc acagctaccg ctccacgcct gactccgcct 20041 acctaggtat tacctcctgc acctgcgcgg ggaccgagca gcgcggggtg gcctggctgg 20101 tgcttgggct gtactcactc gcttcacctc ctgtctcccg caggcgcctc atcccagagc 20161 agctccccag cctcgagcac gcccaactcg cctgcgtcgt cggcgtcgca ccacattcgg 20221 cccagcacgc tgcacggact gtcgccaaag ctccatcgcc agtaccgctc tgcgcgatgc 20281 aagtcggccg gcaacatccc tctatcgccg ctggcacaca cgccgtcccc cacgcaggcg 20341 tcaccgccgc cactgccggg ccacacggtg ggcagctcgc acactactca gagcttcccg 20401 gccaaactgc actcatcgcc tcccgtcgtg cgcccgcgcc ccaagagtgc cgagccccct 20461 cgctcgccgc tcctcaagcg cgtgcagtcg gccgagaagc tgggagcctc tttgagtgcg 20521 gacaagaagg gcgcgctgcg caaacacagc ctcgaggtgg gccacccgga tttccgcaag 20581 gacttccatg gcgagctggc gctgcatagc cttgccgagt ccgacggtga gacgccccca 20641 gtcgagggcc ttggcgcgcc ccggcaggtc gccgtccgcc gcctgggccg acaggagtca 20701 cctttgagcc tgggcgcgga cccgttgctg cccgagggtg cctccaggcc accagtgtcg 20761 agcaaggaga aggaatcccc ggggggcgcc gaggcgtgca ccccaccccg cgcgacgacc 20821 cccggtggcc ggaccctgga gcgggacgtc ggctgcacgc ggcatcagag cgtgcagacg 20881 gaggatggca ctggcgggat ggccagggct gtggccaagg cggcgctgag cccggtgcag 20941 gaacacgaga caggccggcg cagcagctct ggcgaggcgg gcacacccct ggtacccatt 21001 gtcgtagagc ctgcgcggcc cggggctaag gctgtggtgc ctcagcctct gggcgcggac 21061 tccaaggggt tgcaggaacc cgcacccctg gcgccttccg tgcccgaggc cccccggggc 21121 cgggagcgct gggtgttgga ggtggtggag gagcgcacca cgctgagcgg tcctcgctcc 21181 aagcccgcct ccccaaagct ctccccggag ccccagacac cctccctagc cccagcgaag 21241 tgcagtgcac ccagcagtgc agtgacccca gtcccacccg catccctctt gggctcaggc 21301 accaagcctc aagtggggct gacctcccgg tgccctgctg aagctgtgcc cccagcaggc 21361 ctgaccaaaa aaggagtgtc cagtcccgca cccccgggac catagccaag ggggtcatcg 21421 gccccgcgct gtacagcctc cgtatacata tgtacacata taaataaagt gcgtccgtgc 21481 tgcgtgagtt ttctggggct cactcctctc caggcaaggc gagacatcac acgaccccac 21541 ccccatgccc aggtgctttt tgggaggtgg gactccagtt ctggttacca tggagagtgg 21601 agggaatttt ggatagacac ctcctgtggt cccacttctg gtctcacctc tgcaccaact 21661 gcccccaaac ctttaggggg agaattgaag ctgcgatgct cttgtgtccc agcgcccacc 21721 tgaagagaag gttaacagcc tcgtccacca atttccattt atttactcct caacaatcct 21781 gatggcaggt attatgtttc ccattttgca gacaggtaga ctgtgtcaca gagcggttaa 21841 ggcacacaat caaggtcata cagctaggaa ggggatgaac tgggattcaa aatcaagtcc 21901 aaactggttc ctgagccctt agatttttta ttttttattt ttttgacaca gagtctcatt 21961 ctgtcaccca ggctgaagtg ccgtggtgtt agctcagctc atgacaacct ctgcctcccg 22021 agttcaagcg attctcctgc ctcagcctcc cgaggagcta ggaccacaga cgcgtgccac 22081 catgtctggc taatttttaa atatttttaa tggatatggg gtttcaccat gttggccagg 22141 ctggtctcga actcctgacc tcaagtgatc tgcccatctc agcctcccaa agtgctgggg 22201 ttacagatgt gagccactgc acccacccgt cccccgcccc cccttttttt ttgaaacaga 22261 gtcttgctat gtgacccagg ttggcgtaat catagttcac tgtgaccatg atctccctgg 22321 ttcaatcgat cctctggctt cagtggctgg gactacaggc atttatcacc gtgcctggct 22381 aacttttttt aagttctagt agagatgtgg tctcactatg tagcccaggc tggtctcgaa 22441 ttcctgagtt caagtgatcc tccctccttg gcttcccaaa gtgctgggat tacatacgtg 22501 agccactgca cctggccata agggttagat cttataagct ctgctgggct tcctggccat 22561 gccattacag ggctggtagt tcttcaccag cggctggaag gctttccaga gggctggcag 22621 ctgggcacac agtgtgcccc caccccgttg ctcctctccc tggttccgat tcatgtcacc 22681 cacgcaggtc cagggccctt ttggggacac gcaccatttg gagtggtcct ctgtgctgtt 22741 gaagcttggg ccggctggtc cagggaaagc tatctggttc acattcagaa cctgccagat 22801 atccgagcag ttagagggca ggatgcctac agttttgtgc cagaactgga cctgcaggtt 22861 ggtaccaagg gctgctgcca accagccgga gtacaggtct gcaaaggatg gagagagggc 22921 acaggtaggg tcagggccac tgcgagggag tccctagtcc acttcccctc tctcagctca 22981 gtttccatat ttcctttttt tttttttttt ttgacatgga gtcttgctct gtcgcccagg 23041 ctagagtgca ttggtgcaat ctaggctcat tgcaacctct gcctccgagg ttcaagtgat 23101 tctcctgcct cagccttctg agtagctggg actacaggca tgcacaacca tacccggcta 23161 attttttaat aaagaggggt ttcaccatat tgaccaggct ggtctcaaac tcctgatctc 23221 aagcaatccg cccacctcgg tcttccaaaa tgctgggact acaggtgtgt gtcaccgcgc 23281 cctgccacag tttccatatt tctgagctca gttgtcaatt cccagtcctg ggaatgatga 23341 cattaatatt aattgccctt cccaaatttg tttttgagac agagtctcac tctgtcgccc 23401 aggctggagt gcagtggtgt aatctcggca cactgcaacc tctgcctcct ggtttcaagc 23461 gattctcatg cctcagcctc ctgagtagct gggattacag gtgcccgcca ccacacccgg 23521 ctaatttttg tacttttagt agagatgggg ttccaccatg ttggccaggc tgttctcaaa 23581 ctcctgaccg tgtaatcagc cagcgccctc caccatgccc agataatttt tgtactttta 23641 gtagagatgg agtttcacca tgttggccag actggtcttg aactcctggg ctcaagtgac 23701 ccacccgcct tggtctccca aaatgtgggg attacaggtg agccaccgcg tccggccctg 23761 agtgttttta attaattaac ttatttattt tgagactggg tctcattttg acaccccagc 23821 tggagtgcag tggcaggatc atgactcact tgcagcctcg acctcctggg ctcaagctgt 23881 tctcccacct cagcctcctg agtagtgggg actacacgtg tgtgccacta tgccccacta 23941 attaaaattt tttttgtaga gatggggtcc cgctgtgttg cccaggctgg tctcaaactc 24001 ctgggctcaa gcaatcctcc tgcctcgact gggattacag gtgtgagcca ctgtgcctgg 24061 cctatttatt tgtatttatt tatttattta tttattttga gacacagtct cactctgtca 24121 cccaggctgg agtgcagtgg cactatcttg actcactgca agctccacct tccgggttca 24181 cgccattctc ctgcctcagc ctcccgagta actgggacta caggtgcccg ccaccatgcc 24241 tggctaattt ttgcactttt agtagagaca gggtttcacc gtgttagcca ggatggtctc 24301 gatctcctga gctcgtgatc tgcccacctc agcctcccaa agtgctggga ttacaggcat 24361 gagccactgt gcccagccta tttatttatt tttaaataaa gacagggtct tgctctgtca 24421 cccaggctgg agtgcagtgg agtcattata gctcactgca gtctgaaact ctgggctcaa 24481 tttatcctcc tgcctcagcc tcccaagttg cttggctaat ttttaatttt gtagagatga 24541 ggtcttgcta tgtttcccag gcttgtcttg gatgcctgtc ttcaaatgat cctcctgcct 24601 cagcctccta agtaactggg attacaggag tgaggcacca cctctggcac ccaaatgact 24661 tacaggtacc acttcattca caatttttta agtagatgtc agtatcccca tttttcagat 24721 gagtagacca aggctcaggt gaagttatac atcacacagc aaaacccagg gtttgaaccc 24781 aggtctgttc actgatcatg tgattatagc tcccaaacag acctggtctc taccgctgac 24841 ttgaactcag ccatggccac accagtcacc ctgtctgcag agaaggagct ctccaaccct 24901 caaccttgag actcaccatc tccaaatttg ctgaacttgg caaagctctg gaaaacagcc 24961 ccggcctggg atgtgagtgt gatgctgctg ttccagggtt cttggctaac gtggtggccc 25021 ttgaccacat tctccaagtc ggggaattcc tgggcaaaga tcccttccag ctggtagtta 25081 tagacccagg ggtaggtgta ggtcagctgc ttgcctggag gacaagggga ggaggaaatg 25141 caagatcgtt ccaggttctg gtcagtaaac ccaggactca tcctgtgtcc ctgactcgac 25201 ttacccatct tcgagaactg agcgaaggga aaagacacac agagcagggt ctgcccgtag 25261 gtacaggcgc tatgaggcca gctgtatgca gcagaggagg ccggtggagg gaagttaggt 25321 acactgtgga ccagccagaa gcccccatcg tggtcaagga gcaggacacc tggaccaaaa 25381 gaaggcatta ggggaggtcg ggcgcagtgg ctcacgcctg taatcccagc actttgggag 25441 accgagacac acagatcacc tgaggtcagg agttagaaat cagcctggcc aacatggcga 25501 aaccccatct ctactaaaaa tacaaaaatt agctgagcgt ggtggtggac gcctgtaatc 25561 ccagctactc aggaggctga ggcaggagaa tcgcttgaac ccaggaggtg gaggttgcag 25621 tgagctgaga tccaccactg cactccagcc tgggccacag agcaaaattt catctcaaac 25681 ataagtaagt aagtaagtaa ataaataaat aaataaataa ataaataaaa atttaaaaaa 25741 agaaggcagg ccccacgcag tgtaatccca gcactttggg aagccgagga gggcggatca 25801 cctgaggtcg ggagttcaag accagcctgg ccaacatgga gaaacctcat cactactaaa 25861 aatacaacat tagccgggca tggtggtgca tgcctgtaat cccagctact cgggaggctg 25921 atgcaggaga atcacttgaa cccaggaggc ggaggttgca gtgagccgag attgcaccat 25981 tgcactccag cctgggtagc aagagcaaaa ctccttctca aaaaaaaaaa aaaaaaaaag 26041 gcaattagag ggtattgctc caggcctccc agcaagctca tctgcaatca ggaacaccct 26101 cagtttgcag cggggcttag agagggtgcc agcagtgtta gaaccacagc tgagctcagt 26161 ttttgcaggc ataaattctt atagtccaga ataacgccaa gagtaatgta aggtagggaa 26221 acaggccagg tgtggcggct catgccggta atcgcagcac attaagaggc caaggtgggt 26281 ggattgcttg aaccaggact tagagaccag cctaggcaac atggcaaaag cccactctac 26341 aaaaatacaa aaagtaggcc aggtgtggtg gctcacgcct gtaatcccag cactttggga 26401 ggccaaggca ggtggatcac ctgaggtcag gagttcaata tcagcctgtc caacctggta 26461 aaaccccatc tctactaaaa catacaacat ttagctgggc atggtggcgc acacctgtaa 26521 tcccagctac tccagaggct gaggcagaag aatcgcttga actggggaga tggagattgc 26581 agtgagctga gattgcacca ccgaactcca gcctgggtga tggagagaga gtgccgcaaa 26641 aaaaaaaaaa aaaaaaaaag ccaggcgtgg tggggggtac cggcaggccc agtacttggg 26701 aggctgaggt aggaggatca cctgagccct gggaggtcaa ggctgcagtg agctgtgata 26761 ctgcactcca gcctaggtga ctgggtgaga ccttgtctaa aagaaaataa aataggggaa 26821 atagatgccc aaggtctgac aggggagatc tcccacccca ctatggaggt caggggttag 26881 cttggaaaaa tgcataggaa ctcggctggg cagggtggct cattcctgta atcccagcac 26941 tttgggaggc caaggcgggt agatcacttg aggtagaaac caggagttcg agaccagcct 27001 ggccaacatg gtgaaacctc gtctgtacta aaaatatgaa aattagcctg gtgtgatggt 27061 gcgcctttaa taccagctac ttgggaggct gagacaggag aattgcttca accccagagg 27121 cagagattgc agtgagccaa gattgtgcca ctgcactcca gcttgggtga cagagcgaac 27181 tcccaccgtc tcaaaaacaa agaaaaaaaa aagttggggg gaatagatgc ccagggtctg 27241 acagtgcaga tctccccgac caccccatgc acgtcagggg ttaccttgga aaaatgggac 27301 ccgagtctct ccccagcccc caatccaggc ctcacccttc gtgtgcccac gcatggaaga 27361 gtcctgagcc ttgctgggtt gaggcggttg gtcattgtag agcaggaagg cgagctgcgg 27421 gcggataaac ggaggcaacg tgaaattcct cccaggcagg gcagccctag agtttcgccc 27481 ctagttggcc cggggccccc ttcacctggc tggtgttgct ccggtacagc ggctgcaggc 27541 ttcggcccac ggccccctcc gggctgttga tgagtgccct gccgtcccgc cagcctccgg 27601 agctctcgtc cagatacttg tactgcagcc ctctctgcgc cgcctccccg gaccctctaa 27661 gagctggcag cttgtagacc acgaacctgg aggtcggaga atgcacagaa ataggagaga 27721 gggaagagaa ggcaccccag ggttccccga ggaggtagcc atcccggttg agtgaccgca 27781 gccggacccc ggggcgatgg taggcccccc gctgcgcact caccagtcta caggctgccc 27841 ggagtccccg tagcaggtca gggccccggc ggggacgcac agcagcgctg ccagcagcag 27901 cgggatcata gctgctatgg ggctgagatc caggaatctg tgtcgggact gcggggcgct 27961 gggttacatc agaggccagg actggcacct ggcgcctttc acttccctaa acttgcctgg 28021 gaaccggggc ggggacatca cgagggtaca gactcctccc ccgagacgcg atgctgcgtt 28081 ttggagaggc aaaaggcaac gctgagtctg cccagaccac gcccacgacg ggcccggcgc 28141 tccagcgtct cctggagcct ggccaccgtt ccttttcgtg gtagctaatc ccagcccacg 28201 ctcttctgtc tgacccaact cccgcccggg tttctggatc aggcacaagt ttgtatttta 28261 ttttttattc cagccctaca acctgggtcc aactcttggg acctttgttc ttcctctttt 28321 tttttttttt tttttttttt ttttttttga gacggagttt cactcttgtc gcccaggctg 28381 gagtgcaatg gcacgatctc ggttcactgc aacctccgcc ttcctgatta aagcgattat 28441 cctgcctcag cctcccgagt agctgggatt acaggcgccc gacaccacgc ccggctaatt 28501 tttgtagttt tagtagagac ggggtttttc catgttgact aggctggtct cgaactcctg 28561 acctcgtgat ccaccagcct cggcccccga aagtgccaag attgcagaag tgagccactg 28621 cgccccttaa gaagcatttg tgttccagtg tatcttcccc tttttctttc tttttttttt 28681 tccttccttt tttttttttt tttttcccgg acaatatctc gctctgttgc ccaggccgga 28741 gtgcagtggc gcaatcttgg ctcactgcaa cctctgcctc cccagctcaa gtgatcctct 28801 cacctcagcc tcccgagtag ctgggatcac gggcacatgc cactatgccc agctaatttt 28861 tgtagttttt tttttgaaac acggtttcac catgtggctc aggctgaact cctgagctca 28921 agggatccac ctgtcttgcc tcccaaagtg ctgggattac aggcatgtaa tccacgccac 28981 ccagactctc ctttttcatt ctgcacgttt tctccaggat cccctgagct agcaaagagg 29041 tgactgtatg tttgtttttt gtcgttttga gacaggataa tggtttctca cccaggctag 29101 aatgcagtga ctccatcaca gttcactgca gcctcaacct actgggctca agagatcttc 29161 ccaggtagct ggtactacag gcctgtgcca ccacacccag ataattgttc attgcagaga 29221 tgagggtctc actatgttgc ccaggctgct ctggaactcc ggggttcaag tgattctccc 29281 acctcagcct caaaggatca tggcagggca aagggctatg ggaatctgag tactggcttg 29341 aaactgcctt tgcaaaaatt atgacagaaa attatgtcag tgaaagagct ctgacctaac 29401 caactccatc ttgcctttaa tctccaaact gcccttgctt ggtcattcct ggagtggggt 29461 caagctaact tttttttttt ttttgacacg ttgtctcact ctgtcgccca ggctggagtc 29521 cagtggtgtg atctcagctc actgcaacat ccgcctccta ggttcaagca atcctcctgc 29581 ctcagcttcc tgagtagctg ggattacagg catgcgccac atgcccagct aatttttgta 29641 tttttggtag agactgggtt tcaccatttt tggtagagat ggggtttcac cttggggctc 29701 aggctggtgt cgaactcctg ggctcaagtg atctgcccgc ctcagcctgc caaattgctg 29761 ggattaaaga gggagccacc atgcctggcc ccaagctaat tttgggagga atttagttta 29821 tagcttaaat gataatagcc cttccccaaa ctaaactgcc tttgtagaaa taatgaaagg 29881 gcactaggtt aggaggatga gaggagcctg aattctgcta aactgtaggt gtagtagtta 29941 aattatgacc agccattttt ccggccgggt gcggtgcctc acgcctgtaa tcccagcact 30001 ttgagaggcc gaggctggca gattacctga ggtcaggatt tcgaaaccag cctggccagc 30061 atttcgaaac cccatctctt ctaaaaatac aaaaattagc ccggcatggt ggcatgtgcc 30121 tgtaatccta gctacctggt aggctgacac aggagaatca cttgaaccca ggaggcagag 30181 gttgcagtga gccaagatgg aacaactgca ctccagcctg ggcaagagca agactccatc 30241 tcaaaaacaa aaacaaagat ggccagccat tattctggag gtcacaaggt ttgcaacttc 30301 cccagttatt tctgcaaata acatgactat tgcaaaacct aacgctggcc tttttagatg 30361 tctttttagg ctttttgcat atctgacaac tggatgactc cacccagact agcgactcta 30421 tggtccccac ccagaagctg actcagcaac tgttttccac agctccagga ttgcagcaag 30481 gcacccattc cctagccctc ctgcccacca aactgttctt gaataaccct agcctctgaa 30541 ttttcaggga ggctgattca agtaataaaa ctcccatctc ccatttagct ggatgtatgt 30601 ggattaaact ctctctattg caatccctct gtctcgataa attggctcta tctggacagt 30661 gggcaagatg aaccccctgg gcagtcatag gctgtgaact cttaaatcca gggagatgtt 30721 tgctgtcaat gtccgcaggg ttcaaggcag aagacaagga cagtgtagcc tgaactgggt 30781 gggaaaagag aggacagcaa ggcctccagt gcccaagggg gcttgttgga gttgaggggt 30841 gtcactgtag tttaggagat gagggtgtgt aaggccctct aatatgacct ctaggtttac 30901 agcctcctgc catcttccag cctccaaaac ccaaaaattc aggccaacgc tgaagcttta 30961 tctgggaggg gcatttttat aggacccata accattgaca gttaatatca gccacaataa 31021 gggaccttca ggagccgctt tctagaccca gggtctctgg gaagcctcat cccacaccct 31081 ctcctgctgt ccagggccat ctgggagctg gtctgagtgt ccactgagtc cgtttatttg 31141 gcggtctgtc tcactgggtt tgcacgacag tttggacatc tctgtgtggc tccctgtggc 31201 tgaaggcttg tcggattttc cgtaagaggc tcccccaggg ctgtctatgg gtccgtgttt 31261 gatatttggg tggatctttg ggaacgcgag tccaggagag ggtccattcg tgggaaaacc 31321 acccagcatt gtgtcacgcg cgtccgtgtg aagagaccac caaacaggct tcttgtccca 31381 tccccagtca ctaggagagt ccaagtgcca gggcagggct caaaggtggc gcttcatgtg 31441 caaggccagg tggtcagagc gcgaaaaagc acgtgggcag agctggcagc ggaaggggcg 31501 ctgccccgtg tgtttccggt agtggcgggt cagctcgtcc gagcgcgcga atctccagcc 31561 gcagccttcc cacgtgcagg cgtatggctt ctcccctagg ggacaaggaa gccataagcg 31621 ccactgtctg cccagtcatg tccccgggtc ccctgcatct ggccacaccc ctttactcag 31681 cctgggctgg gactaggatg aacaaagtga ggcccctagg gcacaaaatt taaggaggca 31741 ctcactctca gaggccagcc aagtccaagt cccgccctct gcaacccttc ttcccctgta 31801 actacagcgg gcgccgcgcc ctttctcatg tccggggccc cgccccctca cctgtgtgcg 31861 tgcgcagatg cgccttcagg tgggagctct tggtgtagct cttgccgcaa cccgggtgcg 31921 cgcacgtgtg cgctgcctgc ctcttgcgcg cccacgaacg tcggcctcgc ttggatggcg 31981 cggtctcggc tatcacacct ggatcctctg cagtcccccc gagtccagtg cccaccgtcc 32041 cgggtcccaa acaactcagg aaggaggggg acgtggcggg accgggcgcg ggtccctgga 32101 gcccgcggaa gagctggaag tgcccttggt actgaggcgc cgggtacatc gcggggtacc 32161 cggacagtag cccgtagggg gcgcccgacg ccgcaggcac tgaaagcccg gtccgcggga 32221 agtagccacc cgaggagccg gcgccgggcc ccgggtacac cggttgcagc gccagcgcct 32281 tgggctcggg ggccggggct ggagccaggg ctgggcccac gaaggcgtcg ggagcccggg 32341 ctcgcagggc agggcgcacc caacccgagt gatcctccga acccaaaagc ccagccacca 32401 gccccgggcc gccagcatat gcgcccagag tctcgggcgg cggcggatat tgcgccccgg 32461 aggcctcgct gggcgccaga gcgcaggtct ggggcgcgcc accgggctcc gggcccgaga 32521 agttggtgag gaggagatcc aggtcccagg tggcgtccgc gcccctctca tcgtcctctt 32581 cctccccggg ctggtcctca gacttcacgt ggaggggcgg ctccgtgggg tcaggaggac 32641 ccgggcccat gtcctgcgcc tcttcggagc gccaccactg cgggagggag caggcagctc 32701 gaggttcggt ggacactggg gtgccctgcc cagggacatc gcgggctgga cactctgacg 32761 cagaggcttt ggaaaggggt cttgtttgcc tgtctgtctg tcccacttcc ccaacactgc 32821 ccctctaacc gcccctctcc ccgctccccc cccagctcca gggacagaaa ccgatcaggt 32881 ctggctggag acaggagata agacttccac tatctggaac cacactcggg gatgggggaa 32941 ggggtagcag agctgctgga gatggaaaac tggcagggga cgggggagag gatgccagcc 33001 ttgacttagt tttcccccag aacatccctc tccttccctg tctcccaaaa caagattaat 33061 tccgaaattt tggatgtccc ccagacacac tcatcatttc ccgctgatat ctggaagatt 33121 gtctgtgagc ctagatctcg ttcctttttt tttttttttt tcttgagata gggtcttact 33181 cttttgccca ggctggagtg cagtggtgca gctcactgta tccttaatct cctgggctct 33241 aatgatcctc ctgcatcagc ctacggagta cctgggacta caggcacacg cccccatgcc 33301 tggcctaatg tttttggtat tttttgtaga gatagggttt caccatgttg cccaggctac 33361 cttcgttttc tattaccgaa atagatcaca cttagaacct caaaccccta gaccaccctc 33421 ctcaccccct gccagactaa gctgagatct cctctcctgg actgagcgta cctcagtcct 33481 ggttaagtct cttgatttca ggtcaagatg caggtctgga ccccaagatc tgtgactgtg 33541 gccctggatt ccagccagcc cacctagacc ccaccttcta ggccccacct tgaggaagtc 33601 atcctgtgtg tccgggaagg ggcccagggc ggtcagtgtg ctgatggagg gcaaggcggt 33661 ctcggctgtg gccatggctg gctggtgccc accctgggcc tcaagcctcc tcttcctcgg 33721 ctgcctcgtg aactctgagg ctgtgatagc cccttcgagg gctcctctct gtccttagct 33781 gattggctgc agcctctgat aaggcaaagc aaggcaaggc ggcggggggg cactgtttct 33841 ggggcacaaa cttcacgttg gcctgtctgg ggctggggtt aaagactaac cctgtgtcca 33901 aagccaagtc aaatatcaag ggttgggggg ttcagggttt gagggtccag gtgctgggta 33961 aaaacagaca ttgggtctcc aaagaaggag agacttgggg gtcttccctc agttaacttt 34021 cctggaaaag gcagaaaggg tattctggga agaagctgtg ggaccccctg tccccctgat 34081 tgaggctcca cagccctccc cctccccagt aaacagcaac aaccgtgctg atggcgggac 34141 ttggcacgag ctccccgcca agcattatca gacaccccag acgttggagg ctgctatcag 34201 gtgggggccc aggccagcta gagctctgtc ctccggtgaa ggggagggct gggagttggg 34261 tcttcaaatt agcctggcgt tcaatttgcc tgggttggga cctcccaggg tctcagccct 34321 gcagcaggag gaagcccctg acaactggcc caccattgtc tactggggac ccttgggtgg 34381 cactgcatgg gactgcatgg aggctggttc agtgccagct gctttagtgc gatgggggca 34441 gccaagggaa aagtgaccct ctctcctcct gggtgggata ggaccgttgg atcaagccac 34501 aggcagtacc tgacctgcag gagaccagtg ccctggctct gggcctggct gctccttgat 34561 tctgcccaac atgaggagag gatctgagtt tctacaacag gaacctccac tctggtgccc 34621 tgggagcctg ggaaggggac atggatacgt attacagaca cataactcat agtcactcat 34681 tggaagactg aggcaggagg atcacttgac accaggattt caagaccagc ctgggcaaca 34741 taaaacctat ctctccaaca aaaagctgtg agtggtggtg catgcccata gtcccagcta 34801 ctcaggaggc tgaggtggga ggattgcctg agcccaggag gtcgggactg cactgaacta 34861 taatcacacc actgcagtcc agcctggggc acagaaagca agactctgtc gctggaaaaa 34921 aaaaaaagtt gggtggccat ctttattcct ggcagatcaa cctgggtgac acaaacaaaa 34981 acaaggccag gtgcggtggt tcacacctgt aaccccagca ctttgggaag ccaaggtggg 35041 gagatcactt gaggtcagga gttcgagacc agactggcca acaggatgaa gccctctctc 35101 tgctaaaaat acaaaaatta gctgggcatt gtggcacatg cttgtaatcc cagctatttg 35161 ggaggctgag ctggagaact ggttgaaact tggaggcatg gggccaggcg cagtagctca 35221 cgctagtaat cccagcactt tgggaggccg aggcaggcag atcacctgag gccaggagtt 35281 cgagacaaac ctggccaaca cagtgaaacc ctgtctacta aaaatataaa aataggccgg 35341 ggatggtggc atacacctct aatcccaggt acccgggagg ctgaggcaca agaatcactt 35401 gagcctggga ggcagaggtt gcagtgagcc aagatcatgc cactgcactc cagcctgggc 35461 tacagagtga gattctgtct ctaaaaaaca aaaacaaaac aggcttggag cagtgagtag 35521 tggctcatgc ctgcagtccc agcactttgg gaggccaagg taggaggaat gctcgagccc 35581 aggagttcaa gaccagcttg ggcaacatag cgagatccca tctctaaaaa caaaaacaaa 35641 aacacagtta ctcacaggac catgcaatta cataatgaca tgagttcctg ttccctcagt 35701 cacactgatc atataatccg tgcacatata attgtgtgtc tgtctatgta tatggactat 35761 gcacagacat acaattccca ggcactaaaa cacaacccta tgctatattg gttacacaat 35821 gtcatagcca aacaatctct aaaacacaga attcataatc cacaggcagg cacacagtta 35881 tacaatccta ctgacacaat ttagccgtaa ttgagcagac acacatgggg agtcagatac 35941 attgtcacag aacttttttt ttttagatag tgtctctctc tgtcacccag gctggagtgc 36001 agtggcatga ttttggctca ctgcaacctc cacctcctag gctcaagtga ttcctctgcc 36061 tcagcctccc aagtagctgg gattacaggt gcctgctacc acgccacgct aatttttttt 36121 tttttttttt ttttgagaca gagtttcact cttgttgccc aggctggagt gcaatggcat 36181 gatctcagct cactgcaacc tctgcctcct gggttcaagt gattctcctg cctcagcctc 36241 ccgagttgct gggattacag gcacccacaa gcaggcctgg ctaatttttg tatttttagt 36301 agagatgggg tttccccatg ttggccaggc tggtcctgac ctcaggtgac ccaccctcct 36361 tggcctccca aagtgctggg atgacaggca tgagccgccc cagccagtca aaacctacac 36421 tttatacagt cacaccaccg gtcactttta caatatgtaa agtaattatt tagtcacagt 36481 tgcatagcta ccagtgccca accgtagggg atgcacccag ttaaacacag acaaacgcaa 36541 ggacatggca tcacaggtcc tgagaatcaa gacacacaca tttctcaaca gatacacaat 36601 cagaatgggt gcaccacaaa tgcactacac aaaaagacaa aacaggcggg gcgcggtggc 36661 tcacgcctgt aatcccagca ctttgggagg ccgaggtggg tggatcactt gaggtcagga 36721 gtttgaggcc agcctggcca acatggtgaa acccgtctct attaaatata agaaaaaagg 36781 ccaggcatgg tggctcacac ctgtaatccc agctactcag gaggctgagg caggagaatt 36841 gcttgaatcc agtaggcaga ggttgcagtg agccaagatt gcgctactgc actccagcct 36901 gggcaacaga gtgagaatcc gtctcaaaaa aaaaaaaaaa aaaaaaaaat tagccaggca 36961 tgatggcatg cttcagtggt cccagctact tggaggctga ggtgggagga tcgcccagga 37021 ggtggaagct gcagtgagct atgatcgcgc cactgcactc cagcctgagc gacagaacga 37081 gaccctgtct aaaaagaaaa aacaaaaaca aaacaaaaca accacataga ctcacagaat 37141 cacacaagtt cacacacaca cacacataca cagacacaga cacacacaca cacacacaca 37201 cacacacaca gagtcacaag tatctacatg tgcttcctgg gacagactgg cagaaaggtt 37261 tgctaagatc gcccacttgg agctcgtctt ccccacaacc agcacccaat tagagaactc 37321 gagtctggcc tcctgggttc aggaggacgc ctagtgctcg cgccagaatt tctttgttta 37381 attcttgact ccctccgcac acaccccctg caactgacca accaggagtg ggccgagccc 37441 tttcctacgg ccaataagag gagaaaccca gctggccaat cggtttcccg cactcgtctt 37501 ccgcccctac cccgtccgct tcttaaaggg gctagcctat ctctgtctgg gccccccgat 37561 ttccacaggc aaggctacct tagccctttt aaaaggcaga gctgcggagg gggccggatt 37621 ctaggaggaa ccaatgaaaa gcctcactcg gcctccgctc ctcccacttc ttgctgaggt 37681 caaaggcctg cgtcagttgc actgtagcct cggcagtgaa ccgggaggta ctaccaggta 37741 aggaaggtgc ggtagcccca gccgtgggtg agaggagctc cgctctgaca cccccgctcc 37801 tgtaggtcgc cgtcgttgct ccgctcgctc tgagagagca tggccctgag aggcgtctcc 37861 gtgcggctgc tgagccgcgg acccggcctg cacgtccttc gcacgtgggt ctcgtcggcg 37921 gcgcagaccg gtcagtgtgg ggtcgggagt gtggagggaa ggagggagga actgggggtt 37981 tagggacttt ccggggtgac tttcccgttc tgtgcttgca gagaaaggcg ggagaacaca 38041 gagccaactg gctaagtgta aggacctctg gtcgcaccgt gtgtctgctg cccctgttca 38101 gctgtctgtc tgccgcaggt ggactctgtc ccagaatccg agagctgccc gagcggggtg 38161 gcagggtcgt ggccagggtc agaggcacta aggcagtgag tgcgctgtgc ctgcggggcc 38221 ggagaaaagt cacctgatca gtctcgcttg cagctcgcac tagccggggg gcgacatggg 38281 tgttgggggg tagggctgat gagggtccga gaagggaggg cacagtgatc ttgcggactg 38341 gaccgaggcg aattcccctt cccagcctcg cgtcccgagt ttgactggca ggacccgctg 38401 gtgctggagg agcagctgac cacagatgag atcctcatca gggacacctt ccgcacctac 38461 tgccaggaga gactcatgcc tcgcatcctg ttggccaatc gcaacgaagg tgggcgggct 38521 ggtgggtgcc ctgagactgc tcctccgcct ggagccatag ccaccccacc tcaaggcccc 38581 tctgtccttg gggctggggc ttcctgtggc ctaggcctgg gcctgaattt gggcactggt 38641 ccctttgcag tttttcatcg ggagatcatt tcggagatgg gggagttggg tgtgctgggc 38701 cccaccatca aaggtaggaa caagtatctc tccacacact gcagaaccct ctgtattctg 38761 aaagcctctt cctccttccc tccctccctt tcttccttcc ttctctttct tttcttttcc 38821 ttttctttcc tttcttcttc ccccccaaca gagtctggct ctgttgccca ggctggagtg 38881 cagtggcacg atcttggctc actgcaaatt ctgcctccca ggctcaagcg attctcctgc 38941 ctccacccct ctagtagctg ggattacagg tatgtgccac catgcctggc taatttttgt 39001 atttttagta gagacagggt ttcactgtgt tggccaggct ggtctcaaac tcctgacctc 39061 aggtgatccg cccacctcag cctcctaaag tgctgggatt acaggcatga gccaccacgt 39121 tcagccttct ttttgagatg gagtttcgct cttgttgtcc aggctggagt gcagtgatgc 39181 aatcttggct cactgcagcc tccacctccc gggtttaagt gattatcctg cctcaggctc 39241 ccgagtagct gggattacag gcgtccgcca ccacgcctgg ctaatttttg tatttttagt 39301 agaggtgggg tttcaccgtg ttggccaggc tagtctcgaa ctcctgacct caggtgatcc 39361 acccgcctca gcctcctgat tacaggtgtg agccaccgtt gcccggccct tttctttttt 39421 tttttttttt gagatggagt ttcgctctgt cacccagcct ggattactgg attacagtgg 39481 tgcgatcctg gctcactgca gtttcctcct cctaggttca agcaattctg ccacctcagc 39541 cttctgagta gctgggatta caggggtgca ccaccacgcc cagctaattt ttgtattttt 39601 tagtagaaat gggatttcac catgttggcc aggctggtct tgaactcctg acctcaggtg 39661 atccacccac ctcggcctcc caaagtgctg ggattataaa cgtgagccac cgtgcctggc 39721 ccctttgttt ctttttttag agacagggtc tcactgtgtt gcccaggctg ttctcaagtg 39781 atcctcttgc cttactgaaa gcccccttct ttccctaagc cacaatttcc cagtctgtaa 39841 agtggggctg ttgtcccacc ctctgaaagt ggctgtggag atgaaatgaa taaacctcag 39901 ctagggccag cttggtgcct gcctccttgt gtgtccttat tcagccctgt ctcttgggtc 39961 ttagctgggc agggccctgt tctctattgt cctgctttcc cctcctacta ccaccaggat 40021 atggctgtgc tggggtttcg tctgtggcct atgggctcct ggcccgagag ctggagcggg 40081 tggacagtgg ctacaggtcg gcgatgagtg tccagtcctc cctcgtcatg caccctatct 40141 atgcctatgg cagcgaggaa cagcggcaga agtacctgcc ccagctgggt gagtggctgc 40201 ccatggggcc tggtggaagg aagacagtct ctgaggtctg gaactcaagg gtggggctgt 40261 cccctgagcc tattctgtcc ctatctcaaa gatagcataa gtggccacct ggacccccgc 40321 cagaccctgg gcttcacctg gagatctgat ccctggccag cctgactgtc cccctctgtg 40381 accaccgtca tctccctatg ctttctgtgt tccccagtcc agcccaaagt ttaaagtcca 40441 ccaggttcct gctggccatt tgcagtggct cacacctata atcccagcac tttgggaggg 40501 tgaagtgaga agatcccttg agcccaagag ttcgaaacca gcctgggcaa cgtaaggaga 40561 ccccatgtct attagaaaaa caaaaaaagg aaagagccta tgtgacctgc gctaagtgga 40621 cgttggccct cttccgtggt gtctcggagg tgttcagctg cttcaagatg aagctgaaca 40681 tctccttccc agccactggc tgccagaaac tcattgaagt ggacgatgaa cgcaaacttt 40741 gtacttttta tgagaagcgt atggccacag aagttgctgt tgacgctctg ggtgaagaat 40801 ggaagggtta cgtggtcgga atcagtggtg ggaacaataa acaaggtttc cccttgaaac 40861 agggtgtctt gacccatggc cgtgtccact tgctactgag taaggggcat tcctattaca 40921 gaccaaggag aactggagaa agaaagagaa gatcagttca tggttgcatc gtggatgcca 40981 atctgagtgt tctcaacttg gttattgtaa aaaaaaggag agaagggtat tcctggactg 41041 actgagacta tgatgcctcg tcacctgggg cccagtacag ctagcagaat ccgtaaactt 41101 ttcagtctct ctaaagaaaa tgatgtctgc cagtatgttg taaaaaagcc cttaaacaaa 41161 gaaggtaaga aacctaggac caaagcaccc aagattcagc gccttgtcac tccacatgtc 41221 ctgcagcaca aacagcggcg tattgctctg aagcagccgc atattaagaa aaataaagaa 41281 gaggctgcag aatatgctaa acttttggcc aagagaatga aggaggctaa gaagcaccag 41341 ggacaaatcg tgaagagacg cagactgtcc tctctgcgag cttccacttc taagtctgaa 41401 tccagtcaga aataagattt tttgagtaac aaataagatc agattcgcaa aaaaaaaaaa 41461 aaacccacaa ggccctgtgt gaccaggcct ctgctgacct ttccacctca tctctggcca 41521 ctcatgtgat cctcccagcc agacttagct acttgaaatt ctccagaagt accaccagtt 41581 ctttacacat ctgttccctc tcccagcact gctcttcccc acagcctcct cctgcttaac 41641 tcctcacacc cttccagttt ggccaactcc ttcctgattc cctgggtccc cctccccatc 41701 ttggcattgt gatcatgact ttggggtaac ctgtctcttg aaagcagggc cagatctaac 41761 ttagttatga ggtctgactc aggggcgagg gtaatttttt ttttcttttt ttgagacaaa 41821 gtcccgctcc gtcccccagg ctgaagtgca gtggtacaat cttggctcac tgcaacctgc 41881 ggttccccag ttcaagcaat tcctgtgcct cggcctccca attagctggg attgcaggtg 41941 cttgctaccc gtgcctggct aatttttgta tttttagtag agatggtgtt tcactatgtt 42001 ggccaggctg gtctcgaact ccagacctca agtgatccat ctgccttggc ctcccaaagt 42061 gctgagatta catgtgtgag ccacctcgac tggcctaatt ttttgtattt ttagtagaga 42121 tggggtttca ccatattggt caggctggat tttttttctt ttttgagatg gagtcttgct 42181 ctgttgccca ggctgtagtg cagtggcatg atcttggctc actgcaacct ctcctgcccg 42241 ggttcaagcg attctcctgc ctcagtctcc tgagtagctg ggattacagg cacccaccac 42301 cacatctggc tatttttttt tttttttttt tttaagtaga gatggggttt caccatgttg 42361 gccaggctgg tctggaactc ctgacctcaa ttgatccacc tgccttggcc tcctaaagtg 42421 ctgggatgac aggcgtgagc cactgcaccc cgccacgagg ataatttttg agtaagggga 42481 tgtatcaggg accaggcagc cttgtgactt tgtcttgtgc ctgcagccaa gggggagctc 42541 ctgggctgct tcgggctcac agagcccaac agcggaagtg accccagcag catggagacc 42601 agagcccact acaactcatc caacaagagc tacaccctca atgggaccaa gacctggtaa 42661 gggttctggg tggtgggcag gtggtgaaca ggggcaaagg ggcactggtc agacccctca 42721 ccgactgttc catccccagg atcacgaact cgcctatggc cgatctgttt gtagtgtggg 42781 ctcggtgtga agatggctgc attcggggct tcctgctgga gaaggggatg cggggtctct 42841 cggcccccag gatccagggc aagttctcgc tgcgggcctc agccacaggc atgatcatca 42901 tggacggtgt ggaggtgcca gaggagaatg tgctccctgg tgcatccagc ctgggggtaa 42961 gtggcagcca ctttgggaat gggtgttggg tcacctgcgg atgcggcttt gtcaggcagg 43021 ctccgtgctg gggacgcggc tccctgtgcc tgtggagccc acacagtggt gattcttact 43081 cagccggact cgctgacgtg ctgaaaactg cccccatttg gtgaccgtct cgctcatccc 43141 ggctctgccc gggacacatg ggcctgaacc agctcagtca tttgactcac agtgcatctt 43201 ctggcatccg tcagcctcct ggctctgagc atcgaaccca gatgccaggc tgggtgggac 43261 tgtgtgcaaa ccgagtgagc aggcaccgag cttcagtgcc agggccatct gtgatgtgaa 43321 ccacaacctg agtccccctg cgtggggtgg ctggggagga ggctttccct gcttcagagt 43381 tggttctgca taggccctct tggtgtctct tgggtgggcc tgaggcgcca tctcaaccct 43441 acagggtccc ttcggctgcc tgaacaacgc ccggtacggc atcgcgtggg gcgtgcttgg 43501 agcttcggag ttctgcttgc acacagcccg gcagtacgcc ctcgacaggt gtgtgagggc 43561 tgcagtgaga ttctctgggg gtgtggggca gcttgggttt cactctctat accatgggtg 43621 actccccagc ccccacccac caggcctgag ttccttgctc tggaatgacc agtgacgtcc 43681 ttctgagcag ctgtgggctg agtcaacggc agggccaggg caagcttggg ggcactgagg 43741 cagcctggga aggcgtcctg gagcaggggg ccccaggaca gggacggggt gggagagtgg 43801 gcctcccctc gctcttaccc tgccattgcc catgtaggat gcagtttggt gtcccactgg 43861 ccaggaacca gctgattcag aagaagctgg cagacatgct cactgagatt accctgggcc 43921 ttcacgcctg cctgcagctc ggccgcttga aggaccagga caagtagggg ctgtgtggtg 43981 ggggcggggg gatggcagcg gtggctggag gaccttgtgt ccttcctgga gagaaaggtc 44041 cttcctgcct ggtggccctg gggacctgaa ccttctgctg tccctcttgt ccttgatggg 44101 ctgggctgag gacagcccca ctggtccctc attgggagct tggctgcatc agggaatccc 44161 caccccgggc taggtttgct tggagcatcg ggatgccagg atccccagtc cttgttaccc 44221 tcatgtgcca ctcccagggc tgcccccgag atggtttctc tgctgaagag gaataactgt 44281 gggaaagccc tggacatcgc ccgccaggcc cgagacatgc tgggggggaa tgggatttct 44341 gacgagtatc acgtgatccg gcacgccatg aacctggagg ccgtgaacac ctacgaaggt 44401 aggagctgga cctcagaggg ctcactgagg cctcagtgtc tggggagggg gtacagggag 44461 gtgggacggg gacaggtctg agtccaactc cacctatcac taagtgacag tgtgacctgg 44521 ggcgagcctt acccctctgt ccctcatctg gagttactca catggggact gtggaagtga 44581 agtgcttcat gcagcacaca gaggccccat caggccttgc gaggggctcc cagctctttc 44641 tcccacatgc tcgggaggag ggatgttcct gagacaggta agctccgaag gcagcccagg 44701 ggtgaggccc gactcctagc aggctggtgg acgcaaggga gtgagcgaag ctacacgcag 44761 gaatcaacgc tccattttgt taagagacaa aagtcatctt cagatgacgg tgtttgcacc 44821 ctgtaactga tggtgattta catgggggtc cagaacccct tgcttggtgg gaagacagat 44881 gtggaaaaag atacttaaaa aaaaaaaaaa agtgcaaacc cagcatgcgc tatgtagaaa 44941 tagagggagg tatgaggacc ccccgggcag agatgcttcc agacagagag aacagcaaat 45001 gcaggagggg tgccttccct gcgcaccttg caaccccaga tcttgtctct atctggcatc 45061 tggggggtgg ggggatccta caaatgtgag aatgaccctg aggacaagca gcgggcgcca 45121 tgagatggtt ttatgcagag aagggatgag tcagatttgg tgctttaggg gctcggcgca 45181 gtggctcaca cctgtaatcc cagcgctttg ggaggctaag gcaggaggat tgcttgagcc 45241 caggagttag aagttatgat gagctatgat caccccactg cactccagcc tgggtgacag 45301 agtgagaccc tatctatatt taaaaaaaat aggtttgtac tttaggaagt aaacagaatt 45361 ggctctaaaa aggaggctac aatctgcctt ctgcttcaag gagctgggga ggagcaaagg 45421 atgagacacc agggcatggg gtgtgggatt ctctgggaga ggcggcttca gatgggcaga 45481 ggtcagctga ggcaagtgac tgctcttccg atgtgctgtc cctcctatcc ctcccgaggg 45541 taagaaaggg cttgctgtgt ttccttggag ctcaggggtg ctttcagcct ctgtggcaag 45601 gaagcgtggc caggttgaaa atcaaacatt taataaactg tggggctggg ggtggggaga 45661 acttgccaag ggtggggagc acgaagcctg ggcctatggc aggggctgga cttgctacat 45721 tttgggagtt cagcacaagg agctttgggt ttttgttttt ttctgccaag atgcattata 45781 gaaccatgaa aaacaatttc tcagtgtggc ccaaatggtg gcctcacagt cttctcctgg 45841 agactgtcac taaggcgtga gtctcccggc agctgtcagc attcaccatc tctgttggtc 45901 tgtacttctg aagcagtggc ctggggatac atgaaaattg attttaaagg gaagttgtga 45961 gctatgaaaa ctccaaaccg actctgtatt aatcttgtcc aggtacacat gacattcacg 46021 ccctgatcct tgggagagct atcacgggaa tccaggcgtt cacggccagc aagtgagccg 46081 ctccatcagg ggcccgaaac tctcaagccc ctttctggag agatgcctgg ctggaccgta 46141 ggagcgctgt gctctgagct tagaaaggga ggtggcggat ggagtgggaa gtgagagaca 46201 ctgattttta aatatcaaaa tttcccttct gaagtcgttc agatgtgttc cttaaaaaga 46261 agatggaatt ctctgtagag cgtctcaatc cacttttaac catggatgag agcagactcc 46321 atttaccctg aaatagcagc ttctcttgag aggagagtga catggaagca actccgtctg 46381 ctgcagctga ccccctcaca ctgagttcac agtgcgccct ccctccctcc catctggggg 46441 tagtgcctta tgctgggtgt tggagcagag tgagggagag gaaaataaag acctgcacat 46501 ctgaccccaa ggtgtcaggc cggtttactg gtaaccacct gagaagtagt ttcagccaca 46561 gaagaaacga acacgtctgg gggctgtgag ttgccgggac gtggtgggga ctttccccta 46621 gagtggtctg gcccccatct gagtcttagg ctctgctgta aaaaagaaac ccaggttagc 46681 agtgccggtc tgtgccaggg ctgtgggcag aggggatgat ttctgagtgg caggggtgca 46741 agggactttg tcctttctgt caccacagag ctgctcatac aacagtggga catgacaacg 46801 gtttggcttc tcagctcaga cacaccatgt tataggggat cacaaacaag atggtgagtg 46861 caccaagggc tttgcaggga ctgtggcctg gcggggtcag ggctgggtgc ctggtttctg 46921 ttcacattta ctcttggctg ggccatccct gtcctcttcc tcaggcagcg ccccccctgt 46981 gagtttactc atacctcagg ctggagacac aggtctttgt acacagtctc cacgctgtgg 47041 cagacttgtt tgagctctgt ctccaaatgg ctgatctttg ccattttctg ggtgaactct 47101 tggagctttt cctggatgat cttgttgtgg tcattataaa tctgatagat cctctcctct 47161 aatttctctg tcagatccga aacctgttaa acaataagat gtgttctcgg aaaatcaaac 47221 aagaggtatc acaagacatc tgtgttaagt gggtttctta gatttctgca atttatctta 47281 tgaaaatgac aaaataggct gggcacggtg gctcatgcct gtaaccccag cactttggta 47341 ggccaaggtg ggtggattac ttgaggccag gagttcaaga ccaacctggc caacatggtg 47401 aaaacccatc tctattaaaa atataaaagc taggacgggt gtggtggctc acgcctgtaa 47461 tcctaacact ttgggaggcc gaggcgggcg gatcacgagg tcaggagatc gagaccactg 47521 tgaaaccccg tctctactaa aaatacaaaa aaatttagcc gggcgaggtg gcgggcgcct 47581 gtagtcccag ctacttggga ggctgaggca ggagaatggc gtgaacccgg gcggcggagc 47641 ttgcagtgag ctgagatcgc gccactgcac tccagcctgg gctacagagc aagactccat 47701 ctcaaaataa ataaataaat aaataaataa ataaataaat aaatacacaa atacataaaa 47761 actaactggg catggtggtg tgcacctgta atcccagcta ctcgggaggc tgagacagga 47821 gaattgcttg aacctgggag gtggaggttg cagtgaacca agatgatgcc attgcactct 47881 aacctgggca acaagagcaa aactccgtct caaaaaaaaa aaaattagca aggcataggt 47941 gacgcgcact tgcagtccca gctactcgga aggctgaggc aggaggatca cttgagccca 48001 ggagttggag gctacagtga gtcatgatca cactactgca ctccagcctg agagacccta 48061 cctcaaaaga agaaaagaaa aacaccaaaa actaaaaccc agtgcagtga ctgggggtca 48121 gattgcctat attttaaatt tttttgcttt ttttgttttg ttttcttgcg gcggagtttc 48181 gctctcattg cccaagctgg agttcaatgg catgatcttg gctcactgca acctccgcct 48241 cctgggttca agcgattctc ctgcctcagc ctcccgagta gctgggatta caggcgagcg 48301 ccaccacacc cagctaactt tttgtatttt tagtagaaac ggggtttcac catgttagcc 48361 aggctggtct ccaactcctg acctcaggag atccccacct tcagcctccc aaagtgctgg 48421 gattacaggc atgagccact gcacctggcc tgttaaattt ttttaagagg caaaagtata 48481 tacagtcgtg tgctgcataa tgacattttg gtcaatgtca gactgtatat acaatgatgg 48541 cccataagag tataatgcca tatttttact gtaccttttc tatgtttata tatatttaga 48601 tacacaaata cttaccatca tgttataagt gcctacagta ttcagtatgg taacatgctg 48661 tgtgcaggtt tgtagcttag gagcaatagg ctcttccacg tagcctaggt gtgtagtaga 48721 ctataccatc taggtttgtg taaattcaat ctatgatgtt cagatgatga cagtgtcacc 48781 tagggatgca tgtcacaaaa ctgtacagta caagctcagc caggttagga agaaagtaca 48841 cagaatagaa gaggcatcca acaaagtggc ctctgggtgc tgtatcaaaa taaataaata 48901 aataaataaa tgtttttgta tcttttctac tagaggccaa ggcagcaggg tgaggagttc 48961 aagaccagcc tgggtaacat agcaagatcc catctctaca aaaaaaatgt ttaaagaaaa 49021 ttagctgggc atggtggtgc acacctgtag tctcagctac ttaagaggca gaggtgggag 49081 gatcatttga gcccaggagt ttgaggctcc agtgagctgt gatcgtgcca ctgctctgta 49141 agcctgggca acagagtgag acgccacctc taaaaatata aataaataaa taaaacaagt 49201 gagccgggca tgatgttgca tgcctatagt ctcaagctac tcgagaggct gaggtgagaa 49261 gattgcttga gcccaggagt tcaagattgc agtgacctat gattccacca ctgcactgca 49321 gtctgggcaa cagcaaggct tgtctcaaaa aaaaagaaaa aggaaaataa agtcaatctg 49381 tcatcaagac acctagaacc agcaagctgt cccactcatg tgacccatta tgtattttgt 49441 ttgtttgttt gtttgctgga attataggca cgtgctacca tgcctggcta attttttttt 49501 ttttttcgag acagccaaga tcgtgccact ctactccagc ctgggctggg caacagagcg 49561 agactctgtc tcaaaaaaaa aaaaaaaaga aaaagaaaaa accagaaaga ggcagtcttc 49621 tttttcttct tttttttttt ttttgagatg aagtcttgct ctgtcaccca ggctggaatg 49681 cagtggcggg atctctgctc actgcaagct ccgcctcccg ggttcaggcc attctcctgc 49741 ctcagcctcc tgagtagctg ggactacagg cgcccgccac cacgcccagc taattttttg 49801 tatttttagt agagacaggg tttcatcatg ttaaccagga tggtcttgat ctcctgacct 49861 cgtgatctgc ccgcctcggc ctcccaaagt gctgggatta caggcctgag ccaccacgcc 49921 cggccctctt cttttttttt ttttgagacg gagtctcacc tgtcaccagg ctggagtgca 49981 gtggtgcaat ctcggctcac tgcaacctcc gcctcccagg ttcaagcaat tctgtctcat 50041 cctcccgagt agctgggact acaggcgcgc accaccatgc ccagctaatt tttgtatttt 50101 tcatagagac ggggtttcac catgttggcc aggatggtct ccatctcctg accttgtgat 50161 ccccctgcct cagcctccca aagtgctggg attataggcg tgagccacca cacctggccc 50221 ctatttattt atttttgaga cgaaatcttg ctctctagcc caggctgcag tgccgtgcag 50281 agccactgca ctgagccaat ttttttattt ttagtagaga cgtggtttca ccatgttggc 50341 caggctggtc tcgaactcct ggcctcaagt gatccacctg cctcggcctt ccaaagtgct 50401 gggattacag gcgtgagcca ctgtgcctgg tccccccatt atatattctg aagaggcatc 50461 aggtgcaacc attcatgtag gacctgaaca tcacaccagc ttttgcaaga tgtaagccct 50521 gggcatgtgg aagaagctcc actcccccgt cacagaccat tcgatggctt ctgcaggaca 50581 tatgggtgtg acaatgcaag ttgtgcctca accccctact cccgtggctg ctcgaccttg 50641 cgtgaagtaa gagggctgcg tgttggaggc aggagggcat ccaacgcccc gctccctcag 50701 cctggctctc tcgtcaggca aatcattcat ttccaagcct tatctatatc atgggtccaa 50761 tctcttctct gtaggactat tctaagggta aattgagata cagatggtcc caacttacta 50821 cagttcaact tactacattt ttgacttaca gcattttcaa ctcacaatgg gtttattggg 50881 acgtagcccc atcataagtg gaggagctcc tgtgctgtgt gtgaatggcc tagcaccgtg 50941 cctggtgtac aacagacacc atcagtggtt cattccctcc ccttttgcat aaggaatccc 51001 ccctctcgct gggtggagtc tgtcaccttg gtcttcaggc tgttcctgaa gttggtcatg 51061 agtgcatggt ccttttgccg gctcttgttg atgttttcga tcagctcctg ggctctcttc 51121 tgcaggatgt caatgcttga gtccagagag gagaagtaga gtcccgactt cccttccagg 51181 accgtcagct ggcaactggc actggaggtg gcgacacaag ggcaagaaac ctgacttctc 51241 agagtacagg aagcggccag gtgtagtcgc tcatgcctgt aatctcagca ctttgtaggc 51301 cgaggtgggc ggatcacaag gtcaggagtt cgagaccagc ctggcaacat ggtgaaacct 51361 cgtctctagt aaaaacaaaa aaacaaaaaa ttagccaggc acggtggcac gtgcctgtag 51421 tcccagctac ttgggaggct gaggcaggag aattgcttga acccagggga tgaagttgca 51481 gtgagccgag atcacgccac tgtactccag cctgggtgac agagcgagac tccgtctaaa 51541 aaaaaaaaaa ggacaggatg ctgagatcag caccagactc ttccaagagc ggctgatctg 51601 agagagatca ggtgggaggg gacccgcgtt gttgccttcc atgggtgaca cgaggcatgc 51661 ttcctaggtt attccattca cacacacgcc ctgaatgatg agaatggcac ccccatttta 51721 tagctgagaa aacagaccaa gaggcacgac taaatatgcc caaggttcca gaccaaggag 51781 gtctgtctgt ccccaaagcc cactgttatt ttatgacacc aacttgcttc tttggaattt 51841 tggaatacca ataatagtgt caatgctttc tttctttttt ttttgagacg gagtctcact 51901 ctgtcgccca ggctggagtg cagtggcgcg atctcggctc actgcaagct ccacctcccg 51961 ggttcatgcc attctcctgg ctcagcctcc tgagtagctg ggactacagg cgcccgccac 52021 cacgtccagc taattttttg tatttttagt agagacgggg tttcaccgtg ttagccagga 52081 tggtctccat ctcctgacct cgtgatccgc ccgcctcagc ctcccaaact gctgggatta 52141 caggcgtgag ccaccgcgcc cagcccaatg ctttctttct tattttcttt ttttttatag 52201 agacaggtct ttctctgttg cccaggctgg agtgcagtgg cacaatcata gctcactgca 52261 gtgtcagctt caagtgatcc tcccacctca gcctcctgaa taggtgggac tacaagcatg 52321 tgctaccatg cacagctaat ttttaaaatt tttaaataga gactgggctt cactatgtta 52381 ttcaggctgc tctcaaactt ctggcctccc aaaatgttgg gattacaggc atgagccact 52441 gcgcctgact gatgcaatca gcaatctcct gcaacctctg tctcctaggt tcaagccatt 52501 cttctgcctc agcctcccga gtagctggaa ttacaggtgc acaccactac acccaggtaa 52561 tttttgtgtt tttagtagag acagggtttc accatattgg ccaggctggt ctcgaactcc 52621 tgacaagtga tcctcctgcc tcggcctcca aaagtgctgg gattacaggc gtgagccact 52681 gcgcccagcc tgatgctttc ttaagtgacc ttcctgtcat taacctgctg catccctttc 52741 ccaacctact gggtgatgtt gaccgcctgg ggctacagtg tacctgcacc tccttgctcc 52801 caccaggtga ggtgaatacc tgttgaatta tatttaaaga gcatttaaag aaatgcgcac 52861 acacaaaaag ttgttggtcc catgttgtac atgtaactgt aattatgtaa ttcaaatatt 52921 gtgttaacaa gtgaaatagt ggcagaagag tgacagctat ctaaaaagct aaacagaatg 52981 aaaaattttg agtaatatgt aatgttccgt aacttcataa tattcaatgt aataaaatgg 53041 aacttcagtt gggtatggga gagagaacta aaattgaagc aaagatcata aaaatctaga 53101 aaaattctgc atatagattg ctctgccaag gccttgttct ctgctttaaa aacagaactg 53161 gggccgggcg cggtggctca agcctgtaat cccagtactt tgggaggccg aggtgggtgg 53221 atcacaaggg caggagttcg agaccagcct ggccaacatg gtgaaacccc atctctacta 53281 aaaacacaaa aaattagcca ggcgcagtgg caggcgcctg taatcccagc tactcaggag 53341 gctgaggtac aagaatcgct tgaactcggg agccgagatc acaccactgc actctagccc 53401 gggcaacaga gtaagacttc gtctcaaaaa taaataaatt aattaaatta aattaaaata 53461 aattaaaaca actgaaaatt gtgggtaatg cattgtgaat gaagtttatg gaaaaaatgt 53521 gatctgataa ggcaacccat tggtatttta aattaagata tattttttct tcttctgatg 53581 ctctggttta tttgaccggt ttgacgtctg atcagttagc ggttctggcc aggtagagct 53641 gcccctgtga gtcacagcta acttttcctc agccatgttc ccggcactga gctaagtcac 53701 atgccttgtc ttggtgcatc ctcacgacca cactcgggtc atctgctgtt ctctttacga 53761 catgggctta gagtggttac accaccttct caaggtggag gacagtctgt ggcagagctg 53821 ggctttgtac ccagaaggcc tgacttccca gctggtatcc tgcttcctac taaggcactc 53881 aggcgccact cccctcctgg atcttacatt tcctcgtcgc cctctgaggt cctccagctc 53941 tgaaaggcca agccttgtag gtgtggacag agcagttcct gaatgcagca ggtcacactt 54001 gtgtccacaa gcaggttact gattacttac aaaaataaat cttggccagg cacagtggct 54061 cacgcctgta atcccagcat tttgggaggc cgaggcgggt agatcaccag agatcaggag 54121 ttcgagatca gcctggccaa catggtgaaa ccccgtctct actaaaaata caaaaattac 54181 ccaggcatgg tggcatgtgt ctgtaatccc agctactagg cggactgagg caggaggatc 54241 gcttgaacct gggaggcaga ggttgcagtg agctgagatc gtgccactgc actccagcct 54301 gggcaacaga gcgagactcc atctcaaaaa tataaaataa aataaaaata aatctcagct 54361 atctcaacca ggcacagtgg gctcacgcct gtaataccag cacttcgggg ggctaaggca 54421 ggtgaatcgt gagctcaggg ttttgagacc agcctgggca acatggcaaa accccatctc 54481 tacaaaaaac acaaaaatta gccgagtgtg gtggtgtgta cttgtagtcc cagctactta 54541 ggaggctgag gtgagaggat cacttgagcc cgggaggtca aggctgcagt aagcttgatc 54601 atgccactgc actggagcct agatgatatc acgccactgt actccagcct gggcgacaga 54661 gtgagaccct gtttcaaaca aacaaacaaa acaaccacaa cacacatata cacccctccc 54721 ctctccaccc ggtctccccc agccggccac ctccctcttt cctccccatc aaggccaagg 54781 ttctagaagg agctgcttaa attttctggg cttttttttt tttttttttt taagagagag 54841 agagagtctc accctgggct tcttttcttt tcttttgaga cagagtctcg ctctgtcacc 54901 gaggctggag tgcactggca tgatctcagc tcactgcaag ctctgcctcc cgggttcacg 54961 ccattctctt tcctcagcct cccgagtagc tgggactaca ggcgcccgcc accacgcccg 55021 gctaattttt tgtattttta gtagagacag ggtttcaccg tgttagccag gatggtttcc 55081 atctcgtgac ctcatgatct gcccacctca gcctctcaaa gtgctgggat tacaggcgtg 55141 agccaccgcg cccggccatc ctgggcttat tttcttaccc ccattcactt ttcagtccat 55201 tgtaatcagg ttcctccccc ggttctcaaa tgaaacagct cctaccacag tcaccatgac 55261 cttgtggtta gaccgataga catctcagag tgacatgctg acctcttggc agcagccagc 55321 actgctgacc acccttccct gctactcccg acatgtggta tcccagaggt ctgtgcttgg 55381 gcctctgctg tcttcttgcc agacaccgtc aatcacacac ttgttctgca gtgaggcctc 55441 catgcctcct ccctgctggc ccctgcagct aatatcccag ccaccattcc acaatgcaca 55501 ggctattatt ctccaaaggc attcattccc gtagcatggg atccctgaac agtccccttt 55561 cccacctacg tcctgtctcc tccagaaagt gttccttggc tggattaggt gcacacagca 55621 cctacggttt cactgtttac accttggcct cccactgtag tctgctgaag ggccagggcc 55681 atccttcatc tttgtatctt cagtacttac ccctcatgag agaagcaata aatgtgctgc 55741 taataagtaa atgacttgcg agtactatcc attaccatcc acactttgga ttctatcatc 55801 taaattctat catcccagcc catgcctctc tcctgaggac ccaagaatat atgaaacctc 55861 ccagcctccc cacatggctg tctcaaaggc atctgaaact ccacatttgc acaaccaaaa 55921 gcgtcatctc ccaggcaacc cgatttcatc ttccattccc ttagttaatg acataaaaag 55981 ccacctggtt cccaagctca aaacttagca gcaagcctag agtccagctt ctctctgtcc 56041 ctctgcatcc tgtctcaaaa tcccctcatt tctaccttct ttgttttttt atttttattt 56101 ttaattaatt tttttttcag atggagtttc actcttattg cccaggctgg agtgcaatgg 56161 cgcgatctcg gctcactgca acctccgcct cccgggttca agcgattctc ttgcctcagc 56221 ctcccgaata gctgggatta caggcatgta ccaccacact cggcgaattt tgtattttta 56281 gtagagacgg agtttctcca tgttggtcag gctggtctca aactcccgac ctcaggtgat 56341 ccgccctcct cggcctctca aagtgctggg attacaggca tgaaccactg tgcccagcct 56401 ttttatttat ttgtttttga gacagagtct tgctttgttg cccaagctag agtgcagtgg 56461 agcgatctca ggtcactgca acctccaact cccggtttca agcgattctc ttgcctcggc 56521 ctcccaagta gctgggatta caggcacctg ccaccacacc cagctaattt ttgtattttt 56581 agtagagacg gggtttcaac atcttggcca ggctggtctc caactcctga cctcgtgatc 56641 tacctgcctc agcctcccaa agtgctggga ttacaggcgt gagccaccat gcccggccct 56701 accttcttaa tgtctctaaa tctaccatca ttagcagctc tataactttc acttctcaac 56761 taggccatag ggacctttat tttatttact tatttatttt tgtaaaaaat tttgtaataa 56821 attttatgtt aaaaaatttt tttttctttt gtaagaaatt ttcttttgta aaaaattttg 56881 ttattattat tttgtttttg agacagactc tcactcggtc gctcaggctg gagtgacagg 56941 ctggagtgtc gctcaggctg gagtggtgca atctcagctc actgccacct ccacctcctg 57001 ggctcgagtc attctcgtgc cccagcctcc tgagtagctg ggactacagg cacacaccac 57061 cacgcccggc taatttttat atttttagta gagacagggt ctcaccatgt tggccaggct 57121 ggtctcaaac tcctgacctc aagtgatccg cctgcttcgg cctcccaaag tgctgggatt 57181 ataggcatga gccaccacgc ctgacctttt atatatatat attttttttg agatggagtc 57241 tcgctctgtc ccccaggctg gagtgcagtg gcacaatctc ggctcactgc aagctctgcc 57301 tcccgggttc atgccattct cctgcctcag cctcccgagt agctgggact acaggcgccc 57361 gccaccacac ctggctaatt ttttgtattt ttagtagagg cggggtttca ccatgttagc 57421 caggatggtc tcaatctgct gacctcatga tccgcccgtc tcggcctccc aaagtgctgg 57481 gattacaggc gtgagccacc gcacccggcc tgaccttttt ttttcttttg aaggcagggt 57541 ttgctacatt gccaagctgg agtacatcag tacaattact gcaactttga actcccccag 57601 gctcaagtga tcctctcgtc tctgcctcct gaatatctag gactacaggc gggtaccact 57661 actcctggct aaattttgtt taatttttct gtatagatgg aggtcttgct atgttaccca 57721 agctggtctc aaactcctgg cctcaaaaaa tcctgcctcg gcccccaaag tgctgagatt 57781 acaaacgtga gccacagtgt cagcccctag tgacattcta cactgtaatc tgatttccta 57841 ctcctgtttg ttagcctaaa accatcccca ccagccacaa gatcaagtct aaattccttc 57901 acacagcata caaggccttc cagggccctg gcctccaccc tctcttccag ccttaatggg 57961 tttttttctt tttctttctt ttttcttttt tgagaagggg tctcactctg tcgcccaggc 58021 tggagtgcag tggcatgatc ttggctcact gcagcctcca cctcctgggt tcaagcaatt 58081 ctcccacctc agcctcccct gtagctggga ttacaggtgc ctaccaacca tgcctggata 58141 atttttgcct tttttttttt tttttttttt tttagacgag gtctcgctca gctgcccagg 58201 atggagtgca gtgccacgat ctcggctcac tgcaaccact atctcccagg ttcaagtgat 58261 tttcccatct cagcctcctg agtagctggg attacaggca cctgccatca tgcccagcta 58321 atttttgtat ttttagtaga gatagggttt caccatgttg gccaggctgg tcttgaactc 58381 ctgaccttag gtgatccacc tgccttggcc tcccaaattg ctgggattac aggcgtgagc 58441 caccgtgccc agtgcagcct tactgttttc tttttgtttg tttgtttttt tcaagacaga 58501 gtcttgctct gtcacctagg ctggagtgca gtggcacgat ctcggctcac tgcaacctct 58561 gcctcccggg ttcaagcaat tatcctgcct cagcttccca agtagctagg attacaggtg 58621 cccaccactg cacccagcta acttttgtat ttttagtaga gacagggttt caccatgtta 58681 gccaggctgg tctcgaactc ctgacctcgt gatctgcccg ccttggcctc ccaaagtgct 58741 gggattacag gcgtgagcca ccaagcccga tctagcctta ctgttttcta cagcccctcc 58801 atctgcagca attggggacc taccacaatt tccccaaaca tgtcagcctt tgcctggaat 58861 gactttccct tcctgcctcc ttcataatgg aacatttgga acttttgctt ctctaccccc 58921 cgctatacct aggccttgct ggtcacaaag ctgccaaagt ttatgaacat cagtcccctc 58981 cagcgcacga caccttgagg gtgcagacta caccctactc atttgttttg cagcacccca 59041 aaggtttcat cagtttgctg aatatctaaa tgaatgcagc actgacattg gggctcccac 59101 acctcctcgt acccagttca ccttgttgtc tcagcctctc ctggatgctc tcaacagttt 59161 cagaccctaa aattaaaatc gatgagagca ccaggtgttt ctttttgtgt gattgttcac 59221 aacctgcttg agctgctaat ggttcagagc tgctcctttt taaggctcaa gggatgaggc 59281 tggaaccggt tttcatggca gctaagcaaa cagtaaaaaa aaaaaaaaat cacttgaatt 59341 cagctaattc gagtttcatg gtaggacaaa ggcctgcatc agatgtctgc tgagagcttc 59401 tttatttttc caacctctcc tttggggatt tatccctttt gatttcattt ttcttaatac 59461 acagcagctg catattaatc aggatgctaa tgaatcatag gcttccctgg gcctgcattc 59521 accaattaac tttaacaaac aaaatgcgtg gtgtcctctc tgctggcttt cccggttcct 59581 aggctggctt ctgttgccga accctggacc gaggccagag aatgcaagtc agcccaacac 59641 agcacgtcgg gtgtgcatgg cctgcgcccg gattgggcag cgggctgggg gctgtgaaca 59701 cagcgctgcg ttcatctatt atggtctgtc cttctaagaa gtccctaatt tggctgcgag 59761 gattaggaca tagttcccaa gcagggccca cgctggcagg gggcccagga gctgcggagg 59821 aaatgggtct aagcgaagag accatctgtt cgctccaggt aaagctgtga ttggcaggcg 59881 actgagccag aactgcccgc tggtggggcc ctccctccgt ggcctctaac agatatacaa 59941 gtaaacaatc cgccagccac ttatgaccac cgcgggtggc agtcccggga gaggaggcaa 60001 gctaatcaaa cattcaaggg gggaacacaa cagaatctct ggcgcttggg caaaacaatg 60061 ttttgctgaa gtacagccag gaagggggaa ggtttgaatg gaagctgatg agccagttcc 60121 tcaacctctc cttttctctc tggaaacgtg ttaacatttt aatcatgttg ctgagtatct 60181 gctgctggat ctattaactg gcccctgcac tggttttaat ctatctgttt tttggcccag 60241 ggttcctgtg gagctccttg aagtctctaa atgatttgca gcggtctgtg tctcttcact 60301 acacagatcc tggtctaaac atgctattcc gcagtaaggt attagcaaac aaacgacttc 60361 aaccattgct tccctttttt tttggtctcc aattgacagc acacacctat aaaccatttc 60421 acgcgtggcc tgctttccag cagaggaatg tgcgaggaga gggaagctgg cttgctttct 60481 gcctagagtc ctcagggccc tgtcctgacc ctcctctcac tccttctcca ggtgagctca 60541 ttcattccac agctgctgct gaaacaatga ctcccacaga accatctcca gaccctcacc 60601 ccaccccgag ttccgggtca gaaagttgaa ctgtatcctg gacatgtgga tgtccctggc 60661 caccgaaagt ggctcagata aacaaggaca acacagaact catgttcttg tctctgccag 60721 agaagccacc acctaccgga aaccaccaca agctggaaag cagacagcag tgatgtgctt 60781 gtaagcattt agcaactggc tgtgtgatgg gggggaagcc ctgattcgta gtgtttgcca 60841 atttccgtgg tgcagagcct cccaccatta gccaatttca ggctaccaac atgaagacgc 60901 taaatgccaa gttgggaaga gatgtgtaga attggctctt tcaagccggt aggagctggc 60961 cccagtacat catggccggg aaacatcctc cctcatcccc ttcctgtgtc ttatggcaca 61021 aggcacttcc tccatctgcc ctgctaaagg ccaccccccc cacacccccc ataaaagcct 61081 cccactagac ctccctacac ccacacattc tccctcccaa ccattctcta caaagcagcc 61141 atattttttt attaatattt tttgagacag ggttttgttc ttgttgccta ggctggagtg 61201 caatggtgca atcttggctc agtgcaacct ctgcctccca ggttcaagcg attctcctgc 61261 ctaaccctcc caagtagctg ggattacagg catgcaccac cacccctggc taattttgta 61321 tttttagtag agacggggtt ttgccatgtt ggccaggctg gtcttgaact cctgacctca 61381 ggtgatccat ctacctcgaa ctcccaaagt gctgggatta caggcatgag ccatcgtgcc 61441 cggcctgcag ccgtatcttt ttataacggg aacctgagca tgtcacttct gtgacccagc 61501 agttcccctc ctaggtatga gtatgcccag aagaaatgaa atcatctgtc cacacacaaa 61561 cttttctaaa tggatttcat agcagtgctg ctcataatag ccaaaaagtg aaaaccaccc 61621 gaatgactat caacagatga gtggataaac aaaatgtggt agactcatgt ggaatattat 61681 tcagccattg aaaagatgaa gtccagatgc agtggctcac gcctgtaatc ccagcatttt 61741 gggaggccac tgtatctggc ctggagtgat gaagctgttt taaagttgac tatgagaatg 61801 gttgtccagc tctgaatata ccaaaagcaa tcaaatcata cccttcaagg aggtgagccg 61861 ggcgcggtgg ctcacgccta taatctcagc actttgggag gccgaggcgg gcggatcatg 61921 aggtcaggag attgagacca tcctggctaa cacggtgaaa ccccgtctct accaaaaata 61981 tataaaaaaa agccaggcct ggtggcacat gcttgtagtc ccagctactc gggaggctaa 62041 ggcaggagaa ttgcttgaat ctgggagggg gaggttgcag tgagccaaga tcatgccccc 62101 tcactccagc ctgggcaaca gagcgagact ccatctcaaa aaaaaaaaaa aaaaaaggtg 62161 aaccgtatgc tatgggaatt atatctcaaa gctgttactg aaaaaacctc taggataaag 62221 tctgtagccc acaaaatgcc tcataacctg gccccttctg cagccctccc cttgcactca 62281 cccctgtacc ccactcgctc gcctagccac actggccttc ctttagctcc cccgaatgcc 62341 acgagcaggg acttcgcacg agctcttctc tcaacccctc cttccccacc ctgcccacac 62401 tttctgcttc aacatctctt ctacaaggac accttccctg gctccccatt ccaggctggc 62461 tgcccacagg ggaaaggtct cccagcacca tgatctcttc cctgttggtg tttggtcaga 62521 tgcaaacagc aggagggtat gagcatgtct gtctggagga caggagtgtc cctgctgcca 62581 atcacaatac tgccatatca gccttttttt tttttttttt cctcgagaca gggtctcact 62641 ctgtcaccca ggatggagtg caatggtgct atcttggctc actgcaacct ctgcctccca 62701 ggttcaagca attctcctgc ctcaggctcc ggagtagctg ggattaaagg catgcgccac 62761 cacgcctggc taattttttg tatttttagt agaaacaggg tttcactatg ttggccaggc 62821 tggtctcgaa ctcctgacct cgtgatccgc ctgcctcggc ctcccaaagt cctaggatta 62881 caggttgagc cactgcgccc ggccaacatg agcctccttc ttgctctttt gagaacacat 62941 caagcttatt cctacctgag ggacttcata cttgcggccc cctctgcgtg gaaggtttgg 63001 gcccagaatg cctcctggct ggcctcttct catgtgacct cttcagagtt ctccaactgc 63061 tctaaggtcc ctgtcactct acttgctcct tactatattt tgtttcttca ttccacttac 63121 ttgccaccag ttgaaatcac attattcact tatttatttt tgagacaggg tcttgctctt 63181 ccagtgcagt gaagcaatca cgtctcactg cagcctcgac ctcccaggct caagtggtcc 63241 tcccacctca gccttccaag tagctagggc cacaggtgtg tgcaaccgca ccacaccacg 63301 ctcatttctt tctttctttc tttctttttt ttcagagaga tatgtctccc tctgttgccc 63361 aggctggtct caaactcctg ggctcaaggg atcctcctgc cttggcctcc tgaatagcca 63421 ggactacatg catgcaccac cacgcctggc tgatttttgt gtttttagta gagacggagt 63481 ttcaccatgt gggccaggct ggtctcgaaa tcctgacctc aagttatctg ccagcgccag 63541 cctcccaaag tgctgggatt acaggcgtga gccactgcgc ccagcctcta ctatatatat 63601 actttttttt tttttttttg agacagagtc ttgctctgtc accaaggctg gaatgcaatg 63661 gcacaatctt ggcttactgc aatctctgcc tcccaggtta aagcaattct cctgcctcag 63721 cctcctgagt agctgggatt acaagtgtct gtcaccatgc ctggctaatt tttgtagaca 63781 cagggtttca ccatattggc caggctagtc tcaaactcct gacctcatga tctgcctgcc 63841 ttggcctccc aaagtgctgg gattacaggc gtaagccacc gtgcccggcc tactaaattt 63901 ttcttttttc ttttttttga gacggagtct cactctgtcg ccaggctggc atgcagtggc 63961 atgatctcgg ctcactacaa cctctgcctc tggggttcaa gcgattctcc tgccttagcc 64021 tcccgagtag ctgggactac aggcgtgcac cacaacaccc agctaacttc tgtattttta 64081 gtagagacgg ggtttcatca tgttagccag gatggtctcc atctcttgac cttttgatcc 64141 acccccctcg gcctcccaaa gtgctaggat tacaggcata agccaccatt cctggctttt 64201 tttttttttt tttttttttt ttttgagaca gcgtttcggt ctttttgccc aggctggagt 64261 gcaatggcgc catctcagct cactgcaacc tccacctcct gggttcaagc gattctcctg 64321 cctcagcctc ctgagtagct gggattacag gcatgtgcca ctatgcccag ctaattttgt 64381 atttttagta gagacggggt ttctccatgt ttgtcaggct ggtctcaaac tcccgacctt 64441 aggtgatcca cccgccttgg cctcccaaag tgttgggatt acaggtgtga accactgcac 64501 ctggccaata ttttttaata aattggcccg acattctggc ccgcctctag tcccagccat 64561 ttcggaggct gaggtgggag gatcacttga gtctgggagg tggaggttgc agtgagccgt 64621 gatcatgcca ctgcgtggtg acaaagcaag acgctgtgtt aggagggaaa aaaaaagcag 64681 cagacactgg catagccttg tggaagaaaa gggtgaatga gagggaccca ggggcctgtg 64741 tagacccata gggctgggat cttcctcacc tagctggccc tccaccagct tcctcctcgc 64801 agttctcttc ccaccgcgga tgctccttgc tctcccccaa gggctgcggt tcctggtctt 64861 tgcatttcac atggggcacg tccacctgca agcacagtca ggacggaggc caaggaggga 64921 gaatgaggag tgaacagatc gccctcctgc cgacccttca accctggtca cctcgatgtg 64981 ctgctgcggg acggtgggga cccgcaggaa agacgggcag ggctggggca ggtgccatga 65041 gggcaggggc atggggtgaa aggacactct gtccctaggg gacaccagga caccagacct 65101 agaggggccg ggtgagggca gggctgtggg aatgtaactg gaggacctgg gctcctaact 65161 ggtatgtgtg ttggctggag tccttcgaga aaggagaaga ggtaggagaa aagagatgag 65221 gccgggcgcg gtggctcacg cctgtaatcc cagcactttg ggaggcggag gcgggcggat 65281 cacgaggtca ggagtgcgag aacagcctga gcaatatggt gaaaccccgt ctctactaaa 65341 aatacaaaaa ttagctgacg tggtggtgcg tgcctgtaat cccagctact cgggaggctg 65401 aggaaggaga atcacttgaa cccgggaggc agaggttgca gtgagccgag atcacgccac 65461 tgcactccag actgggcgac agagtgatac ttcgtctcaa aaaaaaaaaa agaaaagaaa 65521 tgaaagaaac gggtgcggtg cgtgggtgtt tgtgcctttc tctactccgt tccggccacg 65581 cgccatgtgt ggaaatcaga cccgtcagtg cgtcagtcag ggccgggttc agtcagtcag 65641 gaaatttgag gccaggcctg atgagaggga gccccaatgg caaaggacaa gcggccgggc 65701 tcgggtccgc tggagatggc tgggatcgca gccgttccct gcctatctgt ccgcccgccc 65761 cacgcgcgag aaggaaacaa gcgccgcgta ctccctgtcg ctccattccg tattttcccg 65821 ccttcaagct cgcaccctct gcgcatgcgc cgaccccgcc cctggccagc tgcgctcccg 65881 cggacgagtg tgttgtgacg cgtgctcccg gccccgccct ctttgagaac ttgcgcggcc 65941 aactgggcgg ggccgaccgt taagcagcag tttcgcggtc cgcgggctgc gcgcgcagtc 66001 ggcgcccctt gggaacagga cggcgcgctc tgggtgcgct tgtgtgcccc tgtgaggctc 66061 ctgggttcca cggggcgccc aggttatacg gatctcagag tcctgtattt attcggtgct 66121 tctaacttca tccactctgc cttggaaaat atactccata acaatttttt tttttttttg 66181 agacaagatc ttggtctgtc gcccaggctg gagtgcagtg gtgcgatcac agctcactgt 66241 agcctcgacc tcccaggttc aagcgatcct cctgcctcgg cctcctgagt agcagtgtca 66301 ttcaccaggc ccttctaatt taaaaatatt tttctggccg ggcgcggtgg ctcacgcctg 66361 taatcccagc actttgggag gccgaggcgg gtggatcacc tgaggtcatg agttcgagac 66421 cagcctagcc aacatggtga aactgtctct actaaaaata caaaaaatag ccgggcgtgg 66481 tggcgggcgc ctatagtccc agctactcgg gaggctgggg caggagaatt gcttaaaccc 66541 aggacgcaga ggttgcagtg agctgagatc atgccactgc actccagccg gggtgacaga 66601 gtaaagctcc gtctcaaaaa aaaaaaaaag attgtatgta tatatacaca cacacacaca 66661 cacacacaca cacacacaca cacacacaca aattatattt ttttttcctg tagaaatgag 66721 atctcactgt gctgagccca ggctgatccc caactcctgg gctcaagccg tcctcccgtc 66781 ttggtctccc aaagtgctgt ctcaaaaaaa gaagaaatta tctgatctac cctattgact 66841 gtaggtcata agacccccgt ttcaaagaag tttctgcccc acacaaggcc tatctatcta 66901 gattcttctt ggcctctctg agcatgcatt cctgagactc caagaagaat ctagacagac 66961 aggccttgct gggtttcccc actcagccta ttagtattag acgacccccc cggctttttt 67021 ttttttttct ttttttgtct tgctctgttg tccaggcaac actggagtgc agtggcaaga 67081 tctcaactcg ctgcaacctt ggcctcccag gttcaagcga ttctcagcca accaagtagc 67141 tgggattaca gatgcaccat gcccagctaa tttttgtatt tttagtagag atggggtttt 67201 tccatgttgc tcaggctggt ctcaaactcc tggcctcaag tgatccaccc gcctcagcct 67261 cccaaagtgt caggattaca agtatgaacc acgacaaacc ctttttgtcc aatcaaattt 67321 ctacctgggt gttcaaactt tgctgaacct aagcataaga cacttttcat taatcaggca 67381 tggtggcggg cacctgtaat cccagctgct tgggaagctg aggctgctgt gagccaagat 67441 cgcccctaca ctccagccca gggaacagag ccagactccg tctcaaaaca aaacaaaaca 67501 aaaacacagt ttcccctgta tctctggatc ttcattctga aggcttgtca ggtaacacta 67561 tgatcaaata cattcgtatt ccttttctct tattaatctg ccttttctca gtaattttta 67621 gtgaaacttc agagggcaat aaggaagctt tctcttcacc cctacacagc caaagttcag 67681 atttgatgga aatggcaaga ttccagggta cttgtgttta tttagacaca taaatatgta 67741 tttaacgaca gggtctcact ctgttgccca ggctggagtg cagtggtgta atcacggctc 67801 attgcagcct cgacctccca ggctcaagtg atcaacctca gtctctggag tacctgggat 67861 tacaggcatg agccactatg ctgagctaat ttttgttttc tgtttttggg ggacaggtct 67921 cactctgtgg cccagactgg agtgcagtgg tgtgatctca gctccctaca acctccacct 67981 cctgggttca agtgattctc ccacctcagc ctcccagata gctgggacta cactacaggt 68041 gcctgccatc atgcctggct aatttttgtt ttttgttttt ttttttggaa cagagtcttc 68101 ttctgtcacc caggctgtag tgcagtggca caatcttggc tcactgcaac ctccacctcc 68161 tggttcaaac gattctcgtg cctcagcctc ccgggtagct gggactacag gtgctcacca 68221 ccacgcccag cttttttttt gtatttttag tagagacggg ttttcaccac gttggccagg 68281 ctgctctcga actcctgacc tcaagtgatc tgcctgcctc agcttcccaa agtgttggaa 68341 ttacaggcgt gagccaccac gcccagccta attttttttt tttttttttg agacggagtc 68401 ttgctctatt gcccaggctg gagtgcagtg gtgcgatctc cgctcactgc aagctccgcc 68461 tcccaggttc acaccattct cctgcctcag cctcctgagt agctgggact acaggcaccc 68521 gccaccacgc ccggctaatt ttttgtattt ttagtagaca gggtttcacc atgttggcca 68581 ggatggtctc aatctcttga cctcgtgatc cacacgcctc ggcctcccaa agtgctggga 68641 ttacaggcat gagccaccgc acccagccct aattttttta tttttagtag agacggggtt 68701 tcaccacgtt ggccaggctg ctcctgaatt cctgacctca agtgacccac ccaccttggc 68761 ctcccaaaag ttctgggatt acagaagtga gccactgtgc ctggctcagt acccagactt 68821 tttttttttt tttttgtatt tttagtagag acggggtttc accctgttag ccaggatggt 68881 ctcgatcgcc tggcctcgtg acccgcccgc ctcagcctcc caaagtgctg cgattacaga 68941 cgcgggccac ctcgcccagc ccagtaccca gaattttgac catcaagtgg cacactgagc 69001 tagggggacc caacaagacc cctcacccct aaggaaagcc accagatggg ggcaactgcc 69061 cactttatta gacaataggt ggcccacagg tctcctcagg gcccaccctc acagtagaca 69121 caccacacag gacaacagaa ggaacctgct acccagtcct ctgtccctgg gattctggtc 69181 ctgggacagg tgggaaagag gaaggtgggg gctggcctca cagaggcctc ataaatacaa 69241 ggtcactggc cagggatgca aaggagcgca gcagcaggga ctcggggagg atgacctgtc 69301 ctagagtggc ccatgtcacg cagcctcctg tgtgggaggg ggcctcggct cggcatccag 69361 gcggcacagg ggactgtcat acaccatctg caggttcacc ttgtggccca ccagctcccg 69421 gatattgttg atgccatatt tgatcatcgt tgggcttgtg ggggagggaa cagagtttat 69481 catgaggtca gcacgtgcac ccttctgctc agatttccat ggctcccatc ttcttcagag 69541 gaaaagttca agttctcccc atgacccaca gggccctgcg ccaactgctc tatcacctct 69601 caccctccgc cttcagccac accagcctcc tcactcttct cctaatacaa aaagcacatt 69661 cccacctctg ggcctttgaa ctgactgtga ccactacctg ggacaccctt ccccagatac 69721 tctcatggtt tacctcttta tctacctcta gattttgctt aggtggcact tcttcctcca 69781 tgatgccttc actgaccacg tagttgcctt agccctccct ataccactta ctgccctggt 69841 acacacttct gtcttctttt gctgagtgca tttgtttttt tgaaacagag tctcactctg 69901 taacctaggg tggagtgcag tggcgtaatc tcagctcact gcaacctccg cctcccaggt 69961 tcaagtgatt ctcatacctc agcctcctga gtacctagga ctaggactac agacgcacgc 70021 aacagtgcct ggctaatttt tgtagtttta gtagagacgg gagacagagt cacactctgt 70081 tgtctagact agaatagagt gcagtggcat gatcttggct cactgcaacc tctgcctcct 70141 gggttcaaac gattctcctg cctcagcctc ctaagtagct gggattacag gcacctgcta 70201 ccacgccagg ctaattttta tatttttagt agagatgagg tttcgccatg ttggccaggc 70261 tggcctcgaa ctcctgtcct caggtgatgc acctgcctca gccacccaca gtgcttggat 70321 tacaggcatg tgccaccgcg cctggcctgt ttttattctt tgtagaggcg gagacttgct 70381 ctattgccca ggctgatctc aaactcctgg ctcaggcgat cctcctgcct tggcctccca 70441 cagcactggg atgacaggca tgagctacca cgcctggcct gatgtacttg ttggtcttgc 70501 tagttatcat ccttctcgct ccactgggac cccagattca gagggaggga cataagctgt 70561 catgttcaca gcacggggct gagcattcag tttgtactca cgatgaaagt tttgtccatt 70621 ggatgaatga atgggtggtg ctgaaaccac gcaggcccct atacctggag agttatttga 70681 ggcagctgga caccccgagt ttccacttgg gcacttaccg ctccagggag aggccccagg 70741 caatgaccga cacgttctcg ggaagcccca tgggcagcag catctctgga cggaagaccc 70801 ccgagtttcc gacctccacc cacttcttca ggcctgcaga ggcaggacag aaaagacggg 70861 cagtgcgtgt tgagtttctg ggcactgctg gtacaggttc tgggtctagg tggtgagctt 70921 gggaggtggg gtacaggcat ggcaatgggg ggacccaggc caaaccatag tagcctgaga 70981 cctccctccc accccagtac cagggcctac cctgaccttg gtggtagctg aacacctcca 71041 tgctgggctc tgtgtatggg ttgtaggctg gcttgaagcg gagttgcgtg atacctgcag 71101 gaagtggggg gcgggcagga gagcaggggt ttggaggata atgctggtga tcaacacacc 71161 tgcccgctgc ctcaccacca tggcccaccc ctgcccccct gctcacccag cttggtgaag 71221 aactcccgca gaacgcccat gaggtggccc aaggtgagac catgatccgc caccacgccc 71281 tcgatctggt ggaactcagc caggtgcgtg gcgtccaggg tctcattccg gaatacgcgg 71341 tcgatggaga agtacttgac cggagtgaag ggcttctagg ggtgacaacc gagccaggcc 71401 caggtatggg tcagaaggtc cctttgacag caccctctcc ccactggggc ccccgcctgg 71461 gccaaccgca ccttctgggc aaggcggtag agcgcacggg cgctggctga tgtggtgtgg 71521 gttcgcagta ggtttttccg ggcctcgtcc agcttccagt tatacttgta ccttcaggag 71581 ggaaggtggg aagtccatgc aatggcccag gggtccccag cctccttccc ttccatacca 71641 ggtaagtcct gcctcacccc tgtgagccgt agccgccctg agagtgggtc cgcttgaccc 71701 gctggacata gtccattggg agctgcaggg cctccgctgg atctgggcag gacagagcaa 71761 catcaggtca gtcaacgagc atttcccacc cctttttttt tttcagacag ggtctcactg 71821 ttgtccaggc tggagtgcgg tggtgcaatc tgggcttact gcaacctctg cctcccaggt 71881 tcaagtgatt ctcgtgcctc agttttctca tttgtgtgtg gcttttttgg gttctttttt 71941 tttgagacag agtcttgttc agttggccca gctgaagtac aatggctcaa tcttggctca 72001 ctgcaacctc cacctcccaa gttcaagcaa ttctcctgat tcagcctccc aagtagctgg 72061 gattacaggt gcctgccacc acacccggga aatttttgta tttttagtag agatggggtt 72121 tcgccatgtt ggtcagggtg gtctcgaact cctgatctca ggtgatccac ccacctcggc 72181 ctcccagtgt tgggattaca ggcatgagcc actgcacctg ggctttattt atgtatgtac 72241 gtatgtatgt atgtattttg agatggagtc ttgctctgtt gcccaggctg gagtgcagtg 72301 gcatgatctt ggctcactgc aacctcctcc tcccaggttc aagcaattct cctgcctcag 72361 cctcccaagt agctgggatt acagaagtgc accaccacac acagctaatt ttttatattt 72421 ggtagacgtg aggtttcacc atgttggcca ggctggtctc gaactcctga cttcaagtga 72481 tccgcctgcc tgggcctccc aaagtgctgg gatcatgcca ccgcgcccga cctttttatt 72541 ttttcttttt ttttgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggcgcg 72601 atctcggctc actgcaacct ctgcctccca ggttcgagtg attctcctgc ctcagcctcc 72661 tgaatagatg ggactacagg cacacgccac catgcccagc taatttttgt atttttagta 72721 gagatgggat ttcaccatgt tggccaggat ggtctcgatc cacccaccgt ggcctcccaa 72781 agtgctggaa ttacaggcgt gagccaccgc gcccggccta tttttttttt ttaaagggac 72841 aaaatttttt ttttttttag acggagtctc gctcagtcac ccaggctgga gtgcagtggc 72901 atgatctcgg ctaactgcca gctccgcctc cagggttcac accattctcc tgcctcagcc 72961 tcccgagtag ctgggactac aggcgcccgc cactatgcct ggctaattgt ttttgtattt 73021 ttggtagaga cggggtttca ctgtgttatc caggatggtc ttgatctcct gacctcgcga 73081 tccgcccgtc tcagcctccc aaagtgctgg gattacaggc gtgagccacc gcgcccggcc 73141 taaagagaca aagtcttgct ctgttgcttg gcctgcagtg cagtgatgca atcatagctc 73201 actgcagcct caaactccca ggctcaagca atccttcacc tcagcctccc gagtagctgc 73261 aactacaggc gtgcactact atgcccagct aattttattt gtacagatgg gtctttctat 73321 gttgcttagg ctgttctcaa actcctgggc tcaagcgatc ctcctccctt gccctcccaa 73381 acagtgggat tatacccact gaggctggcc aagtttcctt ctttgtaaaa aggggtaaca 73441 gtactgcttc cagtagttgg caggaagatt agaatagtgg ctagtatgtg atgagtgctt 73501 agtaagtttt gggtgctatg acaataatga caagaatgat gttgctcctc tggccaggta 73561 agccggcacc caatgacctc aactgccctc ttggtcagga aggccacccc attgtgggaa 73621 gtcattccat ttcagacagt gctaagggca atgaaagaaa gaaatgggac agaagaatga 73681 acactagggg agaagggctt ctgtagaact cggggtgagc ccgggagtgt taggctggcc 73741 tgtctctggt gacacgtgag cagagatttg aggctgcagc ctgccttgaa gatctgggag 73801 aaaacagctc ttggctaaag acggaacata agcaaaggcc ctgaggtgtg cacggcctgg 73861 aatgcccaaa gaagacatgg gggcaggcag ggtgtggggg ctcatgcctg taatctcagc 73921 actttgggag gctgaggcaa aactgtttga gatcaggagt ttgagaccag cctgggcaac 73981 atagtgagaa caaaacaaaa caaaaattca ttaattttgg gctaggcgtg gtgctcacgc 74041 ctgtaatcct agcactttgg gaggctgagg cgggagtatc acttgaggtc aggagttcca 74101 gaccagcctg gccaacatgg caaaccccat ctctattaaa aacacaaaaa ttagccgagc 74161 gtggtggcag gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aatcgcttga 74221 acccaggagg tggaaattgc agtgagctga gatcgtgcca ctgcactcta gcttgggcga 74281 cagaatgaga ctgtctcaaa aaaaaaaaaa aaaaaaaaaa agccaggtgc ggtggctcac 74341 gcctgtaatc ccagcacttt gggaggccga ggcagactgg agtttgagac caccctggcc 74401 aacatggcaa aactccatct ctactaaaaa tacaaaaatt agctgggcat ggtggtatgc 74461 acctgtaatc ccagctactc gggaggctga ggcaggagaa tcgcttgaac ctgggaggca 74521 aagtttgcag cgagccaaga tcgtgccact gcacactcca gcttgggcga cagagtgaga 74581 ccctgtctca aaaaaaaaaa aaaaaaaaaa aaaaaagact gcaggtggca taatgagaac 74641 agagcttgac acatccaaca aggatggact ttaatttaac tcaattaatt tttttagagg 74701 tagattcttc cctatgttgc ctgagctgga tttgaactcc tgggctcatg caatcctcct 74761 gccttggcct cccaaatagc cggactacag gtgcacacca ccatgcccag ttctagacct 74821 aatttgatca aatgggatgc ccatcctgta aaggctccgc cgtggcctgg tgtctctcat 74881 gtctcaggga ggccctaggg gctgacccac ctcgaaggaa gaaggtgtcg tgctggtcac 74941 gggctgggtg ctgctggggc tggaagaggg cgtcaaagtt ccagaaggag ctctcaatga 75001 agttatcagt cggcatctcg gtgaacctgg tgggagacgc agcctgactg ccctgcctgt 75061 acccagcaga agcaccaggc accgccccca gccctgtgct caccccatct ccaggaagat 75121 ctgtcggaac tgggagcgga ccttgagcag cgggtgaagg tggccgctgt cggggaggac 75181 accgtgggcc aagaagttgt agggcttgaa gggccggtcc cgccaagagc cactggggga 75241 ggatgcaagg gcctggtaag ggctgcccgc ccctgccccc tgccccagct cccacctgcc 75301 ctgaccctgg ggttgcctgc tacctggaga tcatctctgg gctcagctct gtctcttgct 75361 tggagatgct ggtactaaag gcactgcctt tgctcaccca gtaggtcttc agagtcctgt 75421 ggccagggga aggaagggga cgccactgcc atctcctcct agaggacttc cttgatctcc 75481 cgtctgatga ggatccccat gctacctacc aagtcactct gctttatttc catcctgaca 75541 tttatcacta cctgaaatcc tcctatgtga ccatgctgta tctccctgta ttcactcact 75601 gattcattca acacttactg agaacctaat gtgtgccagg gtctgttcaa ggtgctaagt 75661 acacgactga gtgatgatac tagggaaagt ctagcagatt tattctgtgc caggctctgt 75721 tctgcaaaag tctccccaca agccccggaa agagggcagt tttttttatt tgtttgtttg 75781 ttttcctttt ttgagacaga gttttgctct tgttgcctag gctagagtgc aatggcacgg 75841 tctcagctca ctgcatcctc cacctcccag attcaagcga ttctcctgcc tcagcctcca 75901 gagtagctgg gattacaggc acccgccacc acacccggct aagttttgta ttttcagtaa 75961 agacgaggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca ggtgatttgc 76021 ccgcctcggc ctcccaaagt gctgggatta caggcatgag ccatcgtgcc tggccccttt 76081 ttctttgaga tgggggtctc tctatgttgc ccaggctggc cttgaattca tggattcaag 76141 tgatcctaac acttcagcct cctgagtagc tgggattaca cacgtgagcc acacacttgg 76201 cccaggctat tttatagata aggaaacgga gaccagcaaa ggccaggtga cttggccaag 76261 gtcataaaac tagttaagtg acagagcccg aacttcagtt aggcatccca gctctagtct 76321 cttctctctg ccactctgtg tctgctcaaa ggggacattt gtcctgtgag aggacggccc 76381 ctagcccatc taggttccag aagcctgtct ggtgtgtgtt cattaaactt tggaccacac 76441 taaggacagt gaaagaaatg aacaagcacc aaagatgaag actaggggcg atgggcttca 76501 tggaactcaa ggggagagga gggctgtcag ggtggcctct ttaaggggac acaggagaga 76561 gacttgatga ggagcctatc ttgaagatca gagggaaaac agttctcagc agaagggtcc 76621 tcatgggcaa agaccccaag gcacgtaggc ttgacaacta tggcattaag cgagctggga 76681 ggaaactggc cacacctgtg cttggtgtgg gcaagatgtc tccctcccac catgcctcca 76741 ccatgcttcg caggtggtgg gggcctgagc ctgcaggggg cacccactca cacttcagcc 76801 aacagcttcc tcttcctcag ctcgctcctc tccttctccc ccagcttctc agcctgtccc 76861 ccccggacca gctggagccg ccgctgcacc tcatcctcca tgctgtccac ctgccaggat 76921 aaggagtgtg aggggtatga gggccagggg cccatgtgcc cgtccacctg ccccctgacc 76981 agcccgcagg aacgcaccac tcggaacacc cggggcccgt cagccgcact cttgtccacc 77041 cgaatccact tgttggacat ggccttgctg aagcccactt tgccactggg cagtcgctgg 77101 aaaagagagg ctgcagtgag tggggcacga gggccacctt catcccatca ctcactcccc 77161 attcaccccg cagctcctac cataagctcg ctctgggcca ggccctctgg gggaatgctt 77221 cgaaacacac gggcctcatg gctgccctcc cgggcaatct cctcgccctc cgcagtaagc 77281 tcccagtgct tggtggaccg aagttcagcc tcgatgacct agagggaaat gtggggaggg 77341 tgtccaggca ggggtgaggc tcagagccag gcaccccctt tctgcagcgc tgggtcccag 77401 gttgaaagca cagcccttaa agctaggttc acatcccagc tatacaactt tacagccaag 77461 tggtctcaag ggaaagactt ccccttgctg tgtctccatt tcccttatgt aacacagggg 77521 tagtaataat gatagtttac acctcccagg gtaaagtaca ccactaaaga tgttacctgg 77581 cagaagacaa attctgtcgg ttttgaacct gtgacagatt ttaatgtgaa aaattaggtg 77641 aactgatatg tctccacatt tttttgtttg tttttgtttt ctttgaaagg gggtctcacc 77701 ctgtcactag gctgggtgca gtggtgtgat catagctcac tgcagcctag acctcctggg 77761 ctcaagcgat ccttccgcca gtcccgagta gctgggacta caggcatgtg ccgccacacc 77821 tggttaattt ttaaattact tttttttttt gagacggagt cttgcactgt cgctcaggct 77881 agagtgcaat ggtgcgatct cggttcactg caacctctgc ctcccgggtt caaatgattc 77941 tcctgcctca gcctcccgag tagctcggat tacaggtgcc caccaccacc cccagctaat 78001 ttttgtattt ttagtagaga tggggtttca ccatgttggc caggctggtc tcaaactcct 78061 gacctcgtaa tcggccgact ccgcctccca aagtgctagg attacaggcg tgagccgccg 78121 tgcccggccc tttaaattac tttttaaaga cagggtgttg ttatgttgtc caggctggtc 78181 tcaaactcct ggcctcaagt gattctcctt cctcagcctc ccaaagtgct ggcataatgg 78241 gcatgagcca ccatacccag cctgtctcca tgtttaaatg ttcacatgtc agtgcccaga 78301 atgtcagaga cgggccctgg aggtccacat tacaaacaaa aactgaggtt ccaggtccag 78361 agtcatgaag acaaggacaa caagggtaag tagagtgcac agggcctggc tcccacttca 78421 ttctaatccc aactctgccc ttcacttact gtgtgacctc aggaaagttg cttaacctct 78481 cagtgactct tctcatccat aagagaaagg tgtgatgggt tctacctcag atgattttaa 78541 ggagtagcta aattaatact cagaaagaat atatagtgcc taccacgtga caaacactgc 78601 ataaattttt tttttttttt ttttttgaga cggagtctcg ctctgtcgcc caggctggag 78661 tgcgatctcg gctcactgca acctccacct cccgattcaa aaaattctct gcctcagcct 78721 cccaagtagc tgggattacg ggtgcccacc aacacgcctg gctaattttt ttttgtattt 78781 ttagtagaga cagggtttca tcatcttggc caggctggtc ttgaactcct gacctcgtga 78841 tccacccgcc ttggtctccc aaagtgctgg gattacaggt gtgaccactg cacccggcat 78901 aaattctaat tactataatt atttttatgg ctattactgt cattagattg aacctcacct 78961 ccttatatta tttaaccctc tgtgacgtat acaaatgtca gatcatattg gtcaaggcac 79021 tgtcacccca cacaacacag ctgccatctc tcctctgcat tgaggactat gggctctgca 79081 gcctgagtgc ctaatttcaa atctgggctc atcggcctct tctattcaca ctgattatgt 79141 tactgaccat cagttgtcac atctgcaaaa tgggaataac atggccatca gccaggatta 79201 tggaatggat tggacaaaca agctgagacc aaagtggggc acagaacatg agctctacgg 79261 ataaacagcc cagatcctgg ccccatggat ctagccttct gggtggccca cgggcccctc 79321 agcatcctac gcctgtgact ctgtcactgt ccacttgccc aagtcagaag gctggacttc 79381 atctgtgcgc cccctgcctt gtcctgaatc ccacgccttc cccacctccc ccaagcttcc 79441 ttcccatagc cagacatcat gccctccagc ttggggaagt gccagcttct tctttcatcc 79501 cttcacctcc agtcctgtcc atgccaattc ttcccaaagg tgatgtggat agcacactca 79561 tctgaccggc tctagctcct gctatcgctt cccagttccc tcatgaggta aagacctaag 79621 aggctttgca attacagcta tgctggtctc ccttgccgcc tgccatcccc caagatcccg 79681 gccactgatc tctaaagtgc tatcactttc ctcaacagag aacagactca gccgtcacct 79741 gcaccaggcc gctctccctg acccccaagg ctgggtcagc tgtcacagct gggtcccctg 79801 ttttccccca tcactctggt tcttgaagca ccttcctatt acgtcgcaac ctgcgtgagg 79861 gcagcaaccg cgcttcctcg tccatcatcg taccctctac ccacgctgac cgtgcctcaa 79921 taaacgttta ttgcatgagg aacatccgtg cgtctgggcc ctcactcgtc ccacttcccg 79981 cccgcacatc ccggagtgga cacatataca gctaccccaa actaggcctg agggaatttg 80041 gcttggacgt agagcgcccg tggcaagggg actgtaggtg caagggcaag gcggtccggc 80101 atcacgggcc cggctcacct cgcccagcgc ctgaaggctc ttcacggcgc ccaccaccgc 80161 ctggtgctcc atgcccagct cagccgccaa ctcggcgctg tccaggccgc catcagacgc 80221 ctccagccgc cggagcagca gttccgccac ctgaccatcc gccatgactc cttccagtgt 80281 gctcagcgtg tccgggccag ggtgggcggg gggagctgag ccaccggaac cggaaccgga 80341 gtgtgtaccg ccatcttgaa tgtgtctaac tgtagaacag ggctcaatgt gtcgtcatct 80401 tgagtgtggc gacggagccg caaaaggtgc atgggactca gttcccacct ccaccctggc 80461 tccatcaatg gcgtctggtt gaccagaggg gtaaaattga ttatctcaag gtcagaagca 80521 ccgccctgtt ttgagtttta tctagcctca cttcgagcgg gctgtgtgac cttggacttc 80581 cccgtctctg ggttacagaa ccctctgccc ttctggtatt ctctcttgcc ctaggagagg 80641 cggtgggtgg tgggctccag agatggtcat tttggcgatg tcagctactt ttaaaatttc 80701 aaaggaaggc tgggtaggta ccgtcccttt ttatagaggg aaaaggaatt cacctagtaa 80761 gtcaaggtgg cacctaacct gtgtggacta cctcgtatgt ggtagtactt agcccagaac 80821 tgagtacttc acgtgtagta tttcatttat ccccaaaact ttatgatgta gatactgata 80881 taaccgcatt acacatgtga gaaaactgag gctcagaggg tggtatgact tgcccaaggt 80941 tctacagaga gttggtggaa atgaagtcat aaattataaa ccttccagat cctcccaccc 81001 ctgtgcgctg agagtaacgt cagtgcaaat actgcatggc ctagattagc actgtcctat 81061 agaaacacag tgcaaaacat ctttcttttt tgagacaggg tcttgctctg tcacccatgc 81121 tggagtgcag tggtgcaatt atggcttact gcagccttga cttcccaggc tctaaggatc 81181 ctccaacctt aggctcctga gtagctggaa ctgcaggtgc gcgccaccac acccagcttt 81241 tttttttttt ttaagtatgg gttttggcca tgttgcccag gctggtctca aactcctagg 81301 ctcaggttat cctcctgcat cggtctccca aaatgctggc attagaggca tgacccacct 81361 cgcccagcca gccagaccct gactcttaca aaaaaaaaga tgactaataa aatattttgc 81421 actcggccgg gcacagtggc tcacctctgt aatcccagca ctttgggagg ccgaggcggg 81481 tggatcacct gaggtcggga gtttgagacc agcctgacca acatggagaa accccgtctc 81541 tactaaaaat acaaaattag cagggcgtgg tggcgcatgc ctgtaatctc agctacttgg 81601 gaggctgagg caggagagtc gcttgaacct gggaggcaga ggttgtggtg agccgagatc 81661 gtgtcattgc actccagcct gggtgacaag gcgaaactcc gtctcaaaaa aaaaaaaatt 81721 gcactttttt tcatactaaa tcttcaaaat ctgatgcgta ttttacactt acaacacatc 81781 tcagttcgga ccaaccacat ttcaaatggt cagtaatcac atgtgggcag tggtgactta 81841 cgtcagcaca ggtagattag aatcctagat ctgctggcca ggctcagtaa tcccagcact 81901 ttgggaggct gaggcgggtg aatcacctga ggtcaggagt tcaagaccag cttggccaag 81961 atggcgaaac cccatctcct ctaaaaacac aaaaattagc tgggtgcagt gggggtgcct 82021 gtaatcccag ctacacggga agctgaggca ggagaatcac ttgagcctgg taggtggagc 82081 ttgcagtgag cagagatcgc accactgcac tccagcctcg gtgacagagc aaggctctgt 82141 ctcaaaaaaa aaaaaaaaaa aaaaaaagaa tcctagatct gttatttatg tcattcaacc 82201 agaaggtgta gtctttctgt gcctcagttt cttgctcaca tggaggtgat tggagcccag 82261 gttactgtga gaattaaatg cccacacata ttcacactgc ttactaaatt gagtgtggcc 82321 catgttcgaa cactcttata agcagaacac atttattcct tttacatcac aattattatt 82381 tgagtctgta aaatgggatt actcatatcc ccaaatcctg accctatatg aggttctgta 82441 ttaaggatac atttctcaaa gtcccttctc tcctcccatt ttatgttgat tatttattta 82501 tttatttagg gagtgggtct cactctattg cccaggctgg agtgcagtgg catgatcttg 82561 gctgactgca gcctccacct cctgggctca agcaatcctc tcacctcagc ctcccgagta 82621 gttgggacta caggtgtgta ccagcaggcc aggctaattt gtgttatgta tattatatat 82681 tatatgctat atataatata catactattc atatatttat gcattatatg tcatatataa 82741 tatattacat atgtgtacat atatatattt tgtaagaatg aggttccacc atgttgccca 82801 ggctggtttt gaactgccgg gctcaaacaa tctgcctgcc tcaggttccc aaggtgctag 82861 gattataagt gtgaactacc atgtccggct tatttttatt tatttttgag acagggtctt 82921 gctctgtaac ccaggctaga gtgtgcagtg gcaacaacac agctcacggc agcgtcaacc 82981 tcctggtctc aagtgatcct cctgcctcag ccttctgagt agctgggaac acaggcaggc 83041 gccaccacgc ctggcaatta aaaaaaatgt ttttgtaaaa atggcctcct gctatgttgc 83101 ccaggctggt cttgaactcc tggccttaag cattcctccc atcttggcct gccaaagtgt 83161 tgggattaca ggcgtgagcc actgtgcccc gtgtgttttt agttaatttc cacaagagtc 83221 cttccttcct ctcttctatc atgcagcaag tactttttaa gcctgttctg tgccaggtgc 83281 tgcaggtgac actggaggat gcgaagtgaa caaaacaggc agtgtctaac ctcaagtagg 83341 aagcgccaag cccacaggtg cctgacacag gaacaggagg aagggtcagc aagaggcctg 83401 ggatggtccc cgggatcctc atggggggga ccagactagg caatactgaa tactacctag 83461 aggaagtggt agaaccccaa cttcccagtt cattctcccc cgtctttttt tcgagatgga 83521 gtcttgctct gtcacctggg ctggtgggct ggagtggagt ggcgtgatct tggctcactg 83581 caatctctgg ctcccaggtt caagcgattc tcccacctca gcctcccgag tactgggatt 83641 agaggcaccc gccaccacgc ctggctaatt tttatttatt tatttatttt tatttttagt 83701 ggagatgggt tttcaccttg ttggccaggc tggtctcgaa ctgacctaaa atgatctgcc 83761 cgcctcggcc tccccaagtg ctgggtttac aggcgtgagc cactgcgccc ggccaccatt 83821 tctttttttt ttttttaacc agagaccctt gccaagtcat tccccccact ccactttatt 83881 ttccttttca ttttttcctt cctctctttt ctgagtcaca accattagcc aggaagcacc 83941 ccctccccac cctctttcct ggatccccct cattccctcc ttccatagtc cactcccgcg 84001 ctcccaggtc cagggcttat ttgcccagag tttggaaaac ccccagctct ccttcctcct 84061 ttctacagcg tgggggcagg gtactggtgc cagtcacgtg cctctggctt ctgaagaaga 84121 ctctagactg gggtcggggg gtgggtcctg cccatctccc tagcatctta tcgtccctac 84181 catctgtgtc ttttttccct ccccaaacgg aaccccctgc cctctcgcct gcctatagcc 84241 gtttaattgc aaaagccagg ccgtttgtgg gagaccacag acagcgaccc ccttcattta 84301 ccggttgaga ggagggtaaa ggggcggctg caatctgggt aataacccta tccccactcc 84361 aggagtcaca gtcacatcgt taagccttcc tcccctcttg tcccaggaca gctttaaaaa 84421 cgttaaaagc atttctgctg ggtagcatct ggccagggtc gccccctctg tctgctcagg 84481 aacgtctgtc acttcagaga gcttaagtga cttgccccgg tcacacagca gcagtccgat 84541 aggctgccag ggctctaggg gcagaaggag gagagggctg gcattcttcc caccggcccg 84601 cgtgactgta gcaccggggt gcagcgaagc cccaagggcc ccaatccgtg agctctctcc 84661 catcccaggc aggggtgggg gagcagcagt ggggtgctgg ttctcaaatg caagataaga 84721 gctggctaag aaagccttgc ccagcccctc cacctagagg gaatgggagg gagagaagct 84781 gagggcaggg tcccggtccc gcgtggagac agctgcgctc ccgcggtttc tttaaacgcc 84841 cagatgggca acgacgcgcg cggacgaggg cggggttggg ttcaggtctg gtcacatgac 84901 ctggcctgag gtgctcgcgg cccccacccc accagtgggc gtccccccca cgcgtggtcg 84961 accatcattg gtcggtggtg aggccaatag aaatcggcca tctgggaacc cagcgttccg 85021 aggcgcagcc taacatagtg aaccgacgaa ggtccaatgg aaaaagacgg ccatgggcat 85081 agaccaatga caaagtggca ggggcgggcc caagggctgg gtcaggttgg tttgagaggc 85141 gggtgggtat aaaagtgcaa ggcgggcggc ggcgtccgtc cgtactgcag agccgctgcc 85201 ggagggtcgt tttaaagggc ccgcgcgttg ccgccccctc ggcccgccat gctgctatcc 85261 gtgccgctgc tgctcggcct cctcggcctg gccgtcgccg agcctgccgt ctacttcaag 85321 gagcagtttc tggacggagg taacgcctgg tcccgcctcg aggccgcccc gacgacgcgg 85381 ccggcccccg atcctggatc tgcgttgtcg cccgtaatta ccgtttagag gtccaacacg 85441 gtggcctccc gggactagag ccgcgggcga tttctcttct gcgtccctgg ggagcgcgga 85501 gggcgtagcg gcctcccgcg gcgggagtta gggttagccc gaggatctct gaaggcaccc 85561 gacgtgtcaa actagaggtt ggaatgggga gtgtcgggga tctcctttcc tgtccccagc 85621 agcttgtggc tctcggcaga tgtttggtgt ggggggggat tagcacagcc gctctgacct 85681 acccctctaa tcccccactt agacgggtgg acttcccgct ggatcgaatc caaacacaag 85741 tcagattttg gcaaattcgt tctcagttcc ggcaagttct acggtgacga ggagaaagat 85801 aaaggtaaga gcctaggagt gggtgctcag atccgggagg acttcctggc agaagtcctt 85861 gtctgtacac acacagccgg gacagtcccc ttggaggagg acaggtggag gaagtggggg 85921 agtcttctct attctctaag tcgagggtcc tcgcgagtca aggcccaacg gtgacctcac 85981 taccgtcccg tctcaggttt gcagacaagc caggatgcac gcttttatgc tctgtcggcc 86041 agtttcgagc ctttcagcaa caaaggccag acgctggtgg tgcagttcac ggtgaaacat 86101 gagcagaaca tcgactgtgg gggcggctat gtgaagctgt ttcctaatag tttggaccag 86161 acagacatgc acggagactc agaatacaac atcatgtttg gtgagggcct gcttcctggt 86221 gctgatctct gtcccattag ttagagggag acccagaccc cattgacttt cttaataatg 86281 attttttttg gaaggggagc taaaagaata agtcccagca acaatttatt gcattatgat 86341 cgcagatcta ggctgttaat ttaatttgcg tgtttgtata tagttatttc ccaatcttac 86401 taatgaggat tttgagttct agagcactga tttttttttt ttctccttta aacttaaggc 86461 tccacccaca gcccattcag gacagaatca gggtctgagt ttctcttctc agccttgaca 86521 gacccgagtt gaagaaccag gtcttccttt tataaagagg ggtgagagcc tcgagatgat 86581 gggtagtctc tgactcttaa ctggatctgc ttcacaccta ggtcccgaca tctgtggccc 86641 tggcaccaag aaggttcatg tcatcttcaa ctacaagggc aagaacgtgc tgatcaacaa 86701 ggacatccgt tgcaaggtgt gcctgggggt ggtggcaaat ggctgtcatg gggagattca 86761 gaggtcagcc tcattggggg gtggcccccg ctcaccttct tccttcttca ggatgatgag 86821 tttacacacc tgtacacact gattgtgcgg ccagacaaca cctatgaggt gaagattgac 86881 aacagccagg tggagtccgg ctccttggaa gacgattggg acttcctgcc acccaagaag 86941 ataaaggatc ctgatgcttc aaaaccggaa gactgggatg agcgggccaa gatcgatgat 87001 cccacagact ccaagcctga ggttggtgtt tgggcagggg ctctgctctc cacattggag 87061 ggtgtggaag acatctgggc caactctgat ctcttcatct accccccagg actgggacaa 87121 gcccgagcat atccctgacc ctgatgctaa gaagcccgag gactgggatg aagagatgga 87181 cggagagtgg gaacccccag tgattcagaa ccctgagtac aaggtgagtt tggggctctg 87241 agcagggctg gggctcacag tggggagtgc accaacctta ctcacccttc ggtttccttc 87301 tcccttctgc agggtgagtg gaagccccgg cagatcgaca acccagatta caagggcact 87361 tggatccacc cagaaattga caaccccgag tattctcccg atcccagtat ctatgcctat 87421 gataactttg gcgtgctggg cctggacctc tggcaggtga gacttggagg aaaaaggagg 87481 atccctgggg tacctcaagt gcataagatc acccaagagg aaagggacag ggtaggcacc 87541 ccaggtgagt ctgactcaaa aatggtactt cttgtaaaca gtacttcctg gtctgtccct 87601 gtgaagtcct cacagcaacc cctttaaggt tatacttgct gtgcaccaag tacttcccca 87661 agtactttta tgcaaatcaa cttctttacc cccaaagacc tagaaggtgg tcaggtaacc 87721 cagttagtta gctggggctg ggcacagtgg ctcaccctta caatcacggt actttgggag 87781 gctgagacag aggattgctt gaggccagga gttacacaac tcaacctagc ttggcaacac 87841 agcgaggaga ccctatctct acaaaaaaaa tttttttttt tgagacagag tttcactctt 87901 gttgctgagg ctggagtgca atggcacgat ctcagctcac tgcgccctcc gtctcctggt 87961 ttcaagcgat tctcctgcct cagcctccgg agtagctggg attacaggca tgtgctacta 88021 tggatgccag gctaattttt tttttttttt ttgagaccgt gccttgctct gtcgcccagg 88081 ctggagtgca gtggtgtgat ctctgctcac tgcaagctcc gcacgacccc ccaggttcac 88141 tccattcttc tgcctcaggg tcccgagtaa ctgggactac aggcaccccc caccatgcct 88201 ggctaatttt tttgtatttt tttttttagt acagacatgg tttcaccgtg ttagccagga 88261 tggtctccat ctcctgacct catgaaccac ccaccttggc ctcccaaagt gctgggatta 88321 caggcgtgag ccacctcacc cagccttttt gtagagacag ggcttcatgt tgcccaggtt 88381 ggtctcgaac tcctggcctc aggtcatctg cccgcctcgg cctcccaaag tgctgggatt 88441 acaagggtta gccaccatgc ctagcctcta caaaaacttt aaaaattggc gagatgtcat 88501 gcatacctgt agtcccaact accaaggaag aaggatgatc acttgagcct ggggcatcga 88561 ggctgcagtg agccatgatt atgtcactgc actccagcct cggtgacaga gtgagaccct 88621 ctcaaaaaaa gttgggactt ggccggacac agtggctcac acctgtaatc ccagcacttt 88681 gggaggccaa ggcgggtgga tcacaaggtc aggagatgga gaccatcctg gctaacatgg 88741 tgaatgaaac cccatctcta gtaaaaatac aaaaaatttg cccggtgtgg tggtgggcgc 88801 ctgtagtccc agctactcgg gaggctgagg caaaaggatg acgtgaaccc gggaggcgga 88861 gcttgcagtg agccgagatc atgccattgc actccagcct gggtgatagc gagactctgt 88921 cccaaaaaaa aaaaaaaatg ctgggactga atttttgtct gttttggtca ctgaaatacc 88981 ttctgtgccc aagacagttc tggcatgtag taggtacctg aaaaatacct gaataagaga 89041 gtgagaaaca agaaacaggt gcagagaact gaagtcagtg gcccaaggtc atgggggtag 89101 gaaccacaaa gctggggttt gaacctgggc agtacagcac ctgagtctct ccatcttttt 89161 tttttttttt tttaagacag agtcttgctc tgtcacccag gttggagtgc agtggcttga 89221 tctcggctca ctgcagcctc tgccttccag gttcaagtga ttctcatgcc tcatcctctc 89281 gagcagctgg aattacaggc atgcgccacg acgctgggct tttttttttt tgagatggaa 89341 tttcactctt gttgcccagg ctggagtgca atgatgcaat ctcggcggct caccacaacc 89401 tctgcatccc agattcaagc gattctcctg cctcggcctc ctgagtagct gggattacag 89461 ggatgcgcca tcacagaccc cgggctaatt ttttttagta gagacagagt ttcactatgt 89521 tgcccaggtt ggtctcgaac tcctggcctc aagtgatccg ttcgccatga cctcccaaag 89581 tgctgggatt acaggcatga gcccgtcccg tccctggctg tctctccatc tttccatctt 89641 tttttttttt tttttttttt ttggagatgg agtctcactc tgtcacccag gctggagtgc 89701 agtggcacga tcttggctca ctgcaagctc cgcctcctgg gttcacatca ttctcctgtc 89761 tcagcctccc aaatagctgg gactacaggc acttgccacc acgcctggct gattttttgt 89821 atttttagta gagacggggt ttcaccgtgt tagccagggt ggtctcgatc tcctgacctc 89881 gtgatccgcc caccttggcc tctgggcgag gattacaggc gtgatccacc tcacctggcc 89941 tctccatctt tttaactgca gtgtcagcgg tgttccttgt cttctctgca gatgcaggca 90001 gcagaatata gtggttatag gaacacaggt ggaaaccctg tccaaagcaa gggctatcgg 90061 gtatcacctc tgaccatcct tcccattcat cctccaggtc aagtctggca ccatctttga 90121 caacttcctc atcaccaacg atgaggcata cgctgaggag tttggcaacg agacgtgggg 90181 cgtaacaaag gtgaggcctg gtcctggtcc tgatgtcggg ggcgggcagg gctggcaggg 90241 ggcaaggccc tgaggtgtgt gctctgcctg caggcagcag agaaacaaat gaaggacaaa 90301 caggacgagg agcagaggct taaggaggag gaagaagaca agaaacgcaa agaggaggag 90361 gaggcagagg acaaggagga tgatgaggac aaagatgagg atgaggagga tgaggaggac 90421 aaggaggaag atgaggagga agatgtcccc ggccaggcca aggacgagct gtagagaggc 90481 ctgcctccag ggctggactg aggcctgagc gctcctgccg cagagcttgc cgcgccaaat 90541 aatgtctctg tgagactcga gaactttcat ttttttccag gctggttcgg atttggggtg 90601 gattttggtt ttgttcccct cctccactct cccccacccc ctccccgccc tttttttttt 90661 tttttttaaa ctggtatttt atctttgatt ctccttcagc cctcacccct ggttctcatc 90721 tttcttgatc aacatctttt cttgcctctg tccccttctc tcatctctta gctcccctcc 90781 aacctggggg gcagtggtgt ggagaagcca caggcctgag atttcatctg ctctccttcc 90841 tggagcccag aggagggcag cagaaggggg tggtgtctcc aaccccccag cactgaggaa 90901 gaacggggct cttctcattt cacccctccc tttctcccct gcccccagga ctgggccact 90961 tctgggtggg gcagtgggtc ccagattggc tcacactgag aatgtaagaa ctacaaacaa 91021 aatttctatt aaattaaatt ttgtgtctcc cccctgtgtc tccttctggg gaaagacaga 91081 cttaaggaaa cccagcagtg gtctttttgg gggggggggg ggtttccagt atatctcctt 91141 tttcagctat tgctagagag gttgctgagt gttccacaag attccaggga cccttattta 91201 ccccataacc ctcaaaacca acgggggaat ggctgttgct gctgtaaata ctccacatac 91261 taacttactg aatccttgaa cctaactggt aagttttggt tctgcttatt tatttatttt 91321 taagatagac tctcgctctg ttgtccaggc tggcgtacag tggctcaatc tggttcaccc 91381 tgccccttaa actggccttt ttcatgcttc tattcctttg gggtcctgtg gctcagccca 91441 acccctctgg gatgcctcca ggactggtag atccaaggga gaattctgta tttaatgtag 91501 tgaggctttg aaagttaaca tctttaaata cttctggggt cagattaggt atagtacaac 91561 aaagtactcc tgcccggaga cccaagaatg aaaattgtag gttcattcca aaactccaga 91621 tccctaaaga atgaatgagg ctacaaaatg gcaccttgcc cccgcttcca acttaaggtt 91681 ttttttctgt gcttggtggc ctttcaggta tttctagttt tgagtaactt tcactttgac 91741 tgctctacaa gggttcctgc caggataccc agaccagaat aaaactcatt tgggggccgg 91801 gtacggtggt tcacgcctgt aatcccagca ctttgggagg ccgaggcggg tggatcacct 91861 gaggtcagga gttttgagac cagcctggcc aacacggtga aacccaatac aaaaaattat 91921 ccgggcgtgg tggggggtgc ctgtaatccc agttactcgg gaggctgagg caggagaatt 91981 gcatgaaccc ggaaggcgga agctgcagtg agccgagatc gcgccactgc actccagcct 92041 gggtgacaga gactgtttga aagaaaaaat tctcatttgg aagaggcaaa atgaggtttt 92101 tacagaagaa atacaaatgt ctgtaaccag tacaggtaca taaaaacttt gttaaaaatt 92161 catttgagcg tgagagtggg gacaccagag tcactttgca gccccgtgac gtcaccgata 92221 acgggcatgg cgtcactcag gagaccacgt gtgcgggccg agcaagaagc cccgcccaca 92281 gcgcggagtt tagtctgcgc gtacctcgct cgagaacgcg ctcgtgcgca tgcccacaaa 92341 ggccaaggag ggagtgcgca ggtcacgtgc gccggtggtc agcgcgcgca ttgcctgccc 92401 cggaagtggt cggcgcgcgg cgcggcgcgc ctgggcgcta agatggcggc ggcgtgagtt 92461 gcatgttgtg tgaggatccc ggggccgccg cgtcgctcgg gccccgccat ggccgtcacc 92521 atcacgctca aaacgctgca gcagcagacc ttcaagatcc gcatggagcc tgacgagacg 92581 gtgcgggccg ggccggagcc cgggggcggg agcgacgggt ttcgggggtg gggtgggggc 92641 ggggaggcta gaatcccaac gggaggggca gggaggacgg cgcgggtcgg ccctgcccag 92701 acccccgacc tgcccgactt tcctggaccc cccgatggtc tttggcccag cccccagccg 92761 atcgggcggc gctcctgcgc cggtctccgg gcgaggcccc acccccgggg cgctggccag 92821 gccccggctc caatgtcagc gctctcgcgg ggcgcgggag tcacaagctc ggattcctgg 92881 gcaggccaag ctctccaaga ctgggctcca cgttcccagc tttgcaggcg tctccttggg 92941 acactggtgg tcgaatctag gagtaatgac caggagatac tgagtagtga cgacaacaac 93001 gatgttaatg ataataaacg ggtctcagtc tttataattt tggtggtccg tgtttgttgg 93061 ccgtttactt tttatcagtc accgagtgtt gtagtgttgt tttaccgctc tgcaaaacgg 93121 gcttgtcatt ggggacctct gatcatttta cagatgtgga aacctaggga ttgaggaact 93181 ttgccacagt cacacacaag taatggcaga gctgggattc aaattcggtt ctgcctcgtg 93241 tgagcgtcca cgatataaat attatgctgc ctttttagtc aaaagtagaa ttaggaccta 93301 attcaaacat actgaatgtc tattgggtac ttggcgcttt ttgcatagtg agggagacat 93361 tgaagtccac tttttgtaga aaaggaaaca cacatttttt gatccgctca agtgtttggc 93421 tttgggcaag ttaacctgcc tacctgtctg tgcctccgtt ttctcaaata tcaaatgaac 93481 cggtcgcctg ctggcttccg cctgtaatcc cagcacttag ggaggcagag ggaagaggag 93541 cccttgagcc taggaattcg agatcatcct ggcaacacag tgagaccccg ttctctacaa 93601 aaaaaagaaa caaaaatcaa taaataaaat gagaccaatt aaacgtattt cataggaaac 93661 ttaaaaggta tgacacatat acatttacag tagcactatt ggccaggcac ggtggctcat 93721 gcctctaatc ctgatgattt gggaggcaga ggcaggagga ttgcttgagg ccaggagttt 93781 gagaccagcc agtctaacat agggagactt atctacaaaa aaaaattatt tttaattagt 93841 ctggagtggt ggtgtgtacc tttagtccta gctactccca aggctggaca agagtaccag 93901 ttgagcccag gagtttgagg ctgcagtcag cgatgatcgt gccacatcaa tccagcatgg 93961 gcggcagcga gatcctgcct cttaaaatta aaaaatatca ctattaacta ttagtcactt 94021 attgtccaag gtagagtgta gagtggggga ctgtcccctt tattctgtta attatacttt 94081 atactacccg aaattacatt ttcttcacat tagtgttctc tcattgagag cagggactta 94141 cttgtgttga gtttgttctg tgctgtggcc tctgtgtcta aaacagcacc tggcacataa 94201 tgggtgttta gtataagtaa atactgtgtt gaatggcatg atgattgaat gaattcaaat 94261 ctagcatgct ttagatgctg agaaagctta aaaaacccag cgtgcttgcc tttctgccat 94321 cagggctggc agtttctctg tcctaaacta gaagggaaaa gagaatcttg gggtctccag 94381 tgactgtctg taccactccc tctaggtgaa ggtgctaaag gagaagatag aagctgagaa 94441 gggtcgtgat gccttccccg tggctggaca gaaactcatc tatgccggca agatcttgag 94501 tgacgatgtc cctatcaggg actatcgcat cgatgagaag aactttgtgg tcgtcatggt 94561 gaccaaggtg ggtgacgtgt gctggctggg agggtgggtg gacgagctgg ggagctggca 94621 aagagcctgt gtgcccagga gagattagct gtgaacaggg cggggccaca gcggagcggg 94681 ttgttgggtc tgatagggtt gctgatgcca gctccctttt tcttgctgtt gcagaccaaa 94741 gccggccagg gtacctcagc acccccagag gcctcaccca cagctgcccc agagtcctct 94801 acatccttcc cgcctgcccc cacctcaggc atgtcccatc ccccacctgc cgccagagag 94861 gacaagagcc catcagagga atccgccccc acgacgtccc cagagtctgt gtcagggtaa 94921 ggcgggggca gcagtcccag cttgggccct gtcctcctag cacattccag cgtccacata 94981 agtggtccca cacacctgga gggagggcaa gccgccagaa gccagggtcc gatttctctc 95041 tcttgaattt gcagctctgt tccctcttca ggtagcagcg ggcgagagga agacgcggcc 95101 tccacgctag gtgggtgggt ggtccccagg gcagaggtga ctgggtgccc cagccatcag 95161 ctgggccttg tctgggtgcg ggagggcctg ggagctgccc tttcctcttc ctggtgacct 95221 aggctttgct gcttcctcca cagtgacggg ctctgagtat gagacgatgc tgacggagat 95281 catgtccatg ggctatgagc gagagcgggt cgtggccgcc ctgagagcca gctacaacaa 95341 cccccaccga gccgtggagt atctgctcac ggtgaggtgg ggcttccgcc tcccggggag 95401 gccttgaggg agtacccggg cgtcactgcc ctgatgggcg gttgggaagg caaaacctgc 95461 cctgaaaagc ctttgggtag tgattctagc cactaaaggc ttcccacagg aggctggatg 95521 tgagtgatgg gtgggcctct ggagggcagg gccgaggcct catctgtgtc ctgccagggc 95581 atggaggagg gtggcagcag gaggtctgtg cattagaact aaacaggacc cctgacaggg 95641 aattcctggg agccccgagc cggaacacgg ttctgtccag gagagccagg tatcggagca 95701 gccggccacg gaagcaggtg ggtgtgcaca tgccgcatct gccctccagg tacctgactc 95761 acattacact ccaccccgca gtgctcctag gagcccggcg tggtgtctga ctgcacccct 95821 tcctactacc agcaggagag aaccccctgg agttcctgcg ggaccagccc cagttccaga 95881 acatgcggca ggtgattcag cagaaccctg cgctgctgcc cgccctgctc cagcagctgg 95941 gccaggagaa ccctcagctt ttacaggtgt ggtcccaagg gcagagggag ctagggcagc 96001 caccatttcc cttccctgtg ggcaccagag tccataacac gtaggaatcg ttctaggtcc 96061 ggaaagcagg actaagcaca tgcttccccc acgcccctgt gcttcctgtg acctggtgac 96121 ccccctggtt tctcagtctt cccaaccacc ttgtaaggtg tgggtgctgt caacatcacc 96181 tcccacagaa gaagacaccg gaaacttaga gcagcctgtc attcccaggg tcacgcagct 96241 ggtagtgtgt gtgtctgcct cctgcctcgg ggtggggtcc tgggctgggg ctctggcttc 96301 acatacagat ggtgctgcgt aaatgtctgt aaggtggcat gacgtcaccc aggacagctg 96361 tgccccagtt ggctctggga caactctgct cagtgaggct ccatcttgcc cttgaaggtt 96421 cacaggaaga gtgggggagt ggccccctgg gtgcaagtag gttcctctgg gttctcggtg 96481 aagtgcctac cggtcttggc tcaggaccct gctgtctgac ccttctcctc tcacttctca 96541 gtcccttgcc agctcctcct ggtcggtcgt gttcttcatc tgcactcagc cctccctcat 96601 agtgtcctgg ggccttcagt ctgtatggga acccccaaat ctatgtagcc agcccaggcc 96661 tctctgaatg ccacactcca tggctggctt aattaactct ccatcaaagt gctccctaga 96721 ctagacgtct aagacttaga acagacctcc ccagtcctcc ccgatctctg caaggattag 96781 cttggggttg cttcagcccc aggccctggg gatctgctgg ggtctacccc ttgtcctgtc 96841 tgctcttccc actcagcccc tccctgcact ccagctctct ggtagcctct tcctgggcct 96901 ctgccactgc atagtcccac agtccacttt tcaaccatga ggcagatgac taaaaagcac 96961 tttcttttct tttttttttt gagaggaaat ctcgctctgt cgcccaggct ggagtgcaat 97021 ggcgcaatcc cggctcactg caacctccgc ctcccgggtt caagcgattc ttttgccccc 97081 gcccgagtag ctgggactac aggcgcccgc caccacgcct gactaatttt ttgtattttt 97141 agtagagatg ggatttcacc ctgttagcca ggatggtctc aatctcctga cctgatgatc 97201 tgcccgcctt ggcctcccaa agtgctgcaa ttacaggcat gagccatcac acctggctga 97261 ctgaaaagca ctttcttctt taatgtgaaa atgatcagat gcatacaatg gtagagttat 97321 atcagagctt ccagataccc aggtagggca cagtggctca tgcctgtaat cccagcactt 97381 tgggaggcta aggcaggcag accacttggt caggagtttt agaccagcct ggccaacatg 97441 gcaaaaactc gtctctacta taaatacaaa aagtagctga gcatggcggc aggccactgt 97501 agtcccagct actcaggagg tggaggcagg agaatcgctt gaacccagga ggcagaaatt 97561 acagtgaacc gagattgccc cactgcactc cagcctgggt gacagagtga gagtgtctca 97621 aaaagaacag aaacaaaaat gaacaaatga acttccagat acctaaatcc aacacccaga 97681 tgagttcact ttttttggtt tgggggctgg taggtgggaa tagagatagg gtctatgttg 97741 cccaggctgg ttttgaactc ttgggcccaa gcagtccctc ctgccttgcc tcccaaattg 97801 ttgaggttac aggcgtgagc caccacaccc ggccaagttc acctttctaa aagatccaca 97861 gaccacgtct tttcccatgt aactgtcacc accgtggctt ccccctgctc tagaaaatgg 97921 gtctgacttg ggccttcctc tctggtgtca ttgtctgcct cctccccctt cccttgttcg 97981 ctgttttttg ttttttttgt tgttgttgtt tgtttgtttt tgagatggag tttcgctctt 98041 gttgcccagg ctggagtgca gtggcgtgat ctcagctcac cgtaacctcc gcctcctggg 98101 ttcaagagat tctcctgcct cagcccccag agtagctggg attacaggca tgtgccacca 98161 cgccctgcta attttatatt tttagtagag atgggatttc tccatgttgg tcaggctgct 98221 ctcaaactcc tgacctcaag tgatctgcct gcctcagcct cccaaagtac tgggattgca 98281 ggcatgaggc accgtgccca gccccattca ctctgttcct gcttctgttt ctgagacttc 98341 ccagatgcat ccaccctggg cctttgcact tgctgtttgc tttacctgga ggaaaccctc 98401 tgcatctgcc atactgcagg cctctccacc agccctcatc tgcagttcct tcctagccag 98461 acacttgcag ttgtctggtt catttattgg ctcactcact tattttttga gagtctcttg 98521 cccaggctgg agtgcagtgg cacagtctcg gctccctgca acctctgcct cccgtgttca 98581 agtgattctc ctaccttagc cttcttgagt agatgggatt acaggcacgt gccatcacac 98641 ccagctaatt tttgtatttt agtagagacg gagtttcacc atattggcca ggctggtctc 98701 gaactccagg ctggtctaga atcacctgaa gtgattatag gtgtgaacca ctgcagccag 98761 cctatttatg tatttattta ttcaagacag ggtcttgctc tgttgcccag gctggagttc 98821 agtggcttac tgcagcctcc gcctcctggg ttcagtctta tgcctcagcc tgccaagtag 98881 ctgggattac aggcgtctgc caccacacct ggctactttt tgtattctta gtagagatgg 98941 gggttcacca tgttggccag gctggactcg aactcctgac ctcaagtgat ccacctgcct 99001 ctgcctccca aagtgctggg attacaggtg tgagccaccg tgcccggctt atttcttggc 99061 tcacttaaaa gcacatcatc agctcttcca gggtagggct tgtgtcgact gctatgttgt 99121 gtttggccag aacaggctcc tctggatatt tgcctgactg aatgaaaaag caagggctgt 99181 gatgacctgg ggagaggagg ggaccagggc tgtgaattac cttcccttcc ccaccctctc 99241 ctgcagcaaa tcagccggca ccaggagcag ttcatccaga tgctgaacga gccccctggg 99301 gagctggcgg acatctcaga tgtggagggg gaggtgggcg ccataggaga ggaggccccg 99361 cagatgaact acatccaggt gacgccgcag gagaaagaag ctatagagag ggtaagaggc 99421 ctggctgagg ggtgactgca ggtgggcagg acccctaccc tctcctgctc acacttaacc 99481 tatcttccca cagttgaagg ccctgggctt cccagagagc ctggtcatcc aggcctattt 99541 cgcgtgtgaa aaaaatgaga acttggctgc caacttcctc ctgagtcaga actttgatga 99601 cgagtgatgc caggaagcca ggccaccgaa gcccccaccc tacccttatt ccatgaaagt 99661 tttataaaag aaaaaatata tatatattca tgtttattta agaaatggaa aaaaaaatca 99721 aaaatcttaa aaaaacaagc aaacagtcca gcttcctgtc ctcctaaagt ggcccctgtt 99781 cccatctccc gggccagaca gctgtccccc cgtcctcctc cccagcccag cctgctcaga 99841 gaagctggca ggactgggag gcgacagatg ggcccctctt ggcctctgtc ccagctctct 99901 gcagccagac ggaaaggcgg ctgcttgcct ctccatcctc cgaaaaaccc ctgaggaccc 99961 ccccccatcc tcttctagga tgaggggaag ctggagcccc aactttgatc ctccattgga 100021 gtggcccaaa tctttccatc tagggcaagt cctgaaaggc ccaaggcccc ctccccagtc 100081 tagccttggc ctccagcctg gagaagggct aacatcagct cattgtcaag gccaccccca 100141 ccccagaaca gaaccgtgtc tctgataaag gttttgaagt gaataaagtt ttaaaaacta 100201 gccctatggt ctgtgcctgc tggggctccc cgcgcccacc tgtctgggtc ttggggggct 100261 ggctgggcac aggcaggcat ggtgacgggt gcctggaagg gggagaggag tctgtgagtc 100321 cctgcagatt gaggcttccg gtgtctcctc ccccagtgtg tcatttccag gcagtggacc 100381 ccagcccaac gttacaagga ctgcttctcc ccggccccca ccatcatcac cacagtctgt 100441 tttggctata cttccccccc caccagcctt gtaattggct aattactgga gatgaatgtt 100501 ggtaaacaga agccttcttc tcttctgcca tctgcttctc cattatttgc ccattgtaca 100561 cccccctttt cctcccactg aagccccagt tagtcagcag ggcctagagt ccctgtcccc 100621 ttttgagtac agccaagtgg gggattcccc agctcttaag tattggggga ggaaccaggg 100681 gacccagaga ggcacacctt gagaggacgc agatctcttc aggggtactg ccaggtagca 100741 ggctttattg ggaagggaca aagcctcagg agctgggtgc cccagaggct gctgggtctt 100801 gagccacagc tgcagccaat gcagcagctc gcgcctcctt cttccgtttc tgtttttcct 100861 ccttgaggcg cttgcgctcc ttcttctcta ggtcctggag cagctcctgg aagcgggcac 100921 tccttgggtc cacctggtag cccaggagct cctgggcctc agcctgcagt cgggccctcc 100981 tctccttgtc agcctgggcc ttctcccagt tctcccgctg ctgctgctgc cagttcacaa 101041 tcatctgtgg catcttggcc atgcactctg cgatgtgctg ctccctgcag gggagggaga 101101 gtgggctgtg acactggcac tcagccaggg cagagcccac ccacccatat ctcacccata 101161 ggcctttgcc cacgagcctg cctcctcaac tcccccagta gattaactgt caggaaaagc 101221 ccttgacaag tgaagaaaca ggaagagatg atgagaaagt tacctgtgca tgaatttgat 101281 cacaagggca tgtcaagccc cagtaaaccc caggtgccaa agccaaatca gcatttcact 101341 gagctgccta ctgagcaagc caagtgccac tcccatggat aatttcacac ctttatttca 101401 tagacattgc tgagcacttc ctgtacagag gccaggccca attctaggca caggagtgaa 101461 agcaagaaag acatttccca ccaatccttg agtttgtagc taccagtcac agccccattt 101521 ttttctttta agttcagggg tacaagtgca ggtttgttac gtaggtaaac ttgtgtcatg 101581 gggttttttt gtacagattt atttcatcac tcaggtatta aacctagtac ctagaggtca 101641 tttttcctga tcctctccct ccaccctcca aaaagcccta gtgtgtgttg ctcccctgtg 101701 tgtccatgtg ttctcatcat ttagctccca cttacaagtg agaacacgtt gtatttggtt 101761 ttctgttcct gtgttagttt gctagggatg atagcctcca gctcccgtcg tgtccctgca 101821 aaggacatga tctcgttctt ttttatgaca gcctcattta aggattagaa aagtaaaaga 101881 attaagccct ataatggtca tctggctgag gaggggcaga aacaggagtc cctgctccat 101941 gattccctgc ctgggtgatc agacgcatga cttcacttca ccaagcctca gtttcctcat 102001 ctctaaaatt agggcaatcc cttcttcctc ttaaggctat taggaaaaca agaagattaa 102061 acaccaggaa cacacttgac ctaccccctt cacgctccag gatgtggcat tatctataag 102121 agatgcccct ggtctggccc gacgcggtgg ctcacgcctg taatcctagc actttgggag 102181 gccgaggcag atggatcacc tgaggtcagg agtttgagac cagcctggcc aatatggcga 102241 aaccccatct ctactaaaaa tacaaaaaat gagccgggtg tggtggtggg cgcttgtaat 102301 cccagctact cgggaggctg aagcaggaga atctcttgat cctgggaggc agaggttgca 102361 gtgagctgag attgcactcc agcctgggca acaagagcaa aactctgcct caaaacaaca 102421 acaacaacaa caacaaaaag atgcccctgg tccttcagac ctgcctctgt atctgacctc 102481 tgccctcttg gttccaggac ccatcacctt cctgccctca ccttttcccc agccacactg 102541 gcttcctggc tgttctttaa ggcatatcgt atctcagggc atttgcccca gctgttccct 102601 ttaccagaaa ctttctaccc aagagatcgc cccatagcta agtctggtta ttcaggtctt 102661 tgcttaaaca tcacctcctc agagaggcct gaggcttccc aaaccaaagc tgccctcagg 102721 cacacactct tcactctttt accttcctac tagtcggctg tcaccattgg tattatcttg 102781 cctgtcagct gtcgccatct gatattaagg gcccttttgg ggcctgcctc ggcctcccaa 102841 ataactggga ttacaggcac gagccaccgc ccccggcctg ggacagggtc ttgcacacgg 102901 tatgcgctca gtaaagactt ggtggtgatg gccggacgtg atggctcaca cctgtaatcc 102961 cggcactttg ggaggctgag gcaggtggat cacttgaagt aaggagtttg agaccagcct 103021 ggccaacatg gtggaacccc gtctctacta aaactacaaa aatttgccgg gcgtggtggt 103081 gcacgcctgt aatcccagct acaccggagg ctaaggcggg agaatcgctt gaaccgggga 103141 ggcggaggtt gcagtgagcc aagatcttgc gactgcactc cagcctgggc aacagagcga 103201 gactcttgtc tccaaaaaac aaacaaacaa acaaacaaaa cttgctgatg aatgactaaa 103261 tggacgttat tctgagtttg ttttgcggcc aggtgagcgt atacataaat ctacaacatg 103321 cagatgctgc gacgtgaccc cgaaggggta ggcacggttt ggggcccagt tcgcccacgg 103381 ggcacagagg gcagccccgc gtctgcctgc acgcacgcac ctctcccgac gcttctgctc 103441 ttcggccagc tgcttcaccc gcagcgactc ctgcatggtc gccaggctcg ggtaccattc 103501 gcgttcttcg gcctccagct cccgcagctg ctccggcgac ggccataacg aaccggggac 103561 caccccggag gcggcgccgt aacgcgcgaa ctgcttagcc gcgtagcgcg gtcccagctg 103621 ccaccgcggg gtcaggaggt cctcggggtc tggccaccgg ggtcccggcc tgcggcgcgg 103681 gggcggccgc gcccggtagc cacgggaacc cggggccagg gtcgccgcca cacctagtag 103741 gctgcgtgcc tgtcgcacgg acgccgccat cttggctgtg cggggtcctc acaggcccgc 103801 cgggctgtcc atgcccggtg cctgagcgcg aggcccggtg tgggccaggg cagggcgagg 103861 ggtgtccctg ctgcctcagt tcgggggagg gcggcgaagg ggatagtatg ggttttatac 103921 gtttccaagt ataactttaa attttagtta agcagttact gattaaagtc atttttaatt 103981 ttcacttttt ttcttctcct ttctgcacac tgacacttaa aaatacatat ttttgtcggg 104041 cgcggtggct cacgcctgta atctcagcac tttgggaggc cgaggcgggc ggatcacctg 104101 aggtcgcgag ttcgagacca gtcttttttt tttttttttt tcgagacaga gtctcgctct 104161 gtcgcccagg ctggagggta acggcgtgat ctcgggtcac tgtaacctcc gccttccggg 104221 aggagaggag aaaccccgtc tctactaaaa atacaaaatt agccgggcat ggtggcgcat 104281 gcctgtagtc ccagctccgc gggaggctga ggcaggagaa tcgcttgaac ccgggaggcg 104341 gaggttgggg tgggccgaga tcgtgccatt gcattccagc ctgggtaaca agagcgaaac 104401 tccgtctaaa aaattaataa ataaaaataa atacatacat gcatacacac acacacacat 104461 atacatatat atatacgctc acctggctac aaatatatat ttgtgtattt ttttctaagt 104521 gttcagtgtt ctttgggtaa aaattgtatt tttgtttttt gcatatcaga aaccatactg 104581 ttttattcat ccatgtgtct tctgtgcaca cagtagccct ccaatgaata atctgaaatc 104641 atgttgaatg aacacattga aaaatcaaaa ataagaggct gtggcagggc cactgatatg 104701 gtttggctct gtgtccccaa ccaaatctca tcctgaatta taatccccat gtgtccaggg 104761 agggacctgt aatccccaca tgtcggggaa gggaggtggt tggatcacgg gggtggtttc 104821 ccccatgcta ttcttttttt tttttttttg ccttcaaaaa gtttatttta ttttatttta 104881 ttattattat actttaagtt ttagggtact tgtgcacaac gtgcaggttt gttacatatg 104941 tatacatgtg ccatgttggt gtgctgcacc cattaactcg tcatttagca ttaggtatat 105001 ctcctaatgc tatccctccc ccctcccccc accccacaac agtccccggt gtgtgatgtt 105061 ccctttcctg tgtccatgtg gtctcattgt tcacttccca cctatgagtg agaacatgcg 105121 gtgtttggtt ttttgtcctt gagatagttt gctgagaatg atggtttcca gtttcatcca 105181 tgtccctaca aaggacacga actcatcatt ttttatggct gcgtaaaaat tatattttta 105241 actttacttt tatacattta aactagcatt tattaggctg ggcttggtgg ctcacgcctg 105301 taatcccagt actttgggag gctgaagcgg gcggatcact tgaggtcagg agttggagac 105361 cagcctgacc aacatggcaa aaccttgtct ctactaaaaa tacaaaaatt acctgggcgt 105421 ggtggtgggc gcctgtaatc ccaactactc aggaggctga ggcaggagaa tcgcttgaac 105481 ctgggaggtg gaggttgcaa tgagctgaga tcgtgccatt gcactctacc cagggtgaca 105541 gagtgagact ctgtctgaac aaacaaacag acaaaaagcc cactagcatt tattattttt 105601 ttaatgtaaa tgtggctgaa cgcggtggtt cacacctgta atcccagcac tttgggaggc 105661 tgaggcaggt atcactgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc 105721 cgtctttact aaaaatacaa aaattggctg ggtgcggtgg tgcacacctg taatcccaac 105781 tactcgagaa gctaaggcag gagaatcgct tgaacccggg aggcggaggt tgcagtgagc 105841 tgagattaca ccactgctct ccagcctggg caacagatcg agactctgtc tcaaaataaa 105901 taaataaata aataaataat tctgtcacat gctgtgacat ggatgaacct tgaggacatt 105961 aggctaagtg aaagaagcca gtcacaaaag gacaaatatt gtatgattcc acttatatga 106021 ggtaactaga gtagtcaaat tcatagagac agaaagtaga ttaggagttc caggggttgg 106081 gggtggagga gtgggagtta ttgcttataa tgtataatat attttatatt gtaaaatata 106141 caaaataaat aaaacgaatt tgtgttttgt tttgttttga gtcagagtct cgctctgtct 106201 cccaggtgag agtacagtga tgcaatcttg gctcactgca acctctgcct cccagattca 106261 agggattttc gagcctctgc ctcctaagta gctgggacca caggcgcaag ccaccaagcc 106321 tggctgattt tcatatttta ctagagacgg ggtttcacca cgttggtaag gctggtctca 106381 aactcctgac cttaagtgat ctgcccactt cagcctccca aagtgctagg attacaggtg 106441 tgagccacca tgactggcca cgtttttttt gttttttttt tttgtttgtt tttttttgag 106501 atggagttta gctctgttgc ccaagctgga gtgcagtggt gcgatctccg ctcactgcaa 106561 gctccgcctc cggggttcac gccattctcc tgcctcagcc tcccgagtag ctgggactac 106621 aggcgcccgc caccacgccc ggctaatttt tttgcatttt tagtagagag ggggtttcac 106681 cgtgttagcc aggatggtct cgatcgcctg accttgtgat ccacccgcct cggcctccca 106741 aagtgctggg attacaggca tgagccaccg cgcctggcct gttgtttttg tttttaatca 106801 cacacgcaca ccttctctac ccccacaaaa aataactgca gttttctcaa gacatgaatg 106861 ttcatagcaa caccacatat atatgtcatt accaagctgg aaatagctca cacccaatgt 106921 ccatctgaag tagaatggac acctaaatta tgctgtttca gccaatggaa tactacacag 106981 gaatgaaaga gcaaatatta catcttctca cagaagatgg atgatgctta taagcataat 107041 gtgcaacaaa ggaagaggac ctatatgatt gctttttaaa ttttattttt ttgagacagg 107101 gtattgctct gttgcccagg ctagagtgcc ctggctcggt catagctcac tgcagccttg 107161 aattcctggg ctcaagcaat cctcccgcct cagtctcttc agtagcaaga ctataggcat 107221 gaaccgctac tcccacctac attttaaatt tttttgtaga gacggggttt ctctatgtta 107281 cccaggttgg tcttgaactc ctgggctcac ctcccctccc ctcccctcct ttcttgacag 107341 ggtctcactg ttgcccaggc tggagtacac tggcatatca ctgcagcctc aatctcctgg 107401 tctcaagtgg tcctcccacc tcgggctcct aagtagctgg aacttcaggc actcaccatc 107461 aagcctggct aatttttgta ttatttgtag agatagagtc tcaatatgtt gcccagggtg 107521 gtcctgaatt ctgggaccca agttatcctc ccaccttggc ctcccaaaat gttgggatta 107581 caggtatgag ctaccatgcc tgacctcaga attttttcat cttgcaaacc tgtaactcta 107641 tacactcact ccccagtttc cctttcgccc agtccctggc agtcactatt cctttttttt 107701 tttttttttt tgagacggat ggagtctcgc tccgttgccc aggctggagt gcagtggcgc 107761 gatctcggct cactgcaagc tccgcctccc gggttcacgc cattctcctg cctcagcctc 107821 ccgagtagct gggactccag gcacccgcca ccacacccgg ctaatttgtt tgtatttttg 107881 gtagagacag ggtttcaccg tgttagccag gatggtctcg atcccctgac ctcgcgatct 107941 gcccgtctcg gcctcccaaa gtgctgggat tacaggcgtg agcaccgcgc ccggcctggc 108001 agtcactatt ctaccttgtt tctatgaatt tgactactct aggtacctca tataagtcga 108061 atcatacggt acttgtcctt ttgtaactgg cttatttttt attttgagac agagtcttgc 108121 tctgttgtcc aggctgaagt gcagtggcac aattattata actggctgca gaatcgacct 108181 cctggactca agtgatactc ccacctcagc ctctcaagta gctgaaacta caggtgctca 108241 cctggctaat tttttttttt tttttttttt tttttttgaa acagggtttc gctctgtcgc 108301 ccaagctaga gtgcaggggc gccatctcgg ctcactacaa gctccgccgc ccgggttcac 108361 gccattcttc tgcctcagcc tcccgagtag ctgggactac aggcgcccgc caccacgccc 108421 ggctaatttt ttgtattttt agtagagacg gggtttcacc atgttagcca ggttggtctc 108481 gatctcctga cctcgtgatc ctcctgcctc ggcctcccaa agtgctggga ttacaggcgt 108541 gagccaccac acctggcctt tttttttttt tttttgagac agagtactta ctctgttgcc 108601 caggctggag cgcagtggca tgatctcggc tcactgcaac ctctgcctcc caggctcaag 108661 caattctcat gcctcagcca cctgagtagc tgggactaca ggtgtaaacc actgcacctg 108721 gggctggctg attttaatta gtataatatc ctaaaatttc tttctttctt tctttctttc 108781 tttctttctt tctttctttc tttctgtctg tctgtctgtc tttctttctt tctgtctgtc 108841 tttctttctt tctttctgtc tttctttctt tctttctctt tctttctttc cttctcttct 108901 cttctctcct tccttccttc ctttctttct ttccttcctt cctttctctt tctttcgaca 108961 ttctctctct atcgcccagg ctggagtgca gtggcactgt cttggatcac tgcaaccccc 109021 acctccccgg ttcaagcaat tcttgtgcct cagcctcccg agtagctggg attacaggtg 109081 cccgccacca cacctggcta atttttgtat ttttcgaggt ttcaccatgt tggccaggct 109141 gatctcaaac tcctgacctc aagtgatctg cccacctcag cctcccaaag tgctgggatt 109201 acaggtgtga gccaccgtgc ttggcatatc ctaaagtttc atccatgttg tagcacgtgt 109261 ccgaatttct ttttttcttt tttttgtatt tttagtagag acggggtttc cccgcattag 109321 ccaggatggt ctcaatctcc tgacctcatg atccgcccgc ctcagcctcc caaagtgctg 109381 ggattacagg cttgagccac cgcacccggc caacgtgtca gtatttcttt cctagcctgg 109441 gcaacactgt gaaaccctgt ctctccaaaa aatacaaaaa ttagctggct atggtggtgc 109501 atccctgtag tccctcctac ttgggaggct gaggtgggag gtcaattgag cctgggcggt 109561 gaagtctgca gtgagctgtg agtatgtcag tgcactccag actgggtgac agagcgagac 109621 tgtctcaaaa aaaatttttt ttttcctttt taaggctgaa tgctattcca ttgtatgaat 109681 ggaccacatt ttttttttcc caattgcaga tgtcaggctg aaagaatgga ccacatttta 109741 ggctgggcac gatggctcat gcctgtaatc ccagcacttt gggaggccga ggcaggcgga 109801 ttacaaggtc aggagttcga gaacagcctg tggtgaaacg ctgtctctac taaaaagaca 109861 aaaattagcc gagcgtggtg gtgcgtgcct gtaatcccag ctactcagga ggctgaggca 109921 ggagaatcgc ctgaacccag gaggtggagg ttgcagtgag ccaagatagc gccactgcac 109981 tctagcctgt gtgacagagt gagacttcgt ctcgaaaaaa aagaaaaaaa aagcggccag 110041 gcacggtggc tcacgcctgt aatcccagca ctttgggagg ccgagacagg cagatc // LOCUS D13641 3259 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0016 gene, complete cds. ACCESSION D13641 NID g285986 KEYWORDS KIAA0019; KIAA0016. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3259) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3259) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 REFERENCE 5 (bases 1 to 3259) AUTHORS Seki,N., Moczko,M., Nagase,T., Zufall,N., Ehmann,B., Dietmeier,K., Schafer,E., Nomura,N. and Pfanner,N. TITLE A human homolog of the mitochondrial protein import receptor Mom19 can assemble with the yeast mitochondrial receptor complex JOURNAL FEBS Lett. 375 (3), 307-310 (1995) MEDLINE 96085231 FEATURES Location/Qualifiers source 1..3259 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /chromosome="1" /map="1q42" /sex="male" 5'UTR <1..101 gene 102..539 /gene="KIAA0016" CDS 102..539 /gene="KIAA0016" /note="similar to fungal mitochondrial import receptor Mom19" /codon_start=1 /product="mitochondrial outer membrane protein 19" /db_xref="PID:d1003309" /db_xref="PID:g285987" /translation="MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKK QKLAKERAGLSKLPDLKDAEAVQKFFLEEIQLGEELLAQGEYEKGVDHLTNAIAVCGQ PQQLLQVLQQTLPPPVFQMLLTKLPTISQRIVSAQSLAEDDVE" 3'UTR 540..>3259 BASE COUNT 912 a 588 c 729 g 1030 t ORIGIN 1 ggccgtcggg tgtgagctgc gccgaccgct ctgagggttc gtggcccacc gctccttcgc 61 ggtccctgcc gccaccgtcc acgctcagcg ttgtagagaa gatggtgggt cggaacagcg 121 ccatcgccgc cggtgtatgc ggggcccttt tcattgggta ctgcatctac ttcgaccgca 181 aaagacgaag tgaccccaac ttcaagaaca ggcttcgaga acgaagaaag aaacagaagc 241 ttgccaagga gagagctggg ctttccaagt tacctgacct taaagatgct gaagctgttc 301 agaagttctt ccttgaagaa atacagcttg gtgaagagtt actagctcaa ggtgaatatg 361 agaagggcgt agaccatctg acaaatgcaa ttgctgtgtg tggacagcca cagcagttac 421 tgcaggtctt acagcaaact cttccaccac cagtgttcca gatgcttctg actaagctcc 481 caacaattag tcagagaatt gtaagtgctc agagcttggc tgaagatgat gtggaatgag 541 aaacaaatgt caacataata aaatctcagt taaaaatatt ttaaaaattc ttggtagttg 601 agcagctctg ggggaataag ggcaaatatg cttgttatga actacactga aatctaccaa 661 agttaatgtt tactttgtgt agatccattt gtctatttta tttatttttc ccagtgaaaa 721 gtgtattttg atagagaact tttcattcta taaatacact atgagttact aaaatatcat 781 ggattttgtt tattcctgaa acatagttac atagttaaac tgtacatatg acatggctta 841 tgttaaaaat acccagtgct cagttttgaa agataggcaa aaaaaaaaaa agtataggag 901 aaactgaaga atgtacactt ttttagaggg cacattttgc tgtaaatctg gaaatttgat 961 agacttgact gtgtttgtga aaactgagca ttaaaggttt tgattgatcc tttctttcca 1021 tttaatctct gagacgtaaa tatgtgaggt gtgctgctgt gctgggttaa cagcttcctt 1081 ccctttctgt gtagcagtct tgaaatgttc tgtttaaatc agtaggctta atgtgttctg 1141 ggtatttatc tccttgtatt ttaaatatat gtagttgcaa atagcaccag gaattagatt 1201 tctgtacacc cctaatctag ccttgtgagc ttcgctagtt aatgtgtgct cactttccct 1261 ccatttgtta cgtgagagaa tgcgtctgct gatcactgaa gtgtcccttt tagcttctga 1321 ttcattgggt tctgttgggc atctttaaat ccaccttaac ctgaggaatg tatgtgggca 1381 accaggccct gcattttttt atattctgaa ttttgcatgc ttgcctgact tagtatttct 1441 gaattgatgt tttttttaat ggtataacta tcttgatttt cactgaaatt atatggttct 1501 gtcactactc tgtaaattaa tccgaaactt ttaaggtaac tgggatgatc tgcttgtaaa 1561 aatgcttgtt gccttttgct ttatcttcag tgtacctcct taatcctgct tcaacttgat 1621 tatcttgtga aacgatgaga gtaagttgca accttgtgac tgaaaacttg aaaagagtgg 1681 agcaggtggg acctcttatt ctcaaatagt gacatattct ccgtagtcac agtttcagaa 1741 ctgagtaagg atccttggta cttggtggca tctgttgaac tgaggagcat ttctcattgt 1801 aaagattgcc tttgttctgt ctaaaagtct ggagaaatcc caaagacttt tcctatgtac 1861 taggcatttt attttgattg acttacaaac tcttcttaat cattatcaat ctcggttttt 1921 ttgtggtgca gtggaaggag aaataggtct agtttctgcc tctgattagc cgcacagcct 1981 tgaacaaatc acatttcatc tttgaactta cctctactgt tagactaggc gactcacatt 2041 tgaggacttt tctcgggtat cttgagggtt tgtgatcctg aacccttaaa cagtgctttt 2101 ttgttacaca ggagggcttt tttgggggga tgaccagtac agacatgcca gttagtttta 2161 ctagtgggat cccaaatcca aagcagtgta gtggtgattg gtcagtgact aaccaggcag 2221 ctaagaagtc ttaggcagca gcccagacat gtatagaggg gcagttagag ggagaacagg 2281 ggtgggaaag ggagcaaggg gcagatagct cagcaaggaa agaatgggct cagaaaaagg 2341 agggctggct ggaggagtga ggggcagctt aagtttgggg agggtagaaa cgccgtttcc 2401 ttgggaactg gagtgcagta tgagctgggt gtcacttggc tctgaacata ctggctttgc 2461 tgtaatgctt gaaaaggcgt tggtatcttc attttacagt tcattaaccc aagtacgttt 2521 tcttatttaa atgacaactt tggtgcttta aaatgaggta ccacttttta aagctagctg 2581 tgtcgagtta aagaaaaaat cagcagtttt ttctcccaga aatgtaattg ccaaacactt 2641 ttcatcccca tcttaagttt tacaaggtga tgtaatcagc ttgttgtagt gatgctggcc 2701 aaatggtgct cagcaggtga gaacaaaaaa accccagatt tcagtgaact aatacacagc 2761 ttgagcgttt ccatgtgcta atgttgcaca cttactaaaa aactttggaa atggaaaata 2821 atgtattagt gcaacagttg atgtgcttct ttgggcaaag atatagtttt gttccacaat 2881 ttgtacttaa aagcgaaaga acattgaaaa catagactta ctggctgtag caatgctggc 2941 ctgttaactg ataactagaa cttaggttca cgtttatgta aagtgtgtaa aacctagtag 3001 agcttgcata gtcggcactc agtaaatgtt tggttccttt tgccccttgg taagtttatt 3061 ttaccatcct cccacctgcc attctgactt tattaaatca acatgtggac cagagtgtta 3121 atgagatgtt attgcagaag agattgagaa aattggtata tcatgcagat aacatacaaa 3181 atctttttgt aacgtaaaaa atgcagtttt attattgctt gtgcctcaac tgtttaagtg 3241 aatattaaag ggcttggag // LOCUS D13644 4602 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0019 gene, complete cds. ACCESSION D13644 NID g1531551 KEYWORDS KIAA0019. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4602) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4602) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 REFERENCE 5 (bases 1 to 4602) AUTHORS Matoskova,B., Wong,W.T., Seki,N., Nagase,T., Nomura,N., Robbins,K.C. and Di Fiore,P.P. TITLE RN-tre identifies a family of tre-related proteins displaying a novel potential protein binding domain JOURNAL Oncogene 12 (12), 2563-2571 (1996) MEDLINE 96293402 FEATURES Location/Qualifiers source 1..4602 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /chromosome="10" /map="10p13" /sex="male" 5'UTR <1..279 gene 280..2766 /gene="KIAA0019" CDS 280..2766 /gene="KIAA0019" /codon_start=1 /product="protein related N-ternimus of tre oncogene" /db_xref="PID:d1003312" /db_xref="PID:g2104571" /translation="MNSDQDVALKLAQERAEIVAKYDRGREGAEIEPWEDADYLVYKV TDRFGFLHEEELPDHNVAVERQKHLEIERTTKWLKMLKGWEKYKNTEKFHRRIYKGIP LQLRGEVWALLLEIPKMKEETRDLYSKLKHRARGCSPDIRQIDLDVNRTFRDHIMFRD RYGVKQQSLFHVLAAYSIYNTEVGYCQGMSQITALLLMYMNEEDAFWALVKLFSGPKH AMHGFFVQGFPKLLRFQEHHEKILNKFLSKLKQHLDSQEIYTSFYTMKWFFQCFLDRT PFTLNLRIWDIYIFEGERVLTAMSYTILKLHKKHLMKLSMEELVEFFQETLAKDFFFE DDFVIEQLQISMTELKRAKLDLPEPGKEDEYPKKPLGQLPPELQSWGVHHLSNGQRSV GRPSPLASGRRESGAPHRRHEHSPHPQSRTGTPERAQPPRRKSVEEESKKLKDEADFQ RKLPSGPQDSSRQYNHAAANQNSNATSNIRKEFVPKWNKPSDVSATERTAKYTMEGKG RAAHPALAVTVPGPAEVRVSNVRPKMKALDAEDGKRGSTASQYDNVPGPELDSGASVE EALERAYSQSPRHALYPPSPRKHAEPSSSPSKVSNKFTFKVQPPSHARYPSQLDGEAR GLAHPPSYSNPPVYHGNSPKHFPTANSSFASPQFSPGTQLNPSRRPHGSTLSVSASPE KSYSRPSPLVLPSSRIEVLPVDTGAGGYSGNSGSPKNGKLIIPPVDYLPDNRTWSEVS YTYRPETQGQSWTRDASRGNLPKYSAFQLAPFQDHGLPAVSVDSPVRYKASPAAEDAS PSGYPYSGPPPPAYHYRNRDGLSIQESVLL" 3'UTR 2767..4602 BASE COUNT 1340 a 1041 c 999 g 1222 t ORIGIN 1 ggaggtgtcg cgccggggaa cttcctggtt cccgggctcg gcttcgccgg gatcctcttg 61 gaagggaaac aatggggcgg agggcactgc ggtagccgcc gccgccgccg cgccgcgccg 121 cgccggatct gctcggccgc ccgggaccgc cagctctgtc cgctgcccac agcctagcag 181 tcgggaccgt actgaggaca tgtattcctc tgagaaacct tggacagcag attttggttt 241 aatatctgat tgggacaaca ttcagaccca tttccagtca tgaattcaga ccaggatgta 301 gcactcaaac ttgcccagga gcgagctgaa atagttgcta aatatgacag aggacgagaa 361 ggtgcagaga ttgaaccttg ggaagatgct gattaccttg tttacaaagt cacagataga 421 tttggctttt tacatgagga ggagctccca gatcataatg tggctgtgga acggcaaaag 481 cacctggaaa ttgaaagaac taccaaatgg ctgaaaatgc tgaaaggatg ggaaaaatac 541 aagaacactg aaaagtttca taggcgaatt tacaaaggaa taccactcca gctcagaggt 601 gaagtctggg ccctccttct tgagatccct aaaatgaaag aagaaacaag ggacctgtat 661 agtaaattaa aacacagagc acggggctgt tcacctgaca tcagacaaat agacctggat 721 gtcaaccgca catttcggga ccacattatg tttagagaca gatatggtgt taagcaacaa 781 tccttattcc atgtgcttgc tgcctattct atttataaca cggaagtcgg gtattgtcag 841 gggatgagcc agatcacagc tttactcctc atgtatatga acgaggaaga tgccttctgg 901 gccctggtca aactcttctc aggccctaaa catgccatgc atggcttttt tgtccaaggt 961 tttcctaaac tcttgaggtt tcaagaacat catgaaaaaa tactgaacaa atttctgtcc 1021 aagcttaagc aacacttgga ttctcaagaa atctacacaa gtttttacac aatgaaatgg 1081 ttttttcagt gtttccttga tcgtactccc tttacactaa acctcagaat atgggatatc 1141 tacatctttg aaggagaacg agttcttact gctatgtctt acaccatctt aaaattacac 1201 aaaaaacatc taatgaaatt gtccatggaa gaacttgtag aattttttca ggagaccctg 1261 gcaaaggatt ttttctttga agatgatttt gtgatagagc aacttcagat ttctatgaca 1321 gaactaaagc gggcaaagtt agaccttcca gaacctggta aagaggatga atatccaaag 1381 aagcccttgg ggcagcttcc acctgaactt cagtcttggg gcgtccatca cttgagcaac 1441 ggacagagga gcgtgggccg gccgagcccg ctggccagcg gcaggaggga gagcggggcg 1501 ccccacagga ggcacgagca ctccccgcac ccccagagca ggaccgggac gcccgagaga 1561 gcacagccgc caagacggaa atcggtggag gaggagagca aaaagcttaa agatgaggca 1621 gattttcaaa gaaaactccc atcgggtcca caggacagtt ccaggcaata taatcacgca 1681 gctgccaacc aaaatagcaa cgccacttca aatatcagga aggagtttgt gcccaaatgg 1741 aataaaccgt cagacgtctc agctacagag agaactgcca aatacaccat ggaaggcaaa 1801 ggtcgagcag cgcaccccgc gctcgcagtt accgtcccag gtcctgccga ggtgcgggtg 1861 tcaaacgtgc ggccaaagat gaaggccctg gatgctgagg acgggaagcg gggctccact 1921 gcatcgcagt acgacaacgt gccaggcccg gagctggaca gcggcgcttc cgtggaggag 1981 gcgctggaaa gggcttactc ccagagcccc cggcatgccc tttaccctcc cagcccgaga 2041 aagcacgctg agccaagttc tagtccatca aaagtatcca acaagtttac ttttaaagta 2101 cagcctccaa gtcatgcacg atatccgtcc cagctagatg gggaagcccg agggctagct 2161 catcccccct cctacagcaa tccccccgtt taccacggaa actctcccaa acacttccct 2221 actgccaaca gcagctttgc ttctccacag tttagccctg ggactcaact gaatccttcc 2281 aggagacctc atggttctac tctttccgtc agtgcttctc cggagaaatc ttacagccgc 2341 ccaagccccc ttgtactgcc gtctagtcga atagaagtcc tccctgttga cactggtgct 2401 gggggatatt cgggcaattc agggtcacca aagaatggaa aattgatcat tccaccagtg 2461 gattacttgc cagataacag aacatggtca gaagttagtt atacatacag acctgagacg 2521 cagggacaat catggacccg agatgctagc cgtggcaatt taccaaaata ctcagccttt 2581 caactcgcac cctttcagga ccatggcctc cctgcagttt ctgtagatag tcccgtgaga 2641 tataaagctt caccggccgc agaagatgcc agtccatctg gatatccata ttcagggccc 2701 ccgcctccag cctaccacta caggaatcgg gacgggcttt ccatccaaga gtcagtgttg 2761 ctgtgagatt tgacgtgtac ttgctaaaga cgagagagaa accacgtgaa acctacattg 2821 ctatgttcat aattgccaaa gcagtattta tactattgta aacaactcgc acatctctcc 2881 tgtctgttag taataagaat acagggaaat gtcttcagcc ccacgtagat gctgcttaag 2941 atgagcgttt caattgcatc atcactgacc tgtagaattt aacccggaaa catcgcaata 3001 gcatttaggg atgcacgcac agtggtaatt tattactcag ttccggttat gtttttctcc 3061 aaaacctgaa catttttact ctgttggctc attctatttt gactaatgtt acaaagtact 3121 gaaaaaactg tagaatagat atttttaagg ctatatgtag tgtatgtgtc ttttaataga 3181 tataggcatc tgttcatcca catttgaaca cagagctaaa gaaacgtttt ttaaaaaact 3241 acttttaatc cagatccatg gaataatatg taattcttgt tttccttgtc ttttagtcca 3301 gtttttcttt ttccctctca tatatttgta acctgggtcc taaactgact gaagagtttg 3361 tcctgacttt attaattgca gtctcacaga atgctcatca ttctccttca gctgttgcca 3421 ttttctttca atctgaaaat taaagacatt ggagagaaag gctaaccact taatggtgct 3481 taaaaatcgg ggcacttaaa ataagtttac ggccaaaggt gcagcagtgc ttatcacatc 3541 cacatactaa gagattggtc ttagtgcgtt tgtgcccttt gttctggtct gactcttgcg 3601 aatctcagag caggagaagt tgcagcttac tgatccctca gtgctcacag tactcatgtg 3661 gggcagccac actaactgca ttagcagtgt ggttacttct ttcttctgcc tcgactcaca 3721 tggcttctcc ccacgaaggt cccagggtgg ccatttatac aatccatttc tgtcctcttt 3781 atggaatgta gtcgacggat attgctgcac ctcatcaggc cattacattg tccatgttaa 3841 aattataaaa atgatcctag tttttcatag gagtctagga ttttgatctt ttaaaaaata 3901 cagtgtgatt tcatgctttt gaacatcttt ttttttttca ttgaaacaaa ttcaagttct 3961 tggcttaaaa aaaaattagt gatgtcttta aaaaatattc tgttacacta cagataccaa 4021 aacactgaca aagatttagg cactccaaga gagtatgact tcgtgtttaa ttggggagct 4081 tgttttcaac attgcttata tttgaatagt agaaaacccg acattgatgt ttcttcctgt 4141 tgcaaggtgg ggctttaaac tttcaaggta caaaagtatg tcttttaaaa tggaagtaac 4201 gtttgtttag ctagtgaatg tgacaatata ttttatgtat ataatgagtc taatgtaatt 4261 ttgatcaact tggtaatgat gttattaaat agcaaaacaa gtgcagtgta ttaaagctaa 4321 acactacatt gaaacgcaaa caaaattctg gtgaaataac tgggctgttg atgcaggtca 4381 ggtggaagct gagaaataaa tgtatataaa tatgtagctg aggatattaa ttcccattat 4441 ataaccataa ttctgtgact tgtaataaac caagctgtac aatttagttt ataatagcag 4501 tatctgagct gctccaaaat atatatatca acagtggtaa atactgtaga tttttatata 4561 tatatatata tatatgcaaa aaatatatat atacacacaa gc // LOCUS D16815 2147 bp mRNA PRI 22-MAY-1997 DEFINITION Human mRNA for EAR-1r, complete cds. ACCESSION D16815 NID g2116671 KEYWORDS EAR-1r. SOURCE Homo sapiens osteoblast cell_line:HOS TE85 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2147) AUTHORS Kamizono,A. TITLE Direct Submission JOURNAL Submitted (17-JUL-1993) to the DDBJ/EMBL/GenBank databases. Akihito Kamizono, Mitsubishi Kasei Corporation Research Center, Pharmaceuticals Laboratory II; 1000, Kamoshida-cho, Midori-ku, Yokohama, Kanagawa 227, Japan (Tel:045-963-3398, Fax:045-963-3890) REFERENCE 2 (bases 1 to 2147) AUTHORS Kamizono,A., Shuichiro,K. and Kohei,U. TITLE Isolation of a novel member of the thyroid/steroid hormone receptor superfamily from human osteoblastic osteosarcoma HOS cell JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..2147 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HOS TE85" /cell_type="osteoblast" CDS 306..2045 /function="tyroid/steroid hormone receptor" /codon_start=1 /product="EAR-1r" /db_xref="PID:d1020903" /db_xref="PID:g2116672" /translation="MEVNAGGVIAYISSSSSASSPASCHSEGSENSFQSSSSSVPSSP NSSNSDTNGNPKNGDLANIEGILKNDRIDCSMKTSKSSAPGMTKSHSGVTKFSGMVLL CKVCGDVASGFHYGVHACEGCKGFFRRSIQQNIQYKKCLKNENCSIMRMNRNRCQQCR FKKCLSVGMSRDAVRFGRIPKREKQRMLIEMQSAMKTMMNSQFSGHLQNDTLVEHHEQ TALPAQEQLRPKPQLEQENIKSSSPPSSDFAKEEVIGMVTRAHKDTFMYNQEQQENSA ESMQPQRGERIPKNMEQYNLNHDHCGNGLSSHFPCSESQQHLNGQFKGRNIMHYPNGH AICIANGHCMNFSNAYTQRVCDRVPIDGFSQNENKNSYLCNTGGRMHLVCPMSKSPYV DPHKSGHEIWEEFSMSFTPAVKEVVEFAKRIPGFRDLSQHDQVNLLKAGTFEVLMVRF ASLFDAKERTVTFLSGKKYSVDDLHSMGAGDLLNSMFEFSEKLNALQLSDEEMSLFTA VVLVSADRSGIENVNSVEALQETLIRALRTLIMKNHPNEASIFTKLLLKLPDLRSLNN MHSEELLAFKVHP" BASE COUNT 589 a 483 c 533 g 542 t ORIGIN 1 gctgccctcc ccgtcagccg ccctcgccgc cgcggtgcgc tggctgcagg aagccgccgc 61 gccgccgctt ttgttgtcag ggacccagcg aggagcgccg ctcgccggcc gccgccaccc 121 tctctcgctg cagcctgctg tgcgctgcac ggcctggggc ccgggcgccc ccgcgtctgc 181 ccatgagggg gccccgcgac caccgctgct tccagcccgg ggcggcgcgg cgctgaggcg 241 gcggcggcgg cggcctgccc cctctgcggg aagcgggcgg ccccggccgc ctccgcgagg 301 gcaccatgga ggtgaatgca ggaggtgtga ttgcctatat cagttcttcc agctcagcct 361 caagccctgc ctcttgtcac agtgagggtt ctgagaatag tttccagtcc tcctcctctt 421 ctgttccatc ttctccaaat agctctaatt ctgataccaa tggtaatccc aagaatggtg 481 atctcgccaa tattgaaggc atcttgaaga atgatcgaat agattgttct atgaaaacaa 541 gcaaatcgag tgcacctggg atgacaaaaa gtcatagtgg tgtgacaaaa tttagtggca 601 tggttctact gtgtaaagtc tgtggggatg tggcgtcagg attccactat ggagttcatg 661 cttgcgaagg ctgtaagggt ttctttcgga gaagtattca acaaaacatc cagtacaaga 721 agtgcctgaa gaatgaaaac tgttctataa tgagaatgaa taggaacaga tgtcagcaat 781 gtcgcttcaa aaagtgtctg tctgttggaa tgtcaagaga tgctgttcgg tttggtcgta 841 ttcctaagcg tgaaaaacag aggatgctaa ttgaaatgca aagtgcaatg aagaccatga 901 tgaacagcca gttcagtggt cacttgcaaa atgacacatt agtagaacat catgaacaga 961 cagccttgcc agcccaggaa cagctgcgac ccaagcccca actggagcaa gaaaacatca 1021 aaagctcttc tcctccatct tctgattttg caaaggaaga agtgattggc atggtgacca 1081 gagctcacaa ggataccttt atgtataatc aagagcagca agaaaactca gctgagagca 1141 tgcagcccca gagaggagaa cggattccca agaacatgga gcaatataat ttaaatcatg 1201 atcattgcgg caatgggctt agcagccatt ttccctgtag tgagagccag cagcatctca 1261 atggacagtt caaagggagg aatataatgc attacccaaa tggtcatgcc atttgtattg 1321 caaatggaca ttgtatgaac ttctccaatg cttatactca aagagtatgt gatagagttc 1381 cgatagatgg attttctcag aatgagaaca agaatagtta cctgtgcaac actggaggaa 1441 gaatgcatct ggtttgtcca atgagtaagt ctccatatgt ggatcctcat aaatcaggac 1501 atgaaatctg ggaagaattt tcgatgagct tcactccagc agtgaaagaa gtggtggaat 1561 ttgcaaagcg tattcctggg ttcagagatc tctctcagca tgaccaggtc aaccttttaa 1621 aggctgggac ttttgaggtt ttaatggtac ggttcgcatc attatttgat gcaaaggaac 1681 gtactgtcac ctttttaagt ggaaagaaat atagtgtgga tgatttacac tcaatgggag 1741 caggggatct gctaaactct atgtttgaat ttagtgagaa gctaaatgcc ctccaactta 1801 gtgatgaaga gatgagtttg tttacagctg ttgtcctggt atctgcagat cgatctggaa 1861 tagaaaacgt caactctgtg gaggctttgc aggaaactct cattcgtgca ctaaggacct 1921 taataatgaa aaaccatcca aatgaggcct ctatttttac aaaactgctt ctaaagttgc 1981 cagatcttcg atctttaaac aacatgcact ctgaggagct cttggccttt aaagttcacc 2041 cttaaggcct ttgtttattt aaacatgaac tgatggtaac tgtacatttt gtgctaaaat 2101 gcatatttat atgtgcatac catatgtgga gatagaaaag accttta // LOCUS D17408 1517 bp mRNA PRI 16-JAN-1997 DEFINITION Human mRNA for calponin, complete cds. ACCESSION D17408 NID g1783204 KEYWORDS calponin. SOURCE Homo sapiens adult male aorta smooth muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1517) AUTHORS Takahashi,K. TITLE not determined JOURNAL Unpublished (1993) REFERENCE 2 (bases 1 to 1517) AUTHORS Takahashi,K. TITLE Direct Submission JOURNAL Submitted (13-AUG-1993) to the DDBJ/EMBL/GenBank databases. Katsuhito Takahashi, Osaka Medical Center for Cancer and Cardiovascular Medicine, Medicine; 1-3-3 Nakamichi, Higashinari-ku, Osaka, Osaka 537, Japan (Tel:06-972-1181(ex.2365), Fax:06-972-7749) FEATURES Location/Qualifiers source 1..1517 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="smooth muscle" /chromosome="19" /dev_stage="adult" /map="19p13.2" /sex="male" /tissue_type="aorta" CDS 93..986 /note="calculated MW 33161; pI 9.83; net charge +12; an actin-, tropomyosin- and calmodulin-binding protein (Takahashi,K. et al. (1988) Hypertension 11, 620-626) in smooth muscle" /codon_start=1 /product="calponin" /db_xref="PID:d1004750" /db_xref="PID:g1783205" /translation="MSSAHFNRGPAYGLSAEVKNKLAQKYDHQREQELREWIEGVTGR RIGNNFMDGLKDGIILCEFINKLQPGSVKKINESTQNWHQLENIGNFIKAITKYGVKP HDIFEANDLFENTNHTQVQSTLLALASMAKTKGNKVNVGVKYAEKQERKFEPGKLREG RNIIGLQMGTNKFASQQGMTAYGTRRHLYDPKLGTDQPLDQATISLQMGTNKGASQAG MTAPGTKRQIFEPGLGMEHCDTLNVSLQMGSNKGASQRGMTVYGLPRQVYDPKYCLTP EYPELGEPAHNHHAHNYYNSA" polyA_signal 1495..1500 polyA_site 1517 /note="22 a nucleotides" BASE COUNT 367 a 472 c 430 g 248 t ORIGIN 1 aggagggaag agtgtgcaga cggaacttca gccgctgcct ctgttctcag cgtcagtgcc 61 gccactgccc ccgccagagc ccaccggcca gcatgtcctc tgctcacttc aaccgaggcc 121 ctgcctacgg gctgtcagcc gaggttaaga acaagctggc ccagaagtat gaccaccagc 181 gggagcagga gctgagagag tggatcgagg gggtgacagg ccgtcgcatc ggcaacaact 241 tcatggacgg cctcaaagat ggcatcattc tttgcgaatt catcaataag ctgcagccag 301 gctccgtgaa gaagatcaat gagtcaaccc aaaattggca ccagctggag aacatcggca 361 acttcatcaa ggccatcacc aagtatgggg tgaagcccca cgacattttt gaggccaacg 421 acctgtttga gaacaccaac catacacagg tgcagtccac cctcctggct ttggccagca 481 tggcgaagac gaaaggaaac aaggtgaacg tgggagtgaa gtacgcagag aagcaggagc 541 ggaaattcga gccggggaag ctaagagaag ggcggaacat cattgggctg cagatgggca 601 ccaacaagtt tgccagccag cagggcatga cggcctatgg cacccggcgc cacctctacg 661 accccaagct gggcacagac cagcctctgg accaggcgac catcagcctg cagatgggca 721 ccaacaaagg agccagccag gctggcatga ctgcgccagg gaccaagcgg cagatcttcg 781 agccggggct gggcatggag cactgcgaca cgctcaatgt cagcctgcag atgggcagca 841 acaagggcgc ctcgcagcgg ggcatgacgg tgtatgggct gccacgccag gtctacgacc 901 ccaagtactg tctgactccc gagtacccag agctgggtga gcccgcccac aaccaccacg 961 cacacaacta ctacaattcc gcctagggcc acaaggcctt ccctgttttc cccccaaggg 1021 aggctgctgc tgctcttggc tggacccagc cagggcccaa gccgaccccc ctctccctgc 1081 atggcatcct ccagcccctg tagaactcaa cctctacagg gttagagttt ggagagagca 1141 gactggcggg gggcccattg gggggaaggg gaccctccgc tctgtagtgc tacagggtcc 1201 aacatagagc cgggtgtccc caacagcgcc caaaggacgc actgagcaac gctattccag 1261 ctgtcccccc actccctcac aagtgggtac ccccaggacc agaagctccc ccagcaaagc 1321 ccccagagcc caggctcggc ctgcccccac cccattcccg cagtgggagc aaactgcatg 1381 cccagagacc cagcggacac acgcggtttg gtttgcagcg actggcatac tatgtggatg 1441 tgacagtggc gtttgtaatg agagcacttt cttttttttc tatttcactg gagcacaata 1501 aatggctgta aaatctc // LOCUS D17525 4489 bp mRNA PRI 25-NOV-1996 DEFINITION Human mRNA for precursor of P100 serine protease of Ra-reactive factor, complete cds. ACCESSION D17525 NID g439712 KEYWORDS P100 serine protease of Ra-reactive factor; CRARF; 29-kDa chain of P100; 70-kDa chain of P100; precursor of P100 serine protease of Ra-reactive factor. SOURCE Homo sapiens liver cDNA to mRNA, clone_lib:lambda gt10 phage. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4489) AUTHORS Takada,F., Takayama,Y., Hatsuse,H. and Kawakami,M. TITLE A new member of the C1s family of complement proteins found in a bactericidal factor, Ra-reactive factor, in human serum JOURNAL Biochem. Biophys. Res. Commun. 196 (2), 1003-1009 (1993) MEDLINE 94059062 REFERENCE 2 (bases 1 to 4489) AUTHORS Takada,F. TITLE Direct Submission JOURNAL Submitted (01-SEP-1993) to the DDBJ/EMBL/GenBank databases. Fumio Takada, Kitasato University School of Medicine, Pediatrics and Molecular Biology; 1-15-1 Kitasato, Sagamihara, Kanagawa 228, Japan (E-mail:ftakada@kitasato-u.ac.jp, Tel:0427-78-9115, Fax:0427-78-8441) COMMENT Submitted (01-Sep-1993) to DDBJ by: Fumio Takada Dept. of Pediatrics and Molecular Biology Kitasato University School of Medicine 1-15-1 Kitasato, Sagamihara Kanagawa 228 Japan Phone: 0427-78-9115 Fax: 0427-78-8441 E-mail: ftakada@kitasato-u.ac.jp. FEATURES Location/Qualifiers source 1..4489 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10 phage" /tissue_type="liver" mRNA <1..>4489 /gene="CRARF" /standard_name="mRNA of P100 protein of Ra-reactive factor (RaRF)" /note="human liver" /evidence=experimental 5'UTR <1..462 /gene="CRARF" gene 1..4489 /gene="CRARF" sig_peptide 463..519 /gene="CRARF" CDS 463..2562 /gene="CRARF" /codon_start=1 /evidence=experimental /product="precursor of P100 serine protease of Ra-reactive factor" /db_xref="PID:d1005002" /db_xref="PID:g439713" /translation="MRWLLLYYALCFSLSKASAHTVELNNMFGQIQSPGYPDSYPSDS EVTWNITVPDGFRIKLYFMHFNLESSYLCEYDYVKVETEDQVLATFCGRETTDTEQTP GQEVVLSPGSFMSITFRSDFSNEERFTGFDAHYMAVDVDECKEREDEELSCDHYCHNY IGGYYCSCRFGYILHTDNRTCRVECSDNLFTQRTGVITSPDFPNPYPKSSECLYTIEL EEGFMVNLQFEDIFDIEDHPEVPCPYDYIKIKVGPKVLGPFCGEKAPEPISTQSHSVL ILFHSDNSGENRGWRLSYRAAGNECPELQPPVHGKIEPSQAKYFFKDQVLVSCDTGYK VLKDNVEMDTFQIECLKDGTWSNKIPTCKIVDCRAPGELEHGLITFSTRNNLTTYKSE IKYSCQEPYYKMLNNNTGIYTCSAQGVWMNKVLGRSLPTCLPVCGLPKFSRKLMARIF NGRPAQKGTTPWIAMLSHLNGQPFCGGSLLGSSWIVTAAHCLHQSLDPKDPTLRDSDL LSPSDFKIILGKHWRLRSDENEQHLGVKHTTLHPKYDPNTFENDVALVELLESPVLNA FVMPICLPEGPQQEGAMVIVSGWGKQFLQRFPETLMEIEIPIVDHSTCQKAYAPLKKK VTRDMICAGEKEGGKDACSGDSGGPMVTLNRERGQWYLVGTVSWGDDCGKKDRYGVYS YIHHNKDWIQRVTGVRN" mat_peptide 520..2559 /gene="CRARF" /standard_name="P100 protein of Ra-reactive factor (RaRF)" /note="forms RaRF complex with mannose-binding protein; serine protease;sequence homologous to mouse P100; module homologous to C1r/C1s of complement" /function="activation of C4 and C2 components of complement; bactericidal" /product="P100 serine protease of Ra-reactive factor" mat_peptide 520..1806 /gene="CRARF" /note="internal repeat 1; epidermal growth factor-like module; internal repeat 2; Sushi module 1; Sushi module 2" /product="70-kDa chain of P100" mat_peptide 1807..2559 /gene="CRARF" /note="protease domain" /product="29-kDa chain of P100" 3'UTR 2563..>4489 /gene="CRARF" polyA_signal 2948..2953 /gene="CRARF" polyA_signal 3397..3402 /gene="CRARF" BASE COUNT 1157 a 1210 c 1098 g 1024 t ORIGIN Chromosome 3q27-q28. 1 tggcgataca ttcacacagg aacagctatg ccatgtttac gaattccggt tttgaaaaaa 61 ctttcgttga cagttacaca aagggtcact tcctccccag cgacacatgg gcctctcaaa 121 ggagaggagg gagtaagtcc cacggtaggg ccagtggttg ctccctgggt tttggaatca 181 tttctgcgga gctttcaagg ccagaccctg ggcttagggt cgagacttta tagcagtgac 241 agccagaccc agcaagatgg ctgcgaccgt gaaaccctgg gcggcgatcc gggtgcgcat 301 catgagctga gagcgctggc tgttgccccg gtggaaggag tagaggccgt aggtgagggc 361 ggccgccgtg gccaggcaac ctatgggtac caccgggttc tcgcgggcaa gtcaagctgg 421 gaggaccaag gccgggcagc cgggagcacc caaggcagga aaatgaggtg gctgcttctc 481 tattatgctc tgtgcttctc cctgtcaaag gcttcagccc acaccgtgga gctaaacaat 541 atgtttggcc agatccagtc gcctggttat ccagactcct atcccagtga ttcagaggtg 601 acttggaata tcactgtccc agatgggttt cggatcaagc tttacttcat gcacttcaac 661 ttggaatcct cctacctttg tgaatatgac tatgtgaagg tagaaactga ggaccaggtg 721 ctggcaacct tctgtggcag ggagaccaca gacacagagc agactcccgg ccaggaggtg 781 gtcctctccc ctggctcctt catgtccatc actttccggt cagatttctc caatgaggag 841 cgtttcacag gctttgatgc ccactacatg gctgtggatg tggacgagtg caaggagagg 901 gaggacgagg agctgtcctg tgaccactac tgccacaact acattggcgg ctactactgc 961 tcctgccgct tcggctacat cctccacaca gacaacagga cctgccgagt ggagtgcagt 1021 gacaacctct tcactcaaag gactggggtg atcaccagcc ctgacttccc aaacccttac 1081 cccaagagct ctgaatgcct gtataccatc gagctggagg agggtttcat ggtcaacctg 1141 cagtttgagg acatatttga cattgaggac catcctgagg tgccctgccc ctatgactac 1201 atcaagatca aagttggtcc aaaagttttg gggcctttct gtggagagaa agccccagaa 1261 cccatcagca cccagagcca cagtgtcctg atcctgttcc atagtgacaa ctcgggagag 1321 aaccggggct ggaggctctc atacagggct gcaggaaatg agtgcccaga gctacagcct 1381 cctgtccatg ggaaaatcga gccctcccaa gccaagtatt tcttcaaaga ccaagtgctc 1441 gtcagctgtg acacaggcta caaagtgctg aaggataatg tggagatgga cacattccag 1501 attgagtgtc tgaaggatgg gacgtggagt aacaagattc ccacctgtaa aattgtagac 1561 tgtagagccc caggagagct ggaacacggg ctgatcacct tctctacaag gaacaacctc 1621 accacataca agtctgagat caaatactcc tgtcaggagc cctattacaa gatgctcaac 1681 aataacacag gtatatatac ctgttctgcc caaggagtct ggatgaataa agtattgggg 1741 agaagcctac ccacctgcct tccagtgtgt gggctcccca agttctcccg gaagctgatg 1801 gccaggatct tcaatggacg cccagcccag aaaggcacca ctccctggat tgccatgctg 1861 tcacacctga atgggcagcc cttctgcgga ggctcccttc taggctccag ctggatcgtg 1921 accgccgcac actgcctcca ccagtcactc gatccgaaag atccgaccct acgtgattca 1981 gacttgctca gcccttctga cttcaaaatc atcctgggca agcattggag gctccggtca 2041 gatgaaaatg aacagcatct cggcgtaaaa cacaccactc tccaccccaa gtatgatccc 2101 aacacattcg agaatgacgt ggctctggtg gagctgttgg agagcccagt gctgaatgcc 2161 ttcgtgatgc ccatctgtct gcctgaggga ccccagcagg aaggagccat ggtcatcgtc 2221 agcggctggg gaaagcagtt cttgcaaagg ttcccagaga ccctgatgga gattgaaatc 2281 ccgattgttg accacagcac ctgccagaag gcttatgccc cgctgaagaa gaaagtgacc 2341 agggacatga tctgtgctgg ggagaaggaa gggggaaagg acgcctgttc gggtgactct 2401 ggaggcccca tggtgaccct gaatagagaa agaggccagt ggtacctggt gggcactgtg 2461 tcctggggtg atgactgtgg gaagaaggac cgctacggag tatactctta catccaccac 2521 aacaaggact ggatccagag ggtcaccgga gtgaggaact gaatttggct cctcagcccc 2581 agcaccacca gctgtgggca gtcagtagca gaggacgatc ctccgatgaa agcagccatt 2641 tctcctttcc ttcctcccat cccccctcct tcggcctatc cattactggg caatagagca 2701 ggtatcttca cccccttttc actctcttta aagagatgga gcaagagagt ggtcagaaca 2761 caggccgaat ccaggctcta tcacttacta gttttcagtt ctgggcaggt gacttcatct 2821 cttcgaactt cagtttcttc ataagatgga aatgctatac cttacctacc tcgtaaaagt 2881 ctgatgagga aaagattaac taatagatgc atagcactta acagagtgca tagcatacac 2941 tgttttcaat aaatgcacct tagcagaagg tcgatgtgtc taccaggcag acgaagctct 3001 cttacaaacc cctgcctggg tcttagcatt gatcagtgac acacctctcc cctcaacctt 3061 gaccatctcc atctgccctt aaatgctgta tgcttttttg ccaccgtgca acttgcccaa 3121 catcaatctt caccctcatc cctaaaaaag taaaacagac aaggttctga gtcctgtggt 3181 atgtccccta gcaaatgtag ctaggaacat gcactagatg ccagattgcg ggagggcctg 3241 agagaagcag ggacaggagg gagcctgggg attgtggttt gggaaggcag acacctggtt 3301 ctagaactag ctctgccctt agccccctgt atgaccctat gcaagtcctc ctccctcatc 3361 tcaaagggtc ctccaagctc tgacgatcta agatacaata aagccatttt ccccctgata 3421 agatgaggta aagccaatgt aaccaaaagg caaaaattac aatcggttca aaggaacttt 3481 gatgcagaca aaatgctgct gctgctgctc ctgaaatacc cacccctttc cactacgggt 3541 gggttcccaa gaacatggaa caggcaaagt gtgagccaaa ggatccttcc ttattcctaa 3601 gcagagcatc tgctctgggc cctggcctcc ttcccttctt gggaaactgg gctgcatgaa 3661 ggtgggccct ggtagtttgt accccaaggc ccctatactc ttccttccta tgtccacagc 3721 tgaccccaag cagccgttcc ccgactcctc acccctgagc ctcaccctga actccctcat 3781 cttgcaaggc cataagtgtt ttccaagcaa aatgcctctc ccatcctctc tcaggaagct 3841 tctagagact ttatgccctc cagagctcca agatataagc cctccaaggg atcagaagct 3901 ccaagttcct gtcttctgtt ttatagaaat tgatcttccc tgggggactt taactcttga 3961 cctgtatgca gctgttggag taattccagg tctcttgaaa aaaaagagga agataatgga 4021 gaatgagaac atatatatat atatattaag ccccaggctg aatacccagg gacagcaatt 4081 cacagcctgc ctctggttct ataaacaagt cattctacct ctttgtgccc tgctgtttat 4141 tctgtaaggg gaaggtggca atgggaccca gctccatcag acacttgtca agctagcaga 4201 aactccattt tcaatgccaa agaagaactg taatgctgtt ttggaatcat cccaaggcat 4261 cccaaaacac catatcttcc catttcaagc actgcctggg cacaccccaa catcccaggc 4321 tgtggtggct cctgtgggaa ctacctagat gaagagagta tcatttatac cttctaggag 4381 ctcctattgg gagacatgaa acatatgtaa ttgactacca tgtaatagaa caaaccctgc 4441 caagtgctgc tttggaaagt catggaggta aaagaaagac ccgaaattc // LOCUS D21878 1411 bp mRNA PRI 20-JUN-1997 DEFINITION Human mRNA for BST-1, complete cds. ACCESSION D21878 NID g506334 KEYWORDS BST-1; pre-B cell growth; CD157. SOURCE Homo sapiens bone marrow stromal cell cDNA to mRNA, clone BST-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1411) AUTHORS Hirano,T. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) to the DDBJ/EMBL/GenBank databases. Toshio Hirano, Osaka Univ. Med. Sch., Division of Molecular Oncology; 2-2, Yamadaoka, Suita, Osaka 565, Japan (Tel:06-879-3880, Fax:06-879-3889) REFERENCE 2 (bases 1 to 1411) AUTHORS Kaisho,T., Ishikawa,J., Oritani,K., Inazawa,J., Tomizawa,H., Muraoka,O., Ochi,T. and Hirano,T. TITLE BST-1, a surface molecule of bone marrow stromal cell lines that facilitates pre-B-cell growth JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (12), 5325-5329 (1994) MEDLINE 94261578 FEATURES Location/Qualifiers source 1..1411 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="bone marrow stromal cell" /chromosome="4p15" sig_peptide 128..211 CDS 128..1084 /codon_start=1 /product="BST-1 precursor" /db_xref="PID:d1005423" /db_xref="PID:g999429" /translation="MAAQGCAASRLLQLLLQLLLLLLLLAAGGARARWRAEGTSAHLR DIFLGRCAEYRALLSPEQRNKNCTAIWEAFKVALDKDPCSVLPSDYDLFINLSRHSIP RDKSLFWENSHLLVNSFADNTRRFMPLSDVLYGRVADFLSWCRQKNDSGLDYQSCPTS EDCENNPVDSFWKRASIQYSKDSSGVIHVMLNGSEPTGAYPIKGFFADYEIPNLQKEK ITRIEIWVMHEIGGPNVESCGEGSMKVLEKRLKDMGFQYSCINDYRPVKLLQCVDHST HPDCALKSAAAATQRKAPSLYTEQRAGLIIPLFLVLASRTQL" mat_peptide 212..1081 /function="facilitate pre-B-cell growth" /product="BST-1" BASE COUNT 363 a 339 c 362 g 347 t ORIGIN 1 cgggaaacgg caaacagcga gatatccgag cgagagtccc gccctgcatc agtttgcgga 61 accgccttgg tagaaggaga gaaggggagt ggaggaagca cgggactgga gggaccaaag 121 ttccccgatg gcggcccagg ggtgcgcggc atcgcggctg ctccagctgc tgctgcagct 181 tctgcttcta ctgttgctgc tggcggcggg cggggcgcgc gcgcggtggc gcgcggaggg 241 caccagcgca cacttgcggg acatcttcct gggccgctgc gccgagtacc gcgcactgct 301 gagtcccgag cagcggaaca agaactgcac agccatctgg gaagccttta aagtggcgct 361 ggacaaggat ccctgctccg tgctgccctc agactatgac ctttttatta acttgtccag 421 gcactctatt cccagagata agtccctgtt ctgggaaaat agccacctcc ttgttaacag 481 ctttgcagac aacacccgtc gttttatgcc cctgagcgat gttctgtatg gcagggttgc 541 agatttcttg agctggtgtc gacagaaaaa tgactctgga ctcgattacc aatcctgccc 601 tacatcagaa gactgtgaaa ataatcctgt ggattccttt tggaaaaggg catccatcca 661 gtattccaag gatagttctg gggtgatcca cgtcatgctg aatggttcag agccaacagg 721 agcctatccc atcaaaggtt tttttgcaga ttatgaaatt ccaaacctcc agaaggaaaa 781 aattacacga atcgagatct gggttatgca tgaaattggg ggacccaatg tggaatcctg 841 cggggaaggc agcatgaaag tcctggaaaa gaggctgaag gacatggggt tccagtacag 901 ctgtattaat gattaccgac cagtgaagct cttacagtgc gtggaccaca gcacccatcc 961 tgactgtgcc ttaaagtcgg cagcagccgc tactcaaaga aaagccccaa gtctttatac 1021 agaacaaagg gcgggtctta tcattcccct ctttctggtg ctggcttccc ggactcaact 1081 gtaactggaa actgtgttgc tctaaccctc ctccagccct gcagcctccc cttgcagtca 1141 tcattcgtgt tctgtgtata ccaaatgatt ctgttatcta aagaagcttt ttgctgggaa 1201 aacgatgtcc tgaaaatggt atttcaatga ggcatatgtt caggatttca gaaacaagaa 1261 gttagttcta tttagcaggt taaaaaatgc tgcattagaa ttaaagcaag ttattttctt 1321 atttgtataa tgacacaaag cattgggagt cagactgctt gtatattatc aaacatttta 1381 agagaattct aataaagctg tattttacat c // LOCUS D26579 3236 bp mRNA PRI 27-MAR-1997 DEFINITION Human mRNA for transmembrane protein, complete cds. ACCESSION D26579 NID g1864004 KEYWORDS transmembrane protein; CD156; ADAM8; MS2. SOURCE Homo sapiens adult blood macrophage, neutrophil leukocyte cell_line:THP-1 cDNA to mRNA, clone_lib:THP-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yoshiyama,K., Higuchi,Y., Kataoka,M., Matsuura,K. and Yamamoto,S. TITLE CD156 (Human ADAM8): expression, primary amino acid sequence, and gene location JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 3236) AUTHORS Yoshiyama,K., Setoguchi,M., Kaei,N., Matsuura,K., Higuchi,Y. and Yamamoto,S. TITLE Molecular cloning of human MS2, a myelomonocytic cell surface protein which is a member of hemorrhagic snake venom family, and its function JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 3236) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (20-JAN-1994) to the DDBJ/EMBL/GenBank databases. Shunsuke Yamamoto, Oita Medical University, Department of Pathology; Hasama-machi, Oita 879-55, Japan (Tel:0975-49-4411(ex.2690), Fax:0975-86-5699) COMMENT Submitted (20-Jan-1994) to DDBJ by: Shunsuke Yamamoto Department of Pathology Oita Medical University 1-1 Idaigaoka, Hasama-machi Oita-gun, Oita 879-55 Japan Phone: 0975-49-4411 x2690 Fax: 0975-49-4217. FEATURES Location/Qualifiers source 1..3236 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" /cell_type="macrophage, neutrophil leukocyte" /clone_lib="THP-1" /dev_stage="adult" /tissue_type="blood" 5'UTR 1..9 gene 10..2484 /gene="ms2" CDS 10..2484 /gene="ms2" /function="metalloproteinase and platelet aggregation inhibitor" /note="CD156; ADAM8; MS2" /codon_start=1 /product="transmembrane protein" /db_xref="PID:d1006171" /db_xref="PID:g1864005" /translation="MRGLGLWLLGAMMLPAIAPSRPWALMEQYEVVLPRRLPGPRVRR ALPSHLGLHPERVSYVLGATGHNFTLHLRKNRDLLGSGYTETYTAANGSEVTEQPRGQ DHCLYQGHVEGYPDSAASLSTCAGLRGFFQVGSDLHLIEPLDEGGEGGRHAVYQAEHL LQTAGTCGVSDDSLGSLLGPRTAAVFRPRPGDSLPSRETRYVELYVVVDNAEFQMLGS EAAVRHRVLEVVNHVDKLYQKLNFRVVLVGLEIWNSQDRFHVSPDPSVTLENLLTWQA RQRTRRHLHDNVQLITGVDFTGTTVGFARVSAMCSHSSGAVNQDHSKNPVGVACTMAH EMGHNLGMDHDENVQGCRCQERFEAGRCIMAGSIGSSFPRMFSDCSQAYLESFLERPQ SVCLANAPDLSHLVGGPVCGNLFVERGEQCDCGPPEDCRNRCCNSTTCQLAEGAQCAH GTCCQECKVKPAGELCRPKKDMCDLEEFCDGRHPECPEDAFQENGTPCSGGYCYNGAC PTLAQQCQAFWGPGGQAAEESCFSYDILPGCKASRYRADMCGVLQCKGGQQPLGRAIC IVDVCHALTTEDGTAYEPVPEGTRCGPEKVCWKGRCQDLHVYRSSNCSAQCHNHGVCN HKQECHCHAGWAPPHCAKLLTEVHAASGSLPVLVVVVLVLLAVVLVTLAGIIVYRKAR SRILSRNVAPKTTMGRSNPLFHQAASRVPAKGGAPAPSRGPQELVPTTHPGQPARHPA SSVALKRPPPAPPVTVSSPPFPVPVYTRQAPKQVIKPTFAPPVPPVKPGAGAANPGPA EGAVGPKVALKPPIQRKQGAGAPTAP" sig_peptide 10..57 /gene="ms2" mat_peptide 58..2481 /gene="ms2" 3'UTR 2482..3236 polyA_signal 3199..3204 BASE COUNT 580 a 1048 c 1010 g 598 t ORIGIN 1 gacccggcca tgcgcggcct cgggctctgg ctgctgggcg cgatgatgct gcctgcgatt 61 gcccccagcc ggccctgggc cctcatggag cagtatgagg tcgtgttgcc gcggcgtctg 121 ccaggccccc gagtccgccg agctctgccc tcccacttgg gcctgcaccc agagagggtg 181 agctacgtcc ttggggccac agggcacaac ttcaccctcc acctgcggaa gaacagggac 241 ctgctgggtt ccggctacac agagacctat acggctgcca atggctccga ggtgacggag 301 cagcctcgcg ggcaggacca ctgcttatac cagggccacg tagaggggta cccggactca 361 gccgccagcc tcagcacctg tgccggcctc aggggtttct tccaggtggg gtcagacctg 421 cacctgatcg agcccctgga tgaaggtggc gagggcggac ggcacgccgt gtaccaggct 481 gagcacctgc tgcagacggc cgggacctgc ggggtcagcg acgacagcct gggcagcctc 541 ctgggacccc ggacggcagc cgtcttcagg cctcggcccg gggactctct gccatcccga 601 gagacccgct acgtggagct gtatgtggtc gtggacaatg cagagttcca gatgctgggg 661 agcgaagcag ccgtgcgtca tcgggtgctg gaggtggtga atcacgtgga caagctatat 721 cagaaactca acttccgtgt ggtcctggtg ggcctggaga tttggaatag tcaggacagg 781 ttccacgtca gccccgaccc cagtgtcaca ctggagaacc tcctgacctg gcaggcacgg 841 caacggacac ggcggcacct gcatgacaac gtacagctca tcacgggtgt cgacttcacc 901 gggactactg tggggtttgc cagggtgtcc gccatgtgct cccacagctc aggggctgtg 961 aaccaggacc acagcaagaa ccccgtgggc gtggcctgca ccatggccca tgagatgggc 1021 cacaacctgg gcatggacca tgatgagaac gtccagggct gccgctgcca ggaacgcttc 1081 gaggccggcc gctgcatcat ggcaggcagc attggctcca gtttccccag gatgttcagt 1141 gactgcagcc aggcctacct ggagagcttt ttggagcggc cgcagtcggt gtgcctcgcc 1201 aacgcccctg acctcagcca cctggtgggc ggccccgtgt gtgggaacct gtttgtggag 1261 cgtggggagc agtgcgactg cggccccccc gaggactgcc ggaaccgctg ctgcaactct 1321 accacctgcc agctggctga gggggcccag tgtgcgcacg gtacctgctg ccaggagtgc 1381 aaggtgaagc cggctggtga gctgtgccgt cccaagaagg acatgtgtga cctcgaggag 1441 ttctgtgacg gccggcaccc tgagtgcccg gaagacgcct tccaggagaa cggcacgccc 1501 tgctccgggg gctactgcta caacggggcc tgtcccacac tggcccagca gtgccaggcc 1561 ttctgggggc caggtgggca ggctgccgag gagtcctgct tctcctatga catcctacca 1621 ggctgcaagg ccagccggta cagggctgac atgtgtggcg ttctgcagtg caagggtggg 1681 cagcagcccc tggggcgtgc catctgcatc gtggatgtgt gccacgcgct caccacagag 1741 gatggcactg cgtatgaacc agtgcccgag ggcacccggt gtggaccaga gaaggtttgc 1801 tggaaaggac gttgccagga cttacacgtt tacagatcca gcaactgctc tgcccagtgc 1861 cacaaccatg gggtgtgcaa ccacaagcag gagtgccact gccacgcggg ctgggccccg 1921 ccccactgcg cgaagctgct gactgaggtg cacgcagcgt ccgggagcct ccccgtcctc 1981 gtggtggtgg ttctggtgct cctggcagtt gtgctggtca ccctggcagg catcatcgtc 2041 taccgcaaag cccggagccg catcctgagc aggaacgtgg ctcccaagac cacaatgggg 2101 cgctccaacc ccctgttcca ccaggctgcc agccgcgtgc cggccaaggg cggggctcca 2161 gccccatcca ggggccccca agagctggtc cccaccaccc acccgggcca gcccgcccga 2221 cacccggcct cctcggtggc tctgaagagg ccgccccctg ctcctccggt cactgtgtcc 2281 agcccaccct tcccagttcc tgtctacacc cggcaggcac caaagcaggt catcaagcca 2341 acgttcgcac ccccagtgcc cccagtcaaa cccggggctg gtgcggccaa ccctggtcca 2401 gctgagggtg ctgttggccc aaaggttgcc ctgaagcccc ccatccagag gaagcaagga 2461 gccggagctc ccacagcacc ctaggggggc acctgcgcct gtgtggaaat ttggagaagt 2521 tgcggcagag aagccatgcg ttccagcctt ccacggtcca gctagtgccg ctcagcccta 2581 gaccctgact ttgcaggctc agctgctgtt ctaacctcag taatgcatct acctgagagg 2641 ctcctgctgt ccacgccctc agccaattcc ttctccccgc cttggccacg tgtagcccca 2701 gctgtctgca ggcaccaggc tgggatgagc tgtgtgcttg cgggtgcgtg tgtgtgtacg 2761 tgtctccagg tggccgctgg tctcccgctg tgttcaggag gccacatata cagcccctcc 2821 cagccacacc tgcccctgct ctggggcctg ctgagccggc tgccctgggc acccggttcc 2881 aggcagcaca gacgtggggc atccccagaa agactccatc ccaggaccag gttcccctcc 2941 gtgctcttcg agagggtgtc agtgagcaga ctgcacccca agctcccgac tccaggtccc 3001 ctgatcttgg gcctgtttcc catgggattc aagagggaca gccccagctt tgtgtgtgtt 3061 taagcttagg aatgcccttt atggaaaggg ctatgtggga gagtcagcta tcttgtctgg 3121 ttttcttgag acctcagatg tgtgttcagc agggctgaaa gcttttattc tttaataatg 3181 agaaatgtat attttactaa taaattattg accgagttct gtagattctt gttaga // LOCUS D30783 4627 bp mRNA PRI 05-SEP-1997 DEFINITION Homo sapiens mRNA for epiregulin, complete cds. ACCESSION D30783 NID g2381480 KEYWORDS growth regulator; epiregulin. SOURCE Homo sapiens colorectal adenocarcinoma epithelial cell_line:HCT-15 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Toyoda,H., Komurasaki,T., Uchida,D. and Morimoto,S. TITLE Distribution of mRNA for human epiregulin, a differentially expressed member of the epidermal growth factor family JOURNAL Biochem. J. 326 (Pt 1), 69-75 (1997) MEDLINE 97479200 REFERENCE 2 (bases 1 to 4627) AUTHORS Toyoda,H. TITLE Direct Submission JOURNAL Submitted (24-MAY-1994) to the DDBJ/EMBL/GenBank databases. Hitoshi Toyoda, Research Center, Taisho Pharmaceutical Co., Ltd.; No.403 Yoshino-cho, 1-chome, Ohmiya, Saitama 330, Japan (Tel:048-663-1111(ex.3611), Fax:048-652-7254) FEATURES Location/Qualifiers source 1..4627 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HCT-15" /cell_type="epithelial" /tissue_type="colorectal adenocarcinoma" CDS 167..676 /note="EGF-related peptide" /codon_start=1 /product="epiregulin" /db_xref="PID:d1023005" /db_xref="PID:g2381481" /translation="MTAGRRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGES SDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVR CEHFFLTVHQPLSKEYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVT SGDPELPQV" mat_peptide 353..490 /product="epiregulin" misc_feature 521..574 /note="transmembrane domain" polyA_signal 4580..4585 /note="early mRNA polyadenylation signal" polyA_signal 4584..4589 /note="early mRNA polyadenylation signal" BASE COUNT 1426 a 836 c 836 g 1529 t ORIGIN 1 tcacttgcct gatatttcca gtgtcagagg gacacagcca acgtggggtc ccttctaggc 61 tgacagccgc tctccagcca ctgccgcgag cccgtctgct cccgccctgc ccgtgcactc 121 tccgcagccg ccctccgcca agccccagcg cccgctccca tcgccgatga ccgcggggag 181 gaggatggag atgctctgtg ccggcagggt ccctgcgctg ctgctctgcc tgggtttcca 241 tcttctacag gcagtcctca gtacaactgt gattccatca tgtatcccag gagagtccag 301 tgataactgc acagctttag ttcagacaga agacaatcca cgtgtggctc aagtgtcaat 361 aacaaagtgt agctctgaca tgaatggcta ttgtttgcat ggacagtgca tctatctggt 421 ggacatgagt caaaactact gcaggtgtga agtgggttat actggtgtcc gatgtgaaca 481 cttcttttta accgtccacc aacctttaag caaagagtat gtggctttga ccgtgattct 541 tattattttg tttcttatca cagtcgtcgg ttccacatat tatttctgca gatggtacag 601 aaatcgaaaa agtaaagaac caaagaagga atatgagaga gttacctcag gggatccaga 661 gttgccgcaa gtctgaatgg cgccatcaaa cttatgggca gggataacag tgtgcctggt 721 taatattaat attccatttt attaataata tttatgttgg gtcaagtgtt aggtcaataa 781 cactgtattt taatgtactt gaaaaatgtt tttatttttg ttttattttt gacagactat 841 ttgctaatgt ataatgtgca gaaaatattt aatatcaaaa gaaaattgat atttttatac 901 aagtaatttc ctgagctaaa tgcttcattg aaagcttcaa agtttatatg cctggtgcac 961 agtgcttaga agtaagcaat tcccaggtca tagctcaaga attgttagca aatgacagat 1021 ttctgtaagc ctatatatat agtcaaatcg atttagtaag tatgtttttt atgttcctca 1081 aatcagtgat aattggtttg actgtaccat ggtttgatat gtagttggca ccatggtatc 1141 atatattaaa acaataatgc aattagaatt tgggagaagc aaatataggt cctgtgttaa 1201 acactacaca tttgaaacaa gctaaccctg gggagtctat ggtctcttca ctcaggtctc 1261 agctataatt ctgttatatg aggggcagtg gacagttccc tatgccaact cacgactcct 1321 acaggtacta gtcactcatc taccagattc tgcctatgta aaatgaattg aaaaacaatt 1381 ttctgtaatc ttttatttaa gtagtgggca tttcatagct tcacaatgtt ccttttttgt 1441 atattacaac atttatgtga ggtaattatt gctcaacaga caattagaaa aaagtccaca 1501 cttgaagcct aaatttgtgc tttttaagaa tatttttaga ctatttcttt ttataggggc 1561 tttgctgaat tctaacatta aatcacagcc caaaatttga tggactaatt attattttaa 1621 aatatatgaa gacaataatt ctacatgttg tcttaagatg gaaatacagt tatttcatct 1681 tttattcaag gaagttttaa ctttaataca gctcagtaaa tggcttcttc tagaatgtaa 1741 agttatgtat ttaaagttgt atcttgacac aggaaatggg aaaaaactta aaaattaata 1801 tggtgtattt ttccaaatga aaaatctcaa ttgaaagctt ttaaaatgta gaaacttaaa 1861 cacaccttcc tgtggaggct gagatgaaaa ctagggctca ttttcctgac atttgtttat 1921 tttttggaag agacaaagat ttcttctgca ctctgagccc ataggtctca gagagttaat 1981 aggagtattt ttgggctatt gcataaggag ccactgctgc caccactttt ggattttatg 2041 ggaggctcct tcatcgaatg ctaaaccttt gagtagagtc tccctggatc acataccagg 2101 tcagggagga tctgttcttc ctctacgttt atcctggcat gtgctagggt aaacgaaggc 2161 ataataagcc atggctgacc tctggagcac caggtgccag gacttgtctc catgtgtatc 2221 catgcattat ataccctggt gcaatcacac gactgtcatc taaagtcctg gccctggccc 2281 ttactattag gaaaataaac agacaaaaac aagtaaatat atatggtcct atacatattg 2341 tatatatatt catatacaaa catgtatgta tacatgacct taatggatca tagaattgca 2401 gtcatttggt gctctgctaa ccatttatat aaaacttaaa aacaagagaa aagaaaaatc 2461 aattagatct aaacagttat ttctgtttcc tatttaatat agctgaagtc aaaatatgta 2521 agaacacatt ttaaatactc tacttacagt tggccctctg tggttagttc cacatctgtg 2581 gattcaacca accaaggacg gaaaatgctt aaaaaataat acaacaacaa caaaaaatac 2641 attataacaa ctatttactt tttttttttt ctttttgaga tggagtctcg ctctgttgcc 2701 caggttggag tgcagtggca cgatctcggc tcactgcaac ctcacctccc gggttcaaga 2761 gatcctcctg cctcagcctc ctgagcagct gggactacag gcgcatgcca ccatgcccag 2821 ctaatttttg tatttttagt agaggcgggg tttcaccatg ttggccagga tggtctcaat 2881 ctcctaacct tgagatccac cctccacagc ctcccaaact gctgggatta caggcgtgag 2941 ccaccgcacg tagcatttac attaggtatt acaagtaatg taaagatgat ttaagtatac 3001 aggaggatgt gaataggtta tatgcaagca ctatgccctt ttatataagt gacttgaaca 3061 tctgtgcccg attttagtat gtgcaggggg gcgatctggg aatcagtccc ctgtggatac 3121 caaggtacaa ctgtatttat taacgcttac tagatgtgag gagagtctga atattttcag 3181 tgatcttggc tgtttcaaaa aaatctattg acttttcaat aaatcagctg caatccattt 3241 atttcattta caaaagattt attgtaagcc tctcaatctt ggtttttcag ttgatcttaa 3301 gcatgtcaat tcataaaaac aagtcatttt tgtatttttc atctttaaga atgcttaaaa 3361 aagctaatcc ctaaaatagt tagatctttg taaatgcata ttaaataata aagtatgacc 3421 cacattactt tttatgggtg aaaataagac aaaaataata gttttagtga ggatggtgct 3481 gagtaaacat aaaaactgat ttgctctcag ctgatgtgtc ctgtacacag tgggaagatt 3541 ttagttcaca cttagtctaa ctcccccatt ttacagattt ctcactatat atatttctag 3601 aaggggctat gcatattcaa tgtattgaga accaaagcaa ccacaaatgc ataaatgcat 3661 aatttatggt cttcaaccaa ggccacataa taacccagtt aacttactct ttaaccagga 3721 atattaagtt ctataactag tactcaaggt ttaaccttaa aattaagatt tccttaacct 3781 taaccttaaa attgatatta tattaaacat acataataca atgtaactcc actgttctcc 3841 tgaatatttt ttgctctaat ctctctgccg aaagtcaaag tgatgggaga attggtatac 3901 tggtatgact acgtcttaag tcagattttt atttatgagt ctttgagact aaattcaatc 3961 accaccaggt atcaaatcaa cttttatgca gcaaatatat gattctagtg tctgactttt 4021 gttaaattca gtaatgcagt ttttaaaaac ctgtatctga cccactttgt aatttttgct 4081 ccaatatcca ttctgtagac ttttgaaaaa aaagttttta atttgatgcc caatatattc 4141 tgaccgttaa aaaattcttg ttcatatggg agaaggggga gtaatgactt gtacaaacag 4201 tatttctggt gtatatttta atgtttttaa aaagagtaat ttcatttaaa tatctgttat 4261 tcaaatttga tgatgttaaa tgtaatataa tgtattttct ttttattttg cactctgtaa 4321 ttgcactttt taagtttgaa gagccatttt ggtaaacggt ttttattaaa gatgctatgg 4381 aacataaagt tgtattgcat gcaatttaaa gtaacttatt tgactatgaa tattatcgga 4441 ttactgaatt gtatcaattt gtttgtgttc aatatcagct ttgataattg tgtaccttaa 4501 gatattgaag gagaaaatag ataatttaca agatattatt aatttttatt tatttttctt 4561 gggaattgaa aaaaattgaa ataaataaaa atgcattgaa catcttgcat tcaaaatctt 4621 cactgac // LOCUS D32053 1997 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens mRNA for Lysyl tRNA Synthetase, complete cds. ACCESSION D32053 NID g2366751 KEYWORDS . SOURCE Homo sapiens Fetus Brain cDNA to mRNA, clone:533. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shiba,K., Stello,T., Motegi,H., Noda,T., Musier-Forsyth,K. and Schimmel,P. TITLE Human lysyl-tRNA synthetase accepts N73 variants and rescues E.coli double-defective mutant JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 1997) AUTHORS Motegi,H. TITLE Direct Submission JOURNAL Submitted (04-JUL-1994) to the DDBJ/EMBL/GenBank databases. Hiromi Motegi, Cancer Institute, Department of Cell Biology; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:hmotegi@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4101), Fax:03-3917-7564) COMMENT Sequence updated (21-Oct-1994) by: Hiromi Motegi. FEATURES Location/Qualifiers source 1..1997 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="533" /dev_stage="Fetus" /tissue_type="Brain" CDS 41..1834 /codon_start=1 /product="Lysyl tRNA Synthetase" /db_xref="PID:d1022943" /db_xref="PID:g2366752" /translation="MAAVQAAEVKVDGSEPKLSKNELKRRLKAEKKVAEKEAKQKELS EKQLSQATAAATNHTTDNGVGPEEESVDPNQYYKIRSQAIHQLKVNGEDPYPHKFHVD ISLTDFIQKYSHLQPGDHLTDITLKVAGRIHAKRASGGKLIFYDLRGEGVKLQVMANS RNYKSEEEFIHINNKLRRGDIIGVQGNPGKTKKGELSIIPYEITLLSPCLHMLPHLHF GLKDKETRYRQRYLDLILNDFVRQKFIIRSKIITYIRSFLDELGFLEIETPMMNIIPG GAVAKPFITYHNELDMNLYMRIAPELYHKMLVVGGIDRVYEIGRQFRNEGIDLTHNPE FTTCEFYMAYADYHDLMEITEKMVSGMVKHITGSYKVTYHPDGPEGQAYDVDFTPPFR RINMVEELEKALGMKLPETNLFETEETRKILDDICVAKAVECPPPRTTARLLDKLVGE FLEVTCINPTFICDHPQIMSPLAKWHRSKEGLTERFELFVMKKEICNAYTELNDPMRQ RQLFEEQAKAKAAGDDEAMFIDENFCTALEYGLPPTAGWGMGIDRVAMFLTDSNNIKE VLLFPAMKPEDKKENVATTDTLESTTVGTSV" BASE COUNT 586 a 452 c 493 g 466 t ORIGIN 1 gtactatcct ccttactttt gggtcgggcc ctccgggaag atggcggccg tgcaggcggc 61 cgaggtgaaa gtggatggca gcgagccgaa actgagcaag aatgagctga agagacgcct 121 gaaagctgag aagaaagtag cagagaagga ggccaaacag aaagagctca gtgagaaaca 181 gctaagccaa gccactgctg ctgccaccaa ccacaccact gataatggtg tgggtcctga 241 ggaagagagc gtggacccaa atcaatacta caaaatccgc agtcaagcaa ttcatcagct 301 gaaggtcaat ggggaagacc catacccaca caagttccat gtagacatct cactcactga 361 cttcatccaa aaatatagtc acctgcagcc tggggatcac ctgactgaca tcaccttaaa 421 ggtggcaggt aggatccatg ccaaaagagc ttctggggga aagctcatct tctatgatct 481 tcgaggagag ggggtgaagt tgcaagtcat ggccaattcc agaaattata aatcagaaga 541 agaatttatt catattaata acaaactgcg tcggggagac ataattggag ttcaggggaa 601 tcctggtaaa accaagaagg gtgagctgag catcattccg tatgagatca cactgctgtc 661 tccctgtttg catatgttac ctcatcttca ctttgggctc aaagacaagg aaacaaggta 721 tcgccagaga tacttggact tgatcctgaa tgactttgtg aggcagaaat ttatcatccg 781 ctctaagatc atcacatata taagaagttt cttagatgag ctgggattcc tagagattga 841 aactcccatg atgaacatca tcccaggggg agccgtggcc aagcctttca tcacttatca 901 caacgagctg gacatgaact tatatatgag aattgctcca gaactctatc ataagatgct 961 tgtggttggt ggcatcgacc gggtttatga aattggacgc cagttccgga atgaggggat 1021 tgatttgacg cacaatcctg agttcaccac ctgtgagttc tacatggcct atgcagacta 1081 tcacgatctc atggaaatca cggagaagat ggtttcaggg atggtgaagc atattacagg 1141 cagttacaag gtcacctacc acccagatgg cccagagggc caagcctacg atgttgactt 1201 caccccaccc ttccggcgaa tcaacatggt agaagagctt gagaaagccc tggggatgaa 1261 gctgccagaa acgaacctct ttgaaactga agaaactcgc aaaattcttg atgatatctg 1321 tgtggcaaaa gctgttgaat gccctccacc tcggaccaca gccaggctcc ttgacaagct 1381 tgttggggag ttcctggaag tgacttgcat caatcctaca ttcatctgtg atcacccaca 1441 gataatgagc cctttggcta aatggcaccg ctctaaagag ggtctgactg agcgctttga 1501 gctgtttgtc atgaagaaag agatatgcaa tgcgtatact gagctgaatg atcccatgcg 1561 gcagcggcag ctttttgaag aacaggccaa ggccaaggct gcaggtgatg atgaggccat 1621 gttcatagat gaaaacttct gtactgccct ggaatatggg ctgcccccca cagctggctg 1681 gggcatgggc attgatcgag tcgccatgtt tctcacggac tccaacaaca tcaaggaagt 1741 acttctgttt cctgccatga aacccgaaga caagaaggag aatgtagcaa ccactgatac 1801 actggaaagc acaacagttg gcacttctgt ctagaaaata ataattgcaa gttgtataac 1861 tcaggcgtct ttgcatttct gcgaaagatc aaggtctgca agggaattct tgtgtgctgc 1921 tttccatttg acaccgcagt tctgttcagc catcagaaga gagacaagga attaaaaatt 1981 tctttttaat cctgtta // LOCUS D38048 980 bp mRNA PRI 27-JAN-1997 DEFINITION Human mRNA for proteasome subunit z, complete cds. ACCESSION D38048 NID g1531532 KEYWORDS proteasome subunit z; proteasome. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA, clone:533. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 980) AUTHORS Hisamatsu,H., Shimbara,N., Saito,Y., Kristensen,P., Hendil,K.B., Fujiwara,T., Takahashi,E., Tanahashi,N., Tamura,T., Ichihara,A. and Tanaka,K. TITLE Newly identified pair of proteasomal subunits regulated reciprocally by interferon gamma JOURNAL J. Exp. Med. 183 (4), 1807-1816 (1996) MEDLINE 96261680 REFERENCE 2 (bases 1 to 980) AUTHORS Tanaka,K. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 980) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (25-AUG-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Univ. of Tokushima, Inst. for Enz. Res.; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (E-mail:ketanaka@ddbj.nig.ac.jp, Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) FEATURES Location/Qualifiers source 1..980 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /clone="533" CDS 15..848 /codon_start=1 /product="proteasome subunit z" /db_xref="PID:d1007816" /db_xref="PID:g1531533" /translation="MAAVSVYAPPVGGFSFDNCRRNAVLEADFAKRGYKLPKVRKTGT TIAGVVYKDGIVLGADTRATEGMVVADKNCSKIHFISPNIYCCGAGTAADTDMTTQLI SSNLELHSLSTGRLPRVVTANRMLKQMLFRYQGYIGAALVLGGVDVTGPHLYSIYPHG STDKLPYVTMGSGSLAAMAVFEDKFRPDMEEEEAKNLVSEAIAAGIFNDLGSGSNIDL CVISKNKLDFLRPYTVPNKKGTRLGRYRCEKGTTAVLTEKITPLEIEVLEETVQTMDT S" polyA_signal 950..955 BASE COUNT 258 a 226 c 263 g 233 t ORIGIN 1 gctttcttgg gaagatggcg gctgtgtcgg tgtatgctcc accagttgga ggcttctctt 61 ttgataactg ccgcaggaat gccgtcttgg aagccgattt tgcaaagagg ggatacaagc 121 ttccaaaggt ccggaaaact ggcacgacca tcgctggggt ggtctataag gatggcatag 181 ttcttggagc agatacaaga gcaactgaag ggatggttgt tgctgacaag aactgttcaa 241 aaatacactt catatctcct aatatttatt gttgtggtgc tgggacagct gcagacacag 301 acatgacaac ccagctcatt tcttccaacc tggagctcca ctccctctcc actggccgtc 361 ttcccagagt tgtgacagcc aatcggatgc tgaagcagat gcttttcagg tatcaaggtt 421 acattggtgc agccctagtt ttagggggag tagatgttac tggacctcac ctctacagca 481 tctatcctca tggatcaact gataagttgc cttatgtcac catgggttct ggctccttgg 541 cagcaatggc tgtatttgaa gataagttta ggccagacat ggaggaggag gaagccaaga 601 atctggtgag cgaagccatc gcagctggca tcttcaacga cctgggctcc ggaagcaaca 661 ttgacctctg cgtcatcagc aagaacaagc tggattttct ccgcccatac acagtgccca 721 acaagaaggg gaccaggctt ggccggtaca ggtgtgagaa agggactact gcagtcctca 781 ctgagaaaat cactcctctg gagattgagg tgctggaaga aacagtccaa acaatggaca 841 cttcctgaat ggcatcagtg ggtggctggc cgcggttctg gaaggtggtg agcattgagg 901 cccagtaaga cactcatgtg gctagtgttt gccgaatgaa actcaactca ataaaaaaca 961 aaaaccaaat tgggcagctg // LOCUS D38255 2020 bp mRNA PRI 01-OCT-1997 DEFINITION Homo sapiens mRNA for CAB1, complete cds. ACCESSION D38255 NID g2463543 KEYWORDS . SOURCE Homo sapiens squamous cell carcinoma of esophagus cell_line:TE 6 cDNA to mRNA, clone:CAB1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Akiyama,N., Sasaki,H., Kishi,T., Kato,M., Hirai,H., Yazaki,Y., Sugimura,T. and Terada,M. TITLE Isolation of multiple expressed genes from the c-ERBB-2 amplicon by a newly developed cDNA enrichment method; the HiLIP-ABC method JOURNAL Cancer Res. (1997) In press REFERENCE 2 (bases 1 to 2020) AUTHORS Akiyama,N. TITLE Direct Submission JOURNAL Submitted (17-SEP-1994) to the DDBJ/EMBL/GenBank databases. Nobu Akiyama, University of Tokyo, Faculty of Medicine, Third Department of Internal Medicine; 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan (Tel:81-3-3815-5411(ex.3116), Fax:81-3-5684-3987) FEATURES Location/Qualifiers source 1..2020 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TE 6" /chromosome="17" /clone="CAB1" /map="17q11-12" /tissue_type="squamous cell carcinoma of esophagus" CDS 122..1459 /citation=[1] /codon_start=1 /product="CAB1" /db_xref="PID:d1023393" /db_xref="PID:g2463544" /translation="MSKLPRELTRDLERSLPAVASLGSSLSHSQSLSSHLLPPPEKRR AISDVRRTFCLFVTFDLLFISLLWIIELNTNTGIRKNLEQEIIQYNFKTSFFDIFVLA FFRFSGLLLGYAVLQLRHWWVIAVTTLVSSAFLIVKVILSELLSKGAFGYLLPIVSFV LAWLETWFLDFKVLPQEAEEERWYLAAQVAVARGPLLFSGALSEGQFYSPPESFAGSD NESDEEVAGKKSFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEV PFHGKTFILKTFLPCPAELVYQEVILQPERMVLWNKTVTACQILQRVEDNTLISYDVS AGAAGGVVSPRDFVNVRRIERRRDRYLSSGIATSHSAKPPTHKYVRGENGPGGFIVLK SASNPRVCTFVWILNTDLKGRLPRYLIHQSLAATMFEFAFHLRQRISELGARA" polyA_site 2020 BASE COUNT 365 a 612 c 602 g 441 t ORIGIN 1 gctactgagg ccgcggagcc ggactgcggt tggggcggga agagccgggg ccgtggctga 61 catggagcag ccctgctgct gaggccgcgc cctccccgcc ctgaggtggg ggcccaccag 121 gatgagcaag ctgcccaggg agctgacccg agacttggag cgcagcctgc ctgccgtggc 181 ctccctgggc tcctcactgt cccacagcca gagcctctcc tcgcacctcc ttccgccgcc 241 tgagaagcga agggccatct ctgatgtccg ccgcaccttc tgtctcttcg tcaccttcga 301 cctgctcttc atctccctgc tctggatcat cgaactgaat accaacacag gcatccgtaa 361 gaacttggag caggagatca tccagtacaa ctttaaaact tccttcttcg acatctttgt 421 cctggccttc ttccgcttct ctggactgct cctaggctat gccgtgctgc agctccggca 481 ctggtgggtg attgcggtca cgacgctggt gtccagtgca ttcctcattg tcaaggtcat 541 cctctctgag ctgctcagca aaggggcatt tggctacctg ctccccatcg tctcttttgt 601 cctcgcctgg ttggagacct ggttccttga cttcaaagtc ctaccccagg aagctgaaga 661 ggagcgatgg tatcttgccg cccaggttgc tgttgcccgt ggacccctgc tgttctccgg 721 tgctctgtcc gagggacagt tctattcacc cccagaatcc tttgcagggt ctgacaatga 781 atcagatgaa gaagttgctg ggaagaaaag tttctctgct caggagcggg agtacatccg 841 ccaggggaag gaggccacgg cagtggtgga ccagatcttg gcccaggaag agaactggaa 901 gtttgagaag aataatgaat atggggacac cgtgtacacc attgaagttc cctttcacgg 961 caagacgttt atcctgaaga ccttcctgcc ctgtcctgcg gagctcgtgt accaggaggt 1021 gatcctgcag cccgagagga tggtgctgtg gaacaagaca gtgactgcct gccagatcct 1081 gcagcgagtg gaagacaaca ccctcatctc ctatgacgtg tctgcagggg ctgcgggcgg 1141 cgtggtctcc ccaagggact tcgtgaatgt ccggcgcatt gagcggcgca gggaccgata 1201 cttgtcatca gggatcgcca cctcacacag tgccaagccc ccgacgcaca aatatgtccg 1261 gggagagaat ggccctgggg gcttcatcgt gctcaagtcg gccagtaacc cccgtgtttg 1321 cacctttgtc tggattctta atacagatct caagggccgc ctgccccggt acctcatcca 1381 ccagagcctc gcggccacca tgtttgaatt tgcctttcac ctgcgacagc gcatcagcga 1441 gctgggggcc cgggcgtgac tgtgccccct cccaccctgc gggccagggt cctgtcgcca 1501 ccacttccag agccagaaag ggtgccagtt gggctcgcac tgcccacatg ggacctggcc 1561 ccaggctgtc accctccacc gagccacgca gtgcctggag ttgactgact gagcaggctg 1621 tggggtggag cactggactc cggggcccca ctggctggag gaagtggggt ctggcctgtt 1681 gatgtttaca tggcgccctg cctcctggag gaccagattg ctctgcccca ccttgccagg 1741 gcagggtctg ggctgggcac ctgacttggc tggggaggac cagggccctg ggcagggcag 1801 ggcagcctgt cacccgtgtg aagatgaagg ggctcttcat ctgcctgcgc tctcgtcggt 1861 ttttttagga ttattgaaag agtctgggac ccttgttggg gagtgggtgg caggtggggg 1921 tgggctgctg gccatgaatc tctgcctctc ccaggctgtc cccctcctcc cagggcctcc 1981 tgggggacct ttgtattaag ccaattaaaa acatgaattt // LOCUS D38585 435 bp mRNA PRI 05-MAR-1997 DEFINITION Human mRNA for TSC-22, complete cds. ACCESSION D38585 NID g1871129 KEYWORDS TSC-22. SOURCE Homo sapiens fetal kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ohta,S., Shimekake,Y. and Nagata,K. TITLE Molecular cloning and characterization of a transcription factor for the C-type natriuretic peptide gene promoter JOURNAL Eur. J. Biochem. 242 (3), 460-466 (1996) MEDLINE 97175009 REFERENCE 2 (bases 1 to 435) AUTHORS Ohta,S., Shimekake,Y. and Nagata,K. TITLE Molecular cloning of transcription factor of human C-type nat riuretic peptide JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 435) AUTHORS Ohta,S. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) to the DDBJ/EMBL/GenBank databases. Shigeki Ohta, Shionogi & Co., Ltd., Shionogi Research Laboratories; 12-4 Sagisu, 5-chome, Fukushima-ku, Osaka, Osaka 553, Japan (E-mail:sohta@fl.lab.shionogi.co.jp, Tel:06-458-5861(ex.936), Fax:06-458-0987) FEATURES Location/Qualifiers source 1..435 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="kidney" CDS 1..435 /codon_start=1 /product="TSC-22" /db_xref="PID:d1008179" /db_xref="PID:g1871130" /translation="MKSQWCRPVAMDLGVYQLRHFSISFLSSLLGTENASVRLDNSSS GASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQLEQENNLLKTLA SPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA" BASE COUNT 124 a 115 c 106 g 90 t ORIGIN 1 atgaaatccc aatggtgtag accagtggcg atggatctag gagtttacca actgagacat 61 ttttcaattt ctttcttgtc atccttgctg gggactgaaa acgcttctgt gagacttgat 121 aatagctcct ctggtgcaag tgtggtagct attgacaaca aaatcgagca agctatggat 181 ctagtgaaaa gccatttgat gtatgcggtc agagaagaag tggaggtcct caaagagcaa 241 atcaaagaac taatagagaa aaattcccag ctggagcagg agaacaatct gctgaagaca 301 ctggccagtc ctgagcagct tgcccagttt caggcccagc tgcagactgg ctccccccct 361 gccaccaccc agccacaggg caccacacag ccccccgccc agccagcatc gcagggctca 421 ggaccaaccg catag // LOCUS D38595 2963 bp mRNA PRI 21-OCT-1996 DEFINITION Human mRNA for inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP), complete cds. ACCESSION D38595 NID g664887 KEYWORDS inter-alpha-trypsin inhibitor family heavy chain-related protein; IHRP. SOURCE Homo sapiens liver cDNA to mRNA, clone_lib:lambda gt11 and lambda DR2 clone:GP120-5'-15. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2963) AUTHORS Saguchi,K., Tobe,T., Hashimoto,K., Sano,Y., Nakano,Y., Miura,N.H. and Tomita,M. TITLE Cloning and characterization of cDNA for inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP), a novel human plasma glycoprotein JOURNAL J. Biochem. 117 (1), 14-18 (1995) MEDLINE 95293915 REFERENCE 2 (bases 1 to 2963) AUTHORS Tobe,T. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) to the DDBJ/EMBL/GenBank databases. Takashi Tobe, Showa University, School of Pharmaceutical Sciences, Department of Physiological Chemistry; 1-5-8, Hatanodai, Shinagawa-ku, Tokyo 142, Japan (Tel:03-3784-8215, Fax:03-3784-8216) FEATURES Location/Qualifiers source 1..2963 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GP120-5'-15" /clone_lib="lambda gt11 and lambda DR2" /tissue_type="liver" sig_peptide 34..117 CDS 34..2826 /codon_start=1 /product="inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP)" /db_xref="PID:d1008183" /db_xref="PID:g1483187" /translation="MKPPRPVRTCSKVLVLLSLLAIHQTTTAEKNGIDIYSLTVDSRV SSRFAHTVVTSRVVNRANTVQEATFQMELPKKAFITNFSMNIDGMTYPGIIKEKAEAQ AQYSAAVAKGKNAGLVKATGRNMEQFQVSVSVAPNAKITFELVYEELLKRRLGVYELL LKVRPQQLVKHLQMDIHIFEPQGISFLETESTFMTNQLVDALTTWQNKTKAHIRFKPT LSQQQKSPEQQETVLDGNLIIRYDVDRAISGGSIQIENGYFVHYFAPEGLTTMPKNVV FVIDKSGSMSGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAENV NKARSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLTDGDPTVGE TNPRSIQNNVREAVSGRYSLFCLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQL QDFYQEVANPLLTAVTFEYPSNAVEEVTQNNFRLLFKGSEMVVAGKLQDRGPDVLTAT VSGKLPTQNITFQTESSVAEQEAEFQSPKYIFHNFMERLWAYLTIQQLLEQTVSASDA DQQALRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKPMEGESRNRNVHSGSTF FKYYLQGAKIPKPEASFSPRRGWNRQAGAAGSRMNFRPGVLSSRQLGLPGPPDVPDHA AYHPFRRLAILPASAPPATSNPDPAVSRVMNMKIEETTMTTQTPAPIQAPSAILPLPG QSVERLCVDPRHRQGPVNLLSDPEQGVEVTGQYEREKAGFSWIEVTFKNPLVWVHASP EHVVVTRNRRSSAYKWKETLFSVMPGLKMTMDKTGLLLLSDPDKVTIGLLFWDGRGEG LRLLLRDTDRFSSHVGGTLGQFYQEVLWGSPAASDDGRRTLRVQGNDHSATRERRLDY QEGPPGVEISCWSVEL" mat_peptide 118..2823 polyA_signal 2937..2942 polyA_site 2963 BASE COUNT 709 a 874 c 815 g 565 t ORIGIN 1 gtgagaagcc tcctggcaga cactggagcc acgatgaagc ccccaaggcc tgtccgtacc 61 tgcagcaaag ttctcgtcct gctttcactg ctggccatcc accagaccac tactgccgaa 121 aagaatggca tcgacatcta cagcctcacc gtggactcca gggtctcatc ccgatttgcc 181 cacacggtcg tcaccagccg agtggtcaat agggccaata cggtacagga ggccaccttc 241 cagatggagc tgcccaagaa agccttcatc accaacttct ccatgaacat cgatggcatg 301 acctacccag ggatcatcaa ggagaaggct gaagcccagg cacagtacag cgcagcagtg 361 gccaagggaa agaacgctgg cctcgtcaag gccaccggga gaaacatgga gcagttccag 421 gtgtcggtca gtgtggctcc caatgccaag atcacctttg agctggtcta tgaggagctg 481 ctcaagcggc gtttgggggt gtacgagctg ctgctgaaag tgcggcccca gcagctggtc 541 aagcacctgc agatggacat tcacatcttc gagccccagg gcatcagctt tctggagaca 601 gagagcacct tcatgaccaa ccagctggta gacgccctca ccacctggca gaataagacc 661 aaggctcaca tccggttcaa gccaacactt tcccagcagc aaaagtcccc agagcagcaa 721 gaaacagtcc tggacggcaa cctcattatc cgctatgatg tggaccgggc catctccggg 781 ggctccattc agatcgagaa cggctacttt gtacactact ttgcccccga gggcctaacc 841 acaatgccca agaatgtggt ctttgtcatt gacaagagcg gctccatgag tggcaggaaa 901 atccagcaga cccgggaagc cctaatcaag atcctggatg acctcagccc cagagaccag 961 ttcaacctca tcgtcttcag tacagaagca actcagtgga ggccatcact ggtgccagcc 1021 tcagccgaga acgtgaacaa ggccaggagc tttgctgcgg gcatccaggc cctgggaggg 1081 accaacatca atgatgcaat gctgatggct gtgcagttgc tggacagcag caaccaggag 1141 gagcggctgc ccgaagggag tgtctcactc atcatcctgc tcaccgatgg cgaccccact 1201 gtgggggaga ctaaccccag gagcatccag aataacgtgc gggaagctgt aagtggccgg 1261 tacagcctct tctgcctggg cttcggtttc gacgtcagct atgccttcct ggagaagctg 1321 gcactggaca atggcggcct ggcccggcgc atccatgagg actcagactc tgccctgcag 1381 ctccaggact tctaccagga agtggccaac ccactgctga cagcagtgac cttcgagtac 1441 ccaagcaatg ccgtggagga ggtcactcag aacaacttcc ggctcctctt caagggctca 1501 gagatggtgg tggctgggaa gctccaggac cgggggcctg atgtgctcac agccacagtc 1561 agtgggaagc tgcctacaca gaacatcact ttccaaacgg agtccagtgt ggcagagcag 1621 gaggcggagt tccagagccc caagtatatc ttccacaact tcatggagag gctctgggca 1681 tacctgacta tccagcagct gctggagcaa actgtctccg catccgacgc tgatcagcag 1741 gccctccgga accaagcgct gaatttatca cttgcctaca gctttgtcac gcctctcaca 1801 tctatggtag tcaccaaacc cgatgaccaa gagcagtctc aagttgctga gaagcccatg 1861 gaaggcgaaa gtagaaacag gaatgtccac tcaggttcca ctttcttcaa atattatctc 1921 cagggagcaa aaataccaaa accagaggct tccttttctc caagaagagg atggaataga 1981 caagctggag ctgctggctc ccggatgaat ttcagacctg gggttctcag ctccaggcaa 2041 cttggactcc caggacctcc tgatgttcct gaccatgctg cttaccaccc cttccgccgt 2101 ctggccatct tgcctgcttc agcaccacca gccacctcaa atcctgatcc agctgtgtct 2161 cgtgtcatga atatgaaaat cgaagaaaca accatgacaa cccaaacccc agcccccata 2221 caggctccct ctgccatcct gccactgcct gggcagagtg tggagcggct ctgtgtggac 2281 cccagacacc gccaggggcc agtgaacctg ctctcagacc ctgagcaagg ggttgaggtg 2341 actggccagt atgagaggga gaaggctggg ttctcatgga tcgaagtgac cttcaagaac 2401 cccctggtat gggttcacgc atcccctgaa cacgtggtgg tgactcggaa ccgaagaagc 2461 tctgcgtaca agtggaagga gacgctattc tcagtgatgc ccggcctgaa gatgaccatg 2521 gacaagacgg gtctcctgct gctcagtgac ccagacaaag tgaccatcgg cctgttgttc 2581 tgggatggcc gtggggaggg gctccggctc cttctgcgtg acactgaccg cttctccagc 2641 cacgttggag ggacccttgg ccagttttac caggaggtgc tctggggatc tccagcagca 2701 tcagatgacg gcagacgcac gctgagggtt cagggcaatg accactctgc caccagagag 2761 cgcaggctgg attaccagga ggggcccccg ggagtggaga tttcctgctg gtctgtggag 2821 ctgtagttct gatggaagga gctgtgccca ccctgtacac ttggcttccc cctgcaactg 2881 cagggccgct tctggggcct ggaccaccat ggggaggaag agtcccactc attacaaata 2941 aagaaaggtg gtgtgagcct ggg // LOCUS D38752 1020 bp DNA PRI 09-OCT-1997 DEFINITION Homo sapiens gene for fibroblast growth factor-8, complete cds. ACCESSION D38752 NID g2463547 KEYWORDS fibroblast growth factor-8. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,A., Miyamoto,K., Matsuo,H., Matsumoto,K. and Yoshida,H. TITLE Human androgen-induced growth factor in prostate and breast cancer cells: its molecular cloning and growth properties JOURNAL FEBS Lett. 363 (3), 226-230 (1995) MEDLINE 95255551 REFERENCE 2 (bases 1 to 1020) AUTHORS Tanaka,A. TITLE Direct Submission JOURNAL Submitted (31-OCT-1994) to the DDBJ/EMBL/GenBank databases. Akira Tanaka, Jichi Medical School, Department of Pathology; 3311-1 Yakushiji, Minamikawachi-machi, Kawachi-gun, Tochigi 329-04, Japan (E-mail:atanaka@jichi.ac.jp, Tel:0285-44-2111(ex.3316), Fax:0285-44-8467) FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 171..818 /note="FGF-8" /codon_start=1 /product="fibroblast growth factor-8" /db_xref="PID:d1023395" /db_xref="PID:g2463548" /translation="MGSPRSALSCLLLHLLVLCLQAQVTVQSSPNFTQHVREQSLVTD QLSRRLIRTYQLYSRTSGKHVQVLANKRINAMAEDGDPFAKLIVETDTFGSRVRVRGA ETGLYICMNKKGKLIAKSNGKGKDCVFTEIVLENNYTALQNAKYEGWYMAFTRKGRPR KGSKTRQHQREVHFMKRLPRGHHTTEQSLRFEFLNYPPFTRSLRGSQRTWAPEPR" BASE COUNT 193 a 341 c 307 g 179 t ORIGIN 1 gcggcgcggc gagcacgacg ttccacggga cccgcggagc cgcgtcgtga tcgccgccgg 61 cctcccgcac ccgcaccctc tccgctcgcg ccctgctcag cgcgtcctcc cgcggcggcc 121 cgcgggacgg cgtgacccgc cgggctctcg gtgccccggg gccgcgcgcc atgggcagcc 181 cccgctccgc gctgagctgc ctgctgttgc acttgctggt cctctgcctc caagcccagg 241 taactgttca gtcctcacct aattttacac agcatgtgag ggagcagagc ctggtgacgg 301 atcagctcag ccgccgcctc atccggacct accaactcta cagccgcacc agcgggaagc 361 acgtgcaggt cctggccaac aagcgcatca acgccatggc agaggacggc gaccccttcg 421 caaagctcat cgtggagacg gacacctttg gaagcagagt tcgagtccga ggagccgaga 481 cgggcctcta catctgcatg aacaagaagg ggaagctgat cgccaagagc aacggcaaag 541 gcaaggactg cgtcttcacg gagattgtgc tggagaacaa ctacacagcg ctgcagaatg 601 ccaagtacga gggctggtac atggccttca cccgcaaggg ccggccccgc aagggctcca 661 agacgcggca gcaccagcgt gaggtccact tcatgaagcg gctgccccgg ggccaccaca 721 ccaccgagca gagcctgcgc ttcgagttcc tcaactaccc gcccttcacg cgcagcctgc 781 gcggcagcca gaggacttgg gcccccgagc cccgataggt gctgcctggc cctccccaca 841 atgccagacc gcagagaggc tcatcctgta gggcacccaa aactcaagca agatgagctg 901 tgcgctgctc tgcaggctgg ggaggtgctg ggggagccct gggttccggt tgttgatatt 961 gtttgctgtt gggtttttgc tgtttttttt tttttttttt tttttaaaac aaaagaggct // LOCUS D42138 1924 bp mRNA PRI 19-SEP-1996 DEFINITION Human mRNA for PIG-B, complete cds. ACCESSION D42138 NID g1552168 KEYWORDS PIG-B; phosphatidylinositol glycan of complementation class B. SOURCE Homo sapiens cell_line:P39 cDNA to mRNA, clone_lib:human P39 library in pCEV4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1924) AUTHORS Takahashi,M., Inoue,N., Ohishi,K., Maeda,Y., Nakamura,N., Endo,Y., Fujita,T., Takeda,J. and Kinoshita,T. TITLE PIG-B, a membrane protein of the endoplasmic reticulum with a large lumenal domain, is involved in transferring the third mannose of the GPI anchor JOURNAL EMBO J. 15 (16), 4254-4261 (1996) MEDLINE 97015126 REFERENCE 2 (bases 1 to 1924) AUTHORS Takahashi,M. TITLE Direct Submission JOURNAL Submitted (17-NOV-1994) to the DDBJ/EMBL/GenBank databases. Minoru Takahashi, Research Institute for Microbial Diseases, Osaka University, Department of Immunoregulation; 3-1 Yamadaoka, Suita, Osaka 565, Japan (E-mail:takahash@biken.osaka-u.ac.jp, Tel:06-875-5233, Fax:06-875-5233) FEATURES Location/Qualifiers source 1..1924 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="P39" /clone_lib="human P39 library in pCEV4" CDS 45..1709 /note="involvement of GPI-anchor biosynthesis" /codon_start=1 /product="PIG-B" /db_xref="PID:d1008294" /db_xref="PID:g1552169" /translation="MRRPLSKCGMEPGGGDASLTLHGLQNRSHGKIKLRKRKSTLYFN TQEKSARRRGDLLGENIYLLLFTIALRILNCFLVQTSFVPDEYWQSLEVSHHMVFNYG YLTWEWTERLRSYTYPLIFASIYKILHLLGKDSVQLLIWIPRLAQALLSAVADVRLYS LMKQLENQEVARWVFFCQLCSWFTWYCCTRTLTNTMETVLTIIALFYYPLEGSKSMNS VKYSSLVALAFIIRPTAVILWTPLLFRHFCQEPRKLDLILHHFLPVGFVTLSLSLMID RIFFGQWTLVQFNFLKFNVLQNWGTFYGSHPWHWYFSQGFPVILGTHLPFFIHGCYLA PKRYRILLVTVLWTLLVYSMLSHKEFRFIYPVLPFCMVFCGYSLTHLKTWKKPALSFL FLSNLFLALYTGLVHQRGTLDVMSHIQKVCYNNPNKSSASIFIMMPCHSTPYYSHVHC PLPMRFLQCPPDLTGKSHYLDEADVFYLNPLNWLHREFHDDASLPTHLITFSILEEEI SAFLISSNYKRTAVFFHTHLPEGRIGSHIYVYERKLKGKFNMKMKF" polyA_signal 1902..1907 polyA_site 1924 BASE COUNT 532 a 397 c 376 g 619 t ORIGIN 1 ggctactgca gctttcttcc gccttaggaa ggtggcggcc agggatgagg aggcccctaa 61 gcaagtgcgg aatggagccg gggggcggag atgccagcct cactttgcat ggtctccaga 121 accgctccca cggcaagata aagctgcgaa agagaaagtc taccttgtac ttcaacaccc 181 aggagaagag cgccaggcgc cgcggggatc ttcttggaga aaatatttat ctgctcttgt 241 ttaccatagc tttacgaata ttaaactgct ttttagtgca gacaagtttt gttccagatg 301 aatactggca gtctcttgaa gtttcacatc acatggtttt caattatggt tatttgactt 361 gggaatggac agagagactg aggagttaca cttatccctt aatctttgca agcatttaca 421 agattcttca tcttttaggg aaagatagtg ttcagttgct gatttggatt cctagacttg 481 cccaagcact tctgtctgct gtagcagatg tgagacttta ctcattaatg aagcaactag 541 aaaatcagga agtggcaaga tgggtgtttt tttgccagtt gtgctcctgg ttcacatggt 601 attgctgtac cagaaccctt acaaacacca tggaaactgt tctcactata attgctcttt 661 tctactatcc tttggaaggt tcaaagtcta tgaacagtgt caaatactca tccctggtgg 721 cacttgcctt cataattcgt cccacagctg tcattctgtg gacacctttg ctcttcagac 781 atttctgtca agaaccaaga aagcttgatc ttattctaca tcatttttta cctgtaggct 841 ttgttacttt gagtttgtct ctgatgattg atcgtatttt ttttggccaa tggactctgg 901 ttcaatttaa ttttttgaaa tttaacgtgc tgcagaactg gggaacattt tatggttctc 961 atccatggca ctggtacttc agtcaaggat ttccagttat cttgggtact cacttaccct 1021 tctttattca tggctgctat ctagcaccaa agagataccg gatacttttg gtgactgtgc 1081 tgtggacact gcttgtttat agcatgttga gccacaaaga attcaggttt atttatccag 1141 ttttaccatt ctgtatggtg ttctgtggat actcattaac ccacctgaaa acatggaaga 1201 aaccagctct aagtttcctg tttttatcaa atttgttcct cgccctttat actggtttag 1261 ttcatcaacg aggtactctt gatgtcatga gtcatattca aaaagtttgt tacaacaatc 1321 ccaataaatc ttcagcttca atatttataa tgatgccttg ccactctact ccttattaca 1381 gccatgttca ctgcccactt cccatgagat ttctccagtg cccgccagac ctgactggaa 1441 aaagtcatta tcttgatgaa gcagatgtat tttacctaaa tcccttaaac tggttacata 1501 gagagtttca tgatgatgca tcattgccta ctcacttgat caccttcagc attttggaag 1561 aggaaataag cgctttccta atttcaagca attataaaag aactgctgtt ttcttccaca 1621 ctcacttgcc agagggtcga attggaagtc acatatatgt ctatgaacgg aagttaaaag 1681 ggaaattcaa catgaagatg aaattctgaa ctttcctaga taaattaaca ttgctgggtg 1741 gaaatattca gatgctgctt aaatacttcg gtaaacactg ggtaagattc atggaactta 1801 gaaaaaagct gtatgaactg ctttaccaaa tatcactact gaggaaatgt ataaaatacc 1861 acatagtata aaattacatg ttaatacaat gccagatttt aaataaagac ctttagtttt 1921 cctc // LOCUS D43945 1805 bp mRNA PRI 14-OCT-1997 DEFINITION Homo sapiens mRNA for TFEC isoform (or TFECL), complete cds. ACCESSION D43945 NID g2347002 KEYWORDS TFEC. SOURCE Homo sapiens monocytic leukemia cell_line:THP-1 cDNA to mRNA, clone:pMIR-4. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yasumoto,K. and Shibahara,S. TITLE Molecular cloning of cDNA encoding a human TFEC isoform, a newly identified transcriptional regulator JOURNAL Biochim. Biophys. Acta 1353 (1), 23-31 (1997) MEDLINE 97398136 REFERENCE 2 (bases 1 to 1805) AUTHORS Shibahara,S. TITLE Direct Submission JOURNAL Submitted (22-DEC-1994) to the DDBJ/EMBL/GenBank databases. Shigeki Shibahara, Tohoku University School of Medicine, Dept. of Mol. Biol. and Applied Physiol.; 2-1 Seiryomachi, Aoba-ku, Sendai, Miyagi 980, Japan (Tel:022-717-8117, Fax:022-717-8118) FEATURES Location/Qualifiers source 1..1805 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" /clone="pMIR-4" /tissue_type="monocytic leukemia" gene 183..1226 /gene="Tfec" CDS 183..1226 /gene="Tfec" /codon_start=1 /product="TFEC isoform (or TFECL)" /db_xref="PID:d1022767" /db_xref="PID:g2347003" /translation="MTLDHQIINPTLKWSQPAVPSGGPLVQHAHTTLDSDAGLTENPL TKLLAIGKEDDNAQWHMEDVIEDIIGMESSFKEEGADSPLLMQRTLSGSILDVYSGEQ GISPINMGLTSASCPSSLPMKREITETDTRALAKERQKKDNHNLIERRRRYNINYRIK ELGTLIPKSNDPDMRWNKGTILKASVEYIKWLQKEQQRARELEHRQKKLEQANRRLLL RIQELEIQARTHGLPTLASLGTVDLGAHVTKQQSHPEQNSVDYCQQLTVSQGPSPELC DQAIAFSDPLSYFTDLSFSAALKEEQRLDGMLLDDTISPFGTDPLLSATSPAVSKESS RRSSFSSDDGDEL" polyA_site 1805 /note="15 A nucleotides" BASE COUNT 587 a 351 c 373 g 494 t ORIGIN 1 tgtttacttt ggttgtccct tctggcatgg tgcatatgtt atgggaagag ggattataat 61 ttggtgctgt ttgtagagat gacaacactg ataaaatcca ctcattgctg gtcccagcac 121 acctggaaag ttctgcaagg cctcagctac agaaagccca gagacagaaa gtaaactctt 181 tcatgaccct tgatcatcag atcatcaatc caactcttaa atggtcacaa cctgcagtgc 241 caagtggtgg gcctcttgtg cagcatgcac acacaactct ggacagtgat gctggcctca 301 cagaaaaccc actcaccaag ttactagcta ttgggaaaga agatgacaat gcacaatggc 361 atatggagga cgttattgag gatataatcg gtatggaatc aagttttaaa gaggaaggag 421 cagactctcc tctgctaatg caaagaacat tatctggaag tattttggat gtgtatagcg 481 gtgaacaagg aatttcacca attaacatgg ggcttacaag tgcttcttgt ccaagtagtc 541 taccaatgaa aagagaaatt acagaaactg acactagagc tttagcaaaa gagagacaaa 601 aaaaggacaa ccacaacctc attgaaagaa gaagaaggta taatattaat taccgaatca 661 aggagcttgg cactcttatt ccaaagtcta atgatcctga tatgcgctgg aacaaaggaa 721 ccattctaaa agcatcagtg gagtacatca agtggctaca aaaagaacaa cagagagccc 781 gagaattgga acacagacag aagaaattag agcaggctaa caggcgactt ctacttcgga 841 ttcaggaact agaaattcag gctcgtactc atggtctgcc aaccctggct tcacttggca 901 cggttgattt aggtgctcat gtcaccaaac agcagagcca tcctgagcag aattcagtag 961 actattgcca acaactgact gtgtctcagg ggccaagccc tgagctctgt gatcaagcta 1021 tagccttttc tgatcctttg tcatacttca cagatttatc atttagtgct gcattgaaag 1081 aggaacaaag attggatggc atgctattgg atgacacaat ctctccattt ggaacagatc 1141 ctctgctatc tgccacttcc cctgcagttt ccaaagaaag cagtaggaga agtagcttta 1201 gctcagatga tggtgatgaa ttataagaaa taaacagacc caattcatca actggaaagc 1261 aattctatgc tggtgctatg caattatgct ctgtgtttca tatgttgctt tggcttattt 1321 tttttcttaa aggaatgtgt tgttcatgaa aaactgatag aagcaacaga agaattcgca 1381 ggaagaaaaa tcatagtgtt aatgaattat tgagggcgaa aaaaaggtgt tttcttcttt 1441 gactacggag tccaaatcca cttaaattct gttttcctga aaagaggtac agcataagaa 1501 atagctcttt attgatgttt taaaagcagc aacttggtgg tgtactactg gaactaatga 1561 ctgcaaagtg ttaaacgact gaaatataca aacagtctct tagttactca tttccatctt 1621 ctcttcaact ttcacatcag tcttccggaa tcaagatcaa catatcaggt ggtcattgcc 1681 tttctccatt gtctagtaga catgtctaaa gttcaaactt tataggataa ataaatgtat 1741 aatagattat ctgtcacttg tggttgaaag gcaaatctac aataaatgtg agaattttcc 1801 acaat // LOCUS D44466 3176 bp mRNA PRI 29-JAN-1997 DEFINITION Human mRNA for proteasome subunit p112, complete cds. ACCESSION D44466 NID g1808577 KEYWORDS . SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3176) AUTHORS Yokota,K., Kagawa,S., Shimizu,Y., Akioka,H., Tsurumi,C., Noda,C., Fujimuro,M., Yokosawa,H., Fujiwara,T., Takahashi,E., Ohba,M., Yamasaki,M., DeMartino,G.N., Slaughter,C.A., Toh-e,A. and Tanaka,K. TITLE CDNA cloning of p112, the largest regulatory subunit of the human 26s proteasome, and functional analysis of its yeast homologue, sen3p JOURNAL Mol. Biol. Cell 7 (6), 853-870 (1996) MEDLINE 96413887 REFERENCE 2 (bases 1 to 3176) AUTHORS Yokota,K., Akioka,H., Shimizu,Y., Tanahashi,N., Tsurumi,C., Noda,C., Toh-E,A., Fujimuro,M., Yokosawa,H., DeMartino,G., Slaughter,C. and Tanaka,K. TITLE cDNA cloning of a regulatory subunit p112 of the human 26S proteasome and functional analysis of its homologous gene SEN3 of Saccharomyces cerevisiae JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 3176) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (24-DEC-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Univ. of Tokushima, Inst. for Enz. Res.; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (E-mail:ketanaka@ddbj.nig.ac.jp, Tel:0886-33-7430, Fax:0886-33-7431) COMMENT Submitted (24-Dec-1994) to DDBJ by: Keiji Tanaka Inst. for Enz. Res. The Univ. of Tokushima 3-18-15 Kuramoto-cho Tokushima, Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223 Email: ketanaka@ddbj.nig.ac.jp. FEATURES Location/Qualifiers source 1..3176 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 81..2942 /codon_start=1 /product="proteasome subunit p112" /db_xref="PID:d1008506" /db_xref="PID:g1808578" /translation="MITSAAGIISLLDEDEPQLKEFALHKLNAVVNDFWAEISESVDK IEVLYEDEGFRSRQFAALVASKVFYHLGAFEESLNYALGARDLFNVNDNSEYVETIIA KCIDHYTKQCVENADLPEGEKKPIDQRLEGIVNKMFQRCLDDHKYKQAIGIALETRRL DVFEKTILESNDVPGMLAYSLKLCMSLMQNKQFRNKVLRVLVKIYMNLEKPDFINVCQ CLIFLDDPQAVSDILEKLVKEDNLLMAYQICFDLYESASQQFLSSVIQNLRTVGTPIA SVPGSTNTGTVPGSEKDSDSMETEEKTSSAFVGKTPEASPEPKDQTLKMIKILSGEMA IELHLQFLIRNNNTDLMILKNTKDAVRNSVCHTATVIANSFMHCGTTSDQFLRDNLEW LARATNWAKFTATASLGVIHKGHEKEALQLMATYLPKDTSPGSAYQEGGGLYALGLIH ANHGGDIIDYLLNQLKNASNDIVRHGGSLGLGLAAMGTARQDVYDLLKTNLYQDDAVT GEAAGLALGLVMLGSKNAQAIEDMVGYAQETQHEKILRGLAVGIALVMYGRMEEADAL IESLCRDKDPILRRSGMYTVAMAYCGSGNNKAIRRLLHVAVSDVNDDVRSAAVESLGF ILFRTPEQCPSVVSLLSESYNPHVRYGAAMALGICCAGTGNKEAINLLEPMTNDPVNY VRQGALIASALIMIQQTEITCPKVNQFRQLYSKVINDKHDDVMAKFGAILAQGILDAG GHNVTISLQSRTGHTHMPSVVGVLVFTQFWFWFPLSHFLSLAYTPTCVIGLNKDLKMP KVQYKSNCKPSTFAYPAPLEVPKEKEKEKVSTAVLSITAKAKKKEKEKEKKEEEKMEV DEAEKKEEKEKKKEPEPNFQLLDNPARVMPAQLKVLTMPETCRYQPFKPLSIGGIIIL KDTSEDIEELVEPVAAHGPKIEEEEQEPEPPEPFEYIDD" polyA_signal 3148..3153 BASE COUNT 961 a 626 c 757 g 832 t ORIGIN 1 tgaactgagc ggcccctgag ctgacagata cactgcgcag tggaacggcg agcgagccga 61 cgggcgagtg aggggcgcac atgatcacct cggccgctgg aattatttct cttctggatg 121 aagatgaacc acagcttaag gaatttgcac tacacaaatt gaatgcagtt gttaatgact 181 tctgggcaga aatttccgag tccgtagaca aaatagaggt tttatacgaa gatgaaggtt 241 tccggagtcg gcagtttgca gccttagtgg catctaaagt attttatcac ctgggggctt 301 ttgaggagtc tctgaattat gctcttggag caagggacct cttcaatgtc aatgataact 361 ctgaatatgt ggaaactatt atagcaaaat gcattgatca ctacaccaaa caatgtgtgg 421 aaaatgcaga tttgcctgaa ggagaaaaaa aaccaattga ccagagattg gaaggcatcg 481 taaataaaat gttccagcga tgtctagatg atcacaagta taaacaggct attggcattg 541 ctctggagac acgaagactg gacgtctttg aaaagaccat actggagtcg aatgatgtcc 601 caggaatgtt agcttatagc cttaagctct gcatgtcttt aatgcagaat aaacagtttc 661 ggaataaagt actaagagtt ctagttaaaa tctacatgaa cttggagaaa cctgatttca 721 tcaatgtttg tcagtgctta attttcttag atgatcctca ggctgtgagt gatatcttag 781 agaaactggt aaaggaagac aacctcctga tggcatatca gatttgtttt gatttgtatg 841 aaagtgctag ccagcagttt ttgtcatctg taatccagaa tcttcgaact gttggcaccc 901 ctattgcttc tgtgcctgga tccactaata cgggtactgt tccgggatca gagaaagaca 961 gtgactcgat ggaaacagaa gaaaagacaa gcagtgcatt tgtaggaaag acaccagaag 1021 ccagtccaga gcctaaggac cagactttga aaatgattaa aattttaagt ggtgaaatgg 1081 ctattgagtt acatctgcag ttcttaatac gaaacaataa tacagacctc atgattctaa 1141 aaaacacaaa ggatgcagta cggaattctg tatgtcatac tgcaaccgtt atagcaaact 1201 cttttatgca ctgtgggaca accagtgacc agtttcttag agataatttg gaatggttag 1261 ccagagccac taactgggca aaatttactg ctacagccag tttgggtgta attcataagg 1321 gtcatgaaaa agaagcatta cagttaatgg caacatacct tcccaaggat acttctccag 1381 gatcagccta tcaggaaggt ggaggtctct atgcactagg tcttattcat gccaatcatg 1441 gtggtgatat aattgactat ctgcttaatc agcttaagaa cgccagcaat gatatcgtta 1501 gacacggtgg cagtctgggc cttggtttgg cagccatggg aactgcacgt caagatgttt 1561 atgatttgct aaaaacaaac ctttatcagg atgatgcagt aacaggggaa gcagctggcc 1621 tggccctagg tttggttatg ttgggctcta aaaatgctca ggctattgag gacatggttg 1681 gttatgcaca agaaactcaa catgagaaga ttctgcgtgg tcttgcagtt ggcatagctt 1741 tagtaatgta tgggaggatg gaagaggctg atgctctcat tgaatctctc tgtcgtgaca 1801 aggacccaat tcttcgaagg tctggaatgt atactgtagc catggcttat tgtggctctg 1861 gtaacaacaa agcaattcga cgcctgctac atgtggctgt aagtgatgtg aatgatgatg 1921 tcaggagtgc agcagtagaa tcacttgggt tcattctatt cagaacccct gaacagtgcc 1981 caagtgttgt ctctttgttg tcagagagtt acaaccctca tgtgcgctac ggagctgcaa 2041 tggccttggg gatatgctgt gctggtacag gaaacaagga agccattaat ttgctagaac 2101 caatgacaaa cgaccccgtg aactacgtga ggcaaggggc actcatagct tcagctctca 2161 tcatgatcca gcagactgaa atcacttgtc caaaggtgaa tcagttcaga cagctgtatt 2221 ccaaagtcat caatgataag catgatgatg tcatggccaa gtttggcgct attctggccc 2281 agggcatact ggatgcaggt ggtcataatg tcacaatctc cttgcagtcc aggactgggc 2341 atactcatat gccttctgtg gttggcgtcc ttgtatttac ccagttttgg ttctggtttc 2401 ctctttcaca cttcctgtca ttggcttata cccctacctg tgtcattggc cttaacaagg 2461 acttaaagat gccgaaagtt cagtataaat cgaactgtaa accatccaca tttgcatatc 2521 ctgcccctct ggaagtacca aaagaaaaag aaaaggaaaa ggtttctact gctgtattat 2581 ctataactgc caaggctaaa aagaaggaaa aagaaaagga aaaaaaggag gaggagaaaa 2641 tggaagtgga tgaggcagag aaaaaggagg aaaaagagaa gaaaaaagaa cctgagccaa 2701 acttccagtt attggataac ccagcccgag ttatgcctgc ccagcttaag gtcctaacca 2761 tgccggagac ctgtagatac cagcctttca aaccactctc tattggaggc atcatcattc 2821 tgaaggatac cagtgaagac attgaggagc tggtggaacc tgtggcagca catggcccaa 2881 aaatcgagga ggaggaacaa gagccagaac ccccagaacc atttgagtat attgatgatt 2941 aaggaccaga ggatctcact tgcttatctg aagaagattg tccaggctca tattgggaat 3001 gcttatgagg aaattcatgc cgagacctgc tattcaatgc atgtatcgtt gcctctgcac 3061 tgacctgaag aaccctgtct ccaagtcttt ggttgaagag aagatatatg actgttgagt 3121 gtgctctttc acagaacttg gttttcaaat aaatataaga tctccagatg gacaag // LOCUS D45213 555 bp mRNA PRI 05-JUN-1997 DEFINITION Human mRNA for zinc finger protein, complete cds. ACCESSION D45213 NID g2190183 KEYWORDS zinc finger protein. SOURCE Homo sapiens lymphoma T-cell cell_line:KUT-2 cDNA to mRNA, clone_lib:K.Shigesada clone:hT86. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 555) AUTHORS Terunuma,A. TITLE Direct Submission JOURNAL Submitted (20-JAN-1995) to the DDBJ/EMBL/GenBank databases. Atsushi Terunuma, Cancer Institute Japanese Foundation For Cancer Research, Cell Biology; Kami-Ikebukuro 1-37-1, Toshima-ku, Tokyo 170, Japan (E-mail:aterunum@ddbj.nig.ac.jp, Tel:03-3918-0111(ex.4101), Fax:03-3917-7564) REFERENCE 2 (bases 1 to 555) AUTHORS Terunuma,A., Shiba,K. and Noda,T. TITLE A novel genetic screening system for isolation of cDNA clones that trans-dominantly inhibit the DNA-binding activity of eukaryotic transcription factors JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Terunuma,A., Shiba,K. and Noda,T. TITLE A novel genetic system to isolate a dominant negative effector on DNA-binding activity of oct-2 JOURNAL Nucleic Acids Res. 25 (10), 1984-1990 (1997) MEDLINE 97277179 FEATURES Location/Qualifiers source 1..555 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KUT-2" /cell_type="T-cell" /clone="hT86" /clone_lib="K.Shigesada" /tissue_type="lymphoma" CDS 51..401 /codon_start=1 /product="zinc finger protein" /db_xref="PID:d1021201" /db_xref="PID:g2190184" /translation="MKAKRRRPDLDEIHRELRPQGSARPQPDPNAEFDPDLPGGGLHR CLACARYFIDSTNLKTHFRSKDHKKRLKQLSVEPYSQEEAERAAGMGSYVPPRRLAVP TEVSTEVPEMDTST" polyA_signal 535..540 polyA_site 555 BASE COUNT 120 a 173 c 179 g 83 t ORIGIN 1 gtcgctcccg ccggacaggc gcgcaccgag cgcactctct agcccggcag atgaaggcga 61 agcggcggcg gccggacttg gatgagattc accgcgagct gcggcctcag ggatccgcac 121 gaccccagcc cgacccaaac gccgagttcg accccgacct gccagggggc ggcctgcacc 181 gctgtctggc ctgcgcgagg tacttcatcg attccaccaa cctgaagacc cacttccgat 241 ccaaagacca caagaaaagg ctgaagcagc tgagcgtcga gccctacagt caggaagagg 301 cggagagggc agcgggtatg ggatcctatg tgccccccag gcggctggca gtgcccacgg 361 aagtgtccac tgaggtccct gagatggata cctctacctg acatggcctg aagatgcagg 421 gcagaggaat tgcccatgga cagtgacgca aggactaggc tgggagggag cgtgccaacc 481 ccttttgcct ctgggtttgg ggagcggagg gcctcttctt ggtgccctgc ccccaataaa 541 ggaactggac aaaga // LOCUS D49737 1293 bp mRNA PRI 04-NOV-1997 DEFINITION Homo sapiens mRNA for cytochrome b large subunit of complex II, complete cds. ACCESSION D49737 NID g2588778 KEYWORDS cytochrome b large subunit of complex II. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hirawake,H., Taniwaki,M., Tamura,A., Kojima,S. and Kita,K. TITLE Cytochrome b in human complex II (succinate-ubiquinone oxidoreductase): cDNA cloning of the components in liver mitochondria and chromosomal assignment of the genes for the large and small subunits to 1q21 and 11q23 JOURNAL Cytogenet. Cell Genet. (1997) In press REFERENCE 2 (bases 1 to 1293) AUTHORS Kita,K. TITLE Direct Submission JOURNAL Submitted (17-MAR-1995) to the DDBJ/EMBL/GenBank databases. Kiyoshi Kita, The Institute of Medical Science, The University of Tokyo, Parasitology; 4-6-1 Shirokanedai, Minato-Ku, Tokyo 108, Japan (E-mail:kitak@ims.u-tokyo.ac.jp, Tel:03-5449-5370, Fax:03-5449-5410) FEATURES Location/Qualifiers source 1..1293 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 125..523 /note="cytochrome b large subunit of complex II(succinate-ubiquinone oxidoreductase); mitochondrial protein" /codon_start=1 /product="cytochrome b large subunit of complex II" /db_xref="PID:g2588779" /translation="MERFWNKNIGSNRPLSPHITIYSWSLPMAMSICHRGTGIALSAG VSLFGMSALLLPGNFESYLELVKSLCLGPALIHTAKFALVFPLMYHTWNGIRHLMWDL GKGLKIPQLYQSGVVVLVLTVLSSMGLAAM" polyA_site 1293 /note="8 A nucleotides" BASE COUNT 280 a 291 c 316 g 406 t ORIGIN 1 cccggaaccc aagatggctg cgctgttgct gagacacgtt ggtcgtcatt gcctccgagc 61 ccactttagc cctcagctct gtatcagaaa tgctgttcct ttgggaacca cggccaaaga 121 agagatggag cggttctgga ataagaatat aggttcaaac cgtcctctgt ctccccacat 181 tactatctac agttggtctc ttcccatggc gatgtccatc tgccaccgtg gcactggtat 241 tgctttgagt gcaggggtct ctctttttgg catgtcggcc ctgttactcc ctgggaactt 301 tgagtcttat ttggaacttg tgaagtccct gtgtctgggg ccagcactga tccacacagc 361 taagtttgca cttgtcttcc ctctcatgta tcatacctgg aatgggatcc gacacttgat 421 gtgggaccta ggaaaaggcc tgaagattcc ccagctatac cagtctggag tggttgtcct 481 ggttcttact gtgttgtcct ctatggggct ggcagccatg tgaagaaagg aggctcccag 541 catcatcttc ctacacatta ttacattcac ccatctttct gtttgtcatt cttatctcca 601 gcctgggaaa agttctcctt atttgtttag atccttttgt attttcagat ctccttggag 661 cagtagagta cctggtagac cataatagtg gaaaagggtc tagttttccc cttgtttcta 721 aagatgaggt ggctgcaaaa actccccttt tttgcccaca gcttgcctac tctcggccta 781 gaagcagtta ttctctctcc atattgggct ttgatttgtg ctgagggtca gcttttggct 841 ccttcttcct gagacagtgg aaacaatgcc agctctgtgg cttctgccct ggggatgggc 901 cgggttgggg ggtgggttgg tgaggctttg ggtgccactg cctgtgggtt gctggcttaa 961 aggacaattc tcttcattgg tgagagccca ggccattaac acctacacag tgttattgaa 1021 agaagagagg tgggggtgga ggggaattag tctgtcccag ctagagggag ataaagaggg 1081 ctagttagtt cttggagcag ctgcttttga ggagaaaata tatagctttg gacacgagga 1141 agatctagaa aattatcatt gaacatatta atggttattt ctttttcttg gatttccaga 1201 aaagcctctt aattttatgc tttctcatcg aagtaatgta cccttttttt ctgaaactga 1261 attaaatact cattttatct ttgactctcc ttg // LOCUS D49817 1756 bp mRNA PRI 29-AUG-1997 DEFINITION Homo sapiens mRNA for 6-phosphofructo-2-kinase/fructose-2, 6-bisphosphatase, complete cds. ACCESSION D49817 NID g1468914 KEYWORDS 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase. SOURCE Homo sapiens first trimester placenta chorionic villi cDNA to mRNA, clone_lib:lambda gt10 clone:HP (combined with 2K-3 and AP-4). ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1756) AUTHORS Sakakibara,R. TITLE Direct Submission JOURNAL Submitted (22-MAR-1995) to the DDBJ/EMBL/GenBank databases. Ryozo Sakakibara, Nagasaki University, School of Pharmaceutical Sciences, Department of Biochemistry; 1-14 Bunkyo-machi, Nagasaki, Nagasaki 852, Japan (E-mail:rssakaki@net.nagasaki-u.ac.jp, Tel:0958-47-1111(ex.2536), Fax:0958-44-6774) REFERENCE 2 (bases 1 to 1756) AUTHORS Sakai,A., Kato,M., Fukasawa,M., Ishiguro,M., Furuya,E. and Sakakibara,R. TITLE Cloning of cDNA encoding for a novel isozyme of fructose 6-phosphate, 2-kinase/fructose 2,6-bisphosphatase from human placenta JOURNAL J. Biochem. 119 (3), 506-511 (1996) MEDLINE 96271013 REFERENCE 3 (sites) AUTHORS Sakakibara,R., Kato,M., Okamura,N., Nakagawa,T., Komada,Y., Tominaga,N., Shimojo,M. and Fukasawa,M. TITLE Characterization of a human placental fructose-6-phosphate, 2-kinase/fructose-2,6-bisphosphatase JOURNAL J. Biochem. 122 (1), 122-128 (1997) MEDLINE 97420695 FEATURES Location/Qualifiers source 1..1756 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chorionic villi" /clone="HP (combined with 2K-3 and AP-4)" /clone_lib="lambda gt10" /dev_stage="first trimester" /tissue_type="placenta" CDS 19..1581 /function="bifunctional enzyme" /note="EC 2.7.1.105 / EC 3.1.3.46" /codon_start=1 /product="6-phosphofructo-2-kinase/fructose-2, 6-bisphosphatase" /db_xref="PID:d1009235" /db_xref="PID:g1468916" /translation="MPLELTQSRVQKIWVPVDHRPSLPRSCGPKLTNSPTVIVMVGLP ARGKTYISKKLTRYLNWIGVPTKVFNVGEYRREAVKQYSSYNFFRPDNEEAMKVRKQC ALAALRDVKSYLAKEGGQIAVFDATNTTRERRHMILHFAKENDFKAFFIESVCDDPTV VASNIMEVKISSPDYKDCNSAEAMDDFMKRISCYEASYQPLDPDKCDRDLSLIKVIDV GRRFLVNRVQDHIQSRIVYYLMNIHVQPRTIYLCRHGENEHNLQGRIGGDSGLSSRGK KFASALSKFVEEQNLKDLRVWTSQLKSTIQTAEALRLPYEQWKALNEIDAGVCEELTY EEIRDTYPEEYALREQDKYYYRYPTGESYQDLVQRLEPVIMELERQENVLVICHQAVL RCLLAYFLDKSAEEMPYLKCPLHTVLKLTPVAYGCRVESIYLNVESVCTHRERSEDAK KGPNPLMRRNSVTPLASPEPTKKPRINSFEEHVASTSAALPSCLPPEVPTQLPGQNMK GSRSSADSSRKH" BASE COUNT 405 a 524 c 508 g 319 t ORIGIN 1 tcgggcgcag ccgcgaagat gccgttggaa ctgacgcaga gccgagtgca gaagatctgg 61 gtgcccgtgg accacaggcc ctcgttgccc agatcctgtg ggccaaagct gaccaactcc 121 cccaccgtca tcgtcatggt gggcctcccc gcccggggca agacctacat ctccaagaag 181 ctgactcgct acctcaactg gattggcgtc cccacaaaag tgttcaacgt cggggagtat 241 cgccgggagg ctgtgaagca gtacagctcc tacaacttct tccgccccga caatgaggaa 301 gccatgaaag tccggaagca atgtgcctta gctgccttga gagatgtcaa aagctacctg 361 gcgaaagaag ggggacaaat tgcggttttc gatgccacca atactactag agagaggaga 421 cacatgatcc ttcattttgc caaagaaaat gactttaaag cgtttttcat cgagtcggtg 481 tgcgacgacc ctacagttgt ggcctccaat atcatggaag ttaaaatctc cagcccggat 541 tacaaagact gcaactcggc agaagccatg gacgacttca tgaagaggat cagttgctat 601 gaagccagct accagcccct cgaccccgac aaatgcgaca gggacttgtc gctgatcaag 661 gtgattgacg tgggccggag gttcctggtg aaccgggtgc aggaccacat ccagagccgc 721 atcgtgtact acctgatgaa catccacgtg cagccgcgta ccatctacct gtgccggcac 781 ggcgagaacg agcacaacct ccagggccgc atcgggggcg actcaggcct gtccagccgg 841 ggcaagaagt ttgccagtgc tctgagcaag ttcgtggagg agcagaacct gaaggacctg 901 cgcgtgtgga ccagccagct gaagagcacc atccagacgg ccgaggcgct gcggctgccc 961 tacgagcagt ggaaggcgct caatgagatc gacgcgggcg tctgtgagga gctgacctac 1021 gaggagatca gggacaccta ccctgaggag tatgcgctgc gggagcagga caagtactat 1081 taccgctacc ccaccgggga gtcctaccag gacctggtcc agcgcttgga gccagtgatc 1141 atggagctgg agcggcagga gaatgtgctg gtcatctgcc accaggccgt cctgcgctgc 1201 ctgcttgcct acttcctgga taagagtgca gaggagatgc cctacctgaa atgccctctt 1261 cacaccgtcc tgaaactgac gcctgtcgct tatggctgcc gtgtggaatc catctacctg 1321 aacgtggagt ccgtctgcac acaccgggag aggtcagagg atgcaaagaa gggacctaac 1381 ccgctcatga gacgcaatag tgtcaccccg ctagccagcc ccgaacccac caaaaagcct 1441 cgcatcaaca gctttgagga gcatgtggcc tccacctcgg ccgccctgcc cagctgcctg 1501 cccccggagg tgcccacgca gctgcctgga caaaacatga aaggctcccg gagcagcgct 1561 gactcctcca ggaaacactg aggcagacgt gtcggttcca ttccatttcc atttctgcag 1621 cttagcttgt gtcctgccct ccgcccgagg caaaacgtat cctgaggact tcttccggag 1681 agggtggggt ggagcagcgg gggagccttg gccgaagaga accatgcttg gcaccgtctg 1741 tgtcccctcg gccgct // LOCUS D49818 1980 bp mRNA PRI 04-SEP-1997 DEFINITION Homo sapiens mRNA for 6-phosphofructo-2-kinase/fructose-2, 6-bisphosphatase, complete cds. ACCESSION D49818 NID g1905760 KEYWORDS 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase. SOURCE Homo sapiens first trimester placenta chorionic villi cDNA to mRNA, clone_lib:lambda gt10 clone:2K-1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sakai,A., Kato,M., Fukasawa,M., Ishiguro,M., Furuya,E. and Sakakibara,R. TITLE Cloning of cDNA encoding for a novel isozyme of fructose 6-phosphate, 2-kinase/fructose 2,6-bisphosphatase from human placenta JOURNAL J. Biochem. 119 (3), 506-511 (1996) MEDLINE 96271013 REFERENCE 2 (bases 1 to 1980) AUTHORS Sakakibara,R. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1980) AUTHORS Sakakibara,R. TITLE Direct Submission JOURNAL Submitted (22-MAR-1995) to the DDBJ/EMBL/GenBank databases. Ryozo Sakakibara, Nagasaki University, School of Pharmaceutical Sciences, Department of Biochemistry; 1-14 Bunkyo-machi, Nagasaki, Nagasaki 852, Japan (E-mail:rssakaki@net.nagasaki-u.ac.jp, Tel:0958-47-1111(ex.2536), Fax:0958-44-6774) COMMENT Sequence updated (20-Jan-1997) by: Ryozo Sakakibara Sequence updated (19-Mar-1997) by: Ryozo Sakakibara. FEATURES Location/Qualifiers source 1..1980 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chorionic villi" /clone="2K-1" /clone_lib="lambda gt10" /dev_stage="first trimester" /tissue_type="placenta" CDS 27..1436 /function="bifunctional enzyme" /note="EC 2.7.1.105 / EC 3.1.3.46" /codon_start=1 /product="6-phosphofructo-2-kinase/fructose-2, 6-bisphosphatase" /db_xref="PID:d1019662" /db_xref="PID:g1905761" /translation="MASPRELTQNPLKKIWMPYSNGRPALHACQRGVCMTNCPTLIVM VGLPARGKTYISKKLTRYLNWIGVPTREFNVGQYRRDVVKTYKSFEFFLPDNEEGLKI RKQCALAALRDVRRFLSEEGGHVAVFDATNTTRERRATIFNFGEQNGYKTFFVESICV DPEVIAANIVQVKLGSPDYVNRDSDEATEDFMRRIECYENSYESLDEDLDRDLSYIKI MDVGQSYVVNRVADHIQSRIVYYLMNIHVTPRSIYLCRHGESELNLKGRIGGDPGLSP RGREFAKSLAQFISDQNIKDLKVWTSQMKRTIQTAEALGVPYEQWKVLNEIDAGVCEE MTYEEIQDNYPLEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVICHQ AVMRCLLAYFLDKAAEQLPYLKCPLHTVLKLTPVAYGCKVESIFLNVAAVNTHRDRPQ NVDISRPPEEALVTVPAHQ" BASE COUNT 443 a 561 c 561 g 415 t ORIGIN 1 gcagtccgac tcatcccggc cccgggatgg cgtccccacg ggaattgaca cagaaccccc 61 tgaagaagat ctggatgcca tacagcaatg ggcggcccgc tctgcacgct tgccagcgcg 121 gtgtgtgcat gaccaactgc ccaactctca ttgtcatggt gggcctgccc gccaggggca 181 agacctacat ctccaagaag ctgactcgat acctgaactg gattggtgtg cccactcggg 241 agttcaatgt tggccagtac cgccgggacg tggtcaagac ctacaaatct tttgaatttt 301 ttctccccga caatgaagag ggcctgaaaa tcaggaagca gtgtgccctg gcagccctcc 361 gtgacgtccg gcggttcctt agtgaggagg ggggacatgt ggcggttttt gatgccacaa 421 acaccacccg agaacggaga gcgaccatct ttaattttgg agaacagaat ggctacaaga 481 ccttttttgt cgagtccatc tgtgtggatc ctgaggtcat agctgccaac atcgtgcaag 541 tgaaactggg cagccctgac tatgtcaacc gcgacagtga tgaggctacg gaggacttca 601 tgaggcgcat tgagtgctat gagaactcct acgagtcgct agatgaggac ctggataggg 661 acctgtccta tatcaagatc atggatgtgg gccagagcta cgtggtgaac cgtgtggctg 721 accacatcca gagccgcatc gtatattacc tcatgaacat ccacgtgacc ccccgctcca 781 tctacctctg ccggcacggg gagagcgagc tcaacctcaa gggccggatt ggcggggacc 841 caggactgtc ccctcggggc agggagtttg ccaagagtct agcccagttc atcagtgacc 901 aaaatatcaa ggatctgaag gtctggacaa gccagatgaa gaggacaatc cagacggctg 961 aggcactggg tgtgccctat gaacagtgga aggtcctcaa cgagatcgat gcgggcgtct 1021 gtgaggaaat gacctacgag gaaattcagg ataattatcc actggagttc gccctgcggg 1081 accaggacaa gtaccggtac cggtacccta aaggggagtc ctacgaggac ctggtccaga 1141 gactggagcc tgtcatcatg gagctggaga ggcaagagaa tgtgctggtc atctgccacc 1201 aggctgtgat gcgctgcctg ctggcctact tcctcgacaa ggcagcagaa cagctgccct 1261 acctcaagtg tccgctgcac acagtcctga agctgactcc tgtggcatat ggttgtaaag 1321 tggagtccat attcctgaac gtggctgctg tgaacacgca ccgggacagg cctcagaacg 1381 tggacatctc aagacctcca gaggaagccc ttgtcacggt gcctgctcac cagtgaccat 1441 gttcatccac tgtgaccact aggcaggcac tgctctctgc agagggggtc attccaggcc 1501 ctccagtgtg tgtgatagtc accatgccat gcagggatat tcttaaagcc acacatggct 1561 ggcggaaccc agagccccca ccccagccac ctggctcttt gttgacagtc ggcgacaagg 1621 ttgtgcgtgg ctcctgacct gctgctaaga gtcacttgac cagactgcat ctgcatgggc 1681 tgcgcggagg ttgcccagcc ccagtttctt ccggcgcagc tcttaggtgt tcactctcgc 1741 cagctcagtt ggctttgtga agtgtgaaac cctacaatgt gaaaggaaag tgcttgctgt 1801 gatgttccta ctgtggccca gctgcccagc atggacctgg tgactctcca cagggcctct 1861 accatcctct ctgtggccac ttcctgagcc agaggccagg tcttcatggg gccctgagct 1921 tctgctgcct ctggtgagag ggagagccct tcccatcctt acccaccagg aactagagcc // LOCUS D49835 5817 bp mRNA PRI 04-NOV-1997 DEFINITION Homo sapiens mRNA for DNA-binding protein, complete cds. ACCESSION D49835 NID g2588782 KEYWORDS DNA-binding protein. SOURCE Homo sapiens cell_line:MDA-MB453 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fujimoto-Nishiyama,A., Ishii,S., Matsuda,S., Inoue,J. and Yamamoto,T. TITLE A novel zinc finger protein, Finb, is a transcriptional activator and localized in nuclear bodies JOURNAL Gene 195 (2), 267-275 (1997) MEDLINE 97449303 REFERENCE 2 (bases 1 to 5817) AUTHORS Fujimoto-Nishiyama,A. TITLE Direct Submission JOURNAL Submitted (23-MAR-1995) to the DDBJ/EMBL/GenBank databases. Akiko Fujimoto-Nishiyama, Institute of Medical Science, University of Tokyo, Oncology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (Tel:03-5449-5301, Fax:03-5449-5413) FEATURES Location/Qualifiers source 1..5817 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MDA-MB453" CDS 513..5483 /codon_start=1 /product="DNA-binding protein" /db_xref="PID:g2588783" /translation="MMSAVMSVGKVTENGGSPQGIKSPSKPPGPNRIGRRNQETKEEK SSYNCPLCEKICTTQHQLTMHIRQHNTDTGGADHSCSICGKSLSSASSLDRHMLVHSG ERPYKCTVCGQSFTTNGNMHRHMKIHEKDPTSATATAPGRGDFASAKSSKRKLSHDAE SEREDPAPAKKMVEDGQSGDLEKKADEVFHCPVCFKEFVCKYGLETHMETHSDNPLRC DICCVTFRTHRGLLRHNALVHKQLPRDAMGRPFIQNNPSIPAGFHDLGFTDFSCRKFP RISQAWCETNLRRCISEQHRFVCDTCDKAFPMLCSVALHKQTHVAADQGQEKPQATPL PGDALDQKGFLALLGLQHTKDVRPAPAEEPLPDDNQAIQLQTLKCQLPQDPGCTNLLS LSPFEAASLGGSLTVLPATKDSIKHLSLQPFQKGFIIQPDSSIVVKPISGESAIELAD IQQILKMAASAPPQISLPPFSKAPAAPLQAIFKHMPPLKAKPLVTPRTVVGHLHAPAS HQRQQASRLYQPQPAATAPEAPQRLTGGGLQRPPAAVQVRDPAPRGHAALPAAQPRAE LPGQPEMKTQLEQDSIIEALLPLSMEAKIKQEITEGELKAFMTAPGGKKTPAMRKVLY PCRFCNQVFAFSGVLRAHVRSHLGISPYQCNICDYIAADKAALIRHLRTHSGERPYIC KICHYPFTVKANCERHLRKKHLKATRKDIEKNIEYVSSSAAELVDAFCAPDTVCRLCG EDLKHYRALRIHMPTHCGRGLGGGHKGRKPFECKECSAAFAAKRNCIHHILKQHLHVP EQDIESYVLAADGLGPAEAPAAEASGRGEDSGCAALGDCKPLTAFLEPQNGFLHRGPT QPPPPHVSIKLEPASSFAVDFNEPLDFSQKGLALVQVKQENISFLSPSSLVPYDCSME PIDLSIPKNFRKGDKDLATPSEAKKPEEEAGSSEQPSPCPAPGPSLPVTLGPSGILES PMAPAPAATPEPPAQPLQGPVQLAVPIYSSALVSSPPLVGSSALLSGTALLRPLRPKP PLLLPKPPVTEELPPLASIAQIISSVSSAPTLLKTKVADPGPASTGSNTTASDSLGGS VPKAATTATPAATTSPKESSEPPAPATGPEAASPTEQGPAGTSKKRGRKRGMRSRPRA NSGGVDLYSSGEFASIEKMLATTDTNKFSPFLQTAEDNTQDEVAGAPADHHGPSDEEQ GSPPEDKLLRAKRNSYTNCLQKITCPHCPRVFPWASSLQRHMLTHTDSQSDAGDCSRR GRSGYDLTSRDREQPSEGATELRQVAGDAPVEQATAETASRCTGKSTGVGRAMSRRRS MALRRALGTPTRKRTRRATRAWTWTSPPSSWTSSWRRATRRQAGGAASQEQKLACDAC GKSFKFLGTLSRHRKAHGRPGGPRTRREMAPALQRRGPSPPLNRRRSPPRPRQRWWSR PGSGRPRPEKLAEETEGPSDGESAAEKRSSEKSDDDKKPKTDSPKSVASKADKRKKVC SVCNKRFWSLQDLSRHMRSHTGERPYKCQTCERTFTLKHSLVRHQRIHQKARHAKHHG KDSDKEERGEEDSENESTHSGNNAVSENEAELAPNASNHMAVTRSRKEGLASATKDCS HREEKVTQGGRLSLARVTLTQRARRPWGRTCWSRAARGLPTQSWHS" BASE COUNT 1324 a 1901 c 1690 g 902 t ORIGIN 1 cgggatggca actgcggtca ccctgctaaa gtcggggcgg ggcggcggtc ctccccctca 61 ccccccccag tccgagcgcc gccgccgccg ccgccgccgc cgccggccgc ttcagtaaca 121 cgtccccagg agactcgcag gagcaacacg tgatgtgtct acttatcagg gttgctccga 181 gtgtgtgttc caggagtggt ggctctgagg tgtgaccctg cccacgtttg ggcccagcca 241 ccttcagccc cccaataagg tgggcagctt gataacacaa agaaaacagt gtcaacgagt 301 actaccaaga gaagaagaca agaagtcaac acagactcat tttctactcc gtgtgaatga 361 tagctacagc aggggaaagt ttcatagtct atcagtgggt cagaaaatgg agttttatag 421 cagaggcttc ttagaagctt aaacccctgt cccaatgacg tcaagttcgt cccgctggct 481 tggaaggttc agacctatct tccatcaaca ccatgatgtc ggcggtcatg agtgtaggga 541 aggtcacaga gaatggcggg agcccccagg ggatcaagtc cccctcgaag cctccaggac 601 caaatcggat tggcagaagg aaccaggaaa cgaaagagga gaagtcttcc tataactgcc 661 ccctgtgtga gaagatttgc actacccagc accagctgac catgcacatt cgccagcaca 721 acacagacac tggaggagcc gaccactcat gcagcatctg cggaaagtca ctgagctcgg 781 ccagctccct cgatcgccac atgctggtgc actctggcga gaggccttac aagtgcactg 841 tgtgtggcca gtcatttacc accaatggga acatgcacag acatatgaag atccatgaga 901 aggacccaac tagtgccaca gccacagccc caggtagagg agactttgca tccgctaagt 961 cctccaagag gaaactgagt cacgatgccg agtcagagag agaagaccca gcaccagcta 1021 aaaagatggt agaagacggg cagtcaggtg acttggagaa gaaagctgat gaagtctttc 1081 actgcccagt atgtttcaag gagtttgttt gcaagtatgg actggagacc cacatggaga 1141 cccattcaga taacccacta agatgtgaca tttgttgtgt cacctttcga acacatcgag 1201 gactgctgcg tcacaacgcg cttgtccaca aacaacttcc cagggatgca atgggcagac 1261 ctttcataca gaacaaccct tcaattcctg ctggcttcca cgacttagga ttcacggact 1321 tctcctgtag gaagtttcct cgcatttctc aggcctggtg cgaaacaaac ctgcggaggt 1381 gcatcagcga gcaacaccgt tttgtctgcg acacctgtga caaggcgttc cccatgctct 1441 gctcagtggc tctgcacaag cagacccacg tggcggcaga ccagggtcaa gaaaagccgc 1501 aggccacgcc cctgcctggt gacgccctgg accagaaggg cttcctggcc ttgcttggcc 1561 tgcagcacac caaagacgtc aggcctgccc ccgccgagga gcccctgccg gatgacaacc 1621 aggcaattca gctccagaca ctcaagtgtc agctacctca ggaccccggc tgcaccaacc 1681 tgctgagcct gtcacctttc gaagctgctt ccctaggcgg ttctctcaca gttctccccg 1741 caaccaagga cagcataaag cacctgtccc tgcagccctt ccagaagggc ttcatcatcc 1801 agcctgacag cagcattgtg gtcaagccca tctctggcga gtcggccatc gagctggcag 1861 acatccagca aattctgaag atggcagcct cggctccccc tcagatcagt cttccgccct 1921 tctccaaggc ccctgccgcc ccactgcagg cgatcttcaa gcacatgccc cctctgaagg 1981 caaagcccct ggtcacacca cggacggtgg tgggccacct ccacgccccc gcctctcatc 2041 aacgccagca ggcttcccgg ctgtatcagc cccagcctgc cgccaccgcc cctgaagctc 2101 ctcaaaggct cactggaggc ggcctccaac gcccacctgc tgcagtccaa gtccgggacc 2161 cagccccacg cggccacgcg gctctccctg cagcgcagcc gcgggcggag ctgccgggcc 2221 agcctgagat gaagacgcag ctggagcagg acagcatcat cgaggccctg ctgccgctga 2281 gcatggaggc caagatcaag caggagatca cagaggggga actcaaggcc ttcatgacag 2341 cgcccggcgg caagaagacg cccgccatgc gcaaggtgct ctacccttgc cgcttctgca 2401 accaggtgtt tgccttctcg ggggtcttgc gtgcccacgt gcgctcccac ctgggcatct 2461 cgccatacca gtgcaacatc tgcgactaca tcgccgccga caaggccgcg ctcatccgcc 2521 acctgcgcac gcacagtggg gagcggccct acatttgcaa gatctgccac taccccttca 2581 ctgtcaaagc caactgcgag cggcacctgc gcaagaagca cctcaaggcc acccgcaagg 2641 atatcgagaa gaacatcgag tatgtgagta gcagcgcggc cgagctggtg gacgccttct 2701 gcgccccgga caccgtgtgc cggctgtgcg gcgaggacct caagcactat cgtgccctgc 2761 gcatccacat gccgacgcac tgcggccgcg gcctgggcgg gggccacaag ggccgcaagc 2821 ccttcgagtg caaggagtgc agcgccgcgt tcgcggccaa gcgcaactgc atccaccaca 2881 tcctcaagca gcacctgcac gtgcccgagc aggacatcga gagctacgtg ctggccgccg 2941 acggcctggg ccccgcagag gcgccggccg ctgaggcgtc ggggcgcggg gaggacagtg 3001 gctgcgctgc ccttggtgac tgcaagcccc tcactgcctt cctggaaccc cagaacggct 3061 ttcttcacag gggccccacc cagcctccac ctccccatgt ctcgatcaag ttggagcccg 3121 ccagtagctt tgcggtggac ttcaatgagc ccctggactt ctcgcagaag ggcctggccc 3181 tggtccaagt gaagcaggaa aacatctcct ttctgagccc ttcttccctg gtcccctatg 3241 actgctccat ggagcccatc gacctgtcca tccccaagaa cttcaggaaa ggggacaagg 3301 atttggccac tcccagcgaa gccaagaagc ctgaggagga ggcggggagc agcgagcagc 3361 cctctccctg cccagcaccc ggcccttctc ttcctgtaac tttggggccc agcggaatcc 3421 tggaaagccc catggcccct gctccggcgg ccaccccgga acccccagca cagcccctgc 3481 agggccctgt tcagctggcg gtcccaatct actcctcagc cctggtcagc agccctccac 3541 tcgtgggcag ctcagccctc ctgagtggca cagccttgct gcgtccactg cggcccaagc 3601 ccccgctgct tttgccaaag ccccccgtga cagaagagct gcccccgctg gcctccattg 3661 cccagatcat ctcatctgta tcctcggccc ccaccctgct gaaaaccaag gtggcggacc 3721 cagggcccgc aagcactggc agtaacacca cggcttcaga cagcttagga ggttctgtcc 3781 ccaaagccgc caccaccgcc acccccgctg ccaccaccag cccaaaagag tctagtgagc 3841 ctcccgctcc agccacaggc ccagaggctg cctctcccac cgagcagggc ccagcgggca 3901 cgtcgaagaa gaggggccgg aaaaggggga tgaggagccg accccgcgcc aacagcggcg 3961 gggtggacct gtactccagc ggggagtttg ccagcatcga gaagatgctg gccactacag 4021 acaccaacaa gttcagtccg tttctgcaga cagcggagga caacactcag gatgaggtgg 4081 ccggagcccc tgccgaccac catgggccca gtgatgaaga gcagggcagt cccccagaag 4141 acaagctgct gagggccaag cggaactcgt acaccaactg cctgcagaag atcacctgtc 4201 cccactgtcc ccgggttttc ccttgggcca gctccctaca gaggcacatg ctcacacaca 4261 ctgacagtca gtcggatgcg ggagactgca gccgccgcgg gcgaagtggc tatgacctca 4321 cctcacggga cagagagcag ccgtcggagg gcgccactga gctccgccag gtcgcagggg 4381 atgcgcctgt ggagcaggcc acggcggaaa cggcctcgcg gtgcaccggg aagagcacgg 4441 gcgtggggag agccatgagc cggaggagga gcatggcact gaggagagca ctggggacgc 4501 cgacgcggaa gaggacgcgt cgagcaacca gagcctggac ctggacttcg ccaccaagct 4561 catggacttc aagctggcgg agggcgacgc ggaggcaggc cgggggcgcg gcctcgcagg 4621 agcagaagct cgcctgcgac gcctgtggga agagcttcaa gttcctgggc accctgagcc 4681 gccaccggaa ggcgcacggc cgcccaggag gcccaaggac gagaagggag atggcgccag 4741 cactgcagag gaggggcccc agcccgcccc tgaacaggag gagaagcccc ccgagacccc 4801 ggcagaggtg gtggagtcgg cccgggtcgg ggaggccccg gccggaaaag ctcgcggagg 4861 agacggaggg cccctccgac ggggagagcg cggccgagaa aaggtcctca gagaagagcg 4921 acgatgacaa gaaaccaaag acagactccc ccaaaagcgt ggccagcaag gcagacaaga 4981 ggaagaaggt ctgcagcgtg tgcaacaagc ggttctggtc gctgcaggac ctgagccggc 5041 acatgcgctc ccacacaggg gaaaggccat acaaatgtca gacctgcgag cgaaccttca 5101 ccttgaagca cagcctggtt cgccaccagc ggatccacca gaaagccagg catgccaaac 5161 accacgggaa ggacagcgac aaggaagagc ggggtgagga ggacagcgag aatgagtcca 5221 cccacagcgg caacaacgcc gtctcagaga acgaggctga gctggctccc aatgccagca 5281 accacatggc tgtcacccgg agccggaagg agggcttggc cagtgccacc aaggactgca 5341 gccacaggga ggagaaggtc acgcagggtg gccgtctgag cctggccagg gtgaccttaa 5401 cccagagagc ccggcggccc tggggcagga cctgctggag ccgcgcagca agaggcctgc 5461 ccacccaatc ctggcacagc tgatggcgcc tcccagctcg tcgtagggga tggagtgaca 5521 gcctcagccc cctcagcaca gacaaaagcc agcagagcaa agcgtctata cttcatgggg 5581 tttcctcagt gccctttggc tgttgaggag tgagagagag agagagagag agagagagag 5641 agagagacaa gcaggagcgt ggctgctcgc tcagtgccat agccttaccg cagcctgcgc 5701 gggaggccac agcccgtgcc gattccagtg ccttaactac ttaccggatc cctccatatt 5761 atcatgggtg ttgtattttt ccaaaatgac ttcttaaaca aaacaaatat tataatg // LOCUS D49919 1426 bp mRNA PRI 14-NOV-1997 DEFINITION Homo sapiens mRNA for C-C chemokine receptor type 2, complete cds. ACCESSION D49919 NID g2626807 KEYWORDS C-C chemokine receptor type 2. SOURCE Homo sapiens cord blood Eosinophils cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Nakajima,T., Yoshida,R. and Harada,S. TITLE Molecular cloning of a human novel C-C chemokine receptor JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 1426) AUTHORS Nakajima,T. TITLE Direct Submission JOURNAL Submitted (27-MAR-1995) to the DDBJ/EMBL/GenBank databases. Toshihiro Nakajima, Shionogi Institute for Medical Science; 2-5-1 Mishima, Settsu, Osaka 566, Japan (Tel:06-382-2612(ex.495), Fax:06-382-2598) FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Eosinophils" /tissue_type="cord blood" CDS 72..1139 /codon_start=1 /evidence=not_experimental /product="C-C chemokine receptor type 2" /db_xref="PID:g2626808" /translation="MDYTLDLSVTTVTDYYYPDIFSSPCDAELIQTNGKLLLAVFYCL LFVFSLLGNSLVILVLVVCKKLRSITDVYLLNLALSDLLFVFSFPFQTYYLLDQWVFG TVMCKVVSGFYYIGFYSSMFFITLMSVDRYLAVVHAVYALKVRTIRMGTTLCLAVWLT AIMATIPLLVFYQVASEDGVLQCYSFYNQQTLKWKIFTNFKMNILGLLIPFTIFMFCY IKILHQLKRCQNHNKTKAIRLVLIVVIASLLFWVPFNVVLFLTSLHSMHILDGCSISQ QLTYATHVTEIISFTHCCVNPVIYAFVGEKFKKHLSEIFQKSCSQIFNYLGRQMPRES CEKSSSCQQHSSRSSSVDYIL" BASE COUNT 366 a 324 c 317 g 419 t ORIGIN 1 ctttgtgaag aaggaattgg caacactgaa acctccagaa caaaggctgt cactaaggtc 61 ccgctgcctt gatggattat acacttgacc tcagtgtgac aacagtgacc gactactact 121 accctgatat cttctcaagc ccctgtgatg cggaacttat tcagacaaat ggcaagttgc 181 tccttgctgt cttttattgc ctcctgtttg tattcagtct tctgggaaac agcctggtca 241 tcctggtcct tgtggtctgc aagaagctga ggagcatcac agatgtatac ctcttgaacc 301 tggccctgtc tgacctgctt tttgtcttct ccttcccctt tcagacctac tatctgctgg 361 accagtgggt gtttgggact gtaatgtgca aagtggtgtc tggcttttat tacattggct 421 tctacagcag catgtttttc atcaccctca tgagtgtgga caggtacctg gctgttgtcc 481 atgccgtgta tgccctaaag gtgaggacga tcaggatggg cacaacgctg tgcctggcag 541 tatggctaac cgccattatg gctaccatcc cattgctagt gttttaccaa gtggcctctg 601 aagatggtgt tctacagtgt tattcatttt acaatcaaca gactttgaag tggaagatct 661 tcaccaactt caaaatgaac attttaggct tgttgatccc attcaccatc tttatgttct 721 gctacattaa aatcctgcac cagctgaaga ggtgtcaaaa ccacaacaag accaaggcca 781 tcaggttggt gctcattgtg gtcattgcat ctttactttt ctgggtccca ttcaacgtgg 841 ttcttttcct cacttccttg cacagtatgc acatcttgga tggatgtagc ataagccaac 901 agctgactta tgccacccat gtcacagaaa tcatttcctt tactcactgc tgtgtgaacc 961 ctgttatcta tgcttttgtt ggggagaagt tcaagaaaca cctctcagaa atatttcaga 1021 aaagttgcag ccaaatcttc aactacctag gaagacaaat gcctagggag agctgtgaaa 1081 agtcatcatc ctgccagcag cactcctccc gttcctccag cgtagactac attttgtgag 1141 gatcaatgaa gactaaatat aaaaaacatt ttcttgaatg gcatgctagt agcagtgagc 1201 aaaggtgtgg gtgtgaaagg tttccaaaaa aagttcagca tgaaggatgc cgtgtgtgtt 1261 gttgccaaca cttggaacac aatgactgga gacatagttg tgcatgcctg gcacaacatc 1321 aagcctgtga ttgtgtttat tgatgatgtt gaacaagtgg tggctttgag ggattctgta 1381 tgccaagtgg aaaaaaaaga tgtctccgga attcgacagg ttatca // LOCUS D49950 1102 bp mRNA PRI 05-JUL-1996 DEFINITION Human Liver mRNA for interferon-gamma inducing factor(IGIF), complete cds. ACCESSION D49950 NID g1405318 KEYWORDS interferon-gamma inducing factor (IGIF). SOURCE Homo sapiens Liver cDNA to mRNA, clone:pHuGFR-50-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1102) AUTHORS Ushio,S., Namba,M., Okura,T., Hattori,K., Nukada,Y., Akita,K., Tanabe,F., Konishi,K., Micallef,M., Fujii,M., Torigoe,K., Tanimoto,T., Fukuda,S., Ikeda,M., Okamura,H. and Kurimoto,M. TITLE Cloning of the cDNA for human IFN-gamma-inducing factor, expression in Escherichia coli, and studies on the biologic activities of the protein JOURNAL J. Immunol. 156 (11), 4274-4279 (1996) MEDLINE 96247646 REFERENCE 2 (bases 1 to 1102) AUTHORS Ushio,S. TITLE Direct Submission JOURNAL Submitted (29-MAR-1995) to the DDBJ/EMBL/GenBank databases. Shimpei Ushio, Hayashibara Biochemical Laboratories Inc., Fujisaki Institute; 675-1 Fujisaki, Okayama, Okayama 702, Japan (Tel:086-276-3141, Fax:086-276-6885) FEATURES Location/Qualifiers source 1..1102 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHuGFR-50-1" /tissue_type="Liver" CDS 178..759 /note="precursor protein" /codon_start=1 /product="interferon-gamma inducing factor(IGIF)" /db_xref="PID:d1009319" /db_xref="PID:g1405319" /translation="MAAEPVEDNCINFVAMKFIDNTLYFIAEDDENLESDYFGKLESK LSVIRNLNDQVLFIDQGNRPLFEDMTDSDCRDNAPRTIFIISMYKDSQPRGMAVTISV KCEKISTLSCENKIISFKEMNPPDNIKDTKSDIIFFQRSVPGHDNKMQFESSSYEGYF LACEKERDLFKLILKKEDELGDRSIMFTVQNED" polyA_signal 1062..1080 polyA_site 1102 BASE COUNT 360 a 228 c 231 g 283 t ORIGIN 1 gcctggacag tcagcaagga attgtctccc agtgcatttt gccctcctgg ctgccaactc 61 tggctgctaa agcggctgcc acctgctgca gtctacacag cttcgggaag aggaaaggaa 121 cctcagacct tccagatcgc ttcctctcgc aacaaactat ttgtcgcagg aataaagatg 181 gctgctgaac cagtagaaga caattgcatc aactttgtgg caatgaaatt tattgacaat 241 acgctttact ttatagctga agatgatgaa aacctggaat cagattactt tggcaagctt 301 gaatctaaat tatcagtcat aagaaatttg aatgaccaag ttctcttcat tgaccaagga 361 aatcggcctc tatttgaaga tatgactgat tctgactgta gagataatgc accccggacc 421 atatttatta taagtatgta taaagatagc cagcctagag gtatggctgt aactatctct 481 gtgaagtgtg agaaaatttc aactctctcc tgtgagaaca aaattatttc ctttaaggaa 541 atgaatcctc ctgataacat caaggataca aaaagtgaca tcatattctt tcagagaagt 601 gtcccaggac atgataataa gatgcaattt gaatcttcat catacgaagg atactttcta 661 gcttgtgaaa aagagagaga cctttttaaa ctcattttga aaaaagagga tgaattgggg 721 gatagatcta taatgttcac tgttcaaaac gaagactagc tattaaaatt tcatgccggg 781 cgcagtggct cacgcctgta atcccagccc tttgggaggc tgaggcgggc agatcaccag 841 aggtcaggtg ttcaagacca gcctgaccaa catggtgaaa cctcatctct actaaaaata 901 ctaaaaatta gctgagtgta gtgacgcatg ccctcaatcc cagctactca agaggctgag 961 gcaggagaat cacttgcact ccggaggtag aggttgtggt gagccgagat tgcaccattg 1021 cgctctagcc tgggcaacaa cagcaaaact ccatctcaaa aaataaaata aataaataaa 1081 caaataaaaa attcataatg tg // LOCUS D49958 2383 bp mRNA PRI 06-NOV-1996 DEFINITION Human fetus brain mRNA for membrane glycoprotein M6, complete cds. ACCESSION D49958 NID g1663516 KEYWORDS membrane glycoprotein M6. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 930) AUTHORS Shimizu,F., Watanabe,T.K., Fujiwara,T., Takahashi,E., Nakamura,Y. and Maekawa,H. TITLE Isolation and mapping of the human glycoprotein M6 gene (GPM6A) to 4q33-->q34 JOURNAL Cytogenet. Cell Genet. 74 (1-2), 138-139 (1996) MEDLINE 97049091 REFERENCE 2 (bases 931 to 2383) AUTHORS Shimizu,F. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2383) AUTHORS Shimizu,F. TITLE Direct Submission JOURNAL Submitted (30-MAR-1995) to the DDBJ/EMBL/GenBank databases. Fumio Shimizu, Otsuka Pharmaceutical Co. Ltd., Otska GEN Research; Kawauchi-cho 463-10, Tokushima, Tokushima 771-01, Japan (E-mail:shimizu@otsuka.genome.ad.jp, Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..2383 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4q33 to 4q34" /dev_stage="fetus" /tissue_type="brain" 5'UTR 1..93 /note="A-stretch/1-20" CDS 94..930 /note="major CNS myelin protein PLP/DM20 homolog" /codon_start=1 /product="membrane glycoprotein M6" /db_xref="PID:d1009326" /db_xref="PID:g1663517" /translation="MEENMEEGQTQKGCFECCIKCLGGIPYASLIATILLYAGVALFC GCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILLMVE GFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLWT ICRNTTLVEGANLCLDLRQFGIVTIGEEKKICTVSENFLRMCESTELNMTFHLFIVAL AGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTRSKERLNA YT" 3'UTR 931..2383 polyA_signal 2365..2370 BASE COUNT 690 a 419 c 459 g 815 t ORIGIN 1 aaaaaaaaaa aaaaaaaaaa caccagtttt tccaacatct aattgagctt ttgattaatt 61 ccgtgtacca gattctactg aagaaaggta gccatggaag agaatatgga agagggacag 121 acacaaaaag ggtgttttga atgctgtatc aaatgcctgg ggggcattcc ctatgcctct 181 ctgattgcca ccatcctgct ctatgcgggt gttgccctgt tctgtggctg cggtcatgaa 241 gcgctttctg gaactgtcaa cattctgcaa acctactttg agatggcaag aactgctgga 301 gacacactgg atgtttttac catgattgac atctttaagt atgtgatcta cggcatcgca 361 gctgcgttct ttgtgtatgg cattttgctg atggtggaag gtttcttcac aactggggcc 421 atcaaagatc tctatgggga tttcaaaatc accacttgtg gcagatgtgt gagcgcttgg 481 ttcattatgc tgacatatct tttcatgttg gcctggctgg gagtcacggc tttcacctca 541 ctgccagttt acatgtactt caatctgtgg accatctgcc ggaacaccac attagtggag 601 ggagcaaatc tctgcttgga ccttcgtcag tttggaattg tgacaattgg agaggaaaag 661 aaaatttgta ctgtctctga gaatttcttg aggatgtgcg aatctactga gctgaacatg 721 accttccact tgtttattgt ggcacttgct ggagctgggg cagcagtcat tgctatggtt 781 cactacctta tggttctgtc tgccaactgg gcctatgtga aagacgcctg ccggatgcag 841 aagtatgaag acatcaagtc gaaggaagag caagagcttc atgacatcca ctctactcgc 901 tccaaagagc ggctcaatgc atacacataa atgcatcttc ctgttctttc taccatttga 961 atgcattggt gtttaactaa gggccatcca accatccaac ctttaaaaaa caaaacgaaa 1021 gtgcttctca tcaatgatat gtaaggtgac ttatgaatca cctgagtaca attctttgtt 1081 gtttagcact taaatttccc aatttattaa attgatgtaa atcagatctt ttctacaagc 1141 tcctatccag cctttttttt gaaatttctc aaactcattt actagttctg taaaatcaaa 1201 gatactaaca ttgtcaaatg caaagatttg tttgattttt aaccacttcc catgtgttat 1261 acataacacc ttttgcatta tgtcttatgt tttgaaaaga aaatagcctt ttatactttt 1321 tagttttgat ttcggtaact agtttaacta caggtaacct tcaaaggacc attgtacatt 1381 atgaacaata gatagagatt acatcttgat gactcttgaa atatggaaat tttgtctgaa 1441 gatcagtggc catattactg taggccctgg ttcatgtttt catcaatcta aggtgcaatt 1501 tctaaatttg taagagtagg tttaaaaaaa aaagtgcttc ttatctttgt taacattgta 1561 cttttccttg atgttcttaa aaggtatttc cctcagatta ctcatgttta tgttgtgagc 1621 atgtagaaac agtaatgcta atgcatggct agttgccttt ttaagattgt gacaccaggc 1681 ttacctttta aagtttagta tatagagaca attttaatgg aaataactac tgtagactat 1741 tgaagaatga tctctttgtg atttaagaag tggctggatt ggaactttta atatgctaat 1801 gtggaaaatt aattaccttt atgaaggtgg tttattacaa ataagcacac taacccctcg 1861 gaagttgttt tacctacttt aaaagtttta atggattgca cctctgtaaa ctattcctaa 1921 aatgtgtatg atatatttga aaaggcttcc attaatataa tagctttgct tgcagccttc 1981 caatctatgt tggtttacct gtagtgtttt ataaagtgtg gtcagagggc cctatagaat 2041 gtattgtttg aaagtgtagt gatatatttg tgtttttatt tcaagtaagt cattttaacc 2101 gaatgttcat tcatattcat ttataaaaag tacctgtatc aaaggaattt taacaaagag 2161 caatcagtat tattggacca aatttggtgt ttgttttcac cttgacgctc ttcttttcat 2221 tatttctaat gctacaagaa tgctgtaaag tgtcttctaa aatgatgtag cctgacaaga 2281 catttttttc agtgtataaa actaggtagt attgtgcact gatttgacca ttgtgaaatc 2341 ctttctcagt gtaactgcat ttctaataaa aatttattga gtg // LOCUS D50369 428 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for low molecular mass ubiquinone-binding protein, complete cds. ACCESSION D50369 NID g2605589 KEYWORDS . SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 428) AUTHORS Fujiwara,T., Kawai,A., Shimizu,F., Shinomiya,K., Hirano,H., Okuno,S., Ozaki,K., Katagiri,T., Takeda,S., Kuga,Y., Shimada,Y., Nagata,M., Takaichi,A., Watanabe,T., Horie,M., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Molecular cloning of a human homologue of bovine low molecular mass ubiquinone-binding protein gene JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 428) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka Pharmaceutical Co.,Ltd, Otsuka GEN Research Institute; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..428 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 78..359 /codon_start=1 /product="low molecular mass ubiquinone-binding protein" /db_xref="PID:g2605590" /translation="MGREFGNLTRMRHVISYSLSPFEQRAYPHVFTKGIPNVLRRIRE SFFRVVPQFVVFYLIYTWGTEEFERSKRRIQLPMKMTNEQRIRMTVPCL" polyA_signal 409..414 BASE COUNT 102 a 101 c 115 g 110 t ORIGIN 1 aaaaataagt aggaagtgct caattttaat ttggttcagt tttccggagg gcgagctgag 61 ccctggccgc cgccacaatg ggccgcgagt ttgggaatct gacgcggatg cggcatgtga 121 tcagctacag cttgtcaccg ttcgagcagc gcgcctatcc gcacgtcttc actaaaggaa 181 tccccaatgt tctgcgccgc attcgggagt ctttctttcg cgtggtgccg cagtttgtag 241 tgttttatct tatctacaca tgggggactg aagagttcga gagatccaag aggaggatcc 301 agctgcctat gaaaatgaca aatgagcaac gcatccggat gacggttccc tgtctctgaa 361 agacctttct ctggaagagg agtctgcatt gtagtgtctc aaagacacaa taaacttcct 421 atggtctg // LOCUS D50370 2636 bp mRNA PRI 08-JAN-1997 DEFINITION Human mRNA for nucleosome assembly protein, complete cds. ACCESSION D50370 NID g1769809 KEYWORDS nucleosome assembly protein. SOURCE Homo sapiens fetus brain cDNA to mRNA, clone:NAP1L3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Fujiwara,T., Nakamura,Y., Hirai,Y., Maekawa,H. and Takahashi,E. TITLE Cloning, expression pattern and mapping to Xq of NAP1L3, a gene encoding a peptide homologous to human and yeast nucleosome assembly proteins JOURNAL Cytogenet. Cell Genet. 74 (4), 281-285 (1996) MEDLINE 97130622 REFERENCE 2 (bases 1 to 2636) AUTHORS Watanabe,T., Shimizu,F., Nakamura,Y., Kawai,A., Okuno,S., Takaichi,A., Fujiwara,T. and Hirai,Y. TITLE Cloning, expression, and mapping of a novel human NAP (nucleosome assembly protein)-like gene JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 2636) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..2636 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="NAP1L3" /dev_stage="fetus" /tissue_type="brain" gene 266..1786 /gene="BNAP" CDS 266..1786 /gene="BNAP" /note="putative" /codon_start=1 /product="nucleosome assembly protein" /db_xref="PID:d1009538" /db_xref="PID:g1769810" /translation="MAEADFKMVSEPVAHGVAEEEMASSTSDSGEESDSSSSSSSTSD SSSSSSTSGSSSGSGSSSSSSGSTSSRSRLYRKKRVPEPSRRARRAPLGTNFVDRLPQ AVRNRVQALRNIQDECDKVDTLFLKAIHDLERKYAELNKPLYDRRFQIINAEYEPTEE ECEWNSEDEEFSSDEEVQDNTPSEMPPLEGEEEENPKENPEVKAEEKEVPKEIPEVKD EEKEVAKEIPEVKAEEKADSKDCMEATPEVKEDPKEVPQVKADDKEQPKATEAKARAA VRETHKRVPEERLRDSVDLKRARKGKPKREDPKGIPDYWLIVLKNVDKLGPMIQKYDE PILKFLSDVSLKFSKPGQPVSYTFEFHFLPNPYFRNEVLVKTYIIKAKPDHNDPFFSW GWEIEDCKGCKIDRRRGKDVTVTTTQSRTTATGEIEIQPRVVPNASFFNFFSPPEIPM IGKLEPREDAILDEDFEIGQILHDNVILKSIYYYTGEVNGTYYQFGKHYGNKKYRK" polyA_signal 2606..2611 polyA_signal 2610..2615 BASE COUNT 814 a 521 c 599 g 702 t ORIGIN 1 gattcggctg cggtacatct cggcactcta gctgcagccg ggagaggcct tgccgccacc 61 gctgtcgccc aagcctccac tgccgctgcc acctcagcgc cggcctctgc atccccagct 121 ccagctccgc tctgcgccgc tgctgccatc gccgctgcca cctccgcagc ccgggcctcc 181 gccgccgcca cccaagcatc cgtgagtcat tttctgccca tctctggtcg cgcggtctcc 241 ctggtagagt ttgtaggctt gcaagatggc agaagcagat tttaaaatgg tctcggaacc 301 tgtcgcccat ggggttgccg aagaggagat ggctagctcg actagtgatt ctggggaaga 361 atctgacagc agtagctcta gcagcagcac tagtgacagc agcagcagca gcagcactag 421 tggcagcagc agcggcagcg gcagcagcag cagcagcagc ggcagcacta gcagccgcag 481 ccgcttgtat agaaagaaga gggtacctga gccttccaga agggcgcggc gggccccgtt 541 gggaacaaat ttcgtggata ggctgcctca ggcagttaga aatcgtgtgc aagcgcttag 601 aaacattcaa gatgaatgtg acaaggtaga taccctgttc ttaaaagcaa ttcatgatct 661 tgaaagaaaa tatgctgaac tcaacaagcc tctgtatgat aggcggtttc aaatcatcaa 721 tgcagaatac gagcctacag aagaagaatg tgaatggaat tcagaggatg aggagttcag 781 cagtgatgag gaggtgcagg ataacacccc tagtgaaatg cctcccttag agggtgagga 841 agaagaaaac cctaaagaaa acccagaggt gaaagctgaa gagaaggaag ttcctaaaga 901 aattcctgag gtgaaggatg aagaaaagga agttgctaaa gaaattcctg aggtaaaggc 961 tgaagaaaaa gcagattcta aagactgtat ggaggcaacc cctgaagtaa aagaagatcc 1021 taaagaagtc ccccaggtaa aggcagatga taaagaacag cctaaagcaa cagaggctaa 1081 ggcaagggct gcagtaagag agactcataa aagagttcct gaggaaaggc ttcgggacag 1141 tgtagatctt aaaagagcta ggaagggaaa gcctaaaaga gaagacccta aaggcattcc 1201 tgactattgg ctgattgttt taaagaatgt tgacaagctc gggcctatga ttcagaagta 1261 tgatgagccc attctgaagt tcttgtcgga tgttagcctg aagttctcaa aacctggcca 1321 gcctgtaagt tacacctttg aatttcattt tctacccaac ccatacttca gaaatgaggt 1381 gctggtgaag acatatataa taaaggcaaa accagatcac aatgatccct tcttttcttg 1441 gggatgggaa attgaagatt gcaaaggctg caagatagac cggagaagag gaaaagatgt 1501 tactgtgaca actacccaga gtcgcacaac tgctactgga gaaattgaaa tccagccaag 1561 agtggttcct aatgcatcat tcttcaactt ctttagtcct cctgagattc ctatgattgg 1621 gaagctggaa ccacgagaag atgctatcct ggatgaggac tttgaaattg ggcagatttt 1681 acatgataat gtcatcctga aatcaatcta ttactatact ggagaagtca atggtaccta 1741 ctatcaattt ggcaaacatt atggaaacaa gaaatacaga aaataagtca atctgaaaga 1801 tttttcaaga atcttaaaat ctcaagaagt gaagcagatt catacagcct tgaaaaaagt 1861 aaaaccctga cctgtaacct gaacactatt attccttata gtcaagtttt tgtggtttct 1921 tggtagtcta tattttaaaa atagtcctaa aaagtgtcta agtgccagtt tattctatct 1981 aggctgttgt agtataatat tcttcaaaat atgtaagctg ttgtcaatta tctaaagcat 2041 gttagtttgg tgctacacag tgttgatttt tgtgatgtcc tttggtcatg tttctgttag 2101 actgtagctg tgaaactgtc agaattgtta actgaaacaa atatttgctt gaaaaaaaaa 2161 gttcatgaag taccaatgca agtgttttat tttttttctt ttttccagcc cataagacta 2221 agggtttaaa tctgcttgca ctagctgtgc cttcattagt ttgctataga aatccagtac 2281 ttatagtaaa taaaacagtg tattttgaag tttgactgct tgaaaaagat tagcatacat 2341 ctaatgtgaa aagaccacat ttgattcaac tgagaccttg tgtatgtgac atatagtggc 2401 ctataaattt aatcataatg atgttattgt ttaccactga ggtgttaata taacatagta 2461 tttttgaaaa agtttcttca tcttatattg tgtaattgta aactaaagat accgtgtttt 2521 ctttgtattg tgttctacct tccctttcac tgaaaatgat cacttcattt gatactgttt 2581 ttcatgttct tgtattgcaa cctaaaataa ataaatatta aagtgtgtta tactat // LOCUS D50371 336 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for ATP synthase subunit e, complete cds. ACCESSION D50371 NID g2605591 KEYWORDS . SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 336) AUTHORS Fujiwara,T., Kawai,A., Shimizu,F., Shinomiya,K., Hirano,H., Okuno,S., Ozaki,K., Katagiri,T., Takeda,S., Kuga,Y., Shimada,Y., Nagata,M., Takaichi,A., Watanabe,T., Horie,M., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Molecular cloning of a human homolog of rat ATP synthase subunit e from human fetal brain JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 336) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka Pharmaceutical Co.,Ltd, Otsuka GEN Research Institute; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..336 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 64..273 /codon_start=1 /product="ATP synthase subunit e" /db_xref="PID:g2605592" /translation="MVPPVQVSPLIKLGRYSALFLGVAYGATRYNYLKPRAEEERRIA AEEKKKQDELKRIARELAEDDSILK" polyA_signal 312..317 BASE COUNT 83 a 83 c 102 g 68 t ORIGIN 1 gggcttgtgc ggcatcctgc tccgtctgca ggttgtgctt ccggtgcgga ggtcatggac 61 aaaatggtgc caccggtgca ggtctctccg ctcatcaagc tcggccgcta ctccgccctg 121 ttcctcggtg tggcctacgg agccacgcgc tacaattacc taaaacctcg ggcagaagag 181 gagaggagga tagcagcaga agagaagaag aagcaggatg aactgaaacg gattgccaga 241 gaattggcag aagatgacag catattaaag tgagtgaccc tgcgacccac tctttggacc 301 agcagcggat gaataaagct tcctgtgttg tgtgat // LOCUS D50373 755 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for fatty acid binding protein, complete cds. ACCESSION D50373 NID g2605595 KEYWORDS . SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 755) AUTHORS Fujiwara,T., Kawai,A., Shimizu,F., Shinomiya,K., Hirano,H., Okuno,S., Ozaki,K., Katagiri,T., Takeda,S., Kuga,Y., Shimada,Y., Nagata,M., Takaichi,A., Watanabe,T., Horie,M., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Molecular cloning of a novel fatty acid binding protein from human fetal brain JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 755) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka Pharmaceutical Co.,Ltd, Otsuka GEN Research Institute; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..755 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 53..451 /codon_start=1 /product="fatty acid binding protein" /db_xref="PID:g2605596" /translation="MVEAFCATWKLTNSQNFDEYMKALGVGFATRQVGNVTKPTVIIS QEGDKVVIRTLSTFKDTEISFQLGEEFDETTADDRNCKSVVSLDGDKLVHIQKWDGKE TNFVREIKDGKMVMTLTFGDVVAVRHYEKA" polyA_signal 653..658 /note="suboptimal polyA_signals" polyA_signal 738..742 /note="suboptimal polyA_signals" BASE COUNT 228 a 131 c 174 g 222 t ORIGIN 1 attagaccag aagatccccc gctcctgtct ctaaagaggg gaaagggcaa ggatggtgga 61 ggctttctgt gctacctgga agctgaccaa cagtcagaac tttgatgagt acatgaaggc 121 tctaggcgtg ggctttgcca ctaggcaggt gggaaatgtg accaaaccaa cggtaattat 181 cagtcaagaa ggagacaaag tggtcatcag gactctcagc acattcaagg acacggagat 241 tagtttccag ctgggagaag agtttgatga aaccactgca gatgatagaa actgtaagtc 301 tgttgttagc ctggatggag acaaacttgt tcacatacag aaatgggatg gcaaagaaac 361 aaattttgta agagaaatta aggatggcaa aatggttatg acccttactt ttggtgatgt 421 ggttgctgtt cgccactatg agaaggcata aaaatgttcc tggtcggggc ttggaagagc 481 tcttcagttt ttctgtttcc tcaagtctca gtgctatcct attacaacat ggctgatcat 541 taattagaag gttatccttg gtgtggaggt ggaaaatggt gatttaaaaa cttgttactc 601 caagcaactt gcccaatttt aatctgaaaa tttatcatgt tttataattt gaattaaagt 661 tttgtccccc cccccctttt ttttataaac aagtgaatac attttataat ttcttttgga 721 atgtaaatca aatttgaatt aaaatcttac acgtg // LOCUS D50375 772 bp mRNA PRI 07-NOV-1997 DEFINITION Homo sapiens mRNA for silencer element, complete cds. ACCESSION D50375 NID g2605599 KEYWORDS silencer element. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 772) AUTHORS Fujiwara,T., Kawai,A., Shimizu,F., Shinomiya,K., Hirano,H., Okuno,S., Ozaki,K., Katagiri,T., Takeda,S., Kuga,Y., Shimada,Y., Nagata,M., Takaichi,A., Watanabe,T., Horie,M., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Molecular cloning of a human homologue of chicken silencer element (SCG10) gene JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 772) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka Pharmaceutical Co.,Ltd, Otsuka GEN Research Institute; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..772 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 118..657 /note="SCG10" /codon_start=1 /product="silencer element" /db_xref="PID:g2605600" /translation="MAKTAMAYKEKMKELSMLSLICSCFYPEPRNINIYTYDDMEVKQ INKRASGQAFELILKPPSPISEAPRTLASPKKKDLSLEEIQKKLEAAEERRKSQEAQV LKQLAEKREHEREVLQKALGENNNFSKMAEEKLILKMEQIKENREANLAAIIERLQEK ERHAAEVRRNKELQVELSG" BASE COUNT 230 a 186 c 192 g 164 t ORIGIN 1 ctagcacggt cccactctgc agactcagtg ccttattcag tcttctctct cgctctctcc 61 gctgctgtag ccggaccctt tgccttcgcc actgctcagc gtctgcacat ccctacaatg 121 gctaaaacag caatggccta caaggaaaaa atgaaggagc tgtccatgct gtcactgatc 181 tgctcttgct tttacccgga acctcgcaac atcaacatct atacttacga tgatatggaa 241 gtgaagcaaa tcaacaaacg tgcctctggc caggcttttg agctgatctt gaagccacca 301 tctcctatct cagaagcccc acgaacttta gcttctccaa agaagaaaga cctgtccctg 361 gaggagatcc agaagaaact ggaggctgca gaggaaagaa gaaagtctca ggaggcccag 421 gtgctgaaac aattggcaga gaagagggaa cacgagcgag aagtccttca gaaggctttg 481 ggggagaaca acaacttcag caagatggcg gaggaaaagc tgatcctgaa aatggaacaa 541 attaaggaaa accgtgaggc taatctagct gctattattg aacgtctgca ggaaaaggag 601 aggcatgctg cggaggtgcg caggaacaag gaactccagg ttgaactgtc tggctgaagc 661 aagggagggt ctggcacgcc ccaccaatag taaatccccc tgcctatatt ataatggatc 721 atgcgatatc aggatgggga atgtatgaca tggtttaaaa agaactcatt at // LOCUS D50419 3734 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens mRNA for OTK18, complete cds. ACCESSION D50419 NID g2618575 KEYWORDS . SOURCE Homo sapiens embryo brain cDNA to mRNA, clone:OTK18. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,H., Fujiwara,T., Takahashi,E.I., Shin,S., Okui,K. and Nakamura,Y. TITLE Isolation and mapping of a novel human gene encoding a protein containing zinc-finger structures JOURNAL Genomics 31 (3), 376-379 (1996) MEDLINE 96435435 REFERENCE 2 (bases 1 to 3734) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (27-APR-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, The University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:y-daigo@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..3734 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="OTK18" /dev_stage="embryo" /map="19q13.4" /tissue_type="brain" CDS 346..2481 /note="zinc finger protein" /codon_start=1 /product="OTK18" /db_xref="PID:g2618576" /translation="MPADVNLSQKPQVLGPEKQDGSCEASVSFEDVTVDFSREEWQQL DPAQRCLYRDVMLELYSHLFAVGYHIPNPEVIFRMLKEKEPRVEEAEVSHQRCQEREF GLEIPQKEISKKASFQKDMVGEFTRDGSWCSILEELRLDADRTKKDEQNQIQPMSHSA FFNKKTLNTESNCEYKDPGKMIRTRPHLASSQKQPQKCCLFTESLKLNLEVNGQNESN DTEQLDDVVGSGQLFSHSSSDACSKNIHTGETFCKGNQCRKVCGHKQSLKQHQIHTQK KPDGCSECGGSFTQKSHLFAQQRIHSVGNLHECGKCGKAFMPQLKLSVYLTDHTGDIP CICKECGKVFIQRSELLTHQKTHTRKKPYKCHDCGKAFFQMLSLFRHQRTHSREKLYE CSECGKGFSQNSTLIIHQKIHTGERQYACSECGKAFTQKSTLSLHQRIHSGQKSYVCI ECGQAFIQKAHLIVHQRSHTGEKPYQCHNCGKSFISKSQLDIHHRIHTGEKPYECSDC GKTFTQKSHLNIHQKIHTGERHHVCSECGKAFNQKSILSMHQRIHTGEKPYKCSECGK AFTSKSQFKEHQRIHTGEKPYVCTECGKAFNGRSNFHKHQITHTRERPFVCYKCGKAF VQKSELITHQRTHMGEKPYECLDCGKSFSKKPQLKVHQRIHTGERPYVCSECGKAFNN RSNFNKHQTTHTRDKSYKCSYSVKGFTKQ" polyA_signal 3714..3719 polyA_site 3734 BASE COUNT 1179 a 734 c 823 g 998 t ORIGIN 1 gctaagccta tgtcgcttac tggacgctga agtgattggg aatattagca gtgggggttc 61 tgtagggtca ggaaggggcg gctggctttg ggggagtgat gaggggcttg ttgggggtgg 121 gggtgcgtga taaagggatt tctcggctga agacgaggct gtgaggcttc tgcagaaccc 181 ccaggtcagg ccacatcatt gaggctgcag gatctctctt catagcccag tacgactctc 241 cgccgtgtcc ctggttggaa aatccaaaca cctatccagc ttctggctcc tgggaaaagt 301 ggagttgtca gcaagagaga ccgagagtag aagcccagag tggagatgcc tgctgatgtg 361 aatttatccc agaagcctca ggtcctgggt ccagagaagc aggatggatc ttgcgaggca 421 tcagtgtcat ttgaggacgt gaccgtggac ttcagcaggg aggagtggca gcaactggac 481 cctgcccaga gatgcctgta ccgggatgtg atgctggagc tctatagcca tctcttcgca 541 gtggggtatc acattcccaa cccagaggtc atcttcagaa tgctaaaaga aaaggagccg 601 cgtgtggagg aggctgaagt ctcacatcag aggtgtcaag aaagggagtt tgggcttgaa 661 atcccacaaa aggagatttc taagaaagct tcatttcaaa aggatatggt aggtgagttc 721 acaagagatg gttcatggtg ttccatttta gaagaactga ggctggatgc tgaccgcaca 781 aagaaagatg agcaaaatca aattcaaccc atgagtcaca gtgctttctt caacaagaaa 841 acattgaaca cagaaagcaa ttgtgaatat aaggaccctg ggaaaatgat tcgcacgagg 901 ccccaccttg cttcttcaca gaaacaacct cagaaatgtt gcttatttac agaaagtttg 961 aagctgaacc tagaagtgaa cggtcagaat gaaagcaatg acacagaaca gcttgatgac 1021 gttgttgggt ctggtcagct attcagccat agctcttctg atgcctgcag caagaatatt 1081 catacaggag agacattttg caaaggtaac cagtgtagaa aagtctgtgg ccataaacag 1141 tcactcaagc aacatcaaat tcatactcag aagaaaccag atggatgttc tgaatgtggg 1201 gggagcttca cccagaagtc acacctcttt gcccaacaga gaattcatag tgtaggaaac 1261 ctccatgaat gtggcaaatg tggaaaagcc ttcatgccac aactaaaact cagtgtatat 1321 ctgacagatc atacaggtga tataccctgt atatgcaagg aatgtgggaa ggtctttatt 1381 cagagatcag aattgcttac gcaccagaaa acacacacta gaaagaagcc ctataaatgc 1441 catgactgtg gaaaagcctt tttccagatg ttatctctct tcagacatca gagaactcac 1501 agtagagaaa aactctatga atgcagtgaa tgtggcaaag gcttctccca aaactcaacc 1561 ctcattatac atcagaaaat tcatactggt gagagacagt atgcatgcag tgaatgtggg 1621 aaagccttta cccagaagtc aacactcagc ttgcaccaga gaatccactc agggcagaag 1681 tcctatgtgt gtatcgaatg cgggcaggcc ttcatccaga aggcacacct gattgtccat 1741 caaagaagcc acacaggaga aaaaccttat cagtgccaca actgtgggaa atccttcatt 1801 tccaagtcac agcttgatat acatcatcga attcatacag gggagaaacc ttatgaatgc 1861 agtgactgtg gaaaaacctt cacccaaaag tcacacctga atatacacca gaaaattcat 1921 actggagaaa gacaccatgt atgcagtgaa tgcgggaaag ccttcaacca gaagtcaata 1981 ctcagcatgc atcagagaat tcacaccgga gagaagcctt acaaatgcag tgaatgtggg 2041 aaagccttca cttctaagtc tcaattcaaa gagcatcagc gaattcacac gggtgagaaa 2101 ccctatgtgt gcactgaatg tgggaaggcc ttcaacggca ggtcaaattt ccataaacat 2161 caaataactc acactagaga gaggcctttt gtctgttaca aatgtgggaa ggcttttgtc 2221 cagaaatcag agttgattac ccatcaaaga actcacatgg gagagaaacc ctatgaatgc 2281 cttgactgtg ggaaatcgtt cagtaagaaa ccacaactca aggtgcatca gcgaattcac 2341 acgggagaaa gaccttatgt gtgttctgaa tgtggaaagg ccttcaacaa caggtcaaac 2401 ttcaataaac accaaacaac tcataccaga gacaaatctt acaaatgcag ttattctgtg 2461 aaaggcttta ccaagcaatg aattcctagt gcatcagcat attcataaat gaaatatact 2521 ccgagtttct tgaagaagag aacatcttct cagaatcagg tctaattata tgttattgaa 2581 ttcatgcttc agaaaaactc tagggatgca ctgcatgtgt gaacacatga taaaaaagtc 2641 atgctttatt ttagtgaggg caattacaga gaaaagagta agcagaaatg tccttctgag 2701 tactggcctc attaaggatt ataaattttc tccccgggaa gaaaccctga ctaacgcatt 2761 gagaaaagcc tttctgtaaa gaatggtaca agacaggttg ttactcgatt atttatagta 2821 aaatatgtgg gaaattatat caatgataac cctgtttatt gtgggatatc aatattttta 2881 aagtgccaac acagtcatga taggacaata ttttatgtgt gtgtgtgcgc cttatgtata 2941 taagcatata tataatatat aagcatatta ttatatacag gttgagtatc ccttctccaa 3001 aatgcctggg atcagaagca ttttggattt cagatactta cagattttgg aatatttgca 3061 ttatatttat tggttgagca tccctaatct gaaaatccaa gattaaatgc tccaattagc 3121 atttcctttg agcgtcatgt tagagttcaa aaagtttcag attttgggtt ttcagattag 3181 gaatacccaa cctgtatgta cgtatatttc tgtatctatg tatgtatata tatgcatatg 3241 cagacatatg tatatggtct ggtcagcata tgtgtatgta tgcgtatgta tgtatgtatg 3301 tatgccctca gtgcagtggg gtttgctgca gaattcactg catagcagga gatgtaagca 3361 gatgagttat tttttaagag aatctaatct aattgttttt ataaaaatta ttccctattg 3421 aatatttata taatgaggtt gtatcaacaa tgattaactc ctttattata catacacatg 3481 aatgtgcatt tttggtaaat gcataaatga gattctataa tgtttactga tctttatatt 3541 acagattttc tcttctttta ggattagctc agcttgcccc ccctttccat ctccaccatc 3601 tatagtgagc ctctccataa ttagtgccaa ccattagtct cgttcatatt tttacaccag 3661 gagtcaacaa actgtgccat tggccaaata tggcctccca actgtttttt taaaataaag 3721 ttttattgga acac // LOCUS D50420 1475 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens mRNA for OTK27, complete cds. ACCESSION D50420 NID g2618577 KEYWORDS . SOURCE Homo sapiens embryo brain cDNA to mRNA, clone:39H11. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,H., Fujiwara,T., Shin,S., Okui,K. and Nakamura,Y. TITLE Cloning and mapping of a human novel cDNA (NHP2L1) that encodes a protein highly homologous to yeast nuclear protein NHP2 JOURNAL Cytogenet. Cell Genet. 72 (2-3), 191-193 (1996) MEDLINE 97133376 REFERENCE 2 (bases 1 to 1475) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (27-APR-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, The University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:y-daigo@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="39H11" /dev_stage="embryo" /tissue_type="brain" CDS 95..481 /codon_start=1 /product="OTK27" /db_xref="PID:g2618578" /translation="MTEADVNPKAYPLADAHLTKKLLDLVQQSCNYKQLRKGANEATK TLNRGISEFIVMAADAEPLEIILHLPLLCEDKNVPYVFVRSKQALGRACGVSRPVIAC SVTIKEGSQLKQQIQSIQQSIERLLV" polyA_signal 583..588 polyA_signal 1456..1461 polyA_site 1475 /note="20 a nucleotides" BASE COUNT 321 a 363 c 394 g 397 t ORIGIN 1 atccgtgtcc ttgcggtgct gggcagcaga ccgtccaaac cgacacgcgt ggtatcctcg 61 cggtgtccgg caagagacta ccaagacaga cgctatgact gaggctgatg tgaatccaaa 121 ggcctatccc cttgccgatg cccacctcac caagaagcta ctggacctcg ttcagcagtc 181 atgtaactat aagcagcttc ggaaaggagc caatgaggcc accaaaaccc tcaacagggg 241 catctctgag ttcatcgtga tggctgcaga cgccgagcca ctggagatca ttctgcacct 301 gccgctgctg tgtgaagaca agaatgtgcc ctacgtgttt gtgcgctcca agcaggccct 361 ggggagagcc tgtggggtct ccaggcctgt catcgcctgt tctgtcacca tcaaagaagg 421 ctcgcagctg aaacagcaga tccaatccat tcagcagtcc attgaaaggc tcttagtcta 481 aacctgtggc ctctgccacg tgctccctgc cagcttcccc cctgaggttg tgtatcatat 541 tatctgtgtt agcatgtagt attttcagct actctctatt gttataaaat gtagtactaa 601 atctggtttc tggatttttg tgttgttttt gttctgtttt acagggttgc tatccccctt 661 cctttcctcc ctccctctgc catccttcat ccttttatcc tccctttttg gaacaagtgt 721 tcagagcaga cagaagcagg gtggtggcac cgttgaaagg cagaaagagc caggagaaag 781 ctgatggagc caggacagag atctggttcc agctttcagc cactagcttc ctgttgtgtg 841 cggggtgtgg tggaattaaa cagcattcat tgtgtgtccc tgtgcctggc acacagaatc 901 attcatacgt gttcaagtga tcaaggggtt tcatttgctc ttgggggatt aggtatcatt 961 tggggaggaa gcatgtgttc tgtgaggttg ttcggctatg tccaagtgtc gtttactaat 1021 gtacccctgc tgtttgcttt tggtaatgtg atgttgatgt tctcccccta cccacaacca 1081 tgcccttgag ggtagcaggg cagcagcata ccaaagagat gtgctgcagg actccggagg 1141 cagcctgggt gggtgagcca tggggcagtt gacctgggtc ttgaaagagt cgggagtgac 1201 aagctcagag agcatgaact gatgctggca tgaaggattc caggaagatc atggagacct 1261 ggctggtagc tgtaacagag atggtggagt ccaaggaaac agcctgtctc tggtgaatgg 1321 gactttcttt ggtggacact tggcaccagc tctgagagcc cttcccctgt gtcctgccac 1381 catgtgggtc agatgtactc tctgtcacat gaggagagtg ctagttcatg tgttctccat 1441 tcttgtgagc atcctaataa atctgttcca ttttg // LOCUS D50579 1846 bp mRNA PRI 22-NOV-1997 DEFINITION Homo sapiens mRNA for carboxylesterase, complete cds. ACCESSION D50579 NID g2641989 KEYWORDS carboxylesterase precursor; carboxylesterase. SOURCE Homo sapiens liver cDNA to mRNA, clone_lib:lambda ZAP clone:HuCE21. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sone,T. and Wang,C.Y. TITLE Microsomal amidases and carboxylesterases JOURNAL Comprehensive Toxicology 3, 265-281 (1997) REFERENCE 2 (sites) AUTHORS Sone,T., Ishida,Y., Takabatake,E., Wang,C., Pohl,L. and Isobe,M. TITLE Molecular cloning and expression of a human liver cDNA encoding a novel carboxylesterase JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1846) AUTHORS Sone,T. TITLE Direct Submission JOURNAL Submitted (15-MAY-1995) to the DDBJ/EMBL/GenBank databases. Tomomichi Sone, Setsunan University, Faculty of Pharmaceutical Sciences; 45-1 Nagaotoge-cho, Hirakata, Osaka 573-01, Japan (E-mail:sone@pharm.setsunan.ac.jp, Tel:0720-66-3107, Fax:0720-66-3105) FEATURES Location/Qualifiers source 1..1846 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HuCE21" /clone_lib="lambda ZAP" /tissue_type="liver" CDS 68..1747 /standard_name="carboxylesterase" /EC_number="3.1.1.1" /codon_start=1 /product="carboxylesterase precursor" /db_xref="PID:d1024485" /db_xref="PID:g2641990" /translation="MRLHRLRARLSAVACGLLLLLVRGQGQDSASPIRTTHTGQVLGS LVHVKGANAGVQTFLGIPFAKPPLGPLRFAPPEPPESWSGVRDGTTHPAMCLQDLTAV ESEFLSQFNMTFPSDSMSEDCLYLSIYTPAHSHEGSNLPVMVWIHGGALVFGMASLYD GSMLAALENVVVVIIQYRLGVLGFFSTGDKHATGNWGYLDQVAALRWVQQNIAHFGGN PDRVTIFGESAGGTSVSSLVVSPISQGLFHGAIMESGVALLPGLIASSADVISTVVAN LSACDQVDSEALVGCLRGKSKEEILAINKPFKMIPGVVDGVFLPRHPQELLASADFQP VPSIVGVNNNEFGWLIPKVMRIYDTQKEMDREASQAALQKMLTLLMLPPTFGDLLREE YIGDNGDPQTLQAQFQEMMADSMFVIPALQVAHFQCSRAPVYFYEFQHQPSWLKNIRP PHMKADHGDELPFVFRSFFGGNYIKFTEEEEQLSRKMMKYWANFARNGNPNGEGLPHW PLFDQEEQYLQLNLQPAVGRALKAHRLQFWKKALPQKIQELEEPEERHTEL" sig_peptide 68..148 mat_peptide 149..1744 /standard_name="carboxylesterase" /EC_number="3.1.1.1" /product="carboxylesterase" polyA_site 1846 BASE COUNT 378 a 526 c 547 g 395 t ORIGIN 1 cccggggcag cctctgggtg aacagcagcg tgtccgccgg cagcgaaccg agaccagcga 61 gccgaccatg cggctgcaca gacttcgtgc gcggctgagc gcggtggcct gtgggcttct 121 gctgcttctt gtccggggcc agggccagga ctcagccagt cccatccgga ccacacacac 181 ggggcaggtg ctggggagtc ttgtccatgt gaagggcgcc aatgccgggg tccaaacctt 241 cctgggaatt ccatttgcca agccacctct aggtccgctg cgatttgcac cccctgagcc 301 ccctgaatct tggagtggtg tgagggatgg aaccacccat ccggccatgt gtctacagga 361 cctcaccgca gtggagtcag agtttcttag ccagttcaac atgaccttcc cttccgactc 421 catgtctgag gactgcctgt acctcagcat ctacacgccg gcccatagcc atgaaggctc 481 taacctgccg gtgatggtgt ggatccacgg tggtgcgctt gtttttggca tggcttcctt 541 gtatgatggt tccatgctgg ctgccttgga gaacgtggtg gtggtcatca tccagtaccg 601 cctgggtgtc ctgggcttct tcagcactgg agacaagcac gcaaccggca actggggcta 661 cctggaccaa gtggctgcac tacgctgggt ccagcagaat atcgcccact ttggaggcaa 721 ccctgaccgt gtcaccattt ttggcgagtc tgcgggtggc acgagtgtgt cttcgcttgt 781 tgtgtccccc atatcccaag gactcttcca cggagccatc atggagagtg gcgtggccct 841 cctgcccggc ctcattgcca gctcagctga tgtcatctcc acggtggtgg ccaacctgtc 901 tgcctgtgac caagttgact ctgaggccct ggtgggctgc ctgcggggca agagtaaaga 961 ggagattctt gcaattaaca agcctttcaa gatgatcccc ggagtggtgg atggggtctt 1021 cctgcccagg cacccccagg agctgctggc ctctgccgac tttcagcctg tccctagcat 1081 tgttggtgtc aacaacaatg aattcggctg gctcatcccc aaggtcatga ggatctatga 1141 tacccagaag gaaatggaca gagaggcctc ccaggctgct ctgcagaaaa tgttaacgct 1201 gctgatgttg cctcctacat ttggtgacct gctgagggag gagtacattg gggacaatgg 1261 ggatccccag accctccaag cgcagttcca ggagatgatg gcggactcca tgtttgtgat 1321 ccctgcactc caagtagcac attttcagtg ttcccgggcc cctgtgtact tctacgagtt 1381 ccagcatcag cccagctggc tcaagaacat caggccaccg cacatgaagg cagaccatgg 1441 tgatgagctt ccttttgttt tcagaagttt ctttgggggc aactacatta aattcactga 1501 ggaagaggag cagctaagca ggaagatgat gaagtactgg gccaactttg cgagaaatgg 1561 gaacccgaat ggcgagggtc tgccacactg gccgctgttc gaccaggagg agcaatacct 1621 gcagctgaac ctacagcctg cggtgggccg ggctctgaag gcccacaggc tccagttctg 1681 gaagaaggcg ctgccccaaa agatccagga gctcgaggag cctgaagaga gacacacaga 1741 gctgtagctc cctgtgccgg ggaggagggg gtgggttcgc tgacaggcga gggtcagcct 1801 gctgtgccca cacacaccca ctaaggagaa agaagttgat tccttc // LOCUS D50645 1085 bp mRNA PRI 24-DEC-1996 DEFINITION Human mRNA for SDF2, complete cds. ACCESSION D50645 NID g1741867 KEYWORDS SDF2. SOURCE Homo sapiens cell_line:human glioblastoma cell line, T98G cDNA to mRNA, clone_lib:phage (1gt22A) library, T98G cDNA library 1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hamada,T., Tashiro,K., Tada,H., Inazawa,J., Shirozu,M., Shibahara,K., Nakamura,T., Martina,N., Nakano,T. and Honjo,T. TITLE Isolation and characterization of a novel secretory protein, stromal cell-derived factor-2 (SDF-2) using the signal sequence trap method JOURNAL Gene 176 (1-2), 211-214 (1996) MEDLINE 97075932 REFERENCE 2 (bases 1 to 1085) AUTHORS Hamada,T., Nakano,T., Inazawa,J., Tashiro,K., Shirozu,M., Tada,H., Nakamura,T. and Honjo,T. TITLE Isolation and characterization of a novel soluble factor gene: A contributional trial for resolution of intercellular signal transduction JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1085) AUTHORS Hamada,T. TITLE Direct Submission JOURNAL Submitted (23-MAY-1995) to the DDBJ/EMBL/GenBank databases. Tsuneyoshi Hamada, Kyoto University, Faculty of Medicine, Department of Medical Chemistry; Yoshida, Sakyo-ku, Kyoto, Kyoto 606, Japan (E-mail:kondo@virus1.virus.kyoto-u.ac.jp, Tel:81-75-753-4377, Fax:81-75-753-4388) FEATURES Location/Qualifiers source 1..1085 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="human glioblastoma cell line, T98G" /chromosome="17" /clone_lib="phage (1gt22A) library, T98G cDNA library 1" /map="17q11.2" CDS 40..675 /note="stroma cell-derived factor-2" /codon_start=1 /product="SDF2" /db_xref="PID:d1009953" /db_xref="PID:g1741868" /translation="MAVVPLLLLGGLWSAVGASSLGVVTCGSVVKLLNTRHNVRLHSH DVRYGSSSGQQSVTGVTSVDDSNSYWRIRRKSATVCERGTPIKCGQPIRLTHVNTGRN LHSHHFTSPLSGNQEVTAFGEEGEGDYLDDWTVLCNGPYWVRDGEVRFKHSSTEVLLS VTGEQYGRPISGQKEVHGMAQPSQNNYWKAMEGIFMKPSELLKAEAHHAEL" BASE COUNT 266 a 249 c 295 g 275 t ORIGIN 1 gttttcttcg aagatttggg gctccgcgat acagttagga tggctgtagt acctctgctg 61 ttgttggggg gtttgtggag cgctgtggga gcgtccagcc tgggtgtcgt tacttgcggc 121 tccgtggtga agctactcaa tacgcgccac aacgtccgac tgcactcaca cgacgtgcgc 181 tatgggtcaa gtagtgggca gcagtcagtg acaggtgtaa cctctgtgga tgacagcaac 241 agttactgga ggatacggcg gaagagtgcc acagtgtgtg agaggggaac ccccatcaag 301 tgtggccagc ccatccggct gacacatgtc aacactggcc gaaacctcca tagtcaccac 361 ttcacttcac ctctttctgg aaaccaggaa gtgactgctt ttggtgaaga aggtgaaggt 421 gattatctgg atgactggac agtgctctgt aatggaccct actgggtgag agatggtgag 481 gtgcggttca aacactcttc cactgaggta ctgctgtctg tcacaggaga acaatatggt 541 cgacctatca gtgggcaaaa agaggtgcat ggcatggccc agccaagtca gaacaactac 601 tggaaagcca tggaaggcat cttcatgaag cccagtgagt tgttgaaggc agaagcccac 661 catgcagagc tgtgaatctt gaggctctga ggcactgtta acgcacaatg ttcacagaca 721 tctgttgctg cctcaccttg ggatccctgc cacaagttcc ttgggcagtg gccatgtcac 781 cattgagatg aagatataca acagagaaat agtggctgtg tttgggaagc ttcagccctg 841 cacattttga actagtcact ctcccagact tggcggtggg tcagttcttt cctgagtaga 901 ggacttgctg gtaaaagggg cagatgcttt ttattagtac tgattaaacc acactgaggg 961 aaacatccct cttagctggg aaactgttta ctcttcagga gcttggcatc atggactgtt 1021 aatgtatgtg attttccccc tattttctct cccccacaat gataaaaaca ataattttat 1081 tatga // LOCUS D50663 704 bp mRNA PRI 10-JAN-1997 DEFINITION Human mRNA for TCTEL1 gene, complete cds. ACCESSION D50663 NID g1747307 KEYWORDS TCTEL1. SOURCE Homo sapiens Fetus Brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Fujiwara,T., Shimizu,F., Okuno,S., Suzuki,M., Takahashi,E., Nakamura,Y. and Hirai,Y. TITLE Cloning, expression, and mapping of TCTEL1, a putative human homologue of murine Tcte1, to 6q JOURNAL Cytogenet. Cell Genet. 73 (1-2), 153-156 (1996) MEDLINE 96244612 REFERENCE 2 (bases 1 to 704) AUTHORS Watanabe,T., Fujiwara,T., Shimizu,F., Okuno,S., Takahashi,E., Nakamura,Y. and Hirai,Y. TITLE Cloning, expression, and mapping of a putative human homologue of murine tctex-1 gene JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 704) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (24-MAY-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..704 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Fetus" /tissue_type="Brain" gene 13..354 /gene="TCTEL1" CDS 13..354 /gene="TCTEL1" /note="similar to murine Tcte1 gene product" /codon_start=1 /db_xref="PID:d1009959" /db_xref="PID:g1747308" /translation="MEDYQAAEETAFVVDEVSNIVKEAIESAIGGNAYQHSKVNQWTT NVVEQTLSQLTKLGKPFKYIVTCVIMQKNGAGLHTASSCFWDSSTDGSCTVRWENKTM YCIVSAFGLSI" polyA_signal 687..692 BASE COUNT 204 a 152 c 148 g 200 t ORIGIN 1 gccggaggaa agatggaaga ctaccaggct gcggaggaga ctgcttttgt tgttgatgaa 61 gtgagcaaca ttgtaaaaga ggctatagaa agcgcaattg gtggtaacgc ttatcaacac 121 agcaaagtga accagtggac cacaaatgta gtagaacaaa ctttaagcca actcaccaag 181 ctgggaaaac catttaaata catcgtgacc tgtgtaatta tgcagaagaa tggagctgga 241 ttacacacag caagttcctg cttctgggac agctctactg acgggagctg cactgtgcga 301 tgggagaata agaccatgta ctgcatcgtc agtgccttcg gactgtctat ttgacctgca 361 gtccagccta tggcctttct ccttttgtct ctagttcatc ctctaaccac cagccatgaa 421 ttcagtgaac tcttttctca ttctctttgt tttgtggcac tttcacaatg tagaggaaaa 481 aaccaaatga ccgcactgtg atgtgaatgg caccgaagtc agatgagtat ccctgtaggt 541 cacctgcagc ctgcgttgcc acttgtctta actctgaata tttcatttca aaggtgctaa 601 aatctgaaat ctgctagtgt gaaacttgct ctactctctg aaatgattca aatacactaa 661 ttttccatac tttatacttt tgttagaata aattattcaa atct // LOCUS D50911 3827 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0121 gene, complete cds. ACCESSION D50911 NID g1469164 KEYWORDS KIAA0121. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3827) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3827) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3827 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..410 gene 411..1043 /gene="KIAA0121" CDS 411..1043 /gene="KIAA0121" /note="The KIAA0121 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010112" /db_xref="PID:g1469165" /translation="MLFMKMDLLNYQYLDKMNNNIGILCYEGEAALRGEPRIQTLPVA SALSSHRTGPPPISPSKRKFSMEPGDEDLDCDNDHVSKMSRIFNPHLNKTANGDCRRD PRERSRSPIERAVAPTVSLHGSHLYTSLPSLGLEQPLALTKNSLDASRPAGLSPTLTP GERQQNRPSVITCASAGARNCNLSHCPIAHSGCAAPGPASYRRPPSATCV" 3'UTR 1044..3827 BASE COUNT 870 a 958 c 963 g 1036 t ORIGIN 1 gtcatgccaa gcatagccct catgtcagcg ctgccggctt gcagcgggct gtgagagggg 61 ccggcgccgc tttgtcctag gaaacgggct gcgcgtttct ctttttcact cttttccatt 121 tccaggaagg acttgtaagg acttctgaaa cgctgttttc atactcgatc ggggatacag 181 tacatacacc gtctaccagt aagcccttga agggtttcgt gtgagctcga tttttttgtg 241 cctgattttt ttttttttaa acttttgcat actttgtttt gatagtctga ggctgggcct 301 ctgcctttgt gaagttgaag agccaggagc tactcagcaa caattgattt ttgaaactta 361 actcttttgg ggcaaaagca aagagctggt tttctttgct agcccaataa atgctattta 421 tgaagatgga cctgttgaac tatcagtact tggacaagat gaacaacaat atcggcattc 481 tgtgctacga aggcgaagct gctctcaggg gagaacccag aatacagacc ctgccggtgg 541 cctctgccct cagcagtcac cgcaccggcc ctcccccaat cagccccagc aagaggaagt 601 tcagcatgga gccaggtgac gaggacctag actgtgacaa cgaccacgtc tccaaaatga 661 gtcgcatctt caacccccat ctgaacaaga ctgccaatgg agactgccgc agagaccccc 721 gggagcggag ccgcagcccc atcgagcgcg ctgtggcccc caccgtgagc ctgcacggca 781 gccacctgta cacctccctc cccagccttg gcctggagca gcccctcgca ctgaccaaga 841 acagcctgga cgccagcagg ccagccggcc tctcgcccac actgaccccg ggggagcggc 901 agcagaaccg gccctccgtg atcacctgtg cctcggctgg cgcccgcaac tgcaacctct 961 cgcactgccc catcgcgcac agcggctgtg ccgcgcccgg gcctgccagc taccggaggc 1021 caccgagcgc cacctgtgtc tgactgcctg ccttcctgct tgcctgcagc tgccaccacc 1081 tgtgaccccg tggtggagga gcatttccgc aggagcctgg gcaagaatta caaggagccc 1141 gagccggcac ccaactccgt gtccatcacg ggctccgtgg acgaccactt tgccaaagct 1201 ctgggtgaca cgtggctcca gatcaaagcg gccaaggacg gagcatccag cagccctgag 1261 tccgcctctc gcaggggcca gcccgccagc ccctctgccc acatggtcag ccacagtcac 1321 tccccctctg tggtctcctg aagggagcgc ctcctccaac aacacgtgga tctgcatggt 1381 ttgcctgagc tttgaacagt cagtacttaa aaaaaaaaaa atcatggggg tggggtgggg 1441 ggaagggaag ggatggttta tttgcaaaaa ccatgttgtt gggatttgtg ttctgttttt 1501 gtacttgctt ggtatccgta caagggggcc ctcaaacatg atagcaggaa ctacgcgtgg 1561 aacatctgtc taatgtagca tccttacttc ctgcctcagt taccaaagaa acctctgatg 1621 caggtctgct gccccgacgg ggccaggact ccacagcgct ttctcagtca caagccatga 1681 tgaattggtg actcagacgc tttgtgcttt ttcctttgct tcttgagacc ggggtgtgtg 1741 tggctcagct tccacggcgt gtttggttcg gtccatgtgt gtgcgtgtgt atacttgaag 1801 agaactgtcg tgtctgattt gcactattgg aggaggacta aagttgcgtg acaactttat 1861 gtgttatgcc agaactctga gggcaaactg ctgaaaaaca aagggtttaa ggatgacatt 1921 tctgaccatt tgtgtgtttg ttgttgttac tgtttttgtt ttttttaatg tagacaatac 1981 agctttggaa ggggaagtct catacaggtt ataggtcttt ctctctctag atttcaggtg 2041 cttgcaactg gactgcagac tctaccaatc acgggcattt tatcttctct gaacactgca 2101 gtttgttaga ctagagctga ggttggagga ttccatagtg ctttaaacgt gatgcatgtt 2161 ttaatggaga aaaaatagct ggtttctatt aattatatag acagtaaaca aaaaccttaa 2221 tacttactat cttcttttca gaattagttt atttttgtca gttacagtcc tagatatact 2281 tactgctggt acagttgtac tctaagattg gtatttgata ttcactttac tcacaagtag 2341 tgcgggaggc cagctcctgg caggccctcg cgatgagcag tgggtcagct gcggtgtggg 2401 atgctggagt ttggctgcag gctgacatca tttatttttg catccctgtc tgctttgtta 2461 caagctccca ggggaggtgg ggtttgtgtc ttccaacttc cctacatgca gaaactgctc 2521 cctttgaact ctcttggctg aacagcagat tactgacaga caatctgtga tatggtgttt 2581 tatacgcttc ctcgtacgct ggggccaagg cagtatacat tcctctgact ttatactgtt 2641 attactgcat ttattatttg ctatattaat agctactaac tagaaattag atgaagcaag 2701 catgacagac acagctgtgg aggtcacagc tgctcctttt tggtcaatga gcgtttctat 2761 cccctccccc tggggtgtgc tgtgtcccac ctggcccacc agaggctcac gacgatggca 2821 cctgaccagg tgacgtgggc gtggtcacct cacctgcaag gctttgtgga ctctgcacac 2881 cgtatgaccc ccggttttac agtttttagc tgttgaattt tggaaattgg cactgggtga 2941 aaaggtcgga ggactggctc ttgtagtcac agagtggctg caggcctttg aaaagtggag 3001 gaaagaaaag cccttctcct tgccccgcac acatttcact cccactgtac tgggcttcca 3061 agctttggca ttcaggcccc tatattttct gtaggaaaaa tcgttgagaa cacttttcta 3121 tatgggtgat tttgagacca tcgttacgct gtgcgtaaag aatgtacaga gaaatttgta 3181 ggtatttttt gaagaacatt aatttgttaa tgatatgtag ctatttaatt tttccctttc 3241 ctattgtaat cattcatttt ttttgttgtt cggaaaaaaa aagttgatct ttttttttgt 3301 cgtagatttg tctgtaaaag tgcaggaaca gttattctat gagaacactg catctgcatt 3361 catagccacg agtttgttat tgctacaggc tactgagcgt cgtaacagga aaaccaccca 3421 cagctgaccg gctcggtgga ggacactcct gggacaggtc tctttgtcag tgaacaaggg 3481 cgtcactctg ggaggggtcg gcggtgctgg cggccgggtc cctggtgcac tgacctatct 3541 gggataggca gtaccctgga ggggggcctg gggcagagga ggcagcagaa aaccaaacat 3601 ttcactgaga aagccccctc cctgctctaa gaaggggctc cgtgaagttc ttcccagagc 3661 cgcgctgcct gcagtgcgct ctgaccttct cttcatgtgt gtaaatctgt aatataccat 3721 tctctgtggc ctgtttttcc tggaagaaga aaaaaaaagg tttggcaggc catctttttt 3781 tgtacttaaa agtagcctta agaacaataa taaagtgctc ttaaacc // LOCUS D50915 7883 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0125 gene, complete cds. ACCESSION D50915 NID g1469172 KEYWORDS KIAA0125. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7883) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 7883) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..7883 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..1289 gene 1563..1796 /gene="KIAA0125" CDS 1563..1796 /gene="KIAA0125" /note="The KIAA0125 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010116" /db_xref="PID:g1469173" /translation="MGLSHAVAGDLLLPATPCWTGAWASPGLCLGFVTQGLCRRAQSI SPWEAQGCLLELCGPALAGELVVWVFRLYPSTS" 3'UTR 1797..7883 BASE COUNT 1647 a 2142 c 2276 g 1818 t ORIGIN 1 ttcaagtatg gcagacaaag gatgttctgc gtggggaaat gtggtgacac ccatttcaca 61 aggacagctc acatagattg agtgctcagg aaggaccagc accataccca gtgcctgatg 121 tgtatcatct caattagtcc ttgcctcaga tgcaaaagga aaccatcgcc atcatcatca 181 ccaccatcat catcttcctc ctgtgcagat ggaaaggctg aggcatagag aggtgacgga 241 gtctgcccag gactgcaagc ctgctggtgg cagagccagg ttccaatgga atgaaggctg 301 tcatcctcag atggcagggt aggcaggtgg ctagagctca cttgggagaa ggggaaagga 361 cactgacttt ggctagggat ggagcagagc ttgggctggc tttccatgca cgggcagggg 421 gcgtggctca tggctacgct ccagccccgg gtgtggacat ttaatcttcc aggtctaccc 481 taggctatgg gtctggacag cactgtgatg gaaagaagac actctatgtc ctgcattctg 541 tgaccaatga tgtgactgtg ggaatggcgc tggcatctgg ctgccactct gggacgggtg 601 gccagctgcc atcaggcccc acccaggatg ggaccaccat gcgacttctt ccctcgctcc 661 tcctggtcat gtccagagcc ccaggaggac cagcaaagcc tctcgagccg atggcagctc 721 acgttctgcc ttgtcagcta ctcctctcct gggcaatatt ggctgcttgc tgtggctctc 781 cccggggtat gtgactgcct ctgtgctggg cacctggcct gggctttcct tctgggcctg 841 ggcagctggg ctcagcttgg acccaggcag cagccacaga ggggcccatg gaggtgacag 901 agttgcttct atgatggtga acgggcagct gtgacacgga ggaggcgacc actcctgagt 961 ttccaagtgc tgcggtcagg gccggggcca gcaaagtccc tcccatattc aaagagcggg 1021 tttgggtttg tcccaggagg acatagtcag gagcccatgc tgggacatgc ctcctccaaa 1081 gttcagcctg gatccccagc ctctgccaac ggccccgctc cttagctaac ccagcttgct 1141 cctgggttcc acggcggagt cagatgtttc tgggcagttt cacctttgtg ccttaaatgc 1201 atgttgagga ctttaaggaa ttgtggagaa atagggctgt ggcaaaggca agtgacaact 1261 gggaacaatg atcccgcaga ggctgctgag gcctgggccc caggggcgtg ggttcatcct 1321 tctgcctggg ctttggtggg aggggcagac tctgtggtct gagacacaaa aaaacccaaa 1381 acatatgtgt gtacagacac acagcagagc cacacacaca cttgtgccca tgcacacact 1441 cacaggaggc ccgtggactc cgcacaggga agaaactcct ccggtcgaca gtggacggcg 1501 ctgcagcagg gactcacccc caagccctgc ctgcctccca ttgcccacct ggccctggct 1561 tgatgggctt atctcatgct gtggccgggg acctcttgct tcctgcaacc ccttgctgga 1621 ctggggcctg ggcctctcct gggctgtgcc tagggtttgt aacccagggc ctgtgccggc 1681 gtgcacagag catctctccc tgggaggctc agggctgcct cctcgagctc tgtgggcctg 1741 cactggccgg tgagcttgtg gtgtgggttt tcaggctgta tccttctacc tcctgagccc 1801 aggggtccca ggcgccctgc agctgtctcc tcggccatcc tgtggggccc cgaggccttg 1861 ccctcacttc agtgcctggg tgctcaggct ttgcccaggt gccaggagaa ggtgtgagca 1921 tgagcctatt ggacacacct ggcgacgtat accaggtgtc ccacccctgc caccatgggg 1981 cctcccgata cggcaaccac cacggacctg tggggaccaa tgaggaaaga gagaggcagg 2041 tctgggccag gctcacaggg actccggcat agcagaccct gccccagcag gcccccttgt 2101 ccttcctggg tcctggtcct tcatgaggaa ctagcccatc cctggtgggg ctcccacccc 2161 gcttctcagt gggctctatg cttgcctcgt cggagtcacc cctcaggcag tcctgggatc 2221 ctctccttta gacccactgt gccttcccgg cctcccgggc ttctgctggg ggcagaagaa 2281 atgcctcccc aggtctgtct ctggaggctc tgagggagat gggcttgggg gctgtaggag 2341 gaggcaggga ttccagggtg tcaggaaggc aggggtgcca ggtcccacct agtgaagtaa 2401 taaaccgtgg gtggtgatag tgacccagtg ccctcactgc ccagccccgc ctgtcctcag 2461 ccagcactgc agggatccca ggcccagact ctggaggcct tcactgatcc cagccacccc 2521 agaaaagctg cagcctgcag gcaccagccg ggccatatgc ccagtgccag ctagggccca 2581 ccgcccatcc tgcacacggg gccgctgggc aggtgcccct cacaccccca ggatgtcagt 2641 gctcacctcg agcaaagcgc cccagctcgg ccttgggagg tggtcatgtc cagggggatg 2701 atggagagct gtccaaccaa gagagcggga gggagggaag gagggaggga gagagataga 2761 gagagagaga gagagagagg aagtgtgggc cctaaggctg ccttagtgga ggtgcgcgtg 2821 gcctgcacct caccaagcct agccactctc gcggctctga gtggctcaca ggcttgtgag 2881 ggccccgtcg ctgcctgctg ggtccccacc agggctccct ctaggaatgc gccatggctg 2941 ctatgacaat ttgcacagcc cagtggctta aacaccattt ataccacagg tccagatgaa 3001 tcctgcaggg ccagggtctg ggggtgctgg aggccatgct ccctccaggc ttgcggggag 3061 aacttccctg cctcctccag tctctccatc cctgagctct cggctcctcc tccgtcttca 3121 gggccagggc gtagcgtctg ctctctcggc ctctgcctcc gcttcccacc tcacctggct 3181 tctgtctatg tcagtctccc tctgccaacc tcctagaagg acacttgtga ttacattagg 3241 gctcacccct ttaatccagg ggagcctctc cacttcatga ttttcagcta acttgcttct 3301 gcacagaccc cctttcccta taagggcaca cattcactgg tcccggggct aaggaccttg 3361 ctccaagtcc ctccacccat gatgctgtgc cttccagaaa cctgtcctct gcagctcggt 3421 cttgacccca agcctgctgg tgacctgaac ttcacagggt tatccccttg gactgtgtgc 3481 agcacgatgc aatttctggg cctgaatgtc atgctccctg gggcaggacc ttgagcctgc 3541 agcacacact aggccacctg cagtctcaca ggccatgccc tgggtagaca gggaggtgct 3601 caaccccagc tcgggtcctc tagtctgcct ggctaccatg cttctcactc tcctgcatct 3661 gcagaccctg cgttgccatg tgaggcaggg gtggggtggg gctgagggcg tggctttggt 3721 ccctggctgt ccggatgaag taccagagtg acgccacagc ccatcccggt gacatgctca 3781 cccccaaccc ccgtgtccgg gaccccggtc ttgtgtggtc cctgatgtgg agtcctcagt 3841 ccttaagata catccagaaa gtcctggcca tgaattggag gtgcagagtc ctgcagagcc 3901 tctgggctgg gctggtgccc ccaggagatg gagggcctgg tggatgccct cctccctcag 3961 agctggggca gctgcctccc aggggtggga ctctgggctc agagagaggc ccttgagctg 4021 cagctcaggg ggatgcgagg cttcgtggac tgtgtcctgg tccatgtggt gcacgtgtct 4081 ccacctccaa ggagaggctc ctcagtgtgc acctccccca catccgtcct ctctgccggc 4141 cccgggcgtc tgagcagtca ttccatgcca gcacctctgc agcctgctgg gcctcaggtt 4201 ctctgtgagg gacctccccg gccttcggcg gaggtggagt aagctccgtc aaggcaggtg 4261 gcttcgtccc ttcctgtgag tgacaccagt gatgaaatgg acccctccac acaggcatcc 4321 tcagggcaca gggccctggg ggcaccttcc tcctttcgta tttgttgaga aaaaaagtgg 4381 cattgcgctc acaccaggat gctggagcag agctgacatg ctcgggaaag ggcagaggtc 4441 actgggggtg ggaaggtcat ccagtccaga ctcagcacct cgtgggctgg taaactgagg 4501 ctcaaagtgc tggtgccagg cctgaggcct cgcggtgacc cctctctctg gttcccagca 4561 cctgcctgag acctgcccca ggcacccata acctggaatt ccctgtttcc ttgtccaggg 4621 cctgaggaaa tggctcccca ggtctgtctc tggatgctct gaggcagatg ggcttggggg 4681 ctctaggaag aggcagggac tccagggtgt caggaaggca ggggtgccgg gtcccaccca 4741 gtggagtaac aaactgtggg tggcgtttgg gcctccccgc cttccccact gggtgtgctg 4801 gtgctggcgc tgctgggtca gggctgcccg tgaccccaga caccactgtc catcctgtga 4861 ggctcccgtc tgggcatgtc ctgggtggat tcctcctttc tgttaagtag ctacatgagg 4921 caggggctcc tggatccaaa gcaaatgaca ggaattccag agccaggtgc atccactcag 4981 ggcagccagt gttggtggag ctgcctctag cacatggagg agagtgaaag tcagcctgcc 5041 cctctcacga gaaaagaacc tggggatacc tctcagcctc cagcgttgca agtgcaaggc 5101 cagtggagtt aatctgcaac gtgcacgagg gcgtgtgtca gtggctgtgt gcaggagtgt 5161 gagtgagcaa gagcaagagc gcatggctcc tgctgtacct caaggtgtgg gctcctggtg 5221 gctgctcagt gttcccaggg gtgagaggcc tcatgtatcc taggctgcct gagatttctg 5281 tgtgctgatc gcatcctcag tttcttgtcc accgcttcac tggcaagagt cccaggctcc 5341 aaggacaccc tccctgcaca tgattgggtg ttaatggtgg cctgggttgt gtcttcccct 5401 ggggatgagg gttgggtgtc catggtgccc tgggctgtgt cctcccctag ggatgagggt 5461 cgggcctcca cgatgccctg ggctgtgtgc tcttatggga atgagggttg ggtgtccaag 5521 atgccctggg ctgtgtcctt ccctggggat gagggttgga tgtccaagat gccctgggct 5581 gtgtactccc ctaggaatga gggctgggtg tccaagatac cctgggctgt gtcctcccct 5641 ggggatgagg gttgggtgtc catggtgccc tgggctgtgt cctcccctgg ggatgacggt 5701 tgggtgtcca tggtgccctg ggctgtgttt ccttggggat gagggttggg tgctatggca 5761 tcctgggcag gtgcttcctt tctgcacaag ggttgggtga ccatgatgtc ctggcaatgg 5821 cttccctggg ttgcctcttt tctgccatgt gggaagagca ggggaggttt agttggtctc 5881 agcacatcat tctctcagga taagtagaag agtgtctgag ctgtgaggcc agtgctccag 5941 ctttggaatt gtcttcccca ccctcacctc catcccatca aagcccgaca tgtcgtgtgg 6001 cagcagcgag gtgggtgttg gctgttctct tgggctgggg gttagtcgtg gacggggaaa 6061 ggagagatgc tggtcaaagg gcatgaagtt tctgctgatg ggaggagtca gttcttttga 6121 tctgttgcac agcatggtga ctatagttaa caataatgac tatttcaaaa ttgctaaaag 6181 atgagatttt aaatgttctc accacaaaat gataagtgtg tgaggtgatg gatatgccac 6241 ttaccttgtt ttaatcatcc cacaatatag acaggcattg tcactttgca ttgtacccca 6301 ggaatcttca catttgcttt tttgtcaatt aaaaatagag acacaaaagg agagagggga 6361 gagcaataga ctcttcacgg aaccgtgggc ttctgcctcc gggtaaaata aactgcaaaa 6421 aggattccca ggaaaccgtt ccctctttca gcccttggtt acaggaagcc ggatttggga 6481 aatctgcctg gatgacattc acatgaacgg gcacatacag gaaaacacgg taatgtaatt 6541 agaatagtca gagaaaagta gccagaaatg acattcacat gaacgggcac atacaggaga 6601 aaacacggta acgtaattag aatagtcaga gaaaagtagc cagaaatgac attcacatga 6661 acgggcacat ataggagaaa ccatggtaac gtaattagaa tagtcagaga aaagtagcca 6721 gaaatgacat tcacatgaac gggcacatac aggaaaacac ggtaatgtaa ttagaatagt 6781 cagagaaaag tagccagaaa tgacattcac atgaacgggc acatacagga gaaaacacgg 6841 taacgtaatt agaatagtca gagaaaagta gccagaaatg acattcacat gaacgggcac 6901 atacaggaga aaacacggta acgtaattag aatagtcaga gaaaagtagc cagaagaatt 6961 tgcaacgtgc ccttgtaaca ccaaatttga tcagtttttt aaaaaatgat cgttatgtag 7021 gtgattgaga agtaaatgta ttctttttta aggtaaaaat ttggaccctt atcatgcata 7081 cccccctctg tgctcttcaa atcaacatca ttattaatat ctgtacattt ttgctcatct 7141 gagccagcac aggctgaggc tgtcagaatg gacacctttt ggttgttggg tttctgtcag 7201 tttctggggt gaagctgcgt gattgagaac gtagctcttg gctgccatct cggggattat 7261 taaggactgt gaactctatc cacaagccat ggcaatatct gtcccaccga atgctccctc 7321 taacacactc ttactcccgt gatgtgtgtt aagggctccg atgatgctga aaacagcaca 7381 ggatgtgaaa aggcaggaac agttctgaag tcaaaggctg atgtcctgtt tctctttccc 7441 tctgtgaccg actcccttcc cagtggtaac aagtacccac agcttggttt gaatttctgc 7501 acgctgttgt ctgtgcactc gctcacactt acgcacacag caggcatgtg ggcgatgctg 7561 ggtattttgt gtatgagtgg gatgcacata cacacatcta catccatatc atgcccatgc 7621 atctgtaact tgcttttccc gtgtaagaac acttcttaga gtttgttcaa tgcatgtgtc 7681 tgtgtgaatg attgaaggca tttctaaccc attttaaaga tggctactta ggaccatatg 7741 gatgttgtac tgatgtcatt tgaccacgtc cattgtttcc atcttttggg ctgttcttgt 7801 gtattttact ttccatgtaa cactgtgaca ttgagaattg gtacctacaa cagtctattt 7861 gctttacatt aaatttgtag gct // LOCUS D50916 6060 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0126 gene, complete cds. ACCESSION D50916 NID g1469174 KEYWORDS KIAA0126. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6060) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6060) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..6060 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..72 gene 73..3294 /gene="KIAA0126" CDS 73..3294 /gene="KIAA0126" /note="The KIAA0126 gene is partially related to a yeast gene." /citation=[3] /codon_start=1 /db_xref="PID:d1010117" /db_xref="PID:g1469175" /translation="MTDQENNNNISSNPFAALFGSLADAKQFAAIQKEQLKQQSDELP ASPDDSDNSVSESLDEFDYSVAEISRSFRSQQEICEQLNINHMIQRIFLITLDNSDPS LKSGNGIPSRCVYLEEMAVELEDQDWLDMSNVEQALFARLLLQDPGNHLINMTSSTTL NLSADRDAGERHIFCYLYSCFQRAKEEITKVPENLLPFAVQCRNLTVSNTRTVLLTLE IYVDQNIHEQLVDLMLEAIQGAREYMNKIYFEDVTEFLEEVIEALILDEEVRTFPEVM IPVFDILLGRIKDLELCQILLYAYLDILLYFTRQKDMAKVFVEYIQPKDPTNGQMYQK TLLGVILSISCLLKTPGVVENHGYFLNPSRSSPQEIKVQEANIHQFMAQFHEKIYQML KNLLQLSPETKHCILSWLGNCLHANAGRTKIWANQMPEIFFQMYASDAFFLNLGAALL KLCQPFCKPRSSRLLTFNPTYCALKELNDEERKIKNVHMRGLDKETCLIPAVQEPKFP QNYNLVTENLALTEYTLYLGFHRLHDQMVKINQNLHRLQVAWRDAQQSSSPAADNLRE QFERLMTIYLSTKTAMTEPQMLQNCLNLQVSMAVLLVQLAIGNEGSQPIELTFPLPDG YSSLAYVPEFFADNLGDFLIFLRRFADDILETSADSLEHVLHFITIFTGSIERMKNPH LRAKLAEVLEAVMPHLDQTPNPLVSSVFHRKRVFCNFQYAPQLAEALIKVFVDIEFTG DPHQFEQKFNYRRPMYPILRYMWGTDTYRESIKDLADYASKNLEAMNPPLFLRFLNLL MNDAIFLLDEAIQYLSKIKIQQIEKDRGEWDSLTPEARREKEAGLQMFGQLARFHNIM SNETIGTLAFLTSEIKSLFVHPFLAERIISMLNYFLQHLVGPKMGALKVKDFSEFDFK PQQLVSDICTIYLNLGDEENFCATVPKDGRSYSPTLFAQTVRVLKKINKPGNMIMAFS NLAERIKSLADLQQQEEETYADACDEFLDPIMSTLMCDPVVLPSSRVTVDRSTIARHL LSDQTDPFNRSPLTMDQIRPNTELKEKIQRWLAERKQQKEQLE" 3'UTR 3295..6060 BASE COUNT 1747 a 1295 c 1198 g 1820 t ORIGIN 1 cgtgtcgtcg ctgctggcac ttcaggctct gcctctccca ctaggtctgg atggaggata 61 ccttaaagtg aaatgacaga ccaggagaat aacaacaaca tctcaagtaa cccctttgct 121 gctctttttg gctccctggc tgatgccaaa cagtttgcgg caatccaaaa agagcagctg 181 aagcaacaat ctgatgaact cccagctagc ccagatgact cggataatag cgtgtcagag 241 agcctggatg aattcgatta ctctgtggct gagattagcc gctcattccg atcacagcag 301 gaaatatgtg agcaactcaa catcaatcac atgatccaaa ggatcttcct tattactctg 361 gacaacagtg atcccagctt gaaaagcggg aatggcatcc ctagccgttg tgtgtatttg 421 gaagaaatgg cagtagagct agaagatcaa gactggcttg atatgagcaa tgttgagcag 481 gccctcttcg ctcgcttatt acttcaagat ccaggcaacc acttaattaa catgacttct 541 tctacaacgc taaatctctc tgctgatcga gatgcaggag agaggcacat tttttgttac 601 ctttactcct gcttccagag agccaaggaa gagattacca aagttccaga gaacctgcta 661 ccctttgcag tgcagtgcag aaacctcact gtgtccaata cccgaacagt tcttctcacc 721 ctagagatct atgttgacca aaacatccat gagcaactgg tagatttgat gttagaagcc 781 atccagggag cccgtgagta catgaacaag atctattttg aagatgtaac tgagtttctg 841 gaagaggtca ttgaagcctt gatattggat gaggaagtta gaacatttcc agaagtcatg 901 attccagtgt ttgatatttt attgggccga ataaaagatc tagagctctg tcagatcctt 961 ttgtatgcat atctggatat tcttctctat ttcactaggc aaaaagatat ggcaaaggtt 1021 tttgtagaat acattcagcc caaggaccct accaatgggc aaatgtacca gaagaccttg 1081 ctgggagtaa ttctgagtat ctcctgctta ttaaagactc cgggtgttgt agaaaatcat 1141 ggctactttt tgaatccatc tcgttccagc ccccaggaga tcaaagtaca ggaggccaac 1201 atccatcagt tcatggctca gttccacgaa aagatctacc agatgctgaa gaacttactc 1261 cagctctctc cagaaaccaa acactgtatc ttgtcctggc ttggaaactg tttgcatgca 1321 aatgcaggcc gcaccaagat ttgggccaat cagatgccag aaatcttttt ccaaatgtat 1381 gcctcagatg ctttctttct gaatctgggt gctgctctcc tgaagctatg ccagccattt 1441 tgcaaaccca gatcctctcg gctcctcacc tttaatccca catactgtgc cctcaaggag 1501 ttgaatgatg aagaacgaaa aattaaaaat gtacacatga gaggtttgga caaagaaacc 1561 tgtttgatcc cagctgtgca ggagccgaag tttccacaga actacaacct tgtaacagag 1621 aaccttgctc tgacagagta caccttgtac ttgggatttc acaggttgca tgatcagatg 1681 gtaaaaatca accaaaatct gcatcggctg caggttgcct ggcgggatgc tcagcaaagt 1741 tctagccctg ctgctgacaa tcttcgtgag cagtttgaac gactgatgac catctatctt 1801 tctaccaaga ctgccatgac agagccacaa atgctacaaa actgcctaaa cttgcaggtg 1861 tccatggctg ttctactggt tcaactggcc ataggcaatg agggctcaca gccaatagag 1921 ctaacctttc ctttgccaga tggctacagc tctttggctt atgtgccaga attttttgca 1981 gataacctgg gtgattttct catttttctc cgccgctttg ccgatgacat tttggagaca 2041 tcagcagatt ccctggagca tgtccttcac tttatcacca ttttcactgg aagcatagaa 2101 agaatgaaga atccccacct gagggccaaa ctagcagagg tgttggaagc agtgatgccc 2161 cacctggatc agaccccaaa tcccttggta tccagtgtgt tccaccggaa acgtgtgttc 2221 tgcaactttc agtatgcacc ccaacttgca gaggctctaa tcaaggtttt tgtggacatc 2281 gaatttacag gagaccccca tcaatttgaa cagaagttta attaccgccg tcccatgtat 2341 cctatcctaa gatacatgtg ggggacagat acctatcggg agagcattaa ggatttggct 2401 gactatgcct ctaagaattt agaagccatg aatcccccac ttttcctccg ctttcttaac 2461 ctgctaatga atgatgccat cttccttttg gatgaagcca tacagtattt gagcaagata 2521 aagattcagc aaattgagaa ggatcgaggt gaatgggata gtctgactcc agaagcccgc 2581 cgagaaaagg aggctggcct acagatgttt ggacagctgg cacgtttcca taacatcatg 2641 tccaatgaaa caatcggtac ccttgccttt ctcacatcag agatcaagtc actctttgtg 2701 catcccttcc tggctgagcg catcatctcc atgttgaact acttcctgca acacctggtt 2761 ggccccaaga tgggtgcctt aaaagtcaag gacttcagcg aatttgactt caaaccccag 2821 cagcttgtat cagatatctg cactatctac ttaaatcttg gggatgagga gaatttctgt 2881 gccactgtgc ccaaggatgg acgttcctat tccccaactc tctttgcaca gacagttcga 2941 gtcttgaaga aaataaataa gcctgggaat atgattatgg ctttcagcaa cttggcagag 3001 agaatcaagt ctcttgcaga cctccaacaa caggaagagg aaacctatgc agatgcctgt 3061 gatgagttcc tggatcccat tatgagcaca ctgatgtgtg accctgtggt gctgccatct 3121 tccagagtca ctgtggatag atccaccatt gcaagacatt tgctcagtga ccaaacagat 3181 ccctttaacc gtagtcccct caccatggac cagatccggc caaacacaga actaaaagaa 3241 aaaatccaac ggtggcttgc agagaggaaa caacaaaagg agcaacttga atagatactg 3301 tgaactaacc aaaccaaaac caaccccaga gtgcagataa acaattgttt gtggtttctc 3361 tctttctggt tctgttcctt ttctttcttc ttttcttttt cttttttttt ttttttttac 3421 taaattagag aactgctctt gctgaaatta tgatagttaa gattcctaag aacttgacaa 3481 tgctcccctg gcttgcagga aattatactt attatagcat caccaagttt agaaatcatc 3541 actaattttc accctctgtt gtctcttcat attcttcttc cttttcctaa atgaaattat 3601 attatttcaa ctactaaaac ttcccgccac cctgctattt ctcttccaat tactgttcaa 3661 acatttttgg tgagtgctgc ttttataaat atgtgatgtc acatatttca gtgacagctg 3721 atgttatcag aaaaatcctg ttatcctgtg tatttactag tcctcatgat ctagacttaa 3781 ccccttttct gtgttacaga gtaacttcac ataaaagcag atacctttaa gtttgtgtag 3841 acttctgaat aagatgatag acgagaattt ttttaaaaaa aggaatttaa aggaatcttg 3901 tgcaacttct gttgatagct cttaattttg ttatacagtc tttttaatga ggattggtag 3961 ggcaatccta ccaattgatg ggggtgagga gactgttggg cccttattct cttatgaaaa 4021 tctgtttcta caaggactag gcctaagtag atttaggcaa cgggtatcat atccatgaaa 4081 aatgtcattt taagacacta atatcaacag tatatctcag tgtgtgagat atataaatac 4141 acatacaccc catgtgcttt tttgttggtt tagtgtaaac tgagtaccac tttatatttt 4201 ctcttccata gtgagatgta tgcagtatgt ggccatgttt actaacatac tcattcttca 4261 caaaattctg agattataaa atgtatatag ttaatatttg tttgagcttg taacctgact 4321 tctaagagct acttctaaca cagccttagg tcttcaatta acaaagtagg aagttgagca 4381 caaccttgtc cacagccatt ttaaatatcc acttagccct gtgtagttgg taggtggaca 4441 ccactttcat aagtgaactt gagctcacta ctcaattgtg cagcttaata tgctgactag 4501 ggacattccc ctatggatta tctttatatc actgtctttc taagcccaga gatgtcataa 4561 gtgacaagaa aagtgatagg atcaagaaat gggtgtcact gcttcatttg gaacatgtta 4621 ttaactatgg tctctttcaa ataaatagta gttttatatt tcaacacaag ttgctataag 4681 cagtccttga tgggtttttg attgcatcag ctggaaagcc tcaaatctaa gagggcttcc 4741 catcctagat ataaaatagg tgttgcctat tgctgtgctt ataaaatgaa aaaggaaatt 4801 gaggacactt ttgcaaatgc cagaatgtaa gattcattca gtgtgctccc tgggccttta 4861 tggcatgggt tgacaggatt tgtttatttt ctaaaattag cttcattcaa tatttatcat 4921 cctcctttcc ctctctgaga atgaactatg tataaaataa gcttctgcct atttgcattt 4981 atcttccaaa cccaatctag taggatgttc tcattttaaa aacgagggga aaagaccaga 5041 gtttttcagg agaaaactgg aggaaaatgg gcacaaaaac tcagaaggca gctattccca 5101 gcagcttcct agttaacaac ccccatgctg cctccagtct ttgtctgtat tcttctgtat 5161 ttaaccttca gattgtaagc cttttctggc aagcttttct tcttttttta aactcttttc 5221 ctgaaacttt ttatgaatgg ctatggcacc attaatgctg ctgaatatct ttaaactctg 5281 cacaagcaag tgtgtagctt aaggccacta ctggtaagga aaccaagtgt cctctgtgcc 5341 ttttttcttt ctgtgaagta atttaagaat atccaaaaaa attagacttt aaaaagttat 5401 cttggtacaa caccgtgtgt atatacactt ggaagcttaa aaaggtgttt tgtctggaac 5461 ttagaagcag ctctaaatct agtagagcag actttctaac atacctagtt ttgtgtattg 5521 gctttgctgg agtatgatag caaaatgaag actcttttac tcagctctgg tattgctcat 5581 aacttaccaa gaggctaata ctaaacttgg aaaattgttt aagtatgttt tatcaagcag 5641 tctgggtttt gttttttaat atacttttta atggatatgt gaaaactgaa ggaaatgtta 5701 aaggtttttt aatggtgcaa gtgaaggtgc cagttgctat ttgatatcac actctacaaa 5761 agcttcatta ctttatttga tggtggttgc taagcagcca ttgcacagag cataagtcta 5821 ctgggtgcct ttacatgcca gaggctgatg ctgcactgtt gatgtcatgt gaggaaataa 5881 tgcacatgct ctaactgctc aacaggaaat gaacctagaa acagaaaatg aaaaggttga 5941 ttgaaataaa acttgatcaa cgcgactgta ttttgaaaca ttccaggaag gttacttctt 6001 gtcaaacttg cctggcagtg tttgttcaaa acttgtattt aataaatgaa catctgactt // LOCUS D50917 5544 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0127 gene, complete cds. ACCESSION D50917 NID g1469176 KEYWORDS KIAA0127. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5544) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5544) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5544 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..297 gene 298..1242 /gene="KIAA0127" CDS 298..1242 /gene="KIAA0127" /note="The KIAA0127 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010118" /db_xref="PID:g1469177" /translation="MLGKGGKRKFDEHEDGLEGKIVSPCDGPSKVSYTLQRQTIFNIS LMKLYNHRPLTEPSLQKTVLINNMLRRIQEELKQEGSLRPMFTPSSQPTTEPSDSYRE APPAFSHLASPSSHPCDLGSTTPLEACLTPASLLEDDDDTFCTSQAMQPTAPTKLSPP ALLPEKDSFSSALDEIEELCPTSTSTEAATAATDSVKGTSSEAGTQKLDGPQESRADD SKLMDSLPGNFEITTSTGFLTDLTLDDILFADIDTSMYDFDPCTSSSGTASKMAPVSA DDLLKTLAPYSSQPVTPSQPFKMDLTELDHIMEVLVGS" 3'UTR 1243..5544 BASE COUNT 1504 a 1281 c 1157 g 1602 t ORIGIN 1 ctcctgcacg gcgagtgctg gagcacgagc taccgctcgc tcggtcaggg cgccccctcc 61 gcccgcctcc tgcttcctcc tccgctgcct gccgccgccg cctccaccat tgtataatgc 121 tcggggcgcg caggcagaga acggcggagt cttagcttca gcctcgcctg ctgcccgctc 181 cccggcgcca ccctcgggcc cctggagcgg ggcactccgc atggagcggg agtagctgag 241 gagtgggcgg aaacccctcc tgatgcgtta gttcccaggt ggagctgcat gtgatatatg 301 ttgggtaaag gaggaaaacg gaagtttgat gagcatgaag atgggctgga aggcaaaatc 361 gtgtctccct gtgacggtcc atccaaggtg tcttacacct tacagcgcca gactatcttc 421 aacatttccc ttatgaaact ctataaccac aggcccctga cagagcccag cttgcaaaag 481 accgttttaa ttaacaacat gttgaggcgg atccaggagg aactcaaaca ggaaggcagc 541 ctgaggccca tgttcacccc ctcctcccag cccaccaccg agcccagcga cagctaccga 601 gaggccccgc cggccttcag ccacctggcg tccccgtcct cccacccctg cgacctcgga 661 agcactacgc ccctggaggc ctgcctcacc ccggcctcac tgctcgagga cgacgatgac 721 acgttttgca cctcccaggc catgcagccc acggctccca ccaaactgtc acctccagcc 781 ctcttgccag aaaaggacag tttctcctct gccttggacg agatcgagga gctctgtccc 841 acatctacct ccacagaggc ggccacggct gcgactgaca gtgtgaaagg gacctccagc 901 gaggctggca cccagaaact cgacggtcct caagagagcc gcgcagatga ctcaaaactg 961 atggactctc tgcctgggaa ttttgaaata acgacgtcca cgggtttcct gacagacttg 1021 accctggatg acatcctgtt tgctgacatt gatacgtcca tgtatgattt tgacccctgc 1081 acttcctcat cagggacagc ctcaaaaatg gcccctgtgt ctgccgacga cctcctcaaa 1141 actctggctc cttacagcag tcagcctgtc accccaagtc agcctttcaa aatggacctc 1201 acagagctgg accacatcat ggaggtgctt gttgggtcct aagacccagg gacccagcga 1261 ctatgcccac ccagacccca gagcgttccc ataaccctga cagttctcca cactgtgcat 1321 gcacccttgc ttgccttttt cagagaaaaa gaaaatttta caacaggatc acactagttt 1381 ttgctttgag cagagttgga gtgccttcat ccaagtatga ccacttttaa tacacttttt 1441 tgagtggttc ctcagagacc tactaccctg gtataggaaa gaatccattt gaagacaatg 1501 ttgcaatgtt gaatgacaaa aataaacagt tcaagtgaag cacaaggatt aagttggaaa 1561 agctgtaaat tgcatgtgca tatttgtcta ttttttctat aagttttatt gcaagaggta 1621 aagaagaaaa ctatatatat atatcttatt tagataatct cagtaccttt tctggcattt 1681 ttgccctgta taggttgact tggcaattcg gcctttttag aggcattaac tactcctcgt 1741 aagtgttgca tttacatggc tgtttagaaa actgctgccc aaatttattt tatatttttg 1801 tacagattct gcagtttatg atattgtttt tctaaaaaca aatgctgttt atacatatga 1861 gatagctatt ttgataggat ttgctcacat agttcctgca aacttcagat gtacaagttg 1921 cacttgtact tttatagagt tgtaatgttt tatatgtgta tggtgcaaga gaaaattgga 1981 tcaaatcaat ctgcagttga tgtccccaaa tgcaaacaca ggcacacaca tgcacacacc 2041 cataaacaca cacacagtgc tttaagaaag ggccaggtga tatcacaccc aaatttcaca 2101 agcactgacc ccctggcacc aacacccgcc agtactgtga cttccaaagc cagagccaca 2161 tgtgctcatc aaacttgcat taagcagttg gcgggagatg gctgtggagc tgggggttta 2221 agtgatggtt ctcttttgct ccctcttttg agggtaaagc tactgtcttt cttaagagtg 2281 tatttatgcc aagtttgcgc ttttaattgt ttttattttg ttttttaatg aaaacccaga 2341 tctttccttt ttggcataat ttttatgatg acctgaaatt ttacatccga acaaaatttt 2401 acatccgaaa agcaaccaac ttcttcatgg aactcagccc tgttgcaatg cttagggccc 2461 ttaaagaaga aaatctcccc agaaggcatc catcatgttg cttaattgtc ttctgcagct 2521 tcctttccct agagctttcc ctgtgttgct aagagctgaa aatggcatct tcgtgatcac 2581 cacagtgagc ttggctcgcc tcggccggcc cgggatgcac tcttacaaca tgtgtgactc 2641 ttgaacctgg agttcatcac attacgtcac agcttcccat ctggttgctt tcctgagtca 2701 gctacttcac acttgtcaag gctgttttac cccaaaactc agacaggact ttctatgcat 2761 gttttccctc ctccccccaa ttcccccccc atcaccttat ctcccaggac acacttgaga 2821 agtagctttt tattcctagt ggtgtacatt taattttaaa aaggttgcaa tgtatcatgc 2881 ttgttgccga aactgtttat ggccttcttg tttcagtttt ttcttttctt ccaatggtac 2941 tttagctgtt gagtgcaggt tacaacctat attgttatgc agatggcttc tttaggaata 3001 acttttatat ttatttaaaa atttttaaat tatgggatgt tttgttgttg ttgttgtctt 3061 tgttgttggt catttgtcaa tattcagtca ccaattctgc tcacttcttg ccatggataa 3121 aattgggtct ttctggctaa ttaaaaaaga caactttata aaatggcact ttaagcaagc 3181 catagttagt tttatttttg taatgcacat ggcaaagcaa agacgtttgt gatgaaggaa 3241 ctgctcatct aagcaaaaga tttgagtatg atatgataaa ggctttctac attctaattt 3301 actttttccc cccacttgaa tgtgttttaa aggctaatta tcagctcagt agagcagtga 3361 gaaactgatc aaattgcact tgttctccta caagcaacct ccacgcagac acctcgtact 3421 gctacaggtg tgtcatttcc tttaatagga ccagggacca tgtaactgag gtgagggttg 3481 tagtagatgc ttccagtgtc agtatgcctg ttaattttaa gagcttccct ttcttgcaga 3541 gaacaagtct gcccagattc catgctttct ataactggag gacctggcaa acctgccgca 3601 tgctgcacac atctacctac gtacacatat acaatagtat tgatgattct gaacaataac 3661 agggtaaaac agttggtttg ccattgttaa aaactgattt acagtaactt acaacaactg 3721 tacttttgtt ggattagcaa atcatgtgtt taaacaaatc ccatatgttg ggcaacagtt 3781 caaataagca cggagaagtg ttgcccaaac ttggttctct gactcttatg tatttgtaag 3841 gctgggcttc aaaatcaaaa caaaaacccc aaaaacagca ggcaaatgct ttttaactct 3901 gacaccgttg ccataaatcc ctgatactca aagtctaaca agaaagacat ggaaaattag 3961 cagcccattt tcagaaagat caaaatgatc tagggttcta attgcttttg catcctattc 4021 ttacaaagtg atgtcccaac agggaacagt aggagctgga gtgggatctc caagtcccag 4081 tttgagtgtg ggatgtgctt ccagcagtgc cttcccttta tgaaagacat cacatggcat 4141 ccagggccag gcaggcagct tgaggtgcct ttacgagaaa accgagctgg ggctgggaga 4201 ggacagttat tgacactgat gtgcaatgaa gtgacaagat gagagcagaa tcgtaagagc 4261 tttgaatttg aagtgagttt ttttcccccc ataagttatt tattcctttt ttctgtgtaa 4321 atatatttat tttactgtgg agcgctaaca tctggatcgt aacatgtgca gaatgtatgg 4381 taggaatgta ttctcttgta ggaatgtaaa tctgtattaa aagggggtcc aagccaggcc 4441 cccaggtctt ctcattgtat gcacagtccg cattcatttt tactcttctc taatatgggt 4501 ctatttgaaa tatgcaaaag gtatgaggaa tgttttaata cctccaaatt tttaagaaaa 4561 gcatcaaagg gttgatattt tttaaagttt ttttagtagc actttctctg gatgacagaa 4621 ggggcaacca catgggcacc cttgttcata ccaaagggtg agcagtggcc agagcctcct 4681 ctgcacctct cgagtgtctt taccaattga gctttttatc gccatagccc cttggagtgc 4741 cccagctgcc ctgaggtcaa tcaaggaaaa tttcttaatg aaataagctc caaagagcca 4801 aagtatcaac ttacagatcg tttttaaagc ttaaatttat gaaccacctt tgtggtaaac 4861 aatgaattat gaataccgca gggcagcctt cttaaatgac aaatgtaaaa aaaaaaaaaa 4921 aagactctac ttcgtgcagc aattgctact ctatacgaat tgtcttaatt tgaaaacctt 4981 gctgttacaa attggacctt tatacatttt ctgaaaacaa tgaaaagagt atatttaacc 5041 ttttctggct gtaaatggtt accttcctgt aactgccccg cacctggagg catggagttg 5101 tgtgcatcct gcttatgtac aattgttttc agtgtttcta agaatgagtc tgaatggttc 5161 ttgaaaatta gccaggatca aatgctattg cagacaaagc caataaaaag ttggacttct 5221 tttggggata acaagttttg gaagagaaat gcaggccata tgtgcgcatg accgagattt 5281 tgaaaaaaga tgtacatagt gacatgtttg gtgcatggtt tttgaggagg gcttttgtca 5341 aaaaggaggt ataacctttc ccccacagac ctgagagctg tgccttttct atgcaatatt 5401 acagacgtta catcggaacc cagatggctg tattcacatg taggtttggg ctgtaatcta 5461 aacaattgga cagattaaat gtacatggaa atgagcagtc ttacttttgt agttttatat 5521 tatacaataa acagttaaaa gatg // LOCUS D50919 4453 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0129 gene, complete cds. ACCESSION D50919 NID g1469180 KEYWORDS KIAA0129. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4453) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4453) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..4453 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..10 gene 11..1231 /gene="KIAA0129" CDS 11..1231 /gene="KIAA0129" /note="The KIAA0129 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010120" /db_xref="PID:g1469181" /translation="MAGAATGSRTPGRSELVEGCGWRCPEHGDRVAELFCRRCRRCVC ALCPVLGAHRGHPVGLALEAAVHVQKLSQECLKQLAIKKQQHIDNITQIEDATEKLKA NAESSKTWLKGKFTELRLLLDEEEALAKKFIDKNTQLTLQVYREQADSCREQLDIMND LSNRVWSISQEPDPVQRLQAYTATEQEMQQQMSLGELCHPVPLSFEPVKSFFKGLVEA VESTLQTPLDIRLKESINCQLSDPSSTKPGTLLKTSPSPERSLLLKYARTPTLDPDTM HARLRLSADRLTVRCGLLGSLGPVPVLRFDALWQVLARDCFATGRHYWEVDVQEAGAG WWVGAAYASLRRRGASAAARLGCNRQSWCLKRYDLEYWAFHDGQRSACGPATTSTGSA SSWTTRPASSPSTT" 3'UTR 1232..4453 BASE COUNT 1068 a 1251 c 1111 g 1023 t ORIGIN 1 gtggagatga atggcgggcg cggcgaccgg gagccggacc cctgggaggt cggagcttgt 61 cgagggatgc ggctggcgct gcccggagca tggcgaccgc gtggctgagc tcttctgtcg 121 ccgctgccgc cgctgcgtgt gcgcgctttg cccggtgctg ggcgcgcacc gtggccaccc 181 tgtgggcctg gcgctggagg cagcggtgca cgtgcagaaa ctcagccaag aatgtttaaa 241 gcagctggca atcaagaagc agcagcacat tgacaacata acccagatag aagatgccac 301 cgagaagctc aaggctaatg cagagtcaag taaaacctgg ctgaagggga aattcactga 361 actcagatta ctacttgacg aagaggaagc gctggccaag aaattcattg ataaaaacac 421 gcagcttacc ctccaggtgt acagggaaca agctgactct tgcagagagc aacttgacat 481 catgaatgat ctctccaaca gggtctggag tatcagccag gagcccgatc ctgtccagag 541 gcttcaggca tacacggcca ccgagcagga gatgcagcag cagatgagcc tcggggagct 601 gtgccatccc gtgcccctct cttttgagcc cgtcaagagc ttctttaagg gcctcgtgga 661 agccgtggag agtacattac agacgccatt ggacattcgc cttaaggaaa gcataaactg 721 ccagctctca gacccttcca gcaccaagcc aggtaccttg ttgaaaacca gcccctcacc 781 agagcgatcg ctattgctga aatacgcgcg cacgcccacg ctggatcctg acacgatgca 841 cgcgcgcctg cgcctgtccg ccgatcgcct gacggtgcgc tgcggcctgc tgggcagcct 901 ggggcccgtg cccgtgctgc ggttcgacgc gctctggcaa gtgctggctc gtgactgctt 961 cgccaccggc cgccactact gggaggttga cgtgcaggag gcgggcgccg gctggtgggt 1021 gggcgcggcc tacgcctccc ttcggcgccg cggggcctcg gccgccgccc gcctgggctg 1081 caaccgccag tcctggtgcc tcaagcgcta cgaccttgag tactgggcct tccacgacgg 1141 ccagcgcagc gcctgcggcc ccgcgacgac ctcgaccggc tcggcgtctt cctggactac 1201 gaggccggcg tcctcgcctt ctacgacgtg acgggcggca tgagccacct gcataccttc 1261 cgcgccacgt tccaggagcc gctctacccg gccctgcggc tctgggaggg ggccatcagc 1321 atcccccggc tgccctaggg gccaggaccg gcgtgacagc ctccaggtac gccgcagctg 1381 cccagtctcg cctaatctac ctagatcagc gtggctggtc cccttactgc ctgcttctta 1441 gggccctctc cctgccccag ctttccccga ccaatcacgc ctacagtgct ttgaaggttt 1501 cctctcctag gctagtttca aacaggccct aaacaagtct gctgctgccc tctcatcaga 1561 cctccgcacc ctcaccccac catcacttaa actactttaa tccagttcct tcaaagtgat 1621 acccccacag gtaagccctc agcatcctga atacatcatc cgcagcctgg gaaccttctc 1681 cctcgtacag cacaggaacc tgacacatag taggcacaca gtaaacgttt gtgaatgaat 1741 gggagtcatc cagtcctgac tcttctgtct cttgaggtcc cttgaatctt ccgcttcctc 1801 cccaccgatt tcagcgtgtc cacatcacag ctccctccag aagctgcaag agcttcttag 1861 cagttcctgg tctgaaccct ctcccagtcc tcatcttcca ccctaaaact agagtgatct 1921 tcctaaaact tcacttaacc cctcagctat gaaaaggctt ccaggagttt ccatgaaata 1981 acaaaaaaaa atacaagcgc ctcaccttag cattcaaggc ttgtctagtc tgcccaaaat 2041 tacttatcct cacctagctc ctaccactct tcttagagac tctccagtca gaaatgtgtc 2101 gcatagttcc acctccacac ctctctgctg ccagcacatt catgcagaaa agtcttttca 2161 cctgtctcag tcttccgcag gcttacctgc gccaggaagt ctaaccaagg aacaagaatc 2221 tcactatcag agccacaaat ctgggacctg tctttccaac taaattggag gctttggaag 2281 ggcagctttg tcctatactc tctccaccct gaaaagttcc cagaaagccc cttccctccc 2341 aagcagtgaa ttaataacca gcaggtgcct atcactgagt aacagaagag ctgagttagg 2401 cgggcctcac aggtcaccca gccagatctc atctggggag tctgaggtcc tgaaagaaag 2461 cagagctgcg taaagttacc ccggggtgat ataggggggc cagacgtgtg ccccgttcac 2521 ccacccccag gcaagcatct gacctgtccc ctggcccagc ccttaggccc agctttcaac 2581 ctgcttactc atttctcagg ggattttggg aaggaatcag caggtgacag ttgctaagca 2641 accaaagggc gtggtgtttc ccaactgtct tgggacaaaa agggtaagag cacccttaga 2701 tccagatgtt gccaaggaaa cccagagtgc ccagctgtct ggaatgaagt gacagaggta 2761 gaaaacagta ggccctcaca ggaggccacc ttcgctagca gggagtggga ggcttctttc 2821 cagggatttg tgtctccgtt ctgaaggttc tgcgtcctgt tttgtcaatt ccctagacgg 2881 ttttgaagtt atattctgtt aaagcatctt cataggtgct tggtgggagg ccaaggtcgc 2941 cgaatcctcg tggtttaaat agactttcag tgcattagtt tgaaccatat aaaactgaca 3001 attttcaata gtttttgagt taaaaatggc acttttgata tgagacaatg tagcagaata 3061 ccaggcagac agaaccttgc aaacacctta acttctaacc aaagacttta aaactctggc 3121 tggacagagt tttaagcact atgctgcagg aatcctgaga aaaaggggaa attaattcta 3181 ttaggaatgg cccaaactga attgtgacag gcagagggtg ttcctgacag agggaagatg 3241 aatacacttg accccaacat ttctgcccat ccctgatgac caagacctct tcccagaccc 3301 acagctgcag gggccaagta ataacagctc ttaatagttt ataatgcact gttttaatgc 3361 tttacgacta aactcatctg atcctttcat cagccctaga gggtagaaag ttttcccaat 3421 tctacacatg gcaaaatgga gacccagagt cacttgccca aggtcgcaca gctagtggtg 3481 gagctggagt ctgggcccag gctgtgagtt ccaggtctgt gctcatggcc accaggccat 3541 actgctcagg gtgaatgcag ctggtctctg gccagtgcct ggtgctctgg cccctctcgt 3601 ggagctactg cccatgatgc tttcctagtg cctggttact ctgcgagact tgagtctacc 3661 tttggactgc cttccttggg ggtctgagat gaggccttat ggcccagagg ggaacttgat 3721 tcaaaaattt gggattcatg tagcagagac agagctagac tgtaaaggtc acaaactagc 3781 tgtgtcaagg cccgtgatag ccagaaagcg gcagtttcag tccatatcaa ttgtgtgacc 3841 agggctagtc actttttact tctcagtgcc atctataaaa tggggataat agcactacct 3901 accaagtgct gtgaggctca aatgagccaa aggttataaa cttgccttaa aactatagtc 3961 ctatacaaaa attagctggg cgtggtggtg catgcctgta atcccagcta ctagggaggc 4021 tgaggcaaga gagttgcttg aacccaggag gcggagattg cagtgagctg agattgcacc 4081 actgcactcc agcctgggga cagagcaaga ctcttgtctc aaaaaaaaca aaaaaaacat 4141 atatactgct gtatcatgcc aggatttatc agcattccca agggagcttg cacggtactg 4201 accgagtgct gagactactg gtattcccag ctgccatgtg gcagcagcag gagctactag 4261 aatattctca gcacaggaat gaggcttcct tggtttccat gtctgtaagg gttactgatc 4321 acttaccttc ttctctttca gacttgaatc tgtagacatt tctttattga tatggcaaat 4381 tgcttgcaga tatttttaaa tgacagcaat tttctaatat ttggtttaat aaaatgtgaa 4441 taatgtccct ttt // LOCUS D50920 3468 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0130 gene, complete cds. ACCESSION D50920 NID g1469182 KEYWORDS KIAA0130. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3468) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3468) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3468 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..73 gene 74..3043 /gene="KIAA0130" CDS 74..3043 /gene="KIAA0130" /note="The KIAA0130 gene is related to mouse genetic suppressor element (911GSE)." /citation=[3] /codon_start=1 /db_xref="PID:d1010121" /db_xref="PID:g1469183" /translation="MKVVNLKQAILQAWKERWSYYQWAINMKKFFPKGATWDILNLAD ALLEQAMIGPSPNPLILSYLKYAISSQMVSYSSVLTAISKFDDFSRDLCVQALLDIMD MFCDRLSCHGKAEECIGLCRALLSALHWLLRCTAASAERLREGLEAGTPAAGEKQLAM CLQRLEKTLSSTKNRALLHIAKLEEASSWTAIEHSLLKLGEILTNLSNPQLRSQAEQC GTLIRSIPTMLSVHAEQMHKTGFPTVHAVILLEGTMNLTGETQSLVEQLTMVKRMQHI PTPLFVLEIWKACFVGLIESPEGTEELKWTAFTFLKIPQVLVKLKKYSHGDKDFTEDV NCAFEFLLKLTPLLDKADQRCNCDCTNFLLQECGKQGLLSEASVNNLMAKRKADREHA PQQKSGENANIQPNIQLILRAEPTVTNILKTMDADHSKSPEGLLGVLGHMLSGKSLDL LLAAAAATGKLKSFARKFINLNEFTTYGSEESTKPASVRALLFDISFLMLCHVAQTYG SEVILSESRTGAEVPFFETWMQTCMPEEGKILNPDHPCFRPDSTKVESLVALLNNSSE MKLVQMKWHEACLSISAAILEILNAWENGVLAFESIQKITDNIKGKVCSLAVCAVAWL VAHVRMLGLDEREKSLQMIRQLAGPLFSENTLQFYNERVVIMNSILERMCADVLQQTA TQIKFPSTGVDTMPYWNLLPPKRPIKEVLTDIFAKVLEKGWVDSRSIHIFDTLLHMGG VYWFCNNLIKELLKETRKEHTLRAVELLYSIFCLDMQQVTLVLLGHILPGLLTDSSKW HSLMDPPGTALAKLAVWCALSSYSSHKGQASTRQKKRHREDIEDYISLFPLDDVQPSK LMRLLSSNEDDANILSSPTDRSMSSSLSASQLHTVNMRDPLNRVLANLFLLISSILGS RTAGPHTQFVQWFMEECVDCLEQGGRGSVLQFMPFTTVSELVKVSAMSSPKVVLAITD LSLPLGRQVAAKAIAAL" 3'UTR 3044..3468 BASE COUNT 735 a 1032 c 956 g 745 t ORIGIN 1 aatggcgatg cctaccacct agaactggat tgtgcgctgg ccgccaccgc tgccacctgc 61 tcagagtgaa ataatgaagg tggtcaacct gaagcaagcc attttgcaag cctggaagga 121 gcgctggagt tactaccaat gggcaatcaa catgaagaaa ttctttccta aaggagccac 181 ctgggatatt ctcaacctgg cagatgcgtt actagagcag gccatgattg gaccatcccc 241 caatcctctc atcttgtcct acctgaagta tgccattagt tcccagatgg tgtcctactc 301 ttctgtcctc acagccatca gtaagtttga tgacttttct cgggacctgt gtgtccaggc 361 attgctggac atcatggaca tgttttgtga ccgtctgagc tgtcacggca aagcagagga 421 atgcatcgga ctgtgccgag cccttcttag cgccctccac tggctgctgc gctgcacggc 481 agcctctgca gagcggctgc gggaggggct ggaggccggc actccagccg ctggggagaa 541 gcagcttgcc atgtgccttc agcgcctgga gaaaaccctc agcagcacca agaaccgggc 601 cctgctgcac atcgccaaac tagaggaggc ctcttcttgg actgccatcg agcattctct 661 cttgaaactt ggagagatcc tgaccaatct cagcaacccg cagctccgga gtcaggccga 721 gcagtgtggc accctcatta ggagcatccc cacgatgctg tctgtgcatg cggagcagat 781 gcacaagacc ggcttcccca ctgtccacgc cgtgatcctg ctcgagggca ccatgaacct 841 gacaggcgag acgcagtccc tggtggagca gctgacgatg gtgaagcgca tgcagcatat 901 ccccacccca ctttttgtcc tggagatctg gaaagcttgc ttcgtggggc tcattgagtc 961 tcccgagggt acggaggagc tcaagtggac agctttcact ttcctcaaga ttccacaggt 1021 tttggtgaag ttgaagaagt actctcatgg agacaaggac ttcactgagg atgtcaactg 1081 tgcttttgag ttcctgctga agctcacccc cttgttggac aaagctgacc agcgctgcaa 1141 ctgtgactgt acaaacttcc tgctccaaga atgtggcaag caggggcttc tgtctgaggc 1201 cagcgtcaac aaccttatgg ctaagcgcaa agcggaccga gagcacgcac cccagcagaa 1261 atcgggagag aatgccaaca tccagcccaa catccagctg atcctccggg cggagcccac 1321 tgtcacaaac atcctcaaga cgatggatgc agaccactct aagtcaccgg agggactgct 1381 gggagtcctg ggccacatgc tgtccgggaa gagtctggac ttgctgctgg ctgccgccgc 1441 cgccactgga aagctgaaat ccttcgcccg gaaattcatc aatttgaatg aattcacaac 1501 ctatggcagc gaagaaagca ccaaaccggc ctccgtccgg gccctgctgt ttgacatctc 1561 cttcctcatg ctgtgccatg tggcccagac ctatggttca gaggtgattc tgtccgagtc 1621 gcgcacagga gctgaggtgc ccttcttcga gacctggatg cagacctgca tgcctgagga 1681 gggcaagatc ctgaaccctg accacccctg cttccgcccc gactccacca aagtggagtc 1741 cctggtggcc ctgctcaaca actcctcgga gatgaagcta gtgcagatga agtggcatga 1801 ggcctgtctc agcatctcag ccgccatctt ggaaatcctc aatgcctggg agaatggggt 1861 cctggccttc gagtccatcc agaaaatcac tgataacatc aaagggaagg tatgcagtct 1921 ggcggtgtgt gctgtggctt ggcttgtggc ccacgtccgg atgctggggc tggatgagcg 1981 tgagaagtcg ctgcagatga tccgccagct ggcagggcca ctgtttagtg agaacaccct 2041 gcagttctac aatgagaggg tggtgatcat gaactcgatc ctggagcgca tgtgtgccga 2101 cgtgctgcag cagacagcca cgcagatcaa gtttccctcc accggggtgg acacaatgcc 2161 ctactggaac ctgctgcccc ccaagcggcc catcaaagag gtgctgacgg acatttttgc 2221 caaggtgctg gagaagggct gggtggacag ccgctccatc cacatctttg acaccctgct 2281 gcacatgggc ggcgtctact ggttctgcaa caacctgatt aaggagctgc tgaaggagac 2341 gcggaaggag cacacgctgc gggcagtgga gctgctctac tccatcttct gcctggacat 2401 gcagcaagtg accctggtcc tgctgggcca catcctacct ggcctgctca ctgactcctc 2461 caagtggcac agcctcatgg accccccggg cactgctctt gccaagctgg ccgtgtggtg 2521 tgccctcagt tcctactcct cccacaaggg acaggcgtcc acccgccaga agaagagaca 2581 ccgcgaagac attgaggatt atatcagcct cttccccctg gacgatgtgc agccttcgaa 2641 gttgatgcga ctgctgagct ctaatgagga cgatgccaac atcctttcga gccccacaga 2701 ccgatccatg agcagctccc tctcagcctc tcagctccac acggtcaaca tgcgggaccc 2761 tctgaaccga gtcctggcca acctgttcct gctcatctcc tccatcctgg ggtctcgcac 2821 cgctggcccc cacacccagt tcgtgcagtg gttcatggag gagtgtgtgg actgcctgga 2881 gcagggtggc cgtggcagcg tcctgcagtt catgcccttc accaccgtgt cggaactggt 2941 gaaggtgtca gccatgtcca gccccaaggt ggttctggcc atcacggacc tcagcctgcc 3001 cctgggccgc caggtggctg ctaaagccat tgctgcactc tgaggggctt ggcatggccg 3061 cagtgggggc tggggactgg cgcagcccca ggcgcctcca agggaagcag tgaggaaaga 3121 tgaggcatcg tgcctcacat ccgctccaca tggtgcaaga gcctctagcg gcttccagtt 3181 ccccgctcct gactcctgac ctccaggatg tctcccggtt tcttctttca aaatttcctc 3241 tccatctgct ggcacctgag gagagtgagc agcctggacc acaagcccag tggtcacccc 3301 tgtgtgcgcc cgccccagcc caggagtagt cttacctctg aggaactttc tagatgcaaa 3361 gtgtgtatgt gtgtgtgtgt gtgtgtgtgt gtgtttgtgt gtattttgta atatgtgagg 3421 gaaatctacc ttcgttcatg tataaataaa gctcctcgtg gctccctt // LOCUS D50922 2513 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0132 gene, complete cds. ACCESSION D50922 NID g1469186 KEYWORDS KIAA0132. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2513) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2513) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..2513 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..112 gene 113..1987 /gene="KIAA0132" CDS 113..1987 /gene="KIAA0132" /note="The KIAA0132 gene product is related to Drosophila melanogaster ring canel protein." /citation=[3] /codon_start=1 /db_xref="PID:d1010123" /db_xref="PID:g1469187" /translation="MQPDPRPSGAGACCRFLPLQSQCPEGAGDAVMYASTECKAEVTP SQHGNRTFSYTLEDHTKQAFGIMNELRLSQQLCDVTLQVKYQDAPAAQFMAHKVVLAS SSPVFKAMFTNGLREQGMEVVSIEGIHPKVMERLIEFAYTASISMGEKCVLHVMNGAV MYQIDSVVRACSDFLVQQLDPSNAIGIANFAEQIGCVELHQRAREYIYMHFGEVAKQE EFFNLSHCQLVTLISRDDLNVRCESEVFHACINWVKYDCEQRRFYVQALLRAVRCHSL TPNFLQMQLQKCEILQSDSRCKDYLVKIFEELTLHKPTQVMPCRAPKVGRLIYTAGGY FRQSLSYLEAYNPSNGTWLRLADLQVPRSGLAGCVVGGLLYAVGGRNNSPDGNTDSSA LDCYNPMTNQWSPCAPMSVPRNRIGVGVIDGHIYAVGGSHGCIHHNSVERYEPERDEW HLVAPMLTRRIGVGVAVLNRLLYAVGGFDGTNRLNSAECYYPERNEWRMITAMNTIRS GAGVCVLHNCIYAAGGYDGQDQLNSVERYDVETETWTFVAPMKHRRSALGITVHQGRI YVLGGYDGHTFLDSVECYDPDTDTWSEVTRMTSGRSGVGVAVTMEPCRKQIDQQNCTC " 3'UTR 1988..2513 BASE COUNT 528 a 721 c 790 g 474 t ORIGIN 1 cgcgcagcga tggaggcgcc ggggctcggg cggtggaggc ggagccggag cgcggccatg 61 gcggggtccc tgagtgccag aggtggtggt gttgcttatc ttctggaacc ccatgcagcc 121 agatcccagg cctagcgggg ctggggcctg ctgccgattc ctgcccctgc agtcacagtg 181 ccctgagggg gcaggggacg cggtgatgta cgcctccact gagtgcaagg cggaggtgac 241 gccctcccag catggcaacc gcaccttcag ctacaccctg gaggatcata ccaagcaggc 301 ctttggcatc atgaacgagc tgcggctcag ccagcagctg tgtgacgtca cactgcaggt 361 caagtaccag gatgcaccgg ccgcccagtt catggcccac aaggtggtgc tggcctcatc 421 cagccctgtt ttcaaggcca tgttcaccaa cgggctgcgg gagcagggca tggaggtggt 481 gtccattgag ggtatccacc ccaaggtcat ggagcgcctc attgaattcg cctacacggc 541 ctccatctcc atgggcgaga agtgtgtcct ccacgtcatg aacggcgctg tcatgtacca 601 gatcgacagc gttgtccgtg cctgcagtga cttcctggtg cagcagctgg accccagcaa 661 tgccatcggc atcgccaact tcgctgagca gattggctgt gtggagttgc accagcgtgc 721 ccgggagtac atctacatgc attttgggga ggtggccaag caagaggagt tcttcaacct 781 gtcccactgc caactggtga ccctcatcag ccgggacgac ctgaacgtgc gctgcgagtc 841 cgaggtcttc cacgcctgca tcaactgggt caagtacgac tgcgaacagc gacggttcta 901 cgtccaggcg ctgctgcggg ccgtgcgctg ccactcgttg acgccgaact tcctgcagat 961 gcagctgcag aagtgcgaga tcctgcagtc cgactcccgc tgcaaggact acctggtcaa 1021 gatcttcgag gagctcaccc tgcacaagcc cacgcaggtg atgccctgcc gggcgcccaa 1081 ggtgggccgc ctgatctaca ccgcgggcgg ctacttccga cagtcgctca gctacctgga 1141 ggcttacaac cccagtaacg gcacctggct ccggttggcg gacctgcagg tgccgcggag 1201 cggcctggcc ggctgcgtgg tgggcgggct gttgtacgcc gtgggcggca ggaacaactc 1261 gcccgacggc aacaccgact ccagcgccct ggactgttac aaccccatga ccaatcagtg 1321 gtcgccctgc gcccccatga gcgtgccccg taaccgcatc ggggtggggg tcatcgatgg 1381 ccacatctat gccgtcggcg gctcccacgg ctgcatccac cacaacagtg tggagaggta 1441 tgagccagag cgggatgagt ggcacttggt ggccccaatg ctgacacgaa ggatcggggt 1501 gggcgtggct gtcctcaatc gtctgcttta tgccgtgggg ggctttgacg ggacaaaccg 1561 ccttaattca gctgagtgtt actacccaga gaggaacgag tggcgaatga tcacagcaat 1621 gaacaccatc cgaagcgggg caggcgtctg cgtcctgcac aactgtatct atgctgctgg 1681 gggctatgat ggtcaggacc agctgaacag cgtggagcgc tacgatgtgg aaacagagac 1741 gtggactttc gtagccccca tgaagcaccg gcgaagtgcc ctggggatca ctgtccacca 1801 ggggagaatc tacgtccttg gaggctatga tggtcacacg ttcctggaca gtgtggagtg 1861 ttacgaccca gatacagaca cctggagcga ggtgacccga atgacatcgg gccggagtgg 1921 ggtgggcgtg gctgtcacca tggagccctg ccggaagcag attgaccagc agaactgtac 1981 ctgttgaggc acttttgttt cttgggcaaa aatacagtcc aatggggagt atcattgttt 2041 ttgtacaaaa accgggacta aaagaaaaga cagcactgca aataacccat cttccgggaa 2101 gggaggccag gatgcctcag tgttaaaatg acatctcaaa agaagtccaa agcgggaatc 2161 atgtgcccct cagcggagcc ccgggagtgt ccaagacagc ctggctggga aagggggtgt 2221 ggaaagagca ggcttccagg agagaggccc ccaaaccctc tggccgggta ataggcctgg 2281 gtcccactca cccatgccgg cagctgtcac catgtgattt attcttggat acctgggagg 2341 gggccaatgg gggcctcagg gggaggcccc ctctggaaat gtggttccca gggatgggcc 2401 tgtacataga agccaccgga tggcacttcc ccaccggatg gacagttatt ttgttgataa 2461 gtaaccctgt aattttccaa ggaaaataaa gaacagacta actagtgtct ttc // LOCUS D50923 5613 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0133 gene, complete cds. ACCESSION D50923 NID g1469188 KEYWORDS KIAA0133. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5613) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5613) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5613 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..136 gene 137..4711 /gene="KIAA0133" CDS 137..4711 /gene="KIAA0133" /note="The KIAA0133 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010124" /db_xref="PID:g1469189" /translation="MAAVYSGISLKLKSKTTSWEDKLKLAHFAWISHQCFLPNKEQVL LDWARQSLVAFYKKKLELKEDIVERLWIYIDNILHSRKLQNLLKNGKTINLQISLVKI INERVAEFSLSGSQRNICAVLRCCQGILSTPALAVIYTAKQELMVALLSQLCWSACRQ PEGAVVAQLFEVIHLALGHYLLILQQQVNPRRAFGDVTAHLLQPCLVLRHLLSGGTWT QAGQGQLRQVLSRDIRSQIEAMFRGGIFQPELLSSYKEGLLDQQQGDVKTGAMKNLLA PMDTVLNRLVDAGYCAASLHTSVVANSVALLYKLFLDSYFKEGNQLLCFQVLPRLFGC LKISHLQEEQSKALSTSDWTTELLVVEQLLNSVANNNIYNIAADRIRHEEAQFRFYRH VAELLINHAQAPIPAWFRCLKTLISLNHLILEPDLDDLLASAWIDAEVTEFRTKKAQE ALIRTVFQTYAKLRQVPRLFEEVLGVICRPAAEALRQPVLASGPSTVLSACLLELPPS QILDTWSLVLEKFQSLVLPYLQSDADMALKLLSLSLLLHCIMFNMRSLDSSTPLPIVR RTQCMMERMMRELVQPLLALLPDTPGPEPELWLQKVSDSVLLLSYTWAQVDAMFSLNC SQYHSMSGPLIGVALEISNLPSLLPGVKTQHWKKIEKFTAQFSSLGTYCLEQLYLQKM KRTLMQTSFRSEGAIQSLRCDAAFIIGSGRKSLNQRTTASWDGQVGMVSGLTYPVAHW HLIVSNLTILISYLCPDDVGYLASVLLRTLPMGKAQEVSIDEEAYITLEKISKAFLHS PLFPEMQSLHSAFLTCVTTSCSSILCSGAQRDSGLVSQQLPWLFEKDHMVVGHWENRF AKAGPEGIEPRGEIAQNLLSLVKSDFPIQLEGEQLESILGLLEVISALQLDSLLPPYH VHYFLVLLSMAVTKLGCSCSSSLALKFLTTCYQLLGYLQKGKSARSVFKIMYGSDIFE VVLTSLFRASSRFLIEMDDPAWLEFLQVIGTFLEELMQMLIQMKLSLVLNFRKITAFL SSSKPYTEAASSKQLENQNPQGRQLLLVSLTRLCHVLGPFLKEQKLGQEAPAALSELL QQVVLQTGAVLQLCSVPGARGWRLPSVLISSVSTLLEADLGQHCRDGGADISQGSDRT LLSHVALYQGVYSQILLELPALAGHDQSFQAALQFLTLFFLAPELHPKKDSVFTSMFH SVRRVLADPEIPVQVTQDIEPHLGALFTQMLEVGTTEDLRLVMQCILQGLDVSNMWKA DVQAVVSAVTLLRLLLNCPLSGEKASLLWRACPQIVTALTLLNREASQEQPVSLTVVG PVLDVLAALLRQGEEAIGNPHHVSLAFSILLTVPLDHLKPLEYGSVFPRLHNVLFSIL QCHPKVMLKAIPSFLNSFNRLVFSVMREGRQKDKGSIDDLPTVLKCARLVERMYSHIA ARAEEFAVFSPFMVAQYVLEVQKVTLYPAVKSLLQEGIYLILDLCIEPDVQFLRASLQ PGMRDIFKELYNDYLKYHKAKHEGEKRYTA" 3'UTR 4712..5613 BASE COUNT 1322 a 1364 c 1409 g 1518 t ORIGIN 1 ggacgcggga cccgtacagc ggcctccgcc gcaccgggac agcagccgcc gccgctgccg 61 ccgtcctccc ctgtctaccc ggagctgtct cgagctgagc cccctaccgg gccggatccc 121 gagataaagc ctagccatgg ctgctgttta ttctggcatt tcccttaagc ttaaaagcaa 181 gacaacttcc tgggaagata aactaaaact agctcacttt gcttggattt ctcaccagtg 241 ctttcttcca aataaagaac aagtgttact tgattgggca agacaatcat tggttgcatt 301 ttataagaaa aagcttgaac tgaaggaaga tattgttgaa aggctttgga tctatataga 361 taacatttta catagcagaa aattgcagaa tctcctcaag aatggaaaga ccattaatct 421 tcagatttcc ctagtcaaga tcatcaatga gagagtagct gagttctctc tttcgggatc 481 ccaaagaaac atctgtgctg tccttcgatg ttgccagggc atcctgtcga cacctgccct 541 ggctgtcatc tacacggcca aacaggagct gatggtggcc ttgctgagcc agctttgctg 601 gtcggcctgc aggcagcccg aaggagctgt ggtagcccag ttgtttgagg tcattcacct 661 ggcccttggc cattatctct tgatcctgca gcagcaggtc aacccaagac gtgcctttgg 721 ggatgtgact gctcacctgc tccagccgtg cctggtcctg aggcacttac tctctggggg 781 cacatggacg caggctggcc agggccagct gaggcaggtg ctgagccggg acatcaggag 841 tcagattgag gccatgttcc gaggagggat ttttcagcct gagctactgt catcctacaa 901 ggaggggctc ttggaccagc agcaagggga tgtgaagacg ggagccatga agaaccttct 961 ggctcccatg gacaccgtgc ttaacaggct ggttgatgct ggctactgtg cagcatccct 1021 tcatacctct gttgtggcca actcagtggc cttgctgtat aagctctttc tagattctta 1081 ctttaaggag ggaaaccagc ttctctgctt ccaggttctc cccaggttgt ttggctgctt 1141 gaagatttca cacctgcagg aggagcagag caaagccctg tccacatcag attggaccac 1201 agagcttttg gttgtggaac agctactaaa ctcagtggcc aacaacaata tctacaacat 1261 cgctgccgac agaattcggc acgaagaggc tcagttccgc ttttaccgcc acgtggctga 1321 gctgctgata aaccatgcac aagcacccat accggcctgg ttccgctgtc tgaagacttt 1381 gatatctctg aatcatttga ttttggagcc agacctggat gacctgctgg cttcagcgtg 1441 gatcgatgcc gaggtaacag agtttcgaac caaaaaagcc caggaggcgc ttattcgtac 1501 tgtcttccag acttatgcca aactccgaca agtgccacgg ttgtttgaag aggttttggg 1561 ggtgatctgt cgtccagctg ctgaggcact gaggcagcct gtgctggcct cgggcccctc 1621 cacggtactc tctgcatgcc tcctggagct gcctccaagt cagatcctgg acacgtggtc 1681 ccttgtgctg gagaagttcc agtctttagt cttgccctat ttgcagagtg atgccgacat 1741 ggccctgaaa ttactgtcac tgagcttgct gctgcactgc atcatgttca acatgaggag 1801 cctggacagc agcacgcctc tgcccattgt cagacggaca cagtgcatga tggagaggat 1861 gatgagggag ctcgtgcagc ccctgctggc ccttctcccg gacaccccag gcccagagcc 1921 agagctgtgg ctgcagaagg tcagtgactc tgtgctcctg ctctcttaca cttgggccca 1981 ggtggacgct atgttcagtt tgaactgtag ccagtatcac tctatgtctg ggccccttat 2041 aggtgttgct ctggagatct cgaacctccc ttcgttgctc ccaggtgtaa aaacacagca 2101 ttggaagaag atagagaagt ttacagctca gttcagctct cttggtacat attgcttaga 2161 acagctgtac ctgcagaaaa tgaaaaggac tttaatgcaa actagtttcc ggtctgaagg 2221 agccatccaa agtttgaggt gcgatgctgc ctttattatt ggttccggca gaaaaagctt 2281 gaatcagaga acgacggctt cctgggatgg ccaagttggg atggtgagtg gactcacata 2341 ccctgtagca cactggcact tgattgtgtc aaatctcaca attttaatat cctatctgtg 2401 tccagatgat gtgggatacc tggccagtgt cctgctgaga actttaccca tgggcaaagc 2461 ccaggaagtc tcaatagatg aagaggcata catcacactg gaaaaaatat ccaaagcctt 2521 ccttcatagc cctctctttc cagagatgca gtcccttcat tctgctttct taacgtgcgt 2581 aaccacaagt tgctccagca ttctgtgttc tggtgcccag cgtgactcag gtcttgtcag 2641 tcagcagctt ccctggcttt ttgaaaagga ccacatggtt gtgggtcatt gggaaaacag 2701 atttgcaaaa gctggacccg aaggtataga acctagagga gaaattgccc agaacttact 2761 gtccctggtc aagagtgact tccctatcca gctggaggga gagcagttgg aaagcatcct 2821 ggggcttttg gaagtgattt ctgccttaca gctggacagc ctcttgccac cctatcatgt 2881 gcattatttt cttgtgttac tgtccatggc cgtcaccaaa ctaggatgct cttgctcctc 2941 ctcactggct ctcaagttct tgacgacttg ctaccaactt cttggttact tgcaaaaggg 3001 gaaaagtgct cgctctgtgt tcaagatcat gtatggtagt gatatttttg aggttgtact 3061 gacctcattg ttcagagcta gtagtaggtt ccttattgag atggatgatc ccgcttggct 3121 ggaattcctc caagtgatag ggacgttctt agaggagcta atgcagatgc tcatccaaat 3181 gaagctgagc ttggtgctca attttagaaa aatcaccgca ttcctctcta gttccaaacc 3241 atacacggag gcagcttcaa gcaaacaatt agaaaatcag aacccccagg gcaggcagct 3301 ccttctggtg tctttaacca ggttgtgcca tgtcctggga cctttcctca aagagcagaa 3361 gctgggccaa gaggccccag cagcactgtc tgagctgctg cagcaggttg tgctgcagac 3421 aggagctgtg ctgcagctct gctcagtgcc gggggcccgg ggctggcgcc ttccctcggt 3481 cctcatctca tccgtcagca cgctcttgga agccgacctg ggtcagcact gcagggatgg 3541 aggggccgac atttcccaag gaagcgacag gacgctgctc tcccatgttg ccctctacca 3601 gggtgtttac tctcagatac tgttggagtt gccagctctc gcgggacatg atcagtcttt 3661 tcaggcagcc ttgcagtttt tgactctgtt ctttttggcc ccagaactgc atcccaaaaa 3721 ggactccgtg tttacctcca tgtttcattc tgtgagaaga gttcttgcag atcctgaaat 3781 tcctgttcag gtcactcagg atattgagcc tcatttggga gccttgttca cccaaatgtt 3841 agaggttggg acgacagagg acttgaggct ggtgatgcag tgtattctcc agggactgga 3901 tgtcagtaac atgtggaaag cagatgtgca ggctgttgtg tcagctgtta cactgctgag 3961 gctgctactg aactgcccac tcagtggaga gaaagcaagt ctgttgtggc gtgcgtgtcc 4021 ccagatagtc acagctttaa cactcttaaa ccgagaagct tctcaggagc agcctgtgtc 4081 cctcacagtg gtcgggcctg tcttagatgt cctggctgca ctgctgcggc agggggagga 4141 ggccatcggc aacccccacc acgtcagcct ggccttcagc atccttctca ctgtcccttt 4201 ggaccatctg aagccgctgg agtatggaag cgtcttcccg aggctgcaca acgtgctctt 4261 ctcaatcctg cagtgtcacc ctaaggtaat gctgaaagcc atcccttctt tcttgaactc 4321 tttcaataga ttggtgtttt cagttatgcg ggaagggcgg cagaaggaca aaggaagcat 4381 agatgacctg cctacggtcc taaagtgtgc acgcctggtt gaaagaatgt acagccacat 4441 cgccgcacga gctgaggagt ttgctgtgtt ttccccattt atggtggccc agtacgtgtt 4501 ggaggtacag aaggtgacct tatatccagc tgtgaaaagt ctgctgcagg agggcattta 4561 cctcatcctg gacctctgca tcgagcctga cgtccagttc ctgcgggcct cgctgcagcc 4621 gggaatgaga gacatcttta aggagctcta taatgactat ctcaagtacc acaaggccaa 4681 acatgaagga gagaaaagat atacggccta aggctatggg acagaagtgc cgccagtgac 4741 actgtccaga ggctttggct gcatggtctg aaagagctgg agaatgaaag acttaagatg 4801 ttctaattcg tagtattggt atacatagaa aatcctttgg ggtttatgta gtatattttg 4861 atgtatttta catcgtgttt ttcttactat tttttaatac atagttttat gcagtaagta 4921 ttgcaataga atcctgaaaa ttgaccctgg gatgagatta attcaataga aaaattgctg 4981 actcttggga cctttctgtg tttggttctc gtcttggctc agtggtttgg tgttccctcg 5041 tctgcactgg aaactacata aaacttggct ttttactttg ggtacatggg cgtataattc 5101 agccctgttt aaatatactt gcctttcaaa ttcttcaagt aacatgggaa gtattcttga 5161 aatgtcacat tttctgcctt ccctctaagt atgctttctg aagaagtcag ggaaagttag 5221 agtctgtggc ctgaggtgtc tgctctgggt ggcgatagtg ggcacctcag gcaggtcggt 5281 gacgtttagc acaggtgcca gggctcctgc ctgctcctcc tgtgttagct ctgtgaagtt 5341 catttaggaa tttttttttc ctatgcagtt taagaaataa tcctaattgt tttttcttat 5401 tacctaagca atatattttt attatagcaa cctcagaaaa gaaaaataaa aggataattt 5461 aaaaaactca ttcatagtct cagttaccca gataacctcg gttgtcacct tggagtatct 5521 tgttgtagtc cctttactat gtgtatgtat atagatgtgc atataaatat atatagtagc 5581 taaattggat cataaatgca tttttttaaa gtt // LOCUS D50924 4345 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0134 gene, complete cds. ACCESSION D50924 NID g1469190 KEYWORDS KIAA0134. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4345) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4345) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..4345 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..286 gene 287..2017 /gene="KIAA0134" CDS 287..2017 /gene="KIAA0134" /note="The KIAA0134 gene product is related to human RNA helicase A." /citation=[3] /codon_start=1 /db_xref="PID:d1010125" /db_xref="PID:g1469191" /translation="MPPPRTREGRDRRDHHRAPSEEEALEKWDWNCPETRRLLEDAFF REEDYIRQGSEECQKFWTFFERLQRFQNLKTSRKEEKDPGQPKHSIPALADLPRTYDP RYRINLSVLGPATRGSQGLGRHLPAERVAEFRRALLHYLDFGQKQAFGRLAKLQRERA ALPIAQYGNRILQTLKEHQVVVVAGDTGCGKSTQVPQYLLAAGFSHVACTQPRRIACI SLAKRVGFESLSQYGSQVGYQIRFESTRSAATKIVFLTVGLLLRQIQREPSLPQYEVL IVDEVHERHLHNDFLLGVLQRLLPTRPDLKVILMSATINISLFSSYFSNAPVVQVPGR LFPITVFDVAPPGVRKCILSTNIAETSVTIDGIRFVVDSGKVKEMSYDPQAKLQRLQE FWISQASAEQRKGRAGRTGPGVCFRLYAESDYDAFAPYPVPEIRRVALDSLVLQMKSM SVGDPRTFPFIEPPPPASLETAILYLRDQGALDSSEALTPIGSLLAQLPVDVVIGKML ILGSMFSLVEPVLTIAAALSVQSPFTRSAQSSPECCTPPASSLAAPRCCTHRSWRPAT ATEAETTRTR" 3'UTR 2018..4345 BASE COUNT 929 a 1314 c 1223 g 879 t ORIGIN 1 cggcgtgtga atccagggcc tgaaaaccca gaatgaactt gtgtccatcc cagagatcac 61 tgcagatgtc atgaggtacc ctttgtgtca ccagctcaag caggcctctg gccactttat 121 catctgtggt ggtcctgtcc cgtgaccagg aggaaaaatt agctctttga agagaaagta 181 gttctctatt gcaggcactg gcctcttaaa ttgttgcagg tggggaatgg atgagaatat 241 ttgttttggg tgatcagaac tgagactcct attgtggatt agtaacatgc ctcctcctag 301 aacaagggag ggcagggatc gccgagacca ccaccgggct cccagcgagg aagaggcctt 361 ggagaaatgg gactggaatt gtccagagac gcgtcgcctc ttggaagatg ccttcttccg 421 tgaagaggat tacatccgtc agggttctga ggaatgtcag aagttttgga ccttctttga 481 acgcctgcag agattccaga atctcaagac ctccaggaag gaggagaaag accctggaca 541 gcccaagcac agcatcccag cgctggccga cctacctcgc acttacgacc cacgttaccg 601 catcaacctc tctgttcttg gccctgccac gcggggctct cagggactgg gcaggcactt 661 gcccgcggag agagtggctg agttccgccg agccctgttg cactacctgg actttggcca 721 gaagcaggca tttgggcgtc tggccaagct gcagcgtgag cgggcagccc tccccatcgc 781 ccagtatggg aaccgcatcc tgcagacgct gaaggagcac caggtggtgg tagtggccgg 841 tgacaccggc tgtggcaagt ccactcaggt gccccagtac ctgctggctg ctggcttcag 901 tcatgtggcg tgcacccagc cccggcggat cgcctgcatc tcactggcca agcgtgtggg 961 ctttgagagc ctcagtcagt atggctcaca ggtcggctac cagatccgct ttgagagcac 1021 acgttcggcg gccaccaaga ttgtattcct gacagtgggg ctgctcctgc gacaaatcca 1081 gcgggaaccc agcctgcccc agtatgaggt cctgattgtg gatgaagtcc atgagcggca 1141 tctccacaac gatttcctcc tgggcgtcct ccagcgcctg ttgcccacgc ggcctgacct 1201 caaggtcatc ctcatgtcgg ccaccatcaa catctcgctc ttctccagct atttcagcaa 1261 tgcccctgtg gtacaggtgc ctgggaggct gttccccatc acggtatttg atgtggcacc 1321 ccctggagtc cggaaatgca tcctctccac caacattgct gagacctcag tcaccattga 1381 cgggatccgc ttcgtagtag attccggaaa ggtgaaggag atgagctacg atccgcaggc 1441 caagctgcaa cggctgcagg agttctggat tagtcaggcc agcgcagagc agcggaaggg 1501 ccgggcgggc cgcacgggcc ccggagtctg cttccgcctc tatgccgaat cggactatga 1561 tgccttcgcc ccctaccccg tcccagaaat tcggagggtg gccctggact cgttggtgct 1621 gcagatgaag agcatgagtg tgggggaccc ccgaaccttc cccttcatcg agcccccacc 1681 accagccagc ctggaaaccg ccatcctcta cctccgggac cagggggccc tggacagctc 1741 agaggccctc acacccattg ggtccctgct agcccagctg cctgtggacg ttgtgattgg 1801 gaagatgctg atcctgggct ccatgttcag cctggtggag cctgtgctca ccatcgcagc 1861 cgcacttagc gtccagtcgc ccttcacccg cagcgcccag agcagcccag agtgctgcac 1921 cccacctgcg tcttcgctgg cagccccgag gtgctgcacg cacaggagct ggaggccagc 1981 aactgcgacg gaagccgaga cgacaaggac aagatgagca gcaaacacca gctcctcagc 2041 ttcgtgtccc tgctggagac caacaagccg tacctggtga actgcgtccg catccctgcc 2101 ctccagtccc tcctgctttt tagccggtct ttggacacca atggtgactg ctcccgcctg 2161 gtggccgatg gctggctgga gctgcagcta gcagacagtg aaagtgccat ccgactcctg 2221 gcggcttccc tgcggctccg tgcccgctgg gaaagtgccc tggaccggca gctggcgcac 2281 caggcccagc agcagctgga ggaggaggag gaggatacgc cagtcagccc caaggaggtg 2341 gccaccctga gcaaggaact cctgcaattc acggcatcca agattcctta cagcctccgg 2401 cggctcacag ggctagaagt ccagaacatg tatgtgggac cccagaccat cccagccacc 2461 ccccatcttc ctggcctctt tggcagctcc accctgtccc cccaccccac aaaggggggc 2521 tacgcagtca ctgacttcct cacctacaac tgcctcacga atgacacaga cctgtacagc 2581 gactgtctcc gaaccttctg gacctgcccc cactgtggcc tgcatgcgcc cctcacgccc 2641 ctggagcgca tcgcccatga gaacacctgc ccccaggccc cacaggatgg gcccccaggg 2701 gctgaggaag ctgccctcga aaccctccag aagacatctg tcctgcagag gccctaccac 2761 tgcgaggcct gcgggaagga cttcctcttt acacccacag aggtgctgcg ccaccggaag 2821 cagcacgtgt gagctgggcc aggagccctg cccacctccg tgcagctgac ctgccctcca 2881 gcccaggact aggggcagga ctcttgcctg aacccccagc ctgggcttag ccctgtggtc 2941 ctgtcccagt gcagagggcc tggagcacgg attgtgaata aagcctcaca tgctgataca 3001 cactgttagg cctgcacctg cccatccaga aagcagcagc tgccttgtta gtcctcccca 3061 gggtctagct ttccttcttc ctgctgcagg gtgctgcctg aggggtcctg ggtaggaggg 3121 gcgttagagc cagcagggac ctcccatgtc tccagattcc aggtgcaggt tcttagcacc 3181 tccgcagccg ctctctcttg agtccatcct cagtctctcc taccccttga agtaggggga 3241 ccctgaattt gcccatccac ctgggtcact ttgagagttg tgcagggggg ctgggagcac 3301 tggtgttcac gtgggaccac aggctgcacc ataagaccca ctcacaataa aaaaataaaa 3361 ggccgagccg ggcatggttg ctcactcccg taataaaaaa ataaaagaac atgctgggat 3421 tacaggcgtg agccactgca cccggccgtg accaacattc ttaatctgtc ctctagtgct 3481 ggccactcta aaacttcaca agaatgtgct gtctgtgtct agcttctttt acccaacatg 3541 aggttgcatg ggcttggttc attcattctg ttccatgaat tcatcacaat atatccattc 3601 ttttgtccag gggcatttgg cttccagaaa cctaccccta acaccagccc ttccctccct 3661 ctgcagaaca aaggccccat cagaggggac gtgctgggtg tcagcgtcag cttctgggag 3721 gaggaaggtc agagctaagg cctgaaggat gatgaggtat caggtgggtg gaggggagac 3781 gctcaggagg tgggaaaggc ggaggtgcac aggcttggtt ccaggtacaa ataacgttac 3841 taggagcaca cagtacctgg attttgatga atacattgta catttctgtc ctgtatgtat 3901 ccagggttat aggacgtgat tataggacac gcatatatgt ttggttttag tggactctta 3961 aaaattgttt tccaggggcc gggcacagtg gcttacgcct gtaatcccag cactttggga 4021 ggcctaggcg ggcagatcac ctgaggtcag gagttcaaga tcagcctgac caacatggag 4081 aaaccccgtc tctattggga ggccgaggcg ggaggatcac aaagtcagga gatcgagacc 4141 atcctggcta acacggtgaa accccgtctc tactaaaaat gcaaaaaaat tggctgggcg 4201 tggtggcggg cgcctgtggt cccagctact cgggaggctg aggcaggaga atggagtgag 4261 cccgggaggc agagcttgca gtgagccaag atcgtgccac tgcactccag cctggttgac 4321 agagcaagac tccgtctcaa aaaag // LOCUS D50928 3233 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0138 gene, complete cds. ACCESSION D50928 NID g1469198 KEYWORDS KIAA0138. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3233) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3233) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3233 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..36 gene 37..2898 /gene="KIAA0138" CDS 37..2898 /gene="KIAA0138" /note="The KIAA0138 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010129" /db_xref="PID:g1469199" /translation="MAETLPGSGDSGPGTASLGPGVAETGTRRLSELRVIDLRAELKK RNLDTGGNKSVLMERLKKAVKEEGQDPDEIGIELEATSKKSAKRCVKGLKMEEEGTED NGLEDDSRDGQEDMEASLENLQNMGMMDMSVLDETEVANSSAPDFGEDGTDGLLDSFC DSKEYVAAQLRQLPAQPPEHAVDGEGFKNTLETSSLNFKVTPDIEESLLEPENEKILD ILGETCKSEPVKEESSELEQPFAQDTSSVGPDRKLAEEEDLFDSAHPEEGDLDLASES TAHAQSSKADSLLAVVKREPAEQPGDGERTDCEPVGLEPAVEQSSAASELAEASSEEL AEAPTEAPSPEARDSKEDGRKFDFDACNEVPPAPKESSTSEGADQKMSSFKEEKDIKP IIKDEKGRVGSGSGRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCY GFVTMSTSDEATKCISHLHRTELHGRMISVEKAKNEPAGKKLSDRKECEVKKEKLSSV DRHHSVEIKIEKTVIKKEEKIEKKEEKKPEDIKKEEKDQDELKPGPTNRSRVTKSGSR GMERTVVMDKSKGEPVISVKTTSRSKERSSKSQDRKSESKEKRDILSFDKIKEQRERE RQRQREREIRETERRREREQREREQRLEAFHERKEKARLQRERLQLECQRQRLERERM ERERLERERMRVERERRKEQERIHREREELRRQQEQLRYEQERRPGRRPYDLDRRDDA YWPEGKRVAMEDRYRADFPRPDHRFHDFDHRDRGQYQDHAIDRREGSRPMMGDHRDGQ HYGDDRHGHGGPPERHGRDSRDGWGGYGSDKRLSEGRGLPPPPRGGRDWGEHNQRLEE HQARAWQGAMDAGAASREHARWQGGERGLSGPSGPGHMASRGGVAGRGGFAQGGHSQG HVVPGGGLEGGGVASQDRGSRVPHPHPHPPPYPHFTRRY" 3'UTR 2899..3233 BASE COUNT 874 a 763 c 1051 g 545 t ORIGIN 1 tgcgactgag tcggtggcga agacgggaac gcgacgatgg cggagactct gcccgggtcg 61 ggcgactcgg gccctggcac ggcttctctc ggcccgggcg ttgcggagac tgggacgagg 121 cggctcagcg agctgcgggt gatcgatctg cgggcggagc tgaagaagcg gaacctggac 181 acgggcggca acaagagcgt cctgatggag cggctcaaga aggcggttaa agaagagggg 241 caagatcctg atgaaattgg catcgagtta gaagccacca gcaagaagtc agccaagaga 301 tgtgttaaag gactgaagat ggaggaggaa ggcacagaag ataatggcct ggaagacgat 361 tccagagacg ggcaggagga catggaagca agtctggaga acctgcagaa tatgggcatg 421 atggacatga gtgtgctaga cgaaactgaa gtggcgaata gcagtgctcc agattttggg 481 gaggatggca cggacggcct tctcgattcc ttttgtgata gtaaagaata cgtggctgca 541 cagctgagac agctcccggc tcagccccca gagcatgctg tggatgggga aggatttaag 601 aacactttgg aaacttcatc gttgaacttc aaagtaactc cggacattga agaatccctt 661 ttggagccag aaaatgagaa aatactcgac attttggggg aaacttgtaa atctgagcca 721 gtaaaagaag aaagttccga gctggagcag ccatttgcac aggacacaag tagcgtgggg 781 ccagacagaa agcttgcgga ggaagaggac ctatttgaca gcgcccatcc ggaagagggt 841 gatttagatt tggccagcga gtcaacagca cacgctcagt cgagcaaggc agacagcctg 901 ttagcggtag tgaaaaggga gcccgcggag cagccaggcg atggcgagag gacggactgt 961 gagcctgtag ggctagagcc ggcagttgag cagagtagtg cggcctccga gctcgcggag 1021 gcctctagcg aggagctcgc agaagcaccc acggaagccc caagcccaga agccagagat 1081 agcaaagaag acgggaggaa gtttgatttt gacgcttgta atgaagtccc tccggctcct 1141 aaagagtcct caaccagtga gggcgctgat cagaaaatga gctcttttaa ggaagaaaaa 1201 gatataaagc caatcattaa agatgaaaaa ggtcgggtcg gcagcggttc tggtcggaac 1261 ctgtgggtca gcgggctgtc ctccacaaca cgcgctacgg atctcaagaa ccttttcagc 1321 aagtatggga aggttgtcgg ggccaaagtg gtaacgaacg cccgcagccc gggggctcga 1381 tgctatggat tcgtcaccat gtcgacatct gacgaggcga ccaagtgcat cagccatctc 1441 cacagaactg agctgcatgg acgaatgatc tccgtagaga aggccaaaaa tgagcctgct 1501 gggaaaaagc tttccgacag aaaagagtgc gaagtgaaga aggaaaaatt atcgagtgtc 1561 gacagacatc attctgtgga gatcaaaatt gaaaaaactg taattaagaa ggaagagaag 1621 attgagaaga aggaggaaaa aaagcctgaa gacattaaga aggaagaaaa agaccaggat 1681 gagctgaaac ccggacctac aaatcggtct agagtcacca aatcaggaag cagaggaatg 1741 gagcggacgg tcgtgatgga taaatcgaaa ggagagcccg tcattagcgt gaaaaccaca 1801 agcaggtcca aagagagaag ctccaagagt caggatcgca agtcagaaag caaagaaaag 1861 agagacatct tgtcgtttga taaaatcaaa gaacaaaggg agagagagcg ccagaggcag 1921 cgggaacggg agatccgcga aacggagagg cggcgggagc gcgagcagcg ggagcgggag 1981 caacgcctcg aggccttcca tgagcggaag gagaaggccc ggctacagcg ggaacgcctg 2041 cagctcgagt gccagcgcca gcggctggag cgggagcgca tggagcggga gcggctggag 2101 cgcgagcgca tgcgcgtgga gcgtgagcgc aggaaggagc aggagcgcat ccaccgcgag 2161 cgcgaggagc tgcggcgcca gcaggagcag ctgcgttacg agcaggagcg gcggcccggg 2221 cggaggccct acgacctgga ccgacgagat gatgcctatt ggccagaagg aaagcgtgtg 2281 gcaatggagg accgatatcg tgcagacttt ccccggccag accaccgctt tcacgacttc 2341 gatcatcgag accggggcca gtaccaggac cacgccatcg acaggcggga gggttcgagg 2401 ccaatgatgg gagaccaccg ggatgggcag cactatggag atgaccgcca tggccacgga 2461 ggacccccag agcgccacgg ccgggactcc cgtgatggct gggggggcta cggctccgac 2521 aagaggctga gtgaaggccg ggggctgccc cctcccccca ggggtggccg tgactgggga 2581 gagcacaacc agcggctaga ggagcaccag gcacgcgcct ggcagggtgc catggacgca 2641 ggcgcggcta gccgggagca cgccaggtgg caaggtggcg agaggggcct gtctgggccc 2701 tcggggccgg ggcacatggc aagccgcggt ggagtggcgg ggcgaggcgg ctttgcacaa 2761 ggtggacatt cccagggcca cgtggtgcca ggtggcggac tggaaggtgg cggagtggcc 2821 agccaggacc ggggcagcag agtccctcac ccacaccctc atcccccccc gtacccccac 2881 ttcacccgcc gctactaagt cccactcgct gtgagttttc gggtgggcag acgcactgtt 2941 gaatctggta gccagggttc cctcgaactt gggggatctt tttaaaagca aagtaaatcc 3001 tgccaccatg ttgtagctca atacaatgtg aactcacttt tttttttttt tttaataaat 3061 gtgttcttgt tctgccattt ttaaatcaag gtttctgtta acgaggcatt ccattttcca 3121 ttaataaagt ttaccattcg caaaaaaaaa atgtgttctt gttctgccat ttttaaatca 3181 aggtttctgt taacgaggca ttccattttc cattaataaa gtttaccatt cgc // LOCUS D50929 5276 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0139 gene, complete cds. ACCESSION D50929 NID g1469200 KEYWORDS KIAA0139. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5276) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5276) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5276 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..128 gene 129..4277 /gene="KIAA0139" CDS 129..4277 /gene="KIAA0139" /note="The KIAA0139 gene product is related to mouse centrosomin B." /citation=[3] /codon_start=1 /db_xref="PID:d1010130" /db_xref="PID:g1469201" /translation="MPAYFQRPENALKRANEFLEVGKKQPALDVLYDVMKSKKHRTWQ KIHEPIMLKYLELCVDLRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKMAEEKTE AAKEESQQMVLDIEDLDNIQTPESVLLSAVSGEDTQDRTDRLLLTPWVKFLWESYRQC LDLLRNNSRVERLYHDIAQQAFKFCLQYTRKAEFRKLCDNLRMHLSQIQRHHNQSTAI NLNNPESQSMHLETRLVQLDSAISMELWQEAFKAVEDIHGLFSLSKKPPKPQLMANYY NKVSTVFWKSGNALFHASTLHRLYHLSREMRKNLTQDEMQRMSTRVLLATLSIPITPE RTDIARLLDMDGIIVEKQRRLATLLGLQAPPTRIGLINDMVRFNVLQYVVPEVKDLYN WLEVEFNPLKLCERVTKVLNWVREQPEKEPELQQYVPQLQNNTILRLLQQVSQIYQSI EFSRLTSLVPFVDAFQLERAIVDAARHCDLQVRIDHTSRTLSFGSDLNYATREDAPIG PHLQSMPSEQIRNQLTAMSSVLAKALEVIKPAHILQEKEEQHQLAVTAYLKNSRKEHQ RILARRQTIEERKERLESLNIQREKEELEQREAELQKVRKAEEERLRQEAKEREKERI LQEHEQIKKKTVRERLEQIKKTELGAKAFKDIDIEDLEELDPDFIMAKQVEQLEKEKK ELQERLKNQEKKIDYFERAKRLEEIPLIKSAYEEQRIKDMDLWEQQEEERITTMQLER EKALEHKNRMSRMLEDRDLFVMRLKAARQSVYEEKLKQFEERLAEERHNRLEERKRQR KEERRITYYREKEEEEQRRAEEQMLKEREERERAERAKREEELREYQERVKKLEEVER KKRQRELEIEERERRREEERRLGDSSLSRKDSRWGDRDSEGTWRKGPEADSEWRRGPP EKEWRRGEGRDEDRSHRRDEERPRRLGDDEDREPSLRPDDDRVPRRGMDDDRGPRRGP EEDRFSRRGADDDRPSWRNTDDDRPPRRIADEDRGNWRHADDDRPPRRGLDEDRGSWR TADEDRGPRRGMDDDRGPRRGGADDERSSWRNADDDRGPRRGLDDDRGPRRGMDDDRG PRRGMDDDRGPRRGMDDDRGPRRGLDDDRGPWRNADDDRIPRRGAEDDRGPWRNMDDD RLSRRADDDRFPRRGDDSRPGPWRPLVKPGGWREKEKAREESWGPPRESRPSEEREWD REKERDRDNQDREENDKDPERERDRERDVDREDRFRRPRDEGGWRRGPAEESSSWRDS SRRDDRDRDDRRRERDDRRDLRERRDLRDDRDRRGPPLRSEREEVSSWRRADDRKDDR VEERDPPRRVPPPALSRDRERDRDREREGEKEKASWRAEKDRESLRRTKNETDEDGWT TVRR" 3'UTR 4278..5276 BASE COUNT 1690 a 969 c 1385 g 1232 t ORIGIN 1 ggccggctgg gcgcgggcga ctgctggcga ggcgcgtggg accttacgct ggttcccctt 61 cgtctcctct cccggcccgg gccactagag agttcgctga cgccgggtga gctgagcctg 121 ccgccgagat gccggcctat tttcagaggc cggaaaatgc cctcaaacgc gccaacgaat 181 ttcttgaggt tggcaaaaag cagcctgctc tggatgttct ttatgatgtt atgaaaagta 241 aaaaacatag aacatggcaa aagatacacg aaccaattat gttgaaatac ttggaacttt 301 gcgtggatct tcgcaagagc cacttggcaa aggaggggtt ataccagtat aagaacattt 361 gtcaacaggt gaacataaaa tctctggagg atgttgttag ggcatatttg aaaatggcag 421 aggaaaaaac tgaagctgct aaagaagaat ctcagcagat ggtcttagat atagaggatc 481 tagataatat tcaaactcct gagagtgttc tcctaagtgc tgtaagtggt gaagacactc 541 aggatcgtac tgacagatta cttttaactc catgggttaa attcctgtgg gagtcttaca 601 ggcagtgttt ggaccttctt agaaacaatt ctagagtaga gcgcctgtac catgatattg 661 cccagcaagc tttcaaattc tgcctccaat acacgcgtaa ggctgaattc cgtaaactgt 721 gtgacaattt gagaatgcac ttatcgcaga ttcagcgcca ccataaccaa agtacggcaa 781 tcaatcttaa taatccagag agccagtcca tgcatttgga aaccagactt gttcagctgg 841 acagtgctat cagcatggaa ttgtggcagg aagcattcaa agctgtggaa gatattcacg 901 ggctattctc cttgtctaaa aaaccaccta aacctcagtt gatggcaaat tactataaca 961 aagtctcaac tgtgttttgg aaatctggaa atgctctttt tcatgcatct acactccatc 1021 gtctttacca tctctctaga gaaatgagaa agaatctcac acaagatgag atgcaaagaa 1081 tgtctactag agtcctttta gccactcttt ccatccctat tactcctgag cgtacggata 1141 ttgctcgact tctggatatg gatggcatta tagttgaaaa acagcgtcgc cttgcaacac 1201 tactaggtct tcaagcccca ccgacacgaa ttggccttat taatgatatg gtcagattta 1261 atgtactaca atatgttgtc ccagaagtga aagaccttta caattggctt gaagtagaat 1321 ttaacccatt aaaactctgt gagcgagtca caaaggttct aaattgggtt agggaacaac 1381 ctgaaaagga accggaattg cagcagtatg tgccacaact gcaaaacaac accatcctcc 1441 gccttctgca gcaggtgtca cagatttatc agagcattga gttttctcgt ttgacttctt 1501 tggttccttt tgttgatgct ttccaactgg aacgggccat agtagatgca gccaggcatt 1561 gcgacttgca ggttcgtatt gatcacactt ctcggaccct gagttttgga tctgatttga 1621 attatgctac tcgagaagat gctccgattg gtcctcattt gcaaagcatg ccttcagagc 1681 agataagaaa ccagctgaca gccatgtcct cagtacttgc aaaagcactt gaagtcatta 1741 aaccagctca tatactgcaa gagaaagaag aacagcatca gttggctgtc actgcatacc 1801 ttaaaaattc acgaaaagag caccagcgga tcctggctcg ccgccagaca attgaggaga 1861 gaaaagagcg ccttgagagt ctgaatattc agcgtgagaa agaagaattg gaacagaggg 1921 aagctgaact ccagaaagtg cggaaggctg aggaagagag gctgcgccag gaagcaaagg 1981 agagagagaa ggagcgtatc ttacaggaac atgaacaaat caaaaagaaa actgtccgag 2041 agcgtttgga gcagatcaag aaaacagaac tgggtgccaa agcattcaaa gatattgata 2101 ttgaagacct tgaggaattg gatccagatt ttatcatggc taaacaggtt gaacaactgg 2161 agaaagaaaa gaaagaactt caagaacgcc taaagaatca agaaaagaag attgactatt 2221 ttgaaagagc caaacgtttg gaagaaattc ctttgataaa gagcgcttac gaggaacaga 2281 gaattaaaga catggatctg tgggagcaac aagaggaaga aagaattact acaatgcagc 2341 tagaacgtga aaaggctctt gaacataaga atcgaatgtc acgaatgctt gaagacagag 2401 atttattcgt aatgcgactc aaagctgcac ggcagtctgt ttatgaggaa aaacttaaac 2461 agtttgaaga gcgattagca gaagaaaggc ataatcgatt ggaagaacgg aaaaggcagc 2521 gtaaagaaga acgcaggata acatactata gagaaaaaga agaggaggag cagagaaggg 2581 cagaagaaca aatgctaaaa gagcgggaag agagagagcg cgccgaacga gcaaaacgcg 2641 aggaagagct acgagagtat caggagcggg tgaagaaatt agaagaagtg gaaaggaaaa 2701 aacgccaaag ggagttggaa attgaagaac gagaacggcg tagagaggaa gagagaagac 2761 ttggcgatag ttccctttct agaaaggact ctcgttgggg agatagagat tcagaaggca 2821 cctggagaaa aggacctgaa gcagattctg agtggagaag aggcccgcca gagaaggagt 2881 ggagacgtgg agaagggcga gatgaggaca ggtctcatag aagagatgaa gagcggcccc 2941 ggcgtctggg ggatgatgaa gatagagagc cctctcttag accagacgat gatcgggttc 3001 cccggcgtgg catggatgat gacagaggcc ctagacgtgg tcctgaggaa gataggttct 3061 ctcgtcgtgg ggcagacgat gaccggcctt cctggcgtaa cacagatgat gacaggcctc 3121 ccagacgaat tgccgatgaa gacaggggaa actggcgtca tgcggatgat gacagaccac 3181 ctagacgagg actggatgag gacagaggaa gctggcgaac agctgatgag gacagaggac 3241 caagacgtgg gatggatgat gaccgggggc cgaggcgagg aggcgctgat gatgagcgat 3301 catcctggcg taatgctgat gatgaccggg gtcccaggcg agggttggat gatgatcggg 3361 gtcccaggcg aggcatggat gatgaccggg gtcccaggcg aggcatggat gatgaccggg 3421 gtcccaggcg aggcatggat gatgaccggg gtcccaggcg agggttggat gatgatcgag 3481 gaccttggag gaacgccgat gatgacagaa ttcccaggcg tggtgcagag gatgacaggg 3541 gcccttggag aaacatggat gatgatcgcc tttcaagacg tgctgatgat gatcggtttc 3601 ccagacgggg tgatgactca agacctggtc cttggagacc attagtcaag ccaggtggat 3661 ggagagagaa agaaaaagcc agagaggaga gctggggtcc acctcgagaa tcaaggccat 3721 cagaagaacg tgaatgggac agagaaaaag aaagggacag agataatcaa gatcgggagg 3781 agaatgacaa ggaccctgag agagaaaggg acagagagag agatgtggat cgagaggatc 3841 gcttcagaag acctagggat gaaggtggct ggagaagagg accagctgag gaatcttcaa 3901 gctggagaga ctcaagtcgc cgggacgata gggataggga tgaccgtcgc cgtgagaggg 3961 atgaccggcg tgatctaaga gaaagacgag atctaagaga cgacagggac cgaagaggac 4021 ctccactcag atcagaacgt gaagaagtaa gttcttggag acgtgctgat gacaggaaag 4081 atgaccgggt ggaagagcgg gaccctcctc gtcgagttcc tcccccagct ctttcaagag 4141 accgagaaag agaccgagac cgagaaagag aaggtgaaaa agagaaggcc tcatggagag 4201 ctgagaaaga tagggaatct ctccgtcgta ctaaaaatga gactgatgaa gatggatgga 4261 ccacagtacg acgttaagtc tcaagataat ggatttaaac tggtgtctta aataggtttg 4321 atcacattca aggattatta tacttgtgct tcaaccaatc taaattggat tctttaatgt 4381 tgtttcacca taacacaaaa agcatgaact tgtattaatc ctatataata gattgatcat 4441 gcaccatatc cacaggaggt tggaaaaacc catgccattt tctggaattt aagggtgttg 4501 cattatttca tcaatcattt gttgacaaaa aagaaaaact aaaaaataaa tttaaaatgt 4561 gaaccttcag gtattgagta acacctttat cttggtatag aactgatact ttttttttga 4621 ttttgaaata tctgataata atttggaatg aagtaaggtt ctgttaaaat atatttgaag 4681 accctttaaa gcagtgaatc tgaaacaatt ttcacaccct taagtggttg atacgtacct 4741 attttaggta ttttgaggta tttaccataa actaaattta gaaatttttt agattcactt 4801 gaagtaaaca ttacaaacat tggatacggt ggggttttct ttagatttta cttgagagaa 4861 ggtgagtaca aagcaatttg cagttgttgt aatgacaaga ttactgcgca agtgtgaatc 4921 caaacagtat agcttttaaa ttttaaagca tttggtaaat tatcgctgag tttttttctg 4981 ttgccaatag caaactgctt ttccattaat ggagaattca tgcctttcaa gcattttaaa 5041 tatgacaata tttataaatg tatggtttgg aggaatcgtt taaattctct ttcctaattt 5101 tctttctttt gaagatagat tctttcaaca agtaatttgt agtaatgact gtgttgactt 5161 caattttgga gcgcagtagc tatgttaaag atgaactatt tggtctcatt gaagccaaca 5221 cagaacttgc tgctgtgttt tttcttcagt gataaataaa atacttacag aatttg // LOCUS D50930 5429 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0140 gene, complete cds. ACCESSION D50930 NID g1469202 KEYWORDS KIAA0140. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5429) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5429) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5429 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..212 gene 213..1481 /gene="KIAA0140" CDS 213..1481 /gene="KIAA0140" /note="The KIAA0140 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010131" /db_xref="PID:g1469203" /translation="MVMVLSESLSTRGADSIACGTFSRELHTPKKMSQGPTLFSCGIM ENDRWRDLDRKCPLQIDQPSTSIWECLPEKDSSLWHREAVTACAVTSLIKDLSISDHN GNPSAPPSKRQCRSLSFSDEMSSCRTSWRPLGSKVWTPVEKRRCYSGGSVQRYSNGFS TMQRSSSFSLPSRANVLSSPCDQAGLHHRFGGQPCQGVPGSAPCGQAGDTWSPDLHPV GGGRLDLQRSLSCSHEQFSFVEYCPPSANSTPASTPELARRSSGLSRSRSQPCVLNDK KVGVKRRRPEEVQEQRPSLDLAKMAQNCQTFSSLSCLSAGTEDCGPQSPFARHVSNTR AWTALLSASGPGGRTPAGTPVPEPLPPSFDDHLVCQEDLSCEESDSCALDEDCGRRAE PAAAWRDRGAPGNSLCSLDGELDIEQIEKN" 3'UTR 1482..5429 BASE COUNT 1054 a 1606 c 1623 g 1146 t ORIGIN 1 ggcgcggggt ccgcgcccgg agcgccccgc cgcgcacagg agttgaccac atttggccat 61 ttcccagaag ggccccaccc caagggtgag tggccaatgg ggagctgttt ctgctgacat 121 caattcccca ggaggtactc accccaagtc tgcccaagtg aagatggctg atacccaccc 181 tgggatggag cccagcgcct gaggccctta tcatggtgat ggtcctaagt gaaagcctca 241 gcacccgggg agctgactcc attgcatgtg ggaccttcag ccgtgaactg cacacgccaa 301 agaagatgag tcaaggacct acacttttct cttgtggaat tatggaaaat gacagatggc 361 gagacctgga caggaaatgc cctcttcaga ttgaccaacc gagcaccagc atctgggaat 421 gcctgcctga aaaggacagc tcactatggc accgggaggc agtgaccgcc tgcgctgtga 481 ccagtctgat caaagacctc agcatcagcg accacaacgg gaacccctca gcacccccta 541 gcaagcgcca gtgccgctca ctgtccttct ccgatgagat gtccagttgc cggacatcat 601 ggaggccctt gggctccaaa gtctggactc ccgtggaaaa gagacgctgc tacagcgggg 661 gcagcgtcca gcgctattcc aacggcttca gcaccatgca gaggagttcc agcttcagcc 721 tcccttcccg ggccaacgtg ctctcctcac cctgcgacca ggcaggactc caccaccgat 781 ttggagggca gccctgccaa ggggtgccag gctcagcccc gtgtggacag gcaggtgaca 841 cctggagccc tgacctgcac cccgtgggag gaggccggct ggacctgcag cggtccctct 901 cttgctcaca tgagcagttt tcctttgtgg aatactgtcc tccctcagcc aacagcacac 961 ctgcctcaac accagagctg gcgagacgct ccagcggcct ttcccgcagc cgctcccagc 1021 cgtgtgtcct taacgacaag aaggtcggtg ttaaaaggcg gcgccctgaa gaagtgcaag 1081 agcagaggcc ttctctagac cttgccaaga tggcacagaa ctgtcagacc ttcagcagcc 1141 tcagctgcct gagcgcaggg acagaggact gcggtcccca gagccccttc gcccgccacg 1201 tcagcaacac cagggcctgg accgccctgc tctcagcctc cggcccaggg ggcaggaccc 1261 ccgctgggac cccggtccct gagcctcttc ccccttcctt cgacgaccac ctcgtctgcc 1321 aggaggacct gtcctgtgag gagtcagaca gctgcgccct ggacgaggat tgtggcagga 1381 gagcggagcc ggctgcagcc tggcgggacc gcggggcccc tgggaacagc ctctgctccc 1441 tggacggcga gttggacatt gagcagatag agaagaactg agggggtgtg ggcccaggca 1501 gggctggggt gtgctggcat cgacagcccc cactctgggc actaggtggg cccttgaagg 1561 ggagcccaac tcgtgggcct gatgaaagct tcctgagtgg tgtcgggtcc cagagaggga 1621 gcccacctgc tgcctggggg agagcctggc ctggccgcgt catacagcgg gtgtgtcagc 1681 ctctcaccgg ctccccgagc gtggcagcca ccaggtccac agaactactg cagcccagag 1741 gacagctttg aagtttgcgt cttttctgcc tctttccctg tgggatgttg ggcagtctct 1801 gttgtccccg gcagagctgg gcaccgctct gtatccccct ggtggtgggg gctgtcaggg 1861 agggcctggg gtgggggcca ggggccatct gctatgtcag ggcccttctt ggcctcactc 1921 aggttcactt ctggggagtc ggccccgcag cttctttcac tcagttttac tccgtgcctt 1981 ctctcccagg tctccctgct tcaggcttgg gaaggttcgg gagatgcttc cttctgtaac 2041 accagaacca tttggcctta attccaatgt gagagacaga atccctgggg tgctggactg 2101 gccctccaga gggtaagcca tgtccggagt ctcgggccca aggaacgatt tggagggtgc 2161 ttgttagggc ctcccgtgtt gggtagaaat ttggtggatc tgttggctga aaagatggac 2221 ttgcttgcct ctcctacagc atggagaggc tgaccccatg gctctgccac cgttggggca 2281 gggttagcag atggcagccc ttctctgtgg ctgacaggtc actgagtgat aagcatggtt 2341 ggttccggtg agtgtaggga tggcacgata ccagggcagc ctcttgaaaa cggcctcggg 2401 agacgggagc tgcgagcagg tgggcagatg agggccctat gcgcactcag gggtgaaggg 2461 cgtccgctgg ccactctgca ggggcccctg caggattcca ggcacctccc gtttgtcctt 2521 gaggactgct ggctgtaacc agggcacatc acccacctca agacaagccc acgcccttgt 2581 cagcttaggg ggagcccagt cctgagggct gcatctctgt tgtaggccca gccaccggca 2641 caaagctgga ttcatgctcc ctgcccctac cccaccctgg ctcctcaccc tggggcatcc 2701 gaggagccta gccccctgag ggtttgctct cctctcaagg tttgtagctc ctctccggct 2761 gccttgcaga caccaccaca tgggctctgc tctatgggaa tctggctttt agcgaatgtg 2821 gcgtcttctg caaacaatag caattgggct ggcttaggag caagtggctc attttcccat 2881 aaggctaaaa ataactggtg cgctcccttg tgttggctga cacgcgcgtt caaagcactt 2941 ttgtagtcac tttgcttttg ctcgtcttca tggacgagtg aacgcctcgc ttctgcaggt 3001 tgagtccaga tgcttctcac cttctttctc ctcaagaaag atgctttttg ggaaacgttg 3061 tttaaatctt atttttttac tacatcaaaa ggatggtggt tcaagttccc aatatgtggg 3121 tggcacttct taaaaatcag ctttaaggag ctggcagaaa gcccccagcc ccacagccct 3181 gagagatggt gttgctagct caggtggctg acacatgggg tatgccgggc actgggcagg 3241 tcccagagcc ggggaaccag ctcacctctg gttgctgtag ctcctgccgg aggcatgtct 3301 acttgtgatc ccggacagcc gaacccaaga gctggtggct ctgagcagac agagacatct 3361 tggcctgtcc ctgcctgggg gtcatggaga ccatgtcttc ttagagcaaa tgtggaggcg 3421 gccagggcag ttgttgggtg aatgtggaga gcacatggcc atgtcttgcc cccggagtac 3481 cactgggcgt ggggggtcct ggcaccacat gcccggtgtg gccgagggca cacagcctct 3541 atagcaggcc ttcctgtgga aggcagaggc agtgagggag gtggacggtg ccagctgagg 3601 ctgaggcatg cagcagcccc cagctacctt tgcttagggc tggggtggga ggcacatggt 3661 gacaggtata tgtcgtggga ctggggtgtg ggtgacctgc cctcaaacct tgcctgccac 3721 ctccccattc aggcctggtg gcaggaaggg acaagctgtg gagctggctg agtcacagcc 3781 acctccccac ctccccgcaa gctggtccca tcgaccagca agcccagccc cagggcgctt 3841 agggagaaat gacccagcct cctcagaccc cgcctgcctg tcctgtgccc accacgcagc 3901 agtcagggga gaaaatggtg gctatccctt ctgcttagag aaagaaatgg cctttagctg 3961 gtttcatgtt tgtgttttga ctggagggag tagaccctat ctataaggtg ccaccccatc 4021 atccaagctg ccacactgcc cggagcagcc tgttcctgca ctccaccctg ctggccccag 4081 gacttctgat ctcagtcctc tgggagggag gttcgcctag gaggtgcccc ccacattggt 4141 gtccccatgg gcagcaggca gacagctcac ccccaccagc atgatggccc cagctggggg 4201 cagtggcagg agccttactt ttgtcacagc cttgcccaca aaccctgcct ctgaggggag 4261 actgaggaag ggcagagcca gaagcaagcc gtgccaggcc atctgcctgc tcatggggtc 4321 ctaaagcgcg ggctaagcct gcaggaaagc cggggcggtg gggggggctt agtgccacat 4381 gcaccccact cattccaaag ccaccaaact gccaggggct gccgtccacc cgtggggccc 4441 aggggctggg gccacagcct tgccattttc gttgccatac cctcttgcct tactcgcggt 4501 ggaggccgga tttgcacggg cagacgtgca cctgggcccg tggggagctt gttctgacca 4561 gacgtacaga ttttcattct cagaaagcct tacttttcaa ccaaattttt gtagccagtt 4621 ttgtgaattt gtacactgaa agaaaattta aataaagggg aagtccacat taaaaagaaa 4681 acaaaacaaa ccctaactaa cttccaaatg ggtctcctgg tgcgggggcg tgagtggccg 4741 tgccctgggt gtgctgcctg tctgagcaag cttccctagc tgtggaaccc cgggccccct 4801 gctgcgggct ctgccttggt gtcatgcctg ctgcaccccc gtttccactg acgtgccgtc 4861 tgtggctatg ggggtggtca ctggaatgac ggtcactcca gacgtcagcc ggcagggatg 4921 cagcaggctg gccgcgcacc ggggctcggg caccctctgg ccccacactg gcaatgatgc 4981 cacaccttgc catgtccacg ctgttggtca aacccctctg tcatgcctct ttaaagagaa 5041 aagaagagaa agattttttt tttttttaat ggcagaccga agtggagatc ttgtagccta 5101 gataggatag tctgaccttc tagcatagtc tttttggcaa atgatttgtg ttttcagtgt 5161 gtggggaagc tgtcctgggg gctggggcga cagatagcac ataggctgtt tctggggctg 5221 caggggcttc cctgagctgg atgttgtggg tgttgccgtg cttcaggaag tgtggcgacc 5281 agaaagcgta gacccggggc ccagggtctg cccgcccctg cagcctggcc tccccgcaca 5341 ggctgtggct tgcactccag ccgctctagt ctctcaggaa tttgcttgtt acttgtactg 5401 tgtaaataaa gcttcctggt tcaataccc // LOCUS D50931 3020 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0141 gene, complete cds. ACCESSION D50931 NID g1469204 KEYWORDS KIAA0141. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3020) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3020) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3020 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..80 gene 81..1628 /gene="KIAA0141" CDS 81..1628 /gene="KIAA0141" /note="The KIAA0141 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010132" /db_xref="PID:g1469205" /translation="MWRLPGLLGRALPRTLGPSLWRVTPKSTSPDGPQTTSSTLLVPV PNLDRSGPHGPGTSGGPRSHGWKDAFQWMSSRVSPNTLWDAISWGTLAVLALQLARQI HFQASLPAGPQRVEHCSWHSPLDRFFSSPLWHPCSSLRQHILPSPDGPAPRHTGLREP RLGQEEASAQPRNFSHNSLRGARPQDPSEEGPGDFGFLHASSSIESEAKPAQPQPTGE KEQDKSKTLSLEEAVTSIQQLFQLSVSIAFNFLGTENMKSGDHTAAFSYFQKAAARGY SKAQYNAGLCHEHGRGTPRDISKAVLYYQLAASQGHSLAQYRYARCLLRDPASSWNPE RQRAVSLLKQAADSGLREAQAFLGVLFTKEPYLDEQRAVKYLWLAANNGDSQSRYHLG ICYEKGLGVQRNLGEALRCYQQSAALGNEAAQERLRALFSMGAAAPGPSDLTVTGLKS FSSPSLCSLNTLLAGTSRLPHASSTGNLGLLCRSGHLGASLEASSRAIPPHPYPLERS VVRLGFG" 3'UTR 1629..3020 BASE COUNT 717 a 826 c 755 g 722 t ORIGIN 1 cgcggcggcc tttctagccg ctgtcccaag ggttggtctc gcgctttcgg ctgcgagctc 61 tctgtggtgc tggcagcgac atgtggcgcc tcccgggact cctgggccga gctcttcccc 121 gtacactggg acctagcctc tggagggtga ctcctaagtc caccagccca gatgggcctc 181 agactacctc ctccactttg ctggttcctg tgcctaacct cgacaggtca ggtccccatg 241 gcccaggcac gagcgggggt ccaaggtccc atggatggaa ggatgccttc caatggatgt 301 cttcccgtgt ctccccgaac accctatggg atgccatatc ttggggcact ctggccgtgc 361 tggccctgca gctggcaagg cagatccact tccaggcatc cctgccagca ggacctcagc 421 gggtagaaca ctgctcctgg cacagtcccc tggaccgttt cttctcatct cccttgtggc 481 acccatgctc ctcactgcga caacacatcc tccccagccc cgatggccca gctcccaggc 541 acactggcct cagggaaccc aggcttggcc aggaagaagc ctcagctcag ccccggaact 601 tctcacacaa ctctttgaga ggagctcgtc ctcaggaccc ctctgaggaa ggtcccggtg 661 attttggctt cctgcatgcc agtagtagca tcgagtccga ggcaaaacca gcccagcctc 721 agcccactgg tgaaaaggaa caagataaat caaaaactct ttcccttgag gaggctgtga 781 cttccattca gcagctcttc cagctcagtg tttccatcgc tttcaacttc ctgggaacag 841 agaacatgaa gagtggcgac cacacggcag ccttttctta cttccagaaa gctgcagccc 901 gcggctacag caaagcgcag tacaatgcgg gcttgtgtca tgagcatggc agaggcaccc 961 ccagggacat tagcaaggcg gtcctttatt atcagttggc tgccagccag ggccacagcc 1021 tggctcagta ccgctatgcc aggtgcctac tacgagaccc agcctcttcg tggaaccctg 1081 agcggcagag ggcagtgtcc ttgctgaagc aggctgcaga ctcaggcttg agagaggccc 1141 aagctttcct cggggtgctt ttcaccaagg agccctacct ggatgagcag agagctgtga 1201 aatatctttg gcttgcagcc aacaatgggg actcacagag caggtaccac cttggaattt 1261 gctatgagaa aggccttggt gtgcagagga atctgggaga ggccttgaga tgttaccagc 1321 agtcagccgc tctgggaaat gaggccgccc aggagaggct gcgagccctc ttttccatgg 1381 gggctgcagc cccggggccc agcgacctga cagttacagg actgaagtct ttctccagcc 1441 cctccctctg cagcttgaac accctgctag caggaacctc acgcctacca catgcctcga 1501 gcacaggcaa ccttggcctc ctctgcagaa gtgggcatct cggagccagc ctggaagcct 1561 ccagcagggc tattccccca cacccctacc cactggaaag gagtgttgta agactaggtt 1621 ttggctaagg tgagataaaa catagtccct ggtgcctctt aggggccaga gcgggcagga 1681 ggttggataa caaaaataga gcatcagcaa ccctttccag gtagaaattc cagcgggagt 1741 tcaggttccc aagcaatttc acgtacatgg ctggtaagtg actgatcttt ccccccgctt 1801 ggtagcctca cagatgagtc ttggatgcat tcacagtcat ttctggtctg tgcaccaaag 1861 gatgcattca gtgacctatg aaaaacccta ctgaagggtc cagagaccct ggtgctcacc 1921 ttagcctttg tctttgagca aataacttac cttcttcctt cttatgcctg ggttttctca 1981 cacttaaatc tgtactactg tttgccaatg tctgatgtgt gtatccctgg ttcacaagaa 2041 gattttagat ggtattcaaa ttaatatttt ctatttagtt atatatttaa tgtataatat 2101 aaaaatatga ttagcatatt gacctgtagt ttgatagatg ttctgtctag gataaagcta 2161 actgtaaaaa aaaatagtga atcagtttaa agaaaaacat aacgtaaaag tgggcacaga 2221 tccagagagg tagcaaaaat cacaggggtg gttccatgaa tggctgacac ttaggaaact 2281 ctgaattagg ccatcctcga gactagccca ccattcacct ctgttcatcc cccgtgggct 2341 cataatcgtt ttcatttcac ctttgatttg gaaggaagaa gtttcttgcc caaatgcctg 2401 gatgtgtctg cttgactttc agaacttctc acctcagccc taaagaggga gcctgtgggt 2461 tctcagagag atatcacaat ttgagtccca aagaagaggc cagataccca cccaccttcc 2521 cccaaatctt aagcacctgc gccagtacag tcaagaagag gaaagtgtgt gaagacccag 2581 gtctggctct gccacttgcc tggccatgtc accttgaagc tgtgacctga ctccctatat 2641 tgtttcctca gttgtagacc aaaggcaatg gtgtctgccc tcctacctta gaagacaaat 2701 gcaagggcat ttcaccacag agaggacctt tgtgctcact ttggcccagg aggcagtgat 2761 gctcatggtt gcatgacttt atgagtcgct gggccagggt gaggacctgg gcctcctgac 2821 tcctggccca gagttcttgt ccatcagttc atactgcaat tttatgtgaa agcattatga 2881 ctgtcctacc catgggagag taaatgtaga ttgaatgcta ggagtcttaa agctggagag 2941 tatagatttt gaggtcccca tttgggaaac atgtgccaga aatgtctagg tgtttaataa 3001 aacagatatt ggattatctc // LOCUS D55636 404 bp mRNA PRI 13-AUG-1997 DEFINITION Homo sapiens mRNA for smallest subunit of ubiquinol-cytochrome c reductase, complete cds. ACCESSION D55636 NID g2317645 KEYWORDS smallest subunit of ubiquinol-cytochrome c reductase. SOURCE Homo sapiens fibroblasts cDNA to mRNA, clone_lib:Okayama-Berg library sub_clone:H-9-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 404) AUTHORS Islam,M.M. TITLE Direct Submission JOURNAL Submitted (10-JUN-1995) to the DDBJ/EMBL/GenBank databases. Mohammed M Islam, Austin and Repatriation Medical Centre, Centre for Molecular Biology and Medicine; Banksia St., West Heidelberg Vic-3081, Australia (E-mail:mislam@cfmbml.repat.unimelb.edu.au, Tel:61-3-9496 4110, Fax:61-3-9496 4112) REFERENCE 2 (sites) AUTHORS Islam,M.M., Suzuki,H., Yoneda,M. and Tanaka,M. TITLE Primary structure of the smallest (6.4-kDa) subunit of human and bovine ubiquinol-cytochrome c reductase deduced from cDNA sequences JOURNAL Biochem. Mol. Biol. Int. 41 (6), 1109-1116 (1997) MEDLINE 97305447 COMMENT Sequence updated (04-Aug-1997). FEATURES Location/Qualifiers source 1..404 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Okayama-Berg library" /sub_clone="H-9-1" /tissue_type="fibroblasts" CDS 38..208 /codon_start=1 /product="smallest subunit of ubiquinol-cytochrome c reductase" /db_xref="PID:d1022603" /db_xref="PID:g2317646" /translation="MVTRFLGPRYRELVKNWVPTAYTWGAVGAVGLVWATDWRLILDW VPYINGKFKKDN" polyA_signal 384..399 polyA_site 404 /note="7 A nucleotides" BASE COUNT 87 a 112 c 115 g 90 t ORIGIN 1 gtggacaggg tcatcctgag ggtgcgactc cgccgcgatg gtgacccggt tcctgggccc 61 acgctaccgg gagctggtca agaactgggt cccgacggcc tacacatggg gcgctgtggg 121 cgccgtgggg ctggtgtggg ccaccgattg gcggctgatc ctggactggg taccttacat 181 caatggcaag tttaagaagg ataattaatt acacaaaccc ttcacagact gctctggtgc 241 ctggtggtgc tagctcctcc cacctcagca cctgctgcat ctggagcagc ccaagctctc 301 aggatggaca agaggaaacc cacagctcag cttcaggctt cttatgtttc tgaaaacagc 361 ttggatattt taatgcacgt tgcattaaac ctcactgaaa cctg // LOCUS D55655 2528 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens mRNA for cardiac calsequestrin, complete cds. ACCESSION D55655 NID g2627061 KEYWORDS . SOURCE Homo sapiens adult heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2528) AUTHORS Tanaka,T., Inazawa,J. and Nakamura,Y. TITLE Molecular cloning of a human cDNA for cardiac calsequestrin and its chromosomal assignment to 1p13.3 by fluorescence in situ hybridization JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 2528) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (13-JUN-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, The University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..2528 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" CDS 110..1309 /codon_start=1 /product="cardiac calsequestrin" /db_xref="PID:g2627062" /translation="MKRTHLFIVGIYFLSSCRAEEGLNFPTYDGKDRVVSLSEKNFKQ VLKKYDLLCLYYHEPVSSDKVTPKQFQLKEIVLELVAQVLEHKAIGFVMVDAKKEAKL AKKLGFDEEGSLYILKGDRTIEFDGEFAADVLVEFLLDLIEDPVEIISSKLEVQAFER IEDYIKLIGFFKSEDSEYYKAFEEAAEHFQPYIKFFATFDKGVAKKLSLKMNEVDFYE PFMDEPIAIPNKPYTEEELVEFVKEHQRPTLRRLRPEEMFETWEDDLNGIHIVAFAEK SDPDGYEFLEILKQVARDNTDNPDLSILWIDPDDFPLLVAYWEKTFKIDLFRPQIGVV NVTDADSVWMEIPDDDDLPTAEELEDWIEDVLSGKINTEDDDEDDDDDDNSDEEDNDD SDDDDDE" BASE COUNT 730 a 534 c 564 g 700 t ORIGIN 1 aaaaatcagc ctgtctgctc tctccttgct caacaaggcc tctaacagtc ttctgtcctc 61 tattctgcac acggcatatt tgggaacgag aaacccaaag ttttcccaaa tgaagagaac 121 tcacttgttt attgtgggga tttattttct gtcctcttgc agggcagaag aggggcttaa 181 tttccccaca tatgatggga aggaccgagt ggtaagtctt tccgagaaga acttcaagca 241 ggttttaaag aaatatgact tgctttgcct ctactaccat gagccggtgt cttcagataa 301 ggtcacgcca aaacagttcc aactgaaaga aatcgtgctt gagcttgtgg cccaggtcct 361 tgaacataaa gctataggct ttgtgatggt ggatgccaag aaagaagcca agcttgccaa 421 gaaactgggt tttgatgaag aaggaagcct gtatattctt aagggtgatc gcacaataga 481 gtttgatggc gagtttgcag ctgatgtctt ggtggagttc ctcttggatc taattgaaga 541 cccagtggag atcatcagca gcaaactgga agtccaagcc ttcgaacgca ttgaagacta 601 catcaaactc attggctttt tcaagagtga ggactcagaa tactacaagg cttttgaaga 661 agcagctgaa cacttccagc cttacatcaa attctttgcc acctttgaca aaggggttgc 721 aaagaaatta tctttgaaga tgaatgaggt tgacttctat gagccattta tggatgagcc 781 cattgccatc cccaacaaac cttacacaga agaggagctg gtggagtttg tgaaggaaca 841 ccaaagaccc actctacgtc gcctgcgccc agaagaaatg tttgaaacat gggaagatga 901 tttgaatggg atccacattg tggcctttgc agagaagagt gatccagatg gctacgaatt 961 cctggagatc ctgaaacagg ttgcccggga caatactgac aaccccgatc tgagcatcct 1021 gtggatcgac ccggacgact ttcctctgct cgttgcctac tgggagaaga ctttcaagat 1081 tgacctattc aggccacaga ttggggtggt gaatgtcaca gatgctgaca gtgtctggat 1141 ggagattcca gatgatgacg atcttccaac tgctgaggag ctggaggact ggattgagga 1201 tgtgctttct ggaaagataa acactgaaga tgatgatgaa gatgatgatg atgatgataa 1261 ttctgatgaa gaggataatg atgacagtga tgacgatgat gatgaatagc ccaactccaa 1321 acaattctga tgaaaacaaa atcacagcac ccactaccat acagacagca caaggtggca 1381 gcaagcaatt ctgccccaca cccagccagc tcctttccct tttccatcat ctcttttccc 1441 actccctttg cgtcaggagc agcatcattc agcaaatgcc ttttcaaatg cagcaatccc 1501 acttagcagg gacaggagaa aaattattcc catgttgact gtcttgactg tcacggaaca 1561 gatcttgttc tttgctggac catcaagggt catggcagtg cctgaacatg gcagtctagg 1621 gtgaacaatc ccctaacaca agtttacttg tctttgatta tgacagtaac aaaattgaca 1681 gctttctaac tcacaggcat agagtgacct tttaatcaga gcccagggaa gacacatgat 1741 taatgattta gctccctcca tacctcgaac atcagttggg atccctcctc cagccaagat 1801 gatccttctt agagaaggct cagccttgga agcaaactta taaatcatat tctcatggct 1861 ttgttaaact tatttcaagt gatggtcatt catatcacta tgaacttgga tattcaagcc 1921 tttggatggc tatggagagg gcttgaaatg tgtacaggtg tcaccatcat ttctagtata 1981 ttaggaaact gggatgggag gttgatttgc tctctaaact tccctctagt tggcaagtct 2041 cacatattca tcagcaggag tggagggtgg gggaaaacta gaaagatgaa aacttttaca 2101 tttttctgat gggttcatgt ctctgattgg gtcagctggc ttcctagcct aagctgggat 2161 ctgaataccc cttctctgta gctgctagtg agccttccca tttagattaa agattgcttt 2221 atccagcagt caattaactc tccagttatc agtactccca caattggcca gggcaacaat 2281 aattggagtt catactgatg ccctgaggca ctgaaaaaaa aaaaaatccc aaagtgcctt 2341 ctgagctgtc taaaagttac attgtgcttg gtagatttag tgttaagtgt gcagtataat 2401 tttctaattt attttctcaa tcttttagca catgtgtaag acactgtgca aatttttgga 2461 aaatagagca atactttttg tggaatacta gctaactaat tctgtcatta aactcatatt 2521 ttggaaat // LOCUS D55696 1850 bp mRNA PRI 14-MAR-1997 DEFINITION Human mRNA for cysteine protease, complete cds. ACCESSION D55696 NID g1890049 KEYWORDS cysteine protease. SOURCE Homo sapiens Adult Heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,T., Inazawa,J. and Nakamura,Y. TITLE Molecular cloning of a human cDNA encoding putative cysteine protease (PRSC1) and its chromosome assignment to 14q32.1 JOURNAL Cytogenet. Cell Genet. 74 (1-2), 120-123 (1996) MEDLINE 97049087 REFERENCE 2 (bases 1 to 1850) AUTHORS Tanaka,T., Inazawa,J. and Nakamura,Y. TITLE Molecular cloning of a novel human cDNA encoding putative cysteine protease and its chromosomal assignment to 14q32.1 by fluorescence in situ hybridization JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1850) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (18-JUN-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science,The University of Tokyo, Laboratory of Molecular Medicine; 4-6-1 Sirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..1850 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /dev_stage="Adult" /map="14q32.1" /tissue_type="Heart" CDS 46..1347 /EC_number="3.4.22" /codon_start=1 /product="cysteine protease" /db_xref="PID:d1010173" /db_xref="PID:g1890050" /translation="MVWKVAVFLSVALGIGAVPIDDPEDGGKHWAVIVAGSNGWYNYR HQADACHAYQIIHRNGIPDEQIVVMMYDDIAYSEDNPTPGIVINRPNGTDVYQGVPKD YTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKSGPQDHVFIYFTDHGSTGILVFPNED LHVKDLNETIHYMYKHKMYRKMVFYIEACESGSMMNHLPDNINVYATTAANPRESSYA CYYDEKRSTYLGDWYSVNWMEDSDVEDLTKETLHKQYHLVKSHTNTSHVMQYGNKTIS TMKVMQFQGMKRKASSPVPLPPVTHLDLTPSPDVPLTIMKRKLMNTNDLEESRQLTEE IQRHLDARHLIEKSVRKIVSLLAASEAEVEQLLSERAPLTGHSCYPEALLHFRTHCFN WHSPTYEYALRHLYVLVNLCEKPYPLHRIKLSMDHVCLGHY" polyA_signal 1833..1838 BASE COUNT 482 a 456 c 464 g 448 t ORIGIN 1 gaatcgcctg ccacaggtgt ctgcaattga actccaaggt gcagaatggt ttggaaagta 61 gctgtattcc tcagtgtggc cctgggcatt ggtgccgttc ctatagatga tcctgaagat 121 ggaggcaagc actgggcagt gatcgtggca ggttcaaatg gctggtataa ttataggcac 181 caggcagacg cgtgccatgc ctaccagatc attcaccgca atgggattcc tgacgaacag 241 atcgttgtga tgatgtacga tgacattgct tactctgaag acaatcccac tccaggaatt 301 gtgatcaaca ggcccaatgg cacagatgtc tatcagggag tcccgaagga ctacactgga 361 gaggatgtta ccccacaaaa tttccttgct gtgttgagag gcgatgcaga agcagtgaag 421 ggtataggat ccggcaaagt cctgaagagt ggcccccagg atcacgtgtt catttacttc 481 actgaccatg gatctactgg aatactggtt tttcccaatg aagatcttca tgtaaaggac 541 ctgaatgaga ccatccatta catgtacaaa cacaaaatgt accgaaagat ggtgttctac 601 attgaagcct gtgagtctgg gtccatgatg aaccacctgc cggataacat caatgtttat 661 gcaactactg ctgccaaccc cagagagtcg tcctacgcct gttactatga tgagaagagg 721 tccacgtacc tgggggactg gtacagcgtc aactggatgg aagactcgga cgtggaagat 781 ctgactaaag agaccctgca caagcagtac cacctggtaa aatcgcacac caacaccagc 841 cacgtcatgc agtatggaaa caaaacaatc tccaccatga aagtgatgca gtttcagggt 901 atgaaacgca aagccagttc tcccgtcccc ctacctccag tcacacacct tgacctcacc 961 cccagccctg atgtgcctct caccatcatg aaaaggaaac tgatgaacac caatgatctg 1021 gaggagtcca ggcagctcac ggaggagatc cagcggcatc tggatgccag gcacctcatt 1081 gagaagtcag tgcgtaagat cgtctccttg ctggcagcgt ccgaggctga ggtggagcag 1141 ctcctgtccg agagagcccc gctcacgggg cacagctgct acccagaggc cctgctgcac 1201 ttccggaccc actgcttcaa ctggcactcc cccacgtacg agtatgcgtt gagacatttg 1261 tacgtgctgg tcaacctttg tgagaagccg tatccgcttc acaggataaa attgtccatg 1321 gaccacgtgt gccttggtca ctactgaaga gctgcctcct ggaagctttt ccaagtgtga 1381 gcgccccacc gacatgtgtg ctgatcagag actggagagg tggagtgaga agtctccgct 1441 gctcgggccc tcctggggag cccccgctcc agggctcgct ccaggacctt cttcacaaga 1501 tgacttgctc gctgttacct gcttccccag tcttttctga aaaactacaa attagggtgg 1561 gaaaagctct gtattgagaa gggtcatatt tgctttctag gaggtttgtt gttttccctg 1621 ttagttttga ggagcaggaa gctcatgggg gcttctgtag cccctctcaa aaggagtctt 1681 tattctgaga atttgaagct gaaacctctt taaatcttca gaatgatttt attgaagagg 1741 ggcgcaagcc ccaaatggaa aactgttttt agaaaatatg atgatttttg attgcttttg 1801 tatttaattc tgcaggtgtt caagtcttaa aaaataaaga tttataacag // LOCUS D63390 1195 bp mRNA PRI 12-MAY-1997 DEFINITION Human mRNA for acetylhydrolase IB beta-subunit, complete cds. ACCESSION D63390 NID g2081613 KEYWORDS platelet activating factor; acetylhydrolase IB beta-subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1195) AUTHORS Adachi,H. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) to the DDBJ/EMBL/GenBank databases. Hideki Adachi, Suntory Institute for Biomedical Research; 1-1-1 Wakayamadai, Shimamoto-cho, Mishima-gun, Osaka 618, Japan (E-mail:adachi_h@minase.suntory.co.jp, Tel:075-962-9283, Fax:075-962-6448) REFERENCE 2 (bases 1 to 1195) AUTHORS Adachi,H., Tsujimoto,M., Hattori,M., Arai,H. and Inoue,K. TITLE cDNA cloning of human cytosolic platelet-activating factor acetylhydrolase beta-subunit and its mRNA expression in human tissues JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Adachi,H., Tsujimoto,M., Hattori,M., Arai,H. and Inoue,K. TITLE Differential tissue distribution of the beta- and gamma-subunits of human cytosolic platelet-activating factor acetylhydrolase (isoform I) JOURNAL Biochem. Biophys. Res. Commun. 233 (1), 10-13 (1997) MEDLINE 97289481 FEATURES Location/Qualifiers source 1..1195 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 103..792 /function="platelet activating factor" /codon_start=1 /product="acetylhydrolase IB beta-subunit" /db_xref="PID:d1020706" /db_xref="PID:g2081614" /translation="MSQGDSNPAAIPHAAEDIQGDDRWMSQHNRFVLDCKDKEPDVLF VGDSMVQLMQQYEIWRELFSPLHALNFGIGGDTTRHVLWRLKNGELENIKPKVIVVWV GTNNHENTAEEVAGGIEAIVQLINTRQPQAKIIVLGLLPRGEKPNPLRQKNAKVNQLL KVSLPKLANVQLLDTDGGFVHSDGAISCHDMFDFLHLTGGGYAKICKPLHELIMQLLE ETPEEKQTTIA" BASE COUNT 337 a 255 c 291 g 312 t ORIGIN 1 cgcgccggag cgggaccgac gggaccgagc gagcgaccga cgcgccaccc gccgacgcct 61 cagccgcttg gggcccgcac ggaccctcta cttcagtgta gaatgagcca aggagactca 121 aacccagcag ctattccgca tgcagcagaa gatattcaag gagatgaccg atggatgtct 181 cagcacaaca gatttgtttt ggactgtaaa gacaaagagc ctgatgtact gttcgtggga 241 gactccatgg tgcagttaat gcagcaatat gagatatggc gagagctttt ttccccactt 301 catgcactga attttggaat tgggggagat acaacaagac atgttttgtg gagactaaag 361 aatggagaac tggagaatat taagcctaag gtcattgttg tctgggtagg aacaaataac 421 cacgaaaata cagcagaaga agtagcaggt gggatcgagg ccattgtaca acttatcaac 481 acaaggcagc cacaggccaa aatcattgta ttgggtttgt tacctcgagg tgagaaaccc 541 aatcctttga ggcaaaagaa cgccaaggtg aaccaactcc tcaaggtttc gctgccgaag 601 cttgccaacg tgcagctcct ggataccgac gggggttttg tgcactcgga cggtgccatc 661 tcctgccacg acatgtttga ttttctgcat ctgacaggag ggggctatgc aaagatctgc 721 aaacccctgc atgaactgat catgcagttg ttggaggaaa cacctgagga gaaacaaacc 781 accattgcct gactggctct tatcagtgtt aatagcatct cagcttcctc agatcagttc 841 tatcactggc actacagaat ccttctcttt cttaaggcac tttgcattgt agaatgttcc 901 tggatgttca tatctagtgt ttgaagggga ggagggattt aaactggtcc tgtacataga 961 aggtttgttt gacagaggag aaaaattagc caaggaagat tgttgtttaa attcatttga 1021 aaccagaagg ggacttttta gttgtatgtg taacacattc attgaattat tatcactgtt 1081 ttcttgggac aacatcaagc ctaaatactg aacaatatga agattctttt cttggccttt 1141 ctgtggatta tgtcatatat aataattatc agaatcattc tacttggctt tttcc // LOCUS D63476 5032 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0142 gene, complete cds. ACCESSION D63476 NID g1469865 KEYWORDS KIAA0142. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5032) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5032) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5032 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..473 gene 474..2414 /gene="KIAA0142" CDS 474..2414 /gene="KIAA0142" /note="The KIAA0142 gene is related to human KIAA0006 gene." /citation=[3] /codon_start=1 /db_xref="PID:d1010409" /db_xref="PID:g1469866" /translation="MTDNSNNQLVVRAKFNFQQTNEDELSFSKGDVIHVTRVEEGGWW EGTLNGRTGWFPSNYVREVKASEKPVSPKSGTLKSPPKGFDTTAINKSYYNVVLQNIL ETENEYSKELQTVLSTYLRPLQTSEKLSSANISYLMGNLEEICSFQQMLVQSLEECTK LPEAQQRVGGCFLNLMPQMKTLYLTYCANHPSAVNVLTEHSEELGEFMETKGASSPGI LVLTTGLSKPFMRLDKYPTLLKELERHMEDYHTDRQDIQKSMAAFKNLSAQCQEVRKR KELELQILTEAIRNWEGDDIKTLGNVTYMSQVLIQCAGSEEKNERYLLLFPNVLLMLS ASPRMSGFIYQGKLPTTGMTITKLEDSENHRNAFEISGSMIERILVSCNNQQDLQEWV EHLQKQTKVTSVGNPTIKPHSVPSHTLPSHPVTPSSKHADSKPAPLTPAYHTLPHPSH HGTPHTTINWGPLEPPKTPKPWSLSCLRPAPPLRPSAALCYKEDLSKSPKTMKKLLPK RKPERKPSDEEFASRKSTAALEEDAQILKVIEAYCTSAKTRQTLNSSSRKESAPQVLL PEEEKIIVEETKSNGQTVIEEKSLVDTVYALKDEVQELRQDNKKMKKSLEEEQRARKD LEKLVRKVLKNMNDPAWDETNL" 3'UTR 2415..5032 BASE COUNT 1279 a 1273 c 1232 g 1248 t ORIGIN 1 gctgagcggg ttggcatctg gggcagcggg ctcgctccag gccgtcgggg gccgctcgcc 61 agcgtcgccc gctgtgttgg gagcgcgggc cgtgggcgtc gctcggcctt gtccgcggcg 121 tccccgctgc cggccacggc gctcagcgct tgtgctctgt gttgcaggtc taccccgagc 181 cccggagcga gagcgagtgc ctgagcaaca tccgcgagtt cttgcgcggc tgcggggctt 241 ccctgcggct ggagacgttt gatgcaaatg atttgtatca ggggcagaat tttaacaagg 301 tcctcagttc cttagtgact ctaaataaag taacagcaga catcgggctg gggagtgact 361 ccgtgtgtgc ccggccctcg tctcaccgca taaagtcttt tgactccctt ggatcacagt 421 ctttgcacac tcggacttca aaactgttcc agggccagta tcggagtttg gacatgaccg 481 acaatagcaa caatcaactg gtagtaagag caaagtttaa cttccagcag accaatgagg 541 acgagctttc cttctcaaaa ggagacgtca tccatgtcac ccgtgtggaa gagggaggct 601 ggtgggaggg cacactcaac ggccggaccg gctggttccc cagcaactac gtgcgcgagg 661 tcaaggccag cgagaagcct gtgtctccca aatcaggaac actgaagagc cctcccaaag 721 gatttgatac gactgccata aacaaaagct attacaatgt ggtgctacag aatattttag 781 aaacagaaaa tgaatattct aaagaacttc agactgtgct ttcaacgtac ctacggccat 841 tgcagaccag tgagaagtta agttcagcaa acatttcata tttaatggga aatctagaag 901 aaatatgttc tttccagcaa atgctcgtac agtctttaga agaatgcacc aagttgcccg 961 aagctcagca gagagtcgga ggctgctttt taaacctgat gccacagatg aaaaccctgt 1021 acctcacgta ttgtgccaat cacccttctg cagtgaatgt cctcacggaa cacagtgagg 1081 agttggggga gttcatggag accaaaggtg ccagcagccc tgggattctc gtgctgacca 1141 cgggcctgag caaacccttc atgcgcctgg ataaataccc tacgctgctc aaagagctcg 1201 agagacacat ggaggattat catacagata gacaagatat tcaaaaatcc atggctgcct 1261 tcaaaaacct ttcagcccaa tgtcaagaag tccggaagag gaaagagctt gagctgcaga 1321 tcctgacgga agccatccgg aactgggagg gcgatgacat taaaactctg ggcaacgtca 1381 cttacatgtc ccaggtcctg attcagtgtg ccggaagtga ggaaaagaat gaaagatatc 1441 ttctactctt cccaaatgtt ttgctaatgt tgtctgccag tcctaggatg agtggcttta 1501 tctatcaggg aaagcttcca acgacaggaa tgacaatcac aaagcttgag gacagtgaaa 1561 atcatagaaa tgcatttgaa atatcaggga gcatgattga gcggatatta gtgtcgtgca 1621 acaaccagca ggatctgcag gaatgggtgg agcacctaca gaagcaaacg aaggtcacgt 1681 ctgtgggaaa ccccaccata aagcctcatt cagtgccatc tcataccctc ccctcccacc 1741 cggtcactcc gtccagcaag cacgcagaca gcaagcccgc gccgctgacg cccgcctacc 1801 acacgctgcc ccacccctcc caccacggca ccccgcacac caccatcaac tggggacccc 1861 tggagcctcc gaaaacaccc aagccctgga gcctgagctg cctgcggccc gcgcctcccc 1921 tccggccctc agctgctctc tgctacaagg aggatcttag taagagccct aagaccatga 1981 aaaagctgct gcccaagcgc aaacctgaac ggaagccttc agatgaggag ttcgcgtccc 2041 ggaaaagcac agctgctttg gaagaagatg ctcagattct gaaagtcatt gaagcttact 2101 gcaccagcgc caaaacaagg caaacactca attcaagttc acgcaaagaa tctgctccac 2161 aagttttgct tccagaagaa gagaaaatta tagtggaaga aactaaaagt aatggtcaga 2221 cagtgataga agaaaagagt cttgtggata ccgtatatgc attaaaggat gaagttcaag 2281 aattaagaca ggacaacaaa aagatgaaga aatctctaga ggaagaacag agagcccgca 2341 aagacctgga gaagctggtg aggaaagtcc tgaagaacat gaatgatcct gcctgggatg 2401 agaccaatct ataagggacg tcctcagttc tttctgttga agaccagttc tgaggtgaag 2461 ctgggcaccc ctgacccaag tcggggtgca ctcaggacca cagggcaggg ctgggtgggg 2521 cgccaccttg ctctctgtat atagaaaagc tggagcttat tctgcgaatg gagacgatca 2581 aaccatgact gatgaatcca gacaggaggg attgactctg aggacctgag ctacatcaat 2641 ccactctgtg aacatctcag ttacctcatt ctgcaataag ttcagtgact gactaaaagt 2701 cttgtttttc cagactttga attgaatata taaatattat atatacatgt ttcttgtaaa 2761 tatcccattt tgaatgcata cctgtggtgg ttctgtccgg gctaatcccc atgctagaat 2821 gtcctttcca gctacgtgaa taagaagtcc catgcccgca cccaccggaa gcagaagcct 2881 ggtggatgcc tggttcgttc cgcagcacca gggcctccac cgtgctgtgg cagcaccccc 2941 catgtcggta tttctaaata accttattta tacctgcaga gatacacttc agtcccattc 3001 agaagtcttc tcttaaagca gcattacagt cccagacctg cgggtttctg agggcagctt 3061 gctggctgac agactcagtc ttgacctcaa ggaaggccca tacggcactg ccgcatccac 3121 ctagaggtgt ttgctcttgt ccgctgtctg agtactgtga ttctcagatg agtttgctgc 3181 gttttgggag gacacagacg gttctgtata ggctagttca gtaacaacaa aatacactgt 3241 tttgtcttcc ctcaaagaga gatcttacta gaacctgtaa atagaatgta ttatttatta 3301 taagtcactg cagctgatga aaacagatgg aggccatgct gcaggctgat actgatgggt 3361 ggagttttgt catcaggcca gcctcatccc gaggtctcct ccaccattgg ccgtagccag 3421 caggcttcag tgctcaccga aagtaaaatc ccctccttca gcaagaataa agcaatatac 3481 accttaggtt ccactaagta acataggcat aagcagggaa cgtttccccc actgtgttcc 3541 agtgcagagg agacgaagcc tgtcctcacc gcggctcgct gggcccaggc tggctctgga 3601 aagcctgtgc ggtcctgggc aggaagcccg gcccgtggag caggttttcg ttctgcttca 3661 gcaataaata agggtgacca cagggacttt gcttttggtt tcctttcctg tgaaaaggtt 3721 ggttttaaag tgagatacac ttttccgtag aacaagtgtt ctatctttaa aaacccaaat 3781 tgcagcaccg tggattactg gtctcagaac aactcattgc gcatcagatt tgactctctg 3841 attttctgtc tattggccaa attgcccttt aactgcacct gaatcctttg tgtactgatg 3901 cctttgagct gggcaccttg ggagagtgtt gtgttgctgt ttacggttct tccttgccct 3961 tgctaattac agtctctggt gcccagcaag cccctttggc ttccttccgt gactggtcac 4021 gttgtctgcc tgggctcagc gtggacctgc cccatgctgc agaacctggc ctcacctgga 4081 cttttcacta gaattgccag cttcctcaac ttagcagatc attcactcat gcgggcacaa 4141 gcaaagatca acactttctt ttttggtaag cttgagtttt acaagttatt ttttggtgat 4201 gcgtaagaca ttgcagtggg aaaccattca acttgagttt attggagttt gctgttgtag 4261 caggttttaa ctcaggaaca actcttgtct gatctctccg cccctctgcc gggaggcgac 4321 attaactgtc ctctcggagc cggtagcgtt gctgtccgag tccccaggac ggatctcctg 4381 cagacctgcc ttaatgctca gatcgaagta tttcacaaga atacttgtgt ttttaacagc 4441 ccttcccctg gacggtgcgg ccatgagggc ctcatgttac ggcattgcct tttctttctg 4501 tggatccagt atcttcctcg gctttttagg gagcaggaaa aatgcgtctg agagcaactc 4561 tttttaaaaa cctgccctgt tgtatataac tgtgtctgtt tcaccgtgtg acctcccaag 4621 ggggtgggaa cttgatataa acgtttaaag gggccacgat ttgcccgagg gttactcctt 4681 tgctctcacc ttgtatggat gaggagatga agccatttct tatcctgtag atgtgaagca 4741 ctttcagttt tcagcgatgt tggaatgtag catcagaagc tcgttccttc acactcagtg 4801 gcgtctgtgc ttgtccacat gcactgggcg tctgggacct tgaatgcctg ccctggttgt 4861 gtggactcct taatgccaat catttcttca cttctctggg acacccaggg cgcctgttga 4921 caagtgtgga gaaactccta atttaaatgt cacagacaat gtcctagtgt tgactactac 4981 aatgttgatg ctacactgtt gtaattatta aactgattat ttttcttatg tc // LOCUS D63478 3411 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0144 gene, complete cds. ACCESSION D63478 NID g1469869 KEYWORDS KIAA0144. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3411) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3411) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3411 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..106 gene 107..3058 /gene="KIAA0144" CDS 107..3058 /gene="KIAA0144" /note="The KIAA0144 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1010411" /db_xref="PID:g1469870" /translation="MMTSVGTNRARGNWEQPQNQNQTQHKQRPQATAEQIRLAQMISD HNDADFEEKVKQLIDITGKNQDECVIALHDCNGDVNRAINVLLEGNPDTHSWEMVGKK KGVSGQKDGGQTESNEEGKENRDRDRDYSRRRGGPPRRGRGASRGREFRGQENGLDGT KSGGPSGRGTERGRRGRGRGRGGSGRRGGRFSAQGMGTFNPADYAEPANTDDNYGNSS GNTWNNTGHFEPDDGTSAWRTATEEWGTEDWNEDLSETKIFTASNVSSVPLPAENVTI TAGQRIDLAVLLGKTPSTMENDSSNLDPSQAPSLAQPLVFSNSKQTAISQPASGNTFS HHSMVSMLGKGFGDVGEAKGGSTTGSQFLEQFKTAQALAQLAAQHSQSGSTTTSSWDM GSTTQSPSLVQYDLKNPSDSAVHSPFTKRQAFTPSSTMMEVFLQEKSPAVATSTAAPP PPSSPLPSKSTSAPQMSPGSSDNQSSSPQPAQQKLKQQKKKASLTSKIPALAVEMPGS ADISGLNLQFGALQFGSEPVLSDYESTPTTSASSSQAPSSLYTSTASESSSTISSNQS QESGYQSGPIQSTTYTSQNNAQGPLYEQRSTQTRRYPSSISSSPQKDLTQAKNGFSSV QATQLQTTQSVEGATGSAVKSDSPSTSSIPPLNETVSAASLLTTTNQHSSSLGGLSHS EEIPNTTTTQHSSTLSTQQNTLSSSTSSGRTSTSTLLHTSVESEANLHSSSSTFSTTS STVSAPPPVVSVSSSLNSGSSLGLSLGSNSTVTASTRSSVATTSGKAPPNLPPGVPPL LPNPYIMAPGLLHAYPPQVYGYDDLQMLQTRFPLDYYSIPFPTPTTPLTGRDGSLASN PYSGDLTKFGRGDASSPAPATTLAQPQQNQTQTHHTTQQTFLNPALPPGYSYTSLPYY TGVPGLPSTFQYGPAVFPVAPTSSKQHGVNVSVNASATPFQQPSGYGSHGYNTGRKYP PPYKHFWTAES" 3'UTR 3059..3411 BASE COUNT 871 a 941 c 817 g 782 t ORIGIN 1 cccgactaag tgacttaaac tcccacctac tcctggaata aggagtcaaa gcccggatag 61 gcgcagtatt ctaccttgta aatactgtta tttgtatata ctgtaaatga tgacatcggt 121 gggcactaac cgagcccggg gaaactggga acaacctcaa aaccaaaacc agacacagca 181 caagcagcgg ccacaggcca ctgcagaaca aattagactt gcacagatga tttcggacca 241 taatgatgct gactttgagg agaaggtgaa acaattgatt gatattacag gcaagaacca 301 ggatgaatgt gtgattgctt tgcatgactg caatggagat gtcaacagag ctatcaatgt 361 tcttctggaa ggaaacccag acacgcattc ctgggagatg gtcgggaaga agaagggagt 421 ctcaggccag aaggatggtg gccagacgga atccaatgag gaaggcaaag aaaatcgaga 481 ccgggacaga gactatagtc ggcgacgtgg tgggccacca agacggggga gaggtgccag 541 ccgtggacga gagtttcgag gtcaggaaaa tggattggat ggcaccaaga gtggagggcc 601 ttctggaaga ggaacagaaa gaggcagaag gggccgtggc cgaggcagag gtggctctgg 661 taggcgagga ggaaggtttt ctgctcaagg aatgggaacc tttaacccag ctgattatgc 721 agagccagcc aatactgatg ataactatgg caatagcagc ggcaatacgt ggaacaacac 781 tggccacttt gaaccagatg atgggacgag tgcatggagg actgcaacag aggagtgggg 841 gactgaagat tggaatgaag atctttctga gaccaagatc ttcactgcct ctaatgtgtc 901 ttcagtgcct ctgcctgcgg agaatgtgac aatcactgct ggtcagagaa ttgaccttgc 961 tgttctgctg gggaagacac catctacaat ggagaatgat tcatctaatc tggatccgtc 1021 tcaggctcct tctctggccc agcctctggt gttcagtaat tcgaagcaga ctgccatatc 1081 acagcctgct tcagggaaca cattttctca tcacagtatg gtgagcatgt tagggaaagg 1141 atttggtgat gtcggtgaag ctaaaggcgg cagtactaca ggctcccagt tcttggagca 1201 attcaagact gcccaagccc tggctcagtt ggcagctcag cattctcagt ctggaagcac 1261 caccacctcc tcttgggaca tgggctcgac gacacaatcc ccatcactgg tgcagtatga 1321 tttgaagaac ccaagtgatt cagcagtgca cagccccttt acaaagcgcc aggcttttac 1381 cccatcttca accatgatgg aggtgttcct tcaggagaag tcacctgcag tggctacctc 1441 cacagctgca cctccacctc cgtcttctcc tctgccaagc aaatccacat cggctccaca 1501 gatgtcgcct ggatcttcag acaaccagtc ctctagccct cagccggctc agcagaaact 1561 gaaacagcag aagaaaaaag cctccttgac ttctaagatt cctgctctgg ctgtggagat 1621 gcctggctca gcagatatct cagggctaaa cctgcagttt ggggcattgc agtttgggtc 1681 agagcctgtc ctttctgatt atgagtccac ccccaccacg agcgcctctt caagccaggc 1741 tccaagtagc ctgtatacca gcacggccag tgaatcatcc tctacaattt catctaacca 1801 gagtcaggag tctggttatc agagcggccc aattcagtcg acaacctata cctcccaaaa 1861 taatgctcag ggccctcttt atgaacagag atccacacag actcggcggt accccagctc 1921 catctcttca tcaccccaaa aggacctgac tcaggcaaag aatggcttca gttctgtgca 1981 ggccacgcag ttacagacca cacaatctgt tgaaggtgct acaggctctg cagtgaaatc 2041 tgattcacct tccacttcta gcatcccccc tctcaatgaa acggtatctg cagcttcctt 2101 actgacgaca accaatcagc attcatcctc cttgggtggc ttgagccaca gtgaggagat 2161 tccaaatact accaccacac aacacagcag cacgttatct acgcagcaga ataccctttc 2221 atcatcaaca tcttctgggc gcacttcgac atccactctt ttgcacacaa gtgtggagag 2281 tgaggcgaat ctccattctt cctccagcac tttttccacc acatccagca cagtctctgc 2341 acctccccca gtggtcagtg tctcctccag tctcaatagt ggcagtagcc tgggcctcag 2401 cctaggcagc aactccactg tcacagcctc gactcgaagc tcagttgcta cgacttcagg 2461 aaaagctcct cccaacctcc ctcctggggt cccgccgttg ttgcctaatc cgtatattat 2521 ggctccaggg ctgttacatg cctacccgcc acaagtatat ggttatgatg acttgcagat 2581 gcttcagaca agatttccat tggattacta cagcatccca tttcccacac ccactactcc 2641 gctgactggg agggatggta gcctggccag caacccttat tctggtgacc tcacaaagtt 2701 cggccgtggg gatgcctcct ccccagcccc ggccacaacc ttggcccaac cccaacagaa 2761 ccagacgcag actcaccata ccacgcagca gacattcctg aacccggcgc tgcctcctgg 2821 ctacagttac accagcctgc catactatac aggggtcccg ggcctcccca gcaccttcca 2881 gtatgggcct gctgtgttcc ctgtggctcc tacctcttcc aagcagcatg gtgtgaatgt 2941 cagtgtgaat gcatcggcca cccctttcca acagccgagt ggatatgggt ctcatggata 3001 caacactgga agaaaatatc caccccctta caagcatttc tggacggctg agagctaatt 3061 tggcccaagg ctgggggctg tgttttgtgt gtgtgtataa atttgcactg aagtcttgtt 3121 tcagaaacca gaccactgag gagagcctgc tgagctgagg ccatggcctg cgtggcttgg 3181 ggaaatgagt tggtggatac cttctgggct tttgaacttg cccctccccc atttccctct 3241 cccccatgtg tctgaccctg tcttacccat ttcaagttca agcggtgcag caccttcgaa 3301 gcatcaatgc acacacctgc tgttgctttt gatttctgga aggcatgtag tttcaacttg 3361 taacaaaaat atttgtagtc ttcaataaac tgtggtattt ctttagctaa c // LOCUS D63479 6141 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0145 gene, complete cds. ACCESSION D63479 NID g1469871 KEYWORDS KIAA0145. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6141) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6141) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..6141 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..1462 gene 1463..3505 /gene="KIAA0145" CDS 1463..3505 /gene="KIAA0145" /note="The KIAA0145 gene product is related to diacylglycerol kinase." /citation=[3] /codon_start=1 /db_xref="PID:d1010412" /db_xref="PID:g1469872" /translation="MAKKCSVLKEKLDSLLKTLDDESQASSSLPNPPPTIAEEAEDGD GSGSICGSTGDRLVASACPARPQIFRPREQLMLRANSLKKAIRQIIEHTEKAVDEQNA QTQEQEGFVLGLSESEEKMDHRVCPPLSHSESFGVPKGRSQRKVSKSPCEKLISKGSL SLGSSASLPPQPGSRDGLPALNTKILYPNVRAGMSGSLPGGSVISRLLINADPFNSEP ETLEYYTEKCVMNNYFGIGLDAKISLDFNNKRDEHPEKCRSRTKNMMWYGVLGTKELL HRTYKNLEQKVLLECDGRPIPLPSLQGIAVLNIPSYAGGTNFWGGTKEDDTFAAPSFD DKILEVVAVFGSMQMAVSRVIRLQHHRIAQCRTVKISILGDEGVPVQVDGEAWVQPPG YIRIVHKNRAQTLTRDRAFESTLKSWEDKQKCELPRPPSCSLHPEMLSEEEATQMDQF GQAAGVLIHSIREIAQSHRDMEQELAHAVNASSKSMDRVYGKPRTTEGLNCSFVLEMV NNFRALRSETELLLSGKMALQLDPPQKEQLGSALAEMDRQLRRLADTPWLCQSAEPGD EESVMLDLAKRSRSGKFRLVTKFKKEKNNKNKEAHSSLGAPVHLWGTEEVAAWLEHLS LCEYKDIFTRHDIRGSELLHLERRDLKDLGVTKVGHMKRILCGIKELSRSAPAVEA" 3'UTR 3506..6141 BASE COUNT 1309 a 1712 c 1789 g 1331 t ORIGIN 1 cgccgcccga ggagtcgtcc gacagcgagc ccgaggcgga gcccggctcc ccacagaagc 61 tcatccgcaa ggtgtccacg tcgggtcaga tccgacagaa gaccatcatc aaagagggga 121 tgctgaccaa acagaacaat tcattccagc gatcaaaaag gagatacttt aagcttcgag 181 ggcgaacgct ttactatgcc aaaacggcaa agtcaatcat atttgatgag gtggatctga 241 cagatgccag cgtagctgaa tccagtacca aaaacgtcaa caacagtttt acggtcataa 301 ctccatgcag gaagctcatc ttgtgtgctg ataacagaaa agaaatggaa gattggattg 361 cagcattaaa gactgtgcag aacagggagc actttgagcc cacccagtac agcatggacc 421 acttctcagg gatgcacaat tggtacgcct gttcccacgc gaggccgacc tactgcaatg 481 tgtgccgtga ggctctgtct ggggtcacgt cgcacgggct gtcctgcgag gtgtgcaaat 541 ttaaggccca caagcgctgt gctgtgcgtg caaccaataa ctgcaagtgg accacactgg 601 cctcgatcgg gaaggacatc attgaagatg cagatgggat tgcaatgccc caccagtggt 661 tggaaggaaa cctacctgtg agcgccaagt gcactgtgtg cgacaagacc tgtggcagtg 721 tgctgcgcct gcaggactgg cgctgcctct ggtgcaaggc catggttcac acatcgtgta 781 aagaatcctt gctgaccaag tgcccacttg gcctgtgcaa agtgtcagtc atcccaccca 841 cggctctcaa cagcatcgac tccgatgggt tctggaaggc cagctgtcct ccttcttgca 901 caagcccact gttggtcttc gtcaattcaa aaagtgggga caaccagggt gtgaagttcc 961 tcagaagatt caaacagcta ctaaaccccg cccaggtctt cgacctcatg aacggaggcc 1021 cacacctcgg cttacggtta ttccagaagt ttgacacatt ccggattctg gtttgtggcg 1081 gggatggaag tgttggctgg gtcctctccg aaatcgacag cctcaacctt cataaacagt 1141 gtcagctggg agtgctgccg ctcggcacag ggaacgactt ggcccgagta ctgggctggg 1201 gctcagcctg cgatgacgac acccagctcc cccagatctt ggagaagttg gagagagcca 1261 gcaccaagat gctggacagg tacagcagat tctcttctat gaagactcgg ttgcagccca 1321 cctttctaaa atcctcacct cggaccagca ctcggtggtc atctcctcgg ccaaagtgct 1381 ctgtgagacg gtgaaggact tcgtggcacg ggtggggaag gcctatgaga agacgaccga 1441 gagctcggag gagtcagagg tcatggccaa gaagtgctct gtcctgaaag agaagctgga 1501 ttcccttctc aagaccttgg acgatgagtc ccaggcctcg tcctctctgc ccaacccgcc 1561 ccccaccatt gccgaggagg ctgaagatgg agatgggtcg ggcagcatct gcggttccac 1621 cggagaccgc ttggtggcat cagcttgccc ggcccggccg cagatattcc ggcctcgaga 1681 acagctcatg ctgagagcca acagcctgaa gaaagcaatt cgtcagatca tagaacacac 1741 agaaaaagct gtcgatgagc agaatgccca gacccaggag caggagggct tcgtcctggg 1801 cctctctgag tcagaggaga agatggacca cagagtgtgc ccaccactgt cccacagcga 1861 gagcttcggg gtccccaagg ggaggagcca gcgcaaagtg tcgaaatctc cgtgtgaaaa 1921 gctgatcagc aaagggagtc tgtccctagg cagttctgct tcccttccgc cccagccggg 1981 aagccgggac ggcttgcctg cgctcaacac caagatcctg tacccaaatg tccgggctgg 2041 aatgtctggt tccttacccg gtggctcagt catcagtcgc ctgttaatta atgctgatcc 2101 cttcaactct gaaccagaaa ccctagagta ttacacggag aaatgtgtca tgaacaacta 2161 ttttggcatt ggcctggatg cgaagatatc cctggacttt aacaacaagc gcgatgagca 2221 cccagagaag tgcaggagcc gaaccaagaa catgatgtgg tatggagttc ttggaaccaa 2281 agagttgctg cacagaacct acaagaacct ggagcaaaag gtcttgctgg agtgtgacgg 2341 gcgacccatc ccactcccca gtcttcaggg aattgctgtc cttaacattc ccagctatgc 2401 cggaggaacc aacttctggg ggggtaccaa ggaagatgat actttcgcag ctccatcatt 2461 cgatgacaag attctggagg tggtcgccgt gttcggcagc atgcagatgg ccgtctctcg 2521 agtcatcagg ctacagcatc atcggatcgc ccagtgtcgc acggtgaaga tctccatcct 2581 tggggatgag ggcgtgcctg tgcaggtgga cggagaggcc tgggtccagc cgccagggta 2641 cattcggatt gtccacaaga accgggcaca gacactgacc agagacaggg catttgagag 2701 caccctgaag tcctgggaag acaagcagaa gtgcgagctg ccccgccctc catcctgttc 2761 cctgcacccg gagatgctgt ccgaggagga ggccacccag atggaccagt ttgggcaggc 2821 agcaggggtc ctcattcaca gtatccgaga aatagctcag tctcaccggg acatggagca 2881 ggaactggcc cacgccgtca atgccagctc caagtccatg gaccgtgtgt atggcaagcc 2941 cagaaccaca gaggggctca actgcagctt cgtcctggaa atggtgaata acttcagagc 3001 tctgcgcagt gagacggagc tgctgctgtc tgggaagatg gccctgcagc tggatccgcc 3061 tcagaaggag cagctgggga gtgctcttgc cgagatggac cgacagctca ggaggctggc 3121 agacaccccg tggctctgcc agtccgcaga gcccggcgac gaagagagtg tgatgctgga 3181 tcttgccaag cgcagtcgca gtggtaaatt ccgcctcgtg accaagttta aaaaggagaa 3241 aaacaacaag aacaaagaag ctcacagtag cctgggagcc ccggttcacc tctgggggac 3301 agaggaggtt gctgcctggc tggagcacct cagtctctgt gagtataagg acatcttcac 3361 acggcacgac atccggggct ctgagctcct gcacctggag cggagggacc tcaaggacct 3421 gggcgtgacc aaggtgggcc acatgaagag gatcctgtgt ggcatcaagg agctgagccg 3481 cagcgccccc gccgtcgagg cctagcctct gtcctctcag cctgtggcct ccacatcccc 3541 gccgccgagg cctagcctcc gccctctcag cctgtggcct ctgcgcctcc tgccactgag 3601 gccctgggca gatgctgcag cccgccccct tctcatggtg ctacttcctc tgtcagctac 3661 agaaagcctc cgtgacaccg tccaccagag ctctggggtc tcgaacataa caacacagct 3721 acctttgaaa caacactttc tccagctcag agtcacctgg ggcacatgtg tcacggccac 3781 tcagctctcg cccgcctgtg ctgtgggcca gggaatccag cggcgtctgg cctcctgggc 3841 actgcttgcc tggcctcgtg cttggattgt cccgggggct cctctccgtg tgtccttctg 3901 tggccgcacc gtgtggctcc gcctcctggc ccccagccag ttctcagaaa cgtggctggg 3961 gcccagcaca gcagcctgca agggcccctg tttgttgatg cagcttttgt tgaacaaaaa 4021 tcgtgctctt tcctggtttg aaagtagcat ggatgtttcc agtcttgttg attgtaattt 4081 gacgtgaaga gaaaaaaaaa ttcctcctgc gtgagccaag gcagcgggtg ctgtttccca 4141 ggcggggagc ccctccctgg gtgtcacagg gcctgtgctc ctccctcctc catcctctct 4201 cctcccgctc ctccctcccc ccactgtggg ctggggacgc ctgcccttct gtctccggac 4261 gctctaggcg agttcagctt ggggtgtgag tgagacagct cgccagctgc atccctgcag 4321 acagaggatg tgtgtccaca tgagtgtttc tgtgtgggaa atgcttcctg gctctgggaa 4381 actttttctg cccattctgt ggttcccagg gagcgtggcc ctggtgggcc aggggtggtt 4441 tgacctcttc agcccgtccg gtggcctgga ggccggaggc tctcctgagt gtctgcccct 4501 gcagtggctt cttgtcgcct gctgctgggc gtgatgtcgc tggaggtgct ggcagggact 4561 ctgatttggt ggtccgcgct gcccctgccc tgcctctgtc ctggctctga actagtagat 4621 gatggtgcca gagggcaggg agctcgcctg gggagagggc tgtgccccgt agggacagtg 4681 cccaggtgaa ggatgcccct ggtcctccag ggcactgact ttgccctttt ttcccgttga 4741 tagtcatggc tcagaggtgc ttgtaaatgt cttgggaaga ggtttctgta acccctgccc 4801 tggtgtgagg aggaaatggc tctggcctgg ctgcctggcc gtggcttctc tttggctccc 4861 aaagagaagg acagtgttgg gagtatctgc cgtggcttct ctttggctcc caaagagaag 4921 gacagtgttg ggagtatctg ccggcgctgt ccaggtcctt tagtcagcgt cactccatct 4981 gatgtgcaga agctgggctg cacctgcggg ggtgggcata gaccgggctg ggtctgcagc 5041 agcccctggt cctgagcagg cggcagtgaa cagcactggc ccacctccca ctcacagccc 5101 ctctgtcccc tctgcagtgc acccaggtgg gcccctctgc gtgcctttgg gtgctcccct 5161 ctcgtggtcg ttctggcccg aggcccttag agtatggagg ctgagccagg ccttgggttt 5221 ccccagcaca gcctcctgtc gctgcatgcg acgtgttggg atttttggat gaaagactct 5281 cccacgctct gttggtggac ttagctgcct cactggaagt gatgtgggtg gaaggtggtt 5341 gtatgttacc ttttccacct ctcattgttt tccccagaac attgtagatg ggggttggca 5401 gagggagaaa taagccagcc acggcagtcg cttggtttcc caggtggaat gggctaacac 5461 aggagatgat gggaacctgt cccgcagtcc ctgcatgacc attggccctg ctggcctggc 5521 gatgtgggca tcctggggtt cttagggtcc cagaacaagc cccaggcaag ctggaacttg 5581 ggtggggagg ggacatgagg aggataaaca gctgactgtg gcttcaagga catcagggcc 5641 accccaagtc ctcagtgtcc tactcctggc aaggagttgg gtttggatca aaagtgttta 5701 aaattaatat gttgtcagtg attagaacaa cactgtttac ataaaaacca tttttctaat 5761 tctaacaagt tagaatgtga ggaaggaatg aacatgagtg tttaggaacc tgccctttgg 5821 tgctgggctg gcgtcccgca ctggggtgtc ctcgctgtct gggggctgct ctgctgcccg 5881 gcccaggtcc ccttgtggtg ttgccagacg ggcctcatgg tctgctgtgc agagagaggc 5941 aggaaggatc cctgaagagt cttggagaaa aggttctgtg ccctcaggtg gggcttaccc 6001 cctcgtattt ataatcttaa tttatatagt gaccaccgtg gaaacaaacg cctcttgtat 6061 tgtcatgtac atagtccata cctgagtgct gtacataagt tgttctgtgt ataaataaaa 6121 caagcctgtt tttgatcttc c // LOCUS D63482 2287 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0148 gene, complete cds. ACCESSION D63482 NID g1469877 KEYWORDS KIAA0148. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2287) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..112 gene 113..1528 /gene="KIAA0148" CDS 113..1528 /gene="KIAA0148" /note="The KIAA0148 gene product is related to KIAA0041 and KIAA0050 proteins." /citation=[3] /codon_start=1 /db_xref="PID:d1010415" /db_xref="PID:g1469878" /translation="MSKRLRSSEVCADCSGPDPSWASVNRGTFLCDECCSVHRSLGRH ISQVRHLKHTPWPPTLLQMVETLYNNGANSIWEHSLLDPASIMSGRRKANPQDKVHPN KAEFIRAKYQMLAFVHRLPCRDDDSVTAKDLSKQLHSSVRTGNLETCLRLLSLGAQAN FFHPEKGNTPLHVASKAGQILQAELLAVYGADPGTQDSSGKTPVDYARQGGHHELAER LVEIQYELTDRLAFYLCGRKPDHKNGQHFIIPQMADSSLDLSELAKAAKKKLQSLSNH LFEELAMDVYDEVDRRETDAVWLATQNHSALVTETTVVPFLPVNPEYSSTRNQGRQKL ARFNAHEFATLVIDILSDAKRRQQGSSLSGSKDNVELILKTINNQHSVESQDNDQPDY DSVASDEDTDLETTASKTNRQKSLDSDLSDGPVTVQEFMEVKNALVASEAKIQQLMKV NNNLSDELRIMQKKLLGKDAN" 3'UTR 1529..2287 BASE COUNT 665 a 518 c 570 g 534 t ORIGIN 1 cgccgccgtc agcgccgccg cagctgggac ccgttagagc ggaagcgccg ccgccaccgc 61 cgcctttgct gtcccccggc ctctagttcc ccgcaggtgg gaggtgggag ccatgtcgaa 121 acggctccgg agcagcgagg tgtgcgctga ctgcagcggg ccggatcctt cctgggcatc 181 agtaaatagg ggaacgtttt tatgtgatga gtgctgcagt gtccatcgga gtctagggcg 241 ccatatctcc caagtgaggc atctgaaaca cacaccgtgg cctccaacac tgcttcagat 301 ggttgagacc ttgtataata acggtgctaa ctctatatgg gagcattctt tgctggaccc 361 tgcgtctatt atgagtggaa gacgtaaagc taatccacag gataaagtac atcccaataa 421 agcggaattc atcagagcca agtatcagat gttagcgttc gtccatcgct tgccctgccg 481 ggatgacgat agtgtgactg ccaaagatct tagcaagcaa ctccattcga gcgtgagaac 541 agggaatctt gaaacctgtt tgagactgtt atctttagga gcacaagcca acttctttca 601 tcctgaaaaa ggaaacaccc cactccatgt tgcctccaaa gcagggcaga ttttacaggc 661 tgaattattg gcagtatatg gagcagaccc aggcacacag gattctagtg ggaaaactcc 721 cgttgattat gcaaggcaag gagggcacca tgagctggca gagcgcctcg tggaaataca 781 gtatgagcta acggacagac tagccttcta tctctgtggc aggaaaccag atcacaaaaa 841 tggacagcac tttataatac ctcaaatggc agacagcagc ctggatttgt ctgaattggc 901 aaaagctgct aagaagaaac ttcaatctct aagtaatcat ttgtttgaag aacttgccat 961 ggatgtgtac gatgaagttg acaggcgaga gacggatgca gtctggcttg ccacgcaaaa 1021 ccacagcgcc ctggtaaccg agacaacggt cgtccccttt cttccggtca atcctgagta 1081 ctcatcaaca cgaaatcagg gcagacagaa gttagctcgg ttcaacgccc atgagtttgc 1141 cacgctggtc attgacattc tcagtgacgc caagaggaga cagcagggca gttctctctc 1201 gggttcaaaa gacaatgtgg agctcatact gaaaaccatc aataaccagc acagcgttga 1261 gagtcaagac aacgatcagc ccgactatga cagcgtggca tcagacgaag acacagattt 1321 ggaaaccact gcaagcaaaa caaaccggca gaagagccta gattcagatt tatcagatgg 1381 accagtcact gtacaggaat ttatggaggt caaaaacgct ctagtggctt ctgaggccaa 1441 gatacagcag ctaatgaagg tgaataacaa cttgagtgac gagctgagaa ttatgcagaa 1501 aaagttgctt ggaaaagatg ctaattaatg aagaggagca actactattg gtgtattttt 1561 cacagattgg tgctttctaa ataaaaattg aaagtaactc ctaacattga atgggtttgc 1621 tactgaaaaa gtaatgatct tctggtgcaa acagttgctt gtggacttaa accttggcac 1681 tggtggggaa tttggtcaga ttttacaatc tctgtcaaag agtagacagc tgaactcaca 1741 ccacacccag cttatagaat gtccatggaa gatgaaggcg caccagaagg gaaggaccct 1801 gcgcagaatg gacgtggtga atggtgttta aaatgccaga tgccaaagag taacacgatt 1861 ccctgctgac cccttaactc taatccatcc agcaccatgg agcagcctgc atgtgaggaa 1921 tggaaggagc attcagggcc tccaagtgac agtctctaaa atggggtggt gccaggcaga 1981 ttagcatgtt caaagctgac accactgagg tcgtgttttt tgggtgacaa agccaaagga 2041 gagaaaggcc aaatattcca gccctggccg aaagatgatc actcaccaga cggaagcaag 2101 cgtgctgcgt ggagcatcca tgcgaagatg tcaattccat agatcaatag gtttccagtt 2161 ttcttcgtga tatgttaata tagcaaactt accatgatcc ggttttctct tttttctctt 2221 tttttttaca aagtgctgaa ttgtttggaa tatcaagagt attatgtaat aaaaacttgt 2281 tgatcat // LOCUS D63483 3387 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0149 gene, complete cds. ACCESSION D63483 NID g1469879 KEYWORDS KIAA0149. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3387) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3387) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3387 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" gene 3..2495 /gene="KIAA0149" CDS 3..2495 /gene="KIAA0149" /note="The KIAA0149 gene product is related to Notch3." /citation=[3] /codon_start=1 /db_xref="PID:d1010416" /db_xref="PID:g1469880" /translation="MGLGLLLPLLLLWTRGTQGSELDPKGQHVCVASSPSAELQCCAG WRQKDQECTIPICEGPDACQKDEVCVKPGLCRCKPGFFGAHCSSRCPGQYWGPDCRES CPCHPHGQCEPATGACQCQADRWGARCEFPCACGPHGRCDPATGVCHCEPGWWSSTCR RPCQCNTAAARCEQATGACVCKPGWWGRRCSFRCNCHGSPCEQDSGRCACRPGWWGPE CQQQCECVRGRCSAASGECTCPPGFRGARCELPCPAGSHGVQCAHSCGRCKHNEPCSP DTGSCESCEPGWNGTQCQQPCLPGTFGESCEQQCPHCRHGEACEPDTGHCQRCDPGWL GPRCEDPCPTGTFGEDCGSTCPTCVQGSCDTVTGDCVCSAGYWGPSCNASCPAGFHGN NCSVPCECPEGLCHPVSGSCQPGSGSRDTALIVGSLVPLLLLFLGLACCACCCWAPRS DLKDRPARDGATVSRMKLQVWGTLTSLGSTLPCRSLSSHKLPWVTVSHHDPEVPFNHS FIEPPSAGWATDDSFSSDPESGEADEVPAYCVPPQEGMVPVAQAGSSEASLAAGAFPP PEDASTPFAIPRTSSLARAKRPSVSFAEGTKFAPQSRRSSGELSSPLRKPKRLSRGAQ SGPEGREAEESTGPDEAEAPESFPAAASPGDSATGHRWPPLGSRTVAEHVEAIEGSVQ ESSGPVTTIYMLAGKPRGSEGPVRSVFRHFGSFQKGQAEAKVKRAIPKPPRQALNRKK GSPGLASGSVGQSPNSAPKAGLPGATGPMAVRPEEAVRGLGAGTESSRRAQEPVSGCG SPEQDPQKQAEEERQEEPEYENVVPISRPPEP" 3'UTR 2496..3387 BASE COUNT 635 a 1054 c 1039 g 659 t ORIGIN 1 ccatggggct ggggctgctg ctcccgctgc tgctgctctg gactcggggg actcaggggt 61 ccgagctgga ccccaaaggg cagcacgtct gtgtggccag cagcccctct gctgagctgc 121 agtgctgcgc aggctggagg cagaaggatc aagaatgcac catccccatc tgtgaggggc 181 cggacgcctg ccagaaagac gaggtgtgtg tgaagccggg cctctgtcga tgcaagcctg 241 gattctttgg ggcccactgc agctcccgct gcccgggcca gtactggggc cccgactgcc 301 gtgagagctg cccctgccac ccgcacggcc agtgcgagcc agccacgggc gcgtgccagt 361 gccaggccga ccgctgggga gcccgctgcg agttcccgtg cgcctgcggc ccccacgggc 421 gctgcgaccc cgcgaccggc gtgtgccact gcgaacccgg ctggtggtcg tccacgtgcc 481 gccgcccgtg ccagtgcaac accgcggcgg cgcgctgcga gcaggccacg ggcgcctgcg 541 tgtgcaagcc gggctggtgg gggcgccgct gcagcttccg ctgcaactgc cacggctccc 601 cgtgcgagca ggactccggc cgctgcgcct gccggccggg ctggtggggt cccgaatgcc 661 agcagcagtg cgagtgtgtg cggggccgct gcagcgccgc ctccggcgag tgcacctgcc 721 cgcccggctt ccgcggagcg cgctgcgagc tgccctgccc ggcaggcagc cacggggtgc 781 agtgcgcaca cagctgtggc cgctgcaaac acaatgagcc gtgctctcca gacacaggca 841 gctgtgagtc ctgcgagccg ggctggaacg ggacccagtg ccagcagccc tgcctgcctg 901 gcacctttgg cgagagctgc gaacagcagt gccctcactg ccgacatggg gaggcctgtg 961 agccagatac tggccactgt cagcgctgtg accctggctg gctggggccc aggtgtgaag 1021 acccctgccc cactggtacc tttggggaag actgtggctc tacctgcccc acctgtgttc 1081 aggggtcctg tgatactgtg acaggggact gtgtctgcag tgccggctac tgggggccca 1141 gctgcaacgc ctcctgccca gccggtttcc atggaaacaa ctgctcagtt ccttgtgaat 1201 gcccagaggg actctgccac cctgtctctg ggtcctgcca gccaggctct ggcagtcggg 1261 acactgccct catcgtgggc agccttgtgc ctctgctgct gctcttcctg ggccttgcct 1321 gctgtgcctg ctgctgctgg gccccccgat cagacctcaa ggacaggcca gcgagagatg 1381 gagctaccgt gtccaggatg aagctgcagg tctgggggac actgaccagc ttgggctcca 1441 cgctgccctg ccgttccctc agctcccaca agctaccctg ggtgacagtc tcacatcacg 1501 acccggaggt ccccttcaac cacagcttca tcgagccgcc ctctgccggc tgggccactg 1561 atgactcctt ctcatccgat cctgagtctg gagaggcaga tgaggttcct gcctactgtg 1621 tgccacccca agaagggatg gtccctgtgg cccaggcagg gtcgtcagag gccagcctgg 1681 ctgcaggtgc tttcccgccc cctgaggacg cctccacgcc attcgccatc ccgcgcacct 1741 ccagcctagc tcgggccaag cggccatcgg tctccttcgc ggaaggtacc aagtttgcac 1801 cacagagtcg ccgaagctca ggggagctct ccagcccgct ccgaaagccc aagaggctct 1861 cccggggggc gcagtcgggt cctgagggcc gggaagccga agagtccaca ggcccagacg 1921 aagcagaagc ccccgagtcc tttccggcgg ctgccagtcc cggggattca gccactggcc 1981 accggtggcc cccacttggt agccggacag tggctgagca cgtggaagcc attgagggca 2041 gcgtccagga gagctcgggc cctgtgacca cgatctacat gctggcaggg aagccccgcg 2101 gatccgaagg ccctgtccgc tctgtcttcc gccattttgg tagcttccag aaaggccagg 2161 cggaagccaa ggtcaagagg gccatcccta agcctccgcg ccaggccctg aatcggaaaa 2221 agggcagccc tggccttgcc tctggctctg tcggccagag ccccaactca gccccaaaag 2281 ctgggcttcc tggggccaca gggcctatgg cagtcagacc agaggaagcg gtccgggggc 2341 tgggggctgg caccgagagt tcaaggagag cccaggagcc agtctctggc tgtggctccc 2401 cagaacagga tccccagaag caggctgaag aggaaaggca ggaggaacct gagtatgaga 2461 atgttgtacc catctccagg ccaccagaac cctgatgacc ttgaatttgg ggagtgggga 2521 gagtggatgg actagactgt gctgtgtgct ggaaaatgat cccggggcca ggacagacaa 2581 accagagcct ctgcgcctcc acagggaaaa ggcaaggctt ccaggccagt tggcccaggc 2641 ccctggcagt gctcccggag gggcccagga aggcctgggc agagaccctg taggatgggg 2701 tcaggaaggg ttgcctgcag ggacttttgc tctgctgtcc tggaccctgt gtgcctcata 2761 agggctattc tttctttcac gtgcaaaaca tttttctgaa atagcaaaca acctacatgt 2821 ttgctgataa aagattggct aaacaaaatt tttttttttt tttgagacag aatctccctc 2881 tgtcccccag gctggagtgc agtggtgcga tctcggctca ctgcaagctc tgcctcccgg 2941 gttcacgccc ttctcctgcc tcagcctccc gagtagctgg gactacaggt gccctccacc 3001 atgcttggct aatttttttg tatatttaat agagacaggg tttcaccatg ttagccagga 3061 tggtctggat ctcctgacct cgtgatccac ctgcctcggc ctcccaaagt gctgggatga 3121 caggcatgag ccaccacgcc tggtctatga actttttaaa aaggatgtat gtgtataaaa 3181 acagattcaa gggaaaggca ctaaatggtt ttttcctctg gaagatgaga ttgtaggtga 3241 tatttatttt cttctgaaac ttttgtatag tttgcaaatt ttctacagtg aacattcttt 3301 tttacttttg ttactagatt gaatttgata aagtataata aaaagcaatg atctttgtta 3361 aaaaaataaa aagtactaac attacag // LOCUS D63485 3221 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0151 gene, complete cds. ACCESSION D63485 NID g1469883 KEYWORDS KIAA0151. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3221) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3221) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3221 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..326 gene 327..2477 /gene="KIAA0151" CDS 327..2477 /gene="KIAA0151" /note="The KIAA0151 gene product is classified into serine/threonine kinase." /citation=[3] /codon_start=1 /db_xref="PID:d1010418" /db_xref="PID:g1469884" /translation="MQSTANYLWHTDDLLGQGATASVYKARNKKSGELVAVKVFNTTS YLRPREVQVREFEVLRKLNHQNIVKLFAVEETGGSRQKVLVMEYCSSGSLLSVLESPE NAFGLPEDEFLVVLRCVVAGMNHLRENGIVHRDIKPGNIMRLVGEEGQSIYKLTDFGA ARELDDDEKFVSVYGTEEYLHPDMYERAVLRKPQQKAFGVTVDLWSIGVTLYHAATGS LPFIPFGGPRRNKEIMYRITTEKPAGAIAGAQRRENGPLEWSYTLPITCQLSLGLQSQ LVPILANILEVEQAKCWGFDQFFAETSDILQRVVVHVFSLSQAVLHHIYIHAHNTIAI FQEAVHKQTSVAPRHQEYLFEGHLCVLEPSVSAQHIAHTTASSPLTLFSTAIPKGLAF RDPALDVPKFVPKVDLQADYNTAKGVLGAGYQALRLARALLDGQELMFRGLHWVMEVL QATCRRTLEVARTSLLYLSSSLGTERFSSVAGTPEIQELKAAAELRSRLRTLAEVLSR CSQNITETQESLSSLNRELVKSRDQVHEDRSIQQIQCCLDKMNFIYKQFKKSRMRPGL GYNEEQIHKLDKVNFSHLAKRLLQVFQEECVQKYQASLVTHGKRMRVVHETRNHLRLV GCSVAACNTEAQGVQESLSKLLEELSHQLLQDRAKGAQASPPPIAPYPSPTRKDLLLH MQELCEGMKLLASDLLDNNRIIERLNRVPAPPDV" 3'UTR 2478..3221 BASE COUNT 710 a 941 c 949 g 621 t ORIGIN 1 caccgccaca aggaggcagg gaagaaaccc actagtccca gctcctgggg tggcacagac 61 attgcaactg gccctgcctg tgggtcctag gggcccttgg ctaccaggag gctaagaaca 121 ctgctcatga atgacagtga gccctgaaag ctctgggggt gtcacccagt cccacaagcc 181 tgcatcccct gcagtggaga tgggctcagc tcctggacgt gccacagaca gaaagcataa 241 catacactcg ccaggaagag cctttgcctg actcagggca gctcagagtg tggggcagaa 301 ggtgaccagc cagctcaggg caggagatgc agagcacagc caattacctg tggcacacag 361 atgacctgct ggggcagggg gccactgcca gtgtgtacaa ggcccgcaac aagaaatccg 421 gagagctggt tgctgtgaag gtcttcaaca ctaccagcta cctgcggccc cgcgaggtgc 481 aggtgaggga gtttgaggtc ctgcggaagc tgaaccacca gaacatcgtc aagctctttg 541 cggtggagga gacgggcgga agccggcaga aggtactggt gatggagtac tgctccagtg 601 ggagcctgct gagtgtgctg gagagccctg agaatgcctt tgggctgcct gaggatgagt 661 tcctggtggt gctgcgctgt gtggtggccg gcatgaacca cctgcgggag aacggcattg 721 tgcatcgcga catcaagccg gggaacatca tgcgcctcgt aggggaggag gggcagagca 781 tctacaagct gacagacttc ggcgctgccc gggagctgga tgatgatgag aagttcgtct 841 cggtctatgg gactgaggag tacctgcatc ccgacatgta tgagcgggcg gtgcttcgaa 901 agccccagca aaaagcgttc ggggtgactg tggatctctg gagcattgga gtgaccttgt 961 accatgcagc cactggcagc ctgcccttca tcccctttgg tgggccacgg cggaacaagg 1021 agatcatgta ccggatcacc acggagaagc cggctggggc cattgcaggt gcccagaggc 1081 gggagaacgg gcccctggag tggagctaca ccctccccat cacctgccag ctgtcactgg 1141 ggctgcagag ccagctggtg cccatcctgg ccaacatcct ggaggtggag caggccaagt 1201 gctggggctt cgaccagttc tttgcggaga ccagtgacat cctgcagcga gttgtcgtcc 1261 atgtcttctc cctgtcccag gcagtcctgc accacatcta tatccatgcc cacaacacga 1321 tagccatttt ccaggaggcc gtgcacaagc agaccagtgt ggccccccga caccaggagt 1381 acctctttga gggtcacctc tgtgtcctcg agcccagcgt ctcagcacag cacatcgccc 1441 acacgacggc aagcagcccc ctgaccctct tcagcacagc catccctaag gggctggcct 1501 tcagggaccc tgctctggac gtccccaagt tcgtccccaa agtggacctg caggcggatt 1561 acaacactgc caagggcgtg ttgggcgccg gctaccaggc cctgcggctg gcacgggccc 1621 tgctggatgg gcaggagcta atgtttcggg ggctgcactg ggtcatggag gtgctccagg 1681 ccacatgcag acggactctg gaagtggcaa ggacatccct cctctacctc agcagcagcc 1741 tgggaactga gaggttcagc agcgtggctg gaacgcctga gatccaggaa ctgaaggcgg 1801 ctgcagaact gaggtccagg ctgcggactc tagcggaggt cctctccaga tgctcccaaa 1861 atatcacgga gacccaggag agcctgagca gcctgaaccg ggagctggtg aagagccggg 1921 atcaggtaca tgaggacaga agcatccagc agattcagtg ctgtttggac aagatgaact 1981 tcatctacaa acagttcaag aagtctagga tgaggccagg gcttggctac aacgaggagc 2041 agattcacaa gctggataag gtgaatttca gtcatttagc caaaagactc ctgcaggtgt 2101 tccaggagga gtgcgtgcag aagtatcaag cgtccttagt cacacacggc aagaggatga 2161 gggtggtgca cgagaccagg aaccacctgc gcctggttgg ctgttctgtg gctgcctgta 2221 acacagaagc ccagggggtc caggagagtc tcagcaagct cctggaagag ctatctcacc 2281 agctccttca ggaccgagca aagggggctc aggcctcgcc gcctcccata gctccttacc 2341 ccagccctac acgaaaggac ctgcttctcc acatgcaaga gctctgcgag gggatgaagc 2401 tgctggcatc tgacctcctg gacaacaacc gcatcatcga acggctaaat agagtcccag 2461 cacctcctga tgtctgagct ccatggggca catgaggcat cctgaagcat tagaatgatt 2521 ccaacactgc tcttctgcac catgagacca acccagggca agatcccatc ccatcacatc 2581 agcctacctc cctcctggct gctggccagg atgtcgccag cattaccttc cactgccttt 2641 ctccctggga agcagcacag ctgagactgg gcaccaggcc acctctgttg ggacccacag 2701 gaaagagtgt ggcagcaact gcctggctga cctttctatc ttctctaggc tcaggtactg 2761 ctcctccatg cccatggctg ggccgtgggg agaagaagct ctcatacgcc ttcccactcc 2821 ctctggttta taggacttca ctccctagcc aacaggagag gaggcctcct ggggtttccc 2881 cagggcagta ggtcaaacga cctcatcaca gtcttccttc ctcttcaagc gtttcatgtt 2941 gaacacagct ctctccactc ccttgtgatt tctgagggtc accactgcca gcctcaggca 3001 acatagagag cctcctgttc tttctatgct tggtctgact gagcctaaag ttgagaaaat 3061 gggtggccaa ggccagtgcc agtgtcttgg ggcccctttg gctctccctc actctctgag 3121 gctccagctg gtcctgggac atgcagccag gactgtgagt ctgggcacgt ccaaggcctg 3181 caccttcaag aagtggaata aatgtggcct ttgcttctgt t // LOCUS D63486 6322 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0152 gene, complete cds. ACCESSION D63486 NID g1469885 KEYWORDS KIAA0152. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6322) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6322) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..6322 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..128 gene 129..1007 /gene="KIAA0152" CDS 129..1007 /gene="KIAA0152" /note="The KIAA0152 gene product is related to a putative C.elegans gene encoded in cosmid F44E2." /citation=[3] /codon_start=1 /db_xref="PID:d1010419" /db_xref="PID:g1469886" /translation="MLGAWAVEGTAVALLRLLLLLLPPAIRGPGLGVAGVAGAAGAGL PESVIWAVNAGGEAHVDVHGIHFRKDPLEGRVGRASDYGMKLPILRSNPEDQILYQTE RYNEETFGYEVPIKEEGDYVLVLKFAEVYFAQSQQKVFDVRLNGHVVVKDLDIFDRVG HSTAHDEIIPMSIRKGKLSVQGEVSTFTGKLYIEFVKGYYDNPKVCALYIMAGTVDDV PKLQPHPGLEKKEEEEEEEEYDEGSNLKKQTNKNRVQSGPRTPNPYASDNSSLMFPIL VAFGVFIPTLFCLCRL" 3'UTR 1008..6322 BASE COUNT 1449 a 1477 c 1643 g 1753 t ORIGIN 1 ctgagagcga catgtccccg gcggctcagg cggagcggcc cgtggcgctg tttttctgag 61 tccggggtgg cctggcagcc ggccgaggac gagggtcggc gggggctgcc cccgtggtgg 121 tggccgccat gctgggagcc tgggcggttg agggaaccgc tgtggcgctc ctgcgactgc 181 tgctgctgct gctgccgccg gcgatccggg gacccgggct cggcgtggcc ggcgtggccg 241 gcgcggcggg ggccgggctg cccgagagcg tcatttgggc ggtcaacgcg ggtggagagg 301 cgcatgtgga cgtgcacggg atccacttcc gcaaggaccc tttggaaggc cgggtgggcc 361 gagcctcaga ctatggcatg aaactgccaa tcctgcgttc caaccctgag gaccagatcc 421 tgtatcaaac tgagcggtac aatgaggaga cctttggcta cgaagtgccc atcaaagagg 481 agggggacta cgtgctggtc ttgaaatttg cagaggtcta ctttgcacag tcccagcaaa 541 aggtatttga tgtacgattg aatggccacg tcgtggtgaa ggacttggat atctttgatc 601 gtgttgggca tagcacagct cacgatgaaa ttatacctat gagcatcaga aaggggaagc 661 tgagtgtcca gggggaggtg tccaccttca cagggaaact ctacattgag tttgtcaagg 721 ggtactatga caatcccaag gtctgtgcac tctacatcat ggctgggaca gtggatgatg 781 taccaaagct tcagcctcat ccgggattgg agaagaaaga agaggaagaa gaagaagaag 841 aatatgatga agggtctaat ctcaaaaaac agaccaataa gaaccgggtg cagtcaggcc 901 cccgcacacc caacccctat gcctcggaca acagcagcct catgtttccc atcctggtgg 961 ccttcggagt cttcattcca accctcttct gcctctgccg gttgtgagaa caaatgacta 1021 tcctgaacag ggtggagggg tgtgggaaag aaaccagcca tattggtttt ggtttctgta 1081 tttttcacaa tgattaatga acaaaaacaa agagaaaaaa aacacacatc aattaaagga 1141 gacaaaaaga ggcagagcga gtagagagca gccctcattc accacctggt cccagacgtg 1201 cttcagtcct cgtcctctct ttgtggctgg ctcccagcct tctctttcct cttgaggata 1261 cttagggtaa actggatcct tcctgctcaa ggatcctcat ttgtatacct agtggaaagg 1321 actctgaact cagaggagtc actgttcctt tttttaggtt agaaattaac agcagggaaa 1381 tgccatctta ttacctgaga cgaccagcac tgggagttag gtacggtctg aagttatgtc 1441 tagataagac ttcagacgtc ctgggattga aagaatgtgt gtgaaggggt agaatttgtg 1501 cggtaaagac ttaaaaaaaa aagtagggag attaaaaaaa aagaaagaaa atgcttcctt 1561 atctggaagc ctttctggat taatccagtg atggtcccac ctttagtgtt tgagctttgt 1621 cattgcttgt ctccctggca tgtgccagtt atagactgtc cagcatccaa gacgtttcgg 1681 ttatgtcggg tcctcagatc gcctctgact tgttaccaca acaaatcatt ttgatttcag 1741 tgcctgttgg ggacttgatt tcttctcagg ttttgtttgt ttgtttgttt ccttaatctg 1801 gctcatttga aatttcttct ccctctcaac catcccacta agttatagcc aagaagggaa 1861 ggagacacgg ggatttgggg ttctctgctt gaatgtcttc tcctttacca cctcaccttg 1921 ttggtacctc cctccctgga tctctgagcc agcagccagg aggacctgac ccagcagttc 1981 tttactggcc cctttgtagg gccttgctgc cagggggcag ggatgctttc cagcctgcag 2041 caacagaaca cttgacctta aaagtctctt ctggtctttg gattagaaaa ggcttatgtt 2101 agcatagctt aagagcaacc tcagagactt gagccctact aagtgactga ccactgttta 2161 gagtgtctgg tatctgatgt tcatttattc ccatgttctt gtgtgtcaca gttcagccag 2221 ttttggttta tgcctagagc tacttcaagg aactagacta attagctata taggcccagc 2281 gatgcttctt attgatctta atagtatgcc cttccttccc ctgtcctttc atttctctat 2341 ccaagtagca gtcaggttct tggtgtgatg ggactgaaag aattccagtc agccagagcc 2401 ttggcagctc tgaagctaac cttagcatct aagtgtcgat cttgaattcc ctgaaaaaat 2461 ttctatagga aatgaagctt ccctggtccc ctcctttctg gccattgtca tccatttccc 2521 agttagggca acaatgaagg aggacccagc caagctagaa ggaattttgt ggatgggaga 2581 cagcaggatt agcttcagct tgggctggag cagtcaatat aggatctcag gccaggcccg 2641 cttttctaga atgtgtttaa ttttgagttt gctttattag atatgttttt taagagctct 2701 gtatatttga actgctcctt atgtgacaaa ataggtagct cttgggctca tgtcctgggt 2761 tttggctctt taatgattac tccaggccag catttagtcg tttgagaatt gtagcctgtt 2821 gttttcgctg tgacttgggt ctcagtgcta gggtattgag tcaggcagct ggagggttgt 2881 ggcccgaggc tgcagtcaga ggtatacttc ccatagtgct tcacacagct cccctgcttc 2941 taaaggataa ggtactgtag ccttggtcct ggggaccacc tgcctggggc agtggacatc 3001 ctaactaaac aggcttctgg cagtagcttt ggttcctatc ccatcgaaat tccccaaagc 3061 cctgggccac tgccattggg ttagtcaaga tgaaggagga ggactggctg cctccatttt 3121 gccttgtttg ttagtttgcc tgggtctgtc tgaggaagga gggggtcccg ccttccacct 3181 caacacatcc cttcagtgac tcagagtctc agaaggaaac cctgactcct ggggccattt 3241 cctaatggta ctgtaagcca agcagctttg cttctgcctc tgtttccaag cccacccttt 3301 tcccctgagc tcagggttag ggatgggcgc tttcctctct ggttgtgaac gaaaggaagg 3361 aacatctttc tatggctaac aaaaactaaa ggggaagtga ggaaacagga agaagtatgg 3421 tgggggctgg ggtagactcc cctggagcca agcctatcca gctaacaaga gctccctggg 3481 gctggtcaca gctggctcat gatgctgaac ttgaaagttt ttttgttttt gtttttgttt 3541 tgtggctcct ccaagatata ggtacatgaa gtttaggtta aaggggtggg attctttatt 3601 tttatttttg tattgtatgt gtcaagaatt actctgttgt tcaccttttg ctttttgcac 3661 tgtttgttct cttatctgta ttttgagctt agtgctagga ctgagaggct gcaccatagg 3721 gaatgtatgg gagatggtga ggggtgccag tgaggggtgc gtggaggaga ggcctgggct 3781 cctctactgg atctacactc tgtcccaggt ttttagatcc cactgagccc agctgactga 3841 aaacaaggac agtcagggtg aaacttcttt tgccagaagt gtggcctgag ttgaatttct 3901 gggaggatga cgcagatgtc tgctgcagag ctgggctgag agttctgcag tctagctctg 3961 acttaggtca ggggcctgtt ggtctctcat tggacgtttt tgggtctcac tcatgcttac 4021 tgaaacattg tgccaagaaa ctctgtggga tttgtgtccc ttaaaccaga ctcacttttc 4081 tgaaaaatct ccattgttga ggagaggctg ctcaatcgac accccgagtt ctcatgactg 4141 ggaagatagt tttcttcagg tgtcaatggc gttagactcc caggaagact agccctgccc 4201 acagggccac ctgttggttt gagagcgtgt tcgtgttctc ttgccctccc tgcctaagag 4261 ctactgggat cacgttagcg ggcatttagg ctttgatgag agggcacagt ttgagttagg 4321 tttacctccc cctttctgtg cctgggaact gtttggtcca gctttagaac tgtggttttg 4381 acttccttat ctcttgggag aagcttctgt tttaaggaat ttctcttcct tcttctcctg 4441 cctctagcct ctcctggaaa ggcctggata tggtttctaa aatctcagct gagaacttca 4501 gaaaacagca gcagtatttt ccttttccta gtgctaaaat ccctttccct agaaattggc 4561 tcaccttggg aaacccaggg aaagaatcag caggttctct gccctcccta ggggttgggg 4621 aaggacccac cccggtcagc acagtgcctt ttcctctcct gctctgagcc agggtggggc 4681 attccctcta gattcaggtt tgggcagggg tcctatagtc cctgccatgg ggctgcttcc 4741 ctgtcccttc cctccccttt gctggcctac tctggcataa ttcaagtgtc ttcttgcctt 4801 ggggatcctt agtggcatca aatggcaaca tggaatattg tcctccatgc ccctccagaa 4861 ggacctagga gagtaggtga gctttccaaa gtgagagacg aatctttctt tctttttttt 4921 tttaaagggc aggatgggta tgctttgggc tttctccttc tgtggccccg gaggaaggag 4981 agactgaggc aaggcaaagt gatagtacac tgaagcagaa ccggaaacac ccaggaactg 5041 ttcagaaatc tcagaagaaa tctgcttctc ttcgatggaa agatataatt aacgatcaaa 5101 gagctctaag aaaattgcaa agaagcctta atgttcaagc tttagaaaga tcagagcaat 5161 ttttctcttt cagtccaaac taagactctc tgtatttaaa tctctctggg gcaagagggc 5221 tagatttcct cattttgtta tgagactaga ttggtaccag tagatcagct gcctagcgag 5281 ggcaggtttc ttctttgcat ctgtgtggct tgcttccagt ctggcctgtc ctttccagct 5341 gccttttgtc tagcctgcta tggggggcca gattatcttg ataagagcag gtgatttggg 5401 gactagctgg gttggtagga aaagagcagg atggatctct tgggacaggt tcccccagga 5461 gtataaacac aaggagccag gattgtcctg gcagccaagg aaacagtagt gcctgtttga 5521 gttggcagag agggccttgg cacctcttgc atccaggcag tcttgtgaga tgggggcaca 5581 tagcactggg gaaagcagaa ctccattctc acctctattt tgagcttcag tgctttattt 5641 cagtatgagg aaaaacaaca acaaactgaa gtgcgctttc cgtcctttca aaggacaact 5701 gtcgggaagg gagagccgag ttgcgaggta ggaggggagc actggcaggg agagacattc 5761 ttgactcctc tcttccctgg tgtgttgtga tccagggaat gaaaagaaat ttgaccctgg 5821 attggttctc tccttggact taaggaatct taccttttcc ttccacaaag ttctcccagg 5881 caaggaccag ctgcccattc tgagcccagg gcagcctctt caaccattat tggtctaacc 5941 tggcttgtca ggaaaccaag cccacccttc cacattgggc ctggctgctc tattctgtac 6001 caagtactgg agaaaaagca tcaagttctt agcccttgta gcttctaccc tagtttccca 6061 tcctctctct gtggaggcca aaccaactct ttgccagcag ccacaacatg cattgacagc 6121 ggcacagtga gatataactg atgggctttg aacctggttg gccggggaag ctgtaggggt 6181 ggatagagct ggctttcctt ctgggctgtc tccatctgac cctacccctt ccatgtccca 6241 ccccactccc accaaaaagt acaaaatcag gatgtttttc actgtccatt gctttgtgtt 6301 ttaataaaca atttgcagtg ac // LOCUS D63506 2508 bp mRNA PRI 10-MAR-1997 DEFINITION Human mRNA for unc-18homologue, complete cds. ACCESSION D63506 NID g1944337 KEYWORDS unc-18homologue; Munc-18-3. SOURCE Homo sapiens fetus female brain cDNA to mRNA, clone_lib:lambda ZAPII clone:2-19. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2508) AUTHORS Gengyo-Ando,K. TITLE Direct Submission JOURNAL Submitted (13-JUL-1995) to the DDBJ/EMBL/GenBank databases. Keiko Gengyo-Ando, Tokyo Women's Medical College; 8-1 Kawada-cho, Shinjuku-ku, Tokyo 162, Japan (E-mail:andok@research.twmc.ac.jp, Tel:03-3353-8111(ex.22413), Fax:03-5269-7362) REFERENCE 2 (bases 1 to 2508) AUTHORS Gengyo-Ando,K., Kitayama,H., Mukaida,M. and Ikawa,Y. JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Gengyo-Ando,K., Kitayama,H., Mukaida,M. and Ikawa,Y. TITLE A murine neural-specific homolog corrects cholinergic defects in Caenorhabditis elegans unc-18 mutants JOURNAL J. Neurosci. 16 (21), 6695-6702 (1996) MEDLINE 96421662 COMMENT Sequence updated (07-Mar-1997) by:Keiko Gengyo-Ando. FEATURES Location/Qualifiers source 1..2508 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="2-19" /clone_lib="lambda ZAPII" /dev_stage="fetus" /sex="female" /tissue_type="brain" gene 52..1830 /gene="Munc-18-3" CDS 52..1830 /gene="Munc-18-3" /codon_start=1 /product="unc-18homologue" /db_xref="PID:d1020240" /db_xref="PID:g1944338" /translation="MAPPVAERGLKSVVWQKIKATVFDDCKKEGEWKIMLLDEFTTKL LASCCKMTDLLEEGITVVENIYKNREPVRQMKALYFITPTSKSVDCFLHDFASKSENK YKAAYIYFTDFCPDNLFNKIKASCSKSIRRCKEINISFIPHESQVYTLDVPDAFYYCY SPDPGNAKGKDAIMETMADQIVTVCATLDENPGVRYKSKPLDNASKLAQLVEKKLEDY YKIDEKSLIKGKTHSQLLIIDRGFDPVSTVLHELTFQAMAYDLLPIENDTYKYKTDGK GKEAILEEEDDLWVRIRHRHIAVVLEEIPKLMKEISSTKKATEGKTSLSALTQLMKKM PHFRKQITKQVVHLNLAEDCMNKFKLNIEKLCKTEQDLALGTDAEGQKVKDSMRVLLP VLLNKNHDNCDKIRAILLYIFSINGTTEENLDRLIQNVKIGNESDMIRNWSYLGVPIV PQSQQGKPLRKDRSAEETFQLSRWTPFIKDIMEDAIDNRLDSKEWPYCSQCPAVWNGS GAVSARQKPRANYLEDRKNGSKLIVFVIGGITYSEVRGAYEVSQAHKSCEVIIGSTHV LTPKKLLDDIKMLNKPKDKVSLIKDE" BASE COUNT 888 a 381 c 503 g 736 t ORIGIN 1 aaagtaggtt gggagtggaa ggtggtggct cctgctccgc atgtggggaa gatggcgccg 61 ccggtggcag agagggggct aaagagcgtc gtgtggcaga agataaaagc aacagtgttt 121 gatgactgca agaaagaagg cgaatggaag ataatgcttt tagatgaatt taccactaag 181 cttttggcat cgtgttgcaa aatgacagat cttctagaag aaggtattac tgttgtagag 241 aatatttata agaaccgtga acctgtcaga caaatgaaag ctctttattt catcactccg 301 acatcaaagt ctgtagattg tttcttacat gattttgcaa gtaaatcgga gaacaagtat 361 aaagcagcat atatttactt cactgacttt tgccctgata atctctttaa caaaattaag 421 gcttcttgct ccaagtcaat aagaagatgt aaagaaataa atatttcctt cattccacat 481 gaatctcagg tgtatactct tgatgtacca gatgcattct attactgtta tagtccagac 541 cctggtaatg caaagggaaa agatgccatt atggaaacaa tggctgacca gatagttaca 601 gtgtgtgcca ccttggatga aaatcccgga gtaagatata aaagtaaacc tctagataat 661 gccagtaagc ttgcacagct tgttgaaaaa aagcttgaag actactacaa gattgatgaa 721 aagagcctaa taaagggtaa aactcattca cagctcttaa taattgatcg tggctttgat 781 cctgtgtcca ctgtcctgca tgaactgacc tttcaggcaa tggcatatga tctactacca 841 attgagaatg atacatacaa atataaaaca gatgggaaag gaaaggaggc catccttgaa 901 gaagaagatg acctctgggt tagaattcga catcgacata ttgcggttgt gttagaggaa 961 attcccaagc ttatgaaaga aatttcatca acaaagaaag caacagaagg aaagacatca 1021 cttagtgctc ttacccagct gatgaaaaag atgccccatt tccgaaaaca gattactaag 1081 caagttgtcc atcttaactt agcagaagat tgcatgaata agttcaagct taatatagaa 1141 aagctctgca aaactgaaca ggacctggca cttggaactg atgcagaagg acagaaggtg 1201 aaagattcca tgcgagtact ccttccagtt ctactcaaca aaaatcatga taattgtgat 1261 aaaataagag caattctact ttatatcttc agtattaatg gaactacgga agaaaatttg 1321 gacaggttga tccagaatgt aaagatagga aatgagagtg acatgattcg taactggagt 1381 taccttggtg ttcccattgt tccccaatct caacaaggca aaccgttaag aaaggatcgg 1441 tctgcagaag aaacttttca gctctctcgg tggacacctt ttatcaaaga tattatggag 1501 gatgctattg ataatagatt agattcaaaa gaatggccat attgttccca gtgtccagca 1561 gtatggaatg gttcaggagc tgtaagtgct cgccagaaac ccagagctaa ttatttagaa 1621 gaccgaaaaa atgggtcaaa gctgattgtt tttgtaattg gagggatcac atactctgaa 1681 gtgcgtggtg cttatgaagt ttctcaggca cataaatcct gtgaagttat tattggttct 1741 acacatgttt taacacccaa aaagctgttg gatgatataa agatgctgaa taaacccaag 1801 gataaagtct ccttaattaa agatgaatag catttctttt tggagggttt agagattctt 1861 actaatatgt tgaactaaaa tagaaagaaa atgttgctgt catgtaattt aaacaatgta 1921 aatattttat ggaataatgg cttttcaaat acatttctta aggaactgtt tatgattatt 1981 actggatttg tcatttttga taatttaaat attgctgctg ctttgtagat gatgagaaga 2041 aatgttaaag tgctttctaa aaggaaattt tttcaccttt ggaggagaat atattagagt 2101 tgtgggtaat ttttcacagc cacctatgta catactaatt acccattgga tacttatatc 2161 taaaagtctc atgctgaagt atagtttttg ggaaagaatg attttaaata aagagattgt 2221 aaaagtaaaa aactgtaaat gtatatgtat gatagaattg tttcctctaa gtgtagtttt 2281 tctttcaact aaaattcagt ttatgtgtaa aataattcag tcattaatag aaatggagtg 2341 atttcacagt gtgtactgtt ttgccacata cttctaaaga acacaatttt atataatttt 2401 gaaatcatgt atgtttaaat tagaaaacca aaaatcatga acattctaag aggaaataaa 2461 tatagaattt aaaaaattaa aaaaaaaaaa aaaaaaaaaa aaggaatt // LOCUS D63643 1213 bp mRNA PRI 14-NOV-1996 DEFINITION Human mRNA for clathrin coat assembly protein-like, complete cds. ACCESSION D63643 NID g1669532 KEYWORDS clathrin coat assembly protein-like. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Shimizu,F., Nagata,M., Takaichi,A., Fujiwara,T., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Cloning, expression pattern and mapping to 12p 13.2 --> p13.1 of CLAPS3, a gene encoding a novel clathrin-adaptor small chain JOURNAL Cytogenet. Cell Genet. 73 (3), 214-217 (1996) MEDLINE 96302337 REFERENCE 2 (bases 1 to 1213) AUTHORS Watanabe,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1213) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (21-JUL-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..1213 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 43..624 /codon_start=1 /product="clathrin coat assembly protein-like" /db_xref="PID:d1010444" /db_xref="PID:g1669533" /translation="MIKAILIFNNHGKPRLSKFYQPYSEDTQQQIIRETFHLVSKRDE NVCNFLEGGLLIGGSDNKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFE NVCELDLIFHVDKVHNILAEMVMGGMVLETNMNEIVTQIDAQNKLEKSEAGLAGAPAR AVSAVKNMNLPEIPRNINIGDISIKVPNLPSFK" BASE COUNT 371 a 220 c 235 g 387 t ORIGIN 1 ccccgcctgg ccccagtgcc acccggtcgg cccggcacag ccatgatcaa ggcgatccta 61 atcttcaaca accacgggaa gccgcggctc tccaagttct accagcccta cagtgaagat 121 acacaacagc aaatcatcag ggagactttc catttggtat ctaagagaga tgaaaatgtt 181 tgtaatttcc tagaaggagg attattaatt ggaggatctg acaacaaact gatttataga 241 cattatgcaa cgttatattt tgtcttctgt gtggattctt cagaaagtga acttggcatt 301 ttagatctaa ttcaagtatt tgtggaaaca ttagacaaat gttttgaaaa tgtctgtgag 361 ctggatttga ttttccatgt agacaaggtt cacaatattc ttgcagaaat ggtgatgggg 421 ggaatggtat tggagacaaa tatgaatgag attgttacac aaattgatgc acaaaataag 481 ctggaaaaat ctgaggctgg cttagcagga gctccagccc gtgctgtatc agctgtaaag 541 aatatgaatc ttcctgagat cccaagaaat attaacattg gtgacatcag tataaaagtg 601 ccaaacctgc cctcttttaa ataaaaatgt aaaaaggcca ctcccaggta aaatccaggg 661 ggaagagtca tctaagttta ccatgcagtt gtttaccaaa aatagaggag gagagtctta 721 acttttgctc ttggatttaa gtcaaggtac tgtatagaag ttgtgtaaaa tcagtatgaa 781 agttcaatgt tgctgttctt gctcagtgat tttaaagaaa ttgagtagtt cctatgtgat 841 tttttttttt cttttctaaa ctgcattcct gtgcccacct acggcatgcc tctatgtatt 901 ggctactaca gtgttttaaa aagtgtttca gatatttctc taattatgta caacctaaaa 961 tgttggtgtt ttgtatggat cacaagtgca gcattcctta attccttctg ctatatgtca 1021 cacagttgtt atttggagaa ccaagtatgt attgcatgaa aacattatga cttttttctc 1081 ttagtttaaa taaactccaa ggtaactgga cttctaaagc acctttctgt ttgcctgata 1141 tctactttag caataatttt ttttacaacc ctctgactca acaaagtaaa taaaagtata 1201 ttttatcact att // LOCUS D63780 2035 bp mRNA PRI 19-JUN-1997 DEFINITION Human mRNA for YSK1, complete cds. ACCESSION D63780 NID g2196444 KEYWORDS YSK1. SOURCE Homo sapiens cell_line:HeLa cDNA to mRNA, clone:403-13. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2035) AUTHORS Osada,S.-I. TITLE Direct Submission JOURNAL Submitted (03-JUL-1995) to the DDBJ/EMBL/GenBank databases. Shin-Ichi Osada, Yokohama City University School of Medicine, Molecular Biology; 3-9, Fukuura, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:osada@med.yokohama-cu.ac.jp, Tel:045-787-2597, Fax:045-785-4140) REFERENCE 2 (sites) AUTHORS Osada,S., Izawa,M., Saito,R., Mizuno,K., Suzuki,A., Hirai,S. and Ohno,S. TITLE YSK1, a novel mammalian protein kinase structurally related to Ste20 and SPS1, but is not involved in the known MAPK pathways JOURNAL Oncogene 14 (17), 2047-2057 (1997) MEDLINE 97304522 FEATURES Location/Qualifiers source 1..2035 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone="403-13" CDS 115..1395 /note="protein kinase" /codon_start=1 /product="YSK1" /db_xref="PID:d1021253" /db_xref="PID:g2196445" /translation="MAHLRGFANQHSRVDPEELFTKLDRIGKGSFGEVYKGIDNHTKE VVAIKIIDLEEAEDEIEDIQQEITVLSQCDSPYITRYFGSYLKSTKLWIIMEYLGGGS ALDLLKPGPLEETYIATILREILKGLDYLHSERKIHRDIKAANVLLSEQGDVKLADFG VAGQLTDTQIKRNTFVGTPFWMAPEVIKQSAYDFKADIWSLGITAIELAKGEPPNSDL HPMRVLFLIPKNSPPTLEGQHSKPFKEFVEACLNKDPRFRPTAKELLKHKFITRYTKK TSFLTELIDRYKRWKSEGHGEESSSEDSDIDGEAEDGEQGPIWTFPPTIRPSPHSKLH KGTALHSSQKPAEPVKRQPRSQCLSTLVRPVFGELKEKHKQSGGSVGALEELENAFSL AEESCPGISDKLMVHLVERVQRFSHNRNHLTSTR" BASE COUNT 440 a 609 c 591 g 395 t ORIGIN 1 aacagagact gcggtggact gacgccgcag gggcgagcta gccggctccg cgcctctccg 61 cgggatccag acgcctcctg gggctgctgg cggagggtct gaggcggcgc ggccatggct 121 cacctccggg gatttgccaa ccagcactct cgagtggacc ctgaggagct cttcaccaag 181 ctcgaccgca ttggcaaggg ctcgtttggg gaggtctaca agggcatcga taaccacaca 241 aaggaggtgg tggccatcaa gatcatcgac ctggaggagg ccgaggatga gatcgaggac 301 atccagcagg agatcactgt cctcagtcag tgcgacagcc cctacatcac ccgctacttt 361 ggctcctacc taaagagcac caagctatgg atcatcatgg agtacctggg cggcggctca 421 gcactggact tgcttaaacc aggtcccctg gaggagacat acattgccac gatcctgcgg 481 gagattctga agggcctgga ttatctgcac tccgaacgca agatccaccg agacatcaaa 541 gctgccaacg tgctactctc ggagcagggt gacgtgaagc tggcggactt tggggtagca 601 gggcagctca cagacacgca gattaagagg aacacattcg tgggcacccc cttctggatg 661 gcacctgagg tcatcaagca gtcggcctac gacttcaagg ctgacatctg gtccctgggg 721 atcacagcca tcgagctggc caagggggag cctccaaact ctgacctcca ccccatgcgc 781 gtcctgttcc tgattcccaa gaacagccca cccacactgg agggccagca cagcaagccc 841 ttcaaggagt tcgtggaggc ctgcctcaac aaagaccccc gattccggcc cacggccaag 901 gagctcctga agcacaagtt catcacacgc tacaccaaga agacctcctt cctcacggag 961 ctcatcgacc gctataagcg ctggaagtca gaggggcatg gcgaggagtc cagctctgag 1021 gactctgaca ttgatggcga ggcggaggac ggggagcagg gccccatctg gacgttcccc 1081 cctaccatcc ggccgagtcc acacagcaag cttcacaagg ggacggccct gcacagttca 1141 cagaagcctg cggagcccgt caagaggcag ccgaggtccc agtgcctgtc cacgctggtc 1201 cggcccgtct tcggagagct caaagagaag cacaagcaga gcggcgggag cgtgggtgcg 1261 ctggaggagc tggagaacgc cttcagcctg gccgaggagt cctgccccgg catctcagac 1321 aagctgatgg tgcacctggt ggagcgagtg cagaggtttt cacacaacag aaaccacctg 1381 acatccaccc gctgaagcgc actgctgttc agatagggga cggaaggtcg tttgtttttg 1441 ttctgagctc cataagaact gtgctgactt ggaaggtgcc ctgtgctatg tcgtgcctgc 1501 agggacacgt cggatcccgt gggcctcaca tgccaggtca ccaggtcacc gtctccttcc 1561 acccctgcag tgtgctgttg tgcacgtcag ggacgctgtt ctctatgccc actgccctcc 1621 tccctctcct ggcccagcag tattgctcac gggggctcca gcagccggcg tggccctcat 1681 gagctacgcc tgggtcttct gcagactcat gcagccctat ggccgctcag accaaggcgc 1741 agagcaacta tcagggcagc tctgcctcct cctcccatga ggtggggaga ggcaacaggg 1801 cagcccccag aggagtgtcc tggccgctgt cctcccgggg cccatgatgg ccatagattt 1861 gccttgtggt gttggatcag gtactgtgtc tgctcataag tacttgtgtc atccagaatg 1921 ttttgttttt taagaaaatt gaattacttg tttcctgaaa tattctgagg ttaatatgtt 1981 agttttcata gaacattgag aggccccttc cactttcaat aaagacctga cttgg // LOCUS D63875 4243 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0155 gene, complete cds. ACCESSION D63875 NID g961441 KEYWORDS KIAA0155. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4243) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4243) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..4243 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" gene 87..3608 /gene="KIAA0155" CDS 87..3608 /gene="KIAA0155" /note="KIAA0155 gene product is related to C.elegans B0464.2 protein." /codon_start=1 /db_xref="PID:d1010573" /db_xref="PID:g961442" /translation="MSRGSIEIPLRDTDEVIELDFDQLPEGDEVISILKQEHTQLHIW IALALEYYKQGKTEEFVKLLEAARIDGNLDYRDHEKDQMTCLDTLAAYYVQQARKEKN KDNKKDLITQATLLYTMADKIIMYDQNHLLGRACFCLLEGDKMDQADAQFHFVLNQSP NNIPALLGKACISFNKKDYRGALAYYKKALRTNPGCPAEVRLGMGHCFVKLNKLEKAR LAFSRALELNSKCVGALVGLAVLELNNKEADSIKNGVQLLSRAYTIDPSNPMVLNHLA NHFFFKKDYSKVQHLALHAFHNTEVEAMQAESCYQLARSFHVQEDYDQAFQYYYQATQ FASSSFVLPFFGLGQMYIYRGDKENASQCFEKVLKAYPNNYETMKILGSLYAASEDQE KRDIAKGHLKKVTEQYPDDVEAWIELAQILEQTDIQGALSAYGTATRILQEKVQADVP PEILNNVGALHFRLGNLGEAKKYFLASLDRAKAEAEHDEHYYNAISVTTSYNLARLYE AMCEFHEAEKLYKNILREHPNYVDCYLRLGAMARDKGNFYEASDWFKEALQINQDHPD AWSLIGNLHLAKQEWGPGQKKFERILKQPSTQSDTYSMLALGNVWLQTLHQPTRDREK EKRHQDRALAIYKQVLRNDAKNLYAANGIGAVLAHKGYFREARDVFAQVREATADISD VWLNLAHIYVEQKQYISAVQMYENCLRKFYKHQNTEVVLYLARALFKCGKLQECKQTL LKARHVAPSDTVLMFNVALVLQRLATSVLKDEKSNLKEVLNAVKELELAHRYFSYLSK VGDKMRFDLALAATEARQCSDLLSQAQYHVARARKQDEEERELRAKQEQEKELLRQKL LKEQEEKRLREKEEQKKLLEQRAQYVEKTKNILMFTGETEATKEKKRGGGGGRRSKKG GEFDEFVNDDTDDDLPISKKKKRRKGSGSEQEGEDEEGGERKKKKRRRHPKGEEGSDD DETENGPKPKKRRPPKAEKKKAPKPERLPPSMKGKIKSKAIISSSDDSSDEDKLKIAD EGHPRNSNSNSDSDEDEQRKKCASSESDSDENQNKSGSEAGSPRRPRRQRSDQDSDSD QPSRKRRPSGSEQSDNESVQSGRSHSGVSENDSRPASPSAESDHESERGSDNEGSGQG SGNESEPEGSNNEASDRGSEHGSDDSD" BASE COUNT 1367 a 821 c 989 g 1066 t ORIGIN 1 gtcactcacc tctggattag cctgaagcgg agactaccgg ctgcggagcg gcggggcgag 61 acacttgctc gccttttgac cccatcatgt cgcggggctc catcgagatt cccctccggg 121 acactgacga ggtcattgaa cttgacttcg atcagttacc ggagggagat gaagttatca 181 gtattctgaa acaggaacac acacaactgc acatatggat tgctttggcg ctggaatact 241 acaagcaagg aaaaacagaa gagtttgtaa aattgttgga agcagcacgt atagatggca 301 atttggacta tagagaccat gaaaaagacc agatgacttg cttggataca ttggcagcgt 361 attatgtaca acaggctcgg aaagaaaaga ataaggacaa taaaaaggat cttattacac 421 aggccacctt gttgtataca atggccgata aaattattat gtatgatcag aaccatttgt 481 tgggaagagc ctgcttctgc ctacttgagg gtgacaaaat ggatcaagct gatgcacagt 541 ttcattttgt actcaatcag tctccaaata atattccagc ccttcttggt aaagcttgca 601 tttccttcaa caagaaggat tacagaggag ctcttgctta ctataagaaa gcattgcgta 661 ctaacccagg atgtccagcg gaagttcgtt taggaatggg tcattgcttt gtgaaactta 721 acaaactgga aaaagctcgt ctggcattca gcagagccct ggaactcaat tccaaatgcg 781 tgggagcatt ggttggactg gctgttctag aactcaacaa taaagaggct gattccatta 841 aaaatggtgt ccagcttctt tccagagcct atactattga tcctagcaac cctatggtat 901 tgaaccattt ggcaaatcac tttttcttca aaaaggatta tagtaaagtc cagcatctgg 961 ccctccatgc attccataat acagaagtgg aagctatgca agcagagagc tgctatcagc 1021 tagctagatc attccatgtt caggaagatt atgaccaagc ttttcagtac tattatcaag 1081 ccacacagtt tgcctcatcc tcttttgtgc tcccattttt tggtttggga caaatgtata 1141 tttatcgagg tgacaaagaa aatgcatctc agtgctttga gaaggttttg aaagcttatc 1201 ctaataatta cgaaactatg aaaattctcg gctctctcta tgctgcctca gaagatcaag 1261 aaaaacgaga tattgccaag ggccatttga agaaggtcac agaacagtat cccgatgatg 1321 ttgaagcttg gattgaattg gcacaaatct tagaacagac tgatatacag ggtgcccttt 1381 cagcctatgg aacagcaaca cgaatccttc aggagaaagt gcaggccgat gttcctccag 1441 agattctcaa taatgtgggt gccctccatt ttagacttgg aaacctaggg gaggctaaga 1501 aatatttttt ggcgtcattg gaccgtgcaa aagcagaagc ggaacacgat gagcattact 1561 ataacgccat ttccgttacc acgtcatata atctcgccag gctatatgag gcgatgtgtg 1621 aattccatga agcagaaaaa ctgtataaaa acatcttacg cgaacatcct aattatgttg 1681 actgctattt gcgcctagga gccatggcta gagataaggg aaacttttat gaggcttcag 1741 attggtttaa ggaagctctt cagattaatc aggatcatcc agatgcttgg tctttgattg 1801 gcaatcttca tttggcaaaa caagaatggg gtcctgggca gaagaagttt gagaggatat 1861 taaaacagcc atccacacag agtgatacct attctatgct agcccttggc aacgtgtggc 1921 tccaaacttt acatcagccc acccgagatc gagaaaagga aaagcgtcat caagatcgtg 1981 ctctggccat ctacaaacaa gtactcagaa atgatgcaaa gaatctgtat gctgccaatg 2041 gcataggagc tgttttggcc cacaaaggat attttcgtga agctcgtgat gtatttgccc 2101 aagtaagaga agcaacagca gatattagtg atgtgtggct gaacttagca cacatctatg 2161 tggagcaaaa gcagtacatc agcgccgttc agatgtatga aaactgcctc cgaaagttct 2221 ataagcacca aaacactgaa gttgtactct atttggcccg ggccctcttc aagtgtggca 2281 agttacagga atgcaaacag actttgctga aggctagaca tgtggcaccc agtgatacag 2341 ttcttatgtt taatgtggcc ttggtcctgc aaagattagc tacctctgtc ctgaaagatg 2401 aaaaaagtaa tctgaaggaa gtacttaatg ctgtgaaaga actggagctt gcacatagat 2461 acttcagtta tttgagtaaa gtgggagata aaatgagatt tgatttggcc cttgctgcta 2521 cagaagccag gcagtgttct gacttactga gccaggccca gtaccatgtg gcccgggcac 2581 gcaaacaaga tgaagaagag cgggagctgc gggccaagca agagcaagaa aaggagctgt 2641 taaggcagaa acttcttaaa gaacaggaag agaaacgtct cagagaaaag gaagagcaaa 2701 agaaactttt ggaacagcgg gcccagtatg tggagaagac caaaaatatt cttatgttta 2761 ctggtgagac tgaagcaaca aaagagaaga aaagaggtgg tggtggtgga cggcgttcta 2821 agaagggagg agagtttgat gaatttgtca atgatgacac tgatgatgac ctacctatat 2881 ccaaaaagaa gaagagaaga aagggtagtg gcagtgaaca agaaggtgaa gatgaggagg 2941 gtggtgagag aaagaagaaa aagaggagaa gacatccaaa gggagaagaa ggatctgatg 3001 atgatgaaac agaaaatggc cccaaaccaa aaaaacgacg tccaccaaaa gcagagaaga 3061 aaaaggctcc caagccagaa cgtctgcctc catcaatgaa gggaaaaata aaatccaaag 3121 ccataatttc atcaagtgat gactcttcgg atgaggataa acttaaaatt gctgatgaag 3181 gacatcccag gaacagcaac agcaacagtg actcagacga ggacgaacaa cgaaagaaat 3241 gtgcctcatc agagagtgat tccgatgaga accagaacaa gtctggcagc gaggccggca 3301 gtccccggag gccacgaaga cagcggtcag atcaggactc agacagtgac cagccatcca 3361 gaaagagaag gccctccggt tctgagcagt ctgacaatga atctgtgcag tcagggagaa 3421 gccactcagg agtttctgag aacgactctc gcccagcttc tccaagtgcc gaatcagatc 3481 acgaatcgga gagaggatct gataatgagg gttctggcca aggctctgga aatgaatcgg 3541 aaccagaggg atccaacaat gaggcctcag atagaggctc agaacatggg tcagatgata 3601 gtgactaggt tttatttcat caataagctt catctctgga ggaaactttt ttaatatatg 3661 aaagctgtga taaaaatgtt tcagatgttt agtcaattgt gaaatttttc ttaaggcaat 3721 tttcttttct atcagtttgt atattactaa gccccaagag acatttcctg tgctagagtc 3781 caatatttga gtctctcgtg caaatgagac tattctttgt ggtacaattc cacctatcat 3841 atgtgaaaac tgcagtaaaa ataaacccag atgctaaatc attcctacaa aggtttgact 3901 gaaactgtgg cagatgtctc atcttcttta tatgttaagc agcatactct tctgattttt 3961 attgcaatct tttaccaagt ggtgcacaaa cttggtattg atgtctttat tccattttga 4021 gtttagattg agaatatttt tattttctga aggcagagat atctactgta taattgcacc 4081 aaagtacatt tgaaaggaag gttttcaata gtgtaatact gcagcgatgt agataaaatc 4141 acaaatgtat aatgtgttag gttgaataag gtgtggaaaa tgcttttctg ttagtagaat 4201 gcaaaaacct acctaagcca cataataata aaattctttt acc // LOCUS D63879 3660 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0156 gene, complete cds. ACCESSION D63879 NID g961449 KEYWORDS KIAA0156. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3660) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3660) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..3660 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" gene 1..2892 /gene="KIAA0156" CDS 1..2892 /gene="KIAA0156" /note="KIAA0156 gene product is related to Xenopus nucleolin." /codon_start=1 /db_xref="PID:d1010577" /db_xref="PID:g961450" /translation="MATAAETSASEPEAESKAGPKADGEEDEVKAARTRRKVLSRAVA AATYKTMGPAWDQQEEGVSESDGDEYAMASSAESSPGEYEWEYDEEEEKNQLEIERLE EQLSINVYDYNCHVDLIRLLRLEGELTKVRMARQKMSEIFPLTEELWLEWLHDEISMA QDGLDREHVYDLFEKAVKDYICPNIWLEYGQYSVGGIGQKGGLEKVRSVFERALSSVG LHMTKGLALWEAYREFESAIVEAARLEKVHSLFRRQLAIPLYDMEATFAEYEEWSEDP IPESVIQNYNKALQQLEKYKPYEEALLQAEAPRLAEYQAYIDFEMKIGDPARIQLIFE RALVENCLVPDLWIRYSQYLDRQLKVKDLVLSVHNRAIRNCPWTVALWSRYLLAMERH GVDHQVISVTFEKALNAGFIQATDYVEIWQAYLDYLRRRVDFKQDSSKELEELRAAFT RALEYLKQEVEERFNESGDPSCVIMQNWARIEARLCNNMQKARELWDSIMTRGNAKYA NMWLEYYNLERAHGDTQHCRKALHRAVQCTSDYPEHVCEVLLTMERTEGSLEDWDIAV QKTETRLARVNEQRMKAAEKEAALVQQEEEKAEQRKRARAEKKALKKKKKIRGPEKRG ADEDDEKEWGDDEEEQPSKRRRVENSIPAAGETQNVEVAAGPAGKCAAVDVEPPSKQK EKAASLKRDMPKVLHDSSKDSITVFVSNLPYSMQEPDTKLRPLFEACGEVVQIRPIFS NRGDFRGYCYVEFKEEKSALQALEMDRKSVEGRPMFVSPCVDKSKNPDFKVFRYSTSL EKHKLFISGLPFSCTKEELEEICKAHGTVKDLRLVTNRAGKPKGLAYVEYENESQASQ AVMKMDGMTIKENIIKVAISNPPQRKVPEKPETRKAPGGPMLLPQTYGARGKGRTQLS LLPRALQRPSAAAPQAENGPAAAPAVAAPAATEAPKMSNADFAKLFLRK" BASE COUNT 986 a 824 c 1049 g 801 t ORIGIN 1 atggcgactg cggccgaaac ctcggcttca gaacccgagg ctgagtccaa ggctgggccc 61 aaggctgacg gagaggagga tgaggttaag gcggctagga caaggaggaa ggtgttatcg 121 cgggctgtgg ccgctgcgac atacaagacc atggggccag cgtgggatca gcaggaggaa 181 ggcgtgagcg agagcgatgg ggatgagtac gccatggctt cctccgcgga gagctccccc 241 ggggagtacg agtgggaata tgacgaagag gaggagaaaa accagctgga gattgagaga 301 ctggaggagc agttgtctat caacgtctat gactacaact gccatgtgga cttgatcaga 361 ctgctcaggc tggaagggga gcttaccaag gtgaggatgg cccgccagaa gatgagtgaa 421 atctttccct tgactgaaga gctctggctg gagtggctgc atgacgagat cagcatggcc 481 caggatggcc tggacagaga gcacgtgtat gacctctttg agaaagccgt gaaggattac 541 atttgtccta acatttggct agagtatggc cagtactcag ttggtgggat tggtcagaaa 601 ggtggccttg agaaagttcg ctccgtgttt gaaagggctc tctcgtctgt tggtttacat 661 atgaccaaag gactcgccct ctgggaggct taccgagagt ttgaaagtgc gattgtggaa 721 gctgctcggc ttgagaaagt ccacagtctt ttccggcgac agttggcgat cccactctat 781 gatatggagg ccacatttgc agagtatgaa gaatggtcag aagacccaat accagagtca 841 gtaattcaga actataacaa agcactacag cagctggaga aatataaacc ctatgaagaa 901 gcactgttgc aggcagaggc accaaggctg gcagaatatc aagcatatat cgattttgag 961 atgaaaattg gcgatcctgc tcgcattcag ttgatctttg agcgcgccct ggtcgagaac 1021 tgccttgtcc cagacttatg gatccgttac agtcagtacc tagatcgaca actgaaagta 1081 aaggatttgg ttttatctgt acataaccgc gctattagaa actgcccctg gacagttgcc 1141 ttatggagtc ggtacctctt ggccatggag agacatggag ttgatcatca agtaatttct 1201 gtaaccttcg agaaagcttt gaatgccggc ttcatccagg ccactgatta tgtggagatt 1261 tggcaggcat accttgatta cctgaggaga agggttgatt tcaaacaaga ctccagtaaa 1321 gagctggagg agttgagggc cgcctttact cgtgccttgg agtatctgaa gcaggaggtg 1381 gaagagcgtt tcaatgagag tggtgatcca agctgcgtga ttatgcagaa ctgggctagg 1441 attgaggctc gactgtgcaa taacatgcag aaagctcggg aactctggga tagcatcatg 1501 accagaggaa atgccaagta cgccaacatg tggctagagt attacaacct ggaaagagct 1561 catggtgaca cccagcactg ccggaaggct ctgcaccggg ccgtccagtg caccagtgac 1621 tacccagagc acgtctgcga agtgttactc accatggaga ggacagaagg ttctttagaa 1681 gattgggata tagctgttca gaaaactgaa acccgattag ctcgtgtcaa tgagcagaga 1741 atgaaggctg cagagaagga agcagccctt gtgcagcaag aagaagaaaa ggctgaacaa 1801 cggaaaagag ctcgggctga gaagaaagcg ttaaaaaaga agaaaaagat cagaggccca 1861 gagaagcgcg gagcagatga ggacgatgag aaagagtggg gcgatgatga agaagagcag 1921 ccttccaaac gcagaagggt cgagaacagc atccctgcag ctggagaaac acaaaatgta 1981 gaagtagcag cagggcccgc tgggaaatgt gctgccgtag atgtggagcc cccttcgaag 2041 cagaaggaga aggcagcctc cctgaagagg gacatgccca aggtgctgca cgacagcagc 2101 aaggacagca tcaccgtctt tgtcagcaac ctgccctaca gcatgcagga gccggacacg 2161 aagctcaggc cactcttcga ggcctgtggg gaggtggtcc agatccgacc catcttcagc 2221 aaccgtgggg atttccgagg ttactgctac gtggagttta aagaagagaa atcagccctt 2281 caggcactgg agatggaccg gaaaagtgta gaagggaggc caatgtttgt ttccccctgt 2341 gtggataaga gcaaaaaccc cgattttaag gtgttcaggt acagcacttc cctagagaaa 2401 cacaagctgt tcatctcagg cctgcctttc tcctgtacta aagaggaact agaagaaatc 2461 tgtaaggctc atggcaccgt gaaggacctc aggctggtca ccaaccgggc tggcaaacca 2521 aagggcctgg cctacgtgga gtatgaaaat gaatcccagg cgtcgcaggc tgtgatgaag 2581 atggacggca tgactatcaa agagaacatc atcaaagtgg caatcagcaa ccctcctcag 2641 aggaaagttc cagagaagcc agagaccagg aaggcaccag gtggccccat gcttttgccg 2701 cagacatacg gagcgagggg gaagggaagg acgcagctgt ctctactgcc tcgtgccctg 2761 cagcgcccaa gtgctgcagc tcctcaggct gagaacggcc ctgccgcggc tcctgcagtt 2821 gccgccccag cagccaccga ggcacccaag atgtccaatg ccgattttgc caagctgttt 2881 ctgagaaagt gaacgggacg ctgggagaca ggaaatgcct tacttcactc tggcccggcg 2941 gacctcccac cacccagcag tgcactgggg atggacaggc ctggtgtgct gcgtgctcgc 3001 aaccacagat ggctcctcgg ctttagacag aaaggggaag gggttctaag tcaagagcct 3061 ttcagtgctc cctcatattg agggcagtgg cagaaaagtg accactctgc aggctgggcc 3121 caggatgtgg tgtcctgaga tagttttgta tcttaaagac tgaggcacag aagcgaaacg 3181 agaacacact gtttttgaga cacagttgtc caaatgtttc tggccagctc cggccccttt 3241 ttgtatgaca cttctcttcc accctgcaca gcacatgtgc ccgtcattct tttaatttta 3301 aaagatgaaa tggcagatgc tagtaattca cagaatggcc tcttgtgggg gtgggtctga 3361 gggaagtcag ctataaaaca tttgctggag ttttgttcaa tggggctgtg catttttata 3421 ttatgtgttt gtaaatgaca tgtcagccat tgtttcatgt ttcctaaaag cagaatattt 3481 gcaacatttg ttttgtatag gaattatttg tgccacctgc tgtggactgt tttctttgcc 3541 tagtgactag tgacctgtgt tgtctaaaca tgagtttcag ccctttggtt ttgtttaata 3601 ccatgtcaaa tgcaaacttc aattctcccc atttagcttt attaaactga cgttctcttc // LOCUS D63880 5547 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0159 gene, complete cds. ACCESSION D63880 NID g961451 KEYWORDS KIAA0159. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5547) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5547) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. IV. The coding sequences of 40 new genes (KIAA0121-KIAA0160) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (4), 167-174 (1995) MEDLINE 96127530 FEATURES Location/Qualifiers source 1..5547 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" gene 800..5005 /gene="KIAA0159" CDS 800..5005 /gene="KIAA0159" /note="KIAA0159 gene product is related to yeast protein L8479.14." /codon_start=1 /db_xref="PID:d1010578" /db_xref="PID:g961452" /translation="MAPQMYEFHLPLSPEELLKSGGVNQYVVQEVLSIKHLPPQLRAF QAAFRAQGPLAMLQHFDTIYSILHHFRSIDPGLKEDTLEFLIKVVSRHSQELPAILDD TTLSGSDRNAHLNALKMNCYALIRLLESFETMASQTNLVDLDLGGKGKKARTKAAHGF DWEEERQPILQLLTQLLQLDIRHLWNHSIIEEEFVSLVTGCCYRLLENPTINHQKNRP TREAITHLLGVALTRYNHMLSATVKIIQMLQHFEHLAPVLVAAVSLWATDYGMKSIVG EIVREIGQKCPQELSRDPSGTKGFAAFLTELAERVPAILMSSMCILLDHLDGENYMMR NAVLAAMAEMVLQVLSGDQLEAAARDTRDQFLDTLQAHGHDVNSFVRSRVLQLFTRIV QQKALPLTRFQAVVALAVGRLADKSVLVCKNAIQLLASFLANNPFSCKLSDADLAGPL QKETQKLQEMRAQRRTAAASAVLDPEEEWEAMLPELKSTLQQLLQLPQGEEEIPEQIA NTETTEDVKGRIYQLLAKASYKKAIILTREATGHFQESEPFSHIDPEESEETRLLNIL GLIFKGPAASTQEKNPRESTGNMVTGQTVCKNKPNMSDPEESRGNDELVKQEMLVQYL QDAYSFSRKITEAIGIISKMMYENTTTVVQEVIEFFVMVFQFGVPQALFGVRRMLPLI WSKEPGVREAVLNAYRQLYLNPKGDSARAKAQALIQNLSLLLVDASVGTIQCLEEILC EFVQKDELKPAVTQLLWERATEKVACCPLERCSSVMLLGMMARGKPEIVGSNLDTLVS IGLDEKFPQDYRLAQQVCHAIANISDRRKPSLGKRHPPFRLPQEHRLFERLRETVTKG FVHPDPLWIPFKEVAVTLIYQLAEGPEVICAQILQGCAKQALEKLEEKRTSQEDPKES PAMLPTFLLMNLLSLAGDVALQQLVHLEQAVSGELCRRRVLREEQEHKTKDPKEKNTS SETTMEEELGLVGATADDTEAELIRGICEMELLDGKQTLAAFVPLLLKVCNNPGLYSN PDLSAAASLALGKFCMISATFCDSQLRLLFTILEKSPLPIVRSNLMVATGDLAIRFPN LVDPWTPHLYARLRDPAQQVRKTAGLVMTHLILKDMVKVKGQVSEMAVLLIDPEPQIA ALAKNFFNELSHKGNAIYNLLPDIISRLSDPELGVEEEPFHTIMKQLLSYITKDKQTE SLVEKLCQRFRTSLTERQQRDLAYCVSQLPLTERGLRKMLDNFDCFGDKLSDESIFSA FLSVVGKLRRGAKPEGKAIIDEFEQKLRACHTRGLDGIKELEIGQAGSQRAPSAKKPS TGSRYQPLASTASDNDFVTPEPRRTTRRHPNTQQRASKKKPKVVFSSDESSEEDLSAE MTEDETPKKTTPILRASARRHRS" BASE COUNT 1377 a 1453 c 1461 g 1256 t ORIGIN 1 aagttttgga gcgccggaca agctgaggtc cgcgactcgt cgctaagatt ccccaaaact 61 gagatttcaa gaaaattcac cctttccgct ttctttggcg cccttcaacc gtgaaggaaa 121 tgaaggttga gaacctggaa cccgcttcca aagaccaagc cctttctctg tgcctcacaa 181 cagtcttgcg ctgtagctgt ttttattacc cctattttac tactggggag gagatttgag 241 gtcctgaaag tgaagtgact tgctggacgt cacttagcag aggagatggc gaaacctgga 301 ttagaggcga tgccttctaa ctcccagcgt ggacttgcct cctttcctgg gggtgactga 361 atgcccagcc agggacgcga cgtctctggc cagcagaaat acggcctcct ccccgccgac 421 tgggcaaagg gggaccttgc ggccaaggag ggattcgcag gcgggccggg ggtgggagcg 481 ggggccggag ccggagcctt ggcccgcccc cgggcggcgc tgtgattggc cggccgctcc 541 ggcgggcgtc gcgattggcc gtcctgcagc cgttgagatt tgaactcggt atttgtggct 601 ttgcccgcgc gttgccagac tcagaggcgg ccctttgcct ctgcctgccg gggattggcc 661 ggattctccg ccgacttgaa aactgccttg ctaattggtg cgtgttgtgc acgcgtgttt 721 tttccttttc atttcagcct gactgccgga atcagagccg cgggtgagat ccccagccct 781 gtgagcctgt aggagtagaa tggctcccca aatgtatgag ttccatctgc cattatcccc 841 agaggagttg ttgaaaagtg gaggggtgaa tcagtatgtt gtgcaagagg tactgtccat 901 caaacatctt ccaccacagc ttagagcttt tcaggctgcc tttcgagctc aggggcccct 961 ggctatgctg cagcactttg atactatcta cagcattttg catcactttc gaagtataga 1021 tcctggcctc aaagaagata ctctggaatt cctgataaaa gtggtatccc gccactccca 1081 ggagcttcca gctatcctgg atgatacaac tttgagtgga tcagatagaa acgcccatct 1141 aaatgccctc aaaatgaact gttatgctct gatacgtctc ctggaatcct ttgagaccat 1201 ggccagccag acaaaccttg tggacctgga ccttggtggg aagggtaaga aagctcggac 1261 caaggcagcc catggctttg actgggaaga agagaggcaa ccaattcttc agcttttaac 1321 acagctactt cagttggaca tccgtcacct gtggaaccac tcaataattg aagaagaatt 1381 tgtcagtttg gttactggct gttgctaccg ccttctggag aatcccacca ttaatcacca 1441 gaagaaccgc cccactcggg aagccataac acacctgctt ggtgtagcct tgacccgtta 1501 taaccatatg ctcagtgcta cagtgaagat catccagatg ctgcagcact ttgaacacct 1561 ggcacctgta ctggttgcag ccgtgagtct atgggcaact gactatggaa tgaagagcat 1621 agtgggagag attgtaagag agattggaca aaagtgtccc caagagctga gtcgagaccc 1681 ttcagggaca aagggctttg cagcattcct gacagaacta gcagaacgtg tcccagctat 1741 cctgatgtcc agcatgtgca ttttgctaga tcacctggat ggagaaaatt acatgatgcg 1801 taatgctgtg ctggcagcca tggcggagat ggtgctgcag gttctcagtg gcgatcaact 1861 ggaagcagca gcccgagaca ccagagacca gttcttggat actttacaag cccatggcca 1921 tgatgtcaac tcctttgtgc ggagccgtgt tttgcagctc ttcacccgaa ttgtccagca 1981 gaaggctctc cccctgacac gtttccaggc agtggtggct ttagctgtgg gacgtctggc 2041 agacaagtca gtgctagtat gtaaaaatgc catccagctg ctggccagtt ttctagccaa 2101 taatcctttc tcctgcaagc ttagtgatgc tgaccttgcc ggaccactgc agaaggagac 2161 ccagaaatta caagagatga gggcccagag gcgaactgca gcagcttctg cagtgctgga 2221 cccagaggag gagtgggaag ccatgctgcc agagttgaag tctaccctgc agcagcttct 2281 acagcttccc cagggagagg aggagattcc tgagcaaatt gccaatacag agacaactga 2341 agatgtgaaa ggacgcatct atcaactgct tgccaaagct agttacaaaa aggccatcat 2401 tctcactcga gaagccacag gccacttcca ggagtccgaa cccttcagtc atatagaccc 2461 agaggagtca gaggagacca ggctcttgaa tatattagga cttatcttca aaggcccagc 2521 agcttccaca caagaaaaga atccccggga gtctacagga aacatggtca caggacagac 2581 tgtctgtaaa aataaaccca atatgtcgga tcctgaggaa tccaggggaa atgatgaact 2641 agtgaagcag gagatgctgg tacagtatct gcaggatgcc tacagcttct cccggaagat 2701 tacagaggcc attggcatca tcagcaagat gatgtatgaa aacacaacta cagtggtgca 2761 ggaggtgatt gaattctttg tgatggtctt ccaatttggg gtaccccagg ccctgtttgg 2821 ggtgcgccgt atgctgcctc tcatctggtc taaggagcct ggtgtccggg aagccgtgct 2881 taatgcctac cgccaactct acctcaaccc caaaggggac tctgccagag ccaaggccca 2941 ggctttgatt cagaatctct ctctgctgct agtggatgcc tcggttggga ccattcagtg 3001 tcttgaggaa attctctgtg agtttgtgca gaaggatgag ttgaaaccag cagtgaccca 3061 gctgctgtgg gagcgggcca ccgagaaggt cgcctgctgt cctctggagc gctgttcctc 3121 tgtcatgctt cttggcatga tggcacgagg aaagccagaa attgtgggaa gcaatttaga 3181 cacactggtg agcatagggc tggatgagaa gtttccacag gactacaggc tggcccagca 3241 ggtgtgccat gccattgcca acatctcgga caggagaaag ccttctctgg gcaaacgtca 3301 cccccccttc cggctgcctc aggaacacag gttgtttgag cgactgcggg agacagtcac 3361 aaaaggcttt gtccacccag acccactctg gatcccattc aaagaggtgg cagtgaccct 3421 catttaccaa ctggcagagg gccccgaagt gatctgtgcc cagatattgc agggctgtgc 3481 aaaacaggcc ctggagaagc tagaagagaa gagaaccagt caggaggacc cgaaggagtc 3541 ccccgcaatg ctccccactt tcctgttgat gaacctgctg tccctggctg gggatgtggc 3601 tctgcagcag ctggtccact tggagcaggc agtgagtgga gagctctgcc ggcgccgagt 3661 tctccgggaa gaacaggagc acaagaccaa agatcccaag gagaagaata cgagctctga 3721 gaccaccatg gaggaggagc tggggctggt tggggcaaca gcagatgaca cagaggcaga 3781 actaatccgt ggcatctgcg agatggaact gttggatggc aaacagacac tggctgcctt 3841 tgttccactc ttgcttaaag tctgtaacaa cccaggcctc tatagcaacc cagacctctc 3901 tgcagctgct tcacttgccc ttggcaagtt ctgcatgatc agtgccactt tctgcgactc 3961 ccagcttcgt cttctgttca ccatcctgga aaagtctcca cttcccattg tccggtctaa 4021 cctcatggtt gccactgggg atctggccat ccgctttccc aatctggtgg acccctggac 4081 tcctcatctg tatgctcgcc tccgggaccc tgctcagcaa gtgcggaaaa cagcggggct 4141 ggtgatgacc cacctgatcc tcaaggacat ggtgaaggtg aaggggcagg tcagcgagat 4201 ggcggtgctg ctcatcgacc ccgagcctca gattgctgcc ctggccaaga acttcttcaa 4261 tgagctctcc cacaagggca acgcaatcta taatctcctt ccagatatca tcagccgcct 4321 gtcagacccc gagctggggg tggaggaaga gcctttccac accatcatga aacagctcct 4381 ctcctacatc accaaggaca agcagacaga gagcctggtg gaaaagctgt gtcagcggtt 4441 ccgcacatcc ctaactgagc ggcagcagcg agacctggcc tactgtgtgt cacagctgcc 4501 cctcacagag cgaggcctcc gtaagatgct tgacaatttt gactgttttg gagacaaact 4561 gtcagatgag tccatcttca gtgctttttt gtcagttgta ggcaagctgc gacgtggggc 4621 caagcctgag ggcaaggcta taatagatga atttgagcag aagcttcggg cctgtcatac 4681 cagaggtttg gatggaatca aggagcttga gattggccaa gcaggtagcc agagagcgcc 4741 atcagccaag aaaccatcca ctggttctag gtaccagcct ctggcttcta cagcctcaga 4801 caatgacttt gtcacaccag agccccgccg tactacccgt cggcatccaa acacccagca 4861 gcgagcttcc aaaaagaaac ccaaagttgt cttctcaagt gatgagtcca gtgaggaaga 4921 tctttcagca gagatgacag aagacgagac acccaagaaa acaactccca ttctcagagc 4981 atcggctcgc aggcacagat cctaggaagt ctgttcctgt cctccctgtg cagggtatcc 5041 tgtagggtga cctggaattc gaattctgtt tcccttgtaa aatatttgtc tgtctctttt 5101 ttttaaaaaa aaaaaaggcc gggcactgtg gctcacgcct gtaatcccag cactttgcga 5161 taccaaggcg ggtggataac ctgaggtagg gagttcgaga ccagcctgac caacatggag 5221 aaaccccatc tctactaaaa ataaaaaatt agccgggcgt attggcgtgc gcctgtaatc 5281 ccagctactc aagaggctga ggcaggagaa tcgcctgaac ccagaggcgg aggttgtagt 5341 gagccgaaat cacaccattg cactccagct tgggcaacaa tagcgaacct ccatctcaaa 5401 ttaaaaaaaa aatgcctaca cgctctttaa aatgcaaggc tttctcttaa attagcctaa 5461 ctgaactgcg ttgagctgct tcaactttgg aatatatgtt tgccaatctc cttgttttct 5521 aatgaataaa tgtttttata tactttt // LOCUS D63997 6640 bp mRNA PRI 05-DEC-1997 DEFINITION Homo sapiens mRNA for GCP170, complete cds. ACCESSION D63997 NID g2662348 KEYWORDS . SOURCE Homo sapiens human pancreatic tumore cells cell_line:QGP-1 cDNA to mRNA, clone_lib:lambdaZAPII clone:pFQSY1024. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Misumi,Y., Sohda,M., Yano,A., Fujiwara,T. and Ikehara,Y. TITLE Molecular characterization of GCP170, a 170-kDa protein associated with the cytoplasmic face of the Golgi membrane JOURNAL J. Biol. Chem. 272 (38), 23851-23858 (1997) MEDLINE 97442456 REFERENCE 2 (bases 1 to 6640) AUTHORS Misumi,Y. TITLE Direct Submission JOURNAL Submitted (30-AUG-1995) to the DDBJ/EMBL/GenBank databases. Yoshio Misumi, School of Medicine, Fukuoka University, Department of Biochemistry; Jonan-ku Nanakuma 7-45-1, Fukuoka, Fukuoka 814-80, Japan (E-mail:mm034023@cc.fukuoka-u.ac.jp, Tel:092-801-1011(ex.3251), Fax:092-864-3865) FEATURES Location/Qualifiers source 1..6640 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="QGP-1" /cell_type="human pancreatic tumore cells" /clone="pFQSY1024" /clone_lib="lambdaZAPII" CDS 270..4862 /codon_start=1 /product="GCP170" /db_xref="PID:d1024542" /db_xref="PID:g2662349" /translation="MDGASAEQDGLQEDRSHSGPSSLPEAPLKPPGPLVPPDQQDKVQ CAEVNRASTEGESPDGPGQGGLCQNGPTPPFPDPPSSLDPTTSPVGPDASPGVAGFHD NLRKSQGTSAEGSVRKEALQSLRLSLPMQETQLCSTDSPLPLEKEEQVRLQARKWLEE QLKQYRVKRQQERSSQPATKTRLFSTLDPELMLNPENLPRASTLAMTKEYSFLRTSVP RGPKVGSLGLPAHPREKKTSKSSKIRSLADYRTEDSNAGNSGGNVPAPDSTKGSLKQN RSSAASVVSEISLSPDTDDRLENTSLAGDSVSEVDGNDSDSSSYSSASTRGTYGILSK TVGTQDTPYMVNGQEIPADTLGQFPSIKDVLQAAAAEHQDQGQEVNGEVRSRRDSICS SVSLESSAAETQEEMLQVLKEKMRLEGQLEALSLEASQALKEKAELQAQLAALSTKLQ AQVECSHSSQQRQDSLSSEVDTLKQSCWDLERAMTDLQNMLEAKNASLASSNNDLQVA EEQYQRLMAKVEDMQRSMLSKDNTVHDLRQQMTALQSQLQQVQLERTTLTSKLKASQA EISSLQSVRQWYQQQLALAQEARVRLQGEMAHIQVGQMTQAGILEHLKLENVSLSQQL TETQHRSMKEKGRIAAQLQGIEADMLDQEAAFMQIQEAKTMVEEDLQRRLEEFEGERE RLQRMADSAASLEQQLEQVKLTLLQRDQQLEALQQEHLDLMKQLTLTQEALQSREQSL DALQTHYDELQARLGELQGEAASREDTICLLQNEKIILEAALQAAKSGKEELDRGARR LEEGTEETSETLEKLREELAIKSGQVEHLQQETAALKKQMQKIKEQFLQQKVMVEAYR RDATSKDQLISELKATRKRLDSELKELRQELMQVHGEKRTAEAELSRLHREVAQVRQH MADLEGHLQSAQKERDEMETHLQSLQFDKEQMVAVTEANEALKKQIEELQQEARKAIT EQKQKMRRLGSDLTSAQKEMKTKHKAYENAVGILSRRLQEALAAKEAADAELGQLRAQ GGSSDSSLALHERIQALEAELQAVSHSKTLLEKELQEVIALTSQELEESREKVLELED ELQESRGFRKKIKRLEESNKKLALELEHEKGKLTGLGQSNAALREHNSILETALAKRE ADLVQLNLQVQAVLQRKEEEDRQMKHLVQALQASLEKEKEKVNSLKEQVAAAKVEAGH NRRHFKAASLELSEVKKELQAKEHLVQKLQAEADDLQIREGKHSQEIAQFQAELAEAR AQLQLLQKQLDEQLSKQPVGNQEMENLKWEVDQKEREIQSLKQQLDLTEQQGRKELEG LQQLLQNVKSELEMAQEDLSMTQKDKFMLQAKVSELKNNMKTLLQQNQQLKLDLRRGQ DEKGAESAGQLFQPCHAHQDPGLPSSRLAAGGAAETTARREQGAPQEPEQLPPAAQAG DGQPAAPDGGARPDGARVSVLVDAAGASHCQPCAPGGSRRPTRRPTETQSEQGFQRRA GRVTAVDSPPCAAAPEGSYQCYLFDCVVDVFLRHEI" polyA_signal 6617..6622 BASE COUNT 1655 a 1703 c 1986 g 1296 t ORIGIN 1 cgtgtacttc cttgtttgtc tttgtcgctg gccgacttgt cttcattcca ggtggccaga 61 gcgagtgggg ccgggcgttg tcacgggtat catgatatta gctggtttga catcaagtca 121 tttgtgagtc atcagatctt ctcctgaaaa tgggagacac agtagggccc ctcccaggag 181 ctcttggctg ttgctgatgg cagaagccaa gcttgtccaa ggttcacttg tagcccctca 241 gcgtcagctc agctggtgtc gtcctgacca tggacggcgc gtcggccgag caagatggcc 301 tccaggagga cagatcccac agtggcccct cgtctctccc cgaggcccca ctgaagcccc 361 cgggcccact ggtgccacct gaccagcagg acaaagtcca gtgtgccgag gtaaacagag 421 catccacgga aggggaaagc ccggatggac ctggccaggg aggcctctgt cagaacgggc 481 caacgccacc cttcccagac cctccgtcgt ctctcgatcc caccacaagc ccagtgggcc 541 ctgatgcctc tccaggtgtg gctggtttcc atgacaacct aaggaagtct cagggaacta 601 gtgctgaggg cagtgttaga aaagaagctt tgcagtctct cagactcagt cttcctatgc 661 aagaaacgca actgtgctct acagattctc ccctgcccct ggagaaggag gagcaggtcc 721 gacttcaggc tcggaagtgg ctggaagagc agctcaaaca gtacagggtg aagcgccagc 781 aggagaggtc cagtcaacct gcaaccaaaa cgagactttt tagcacgctt gatcctgagc 841 tcatgttaaa cccagaaaac ttaccaaggg ccagtaccct ggctatgaca aaagaatatt 901 ccttcctgcg caccagtgtc cctcgggggc ctaaggtggg cagcctgggg cttccggcac 961 atcctaggga gaaaaaaact tccaaatcaa gcaaaatccg gtctctggcc gattacagaa 1021 ctgaagattc aaatgcgggg aattctgggg gaaatgtccc ggctcccgat tctaccaagg 1081 gttccctgaa gcagaacaga agcagtgcgg cgtccgttgt gtctgagatc agcctgtccc 1141 ccgacactga cgaccgtctg gagaacacct ccctggctgg agacagcgtg tctgaggtgg 1201 atggaaatga cagcgacagc tcatcgtaca gcagcgcctc cacccgaggg acctatggca 1261 ttctgtcgaa gacagtgggc acgcaggaca ccccctatat ggtcaacggc caggagattc 1321 ctgcggatac cctgggccag ttcccctcca ttaaggacgt cctccaggcc gcagccgctg 1381 agcaccaaga ccaggggcag gaggtcaacg gggaggtgcg gagtcggaga gacagcatct 1441 gcagcagcgt gtccttggag agctctgcag cagaaacaca ggaggagatg ctgcaggtgc 1501 tcaaagagaa aatgcgactc gaaggacagc tggaagcctt gtcactggag gcgagtcagg 1561 cacttaaaga gaaggctgag ctgcaggccc agctggccgc cctcagcacg aagctgcagg 1621 cgcaggtgga gtgcagccac agcagccagc agcggcagga ttcgctgagc tcggaggtgg 1681 acaccctgaa gcagtcgtgc tgggacctgg agcgagccat gactgacctg cagaacatgc 1741 tggaggcaaa aaatgccagc ctggcgtcgt ccaacaacga cttgcaggtg gccgaggagc 1801 agtaccagag gcttatggcc aaggtagagg acatgcagag gagcatgctc agcaaggaca 1861 acacagtgca cgacctgcga cagcagatga cagccttgca gagccagctt cagcaggtgc 1921 agctggagcg gacgacgctg accagcaagc tgaaggcgtc gcaggcggag atctcgtccc 1981 tacagagtgt ccggcagtgg taccagcagc agctcgccct ggcacaggag gcccgcgtca 2041 ggctgcaggg tgagatggcc cacatccagg ttggacagat gacccaggct ggtatcctgg 2101 agcacctgaa actcgagaat gtgtccctgt cccagcagct gacggaaact cagcacaggt 2161 ccatgaagga gaaggggcgc atcgcggcac agctgcaggg cattgaggct gacatgttgg 2221 atcaggaagc agccttcatg cagattcagg aggcaaagac gatggtggag gaggaccttc 2281 agaggaggct ggaagagttt gaaggtgaga gggagcggct gcagaggatg gcggactcgg 2341 cggcatccct ggagcagcag ctggagcagg tgaagttgac tttactccag cgagaccagc 2401 agcttgaggc tttgcagcag gagcacctgg acctgatgaa acagctcacc ttgactcagg 2461 aggctctgca gagcagggag cagtccctcg atgccctgca gacacactac gatgagctgc 2521 aggccaggct gggggagctg cagggcgagg ccgcctccag ggaggacacg atctgcctcc 2581 tgcagaacga gaagatcatc ttggaggcgg ctttgcaggc ggccaagagt ggcaaggagg 2641 agcttgacag aggagcaaga cgcttggaag aaggtaccga ggaaacgtcg gaaactttag 2701 agaagttaag agaagaatta gctatcaaat ccggccaggt ggaacacctg cagcaggaga 2761 ctgctgctct gaaaaagcaa atgcaaaaaa taaaggaaca gtttctccaa caaaaggtga 2821 tggtggaggc ctaccggcgc gacgccacct ccaaagacca gctcatcagt gagctgaaag 2881 ccaccaggaa gaggctggac tcggagctga aggagctgcg gcaggagctg atgcaagtgc 2941 acggggagaa gcggactgcc gaggcggagc tctcgcgcct gcacagagag gtggcccagg 3001 tccgtcagca catggcggac cttgaagggc atctccagtc ggcgcagaag gagcgagacg 3061 agatggaaac acacttgcag tcgttgcagt tcgataagga gcagatggtc gcggtcacag 3121 aggccaatga ggcgctgaag aaacaaatcg aagagttgca gcaagaggcc cggaaggcca 3181 tcacggaaca gaagcagaag atgaggcggc tgggctcaga cttgaccagc gcccagaagg 3241 agatgaagac caaacataag gcctacgaga acgccgtggg catcctcagc cgccgcctgc 3301 aggaggccct cgcggccaag gaggctgcgg acgcggagct gggccagctc cgagcccagg 3361 gtggcagcag tgacagcagc ctggctctac atgaaaggat ccaggccctg gaggcggagc 3421 tgcaggctgt cagtcatagc aagacgctgc tggaaaagga actgcaggag gtcatagcgc 3481 tgaccagcca ggagctggag gagtcccggg agaaggtgct ggagctggag gacgagcttc 3541 aagaatccag aggctttagg aagaagataa aacgccttga ggagtcaaac aagaagttgg 3601 ctcttgaatt agagcacgag aaagggaagc ttacgggcct cggtcagtcc aacgcagctc 3661 tgcgggaaca caacagcatc ctagaaacag ctttggccaa gagggaggca gacctagtcc 3721 agttgaacct tcaggtgcag gcagttttgc agcgcaaaga agaggaggat cgccagatga 3781 agcatcttgt ccaggccctg caggcctcac tagagaagga gaaggagaag gtgaacagcc 3841 tcaaggagca ggtggctgct gccaaggtgg aagccgggca taaccgccgc cacttcaagg 3901 cggcctcctt ggagctgagt gaggtgaaga aggagctgca ggccaaggaa cacctggtgc 3961 agaagctgca ggccgaggcc gacgaccttc agattcggga ggggaaacat tcccaggaga 4021 tagcacagtt ccaagcagag ctggccgagg cccgggcaca gctccagctc ctgcagaagc 4081 agctggacga gcagctcagc aaacagcccg tgggaaacca agagatggaa aatctcaaat 4141 gggaggtgga tcagaaagaa agagaaatcc agtccttgaa gcagcagctg gacttgacgg 4201 agcagcaggg caggaaggaa ctggaagggc tacagcagct gctgcagaac gtcaagtctg 4261 agttggagat ggcccaggaa gacctgtcca tgacccagaa ggataaattt atgctccagg 4321 caaaagtgtc ggagctgaag aacaacatga agaccctgct ccagcagaac cagcagctca 4381 agctggacct acgccgcggc caagacgaga aaggagccga aagcgcaggc cagctcttcc 4441 aaccctgcca cgcccatcaa gatcccggac tgcccagttc ccgcctcgct gctggaggag 4501 ctgctgagac caccgcccgc cgtgagcaag gagcccctca agaacctgaa cagctgcctc 4561 cagcagctca agcaggagat ggacagcctg cagcgccaga tggaggagca cgccctgacg 4621 gtgcacgagt ctctgtcctc gtggacgccg ctggagccag ccactgccag ccctgtgccc 4681 ccggggggtc acgccggccc acgcggcgac ccacagagac acagtcagag cagggcttcc 4741 aaagaagggc cgggagagtg actgctgtgg actcgcctcc gtgcgccgct gccccagaag 4801 gctcttatca atgttattta tttgattgtg tggtcgatgt ttttctaaga catgaaattt 4861 aagttttgtt ttgcctttaa caagaagtaa aatatatagc agaatgagag ccaaggacta 4921 gaaaaacatt cgaagatcac aattagcttt tcacatggaa tgaccaactc ttaaaagcct 4981 gataggctct cggcgaggag ctttgaacgt gtctgaaggg ttacttgtag gtcgtggctt 5041 ctgagcggcc accgatgctg ctctctgcgg gtgacaggga gaggctgcgt aactgggagc 5101 agctgtgtga cagggtctgc ggcacccgcc tggccaggcc ggctgcagtt tctcacttcc 5161 ctgttccatt cagtaagagc tttacttttc cgcagaaatg aaattttatc tgtacctttg 5221 gctttttact tgtttttttt gatagccatc ccaccatagg atgtgtacat agatactgaa 5281 tatcataatc caatctttgt tttttttttt ttttttttga gacagagtct cgctttgttg 5341 cccaggctgg agtgcagtgg cacactctcc gctcactgca agctccgcct cccaggttca 5401 tgcgattctc ctgcctcagc ctctcgagta gctgggatta caggcgtgcg ccactatgcc 5461 aggctaatgt ttgtattttt agtagcaatg gggtttcacc atgttggcca ggatggtctc 5521 gatctcctga cctcaagtga tctgcccatc tcagcctccc aaagtgctgg aattataggc 5581 gtgaaccacc gcccccggct gcagttggat ttttaaattg ctttttttta ttgttgaggt 5641 ttttttatct ccaagggact ctcccggcac ttctaccttc cagagttact tcagtgcata 5701 aagtttgaat tattttgttc ttgtgggcag aagtgggaat gatggaatat cctcacggaa 5761 aaggcagtga agttgggagt actgcttaca aaacagggtc accagtgcat tatgtggcgt 5821 gttcatcccc acgccgtgtg tcacgggcta gggcggcgtg ttcatcccca caccgtgtgt 5881 cacaacaggc tagggcactt cacgatgtca ctacttgttt ttctgacgtt ccaaaaacaa 5941 cgtaacttgg ttttcatgtg tttttccatg gtatatgtga gattgatgct acgggtctta 6001 cggactcaca cccgttccca ctctctgcaa tatggatcag gcagtgtttc tgataggatg 6061 tgaaatggac tctcctcggg tgggtccagc aggggccctg cccaccagaa cacagtccgt 6121 gctgtgctgc gctaaggagc tggccctcaa ctctccttgg tgcagggttc ccacaaccga 6181 gttctagttc cctgaggtct ttaaaaacaa aaacagaatg ttgtacgtga agattctagg 6241 aggggaggga ccagcaaatc tgagagaacc gtcctggggc ctcccttcga ggagccctct 6301 gatgtgagga gggacttgag ttgagtgacg ctgtggtgtg aggtgttctg agctcactga 6361 ccggaaggtc caggtgaatc tcgtcataag tgatctcagg ctctcacagg atccggaggg 6421 aaatgtgtta gagggtctgg aaaattcagt gcttttgagt tacttgtttt tattaaaaat 6481 ttcctcacaa aagagagtcc tcaagttgtg gctgttcttg ggaaaggggt caccgtgtct 6541 gacaaagtgt aactttaaaa agcacgttca ttttttacaa atgtaagtgt gcttgggaat 6601 tccttaaatt ttgtgcaata aactattttt tggtaaagat // LOCUS D64007 567 bp mRNA PRI 23-DEC-1996 DEFINITION Human mRNA for RT14, complete cds. ACCESSION D64007 NID g1752637 KEYWORDS RT14. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Fujiwara,T., Shinomiya,H., Kuga,Y., Hishigaki,H., Nakamura,Y. and Hirai,Y. TITLE Molecular cloning of a novel human cDNA, RT14, containing a putative ORF highly conserved between human, fruit fly, and nematode JOURNAL DNA Res. 2 (5), 235-237 (1995) MEDLINE 96366390 REFERENCE 2 (bases 1 to 567) AUTHORS Watanabe,T., Shinomiya,H., Fujiwara,T., Takahashi,E., Nakamura,Y. and Hirai,Y. TITLE Molecular cloning of a novel gene, RT14, containing an ORF homologus to a reverse-complementary sequence of Tra-2 gene JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 567) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..567 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" gene 99..476 /gene="RT14" CDS 99..476 /gene="RT14" /note="similar to D. melanogaster reverse-complementary sequence of Tra-2: GenBank Accession Number M30939" /codon_start=1 /product="RT14" /db_xref="PID:d1011538" /db_xref="PID:g1752638" /translation="MLSRLLKEHQAKQNERKELQEKRRREAITAATCLTEALVDHLNV GVAQAYMNQRKLDHEVKTLQVQAAQFAKQTGQWIGMVENFNQALKEIGDVENWARSIE LDMRTIATALEYVYKGQLQSAPS" BASE COUNT 143 a 170 c 169 g 85 t ORIGIN 1 cagcgggcac gtgacatggc cccggggagc cgaggtgagc gtccagcttc cggagccgga 61 gggggcccgg cgtacccagc ccccagcccg acgtgaccat gctgtcccgc ctcctaaaag 121 aacaccaggc caagcagaat gaacgcaagg agctgcagga aaagaggagg cgagaggcta 181 tcactgcagc gacctgcctg acagaagctt tggtggatca cctcaatgtg ggtgtggccc 241 aggcctacat gaaccagaga aagctggacc atgaggtgaa gaccctacag gtccaggctg 301 cccaatttgc caagcagaca ggccagtgga tcggaatggt ggagaacttc aaccaggcac 361 tcaaggaaat tggggatgtg gagaactggg ctcggagcat cgagctggac atgcgcacca 421 ttgccactgc actggaatat gtctacaaag ggcagctgca gtctgcccct tcctagcccc 481 tgttccctcc ccaaacccta tccctcctac ctcacccgca gggggaagga gggaggctga 541 caagccttga ataaaacaca agcctcc // LOCUS D64109 1204 bp mRNA PRI 29-JUL-1996 DEFINITION Human mRNA for tob family, complete cds. ACCESSION D64109 NID g1469154 KEYWORDS tob family. SOURCE Homo sapiens cDNA to mRNA, clone_lib:Dauji clone:tob4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1204) AUTHORS Matsuda,S., Ikematsu,N. and Yamamoto,T. TITLE Cioning of tob4 JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 1204) AUTHORS Ikematsu,N. TITLE Direct Submission JOURNAL Submitted (09-SEP-1995) to the DDBJ/EMBL/GenBank databases. Naoko Ikematsu, The University of Tokyo, Oncology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:michino@ims.u-tokyo.ac.jp, Tel:03-5449-5303, Fax:03-5449-5413) COMMENT Submitted (9-Sep-1995) to DDBJ by: Naoko Ikematsu Dept. of Oncology The University of Tokyo 4-6-1 Shirokanedai Minato-ku, Tokyo 108 Japan Phone: 03-5449-5303 Email: michino@ims.u-tokyo.ac.jp Fax: 03-5449-5413. FEATURES Location/Qualifiers source 1..1204 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="tob4" /clone_lib="Dauji" CDS 94..1128 /codon_start=1 /product="tob family" /db_xref="PID:d1011627" /db_xref="PID:g1469155" /translation="MQLEIKVALNFIISYLYNKLPRRRADLFGEELEQALKKKYEGHW YPEKPLKGSGFRCVHIGEMVDPVVELAAKRSGLAVEDVRANVPEELSVWIDPFEVSYQ IGEKGAVKVLYLDDSEGCGAPELDKEIKSSFNPDAQVFVPIGSQDSSLSNSPSPSFGQ SPSPTFIPRSAQPITFTTASFAATKFGSTKMKKGGGAASGGGVASSGAGGQQPHQQPR MARSPTNSLLKHKSLSLSMHSLNFITANPAPQSQLSPNAKEFVYNGGGSPSLFFDAAD GRAAAPQARLEAVGLAPATAAALTWPRYLEVVPTASSWRRHPFVEGLSYNLNTMQYPS QQFQPVVLAN" BASE COUNT 270 a 366 c 340 g 228 t ORIGIN 1 atattttctc acggtgcctc tcatttccca gagccgcctg gagcccaagg ctgtacacgt 61 gccctgtgct gattctctgc ctaggaaagg accatgcagc tagagatcaa agtggccctg 121 aacttcatca tctcctactt gtacaacaag ctgccccggc gccgggcaga cctgtttggg 181 gaggagctag agcaggcttt gaaaaagaaa tatgaaggcc actggtaccc tgagaagcca 241 ctgaaaggct ctggcttccg ctgtgttcac attggggaga tggtggaccc cgtggtggag 301 ctggccgcca agcggagtgg cctggcagtg gaagatgtgc gggccaatgt gcctgaggag 361 ctgagtgtct ggattgatcc ctttgaggtg tcctaccaga ttggtgagaa gggagctgtg 421 aaagtgctgt acctggatga cagtgagggt tgcggtgccc cagagctgga caaggagatc 481 aagagcagct tcaaccctga cgcccaggtg ttcgtgccca ttggcagcca ggacagctcc 541 ctgtccaact ccccatcgcc atcctttggc cagtcaccca gccctacctt cattccccgc 601 tccgctcagc ccatcacctt caccaccgcc tccttcgctg ccaccaaatt tggctccact 661 aagatgaaga aggggggcgg ggcagcaagt ggtgggggtg tagccagcag tggggcgggt 721 ggccagcagc cacaccagca gcctcgcatg gcccgctcac ccaccaacag cctgctgaag 781 cacaagagcc tctctctgtc tatgcattca ctgaacttca tcacggccaa cccggcccct 841 cagtcccagc tctcacccaa tgccaaggag ttcgtgtaca acggtggtgg ctcacccagc 901 ctcttctttg atgcggccga tggcagggca gcggcacccc aggcccgttt ggaggcagtg 961 gggctggcac ctgcaacagc agcagctttg acatggccca ggtatttgga ggtggtgcca 1021 acagcctctt cctggagaag acaccccttt gtggaaggcc tcagctacaa cctgaacacc 1081 atgcagtatc ccagccagca gttccagccc gtggtgctgg ccaactgacc atctacctgc 1141 ccgtggggcc aggagcaccc aagaccacag aaaagagaaa ggaaaggcca aaaaaaaaaa 1201 aacc // LOCUS D64142 1232 bp mRNA PRI 15-OCT-1996 DEFINITION Human mRNA for histone H1x, complete cds. ACCESSION D64142 NID g1620002 KEYWORDS histone H1x. SOURCE Homo sapiens lymphocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamamoto,T. and Horikoshi,M. TITLE Cloning of the cDNA encoding a novel subtype of histone H1 JOURNAL Gene 173 (2), 281-285 (1996) MEDLINE 97082983 REFERENCE 2 (bases 1 to 1232) AUTHORS Horikoshi,M. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1232) AUTHORS Horikoshi,M. TITLE Direct Submission JOURNAL Submitted (16-SEP-1995) to the DDBJ/EMBL/GenBank databases. Masami Horikoshi, Inst. Mol. Cell. Biosci., The University of Tokyo, Laboratory of Developmental Biology; 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113, Japan (E-mail:horikosh@imcbns.iam.u-tokyo.ac.jp, Tel:03-5802-3388, Fax:03-5684-8341) FEATURES Location/Qualifiers source 1..1232 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" mRNA 1..1232 CDS 102..743 /codon_start=1 /product="histone H1x" /db_xref="PID:d1011677" /db_xref="PID:g1620003" /translation="MSVELEEALPVTTAEGMAKKVTKAGGSAALSPSKKRKNSKKKNQ PGKYSQLVVETIRRLGERNGSSLAKIYTEAKKVPWFDQQNGRTYLKYSIKALVQNDTL LQVKGTGANGSFKLNRKKLEGGGERRGAPAAATAPAPTAHKAKKAAPGAAGSRRADKK PARGQKPEQRSHKKGAGAKKDKGGKAKKTAAAGGKKVKKAAKPSVPKVPKGRK" polyA_signal 1212..1217 BASE COUNT 220 a 418 c 402 g 192 t ORIGIN 1 gcgcccccag cccccctgca ccccctcggc ccctcgcctt cctcttcccg gcgcggcccc 61 ccggcttccg cgcgccgccc gccaccaatc ctcttgctac catgtccgtg gagctcgagg 121 aggccctgcc agtgacgacc gccgagggaa tggccaagaa ggtgaccaag gctggcggct 181 cggcggcgtt gtccccatct aagaagagga agaatagcaa gaagaagaac cagccgggca 241 agtacagcca gctggtggtg gagaccatcc gtaggctggg cgagcgcaac ggctcgtcgc 301 tggccaagat ctacaccgag gccaagaagg ttccgtggtt cgaccagcag aatgggcgca 361 cctacctcaa gtactcgatc aaggcgctgg tgcagaacga cacgcttctg caggtgaagg 421 gcaccggcgc caacggttcc ttcaagctca accgcaagaa gctggagggc ggcggggagc 481 ggcgcggagc cccggcggcc gccaccgccc cggcccccac cgcgcacaaa gcgaagaagg 541 cagccccggg cgcggccggc tcccggcgcg cggacaagaa gcccgccagg ggccagaagc 601 cggagcagcg ctcgcacaag aagggcgctg gcgccaagaa ggacaaaggc ggcaaggcca 661 agaagacggc ggccgccggg ggcaagaagg tgaagaaggc ggccaagccc agcgtcccca 721 aagtgcccaa gggccgcaag tgagcgtgtc ggccggtcag agcggccggc gtggactttt 781 cggtgttttt gtttttctac cccaagtgac gtagattttg tacggctcac gccggccggg 841 gccgcgaggc ctggtctgag cctcagggag gggccccggg tcctctcagt ctttcccctc 901 ccccaacgat gtagcgtttt tcgttgtttg ctttaggttt ttgaaacagc cccggcgacg 961 cctctattgg ctctcggcct tggcaacggc cgtcgtcatg gttactggcc cctaggcgcc 1021 gatggccgag gccgcgcctg cccaccgggc ggggtcgctg gttggccggg cccaggcgcg 1081 cggggacgcg gaggccgcgc atcctttccc agctccccac cctccttgcc tttgggtgcg 1141 cgacaaacaa tcgctccggg ctcagggctg cgcggctctt cccttcattc catgggcctt 1201 tttttgggca caataaagcg tttaaacctt tc // LOCUS D67025 2177 bp mRNA PRI 02-DEC-1997 DEFINITION Homo sapiens mRNA for proteasome subunit p58, complete cds. ACCESSION D67025 NID g2656091 KEYWORDS . SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kominami,K., Okura,N., Kawamura,M., DeMartino,G.N., Slaughter,C.A., Shimbara,N., Chung,C.H., Fujimuro,M., Yokosawa,H., Shimizu,Y., Tanahashi,N., Tanaka,K. and Toh-e,A. TITLE Yeast counterparts of subunits S5a and p58 (S3) of the human 26S proteasome are encoded by two multicopy suppressors of nin1-1 JOURNAL Mol. Biol. Cell 8 (1), 171-187 (1997) MEDLINE 97170075 REFERENCE 2 (bases 1 to 2177) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (22-SEP-1995) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Inst. for Enz. Res.; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (E-mail:keiji@ier.tokushima-u.ac.jp, Tel:0886-31-3111(ex.2563), Fax:0886-33-7431) FEATURES Location/Qualifiers source 1..2177 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 158..1762 /codon_start=1 /product="proteasome subunit p58" /db_xref="PID:d1024532" /db_xref="PID:g2656092" /translation="MKQEGSARRRGADKAKPPPGGGEQEPPPPPAPQDVEMKEEAATG GGSTGEADGKTAAAAVEHSQRELDTVTLEDIKEHVKQLEKAVSGKEPRFVLRALRMLP STSRRLNHYVLYKAVQGFFTSNNATRDFLLPFLEEPMDTEADLQFRPRTGKAASTPLL PEVEAYLQLLVVIFMMNSKRYKEAQKISDDLMQKISTQNRRALDLVAAKCYYYHARVY EFLDKLDVVRSFLHARLRTATLRHDADGQATLLNLLLRNYLHYSLYDQAEKLVSKSVF PEQANNNEWARYLYYTGRIKAIQLEYSEARRTMTNALRKAPQHTAVGFKQTVHKLLIV VELLLGEIPDRLQFRQPSLKRSLMPYFLLTQAVRTGNLAKFNQVLDQFGEKFQADGTY TLIIRLRHNVIKTGVRMISLSYSRISLADIAQKLQLDSPEDAEFIVAKAIRDGVIEAS INHEKGYVQSKEMIDIYSTREPQLAFHQRISFCLDIHNMSVKAMRFPPKSYNKDLESA EERREREQQDLEFAKEMAEDDDDSFP" BASE COUNT 500 a 623 c 634 g 420 t ORIGIN 1 gaattcgcgg ccgctggttt gcagctgctc cgtcatcgtg cggcccgacg ctatctcgcg 61 ctcgtgtgca ggcccggctc ggctcctggt ccccggtgcg agggttaacg cgaggccccg 121 gcctcggtcc ccggactagg ccgtgacccc gggtgccatg aagcaggagg gctcggcgcg 181 gcgccgcggc gcggacaagg cgaaaccgcc gcccggcgga ggagaacaag aacccccacc 241 gccgccggcc ccccaggatg tggagatgaa agaggaggca gcgacgggtg gcgggtcaac 301 gggggaggca gacggcaaga cggcggcggc agcggttgag cactcccagc gagagctgga 361 cacagtcacc ttggaggaca tcaaggagca cgtgaaacag ctagagaaag cggtttcagg 421 caaggagccg agattcgtgc tgcgggccct gcggatgctg ccttccacat cacgccgcct 481 caaccactat gttctgtata aggctgtgca gggcttcttc acttcaaata atgccactcg 541 agactttttg ctccccttcc tggaagagcc catggacaca gaggctgatt tacagttccg 601 tccccgcacg ggaaaagctg cgtcgacacc cctcctgcct gaagtggaag cctatctcca 661 actcctcgtg gtcatcttca tgatgaacag caagcgctac aaagaggcac agaagatctc 721 tgatgatctg atgcagaaga tcagtactca gaaccgccgg gccctagacc ttgtagccgc 781 aaagtgttac tattatcacg cccgggtcta tgagttcctg gacaagctgg atgtggtgcg 841 cagcttcttg catgctcggc tccggacagc tacgcttcgg catgacgcag acgggcaggc 901 caccctgttg aacctcctgc tgcggaatta cctacactac agcttgtacg accaggctga 961 gaagctggtg tccaagtctg tgttcccaga gcaggccaac aacaatgagt gggccaggta 1021 cctctactac acagggcgaa tcaaagccat ccagctggag tactcagagg cccggagaac 1081 gatgaccaac gcccttcgca aggcccctca gcacacagct gtcggcttca aacagacggt 1141 gcacaagctt ctcatcgtgg tggagctgtt gctgggggag atccctgacc ggctgcagtt 1201 ccgccagccc tccctcaagc gctcactcat gccctatttc cttctgactc aagctgtcag 1261 gacaggaaac ctagccaagt tcaaccaggt cctggatcag tttggggaga agtttcaagc 1321 agatgggacc tacaccctaa ttatccggct gcggcacaac gtgattaaga caggtgtacg 1381 catgatcagc ctctcctatt cccgaatctc cttggctgac atcgcccaga agctgcagtt 1441 ggatagcccc gaagatgcag agttcattgt tgccaaggcc atccgggatg gtgtcattga 1501 ggccagcatc aaccacgaga agggctatgt ccaatccaag gagatgattg acatctattc 1561 cacccgagag ccccagctag ccttccacca gcgcatctcc ttctgcctag atatccacaa 1621 catgtctgtc aaggccatga ggtttcctcc caaatcgtac aacaaggact tggagtctgc 1681 agaggaacgg cgtgagcgag aacagcagga cttggagttt gccaaggaga tggcagaaga 1741 tgatgatgac agcttccctt gagctggggg gctggggagg ggtaggggga atggggacag 1801 gctctttccc ccttgggggt cccctgccca gggcactgtc cccattttcc cacacacagc 1861 tcatatgctg cattcgtgca gggggtgggg gtgctgggag ccagccaccc tgacctcccc 1921 cagggctcct ccccagccgg tgacttactg tacagcaggc aggagggtgg gcaggcaacc 1981 tccccgggca gggtcctggc cagcagtgtg ggagcaggag gggaaggata gttctgtgta 2041 ctcctttagg gagtggggga ctagaactgg gatgtcttgg cttgtatgtt ttttgaagct 2101 tcgattatga tttttaaaca ataaaaagtt ctcccaaaaa aaaaaaaaaa aaaaaaaaaa 2161 aaagcggccg cgaattc // LOCUS D67029 5434 bp mRNA PRI 19-NOV-1996 DEFINITION Human SEC14L mRNA, complete cds. ACCESSION D67029 NID g1669536 KEYWORDS SEC14L. SOURCE Homo sapiens adult lung cDNA to mRNA, clone:HY3338. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Chinen,K., Takahashi,E. and Nakamura,Y. TITLE Isolation and mapping of a human gene (SEC14L), partially homologous to yeast SEC14, that contains a variable number of tandem repeats (VNTR) site in its 3' untranslated region JOURNAL Cytogenet. Cell Genet. 73 (3), 218-223 (1996) MEDLINE 96302338 REFERENCE 2 (bases 1 to 5434) AUTHORS Nakamura,Y. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 5434) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (23-SEP-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science,The University of Tokyo, Laboratory of Molecular Medicine; 4-6-1 Sirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..5434 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HY3338" /dev_stage="adult" /tissue_type="lung" gene 304..2451 /gene="SEC14L" CDS 304..2451 /gene="SEC14L" /note="deduced amino acid sequence is highly homologous to hypothetical proteins of C.elegans(T23g5.4 and T23G5.2)." /codon_start=1 /db_xref="PID:d1011708" /db_xref="PID:g1669537" /translation="MVQKYQSPVRVYKYPFELIMAAYERRFPTCPLIPMFVGSDTVSE FKSEDGAIHVIERRCKLDVDAPRLLKKIAGVDYVYFVQKNSLNSRERTLHIEAYNETF SNRVIINEHCCYTVHPENEDWTCFEQSASLDIKSFFGFESTVEKIAMKQYTSNIKKGK EIIEYYLRQLEEEGITFVPRWSPPSITPSSETSSSSSKKQAASMAVVIPEAALKEGLS GDALSSPSAPEPVVGTPDDKLDADHIKRYLGDLTPLQESCLIRLRQWLQETHKGKIPK DEHILRFLRARDFNIDKAREIMCQSLTWRKQHQVDYILETWTPPQVLQDYYAGGWHHH DKDGRPLYVLRLGQMDTKGLVRALGEEALLRYVLSVNEERLRRCEENTKVFGRPISSW TCLVDLEGLNMRHLWRPGVKALLRIIEVVEANYPETLGRLLILRAPRVFPVLWTLVSP FIDDNTRRKFLIYAGNDYQGPGGLLDYIDKEIIPDFLSGECMCEVPEGGLVPKSLYRT AEELENEDLKLWTETIYQSASVFKGAPHEILIQIVDASSVITWDFDVCKGDIVFNIYH SKRSPQPPKKDSLGAHSITSPGGNNVQLIDKVWQLGRDYSMVESPLICKEGESVQGSH VTRWPGFYILQWKFHSMPACAASSLPRVDDVLASLQVSSHKCKVMYYTEVIGSEDFRG SMTSLESSHSGFSQLSAATTSSSQSHSSSMISR" repeat_region 4138..4788 /note="Tandem repetition is polymorphic." /rpt_type=TANDEM repeat_unit 4139..4151 polyA_signal 5413..5418 BASE COUNT 1229 a 1282 c 1518 g 1405 t ORIGIN 1 caagtgccgt cgccgcgccc cttccccctc ccgcctcccc ggccccctcc ccggaaccgg 61 cggtcgagct acggtcgcgg acgagtggaa ccgagactgc cccgcggagc cgccggtatg 121 agcgcccctc gccaccccgt gtcccaggcc cggcctttct gacaagagct agacttcggg 181 ctccttgagg atattcagtt ttgtatgttt gaatatcctc tcaccatgtt cagcataaag 241 taccattctt aatgattatc ctcaacaaga caggtgtgag agggttgctg ttgcattgca 301 atcatggtgc aaaaatacca gtccccagtg agagtgtaca aatacccctt tgaattaatt 361 atggctgcct atgaaaggag gttccctaca tgtcctttga ttccgatgtt cgtgggcagt 421 gacactgtga gtgaattcaa gagcgaagat ggggctattc atgtcattga aaggcgctgc 481 aagctggatg tagatgcacc cagactgctg aagaagattg caggagttga ttatgtttat 541 tttgtccaga aaaactcact gaattctcgg gaacgtactt tgcacattga ggcttataat 601 gaaacgtttt ccaatcgggt catcattaat gagcattgct gctacaccgt tcaccctgaa 661 aatgaagatt ggacctgttt tgaacagtct gcaagtttag atattaaatc tttctttggt 721 tttgaaagta cagtggaaaa aattgcaatg aaacaatata ccagcaacat taaaaaagga 781 aaggaaatca tcgaatacta ccttcgccaa ttagaagaag aaggcataac ctttgtgccc 841 cgttggagtc cgccttccat cacgccctct tcagagacat cttcatcatc ctccaagaaa 901 caagcagcgt ccatggccgt cgtcatccca gaagctgccc tcaaggaggg gctgagtggt 961 gatgccctca gcagccccag tgcacctgag cccgtggtgg gcacccctga cgacaaacta 1021 gatgccgacc acatcaagag atacctgggc gatttgactc cgctgcagga gagctgcctc 1081 attagacttc gccagtggct ccaggagacc cacaagggca aaattccaaa agatgagcat 1141 attcttcggt tcctccgtgc acgggatttt aatattgaca aagccagaga gatcatgtgt 1201 cagtctttga cgtggagaaa gcagcatcag gtagactaca ttcttgaaac ctggacccct 1261 cctcaggtcc ttcaggatta ctacgcggga ggctggcatc atcacgacaa agatgggcgg 1321 cccctctacg tgctcaggct ggggcagatg gacaccaaag gcttggtgag agcgctcggg 1381 gaggaagccc tgctgagata cgttctctcc gtaaatgaag aacggctaag gcgatgcgaa 1441 gagaatacaa aagtctttgg tcggcctatc agctcatgga cctgcctggt ggacttggaa 1501 gggctgaaca tgcgccactt gtggagacct ggtgtgaaag cgctgctgcg gatcatcgag 1561 gtggtggagg ccaactaccc tgagacactg ggccgccttc tcatcctgcg ggcgcccagg 1621 gtatttcctg tgctctggac gctggttagt ccgttcattg atgacaacac cagaaggaag 1681 ttcctcattt atgcaggaaa tgactaccag ggtcctggag gcctgctgga ttacatcgac 1741 aaagagatta ttccagattt cctgagtggg gagtgcatgt gcgaagtgcc agagggtgga 1801 ctggtcccca aatctctgta ccggactgca gaggagctgg agaacgaaga cctgaagctc 1861 tggactgaga ccatctacca gtctgcaagc gtcttcaaag gagccccaca tgagattctc 1921 attcagattg tggatgcctc gtcagtcatc acttgggatt tcgacgtgtg caaaggggac 1981 attgtgttta acatctatca ctccaagagg tcgccacaac cacccaaaaa ggactccctg 2041 ggagcccaca gcatcacctc tccgggtggg aacaatgtgc agctcataga caaagtctgg 2101 cagctgggcc gcgactacag catggtggag tcgcctctga tctgcaaaga aggagaaagc 2161 gtgcagggtt cccatgtgac caggtggccg ggcttctaca tcctgcagtg gaaattccac 2221 agcatgcctg cgtgcgccgc cagcagcctt ccccgggtgg acgacgtgct tgcgtccctg 2281 caggtctctt cgcacaagtg taaagtgatg tactacaccg aggtgatcgg ctcggaggat 2341 ttcagaggtt ccatgacgag cctggagtcc agccacagcg gcttctccca gctgagtgcc 2401 gccaccacct cctccagcca gtcccactcc agctccatga tctccaggta gtgccgcgct 2461 gcctgcacct agtgtgcaga ggggacggcc gcccctcctc ggacagcagc tgcacccgcc 2521 cacccagcgg cgacattgta cagactcctc tcacctctag atagcaaata gctctcagat 2581 ggtaaacgta gtcgtttgat cccaaaacta ccttggcagg tagttttaac tctgatccta 2641 acttaactca atagccatag attttgtata cgttgtgcac aaaatccaac cagagcgcaa 2701 gggctctctt gaaagaaaag tagtttctgt accaattaaa ggattgacgt ggtctcagat 2761 attgatgcaa aaaatttttc caacgaactc cgcattgtcc attagtgaat gaattcctgt 2821 gacatcctcc agagatggcc cctcctcacc tgggacggaa gctgccagct cgcttccccc 2881 aagctgcctc atggcccgca cgccgcctca cggcccccat gcttcccgcc agtcaagatg 2941 gtctgtggac ttagggccag cccttgaggt ccttatcctc tgaggattca gaggttgcct 3001 gcggagtacc ttgtcccagg gccagacaca cccacaccac ccactgtctg cagtggggcc 3061 gggggctcag gaggggctct cagggactcc tggtgactcc aggaaaatgc tgccatcgtt 3121 aaacattact ttctctttcc tccttttcaa atctttttga tactttttag agcaggattt 3181 ttctgtatgt gaacttgggt gggggggttc ttcccgtttc cttccgtgcg tcgcccctct 3241 cacctgcagt cagctcccag cccagtgtag gccatctcct ctgtgccctc tggaggctca 3301 ttgtctcaga gcccagacag ttccagccac taggaggccg tcttggaacc agcaagtcgc 3361 atttgccact tgacactgtc catggggttt tattagtagc taagcagcag ctctcgcatc 3421 cacttcaggg tggcgtgtgg catgtaggag tcctgcttct ttgtacatgg gaattgtgga 3481 ctcatgcgtg tgtgtgtgtg catgtgctgt gtgtgtgcat gtgtgcatga cggtgggggt 3541 gctgggggga cggggtgagt ggaaacttag tttgagtaat gaaggaatct tcacagaagc 3601 aaatcagaat atgggatttg tttgcctttt acattttgtt taattcctga ttttaaagcc 3661 tgctctatct ggtacaggcc cttatttttt cagcttttta tgggaaaagc aggttatttg 3721 agaatctgtc cagaagttgc ataggggatg gcctccacga taaggacatg caacacgtgt 3781 ttctgtgtgc agcagaggcc gtgtttttca tgccaaaccc cacgcggctg tcaactgtgt 3841 gcgtggtagg catggagatc ctggttgtgc cgtctcagct ccgctctgaa ggcactgtgt 3901 gggtgctgcg tgactggaga gctgtgtgga ggccatgtgt gccccgtgca gggatcagga 3961 gggcggggga gggaccgagc agccctcttg cccggtcggg tcagccctag tggctgcctg 4021 cacactgtag acgtcccagg gcctgtgctg tgatcacctg cctttggacc acatttgtgt 4081 ttgctcttag agatcgagct cctcagtggt acctgaagcc tttgcttccg gaaagcgcgg 4141 tagggttcgt aggtagggct agtaggtagg gttagtaggt agggctagta ggtagggcta 4201 gtaggtaggg ttagtaggta gggttcgtag gtagggctgg taggtagggt tagtaggtag 4261 ggctagtagg tagggttcgt aggtagggct agtaggtagg gttagtaggt agggctagta 4321 ggtagggcta gtaggtaggg ttagtaggta gggttcgtag gtagggctgg taggtagggt 4381 tagtaggtag ggctagtagg tagggttcgt aggtagggct agtaggtagg gttagtaggt 4441 agggctagta ggtagggcta gtaggtaggg ttagtaggta gggttcgtag gtagggctgg 4501 taggtagggt tagtaggtag ggctagtagg tagggctagt aggtagggct agtaggtagg 4561 gttagtaggt agggctagta ggtagggcta gtaggtaggg ttagtaggta gggttcgtag 4621 gtagggctgg taggtagggt tagtaggtag ggctagtagg tagggctagt aggtagggct 4681 agtaggtagg gctagtaggt agggctagta ggtagggcta gtaggtaggg ctagtaggta 4741 gggttcgtag gtagggttcg taggtagggt tcgtaggtag ggttagtagc gcgtctgtgc 4801 tgcttccacc tggtgcttcc tgttcccaaa tcacaagggc ctgaaggtgg tccctgcttt 4861 ctctttctct ttctctgtgt ctcagatggc gattttgctg acagctgcca agaaaatgct 4921 tcactcaaca gtcctcatgt gcccagagat gtttatagaa ctgtttgaat tgcagccatc 4981 ccctgccccc tcccaggctg aagatctgtt ctttttaagt tgattcggga gtggcattct 5041 tttataccca aagactgtag tgcatcttga agagctcaaa gcacatgacc gcacaaatgc 5101 ttacagggtt tcctcccgag taatccaatc tcactcccct tgtaagggaa ttctggggca 5161 gctatggttt gagtatgcag tttgcatcgt gtttctacct ttagtacctt gccactcttt 5221 taaaacgctg ctgtcatttc ccatttctta gtactaatga ttctttgatt ctccctctat 5281 tatgtcttaa ttcactttcc ttcctaaatt tgttatttgc atatcaaatt ctgtaaatgt 5341 tttgtaaaca tattacctca cttggtaata caatactgat agtctttaaa agattttttt 5401 attgttatca ataataaatg tgaactattt aaag // LOCUS D67031 2920 bp mRNA PRI 10-DEC-1997 DEFINITION Homo sapiens ADDL mRNA for adducin-like protein, complete cds. ACCESSION D67031 NID g2696053 KEYWORDS adducin-like protein; ADDL. SOURCE Homo sapiens cDNA to mRNA, clone:GEN-028F07. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Katagiri,T., Ozaki,K., Fujiwara,T., Shimizu,F., Kawai,A., Okuno,S., Suzuki,M., Nakamura,Y., Takahashi,E. and Hirai,Y. TITLE Cloning, expression and chromosome mapping of adducin-like 70 (ADDL), a human cDNA highly homologous to human erythrocyte adducin JOURNAL Cytogenet. Cell Genet. 74 (1-2), 90-95 (1996) MEDLINE 97049079 REFERENCE 2 (bases 1 to 2920) AUTHORS Katagiri,T. TITLE Direct Submission JOURNAL Submitted (22-SEP-1995) to the DDBJ/EMBL/GenBank databases. Toyomasa Katagiri, Japanese Foundation for Cancer Research, the Cancer Chemotherapy Center, Department of Human Genome Analysis; 1-37-1 Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:tkatagi@hgc.ims.u-tokyo.ac.jp, Tel:03-5394-3926, Fax:03-5394-3926) FEATURES Location/Qualifiers source 1..2920 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /clone="GEN-028F07" /map="10q24.2-3" gene 184..2208 /gene="ADDL" CDS 184..2208 /gene="ADDL" /codon_start=1 /product="adducin-like protein" /db_xref="PID:d1024688" /db_xref="PID:g2696054" /translation="MSSDASQGVITTPPPPSMPHKERYFDRINENDPEYIRERNMSPD LRQDSSMMEQRKRVTRILQSPAFREDLECLIQEQMKKGHNPTGLLALQQIADYIMANS FSGFSSPPLSLGMVTPINDLPGADTSSYVKGEKLTRCKLASLYRLVDLFGWAHLANTY ISVRISKEQDHIIIIPRGLSFSEATASNLVKVNIIGEVVDQGSTNLKIDHTGFSPHAA IYSTRPDVKCVIHIHTLATAAVSSMKCGILPISQESLLLGDVAYYDYQGSLEEQEERI QLQKVLGPSCKVLVLRNHGVVALGETLEEAFHYIFNVQLACEIQVQALAGAGGVDNLH VLDFQKYKAFTYTVAASGGGGVNMGSHQKWKVGEIEFEGLMRTLDNLGYRTGYAYRHP LIREKPRHKSDVEIPATVTAFSFEDDTVLLSPLKYMAQRQQREKTRWLNSPNTYMKVN VPEESRNGETSPRTKITWMKAEDSSKVSGGTPIKIEDPNQFVPLNTNPNEVLEKRNKI REQNRYDLKTAGPQSQLLAGIVVDKPPSTMQFEDDDHGPPAPPNPFSHLTEGELEEYK RTIERKQQGLEENHELFSKSFISMEVPVMVVNGKDDMHDVEDELAKRVSRLSTSTTIE NIEITIKSPEKIEEVLSPEGSPSKSPSKKKKKFRTPSFLKKNKKKEKVEA" BASE COUNT 943 a 576 c 597 g 804 t ORIGIN 1 tctggttcgg cccacctctg aaggttccag aatcgatagt gaattcgtgg agtaggtttc 61 tgtgcagcat tgcagaatcc acacctagag aacagaagac acagacacgt acgtctacta 121 cccttgttag aaggaagctt tggatcttcg gtggataaca agagtaatcc acagacttaa 181 aacatgagct cagatgccag ccaaggcgtg attaccactc ctcctcctcc cagcatgcct 241 cacaaagaga gatattttga ccgcatcaat gaaaatgacc cagaatacat tagggagagg 301 aacatgtctc ctgatctacg acaagactcc agcatgatgg agcagaggaa acgagttact 361 cggatcctgc aaagtcctgc ctttcgggaa gacttggaat gccttattca agaacagatg 421 aagaaaggcc acaacccaac tggattacta gcattacagc agattgcaga ttacatcatg 481 gccaattctt tctcgggttt ttcttcacct cctctcagtc ttggcatggt cacacctatc 541 aatgaccttc ctggtgcaga tacatcctca tatgtgaagg gagaaaaact tactcgctgt 601 aaacttgcca gcctgtacag acttgtagac ttgtttggat gggcacacct ggcaaatacc 661 tatatctcag taagaataag taaggagcaa gaccacatta taataattcc cagaggccta 721 tctttttctg aagctacagc ctccaatttg gtgaaagtca atataatagg agaagtggtt 781 gaccagggaa gtaccaattt gaaaattgac catacaggat tcagtcccca tgctgcaatc 841 tattcaacac gtcctgatgt taagtgtgtg atacacatcc atacccttgc aacagcagct 901 gtatcctcca tgaaatgtgg gatccttcca atttctcaag agtctcttct tctgggagat 961 gttgcctatt atgactacca agggtcactt gaagaacagg aggagagaat tcaactgcag 1021 aaggttctgg gaccaagttg taaagtgctg gtactcagga atcatggtgt ggttgcactt 1081 ggagaaacat tagaggaggc ttttcattat atttttaatg tgcaactagc ctgtgagatt 1141 caggtgcagg ccctagcagg tgcaggtgga gtagacaatc tccatgtact ggactttcag 1201 aagtataaag ctttcactta cactgtagca gcgtctggtg gaggaggtgt gaatatgggt 1261 tcccatcaaa aatggaaggt tggcgaaatt gagtttgaag ggcttatgag gactctggac 1321 aacttggggt atagaacagg ctatgcttac aggcatcctc tcattcgaga gaagcctagg 1381 cacaagagtg atgtggaaat cccagcaact gtgactgctt tttcctttga agacgataca 1441 gtgctactct ctcctctcaa atacatggca cagaggcaac agcgtgaaaa aacaagatgg 1501 ctgaactcac caaatactta catgaaagtg aatgtgcctg aggagtctcg gaacggagaa 1561 accagtcccc gaaccaaaat cacgtggatg aaagcagaag actcatctaa agttagtggt 1621 ggaacaccta tcaaaattga agatccaaat cagtttgttc ctttaaacac aaacccgaat 1681 gaggtactag aaaagagaaa taagattcgg gaacaaaatc gatatgactt gaaaacagca 1741 ggaccacaat ctcagttgct tgctggaatt gttgtggata agccaccttc tactatgcaa 1801 tttgaagatg atgatcatgg cccaccagct cctcctaacc catttagtca tctcacagaa 1861 ggagaacttg aagagtataa gaggacaatc gaacgtaaac aacaaggcct agaagaaaac 1921 catgagctgt tttccaagag cttcatctcc atggaagtgc ctgtcatggt agtaaatggc 1981 aaggatgata tgcatgatgt tgaagatgag cttgctaagc gagtgagtag gttaagcaca 2041 agtacaacca tagaaaacat cgagattact attaagtctc cagagaaaat cgaagaagtc 2101 ctgtcacctg aaggctcccc ttcaaaatcg ccatccaaga aaaagaagaa attccgcact 2161 ccttcttttc tgaaaaagaa caaaaaaaag gagaaagttg aggcctaaat aaagtctttt 2221 tataattatt attataacaa tgtgacattg cacatctaaa taccacattt aagttgatca 2281 ttaatatgca atggtagatc agattggggg atgtagcaaa ctggacttta agaactggaa 2341 agaggtttta caaaagaaaa actttcagat tcatctctca ttttatatgt ccagaaatgg 2401 ctttgaattt taagcaatta ctagttttaa ttagctctgc cctcatgaag tattattata 2461 attcaccata aacagctatc tgtctgaatt acttcaggcc ttctccataa tatctgttag 2521 aaagaaattg ccagtgagca agtgagaatt tttatttctc aatacctgct tcacttgata 2581 atcatattat aattttttat catgattatt gactatattt ttggagtccc attgtttcag 2641 tgggcattaa cagaatgctt taaaaacttc taagacaaga atctatagca ttagtataca 2701 ctggcacata attttttaaa aagttttaag aaaagattca tttggaattt tattcacagt 2761 ataaaatttc ctcacctgaa gtaactttgt ttgccaaaaa agttgtttta ataaactata 2821 atttttgaaa acttcctttt ttattagttt agaaagcccc ttatttttca acaaagggga 2881 ttttgtacac ataacatggg ttatttagtt taactctggc // LOCUS D67035 3446 bp mRNA PRI 06-OCT-1997 DEFINITION Homo sapiens mRNA for SCP-1, complete cds. ACCESSION D67035 NID g2467368 KEYWORDS SCP-1. SOURCE Homo sapiens testis cDNA to mRNA, clone_lib:human 5' streched testis cDNA library. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kondoh,N., Nishina,Y., Tsuchida,J., Koga,M., Tanaka,H., Uchida,K., Inazawa,J., Taketo,M., Nozaki,M., Nojima,H., Matsumiya,K., Namiki,M., Okuyama,A. and Nishimune,Y. TITLE Assignment of synaptonemal complex protein (SCP1) to human chromosome 1p13 by fluorescence in situ hybridization and its expression in the testis JOURNAL Cytogenet. Cell Genet. (1997) In press REFERENCE 2 (bases 1 to 3446) AUTHORS Nishina,Y. TITLE Direct Submission JOURNAL Submitted (25-SEP-1995) to the DDBJ/EMBL/GenBank databases. Yukio Nishina, Research for Microbial Diseases, Osaka University, Sci. for Lob. of Animal Experimentation; 3-1 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-879-8338, Fax:06-879-8339) FEATURES Location/Qualifiers source 1..3446 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone_lib="human 5' streched testis cDNA library" /map="1q13" /tissue_type="testis" gene 177..3098 /gene="Scp-1" CDS 177..3098 /gene="Scp-1" /note="synaptonemal complex protein" /codon_start=1 /product="SCP-1" /db_xref="PID:d1023454" /db_xref="PID:g2467369" /translation="MEKQKPFALFVPPRSSSSQVSAVKPQTLGGDSTFFKSFNKCTED DFEFPFAKTNLSKNGENIDSDPALQKVNFLPVLEQVGNSDCHYQEGLKDSDLENSEGL SRVYSKLYKEAEKIKKWKVSTEAELRQKESKLQENRKIIEAQRKAIQELQCGNEKVSL TLEEGIQDNKDLIKENNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNSNI EKMITAFEELRVQAENSRLEMHFKLKEDYEKIQHLEQEYKKEINDKEKQVSLLLIQIT EKENKMKDLTFLLEESRDKVNQLEEKTKLQSENLKQSIEKQHHLTKELEDIKVSLQRS VSTQKALEEDLQIATNTICQLTEEKDTQMEESNKARAAHSFVVTEFETTVCSLEELLR TEQQRLENYEDQLIILTMELQKTSSELEEMTKLTNNKEVELEELKKVLGEKETLLYDN KQFEKIAEELKGTEQELIGLLQAREKEVHDLEYSYCHYHKWTVLPKRGQRPKLSSKRE LKNTEYFTLQQASPPPNELTQETSDMTLELKNQQEDIINNKKQEERMLTQIENLQETE TQLRNELEYVREELKQKRDEVKCKLDKSEENCNNLRKQVENKNKYIEELQQENKALKK KGTAESKQLNVYEIKVNKLELELESAKQKFGEITDTYQKEIEDKKISEENLLEEVEKA KVIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERDSELGLYKSKEQEQS SLRASLEIELSNLKAELLSVKKQLEIEREEKEKLKREAKENTATLKEKKDKKTQTFLL ETPDIYWKLDSKAVPSQTVSRNFTSVDHGISKDKRDYLWTSAKNTLSTPLPKAYTVKT PTKPKLQQRENLNIPIEESKKKRKMAFEFDINSDSSETTDLLSMVSEEETLKTLYRNN NPPASHLCVKTPKKAPSSLTTPGSTLKFGAIRKMREDRWAVIAKMDRKKKLKEAEKLF V" polyA_site 3446 /note="14 A nucleotides" BASE COUNT 1420 a 514 c 684 g 828 t ORIGIN 1 gcgcaggaac ttaagacagt tcctcctggc gatgtgatgg aatttaatgg gacaggagaa 61 gggaacgggc tttcttttca ggccagcgtg gcagcgggcg gtagggcgaa agggagaagg 121 aaacgagggt ttattccgtt gcccactccg cgatatttac aaccgtaaca gagaaaatgg 181 aaaagcaaaa gccctttgca ttgttcgtac caccgagatc aagcagcagt caggtgtctg 241 cggtgaaacc tcagaccctg ggaggcgatt ccactttctt caagagtttc aacaaatgta 301 ctgaagatga ttttgagttt ccatttgcaa agactaatct ctccaaaaat ggggaaaaca 361 ttgattcaga tcctgcttta caaaaagtta atttcttgcc cgtgcttgag caggttggta 421 attctgactg tcactatcag gaaggactaa aagactctga tttggagaat tcagagggat 481 tgagcagagt gtattcaaaa ctgtataagg aggctgaaaa gataaaaaaa tggaaagtaa 541 gtacagaagc tgaactgaga cagaaagaaa gtaagttgca agaaaacaga aagataattg 601 aagcacagcg aaaagccatt caggaactgc aatgtggaaa tgaaaaagta agtttgacat 661 tagaagaagg aatacaagac aataaagatt taataaaaga gaataatgcc acaaggcatt 721 tatgtaatct actcaaagaa acctgtgcta gatctgcaga aaagacaaag aaatatgaat 781 atgaacggga agaaaccagg caagtttata tggatctaaa tagtaacatt gagaaaatga 841 taacagcttt tgaggaactt cgtgtgcaag ctgagaattc cagactggaa atgcatttta 901 agttaaagga agattatgaa aaaatccaac accttgaaca agaatacaag aaggaaataa 961 atgacaagga aaagcaggta tcactactat tgatccaaat cactgagaaa gaaaataaaa 1021 tgaaagattt aacatttctg ctagaggaat ccagagataa agttaatcaa ttagaggaaa 1081 agacaaaatt acagagtgaa aacttaaaac aatcaattga gaaacagcat catttgacta 1141 aagaactaga agatattaaa gtgtcattac aaagaagtgt gagtactcaa aaggctttag 1201 aggaagattt acagatagca acaaacacaa tttgtcagct aactgaagaa aaagacactc 1261 aaatggaaga atctaataaa gctagagctg ctcattcgtt tgtggttact gaatttgaaa 1321 ctactgtctg cagcttggaa gaattattga gaacagaaca gcaaagattg gaaaattatg 1381 aagatcaatt gataatactt accatggagc ttcaaaagac atcaagtgag ctggaagaga 1441 tgactaagct tacaaataac aaagaagtag aacttgaaga attgaaaaaa gtcttgggag 1501 aaaaggaaac acttttatat gacaataaac aatttgagaa gattgctgaa gaattaaaag 1561 gaacagaaca agaactaatt ggtcttctcc aagccagaga gaaagaagta catgatttgg 1621 aatacagtta ctgccattac cacaagtgga cagtattacc caaaagaggt caaagaccaa 1681 aactgagctc gaaacgagaa ctcaagaata ctgaatactt cacactgcaa caagcttcac 1741 ccccccccaa cgagctcaca caggaaacaa gtgatatgac cctagaactc aagaatcagc 1801 aagaagatat aattaataac aaaaagcaag aagaaaggat gttgacacaa atagaaaatc 1861 ttcaagaaac agaaacccaa ttaagaaatg aactagaata tgtgagagaa gagctaaaac 1921 agaaaagaga tgaagttaaa tgtaaattgg acaagagtga agaaaattgt aacaatttaa 1981 ggaaacaagt tgaaaataaa aacaagtata ttgaagaact tcagcaggag aataaggcct 2041 tgaaaaaaaa aggtacagca gaaagcaagc aactgaatgt ttatgagata aaggtcaata 2101 aattagagtt agaactagaa agtgccaaac agaaatttgg agaaatcaca gacacctatc 2161 agaaagaaat tgaggacaaa aagatatcag aagaaaatct tttggaagag gttgagaaag 2221 caaaagtaat agctgatgaa gcagtaaaat tacagaaaga aattgataag cgatgtcaac 2281 ataaaatagc tgaaatggta gcacttatgg aaaaacataa gcaccaatat gataagatca 2341 ttgaagaaag agactcagaa ttaggacttt ataagagcaa agaacaagaa cagtcatcac 2401 tgagagcatc tttggagatt gaactatcca atctcaaagc tgaacttttg tctgttaaga 2461 agcaacttga aatagaaaga gaagagaagg aaaaactcaa aagagaggca aaagaaaaca 2521 cagctactct taaagaaaaa aaagacaaga aaacacaaac atttttattg gaaacacctg 2581 acatttattg gaaattggat tctaaagcag ttccttcaca aactgtatct cgaaatttca 2641 catcagttga tcatggcata tccaaagata aaagagacta tctgtggaca tctgccaaaa 2701 atactttatc tacaccattg ccaaaggcat atacagtgaa gacaccaaca aaaccaaaac 2761 tacagcaaag agaaaacttg aatataccca ttgaagaaag taaaaaaaag agaaaaatgg 2821 cctttgaatt tgatattaat tcagatagtt cagaaactac tgatcttttg agcatggttt 2881 cagaagaaga gacattgaaa acactgtata ggaacaataa tccaccagct tctcatcttt 2941 gtgtcaaaac accaaaaaag gccccttcat ctctaacaac ccctggatct acactgaagt 3001 ttggagctat aagaaaaatg cgggaggacc gttgggctgt aattgctaaa atggatagaa 3061 aaaaaaaact aaaagaagct gaaaagttat ttgtttaatt tcagagaatc agtgtagtta 3121 aggagcctaa taacgtgaaa cttatagtta atattttgtt cttatttgcc agagccaaat 3181 tttatctgga agttgagact taaaaaatac ttgcatgaat gatttgtgtt tctttatatt 3241 tttagcctaa atgttaacta catattgtct ggaaacctgt cattgtattc agataattag 3301 atgattatat attgttgtta ctttttcttg tattcatgaa aactgttttt actaagtttt 3361 caaatttgta aagttagcct ttgaatgcta agaatgcatt attgagggtc attctttatt 3421 ctttactatt aaaatatttt ggatgc // LOCUS D76444 3423 bp mRNA PRI 22-APR-1997 DEFINITION Human hkf-1 mRNA, complete cds. ACCESSION D76444 NID g1945614 KEYWORDS hkf-1. SOURCE Homo sapiens Adult brain cDNA to mRNA, clone_lib:lambda gt10 clone:H361. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3423) AUTHORS Yasojima,K. TITLE Direct Submission JOURNAL Submitted (18-OCT-1995) to the DDBJ/EMBL/GenBank databases. Koji Yasojima, Kyoto Pref. Univ. of Medicine, Biochem. & Mol. Genet.; Kawaramachi-Hirokoji, Kamikyo-ku, Kyoto, Kyoto 602, Japan (E-mail:yaso@koto.kpu-m.ac.jp, Tel:075-251-5850, Fax:075-251-5799) REFERENCE 2 (bases 1 to 3423) AUTHORS Yasojima,K.K. TITLE Analysis of gene expression specific to brain internal tissues JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Yasojima,K., Tsujimura,A., Mizuno,T., Shigeyoshi,Y., Inazawa,J., Kikuno,R., Kuma,K., Ohkubo,K., Hosokawa,Y., Ibata,Y., Abe,T., Miyata,T., Matsubara,K., Nakajima,K. and Hashimoto-Gotoh,T. TITLE Cloning of human and mouse cDNAs encoding novel zinc finger proteins expressed in cerebellum and hippocampus JOURNAL Biochem. Biophys. Res. Commun. 231 (2), 481-487 (1997) MEDLINE 97223484 COMMENT Sequence updated (22-Apr-1997). FEATURES Location/Qualifiers source 1..3423 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H361" /clone_lib="lambda gt10" /dev_stage="Adult" /tissue_type="brain" mRNA 1..3423 gene 923..2980 /gene="hkf-1" CDS 923..2980 /gene="hkf-1" /codon_start=1 /db_xref="PID:d1020519" /db_xref="PID:g1945615" /translation="MWLKLFFLLLYFLVLFVLARFFEAIVWYETGIFATQLVDPVALS FKKLKTILECRGLGYSGLPEKKDVRELVEKSGDLMEGELYSALKEEEASESVSSTNFS GEMHFYELVEDTKDGIWLVQVIANDRSPLVGKIHWEKMVKKVSRFGIRTGTFNCSSDP RYCRRRGWVRSTLIMSVPQTSTSKGKVMLKEYSGRKIEVEHIFKWITAHAASRIKTIY NAEHLKEEWNKSDQYWLKIYLFANLDQPPAFFSALSIKFTGRVEFIFVNVENWDNKSY MTDIGIYNMPSYILRTPEGIYRYGNHTGEFISLQAMDSFLRSLQPEVNDLFVLSLVLV NLMAWMDLFITQGATIKRFVVLISTLGTYNSLLIISWLPVLGFLQLPYLDSFYEYSLK LLRYSNTTTLASWVRADWMFYSSHPALFLSTYLGHGLLIDYFEKKRRRNNNNDEVNAN NLEWLSSLWDWYTSYLFHPIASFQNFPVESDWDEDPDLFLERLAFPDLWLHPLIPTDY IKNLPMWRFKCLGVQSEEEMSEGSQDTENDSESENTDTLSSEKEVFEDKQSVLHNSPG TASHCDAEACSCANKYCQTSPCERKGRSYGSYNTNEDMEPDWLTWPADMLHCTECVVC LENFENGCLLMGLPCGHVFHQNCIVMWLAGGRHCCPVCRWPSYKKKQPYAQHQPLSND VPS" BASE COUNT 811 a 808 c 867 g 937 t ORIGIN 1 acggccgcgg tctccggagg tggcgggggt gttggggacg ggtgctgcga ccggcactgc 61 ccatccgagc gggacgggcg ctgagtggcc gggaggagcc gggtagccgc ctggaggagc 121 agtctcgggg cctattattg gttttttccc tccgagaccg attccatctg cagagaccgc 181 gacgccccat cctgggccgg gccgcagtgc cgcccgcctg agaggcgccg cccgccagcc 241 ggcccgagcg aacctggagc cgccgccgtg cccgccgctc ttctcgcgga gcctgggcgg 301 tcggcggggc ctggggcctg ggcttcgggc gcggcgttgc ggcggccgct cctccccccg 361 cagaacacgc tgggccccgg gcctggcccg gccgagcgcc gcgccctcct gacccgcggc 421 cgcggagtcc ggcccccacg gcccctcggg ccccggcctg ccgcccggat ccccgcctcc 481 tgggcggatc tgagttattt tttggtctcc ccctccccct tgagatcgcg gcaccggagg 541 gccgaccccg ccacctgggt cagtgcccgc cccggggaag cgtccctcgt tttgttcttc 601 cccgcgagct ctcccgcgcg cccctctcct ttctgtgctt aatggatggg gttttgagtt 661 tttccccttt atttttgtcg gctcttgact gggaggcccg gcgcgaggct ctgcgtctct 721 gcgtccctcg tcgccgcctc gcgcccgcgc ggatcccgtc acctgcttcc cgccggggat 781 ggccggccag tgacggcgcc gggtggcccg cgcgtggaca cggggccctc gcccaagctg 841 ccaccgagcc gcggcccccg ccctcgaccc ttctctctcc cgtattccga gctctctgga 901 aagagaggag actccaggga agatgtggct gaagcttttt ttcttgctcc tctatttcct 961 ggtcctgttc gtcctggcca ggttttttga ggccattgtg tggtatgaaa ctggcatctt 1021 tgccacccag ctggtggatc cggtggcgct gagcttcaag aagctgaaga ccattttgga 1081 gtgccggggg ttgggctact cagggttgcc cgagaagaag gatgtccggg agctggtgga 1141 aaagtcaggt gacttgatgg agggtgagct ctattctgct ctcaaggaag aagaagcatc 1201 cgaatcggtt tctagtacca atttcagtgg tgaaatgcac ttctatgagc ttgtggaaga 1261 cacaaaagat ggcatctggc tggttcaggt catagcaaat gacagaagtc ccttggtggg 1321 caaaattcac tgggagaaaa tggttaaaaa ggtgtcaaga tttggaatac gtacaggcac 1381 atttaactgt tccagtgatc ccagatattg caggagaaga ggctgggtcc gatccacact 1441 cattatgtct gttccacaaa caagtacttc aaaagggaaa gtcatgctta aagaatacag 1501 tggacgcaag attgaagtag agcacatttt taaatggata actgctcatg cagcttctcg 1561 gatcaaaacc atttataatg ctgaacactt gaaagaagaa tggaataaaa gtgatcagta 1621 ttggttaaaa atatacctat ttgcaaacct tgaccagccc ccagctttct tctctgcact 1681 aagtataaag tttactggaa gagttgagtt tatttttgtt aatgtagaaa attgggacaa 1741 caagagttat atgacagata ttggcatata taatatgcca tcatacatac ttagaactcc 1801 tgaaggaatt tacaggtatg gaaaccacac aggcgaattt atatcccttc aggccatgga 1861 ttcatttttg cgctcattac aacccgaggt aaatgatctg tttgttttga gcttggttct 1921 agttaatctt atggcttgga tggacttatt tattacacaa ggagctacca taaagcgatt 1981 tgtggttctc ataagcactt tagggacata taattctcta ttaattattt cctggctacc 2041 tgtgttgggc tttttacagc taccttactt agatagcttt tatgaatata gcttaaaatt 2101 gttgagatat tccaatacaa ccacactggc ttcatgggta agggcagact ggatgtttta 2161 ctcttcacac ccagccctgt ttctcagtac ataccttggt catggtttac taattgatta 2221 ctttgagaag aagagaaggc gcaacaacaa caatgatgaa gtcaatgcca ataacttaga 2281 atggttatca agtctgtggg actggtacac cagctacctc ttccacccga ttgcttcttt 2341 tcagaacttt cctgtagaat ctgattggga cgaagaccct gacttattct tggagcgctt 2401 agctttccct gacctttggc ttcaccctct gataccaact gattatatta aaaacttacc 2461 aatgtggcga tttaaatgtc ttggagtcca gtctgaagag gaaatgtcgg aggggtctca 2521 agatactgaa aatgactcgg aaagtgagaa cacagacact ttgagtagtg agaaggaagt 2581 atttgaagat aagcaaagcg tacttcacaa ttctccagga acagcaagtc actgtgatgc 2641 tgaggcttgt tcatgtgcca ataaatattg tcagaccagc ccatgtgaaa ggaaggggag 2701 gtcatatgga tcatataaca ctaatgaaga tatggaacct gattggttaa cttggcctgc 2761 tgatatgctg cactgtactg aatgtgttgt ttgcctagag aattttgaaa atggatgttt 2821 gctaatgggg ttgccttgtg gtcatgtgtt tcatcagaat tgcattgtga tgtggttggc 2881 tgggggccga cattgttgcc ctgtttgccg gtggccttct tataaaaaaa agcagccata 2941 tgcacaacac cagcccttgt caaatgatgt cccatcttaa ccatgtgcaa tttgtccttt 3001 ataagctttg agtatcttac agcttgcctt tttaatgtta gtcacaatgt ttttgtggtt 3061 tgaagtttag tttaatgtta gtgcagtgac gggaaataca cattatgcta atgttgatga 3121 cagaatttat ttggttgcct tgtgtgttaa ttgaatgcat acctaattgt aaaatttttt 3181 tatttacaac attggaaatt cagaagttaa tgtttttttg taagcacaaa agaagtatta 3241 tagaaattta tcctagcaag actttacaag ataggatcaa attctaatgg aattgagccg 3301 gtttcttatc ctaaatgttt cctccctttt tacaatctct gtccagcacc tcttggttaa 3361 ataatgtatg ctgtgagaca tgaaattaaa acagacctat gaaataaatt attttaaaac 3421 cag // LOCUS D78011 2123 bp mRNA PRI 20-AUG-1997 DEFINITION Homo sapiens mRNA for dihydropyrimidinase, complete cds. ACCESSION D78011 NID g2339965 KEYWORDS dihydropyrimidinase; unc-33. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2123) AUTHORS Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and Nonaka,M. TITLE Direct Submission JOURNAL Submitted (20-OCT-1995) to the DDBJ/EMBL/GenBank databases. Naoki Hamajima, Nagoya City University Medical School, Department of Pediatrics; 1 Kawasumi, Mizuho-cho, Mizuho-ku, Nagoya 467, Japan (E-mail:hamajima@med.nagoya-cu.ac.jp, Tel:+81-52-853-8246, Fax:+81-52-842-3449) REFERENCE 2 (sites) AUTHORS Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and Nonaka,M. TITLE A novel gene family defined by human dihydropyrimidinase and three related proteins with differential tissue distribution JOURNAL Gene 180 (1-2), 157-163 (1996) MEDLINE 97128821 COMMENT Sequence updated (07-Jun-1997) Sequence updated (19-Aug-1997). FEATURES Location/Qualifiers source 1..2123 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" CDS 130..1689 /codon_start=1 /product="dihydropyrimidinase" /db_xref="PID:d1011851" /db_xref="PID:g1330236" /translation="MAAPSRLLIRGGRVVNDDFSEVADVLVEDGVVRALGHDLLPPGG APAGLRVLDAAGKLVLPGGIDTHTHMQFPFMGSRSIDDFHQGTKAALSGGTTMIIDFA IPQKGGSLIEAFETWRSWADPKVCCDYSLHVAVTWWSDQVKEEMKILVQDKGVNSFKM FMAYKDLYMVTDLELYEAFSRCKEIGAIAQVHAENGDLIAEGAKKMLALGITGPEGHE LCRPEAVEAEATLRAITIASAVNCPLYIVHVMSKSAAKVIADARRDGKVVYGEPIAAS LGTDGTHYWNKEWHHAAHHVMGPPLRPDPSTPDFLMNLLANDDLTTTGTDNCTFNTCQ KALGKDDFTKIPNGVNGVEDRMSVIWEKGVHSGKMDENRFVAVTSTNAAKIFNLYPRK GRIAVGSDADIVIWDPKGTRTISAKTHHQAVNFNIFEGMVCHGVPLVTISRGKVVYEA GVFSVTAGDGKFIPRKPFAEYIYKRIKQRDRTCTPTPVERAPYKGEVATLKSRVTKED ATAGTRKQAHP" BASE COUNT 558 a 506 c 552 g 507 t ORIGIN 1 gcagcctgag gcagagctcg ggggctgtcg gtggggacct tgcaggaggg caccccaagc 61 ccgcccggcc cgcccaaccc agcccctgcg cgcagcccgg gccgagtagg accccgcgcg 121 cccctcgcta tggcggcgcc ctcgcggctc ctgatccgcg ggggtcgcgt ggtcaacgat 181 gacttctcgg aggtggccga cgtgctggtg gaggacggcg tggtgcgggc actcgggcac 241 gacctgctgc ctcccggggg cgctcctgcg gggctgcggg tcctcgacgc cgccggcaag 301 ctcgtcctgc ccggaggcat cgacacacac acgcacatgc agttcccctt catgggctcg 361 cggtccatcg acgacttcca ccagggcacc aaggctgctc tctcaggagg caccaccatg 421 attattgatt tcgccattcc tcagaaaggt ggctccctca ttgaggcctt cgagacctgg 481 cgaagctggg ctgatcccaa agtttgctgc gactacagcc ttcatgtggc agtgacgtgg 541 tggagtgacc aggttaaaga agaaatgaaa atccttgtgc aagataaagg tgttaactct 601 ttcaagatgt ttatggccta taaagatctg tacatggtga cagacctgga gctgtacgaa 661 gccttctctc ggtgcaagga aattggagca attgcccagg tccatgcgga aaatggagac 721 ttaattgcag agggagcaaa gaagatgttg gctctgggga taacaggccc tgagggccac 781 gagctgtgcc gcccagaggc agtggaggca gaggccacgc tgagagccat caccatagcc 841 agcgctgtga actgtcctct ctacattgtg catgtgatga gcaagtctgc agctaaggtg 901 atagcggatg caaggagaga tgggaaggtg gtctatggtg aacccatagc agccagtctt 961 ggcacagatg gcactcacta ctggaataaa gaatggcacc atgcagccca ccatgtcatg 1021 ggtccacctt tgcgaccaga cccctcaaca cccgacttcc tcatgaatct gttggctaat 1081 gatgatctaa ccacaacagg gactgataac tgcactttca acacctgcca gaaagctctt 1141 gggaaggatg attttaccaa gatccccaat ggggtgaatg gtgttgaaga tcggatgtcc 1201 gtaatatggg aaaaaggcgt gcatagtggt aaaatggatg aaaacagatt tgtggcagtt 1261 accagcacaa atgcagccaa aatttttaat ctctatccaa gaaaaggaag aatagctgta 1321 ggatcagatg ctgacattgt tatttgggac ccaaaaggca caaggactat ctcagcaaaa 1381 actcatcatc aggctgttaa cttcaacatt ttcgagggca tggtttgcca cggggtgccc 1441 cttgtgacta tttcaagagg caaagtggta tatgaagccg gagtgttcag tgtcacggca 1501 ggagatggga agtttattcc tcgaaaacca tttgctgaat atatttacaa acgaataaag 1561 cagcgagacc ggacttgcac acctacccct gtggagcgtg caccctataa gggagaagtc 1621 gccacactga aatccagagt gacaaaagaa gatgccacag cagggaccag gaaacaggcc 1681 cacccctgaa gtgtgtgcca tcggtaaaaa aaatcagagg aaaggaggct gccattccct 1741 tcacagccaa acattgtcaa cccatggaga agcaggcctt attcaactcc ctaggatcct 1801 ttagaaaaaa ttcaccacta taggcttctt tgattttctt tcagaagcaa ttgctgtctt 1861 tctcactgtg tttttgttgc tgcataagat tgaaggtata aatttatatt attgtgatgg 1921 aaaggctgtg tggaaattca ttgatgatac tttaaaatgt catctttgct tgtactagat 1981 ttcttactta gaatttttaa aaatcatttt cttgtttaaa tagtttcttt ttttaaaaaa 2041 atggttacat tagttttaaa atagctctgt gattttactt tttattgtaa ttaataaaca 2101 ttgagatctt cattttatac ctt // LOCUS D78014 5047 bp mRNA PRI 09-JUN-1997 DEFINITION Human mRNA for dihydropyrimidinase related protein-3, complete cds. ACCESSION D78014 NID g1330241 KEYWORDS dihydropyrimidinase related protein-3; unc-33. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5047) AUTHORS Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and Nonaka,M. TITLE Direct Submission JOURNAL Submitted (20-OCT-1995) to the DDBJ/EMBL/GenBank databases. Naoki Hamajima, Nagoya City University Medical School, Pediatrics; 1 Kawasumi, Mizuho-cho, Mizuho-ku, Nagoya, Aichi 467, Japan (E-mail:hamajima@med.nagoya-cu.ac.jp, Tel:+81-52-853-8246, Fax:+81-52-842-3449) REFERENCE 2 (sites) AUTHORS Hamajima,N., Matsuda,K., Sakata,S., Tamaki,N., Sasaki,M. and Nonaka,M. TITLE A novel gene family defined by human dihydropyrimidinase and three related proteins with differential tissue distribution JOURNAL Gene 180 (1-2), 157-163 (1996) MEDLINE 97128821 FEATURES Location/Qualifiers source 1..5047 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 111..1823 /codon_start=1 /product="dihydropyrimidinase related protein-3" /db_xref="PID:d1011854" /db_xref="PID:g1330242" /translation="MSYQGKKNIPRITSDRLLIKGGRIVNDDQSFYADIYMEDGLIKQ IGDNLIVPGGVKTIEANGKMVIPGGIDVHTHFQMPYKGMTTVDDFFQGTKAALAGGTT MIIDHVVPEPESSLTEAYEKWREWADGKSCCDYALHVDITHWNDSVKQEVQNLIKDKG VNSFMVYMAYKDLYQVSNTELYEIFTCLGELGAIAQVHAENGDIIAQEQTRMLEMGIT GPEGHVLSRPEELEAEAVFRAITIASQTNCPLYVTKVMSKSAADLISQARKKGNVVFG EPITASLGIDGTHYWSKNWAKAAAFVTSPPLSPDPTTPDYINSLLASGDLQLSGSAHC TFSTAQKAIGKDNFTAIPEGTNGVEERMSVIWDKAVATGKMDENQFVAVTSTNAAKIF NLYPRKGRISVGSDSDLVIWDPDAVKIVSAKNHQSAAEYNIFEGMELRGAPLVVICQG KIMLEDGNLHVTQGAGRFIPCSPFSDYVYKRIKARRKMADLHAVPRGMYDGPVFDLTT TPKGGTPAGSARGSPTRPNPPVRNLHQSGFSLSGTQVDEGVRSASKRIVAPPGGRSNI TSLS" BASE COUNT 1288 a 1215 c 1269 g 1275 t ORIGIN 1 ccgttgctgt cgccgttgct gtcgggggcg ctgtgcgctg aggaaggcgc gggcgagccg 61 gagcagaaga aggagggagg gagccagccg ctgcagccac caccgccacc atgtcctacc 121 aaggcaagaa gaacatcccg cggatcacga gtgaccgtct ccttatcaag ggaggcagaa 181 tcgtcaatga tgatcagtcc ttttatgctg atatttacat ggaagatggc ttaataaaac 241 aaattggaga caatctgatt gttcctggag gagtgaagac cattgaagcc aatgggaaga 301 tggtgatccc tggaggcatc gatgtccata ctcacttcca gatgccatat aagggaatga 361 ccacagtaga tgacttcttc caagggacaa aggcggcctt agcaggtggc accaccatga 421 tcattgacca tgtggtgcct gagcctgagt ccagcctgac tgaggcctat gagaaatgga 481 gagagtgggc tgatgggaag agttgctgtg actatgccct gcatgtggac atcacccact 541 ggaatgacag cgtcaagcag gaagtgcaga acctcatcaa ggacaaaggg gttaactcct 601 tcatggttta tatggcttat aaggatttgt atcaagtatc taacacagag ctctatgaga 661 tcttcacctg cctgggagag ctgggggcca ttgctcaagt tcatgctgag aatggggata 721 tcattgccca ggagcaaacc cgcatgttgg aaatggggat aactggccca gaaggccatg 781 tactgagcag gccagaagag ctggaagctg aggctgtgtt ccgtgccatc accattgcca 841 gccaaaccaa ttgccctctc tacgtcacaa aggtcatgag caagagtgca gctgacctca 901 tctcacaagc caggaaaaaa ggaaatgtag tctttggtga gcccatcact gccagcctcg 961 gcatagatgg aacccattat tggagcaaga actgggccaa ggcggctgca tttgtgacat 1021 ccccacccct gagccctgac ccaactactc cggactacat caactccttg ctggccagcg 1081 gggatctgca gctatctggg agtgcccact gcaccttcag cactgcccag aaagcaattg 1141 ggaaggacaa cttcacagcc attcctgagg gcaccaatgg tgtggaggag cggatgtctg 1201 tcatctggga caaggctgtg gccacaggga aaatggacga aaaccagttc gtggctgtga 1261 caagcacaaa cgctgccaag atcttcaacc tgtatccccg caagggaaga atatctgtgg 1321 gttctgacag cgacctcgtc atctgggatc cagatgctgt gaagatcgtc tctgccaaga 1381 accaccagtc tgcggcagag tacaacatct ttgaagggat ggagctgcgc ggggctcctc 1441 tggttgtcat ctgccagggc aagatcatgc tggaagatgg caacctgcac gtgacccagg 1501 gggctggccg cttcataccc tgcagcccgt tctccgacta tgtctacaag cgcattaaag 1561 cacggaggaa gatggcagac ctgcatgccg tcccaagggg catgtacgat gggcctgtgt 1621 ttgacctgac caccaccccc aaaggtggca cccccgcagg ctctgctcgg ggctctccta 1681 ctcggccgaa cccacctgtg aggaatcttc atcagtcggg atttagcctg tcaggcaccc 1741 aagtggatga gggggttcgc tcagccagca agcgcatcgt ggccccccca ggcggccgtt 1801 ctaatatcac atctctgagt taagcaagcc ttcctcaaag agaggggcag aagcaagaag 1861 agattgtttt gaagccaaaa tggtacaccg atatttaaga aggaaagcga atccaaacgg 1921 ttgtgatcta aagaatcaat aagcctcaag ccttatgttt ctccaatgtt acgctcgctt 1981 gcctagcttt acgaatattg ctttgttttc tgtttatgca tagccttgat ttgtttgact 2041 cccctccccc catttacatg catgcaatca gacaggccac taaggtaaaa gagtctgctc 2101 tatcatagtg ttgagagcgt gtgtagtgct gcatcttatg acaaggggac agacaagctg 2161 ggacgtcagg gaaatgaaca aaagggacgc aggttatttg gggtgagtgg gtggtgggag 2221 cctggagcaa ggtggagggt gcagaggggc tggggtaggg catgtaggag ggaggtgggt 2281 gggtcaggtg agtggaaggg gtgttgtata ttgtgttgat gacgtacgtt atttccatgg 2341 aagatagccg ctgtggcagc tgtcacatca ccacagctcc ctagggtctg ccgagaaggc 2401 aggcagtctt tgggttctgt tctttgtcac gtcccctaca agtaaatttt gtttctttga 2461 acgtttatta aaatgccaag acccaaccat ttcttccacc tgcttgattg tgccagtgtt 2521 tgctcaggcc tctttcttag tgttgctttc aaatccttct ctttcctggg ttgggaaggc 2581 caggcaggga cagagcaaat gacacttctc ttcctcttgc cctccctgcc tctttggtgc 2641 tcttaaaagc cagcagctga gaacatagca caggcccacg tggtgagggc acccacagct 2701 taaagacgct tccttctaaa cacggcgagg tcacctctca ctcttctgtc tttgcaaacc 2761 gagaagagtg gcatgcttct ggcatcccaa gtcaggattt tagctcagat gaggcagaat 2821 gaagggcctc tcttacaggc agtttgtgtt tgattctctc gatcctggca catccatgat 2881 aaataggagt ttttgaaagt tggttttatt aggtgttccc taatttttac cgtaataggt 2941 catctcagct tatatgaaag tcaagtgggg aactgggaaa gccaaagtca gtcttgagca 3001 gagggagcac attttgtgga cctggttcca cctttccatt ccaaaccacc tgtttcccct 3061 tccattagca gaaactctgg gggaactttg tgtctcagtc ctagaatctc cccaagtgag 3121 tggaagtgac atgatgcagt cttcctcatg gggcacctga aagaaattag tgtgggtgct 3181 tcgatctacc ttgtctgtca gagttgaata tctctttccc tatcatgctg cttctgaaaa 3241 ttcagttttg gagcaagtcc tgtgagcaag ataagaatct atagaaccaa gatgctcatt 3301 ttcagaagaa atatgttcaa cctgggatca gacttccatg ctctggggaa tccaagtggt 3361 agcacctgta accctgtgta ctaagtgctt tgaagagaag agcaggcctc agacaccttt 3421 taattgctta ggagaaacca ttgtctctga ctgcaggttt gaataagttg aagaccagag 3481 aaaagtacac actgggctac aaaggaattt ggagatagcc aaggaacagg atttccccta 3541 gcaagctacc ttctgttcaa atcatgaaaa aagactattt ccccttagaa tagggaagct 3601 tgctatttta aagctcttgt agtgcttttc ttttaaggga gatgtagtaa aagggaaaat 3661 gtagctctta gtttacactt caaagatgtg ggggtctttc agagaactaa gaataacagt 3721 tttatgtgca gagagagttt gccagatctg aagcatatac ctcattgact aggctgttac 3781 tttgggatag gttgcagtac cagccacagc cagcagatag aggaaaagac acacataaac 3841 tcgcttctga gcgtccactt ctgcactctc tgctctgctg ttactcagcc cctgagtctg 3901 actcatctct gcacaacctc tctgtgccat gaagataagt cttccatggc caaatcggtc 3961 atccgcactg cccttgggac ttccgaagtg aaccattcca ccagaacctt tgattctgca 4021 caagatttcc ttgctctggg aacaaccccc aaatgccctt gggaggaaca acatgagctc 4081 aggaagcctc tctttcttca cttaccatta ctaactctcc aagcatagaa atccctggga 4141 attgcgagaa taactcccac tattttaaaa tttatattca gatttgtttc gtttcataag 4201 acacatcaaa caggcctata caaaaggttt aggaaaagaa aacaatggtg agtcccggcc 4261 ctcttcgaat tcactggcac ctcatgcaag tgtaggaagg cacgctggat cgtctatctg 4321 attccaaagc tgtcctttgc catctcatcc cttggcctgc cccccaaccc tgaggatgcc 4381 cctgccatcc ccccaacctc ctcatattgc ctctgaaccc agatggcaat ccatcccggt 4441 tctctctgag ggccacgggc ttgggtagtg gaaagggtgt ttgggaaatt gttaaatcag 4501 ttacccgtag tagagctatt tcttgtactt ctaagttttc tagaagtgga aggattgtag 4561 tcatcctgaa aatgggttta cttcaaaatc cctcagcctt gttcttcacg actgtctata 4621 ctgagagtgt catgtttcca caaagggctg acacctgagc ctggattttc actcatccct 4681 gagaagccct ttccagtagg gtgggcaatt cccaacttcc ttgccacaag cttcccaggc 4741 tttctcccct ggaaaactcc agcttgagtc ccagatacac tcatgggctg ccctgggcag 4801 ccagcattca ttgtaagttc cctctttgaa aactggtgtg tgggtgttca gttctgtgtc 4861 tggtgggtat ggacagacag taatctcctg tgatctgtgc tagctgtgag gcagctctgg 4921 aacgtgaaga gctgtttggt ttgaaccgtg aacaaaactg tgttttgagt ttagctgaca 4981 ttaaagaaaa aagttcatca cgtgactgtt aatgtaaacc tggttattaa aataactatg 5041 aaattac // LOCUS D78130 2326 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens mRNA for squalene epoxidase, complete cds. ACCESSION D78130 NID g2443315 KEYWORDS squalene epoxidase. SOURCE Homo sapiens (strain:HeLa) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nagai,M., Sakakibara,J., Wakui,K., Fukushima,Y., Igarashi,S., Tsuji,S., Arakawa,M. and Ono,T. TITLE Localization of the squalene epoxidase gene (SQLE) to human chromosome region 8q24.1 JOURNAL Genomics 44 (1), 141-143 (1997) MEDLINE 97432831 REFERENCE 2 (bases 1 to 2326) AUTHORS Sakakibara,J., Kizawa,Y., Kanai,Y., Nagai,M. and Ono,T. TITLE Direct Submission JOURNAL Submitted (12-OCT-1995) to the DDBJ/EMBL/GenBank databases. Jun Sakakibara, Niigata University School of Medicine, Department of Biochemistry; 757, Asahimachi-dori 1, Niigata, Niigata 951, Japan (E-mail:juns@med.niigata-u.ac.jp, Tel:025-223-6161(ex.2262), Fax:025-222-4599) FEATURES Location/Qualifiers source 1..2326 /organism="Homo sapiens" /strain="HeLa" /db_xref="taxon:9606" CDS 264..1988 /EC_number="1.14.99.7" /codon_start=1 /product="squalene epoxidase" /db_xref="PID:g2443316" /translation="MWTFLGIATFTYFYKKFGDFITLANREVLLCVLVFLSLGLVLSY RCRHRNGGLLGRQRSGSQFALFSDILSGLPFIGFFWAKSPPESENKEQLGARRRRKGT NISETSLIGTAACTSTSSQNDPEVIIVGAGVLGSALVAVLSRDGRKVTVIERDLKEPD RIVGEFLQPGGYHVLKDLGLGDTVEGLDAQVVNGYMIHDQESKSEVQIPYPLSENNQV QSGRAFHHGRFIMSLRKAAMAEPNAKFIEGVVLQLLEEDDVVMGVQYKDKETGDIKEL HAPLTVVADGLFSKFRKSLVSNKVSVSSHFVGFLMKNAPQFKANHAELILANPSPVLI YRISSSETRVLVDIRGEMPRNLREYMVEKIYPQIPDHLKEPFLEATDNSHLRSMLASF LPPSSVKKRGVLLLGDAYNMRHPLTGGGMTVAFKDIKLWRKLLKGIPDLYDDAAIFEA NKSFYWARKTSHSFVVNILAQALYELFSATDDSLHQLRKACFLYFKLGGECVAGPVGL LSVLSPNPLALIGHFFAVAIYAVYFCFKSEPWITKPRALLSSSAVLYKACSVIFPLIY SEMKYMVH" BASE COUNT 638 a 476 c 518 g 694 t ORIGIN 1 ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggta cgccctatac 61 aacttggctt cacatacttt tacactaact ttatatgatt tttaaaaact ggtctgatcg 121 gacttctcgt cctgggacac tgtttactgg agtctggccg gctctccgtg ctcctcttgg 181 tacctcattt tggggagaac cttaaaccca ctcgagcaga taatctccgc cttgaccggt 241 gccaccaaag aagggttgga accatgtgga cttttctggg cattgccact ttcacctatt 301 tttataagaa gttcggggac ttcatcactt tggccaacag ggaggtcctg ttgtgcgtgc 361 tggtgttcct ctcgctgggc ctggtgctct cctaccgctg tcgccaccga aacgggggtc 421 tcctcgggcg ccagcggagc ggctcccagt tcgccctctt ctcggatatt ctctcaggcc 481 tgcctttcat tggcttcttc tgggccaaat ccccccctga atcagaaaat aaggagcagc 541 tcggggccag gaggcgcaga aaaggaacca atatttcaga aacaagctta ataggaacag 601 ctgcctgtac atcaacatct tctcagaatg acccagaagt tatcatcgtg ggagctggcg 661 tgcttggctc tgctttggta gctgtgcttt ccagagatgg aagaaaggtg acagtcattg 721 agagagactt aaaagagcct gacagaatag ttggagaatt cctgcagccg ggtggttatc 781 atgttctcaa agaccttggt cttggagata cagtggaagg tcttgatgcc caggttgtaa 841 atggttacat gattcatgat caggaaagca aatcagaggt tcagattcct taccctctgt 901 cagaaaacaa tcaagtgcag agtggaagag ctttccatca cggaagattc atcatgagtc 961 tccggaaagc agctatggca gagcccaatg caaagtttat tgaaggtgtt gtgttacagt 1021 tattagagga agatgatgtt gtgatgggag ttcagtacaa ggataaagag actggagata 1081 tcaaggaact ccatgctcca ctgactgttg ttgcagatgg gcttttctcc aagttcagga 1141 aaagcctggt ctccaataaa gtttctgtat catctcattt tgttggcttt cttatgaaga 1201 atgcaccaca gtttaaagca aatcatgctg aacttatttt agctaacccg agtccagttc 1261 tcatctaccg gatttcatcc agtgaaactc gagtacttgt tgacattaga ggagaaatgc 1321 caaggaattt aagagaatac atggttgaaa aaatttaccc acaaatacct gatcacctga 1381 aagaaccatt cttagaagcc actgacaatt ctcatctgag gtccatgcta gcaagcttcc 1441 ttcctccttc atcagtgaag aaacgaggtg ttcttctttt gggagacgca tataatatga 1501 ggcatccact tactggtgga ggaatgactg ttgcttttaa agatataaaa ctatggagaa 1561 aactgctaaa gggtatccct gacctttatg atgatgcagc tattttcgag gccaacaaat 1621 cattttactg ggcaagaaaa acatctcatt cctttgtcgt gaatatcctt gctcaggctc 1681 tttatgaatt attttctgcc acagatgatt ccctgcatca actaagaaaa gcctgttttc 1741 tttatttcaa acttggtggc gaatgtgttg cgggtcctgt tgggctgctt tctgtattgt 1801 ctcctaaccc tctagcttta attggacact tctttgctgt tgcaatctat gccgtgtatt 1861 tttgctttaa gtcagaacct tggattacaa aacctcgagc ccttctcagt agtagtgctg 1921 tattgtacaa agcgtgttct gtaatatttc ctctaattta ctcagaaatg aagtatatgg 1981 ttcattaagc ttaaagggga accatttgtg aatgaatatt tggaacttac caagtcctaa 2041 gagacttttg gaagaggata tatatagcat agtaccatac cacttataaa gtggaaactc 2101 ttggaccaag atttggatta atttgttttt gaagtttttt gtatataaat atgtaaatac 2161 atgctttaat ttgcaattta aaatgaaggg gttaaataag ttagacattt gaaagaaatg 2221 attgttacca taaattagtg ctaatgctga ggagaactac agtttttctt ttgaatttag 2281 tatttgagat gagttgttgg gacatgcaaa taaaatgaag aatgac // LOCUS D78334 1067 bp mRNA PRI 05-NOV-1996 DEFINITION Human mRNA for ankyrin motif, complete cds. ACCESSION D78334 NID g1655417 KEYWORDS ankyrin motif. SOURCE Homo sapiens adult testis cDNA to mRNA, clone:TSA806. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ozaki,K., Kuroki,T., Hayashi,S. and Nakamura,Y. TITLE Isolation of three testis-specific genes (TSA303, TSA806, TSA903) by a differential mRNA display method JOURNAL Genomics 36 (2), 316-319 (1996) MEDLINE 96411689 REFERENCE 2 (bases 1 to 1067) AUTHORS Ozaki,K., Kuroki,T., Hayashi,S. and Nakamura,Y. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1067) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (16-NOV-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science,The University of Tokyo, Laboratory of Molecular Medicine; 4-6-1 Sirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..1067 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TSA806" /dev_stage="adult" /tissue_type="testis" CDS 239..694 /codon_start=1 /product="ankyrin motif" /db_xref="PID:d1012011" /db_xref="PID:g1655418" /translation="MRIVPTILLNFGADPDLRDIRYNTVLHYAVCGQSLSLVEKLLEY EADLEAKNKDGYTPLLVAVINNNPKMVKFLLEKGADVNASDNYQRTALILAVSGEPPC LVKLLLQQGVELCYEGIVDSQLRNMFISMVLLHRYPQFTASHGKKKHAK" polyA_signal 1047..1052 BASE COUNT 371 a 160 c 208 g 328 t ORIGIN 1 attttaaaga aacttcacag agctgcttca gtcggggatt tgaagaagct gaaggaatac 61 cttcagatca agaaatatga tgtaaatatg caggacaaaa aatacagaac acctttgcac 121 ctagcctgtg ctaatggaca tacagatgtt gtacttttcc taattgagca acaatgcaaa 181 ataaatgtcc gggatagtga aaacaaatcc ccattgatta aggcagtaca gtgtcaaaat 241 gaggattgtg cctactattc ttctaaactt tggtgcagac ccagatctga gggatattcg 301 ttataatact gttcttcact atgctgtttg tggtcaaagt ttgtcattag ttgaaaaact 361 gcttgaatac gaagctgatc ttgaagcgaa aaataaggat gggtatactc cactattagt 421 tgccgttatt aacaataatc caaaaatggt aaaatttctt ctggagaaag gggctgatgt 481 gaatgcttca gataattatc aaagaacagc ccttattctt gctgtcagtg gtgaaccacc 541 atgtttagta aagcttcttc ttcagcaagg tgtggaatta tgttacgaag gtattgtgga 601 ttcacagctg aggaatatgt ttatttccat ggttttactg catagatacc cacaattcac 661 tgcgagccat ggaaagaaga aacatgctaa atagacacct tattcttggc actacatgtg 721 actaaaggaa gatatggaac ccatttctac aatttctttg ccgcttcctt gaattggaaa 781 aatgtacttt gaaagaaccg gttaagtgaa ctatgataat atttttgctg actacccagt 841 tgaagaaaaa gtttcgttaa ttggatggga tttttttttt tcacgttaga agaatgaatg 901 aagaaatttt aaaagataaa cattatattg tgaaccatca gctgaaaaga taaatttgtg 961 ttcaatatat aggagaaaaa atttgtgtca aaatgttgaa tggaataata atgagaaact 1021 gtgttaggca tgtattaaaa catttaaata aaataaaaat acatttc // LOCUS D78335 831 bp mRNA PRI 05-NOV-1996 DEFINITION Human mRNA for 5'-terminal region of UMK, complete cds. ACCESSION D78335 NID g1655419 KEYWORDS 5'-terminal region of UMK. SOURCE Homo sapiens adult testis cDNA to mRNA, clone:TSA903. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ozaki,K., Kuroki,T., Hayashi,S. and Nakamura,Y. TITLE Isolation of three testis-specific genes (TSA303, TSA806, TSA903) by a differential mRNA display method JOURNAL Genomics 36 (2), 316-319 (1996) MEDLINE 96411689 REFERENCE 2 (bases 1 to 831) AUTHORS Ozaki,K., Kuroki,T., Hayashi,S. and Nakamura,Y. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 831) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (16-NOV-1995) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science,The University of Tokyo, Laboratory of Molecular Medicine; 4-6-1 Sirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:03-5449-5372, Fax:03-5449-5433) FEATURES Location/Qualifiers source 1..831 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TSA903" /dev_stage="adult" /tissue_type="testis" CDS 219..554 /codon_start=1 /product="5'-terminal region of UMK" /db_xref="PID:d1012012" /db_xref="PID:g1655420" /translation="MKLFVDTDADTRLSRRVLRDISERGRDLEQILSQYITFVKPAFE EFCLPTKQYADVIIPRGADNLVAINLIEQHIQDILNGGPSKRQTNGCLNGYTPSRKRQ ASESSSRPH" polyA_signal 813..818 BASE COUNT 244 a 197 c 200 g 190 t ORIGIN 1 gttgacaaga gacattccag cccaccactt cccaagtaaa gaattaaaat gcagcatgat 61 ggctaaggca agggcctgca gaagaatgta aaggagggag gaagagcagg ggattcagag 121 caggaaggag gagacagtac tgtctatccc gcagacgtgg tgctctttga agggatcctg 181 gccttctact cccaggaaag gtacgagacc tgttccagat gaagcttttt gtggatacag 241 atgcggacac ccggctctca cgcagagtat taagggacat cagcgagaga ggcagggatc 301 ttgagcagat tttatctcag tacattacgt tcgtcaagcc tgcctttgag gaattctgct 361 tgccaacaaa gcagtatgct gatgtgatca tccctagagg tgcagataat ctggtggcca 421 tcaacctcat cgagcagcac atccaggaca tcctgaatgg agggccctcc aaacggcaga 481 ccaatggctg tctcaacggc tacacccctt cacgcaagag gcaggcatcg gagtccagca 541 gcaggccgca ttgacccgtc tccatcggac cccagcccct atctccaaga gacagaggag 601 gcgtcaggag gcactgctca tctgtacata ctgtttccta tgacattact gtatttaaga 661 aaacaccatg gagatgaaat gcctttgatt ttttttttct ttttgtactt tggaacgaca 721 aaatgaaaca gaacttgacc ctgagcttaa ataacaaaac tgtgccaact actactggtg 781 atgcctaatt atgaatccaa cgtgtaacca gtaataaata catatatata t // LOCUS D78514 617 bp DNA PRI 18-DEC-1996 DEFINITION Human mRNA for ubiquitin-conjugating enzyme, complete cds. ACCESSION D78514 NID g1741956 KEYWORDS UBE2G; ubiquitin-conjugating enzyme. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Kawai,A., Fujiwara,T., Maekawa,H., Hirai,Y., Nakamura,Y. and Takahashi,E. TITLE Molecular cloning of UBE2G, encoding a human skeletal muscle-specific ubiquitin-conjugating enzyme homologous to UBC7 of C. elegans JOURNAL Cytogenet. Cell Genet. 74 (1-2), 146-148 (1996) MEDLINE 97049093 REFERENCE 2 (sites) AUTHORS Watanabe,T., Okuno,S., Fujiwara,T., Takahashi,E., Nakamura,Y., Hirai,Y. and Maekawa,H. TITLE Molecular cloning of a novel ubiquitin-conjugating enzyme, UBE2G, homologus to UBC7 of C.elegans JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 617) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (28-NOV-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..617 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" gene 19..531 /gene="UBE2G" CDS 19..531 /gene="UBE2G" /codon_start=1 /product="ubiquitin-conjugating enzyme" /db_xref="PID:d1012075" /db_xref="PID:g1741957" /translation="MTELQSALLLRRQLAELNKNPVEGFSAGLIDDNDLYRWEVLIIG PPDTLYEGGVFKAHLTFPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPGEDKY GYEKPEERWLPIHTVETIMISVISMLADPNGDSPANVDAAKEWREDRNGEFKRKVARC VRKSQETAFE" 3'UTR 532..>617 BASE COUNT 181 a 129 c 142 g 165 t ORIGIN 1 gggccctcgg cagggaggat gacggagctg cagtcggcac tgctactgcg aagacagctg 61 gcagaactca acaaaaatcc agtggaaggc ttttctgcag gtttaataga tgacaatgat 121 ctctaccgat gggaagtcct tattattggc cctccagata cactttatga aggtggtgtt 181 tttaaggctc atcttacttt cccaaaagat tatcccctcc gacctcctaa aatgaaattc 241 attacagaaa tctggcaccc aaatgttgat aaaaatggtg atgtgtgcat ttctattctt 301 catgagcctg gggaagataa gtatggttat gaaaagccag aggaacgctg gctccctatc 361 cacactgtgg aaaccatcat gattagtgtc atttctatgc tggcagaccc taatggagac 421 tcacctgcta atgttgatgc tgcgaaagaa tggagggaag atagaaatgg agaatttaaa 481 agaaaagttg cccgctgtgt aagaaaaagc caagagactg cttttgagtg acatttattt 541 agcagctagt aacttcactt atttcagggt ctccaattga gaaacatggc actgtttttc 601 ctgcactcta cccaccg // LOCUS D78611 2476 bp mRNA PRI 05-NOV-1996 DEFINITION Human MEST mRNA, complete cds. ACCESSION D78611 NID g1655421 KEYWORDS MEST. SOURCE Homo sapiens cDNA to mRNA, clone_lib:human fetus cDNA clone:pB312. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sado,T., Nakajima,N., Tada,M. and Takagi,N. TITLE A novel mesoderm-specific cDNA isolated from a mouse embrional carcinoma cell line JOURNAL Dev. Growth Differ. 35, 551-560 (1993) REFERENCE 2 (sites) AUTHORS Nishita,Y., Yoshida,I., Sado,T. and Takagi,N. TITLE Genomic imprinting and chromosomal localization of the human MEST gene JOURNAL Genomics 36 (3), 539-542 (1996) MEDLINE 97038699 REFERENCE 3 (bases 1 to 2476) AUTHORS Nishita,Y. JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 2476) AUTHORS Nishita,Y. TITLE Direct Submission JOURNAL Submitted (11-DEC-1995) to the DDBJ/EMBL/GenBank databases. Yoshinori Nishita, Hokkaido University, Graduate School of Enviro. Earth Sci.; kita 10 nishi 5, kita-ku, Sapporo, hokkaido 060, Japan (E-mail:nishita@noah.eesbio.hokudai.ac.jp, Tel:011-706-3588, Fax:011-737-0536) FEATURES Location/Qualifiers source 1..2476 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /clone="pB312" /clone_lib="human fetus cDNA" /map="7q32" gene 224..1231 /gene="MEST" CDS 224..1231 /gene="MEST" /codon_start=1 /db_xref="PID:d1012098" /db_xref="PID:g1655422" /translation="MVRRDRLRRMREWWVQVGLLAVPLLAAYLHIPPPQLSPALHSWK SSGKFFTYKGLRIFYQDSVGVVGSPEIVVLLHGFPTSSYDWYKIWKGLTLRFHRVIAL DFLGFGFSDKPRPHHYSIFEQASIVEALLRHLGLQNRRINLLSHDYGDIVAQELLYRY KQNRSGRHTIKSLCLSNGGIFPETHRPLLLQKLLKDGGVLSPILTRLMNFFVFSRGLT PVFGPYTRPSESELWDMWAGIRNNDGNLVIDSLLQYINQRKKFRRRWVGALASVTIPI HFIYGPLDPVNPYPEFLELYRKTLPRSTVSILDDHISHYPQLEDPMGFLNAYMGFINS F" BASE COUNT 632 a 583 c 556 g 705 t ORIGIN 1 cggccagcac accccggcac ctcctctgcg gcagctgcgc ctcgcaagcg cagtgccgca 61 gcgcacgccg gagtggctgt agctgcctcg gcgcggctgc cgccctgcgc gggctgtggg 121 ctgcgggctg cgcccccgct gctggccagc tctgcacggc tgcgggctct gcggcgcccg 181 gtgctctgca acgctgcggc gggcggcatg ggataacgcg gccatggtgc gccgagatcg 241 cctccgcagg atgagggagt ggtgggtcca ggtggggctg ctggccgtgc ccctgcttgc 301 tgcgtacctg cacatcccac cccctcagct ctcccctgcc cttcactcat ggaagtcttc 361 aggcaagttt ttcacttaca agggactgcg tatcttctac caagactctg tgggtgtggt 421 tggaagtcca gagatagttg tgcttttaca cggttttcca acatccagct acgactggta 481 caagatttgg aagggtctga ccttgaggtt tcatcgggtg attgcccttg atttcttagg 541 ctttggcttc agtgacaaac cgagaccaca tcactattcc atatttgagc aggccagcat 601 cgtggaagcg cttttgcggc atctggggct ccagaaccgc agaatcaacc ttctttctca 661 tgactatgga gatattgttg ctcaggagct tctctacagg tacaagcaga atcgatctgg 721 tcggcatacc ataaagagtc tctgtctgtc aaatggaggt atctttcctg agactcaccg 781 tccactcctt ctccaaaagc tactcaaaga tggaggtgtg ctgtcaccca tcctcacacg 841 actgatgaac ttctttgtat tctctcgagg tctcacccca gtctttgggc cgtatactcg 901 gccctctgag agtgagctgt gggacatgtg ggcagggatc cgcaacaatg acgggaactt 961 agtcattgac agtctcttac agtacatcaa tcagaggaag aagttcagaa ggcgctgggt 1021 gggagctctt gcctctgtaa ctatccccat tcattttatc tatgggccat tggatcctgt 1081 aaatccctat ccagagtttt tggagctgta caggaaaacg ctgccgcggt ccacagtgtc 1141 gattctggat gaccacatta gccactatcc acagctagag gatcccatgg gcttcttgaa 1201 tgcatatatg ggcttcatca actccttctg agctggaaag agtagcttcc ctgtattacc 1261 tcccctactc ccttatgtgt tgtgtattcc acttaggaag aaatgcccaa aagaggtcct 1321 ggccatcaaa cataattctc tcacaaagtc cactttactc aaattggtga acagtgtata 1381 ggaagaagcc agcaggagct ctgactaagg ttgacataat agtccacctc ccattacttt 1441 gatatctgat caaatgtata gacttggctt tgttttttgt gctattagga aattctgatg 1501 agcattacta ttcactgatg cagaaagacg ttcttttgca taaaagactt ttttttaaca 1561 ctttggactt ctctgaaata tttagaagtg ctaatttctg gcccaccccc aacaggaatt 1621 ctatagtaag gaggaggaga aggggggctc cttccctctc ctcgaatgac gttatgggca 1681 catgcctttt aaaagttctt taagcaacac agagctgagt cctctttgtc atacctttgg 1741 atttagtgtt tcatcagctg tttttagtta taaacatttt gttaaaatag atattggttt 1801 aaatgataca gtattttagg tatgatttaa gactatgatt tacctataca ttatatatat 1861 tttataaaga tactaaacca gcataccctt actctgccag agtagtgaag ctaattaaac 1921 acgtttggtt tctgaataaa ttgaactaaa tccaaactat ttcctaaaat cacaggacat 1981 taaggaccaa tagcatctgt gccagagatg tactgttatt agctgggaag accaattcta 2041 acagcaaata acagtctgag actcctcata cctcagtggt tagaagcatg tctctcttga 2101 gctacagtag aggggaaggg attgttgtgt agtcaagtca ccatgctgaa tgtacactga 2161 ttcctttatg atgactgctt aactccccac tgcctgtccc agagaggctt tccaatgtag 2221 ctcagtaatt cctgttactt tacagacagg aaagttccag aaactttaag aacaaactct 2281 gaaagaccta tgagcaaatg gtgctgaata cttttttttt aaagccacat ttcattgtct 2341 tagtcaaagc aggattatta agtgattatt taaaattcgt ttttttaaat tagcaacttc 2401 aagtataaca actttgaaac tggaataagt gtttattttc tattaataaa aatgaattgt 2461 gacaaaaaaa aaaccg // LOCUS D79205 371 bp mRNA PRI 26-DEC-1996 DEFINITION Human mRNA for ribosomal protein L39, complete cds. ACCESSION D79205 NID g1754620 KEYWORDS ribosomal protein L39. SOURCE Homo sapiens colon tumor epithelial cell cell_line:COLO 205 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Otsuka,S., Tanaka,M., Saito,S., Yoshimoto,K. and Itakura,M. TITLE Molecular cloning of a cDNA encoding human ribosomal protein L39 JOURNAL Biochim. Biophys. Acta 1308 (2), 119-121 (1996) MEDLINE 96350464 REFERENCE 2 (bases 1 to 371) AUTHORS Otsuka,S., Tanaka,M., Iwami,M., Saito,S., Yoshimoto,K. and Itakura,M. TITLE Molecular cloning and sequence analysis of human ribosomal protein L39 JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 371) AUTHORS Otsuka,S. TITLE Direct Submission JOURNAL Submitted (09-DEC-1995) to the DDBJ/EMBL/GenBank databases. Satoshi Otsuka, School of Medicine, The University of Tokushima, Department of Clinical and Molecular Nutrition; Kuramoto 3-18-15, Tokushima, Tokushima 770, Japan (E-mail:itakura@nutr.med.tokushima-u.ac.jp, Tel:0886-31-3111(ex.2288), Fax:0886-31-9476) FEATURES Location/Qualifiers source 1..371 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="COLO 205" /cell_type="epithelial cell" /tissue_type="colon tumor" CDS 38..193 /codon_start=1 /product="ribosomal protein L39" /db_xref="PID:d1012131" /db_xref="PID:g1754621" /translation="MSSHKTFRIKRFLAKKQKQNRPIPQWIRMKTGNKIRYNSKRRHW RRTKLGL" polyA_site 346..351 BASE COUNT 109 a 79 c 79 g 104 t ORIGIN 1 cagccatcgt ggtgtgttct tgactccgct gctcgccatg tcttctcaca agactttcag 61 gattaagcga ttcctggcca agaaacaaaa gcaaaatcgt cccattcccc agtggattcg 121 gatgaaaact ggaaataaaa tcaggtacaa ctccaaaagg agacattgga gaagaaccaa 181 gctgggtcta taaggaattg cacatgagat ggcacacata tttatgctgt ctgaaggtca 241 cgatcatgtt accatatcaa gctgaaaatg tcaccactat ctggagattt cgacgtgttt 301 tcctctctga atctgttatg aacacgttgg ttggctggat tcagtaataa atatgtaagg 361 cctttctttt t // LOCUS D79887 6662 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0165 gene, complete cds. ACCESSION D79987 NID g1136391 KEYWORDS KIAA0165. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6662) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6662) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..6662 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="8" /sex="male" 5'UTR 1..1113 gene 1114..6501 /gene="KIAA0165" CDS 1114..6501 /gene="KIAA0165" /note="similarto Schizosaccharomyces pombe cut1+ protein which regulates spindle pole body duplication." /citation=[3] /codon_start=1 /db_xref="PID:d1012148" /db_xref="PID:g1136392" /translation="MEAPSPPLRALYESCQFFLSGLERGTKRRYRLDAILSLFAFLGG YCSLLQQLRDDGVYGGSSKQQQSFLQMYFQGLHLYTVVVYDFAQGCQIVDLADLTQLV DSCKSTVVWMLEALEGLSGQELTDHMGMTASYTSNLAYSFYSHKLYAEACAISEPLCQ HLGLVKPGTYPEVPPEKLHRCFRLQVESLKKLGKQAQGCKMVILWLAALQPCSPEHMA EPVTFWVRVKMDAARAGDKELQLKTLRDSLSGWDPETLALLLREELQAYKAVRADTGQ ERFNIICDLLELSPEETPAGAWARATHLVELAQVLCYHDFTQQTNCSALDAIREALQL LDSVRPEAQARDQLLDDKAQALLWLYICTLEAKIQEGIERDRRAQAPGNLEEFEVNDL NYEDKLQEDRFLYSNIAFNLAADAAQSKCLDQALALWKELLTKGQAPAVRCLQQTAAS LQILAALYQLVAKPMQALEVLLLLRIVSERLKDHSKAAGSSCHITQLLLTLGCPSYAQ LHLEEAASSLKHLDQTTDTYLLLSLTCDLLRSQLYWTHQKVTKGVSLLLSVLRDPALQ KSSKAWYLLRVQVLQLVAAYLSLPSNNLSHSLWEQLCAQGWQTPEIALIDSHKLLRSI ILLLMGSDILSTQKAAVETSFLDYGENLVQKWQVLSEVLSCSEKLVCHLGRLGSVSEA KAFCLEALKLTTKLQIPRQCALFLVLKGELELARNDIDLCQSDLQQVLFLLESCTEFG GVTQHLDSVKKVHLQKGKQQAQVPCPPQLPEEELFLRGPALELVATVAKEPGPIAPST NSSPVLKTKPQPIPNFLSHSPTCDCSLCASPVLTAVCLRWVLVTAGVRLAMGHQAQGL DLLQVVLKGCPEAAERLTQALQASLNHKTPPSLVPSLLDEILAQAYTLLALEGLNQPS NESLQKVLQSGLKFVAARIPHLEPWRASLLLIWALTKLGGLSCCTTQLFASSWGWQPP LIKSVPGSEPSKTQGQKRSGRGRQKLASAPLSLNNTSQKGLEGRGLPCTPKPPDRIRQ AGPHVPFTVFEEVCPTESKPEVPQAPRVQQRVQTRLKVNFSDDSDLEDPVSAEAWLAE EPKRRGTASRGRGRARKGLSLKTDAVVAPGSAPGNPGLNGRSRRAKKVASRHCEERRP QRASDQARPGPEIMRTIPEEELTDNWRKMSFEILRGSDGEDSASGGKTPAPGPEAASG EWELLRLDSSKKKLPSPCPDKESDKDLGPRLQLPSAPVATGLSTLDSICDSLSVAFRG ISHCPPSGLYAHLCRFLALCLGHRDPYATAFLVTESVSITCRHQLLTHLHRQLSKAQK HRGSLEIADQLQGLSLQEMPGDVPLARIQRLFSFRALESGHFPQPEKESFQERLALIP SGVTVCVLALATLQPGTVGNTLLLTRLEKDSPPVSVQIPTGQNKLHLRSVLNEFDAIQ KAQKENSSCTDKREWWTGRLALDHRMEVLIASLEKSVLGCWKGLLLPSSEEPGPAQEA SRLQELLQDCGWKYPDRTLLKIMLSGAGALTPQDIQALAYGLCPTQPERAQELLNEAV GRLQGLTVPSNSHLVLVLDKDLQKLPWESMPSLQALPVTRLPSFRFLLSYSIIKEYGA SPVLSQGVDPRSTFYVLNPHNNLSSTEEQFRANFSSEAGWRGVVGEVPRPEQVQEALT KHDLYIYAGHGAGARFLDGQAVLRLSCRAVALLFGCSSAALAVHGNLEGAGIVLKYIM AGCPLFLGNLWDVTDRDIDRYTEALLQGWLGAGPGAPLLYYVNQARQAPRLKYLIGAA PIAYGLPVSLR" 3'UTR 6502..6662 BASE COUNT 1412 a 1931 c 1863 g 1456 t ORIGIN 1 ggcggttaag tcctgtacct aggaaagagg gcgagctctg gggcgctctc cggtgtcatg 61 aggagcttca aaagagtcaa ctttgggact ctgctaagca gccagaagga ggctgaagag 121 ttgctgcccg acttgaaggt gggggtgctg cctggctcgg gatacacctg gctttccaaa 181 ctgagctgtt ttgtgtttgc cttttgaaga gatggataag agttcctgtc caaccctcca 241 gctggttttc ccagcagccg atctgatgct gagaggagac aagcttgtga tgccatcctg 301 agggcttgca accagcagct gactgctaag ctagcttgcc ctaggcatct ggggagcctg 361 ctggagctgg cagagctggc ctgtgatggc tacttagtgt ctaccccaca gcgtcctccc 421 ctctacctgg aacgaattct ctttgtctta ctgcggaatg ctgctgcaca aggaagccca 481 gaggccacac tccgccttgc tcagcccctc catgcctgct tggtgcagtg ctctcgcgag 541 gctgctcccc aggactatga ggccgtggct cggggcagct tttctctgct ttggaagggg 601 gcagaagccc tgttggaacg gcgagctgca tttgcagctc ggctgaaggc cttgagcttc 661 ctagtactct tggaggatga aagtacccct tgtgaggttc ctcactttgc ttctccaaca 721 gcctgtcgag cggtagctgc ccatcagcta tttgatgcca gtggccatgg tctaaatgaa 781 gcagatgctg atttcctaga tgacctgctc tccaggcacg tgatcagagc cttggtgggt 841 gagagaggga gctcttctgg gcttctttct ccccagaggg ccctctgcct cttggagctc 901 accttggaac actgccgtcg cttttgctgg agccgccacc atgacaaagc catcagcgca 961 gtggagaagg ctcacagtta cctaaggaac accaatctag cccctagcct tcagctatgt 1021 cagctggggg ttaagctgct gcaggtcggg gaggaaggac ctcaggcagt ggccaagctt 1081 ctgatcaagg catcagctgt cctgagcaag agtatggagg caccatcacc cccacttcgg 1141 gcattgtatg agagctgcca gttcttcctt tcaggcctgg aacgaggcac caagaggcgc 1201 tatagacttg atgccattct gagcctcttt gcttttcttg gagggtactg ctctcttctg 1261 cagcagctgc gggatgatgg tgtgtatggg ggctcctcca agcaacagca gtcttttctt 1321 cagatgtact ttcagggact tcacctctac actgtggtgg tttatgactt tgcccaaggc 1381 tgtcagatag ttgatttggc tgacctgacc caactagtgg acagttgtaa atctaccgtt 1441 gtctggatgc tggaggcctt agagggcctg tcgggccaag agctgacgga ccacatgggg 1501 atgaccgctt cttacaccag taatttggcc tacagcttct atagtcacaa gctctatgcc 1561 gaggcctgtg ccatctctga gccgctctgt cagcacctgg gtttggtgaa gccaggcact 1621 tatcccgagg tgcctcctga gaagttgcac aggtgcttcc ggctacaagt agagagtttg 1681 aagaaactgg gtaaacaggc ccagggctgc aagatggtga ttttgtggct ggcagccctg 1741 caaccctgta gccctgaaca catggctgag ccagtcactt tctgggttcg ggtcaagatg 1801 gatgcggcca gggctggaga caaggagcta cagctaaaga ctctgcgaga cagcctcagt 1861 ggctgggacc cggagaccct ggccctcctg ctgagggagg agctgcaggc ctacaaggcg 1921 gtgcgggccg acactggaca ggaacgcttc aacatcatct gtgacctcct ggagctgagc 1981 cccgaggaga caccagccgg ggcctgggca cgagccaccc acctggtaga actggctcag 2041 gtgctctgct accacgactt tacgcagcag accaactgct ctgctctgga tgctatccgg 2101 gaagccctgc agcttctgga ctctgtgagg cctgaggccc aggccagaga tcagcttctg 2161 gacgataaag cacaggcctt gctgtggctt tacatctgta ctctggaagc caaaatacag 2221 gaaggtatcg agcgggatcg gagagcccag gcccctggta acttggagga atttgaagtc 2281 aatgacctga actatgaaga taaactccag gaagatcgtt tcctatacag taacattgcc 2341 ttcaacctgg ctgcagatgc tgctcagtcc aaatgcctgg accaagccct ggccctgtgg 2401 aaggagctgc ttacaaaggg gcaggcccca gctgtacggt gtctccagca gacagcagcc 2461 tcactgcaga tcctagcagc cctctaccag ctggtggcaa agcccatgca ggctctggag 2521 gtcctcctgc tgctacggat tgtctctgag agactgaagg accactcgaa ggcagctggc 2581 tcctcctgcc acatcaccca gctcctcctg accctcggct gtcccagcta tgcccagtta 2641 cacctggaag aggcagcatc gagcctgaag catctcgatc agactactga cacatacctg 2701 ctcctttccc tgacctgtga tctgcttcga agtcaactct actggactca ccagaaggtg 2761 accaagggtg tctctctgct gctgtctgtg cttcgggatc ctgccctcca gaagtcctcc 2821 aaggcttggt acttgctgcg tgtccaggtc ctgcagctgg tggcagctta ccttagcctc 2881 ccgtcaaaca acctctcaca ctccctgtgg gagcagctct gtgcccaagg ctggcagaca 2941 cctgagatag ctctcataga ctcccataag ctcctccgaa gcatcatcct cctgctgatg 3001 ggcagtgaca ttctctcaac tcagaaagca gctgtggaga catcgttttt ggactatggt 3061 gaaaatctgg tacaaaaatg gcaggttctt tcagaggtgc tgagctgctc agagaagctg 3121 gtctgccacc tgggccgcct gggtagtgtg agtgaagcca aggccttttg cttggaggcc 3181 ctaaaactta caacaaagct gcagatacca cgccagtgtg ccctgttcct ggtgctgaag 3241 ggcgagctgg agctggcccg caatgacatt gatctctgtc agtcggacct gcagcaggtt 3301 ctgttcttgc ttgagtcttg cacagagttt ggtggggtga ctcagcacct ggactctgtg 3361 aagaaggtcc acctgcagaa ggggaagcag caggcccagg tcccctgtcc tccacagctc 3421 ccagaggagg agctcttcct aagaggccct gctctagagc tggtggccac tgtggccaag 3481 gagcctggcc ccatagcacc ttctacaaac tcctccccag tcttgaaaac caagccccag 3541 cccataccca acttcctgtc ccattcaccc acctgtgact gctcgctctg cgccagccct 3601 gtcctcacag cagtctgtct gcgctgggta ttggtcacgg caggggtgag gctggccatg 3661 ggccaccaag cccagggtct ggatctgctg caggtcgtgc tgaagggctg tcctgaagcc 3721 gctgagcgcc tcacccaagc tctccaagct tccctgaatc ataaaacacc cccctccttg 3781 gttccaagcc tcttggatga gatcttggct caagcataca cactgttggc actggagggc 3841 ctgaaccagc catcaaacga gagcctgcag aaggttctac agtcagggct gaagtttgta 3901 gcagcacgga taccccacct agagccctgg cgagccagcc tgctcttgat ttgggccctc 3961 acaaaactag gtggcctcag ctgctgtact acccaacttt ttgcaagctc ctggggctgg 4021 cagccaccat taataaaaag tgtccctggc tcagagccct ctaagactca gggccaaaaa 4081 cgttctggac gagggcgcca aaagttagcc tctgctcccc tgagcctcaa taatacctct 4141 cagaaaggtc tggaaggtag aggactgccc tgcacaccta aacccccaga ccggatcagg 4201 caagctggcc ctcatgtccc cttcacggtg tttgaggaag tctgccctac agagagcaag 4261 cctgaagtac cccaggcccc cagggtacaa cagagagtcc agacgcgcct caaggtgaac 4321 ttcagtgatg acagtgactt ggaagaccct gtctcagctg aggcctggct ggcagaggag 4381 cctaagagac ggggcactgc ttcccggggc cgggggcgag caaggaaggg cctgagccta 4441 aagacggatg ccgtggttgc cccaggtagt gcccctggga accctggcct gaatggcagg 4501 agccggaggg ccaagaaggt ggcatcaaga cattgtgagg agcggcgtcc ccagagggcc 4561 agtgaccagg ccaggcctgg ccctgagatc atgaggacca tccctgagga agaactgact 4621 gacaactgga gaaaaatgag ctttgagatc ctcaggggct ctgacgggga agactcagcc 4681 tcaggtggga agactccagc tccgggccct gaggcagctt ctggagaatg ggagctgctg 4741 aggctggatt ccagcaagaa gaagctgccc agcccatgcc cagacaagga gagtgacaag 4801 gaccttggtc ctcggctcca gctcccctca gcccccgtag ccactggtct ttctaccctg 4861 gactccatct gtgactccct gagtgttgct ttccggggca ttagtcactg tcctcctagt 4921 gggctctatg cccacctctg ccgcttcctg gccttgtgcc tgggccaccg ggatccttat 4981 gccactgctt tccttgtcac cgagtctgtc tccatcacct gtcgccacca gctgctcacc 5041 cacctccaca gacagctcag caaggcccag aagcaccgag gatcacttga aatagcagac 5101 cagctgcagg ggctgagcct tcaggagatg cctggagatg tccccctggc ccgcatccag 5161 cgcctctttt ccttcagggc tttggaatct ggccacttcc cccagcctga aaaggagagt 5221 ttccaggagc gcctggctct gatccccagt ggggtgactg tgtgtgtgtt ggccctggcc 5281 accctccagc ccggaaccgt gggcaacacc ctcctgctga cccggctgga aaaggacagt 5341 cccccagtca gtgtgcagat tcccactggc cagaacaagc ttcatctgcg ttcagtcctg 5401 aatgagtttg atgccatcca gaaggcacag aaagagaaca gcagctgtac tgacaagcga 5461 gaatggtgga cagggcggct ggcactggac cacaggatgg aggttctcat cgcttcccta 5521 gagaagtctg tgctgggctg ctggaagggg ctgctgctgc cgtccagtga ggagcccggc 5581 cctgcccagg aggcctcccg cctacaggag ctgctacagg actgtggctg gaaatatcct 5641 gaccgcactc tgctgaaaat catgctcagt ggtgccggtg ccctcacccc tcaggacatt 5701 caggccctgg cctacgggct gtgcccaacc cagccagagc gagcccagga gctcctgaat 5761 gaggcagtag gacgtctaca gggcctgaca gtaccaagca atagccacct tgtcttggtc 5821 ctagacaagg acttgcagaa gctgccgtgg gaaagcatgc ccagcctcca agcactgcct 5881 gtcacccggc tgccctcctt ccgcttccta ctcagctact ccatcatcaa agagtatggg 5941 gcctcgccag tgctgagtca aggggtggat ccacgaagta ccttctatgt cctgaaccct 6001 cacaataacc tgtcaagcac agaggagcaa tttcgagcca atttcagcag tgaagctggc 6061 tggagaggag tggttgggga ggtgccaaga cctgaacagg tgcaggaagc cctgacaaag 6121 catgatttgt atatctatgc agggcatggg gctggtgccc gcttccttga tgggcaggct 6181 gtcctgcggc tgagctgtcg ggcagtggcc ctgctgtttg gctgtagcag tgcggccctg 6241 gctgtgcatg gaaacctgga gggggctggc atcgtgctca agtacatcat ggctggttgc 6301 cccttgtttc tgggtaatct ctgggatgtg actgaccgcg acattgaccg ctacacggaa 6361 gctctgctgc aaggctggct tggagcaggc ccaggggccc cccttctcta ctatgtaaac 6421 caggcccgcc aagctccccg actcaagtat cttattgggg ctgcacctat agcctatggc 6481 ttgcctgtct ctctgcggta accccatgga gctgtcttat tgatgctaga agcctcataa 6541 ctgttctacc tccaaggtta gatttaatcc ttaggataac tcttttaaag tgattttccc 6601 cagtgtttta tatgaaacat ttccttttga tttaacctca gtataataaa gatacatcat 6661 tt // LOCUS D79983 5559 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0161 gene, complete cds. ACCESSION D79983 NID g1136383 KEYWORDS KIAA0161. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5559) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5559) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5559 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="2" /sex="male" 5'UTR 1..348 gene 349..1227 /gene="KIAA0161" CDS 349..1227 /gene="KIAA0161" /note="There is a C3HC4 zinc-finger in the C-terminal region." /citation=[3] /codon_start=1 /db_xref="PID:d1012144" /db_xref="PID:g1136384" /translation="MTTARYRPTWDLALDPLVSCKLCLGEYPVEQMTTIAQCQCIFCT LCLKQYVELLIKEGLETAISCPDAACPKQGHLQENEIECMVAAEIMQRYKKLQFEREV LFDPCRTWCPASTCQAVCQLQDVGLQTPQPVQCKACRMEFCSTCKASWHPGQGCPETM PITFLPGETSAAFKMEEDDAPIKRCPKCKVYIERDEGCAQMMCKNCKHAFCWYCLESL DDDFLLIHYDKGPCRNKLGHSRASVIWHRTQVVGIFAGFGLLLLVASPFLLLATPFVL CCKCKCSKGDDDPLPT" 3'UTR 1228..5559 BASE COUNT 1375 a 1339 c 1288 g 1557 t ORIGIN 1 ggctggcatt gcagtgcggg ccgtgcgggc tgcgcgggcg cggggaggcg cgggcggcaa 61 actgcgggca cccggcaccc cgcagccagt accgggcgga ggcgtcagag ccgcgcaccg 121 cggacgagca ggcccaggca tccagtaccc tgctgactct gacatgcccc cagcctcccc 181 cagctctgca gggagccctg gtgccagaga ccttctccca tttctcccta tgggtgcctg 241 acctgagggc agagacatct gagtgtgtat aagcacagca ggacaccagg acctggcctg 301 aagacatttc ctctcctccc ctctggattt ctcagagact gttctgcgat gaccacagca 361 aggtaccggc ccacctggga cctggccctc gacccgctgg tgtcttgcaa gctctgtctt 421 ggggagtacc cagtggagca gatgacaacc atagcccagt gccaatgcat cttctgtact 481 ctgtgcctga aacagtatgt tgagctcttg atcaaagaag gattagaaac cgcaattagc 541 tgcccagatg ctgcctgccc taaacagggc cacctacagg agaacgagat tgagtgcatg 601 gttgcagctg aaattatgca aagatataaa aagctacaat ttgaaagaga ggtgctgttt 661 gatccctgtc ggacttggtg cccggcgtcc acctgccaag ctgtgtgtca gctccaggac 721 gtggggctgc agacccccca gccagtgcag tgcaaagcct gccgtatgga attctgctcc 781 acctgcaaag ccagctggca ccctggccag ggctgcccgg agaccatgcc gatcaccttc 841 ctccccgggg agaccagtgc tgctttcaaa atggaagaag atgacgcgcc catcaagcgc 901 tgccccaagt gcaaagtcta catcgagcga gacgaaggct gcgcgcagat gatgtgcaag 961 aactgcaagc acgccttctg ctggtactgc ctggagtctc tggacgatga tttccttctg 1021 atacactacg ataagggacc ctgccggaac aagctgggcc actcccgggc atctgtgatc 1081 tggcatcgga cacaggttgt gggcattttt gcaggatttg ggctgctgct cttggtggcc 1141 tcacctttcc tactcctggc cactcccttt gtactttgct gcaagtgcaa gtgcagtaaa 1201 ggtgacgacg acccgttacc cacctagagg aagcgcgatg ctggaacaca tccctgcctc 1261 cgggaagtgt ggctctcccc caaccctccc caccgtcccc ccttcactaa acatctttct 1321 tgccttatgt gccccattga gcttcacagt gtcaggctgg acgccgtgat ttcagggacc 1381 tatgtcacaa tgttcgctga ggccccaggt gtggtgggga ggggaggcag gtgtgggtag 1441 cgcacatccc cacagatcaa tctctgcaga tgacagggag gtgctgtgag aagtgcacca 1501 ggcagctttc tctctgtggt agactaggca tgtctgggga tggcctaaga gactttctgc 1561 tccttggctt ctagatggca ccatgttgtc agagaagtct tttaaggact gccactctct 1621 tcagacagaa tctgattatt ccagcttgag agaagcactc tgtttatgac aactgttttt 1681 attactaatg gcatttagta aaatcctttt tagaaggtat tttttttcca actgagtagt 1741 gaaaatcaac aaggtgcata taaaccatcc tatgcctttc ttgaaatgtg cattttaaga 1801 agttatagtt aaacgacttt tcaggagatt tagaaagcct tatgcactct ttgtgttttt 1861 cttgaaactt gctgtagtaa ctttttgaac gctgtaagct gtgtactgtt ataagtgtgc 1921 ttcctatatt gttgcatttc cttgataata ttgacgtgtt taaaggaaca tctcatctcc 1981 attagatttg ccttttgttg ttttctctct ttggtgattc agctcagctc atgggcctca 2041 tcccttctct cccaggtagc agaaaacgct ttttattgca ttcaatccca ctgctttgct 2101 cggcaatggt tctcctccga attgctgccg tctggcctct ggcctcagtc ttcagataga 2161 cagtaagaag aaagcagcct cattgatccg cagatgtagg ggcctcttgg cagaggctgg 2221 aggactctgg gggctaggga agagcctgcc agattttcac atttttaaaa tgttctagtc 2281 attttgagaa atcatatatc ttacacagtt ccaagtcctg ttgctaaccg ttttgctctt 2341 gttggggaaa agaacctccc atttcacttc gttttaacgt ggggatttta ccacttgatc 2401 ttttacaggg ctgtctgtga ccatttccat ggcagcagga tgcagggatt aataaggaca 2461 catacacatt attccaccaa ttgactttga aaagtggaga aagtgtattc tttaaagaga 2521 atttacttct aaaagccaca ggtgctttta caaagcaaac tgcattgaat ttaaaacttc 2581 taaaaataac aggcaaatat tgtagctata ttacagtggg attttaaaaa tcttgttaaa 2641 cattctataa agaagtaaaa caaaatcctt ttatctcagt tatttgccca tcacaagcac 2701 atttttaaaa tgttgctttg tgtgtgtttg acttctgcat ttgagtatca gtctcaggtc 2761 ctagtttact ttccctggtt ggaaatttat ttcttatttc ctaacattga attcgttaga 2821 aaaaacagcg tcacctcact cttcacctgt catgttgatt ttccttatga acccgaagcc 2881 atttagaaaa tccctgtgtg tcaaaattac attcaaaaag ctctccttgt aattgcaagt 2941 ttagtaactc agtaagaaca tgcctgcgac tccctttctg gatggaacct gggctgtggc 3001 tctctgtcgt ttgtggtgct gtgatggtgt ctactgttag aatagctttt ctggaggtgg 3061 gtggcaactc cacgcgggag tcattggctg ggcttgagcc ctcagcctgt gatatgtgga 3121 tgcagctgtc cagccactgc cctttaacat gcccagcaca catgggaggc ctgtggccct 3181 gtgcccagtg gcccacagga cacgcctcca ccatatgctc atccttcctg cctgaaattg 3241 ctgctgccca tcgcatgagc ccacacagta gcacccccgt ccttaggatg agagtcagag 3301 cctttggaaa ggctgcatcc ccagggctag agctcagatg accttatttc tagagggaca 3361 ggctgttctg ttggaagagt ctctggccca gtcatgcaca tctgtagccc cagccatcct 3421 tgtgccttca cctctgatgt gtttcacaag cacgtagttc aacccagata gggaccacag 3481 agattctggg gccagccagg ggcagtcaaa ttagcaccta ctggctctga ctttttgtat 3541 gaagcatgcc tcagtttcct cattttgatc tagatataaa attaaaattg tccatttcct 3601 atataaatac cagcaagatc actctgagat acataaactc gcatttcctt ctgttttagt 3661 aaggagtgcc agacttatct ttgatgggaa tacagtatga accctgcttg atgtaaaatg 3721 gaaatagcac acaggcagat ggccctgggt ttggactttg atcttgccat cattccccag 3781 taaaacctgg ggagttctgc gttgagtgga cggacttatt cctgtgaagc ggcataattt 3841 gtctccattg aaaaatggca ttcactctta cagatggtgt tcactgcaag ccccagaagc 3901 atatggcatg tgttcactaa gaggccttta atcctgggga gtaaggggcg aaggccctta 3961 gacaaccatg gctgctgtac cgccgcccag ggtgggtggc cagtgaggac tggccttagc 4021 ccagtggacc tgtggcttct ctgaggccct tgagtaactg accacatttg gaggttttgc 4081 tggaaatgcc tgacctctca gtctggctct gctgtgtagt ccatagccca gccagatgag 4141 cttgcagcct cataggaggt cagcaccttg caaagatgca gtcaccatag atgtccacgt 4201 agcagagact gacttaggat ctgagataaa gcatcggatt gcaggaataa ctgtccaaat 4261 tagttcttcc tcctcaactc ataggagtag ctgtggacag aggaaccaac atctgccacc 4321 tctggcattt tctttctttt ttttcttttt gagacggagt ttcgcttttg tcccccaagc 4381 tggagtacaa tgacaggatc tcggctcact gcaagctctg cctcccgggt tcaagcgatt 4441 ctcctgcctc agcctcccaa gtagctggga ttacaggcac ccaccacctc acccagctaa 4501 tttttgtatt ttcagtagag acggggtttc accatgttgg ccaggctact ctcaaactcc 4561 tgacctcagg tgatctgccc acctcggctt cccaaagtgc tggaattaca ggcctgagcc 4621 accccacccg gcctattttc ttaaagggga acaaatgatc ttgacaacat attattccat 4681 aaaaccagtt tagggcacag gccagttcct gattagaaca caggacctgt gggagggact 4741 atcagagatg caaaaattac ttcaagatga gtttattgtt ttcatttgta ttgcaaaagt 4801 tagaagtcat tttacaaatt aaaaaaacat ttttttcttg gtagtcttta aaaattaggg 4861 gattgaaagg atccaggatg ggctttgtgt gtgtgtctca gattctcatt tattagtgag 4921 cacacctgtg tatatatata aatcacaagg agatcatcaa gggaaaacat tttgcatgtg 4981 taaagcttca tgaagttctc tttaaaaaat accaaagctt gtttatttct gataattaac 5041 ctaagccctt atgaaaataa acaaaatgaa gggattatga caggtattac caaaaacacc 5101 aaaaggaaca aaggggcctg cgttaaaacc taattgctaa tgcttcacaa ctaggagagc 5161 atgccgtctt gatgtttaaa aaacccaggg tctccaccct tcctttgatt tgtgcaattc 5221 tgtcttccac agttccggag ccttcagtga ggggtagcta catgccccat gcctgccctt 5281 tctttccttc tttgctcact ttactatggg tgtattttaa tcttgtataa aaatatgcat 5341 gaatgagtca tgcacatgta tacgttatgt atttgacaag tggtggtgaa acaaaatcaa 5401 aacagatttg atttgtgttt ttgaaatgtc agtacatttt gtgccactaa cactgtgatg 5461 tataaaagag ctgtttgaat gccttttaat gttgtgtttt gtactctgga atcatatgga 5521 aaaagtttga tttgtaattt caatacatat tttaaatgt // LOCUS D79984 5876 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0162 gene, complete cds. ACCESSION D79984 NID g1136385 KEYWORDS KIAA0162. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5876) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5876) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 REFERENCE 4 (sites) AUTHORS Chiang,P.W., Wang,S., Smithivas,P., Song,W.J., Ramamoorthy,S., Hillman,J., Puett,S., Van Keuren,M.L., Crombez,E., Kumar,A., Glover,T.W., Miller,D.E., Tsai,C.H., Blackburn,C.C., Chen,X.N., Sun,Z., Cheng,J.F., Korenberg,J.R. and Kurnit,D.M. TITLE Identification and analysis of the human and murine putative chromatin structure regulator SUPT6H and Supt6h JOURNAL Genomics 34 (3), 328-333 (1996) MEDLINE 96374824 FEATURES Location/Qualifiers source 1..5876 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="7" /sex="male" 5'UTR 1..90 gene 91..5271 /gene="KIAA0162" CDS 91..5271 /gene="KIAA0162" /note="similar to emb-5 protein of C.elegans." /citation=[3] /codon_start=1 /db_xref="PID:d1012145" /db_xref="PID:g1136386" /translation="MSDFVESEAEESEEEYNDEGEVVPRVTKKFVEEEDDDEEEEEEN LDDQDEQGNLKGFINDDDDEDEGEEDEGSDSGDSEDDVGHKKRKRTSFDDRLEDDDFD LIEENLGVKVKRGQKYRRVKKMSDDEDDDEEEYGKEEHEKEAIAEEIFQDGEGEEGQE AMEAPMAPPEEEEEDDEESDIDDFIVDDDGQPLKKPKWRKKLPGYTDAALQEAQEIFG VDFDYDEFEKYNEYDEELEEEYEYEDDEAEGEIRVRPKKTTKKRVSRRSIFEMYEPSE LESSHLTDQDNEIRATDLPERFQLRSIPVKGAEDDELEEEADWIYRNAFATPTISLQE SCDYLDRGQPASSFSRKGPSTIQKIKEALGFMRNQHFEVPFIAFYRKEYVEPELHIND LWRVWQWDEKWTQLRIRKENLTRLFEKMQAYQYEQISADPDKPLADGIRALDTTDMER LKDVQSMDELKDVYNHFLLYYGRDIPKMQNAAKASRKKLKRVREEGDEEGEGDEAEDE EQRGPELKQASRRDMYTICQSAGLDGLAKKFGLTPEQFGENLRDSYQRHETEQFPAEP LELAKDYVCSQFPTPEAVLEGARYMVALQIAREPLVRQVLRQTFQERAKLNITPTKKG RKDVDEAHYAYSFKYLKNKPVKELRDDQFLKICLAEDEGLLTTDISIDLKGVEGYGND QTYFEEIKQFYYRDEFSHQVQEWNRQRTMAIERALQQFLYVQMAKELKNKLLAEAKEY VIKACSRKLYNWLRVAPYRPDQQVEEDDDFMDENQGKGIRVLGIAFSSARDHPVFCAL VNGEGEVTDFLRLPHFTKRRTAWREEEREKKAQDIETLKKFLLNKKPHVVTVAGENRD AQMLIEDVKRIVHELDQGQQLSSIGVELVDNELAILYMNSKKSEAEFRDYPPVLRQAV SLARRIQDPLIEFAQVCSSDEDILCLKFHPLQEHVVKEELLNALYCEFINRVNEVGVD VNRAIAHPYSQALIQYVCGLGPRKGTHLLKILKQNNTRLESRTQLVTMCHMGPKVFMN CAGFLKIDTASLGDSTDSYIEVLDGSRVHPETYEWARKMAVDALEYDESAEDANPAGA LEEILENPERLKDLDLDAFAEELERQGYGDKHITLYDIRAELSCRYKDLRTAYRSPNT EEIFNMLTKETPETFYIGKLIICNVTGIAHRRPQGESYDQAIRNDETGLWQCPFCQQD NFPELSEVWNHFDSGSCPGQAIGVKTRLDNGVTGFIPTKFLSDKVVKRPEERVKVGMT VHCRIMKIDIEKFSADLTCRTSDLMDRNNEWKLPKDTYYDFDAEAADHKQEEDMKRKQ QRTTYIKRVIAHPSFHNINFKQAEKMMETMDQGDVIIRPSSKGENHLTVTWKVSDGIY QHVDVREEGKENAFSLGATLWINSEEFEDLDEIVARYVQPMASFARDLLNHKYYQDCS GGDRKKLEELLIKTKKEKPTFIPYFICACKELPGKFLLGYQPRGKPRIEYVTVTPEGF RYRGQIFPTVNGLFRWFKDHYQDPVPGITPSSSSRTRTPASINATPANINLADLTRAV NALPQNMTSQMFSAIAAVTGQGQNPNATPAQWASSQYGYGGSGGGSSAYHVFPTPAQQ PVATPLMTPSYSYTTPSQPITTPQYHQLQASTTPQSAQAQPQPSSSSRQRQQQPKSNS HAAIDWGKMAEQWLQEKEAERRKQKQRLTPRPSPSPMIESTPMSIAGDATPLLDEMDR " 3'UTR 5272..5876 BASE COUNT 1601 a 1468 c 1655 g 1152 t ORIGIN 1 cgactccatt ttcctcggtg ggggcttagc gcactggagg agcggcgggc ttcagacagt 61 tatcttcagg cagagtgaga agctgcagca atgtctgatt ttgtggaaag cgaggctgag 121 gagtcagagg aagaatacaa tgatgaaggc gaggtggtac cccgagtcac caagaaattt 181 gtggaagagg aggatgatga tgaggaggag gaggaggaga acctagatga tcaggatgag 241 caaggcaact tgaaaggctt tatcaatgac gatgatgatg aagatgaagg ggaggaggat 301 gagggcagtg actctggtga ttcagaagat gatgttggcc acaagaagag aaaacgcacc 361 tcttttgatg accgcctgga ggatgatgat tttgacctca ttgaggagaa tttgggtgtc 421 aaagtcaaaa gaggacaaaa gtaccggcgt gtcaaaaaaa tgtcagatga cgaggacgat 481 gacgaggagg aatatggcaa ggaggaacat gaaaaagaag ctattgcgga agaaatcttc 541 caggatgggg aaggggaaga agggcaggag gccatggagg cccccatggc tcctccagag 601 gaggaggaag aagatgatga ggagtcagat attgacgact tcattgtgga tgatgatgga 661 cagcctctga aaaaacctaa gtggcggaaa aagcttcctg gatacacaga cgcggccctg 721 caagaagccc aggaaatctt cggtgtggac tttgactatg atgaatttga gaaatacaat 781 gagtatgatg aagaactgga ggaagagtat gagtatgagg atgatgaggc tgagggtgaa 841 atccgagtgc gccccaagaa gaccaccaag aagcgtgtga gccgtaggag catctttgaa 901 atgtatgagc ccagtgagct agaaagcagc cacctcacag atcaggacaa tgaaatccga 961 gccactgacc tgcctgagag gttccagctc cgctccatcc cagtcaaggg ggctgaagat 1021 gatgaactag aagaagaagc tgactggatc tacaggaatg cttttgccac accaaccatt 1081 tctctccagg aaagctgtga ttacctagac cgagggcagc cagccagcag cttcagtcgg 1141 aaagggccca gcacaattca gaagatcaaa gaggccctgg gcttcatgcg aaatcagcat 1201 tttgaggtgc cttttattgc cttctatcga aaggagtatg tggagcctga gttgcacatc 1261 aatgacctat ggagagtctg gcagtgggat gaaaagtgga cccagctgcg gatccgtaaa 1321 gagaacctaa cacggctgtt tgagaagatg caggcttatc agtatgaaca gatctctgct 1381 gaccctgaca aacctcttgc tgatggcatc cgggctctgg acaccactga catggagagg 1441 ctcaaggatg tccaatcaat ggatgagctg aaagatgtct acaaccattt tcttctttat 1501 tatggccgag acatccctaa gatgcagaac gccgccaaag ctagccgcaa gaagctgaag 1561 cgtgtcaggg aagagggaga tgaagaaggt gaaggtgacg aggcagaaga tgaggagcag 1621 agggggcctg agctcaagca agcctctcgc cgagacatgt acaccatctg ccagagtgct 1681 gggctagatg gcctggccaa aaagtttggg cttactcccg agcagtttgg ggagaacctg 1741 cgggatagct accagcggca cgagacagag cagtttcccg cggagccctt ggagctggcc 1801 aaggattacg tttgcagcca gttccctact ccagaagctg tgctagaagg cgcccgctac 1861 atggtagccc tgcagattgc ccgtgagccc cttgtccggc aggtgctgag gcaaaccttc 1921 caagagagag ccaagttaaa tataaccccc accaagaaag gtagaaagga tgtggatgag 1981 gcccactatg cctattcctt caagtattta aagaacaagc ctgttaagga actgagagat 2041 gaccagtttc tcaagatatg cctggctgaa gacgaagggc tcctcaccac tgacatcagc 2101 atagatttga agggagtgga aggctatggc aacgaccaga catattttga ggagataaaa 2161 cagttttact accgagatga gttcagccac caggtgcagg agtggaaccg gcagcgcacc 2221 atggccatcg aacgggcttt acagcagttc ctctatgtgc agatggccaa agaactcaag 2281 aacaagctgc tggctgaagc caaggaatat gtcataaagg cctgtagtcg aaagctctac 2341 aattggttga gagtggcacc ctaccgacca gatcagcagg tggaagaaga tgacgacttt 2401 atggacgaga accaagggaa gggcattcga gtcctcggca ttgctttctc ctctgccaga 2461 gatcaccctg tgttctgcgc cctggtcaat ggtgaaggag aagtgacaga cttccttcga 2521 ctgccccatt ttaccaaacg gcgaactgca tggagagagg aagagcggga aaagaaggct 2581 caagacattg aaacgctaaa gaaatttctc ctgaataaga agcctcatgt agtgacagtt 2641 gcaggagaga acagggacgc ccagatgttg attgaagatg tgaagcgcat tgtacatgag 2701 ctggaccagg gccagcagct gtcatctatt ggggtagagc tggttgacaa cgagttggcc 2761 attctctata tgaacagcaa gaagtcagag gcagagttcc gggattatcc tccagtgctg 2821 agacaggccg tctccctggc ccggcgcatc caggaccctc tgattgaatt tgcccaggtg 2881 tgcagttccg atgaagacat cctgtgtctc aagtttcacc ccttgcagga gcatgtggtg 2941 aaagaggagc tgctcaacgc cttgtactgt gaatttatca accgagtcaa tgaggtcggg 3001 gtcgatgtca accgtgccat tgcccaccct tacagccagg ccttgatcca gtatgtttgt 3061 ggcctgggac ctcggaaagg gacccacctc ctgaagatcc tgaagcagaa caacacccgg 3121 ctcgagagcc ggacccagct ggtcaccatg tgccacatgg gtcccaaagt cttcatgaat 3181 tgtgctggct tcctcaagat cgacacggcc tccctggggg acagcactga ctcatatatt 3241 gaagtccttg atggttcccg tgtccaccct gagacttatg agtgggctag gaagatggca 3301 gtggatgccc tggaatacga tgaatcagcc gaggatgcca atcctgcagg agcccttgaa 3361 gaaatcttgg aaaacccaga gcgactgaaa gacctggacc ttgatgcctt tgcagaagag 3421 ctggagaggc agggctatgg tgacaaacac atcacactct atgacatccg ggcagagctg 3481 agctgtcgat ataaggacct ccggacagcc taccgctctc ccaacacaga ggagatcttc 3541 aatatgttaa ccaaagaaac accagagacc ttctacattg gaaagctcat catctgcaat 3601 gtcactggca ttgcccacag gcgtccccag ggtgagagct atgaccaggc gatccgcaat 3661 gatgagacag ggctgtggca gtgccccttc tgtcagcagg acaatttccc tgaactaagc 3721 gaggtgtgga accactttga cagcggttcg tgcccaggcc aggccatcgg tgtcaaaaca 3781 cggctagaca atggtgtcac cggcttcatc cccaccaaat tcctcagtga caaagtggta 3841 aagcggccag aagaacgagt gaaggtggga atgactgttc actgccgcat catgaagatt 3901 gacattgaga agttcagtgc agacctgacc tgccgcacct cagacctcat ggacaggaac 3961 aatgagtgga agctgcccaa agacacctac tatgactttg atgctgaagc tgcagaccac 4021 aagcaggagg aggacatgaa gcggaagcag cagcggacca catacatcaa gagagtgatc 4081 gcacacccat ccttccataa tatcaatttc aagcaagcag aaaagatgat ggagaccatg 4141 gaccagggtg atgtgattat ccgaccaagc agcaagggcg agaaccacct gacagtgacc 4201 tggaaagtca gtgatggcat ctaccagcat gtggatgtgc gggaggaggg caaggaaaat 4261 gccttcagcc tgggagccac tctgtggatc aacagtgagg aattcgaaga tttggatgag 4321 attgttgctc gctatgtcca gcccatggca tcctttgccc gggaccttct gaatcacaag 4381 tattatcagg actgcagcgg tggggaccgc aagaaattag aggagctgct catcaaaact 4441 aagaaggaga agcccacctt catcccttat ttcatctgtg cctgcaagga actgcccggc 4501 aagttcctac tgggatacca gccccggggt aaacccagga tagaatatgt aacggtgact 4561 ccagagggat tccggtaccg gggccagatc ttcccaaccg tgaatggact gtttagatgg 4621 tttaaggatc actaccagga tcctgtacca ggcatcaccc ctagcagcag cagcaggacc 4681 cggacacctg cctctatcaa tgctacccca gccaacatca accttgcaga tctgacacgg 4741 gctgtgaatg ccctgcctca gaacatgact tcacagatgt tcagtgccat tgctgcggtg 4801 acaggccaag gacagaaccc taatgccacc ccagcccagt gggcctccag ccagtacggc 4861 tatggcggca gtggaggcgg cagcagtgct taccacgtat tcccaacgcc agcccagcag 4921 ccagtggcca caccactaat gacccctagc tactcctaca cgaccccaag ccagcccatc 4981 accacccctc agtaccacca gctccaggcc agcaccaccc cacagtcggc ccaggcccag 5041 ccccagccct cttccagctc ccggcaacgg cagcagcagc caaagtccaa cagccatgca 5101 gccatcgact ggggaaaaat ggcggagcag tggctgcagg aaaaggaggc agaacggcgg 5161 aaacagaagc agcggctgac acctcggccc tcccccagcc ccatgatcga aagcaccccc 5221 atgtccattg ctggcgatgc caccccactc ctggacgaga tggatcggta gggggcctgc 5281 tcctcggact ctggttacct ctgaggctgg gaaaggcctg gctgcccact gcctccctcc 5341 ctgcccctcc ttttatgtcc ataaagtggc gtgaagtgag acgttctctt tggtggtcaa 5401 cccggatggg tgacaggctg gatggccttg tgaacttgag ctcagtgtat gctaggcaac 5461 aattctcccg ctccagaccc tcaccgacca cctgtcctgg gaccaggctg ggaggggagt 5521 gtggcaggga ggaggaagag gaaggtgaga atgagtagaa cagttttgta ttctactccc 5581 tacaagccat tttgaacttc tgccctcacc ggactctggg ctgtgactgg ggcaccaaac 5641 tcagcacatg agtctcccct agctctcgtg gggagaggga tgctatttat tcagtttggg 5701 gcaggaggga gaggagggaa agtatttcta accctgatgc caacagccgg gtggctgtcc 5761 aagcaggatt gcaggggaca cagggaagca ctgcccagcc cctgcctggc tgccctttcc 5821 cccctgctgc tgccaccgct tcctgcctgt catttgaata aacagtgttt ctattg // LOCUS D79986 5538 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0164 gene, complete cds. ACCESSION D79986 NID g1136389 KEYWORDS KIAA0164. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5538) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5538) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5538 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="6" /sex="male" 5'UTR 1..253 gene 254..3016 /gene="KIAA0164" CDS 254..3016 /gene="KIAA0164" /note="similar to human DNA-binding protein 5." /citation=[3] /codon_start=1 /db_xref="PID:d1012147" /db_xref="PID:g1136390" /translation="MGRSNSRSHSSRSKSRSQSSSRSRSRSHSRKKRYSSRSRSRTYS RSRSRDRMYSRDYRRDYRNNRGMRRPYGYRGRGRGYYQGGGGRYHRGGYRPVWNRRHS RSPRRGRSRSRSPKRRSVSSQRSRSRSRRSYRSSRSPRSSSSRSSSPYSKSPVSKRRG SQEKQTKKAEGEPQEESPLKSKSQEEPKDTFEHDPSESIDEFNKSSATSGDIWPGLSA YDNSPRSPHSPSPIATPPSQSSSCSDAPMLSTVHSAKNTPSQHSHSIQHSPERSGSGS VGNGSSRYSPSQNSPIHHIPSRRSPAKTIAPQNAPRDESRGRSSFYPDGGDQETAKTG KFLKRFTDEESRVFLLDRGNTRDKEASKEKGSEKGRAEGEWEDQEALDYFSDKESGKQ KFNDSEGDDTEETEDYRQFRKSVLADQGKSFATASHRNTEEEGLKYKSKVSLKGNRES DGFREEKNYKLKETGYVVERPSTTKDKHKEEDKNSERITVKKETQSPEQVKSEKLKDL FDYSPPLHKNLDAREKSTFREESPLRIKMIASDSHRPEVKLKMAPVPLDDSNRPASLT KDRLLASTLVHSVKKEQEFRSIFDHIKLPQASKSTSESFIQHIVSLVHHVKEQYFKSA AMTLNERFTSYQKATEEHSTRQKSPEIHRRIDISPSTLRKHTRLAGEERVFKEENQKG DKKLRCDSADLRHDIDRRRKERSKERGDSKGSRESSGSRKQEKTPKDYKEYKSYKDDS KHKREQDHSRSSSSSASPSSPSSREEKESKKEREEEFKTHHEMKEYSGFAGVSRPRGT FFRIRGRGRARGVFAGTNTGPNNSNTTFQKRPKEEEWDPEYTPKSKKYFLHDDRDDGV DYWAKRGRGRGTFQRGRGRFNFKKSGSSPKWTHDKYQGDGIVEDEEETMENNEEKKDR RKEEKE" 3'UTR 3017..5538 BASE COUNT 1861 a 965 c 1138 g 1574 t ORIGIN 1 gccccagatc ggaagtgacg gagacgtgct agcgcgtcga aggtagctct atggttttcc 61 tcgcgttctt gagtcgggaa atggccgctg tgtggttgca acggagataa attcccggaa 121 ccgcgattcg gcgtgtcagg aattcgaatt tagagtttaa tttctcagag cattctctcc 181 aggaagaatt tttacagtat ctcaaagact tcacttgact tcttgatcct gcataaaacc 241 aaggagaaaa gaaatgggtc gctccaattc tagatcacat tcttcaaggt caaagtctag 301 atcacagtct agttctcgat caagatcaag atctcattct agaaagaagc gatacagttc 361 taggtctcgt tccagaacat attcaaggtc tcgtagtaga gatcgtatgt attctagaga 421 ttatcgtcgc gattacagaa ataatagagg aatgagacga ccttatgggt acagaggaag 481 gggtagaggg tattatcaag gaggaggagg tagatatcat cgaggtggtt atagacctgt 541 ctggaataga aggcactcta ggagtcctag acgaggtcgt tcacgttcca ggagtccaaa 601 aagaagatcc gtttcttctc aaagatccag aagcagatct cgccggtcat atagatcttc 661 taggtctcca agatcatcct cttctcgttc ttcatcccca tatagcaaat ctcctgtttc 721 taaaagacga gggtctcagg aaaaacaaac caaaaaagct gaaggggaac cccaagaaga 781 gagtccgttg aaaagtaaat cacaggagga accgaaagat acatttgaac atgacccatc 841 tgagtctatc gatgaattta ataagtcatc agccacatcc ggtgatattt ggcctggcct 901 ttcagcttat gataatagtc ctagatcacc ccatagtcct tcacctattg ctacaccacc 961 tagtcagagt tcatcttgct ctgatgctcc catgctcagt acagttcact ctgcaaaaaa 1021 tactccttct cagcattcac attccattca gcatagtcct gaaaggtctg ggtctggttc 1081 tgttggaaat ggatctagtc gatacagtcc ttctcagaat agtccaattc atcacatccc 1141 ttcacgaaga agtcctgcaa agacaatcgc accacagaat gctccaagag atgagtctag 1201 gggccgttcc tcgttttatc ctgatggtgg agatcaggaa actgcaaaga ctgggaagtt 1261 cttaaaaagg ttcacagatg aagagtctag agtattcctg cttgataggg gtaataccag 1321 ggataaagag gcttcaaaag agaaaggatc agagaaaggg agggcagagg gagaatggga 1381 agatcaggaa gctctagatt acttcagtga taaagagtct ggaaaacaaa agtttaatga 1441 ttcagaaggg gatgacacag aggagacaga ggattataga cagttcagga agtcagtcct 1501 cgcagatcag ggtaaaagtt ttgctactgc atctcaccgg aatactgagg aggaaggact 1561 caagtacaag tccaaagttt cactgaaagg caatagagaa agtgatggat ttagagaaga 1621 aaaaaattat aaacttaaag agactggata tgtagtggaa aggcctagca ctacaaaaga 1681 taagcacaaa gaagaagaca aaaattctga aagaataaca gtaaagaaag aaactcagtc 1741 acctgagcag gtaaagtctg aaaagctcaa agacctcttt gattacagtc cccctctaca 1801 caagaatctg gatgcacgag aaaagtctac cttcagagag gaaagcccac ttaggatcaa 1861 aatgatagcg agtgattctc accgtcctga agtcaaactc aaaatggcac ctgttcctct 1921 tgatgattct aacagacctg cttccttgac taaagacagg ctgcttgcta gtacacttgt 1981 ccattctgtc aagaaggagc aagaattccg atccatcttt gaccacatta agttgccaca 2041 ggccagcaaa agcacttcag agtcatttat tcaacacatt gtgtccttgg ttcatcatgt 2101 taaagagcaa tacttcaagt cagctgcaat gaccctaaac gagcggttca cttcgtatca 2161 gaaagccact gaagaacata gtactcggca aaagagccct gaaatacaca ggagaattga 2221 catctcacca agtaccctga ggaagcatac ccgtttagca ggggaagaga gagtttttaa 2281 agaagaaaat caaaagggag ataaaaaatt aaggtgtgac tctgctgacc ttcggcatga 2341 cattgatcgc cgtagaaaag aaagaagtaa agaacgggga gattccaagg gctccaggga 2401 atccagtgga tcaagaaagc aggaaaaaac tccaaaagat tacaaggaat acaaatctta 2461 caaagatgac agtaaacata aaagagagca agatcattct cgatcttcat cctcttcagc 2521 atcaccttct tctcccagtt ctcgagaaga aaaggagagt aagaaggaaa gagaagaaga 2581 atttaaaact caccatgaaa tgaaagaata ctcaggcttt gcaggagtta gccgaccacg 2641 aggaaccttt tttcgaatta gaggcagagg aagagccaga ggagtttttg ctgggacaaa 2701 tactggtcca aacaactcaa atactacttt tcaaaagaga ccgaaggaag aggaatggga 2761 tccagaatat accccaaaga gcaagaagta cttcttgcat gacgacagag atgatggtgt 2821 ggattattgg gccaaaagag gaagaggtcg tggtactttt caacgtggca gagggcgctt 2881 taacttcaaa aaatcaggta gcagtcctaa atggactcat gacaaatacc aaggggatgg 2941 gattgttgaa gatgaagaag agaccatgga aaataatgaa gaaaagaagg acagacgcaa 3001 ggaagaaaag gaataataaa tatgaagtaa gattacaaca gagcagaact tgcacccacc 3061 atttttttta cctgattttt gttttcaaat aagaatgtaa gcattttact taaattttac 3121 tgtttgcaag tagtctatag aaattttgtt ttaagtcttc aaatatcttg agaaatagta 3181 gactgtatgt tgaaaattgt actgaaataa agtagaaaat tgttacgtac catatttgta 3241 actatcaact tttaaaactt ttaacgtttt tgttacatgc attgtaattc tgctttgtct 3301 ataagatatg gtcaagtaca gctctgtgaa agttctgatt ctcttccttc cctgtttgtc 3361 aatgttttat tctgaagtaa acgttagctc tacatataaa tcctggaaca gaaattgttt 3421 atagagacta cactaattat tttaactgta tacatctgtt taatttgaac acactacatc 3481 gtagggtgac tgatttttga agtataccac agacaaaaag ttgttactat ggtaaactaa 3541 gctagtttaa cacttgagca aatgcttaag aaggaattaa aaaaaaaaag ctttgccaat 3601 agctaaaaag tacaagctat taaaaatcag attgaaaagt tttgagaaaa tgttattttt 3661 actgaaagca agcagtggcc tataaagaac attcttagga gccttttcta tttgcgttca 3721 aaactgtgtg ttctctttct attcctattt gatagtttga gtcatggtct tagatattag 3781 ctatttgtga gaggaaactg gtttgtaaca atactgcaaa tagaaacccc atttctactg 3841 aacatcctag ttttaaacag aagaaaaact gtaatcctgg ggttggtatg taggaggtct 3901 atcctgcaga ataagttgat acattagtac ctgatttcat atcttacata tttatttgag 3961 ctgaacatta gtttgtagtg taactattag taaaaataga gaaacacagc atactgttca 4021 ttaatagtat tttaaaaaaa ttgtttttca aatgtcacca ataaaagttt tggcaggaag 4081 cttgttgcgg cattgatcta acctttttcc cccccatttc agttgcagtt tttgtagaat 4141 ggctttttct ttttcctctt aagagttcta ttcttcaggt agataatttt tcaaatgtga 4201 attatctttt gtgtctatat tgatagctct taaaggagtg aaaatctaaa atagtaaatt 4261 tcaatgttaa gtgtctgctt tatgggcata tataaaagta gacacatttc atttgttaat 4321 ttagttgtgt gtgtgtgtta aaaggagcta atgcttattc tgttaatgta aacttttgaa 4381 gatcttaagt gtattgctct ttcatcttaa acactttcga ggatttgcag tgcgtctagc 4441 acctagatta cagccaggaa cattggttaa gaactgttgg aaacaaaact aaaagcaaac 4501 tcaacatatg tgatgtttat ggccctcaga tccttagtat tgtgtgattt tcccccgtta 4561 acatgtcttt ctaaaattgt ctattaaagc agaggaaata cctgccaaag gaagtatgta 4621 ttgcattaat cagggcataa ctaatattct cctgttcaga ataatactta tttacgtgtg 4681 aaagcaacat ggatgtgatt cccaacacag aattttcatg acccttttat tgtatacaaa 4741 taaataccat aacagttact tggttagaca tcaaatctgt gtgcatgact atgtgcttat 4801 ccacttaaga caataggtaa aaggggatct gagaaattat gtaataggga gtgggaataa 4861 aactacttaa ttcctgtggg caggttatat tttaagttca aatgcattgc tttaaccttt 4921 ggttactttt attctgttaa acagaattga agaaagagta ttataccaga gtgtagtagg 4981 ctagggtgat tgtaagaact ctgtaataga atgtcattgt ggatgttacc tttttcagat 5041 ccaagcatat aaaaagcctg tatatttttt aaaaacacat cttaactcca cgctttacga 5101 tattataaaa gttgaatggt tcctcttggt aaggatattt gcttacaagt gctaggaaat 5161 aactcactga tacctgcgtt aacatacttt gttttgccta gagaggggca ataaaaatga 5221 accaaaggat atttccagaa aggattaaga aagctgttta agaaggccat gactctttag 5281 gtgtgtatgt gtacctttca gcatcctagg aatttttata ctaaaagcaa aatgtttttt 5341 ccagttagtc ttcttcaagg aattactatt gttccttttg tcacaggtaa aatcagtgtt 5401 gggaattata atttgagaaa aatattaccc agtaacattg aatgtagatg gctaaacgat 5461 tcttactcag tgtgatgtat aatgatgcaa cagggaccct tgtaaattgt catacgccaa 5521 taaaatgtca caagtaat // LOCUS D79988 6942 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0166 gene, complete cds. ACCESSION D79988 NID g1136393 KEYWORDS KIAA0166. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6942) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6942) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..6942 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="17" /sex="male" 5'UTR 1..163 gene 164..6793 /gene="KIAA0166" CDS 164..6793 /gene="KIAA0166" /note="There are three putative hydrophobic domains in the central region." /citation=[3] /codon_start=1 /db_xref="PID:d1012149" /db_xref="PID:g1136394" /translation="MWNDIELLTNDDTGSGYLSVGSRKEHGTALYQVDLLVKISSEKA SLNPKIQACSLSDGFIIVADQSVILLDSICRSLQLHLVFDTEVDVVGLCQEGKFLLVG ERSGNLHLIHVTSKQTLLTNAFVQKANDENRRTYQNLVIEKDGSNEGTYYMLLLTYSG FFCITNLQLLKIQQAIENVDFSTAKKLQGQIKSSFISTENYHTLGCLSLVAGDLASEV PVIIGGTGNCAFSKWEPDSSKKGMTVKNLIDAEIIKGAKKFQLIDNLLFVLDTDNVLS LWDIYTLTPVWNWPSLHVEEFLLTTEADSPSSVTWQGITNLKLIALTASANKKMKNLM VYSLPTMEILYSLEVSSVSSLVQTGISTDTIYLLEGVCKNDPKLSEDSVSVLVLRCLT EALPENRLSRLLHKHRFAEAESFAIQFGLDVELVYKVKSNHILEKLALSSVDASEQTE WQQLVDDAKENLHKIQDDEFVVNYCLKAQWITYETTQEMLNYAKTRLLKKEDKTALIY SDGLKEVLRAHAKLTTFYGAFGPEKFSGSSWIEFLNNEDDLKDIFLQLKEGNLVCAQY LWLRHRANFESRFDVKMLESLLNSMSASVSLQKLCPWFKNDVIPFVRRTVPEGQIILA KWLEQAARNLELTDKANWPENGLQLAEIFFTAEKTDELGLASSWHWISLKDYQNTEEV CQLRTLVNNLRELITLHRKYNCKLALSDFEKENTTTIVFRMFDKVLAPELIPSILEKF IRVYMREHDLQEEELLLLYIEDLLNRCSSKSTSLFETAWEAKAMAVIACLSDTDLIFD AVLKIMYAAVVPWSAAVEQLVKQHLEMDHPKVKLLQESYKLMEMKKLLRGYGIREVNL LNKEIMRVVRYILKQDVPSSLEDALKVAQAFMLSDDEIYSLRIIDLIDREQGEDCLLL LKSLPPAEAEKTAERVIIWARLALQEEPDHSKEGKAWRMSVAKTSVDILKILCDIQKD NLQKKDECEEMLKLFKEVASLQENFEVFLSFEDYSNSSLVADLREQHIKAHEVAQAKH KPGSTPEPIAAEVRSPSMESKLHRQALALQMSKQELEAELTLRALKDGNIKTALKKCS DLFKYHCNADTGKLLFLTCQKLCQMLADNVPVTVPVGLNLPSMIHDLASQAATICSPD FLLDALELCKHTLMAVELSRQCQMDDCGILMKASFGTHKDPYEEWSYSDFFSEDGIVL ESQMVLPVIYELISSLVPLAESKRYPLESTSLPYCSLNEGDGLVLPVINSISALLQNL QESSQWELALRFVVGSFGTCLQHSVSNFMNATLSEKLFGETTLVKSRHVVMELKEKAV IFIRENATTLLHKVFNCRLVDLDLALGYCTLLPQKDVFENLWKLIDKAWQNYDKILAI SLVGSELASLYQEIEMGLKFRELSTDAQWGIRLGKLGISFQPVFRQHFLTKKDLIKAL VENIDMDTSLILEYCSTFQLDCDAVLQLFIETLLHNTNAGQGQGDASMDSAKRRHPKL LAKALEMVPLLTSTKDLVISLSGILHKLDPYDYEMIEVVLKVIERADEKITNININQA LSILKHLKSYRRISPPVDLEYQYMLEHVITLPSAAQTRLPFHLIFFGTAQNFWKILST ELSEESFPTLLLISKLMKFSLDTLYVSTAKHVFEKKLKPKLLKLTQAKSSTLINKEIT KITQTIESCLLSIVNPEWAVAIAISLAQDIPEGSFKISALKFCLYLAERWLQNIPSQD EKREKAEALLKKLHIQYRRSGTEAVLIAHKLNTEEYLRVIGKPAHLIVSLYEHPSINQ RIQNSSGTDYPDIHAAAKEIAEVNEINLEKVWDMLLEKWLCPSTKPGEKPSELFELQE DEALRRVQYLLLSRPIDYSSRMLFVFATSTTTTLGMHQLTFAHRTRALQCLFYLADKE TIESLFKKPIEEVKSYLRCITFLASFETLNIPITYELFCSSPKEGMIKGLWKNHSHES MAVRLVTELCLEYKIYDLQLWNGLLQKLLGFNMIPYLRKVLKAISSIHSLWQVPYFSK AWQRVIQIPLLSASCPLSPDQLSDCSESLIAVLECPVSGDLDLIGVARQYIQLELPAF ALACLMLMPHSEKRHQQIKNFLGSCDPQVILKQLEEHMNTGQLAGFSHQIRSLILNNI INKKEFGILAKTKYFQMLKMHAMNTNNITELVNYLANDLSLDEASVLITEYSKHCGKP VPPDTAPCEILKMFLSGLS" 3'UTR 6794..6942 BASE COUNT 2161 a 1323 c 1474 g 1984 t ORIGIN 1 ggcatggaac ctaaagacta gaggcggttg tgtgagtcag gaagaggggc cagatatctg 61 agtgttcctc tttagtttct tcaattgcag ataatatggt gtctaatttt atgttgttca 121 ggaaagacag tggttcctga ctcaggaaga cagtctcaga aacatgtgga atgatattga 181 gctgctaaca aatgatgata ccggaagtgg gtacctgagt gtcggttcaa gaaaagaaca 241 tggaactgct ttatatcaag tagatttgct agtgaagatc tcttctgaaa aggcctcatt 301 aaatccaaag atacaggcat gcagcttaag tgatgggttt attattgtag ccgaccaatc 361 agtgatattg cttgacagta tttgtagatc acttcaattg catcttgtct ttgatactga 421 agtggatgta gttggccttt gtcaagaagg aaagtttctt ttggttggcg agagaagtgg 481 caacctacat cttattcatg taacatcaaa acaaacacta ctcactaatg catttgttca 541 gaaagctaac gatgaaaatc ggcggactta ccagaatctt gtcattgaga aggatggttc 601 aaatgaaggt acctattata tgctacttct tacatacagt ggattttttt gtattacaaa 661 ccttcagctt ttaaaaattc aacaagcaat tgagaatgta gacttcagta cagcaaaaaa 721 gttacaagga caaatcaagt ccagttttat ttctactgaa aattatcata ctcttggttg 781 tctcagtctt gtggctggag atttagcaag tgaagttcct gtgataattg ggggaaccgg 841 taattgtgca ttctcaaaat gggaaccaga ttcttccaag aaaggaatga cagttaagaa 901 ccttattgat gcagagatta ttaaaggtgc aaagaagttc cagctgatag acaatctact 961 ttttgttctt gatactgata acgtgctgag tttatgggat atttacactc taactcctgt 1021 atggaactgg ccctctcttc acgtagaaga gtttcttctt actacagaag cagactctcc 1081 ttcatcagtc acgtggcaag gaattacaaa tctcaaatta atagctctga cagcttcagc 1141 taataagaag atgaaaaacc tcatggttta ttcattacct acaatggaaa tactatattc 1201 tttggaagta tctagtgttt cttctctggt ccaaacagga attagcacag ataccatata 1261 ccttttagaa ggagtttgca aaaatgatcc aaaattgtct gaagactcag tctctgtgtt 1321 agtactcaga tgtcttacgg aagctttacc agaaaacaga ttgagtcggt tacttcacaa 1381 acacagattt gctgaagctg agagttttgc cattcagttt ggactagatg ttgagcttgt 1441 ttacaaggtc aagtcaaatc atatattgga gaaactggca ttgagttctg tggatgccag 1501 tgaacagacc gaatggcaac aacttgtaga cgacgctaag gaaaatctac ataagatcca 1561 ggatgatgaa tttgtggtga attactgcct gaaagctcag tggataacct atgaaaccac 1621 tcaagagatg ctgaattatg ccaaaaccag gcttttgaag aaagaagata aaactgctct 1681 catttattct gatggcttga aagaggtgct aagagctcat gcaaaattga ctacttttta 1741 tggagcattt ggaccagaaa aattcagtgg cagttcttgg attgaatttc taaataatga 1801 agatgatctt aaagatattt ttttacagct aaaagaagga aaccttgttt gtgcacagta 1861 tctttggctt cgacatcggg caaactttga aagcagattt gatgtgaaaa tgctggagag 1921 cttgctcaac tcaatgtctg catcagtctc tttgcaaaag ctgtgtccat ggtttaaaaa 1981 tgatgtgatt ccatttgtaa gaaggactgt gcctgaagga cagataattc ttgcaaaatg 2041 gttggaacaa gcagccagga accttgaatt aactgataag gcaaattggc cagaaaatgg 2101 acttcaattg gcagagatat tttttacagc agaaaaaaca gacgagttgg gattggcatc 2161 ttcctggcat tggatttcct tgaaagatta tcagaacaca gaggaagtat gtcagctaag 2221 gactttggta aataacttgc gagagttgat cacgttgcat aggaagtaca actgcaaatt 2281 agccctctct gattttgaga aggaaaatac aaccaccata gtgttccgaa tgtttgataa 2341 agtgctggcc ccagagctta ttccctccat cttagagaag tttataagag tttacatgag 2401 agaacatgac ttgcaagagg aggaacttct cttgctgtac atagaggatt tactgaatag 2461 atgcagctca aagtccacat cactctttga aacagcatgg gaagcaaagg ccatggcagt 2521 aatagcgtgt ttatctgaca cggacctcat atttgatgcc gtgctcaaga tcatgtatgc 2581 ggcagtggtt ccttggagtg cagctgtgga gcaactggtg aaacagcacc tggaaatgga 2641 ccatcccaaa gtcaagttat tacaggaaag ttacaaacta atggagatga aaaaactttt 2701 acgaggctat ggaataagag aggtaaatct cttaaacaag gaaataatga gagtggttag 2761 atacattctc aaacaagatg tcccatcttc tttagaagat gctttaaagg tagcccaagc 2821 gtttatgtta tctgatgatg agatctacag tctaagaatt attgacctga ttgatagaga 2881 acagggtgaa gactgtctcc ttctgttgaa gtctttgcct cctgctgaag ctgagaaaac 2941 tgcagaaaga gtcatcatat gggcacgact ggcattacaa gaagagccag atcattctaa 3001 agagggcaag gcctggagaa tgtctgtagc gaagacatcc gtggacattc ttaagatact 3061 atgtgacatt cagaaagaca atctgcagaa gaaggacgaa tgtgaagaaa tgttgaaact 3121 atttaaagag gttgctagct tacaggagaa ctttgaggtc tttctttcat ttgaagatta 3181 tagcaatagt tccctggtag cagatctccg tgagcagcac attaaagctc acgaagttgc 3241 acaggcgaaa cacaaacctg ggagcacccc agagcccata gctgctgagg tgaggagccc 3301 aagcatggaa tcaaagctgc acagacaggc actggccctg cagatgtcca aacaagagct 3361 ggaggcagag ctgaccttga gagccttaaa agatgggaac atcaaaacag cactgaaaaa 3421 atgcagcgac ttgtttaagt atcactgcaa tgctgacact gggaaattgc tatttctgac 3481 atgtcagaag ctttgtcaga tgttggctga taatgtccca gtgacagtgc ctgtgggact 3541 gaatcttcct tccatgatac atgatctagc aagccaagct gccaccattt gcagtccaga 3601 ttttttacta gatgctttag aactatgtaa acatacttta atggctgtag agctttccag 3661 acaatgccaa atggatgact gtggaatcct catgaaagct tcttttggga cacataaaga 3721 tccatatgaa gagtggtctt acagtgactt cttcagtgaa gatggaattg ttcttgagtc 3781 acagatggtg cttccagtga tttatgaact gatttcatct cttgtgcctc tagctgaaag 3841 caagagatat cccttggagt ctaccagttt gccatactgc tcccttaatg aaggagatgg 3901 ccttgtttta cctgttataa attccatctc tgccctgctt cagaatcttc aggaatctag 3961 ccagtgggag ctagccctaa gatttgtggt tggttcattt ggtacctgtc ttcagcactc 4021 tgtgtcaaac ttcatgaatg ccactttgag tgaaaagtta tttggagaga ctacattagt 4081 taaatcaagg catgttgtta tggaattgaa agaaaaagct gttatattta tcagggaaaa 4141 tgctacaaca ctactgcaca aagtatttaa ttgtcgcttg gtagatcttg acctggcgtt 4201 gggttactgc actctcttac ctcaaaaaga tgtgtttgaa aatctctgga agctcataga 4261 taaagcatgg cagaattacg acaaaatctt ggcaatatct ctggtgggct ctgagctggc 4321 aagtctctat caggaaatag aaatggggct taagttccgt gaactcagta ctgatgccca 4381 gtggggcatt cgtcttggta aacttggtat ttcttttcaa ccagttttca ggcaacattt 4441 tctcaccaag aaagacctca ttaaagctct tgtggagaat atagatatgg acacaagcct 4501 cattttggaa tattgcagca catttcagtt ggactgcgat gcagttcttc agctcttcat 4561 tgaaacgctg ctccacaaca caaatgccgg ccaaggccag ggagatgcaa gcatggactc 4621 tgcaaagcgg cggcatccca aactcctggc caaagccctt gagatggttc ctttactgac 4681 gagcacaaaa gatttggtca tcagtcttag tggaatacta cataagctgg atccttatga 4741 ctatgaaatg attgaagttg tcttgaaagt tatagaacga gctgatgaaa agataaccaa 4801 tattaatatt aatcaggcat tgagtattct gaaacatttg aagtcataca gaagaatttc 4861 tcctcccgtg gatctagaat atcagtatat gttggaacat gtcataactt tgccatcagc 4921 tgcccaaact agactgcctt ttcacctgat attctttggc acagcacaga acttctggaa 4981 aattctctct acagaactca gtgaagaatc tttcccaaca ttgctcttaa tttcgaaatt 5041 aatgaagttc tctctggaca ctctgtacgt gtctacagca aaacacgttt tcgaaaaaaa 5101 actgaagcca aagctcctga agttaacaca agctaaatcc tcaacactga ttaacaagga 5161 aataactaag atcacgcaga ccatcgaatc ctgcttactc tctatagtca acccagagtg 5221 ggctgtagct attgccatca gccttgccca ggatatccct gaaggttcct tcaagatatc 5281 tgctttgaaa ttctgccttt atttagctga gagatggcta cagaatatcc catcgcagga 5341 cgaaaaacgt gaaaaagccg aggctttgtt gaagaagctt catatccagt accggcgatc 5401 gggcacagaa gctgtgctca tagcccacaa gctgaacact gaggaatatt taagagtgat 5461 cggaaagcca gcacatctta ttgtcagtct ctacgaacat cctagcatca atcaaagaat 5521 tcagaattca tctggcacag attatcctga tattcatgca gcagctaaag aaatagccga 5581 agtcaatgaa attaatttgg aaaaagtctg ggacatgttg ttggaaaaat ggctatgccc 5641 ttcaacaaaa cctggtgaaa aaccatcaga attatttgaa cttcaagaag atgaagccct 5701 acgaagagtg cagtatctcc tcctgtctcg tccaattgat tatagttcaa gaatgctgtt 5761 tgtatttgca acatcaacta caaccacatt aggtatgcat cagttaactt ttgcccatag 5821 aactcgagct cttcagtgtc tcttctattt ggctgacaag gaaactatag aatctctctt 5881 taaaaaaccc attgaagaag tgaaatctta tttgagatgt ataacttttc tggcatcatt 5941 tgagactttg aatatcccca tcacatatga attattttgc agcagtccta aagaaggaat 6001 gattaagggt ctgtggaaaa accacagcca cgagtccatg gcagtaagat tggtgactga 6061 gctgtgttta gaatacaaaa tctatgacct gcagctttgg aatggactct tgcaaaagct 6121 tctgggcttc aatatgattc cttatctaag gaaagtttta aaagccatct ccagtatcca 6181 ttctttatgg caggttccct acttcagcaa agcgtggcag cgtgtgatac agataccact 6241 gctttcagcc tcttgtcctt taagtcctga tcagctgtca gattgttctg agagtctcat 6301 cgctgtcctc gaatgtccag tctcaggtga tcttgacctg atcggagtcg ccaggcagta 6361 tatccagtta gaacttccgg cttttgcatt agcttgtctg atgctcatgc cccactcaga 6421 gaaaagacac cagcaaatta agaattttct gggttcctgt gaccctcagg ttattttaaa 6481 gcaattggaa gagcatatga acacgggcca gctagcagga ttttcacatc aaattagaag 6541 tctgattttg aataatatca tcaataagaa ggagtttggg attttggcaa agaccaaata 6601 ctttcaaatg ttgaagatgc atgcgatgaa taccaacaat atcactgagc tagtgaacta 6661 tttggcaaat gacttaagtt tagatgaagc ttcagtcttg ataactgaat attcaaagca 6721 ctgcgggaaa cctgtgcctc cagacactgc tccctgtgaa attctgaaga tgtttcttag 6781 tggattatcg taaatcactg aacctttttt tcaagaagga caagaatttt ggagtctgct 6841 attaatggac catatttatt acagttttta aattgtacaa tctctgtatt atagctattt 6901 gtctaacatt accccacatg taataaataa aacaatatga gc // LOCUS D79989 3950 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0167 gene, complete cds. ACCESSION D79989 NID g1531538 KEYWORDS KIAA0167. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3950) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3950) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..3950 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" misc_feature 1..3950 /note="the KIAA0167 gene is located in human chromosome 12." 5'UTR 1..88 gene 89..2599 /gene="KIAA0167" CDS 89..2599 /gene="KIAA0167" /note="the KIAA0167 protein has similarity to KIAA0041 and KIAA0050 protein." /citation=[3] /codon_start=1 /product="KIAA0167 protein" /db_xref="PID:d1012150" /db_xref="PID:g1531539" /translation="MHAQRQFVVAAVRAEVRRHEVAKQALNRLRKLAERVDDPELQDS IQASLDSIREAVINSQEWTLSRSIPELRLGVLGDARSGKSSLIHRFLTGSYQVLEKTE SEQYKKEMLVDGQTHLVLIREEAGAPDAKFSGWADAVIFVFSLEDENSFQAVSRLHGQ LSSLRGEGRGGLALALVGTQDRISASSPRVVGDARARALCADMKRCSYYETCATYGLN VDRVFQEVAQKVVTLRKQQQLLAACKSLPSSPSHSAASTPVAGQASNGGHTSDYSSSL PSSPNVGHRELRAEAAAVAGLSTPGSLHRAAKRRTSLFANRRGSDSEKRSLDSRGETT GSGRAIPIKQSFLLKRSGNSLNKEWKKKYVTLSSNGFLLYHPSINDYIHSTHGKEMDL LRTTVKVPGKRPPRAISAFGPSASINGLVKDMSTVQMGEGLEATTPMPSPSPSPSSLQ PPPDQTSKHLLKPDRNLARALSTDCTPSGDLSPLSREPPPSPMVKKQRRKKLTTPSKT EGSAGQAEEENFEFLIVSSTGQTWHFEAASFEERDAWVQAIESQILASLQCCESSKVK LRTDSQSEAVAIQAIRNAKGNSICVDCGAPNPTWASLNLGALICIECSGIHRNLGTHL SRVRSLDLDDWPRELTLVLTAIGNDTANRVWESDTRGRAKPSRDSSREERESWIRAKY EQLLFLAPLSTSEEPLGRQLWAAVQAQDVATVLLLLAHARHGPLDTSVEDPQLRSPLH LAAELAHVVITQLLLWYGADVAARDAQGRTALFYARQAGSQLCADILLQHGCPGEGGS AATTPSAATTPSITATPSPRRRSSAASVGRADAPVALV" 3'UTR 2600..3950 BASE COUNT 798 a 1216 c 1238 g 698 t ORIGIN 1 aaggggcctt ctgaggtttg ggggctgtag ggccatgggc ctcagggcca gaggtggttg 61 ttagcctggc aagacaggtc tgggcaacat gcatgcccag aggcagttcg ttgtagctgc 121 agtgagagca gaagtcagac gacatgaggt ggccaagcag gctctaaacc gcctcaggaa 181 gctggcagag agggtggacg accccgaact ccaggacagc atccaggcct cattggacag 241 cattcgagag gctgtgatca atagccagga atggactttg agccgctcca ttcctgaact 301 gcgcctgggt gtgctgggcg atgccaggag tgggaagtca tcgctcatcc accgattcct 361 gactggctca taccaggtgc tggagaagac agagagtgag cagtacaaga aagaaatgtt 421 ggtggatgga cagacacatc tggtgctaat ccgagaggaa gctggggcac ctgatgccaa 481 gttctcaggc tgggcagatg ctgtgatctt cgtcttcagc ctggaggatg agaacagttt 541 ccaggctgtg agccgtctcc atgggcagct gagttccctt cgcggggagg gacgaggagg 601 cctggccttg gcactggtgg ggacacaaga caggatcagt gcttcctccc ctcgggtggt 661 gggagatgct cgtgccagag ctctgtgcgc ggacatgaaa cgctgcagct actatgagac 721 ttgtgcaacc tatgggctca atgtggatcg ggtcttccag gaggtggccc agaaggtggt 781 gaccttgcgc aagcagcaac agcttctggc tgcctgcaag tccctgccca gctccccaag 841 ccactcagct gcatccactc cggtagctgg ccaggctagt aacgggggcc acactagcga 901 ctactcttct tccctcccgt cctcaccgaa tgttggtcac cgggagctcc gagccgaggc 961 agctgcagtg gctggattga gcaccccagg gtccctgcac cgggcagcca agcgcaggac 1021 cagccttttt gcgaatcgtc ggggtagtga ctccgagaaa cgaagcttgg atagtcgggg 1081 agagacaaca gggagtgggc gagccatccc catcaaacag agcttcctac taaaacgaag 1141 tggcaattcc ttgaacaaag aatggaagaa gaaatatgta accctgtcca gtaatggctt 1201 tctactctac caccccagta ttaacgatta catccacagt acccacggca aggagatgga 1261 cttgctgcga acaacagtca aagtcccggg caagcggccc ccgagggcca tctctgcctt 1321 tggcccctca gccagcatta acgggctcgt caaggacatg agcactgtcc agatgggtga 1381 aggcctggaa gccactactc ccatgccaag ccctagcccc agccccagtt ccctgcagcc 1441 accaccagat cagacatcca aacacctgct gaagccagac cggaatttgg cccgagccct 1501 cagcacggac tgtaccccat ctggagacct gagccccctg agtcgggaac cccctccttc 1561 tcccatggtg aagaagcaga ggaggaaaaa attgacaaca ccatccaaga ctgaaggctc 1621 ggctgggcag gctgaagagg aaaactttga gttcctgatc gtgtccagca cgggtcagac 1681 gtggcacttt gaggcagcca gttttgagga gcgggatgcc tgggtccagg ccatcgagag 1741 tcagatccta gccagtctgc aatgctgtga gagcagcaag gtcaagctgc gcacagacag 1801 ccaaagcgag gccgtggcca tccaggcgat ccggaacgcc aaggggaatt caatctgcgt 1861 ggactgcggg gcccccaacc ccacgtgggc cagcttgaac ctgggcgccc tcatctgcat 1921 cgagtgttct ggcatccacc gcaacctggg cacacacctg tcccgcgttc gctcgctgga 1981 cttggacgac tggccacggg agctgaccct ggtgctgacg gctattggca acgacacggc 2041 caaccgcgtg tgggaaagcg acacgcgagg ccgtgccaag ccctcgcggg actcttcgcg 2101 ggaggagcgc gagtcgtgga ttcgcgccaa gtacgagcag ctactgttcc tggcgccgct 2161 gagcacctcg gaggagccgc tgggccgcca gctgtgggcc gccgtgcagg cccaggacgt 2221 ggctaccgtt ctcctgcttt tggcccatgc gcgacacggg ccgctcgaca ccagcgtaga 2281 ggacccacag ctgcgctccc cactccacct ggcggccgag ctcgcccacg tcgtcatcac 2341 gcaactgctg ctgtggtacg gcgcggacgt ggcggcccgt gacgcccagg gccgcacggc 2401 gctgttctac gcccgccagg ctggaagcca gctgtgcgcc gacatccttc tccagcacgg 2461 ctgcccgggt gagggcggca gcgcggccac cacgcccagc gcggccacca cgcccagcat 2521 caccgccacg cccagccccc gccgccggag cagcgccgct agcgtgggcc gcgccgacgc 2581 cccggttgcg ctggtatagt tgcccagcgg gagagacacc cccatcccca cgcgggccgg 2641 gcacgaccac accgggcgga ccgctggaca gacgcaccca ctcacctctc cgatccgcac 2701 cccgccccac gggagcactt cctaccccca cgagggcaca gcccccacgc cttccagaag 2761 acacaaactc acctccctct ccccaacgag gcatggagac cccagctcaa cacgccccct 2821 ctccattgtt tccacactcg gattgctctg ggtctacccg ctcggggttc gcggaagggg 2881 aggtccccag cggtacgggc tcctaggcgc ttggacatgc ccgcttgtgc cccctagggg 2941 cggaaagaag acccggtcct gcccgagctc aacaatccca gggcgggagc ccatacccag 3001 gctggctcct ggggcgacgc cacgccagag ggagggttta gtacatggga gggctagccc 3061 cgggacttgg ggccaatacg gaaaccctac cccttccatg ctcatctcac tgcccagtga 3121 agggggcaag ggcagcgagg ggcaggaccc ctgctgctca tgggggtggg ggttcccaac 3181 cctttccgtg aaagggaccg tgagtagtgg aaaccaggac gtcccctaca cctacccatg 3241 ggtggaagct aaagagtccg gggggaacca agagcctggg tccctcccac tactatcttg 3301 ggagggagaa gggatggagc aggagcgggc taagagggct gcgggggcct tgtaatttat 3361 tgctttgttc cccaacatgg ggatgggggc gggcaggggc gtggggaggg cgttttgggg 3421 taggggaagc ccagggtggg cagggggtgg gttggggtgg gggtgctctc gggttgtgtc 3481 tgtgaccgtg actgcgtacc tgtgactgtg cagcgcccgt ggtgatctgg ggtagggggc 3541 acccctacag tgggacccct cccccattat tctttctgtc cagcccctcc ctcccactgg 3601 agcagctcca gagccattcc tcacccccgt gacctctccc agccagggct gagaggattc 3661 gaggtggctg ggagggtttc caggccccct gccccttccc cacaaactgg gtgatagggc 3721 ggacttacct ggcccagccc tgcctccctc agccaaccca gcccttgggc tgtctgtgtc 3781 agtggtttcc ggtctttttt tttttttttt tttctggcta aaatagtttg caaaggacca 3841 ggtaattggg gaggggagag aggtgggggc aagggggaaa tgccccccca tctccttgga 3901 ggcagtggtg tgaatcttct tcaacagcaa ttaaagagga agtgattttg // LOCUS D79990 5426 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0168 gene, complete cds. ACCESSION D79990 NID g1136395 KEYWORDS KIAA0168. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5426) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5426) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5426 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="20" /sex="male" 5'UTR 1..196 gene 197..1177 /gene="KIAA0168" CDS 197..1177 /gene="KIAA0168" /citation=[3] /codon_start=1 /db_xref="PID:d1012151" /db_xref="PID:g1136396" /translation="MDYSHQTSLVPCGQDKYISKNELLLHLKTYNLYYEGQNLQLRHR EEEDEFIVEGLLNISWGLRRPIRLQMQDDNERIRPPPSSSSWHSGCNLGAQGTTLKPL TVPKVQISEVDAPPEGDQMPSSTDSRGLKPLQEDTPQLMRTRSDVGVRRRGNVRTPSD QRRIRRHRFSINGHFYNHKTSVFTPAYGSVTNVRINSTMTTPQVLKLLLNKFKIENSA EEFALYVVHTSGEKQKLKATDYPLIARILQGPCEQISKVFLMEKDQVEEVTYDVAQYI KFEMPVLKSFIQKLQEEEDREVKKLMRKYTVLRLMIRQRLEEIAETPATI" 3'UTR 1178..5426 BASE COUNT 1435 a 1259 c 1383 g 1349 t ORIGIN 1 ggggaggaag aaaggcgaag gcaaggcgaa ggggtggaga gtgatatgaa gagcgagaga 61 aaagagagga cagcggacga gcagatccgg tatctggaat cccggcgcct agaacgtgtt 121 tttcgggaga gcaaaggctg tgtctacggc aggctgggga tatagcctct ccttccgatg 181 aaaagagaaa ggaagaatgg actacagcca ccaaacgtcc ctagtcccat gtggacaaga 241 taaatacatt tccaaaaatg aacttctctt gcatctgaag acctacaact tgtactatga 301 aggccagaat ttacagctcc ggcaccggga ggaagaagac gagttcattg tggaggggct 361 cctgaacatc tcctggggcc tgcgccggcc cattcgcctg cagatgcagg atgacaacga 421 acgcattcga ccccctccat cctcctcctc ctggcactct ggctgtaacc tgggggctca 481 gggaaccact ctgaagcccc tgactgtgcc caaagttcag atctcagagg tggatgcccc 541 gccggagggt gaccagatgc caagctccac agactccagg ggcctgaagc ccctgcagga 601 ggacacccca cagctgatgc gcacacgcag tgatgttggg gtgcgtcgcc gtggcaatgt 661 gaggacgcct agtgaccagc ggcgaatcag acgccaccgc ttctccatca acggccattt 721 ctacaaccat aagacatccg tgttcacacc agcctatggc tctgtcacca acgtccgcat 781 caacagcacc atgaccaccc cacaggtcct gaagctgctg ctcaacaaat ttaagattga 841 gaattcagca gaggagtttg ccttgtacgt ggtccatacg agtggtgaga aacagaagct 901 gaaggccacc gattacccgc tgattgcccg aatcctccag ggcccatgtg agcagatctc 961 caaagtgttc ctaatggaga aggaccaggt ggaggaagtc acctacgacg tggcccagta 1021 tataaagttc gagatgccgg tacttaaaag cttcattcag aagctccagg aggaagaaga 1081 tcgggaagta aagaagctga tgcgcaagta caccgtgctc cggctaatga ttcgacagag 1141 gctggaggag atagccgaga ccccagcaac aatctgagcc atgagaacga ggggatctgg 1201 gcaccccagg aaccgccatt gcccataaga cccccaggaa gctaggcact ttctttccat 1261 ggaaacattt agacacaaac ctccccagct ccggccaagc catcatttgc tacctggagc 1321 tggatgtaga agtcagcaga cagctcccta tccctggacc cctgccctcc ttttttctgc 1381 tcacaaggac ttttgatttt agttataagg aggacccaaa atgtgtgtgt gtacatgtgt 1441 gtgcacacat ggtacgtgtc catgtgccta cctgatactt tcacatgtaa ttaaattcca 1501 ggcaaccagc acaagagccg tgagcttggc acatgtgctg ctcgtgagca ggaaaatcag 1561 aggagccact gatctgagtg gtatttaggt tgaaggaaag atttctcctc tcaagtgcca 1621 gggagcagcc acacgtctgt ctgtgtttag agagggaaga gggttctcca ggttcaccat 1681 ttgggttgtt tatatgttgg tagaaattct ccctgtatgc ctagaaggat cagtgaatgt 1741 aagagccttg gaaattaaca aaataacagc cacataacct tgcggcaagt ctgatggaaa 1801 gaaaaagata aaccatccgt ggggtagatg caataagccc acgtattttt acactggaaa 1861 cgttgattgt tttaaatgac aaagacatat gtgatgttct atgtggaaac ctgtgaagag 1921 tggattctgc ctccatctct gcctccatgg ctacctttag gagacagaga agatcctgtg 1981 tgtttctctg tacccagctg acagcctgtc tctatggcgc ttccttgagt ggaaggaaat 2041 gtctcaagaa acaaagatct cgctggtgcg tacacagtgc tgaccagcta gtgtggccag 2101 ggcctggtgg cctggtggcc aggaagtttc aggttgaagg gaaatgtcga ggctacctgc 2161 agatatgaca ggtgccttga acgcagccca tcttcatgtc atcaaaggtc ttcctgcact 2221 tgaagctggg gcgatgtttg cagtcaagac cattctttcc aacctctggg ttcttgcaag 2281 ttgccctcac cttgtgtgtg gagatgcatt ccaagaatga agcctcatct tgctactgag 2341 tgtggggttc agggaagctc tttaggccac ctggtgaagg tgcatgggga ggatggagct 2401 tctcctcagc tcctctgagc agccacctat gtgatcttta aatccaaccc caatgggaga 2461 aaagggcaag aacagtctgt gccctgggac tcctatcagg aagcttgaca ggcagctggg 2521 catcagtgca gctgatatcg tttgaggagg gagacagatg cttggacctg ggtgcctggc 2581 tatggagatt gaccaagcaa gatcaggagc tcctgatagc aggcgtcttt gagcctagct 2641 ggggtagagg cactgcccat ctcttctcca ccttctctcc acagaatgtt tgcagagctg 2701 ggcagttgag gaaaggacag cccctggttg gtgcctccaa aggaaggtgg acttttttgg 2761 tggagacgtt tctgccctgg gcaccctcct gcccccgatt catacctatg gcttcttgag 2821 aaggctcaca gctgtggtct taacgtagac tgcagaaaga tggcatgcgg cccctggcat 2881 ttcgccaagg gttttatagc aagtctcctt cctccatagg gacagcagca ccagccctgt 2941 ggggcatgga gtggaagccc agaagggctt ctgcaagctg cacagaactg gggtaagaag 3001 acaaagagta gccaccggga gaggcttcct ttgttacagc tgggaaagaa cagttctgtg 3061 aatgcaaaca cctcctgagt tttgcaattg agaaaatgat ttggagaact tctcttctgg 3121 taatttttat tttgaatgtt cagggcctta gttggcccca gtaattctcc ttggaggact 3181 tgggagaaga atttccacaa agcaaactac taaccactag ctcttactgg acagcgattt 3241 ctggcttata agagttctct ttgatttgca ctagcactac gatagtgtta gatggggaaa 3301 tactgcaaca tgtccagttg gccagatcac tttccaaggg agcgatacta aggcagactc 3361 agctttttaa agatgggagg tcaggaggtg gaagtgagag gagatcccat ctcacacaac 3421 acacttccac gtaatgcaga ccacactttt ccattttgtc ctgccctctt gagaggtcat 3481 ttctcacgtc ctaagaacct gatcagaaat tttggaaggg ttctttgaaa tagcagcagt 3541 tgaaacagag acactttgcc acagtgtgga gcagattttc tcactggtat cacatggtct 3601 tgcagttttg aactcttcga ccgatttgtg ggagtttatg taattgcgtg caatgaacct 3661 gaaattgtgt aaaggacaaa agaccagttt atagggttgg gttttttttc caacttgtga 3721 aaagcagttt agctgcatct gtctccccac cacccccacc ccgggagggg cttatgttac 3781 aaggtgatca agtgaaggaa aaacctgagc ctatctggct gggatggtgg aattaagcac 3841 aaggtcacat tctctgtgat cacatgagag ggaaggtgat gacttaaatg gcagggggtg 3901 gggattatct tggggagagg ctgaaaagca caaaagatag tcttccctgt acgtattggt 3961 gaagaacgtg cacaaggctg gatggacttc aacttggagt tgagttgagg caagaggatt 4021 tctggatatt agtcacccat ctgcaagaaa aatgctgagg cctcgggtca agattttgat 4081 ctgagacatg ctgatgcttc aaggagaaat attttcacaa tcctctcttc cctcaccaga 4141 agagaacagt actctctcct agaaacctct aggtaaacac attttatcct aatatcggta 4201 gcatataatg ccccccccaa aatatctgtt ttccatgcaa aaaagtctca acaagaagtc 4261 tgtggagttg agtggttact tcaaagtgtc aggagagtga agaaattggc cacagaagag 4321 caagaagctc tcttaagaaa agggaattct ctttaaagaa accaccacca acaacaaaac 4381 aaccaaaaac catgttttat gtcaaagctc tgtagcacag agaatgtggt gtcacagata 4441 catcgccgag agaggtttct ttctttcttt tttttttttt tgagacagag tctggttctg 4501 tttcccaggc tggagtgcag tggtgggatc tcagctcact gcaacatccg cctctggggt 4561 tcaagtgatt ctcctgtctc agcctcccaa gtagctggaa ttacagggac ccgccaccac 4621 gcccggctaa tttttttgtg tggttttagt agaggtgggg tttcaccatc ttggccaggc 4681 tggtcttgaa ctcctgacct cgtgatccac ccgcctaggc ctcccaaagt gttgggatta 4741 caggcgtgag ccactgtgcc cagccaaaag agaaatttct acatgaacaa ggcaatttca 4801 gtgtcttaca gcggccaaac catgacgtga agaatgagat aggagacagg agatcaccat 4861 aagcgtccct gatatagcag cacacatttt cacgtttcca cttaaatcgt tttgcacaaa 4921 gtcttgcttc gctcagatga gatgagatat gatttcctag agatgtaaaa ataagaatga 4981 atgtggcgcc cccttcttcc agatgtaata gaaagctctg ccctatcaca aggggggtgt 5041 tgaagcgccc cttgtgtttt aactgtattt aactgagcac aagatgcaca agctgtggtg 5101 ggaaaccctc agtttacctt tggagtcttc cctgcagatc gcagacctgt ttccaggctg 5161 atgtttctgg tgtgtaattg ctagcgtttc tgaagggttt tcccaattgt tttagccttg 5221 tgaagtattc ttaattataa cttgcctttc agcgatggta catgacttga ttcaacgttt 5281 ggttctgaac ttacacactg atgcgtttac tcatctaaca taatctgaca gggcctcagc 5341 aagggagcca tacatttttg taacattttg atatgtttta atgcatctga cttagatctt 5401 actgaaataa agcacttttc aaagag // LOCUS D79992 6940 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0170 gene, complete cds. ACCESSION D79992 NID g1136399 KEYWORDS KIAA0170. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6940) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6940) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..6940 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="6" /sex="male" 5'UTR 1..13 gene 14..6283 /gene="KIAA0170" CDS 14..6283 /gene="KIAA0170" /note="similar to Drosophila photoreceptor cell-specific protein, calphotin." /citation=[3] /codon_start=1 /db_xref="PID:d1012153" /db_xref="PID:g1136400" /translation="MEDTQAIDWDVEEEEETEQSSESLRCNVEPVGRLHIFSGAHGPE KDFPLHLGKNVVGRMPDCSVALPFPSISKQHAEIEILAWDKAPILRDCGSLNGTQILR PPKVLSPGVSHRLRDQELILFADLLCQYHRLDVSLPFVSRGPLTVEETPRVQGETQPQ RLLLAEDSEEEVDFLSERRMVKKSRTTSSSVIVPESDEEGHSPVLGGLGPPFAFNLNS DTDVEEGQQPATEEASSAARRGATVEAKQSEAEVVTEIQLEKDQPLVKERDNDTKVKR GAGNGVVPAGVILERSQPPGEDSDTDVDDDSRPPGRPAEVHLERAQPFGFIDSDTDAE EERIPATPVVIPMKKRKIFHGVGTRGPGAPGLAHLQESQAGSDTDVEEGKAPQAVPLE KSQASMVINSDTDDEEEVSAALTLAHLKESQPAIWNRDAEEDMPQRVVLLQRSQTTTE RDSDTDVEEEELPVENREAVLKDHTKIRALVRAHSEKDQPPFGDSDDSVEADKSSPGI HLERSQASTTVDINTQVEKEVPPGSAIMHIKKHQVSVEGTNQTDVKAVGGPAKLLVVS LEEAWPLHGDCETDAEEGTSLTASVVADVRKSQLPAEGDAGAEWAAAVLKQERAHEVG AQGGPPVAQVEQDLPISRENLTDLVVDTDTLGESTQPQREGAQVPTGREREQHVGGTK DSEDNYGDSEDLDLQATQCFLENQGLEAVQSMEDEPTQAFMLTPPQELGPSHCSFQTT GTLDEPWEVLATQPFCLRESEDSETQPFDTHLEAYGPCLSPPRAIPGDQHPESPVHTE PMGIQGRGRQTVDKVMGIPKETAERVGPERGPLERETEKLLPERQTDVTGEEELTKGK QDREQKQLLARDTQRQESDKNGESASPERDRESLKVEIETSEEIQEKQVQKQTLPSKA FEREVERPVANRECDPAELEEKVPKVILERDTQRGEPEGGSQDQKGQASSPTPEPGVG AGDLPGPTSAPVPSGSQSGGRGSPVSPRRHQKGLLNCKMPPAEKASRIRAAEKVSRGD QESPDACLPPAVPEAPAPPQKPLNSQSQKHLAPPPLLSPLLPSIKPTVRKTRQDGSQE APEAPLSSELEPFHPKPKIRTRKSSRMTPFPATSAAPEPHPSTSTAQPVTPKPTSQAT RSRTNRSSVKTPEPVVPTAPELQPSTSTDQPVTSEPTSQVTRGRKSRSSVKTPETVVP TALELQPSTSTDRPVTSEPTSQATRGRKNRSSVKTPEPVVPTAPELQPSTSTDQPVTS EPTYQATRGRKNRSSVKTPEPVVPTAPELRPSTSTDRPVTPKPTSRTTRSRTNMSSVK TPETVVPTAPELQISTSTDQPVTPKPTSRTTRSRTNMSSVKNPESTVPIAPELPPSTS TEQPVTPEPTSRATRGRKNRSSGKTPETLVPTAPKLEPSTSTDQPVTPEPTSQATRGR TNRSSVKTPETVVPTAPELQPSTSTDQPVTPEPTSQATRGRTDRSSVKTPETVVPTAP ELQASASTDQPVTSEPTSRTTRGRKNRSSVKTPETVVPAAPELQPPTSTDRPVTPEPT SRATRGRTNRSSVKTPESIVPIAPELQPSTSRNQLVTPEPTSRATRCRTNRSSVKTPE PVVPTAPEPHPTTSTDQPVTPKLTSRATRRKTNRSSVKTPKPVEPAASDLEPFTPTDQ SVTPEAIAQGGQSKTLRSSTVRAMPVPTTPEFQSPVTTDQPISPEPITQPSCIKRQRA AGNPGSLAAPIDHKPCSAPLEPKSQASRNQRWGAVRAAESLTAIPEPASPQLLETPIH ASQIQKVEPAGRSRFTPELQPKASQSRKRSLATMDSPPHQKQPQRGEVSQKTVIIKEE EEDTAEKPGKEEDVVTPKPGKRKRDQAEEEPNRIPSRSLRRTKLNQESTAPKVLFTGV VDARGERAVLALGGSLAGSAAEASHLVTDRIRRTVKFLCALGRGIPILSLDWLHQSRK AGFFLPPDEYVVTDPEQEKNFGFSLQDALSRARERRLLEGYEIYVTPGVQPPPPQMGE IISCCGGTYLPSMPRSYKPQRVVITCPQDFPHCSIPLRVGLPLLSPEFLLTGVLKQEA KPEAFVLSPLEMSST" 3'UTR 6284..6940 BASE COUNT 1866 a 1938 c 1750 g 1386 t ORIGIN 1 cttaaagtag atcatggagg acacccaggc tattgactgg gatgttgaag aagaggagga 61 gacagagcaa tccagtgaat ccttgaggtg taacgtggag ccagtagggc ggctacatat 121 ctttagtggt gcccatggac cagaaaaaga tttcccacta cacctcggga agaatgtggt 181 aggccgaatg cctgactgct ctgtggccct gccctttcca tctatctcca aacaacatgc 241 agagattgaa atcttagcct gggacaaggc acctatcctc cgagactgtg ggagccttaa 301 tggtactcaa atcctgagac ctcctaaggt tttgagccct ggggtgagtc accgtctgag 361 ggaccaggaa ttgattctct ttgctgactt gctctgccag taccatcgcc tggatgtctc 421 tctgcccttt gtctcccggg gccctctgac agtagaagag acacccagag tacagggaga 481 aactcaaccc cagaggcttc tgttggctga ggactcggag gaggaagtag attttctttc 541 tgaaaggcgt atggtaaaaa aatcaaggac cacatcttcc tctgtgatag ttccagagag 601 tgatgaagag gggcattccc cggtcctggg cggccttggg ccgccttttg ccttcaattt 661 gaacagtgac acagatgtgg aagaaggtca gcaaccagcc acagaggagg cctcctcagc 721 tgccagaaga ggtgccactg tagaggcaaa gcagtctgaa gctgaagttg taactgaaat 781 ccagcttgaa aaggatcagc ctttagtgaa ggagagggac aatgatacaa aagtcaagag 841 gggtgcaggg aatggggtgg ttccagctgg ggtgattctg gagaggagcc aacctcctgg 901 agaggacagt gacacagatg tggatgatga cagcaggcct cctggaaggc cagctgaggt 961 ccatttggaa agggctcagc cttttggctt catcgacagc gacactgatg cggaagaaga 1021 gaggatccca gcaaccccag ttgtcattcc tatgaagaag aggaagatct tccatggagt 1081 aggtacaagg ggtcctggag caccaggcct ggcccatctg caggagagcc aggctggtag 1141 tgatacagat gtggaagaag gcaaggcccc acaggctgtc cctctggaga aaagccaagc 1201 ttccatggtt atcaacagcg atacagatga cgaggaagaa gtctcagcag cgctgacttt 1261 ggcacatctg aaagagagcc agcctgctat atggaacaga gatgcagaag aggacatgcc 1321 ccaacgtgtg gtccttctgc agcgaagcca aaccaccact gagagagaca gtgacacaga 1381 cgtggaggag gaagagctcc cagtggaaaa tagagaagct gtcctcaagg atcacacaaa 1441 gattagagcc cttgttagag cacattcaga aaaggaccaa cctccttttg gggacagtga 1501 tgacagtgtg gaagcagata agagctcacc tgggatccac ctggagagaa gccaagcctc 1561 caccacagtg gacatcaaca cacaagtgga gaaggaagtc ccgccagggt cagccattat 1621 gcatataaag aagcatcagg tgtctgtgga ggggacaaat caaacagatg tgaaagcagt 1681 tgggggacca gcaaagctgc ttgtggtatc tctagaggaa gcctggcctc tgcatgggga 1741 ctgtgaaaca gatgcagagg agggcacctc cctaacagcc tcagtagttg cagatgtaag 1801 aaagagccag cttccagcag aaggggatgc tggggcagag tgggctgcag ctgttcttaa 1861 gcaggagaga gctcatgagg tgggggccca gggtgggcca cctgtggcac aagtggagca 1921 ggacctccct atctcaagag agaacctcac agatctggtg gtggacacag acactctagg 1981 ggaatccacc cagccacaga gagagggagc ccaggtcccc acaggaaggg agagagaaca 2041 acatgtgggt gggaccaagg actctgaaga caactatggt gattctgaag atctggacct 2101 acaagctacc cagtgctttc tggagaatca gggcctggaa gcagtccaga gcatggagga 2161 tgaacctacc caggccttca tgttgactcc accccaagag cttggccctt cccattgcag 2221 cttccagaca acaggtaccc tagatgaacc atgggaggtc ctggctacac agccattctg 2281 tctgagagag tctgaggact ctgagaccca gccttttgac acgcaccttg aggcctatgg 2341 accttgcctg tctccaccta gggcaatacc aggagaccaa catccagaga gcccagttca 2401 cacagagcca atggggattc aaggcagagg gaggcagact gtggataaag tcatgggtat 2461 accaaaagaa acagcagaga gggtgggccc tgagagaggg ccattggaga gagaaactga 2521 gaaactgcta ccagaaagac agacagatgt gacaggagag gaagaattaa ccaaggggaa 2581 acaggacaga gaacaaaaac agttgttagc tagagacacc cagagacaag aatctgacaa 2641 aaatggggaa agtgcaagtc ctgaaagaga tagggagagt ttgaaggtag aaattgagac 2701 atctgaggaa atacaagaga aacaagtaca gaagcagacc cttccaagca aagcatttga 2761 gagagaagta gagagaccag tagcaaacag agagtgcgat ccagccgagt tagaagagaa 2821 ggtgcccaaa gtgatcctgg agagagatac acagagaggg gagccagagg gagggagcca 2881 ggaccagaaa gggcaggcct ccagcccaac accagagcct ggggtggggg cgggggacct 2941 tccgggacct acctcagccc ccgtaccttc tgggagccag tcaggtggaa ggggatcccc 3001 agtgagcccc aggaggcatc agaaaggcct cctgaattgc aagatgccac ctgctgagaa 3061 ggcttccagg atcagagctg ctgagaaggt ttccaggggc gatcaggaat ctccagatgc 3121 ttgtctgcct cctgcagtac ctgaagcccc agccccaccc caaaagcccc ttaactctca 3181 gagccagaaa catcttgcac ctccgcccct tctttctccc cttttacctt ctatcaagcc 3241 aaccgttcgt aagaccaggc aagatgggag tcaggaagct ccagaggctc ccttgtcctc 3301 agagctggag cctttccacc caaagcctaa aattagaact cggaagtcct ccagaatgac 3361 accctttcca gctacctctg ctgcccctga gccccaccct tccacctcca cagcccagcc 3421 agtcactccc aagcccacat ctcaggccac taggagcagg acaaataggt cctctgtcaa 3481 gacccctgaa ccagttgtcc ccacagcccc tgagctccag ccttccacct ccacagacca 3541 gcctgtcacc tctgagccca catctcaggt tactagggga agaaaaagta gatcctctgt 3601 caagacccct gaaacagttg tgcccacagc ccttgagctc cagccttcca cctccaccga 3661 ccgacctgtc acctctgaac ccacctctca ggctactagg ggaagaaaaa atagatcctc 3721 tgtcaagacc cctgaaccag ttgtccccac agcccctgag ctccagcctt ccacctccac 3781 agaccagcct gtcacttctg agcccacata tcaggctact aggggaagaa aaaatagatc 3841 ctctgtcaag acccctgaac cagttgtgcc cacagcccct gagctccggc cttccacctc 3901 cacagaccga cctgtcaccc ccaagcccac atctcggacc actaggagca ggacaaatat 3961 gtcctctgtc aagacccctg aaacagttgt ccccacagcc cctgagctcc agatttccac 4021 ctccacagac caacctgtca cccctaagcc cacatctcgg accactagga gcaggacaaa 4081 tatgtcctct gtgaagaacc ctgaatcaac tgtccctata gcccctgagc tcccaccttc 4141 cacctccaca gagcagcctg tcacccctga gcccacatct cgggctacta ggggaagaaa 4201 aaatagatcc tctggcaaga cccctgaaac acttgtcccc acagccccta agctcgagcc 4261 ttccacttcc acagaccaac ctgtcactcc tgagcccaca tctcaggcca ccaggggcag 4321 gacaaatagg tcctctgtga agacccctga aacagttgtc cccacagccc ctgagctcca 4381 gccttccacc tccacagacc agcctgttac ccctgagcct acgtctcagg ctactagggg 4441 aagaacagat agatcctctg tcaagactcc tgaaacagtt gtccccacag cccctgagct 4501 acaggcttcc gcctccacag accagcctgt cacctctgag cccacatctc ggaccactag 4561 gggaagaaaa aatcggtcct ctgtcaagac ccctgaaaca gttgtgcccg cagcccctga 4621 gctccagcct cccacctcca cagaccgacc tgtcacccct gagcccacat ctcgggccac 4681 taggggcagg acaaataggt cctctgtcaa gacccctgaa tcaattgtcc ctatagcccc 4741 tgagcttcag ccttccacct ccagaaacca gcttgtcacc cctgagccca catctcgggc 4801 cactaggtgc aggacaaata ggtcctctgt caagacccct gagccagttg tccccacagc 4861 ccctgagccc catcctacca cctccacaga ccagcctgtc acccccaagc tcacatctag 4921 ggccactagg agaaagacaa ataggtcctc tgtcaagact cccaaaccag ttgaaccagc 4981 agcctctgat cttgagcctt ttacccccac agaccagtcc gtcacccctg aggccatagc 5041 tcagggtggt cagagcaaaa cactgaggtc ttccacagta agagctatgc cggttcctac 5101 cacccctgaa ttccaatctc ctgtcaccac agaccagcct atttcccctg agcctattac 5161 tcaacccagt tgcatcaaga ggcagagagc cgctgggaac cctggctccc tcgcagctcc 5221 cattgaccat aagccttgct ctgcaccctt ggaacctaaa tcccaggcct caaggaacca 5281 aagatgggga gcagtgagag cagctgaatc ccttacagcc attcctgagc ctgcctctcc 5341 ccagcttctt gagacaccaa ttcatgcctc ccagatccaa aaggtggaac cagcaggtag 5401 atctaggttc accccggagc tccagcctaa ggcctctcaa agccgcaaga ggtctttagc 5461 taccatggat tcaccaccac atcaaaaaca gccccaaaga ggggaagtct cccagaagac 5521 agtgattatc aaggaagagg aagaagatac tgcagagaag ccagggaagg aagaggatgt 5581 cgtgactcca aaaccaggca agagaaagag agaccaggca gaggaggagc ccaacagaat 5641 accaagccgc agcctccgac ggaccaaact taaccaagaa tcaacagccc ccaaagtgct 5701 cttcacagga gtggtggatg ctcggggaga gcgggctgtg ctggcactgg ggggaagtct 5761 ggctggttca gcggcagagg cttcccacct ggtcactgat cgcatccgcc ggacagtcaa 5821 gttcctgtgt gccctggggc ggggaatccc cattctgtcc ctggactggc tgcatcagtc 5881 ccgcaaggct ggtttcttct tacccccgga tgaatatgtg gtgaccgacc ctgagcaaga 5941 gaagaacttt ggctttagcc ttcaagacgc actgagcagg gctcgggagc gaaggctgct 6001 agagggctat gagatctatg tgacccctgg agtccagcca ccaccacctc agatgggaga 6061 gattattagc tgctgtggag gcacatacct acccagcatg cctcggtcct ataagcctca 6121 gagagttgtg atcacatgcc ctcaggactt ccctcattgc tccattccac tacgggttgg 6181 gctgcccctc ctctcgcctg agttcctgct gactggagtg ctgaagcagg aagccaagcc 6241 agaggccttt gtcctctccc ctttggagat gtcatccacc tgagaactcc actacccttt 6301 tccctcccag accacgaatt agaagatatg tggaagaaag aactcagggc gttagaaagg 6361 attggggtat attgatacaa cttgtcctgg aacatgggtg ggaccagaaa tctttatgaa 6421 taaatgaaaa gataagggat ttggaagcca caggttgttt tttgtttgtt tgtttgtttt 6481 tttaatggcc attttatttt atttgtattt atagtttttt atttgtatag atttagggga 6541 tacaagattt cttacatgca tgtattaaat ggccatttta aaattagcta gtttcatgct 6601 cagatgtcat aagtggcagc tatctttagc cagactgttg cagttattgc tcgatgccac 6661 tcatggtgtc ctacctccta tttggaaacc atctctattt ttttcttact gagattctta 6721 ctttggggtc aggaacttga agggatgctt ggagtgagta gatttgaggg tccagttatg 6781 gagtgctact aaaacatttt cttctctcct ggcctctgga agcatcttta gctttgactt 6841 tgggcaagtc tctgtacttt tctggccagc ttttccagga tttataaaat tagagcttcg 6901 gcttgacctc tgtgataaat aaatattcac tctgtgcctt // LOCUS D79993 3336 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0171 gene, complete cds. ACCESSION D79993 NID g1136401 KEYWORDS KIAA0171. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3336) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3336) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..3336 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="5" /sex="male" 5'UTR 1..101 gene 102..1979 /gene="KIAA0171" CDS 102..1979 /gene="KIAA0171" /note="similar to hypothetical protein L8167.6 of Saccharomyces cerevisiae." /citation=[3] /codon_start=1 /db_xref="PID:d1012154" /db_xref="PID:g1136402" /translation="MLNMWKVRELVDKATNVVMNYSEIESKVREATNDDPWGPSGQLM GEIAKATFMYEQFPELMNMLWSRMLKDNKKNWRRVYKSLLLLAYLIRNGSERVVTSAR EHIYDLRSLENYHFVDEHGKDQGINIRQKVKELVEFAQDDDRLREERKKAKKNKDKYV GVSSDSVGGFRYSERYDPEPKSKWDEEWDKNKSAFPFSDKLGELSDKIGSTIDDTISK FRRKDREDSPERCSDSDEEKKARRGRSPKGEFKDEEETVTTKHIHITQATETTTTRHK RTANPSKTIDLGAAAHYTGDKASPDQNASTHTPQSSVKTSVPSSKSSGDLVDLFDGTS QSTGGSADLFGGFADFGSAAASGSFPSQVTATSGNGDFGDWSAFNQAPSGPVASSGEF FGSASQPAVELVSGSQSALGPPPAASNSSDLFDLMGSSQATMTSSQSMNFSMMSTNTV GLGLPMSRSQNTDMVQKSVSKTLPSTWSDPSVNISLDNLLPGMQPSKPQQPSLNTMIQ QQNMQQPMNVMTQSFGAVNLSSPSNMLPVRPQTNALIGGPMPMSMPNVMTGTMGMAPL GNTPMMNQSMMGMNMNIGMSAAGMGLTGTMGMGMPNIAMTSGTVQPKQDAFANFANFS K" 3'UTR 1980..3336 BASE COUNT 1035 a 635 c 727 g 939 t ORIGIN 1 ggcggcggtg accccggcct ggaactgccc cggtacggaa gtgttccggg gtccgtgggg 61 agcaggagag ggaggcggcg gaccgtcccg cgcggggcac gatgttgaac atgtggaagg 121 tgcgcgagct ggtggacaaa gccaccaatg ttgttatgaa ttattcagag atcgagtcta 181 aggttcgaga ggcaacgaac gatgatcctt ggggaccttc tgggcaactc atgggagaga 241 ttgccaaggc tacatttatg tatgaacaat ttccagaact tatgaacatg ctttggtcac 301 gaatgttaaa agacaacaaa aagaattgga gaagagttta taagtcgttg ctgctcctag 361 cttacctcat aaggaatgga tcagagcgtg ttgttacaag tgccagagaa cacatttatg 421 atttacgatc cctggaaaat taccactttg tagatgagca tggtaaggat caaggtataa 481 atattcgaca gaaggtgaag gaattggttg aatttgccca ggatgacgac aggcttcgtg 541 aagagcgaaa gaaagcaaag aagaacaaag acaagtatgt tggggtttcc tcagacagtg 601 ttggaggatt cagatacagt gaaagatatg atcctgagcc caaatcaaaa tgggatgagg 661 agtgggataa aaacaagagt gcttttccat tcagtgataa attaggtgag ctgagtgata 721 aaattggaag cacaattgat gacaccatca gcaagttccg gaggaaagat agagaagact 781 ctccagaaag atgcagcgac agcgatgagg aaaagaaagc gagaagaggc agatctccca 841 aaggtgaatt caaagatgaa gaggagactg tgacgacaaa gcatattcat atcacacagg 901 ccacagagac caccacaacc agacacaagc gcacagcaaa tccttccaaa accattgatc 961 ttggagcagc agcacattac acaggggaca aagcaagtcc agatcagaat gcttcaaccc 1021 acacacctca gtcttcagtt aagacttcag tgcctagcag caagtcatct ggtgaccttg 1081 ttgatctgtt tgatggcacc agccagtcaa caggaggatc agctgattta ttcggaggat 1141 ttgctgactt tggctcagct gctgcatcag gcagtttccc ttcccaagta acagcaacaa 1201 gtgggaatgg agactttggt gactggagtg ccttcaacca agccccatca ggccctgttg 1261 cttccagtgg cgagttcttt ggcagtgcct cacagccagc ggtagaactt gttagtggct 1321 cacaatcagc tctaggccca cctcctgctg cctcaaattc ttcagacctg tttgatctta 1381 tgggctcgtc ccaggcaacc atgacatctt cccagagtat gaatttctct atgatgagca 1441 ctaacactgt gggacttggt ttgcctatgt caagatcaca gaatacagat atggtccaga 1501 aatcagtcag caaaaccttg ccctctactt ggtctgaccc cagtgtaaac atcagcctag 1561 acaacttact acctggtatg cagccttcca aaccccagca gccatcactg aatacaatga 1621 ttcagcaaca gaatatgcag cagcctatga atgtgatgac tcaaagtttt ggagctgtga 1681 acctcagttc tccatcgaac atgcttcctg tccggcccca aactaatgct ttgatagggg 1741 gacccatgcc tatgagcatg cccaatgtga tgactggcac catgggaatg gcccctcttg 1801 gaaatactcc gatgatgaac cagagcatga tgggcatgaa catgaacata gggatgtccg 1861 ctgctgggat gggcttgaca ggcacaatgg gaatgggcat gcccaacata gccatgactt 1921 ctggaactgt gcaacccaag caagatgcct ttgcaaattt cgccaatttt agcaaataag 1981 agattgtaaa agaagcagat tgaatgaaga atttttagct gtgcagatag gtgatgttgg 2041 gatggaaaat gctaatcaac taccctttct tttatcaagt aattaaaata aatctacata 2101 aagaaccaaa aaggctgttt tataaaagtg aaatatccag tatttcagag ggccaggcaa 2161 gagcacttca gatgaggcag tcaaaatcat ttttttccag tgaggataga ccacaagtgg 2221 gtggtgagac cattgaaagc ctttatcaac tgaagagtcc atttaacagc ataatttgtg 2281 ggaagactgg aatagggctg aataaatgtg tttgaatctc taattttata ctttcttttc 2341 ctgaggaact tgatttttct gtccctggat cgccttgtca taattgggtc tgttcctttt 2401 actaccactc ttgagtccat atatgaaatc attaaagttg gatgatcagt tttttataaa 2461 aatatatatt tttgtccaag aaaaaaaaaa gcatacatat gtgattatgg ctaaatcaaa 2521 ggtaactgga atgtatatac ttttgctaat gttccagcaa cactgctatt atactatcca 2581 aatttttatt gtaacaaaac ctctttaagc aattggtgat tgccatggga cttttcccat 2641 gtcttctgct gtaattatcc tgtgcagaac taggaagaaa tttttttcag gactgctcta 2701 tggtttcctt taaaagaaaa aaacttctgt ttgtttttag cagtcattat ttacaatttg 2761 cagtgattaa cttggcaagg cttccttccg tgtttatccc tgtagccatc atttaagtca 2821 ggaacagtca gaaaaatatt tattttattt tttttttggg tgtctgcaaa ggtaaaaatc 2881 cattaaaacc ttaagttaaa tataaatgtt acaactcaat gtttgctttt agattttata 2941 cagtatttgt tttgttttgg ttttgagtgt atataatgca gcattagcaa tatggttcca 3001 atagaggagt taaatatata ttgttaaagg agacctgtag cagtcaaaga ttttattgat 3061 ttaatgacaa aggaaattaa tgaaaatgtt tttgtttttc tgctgtaatt ctgcattaag 3121 ctcacatgaa aatcatgatt ctagagtttg gaatgcaaaa ttaattgttt taccctcaag 3181 ctgggaatat ttttcaaaat aaatactata atatagatat caaattatta cctccccatg 3241 ttatgttgaa aattttttta ttaaattgat aaaactttat ttccattata ttcataatgt 3301 tctgttatac ataacattaa aatgttcatt aaaatc // LOCUS D79995 4831 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0173 gene, complete cds. ACCESSION D79995 NID g1136405 KEYWORDS KIAA0173. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4831) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4831) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..4831 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="2" /sex="male" 5'UTR 1..207 gene 208..3807 /gene="KIAA0173" CDS 208..3807 /gene="KIAA0173" /note="similar to pig tubulin-tyrosine ligase." /citation=[3] /codon_start=1 /db_xref="PID:d1012156" /db_xref="PID:g1136406" /translation="MASAGTQHYSIGLRQKNSFKQSGPSGTVPATPPQKPSEGRVWPQ AHQQVKPIWKLEKKQVETLSAGLGPGLLGVPPQPAYFFCPSTLCSSGTTAVIAGHSSS CYLHSLPDLFNSTLLYRRSSYRQKPYQQLESFCLRSSPSEKSPFSLPQKSLPVSLTAN KATSSMVFSMAQPMASSSTEPYLCLAAAGENPSGKSLASAISGKIPSPLSSSYKPMLN NNSFMWPNSTPVPLLQTTQGLKPVSPPKIQPVSWHHSGGTGDCAPQPVDHKVPKSIGT VPADASAHIALSTASSHDTSTTSVASSWYNRNNLAMRAEPLSCALDDSSDSQDPTKEI RFTEAVRKLTARGFEKMPRQGCQLEQSSFLNPSFQWNVLNRSRRWKPPAVNQQFPQED AGSVRRVLPGASDTLGLDNTVFCTKRISIHLLASHASGLNHNPACESVIDSSAFGEGK APGPPFPQTLGIANVATRLSSIQLGQSEKERPEEARELDSSDRDISSATDLQPDQAET EDTEEELVDGLEDCCSRDENEEEEGDSECSSLSAVSPSESVAMISRSCMEILTKPLSN HEKVVRPALIYSLFPNVPPTIYFGTRDERVEKLPWEQRKLLRWKMSTVTPNIVKQTIG RSHFKISKRNDDWLGCWGHHMKSPSFRSIREHQKLNHFPGSFQIGRKDRLWRNLSRMQ SRFGKKEFSFFPQSFILPQDAKLLRKAWESSSRQKWIVKPPASARGIGIQVIHKWSQL PKRRPLLVQRYLHKPYLISGSKFDLRIYVYVTSYDPLRIYLFSDGLVRFASCKYSPSM KSLGNKFMHLTNYSVNKKNAEYQANADEMACQGHKWALKALWNYLSQKGVNSDAIWEK IKDVVVKTIISSEPYVTSLLKMYVRRPYSCHELFGFDIMLDENLKPWVLEVNISPSLH SSSPLDISIKGQMIRDLLNLAGFVLPNAEDIISSPSSCSSSTTSLPTSPGDKCRMAPE HVTAQKMKKAYYLTQKIPDQDFYASVLDVLTPDDVRILVEMEDEFSRRGQFERIFPSH ISSRYLRFFEQPRYFNILTTQWEQKYHGNKLKGVDLLRSWCYKGFHMGVVSDSAPVWS LPTSLLTISKDDVILNAFSKSETSKLGKQSSCEVSLLLSEDGTTPKSKKTQAGLSPYP QKPSSSKDSEDTSKEPSLSTQTLPVIKCSGQTSRLSASSTFQSISDSLLAVSP" 3'UTR 3808..4831 BASE COUNT 1157 a 1363 c 1156 g 1155 t ORIGIN 1 gccgagccgg ggcgaagctg gatcccctag atagactgtc ttcaagctca ctgatatttt 61 cctctgcttg atccattgtg ctgttgagag cctctagtaa atttttcaga ctgacagact 121 tcaaggatgc agctgctact accggaggtg tgtggcacct tacctcagca aggccatgag 181 accgtgtggc catgatgtgg gcccctcatg gcctcagcag gaacacagca ctatagtatt 241 ggcctccgcc agaaaaacag cttcaagcag agtggtccct caggcacagt acctgccacg 301 ccacctcaga aaccctcgga gggcagagtc tggcctcagg cccatcagca agtgaagcca 361 atctggaagc tggaaaagaa gcaagtggag acactgtcag cagggttggg cccaggcctc 421 ttgggcgtcc caccccagcc agcatatttc ttttgcccca gcactttatg tagctctggg 481 accacggctg tcattgcagg ccacagcagt tcctgttacc tacactctct cccggacttg 541 ttcaacagca ccctgctata ccgccgctcc agctataggc aaaaaccgta ccagcaactg 601 gagtctttct gcttgcgttc gagcccatca gaaaaaagcc ctttttctct ccctcaaaag 661 agcctccctg tcagtctcac tgccaacaag gccacttctt ccatggtctt ctccatggcc 721 cagcccatgg cctcctcatc cacagaacca tacctctgct tggcagcggc tggggaaaac 781 ccttcaggga agagcctggc ctctgccatc tcagggaaga tcccatctcc actctcttcc 841 tcctataagc ccatgctgaa taataattcc ttcatgtggc caaatagcac gccagtgcct 901 ttattgcaga ccacacaggg cctgaagcca gtatcgccac ccaagatcca gcctgtctcc 961 tggcatcatt cagggggtac tggagactgt gcaccgcagc ctgttgacca taaggtgccc 1021 aaaagcattg gcactgtccc agctgatgcc agtgcccata tcgccttgtc taccgctagc 1081 tcccacgaca catccaccac cagtgttgcc tcttcctggt ataaccggaa taacttagcc 1141 atgagggcag agccactttc ctgtgctctg gatgacagct ctgattccca ggatccaact 1201 aaggagattc ggttcactga ggccgtgagg aaattgaccg caagaggctt tgagaagatg 1261 ccgaggcaag gctgccagct tgaacagtct agtttcctga accccagctt ccagtggaat 1321 gtcctcaaca ggagcaggcg gtggaaacct cctgcggtaa atcagcagtt tcctcaggag 1381 gatgctggat cggtcaggcg ggtcctccct ggtgcctcag ataccttggg gttggacaat 1441 acagtcttct gtaccaagcg tatcagcatt cacctccttg cctcacatgc cagtgggctc 1501 aatcacaacc ctgcctgtga atctgtaatt gactcctcag catttggaga aggcaaagct 1561 ccaggtcccc cttttcctca aactcttggc atagccaacg tggccacccg cctctcttcc 1621 atccagctgg gccagtctga gaaggagaga cctgaggagg ccagggagct ggactcatct 1681 gatagggata ttagttcagc tactgacctc cagccagatc aggctgagac tgaagataca 1741 gaagaagaac tagtagatgg tttggaagac tgttgtagcc gtgatgagaa tgaagaggag 1801 gagggagact cagagtgctc ctcattaagt gctgtctccc ccagcgaatc ggtggccatg 1861 atctctagaa gctgtatgga aattctgacc aaaccccttt ccaatcatga gaaagttgtc 1921 cgaccagccc tcatctacag tctctttccc aacgttcccc ctaccatcta ttttggcact 1981 cgggatgaga gagtggagaa acttccctgg gagcagagga agttgctccg atggaagatg 2041 agcacagtga cccccaacat tgtcaagcag accattggac ggtcccactt caaaatcagc 2101 aaaagaaacg atgactggct gggctgctgg ggtcaccaca tgaagtctcc tagtttccga 2161 tccattcgag agcatcagaa gctaaaccat ttcccaggct cattccagat tgggaggaag 2221 gaccggctat ggcggaacct gtcacgtatg cagagccgct ttggcaagaa ggagttcagt 2281 ttcttccccc agtcctttat cctgccccag gacgccaagc tcctgcgcaa agcgtgggag 2341 agcagcagcc gccaaaagtg gattgtgaag ccaccagcat cagctcgagg cattggcatc 2401 caggttattc acaagtggag tcagctcccc aagcgaaggc ccctcctggt acagaggtat 2461 ctacacaaac cctacctcat cagcggcagc aagtttgacc tgcggatcta tgtttatgtc 2521 acttcctacg atcctctgcg gatttacctc ttttcagatg gactggtccg ctttgccagt 2581 tgcaagtatt cgccttccat gaagagcctt ggcaataagt tcatgcacct gaccaactac 2641 agtgtcaata aaaagaatgc cgagtaccag gccaatgcag atgaaatggc ttgccagggc 2701 cacaaatggg cactgaaggc tttgtggaac tacctgagcc agaagggagt caatagcgac 2761 gccatctggg agaagataaa ggatgttgtt gtcaaaacta tcatctcgtc agagccctat 2821 gtgaccagcc tgctcaagat gtatgtgcga cggccctata gctgccatga actctttggt 2881 tttgacatca tgctagacga aaacctcaag ccctgggtcc tggaagtcaa catttcccca 2941 agcctccact ccagctctcc actggatatc agcatcaaag gccagatgat tcgtgacctt 3001 ctgaatctgg caggttttgt cctgcccaat gcagaggata tcatttccag ccccagcagc 3061 tgcagcagct ccaccaccag cctgcccacc tcccctgggg acaaatgtcg aatggctcca 3121 gagcatgtca ctgcacagaa gatgaagaaa gcctattatc tgacccagaa aattcctgat 3181 caggacttct atgcatctgt gctggatgtc ctgacaccag atgatgttcg gattctggtt 3241 gagatggaag atgagttttc tcgccgtggt cagtttgaac gaatttttcc ttctcatatc 3301 tcctctcgct atctccgctt ttttgagcag ccacgatatt tcaacattct caccacccaa 3361 tgggaacaga aataccatgg caacaagctt aaaggagtag atctgctccg gagttggtgc 3421 tacaaagggt tccacatggg agttgtctct gattctgctc cagtgtggtc tctcccgaca 3481 tcacttctga ctatctcaaa ggatgacgtg atactcaatg ccttcagcaa atcagagact 3541 agcaagctgg gaaaacaaag ctcctgtgag gttagcctac tactctctga agacgggacc 3601 acgcccaaat ccaagaagac tcaagctggc ctttcccctt atccccagaa acccagttcc 3661 tcaaaggaca gtgaggacac cagcaaagag cccagccttt ctacccagac gttacctgtg 3721 atcaagtgct ctgggcagac ttcaagactt tctgcttcct ccactttcca gtcaatcagt 3781 gactccctcc tggctgtgag cccataactg gcctctctcc aaaagcctct gcccaggagc 3841 atgggcatca gctacctcac gggaaccagc ctgctgttca gaccagtctg accccctacc 3901 cctttcaccc tgtccctcct cagagtattt tttgaagtgg ttgcattata gagatgggta 3961 tttgtagggc cggagggatg gtagtgatgg ggagaaggtg aggaagggtc accctctgtc 4021 acctgtctgc ctggctggca cctcatatct cagcagagaa gccagtggtg gccacgcagc 4081 cttataaagc aggttttggt ttctacctta agtgagccat gtgtggtttg tctgggggcc 4141 ctggtgtggt tgctgagttg tagctcaaga ggagaaaaca tacagaacat atttggaccg 4201 gaaatccttt gttctgaatt tgagggggtc ttctgaggtc cttatttcct taggtctttc 4261 ctcacccctc tcccaccgct gtcctgagga gaaacccttg aacttcctca gtagacaggc 4321 ggagaggcca caacatgccg aacccatttc ctgtcatcct agtcttgggt cttcaccgcc 4381 tccttccaaa tacccaccct gccagcagcc ctaggtcttc ctgttctgac cccccatcac 4441 tgctcgttca gccttctaga cgtctctctc gtggacatct gttctttagc tgttggcttt 4501 ctctgaggtg tgagagggtc tatgaacttt gtgaatttcc catggcccca gtgaaggagc 4561 ccagataatc ccagtagctg ttacctgtct ccatgtatca aaggacacag tccaggggga 4621 gggtggaagg agatgtggtt tctctatagt gcaacaaaca tggtttctca atgttctgct 4681 gtgcagcaag cagggtctgg cggcttggta ggtgggtttc aggagcagtc actattgtag 4741 gatgggcttc caatcaaacc tcagactaaa ctcttgtact gaactgattc tacctccctc 4801 ctctagactc agtaaacagt gactattcaa t // LOCUS D79996 2348 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0174 gene, complete cds. ACCESSION D79996 NID g1136407 KEYWORDS KIAA0174. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2348) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2348) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..2348 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..63 gene 64..1158 /gene="KIAA0174" CDS 64..1158 /gene="KIAA0174" /citation=[3] /codon_start=1 /db_xref="PID:d1012157" /db_xref="PID:g1136408" /translation="MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYL AAGKDERARIRVEHIIREDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVST LIWAAPRLQSEVAELKIVADQLCAKYSKEYGKLCRTNQIGTVNDRLMHKLSVEAPPKI LVERYLIEIAKNYNVPYEPDSVVMAEAPPGVETDLIDVGFTDDVKKGGPGRGGSGGFT APVGGPDGTVPMPMPMPMPSANTPFSYPLPKGPSDFNGLPMGTYQAFPNIHPPQIPAT PPSYESVDDINADKNISSAQIVGPGPKPEASAKLPSRPADNYDNFVLPELPSVPDTLP TASAGASTSASEDIDFDDLSRRFEELKKKT" 3'UTR 1159..2348 BASE COUNT 577 a 542 c 598 g 631 t ORIGIN 1 tgaaccctga agtcggtgtc tgctgcgttc acggcaggat tcggttagga ggaacagcac 61 agcatgctgg gctctggatt taaagctgag cgcttaagag tgaatttgag attagtcata 121 aatcgcctta aactattgga gaaaaagaaa acggaactgg cccagaaagc aaggaaggag 181 attgctgact atctggctgc tgggaaagat gaacgagctc ggatccgtgt ggagcacatt 241 atccgggaag actacctcgt ggaggccatg gagatcctgg agctgtactg tgacctgctg 301 ctggctcggt ttggccttat ccagtctatg aaggaactag attctggtct ggctgaatct 361 gtgtctacat tgatctgggc tgctcctcga ctccagtcag aagtggctga gttgaaaata 421 gttgctgatc agctctgtgc caagtatagc aaggaatatg gcaagctatg taggaccaac 481 cagattggaa ctgtgaatga caggctaatg cacaagctga gtgtggaagc cccacccaaa 541 atcctggtgg agagatacct gattgaaatt gcaaagaatt acaacgtacc ctatgaacct 601 gactctgtgg tcatggcaga agctcctcct ggggtagaga cagatcttat tgatgttgga 661 ttcacagatg atgtgaagaa aggaggccct ggaagaggag ggagtggtgg cttcacagca 721 ccagttggtg gacctgatgg aacggtgcca atgcccatgc ccatgcccat gccatctgca 781 aatacgcctt tctcatatcc actgccaaag ggaccatcag atttcaatgg actgccaatg 841 gggacttatc aggcctttcc caatattcat ccacctcaga taccagcaac tcccccatcg 901 tatgaatctg tagatgacat taatgctgat aagaatatct cttctgcaca gattgttggt 961 cctggaccca agccagaagc ctctgcaaag cttccttcca gacctgcaga taactatgac 1021 aactttgtcc taccagagtt gccatctgtg ccagacacac taccaactgc atctgctggt 1081 gccagcacct cagcatctga agacattgac tttgatgatc tttcccggag gtttgaagag 1141 ctgaaaaaga aaacataggt ctcttaaacc aggcaacttt cacgttttgg gagttgagac 1201 tgagcaattt ctccttgtaa caaagaatct ccatgaaatt ctgtttcatc tgttaaccgt 1261 cactcagcac aacactccct ctgggctctc ttcctgctcc tccagattct gctgctttcc 1321 agttctctgt tgatcctgag actaacaatt ggagactgag gccagagcaa ctggctcctg 1381 gcagctgtgc ttgtccgctt cctgtcagag tgatcccagg tttcctcctg gcccgtccca 1441 tggtccctcc acaggagtgt gagaggatgg gggaagcact gtgggaagac caccaaagat 1501 ggctggacag tgggagagag cacgttgtga agcatcccag cctcgtgttg aggttccaga 1561 cttagaaaca gacccctctg tacaggggga ttgtggtgag tgagaatcaa ggccaccttg 1621 tgtgttttct cactctcgaa tgcaagtggg agagggaaaa tgactcggga cgccattgta 1681 acggttcctg gaagctgggc cctctcattg gcatatacag tactcctcgc tgcagggcac 1741 tgtcccaccg ggatccagtt gcaaagtttg tcttgacagt tgaaggcctc gcttagttgt 1801 actggattct cagggagccc tctgtggcct tttgctttgc gtgctgtttc ccttgtacca 1861 gagggcggca ccgtggaaat tctgttttcc ctgtagcata ttgtgttgga ttgcattact 1921 ggcagagaaa ggacaaggtg ccattcaagt cctagggtgg gcttccagct gccttaatag 1981 aagtactcaa gtcttttggg tagtgagctg gaaagcctac aggaaaagag gggtacctgt 2041 tttcatttga aaactttgat tcatggaacc tttaaaacta atctcagaaa aatttttggt 2101 gcccatgcag ctgtagttgt tcactgcttt cctggatgga tgggactctt atgtcataac 2161 ttctgttact cctttggccc atagctaagg tcatccttcc ccacaggggt ggctttggga 2221 ttggatgata cagcttttgc ttctgtgtag tatacctgta catacttgtt tcaggcagcc 2281 tttctttaat gttttcagtt ggtttgtatt ttgtagctca gtagctgcta ataaagttaa 2341 agatcctg // LOCUS D79997 2470 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0175 gene, complete cds. ACCESSION D79997 NID g1136409 KEYWORDS KIAA0175. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2470) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2470) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..2470 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="9" /sex="male" 5'UTR 1..170 gene 171..2126 /gene="KIAA0175" CDS 171..2126 /gene="KIAA0175" /note="similar to protein kinase of X.laevis, has putative transmembrane domain incentral region" /citation=[3] /codon_start=1 /db_xref="PID:d1012158" /db_xref="PID:g1136410" /translation="MKDYDELLKYYELHETIGTGGFAKVKLACHILTGEMVAIKIMDK NTLGSDLPRIKTEIEALKNLRHQHICQLYHVLETANKIFMVLEYCPGGELFDYIISQD RLSEEETRVVFRQIVSAVAYVHSQGYAHRDLKPENLLFDEYHKLKLIDFGLCAKPKGN KDYHLQTCCGSLAYAAPELIQGKSYLGSEADVWSMGILLYVLMCGFLPFDDDNVMALY KKIMRGKYDVPKWLSPSSILLLQQMLQVDPKKRISMKNLLNHPWIMQDYNYPVEWQSK NPFIHLDDDCVTELSVHHRNNRQTMEDLISLWQYDHLTATYLLLLAKKARGKPVRLRL SSFSCGQASATPFTDIKSNNWSLEDVTASDKNYVAGLIDYDWCEDDLSTGAATPRTSQ FTKYWTESNGVESKSLTPALCRTPANKLKNKENVYTPKSAVKNEEYFMFPEPKTPVNK NQHKREILTTPNRYTTPSKARNQCLKETPIKIPVNSTGTDKLMTGVISPERRCRSVEL DLNQAHMEETPKRKGAKVFGSLERGLDKVITVLTRSKRKGSARDGPRRLKLHYNVTTT RLVNPDQLLNEIMSILPKKHVDFVQKGYTLKCQTQSDFGKVTMQFELEVCQLQKPDVV GIRRQRLKGDAWVYKRLVEDILSSCKV" 3'UTR 2127..2470 BASE COUNT 767 a 488 c 536 g 679 t ORIGIN 1 ttggcgggcg gaagcggcca caacccggcg atcgaaaaga ttcttaggaa cgccgtacca 61 gccgcgtctc tcaggacagc aggcccctgt ccttctgtcg ggcgccgctc agccgtgccc 121 tccgcccctc aggttctttt tctaattcca aataaacttg caagaggact atgaaagatt 181 atgatgaact tctcaaatat tatgaattac atgaaactat tgggacaggt ggctttgcaa 241 aggtcaaact tgcctgccat atccttactg gagagatggt agctataaaa atcatggata 301 aaaacacact agggagtgat ttgccccgga tcaaaacgga gattgaggcc ttgaagaacc 361 tgagacatca gcatatatgt caactctacc atgtgctaga gacagccaac aaaatattca 421 tggttcttga gtactgccct ggaggagagc tgtttgacta tataatttcc caggatcgcc 481 tgtcagaaga ggagacccgg gttgtcttcc gtcagatagt atctgctgtt gcttatgtgc 541 acagccaggg ctatgctcac agggacctca agccagaaaa tttgctgttt gatgaatatc 601 ataaattaaa gctgattgac tttggtctct gtgcaaaacc caagggtaac aaggattacc 661 atctacagac atgctgtggg agtctggctt atgcagcacc tgagttaata caaggcaaat 721 catatcttgg atcagaggca gatgtttgga gcatgggcat actgttatat gttcttatgt 781 gtggatttct accatttgat gatgataatg taatggcttt atacaagaag attatgagag 841 gaaaatatga tgttcccaag tggctctctc ccagtagcat tctgcttctt caacaaatgc 901 tgcaggtgga cccaaagaaa cggatttcta tgaaaaatct attgaaccat ccctggatca 961 tgcaagatta caactatcct gttgagtggc aaagcaagaa tccttttatt cacctcgatg 1021 atgattgcgt aacagaactt tctgtacatc acagaaacaa caggcaaaca atggaggatt 1081 taatttcact gtggcagtat gatcacctca cggctaccta tcttctgctt ctagccaaga 1141 aggctcgggg aaaaccagtt cgtttaaggc tttcttcttt ctcctgtgga caagccagtg 1201 ctaccccatt cacagacatc aagtcaaata attggagtct ggaagatgtg accgcaagtg 1261 ataaaaatta tgtggcggga ttaatagact atgattggtg tgaagatgat ttatcaacag 1321 gtgctgctac tccccgaaca tcacagttta ccaagtactg gacagaatca aatggggtgg 1381 aatctaaatc attaactcca gccttatgca gaacacctgc aaataaatta aagaacaaag 1441 aaaatgtata tactcctaag tctgctgtaa agaatgaaga gtactttatg tttcctgagc 1501 caaagactcc agttaataag aaccagcata agagagaaat actcactacg ccaaatcgtt 1561 acactacacc ctcaaaagct agaaaccagt gcctgaaaga aactccaatt aaaataccag 1621 taaattcaac aggaacagac aagttaatga caggtgtcat tagccctgag aggcggtgcc 1681 gctcagtgga attggatctc aaccaagcac atatggagga gactccaaaa agaaagggag 1741 ccaaagtgtt tgggagcctt gaaagggggt tggataaggt tatcactgtg ctcaccagga 1801 gcaaaaggaa gggttctgcc agagacgggc ccagaagact aaagcttcac tataatgtga 1861 ctacaactag attagtgaat ccagatcaac tgttgaatga aataatgtct attcttccaa 1921 agaagcatgt tgactttgta caaaagggtt atacactgaa gtgtcaaaca cagtcagatt 1981 ttgggaaagt gacaatgcaa tttgaattag aagtgtgcca gcttcaaaaa cccgatgtgg 2041 tgggtatcag gaggcagcgg cttaagggcg atgcctgggt ttacaaaaga ttagtggaag 2101 acatcctatc tagctgcaag gtataattga tggattcttc catcctgccg gatgagtgtg 2161 ggtgtgatac agcctacata aagactgtta tgatcgcttt gattttaaag ttcattggaa 2221 ctaccaactt gtttctaaag agctatctta agaccaatat ctctttgttt ttaaacaaaa 2281 gatattattt tgtgtatgaa tctaaatcaa gcccatctgt cattatgtta ctgtcttttt 2341 taatcatgtg gttttgtata ttaataattg ttgactttct tagattcact tccatatgtg 2401 aatgtaagct cttaactatg tctctttgta atgtgtaatt tctttctgaa ataaaaccat 2461 ttgtgaatat // LOCUS D80008 3248 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0186 gene, complete cds. ACCESSION D80008 NID g1136431 KEYWORDS KIAA0186. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3248) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3248) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..3248 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..94 gene 95..685 /gene="KIAA0186" CDS 95..685 /gene="KIAA0186" /citation=[3] /codon_start=1 /db_xref="PID:d1012169" /db_xref="PID:g1136432" /translation="MFCEKAMELIRELHRAPEGQLPAFNEDGLRQVLEEMKALYEQNQ SDVNEAKSGGRSDLIPTIKFRHCSLLRNRRCTVAYLYDRLLRIRALRWEYGSVLPNAL RFHMAAEEMEWFNNYKRSLATYMRSLGGDEGLDITQDMKPPKSLYIEVRCLKDYGEFE VDDGTSVLLKKNSQHFLPRWKCEQLIRQGVLEHILS" 3'UTR 686..3248 BASE COUNT 832 a 638 c 710 g 1068 t ORIGIN 1 ctagaacgaa aggagtgagg cgccgagagc ccagatacca ttttggcgtg agagctggtg 61 gttggcaagg ccgcgggagt gggaagcgtc cgccatgttc tgcgaaaaag ccatggaact 121 gatccgcgag ctgcatcgcg cgcccgaagg gcaactgcct gccttcaacg aggatggact 181 cagacaagtt ctggaggaga tgaaagcttt gtatgaacaa aaccagtctg atgtgaatga 241 agcaaagtca ggtggacgaa gtgatttgat accaactatc aaatttcgac actgttctct 301 gttaagaaat cgacgctgca ctgtagcata cctgtatgac cgcttgcttc ggatcagagc 361 actcagatgg gaatatggta gcgtcttgcc aaatgcatta cgatttcaca tggctgctga 421 agaaatggag tggtttaata attataaaag atctcttgct acttatatga ggtcactggg 481 aggagatgaa ggtttggaca ttacacagga tatgaaacca ccaaaaagcc tatatattga 541 agtccggtgt ctaaaagact atggagaatt tgaagttgat gatggcactt cagtcctatt 601 aaaaaaaaat agccagcact ttttacctcg atggaaatgt gagcagctga tcagacaagg 661 agtcctggag cacatcctgt catgaccatg cgccgaggca cttccaggct tcactcaact 721 catggactcc tctgtactca ctctctccac cctcccttca cctccctctt tgattttaga 781 agctatagac attgtttaag ataactaaga atacttggct aagaagtata atttgctaac 841 tattaaggac tttctttttt taatgttgta cactattctt cctactcttt tttggttttg 901 gttttgtttt gtagagactg tctcactatg ttgcccaagc tggtctcaaa ctcctggcct 961 caagcagtcc tcccacctta gcttctcaaa gtgttgagat cacaggcgtg agccactgca 1021 cccgacccct actccttttt ctaataagct gtatctgtaa tcacagcatt cctacagttg 1081 ttacagtgtg ttttttaaat gaaagtaaac atggttacat ttgaatctct taaataatca 1141 gtcacttggc tggacaggaa gaaggtagat cctgtgtgtc ttgttttctg gtcatgtgta 1201 ttgtacaagc tagagagctg aatttctgag atacacattt tcaaatcaca tgcaagtgaa 1261 gatgatggtc tgtagaaatt ttcagtatat ataatgttta atgacatact aatttatcat 1321 ctggctattt gggaaggaag gacacacatg gattttgcac atttccacca tggtggctgg 1381 tgtggcttgt ggctatgggg tgatcaccag tatcaccact ttggaagggg acagtgaaat 1441 tggggctaga gaaggaactt tgtacagttt tccctgagat tcagattgac tgaaaagtca 1501 catgaagagt tgattgtctt ttaatggtat gttttaaaca gctgacattt taaattttga 1561 tgaaatccag tttattcgtt tgttctttta tgctttgggt gttgcatccg agaaatcttt 1621 tcccatccca agatcacaat tttttttcct ttttacttct agaagtgtta taattttaag 1681 ctttatactt tggtctatga cccgtttttt tttttgtttt gttttgtttt ttcgtttgtt 1741 tctttgtttt gagatggagt cttgttctgt cacccaggct ggggtgcagt ggcgtgatct 1801 tggctcactg caatctctat cccctgggtt caagtgattc tcttgtctca gcctcccaag 1861 tagctgggat tacaggcaca ggccgccacg cccggctaat ttttgtattt ttagtagaga 1921 cagagtttta ccatgttggc caggctggtt tcaaactcct gacctcaagt gacccacctt 1981 ggcctcccaa agttttggga ttacaagtgt gggccaccgc ggccagccta tgatccattt 2041 tgaatgaatt ttttatatgg tgcaaggtgt caatccacct tcactttttc ttgggaatat 2101 agatatccag ctgtttcact accatttttt gaaaggactg ccctttgctc tatcaccttt 2161 gcatttttgt taaaaagtag ttgtcaatgt atatgtgggt ttatttcagg actctgtttt 2221 gttccattga cctgtttttc tctcctgaat gccaatacca tatttgtatg tagtgtatgt 2281 aattttctaa taattcttga aacagatagt attaatgcgt catatttttg ctgttgtttg 2341 tattttttgt ggagatgggg tttcaccatg ttggccaggc tgtgttgaac tcctgagcta 2401 aagcaataca cttgcctcgt cctccccatg tgctgggatt acaggcgtga gccttggtgc 2461 tggcccagtg taccacattt ctttttgaga tttgttttgg ctatgttaag tcctttgctt 2521 ttgatgtgaa atttgggaac aggcagggtg tggtggctta tgcctgtaat cctagaactt 2581 tgggaggcct agatgggtgg atcacttgag ctcaggagtt ccagaccagc ccgggcctat 2641 ggcgaaactc cgtctctaca aaaaatagaa aaaattagcc aggtgtggtg gtgcatgcct 2701 gtagtcacag ttacacggca ggctgaggtg ggaggatcac ttgaacccca gaggtcaaga 2761 ctgcagtgag ctgagatcac accactgtac tccagcctgg gtgacaaagt gagactctat 2821 ctcaaaaaga aattaggatc aacttgtcaa tttctacaac aacaacaaca aaaacccctg 2881 ttgggcacct tgattgagat tgcattgaat ttatataaaa ctgttgggag aattgacatc 2941 ttaataatat tgagtcttct ggcctataaa caaggtctgt cttcctaggt attaatgttt 3001 tgtcttctat ttctcttaat aatcttttgt agttttcagt gtacaggtct accatgtcag 3061 catttcatag ttttgatgct aaatggtatt ttaaaatttc aaattctaac cacttgttgc 3121 tagtaaatag aaatacaatt gatgttgaac ttgtatcctt cagccttgct aaactgtgag 3181 ttctcatggt gtttttgtaa attacatcaa cagtcatgtg ttctatgaat aaagagtttt 3241 actccttc // LOCUS D80009 4181 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0187 gene, complete cds. ACCESSION D80009 NID g1136433 KEYWORDS KIAA0187. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4181) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4181) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..4181 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..27 gene 28..3876 /gene="KIAA0187" CDS 28..3876 /gene="KIAA0187" /citation=[3] /codon_start=1 /db_xref="PID:d1012170" /db_xref="PID:g1136434" /translation="MEAKDQKKHRKKNSGPKAAKKKKRLLQDLQLGDEEDARKRNPKA FAVQSAVRMARSFHRTQDLKTKKHHIPVVDRTPLEPPPIVVVVMGPPKVGKSTLIQCL IRNFTRQKLTEIRGPVTIVSGKKRRLTIIECGCDINMMIDLAKVADLVLMLIDASFGF EMETFEFLNICQVHGFPKIMGVLTHLDSFKHNKQLKKTKKRLKHRFWTEVYPGAKLFY LSGMVHGEYQNQEIHNLGRFITVMKFRPLTWQTSHPYILADRMEDLTNPEDIRTNIKC DRKVSLYGYLRGAHLKNKSQIHMPGVGDFAVSDISFLPDPCALPEQQKKRCLNEKEKL VYAPLSGVGGVLYDKDAVYVDLGGSHVFQDEVGPTHELVQSLISTHSTIDAKMASSRV TLFSDSKPLGSEDIDNQGLMMPKEEKQMDLNTGRMRRKAIFGDEDESGDSDDEEDDEM SEDDGLENGSSDEEAEEEENAEMTDQYMAVKGIKRRKLELEEDSEMDLPAFADSDDDL ERSSAEEGEAEEADESSEEEDCTAGEKGISGSKAAGEGSKAGLSPANCQSDRVNLEKS LLMKKAALPTFDSGHCTAEEVFASEDESEESSSLSAEEEDSENEEAIRKKLSKPSQVS SGQKLGPQNFIDETSDIENLLKEEEDYKEENNDSKETSGALKWKEDLSRKAAEAFLRQ QQAAPNLRKLIYGTVTEDNEEEDDDTLEELGGLFRVNQPDRECKHKADSLDCSRFLVE APHDWDLEEVMNSIRDCFVTGKWEDDKDAAKVLAEDEELYGDFEDLETGDVHKGKSGP NTQNEDIEKEVKEEIDPDEEESAKKKHLDKKRKLKEMFDAEYDEGESTYFDDLKGEMQ KQAQLNRAEFEDQDDEARVQYEGFRPGMYVRIEIENVPCEFVQNFDPHYPIILGGLGN SEGNVGYVQMRLKKHRWYKKILKSRDPIIFSVGWRRFQTIPLYYIEDHNGRQRLLKYT PQHMHCGAAFWGPITPQGTGFLAIQSVSGIMPDFRIAATGVVLDLDKSIKIVKKLKLT GFPYKIFKNTSFIKGMFNSALEVAKFEGAVIRTVSGIRGQIKKALRAPEGAFRASFED KLLMSDIVFMRTWYPVSIPAFYNPVTSLLKPVGEKDTWSGMRTTGQLRLAHGVRLKAN KDSLYKPILRQKKHFNSLHIPKALQKALPFKNKPKTQAKAGKVPKDRRRPAVIREPHE RKILALLDALSTVHSQKMKKAKEQRHLHNKEHFRAKQKEEEEKLKRQKDLRKKLFRIQ GQKERRNQKSSLKGAEGQLQ" 3'UTR 3877..4181 BASE COUNT 1308 a 806 c 1111 g 956 t ORIGIN 1 gttacttgtt attggtaaat agccactatg gaggctaagg accagaagaa acacagaaag 61 aaaaacagtg gacccaaagc tgcaaagaaa aagaagcggc ttctgcagga tctccagcta 121 ggagacgaag aagatgcccg gaagagaaat cccaaagctt ttgcagttca gtctgctgtg 181 cggatggctc gatcctttca caggactcag gatttgaaga caaaaaagca tcatattcca 241 gtggttgatc gaactccact agagccccca ccaatagtgg tagtggtgat gggacctcca 301 aaagttggaa agagcacttt gatacaatgc ctcattcgga actttacccg gcagaagttg 361 accgagatca gaggccctgt gacgattgtg tcaggtaaaa agcgcagact caccattatt 421 gaatgtgggt gtgacattaa catgatgatt gatctggcta aagtagcaga tctggtactg 481 atgcttatag atgccagctt tgggtttgaa atggaaacgt ttgagtttct aaacatctgt 541 caagtacatg gctttcctaa aattatggga gttctcaccc acctcgactc cttcaagcat 601 aataagcaac tgaagaagac aaagaagcga ttaaaacaca ggttctggac ggaagtttac 661 ccgggtgcca agctgttcta cctttctgga atggtgcatg gagaatatca aaaccaagaa 721 atccacaatc tgggccgttt tattacagtt atgaagttta ggcctctcac atggcaaact 781 tctcaccctt atatcctggc agacaggatg gaagatttga caaacccaga ggatatccga 841 acaaacatca aatgtgaccg gaaggtgtca ctttatggtt atttaagagg agcacacttg 901 aaaaataaaa gccaaattca catgccaggg gtaggagatt ttgccgtgag tgacatcagt 961 ttcctcccag acccttgcgc tcttcctgaa caacaaaaga agcgctgttt aaatgagaag 1021 gagaagctgg tttatgcgcc tctttctgga gttgggggtg tgctgtatga caaagacgct 1081 gtctatgttg accttggtgg cagccacgtt tttcaggatg aagtggggcc cacccatgag 1141 ctggtccaga gtctcatctc tacccactcc accattgatg ccaagatggc ttcaagtcga 1201 gtgacgctgt tttctgattc caaaccactt gggtcagagg atatagataa tcaagggcta 1261 atgatgccaa aggaggaaaa acaaatggac ttgaacactg gtcgaatgcg tcggaaagcc 1321 attttcggag atgaagatga atctggagat agtgatgatg aagaagatga tgaaatgtct 1381 gaagatgacg ggttggaaaa cggctctagt gatgaggaag cagaagagga ggaaaatgct 1441 gagatgactg atcagtatat ggctgttaag ggcatcaaac gacggaaact tgagttggaa 1501 gaagacagtg aaatggattt gccagcattt gctgacagtg acgatgacct tgagaggagc 1561 tcagcggaag aaggggaagc ggaggaagct gatgaaagca gtgaagaaga ggactgcact 1621 gcaggagaga agggcatttc aggatcaaag gctgctggag aaggtagtaa agcagggctg 1681 tcaccagcta attgccagag tgaccgtgtg aatctggaga agtctttgct gatgaagaaa 1741 gcagctctcc ccactttcga ttctgggcat tgcacagctg aagaggtgtt tgcatctgaa 1801 gatgaatctg aagaaagctc ctcactcagt gcagaggaag aagactcaga aaatgaagag 1861 gctattagaa aaaagctttc aaagccttct caagtgagca gtggtcagaa actggggcca 1921 cagaacttca ttgatgagac cagtgatata gaaaatttac tcaaagagga agaagattac 1981 aaggaagaaa ataatgattc caaagaaacg tcaggtgccc tcaagtggaa ggaagacctt 2041 tccagaaagg cagctgaggc ctttctgagg cagcagcaag cagctccaaa cctccgaaag 2101 cttatttatg ggacagtgac agaagataat gaagaagaag atgatgatac tctagaagag 2161 cttggagggt tgtttcgtgt caaccagcct gacagagagt gtaagcacaa ggctgactct 2221 ttggactgct ccagatttct tgtggaggcc ccccatgact gggatttaga ggaggttatg 2281 aacagtatca gagattgctt cgtgactgga aagtgggaag atgataaaga tgcagccaag 2341 gtcttagcag aagatgagga gctctacggt gactttgaag acttggaaac aggggacgtg 2401 cacaagggaa aatcaggccc caatactcag aatgaagata tagagaaaga agttaaggaa 2461 gaaattgacc ccgacgaaga agaaagtgcc aagaaaaagc atttggataa gaagagaaaa 2521 ttgaaggaga tgtttgatgc agaatatgat gaaggagaaa gcacatattt tgatgatctt 2581 aaaggagaaa tgcagaaaca agcacagctg aatcgcgcag aatttgaaga tcaagatgat 2641 gaagccagag ttcagtatga gggttttcga cctgggatgt acgtccgcat tgagattgaa 2701 aatgttccct gtgaatttgt gcagaacttt gacccccatt accccattat cctgggtggc 2761 ttgggcaaca gtgagggaaa tgttggctac gtgcagatgc gtctgaagaa acatcgctgg 2821 tataagaaaa tcctcaagtc ccgagatcca atcatatttt ctgtagggtg gaggaggttt 2881 cagaccatcc cactgtatta tatcgaagac cacaatggaa gacaaaggct tctaaagtat 2941 accccacagc acatgcattg cggagcagcc ttttggggcc ctatcactcc acagggaact 3001 ggtttcttgg caatacagtc tgtcagtggc ataatgcctg attttcggat agctgctaca 3061 ggagttgtcc ttgatctgga taaatccata aaaattgtga agaaattaaa gctaactggt 3121 tttccatata aaattttcaa gaacacttca tttattaagg gaatgtttaa ttctgccttg 3181 gaagtggcca aatttgaagg tgctgtgatt cgaacagtca gtgggataag ggggcagatc 3241 aagaaagcac tccgagctcc agaaggagct ttcagggcca gctttgagga taagctgctg 3301 atgagcgata ttgtcttcat gcgaacttgg tatcctgttt ccatcccagc gttctataac 3361 ccagtaacat ctttgttgaa accagtgggt gagaaagaca cctggtcagg aatgcggacc 3421 acgggccaac tcaggctcgc ccatggcgtc agactaaagg cgaacaagga ctctctgtat 3481 aagccaatcc tgaggcaaaa gaaacatttt aattcactgc acattccaaa agccttgcag 3541 aaggccctgc catttaagaa caagcccaag acccaagcaa aggcaggcaa ggtgccaaag 3601 gacaggcgga gaccggccgt catacgcgag cctcatgaaa gaaagatcct tgcactgctg 3661 gatgctctga gtacggtgca tagtcagaag atgaagaagg ccaaggagca gcggcacctg 3721 cacaataaag agcacttcag agccaagcag aaggaggagg aggagaagct gaagcggcag 3781 aaggacctca ggaagaagct cttcagaatt caggggcaga aggaaagaag aaaccagaag 3841 tccagtttga agggggctga gggccaattg cagtgagcct ttggactgga gggactgtcc 3901 ctggatctgc ggaggtagac agtttcaaac atcacagttt gaatgcctgt gaatgacacg 3961 tcagtgggaa agagctcaag agatgtctct actcaaactg tgcctgcagg aggaggaaca 4021 gagaagcctg ggctgctggg actgggttca ttctcatgac ttggggctgt cgagatttaa 4081 agtgatgtaa gctgtggtta tgtggattct cttactttcc tctgcctgcc tcagtttaat 4141 tattttggcc tacagaaata tcattaaaat atttttttgt t // LOCUS D80011 4824 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0189 gene, complete cds. ACCESSION D80011 NID g1663691 KEYWORDS KIAA0189. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4824) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4824) AUTHORS Nagase,T., Seki,N., Tanaka,A., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG1 JOURNAL DNA Research (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..4824 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..337 gene 338..3409 /gene="KIAA0189" CDS 338..3409 /gene="KIAA0189" /note="similar to rat rhoGAP." /citation=[3] /codon_start=1 /db_xref="PID:d1012172" /db_xref="PID:g1663692" /translation="MTLNNCASMKLEVHFQSKQNEDSEEEEQCTISSHWAFQQESKCW SPMGSSDLLAPPSPGLPATSSCESVLTELSATSLPVITVSLPPEPADLPLPGRAPSSS DRPLLSPTQGQEGPQDKAKKRHRNRSFLKHLESLRRKEKSGSQQAEPKHSPATSEKVS KASSFRSCRGFLSAGFYRAKNWAATSAGGSGANTRKAWEAWPVASFRHPQWTHRGDCL VHVPGDHKPGTFPRSLSIESLCPEDGHRLADWQPGRRWGCEGRRGSCGSTGSHASTYD NLPELYPAEPVMVGAEAEDEDDEESGGSYAHLDDILQHVWGLQQRVELWSRAMYPDLG PGDEEEEEATSSVEIATVEVKCQAEALSQMEVPAHGESPAWAQAEVQPAVLAPAQAPA EAEPVAQEEAEAPAPAPAPAPAQDSEQEAHSGGEPTFASSLSVEEGHSISDTVASSSE LDSSGNSMNEAEAAGSLAGLQASMPRERRDSGVGASLTRPCRKLRWHSFQNSHRPSLN SESLEINRQFAGQINLLHKGSLLRLTAFMEKYTVPHKQGWVWSMPKFMRRNKTPDYRG QHVFGVPPLIHVQRTGQPLPQSIQQAMRYLRSQCLDQVGIFRKSGVKSRIQNLRQMNE TSPDNVCYEGQSAYDVADLLKQYFRDLPEPIFTSKLTTTFLQIYQLLPKDQWLAAAQA ATLLLPDENREVLQTLLYFLSDIASAEENQMTAGNLAVCLAPSIFHLNVSKKDSPSPR IKSKRSLIGRPGPRDLSDNMAATQGLSHMISDCKKLFQVPQDMVLQLCSSYSAAELSP PGPALAELRQAQAAGVSLSLYMEENIQDLLRDAAERFKGWMSVPGPQHTELACRKAPD GHPLRLWKASTEVAAPPAVVLHRVLRERALWDEDLLRAQVLEALMPGVELYHYVTDSM APHPCRDFVVLRMWRSDLPRGGCLLVSQSLDPEQPVPESGVRALMLTSQYLMEPCGLG RSRLTHICRADLRGRSPDWYNKVFGHLCAMEVAKIRDSFPTLQAAGPETKL" 3'UTR 3410..4824 BASE COUNT 1063 a 1412 c 1368 g 981 t ORIGIN 1 ctcgggttga caacggctaa gcagccgcgg cagctgctct tctggctggc accgtcaccc 61 ctgcccgacc ccaggccccg cgtgtgtccc ggacggagca gtccctcctg ggctcgcctc 121 ctgcccgctc gctctgtgtc agcgcccgaa cccaaagggg tggacgcggc ctccagaagc 181 cgaggccaaa agagcatgtg agtggcttca agcaacagga ttccctcagt atgtgcagct 241 ttttgaagaa ggttcgtttc ccctggatat tggctctgtg aagaaaaacc acggttttct 301 ggacgaggac tctttggggg ccctgtgtag gaggctgatg accttgaata attgtgcctc 361 gatgaaactg gaggttcatt ttcaaagcaa gcagaatgaa gactcagaag aggaagagca 421 gtgtaccatc agtagccact gggccttcca gcaggaaagt aagtgctggt ctcctatggg 481 gtcctctgat ctgttggccc caccgagccc tggcctgcca gcgacctcaa gctgtgagag 541 cgtcctcacc gagcttagtg ccacctctct gccagtcatc accgtgagcc taccacccga 601 gccagcagac ttgcccttgc caggccgtgc ccccagctcg agtgaccggc ccctcctcag 661 ccccacccag ggccaggagg gtccccagga caaagccaag aagcgccatc gtaaccgtag 721 cttcctcaag caccttgaat ctctgaggcg gaaggaaaag agtggcagcc agcaagcaga 781 gcccaagcat agtccagcca cctcagagaa ggtctccaaa gcctcatctt tccgcagttg 841 tcgtggcttc ctctcagctg gattttacag ggccaagaac tgggccgcca cctcagccgg 901 tggcagtggt gccaatactc ggaaggcctg ggaggcctgg cctgtggcct cgttccggca 961 tcctcagtgg acacaccggg gtgattgcct ggtgcacgtt cctggggacc acaaaccagg 1021 cacattccct cgctccctgt ccattgagag cctgtgtcct gaggatggac accgcctggc 1081 agactggcag ccaggtaggc ggtggggctg tgaggggcgc cggggctcct gtggctcaac 1141 gggcagccat gccagcacgt atgacaactt gcctgagctg tacccagctg agcctgtaat 1201 ggttggggct gaggctgaag atgaagatga tgaggagagt gggggcagct atgctcacct 1261 agacgacatc ctccagcacg tgtgggggct acagcaacga gtagagctgt ggtctcgggc 1321 catgtaccca gacctggggc ctggagatga ggaagaggag gaggccactt catcagtaga 1381 aatagccaca gttgaggtca aatgccaagc tgaggctctc agccagatgg aggttccggc 1441 ccatggagag tccccagcct gggcccaggc tgaagtccag ccagcagtcc tggctccggc 1501 tcaggctcca gctgaggctg aaccagtggc acaggaagag gctgaggccc cggccccagc 1561 cccggccccg gccccagccc aggacagtga gcaggaggca cattcaggcg gggaacccac 1621 ctttgcctct agcctgtctg tggaagaagg acactccatt tctgacactg tggcctcctc 1681 cagcgaactt gacagtagtg ggaactccat gaatgaggct gaggctgcgg ggtccctggc 1741 tggactccag gcatcaatgc cccgtgaacg gcgcgattca ggtgttgggg cctcacttac 1801 cagaccctgc aggaagctcc gttggcatag cttccagaac tcccatcgtc ccagcctcaa 1861 ctcagagtcg ctggagatca accggcagtt tgcaggccag atcaacctcc tgcacaaggg 1921 ctcactgctg cggcttaccg cgttcatgga gaagtacact gtgccccaca agcagggctg 1981 ggtctggtca atgcccaagt tcatgaggag gaacaagacc ccagattacc ggggacagca 2041 cgtatttggg gtgccacccc tcatccacgt gcagcgcacg ggccagccac tgccacagag 2101 cattcagcaa gccatgcgct acttgcgcag ccagtgcctg gaccaagtag gcatcttccg 2161 caagtctggg gtcaagtcca ggatccagaa cctgcgtcaa atgaatgaga cctcgcctga 2221 caatgtctgc tacgagggcc agtcagccta cgacgtggct gacctgctaa agcagtattt 2281 ccgggacctg cctgagccca tcttcaccag caagctcacc accactttcc tccagatcta 2341 ccagctcctc cccaaggatc agtggttggc agcagcacaa gccgccacct tgctgctccc 2401 cgatgagaac cgagaggtgc tacagaccct gctctacttc ttaagtgaca ttgcctctgc 2461 cgaggaaaac cagatgacag caggcaacct ggcagtgtgc ctggcgccct ccatcttcca 2521 cctcaatgtc tctaagaagg atagcccctc tcccaggatc aagagcaaac gcagcctcat 2581 tggcaggcca ggccctaggg acctgagtga caacatggca gccacccagg gcctgtcgca 2641 catgatcagt gactgcaaga aacttttcca ggtgccccag gacatggtgc tgcagctgtg 2701 cagctcctac agcgcagctg agctcagccc tcccggccca gccctggctg agctgcgtca 2761 ggcccaagct gcaggggtaa gcctgagcct ctacatggaa gagaatatcc aggacctgct 2821 gcgtgatgct gctgagcgct tcaagggctg gatgagcgtg ccagggcccc agcacacgga 2881 gctggcttgc aggaaggcac cggatgggca ccccctgcgg ctatggaagg catccacaga 2941 ggtggcagcc cccccagctg tggtgctgca tcgtgttctc cgggagcggg ccctctggga 3001 tgaggatctg ctgcgggccc aggtgctgga agccctgatg ccgggtgtgg agctgtacca 3061 ctatgtcacc gacagcatgg caccccatcc ctgccgcgac tttgtggtgc ttcggatgtg 3121 gcgctctgac ctgcctcgtg ggggttgcct gcttgtctcc cagtccctgg atccggaaca 3181 acctgtgcca gagtcgggtg tgcgagccct catgctcaca tcccagtacc tcatggagcc 3241 ttgcggcttg ggccgctctc ggctcacaca catctgccgg gctgacctca ggggccgttc 3301 tcctgactgg tacaacaaag tctttggaca cctgtgtgcc atggaagtgg caaagatccg 3361 ggactccttc cccaccctgc aggcagcggg ccctgagaca aagctgtgag ccttgggctg 3421 gtcccagggt ggcaccaccc aggccccctg ggcaccaagg gagcgagggg gaataagagc 3481 agggcagccc cctgggtgcc gctgtcagga gcagagccag gcccaggtgg ctccagctgc 3541 ctgtcctgtc ccctttccta aagctcctct gcacatagag gggagaaaaa gagaatttag 3601 gcaactccac tcccccttca cccccaaccc tgtattctac tctcccgaaa agagaagaga 3661 atcgcatgag tagcaagact gctgccacca gccacctgct tgtgaggccg ccacttggca 3721 tgaagcctcc acagctcccc gcctgcaggg gcaaagaggt cgacagcaat gtgtgatccc 3781 agctctctgc cagactgaga gggcaagccg tcttgtttgc tgcaaggatg cttttgaggt 3841 tggacaggag gttctggtcc tgcctttggg gccaacgctg gctctgaagt gtctttttca 3901 gaggaattgg actggagtga atgggcacag gggtggagcg cagggcagcc ccagtcacca 3961 cgagctgttt catttgtgta aatacgatgc tgaattttat gaggctgagt taagagtggg 4021 cactgacggg cccctaatat gtgacatgac gatttggcat agatagggat gtgagagtgg 4081 agtaccttcc tttctcaagt ctcgagatgc cagtgaaacc agtattccct gacttgggtt 4141 tcacacttta tcgaccaccc catggggtcc tgtgtagcct ttggccacgc actgacactg 4201 cccaggccaa gcaaaagcag agtgtggtta agagacaagg gtctatgtca tcatcccttt 4261 tgcttatggt agtgggctaa cataacagcc cctttctcaa gcagagacct ggcccctggg 4321 ccagccagat ggaagggcct atcttagcca gctggagctt aagccaaacc tgttctgtcc 4381 cactggccag ccatctttgc tcacatggaa attcaaatgc cattcaaagg ccactggtgc 4441 tttatttttc tatctgctga gtaaactgaa atgagatggg tcccaaccac cactgtaatt 4501 ttcaagccta ttttatttgt acctgtaaat actgtacagc taatatatat atatatatat 4561 atatatatat gtgtgtgtgt gtgtgtgtgt atgtgtgttt atagagatac acacacatat 4621 atatgtgtgt atatatatac acatacatat atatacacac acgcatttgc acagacacac 4681 acatatatca attctcatga gtgtattata atctctggtg ggggcaagtg tctggaaggc 4741 ctgaggggca cttcagatga gaatggagag gtagggagcc aggtgcagca ggatccctca 4801 aatcaataaa gcattaccag agat // LOCUS D82060 2330 bp mRNA PRI 16-OCT-1996 DEFINITION Human kidney mRNA for putative membrane protein with histidine rich charge clusters, complete cds. ACCESSION D82060 NID g1616917 KEYWORDS putative membrane protein with histidine rich charge clusters. SOURCE Homo sapiens kidney cDNA to mRNA, clone_lib:kidney cDNA library clone:pKE610, pKE606, pKE604. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ando,A., Kikuti,Y.Y., Shigenari,A., Kawata,H., Okamoto,N., Shiina,T., Chen,L., Ikemura,T., Abe,K., Kimura,M. and Inoko,H. TITLE cDNA cloning of the human homologues of the mouse Ke4 and Ke6 genes at the centromeric end of the human MHC region JOURNAL Genomics 35 (3), 600-602 (1996) MEDLINE 97001166 REFERENCE 2 (bases 1 to 2330) AUTHORS Ando,A. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2330) AUTHORS Ando,A. TITLE Direct Submission JOURNAL Submitted (18-DEC-1995) to the DDBJ/EMBL/GenBank databases. Asako Ando, Tokai University School of Medicine, Department of Molecular Life Science; Bohseidai, Isehara, Kanagawa 259-11, Japan (E-mail:aando@is.icc.u-tokai.ac.jp, Tel:81-463-93-1121(ex.2563), Fax:81-463-94-8884) FEATURES Location/Qualifiers source 1..2330 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6q21.3" /clone="pKE610, pKE606, pKE604" /clone_lib="kidney cDNA library" /tissue_type="kidney" CDS 298..1587 /codon_start=1 /product="putative membrane protein with histidine rich charge clusters" /db_xref="PID:d1012195" /db_xref="PID:g1616918" /translation="MARGLGGPHWVAVGLLTWATLGLLVAGLGGHDDLHDDLQEDFHG HSHRHSHEDFHHGHSHAHGHGHTHESIWHGHTHDHDHGHSHEDLHHGHSHGYSHESLY HRGHGHDHEHSHGGYGESGAPGIKQDLDAVTLWAYALGATVLISAAPFFVLFLIPVES NSPRHRSLLQILLSFASGGLLGDAFLHLIPHALEPHSHHTLEQPGHGHSHSGQGPILS VGLWVLSGIVAFLVVEKFVRHVKGGHGHSHGHGHAHSHTRGSHGHGRQERSTKEKQSS EEEGKETRGVQKRRGGSTVPKDGPVRPQNAEEEKRGLDLRVSGYLNLAADLAHNFTDG LAIGASFRGGRGLGILTTMTVLLHEVPHEVGDFAILVQSGCTKKQAMRLQLLTAVGAL AGTAVPFSLKEEQWTVKLQVVQVLAGSCHLLQVALST" polyA_signal 2304..2309 /citation=[2] BASE COUNT 525 a 563 c 717 g 525 t ORIGIN 1 tctgtttttt ctctaccatc ctttccaggc cttttcctca cctaatgagt cgtagagacg 61 agggcccaga gagtctgtaa agtggctggt gaaagattag tgtcccaggg ccctacatcc 121 gggaggtggt tcgggataaa gagaactagt cttgggaaca atgtaggtgg gaacttaagg 181 gaatgggaga gcggcccata gaggtggacg gagggcgcga ttggagtaaa gcggaccctg 241 tgtaggtata gagttgagtc aagtggagtc actgcctctg tccctctggt cagcgtgatg 301 gccagaggcc tggggggccc ccactgggtg gccgtgggac tgctgacctg ggcgaccttg 361 gggcttctgg tggctggact cgggggtcat gacgacctgc acgacgatct gcaagaggac 421 ttccatggcc acagccacag gcactcacat gaagatttcc accatggtca cagccatgcc 481 catggtcatg gccacactca cgagagcatc tggcatggac atacccacga tcacgaccat 541 ggacattcac atgaggattt acaccatggc catagccatg gctactccca tgagagcctc 601 taccacagag gacatggaca tgaccatgag catagccatg gaggctatgg ggagtctggg 661 gctccaggca tcaagcagga cctggatgct gtcactctct gggcttatgc actgggggcc 721 acagtgctga tctcagcagc tccatttttt gtcctcttcc ttatccccgt ggagtcgaac 781 tctccccggc atcgctctct acttcagatc ttgctcagtt ttgcttccgg tgggctcctg 841 ggagatgctt tcctgcacct cattcctcat gctcttgaac ctcattctca ccacactctg 901 gagcaacccg gacatggaca ctcccacagt ggccagggcc ccattctgtc tgtgggcctg 961 tgggttctca gtggaattgt tgcctttctt gtcgtggaga aatttgtgag acatgtgaaa 1021 ggaggacatg gtcacagtca tggacatgga cacgctcaca gtcatacacg tggaagtcat 1081 ggacatggaa gacaagagcg ttctaccaag gagaagcaga gctcagagga agaaggaaag 1141 gaaacaagag gggttcagaa gaggcgagga gggagcacag tacccaaaga tgggccagtg 1201 agacctcaga acgctgaaga agaaaaaaga ggcttagacc tgcgtgtgtc ggggtacctg 1261 aatctggctg ctgacttggc acacaacttc actgatggtc tggccattgg ggcttccttt 1321 cgagggggcc ggggactagg gatcctgacc acaatgactg tcctgctaca tgaagtgccc 1381 cacgaggtcg gggactttgc catcttggtc cagtctggct gcaccaaaaa gcaggcgatg 1441 cgtctgcaac tactgacagc agtaggggca ctggcaggca cagctgtgcc cttctcactg 1501 aaggaggagc agtggacagt gaaattgcag gtggtgcagg tcctggctgg gtcctgccat 1561 ttactgcagg tggctttatc tacgtagcaa cagtgtctgt gttgcccgag ctgctgaggg 1621 aggcatcacc attgcaatca cttctggagg tgctggggct gctgggggga gttatcatga 1681 tggtgctgat tgcccacctt gagtgagggg tggataaact accctgcccc aaacctctac 1741 ccctaactcc aggtcagggg tgcgtagagg ttgggggccc tggccaggga catctgccaa 1801 aggaaggaac tgtagcctgg gagcaatggt tactttggca ttagggcctt caagggctgg 1861 cagtcttaca gaggctggag cggtgagaat gagaggccag agggaccata gtgttgggca 1921 ctgtctgacc atgttgcatt tggaaggcta aatggggcca tgaagaaggc tggaagggac 1981 agggggtgat ggcagcctac ctggtgtccc ctaccccacc tgttctcgga gaaccaagtt 2041 gctacacagg aagttctcca aggtccagtt tcctttctcc caccagttgg tggaggcttc 2101 agggaagacc agagtcctgg acagagaggg taacaggagg agtcggggat aaacatcaaa 2161 catcaatcgt gtgtcctgat ttgggagtga ttggggggat ggggtgggag agggttaatt 2221 ggtattctca tggcctgatt ttttttgttt ctattccttt tatatcactg tgtttgaatc 2281 gagggggagg ggtggtaacc ggaaataaag acctccgatc ttccgccccc // LOCUS D82070 868 bp mRNA PRI 19-FEB-1997 DEFINITION Human aC1 mRNA, complete cds. ACCESSION D82070 NID g1845550 KEYWORDS aC1. SOURCE Homo sapiens neuroblastoma cell_line:SH-SY5Y cDNA to mRNA, clone:aC1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kito,K., Ito,T. and Sakaki,Y. TITLE Fluorescent differential display analysis of gene expression in differentiating neuroblastoma cells JOURNAL Gene 184 (1), 73-81 (1997) MEDLINE 97169148 REFERENCE 2 (bases 1 to 868) AUTHORS Kito,K., Ito,T. and Sakaki,Y. TITLE Fluorescent differential display analysis of gene expression in differentiating neuroblastoma cells JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 868) AUTHORS Ito,T. TITLE Direct Submission JOURNAL Submitted (19-DEC-1995) to the DDBJ/EMBL/GenBank databases. Takashi Ito, Institute of Medical Science, University of Tokyo, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:tito@hgc.ims.u-tokyo.ac.jp, Tel:03-5449-5623, Fax:03-5449-5445) FEATURES Location/Qualifiers source 1..868 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SH-SY5Y" /cell_type="neuroblastoma" /clone="aC1" gene 176..457 /gene="aC1" CDS 176..457 /gene="aC1" /codon_start=1 /db_xref="PID:d1012201" /db_xref="PID:g1845551" /translation="MDTQKQIHKTHNSKNQFFTIFFFLSVEFGKEGTRKNFYLLLSIG HYGRKSRRADLGTADTADKTEPECFAASWTFDPNPSVTVSGAHSTAVHQ" polyA_signal 830..835 BASE COUNT 274 a 211 c 180 g 203 t ORIGIN 1 ctgtttcaac catatccttt caaaccagat cagtgaggtc atgaccagaa aacaagccct 61 gccagcctcc tacctcaaat ctaattaatt ataattttct tccttatgac aacccacaca 121 aaagacagag ataagaaaaa caaggacttc ctgggaggct gtggatcaat taccaatgga 181 cacccagaag caaattcaca agactcacaa ttcaaagaac caatttttta caattttttt 241 tttcctgtca gttgaatttg ggaaggaagg aacacgcaaa aatttttacc ttcttctttc 301 aattggacac tatggacgga aatccaggag agctgacctt ggaactgcag acactgcaga 361 taaaacagag ccagaatgct ttgctgccag ctggaccttt gacccaaacc ccagtgtgac 421 tgtgtccggt gctcactcaa ctgcagtgca tcaatgaagg aaaaaagaac tgagcattgg 481 caaaaagctg aggatgacaa gcttagggga tgaaaggctg ccttttcccc cccttctgag 541 cgtttctgac agctcccagc tgggagaaac caacatgttg aaaagacaaa gaatactgga 601 gaagagagag ggtgggcaga gcacaacctc atcctcccag tggttcctct tgagtttgat 661 ttgacaagat gccttcccac ccaggtaacc ccgtggaacg tgccagtacc tcaccgcacg 721 acctcactga gtcctcacaa caaatccagg ctgcagattt ttttccccac ttggagatga 781 caaattgaaa tgcaggaagg ttaaagagtt tgcctgaggt tgcttagata ataaaagaat 841 ctggattaga acccagacct acccagct // LOCUS D82343 1009 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for AMY, complete cds. ACCESSION D82343 NID g1841335 KEYWORDS AMY. SOURCE Homo sapiens Neuroblastoma cell_line:IMR-32 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoyama,M., Nishi,Y., Yoshii,J., Okubo,K. and Matsubara,K. TITLE Identification and cloning of neuroblastoma-specific and nerve tissue-specific genes through compiled expression profiles JOURNAL DNA Res. 3 (5), 311-320 (1996) MEDLINE 97191543 REFERENCE 2 (bases 1 to 1009) AUTHORS Yokoyama,M. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) to the DDBJ/EMBL/GenBank databases. Masahiro Yokoyama, Japan Tobacco, Inc., Pharmaceutical Frontier Research Laboratories; 13-2 Fukuura 1-chome, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:yokoyama@ikrl.jti.co.jp, Tel:045-786-7694, Fax:045-786-7692) FEATURES Location/Qualifiers source 1..1009 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR-32" /cell_type="Neuroblastoma" CDS 286..693 /codon_start=1 /product="AMY" /db_xref="PID:d1012221" /db_xref="PID:g1841336" /translation="MPGRWRWQRDMHPARKLLSLLFLILMGTELTQVLPTNPEESWQV YSSAQDSEGRCICTVVAPQQTMCSRDARTKQLRQLLEKVQNMSQSIEVLDRRTQRDLQ YVEKMENQMKGLESKFKQVEESHKQHLARQFKG" polyA_site 1009 BASE COUNT 261 a 282 c 286 g 180 t ORIGIN 1 gcgcggggga gccattagga ggcgaggaga gaggagggcg cagctcccgc ccagcccagc 61 cctgcccagc cctgcccgga ggcagacgcg ccggaaccgg gacgcgataa atatgcagag 121 cggaggcttc gcgcagcaga gcccgcgcgc cgcccgctcc gggtgctgaa tccaggcgtg 181 gggacacgag ccaggcgccg ccgccggagc cagcggagcc ggggccagag ccggagcgcg 241 tccgcgtcca cgcagccgcc ggccggccag cacccagggc cctgcatgcc aggtcgttgg 301 aggtggcagc gagacatgca cccggcccgg aagctcctca gcctcctctt cctcatcctg 361 atgggcactg aactcactca agtgctgccc accaaccctg aggagagctg gcaggtgtac 421 agctctgccc aggacagcga gggcaggtgt atctgcacag tggtcgcccc acagcagacc 481 atgtgttcac gggatgcccg cacaaaacag ctgaggcagc tactggagaa ggtgcagaac 541 atgtctcaat ccatagaggt cttggacagg cggacccaga gagacttgca gtacgtggag 601 aagatggaga accaaatgaa aggactggag tccaagttca aacaggtgga ggagagtcat 661 aagcaacacc tggccaggca gtttaagggc taacttaaaa gagttttttc aatgctgcag 721 tgactgaaga agcagtccac tcccatgtaa ccatgaaaga gagccagaga gctttttgca 781 ccatgcattt ttactattat tttccaatac ttagcaccat ttcactaagg aaccttgaat 841 acaaccagga tcctcctttg catgcgactg tagctgcatt tcatgaatag tttgaaccct 901 tgtcaatgca ttttttgaaa aagaaagaaa aaaaaaactt cgtgtatgtg actcaaagca 961 tgtaacctta agatgttgca ttctaaactg acaataaaga cctttcccc // LOCUS D82344 3029 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for NBPhox, complete cds. ACCESSION D82344 NID g1841337 KEYWORDS NBPhox. SOURCE Homo sapiens Neuroblastoma cell_line:IMR-32 cDNA to mRNA, clone:NBPhox. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoyama,M., Nishi,Y., Yoshii,J., Okubo,K. and Matsubara,K. TITLE Identification and cloning of neuroblastoma-specific and nerve tissue-specific genes through compiled expression profiles JOURNAL DNA Res. 3 (5), 311-320 (1996) MEDLINE 97191543 REFERENCE 2 (bases 1 to 3029) AUTHORS Yokoyama,M. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) to the DDBJ/EMBL/GenBank databases. Masahiro Yokoyama, Japan Tobacco, Inc., Pharmaceutical Frontier Research Laboratories; 13-2 Fukuura 1-chome, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:yokoyama@ikrl.jti.co.jp, Tel:045-786-7694, Fax:045-786-7692) FEATURES Location/Qualifiers source 1..3029 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR-32" /cell_type="Neuroblastoma" /clone="NBPhox" CDS 361..1305 /codon_start=1 /product="NBPhox" /db_xref="PID:d1012222" /db_xref="PID:g1841338" /translation="MYKMEYSYLNSSAYESCMAGMDTSSLASAYADFSSCSQASGFQY NPIRTTFGATSGCPSLTPGSCSLGTLRDHQSSPYAAVPYKLFTDHGGLNEKRKQRRIR TTFTSAQLKELERVFAETHYPDIYTREELALKIDLTEARVQVWFQNRRAKFRKQERAA AAAAAAAKNGSSGKKSDSSRDDESKEAKSTDPDSTGGPGPNPNPTPSCGANGGGGGGP SPAGAPGAAGPGGPGGEPGKGGAAAAAAAAAAAAAAAAAAAAGGLAAAGGPGQGWAPG PGPITSIPDSLGGPFGSVLSSLQRPNGAKAALVKSSMF" BASE COUNT 692 a 816 c 800 g 721 t ORIGIN 1 ttaaattcta attagagatg caggaatcaa tgatagggag gttggacagc tcagttcccc 61 agtgccagcc caatagacgg atgagttatt gtcatgtaaa aagcgccagc aataagacca 121 accgctttgc tattgtccaa gtggaaagag ccaagtttat tatgaggact atatgctcta 181 gagacctcag acaaggcatc tcataggagg ctttttcata aaactaggct ctgctggtag 241 taaggaggcc agtttggagg caggcgttga gctgtgcaca tctccccact ccagccacct 301 tctccatatc catcttttat ttcatttttc cacttggctg agccatccag aaccttttca 361 atgtataaaa tggaatattc ttacctcaat tcctctgcct acgagtcctg tatggctggg 421 atggacacct cgagcctggc ttcagcctat gctgacttca gttcctgcag ccaggccagt 481 ggcttccagt ataacccgat aaggaccact tttggggcca cgtccggctg cccttccctc 541 acgccgggat cctgcagcct gggcaccctc agggaccacc agagcagtcc gtacgccgca 601 gttccttaca aactcttcac ggaccacggc ggcctcaacg agaagcgcaa gcagcggcgc 661 atccgcacca ctttcaccag tgcccagctc aaagagctgg aaagggtctt cgcggagact 721 cactaccccg acatctacac tcgggaggag ctggccctga agatcgacct cacagaggcg 781 cgagtccagg tgtggttcca gaaccgccgc gccaagtttc gcaagcagga gcgcgcagcg 841 gcagccgcag cggccgcggc caagaacggc tcctcgggca aaaagtctga ctcttccagg 901 gacgacgaga gcaaagaggc caagagcact gacccggaca gcactggggg cccaggtccc 961 aatcccaacc ccacccccag ctgcggggcg aatggaggcg gcggcggcgg gcccagcccg 1021 gctggagctc cgggggcggc ggggcccggg ggcccgggag gcgaacccgg caagggcggc 1081 gcagcagcag cggcggcggc cgcggcagcg gcggcggcgg cagcggcagc ggcggcagct 1141 ggaggcctgg ctgcggctgg gggccctgga caaggctggg ctcccggccc cggccccatc 1201 acctccatcc cggattcgct tgggggtccc ttcggcagcg tcctatcttc gctccaaaga 1261 cccaacggtg ccaaagccgc cttagtgaag agcagtatgt tctgatctgg aatcctgcgg 1321 cggcggcggc ggcggcgaca gcgggcgagc cagggcccgg gcgggcgagt gggcgagcgg 1381 gtaggcccaa ggctattgtc gtcgctgctg ccatggcttt ttcattgagg gcctaaagta 1441 atcgcgctaa gaataaaggg aaaacggcgt cgccctcatt tcaaccccac tcctaccccc 1501 ttcctcaacc cccaaacaaa acaaacaaac ttccctggct tcgcacctgc ctggggcctc 1561 gcagcggggc cagggctccg cctgctgatc gggggttgtg agcagcgcgg cctggacgcg 1621 gggcactctc agggggctgt gtctgcgtgt cagtttgtgt ctgtctcggg gaatgtgtgt 1681 ctgtggccca agcaggtgac aggaagagat ggggggcctc aaccaactta gtgacttgtt 1741 tagaaaaaaa agacaaaaaa gtaaaaataa aaacaaaaaa gttggaaggc agaaaccatt 1801 aaaaaacaaa aagccaacaa cccagaaagg tttaaaaaac ataaggaaaa aaaagacaaa 1861 ttaaaggagg ggctagggga gaagctgcag ctggagctga aggctcgatc ttgtgaaccc 1921 ctaaatccgc tccctcctaa cagcacggat tctcttgggg ctcttcttca gggaagagta 1981 gggacgccgt tccagccccc cttcctatcg tgtccttggg ttcgggtcac tgcggcgacg 2041 acttgctcag actgtcccgg cggccggagt gactttctcg cacccccttg cctgtcccac 2101 ctcgctgaac accatcccgc cattagcgca tcggaacccc acacagttgc aactcccaac 2161 cccgaatctt tgcagccgtt cggccctgaa agatgcccta tccatgagat gccttttcat 2221 ctgcaaactc tgcaaaatgt gtctcatgtt tcgcaactct ttttttcccc ctcgctcccg 2281 cctaccccgt cggcattttc ttcttccacc agcttttact gaactttttg gcactgcttt 2341 ggattggggt caattgcagt ccacgtaact ggctgcagag aaatctaccg agcaaggaaa 2401 aggcacacac acacgtttgc aggggtgtct cggtttgcat ttctgttgga atgatccgaa 2461 ctggactcac atcctgtatg gtggatggac tgtatattga gggttccatt cttcgcgcag 2521 tttagacatc tctgttttga ttctttgttg ttgtttttat tttaaaaggc acaaactcta 2581 gatattagtt gaatgttgag gctttaactt tttcggtgtc tttctacaac tgtgttctgt 2641 gactcaattg tatcgtgtta atatcagtgc agactgtctc ctctacgtga ccgtataatg 2701 tttttctcgt cttgtagtct ctatggcgtg tctttatggt gtaataaggt tctcacgggg 2761 tcaatctttt gtgtttagag aggccacggt tcagacaatg gtatatattt ttgttatcag 2821 gtgcatgtct gtctgatttc tttttttttc ctgttggact atgtttgtga acataattgt 2881 cataagttat gtttcagatt tttgaattta tttatatgtg ttataatgaa tgcttctatt 2941 taaaagggaa atatttctac atgtgcttat agttttccaa gagtgtacca ttaacttgat 3001 tgttgataat aaaaaccaaa agcaagtct // LOCUS D82345 639 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for NB thymosin beta, complete cds. ACCESSION D82345 NID g1841339 KEYWORDS NB thymosin beta. SOURCE Homo sapiens Neuroblastoma cell_line:IMR-32 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoyama,M., Nishi,Y., Yoshii,J., Okubo,K. and Matsubara,K. TITLE Identification and cloning of neuroblastoma-specific and nerve tissue-specific genes through compiled expression profiles JOURNAL DNA Res. 3 (5), 311-320 (1996) MEDLINE 97191543 REFERENCE 2 (bases 1 to 639) AUTHORS Yokoyama,M. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) to the DDBJ/EMBL/GenBank databases. Masahiro Yokoyama, Japan Tobacco, Inc., Pharmaceutical Frontier Research Laboratories; 13-2 Fukuura 1-chome, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:yokoyama@ikrl.jti.co.jp, Tel:045-786-7694, Fax:045-786-7692) FEATURES Location/Qualifiers source 1..639 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR-32" /cell_type="Neuroblastoma" CDS 98..235 /codon_start=1 /product="NB thymosin beta" /db_xref="PID:d1012223" /db_xref="PID:g1841340" /translation="MSDKPDLSEVEKFDRSKLKKTNTEEKNTLPSKETIQQEKECVQT S" BASE COUNT 178 a 130 c 130 g 201 t ORIGIN 1 cgcgggaacg ctaacctggt ccggagcgag tctgggtctc agccccgcga acagcctttc 61 acgagtcttc aagctttcag gctatcttct agtcaagatg agtgataagc cagacttgtc 121 ggaagtggag aagtttgaca ggtcaaaact gaagaaaact aatactgaag aaaaaaatac 181 tcttccctca aaggaaacta tccagcaaga gaaagagtgt gttcaaacat cataaaatgg 241 ggatcgcctc ccaacagcag atttcgacat tacctgagag tcttgatttt aggcttgttt 301 tttgtaaacc catgtgtttg tagagatttt aggcgtcttc ggatatcttc tcacctatgt 361 tccctggcta agaagtcaga ggtagccaat gtttccttaa attcattttt aaacttacca 421 ttggtgcata tgttccagat ggcagatgct gtcaataatc tcaccattga tgacctttgt 481 gtatgtagtt cttgcatcct atactggata agcctgtttt aacctgctat gatgggtgct 541 tccattgctt cataatcttc atgaagttgc atgcttttgc agcttttcac agtttatttg 601 catttctaat gtagtaataa agtaaccaat ataatcatt // LOCUS D82346 1425 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for HNSPC, complete cds. ACCESSION D82346 NID g1841341 KEYWORDS HNSPC. SOURCE Homo sapiens Neuroblastoma cell_line:IMR-32 cDNA to mRNA, clone:HNSPC. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoyama,M., Nishi,Y., Yoshii,J., Okubo,K. and Matsubara,K. TITLE Identification and cloning of neuroblastoma-specific and nerve tissue-specific genes through compiled expression profiles JOURNAL DNA Res. 3 (5), 311-320 (1996) MEDLINE 97191543 REFERENCE 2 (bases 1 to 1425) AUTHORS Yokoyama,M. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) to the DDBJ/EMBL/GenBank databases. Masahiro Yokoyama, Japan Tobacco, Inc., Pharmaceutical Frontier Research Laboratories; 13-2 Fukuura 1-chome, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:yokoyama@ikrl.jti.co.jp, Tel:045-786-7694, Fax:045-786-7692) FEATURES Location/Qualifiers source 1..1425 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR-32" /cell_type="Neuroblastoma" /clone="HNSPC" CDS 178..1359 /codon_start=1 /product="HNSPC" /db_xref="PID:d1012224" /db_xref="PID:g1841342" /translation="MVQKSRNGGVYPGPSGEKKLKVGFVGLDPGAPDSTRDGALLIAG SEAPKRGSILSKPRAGGAGAGKPPKRNAFYRKLQNFLYNVLERPRGWAFIYHAYVFLL VFSCLVLSVFSTIKEYEKSSEGALYILEIVTIVVFGVEYFVRIWAAGCCCRYRGWRGR LKFARKPFCVIDIMVLIASIAVLAAGSQGNVFATSALRSLRFLQILRMIRMDRRGGTW KLLGSVVYAHSKELVTAWYIGFLCLILASFLVYLAEKGENDHFDTYADALWWGLITLT TIGYGDKYPQTWNGRLLAATFTLIGVSFFALPAGILGSGFALKVQEQHRQKHFEKRRN PAAGLIQSAWRFYATNLSRTDLHSTWQYYERTVTVPMYRYRRRAPATKQLFHFLFSIC S" BASE COUNT 231 a 462 c 439 g 293 t ORIGIN 1 cgcggagcga ggtggccgca gcgtctccgc gcgcggccca agcccggcag gagtgcggaa 61 ccgccgcctc ggccatgcgg ctcccggccg gggggcctgg gctggggccc gcgccgcccc 121 ccgcgctccg cccccgctga gcctgagccc gacccggggc gcctcccgcc aggcaccatg 181 gtgcagaagt cgcgcaacgg cggcgtatac cccggcccga gcggggagaa gaagctgaag 241 gtgggcttcg tggggctgga ccccggcgcg cccgactcca cccgggacgg ggcgctgctg 301 atcgccggct ccgaggcccc caagcgcggc agcatcctca gcaaacctcg cgcgggcggc 361 gcgggcgccg ggaagccccc caagcgcaac gccttctacc gcaagctgca gaatttcctc 421 tacaacgtgc tggagcggcc gcgcggctgg gcgttcatct accacgccta cgtgttcctc 481 ctggttttct cctgcctcgt gctgtctgtg ttttccacca tcaaggagta tgagaagagc 541 tcggaggggg ccctctacat cctggaaatc gtgactatcg tggtgtttgg cgtggagtac 601 ttcgtgcgga tctgggccgc aggctgctgc tgccggtacc gtggctggag ggggcggctc 661 aagtttgccc ggaaaccgtt ctgtgtgatt gacatcatgg tgctcatcgc ctccattgcg 721 gtgctggccg ccggctccca gggcaacgtc tttgccacat ctgcgctccg gagcctgcgc 781 ttcctgcaga ttctgcggat gatccgcatg gaccggcggg gaggcacctg gaagctgctg 841 ggctctgtgg tctatgccca cagcaaggag ctggtcactg cctggtacat cggcttcctt 901 tgtctcatcc tggcctcgtt cctggtgtac ttggcagaga agggggagaa cgaccacttt 961 gacacctacg cggatgcact ctggtggggc ctgatcacgc tgaccaccat tggctacggg 1021 gacaagtacc cccagacctg gaacggcagg ctccttgcgg caaccttcac cctcatcggt 1081 gtctccttct tcgcgctgcc tgcaggcatc ttggggtctg ggtttgccct gaaggttcag 1141 gagcagcaca ggcagaagca ctttgagaag aggcggaacc cggcagcagg cctgatccag 1201 tcggcctgga gattctacgc caccaacctc tcgcgcacag acctgcactc cacgtggcag 1261 tactacgagc gaacggtcac cgtgcccatg tacaggtacc gccgccgggc acctgccacc 1321 aagcaactgt ttcatttttt attttccatt tgttcttaaa ccccactttt tgttgttcat 1381 tattttgatt gatttttttt ctttaaaatg tatttttcac aaagg // LOCUS D82880 2562 bp mRNA PRI 02-DEC-1996 DEFINITION Human brain mRNA for human ras GTPase-activating protein,Gap1m, complete cds. ACCESSION D82880 NID g1513025 KEYWORDS Ras GTPase-activating protein; Gap1m; RASA2. SOURCE Homo sapiens male brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Li,S., Satoh,H., Watanabe,T., Nakamura,S. and Hattori,S. TITLE cDNA cloning and chromosomal mapping of a novel human GAP (GAP1M), a GTPase-activating protein of Ras JOURNAL Genomics 35 (3), 625-627 (1996) MEDLINE 97001173 REFERENCE 2 (bases 1 to 2562) AUTHORS Li,S., Satoh,H., Watanabe,T., Nakamura,S. and Hattori,S. TITLE cDNA cloning and chromosomal mapping of human Gap1m, a novel GTPase-activating protein toward Ras JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2562) AUTHORS HATTORI,S. TITLE Direct Submission JOURNAL Submitted (26-DEC-1995) to the DDBJ/EMBL/GenBank databases. SEISUKE HATTORI, National Institute of Neuroscience, Division of Biochemistry and Cellular Biology; 4-1-1 Ogawahigashi, Kodaira, Tokyo 187, Japan (E-mail:hattori@ncnaxp.ncnp.go.jp, Tel:81-423-46-1722, Fax:81-423-46-1752) FEATURES Location/Qualifiers source 1..2562 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="brain" CDS 1..2562 /codon_start=1 /product="human ras GTPase-activating protein, Gap1m" /db_xref="PID:d1012288" /db_xref="PID:g1513026" /translation="MAAAAPAAAAASSEAPAASATAEPEAGDQDSREVRVLQSLRGKI CEAKNLLPYLGPHKMRDCFCTINLDQEEVYRTQVVEKSLSPFFSEEFYFEIPRTFQYL SFYVYDKNVLQRDLRIGKVAIKKEDLCNHSGKETWFSLQPVDSNSEVQGKVHLELKLN ELITENGTVCQQLVVHIKACHGLPLINGQSCDPYATVSLVGPSRNDQKKTKVKKKTSN PQFNEIFYFEVTRSSSYTRKSQFQVEEEDIEKLEIRIDLWNNGNLVQDVFLGEIKVPV NVLRTDSSHQAWYLLQPRDNGNKSSKTDDLGSLRLNICYTEDYVLPSEYYGPLKTLLL KSPDVQPISASAAYILSEICRDKNDAVLPLVRLLLHHDKLVPFATAVAELDLKDTQDA NTIFRGNSLATRCLDEMMKIVGGHYLKVTLKPILDEICDSSKSCEIDPIKLKEGDNVE NNKENLRYYVDKLFNTIVKSSMSCPTVMCDIFYSLRQMATQRFPNDPHVQYSAVSSFV FLRFFAVAVVSPHTFHLRPHHPDAQTIRTLTLISKTIQTLGSWGSLSKSKSSFKETFM CEFFKMFQEEGYIIAVKKFLDEISSTETKESSGTSEPVHLKEGEMYKRAQGRTRIGKK NFKKRWFCLTSRELTYHKQPEFIERKDAIYTIPVKNILAVEKLEESSFNKKNMFQVIH TEKPLYVQANNCVEANEWIDVLCRVSRCNQNRLSFYHPSVYLNGNWLCCQETGENTLG CKPCTAGVPADIQIDIDEDRETERIYSLFTLSLLKLQKMEEACGTIAVYQGPQKEPDD YSNFVIEDSVTTFKTIQQIKSIIEKLDEPHEKYRKKRSSSAKYGSKENPIVGKAS" BASE COUNT 841 a 487 c 555 g 679 t ORIGIN 1 atggcggcgg cggcgcctgc tgctgcggcg gcttcttccg aggcgccagc ggcgagtgcg 61 actgcagagc ccgaggccgg ggaccaggac agtcgcgagg ttcgagtgtt gcagagcctg 121 cggggcaaga tctgtgaagc aaaaaattta ttgccatatc ttggacccca caaaatgaga 181 gattgtttct gtaccataaa tttggaccag gaagaagttt atcgtaccca agttgtggaa 241 aaatctttaa gcccattttt cagtgaagaa ttttactttg agattccaag aactttccag 301 tatttgtctt tctatgttta tgataagaat gttttacaaa gagatctccg tataggaaaa 361 gtagccatca aaaaagaaga cttgtgtaat cacagtggca aagaaacttg gttttcatta 421 cagcctgttg actccaattc agaggttcag ggtaaagttc accttgaatt aaaactgaat 481 gaactgataa cggagaatgg aactgtatgc cagcagcttg ttgtacacat caaggcatgc 541 catgggttgc ctctcataaa tggccaaagc tgtgaccctt atgcaacagt ttctctagtg 601 ggcccttcta ggaatgacca aaagaagaca aaagtaaaga agaaaacaag caatccgcag 661 tttaatgaaa tcttttattt tgaggtaacc agatccagta gttacaccag aaagtcccag 721 ttccaggtag aagaggagga cattgaaaag ctagaaatca ggatcgactt gtggaacaat 781 ggaaacctag tccaagatgt tttcctaggt gagattaagg ttcctgtgaa cgtattaaga 841 actgattcct ctcatcaagc ctggtacttg ctacagccaa gagacaatgg aaacaagtca 901 tccaaaactg atgacctggg gtctcttcga ttaaatatat gttatacaga agactacgtg 961 cttccttcag agtactatgg tcctttgaaa actttgctgc taaaatcacc agatgttcaa 1021 ccaatatctg cctcagctgc ttacattttg agtgaaatat gtcgagataa aaatgatgct 1081 gttttgcccc ttgtacgact gctgctgcac catgataaac ttgttccttt tgccactgct 1141 gtggctgaat tagacttgaa ggatacacaa gatgcaaaca caatttttag aggaaattcc 1201 ctggctaccc gatgtctgga tgagatgatg aaaatagtgg gagggcacta cctgaaagta 1261 acattaaaac ctattcttga tgagatatgt gactcctcaa aatcctgtga aatcgatcct 1321 attaaattga aagagggaga taatgtagaa aataataagg agaatctgcg ctactatgta 1381 gacaagttat tcaatacaat tgtaaaatca agtatgagct gccccactgt aatgtgtgat 1441 atcttttatt ctctaaggca gatggctact cagagatttc ctaatgaccc tcatgttcag 1501 tattctgcag tgagcagctt tgtatttctc cgtttctttg ctgtagccgt agtatcacct 1561 catacttttc atttgcgacc tcatcatcca gatgcacaga caattagaac attaactctc 1621 atctcaaaaa ctatacaaac tttgggaagc tgggggagtc tgtccaaaag caagtcaagt 1681 ttcaaagaga cattcatgtg tgaatttttc aaaatgtttc aagaagaagg atatattata 1741 gcagttaaaa agttcttgga tgaaatttca tctactgaaa ctaaagagtc cagtggtacg 1801 agtgagcctg tgcacctgaa agaaggtgag atgtataaaa gagctcaagg aagaactcgg 1861 attggaaaaa agaattttaa gaaacgatgg ttctgcttaa caagcagaga gctcacctac 1921 cacaaacagc cagagttcat tgaacgcaaa gatgcaatct acacaatccc agtaaaaaac 1981 attcttgctg tggaaaaact ggaagagagc tctttcaaca agaaaaatat gttccaagta 2041 atacatacgg agaaaccact ctatgtccag gcaaataact gtgtagaagc taatgaatgg 2101 atagacgtac tctgcagggt gagccgatgc aatcaaaaca ggctcagttt ttatcatccc 2161 tctgtgtatc tgaacggaaa ttggctctgc tgtcaggaga ctggtgaaaa cactctcggc 2221 tgcaagccat gtactgcagg tgtccctgca gacatccaaa tagatattga tgaagacaga 2281 gaaacagaaa gaatttattc cctttttacc ctcagtttac ttaagctgca gaagatggaa 2341 gaggcttgtg gaactattgc agtctatcaa ggaccacaga aagagcctga tgattattct 2401 aactttgtaa tcgaggattc tgtaacaacc tttaagacaa ttcagcaaat aaaaagtata 2461 attgagaagc tggatgaacc tcatgaaaaa tataggaaga aaagatccag tagtgcaaaa 2521 tatgggagca aggaaaatcc aattgttggg aaagcatctt ag // LOCUS D83004 1203 bp mRNA PRI 19-FEB-1997 DEFINITION Human epidermoid carcinoma mRNA for ubiquitin-conjugating enzyme E2 similar to Drosophila bendless gene product, complete cds. ACCESSION D83004 NID g1181557 KEYWORDS ubiquitin-conjugating enzyme E2 similar. SOURCE Homo sapiens epidermoid carcinoma cell_line:KB cDNA to mRNA, clone_lib:KB/pKA1 clone:HP00686. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamaguchi,T., Kim,N.S., Sekine,S., Seino,H., Osaka,F., Yamao,F. and Kato,S. TITLE Cloning and expression of cDNA encoding a human ubiquitin-conjugating enzyme similar to the Drosophila bendless gene product JOURNAL J. Biochem. 120 (3), 494-497 (1996) MEDLINE 97058291 REFERENCE 2 (bases 1 to 1203) AUTHORS Kato,S. TITLE Human ubiquitin-conjugating enzyme E2 similar to Drosophila bendless gene product JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1203) AUTHORS Kato,S. TITLE Direct Submission JOURNAL Submitted (10-JAN-1996) to the DDBJ/EMBL/GenBank databases. Seishi Kato, Sagami Chemical Research Center, Genetic Engineering Section; 4-4-1 Nishi-Ohnuma, Sagamihara, Kanagawa 229, Japan (Tel:0427-42-4791, Fax:0427-42-5091) FEATURES Location/Qualifiers source 1..1203 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KB" /cell_type="epidermoid carcinoma" /clone="HP00686" /clone_lib="KB/pKA1" 5'UTR 1..63 CDS 64..522 /codon_start=1 /product="ubiquitin-conjugating enzyme E2 UbcH-ben" /db_xref="PID:d1012342" /db_xref="PID:g1181558" /translation="MAGLPRRIIKETQRLLAEPVPGIKAEPDESNARYFHVVIAGPQD SPFEGGTFKLELFLPEEYPMAAPKVRFMTKIYHPNVDKLGRICLDILKDKWSPALQIR TVLLSIQALLSAPNPDDPLANDVAEQWKTNEAQAIETARAWTRLYAMNNI" 3'UTR 523..1203 BASE COUNT 335 a 251 c 259 g 358 t ORIGIN 1 actcgtgcgt gaggcgagag gagccggaga cgagaccaga ggccgaactc gggttctgac 61 aagatggccg ggctgccccg caggatcatc aaggaaaccc agcgtttgct ggcagaacca 121 gttcctggca tcaaagccga accagatgag agcaacgccc gttattttca tgtggtcatt 181 gctggccctc aggattcccc ctttgaggga gggactttta aacttgaact attccttcca 241 gaagaatacc caatggcagc ccctaaagta cgtttcatga ccaaaattta tcatcctaat 301 gtagacaagt tgggaagaat atgtttagat attttgaaag ataagtggtc cccagcactg 361 cagatccgca cagttctgct atcgatccag gccttgttaa gtgctcccaa tccagatgat 421 ccattagcaa atgatgtagc ggagcagtgg aagaccaacg aagcccaagc catagaaaca 481 gctagagcat ggactaggct atatgccatg aataatattt aaattgatac gatcatcaag 541 tgtgcatcac ttctcctgtt ctgccaagac ttcctcctct ttgtttgcat ttaatggaca 601 cagtcttaga aacattacag aataaaaaag cccagacatc ttcagtcctt tggtgattaa 661 atgcacatta gcaaatctat gtcttgtcct gattcactgt cataaagcat gagcagaggc 721 tagaagtatc atctggattg ttgtgaaacg tttaaaagca gtggcccctc cctgctttta 781 ttcatttccc ccatcctggt ttaagtataa agcactgtga atgaaggtag ttgtcaggtt 841 agctgcaggg gtgtgggtgt ttttatttta ttttatttta ttttattttt gaggggggag 901 gtagtttaat tttatgggct cctttccccc ttttttggtg atctaattgc attggttaaa 961 agcagctaac caggtcttta gaatatgctc tagccaagtc taactttatt tagacgctgt 1021 agatggacaa gcttgattgt tggaaccaaa atgggaacat taaacaaaca tcacagccct 1081 cactaataac attgctgtca agtgtagatt ccccccttca aaaaaagctt gtgaccattt 1141 tgtatggctt gtctggaaac ttctgtaaat cttatgtttt agtaaaatat tttttgttat 1201 tct // LOCUS D83017 2977 bp mRNA PRI 06-FEB-1997 DEFINITION Human mRNA for nel-related protein, complete cds. ACCESSION D83017 NID g1827482 KEYWORDS nel-related protein. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Katagiri,T., Suzuki,M., Shimizu,F., Fujiwara,T., Kanemoto,N., Nakamura,Y., Hirai,Y., Maekawa,H. and Takahashi,Ei. TITLE Cloning and characterization of two novel human cDNAs (NELL1 and NELL2) encoding proteins with six EGF-like repeats JOURNAL Genomics 38 (3), 273-276 (1996) MEDLINE 97131504 REFERENCE 2 (bases 1 to 2977) AUTHORS Watanabe,T., Katagiri,T., Suzuki,M., Fujiwara,T., Kanemoto,N., Maekawa,H., Nakamura,Y. and Takahashi,E. TITLE Cloning, expression, and mapping of a humannel-related protein 1 (NRP1) JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 2977) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..2977 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 103..2535 /note="NRP1" /codon_start=1 /product="nel-related protein" /db_xref="PID:d1012347" /db_xref="PID:g1827483" /translation="MPMDLILVVWFCVCTARTVVGFGMDPDLQMDIVTELDLVNTTLG VAQVSGMHNASKAFLFQDIEREIHAAPHVSEKLIQLFQNKSEFTILATVQQKPSTSGV ILSIRELEHSYFELESSGLRDEIRYHYIHNGKPRTEALPYRMADGQWHKVALSVSASH LLLHVDCNRIYERVIDPPDTNLPPGINLWLGQRNQKHGLFKGIIQDGKIIFMPNGYIT QCPNLNHTCPTCSDFLSLVQGIMDLQELLAKMTAKLNYAETRLSQLENCHCEKTCQVS GLLYRDQDSWVDGDHCRNCTCKSGAVECRRMSCPPLNCSPDSLPVHIAGQCCKVCRPK CIYGGKVLAEGQRILTKSCRECRGGVLVKITEMCPPLNCSEKDHILPENQCCRVCRGH NFCAEGPKCGENSECKNWNTKATCECKSGYISVQGDSAYCEDIDECAAKMHYCHANTV CVNLPGLYRCDCVPGYIRVDDFSCTEHDECGSGQHNCDENAICTNTVQGHSCTCKPGY VGNGTICRAFCEEGCRYGGTCVAPNKCVCPSGFTGSHCEKDIDECSEGIIECHNHSRC VNLPGWYHCECRSGFHDDGTYSLSGESCIDIDECALRTHTCWNDSACINLAGGFDCLC PSGPSCSGDCPHEGGLKHNGQVWTLKEDRCSVCSCKDGKIFCRRTACDCQNPSADLFC CPECDTRVTSQCLDQNGHKLYRSGDNWTHSCQQCRCLEGEVDCWPLTCPNLSCEYTAI LEGECCPRCVSDPCLADNITYDIRKTCLDSYGVSRLSGSVWTMAGSPCTTCKCKNGRV CCSVDFECLQNN" BASE COUNT 774 a 649 c 750 g 804 t ORIGIN 1 tagcaagttt ggcggctcca agccaggcgc gcctcaggat ccaggctcat ttgcttccac 61 ctagcttcgg tgccccctgc taggcgggga ccctcgagag cgatgccgat ggatttgatt 121 ttagttgtgt ggttctgtgt gtgcactgcc aggacagtgg tgggctttgg gatggaccct 181 gaccttcaga tggatatcgt caccgagctt gaccttgtga acaccaccct tggagttgct 241 caggtgtctg gaatgcacaa tgccagcaaa gcatttttat ttcaagacat agaaagagag 301 atccatgcag ctcctcatgt gagtgagaaa ttaattcagc tgttccagaa caagagtgaa 361 ttcaccattt tggccactgt acagcagaag ccatccactt caggagtgat actgtccatt 421 cgagaactgg agcacagcta ttttgaactg gagagcagtg gcctgaggga tgagattcgg 481 tatcactaca tacacaatgg gaagccaagg acagaggcac ttccttaccg catggcagat 541 ggacaatggc acaaggttgc actgtcagtt agcgcctctc atctcctgct ccatgtcgac 601 tgtaacagga tttatgagcg tgtgatagac cctccagata ccaaccttcc cccaggaatc 661 aatttatggc ttggccagcg caaccaaaag catggcttat tcaaagggat catccaagat 721 gggaagatca tctttatgcc gaatggatat ataacacagt gtccaaatct aaatcacact 781 tgcccaacct gcagtgattt cttaagcctg gtgcaaggaa taatggattt acaagagctt 841 ttggccaaga tgactgcaaa actaaattat gcagagacaa gacttagtca attggaaaac 901 tgtcattgtg agaagacttg tcaagtgagt ggactgctct atcgagatca agactcttgg 961 gtagatggtg accattgcag gaactgcact tgcaaaagtg gtgccgtgga atgccgaagg 1021 atgtcctgtc cccctctcaa ttgctcccca gactccctcc cagtacacat tgctggccag 1081 tgctgtaagg tctgccgacc aaaatgtatc tatggaggaa aagttcttgc agaaggccag 1141 cggattttaa ccaagagctg tcgggaatgc cgaggtggag ttttagtaaa aattacagaa 1201 atgtgtcctc ctttgaactg ctcagaaaag gatcacattc ttcctgagaa tcagtgctgc 1261 cgtgtctgta gaggtcataa cttttgtgca gaaggaccta aatgtggtga aaactcagag 1321 tgcaaaaact ggaatacaaa agctacttgt gagtgcaaga gtggttacat ctctgtccag 1381 ggagactctg cctactgtga agatattgat gagtgtgcag ctaagatgca ttactgtcat 1441 gccaatactg tgtgtgtcaa ccttcctggg ttatatcgct gtgactgtgt cccaggatac 1501 attcgtgtgg atgacttctc ttgtacagaa cacgatgaat gtggcagcgg ccagcacaac 1561 tgtgatgaga atgccatctg caccaacact gtccagggac acagctgcac ctgcaaaccg 1621 ggctacgtgg ggaacgggac catctgcaga gctttctgtg aagagggctg cagatacggt 1681 ggaacgtgtg tggctcccaa caaatgtgtc tgtccatctg gattcacagg aagccactgc 1741 gagaaagata ttgatgaatg ttcagaggga atcattgagt gccacaacca ttcccgctgc 1801 gttaacctgc cagggtggta ccactgtgag tgcagaagcg gtttccatga cgatgggacc 1861 tattcactgt ccggggagtc ctgtattgac attgatgaat gtgccttaag aactcacacc 1921 tgttggaacg attctgcctg catcaacctg gcagggggtt ttgactgtct ctgcccctct 1981 gggccctcct gctctggtga ctgtcctcat gaaggggggc tgaagcacaa tggccaggtg 2041 tggaccttga aagaagacag gtgttctgtc tgctcctgca aggatggcaa gatattctgc 2101 cgacggacag cttgtgattg ccagaatcca agtgctgacc tattctgttg cccagaatgt 2161 gacaccagag tcacaagtca atgtttagac caaaatggtc acaagctgta tcgaagtgga 2221 gacaattgga cccatagctg tcagcagtgt cggtgtctgg aaggagaggt agattgctgg 2281 ccactcactt gccccaactt gagctgtgag tatacagcta tcttagaagg ggaatgttgt 2341 ccccgctgtg tcagtgaccc ctgcctagct gataacatca cctatgacat cagaaaaact 2401 tgcctggaca gctatggtgt ttcacggctt agtggctcag tgtggacgat ggctggatct 2461 ccctgcacaa cctgtaaatg caagaatgga agagtctgtt gttctgtgga ttttgagtgt 2521 cttcaaaata attgaagtat ttacagtgga ctcaacgcag aagaatggac gaaatgacca 2581 tccaacgtga ttaaggatag gaatcggtag tttggttttt ttgtttgttt tgttttttta 2641 accacagata attgccaaag tttccacctg aggacggtgt ttcggaggtt gccttttgga 2701 cctaccactt tgctcattct tgctaaccta gtctaggtga cctacagtgc cgtgcattta 2761 agtcaatggt tgttaaaaga agtttcccgt gttgtaaatc atgtttccct tatcagatca 2821 tttgcaaata catttaaatg atctcatggt aaatggttga tgtatttttt gggtttattt 2881 tgtgtactaa ccataataga gagagactca gctcctttta tttattttgt tgatttatgg 2941 atcaaattct aaaataaagt tgcctgttgt gactttt // LOCUS D83018 3198 bp mRNA PRI 06-FEB-1997 DEFINITION Human mRNA for nel-related protein 2, complete cds. ACCESSION D83018 NID g1827484 KEYWORDS nel-related protein 2; nel-related protein 2 (NRP2). SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Watanabe,T.K., Katagiri,T., Suzuki,M., Shimizu,F., Fujiwara,T., Kanemoto,N., Nakamura,Y., Hirai,Y., Maekawa,H. and Takahashi,Ei. TITLE Cloning and characterization of two novel human cDNAs (NELL1 and NELL2) encoding proteins with six EGF-like repeats JOURNAL Genomics 38 (3), 273-276 (1996) MEDLINE 97131504 REFERENCE 2 (bases 1 to 3198) AUTHORS Katagiri,T., Watanabe,T., Suzuki,M., Fujiwara,T., Kanemoto,N., Maekawa,H., Nakamura,Y. and Takahashi,E. TITLE Cloning, expression, and mapping of a humannel-related protein 2 (NRP1) JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 3198) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka GEN Research Institute,Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..3198 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 97..2547 /note="NRP2" /codon_start=1 /product="nel-related protein 2" /db_xref="PID:d1012348" /db_xref="PID:g1827485" /translation="MESRVLLRTFCLIFGLGAVWGLGVDPSLQIDVLTELELGESTTG VRQVPGLHNGTKAFLFQDTPRSIKASTATAEQFFQKLRNKHEFTILVTLKQTHLNSGV ILSIHHLDHRYLELESSGHRNEVRLHYRSGSHRPHTEVFPYILADDKWHKLSLAISAS HLILHIDCNKIYERVVEKPSTDLPLGTTFWLGQRNNAHGYFKGIMQDVQLLVMPQGFI AQCPDLNRTCPTCNDFHGLVQKIMELQDILAKTSAKLSRAEQRMNRLDQCYCERTCTM KGTTYREFESWIDGCKNCTCLNGTIQCETLICPNPDCPLKSALAYVDGKCCKECKSIC QFQGRTYFEGERNTVYSSSGVCVLYECKDQTMKLVESSGCPALDCPESHQITLSHSCC KVCKGYDFCSERHNCMENSICRNLNDRAVCSCRDGFRALREDNAYCEDIDECAEGRHY CRENTMCVNTPGSFMCICKTGYIRIDDYSCTEHDECITNQHNCDENALCFNTVGGHNC VCKPGYTGNGTTCKAFCKDGCRNGGACIAANVCACPQGFTGPSCETDIDECSDGFVQC DSRANCINLPGWYHCECRDGYHDNGMFSPSGESCEDIDECGTGRHSCANDTICFNLDG GYDCRCPHGKNCTGDCIHDGKVKHNGQIWVLENDRCSVCSCQNGFVMCRRMVCDCENP TVDLFCCPECDPRLSSQCLHQNGETLYNSGDTWVQNCQQCRCLQGEVDCWPLPCPDVE CEFSILPENECCPRCVTDPCQADTIRNDITKTCLDEMNVVRFTGSSWIKHGTECTLCQ CKNGHICCSVDPQCLQEL" BASE COUNT 900 a 657 c 757 g 884 t ORIGIN 1 ttgggaggag cagtctctcc gctcgtctcc cggagctttc tccattgtct ctgcctttac 61 aacagaggga gacgatggac tgagctgatc cgcaccatgg agtctcgggt cttactgaga 121 acattctgtt tgatcttcgg tctcggagca gtttgggggc ttggtgtgga cccttcccta 181 cagattgacg tcttaacaga gttagaactt ggggagtcca cgaccggagt gcgtcaggtc 241 ccggggctgc ataatgggac gaaagccttt ctctttcaag atactcccag aagcataaaa 301 gcatccactg ctacagctga acagtttttt cagaagctga gaaataaaca tgaatttact 361 attttggtga ccctaaaaca gacccactta aattcaggag ttattctctc aattcaccac 421 ttggatcaca ggtacctgga actggaaagt agtggccatc ggaatgaagt cagactgcat 481 taccgctcag gcagtcaccg ccctcacaca gaagtgtttc cttacatttt ggctgatgac 541 aagtggcaca agctctcctt agccatcagt gcttcccatt tgattttaca cattgactgc 601 aataaaattt atgaaagggt agtagaaaag ccctccacag acttgcctct aggcacaaca 661 ttttggctag gacagagaaa taatgcgcat ggatatttta agggtataat gcaagatgtc 721 caattacttg tcatgcccca gggatttatt gctcagtgcc cagatcttaa tcgcacctgt 781 ccaacttgca atgacttcca tggacttgtg cagaaaatca tggagctaca ggatatttta 841 gccaaaacat cagccaagct gtctcgagct gaacagcgaa tgaatagatt ggatcagtgc 901 tattgtgaaa ggacttgcac catgaaggga accacctacc gagaatttga gtcctggata 961 gacggctgta agaactgcac atgcctgaat ggaaccatcc agtgtgaaac tctaatctgc 1021 ccaaatcctg actgcccact taagtcggct cttgcgtatg tggatggcaa atgctgtaag 1081 gaatgcaaat cgatatgcca atttcaagga cgaacctact ttgaaggaga aagaaataca 1141 gtctattcct cttctggagt atgtgttctc tatgagtgca aggaccagac catgaaactt 1201 gttgagagtt caggctgtcc agctttggat tgtccagagt ctcatcagat aaccttgtct 1261 cacagctgtt gcaaagtttg taaaggttat gacttttgtt ctgaaaggca taactgcatg 1321 gagaattcca tctgcagaaa tctgaatgac agggctgttt gtagctgtcg agatggtttt 1381 agggctcttc gagaggataa tgcctactgt gaagacatcg atgagtgtgc tgaagggcgc 1441 cattactgtc gtgaaaatac aatgtgtgtc aacaccccgg gttcttttat gtgcatctgc 1501 aaaactggat acatcagaat tgatgattat tcatgtacag aacatgatga gtgtatcaca 1561 aatcagcaca actgtgatga aaatgcttta tgcttcaaca ctgttggagg acacaactgt 1621 gtttgcaagc cgggctatac agggaatgga acgacatgca aagcattttg caaagatggc 1681 tgtaggaatg gaggagcctg tattgccgct aatgtgtgtg cctgcccaca aggcttcact 1741 ggacccagct gtgaaacgga cattgatgaa tgctctgatg gttttgttca atgtgacagt 1801 cgtgctaatt gcattaacct gcctggatgg taccactgtg agtgcagaga tggctaccat 1861 gacaatggga tgttttcacc aagtggagaa tcgtgtgaag atattgatga gtgtgggacc 1921 gggaggcaca gctgtgccaa tgataccatt tgcttcaatt tggatggcgg atatgattgt 1981 cgatgtcctc atggaaagaa ttgcacaggg gactgcatcc atgatggaaa agttaagcac 2041 aatggtcaga tttgggtgtt ggaaaatgac aggtgctctg tgtgctcatg tcagaatgga 2101 ttcgttatgt gtcgacggat ggtctgtgac tgtgagaatc ccacagttga tcttttttgc 2161 tgccctgaat gtgacccaag gcttagtagt cagtgcctcc atcaaaatgg ggaaactttg 2221 tataacagtg gtgacacctg ggtccagaat tgtcaacagt gccgctgctt gcaaggggaa 2281 gttgattgtt ggcccctgcc ttgcccagat gtggagtgtg aattcagcat tctcccagag 2341 aatgagtgct gcccgcgctg tgtcacagac ccttgccagg ctgacaccat ccgcaatgac 2401 atcaccaaga cttgcctgga cgaaatgaat gtggttcgct tcaccgggtc ctcttggatc 2461 aaacatggca ctgagtgtac tctctgccag tgcaagaatg gccacatctg ttgctcagtg 2521 gatccacagt gccttcagga actgtgaagt taactgtctc atgggagatt tctgttaaaa 2581 gaatgttctt tcattaaaag accaaaaaga agttaaaact taaattgggt gatttgtggg 2641 cagctaaatg cagctttgtt aatagctgag tgaactttca attatgaaat ttgtggagct 2701 tgacaaaatc acaaaaggaa aattactggg gcaaaattag acctcaagtc tgcctctact 2761 gtgtctcaca tcaccatgta gaagaatggg cgtacagtat ataccgtgac atcctgaacc 2821 ctggatagaa agcctgagcc cattggatct gtgaaagcct ctagcttcac tggtgcagaa 2881 aattttcctc tagatcagaa tcttcagaat cagttaggtt cctcactgca agaaataaaa 2941 tgtcaggcag tgaatgaatt atattttcag aagtaaagca aagaagctat aacatgttat 3001 gtacagtaca ctctgaaaag aaatctgaaa caagttattg taatgataaa aataatgcac 3061 aggcatggtt acttaatatt ttctaacagg aaaagtcatc cctatttcct tgttttactg 3121 cacttaatat tatttggttg aatttgttca gtataagctc gttcttgtgc aaaattaaat 3181 aaatatttct cttacctt // LOCUS D83260 1216 bp mRNA PRI 17-FEB-1997 DEFINITION Human HXC-26 mRNA, complete cds. ACCESSION D83260 NID g1842157 KEYWORDS HXC-26. SOURCE Homo sapiens skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Toyoda,A., Sakai,T., Sugiyama,Y., Kusuda,J., Hashimoto,K. and Maeda,H. TITLE Isolation and analysis of a novel gene, HXC-26, adjacent to the rab GDP dissociation inhibitor gene located at human chromosome Xq28 region JOURNAL DNA Res. 3 (5), 337-340 (1996) MEDLINE 97191546 REFERENCE 2 (bases 1 to 1216) AUTHORS Toyoda,A., Sakai,T., Sugiyama,Y., Kusuda,J., Hashimoto,K. and Maeda,H. TITLE Isolation and analysis of a novel gene, HXC-26, adjacent to the rab GDP dissociation inhibitor gene located at Xq28 region JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1216) AUTHORS Toyoda,A. TITLE Direct Submission JOURNAL Submitted (29-JAN-1996) to the DDBJ/EMBL/GenBank databases. Atsushi Toyoda, Soka University, Faculty of Engineering; 1-236 Tangi, Hachioji, Tokyo 192, Japan (E-mail:atoyoda@t.soka.ac.jp, Tel:0426-91-9489, Fax:0426-91-9312) FEATURES Location/Qualifiers source 1..1216 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq28" /tissue_type="skeletal muscle" gene 23..1000 /gene="HXC-26" CDS 23..1000 /gene="HXC-26" /codon_start=1 /db_xref="PID:d1012538" /db_xref="PID:g1842158" /translation="MHLMKKREKQREQMEQMKQRIAEENIMKSNIDKKFSAHYDAVEA ELKSSTVGLVTLNDMKAKQEALVKEREKQLAKKEQSKELQMKLEKLREKERKKEAKRK ISSLSFTLEEEEEGGEEEEEAAMYEEEMEREEITTKKRKLGKNPDVDTSFLPDRDREE EENRLREELRQEWEAKQEKIKSEEIEITFSYWDGSGHRRTVKMRKGNTMQQFLQKALE ILRKDFSELRSAGVEQLMYIKEDLIIPHHHSFYDFIVTKARGKSGPLFNFDVHDDVRL LSDATVEKDESHAGKVVLRSWYEKNKHIFPASRWEPYDPEKKWDKYTIR" BASE COUNT 324 a 301 c 402 g 189 t ORIGIN 1 cgcgagcgag gccggccgcg ccatgcacct gatgaagaag cgggagaagc agcgcgagca 61 gatggagcag atgaagcagc gcatcgcgga ggagaacatc atgaaatcca acattgacaa 121 gaagttctct gcgcactacg acgcggtgga ggcagagctc aagtccagca ccgtgggtct 181 cgtgaccctg aatgacatga aggccaagca ggaggctctg gtgaaggagc gggagaagca 241 gctggccaag aaggagcagt ccaaggagct gcagatgaag ctggagaagc ttcgagagaa 301 ggagcgtaag aaggaagcca agcggaagat ctccagcctg tccttcaccc tggaggagga 361 agaagaggga ggcgaggagg aagaggaggc ggccatgtat gaggaggaga tggaaaggga 421 agagatcacc acgaagaaga gaaaactggg gaagaaccca gacgttgaca caagcttctt 481 gcctgatcga gaccgtgagg aggaggagaa tcggcttcgg gaagagctgc ggcaggagtg 541 ggaagccaag caggagaaga tcaagagtga ggagatcgag atcaccttca gctactggga 601 tggctctggg caccggcgga cagtcaagat gagaaagggc aacaccatgc agcagttcct 661 gcagaaggcg ctcgagatcc ttcggaaaga cttcagtgag ctgaggtccg caggggtgga 721 gcagctcatg tacatcaagg aggacttgat catccctcac catcacagct tctacgactt 781 catcgtcacc aaggcacggg ggaagagtgg accactcttc aactttgatg ttcatgacga 841 tgtgcggttg ctcagtgacg ccactgtgga gaaggatgag tcccatgcag gcaaggtggt 901 gctgaggagc tggtacgaga agaacaagca catctttccc gccagccgct gggaacccta 961 cgaccctgaa aagaagtggg acaagtacac gatccgctga gcatccagga ggctgcgcgg 1021 ccccggctcc tcagctccct cagtgtgccc cgtggtgtca ccgggactcc aggcacccgc 1081 tcccctgcga ccatgccagg cacgctggga ggaggacggc agctgctcgt gtcctgcccc 1141 tgccacatca gtgactgctt tattcttttc caataaagaa gtgcacgtgt cagagctgga 1201 gcgcctgcat tgtgag // LOCUS D83597 2697 bp mRNA PRI 18-FEB-1997 DEFINITION Human mRNA for RP105, complete cds. ACCESSION D83597 NID g1843410 KEYWORDS RP105. SOURCE Homo sapiens B cell line cell_line:Daudi cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Miura,Y., Miyake,K., Yamashita,Y., Shimazu,R., Copeland,N.G., Gilbert,D.J., Jenkins,N.A., Inazawa,J., Abe,T. and Kimoto,M. TITLE Molecular cloning of a human RP105 homologue and chromosomal localization of the mouse and human RP105 genes (Ly64 and LY64) JOURNAL Genomics 38 (3), 299-304 (1996) MEDLINE 97131508 REFERENCE 2 (sites) AUTHORS Miura,Y., Miyake,K., Shimazu,R., Yamashita,Y., Copeland,N.G., Gilbert,D.J., Jenkins,N.A., Inazawa,J., Abe,T. and Masao,K. TITLE Molecular cloning of a human RP105 homologue and chromosomal localizati on of the mouse and human RP105 genes JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2697) AUTHORS Miyake,K. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) to the DDBJ/EMBL/GenBank databases. Kensuke Miyake, Saga Medical School, Department of Immunology; 5-1-1, Nabeshima, Saga City, Saga 849, Japan (E-mail:miyake@smsnet.saga-med.ac.jp, Tel:+81-952-31-6511, Fax:+81-952-33-2518) FEATURES Location/Qualifiers source 1..2697 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Daudi" /map="5q12" /tissue_type="B cell line" CDS 143..2128 /codon_start=1 /product="RP105" /db_xref="PID:d1012688" /db_xref="PID:g1843411" /translation="MAFDVSCFFWVVLFSAGCKVITSWDQMCIEKEANKTYNCENLGL SEIPDTLPNTTEFLEFSFNFLPTIHNRTFSRLMNLTFLDLTRCQINWIHEDTFQSHHQ LSTLVLTGNPLIFMAETSLNGPKSLKHLFLIQTGISNLEFIPVHNLENLESLYLGSNH ISSIKFPKDFPARNLKVLDFQNNAIHYISREDMRSLEQAINLSLNFNGNNVKGIELGA FDSTVFQSLNFGGTPNLSVIFNGLQNSTTQSLWLGTFEDIDDEDISSAMLKGLCEMSV ESLNLQEHRFSDISSTTFQCFTQLQELDLTATHLKGLPSGMKGLNLLKKLVLSVNHFD QLCQISAANFPSLTHLYIRGNVKKLHLGVGCLEKLGNLQTLDLSHNDIEASDCCSLQL KNLSHLQTLNLSHNEPLGLQSQAFKECPQLELLDLAFTRLHINAPQSPFQNLHFLQVL NLTYCFLDTSNQHLLAGLPVLRHLNLKGNHFQDGTITKTNLLQTVGSLEVLILSSCGL LSIDQQAFHSLGKMSHVDLSHNSLTCDSIDSLSHLKGIYLNLAANSINIISPRLLPIL SQQSTINLSHNPLDCTCSNIHFLTWYKENLHKLEGSEETTCANPPSLRGVKLSDVKLS CGITAIGIFFLIVFLLLLAILLFFAVKYLLRWKYQHI" sig_peptide 143..202 mat_peptide 203..2125 polyA_signal 2678..2683 BASE COUNT 757 a 680 c 533 g 727 t ORIGIN 1 gctgagcagt caacagcatt tcttgttcca agatcaccct tctgagtacc tctctggctg 61 ccaaattgcc agggccttca cagtttgatt ccattctcag ctccaagcat taggtaaacc 121 caccaagcaa tcctagcctg tgatggcgtt tgacgtcagc tgcttctttt gggtggtgct 181 gttttctgcc ggctgtaaag tcatcacctc ctgggatcag atgtgcattg agaaagaagc 241 caacaaaaca tataactgtg aaaatttagg tctcagtgaa atccctgaca ctctaccaaa 301 cacaacagaa tttttggaat tcagctttaa ttttttgcct acaattcaca atagaacctt 361 cagcagactc atgaatctta cctttttgga tttaactagg tgccagatta actggataca 421 tgaagacact tttcaaagcc atcatcaatt aagcacactt gtgttaactg gaaatcccct 481 gatattcatg gcagaaacat cgcttaatgg gcccaagtca ctgaagcatc ttttcttaat 541 ccaaacggga atatccaatc tcgagtttat tccagtgcac aatctggaaa acttggaaag 601 cttgtatctt ggaagcaacc atatttcctc cattaagttc cccaaagact tcccagcacg 661 gaatctgaaa gtactggatt ttcagaataa tgctatacac tacatctcta gagaagacat 721 gaggtctctg gagcaggcca tcaacctaag cctgaacttc aatggcaata atgttaaagg 781 tattgagctt ggggcttttg attcaacggt cttccaaagt ttgaactttg gaggaactcc 841 aaatttgtct gttatattca atggtctgca gaactctact actcagtctc tctggctggg 901 aacatttgag gacattgatg acgaagatat tagttcagcc atgctcaagg gactctgtga 961 aatgtctgtt gagagcctca acctgcagga acaccgcttc tctgacatct catccaccac 1021 atttcagtgc ttcacccaac tccaagaatt ggatctgaca gcaactcact tgaaagggtt 1081 accctctggg atgaagggtc tgaacttgct caagaaatta gttctcagtg taaatcattt 1141 cgatcaattg tgtcaaatca gtgctgccaa tttcccctcc cttacacacc tctacatcag 1201 aggcaacgtg aagaaacttc accttggtgt tggctgcttg gagaaactag gaaaccttca 1261 gacacttgat ttaagccata atgacataga ggcttctgac tgctgcagtc tgcaactcaa 1321 aaacctgtcc cacttgcaaa ccttaaacct gagccacaat gagcctcttg gtctccagag 1381 tcaggcattc aaagaatgtc ctcagctaga actcctcgat ttggcattta cccgcttaca 1441 cattaatgct ccacaaagtc ccttccaaaa cctccatttc cttcaggttc tgaatctcac 1501 ttactgcttc cttgatacca gcaatcagca tcttctagca ggcctaccag ttctccggca 1561 tctcaactta aaagggaatc actttcaaga tgggactatc acgaagacca acctacttca 1621 gactgtgggc agcttggagg ttctgatttt gtcctcttgt ggtctcctct ctatagacca 1681 gcaagcattc cacagcttgg gaaaaatgag ccatgtagac ttaagccaca acagcctgac 1741 atgcgacagc attgattctc ttagccatct taagggaatc tacctcaatc tggctgccaa 1801 cagcattaac atcatctcac cccgtctcct ccctatcttg tcccagcaga gcaccattaa 1861 tttaagtcat aaccccctgg actgcacttg ctcgaatatt catttcttaa catggtacaa 1921 agaaaacctg cacaaacttg aaggctcgga ggagaccacg tgtgcaaacc cgccatctct 1981 aaggggagtt aagctatctg atgtcaagct ttcctgtggg attacagcca taggcatttt 2041 ctttctcata gtatttctat tattgttggc tattctgcta ttttttgcag ttaaatacct 2101 tctcaggtgg aaataccaac acatttagtg ctgaaggttt ccagagaaag caaataagtg 2161 tgcttagcaa aattgctcta agtgaaggaa ctgtcatctg ctggtgacca gaccagactt 2221 ttcagattgc ttcctggaac tgggcaggga ctcactgtgc ttttctgagc ttcttactcc 2281 tgtgagtccc agagctaaag aaccttctag gcaagtacac cgaatgactc agtccagagg 2341 gtcagatgct gctgtgagag gcacagagcc ctttccgcat gtggaagagt gggaggaagc 2401 agagggaggg actgggcagg gactgccggc cccggagtct cccacaggga ggccattccc 2461 cttctactac cgacatccct cccagcacca cacaccccgc ccctgaaagg agatcatcag 2521 cccccacaat ttgtcagagc tgaagccagc ccactaccca cccccactac agcattgtgc 2581 ttgggtctgg gttctcagta aatgtagcca tttgagaaac ttacttgggg acaaagtctc 2641 aatccttatt ttaaatgaaa aaagaaaaga aaagcataat aaatttaaaa gaaaagg // LOCUS D83703 3194 bp mRNA PRI 03-JUN-1997 DEFINITION Human mRNA for peroxisome assembly factor-2, complete cds. ACCESSION D83703 NID g1747315 KEYWORDS PAF-2; peroxisome assembly factor-2. SOURCE Homo sapiens cDNA to mRNA, clone_lib:lambda gt 11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3194) AUTHORS Fukuda,S. TITLE Direct Submission JOURNAL Submitted (25-FEB-1996) to the DDBJ/EMBL/GenBank databases. Seiji Fukuda, Gifu University, school of Medicine, Department of Pediatrics; Tsukasa-machi 40, Gifu, Gifu 500, Japan (E-mail:sfuk@cc.gifu-u.ac.jp, Tel:058-265-1241(ex.2817), Fax:058-265-9011) REFERENCE 2 (bases 1 to 3194) AUTHORS Fukuda,S., Shimozawa,N., Suzuki,Y. and Tomatsu,S. TITLE Peroxisome Assembly Factor-2 (PAF-2) : The Human Gene Responsible for Group C Peroxisome Biogenesis Disorder JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Fukuda,S., Shimozawa,N., Suzuki,Y., Zhang,Z., Tomatsu,S., Tsukamoto,T., Hashiguchi,N., Osumi,T., Masuno,M., Imaizumi,K., Kuroki,Y., Fujiki,Y., Orii,T. and Kondo,N. TITLE Human peroxisome assembly factor-2 (PAF-2): a gene responsible for group C peroxisome biogenesis disorder in humans JOURNAL Am. J. Hum. Genet. 59 (6), 1210-1220 (1996) MEDLINE 97094178 COMMENT Sequence updated (01-Aug-1996) by:Seiji Fukuda. FEATURES Location/Qualifiers source 1..3194 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /clone_lib="lambda gt 11" /map="6p21.1" gene 71..3013 /gene="PAF-2" CDS 71..3013 /gene="PAF-2" /codon_start=1 /product="peroxisome assembly factor-2" /db_xref="PID:d1012741" /db_xref="PID:g1747316" /translation="MALAVLRVLEPFPTETPPLAVLLPPGGPWPAAELGLVLALRPAG ESPAGPALLVAALEGPDAGTEEQGPGPPQLLVSRALLRLLALGSGAWVRARAVRRPPA LGWALLGTSLGPGLGPRVGPLLVRRGETLPVPGPRVLETRPALQGLLGPGTRLAVTEL RGRARLCPESGDSSRPPPPPVVSSFAVSGTVRRLQGVLGGTGDSLGVSRSCLRGLGLF QGEWVWVAQARESSNTSQPHLARVQVLEPRWDLSDRLGPGSGPLGEPLADGLALVPAT LAFNLGCDPLEMGELRIQRYLEGSIAPEDKGSCSLLPGPPFARELHIEIVSSPHYSTN GNYDGVLYRHFQIPRVVQEGDVLCVPTIGQVEILEGSPEKLPRWREMFFKVKKTVGEA PDGPASAYLADTTHTSLYMVGSTLSPVPWLPSEESTLWSSLSPPGLEALVSELCAVLK PRLQPGGALLTGTSSVLLRGPPGCGKTTVVAAACSHLGLHLLKVPCSSLCAESSGAVE TKLQAIFSRARRCRPAVLLLTAVDLLGRDRDGLGEDARVMAVLRHLLLNEDPLNSCPP LMVVATTSRAQDLPADVQTAFPHELEVPALSEGQRLSILRALTAHLPLGQEVNLAQLA RRCAGFVVGDLYALLTHSSRAACTRIKNSGLAGGLTEEDEGELCAAGFPLLAEDFGQA LEQLQTAHSQAVGAPKIPSVSWHDVGGLQEVKKEILETIQLPLEHPELLSLGLRRSGL LLHGPPGTGKTLLAKAVATECSLTFLSVKGPELINMYVGQSEENVREVFARARAAAPC IIFFDELDSLAPSRGRSGDSGGVMDRVVSQLLAELDGLHSTQDVFVIGATNRPDLLDP ALLRPGRFDKLVFVGANEDRASQLRVLSAITRKFKLEPSVSLVNVLDCCPPQLTGADL YSLCSDAMTAALKRRVHDLEEGLEPGSSALMLTMEDLLQAAARLQPSVSEQELLRYKR IQRKFAAC" BASE COUNT 562 a 950 c 1029 g 653 t ORIGIN 1 acactagtcg tctggctctc tggctccgga agctgtgctc cttcaccctc ctcgttggtg 61 tcctgtcacc atggcgctgg ctgtcttgcg ggtcctggag ccctttccga ccgagacacc 121 cccgttggca gtgctgctgc cacccggggg cccgtggccg gcggcggagc tgggcctggt 181 gctggccctg aggcctgcag gggagagccc ggcagggccg gcgctgctgg tggcagccct 241 ggaggggccg gacgcgggca ccgaagagca gggtcccggg ccgccgcagc tactggttag 301 ccgcgcgctg ctgcggctcc tggcactggg ctccggggcc tgggtgcggg cgcgggcggt 361 gcggcggccc ccggcgctag gttgggcact gcttggcacc tcgctggggc ctgggctcgg 421 accgcgagtc gggccgctgc tggtgaggcg cggagagacc ctcccagtgc ccggaccgcg 481 ggtgctggag acacggccgg cgttgcaagg gctgctgggc ccagggactc ggctggctgt 541 gactgagctc cgcgggcggg ccagactgtg tccagagtct ggggacagca gtcggccccc 601 acccccgccc gtggtgtcct cctttgcggt ttctggcaca gtgcggcgac tccagggagt 661 tctgggaggg actggagatt cactaggggt gagccggagc tgtctccgtg gccttggcct 721 cttccagggc gaatgggtgt gggtggccca ggccagagag tcatcgaaca cttcacagcc 781 gcacttggct agggtgcagg tcctagaacc tcgctgggac ctctctgata gactgggacc 841 cggctctgga ccgctgggag agcccctcgc tgacggactg gcgcttgtcc ctgccacttt 901 ggcttttaat cttggctgtg accccctgga aatgggagag ctcagaattc agaggtactt 961 ggaaggctcc atcgcccctg aagacaaagg aagctgctca ttgctgcctg ggcctccatt 1021 tgccagagag ttacacatcg aaattgtgtc ttctccccac tacagtacta atggaaatta 1081 tgacggtgtt ctttaccggc actttcagat acccagggta gtccaggaag gggatgttct 1141 atgtgtgcca acaattgggc aagtagagat cctggaagga agtccagaga aactgcccag 1201 gtggcgggaa atgtttttta aagtgaagaa aacagttggg gaagctccag atggaccagc 1261 cagtgcctac ttggccgaca ccacccatac ctccttgtac atggtgggtt ctaccctgag 1321 ccctgttcca tggctccctt cagaggaatc cactctctgg agcagtttgt ctcctccagg 1381 cctggaggcc ttggtgtctg aactctgtgc tgtcctgaag cctcgcctcc agccaggggg 1441 tgccctgctg acaggaacta gcagtgtcct tctacggggc cccccaggct gtgggaagac 1501 cacagtagtt gctgctgcct gtagtcacct tgggctccac ttactgaagg tgccctgctc 1561 cagcctctgt gcagaaagta gtggggctgt ggagacaaaa ctgcaggcca tcttctcccg 1621 ggcccgccgt tgccggcctg cagtcctgtt gctcacagct gtggaccttc tgggccggga 1681 ccgtgatggg ctgggtgagg atgcccgtgt gatggctgtg ctgcgtcacc tcctcctcaa 1741 tgaggacccc ctcaacagct gccctcccct catggttgtg gccaccacaa gccgggccca 1801 ggacctgcct gctgatgtgc agacagcatt tcctcatgag ctcgaggtgc ctgctctgtc 1861 agaggggcag cggctcagca tcctgcgggc cctcactgcc caccttcccc tgggccagga 1921 ggtgaacttg gcacagctag cacggcggtg tgcaggcttt gtggtagggg atctctatgc 1981 ccttctgacc cacagcagcc gggcagcctg caccaggatc aagaactcag gtttggcagg 2041 tggcttgact gaggaggatg agggggagct gtgtgctgcc ggctttcctc tcctggctga 2101 ggactttggg caggcactgg agcaactgca gacagctcac tcccaggccg ttggagcccc 2161 caagatcccc tcagtgtcct ggcatgatgt gggtgggctg caggaggtga agaaggagat 2221 cctggagacc attcagctcc ccctggagca ccctgagcta ctgagcctgg gcctgagacg 2281 ctcaggcctt ctgctccatg ggccccctgg caccggcaag acccttctgg ccaaggcagt 2341 agccactgag tgcagcctta ccttcctcag cgtgaagggg ccagagctca ttaacatgta 2401 tgtgggccaa agtgaggaga atgtgcggga agtgtttgcc agggccaggg ctgcagctcc 2461 atgcattatc ttctttgatg aactggactc tttggcccca agccgggggc gaagtggaga 2521 ttctggagga gtgatggaca gggtggtgtc tcagctcctt gccgagctag atgggctgca 2581 cagcactcag gatgtgtttg tgattggagc caccaacaga ccagatctcc tggaccctgc 2641 ccttctgcgg cctggcagat ttgacaagct ggtgtttgtg ggggcaaatg aggaccgggc 2701 ctcccagcta cgcgttctaa gtgccatcac acgcaaattc aagctagagc catctgtgag 2761 cctggtaaac gtgctagatt gctgccctcc ccagctgacg ggcgcggacc tctactctct 2821 ctgctctgat gctatgacag ctgccctcaa acgcagggtt catgacctgg aggaagggct 2881 ggagccaggt agctcagcac tgatgctcac catggaggac ttgctgcagg ctgccgcccg 2941 gctgcaaccc tcagtcagtg agcaggagct gctccggtac aagcgcatcc agcgcaagtt 3001 tgctgcctgc taggagcccc ccagggtctg ggacccgctc agcatggctg caggtacctt 3061 gatagcccac agagagatct gggaaggaag ggctcctcct caggctgctg caaccactgg 3121 aggcactcta gagatccagg tgcaagtgga ttgagacagc agcaacagct caagagatat 3181 ctctgctact tgcc // LOCUS D83735 2122 bp mRNA PRI 05-SEP-1996 DEFINITION Human adult heart mRNA for neutral calponin, complete cds. ACCESSION D83735 NID g1526431 KEYWORDS neutral calponin. SOURCE Homo sapiens adult heart cDNA to mRNA, clone_lib:CLONTECH cDNA library CAT.# HL3026b. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2122) AUTHORS Masuda,H., Tanaka,K., Takagi,M., Ohgami,K., Sakamaki,T., Shibata,N. and Takahashi,K. TITLE Molecular cloning and characterization of human non-smooth muscle calponin JOURNAL J. Biochem. 120 (2), 415-424 (1996) MEDLINE 97044758 REFERENCE 2 (bases 1 to 2122) AUTHORS Masuda,H. TITLE Direct Submission JOURNAL Submitted (05-MAR-1996) to the DDBJ/EMBL/GenBank databases. Hiroaki Masuda, Osaka Medical Center for Cancer and Cardiovascular Diseases; 1-3-3, Nakamichi, Higashinari-ku, Osaka, Osaka 537, Japan (E-mail:masudah@sb.gunma-u.ac.jp, Tel:06-972-1181, Fax:06-972-7749) FEATURES Location/Qualifiers source 1..2122 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CLONTECH cDNA library CAT.# HL3026b" /dev_stage="adult" /tissue_type="heart" CDS 28..957 /codon_start=1 /product="neutral calponin" /db_xref="PID:d1012763" /db_xref="PID:g1526432" /translation="MSSTQFNKGPSYGLSAEVKNRLLSKYDPQKEAELRTWIEGLTGL SIGPDFQKGLKDGTILCTLMNKLQPGSVPKINRSMQNWHQLENLSNFIKAMVSYGMNP VDLFEANDLFESGNMTQVQVSLLALAGKAKTKGLQSGVDIGVKYSEKQERNFDDATMK AGQCVIGLQMGTNKCASQSGMTAYGTRRHLYDPKNHILPPMDHSTISLQMGTNKCASQ VGMTAPGTRRHIYDTKLGTDKCDNSSMSLQMGYTQGANQSGQVFGLGRQIYDPKYCPQ GTVADGAPSGTGDCPDPGEVPEYPPYYQEEAGY" 3'UTR 958..2122 misc_feature complement(1649..1929) /note="Alu sequence in antisense orientation" polyA_signal 2099..2104 BASE COUNT 438 a 651 c 591 g 442 t ORIGIN 1 gcccgtcccg ccgcccgccc gccagccatg agctccacgc agttcaacaa gggcccctcg 61 tacgggctgt cggccgaggt caagaaccgg ctcctgtcca aatatgaccc ccagaaggag 121 gcagagctcc gcacctggat cgagggactc accggcctct ccatcggccc cgacttccag 181 aagggcctga aggatggaac tatcttatgc acactcatga acaagctaca gccgggctcc 241 gtccccaaga tcaaccgctc catgcagaac tggcaccagc tagaaaacct gtccaacttc 301 atcaaggcca tggtcagcta cggcatgaac cctgtggacc tgttcgaggc caacgacctg 361 tttgagagtg ggaacatgac gcaggtgcag gtgtctcttc tcgccctggc ggggaaggcc 421 aagactaagg ggctgcagag cggggtggac attggcgtca agtactcgga gaagcaggag 481 cggaatttcg acgatgccac catgaaggct ggccagtgcg tcatcgggct gcagatgggc 541 accaacaaat gcgccagcca gtcgggcatg actgcctacg gcacgagaag gcatctctat 601 gaccccaaga accatatcct gccccccatg gaccactcga ccatcagcct ccagatgggc 661 acgaacaagt gcgccagcca ggtgggcatg acggctcccg ggacccggcg gcacatctat 721 gataccaagc tgggaaccga caagtgtgac aactcctcca tgtccctgca gatgggctac 781 acgcagggcg ccaaccagag cggccaggtc ttcggcctgg gccggcagat atatgacccc 841 aagtactgcc cgcaaggcac agtggccgat ggggctccct cgggcaccgg cgactgcccg 901 gacccggggg aggtccctga atatccccct tactaccagg aggaggccgg ctactgaggc 961 tcccagcacg ctctctcccc acatcgtctt cccatctggg tttttgggtt tttctgtgtt 1021 ttcatctttt tttttttttt tcttgacccg ttcagtgctg ccagtcaacc aagggtctgt 1081 gagtgtcagc gtgggatcag gcagcagagc ttttttcccc tttgccttga tccttcgcaa 1141 ggctgagcca ctgggctgtg ggggaagggg tcaaggccat atcccaatac gtgtagggcg 1201 agggtccctg ctggcacatt caggctgtgc tgggaagaag agacctgggc ttggaaggaa 1261 ccggtccccg acggtttctg gttgcctcgc ctcttccccc ttttgtcagc tgagcagttt 1321 gtggtttcta tgcccgcaag tttcaggaag tattcacaaa agaaaaatac attttttccc 1381 ccaggggtgg ggcaaggaca gtggagagag tgctaggaaa tgagtcccct gggaaagggg 1441 accgggccgt gatgttaaat atctccggct cccaagtgac tggatttgcc taggaccttc 1501 agatcaacag acttcagacc ctcagacctg ccccggggcc aggtggagaa agtgagggcc 1561 gtacaaggaa gtgaaattct gagttgttgg ggctaagcct gaccccctct ccatgctccc 1621 cgccccaact cactctggcc tcagtagatt tttttttcag ttgtggttgt tgcccaggct 1681 ggagtgcagt ggcgccatct tggctcactg cacctccacc ttccgggctc aagcgattct 1741 ccagcctcag cctcctgagt agctaggact gcaggtgctc caccacgccc ggctaatttt 1801 tgtattttta gtagagatgg ggtttcccca tgttggccag gctggtctcg aactcctggc 1861 ctcaggtgtg atccgcccgc ctccgcctcc ccaagcgctg agattacagg tgtgagccac 1921 cgtgcccagg ccctcagtag gttttaagga gtccccagcc ctcctccctt ctgggcccga 1981 ccagcttata ctgctccatc ttccccggcc acatgccccg ccaagtactg cacagggacc 2041 ccccacccag gggccctgct ccgtgagata atgtgaaata cgactgtgga ccaaacgcaa 2101 taaaaccttt gtttgtagga ag // LOCUS D83760 1491 bp mRNA PRI 08-JUL-1997 DEFINITION Homo sapiens mRNA for mother against dpp (Mad) related protein, complete cds. ACCESSION D83760 NID g2251103 KEYWORDS mother against dpp (Mad) related protein. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1491) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Takeshi Watanabe, Otsuka Pharmaceutical Co.,Ltd, Otsuka GEN Research Institute; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2888, Fax:0886-37-1035) REFERENCE 2 (bases 1 to 1491) AUTHORS Watanabe,T., Kawai,A., Shimizu,F., Shinomiya,H., Nishino,N., Taniguchi,Y., Hirano,H., Fujiwara,T., Kanemoto,N., Okuno,S., Kyushiki,H., Kuga,Y., Shimada,Y., Nagata,M., Takaichi,A., Horie,M., Saito,A., Maekawa,H. and Takahashi,E. TITLE Mother against dpp gene (Mad gene), long type JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Watanabe,T.K., Suzuki,M., Omori,Y., Hishigaki,H., Horie,M., Kanemoto,N., Fujiwara,T., Nakamura,Y. and Takahashi,E. TITLE Cloning and characterization of a novel member of the human Mad gene family (MADH6) JOURNAL Genomics 42 (3), 446-451 (1997) MEDLINE 97349112 COMMENT Sequence updated (08-Oct-1996) by: Watanabe Takeshi. FEATURES Location/Qualifiers source 1..1491 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 56..1459 /codon_start=1 /product="mother against dpp (Mad) related protein" /db_xref="PID:d1021974" /db_xref="PID:g2251104" /translation="MHSTTPISSLFSFTSPAVKRLLGWKQGDEEEKWAEKAVDSLVKK LKKKKGAMDELERALSCPGQPSKCVTIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQ SHHELKPLECCEFPFGSKQKEVCINPYHYRRVETPVLPPVLVPRHSEYNPQLSLLAKF RSASLHSEPLMPHNATYPDSFQQPPCSALPPSPSHAFSQSPCTASYPHSPGSPSEPES PYQHSVDTPPLPYHATEASETQSGQPVDATADRHVVLSIPNGDFRPVCYEEPQHWCSV AYYELNNRVGETFQASSRSVLIDGFTDPSNNRNRFCLGLLSNVNRNSTIENTRRHIGK GVHLYYVGGEVYAECVSDSSIFVQSRNCNYQHGFHPATVCKIPSGCSLKVFNNQLFAQ LLAQSVHHGFEVVYELTKMCTIRMSFVKGWGAEYHRQDVTSTPCWIEIHLHGPLQWLD KVLTQMGSPHNPISSVS" BASE COUNT 343 a 452 c 376 g 320 t ORIGIN 1 ttgccgtgaa gggctgtgcg gttcccgtgc gcgccggagc ctgctgtggc ctcttatgca 61 ctccaccacc cccatcagct ccctcttctc cttcaccagc cccgcagtga agagactgct 121 aggctggaag caaggagatg aagaggaaaa gtgggcagag aaggcagtgg actctctagt 181 gaagaagtta aagaagaaga agggagccat ggacgagctg gagagggctc tcagctgccc 241 ggggcagccc agcaaatgcg tcacgattcc ccgctccctg gacgggcggc tgcaggtgtc 301 ccaccgcaag ggcctgcccc atgtgattta ctgtcgcgtg tggcgctggc cggatctgca 361 gtcccaccac gagctgaagc cgctggagtg ctgtgagttc ccatttggct ccaagcagaa 421 agaagtgtgc attaaccctt accactaccg ccgggtggag actccagtac tgcctcctgt 481 gctcgtgcca agacacagtg aatataaccc ccagctcagc ctcctggcca agttccgcag 541 cgcctccctg cacagtgagc cactcatgcc acacaacgcc acctatcctg actctttcca 601 gcagcctccg tgctctgcac tccctccctc acccagccac gcgttctccc agtccccgtg 661 cacggccagc taccctcact ccccaggaag tccttctgag ccagagagtc cctatcaaca 721 ctcagttgac acaccacccc tgccttatca tgccacagaa gcctctgaga cccagagtgg 781 ccaacctgta gatgccacag ctgatagaca tgtagtgcta tcgataccaa atggagactt 841 tcgaccagtt tgttacgagg agccccagca ctggtgctcg gtcgcctact atgaactgaa 901 caaccgagtt ggggagacat tccaggcttc ctcccgaagt gtgctcatag atgggttcac 961 cgacccttca aataacagga acagattctg tcttggactt ctttctaatg taaacagaaa 1021 ctcaacgata gaaaatacca ggagacatat aggaaagggt gtgcacttgt actacgtcgg 1081 gggagaggtg tatgccgagt gcgtgagtga cagcagcatc tttgtgcaga gccggaactg 1141 caactatcaa cacggcttcc acccagctac cgtctgcaag atccccagcg gctgcagcct 1201 caaggtcttc aacaaccagc tcttcgctca gctcctggcc cagtcagttc accacggctt 1261 tgaagtcgtg tatgaactga ccaagatgtg tactatccgg atgagttttg ttaagggttg 1321 gggtgctgag tatcatcgcc aggatgtcac cagcaccccc tgctggattg agattcatct 1381 tcatgggcca ctgcagtggc tggacaaagt tctgactcag atgggctctc cacataaccc 1441 catttcttca gtgtcttaac agtcatgtct taagctgcat ttccatagga t // LOCUS D83767 1482 bp mRNA PRI 28-MAR-1997 DEFINITION Human clone N9 Rep-8 mRNA, complete cds. ACCESSION D83767 NID g1913784 KEYWORDS Rep-8. SOURCE Homo sapiens (lab_host:Homo sapiens) embryonal carcinoma cell cell_line:NEC14 cDNA to mRNA, clone_lib:lambda Zap II cDNA library clone:N9. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamabe,Y., Ichikawa,K., Sugawara,K., Imamura,O., Shimamoto,A., Suzuki,N., Tokutake,Y., Goto,M., Sugawara,M. and Furuichi,Y. TITLE Cloning and characterization of Rep-8 (D8S2298E) in the human chromosome 8p11.2-p12 JOURNAL Genomics 39 (2), 198-204 (1997) MEDLINE 97179221 REFERENCE 2 (bases 1 to 1482) AUTHORS Yamabe,Y. TITLE Cloning and Characterization of a Novel Gene, WS-2,Within Werner Syndrome Region (8p11.2-12) Obtained by TAIL-PCR JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1482) AUTHORS Yamabe,Y. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Yukako Yamabe, AGENE Research Institute; 200 Kajiwara, Kamakura, Kanagawa 247, Japan (E-mail:yukakoya@po.iijnet.or.jp) FEATURES Location/Qualifiers source 1..1482 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NEC14" /cell_type="embryonal carcinoma cell" /chromosome="8" /clone="N9" /clone_lib="lambda Zap II cDNA library" /lab_host="Homo sapiens" gene 16..828 /gene="Rep-8" CDS 16..828 /gene="Rep-8" /codon_start=1 /db_xref="PID:d1019699" /db_xref="PID:g1913785" /translation="MASRGVVGIFFLSAVPLVCLELRRGIPDIGIKDFLLLCGRILLL LALLTLIISVTTSWLNSFKSPQVYLKEEEEKNEKRQKLVRKKQQEAQGEKASRYIENV LKPHQEMKLRKLEERFYQMTGEAWKLSSGHKLGGDEGTSQTSFETSNREAAKSQNLPK PLTEFPSPAEQPTCKEIPDLPEEPSQTAEEVVTVALRCPSGNVLRRRFLKSYSSQVLF DWMTRIGYHISLYSLSTSFPRRPLAVEGGQSLEDIGITVDTVLILEEKEQTN" polyA_signal 1400..1405 /note="polyA signal" BASE COUNT 468 a 279 c 315 g 420 t ORIGIN 1 gcacgagccg ccaccatggc ttcacgtggg gttgttggca ttttcttcct ctctgctgtc 61 ccccttgtgt gtctggagct ccggcgtggg atcccggata taggaatcaa ggattttctt 121 ttgctttgtg gccggatttt gctactgctt gctcttctta ctttaattat ttctgtgact 181 acctcatggc ttaactcatt taaatctccc caagtttatc tgaaggaaga agaagaaaag 241 aatgagaaaa gacaaaaact tgtgagaaaa aaacaacaag aagcacaagg agagaaggcc 301 agcagataca tagagaatgt tttaaaacct caccaggaaa tgaaattgag aaaactggag 361 gagcgctttt atcaaatgac gggtgaagcc tggaaattaa gcagtggtca caaacttggg 421 ggtgatgaag gtacaagtca gacatctttt gaaacatcaa acagagaagc agcaaagagc 481 cagaacttgc ctaaaccttt aactgaattt ccgtctcctg ctgaacagcc cacatgcaag 541 gagattcctg atttacctga agaaccttct caaacagcag aagaagtagt tactgttgct 601 ctccgatgtc ccagtgggaa tgtcctgagg agaaggtttt tgaagtccta cagctcacag 661 gtcttatttg actggatgac gagaattggg taccacatat ctctatacag cctttctact 721 tcctttccca gacggcctct ggcagtggag ggaggccagt cgctggagga cataggaata 781 actgtggaca ctgtactcat cctggaggag aaggagcaga ccaactagga aagaagggag 841 agctccctgt ttgcatgaag tcagttatgc tatgaccttc tggcacaata aaggcttcac 901 tttcgaatca caccatacct tgattgagct catggcagta aactttgaac attgatatcc 961 atgggaatag gattagaaaa ggattgcttt ctatatataa taatctgtgg actgtgccat 1021 tttacagtgt acccaaatga gaatgaggtt gaaatgtatg cagtaaggta ctcagtaatt 1081 aattggtatt ttttcccagc tgacatgatt tcctcagtgt tagaaaacaa acccttagaa 1141 ctttcctttc tgcctcttca atccatctta ccacacaata tttcatgatt caaattcttc 1201 aaagtcttat acgcaggaat gtttattctg ctgtatttct gtgaaattaa aaacttggaa 1261 gaagcttcaa agctcttgga ggctttaaag ttctttctgt tgggtgtgca ttacagttta 1321 cttaactgat gtttgcgatt tatataattt tgccttgtat taaatgttac aaagttccaa 1381 atgaatcagt attttaaaaa ataaaactat gaaagcatta aaatataggt gaatttttaa 1441 aaaaaaaaaa aaaaaaaaaa aaaaaaaacc aaaggggggg gg // LOCUS D83777 5076 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0193 gene, complete cds. ACCESSION D83777 NID g1228036 KEYWORDS KIAA0193. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5076) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5076) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5076 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="7" /sex="male" 5'UTR 1..352 gene 353..1393 /gene="KIAA0193" CDS 353..1393 /gene="KIAA0193" /note="expressed ubiquitously with strong expression in brain" /codon_start=1 /db_xref="PID:d1012780" /db_xref="PID:g1228037" /translation="MISRPAWLWGAEMGANEHGVCIANEAINTREPAAEIEALLGMDL VRLGLERGETAKEALDVIVSLLEEHGQGGNYFEDANSCHSFQSAYLIVDRDEAWVLET IGKYWAAEKVTEGVRCICSQLSLTTKMDAEHPELRSYAQSQGWWTGEGEFNFSEVFSP VEDHLDCGAGKDSLEKQEESITVQTMMNTLRDKASGVCIDSEFFLTTASGVSVLPQNR SSPCIHYFTGTPDPSRSIFKPFIFVDDVKLVPKTQSPCFGDDDPAKKEPRFQEKPDRR HELYKAHEWARAIIESDQEQGRKLRSTMLELEKQGLEAMEEILTSSEPLDPAEVGDLF YDCVDTEIKFFK" 3'UTR 1394..5076 BASE COUNT 1358 a 1121 c 1270 g 1327 t ORIGIN 1 gcggccgcag cctcagcacc gcagagcgga gagcggagcc cggagcccgc cgccccagga 61 tggctgcagc tcctccaagt tactgttttg ttgccttccc tccacgtgct aaggatggtc 121 tggtggtatt tgggaaaaat tcagcccggc ccagagatga agtgcaagag gttgtgtatt 181 tctcggctgc tgatcacgaa ccggagagca aggttgagat ttaagttctc aaaggtggct 241 gcaccaaatc ccatcccaag tgcccttctt ccagtgtgaa gggcactcct tccacgaaga 301 gggaaagtgc acttacattt caatcgacca agttccaagg acctatgcca taatgataag 361 cagacccgcc tggctctggg gagcagaaat gggagccaat gaacatggag tgtgcatagc 421 caatgaagcc atcaacacca gagagccagc tgccgagata gaagccttgc tggggatgga 481 tctggtcagg cttggtttag aaagagggga aacagctaaa gaagccttag atgtcattgt 541 ctccttgttg gaagaacatg gacaaggtgg gaattacttt gaagatgcaa actcctgcca 601 cagcttccaa agtgcatatc tgattgtgga tcgtgatgaa gcctgggtgc tcgagaccat 661 agggaagtac tgggctgccg agaaagtcac agagggagtg aggtgcattt gcagtcagct 721 ttcgctcacc actaagatgg atgcagagca tccggaactc aggagttacg ctcagagcca 781 aggttggtgg acgggagagg gcgagttcaa tttttccgaa gtcttttctc cagttgagga 841 tcatctagac tgcggtgctg gcaaagacag cttagaaaaa caagaagaaa gcatcacagt 901 gcagactatg atgaacacct tacgggacaa agccagcgga gtgtgcatag actctgagtt 961 tttcctcacc acagccagtg gagtgtctgt cctgccgcag aatagaagct ctccgtgcat 1021 tcactacttc actggaaccc ctgatccttc caggtccata ttcaagcctt tcatctttgt 1081 tgatgacgta aaacttgtcc ccaaaacaca gtctccctgt tttggggatg acgaccctgc 1141 caaaaaggag cctcggttcc aggagaaacc agaccgccgg catgagctgt acaaagccca 1201 cgagtgggca cgtgccatca tcgaaagtga ccaggagcaa ggtcgcaagc tgaggagcac 1261 catgctggag ctggagaagc aaggcctgga agccatggaa gaaatcctga ccagctccga 1321 gccactggac cctgcggaag tgggggacct tttctatgac tgtgttgaca cggagattaa 1381 gttctttaag tgaagtaagc gttccctttc cccttcttat ttaagacttc ccaccttact 1441 aaattaccag caaaacaaac cactctcctg tttgagtaaa atgagaaagt taatatgtgg 1501 cctccttttc tgaagccaga tcaaactgtt accttgtgtt ccaccttgaa tctcacagcg 1561 tccccttctg caatgtaggt ctccttcctg tgcagtgtaa catgtatccc gttgcctgtt 1621 gttcggttgt gtgactaatt gtggatttta agctgctatt attgtatttc agtggcaatg 1681 gacacattag ccttttacaa gaggactaga gttcatcaag ccttgaaagg caggcttcac 1741 agtgccgagt tggcgggaaa agcaaattct tttgaagtct tagtctttcc ctcagtagcg 1801 gtttctttca ggttaacaag aggcatttgt gcacacacac agggctcttg tgtgtgttgt 1861 caaggggacc ctccgtggcc tcccgtgagt gcatgcctgt agtgcacagt gtctctacag 1921 gtgtcttctg gggggcagaa ccaattggaa ggaagaaagg gacccctctc cagtcctggc 1981 tccttcctac atcctgggct cctgaagaag ctgtcttccc attttccatg cgctgtgctt 2041 atgtgtggtg gactgcagag ctgcttccac ttacaggaga gctgataatt tgttagctgg 2101 aacctattca cttccgagat tcagacatag ccatgctggt ggccttctga atcactgcat 2161 ggatgtccca ggaggcagct ctccccacac agcagcacag ccatcacagg attccttgtg 2221 tagaaatgat tcccagtcta gttaccaaca gctagtctag gagtaattga atggccctat 2281 ggcacagttc cacccacaga gtagtgaatc tctcagccaa ggagggaaag aaaaggaaga 2341 actcttgact atttagattc tagttaaata tctggaatcc tagcagtcac tacattatct 2401 cagcagagag actttaatta aactgatttg tttccaatgt cgggttcact taaaggattt 2461 gacttaccac cagagcatag aaaagcatgc aaggaagacc agatgggctt agcattggga 2521 agacagaggg caaggaggtg atagatggat atagaagcat ttctctgcag gataccagtt 2581 caggccccac cattcctgcc aaggccatta catcccacaa acccaaatac aaagcagctg 2641 acttccctgg atcttccccc cactcctcac acctcacatg tcccaggagc tgccttcatt 2701 caggcgggta gctgcactgg gcatggggtg gtggtgggag cttaccgcca cctattcaag 2761 ctctcagcta ctcctgaaac gggcagagat gatgaacaga agtgtatgta aatacagcag 2821 ctagtgggag agcaccagtt gggcctaatc ctgcctcatc attcttggca ggaatctgca 2881 aatggaaaca ttgtgagtat cagcaatctg ggaagtgaca gggttaataa ctccttccca 2941 gaagctgtat catgagattt tgaggggacc gagccctgtt acatggatgt gaacagtgag 3001 gatcagaggt tttatcagaa cacattcttt ttttctacca actctccaga gcgtgagtat 3061 aggagtgcca tgagcttttt agtcagcagt tttgtaaact ctgtatataa aatcattaac 3121 cacacattgt gggtgatggg aagacgattt cagctgacag agttaatggc aaccaataat 3181 ggtggcctgt agctgctaag agcttcacgc aggtttggcc tgggctttca ctgttggtga 3241 atttagagtg tccttttagg tggggcggct attctaaaag tgtctttcta tcactgttaa 3301 ggggggggga aagtgaggtt cgaggatgac gtaggtaact ctcccctccc aagtccatgt 3361 tccaagtggc tatgtaaagc aagatgatac agaaagctgc tctaaaatct cactgagtga 3421 tttcaccttc gcctactatg aaatgtctca tcagacctga catgtctgag ataaccaagg 3481 tgattcagga tttgatcaaa agaagtctag taagaattaa ttacacagaa gcctcctttc 3541 atttctatgg gccaaacaaa ggccatggat aaccctaccc gctttatgtc attacccatt 3601 gggaaacaca atggctactt ctgttagggt acattgacct tggtcaagca tcttaaagaa 3661 ggcaacccta attgagagct gtcttggcta atactctgca ccacaattgt gatgtcctag 3721 tcctaccact agagggcatg gtacagcctg gcaaaagtta aaaggggtgt ggcagctccc 3781 atcaggtctg gaggtggtct ataagcacag ttgacagttg tgcattggga tgggtggaga 3841 aagacgacaa gagagcagag aatctgctga tgtggctgcg cttactttta gtgactttat 3901 gtacttatat taacagctgg aaataggttg ttgggttttg agcaggctgt tatagtgagg 3961 aatgttcatt tttaaatgtt cctaacagat tttgcttttg aaaaatgctt gttacatgaa 4021 taatttgtgg accagggatt gcttttctga aggcagtata gggaacatga atattcaaga 4081 tgaaatacaa aaattatgtt taagggtcat agtgtataag tagcttccta ggaaaccctt 4141 tgtgtatctt ttcagactgg ggtgggggct gagcatgctt gtgcagaaag aagccatagc 4201 cagaaaggac agaatctctc ccccactccc ttgccccata accaaacata agctagctag 4261 tcttgtctaa tagatgggat ttactatagg tgaagatagc cctcatattc aaggacagaa 4321 gctctggcag gagtaaatta gcaaagcaga aatagtaccc tttcattctt ggaggtgctt 4381 tgaaatttta ggtagaatat aatcgaaatt atggaggttc cttagtgctc aataatataa 4441 gacctggtgt tattagaacg agtctttctt ataaactaac agagcaggta tatgcctgtt 4501 agaccttagc tgtggggttc ctttactatt gggtgaatca ttaggtataa aaaataatca 4561 tcaaccaggc aaattacttt gcttcctagc tgatgtcatc ccacattggt acaggtgtta 4621 ttcagtactg ggtggttcag cagggaagcc gggtgggacc agtgtgtctg tcatgaaacc 4681 actaactgca ttcctgactg aagagccatc tgtcatttat tggggaaggt cttcagttga 4741 gctctcagcc ttaggaagga agcacgtgga ggagggacgg aggaggttcc cttgctgggc 4801 atgcttcgta gagggccagg agcagcaggt catgtgcaca tgccgttgca gcacaagctt 4861 atgcttcccg tagccgtggc ttttcattct gcacagtccc aggtcccagc tcccctctta 4921 tggtttctgt cataatgtgc tttatctgat tgactccaaa catcccgaaa tgtcacctgc 4981 agatttctcg tgggaaccaa tatgtacatg tttgcaatta tgctgtgaga atttaaatgt 5041 gttagatgga aaatgctatt ggcagggaat aataat // LOCUS D83779 5022 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0195 gene, complete cds. ACCESSION D83779 NID g1228040 KEYWORDS KIAA0195. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5022) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5022) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5022 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="17" /sex="male" 5'UTR 1..203 gene 204..4274 /gene="KIAA0195" CDS 204..4274 /gene="KIAA0195" /note="the KIAA0195 gene is expressed ubiquitously.; the KIAA0195 protein retains 9 hydrophobic domains." /codon_start=1 /db_xref="PID:d1012782" /db_xref="PID:g1228041" /translation="MDLKEKHLGEPPSALGLSTRKALSVLKEQLEAVLEGHLRERKKC LTWKEVWRSSFLHHSNRCSCFHWPGASLMLLAVLLLLGCCGGQPAGSRGVGLVNASAL FLLLLLNLVLIGRQDRLKRREVERRLRGIIDQIQDALRDGREIQWPSAMYPDLHMPFA PSWSLHWAYRDGHLVNLPVSLLVEGDIIALRPGQESFASLRGIKDDEHIVLEPGDLFP PFSPPPSPRGEVERGPQSPQQHRLFRVLETPVIDNIRWCLDMALSRPVTALDNERFTV QSVMLHYAVPVVLAGFLITNALRFIFSAPGVTSWQYTLLQLQVNGVLPILPLLFPVLW VLATACGEARVLAQMSKASPSSLLAKFSEDTLSSYTEAVSSQEMLRCIWGHFLRVLGG TSPTLSHSSSLLHSLGSVTVLCCVDKQGILSWPNPSPETVLFFSGKVEPPHSSHEDLT DGLSTRSFCHPEPHERDALLAGSLNNTLHLSNEQERGDWPGEAPKPPEPYSHHKAHGR SKHPSGSNVSFSRDTEGGEEEPSKTQPGMESDPYEAEDFVCDYHLEMLSLSQDQQNPS CIQFDDSNWQLHLTSLKPLGLNVLLNLCDASVTERLCRFSDHLCNIALQESHSAVLPV HVPWGLCELARLIGFTPGAKELFKQENHLALYRLPSAETMKETSLGRLSCVTKRRPPL SHMISLFIKDTTTSTEQMLSHGTADVVLEACTDFWDGADIYPLSGSDRKKVLDFYQRA CLSGYCSAFAYKPMNCALSSQLNGKCIELVQVPGQSSIFTMCELPSTIPIKQNARRSS WSSDEGIGEVLEKEDCMQALSGQIFMGMVSSQYQARLDIVRLIDGLVNACIRFVYFSL EDELKSKVFAEKMGLETGWNCHISLTPNGDMPGSEIPPSSPSHAGSLHDDLNQVSRDD AEGLLLMEEEGHSDLISFQPTDSDIPSFLEDSNRAKLPRGIHQVRPHLQNIDNVPLLV PLFTDCTPETMCEMIKIMQEYGEVTCCLGSSANLRNSCLFLQSDISIALDPLYPSRCS WETFGYATSISMAQASDGLSPLQLSGQLNSLPCSLTFRQEETISIIRLIEQARHATYG IRKCFLFLLQCQLTLVVIQFLSCLVQLPPLLSTTDILWLSCFCYPLLSISLLGKPPHS SIMSMATGKNLQSIPKKTQHYFLLCFLLKFSLTISSCLICFGFTLQSFCDSSRDRNLT NCSSVMLPSNDDRAPAWFEDFANGLLSAQKLTAALIVLHTVFISITHVHRTKPLWRKS PLTNLWWAVTVPVVLLGQVVQTAVDLQLWTHRDSHVHFGLEDVPLLTWLLGCLSLVLV VVTNEIVKLHEIRVRVRYQKRQKLQFETKLGMNSPF" 3'UTR 4275..5022 BASE COUNT 942 a 1635 c 1404 g 1041 t ORIGIN 1 cggacatggc tgcggccccc ggaggagggg acgtgaagtg aggagggggt tgggagggga 61 gaggacgcgg gcgaggaaga ccagccccgg ggccccgatg ttgtgactgt gacagactca 121 ctggggtttg tacatgctgg ggaggagcct tcctttcagg ggtgaccaca ttcatctggg 181 catgcctgca gtactcttgg cccatggacc tgaaggagaa gcacctgggc gagcctccct 241 cagccctggg cctgtccacg cggaaggccc tcagcgtcct gaaggagcag ctggaggcag 301 tgctggaagg acatctcagg gagcggaaga agtgtctgac gtggaaggag gtgtggagaa 361 gcagcttcct ccaccacagt aaccgctgct cctgcttcca ctggccgggg gcctcactca 421 tgctactggc cgtgctgctg ctgctgggct gctgcggggg acagccagcc gggagccgtg 481 gggtggggct ggtgaatgcc tcggccttgt tcctgttact gcttctcaac cttgtgctca 541 tcgggcggca agaccggctg aagcgtcggg aggtagagcg gaggctgcga gggatcattg 601 accaaatcca agatgccctc agggatggca gggagatcca gtggcccagt gccatgtatc 661 cagacctcca catgcctttt gcgccatcct ggtccttgca ctgggcctac agagacggac 721 acctggtcaa cctgccagtc agcctgctgg ttgaaggaga catcatagct ttgaggcctg 781 gccaggaatc gtttgcttct ctgaggggga tcaaggatga cgagcacatc gtcctggagc 841 cgggagacct cttccccccc ttctcccctc caccctcacc ccggggagaa gtggagagag 901 ggccacagag cccccagcag caccggcttt tccgtgtcct tgagacccct gtgattgaca 961 acatcagatg gtgcctggac atggccctgt cccgaccagt cactgccctg gacaatgagc 1021 ggttcacagt gcagtcggtg atgctacact atgctgtgcc cgtggtcctg gccggcttcc 1081 tcatcaccaa tgccctgcgc ttcatcttca gtgccccggg ggtcacttcc tggcagtaca 1141 ccctcctcca gctccaggtg aatggcgtcc tgcccatcct ccccctgctc tttccagtcc 1201 tctgggttct ggcaactgcc tgtggagagg cccgtgtcct ggcccagatg agcaaggcct 1261 cacccagctc cctgctggct aagttctcag aggatactct cagcagctat acggaggctg 1321 tctcctctca ggaaatgctg cgctgcattt ggggccactt cctgagggtg ctcgggggga 1381 catcgccaac gctgagccac agttccagcc tgctgcacag cctgggctct gtcacggtcc 1441 tgtgctgtgt ggacaaacag gggatcctgt catggccaaa tcccagccca gagactgtac 1501 tgttcttcag cgggaaggtg gagccccctc acagcagcca tgaggacctc accgatggcc 1561 tatccacccg ctccttctgc catcccgagc cccatgaacg agacgccctc ctggctggct 1621 ccctgaacaa caccctgcac ctttccaatg agcaggagcg tggcgactgg cctggcgagg 1681 ctcccaagcc ccccgagccc tattcacacc acaaagcgca tggccgcagc aaacacccat 1741 ctggctccaa cgtgagcttc agcagggaca ccgagggtgg tgaagaagag cccagcaaga 1801 cccagcctgg gatggagagc gacccctacg aagcagagga ctttgtgtgt gactaccacc 1861 tggagatgct gagcctgtcc caggaccagc agaacccctc ctgcatccag tttgatgact 1921 ccaactggca gctgcacctc acctccctca aacccctggg cctcaatgtg ctgctgaacc 1981 tgtgtgatgc cagcgtcacc gagcgcctgt gccgattctc cgaccacctg tgcaacattg 2041 ccctgcaaga gagccacagc gccgtgctgc ccgtccatgt gccctggggc ctctgcgagc 2101 ttgcccgcct cattggcttc actcctgggg ccaaggagct tttcaagcag gagaaccatc 2161 tggcgctgta ccgcctcccc agtgccgaga caatgaagga gacatcgctg gggcggctct 2221 cctgtgtcac caagcggcgg cctcccctca gccacatgat cagcctcttc attaaagaca 2281 ccaccaccag cacagagcag atgctgtccc atggcaccgc tgatgtggtc ttagaggcct 2341 gcacagactt ctgggacgga gctgacatct accctctctc gggatctgac agaaagaaag 2401 tgctggactt ctaccagcga gcctgcctgt ctgggtattg ctctgccttc gcctacaagc 2461 ccatgaactg cgccctgtcc tctcagctca atggcaagtg catcgagctg gtacaggtgc 2521 ccggccaaag cagcatcttc accatgtgcg agctgcccag caccatcccc atcaagcaga 2581 acgcccgccg cagcagctgg agctctgacg aagggatcgg ggaggtgctg gagaaggaag 2641 actgcatgca ggccctgagc ggccagatct tcatgggcat ggtgtcctcc cagtaccagg 2701 cccggctgga catcgtgcgc ctcattgatg ggcttgtcaa cgcctgcatc cgctttgtct 2761 acttctcttt ggaggatgag ctcaaaagca aggtgtttgc agaaaaaatg ggcctggaga 2821 caggctggaa ctgccacatc tccctcacac ccaatggtga catgcctggc tccgagatcc 2881 ccccctccag ccccagccac gcaggctccc tgcatgatga cctgaatcag gtgtcccgag 2941 atgatgcaga agggctcctc ctcatggagg aggagggcca ctcggacctc atcagcttcc 3001 agcctacgga cagcgacatc cccagcttcc tggaggactc caaccgggcc aagctgcccc 3061 ggggtatcca ccaagtgcgg ccccacctgc agaacattga caacgtgccc ctgctagtgc 3121 cccttttcac cgactgcacc ccagagacca tgtgtgagat gataaagatc atgcaagagt 3181 acggggaggt gacctgctgc ctgggcagct ctgccaacct gcggaacagc tgcctcttcc 3241 tccagagcga catcagcatt gccctggatc ccctgtaccc atcccgttgc tcctgggaga 3301 cctttggcta cgccaccagc atcagcatgg cccaggcctc ggatggcctt tctcccctgc 3361 agctgtcagg gcagctcaac agcctgccct gttccctgac ctttcgccag gaggagacca 3421 tcagcatcat ccggcttatc gaacaggctc ggcatgccac ctatggcatc cgtaagtgct 3481 tcctcttcct gctgcagtgc cagctgactc ttgtggtcat ccagttcctt tcttgcctgg 3541 tccagctgcc gccactcctg agtaccaccg acatcctgtg gctgtcctgc ttttgctacc 3601 ctctgctcag catctctctg ctggggaagc ccccccatag ctccatcatg tctatggcaa 3661 cggggaaaaa cctccagtcc attcccaaga agacccagca ctacttcctg ctctgcttcc 3721 tgctcaagtt cagcctcacc atcagctcct gcctcatctg ctttggcttc acactgcaga 3781 gcttctgtga cagctcccgg gaccgcaacc tcaccaactg ctcctccgtc atgctgccca 3841 gcaacgacga cagggctcca gcctggtttg aggactttgc caatggactg ctgtcggctc 3901 agaagctcac ggccgccctg attgtcctgc acactgtctt catttccatc acccatgtgc 3961 atcgcaccaa gcccctgtgg agaaagagcc ccttgaccaa cctctggtgg gccgtgacag 4021 tgcctgtggt gctgctgggt caggtggtcc agacggctgt ggacctgcag ctgtggacac 4081 acagggacag ccacgtccac tttggcctgg aggacgtgcc cctgctgaca tggctcctgg 4141 gctgcctgtc cctggtcctt gtggtggtga ccaatgagat cgtgaagcta catgagattc 4201 gggtccgagt ccgctaccag aagcgacaga agctgcagtt tgaaactaag ctgggcatga 4261 actctccctt ctgagccact ggctgtggtg gctgtagttg cccccgtccc tggggctaaa 4321 gccagaccca tttctgaaca ggggagtttg tatcatgaat gtttccaggt ttgctcctgc 4381 acccgtggca ctggaaaccc agctccccgt gtcagacccc gctgtcttcc tgagccctgg 4441 ggctcactgt ggaggagctg acggcctggg cccttggcca gtcctggctc ttccctgggc 4501 ctcaccaggg acactcttga atgtatggcc tcaggcgctc cctagagggg ccctaaaccc 4561 cctcacctgt gagctacccc ctttagggat cccttgcccc cttggagatc ccttgccccc 4621 cagtgcctct gctcgtgggt ccctggacac ggccttgaag ccaaccttct ttggaggagc 4681 aacagcagca gccttggccg acgcgtccaa ctcccaaggc tgccgtggag ggcagggggg 4741 tggtgcttgc ctggatgtgg ccccgagtgc ctcccctccc tccctctgtg ggggagtctc 4801 ccgcctgaac ctgaagatgg agcagggccc ccgcttcgcc ctggagcctc ttcctgtgcc 4861 tggctcaagc tggctgcctg tcagtcttgg ggaatctggc ccaggtctcc tcagcctctg 4921 ccccagttct gggagaagtt tctactggtg tatatttttt actggaaatg agccttttag 4981 gaatgaatgt agactggttt gtattaaaat gtgtcaattg ct // LOCUS D83780 4103 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0196 gene, complete cds. ACCESSION D83780 NID g1228042 KEYWORDS KIAA0196. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4103) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4103) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..4103 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="8" /sex="male" 5'UTR 1..273 gene 274..3753 /gene="KIAA0196" CDS 274..3753 /gene="KIAA0196" /note="the KIAA0196 gene is expressed ubiquitously." /codon_start=1 /db_xref="PID:d1012783" /db_xref="PID:g1228043" /translation="MLDFLAENNLCGQAILRIVSCGNAIIAELLRLSEFIPAVFRLKD RADQQKYGDIIFDFSYFKGPELWESKLDAKPELQDLDEEFRENNIEIVTRFYLAFQSV HKYIVDLNRYLDDLNEGVYIQQTLETVLLNEDGKQLLCEALYLYGVMLLVIDQKIEGE VRERMLVSYYRYSAARSSADSNMDDICKLLRSTGYSSQPGAKRPSNYPESYFQRVPIN ESFISMVIGRLRSDDIYNQVSAYPLPEHRSTALANQAAMLYVILYFEPSILHTHQAKM REIVDKYFPDNWVISIYMGITVNLVDAWEPYKAAKTALNNTLDLSNVREQASRYATVS ERVHAQVQQFLKEGYLREEMVLDNIPKLLNCLRDCNVAIRWLMLHTADSACDPNNKRL RQIKDQILTDSRYNPRILFQLLLDTAQFEFILKEMFKQMLSEKQTKWEHYKKEGSERM TELADVFSGVKPLTRVEKNENLQAWFREISKQILSLNYDDSTAAGRKTVQLIQALEEV QEFHQLESNLQVCQFLADTRKFLHQMIRTINIKEEVLITMQIVGDLSFAWQLIDSFTS IMQESIRVNPSMVTKLRATFLKLASALDLPLLRINQANSPDLLSVSQYYSGELVSYVR KVLQIIPESMFTSLLKIIKLQTHDIIEVPTRLDKDKLRDYAQLGPRYEVAKLTHAISI FTEGILMMKTTLVGIIKVDPKQLLEDGIRKELVKRVAFALHRGLIFNPRAKPSELMPK LKELGATMDGFHRSFEYIQDYVNIYGLKIWQEEVSRIINYNVEQECNNFLRTKIQDWQ SMYQSTHIPIPKFTPVDESVTFIGRLCREILRITDPKMTCHIDQLNTWYDMKTHQEVT SSRLFSEIQTTLGTFGLNGLDRLLCFMIVKELQNFLSMFQKIILRDRTVQDTLKTLMN AVSPLKSIVANSNKIYFSAIAKTQKIWTAYLEAIMKVGQMQILRQQIANELNYSCRFD SKHLAAALENLNKALLADIEAHYQDPSLPYPKEDNTLLYEITAYLEAAGIHNPLNKIY ITTKRLPYFPIVNFLFLIAQLPKLQYNKNLGMVCRKPTDPVDWPPLVLGLLTLLKQFH SRYTEQFLALIGQFICSTVEQCTSQKIPEIPADVVGALLFLEDYVRYTKLPRRVAEAH VPNFIFDEFRTVL" 3'UTR 3754..4103 BASE COUNT 1222 a 887 c 874 g 1120 t ORIGIN 1 aggggcggaa gtcggggtct gacccgctcc aggtccggga ctgcggatag aagaggaccg 61 ccgccttgag ggaggggtgg aaactgggtg ccggctccgc gcgcgacctc cggccctgcg 121 cgtgcgccgt ggcgcggccc ggctgacagg ttctttaatg gaggagccaa tctctctgca 181 cacctggttt catctaataa tatacagaca ccagctctga ggccagttaa tcatccccag 241 tgtccaggca cagagtagtc ggtccgcctc acaatgttgg actttctagc cgagaacaac 301 ctctgtggcc aagcaatcct aaggattgtt tcctgtggta atgccatcat tgctgaactt 361 ttgagactct ctgagtttat tcctgctgtg ttcaggttaa aagacagagc tgatcaacag 421 aaatatggag atatcatatt tgatttcagc tattttaagg gtccagaatt atgggaaagc 481 aaactggatg ctaagccaga gctacaggat ttagatgaag aatttcgtga aaacaacata 541 gaaattgtga ccagatttta tttagcattt caaagtgtac ataaatatat tgtagactta 601 aacagatatc tagatgatct caatgaaggg gtttatattc agcaaacctt agaaactgtg 661 cttctcaatg aagatggaaa acaacttcta tgtgaagcac tgtacttata tggagttatg 721 ctactggtca ttgaccaaaa gattgaagga gaagtcagag agaggatgct ggtttcttac 781 taccgataca gtgctgctcg atcttctgct gattcaaata tggacgatat ttgtaagctg 841 cttcgaagta caggttattc tagccaacca ggtgccaaaa gaccatccaa ctatcccgag 901 agctatttcc agagagtgcc tatcaacgaa tccttcatca gtatggtcat tggtcgactg 961 agatctgatg atatttacaa ccaggtctca gcgtatcctt tgccggagca tcgcagcaca 1021 gccctggcaa accaagctgc catgctgtac gtgattctct actttgagcc ttccatcctt 1081 cacacccatc aagcaaaaat gagagagata gtggataaat actttccaga taattgggta 1141 attagtattt acatggggat cacagttaat ctagtagatg cttgggaacc ttacaaagct 1201 gcaaaaactg ctttaaataa taccctggac ctttcaaatg tcagagaaca ggcaagcaga 1261 tatgctactg tcagtgaaag agtgcatgct caagtgcagc aatttctaaa agaaggttat 1321 ttaagggagg agatggttct ggacaatatc ccaaagcttc tgaactgcct gagagactgc 1381 aatgttgcca tccgatggct gatgcttcat acagcagact cagcctgtga cccaaacaac 1441 aaacgccttc gtcaaatcaa ggaccagatt ctaacagact ctcggtacaa tcccaggatc 1501 ctcttccagc tgctgttgga tactgcacaa tttgagttta tactcaaaga gatgttcaag 1561 caaatgcttt cagaaaagca aaccaaatgg gagcattaca agaaagaggg ttcggagcgg 1621 atgactgagc ttgctgatgt cttttcagga gtgaaacccc taaccagagt ggagaaaaat 1681 gaaaaccttc aagcttggtt cagagagatc tcaaaacaaa tattgtcttt aaattatgat 1741 gattctactg ctgcgggcag aaaaactgta caactgatac aagctttgga agaggttcaa 1801 gaattccacc agttggaatc caatctgcaa gtatgtcagt ttcttgccga tactcgaaag 1861 tttcttcatc aaatgatcag aaccattaac attaaagagg aggttctgat cacaatgcag 1921 atcgttgggg acctttcttt cgcttggcag ttgattgaca gtttcacatc catcatgcaa 1981 gaaagcataa gggtaaatcc atccatggtt actaaactca gagctacctt cctaaagctt 2041 gcctctgccc tcgatctgcc ccttcttcgt attaatcagg caaatagccc cgacctgctc 2101 agcgtgtcac agtactattc tggagagttg gtatcctatg tgagaaaagt tttgcagatc 2161 atcccagaaa gcatgtttac atctcttcta aagatcataa agcttcagac ccacgacatt 2221 attgaagtgc ctacccgcct ggacaaagac aagctgaggg actatgctca gctaggccca 2281 cgatacgagg ttgccaagct tactcatgct atttccattt ttactgaagg catcttaatg 2341 atgaaaacga ctttggttgg catcatcaag gtggatccaa agcagttgct ggaagatgga 2401 ataaggaaag agcttgtgaa gcgcgttgcc tttgccctgc ataggggact gatattcaac 2461 cctcgagcca agccaagtga attgatgccc aagctgaaag agttgggagc gaccatggat 2521 ggattccatc gttcttttga atacatacag gactatgtca acatttatgg tctgaagatt 2581 tggcaggaag aagtatctcg tatcataaat tacaacgtgg agcaagagtg taataacttt 2641 ctaagaacga agattcaaga ttggcaaagc atgtaccagt ccactcatat tccaataccc 2701 aagtttaccc ctgtggatga gtctgtaacg tttattggtc gactctgcag agaaatcctg 2761 cggatcacag acccaaaaat gacatgtcac atagaccagc tgaacacttg gtatgatatg 2821 aaaactcatc aggaagtgac cagcagccgc ctcttctcag aaatccagac caccttggga 2881 acctttggtc taaatggctt agacaggctt ctgtgcttta tgattgtaaa agagttacag 2941 aatttcctca gtatgtttca gaaaattatc ctgagagaca gaactgttca ggacacttta 3001 aaaaccctca tgaatgctgt cagtccccta aaaagtattg tcgcaaattc aaataaaatt 3061 tatttttccg ccattgccaa aacacagaag atttggactg cgtatctcga ggctataatg 3121 aaggttgggc agatgcagat tctgagacaa cagattgcca atgaattaaa ttattcttgt 3181 cggtttgatt ctaaacatct ggcagctgct ctggagaatc tcaataaggc tctcctagca 3241 gacattgaag cccactatca ggacccttca cttccttacc ccaaagaaga taacacactt 3301 ttatatgaaa tcacagccta tctggaggca gctggcattc acaacccact gaataagata 3361 tacataacaa caaagcgctt accctatttt ccaattgtaa actttctatt tttgatcgct 3421 cagttgccaa aacttcaata caacaaaaat ctgggaatgg tctgccgaaa accgaccgac 3481 ccggttgatt ggccaccgct tgtcctggga ctgctcactc tgctgaagca gttccattcc 3541 cggtacaccg agcagttcct ggcgctgatt ggccagttta tctgctccac ggtggagcag 3601 tgtacaagcc agaagatacc tgaaattcct gcagatgttg tgggtgccct tctgttcctg 3661 gaggattatg ttcggtacac aaagctaccc aggagggttg ctgaagcaca tgtgcctaat 3721 ttcatttttg atgagttcag aacagtgctg taactgtttt tcctacttct tcaatggaag 3781 gattgtcctt agatcttccc accatcacaa atgaatttga agatgaaaag aaactcagtt 3841 gctcatacaa ctgcattttt tctgtctatt atgggaaaca tcagacgttc tgagtaagat 3901 atatctcatg gcattagtta atataactga tattgtttaa atcatggtat tacatgcaat 3961 ttatatcaga taaaagcaga acacattttt gtactgcctc tcttaaatgc tgaatgtaac 4021 tgttatgtat aaatccattt agttttatgt tctaaataac tatttgtgca actccagatt 4081 ttcagtaaaa tagtattact agt // LOCUS D83785 5713 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0200 gene, complete cds. ACCESSION D83785 NID g1663697 KEYWORDS KIAA0200. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5713) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5713) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA 0161 - KIAA 0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. (1995) In press REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Tanaka,A. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161-KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 3 (1), 17-24 (1996) MEDLINE 96281124 FEATURES Location/Qualifiers source 1..5713 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="5" /sex="male" 5'UTR 1..263 gene 264..3314 /gene="KIAA0200" CDS 264..3314 /gene="KIAA0200" /note="expressed ubiquitously; product similar to D.melanogaster mam protein." /citation=[3] /codon_start=1 /db_xref="PID:d1012788" /db_xref="PID:g1663698" /translation="MVLPTCPMAEFALPRHSAVMERLRRRIELCRRHHSTCEARYEAV SPERLELERQHTFALHQRCIQAKAKRAGKHRQPPAATAPAPAAPAPRLDAADGPEHGR PATHLHDTVKRNLDSATSPQNGDQQNGYGDLFPGHKKTRREAPLGVAISSNGLPPASP LGQSDKPSGADALQSSGKHSLGLDSLNKKRLADSSLHLNGGSNPSESFPLSLNKELKQ EPVEDLPCMITGTVGSISQSNLMPDLNLNEQEWKELIEELNRSVPDEDMKDLFNEDFE EKKDPESSGSATQTPLAQDINIKTEFSPAAFEQEQLGSPQVRAGSAGQTFLGPSSAPV STDSPSLGGSQTLFHTSGQPRADNPSPNLMPASAQAQNAQRALAGVVLPSQGPGGASE LSSAHQLQQIAAKQKREQMLQNPQQATPAPAPGQMSTWQQTGPSHSSLDVPYPMEKPA SPSSYKQDFTNSKLLMMPSVNKSSPRPGGPYLQPSHVNLLSHQPPSNLNQNSANNQGS VLDYGNTKPLSHYKADCGQGSPGSGQSKPALMAYLPQQLSHISHEQNSLFLMKPKPGN MPFRSLVPPGQEQNPSSVPVQAQATSVGTQPPAVSVASSHNSSPYLSSQQQAAVMKQH QLLLDQQKQREQQQKHLQQQQFLQRQQHLLAEQEKQQFQRHLTRPPPQYQDPTQGSFP QQVGQFTGSSAAVPGMNTLGPSNSSCPRVFPQAGNLMPMGPGHASVSSLPTNSGQQDR GVAQFPGSQNMPQSSLYGMASGITQIVAQPPPQATNGHAHIPRQTNVGQNTSVSAAYG QNSLGSSGLSQQHNKGTLNPGLTKPPVPRVSPAMGGQNSSWQHQGMPNLSGQTPGNSN VSPFTAASSFHMQQQAHLKMSSPQFSQAVPNRPMAPMSSAAAVGSLLPPVSAQQRTSA PAPAPPPTAPQQGLPGLSPAGPELGAFSQSPASQMGGRAGLHCTQAYPVRTAGQELPF AYSGQPGGSGLSSVAGHTDLIDSLLKNRTSEEWMSDLDDLLGSQ" 3'UTR 3315..5713 BASE COUNT 1317 a 1672 c 1502 g 1222 t ORIGIN 1 cggccgcggc ggtagcgcgg aaaacaatgg ggccggggcg gtggggagag gccgaggctt 61 gaggtaggca gcaagcgccg gctgggggtc gggccgagcg gggcaggagg aaaacccgcc 121 gccgcgcgcg agcccgctcc gctgccctcg ggggcatggc gcggccgtga ggcggagagg 181 ggtagccgcg gggagcgaag cccgcagtgc cagccggccc cgagaggccc ggccccgggc 241 ccggcccgtg cagcccgcgg cccatggtgc tgcccacctg ccccatggcg gagttcgcgc 301 tgccgcggca cagcgcggtc atggagcgcc ttcgccggcg catcgagctg tgccggcgcc 361 accacagcac ctgcgaggcc cgctacgagg ccgtgtcgcc cgagcgcctg gagctggagc 421 gccaacacac cttcgccctg caccagcgct gcatccaggc caaggccaag cgcgccggga 481 agcacaggca gccgcccgcc gccacggccc cggcgcccgc cgccccggcc ccgcgcctgg 541 acgccgctga cggccccgag cacggccgcc cggccacgca tcttcatgat acagttaaga 601 ggaatcttga cagcgccact tcccctcaga atggcgatca acagaatggc tacggggacc 661 tctttcctgg gcataagaag actcgccggg aggcccctct gggagttgcc atctcttcca 721 atggactgcc tccagcctcc cccctcggtc agtctgacaa gccttctgga gccgacgccc 781 tgcagtccag tgggaagcac tctctggggc tagactctct caacaaaaag cgtctggctg 841 actccagcct tcacttgaat ggaggcagta accccagtga gtcatttcct ctgagcctga 901 ataaagaact gaagcaggag cctgtcgaag acctgccttg catgatcact gggactgtcg 961 gctccatatc gcaaagcaac ctcatgccag acctcaacct taacgagcag gagtggaagg 1021 agctcatcga ggagctgaac aggtcggtgc ccgatgaaga catgaaggac ctgtttaatg 1081 aggacttcga ggagaagaag gacccagagt cttctggctc tgccacacaa acccccttgg 1141 cacaggacat taatattaag acggaattct ctccagcagc ctttgagcaa gaacagttag 1201 gctctccaca agtgagggcc gggtctgcag ggcagacctt tctggggcct tcctctgccc 1261 ctgtgagtac agattccccc agcctagggg gctcccaaac cttattccac acctctggtc 1321 agccccgggc ggacaatccc agtccaaacc tgatgccggc atcagcccag gcccagaacg 1381 cacaaagagc ccttgcaggt gtggtattgc ccagtcaggg cccaggaggg gcctcagagc 1441 tgtcctctgc ccaccagctc cagcagatcg ctgccaagca gaagcgcgag cagatgctcc 1501 agaacccaca gcaggccacc ccggcaccag ccccgggcca gatgtccaca tggcagcaga 1561 cggggccctc ccacagttcc ttagatgtcc cttaccccat ggagaagcct gccagccctt 1621 ccagctacaa gcaagacttc actaactcca aactgctcat gatgcctagt gtgaataaga 1681 gttcccctcg gcccggaggc ccctacctcc agcccagcca tgtgaacctg ctgagtcacc 1741 agccaccgag taacttgaat cagaactccg cgaataacca ggggtctgtg ctggactacg 1801 gcaatacaaa acccctttct cattacaaag cggactgtgg gcaaggcagc ccggggtctg 1861 gccagagcaa gccagccctg atggcttatc ttccccagca gctgtcccat ataagtcacg 1921 agcagaactc cctgtttctg atgaagccaa agccaggaaa tatgcctttc cgatcactgg 1981 ttccacctgg ccaggagcag aacccttcca gtgtccctgt gcaagcccag gctaccagtg 2041 ttgggaccca gccgcctgcc gtgtccgtgg ccagctccca caacagctcc ccctatctca 2101 gcagccagca acaggccgct gtaatgaagc agcatcagtt gcttttggac caacagaaac 2161 aaagggagca gcagcaaaag catttacagc aacagcagtt ccttcagagg caacagcacc 2221 ttctcgcgga acaggagaag caacagtttc agcgccatct gacccgccca ccaccccagt 2281 accaagaccc gacacaaggc agcttcccac agcaggttgg acagttcaca gggtcctctg 2341 ctgccgtgcc cggcatgaac accttgggtc catccaactc cagctgtcct cgagtgttcc 2401 ctcaggctgg gaatctgatg ccaatgggcc ctggacatgc ttcagtttcc tctctcccca 2461 caaactcagg ccaacaggac cggggtgtgg ctcagttccc tggctcccaa aacatgcctc 2521 agagcagcct ctatggcatg gcttctggca taacccagat agttgcccag cccccgccac 2581 aggccaccaa tggacatgcc cacattccac ggcagaccaa cgtgggccag aacacctccg 2641 tctcagctgc ctatgggcag aactctctgg gaagctctgg cctctcccag cagcacaata 2701 aggggaccct gaaccctggt ttaacaaagc caccggtccc aagggtgtca ccagccatgg 2761 gaggccagaa ttcctcctgg cagcatcagg gaatgccgaa cctcagtggc cagaccccag 2821 ggaacagcaa cgtgagtccc ttcactgcag cctccagttt ccacatgcag cagcaggccc 2881 acctgaaaat gtctagcccg caattctccc aggcagtgcc caacaggccc atggctccca 2941 tgagctcagc agctgccgtg gggtccttgc tacccccagt gagtgcacag cagaggacca 3001 gcgcccctgc cccagcacca cccccaacag cccctcagca gggcttgcct ggcctgagcc 3061 cagcagggcc tgagctgggg gccttcagcc agagccctgc ctcacagatg ggcggtcggg 3121 cggggctgca ctgcacccag gcctaccctg tgcggaccgc gggccaggag ctgccttttg 3181 cctatagcgg gcagccaggt ggcagtgggc tctctagtgt ggctggacac accgatctga 3241 tcgactccct gctgaagaac aggacttcag aggagtggat gagtgatttg gacgacctgt 3301 tagggtctca gtaatggaag gatttgtagt gtttttagtg ttcattcatc ctatattttt 3361 attctcagat tcaaagaaag agcaactact ttggaccaaa agcccatggc ctggggagct 3421 gggcaggtag agcccaagct ccaggtgagg cctggccctg ggcagggtct gtggctgcgc 3481 ccctcaggcc agcagttgag gtccatcggg ctggccccag cccatctgct ggcatcagta 3541 cctggtgttg ggacagcagg atagggttct aaaggtggtt ttctatccaa acgaccaaaa 3601 aaccaacagt aacaccagtg aaaccccaca ctgtcgggct tataaaaatc tgtgccatca 3661 tggtgatttt atccaagact gctccactta ccccagtgct ggggacaagt ttctgttgaa 3721 actttagata gcagaattat ttgcaatttg tagcatagaa aagattttta aattttttta 3781 caaaaggttt ttaaacagat tagggtaggt gatggtttaa atcaattaag tggcattgga 3841 aacctagggt ttccttttga ttaagagcct tttttgtttc tgctctttgt cagctttcag 3901 gggagaagga ggccactgga aaattatttc cctaagtgca ggctgttgac tgcgtatgcc 3961 aaaaagggac aggaggcatg ggatagcagg tctggtgaca cagctagggt cttcctagca 4021 gctcctcctc ctccctccca aggcccccag gaatcccttc ctcccatgtc ctggcagcag 4081 gaccccaggc tacatatgga aggtagagat gtgggggtcc tgtatcctgg agtattatgt 4141 ctccccacct tctgcagttt tctctgaaca tgtatgttgc ccatggtggg agcgtggtca 4201 ctgtgcagtt gtgcacagat gtctttcctt taccgttggc ctttctgtct gcctctcctt 4261 cctctctgca gcccaaatgg aaaacaatta tttactccat tggagggaaa ggaagagtct 4321 tagaattcct aagggaacct tagcataaag gttttgggga aggaggccgt aggccggccc 4381 ggaggaagca attccacttg gtttgacaac ttctgccact cccatgtcag atgacttgca 4441 cttcttaaag agattgcttt ataacactaa gacatccttt ctaaagattc aagtggactt 4501 gactaagctg agggtccacg aaatagaata tgacatgtga gctgtttttg gaaaacgaag 4561 atggagagag cacttccccg taacgaaagc aaagtggtaa gcacagggtg agaccctttt 4621 acacagaatg gtggagagaa aagagaatgc tgaaaagtgg ctcagatgca gagtgttctg 4681 tggagaaact gcagccccac ttctgtttcc ctggagtctc ccaatggatc attcaggagt 4741 gtcctatgtg agaattgagc caaggaaaat actcatgcaa ccagcctgag tcgcggtgag 4801 gggatgagag gttgtacaca cattggtagt tattttgcac cagcagtgcc tttctcactg 4861 ggggtacttg gaccctcaga tcttcttttc taatagccat ttgccacccc aagtggtatg 4921 tcggccattt ctccttaaaa caccttccct acctttccca tgtactcagt ttagctctca 4981 aagaaggggt gaatcataaa gccagtgaaa atttcaccct ctgagggagt tccccaatct 5041 gaaggggaag agggtgacct cagcggcttt tctcccaaaa atcggctgaa ggctggttgt 5101 ggatccttgt tcctctcctg accccatctg gctgctgccc cgtctcccac ccctgtcccc 5161 ggggctcgct ggccctgcac tccgccttag tcctggggcc ggcgacacag tgggggctcc 5221 tcacttgctg cagtgtcata gcaataaaat gtgattcttg gggtcccccc agggagctgc 5281 ccatggcttt atttatgaac ctggttttcg ggagtcaggg gaggagatga ctttgcttct 5341 gtgcacagcc ccgtcttcca ggagccacaa ctcagaagaa aagggtgctc agacttttgt 5401 tatacacatt tgctttgtgt aaataaatgt ttacaatttt atatgaaaga tggaataagc 5461 gctagagctt ccaactgtat attttttact tttatagatt ttaaaactat gatcctttat 5521 atgtgtgttt tgggggagct atgataagtt ttatggcaaa cggttggtat tgttaacttt 5581 ttattgtcat caaaagttca taaaagtcct attaatcccc atattcttct actgccctta 5641 actctggtat acaccaaaaa gaaatcttta ctttccttgt tttatcatta taaaaataaa 5701 gtattttgct agt // LOCUS D83920 1194 bp mRNA PRI 11-JUN-1997 DEFINITION Human uterus mRNA for human ficolin-1, complete cds. ACCESSION D83920 NID g1510126 KEYWORDS ficolin. SOURCE Homo sapiens uterus cDNA to mRNA, clone_lib:cDNA library of human uterus clone:clones 15S7 and 24-3-6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Harumiya,S. TITLE Direct Submission JOURNAL Submitted (09-MAR-1996) to the DDBJ/EMBL/GenBank databases. Satoru Harumiya, The Cancer Institute, Biochemistry; Kami-Ikebukuro 1-37-1, Toshima-ku, Tokyo 170, Japan (Tel:03-3918-0111, Fax:03-3918-0342) REFERENCE 2 (sites) AUTHORS Harumiya,S., Takeda,K., Sugiura,T., Fukumoto,Y., Tachikawa,H., Miyazono,K., Fujimoto,D. and Ichijo,H. TITLE Characterization of ficolins as novel elastin-binding proteins and molecular cloning of human ficolin-1 JOURNAL J. Biochem. 120 (4), 745-751 (1996) MEDLINE 97103465 FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clones 15S7 and 24-3-6" /clone_lib="cDNA library of human uterus" /tissue_type="uterus" CDS 16..975 /codon_start=1 /evidence=experimental /product="ficolin" /db_xref="PID:d1012794" /db_xref="PID:g1510127" /translation="MARGLAVLLVLFLHIKNLPAQAADTCPEVKVVGLEGSDKLTILR GCPGLPGAPGPKGEAGVIGERGERGLPGAPGKAGPVGPKGDRGEKGMRGEKGDAGQSQ SCATGPRNCKDLLDRGYFLSGWHTIYLPDCRPLTVLCDMDTDGGGWTVFQRRMDGSVD FYRDWAAYKQGFGSQLGEFWLGNDNIHALTAQGSSELRVDLVDFEGNHQFAKYKSFKV ADEAEKYKLVLGAFVGGSAGNSLTGHNNNFFSTKDQDNDVSSSNCAEKFQGAWWYADC HASNLNGLYLMGPHESYANGINWSAAKGYKYSYKVSEMKVRPA" BASE COUNT 278 a 330 c 370 g 216 t ORIGIN 1 ctgagtggag ccaccatggc ccgggggctc gctgtcctgc tagtcttgtt cctgcatatc 61 aagaacctgc ctgcccaggc tgcggacaca tgtccagagg tgaaggtggt gggcctggag 121 ggctctgaca agctcaccat tctccgaggc tgcccggggc tgcccggggc cccagggcca 181 aagggagagg caggtgtcat tggagagaga ggagaacgcg gtctccctgg agcccctgga 241 aaggcaggac cagtggggcc caaaggagac cgaggagaga aggggatgcg tggagagaaa 301 ggagacgctg ggcagtctca gtcgtgtgcg acaggcccac gcaactgcaa ggacctgcta 361 gaccgggggt atttcctgag cggctggcac accatctacc tgcccgactg ccggcccctg 421 actgtgctct gtgacatgga cacggacgga gggggctgga ccgttttcca gcggaggatg 481 gatggctctg tggacttcta tcgggactgg gccgcataca agcagggctt cggcagtcag 541 ctgggggagt tctggctggg gaacgacaac atccacgccc tgactgccca gggaagcagc 601 gagctccgtg tagacctggt ggactttgag ggcaaccacc agtttgctaa gtacaaatca 661 ttcaaggtgg ctgacgaggc agagaagtac aagctggtac tgggagcctt tgtcgggggc 721 agtgcgggta attctctaac gggccacaac aacaacttct tctccaccaa agaccaagac 781 aatgatgtga gttcttcgaa ttgtgctgag aagttccagg gagcctggtg gtacgccgac 841 tgtcatgctt caaacctcaa tggtctctac ctcatgggac cccatgagag ctatgccaat 901 ggtatcaact ggagtgcggc gaaggggtac aaatatagct acaaggtgtc agagatgaag 961 gtgcggcccg cctagacggg ccaggacccc tccacatgca cctgctagtg gggaggccac 1021 acccacaagc gctgcgtcgt ggaagtcacc ccatttcccc agccagacac actcccatga 1081 cgcccacagc tgcccctttg cccccagctc agtcaagccg ccacatgccc acaacctcac 1141 cagagggaga attatgtttc taaatatgtt tactttggga cagaaaaaaa aaaa // LOCUS D84064 2896 bp mRNA PRI 12-NOV-1997 DEFINITION Homo sapiens mRNA for Hrs, complete cds. ACCESSION D84064 NID g2618587 KEYWORDS Hrs; 115-kDa tyrosine kinase substrate. SOURCE Homo sapiens placenta cDNA to mRNA, clone:phHrs. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2896) AUTHORS Lu,L., Komada,M., Yoshida,N. and Kitamura,N. TITLE Cloning of the cDNA and chromosome mapping of the gene for human Hrs, a 115-kDa tyrosine kinase substrate JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2896) AUTHORS Komada,M. TITLE Direct Submission JOURNAL Submitted (13-MAR-1996) to the DDBJ/EMBL/GenBank databases. Masayuki Komada, Kansai Medical University, Institute for Liver Research; Fumizono-cho 10-15, Moriguchi, Osaka 570, Japan (Tel:06-992-1001(ex.2535), Fax:06-994-6099) FEATURES Location/Qualifiers source 1..2896 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phHrs" /tissue_type="placenta" CDS 61..2394 /function="tyrosine phosphorylated protein in growth factor-stimulated cells" /note="zinc finger protein" /codon_start=1 /product="Hrs" /db_xref="PID:g2618588" /translation="MGRGSGTFERLLDKATSQLLLETDWESILQICDLIRQGDTQAKY AVNSIKKKVNDKNPHVALYALEVMESVVKNCGQTVHDEVANKQTMEELKDLLKRQVEV NVRNKILYLIQAWAHAFRNEPKYKVVQDTYQIMKVEGHVFPEFKESDAMFAAERAPDW VDAEECHRCRVQFGVMTRKHHCRACGQIFCGKCSSKYSTIPKFGIEKEVRVCEPCYEQ LNRKAEGKATSTTELPPEYLTSPLSQQSQLPPKRDETALQEEEELQLALALSQSEAEE KERLRQKSTYTSYPKAEPMPSASSAPPASSLYSSPVNSSAPLAEDIDPELARYLNRNY WEKKQEEARKSPTPSAPVPLTEPAAQPGEGHAAPTNVVENPLPETDSQPIPPSGGPFS EPQFHNGESEESHEQFLKALQNAVTTFVNRMKSNHMRGRSITNDSAVLSLFQSINGMH PQLLELLNQLDERRLYYEGLQDKLAQIRDARGALSALREEHREKLRRAAEEAERQRQI QLAQKLEIMRQKKQEYLEVQRQLAIQRLQEQEKERQMRLEQQKQTVQMRAQMPAFPLP YAQLQAMPAAGGVLYQPSGPASFPSTFSPAGSVEGSPMHGVYMSQPAPAAGPYPSMPS TAADPSMVSAYMYPAGATGAQAAPQAQAGPTASPAYSSYQPTPTAGYQNVASQAPQSL PAISQPPQSSTMGYMGSQSVSMGYQPYNMQNLMTTLPSQDASLPPQQPYIAGQQPMYQ QMAPSGGPPQQQPPVAQQPQAQGPPAQGSEAQLISFD" BASE COUNT 602 a 960 c 869 g 465 t ORIGIN 1 gggcgcgcca gctcgtagca ggggagcgcc cgcggcgtcg ggtttgggct ggaggtcgcc 61 atggggcgag gcagcggcac cttcgagcgt ctcctagaca aggcgaccag ccagctcctg 121 ttggagacag attgggagtc cattttgcag atctgcgacc tgatccgcca aggggacaca 181 caagcaaaat atgctgtgaa ttccatcaag aagaaagtca acgacaagaa cccacacgtc 241 gccttgtatg ccctggaggt catggaatct gtggtaaaga actgtggcca gacagttcat 301 gatgaggtgg ccaacaagca gaccatggag gagctgaagg acctgctgaa gagacaagtg 361 gaggtaaacg tccgtaacaa gatcctgtac ctgatccagg cctgggcgca tgccttccgg 421 aacgagccca agtacaaggt ggtccaggac acctaccaga tcatgaaggt ggaggggcac 481 gtctttccag aattcaaaga gagcgatgcc atgtttgctg ccgagagagc cccagactgg 541 gtggacgctg aggaatgcca ccgctgcagg gtgcagttcg gggtgatgac ccgtaagcac 601 cactgccggg cgtgtgggca gatattctgt ggaaagtgtt cttccaagta ctccaccatc 661 cccaagtttg gcatcgagaa ggaggtgcgc gtgtgtgagc cctgctacga gcagctgaac 721 aggaaagcgg agggaaaggc cacttccacc actgagctgc cccccgagta cctgaccagc 781 cccctgtctc agcagtccca gctgcccccc aagagggacg agacggccct gcaggaggag 841 gaggagctgc agctggccct ggcgctgtca cagtcagagg cggaggagaa ggagaggctg 901 agacagaagt ccacgtacac ttcgtacccc aaggcggagc ccatgccctc ggcctcctca 961 gcgccccccg ccagcagcct gtactcttca cctgtgaact cgtcggcgcc tctggctgag 1021 gacatcgacc ctgagctcgc acggtatctc aaccggaact actgggagaa gaagcaggag 1081 gaggctcgca agagccccac gccatctgcg cccgtgcccc tgacggagcc ggctgcacag 1141 cctggggaag ggcacgcagc ccccaccaac gtggtggaga accccctccc ggagacagac 1201 tctcagccca ttcctccctc tggtggcccc tttagtgagc cacagttcca caatggcgag 1261 tctgaggaga gccacgagca gttcctgaag gcgctgcaga acgccgtcac caccttcgtg 1321 aaccgcatga agagtaacca catgcggggc cgcagcatca ccaatgactc ggccgtgctc 1381 tcactcttcc agtccatcaa cggcatgcac ccgcagctgc tggagctgct caaccagctg 1441 gacgagcgca ggctgtacta tgaggggctg caggacaagc tggcacagat ccgcgatgcc 1501 cggggggcgc tgagtgccct gcgcgaagag caccgggaga agcttcgccg ggcagccgag 1561 gaggcagagc gccagcgcca gatccagctg gcccagaagc tggagataat gcggcagaag 1621 aagcaggagt acctggaggt gcagaggcag ctggccatcc agcgcctgca ggagcaggag 1681 aaggagcggc agatgcggct ggagcagcag aagcagacgg tccagatgcg cgcgcagatg 1741 cccgccttcc ccctgcccta cgcccagctc caggccatgc ccgcagccgg aggtgtgctc 1801 taccagccct cgggaccagc cagcttcccc agcaccttca gccctgccgg ctcggtggag 1861 ggctccccaa tgcacggcgt gtacatgagc cagccggccc ctgccgctgg cccctacccc 1921 agcatgccca gcactgcggc tgatcccagc atggtgagtg cctacatgta cccagcaggg 1981 gccactgggg cgcaggcggc cccccaggcc caggccggac ccaccgccag ccccgcttac 2041 tcatcctacc agcctactcc cacagcgggc taccagaacg tggcctccca ggccccacag 2101 agcctcccgg ccatctctca gcctccgcag tccagcacca tgggctacat ggggagccag 2161 tcagtctcca tgggctacca gccttacaac atgcagaatc tcatgaccac cctcccaagc 2221 caggatgcgt ctctgccacc ccagcagccc tacatcgcgg ggcagcagcc catgtaccag 2281 cagatggcac cctctggcgg tcccccccag cagcagcccc ccgtggccca gcaaccgcag 2341 gcacaggggc cgccggcaca gggcagcgag gcccagctca tttcattcga ctgacccagg 2401 ccatgctcac gtccggagta acactacata cagttcacct gaaacgcctc gtctctaact 2461 gccgtcgtcc tgcctccctg tcctctactg ccggtagtgt cccttctctg cgagtgaggg 2521 ggggccttca ccccaagccc acctcccttg tcctcagcct actgcagtcc ctgagttagt 2581 ctctgctttc tttccccagg gctgggccat ggggagggaa ggactttctc ccaggggaag 2641 cccccagccc tgtgggtcat ggtctgtgag aggtggcagg aatggggacc ctcacccccc 2701 aagcagcctg tgccctctgg ccgcactgtg agctggctgt ggtgtctggg tgtggcctgg 2761 ggctccctct gcaggggcct ctctcggcag ccacagccaa gggtggaggc ttcaggtctc 2821 cagcttctct gcttctcagc tgccatctcc agtgccccag aatggtacag cgataataaa 2881 atgtatttca gaaagg // LOCUS D84103 4263 bp mRNA PRI 25-OCT-1996 DEFINITION Human fetus brain mRNA for mitochondrial DNA polymerase gamma, complete cds. ACCESSION D84103 NID g1644238 KEYWORDS mitochondrial DNA polymerase gamma. SOURCE Homo sapiens fetus brain cDNA to mRNA, clone:GEN-404D04. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4263) AUTHORS Watanabe,T.K., Shimizu,F., Nishino,N., Fujiwara,T., Kanemoto,N., Suzuki,M., Nakamura,Y., Hirai,Y., Maekawa,H. and Takahashi,E. TITLE Molecular cloning of a human DNA polymerase gamma JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 4263) AUTHORS Watanabe,T.K. TITLE Direct Submission JOURNAL Submitted (17-MAR-1996) to the DDBJ/EMBL/GenBank databases. Takeshi K Watanabe, Otsuka GEN Research Institute, Structural Analysis; 463-10, Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (E-mail:kuga@po.iijnet.or.jp, Tel:+81-886-65-2888, Fax:+81-886-37-1035) FEATURES Location/Qualifiers source 1..4263 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /clone="GEN-404D04" /dev_stage="fetus" /tissue_type="brain" CDS 101..3826 /codon_start=1 /product="mitochondrial DNA polymerase gamma" /db_xref="PID:d1012898" /db_xref="PID:g1644239" /translation="MSRLLWRKVAGATVGPGPVPAPGRWVSSSVPASDPSDGQRRRQQ QQQQQQQQQQQQQPQQPQVLSSEGGQLRHNPLDIQMLSRGLHEQIFGQGGEMPGEAAV RRSVEHLQKHGLWGQPAVPLPDVELRLPPLYGDNLDQHFRLLAQKQSLPYLEAANLLL QAQLPPKPPAWAWAEGWTRYGPEGEAVPVAIPEERALVFDVEVCLAEGTCPTLAVAIS PSAWYSWCSQRLVEERYSWTSQLSPADLIPLEVPTGASSPTQRDWQEQLVVGHNVSFD RAHIREQYLIQGSRMRFLDTMSMHMAISGLSSFQRSLWIAAKQGKHKVQPPTKQGQKS QRKARRGPAISSWDWLDISSVNSLAEVHRLYVGGPPLEKEPRELFVKGTMKDIRENFQ DLMQYCAQDVWATHEVFQQQLPLFLERCPHPVTLAGMLEMGVSYLPVNQNWERYLAEA QGTYEELQREMKKSLMDLANDACQLLSGERYKEDPWLWDLEWDLQEFKQKKAKKVKKE PATASKLPIEGAGAPGDPMDQEDLGPCSEEEEFQQDVMARACLQKLKGTTELLPKRPQ HLPGHPGWYRKLCPRLDDPAWTPGPSLLSLQMRVTPKLMALTWDGFPLHYSERHGWGY LVPGRRDNLAKLPTGTTLESAGVVCPYRAIESLYRKHCLEQGKQQLMPQEAGLAEEFL LTDNSAIWQTVEELDYLEVEAEAKMENLRAAVPGQPLALTARGGPKDTQPSYHHGNGP YNDVDIPGCWFFKLPHKDGNSCNVGSPFAKDFLPKMEDGTLQAGPGGASGPRALEINK MISFWRNAHKRISSQMVVWLPRSALPRAVIRHPDYDEEGLYGAILPQVVTAGTITRRA VEPTWLTASNARPDRVGSELKAMVQAPPGYTLVGADVDSQELWIAAVLGDAHFAGMHG CTAFGWMTLQGRKSRGTDLHSKTATTVGISREHAKIFNYGRIYGAGQPFAERLLMQFN HRLTQQEAAEKAQQMYAATKGLRWYRLSDEGEWLVRELNLPVDRTEGGWISLQDLRKV QRETARKSQWKKWEVVAERAWKGGTESEMFNKLESIATSDIPRTPVLGCCISRALEPS AVQEEFMTSRVNWVVQSSAVDYLHLMLVAMKWLFEEFAIDGRFCISIHDEVRYLVREE DRYRAALALQITNLLTRCMFAYKLGLNDLPQSVAFFSAVDIDRCLRKEVTMDCKTPSN PTGMERRYGIPQGEALDIYQIIELTKGSLEKRSQPGP" BASE COUNT 938 a 1213 c 1302 g 810 t ORIGIN 1 ggacgtgtct ctctccacgt cttccagcca gtaaaagaag ccaagctgga gcccaaagcc 61 aggtgttctg actcccagcg tgggggtccc tgcaccaacc atgagccgcc tgctctggag 121 gaaggtggcc ggcgccaccg tcgggccagg gccggttcca gctccggggc gctgggtctc 181 cagctccgtc cccgcgtccg accccagcga cgggcagcgg cggcggcagc agcagcagca 241 gcagcagcag cagcagcagc agcaacagca gcctcagcag ccgcaagtgc tatcctcgga 301 gggcgggcag ctgcggcaca acccattgga catccagatg ctctcgagag ggctgcacga 361 gcaaatcttc gggcaaggag gggagatgcc tggcgaggcc gcggtgcgcc gcagcgtcga 421 gcacctgcag aagcacgggc tctgggggca gccagccgtg cccttgcccg acgtggagct 481 gcgcctgccg cccctctacg gggacaacct ggaccagcac ttccgcctcc tggcccagaa 541 gcagagcctg ccctacctgg aggcggccaa cttgctgttg caggcccagc tgcccccgaa 601 gcccccggct tgggcctggg cggagggctg gacccggtac ggccccgagg gggaggccgt 661 acccgtggcc atccccgagg agcgggccct ggtgttcgac gtggaggtct gcttggcaga 721 gggaacttgc cccacattgg cggtggccat atccccctcg gcctggtatt cctggtgcag 781 ccagcggctg gtggaagagc gttactcttg gaccagccag ctgtcgccgg ctgacctcat 841 ccccctggag gtccctactg gtgccagcag ccccacccag agagactggc aggagcagtt 901 agtggtgggg cacaatgttt cctttgaccg agctcatatc agggagcagt acctgatcca 961 gggttcccgc atgcgtttcc tggacaccat gagcatgcac atggccatct cagggctaag 1021 cagcttccag cgcagtctgt ggatagcagc caagcagggc aaacacaagg tccagccccc 1081 cacaaaacaa ggccagaagt cccagaggaa agccagaaga ggcccagcga tctcatcctg 1141 ggactggctg gacatcagca gtgtcaacag tctggcagag gtgcacagac tttatgtagg 1201 ggggcctccc ttagagaagg agcctcgaga actgtttgtg aagggcacca tgaaggacat 1261 tcgtgagaac ttccaggacc tgatgcagta ctgtgcccag gacgtgtggg ccacccatga 1321 ggttttccag cagcagctac cgctcttctt ggagaggtgt ccccacccag tgactctggc 1381 cggcatgctg gagatgggtg tctcctacct gcctgtcaac cagaactggg agcgttacct 1441 ggcagaggca cagggcactt atgaggagct ccagcgggag atgaagaagt cgttgatgga 1501 tctggccaat gatgcctgcc agctgctctc aggagagagg tacaaagaag acccctggct 1561 ctgggacctg gagtgggacc tgcaagaatt taagcagaag aaagctaaga aggtgaagaa 1621 ggaaccagcc acagccagca agttgcccat cgagggggct ggggcccctg gtgatcccat 1681 ggatcaggaa gacctcggcc cctgcagtga ggaggaggag tttcaacaag atgtcatggc 1741 ccgcgcctgc ttgcagaagc tgaaggggac cacagagctc ctgcccaagc ggccccagca 1801 ccttcctgga caccctggat ggtaccggaa gctctgcccc cggctagacg accctgcatg 1861 gaccccgggc cccagcctcc tcagcctgca gatgcgggtc acacctaaac tcatggcact 1921 tacctgggat ggcttccctc tgcactactc agagcgtcat ggctggggct acttggtgcc 1981 tgggcggcgg gacaacctgg ccaagctgcc gacaggtacc accctggagt cagctggggt 2041 ggtctgcccc tacagagcca tcgagtccct gtacaggaag cactgtctcg aacaggggaa 2101 gcagcagctg atgccccagg aggccggcct ggcggaggag ttcctgctca ctgacaatag 2161 tgccatatgg caaacggtag aagaactgga ttacttagaa gtggaggctg aggccaagat 2221 ggagaacttg cgagctgcag tgccaggtca acccctagct ctgactgccc gtggtggccc 2281 caaggacacc cagcccagct atcaccatgg caatggacct tacaacgacg tggacatccc 2341 tggctgctgg tttttcaagc tgcctcacaa ggatggtaat agctgtaatg tgggaagccc 2401 ctttgccaag gacttcctgc ccaagatgga ggatggcacc ctgcaggctg gcccaggagg 2461 tgccagtggg ccccgtgctc tggaaatcaa caaaatgatt tctttctgga ggaacgccca 2521 taaacgtatc agctcccaga tggtggtgtg gctgcccagg tcagctctgc cccgtgctgt 2581 gatcaggcac cccgactatg atgaggaagg cctctatggg gccatcctgc cccaagtggt 2641 gactgccggc accatcactc gccgggctgt ggagcccaca tggctcaccg ccagcaatgc 2701 ccggcctgac cgagtaggca gtgagttgaa agccatggtg caggccccac ctggctacac 2761 ccttgtgggt gctgatgtgg actcccaaga gctgtggatt gcagctgtgc ttggagacgc 2821 ccactttgcc ggcatgcatg gctgcacagc ctttgggtgg atgacactgc agggcaggaa 2881 gagcaggggc actgatctac acagtaagac agccactact gtgggcatca gccgtgagca 2941 tgccaaaatc ttcaactacg gccgcatcta tggtgctggg cagccctttg ctgagcgctt 3001 actaatgcag tttaaccacc ggctcacaca gcaggaggca gctgagaagg cccagcagat 3061 gtacgctgcc accaagggcc tccgctggta tcggctgtcg gatgagggcg agtggctggt 3121 gagggagttg aacctcccag tggacaggac tgagggtggc tggatttccc tgcaggatct 3181 gcgcaaggtc cagagagaaa ctgcaaggaa gtcacagtgg aagaagtggg aggtggttgc 3241 tgaacgggca tggaaggggg gcacagagtc agaaatgttc aataagcttg agagcattgc 3301 tacgtctgac ataccacgta ccccggtgct gggctgctgc atcagccgag ccctggagcc 3361 ctcggctgtc caggaagagt ttatgaccag ccgtgtgaat tgggtggtac agagctctgc 3421 tgttgactac ttacacctca tgcttgtggc catgaagtgg ctgtttgaag agtttgccat 3481 agatgggcgc ttctgcatca gcatccatga cgaggttcgc tacctggtgc gggaggagga 3541 ccgctaccgc gctgccctgg ccttgcagat caccaacctc ttgaccaggt gcatgtttgc 3601 ctacaagctg ggtctgaatg acttgcccca gtcagtcgcc tttttcagtg cagtcgatat 3661 tgaccggtgc ctcaggaagg aagtgaccat ggattgtaaa accccttcca acccaactgg 3721 gatggaaagg agatacggga ttccccaggg tgaagcgctg gatatttacc agataattga 3781 actcaccaaa ggctccttgg aaaaacgaag ccagcctgga ccatagcact gcctggaggc 3841 tctgtatttg ctcccgtgga gcttcatcgg ggtgggtgca ggctcccaaa ctcaggcttt 3901 cagctgtgct ttttgcaaaa cggcttgcct aaggccagcc atttttcagt agcaggacct 3961 gccaagaaga ttccttctaa ctgaaggtgc agttgaattc agtgggttca gaaccaagat 4021 gccaacatcg gtgtggacta caggacaagg ggcattgttg cttgttgggt aaaaatgaag 4081 cagaagcccc aaagttcaca ttaactcagg catttcattt attttttcct tttcttcttg 4141 gctggttctt tgttctgtcc cccatgctct gatgcagtgc cctagaaggg gaaagaatta 4201 atgctctaac gtgataaacc tgctccaagg cagtggaaat aaaaagaagg aaaaaaaaga 4261 ctc // LOCUS D84107 1388 bp mRNA PRI 12-JUN-1997 DEFINITION Human mRNA for RBP-MS/type 1, complete cds. ACCESSION D84107 NID g1669546 KEYWORDS RBP-MS; RBP-MS/type 1; alternative splicing; Werner syndrome. SOURCE Homo sapiens cell_line:embryonic carcinoma, NEC14 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1388) AUTHORS Shimamoto,A. TITLE Direct Submission JOURNAL Submitted (21-MAR-1996) to the DDBJ/EMBL/GenBank databases. Akira Shimamoto, AGENE Research Institute Co., Ltd.; 200 Kajiwara, Kamakura, Kanagawa 247, Japan (E-mail:akirashi@sh0.po.iijnet.or.jp, Tel:0467-46-4971, Fax:0467-48-6595) REFERENCE 2 (bases 1 to 1388) AUTHORS Shimamoto,A., Kitao,S., Ichikawa,K., Suzuki,N., Yamabe,Y., Imamura,O., Tokutake,Y., Satoh,M., Matsumoto,T., Kuromitsu,J., Kataoka,H., Sugawara,K., Sugawara,M., Sugimoto,M., Goto,M. and Furuichi,Y. TITLE A unique human gene that spans over 230 kb in the human chromosome 8p11-12 and codes multiple family proteins sharing RNA-binding motifs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (20), 10913-10917 (1996) MEDLINE 97008106 FEATURES Location/Qualifiers source 1..1388 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="embryonic carcinoma, NEC14" /chromosome="8" /map="8q11.2-12.1" 5'UTR 1..566 exon 1..647 /number=1 gene 567..1157 /gene="RBP-MS" CDS 567..1157 /gene="RBP-MS" /standard_name="RNA-binding protein gene with multiple splicing" /note="alternative splicing (see also D84107-D84111)" /codon_start=1 /product="RBP-MS/type 1" /db_xref="PID:d1012900" /db_xref="PID:g1669547" /translation="MNNGGKAEKENTPSEANLQEEEVRTLFVSGLPLDIKPRELYLLF RPFKGYEGSLIKLTSKQPVGFVSFDSRSEAEAAKNALNGIRFDPEIPQTLRLEFAKAN TKMAKNKLVGTPNPSTPLPNTVPQFIAREPYELTVPALYPSSPEVWAPYPLYPAELAP ALPPPAFTYPASLHAQMRWLPPSEATSQGWKSRQFC" exon 648..706 /gene="RBP-MS" /number=2 exon 707..756 /gene="RBP-MS" /number=3 exon 757..812 /gene="RBP-MS" /number=4 exon 813..963 /gene="RBP-MS" /number=5 exon 964..1094 /gene="RBP-MS" /number=7 exon 1095..1164 /number=12 exon 1165..1265 /number=14 exon 1266..1388 /number=15 BASE COUNT 252 a 510 c 320 g 306 t ORIGIN 1 atcccggact tcccagagcc tgcctggagc gcgtactcag cggctctcgg gtcccagcgt 61 cccagccgcg gcccgcgctc ctccgccccg ctcctcctcc tcctcttcct cctcctcctc 121 ctctctaggc acccccgtcc cctccttcca gcggctgcag cccccagccc caactctccg 181 cgcttactcc tgggacgcgc gtcctcgccc catcctttgc ttccttcctt ccttccttct 241 tccttcctcc cctggctccc gccctccctc tccaggtcgc cctcccgggg cccgattgtc 301 tcggttcccc gctgccggcc cgcgccctgc cccgtctctc ccttgcactt cctgagtcgc 361 ccgccgccgc cgtcgcagac tcgccgcggg agccccagcc caacccgagc ccgacagcca 421 ctgccccggc tccagctcca gccccacagc ccgcggcgcc cgcccgaggg agccccggcg 481 cccggggaag gctccagtgg gctagcgcgc cctcgcccag ccccgcgccc cagccctgcc 541 cggcccggcg aggaaggacc gggaagatga acaacggcgg caaagccgag aaggagaaca 601 ccccgagcga ggccaacctt caggaggagg aggtccggac cctatttgtc agtggccttc 661 ctctggatat caaacctcgg gagctctatc tgcttttcag accatttaag ggctatgagg 721 gttctcttat aaagctcaca tctaaacagc ctgtaggttt tgtcagtttt gacagtcgct 781 cagaagcaga ggctgcaaag aatgctttga atggcatccg cttcgatcct gaaattccgc 841 aaacactacg actagagttt gctaaggcaa acacgaagat ggccaagaac aaactcgtag 901 ggactccaaa ccccagtact cctctgccca acactgtacc tcagttcatt gccagagagc 961 catatgagct cacagtgcct gcactttacc ccagtagccc tgaagtgtgg gccccgtacc 1021 ctctgtaccc agcggagtta gcgcctgctc tacctcctcc tgctttcacc tatcccgctt 1081 cactgcatgc ccagatgcgc tggctccctc cctccgaggc tacttctcag ggctggaagt 1141 cccgtcagtt ctgctgaata ctatgtccca ggtgtgtgat ggcggctgca atctgtcttg 1201 tgggtattaa tgcaatcttc agtggtggct actgttctct agctgttcta caaaactgga 1261 gcatgctggc ttgaaaaacc cttgcccagt ttggatccct tcaagacttt gtcacagcct 1321 ctatcacaca tctgtttttc tcgaagaaaa aaatataatt aataaaaatg ttttactctt 1381 ttacactg // LOCUS D84145 1023 bp mRNA PRI 19-MAY-1997 DEFINITION Human WS-3 mRNA, complete cds. ACCESSION D84145 NID g2114143 KEYWORDS WS-3. SOURCE Homo sapiens skin Fibroblast cell_line:primary cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1023) AUTHORS Ichikawa,K. TITLE Direct Submission JOURNAL Submitted (19-MAR-1996) to the DDBJ/EMBL/GenBank databases. Koji Ichikawa, AGENE Research Institute Co., Ltd.; 200 kajiwara, Kamakura, Kanagawa 247, Japan (E-mail:ichikawk@po.iijnet.or.jp, Tel:0467-46-4815(ex.3869), Fax:0467-48-6595) REFERENCE 2 (sites) AUTHORS Ichikawa,K. TITLE Isolation and Characterization of a Novel Gene, WS-3, obtained from Werner Syndrome Region (8p11.2-p12) JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Ichikawa,K., Yamabe,Y., Imamura,O., Kuromitsu,J., Sugawara,K., Suzuki,N., Shimamoto,A., Matsumoto,T., Tokutake,Y., Kitao,S., Kataoka,H., Satoh,M., Sugimoto,M., Goto,M., Sugawara,M. and Furuichi,Y. TITLE Cloning and characterization of a novel gene, WS-3, in human chromosome 8p11-p12 JOURNAL Gene 189 (2), 277-287 (1997) MEDLINE 97311421 FEATURES Location/Qualifiers source 1..1023 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary" /cell_type="Fibroblast" /chromosome="8" /map="8p11.2-12" /tissue_type="skin" 5'UTR 1..87 gene 88..660 /gene="WS-3" CDS 88..660 /gene="WS-3" /codon_start=1 /db_xref="PID:d1020778" /db_xref="PID:g2114144" /translation="MAEKTQKSVKIAPGAVVCVESEIRGDVTIGPRTVIHPKARIIAE AGPIVIGEGNLIEEQALIINAYPDNITPDTEDPEPKPMIIGTNNVFEVGCYSQAMKMG DNNVIESKAYVGRNVILTSGCIIGACCNLNTFEVIPENTVIYGADCLRRVQTERPQPQ TLQLDFLMKILPNYHHLKKTMKGSSTPVKN" 3'UTR 661..1023 polyA_signal 1004..1009 BASE COUNT 326 a 195 c 211 g 291 t ORIGIN 1 caaccctgcc aggctctcca atagcatgtg gaattatcgc tctacccagg cggtggtgtc 61 gatttacgtt ccaattgggg ccgtaccatg gcggagaaga ctcaaaagag tgtgaagatt 121 gctcctggag cagttgtatg tgtagaaagt gaaatcagag gagatgtaac tatcggacct 181 cggacagtga tccaccctaa agcaagaatt attgcggaag ccgggccaat agtgattggc 241 gaagggaacc taatagaaga acaggccctt atcataaatg cttacccaga taatatcact 301 cctgacactg aagatccaga accaaaacct atgatcattg gcaccaataa tgtgtttgaa 361 gttggctgtt attcccaagc catgaagatg ggagataata atgtcattga atcaaaagca 421 tatgtaggca gaaatgtaat attgacaagt ggctgcatca ttggggcttg ttgcaaccta 481 aatacatttg aagtcatccc tgagaatacg gtgatctatg gtgcagactg ccttcgtcgg 541 gtgcagactg agcgaccgca gccccagaca ctacagctgg atttcttgat gaaaatcttg 601 ccaaattacc accacctaaa gaagactatg aaaggaagct caactccagt aaagaactaa 661 gaacagtgta taacatgaag ataacatttt gtctttgacc actgtctttt gaatgggccc 721 acagtgttta tgtactctta acaactcaca gaataataca tgttcacttt attttgtaaa 781 attgggttga gaggaaacta atggagtttc attgtaactg tcctttgtaa tttatataaa 841 tgtattattt tcctatatcc ttggttcttt tctgataatt tacagattta gcttttcttt 901 tgttatataa actgctagcc acaaatttta gttatgtaaa aggctaccct tgacaagaaa 961 agacatactg tcatgtattt atattctagc atagactaaa ctgaataaaa atgctgataa 1021 cag // LOCUS D84212 2033 bp mRNA PRI 21-NOV-1997 DEFINITION Homo sapiens mRNA for aurora/IPL1-related kinase, complete cds. ACCESSION D84212 NID g2641947 KEYWORDS aurora/IPL1-related kinase. SOURCE Homo sapiens blood B-cell cDNA to mRNA, clone_lib:lgt11. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kimura,M., Kotani,S., Hattori,T., Sumi,N., Yoshioka,T., Todokoro,K. and Okano,Y. TITLE Cell cycle-dependent expression and spindle pole localization of a novel human protein kinase, Aik, related to Aurora of Drosophila and yeast Ipl1 JOURNAL J. Biol. Chem. 272 (21), 13766-13771 (1997) MEDLINE 97298083 REFERENCE 2 (bases 1 to 2033) AUTHORS Kimura,M. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) to the DDBJ/EMBL/GenBank databases. Masashi Kimura, Gifu University, Dpt. of Molecular Pathobiochemistry; Tsukasamachi-40, Gifu 500, Japan (E-mail:bunbyo@cc.gifu-u.ac.jp, Tel:058-267-2368, Fax:058-267-2950) FEATURES Location/Qualifiers source 1..2033 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" /clone_lib="lgt11" /tissue_type="blood" CDS 170..1378 /note="aik" /codon_start=1 /product="aurora/IPL1-related kinase" /db_xref="PID:d1024471" /db_xref="PID:g2641948" /translation="MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQA QRVLCPSNSSQRVPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPL PSHLKIILRRNWHQNRKMKNQKEAVALEDFEIGRPLGKGKFGNVYLAREKQSKFILAL KVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVY RELQKLSKFDEQRTANLYNRIANALSYCHSKRVIHRDIKPENLLLGSAGELKIADFGW SVHAPSSRRTTLCGTLDYLPPEMIEGRMHDEKVDLWSLGVLCYEFLVGKPPFEANTYQ ETYKRISRVEFTFPDFVTEGARDLISRLLKHNPSQRPMLREVLEHPWITANSSKPSNC QNKESASKQS" BASE COUNT 586 a 448 c 470 g 529 t ORIGIN 1 gaattccggg actgagctct tgaagacttg ggtccttggt cgcaggtgga gcgacgggtc 61 tcactccatt gcccaggcca gagtgcggga tatttgataa gaaacttcag tgaaggccgg 121 gcgcggtgct catgcccgta atcccagcat tttcggaggc cgaggcatca tggaccgatc 181 taaagaaaac tgcatttcag gacctgttaa ggctacagct ccagttggag gtccaaaacg 241 tgttctcgtg actcagcaat ttccttgtca gaatccatta cctgtaaata gtggccaggc 301 tcagcgggtc ttgtgtcctt caaattcttc ccagcgcgtt cctttgcaag cacaaaagct 361 tgtctccagt cacaagccgg ttcagaatca gaagcagaag caattgcagg caaccagtgt 421 acctcatcct gtctccaggc cactgaataa cacccaaaag agcaagcagc ccctgccatc 481 gcacctgaaa ataatcctga ggaggaactg gcatcaaaac agaaaaatga agaatcaaaa 541 agaggcagtg gctttggaag actttgaaat tggtcgccct ctgggtaaag gaaagtttgg 601 taatgtttat ttggcaagag aaaagcaaag caagtttatt ctggctctta aagtgttatt 661 taaagctcag ctggagaaag ccggagtgga gcatcagctc agaagagaag tagaaataca 721 gtcccacctt cggcatccta atattcttag actgtatggt tatttccatg atgctaccag 781 agtctaccta attctggaat atgcaccact tggaacagtt tatagagaac ttcagaaact 841 ttcaaagttt gatgagcaga gaactgctaa cttatataac agaattgcaa atgccctgtc 901 ttactgtcat tcgaagagag ttattcatag agacattaag ccagagaact tacttcttgg 961 atcagctgga gagcttaaaa ttgcagattt tgggtggtca gtacatgctc catcttccag 1021 gaggaccact ctctgtggca ccctggacta cctgccccct gaaatgattg aaggtcggat 1081 gcatgatgag aaggtggatc tctggagcct tggagttctt tgctatgaat ttttagttgg 1141 gaagcctcct tttgaggcaa acacatacca agagacctac aaaagaatat cacgggttga 1201 attcacattc cctgactttg taacagaggg agccagggac ctcatttcaa gactgttgaa 1261 gcataatccc agccagaggc caatgctcag agaagtactt gaacacccct ggatcacagc 1321 aaattcatca aaaccatcaa attgccaaaa caaagaatca gctagcaaac agtcttagga 1381 atcgtgcagg gggagaaatc cttgagccag ggctgccata taacctgaca ggaacatgct 1441 actgaagttt attttaccat tgactgctgc cctcaatcta gaacgctaca caagaaatat 1501 tttgttttta ctcagcaggt gtgccttaac ctccctattc agaaagctcc acatcaataa 1561 acatgacact ctgaagtgaa agtagccacg agaattgtgc tacttatact ggaacataat 1621 ctggaggcaa ggttcgactg cagtcgaacc ttgcctccag attatgaacc agtataagta 1681 gcacaattct cgtggctact ttcacttcag agtgtcatgt ttattgatgt ggagctttct 1741 gaatagggag gttaaggcac acctgctgag taaaacaaat atttcttgtg tagcgttctt 1801 aggaatctgg tgtctgtccg gccccggtag gcctgttggg tttctagtcc tccttaccat 1861 catctccata tgagagtgtg aaaataggaa cacgtgctct acctccattt agggatttgc 1921 ttgggataca gaagaggcca tgtgtctcag agctgttaag ggcttatttt tttaaaacat 1981 tggagtcata gcatgtgtgt aaactttaaa tatgcaggcc ttcgtggctc gag // LOCUS D84239 16382 bp mRNA PRI 10-APR-1997 DEFINITION Human mRNA for IgG Fc binding protein, complete cds. ACCESSION D84239 NID g1944351 KEYWORDS IgG Fc binding protein. SOURCE Homo sapiens colon epithelial cell.. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 16382) AUTHORS Morikawa,M. TITLE Direct Submission JOURNAL Submitted (28-MAR-1996) to the DDBJ/EMBL/GenBank databases. Minoru Morikawa, Chugai pharmaceutical co., ltd., Central research labs; 41-8, Takada 3-Chome, Toshima-ku, Tokyo 171, Japan (Tel:03-3987-7111, Fax:03-3980-3578) REFERENCE 2 (bases 1 to 16382) AUTHORS Harada,N., Iijima,S., Kobayashi,K., Yoshida,T., Brown,W., Hibi,T., Oshima,A. and Morikawa,M. TITLE Molecular cloning and expression of IgG Fc binding protein from colonic epithelial cells JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Harada,N., Iijima,S., Kobayashi,K., Yoshida,T., Brown,W., Hibi,T., Oshima,A. and Morikawa,M. TITLE Human IgGFc binding protein (Fc-gamma BP) in colonic epithelial cells exhibits mucin-like structure JOURNAL J. Biol. Chem. (1997) In press FEATURES Location/Qualifiers source 1..16382 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epithelial cell" /tissue_type="colon" 5'UTR 1..8 CDS 9..16226 /codon_start=1 /product="IgG Fc binding protein" /db_xref="PID:d1020288" /db_xref="PID:g1944352" /translation="MGALWSWWILWAGATLLWGLTQEASVDLKNTGREEFLTAFLQNY QLAYSKAYPRLLISSLSESPASVSILSQADNTSKKVTVRPGESVMVNISAKAEMIGSK IFQHAVVIHSDYAISVQALNAKPDTAELTLLRPIQALGTEYFVLTPPGTSARNVKEFA VVAGAAGASVSVTLKGSVTFNGKFYPAGDVLRVTLQPYNVAQLQSSVDLSGSKVTASS PVAVLSGHSCAQKHTTCNHVVEQLLPTSAWGTHYVVPTLASQSRYDLAFVVASQATKL TYNHGGITGSRGLQAGDVVEFEVRPSWPLYLSANVGIQVLLFGTGAIRNEVTYDPYLV LIPDVAAYCPAYVVKSVPGCEGVALVVAQTKAISGLTIDGHAVGAKLTWEAVPGSEFS YAEVELGTADMIHTAEATTNLGLLTFGLAKAIGYATAADCGRTVLSPVEPSCEGMQCA AGQRCQVVGGKAGCVAESTAVCRAQGDPHYTTFDGRRYDMMGTCSYTMVELCSEDDTL PAFSVEAKNEHRGSRRVSYVGLVTVRAYSHSVSLTRGEVGFVLVDNQRSRLPVSLSEG RLRVYQSGPRAVVELVFGLVVTYDWDCQLALSLPARFQDQVCGLCGNYNGDPADDFLT PDGALAPDAVEFASSWKLDDGDYLCEDGCQNNCPACTPGQAQHYEGDRLCGMLTKLDG PFAVCHDTLDPRPFLEQCVYDLCVVGGERLSLCRGLSAYAQACLELGISVGDWRSPAN CPLSCPANSRYELCGPACPTSCNGAAAPSNCSGRPCVEGCVCLPGFVASGGACVPASS CGCTFQGLQLAPGQEVWADELCQRRCTCNGATHQVTCRDKQSCPAGERCSVQNGLLGC YPDRFGTCQGSGDPHYVSFDGRRFDFMGTCTYLLVGSCGQNAALPAFRVLVENEHRGS QTVSYTRAVRVEARGVKVAVRREYPGQVLVDDVLQYLPFQAADGQVQVFRQGRDAVVR TDFGLTVTYDWNARVTAKVPSSYAEALCGLCGNFNGDPADDLALRGGGQAANALAFGN SWQEETRPGCGATEPGDCPKLDSLVAQQLQSKNECGILADPKGPFRECHSKLDPQGAV RDCVYDRCLLPGQSGPLCDALATYAAACQAAGATVHPWRSEELCPLSCPPHSHYEACS YGCPLSCGDLPVPGGCGSECHEGCVCDEGFALSGESCLPLASCGCVHQGTYHPPGQTF YPGPGCDSLCHCQEGGLVSCESSSCGPHEACQPSGGSLGCVAVGSSTCQASGDPHYTT FDGRRFDFMGTCVYVLAQTCGTRPGLHRFAVLQENVAWGNGRVSVTRVITVQVANFTL RLEQRQWKVTVNGVDMKLPVVLANGQIRASQHGSDVVIETDFGLRVAYDLVYYVRVTV PGNYYQQMCGLCGNYNGDPKDDFQKPNGSQAGNANEFGNSWEEVVPDSPCLPPTPCPP GSEDCIPSHKCPPELEKKYQKEEFCGLLSSPTGPLSSCHKLVDPQGPLKDCIFDLCLG GGNLSILCSNIHAYVSACQAAGGHVEPWRTETFCPMECPPNSHYELCADTCSLGCSAL SAPPQCQDGCAEGCQCDSGFLYNGQACVPIQQCGCYHNGVYYEPEQTVLIDNCRQQCT CHAGKGMVCQEHSCKPGQVCQPSGGILSCVTKDPCHGVTCRPQETCKEQGGQGVCLPN YEATCWLWGDPHYHSFDGRKFDFQGTCNYVLATTGCPGVSTQGLTPFTVTTKNQNRGN PAVSYVRVVTVAALGTNISIHKDEIGKVRVNGVLTALPVSVADGRISVTQGASKALLV ADFGLQVSYDWNWRVDVTLPSSYHGAVCGLCGNMDRNPNNDQVFPNGTLAPSIPIWGG SWRAPGWDPLCWDECRGSCPTCPEDRLEQYEGPGFCGPLAPGTGGPFTTCHAHVPPES FFKGCVLDVCMGGGDRDILCKALASYVAACQAAGVVIEDWRAQVGCEITCPENSHYEV CGPPCPASCPSPAPLTTPAVCEGPCVEGCQCDAGFVLSADRCVPLNNGCGCWANGTYH EAGSEFWADGTCSQWCRCGPGGGSLVCTPASCGLGEVCGLLPSGQHGCQPVSTAECQA WGDPHYVTLDGHRFNFQGTCEYLLSAPCHGPPLGAENFTVTVANEHRGSQAVSYTRSV TLQIYNHSLTLSARWPRKLQVDGVFVTLPFQLDSLLHAHLSGADVVVTTTSGLSLAFD GDSFVRLRVPAAYAGSLCGLCGNYNQDPADDLKAVGGKPAGWQVGGAQGCGECVSKPC PSPCTPEQQESFGGPDACGVISATDGPLAPCHGLVPPAQYFQGCLLDACQVQGHPGGL CPAVATYVAACQAAGAQLREWRRPDFCPFQCPAHSHYELCGDSCPGSCPSLSAPEGCE SACREGCVCDAGFVLSGDTCVPVGQCGCLHDDRYYPLGQTFYPGPGCDSLCRCREGGE VSCEPSSCGPHETCRPSGGSLGCVAVGSTTCQASGDPHYTTFDGRRFDFMGTCVYVLA QTCGTRPGLHRFAVLQENVAWGNGRVSVTRVITVQVANFTLRLEQRQWKVTVNGVDMK LPVVLANGQIRASQHGSDVVIETDFGLRVAYDLVYYVRVTVPGNYYQLMCGLCGNYNG DPKDDFQKPNGSQAGNANEFGNSWEEVVPDSPCLPPPTCPPGSEGCIPSEECPPELEK KYQKEEFCGLLSSPTGPLSSCHKLVDPQGPLKDCIFDLCLGGGNLSILCSNIHAYVSA CQAAGGHVEPWRNETFCPMECPQNSHYELCADTCSLGCSALSAPLQCPDGCAEGCQCD SGFLYNGQACVPIQQCGCYHNGAYYEPEQTVLIDNCRQQCTCHAGKVVVCQEHSCKPG QVCQPSGGILSCVTKDPCHGVTCRPQETCKEQGGQGVCLPNYEATCWLWGDPHYHSFD GRKFDFQGTCNYVLATTGCPGVSTQGLTPFTVTTKNQNRGNPAVSYVRVVTVAALGTN ISIHKDEIGKVRVNGVLTALPVSVADGRISVAQGASKALLVADFGLQVSYDWNWRVDV TLPSSYHGAVCGLCGNMDRNPNNDQVFPNGTLAPSIPIWGGSWRAPGWDPLCWDECRG SCPTCPEDRLEQYEGPGFCGPLAPGTGGPFTTCHAHVPPESFFKGCVLDVCMGGGDHD ILCKALASYVAACQAAGVVIEDWRAQVGCEITCPENSHYEVCGPPCPASCPSPAPLTT PAVCEGPCVEGCQCDAGFVLSADRCVPLNNGCGCWANGTYHEAGSEFWADGTCSQWCR CGPGGGSLVCTPASCGLGEVCGLLPSGQHGCQPVSTAECQAWGDPHYVTLDGHRFDFQ GTCEYLLSAPCHGPPLGAENFTVTVANEHRGSQAVSYTRSVTLQIYNHSLTLSARWPR KLQVDGVFVTLPFQLDSLLHAHLSGADVVVTTTSGLSLAFDGDSFVRLRVPAAYAGSL CGLCGNYNQDPADDLKAVGGKPAGWQVGGAQGCGECVSKPCPSPCTPEQQESFGGPDA CGVISATDGPLAPCHGLVPPAQYFQGCLLDACQVQGHPGGLCPAVATYVAACQAAGAQ LREWRRPDFCPFQCPAHSHYELCGDSCPGSCPSLSAPEGCESACREGCVCDAGFVLSG DTCVPVGQCGCLHDDRYYPLGQTFYPGPGCDSLCRCREGGEVSCEPSSCGPHETCRPS GGSLGCVAVGSTTCQASGDPHYTTFDGRRFDFMGTCVYVLAQTCGTRPGLHRFAVLQE NVAWGNGRVSVTRVITVQVANFTLRLEQRQWKVTVNGVDMKLPVVLANGQIRASQHGS DVVIETDFGLRVAYDLVYYVRVTVPGNYYQLMCGLCGNYNGDPKDDFQKPNGSQAGNA NEFGNSWEEVVPDSPCLPPPTCPPGSEGCIPSEECPPELEKKYQKEEFCGLLSSPTGP LSSCHKLVDPQGPLKDCIFDLCLGGGNLSILCSNIHAYVSACQAAGGHVEPWRNETFC PMECPQNSHYELCADTCSLGCSALSAPLQCPDGCAEGCQCDSGFLYNGQACVPIQQCG CYHNGVYYEPEQTVLIDNCRQQCTCHVGKVVVCQEHSCKPGQVCQPSGGILSCVNKDP CHGVTCRPQETCKEQGGQGVCLPNYEATCWLWGDPHYHSFDGRKFDFQGTCNYVLATT GCPGVSTQGLTPFTVTTKNQNRGNPAVSYVRVVTVAALGTNISIHKDEIGKVRVNGVL TALPVSVADGRISVAQGASKALLVADFGLQVSYDWNWRVDVTLPSSYHGAVCGLCGNM DRNPNNDQVFPNGTLAPSIPIWGGSWRAPGWDPLCWDECRGSCPTCPEDRLEQYEGPG FCGPLASGTGGPFTTCHAHVPPESFFKGCVLDVCMGGGDHDILCKALASYVAACQAAG VVIEDWRAQVGCEITCPENSHYEVCGPPCPASCPSPAPLTTPAVCEGPCVEGCQCDAG FVLSADRCVPLNNGCGCWANGTYHEAGSEFWADGTCSQWCRCGPGGGSLVCTPASCGL GEVCGLLPSGQHSCQPVSTAECQAWGDPHYVTLDGHRFDFQGTCEYLLSAPCHGPPLG AENFTVTVANEHRGSQAVSYTRSVTLQIYNHSLTLSARWPRKLQVDGVFVALPFQLDS LLHAHLSGADVVVTTTSGLSLAFDGDSFVRLRVPAAYAASLCGLCGNYNQDPADDLKA VGGKPAGWQVGGAQGCGECVSKPCPSPCTPEQQESFGGPDACGVISATDGPLAPCHGL VPPAQYFQGCLLDACQVQGHPGGLCPAVATYVAACQAAGAQLGEWRRPDFCPLQCPAH SHYELCGDSCPVSCPSLSAPEGCESACREGCVCDAGFVLSGDTCVPVGQCGCLHDGRY YPLGEVFYPGPECERRCECGPGGHVTCQEGAACGPHEECRLEDGVQACHATGCGRCLA NGGIHYITLDGRVYDLHGSCSYVLAQVCHPKPGDEDFSIVLEKNAAGHLQRLLVTVAG QVVSLAQGQQVTVDGEAVALPVAVGRVRVTAEGRNMVLQTTKGLRLLFDGDAHLLMSI PSPFRGRLCGLCGNFNGNWSDDFVLPNGSAASSVETFGAAWRVPGSSKGCGEGCGPQG CPVCLAEETAPYESNEACGQLRNPQGPFATCQAVLSPSEYFRQCVYDLCAQKGDKAFL CRSLAAYTAACQAAGVAVKPWRTDSFCPLHCPAHSHYSICTRTCQGSCAALSGLTGCT TRCFEGCECDDRFLLSQGVCIPVQDCGCTHNGRYLPVNSSLLTSDCSERCSCSSSSGL TCQAAGCPPGRVCEVKAEARNCWATRGLCVLSVGANLTTFDGARGATTSPGVYELSSR CPGLQNTIPWYRVVAEVQICHGKTEAVGQVHIFFQDGMVTLTPNKGVWVNGLRVDLPA EKLASVSVSRTPDGSLLVRQKAGVQVWLGANGKVAVIVSNDHAGKLCGACGNFDGDQT NDWHDSQEKPAMEKWRAQDFSPCYG" 3'UTR 16227..16382 polyA_signal 16366..16371 BASE COUNT 2803 a 5193 c 5206 g 3180 t ORIGIN 1 ctgcagccat gggtgcccta tggagctggt ggatactctg ggctggagca accctcctgt 61 ggggattgac ccaggaggct tcagtggacc tcaagaacac tggcagagag gaattcctca 121 cagccttcct gcagaactat cagctggcct acagcaaggc ctacccccgc ctccttatct 181 ccagtctgtc agagagcccc gcttcagtct ccatcctcag ccaggcagac aacacctcaa 241 agaaggtcac agtgaggccc ggggagtcgg tcatggtcaa catcagtgcc aaggctgaga 301 tgataggcag caagatcttc cagcatgcgg tggtgatcca ttctgactat gccatctctg 361 tgcaggcact aaatgccaag cctgacacag cggagctgac actgctgcgg cccatccagg 421 ccctaggcac cgagtatttt gtgctcacac cccccggcac ctcagccagg aatgtcaagg 481 agtttgccgt ggtggccggt gccgcaggtg cctcggtcag tgtcacgctg aaggggtcag 541 tgacattcaa tggcaagttc tatccagcag gcgatgtcct aagagtgact ctacagccct 601 acaatgtggc ccagctacag agctcagtgg atctctcggg gtcaaaggtc acagctagta 661 gccccgtggc tgtcctctct ggccacagct gtgcgcagaa acatacgacc tgcaaccatg 721 tggttgagca gctgctaccc acgtctgcct ggggcaccca ctatgtagta cccacgctgg 781 cctcccaatc tcgctatgat ttggccttcg ttgtggccag ccaggccaca aagctgacct 841 acaaccatgg gggtatcact ggctcccgtg ggctccaggc aggtgatgtg gtagagtttg 901 aggtccggcc atcctggcca ctctacctgt ctgcaaatgt gggcatccag gtcctgttgt 961 ttggcacagg tgccataagg aatgaagtga cttatgaccc ctacctggtc ctgatcccag 1021 atgtggcggc ctactgccca gcctatgtgg tcaagagtgt accaggctgt gagggcgtgg 1081 ccctggtagt ggcacagacg aaggctatca gcgggctgac catagatggg catgcagtgg 1141 gggccaagct cacctgggag gctgtgccag gcagtgagtt ctcgtatgct gaagtggagc 1201 tcggcacagc tgacatgatc cacacggccg aggccaccac caacttggga ctgctcacct 1261 tcgggctggc caaggctata ggctacgcaa cagctgctga ttgcggccgg actgtactgt 1321 ccccagtgga gccctcctgc gaaggcatgc agtgcgcagc cgggcagcgc tgccaggtgg 1381 taggcgggaa ggccgggtgt gtggcggagt ccaccgctgt ctgccgcgcc cagggcgacc 1441 cccattacac caccttcgac ggccgtcgct acgacatgat gggcacctgt tcgtacacga 1501 tggtggagct gtgcagcgag gacgacaccc tgcccgcctt cagcgtggag gccaagaacg 1561 agcaccgggg cagccgccgc gtctcctacg tgggcctcgt cactgtgcgc gcctacagcc 1621 actctgtgtc gctgacccgc ggtgaagttg gcttcgtcct ggttgacaac cagcgctcgc 1681 gcctgccagt ctccctgagt gagggtcgcc tgcgtgtgta ccagagcgga ccacgggccg 1741 tggtggagct ggtctttggg ctggtggtca cttatgactg ggactgccag ctggcactca 1801 gcctgcctgc acgcttccaa gaccaggtgt gcgggctgtg tggcaactat aatggtgacc 1861 cagcagacga cttcctcacg cctgacgggg ctctggctcc tgacgctgtg gagttcgcaa 1921 gtagctggaa gctggatgat ggggactacc tgtgtgagga tggctgccag aacaactgtc 1981 ccgcctgcac cccaggccag gcccaacact atgagggcga ccgactctgt ggcatgctga 2041 ccaagctcga tggccccttc gctgtctgcc atgacaccct ggaccccagg cccttcctgg 2101 agcagtgtgt atatgacctg tgtgtggtcg gtggggagcg gctcagcctg tgccgtggcc 2161 tcagcgccta tgcccaggcc tgtctggagc ttggcatctc ggttggggac tggagatcac 2221 cagccaactg ccccctgtcc tgccctgcca acagccgcta tgagctctgc ggccctgctt 2281 gcccgacctc ctgcaacggg gctgcggcgc cgtccaactg ctccgggcgc ccctgcgtgg 2341 agggctgcgt gtgcctccca ggcttcgtgg ccagcggcgg cgcctgcgtg ccggcctcgt 2401 cgtgtggctg caccttccag ggtctccagc tcgctccggg ccaggaagtg tgggcggacg 2461 agttgtgcca aaggcgctgc acctgcaacg gcgccaccca tcaggtcacc tgccgcgaca 2521 agcagagctg cccggcgggt gagcgctgca gcgtccagaa cggcctcctg ggctgctacc 2581 ccgatcgctt cgggacctgc caggggtccg gggacccaca ctatgtgagc ttcgacggcc 2641 ggcgcttcga cttcatgggc acctgcacgt acctgctggt cggctcatgc ggccagaacg 2701 cagcgctgcc tgccttccgg gtgctggtgg aaaacgagca tcggggcagc cagactgtga 2761 gctacacgcg cgccgtgcgg gtggaggccc gcggggtgaa ggtggccgtg cgccgggagt 2821 accccgggca agtgctggtg gatgacgtcc ttcagtatct gcccttccaa gcagcagatg 2881 ggcaggtgca ggtgttccga cagggcaggg atgccgtcgt gcgcacggac tttggcctga 2941 ctgtcactta tgactggaat gcacgagtga ctgccaaggt gcccagcagc tatgctgagg 3001 ccctgtgtgg actctgtggg aacttcaacg gggacccagc tgatgacctg gctctgcggg 3061 gtgggggtca agctgccaat gcactggcct ttgggaacag ctggcaagaa gagacgaggc 3121 ccggctgtgg agcaactgaa ccgggtgact gtcccaagct ggactccctg gtggcccagc 3181 agctgcagag caagaatgag tgtggaatcc ttgccgaccc caaggggccc ttccgggagt 3241 gccatagcaa gctggacccc cagggtgccg tgcgcgactg tgtctatgac cgctgcctgc 3301 tgccaggcca gtctgggcca ctgtgtgacg cactggccac ctatgctgct gcatgccagg 3361 ctgctggagc cacagtgcac ccctggagga gtgaagaact ttgcccactg agctgcccac 3421 cccacagcca ctatgaggcg tgttcctacg gctgcccgct gtcctgtgga gacctcccag 3481 tgcccggggg ctgtggctca gaatgccatg agggctgcgt gtgcgatgag ggctttgcgc 3541 tcagtggtga gtcctgcctg cccctggcct cctgtggctg cgtacaccag ggcacctacc 3601 acccaccagg ccagaccttc taccctggcc ccggatgtga ttccctttgc cactgccagg 3661 agggcggcct ggtgtcctgt gagtcctcca gctgcggacc gcacgaggcc tgccagccat 3721 ccggtggcag cttgggctgt gtggccgtgg gctctagcac ctgccaggcg tcaggagacc 3781 cccactacac caccttcgat ggccgccgct tcgacttcat gggcacctgc gtgtatgtgc 3841 tggctcagac ctgcggcacc cggcctggcc tgcatcggtt tgccgtcctg caggagaacg 3901 tggcctgggg taatgggcga gtcagtgtga ccagggtgat cacggtccag gtggcaaact 3961 tcaccctgcg gctggagcag agacagtgga aggtcacggt gaacggtgtg gacatgaagc 4021 tgcccgtggt gctggccaac ggccagatcc gtgcctccca gcatggttca gatgttgtga 4081 ttgagaccga cttcggcctg cgtgtggcct acgaccttgt gtactatgtg cgggtcaccg 4141 tccccggaaa ctactaccag cagatgtgtg gcctgtgtgg gaactacaac ggcgacccca 4201 aggatgactt ccagaagccc aatggctcac aggcaggcaa cgccaatgag ttcggcaact 4261 cctgggagga ggtggtgccc gactctccct gcctgccgcc caccccttgc ccgccgggga 4321 gcgaggactg tatccccagc cacaagtgtc ctcccgagct ggagaagaag tatcagaagg 4381 aggagttctg tgggctcctc tccagcccca cagggccact gtcctcctgc cacaagctgg 4441 tggatcccca gggtcccttg aaagattgca tctttgatct ctgcctgggt ggtgggaacc 4501 tgagcattct ctgcagcaac atccatgcct acgtgagtgc ttgccaggcg gctggaggcc 4561 acgtggagcc ctggaggact gaaactttct gtcccatgga gtgccctccg aacagtcact 4621 acgagctctg tgcggacacc tgctccctgg gctgctcagc tctcagtgcc cctccacagt 4681 gccaggatgg gtgtgctgag ggctgccagt gtgactccgg cttcctctac aatggccaag 4741 cctgcgtgcc catccagcaa tgcggctgct accacaatgg tgtctactat gagccggagc 4801 agacagtcct cattgacaac tgtcggcagc agtgcacgtg ccatgcgggt aaaggcatgg 4861 tgtgccagga acacagctgc aagccggggc aggtgtgcca gccctccgga ggcatcctga 4921 gctgcgtcac caaagacccg tgccacggcg tgacatgccg gccacaggag acatgcaagg 4981 agcagggtgg ccagggcgtg tgcctgccca actatgaggc cacgtgctgg ctgtggggcg 5041 acccacacta ccactccttc gatggccgga agtttgactt ccagggcacc tgtaactatg 5101 tgctggcaac aactggctgc ccgggggtca gcacccaggg cctgacaccc ttcaccgtca 5161 ccaccaagaa ccagaaccgg ggcaaccctg ctgtgtccta cgtgagagtc gtcaccgtgg 5221 ctgccctcgg caccaacatc tccatccaca aggacgagat cggcaaagtc cgggtgaacg 5281 gtgtgctcac agccttgcct gtctctgtgg ccgacgggcg gatttcagtg acccagggtg 5341 catcgaaggc actgctggtg gctgactttg gactgcaagt cagctatgac tggaactggc 5401 gggtagacgt gacgctgccc agcagctatc atggcgcagt gtgcgggctc tgcggtaaca 5461 tggaccgcaa ccccaacaat gaccaggtct tccctaatgg cacactggct ccctccatac 5521 ccatctgggg cggcagctgg cgagccccag gctgggaccc actgtgttgg gacgaatgtc 5581 gggggtcctg cccaacgtgc cctgaggacc ggttggagca gtacgagggc cctggcttct 5641 gcggacccct ggcccccggc acagggggcc ctttcaccac ctgccatgct catgtgccac 5701 ctgagagctt cttcaagggc tgtgttctgg acgtctgcat gggtggtggg gaccgtgaca 5761 ttctttgcaa ggctctggct tcctatgtgg ccgcctgcca ggctgctggg gttgtcatcg 5821 aagactggcg ggcacaggtt ggctgtgaga tcacctgccc agaaaacagc cactatgagg 5881 tctgtggccc accctgcccg gccagctgtc cgtcccctgc accccttacg acgccagccg 5941 tatgtgaggg cccctgtgtg gagggctgcc agtgcgacgc gggtttcgtg ttaagtgctg 6001 accgctgtgt tcccctcaac aacggctgcg gctgctgggc caatggcacc taccacgagg 6061 cgggcagtga gttttgggct gatggcacct gctcccagtg gtgtcgctgc gggcctgggg 6121 gtggctcgct ggtctgcaca cctgccagct gtgggctggg tgaagtgtgt ggcctcctgc 6181 catccggcca gcacggctgc cagcccgtca gcacagctga gtgccaggcg tggggtgacc 6241 cccattacgt cactctggat gggcaccgat tcaatttcca aggcacctgc gagtacctgc 6301 tgagtgcacc ctgccacgga ccacccttgg gggctgagaa cttcactgtc actgtagcca 6361 atgagcaccg gggcagccag gctgtcagct acacccgcag tgtcaccctg caaatctaca 6421 accacagcct gacactgagt gcccgctggc cccggaagct acaggtggac ggcgtgttcg 6481 tcactctgcc cttccagctg gactcgctcc tgcacgcaca cctgagcggc gccgacgtgg 6541 tggtgaccac aacctcaggg ctctcgctgg ctttcgacgg ggacagcttc gtgcgcctgc 6601 gcgtgccggc ggcgtacgcg ggctctctct gtggcttatg cgggaactac aaccaggacc 6661 ccgcagacga cctgaaggcg gtgggcggga agcccgccgg atggcaggtg ggcggcgccc 6721 agggctgcgg ggaatgtgtg tccaagccat gcccgtcgcc gtgcacccca gagcagcaag 6781 agtccttcgg cggcccggac gcctgcggcg tgatctccgc caccgacggc ccgctggcgc 6841 cctgccacgg ccttgtgccg cccgcgcagt acttccaggg ctgcttgctg gacgcctgcc 6901 aagttcaggg ccatcctgga ggcctctgtc ctgcagtggc cacctacgtg gcagcctgtc 6961 aggccgctgg ggcccagctc cgcgagtgga ggcggccgga cttctgtccc ttccagtgcc 7021 ctgcccacag ccactacgag ctctgcggtg actcctgtcc tgggagctgc ccgagcctgt 7081 cggcacccga gggctgtgag tcggcctgcc gtgaaggctg tgtctgcgat gctggcttcg 7141 tgctcagtgg tgacacgtgt gtacctgtgg gccagtgtgg ctgcctccac gatgaccgct 7201 actacccact gggccagacc ttctaccctg gccctgggtg tgattccctt tgccgctgcc 7261 gggagggcgg tgaggtgtcc tgtgagccct ccagctgcgg cccgcatgag acctgccggc 7321 catccggtgg cagcttgggc tgcgtggccg tgggctctac cacctgccag gcgtcgggag 7381 atccccacta caccaccttc gatggccgcc gcttcgactt catgggcacc tgcgtgtatg 7441 tgctggctca gacctgcggc acccggcctg gcctacatcg gtttgccgtc ctgcaggaga 7501 acgtggcctg gggtaatggg cgagtcagtg tgaccagggt gatcacggtc caggtggcaa 7561 acttcaccct gcggctggag cagagacagt ggaaggtcac ggtgaacggt gtggacatga 7621 agctgcccgt ggtgctggcc aacggccaga tccgtgcctc ccagcatggt tcagatgttg 7681 tgattgagac cgacttcggc ctgcgtgtgg cctacgacct tgtgtactat gtgcgggtca 7741 ccgtccctgg aaactactac cagctgatgt gtggcctgtg tgggaactac aacggcgacc 7801 ccaaggatga cttccagaag cccaatggct cgcaggcagg caacgccaat gagttcggca 7861 actcctggga ggaggtggtg cccgactctc cctgcctgcc gccgcccacc tgcccgccgg 7921 ggagcgaggg ctgtatcccc agcgaggagt gtcctcccga gctggagaag aagtatcaga 7981 aggaggagtt ctgtgggctc ctctccagcc ccacagggcc actgtcctcc tgccacaagc 8041 tggtggatcc ccagggtccc ttgaaagatt gcatctttga tctctgcctg ggtggtggga 8101 acctgagcat tctctgcagc aacatccatg cctacgtgag tgcttgccag gcggctggag 8161 gccacgtgga gccctggagg aatgaaactt tctgtcccat ggaatgccct cagaacagtc 8221 actacgagct ctgtgcggac acctgctccc tgggctgctc ggctctcagt gcccctctgc 8281 agtgcccaga tgggtgtgct gagggctgcc agtgtgactc cggcttcctc tacaacggcc 8341 aagcctgcgt gcccatccag caatgtggct gctaccacaa tggtgcctac tatgagccgg 8401 agcagacagt cctcattgac aactgtcggc agcagtgcac gtgccatgcg ggtaaagtcg 8461 tggtgtgcca ggaacacagc tgcaagccgg ggcaggtgtg ccagccctcc ggaggcatcc 8521 tgagctgcgt caccaaagac ccgtgccacg gcgtgacatg ccggccacag gagacatgca 8581 aggagcaggg tggccagggt gtgtgcctgc ccaactatga ggccacgtgc tggctgtggg 8641 gcgacccaca ctaccactcc ttcgatggcc ggaagtttga cttccagggc acctgtaact 8701 atgtgctggc aacaactggc tgcccggggg tcagcaccca gggcctgaca cccttcaccg 8761 tcaccaccaa gaaccagaac cggggcaacc ctgctgtatc ctacgtgaga gtcgtcaccg 8821 tggctgccct cggcaccaac atctccatcc acaaggacga gatcggcaaa gtccgggtga 8881 acggtgtgct cacagccttg cctgtctccg tggccgacgg gcggatttca gtggcccagg 8941 gtgcatcgaa ggcactgctg gtggctgact ttggactgca agtcagctat gactggaact 9001 ggcgggtaga cgtgacgctc cccagcagct atcatggcgc agtgtgcggg ctctgcggta 9061 acatggaccg caaccccaac aatgaccagg tcttccctaa tggcacactg gctccctcca 9121 tacccatctg gggcggcagc tggcgagccc caggctggga cccactgtgt tgggacgaat 9181 gtcgggggtc ctgcccaacg tgccctgagg accggttgga gcagtacgag ggccctggct 9241 tctgcggacc cctggccccc ggcacagggg gccctttcac cacctgccat gctcatgtgc 9301 cacctgagag cttcttcaag ggctgtgttc tggacgtctg catgggtggt ggggaccatg 9361 acattctttg caaggctctg gcttcctacg tggccgcctg ccaggccgct ggggttgtca 9421 tcgaagactg gcgggcacag gttggctgtg agatcacctg cccagaaaac agccactatg 9481 aggtctgtgg cccaccctgc ccggccagct gtccgtcccc tgcacccctt acgacgccag 9541 ccgtatgtga gggcccctgt gtggagggct gccagtgcga cgcgggtttc gtgttaagtg 9601 ctgaccgctg tgttcccctc aacaacggct gcggctgctg ggccaatggc acctaccacg 9661 aggcgggcag tgagttttgg gctgatggca cctgctccca gtggtgtcgc tgcgggcctg 9721 ggggtggctc gctggtctgc acacctgcca gctgtgggct gggtgaagtg tgtggcctcc 9781 tgccatccgg ccagcacggc tgccagcccg tcagcacagc tgagtgccag gcgtggggtg 9841 acccccatta cgtcactctg gatgggcacc gattcgattt ccaaggcacc tgcgagtacc 9901 tgctgagtgc accctgccac ggaccaccct tgggggctga gaacttcact gtcactgtag 9961 ccaatgagca ccggggcagc caggctgtca gctacacccg cagtgtcacc ctgcaaatct 10021 acaaccacag cctgacactg agtgcccgct ggccccggaa gctacaggtg gacggcgtgt 10081 tcgtcactct gcccttccag ctggactcgc tcctgcacgc acacctgagc ggcgccgacg 10141 tggtggtgac cacaacctca gggctctcgc tggctttcga cggggacagc ttcgtgcgcc 10201 tgcgcgtgcc ggcggcgtac gcgggctctc tctgtggctt atgcgggaac tacaaccagg 10261 accccgcaga cgacctgaag gcggtgggcg ggaagcccgc cggatggcag gtgggcggcg 10321 cccagggctg cggggaatgt gtgtccaagc catgcccgtc gccgtgcacc ccagagcagc 10381 aagagtcctt cggcggcccg gacgcctgcg gcgtgatctc cgccaccgac ggcccgctgg 10441 cgccctgcca cggccttgtg ccgcccgcgc agtacttcca gggctgcttg ctggacgcct 10501 gccaagttca gggccatcct ggaggcctct gtcctgcagt ggccacctac gtggcagcct 10561 gtcaggccgc tggggcccag ctccgcgagt ggaggcggcc ggacttctgt cccttccagt 10621 gccctgccca cagccactac gagctctgcg gtgactcctg tcctgggagc tgcccgagcc 10681 tgtcggcacc cgagggctgt gagtcggcct gccgtgaagg ctgtgtctgc gatgctggct 10741 tcgtgctcag tggtgacacg tgtgtacctg tgggccagtg tggctgcctc cacgatgacc 10801 gctactaccc actgggccag accttctacc ctggccctgg gtgtgattcc ctttgccgct 10861 gccgggaggg cggtgaggtg tcctgtgagc cctccagctg cggcccgcat gagacctgcc 10921 ggccatccgg tggcagcttg ggctgcgtgg ccgtgggctc taccacctgc caggcgtcgg 10981 gagatcccca ctacaccacc ttcgatggcc gccgcttcga cttcatgggc acctgcgtgt 11041 atgtgctggc tcagacctgc ggcacccggc ctggcctaca tcggtttgcc gtcctgcagg 11101 agaacgtggc ctggggtaat gggcgagtca gtgtgaccag ggtgatcacg gtccaggtgg 11161 caaacttcac cctgcggctg gagcagagac agtggaaggt cacggtgaac ggtgtggaca 11221 tgaagctgcc cgtggtgctg gccaacggcc agatccgtgc ctcccagcat ggttcagatg 11281 ttgtgattga gaccgacttc ggcctgcgtg tggcctacga ccttgtgtac tatgtgcggg 11341 tcaccgtccc tggaaactac taccagctga tgtgtggcct gtgtgggaac tacaacggcg 11401 accccaagga tgacttccag aagcccaatg gctcgcaggc aggcaacgcc aatgagttcg 11461 gcaactcctg ggaggaggtg gtgcccgact ctccctgcct gccgccgccc acctgcccgc 11521 cggggagcga gggctgtatc cccagcgagg agtgtcctcc cgagctggag aagaagtatc 11581 agaaggagga gttctgtggg ctcctctcca gccccacagg gccactgtcc tcctgccaca 11641 agctggtgga tccccagggt cccttgaaag attgcatctt tgatctctgc ctgggtggtg 11701 ggaacctgag cattctctgc agcaacatcc atgcctacgt gagtgcttgc caggcggctg 11761 gaggccacgt ggagccctgg aggaatgaaa ctttctgtcc catggaatgc cctcagaaca 11821 gtcactacga gctctgtgcg gacacctgct ccctgggctg ctcggctctc agtgcccctc 11881 tgcagtgccc agatgggtgt gctgagggct gccagtgtga ctccggcttc ctctacaacg 11941 gccaagcctg cgtgcccatc cagcaatgtg gctgctacca caatggtgtc tactatgagc 12001 cggagcagac agtcctcatt gacaactgtc ggcagcagtg cacgtgccat gtgggtaaag 12061 tcgtggtgtg ccaggaacac agctgcaagc cggggcaggt gtgccagccc tccggaggca 12121 tcctgagctg cgtcaacaaa gacccgtgcc acggcgtgac atgccggcca caggagacat 12181 gcaaggagca gggtggccag ggtgtgtgcc tgcccaacta tgaggccacg tgctggctgt 12241 ggggcgaccc acactaccac tccttcgatg gccggaagtt tgacttccag ggcacctgta 12301 actatgtgct ggcaacaact ggctgcccgg gggtcagcac ccagggcctg acacccttca 12361 ccgtcaccac caagaaccag aaccggggca accctgctgt atcctacgtg agagtcgtca 12421 ccgtggctgc cctcggcacc aacatctcca tccacaagga cgagatcggc aaagtccggg 12481 tgaacggtgt gctcacagcc ttgcctgtct ccgtggccga cgggcggatt tcagtggccc 12541 agggtgcatc gaaggcactg ctggtggctg actttggact gcaagtcagc tatgactgga 12601 actggcgggt agacgtgacg ctccccagca gctatcatgg cgcagtgtgc gggctctgcg 12661 gtaacatgga ccgcaacccc aacaatgacc aggtcttccc taatggcaca ctggctccct 12721 ccatacccat ctggggcggc agctggcgag ccccaggctg ggacccactg tgttgggacg 12781 aatgtcgggg gtcctgccca acgtgccctg aggaccggtt ggagcagtac gaggggcctg 12841 gcttctgcgg acccctggca tctggcacag ggggcccctt caccacctgc catgctcatg 12901 tgccacctga gagcttcttc aagggctgtg ttctggacgt ctgcatgggt ggtggggacc 12961 atgacattct ttgcaaggct ctggcttcct acgtggccgc ctgccaggcc gctggggttg 13021 tcatcgaaga ctggcgggca caggttggct gtgagatcac ctgcccagaa aacagccact 13081 atgaggtctg tggcccaccc tgcccggcca gctgtccgtc ccctgcaccc cttacgacgc 13141 cagccgtatg tgagggcccc tgtgtggagg gctgccagtg cgacgcgggt ttcgtgttaa 13201 gtgctgaccg ctgtgttccc ctcaacaacg gctgcggctg ctgggccaat ggcacctacc 13261 acgaggcggg cagtgagttt tgggctgatg gcacctgctc ccagtggtgt cgctgcgggc 13321 ctgggggtgg ctcgctggtc tgcacacctg ccagctgtgg gctgggtgaa gtgtgtggcc 13381 tcctgccatc cggccagcac agctgccagc ccgtcagcac agctgagtgc caggcgtggg 13441 gtgaccccca ttacgtcact ctggatgggc accgattcga tttccaaggc acctgcgagt 13501 acctgctgag tgcaccctgc cacggaccac ccttgggggc tgagaacttc actgtcactg 13561 tagccaatga gcaccggggc agccaggctg tcagctacac ccgcagtgtc accctgcaaa 13621 tctacaacca cagcctgaca ctgagtgccc gctggccccg gaagctacag gtcgacggcg 13681 tgttcgtggc tctgcctttc cagctggact cgctcctgca cgcacacctg agcggcgccg 13741 acgtggtggt gaccacaacc tcagggctct cgctggcttt cgatggggac agcttcgtgc 13801 gcctgcgcgt gccggcggcg tacgcggcct ctctctgtgg cttatgcggg aactacaacc 13861 aggaccccgc agacgacctg aaggctgtgg gcgggaagcc cgctggatgg caggtgggcg 13921 gggcccaggg ctgcggggaa tgtgtgtcca agccatgccc gtcgccgtgc accccagagc 13981 agcaggagtc cttcggcggc ccggacgcct gcggcgtgat ctccgccacc gacggcccgc 14041 tggcaccctg ccacggcctt gtgccgcccg cgcagtactt ccagggctgc ttgctggacg 14101 cctgccaagt tcagggccat cctggaggcc tctgtcctgc agtggctacc tacgtggcag 14161 cctgtcaggc cgctggggcc cagctcggcg agtggaggcg gccggacttc tgtcccttgc 14221 agtgccctgc ccacagccac tatgagctct gcggtgactc ctgccctgtg agctgcccga 14281 gcctctcagc acccgagggc tgtgagtcgg cctgccgtga aggctgtgtc tgcgatgctg 14341 gcttcgtact cagtggtgac acctgcgtac ccgtgggcca gtgtggctgc ctccatgatg 14401 gccgctacta cccactgggc gaggtcttct acccgggccc tgagtgtgag cgacgctgtg 14461 agtgtgggcc aggtggccat gtcacctgcc aggagggcgc agcctgtggg ccccatgagg 14521 agtgccggtt agaggatggt gtccaggcct gtcatgccac aggctgtggc cgctgcctgg 14581 ccaacggggg catccactac atcacccttg atggccgtgt ctacgacctg catggctcct 14641 gctcctatgt cttggcccaa gtctgccacc caaagcctgg ggacgaggac ttttccatcg 14701 tgcttgagaa gaatgcagct ggacatctcc aacgcctcct ggttactgtg gctggccagg 14761 ttgtgagcct agctcagggg cagcaggtca ccgtggacgg cgaggctgtg gccctgcctg 14821 tggctgtggg ccgcgtgcgg gtgaccgccg agggccgaaa catggttctg cagacgacca 14881 aggggctgcg gcttctcttt gatggcgatg cccacctcct catgtccatc cccagcccct 14941 tccgtggacg gctctgtggc ctctgtggga acttcaatgg caactggagt gacgactttg 15001 tcctgcccaa tggctcagca gcgtccagtg tggagacctt cggggctgca tggcgggtgc 15061 ccggctcctc caagggctgt ggcgagggct gcgggcccca aggctgccca gtgtgcttgg 15121 cagaggagac tgcaccctat gagagcaacg aggcctgcgg gcagctccgg aacccccagg 15181 gccccttcgc gacctgccag gcggtgctga gtccctctga gtacttccgc caatgcgtat 15241 acgacctgtg cgcgcaaaag ggtgacaaag ccttcctgtg ccgcagcctg gcagcctaca 15301 cggcggcctg tcaggcagct ggcgtggccg tgaagccctg gaggacagac agcttctgcc 15361 cgctccattg ccccgcccac agccactact ccatctgcac tcgcacctgc cagggatcct 15421 gtgcggctct ctccggcctc acgggctgca ccacccgctg ttttgagggc tgtgagtgcg 15481 acgaccgctt cctgctttcc cagggtgtct gcatccctgt ccaagattgt ggctgcaccc 15541 ataatggccg atacttgccg gtaaactcct ccctgctgac ctcagactgc agcgagcgct 15601 gttcctgttc ctcaagctct ggcctgacat gccaggccgc tggctgccca ccaggccgtg 15661 tatgtgaggt caaggctgaa gcccggaact gctgggccac ccgtggtctc tgtgtcctgt 15721 ctgtgggtgc caacctcacc acctttgatg gggcccgtgg tgccaccacc tctcctggtg 15781 tctatgagct ctcttcccgc tgcccaggac tacagaatac catcccctgg taccgtgtag 15841 ttgccgaagt ccagatctgc catggcaaaa cggaggctgt gggccaggtc cacatcttct 15901 tccaggatgg gatggtgacg ttgactccaa acaagggtgt gtgggtgaat ggtctccgag 15961 tggatctccc agctgagaag ttagcatctg tgtccgtgag tcgtacacct gatggctccc 16021 tgctagtccg ccagaaggca ggggtccagg tgtggcttgg agccaatggg aaggtggctg 16081 tgattgtcag caatgaccat gctgggaaac tgtgtggggc ctgtggaaac tttgacgggg 16141 accagaccaa tgattggcat gactcccagg agaagccagc gatggagaaa tggagagcgc 16201 aggacttctc cccatgttat ggctgatcag tcatccacca ggaacgaaga tttcctgaag 16261 aagacctggt ccctctggag gttgcggtgg ctgaaggatg catcatgtgc tcctaccctg 16321 ctctaccgct tttctgggtc acagaggcca aatgtgagag cattgaataa atatcttaag 16381 ct // LOCUS D84294 9078 bp mRNA PRI 26-NOV-1996 DEFINITION Human mRNA for TPRDI, complete cds. ACCESSION D84294 NID g1632761 KEYWORDS TPRDI. SOURCE Homo sapiens fetal brain cDNA to mRNA, clone:TPRDI. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9078) AUTHORS Tsukahara,F., Hattori,M., Muraki,T. and Sakaki,Y. TITLE Identification and cloning of a novel cDNA belonging to tetratricopeptide repeat gene family from Down syndrome-critical region 21q22.2 JOURNAL J. Biochem. 120 (4), 820-827 (1996) MEDLINE 97103476 REFERENCE 2 (bases 1 to 9078) AUTHORS Tsukahara,F. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 9078) AUTHORS Tsukahara,F. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) to the DDBJ/EMBL/GenBank databases. Fujiko Tsukahara, Tokyo Women's Medical College, Department of Pharmacology; Kawada-cho 8-1, Shinjuku-ku, Tokyo 162, Japan (E-mail:fuji@research.twmc.ac.jp, Tel:81-3-3353-8111(ex.22513), Fax:81-3-5269-7417) COMMENT Sequence updated (24-Sep-1996) by: Fujiko Tsukahara. FEATURES Location/Qualifiers source 1..9078 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /clone="TPRDI" /map="21q22.2" /tissue_type="fetal brain" mRNA 1..9078 5'UTR 1..1469 CDS 1470..7547 /codon_start=1 /product="TPRDI" /db_xref="PID:d1012977" /db_xref="PID:g1632762" /translation="MDNFAEGDFTVADYALLEDCPHVDDCVFAAEFMSNDYVRVTQLY CDGVGVQYKDYIQSERNLEFDICSIWCSKPISVLQDYCDAIKINIFWPLLFQHQNSSV ISRLHPCVDANNSRASEINLKKLQHLELMEDIVDLAKKVANDSFLIGGLLRIGCKIEN KILAMEEALNWIKYAGDVTILTKLGSIDNCWPMLSIFFTEYKYHITKIVMEDCNLLEE LKTQSCMDCIEEGELMKMKGNEEFSKERFDIAIIYYTRAIEYRPENYLLYGNRALCFL RTGQFRNALGDGKRATILKNTWPKGHYRYCDALSMLGEYDWALQANIKAQKLCKNDPE GIKDLIQQHVKLQKQIEDLQGRTANKDPIKAFYENRAYTPRSLSAPIFTTSLNFVEKE RDFRKINHEMANGGNQNLKVADEALKVDDCDCHPEFSPPSSQPPKHKGKQKSRNNESE KFSSSSPLTLPADLKNILEKQFSKSSRAAHQDFANIMKMLRSLIQDGYMALLEQRCRS AAQAFTELLNGLDPQKIKQLNLAMINYVLVVYGLAISLLGIGQPEELSEAENQFKRII EHYPSEGLDCLAYCGIGKVYLKKNRFLEALNHFEKARTLIYRLPGVLTWPTSNVIIEE SQPQKIKMLLEKFVEECKFPPVPDAICCYQKCHGYSKIQIYITDPDFKGFIRISCCQY CKIEFHMNCWKKLKTTTFNDKIDKDFLQGICLTPDCEGVISKIIIFSSGGEVKCEFEH KVIKEKVPPRPILKQKCSSLEKLRLKEDKKLKRKIQKKEAKKLAQERMEEDLRESNPP KNEEQKETVDNVQRCQFLDDRILQCIKQYADKIKSGIQNTAMLLKELLSWKVLSTEDY TTCFSSRNFLNEAVDYVIRHLIQENNRVKTRIFLHVLSELKEVEPKLAAWIQKLNSFG LDATGTFFSRYGASLKLLDFSIMTFLWNEKYGHKLDSIEGKQLDYFSEPASLKEARCL IWLLEEHRDKFPALHSALDEFFDIMDSRCTVLRKQDSGEAPFSSTKVKNKSKKKKPKD SKPMLVGSGTTSVTSNNEIITSSEDHSNRNSDSAGPFAVPDHLRQDVEEFEALYDQHS NEYVVRNKKLWDMNPKQKCSTLYDYFSQFLEEHGPLDMSNKMFSAEYEFFPEETRQIL EKAGGLKPFLLGCPRFVVIDNCIALKKVASRLKKKRKKKNIKTKVEEISKAGEYVRVK LQLNPAAREFKPDVKSKPVSDSSSAPAFENVKPKPVSANSPKPACEDVKAKPVSDNSS RQVSEDGQPKGVSSNSPKPGSEDANYKRVSCNSPKPVLEDVKPTYWAQSHLVTGYCTY LPFQRFDITQTPPAYINVLPGLPQYTSIYTPLASLSPEYQLPRSVPVVPSFVANDRAD KNAAAYFEGHHLNAENVAGHQIASETQILEGSLGISVKSHCSTGDAHTVLSESNRNDE HCGNSNNKCEVIPESTSAVTNIPHVQMVAIQVSWNIIHQEVNTEPYNPFEERQGEISR IEKEHQVLQDQLQEVYENYEQIKLKGLEETRDLEEKLKRHLEENKISKTELDWFLQDL EREIKKWQQEKKEIQERLKSLKKKIKKVSNASEMYTQKNDGKEKEHELHLDQSLEISN TLTNEKMKIEEYIKKGKEDYEESHQRAVAAEVSVLENWKESEVYKLQIMESQAEAFLK KLGLISRDPAAYPDMESDIRSWELFLSNVTKEIEKAKSQFEEQIKAIKNGSRLSELSK VQISELSFPACNTVHPELLPESSGHDGQGLVTSASDVTGNHAALHRDPSVFSAGDSPG EAPSALLPGPPPGQPEATQLTGPKRAGQAALSERSPVADRKQPVPPGRAARSSQSPKK PFNSIIEHLSVVFPCYNSTELAGFIKKVRSKNKNSLSGLSIDEIVQRVTEHILDEQKK KKPNPGKDKRTYEPSSATPVTRSSQGSPSVVVAPSPKTKGQKAEDVPVRIALGASSCE ICHEVFKSKNVRVLKCGHKYHKGCFKQWLKGQSACPACQGRDLLTEESPSGRGWPSQN QELPSCSSR" 3'UTR 7548..9078 polyA_signal 9056..9061 BASE COUNT 2933 a 1681 c 1965 g 2499 t ORIGIN 1 ctgaactagt tgccagtgat cttgaaacgt gacagtaacc aagagataaa taggtgacaa 61 tgacaggaaa attagatgta gtaaaagaga gtgtttgaga gcagaagcta tggcaactaa 121 agactggatt tgaatccttc ctagcttggt gacatgagca aattacttga tttaagtgag 181 cattttccca tctgtacagt ggagataacg ataattgtgc ctgctaagaa gaattgctgt 241 gaagattagt gaaataatgc atgtaaaaca tttggtacag tatgtgacac atagtacaaa 301 tagtttgcta ggaagattgt tattattctt cacttgtgat attgtgaagt tttcatacag 361 caaattggac atcatgagat ggattgatta aataaataga tttgaacttc aaggactggt 421 agtgttcttg ctttggaaag aagaaacttg gtttatccta ataatagtag gataataatg 481 gtgaagtgat aggtacaagt aatagtgttt atgatgcgct ggtgatgata ggaaaagaaa 541 gccattatat gggcaagagc tagaagtaat aaaatggtgc atttttcagt gatgtttggc 601 ctatgtagct attctctgat aactataaaa atccttatta ttgaagattc ttcaggaaaa 661 aaaaaccctt agtctgaaac tttagcacca atcccccttg ccccccattg aaatacgtat 721 ttttaaaaca tggcttttga taatgtgagg gttttttcct ttttgcgatt tagcagtgct 781 gattgtgtat tgcagtagtt gtgagagcat tagaagcagc agtcgatagg aggatggaag 841 gtctggatgc cgccttgggg agttaggaga ttggcagact taccctgtac cactctagcc 901 ctactccttt gcccaagaca gaaacacact gagatggata ggagaatatg agcagttgat 961 aggaaagttc tcagtggagt caggatttag gttaggccag gagattgaga atataacagt 1021 ttgtgtatga tgaaatggca tatttcacag aatgcagtaa aagcaggtag ggtacaagtg 1081 cagcaacagg aagatgtctt ttcttcattc agcaaacact tatttgagag cttaccatgt 1141 gctaggcaca tacaaagata aataagatgc ccttgatgat cctctattta aaggagacat 1201 gtaaacaggt taacttagag tagagatggt gaatatgtga acctgaggaa aggaagaaat 1261 agattaaatt atctggagag agaggaaaag tcagcagaat ggggacgaga atctttcgga 1321 gctcagtgtt ctgataggag ttatttcctt gggcataggt tccaagtatt tttctaatat 1381 accatagaag ccaggaaaac tttcttctgt tatctcaaat gatttaatta ctgacttgag 1441 tttgtgttgt ctccttagac ttgtgcacca tggacaattt tgctgaggga gatttcactg 1501 tggcggatta tgccttgtta gaagattgcc ctcacgtgga tgattgtgtc tttgctgctg 1561 aatttatgag caatgattat gttcgtgtga ctcagcttta ctgtgatggg gtgggtgtgc 1621 aatataaaga ttatatccaa agtgagagga atttggaatt tgacatctgc agtatatggt 1681 gtagtaaacc aatttctgtc ctgcaagatt attgcgatgc cattaaaata aacatcttct 1741 ggccacttct gtttcaacat caaaacagtt ccgtaatatc acgattgcat ccctgtgtgg 1801 acgccaacaa ttcacgtgct tctgagataa atttgaagaa actacaacat cttgagttga 1861 tggaagatat tgtggatttg gcaaagaaag ttgctaatga ttcattcctt attggaggct 1921 tattgagaat tggttgtaaa atagaaaata aaatcttggc aatggaagaa gctctgaatt 1981 ggataaaata tgcaggcgat gtaacaattc taactaaatt aggatcaatt gacaattgtt 2041 ggcctatgtt aagtattttc tttactgaat acaagtacca cataactaaa attgtaatgg 2101 aagactgcaa tttgcttgaa gaacttaaaa ctcaaagttg tatggattgt atagaggaag 2161 gagaactaat gaaaatgaaa ggaaatgaag agttttccaa agaaagattt gatatagcta 2221 ttatctatta caccagagcc attgaatata gacctgaaaa ctaccttctt tatggtaacc 2281 gagctctttg ttttcttcgt actggacagt ttagaaatgc actcggtgat ggaaagagag 2341 ccactattct gaagaacact tggccaaagg gtcattatcg ttattgtgat gctctttcta 2401 tgctggggga atatgactgg gccctgcaag caaacataaa agctcaaaaa ctctgtaaaa 2461 atgaccctga gggaatcaag gatctaattc agcagcatgt aaagttacaa aaacaaatag 2521 aagacctaca aggtcgaaca gcaaataagg atccaattaa agccttttat gaaaacaggg 2581 cctacacacc taggagttta tcagcaccta tatttactac ttcacttaac tttgtggaga 2641 aggaaagaga tttcagaaaa attaatcacg aaatggccaa cggtggtaat cagaatctaa 2701 aggtggcgga tgaggcgttg aaggtagatg attgtgactg tcatcctgaa ttttcaccac 2761 catcaagtca gcctccaaaa cataaaggaa aacaaaaatc tcgaaacaat gaatcagaaa 2821 agttcagttc tagttcacca ttgactttac cagcagattt gaagaacatc ttggagaaac 2881 agttttctaa atcttccaga gctgcacacc aggattttgc taatataatg aaaatgctga 2941 gaagcttaat tcaagatggc tatatggcct tattggagca gcgttgccgc agcgctgcac 3001 aggcctttac agagttgctg aacggtttag atcctcaaaa aataaagcaa ttgaacctgg 3061 ccatgattaa ctatgttttg gtcgtctatg gacttgccat ttctctcctt ggaataggac 3121 agcctgagga attatctgaa gccgaaaacc agtttaagag gattattgaa cactacccca 3181 gtgagggcct tgattgcttg gcctactgtg gaattggaaa agtgtatttg aaaaaaaaca 3241 gatttctaga agctctcaat cactttgaga aagcaagaac cttgatttat cgtcttcctg 3301 gagtgttaac ttggcccacg agtaatgtga ttattgaaga gtctcagcca caaaaaataa 3361 agatgctgtt agagaaattt gttgaagaat gcaagttccc tccagtgcca gatgccattt 3421 gttgctatca gaagtgccat ggatattcta agatccagat atacataact gatccagact 3481 ttaagggttt tatacgcatc agctgttgcc agtactgtaa aatagaattt cacatgaatt 3541 gctggaagaa gttaaaaact acaaccttta atgataaaat tgacaaggat tttctacaag 3601 gaatatgtct tacccctgac tgtgaaggtg tcatttctaa gattatcatc ttcagcagtg 3661 gtggtgaagt taaatgtgaa tttgaacaca aggtcataaa agaaaaggtt cctccaagac 3721 ctattctgaa acagaaatgt tctagcctag agaaactaag actgaaagaa gacaaaaaat 3781 tgaagagaaa gatccaaaaa aaagaagcaa aaaagttagc acaagaaaga atggaggagg 3841 acttaagaga aagtaatcca cccaaaaatg aagagcagaa agaaactgta gacaatgttc 3901 agcgttgtca gttccttgat gacagaattc tacagtgtat aaagcagtat gctgacaaga 3961 ttaaatccgg catacagaat acagccatgc ttctcaaaga attgctttct tggaaagttt 4021 tgagcacaga agactataca acctgttttt ctagcagaaa ttttctaaat gaagcagtgg 4081 actatgttat tcgccacttg attcaagaaa ataacagagt aaagacaaga atatttctgc 4141 atgttttgag tgagcttaaa gaagtggagc ccaaattagc cgcctggatc caaaaactta 4201 atagctttgg cttagatgcc acaggaactt tcttttctcg ttatggagca tctcttaaac 4261 tgcttgattt tagtatcatg actttcctct ggaatgagaa atatggtcac aaactagact 4321 ctatagaagg aaagcaactt gattatttct ctgagccagc atcattgaag gaagcccgtt 4381 gtttaatatg gctgctagaa gaacacagag acaagttccc agcattgcat agtgctttag 4441 atgaattctt tgatataatg gacagccgct gtactgtgtt aaggaaacaa gatagtggtg 4501 aagcaccgtt tagttcaacc aaggtgaaaa acaaaagcaa gaaaaagaag ccaaaggatt 4561 caaagcctat gttagttggg tctggaacaa cttcagtaac ttcaaataat gagatcatca 4621 cttcaagtga agaccatagc aatcgaaatt cagattctgc aggcccattt gcagtgcctg 4681 accatcttcg gcaagatgta gaagaattcg aagctctcta tgaccaacac agtaacgaat 4741 atgttgtccg caataagaag ctatgggaca tgaacccaaa acaaaaatgt tcaactctat 4801 atgattactt ctctcagttt ttggaggaac atggtccctt ggacatgagt aacaagatgt 4861 tctctgcaga atatgagttt ttcccagaag aaactcgaca gatactagaa aaagcaggag 4921 gtttaaaacc ttttctcttg ggatgccctc gttttgttgt gattgacaac tgtattgcac 4981 tgaagaaggt tgcatcacgg ctcaagaaaa aaaggaagaa gaaaaacatt aaaacaaaag 5041 tagaagaaat ttcaaaagca ggggagtatg tacgagttaa actacaactg aatccagctg 5101 ctagggaatt taaaccagat gtaaagtcta aaccagtgtc agattcatct tcagcaccag 5161 cttttgaaaa tgtgaaaccc aaacctgtgt ctgcaaattc tcccaagcca gcttgtgaag 5221 atgtgaaggc caaaccagta tccgacaatt cttctagaca agtttctgag gatgggcaac 5281 ccaaaggggt ctcttctaat tctcctaaac caggctctga ggatgcaaat tacaagcgag 5341 tctcctgtaa ttcccccaaa ccggttcttg aggatgtgaa accaacttat tgggctcaat 5401 cccatttggt cacaggatac tgtacgtatc ttcctttcca gagatttgat atcacccaga 5461 caccgccagc atacataaac gtgttaccag gtttgcccca gtacaccagc atatatacac 5521 ccttggccag cctttctcct gaatatcagc taccaagatc agtaccagtg gtgccgtctt 5581 ttgtagccaa tgacagagca gataaaaatg ctgctgccta ttttgagggt catcatttga 5641 atgctgagaa tgttgctggt caccagattg cctctgaaac acagatcctt gagggctctt 5701 tgggaatatc tgtaaagtca cactgcagca caggtgatgc tcatacagtc ctgagtgagt 5761 ctaacagaaa tgatgagcac tgtggaaatt ctaacaacaa atgtgaagta attccagaaa 5821 gcaccagtgc agtaacaaac attccacacg tgcagatggt tgccatacag gtatcttgga 5881 acataataca ccaagaagtc aatactgagc catataatcc ttttgaggaa cgacaagggg 5941 aaatttcacg gattgaaaag gagcaccaag tattacaaga ccaacttcaa gaagtgtatg 6001 aaaattatga gcagataaaa cttaagggct tagaagagac cagggacctg gaagagaagt 6061 tgaaaaggca cttagaagaa aacaagatct caaagacgga attagattgg ttccttcaag 6121 atttggaaag agaaattaaa aaatggcaac aggaaaaaaa agaaatccaa gaaagactaa 6181 aatcactgaa gaagaaaatt aaaaaggttt caaatgccag tgaaatgtat acccagaaaa 6241 atgatggaaa ggaaaaggaa catgaattac atctggatca gtcccttgaa atcagcaaca 6301 cacttacaaa tgagaaaatg aaaatagaag agtatataaa gaaagggaaa gaggattatg 6361 aagagagtca tcagagagct gtggctgcag aggtatccgt acttgaaaac tggaaggaga 6421 gtgaagtgta taagctacag atcatggagt cacaagcaga agcctttctg aagaagctgg 6481 ggctgattag ccgtgatcct gcagcatatc ctgacatgga gtctgatata cgttcatggg 6541 aattgtttct ttctaatgtt acaaaagaaa ttgagaaagc aaagtctcag tttgaagaac 6601 aaattaaggc aattaaaaat ggttctcggc tcagtgaact ttctaaagtg cagatttctg 6661 agctttcatt tcctgcctgt aacacggttc atcccgagtt actccctgag tcttcaggcc 6721 acgatggcca agggcttgtg acttctgcaa gcgacgtgac tggaaaccac gcagcacttc 6781 acagggatcc tagtgtgttc tctgctggtg attccccagg ggaggctcct tctgcgctgt 6841 tgccagggcc accccctggt cagcctgaag ccactcagct gacagggcca aaacgggctg 6901 gccaggcagc tctgtcagaa cgaagccctg tggctgatcg gaagcagcct gttcctccag 6961 gacgtgctgc gcgttcaagc cagtctccaa aaaagccgtt caatagtatt attgagcacc 7021 tgtcagtggt attcccatgt tacaacagca ctgagcttgc tggttttatt aaaaaagtgc 7081 gaagcaaaaa caagaactca ctctcaggat tgagtattga tgaaattgtc caaagagtga 7141 cagaacacat tctagatgaa cagaaaaaga aaaagccaaa cccaggaaag gacaagagga 7201 cttatgagcc cagctctgcc acccccgtga ccaggtcctc ccagggctca ccctcggtgg 7261 ttgttgcacc atcacccaaa accaaggggc agaaagcaga agatgtccct gtgaggattg 7321 cactgggtgc aagttcctgt gaaatatgcc acgaggtgtt caaatcaaaa aacgtgcgtg 7381 tgctcaaatg tgggcacaag tatcacaaag ggtgctttaa gcagtggctt aaagggcaga 7441 gcgcttgccc ggcctgccag ggtcgtgatc tcctgacaga agagtcacct tctggaagag 7501 gctggcccag tcagaatcag gagctgcctt cctgctcttc taggtagtca cacttcacta 7561 aagtgtcatc caccagtgtg ttgaatccga agaatgacaa ttttctacca ctggtgtaaa 7621 aaacaaacat ttgaagaccc ttgtgcattg tgtgtcacaa agctaaatac atggaaatcg 7681 ttaatatcgc tgatattaag taatttcccc actctgagtg aatactttga tgattgccaa 7741 cagtggctaa taaaatgacg gctaccacac tcatgggtca ctggggctgc gcagggctct 7801 ttgaggtggg tggcttcttt tggaaagtac tatgaacgtc tcgaagcagt attctagtga 7861 taagaattct taacatagcc aagcgcccca cgtttgttcc ccacgtttgt tccccttttc 7921 tgtttgaaaa acctgttctg gtagctccac aagagagatg atactgactt tttaaatttt 7981 ttacaagagt ctgtattcct gatatgccta tatttttcct caaagattct gcattttaag 8041 gatgggcata agcaaactat attttaataa tttatagtta atgttaaaat attggctgat 8101 ttagaccaaa agattcaaat ctcctctttg tgaaatccca tctgcatttg attttttatt 8161 attttatgtt cccccgttag attgttttaa gtgtttgctt ttcatctttt atagatgtaa 8221 tctgattttc aaaaatcatt aacacttttt aattagtatc gactaagact ttttccccct 8281 ggaatcgagg ctgtgtgtcc gtcatcccag cccccggttg gagcctgctc tttgaactcc 8341 gctgccttcc ttagcagctt ctgtcctctt ctgtgagtca gtcagcgagt gcttgggatc 8401 cgcatccagc cgtgctgagc acacaacagg ctgtgtgtgg aaatggccac caccattctc 8461 cttccccacc ccaccacaaa aagagaagct gtgtctttag acaaccctga ggtatctgtg 8521 ttacaatcgt tctgtgtttg atatttgtgt aaagtatgca tgcagtcttg tactgtgacc 8581 taagaacaaa actgtaactg cattagaaac catgaaaaaa ttagatattg ttttgtgact 8641 tttagacagt ggtaaatata gaaccatgaa ttctggtcac attccatttc tctccaacat 8701 gaaggatcaa aaaatgtttt tcaatgtgtt ctttgttcca ctggaaactt agagtcatga 8761 gtttatgagc tgatttggtc accttcctct gcctttgttc actgtgagtt ctgatgtctt 8821 agtgacttag ttcttagaag ctcacgcctt agtttgaaac agattctcca cggtggtccc 8881 caaaacactg tctgcatatc cataagaatt gagcgctatg ggtgttaacg tgcatgagga 8941 tcagtttgca gcagcaagta caaaaggaga agaggaacat ccgttgaatg agtgtgtttt 9001 gtacataact tcagatactt gtgaacatgc cttatatttg tccaacaact gtcagaataa 9061 agaacattct aaaatgag // LOCUS D84307 1856 bp mRNA PRI 16-JUN-1997 DEFINITION Human mRNA for phosphoethanolamine cytidylyltransferase, complete cds. ACCESSION D84307 NID g1817547 KEYWORDS phosphoethanolamine cytidylyltransferase; CTP. SOURCE Homo sapiens glioblastoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1856) AUTHORS Nakashima,A., Hosaka,K. and Nikawa,J. TITLE Direct Submission JOURNAL Submitted (07-APR-1996) to the DDBJ/EMBL/GenBank databases. Jun-Ichi Nikawa, Kyushu Institute of Technology, Biochemical Engineering and Science; 680-4 Kawazu, Iizuka, Fukuoka 820, Japan (E-mail:nikawa@bse.kyutech.ac.jp, Tel:+81-948-29-7822, Fax:+81-948-29-7801) REFERENCE 2 (bases 1 to 1856) AUTHORS Nakashima,A., Hosaka,K. and Nikawa,J. TITLE Cloning of a human cDNA for CTP-phosphoethanolamine cytidylyltransferase by complementation in vivo of a yeast mutant JOURNAL J. Biol. Chem. 272 (14), 9567-9572 (1997) MEDLINE 97238903 FEATURES Location/Qualifiers source 1..1856 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="glioblastoma" CDS 67..1236 /note="CTP" /codon_start=1 /product="phosphoethanolamine cytidylyltransferase" /db_xref="PID:d1012987" /db_xref="PID:g1817548" /translation="MIRNGRGAAGGAEQPGPGGRRAVRVWCDGCYDMVHYGHSNQLRQ ARAMGDYLIVGVHTDEEIAKHKGPPVFTQEERYKMVQAIKWVDEVVPAAPYVTTLETL DKYNCDFCVHGNDITLTVDGRDTYEEVKQAGRYRECKRTQGVSTTDLVGRMLLVTKAH HSSQEMSSEYREYADSFGKCPGGRNPWTGVSQFLQTSQKIIQFASGKEPQPGETVIYV AGAFDLFHIGHVDFLEKVHRLAERPYIIAGLHFDQEVNHYKGKNYPIMNLHERTLSVL ACRYVSEVVIGAPYAVTAELLSHFKVDLVCHGKTEIIPDRDGSDPYQEPKRRGIFRQI DSGSNLTTDLIVQRIITNRLEYEARNQKKEAKELAFLEAARQQAAQPLGERDGDF" polyA_signal 1815..1820 BASE COUNT 386 a 557 c 580 g 333 t ORIGIN 1 attgcgggcg gcggcgttcg gagtcgccgg gagctgccag gctgtccgcg ccgccgctgc 61 ggggccatga tccggaacgg gcgcggggct gcaggcggcg cagagcagcc gggcccgggg 121 ggcaggcgcg ccgtgagggt gtggtgcgat ggctgctatg acatggtgca ttacggccac 181 tccaaccagc tgcgccaggc acgggccatg ggtgactacc tcatcgtagg cgtgcacacc 241 gatgaggaga tcgccaagca caaggggccc ccggtgttca ctcaggagga gagatacaag 301 atggtgcagg ccatcaaatg ggtggacgag gtggtgccag cggctcccta cgtcactaca 361 ctagagaccc tggacaaata caactgtgac ttctgtgttc acggcaatga catcaccctg 421 actgtagatg gccgggacac ctatgaggaa gtaaagcagg ctgggaggta cagagaatgc 481 aagcgcacgc aaggggtgtc caccacagac ctcgtgggcc gcatgctgct ggtaaccaaa 541 gcccatcaca gcagccagga gatgtcctct gagtaccggg agtatgcaga cagttttggc 601 aagtgccctg gtgggcggaa cccctggacc ggggtatccc agttcctgca gacatctcag 661 aagatcatcc agtttgcttc tgggaaggag ccccagccag gggagacagt catctatgtg 721 gctggtgcct tcgacctgtt ccacatcggg catgtggact tcctggagaa ggtgcacagg 781 ctggcagaga ggccctacat catcgcgggc ttacactttg accaggaggt caatcactac 841 aaggggaaga actaccccat catgaatctg catgaacgga ctctgagcgt gctggcctgc 901 cggtacgtgt cagaagtggt gattggagcc ccgtacgcgg tcacagcaga gctcctaagt 961 cacttcaagg tggacctggt gtgtcacggc aagacagaaa ttatccctga cagggatggc 1021 tccgacccat accaggagcc caagagaagg ggcatcttcc gtcagattga cagtggcagc 1081 aacctcacca cagacctcat cgtccagcgg atcatcacca acaggttgga gtatgaggcg 1141 cgaaaccaga agaaggaagc caaggagctg gccttcctgg aggctgccag gcagcaggcg 1201 gcacagcccc tgggggagcg cgatggtgac ttctaacctg gcagaggccc tggccggccc 1261 tccccctgct ctgcttctgc gccttctgcg tttggacata ggactctgca gggccgccct 1321 ctctaactgg cctggctctg gaagggctgg tgaggactct gcctccttgc ctgcctacaa 1381 ggtgcctggt ttgcagcagg ctctccgctc tttccagcaa agctgctcag agagggtgtc 1441 cagcacagtg gagaggccgg aagtgagacg ggcagacggc acctgcagcc tgaaacgcac 1501 cgctcctgcg tgcgccccca cctggtcccc ggatgccccc accacctgga cagaggccac 1561 actgactgcc cacccagctg tggcgggagg tgcagagcag ggggctttag ggagcagtga 1621 ctgcggtcac ccctttagtt ctctgggtgt agaccacacc acctcccact gggcaccccc 1681 caacacggtg tcctgccacc cagcgcctgg ctccaggaaa acacgcttgc cttccttccc 1741 ggcagcttcg ccactctcct tatggactct gttctgtttg tacatggctg acggaaatct 1801 ctttggtaca accgaataaa gcctggtggc agtgctgcgc ggggctccca gccaat // LOCUS D84454 2620 bp mRNA PRI 12-NOV-1996 DEFINITION Human mRNA for UDP-galactose translocator, complete cds. ACCESSION D84454 NID g1526437 KEYWORDS UDP-galactose translocator; UGT. SOURCE Homo sapiens normal fibroblast cell_line:TIG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hara,T., Yamauchi,M., Takahashi,E., Hoshino,M., Aoki,K., Ayusawa,D. and Kawakita,M. TITLE The UDP-galactose translocator gene is mapped to band Xp11.23-p11.22 containing the Wiskott-Aldrich syndrome locus JOURNAL Somat. Cell Mol. Genet. 19 (6), 571-575 (1993) MEDLINE 94174379 REFERENCE 2 (bases 1 to 2620) AUTHORS Miura,N., Ishida,N., Hoshino,M., Yamauchi,M., Hara,T., Ayusawa,D. and Kawakita,M. TITLE Human UDP-galactose translocator: molecular cloning of a complementary DNA that complements the genetic defect of a mutant cell line deficient in UDP-galactose translocator JOURNAL J. Biochem. 120 (2), 236-241 (1996) MEDLINE 97044734 REFERENCE 3 (sites) AUTHORS Miura,N., Ishida,N., Mamauchi,M., Hara,T., Ayusawa,D., Hoshino,M. and Kawakita,M. TITLE Molecular cloning and expression of human UDP-galactose translocator JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 2620) AUTHORS Ishida,N. TITLE Direct Submission JOURNAL Submitted (18-APR-1996) to the DDBJ/EMBL/GenBank databases. Nobuhiro Ishida, The Tokyo Metropolitan Institute of Medical Science, The Physiological Chemistry; 18-22, Honkomagome 3-chome, Bunkyo-ku, Tokyo 113, Japan (E-mail:ishidan@rinshoken.or.jp, Tel:03-3823-2101, Fax:03-3823-2965) FEATURES Location/Qualifiers source 1..2620 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TIG-1" /cell_type="normal fibroblast" /chromosome="X" /map="Xp11.22-p11.23" CDS 324..1505 /codon_start=1 /product="UDP-galactose translocator" /db_xref="PID:d1013353" /db_xref="PID:g1526438" /translation="MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLV VQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLH EAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLML NRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGV YFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVL NQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVDPLFALGAGLVIGA VYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDLITEPFLPKSVLV K" BASE COUNT 466 a 727 c 761 g 666 t ORIGIN 1 catcggggga tgatctggaa agcgcgatca gtgaagcgga cgaacggcag gataaggcgg 61 gtctagtgac aggaatgggc cgatgaagct ctgtagggat ggtgggtagg ccgatcgggc 121 cgtgtccccg gcctcccgat ccgacggaat ttggaaatcc cggggctatt catacattga 181 gcttttagga gcggaggaga aaagccacca ccctgacgat cccggctctc gctccacctt 241 cactcaggtg gcccggcagc ggaagtgacg aacgcggaag tggtttttct gttgccgagg 301 ggacgggccg ggcagatgcc aacatggcag cggttggggc tggtggttcc accgcggcgc 361 ccgggccagg ggcggtttcc gcgggtgcat tggagccggg gaccgccagt gcggctcaca 421 ggcgcctgaa gtacatatcc ctagctgtgc tggtggtcca gaatgcctcc ctcatcctca 481 gcatccgcta cgcccgcacg ttgccagggg accgcttctt tgccaccact gctgtggtca 541 tggcggaagt gctcaaaggt ctcacctgcc tgctgctgct cttcgcacag aagaggggta 601 acgtgaagca cctggttctc ttcctccatg aggctgtcct ggtgcagtat gtggacacgc 661 tcaagctcgc agtgccctct ctcatctaca ccttgcagaa taacctccag tatgttgcca 721 tctctaacct accagctgcc actttccagg tgacatacca gctgaagatc ctgaccacag 781 cgctgttctc cgtgctcatg ctgaatcgca gcctttcccg gctgcagtgg gcctccctgc 841 tgctcctctt cactggcgtc gccattgtcc aggcacagca agccggtggg ggaggcccac 901 ggccactgga tcagaaccct ggggcaggcc tggcagccgt cgtggcctcc tgtctctcct 961 ccggcttcgc aggtgtctac tttgagaaga tcctcaaagg cagctcaggc tccgtgtggc 1021 tgcgcaacct gcaactgggc ctcttcggca cagcactggg cctggtgggg ctctggtggg 1081 ctgagggtac cgccgtggcc acccgtggtt tcttttttgg gtacacacct gctgtctggg 1141 gcgtggtgct caaccaggcc ttcggcgggc tactggtggc tgtggttgtc aagtacgctg 1201 acaatatcct caagggcttt gccacctccc tgtccattgt gctgtccact gttgcctcca 1261 ttcgcctctt tggcttccac gtggacccat tatttgccct tggcgctgga ctcgtcattg 1321 gtgctgtcta cctctacagc cttccccgag gtgcagccaa agccatagcc tctgcctctg 1381 cctccgcctc cgggccctgc gttcaccagc agcctcccgg gcagccacca ccaccgcagc 1441 tgtcttccca ccgtggagac ctcatcacgg agccctttct gccaaagtca gtgctggtga 1501 agtgagggct ggcagcaatg gggggacaca agggaggggg actggggtgg agggtgttgg 1561 gcatctgcag gacccaagtc gccaccctcc ggggcctggc tcctctgggt ttgggagatg 1621 gtcttttctc ccaggtcact gagacttctg gaggggtgtg ggactagagc tgggtgtcac 1681 gtgaaccctt cctggtaggg tgaccccctt cccctggagg gggttttaga gctgccgcct 1741 ctgctccctc taacctcttt ggaggcaggg ttgggggtat tgtcattcaa ggcctttttt 1801 ttgtctgctc cctccccgac cctgtgccct cttctggagg tttctcgtct gggagagtcc 1861 ctcccagcag tccctccacc tccataagga cacactggac aaaactcccg cagctcttca 1921 ggaatgaccg atgcctacct gtggggttca gttgcccata gtttgaggcc ttctctcctc 1981 ccttaccacc gctctggatc atgttactag ttccgtcttt tgtgtggcct tgggccagct 2041 tccttgatac cttgaagatg ggcttcttgt gagtccccag ggagaaaggg acaagagcta 2101 agatttttgc atcagccctt ctggcagaag gtgtggtagg ggccatttgt tttttttagt 2161 ggacttggga tttgtggtgt aatcatatca ttaatgatcc agggtgtggg aaaaatggag 2221 gtccttgaag tggctgaatc tcattgtatt taagacactg tcagttgcca gatgtaggct 2281 tatttttgga gatgtctagg agaggaaaaa gctaccaatc atactcttga tatccgtctg 2341 gctgtgtgag gcacccctac ctcatggggg tgtcttggga ttgatgaact gtggaacctg 2401 cctcctgcgc tccccaaagc ttattaaccc cttaactgta tcggggcggg gtgtgtgtgt 2461 gcatggaaga tgcctgggct gtctttgcta tatgtaaata gagccattgg atctttattt 2521 ttgattaatt tgttctgatt ttttggtttg ttttttaagg aactgtaatg aacaaatgtc 2581 aggatatcca atgccaaata aagatgttgt atttatttag // LOCUS D84476 4525 bp mRNA PRI 27-JAN-1997 DEFINITION Human mRNA for ASK1, complete cds. ACCESSION D84476 NID g1805499 KEYWORDS ASK1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ichijo,H., Nishida,E., Irie,K., ten Dijke,P., Saitoh,M., Moriguchi,T., Takagi,M., Matsumoto,K., Miyazono,K. and Gotoh,Y. TITLE Induction of apoptosis by ASK1, a mammalian MAPKKK that activates SAPK/JNK and p38 signaling pathways JOURNAL Science 275 (5296), 90-94 (1997) MEDLINE 97130104 REFERENCE 2 (bases 1 to 4525) AUTHORS Ichijo,H., Nishida,E., Irie,K., ten Dijke,P., Saitoh,M., Moriguchi,T., Takagi,M., Matsumoto,K., Miyazono,K. and Gotoh,Y. TITLE Induction of apoptosis by a novel mammalian MAPKKK that activates SAPK/JNK and p38 signaling pathway JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 4525) AUTHORS Ichijo,H. TITLE Direct Submission JOURNAL Submitted (19-APR-1996) to the DDBJ/EMBL/GenBank databases. Hidenori Ichijo, The Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (Tel:03-3918-0111, Fax:03-3918-0342) FEATURES Location/Qualifiers source 1..4525 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 268..4395 /codon_start=1 /product="ASK1" /db_xref="PID:d1013364" /db_xref="PID:g1805500" /translation="MSTEADEGITFSVPPFAPSGFCTIPEGGICRRGGAAAVGEGEEH QLPPPPPGSFWNVESAAAPGIGCPAATSSSSATRGRGSSVGGGSRRTTVAYVINEASQ GQLVVAESEALQSLREACETVGATLETLHFGKLDFGETTVLDRFYNADIAVVEMSDAF RQPSLFYHLGVRESFSMANNIILYCDTNSDSLQSLKEIICQKNTMCTGNYTFVPYMIT PHNKVYCCDSSFMKGLTELMQPNFELLLGPICLPLVDRFIQLLKVAQASSSQYFRESI LNDIRKARNLYTGKELAAELARIRQRVDNIEVLTADIVINLLLSYRDIQDYDSIVKLV ETLEKLPTFDLASHHHVKFHYAFALNRRNLPGDRAKALDIMIPMVQSEGQVASDMYCL VGRIYKDMFLDSNFTDTESRDHGASWFKKAFESEPTLQSGINYAVLLLAAGHQFESSF ELRKVGVKLSSLLGKKGNLEKLQSYWEVGFFLGASVLANDHMRVIQASEKLFKLKTPA WYLKSIVETILIYKHFVKLTTEQPVAKQELVDFWMDFLVEATKTDVTVVRFPVLILEP TKIYQPSYLSINNEVEEKTISIWHVLPDDKKGIHEWNFSASSVRGVSISKFEERCCFL YVLHNSDDFQIYFCTELHCKKFFEMVNTITEEKGRSTEEGDCESDLLEYDYEYDENGD RVVLGKGTYGIVYAGRDLSNQVRIAIKEIPERDSRYSQPLHEEIALHKHLKHKNIVQY LGSFSENGFIKIFMEQVPGGSLYALLRSKWGPLKDNEQTIGFYTKQILEGLKYLHDNQ IVHRDIKGDNVLINTYSGVLKISDFGTSKRLAGINPCTETFTGTLQYMAPEIIDKGPR GYGKAADIWSLGCTIIEMATGKPPFYELGEPQAAMFKVGMFKVHPEIPESMSAEAKAF ILKCFEPDPDKRACANDLLVDEFLKVSSKKKKTQPKLSALSAGSNAEYLRSISLPVPV LVEDTSSSSEYGSVSPDTELKVDPFSFKTRAKSCGERDVKGIRTLFLGIPDENFEDHS APPSPEEKDSGFFMLRKDSERRATLHRILTEDQDKIVRNLMESLAQGAEEPKLKWEHI TTLIASLREFVRSTDRKIIATTLSKLKLELDFDSHGISQVQVVLFGFQDAVNKVLRNH NIKPHWMFALDSIIRKAVQTAITILVPELRPHFSLASESDTADQEDLDVEDDHEEQPS NQTVRRPQAVIEDAVATSGVSTLSSTVSHDSQSAHRSLNVQLGRMKIETNRLLEELVR KEKELQALLHRAIEEKDQEIKHLKLKSQPIEIPELPVFHLNSSGTNIEDSELTDWLRV NGADEDTISRFLAEDYTLLDVLYYVTRDDLKCLRLRGGMLCTLWKAIIDFRNKQT" polyA_site 4525 /note="8 A nucleotides" BASE COUNT 1280 a 1001 c 1152 g 1092 t ORIGIN 1 acccggcttc cccacccctt gtactctaaa ctctgcagag ggcgagcgtg cggccacgga 61 ggcgccgagg aggagcgagc gccgccgggc agcggcgtgc cctcggggga gagggcgccg 121 gagaggaggc ggcggcgcgg cggcgagggc gcggcgcgcg atggcagctg cttagcccgg 181 cgggcgcgga gcagccccga gctgtggctg gccaggcggt gcggctgggc gggggacgcc 241 gccgccgttg ctgcccggcc cggagagatg agcacggagg cggacgaggg catcactttc 301 tctgtgccac ccttcgcccc ctcgggcttc tgcaccatcc ccgagggcgg catctgcagg 361 aggggaggag cggcggcggt gggcgagggc gaggagcacc agctgccacc gccgccgccg 421 ggcagtttct ggaacgtgga gagcgccgct gcccctggca tcggttgtcc ggcggccacc 481 tcctcgagca gtgccacccg aggccggggc agctctgttg gcgggggcag ccgacggacc 541 acggtggcat atgtgatcaa cgaagcgagc caagggcaac tggtggtggc cgagagcgag 601 gccctgcaga gcttgcggga ggcgtgcgag acagtgggcg ccaccctgga aaccctgcat 661 tttgggaaac tcgactttgg agaaaccacc gtgctggacc gcttttacaa tgcagatatt 721 gcggtggtgg agatgagcga tgccttccgg cagccgtcct tgttttacca ccttggggtg 781 agagaaagtt tcagcatggc caacaacatc atcctctact gcgatactaa ctcggactct 841 ctgcagtcac tgaaggaaat catttgccag aagaatacta tgtgcactgg gaactacacc 901 tttgttcctt acatgataac tccacataac aaagtctact gctgtgacag cagcttcatg 961 aaggggttga cagagctcat gcaaccgaac ttcgagctgc ttcttggacc catctgctta 1021 cctcttgtgg atcgttttat tcaacttttg aaggtggcac aagcaagttc tagccagtac 1081 ttccgggaat ctatactcaa tgacatcagg aaagctcgta atttatacac tggtaaagaa 1141 ttggcagctg agttggcaag aattcggcag cgagtagata atatcgaagt cttgacagca 1201 gatattgtca taaatctgtt actttcctac agagatatcc aggactatga ttctattgtg 1261 aagctggtag agactttaga aaaactgcca acctttgatt tggcctccca tcaccatgtg 1321 aagtttcatt atgcatttgc actgaatagg agaaatctcc ctggtgacag agcaaaagct 1381 cttgatatta tgattcccat ggtgcaaagc gaaggacaag ttgcttcaga tatgtattgc 1441 ctagttggtc gaatctacaa agatatgttt ttggactcta atttcacgga cactgaaagc 1501 agagaccatg gagcttcttg gttcaaaaag gcatttgaat ctgagccaac actacagtca 1561 ggaattaatt atgcggtcct cctcctggca gctggacacc agtttgaatc ttcctttgag 1621 ctccggaaag ttggggtgaa gctaagtagt cttcttggta aaaagggaaa cttggaaaaa 1681 ctccagagct actgggaagt tggatttttt ctgggggcca gcgtcctagc caatgaccac 1741 atgagagtca ttcaagcatc tgaaaagctt tttaaactga agacaccagc atggtacctc 1801 aagtctattg tagagacaat tttgatatat aagcattttg tgaaactgac cacagaacag 1861 cctgtggcca agcaagaact tgtggacttt tggatggatt tcctggtcga ggccacaaag 1921 acagatgtta ctgtggttag gtttccagta ttaatattag aaccaaccaa aatctatcaa 1981 ccttcttatt tgtctatcaa caatgaagtt gaggaaaaga caatctctat ttggcacgtg 2041 cttcctgatg acaagaaagg tatacatgag tggaatttta gtgcctcttc tgtcagggga 2101 gtgagtattt ctaaatttga agaaagatgc tgctttcttt atgtgcttca caattctgat 2161 gatttccaaa tctatttctg tacagaactt cattgtaaaa agttttttga gatggtgaac 2221 accattaccg aagagaaggg gagaagcaca gaggaaggag actgtgaaag tgacttgctg 2281 gagtatgact atgaatatga tgaaaatggt gacagagtcg ttttaggaaa aggcacttat 2341 gggatagtct acgcaggtcg ggacttgagc aaccaagtca gaattgctat taaggaaatc 2401 ccagagagag acagcagata ctctcagccc ctgcatgaag aaatagcatt gcataaacac 2461 ctgaagcaca aaaatattgt ccagtatctg ggctctttca gtgagaatgg tttcattaaa 2521 atcttcatgg agcaggtccc tggaggaagt ctttatgctc tccttcgttc caaatggggt 2581 ccattaaaag acaatgagca aacaattggc ttttatacaa agcaaatact ggaaggatta 2641 aaatatctcc atgacaatca gatagttcac cgggacataa agggtgacaa tgtgttgatt 2701 aatacctaca gtggtgttct caagatctct gacttcggaa catcaaagag gcttgctggc 2761 ataaacccct gtactgaaac ttttactggt accctccagt atatggcacc agaaataata 2821 gataaaggac caagaggcta cggaaaagca gcagacatct ggtctctggg ctgtacaatc 2881 attgaaatgg ccacaggaaa acccccattt tatgaactgg gagaaccaca agcagctatg 2941 ttcaaggtgg gaatgtttaa agtccaccct gagatcccag agtccatgtc tgcagaggcc 3001 aaggcattca tactgaaatg ttttgaacca gatcctgaca agagagcctg tgctaacgac 3061 ttgcttgttg atgagttttt aaaagtttca agcaaaaaga aaaagacaca acctaagctt 3121 tcagctcttt cagctggatc aaatgcagaa tatctcagga gtatatcctt gccggtacct 3181 gtgctggtgg aggacaccag cagcagcagt gagtacggct cagtttcacc cgacacggag 3241 ttgaaagtgg accccttctc tttcaaaaca agagccaagt cctgcggaga aagagatgtc 3301 aagggaattc ggacactctt tttgggcatt ccagatgaga attttgaaga tcacagtgct 3361 cctccttccc ctgaagaaaa agattctgga ttcttcatgc tgaggaagga cagtgagagg 3421 cgagctaccc ttcacaggat cctgacggaa gaccaagaca aaattgtgag aaacctaatg 3481 gaatctttag ctcagggggc tgaagaaccg aaactaaaat gggaacacat cacaaccctc 3541 attgcaagcc tcagagaatt tgtgagatcc actgaccgaa aaatcatagc caccacactg 3601 tcaaagctga aactggagct ggacttcgac agccatggca ttagccaagt ccaggtggta 3661 ctctttggtt ttcaagatgc tgtcaataaa gttcttcgga atcataacat caagccgcac 3721 tggatgtttg ccttagacag tatcattcgg aaggcggtac agacagccat taccatcctg 3781 gttccagaac taaggccaca tttcagcctt gcatctgaga gtgatactgc tgatcaagaa 3841 gacttggatg tagaagatga ccatgaggaa cagccttcaa atcaaactgt ccgaagacct 3901 caggctgtca ttgaagatgc tgtggctacc tcaggcgtga gcacgctcag ttctactgtg 3961 tctcatgatt cccagagtgc tcaccggtca ctgaatgtac agcttggaag gatgaaaata 4021 gaaaccaata gattactgga agaattggtt cggaaagaga aagaattaca agcactcctt 4081 catcgagcta ttgaagaaaa agaccaagaa attaaacacc tgaagcttaa gtcccaaccc 4141 atagaaattc ctgaattgcc tgtatttcat ctaaattctt ctggcacaaa tattgaagat 4201 tctgaactta ccgactggct gagagtgaat ggagctgatg aagacactat aagccggttt 4261 ttggctgaag attatacact attggatgtt ctctactatg ttacacgtga tgacttaaaa 4321 tgcttgagac taaggggagg gatgctgtgc acactgtgga aggctatcat tgactttcga 4381 aacaaacaga cttgactgtt gctcaatcta atcttcgatg gaaattctaa aaattaatac 4441 agagctgatc ttcttggggg tgggaaaatc gaagggagag gagaaaggcg ctgcacttta 4501 aatccagtat ttgtttactc atgtt // LOCUS D84488 2443 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens mRNA for small GTP-binding protein, complete cds. ACCESSION D84488 NID g2388543 KEYWORDS small GTP-binding protein; RAB7L1. SOURCE Homo sapiens placenta cDNA to mRNA, clone:502C07. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shimizu,F., Katagiri,T., Suzuki,M., Watanabe,T.K., Okuno,S., Kuga,Y., Nagata,M., Fujiwara,T., Nakamura,Y. and Takahashi,E. TITLE Cloning and chromosome assignment to 1q32 of a human cDNA (RAB7L1) encoding a small GTP-binding protein, a member of the RAS superfamily JOURNAL Cytogenet. Cell Genet. 77 (3-4), 261-263 (1997) MEDLINE 97430832 REFERENCE 2 (bases 1 to 2443) AUTHORS Shimizu,F. TITLE Direct Submission JOURNAL Submitted (22-APR-1996) to the DDBJ/EMBL/GenBank databases. Fumio Shimizu, Otsuka Pharmaceutical Co. Ltd., Otska GEN Research; Kawauchi-cho 463-10, Tokushima, Tokushima 771-01, Japan (E-mail:shimizu@otsuka.genome.ad.jp, Tel:0886-65-2888, Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..2443 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="502C07" /map="1q32" /tissue_type="placenta" gene 41..652 /gene="RAB7L1" CDS 41..652 /gene="RAB7L1" /codon_start=1 /product="small GTP-binding protein" /db_xref="PID:d1023020" /db_xref="PID:g2388544" /translation="MGSRDHLFKVLVVGDAAVGKTSLVQRYSQDSFSKHYKSTVGVDF ALKVLQWSDYEIVRLQLWDIAGQERFTSMTRLYYRDASACVIMFDVTNATTFSNSQRW KQDLDSKLTLPNGEPVPCLLLANKCDLSPWAVSRDQIDRFSKENGFTGWTETSVKENK NINEAMRVLIEKMMRNSTEDIMSLSTQGDYINLQTKSSSWSCC" BASE COUNT 657 a 538 c 588 g 660 t ORIGIN 1 ccacacttcc cgcctcccta aaacgcacac cccgctagcc atgggcagcc gcgaccacct 61 gttcaaagtg ctggtggtgg gggacgccgc agtgggcaag acgtcgctgg tgcagcgata 121 ttcccaggac agcttcagca aacactacaa gtccacggtg ggagtggatt ttgctctgaa 181 ggttctccag tggtctgact acgagatagt gcggcttcag ctgtgggata ttgcagggca 241 ggagcgcttc acctctatga cacgattgta ttatcgggat gcctctgcct gtgttattat 301 gtttgacgtt accaatgcca ctaccttcag caacagccag aggtggaaac aggacctaga 361 cagcaagctc acactaccca atggagagcc ggtgccctgc ctgctcttgg ccaacaagtg 421 tgatctgtcc ccttgggcag tgagccggga ccagattgac cggttcagta aagagaacgg 481 tttcacaggt tggacagaaa catcagtcaa ggagaacaaa aatattaatg aggctatgag 541 agtcctcatt gaaaagatga tgagaaattc cacagaagat atcatgtctt tgtccaccca 601 aggggactac atcaatctac aaaccaagtc ctccagctgg tcctgctgct agtagtgttt 661 ggcttatttt ccatcccagt tctgggaggt cttttaagtc tcttcccttt ggttgcccac 721 ctgaccattt tattaagtac atttgaattg tctcctgact actgtccagt aaggaggccc 781 attgtcactt agaaaagaca cctggaaccc atgtgcattt ctgcatctcc tggattagcc 841 tttcacatgt tgctgactca cattagtgcc agttagtgcc ttcggtgtaa gatcttctca 901 tcagccctca atttgtgatc cggaattttg tgagaaggat tagaaatcag cacctgcgtt 961 ttagagatca taattctcac ctacttctga gcttattttt ccatttgata ttcattgata 1021 tcatgacttc caattgagag gaaaatgaga tcaaatgtca tttcccaaat ttcttgtagg 1081 ccgttgtttc agattctttc tgtcttggaa tgtaaacatc tgattctgga atgcagaagg 1141 aggggtctgg gcatctgtgg atttttggct actagaagtg tcccagaagt cactgtattt 1201 ttgaaacttc taacgtcata attaagtttc tcttgtcttg gcatcaagaa tagtcaagtt 1261 ttttggccgg gcatggtggc tcatgcctgt aatcccagca cttggggagg ccaaggcagg 1321 cggatcacat gaggccagga attcgagacc aacctggtca gcatggcaaa accccgtctc 1381 tactaaaagt acaaaaatta gccaggcgtg atggcacgtg tctgtaatcc cagctactct 1441 ggagactgag gtgggagaat cgcttgagac tgggaggcag aggttgcagt gaaccgagat 1501 catgccaccg cacttcagcc tgggtgacag agaaggactc cgtctcaaaa aaaaaagaaa 1561 aaagaatagt catttttaaa ctacctatct catgcaatga aagcattttc ttccacaaag 1621 agcttaatcc tcatgatagg attgcctagt gtctcccatt tgcaggtttc tgggttgatg 1681 tcttaatgca taatactgca agtgacatca gctggctgtg atgcttcgaa ataggtctgc 1741 tcctcacagc tttgggaatc tgaatggaag aagaaaagag agaagttaac aacctccact 1801 ggggcaactt tgtgaacatg taggcactta gtcataggaa acatattatg tgcaggtcct 1861 agcctggggt aggaaagtag atagacagaa aatcattagg taatttaagt actaaattgg 1921 gcagggcttt ttagtatcaa atcactacta gaccgtttaa tttgttaaat tatctctagg 1981 atggtgattt ataacctacc caaagttatc gatattctta ctaaactctg aggcctgaag 2041 ttctgtgata gaccttaaat aagtgtccta agtcagtggt tcccaaatct ggctggtcgg 2101 gaatacctgg gaagtttgtt aaaatttttt aaaaatgttt taagattttt gggtcctgag 2161 ccaggcgtgg tggctcacac ctgtaatccc agcactttgg gaggctgagg caggtggatc 2221 gcctgaggtc aggagttcaa gatcaacctg gccaacatac tgaaaccccg tctctactaa 2281 aaataagaaa aattagctgg gcgtggtggc gggcacctgt aatcccagct acttgggagg 2341 ctgaggcagg agaatcactt gaacctggga gttagaggtt gcagtgagct gagatcacac 2401 cattgcgctt cagcctgggc aacaagagtg aaactccatc tcc // LOCUS D85131 1738 bp mRNA PRI 09-DEC-1996 DEFINITION Human mRNA for Myc-associated zinc-finger protein of human islet, complete cds. ACCESSION D85131 NID g1752741 KEYWORDS MAZi; Myc-associated zinc-finger protein of human islet. SOURCE Homo sapiens human pancreatic islets cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1738) AUTHORS Tsutsui,H., Sakatsume,O., Itakura,K. and Yokoyama,K.K. TITLE Members of the MAZ family: a novel cDNA clone for MAZ from human pancreatic islet cells JOURNAL Biochem. Biophys. Res. Commun. 226 (3), 801-809 (1996) MEDLINE 96428591 REFERENCE 2 (bases 1 to 1738) AUTHORS Tsutsui,H. TITLE Direct Submission JOURNAL Submitted (09-MAY-1996) to the DDBJ/EMBL/GenBank databases. Hatsumi Tsutsui, RIKEN(The Institute of Physical and Chemical Research), Tsukuba Life Science Center; 3-1-1, Koyadai, Tsukuba, Ibaraki 305, Japan (E-mail:tsutsui@rtc.riken.go.jp, Tel:0298-36-3612, Fax:0298-36-9120) FEATURES Location/Qualifiers source 1..1738 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="human pancreatic islets" gene 92..1585 /gene="MAZi" CDS 92..1585 /gene="MAZi" /codon_start=1 /product="Myc-associated zinc-finger protein of human islet" /db_xref="PID:d1013410" /db_xref="PID:g1752742" /translation="MRSRRGPARPAAPRPRWSLGPRCAEAMFPVFPCTLLAPPFPVLG LDSRGVGGLMNSFPPPQGHAQNPLQVGAELQSRFFASQGCAQSPFQAAPAPPPTPQAP AAEPLQVDLLPVLAAAQESAAAAAAAAAAAAAVAAAPPAPAAASTVDTAALKQPPAPP PPPPPVSAPAAEAAPPASAATIAAAAATAVVAPTSTVAVAPVASALEKKTKSKGPYIC ALCAKEFKNGYNLRRHEAIHTGAKAGRVPSGAMKMPTMVPLSLLSVPQLSGAGGGGGE AGAGGGAAVAAGGVVTTTASGKRIRKNHACEMCGKAFRDVYHLNRHKLSHSDEKPYQC PVCQQRFKRKDRMSYHVRSHDGAVHKPYNCSHCGKSFSRPDHLNSHVRQVHSTERPFK CEKCEAAFATKDRLRAHTVRHEEKVPCHVCGKMLSSAYISDHMKVHSQGPHHVCELCN KGTGEVCPMAAAAAAAAAVAAPPTAVGSLSGAEGVPVSSQPLPSQPW" BASE COUNT 279 a 641 c 551 g 267 t ORIGIN 1 gaattccggg ggttccggcg ctccgcggcc ccaagcgccc tcctttcctc cctccgccgg 61 ccggggttgc gggcgcgggg cgccgcgggc catgcgatct cggcgcggcc cagcccggcc 121 ggcggcgccc cgcccccgct ggagcctggg gccccgctgc gccgaggcca tgttcccggt 181 gtttccttgc acgctgctgg cccccccctt ccccgtgctg ggcctggact cccggggggt 241 tggcggcctc atgaactcct tcccgccacc tcagggtcac gcccagaacc ccctgcaggt 301 cggggctgag ctccagtccc gcttctttgc ctcccagggc tgcgcccaga gtccattcca 361 ggccgcgccg gcgcccccgc ccacgcccca ggccccggcg gccgagcccc tccaggtgga 421 cttgctcccg gtgctcgccg ccgcccagga gtccgccgcg gctgctgcgg ccgctgccgc 481 cgctgctgcc gccgtcgctg ccgcgccccc ggcccctgcc gccgcctcta cggtggacac 541 agcggccctg aagcagcctc cggcgccccc tccgccaccc ccgccagtgt cggcgcccgc 601 ggccgaggcc gcgccccccg cctccgccgc cactatcgcc gcggcggcgg ccaccgccgt 661 cgtagcccca acctcgacgg tcgccgtggc cccggtcgcg tctgccttgg agaagaagac 721 aaagagcaag gggccctaca tctgcgctct gtgcgccaag gagttcaaga acggctacaa 781 tctccggagg cacgaagcca tccacacggg agccaaggcc ggccgggtcc cctcgggtgc 841 tatgaagatg ccgaccatgg tgcccctgag cctcctgagc gtgccccagc tgagcggagc 901 cggcggggga gggggagagg cgggtgccgg cggcggcgct gcagtggccg ccggtggcgt 961 ggtgaccacg accgcctcgg ggaagcgcat ccggaagaac catgcctgcg agatgtgtgg 1021 caaggccttc cgcgacgtct accacctgaa ccgacacaag ctgtcgcact cggacgagaa 1081 gccctaccag tgcccggtgt gccagcagcg cttcaagcgc aaggaccgca tgagctacca 1141 cgtgcgctca catgacggcg ctgtgcacaa gccctacaac tgctcccact gtggcaagag 1201 cttctcccgg ccggatcacc tcaacagtca cgtcagacaa gtgcactcaa cagaacggcc 1261 cttcaaatgt gagaaatgtg aggcagcttt cgccacgaag gatcggctgc gggcgcacac 1321 agtacgacac gaggagaaag tgccatgtca cgtgtgtggc aagatgctga gctcggctta 1381 tatttcggac cacatgaagg tgcacagcca gggtcctcac catgtctgtg agctctgcaa 1441 caaaggtact ggtgaggttt gtccaatggc ggcggcagcg gcagcagcgg cagcagtagc 1501 agcccctccc acagctgtgg gctccctctc gggggcggag ggggtgcctg tgagctctca 1561 gccacttccc tcccaaccct ggtgagctcc aagttggttg cgggggagag gggagaatgg 1621 agtagagtcc cttggtacaa gctcctctcc cccctctttt cccaccaact cctatttccc 1681 taccaaccaa ggagcctcca gaaggaaagg aggaagaaat gttttcttag gggaattc // LOCUS D85181 2067 bp mRNA PRI 25-MAR-1997 DEFINITION Human mRNA for fungal sterol-C5-desaturase homolog, complete cds. ACCESSION D85181 NID g1906795 KEYWORDS fungal sterol-C5-desaturase homolog. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Matsushima,M., Inazawa,J., Takahashi,E., Suzumori,K. and Nakamura,Y. TITLE Molecular cloning and mapping of a human cDNA (SC5DL) encoding a protein homologous to fungal sterol-C5-desaturase JOURNAL Cytogenet. Cell Genet. 74 (4), 252-254 (1996) MEDLINE 97130614 REFERENCE 2 (bases 1 to 2067) AUTHORS Matsushima,M., Inazawa,J., Takahashi,E., Suzumori,K. and Nakamura,Y. TITLE Molecular Cloning and Mapping of a Human cDNA Encoding a Protein Homologous to Fungal Sterol-C5-Desaturase JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2067) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (10-MAY-1996) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Laboratory of Molecular Medicine, Institute of Medical Science, University of Tokyo; 4-6-1 Shirokanedai,, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) FEATURES Location/Qualifiers source 1..2067 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q23.3" CDS 261..971 /codon_start=1 /product="fungal sterol-C5-desaturase homolog" /db_xref="PID:d1019713" /db_xref="PID:g1906796" /translation="MDLVLRVADYYFFTPYVYPATWPEDDIFRQAISLLIVTNVGAYI LYFFCATLSYYFVFDHALMKHPQFLKNQVRREIKFTVQALPWISILTVALFLLEIRGY SKLHDDLGEFPYGLFELVVSIISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIP TPFASHAFHPIDGFLQSLPYHIYPFIFPLHKVVYLSLYILVNIWTISIHDGDFRVRMK NYSMESLQRLNRLLPSYS" BASE COUNT 641 a 367 c 365 g 694 t ORIGIN 1 cgcaaagtta aacagctaaa agaagtaaaa taagaaggca atgcttgtgg aatgtacagt 61 gcatattggc ggcgcacgcc tcattacgat tcgcctgctt gcttctcctg ttcaatcgtt 121 tctttggaag gcagtggatt tttctcttgc gtctctgtct tcttcagttt cgacttatcg 181 aatttctcga tctcagccat atcgagtttg cagagcagtg gcgtgcggag cggcggcgga 241 ccacctccag gggctaagtg atggatcttg tactccgtgt tgcagattac tattttttta 301 caccatacgt gtatccagcc acatggccag aagatgacat cttccgacaa gctattagtc 361 ttctgattgt aacaaatgtt ggtgcttaca tcctttattt cttctgtgca acactgagct 421 attattttgt cttcgatcat gcattaatga aacatccaca atttttaaag aatcaagtcc 481 gtcgagagat taagtttact gtccaggcat tgccatggat aagtattctt actgttgcac 541 tgttcttgct ggagataaga ggttacagca aattacatga tgacctagga gagtttccat 601 atggattgtt tgaacttgtc gttagtataa tatctttcct ctttttcact gacatgttca 661 tctactggat tcacagaggc cttcatcata gactggtata taagcgccta cataaacctc 721 accatatttg gaagattcct actccatttg caagtcatgc ttttcaccct attgatggct 781 ttcttcagag tctaccttac catatatacc cttttatctt tccattacac aaggtggttt 841 atttaagtct gtacatcttg gttaatatct ggacaatttc cattcatgac ggtgattttc 901 gtgtaagaat gaaaaattat tcaatggaga gtttacaaag actgaataga ttattgccca 961 gttattctta agtaaggaca aagaaggaaa tatcatcgta tttctttttt ttaataagga 1021 aaaaataata tccatacagt caagatacat agtaaatggt atcatttgga aatcagcatc 1081 gtgggcactg ctgaggaatg atcctagtgg taggtcagaa gaagatgctg tgaacaccag 1141 gactttaatc ttatgcttaa aatgccagat gttgttcggg ggacaacttg tatctttcta 1201 gcagcagatc tgtagtttgt atagcctcaa caacaatttt aaataagatg gagaataaat 1261 tattgagggg actaggctat atgcatttgc cttcatccac ccatgtttat taagaatcat 1321 tgtgcttaat aataccaaga ctaagcacca taaccaagaa atactaatgt aaagattgtt 1381 tcttgtttca ggaatggtta attcttcaac gttggtatga taatgataac ttgttttgac 1441 ttgaataaag tactacatca gtgtggaaaa aaattctgat acattagcag ctatgtaaat 1501 gacctaattg atagcaggtg taataagact atcgtcttcc tacacatagg aggctcattc 1561 tctggacaca ctatcaccta ttacatttta ctgattaaca aataaattgg aatttaaaaa 1621 tatcgatatc accatgattt aatccagatc tgggattatg tagctaaaca ttgtgatgat 1681 tattatttaa aaccattatt taataagagt aaaaatatgt gaatctggat atatttaaaa 1741 aaagaaattt gatgcccaga taatatatta ggcactactg attttttagt taaattgatg 1801 cactacactt ttgatgtttg aagttacaaa cctgtaattt ttttgtaaag gaaataattg 1861 ccaaatacct aggcccattg ctgacgatta gttctaaaat cttattcctc ctcttctccc 1921 ctcacttttc cctacttcct ctgcaaaaag atttaacaaa tacattcata aggaaatgtg 1981 tgttgtaaca aatatattgc aaaaacatag tttgtaaagg cattctataa gctatttatg 2041 taaaatcaat aaaagttgat cataatt // LOCUS D85425 1393 bp mRNA PRI 18-FEB-1997 DEFINITION Human mRNA for transactivator HSM-1, complete cds. ACCESSION D85425 NID g1339911 KEYWORDS transactivator HSM-1; CBF-C; CAATT box binding factor subunit C. SOURCE Homo sapiens male whole brain cDNA to mRNA, clone_lib:pACT2 library of CLONTECH. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Taira,T., Iguchi-Ariga,S. and Ariga,H. TITLE Novel Transactivator HSM-1 regulates human hsp70 gene expression JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 1393) AUTHORS Ariga,H. TITLE Direct Submission JOURNAL Submitted (11-MAY-1996) to the DDBJ/EMBL/GenBank databases. Hiroyoshi Ariga, Faculty of Pharmaceutical Sciences, Hokkaido University, Molecular Biology; Kita 12, Nishi 6, Kita-ku, Sapporo, Hokkaio 060, Japan (E-mail:Hiro@ph.hines.hokudai.ac.jp, Tel:011-706-3745, Fax:011-706-4988) FEATURES Location/Qualifiers source 1..1393 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pACT2 library of CLONTECH" /sex="male" /tissue_type="whole brain" CDS 84..1091 /note="CAATT box binding factor subuit C (CBF-C)" /codon_start=1 /product="transactivator HSM-1" /db_xref="PID:d1013502" /db_xref="PID:g1843423" /translation="MSTEGGFGGTSSSDAQQSLQSFWPRVMEEIRNLTVKDFRVQELP LARIKKIMKLDEDVKMISAEAPVLFAKAAQIFITELTLRAWIHTEDNKRRTLQRNDIA MAITKFDQFDFLIDIVPRDELKPPKRQEEVRQSVTPAEPVQYYFTLAQQPTAVQVQGQ QQGQQTTSSTTTIQPGQIIIAQPQQGQTTPVTMQVGESQQVKIVQAQPQGQAQQAQSG TGQTMQVMQQIITNTGEIQQIPVQLNAGQLQYIRLAQPVSGTQVVQGQIQTLATNAQQ ITQTEVQQGQQQFSQFTDGQQLYQIQQVTMPAGQDLAQPMFIQSANQPSDGQAPQVTG D" BASE COUNT 390 a 390 c 340 g 273 t ORIGIN 1 gaattcgcgg ccgcgtcgac cgactccgta ggagcgcggg ggcggctcct gctcttcctg 61 gactcctgag cagagttgtc gagatgtcca cagaaggagg atttggtggt actagcagca 121 gtgatgccca gcaaagccta cagtcgttct ggcctcgggt catggaagaa atccggaatt 181 taacagtgaa agacttccga gtgcaggaac tcccactggc tcgtattaag aagattatga 241 aactggatga agatgtgaag atgatcagtg cagaagcgcc tgtactcttt gccaaggcag 301 cccagatttt tatcacagag ttgactcttc gagcctggat tcacacagaa gataacaagc 361 gccggactct acagagaaat gatatcgcca tggcaattac aaaatttgat cagtttgatt 421 ttctcatcga tattgttcca agagatgaac tgaaacctcc aaagcgtcag gaggaggtgc 481 gccagtctgt aactcctgcc gagccagtcc agtactattt cacgctggct cagcaaccca 541 ccgctgtcca agtccagggc cagcagcaag gccagcagac caccagctcc acgaccacca 601 tccagcctgg gcagatcatc atcgcacagc ctcagcaggg ccagaccaca cctgtgacaa 661 tgcaggttgg agaaagtcaa caagtgaaaa ttgtccaggc tcaaccacag ggtcaagccc 721 aacaggccca gagtggcact ggacagacca tgcaggtgat gcagcagatc atcactaaca 781 caggagagat ccagcagatc ccggtgcagc tgaatgccgg ccagctgcag tatatccgct 841 tagcccagcc tgtatcaggc actcaagttg tgcagggaca gatccagaca cttgccacca 901 atgctcaaca gattacacag acagaggtcc agcaaggaca gcagcagttc agccagttca 961 cagatggaca gcagctatac cagatccagc aagtcaccat gcctgcgggc caggacctcg 1021 cccagcccat gttcatccag tcagccaacc agccctccga cgggcaggcc ccccaggtga 1081 ccggcgactg agggcctgag ctggcaaggc caaagacacc caacacaatt tttgccatac 1141 agccccaggc aatgggcaca gccttcctcc ccagaggacc cggccgacct cagcgcctcc 1201 tgcaggctag gacactggtg cactacaccc catgcctggg ggccgagatt ctccagcaga 1261 aagatgcaat attttttgtt tccttttttt ccattttttt ctctaaggaa tcaatatttc 1321 aatatgttga gtgtgtgtcc aatgctatga aattaaaata ttaaataaca aaaaaaaaaa 1381 aaaaaaactc gag // LOCUS D85759 2765 bp mRNA PRI 27-DEC-1997 DEFINITION Homo sapiens mRNA for MNB protein kinase, complete cds. ACCESSION D85759 NID g1526445 KEYWORDS MNB; minibrain; MNB protein kinase; DYRK. SOURCE Homo sapiens (isolate:caucasian) fetuses, 20-26 weeks brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2765) AUTHORS Shindoh,N., Kudoh,J., Maeda,H., Yamaki,A., Minoshima,S., Shimizu,Y. and Shimizu,N. TITLE Cloning of a human homolog of the Drosophila minibrain/rat Dyrk gene from 'the Down syndrome critical region' of chromosome 21 JOURNAL Biochem. Biophys. Res. Commun. 225 (1), 92-99 (1996) MEDLINE 96332410 REFERENCE 2 (bases 1 to 2765) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (01-JUN-1996) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) FEATURES Location/Qualifiers source 1..2765 /organism="Homo sapiens" /isolate="caucasian" /db_xref="taxon:9606" /chromosome="21" /dev_stage="fetuses, 20-26 weeks" /map="21q22.2" /tissue_type="brain" gene 69..2333 /gene="MNB" CDS 69..2333 /gene="MNB" /codon_start=1 /product="MNB protein kinase" /db_xref="PID:d1013551" /db_xref="PID:g1526446" /translation="MHTGGETSACKPSSVRLAPSFSFHAAGLQMAGQMPHSHQYSDRR QPNISDQQVSALSYSDQIQQPLTNQRRMPQTFRDPATAPLRKLSVDLIKTYKHINEVY YAKKKRRHQQGQGDDSSHKKERKVYNDGYDDDNYDYIVKNGEKWMDRYEIDSLIGKGS FGQVVKAYDRVEQEWVAIKIIKNKKAFLNQAQIEVRLLELMNKHDTEMKYYIVHLKRH FMFRNHLCLVFEMLSYNLYDLLRNTNFRGVSLNLTRKFAQQMCTALLFLATPELSIIH CDLKPENILLCNPKRSAIKIVDFGSSCQLGQRIYQYIQSRFYRSPEVLLGMPYDLAID MWSLGCILVEMHTGEPLFSGANEVDQMNKIVEVLGIPPAHILDQAPKARKFFEKLPDG TWNLKKTKDGKREYKPPGTRKLHNILGVETGGPGGRRAGESGHTVADYLKFKDLILRM LDYDPKTRIQPYYALQHSFFKKTADEGTNTSNSVSTSPAMEQSQSSGTTSSTSSSSGG SSGTSNSGRARSDPTHQHRHSGGHFTAAVQAMDCETHSPQVRQQFPAPLGWSGTEAPT QVTVETHPVQETTFHVAPQQNALHHHHGNSSHHHHHHHHHHHHHGQQALGNRTRPRVY NSPTNSSSTQDSMEVGHSHHSMTSLSSSTTSSSTSSSSTGNQGNQAYQNRPVAANTLD FGQNGAMDVNLTVYSNPRQETGIAGHPTYQFSANTGPAHYMTEGHLTMRQGADREESP MTGVCVQQSPVASS" BASE COUNT 788 a 663 c 591 g 723 t ORIGIN 1 ttttgccgct ggactcttcc ctcccttccc ccaccccatc aggatgatat gagacttgaa 61 agaagacgat gcatacagga ggagagactt cagcatgcaa accttcatct gttcggcttg 121 caccgtcatt ttcattccat gctgctggcc ttcagatggc tggacagatg ccccattcac 181 atcagtacag tgaccgtcgc cagccaaaca taagtgacca acaggtttct gccttatcat 241 attctgacca gattcagcaa cctctaacta accagaggcg gatgccccaa accttccgtg 301 acccagcaac tgctcccctg agaaaacttt ctgttgactt gatcaaaaca tacaagcata 361 ttaatgaggt ttactatgca aaaaagaagc gaagacacca acagggccag ggagacgatt 421 ctagtcataa gaaggaacgg aaggtttaca atgatggtta tgatgatgat aactatgatt 481 atattgtaaa aaacggagaa aagtggatgg atcgttacga aattgactcc ttgataggca 541 aaggttcctt tggacaggtt gtaaaggcat atgatcgtgt ggagcaagaa tgggttgcca 601 ttaaaataat aaagaacaag aaggcttttc tgaatcaagc acagatagaa gtgcgacttc 661 ttgagctcat gaacaaacat gacactgaaa tgaaatacta catagtgcat ttgaaacgcc 721 actttatgtt tcgaaaccat ctctgtttag tttttgaaat gctgtcctac aacctctatg 781 acttgctgag aaacaccaat ttccgagggg tctctttgaa cctaacacga aagtttgcgc 841 aacagatgtg cactgcactg cttttccttg cgactccaga acttagtatc attcactgtg 901 atctaaaacc tgaaaatatc cttctttgta accccaaacg cagtgcaatc aagatagttg 961 actttggcag ttcttgtcag ttggggcaga ggatatacca gtatattcag agtcgctttt 1021 atcggtctcc agaggtgcta ctgggaatgc cttatgacct tgccattgat atgtggtccc 1081 tcgggtgtat tttggttgaa atgcacactg gagaacctct gttcagtggt gccaatgagg 1141 tagatcagat gaataaaata gtggaagttc tgggtattcc acctgctcat attcttgacc 1201 aagcaccaaa agcaagaaag ttctttgaga agttgccaga tggcacttgg aacttaaaga 1261 agaccaaaga tggaaaacgg gagtacaaac caccaggaac ccgtaaactt cataacattc 1321 ttggagtgga aacaggagga cctggtgggc gacgtgctgg ggagtcaggt catacggtcg 1381 ctgactactt gaagttcaaa gacctcattt taaggatgct tgattatgac cccaaaactc 1441 gaattcaacc ttattatgct ctgcagcaca gtttcttcaa gaaaacagct gatgaaggta 1501 caaatacaag taatagtgta tctacaagcc ccgccatgga gcagtctcag tcttcgggca 1561 ccacctccag tacatcgtca agctcaggtg gctcatcggg gacaagcaac agtgggagag 1621 cccggtcgga tccgacgcac cagcatcggc acagtggtgg gcacttcaca gctgccgtgc 1681 aggccatgga ctgcgagaca cacagtcccc aggtgcgtca gcaatttcct gctcctcttg 1741 gttggtcagg cactgaagct cctacacagg tcactgttga aactcatcct gttcaagaaa 1801 caacctttca tgtagcccct caacagaatg cattgcatca tcaccatggt aacagttccc 1861 atcaccatca ccaccaccac caccatcacc accaccatgg acaacaagcc ttgggtaacc 1921 ggaccaggcc aagggtctac aattctccaa cgaatagctc ctctacccaa gattctatgg 1981 aggttggcca cagtcaccac tccatgacat ccctgtcttc ctcaacgact tcttcctcga 2041 catcttcctc ctctactggt aaccaaggca atcaggccta ccagaatcgc ccagtggctg 2101 ctaatacctt ggactttgga cagaatggag ctatggacgt taatttgacc gtctactcca 2161 atccccgcca agagactggc atagctggac atccaacata ccaattttct gctaatacag 2221 gtcctgcaca ttacatgact gaaggacatc tgacaatgag gcaaggggct gatagagaag 2281 agtcccccat gacaggagtt tgtgtgcaac agagtcctgt agctagctcg tgactacatt 2341 gaaacttgag tttgtttctt gtgtgttttt atagaagtgg tgtttttttt ccaaaaacaa 2401 agtgcaaagc tgcttgaatc aggaggagat taacacactg aaccgctaca agagggcaaa 2461 gctgattttt tttttaactt gaaaagattg caaagggaca ttgaagtgtt taaaagagcc 2521 atgtccaaac ccatcttcat ggatagctca gaggtatcct ctttttgctc ccccatttta 2581 acttgccaca tcccagtcac agtggggttt ttttgtcttt ctattcagca aaagttaata 2641 ttcagatgtt ggtcttggtc atttgccaac taattttaaa gtaaaaggca ctgcacataa 2701 tttgcataaa gggccccatg agggtgtttt tttttctttt tgtccccccc atcccccttt 2761 ttttt // LOCUS D85815 1086 bp DNA PRI 15-APR-1997 DEFINITION Human DNA for rhoHP1, complete cds. ACCESSION D85815 NID g1944384 KEYWORDS rhoHP1. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1086) AUTHORS Shimizu,F. TITLE Direct Submission JOURNAL Submitted (05-JUN-1996) to the DDBJ/EMBL/GenBank databases. Fumio Shimizu, Otsuka Pharmaceutical Co. Ltd., Otska GEN Research; Kawauchi-cho 463-10, Tokushima, Tokushima 771-01, Japan (E-mail:shimizu@otsuka.genome.ad.jp, Tel:0886-65-2888, Fax:0886-37-1035) REFERENCE 2 (sites) AUTHORS Shimizu,F., Watanabe,T.K., Okuno,S., Fujiwara,T. and Nakamura,Y. TITLE A novel human cDNA homologous to rho genes JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1086) AUTHORS Shimizu,F., Watanabe,T.K., Okuno,S., Omori,Y., Fujiwara,T., Takahashi,E. and Nakamura,Y. TITLE Isolation of a novel human cDNA (rhoHP1) homologous to rho genes JOURNAL Biochim. Biophys. Acta 1351 (1-2), 13-16 (1997) MEDLINE 97236425 FEATURES Location/Qualifiers source 1..1086 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 40..672 /note="Rho-related protein HP1" /codon_start=1 /product="rhoHP1" /db_xref="PID:d1020429" /db_xref="PID:g1944385" /translation="MTAAQAAGEEAPPGVRSVKVVLVGDGGCGKTSLLMVFADGAFPE SYTPTVFERYMVNLQVKGKPVHLHIWDTAGQDDYDRLRPLFYPDASVLLLCFDVTSPN SFDNIFNRWYPEVNHFCKKVPIIVVGCKTDLRKDKSLVNKLRRNGLEPVTYHRGQEMA RSVGAVAYLECSARLHDNVHAVFQEAAEVALSSRGRNFWRRITQGFCVVT" BASE COUNT 199 a 361 c 338 g 188 t ORIGIN 1 cgcagccgcc cgcccgcccg ctcagcgccc ggccccggga tgacggcggc ccaggccgcg 61 ggtgaggagg cgccaccagg cgtgcggtcc gtcaaggtgg tcctggtggg cgacggcggc 121 tgcgggaaga cgtcgctgct gatggtcttc gccgatgggg ccttccccga gagctacacc 181 cccacggtgt ttgagcggta catggtcaac ctgcaagtga aaggcaaacc tgtgcacctc 241 cacatctggg acacagcagg gcaagatgac tatgaccgcc tgcggcccct gttctaccct 301 gacgccagcg tcctgctgct ttgcttcgat gtcaccagcc cgaacagctt tgacaacatc 361 tttaaccggt ggtacccaga agtgaatcat ttctgcaaga aggtacccat catcgtcgtg 421 ggctgcaaga ctgacctgcg caaggacaaa tcactggtga acaagctccg aagaaacgga 481 ttggagcctg tgacctacca caggggccag gagatggcga ggtccgtggg cgcggtggcc 541 tacctcgagt gctcggctcg gctccatgac aacgtccacg ccgtcttcca ggaggccgcc 601 gaggtggccc tcagcagccg cggtcgcaac ttctggcggc ggattaccca gggcttttgc 661 gtggtgacct gagcggctcg gggcgtccca gcgacgcggg aaggggcagg gcgctgacct 721 gctgctgagc tggctgggct ggacccggtc cctaggctgt gaccgccgaa ctccactgca 781 acagacgggc gccaccaaag ccaggccctg aggcctggga gtcctggact gagaaagggg 841 gttcctgggc ccacctgctc tgtgtagggc tcgtcctgcg gtgcccgaga atcactcgct 901 aacccctatg cccggtcccg gaccgacatc ctggagccgc ctgtgcagcc tgatgccccc 961 tcgtggctgc tcccagggct gcacctgcca ggacctaatg ttcttaggtc cctctggcca 1021 gaacccacac ccggcccctt cccacctgtc atactggtaa ctgtaacaag aaaaacgaca 1081 tcactt // LOCUS D85939 1167 bp mRNA PRI 20-MAY-1997 DEFINITION Human mRNA for p97 homologous protein, complete cds. ACCESSION D85939 NID g2114175 KEYWORDS p97. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1167) AUTHORS Nobukuni,T. TITLE Direct Submission JOURNAL Submitted (13-JUN-1996) to the DDBJ/EMBL/GenBank databases. Takahiro Nobukuni, Mitsubishi Kasei Institute of Lige Sciences; Machida, Tokyo 194, Japan (E-mail:tnobukuh@libra.ls.m-kagaku.co.jp, Tel:0427-24-6265, Fax:0427-24-6316) REFERENCE 2 (bases 1 to 1167) AUTHORS Nobukuni,T., Kobayashi,M., Omori,A., Yoshida,S., Iwanaga,T., Hashimoto,K., Hattori,S., Kaibuchi,K., Masui,T. and Iwashita,S. TITLE Cloning of a Novel Protein Harbouring Alu family repeat sequence like region in ORF JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nobukuni,T., Kobayashi,M., Omori,A., Ichinose,S., Iwanaga,T., Takahashi,I., Hashimoto,K., Hattori,S., Kaibuchi,K., Miyata,Y., Masui,T. and Iwashita,S. TITLE An Alu-linked repetitive sequence corresponding to 280 amino acids is expressed in a novel bovine protein, but not in its human homologue JOURNAL J. Biol. Chem. 272 (5), 2801-2807 (1997) MEDLINE 97160586 FEATURES Location/Qualifiers source 1..1167 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 110..763 /codon_start=1 /product="p97 homologous protein" /db_xref="PID:d1020879" /db_xref="PID:g2114176" /translation="MEEFDSEDFSTSEEDEDYVPSGGEYSEDDVNELVKEDEVDGEEQ TQKTQGKKRKAQSIPARKRRQGGLSLEEEEEEDANSESEGSSSEEEDDAAEQEKGIGS EDARKKKEDELWASFLNDVGPKSKVPPSTQVKKGEETEETSSSKLLVKAEELEKPKET EKVKITKVFDFAGEEVRVTKEVDATSKEAKSFFKQNEKEKPQANVPSALPSLPAGSG" BASE COUNT 359 a 233 c 333 g 242 t ORIGIN 1 cctagctgcc gtcgccgccg ccggggctct atggtctctc cctagagctt tgccgttgga 61 ggcggctgct gcggtcttgt gagtttgacc agcgtcgagc ggcagcaaca tggaggaatt 121 cgactccgaa gacttctcta cgtcggagga ggacgaggac tacgtgccgt cgggtggaga 181 gtatagtgaa gatgatgtaa atgaattagt gaaggaagat gaagtggatg gtgaagagca 241 gacacagaaa acccaaggga aaaaaagaaa ggcccagagc attccagcca ggaagagaag 301 acaaggtggc ctctcattag aagaagagga agaggaggat gccaattcag aatctgaggg 361 aagcagtagt gaggaggaag atgacgctgc agagcaggaa aaaggcattg gatcagagga 421 tgccaggaaa aagaaggagg acgaactctg ggccagcttc ctcaatgatg tgggaccaaa 481 atcaaaagtg cccccaagta cacaagttaa gaaaggagag gagactgaag agacaagttc 541 aagtaaattg ttggtaaaag cagaagagct agagaaacct aaagaaacag aaaaagttaa 601 aatcaccaag gtgtttgatt ttgctggtga agaagtaagg gtaactaagg aagtggatgc 661 tacatctaaa gaggccaaat ccttcttcaa gcagaatgag aaagaaaaac cacaggctaa 721 tgttccttca gctctgccat cactccctgc cgggtcaggg tgagtatttg agtctcactc 781 cctgatgtac ttggcacgtt ggagaagctt tcagtgcaaa tacctacttt cgctgcagat 841 cagactcctt gcctgagaga gagaattatg atgctgcttt gcctttgctg caatatttac 901 cagaatggaa gatgcaaagg aaaattagcc ataactttct caggaggatt gcttgaaccc 961 aggagtttgt gaccagcctg ggcaatacag taagaccctg tctcttaaaa aaaaaaaaat 1021 tagctgggca tggtagcttg cacctctgtc ctagctactg tggaggctga ggcaggagga 1081 tcccttgagc ccagaaggtt gaggctgcag tgagccgtga ttgtgccact gcactccagc 1141 ctgggtgaca gagagatcct gtctctt // LOCUS D86043 1977 bp mRNA PRI 03-MAR-1997 DEFINITION Human mRNA for SHPS-1, complete cds. ACCESSION D86043 NID g1864010 KEYWORDS SHPS-1. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamao,T., Matozaki,T., Amano,K., Matsuda,Y., Takahashi,N., Ochi,F., Fujioka,Y. and Kasuga,M. TITLE Mouse and human SHPS-1: molecular cloning of cDNAs and chromosomal localization of genes JOURNAL Biochem. Biophys. Res. Commun. 231 (1), 61-67 (1997) MEDLINE 97223399 REFERENCE 2 (bases 1 to 1977) AUTHORS Matozaki,T., Takahashi,N. and Yamao,T. TITLE Cloning of human SHPS-1 cDNA JOURNAL Unpublished (1998) REFERENCE 3 (bases 1 to 1977) AUTHORS Matozaki,T. TITLE Direct Submission JOURNAL Submitted (15-JUN-1996) to the DDBJ/EMBL/GenBank databases. Takashi Matozaki, Kobe University School of Medicine, Second Department of Internal Medicine; Kusunoki-cho, Chuo-ku, Kobe, Hyogo 650, Japan (E-mail:matozaki@med.kobe-u.ac.jp, Tel:078-341-7451, Fax:078-382-2080) FEATURES Location/Qualifiers source 1..1977 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 3..1514 /codon_start=1 /product="SHPS-1" /db_xref="PID:d1013659" /db_xref="PID:g1864011" /translation="MEPAGPAPGRLGPLLCLLLAASCAWSGVAGEEELQVIQPDKSVS VAAGESAILHCTVTSLIPVGPIQWFRGAGPARELIYNQKEGHFPRVTTVSESTKRENM DFSISISNITPADAGTYYCVKFRKGSPDTEFKSGAGTELSVRAKPSAPVVSGPAARAT PQHTVSFTCESHGFSPRDITLKWFKNGNELSDFQTNVDPVGESVSYSIHSTAKVVLTR EDVHSQVICEVAHVTLQGDPLRGTANLSETIRVPPTLEVTQQPVRAENQVNVTCQVRK FYPQRLQLTWLENGNVSRTETASTVTENKDGTYNWMSWLLVNVSAHRDDVKLTCQVEH DGQPAVSKSHDLKVSAHPKEQGSNTAAENTGSNERNIYIVVGVVCTLLVALLMAALYL VRIRQKKAQGSTSSTRLHEPEKNAREITQDTNDITYADLNLPKGKKPAPQAAEPNNHT EYASIQTSPQPASEDTLTYADLDMVHLNRTPKQPAPKPEPSFSEYASVQVPRK" BASE COUNT 453 a 648 c 530 g 346 t ORIGIN 1 ccatggagcc cgccggcccg gcccccggcc gcctcgggcc gctgctctgc ctgctgctcg 61 ccgcgtcctg cgcctggtca ggagtggcgg gtgaggagga gctgcaggtg attcagcctg 121 acaagtccgt atcagttgca gctggagagt cggccattct gcactgcact gtgacctccc 181 tgatccctgt ggggcccatc cagtggttca gaggagctgg accagcccgg gaattaatct 241 acaatcaaaa agaaggccac ttcccccggg taacaactgt ttcagagtcc acaaagagag 301 aaaacatgga cttttccatc agcatcagta acatcacccc agcagatgcc ggcacctact 361 actgtgtgaa gttccggaaa gggagccctg acacggagtt taagtctgga gcaggcactg 421 agctgtctgt gcgtgccaaa ccctctgccc ccgtggtatc gggccctgcg gcgagggcca 481 cacctcagca cacagtgagc ttcacctgcg agtcccacgg cttctcaccc agagacatca 541 ccctgaaatg gttcaaaaat gggaatgagc tctcagactt ccagaccaac gtggaccccg 601 taggagagag cgtgtcctac agcatccaca gcacagccaa ggtggtgctg acccgcgagg 661 acgttcactc tcaagtcatc tgcgaggtgg cccacgtcac cttgcagggg gaccctcttc 721 gtgggactgc caacttgtct gagaccatcc gagttccacc caccttggag gttactcaac 781 agcccgtgag ggcagagaac caggtgaatg tcacctgcca ggtgaggaag ttctaccccc 841 agagactaca gctgacctgg ttggagaatg gaaacgtgtc ccggacagaa acggcctcaa 901 ccgttacaga gaacaaggat ggtacctaca actggatgag ctggctcctg gtgaatgtat 961 ctgcccacag ggatgatgtg aagctcacct gccaggtgga gcatgacggg cagccagcgg 1021 tcagcaaaag ccatgacctg aaggtctcag cccacccgaa ggagcagggc tcaaataccg 1081 ccgctgagaa cactggatct aatgaacgga acatctatat tgtggtgggt gtggtgtgca 1141 ccttgctggt ggccctactg atggcggccc tctacctcgt ccgaatcaga cagaagaaag 1201 cccagggctc cacttcttct acaaggttgc atgagcccga gaagaatgcc agagaaataa 1261 cacaggacac aaatgatatc acatatgcag acctgaacct gcccaagggg aagaagcctg 1321 ctccccaggc tgcggagccc aacaaccaca cggagtatgc cagcattcag accagcccgc 1381 agcccgcgtc ggaggacacc ctcacctatg ctgacctgga catggtccac ctcaaccgga 1441 cccccaagca gccggccccc aagcctgagc cgtccttctc agagtacgcc agcgtccagg 1501 tcccgaggaa gtgaatggga ccgtggtttg ctctagcacc catctctacg cgctttcttg 1561 tcccacaggg agccgccgtg atgagcacag ccaacccagt tcccggaggg ctggggcggt 1621 gcaggctctg ggacccaggg gccagggtgg ctcttctctc cccacccctc cttggctctc 1681 cagcacttcc tgggcagcca cggccccctc cccccacatt gccacatacc tggaggctga 1741 cgttgccaaa ccagccaggg aaccaacctg ggaagtggcc agaactgcct ggggtccaag 1801 aactcttgtg cctccgtcca tcaccatgtg ggttttgaag accctcgact gcctccccga 1861 tgctccgaag cctgatcttc cagggtgggg aggagaaaat cccacctccc ctgacctcca 1921 ccacctccac caccaccacc accaccacca ccaccactac caccaccacc caactgg // LOCUS D86061 1583 bp mRNA PRI 17-SEP-1996 DEFINITION Human mRNA for KNP-Ia, complete cds. ACCESSION D86061 NID g1545812 KEYWORDS KNP-I; KNP-Ia; alternative splicing. SOURCE Homo sapiens (isolate:7 Caucasian fetuses, 20-26 weeks) brain cDNA to mRNA, clone_lib:fetal brain 5'-strech plus cDNA library (CLONTECH). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1583) AUTHORS Nagamine,K., Kudoh,J., Minoshima,S., Kawasaki,K., Asakawa,S., Ito,F. and Shimizu,N. TITLE Isolation of cDNA for a novel human protein KNP-I that is homologous to the E. coli SCRP-27A protein from the autoimmune polyglandular disease type I (APECED) region of chromosome 21q22.3 JOURNAL Biochem. Biophys. Res. Commun. 225 (2), 608-616 (1996) MEDLINE 96354831 REFERENCE 2 (bases 1 to 1583) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (17-JUN-1996) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University, School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) FEATURES Location/Qualifiers source 1..1583 /organism="Homo sapiens" /isolate="7 Caucasian fetuses, 20-26 weeks" /db_xref="taxon:9606" /chromosome="21" /clone_lib="fetal brain 5'-strech plus cDNA library (CLONTECH)" /map="21q22.3" /tissue_type="brain" gene 18..824 /gene="KNP-I" CDS 18..824 /gene="KNP-I" /note="alternative splicing: see also acc# D86062" /codon_start=1 /product="KNP-Ia" /db_xref="PID:d1013671" /db_xref="PID:g1545813" /translation="MAAVRALVASRLAAASAFTSLSPGGRTPSQRAALHLSVPRPAAR VALVLSGCGVYDGTEIHEASAILVHLSRGGAEVQIFAPDVPQMHVIDHTKGQPSEGES RNVLTESARIARGKITDLANLSAANHDAAIFPGGFGAAKNLSTFAVDGKDCKVNKEVE RVLKEFHQAGKPIGLCCIAPVLAAKVLRGVEVTVGHEQEEGGKWPYAGTAEAIKALGA KHCVKEVVEAHVDQKNKVVTTPAFMCETALHYIHDGIGAMVRKVLELTGK" exon 446..538 /gene="KNP-I" /note="alternatively spliced exon HC21EXc145" polyA_signal 1565..1570 BASE COUNT 341 a 414 c 480 g 348 t ORIGIN 1 ccgctgtcct caccgcaatg gcggctgtga gggccctggt ggcctcgagg ctcgctgcgg 61 catctgcatt cacgtccctg tcccccggcg gtcggacgcc ttcccagcgc gcagcccttc 121 acctctccgt gccgcgcccc gcggccaggg tcgcgctggt gctgtctgga tgcggagtct 181 acgatgggac cgagatccac gaggcctcgg cgatcctggt gcacctgagc cgtggagggg 241 ctgaagtcca gatctttgct cctgacgtcc ctcagatgca cgtgattgac cacaccaagg 301 ggcagccgtc cgaaggcgag agcaggaatg ttttgaccga gtctgcgagg atcgcccgtg 361 gcaaaatcac agacctggcc aacctcagtg cagccaacca tgatgctgcc atctttccag 421 gaggctttgg agcggctaaa aacctgagca cgtttgccgt ggacgggaaa gattgcaagg 481 tgaataaaga agtggagcgt gtcctgaagg agttccacca ggccgggaag cccatcggct 541 tgtgctgcat tgcacctgtc ctcgcggcca aggtgctcag aggcgtcgag gtgactgtgg 601 gccacgagca ggaggaaggt ggcaagtggc cttatgccgg gaccgcagag gccatcaagg 661 ccctgggtgc caagcactgc gtgaaggaag tggtcgaagc tcacgtggac cagaaaaaca 721 aggtggtcac gaccccagcc ttcatgtgcg agacggcact ccactacatc catgatggga 781 tcggagccat ggtgaggaag gtgctggaac tcactggaaa gtgacgcgca tggacggggc 841 ccagctaggc gccaggactt ggcctcaccc tctggctgag gagctgtcgg ctgctttcca 901 tccagctggg agtctggcag gccctttttt ttttttcttt gccgaaacct gcaggcgttc 961 tctctctaag gaggatgtgc tgcagtgcat gggggatgtt tcttcctggg tgtggctggg 1021 ctgctctcac atacagaggc cgaggggcca attcgttctc tgccacaggg acttgcctca 1081 ctgtgtccca aaaacaaatc gcagccagct tttccagaaa tagaaaattc tgccgtctga 1141 ggttttatac ttcaggttag ttagtttttg gaaggaagaa catttttagg tttgcaagcc 1201 tcctgatcag gaaaccagaa ataccacatt tatggaccat gaaaggttgg ttcttgactc 1261 tgaagggact tttgagttaa tcagcgtaag gggatttcta aagcaggcaa tccctgtagc 1321 cgcagagaat aaacgccttc ccaaaatggc aacttcccac agccacattt caaacctgct 1381 gagactgctg agtgaggaat ggcagtgagg tttcttcaat tagtctcagt tctcttaatt 1441 ttcaggaaga aagggaaatt gcagctcctc agcccccagg attgacctct ggggagtgat 1501 ggtagcgttg gtgccaggcc gtgggttcag gtgtggcaga agcttgcaga tgcgtccgaa 1561 gggaaataaa gtgtgttggc gtt // LOCUS D86322 2710 bp mRNA PRI 20-JAN-1998 DEFINITION Homo sapiens mRNA for calmegin, complete cds. ACCESSION D86322 NID g2467376 KEYWORDS calmegin. SOURCE Homo sapiens male testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,H., Ikawa,M., Tsuchida,J., Nozaki,M., Suzuki,M., Fujiwara,T., Okabe,M. and Nishimune,Y. TITLE Cloning and characterization of the human Calmegin gene encoding putative testis-specific chaperone JOURNAL Gene 204 (1-2), 159-163 (1997) MEDLINE 98094268 REFERENCE 2 (bases 1 to 2710) AUTHORS Tanaka,H. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) to the DDBJ/EMBL/GenBank databases. Hiromitsu Tanaka, Osaka University, Research Institute for Microbial Diseases; 3-1 Yamadaoka, Suita, Osaka 565, Japan (E-mail:tanaka@biken.osaka-u.ac.jp, Tel:06-879-8338, Fax:06-879-8339) FEATURES Location/Qualifiers source 1..2710 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q28.3-q31.1" /sex="male" /tissue_type="testis" CDS 102..1934 /codon_start=1 /product="calmegin" /db_xref="PID:d1023458" /db_xref="PID:g2467377" /translation="MHFQAFWLCLGLLFISINAEFMDDDVETEDFEENSEEIDVNESE LSSEIKYKTPQPIGEVYFAETFDSGRLAGWVLSKAKKDDMDEEISIYDGRWEIEELKE NQVPGDRGLVLKSRAKHHAISAVLAKPFIFADKPLIVQYEVNFQDGIDCGGAYIKLLA DTDDLILENFYDKTSYIIMFGPDKCGEDYKLHFIFRHKHPKTGVFEEKHAKPPDVDLK KFFTDRKTHLYTLVMNPDDTFEVLVDQTVVNKGSLLEDVVPPIKPPKEIEDPNDKKPE EWDERAKIPDPSAVKPEDWDESEPAQIEDSSVVKPAGWLDDEPKFIPDPNAEKPDDWN EDTDGEWEAPQILNPACRIGCGEWKPPMIDNPKYKGVWRPPLVDNPNYQGIWSPRKIP NPDYFEDDHPFLLTSFSALGLELWSMTSDIYFDNFIICSEKEVADHWAADGWRWKIMI ANANKPGVLKQLMAAAEGHPWLWLIYLVTAGVPIALITSFCWPRKVKKKHKDTEYKKT DICIPQTKGVLEQEEKEEKAALEKPMDLEEEKKQNDGEMLEKEEESEPEEKSEEEIEI IEGQEESNQSNKSGSEDEMKEADESTGSGDGPIKSVRKRRVRKD" polyA_signal 2676..2680 BASE COUNT 957 a 421 c 567 g 765 t ORIGIN 1 cgccggcggg actggtctga agagacgcgg ggacaaagtg gcaacgactt ggacatctga 61 gctgtcactg ccgaaaacag gccgcaagag agataatcaa tatgcatttc caagcctttt 121 ggctatgttt gggtcttctg ttcatctcaa ttaatgcaga atttatggat gatgatgttg 181 agacggaaga ctttgaagaa aattcagaag aaattgatgt taatgaaagt gaactttcct 241 cagagattaa atataagaca cctcaaccta taggagaagt atattttgca gaaacttttg 301 atagtggaag gttggctgga tgggtcttat caaaagcaaa gaaagatgac atggatgagg 361 aaatttcaat atacgatgga agatgggaaa ttgaagagtt gaaagaaaac caggtacctg 421 gtgacagagg actggtatta aaatctagag caaagcatca tgcaatatct gctgtattag 481 caaaaccatt catttttgct gataaaccct tgatagttca atatgaagta aattttcaag 541 atggtattga ttgtggaggt gcatacatta aactcctagc agacactgat gatttgattc 601 tggaaaactt ttatgataaa acatcctata tcattatgtt tggaccagat aaatgtggag 661 aagattataa acttcatttt atcttcagac ataaacatcc caaaactgga gttttcgaag 721 agaaacatgc caaacctcca gatgtagacc ttaaaaagtt ctttacagac aggaagactc 781 atctttatac ccttgtgatg aatccagatg acacatttga ggtgttagtt gatcaaacag 841 ttgtaaacaa aggaagcctc ctagaggatg tggttcctcc tatcaaacct cccaaagaaa 901 ttgaagatcc caatgataaa aaacctgagg aatgggatga aagagcaaaa attcctgatc 961 cttctgccgt caaaccagaa gactgggatg aaagtgaacc tgcccaaata gaagattcaa 1021 gtgttgttaa acctgctggc tggcttgatg atgaaccaaa atttatccct gatcctaatg 1081 ctgaaaaacc tgatgactgg aatgaagaca cggatggaga atgggaggca cctcagattc 1141 ttaatccagc atgtcggatt gggtgtggtg agtggaaacc tcccatgata gataacccaa 1201 aatacaaagg agtatggaga cctccactgg tcgataatcc taactatcag ggaatctgga 1261 gtcctcgaaa aattcctaat ccagattatt tcgaagatga tcatccattt cttctgactt 1321 ctttcagtgc tcttggttta gagctttggt ctatgacctc tgatatctac tttgataatt 1381 ttattatctg ttcggaaaag gaagtagcag atcactgggc tgcagatggt tggagatgga 1441 aaataatgat agcaaatgct aataagcctg gtgtattaaa acagttaatg gcagctgctg 1501 aagggcaccc atggctttgg ttgatttatc ttgtgacagc aggagtgcca atagcattaa 1561 ttacttcatt ttgttggcca agaaaagtaa agaaaaaaca taaagataca gagtataaaa 1621 aaaccgacat atgtatacca caaacaaaag gagtactaga gcaagaagaa aaggaagaga 1681 aagcagccct ggaaaaacca atggacctgg aagaggaaaa aaagcaaaat gatggtgaaa 1741 tgcttgaaaa agaagaggaa agtgaacctg aggaaaagag tgaagaagaa attgaaatca 1801 tagaagggca agaagaaagt aatcaatcaa ataagtctgg gtcagaggat gagatgaaag 1861 aagcagatga gagcacagga tctggagatg ggccgataaa gtcagtacgc aaaagaagag 1921 tacgaaagga ctaaactaga ttgaaatatt tttaattccc gagaggatgt ttggcattgt 1981 aaaaatcagc atgccagacc tgaactttaa tcagtctgca catcctgttt ctaatatcta 2041 gcaacattat attctttcag acatttattt tagtccttca tttccgagga aaaagaagca 2101 actttgaagt tacctcatct ttgaatttag aataaaagtg gcacattaca tatcggatct 2161 aagagattaa taccattaga agttacacag ttttagttgt ttggagatag ttttggtttg 2221 tacagaacaa aataatatgt agcagcttca ttgctattgg aaaaatcagt tattggaatt 2281 tccacttaaa tggctataca acaatataac tggtagttct ataataaaaa tgagcatatg 2341 ttctgttgtg aagagctaaa tgcaataaag tttctgtatg gttgtttgat tctatcaaca 2401 attgaaagtg ttgtatatga cccacattta cctagtttgt gtcaaattat agttacagtg 2461 agttgtttgc ttaaattata gattccttta aggacatgcc ttgttcataa aatcactgga 2521 ttatattgca gcatatttta catttgaata caaggataat gggttttatc aaaacaaaat 2581 gatgtacaga ttttttttca agtttttata gttgctttat gccagagtgg tttaccccat 2641 tcacaaaatt tcttatgcat acattgctat tgaaaataaa atttaaatat tttttcatcc 2701 tgaaaaaaaa // LOCUS D86425 4829 bp mRNA PRI 25-JUL-1996 DEFINITION Human osteoblast mRNA for osteonidogen, complete cds. ACCESSION D86425 NID g1449166 KEYWORDS osteonidogen. SOURCE Homo sapiens osteoblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4829) AUTHORS Ohno,I., Hashimoto,J., Takaoka,K., Ochi,T., Okubo,K. and Matsubara,K. TITLE The cloning and characterization of a cDNA for the novel bone matrix protein : osteonidogen JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 4829) AUTHORS Ohno,I. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics Matsubara Laboratory; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..4829 /organism="Homo sapiens" /note="primary cultured" /db_xref="taxon:9606" /cell_type="osteoblast" /tissue_type="cancellous bone" CDS 1..4131 /codon_start=1 /product="osteonidogen" /db_xref="PID:d1013774" /db_xref="PID:g1449167" /translation="MEGDRVAGRPVLSSLPVLLLLQLLMLRAAALHPDELFPHGESWW DQLLQEGDDVKLSRGEAGESPALLTKPDSATSTWAPTASSPLRTSPGKRSMWTMISPP TSRPSPLFWRTSTRATAEAESCTERTPPPQCWAWPPAMCALASRALRAFYPHPRLPGH LGAGRRLRGGQTRALPSGELNTFQAVLASDGSDSYALFLYPANGLQFLGTRPKESYNV QLQLPARVGFCRGEADDLKSEGPYFSLTSTEQSVKNLYQLSNLGIPGVWAFHIGSTSP LDNVRPAAVGDLSAAHSSVPLGRSFSHATALESDYNEDNLDYYDVNEEEAEYLPGEPE EALNGHSSIDVSFQSKVDTKPLEESSTLDPHTKEGTSLGEVGGPDLKGQVEPWDERET RSPAPPEVDRDSLAPSWETPPPYPENGSIQPYPDGGPVPSEMDVPPAHPEEEIVLRSY PASGHTTPLSRGTYEVGLEDNIGSNTEVFTYNAANKETCEHNHRQCSRHAFCTDYATG FCCHCQSKFYGNGKHCLPEGAPHRVNGKVSGHLHVGHTPVHFTDVDLHAYIVGNDGRA YTAISHIPQPAAQALLPLTPIGGLFGWLFALEKPGSENGFSLAGAAFTHDMEVTFYPG EETVRITQTAEGLDPENYLSIKTNIQGQVPYVPANFTAHISPYKELYHYSDSTVTSTS SRDYSLTFGAINQTWSYRIHQNITYQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERV LRFAVTNQIGPVKEDSDPTPVNPCYDGSHMCDTTARCHPGTGVDYTCECASGYQGDGR NCVDENECATGFHRCGPNSVCINLPGSYRCECRSGYEFADDRHTCILITPPANPCEDG SHTCAPAGQARCVHHGGSTFSCACLPGYAGDGHQCTDVDECSENRCHPAATCYNTPGS FSCRCQPGYYGDGFQCIPDSTSSLTPCEQQQRHAQAQYAYPGARFHIPQCDEQGNFLP LQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPEPTQRPPTICERWRENLLEH YGGTPRDDQYVPQCDDLGHFIPLQCHGKSDFCWCVDKDGREVQGTRSQPGTTPACIPT VAPPMVRPTPRPDVTPPSVGTFLLYTQGQQIGYLPLNGTRLQKDAAKTLLSLHGSIIV GIDYDCRERMVYWTDVAGRTISRAGLELGAEPETIVNSGLISPEGLAIDHIRRTMYWT DSVLDKIESALLDGSERKVLFYTDLVNPRAIAVDPIRGNLYWTDWNREAPKIETSSLD GENRRILINTDIGLPNGLTFDPFSKLLCWADAGTKKLECTLPDGTGRRVIQNNLKYPF SIVSYADHFYHTDWRRDGVVSVNKHSGQFTDEYLPEQRSHLYGITAVYPYCPTGRK" BASE COUNT 1247 a 1321 c 1175 g 1086 t ORIGIN 1 atggaggggg accgggtggc cgggcggccg gtgctgtcgt cgttaccagt gctactgctg 61 ctgcagttgc taatgttgcg ggccgcggcg ctgcacccag acgagctctt cccacacggg 121 gagtcgtggt gggaccagct cctgcaggaa ggcgacgacg taaagctcag ccgtggtgaa 181 gctggcgaat cccctgcact tcttacgaag cccgattcag caacctctac gtgggcacca 241 acggcatcat ctccactcag gacttcccca gggaaacgca gtatgtggac tatgatttcc 301 ccaccgactt cccggccatc gccccttttc tggcggacat cgacacgagc cacggcagag 361 gccgagtcct gtaccgagag gacacctccc ccgcagtgct gggcctggcc gcccgctatg 421 tgcgcgctgg cttcccgcgc tctgcgcgct ttttaccccc acccacgcct tcctggccac 481 ctgggagcag gtaggcgctt acgaggaggt caaacgcggg cgctgccctc gggagagctg 541 aacactttcc aggcagtttt ggcatctgat gggtctgata gctacgccct ctttctttat 601 cctgccaacg gcctgcagtt ccttggaacc cgccccaaag agtcttacaa tgtccagctt 661 cagcttccag ctcgggtggg cttctgccga ggggaggctg atgatctgaa gtcagaagga 721 ccatatttca gcttgactag cactgaacag tctgtgaaaa atctctatca actaagcaac 781 ctggggatcc ctggagtgtg ggctttccat atcggcagca cttccccgtt ggacaatgtc 841 aggccagctg cagttggaga cctttccgct gcccactctt ctgttcccct gggacgttcc 901 ttcagccatg ctacagccct ggaaagtgac tataatgagg acaatttgga ttactacgat 961 gtgaatgagg aggaagctga ataccttccg ggtgaaccag aggaggcatt gaatggccac 1021 agcagcattg atgtttcctt ccaatccaaa gtggatacaa agcctttaga ggaatcttcc 1081 accttggatc ctcacaccaa agaaggaaca tctctgggag aggtaggggg cccagattta 1141 aaaggccaag ttgagccctg ggatgagaga gagaccagaa gcccagctcc accagaggta 1201 gacagagatt cactggctcc ttcctgggaa accccaccac cgtaccccga aaacggaagc 1261 atccagccct acccagatgg agggccagtg ccttcggaaa tggatgttcc cccagctcat 1321 cctgaagaag aaattgttct tcgaagttac cctgcttcag gtcacactac acccttaagt 1381 cgagggacgt atgaggtggg actggaagac aacataggtt ccaacaccga ggtcttcacg 1441 tataatgctg ccaacaagga aacctgtgaa cacaaccaca gacaatgctc ccggcatgcc 1501 ttctgcacgg actatgccac tggcttctgc tgccactgcc aatccaagtt ttatggaaat 1561 gggaagcact gtctgcctga gggggcacct caccgagtga atgggaaagt gagtggccac 1621 ctccacgtgg gccatacacc cgtgcacttc actgatgtgg acctgcatgc gtatatcgtg 1681 ggcaatgatg gcagagccta cacggccatc agccacatcc cacagccagc agcccaggcc 1741 ctcctccccc tcacaccaat tggaggcctg tttggctggc tctttgcttt agaaaaacct 1801 ggctctgaga acggcttcag cctcgcaggt gctgccttta cccatgacat ggaagttaca 1861 ttctacccgg gagaggagac ggttcgtatc actcaaactg ctgagggact tgacccagag 1921 aactacctga gcattaagac caacattcaa ggccaggtgc cttacgtccc agcaaatttc 1981 acagcccaca tctctcccta caaggagctg taccactact ccgactccac tgtgacctct 2041 acaagttcca gagactactc tctgactttt ggtgcaatca accaaacatg gtcctaccgc 2101 atccaccaga acatcactta ccaggtgtgc aggcacgccc ccagacaccc gtccttcccc 2161 accacccagc agctgaacgt ggaccgggtc tttgccttgt ataatgatga agaaagagtg 2221 cttagatttg ctgtgaccaa tcaaattggc ccggtcaaag aagattcaga ccccactccg 2281 gtgaatcctt gctatgatgg gagccacatg tgtgacacaa cagcacggtg ccatccaggg 2341 acaggtgtag attacacctg tgagtgcgca tctgggtacc agggagatgg acggaactgt 2401 gtggatgaaa atgaatgtgc aactggcttt catcgctgtg gccccaactc tgtatgtatc 2461 aacttgcctg gaagctacag gtgtgagtgc cggagtggtt atgagtttgc agatgaccgg 2521 catacttgca tcttgatcac cccacctgcc aacccctgtg aggatggcag tcatacctgt 2581 gctcctgctg ggcaggcccg gtgtgttcac catggaggca gcacgttcag ctgtgcctgc 2641 ctgcctggtt atgccggcga tgggcaccag tgcactgatg tagatgaatg ctcagaaaac 2701 agatgtcacc ctgcagctac ctgctacaat actcctggtt ccttctcctg ccgttgtcaa 2761 cccggatatt atggggatgg atttcagtgc atacctgact ccacctcaag cctgacaccc 2821 tgtgaacaac agcagcgcca tgcccaggcc cagtatgcct accctggggc ccggttccac 2881 atcccccaat gcgacgagca gggcaacttc ctgcccctac agtgtcatgg cagcactggt 2941 ttctgctggt gcgtggaccc tgatggtcat gaagttcctg gtacccagac tccacctggc 3001 tccaccccgc ctcactgtgg accatcacca gagcccaccc agaggccccc gaccatctgt 3061 gagcgctgga gggaaaacct gctggagcac tacggtggca ccccccgaga tgaccagtac 3121 gtgccccagt gcgatgacct gggccacttc atccccctgc agtgccacgg aaagagcgac 3181 ttctgctggt gtgtggacaa agatggcaga gaggtgcagg gcacccgctc ccagccaggc 3241 accacccctg cgtgtatacc caccgtcgct ccacccatgg tccggcccac gccccggcca 3301 gatgtgaccc ctccatctgt gggcaccttc ctgctctata ctcagggcca gcagattggc 3361 tacttacccc tcaatggcac caggcttcag aaggatgcag ctaagaccct gctgtctctg 3421 catggctcca taatcgtggg aattgattac gactgccggg agaggatggt gtactggaca 3481 gatgttgctg gacggacaat cagccgtgcc ggtctggaac tgggagcaga gcctgagacg 3541 atcgtgaatt caggtctgat aagccctgaa ggacttgcca tagaccacat ccgcagaaca 3601 atgtactgga cggacagtgt cctggataag atagagagcg ccctgctgga tggctctgag 3661 cgcaaggtcc tcttctacac agatctggtg aatccccgtg ccatcgctgt ggatccaatc 3721 cgaggcaact tgtactggac agactggaat agagaagctc ctaaaattga aacgtcatct 3781 ttagatggag aaaacagaag aattctgatc aatacagaca ttggattgcc caatggctta 3841 acctttgacc ctttctctaa actgctctgc tgggcagatg caggaaccaa aaaactggag 3901 tgtacactac ctgatggaac tggacggcgt gtcattcaaa acaacctcaa gtaccccttc 3961 agcatcgtaa gctatgcaga tcacttctac cacacagact ggaggaggga tggtgttgta 4021 tcagtaaata aacatagtgg ccagtttact gatgagtatc tcccagaaca acgatctcac 4081 ctctacggga taactgcagt ctacccctac tgcccaacag gaagaaagta agtacagtaa 4141 tgtaaaggaa gacttggagt ttacaatcag aacctggacc ctaaagaaca gtgactgcaa 4201 aggcaaagaa agtaaaaaag gaattggcca ttagacgttc ctgagcatcc aagatgaaca 4261 ttttgtagtg caaaaagact tttgtgaaaa gctgatacct caatctttac tactgtattt 4321 ttaaaaatga aggttgttat tgcaagttta aaaaggtaac agaattttaa ctgttgctta 4381 ttaaagcaac ttcttgtaaa catttatcat taatatttaa aagatcaaat tcattcaact 4441 aagaattaga gtttaagact ctaaacctga tttttgccat ggattccttc tggccaagaa 4501 attaaagcac atgtgatcaa tataacaata taatcctaaa ccttgacagt tggagaagcc 4561 aatgcagaac tgatgggaaa ggaccaatta tttatagttt cccaacaaaa gttctaagat 4621 tttttacctc tgcatcagtg catttctatt tatatcaaaa ggtgctaaaa tgattcaatt 4681 tgcattttct gatcctgtag tgcctctata gaagtaccca cagaaagtaa agtatcacat 4741 ttataaatac caaagatgta acaattttaa aattttctag attactccaa taaagtgttt 4801 taagtttaaa aaaaaaaaaa aaaaaaaaa // LOCUS D86479 2839 bp mRNA PRI 26-DEC-1996 DEFINITION Human mRNA for AEBP1 gene, complete cds. ACCESSION D86479 NID g1468942 KEYWORDS AEBP1. SOURCE Homo sapiens osteoblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ohno,I., Hashimoto,J., Shimizu,K., Takaoka,K., Ochi,T., Matsubara,K. and Okubo,K. TITLE A cDNA cloning of human AEBP1 from primary cultured osteoblasts and its expression in a differentiating osteoblastic cell line JOURNAL Biochem. Biophys. Res. Commun. 228 (2), 411-414 (1996) MEDLINE 97079196 REFERENCE 2 (bases 1 to 2839) AUTHORS Ohno,I., Hashimoto,J., Kouta,S., Takaoka,K., Ochi,T., Okubo,K. and Matsubara,K. TITLE The cloning of a cDNA for human AEBP1 regulating of the differentiation of osteoblasts JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2839) AUTHORS Ohno,I. TITLE Direct Submission JOURNAL Submitted (11-JUL-1996) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics Matsubara Laboratory; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..2839 /organism="Homo sapiens" /note="primary cultured" /db_xref="taxon:9606" /cell_type="osteoblast" /tissue_type="cancellous bone" CDS 1..2538 /codon_start=1 /product="AEBP1" /db_xref="PID:d1013781" /db_xref="PID:g1468943" /translation="MDYYFGPPPPQKPDAERQTDEEKEELKKPKKEDSSPKEETDKWA VEKGKDHKEPRKGEELEEEWTPTEKVKCPPIGMESHRIEDNQIRASSMLRHGLGAQRG RLNMQTGATEDDYYDGAWCAEDDARTQWIEVDTRRTTRFTGVITQGRDSSIHDDFVTT FFVGFSNDSQTWVMYTNGYEEMTFHGNVDKDTPVLSELPEPVVARFIRIYPLTWNGSL CMRLEVLGCSVAPVYSYYAQNEVVATDDLDFRHHSYKDMRQLMKVVNEECPTITRTYS LGKSSRGLKIYAMEISDNPGEHELGEPEFRYTAGIHGNEVLGRELLLLLMQYLCREYR DGNPRVRSLVQDTRIHLVPSLNPDGYEVAAQMGSEFGNWALGLWTEEGFDIFEDFPDL NSVLWGAEERKWVPYRVPNNNLPIPERYLSPDATVSTEVRAIIAWMEKNPFVLGANLN GGERLVSYPYDMARTPTQEQLLAAAMAAARGEDEDEVSEAQETPDHAIFRWLAISFAS AHLTLTEPYRGGCQAQDYTGGMGIVNGAKWNPRTGTINDFSYLHTNCLELSFYLGCDK FPHESELPREWENNKEALLTFMEQVHRGIKGVVTDEQGIPIANATISVSGINHGVKTA SGGDYWRILNPGEYRVTAHAEGYTPSAKTCNVDYDIGATQCNFILARSNWKRIREIMA MNGNRPIPHIDPSRPMTPQQRRLQQRRLQHRLRLRAQMRLRRLNATTTLGPHTVPPTL PPAPATTLSTTIEPWGLIPPTTAGWEESETETYTEVVTEFGTEVEPEFGTKVEPEFET QLEPEFETQLEPEFEEEEEEEKEEEIATGQAFPFTTVETYTVNFGDF" BASE COUNT 659 a 849 c 860 g 471 t ORIGIN 1 atggactatt actttgggcc tcctccgccc cagaagcccg atgctgagcg ccagacggac 61 gaagagaagg aggagctgaa gaaacccaaa aaggaggaca gcagccccaa ggaggagacc 121 gacaagtggg cagtggagaa gggcaaggac cacaaagagc cccgaaaggg cgaggagttg 181 gaggaggagt ggacgcctac ggagaaagtc aagtgtcccc ccattgggat ggagtcacac 241 cgtattgagg acaaccagat ccgagcctcc tccatgctgc gccacggcct gggggcacag 301 cgcggccggc tcaacatgca gaccggtgcc actgaggacg actactatga tggtgcgtgg 361 tgtgccgagg acgatgccag gacccagtgg atagaggtgg acaccaggag gactacccgg 421 ttcacaggcg tcatcaccca gggcagagac tccagcatcc atgacgattt tgtgaccacc 481 ttcttcgtgg gcttcagcaa tgacagccag acatgggtga tgtacaccaa cggctatgag 541 gaaatgacct ttcatgggaa cgtggacaag gacacacccg tgctgagtga gctcccagag 601 ccggtggtgg ctcgtttcat ccgcatctac ccactcacct ggaatggcag cctgtgcatg 661 cgcctggagg tgctggggtg ctctgtggcc cctgtctaca gctactacgc acagaatgag 721 gtggtggcca ccgatgacct ggatttccgg caccacagct acaaggacat gcgccagctc 781 atgaaggtgg tgaacgagga gtgccccacc atcacccgca cttacagcct gggcaagagc 841 tcacgaggcc tcaagatcta tgccatggag atctcagaca accctgggga gcatgaactg 901 ggggagcccg agttccgcta cactgctggg atccatggca acgaggtgct gggccgagag 961 ctgttgctgc tgctcatgca gtacctgtgc cgagagtacc gcgatgggaa cccacgtgtg 1021 cgcagcctgg tgcaggacac acgcatccac ctggtgccct cactgaaccc tgatggctac 1081 gaggtggcag cgcagatggg ctcagagttt gggaactggg cgctgggact gtggactgag 1141 gagggctttg acatctttga agatttcccg gatctcaact ctgtgctctg gggagctgag 1201 gagaggaaat gggtccccta ccgggtcccc aacaataact tgcccatccc tgaacgctac 1261 ctttcgccag atgccacggt atccacggag gtccgggcca tcattgcctg gatggagaag 1321 aaccccttcg tgctgggagc aaatctgaac ggcggcgagc ggctagtatc ctacccctac 1381 gatatggccc gcacgcctac ccaggagcag ctgctggccg cagccatggc agcagcccgg 1441 ggggaggatg aggacgaggt ctccgaggcc caggagactc cagaccacgc catcttccgg 1501 tggcttgcca tctccttcgc ctccgcacac ctcaccttga ccgagcccta ccgcggaggc 1561 tgccaagccc aggactacac cggcggcatg ggcatcgtca acggggccaa gtggaacccc 1621 cggaccggga ctatcaatga cttcagttac ctgcatacca actgcctgga gctctccttc 1681 tacctgggct gtgacaagtt ccctcatgag agtgagctgc cccgcgagtg ggagaacaac 1741 aaggaggcgc tgctcacctt catggagcag gtgcaccgcg gcattaaggg ggtggtgacg 1801 gacgagcaag gcatccccat tgccaacgcc accatctctg tgagtggcat taatcacggc 1861 gtgaagacag ccagtggtgg tgattactgg cgaatcttga acccgggtga gtaccgcgtg 1921 acagcccacg cggagggcta caccccgagc gccaagacct gcaatgttga ctatgacatc 1981 ggggccactc agtgcaactt catcctggct cgctccaact ggaagcgcat ccgggagatc 2041 atggccatga acgggaaccg gcctatccca cacatagacc catcgcgccc tatgaccccc 2101 caacagcgac gcctgcagca gcgacgccta caacaccgcc tgcggcttcg ggcacagatg 2161 cggctgcggc gcctcaacgc caccaccacc ctaggccccc acactgtgcc tcccacgctg 2221 ccccctgccc ctgccaccac cctgagcact accatagagc cctggggcct cataccgcca 2281 accaccgctg gctgggagga gtcggagact gagacctaca cagaggtggt gacagagttt 2341 gggaccgagg tggagcccga gtttgggacc aaggtggagc ccgagtttga gacccagttg 2401 gagcctgagt ttgagaccca gctggaaccc gagtttgagg aagaggagga ggaggagaaa 2461 gaggaggaga tagccactgg ccaggcattc cccttcacaa cagtagagac ctacacagtg 2521 aactttgggg acttctgaga tcagcgtcct accaagaccc cagcccaact caagctacag 2581 cagcagcact tcccaagcct gctgaccaca gtcacatcac ccatcagcac atggaaggcc 2641 cctggtatgg acactgaaag gaagggctgg tcctgcccct ttgagggggt gcaaacatga 2701 ctgggaccta agagccagag gctgtgtaga ggctcctgct ccacctgcca gtctcgtaag 2761 agatggggtt gctgcagtgt tggagtaggg gcagagggag ggagccaagg tcactccaat 2821 aaaacaagct catggcaaa // LOCUS D86519 1957 bp mRNA PRI 24-DEC-1996 DEFINITION Human mRNA for neuropeptide y/peptide YY Y6 receptor, complete cds. ACCESSION D86519 NID g1731789 KEYWORDS Y6; Y6 encoding protein; neuropeptide Y/peptide YY receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Matsumoto,M., Nomura,T., Momose,K., Ikeda,Y., Kondou,Y., Akiho,H., Togami,J., Kimura,Y., Okada,M. and Yamaguchi,T. TITLE Inactivation of a novel neuropeptide Y/peptide YY receptor gene in primate species JOURNAL J. Biol. Chem. 271 (44), 27217-27220 (1996) MEDLINE 97066888 REFERENCE 2 (bases 1 to 1957) AUTHORS Matsumoto,M. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1957) AUTHORS Matsumoto,M. TITLE Direct Submission JOURNAL Submitted (12-JUL-1996) to the DDBJ/EMBL/GenBank databases. Mitsuyuki Matsumoto, Yamanouchi Pharmaceutical Co.,Ltd., Neuroscience & Gastrointestinal Research Lab.; 21, Miyukigaoka, Tsukuba, Ibaraki 305, Japan (E-mail:matsum_m@yamanouchi.co.jp, Tel:+81-298-52-5111, Fax:+81-298-56-2515) FEATURES Location/Qualifiers source 1..1957 /organism="Homo sapiens" /db_xref="taxon:9606" gene 439..1311 /gene="Y6" CDS 439..1311 /gene="Y6" /codon_start=1 /product="Y6 encoding protein" /db_xref="PID:d1013790" /db_xref="PID:g1731790" /translation="MEVSLNHPASNTTSTKNNNSAFFYFESCQPPSPALLLLCIAYTV VLIVGLFGNLSLIIIIFKKQRKAQNFTSILIANLSLSDTLVCVMCIHFTIIYTLMDHW IFGDTMCRLTSYVQSVSISVSIFSLVFTAVERYQLIVNPRGWKPSVTHAYWGITLIWL FSLLLSIPFFLSYHLTDEPFRNLSLPTDLYTHQVACVENWPSKKDRLLFTTSLFLLQY FVPLGFILICYLKIVICLRRRNAKVDKKKENEGRLNENKRINTMLISIVVTFGACWLP RISSMSSLTGIMRC" BASE COUNT 554 a 485 c 368 g 550 t ORIGIN 1 ttgataggga tagaaacaca tttggctgct tctatagtta acaagatgct gttacattcc 61 ttgcctcact agctctgaag actatactag cgggacaaag aaagcacctg agatgagctg 121 agaggagggt aaaggtacac agagatcccc tggatatttg ttctatgtcc tctcaggggc 181 tttgctacca ctagagaatt atccatatta agaacttgca ttgatattct gggttctgtt 241 tcatttttta gggtctcaag agcacgctca agtcattcac atgtttccat caaatacaga 301 cacagatcag ggaagattaa accctactaa tttctcgtcg gatgcctcac aacaaggtgc 361 cttccaagaa ctaatggcca aaatatccac ccacaacaca aataagctta gaaaatctct 421 tcttacaatc ctgacacaat ggaagtttcc ctaaaccacc cagcatctaa tacaaccagc 481 acaaagaaca acaactcggc atttttttac tttgagtcct gtcaacctcc ttctccagct 541 ttactcctat tatgcatagc ctatactgtg gtcttaattg tgggcctttt tggaaacctc 601 tctctcatca tcatcatctt taagaagcag agaaaagctc agaatttcac cagcatactg 661 attgccaatc tctccctctc tgataccttg gtgtgtgtca tgtgcatcca ttttactatc 721 atctacactc tgatggacca ctggatattt ggggatacca tgtgcagact cacatcctat 781 gtgcagagtg tctcaatctc tgtgtccata ttctcacttg tattcactgc tgtcgaaaga 841 tatcagctaa ttgtgaaccc ccgtggctgg aagcccagtg tgactcatgc ctactggggc 901 atcacactga tttggctgtt ttcccttctg ctgtctattc ccttcttcct gtcctaccac 961 ctcactgatg agcccttccg caacctctct ctccccactg acctctacac ccaccaggtg 1021 gcctgtgtgg agaactggcc ctccaaaaag gaccggctgc tcttcaccac ctcccttttt 1081 ctgctgcagt attttgttcc tctaggcttc atcctcatct gctacttgaa gattgttatc 1141 tgcctccgca ggagaaatgc aaaggtagat aagaagaagg aaaatgaggg ccggctcaat 1201 gagaacaaga ggatcaacac aatgttgatt tccatcgtgg tgacctttgg agcctgctgg 1261 ctgccccgaa tatcttcaat gtcatctttg actggtatca tgaggtgctg atgagctgcc 1321 accacgacct ggtatttgta gtttgccact tggttgctat ggtttccaca tgtataaacc 1381 ctctctttta tggctttctc aacaaaaatt tccaaaagga cctggtagtg cttattcacc 1441 actgctggtg cttcacacct caggaaagat gtgaaaatat tgccatctcc actatgcaca 1501 cagactccaa gaggtcttta agattggctc gtataacaac aggtatatga aaattgataa 1561 tgctgaagct cttcttgaat gggagctgga caggtaatgg tgggaatagg gcaagatgca 1621 gaaagaagaa accagaacca aaaatagcaa ctttataccc acttttcctt taggctaaga 1681 ctgcctgtct catatgtcta tccaacacac cctccaacat acacgaacac acataccacc 1741 ccttttctct taagaaaata actctaataa ttcaaacaac ctgcccgcca tcatttgtgg 1801 caaagaatga gaatgagaaa gcagagagag aggcaaacag cagtgatggc tggggaacaa 1861 tgttcacaga tacttttatt caatggaata tctacaaaag ttatgactaa tgatatgcct 1921 agtaaaaaca ctgctatacc tccttagcac tgagaat // LOCUS D86640 2963 bp mRNA PRI 24-JAN-1997 DEFINITION Human mRNA for stac, complete cds. ACCESSION D86640 NID g1799567 KEYWORDS stac. SOURCE Homo sapiens fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Suzuki,H., Kawai,J., Taga,C., Yaoi,T., Hara,A., Hirose,K., Hayashizaki,Y. and Watanabe,S. TITLE Stac, a novel neuron-specific protein with cysteine-rich and SH3 domains JOURNAL Biochem. Biophys. Res. Commun. 229 (3), 902-909 (1996) MEDLINE 97115677 REFERENCE 2 (bases 1 to 2963) AUTHORS Suzuki,H., Kawai,J., Taga,C., Yaoi,T., Hara,A., Hirose,K., Hayashizaki,Y. and Watanabe,S. TITLE Stac, a Novel Neuron-specific Protein with Cysteine-rich and SH3 Domains JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2963) AUTHORS Suzuki,H. TITLE Direct Submission JOURNAL Submitted (29-JUL-1996) to the DDBJ/EMBL/GenBank databases. Harukazu Suzuki, Shionogi Co., Ltd., Shionogi Institute for Medical Science; Mishima 2-5-1, Settsu, Osaka 566, Japan (Tel:06-382-2612, Fax:06-382-2598) FEATURES Location/Qualifiers source 1..2963 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" gene 40..1248 /gene="Stac" CDS 40..1248 /gene="Stac" /codon_start=1 /product="stac" /db_xref="PID:d1013840" /db_xref="PID:g1799568" /translation="MIPPSSPREDGVDGLPKEAVGAEQPPSPASTSSQESKLQKLKRS LSFKTKSLRSKSADNFFQRTNSEDMKLQAHMVAEISPSSSPLPAPGSLTSTPARAGLH PGGKAHAFQEYIFKKPTFCDVCNHMIVGTNAKHGLRCKACKMSIHHKCTDGLAPQRCM GKLPKGFRRYYSSPLLIHEQFGCIKEVMPIACGNKVDPVYETLRFGTSLAQRTKKGSS GSGSDSPHRTSTSDLVEVPEEANGPGGGYDLRKRSNSVFTYPENGTDDFRDPAKNINH QGSLSKDPLQMNTYVALYKFVPQENEDLEMRPGDIITLLEDSNEDWWKGKIQDRIGFF PANFVQRLQQNEKIFRCVRTFIGCKEQGQITLKENQICVSSEEEQDGFIRVLSGKKKG LIPLDVLENI" polyA_signal 2573..2578 polyA_signal 2928..2933 BASE COUNT 819 a 737 c 689 g 718 t ORIGIN 1 gttcctccgg gagcccaaca ccgttcccgc gcggccacga tgatccctcc gagcagcccc 61 cgcgaggacg gcgtggacgg gctgcccaag gaggcggtgg gcgccgagca accgccctct 121 cctgcatcca ccagcagcca ggaatccaag ctccagaaac taaaacgatc actttctttc 181 aagaccaaga gtttacggag caaaagtgct gacaacttct tccagcgaac caacagcgaa 241 gacatgaaac tgcaagcaca catggtggct gagatcagcc ccagctccag cccactccct 301 gctccaggaa gcctgacgtc cacacccgcc agggctggtc tgcatccagg tggcaaggct 361 catgcctttc aggaatacat cttcaagaag cccactttct gtgatgtctg caaccacatg 421 atagtgggaa caaatgctaa gcatggactg cgctgcaaag cctgtaagat gagcatccac 481 cacaagtgca cagatggcct ggcaccccag cggtgcatgg gcaagctgcc aaaggggttt 541 cggcgttact acagctcccc cttgctcatt catgaacagt ttggctgcat taaagaagtt 601 atgcccattg cctgtggcaa taaggtggac cctgtctacg agaccctccg cttcggcacc 661 tccctggccc agaggacaaa gaagggcagc tccggcagtg gctctgactc acctcacaga 721 acctctactt cagatcttgt ggaggttcct gaggaagcca atgggccagg aggcgggtat 781 gacctaagga aacgcagcaa cagcgtgttt acatatccag aaaatggcac tgatgatttc 841 agagatccag cgaagaacat aaaccaccag ggatctcttt ccaaagaccc attacagatg 901 aacacctatg ttgccttgta caaatttgta ccacaggaga atgaagattt ggaaatgagg 961 ccaggagaca taattactct tttagaggat tccaatgaag actggtggaa agggaaaatt 1021 caagacagaa ttggcttctt tccagccaac tttgttcaga gactacaaca aaatgagaag 1081 atttttagat gtgttagaac cttcattggg tgtaaggaac aggggcagat aacactgaaa 1141 gagaatcaga tctgcgtgag ttctgaagaa gaacaagatg gttttatcag agtcctcagt 1201 ggaaaaaaga aaggcctcat cccccttgat gtactagaaa acatctgatt gctggctcct 1261 cctccgtttg cagtaggcaa gctctgctgc gatgcctctg cctcatctca cactgcgtca 1321 acccaaagga gctgccgcac tgacccagcc ccccaggaaa cagtgagaca agaatcaagt 1381 atctgagact gtggagtaat agccacaaaa cagagggccc actgcacagc atatccaggc 1441 tgccacaggt ggggacgagg ctgagagagt cagcaggcag agccagatgc catgcttggc 1501 agcagcagta ggactataaa ccacagctgt cccccaggat cccactcctt tcctgtctgt 1561 gtggtgtaag ttaacacact ggagtgtgct ccagtttgca gggtagccca gtgcaaggtt 1621 cagatccatg tagctaagta ttatcctgct tccagaccta tgtcaccagt accaatcagt 1681 cagtgtcatc acatttcagg ccccaagcaa tctctgtgca aagcatcaga aagacctgct 1741 tcccagcccc cagcattcca gtgctctcca ggcttcctct ctttgtgatt gtgctgtcca 1801 gagtgtccag cttgttcttt ctttctcttc agtcctctga gtacatctgg tggtgtgcat 1861 tagatgtgag ggctatgttg acatggcatc acctccaaag acctgacctg cctaaagact 1921 gatgacaggc catccttcct gctgttctag gtactggcct gggtgacaga gcaggacatg 1981 agacatagat acagtgggga ggagaagtgg ggaaaggtgg agcagagagt tcttacttat 2041 tgaagattat acagcccttt cggttatgaa gtccctgctt gaaggcaatg gacctgggga 2101 agagactatc acaaaaagtc tccattttca ttttacatcc tctctattgg aggcagcact 2161 tttccctcat gctgtcctat aggactccac tttgaaggtt gtgcctacgt tgcagggaac 2221 taggaacatg gaggggaacc aacaacagca tcttagaaga aatgtagcca aattggagtc 2281 cattcttctt tagggcagta tatgaaatcc tagcagatgt aaaatggaaa agaatcctaa 2341 tgcttcttcc ttcagaaagt agaggaacta ggggcccaat tagcatcatc taggggaatc 2401 tctattactc tgtacttata ctaatgttta caagaatgca atatactgtg atgccttcct 2461 actcaagcct cctagcattc aaacttccat cctattagtc attaacgtgg ttaaacttca 2521 attcacaatc accttggaat caatgtcagt ttgatttatt ttgttacaga gcaataaaat 2581 cattagaaca atggttttta aaagacttaa gtggatgcat cctatgcatg taagcattat 2641 ttcccaaacc aaagcatcat tcatctccat ttatttcttt tgttcccgct cagtgtgaag 2701 ttgggaactg agaggggatg gctgctggtt tcacagaagt ttggatgcct tactcttatc 2761 ttaaagccag catccagaat ttcctccctc tctgatgcca catgcaaaac caggtgtacg 2821 atgtcaagat ggattctttg agagccaggg tagatcatat gactgccttt ctgtaaaatt 2881 ggatgcctag tacaaatgct ctttgcctct aattcagtat tccatttaat aaaaacaata 2941 cagtggctgg aaaggagcat cag // LOCUS D86724 1354 bp mRNA PRI 01-JUL-1997 DEFINITION Homo sapiens mRNA for nonhepatic arginase, complete cds. ACCESSION D86724 NID g1694632 KEYWORDS nonhepatic arginase. SOURCE Homo sapiens cell_line:HepG2 liver tumor cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1354) AUTHORS Gotoh,T. TITLE Direct Submission JOURNAL Submitted (29-JUL-1996) to the DDBJ/EMBL/GenBank databases. Tomomi Gotoh, Kumamoto University School of Medicine, Department of Molecular Genetics; Kuhonji 4-24-1, Kumamoto, Kumamoto 862, Japan (E-mail:tomomi@gpo.kumamoto-u.ac.jp, Tel:096-373-5143, Fax:096-373-5145) REFERENCE 2 (bases 1 to 1354) AUTHORS Gotoh,T. TITLE Molecular cloning of cDNA for nonhepatic arginase (arginase II) JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Gotoh,T., Sonoki,T., Nagasaki,A., Terada,K., Takiguchi,M. and Mori,M. TITLE Molecular cloning of cDNA for nonhepatic mitochondrial arginase (arginase II) and comparison of its induction with nitric oxide synthase in a murine macrophage-like cell line JOURNAL FEBS Lett. 395 (2-3), 119-122 (1996) MEDLINE 97053663 REFERENCE 4 (sites) AUTHORS Gotoh,T., Araki,M. and Mori,M. TITLE Chromosomal localization of the human arginase II gene and tissue distribution of its mRNA JOURNAL Biochem. Biophys. Res. Commun. 233 (2), 487-491 (1997) MEDLINE 97289658 FEATURES Location/Qualifiers source 1..1354 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2 liver tumor" CDS 21..1085 /codon_start=1 /product="nonhepatic arginase" /db_xref="PID:d1013846" /db_xref="PID:g1694633" /translation="MSLRGSLSRLLQTRVHSILKKSVHSVAVIGAPFSQGQKRKGVEH GPAAIREAGLMKRLSSLGCHLKDFGDLSFTPVPKDDLYNNLIVNPRSVGLANQELAEV VSRAVSDGYSCVTLGGDHSLAIGTISGHARHCPDLCVVWVDAHADINTPLTTSSGNLH GQPVSFLLRELQDKVPQLPGFSWIKPCISSASIVYIGLRDVDPPEHFILKNYDIQYFS MRDIDRLGIQKVMERTFDLLIGKRQRPIHLSFDIDAFDPTLAPATGTPVVGGLTYREG MYIAEEIHNTGLLSALDLVEVNPQLATSEEEAKTTANLAVDVIASSFGQTREGGHIVY DQLPTPSSPDESENQARVRI" polyA_site 1354 /note="51 A nucleotides" BASE COUNT 376 a 304 c 309 g 365 t ORIGIN 1 gattctcagt gctgcggatc atgtccctaa ggggcagcct ctcgcgtctc ctccagacgc 61 gagtgcattc catcctgaag aaatccgtcc actccgtggc tgtgatagga gccccgttct 121 cacaagggca gaaaagaaaa ggagtggagc atggtcccgc tgccataaga gaagctggct 181 tgatgaaaag gctctccagt ttgggctgcc acctaaaaga ctttggagat ttgagtttta 241 ctccagtccc caaagatgat ctctacaaca acctgatagt gaatccacgc tcagtgggtc 301 ttgccaacca ggaactggct gaggtggtta gcagagctgt gtcagatggc tacagctgtg 361 tcacactggg aggagaccac agcctggcaa tcggtaccat tagtggccat gcccgacact 421 gcccagacct ttgtgttgtc tgggttgatg cccatgctga catcaacaca ccccttacca 481 cttcatcagg aaatctccat ggacagccag tttcatttct cctcagagaa ctacaggata 541 aggtaccaca actcccagga ttttcctgga tcaaaccttg tatctcttct gcaagtattg 601 tgtatattgg tctgagagac gtggaccctc ctgaacattt tattttaaag aactatgata 661 tccagtattt ttccatgaga gatattgatc gacttggtat ccagaaggtc atggaacgaa 721 catttgatct gctgattggc aagagacaaa gaccaatcca tttgagtttt gatattgatg 781 catttgaccc tacactggct ccagccacag gaactcctgt tgtcggggga ctaacctatc 841 gagaaggcat gtatattgct gaggaaatac acaatacagg gttgctatca gcactggatc 901 ttgttgaagt caatcctcag ttggccacct cagaggaaga ggcgaagact acagctaacc 961 tggcagtaga tgtgattgct tcaagctttg gtcagacaag agaaggaggg catattgtct 1021 atgaccaact tcctactccc agttcaccag atgaatcaga aaatcaagca cgtgtgagaa 1081 tttaggagac actgtgcact gacatgtttc acaacaggca ttccagaatt atgaggcatt 1141 gaggggatag atgaatacta aatggttgtc tgggtcaata ctgccttaat gagaacattt 1201 acacattctc acaattgtaa agtttcccct ctattttggt gaccaatact actgtaaatg 1261 tatttggttt tttgcagttc acagggtatt aatatgctat agtactatgt aaatttaaag 1321 aagtcataaa cagcatttat taccttggta tatc // LOCUS D86955 799 bp mRNA PRI 06-MAR-1997 DEFINITION Human mRNA for CC chemokine LARC precursor, complete cds. ACCESSION D86955 NID g1871138 KEYWORDS CC chemokine LARC precursor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hieshima,K., Imai,T., Opdenakker,G., Van Damme,J., Kusuda,J., Tei,H., Sakaki,Y., Takatsuki,K., Miura,R., Yoshie,O. and Nomiyama,H. TITLE Molecular cloning of a novel human CC chemokine liver and activation-regulated chemokine (LARC) expressed in liver. Chemotactic activity for lymphocytes and gene localization on chromosome 2 JOURNAL J. Biol. Chem. 272 (9), 5846-5853 (1997) MEDLINE 97190319 REFERENCE 2 (bases 1 to 799) AUTHORS Hieshima,K., Imai,T., Opdenakker,G., Van Damme,J., Kusuda,J., Tei,H., Sakaki,Y., Takatsuki,K., Miura,R., Yoshie,O. and Nomiyama,H. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 799) AUTHORS Nomiyama,H. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) to the DDBJ/EMBL/GenBank databases. Hisayuki Nomiyama, Kumamoto University Medical School, Department of Biochemistry; Honjo 2-2-1, Kumamoto, Kumamoto 860, Japan (E-mail:nomiyama@gpo.kumamoto-u.ac.jp, Tel:+81-96-373-5063) FEATURES Location/Qualifiers source 1..799 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="q33-37" sig_peptide 59..136 /gene="LARC" CDS 59..349 /gene="LARC" /codon_start=1 /product="CC chemokine LARC precursor" /db_xref="PID:d1013880" /db_xref="PID:g1871139" /translation="MCCTKSLLLAALMSVLLLHLCGESEAASNFDCCLGYTDRILHPK FIVGFTRQLANEGCDINAIIFHTKKKLSVCANPKQTWVKYIVRLLSKKVKNM" gene 59..349 /gene="LARC" mat_peptide 137..346 /gene="LARC" /product="CC chemokine LARC" BASE COUNT 240 a 138 c 153 g 268 t ORIGIN 1 cactcccaaa gaactgggta ctcaacactg agcagatctg ttctttgagc taaaaaccat 61 gtgctgtacc aagagtttgc tcctggctgc tttgatgtca gtgctgctac tccacctctg 121 cggcgaatca gaagcagcaa gcaactttga ctgctgtctt ggatacacag accgtattct 181 tcatcctaaa tttattgtgg gcttcacacg gcagctggcc aatgaaggct gtgacatcaa 241 tgctatcatc tttcacacaa agaaaaagtt gtctgtgtgc gcaaatccaa aacagacttg 301 ggtgaaatat attgtgcgtc tcctcagtaa aaaagtcaag aacatgtaaa aactgtggct 361 tttctggaat ggaattggac atagcccaag aacagaaaga accttgctgg ggttggaggt 421 ttcacttgca catcatggag ggtttagtgc ttatctaatt tgtgcctcac tggacttgtc 481 caattaatga agttgattca tattgcatca tagtttgctt tgtttaagca tcacattaaa 541 gttaaactgt attttatgtt atttatagct gtaggttttc tgtgtttagc tatttaatac 601 taattttcca taagctattt tggtttagtg caaagtataa aattatattt gggggggaat 661 aagattatat ggactttctt gcaagcaaca agctattttt taaaaaaact atttaacatt 721 cttttgttta tattgttttg tctcctaaat tgttgtaatt gcattataaa ataagaaaaa 781 cattaataag acaaatatt // LOCUS D86956 3614 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0201 gene, complete cds. ACCESSION D86956 NID g1503985 KEYWORDS KIAA0201. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3614) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3614) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified huma genes. VI. The coding sequences of 80 new genes (KIAA 0201-KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..3614 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="13" /sex="male" gene 348..2924 /gene="KIAA0201" CDS 348..2924 /gene="KIAA0201" /note="similar to mouse heat shock protein 105 kDa beta" /citation=[3] /codon_start=1 /db_xref="PID:d1013881" /db_xref="PID:g1503986" /translation="MSVVGLDVGSQSCYIAVARAGGIETIANEFSDRCTPSVISFGSK NRTIGVAAKNQQITHANNTVSNFKRFHGRAFNDPFIQKEKENLSYDLVPLKNGGVGIK VMYMGEEHLFSVEQITAMLLTKLKETAENSLKKPVTDCVISVPSFFTDAERRSVLDAA QIVGLNCLRLMNDMTAVALNYGIYKQDLPSLDEKPRIVVFVDMGHSAFQVSACAFNKG KLKVLGTAFDPFLGGKNFDEKLVEHFCAEFKTKYKLDAKSKIRALLRLYQECEKLKKL MSSNSTDLPLNIECFMNDKDVSGKMNRSQFEELCAELLQKIEVPLYSLLEQTHLKVED VSAVEIVGGATRIPAVKERIAKFFGKDISTTLNADEAVARGCALQCAILSPAFKVREF SVTDAVPFPISLIWNHDSEDTEGVHEVFSRNHAAPFSKVLTFLRRGPFELEAFYSDPQ GVPYPEAKIGRFVVQNVSAQKDGEKSRVKVKVRVNTHGIFTISTASMVEKVPTEENEM SSEADMECLNQRPPENPDTDKNVQQDNSEAGTQPQVQTDAQQTSQSPPSPELTSEENK IPDADKANEKKVDQPPEAKKPKIKVVNVELPIEANLVWQLGKDLLNMYIETEGKMIMQ DKLEKERNDAKNAVEEYVYEFRDKLCGPYEKFICEQDHQNFLRLLTETEDWLYEEGED QAKQAYVDKLEELMKIGTPVKVRFQEAEERPKMFEELGQRLQHYAKIAADFRNKDEKY NHIDESEMKKVEKSVNEVMEWMNNVMNAQAKKSLDQDPVVRAQEIKTKIKELNNTCEP VVTQPKPKIESPKLERTPNGPNIDKKEEDLEDKNNFGAEPPHQNGECYPNEKNSVNMD LD" 3'UTR 2925..3614 BASE COUNT 1169 a 679 c 842 g 924 t ORIGIN 1 ctgaggaagt gggacctccc cttttgggtc ggtagttcag cgccggcgcc ggtgtgcgag 61 ccgcggcaga gtgaggcagg caacccgagg tgcggagcga cctgcggagg ctgagccccg 121 ctttctccca gggtttctta tcagccagcc gccgctgtcc ccgggggagt aggaggctcc 181 tgacaggccg cggctgtctg tgtgtccttc tgagtgtcag aggaacggcc agaccccgcg 241 ggccggagca gaacgcggcc agggcagaaa gcggcggcag gagaagcagg cagggggccg 301 gaggacgcag accgagaccc gaggcggagg cggaccgcga gccggccatg tcggtggtgg 361 ggttggacgt gggctcgcag agctgctaca tcgcggtagc ccgggccggg ggcatcgaga 421 ccatcgccaa tgagttcagc gaccggtgca ccccgtcagt catatcattt ggatcaaaaa 481 atagaacaat cggagttgca gccaaaaatc agcaaatcac tcatgcaaac aatacggtgt 541 ctaacttcaa aagatttcat ggccgagcat tcaatgaccc cttcattcaa aaggagaagg 601 aaaacttgag ttacgatttg gttccattga aaaatggtgg agttggaata aaggtaatgt 661 acatgggtga agaacatcta tttagtgtgg agcagataac agccatgttg ttgactaagc 721 tgaaggaaac tgctgaaaac agcctcaaga aaccagtaac agattgtgtt atttcagtcc 781 cctccttctt tacagatgct gagaggcgat ctgtgttaga tgctgcacag attgttggcc 841 taaactgttt aagacttatg aatgacatga cagctgttgc tttgaattac ggaatttata 901 agcaggatct cccaagcctg gatgagaaac ctcggatagt ggtttttgtt gatatgggac 961 attcagcttt tcaagtgtct gcttgtgctt ttaacaaggg aaaattgaag gtactgggaa 1021 cagcttttga tcctttctta ggaggaaaaa acttcgatga aaagttagtg gaacattttt 1081 gtgcagaatt taaaactaag tacaagttgg atgcaaaatc caaaatacga gcactcctac 1141 gtctgtatca ggaatgtgaa aaactgaaaa agctaatgag ctctaacagc acagaccttc 1201 cactgaatat cgaatgcttt atgaatgata aagatgtttc cggaaagatg aacaggtcac 1261 aatttgaaga actctgtgct gaacttctgc aaaagataga agtacccctt tattcactgt 1321 tggaacaaac tcatctcaaa gtagaagatg tgagtgcagt tgagattgtt ggaggcgcta 1381 cacgaattcc agctgtgaag gaaagaattg ccaaattctt tggaaaagat attagcacaa 1441 cactcaatgc agatgaagca gtagccagag gatgtgcatt acagtgtgca atactttccc 1501 cggcatttaa agttagagaa ttttccgtca cagatgcagt tccttttcca atatctctga 1561 tctggaacca tgattcagaa gatactgaag gtgttcatga agtctttagt cgaaaccatg 1621 ctgctccttt ctccaaagtt ctcacctttc tgagaagggg gccttttgag ctagaagctt 1681 tctattctga tccccaagga gttccatatc cagaagcaaa aataggccgc tttgtagttc 1741 agaatgtttc tgcacagaaa gatggagaaa aatctagagt aaaagtcaaa gtgcgagtca 1801 acacccatgg cattttcacc atctctacgg catctatggt ggagaaagtc ccaactgagg 1861 agaatgaaat gtcttctgaa gctgacatgg agtgtctgaa tcagagacca ccagaaaacc 1921 cagacactga taaaaatgtc cagcaagaca acagtgaagc tggaacacag ccccaggtac 1981 aaactgatgc tcaacaaacc tcacagtctc ccccttcacc tgaacttacc tcagaagaaa 2041 acaaaatccc agatgctgac aaagcaaatg aaaaaaaagt tgaccagcct ccagaagcta 2101 aaaagcccaa aataaaggtg gtgaatgttg agctgcctat tgaagccaac ttggtctggc 2161 agttagggaa agaccttctt aacatgtata ttgagacaga gggtaagatg ataatgcaag 2221 ataaattgga aaaagaaagg aatgatgcta aaaatgcagt tgaggaatat gtgtatgagt 2281 tcagagacaa gctgtgtgga ccatatgaaa aatttatatg tgagcaggat catcaaaatt 2341 ttttgagact cctcacagaa actgaagact ggctgtatga agaaggagag gaccaagcta 2401 aacaagcata tgttgacaag ttggaagaat taatgaaaat tggcactcca gttaaagttc 2461 ggtttcagga agctgaagaa cggccaaaaa tgtttgaaga actaggacag aggctgcagc 2521 attatgccaa gatagcagct gacttcagaa ataaggatga gaaatacaac catattgatg 2581 agtctgaaat gaaaaaagtg gagaagtctg ttaatgaagt gatggaatgg atgaataatg 2641 tcatgaatgc tcaggctaaa aagagtcttg atcaggatcc agttgtacgt gctcaggaaa 2701 ttaaaacaaa aatcaaggaa ttgaacaaca catgtgaacc cgttgtaaca caaccgaaac 2761 caaaaattga atcacccaaa ctggaaagaa ctccaaatgg cccaaatatt gataaaaagg 2821 aagaagattt agaagacaaa aacaattttg gtgctgaacc tccacatcag aatggtgaat 2881 gttaccctaa tgagaaaaat tctgttaata tggacttgga ctagataacc ttaaattggc 2941 ctattccttc aattaataaa atatttttgc catagtatgt gactctacat aacatactga 3001 aactatttat attttctttt ttaaggatat ttagaaattt tgtgtattat atggaaaaag 3061 aaaaaaagct taagtctgta gtctttatga tcctaaaagg gaaaattgcc ttggtaactt 3121 tcagattcct gtggaattgt gaattcatac taagctttct gtgcagtctc accatttgca 3181 tcactgagga tgaaactgac ttttgtcttt tggagaaaaa aaactgtact gcttgttcaa 3241 gagggctgtg attaaaatct ttaagcattt gttcctgcca aggtagtttt cttgcatttt 3301 gctctccatt cagcatgtgt gtgggtgtgg atgtttataa acaagactaa gtctgacttc 3361 ataagggctt tctaaaacca tttctgtcca agagaaaatg actttttgct ttgatattaa 3421 aaattcaatg agtaaaacaa aagctagtca aatgtgttag cagcatgcag aacaaaaact 3481 ttaaactttc tctctcacta tacagtatat tgtcatgtga aagtgtggaa tggaagaaat 3541 gtcgatcctg ttgtaactga ttgtgaacac ttttatgagc tttaaaataa agttcatctt 3601 atggtgtcat ttct // LOCUS D86958 6614 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0203 gene, complete cds. ACCESSION D86958 NID g1503989 KEYWORDS KIAA0203. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6614) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6614) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6614 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="8" /sex="male" 5'UTR 1..515 gene 516..5291 /gene="KIAA0203" CDS 516..5291 /gene="KIAA0203" /note="similar to mouse CC1." /citation=[3] /codon_start=1 /db_xref="PID:d1013883" /db_xref="PID:g1503990" /translation="MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQV LVVNGGECMAADRRVCTYSAGTDTNPIFLFNKEMILCDRPPAIPKTTFSTENDMEIKV EESLMMPAVFHTVASRTQLALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLEDC SNSYQKLLFKFESIYSNYLQSIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYRECLG RLDSLPEHEDSEKAETKRSTELVLSPDMPRTTNESLLTSFPKSVEHVSPDTADAESGK EIRESCQSTVHQQDETTIDTKDGDLPFFNVSLLDWINVQDRPNDVESLVRKCFDSMSR LDPRIIRPFIAECRQTIAKLDNQNMKAIKGLEDRLYALDQMIASCGRLVNEQKELAQG FLANQKRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV RLKWCCFVMLHADQDGEKLQALLRLVIELLERVKIVEALSTVPQMYCLAVVEVVRRKM FIKHYREWAGALVKDGKRLYEAEKSKRESFGKLFRKSFLRNRLFRGLDSWPPSFCTQK PRKFDCELPDISLKDLQFLQSFCPSEVQPFLRVPLLCDFEPLHQHVLALHNLVKAAQS LDEMSQTITDLLSEQKASVSQTSPQSASSPRMESTAGITTTTSPRTPPPLTVQDPLCP AVCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLDLDSLAESPESDFMSAVNEFV IEENLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDTNVCGKEDFGDHTSLNVQL ERCRVVAQDSHFSIQTIKEDLCHFRTFVQKEQCDFSNSLKCTAVEIRNIIEKVKCSLE ITLKEKHQKELLSLKNEYEGKLDGLIKETEENENKIKKLKGELVCLEEVLQNKDNEFA LVKHEKEAVICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLEDLKKLHVENDEK LQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKENQQIINQIQE SHAEIIQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEESRAQQKET LKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISELISRHEE ESNILKAELNKVTSLHNQAFEIEKNLKEQIIELQSKLDSELSALERQKDEKITQQEEK YEAIIQNLEKDRQKLVSSQEQDREQLIQKLNCEKDEAIQTALKEFKLEREVVEKELLE KVKHLENQIAKSPAIDSTRGDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNV RTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARL LEEKKKLEEEVSKLRSSSFVPSPYVATAPELYGACAPELPGESDRSAVETADEGRVDS AMETSMMSVQENIHMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSMSSVSSRHSE KIAIRDFQVGDLVLIILDERHDNYVLFTVSPTLYFLHSESLPALDLKPASGASRRPWV LGKVMEKEYCQAKKAQNRFKVPLGTKFYRVKAVSWNKKV" 3'UTR 5292..6614 BASE COUNT 2286 a 1134 c 1350 g 1844 t ORIGIN 1 aacaaaccaa gccgcggcgg tgtccgcggc cctgccgagc cctcggcgtt gcctcagaat 61 cccccagtcg cctgggcccc tcggctctga caggccgcgg ccttctgtcc cccggcccca 121 gacccagagc cgaggggcct gctcgcgtcc ttgtccgccc ggacccctcc ctgcctccta 181 gagttcgggg ccgcggcggg cgggcgcccg ggacgccggc ggttgtgtcg gcttagcggt 241 gccgaatggg cggttggtaa ccgctgccga ggactaggcg gcggcggaag atggtgccgg 301 gggtcgctgg ctctgctgct gccgccggcg aaggaggagg cgttgccggt tttctgagtt 361 taaccagtaa tgccattcag ttgccaatct caagcaaagc aaacataagc cagttttaat 421 ctacttttta agaaaagtgg tagtcctttt cacagtgcct gacgtaactg tatcagaggg 481 tgaggtataa gctcacagaa ttcagataaa tcatcatgaa gttatatgta tttctggtta 541 acactggaac tactctaaca tttgacactg aacttacagt gcaaactgtg gcagacctta 601 agcatgccat tcaaagcaaa tacaagattg ctattcaaca ccaggtgctg gtggtcaatg 661 gaggagaatg catggctgca gatcgaagag tgtgtaccta cagtgctggg acggatacaa 721 atccaatttt tctttttaac aaagaaatga tcttatgtga tcgtccacct gctattccta 781 aaactacctt ttcgacagaa aatgacatgg aaataaaagt tgaagaatct cttatgatgc 841 ctgcagtttt tcatactgtt gcttcaagga cacagcttgc attggaaatg tatgaagttg 901 ccaagaaact ttgttctttt tgtgaaggtc ttgtacatga tgaacatctt caacaccaag 961 gctgggctgc aatcatggcc aacctggagg actgttcaaa ttcataccaa aagctacttt 1021 tcaagtttga aagtatttat tcaaattatc tgcagtccat agaagacatc aagttaaaac 1081 ttactcattt aggaactgca gtttcagtaa tggccaagat tccactgttg gagtgcctaa 1141 ccagacatag ttacagagaa tgtttgggaa gactggattc tttacctgaa catgaagact 1201 cagaaaaagc tgagacgaaa agatccactg aactggtgct ctctcctgat atgcctagaa 1261 caactaacga atctttgtta acctcatttc ccaagtcagt ggaacatgtg tccccagata 1321 ccgcagatgc tgaaagtggc aaagaaatta gggaatcttg tcaaagtact gttcatcagc 1381 aagatgaaac tacgattgac actaaagatg gtgatctgcc cttttttaat gtctctttgt 1441 tagactggat aaatgttcaa gatagaccta atgatgtgga atctttggtc aggaagtgct 1501 ttgattctat gagcaggctt gatccaagga ttattcgacc atttatagca gaatgccgtc 1561 aaactattgc caaacttgat aatcagaata tgaaagccat taaaggactt gaagatcggc 1621 tctacgccct ggaccagatg attgctagct gtggccgact ggtgaatgaa cagaaagagc 1681 ttgctcaggg atttttagct aatcagaaga gagctgaaaa cttaaaggat gcatctgtat 1741 tacctgattt atgcctgagt cacgcaaatc agttgatgat tatgttgcaa aatcatagaa 1801 aactgttaga tattaagcag aagtgtacca ctgccaaaca agaactagca aataacctac 1861 atgtcagact gaagtggtgt tgctttgtaa tgcttcatgc tgatcaagat ggagagaagt 1921 tacaagcttt gctccgcctc gtaatagagc tgttagaaag agtcaaaatt gttgaagctc 1981 ttagtacagt tcctcagatg tactgcttag ctgttgttga ggttgtaaga agaaaaatgt 2041 tcataaaaca ctacagggag tgggctggtg ctttagtcaa agatggaaag agattatatg 2101 aagcagaaaa atcaaaaagg gaatcctttg ggaaattatt taggaagtct tttttaagaa 2161 atcgtctgtt taggggactg gactcctggc ccccttcctt ttgtactcaa aagcctcgaa 2221 agtttgactg tgaacttcca gatatttcat taaaagattt acagtttctg caatcatttt 2281 gtccttcgga agttcagcca ttcctcaggg ttcccttact ttgtgacttt gaacctctac 2341 accagcatgt acttgctcta cataatttgg taaaagcagc acaaagtttg gatgaaatgt 2401 cacagaccat tacagatcta ctgagtgaac aaaaggcatc tgtgagtcag acatccccac 2461 agtctgcttc ttcaccaagg atggaaagta cagcaggaat tacaactact acctcaccga 2521 gaactcctcc accactgact gttcaggatc ccttatgtcc tgcagtttgt cccttagaag 2581 aattatctcc agatagtatt gatgcacata cgtttgattt tgaaactatt ccccatccaa 2641 acatagaaca gactattcac caagtttctt tagacttgga ttcattagca gaaagtcctg 2701 aatcagattt tatgtctgct gtgaatgagt ttgtaataga agaaaatttg tcgtctccta 2761 atcctataag tgatccacaa agcccagaaa tgatggtgga atcactttat tcatcagtta 2821 tcaatgcgat agacagtaga cgaatgcagg atacaaatgt atgtggtaag gaggattttg 2881 gagatcatac ttctctgaat gtccagttgg aaagatgtag agttgttgcc caagactctc 2941 acttcagtat acaaaccatt aaggaagacc tttgccactt tagaacattt gtacaaaaag 3001 aacagtgtga cttctcaaat tcattaaaat gtacagcagt agaaataaga aacattattg 3061 aaaaagtaaa atgttctctg gaaataacac taaaagaaaa acatcaaaaa gaactactgt 3121 ctttaaaaaa tgaatatgaa ggtaaacttg acggactaat aaaggaaact gaagagaatg 3181 aaaacaaaat taaaaaattg aagggagagt tagtatgcct tgaggaggtt ttacaaaata 3241 aagataatga atttgctttg gttaaacatg aaaaagaagc tgtaatctgc ctgcagaatg 3301 aaaaggatca gaagttgtta gagatggaaa atataatgca ctctcaaaat tgtgaaatta 3361 aagaactgaa gcagtcacga gaaatagtgt tagaagactt aaaaaagctc catgttgaaa 3421 atgatgagaa gttacagtta ttgagggcag aacttcagtc cttggagcaa agtcatctaa 3481 aggaattaga ggacacactt caggttaggc acatacaaga gtttgagaag gttatgacag 3541 accacagagt ttctttggag gaattaaaaa aggaaaacca acaaataatt aatcaaatac 3601 aagaatctca tgctgaaatt atccaggaaa aagaaaaaca gttacaggaa ttaaaactca 3661 aggtttctga tttgtcagac acgagatgca agttagaggt tgaacttgcg ttgaaggaag 3721 cagaaactga tgaaataaaa attttgctgg aagaaagcag agcccagcag aaggagacct 3781 tgaaatctct tcttgaacaa gagacagaaa atttgagaac agaaattagt aaactcaacc 3841 aaaagattca ggataataat gaaaattatc aggtgggctt agcagagcta agaactttaa 3901 tgacaattga aaaagatcag tgtatttccg agttaattag tagacatgaa gaagaatcta 3961 atatacttaa agctgaatta aacaaagtaa catctttgca taaccaagca tttgaaatag 4021 aaaaaaacct aaaagaacaa ataattgaac tgcagagtaa attggattca gaattgagtg 4081 ctcttgaaag acaaaaagat gaaaaaatta cccaacaaga agagaaatac gaagctatta 4141 tccagaacct tgagaaagac agacaaaaat tggtcagcag ccaggagcaa gacagagaac 4201 agttaattca gaagcttaat tgtgaaaaag atgaagctat tcagactgcc ctaaaagaat 4261 ttaaattgga gagagaagtt gttgagaaag agttattaga aaaagttaaa catcttgaga 4321 atcaaatagc aaaaagtcct gccattgact ctaccagagg agattcttca agcttagttg 4381 ctgaacttca agaaaagctt caggaagaaa aagctaagtt tctagaacaa cttgaagagc 4441 aagaaaaaag aaagaatgaa gaaatgcaaa atgttcgaac atctttgatt gcggaacaac 4501 agaccaattt taacactgtt ttaacaagag agaaaatgag aaaagaaaac ataataaatg 4561 atcttagtga taagttgaaa agtacaatgc agcaacaaga acgggataaa gatttgatag 4621 agtcactttc tgaagatcga gctcgtttgc ttgaggaaaa gaaaaagctt gaagaagaag 4681 tcagtaagtt gcgcagtagc agttttgttc cttcaccata tgtagctaca gccccagaac 4741 tttatggagc ttgtgcacct gaactcccag gtgaatcaga tagatccgct gtggaaacag 4801 cagatgaagg aagagtggat tcagcaatgg agacaagcat gatgtctgta caagaaaata 4861 ttcatatgtt gtctgaagaa aaacagcgga taatgctgtt agaacgaaca ttgcaattga 4921 aagaagaaga aaataaacgg ttaaatcaaa gactgatgtc tcagagcatg tcttcagtat 4981 cttcaaggca ttctgaaaag atagctatta gagattttca ggtgggagat ttggtactca 5041 tcatcctaga cgaacgccat gacaattatg tgttatttac tgttagtcct actttatatt 5101 ttctacattc agagtctcta cctgccctgg atctcaaacc agcttcaggt gcatctagaa 5161 gaccctgggt actcggaaaa gtaatggaaa aagaatactg tcaagccaaa aaggcacaaa 5221 acagatttaa agttcctttg gggacaaagt tttacagagt gaaagccgta tcatggaata 5281 agaaagtata acttatggac aaaattaata cattctatga catttttttc tgatttgtcc 5341 tgcagtgctc attcatcact ccaaaaacag caggccatct ttttatgcaa aagtcagcgt 5401 gacaatatac ttcactggtg tacatcgttt actttttaac tggcttcatt ttaggaataa 5461 taaattcatc agaatccttg gctgaattaa aatggttttt gttttttggt tttttttttt 5521 acccagacaa ctctagaaat gcggaccaaa ctacttcatt ttctcaaagg gcataccttg 5581 tgcattgtgg cttatgatga gccatattaa ttgcctgtta aatatacact agcttgaact 5641 tagatgttaa atgttattat taccagcatt tgtccttttg tgaaatcagt atcagaatac 5701 ttgcactctt taacacattc tttataaaat gtataaatta ttcagaacta tttaaaataa 5761 agaggagtgt tattgcatgc tgataatcat tttgagtttg cctcagtaga tactaaagca 5821 aattgtttca gtttttttaa atgccctttg atgtttcaaa aaaaaaaagg aactgtaatt 5881 tgattgactg attttaagat cagccataag taatcagcaa tcttcaaaag cactttcagt 5941 ggattggtca tctgggttct aaagggaaga gtctgtgcta ctaaccattt caaatgcaga 6001 ctcaaacctt cccaacatct ttatgactct agaataatca tattgatgaa atcgtaattc 6061 atggttgagt ttcagaacaa aagatattca ttgcacatta accatttaga ggtcatttaa 6121 ataacaaaat attgtattgt aaaagaactg tacaatttta aaacaataaa gatttgaacc 6181 tgtaaatgtg tgtgcctttt aaagaaggat acatttttaa tatatttgag tgattgctgg 6241 gaagtgtgaa aatattgtta tgtatcatat caaagagaaa catgtttatt acaaaaatgt 6301 tctttaacta tatactatgt aacagggtaa acagtgttat gtagaataga attgtgtaaa 6361 ctagatcttt agagaagttg ccattgagca aagttattta aatgagttag ttgagttgga 6421 tgagaattgt ttgaggtttg ttgctagaga acaataataa aataattctt tttcagaaaa 6481 tatttaattt cttcataaaa ataagttaaa tattttttta aatatgtata tctaatagta 6541 caaaatggaa taaacatcat agtgtataga aaactgaatt tgacaagtta atgaataaat 6601 gaacaaatga tttc // LOCUS D86960 6253 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0205 gene, complete cds. ACCESSION D86960 NID g1503993 KEYWORDS KIAA0205. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6253) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6253) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6253 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="1" /sex="male" 5'UTR 1..227 gene 228..1340 /gene="KIAA0205" CDS 228..1340 /gene="KIAA0205" /note="similar to putative product coded in C.elegans cosmid C01C10." /citation=[3] /codon_start=1 /db_xref="PID:d1013885" /db_xref="PID:g1503994" /translation="MAITLEEAPWLGWLLVKALMRFAFMVVNNLVAIPSYICYVIILQ PLRVLDSKRFWYIEGIMYKWLLGMVASWGWYAGYTVMEWGEDIKAVSKDEAVMLVNHQ ATGDVCTLMMCLQDKGLVVAQMMWLMDHIFKYTNFGIVSLVHGDFFIRQGRSYRDQQL LLLKKHLENNYRSRDRKWIVLFPEGGFLRKRRETSQAFAKKNNLPFLTNVTLPRSGAT KIILNALVAQQKNGSPAGGDAKELDSKSKGLQWIIDTTIAYPKAEPIDIQTWILGYRK PTVTHVHYRIFPIKDVPLETDDLTTWLYQRFVEKEDLLSHFYETGAFPPSKGHKEAVS REMTLSNLWIFLIQSFAFLSGYMWYNIIQYFYHCLF" 3'UTR 1341..6253 BASE COUNT 1829 a 1149 c 1332 g 1943 t ORIGIN 1 ctccccgcca gtccgggcga acgcggccgg gcccttgggg accgagtctc ggcgccgccg 61 gggaacgggc gcgggccgcg ccacagccgg agcggggcag ggccccgcca ccgccttctt 121 ccgccggccc cgccgccggc catgcattct tccggctcct ctgactccca aagccccggc 181 tggggcccgg ccagcccgag aaagacagga cggagtccag tgtgagaatg gctataactt 241 tggaagaagc tccgtggctg ggctggctct tggtgaaagc actgatgagg tttgccttca 301 tggtcgtcaa caacctggtt gctattccat cctacatctg ctatgtaatt atacttcagc 361 cccttcgagt gctggacagt aagcggttct ggtatatcga aggaatcatg tataaatggc 421 ttttaggaat ggtagcttcc tggggatggt atgctggata tacagtgatg gaatggggag 481 aagatattaa agcagtttca aaagatgaag cagtgatgtt ggtgaatcat caggcaacag 541 gagatgtgtg cacactgatg atgtgcctcc aggacaaagg actggttgtt gctcagatga 601 tgtggttgat ggatcatatt tttaagtaca caaactttgg aattgtttct ctagttcatg 661 gagacttctt tataagacag ggaagatctt atcgtgacca acagctgctg cttctcaaga 721 agcacttaga aaataattac aggagcagag atcgaaaatg gattgttttg tttccagaag 781 ggggcttcct caggaagagg cgagaaacaa gtcaggcatt tgccaagaaa aataacttgc 841 catttcttac aaatgttact ctgccaaggt ctggggcaac aaaaattatt ttgaatgcac 901 ttgtagcaca acagaaaaat ggaagtccag caggaggaga tgctaaagaa ttagacagca 961 aatcaaaagg cctccagtgg ataatagata caacgatagc ttatcccaaa gctgaaccta 1021 tagatattca aacctggatc cttggataca ggaaaccaac agtcacacat gtacattaca 1081 ggatctttcc aattaaagat gtacccctgg agactgatga ccttaccact tggctctatc 1141 agcggtttgt tgaaaaagaa gacctcttat cacattttta tgaaacagga gcttttccac 1201 cttccaaggg ccataaggaa gctgtttcca gggagatgac cctcagcaac ttgtggatat 1261 ttctcataca gtcttttgca tttttgtcag gctatatgtg gtacaacatc attcagtatt 1321 tttaccattg cctgttttag gaattgacgt ggacttgtca aggtcaccgt aggagttcag 1381 actttctttc ataagtgtgc attattttac atgtgcaaat cagaatatat ttaaaaaaaa 1441 gcaaaagaaa ttcaatggat ggattaatat ttatccccct ttgggatatt ttaaaatcta 1501 ctaaaatgag gattagtaat ataatgacct gctaatatat tttaagaaca tgttttaaaa 1561 agatacactc tatcacatat ttaagcaaac atcacatttg gagaagagga aatcataaaa 1621 tcatcctaga agactatctg agagaaattc tgttgccacc agttatactt gacaatttag 1681 ttgaagctca gaaagttata ttcatccctc ttagctgtag tctatattag ggcagttctc 1741 tagaacgccc acatttccac caactcagta aacttgaggg gagctggttg gcacctcgtg 1801 aagaactctt tccctgtcgt ttgcagtaac aaactccagt ctgttgcagt aacaaactcc 1861 agtctgttgc agtaacaaac tctagaatat tgacattctc tgtgggggaa aagcagtgtc 1921 cactggaccc cttctggtac tggatgtgtt ctttacaaag gctagctcag tccaacattg 1981 tgtttacata cactcgtgct tttccttatc tgacttctca ttttgtatca gaggcatatc 2041 ataaattgat aattttgcaa aatgcacttt tttgagatgc agatatagca aaggattagt 2101 aatatagcct gaaaacaaat gggagcatag cagtgtgtga ggttctcgag aactgtcttg 2161 tctctgtgtg tttatttgcc tgccagtgct ctccagcgcc atcctgccct ggacaccacc 2221 ctgacgtgat gcctctattg cagctcagag gctttatttt ttccattttg acattggcac 2281 taaatgcatt tggggatggt taaaacaaat tactatagaa catttaaatg atcagtttaa 2341 ggggaaatag gctagtttat agaaaaataa gagctagtgg cttataatgg tgacaggttc 2401 tcatgtggca cccctaggac ctgtgcagac agtagtctgt tgaatcatta catcaaggag 2461 ctgcccctgt cagggtgagt gtaattagga acgataccag cacataaggc tccccccaat 2521 ctcttccagt tgctttttct tttttctttt tttttttaga caggtcttac tctgtcaccc 2581 aggctgtagt gcagtggcat aatctcagct cactgcagtc tctgcctccc gggttcgagc 2641 gattctcctg cctcagcctc ccgagtagct gggactacag gcacccacca tcatgcttgg 2701 ctaatttttg tatttttagt agagatgggg tttcaccatg ttggccaggc tggcctcgaa 2761 ctcctgacct caggtgatct gcccacctcg gcttcctaaa gtgctgggat tgcaggcgtg 2821 agccaccacg cccggcctcc agttgctttt gaagagggta aagtcaagtt tctatttcta 2881 gaaaacattt tttagaaatt tgttgcgatg tttgtacaat ttacctcata agagaaccac 2941 acccttctcc aaaagtgctg gtactgcatc agtgagatga gtagggtttt tgtttgggat 3001 tgtaagacta ttaagatcct agtaagtcag tggtagttta ctgtgatgtg agcattgtag 3061 attcccccgt tgtcactaat cacacaacaa ttttgagaag taggctataa aacaaaaaaa 3121 ggttgctgtt ttctattttt aaataaacca aaaaaaaaca gaaaagatgt gaattttccc 3181 cagttatgtg ggaaaggtaa agcaacacca aataaaagcc cacagcagct tcatctttac 3241 gataactcag tgcatatgtg aaagagaatg atgcattaac tgaaatacct cattgaatat 3301 tatactacta ctggtaaaat gcagaagaca gtgttaatgt gtttggtttg ggtactggtt 3361 gttataaaat gcaatttttt tttaaatcta agcatttcat tatgtgttct acagtgtcgg 3421 tgaataaatg aaaccaatct caatttagag gtatggatga tgacagaaag cccaatagaa 3481 gcttaatgat gcttctgttt gaggcccagc aagcaccact aaattactgg atgaaatgaa 3541 attgttcact tgagggatta gtcaaccatc tgggggagaa gttgctcact gtcaaataca 3601 gcatgcacgg tcctagctga tagacctttt cctcattgct acagcaagcc acagggtaga 3661 gtgaccagtt ctcctatcca aaaataaatg cgaacatgca tacataaatg tggctgaggg 3721 ccacttttgc catcactgtg ctccaaagga acatgatatt gttaaatata ctgtacacat 3781 taaactattc tataagatgc tttacttttc aagtacggtg tcattttcac agtctccagg 3841 gtgacaaatg ccaatagata tgtgagttaa tttttttaaa tatctccgtg agttgtaaat 3901 agatgtgtat ttgatctgcc ctatccttct ttgattcttt aacatgttct ctctcttttg 3961 tttgcaaaat atgtttgaaa gcattggtag tgctttcctt atcagtaact ggagttctcc 4021 tgcttgcact agagaaggta aagagagaat cagtattctt atatggcaat ctggggaagc 4081 agcaatatgc cactgtacaa aactgaagaa aagttcctaa tttgtacttt gtgaagggag 4141 atgaaaggac gtttaaagta tatatatttt gtcaagagga aagaagataa aactatgcca 4201 gttttatatc aatagcttgt agaagctcag ctcttcttgg tcttggctag actgcctaga 4261 ttcccacagc agacaaggtt gagaatccat tgctggaatc ttggtattga tgagttacag 4321 tgatggaaca tgtgcttggc cacaggcagg tccagtcact gcaaaagtga ccaagccagc 4381 aggtcaccct taacttcaga aacaattatt ggtggtgaac tgtacttaaa ttgcagagaa 4441 acctgtaagt aatggaaggt aaagaaaaat tacagaatgg aaaataatat tttgggcaag 4501 caaacaaatt cactgagaat tccaaaagta tattaaaaaa gaagatagct atgagttcag 4561 atctatctta ttggtcttta atattacaac caatccttaa ctttccacta taaaggaagg 4621 attactagat tgattacttt ctgggtagat aatctggtaa taaatgatag gtaaatcaaa 4681 aattactttt atttaggagt ttgaattctt actctcatca gacatttttt ttctagggac 4741 gcttactaat taaatgattt aagttgtttc ttaggggttt tttgcctata tatttatgac 4801 tgtgttaatg agtagtgaaa tgatgcggaa agacagctat caggaagagg aaatacagaa 4861 gcctgaataa tctatgggtt agaaaagcat ccctgaataa tcaaaaattg gcagtattgg 4921 cattgttctc aagccttttt atgaaaatga aatctgaaat caccaaatgt aaacctggga 4981 acattattct agtgttgctg tcttggattc atgttaagaa gcgtcttcat tctttgctca 5041 tgttgcccac ttcttgtgga tttgtctgag tgttttttga caatcacttc cttaaagact 5101 cttctgaact agttggacct ggttaatcat agagagtagc ctttaatcat ggatagtctt 5161 cttggattat ttttatattt gaaaagaaaa tgttttattt gcactactga gtaggaagag 5221 ttaattgttt tctttgttct ttttttgaag tcattacaca ggacttcact ccagagttac 5281 cattatgagt gtgttcagct ctggtccaca gaggatggat aaaaatggtt tgttatgttt 5341 ttttgctctg cagtgctatg agccttatat ctgttaatat gaaggacaaa gtcaaaagca 5401 gcagtggata gcaggaaggg tagagactaa tatgtttggg accaaaacca tctaagttag 5461 agatttccag atcacagagg ggctgggcat tctctggagc agtcattggt tggtgcttta 5521 ttgtaatcat tttgcgccaa tccccaacaa ttaggaactg gaccctggga ataagctgag 5581 ggtgctgaac tgttggggaa gggtgactgt agccacatgg aagataaaat atgggttttt 5641 ctgcaaaatt tccatctgag ggtttttaca tttaatattt ttttaagaca gtttaaagag 5701 caaacgtttt ttaagtgtat tctagttgca aagtatgcac acatatcttg aatggcttta 5761 tttttattgt gtaaaactgt tgaacacatg actgtgatgc acaaattctt tacgtgtaag 5821 gagtctatgc attttacagt aacttatttt atgatcgggt gatgagacag ttatactttc 5881 aactgccatt atttttatta agtgctttca ttttctttac agttattata aaattgtatt 5941 tattttatac agatgggttt tcattttcct gatgctgtaa tgtttacttc agcttgttga 6001 cctttctttg tgttatctgc atgttgtaac gtgtgataag aatgaatgta aaggctgtgg 6061 caactgtaat taatttttgt aaagggctgg tcacacgtgg atctggttta tgaatgcatt 6121 tgggatgatt ttggtaacca gatcaccttt tcagaaattt agatgtgaac accaaaagaa 6181 gcattttctc aacaaaaatt aatagctggt tctatttttt ttaaacctag aaaaaataaa 6241 gttgattttt ttc // LOCUS D86965 6611 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0210 gene, complete cds. ACCESSION D86965 NID g1504003 KEYWORDS KIAA0210. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6611) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6611) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6611 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 3" /sex="male" 5'UTR 1..1664 gene 1665..4052 /gene="KIAA0210" CDS 1665..4052 /gene="KIAA0210" /note="similar to a putative protein coded in Caenorhabditis elegans cosmid B0393." /citation=[3] /codon_start=1 /db_xref="PID:d1013890" /db_xref="PID:g1504004" /translation="MYHSLSETRHPLQPEEQEVGIDPLSSYSNKSGGDSNKNGRRTSS TLDSEGTFNSYRKEWEELFVNNNYLATIRQKGINGQLRSSRFRSICWKLFLCVLPQDK SQWISRIEELRAWYSNIKEIHITNPRKVVGQQDLMINNPLSQDEGSLWNKFFQDKELR SMIEQDVKRTFPEMQFFQQENVRKILTDVLFCYARENEQLLYKQGMHELLAPIVFVLH CDHQAFLHASESAQPSEEMKTVLNPEYLEHDAYAVFSQLMETAEPWFSTFEHDGQKGK ETLMTPIPFARPQDLGPTIAIVTKVNQIQDHLLKKHDIELYMHLNRLEIAPQIYGLRW VRLLFGREFPLQDLLVVWDALFADGLSLGLVDYIFVAMLLYIRDALISSNYQTCLGLL MHYPFIGDVHSLILKALFLRDPKRNPRPVTYQFHPNLDYYKARGADLMNKSRTNAKGA PLNINKVSNSLINFGRKLISPAMAPGSAGGPVPGGNSSSSSSVVIPTRTSAEAPSHHL QQQQQQQRLMKSESMPVQLNKGLSSKNISSSPSVESLPGGREFTGSPPSSATKKDSFF SNISRSRSHSKTMGRKESEEELEAQISFLQGQLNDLDAMCKYCAKVMDTHLVNIQDVI LQENLEKEDQILVSLAGLKQIKDILKGSLRFNQSQLEAEENEQITIADNHYCSSGQGQ GRGQGQSVQMSGAIKQASSETPGCTDRGNSDDFILISKDDDGSSARGSFSGQAQPLRT LRSTSGKSQAPVCSPLVFSDPLMGPASASSSNPSSSPDDDSSKDSGFTIVSPLDI" 3'UTR 4053..6611 BASE COUNT 1797 a 1554 c 1440 g 1820 t ORIGIN 1 tgactgcatc acctggtctg tgaattttcc attagaagct tggtgtgctg ttaggtgaaa 61 gacttgctca gctatgcgtc attgggtttt atcaacatat aggcgaaaaa aatcctggtc 121 tctgagtgta cagctgagat gaaaatttct tttattggag gaagtattga gtgtgtgctc 181 tcaaatgcgg cctcagttga gtagtgcatt cctgagtttt ggaagcaaat ttgcaaacaa 241 ttgagagtcg tacagtgggt gttctaactg gattcaggtt ttttctaatg taattttttc 301 acacgtaaat taaaaagttt agaaatgtca cacataactt cataacactt tatggagaaa 361 tggttgtact tttaattttt ttctttttat ttatactcca actgactgag cagaggttgt 421 acttctaaat aactttgtgg aagtttttag taccataatt tttataattt tcattccagt 481 cctttgatat ttatgacagt acttctgaag cgcttactga gtgccggaca ctgttgtaag 541 tgctttacgg aacttgactt tttttttttt ttgagacgga ctctcgctct gtcgcccagg 601 ctggagtgca gtggtgcagt ggctcgatct cggctcactg ccacctctcc ctcatggttt 661 caaacacttc tcctgcctca gcctcccagg tagccaggat tatagccgcc cgccaccact 721 cccgactaat tttattttgt atgttctttt ttagtagaga cggaggagtt tcaccatgtt 781 ggccaggctg gtatcgacct cctgacctca agtgatgtgt ccatctcggc ctcccaaggt 841 gctggaatta caggtgtgag ccactgtgct cggcctacct tttttttttg ttttttgttt 901 ttttgaaaag gagtttcgct cttgtccagg ctggagtata atggtgcgat ctcagctcac 961 cgcaatctcc gcctcccaga ttcaagcgat tctcctgcct cagcctcctc aggagctggg 1021 attacaggcg cccaccgcca tgcccggcta atttttgtat ttttagtaga gacggggttt 1081 cactatattg gccaggctgg tctcgaactg ctgacctcaa gtaatccgcc tgcctcagcc 1141 tcccaaagtg ctgggattac agacgtgatc caccaggatc acaccaggcc gcgcctggcc 1201 tgctttcatt ttaaaagtca aatttgtcat ccgcctcagt gcttgtaatc ttttctgagt 1261 gagatactga aatttgcagt ttcgttttgc ttgcacttgt tcactggacc agtagtcact 1321 gttaaatgta aaagtatcta cttcctctga aagtttttta ttcctttatt tcctgcctgg 1381 gcttgtcctc caccctacat gtatgcgtag tagatttagt gtttgttatc ctaaccttta 1441 ggtttaggga ttgactgggt ttctgacttt ttatttggcc aatgaggacg atacagaaaa 1501 tgaagcattg gtcattatca cattttaacg ctgaaaaagt aagaaggaca accccggaat 1561 aaaatgatat cagtatcaag ataaaagttt ggaatgggag aaaaattctc aaagcctgaa 1621 agaaaatctg tagttacttt tggtgacgct gtccagttcc cacaatgtat cattccttat 1681 ctgaaactag acatcctctg cagccagaag aacaagaagt aggcattgac cccttgtcca 1741 gttactctaa caagtctgga ggagattcaa ataaaaatgg aagaagaaca agttctactt 1801 tagactctga agggactttt aattcctata ggaaagaatg ggaagaacta tttgtaaaca 1861 acaattactt ggcaacaata aggcagaagg ggattaatgg gcagctgaga agcagcaggt 1921 tccgcagcat ttgctggaag ctatttcttt gtgttcttcc tcaagacaaa agtcaatgga 1981 taagtagaat tgaagaatta agagcatggt atagcaacat taaagaaata catattacca 2041 acccgaggaa ggttgttggc caacaagatt tgatgatcaa taatcctctt tcacaggatg 2101 aagggagtct ttggaacaaa ttcttccaag ataaagaact tcgatcaatg attgaacaag 2161 atgtcaaaag aacgtttcct gaaatgcagt ttttccagca agaaaatgtg agaaaaattc 2221 ttacagatgt tcttttctgt tatgccagag aaaacgagca gttgctttat aaacagggca 2281 tgcacgaact gttagcacct atagtctttg tccttcactg tgaccaccaa gcttttctac 2341 atgccagtga gtctgcacag cccagtgagg aaatgaaaac tgtcttgaac cctgagtatc 2401 tggaacatga tgcctatgca gtgttctcac aacttatgga aactgctgaa ccttggtttt 2461 caacttttga gcatgatggt cagaagggga aagaaacact gatgactccc attccctttg 2521 ctagaccaca agatttaggg ccaacaattg ctattgttac taaagtcaac cagatccagg 2581 atcatctact gaagaagcat gatattgagc tttacatgca cttgaacaga ctagaaattg 2641 caccacagat atatgggtta aggtgggtgc ggctgctatt tggacgagag ttccccctgc 2701 aggaccttct ggtggtctgg gatgccttgt ttgcagacgg cctcagcctg ggtttagtag 2761 attatatctt cgtagccatg ttactttaca tccgagatgc tttgatctct agtaactacc 2821 agacctgtct cggccttctg atgcattacc cattcatcgg ggatgtacac tcactgattc 2881 ttaaggctct gttccttaga gatccaaaga gaaatccaag accagtgact tatcaattcc 2941 atccaaattt agattattac aaagcacgag gagcagacct catgaataaa agccggacca 3001 atgccaaagg tgctcccctg aatataaata aggtctctaa tagcctgatt aattttggaa 3061 gaaagttgat ttccccagca atggctccag gcagtgcagg tggccctgta cctggaggca 3121 acagcagtag ctcctcctct gttgtaattc ctaccaggac ctcagcagag gccccaagcc 3181 atcacttgca acagcaacag cagcagcaga ggctgatgaa atcagaaagc atgcctgtgc 3241 aattgaacaa agggctaagt tctaaaaaca tcagttcatc tccaagcgtt gagagtttgc 3301 ctggaggaag agaattcact ggctctccac cttcatctgc tactaaaaaa gattcctttt 3361 ttagcaacat ctcacgttct cgctcacaca gcaaaactat gggcagaaaa gaatctgaag 3421 aagaattaga agcccaaatt tccttccttc aagggcagtt gaatgacctg gatgccatgt 3481 gcaaatactg tgcaaaggtg atggacactc atcttgtaaa tattcaagat gtgatattac 3541 aagaaaattt ggaaaaagaa gatcaaattc tggtttccct ggcaggatta aaacagatca 3601 aagacattct aaaaggttcc ctgcgtttta accagagcca gctagaggcc gaagagaacg 3661 aacagatcac cattgcggac aaccactact gctccagcgg ccagggccag ggccgaggcc 3721 aaggccagag cgttcaaatg tcaggggcca ttaaacaggc ctcttcagaa acgccagggt 3781 gcactgatag agggaattcc gatgacttca tcctgatttc caaagatgat gatgggagca 3841 gtgccagggg ctccttctcc ggccaggccc agcctcttcg caccctcaga agcacctctg 3901 ggaaaagcca ggccccagtc tgctccccac tggtgttctc agatccactg atgggcccag 3961 cctcagcttc ctccagcaac cccagctcca gtcctgatga cgacagcagc aaggactctg 4021 gcttcaccat tgtgagtccc ctggacatct gaccacagtg cccagtcctg ccccacaggg 4081 atctagccac ccttcagtgg ccccaaggcc agactgaggc tcatccagtg gagaaccttc 4141 ttaaaccact gcttccttcc cggcatgcat ttggcattgg tccagccctt tgaaacccct 4201 tagagagaag catatatggc cacaaagcac agaggcttag gtttgccaca tgcagacagg 4261 gctttctggg cccttaccta atccccaccc gactcttgct ctgagttaga gctgagttac 4321 gtacccagta tcacactcac agttagaaaa gaccgaatca caatttagaa tcacttttcc 4381 tctgtcccct tctccccagc taagaatgtg tggcacctcc atcagttata cttagaagga 4441 gcagaaatag ttattttcgt atcttctatc cctcaaagca tcagacatgg gaaaattggt 4501 ttataccaag aaagcttcct ctgtggaaat ctgtctcagc ctactttatt cctgcattgg 4561 gaagccatat cgcagagcta aatgcaatag aatgaaccag aactagtgga ttccagggct 4621 gggggaaaaa aaaaaaagaa aaaacctcat tactgacctc tcaaagttat aaggatctct 4681 gcaaacagga tctaagctta ggaataatat ttaggtgtga tatagtgtta gatttttttg 4741 atgtattaaa gaatgcatct ccaatcctta ggccatatca actttggcca tcaatatctc 4801 tccttaaaca attatatttc accttttaga atctttcata gccagaaaac aagattactg 4861 taagccagtt ttagctgcac tgatttcaaa agatataaga atattactat ccttcaaatg 4921 gaaaatgcga ccttgacttt atgggataaa catctttcag acagtcagtt ttctagtcag 4981 gtttctctgg tttcagagct gtatatacct gtcaactgag gaataaaggg aaaaacccaa 5041 gttcattccc acccaaagtc agaatccctc attggcctta aggtagcagt cataagacag 5101 agaattggac ctagagtccc ttctgtgggg aataaggata cctagagaac attccacatg 5161 ccaagaggat gcaggatttc tacacaaccc cttcccttct tggaagtcaa gtgtaggtac 5221 tgcagggcct gtgctcagct gtgaaccccg tatcctgggc cccactgccg ggaccgggtc 5281 tgacatgcca gtgccttcct gggctgagca cagattagag actctccccc ttgtcagtca 5341 gcaccttagg aaaccatgat gggcacagag catcacatga gctgtttctc tccttaaaga 5401 agatccctgg aaaggatgct tttcctctcc tttgcctgcg caggaattct aacaggagtg 5461 ggtgaggatg gcagagggac acagtgcctg tctcgcctcc atcagggaga gcagccatgc 5521 cagggatgac tagctctttg agcctgtcct cagaggatgg cgaggcagcc gggcagtgga 5581 ggccttcatg gtaacaaatg aaagctcagt atagaggaac agacactgtt tacgtccctc 5641 ccactgctaa ccttatatat ctctatagac aaatgtgata atgacatgat ttcccacctg 5701 ccctccaaga aaatggtgac tcactctcaa gtcagctact gtagagaggg ttctaattgg 5761 ttctgcaatt tgctcttaaa ctctagcagg gaactctcct cttaccacat cagcatgtaa 5821 ggtgaataat aactctggtt ttgccagaca gcaggttgtc tgaccttcaa ccactgggca 5881 attgcctggc agatgcacac agtagctccc tggcttctgg ctctgagtgt tcctctcagc 5941 acctctgagt aagctgctgc caagcacata tccctatgac aacactttgt aaaagccgcg 6001 gggcccccat acagcgagtg accttgcaac tgtgcagggt tgccattggt cactttctca 6061 ccttgggaag gtgtcagtgt tttcagttct aaggtaagag gtgtagagct gttcccacca 6121 gggctctggg acagactgga aaggaccaca gacctggcca tccctgggca gcagggccag 6181 tgtcacctgc tgacctctag tatttccttt gccctagagc tagagtcatg atagctgagg 6241 gtcactcgcc ctgcaagagt cactaggcac ccaccatgcc aataaggctc tccgctggct 6301 ccctgcagtt ggctgggtgt ttaatagtca ctgaaaactc ccagccctgc tgcacactag 6361 aggcaggtcc tctcggtcct ctccatcctg tgcttctgtg gcccccagca agctcaccgc 6421 ctccttggag gagagagaca tacaaggaca gtgggtcatg ggtagtacca gcctcaaatt 6481 cccacaggct catactcaga caattgtatt actgccttat gttttttaag tgttttttta 6541 aattcttcat agttgagtat tatttgcaat tttattagtt acagtgctat taaagaatat 6601 gtgctccttt t // LOCUS D86966 5086 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0211 gene, complete cds. ACCESSION D86966 NID g1504005 KEYWORDS KIAA0211. SOURCE Homo sapiens male myeloblast cell_line:KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5086) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5086) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5086 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..570 gene 571..4374 /gene="KIAA0211" CDS 571..4374 /gene="KIAA0211" /note="similarto human ZFY protein." /citation=[3] /codon_start=1 /db_xref="PID:d1013891" /db_xref="PID:g1504006" /translation="MGDMKTPDFDDLLAAFDIPDPTSLDAKEAIQTPSEENESPLKPP GICMDESVSLSHSGSAPDVPAVSVIVKNTSRQESFEAEKDHITPSLLHNGFRGSDLPP DPHNCGKFDSTFMNGDSARSFPGKLEPPKSEPLPTFNQFSPISSPEPEDPIKDNGFGI KPKHSDSYFPPPLGCGAVGGPVLEALAKFPVPELHMFDHFCKKEPKPEPLPLGSQQEH EQSGQNTVEPHKDPDATRFFGEALEFNSHPSNSIGESKGLARELGTCSSVPPRQRLKP AHSKLSSCVAALVALQAKRVASVTKEDQPGHTKDLSGPTKESSKGSPKMPKSPKSPRS PLEATRKSIKPSDSPRSICSDSSSKGSPSVAASSPPAIPKVRIKTIKTSSGEIKRTVT RILPDPDDPSKSPVGSPLGSAIAEAPSEMPGDEVPVEEHFPEAGTNSGSPQGARKGDE SMTKASDSSSPSCSSGPRVPKGAAPGSQTGKKQQSTALQASTLAPANLLPKAVHLANL NLVPHSVAASVTAKSSVQRRSQPQLTQMSVPLVHQVKKAAPLIVEVFNKVLHSSNPVP LYAPNLSPPADSRIHVPASGYCCLECGDAFALEKSLSQHYGRRSVHIEVLCTLCSKTL LFFNKCSLLRHARDHKSKGLVMQCSQLLVKPISADQMFVSAPVNSTAPAAPAPSSSPK HGLTSGSASPPPPALPLYPDPVRLIRYSIKCLECHKQMRDYMVLAAHFQRTTEETEGL TCQVCQMLLPNQCSFCAHQRIHAHKSPYCCPECGVLCRSAYFQTHVKENCLHYARKVG YRCIHCGVVHLTLALLKSHIQERHCQVFHKCAFCPMAFKTASSTADHSATQHPTQPHR PSQLIYKCSCEMVFNKKRHIQQHFYQNVSKTQVGVFKCPECPLLFVQKPELMQHVKST HGVPRNVDELSNLQSSADTSSSRPGSRVPTEPPATSVAARSSSLPSGRWGRPEAHRRV EARPRLRNTGWTCQECQEWVPDRESYVSHMKKSHGRTLKRYPCRQCEQSFHTPNSLRK HIRNNHDTVKKFYTCGYCTEDSPSFPRPSLLESHISLMHGIRNPDLSQTSKVKPPGGH SPQVNHLKRPVSGVGDAPGTSNGATVSSTKRHKSLFQCAKCSFATDSGLEFQSHIPQH QVDSSTAQCLLCGLCYTSASSLSRHLFIVHKVRDQEEEEEEEAAAAEMAVEVAEPEEG SGEEVPMETRENGLEECAGEPLSADPEARRLLGPAPEDDGGHNDHSQPQASQDQDSHT LSPQV" 3'UTR 4375..5086 BASE COUNT 1131 a 1608 c 1362 g 985 t ORIGIN 1 gcctgcgccg cgcagcccgc ctcgggcggg aggcgggagg cgggaggccg ggcccaggcc 61 ggggcagccc ctccaccgca ccgtcctggg ccggtgccca ggtccgagtc gccttccgcc 121 tgcccccccc gccaatcccc cgccgcggca gccccagcca ggtcccgccg ccaggccggc 181 tcccgccccc gctccgcccc cggagccgca gccccgcccg ccatcgccgt cgccatgttg 241 tggctcccgc agccggcgct ggggacgcgc gcggccgaga ctctggcctg cagtcgccgc 301 cgccgccgcc aggtggtgtt tggactctag accatgtgcc taggtagaag tttttccttt 361 ctccgcagct ctgctcccct agcaacgctc gccacaccct tgttttgaga tcctctctaa 421 ggagcggaga gtttaatagg caagaaggaa gggagaagac agaaggaaga cgctcccccg 481 tacggagaca gagggagggg gggctccaaa gccgaaagag gaggtcccta cctgccacgg 541 ataccagtca gcccttgcca gcatccagcc atgggggata tgaaaacccc agattttgat 601 gaccttctgg ctgcctttga catcccagac cccaccagcc ttgatgccaa ggaggccatc 661 cagacaccca gtgaggagaa tgagagtccc ctcaaacctc caggcatatg tatggatgaa 721 agtgtgtcct tgtctcactc aggatcagcc cccgatgtgc cggccgtgag tgtcattgtc 781 aagaacacca gccgccagga gtcatttgaa gcggagaaag accacattac tcccagtctc 841 ctacacaatg gattccgggg ctcagatctg cctccagatc cccacaactg tgggaaattt 901 gattctactt ttatgaatgg agacagtgcc aggagtttcc ctggcaaact ggagcctccc 961 aagtcagagc cattacccac cttcaaccag ttcagtccaa tctccagccc agaacctgag 1021 gatcccatca aagataacgg atttgggata aagcccaaac actctgacag ttatttccca 1081 ccccctcttg ggtgcggggc tgtgggaggc ccagtcctgg aggctctggc taagtttccg 1141 gttccagagc tgcatatgtt tgatcatttt tgtaagaaag aacccaagcc agaacccctg 1201 cccttgggga gccagcagga acacgagcaa agtgggcaga acacagtgga acctcacaag 1261 gatccggatg ccactcgatt cttcggggaa gctttggagt tcaacagcca tcctagcaac 1321 agtattggag agtccaaggg gcttgcccgg gagcttggta cctgctcatc agtcccccct 1381 aggcagcgtc taaagccagc tcattccaag ctgtcctctt gtgtggcagc cttggtggcc 1441 ttgcaggcca aaagagtggc tagtgtcact aaggaggatc agcctggcca cacaaaggat 1501 ctctcagggc ccactaaaga gagttctaaa ggtagcccca aaatgcccaa gtcaccaaag 1561 agtccccgga gccctctgga ggccactaga aaaagtatca agccatcgga cagccctcgt 1621 agcatctgca gtgacagcag cagcaaaggc tcaccgtctg tggctgccag ctccccacca 1681 gcaattccca aagtgagaat caaaaccatt aagacatcat caggggaaat caaacggact 1741 gtcacaagga tcctgccaga tcctgatgat ccaagtaagt cccctgttgg gtcacctcta 1801 gggagcgcca ttgcagaggc ccccagcgag atgccagggg atgaggtgcc tgtggaagag 1861 cactttcctg aggcaggcac aaattcaggg agcccccagg gggccaggaa aggggacgag 1921 agcatgacaa aggccagtga ctcgtcatct cccagctgca gttctgggcc ccgggtccca 1981 aagggggctg ccccaggctc acagacaggc aagaagcaac agagcacagc actgcaggca 2041 tccaccctgg cccctgccaa cctcctgccc aaagccgtgc acttggccaa cctgaacctc 2101 gtcccccaca gtgttgctgc atcagtgaca gccaagtctt cagtgcaaag acggagccag 2161 ccacagctta cacaaatgtc ggtgcccctg gtccaccagg tgaaaaaggc tgccccactg 2221 attgtagagg tcttcaacaa ggtccttcac agctccaacc ccgtgcccct ctatgcgcca 2281 aatctcagcc cgcctgcgga cagcaggatc cacgtgccgg ccagtgggta ctgctgcctg 2341 gagtgtggag acgcatttgc cttagagaag agcctgagcc agcactatgg ccggcggagc 2401 gtccacattg aggtactgtg cacactgtgc tccaagacgc tgctcttctt caacaagtgc 2461 agcctgctcc ggcacgcccg tgaccacaag agcaaggggc tcgtcatgca gtgttcccag 2521 ctgctggtga agcctatctc tgcggaccaa atgttcgtgt cggcccctgt gaactccacg 2581 gcaccagcag ccccagcccc ttcatcctct cccaaacatg gcctcacttc gggcagtgcc 2641 agtccccctc ctccagcctt gccactctac ccagaccctg tgaggctcat ccggtactca 2701 atcaagtgtc ttgaatgtca caagcagatg cgggactaca tggtcctggc tgcacatttc 2761 cagaggacaa cagaggagac agaggggctg acctgccagg tatgccagat gctgctgccc 2821 aaccagtgca gtttctgtgc ccaccagcgg attcatgcac acaagtcccc ctactgctgc 2881 ccggagtgtg gggtcctctg ccgctctgcc tacttccaga cccatgtaaa ggagaattgc 2941 ctgcactatg cccgcaaggt gggctacagg tgcatccact gtggtgtcgt ccacctgacc 3001 ttggccttgc tgaaaagcca catccaggag cgacactgcc aggttttcca caaatgtgca 3061 ttctgcccca tggccttcaa gactgccagc agcactgcag accacagtgc cacccagcac 3121 cccacccagc cccacagacc ctcccagctc atttataagt gctcctgtga aatggtcttc 3181 aacaagaaga ggcacattca gcagcatttt taccagaatg tcagcaagac gcaggtgggc 3241 gtcttcaagt gccctgagtg cccactcttg ttcgtgcaga agccggagtt gatgcaacac 3301 gtcaagagca cccacggtgt tccccgaaat gtggacgagc tgtcaaacct ccagtcttca 3361 gcggacacat cctcaagccg ccctggctct cgagttccca ctgagccacc agccactagt 3421 gtggctgctc ggagcagctc cctgccttct ggccgctggg gtaggcctga agcccaccgc 3481 agggtggaag ccaggccgcg gctgaggaac actggctgga cctgccagga gtgccaggag 3541 tgggttccag atcgggagag ctacgtgtcc cacatgaaaa agagccacgg tcggacattg 3601 aagcggtacc catgccggca gtgtgaacag tccttccaca cccccaacag cctgcgcaaa 3661 cacatccgca acaaccatga cacagtaaag aagttctaca cctgcgggta ctgcacagag 3721 gacagcccca gctttcctcg gccctccctt ctggagagcc acatcagcct tatgcatggc 3781 atcagaaacc ctgatttgag ccagacgtcc aaagtgaaac ctccgggtgg acattcccct 3841 caggtgaacc atctgaaaag accagtcagt ggagtggggg acgctccagg caccagcaat 3901 ggcgcaactg tctcttccac caaaaggcac aagtcccttt ttcagtgcgc gaaatgtagt 3961 tttgccacag actcggggct cgagtttcag agccacatac ctcagcacca ggtggacagc 4021 tccacagccc aatgtctcct ctgtggtttg tgctacacct ctgccagctc cctcagccgc 4081 cacctcttca ttgtccacaa ggtgagagac caggaggagg aggaggaaga ggaggcggcg 4141 gcagcggaga tggcagtgga ggtggcagag ccagaggagg gctccgggga ggaggtgccc 4201 atggagacta gagagaatgg actggaagaa tgtgccggtg agcctttgtc agctgaccca 4261 gaggcgagga gattgctggg cccggcccct gaggacgatg gtggccacaa tgatcacagt 4321 caaccacagg cctctcagga ccaggacagc cacacactgt cccctcaggt gtgaccggag 4381 actttgcagt gtgcatggtc aggggtggtg ccgaagtgtc ttccacctgc cctgcggacc 4441 gtggaaaata aaaggctctg cccccagtgt gagtgtgacc ggttgtaccc tggagtagtg 4501 tctgccctga gctgccagtg ctgggtatcc cccagcccca ggaaatgtgg ggtcggccag 4561 gaccctcaca gctctgaatt tgcttctgtt atttatggct tttcgctgct tcttggtgcc 4621 ccatctcttg tctgtgtcct tccaacccca agctgcttat gtggcccaac cccactgctg 4681 tcaactaggc ttgaacccca cagcggctgt gctcttctgg gaggttcccg cttgctgcct 4741 tcagccaggg cgctcctcag agctctattt tcctgcagac accagctctc cttcctgcct 4801 ttagatcctg agaaggaggg aaatgagggg tgctgacaca gtccctctgg gagagctctg 4861 cctagtctgg tttggcgagg gcccttgatc accttgcccc tcctccctgt cttctctgat 4921 tcttttccct caaaatagtc ctgagaacta attgtcacag acattggaat atttgtactg 4981 ctctcgtgcc atttgagagg ctgctgcccc aggcaggcca gcccctactc ctcttggcta 5041 cactcatgtt gctcagacta tatttcaaat aaaaaatctt ctcacc // LOCUS D86967 6072 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0212 gene, complete cds. ACCESSION D86967 NID g1504007 KEYWORDS KIAA0212. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2602. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6072) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6072) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6072 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 3" /clone="HA2602" /sex="male" /tissue_type="bone marrow" 5'UTR 1..58 gene 59..2032 /gene="KIAA0212" CDS 59..2032 /gene="KIAA0212" /note="Containing ATP/GTP-binding site motif A(P-loop): Similar to C.elegans protein(P1:CEC47E128);Similar to Mouse alpha-mannosidase(P1:B54407)" /citation=[2] /codon_start=1 /db_xref="PID:d1013892" /db_xref="PID:g1504008" /translation="MQWRALVLGLVLLRLGLHGVLWLVFGLGPSMGFYQRFPLSFGFQ RLRSPDGPASPTSGPVGRPGGVSGPSWLQPPGTGAAQSPRKAPRRPGPGMCGPANWGY VLGGRGRGPDEYEKRYSGAFPPQLRAQMRDLARGMFVFGYDNYMAHAFPQDELNPIHC RGRGPDRGDPSNLNINDVLGNYSLTLVDALDTLAIMGNSSEFQKAVKLVINTVSFDKD STVQVFEATIRVLGSLLSAHRIITDSKQPFGDMTIKDYDNELLYMAHDLAVRLLPAFE NTKTGIPYPRVNLKTGVPPDTNNETCTAGAGSLLVEFGILSRLLGDSTFEWVARRAVK ALWNLRSNDTGLLGNVVNIQTGHWVGKQSGLGAGLDSFYEYLLKSYILFGEKEDLEMF NAAYQSIQNYLRRGREACNEGEGDPPLYVNVNMFSGQLMNTWIDSLQAFFPGLQVLIG DVEDAICLHAFYYAIWKRYGALPERYNWQLQAPDVLFYPLRPELVESTYLLYQATKNP FYLHVGMDILQSLEKYTKVKCGYATLHHVIDKSTEDRMESFFLSETCKYLYLLFDEDN PVHKSGTRYMFTTEGHIVSVDEHLRELPWKEFFSEEGGQDQGGKSVHRPKPHELKVIN SSSNCNRVPDERRYSLPLKSIYMRQIDQMVGLI" 3'UTR 2033..6072 BASE COUNT 1518 a 1317 c 1410 g 1827 t ORIGIN 1 ggtggtcggc ggggaggccc ccgcgcttta aaataatgcc cgcggcgccc gcgcgaccat 61 gcaatggcga gcgctcgtcc tggggctggt gctcctccgg cttggcctcc atggagtatt 121 gtggctcgtc ttcgggctgg ggcccagcat gggcttctac cagcgctttc cgctcagctt 181 cggcttccag cgtctgagga gccccgacgg ccccgcgtcg cccacctcgg ggcccgtggg 241 ccggcctggg ggggtatccg ggccgtcgtg gctgcagccg ccggggaccg gggcagcgca 301 gagcccgcgc aaggctccgc ggcgtcctgg gccggggatg tgcggcccag ccaactgggg 361 ctacgtgctg ggcggccggg gccgcggccc ggacgagtac gagaagcgct acagcggcgc 421 cttccctccg cagctgcgtg cccagatgcg cgacctggca cggggcatgt tcgtctttgg 481 ctacgacaac tacatggctc acgccttccc ccaggacgag ctcaacccca tccactgccg 541 cggccgtggg cccgaccgcg gggacccttc aaatctgaac atcaatgatg tactagggaa 601 ctactcattg actcttgttg atgcattgga tacacttgca ataatgggaa attcatccga 661 gttccagaaa gcagtcaagt tagtgatcaa cacagtttca tttgacaaag attccaccgt 721 ccaagtcttt gaggccacga taagggtcct gggaagcctc ctttctgctc acagaataat 781 aactgactcc aagcagccct ttggtgacat gacaattaag gactatgata atgagttgtt 841 atacatggcc catgacctgg cggtgcggct cctccctgct tttgaaaaca ccaagacagg 901 gattccatat cctcgggtga atctaaagac aggagttcct cctgacacca ataatgagac 961 atgcacagcg ggagccggtt ccctcctggt ggaatttggg attctgagtc gactcctggg 1021 ggactccaca tttgagtggg tggccagacg agcagtgaaa gccctttgga acctccggag 1081 caatgataca ggattactag gcaatgtcgt gaacattcag acgggccact gggttggaaa 1141 gcagagtggc ctgggtgccg ggctggactc cttctatgaa tacctcttga aatcttacat 1201 tctctttgga gaaaaagaag acctagaaat gtttaatgct gcatatcaga gtattcagaa 1261 ctacttaaga agagggcggg aagcctgcaa tgaaggagaa ggagaccctc cactctatgt 1321 caacgtgaac atgttcagtg ggcagctgat gaacacctgg attgactctc tgcaggcctt 1381 tttccctgga ctgcaggtgc tgataggaga tgtggaagat gccatctgcc ttcatgcctt 1441 ctactatgcc atatggaaac gatatggtgc cctccctgag agatataact ggcagctgca 1501 ggcccctgac gttctcttct acccactgag accagagtta gtggaatcca catatctcct 1561 ctaccaggca accaagaatc ccttctacct ccatgtagga atggatattc tgcagagtct 1621 ggaaaagtac acaaaagtca agtgtgggta cgccacgctg catcacgtca ttgacaagtc 1681 cacagaagac cggatggaga gcttctttct cagtgagacc tgtaaatatt tgtatctgct 1741 gtttgatgaa gacaatccag tacacaagtc tggaaccaga tacatgttca caacagaggg 1801 acacattgta tctgtggatg agcatcttcg ggaattgcca tggaaggaat tcttctctga 1861 agagggaggg caggaccaag ggggaaagtc tgtgcacagg ccgaaacctc atgagttaaa 1921 agtcatcaac tccagctcca actgcaatcg tgtacctgat gagaggaggt actccctgcc 1981 cttaaagagc atctacatgc gacagattga ccagatggtt ggtttgattt gatctgctct 2041 ctgtgaggcc tcatcttgaa ccagacctta acgaccaaac ccagaccatg ccaaagtcca 2101 gtctgaaatg aaaggggaca gaagtcttgc tgtccatggt ggtgtaggaa tttctgtgca 2161 acacctcacc acgtctggtt aatccttgca cacttcagtg tttctctcct gttcaataaa 2221 atgccctgtt aaggatataa tttgaagtga gaagatacat ggaaattgcc ctcttatgac 2281 atgttgatgt tataagcaca atagatgggg catctttgga ttgatgttca cagctttata 2341 cttcagaacc taagtctctt cactttgctg gcacctgcta tactggagta ttgctatgtc 2401 tttaaaaaat ttttttttat tatattttat ttttttgaga cagggtcttg atattttttt 2461 gggacagggt tacctgggct caagtgatcc ttctgcctca gcctcccgag tagctgggat 2521 tacaggtgag caccactgta cctggctagc tacttctttg ttagaggatt gagaatgaaa 2581 tttctgcaaa agggcccatg gttcatttgg tatccctatt taattgcatt gaaaatgtca 2641 tcctttctgt tgttagataa ttggggtctt cccctgatat ccaaccgtga ttttggatca 2701 catgggagaa aaagtcatcc agtttttcat gtttgcctca agtaatcttt acagtgttac 2761 aaattatttg cttaagaaga atggtcttaa ccagaattct taacagatag tctcttaggt 2821 tattatgtta tggtctaaga ggttaactga catcttttgg atggtatttt gcattttgaa 2881 tatgaactta cctgaggaac tcccatagtt ccagaatcag gtgcctttta gggagagaac 2941 aatacctaag attgtctgag cttccatctt tctcatattt cctaagcaag gattctcact 3001 tatgaccata tttgggttag agttctgttt tgtttctgtt ttctgtgtct agtgccaatt 3061 agctaaatca gggagaaaga aatgatcaca tgacttttag catccttgag ccatttctct 3121 gtgtaataca ggctttagat tagtgcctta tattggtttt ggtttggggc actggatgtc 3181 gcagctactg ctatggtttc aggaggcctg tttagccaca tggtgagacc gtggtgaaag 3241 ggggatggaa attgcttggc cagtctttgc ctttcatcct gtaaaagtaa gcatgtagaa 3301 ggaggaagtt gtgctaaaat gcctttgttt ttttgttatt attttcttag ccagaacatc 3361 tctctttgaa ctcacactga tacacacctg ctactcttac acagtgcagc agggctgact 3421 cttagtctgg cttccatgaa gcgtcatggg tggaaacgca ttctagtaaa aaaggtagga 3481 aatccctaaa acttccagcc tcacatagca cggttctcac ctgtcactgt tttcccacct 3541 ctaaggattt catgtacatc ttttcaaagc tagaaataag cactgtctaa gtttatgttg 3601 catttttagt caaaagggag aaatcttatt ccttcttgaa aattttaagt gttatggttt 3661 tatatagttc agttctttga gatttttgaa aagagtattt tcagtaataa acgtgccatc 3721 tctatctctt aaacatttat tacaacaatt gttttaaaat agaaaaaata aaatgcttct 3781 attttacctt ttttcatttc agaagcatta ttctgtttat taacagtgtc ccatctactg 3841 aatagaaaac tttgagaata atatatatat atattttaaa tgttttcact gactcattga 3901 aaatgttaat tacacacaca tgcatgcatg cacacacgag catacttgta cctttgtctc 3961 tgggcaaaca ggtgggactg ttagtgaccc atttgggaaa atagagcatc tcagagaagg 4021 aggtgagttc ttcctgcctg tgatttctct tggcgctccc ctcctctccc gctctggctt 4081 ctgtggcggc agtggtgggt aagcactcca gtgttctctt aatgaggcac tttgcctgtc 4141 actcgagcaa gcctgggtgt tccttcctcc tcatgctcct ggaataggga atagggatct 4201 catgcttgca aactacacaa tgctgcaggt gcttcccagg ggccacaggc tgtcaggaaa 4261 cgtgttttat gttaagtcac aaacccactt gacttctggg tactggaatt aataccagtg 4321 ggtgagactg agggtgagtg agttagtaca tattaatcct ggttgttgag cttccagact 4381 accccgtcca aagtttgatg ctatgtagtc agtggtttgt ggggctggat gccagaaggt 4441 tctttgagcc agtttcaaag gttacttgtt tttttttttt tttttttaag tcagaatgtt 4501 aacagctgtg atatatcctg cagggctttt gcagtttctt ctgttctgtg ttctgaaatc 4561 ctgggtagag aatggctgag gaggagatta ccagagaagt tgctttgctc agtgctttgc 4621 cccaggattg cctcaaatct gagtggactt catcctttgc ggcggctctg agcctggccc 4681 atcttcctat tcccacgtgt agctagtgtc tagtgtcagc tttgctcaat gtggtggaaa 4741 cattttgcag aactgttgta gaaagctgcc ttatagttgg cttgacaaag cataattctc 4801 tcataacaaa ctttcaaatc attacagtag cttagctact ttagttgatg tgaccgagga 4861 atcccttcta gaatcatagg tggcaaggga gggtttgcta gctctccatt tgcactggcc 4921 attgtgaaaa accagcttct gtattcaaat ctttccttca tttttttaaa tttttttttt 4981 ggcagcgctt gtgctggaac ttactcattg taactgaatc ctcagggctt ttcttgtttt 5041 agatcatgga ctgtgcacgt gacacttaaa taattttcta tgtatttaaa gaaaaatgca 5101 ccaggatggt gtctgtgcac gtgactatta gaggagcgtc tgtagaagta cctggtttgg 5161 tcagtgcagt tgtgcaatct gagggccttg tttcctcctc ccctttcccc ttctccccac 5221 caaaggaaaa tatccctctt aatgatttcg tagttcagtt tactgaatga ttaccacctg 5281 taattcctct ttggattgtg tagactcaac atgagacatt cctttctgct ttctggaggg 5341 caccaggggc ctttctcttt gataaatttt ttttgtctgt tgacaaaaac aaaaatcttt 5401 tttcaaatgt agtgctggtg aaaaggtagg gctgagtgat taccttagcc acagggtggc 5461 tgagcaggaa ctttagaaga aaatcctgag ctttcctgtc cattcccagc atccagctcc 5521 tattctagtg cctcttccct gcagggcagg gaccccttgg gaaatcgagg aggtgggacg 5581 ggctgggccc tgtgtcccag gtttcacagg gctcagggtt atgctcccgc ttgaatctgg 5641 acgtgaatct ggtaaaaata tcaagtacct gtggaactcc ctgattctat accctcttcc 5701 ttctttctgc aaggcagagg aataatattt ttaaaggtta ttttgtttta gttttaaata 5761 gcaaaacaca agctgcattt ttatttattt tgcataagaa aggtaaatct ttttacaaaa 5821 aaaagtatag agttggaaac tctgggaaaa cttacggaaa tacacaaatg cttctctgta 5881 atgtgcaata tgctttgcaa ctgtagatga tattttatgt ttaatctgta aataagaaat 5941 gtatttaaat taaaagggat ctttttgtaa aaggaccaaa tgttctttta taaatgtaat 6001 aaggaatatc ttgctcttta aaatttatta ggatttttat gagtaatttt tattaaaaga 6061 tttctttttt tg // LOCUS D86969 4935 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0215 gene, complete cds. ACCESSION D86969 NID g1504011 KEYWORDS KIAA0215. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2776. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4935) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4935) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..4935 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA2776" /sex="male" /tissue_type="bone marrow" 5'UTR 1..298 gene 299..2770 /gene="KIAA0215" CDS 299..2770 /gene="KIAA0215" /note="similar to Human zinc-finger protein, BR140(P1:JC2069)" /citation=[3] /codon_start=1 /db_xref="PID:d1013894" /db_xref="PID:g1504012" /translation="MKRHRPVSSSDSSDESPSTSFTSGSMYRIKSKIPNEHKKPAEVF RKDLISAMKLPDSHHINPDSYYLFADTWKEEWEKGVQVPASPDTVPQPSLRIIAEKVK DVLFIRPRKYIHCSSPDTTEPGYINIMELAASVCRYDLDDMDIFWLQELNEDLAEMGC GPVDENLMEKTVEVLERHCHENMNHAIETEEGLGIEYDEDVICDVCRSPDSEEGNDMV FCDKCNVCVHQACYGILKVPEGSWLCRSCVLGIYPQCVLCPKKGGALKTTKTGTKWAH VSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLKTGACIQCSIKSCITAF HVTCAFEHGLEMKTILDEGDEVKFKSYCLKHSQNRQKLGEAEYPHHRAKEQSQAKSEK TSLRAQKLRELEEEFYSLVRVEDVAAELGMPTLAVDFIYNYWKLKRKSNFNKPLFPPK EDEENGLVQPKEESIHTRMRMFMHLRQDLERVRNLCYMISRREKLKLSHNKIQEQIFG LQVQLLNQEIDAGLPLTNALENSLFYPPPRITLKLKMPKSTPEDHRNSSTETDQQPHS PDSSSSVHSIRNMQVPQESLEMRTKSYPRYPLESKNNRLLASLSHSRSEAKESSPAWR TPSSECYHGQSLGKPLVLQAALHGQSSIGNGKSQPNSKFAKSNGLEGSWSGNVTQKDS SSEMFCDQEPVFSPHLVSQGSFRKSTVEHFSRSFKETTNRWVKNTEDLQCYVKPTKNM SPKEQFWGRQVLRRSAGRAPYQENDGYCPDLELSDSEAESDGNKEKVRVRKDSSDREN PPHDSRRDCHGKSKTHPLSHSSMQR" 3'UTR 2771..4935 BASE COUNT 1388 a 1051 c 1173 g 1323 t ORIGIN 1 atacaatagt gctccgcgcc gcctcagccg ccgccgccgc ccaaccgcct gcccagcgct 61 gaggcctgac gggccgggcg gacgagggcc gagggcggga gctgaggcgc ggggggcggc 121 cccggcgggg ggcgggggcg aggaggggat taaggggcag gtgcgaggga gggaagagaa 181 gaaagcgagc ggttaggggg gcggttacca ctccgaccgg actcacccgg cacattgccg 241 ggccgcggcg tggagccggg caggagccgc gagccagctg cgcgaaggat gctccaggat 301 gaaacgccat aggcctgtca gcagcagtga cagttcagac gaaagtcctt ccacttcctt 361 tacttctggc tcaatgtata ggatcaagtc aaaaattcca aatgaacaca agaaacctgc 421 tgaggtattc cggaaggacc tcatcagtgc catgaaactt ccagattctc accacattaa 481 tcctgatagc tattacctct ttgctgatac atggaaggaa gaatgggaaa agggagtcca 541 ggtaccagcc agtccagaca ccgttccaca gccttctctc aggattatag ctgagaaggt 601 aaaggacgtt ctgtttatcc gaccccggaa gtatattcac tgctccagcc cagacaccac 661 agagcctggc tacatcaaca tcatggagtt ggcagcatct gtttgccgct atgacctaga 721 tgacatggac atcttctggc ttcaggaact caatgaagac cttgcagaaa tgggttgtgg 781 gccagttgat gagaatctta tggaaaagac agtagaagtc ctggaacgcc attgccatga 841 aaatatgaac catgctattg agacagagga agggctaggc atagagtatg atgaagatgt 901 gatctgtgat gtgtgccggt ctccagacag tgaagaaggg aatgatatgg tgttctgtga 961 taagtgtaac gtctgtgtgc atcaggcctg ctatggcatc ctcaaggtcc cagaaggcag 1021 ctggctgtgt cgctcctgtg tcctgggcat ttatccgcaa tgtgtgttat gtccaaagaa 1081 aggtggagcc ctgaagacca ccaagacagg gactaaatgg gctcatgtca gctgtgccct 1141 gtggatccca gaggtcagca ttgcttgtcc tgagaggatg gaaccgatca cgaagatctc 1201 ccacatccca cccagtcggt gggccttagt ctgcaacttg tgcaagttga agacgggggc 1261 ttgtattcag tgctctataa aaagctgcat cactgccttc cacgtcacct gtgcctttga 1321 gcacggccta gagatgaaga ccatcctaga tgagggagac gaagtgaagt tcaagtcata 1381 ttgcctcaag catagccaaa acaggcagaa acttggagaa gctgagtacc cccaccacag 1441 ggctaaagag cagagccagg ccaaaagtga gaaaaccagc ctgcgggcac agaagcttcg 1501 ggagctggag gaggagttct attccttggt acgagtggaa gatgtggccg cagagctggg 1561 tatgcccacg ctagctgtgg actttatcta taactactgg aaactgaagc ggaaaagtaa 1621 cttcaataag ccattatttc ctccaaagga ggatgaagaa aatgggctgg tgcagccaaa 1681 agaggaaagc attcacactc gaatgagaat gtttatgcat ctacgccagg acctggagag 1741 ggtccgaaat ctgtgctata tgataagcag acgagagaag ctgaagctgt cacacaacaa 1801 aatacaggaa cagatcttcg gtttgcaagt ccagcttctt aaccaagaaa ttgatgcagg 1861 gcttcctttg acaaatgcac ttgaaaactc actgttttac ccaccaccaa gaattacctt 1921 gaagttaaaa atgcccaaat caaccccaga agaccacaga aacagctcca cagaaaccga 1981 tcagcagccc cactctcctg acagcagctc atctgttcac agtataagga acatgcaggt 2041 gcctcaggag tcactagaaa tgagaacaaa atcgtatccg agatacccac tagagagcaa 2101 gaataaccgt ttgctggcca gtctcagcca ttctaggagt gaagcaaagg agtccagtcc 2161 tgcttggaga accccgtcct cggagtgcta tcatgggcag tcactgggaa agcctctggt 2221 ccttcaggct gccctccatg gacagtcttc cattgggaat gggaaaagtc agcctaactc 2281 caagtttgcc aaatccaatg gcctggaggg cagctggtct gggaatgtca cccaaaaaga 2341 cagctcgagt gagatgttct gtgaccagga gcctgtgttc agcccccact tggtcagtca 2401 gggcagcttt agaaaatcca ctgtagaaca ctttagtagg tcctttaaag agaccaccaa 2461 taggtgggtg aagaacacag aggacctcca gtgctatgtg aagccaacca agaatatgag 2521 ccccaaggag cagttctggg gtagacaggt tctcaggcgg tctgcaggga gagctccata 2581 tcaggaaaat gatggctatt gcccagattt ggagctgagt gattcagagg cagaaagtga 2641 tgggaataaa gaaaaagtca gggtaaggaa agatagctca gacagggaaa atcctcccca 2701 tgactctaga cgggattgcc atggtaaaag caagacacat cccctttccc acagttcaat 2761 gcaaaggtga ttagaaactt ccaaggatga cccaaccttt gcctttgccc catatattgg 2821 ggaaaaccca tacaccaaaa ggattttagc atatgttaag aggaattgca gtgaaaagga 2881 taacattttt ccatagtaaa ttgtcttgca gtttttgaaa atgtttcaag tctagttttt 2941 acaagcacat tacagtaatt gcaggttgtc cagaggttgg tttgtcagag gctattggga 3001 acagctgggc ccagggtatt tgctcagtaa atattttggg gcagctttcc ggttatataa 3061 cacattgaca agtatatgta ttaagagtcc ttgcattgtt aactgatttg caagagacca 3121 tgattatcag acactaaaac tacttatctt ttgaagctac agcatgtgac tccccagagc 3181 tcctgtctgt aggaagtttc taattccact ggtatctata aacccttttc aggggagaga 3241 ccaggagacc accatcttag atactgtcaa actcactact gtcttccttt gttctgcaca 3301 aggttgagtt tctttcacag tatcttaact tactgagtca atactcctca tgcttatcag 3361 tgtccagtgc ctgtttctaa acttgcttgt tggggacaca ctcataacta tttcacctga 3421 ctgacaagca taaaaggcta acttgtggat agtgtttaca aaagtacaaa agtactactt 3481 ccaaggaaaa tgctctgatt ctctgtttag gcatttgtga aggattccag agcctttccc 3541 aaagtgagac catttctggg gtatttgtac caggtcaggg tttttgtgtt gttttcaaat 3601 aagtttgact aaaataagtt tggctgacag tttttgtata cctgctttaa tgtttataaa 3661 atttttattg aggtataatt aaaatatagt aaaatgcaca gaggttaagt attcttacct 3721 gctttaattt tgaaacagat ttttattgtc aaacagaatt tgaaagcatg tgtttaacac 3781 atggatataa tttagctttg tgtcaccttt gaatctgagg cagtttccca aacaaaaagg 3841 ctggagccat ttttcagaag ttgcttatac cctgttccca aactatcagc cttgattaat 3901 ttctagtagg aagagaataa ttacatttgc gggggggggg ggtggataaa aacatgtctg 3961 cttctcattt aaataagaga gaaatgatgc cgttttttaa atgtgaagca gactataatt 4021 ctcagctctc ttttcttctt agccttaaat taatattctc tttcttctag ttttggaaag 4081 tgtagtggga atattcagac aaaagaggcc attttccatt tttaaagctt cttactggtg 4141 aaacagccca gttgtagtag gtgccagtca gtcaaggcag gggccctctc tccgtcaata 4201 tggaaaactc agcagttttc ctctccccca gttgtgttct tgtaacgttg ttaatgggtt 4261 cctttgcttt ttgctttctc cttttctgaa aatgtatgtg ttttgcctct cttttggcta 4321 catcttcaaa atatttcttt tgtgcctatg tacatgtgta aacatgccat agcatgtgtg 4381 gtaggtgtcc tgtattttgt ttgggaaaaa aactatcaaa atgaggaaga gaatttcccc 4441 tatttatgca ctaggtttct gtgctttttc tttgagttct ctggagtaga tattaatttg 4501 ataccttcat ggtaatgaaa ttatgatgga gctgtgttat aaattcctta tgtcagaggc 4561 cagtgcggta gcctttgtcc cttcatgcct ttcaattctg agtgggagga aaagcaaaca 4621 tcaaaacagt gcttcagcca aattccatat gtaatgccat tgggagagta ttgactaaaa 4681 tatcattcgt cagggaaata tagttgtaat atttttacag gatattccta ggtaaatgaa 4741 ggagccttca gttgtaaatt tcaattaccc caaaatgtat ttgctacatt ttgttgtttg 4801 aagtattacc tcttaacctt ctttgttaat ttttttcatt ttgtcttata tagtccagtt 4861 ttccaagata agctcagtcc tttttcaaat gtcacctttt taccaatact ttttcattaa 4921 attatgaaaa ctgct // LOCUS D86972 4689 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0218 gene, complete cds. ACCESSION D86972 NID g1504017 KEYWORDS KIAA0218. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2987. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4689) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4689) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..4689 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 3" /clone="HA2987" /sex="male" /tissue_type="bone marrow" 5'UTR 1..398 gene 399..2684 /gene="KIAA0218" CDS 399..2684 /gene="KIAA0218" /note="similar to E.coli hypothetical 29.6 KD protein(P1:YIGW_ECOLI)" /citation=[3] /codon_start=1 /db_xref="PID:d1013897" /db_xref="PID:g1504018" /translation="MASERGKVKHNWSSTSEGCPRKRSCLREPCDVAPSSRPAQRSAS RSGGPSSPKRLKAQKEDDVACSRRLSWGSSRRRNNSSSSFSPHFLGPGVGGAASKGCL IRNTRGFLSSGGSPLRPANASLEEMASLEEEACSLKVDSKDSSHNSTNSEFAAEAEGQ NDTIEEPNKVQKRKRDRLRDQGSTMIYLKAIQGILGKSMPKRKGEAATRAKPSAAEHP SHGEGPARSEGPAKTAEGAARSVTVTAAQKEKDATPEVSMEEDKTVPERSSFYDRRVV IDPQEKPSEEPLGDRRTVIDKCSPPLEFLDDSDSHLEIQKHKDREVVMEHPSSGSDWS DVEEISTVRFSQEEPVSLKPSAVLEPSSFTTDYVMYPPHLYSSPWCDYASYWTSSPKP SSYPSTGSSSNDAAQVGKSSRSRMSDYSPNSTGSVQNTSRDMEASEEGWSQNSRSFRF SRSSEEREVKEKRTFQEEMPPRPCGGHASSSLPKSHLEPSLEEGFIDTHCHLDMLYSK LSFQGTFTKFRKIYSSSFPKEFQGCISDFCDPRTLTDCLWEELLKEDLVWGAFGCHPH FARYYSESQERNLLQALRHPKAVAFGEMGLDYSYKCTTPVPEQHKVFERQLQLAVSLK KPLVIHCREADEDLLEIMKKFVPPDYKIHRHCFTGSYPVIEPLLKYFPNMSVGFTAVL TYSSAWEAREALRQIPLERIIVETDAPYFLPRQVPKSLCQYAHPGLALHTVREIARVK DQPLSLTLAALRENTSRLYSL" 3'UTR 2685..4689 BASE COUNT 1086 a 1196 c 1273 g 1134 t ORIGIN 1 ggcggaggcc gggccaggcg gggctgtagg gctggggcag gcggcgcccg ggtctggccc 61 tggcctacgg gctaagcgcc cggccccggg gccgcggatc cggcagccat gtgtggatct 121 tgaagcgcca tgtggtcttc tgaagcgctt tgacagttct aaagggcttt atattgcaaa 181 ctgatcaaag cgcttcgtag ccccggaagc gcttaggcat ctccgaagta gcgctgggca 241 aagtgaaggc ttcctgatct cagaagcacg ttgtgggctt ggaaacctac ggcagccttt 301 agtcaatttt ggatcgcttt gacttctcca acacggtttg gcatctctga aaccttgaga 361 actgtgatgg gcagtggaaa gaagagggaa aggtgcccat ggcgtccgag cggggcaagg 421 tcaagcacaa ctggagcagc acgtcggaag ggtgtccccg caagcgcagc tgcctccggg 481 agccctgtga tgtggccccc tccagccggc cagctcagag gtctgcgtcg cgttctggag 541 ggcccagcag ccccaagcgc ctgaaagccc agaaggagga cgatgtggct tgctcgcgga 601 ggttatcctg gggctcatcc cgccgcagaa ataactcctc ctcctccttc tccccacatt 661 tcttgggccc tggtgtgggc ggggccgcct ccaaaggctg cctgattcgg aacactcggg 721 ggttcctgtc ttcaggggga tcccctctgc gtcctgccaa cgcctctttg gaagaaatgg 781 cttctctaga ggaggaagcc tgcagcctta aggttgattc caaagatagt tctcataact 841 ccacaaactc tgaatttgca gctgaagctg agggtcagaa tgatacaatt gaggaaccca 901 acaaggtcca gaaaaggaag agggatagac ttcgagacca gggctccaca atgatctacc 961 tgaaggctat ccagggcatc ctggggaaat cgatgccaaa aaggaaggga gaggctgcca 1021 ctcgggcaaa accaagcgca gcagagcatc ccagccatgg agaaggacca gccaggagtg 1081 aaggaccagc caagactgca gaaggagcag ccaggagtgt cacagtcact gctgctcaga 1141 aggagaaaga cgcaacccca gaggtcagca tggaggagga taagacagtg ccagagagga 1201 gcagcttcta tgacaggaga gtagttatag accctcaaga gaaacccagt gaggagcccc 1261 ttggggaccg aaggactgtc attgacaaat gctctccacc cctagagttc ttggatgact 1321 ctgactctca tttagaaatc caaaagcata aagataggga ggtggtgatg gagcacccct 1381 cttctggaag tgactggtct gatgttgagg agatctccac agtcagattc tctcaggagg 1441 aacctgtctc cctgaaacct tcagccgttc tggagccttc ttccttcacc accgactatg 1501 tcatgtaccc tcctcatttg tacagtagtc cttggtgtga ctacgccagc tattggacca 1561 gcagccccaa gccttctagc tacccctcca caggcagcag cagcaacgat gcagcccagg 1621 ttgggaagag cagccggagc cgcatgagtg attattcccc caactctaca gggagtgtcc 1681 aaaacacctc cagagacatg gaggcctcag aggaaggctg gtcccagaat tctcgttcat 1741 ttcgcttctc cagaagctca gaagaaagag aggtgaagga gaaaagaaca ttccaagagg 1801 agatgcctcc gcgtccttgt ggaggacacg catccagctc cctgccaaag agccacctgg 1861 agccaagcct agaggagggc ttcattgaca ctcattgtca cctggacatg ctctattcca 1921 agctatcttt ccaagggacc tttacaaagt tcagaaaaat ttacagcagc tccttcccta 1981 aggaatttca gggctgcatc tctgacttct gtgatccccg caccctgaca gattgcctat 2041 gggaggagct gttgaaagag gatctggtct ggggggcctt tggctgtcac cctcattttg 2101 cacgttacta cagtgagagt caagaaagaa atcttttgca agccttaagg caccctaagg 2161 ctgtggcatt tggagaaatg ggcttggatt actcttacaa gtgcaccacg cctgtcccag 2221 aacagcacaa ggtatttgag agacagctgc agctggctgt gtctctaaag aagcccttgg 2281 tgatccactg ccgagaagct gatgaagatc tgctagaaat catgaaaaag tttgtgcccc 2341 ctgactacaa gatccatagg cattgcttca ccggcagcta cccggtcatt gagcccctgc 2401 tgaagtactt tcccaacatg tctgtgggct tcacggcagt gctgacatac tcctctgcct 2461 gggaggcccg ggaagccttg aggcagatcc cactggagag aatcatcgtg gaaacggatg 2521 ctccctattt cctccctcgc caggttccca aaagcctttg ccagtatgcc cacccgggcc 2581 tggccttgca tacggtccga gagattgcca gagtcaaaga tcagccactc tccctcacct 2641 tggctgcctt gcgtgagaac accagtcgcc tctacagtct ttaagcagag aagggcgacc 2701 agcagcctga cagaacacag gctggcttga agtctgtgtc tcaggtcgag gatgtgttta 2761 gagagctgat tggaacacag aaaaccagga caggatgttt tcctccaagc gggtcataag 2821 gcttcagctt ctgggtggtg ggtggggtgg ggtgggcatg gaatgagggg tatggggact 2881 gcctgtaata gcactgggat tttgcctccc agccaaaatg ctccagggta ggcagcaaag 2941 aagaaagagt gcgatatgag ggtacagtga gtttggcagt ccaaattctg ttctctgcag 3001 cctgtttttg agtagttggt gaataaagtc acccgcctac ttgtctaagc acatgtgggt 3061 gtgtaactca gttcctggtt ctcggttcta aaaacagact tccaactggg aaactttttg 3121 ggggaaatta actggacacc tatctcggag gtttattttc ttgcaaccag tgaagtcgtc 3181 ctcctccctt ccctggataa ctcttcagtt tgactgtcac tgttctggtg tcaactccag 3241 cgtcggcaca ggcagaagga cttcagctgc tggtctcatt ggttccactg ccattgatat 3301 gggggggtgc agaggagcac tcattgtcca tgatggagat ccaggacaga ctgggggact 3361 ccgggaagag gctttcttgg ggaagaggac cccaaaggag gtacttcctc ctcactgatg 3421 ccctcagggc tgctgtgtgg gtgtgttaag ctgataaggt tcagtttgcg gcggaaacta 3481 cctgctagtc ttgtgtcagg ggctgccagc ctccttttgt tttacttttg cccccttctt 3541 gggaacattt gaccaaaaat aatcctcatt tcatcttgtg tacagtcctt tgtgtacccc 3601 gcccaggttg agaggtgaat cagcgtaatt ctccgaagct ctgctgggca gctcagccat 3661 gttacattgt ctatgcagaa taagcaggcc tggtgtctgc aaatgtacca gtccttctcc 3721 cgcatctcct gggtgccccc ttggctttgc ctctcctgtg tcctgtcttt ctgcgtttta 3781 gaagtgagag cctctccttc accttttgag caactgagtc atctcaagtc ctgagctctg 3841 ctctgccgcc tgctggtgcc ttgtaaaggt atactcgtta caggccctag aggttctaat 3901 ggtccagggt taagtgagag gagactgtac tttgtttcaa aggatccttc accctgatct 3961 gcagtgaggt ggatagatca cctggagtcc ctcgtctgtg gtcttggagg cttaaattgt 4021 aaaatacatc ccttatggaa tcctaaattc ctctaggtgt ttttggaagg cgcatttgag 4081 ccttgtgagc taaaatggaa tggatttaat atttcctatc tggcatttcc atcttgcccc 4141 tggtacacaa gtcactggcc tggaactcag ccttgattca ctgtccgtct tcacggatta 4201 gctgtgctgt tatgttgtct gtgctgcaga ttggcccatg tgggaagtcg ggggggacct 4261 gatttcctgc ttggaagact tgggggactg ccgagcatat caaagtgttt atagtcacca 4321 agtgaactgc agcacaacca tctcctctcc agcaagccct gaagtcagta gtgcctgcag 4381 gtgaaaccaa ccagccctgt gttagaggag gaaaagcgga gatgacatgg aagtctccaa 4441 gcctgtgcca tccacctgcc aaggaaaagc acaaggtgct atctactttt ctctctagga 4501 tttagattat catttatgtg ctgttgcaca gtgaaacctc acctgtgtgg gcgtgaaagc 4561 tgattggcat tgtttttgat tcagcttttt ggatggctaa ttgttttcac tgtgctgtgg 4621 gaatgcctct gtattttttc ccctctttgg ccatcttttt ctgaaaataa agtgatggat 4681 cctctagcc // LOCUS D86975 6033 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0222 gene, complete cds. ACCESSION D86975 NID g1504023 KEYWORDS KIAA0222. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2586. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6033) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6033) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6033 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 18. (RH_ID :RH25428)" /clone="HA2586" /sex="male" /tissue_type="bone marrow" 5'UTR 1..318 gene 319..3810 /gene="KIAA0222" CDS 319..3810 /gene="KIAA0222" /note="similar to Mouse finger protein(clone mkr3)(S03677):" /citation=[3] /codon_start=1 /db_xref="PID:d1013900" /db_xref="PID:g1504024" /translation="MDRNREAEMELRRGPSPTRAGRGHEVDGDKATCHTCCICGKSFP FQSSLSQHMRKHTGEKPYKCPYCDHRASQKGNLKIHIRSHRTGTLIQGHEPEAGEAPL GEMRASEGLDACASPTKSASACNRLLNGASQADGARVLNGASQADSGRVLLRSSKKGA EGSACAPGEAKAAVQCSFCKSQFERKKDLELHVHQAHKPFKCRLCSYATLREESLLSH IERDHITAQGPGSGEACVENGKPELSPGEFPCEVCGQAFSQTWFLKAHMKKHRGSFDH GCHICGRRFKEPWFLKNHMKAHGPKTGSKNRPKSELDPIATINNVVQEEVIVAGLSLY EVCAKCGNLFTNLDSLNAHNAIHRRVEASRTRAPAEEGAEGPSDTKQFFLQCLNLRPS AAGDSCPGTQAGRRVAELDPVNSYQAWQLATRGKVAEPAEYLKYGAWDEALAGDVAFD KDRREYVLVSQEKRKREQDAPAAQGPPRKRASGPGDPAPAGHLDPRSAARPNRRAAAT TGQGKSSECFECGKIFRTYHQMVLHSRVHRRARRERDSDGDRAARARCGSLSEGDSAS QPSSPGSACAAADSPGSGLADEAAEDSGEEGAPEPAPGGQPRRCCFSEEVTSTELSSG DQSHKMGDNASERDTGESKAGIAASVSILENSSRETSRRQEQHRFSMDLKMPAFHPKQ EVPVPGDGVEFPSSTGAEGQTGHPAEKLSDLHNKEHSGGGKRALAPDLMPLDLSARST RDDPSNKETASSLQAALVVHPCPYCSHKTYYPEVLWMHKRIWHRVSCNSVAPPWIQPN GYKSIRSNLVFLSRSGRTGPPPALGGKECQPLLLARFTRTQVPGGMPGSKSGSSPLGV VTKAASMPKNKESHSGGPCALWAPGPDGYRQTKPCHGQEPHGAATQGPLAKPRQEASS KPVPAPGGGGFSRSATPTPTVIARAGAQPSANSKPVEKFGVPPAGAGFAPTNKHSAPD SLKAKFSAQPQGPPPAKGEGGAPPLPPREPPSKAAQELRTLATCAAGSRGDAALQAQP GVAGAPPVLHSIKQEPVAEGHEKRLDILNIFKTYIPKDFATLYQGWGVSGPGLEHRGT LRTQARPGEFVCIECGKSFHQPGHLRAHMRAHSVVFESDGPRGSEVHTTSADAPKQGR DHSNTGTVQTVPLRKGT" 3'UTR 3811..6033 BASE COUNT 1351 a 1700 c 1726 g 1256 t ORIGIN 1 cggggcgtgc gcgtcctcct ccccaggccc gccgcctccc tgccaagaat ctgagagagg 61 ccgagtggag ttcggtcctt ctctgaacag ttttagctga gagtaccagc atccaactgg 121 gagcgttgtc attgcatttc cacattccca ggaaagccca ggtgctggct gccagctgct 181 gcgcccccca tgtagaaggt gcacctcctg ggagcaggca cgtcttttgg ctcttctgac 241 catggagaga taggacggtc cctgcagccc gcgcgacaga aagctgtgcc gccaccaccg 301 gccgcgtccg tccttcggat ggatcgcaac agagaggccg agatggagct gaggcgaggc 361 cccagcccca ccagggccgg ccggggccac gaggtggatg gggacaaggc tacctgccac 421 acctgctgca tctgcggcaa gagcttcccc ttccagagct cgctttcgca gcacatgcgc 481 aagcacacgg gcgagaagcc ctacaagtgt ccctactgcg accaccgggc ttcccagaag 541 ggcaacctga agattcacat ccggagccac cgcacgggga ctctgattca gggacacgag 601 ccggaggcgg gcgaggcgcc gctgggtgag atgcgcgcct ccgagggcct ggacgcctgc 661 gccagcccca ccaagagcgc ctctgcctgc aaccggctgc tgaacggggc ctcgcaggcc 721 gacggcgcca gggtcctgaa cggggcctcg caggccgaca gcggcagagt cctgctgcgg 781 agcagcaaga agggggcaga ggggtccgca tgcgccccgg gggaggccaa ggcagcggtc 841 cagtgctcct tctgcaagag ccagttcgag cgtaagaagg acctggagct gcacgtgcac 901 caggcgcaca agccgttcaa gtgcaggctg tgcagctacg cgacgctgcg ggaggagtcg 961 ctgctgagcc acatcgagag ggaccacatc accgcgcagg ggcccggcag cggcgaggcc 1021 tgcgtggaga acggcaagcc cgagctgagc cccggggagt tcccgtgcga ggtgtgtggc 1081 caggccttca gccagacctg gttcctgaag gcgcacatga agaagcaccg gggctccttc 1141 gaccacggct gccacatctg cggccgtagg ttcaaggagc cctggttcct caagaaccac 1201 atgaaggcgc acggccccaa gacgggcagc aagaacaggc ccaagagtga gctggacccc 1261 atcgccacca tcaacaacgt ggtccaggag gaggtgatcg tcgccggcct gagcctctac 1321 gaggtctgcg ccaagtgcgg gaacctgttt acaaacctgg acagcttgaa cgcccacaat 1381 gccatccacc gcagagtcga ggccagccgc acgcgcgccc cggccgagga gggggcggag 1441 gggccctcgg acaccaagca gttctttctc cagtgcctga acctgaggcc gtcggcggcc 1501 ggcgactcgt gccctggcac gcaggccgga cggcgggtgg ctgagctgga cccggtcaac 1561 agctaccagg cctggcagct ggccacgcgg ggtaaggtgg ccgagccggc cgagtacctc 1621 aagtacgggg cctgggacga ggcgctggcc ggggacgtgg ccttcgacaa ggacaggcgc 1681 gagtacgtcc tggtgagcca ggagaagcgc aagcgtgagc aggatgcacc agccgcgcag 1741 gggcccccgc ggaagcgcgc gagcgggcct ggggaccccg cgcccgccgg ccacctcgat 1801 ccccgctcgg ccgcgcgccc caaccgcagg gccgcagcca ccaccggcca gggcaagtcc 1861 tccgagtgct tcgagtgcgg caagatcttc cgcacctatc atcagatggt gctgcactca 1921 cgcgtgcatc gccgcgcgcg ccgcgagagg gacagtgacg gggacagggc ggcgcgggcc 1981 cgctgcggat cactcagtga gggtgactcg gcctcccagc ccagcagccc tggctccgcc 2041 tgtgccgctg ctgactcccc gggctctggc ctggccgacg aggctgccga agacagtggt 2101 gaggagggcg cccctgaacc tgcaccaggg ggacagccgc gccgctgctg cttttccgaa 2161 gaggtgactt cgaccgagct ctccagtgga gaccagagtc acaagatggg agataacgcc 2221 tcggaaagag acaccggcga gtccaaggca gggatcgcag cttctgtgtc catacttgaa 2281 aacagtagca gagagacttc tagaaggcaa gagcagcaca gattttctat ggacttaaag 2341 atgccagcat ttcaccccaa gcaggaggtg cccgtccctg gtgatggtgt ggagttccct 2401 tccagtacgg gagcggaggg ccagacgggt caccctgcag aaaagctgtc cgatttgcac 2461 aacaaggaac actctggggg agggaagcgg gcgctggccc cagacctcat gccgctagat 2521 ttaagtgcga ggtcgacgcg ggatgacccc agcaataagg agacggcctc ctccctgcag 2581 gcggctttag tcgttcaccc gtgtccttac tgcagccaca agacctacta ccccgaggtc 2641 ctgtggatgc acaaacgcat ctggcaccgt gtcagctgca actccgtggc tcccccgtgg 2701 attcagccca atggttacaa aagcatcaga agcaatttgg ttttcctttc ccggagcgga 2761 cgcacgggcc ccccgcctgc cctcggtggc aaagaatgcc agcctttgct ccttgctcgg 2821 ttcacccgca ctcaggtgcc aggggggatg ccggggtcca aaagtggctc ttctcccctg 2881 ggagtggtca caaaagccgc tagcatgcct aagaataagg agagccattc cggaggtccc 2941 tgcgctctgt gggcgcccgg ccctgacggg tatcgacaga ccaaaccttg tcacggccag 3001 gagccacatg gcgcggccac acaggggccc ctggccaagc ccaggcagga ggctagctcc 3061 aaaccggtgc ctgccccggg tggcgggggc ttcagcagga gcgccacccc tacgcccacc 3121 gtcatcgccc gggctggcgc gcagccctcg gccaatagca agcctgtgga gaagtttggg 3181 gtccccccag cgggggctgg ctttgccccc acaaataagc acagtgcccc ggactccctg 3241 aaagccaaat tcagtgctca gcctcagggt ccacctcctg caaagggcga agggggcgct 3301 cctcctctac ctccccgcga gcccccctcg aaggcagccc aggagctgag gactctggcc 3361 acctgtgctg cggggtccag gggcgacgcg gccttgcagg cccagcccgg cgtggctggg 3421 gcgccccccg tcctacactc catcaaacag gagccagtgg ccgaggggca tgagaagcgc 3481 ctggacatcc tcaacatctt taagacgtac attccaaagg actttgcgac cctctaccag 3541 ggatggggtg tcagcggccc tgggttggag cacagaggga cactccggac gcaggcccgg 3601 ccaggagagt tcgtctgcat cgagtgcgga aagagcttcc accagcccgg ccacctcagg 3661 gcccacatgc gggcacactc agtggtgttt gagtccgatg ggcctcgggg ttctgaagtt 3721 cataccacct ccgcagacgc ccccaaacaa gggagagacc attctaacac aggtaccgtc 3781 cagacagtgc ctctgagaaa gggaacctaa aggcgtgttt ccgacgcacc ccaggtcccc 3841 gtaacggcca ttagcagtac cctcacgatg tcccagcagc ctcccacctg tgacctggcc 3901 gctccatgga agaacagccg gggaactcct gagcagacac ctcacatccc gagccgctgc 3961 gctggagtgg aaactgaagg cagatgcctc tccttgttaa acgttcagaa ataaatgaag 4021 atgctatatt ctagaaatac atgtagatac tatatacgca tttacgtgct catcgtccat 4081 agtcccatat tttcttataa taaacagtag tactggcagg cacagtaggg gcacaaggca 4141 tctgtcttat tcaagacaag tttgagacac tggaaaaaaa gatacttgtt gtgtgtgttg 4201 gacagagtgg cgaggctgag cactgtcaca ggggcctccc atgttaagag ggactgtggg 4261 gatgatgtca gaacaagacg tggtggattt gaggttgatc gagtattaat actactgcct 4321 ctccttgtct tagtgggtat ttaaaatagt aaataagaga gaggaaggag gtgacgttca 4381 ggtgctgtgg gaagcaggct tggcggaggg gtatgatgat gagaccctca ttgttcactg 4441 gctccatcgc actcctccct ggggccgtgt gcctgttcca ttcttcccac cattcgaact 4501 gagcgaatct ggcaaaggag acacgtctgt gggaatgcgt agattccgcc tcggaagaga 4561 gctagcgcaa cactaagaaa agcaggcttc ttgtttattc tcaggacctt tttgtaacag 4621 ggctacattc tgcaaactgc ttacaaagga agactatacg tcttaacaaa ttatttagcc 4681 actgagtcct cccgattcgg acctgtttta gtaatggcag aagaatccct gagcaggttc 4741 aggtgcccta gatgactagg gtgctgagct ctggcgcctt ctgtccccac tctttgcctc 4801 cccgcccctt ccctgagcca ccccagcaag tgggtgtctt ttctccctgg gcctggtgac 4861 ctccacagga tgagtgactt tgttcataaa gggtggggat caccagcccc ttgggtgggg 4921 gacggcttca tatacctctt cctcagtaat gcaaatgcga gtttttgtgg tgggggttaa 4981 ggcccataac aaaggatctt aaaccatgca gtgtacgcaa ttgaaatggt attccacaga 5041 tataaatatt ttcttttccc attgccgtga cactatgtgt gatggtaata tttctgagag 5101 tttcagattt ttgcacatat gattttatgc attatcaaaa gttactgctg ccttgaatga 5161 aaatgttctg tgaaattttt tgcaaaagct ttactaggtt tttttttaat tgtgaaattt 5221 tgtaaaggca ggaaatggat taaaacgagc atgctaaata tatttttcaa aaaagcaata 5281 attttacatg tacagaaatt atcctaacct ttaatactgg cgagagcaac agtttactta 5341 atacggtaat ggactagtgc agtttttgta gacagtgggc ttctgataca aagtcttgtt 5401 taaacacaga cacacacaca cacaaacaca cacacacacc ctaaagtgtg ggtttcctgt 5461 tctaatgatt tgttgaatat tattatatta ttattattat tattattatt attgttattg 5521 ttattagtaa tgtttggttc tggattctac ttgttactga gtttaaatta cttgacggtt 5581 caggttactt tgcaacactt tcaaacgatg caatgtaact ggctagctta tatatatata 5641 tatatatata tatatatatt tttttttttt tttttactta tttttttctg atattcttac 5701 accagatatg tacgaaaatg atctgtcctg ttggtgtaat taggaatgtc catgcagata 5761 cagttaaaca actgtaattg actgttctgt aaagttattt tgggcaaagt tgcggagaca 5821 cattcctctg tccacctaag aaatcagaag actcttctgt tgatttatgt ttaatcattt 5881 cagtagtttc cccacagtga tcatttctgc attttctggc ttttgttttc ttggctgaaa 5941 gtgaatggtg actgttagga atgtcaggga ctagtgaccc agtcctgttt ctctgtgttt 6001 tagttattaa aaagaaattc tgtacccaaa gtg // LOCUS D86977 4226 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0224 gene, complete cds. ACCESSION D86977 NID g1504027 KEYWORDS KIAA0224. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA4657. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4226) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4226) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..4226 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 16. (RH_ID :RH25430)" /clone="HA4657" /sex="male" /tissue_type="bone marrow" 5'UTR 1..136 gene 137..3820 /gene="KIAA0224" CDS 137..3820 /gene="KIAA0224" /note="similar to putative ATP-dependent RNA helicase K03H1.2 of C.elegans(S41025)" /citation=[3] /codon_start=1 /db_xref="PID:d1013902" /db_xref="PID:g1504028" /translation="MGDTSEDASIHRLEGTDLDCQVGGLICKSKSAASEQHVFKAPAP RPSLLGLDLLASLKRREREEKDDGEDKKKSKVSSYKDWEESKDDQKDAEEEGGDQAGQ NIRKDRHYRSARVETPSHPGGVSEEFWERSRQRERERREHGVYASSKEEKDWKKEKSR DRDYDRKRDRDERDRSRHSSRSERDGGSERSSRRNEPESPRHRPKDAATPSRSTWEEE DSGYGSSRRSQWESPSPTPSYRDSERSHRLSTRDRDRSVRGKYSDDTPLPTPSYKYNE WADDRRHLGSTPRLSRGRGRREEGEEGISFDTEEERQQWEDDQRQADRDWYMMDEGYD EFHNPLAYSSEDYVRRREQHLHKQKQKRISAQRRQINEDNERWETNRMLTSGVVHRLE VDEDFEEDNAAKVHLMVHNLVPPFLDGRIVFTKQPEPVIPVKDATSDLAIIARKGSQT VRKHREQKERKKAQHKHWELAGTKLGDIMGVKKEEEPDKAVTEDGKVDYRTEQKFADH MKRKSEASSEFAKKKSILEQRQYLPIFAVQQELLTIIRDNSIVIVVGETGSGKTTQLT QYLHEDGYTDYGMIGCTQPRRVAAMSVAKRVSEEMGGNLGEEVGYAIRFEDCTSENTL IKYMTDGILLRESLREADLDHYSAIIMDEAHERSLNTDVLFGLLREVVARRSDLKLIV TSATMDAEKFAAFFGNVPIFHIPGRTFPVDILFSKTPQEDYVEAAVKQSLQVHLSGAP GDILIFMPGQEDIEVTSDQIVEHLEELENAPALAVLPIYSQLPSDLQAKIFQKAPDGV RKCIVATNIAETSLTVDGIMFVIDSGYCKLKVFNPRIGMDALQIYPISQANANQRSGR AGRTGPGQCFRLYTQSAYKNELLTTTVPEIQRTNLANVVLLLKSLGVQDLLQFHFMDP PPEDNMLNSMYQLWILGALDNTGGLTSTGRLMVEFPLDPALSKMLIVSCDMGCSSEIL LIVSMLSVPAIFYRPKGREEESDQIREKFAVPESDHLTYLNVYLQWKNNNYSTIWCND HFIHAKAMRKVREVRAQLKDIMVQQRMSLASCGTDWDIVRKCICAAYFHQAAKLKGIG EYVNIRTGMPCHLHPTSSLFGMGYTPDYIVYHELVMTTKEYMQCVTAVDGEWLAELGP MFYSVKQAGKSRQENRRRAKEEASAMEEEMALAEEQLRARRQEQEKRSPLGSVRSTKI YTPGRKEQGEPMAPRRTPARFGL" 3'UTR 3821..4226 BASE COUNT 1021 a 1077 c 1313 g 815 t ORIGIN 1 ataatggccg ctttcaaggt gtggattttg gctccttgag cctgtttgag cgaggggtgg 61 gagcgccggc gccccagaat ccgggacaga agggtcccaa gagtcgcgct tggtgagaga 121 aatcccagat cctgtgatgg gggacaccag tgaggatgcc tcgatccatc gattggaagg 181 cactgatctg gactgtcagg ttggtggtct tatttgcaag tccaaaagtg cggccagcga 241 gcagcatgtc ttcaaggctc ctgctccccg cccttcatta ctgggactgg acttgctggc 301 ttccctgaaa cggagagagc gagaggagaa ggacgatggg gaggacaaga agaagtccaa 361 agtctcctcc tacaaggact gggaagagag caaggatgac cagaaggatg ctgaggaaga 421 gggcggtgac caggctggcc aaaatatccg gaaagacaga cattatcggt ctgctcgggt 481 agagactcca tcccatccgg gtggtgtgag cgaagagttt tgggaacgca gtcggcagag 541 agagcgggag cggagggaac atggtgtcta tgcctcgtcc aaagaagaaa aggattggaa 601 gaaggagaaa tcgcgggatc gagactatga ccgcaagagg gacagagatg agcgggatag 661 aagtaggcac agcagcagat cagagcgaga tggagggtca gagcgtagca gcagaagaaa 721 tgaacccgag agcccacgac atcgacctaa agatgcagcc accccttcaa ggtctacctg 781 ggaggaagag gacagtggct atggctcctc aaggcgctca cagtgggaat cgccctcccc 841 gacgccttcc tatcgggatt ctgagcggag ccatcggctg tccactcgag atcgagacag 901 gtctgtgagg ggcaagtact cggatgacac gcctctgcca actccctcct acaaatataa 961 cgagtgggcc gatgacagaa gacacttggg gtccaccccg cgtctgtcca ggggccgagg 1021 aagacgtgag gagggcgaag aaggaatttc atttgacacg gaggaggagc ggcagcagtg 1081 ggaagatgac cagaggcaag ccgatcggga ttggtacatg atggacgagg gctatgacga 1141 gttccacaac ccgctggcct actcctccga ggactacgtg aggaggcggg agcagcacct 1201 gcataaacag aagcagaagc gcatttcagc tcagcggaga cagatcaatg aggataacga 1261 gcgctgggag acaaaccgca tgctcaccag tggggtggtc catcggctgg aggtggatga 1321 ggactttgaa gaggacaacg cggccaaggt gcatctgatg gtgcacaatc tggtgcctcc 1381 ctttctggat gggcgcattg tcttcaccaa gcagccggag ccagtgattc cagtgaagga 1441 tgccacttct gacctggcca tcattgctcg gaaaggcagc cagacagtgc ggaagcacag 1501 ggagcagaag gagcgcaaga aggctcagca caaacactgg gaactggcgg ggaccaaact 1561 gggagatata atgggcgtca agaaggagga agagccagat aaagctgtga cggaggatgg 1621 gaaggtggac tacaggacag agcagaagtt tgcagatcac atgaagagaa agagcgaagc 1681 cagcagtgaa tttgcaaaga agaagtccat cctggagcag aggcagtacc tgcccatctt 1741 tgcagtgcag caggagctgc tcactattat cagagacaac agcatcgtga tcgtggttgg 1801 ggagacgggg agtggtaaga ccactcagct gacgcagtac ctgcatgaag atggttacac 1861 ggactatggg atgattgggt gtacccagcc ccggcgtgta gctgccatgt cagtggccaa 1921 gagagtcagt gaagagatgg ggggaaacct tggcgaggag gtgggctatg ccatccgctt 1981 tgaagactgc acttcagaga acaccttgat caaatacatg actgacggga tcctgctccg 2041 agagtccctc cgggaagccg acctggatca ctacagtgcc atcatcatgg acgaggccca 2101 cgagcgctcc ctcaacactg acgtgctctt tgggctgctc cgggaggtag tggctcggcg 2161 ctcagacctg aagctcatcg tcacatcagc cacgatggat gcggagaagt ttgctgcctt 2221 ttttgggaat gtccccatct tccacatccc tggccgtacc ttccctgttg acatcctctt 2281 cagcaagacc ccacaggagg attacgtgga ggctgcagtg aagcagtcct tgcaggtgca 2341 cctgtcgggg gcccctggag acatccttat cttcatgcct ggccaagagg acattgaggt 2401 gacctcagac cagattgtgg agcatctgga ggaactggag aacgcgcctg ccctggctgt 2461 gctgcccatc tactctcagc tgccttctga cctccaggcc aaaatcttcc agaaggctcc 2521 agatggcgtt cggaagtgca tcgttgccac caatattgcc gagacgtctc tcactgttga 2581 cggcatcatg tttgttatcg attctggtta ttgcaaatta aaggtcttca accccaggat 2641 tggcatggat gctctgcaga tctatcccat tagccaggcc aatgccaacc agcggtcagg 2701 gcgagccggc aggacgggcc caggtcagtg tttcaggctc tacacccaga gcgcctacaa 2761 gaatgagctc ctgaccacca cagtgcccga gatccagagg actaacctgg ccaacgtggt 2821 gctgctgctc aagtccctcg gggtgcagga cctgctgcag ttccacttca tggacccgcc 2881 cccggaggac aacatgctca actctatgta tcagctctgg atcctcgggg ccctggacaa 2941 cacaggtggt ctgacctcta ccgggcggct gatggtggag ttcccgctgg accctgccct 3001 gtccaagatg ctcatcgtgt cctgtgacat gggctgcagc tccgagatcc tgctcatcgt 3061 ttccatgctc tcggtcccag ccatcttcta caggcccaag ggtcgagagg aggagagtga 3121 tcaaatccgg gagaagttcg ctgttcctga gagcgatcat ttgacctacc tgaatgttta 3181 cctgcagtgg aagaacaata attactccac catctggtgt aacgatcatt tcatccatgc 3241 taaggccatg cggaaggtcc gggaggtgcg agctcaactc aaggacatca tggtgcagca 3301 gcggatgagc ctggcctcgt gtggcactga ctgggacatc gtcaggaagt gcatctgtgc 3361 tgcctatttc caccaagcag ccaagctcaa gggaatcggg gagtacgtga acatccgcac 3421 agggatgccc tgccacttgc accccaccag ctcccttttt ggaatgggct acaccccaga 3481 ttacatagtg tatcacgagt tggtcatgac caccaaggag tatatgcagt gtgtgaccgc 3541 tgtggacggg gagtggctgg cggagctggg ccccatgttc tatagcgtga aacaggcggg 3601 caagtcacgg caggagaacc gtcgtcgggc caaagaggaa gcctctgcca tggaggagga 3661 gatggcgctg gccgaggagc agctgcgagc ccggcggcag gagcaggaga agcgcagccc 3721 cctgggcagt gtcaggtcta cgaagatcta cactccaggc cggaaagagc aaggggagcc 3781 catggcccct cgccgcacgc cagcccgctt tggtctgtga gctgaggctg tccccagaga 3841 ggatggcagc aggtattggg tcctcagcct tctggcggga gccctgaggc tgcggacaaa 3901 gccttttcat ctgaggactt tcatctgtgc atatcacggc cccccagggc agttcctgct 3961 ggaccagact ctctggcaga ggaggtggag ttcttccatg caggagcacg gcatggcggg 4021 agcggggctg cagagtatcc gaggtgctgc cggggcagcg ggaggtggct ggacccatcg 4081 catctaaaac tggcccagga cacttggtgt atgcgtgact tggctgtggc tgtctttttt 4141 aatccttgtg taaagcagca aaaaagacct aaagggaatt gtaatttggt tataattcag 4201 gatttggaaa taaatttatt atttgt // LOCUS D86979 5891 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0226 gene, complete cds. ACCESSION D86979 NID g1504031 KEYWORDS KIAA0226. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA4633. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5891) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5891) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5891 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 3. (RH_ID :RH25432)" /clone="HA4633" /sex="male" /tissue_type="bone marrow" 5'UTR 1..622 gene 623..2878 /gene="KIAA0226" CDS 623..2878 /gene="KIAA0226" /citation=[3] /codon_start=1 /db_xref="PID:d1013904" /db_xref="PID:g1504032" /translation="MLSIPTLGPSLASTNPCPTMAQRRSTSFPLSGPPRKPQESRGHV SPAEDQTIQAPPVSVSALARDSPLTPNEMSSSTLTSPIEASWVSSQNDSPGDASEGPE YLAIGNLDPRGRTASCQSHSSNAESSSSNLFSSSSSQKPDSAASSLGDQEGGGESQLS SVLRRSSFSEGQTLTVTSGAKKSHIRSHSDTSIASRGAPESCNDKAKLRGPLPYSGQS SEVSTPSSLYMEYEGGRYLCSGEGMFRRPSEGQSLISYLSEQDFGSCADLEKENAHFS ISESLIAAIELMKCNMMSQCLEEEEVEEEDSDREIQELKQKIRLRRQQIRTKNLLPMY QEAEHGSFRVTSSSSQFSSRDSAQLSDSGSADEVDEFEIQDADIRRNTASSSKSFVSS QSFSHCFLHSTSAEAVAMGLLKQFEGMQLPAASELEWLVPEHDAPQKLLPIPDSLPIS PDDGQHADIYKLRIRVRGNLEWAPPRPQIIFNVHPAPTRKIAVAKQNYRCAGCGIRTD PDYIKRLRYCEYLGKYFCQCCHENAQMAIPSRVLRKWDFSKYYVSNFSKDLLIKIWND PLFNVQDINSALYRKVKLLNQVRLLRVQLCHMKNMFKTCRLAKELLDSFDTVPGHLTE DLHLYSLNDLTATRKGELGPRLAELTRAGATHVERCMLCQAKGFICEFCQNEDDIIFP FELHKCRTCEECKACYHKACFKSGSCPRCERLQARREALARQSLESYLSDYEEEPAEA LALEAAVLEAT" 3'UTR 2879..5891 BASE COUNT 1330 a 1584 c 1579 g 1398 t ORIGIN 1 cccggagccc accggccgca ggtgcctcct ccggccccag ggggccccgg gagccctgaa 61 gggcgaagcg gcagggacgc ctctcttggg cgaagaggcg gcctcaccgc cccggatgcg 121 gccggagggc gcgggaatgg agctcggagg cggcgaggag cgcctgcctg aggagagcag 181 gcgtgccgcc gccagacgga ttactggcag ttcgtgaaag acatccggtg gctcagtccc 241 cactcagccc ttcacgtgga gaagttcatc agcgtgcacg agaacgacca gagcagtgct 301 gatggtgcca gtgaacgtgc tgttgccgag ctgtggctgc agcacagcct gcagtaccac 361 tgcctctcag cccagctccg gcccctgctc ggggatagac agtatatcag aaaattctac 421 acagatgctg ccttcctgct aagtgacgct catgtcacgg ccatgctgca gtgcctggaa 481 gcagtggaac agaacaaccc ccgcctcctg gctcagatcg atgcgtccat gtttgccaga 541 aagcacgaga gcccgctcct ggtgacaaag agccagagcc tgacagccct gcccagttcc 601 acatacaccc ctccaaacag ctatgctcag cattcctact ttgggtcctt ctctagcctc 661 caccaatccg tgcccaacaa tggctcagag aagatctact tcctttccac tctctggccc 721 tccccggaaa cctcaagaaa gcagagggca cgtctcacca gcagaggatc aaaccatcca 781 agccccccca gtttcagtct ctgcactagc cagggattcc cctttgaccc caaatgaaat 841 gagctccagt actctgacca gccccataga ggcatcctgg gtcagcagcc agaatgattc 901 cccaggtgat gccagtgagg ggcctgagta cctggccatt ggcaacttgg acccccgagg 961 ccggactgcc agctgtcaga gtcacagcag caatgccgag agcagcagtt ccaatttgtt 1021 ctcctccagc agctcccaga agccagattc tgctgcctct tccttagggg accaggaagg 1081 aggtggggag agccagctgt ccagtgtcct ccgcaggtcc agcttctcag aggggcagac 1141 actcactgtc accagtgggg caaagaaaag ccacattcgc tcccattcgg ataccagcat 1201 tgcctccagg ggagctccag aatcctgcaa tgataaggcg aagttgagag gccctttgcc 1261 ctactctggt caaagcagtg aagtcagcac acccagctct ctgtacatgg aatatgaagg 1321 tggtcggtac ctgtgctcag gggaaggcat gttccgaaga ccatcagaag gacagtccct 1381 catcagctac ctctctgagc aagacttcgg cagctgtgcc gacctggaaa aggagaatgc 1441 ccacttcagc atctcagagt ccttaattgc tgccatcgag ctaatgaagt gcaacatgat 1501 gagccagtgc ctagaggagg aggaagtgga agaggaagac agtgatagag agatccagga 1561 gctgaagcag aagatccgcc ttcggcgcca gcaaatccgc accaagaacc tgctccccat 1621 gtaccaggag gctgagcacg gaagctttcg ggtcacctcc agcagctccc agttcagctc 1681 acgtgattcg gcacagctct ctgactctgg ctctgctgat gaggttgatg aatttgaaat 1741 ccaagatgct gacatcagaa ggaacacagc ctcaagcagc aaatccttcg tttcctccca 1801 gtccttctcc cactgcttcc tgcactccac gtctgctgag gcggtggcca tggggctcct 1861 gaagcagttt gaggggatgc agcttccagc cgcctcggag ctggagtggc ttgtcccgga 1921 gcatgatgcc cctcagaagc tcctgcccat tcctgactca ctgcccatct caccggatga 1981 cgggcagcac gctgacatct acaagctgcg gattcgtgtt cgtggcaact tggagtgggc 2041 cccgccccgg cctcagataa tttttaatgt tcatccagcc ccaacgagga aaattgccgt 2101 ggccaagcag aattaccgct gtgcaggatg tggcatccgg actgaccctg attacatcaa 2161 gcgactgcgg tactgtgagt acctgggcaa gtacttctgc cagtgctgcc acgagaatgc 2221 ccagatggcc atccccagcc gggttctgcg caagtgggac ttcagcaagt actacgtcag 2281 caacttctcc aaggacctgc tcattaagat ctggaatgat cctctcttca acgtgcagga 2341 cataaacagt gccctctata ggaaggtcaa gctgctcaat caagtccggc tgctgcgggt 2401 ccagctgtgt cacatgaaga acatgttcaa gacttgccga ctggccaagg agcttctgga 2461 ttcctttgac acagtcccag gccacctgac agaggacctc cacctgtact cactgaatga 2521 cctgactgcg accaggaagg gggagctggg gccccggctt gctgagctca ccagggcagg 2581 ggctacccat gtggagagat gcatgctctg ccaagccaaa ggcttcatct gtgagttctg 2641 tcagaatgag gatgacatca tctttccctt tgagctccat aagtgccgga cctgtgaaga 2701 gtgtaaagcg tgttaccata aagcctgctt caagtctgga agctgtccgc gctgcgagcg 2761 gctgcaggcc cggcgggagg cactggccag gcagagcctg gagtcttacc tgtcagacta 2821 cgaggaggag cccgcggaag cgctggccct ggaagccgcc gtcctggagg ccacctgaag 2881 aaagcacgtg cagccctccc tccgggccgg gtcacacctg ttgcagaact gagccactct 2941 ttgaaggact cgccccacct ggggcttctt tttttttttt ttttttaatt atcatcatct 3001 tttttttttt tttactgact tgtctgacgt ctgtgtgcag tcagccgtcg gcaggttgat 3061 gggtccagag tctgtggtga cagataattt gtaaacacca ggtgtttcca tcagaactga 3121 catgcgggtc cttcagtgaa gcttctagtg cctctgtcag tggaagagac agcaagacca 3181 agttcttcca gcgtctgtgg ccttctcctc taggtttcac ctgcatgtca ggtatcattt 3241 ccaattttcc tttgtttcag ttctggagct tctgagccag gcctttctca accacctctc 3301 ctgctgctga aacggggatg gcgttttccc tctccctgtc ctggactggg gtcagactgt 3361 gccccgagga gaagcagcag agaataggac tacgtcatgg gcatttcgtc cacttatttg 3421 ggtattttgg gggccacaga acaatcctga ctatcctaga ctcctcagag acctcagagg 3481 cagctgtgaa tgtccctatg ttgccgggag ttcctgtttg aaatatttga agcatagagg 3541 atgccacaag ctgactttct tcatctacct tggtgatctt gaagcaaaga acagaactga 3601 tgctcaggcc aggctcacct gtagccttac gccgcaagca tacgtgaggc gccagctctg 3661 tcgctgaagg agcgcttact cagaggagcg gtcggccccc tcttggtgtt aaggtctctt 3721 agttaacctg gctttttggt gcaggtgtga tctttgaagc tcaggcaggt ccctgatgcc 3781 atcctaaggt gaggacagga acctcaccca ccatcttctt agcgtgtccc tgatgactct 3841 gtcctctgtt agatggtcgt tgtgcttctg agtaaaagta caacccgact ccgttctctc 3901 cccttcctgc agcagagctg ggtccttccc tggtggccga gtctctcttg ccttagcttc 3961 tttggtcaaa gttggagaaa agcttcctgc tattagtgct gttacagaac ttgacggttt 4021 gtggatgtga gtgtgaatgt ccctgtgttc ttgggataac aagagccttt atgccaatta 4081 tgcacttaac tctgtgtagc ctggtaatgt ttatctgttc atttgataat gctgatttta 4141 gtgtgctgcc cccctccccc cgttaatgtg tgttgatggt gaagtccttt tgataatgct 4201 gattttggtg tgctgcctcc cccttccccc ccgttaatgt gtgtgttgac agtgaagtcc 4261 ttgggtgggg ccatgtgtgt gtttgtgatg ttccttaagt tgatgcagct tctaacctct 4321 gtgaaaacac tggtcagagt ggcttctcca agagctggca gctctgtgaa ctaaagcctg 4381 catcattttt gttctgggat tgaattctgc ccatgggcat gtcttctcat agttgcttgc 4441 tggtaggaaa gaaatgggcg tgggtgctgc cctggaagct gagcggaaag ttgcctgtgg 4501 ttggtggaag ctgatgagag cttgagctgg cggtaagaag gagtctccca gggaagtggg 4561 agaggcatta aggtgatggc cagggctgag gctccaccag cgtgagaggg aacatgtggg 4621 aactggcccc tgcccttgat tcctctgcct caaagttggg atctgaaagc catgtagggc 4681 tagaagaccc tgaggctgtt ctcccttctg ttcatagtga gactcaaaaa gccaagtccc 4741 agaagttctg aagggctgtg actagaagtg cccaggtcct tcagggagct ttaagaatga 4801 ccccacagaa ctcaagttta actaggggtt aggtcccaga ttcagaccca ggagtttata 4861 aaaatgagct ctacttccag ttttggttta aattacacat ccaggccagg cacagtggct 4921 cacacctgta atcccagcac tttgggaggc cagtgcgggc ggatcatgag gtcaggagtt 4981 tgagaccagc ctggccaatg tggtgaaacc ctgtctcttc caaaaataca aaaattagct 5041 gggcgtggtg gcacacgcct gtaatcccag ctacttggga ggctgaggca ggagaatcgc 5101 ttgaacctgg gaggcagagg ttgcagtgag ccgagattgc gccaccgcac tccagcctgg 5161 gtgacagagt gagactccgt ctcaaaaaac aaaaaggtga cacatccagc tctttctcca 5221 ggtcactgcg ctggaggaca gatgtgccgt cttgtcctgc ctgtttcaca tcagcatagg 5281 atcaaaggat gacaatgctg acagcttctg aagccgaact caacagtctc ataggctcct 5341 cacttgtcac ttatttttcc ctagctccct caaccgcacc ccatcccttt agatcgtgcg 5401 tctgttttag tgactctgac acgatgccgt cctcaccttc caaataccca gttatttatt 5461 caagaggggg gaagtgggta gaggatggga tgttttggaa gcactttgca agttaccact 5521 atctgaaaat cccctgctgt tgcggggaga agctttgaat gcactgaaga gaattccttc 5581 taaatgaagg caggtgatag tgttctttct gtaagtaaag ggaaagaaaa aaaacatagt 5641 ttgcttacca ggtggagaca agattcaaga catagcagaa gagtggaaga caaatatttt 5701 ccacttaaat gaggctgttt ttgacgttct ctgccaagga tttagagctt tcgttgaact 5761 aacataaaag gagtgcgagt cttagtagag atgttccgtg tgtgccgccc gtgctctgaa 5821 ctgcgtttcc acctgctgtg gtgcttgtgc agcctggcag ttcattgtca tctttaataa 5881 actaaggaaa t // LOCUS D86985 6025 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0232 gene, complete cds. ACCESSION D86985 NID g1504043 KEYWORDS KIAA0232. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA2598. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6025) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6025) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6025 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="chromosome 4. (RH_ID :RH25438)" /clone="HA2598" /sex="male" /tissue_type="bone marrow" 5'UTR 1..596 gene 597..3623 /gene="KIAA0232" CDS 597..3623 /gene="KIAA0232" /citation=[3] /codon_start=1 /db_xref="PID:d1013910" /db_xref="PID:g1504044" /translation="MENRKDTEYKEEPLWYTEPIAEYFVPLSRKSKLETTYRNRQDTS DLTSEAVEELSESVHGLCISNNNLHKTYLAAGTFIDGHFVEMPAVINEDIDLTGTSLC SLPEDNKYLDDIHLSELTHFYEVDIDQSMLDPGASETMQGESRILNMIRQKSKENTDF EAECCIVLDGMELQGERAIWTDSTSSVGAEGLFLQDLGNLAQFWECCSSSSGDADGES FGGDSPVRLSPILDSTVLNSHLLAGNQELFSDINEGSGINSCFSVFEVQCSNSVLPFS FETLNLGNENTDSSANMLGKTQSRLLIWTKNSAFEENEHCSNLSTRTCSPWSHSEETR SDNETLNIQFEESTQFNAEDINYVVPRVSSNYVDEELLDFLQDETCQQNSRTLGEIPT LVFKKTSKLESVCGIQLEQKTENKNFETTQVCNESPHGDGYSSGVIKDIWTKMADTNS VATVEIERTDAELFSADVNNYCCCLDAEAELETLQEPDKAVRRSEYHLWEGQKESLEK RAFASSELSNVDGGDYTTPSKPWDVAQDKENTFILGGVYGELKTFNSDGEWAVVPPSH TKGSLLQCAASDVVTIAGTDVFMTPGNSFAPGHRQLWKPFVSFEQNDQPKSGENGLNK GFSFIFHEDLLGACGNFQVEDPGLEYSFSSFDLSNPFSQVLHVECSFEPEGIASFSPS FKPKSILCSDSDSEVFHPRICGVDRTQYRAIRISPRTHFRPISASELSPGGGSESEFE SEKDEANIPIPSQVDIFEDPQADLKPLEEDAEKEGHYYGKSELESGKFLPRLKKSGME KSAQTSLDSQEESTGILSVGKQNQCLECSMNESLEIDLESSEANCKIMAQCEEEINNF CGCKAGCQFPAYEDNPVSSGQLEEFPVLNTDIQGMNRSQEKQTWWEKALYSPLFPASE CEECYTNAKGESGLEEYPDAKETPSNEERLLDFNRVSSVYEARCTGERDSGAKSDGFR GKMCSSASSTSEETGSEGGGEWVGPSEEELFSRTHL" 3'UTR 3624..6025 BASE COUNT 1789 a 1146 c 1391 g 1699 t ORIGIN 1 cgatctgctt ctgatgaaag ctctggtatc gagactttag tggaggagct ctgctccaga 61 ctgaaagacc ttcagagtaa gcaagaagag aagattcaca aaaagttaga ggggtctccc 121 tctccagagg cagaattatc ccctccagca aaggatcaag tggaaatgta ctatgaagca 181 tttccaccac tttctgagaa accagtttgc ctgcaagaaa tcatgactgt gtggaacaag 241 tctaaagtct gttcttactc tagctcttct tcatcatcca cagccccacc agctagcaca 301 gatacttcct ctcctaagga ctgcaacagt gaaagtgaag tcaccaagga aagaagcagt 361 gaagtaccca ccactgtgca tgagaaaacc cagagcaaaa gcaaaaacga gaaggaaaac 421 aaatttagta atggctttct ttgaagcacg gtgaaaaggc tgaaaggaac attcatactg 481 gaagtagtag cagtagcagc agtggttctg tcaaacagct gtgcaagcgg ggtaagagac 541 ctttaaaaga aatagggaga aaagatcctg ggagcactga aggaaaagac ctgtacatgg 601 agaatagaaa ggacacagag tataaagagg agcccttgtg gtacaccgag ccaattgctg 661 aatattttgt tcctctgagc agaaaaagta aactagagac cacataccga aacagacagg 721 atacaagtga tctgacatca gaggcagtgg aagaattgtc tgaatcagtg catggtcttt 781 gtatcagcaa caataatctt cataaaacat acctcgcagc aggtactttc attgatggtc 841 attttgtaga aatgcctgca gttataaatg aggatattga cctcactggg acctcattat 901 gttctctacc agaggacaat aaatacctgg atgatattca tctatcagaa ttaacgcact 961 tctatgaagt ggatattgat caatccatgt tggatcctgg tgcctcagaa acaatgcaag 1021 gagaaagtcg gattttgaat atgattcgac agaaaagcaa agagaacaca gattttgagg 1081 cagaatgttg catagtgtta gatggtatgg agttgcaagg ggaacgtgca atatggacag 1141 attctaccag ctccgtaggt gctgagggct tattcctgca ggaccttggc aatctggctc 1201 agttttggga gtgctgttca tccagctccg gtgatgctga tggggagagt tttggaggag 1261 actctccagt tagactctct cccatcttag acagcacagt gctcaattca cacctgcttg 1321 ctggcaatca agagctcttt tcagatatta atgaaggatc tggtataaac tcttgttttt 1381 cagtgtttga agtgcaatgc agtaattctg ttttaccatt ttcttttgaa acactcaact 1441 tgggaaatga aaatacagat tctagtgcta atatgcttgg gaaaacacag tctagattgc 1501 taatatggac caaaaatagt gcctttgaag aaaatgaaca ctgttctaat ctttcaacaa 1561 gaacttgtag tccatggtcc cattcagaag aaacacgttc agacaatgaa acattaaata 1621 ttcagtttga agaatccaca cagtttaatg ccgaagatat taattatgta gttcctagag 1681 tctcgtcaaa ttatgtagat gaagaacttc tagatttttt gcaagatgaa acttgccagc 1741 aaaacagtag aactttaggt gagattccta cattagtttt caaaaaaaca tctaaactag 1801 aatccgtctg tggtattcag ctagaacaaa aaacagaaaa caaaaatttt gaaactacac 1861 aagtatgtaa tgaaagtcca catggagatg gctacagctc aggggttatt aaagacattt 1921 ggacaaagat ggcagacaca aattctgtgg ctacagtaga aatagaaaga actgatgctg 1981 agttgttttc ggcagatgta aataactact gctgctgtct agatgctgaa gctgaactgg 2041 agacccttca ggagcctgat aaggctgtgc ggaggtcaga gtaccatctg tgggagggac 2101 agaaagagag cctggagaaa agagcatttg cttctagtga gctatcaaac gtggatggtg 2161 gtgattatac aacaccctct aaaccctggg atgtagccca agataaagaa aacacattca 2221 ttcttggagg agtttatgga gaactcaaaa ccttcaatag tgatggggag tgggcagtcg 2281 taccacctag tcacacaaaa ggaagtctgt tacagtgtgc agcttctgat gttgtgacga 2341 tagctggtac agatgtcttt atgaccccag gaaacagttt tgctcctggg cacaggcagt 2401 tatggaaacc cttcgtgtca tttgaacaga atgatcagcc gaagagtggg gaaaatgggt 2461 taaataaggg attttctttt atcttccatg aagacttact aggagcttgt ggcaactttc 2521 aagtcgaaga tcctggactt gaatactcat tttcttcctt tgacttaagc aatccatttt 2581 cacaagttct tcatgtagaa tgctcatttg aacctgaagg gattgcatct ttcagcccca 2641 gttttaaacc gaaatcaatc ctctgttctg attcagacag tgaagtgttt caccccagga 2701 tatgtggtgt tgacagaaca caatacaggg ctattcggat ctctcctcgg actcactttc 2761 gcccaatttc tgcatccgaa ctgtccccag gaggaggaag cgagtcagaa tttgaatctg 2821 agaaagatga agcaaatatt cccattcctt ctcaagttga tatatttgaa gatccgcagg 2881 cagatctcaa acctttggaa gaagatgcag agaaagaagg ccattactat ggaaaatcag 2941 agcttgagtc tggaaaattc cttcccaggt taaaaaaatc tgggatggaa aagagtgctc 3001 agacatcact ggattcccag gaggaatcaa ctgggattct ttcagtagga aagcaaaatc 3061 agtgtttgga atgtagcatg aatgaatccc tggaaataga tttagaaagc tcagaagcaa 3121 attgtaaaat aatggcacaa tgcgaggaag aaattaataa tttttgtggt tgcaaagcag 3181 gttgtcagtt tcctgcttat gaagataatc cagtttcttc gggacagctg gaagagttcc 3241 ctgtattgaa cactgatata caaggaatga atagaagtca agaaaaacag acctggtggg 3301 aaaaagcctt gtactctcct ctttttcctg catcagagtg tgaagaatgt tacacaaatg 3361 ccaagggaga gagtggttta gaagaatatc cagatgctaa agagacaccc agtaatgaag 3421 agcgcctgtt agattttaat agggtgtctt ctgtttatga agcaagatgt acaggagaga 3481 gagattctgg agcaaagtca gatggcttcc gcggaaagat gtgctccagc gccagctcca 3541 cctcggaaga gacaggctca gaaggcggag gcgagtgggt gggccctagt gaagaggagc 3601 tcttttctcg aactcatctc taaacctgca aaatagtaca aattattgtt taaaaatgat 3661 atgtgatgga aaattactct tcagtgagac ctgttaatct aaaacaacaa cttaggtttc 3721 ctcttcaatt aactgattca gattggtaat aattatcttt ctcttcttgc ttattttaga 3781 gttgaggaca gctatcctgt taaagatttt ttttcccagc tgttaaattc ttggctattt 3841 gaaatagact agattgtgtt gtcaaatcaa gaatgggtgt gcatgtgctt gtcttagaag 3901 tatcactgct ttttgcatct taactgcagt taattttcct tccgactgcg gttatatcac 3961 tatgacctta ctagcattgc agtgtcaaca accacttctg ctcttcagag acttcagctt 4021 tggagcattt aggctttgtt ctccaagaac tgggatatcc attcttaccc tacagtggct 4081 tgatgccttt ctgaaggcga gagggaagcc tgggtgactc agcggtggtc tccattcagc 4141 aaaatctcat gtacatttcc agtaggaacc gcagaggtgt gcttttcaag actcaccaaa 4201 tactgtgttt tctctcttag gatttctttt cccctaaagt atcacggaag atactatggt 4261 tcgtgacttt cttgctaact gaagaagcca aggatttggg gtgtggggtc gtatgcgaga 4321 cacagtgggg taagggtgca taccccaccc cttacctgct ctcatactgc agttacattt 4381 acaccaaaac cccatgcagg gttctttgtg gtgagtgttc catacgtgct aaggacctta 4441 gttgcagatt gttactttct ggtgacctat gttgaattga aacccccaaa acttgaaatt 4501 gtgaacattt gacatgcagt aaaggccacc tcatcaccca gagaaatctt tggctgctgc 4561 agctagccgc ttcttggctg tgatgtagta tagcttcgat ctcattttgt gtttgagaga 4621 atgttctggg caagttctgt gtgtggtggg ttggggcggg tagagtcatg agttttccac 4681 atccctgtgt ggtggttttg ctgactgtcg ctccgtggga ctggctcccg tttctccttg 4741 gtgagcccgg ggagccggcg catcttgtga gtcgcgtctg tgcatggcga tccgctcctc 4801 cggctctcat ggcattgtgc cacaggcaga ggccaggagg agcagtatgt gcacagccga 4861 aacattttac attttttaca ttgtttttct tttttaacca actcattgtt taaaaaacaa 4921 aaacaaaaaa aacctaatct gtgaaatcag cgtagcatgc ctggagcatc aggaatggca 4981 gaaaagtctg atgcgctcta gacagcttca ccactcattt gggcaggcag taaacacaca 5041 tataatttat tagctgggag ctgaactggc tgtgaaatct atgatttgct ttgaacattt 5101 gggttttgtt gcctttttct taattgataa cacagaaaag aaagtaccat caaagactgt 5161 ggagtcattg agggtctgtg tgtcctcacc gagagggacc tggtgtgccc gccgggtcga 5221 tcttcccacg tgttagggtt tatttttata caacacatct tttgacactt taaggtgggt 5281 gggtgtgtgt gtgtgtgtgt gtgtgcgcgc gtgcgcgcgc gcatgtgtaa ggttttatgt 5341 tgctgttatt tatttacgaa cttcagatac gtttttatgt atttttcatt cttctggagc 5401 ttcctaaaaa ttgataagca tctgcactga aatataattt aacagcaaaa gtaaaaaagg 5461 attgaaagtt gtaaattcct catatcacta cagtgacgat tattctagaa atcgttgctt 5521 gtgtagcaaa gaccaaataa atagatttca gacacaacct tgagcacagt tgattttgga 5581 cagctgctgt ttattaggaa agggctccag gtggcaaagg tgcacacttc ctcagacaca 5641 ggtgagaaga tgcagcacct tccacaggtg aatgggacgg attcgaagtg agcaaaggga 5701 ttcacaaatt atgtatttat ttgttttcat agttaagtag ctgaagctca gaggctttca 5761 gcaacagaga tgaaagtgtg gctttttagt tttgtgaatg gatgatcaca aagaaaaagc 5821 atttttaaaa agttggcaaa cgctgaaacg cactgtggta tgaagcgcat tgcatttcca 5881 tagcactgaa gtaccagttt ccattcctgg gctgagattg tttttcccgt ggttgtattg 5941 ttctgatttc acgtacacca gagtaactga tttttttttg tttgttttct tgtggagtta 6001 acaccaaata aaaattgtaa aaaac // LOCUS D87071 6368 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0233 gene, complete cds. ACCESSION D87071 NID g1510142 KEYWORDS KIAA0233. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA4602. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6368) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6368) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6368 /organism="Homo sapiens" /note="RH_ID:RH25439" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="16" /clone="HA4602" /sex="male" /tissue_type="bone marrow" gene 3..6110 /gene="KIAA0233" CDS 3..6110 /gene="KIAA0233" /note="similar to C.elegans protein encoded in cosmid T20D3 (Z68220)." /citation=[3] /codon_start=1 /db_xref="PID:d1013930" /db_xref="PID:g1510143" /translation="MDLRPELPTTLGPVSLRQLGLEHTRYPCLDLGAMLLYTLTFWLL LRQFVKEKLLKWAESPAALTEVTVADTEPTRTQTLLQSLGELVKGVYAKYWIYVCAGM FIVVSFAGRLVVYKIVYMFLFLLCLTLFQVYYSLWRKLLKAFWWLVVAYTMLVLIAVY TFQFQDFPAYWRNLTGFTDEQLGDLGLEQFSVSELFSSILVPGFFLLACILQLHYFHR PFMQLTDMEHVSLPGTRLPRWAHRQDAVSGTPLLREEQQEHQQQQQEEEEEEDSRDEG LGVATPHQATQVPEGAAKWGLVAERLLELAAGFSDVLSRVQVFLRRLLELHVFKLVAL YTVWVALKEVSVMNLLLVVLWAFALPYPRFRPMASCLSTVWTCVIIVCKMLYQLKVVN PQEYSSNCTEPFPNSTNLLPTEISQSLLYRGPVDPANWFGVRKGFPNLGYIQNHLQVL LLLVFEAIVYRRQEHYRRQHQLAPLPAQAVFASGTRQQLDQDLLGCLKYFINFFFYKF GLEICFLMAVNVIGQRMNFLVTLHGCWLVAILTRRHRQAIARLWPNYCLFLALFLLYQ YLLCLGMPPALCIDYPWRWSRAVPMNSALIKWLYLPDFFRAPNSTNLISDFLLLLCAS QQWQVFSAERTEEWQRMAGVNTDRLEPLRGEPNPVPNFIHCRSYLDMLKVAVFRYLFW LVLVVVFVTGATRISIFGLGYLLACFYLLLFGTALLQRDTRARLVLWDCLILYNVTVI ISKNMLSLLACVFVEQMQTGFCWVIQLFSLVCTVKGYYDPKEMMDRDQDCLLPVEEAG IIWDSVCFFFLLLQRRVFLSHYYLHVRADLQATALLASRGFALYNAANLKSIDFHRRI EEKSLAQLKRQMERIRAKQEKHRQGRVDRSRPQDTLGPKDPGLEPGPDSPGGSSPPRR QWWRPWLDHATVIHSGDYFLFESDSEEEEEAVPEDPRPSAQSAFQLAYQAWVTNAQAV LRRRQQEQEQARQEQAGQLPTGGGPSQEVEPAEGPEEAAAGRSHVVQRVLSTAQFLWM LGQALVDELTRWLQEFTRHHGTMSDVLRAERYLLTQELLQGGEVHRGVLDQLYTSQAE ATLPGPTEAPNAPSTVSSGLGAEEPLSSMTDDMGSPLSTGYHTRSGSEEAVTDPGERE AGASLYQGLMRTASELLLDRRLRIPELEEAELFAEGQGRALRLLRAVYQCVAAHSELL CYFIIILNHMVTASAGSLVLPVLVFLWAMLSIPRPSKRFWMTAIVFTEIAVVVKYLFQ FGFFPWNSHVVLRRYENKPYFPPRILGLEKTDGYIKYDLVQLMALFFHRSQLLCYGLW DHEEDSPSKEHDKSGEEEQGAEEGPGVPAATTEDHIQVEARVGPTDGTPEPQVELRPR DTRRISLRFRRRKKEGPARKGAAAIEAEDREEEEGEEEKEAPTGREKRPSRSGGRVRA AGRRLQGFCLSLAQGTYRPLRRFFHDILHTKYRAATDVYALMFLADVVDFIIIIFGFW AFGKHSAATDITSSLSDDQVPEAFLVMLLIQFSTMVVDRALYLRKTVLGKLAFQVALV LAIHLWMFFILPAVTERMFNQNVVAQLWYFVKCIYFALSAYQIRCGYPTRILGNFLTK KYNHLNLFLFQGFRLVPFLVELRAVMDWVWTDTTLSLSSWMCVEDIYANIFIIKCSRE TEKKYPQPKGQKKKKIVKYGMGGLIILFLIAIIWFPLLFMSLVRSVVGVVNQPIDVTV TLKLGGYEPLFTMSAQQPSIIPFTAQAYEELSRQFDPQPLAMQFISQYSPEDIVTAQI EGSSGALWRISPPSRAQMKRELYNGTADITLRFTWNFQRDLAKGGTVEYANEKHMLAL APNSTARRQLASLLEGTSDQSVVIPNLFPKYIRAPNGPEANPVKQLQPNEEADYLGVR IQLRREQGAGATGFLEWWVIELQECRTDCNLLPMVIFSDKVSPPSLGFLAGYGIMGLY VSIVLVIGKFVRGFFSEISHSIMFEELPCVDRILKLCQDIFLVRETRELELEEELYAK LIFLYRSPETMIKWTREKE" 3'UTR 6111..6368 BASE COUNT 1147 a 2060 c 1962 g 1199 t ORIGIN 1 ccatggacct gcgccctgag ctgcccacca ccctgggccc cgtcagcctg cgccagctgg 61 ggctggagca cacccgctac ccctgtctgg accttggtgc catgttgctc tacaccctga 121 ccttctggct cctgctgcgc cagtttgtga aagagaagct gctgaagtgg gcagagtctc 181 cagctgcgct gacggaggtc accgtggcag acacagagcc tacgcggacg cagacgctgt 241 tgcagagcct gggggagctg gtgaagggcg tgtacgccaa gtactggatc tatgtgtgtg 301 ctggcatgtt catcgtggtc agcttcgccg gccgcctcgt ggtctacaag attgtctaca 361 tgttcctctt cctgctctgc ctcaccctct tccaggtcta ctacagcctg tggcggaagc 421 tgctcaaggc cttctggtgg ctcgtggtgg cctacaccat gctggtcctc atcgccgtct 481 acaccttcca gttccaggac ttccctgcct actggcgcaa cctcactggc ttcaccgacg 541 agcagctggg ggacctgggc ctggagcagt tcagcgtatc cgagctcttc tccagcatcc 601 tggtgcccgg cttcttcctc ctggcctgca tcctgcagct gcactacttc cacaggccct 661 tcatgcagct caccgacatg gagcacgtgt ccctgcctgg cacgcgcctc ccgcgctggg 721 ctcacaggca ggatgcagtg agtgggaccc cactgctgcg ggaggagcag caggagcatc 781 agcagcagca gcaggaggag gaggaggagg aggactccag ggacgagggg ctgggcgtgg 841 ccactcccca ccaggccacg caggtgcctg aaggggcagc caagtggggc ctggtggctg 901 agcgcctgct ggagctggca gccggcttct cggacgtcct ctcacgcgtg caggtgttcc 961 tgcggcggct gctggagctt cacgttttca agctggtggc cctgtacacc gtctgggtgg 1021 ccctgaagga ggtgtcggtg atgaacctgc tgctggtggt gctgtgggcc ttcgccctgc 1081 cctacccacg cttccggccc atggcctcct gcctgtccac cgtgtggacc tgcgtcatca 1141 tcgtgtgtaa gatgctgtac cagctcaagg ttgtcaaccc ccaggagtat tccagcaact 1201 gcaccgagcc cttccccaac agcaccaact tgctgcccac ggagatcagc cagtccctgc 1261 tgtaccgggg gcccgtggac cctgccaact ggtttggggt gcggaaaggg ttccccaacc 1321 tgggctacat ccagaaccac ctgcaagtgc tgctgctgct ggtattcgag gccatcgtgt 1381 accggcgcca ggagcactac cgccggcagc accagctggc cccgctgcct gcccaggccg 1441 tgtttgccag cggcacccgc cagcagctgg accaggatct gctcggctgc ctcaagtact 1501 tcatcaactt cttcttctac aaattcgggc tggagatctg cttcctgatg gccgtgaacg 1561 tgatcgggca gcgcatgaac tttctggtga ccctgcacgg ttgctggctg gtggccatcc 1621 tcacccgcag gcaccgccag gccattgccc gcctctggcc caactactgc ctcttcctgg 1681 cgctgttcct gctgtaccag tacctgctgt gcctggggat gcccccggcc ctgtgcattg 1741 attatccctg gcgctggagc cgggccgtcc ccatgaactc cgcactcatc aagtggctgt 1801 acctgcctga tttcttccgg gcccccaact ccaccaacct catcagcgac tttctcctgc 1861 tgctgtgcgc ctcccagcag tggcaggtgt tctcagctga gcgcacagag gagtggcagc 1921 gcatggctgg cgtcaacacc gaccgcctgg agccgctgcg gggggagccc aaccccgtgc 1981 ccaactttat ccactgcagg tcctaccttg acatgctgaa ggtggccgtc ttccgatacc 2041 tgttctggct ggtgctggtg gtggtgtttg tcacgggggc cacccgcatc agcatcttcg 2101 ggctgggcta cctgctggcc tgcttctacc tgctgctctt cggcacggcc ctgctgcaga 2161 gggacacacg ggcccgcctc gtgctgtggg actgcctcat tctgtacaac gtcaccgtca 2221 tcatctccaa gaacatgctg tcgctcctgg cctgcgtctt cgtggagcag atgcagaccg 2281 gcttctgctg ggtcatccag ctcttcagcc ttgtatgcac cgtcaagggc tactatgacc 2341 ccaaggagat gatggacaga gaccaggact gcctgctgcc tgtggaggag gctggcatca 2401 tctgggacag cgtctgcttc ttcttcctgc tgctgcagcg ccgcgtcttc cttagccatt 2461 actacctgca cgtcagggcc gacctccagg ccaccgccct gctagcctcc aggggcttcg 2521 ccctctacaa cgctgccaac ctcaagagca ttgacttcca ccgcaggata gaggagaagt 2581 ccctggccca gctgaaaaga cagatggagc gtatccgtgc caagcaggag aagcacaggc 2641 agggccgggt ggaccgcagt cgcccccagg acaccctggg ccccaaggac cccggcctgg 2701 agccagggcc cgacagtcca gggggctcct ccccgccacg gaggcagtgg tggcggccct 2761 ggctggacca cgccacagtc atccactccg gggactactt cctgtttgag tccgacagtg 2821 aggaagagga ggaggctgtt cctgaagacc cgaggccgtc ggcacagagt gccttccagc 2881 tggcgtacca ggcatgggtg accaacgccc aggcggtgct gaggcggcgg cagcaggagc 2941 aggagcaggc aaggcaggaa caggcaggac agctacccac aggaggtggt cccagccagg 3001 aggtggagcc agcagagggc cccgaggagg cagcggcagg ccggagccat gtggtgcaga 3061 gggtgctgag cacggcgcag ttcctgtgga tgctggggca ggcgctagtg gatgagctga 3121 cacgctggct gcaggagttc acccggcacc acggcaccat gagcgacgtg ctgcgggcag 3181 agcgctacct cctcacacag gagctcctgc agggcggcga agtgcacagg ggcgtgctgg 3241 atcagctgta cacaagccag gccgaggcca cgctgccagg ccccaccgag gcccccaatg 3301 ccccaagcac cgtgtccagt gggctgggcg cggaggagcc actcagcagc atgacagacg 3361 acatgggcag ccccctgagc accggctacc acacgcgcag tggcagtgag gaggcagtca 3421 ccgaccccgg ggagcgtgag gctggtgcct ctctgtacca gggactgatg cggacggcca 3481 gcgagctgct cctggacagg cgcctgcgca tcccagagct ggaggaggca gagctgtttg 3541 cggaggggca gggccgggcg ctgcggctgc tgcgggccgt gtaccagtgt gtggccgccc 3601 actcggagct gctctgctac ttcatcatca tcctcaacca catggtcacg gcctccgccg 3661 gctccctggt gctgcccgtg ctcgtcttcc tgtgggccat gctgtcgatc ccgaggccca 3721 gcaagcgctt ctggatgacg gccatcgtct tcaccgagat cgcggtggtc gtcaagtacc 3781 tgttccagtt tgggttcttc ccctggaaca gccacgtggt gctgcggcgc tacgagaaca 3841 agccctactt cccgccccgc atcctgggcc tggagaagac tgacggctac atcaagtacg 3901 acctggtgca gctcatggcc cttttcttcc accgctccca gctgctgtgc tatggcctct 3961 gggaccatga ggaggactca ccatccaagg agcatgacaa gagcggcgag gaggagcagg 4021 gagccgagga ggggccaggg gtgcctgcgg ccaccaccga agaccacatt caggtggaag 4081 cgagggtcgg acccacggac gggaccccag aaccccaagt ggagctcagg ccccgtgata 4141 cgaggcgcat cagtctacgt tttagaagaa ggaagaagga gggcccagca cggaaaggag 4201 cggcagccat cgaagctgag gacagggagg aagaagaggg ggaggaagag aaagaggccc 4261 ccacggggag agagaagagg ccaagccgct ctggaggaag agtaagggcg gccgggcggc 4321 ggctgcaggg cttctgcctg tccctggccc agggcacata tcggccgcta cggcgcttct 4381 tccacgacat cctgcacacc aagtaccgcg cagccaccga cgtctatgcc ctcatgttcc 4441 tggctgatgt tgtcgacttc atcatcatca tttttggctt ctgggccttt gggaagcact 4501 cggcggccac agacatcacg tcctccctat cagacgacca ggtacccgag gctttcctgg 4561 tcatgctgct gatccagttc agtaccatgg tggttgaccg cgccctctac ctgcgcaaga 4621 ccgtgctggg caagctggcc ttccaggtgg cgctggtgct ggccatccac ctatggatgt 4681 tcttcatcct gcccgccgtc actgagagga tgttcaacca gaatgtggtg gcccagctct 4741 ggtacttcgt gaagtgcatc tacttcgccc tgtccgccta ccagatccgc tgcggctacc 4801 ccacccgcat cctcggcaac ttcctcacca agaagtacaa tcatctcaac ctcttcctct 4861 tccaggggtt ccggctggtg ccgttcctgg tggagctgcg ggcagtgatg gactgggtgt 4921 ggacggacac cacgctgtcc ctgtccagct ggatgtgtgt ggaggacatc tatgccaaca 4981 tcttcatcat caaatgcagc cgagagacag agaagaaata cccgcagccc aaagggcaga 5041 agaagaagaa gatcgtcaag tacggcatgg gtggcctcat catcctcttc ctcatcgcca 5101 tcatctggtt cccgctgctc ttcatgtcgc tggtgcgctc cgtggttggg gttgtcaacc 5161 agcccatcga tgtcaccgtc accctcaagc tgggcggcta tgagccgctg ttcaccatga 5221 gcgcccagca gccgtccatc atccccttca cggcccaggc ctatgaggag ctgtcccggc 5281 agtttgaccc ccagccgctg gccatgcagt tcatcagcca gtacagccct gaggacatcg 5341 tcacggcgca gattgagggc agctccgggg cgctgtggcg catcagtccc cccagccgtg 5401 cccagatgaa gcgggagctc tacaacggca cggccgacat caccctgcgc ttcacctgga 5461 acttccagag ggacctggcg aagggaggca ctgtggagta tgccaacgag aagcacatgc 5521 tggccctggc ccccaacagc actgcacggc ggcagctggc cagcctgctc gagggcacct 5581 cggaccagtc tgtggtcatc cccaatctct tccccaagta catccgtgcc cccaacgggc 5641 ccgaagccaa ccctgtgaag cagctgcagc ccaatgagga ggccgactac ctcggcgtgc 5701 gtatccagct gcggagggag cagggtgcgg gggccaccgg cttcctcgaa tggtgggtca 5761 tcgagctgca ggagtgccgg accgactgca acctgctgcc catggtcatt ttcagtgaca 5821 aggtcagccc accgagcctc ggcttcctgg ctggctacgg catcatgggg ctgtacgtgt 5881 ccatcgtgct ggtcatcggc aagttcgtgc gcggattctt cagcgagatc tcgcactcca 5941 ttatgttcga ggagctgccg tgcgtggacc gcatcctcaa gctctgccag gacatcttcc 6001 tggtgcggga gactcgggag ctggagctgg aggaggagtt gtacgccaag ctcatcttcc 6061 tctaccgctc accggagacc atgatcaagt ggactcgtga gaaggagtag gagctgctgc 6121 tggcgcccga gagggaagga gccggcctgc tgggcagcgt ggccacaagg ggcggcactc 6181 ctcaggccgg gggagccact gccccgtcca aggccgccag ctgtgatgca tcctcccggc 6241 ctgcctgagc cctgatgctg ctgtcagaga aggacactgc gtccccacgg cctgcgtggc 6301 gctgccgtcc cccacgtgta ctgtagagtt ttttttttaa ttaaaaaatg ttttatttat 6361 acaaatgg // LOCUS D87073 5878 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0236 gene, complete cds. ACCESSION D87073 NID g1510146 KEYWORDS KIAA0236. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA4654. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5878) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5878) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5878 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA4654" /sex="male" /tissue_type="bone marrow" 5'UTR 1..436 gene 437..5500 /gene="KIAA0236" CDS 437..5500 /gene="KIAA0236" /note="similar to Human zinc finger protein(ZNF142)" /citation=[3] /codon_start=1 /db_xref="PID:d1013932" /db_xref="PID:g1510147" /translation="MTDPLLDSQPASSTGEMDGLCPELLLIPPPLSNRGILGPVQSPC PSRDPAPIPTEPGCLLVEATATEEGPGNMEIIVETVAGTLTPGAPGETPAPKLPPGER EPSQEAGTPLPGQETAEEENVEKEEKSDTQKDSQKAVDKGQGAQRLEGDVVSGTESLF KTHMCPECKRCFKKRTHLVEHLHLHFPDPSLQCPNCQKFFTSKSKLKTHLLRELGEKA HHCPLCHYSAVERNALNRHMASMHEDISNFYSDTYACPVCREEFRLSQALKEHLKSHT AAAAAEPLPLRCFQEGCSYAAPDRKAFIKHLKETHGVRAVECRHHSCPMLFATAEAME AHHKSHYAFHCPHCDFACSNKHLFRKHKKQGHPGSEELRCTFCPFATFNPVAYQDHVG KMHAHEKIHQCPECNFATAHKRVLIRHMLLHTGEKPHKCELCDFTCRDVSYLSKHMLT HSNTKDYMCTECGYVTKWKHYLRVHMRKHAGDLRYQCNQCSYRCHRADQLSSHKLRHQ GKSLMCEVCAFACKRKYELQKHMASQHHPGTPSPLYPCHYCSYQSRHKQAVLSHENCK HTRLREFHCALCDYRTFSNTTLLFHKRKAHGYVPGDQAWQLRYASQEPEGAMQGPTPP PDSEPSNQLSARPEGPGHEPGTVVDPSLDQALPEMSEEVNTGRQEGSEAPHGGDLGGS PSPAEVEEGSCTLHLEALGVELESVTEPPLEEVTETAPMEFRPLGLEGPDGLEGPELS SFEGIGTSDLGAEENPLLEKPVSEPSTNPPSLEEAPNNWVGTFKTTPPAETAPLPPLP ESESLLKALRRQDKEQAEALVLEGRVQMVVIQGEGRAFRCPHCPFITRREKALNLHSR TGCQGRREPLLCPECGASFKQQRGLSTHLLKKCPVLLRKNKGLPRPDSPIPLQPVLPG TQASEDTESGKPPPASQEAELLLPKDAPLELPREPEETEEPLATVSGSPVPPAGNSLP TEAPKKHCFDPVPPAGNSSPTEAPKKHHLDPVPPAGNSSPTEALKKHRFEQGKFHCNS CPFLCSRLSSITSHVAEGCRGGRGGGGKRGTPQTQPDVSPLSNGDSAPPKNGSTESSS GDGDTVLVQKQKGARFSCPTCPFSCQQERALRTHQIRGCPLEESGELHCSLCPFTAPA ATALRLHQKRRHPTAAPARGPRPHLQCGDCGFTCKQSRCMQQHRRLKHEGVKPHQCPF CDFSTTRRYRLEAHQSRHTGIGRIPCSSCPQTFGTNSKLRLHRLRVHDKTPTHFCPLC DYSGYLRHDITRHVNSCHQGTPAFACSQCEAQFSSETALKQHALRRHPEPAQPAPGSP AETTEGPLHCSRCGLLCPSPASLRGHTRKQHPRLECGACQEAFPSRLALDEHRRQQHF SHRCQLCDFAARERVGLVKHYLEQHEETSAAVAASDGDGDAGQPPLHCPFCDFTCRHQ LVLDHHVKGHGGTRLYKCTDCAYSTKNRQKITWHSRIHTGEKPYHCHLCPYACADPSR LKYHMRIHKEERKYLCPECGYKCKWVNQLKYHMTKHTGLKPYQCPECEYCTNRADALR VHQETRHREARAFMCEQCGKAFKTRFLLRTHLRKHSEAKPYVCNVCHRAFRWAAGLRH HALTHTDRHPFFCRLCNYKAKQKFQVVKHVRRHHPDQADPNQGVGKDPTTPTVHLHDV QLEDPSPPAPAAPHTGPEG" 3'UTR 5501..5878 BASE COUNT 1363 a 1857 c 1542 g 1116 t ORIGIN 1 caaaacatag agtaccccgg cagccggcaa gaggaagaga gagtggcttc cacatcccca 61 atatcctaga ggcggctgag ccggaggcgg tcgcacaaag cgggccccgg gggccgttcc 121 agccgcggcc gaccatagag atgcggctcc cgccggctct gggtctggag ataggaaagc 181 tgaggcccag agaagcgaag cgactgtgtc tgtccaagac cacgcgccct cctgcccgga 241 agataagcgt atttcttctc tggtgcccac ctgtctccta cctcaccctg ccctcccgca 301 ggtgaaggtt cttaatcttg acggctcagc gtcctccttg gctccccccg gaggccatgt 361 atggtcaagc ttgaagattc cccagaacaa cgctaatatt cacatttaag aagccaaaac 421 acacaagtcg gtggtgatga cagaccccct tttggactca cagccagcca gtagcaccgg 481 ggagatggat ggactgtgcc ctgagctatt gctgatcccc ccgcctctct ctaaccgtgg 541 aatcctgggg cctgtccaga gcccctgtcc ttcccgggac cctgcaccta tacctactga 601 gccaggctgc ctgctggtag aggccacagc aactgaagag ggaccaggga acatggagat 661 cattgtggag acagtagctg gaaccctgac cccaggtgct cctggagaga ccccagctcc 721 caaactgcct ccaggagaga gagaaccttc acaggaagca ggtacaccct tgcctgggca 781 ggagacagct gaagaggaga atgtagagaa agaagagaag agtgacaccc agaaggactc 841 ccaaaaggct gtggataaag gccaaggggc tcagcggctg gaaggggatg tggtctctgg 901 caccgagtcc ctcttcaaga cccatatgtg tccagagtgt aagcgctgct ttaagaagcg 961 gactcatctg gtggagcacc tgcatctcca cttcccagac cccagcctcc agtgccctaa 1021 ctgccagaag ttcttcacca gtaagagcaa gctcaagacc catctgctgc gggagctggg 1081 tgaaaaggcc caccactgcc cactgtgcca ctacagtgcg gtggagagga atgcactcaa 1141 ccgccacatg gccagcatgc atgaagatat ttccaacttc tactcagaca cctatgcctg 1201 tcctgtctgc cgtgaggaat tccgcctcag ccaggcccta aaggagcacc tcaagagcca 1261 cacggcagca gccgcagcag agccattacc ccttcgctgc tttcaggagg gctgcagcta 1321 tgcagcaccc gaccgcaagg ccttcattaa gcacctgaag gagacccatg gggtgcgggc 1381 tgtggagtgc cgccatcact catgtcccat gctctttgcc acagccgaag ccatggaggc 1441 ccaccacaag agtcactacg ccttccactg cccccactgt gattttgctt gttccaataa 1501 gcacctattc cgtaaacaca agaagcaggg ccaccctggc agtgaagagc tgcgctgcac 1561 cttctgcccc tttgccacct tcaacccagt ggcttaccag gatcatgtag gcaagatgca 1621 tgctcatgaa aagatccacc agtgtcctga gtgcaacttt gccactgccc acaagagggt 1681 gctcatccga cacatgcttc tacatacggg tgagaagccc cacaagtgtg agctgtgtga 1741 cttcacatgc cgagacgtga gctacctatc caagcacatg ctgacccact ccaacaccaa 1801 ggattacatg tgcactgaat gtggctatgt caccaagtgg aagcactacc tccgtgtgca 1861 catgcgaaaa catgcagggg acctcaggta tcagtgcaac cagtgctcct atcgctgtca 1921 ccgggctgat cagctgagca gccacaagct gcggcatcag ggcaagtctc tgatgtgtga 1981 ggtgtgtgcc ttcgcctgca agcggaagta tgagctgcag aagcacatgg cttcccagca 2041 ccaccctggc acaccgtccc cactctaccc ttgccactac tgcagttacc agagccgcca 2101 caagcaggct gtgctgagcc atgagaactg caagcatacc cgcctccgtg agttccactg 2161 tgccctctgt gactaccgca ccttcagcaa caccacactc ttgttccata aacgcaaggc 2221 ccatggctat gtacctggag accaggcctg gcagctccgc tatgcaagcc aggagccaga 2281 aggggccatg cagggcccaa cacccccacc agattcagag ccctcaaacc agctgtcagc 2341 ccgacctgag gggccaggtc acgaacctgg gactgtggtg gaccccagct tggaccaggc 2401 cctgccagag atgagtgagg aggtcaacac tggaagacag gagggcagtg aggctcccca 2461 tgggggtgac ctgggtggca gtcccagccc agcagaggtg gaggagggca gctgcacact 2521 acacctagag gccctgggag tagagctgga gtctgtgact gagccacccc ttgaggaggt 2581 cactgaaaca gcccctatgg agttcaggcc cctgggactg gaagggccag atggactgga 2641 aggaccagag ctatctagct ttgaaggtat tgggacttct gacttgggtg ctgaagaaaa 2701 tccccttctg gaaaagccag tgtctgagcc ctccacaaat cctccatcct tagaggaggc 2761 tcctaacaac tgggtaggaa ccttcaagac aactccacct gctgagacag cacccttgcc 2821 cccattacct gagtcagagt cattactcaa ggccctaagg agacaggaca aagaacaagc 2881 agaggcattg gtgctagagg ggcgggtgca gatggtagtg atccagggag aggggcgagc 2941 cttccgctgc ccacactgcc cttttatcac tcgccgggag aaggccctga atctgcactc 3001 caggactggg tgccaaggcc gccgagagcc cctgctgtgc cccgagtgtg gggctagctt 3061 caagcaacaa cgcggcctca gcacccacct gctgaagaag tgccctgttc tactcagaaa 3121 gaacaagggc ttgcccagac cagattcacc catccctctg caacctgtgc tcccaggtac 3181 ccaggcctca gaggacacag aaagtgggaa gcccccacct gcatcacaag aagcagagct 3241 actgcttcca aaagatgctc ctttggagct tcccagggag ccagaagaaa cagaagagcc 3301 tcttgccaca gtctctggtt ccccagtccc tcctgcagga aactccttgc ccacagaggc 3361 ccctaagaag cactgctttg acccagtccc tcctgcagga aactcctcac ccacggaggc 3421 ccctaagaag caccaccttg acccagtccc tcctgcagga aactcctcac ccacagaggc 3481 cctgaagaag caccgctttg agcagggcaa gtttcactgc aactcctgcc cattcctttg 3541 ttcccggctc tcctctatta cctctcacgt ggctgaaggc tgcagggggg gacgtggcgg 3601 gggaggaaaa cgagggaccc cccagaccca gcctgatgtg tccccgttga gcaatgggga 3661 ctctgctccc ccgaagaatg ggagtacaga gtccagctct ggtgatgggg atacagttct 3721 ggttcaaaag cagaaggggg ctcgcttctc ctgccctaca tgtcccttta gctgccagca 3781 ggaacgggct ctgaggactc accagatccg gggctgcccc ctcgaggagt ctggagagct 3841 gcactgcagc ctctgcccat tcactgctcc tgctgccact gccttaaggc tccaccagaa 3901 gcggaggcac cccactgcag ccccagcccg tgggccccgg ccccatctac agtgtgggga 3961 ctgtggcttc acctgtaaac agagccgttg catgcagcag caccggcggc tcaagcacga 4021 gggggtgaag ccccatcagt gccccttctg tgacttttcg accaccagac ggtaccggtt 4081 agaggctcac cagtcccgac acacaggcat tggccgcatc ccctgcagct cttgccccca 4141 gacgtttggt accaactcga aactgcgctt gcaccggtta agggtacatg acaaaacacc 4201 tacccacttc tgtccacttt gtgactatag tggctacctt cgccatgaca tcactcgtca 4261 tgtcaacagc tgccaccaag gcaccccagc ctttgcctgc tcccagtgtg aagcccagtt 4321 cagctcagag acagcactta agcagcatgc tctgcgccga caccccgagc ctgcacagcc 4381 tgcccctggc tctcctgcag agaccactga gggccccctg cactgttccc gctgtgggtt 4441 gctgtgcccc agccctgcca gcttacgagg acacacccgt aaacagcacc cacggcttga 4501 gtgtggggcc tgccaggagg ccttccctag ccgactggct ctggatgagc accggaggca 4561 gcagcatttc agccaccgct gtcagctctg tgactttgct gcccgggagc gggtgggcct 4621 ggtaaagcac tacctggaac agcatgagga gacttcagca gccgtggcag cctcagatgg 4681 ggatggggat gctggccagc ccccgctaca ctgccccttt tgtgacttca catgccgcca 4741 tcagctggta ctagatcacc atgtgaaagg gcatgggggc actcgtctct acaagtgcac 4801 cgattgtgct tacagcacca agaaccgaca gaagatcacc tggcacagcc gcatccacac 4861 tggggaaaag ccttaccact gtcacctctg cccctatgcc tgtgctgatc cctctcgtct 4921 caagtaccac atgcggatcc acaaggagga acggaagtac ctgtgccctg agtgtggcta 4981 caagtgcaag tgggtcaacc agctgaaata ccacatgacc aagcatacag gactgaagcc 5041 ataccagtgt cccgagtgtg agtactgcac caaccgggct gatgcactgc gtgtgcacca 5101 ggagacccgg catcgagaag cacgggcttt catgtgtgag cagtgtggca aggccttcaa 5161 gacgcgcttc ctgctgcgca cccaccttcg caagcacagt gaggccaaac cctatgtgtg 5221 caatgtgtgc caccgtgctt tccgctgggc tgctggcctg cgccatcatg ccctcaccca 5281 caccgaccgc caccccttct tttgccgcct ctgcaactac aaggccaagc aaaagttcca 5341 ggtggtcaag cacgtacgca ggcaccaccc tgaccaagcc gacccaaacc agggtgtggg 5401 caaagacccc accaccccca cagtgcacct gcatgatgtg cagctggagg atcccagccc 5461 tcctgctcct gccgctcccc acactggacc tgagggctga aagcctgccc cacctcctgt 5521 ataggaagag ggtatggtct gagatgtgca gactgggacc agcgctagcc tgaggagctc 5581 agagcctaag gaaagactgg cttttggggt acaagggtga ctagaacctt cctgggactc 5641 tggctatagt actttgaaat tatcacccat ataaaagagg gacatggact ataacgttga 5701 tttcttattg ctgtacattg cgtttttaac ctgcaagttc tcagtttctt caccatcact 5761 ccatcaaagt ccctggctat aagatctgga ttttacccac tccatcttct ctttccttct 5821 tactgtgtca attcctattt tctttcagaa tcttctaaaa acagttgtat ctaaccgc // LOCUS D87074 7239 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0237 gene, complete cds. ACCESSION D87074 NID g1510148 KEYWORDS KIAA0237. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone:HA6286. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7239) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 7239) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohara,O. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA 0201 - KIAA 0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..7239 /organism="Homo sapiens" /note="RH_ID :RH25441" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="1" /clone="HA6286" /sex="male" /tissue_type="bone marrow" 5'UTR 1..475 gene 476..1402 /gene="KIAA0237" CDS 476..1402 /gene="KIAA0237" /note="similar to a C.elegans protein encoded in cosmid T10A3(U41035)" /citation=[3] /codon_start=1 /db_xref="PID:d1013933" /db_xref="PID:g1510149" /translation="MFNGEPGPASSGASRNVVRSSSIGGEICGSQQAGGGAGTTTAKK RRSSLGAKMVAIVGLTQWSKSTLQLPQPEGATKKLRSNIRRSTETGIAVEMRSRVTRQ GSRESTDGSTNSNSSDGTFIFPTTRLGAESQFSDFLDGLGPAQIVGRQTLATPPMGDV HIAIMDRSGQLEVEVIEARGLTPKPGSKSLPATYIKVYLLENGACLAKKKTKMTKKTC DPLYQQALLFDEGPQGKVLQVIVWGDYGRMDHKCFMGMAQIMLDELDLSAAVTGWYKL FPTSSVADSTLGSLTRRLSQSSLESATSPSCS" 3'UTR 1403..7239 BASE COUNT 1503 a 1973 c 2009 g 1754 t ORIGIN 1 cggacgcgtg ggctgctgcg gagccggcag cgcaggcggg cggagcggag ctgcccccgt 61 ggtggcgggc gatgcccccg tgagcctccc tcgccgcccc tccccgcccc gcgtgcctat 121 ccactcggag tccgcgccag cctggggccg ggccgcgcct actgccgggt tcgcggggcg 181 gggtcccggg gcagcacctg ccccgccttg cggagccgcc tcggcctgtg gaggccccct 241 ccctgtctgg accccggccc cacctccgga cccttttatc acatcgcctc ctctgggagc 301 ctgccctgat tgccttcacc tcatttcttg aaaattggtg ttttggcaga agtcaattga 361 agccttgtgc aaatgcccta ggggtgtgct gttgggaggc agccccctgt gatgcggaac 421 accaggctca gattcatcag ttcgagctgc ctgaggccct gccaccccgg ggaccatgtt 481 taacggggag ccaggtcctg cctcatctgg ggcctccagg aatgtggtgc ggagctccag 541 cattggcggt gaaatctgcg gatcccagca agccgggggc ggggctggga ccaccaccgc 601 caagaagcgg cggagcagcc tgggtgccaa gatggtggcc atcgtgggcc tgactcagtg 661 gagcaagagc acactccagc ttccgcagcc tgaaggggcc accaagaagc tgcgcagcaa 721 catccgccgg agcacggaga caggcatcgc ggtggagatg cggagccggg tcacacgcca 781 gggcagccgg gagtccaccg atgggagcac caacagcaac agctccgacg gcacgttcat 841 cttccccact acccggctag gggctgaaag ccagttcagc gatttcctgg atgggctggg 901 accagctcag attgtggggc gacagacact ggcaacacca cccatgggag atgtgcacat 961 tgccatcatg gaccggagtg gccagctgga ggtggaagtg attgaagctc ggggcctgac 1021 ccccaaacca ggctccaaat ccctcccagc cacctatatc aaggtttacc tgctggagaa 1081 tggggcctgc ttggccaaga agaagacaaa gatgaccaag aagacctgtg atcccctgta 1141 ccagcaggct ctgctctttg acgagggacc ccagggcaag gtgctgcagg tgatcgtctg 1201 gggagactat ggccgcatgg accacaagtg cttcatgggc atggcccaga tcatgctgga 1261 cgagctggac ctcagcgccg cggtcaccgg ctggtacaaa ctcttcccca cctcctcagt 1321 ggcagactcc acactcggat ccctcaccag gcgcctgtcc cagtcttccc tggagagtgc 1381 caccagcccc tcatgctctt aaggatgtca ggaagaggcc aggatggtgg tgtggggagg 1441 ggtgcctgct ggccccatgt cctcccctgt acatagtctt cgtgtctttc tggacccctt 1501 gtcctgctgc atgcctgttg gctactgggc tcatcccagc tggcagtgga gactgtagtg 1561 tgtgcgtgtg tgcgtgcgtg tgtgtgtgtg tgtacgtgac cacgttctat ctgttcattt 1621 gtctgggtat agtcactcct ggtgatgata tgggctgaaa tgtctccacg tctctttgtg 1681 tcttgttgaa aagaaaccca aaggagtgtt gtgtggacat gactcaccct gaggagtctc 1741 cagggatgga ggtggggcat gcggccactg gtcggtgctg cctggtcctg gcctggggca 1801 gagcctgtgt gttcttcacc tgtgcctgct cctggtggtc tctgctttgt tttctgtctt 1861 gtcttttgtt ccactcttga ctctcccggc tctgccactg ttttctgaga aatgtagcat 1921 ccgctgcagc tggccacact gagggccctc tgggaacccc accccactgg agccgctccg 1981 gcagctcttc ctgccactga atgcgttctg cagcatgtag catgcccacc tagctccctg 2041 gccagggccc tggggaggca gagggtaccc aggggactga gggcttagaa atgactttct 2101 ctatgaggct ggaacctcct ccttcttcca gtgaagcacg aggtcttggt tccagggttc 2161 ctggccaggt gcccccttag catttgttct tcatctcctg tctcttcaag cctccccact 2221 ccaccgtgcc gaaggagctc tgccagtggg cctgggcagg cagccacaga gggcatgtta 2281 tctgctaaag caaacagtcc tcctgaggcc ctgagggtgg ccctgacccc ctcagggctc 2341 attcctggtg ggcatgactc ggtaaggagg ttctgtgggt agggctgttt gctggaataa 2401 ggagggctga ggctgagttg ttccccgcca ctgttgagga tcccatctga catttgggga 2461 ttcactgcat gaagttgttc atttggggct ccagtttttg tgcatttcca gaccagggtc 2521 cctgtctggg agctcagact tagctgggct ccagcagcct ggctccggac tctcgtctgc 2581 caccatcacc agctctgtct aagcaatact tacccctccg ggcttccctg ctttcctgtg 2641 ccccactgct gccctctgca gagccatctg ctccaggcac ccacctgcct ctgcttgctt 2701 tctcacccat gtctggttgg agcttctcac agccactcaa accctgactt gatttgacaa 2761 ctgggccctg ttggtggaga ccagtgcctg aactggaccc tgtgaaatct gtcccgtggg 2821 aatcctgaag tctgactcag gaatgcccag acctgtccct gtccctcctc agctctaggg 2881 tgaaaggcga gaggttgcac aggacatgga cagagcagcc cttgggtttg tatctggtag 2941 gggagaaagc agcaggacag tggaggtgtg tggaaggcac tctttccacc tctctccagg 3001 cattatcccc aggtgctgaa atggctgtgg ccccagcacc tgaggcaggg gtacgctacc 3061 tggggtagca gtagaaaggt cttggggtct gaatgacttg gggcctccat tcataaccaa 3121 agttgtaggg gcatggaggc agtggcgcct tatggatagg tcatctcagc ccaaagggcc 3181 tccttggctg gcatggttct gtgttacgtt gaagccagag tcttataaca cctcccaaga 3241 aagatagggg aagaagccag aaccccacct ggcctgccca caggacaaga agtgggtaag 3301 ggcaggaagg aaatgaacgg agttagctcc caggctccta tctctgcctg agcctctatt 3361 cttatattat cagagaaagg gaccattggg tgcatagaga tggggaggag gccactgggc 3421 tgaattttct ctttaggcgg aaatgctctc cccaggccca ttgcgttcgt caagttcttg 3481 gaaatggaca aagggctctg tcctcctcga ccctagtggg ggatcaagaa ggaaactccg 3541 ttgcaaaagg gtattttaat ctcctgttta tgatatattc acctctagag cagtcactgt 3601 cagggtgttg cgaaaatact ctacaccttt gggatgatag ggttgttagt gacccacagg 3661 acagtataga tgtttgtgga tgtagcactg agtggtgata cccagaccag cagtcacccc 3721 aggaagtggg ggctacccat accctcattc ctctggtggg gagctgcctg cagagaggcc 3781 tgccactggg ggttccaggc cgtggtgacc cttgcctctg gggaagggtt agcacagggg 3841 aggttctagc tggaggaggg gtctgatgtg ctggtaactt gggctgacct acctacagtc 3901 cctggctgcc aaaactgctc agcagttggg ccactccact attccacctc tctaaaagaa 3961 ggtaatttcc tccccaaatt gaccttgggg aattattctt ttaacccttt gcaccaaata 4021 agttactcat ccccacctgg attttacccc atgagggtaa agttgtgtga ggctgacacg 4081 tctgtgtgat gcagtgtggc tgcacttaag ggtctgtttc tcagcatcat ggatgcaggg 4141 gcttgtctga aaagccactc tggacctagc ctgtcccaga agaggagaat gcacaagtgt 4201 aactcctggt tgtttgctgg ggtgggggag catctgctgt ttgaggacgg ggggtgggga 4261 aggaaggaac atgatccctc cagaagtctc ccaccctggg gccaactcac tgccatgttc 4321 agtgtcccgg ctccaaatgc ccccttgccc agatgaaacc ctgcagtggt tacaggaatg 4381 gagctctttg tcattccacc tcctctggtc aggcgaggtt cactgtgcat atggcagaga 4441 caggagtggc cctgcagtga tgttgggttg tgggcaggga cagtgatgga tgactcagag 4501 cgtctgtctt tgtatttctg ctctgttcat tctgtcccac tttcttcata gactcctttt 4561 ccctgcaatg ggtttttggt atgaaaaagg ctccagtaaa tggagccaag tcttggtttg 4621 acaaggggaa cttgatcctt cagggaagaa tcatctccaa aatgactccc catctggttt 4681 cttctaccac ccaattctac taggaaagga gcacttagga ggctcttggg atgcagcaag 4741 tcccaggcaa tgccagcatt tcatgggggc tgaggcagaa cccaggagct ccaagaaagg 4801 ccacccatag agctcatcct ctgtggaatc actgccggtt agaacactga ggtcaagcca 4861 gtcctcccat ggtcatgcca cccaccaggg aagcatgcct gatctcttct ttactgccag 4921 ccttgaggag agcagaagcc tccattttta aagacaataa agacctccaa gggtacttct 4981 ttggaaatga aatgtagcac aatcttagcc tacgttacca ggagccctga catgcaacca 5041 gggtccctct tccatgccct gctgcccagg agttgctgag ctcctcttcc cctggggttc 5101 cagccctcct aatacctcat attccccatc ttcctcagcc cagaaagcaa tggggcttta 5161 gtgatgctcc tcttttgtgt ctctctggtt gcctctagca ctgtgcaaac tctgcaagaa 5221 attgctgcct ttgcttgatg ttgtagatga gttgccctcc acctggctgg agagatggca 5281 catcattcag ggccagaagg ttgtccagca gtgtttctgc agtggctgca gggagatgga 5341 aaagaagagc cctgcttccc tgcctcctcc tacctttctc ttccactcct cagagttttc 5401 ttctccagta tcctgacatg taaagagatt ctttaaaatg ctgcttcttc tactggactg 5461 tctttcactg agtagtcagg gaggagatga aaatgtggac acatcccccc gccttctccc 5521 tcaacctttt cactcacgtc agaagaggtt ggggcaaaac aaaacaagca tgattgaaga 5581 gcaggggaag cccacacaat caggtgagcc ctgatggggg ctggactggc ctggtctggc 5641 tgggaaagga ttcttttaaa agcgtgcctg agcatctcag agcaatgtca gtgatgccaa 5701 agagagcaac ttggcctcct tgggcaccaa ctctgatggc tgagttgcac aaacgtggct 5761 cagatgttgg catgcccgat ggtgggacct tgttctctga taaagggctt aatctttcca 5821 tgtagcaaag caactttccc tccctcccct ccccccgaac cctgagcttg ggcttgtgtg 5881 ggcccagcat ggctgtgcct gtgagggaag ccacatcagt gagagaaagt tgcattctct 5941 caggggccta gattggcttg gggcaggtgc ttgttgaaaa gcatgagtgt ttctgctttg 6001 ggaaaccctt ccagggctct ggttggaaac ttggccagta aggacagggc cttgggcctc 6061 ccaaggagtt cacaatgatg ccaggaaggg tccctgagcc tgggttccag tcagcttggc 6121 taacagaatg gggcttggga attccagggg caactgagca tccaccccat tagccagtga 6181 tctggacagg gacagctggc tacagggaat caaaggctgt tctgtacagt tttaccagaa 6241 actgtgacct tgggtaaggc tcctcacttt tctgggcctc aacttcctca tctgcaaaat 6301 cagagcatta gctgctctcc ccatgattcc ctcccagttt taattaaaag tctctgatct 6361 ggagcacatg agggttgcga gcctggtgca cctggttcct gtctccgtgg gctggtttcc 6421 ttcagcccct gatgcccagg ctgggtgcag gctctcatct ggtccatgcc cagatgctgc 6481 ctcagacaag aactctggga atcctaacag actctgcttt cctctcttcg tggtttgtcc 6541 tccctctttt attcttgtgt aatgatgaga cataattaag ggtcatctac aatggacaat 6601 tttcaaggtg ctgatgtgat gtcacaacct ccgctacatc ctgagtaagt cagtgtcccc 6661 aacagcagat gaggctgggt ctttcctcat ttctgcttct gggatgacat cagtgggaga 6721 gttgaagatc tgagaattct aaaggaaatg ttctctgatg ggaggagaga gggaacttat 6781 ttcccttcaa agtagtttgc ttcttaagac cactcctccc tccagaatgt ctttaaccag 6841 acatcatttc agaaggtggg gcagctgctt ccttaggaag agtgggcctg atagctcaac 6901 caactccttt agcatgtaaa ctcacagaag gcaagaaccc ctttttttag actttccaaa 6961 tgcatcctgc aaagagagag atagctgata gggactgaca agcacactgt ttagataaga 7021 agcaattacc cttttatatc tgtgctctat acattttcta tgtgcagcat ctttcctaac 7081 ttgttgtggt tcccgggtgg caggtgcaca gctgggaggg actgccggct gtttcgtaca 7141 acattcttgc caattccctt gaggagaaaa ttcttcacat ggcttctgca tgtacagtat 7201 ttgggcagca aaacatgatt aaagtcagtt tgaaaatgg // LOCUS D87119 4221 bp mRNA PRI 30-AUG-1996 DEFINITION Human cancellous bone osteoblast mRNA for GS3955, complete cds. ACCESSION D87119 NID g1507671 KEYWORDS GS3955. SOURCE Homo sapiens cancellous bone tissue_lib:3 end-directed library osteoblast cell_line:primary-cultured cDNA to mRNA, clone_lib:lambda ZAPII clone:GS3955. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4221) AUTHORS Ohno,I., Hashimoto,J., Takaoka,K., Ochi,T., Okubo,K. and Matsubara,K. TITLE The cloning of a cDNA for putative serine/threonine kinase expressed in human osteoblast JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 4221) AUTHORS Ohno,I. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics Matsubara Laboratory; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..4221 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary-cultured" /cell_type="osteoblast" /clone="GS3955" /clone_lib="lambda ZAPII" /tissue_lib="3 end-directed library" /tissue_type="cancellous bone" CDS 1226..2257 /codon_start=1 /product="GS3955" /db_xref="PID:d1013940" /db_xref="PID:g1507672" /translation="MNIHRSTPITIARYGRSRNKTQDFEELSSIRSAEPSQSFSPNLG SPSPPETPNLSHCVSCIGKYLLLEPLEGDHVFRAVHLHSGEELVCKVFDISCYQESLA PCFCLSAHSNINQITEIILGETKAYVFFERSYGDMHSFVRTCKKLREEEAARLFYQIA SAVAHCHDGGLVLRDLKLRKFIFKDEERTRVKLESLEDAYILRGDDDSLSDKHGCPAY VSPEILNTSGSYSGKAADVWSLGVMLYTMLVGRYPFHDIEPSSLFSKIRRGQFNIPET LSPKAKCLIRSILRREPSERLTSQEILDHPWFSTDFSVSNSAYGAKEVSDQLVPDVNM EENLDPFFN" BASE COUNT 1002 a 1043 c 1060 g 1116 t ORIGIN 1 gtcggccgtc ccctttaatt tttaaataca cggtcccctc ttttctctgg ggggggcaag 61 caagaaatca aagaaggagg agacaagccg tcaattttct ccaaaacaaa ccccaccggg 121 caatttggtc tcggggtagg gggagacggg gtgattgcaa attattccag gacgagatcc 181 agttctccag cgggaaaggg gcaaaggaac gccgcgcgtt ggaagggcca gggtacgcag 241 ctccccttgc agcgcccgca ggacccccgc aagctcgtgc cggcgaaatc ggagaccgcc 301 gatctgtcct cgttctctcc tgcacgtctg gctgcattcg gaggaagacc tggggcgcga 361 gcgagcggcg acagcatgag cctgtgctga cctccgcgcg gcgggccgag cccagggctt 421 tgtcgcggta cctgcgccca gcccgcgccg caactctgtg cccagctttt gcaatctttt 481 gttgcagcgc tgaccgcacc aagttaaatg ctcccttgca atttttcttt tttttgtttg 541 tttgtttaat ttttggagag ctcgcgatct tggaaaagcc tcagacgcca tctacagtta 601 aaacgtaggt aactgccctc tcccgcaccc cccccttaca cgccccccac cctttccacc 661 aaaaaaaggg ggtgcagcgc ggattctggc tgccgtgcgt cgccagccgg tagacccgtg 721 cttgtttcct ttctcttttt gtttggcttc taacgcgttg ggactgagtc gccgccgtga 781 gctccccgaa gactgcacaa actaccgcgg gctcctccgc cccgtctgcg attcggaagc 841 cggcctgggg gtcgcgtcgg gagccctgcg ctgcagctcc gcaccttagc agcccgggta 901 ctcatccaga tccacgccgg ggacacacac acagagtaac taaaagtgcg gcgattctgc 961 acatcgccga ctgctttggg gtaacaaaaa gacccgagtt gcctgccgac cgaggacccc 1021 cgggagccgg gctcggagca gacgaggtat ccggcggcgc ccatttgggg gcttctaact 1081 ctttctccac gcagcccctc ttctgtcccc tcccctctcg ctccctttta aaatcagtgg 1141 caccgaggcg cctgcagccg cactcgccag cgactcatct ctccagcggg tttttttttg 1201 tttgtcgtgt gcgatcctca cactcatgaa catacacagg tctaccccca tcacaatagc 1261 gagatatggg agatcgcgga acaaaaccca ggatttcgaa gagttgtcgt ctataaggtc 1321 cgcggagccc agccagagtt tcagcccgaa cctcggctcc ccgagcccgc ccgagactcc 1381 gaacttgtcg cattgcgttt cttgtatcgg gaaatactta ttgttggaac ctctggaggg 1441 agaccacgtt tttcgtgccg tgcatctgca cagcggagag gagctggtgt gcaaggtgtt 1501 tgatatcagc tgctaccagg aatccctggc accgtgcttt tgcctgtctg ctcatagtaa 1561 catcaaccaa atcactgaaa ttatcctggg tgagaccaaa gcctatgtgt tctttgagcg 1621 aagctatggg gacatgcatt ccttcgtccg cacctgcaag aagctgagag aggaggaggc 1681 agccagactg ttctaccaga ttgcctcggc agtggcccac tgccatgacg gggggctggt 1741 gctgcgggac ctcaagctgc ggaaattcat ctttaaggac gaagagagga ctcgggtcaa 1801 gctggaaagc ctggaagacg cctacattct gcggggagat gatgattccc tctccgacaa 1861 gcatggctgc ccggcttacg taagcccaga gatcttgaac accagtggca gctactcggg 1921 caaagcagcc gacgtgtgga gcctgggggt gatgctgtac accatgttgg tggggcggta 1981 ccctttccat gacattgaac ccagctccct cttcagcaag atccggcgtg gccagttcaa 2041 cattccagag actctgtcgc ccaaggccaa gtgcctcatc cgaagcattc tgcgtcggga 2101 gccctcagag cggctgacct cgcaggaaat tctggaccat ccttggtttt ctacagattt 2161 tagcgtctcg aattcagcat atggtgctaa ggaagtgtct gaccagctgg tgccggacgt 2221 caacatggaa gagaacttgg accctttctt taactgagct catgccccac ggagacttag 2281 caggttccag gagtgagcga gggcagcgga aaggagttct tccgggggac acgaattgcc 2341 tggctgagta gcaagaaaga cacactctta agtttcttgg ttcagagcag gaaaaccttc 2401 aaggagctga ctgaccacgt agcatggggg caagaggcgt gggatgggga ttggggtgag 2461 atggatggga gcccgctgga gcttgtcttc cctaacatag cctgggagac caccccttgc 2521 cacttgggcc acttccgcct accccacttt tcattttgtt ccaaaatagt tgcagatcct 2581 gacagaatca aaactctctg cctcaaacac acatcctggc atcgcactgt tagcatttaa 2641 cttcttgtta ggattcaggg aaggaacagt tggccaagaa ttttttttct tttaaacaag 2701 ccaaccacct agctggtaat taatgaggtt cacttaaaaa aaaaattcgg tgcacacaga 2761 ctgacatgaa acctgggtgc tacagtaaaa gaaaacaaaa gtccagtttg tgtctcttaa 2821 tcgctcactt caactcattt cttctaaata aactatttaa tatcctggtc aggaaatgac 2881 atgttaatgc tttgctccct gaagggggaa aaaatctgtc ctttaacaag ctattctgtt 2941 ttgtgtcaat tgggtccgtg gcaaggaagc tattaggaag tcaaacggtc caggatgcat 3001 tacctgctaa tccttaggtt taaaggggga aagaaaaggg aagaagaaag gaaaagagaa 3061 atccaactcc tttttcatgt tttgcttttg aacaatgagg gtttgtgtga caggcattcc 3121 tctttgctga gatgatagca atggcctgag attttagcaa gctcctggag tctgatgctt 3181 ttgcagtact ctgatcgcaa ctaaacattt gtctttgttt tattagaaac tagtgaaaca 3241 aagcaggttg tcccacatgt ataaaataca gggcagctat ttagttttct ttacagagaa 3301 tgatcctttt aaggcttgta aggccctctg gtttggacaa aaaccctcag tagagacaag 3361 cgggaaggat aattagctga aagctatgat gatataaata aaaacagctc tctatcccaa 3421 tacgcacctt tgtattttca agaactcttc tatttattaa ggaaaatgtc acattgtgat 3481 gtattaagcc agtacttcaa ttacgggttg acttgggatg acatattaca tgctgtagtt 3541 aacatttata attctttttc cttgtttgag tatttctgtc tctgaaataa ccttttactt 3601 ggcttttcta gatagcttta tttgatttcg agtggcaaaa tgttttttat tacggctttt 3661 ctattgctgt atgatacaga actcttttgg cataaatatt tgtgttccca gtacctcact 3721 tgttcggatt tgactgcctg tatatgtttt gtgaaatggt cctgtttttg ggtaggtgac 3781 acgtggactc tagtatgtaa atgttacttg aatctgtgct tcataatagt gtgtggcatg 3841 tatgtgcaga ctcttggatg ctttatgcct gcgcaccagg agccctgtcc tcacgttccc 3901 aggagggcgg cttcaccctt cgtaaccagg agacaaggcg gccatggatt tgcccttgat 3961 tctattttgc taatggaaga tagaaaggag agaaggtttt tttttttttt taacattctg 4021 aagatggtgc tgtgtcaaga aggacctttt ttttcccctc tcccctattt tttaagtacc 4081 ttggaggagg agaggttggt gacatgcatg gtggggatct atggcctctg gtgctttgtc 4141 ctgtatttgg tttaatgttt ttgtcctaat ctcttcaatc aataaaattg tgcgtattta 4201 actaaaaaaa aaaaaaaaaa a // LOCUS D87120 2475 bp mRNA PRI 30-AUG-1996 DEFINITION Human cancellous bone osteoblast mRNA for GS3786, complete cds. ACCESSION D87120 NID g1507673 KEYWORDS GS3786. SOURCE Homo sapiens cancellous bone tissue_lib:3 end-directed library osteoblast cell_line:primary-cultured cDNA to mRNA, clone_lib:lambda ZAP clone:GS3786. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2475) AUTHORS Ohno,I., Hashimoto,J., Takaoka,K., Ochi,T., Okubo,K. and Matsubara,K. TITLE The cloning of a cDNA for novel genes expressed in human osteoblast JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2475) AUTHORS Ohno,I. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Ikko Ohno, Institute for Molecular and Cellular Biology, Osaka University, Molecular Genetics Matsubara Laboratory; 1-3 Yamada-oka, Suita, Osaka 565, Japan (E-mail:ikko@imcb.osaka-u.ac.jp, Tel:81-6-879-7992, Fax:81-6-877-1922) FEATURES Location/Qualifiers source 1..2475 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary-cultured" /cell_type="osteoblast" /clone="GS3786" /clone_lib="lambda ZAP" /tissue_lib="3 end-directed library" /tissue_type="cancellous bone" CDS 168..851 /codon_start=1 /product="GS3786" /db_xref="PID:d1013941" /db_xref="PID:g1507674" /translation="MRVAGAAKLVVAVAVFLLTFYVISQVFEIKMDASLGNLFARSAL DTAARSTKPPRYKCGISKACPEKHFAFKMASGAANVVGPKICLEDNVLMSGVKNNVGR GINVALANGKTGEVLDTKYFDMWGGDVAPFIEFLKAIQDGTIVLMGTYDDGATKLNDE ARRLIADLGSTSITNLGFRDNWVFCGGKGIKTKSPFEQHIKNNKDTNKYEGWPEVVEM EGCIPQKQD" BASE COUNT 811 a 380 c 527 g 757 t ORIGIN 1 ccggaggagt ccgagaggaa gcggaggcgc gagctggagg cggcggctcc cgtcggcctc 61 cggcaggact gagcgctggg aggccggaag gcgggcgcgc acggcggaga ggcgggcggg 121 aggccggagc atattaatga aaagtgccat aaactgaaaa accaaacatg agggtagcag 181 gtgctgcaaa gttggtggta gctgtggcag tgtttttact gacattttat gttatttctc 241 aagtatttga aataaaaatg gatgcaagtt taggaaatct atttgcaaga tcagcattgg 301 acacagctgc acgttctaca aagcctccca gatataagtg tgggatctca aaagcttgcc 361 ctgagaagca ttttgctttt aaaatggcaa gtggagcagc caacgtggtg ggacccaaaa 421 tctgcctgga agataatgtt ttaatgagtg gtgttaagaa taatgttgga agagggatca 481 atgttgcctt ggcaaatgga aaaacaggag aagtattaga cactaaatat tttgacatgt 541 ggggaggaga tgtggcacca tttattgagt ttctgaaggc catacaagat ggaacaatag 601 ttttaatggg aacatacgat gatggagcaa ccaaactcaa tgatgaggca cggcggctca 661 ttgctgattt ggggagcaca tctattacta atcttggttt tagagacaac tgggtcttct 721 gtggtgggaa gggcattaag acaaaaagcc cttttgaaca gcacataaag aacaataagg 781 atacaaacaa atatgaagga tggcctgaag ttgtagaaat ggaaggatgc atcccccaga 841 agcaagacta atggaaatgt ggagagaatt gaagaaagcg cactttcact cttaatggga 901 gagctataaa tggcagagct atgtgtaaat attttaagag catgcagcca tcttggtgtg 961 tgcatgagta ttgtctcttt tgatatcagg attatttatt gctaacgtaa atagatagca 1021 ttgtaaataa tcatcacaat gatcaaatca ctgaaccatg tctccgcaca tttccctaaa 1081 agtacaatgt ttagactgct atggtaatac atattttaaa ttctaaaagc atacacaatg 1141 tgtaactgaa tggtttgtga aaaatatatt gatatatata ctagttgcta tgaaaatatc 1201 atggaataat agggatttta gggtggatac tttattttct tttatgtttc tatatgttgc 1261 gttgtgatga cattatcttt taaattaaaa agagatttgg ctagttgtgt gtgtaatgtt 1321 actttacagt ccgactctcc tgatgtacct cttttcatga tctttttctt tccttcccaa 1381 gaaactgagg aatgtttaat atgaaaacat acatcggata tgtgaaaagc acaacaaaat 1441 tcttaatgta cacagtaaaa aagtaaatat ataaatgtag atggcattta ggaccacagc 1501 ttgctggatt tgtgttagct atgggaataa cttgattttg tataagctat ttagagtgag 1561 gctggaggtg gcagcttcac agaactggag aaccaggcca agtcccctcc ccaacctaat 1621 taggtcattc aggacagcta agtcagtata tttagagcaa tactagcata cgtttttctt 1681 aattgttatc agcattgacc aagtggtttg gaaggaggca tgctttaata tcacaataat 1741 tttgatttgt aaaccaagaa attaatcctg tgtttatcta acttcataat agcaattatt 1801 gcccgaagct atagtggcat atttacaaaa gttcttatta ctgggcggac tgataacatt 1861 taaaaaataa ttgtgtttga ccccaaatga ctttataccc aattctacat aaaaatatag 1921 aagatctatc tttttttgtt accttcagat gttcactaaa taactcagtt tttaagcaga 1981 agttttcagg gcattaaata tatgttgtgt atgaagtatc tcaaactgga acataaattt 2041 agtgatcaaa ctgccattca cagtgtaagg cagcacttaa atttcgaacc taaagtttag 2101 atgcattgta taaaaaaacc taaaagcagt atctgttatt tagctgtaaa ccaagttgga 2161 agctattcgg ataatttctt aaatattgat gaactttgga gtactgtttc ttccttcaaa 2221 ctgaatgtaa ttaattcatg aataaatgca ccttatatgt ttaaacaatc tttgtatact 2281 tttgggattt ttggtgctta tatgctaaat cacattcagc atgtgtattt tgacatttaa 2341 aatacttccc tcaattctgt aaattaaaag aatagttatt ttacagttcc agggattgtg 2401 aaataaatgt tgcagttttt taaaataatg aaaataaata ctcttggttt tgctttgtga 2461 aaaaaaaaaa aaaaa // LOCUS D87127 2491 bp mRNA PRI 15-APR-1997 DEFINITION Human mRNA for translocation protein-1, complete cds. ACCESSION D87127 NID g1817551 KEYWORDS HTP-1; translocation protein-1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2491) AUTHORS Daimon,M. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Makoto Daimon, Yamagata University School of Medicine, The Third Department of Internal Medicine; 2-2-2 Iida-Nishi, Yamagata, Yamagata 990-23, Japan (E-mail:mdaimon@med.id.yamagata-u.ac.jp, Tel:+81-236-28-5316, Fax:+81-236-28-5318) REFERENCE 2 (bases 1 to 2491) AUTHORS Daimon,M., Susa,S., Suzuki,K., Kato,T., Hayashi,T., Yamatani,K. and Sasaki,H. TITLE Identification of a human cDNA homologue to the Drosophila translocational protein-1 (Dtrp-1) JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Daimon,M., Susa,S., Suzuki,K., Kato,T., Yamatani,K. and Sasaki,H. TITLE Identification of a human cDNA homologue to the Drosophila translocation protein 1 (Dtrp1) JOURNAL Biochem. Biophys. Res. Commun. 230 (1), 100-104 (1997) MEDLINE 97148580 FEATURES Location/Qualifiers source 1..2491 /organism="Homo sapiens" /db_xref="taxon:9606" gene 13..1212 /gene="HTP-1" CDS 13..1212 /gene="HTP-1" /codon_start=1 /product="translocation protein-1" /db_xref="PID:d1013944" /db_xref="PID:g1817552" /translation="MAERRRHKKRIQEVGEPSKEEKAVAKYLRFNCPTKSTNMMGHRV DYFIASKAVDCLLDSKWAKAKKGEEALFTTRESVVDYCNRLLKKQFFHRALKVMKMKY DKDIKKEKDKGKAESGKEEDKKSKKENIKDEKTKKEKEKKKDGEKEESKKEETPGTPK KKETKKKFKLEPHDDQVFLDGNEVYVWIYDPVHFKTFVMGLILVIAVIAATLFPLWPA EMRVGVYYLSVGAGCFVASILLLAVARCILFLIIWLITGGRHHFWFLPNLTADVGFID SFRPLYTHEYKGPKADLKKDEKSETKKQQKSDSEEKSDSEKKEDEEGKVGPGNHGTEG SGGERHSDTDSDRREDDRSQHSSGNGNDFEMITKEELEQQTDGDCEEDEEEENDGETP KSSHEKS" BASE COUNT 869 a 387 c 509 g 726 t ORIGIN 1 ggagcggcca acatggcgga acgcaggaga cacaagaagc ggatccagga agttggtgaa 61 ccatctaaag aagagaaggc tgtggccaag tatcttcgat tcaactgtcc aacaaagtcc 121 accaatatga tgggtcaccg ggttgattat tttattgctt caaaagcagt ggactgtctt 181 ttggattcaa agtgggcaaa ggccaagaaa ggagaggaag ctttatttac aaccagggag 241 tctgtggttg actactgcaa caggctttta aagaagcagt tttttcaccg agccctaaaa 301 gtaatgaaaa tgaaatatga taaagacata aagaaagaaa aagataaagg aaaagctgaa 361 agtggaaaag aagaagataa aaagagcaag aaagaaaata taaaggatga gaagacaaaa 421 aaagaaaaag agaaaaaaaa agatggtgaa aaggaagaat ccaaaaagga ggaaactcca 481 ggaactccta aaaagaagga aactaagaaa aaattcaaac ttgagccaca tgatgatcag 541 gtttttctgg atggaaatga ggtgtatgta tggatctatg acccagttca ctttaaaaca 601 tttgtcatgg gattaattct tgtgattgca gtaatagcgg ccaccctctt ccccctttgg 661 ccagcagaaa tgagagtagg tgtttattac ctcagtgtgg gtgcaggctg ttttgtagcc 721 agtattcttc tccttgctgt tgctcgatgc attctatttc tcatcatttg gctcataact 781 ggaggaaggc accacttttg gttcttgcca aatctgactg ctgatgtggg cttcattgac 841 tccttcaggc ctctgtacac acatgaatac aaaggaccaa aagcagactt aaagaaagat 901 gagaagtctg aaaccaaaaa gcaacagaag tccgacagtg aggaaaagtc agacagtgag 961 aaaaaggaag atgaggaggg gaaagtagga ccaggaaatc atggaacaga aggctcgggg 1021 ggagaacggc attcagacac ggacagtgac aggagggaag atgatcgatc ccagcacagt 1081 agtggaaatg gaaatgattt tgaaatgata acaaaagagg aactggaaca gcaaacagat 1141 ggggattgtg aagaggatga ggaagaggaa aatgatggag aaacacctaa atcttcacat 1201 gaaaaatcat aatctgacta attttgggac tgaatgaata agtacaagag gttggatttt 1261 ctatgttggc tgattaccat attgaacaca tggcatttgt agcattcttt aaatctatct 1321 actgagatgt atttgacatt caagcagtta tattcggtcc ttcattttat agaatattgg 1381 cactattatt ggtacagttt aaagccatta atatgtttta tccatttgat aattttacag 1441 taagtaggtc tcattcattt tgacagttat caaagatgta ctttccacag ttaaatttac 1501 attaatggca atttttgata gttttatggc tttttactgt tagactaatc aaaaataact 1561 ttaaaaggaa caaagaaact ccaacatttc acattatgca tagttatgta gccatttcac 1621 agtttcttta agatgtgtaa actcattgtc cttgatagtt tttatttttc attataaaat 1681 tataccagga gatttctttt aagattctga gttagcagag ttcaaaacta ttttgtggaa 1741 acaagccaac tagtaacaat gcagcaacac ttctggttta gctaaattat ttttccaatg 1801 taggaaatcc acactgattt gtacgtctga ctgagagaaa gatggtcgtc tccagcagag 1861 aaagtgaaca gcatttgttg gaaggtgatg gctctccctc ctccctcccc atttcattgg 1921 cgtaacgtaa agtgtattct gtacataatt tacaaataaa acattttatt ttaattgtta 1981 cttattattt agatatttct caacacttaa attcataaaa ttaagaccat gtaagggtat 2041 gtttttagag aaatggaagt ttgagtaacc cacagaacat ctgtgatctt tctacagcag 2101 cttcagtttt gtgccaacat tccatgtatt ttgaatatga gcaaaaactg atcttaagag 2161 cagacttaaa gtagctttgt acgccttaat gttcattttg atttatttta aatctttaca 2221 ttcagaaatg agatactgta ttatcagacc aggaggcatt gctgtgaaag ataatttcct 2281 attctaaaat atcaaattta aaataaagat aatgaaagaa aacataagag aactattgga 2341 ctctaattgc tttgaactag ttttgcagtt tgactttacg tcttatacat cttagttaca 2401 agtctttgga aactcttgtt tacttatgag cataatcatt ccttggagtt atactaacta 2461 ataatgaata ttaaatgatt tttcttcagt c // LOCUS D87291 1489 bp mRNA PRI 25-FEB-1997 DEFINITION Human mRNA for inward rectifier potassium channel, complete cds. ACCESSION D87291 NID g1772441 KEYWORDS inward rectifier potassium channel. SOURCE Homo sapiens kidney cDNA to mRNA, clone_lib:human kidney Marathon-ready cDNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1489) AUTHORS Ohira,M., Seki,N., Nagase,T., Suzuki,E., Nomura,N., Ohara,O., Hattori,M., Sakaki,Y., Eki,T., Murakami,Y., Saito,T., Ichikawa,H. and Ohki,M. TITLE Gene identification in the 1.6 -Mb of the down syndrome region on chromosome 21 JOURNAL Genome Res. (1997) In press REFERENCE 2 (sites) AUTHORS Ohira,M., Seki,N., Nagase,T., Ichikawa,H., Suzuki,E., Nomura,N. and Ohki,M. TITLE Gene Identification in a 1.6-Mb Region of the Down Syndrome Region on Chromosome 21 JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1489) AUTHORS Ohira,M. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Ohira, Kazusa DNA Research Institute, Laboratory of Gene Structure 1; 1532-3 Yanauchino, Kisarazu, Chiba 292, Japan (E-mail:oohira@kazusa.or.jp, Tel:+81-438-52-3932, Fax:+81-438-52-3931) FEATURES Location/Qualifiers source 1..1489 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /clone_lib="human kidney Marathon-ready cDNA" /tissue_type="kidney" CDS 355..1482 /note="human homolog of rat inward rectifier potassium channel 10" /codon_start=1 /product="inward rectifier potassium channel" /db_xref="PID:d1014016" /db_xref="PID:g1772442" /translation="MDAIHIGMSSTPLVKHTAGAGLKANRPRVMSKSGHSNVRIDKVD GIYLLYLQDLWTTVIDMKWRYKLTLFAATFVMTWFLFGVIYYAIAFIHGDLEPDEPIS NHTPCIMKVDSLTGAFLFSLESQTTIGYGVRSITEECPHAIFLLVAQLVITTLIEIFI TGTFLAKIARPKKRAETIKFSHCAVITKQNGKLCLVIQVANMRKSLLIQCQLSGKLLQ THVTKEGERILLNQATVKFHVDSSSEGPFLILPMTFYHVLDETSPLRDLTPQNLKEKE FELVVLLNATVESTSAVCQSRTSYIPEEIYWGFEFVPVVSLSKNGKYVADFSQFEQIR KSPDCTFYCADSEKQQLEEKYRQEDQRERELRTLLLQQSNV" BASE COUNT 372 a 401 c 346 g 370 t ORIGIN 1 ctcctctgcc caatgtctcc caatctcttt cctttctctc ttcagttcct ccaggtaatt 61 cttactcaaa cttgtaccaa cttgtttttg actgacagtg aacagtgaga gagttttctt 121 cattttgagg aaccctaaac acctatcttt cccaaggcaa cctgtctgga ctgagcattt 181 ctctgacttg acataacttc ccatccagcc aggagtctgc actcttcagt ctttgcaggc 241 agtagcagaa tcccatggta gccaggtggg tgaaggggag cgaggacgtt ctacctgcct 301 tgaagaagac acctgacctg cggagtgagt gaccagtgtt tccagagcct ggcaatggat 361 gccattcaca tcggcatgtc cagcaccccc ctggtgaagc acactgctgg ggctgggctc 421 aaggccaaca gaccccgcgt catgtccaag agtgggcaca gcaacgtgag aattgacaaa 481 gtggatggca tatacctact ctacctgcaa gacctgtgga ccacagttat cgacatgaag 541 tggagataca aactcaccct gttcgctgcc acttttgtga tgacctggtt cctttttgga 601 gtcatctact atgccatcgc gtttattcat ggggacttag aacccgatga gcccatttca 661 aatcataccc cctgcatcat gaaagtggac tctctcactg gggcgtttct cttttccctg 721 gaatcccaga caaccattgg ctatggagtc cgttccatca cagaggaatg tcctcatgcc 781 atcttcctgt tggttgctca gttggtcatc acgaccttga ttgagatctt catcaccgga 841 accttcctgg ccaaaatcgc cagacccaaa aagcgggctg agaccatcaa gttcagccac 901 tgtgcagtca tcaccaagca gaatgggaag ctgtgcttgg tgattcaggt agccaatatg 961 aggaagagcc tcttgattca gtgccagctc tctggcaagc tcctgcagac ccacgtcacc 1021 aaggaggggg agcggattct cctcaaccaa gccactgtca aattccacgt ggactcctcc 1081 tctgagggcc ccttcctcat tctgcccatg acattctacc atgtgctgga tgagacgagc 1141 cccctgagag acctcacacc ccaaaaccta aaggagaagg agtttgagct tgtggtcctc 1201 ctcaatgcca ctgtggaatc caccagcgct gtctgccaga gccgaacatc ttatatccca 1261 gaggaaatct actggggttt tgagtttgtg cctgtggtat ctctctccaa aaatggaaaa 1321 tatgtggctg atttcagtca gtttgaacag attcggaaaa gcccagattg cacattttac 1381 tgtgcagatt ctgagaaaca gcaactcgag gagaagtaca ggcaggagga tcagagggaa 1441 agagaactga ggacactttt attacaacag agcaatgtct gatcacagg // LOCUS D87292 1137 bp mRNA PRI 10-MAR-1997 DEFINITION Human mRNA for rhodanese, complete cds. ACCESSION D87292 NID g1877030 KEYWORDS rhodanese. SOURCE Homo sapiens fetal liver cDNA to mRNA, clone_lib:lambda gt11 fetal liver clone:Rho1.1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Aita,N., Ishii,K., Akamatsu,Y., Ogasawara,Y. and Tanabe,S. TITLE Cloning and expression of human liver rhodanese cDNA JOURNAL Biochem. Biophys. Res. Commun. 231 (1), 56-60 (1997) MEDLINE 97223398 REFERENCE 2 (bases 1 to 1137) AUTHORS Aita,N., Ishii,K., Akamatsu,Y., Ogasawara,Y. and Tanabe,S. TITLE Cloning and expression of cloned human liver rhodanese JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1137) AUTHORS Tanabe,S. TITLE Direct Submission JOURNAL Submitted (22-AUG-1996) to the DDBJ/EMBL/GenBank databases. Shinzou Tanabe, Meiji College of Pharmacy, Department of Hygienic Chemistry; Yatocho 1-22-1, Tanashi, Tokyo 188, Japan (Tel:0424-21-0479, Fax:0424-21-1489) FEATURES Location/Qualifiers source 1..1137 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Rho1.1" /clone_lib="lambda gt11 fetal liver" /dev_stage="fetal" /tissue_type="liver" CDS 49..942 /EC_number="2.8.1.1" /note="thiosulfate:cyanide sulfurtransferase" /codon_start=1 /product="rhodanese" /db_xref="PID:d1014017" /db_xref="PID:g1877031" /translation="MVHQVLYRALVSTKWLAESIRTGKLGPGLRVLDASWYSPGTREA RKEYLERHVPGASFFDIEECRDTASPYEMMLPSEAGFAEYVGRLGISNHTHVVVYDGE HLGSFYAPRVWWMFRVFGHRTVSVLNGGFRNWLKEGHPVTSEPSRPEPAVFKATLDRS LLKTYEQVLENLESKRFQLVDSRSQGRFLGTEPEPDAVGLDSGHIRGAVNMPFMDFLT EDGFEKGPEELRALFQTKKVDLSQPLIATCRKGVTACHVALAAYLCGKPDVAVYDGSW SEWFRRAPPESRVSQGKSEKA" polyA_signal 1085..1090 BASE COUNT 228 a 332 c 343 g 234 t ORIGIN 1 gaattccggg cgcggcgtcc ggggcgagtg acacgcagag ctgaagccat ggttcatcag 61 gtgctctacc gggcgctggt ctccaccaag tggctggcgg agtccatcag gactggcaag 121 ctggggcccg gcctgcgggt gctggacgcg tcctggtact caccaggcac ccgagaggcc 181 cgcaaggagt acctcgagcg ccacgtaccc ggcgcctctt tctttgacat agaagagtgc 241 cgggacacgg cgtcgcccta cgagatgatg ctgcccagcg aggctggctt cgccgagtat 301 gtgggccgcc tgggcatcag caaccacacg cacgtggtgg tgtatgatgg tgaacacctg 361 ggcagcttct atgctccccg ggtctggtgg atgttccgtg tgtttggcca ccgcaccgta 421 tcagtgctca atggtggctt ccggaactgg ctgaaggagg gccacccggt gacatccgag 481 ccctcacgcc cagaaccggc cgtcttcaaa gccacactgg accgctccct gctcaagacc 541 tacgagcagg tgctggagaa ccttgaatct aagaggttcc agctggtgga ttcaaggtct 601 caagggcggt tcctgggcac cgagccggag ccggatgcag taggactgga ctcgggccat 661 atccgtggtg ccgtcaacat gcctttcatg gacttcctga ctgaggatgg cttcgagaag 721 ggcccagaag agctccgtgc tctgttccag accaagaagg tggatctctc gcagcctctc 781 attgccacgt gccgcaaggg agtcaccgcc tgccacgtgg ccttggctgc ctacctctgc 841 ggcaagcctg atgtggccgt gtacgatggc tcctggtccg agtggtttcg ccgggccccc 901 ccagagagcc gtgtgtccca gggaaagtct gagaaggcct gagccgtgac ctcttctgct 961 tactgtaact gcggccggtt tagtgacccc atgacttaca gccggttctt acctcttagg 1021 tgaaggagat gacatgtttt ttagaattgc tgtgcaaggc tcaccctctc tctgtcaaca 1081 ctggaataaa ctttgccttt tctgaaaaaa aaaaaaaaaa aaaaaaaacc ggaattc // LOCUS D87328 6465 bp mRNA PRI 25-FEB-1997 DEFINITION Human mRNA for HCS, complete cds. ACCESSION D87328 NID g1813423 KEYWORDS HCS. SOURCE Homo sapiens immature myeloblastoid cell cell_line:KG-1 cDNA to mRNA, clone:kg-24. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ohira,M., Seki,N., Nagase,T., Suzuki,E., Nomura,N., Ohara,O., Hattori,M., Sakaki,Y., Eki,T., Murakami,Y., Saito,T., Ichikawa,H. and Ohki,M. TITLE Gene identification in the 1.6 -Mb of the down syndrome region on chromosome 21 JOURNAL Genome Res. (1997) In press REFERENCE 2 (sites) AUTHORS Ohira,M., Seki,N., Nagase,T., Suzuki,E., Nomura,N., Ohara,O., Saito,T., Ichikawa,H. and Ohki,M. TITLE Gene Identification in the 1.6 Mb of the Down Syndrome Region on Chromosome 21 JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 6465) AUTHORS Ohira,M. TITLE Direct Submission JOURNAL Submitted (23-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Ohira, Kazusa DNA Research Institute, Laboratory of Gene Structure 1; 1532-3 Yanauchino, Kisarazu, Chiba 292, Japan (E-mail:oohira@kazusa.or.jp, Tel:+81-438-52-3932, Fax:+81-438-52-3931) FEATURES Location/Qualifiers source 1..6465 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="immature myeloblastoid cell" /chromosome="21" /clone="kg-24" /map="21q22.2" CDS 1232..3412 /codon_start=1 /product="HCS" /db_xref="PID:d1014022" /db_xref="PID:g1813424" /translation="MEDRLHMDNGLVPQKIVSVHLQDSTLKEVKDQVSNKQAQILEPK PEPSLEIKPEQDGMEHVGRDDPKALGEEPKQRRGSASGSEPAGDSDRGGGPVEHYHLH LSSCHECLELENSTIESVKFASAENIPDLPYDYSSSLESVADETSPEREGRRVNLTGK APNILLYVGSDSQEALGRFHEVRSVLADCVDIDSYILYHLLEDSALRDPWTDNCLLLV IATRESIPEDLYQKFMAYLSQGGKVLGLSSSFTFGGFQVTSKGALHKTVQNLVFSKAD QSEVKLSVLSSGCRYQEGPVRLSPGRLQGHLENEDKDRMIVHVPFGTRGGEAVLCQVH LELPPSSNIVQTPEDFNLLKSSNFRRYEVLREILTTLGLSCDMKQVPALTPLYLLSAA EEIRDPLMQWLGKHVDSEGEIKSGQLSLRFVSSYVSEVEITPSCIPVVTNMEAFSSEH FNLEIYRQNLQTKQLGKVILFAEVTPTTMRLLDGLMFQTPQEMGLIVIAARQTEGKGR GGNVWLSPVGCALSTLLISIPLRSQLGQRIPFVQHLMSVAVVEAVRSIPKYQDINLRV KWPNDIYYSDLMKIGGVLVNSTLMGETFYILIGCGFNVTNSNPTICINDLITEYNKQH KAELKPLRADYLIARVVTVLEKLIKEFQDKGPNSVLPLYYRYWVHSGQQVHLGSAEGP KVSIVGLDDSGFLQVHQEGGEVVTVHPDGNSFDMLRNLILPKRR" BASE COUNT 1666 a 1482 c 1562 g 1755 t ORIGIN 1 ggccgcgtct tctgcgtcac gaacagccag tccattgaag acttgaacaa gtgggcccta 61 tttcttgtgt ctccttttat acttgaagca gaacacatag catttgtgac ggagagcatt 121 tgggtacaaa gtgagaattt acagagatca tcctcttcag aaacaacgga gtgtcgttct 181 gtcgtccagg ctggagtgca gtggtgtggt ctcggctcac cacaacctct gcctactggg 241 ttccagtgat tctcctgcct cagcatcccg agtagctggg actacaggtg cacgctacca 301 tgcccggcta atttttgtat ttttagtaga ggcgggggtt tcaccatgcc atccaggctg 361 gtttcgaact cctgacctcg tgatccacct gtctcggcct cccaaagtgc tgggattaca 421 ggtgtgagcc accgcgcctg gccctgtatt ttcaaattct ttacctctgc gtttccagcc 481 ttgctccatc caatttgtta ctagagcttt ttaacaattt cacgttgaat taagatgatt 541 tggaagagga gaaacaagag tcagaaaaat tagcagcatc cgtgatctga ctctgggtac 601 cagtgggttg ttgagatttc tgttccccac tcactgcaga gctgctgcct gccaatgccc 661 cctgttgggc agggccacct gtgctagttt aacctggggt ctttcactcc cggactgctt 721 cttggatgtg agttttaagt tgagcagaga agatatgcct tgcctgctgc catacattgt 781 aacatcaagt caagagcttt agttacttaa cagatttctg ctggctgcta ttataggagg 841 aaactccttt gtttagggac agccagggct tccttagctt tgatctttgg gagtgagcct 901 tttgcttgac ttcccttctc ttttccatgc ctttgtttcc atctcgaagt gcaaatttaa 961 agtgtcctgt ttgcctttat ccactggaag gcaaacaccc tggacgacgt atcagtgaca 1021 gcgcggttct gctttgatcg gagcacccag cgctgcttga aggcgtttgc atggtccatg 1081 gctcatggac cacgctccta gaaaacggaa atgcacttag attgtcaagt ggtcagactg 1141 ttgtttgcca ttagcttgca gacctgggga tccttatcgg ctaattgctg aagcaagtgt 1201 ggacaacttc agcaagctgg gggtggcgtt catggaagat agactccaca tggataatgg 1261 actggtaccc caaaagattg tgtcggtgca cttgcaggac tccactctga aggaagttaa 1321 ggatcaggtc tcaaacaagc aagcccagat cctagagccg aagcctgaac cttctcttga 1381 gattaagcct gagcaggacg gtatggagca tgttggcaga gatgacccaa aggctcttgg 1441 tgaagaaccc aaacaaagga gaggcagtgc ctctgggagt gagcctgctg gggacagtga 1501 caggggaggg ggccccgttg agcattatca cctccatctg tctagttgcc acgagtgtct 1561 ggaacttgag aacagcacca ttgagtcagt caagtttgcg tctgccgaga acattccaga 1621 ccttccctac gattatagca gcagtttgga gagtgttgct gatgagacct cccccgaaag 1681 agaagggagg agagtcaacc tcacgggaaa ggcacccaac atcctcctct atgtgggctc 1741 cgactcccag gaagccctcg gccggttcca cgaggtccgg tctgtgctgg ccgactgtgt 1801 ggacattgac agttatattc tctaccacct gctggaggac agtgctctca gagacccgtg 1861 gacggacaac tgtctgctgt tggtcattgc taccagggag tccattcccg aagacctgta 1921 ccagaagttc atggcctatc tttctcaggg agggaaggtg ttgggcctgt cttcatcctt 1981 cacctttggt ggctttcagg tgacaagcaa gggtgcactg cacaagacag tccagaactt 2041 ggttttctcc aaggctgacc agagcgaggt gaagctcagc gtcttgagca gtggctgcag 2101 gtaccaggaa ggccccgtcc ggctcagccc cggcaggctc cagggccacc tggagaatga 2161 ggacaaggac aggatgattg tgcatgtgcc ttttggaact cgcgggggag aagctgttct 2221 ttgccaggtg cacttagaac tacctcccag ctccaacata gtgcaaactc cagaagattt 2281 taacttgctc aagtcaagca attttagaag atacgaagtc cttagagaga ttctgacaac 2341 ccttggcctc agctgtgaca tgaaacaagt tcctgcctta actcctcttt acttgctgtc 2401 agctgcggag gaaatcaggg atcctcttat gcagtggctt gggaaacatg tggactccga 2461 gggagaaata aaatccggcc agctctctct tagatttgtt tcatcctacg tgtctgaagt 2521 agaaataacc ccatcttgta tacctgtggt gaccaacatg gaggccttct catcagaaca 2581 tttcaactta gagatctatc gccaaaatct gcagaccaag cagttgggga aagtaatttt 2641 gtttgccgaa gtgaccccca caacgatgcg tctcctggat gggctgatgt ttcagacacc 2701 gcaggaaatg ggcttaatag tgatcgcggc ccggcagacc gagggcaaag gacggggagg 2761 gaatgtgtgg ctgagccctg tgggatgtgc tctttctact ctgctcatct ccattccact 2821 gagatcccag ctgggacaga ggatcccgtt tgtccagcat ctgatgtccg tggctgtcgt 2881 ggaagcagtg aggtccattc ccaagtatca ggatatcaac ttacgagtga agtggcccaa 2941 cgatatttat tacagtgacc tcatgaagat cggcggagtt ctggttaact caacactcat 3001 gggagaaaca ttttatatac ttattggctg tggatttaat gtgactaaca gtaaccctac 3061 catctgcatc aacgacctca tcacagaata caataaacaa cacaaggcag aactgaagcc 3121 cttaagagcc gattatctca tcgccagagt cgtgactgtg ctggagaaac tgatcaaaga 3181 gtttcaggac aaagggccca acagcgtcct tcccctttat taccgatact gggtccacag 3241 tggtcagcaa gtccatctgg gcagcgcaga gggaccaaag gtgtccatcg ttggcctgga 3301 cgattctggc ttcctccagg ttcaccagga gggcggcgag gttgtgactg tgcacccgga 3361 cggcaactcc ttcgacatgc tgagaaacct catcctcccc aaacggcggt aatgccgggc 3421 gtccccgaga cgcggctgcc tgtccgtgcc catgcatctg gaaatctaat ttagagttgt 3481 aggtgaattt tcttttcctc caattcattt gttaagtctt tgttcttttt ctgtgtttct 3541 gtttgttttt aggtttgttt tgttgtcgtt ttctttggtg tttgaagagg ctctgggata 3601 gatggttaag aagtagaaaa tttagtttag ggaaagccct cccacaggtg ggaaattgct 3661 ctcccctctg tggcttggac ttacgtttat tgtcaagggg agtttttaca tggaaatgac 3721 aatgggaaaa ttcagatatt ttcttagtag tgcagacctt tacccctagt ctatgaaaaa 3781 acaaaccaaa atatgctctt gcgcccaggc cagtggtgag ttagaggtat gctatcactg 3841 tttgtaagca tctggggagg tactgaactg taagaacatg cttggacact tagtcattgt 3901 tctgtgtttt tattaatgaa gaaaagggaa gacagacttc caagagttac tgtccacccg 3961 gtggtgtggc cccatagcga agtctaaatg cctgtagaga tagagctagc tggtgtggtt 4021 gcagtgacct tgtagaggaa atcagttcat tactttgaca tcattcagtg agctctcctt 4081 tcctaaggaa gtttaaatgt ccttagttag ggactgactt tcttaagtaa gtttaaattt 4141 actacatatt gtgaagagac aggatcaagt tcagaatcct taaatgtctg attaggcatc 4201 acttggatga ggaggtgggc gatttggctc tgacagctgg agatgaaggc acactcatac 4261 cacatacaag ggaggatttg gagcttttaa gccagtttca gatttactct gaaatgtgga 4321 gcattcctgc aagactgtgc agctcacgga atatagaaga catggcattt tactcagaag 4381 tcataagttt ttgcccccct catttacctc gtattaccaa gaaagaaaat gttatcgata 4441 ctaaacacca tcagttcaga gggaggatgt gtgtgtgtgc ccgcatatgt gtgtgcgtgc 4501 gtgtgtgcgc acatagcttt aaaagaagac attcaaaatt tgatgtgcta caagcctcat 4561 gaaagaacaa aagaaatgaa gccttttgat atgcattcgc tattcccaga tgtacgccat 4621 gccttttcca tgtccctcct atctctgttg aacttatgaa tcatactcat tacttttcag 4681 ctttttaaaa ggccaatttt tgtccagttt tctctcttcc agtcccagct gaaattagtg 4741 gaaagaaagt ttgatggagc tttcagcttt gaacaaaatc ccttcattgt aaactagcac 4801 catctttatc caggtcttac ccagtcaggc taattccaga aacttgtggt ttttagtata 4861 gtctgtctac ctttagccag gcacaggaca gccctatgaa aaaataccca atatatattt 4921 tttggaaatg aaacattaaa agaacttaaa aagtaatttt tggaaatgag gcttcaatta 4981 gaattatttt tctcaaaaaa caaacaaaca aaaaacacaa aaaaaaccac tcttctccaa 5041 atgcccaagc cttctttcaa aattagttag aaacttaagt aaaatacaag tccacaccat 5101 ccccaaatta caaaatggac ttacccttga gagggcatct gcagaatatc atcagggaca 5161 aagatctcga ggctaacgat gtaggtttca tttctcagac tttgtaatat aaggcaagcc 5221 ctctctcaga gctgccatca tcactttttg aatttctttg ggggttattt aatgaaaaac 5281 atgctatgtt ttgttttaag ctgaagtcct attctggaca ctctgctttg ggaaaaaatg 5341 ttatcattta atttcctttc tgcaaattaa aactaatgaa gtgtggcctt gtcaaaggct 5401 atggagatgt tccgggcata ctgctgtgct ctgtgctttc cagcaggcgc tcctccctca 5461 cgcaggagac tcagttgtcc tgagagagat gaagcagcct tgaagcagat gctgcgtttt 5521 ccataaacct gattttgcct cacatgaacc aaagactctc aaaactccgc ttctatagaa 5581 ttagctgaat aaaggcattt tactgatagc tgttcgtgtt agcgaaacct gtctacctgc 5641 tatagcacac tctccgattt gggccattta tgcaccccgc aacctgggat ctcaaggagc 5701 tttaaagtct taatgggaac ttggcatttt cctgatgatc tttaaaatgt ggtcactaaa 5761 ctcaggattg gcgtgtgctt ttagaacact ggagtagccc ttgttttaga ggctgtgcat 5821 tgagtatcga ccgtattttg taaaaggcaa gatatcctcc cttccaggct ggtaacgggt 5881 ttcaagggga ctcttgagga agtgccccct aaaatagaac acagcaataa ctgggcttcc 5941 tgtccccacc cccaccccag cagtgctctc tggcactggg aactctgcta gggagtggtg 6001 gaagtaggaa ggatttgtgt gcaaaggaaa atcgtggttg agtttcactg cagcaggctg 6061 acgttgcctg atgtgagagc aagtggccga ctggggtgcg ggtgcacagg tcgggggagc 6121 acaggccaca gagcgcagct ctgggggtcc cccaaggcac agcatataca gcatggtcgc 6181 cccttgccct ggagtctggg aacaaagaga ggagccagcc tccccgcact gcttcagatg 6241 gaaaagggag gcagggtggg cttccgttct ccagatctgt ttgctcttaa caggcagaac 6301 atgggagaat ccttattcct ggttaatcac tatgcatatt tgaaataaaa gaaagcgtaa 6361 gcctctgcaa ttttaacttc tcaaaggatg tctctgaaaa gaatcacttt aaaccaatgc 6421 ctataaaaag caagtctacc aaaataaact aagactttct atgtg // LOCUS D87343 3252 bp mRNA PRI 06-NOV-1997 DEFINITION Homo sapiens mRNA for DCRA, complete cds. ACCESSION D87343 NID g2589159 KEYWORDS DCRA. SOURCE Homo sapiens (sub_species:domesticus) fetus, 19-23 weeks pool fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nakamura,A., Hattori,M. and Sakaki,Y. TITLE Isolation of a novel human gene from the Down syndrome critical region of chromosome 21q22.2 JOURNAL J. Biochem. 122 (4), 872-877 (1997) MEDLINE 98060515 REFERENCE 2 (bases 1 to 3252) AUTHORS Nakamura,A. TITLE Direct Submission JOURNAL Submitted (23-AUG-1996) to the DDBJ/EMBL/GenBank databases. Akiko Nakamura, Institute of Medical Science, University of Tokyo, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo Japan, 108 (E-mail:ana@hgc.ims.u-tokyo.ac.jp, Tel:81-3-5449-5623, Fax:81-3-5449-5445) FEATURES Location/Qualifiers source 1..3252 /organism="Homo sapiens" /sub_species="domesticus" /db_xref="taxon:9606" /chromosome="21" /dev_stage="fetus, 19-23 weeks pool" /map="21q22.2" /tissue_type="fetal brain" CDS 240..1133 /codon_start=1 /product="DCRA" /db_xref="PID:g2589160" /translation="MGTALDIKIKRANKVYHAGEVLSGVVVISSKDSVQHQGVSLTME GTVNLQLSAKSVGVFEAFYNSVKPIQIINSTIEMVKPGKFPSGKTEIPFEFPLHLKGN KVLYETYHGVFVNIQYTLRCDMKRSLLAKDLTKTCEFIVHSAPQKGKFTPSPVDFTIT PETLQNVKERALLPKFLLRGHLNSTNCVITQPLTGELVVESSEAAIRSVELQLVRVET CGCAEGYARDATEIQNIQIADGDVCRGLSVPIYMVFPRLFTCPTLETTNFKVEFEVNI VVLLHPDHLITENFPLKLCRI" BASE COUNT 815 a 786 c 798 g 853 t ORIGIN 1 ggggtgaacc gtgacagcgc ttcggcccgc actaaggccg gctcttgtgc cggaaggagg 61 aaggcgtggg gcattcgccc ctcggagcta gggagtgtgt gcgacgccgc tgcgaggtca 121 cgtgagccac tgccggcaga gagggaaagg ggcggggccc agaacgaagc ggggaggcgc 181 cccttgtttc cctggggtca cgcgcagccg gaagtggcgg ctgctgcgga gaattggaga 241 tggggaccgc cctggacatc aagattaaaa gagcgaataa agtttatcac gccggggaag 301 tgctctctgg cgtggtggtc atatcgagta aggattcagt ccaacaccag ggagtgtctt 361 tgaccatgga aggaactgta aacctccagc tcagtgccaa aagtgtgggt gtgtttgaag 421 ctttttataa ttctgttaag cctatccaga ttatcaacag caccatagaa atggtgaagc 481 cggggaaatt tcccagcggc aaaacagaaa tcccttttga atttcctctg cacttgaagg 541 gtaacaaagt tctgtatgag acgtatcatg gcgtgtttgt caacattcag tatacactgc 601 gctgtgacat gaagcggtct ctgttggcca aggacttgac aaagacctgt gaatttatcg 661 ttcactccgc tcctcagaag gggaagttta ctcccagtcc cgtggacttc acgattacac 721 ctgaaacctt acagaacgtc aaagagagag ctttgcttcc caaatttctc cttcgaggac 781 atctcaactc aacaaactgt gtcatcacgc agccactaac gggagagctg gtggtggaga 841 gctcggaagc cgccatcaga agcgtggagc tgcagctggt gcgcgtggag acgtgcgggt 901 gtgcagaagg ctatgcccgc gacgccacgg agattcagaa cattcagatc gccgacgggg 961 atgtgtgcag gggcctctct gtccccatct acatggtctt ccctaggctg ttcacctgcc 1021 ctacactgga gaccaccaac ttcaaagtgg aatttgaggt taacatcgtg gtgctgcttc 1081 accctgacca cctcatcacg gagaacttcc cgctgaagct ctgcaggata tagcccggag 1141 gagggaagca tagagaacgg gagtggccat ctggaaatcc agctggttat ccaaatccta 1201 aggggagcta cagccagcgg catatacttg tttttgtgat tattctgtat cagaaatgaa 1261 acagaccctc aaattaactt tccttcctca tttcttgagg cttctgcttc caacaggcac 1321 ctctaatcag accttttctt tgaaattcaa caagatttct taatgctatt tgccaagacc 1381 atttcacaga aaacattgac tgtggctctt gccttatctg ttccttttta ggtacagtaa 1441 aacaattgtg acagcagttt gagcttgctg gagagtggca tcatggggac aaaaggaaac 1501 ctctgacttg ctaatggatg tagccaggga ctccccatag caaagggtct gtggccagtt 1561 gacatccagg atggctgcaa gcgcacttga tggtcaggaa gtttgcagat actcgccaag 1621 gcagagcgca aagtgctagc cactggaaat gcatgacttc cctccacccc tactctattc 1681 tgtagttttt tggttttgtt tctgagacgg agtctcagtc tgtcacccag gctggagtga 1741 tctcagctca ctgcaacctc cacctcccag gttcaagcga ctctcctgcc tcagcctccc 1801 gagtagttgg gattacaggt gactgccacc gtgcccggct aatgtttgta tttttagtag 1861 agacggggct tcaccatctt ggccaggctg gtcttgaact cctgacctcg tgacccaccc 1921 gccttggcct cccaaagtgc tgggattaca ggtgtgagcc accacaccca gcctctgtag 1981 ttctttttac aacatttttc attataactt taaatttttt aagcaactgg aaaagtgttc 2041 cttgctctct tggggggatt tggctggtgc cgaagtgttt ctgaagtctc aagaactgcc 2101 ataaaatctc acgctgccat ttccctgaac agatacatac atagagagag acagttttcc 2161 aaactgtgtc acgcaggctg agtgcactgg caggatcaca gctcacggca gcctcaacct 2221 ccctggctca agcgatccct cccctcagcc tcctgagtag ctgagactac aggtgagtgc 2281 caccacactc agctaatttt caaatttttt gtagacaggg tctccctatg ttgcccaggc 2341 tggtcttgaa ctcctagact caagtgatcc tcctgtcttg gcctcccaaa gtgctgagat 2401 tacaggtgtg agccactgtg cccagcagtt tcccagaata tatttaaatg caaagttaca 2461 tgaggggaaa acatgtatgt ttgctcctgt tgttactggg taggttctga acagcagaaa 2521 cccatgtgca gggtgggctg gtgaaggccc ctctccgcaa ggtggtagca ggaaaaggtc 2581 cttgacttga tgaatttggt ctgcctctga gccactggag gaagctgttt tgagccaggg 2641 ttttttggcc taaagccagc atttcctcag tctccctttg tggttcgaag gatatggact 2701 attgcaatac atttcttcct tcaaatcctg ccactgtttt gttggcccac aactaatagg 2761 acctcaaaat aagccatgct gctttgcaca cacactagcc ttcttttgta cttttcattc 2821 tggatgggct tggccaaaac aggctcaggc caaagacctc ccaagctgta tgtacttcca 2881 gtatcctgaa acagtgtttg gtgacataat gccaagggta aacaagcctg atttaggcac 2941 tgctttatcc aggggcttca cccatgaaat taataaaact tatctgagtc acttgaaact 3001 tggttcccag aaaacacatt tctggtttat aatctccttt tatgctcacc tgacattaat 3061 tatctatcct tgatgatgtg tttaaactga gtagcagaaa acagaggcca cactttctgg 3121 gaaattttaa aggaagaaac catttttaat gagatgaaaa tatttaacga atttaaaaag 3181 ctaatgacaa ttttgagaaa aggtttggga tgtatattgc tatgtaattt aataaactga 3241 ttttatggat at // LOCUS D87432 6296 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0245 gene, complete cds. ACCESSION D87432 NID g1665758 KEYWORDS KIAA0245. SOURCE Homo sapiens male bone marrow Myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7016. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6296) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6296 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /clone="HA7016" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 262..1809 /gene="KIAA0245" CDS 262..1809 /gene="KIAA0245" /note="Similar to Schistosoma mansoni amino acid permease (L25068)." /codon_start=1 /db_xref="PID:d1014066" /db_xref="PID:g1665759" /translation="MEAREPGRPTPTYHLVPNTSQSQVEEDVSSPPQRSSETMQLKKE ISLLNGVSLVVGNMIGSGIFVSPKGVLVHTASYGMSLIVWAIGGLFSVVGALCYAELG TTITKSGASYAYILEAFGGFIAFIRLWVSLLVVEPTGQAIIAITFANYIIQPSFPSCD PPYLACRLLAAACICLLTFVNCAYVKWGTRVQDTFTYAKVVALIAIIVMGLVKLCQGH SEHFQDAFEGSSWDMGNLSLALYSALFSYSGWDTLNFVTEEIKNPERNLPLAIGISMP IVTLIYILTNVAYYTVLNISDVLSSDAVAVTFADQTFGMFSWTIPIAVALSCFGGLNA SIFASSRLFFVGSREGHLPDLLSMIHIERFTPIPALLFNCTMALIYLIVEDVFQLINY FSFSYWFFVGLSVVGQLYLRWKEPKRPRPLKLSVFFPIVFCICSVFLVIVPLFTDTIN SLIGIGIALSGVPFYFMGVYLPESRRPLFIRNVLAAITRGTQQLCFCVLTELDVAEEK KDERKTD" BASE COUNT 1509 a 1511 c 1440 g 1836 t ORIGIN 1 cactgggaga gtttatgtgg ccgaggcaga caagtggaat taggccttgc tgcaggggac 61 ttcatttcct tctcagtact ggacccattt atgaggaggt ggcttatgaa agtgtgatgt 121 tcgcgtattt cttgacaggc agtggcgtga tcttggctca ctgcaacctc cgactccctg 181 gttcaagcga ttctcctgcc tcagcctcct gagtggggat tacaggccac agcaaacaca 241 ggtgtgcagg aaccgtttgt catggaagcc agggagcctg ggaggcccac acccacctac 301 catcttgtcc ctaacaccag ccagtcccag gtggaagaag atgtcagctc gccacctcaa 361 aggtcctccg aaactatgca gctgaagaag gagatctccc tgctgaatgg ggtcagcctg 421 gtggtgggca acatgatcgg ctcagggatc tttgtctcac ccaagggtgt gctggtacac 481 actgcctcct atgggatgtc actgattgtg tgggccattg gtgggctctt ctctgttgtg 541 ggtgcccttt gttatgcaga gctggggacc accatcacca agtcgggagc cagctacgct 601 tatattctag aggcctttgg gggcttcatt gccttcatcc gcctgtgggt ctcactgcta 661 gttgttgagc ccaccggtca ggccatcatc gccatcacct ttgccaacta catcatccag 721 ccgtccttcc ccagctgtga tcccccatac ctggcctgcc gtctcctggc tgctgcttgc 781 atatgtctgc tgacatttgt gaactgtgcc tatgtcaagt ggggcacacg tgtgcaggac 841 acgttcactt acgccaaggt cgtagcgctc attgccatca ttgtcatggg ccttgttaaa 901 ctgtgccagg gacactctga gcactttcag gacgcctttg agggttcctc ctgggacatg 961 ggaaacctct ctcttgccct ctactctgcc ctcttctctt actcaggttg ggacaccctt 1021 aattttgtaa cagaagaaat caaaaaccca gaaagaaatt tgcccttggc cattgggatt 1081 tctatgccaa ttgtgacgct catctacatc ctgaccaatg tggcctatta cacagtgctg 1141 aacatttcag atgtccttag cagtgatgct gtggctgtga catttgctga ccagacgttt 1201 ggcatgttca gctggaccat ccccattgct gttgccctgt cctgctttgg gggcctcaat 1261 gcatccatct ttgcttcatc aaggttgttc ttcgtgggct cccgggaggg ccacctaccg 1321 gaccttctgt ccatgatcca cattgagcgt tttacaccta tccctgcttt actgttcaat 1381 tgcaccatgg cactcatcta cctcatcgtg gaggatgttt tccagcttat caactacttc 1441 agcttcagct actggttctt cgtgggcctg tctgttgttg gacagctcta cctccgctgg 1501 aaggagccca agcggccccg gcctctcaag ctgagcgtgt ttttccccat cgtgttctgc 1561 atatgctccg tgtttctggt gatagtgccc ctcttcactg acaccattaa ttccctcatt 1621 ggcatcggga ttgccctttc tggagtccct ttctacttca tgggtgttta cctgccagag 1681 tcccggaggc cattgtttat tcggaatgtc ctggctgcta tcaccagagg cacccagcag 1741 ctttgctttt gtgtcctgac tgagcttgat gtagccgaag aaaaaaagga tgagaggaaa 1801 actgactaga ggtcagaggt ggctttctga ggcctggaag gcaggccaac cagcaaaatc 1861 ctgataacaa gactctgtgg gcccaactct cctgaattaa aggagccttt tgacccaatc 1921 atatagtggg gctcagggcc agtgctcact cttattggta agctatagga gactcaggat 1981 ctgggccaac ctcaaggtgg gggcttcaga gggtgggggg aagattgggg aacgggggga 2041 atggtcattt agttttactc ctgataggta gatgcagctc ttacagatat ttacttggta 2101 aagtgcagtg gggaagaggg aatgctaggt tgatagggct ggtggcttct gaatttggta 2161 tttgaactag gagtccctat agaggggctg ctttatggga agtttttctc tgaccaggta 2221 caacacctga ctttaaaggc ctgaaatgct accatttctt cctctggctc aaaattcttc 2281 cctggggaga gagttatatt cccttattta ttgatattta gtccagaaca ccagttctaa 2341 cgaagcatgc gtgtctcttc atctacagga tgcaataggc tgattgtatt taaaaatcaa 2401 agtacccaaa actgagtccc tttgggctca gaaatgtctg tggtattggg tcagactctg 2461 accacagatt ttatgctgtt tagcacaatt tctattgagt cttacctgca acaatgaacc 2521 ttaaagattt ttttactcac gtacctgtta cactttagca tacagataga tcatagatca 2581 cgttacaagc acttggctca ggtccagcaa ggacagatga acaaattcct gagtcagaag 2641 tctgttaata ttgctgtttt gaaggacaat cctttatttt acttgagacc ttacatcttt 2701 gttctagctg acagtaaatc tctgggtttc tgttacgaac tctaagaggg ctgaaacttc 2761 tgatattcag gtggatcacc tgaattctct cagctgtcaa tggcttggag aacatctcat 2821 gggcccaagt catcaaataa cctgttcctc tctgtaaggg cagtgtgagg gactgctgtg 2881 cagacccaag caatcccaac ctggtgctag gtcatttcac ttttctgaaa acctcacatc 2941 aggctgcatc ctcttctgtc cctggcacca ggctttgttt acacttggag ccaccttggt 3001 gtgggtcacc gggacagtgt actcctctcc tgccagcctc cccttccccg aggtgtggtg 3061 gctgcagtct caggaagagc ttggtacttg tggggacttc tgttttctcc ctgtggagat 3121 cagtgaagac tgggaggaaa gctgcttcaa cctgagtccg gctcttcagc aggctgcaca 3181 agtggaagca actaattctg gtgctcaggc tgggctctcc acccaagtta ggcctgctct 3241 ggcctaatgg atcttactgt atgagcagga cggctgcatt ggattgtaca actgttttgt 3301 gatgccccca gacactgtca tcctaggccg agaagaacct gctagcttga cataccccat 3361 gggcttatcc ttaggttttg gaattggtca acagtgaggc agtctccctt cctgaccatt 3421 cttctccacc cagtcacaga taagggaata accttggcca tatatttgct caataaagat 3481 tgaaggaagc atggtcatag ttgccctggg ttcagagcat aatgcatatg tgaagcatgg 3541 ggtgacattc ctactgtcat gggtttggga tttgtaacgg caaattcctg cccgacgaca 3601 gggtgtctta tgcaaaggct gacttgcctg aacgctaaga acatgacttc tgtctgagct 3661 aagctggcac ccatcccagg gctcctctgg agctaatcct ttaagcaaaa tgtgcttgcc 3721 ttttaaagat ccctgacccc agctttagct ttctccacca gataaccagc taatcccagg 3781 aatttgctgc cccccaccag tggcttctag ggaaagcaag gacctcacat gccaggtgcc 3841 ctagtacttg cttagtgagc catgtcatcc tcctttcatt tttggatggt gacagcattt 3901 ttcccctctg tgctggatac agacttctcc caggatcctc tctttgggag cgaagccaga 3961 ggatccctac agcactcaag cttcatggtg gaattaattt ctgccagctc tttgttgtct 4021 gtctccttaa atccttttcc tggtgtgctt attatccctt ttgcagtgag tacagtttat 4081 taagttgtca gccctttaat attggggaaa cttaatgagt ataaatagca gggagcacat 4141 tgtaacagca cagtgttttg tttttttcac ccggttgctg tatgagaatg gctttcaatc 4201 ctttgtttct atgcctacag acagaaagca agatgtctaa tattagacat acaagttgct 4261 gcctgttata acggtgaatt atacctttgt gcatgcctag gatgtttgtt gttttaatta 4321 gctgcaatat atacggcctg tgtacacaga atttaatcac ttcggcaggt tgaacaactc 4381 catgtagata agagcaagtg taggcaaagg tttagaaaat ggacataaag tcaaagaatg 4441 atggcaggta ggatgaagga gagatactta ggaaatccta aaagaggcgg caagaaggta 4501 cctccctgtg taactcacct tcccccatga cagtaagaga cactcacagg ctatgagggt 4561 acacccctag ctgaatgttc tgtgttgttt ccttagacct gtggtgtccg ctgcaacagc 4621 tactagccac gtgtagctaa ttacattaaa atgaaataaa attaaaagct cagtttctca 4681 gttgcgctaa tcacatttca agtgctcagc agccacccgt gtctactact acacagtgca 4741 gacacagaac atatcatcac tgcagatagt tctactggac aatgttacgc tagaataaac 4801 accaaggcag tcagttaagg cagctatggt ttggaaaggc atacggacag agtctgctta 4861 gaagagatac aagttgttaa taaaattgat cctgttgata gtagtttgtt tttgtggtgg 4921 gtgctgtgaa gagtaaacat tactcagtgg aaagctaagt tcagaaggta ctttgttttt 4981 cctcccttgc cttaagtcct tggtatttat aatcaatgct gaaccttcta tttcactacc 5041 gctccctgtt ttagatattc agatttaaaa ggttttcaaa gaattacttt cttccatgtt 5101 caaagctaga ttttactaaa cacatgtatc acattcatat atattgtttc ttggccccac 5161 tgccaaagga agtcagtcag taatttcaca accgttatca gagtttggaa gcagaaatag 5221 ctgttaacta aaatctccca ctgctcagac tactttctgc cctaatggcc attactatcc 5281 agtctgtatt gctacaaggg acccactggt acccctttta gattctatca aaaggaacag 5341 ggttttccta gaggcaggca gcctggtggt atggcacagc agaagcttac tgctaatgaa 5401 atgggaacct ccccctccct tgtggtttca gcacagaacc tgaatgccag gaaaaattcc 5461 tgggccaaga agctaaagct aaagaaacct tccttttttc aacgtttttt tttctttcaa 5521 actgtagggt cacttttgat tgaggcaaag gggtcctact gtaagtggaa aagactcact 5581 cccctaacat aagttttcac tgtggtggga tggtgccgcc cgatatgctt gatatgcttt 5641 tccttccaca tgttaagcta ggaaacctaa caggatgtca gcagggcagt taactctgga 5701 ctcagagccc tcaagggcat gtggcagaac ctcatggaca tcacaagacc atcagtctga 5761 atccaggtcg tgggggctgt catagccgaa ctccttctgc acatccagag ggtacttgct 5821 ccacatccgc tgtctgctgc tgcctctttc ctcctcactc aggctgttgt agtcagcaga 5881 gcctagaatg acatcccggg agtggattct aaatgtgatt ttcctaggct actgcaggag 5941 ccccttctct tctcagaaag gtctgttttt gttcccgatt gtaatgcaaa atccttgctc 6001 aataaataaa aaagaatata gaattctttt ttttttaaag aaggaatcac tttcctatca 6061 tctaaaccaa gttccttcac actggagtat tttgtcactt ctcccctccg tggagtattt 6121 tgtcacttct cccctccgta taggattttt tgttgttgta agagttgtag tcatattgta 6181 aatatttttg tacctttctc cttttaacgt gttattgaca aacctcccca aaagaatatg 6241 caattgtttg attcatttct ctgttatcag acaccaataa attctttttg ttgggc // LOCUS D87434 5338 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0247 gene, complete cds. ACCESSION D87434 NID g1665762 KEYWORDS KIAA0247. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7001. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5338) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5338 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7001" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 269..1180 /gene="KIAA0247" CDS 269..1180 /gene="KIAA0247" /codon_start=1 /db_xref="PID:d1014068" /db_xref="PID:g1665763" /translation="MCHGRIAPKSTSVFAVASVGHGVFLPLVILCTLLGDGLASVCPL PPEPENGGYICHPRPCRDPLTAGSVIEYLCAEGYMLKGDYKYLTCKNGEWKPAMEISC RLNEDKDTHTSLGVPTLSIVASTASSVALILLLVVLFVLLQPKLKSFHHSRRDQGVSG DQVSIMVDGVQVALPSYEEAVYGSSGHCVPPADPRVQIVLSEGSGPSGRSVPREQQLP DQGACSSAGGEDEAPGQSGLCEAWGSRASETVMVHQATTSSWVAGSGNRQLAHKETAD SENSDIQSLLSLTSEEYTDDIPLLKEA" BASE COUNT 1131 a 1419 c 1380 g 1408 t ORIGIN 1 ccgaagccgg cgaccgcccc acctcctccc tccctcccgc ccgcttcctc tgcccacagc 61 gccggccaga gcgagctaga caagggcacg cggggcctcg cctagacccg agaagactgc 121 gggcgcgcgc aagcggcggc gtggaagctg tgagcgcccc catcccggag gtctccgccg 181 gctcccgggt gaatcagctc ccggccgact ttaggattct tctggatttt aaattttttc 241 tttttaaaaa aacttggacg gataaaagat gtgccatggc aggatagcac caaagagcac 301 ctcagtgttt gccgtggcct ccgtgggaca tggagtgttc cttccgctag tgatcctttg 361 caccctgctt ggagacggac ttgcttccgt gtgcccccta ccaccggagc cagagaatgg 421 tggctacatc tgccaccccc ggccctgcag agaccccctg acagcaggca gtgtcatcga 481 atacctgtgt gctgaaggct acatgttgaa gggcgattac aaatacctga cgtgtaagaa 541 tggcgagtgg aaaccagcca tggagattag ctgccgtctc aacgaggata aagacaccca 601 cacatcactt ggggtcccca cgctgtctat agtggcttct actgccagct ccgtggcgct 661 cattctcctc ctcgtggtgc tgtttgtgct gctgcagcca aagctgaagt ctttccatca 721 tagcaggcgt gaccaggggg tatctgggga ccaggtctcc atcatggtgg atggagtcca 781 ggttgcacta ccatcatacg aggaggctgt atatggcagt tctggtcact gtgtgccacc 841 tgctgacccc agagtacaga ttgtgctgtc agaagggtct gggcccagtg ggaggagcgt 901 gccaagggag caacagctgc cggaccaagg ggcctgctcc tctgcaggtg gagaagatga 961 ggccccaggc cagtctggac tatgtgaagc ctggggctct cgggcctcag agactgtgat 1021 ggtgcatcag gcaaccacct cttcctgggt ggccggctca gggaaccgcc aactggcaca 1081 caaagaaact gcagattcag agaacagtga catacaaagc cttttatccc tcacgtcaga 1141 ggagtacaca gatgatattc cactgttgaa agaagcatga gggcagcggc cagcctttcc 1201 tctctgcgag gttctctcag cccttcctcc ctctccctgt gggattgagc accctgtact 1261 ctccagccac cttacctgga tacctgagct gccacctgtg tatctgtgta tctctgaggg 1321 ccctataggc ccaccttgct ggaaactcaa ggaagattct cgccatctgc ctgttggaca 1381 gctggaggag ctggctcttt gcctggcccc gccttcccat ctgtcagaga catatttgaa 1441 tgtgctggat caaaccctcc cttttcctaa gcctctgggt cccctccagc cagctctttg 1501 gcggcagccc ccaccagctc ctgtgggcct gagtgctgct gtgtttactt gtgcctttcc 1561 cccaccctgt ccagtttccc tgtcatgcag acttgttgct gtccacaagc cttagtggct 1621 gcactgctgc cccctgccac acagggggcc gggcctgggt ctgtcctgtt tcctttgagg 1681 gttgccccta ctgccctttg caggaacaga tccaggtgtg agagctcttg agtcaagagt 1741 ggcagaagtg gctctaattg gggtgagagt gtagtccctg ggcttgccct gggttgaccc 1801 tggtggcata tttccttggc cgaggatgga agatttggag aatcatgtcc atgctggccc 1861 aggacccagc catctggccc aaaggcacaa gctcctggcc ctgttgagtt gagagtttcc 1921 aagaagcatc cagaagatcc caagggagag aaggaaaatg gctgataatg attgtcttcc 1981 taatatgcaa gttctcactt cctacttcca gcatcggcct tcctggcctt gtcttttttt 2041 tgtttccctg gagtataatg ggaagttgca tgctgcctcc tgggttttat cccagatagc 2101 tctggctttc ttgctgccca caggggcctg gggcaggaag gagacttgct gagatgccat 2161 ggagtgccca tctggtcact ggcagtctgg gcaggttgcc cctttctggg tttgtggtga 2221 cggaggggag gccgagaggc acagaccaag tccccgggtg gctgcaggca gctccagccc 2281 ggtcctgagg atcctcctca ccatggtcac gtgccttagt aactgtgccc aggaagtggc 2341 ctgctgcttg ctgtgctgct gcttttccta cttctgccct tccctgccac ccctcgcatg 2401 tcacagctga caagcaattc cttgtcttcc ctggccccct gggggaaggg ctgagaaaca 2461 gtccatgtgc accccaacct taatggcctg aggtgggcag aggggtgtgg agcagcctgg 2521 agtacagggc cctgggggag gagcccactg atgaggggcg ctctcccata gccatgtgtt 2581 gaatgctaac taggctgggg tggacgaact ctgccaactg ctgtcatctt agaagataga 2641 tgcagcagta aggaatgttt gttttgcttt tttctgaaat tttctgaagc actgtggctg 2701 ggaaacttcg aagcggaccc tgtgctgcat gtctgctcct cccctgagcc tgtctgcttg 2761 ggggtggtaa aaataaaaat cccagtttat tttcagtacc ttacctaaca gggttggctc 2821 caggcgtggg tggcctagaa gatgagggga gtggtcttct cccagccttt taccctcttg 2881 cctcctgcct ccgcgcttac acacgcactt taccacccgg tcattccctg gcctcttgct 2941 gccacttgta gtcttccttc cttcctctca gggtaagggc agtgcctgct gtgcctgttg 3001 gccactccca cacttcccct cccccaggag ccctcatctg ctgtgctgag tccaggaaag 3061 catagttagg tagggagctg gttggagaag gtgctagaac tagaaggcag atgagactag 3121 catgggccca cctggagggc tgtccctaat ggccccagtc gccttacctc acccacagca 3181 gtgcccttgt cttcctccaa aacagaaagc agtgacaaaa gggggagggg tggtaatctg 3241 aagtctcact gctgagcctt cagcttttat ttttcactgt ttcaaaaccc gcattctatt 3301 ctagaatggt ttttaaaatg gaagatctta cctttttcta tcttgttact ctggggtttt 3361 gtccccctaa gagattgcac tttttgtttg gggtttattc agctgcatag atgaccagct 3421 tgatccctgg tgaaatgaaa agccttcctt ctcctgaagc ctctttccgc cctgccctcc 3481 actaacaaca ctgaggagca caagcccagg cttgcccacc tggtaggaaa ggaagaaatt 3541 agaacaatgg gagccttggc tcccctctcg tctcctcccc tccttcttgt cactggcttt 3601 gatgaggccc acttcccaga ggctcctggg cctgtgagtg caggagctca ttctcccctc 3661 actgctgaag tctgtgacag cttcttcctc cagttatgtc tttcttccaa agcaatttct 3721 taaccatcag ccatgtgctg ctatttctag ggcttctggg ctttgtccct tactgagaga 3781 ttagggactc cacagctgcc ttgaggtagg gcctggctga gagacaaggg tagcagcagg 3841 tggcaggctg ttaaaagaca ggctgcctga ggagcctgga gcaggtggaa acaggtggaa 3901 gaaaccggcc acagccctgc tttaccgggc tcacctctag ggcattccag caagaggctg 3961 atgcaggaga atggccagca ccaaaggaca tttaaaagag tttttgggtt tttttgtttg 4021 tttgttgttg gtgtttgttt tttttttttt ttttttggca cacttgagct gactcagtgc 4081 aggtttaata tcctggtgac ttgcagtcac attctaatga ctttcaaggg ccagaatatg 4141 gtgaaaatca cttaaaatat ccgtcccttc catgccttag tttagcaggt aggctctatc 4201 ttttgccatt tctgtatttt atgtgctgtg ttcccgtttc actgggtatg aactgtgaaa 4261 tggactgaat cctggccact ttatgagttt gtttggtttt ataaggcatt tcaatgtaca 4321 ttctataaat acaagcactc catttgcaaa cagatcttaa gctaatattt tctttcccat 4381 tcatcttgcc ctccccctcc tcccaccagc tttaaagttc agtggagaag ccagatggca 4441 attcagacaa aggtatactc ttcctgcttc atgggtggtg gcacgggaat agatagccct 4501 tagccctttc cctcccagtc ccagctgagc cctcagacca cttgcttccc acataacaat 4561 gtcgcctcca tttccgagga acatccttgc gtagagaatg aaatatgctg caatcatttc 4621 tgcatcctta ctcctcaccc ccaaagaaaa aaaaaaggcc tagcagggaa gcagcatgca 4681 ggcttcacag cttaatgcca aggacagcga gtgaggctgg gagcttctct tgggcctgct 4741 gggtctgtca gctctcggaa tagggacagt ccttactggt gccccaaggt gggacttgga 4801 gaatattttg cttggcatat gtttggtctg aatggtgtag ttgctggttc cctagagagg 4861 aaaaggtggc aggcccagct ttgctgggaa atggctctta atttccagtt gaaaccctag 4921 tagaattgtg aatgaaaacc tcaaggttga gcccctctgc caagcagcag agctagtaga 4981 aggggatgca ggggcaaagc actcagttgc caagcaagga ggagagatgt acgtgggctg 5041 tgtggcagtc cccacaccct gccctggctt cttcaggtta tcgcaccact atggaatcct 5101 ttgcagaatg gtactcatat aatggtttaa aacaacacat tcataattga ctctgtgcag 5161 gatgtcactc aatcagtttg ggtttgcttt attttatttt atatatatat tttttggtat 5221 cctgtacatt gcagtgggtg tgaagatagt attttaatat ttgtacaaag tttaatttaa 5281 ttttaattgt tctatgtata taactgcatt tctaaataat taaaaaaaag ttcttatg // LOCUS D87436 6219 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0249 gene, complete cds. ACCESSION D87436 NID g1665766 KEYWORDS KIAA0249. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7006. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6219) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6219 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7006" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 240..2930 /gene="KIAA0249" CDS 240..2930 /gene="KIAA0249" /note="Similar to Human KIAA0188 protein" /codon_start=1 /db_xref="PID:d1014070" /db_xref="PID:g1665767" /translation="MNYVGQLAGQVIVTVKELYKGINQATLSGCIDVIVVQQQDGSYQ CSPFHVRFGKLGVLRSKEKVIDIEINGSAVDLHMKLGDNGEAFFVEETEEEYEKLPAY LATSPIPTEDQFFKDIDTPLVKSGGDETPSQSSDISHVLETETIFTPSSVKKKKRRRK KYKQDSKKEEQAASAAAEDTCDVGVSSDDDKGAQAARGSSNASLKEEECKEPLLFHSG DHYPLSDGDWSPLETTYPQTACPKSDSELEVKPAESLLRSESHMEWTWGGFPESTKVS KRERSDHHPRTATITPSENTHFRVIPSEDNLISEVEKDASMEDTVCTIVKPKPRALGT QMSDPTSVAELLEPPLESTQISSMLDADHLPNAALAEAPSESKPAAKVDSPSKKKGVH KRSQHQGPDDIYLDDLKGLEPEVAALYFPKSESEPGSRQWPESDTLSGSQSPQSVGSA AADSGTECLSDSAMDLPDVTLSLCGGLSENGEISKEKFMEHIITYHEFAENPGLIDNP NLVIRIYNRYYNWALAAPMILSLQVFQKSLPKATVESWVKDKMPKKSGRWWFWRKRES MTKQLPESKEGKSEAPPASDLPSSSKEPAGARPAENDSSSDEGSQELEESITVDPIPT EPLSHGSTTSYKKSLRLSSDQIAKLKLHDGPNDVVFSITTQYQGTCRCAGTIYLWNWN DKIIISDIDGTITKSDALGQILPQLGKDWTHQGIAKLYHSINENGYKFLYCSARAIGM ADMTRGYLHWVNDKGTILPRGPLMLSPSSLFSAFHREVIEKKPEKFKIECLNDIKNLF APSKQPFYAAFGNRPNDVYAYTQVGVPDCRIFTVNPKGELIQERTKGNKSSYHRLSEL VEHVFPLLSKEQNSAFPCPEFSSFCYWRDPIPEVDLDDLS" BASE COUNT 1610 a 1479 c 1563 g 1567 t ORIGIN 1 gagaagaagt ggcaggtgat gctgaagcgg gggagaagcg gcagagccgg ccacacagtg 61 caggggatgg agacaggtgc tgggctggtc ctcctgcagc atcctcagtt gttggagggc 121 agtcatcctc aggccgtacc cagccagaga agaaaaagaa cagtgtgaag ccacgtgtga 181 tagccgtcca acatcggctc ttccctccaa ttacattgta gttgattgtg tctcaaacca 241 tgaattatgt gggacagctg gctgggcagg tgattgtcac tgtgaaggaa ctctacaagg 301 gcattaacca ggccaccctc tctgggtgca ttgatgtcat cgtggtacag cagcaggatg 361 gcagctatca gtgttcacct tttcacgttc ggtttggaaa gctgggagtc ctgagatcca 421 aagagaaagt gattgatata gaaatcaacg gcagtgcagt ggatcttcac atgaagttgg 481 gtgataacgg agaagctttc tttgttgagg agactgaaga agaatatgaa aagcttcctg 541 cttaccttgc cacctcacca attcctactg aagatcagtt ctttaaagat attgacaccc 601 ctttggtgaa atcgggtgga gatgaaacac catctcagag ttcagacatc tcacacgtct 661 tggaaacaga gacaattttt actccaagtt ctgtgaaaaa gaaaaaacga aggagaaaga 721 aatacaaaca ggacagtaag aaggaagagc aggccgcatc tgctgctgca gaagacacat 781 gtgatgtagg cgtgagctcc gatgatgaca agggggccca ggcagcacga ggatcttcaa 841 atgcttcctt gaaagaagaa gaatgtaaag agcctttgct cttccattct ggggatcatt 901 accccttatc tgatggagat tggtcccctt tagagaccac ctatccccag acagcgtgtc 961 ctaagagtga ttcagagctg gaggtgaaac ctgcggagag cctgctcaga tcagagtctc 1021 acatggagtg gacgtggggc ggattcccag agtccaccaa ggtcagcaaa agagaacgat 1081 ctgaccatca tcctaggaca gctacaatta caccatcaga aaatactcat tttcgggtaa 1141 ttcccagtga ggacaacctc atcagtgaag ttgagaagga tgcttccatg gaagacactg 1201 tctgtaccat agtgaagccc aaacccagag ccctgggtac acagatgagc gacccaacat 1261 ctgtggcaga gcttctcgaa cctcctcttg agagtactca gatttcatct atgttagatg 1321 ctgaccacct tcccaacgca gccttagcgg aggcgccctc agaatccaaa ccggcagcta 1381 aagtagactc gccgtcaaag aagaaaggtg ttcacaaaag aagccaacac cagggacctg 1441 atgatattta ccttgatgac ttaaagggtc tagaacctga agttgcagct ctttatttcc 1501 ctaaaagtga atcggagccc ggttccaggc agtggcccga gtctgacaca ctctctggct 1561 cccagtcccc acagtccgtg ggaagcgcag ctgcagatag cggcaccgag tgcctctcag 1621 attctgccat ggacttgcct gacgttaccc tctccctttg cgggggcctc agtgaaaatg 1681 gagaaatttc aaaagaaaaa ttcatggagc atatcattac ttatcacgaa tttgcagaaa 1741 accctggact tatagacaat cctaaccttg taataaggat atataatcgt tactataact 1801 gggctttggc agctcccatg atccttagct tgcaagtatt ccagaagagc ttgcctaagg 1861 ccacagttga gtcctgggtg aaagacaaga tgccaaagaa atctggtcgc tggtggtttt 1921 ggcgaaagag agaaagcatg accaaacagc tgccagaatc caaggaggga aaatctgagg 1981 caccgccagc cagtgacctg ccatccagct ccaaggagcc ggccggtgcc aggccggccg 2041 agaatgactc ctcgagtgac gagggatcac aggagctcga agaatccatc acagtggacc 2101 ccatccccac agagcccctg agccacggca gcacaacttc atataagaag tctctccgcc 2161 tctcctcaga ccagatcgca aaactgaagc tccacgatgg cccaaatgat gttgtgttta 2221 gtattacaac ccagtatcaa ggcacctgtc gctgtgcagg gaccatttac ctgtggaact 2281 ggaatgacaa gatcatcatt tctgatattg atgggacaat aaccaagtcg gatgctttgg 2341 gacagattct cccacagctg ggcaaagact ggacccacca gggtatagca aagctctacc 2401 attccatcaa tgagaatggc tacaagtttc tgtactgctc ggctcgtgcc atcggcatgg 2461 ccgacatgac ccgtggctac ctgcactggg tcaatgacaa gggcacaatc ttgccccggg 2521 gccccctgat gctgtccccc agcagcttgt tctccgcctt ccacagagaa gtgatagaaa 2581 agaaaccaga gaagttcaaa attgagtgtc taaatgatat caagaatctg tttgccccgt 2641 ctaagcagcc cttctatgct gcctttggaa accgtccaaa tgatgtctat gcctacacac 2701 aagttggagt tccagactgt agaatattca ccgtgaaccc caagggtgaa ttaatacaag 2761 aaagaaccaa aggaaacaag tcatcgtatc acaggctgag tgagctcgtg gagcatgtgt 2821 tcccccttct cagtaaggag cagaattccg cttttccctg cccggagttc agctccttct 2881 gctactggcg agacccgatc cctgaagtgg acctggatga cctgtcttga ggtggcacct 2941 cagtgggtgg gcagggcttg gtccccctcc ccacagcaag ggaaggcagc tggctcttct 3001 gctgacctca gataccagcc ttccccagcg gggacgggtg cttctggagc tggtcccgcc 3061 atcctccttt gccttcccag gccagctgct caggctcggc aggtctgcag ctcagctcct 3121 ggaaggagaa gggaggaact gggcctgggg ctggaggcct gggatccctc ctttgtgggt 3181 cgcacacatg tttcctgctg tgagctgggg cctccttcca ttgcatcatt ttaaaggaag 3241 aaaaaagcag ctaaaaaaga gtggaccaaa acactgcaca cagtgaagtg ttccagtttc 3301 cactgggcag ttgaggtggc ttctgtaacc agggctgtct tcagatgtca gggtccctga 3361 actgctgcgg gcccagtcag tgatgctggc tgaagctgcc tgtgcacgtt tcttctctgg 3421 tcgcctcatt tcctgctaca ctgaaggggt cagctgctcc agtgggccaa gttgggcagg 3481 acccccgccc ctgcagggcc catgcaccag agccactgag cccagtccca taaacctggc 3541 cctctttggg gaaagatccc cacagagcat cctcctctca tctgtgccaa ctccacgagc 3601 ccttaatttc ttagtcctca ccagaagaac aggtctcaca agtatatatt tgatgtctgt 3661 aataaaagtg ggaaggtggg tcttaaaaca gaccaaaccc cgccccgccc ccaacaactc 3721 tgcttttagg gaggcctccg aaatgcagat aggcggttga gtggggtcct gggaagagcg 3781 ctgaatccct ctgcttgctg cctggtgtgg gcctttggaa agcatcttgc cttgggacag 3841 gatttctaaa attctgtgat tcagatttgt cagggaagca cagtgaagct tgcttaaagg 3901 cactggccag cagtgtgtga ctttggcttt tgggatcaca ccctgtaatc gggcccgtgg 3961 aagcagcgtc aaagaggggt cttggagctc ctatggagca gactgccccc cgagcagtgt 4021 ccccagccta gccctgtgag accccatggg gacacgggtg cctatgtatt ttcactaaaa 4081 tatacatggt agctccattt actgatgcgg ttgtaatgag ctcacatcgt gtctgaagag 4141 atggcaccag ggaaaggtgt gccataagct gctccagagc ttttggtatg ctgagtgttg 4201 acagagctgc actcttaaca tcaagagaac tgtcaggagc ccagaaccaa ccccaggtct 4261 tggtctccat tggcgagaac acaggacgtg gtggttcctg agcagagagg gatctgcaga 4321 tacaggcttg gcgctcgggg tggtctcgtg gccaactctt catgcccctg ccgttgtagt 4381 ggaacctcta catgttttag tttgcttcac ctaaaataat gctgatctag agatagagaa 4441 ataggggtgg ttatttttcc agattggaga gttgaaagtc cctgactgat ttcagccatt 4501 ttcctagtgc ttgtcggatg cagagacaat gttgaaatcc cctaaacaca gttctcagtg 4561 gcaaaaccta ggaaggctca tgttcccaga gaagggacca catgagcctt ctcccatgca 4621 aagcttcccc cagcttaaat agttgataag gactaattgt ttaatgagtt tatttatcta 4681 cagtaggtta gggatcctgg gttctgttta tatgaagttc ttcccagttt gtgaattcta 4741 gtacagcagc catgcagcca ccttatttca tagatgccat ctgtgtgtcc tcttgactac 4801 cttctattta gaggaagaat gagagctttg tgtgtttaac tgagcttata gtaggacttc 4861 tttgcatatg tatggtactg aaaaatctta atatacatct ttaatccttt ttaggttgtc 4921 ctttaaagag tttttgacta gtttcttttt cttgacagct cttctctttg gacacatggg 4981 ccttcttaga gggttcagtc taggacccgg ctctcctggc cctgtgttga gggcagctgg 5041 tccctctgtc cctgtgtctg ctagcactag actttgttgc tgcagattga tccagtgggt 5101 acataggcta attaatgtga gtctttcctt gtttaaagga gtccctcttg ctgaaagtag 5161 atgattacta ttgctgtagt gttaggaaag tattaagttt gtgctgaaaa tccattgcca 5221 tttggtacaa atgacatttg ttctttctgt gaaagagatg ccctcgagtg tgtttgtaca 5281 caaaccctta ggatggtgag ttgaagcatc accctcgcgc tatcttcagt gacgggtgac 5341 ggctcaggga gatggcaggc agattgggct ctaagtcatt attctctcag ttactccatt 5401 ggtgaaatgg ccctttccct gtttgcagtt cagtctaagt ctcgtatttg ctttgctgtc 5461 tgtgtgctga agctcgtccc gtgtgagttg ctgtctgccc cttgtcaggc tgtgaggtgc 5521 tcgtgtagac ctggagcatg caggctgcct ccgtttttgg gtactgtgtt gtgttttgct 5581 ctgtctaaaa acatctgcat agttttcaac tggaaaaaga aaaaacttaa aaatgggatg 5641 tcctaaaatg aaagctgctc aaagtcacag aacaaccgag ggacaaagga gattggatga 5701 ctgggaagcg ctggcccgga acagcccctg caactgtggg gcctgcacac agcccttcca 5761 cagttggcac tgcaggtgca ggccaaccct ttaaagaata aacaaggaag tcagctcttt 5821 cactttttac aagttggcaa aaacagactt ccggggaatt tcgatgtttt cccgtgttgt 5881 agagcttcca gggtttaata aaactggtta aaaattgagt ctttccctga agtaagtgct 5941 ctttccagat gaaaactact cttttggttt tgtttgaaag taagaaaggg aggggaaact 6001 ttgctctttt aataattatg ttcagcctat gatgaagtat ttgattatta gacagcaatg 6061 tcactaataa gttttaagtt gtccaaagtt aattgtaaac atcatcagta cagtactctt 6121 agttacagta aagcaattgt tgcaagatga atggctaata ttttggtgca gtgtttgatg 6181 ttcaaaacaa aatgttacaa caataaacga acataacat // LOCUS D87437 5082 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0250 gene, complete cds. ACCESSION D87437 NID g1665768 KEYWORDS KIAA0250. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA2794. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5082) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5082 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA2794" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 425..2833 /gene="KIAA0250" CDS 425..2833 /gene="KIAA0250" /codon_start=1 /db_xref="PID:d1014071" /db_xref="PID:g1665769" /translation="MSFLGILCKCPLQNESQEESYNAYPLPAVKVSMDWLRLRPRVFQ EAVVDERQYIWPWLISLLNSFHPHEEDLSSISATPLPEEFELQGFLALRPSFRNLDFS KGHQGITGDKEGQQRRIRQQRLISIGKWIADNQPRLIQCENEVGKLLFITEIPELILE DPSEAKENLILQETSVIESLAADGSPGLKSVLSTSRNLSNNCDTGEKPVVTFKENIKT REVNRDQGRSFPPKEVRRDYSKGITVTKNDGKKDNNKRKTETKKCTLEKLQETGKQNV AVQVKSQTELRKTPVSEARKTPVTQTPTQASNSQFIPIHHPGAFPPLPSRPGFPPPTY VIPPPVAFSMGSGYTFPAGVSVPGTFLQPTAHSPAGNQVQAGKQSHIPYSQQRPSGPG PMNQGPQQSQPPSQQPLTSLPAQPTAQSTSQLQVQALTQQQQSPTKAVPALGKSPPHH SGFQQYQQADASKQLWNPPQVQGPLGKIMPVKQPYYLQTQDPIKLFEPSLQPPVMQQQ PLEKKMKPFPMEPYNHNPSEVKVPEFYWDSSYSMADNRSVMAQQANIDRRGKRSPGIF RPEQDPVPRMPFEKSLLEKPSELMSHSSSFLSLTGFSLNQERYPNNSMFNEVYGKNLT SSSKAELSPSMAPQETSLYSLFEGTPWSPSLPASSDHSTPASQSPHSSNPSSLPSSPP THNHNSVPFSNFGPIGTPDNRDRRTADRWKTDKPAMGGFGIDYLSATSSSESSWHQAS TPSGTWTGHGPSMEDSSAVLMESLKSIWSSSMMHPGPSALEQLLMQQKQKQQRGQGTM NPPH" BASE COUNT 1367 a 1279 c 1147 g 1289 t ORIGIN 1 gccaggcaga gtcctactat aggcatgcag ctcagcttgt cccctccaat gaagcattgc 61 tgtgaagttc cctttcccag ctgcctccac taatctgcaa aaagcacttt ctaaagcact 121 ggaaagccga gatgaggtga aaaccaagtg gggtgtttct gacttcatca aggcctttat 181 taaattccac ggtcatgtgt acctgagtaa gagcttggaa aagttgagcc ctcttcgaga 241 gaaattggaa gaacagttta agaggctgct attccaaaaa gctttcaact ctcagcagtt 301 agttcatgtc actgtcatta acctgtttca acttcatcac cttcgtgact ttagcaatga 361 aaccgagcag cacacttata gccaagatga gcagctatgt tggacacagt tgctggccct 421 ctttatgtct tttcttggca tcctgtgcaa gtgtcctcta cagaatgagt ctcaggagga 481 gtcctacaat gcctatcctc ttccagcagt caaggtctcc atggactggc taagactcag 541 acccagggtc tttcaggagg cagtggtgga tgaaagacag tacatttggc cctggttgat 601 ttctcttctg aatagtttcc atccccatga agaggacctc tcaagtatta gtgcgacacc 661 acttccagag gagtttgaat tacaaggatt tttggcattg agaccttctt tcaggaactt 721 ggatttttcc aaaggtcacc agggtattac aggggacaaa gaaggccagc aacgacgaat 781 acgacagcaa cgcttgatct ctataggcaa atggattgct gataatcagc caaggctgat 841 tcagtgtgaa aatgaggtag ggaaattgtt gtttatcaca gaaatcccag aattaatact 901 ggaagacccc agtgaagcca aagagaacct cattctgcaa gaaacatctg tgatagagtc 961 gctggctgca gatgggagcc cagggctaaa atcagtgcta tctacaagcc gaaatttaag 1021 caacaactgt gacacaggag agaagccagt ggttaccttc aaagaaaaca ttaagacacg 1081 agaagtgaac agagaccaag gaagaagttt tcctcccaaa gaggtgagaa gggactatag 1141 caaaggaata actgtaacta agaatgatgg aaagaaggac aacaacaaga ggaaaactga 1201 aaccaagaaa tgcaccttag aaaagttaca ggaaacagga aagcagaatg tggcagtgca 1261 ggtaaaatcc cagacagaac taagaaagac tccagtgtct gaagccagaa aaacacctgt 1321 aactcaaacc ccaactcaag caagtaactc ccagttcatc cccattcatc accctggagc 1381 cttccctcct cttcccagca ggccagggtt tccgccccca acatatgtta tccccccgcc 1441 tgtggcattt tctatgggct caggttacac cttcccagct ggtgtttctg tcccaggaac 1501 ctttcttcag cctacagctc actctccagc aggaaaccag gtgcaagctg ggaaacagtc 1561 ccacattcct tacagccagc aacggccctc tggaccaggg ccaatgaacc agggacctca 1621 acaatcacag ccaccttccc agcaacccct tacatcttta ccagctcagc caacagcaca 1681 gtctacaagc cagctgcagg ttcaagctct aactcagcaa caacaatccc ctacaaaagc 1741 tgtgccggct ttggggaaaa gcccgcctca ccactctgga ttccagcagt atcaacaggc 1801 agatgcctcc aaacagctgt ggaatccccc tcaggttcaa ggcccattag ggaaaattat 1861 gcctgtgaaa cagccctact accttcagac ccaagacccc ataaaactgt ttgagccgtc 1921 attgcaacct cctgtaatgc agcagcagcc tctagaaaaa aaaatgaagc cttttcccat 1981 ggagccatat aaccataatc cctcagaagt caaggtccca gaattctact gggattcttc 2041 ctacagcatg gctgataaca gatctgtaat ggcacagcaa gcaaacatag accgcagggg 2101 caaacggtca ccaggaatct tccgtccaga gcaggatcct gtacccagaa tgccgtttga 2161 gaaatcctta ttggagaagc cctcagagct catgtcacat tcatcctctt tcctgtccct 2221 caccggattc tctctcaatc aggaaagata cccaaataat agtatgttca atgaggtata 2281 tgggaaaaac ctgacatcca gctccaaagc agaactcagt ccctcaatgg ccccccagga 2341 aacatctctg tattcccttt ttgaagggac tccgtggtct ccatcacttc ctgccagttc 2401 agatcattca acaccagcca gccagtctcc tcattcctct aacccaagca gcctacccag 2461 ctctcctcca acacacaacc ataattctgt tccattctcc aattttggac ccattgggac 2521 tccagataac agggatagaa ggactgcaga tcggtggaaa actgataagc cagccatggg 2581 tgggtttggc attgattatc tctcagcaac gtcatcctct gagagcagtt ggcatcaggc 2641 cagcactccg agtggcacct ggacaggcca tggcccttcc atggaggatt cctctgctgt 2701 cctcatggaa agcctaaagt ctatctggtc cagttccatg atgcatcctg gaccttctgc 2761 tctggagcag ctgttaatgc agcagaagca gaaacagcaa cggggacaag gcaccatgaa 2821 ccctccacac tgaggccaaa gtggcaacct gggaatgaag gctccataaa ccatggcatg 2881 ttgggtttgc aggactggcc cacacagtcc cctgcaggtg gcagccctct tttctgtttc 2941 tcgctgtcaa gagggtgtaa gtattccacc agcccgctga gtgtgcacga aatgttcgca 3001 gtgcaacaaa aagaaaaatc catcaggaac tctccgtccc cccggggcct tccggaggga 3061 gagagagagg aactgctgtt tatctcactc agttacttgg tatcaccgcc tctcaccttc 3121 tccatcgtgc atgtccccag ccacatggga agtgaaagct gagaagggaa ggcagatggg 3181 agaagccaat gggaacttct cagtcctttt ttcctctttg gggaataaaa taggaatcca 3241 ttaatgattg ctttgctgac tgagaatgta gttgaaatta aacatctttt attattatta 3301 ctctcagtag taaaatatca cactgaattc ttccatacac aggtgtgctt ctagtcagtg 3361 tgtagcaagg aaagccccgt tcactgctcc tgtgagaggt tggtggtgac aggatgggga 3421 accgacctct tcagccagtg gaaatgttcc ataagggaga gttcaaggcc tgtcagaagg 3481 ctctggtagg ccttcctctg gccaggagac tccagcaggg aatgcccttc actctgtagg 3541 tgctcgagcc ccaatcgagg atacagtgtg ggtggtggtg ctgggctgga ctagcaggta 3601 gactgctgta ggatttcaaa ttacgttttt gattcctgta cattttacag tcgcacagca 3661 agcagtctca cagaaggcag gctagtccat tcacagcctg acacgttcta ataggtagaa 3721 gctttcagtg tggttatttt ttctttggtt ggtttttgtg cccccattct acttcccacc 3781 ctcctgcccc atctccatcc cttcttttac ccaatgctgt atgctagtaa ttgtttttat 3841 tcctaatgtg tgcaacatca catctcccca agaagcaaca gcatggggtc cagcagttgg 3901 ggcccaaaag acagtctgaa gaggaaggaa gcagcagtat ctgcgtagcc cacagagggc 3961 ccaggcccct gcccagctgc agtctcccag cctccacttt cagagtgaaa ttcaaggcag 4021 cacggacatg tgcccatcag gcacagaaga aaacacgacg tcgtccattt tggaagagac 4081 gaaagaaagg aaaataaact ctttgtatga tatttattag gaggaaagag gactgaaaat 4141 gttcttgtgt agaaacagaa ggacagcatt tctgttagtc atttcctgga aaagtaatat 4201 tttaagggga aattatggaa acaatctaat tgttcaattg ctgtgctagt ggtagggttt 4261 attttctggg aggtctctcc tttgtgtgtc tgtatgtttg tgtacacaca cgtgcccatc 4321 tgctgtccca gaggggaggg gttgtgtgtg cgagtgtatg gagttagtgt ggaacttaag 4381 agctggaaga cagctgtaga gcaaagcaca tccaggagcc ccagttgtca ctgcagtctg 4441 ggcaacccca gcaatgaaaa ggggtgagat aacgctcatt gctcttcaga gagagtggtt 4501 ggagcccccc ccgccccgta tgcttacatt attgctcttt tagtttgaca tggtgtttgg 4561 gttttgtttt tttgaaaggt ctgaaaaggt gaagccccct acccaatggc aatatgaaac 4621 cttttgtgct tctcttcagc cccttccctg tgtccacctt tctctcctct tcccaagcct 4681 ttttcctact acctttaccc agtttgtgtg tttgagctct gcattcaggc agctgcaaca 4741 ttccagtgtt tgaactgtca ctgattcttg cgccctagac aagctaacca ggtttaccat 4801 ctcactccca gtaataccca gctcctatct aaagccccat tctgcatgag aatttggtgt 4861 ttggaatgtt ttctgactct tggggcggga ttcctcgcct tatcatcctc actgtggagt 4921 aatgaggggg aggagaatct ttatcagaaa ctggttttgt gtagtaaact ttcttttgtg 4981 gttttttgtt ttgttttctg ggttttgttt tttgtttttg tctgtgcaag acctgcagct 5041 gctgaaaatc agctttgcct ttaattaaac catgttctct cc // LOCUS D87443 6049 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0254 gene, complete cds. ACCESSION D87443 NID g1665774 KEYWORDS KIAA0254. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7011. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6049) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6049 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7011" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 529..3507 /gene="KIAA0254" CDS 529..3507 /gene="KIAA0254" /codon_start=1 /db_xref="PID:d1014074" /db_xref="PID:g1665775" /translation="MKTETVPPFQETPAGSSCHLNNLLSSRKLMAVGVLLGWLLVIHL LVNVWLLCLLSALLVVLGGWLGSSLAGVASGRLHLERFIPLATCPPCPEAERQLEREI NRTIQMIIRDFVLSWYRSVSQEPAFEEEMEAAMKGLVQELRRRMSVMDSHAVAQSVLT LCGCHLQSYIQAKEATAGKNGPVEPSHLWEAYCRATAPHPAVHSPSAEVTYTRGVVNL LLQGLVPKPHLETRTGRHVVVELITCNVILPLISRLSDPDWIHLVLVGIFSKARDPAP CPASAPEQPSVPTSLPLIAEVEQLPEGRASPVAAPVFLSYSEPEGSAGPSPEVEEGHE AVEGDLGGMCEERKVGNNSSHFLQPNLRGPLFLCEDSELESPLSELGKETIMLMTPGS FLSDRIQDALCALESSQALEPKDGEASEGAEAEEGPGTETETGLPVSTLNSCPEIHID TADKEIEQGDVTASVTALLEGPEKTCPSRPSCLEKDLTNDVSSLDPTLPPVLLSSSPP GPLSSATFSFEPLSSPDGPVIIQNLRITGTITAREHSGTGFHPYTLYTVKYETALDGE NSSGLQQLAYHTVNRRYREFLNLQTRLEEKPDLRKFIKNVKGPKKLFPDLPFGNMDSD RVEARKSLLESFLKQLCAIPEIANSEEVQEFLALNTDARIAFVKKPFMVSRIDKMVVS AIVDTLKTAFPRSEPQSPTEELSEAETESKPQTEGKKASKSRLRFSSSKISPALSVTE AQDKILYCLQEGSVESETLSMSAMESFIEKQTKLLEMQPTKAPEKDPEQPPKGRVDSC VSDAAVPAQDPSNSDPGTETELADTALDLLLLLLTEQWKWLCTENMQKFLRLIFGTLV QRWLEVQVANLTSPQRWVQYLRLLQESIWPGGVLPKFPRPVRTQEQKLAAEKQALQSL MGVLPDLVVEILGVNKCRLSWGLVLESLQQPLINRHLIYCLGDIILEFLDLSASVEES AATTSASDTPGNSKRMGVSS" BASE COUNT 1523 a 1525 c 1493 g 1508 t ORIGIN 1 cggccggccg gcggtcatcc ttcacacagg cccggctgcg gagcgcagtc gcccaggttt 61 acggcggcag agggccgagg ctcccggaga ggaccgcgat gagtccactt caggtcgggg 121 ctgagtgtgg gcgcgcggtt cccggcggcg tgtgacccag tcttcgccgg ccgctgcccg 181 ctgcgtgtgt aagtgtccgt gggccccagg gccttgcagc gcccgccccg aggccccgaa 241 gaggccttcc gcctggggca gtgtgaaggc cacagagttt tcctccaaag ccacacctca 301 gctggaaggg gcgagttgga gggctctgtt agtggtagcc atggggagac gcggagctgc 361 agctgtcgct ttgcagtgcc tgtaggactt tgagggcctg gagccccggt tccctgaaca 421 gcacccagat cgccccatat ctgagacaca gttcatgccc cccttcccta actctgaagt 481 aaaatcttga gggctgtcat ctggggaagc caccttgtcc gttcagccat gaagacagaa 541 acagtgccac cgttccagga aactccagct ggatcgagct gtcacctcaa taacctgttg 601 agtagccgga agctgatggc tgtgggggtc ttgcttggct ggctcctggt catacacctt 661 ctggtcaacg tgtggctgct gtgccttctg tcggcattgc tagtggtgct gggaggatgg 721 ctgggctcca gcctcgctgg agtggcttca ggtcgactgc atctggaacg cttcatcccg 781 ttggccacct gtcctccatg ccctgaggca gaaaggcagc tggaacggga gatcaaccgc 841 accatccaga tgattattcg agattttgtg ttatcttggt accgttccgt gagccaggag 901 ccagcctttg aggaagaaat ggaggcagcc atgaaagggt tggtccagga gcttcggaga 961 aggatgagcg tgatggacag tcatgctgtt gcccagagtg ttctgactct ctgcggttgt 1021 cacctgcaga gctacattca ggcaaaggag gccactgcag ggaagaatgg tccagttgag 1081 ccttcccacc tctgggaggc ttactgccgg gcgactgccc cacatcctgc tgtgcacagc 1141 cccagtgctg aagtcaccta tacgcgtggc gttgtgaatt tgttgcttca agggctggtg 1201 cccaagcccc acttggagac tcgtaccgga cgccatgtag tggtcgaact catcacatgc 1261 aatgtaatct taccactgat cagcaggctg tcagatcctg actggatcca ccttgtactc 1321 gtgggtatct tttccaaggc cagagatcca gcaccctgcc cagccagtgc ccccgaacag 1381 ccctcagtgc ccacatctct gccactgatt gctgaggtag agcagcttcc agaagggaga 1441 gcttctccag tagcagcccc agtattccta agttacagtg agccagaggg ttctgcaggc 1501 ccctctccag aggttgaaga aggccacgaa gctgtagagg gagatttggg tgggatgtgt 1561 gaagaaagaa aagtaggaaa caactcatct catttcctac agccaaatct tcgaggtccc 1621 ctgttcttat gtgaagactc agagctggag tctccgctgt ctgaactggg caaagaaacc 1681 atcatgctca tgactccagg cagctttctc tctgacagga ttcaggatgc cctgtgtgcc 1741 ctagagagtt cccaggctct ggaacccaaa gatggtgagg catctgaagg agcagaggct 1801 gaggagggtc cagggacaga aacagagaca ggcctgccgg tctccacact gaattcctgc 1861 ccagagatcc atattgacac agcagacaag gagatagaac aaggagatgt taccgcctct 1921 gttacagctt tgctggaggg gccagaaaag acctgcccct cacggccgtc atgcttagag 1981 aaggatctca ccaatgatgt gagctccctt gatcctactc tgccaccagt tctgctttcc 2041 tcctctccac ctggtcctct cagctcagcc accttcagct ttgagcccct aagcagtccc 2101 gatggtccag ttatcatcca gaaccttcgt atcactggca ccattacagc ccgagagcac 2161 agtggcactg gattccaccc atacacactc tatactgtga agtacgagac agcccttgac 2221 ggtgaaaaca gcagcggcct gcagcagctg gcctaccaca ctgtgaatcg tcgctatcgg 2281 gagttcttga atctgcagac ccgtctggag gagaaaccag atctacgaaa gttcatcaaa 2341 aatgtgaagg gtcctaaaaa gctctttcca gatcttccat ttggaaacat ggacagtgac 2401 agagtagaag cccgtaagag cctcctagaa tcattcctaa agcaactctg tgccattcca 2461 gagatcgcta acagtgagga ggtgcaggag ttccttgctc tgaacacaga tgctcgtatt 2521 gcctttgtca agaaaccatt tatggtctct agaatagaca agatggtggt gagtgccatt 2581 gtggacacct tgaagacagc gtttcctcgc tctgaacccc agagccccac agaggagctg 2641 agtgaggccg agaccgaaag caagccccag acagaaggca agaaggctag caagtccagg 2701 ctgaggttct catccagtaa aatttctcca gcactaagtg tgactgaagc acaagacaag 2761 attctttatt gtctccagga aggcagtgtg gagtctgaga ctctatccat gtctgcgatg 2821 gaatctttta ttgaaaaaca gacaaagtta ctggaaatgc agccaacaaa agccccagaa 2881 aaagatcctg aacaacctcc caaaggacgt gtggacagtt gcgtgtcaga tgcagccgtg 2941 ccagcccaag accccagcaa cagcgatcca ggaacagaga cagagttagc tgacacagcc 3001 ctggatctgc tcctcttgct actaacagaa cagtggaaat ggctatgtac cgaaaacatg 3061 caaaagtttc ttcgtcttat ctttgggacc ctagttcaaa ggtggctaga ggtgcaggta 3121 gctaatttaa caagtccaca gcgctgggtg cagtacctcc ggcttcttca ggagtccatc 3181 tggcctggtg gagttttgcc taagtttcca cggcccgtaa ggacccaaga gcagaaactg 3241 gctgctgaga aacaggcttt gcagagcctg atgggagtcc tcccagatct cgtagtagaa 3301 attcttgggg tgaacaaatg ccggctgagc tggggtctag tcctggagtc actacaacaa 3361 cccctcatca acaggcattt gatttactgc cttggggaca tcatcctgga attcttggat 3421 ctcagtgcct ctgttgagga gtctgctgct accacctctg cctcagatac cccaggcaac 3481 tctaagagga tgggtgtctc ctcttagctg gttattcacg ccttcttccc aggtcaggga 3541 agtagagtta ctcggccacc agagaccagt caggaagccc gtgccctctt caagtaggct 3601 acagctaaga aggcctaggt tctcggtggc tcacttctct cccacttctg ctccatttgt 3661 agcaggtgac aggggtctgt gtgtcccagt tgcatgcacg tttatttccc tgttcctttt 3721 gtgatgttgg gattgttgct ggtgagtaga tcctgtttcc tttgggaaaa gaagctgtga 3781 ggtagaggaa tgaacccctt ttgcttctcc ttttgccatc ctccctgaga ttgtttgcca 3841 gaatcctggc cctgtgcaca catgtgctta atccaccagc acacaagctc tagatagaac 3901 aggaaaaaaa ctgtggccag atttccttgg aggagaaaac aaccttctca gctctctgtt 3961 cctacccgga aactaaaaat tcattcaggg ccattctcaa aaaccagact cctgggtcta 4021 atctagtttc ccaacagttc cagaagaaga aggagggaag aaaatttttc aaggaacttc 4081 ttgctcatgt gttgctgttt cccaacacat cagccttatg gaagatgtca tgaggctgct 4141 tgcctgggag gctacacttt cctttccaga agacttcaaa gctgactgcc aaagcttctg 4201 gtaaatccac cccacccttc acaccctttc ctgaaaggca tttataccac ccattttatt 4261 tattggcttt gctcccaacc ccctgcccta gagcaattga gtttttgaat ttttgaccct 4321 ctctctgtct tcagcaaaac tgcatgacaa gctgcatatt cagcctcagt ttttccaagg 4381 agaatggaaa gcacttgcct cctgtcttaa gaatgcacaa agagtaggac tagcaagcct 4441 gggggatgaa atctcccttc ctaagtcttt gacagagaac ttttattatt ctgaacagat 4501 aagattagca aacttacaag gaatagcttc ttatttccca ttccttacgg ttataatcag 4561 tcctttaaca acaccagcat gaaggcaaag gtcttttgtc agactgcaac atctattaga 4621 aaggcaaggc aactgggaat tgctggagaa ggataaatca ggaggaattg ctggtagaaa 4681 aggctgcaga cagtcacagg gaagtgctgt atattaaatg catgtgaata gctggcaaaa 4741 agatgtcaag gatgtcactg ggagcccttt ggcttcacac tcgcctgtta gcgccttgct 4801 ttcgctgaat taccagagcc taccactcgg aggaaggtct tggagttaga ctgatgagaa 4861 ggcaatgggg tgtcaaaaac aagtgtcaaa gttctcttag attctgaagt cggtgatgag 4921 cacacaatta aagtggtttg tggaagaaag ttgtaaaatc tttttacttt tttgcccctc 4981 tggaggcttt taaccctaaa ttgtgtctga tgtattcggg aggcttaggt gtgcaattct 5041 tgcccaaagg caaggagatg aatgagatgt ctcatggcct cttctagcct taaaagttga 5101 ccattacagt gactttatgg ctccaaagtt cttgaagcat ctccagccct agactttgac 5161 aagaaaaggt agctcagcat ctctactttc cgtgagtacc tgcaggagtc cagccaatgt 5221 attagagctg atcattaagc tccttcagtg agggctaaga tcctattctc gatgagcata 5281 gggggacagc cttgttttat gccacttttc tctccccata ccttcccctc atgtgtactt 5341 agccacctgt gttgctttga atctgctgcc agttctggct caaatgtggc acaaaatcag 5401 tacatcagac acaccatgaa atccctgtgg ctataaattc taaggaatct tttgacactt 5461 ccagcccaaa gaacttattt ttatttcgta ttatataaga gtccccacct ctactccacg 5521 taccccaaaa agataagctg tcctcagatc ctccttctac ccaggaggga aataatggat 5581 attgccaagg gaacaatcag aactcatctg gctttgaaaa cgacctgcca agtttggcta 5641 gcaggttcag acaacaagag agtaacaatt ggtggagagc cacaggtagc tggaaaagat 5701 actcccagct gacagggtga taaaaggcag ggaagaattg ggcctgtgct gatccaggaa 5761 ggttttattg tttgcttgtt ttttgtattt actcgtgact aaattttatt catttaagtc 5821 agaattttga tatttgaaat aagactactt tcccttttct tactatctat aaaaaagacg 5881 aggcagggat gctgtgttct ttctgtcaaa ttctaaatgt gatctttgcc attgcttatt 5941 tgaattcaag aaagctatac tttttcatca catataactc acagtttcat ttttgcattg 6001 ttggattaga cttaataaaa cctaaatgga aaataaattc ctaaatctg // LOCUS D87444 4028 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0255 gene, complete cds. ACCESSION D87444 NID g1665776 KEYWORDS KIAA0255. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7076. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4028) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..4028 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7076" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 341..2218 /gene="KIAA0255" CDS 341..2218 /gene="KIAA0255" /note="Similar to S.cerevisiae EMP70 protein precursor (S25110)" /codon_start=1 /db_xref="PID:d1014075" /db_xref="PID:g1665777" /translation="MCETSAFYVPGVAPINFHQNDPVEIKAVKLTSSRTQLPYEYYSL PFCQPSKITYKAENLGEVLRGDRIVNTPFQVLMNSEKKCEVLCSQSNKPVTLTVEQSR LVAERITEDYYVHLIADNLPVATRLELYSNRDSDDKKKEKDVQFEHGYRLGFTDVNKI YLHNHLSFILYYHREDMEEDQEHTYRVVRFEVIPQSIRLEDLKADEKSSCTLPEGTNS SPQEIDPTKENQLYFTYSVHWEESDIKWASRWDTYLTMSDVQIHWFSIINSVVVVFFL SGILSMIIIRTLRKDIANYNKEDDIEDTMEESGWKLVHGDVFRPPQYPMILSSLLGSG IQLFCMILIVIFVAMLGMLSPSSRGALMTTACFLFMFMGVFGGFSAGRLYRTLKGHRW KKGAFCTATLYPGVVFGICFVLNCFIWGKHSSGAVPFPTMVALLCMWFGISLPLVYLG YYFGFRKQPYDNPVRTNQIPRQIPEQRWYMNRFVGILMAGILPFGAMFIELFFIFSAI WENQFYYLFGFLFLVFIILVVSCSQISIVMVYFQLCAEDYRWWWRNFLVSGGSAFYVL VYAIFYFVNKLDIVEFIPSLLYFGYTALMVLSFWLLTGTIGFYAAYMFVRKIYAAVKI D" BASE COUNT 867 a 1151 c 985 g 1025 t ORIGIN 1 ccaagatggc gacggcgatg gtgagtgaag gagactccgg gagcgggagc tggagcgggg 61 ccctccgggg tatcccagga tcttccagca ccccatgcct ggccctgagc cacctccggg 121 acccctgact caggcctgag ggctacctct gactgggctt gtcttccccg aaatccacct 181 ccctggccct gcccctgcac tcaggcttgt gaaggccccg agttttgggg gaggcgccgt 241 ttcggaggaa gacctcggct gctgccttcg ccggttccca ttctactttt ggtctccgcc 301 cactgattgg ttgccgtggt ctttactgct tttctccctg atgtgtgaaa caagcgcctt 361 ctatgtgcct ggggtcgcgc ctatcaactt ccaccagaac gatcccgtag aaatcaaggc 421 tgtgaagctc accagctctc gaacccagct accttatgaa tactattcac tgcccttctg 481 ccagcccagc aagataacct acaaggcaga gaatctggga gaggtgctga gaggggaccg 541 gattgtcaac acccctttcc aggttctcat gaacagcgag aagaagtgtg aagttctgtg 601 cagccagtcc aacaagccag tgaccctgac agtggagcag agccgactcg tggccgagcg 661 gatcacagaa gactactacg tccacctcat tgctgacaac ctgcctgtgg ccacccggct 721 ggagctctac tccaaccgag acagcgatga caagaagaag gaaaaagatg tgcagtttga 781 acacggctac cggctcggct tcacagatgt caacaagatc tacctgcaca accacctctc 841 attcatcctt tactatcatc gggaggacat ggaagaggac caggagcaca cgtaccgtgt 901 cgtccgcttc gaggtgattc cccagagcat caggctggag gacctcaaag cagatgagaa 961 gagttcgtgc actctgcctg agggtaccaa ctcctcgccc caagaaattg accccaccaa 1021 ggagaatcag ctgtacttca cctactctgt ccactgggag gaaagtgata tcaaatgggc 1081 ctctcgctgg gacacttacc tgaccatgag tgacgtccag atccactggt tttctatcat 1141 taactccgtt gttgtggtct tcttcctgtc aggtatcctg agcatgatta tcattcggac 1201 cctccggaag gacattgcca actacaacaa ggaggatgac attgaagaca ccatggagga 1261 gtctgggtgg aagttggtgc acggcgacgt cttcaggccc ccccagtacc ccatgatcct 1321 cagctccctg ctgggctcag gcattcagct gttctgtatg atcctcatcg tcatctttgt 1381 agccatgctt gggatgctgt cgccctccag ccggggagct ctcatgacca cagcctgctt 1441 cctcttcatg ttcatggggg tgtttggcgg attttctgct ggccgtctgt accgcacttt 1501 aaaaggccat cggtggaaga aaggagcctt ctgtacggca actctgtacc ctggtgtggt 1561 ttttggcatc tgcttcgtat tgaattgctt catttgggga aagcactcat caggagcggt 1621 gccctttccc accatggtgg ctctgctgtg catgtggttc gggatctccc tgcccctcgt 1681 ctacttgggc tactacttcg gcttccgaaa gcagccatat gacaaccctg tgcgcaccaa 1741 ccagattccc cggcagatcc ccgagcagcg gtggtacatg aaccgatttg tgggcatcct 1801 catggctggg atcttgccct tcggcgccat gttcatcgag ctcttcttca tcttcagtgc 1861 tatctgggag aatcagttct attacctctt tggcttcctg ttccttgttt tcatcatcct 1921 ggtggtatcc tgttcacaaa tcagcatcgt catggtgtac ttccagctgt gtgcagagga 1981 ttaccgctgg tggtggagaa atttcctagt ctccgggggc tctgcattct acgtcctggt 2041 ttatgccatc ttttatttcg ttaacaagct ggacatcgtg gagttcatcc cctctctcct 2101 ctactttggc tacacggccc tcatggtctt gtccttctgg ctgctaacgg gtaccatcgg 2161 cttctatgca gcctacatgt ttgttcgcaa gatctatgct gctgtgaaga tagactgatt 2221 ggagtggacc acggccaagc ctgctccgtc ctcggacagg aagccaccct gcgtggggga 2281 ctgcaggcac gcaaaataaa ataactcctg ctcgtttgga atgtaactcc tggcacagtg 2341 ttcctggatc ctggggctgc gtggggggcg ggagggcctg tagataatct tgcgtttttc 2401 gtcatcttat tccagttctg tgggggatga gtttttttgt gggttgcttt ttcttcagtg 2461 ctaagaaagt tccctccaac aggaactctc tgacctgttt attcaggtgt atttctggtt 2521 tggatttttt tttccttctt tgttttaaca aatggatcca ggatggataa atccaccgag 2581 ataagggttt tggtcactgt ctccacctca gttcctcagg gctgttggcc accctatgac 2641 taactggaag aggacacgcc agagcttcag tgaggtttcc gagcctctcc ctgcccatcc 2701 tcaccactga ggccacgaca aagcacagct ccagctcgga cagcaccctc agtgccagcc 2761 agcctctgcc agacctctct ttccctcttc tccccagcct cctccagggc tgcccaaggc 2821 agggtttcca gccaggcctc ggggtcatct tttcaccagg agcaaaccca agtcttagtt 2881 gctacaagaa aatcccctgg aagtactggg ggccaggttc cccagacagc aggaattgcc 2941 cctgttcaga gcagccggag tttgctggac cacaaggaag aagagaagag acttgcagtg 3001 aactgttttt gtgccaagaa accctggacc tggggccaag tatttcccaa gccaagcatc 3061 cacttgtctg tgtctgggaa gggatggcca aggccgctag ggtccttacc cctcaggatc 3121 actccccagc cctttcctca ggaggtaccg ctctccaagg tgtgctagca gtgggccctg 3181 cccaacttca ggcagaacag ggaggcccag agattacaga tcccctcctg taagtggcca 3241 ggcattctct ccctgccctc tctggcctct ggggtcatac tcacttcttt agccagcccc 3301 atcccctcca ccccacacct gagttcttgc ctcctccttt tggggacacc caaaacactg 3361 cttgtgagaa ggaagatgga aggtaagttc tgtcgttctt tccccaatcc ccaggaatgg 3421 acaagaagcc aacttagaaa gaagggtctc acgtggctgg cctggctcct ccgtagaccc 3481 ctgttctttt caacctctgc ccacccgtgc atgtcatcac aaacatttgc tcttaagtta 3541 caagagacca catccaccca gggattaggg ttcaagtagc agctgctaac ccttgcacca 3601 gcccttgtgg gactcccaac acaagacaaa gctcaggatg ctggtgatgc taggaagatg 3661 tccctcccct cactgcccca cattctccca gtggctctac cagcctcacc catcaaacca 3721 gtgaatttct caatcttgcc tcacagtgac tgcagcgcca agcggcatcc accaagcatc 3781 aagttggaga aaagggaacc caagcagtag agagcgatat tggagtcttt tgttcattca 3841 aatcttggat tttttttttt ccctaagaga ttctcttttt agggggaatg ggaaacggac 3901 acctcataaa gggttcaaag atcatcaatt tttctgactt tttaaatcat tatcattatt 3961 atttttaatt aaaaaaatgc ctgtatgcct ttttttggtc ggattgtaaa taaatatacc 4021 attgtcct // LOCUS D87445 6935 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0256 gene, complete cds. ACCESSION D87445 NID g1665778 KEYWORDS KIAA0256. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA4798. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6935) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6935 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA4798" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 1425..3332 /gene="KIAA0256" CDS 1425..3332 /gene="KIAA0256" /codon_start=1 /db_xref="PID:d1014076" /db_xref="PID:g1665779" /translation="MEQKKLQEALSKAAGKKNKTPVQLDLGDMLAALEKQQQAMKARQ ITNTRPLSYTVVTAASFHTKDSTNRKPLTKSQPCLTSFNSVDIASSKAKKGKEKEIAK LKRPTALKKVILKEREEKKGRLTVDHNLLGSEEPTEMHLDFIDDLPQEIVSQEDTGLS MPSDTSLSPASQNSPYCMTPVSQGSPASSGIGSPMASSTITKIHSKRFREYCNQVLCK EIDECVTLLLQELVSFQERIYQKDPVRAKARRRLVMGLREVTKHMKLNKIKCVIISPN CEKIQSKGGLDEALYNVIAMAREQEIPFVFALGRKALGRCVNKLVPVSVVGIFNYFGA ESLFNKLVELTEEARKAYKDMVAAMEQEQAEEALKNVKKVPHHMGHSRNPSAASAISF CSVISEPISEVNEKEYETNWRNMVETSDGLEASENEKEVSCKHSTSEKPSKLPFDTPP IGKQPSLVATGSTTSATSAGKSTASDKEEVKPDDLEWASQQSTETGSLDGSCRDLLNS SITSTTSTLVPGMLEEEEDEDEEEEEDYTHEPISVEVQLNSRIESWVSETQRTMETLQ LGKTLNGSEEDNVEQSGEEEAEAPEVLEPGMDSEAWTADQQASPGQQKSSNCSSLNKE HSDSNYTTQTT" BASE COUNT 2199 a 1305 c 1429 g 2002 t ORIGIN 1 gccgcctcct cggccagtgg cgtagccgaa tcggtgtcgc ggccagccag ataggggcgg 61 aggtccggaa cccagtctgg acccgagcgg ggggccatgg agaaagcggc ccgaggcgct 121 gtttacaccg actagcgcgg gcccgttgcg gctgcaggca ccatggaccg agcccccacg 181 gagcagaatg tcaagctgtc agctgaggtg gagccattta ttccccagaa gaagagtcct 241 gatacattta tgatccctat ggctctccca aatgataatg gaagtgtttc tggtgtggaa 301 ccaactccaa ttcccagcta cctgattact tgttacccat ttgtgcagga aaaccagtcc 361 aatagacagt ttcctttata taacaatgat atacgatggc aacaacccaa tccaaaccct 421 actggaccat actttgccta tcccattata tctgctcagc cgcctgtttc tacagagtat 481 acatattatc agctgatgcc agcaccatgt gcccaggtta tgggtttcta tcatcctttt 541 cctacacctt actccaacac ctttcaggct gcaaatactg taaatgctat caccacagaa 601 tgcactgagc gtccaagtca gcttggacag gtcttcccat tgtccagcca tcgaagcaga 661 aacagtaaca gaggatcagt ggtcccaaaa caacagcttt tacaacagca cataaaaagc 721 aaaaggccgc tggtgaaaaa tgtagctact cagaaagaaa caaatgcagc aggtcctgat 781 agtcgatcaa aaattgtgct tctggtagat gcttcacagc aaactgattt cccatcagat 841 atcgctaaca agtctctctc agagaccact gcaacaatgc tctggaagtc caagggcagg 901 agaagaagag catcccaccc tactgctgaa tcttctagtg agcagggggc tagtgaagcc 961 gacattgaca gtgatagtgg ttactgcagt cccaaacaca gcaacaacca gcctgcagca 1021 ggggctttga gaaatcctga ttctgggacc atgaatcatg tggaatcatc tatgtgtgca 1081 ggtggtgtaa attggtccaa tgtaacttgc caggcaactc agaaaaaacc ttggatggaa 1141 aaaatcagac attttctaga ggtggaaggc aaactgaaca aagaaataat tcacaggatg 1201 aagatgggtt tcaagaacta aatgagaatg gaaatgctaa ggatgagaat attcaacaaa 1261 aactttcttc taaagtattg gatgatttac ctgaaaactc accaatcaat atagttcaga 1321 ctccaattcc tattaccacc tcagttccca aacgtgcaaa aagtcagaag aagaaagctt 1381 tagcagcagc ccttgccaca gctcaagagt attcagaaat aagtatggag caaaaaaaat 1441 tacaggaagc tttatcaaaa gcagctggaa aaaagaataa aacacctgtg cagctagatt 1501 taggggacat gttagctgct ctggaaaaac aacagcaagc aatgaaagca cggcaaatta 1561 ctaacaccag acctctgtca tatacagtgg ttactgcagc ttcttttcac actaaagact 1621 ctactaatag aaaaccttta accaaaagtc agccctgttt gacatccttt aattctgtgg 1681 acattgcttc ttctaaagca aaaaaaggaa aagagaagga aattgcaaaa ctaaaacgac 1741 ccacagcact taaaaaggtt attttaaaag aaagagagga aaagaagggg cgcttaactg 1801 tggaccacaa tcttttggga tccgaggaac caacagaaat gcacttagat tttattgatg 1861 acttgccaca ggagattgtt tcccaggaag atactggact aagcatgccc agtgatactt 1921 cactctctcc agcaagtcag aactctccat actgtatgac acctgtgtca caaggctctc 1981 ctgctagttc tggaataggc agtccaatgg catcttcaac aataaccaaa atccacagca 2041 aaagatttag agagtattgt aatcaggttc tttgtaaaga gattgatgaa tgtgtgactc 2101 ttcttctcca agagcttgtc agtttccagg aacgcatcta ccaaaaagat cctgtaagag 2161 caaaagcaag gagacgactc gttatgggtc taagagaagt taccaaacat atgaagttaa 2221 acaagatcaa gtgtgttata atttctccaa actgtgaaaa aatccagtca aaaggtggtc 2281 tggatgaggc tctctataat gttatagcca tggcacggga acaagaaatt ccttttgtgt 2341 ttgcccttgg aaggaaagct ctaggacgct gtgtgaacaa gctggttcct gttagcgtag 2401 tgggaatctt caactacttt ggtgctgaga gcctgtttaa taaattagta gaactcactg 2461 aggaggccag gaaagcatat aaagatatgg ttgcagcaat ggaacaggag caggctgagg 2521 aagccttaaa gaatgtgaag aaggtaccac accacatggg acattctcgg aatccctctg 2581 cagcaagtgc catttctttc tgcagtgtta tttctgaacc gatctctgaa gtaaatgaaa 2641 aggaatatga aacaaattgg agaaacatgg tggaaacttc agatggactg gaagcatcag 2701 aaaatgagaa agaggtatcc tgtaagcaca gcacttctga aaaacccagt aaacttccat 2761 ttgacacacc cccaattggt aagcagccat cattagtggc tacaggcagt actacctcag 2821 ctacaagtgc tgggaaatcc acagcaagtg ataaagagga agtgaagcca gatgacctgg 2881 aatgggcctc acagcagagt acagagactg gctctttgga tggcagttgc cgagatcttt 2941 tgaattcctc catcaccagc accaccagca ctcttgtacc tggcatgctt gaagaagaag 3001 aagatgaaga tgaggaggag gaggaagatt atactcatga acccatatct gtagaagtgc 3061 agctcaatag tagaattgag tcttgggtct cagagaccca gagaactatg gaaacccttc 3121 agcttggaaa aacccttaat ggttctgagg aagacaatgt agagcaaagt ggagaagagg 3181 aagcagaggc gcctgaggtg ctggagccag ggatggacag tgaggcatgg actgctgacc 3241 agcaggccag tcctgggcag cagaagtcca gcaactgcag ctcgctcaac aaagagcact 3301 ctgattctaa ttacacaacg caaactacgt aactcaggaa atgtcggctc tctatctcca 3361 gctgtggaag ggttgcagcc attacctttt atgcttcatc tcaacatttt gcactgtcca 3421 gtatttaata tacgtattta attcccaaca aatatttttg tagcttttac ttgttatgat 3481 ctgtagctta gcttttaatt agtatctaag tgtctttcta agaactgtgt ggaaaattca 3541 gatctgtttc agcttatttt gtaatcaaaa acagtgataa aaagaagacc agatcttaaa 3601 gaaaataaat ttcaaatgct tacttaaaag acattttgaa agttaaagaa caaggttcta 3661 aggatagaag cagttatcag tgtttgcttc aggactccac ctcctctact ctaatttgac 3721 caaaaaattg tttgggcttc tttaaaaaag aactgggggt ggagtcagaa aattaaatga 3781 aaggctgagg gtaactaagt ccaccagtgt tgtatgttaa aaaatcaatg caacttttat 3841 gtggtccaca aatgtttagt cagaagtcac tgattattgt aattaattag tgttgggatg 3901 ggctaaaaca gagccttcaa aacttcggct agcagtggag ccaccatctt agattatagc 3961 tagctagcct catttgtgga aagtgataga tgctgtctat aatagtgaac agtcacccat 4021 gataggacct ccaggttctg tctcatattt gcttcttact tacctcagga atgctcttgt 4081 acatagactt atttacaaaa agctaggcac atgttgacag gtgaataact gtaaccgatt 4141 gtatgactgc tgcacttaca tgtaaactct tcagaaacag agtcttatac tggtgtgttc 4201 tcttgcatgc ttctggttca ggactcttga tttgagatat ggatttgatt gagtatccaa 4261 acttgtcctg agtgcaaaac tgtttcacct tttaaaaaat acctattttg cacctagcct 4321 tgagcacctt ccacatagca atgaccatag ttactgtcag gaggtcaagg aaaggaactt 4381 tgcacaactt gtgacatgta tcctgataat caaggcttag aggaggaagt tttagaagat 4441 aagagaaagt tgttctaatt gtgctgaaac tattagatga tttagagtat acagatatgt 4501 aggtattaat tctctattca ctattattta tctctgccct tctctaggag tttgtatacc 4561 tgcttaggag acaataaatg agctaaatgt tttatttgct agtcagtcac cacctggact 4621 tcagtgactt tacaagttta tgtaatggtg gaagaatgac aaactatgta atttttttgt 4681 cttccatcca actccccacc acccccaact gtccccccca cccccctcac acacatgcac 4741 acatccgtac gtgtgtgtgt tttccactta caagcttcca taagcaggca caaaactgag 4801 aaggaagggg tattatccct gccctgatta tctggggcag ggctttgcct cacagaggca 4861 ggagagaaga attgggcaga ttctttactg aactcattgg gactactgtg ctagttttga 4921 tgttttataa tgctggcatt taattactgg agagattgga ttcttggttg atgatttagt 4981 atttgtgaat tgtgaaagtt caggagctgt gtagaaaatg ttagtcaatc aactttatta 5041 ttgtgctaaa aggggacatt cttatactgt cctgtctaaa ctgttctcca gtatagactt 5101 cctaggcact aaatatccaa tatttaaagg aacacagcag gtaaggaatg aagcctctga 5161 aatagtactc atggatttat acatggcaga tcttactgtc tctacacatt tggaagtgtt 5221 cgttggttta aagaaatgat agaggttttg aactactgac agtcttaaaa gtgaatttaa 5281 aaactgttca tactttttat ggtgtaaatt tcctttgctc gatgtcagtg attcagataa 5341 ctcttgacct tgagatgatg gcttttcaca ggtttcttat attttatatc tcttctgaac 5401 atgaattgtc attttagatt tttgacattt gtatcaaaag agaagttgag gaaatcttca 5461 gaacactggt aacttttagt tttgctatag acttcagaag tgtttattta tatgttcggt 5521 aaatgctctc gcatatgcag tacctcttct gccagcaaat ccaagggacc atagcctttt 5581 tatgagacag gtcacctcta gaggacaccc caagaattat taaaggaaat gttaccattt 5641 tgagagcatg cttaaataaa tattaataat gtctttataa cttgtttcct ttaaattttg 5701 gaatattgaa ttacaggctt tggaggagtt gtgaaaatta ggaaagtttt tatatatttt 5761 ttgaagtggg catggttggc tctttgaaga cctataaaga gatccagtgg gaagagtaag 5821 ggttggttca tcatcacaag aaataaaaaa catagtgatt ttttctctta atgtgtagag 5881 gtggttttac tggcaataat taataataga tttctatttc agtatgtaag catattaact 5941 aaaatatgaa ttacacttcc aaagttagat ttctgcttca gtaggtttgt ttgctgtgaa 6001 gattacttct caaaagacag atgttcatat tagcttaatt ttcggtttaa atatgtttgt 6061 aaatgatgta atatatttct tttgactaaa tgtggaaaag taatgtgtgt tatacattga 6121 gaagttttta ctggctttga ctggaggttg tttttgcaga gatggtattt tatatgattc 6181 cagtatttgg aaaagaatta gtcaaaagga attcacatag tttaaatact gagaaattaa 6241 tatccaaata tgtacttgtc tgatttctaa ataagctggg ggaggaggga ggggtgggaa 6301 ttgaaatgtg caaatgagta gtgaatgcta cactcatttt caactcttta acatgaaact 6361 gttcaatctt aacacattgt tactttaata tatgtataaa gaagtattac tgtttgtaaa 6421 gctgctgttt gcttaaaaaa aaaaaacacc cttgtcatgt attttctgta tgttgggcca 6481 acaggttaga acatcaactc atttaaaaat ttttatcttt ttttgattta aaaaaattct 6541 gtgaaataat ttatttacag acatcttcct cctccctcat cccttccaac ctttacatac 6601 atcacagaat caaccaaact gtttgcctaa tctgaaatct gaatcctaat gagaaaaatt 6661 taaattttgt tggcacatca caccttgaaa gtatttgtat tattttataa tttaatttct 6721 aaatatacca cataagttta taatttaatg tcttaattgt aatgctctaa taaaaaacta 6781 gcaaaattag tgtgagttat aacatgaagg gattttcatc ttttgctgta tgaaggataa 6841 ttgttatatc acatttgggg ggtaataaca gcttttttgc actatgtaaa tactagtggg 6901 gattcttctg tactaataaa atgattattg aaatg // LOCUS D87447 6313 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0258 gene, complete cds. ACCESSION D87447 NID g1665782 KEYWORDS KIAA0258. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7053. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6313) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..6313 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7053" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 86..1261 /gene="KIAA0258" CDS 86..1261 /gene="KIAA0258" /codon_start=1 /db_xref="PID:d1014078" /db_xref="PID:g1665783" /translation="MIEVVAELSRGPVFLAGEALECVVTVTNPLPPTATSASSEALAW ASAQIHCQFHASESRVALPPPDSSQPDVQPDSQTVFLPHRGERGQCILSTPPKILFCD LRLDPGESKSYSYSEVLPIEGPPSFRGQSVKYVYKLTIGCQRVNSPITLLRVPLRVLV LTGLQDVRFPQDEAVAPSSPFLEEDEGGKKDSWLAELAGERLMAATSCRSLHLYNISD GRGKVGTFGIFKSVYRLGEDVVGTLNLGEGTVACLQFSVSLQTEERVQPEYQRRRGAG GVPSVSHVTHARHQESCLHTTRTSFSLPIPLSSTPGFCTAIVSLKWRLHFEFVTSREP GLVLLPPVEQPEPTTWTGPEQVPVDTFSWDLPIKVLPTSPTLASYAAPGPSTSTITI" BASE COUNT 1334 a 1665 c 1775 g 1539 t ORIGIN 1 gccaggtccc tgaggggcgg gcagatgagg cctaggggtg ccgatcccta gtgtcgacta 61 tgcgagatct gattccggag ctgccatgat tgaagtggta gcagagctca gccggggtcc 121 tgtatttttg gctggggagg cgctggagtg tgtagtgacc gtcaccaacc cccttccgcc 181 cacggccact tctgcatcca gtgaggccct ggcctgggcc agtgcccaaa tccactgcca 241 gttccatgcc agtgagagtc gagtagcact gcctcctcct gactctagtc agccagatgt 301 ccagcccgac agccagactg tctttctgcc acaccgaggt gagaggggcc agtgtatcct 361 ttctactcca ccgaaaattc tattctgtga cctgaggctt gatcctggag agtccaaatc 421 atactcctac agtgaagtgc tgcccataga gggaccaccc tcctttcggg gtcagtcagt 481 caagtacgtc tacaaactga ccattggctg ccagcgtgtc aactccccta tcactttact 541 cagagtccct ctgagggttc ttgtgctgac tggccttcag gatgtccggt ttccccagga 601 tgaggctgta gccccatcca gtccattctt ggaggaggat gaaggtggga agaaagattc 661 atggctagct gagctggctg gggaacgcct aatggctgcc acatcctgcc gcagcctcca 721 tctatacaat atcagtgatg gccgagggaa agttgggacg tttggcatct tcaaatctgt 781 gtacagactt ggcgaggacg tggtggggac cttaaactta ggggaaggaa ccgtagcttg 841 tttgcagttt tcagtcagct tacagaccga ggagcgtgta cagcctgagt accagcggcg 901 acgtggggca gggggtgtcc cctctgtgtc acatgtgact cacgcccggc accaggaatc 961 ctgcctacat acaactagaa ccagcttctc cctcccaatc cctctcagct ccaccccagg 1021 cttctgtaca gccattgtgt ccttgaagtg gagattgcat tttgaatttg taacgtcccg 1081 agaaccagga ttggtactcc taccccctgt ggaacagccc gaacctacca cctggacagg 1141 acctgagcaa gtacctgtag acaccttcag ctgggacctg cccatcaagg tgctgcctac 1201 tagccccacc ctggcctcat atgctgcccc aggccccagc accagcacca taaccatctg 1261 aaactggccc accctggtgc tagttccttc cggatactga gaactcagca cctggactct 1321 aatgggaccc actttttcca cctggggtcc aatgtcgtgg acagtgagag tcgggctttc 1381 agctatagca ttaatttatt tgttcagaat acattggcag ctgctagtgg tttccctgga 1441 agtggcagca gcagtgagca gtcagcagat ggatgatcag ttgagtttag ctggagtggg 1501 gagcaggagc cccaggaaca ggggtgttgg ctgagcccca ttctgggtca ggccctcccc 1561 ctttgcaggg cagccgaggg tcagattttt gcaccaagga gaactggcag gttcctgcct 1621 cctgacgtac ctcacaccca gccgggaagt cgatgggatg ctgggacctg gggaaccaag 1681 gataggggaa ggagtcagca cagtgaaagg ctgcctttat ccctgcccac atgttccctc 1741 tctcacagtt ttccccccac agagcccctt tcagtggccc cttggtcctc ctaactaagc 1801 tgtcacctac catatgtggg cctttttgtt ttataacagg agtattttct ctccaggtcc 1861 accccaacct cccctgattt atagcctgaa gccttatctt tcacactagt gttggtccct 1921 tcaggtttgg cccatcttgt attgctcttc tgttcattct tacatcacag caatctagtc 1981 actccctggt catccctcag tcactcatat cagagtcatt ctctctggcc atctttggtc 2041 actcacgtgt cacagcagcc cacgccaaca ggatgcagac aggtgcaatg gaaacagtcc 2101 ttgcggagcc aagactcacc cagggtaaaa tatttcccct catagtgaca gggggctagg 2161 gaagaacggg aaatgttagt aggtgtagga gtgctgatga gaggcagagg ctcttctggt 2221 ctggggtgga gacagtaagt acgcactatc cccgtattta gtttgtcttt cctgtttcac 2281 agctggagga agcctgggta ttttgacacg ggatcatctg taaggcccca tcctccctgt 2341 gccctctctg ctgctcctcc attcctaacg cttcacccca ctttaccttg agcttggaag 2401 tagcacttgc tgtagactcc tgggtgctgg aggagtagag acatcaccaa gcagatgatc 2461 ccccagcctc ctaggatccc cttggcctgt ccagcccaga gcatccttag ggccattgct 2521 gctgcacagc cctctcagac ccttcttggc ctctgctcag ctactctggt cttgactcct 2581 tgactttgct ttgcgttgct ccttgagtct tagtttctgt ctttctcccc tgggctcctg 2641 tctcacacta tctccctgcc ctctgctctc acaggctggg gatgtttata aagtgaggac 2701 cctggccccc tgctgagtag agctggaaaa gttgtaactc tgtttcctga ggtgagggca 2761 tgaaaacaag aggtctagct ttaacaagct gtgagagctg attcatgccc cggcacagct 2821 agagggaggg aggtggccat ggagggggca ctggactggg cacttcccca gcaaggaggc 2881 aggaggggcg agggccccca ggtggtcccc agatctcttc cctgacctgg agagaaggaa 2941 gcattccacc ttcccccttt ctcccccact gccaccacca ggggtgtgta tgctgggatc 3001 cctgcctgga ccggagggag gcatttcctg gggatggtta atcctgtgcc ccagccaaac 3061 ccaggagctg caatagggtg cgacggccag aagctccagg agagtgagca ggcacctgga 3121 gtggagactg tgtttccctc agatcctagg gcagggtttc cctaatgtat ccaagaaata 3181 gggctgcccc tcagagatgg tggggagggt ctcttttcct caggcattcc agaggtgaac 3241 tgtccattgc ttatcacctt caaacataca gcagatgtgg gatcacccca catctgggga 3301 tggttctttc ccctttcaaa gaggagcatc tctaagtgcc ctgatgggat gaatcactcc 3361 aggttcacag aggtgtcctc tctttcctcc catatataat ggagtgaggt ttttaggaat 3421 ttatcatttg gcatcctctg agtttcccac aggttctgga ggagcccagg atggattatt 3481 gagagcatgg gctgtagaga cagtcttctt ggattcagat cctgactcca cttagctatg 3541 taacctggtc agattacttc acctctctga gcctgtttcc tcatctataa attggggata 3601 gtaatgccaa ctcattgggc tgttatgagg attactgaga taatgcgtgc agtgctctta 3661 tcaccatctc tggtgcgtaa gcgtcaggaa atagcagttg ctgtgattgg ggctaaagct 3721 ctgaggcaaa atgggcgaca ttattttctt tgaatgacat taagcagttt gtgcatagct 3781 gagggcttct attggggatg gctgtctcct ggcatagacc tctgcacctt tcacactcat 3841 actccttgtc agcagtcccc aacctttttg gtaccaggga ccggttttgt ggaaaacaat 3901 ttttccacca gtggatggag ggggatagca gcggggagat gattttggga tgaaactgtt 3961 tcatctcaga tcatcaggca ttagattctc ataaggagtg tgcaatctag atcccttgca 4021 tgcggagttc acagtggggt ttgcactcct gtgagaatct aatgcctctg ctgatctgcc 4081 aggaggagga gctcaggcgg taatgctcac tcgcctgccg cccacctcct gctttgtgct 4141 cccgcttcct aacaggccac agactggtac tggcctgtgg cctgggggat ggagacccct 4201 aatccatgtc acctttccca cctctttcaa aaacaggtac ctccaggaac attttggttt 4261 tggcccttgt attgacttct gaatgtctag tttgagaaac tgttcccaaa taagccttct 4321 tcccccagat ctgcaccctc gcctctaccc taggacaaga tgtccttttc tcatcatcct 4381 gccaggctaa ctttaagtct cctgcttttt ctcacttgga tttggatcca tttcttccta 4441 tttccgctca tgtgaactct ccagttctcc tttctcacca ctctcctgct agccatctct 4501 ttggcactaa aggccctggt caaattggat ttctttcatt tttccacact tcaaagaccc 4561 atgttctagg tattctccat agggatagtc tctttggcat ttatttggtt tttctacgtt 4621 ttcagtccca tttactccaa gactcactcc ctgccaccta gtgcatcaga tacagctact 4681 tctggctgac ttttcaaggg ggaccaccct acctgtcatc tcttcactgt tcagaaatga 4741 ctgtgtcagt gcacctcaaa ctcccttgct gtccttttcc aaggagacag ctaaggtgga 4801 tggagatgca gaatggacct cacgttcgcc ctagtcagga ctgataccct ttccgtttca 4861 gaggattgcc aagaaaaaac tcacagttga ggcagggtgc tctgaggtcg gctgcggtgt 4921 gggaggcacg gcctgggcct gctctctggg ctggagcagg tggattcgaa ggcctgtcta 4981 gcacgagggc ccaaaggtct tgtcagtggc cagtagctct gccgcctttc ccagagaggg 5041 ggtccagggg acatcctgga aggctgggcc ctgggccacc ttctgctctt gcaagctaga 5101 gccagcccaa tagggggcgg atgtgagtgg ggagctgggg cgcatgaagg tgggggtgat 5161 gccgaagggg aagggatcgc cagtggggat tggtgcgtgt gcggaaacgg ggacagaagt 5221 gaaggttcat cgcctataac gaagatgagg taggcatata ggggcttctg gaaagctaga 5281 ggctgggctg agccaggagt cctctcccag aagttggggg gcggtgcaga ggtgtgggtc 5341 gagcccgcat gcgtgcctgc tggggagggg gtgagtggtg aggaccaggc ccgctgggtc 5401 ctgggggcgc ggtggctggc gcgcaggtcc cggagggggc ggctggcgcg cactacacgc 5461 ttgggaacaa ggaaaacatc cgccggaggc ccggccgggc ggcgctccag cctcggggca 5521 ggtgcgcgga gaggaagtga gagcattccg gcccccccac cccaaccccg gccgctggcc 5581 ctctggtgag tcacagccga cccccgccgc cggagggaga ggggagctgc gggccagagc 5641 cccggagggt ctggaggagc caggagggtt tctgggagca gagggtcact tagtgggctt 5701 ctgtcgtggt gtcgctacgg gcgcgaaacg gacactgaac acagtctgac tgtatggagg 5761 caggtgggga gggatcccct gggagaactt ggcgggccga gagcagaccc cagggcaagg 5821 aggggccccc gagggggaaa ccgggagtcg ggcaggtggc gtaacccaga aagggaagga 5881 gagccggatt gattggggtg agagaggaag gaagcacgcc aagttaggcc tgggagaact 5941 gagggacctg aggagggagg agggagacca acacagggtg ggaaggcgga aatggccaaa 6001 ccccaggcat caggtctgtc cagaggctga cgtagacagt gaagggtgaa gggtaggttt 6061 taggagtagg gggagttatg attatttggt tacattttgg gattatttgg tctcacaggt 6121 agaagggagc ctgctggtct ctgtgtaacg gatggcttaa aagcaaggtt gtctgcgtct 6181 tggattactg tctgccattc agcctttgcc aaaaaatttg gcactgatct gcacattttt 6241 atagtcattt aaaattgtat gactctgtca aatgatttaa gtaattttgg tggattttta 6301 aaaataaaaa aat // LOCUS D87451 3205 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0262 gene, complete cds. ACCESSION D87451 NID g1665790 KEYWORDS KIAA0262. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7073. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3205) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..3205 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7073" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 699..2984 /gene="KIAA0262" CDS 699..2984 /gene="KIAA0262" /note="Contains C3HC4 type zinc finger signature" /codon_start=1 /db_xref="PID:d1014082" /db_xref="PID:g1665791" /translation="MMDGKNSSGSKRYNRKRELSYPKNESFNNQSRRSSSQKSKTFNK MPPQRGGGSSKLFSSSFNGGRRDEVAEAQRAEFSPAQFSGPKKINLNHLLNFTFEPRG QTGHFEGSGHGSWGKRNKWGHKPFNKELFLQANCQFVVSEDQDYTAHFADPDTLVNWD FVEQVRICSHEVPSCPICLYPPTAAKITRCGHIFCWACILHYLSLSEKTWSKCPICYS SVHKKDLKSVVATESHQYVVGDTITMQLMKREKGVLVALPKSKWMNVDHPIHLGDEQH SQYSKLLLASKEQVLHRVVLEEKVALEQQLAEEKHTPESCFIEAAIQELKTREEALSG LAGSRREVTGVVAALEQLVLMAPLAKESVFQPRKGVLEYLSAFDEETTEVCSLDTPSR PLALPLVEEEEAVSEPEPEGLPEACDDLELADDNLKEGTICTESSQQEPITKSGFTRL SSSPCYYFYQAEDGQHMFLHPVNVRCLVREYGSLERSPEKISATVVEIAGYSMSEDVR QRHRYLSHLPLTCEFSICELALQPPVVSKETLEMFSDDIEKRKRQRQKKAREERRRER RIEIEENKKQGKYPEVHIPLENLQQFPAFNSYTCSSDSALGPTSTEGHGALSISPLSR SPGSHADFLLTPLSPTASQGSPSFCVGSLEEDSPFPSFAQMLRVGKAKADVWPKTAPK KDENSLVPPAPVDSDGESDNSDRVPVPSFQNSFSQAIEAAFMKLDTPATSDPLSEEKG GKKRKKQKQKLLFSTSVVHTK" BASE COUNT 760 a 889 c 834 g 722 t ORIGIN 1 cgactcgtcg ccattcccgg agcaggtcgg cctcggccca ggggcgagta tccgttgctg 61 tgtcggagac actagtcccc gacaccgaga cagccagccc tctcccctgc ctcgcggcgg 121 gagagcgtgt ccggccggcc ggccggcggg gctcgcgcaa cctccctcgc ctccccttcc 181 cccgcagcct ccgccccgcc aggcccggcc cggactcccg agccccggcc tcctcgtcct 241 cggtcgccgc tgccgccggg cttaacagcc ccgtccgccg cttctcttcc tagtttgaga 301 agccaaggaa ggaaacaggg aaaaatgtcg ccatgaaggc cgagaaccgc tgccgccgcc 361 gacccccgcc ggccctgaac gccatgagcc tgggtccccg ccgcgcccgc tccgctccga 421 ctgccgtcgc cgccgaggcc cccgttgatg ccgctgagct cccccaacgc cgccgccacc 481 gcctccgaca tggacaagaa cagcggctcc aacagctcct ccgcctcttc gggcagcagc 541 aaagggcaac agccgccccg ctccgcctcg gcggggccag ccggcgagtc taaacccaag 601 agcgaattac taatttcagc tggattcaat ttgttgtcag ttgattctgt agtaaggcca 661 tatgttgccc ctctggaggt gcttgtcaac tactctggat gatggatgga aagaactcca 721 gtggatccaa gcgttataat cgcaaacgtg aactttccta ccccaaaaat gaaagtttta 781 acaaccagtc ccgtcgctcc agttcacaga aaagcaagac ttttaacaag atgcctcctc 841 aaaggggcgg cggcagcagc aaactcttta gctcttcttt taatggtgga agacgagatg 901 aggtagcaga ggctcaacgg gcagagttta gccctgccca gttctctggt cctaagaaga 961 tcaacctgaa ccacttgttg aatttcactt ttgaaccccg tggccagacg ggtcactttg 1021 aaggcagtgg acatggtagc tggggaaaga ggaacaagtg gggacataag ccttttaaca 1081 aggaactctt tttacaggcc aactgccaat ttgtggtgtc tgaagaccaa gactacacag 1141 ctcattttgc tgatcctgat acattagtta actgggactt tgtggaacaa gtgcgcattt 1201 gtagccatga agtgccatct tgcccaatat gcctctatcc acctactgca gccaagataa 1261 cccgttgtgg acacatcttc tgctgggcat gcatcctgca ctatctttca ctgagtgaga 1321 agacgtggag taaatgtccc atctgttaca gttctgtgca taagaaggat ctcaagagtg 1381 ttgttgccac agagtcacat cagtatgttg ttggtgatac cattacgatg cagctgatga 1441 agagggagaa aggggtgttg gtggctttgc ccaaatccaa atggatgaat gtagaccatc 1501 ccattcatct aggagatgaa cagcacagcc agtactccaa gttgctgctg gcctctaagg 1561 agcaggtgct gcaccgggta gttctggagg agaaagtagc actagagcag cagctggcag 1621 aggagaagca cactcccgag tcctgcttta ttgaggcagc tatccaggag ctcaagactc 1681 gggaagaggc tctgtcggga ttggccggaa gcagaaggga ggtcactggt gttgtggctg 1741 ctctggaaca actggtgctg atggctccct tggcgaagga gtctgttttt caacccagga 1801 agggtgtgct ggagtatctg tctgccttcg atgaagaaac cacggaagtt tgttctctgg 1861 acactccttc tagacctctt gctctccctc tggtagaaga ggaggaagca gtgtctgaac 1921 cagagcctga ggggttgcca gaggcctgtg atgacttgga gttagcagat gacaatctta 1981 aagaggggac catttgcact gagtccagcc agcaggaacc catcaccaag tcaggcttca 2041 cacgcctcag cagctctcct tgttactact tttaccaagc ggaagatgga cagcatatgt 2101 tcctgcaccc tgtgaatgtg cgctgcctcg tgcgggagta cggcagcctg gagaggagcc 2161 ccgagaagat ctcagcaact gtggtggaga ttgctggcta ctccatgtct gaggatgttc 2221 gacagcgtca cagatatctc tctcacttgc cactcacctg tgagttcagc atctgtgaac 2281 tggctttgca acctcctgtg gtctctaagg aaaccctaga gatgttctca gatgacattg 2341 agaagaggaa acgtcagcgc caaaagaagg ctcgggagga acgccgccga gagcgcagga 2401 ttgagataga ggagaacaag aaacagggca agtacccaga agtccacatt cccctcgaga 2461 atctacagca gtttcctgcc ttcaattctt atacctgctc ctctgattct gctttgggtc 2521 ccaccagcac cgagggccat ggggccctct ccatttctcc tctcagcaga agtccaggtt 2581 cccatgcaga ctttctgctg acccctctgt cacccactgc cagtcagggc agtccctcat 2641 tctgcgttgg gagtctggaa gaagactctc ccttcccttc ctttgcccag atgctgaggg 2701 ttggaaaagc aaaagcagat gtgtggccca aaactgctcc aaagaaagat gagaacagct 2761 tagttcctcc tgcccctgtg gacagcgacg gggagagtga taattcagac cgtgttcctg 2821 tgcccagttt tcaaaattcc ttcagccaag ctattgaagc agccttcatg aaactggaca 2881 caccagctac ttcagatccc ctctctgaag agaaaggagg aaagaaaaga aaaaaacaga 2941 aacagaagct cctgttcagc acctcagtcg tccacaccaa gtgacactac tggcccaggc 3001 taccttctcc atctggtttt tgtttttgtt tttttttccc ccatgctttt gtttggctgc 3061 tgtaattttt aagtatttga gtttgaacag attagctctg gggggagggg gtttccacaa 3121 tgtgaggggg aaccaagaaa attttaaata cagtgtattt tccagcttcc tgtctttaca 3181 ccaaaataaa gtattgacac aagag // LOCUS D87452 4461 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0263 gene, complete cds. ACCESSION D87452 NID g1665792 KEYWORDS KIAA0263. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA7068. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4461) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..4461 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA7068" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 309..1634 /gene="KIAA0263" CDS 309..1634 /gene="KIAA0263" /note="Similar to S.cerevisiae YD9335.03c protein (S54640)" /codon_start=1 /db_xref="PID:d1014083" /db_xref="PID:g1665793" /translation="MCVCQTMEVGQYGKNASRAGDRGVLLEPFIHQVGGHSSMMRYDD HTVCKPLISREQRFYESLPPEMKEFTPEYKGVVSVCFEGDSDGYINLVAYPYVESETV EQDDTTEREQPRRKHSRRSLHRSGSGSDHKEEKASLSLETSESSQEAKSPKVELHSHS EVPFQMLDGNSGLSSEKISHNPWSLRCHKQQLSRMRSESKDRKLYKFLLLENVVHHFK YPCVLDLKMGTRQHGDDASAEKAARQMRKCEQSTSATLGVRVCGMQVYQLDTGHYLCR NKYYGRGLSIEGFRNALYQYLHNGLDLRRDLFEPILSKLRGLKAVLERQASYRFYSSS LLVIYDGKECRAESCLDRRSEMRLKHLDMVLPEVASSCGPSTSPSNTSPEAGPSSQPK VDVRMIDFAHSTFKGFRDDPTVHDGPDRGYVFGLENLISIMEQMRDENQ" BASE COUNT 873 a 1248 c 1274 g 1066 t ORIGIN 1 cttgttgttg atccgtaccc agtgggcagc gccgggagct ggaccaagcg gccggtgaga 61 ggccgctgta gcggtgctca gccacctgtg ctgcctgcca gggggcgggc cgaaacctgg 121 aggcccgggg ggcccagctc ccgtagggag ccgtgggcgc tcggtgcccg ggccgggcag 181 gacagaataa taagctgaat agaatctgac cattggcttt cacctggcca ggaccttcta 241 tgtagctctc cttttgtggc ccatgtgctg catcctctgc cctcagtgtg caactggccc 301 ccaacgcaat gtgtgtttgt caaaccatgg aagtggggca gtatggcaag aatgcaagtc 361 gggctggaga ccggggagtc ctcctggagc ccttcatcca ccaagtaggc ggacacagca 421 gcatgatgcg ttacgacgat cacactgtgt gcaagcccct catctcccgg gaacagcgct 481 tttacgagtc cctccctccc gaaatgaagg agttcacccc tgaatacaaa ggcgtggtat 541 ctgtctgttt tgagggggac agtgatggtt acatcaactt agtggcctat ccttatgtgg 601 aaagtgagac tgtggaacag gatgacacaa cagaacggga gcaacctcgg cgcaaacact 661 cccgccggag cctgcaccgg tcaggcagtg gcagtgacca caaggaggag aaagccagcc 721 tgtcccttga gacctctgag agctcacagg aggcaaagag tccgaaggtg gagctgcaca 781 gccactcaga ggtccctttc cagatgctag atggcaacag tggcttgagt tctgagaaga 841 tcagccacaa cccctggagc ctgcgttgtc acaagcagca gctgagccgc atgcgctccg 901 agtccaagga ccgaaagctc tacaagttcc tcctgcttga gaacgtggtg caccacttca 961 agtacccctg cgtgttggac ctgaagatgg gcacgcggca gcatggcgat gacgcgtcag 1021 ctgagaaggc agcccggcag atgcggaaat gcgagcagag cacatcagcc acgctgggcg 1081 tcagggtctg cggcatgcag gtgtaccagc tggacacagg gcattacctc tgcaggaaca 1141 agtactatgg ccgtgggctc tccattgaag gcttccgcaa tgccctctat caatatctgc 1201 acaatggcct ggacctgcga cgtgacctgt ttgagcctat cctgagcaaa ctgcggggcc 1261 tgaaagctgt gctggagcgg caggcctctt accgcttcta ctccagttcc ctgcttgtca 1321 tctatgatgg caaggagtgc cgggctgagt cctgcctgga ccgccggtct gagatgcgtc 1381 tcaagcacct ggacatggtg ctccctgagg tggcgtcatc ctgtggcccc agcaccagcc 1441 ccagcaacac cagccccgag gcgggtccct cctctcagcc caaggtggat gtccgcatga 1501 ttgactttgc acacagcaca ttcaagggct tccgggatga ccccaccgtg catgatgggc 1561 cagacagagg ctacgtgttt ggcctggaga acctcatcag catcatggaa cagatgcggg 1621 acgagaacca gtaggccctg ttctgggccc ccagaacccc ttcctctcca ctgcaggcag 1681 ggaccattgt tctgaacttg ccgtgaggac acacagactt gcttttaaag ggttatattt 1741 ctctttggtg taaactaaaa gaaatgtttt tagctgtagc ctggaatcca tatatataaa 1801 gtgaaggagg gcagaccaca cgccctctca gccaggctcc tcagctttgt ggctctgact 1861 ggtgtgtcca ggctgcctta ggaaggaaga ggtgcccctg gtgggcttgg cagcagggac 1921 agggtgccct tggacattgg tttctcttgt ctagatcttt gagatctgtg gctgcagggc 1981 cctgctgatt gtaaggtaaa gccctgggct ggtgcagggc ccctccacgc ccactcttcc 2041 cttgttcccc agaagtagag ggctctgggt gcccatttct tgggggcttt ccagtcttat 2101 gctgtgggtg tcagctagct ctttaatagg tgccctcagg gcaccacagg gctgactgca 2161 caaagctgga cccatccttc ggtctgacct tagcatgggg ctagattaat gaagctgggc 2221 tgaggccaac ttatggcaga gggcggcgcc tgggttcccc aggcacctgt tggcacgtga 2281 caggttggca cctgtcctat tcctgaaaca gcctctctca ccaagttccc ttgcctaaga 2341 aggccactcc ctcccacccc actgaagtgg gggatagtcg gtgtcctagc aggcctcagg 2401 gcctctggtg gctctggccc agacagtatt tgcagttctt gtgctatggg tgggagtctt 2461 cttcctcaag tttcggcagc tgtgctgctg ctggatgggc tgctcctccc agggctcaag 2521 ggctgtggtc cgctcagggt ctcatttccc caggccaagt tcaaggcagc agccctttgt 2581 gaggcgctct tggccctggg cctggaggga gaactttaag cttttttgct cacagggacg 2641 tggtatgggc cctgggtgca ggtgcccaca ttctgctaat gagagctttg tctgatcagt 2701 cctgggtcca tcagtttgtc catgtgtccg gctgccagcc cgtcccttgg gatccttccc 2761 ctggggtgta gccttgttca ttagtatata ctcattcctt catgctttcc tcagcagaac 2821 acttccactt ctgaggtgag cttttgcccc gtgcccttcc tccacaggtg ttgccttttt 2881 ataaagacct gatagcagaa taaattggtg tttccctgtt gacccagcac catttctgtg 2941 ggcctagaat atggccctca acccttagag tggggcagtg agggcttgag gagtgaccct 3001 tcctttctca tggttttagt cattttggct gccagccctt aatggcacag atctgctgct 3061 tctaacagat ggccaggagg tgacaccgat ttcagccatt gccaaggtta gcaccctctc 3121 ctttgagcct agggccacac tgttcattgt cactttaggc aagtgcctgt ttggctttaa 3181 aggtaagcct gccagctgtg agaagccttg gtaactgatg gactcatttc ctggtcctta 3241 aagatgcagc ctcttaaggg ctccttgatg gatgccatct ctcctagccc ccagccctgg 3301 tgccactggt gggcaggttc ccattctttg gggctgggag ggacagcttg cctgtttctg 3361 gtcacaaatt acagtcttct ctcctgtacc attctgtggc ttcagccatg ggggcagtag 3421 cccttcatta gtgtagatag tcattccctg gtagggtgga gggtaagaca tagggtctgg 3481 aactgtttgg gaccttttgg ggatgtcctg tgcctcccag attcctagat tctgggagga 3541 gaggctgccg cattctgctg ctcctcacag cgagcaaagc tgcacccact tacattcagt 3601 attttcctgg cactacaaag agtgggaagg cctgggattt gctgctgctc ccttagagca 3661 gggcccctct tttcagcact ttggacacct ggagacccag ccctgttatt taatggtagt 3721 gggcaagtgt gtgtgcatac tgtctgccac tgctttctcc ctgccccatg ccagagagcc 3781 ctgtccctgc caggcccagc cttcttagcc ccaacttggg aacaaagtgc aacatgggat 3841 catgggttgg ggtgctcagg tgagccctct ctatagtgct tccctgggcc aagctgacac 3901 cagcccctga gggtggggtg ggacgggtgg tgcttaaaag aggaagggga ccagtgtagc 3961 aacttgccag ggaccccacc cctccctctc tgggcctgtg cagtgagcat ggggattccc 4021 atcaaggggc ctggcacctg tgctagttac gtagccgctg ctcacgcgct cactcctgac 4081 cacatgcacg ttccctagat gcagactgct ttgaacttta aagctgtaca atttggttat 4141 gtttgtgctg acttaaaata tattttaatg aggaaaaaat aatggagaac cctgggaagg 4201 acctggttct tttgcttctc ggggaactgt aagccctcgc gttctgggaa tcgctctctg 4261 ctgctctttc ctggaagcta agcctgtctc caccgcccga ggcctgcgcc ggtggctccc 4321 gccgcagttg cgtttgcttt ggaccttgcg tgcgggggag ggggtgctcg gtccgagccc 4381 gctcctttct gtacacctag cgctgcccgc cccgcttgtg tctgaggtcg tgtatgtcaa 4441 aaataaagcc gctagaaacg g // LOCUS D87455 5585 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0266 gene, complete cds. ACCESSION D87455 NID g1665798 KEYWORDS KIAA0266. SOURCE Homo sapiens male bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript II SK clone:HA2755. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5585) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5585 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA2755" /clone_lib="pBluescript II SK" /sex="male" /tissue_type="bone marrow" gene 734..3034 /gene="KIAA0266" CDS 734..3034 /gene="KIAA0266" /note="Similar to S.cerevisiae hypothetical protein 5 (S49634)" /codon_start=1 /db_xref="PID:d1014086" /db_xref="PID:g1665799" /translation="MNVNQVAENLALSHQEELVDLPKNYPLSENEDEGDSDGERKHQK LLEAIISLDGKNRRKLAERSEASLKVSEFSVSSEGSGEKLGLADLLEPVKTSSSLATV KKQLNRVKSKKVVELPLNKEKIEQIHREVAFSKTSQVLSKWDPIILKNQQAEQLVFPL GKEQPAIAPIEHALSGWKARTPLEQEIFNLLHKNKQPVTDPLLTPMEKASLQAMSLEE AKMHRAELQRARALQSYYEAKARKEKKIKSKKYHKVVKKGKAKKALKEFEQLQKVNPT VALEEMEKIENARMMERMSLKHQNSGKWAKSKAIMAKYDLEARQAMQEQLAKNKELTQ KLQVASESEEEEGGTEVEELLVPHVANEVQMNVDGPNPWMFRSCTSDTKEAATQEDPE QVPELAAHEVSASEAEERPVAEEEILLREFEERQSLRKRSELNQDAEPASSQETKDSS SQEVLSELRALSQKLKEKHQSRKQKASSEGTVPQVQREEPAPEEAEPLLLQRSERVQT LEELEELGKEDCFQNKELPRPVLEGQQSERTPNNRPDAPKEKKEKEQLINLQNFLTTQ SPSVRSLAVPTIIEELEDEEERDQRQMIKEAFAGDDVIRDFLKEKREAVEASKPKDVD LTLPGWGEWGGVGLKPSAKKRRQFLIKAPEGPPRKDKNLPNVIISEKRNIHAAAHQVQ VLPYPFTHHRQFERTIQTPIGSTWNTQRAFQKLTTPKVVTKPGHIIKPIKAEDVGYQS SSRSDLPVIQRNPKRITTRHNKEEKL" BASE COUNT 1770 a 1078 c 1275 g 1462 t ORIGIN 1 gcctttgcta aattgctgaa taagaagatg gttgagtcac ctccttcgct taaacttgtc 61 ctcattggag gttgtcgtaa caaagatgat gaacttaggg taaaccaact gagaaggctg 121 tctgaggatt taggagttca agaatatgtg gaatttaaaa taaacattcc atttgatgaa 181 ttaaagaatt atttgtctga agcaacaatt ggtctgcata ccatgtggaa cgagcatttt 241 gggattggag ttgtggagtg tatggcagct ggcacaatta tccttgcaca caattcgggg 301 ggcccaaagc ttgacattgt ggttcctcac gaaggagata taactggctt tctggctgag 361 agtgaagaag actatgctga aactatcgct cacattcttt ccatgtctgc agaaaagaga 421 ctccaaatca gaaaaagtgc tcgtgcatct gtaagcagat tctctgatca ggaatttgaa 481 gtgacattcc tatcatctgt ggaaaagtta tttaagtaat gccatatctg taaaattaaa 541 gatattttat ataaactggt taaacacctt catatgtaaa tatttttcta aattcaatct 601 catttgtcaa atcattttac tttagaaaac agacaaaatt tccttttaga ataaaaggaa 661 gtgttgaaaa gaaaatggat gactagcctt cggcttccat tcttggtata catgagagag 721 gctggctgct gagatgaatg tgaaccaggt tgcagagaat ctggctttga gccaccagga 781 agaactagtg gatttgccaa aaaactaccc cttgagtgaa aatgaagatg agggggacag 841 tgatggagag agaaagcatc aaaagcttct ggaagcaatc atttcccttg atggaaagaa 901 taggcggaaa ttggctgaga ggtctgaggc tagtctgaaa gtgtcagagt tcagtgtcag 961 ttctgaagga tcaggagaaa agctgggcct tgcagatctg cttgagcccg ttaaaacttc 1021 atcttctttg gccactgtaa aaaagcaact gaatagagtc aaatcaaaga aggtggtgga 1081 gttacctctt aacaaagaaa aaattgaaca gatccacaga gaagtagcat tcagtaaaac 1141 ctcacaggtc ctctccaaat gggaccctat catcctgaag aaccagcagg cagagcagct 1201 ggtttttccc ctggggaagg agcagccagc cattgctccc attgaacatg cgctcagtgg 1261 ctggaaggca agaactcccc tggagcagga aatttttaac ctcctccata agaacaagca 1321 gccagtgaca gatcctttac tgactcccat ggaaaaggcc tctctccaag ccatgagcct 1381 ggaagaggca aagatgcacc gagcagagct tcagagggct cgggctctgc agtcctacta 1441 tgaggccaag gctcgaaaag agaagaaaat caaaagtaaa aagtatcaca aagtcgtgaa 1501 gaaaggaaag gccaagaaag ccttaaaaga gtttgagcag ctacagaagg ttaatccaac 1561 tgtggcactg gaagaaatgg aaaaaattga aaatgccaga atgatggaaa gaatgagcct 1621 taagcaccaa aacagtggga aatgggccaa gtcaaaggca attatggcca aatatgacct 1681 ggaggctcgc caagctatgc aggaacagtt ggccaagaac aaagaactga cacagaaact 1741 ccaggtagcc tctgagagtg aggaagagga gggaggcaca gaagtggaag aactccttgt 1801 ccctcatgta gcgaatgaag tgcagatgaa tgtggacgga ccgaatccct ggatgttcag 1861 gagctgcacc agtgacacca aagaggctgc aacacaggag gaccctgagc aagtgccaga 1921 gcttgcagct catgaggttt ctgcaagtga ggcagaagaa agaccagtgg cagaggaaga 1981 aattttgttg agagaatttg aggaaaggca atcccttaga aaaagatctg agctcaacca 2041 ggatgctgag ccagcaagca gtcaagaaac aaaagattct agcagccagg aggtgctgtc 2101 cgaattgagg gcactatctc agaaattgaa ggaaaaacat cagtccagga agcaaaaagc 2161 aagttcagag gggactgttc cccaggtcca gagagaggaa cctgccccag aagaagcgga 2221 acccctattg ctacagaggt cagagagagt acaaactctg gaagagctag aagagctggg 2281 aaaagaagat tgttttcaaa ataaggagct tcccagacct gtgttagaag gacagcagtc 2341 agagaggacc ccaaataatc ggcctgatgc ccctaaggag aagaaagaga aggagcaact 2401 gatcaaccta cagaacttcc tgaccacaca gtctccttcc gtgaggtctt tggcagttcc 2461 cacaataata gaggagctgg aagatgaaga ggagagagac caaaggcaga tgataaagga 2521 agcttttgct ggggatgatg tcatcagaga tttcttgaaa gagaagaggg aagctgtgga 2581 ggcgagtaag ccaaaggacg tggacctgac actacctggc tggggcgagt ggggtggtgt 2641 gggcctaaag cccagtgcca agaaaagacg ccagtttctc attaaagccc ctgagggtcc 2701 tccaagaaaa gataagaatt tgccaaatgt gattatcagt gagaagcgca acatccacgc 2761 agcagctcat caggtacaag tgcttccata tccatttacc caccatcggc aatttgaaag 2821 gaccatccag acccctatag gatccacatg gaacacccag agggctttcc aaaagctgac 2881 tactcccaag gtcgtcacca agccaggcca tatcattaag cccataaaag cagaggatgt 2941 gggctaccag tcttcctcaa ggtcagacct gcctgtcata cagaggaatc caaaacgaat 3001 caccacacgt cacaataaag aagaaaaact gtaggttgtg tagctggaga agtgacagtc 3061 aggggccctg attccacttc ctttggtcca gttttactct gctacagggt ggattccaaa 3121 actggctcag cacattgcat gtagttgagc cacatttttt aaaaaaagaa aatggatgac 3181 cattaattga ctagcatttt agaattgatc agacattaga acacagaaaa attctagtac 3241 atttaaattc taaacaatac agtggatgac ccttttgaat atacctaatg atttccttaa 3301 aaaagaaatt ttaaacagac ttgtttaatc gtgttctcaa agcatacagt caagaggtgg 3361 gactgactga tgctttatag gtgtgtgtag ggtggtagag gccaaggtgc tgccagcaat 3421 cctttccata ctaggtactg gtgaaaattg tttttgttta tgctgtcagc acatttgtgt 3481 gggtctctca ttgtccctta acagtgccgc atctcagcct ggaagtcagc tttaagtcat 3541 tcaagagaac ctcaggctgt ttttctgaca gtgatgatat gatatacaga tacatccaca 3601 gggtatctat taccagcata atgcattgta agatggcaag gtggcatttt gaaagagcgc 3661 tgggcaagca gttagcacat ctgggcctac cttcagcttc ttcatttaca aacttctgac 3721 cttttgacac taaagctgct cctttatctt tctgagtctc agattcttca tctgtaatct 3781 ggagttgtta attccagtcc ttactacctt ttagagttgg aatgagatgg aaagtagatg 3841 aaactacttt gaaaattacg acagttaagg gctgggtgcg gtgtctcaca cctgtaatcc 3901 cagcactttg ggaggccgag atgggtggat catgaggtca gcagttgaga ccagccaaag 3961 tgctgggtgc ggtgtctcac acctgtaatc ccagcacttt gggaggccga gatgggtgga 4021 tcatgaggtc agcagttgag accagcctga acaacatggt gaaaccctgt ctgtactaaa 4081 aatacaaaaa aattagctgg gcctggtggc aggcacctgt aatcccagct acttgggaga 4141 ctgaggcagg agaattgctt gaaactggaa ggcagaggtt gcagtgagcc aagattgtgc 4201 cactgcactc tagcctgggc aataagcaaa actccatctc aaaaaagaaa aaaaggaaaa 4261 agaaaattat gagagttact taaaggtaac atcacatact aaatgtcttc tataatccta 4321 tatttattaa tgcattacaa ctctgtagat tgttagttac taggccagta gctaggaatt 4381 ggtataaatt taatgcacct tctatcctga ataactagca tggaaaagtg aatatatgtg 4441 tgagcagata tggctataaa gacctatagc ttttgcactt tatgcatata taatcaatcc 4501 tttctagttc agtgaattga ccccatccac aggctgattc atctttgtgt taaggggcaa 4561 atgaaacggt atattatttc tttgcagtct cctctcagtc attcatcaat gtggccagct 4621 tatctactcc caattatgtt gttgatacat ctccaagcca tctgtcatca gatcaaaaag 4681 cagcaaacag agggtcagtc acaggatgtt ctgacacacc attgtaactt tttgttagag 4741 atgatcccat ttagaaaaag actggtagaa attggagtga aaggaaccct acagattagc 4801 ccagttctct cttattttca gctttacaga caagaacaat ttaaatctaa agaatttagt 4861 agattccttc agtgtcacaa agctgtttca tgaaagaatc aagattataa cctggatatt 4921 ctgactcctg gcccagtgct ttttcttact ttgtagctac actttgaagt aagattcaaa 4981 ctgttatcca ctcaattgcc ttattcctga ggatgtagtc aaggaagaaa aagttttctg 5041 gaattccgta aattatattt taagcttatt tcttcaaaat tattttcata tatcacagat 5101 atatcattgg aagatataat ttgcatatat gttcattatc agtgttccta atttggtatt 5161 acatgtattc tatttttttc tgaatgatag catgaaaagt gtcaaagtgg tttgtccgct 5221 agcgtctgtc tgcagaactt tcaggatgac tattaattcc tctcagatgt catttttgag 5281 tggtccaagc ctgctgtttt gaacccacag cagtggagat ttgtattctt atttacagtt 5341 gtgtactata aagtgtgtgt tacataggtt ttgtgtaata attatttgta aatattattt 5401 agatttgtat ttagacatga tttatatcta atatagatac aaagtctgtg tctaaatatt 5461 atttaaagaa gtgatttttc attctcttgg attctttcca gtgtggtgcc ttttatatgc 5521 ctcacatagt ctccttgttc tcctactaat attcccaagc tccatatgcc aattaaagaa 5581 gaaac // LOCUS D87457 2109 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0281 gene, complete cds. ACCESSION D87457 NID g1665800 KEYWORDS KIAA0281. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6725. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2109) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) FEATURES Location/Qualifiers source 1..2109 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6725" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 174..917 /gene="KIAA0281" CDS 174..917 /gene="KIAA0281" /codon_start=1 /db_xref="PID:d1014087" /db_xref="PID:g1665801" /translation="MQVVKEQVMRALTTKPSSLDQFKSKLQNLSYTEILKIRQSERMN QEDFQSRPILELKEKIQPEILELIKQQRLNRLVEGTCFRKLNARRRQDKFWYCRLSPN HKVLHYGDLEESPQGEVPHDSLQDKLPVADIKAVVTGKDCPHMKEKGALKQNKEVLEL AFSILYDSNCQLNFIAPDKHEYCIWTDGLNALLGKDMMSDLTRNDLDTLLSMEIKLRL LDLENIQIPDAPPPIPKEPSNYDFVYDCN" BASE COUNT 547 a 578 c 459 g 525 t ORIGIN 1 ctgttcacat ggctcaactg gaaacctgtt tcatgaacaa gcttactcag gaaccatctg 61 gtggtattcc agcacattgt tcttcagggg gacgactcta agtcgctttg tggtggcagc 121 agcttagaat cagtatttgt ggttgggaaa gatggactta cgggagcttg gtaatgcagg 181 tggtgaagga gcaggttatg agagcactta caaccaagcc tagctccctg gaccagttca 241 agagcaaact gcagaacctg agctacactg agatcctgaa aatccgccag tccgagagga 301 tgaaccagga agatttccag tcccgcccga ttttggaact aaaggagaag attcagccag 361 aaatcttaga gctgatcaaa cagcaacgcc tgaaccgcct tgtggaaggg acctgcttta 421 ggaaactcaa tgcccggcgg aggcaagaca agttttggta ttgtcggctt tcgccaaatc 481 acaaagtcct gcattacgga gacttagaag agagtcctca gggagaagtg ccccacgatt 541 ccttgcagga caaactgccg gtggcagata tcaaagccgt ggtgacggga aaggactgcc 601 ctcatatgaa agagaaaggt gcccttaaac aaaacaagga ggtgcttgaa ctcgctttct 661 ccatcttgta tgactcaaac tgccaactga acttcatcgc tcctgacaag catgagtact 721 gtatctggac agatggactg aatgcgctac tcgggaagga catgatgagc gacctgacgc 781 ggaatgacct ggacaccctg ctcagcatgg aaatcaagct ccgcctcctg gacctggaaa 841 acatccagat ccctgacgca cctccgccga ttcccaagga gcccagcaac tatgacttcg 901 tctatgactg taactgaagt ggccgggccc agacatgccc cttccaaaac tggaacacct 961 agctaacagg agagaggaat gaaaacacac ccacgccttg gaaccgtcct ttggtaaagg 1021 gaagctgtgg gtccacattc ccttcagcat cacctctagc cctggcaact ttcagcccct 1081 agctggcatc ttgctcaccg ccctgattct gttcctcggc tccactgctt caggtcactt 1141 cccatggctg cagtccactg gtgggacaag agcaaagccc actgccagta agaaggccaa 1201 agggcccttc catcctagcc ctctgcaggc atgcccttcc ttcccttggg caggaaagcc 1261 agcagcccca gactgcccaa aaacttgccc accagaccaa gggcagtgcc ccaaggcccc 1321 tgtctggagg aaatggccta gctatttgat gagaagacca aaccccacat cctcctttcc 1381 cctctctcta gaatcatctc gcaccaccag ttacacttga attaagatct gcgctcaaat 1441 ctcctcccac ctctctccct gcttttgcct tgctctgttc ctctttggtc ccaagagcag 1501 cagccgcagc ctcctcgtga tcctccctag cataaatttc ccaaacagtc cacaggtccc 1561 atgcccactt tgcgtctgca ctgtgatcgt gacaaatctt ccctcctcac cagctagtct 1621 ggggtttcct ctccctgccc caggccagaa ctgccttctt catttccacc cacgctccca 1681 gcctcttagc tgaaagcaca aatggtgaaa tcagtagtct cgctccatct ctaatagact 1741 aaacctaaat gcctctagga cggactgttg ctatccaagc gtttggtgtt accttctcct 1801 gggaggtcct gctgcaactc aagttccaca ggatggtcaa gctgtcagac atccaagttt 1861 acatcattgt aattattact ggtatttaca atttgcaaga gttttgggtt agtttttttt 1921 tttttttttt tgctttgttt ttgtacaaaa gagtctaaca ttttttgcca aacagatata 1981 tatttaatga aaagaagaga tacataaatg tgtgaatttc cagttttttt tttaattatt 2041 ttaatcccaa acatcttcct gaaaataaca ttcccttaaa catgctgtgg aataaaatgg 2101 attgtgatg // LOCUS D87459 2625 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0269 gene, complete cds. ACCESSION D87459 NID g1665804 KEYWORDS KIAA0269. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6751. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2625) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..2625 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6751" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 243..1922 /gene="KIAA0269" CDS 243..1922 /gene="KIAA0269" /note="Similar to Volbox carteri extensin (S22697)" /codon_start=1 /db_xref="PID:d1014089" /db_xref="PID:g1665805" /translation="MPLVKRNIDPRHLCHTALPRGIKNELECVTNISLANIIRQLSSL SKYAEDIFGELFNEAHSFSFRVNSLQERVDRLSVSVTQLDPKEEELSLQDITMRKAFR SSTIQDQQLFDRKTLPIPLQETYDVCEQPPPLNILTPYRDDGKEGLKFYTNPSYFFDL WKEKMLQDTEDKRKEKRKQKQKNLDRPHEPEKVPRAPHDRRREWQKLAQGPELAEDDA NLLHKHIEVANGPASHFETRPQTYVDHMDGSYSLSALPFSQMSELLTRAEERVLVRPH EPPPPPPMHGAGDAKPIPTCISSATGLIENRPQSPATGRTPVFVSPTPPPPPPPLPSA LSTSSLRASMTSTPPPPVPPPPPPPATALQAPAVPPPPAPLQIAPGVLHPAPPPIAPP LVQPSPPVARAAPVCETVPVHPLPQGEVQGLPPPPPPPPLPPPGIRPSSPVTVTALAH PPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRHPSTLPVISDARSVLLEAIRKG IQLRKVEEQREQEAKHERIENDVATILSRRIAVEYSDSEDDSEFDEVDWLE" BASE COUNT 776 a 608 c 485 g 756 t ORIGIN 1 cttctcttgc acttgcggat gatgaactgg aataacgatg aaagaaagca catccgatct 61 caacattcac gtcctgccct ataaccgatt aattaattga tccccagcta gactagtgtt 121 ggagaaatca gcatgttaaa acaactgttg atgatagctg ttggagtaaa gttgcagtgg 181 aagctatggc tgcaaaatcg ttaaaatctt caaggtgaac tggcacaaag gttaatctca 241 agatgccgct agtgaaaaga aacatcgatc ctaggcactt gtgccacaca gcactgccta 301 gaggcattaa gaatgaactg gaatgtgtaa ccaatatttc cttggcaaat ataattagac 361 aactaagtag cctaagtaaa tatgctgaag atatatttgg agaattattc aatgaagcac 421 atagtttttc cttcagagtc aactcattgc aagaacgtgt ggaccgttta tctgttagtg 481 ttacacagct tgatccaaag gaagaagaat tgtctttgca agatataaca atgaggaaag 541 ctttccgaag ttctacaatt caagaccagc agcttttcga tcgcaagact ttgcctattc 601 cattacagga gacgtacgat gtttgtgaac agcctccacc tctcaatata ctcactcctt 661 atagagatga tggtaaagaa ggtctgaagt tttataccaa tccttcgtat ttctttgatc 721 tatggaaaga aaaaatgttg caagatacag aggataagag gaaggaaaag aggaagcaga 781 agcagaaaaa tctagatcgt cctcatgaac cagaaaaagt gccaagagca cctcatgaca 841 ggcggcgaga atggcagaag ctggcccaag gtccagagct ggctgaagat gatgctaatc 901 tcttacataa gcatattgaa gttgctaatg gcccagcctc tcattttgaa acaagacctc 961 agacatacgt ggatcatatg gatggatctt actcactttc tgccttgcca tttagtcaga 1021 tgagtgagct tctgactaga gctgaggaaa gggtattagt cagaccacat gaaccacctc 1081 cacctccacc aatgcatgga gcaggagatg caaaaccgat acccacctgt atcagttctg 1141 ctacaggttt gatagaaaat cgccctcagt caccagctac aggcagaaca cctgtgtttg 1201 tgagccccac tcccccacct cctccaccac ctcttccatc tgccttgtca acttcctcat 1261 taagagcttc aatgacttca actcctcccc ctccagtacc tcccccacct ccacctccag 1321 ccactgcttt gcaagctcca gcagtaccac cacctccagc tcctcttcag attgcccctg 1381 gagttcttca cccagctcct cctccaattg cacctcctct agtacagccc tctccaccag 1441 tagctagagc tgccccagta tgtgagactg taccagttca tccactccca caaggtgaag 1501 ttcaggggct gcctccaccc ccaccaccgc ctcctctgcc tccacctggc attcgaccat 1561 catcacctgt cacagttaca gctcttgctc atcctccctc tgggctacat ccaactccat 1621 ctactgcccc aggtccccat gttccattaa tgcctccatc tcctccatca caagttatac 1681 ctgcttctga gccaaagcgc catccatcaa ccctacctgt aatcagtgat gccaggagtg 1741 tgctactgga agcaatacga aaaggtattc agctacgcaa agtagaagag cagcgtgaac 1801 aggaagctaa gcatgaacgc attgaaaacg atgttgccac catcctgtct cgccgtattg 1861 ctgttgaata tagtgattcg gaagatgatt cagaatttga tgaagtagat tggttggagt 1921 aagaaaaatg cattgataaa tattacaaaa ctgaatgcaa atgtcctttg tggtgcttgt 1981 tccttgaaaa tgtttggtca ttctagtgtt ttgctttctt ttccttataa taaatgaccc 2041 ttttcctcca taacttttga tttctaagga aaatattagc atacatttca aactaaatgt 2101 tttacagtgg cttatctttt ttttccccct gaaaagacta atttggtcaa ataaaccact 2161 aagtattaag catggacagc tgttgttaga gtagcagatt cagttttttg atatatctta 2221 attgtgtact ttgtgaattt taatttaaag aaagcaactg aaattgaaat cttgagggca 2281 gctgtatcta ctaatgagcc ttattccatt tcctgatgtt ttaaaagaag aaacactgcc 2341 ttgattatac gaatacactc agaaagtaca tttagcttgt agtgttgaat tctcttaaag 2401 gaatgcttga attttttcat tattgtttta ttgtttttat atacttgcct tatttgaatg 2461 tttagcagta tccccttccc acttatatat tgtgtgatat gattttgctt gcctatagga 2521 gttaaaaact tttccatgtg aaatactctg acttaaacat acatgtaact tacataactg 2581 ttaagaataa cagtctgatt taataaatgg ttcattttaa aagtt // LOCUS D87461 3542 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0271 gene, complete cds. ACCESSION D87461 NID g1944417 KEYWORDS KIAA0271. SOURCE Homo sapiens male brain myloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6752. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3542) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..3542 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myloblast" /clone="HA6752" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 177..758 /gene="KIAA0271" CDS 177..758 /gene="KIAA0271" /note="similar to human transforming protein bcl-2 (A24428)" /codon_start=1 /db_xref="PID:d1020443" /db_xref="PID:g1944418" /translation="MATPASAPDTRALVADFVGYKLRQKGYVCGAGPGEGPAADPLHQ AMRAAGDEFETRFRRTFSDLAAQLHVTPGSAQQRFTQVSDELFQGGPNWGRLVAFFVF GAALCAESVNKEMEPLVGQVQEWMVAYLETRLADWIHSSGGWAEFTALYGDGALEEAR RLREGNWASVRTVLTGAVALGALVTVGAFFASK" BASE COUNT 804 a 817 c 1030 g 891 t ORIGIN 1 cccacgcgtc cgctccctct ctccctccct cccagctcct gcaccaggaa acggcccgga 61 tcccggcagc ggcctgaccc ggctccacgc tggccaggag gatgaaaggc cccagctggg 121 ggctccttgc caccagtgct gtgtcttaag agctgccatc ccggctggcc gcccggatgg 181 cgaccccagc ctcggcccca gacacacggg ctctggtggc agactttgta ggttataagc 241 tgaggcagaa gggttatgtc tgtggagctg gccccgggga gggcccagca gctgacccgc 301 tgcaccaagc catgcgggca gctggagatg agttcgagac ccgcttccgg cgcaccttct 361 ctgatctggc ggctcagctg catgtgaccc caggctcagc ccaacaacgc ttcacccagg 421 tctccgatga actttttcaa gggggcccca actggggccg ccttgtagcc ttctttgtct 481 ttggggctgc actgtgtgct gagagtgtca acaaggagat ggaaccactg gtgggacaag 541 tgcaggagtg gatggtggcc tacctggaga cgcggctggc tgactggatc cacagcagtg 601 ggggctgggc ggagttcaca gctctatacg gggacggggc cctggaggag gcgcggcgtc 661 tgcgggaggg gaactgggca tcagtgagga cagtgctgac gggggccgtg gcactggggg 721 ccctggtaac tgtaggggcc ttttttgcta gcaagtgaaa gtccagggcc aggtggggct 781 aggtgtggct gggggccagg agagcaggaa cagaacagag aaatgccctt ggaagaagtg 841 gagttggtgg atgggtgggc atggaacagg atgggcagag aaagggtagt gtgtgaggga 901 gctgagtagg ccaggtaggc gattggaaga gtgagcagga cacagagggg aggggaatgt 961 tttggcaagt ttaggggcac aggagatgta gtcgttccag ggctggggga ggtgggaggg 1021 atcacgccta taggtgtggg cacatgaaac gacctggaac ttgcttcaca gccctgagga 1081 aggtggactt acataagcag ctgtattcca ttagatgagt gggatttagg gaacgcagaa 1141 ggcacatccc tttggaatgg aagcttaggg gttctcaggt gatagggaga ggtggctgtt 1201 aacagtgggc tgcttggaca cgcgtgtgca tgtgcacgca tgctggtgtg catgctgggc 1261 tgcctggcaa atctggtggt ggtgggattc ctcaaggaga aaacattccc tcttgcaatg 1321 gcaagaacta ggggcagttc tctgtccctc ctcccaaccc ctcctttccc ctgcccttgt 1381 cctgatgcct caaggcttag agagaaacat tgtatccaga ccgagggctc tgctgcttct 1441 ttccagaaag tgattggcaa ggctttggag agaagagcag ttctgcagct ggccttgttc 1501 cttcatcatc ccccttcctt gtgcattatg cacttgctgc tgcctcctgg gctctgatag 1561 aagggcaggg ctgttgagcc tggatgggtg gaggcttagg tagccggacc tgcctgccac 1621 cctcctctcc cactcaggca caatggtgcc taaagtgttt ccaatctctg ggacctctgt 1681 acccaaactg aaactctaaa ttggggccct aactaatttt ccttttgagg ttgtgggcat 1741 aagtgctgat ctagaataca gtctgggtcc cacactgtgt ctcagtgaga ctgttgatgc 1801 cttgagatga ccatttcaga tctgaatccc atgggtgtga gggtgatggg tactccagga 1861 ctggcctatg ctgtgttgtg ggctttggtt cggctttatc aggggccagg catatgggtt 1921 ctagagtacc taccatgacc tagaagcatt tatgatttat ttgaagccac actgtttgca 1981 tgggtgttac ttgtctgtac ctcagagtct gaggatgtta actttggaac tcgcagtcct 2041 ctagaacagc ttcagattat ggctttttct tttgaggaag aaattattca ctccagatgc 2101 atgccctgag ccagacctca ctgctgcact ttccaaggtg ctaagattgc tgctctccaa 2161 tgctaacttt ctgacacagt gctctagaac cctgcctgtg gtcctgagca ctgatcagct 2221 tagctagacc atggttgact cttcttggag attttcactt ggtcctagaa tgtggcaacg 2281 tagttgtgct cgccagaacg tgggaccaaa ttggcctcag gtgttgagtc cagacttctg 2341 cttttgagag agggctgcac tttttcatgg tatttctagg ggaggtggta ggctgcatgt 2401 gccacttggt cttgttgtga gtatgctgac accagaaact cagagccagc ttgtggcaag 2461 cagttggggt ggggggtctc tgacttgctc aggacaaact aggccagtgg ttttcaaact 2521 gcttggcaga gccctgaagt ttcctagggg ttgcctcagg agtccttggg gagatgaagg 2581 gggtggggag ctgagcaggc tgggcaattt gccctcaaac agaacagctc cccttgtagc 2641 tgtcttacat attggggttc agggtaagat tttatttgca ttaaggggtt tgctgctgaa 2701 aaaaagttgg aaaaccactg actagaccat cggctccaaa ttggagtctg tgcttccttc 2761 cccaggtatg gagcacactc ttcaccctac cctctaccac aggacacata tccctgttag 2821 cattccccgg gacctttagc caagaggagc tgcagggacc atggccaggt taccaaaatg 2881 ccctgctctg aagccttgac acctgggtgg aaagagaggc tgttttctga aagggtaaag 2941 ggcttggtct ggattcccag aagcatagct tagatgggac cacagtgggc aattttgacc 3001 tgtcctgccc ttcttagctt gaagggaaac cccagagact cttctgtcag ggaaaactag 3061 ggactctctt ctagagccat atagttcctt gggattagct cttggccaag aaggctgagt 3121 atggttccca atttttaaat ccatttcatt ttttaaaaaa taagggaaat aaatgtaatt 3181 gccatttttc aaagattaag taggaggaga ggggtttctt gctctccaga gcccaaaggg 3241 acaaataggg actttgttta ggccaaggaa ggagcggaag tagggcaact cggtcctgcg 3301 attattaatc ccactcccca cttattctag ggcacacaaa cactatttta cttttttaaa 3361 atcataaaac ggcagaacag atttggttag tttagaagaa aagaaagctc tataaatata 3421 aatctatatt cctgtatttt tatttaataa tttataaata ccaagttcat ttgactttta 3481 tttttgtgta atatgtaatg atcgtattaa aaacaataaa taaagcccag aagtttaatg 3541 ag // LOCUS D87463 3040 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0273 gene, complete cds. ACCESSION D87463 NID g1665810 KEYWORDS KIAA0273. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6723. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3040) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..3040 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6723" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 404..1396 /gene="KIAA0273" CDS 404..1396 /gene="KIAA0273" /codon_start=1 /db_xref="PID:d1014092" /db_xref="PID:g1665811" /translation="MELLSTPHSIEINNITCDSFRISWAMEDSDLERVTHYFIDLNKK ENKNSNKFKHRDVPTKLVAKAVPLPMTVRGHWFLSPRTEYSVAVQTAVKQSDGEYLVS GWSETVEFCTGDYAKEHLAQLQEKAEQIAGRMLRFSVFYRNHHKEYFQHARTHCGNML QPYLKDNSGSHGSPTSGMLHGVFFSCNTEFNTGQPPQDSPYGRWRFQIPAQRLFNPST NLYFADFYCMYTAYHYAILVLAPKGSLGDRFCRDRLPLLDIACNKFLTCSVEDGELVF RHAQDLILEIIYTEPVDLSLGTLGEISGHQLMSLSTADAKKDPSCKTCNISVGR" BASE COUNT 547 a 1007 c 878 g 608 t ORIGIN 1 gcctctcttc ttgcgctgct cagctgggaa catcgtctca ccaggggcag cagcgacgcg 61 ctgcacagcc agacaggagc tggttgcggg gcatggaagc agcctccttg gcagccggga 121 gaggagcaag cgcacgccac tgcccgtgac ccaggcgtcc ggctgctgtc ccctgccggg 181 gagctcatcc acgcagaggt ctctccctgt cctccctgcg agcttttcct ctgcagagcc 241 cagtggagcc agccatgtac cgcggcaacg cttgctgacc ccagtgagga gagaggcctg 301 aatccgaggg ccctggcacc tgtgagctct tggctgtcac ctgccccaag cctcgcacct 361 ccgctccaca gcagtcccca caggagacaa ccctgacggg agcatggagc tgctgtccac 421 gccccacagc attgagatca acaacatcac ctgcgactcc ttccgcatct cctgggccat 481 ggaggacagt gacctggaga gggtcaccca ttacttcatt gaccttaaca agaaggagaa 541 taagaattcc aacaagttca agcaccggga cgtccccacc aagctcgtgg ccaaggcagt 601 gccactgccc atgacggtga gaggccactg gttcctgagc ccccgcacgg agtacagtgt 661 ggccgtgcag acggcagtga agcagagcga tggggagtac ctggtgtccg gctggagcga 721 gacggtggag ttctgcactg gggattatgc caaggagcac ctggctcagc ttcaggagaa 781 agctgagcag atcgcaggcc gcatgctccg cttctccgtc ttctaccgca accatcacaa 841 ggagtacttc cagcatgcca ggacccactg cgggaacatg ctgcagcctt acctgaagga 901 caacagcggc agccacggct cccccaccag cggtatgctc cacggggtct tcttcagctg 961 caacacggag ttcaacacgg gccagccccc gcaggactcc ccctacggcc gctggcgctt 1021 ccagatccca gctcagcgcc tcttcaatcc cagcaccaac ctctactttg cggacttcta 1081 ctgcatgtac acggcctacc actacgccat cctggtgctg gcgcccaaag gctccctggg 1141 ggaccgcttc tgccgcgacc gcctgcccct cctggacatt gcttgcaaca agttcctgac 1201 ctgcagcgtg gaggatgggg agctggtctt ccgccacgcc caggacctca tcctggagat 1261 catctacact gagcccgtcg acctgtccct gggcaccctg ggggagatca gtgggcacca 1321 gctcatgagt ctgtctactg ccgatgccaa gaaggacccc agctgcaaga cctgcaacat 1381 cagcgtgggc cgctagggac tcctggggag ctggggggcg agggagagat gagcggaagg 1441 tggaggtggg tagcccaggt tcagggagct ggctttctct ctcctgcccc cctgctccct 1501 ccactctgcc cagctgccct cccctcgccc cttggcattg gaggcaacag acggtggtcc 1561 tcttggtaag tgtggcccac acccagcctg gtggacatgg aaaggcttct cagcctctgt 1621 agttggcagc tgggctggac ttctggagaa cttccccttt cctctggttt ccttgtccta 1681 ctgttctttg cctcgtccag gtctctatct cccaaagacc tgcccacctc ctgggctctc 1741 ccagggaagc ccctctgggc caaagcccat gggtagcttg gctgcctcca gaagagatgc 1801 agagagctgt ggggcgatct ttcttagtcc cactcacccc actcagggcc aaccacagaa 1861 tgccccctcc ctgctgagct gggtctctgg ggcttcccac tccaggcctg gatcccttcc 1921 ttctataaat agtctcctgc accaccgtga ctccctggct gctccttgcc ccaaagaacc 1981 ctcttcctgc atgggagagc tgtccctcct tctccacctg cccaccggga cactccccct 2041 gctctctaga aaggaactct ctttcttctc ttggcactgg gggcgtcagg ctgtcggatg 2101 ttccttcctt ccatacccca gcataccttc tccttgctta gagtgcaggg gtctgagaca 2161 gcagccactg gggttgagga gggggaggag acccctgaga cccagaggtg ccccttcctc 2221 agctgggagt gggccgcctc atgctctgct ggggcctggc atggaacgtg agacctgtaa 2281 gctgtcccgc gggccccggc accccgccca gctgtgcact ttgctcactc gctctgccat 2341 ggtgccatga ctgtactgtg cccatgcgtg acctggactg tggaccctct gctgctccgc 2401 ctctcccctc cccactggct ctgtctgctc tcctgccacc ctgctggccg ggagcccctc 2461 cccggggagt tcttggtgaa gtccttcccg ggcctccttg tgtttttgcc tcattcctac 2521 tgtcacacag gtcacgaggg tggactccct acaatcaaca aagcaaacag agagcctgtg 2581 ggaggggctg acagcagcag ccggctgttt gggggatgat ggaggtgaca tcaggcagag 2641 gagagtgcag cctcacagtg actttctcag aggtgacaga gatgatggat gagcagctgg 2701 attttcgtga tgaaggacgg aagcagcagc gggccggcaa ggccatacct cggtgaggga 2761 caggtggaca acggtcacct atctgtagcc aggggcagtt gtgtggccag ctgtctctct 2821 gggatgagtc aggaggcctg gaggcttggg gagaggtgtg gagaaggaga gaacatggcc 2881 caggcccttt ccttccccct gtgctgacag cattgctgtg ggggtggccc actgccctcc 2941 cctggccctc atgtcccccc ggggctgggg tccgcctgcc tgtgctgtgc ttgcgacgtg 3001 catcaataaa ccaccatggc ctgagggccc tgctcgtctc // LOCUS D87464 3010 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0274 gene, complete cds. ACCESSION D87464 NID g1665812 KEYWORDS KIAA0274. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6690. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3010) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..3010 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6690" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 125..2848 /gene="KIAA0274" CDS 125..2848 /gene="KIAA0274" /note="Similar to S.cerevisiae hypothetical protein N0330 (S55864)" /codon_start=1 /db_xref="PID:d1014093" /db_xref="PID:g1665813" /translation="MPTAAAPIISSVQKLVLYETRARYFLVGSNNAETKYRVLKIDRT EPKDLVIIDDRHVYTQQEVRELLGRLDLGNRTKMGQKGSSGLFRAVSAFGVVGFVRFL EGYYIVLITKRRKMADIGGHAIYKVEDTNMIYIPNDSVRVTHPDEARYLRIFQNVDLS SNFYFSYSYDLSHSLQYNLTVLRMPLEMLKSEMTQNRQESFDIFEDEGLITQGGSGVF GICSEPYMKYVWNGELLDIIKSTVHRDWLLYIIHGFCGQSKLLIYGRPVYVTLIARRS SKFAGTRFLKRGANCEGDVANEVETEQILCDASVMSFTAGSYSSYVQVRGSVPLYWSQ DISTMMPKPPITLDQADPFAHVAALHFDQMFQRFGSPIIILNLVKEREKRKHERILSE ELVAAVTYLNQFLPPEHTIVYIPWDMAKYTKSKLCNVLDRLNVIAESVVKKTGFFVNR PDSYCSILRPDEKWNELGGCVIPTGRLQTGILRTNCVDCLDRTNTAQFMVGKCALAYQ LYSLGLIDKPNLQFDTDAVRLFEELYEDHGDTLSLQYGGSQLVHRVKTYRKIAPWTQH SKDIMQTLSRYYSNAFSDADRQDSINLFLGVFHPTEGKPHLWELPTDFYLHHKNTMRL LPTRRSYTYWWTPEVIKHLPLPYDEVICAVNLKKLIVKKFHKYEEEIDIHNEFFRPYE LSSFDDTFCLAMTSSARDFMPKTVGIDPSPFTVRKPDETGKSVLGNKSNREEAVLQRK TAASAPPPPSEEAVSSSSEDDSGTDREEEGSVSQRSTPVKMTDAGDSAKVTENVVQPM KELYGINLSDGLSEEDFSIYSRFVQLGQSQHKQDKNSQQPCSRCSDGVIKLTPISAFS QDNIYEVQPPRVDRKSTEIFQAHIQASQGIMQPLGKEDSSMYREYIRNRYL" BASE COUNT 863 a 628 c 696 g 823 t ORIGIN 1 agtgcctaat gggtgttgtt cctggctgga cttgatgtcc agggcctgag gggttttctc 61 gccgagtctc ctggggcggt ccggaggctc gtgccctgtt gtggggcccc catttgccgc 121 cgccatgccc acggccgccg cccccatcat cagctcggtc cagaagctgg ttctgtatga 181 gactagagct agatactttc tagttgggag caataatgca gaaacgaaat atcgtgtctt 241 gaagattgat agaacagaac caaaagattt ggtcataatt gatgacaggc atgtctatac 301 tcaacaagaa gtaagggaac ttcttggccg cttggatctt ggaaatagaa caaagatggg 361 acagaaagga tcctcgggct tatttcgagc ggtttcagct tttggtgttg tgggttttgt 421 caggttctta gaaggctatt atattgtgtt aataactaaa aggaggaaga tggcggatat 481 tggaggtcat gcaatctata aggtcgaaga tacaaatatg atctatatac ccaatgattc 541 tgtacgggtt actcatcctg atgaagctag gtatctacga atatttcaaa atgtggacct 601 atctagcaat ttttacttta gttacagcta tgatttgtcc cactcacttc aatataatct 661 cactgtcttg cgaatgcccc tggagatgtt aaagtcagaa atgacccaga atcgccaaga 721 gagctttgac atctttgaag atgaaggatt aattacacaa ggtggaagcg gggtatttgg 781 gatctgtagt gagccttata tgaaatatgt atggaatggt gaacttctgg atataattaa 841 aagtactgtg catcgtgact ggcttttgta tattattcat gggttctgtg ggcagtcaaa 901 gctgttgatc tatggacgac cagtgtatgt cactctaata gctagaagat ccagtaaatt 961 tgctggcacc cgttttctta aaagaggtgc aaactgtgag ggtgatgttg caaatgaagt 1021 ggagactgaa caaatactct gcgatgcttc tgtgatgtct ttcactgcag gaagttattc 1081 ttcatatgta caagttagag gatctgtgcc cttatactgg tctcaggaca tttcaactat 1141 gatgcctaaa ccacctatta cattggatca ggcagatcca tttgcacatg tggctgccct 1201 tcactttgac cagatgttcc agaggtttgg ctctcccatc atcatcttga atttagtgaa 1261 ggaacgagag aaaagaaagc atgaaagaat tctgagtgaa gaacttgttg ctgctgtgac 1321 ctatctcaac caatttttgc ctcctgagca cactattgtt tatattccct gggacatggc 1381 caagtatacc aaaagcaagc tgtgtaatgt tcttgatcga ctaaatgtga ttgcagaaag 1441 tgtggtgaag aaaacaggtt tctttgtaaa ccgccctgat tcttactgta gcattttgcg 1501 gccagatgaa aagtggaatg aactaggagg atgtgtgatt cccactggtc gcctgcagac 1561 tggcatcctt cgaaccaact gtgtggactg tttagatcgc accaacacag cacagtttat 1621 ggtgggaaaa tgtgctctgg cctatcagct gtattcactg ggactgattg acaaacctaa 1681 tctacagttt gatacagatg cagttaggtt atttgaggaa ctctatgaag atcatggtga 1741 taccctatcc cttcagtatg gtggttctca acttgttcat cgtgtgaaaa cctacagaaa 1801 gatagcacca tggacccagc actccaaaga catcatgcaa accctgtcta gatattacag 1861 caatgctttt tcagatgccg atagacaaga ttccattaat ctcttcctgg gagttttcca 1921 tcccactgaa gggaaacctc atctctggga gctcccaaca gatttttatt tgcatcacaa 1981 aaataccatg agacttttgc caacaagaag aagttatact tactggtgga caccagaggt 2041 gataaagcat ttaccattgc cctatgatga agttatctgt gctgtgaact taaagaagtt 2101 gatagtgaag aaattccaca aatatgaaga agagattgat atccacaatg agttctttcg 2161 gccatatgag ttgagcagct ttgatgatac cttttgcttg gctatgacaa gctcagcacg 2221 tgactttatg cctaagaccg ttggaattga tccaagtcca tttactgtgc gtaaaccaga 2281 tgaaactgga aaatcagtat tgggaaacaa aagcaataga gaagaagctg tattacagcg 2341 gaaaacggca gccagcgccc cgccgccccc cagcgaggag gctgtgtcca gcagctctga 2401 ggatgactct gggactgatc gggaagaaga gggctctgtg tctcagcgct ccactcccgt 2461 gaagatgact gatgcaggag acagtgccaa agtgaccgag aatgtggtcc aacccatgaa 2521 ggagctatat ggaattaacc tctcagatgg cctctcagaa gaagatttct ccatttattc 2581 aagatttgtt cagctggggc agagtcaaca taaacaagac aagaatagcc agcagccctg 2641 ttctaggtgc tcagatggag ttataaaact aacacccatc tcagctttct cgcaagataa 2701 catctatgaa gttcagcccc caagagtaga cagaaaatct acagagatct tccaagccca 2761 catccaggcc agccaaggta tcatgcagcc cctaggaaaa gaggactcct ccatgtaccg 2821 agagtacatc aggaaccgct acctgtgaaa agagcgcagg tccacctggt ggacacgtct 2881 gattagctta gaacctgtct tgtctcatct tcaaaaggta acttattaaa agtcctttgc 2941 gtctgaagcc tttctccttt tctgtcattt gcaaattcca aattatagct aataaagatg 3001 actagataac // LOCUS D87467 5900 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0277 gene, complete cds. ACCESSION D87467 NID g1665818 KEYWORDS KIAA0277. SOURCE Homo sapiens male brain cDNA to mRNA, clone_lib:pSPORT 1 clone:HA6833. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5900) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes.VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from human cell line KG-1 and brain JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi,Y., Ohara,O., Tanaka,A., Kotani,H., Miyajima,N. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain JOURNAL DNA Res. 3 (5), 321-329 (1996) MEDLINE 97191544 FEATURES Location/Qualifiers source 1..5900 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HA6833" /clone_lib="pSPORT 1" /sex="male" /tissue_type="brain" gene 56..1798 /gene="KIAA0277" CDS 56..1798 /gene="KIAA0277" /note="Similar to a C.elegans guanine nucleotide releasing factor homolog (S4 2368)" /codon_start=1 /db_xref="PID:d1014096" /db_xref="PID:g1665819" /translation="MGSSRLRVFDPHLERKDSAAALSDRELPLPTFDVPYFKYIDEED EDDEWSSRSQSSTEDDSVDSLLSDRYVVVSGTPEKILEHLLNDLHLEEVQDKETETLL DDFLLTYTVFMTTDDLCQALLRHYSAKKYQGKEENSDVPRRKRKVLHLVSQWIALYKD WLPEDEHSKMFLKTIYRNVLDDVYEYPILEKELKEFQKILGMHRRHTVDEYSPQKKNK ALFHQFSLKENWLQHRGTVTETEEIFCHVYITEHSYVSVKAKVSSIAQEILKVVAEKI QYAEEDLALVAITFSGEKHELQPNDLVISKSLEASGRIYVYRKDLADTLNPFAENEES QQRSMRILGMNTWDLALELMNFDWSLFNSIHEQELIYFTFSRQGSGEHTANLSLLLQR CNEVQLWVATEILLCSQLGKRVQLVKKFIKIAAHCKAQRNLNSFFAIVMGLNTASVSR LSQTWEKIPGKFKKLFSELESLTDPSLNHKAYRDAFKKMKPPKIPFMPLLLKDVTFIH EGNKTFLDNLVNFEKLHMIADTVRTLRHCRTNQFGDLSPKEHQELKSYVNHLYVIDSQ QALFELSHRIEPRV" BASE COUNT 1677 a 1245 c 1288 g 1690 t ORIGIN 1 gggatggcaa gatcttttag catttagggg atgcctttgt tagtaaccgt tcacaatggg 61 cagctcccgg ctgagggtct ttgaccctca tttggagagg aaagattccg ccgcggcgct 121 ctcagaccga gagctgccct tgcctacctt cgatgtgcct tatttcaaat acatcgacga 181 ggaggatgag gacgatgaat ggagcagccg ttcgcagtct tccaccgagg atgactcagt 241 ggactctctg ctctctgaca gatatgtggt ggtgtccggg accccggaga agattttgga 301 gcaccttttg aatgacttgc acctggaaga agtccaggac aaagaaacag agaccctcct 361 ggatgacttc cttctcacgt acactgtctt catgacaact gatgacttgt gccaggctct 421 gttaaggcac tattctgcta agaagtatca aggcaaagag gaaaactcag atgttccgcg 481 taggaaacgt aaagtcttgc atcttgtttc ccagtggatt gctctgtaca aagactggtt 541 acctgaagat gaacattcaa aaatgttttt aaagaccata tataggaatg tactggatga 601 tgtttatgaa tatccaatac ttgaaaaaga attgaaagaa tttcaaaaga tacttggaat 661 gcaccgtcgt cacactgtag atgaatattc accacaaaaa aagaataaag cccttttcca 721 ccaattcagt cttaaggaga actggctcca gcatagagga actgtgactg aaacggagga 781 aattttctgc cacgtgtata taacagagca ctcctatgtc agtgtgaagg caaaagtttc 841 cagtatagcc caagagatcc taaaagtcgt ggcagaaaag atccagtatg cagaagagga 901 tctggctctg gtggccatca cattctctgg ggaaaagcat gaacttcagc caaatgactt 961 agtcatctcc aaatccctcg aggcatctgg tcgaatatat gtctaccgga aagacctggc 1021 ggacactttg aacccatttg cagaaaatga ggaatcacag caaaggtcga tgaggatttt 1081 gggaatgaac acttgggatc ttgctctgga attaatgaat tttgattgga gtctattcaa 1141 ttcaattcac gagcaagagc tgatctactt cacgttcagc agacagggaa gtggggaaca 1201 cactgcaaat ctcagccttc tgctccagag atgcaatgag gtccagcttt gggtggccac 1261 ggagattctg ctctgcagcc agctgggcaa gcgagtgcag ctggtgaaaa aattcatcaa 1321 aattgcggct cactgcaaag cccagagaaa cctgaattct ttctttgcca ttgtgatggg 1381 tctcaacact gcttctgtca gtcgactgtc gcagacctgg gagaaaatcc ctgggaagtt 1441 taagaaactt ttctctgaac ttgaaagttt aacagatcct tccctaaatc acaaagccta 1501 cagagatgca ttcaaaaaga tgaagccacc aaaaatccct ttcatgccct tattgcttaa 1561 agatgtaaca tttattcatg aaggaaataa aacttttttg gataatcttg tcaattttga 1621 aaagctgcat atgatcgcag acactgtccg aaccctgaga cactgcagga ctaaccagtt 1681 tggtgacctg tctccaaaag agcatcaaga gttaaagtcc tatgttaatc acctgtatgt 1741 cattgacagc cagcaggctc tgtttgagct ctcacacagg atcgagcctc gggtgtgagc 1801 cccactgcct cacctcccct gtatctgcag cactttgagc tacgggaatg tctatgccaa 1861 gcacgttgct ttcctgtgag aaaagaagtt gctgagtttt atcagtataa cccaagacat 1921 tcacaggaaa gccagccaaa gcgtgttcag gaagtgatgt cagccaccag agagggggag 1981 aggtttctcc atgctactct cgggacaaga aggcagaagg agagtcagaa gcattcttga 2041 gatggagaag gctggtttct tatgatcaca ttgttgatcc agtccagttt tcaatatgag 2101 atgtgccagc atcaagacaa gacaacgtct tgacatgcaa tgaccaaata tttcattaag 2161 agcgtgcatg aaacaggaag gagtttttac tttgcctagt tttagattac tgtccataag 2221 ctgtcaaaga agtcattctt ttgaacacct gatgacagag acagcatctc tagatctcca 2281 gggaggagag gtttctgttg atacaacctg tgacatcacc aaaagccact tgtgtctagg 2341 gagttagtga ggactgcagc tagcatccat gctctgatgg gcagatgaac aatgtcaagg 2401 tgtgcatcac tttgcaccac aatcaactat tgacacatgc ttgcaggtga aattagtttc 2461 tgtacaactg atttgcagct ataggcaagg tagatgaagt tgctttgcca gtaaggaaaa 2521 atagtaatct ttaagaaatt gactcattgt ttaatttctg gggattttct ttatacttct 2581 aagcaggctc ttatctttta ttggacataa tatgattttg aaaaagcaca gtgcctgaca 2641 cattgcaaac actcaccaac tgcttgctga ggtgacagag tcacaaaagt ctgcattctt 2701 gtgcctgatg atgcattttg cgtacctcat acaggctcct tgcccacact atggaatgac 2761 agcagccagt gcagggaggt taagtgacat ttaatgagtg aagcacttag cactctctag 2821 gtaataagat agtggtaatt actagtgttt tggcaaatga aaaatgccct gaaatagcca 2881 aatgtctgat taatgttggc aacttagaag tcctataatc caactaccag ccaaagcagg 2941 gagcctttct ataatttgcc tttttttttt tttttttcaa aatctgagtc ttctaaaatc 3001 ttattattcc catttttacc aattgaggct cctgtagcaa ataagacctc ttgatatttt 3061 caaggactgg ttagaggatt tctttcaacc ttcacatgaa caaaacagcc tatgggtcaa 3121 aataatgaaa tccacccctg cctgctagat acttgtcacc ttgctaaaat gcaagggcct 3181 ggtccattca ttttccaaat gcaggagtct tggtgcactt ctcactcttc ctgcctgttc 3241 atctctttca tgcccacaca gacctgtttc ctttttgtct catcaacgcc tcattcatcc 3301 tcattactga ggcgtgtcca atgctttttg acatctttat agcagtgctg tttcctgggc 3361 tcaggaacca cactgagctt gagatactgc tggaaggaac catgtggaga gaaggtttgg 3421 gagaactttg agagagactt agtttggccc agcatgtaaa acttcagtcc tgaacattta 3481 tagggtttta tagaagggca tcctccaggg ctggtccatt cagagaaatg ctgcatgctg 3541 ccgtcatgga atgtggccca caggacacca gagccgtgag aaccggagag cagacttccc 3601 tcacggctgg gctgagcaaa ccctccaaag ccctcctcac gcagttacta acaatagcat 3661 gggcttacag cacaagcacg tgttctcacc tttttcctat gccctggact aaggtttggc 3721 cagtgtaatc atataaggcc atcctgacat tgtttctgtg tttcaaaatt tggattttta 3781 tttacattag aactacattg ctcctagtag aacattacct ttaggggact aattttccat 3841 ggagaactat ttcagcatat tgcatgctgc tcagacccca agtcagatat gcccaccaag 3901 ccagatgaag ctacacaaat gtggtattta aatgcatttt gtacagtgac ttcagagtat 3961 ctcacatgac atgggtgtaa actggctggg gagaaaatga tgcttgttca cctcttcctc 4021 cagccgtggt taggtggtcc taggggtagc agagggaagg gaggattttg tgcagtcaag 4081 atttgctttt ccatccttgt cttctgaatg tctaaaatct ctgcatcttt ctgaagttta 4141 acaactgtct ccagaggttt gccaggcagc agctctcaga agtttccaaa gctttgcaga 4201 atcttagatc tggaattaaa gaattcaagc ccgaattgtg agaaccagat attactcaac 4261 agaaagctct ttctaaggaa tctgagctgt tcactggtgg acagtggtgg ggcttgagtg 4321 ctccttgtta ataggatggg ccatgcaccc tctctggata ttcaccaagg cctcttcaga 4381 atagggtttg ttctggctag aagcgtggtc tagaagatgg ctaagctctt tgccagctct 4441 catttggagt tttattattg cataaaatct tcgctcactc tgcaaatctt acgtaatctg 4501 gcaccttcgg caccaggtgg tgcaggggca cttctaagtg ggctcttttt gttacagcac 4561 aactctcaga cagtcctgtg ggtctttgga ttcgtcagca ttccagcaaa ctagccctgc 4621 ttagaagtta gcacaagacg gcagaatgca ggaccccgta ggcaaaatca caaccttgct 4681 attaaaaaaa attttttttt acatacacat ttgcaggtgt tccctagagt gtggtgtttt 4741 gaatttgctc tttgtcatct gtataattgc caaatgatta tagtgataca catgacctgc 4801 attcactttt ttctagtttc cttaattatg tttagaataa attcgtttcc ctagaccgag 4861 aaccacaaac aggtagtgtg gagcatacac cgaatttaga agcatgtgga taaggtcagt 4921 gctcacactg cctagtccac agggagagga tgctgcatga atatatactt gcctctgagt 4981 ggaggagaaa tcgtggcatg aaagagagag taccagtgat gacttcttat ccctggagct 5041 gggctttcac tgctacccat atcccagccc tgcgagtctg ttctagccag cacagacacc 5101 gcagatccgg aactgaatgt tcctaaatgg cgcagccaat ccaggctttt cagaaactgg 5161 gcaaaaacat taaaatgggg acgatcgggt cttccgcagt ggtccaacac aggatttctt 5221 ttaaatgttt caaaaacatg tccttaaaat ttcagcctgc ttcttagcga gtgggccagt 5281 tttgcttaaa actggtgggg gggcgggggg gaagttttta aaaattgcca aaaagttaga 5341 tgcaaatgta ttactgtata aagcaaagct gtatatacta aacatttttt agcagagtaa 5401 tatttatttg catagtctat ttattgtatt cgtattacac tgttattaaa tactgggatg 5461 aaatcagtga cctgaagcaa gaaatcttgc cttttaatgt atcattaatt agggctgctg 5521 tgatattgtc agcttgcatt aacaattaga agatagagaa cccgccatca gggtgtctac 5581 ctaacttctc agggactaca cttggtagtt ttccaccatt taaagaactg gtaaatatga 5641 aacatttgtt gagttaccag aattgccatt aacagtgttt tctttcccat attccatgct 5701 ttctgcctct gtgtatatat ataatatata tgtatatgac tgtgctgtgt atttatcgaa 5761 gctagtaagc aataatttat atgtaaaaat ggccaagcaa tataaggtta aaacttatat 5821 aagtaaccct taccttatct tgtattttca attttttttt aaaactgctt ttccaaatat 5881 gagactatgt taaagacact // LOCUS D87717 5615 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0013 gene, complete cds. ACCESSION D87717 D13638 NID g1663709 KEYWORDS KIAA0013. SOURCE Homo sapiens bone marrow myeloblast cell_line:KG-1 cDNA to mRNA, clone_lib:pBluescript clone:HA0450. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5615) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (09-SEP-1996) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5615) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..5615 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /chromosome="15" /clone="HA0450" /clone_lib="pBluescript" /map="RH-ID RH25252" /tissue_type="bone marrow" gene 722..3793 /gene="KIAA0013" CDS 722..3793 /gene="KIAA0013" /note="similar to human GTPase-activating protein(A49869)" /codon_start=1 /db_xref="PID:d1014132" /db_xref="PID:g285981" /translation="MWDQRLVRLALLQHLRAFYGIKVKGVRGQCDRRRHETAATEIGG KIFGVPFNALPHSAVPEYGHIPSFLVDACTSLEDHIHTEGLFRKSGSVIRLKALKNKV DHGEGCLSSAPPCDIAGLLKQFFRELPEPILPADLHEALLKAQQLGTEEKNKATLLLS CLLADHTVHVLRYFFNFLRNVSLRSSENKMDSSNLAVIFAPNLLQTSEGHEKMSSNTE KKLRLQAAVVQTLIDYASDIGRVPDFILEKIPAMLGIDGLCATPSLEGFEEGEYETPG EYKRKRRQSVGDFVSGALNKFKPNRTPSITPQEERIAQLSESPVILTPNAKRTLPVDS SHGFSSKKRKSIKHNFNFELLPSNLFNSSSTPVSVHIDTSSEGSSQSSLSPVLIGGNH LITAGVPRRSKRIAGKKVCRVESGKAGCFSPKISHKEKVRRSLRLKFNLGKNGREVNG CSGVNRYESVGWRLANQQSLKNRIESVKTGLLFSPDVDEKLPKKGSEKISKSEETLLT PERLVGTNYRMSWTGPNNSSFQEVDANEASSMVENLEVENSLEPDIMVEKSPATSCEL TPSNLNNKHNSNITSSPLSGDENNMTKETLVKVQKAFSESGSNLHALMNQRQSSVTNV GKVKLTEPSYLEDSPEENLFETNDLTIVESKEKYEHHTGKGEKCFSERDFSPLQTQTF NRETTIKCYSTQMKMEHEKDIHSNMPKDYLSKQEFSSDEEIKKQQSPKDKLNNKLKEN ENMMEGNLPKCAAHSKDEARSSFSQQSTCVVTNLSKPRPMRIAKQQSLETCEKTVSES SQMTEHRKVSDHIQWFNKLSLNEPNRIKVKSPLKFQRTPVRQSVRRINSLLEYSRQPT GHKLASLGDTASPLVKSVSCDGALSSCIESASKDSSVSCIKSGPKEQKSMSCEESNIG AISKSSMELPSKSFLKMRKHPDSVNASLRSTTVYKQKILSDGQVKVPLDDLTNHDIVK PVVNNNMGISSGINNRVLRRPSERGRAWYKGSPKHPIGKTQLLPTSKPVDL" BASE COUNT 1783 a 981 c 1275 g 1576 t ORIGIN 1 gaaactgcgg gtgtgacccc cccgtggtgg ctctgggtgt ctgcggagga gctgggggcg 61 gaagatgagg ctaacggctt ggcttcagtg aacgcaccgg gatgtgcagg ccgggaggta 121 gaggcaggct gatgggggag ggaacgagca gcctgtgaga cggggtgacg gcggctacca 181 gcccgggcgg gcaccgggac tggaagagtt gcctgagcag ccggctggtc cggcggccag 241 gctagggcgg gggcgagcgc ccagttgagc ctgctggggc tggaggagcg agaagggttt 301 tcttcacatt tcagagcgaa ccagacgggg acagtaaggt ttggaggaag ggggatcgtt 361 ggaagtagca agaagtggag agaatctggc aatagacgag aaaccgaaag aatcagaaag 421 aagtctatgt gagtagctga aagcattggg tgaccagaaa gaaggtcggt gtaagtgaag 481 gaagagtgag gtgtggctgg atcaaagggc taagagaagc gggtctgtgt aagtggatgt 541 gagtgaggat caaggaaaag ccgtggaagt ggccgggggt cggggccgca gaagtgccag 601 acggggccgg aaagcagccg agcggagttc aaatttgaga gcgtttggaa attggaagac 661 ttggtggcga acgagggtca ggacctgcat cctgcctcag agagttatcg acgtatccgg 721 aatgtgggat cagaggctgg tgaggttggc cctgttgcag catctgcggg ccttctatgg 781 tattaaggtg aagggtgtcc gtgggcagtg cgatcgcagg agacatgaaa cagcagccac 841 ggaaataggg ggtaaaatat ttggagtacc ttttaatgca ctgccccatt ctgctgtacc 901 agaatatgga cacattccaa gctttcttgt cgatgcttgc acatctttag aagaccatat 961 tcataccgaa gggctttttc ggaaatcagg atctgtgatt cgcctaaaag cactaaagaa 1021 taaagtggat catggtgaag gttgcctatc ttctgcacct ccttgtgata ttgcgggact 1081 tcttaagcag ttttttaggg aactgccaga gcccattctc ccagctgatt tgcatgaagc 1141 acttttgaaa gctcaacagt taggcacaga ggaaaagaat aaagctacac tgttgctctc 1201 ctgtcttctg gctgaccaca cagttcatgt attaagatac ttctttaact ttctcaggaa 1261 tgtttctctt agatccagtg agaataagat ggacagcagc aatcttgcag taatatttgc 1321 accgaatctt cttcagacaa gtgaaggaca tgaaaagatg tcttctaaca cagaaaagaa 1381 gctacgatta caggctgcag tagtacagac tcttatcgat tatgcatcag atattgggcg 1441 tgtaccagat tttatcctgg aaaagatacc agccatgttg ggtattgatg gtctctgtgc 1501 tactccatca ctggaaggct ttgaagaagg tgaatatgaa actcctggtg aatataagag 1561 aaagagaaga caaagtgtag gagattttgt tagtggagca ctaaataaat ttaaacctaa 1621 cagaacacct tctattacac ctcaagaaga aagaattgcc cagctatctg aatcaccagt 1681 gattcttaca ccaaatgcta agcgtacatt gccagtagat tcttctcatg gtttctcaag 1741 taagaaaagg aagtccatca agcacaattt taactttgag ctgttgccaa gtaatctctt 1801 caatagcagt tctacaccgg tatcagttca catcgataca agctcagaag ggtcatctca 1861 gagttcactc tctcctgtac tcattggtgg aaaccatttg atcactgcag gtgtgccaag 1921 gcgaagtaaa agaattgcag gcaaaaaagt ttgcagagtg gaatcaggaa aagcaggctg 1981 cttttctcct aaaatcagcc ataaagaaaa ggttcgaaga tctctgcgtt tgaaattcaa 2041 tctagggaaa aatggcagag aagtaaatgg atgttctggt gtcaatagat atgaaagtgt 2101 tggttggcga cttgcaaatc aacaaagttt aaaaaatcga attgaatctg taaaaacagg 2161 tttgcttttt agcccagatg ttgatgaaaa gttaccaaag aaaggttcag aaaagatcag 2221 taagtctgag gaaaccttac taactccaga gcgactagtt ggaacaaatt accggatgtc 2281 ttggacagga cctaataatt caagttttca agaagtagat gcaaatgaag cttcttcaat 2341 ggtggaaaat cttgaggtag aaaactcttt ggagcctgat attatggtag aaaagtcacc 2401 tgctacttca tgtgaactca ccccttccaa tttaaacaat aagcataata gcaacataac 2461 aagtagccct cttagcgggg atgaaaataa catgaccaaa gagactttgg tgaaagttca 2521 aaaagcgttt tctgaatctg gaagtaatct tcacgcattg atgaatcaga ggcagtcatc 2581 agtaactaat gtggggaaag taaaattaac tgaaccatct tatttagaag atagcccaga 2641 ggaaaatcta tttgaaacta atgatttgac tatagtagaa tcaaaggaga aatatgaaca 2701 ccacactggt aaaggtgaaa aatgtttttc agagagggac ttttcacccc ttcaaactca 2761 aacatttaat agagaaacaa ctataaaatg ttattcaact cagatgaaga tggaacatga 2821 aaaagacatt cattcaaata tgccaaaaga ttatttaagc aagcaagaat tctccagtga 2881 tgaagaaata aagaaacagc agtccccaaa ggataaacta aataataaat taaaagagaa 2941 tgagaatatg atggaaggta acttaccgaa gtgtgcagca catagcaagg acgaggctag 3001 atcctctttc tcacagcaga gtacatgtgt tgtaacaaac ttgtcaaaac ctaggcctat 3061 gagaattgct aaacagcagt cattggaaac atgtgagaaa acagtttctg aaagttcaca 3121 aatgacagaa catagaaagg tttctgatca catacagtgg tttaacaagc tttctttaaa 3181 tgaaccaaat agaataaaag tcaagtcacc tcttaagttt cagcgtactc ctgttcgtca 3241 gtccgtcaga agaattaatt ctttgttgga gtatagcaga caacctacag ggcataagtt 3301 ggcgagtctt ggtgatacag cttctccttt ggtcaaatca gtgagctgtg acggtgctct 3361 ttcctcttgt atagaaagtg catcaaaaga ttcctctgtt tcatgtatca aatcaggtcc 3421 taaagaacag aagtccatgt catgtgaaga gtcaaatatt ggtgcaattt caaagtcaag 3481 catggagtta ccctcgaaat ctttcttaaa gatgaggaag cacccagatt cagtgaatgc 3541 ttctcttagg tctactacag tttataaaca gaagatctta tctgatggcc aagttaaggt 3601 tcccttggat gatctgacta atcatgatat agtaaaacca gttgtaaata acaacatggg 3661 catttcttct gggataaata acagggtcct taggagacca tcagaaagag gaagggcctg 3721 gtacaaaggt tctccaaaac atcctatcgg aaaaactcaa ttactaccaa caagtaaacc 3781 tgtagatttg taattggtaa atgttatact tgtcattaat gtaaataaag tgagtaattg 3841 gtatgacttg caggatgatg tacatgttag tttgtagctc aggatgattg ttaagcaata 3901 gatttgctct attgaaaatg tttcattttt ttcactgtac aagcaactta gatttttatt 3961 tgtacaaatt acttctttgt ttttcttaat gatggcaatt tttaaacttt aattttattg 4021 tgatctctta aagcagaggt tagactttac ctttctgact ctgtcgtcca ggctggagtg 4081 cagtggcgca atctcactgc aagctccact tcctgggttc atgccatttt cctgcctcag 4141 cctcccgagt agctgggact acaggtgccc gccaccacgc ccagctaatt ttttgtattt 4201 ttagtagaga cggtttcacc gtgttagcca ggatggtctc gatctcctga ccttgtgatc 4261 cgcccgcctc agcctcccaa agtgctggga ttacaggcat gagccaccac gcccggctag 4321 actttacctt tctaaagaaa ttgtttactg gatttataag aagttaattt ttgaaaatga 4381 catatttttg tgtgatagaa agaatggagc aagttgtgcc tatttcctcc aagtcagata 4441 aggtttctaa aataaataaa tttctagcat ataaagggta gagataaact ctgcaaatct 4501 tatgtctgga attatattaa tgtttattgt ccttgccaaa attcctagaa attaatttcc 4561 ttcaatagca tcctaaaact ctatttttat ttggggcaga gtaatttcat ttatagtgcc 4621 agtaggtgta ccttgtgttc actcgaacta agaacaatgg ttaaggcaga ataatgacta 4681 aaatatgttc atatattatg atgtggaaat aattgataac ttttaagcca tactatgttt 4741 ttaaagataa tttgcacaaa cacgtttgtg tctgttctgt ccaatataga tttggcaatt 4801 atttaaagag ggataatctt gaaaaaaatt aaccaaggtg atttcttata tgtagatgct 4861 cgattttgga atttgaaata gtagatgcac ctctttacct tttttacttg gataaaaacc 4921 tatgatgatt ttgtcctgtg tgtaaatgtt atttatttag catagacatt aaagataact 4981 ctctggaaaa tgacttgact aaggctctca tgaaattcaa agtgccattt agaacatgca 5041 ccaaattgtc aagtaaatct gtctaaattt atattttaaa ttattacaaa ttacacatct 5101 ttgaggaaag agtattatga acaatagaac atattctcta ggttgtagag gaaggaataa 5161 gcagacagaa tcaaccacta aaggtagttt ttcagattgg ttgttagaat gtcatgttta 5221 gatgttggag cagattagag cagcattcat gccactcgga gcaaccagac ttacagcata 5281 agtatgtacg aggaatttca aatcatcaga tgtttgcttg gctaggttct actttgttta 5341 tttgatatca aataggtttg tagatgttta tggcatttct aattgtaagt agagacaaaa 5401 tattcatata gtcagatata tgttgtctgc tttaaacaat ttttaaattt taaaaatgca 5461 ttaacgtctt tttatatcca tcaagggaag gatgaaatgt tgaatttgaa gactaattca 5521 gtaagaagtc ctaggggttt aactgtacat actacctgaa ctggcttttc tgagagatga 5581 atcaataatg aaacatgtct gttttaaaaa ctacc // LOCUS D87735 722 bp mRNA PRI 14-OCT-1996 DEFINITION Human mRNA for ribosomal protein L14, complete cds. ACCESSION D87735 NID g1620021 KEYWORDS ribosomal protein L14. SOURCE Homo sapiens neonatal male umbilical cord vein endothelial cell cell_line:HUE4 cDNA to mRNA, clone_lib:pCMV-SPORT clone:tHUE4-4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,M., Tanaka,T. and Mitsui,Y. TITLE The elongation of alanine of ribosomal protein L14 in immortalization of endothelial cell JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 722) AUTHORS Mitsui,Y. TITLE Direct Submission JOURNAL Submitted (05-SEP-1996) to the DDBJ/EMBL/GenBank databases. Youji Mitsui, Agency of Industrial Science and Technology, National Institute of Bioscience and Human-Technology; Higashi 1-1, Tsukuba Science City, Ibaraki 305, Japan (E-mail:ttanaka@is.icc.u-tokai.ac.jp, Tel:+81-298-94-6070, Fax:+81-298-94-6095) COMMENT Sequence updated (12-Sep-1996) by:Youji Mitsui. FEATURES Location/Qualifiers source 1..722 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUE4" /cell_type="endothelial cell" /clone="tHUE4-4" /clone_lib="pCMV-SPORT" /dev_stage="neonatal" /sex="male" /tissue_type="umbilical cord vein" 5'UTR 1..17 CDS 18..680 /note="putative; similar to rat ribosomal protein L14: EMBL Accession Number X94242" /codon_start=1 /product="ribosomal protein L14" /db_xref="PID:d1014133" /db_xref="PID:g1620022" /translation="MVFRRFVEVGRVAYVSFGPHAGKLVAIVDVIDQNRALVDGPCTQ VRRQAMPFKCMQLTDFILKFLHSAHQKYVRQAWQKADINTKWAATRWAKKIEARERKA KMTDFDRFKVMKAKKMRNRIIKNEVKKLQKAALLKASPKKAPGTKGTAAAAAAAAAAA AAAAKVPAKKITAASKKAPAQKVPAQKATGQKAAPAPKAQKGQKAPAQKAPAPKASGK KA" repeat_region 465..509 /note="elongation of alanine" /rpt_type=TANDEM /rpt_unit=465..467 3'UTR 681..722 BASE COUNT 218 a 172 c 182 g 150 t ORIGIN 1 cgcctaacgc tgccaacatg gtgttcaggc gcttcgtgga ggttggccgg gtggcctatg 61 tctcctttgg acctcatgcc ggaaaattgg tcgcgattgt agatgttatt gatcagaaca 121 gggctttggt cgatggacct tgcactcaag tgaggagaca ggccatgcct ttcaagtgca 181 tgcagctcac tgatttcatc ctcaagtttc tgcacagtgc ccaccagaag tatgtccgac 241 aagcctggca gaaggcagac atcaatacaa aatgggcagc cacacgatgg gccaagaaga 301 ttgaagccag agaaaggaaa gccaagatga cagattttga tcgttttaaa gttatgaagg 361 caaagaaaat gaggaacaga ataatcaaga atgaagttaa gaagcttcaa aaggcagctc 421 tcctgaaagc ttctcccaaa aaagcacctg gtactaaggg tactgctgct gctgctgctg 481 ctgctgctgc tgctgctgct gctgctgcta aagttccagc aaaaaagatc accgccgcga 541 gtaaaaaggc tccagcccag aaggttcctg cccagaaagc cacaggccag aaagcagcgc 601 ctgctccaaa agctcagaag ggtcaaaaag ctccagccca gaaagcacct gctccaaagg 661 catctggcaa gaaagcataa gtggcaatca taaaaagtaa taaaggttct ttttgacctg 721 tt // LOCUS D87810 1219 bp mRNA PRI 17-MAR-1997 DEFINITION Human mRNA for phosphomannomutase, complete cds. ACCESSION D87810 D85231 NID g1549218 KEYWORDS phosphomannomutase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wada,Y. and Sakamoto,M. TITLE Isolation of the human phosphomannomutase gene (PMM1) and assignment to chromosome 22q13 JOURNAL Genomics 39 (3), 416-417 (1997) MEDLINE 97224476 REFERENCE 2 (bases 1 to 1219) AUTHORS Wada,Y. and Sakamoto,M. TITLE cDNA sequence and chromosomal localization of human phosphomannomutase JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1219) AUTHORS Wada,Y. TITLE Direct Submission JOURNAL Submitted (12-SEP-1996) to the DDBJ/EMBL/GenBank databases. Yoshinao Wada, Osaka Medical Center for Maternal and Child Health, Department of Molecular Medicine; 840 Murodo-cho, Izumi, Osaka 590-02, Japan (E-mail:j61638a@center.osaka-u.ac.jp, Tel:81-725-56-1220, Fax:81-725-57-3021) COMMENT D85231:submitted (11-May-1996) by Yoshinao Wada. FEATURES Location/Qualifiers source 1..1219 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 16..804 /codon_start=1 /product="phosphomannomutase" /db_xref="PID:d1014151" /db_xref="PID:g1339916" /translation="MAVTAQAARRKERVLCLFDVDGTLTPARQKIDPEVAAFLQKLRS RVQIGVVGGSDYCKIAEQLGDGDEVIEKFDYVFAENGTVQYKHGRLLSKQTIQNHLGE ELLQDLINFCLSYMALLRLPKKRGTFIEFRNGMLNISPIGRSCTLEERIEFSELDKKE KIREKFVEALKTEFAGKGLRFSRGGMISFDVFPEGWDKRYCLDSLDQDSFDTIHFFGN ETSPGGNDFEIFADPRTVGHSVVSPQDTVQRCREIFFPETAHEA" polyA_signal 1203..1208 polyA_site 1219 BASE COUNT 257 a 334 c 373 g 255 t ORIGIN 1 cgcggacctg cagccatggc agtcaccgcc caggcagccc gcaggaagga gcgcgtcctc 61 tgcctgtttg acgtggacgg gaccctcacg ccggctcgcc agaaaattga ccctgaggtg 121 gccgccttcc tgcagaagct acgaagtaga gtgcagatcg gtgtggtggg cggctctgac 181 tactgtaaga tcgctgagca gctgggtgac ggggatgaag tcattgagaa gtttgattat 241 gtgtttgccg agaacgggac ggtgcagtat aagcacggac gactgctctc caagcagacc 301 atccagaacc acctggggga ggagctgctg caggacttga tcaacttctg cctcagctac 361 atggccctgc tcaggctgcc caagaagcgt ggaaccttca tcgagttccg gaatggcatg 421 ctgaacatct cgcccatcgg ccggagctgc accctggagg agaggatcga gttctccgaa 481 ctggacaaga aagagaagat ccgggagaag ttcgtggaag ccctgaaaac agagtttgct 541 ggcaaagggc tgaggttctc tcgaggaggc atgatcagct ttgacgtctt ccccgagggc 601 tgggacaagc gctactgcct ggatagcctg gaccaggaca gcttcgacac catccacttc 661 tttgggaacg agactagccc tggtgggaac gactttgaga tctttgccga cccccggact 721 gttggccaca gcgtggtgtc tcctcaggac acggtgcagc gatgccggga gattttcttc 781 ccagaaacag ctcatgaagc gtgaccgggg cccacatctg tgtgtcgtga cttctgaaga 841 atttggccta ggcctaaaga gaggtcctgg tgttggatag atgccagggc ccttcttttg 901 gcccaggacg cctgctgcaa gcccacccag atggggccag agtctgtgtg gacaaccgtc 961 cccagccagt ttgctcctag tggcactggc ttcgtcctcc cagggcccag agtgttcccc 1021 atgctccacc tggtggccca ggccacagct gctgcttgta tttcggtaca gaagaggttt 1081 ctttctgcac caggaggagg cgtgctcaag tatcggtacg agatctagcc tgccctgcct 1141 gcctgccctg ggcgatgagg tacggtgggg aaggtgccta ttttagagaa ctttgtcaca 1201 gtattaaagt tcccagaac // LOCUS D87845 2547 bp mRNA PRI 16-JAN-1997 DEFINITION Human mRNA for platelet-activating factor acetylhydrolase 2, complete cds. ACCESSION D87845 NID g1765863 KEYWORDS platelet-activating factor acetylhydrolase 2. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hattori,K., Adachi,H., Matsuzawa,A., Yamamoto,K., Tsujimoto,M., Aoki,J., Hattori,M., Arai,H. and Inoue,K. TITLE cDNA cloning and expression of intracellular platelet-activating factor (PAF) acetylhydrolase II. Its homology with plasma PAF acetylhydrolase JOURNAL J. Biol. Chem. 271 (51), 33032-33038 (1996) MEDLINE 97115847 REFERENCE 2 (bases 1 to 2547) AUTHORS Hattori,K., Adachi,H., Matsuzawa,A., Yamamoto,K., Tsujimoto,M., Aoki,J., Hattori,M., Arai,H. and Inoue,K. TITLE cDNA Cloning and Expression of Intracellular PAF Acetylhydrolase II JOURNAL Unpublished (1997) REFERENCE 3 (bases 1 to 2547) AUTHORS Hattori,K. TITLE Direct Submission JOURNAL Submitted (11-SEP-1996) to the DDBJ/EMBL/GenBank databases. Kenji Hattori, Faculty of Pharmaceutical Sciences, The Univercity of Tokyo, Department of Health Chemistry; Hongo-7-3-1, Bunkyo-ku, Tokyo 113, Japan (E-mail:khattori@mol.f.u-tokyo.ac.jp, Tel:81-3-3812-2111, Fax:81-3-3818-3173) FEATURES Location/Qualifiers source 1..2547 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 204..1382 /codon_start=1 /product="platelet-activating factor acetylhydrolase 2" /db_xref="PID:d1014160" /db_xref="PID:g1765864" /translation="MGVNQSVGFPPVTGPHLVGCGDVMEGQNLQGSFFRLFYPCQKAE ETMEQPLWIPRYEYCTGLAEYLQFNKRCGGLLFNLAVGSCRLPVSWNGPFKTKDSGYP LIIFSHGLGAFRTLYSAFCMELASRGFVVAVPEHRDRSAATTYFCKQAPEENQPTNES LQEEWIPFRRVEEGEKEFHVRNPQVHQRVSECLRVLKILQEVTAGQTVFNILPGGLDL MTLKGNIDMSRVAVMGHSFGGATAILALAKETQFRCAVALDAWMFPLERDFYPKARGP VFFINTEKFQTMESVNLMKKICAQHEQSRIITVLGSVHRSQTDFAFVTGNLIGKFFST ETRGSLDPYEGQEVMVRAMLAFLQKHLDLKEDYNQWNNLIEGIGPSLTPGAPHHLSSL " BASE COUNT 621 a 629 c 685 g 612 t ORIGIN 1 ccacgcgtcc gcggacgcgt gggcgagaag tgcttccaag cgtccatttt gagccttgga 61 aactacgacg accaaagggc cacgggttcc tgggtcgttt ctcatttccg tcgagttaaa 121 cgtctggggc tgcttctgag gaatcagctt ggctggccag caagttcagc tccggcaagt 181 catttgattc acccggtgat gaaatggggg tcaaccagtc tgtgggcttt ccacctgtca 241 caggacccca cctcgtaggc tgtggggatg tgatggaggg tcagaatctc caggggagct 301 tctttcgact cttctacccc tgccaaaagg cagaggagac catggagcag cccctgtgga 361 ttccccgcta tgagtactgc actggcctgg ccgagtacct gcagtttaat aagcgctgcg 421 ggggcttgct gttcaacctg gcggtgggat cttgtcgcct gcctgttagc tggaatggcc 481 cctttaagac aaaggactct ggatacccct tgatcatctt ctcccatggc ctaggagcct 541 tcaggacttt gtattcagcc ttctgcatgg agctggcctc acgtggcttt gtggttgctg 601 tgccagagca cagggaccgg tcagcggcaa ccacctattt ctgcaagcag gccccagaag 661 agaaccagcc caccaatgaa tcgctgcagg aggaatggat ccctttccgt cgagttgagg 721 aaggggagaa ggaatttcat gttcggaatc cccaggtgca tcagcgggta agcgagtgtt 781 tacgggtgtt gaagatcctg caagaggtca ctgctgggca gactgtcttc aacatcttgc 841 ctggtggctt ggatctgatg actttgaagg gcaacattga catgagccgt gtggctgtga 901 tgggacattc atttggaggg gccacagcta ttctggcttt ggccaaggag acccaatttc 961 ggtgtgcggt ggctctggat gcttggatgt ttcctctgga acgtgacttt taccccaagg 1021 cccgaggacc tgtgttcttt atcaatactg agaaattcca gacaatggag agtgtcaatt 1081 tgatgaagaa gatatgtgcc cagcatgaac agtctaggat cataaccgtt cttggttctg 1141 ttcatcggag tcaaactgac tttgcttttg tgactggcaa cttgattggt aaattcttct 1201 ccactgaaac ccgtgggagc ctggacccct atgaagggca ggaggttatg gtacgggcca 1261 tgttggcctt cctgcagaag cacctcgacc tgaaagaaga ctataatcaa tggaacaacc 1321 ttattgaagg cattggaccg tcgctcaccc caggggcccc ccaccatctg tccagcctgt 1381 aggcacaact ggccatttgt aaagtcactt cagccaagtt ttcatttggg agctacccaa 1441 gggcacccat gagctcctat caagaagtga tcaacgtgac cccttttcac agattgaaag 1501 gtgtaatcac actgctgctt ggataactgg gtactttgat cttagatttg atcttaaaat 1561 cactttggga ctgggatccc ttgctgattg acaaacagac tttctgggac cttgatggag 1621 tggggaacaa gcagtagagt gggactgggg gagacccagg ccccgggctg agcactgtga 1681 ggcctggatg tgaagactca gcccagcgaa gctcattccc ttacccccgg ccagtgctgc 1741 tgcttcagtg gaagagatga agccaaagga cagaatgaaa atccctacct tcagagactc 1801 tagcccagcc caacaccatc tcttcctacc tctcagcctt ctccctcccc agggccactt 1861 gttgaagtct gagcacttta tgtaaatttc taggtgtgag ccgtgatcac attttctatt 1921 tatttccaag tcttctcatt gtatggaaca tagtactact tatacttaca gtagtaagtt 1981 atacttgtga gcccacagag tggcagacag catggctctc acagcacagg gagaaaaact 2041 gaggtacaca gaggtacctc agaagctctg gatgtctttg ggggttttgc taagtgtatc 2101 ttgataggaa acaacaaaag caggttgaga tggggaagat gacagaacaa cagtgttaaa 2161 tggccatttg cacaggcctt tgccacaaca gagaagtagt ttggtcagct aaaactcagc 2221 tgcagcctgg acagtagagc gagaccccat cttaaaaata aagaaggctg ggcgtggtgg 2281 ctcatgcctg taatcccagc actttgggag gccaaggcag gcagatcact taaggccagg 2341 agttcaagac cacctggcca acatggtgaa accccgtctc tactaaaaat acaaaaaatt 2401 agcctggcgt aatggcaggc gcctataatc ccagctactc aggaggctga agcagaagaa 2461 tcacttgaac ctaggaggcg gaggttgcag tgagtcaaga tcgcgccact gcactccagc 2521 ctgggtgaca gagcaagact ctgtctt // LOCUS D87920 2275 bp mRNA PRI 23-JAN-1998 DEFINITION Homo sapiens mRNA for sodium iodide symporter, complete cds. ACCESSION D87920 NID g2804569 KEYWORDS sodium iodide symporter. SOURCE Homo sapiens Thyroid gland cDNA to mRNA, clone_lib:clontech human thyroid lambda gt11 cDNA library. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Saito,T., Endo,T., Kawaguchi,A., Ikeda,M. and Onaya,T. TITLE Molecular cloning of the human Na+/I- symporter JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2275) AUTHORS Saito,T. TITLE Direct Submission JOURNAL Submitted (17-SEP-1996) to the DDBJ/EMBL/GenBank databases. Tsukasa Saito, Yamanashi Medical University, Third Department of Internal Medicine; Shimokato 1110, Tamaho, Yamanashi 409-38, Japan (E-mail:tsaito@res.yamanashi-med.ac.jp, Tel:0552-73-1111, Fax:0552-73-7108) COMMENT Sequence updated (30-May-1997). FEATURES Location/Qualifiers source 1..2275 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="clontech human thyroid lambda gt11 cDNA library" /tissue_type="Thyroid gland" CDS 220..2151 /codon_start=1 /product="sodium iodide symporter" /db_xref="PID:d1025402" /db_xref="PID:g2804570" /translation="MEAVETGERPTFGAWDYGVFALMLLVSTGIGLWVGLARGGQRSA EDFFTGGRRLAALPVGLSLSASFMSAVQVLGVPSEAYRYGLKFLWMCLGQLLNSVLTP LLFMPVFYRLGLTSTYEYLEMRFSRAVRLCGTLQYIVATMLYTGIVIYAPALILNQVT GLDIWASLLSTGIICTFYTAVGGMKAVVWTDVFQVVVMLSGFWVVLARGVMLVGGPRQ VLTLAQNHSRINLMDFNPDPRSRYTFWTFVVGGTLVWLSMYGVNQAQVQRYVACRTEK QAKLALLINQVGLFLIVSSAACCGIVMFVFYTDCDPLLLGRISAPDQYMPLLVLDIFE DLPGVPGLFLACAYSGTLSTASTSINAMAAVTVEDLIKPRLRSLAPRKLVIISKGLSL IYGSACLTVAALSSLLGGGVLQGSFTVMGVISGPLLGAFILGMFLPACNTPGVLAGLG AGLALSLRVALGATLYPPSEQTMRVLPSSAARCVALSVNASGLLDPALLPANDSSRAP SSGMDASRPALADSFYAISYLYYGALGQLTTVLCGALISCLTGPTKRQTLAPGLLWWD LARQTASVAPKEEVAILDDNLVKGPEELPTGNKKPPGFLPTNEDRLFFLGQKELEGAG SWTPCVGHDGGRDQQETNL" BASE COUNT 367 a 763 c 685 g 460 t ORIGIN 1 ccgcggggac agggaggccg acacggacat cgacagccca tagattccta acccagggag 61 ccccggcccc tctcgccgct tcccacccca gacggagcgg ggacaggctg ccgagcatcc 121 tcccacccgc cctccccgtc ctgcctcctc ggcccctgcc agcttccccc gcttgagcac 181 gcagggctcc gaggacgctg ggcctccgca cccgccctca tggaggccgt ggagaccggg 241 gaacggccca ccttcggagc ctgggactac ggggtctttg ccctcatgct cctggtgtcc 301 actggcatcg ggctgtgggt cgggctggct cggggcgggc agcgcagcgc tgaggacttc 361 ttcaccgggg gccggcgcct ggcggccctg cccgtgggcc tgtcgctgtc tgccagcttc 421 atgtcggccg tgcaggtgct gggcgtgccg tcggaggcct atcgctatgg cctcaagttc 481 ctctggatgt gcctgggcca gcttctgaac tcggtcctca cgcccttgct cttcatgccc 541 gtcttctacc gcctgggcct caccagcacc tacgagtacc tggagatgcg cttcagccgc 601 gcagtgcggc tctgcgggac tttgcagtac attgtagcca cgatgctgta caccggcatc 661 gtaatctacg caccggccct catcctgaac caagtgaccg ggctggacat ctgggcgtcg 721 ctcctgtcca ccggaattat ctgcaccttc tacacggctg tgggcggcat gaaggctgtg 781 gtctggactg atgtgttcca ggtcgtggtg atgctaagtg gcttctgggt tgtcctggca 841 cgcggtgtca tgcttgtggg cgggccccgc caggtgctca cgctggccca gaaccactcc 901 cggatcaacc tcatggactt taaccctgac ccgaggagcc gctatacatt ctggactttt 961 gtggtgggtg gcacgttggt gtggctctcc atgtatggcg tgaaccaggc gcaggtgcag 1021 cgctacgtgg cttgccgcac agagaagcag gccaagctgg ccctgctcat caaccaggtc 1081 ggcctgttcc tgatcgtgtc cagcgctgcc tgctgtggca tcgtcatgtt tgtgttctac 1141 actgactgcg accctctcct cctggggcgc atctctgccc cagaccagta catgcctctg 1201 ctggtgctgg acatcttcga agatctgcct ggagtccccg ggcttttcct ggcctgtgct 1261 tacagtggca ccctcagcac agcatccacc agcatcaatg ctatggctgc agtcactgta 1321 gaagacctca tcaaacctcg gctgcggagc ctggcaccca ggaaactcgt gattatctcc 1381 aaggggctct cactcatcta cggatcggcc tgtctcaccg tggcagccct gtcctcactg 1441 ctcggaggag gtgtccttca gggctccttc accgtcatgg gagtcatcag cggccccctg 1501 ctgggagcct tcatcttggg aatgttcctg ccggcctgca acacaccggg cgtcctggcg 1561 ggactaggcg cgggcttggc gctgtcgctg cgggtggcct tgggcgccac gctgtaccca 1621 cccagcgagc agaccatgag ggtcctgcca tcgtcggctg cccgctgcgt ggctctctca 1681 gtcaacgcct ctggcctcct ggacccggct ctcctccctg ctaacgactc cagcagggcc 1741 cccagctcag gaatggacgc cagccggccc gccttagctg acagcttcta tgccatctcc 1801 tatctctatt acggtgccct gggccagctg accactgtgc tgtgcggagc cctcatcagc 1861 tgcctgacag gccccaccaa gcgccagacc ctggccccgg gattgttgtg gtgggacctc 1921 gcacggcaga cagcatcagt ggcccccaag gaagaagtgg ccatcctgga tgacaacttg 1981 gtcaagggtc ctgaagaact ccccactgga aacaagaagc cccctggctt cctgcccacc 2041 aatgaggatc gtctgttttt cttggggcag aaggagctgg agggggctgg ctcttggacc 2101 ccctgtgttg gacatgatgg tggtcgagac cagcaggaga caaacctctg atggtggtcg 2161 agaccagcag gagacaaacc tctgaggaca gggccagccg cgggactgac accctgggat 2221 ggaacctcag gatgggccaa acccagacaa cgggcccatg gcttgggctc tgatt // LOCUS D87930 4613 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens mRNA for myosin phosphatase target subunit 1 (MYPT1). ACCESSION D87930 NID g2443337 KEYWORDS myosin phosphatase target subunit 1. SOURCE Homo sapiens adult Brain Liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Takahashi,N., Ito,M., Tanaka,J., Nakano,T., Kaibuchi,K., Odai,H. and Takemura,K. TITLE Localization of the gene coding for myosin phosphatase, target subunit 1 (MYPT1) to human chromosome 12q15-q21 JOURNAL Genomics 44 (1), 150-152 (1997) MEDLINE 97432834 REFERENCE 2 (bases 1 to 4613) AUTHORS Takahashi,N. TITLE Direct Submission JOURNAL Submitted (17-SEP-1996) to the DDBJ/EMBL/GenBank databases. Nobuaki Takahashi, Kirin brewery co., ltd., centarl laboratries for key technology; 1-13-5, Fukuura kanazawa-ku, yokohama, kanagawa 236, Japan (E-mail:ntakahashi@kirin.co.jp, Tel:81-45-788-7200) FEATURES Location/Qualifiers source 1..4613 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /dev_stage="adult" /map="12q15-q21.2" /tissue_type="Brain Liver" CDS 1..3093 /note="MYPT1" /codon_start=1 /product="myosin phosphatase target subunit 1" /db_xref="PID:d1023241" /db_xref="PID:g2443338" /translation="MKMADAKQKRNEQLKRWIGSETDLEPPVVKRQKTKVKFDDGAVF LAACSSGDTDEVLKLLHRGADINYANVDGLTALHQACIDDNVDMVKFLVENGANINQP DNEGWIPLHAAASCGYLDIAEFLIGQGAHVGAVNSEGDTPLDIAEEEAMEELLQNEVN RQGVDIEAARKEEERIMLRDARQWLNSGHINDVRHAKSGGTALHVAAAKGYTEVLKLL IQAGYDVNIKDYDGWTPLHAAAHWGKEEACRILVDNLCDMEMVNKVGQTAFDVADEDI LGYLEELQKKQNLLHSEKRDKKSPLIESTANMDNNQSQKTFKNKETLIIEPEKNASRI ESLEQEKVDEEEEGKKDESSCSSEEDEEDDSESEAETDKTKPLASVTNANTSSTQAAP VAVTTPTVSSGQATPTSPIKKFPTTATKISPKEEERKDESPATWRLGLRKTGSYGALA EITASKEGQKEKDTAGVTRSASSPRLSSSLDNKEKEKDSKGTRLAYVAPTIPRRLAST SDIEEKENRDSSSLRTSSSYTRRKWEDDLKKNSSVNEGSTYHKSCSFGRRQDDLISSS VPSTTSTPTVTSAAGLQKSLLSSTSTTTKITTGSSSAGTQSSTSNRLWAEDSTEKEKD SVPTAVTIPVAPTVVNAAASTTTLTTTTAGTVSSTTEVRERRRSYLTPVRDEESESQR KARSRQARQSRRSTQGVTLTDLQEAEKTIGRSRSTRTREQENEEKEKEEKEKQDKEKQ EEKKESETSREDEYKQKYSRTYDETYQRYRPVSTSSSTTPSSSLSTMSSSLYASSQLN RPNSLVGITSAYSRGITKENEREGEKREEEKEGEDKSQPKSIRERRRPREKRRSTGVS FWTQDSDENEQEQQSDTEEGSNKKETQTDSISRYETSSTSAGDRYDSLLGRSGSYSYL EERKPYSSRLEKDDSTDFKKLYEQILAENEKLKAQLHDTNMELTDLKLQLEKATQRQE RFADRSLLEMEKRERRALERRISEMEEELKMLPDLKADNQRLKDENGALIRVISKLSK " BASE COUNT 1579 a 831 c 965 g 1238 t ORIGIN 1 atgaagatgg cggacgcgaa gcagaagcgg aacgagcagc tgaaacgctg gatcggctcc 61 gagacggacc tcgagcctcc ggtggtgaag cgccagaaga ccaaggtgaa gttcgacgat 121 ggcgccgtct tcctggctgc ttgctccagc ggcgacacgg acgaggtcct caagctgctg 181 caccgcggcg ccgacatcaa ttacgccaat gtggacggac tcactgccct gcaccaggct 241 tgcattgatg acaatgttga tatggtgaag tttctggtag aaaatggagc aaatattaat 301 caacctgata atgaaggctg gataccacta catgcagcag cttcctgtgg atatcttgat 361 attgcagagt ttttgattgg tcaaggagca catgtagggg ctgtcaacag tgaaggagat 421 acacctttag atattgcgga ggaggaggca atggaagagc tacttcaaaa tgaagttaat 481 cggcaagggg ttgatataga agcagctcga aaggaagaag aacggatcat gcttagagat 541 gccaggcagt ggctaaatag tggtcatata aatgatgtcc ggcatgcaaa atctggaggt 601 acagcacttc acgttgcagc tgctaaaggc tatacggaag ttttaaaact tttaatacag 661 gcaggctatg atgttaatat taaagactat gatggctgga cacctcttca tgctgcagct 721 cattggggta aagaagaagc atgtcgaatt ttagtggaca atctgtgtga tatggagatg 781 gtcaacaaag tgggccaaac agcctttgat gtagcagatg aagacatttt aggatattta 841 gaagagttgc aaaagaaaca aaatctgctc catagtgaaa aacgggacaa gaaatctcca 901 ctaattgaat caacagcaaa tatggacaat aatcagtcac agaagacctt taaaaacaaa 961 gagacgttga ttattgaacc agagaaaaat gcatcccgta ttgaatctct ggaacaagaa 1021 aaggttgatg aagaagaaga aggaaagaag gatgagtcta gctgctctag tgaagaagat 1081 gaggaagatg actcggaatc agaagctgaa acagataaga caaaacccct ggcttctgta 1141 actaatgcca acacttctag tacacaagca gctcctgtag ctgttacaac acctactgtg 1201 tcatcaggtc aagcaacacc tacatcacct attaaaaagt ttccaaccac agctacaaaa 1261 atttctccca aagaagaaga gagaaaagat gagtctcctg caacttggag gttaggactt 1321 agaaagacgg gcagctatgg tgcacttgct gaaatcacag catctaaaga gggtcagaaa 1381 gaaaaagata ctgcaggtgt tacacgttca gcttcaagtc ccagactttc ctcctctttg 1441 gataataaag aaaaggagaa agatagtaaa ggaactaggc ttgcatatgt tgcacctaca 1501 ataccaagac gactagccag tacatctgac attgaagaga aagaaaacag agattcttca 1561 agtttgcgaa caagtagttc atatacaagg agaaaatggg aagatgatct taaaaaaaat 1621 agctcagtta atgaaggatc aacgtatcat aaaagttgct cctttggtag aagacaagat 1681 gatttgatta gttctagtgt tccaagcacc acatcaacac caacagttac ctctgcagct 1741 gggcttcaga aaagcctgct ttccagcaca agcactacta caaagattac aacgggttct 1801 tcctcagcag gcacacaaag cagtacctca aatcgtttgt gggctgagga tagtactgag 1861 aaagaaaagg acagtgttcc tacggcagtg accattcctg ttgctccaac tgttgtaaat 1921 gctgcagctt ctaccacaac cctgactaca actactgctg gcactgtctc ctccacaaca 1981 gaggtcaggg agagacgcag atcatacctc actcctgtta gggatgaaga gtctgaatcc 2041 caaagaaaag caagatctag acaagcaaga caatctagaa gatcaacaca gggagtgaca 2101 ttaactgatc ttcaagaagc tgagaaaaca ataggaagaa gtcgttctac ccgaaccaga 2161 gaacaagaaa atgaagaaaa agaaaaagag gaaaaagaga aacaagataa agagaaacaa 2221 gaagaaaaga aggagtcaga aacatctaga gaagatgaat ataaacaaaa gtactccaga 2281 acgtatgatg agacttacca gcgttatagg ccagtatcaa cttcaagttc aaccactcca 2341 tcctcttcac tttctactat gagcagttca ctgtatgctt caagtcaact aaacaggcca 2401 aatagtcttg taggcataac ttctgcttac tccagaggaa taacaaaaga aaatgaaaga 2461 gagggagaaa aaagagaaga ggagaaagaa ggagaagata aatcacaacc taaatcaatc 2521 agagaacgac gacgaccaag agagaaaaga agatctacag gagtttcatt ttggacacaa 2581 gatagtgatg aaaatgaaca agaacaacaa tcagacacag aagagggatc caataagaaa 2641 gaaactcaga cggattccat ttctagatat gaaaccagtt ctacatcagc tggtgatcga 2701 tatgattcct tgctgggtcg ctctggatca tacagttact tagaagaaag aaaaccttac 2761 agcagcaggc tagaaaagga tgactcaact gactttaaaa agctttatga acaaattcta 2821 gctgaaaatg aaaagctgaa ggcacagcta catgatacaa atatggaact aacagatctt 2881 aaattacagt tggaaaaggc cacccagaga caagaaagat ttgctgatag atcactgttg 2941 gaaatggaaa aaagggaacg aagagctcta gaaagaagaa tatctgaaat ggaagaagag 3001 ctcaaaatgt taccagacct aaaagcagac aaccagaggc taaaggatga aaatggggcc 3061 ttgatcagag ttataagcaa actttccaaa taaaaaaaaa aaagcagcaa gtaatggaat 3121 tgcacatatt agtaacccag tggaccataa ttggcagtca ctggaagtct gggaagaatc 3181 cttggagact gtcattttcg gatatcctgc caaatgccct cttatctaga atttttgttt 3241 cattttgttt aattttctgg ggtgtttttg ttgttgttgg tttgtttttt gttttttttt 3301 ttaatcaaga ccattgtttc atgttaatgc agctgctgag aagatttttt tttaatgact 3361 gagaaaactt gtttacagct ccagcatata aggaaagtgt tcaaggccag atatgcctca 3421 gatatttaac cagtaagcct tagttgtaca taaatacttt tgtgtcaaca aaaactttca 3481 gctctcacag aagacagtta ctcaacattt tttgatgtgc cacagtttcg agtttttcga 3541 tatttaaatt ttttggcttt tcatctaagt ttgggtttgt attttttcct tctaaactct 3601 tcatgtggca gagtcttcta tgttttcacg gctttttcat tacagaaaag aacacttgct 3661 cttctgtgat tattgtcatg tattaggcta atgctgtgtt gtctcccacc tggaactgaa 3721 ttgcttggtg gaacatatgc tttcactgtt tgtgcaatat gcatttattt cttatatgaa 3781 tgctttaaag tcatttgagg ttagatcttt taattcctat tttctgcttc attggtcact 3841 ttttttttat tgtagtataa gatgttagat tctgtaatct tcacattcat tttagcaggt 3901 actgagtgat gctgtatata caaataagtg tattgttttg atttttagac caccacatgg 3961 catgcttgac tatttcttat ttcaaatgtc tgctaatgca gagtaggcta ctccatgata 4021 gtgttaaaaa acaaaatttg ctaacaatgt gatataaaga ctttaaaagt tacacattat 4081 gtggagccct atctttacaa aagtttccta ctgtaaagtg cttttatttt cagttttcat 4141 ttgatagtac tcaaccataa ttaaagttgc ataagataat tgctttacat ttcacatacc 4201 tatatttatc tgagtgctgt ctaaaactgt tgtgctagcc aaagtaatgc tatgaaatca 4261 tttgcagaat taacccgtga gttaatgtta aatgcactgt tattgccatg tgaagaggca 4321 tcgactttga taccaccatc atgttcagac cattttatac atttcagtgg cctttttttt 4381 tttaaggaaa aaaaagcgca aaaccaagta catagtgacg atggctttta tttggacaaa 4441 tagcttttat attttcatta aaccatgcaa aaaatactac atctttctgg cacataactg 4501 tctccttaac cactggaaca gttcagccat ttgaataaat tgtacattgt aaagcttata 4561 gtagctgatt gtattattga ttgtattgta tactatatta aatgtgaatt tgt // LOCUS D87953 3056 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for RTP, complete cds. ACCESSION D87953 NID g1596166 KEYWORDS RTP. SOURCE Homo sapiens umbilical vein endothelium endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3056) AUTHORS Kokame,K., Kato,H. and Miyata,T. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) to the DDBJ/EMBL/GenBank databases. Koichi Kokame, National Cardiovascular Center Research Institute, Department of Etiology and Pathogenesis; Fujishirodai 5-7-1, Suita, Osaka 565, Japan (E-mail:kame@ri.ncvc.go.jp, Tel:+81-6-833-5012, Fax:+81-6-872-8091) REFERENCE 2 (sites) AUTHORS Kokame,K., Kato,H. and Miyata,T. TITLE Homocysteine-respondent genes in vascular endothelial cells identified by differential display analysis. GRP78/BiP and novel genes JOURNAL J. Biol. Chem. 271 (47), 29659-29665 (1996) MEDLINE 97094664 FEATURES Location/Qualifiers source 1..3056 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial cell" /tissue_type="umbilical vein endothelium" gene 123..1307 /gene="GC4" CDS 123..1307 /gene="GC4" /codon_start=1 /product="RTP" /db_xref="PID:d1014198" /db_xref="PID:g1596167" /translation="MSREMQDVDLAEVKPLVEKGETITGLLQEFDVQEQDIETLHGSV HVTLCGTPKGNRPVILTYHDIGMNHKTCYNPLFNYEDMQEITQHFAVCHVDAPGQQDG AASFPAGYMYPSMDQLAEMLPGVLQQFGLKSIIGMGTGAGAYILTRFALNNPEMVEGL VLINVNPCAEGWMDWAASKISGWTQALPDMVVSHLFGKEEMQSNVEVVHTYRQHIVND MNPGNLHLFINAYNSRRDLEIERPMPGTHTVTLQCPALLVVGDSSPAVDAVVECNSKL DPTKTTLLKMADCGGLPQISQPAKLAEAFKYFVQGMGYMPSASMTRLMRSRTASGSSV TSLDGTRSRSHTSEGTRSRSHTSEGTRSRSHTSEGAHLDITPNSGAAGNSAGPKSMEV SC" BASE COUNT 706 a 844 c 811 g 695 t ORIGIN 1 cccagctggt gctgaagctc gtcagttcac catccgccct cggcttccgc ggggcgctgg 61 gccgccagcc tcggcaccgt cctttccttt ctccctcgcg ttaggcaggt gacagcaggg 121 acatgtctcg ggagatgcag gatgtagacc tcgctgaggt gaagcctttg gtggagaaag 181 gggagaccat caccggcctc ctgcaagagt ttgatgtcca ggagcaggac atcgagactt 241 tacatggctc tgttcacgtc acgctgtgtg ggactcccaa gggaaaccgg cctgtcatcc 301 tcacctacca tgacatcggc atgaaccaca aaacctgcta caaccccctc ttcaactacg 361 aggacatgca ggagatcacc cagcactttg ccgtctgcca cgtggacgcc cctggccagc 421 aggacggcgc agcctccttc cccgcagggt acatgtaccc ctccatggat cagctggctg 481 aaatgcttcc tggagtcctt caacagtttg ggctgaaaag cattattggc atgggaacag 541 gagcaggcgc ctacatccta actcgatttg ctctaaacaa ccctgagatg gtggagggcc 601 ttgtccttat caacgtgaac ccttgtgcgg aaggctggat ggactgggcc gcctccaaga 661 tctcaggatg gacccaagct ctgccggaca tggtggtgtc ccaccttttt gggaaggaag 721 aaatgcagag taacgtggaa gtggtccaca cctaccgcca gcacattgtg aatgacatga 781 accccggcaa cctgcacctg ttcatcaatg cctacaacag ccggcgcgac ctggagattg 841 agcgaccaat gccgggaacc cacacagtca ccctgcagtg ccctgctctg ttggtggttg 901 gggacagctc gcctgcagtg gatgccgtgg tggagtgcaa ctcaaaattg gacccaacaa 961 agaccactct cctcaagatg gcggactgtg gcggcctccc gcagatctcc cagccggcca 1021 agctcgctga ggccttcaag tacttcgtgc agggcatggg atacatgccc tcggctagca 1081 tgacccgcct gatgcggtcc cgcacagcct ctggttccag cgtcacttct ctggatggca 1141 cccgcagccg ctcccacacc agcgagggca cccgaagccg ctcccacacc agcgagggca 1201 cccgcagccg ctcgcacacc agcgaggggg cccacctgga catcaccccc aactcgggtg 1261 ctgctgggaa cagcgccggg cccaagtcca tggaggtctc ctgctaggcg gcctgcccag 1321 ctgccgcccc cggactctga tctctgtagt ggccccctcc tccccggccc cttttcgccc 1381 cctgcctgcc atactgcgcc taactcggta ttaatccaaa gcttattttg taagagtgag 1441 ctctggtgga gacaaatgag gtctattacg tgggtgccct ctccaaaggc ggggtggcgg 1501 tggaccaaag gaaggaagca agcatctccg catcgcatcc tcttccatta accagtggcc 1561 ggttgccact ctcctcccct ccctcagaga caccaaactg ccaaaaacaa gacgcgtagc 1621 agcacacact tcacaaagcc aagcctaggc cgccctgagc atcctggttc aaacgggtgc 1681 ctggtcagaa ggccagccgc ccacttcccg tttcctcttt aactgaggag aagctgatcc 1741 agtttccgga aacaaaatcc ttttctcatt tggggagggg ggtaatagtg acatgcaggc 1801 acctctttta aacaggcaaa acaggaaggg ggaaaaggtg ggattcatgt cgaggctaga 1861 ggcatttgga acaacaaatc tacgtagtta acttgaagaa accgattttt aaagttggtg 1921 catctagaaa gctttgaatg cagaagcaaa caagcttgat ttttctagca tcctcttaat 1981 gtgcagcaaa agcaggcaac aaaatctcct ggctttacag acaaaaatat ttcagcaaac 2041 gttgggcatc atggtttttg aaggctttag ttctgctttc tgcctctcct ccacagcccc 2101 aacctcccac ccctgataca tgagccagtg attattcttg ttcagggaga agatcattta 2161 gatttgtttt gcattcctta gaatggaggg caacattcca cagctgccct ggctgtgatg 2221 agtgtccttg caggggccgg agtaggagca ctggggtggg ggcggaattg gggttactcg 2281 atgtaaggga ttccttgttg ttgtgttgag atccagtgca gttgtgattt ctgtggatcc 2341 cagcttggtt ccaggaattt tgtgtgattg gcttaaatcc agttttcaat cttcgacagc 2401 tgggctggaa cgtgaactca gtagctgaac ctgtctgacc cggtcacgtt cttggatcct 2461 cagaactctt tgctcttgtc ggggtggggg tgggaactca cgtggggagc ggtggctgag 2521 aaaatgtaag gattctggaa tacatattcc atgggacttt ccttccctct cctgcttcct 2581 cttttcctgc tccctaacct ttcgccgaat ggggcagcac cactgacgtt tctgggcggc 2641 cagtgcggct gccaggttcc tgtactactg ccttgtactt ttcattttgg ctcaccgtgg 2701 attttctcat aggaagtttg gtcagagtga attgaatatt gtaagtcagc cactgggacc 2761 cgaggatttc tgggaccccg cagttgggag gaggaagtag tccagccttc caggtggcgt 2821 gagaggcaat gactcgttac ctgccgccca tcaccttgga ggccttccct ggccttgagt 2881 agaaaagtcg gggatcgggg caagagaggc tgagtacgga tgggaaacta ttgtgcacaa 2941 gtctttccag aggagtttct taatgagata tttgtattta tttccagacc aataaatttg 3001 taactttgca gcggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS D87957 1658 bp DNA PRI 03-FEB-1998 DEFINITION Homo sapiens gene for protein involved in sexual development, complete cds. ACCESSION D87957 NID g1620897 KEYWORDS protein involved in sexual development. SOURCE Homo sapiens male foreskin fibroblast DNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Okazaki,N., Okazaki,K., Watanabe,Y., Kato-Hayashi,M., Yamamoto,M. and Okayama,H. TITLE Novel factor highly conserved among eukaryotes controls sexual development in fission yeast JOURNAL Mol. Cell. Biol. 18 (2), 887-895 (1998) MEDLINE 98107674 REFERENCE 2 (bases 1 to 1658) AUTHORS Okazaki,N. TITLE Direct Submission JOURNAL Submitted (22-SEP-1996) to the DDBJ/EMBL/GenBank databases. Noriko Okazaki, The Okayama Cell Switching Project, ERATO, JRDC; 103-5 Tanakamonzen-cho, Sakyo-ku, Kyoto, Kyoto 606, Japan (E-mail:okayamap@mbox.kyoto-inet.or.jp, Tel:075-712-5406, Fax:075-712-5492) FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /sex="male" /tissue_type="foreskin" CDS 150..1049 /note="protein involved in sexual development" /codon_start=1 /db_xref="PID:d1014201" /db_xref="PID:g1620898" /translation="MHSLATAAPVPTTLAQVDREKIYQWINELSSPETRENALLELSK KRESVPDLAPMLWHSFGTIAALLQEIVNIYPSINPPTLTAHQSNRVCNALALLQCVAS HPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLTSLGVIGALVKTDEQEVINFLLT TEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAYICQTYERFSHVAMILGKMVL QLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLKDTTFAQVLKDDTTTKRWL AQLVKNLQEGQVTDPRGIPLPPQ" BASE COUNT 410 a 422 c 412 g 414 t ORIGIN 1 tgagaggtca gagggccgcg aagtgggcgg agcgagccgg agtcggatgg cggctacggc 61 ggctcattat tttccgctgc aggggtgctg aaggggggac gcgggtcgga cgcgtccggc 121 tgtggaagag agcggcggcc gctcacaaca tgcacagcct ggcgacggct gcgcctgtgc 181 ctactacact ggcacaagtg gatagagaaa agatctatca gtggatcaat gagctgtcca 241 gtcctgagac tagggaaaat gctttgctgg agctaagtaa gaagcgagaa tctgttcctg 301 accttgcacc catgctgtgg cattcatttg gtactattgc agcactttta caggaaattg 361 taaatattta tccatctatc aacccaccca ccttgacagc acaccagtct aacagagttt 421 gcaatgctct ggcattactg caatgtgtag catcacatcc agaaaccagg tcagcgtttc 481 tcgcagcaca catcccactt tttttgtacc cctttttgca cactgtcagc aaaacacgtc 541 cctttgagta tctccggctc accagccttg gagttattgg ggccctggtg aaaacagatg 601 aacaagaagt aatcaacttt ttattaacaa cagaaattat ccctttatgt ttgcgaatta 661 tggaatctgg aagtgaactt tctaaaacag ttgccacatt catcctccag aagatcttgt 721 tagatgacac tggtttggct tatatatgtc agacgtatga gcgtttctcc catgttgcca 781 tgatcttggg taagatggtc ctgcagctat ccaaagagcc ttctgcccgt ctgctgaagc 841 atgtagtgag atgttacctt cgactttcag ataaccccag ggcacgtgaa gcactcagac 901 agtgcctccc tgaccagctg aaagacacaa ccttcgccca ggtgctaaaa gatgacacca 961 ccacgaaacg ctggcttgca caactggtga agaacctgca agagggccag gtcaccgatc 1021 cccggggtat ccccctgccc cctcagtgat ccttccctgt tccctcccac tactccccca 1081 agttggggaa aggaggggga acctacgaga aaaacagctc aggttttatc accgactggg 1141 aatagacaac ctcaatgctg aaccgcactg gagaaaaggg gcaaggtacc cctgctgagg 1201 tgtatgggct gccatctcag gctgtcttga ggacctgggc tccctctgct actcccagga 1261 aatgggctcc tgacacagca gtctgccacc acagccccag gagggtgtca acaccagcaa 1321 atgctgtatt tgcagcatgt ccaagatgac ccttctcccc tacctctacc tagccactgg 1381 cagggagggg agacagtggt gatagcagca gcactctagg catggtgaac gcctgggacc 1441 aagccatgtg gcgtttttta ttttgccttt ctggaagact caagatatgt ctcttcattc 1501 tctctcagta tttgtttact ttggtttttt tgtttttaat ctcagagaga ggtgtgttta 1561 gtgggcacaa gctgtaatat tcagcaaaac tttgtcgact ggcactgttt acaagtttgt 1621 tagctgcata agctcaataa aaagttggtt tgggcatt // LOCUS D87969 1777 bp mRNA PRI 18-MAR-1997 DEFINITION Human mRNA for CMP-sialic acid transporter, complete cds. ACCESSION D87969 NID g1694636 KEYWORDS CMP-sialic acid transporter. SOURCE Homo sapiens 73 days post natal infant brain brain cDNA to mRNA, clone:position 103.. 1777 is from I.M.A.G.E Consortium Clone ID 44875. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1777) AUTHORS Ishida,N. TITLE Direct Submission JOURNAL Submitted (20-SEP-1996) to the DDBJ/EMBL/GenBank databases. Nobuhiro Ishida, The Tokyo Metropolitan Institute of Medical Science, The Physiological Chemistry; 18-22, Honkomagome 3-chome, Bunkyo-ku, Tokyo 113, Japan (E-mail:ishidan@rinshoken.or.jp, Tel:03-3823-2101, Fax:03-3823-2965) REFERENCE 2 (bases 1 to 1777) AUTHORS Ishida,N., Miura,N., Yoshioka,S. and Kawakita,M. TITLE Molecular cloning and characterization of a novel isoform of the human UDP-galactose transporter, and of related complementary DNAs belonging to the nucleotide-sugar transporter gene family JOURNAL J. Biochem. 120 (6), 1074-1078 (1996) MEDLINE 97164005 REFERENCE 3 (bases 1 to 1777) AUTHORS Ishida,N. TITLE Molecular cloning and characterization of an isoform and the related genes of human UDP-galactose translocator JOURNAL Unpublished (1996) REFERENCE 4 (sites) AUTHORS Lennon,G., Auffray,C., Polymeropoulos,M. and Soares,M.B. TITLE The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression JOURNAL Genomics 33 (1), 151-152 (1996) MEDLINE 96224170 FEATURES Location/Qualifiers source 1..1777 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="position 103.. 1777 is from I.M.A.G.E Consortium Clone ID 44875" /dev_stage="73 days post natal infant brain" /tissue_type="brain" source 1..169 /note="5'RACE products from normal, whole liver from a 33 year old Caucasian female" /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="33 year old" /sex="female" /tissue_lib="Clontech Human Liver 5'-RACE Ready cDNA" /tissue_type="liver" CDS 28..1041 /codon_start=1 /product="CMP-sialic acid transporter" /db_xref="PID:g1669558" /translation="MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFS TTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQN NMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQW KPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGI IVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSA AAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERV IGV" polyA_signal 1749..1754 polyA_site 1777 /note="33 A nucleotides" BASE COUNT 519 a 320 c 375 g 563 t ORIGIN 1 agttccgcgg ggggctgtcg gggaaccatg gctgccccga gagacaatgt cactttatta 61 ttcaagttat actgcttggc agtgatgacc ctgatggctg cagtctatac catagcttta 121 agatacacaa ggacatcaga caaagaactc tacttttcaa cgacagccgt gtgtatcaca 181 gaagttataa agttattgct aagtgtggga attttagcta aagaaactgg tagtctgggt 241 agattcaaag catctttaag agaaaatgtc ttggggagcc ccaaggaact gttgaagtta 301 agtgtgccat cgttagtgta tgctgttcag aacaacatgg ctttcctagc tcttagcaat 361 ctggatgcag cagtgtacca ggtgacctac cagttgaaga ttccgtgtac tgctttatgc 421 actgttttaa tgttaaatcg gacactcagc aaattacagt gggtttcagt ttttatgctg 481 tgtgctggag ttacgcttgt acagtggaaa ccagcccaag ctacaaaagt ggtggtggaa 541 caaaatccat tattagggtt tggcgctata gctattgctg tattgtgctc aggatttgca 601 ggagtatatt ttgaaaaagt tttaaagagt tcagatactt ctctttgggt gagaaacatt 661 caaatgtatc tatcagggat tattgtgaca ttagctggcg tctacttgtc agatggagct 721 gaaattaaag aaaaaggatt tttctatggt tacacatatt atgtctggtt tgtcatcttt 781 cttgcaagtg ttggtggcct ctacacttct gttgtggtta agtacacaga caacatcatg 841 aaaggctttt ctgcagcagc ggccattgtc ctttccacca ttgcttcagt aatgctgttt 901 ggattacaga taacactcac ctttgccctg ggtactcttc ttgtatgtgt ttccatatat 961 ctctatggat tacccagaca agacactaca tccatccaac aaggagaaac agcttcaaag 1021 gagagagtta ttggtgtgtg attttagcct cacgtgagac tccttttaag actaaaccat 1081 ttgcattaaa ctagagcctt aagtcaatct cagaaggtag cataaacaaa taaaaattaa 1141 ctgtatggca tgatcagtgc ggttatgtgg aaacaacaac aaacaaacga agctatctga 1201 gtgaactgct aatacagaaa cttaatgtag acctgtttgg ggtctactat tgttttagaa 1261 tgaaggaatt gtattattgt gtgtatatat aatttgtaaa taaaaagtat ggagatgata 1321 cggtgttaaa aaaaatcatg gtaaggctac aatactcaag taacaaggtt tgggacaatg 1381 tctaagggtt aaagtgccaa agccatttct gtactaactg ttctcttgtt ccggtaccgg 1441 ggagaaggat gacccctcct tattctccaa ttcatgtaca gtattttgtc ctagcagcat 1501 aaagacctag ctcttttctt acaagaggca gaaacaagac aggctagttc ataaacaaac 1561 tgtgtaactt ctcaaaatga atctatttca taactcggac aatttctggg tggtgactga 1621 gtaccccttt agtgagtacc cctttagtgc tatatttgtg ccattcatta tctggttcat 1681 atttcttttc tgttagatga tacacatttc ttcaaaaaaa tttctaatgt cacttttgta 1741 cttttttaaa taaagtatgt ttaactgttg ggctctc // LOCUS D87989 1186 bp mRNA PRI 18-MAR-1997 DEFINITION Human mRNA for UDP-galactose transporter related isozyme 1, complete cds. ACCESSION D87989 NID g1694637 KEYWORDS UGTrell; UDP-galactose transporter. SOURCE Homo sapiens 20 week post conception fetus liver and spleen cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1186) AUTHORS Ishida,N. TITLE Direct Submission JOURNAL Submitted (24-SEP-1996) to the DDBJ/EMBL/GenBank databases. Nobuhiro Ishida, The Tokyo Metropolitan Institute of Medical Science, The Physiological Chemistry; 18-22, Honkomagome 3-chome, Bunkyo-ku, Tokyo 113, Japan (E-mail:ishidan@rinshoken.or.jp, Tel:03-3823-2101, Fax:03-3823-2965) REFERENCE 2 (bases 1 to 1186) AUTHORS Ishida,N., Miura,N., Yoshioka,S. and Kawakita,M. TITLE Molecular cloning and characterization of a novel isoform of the human UDP-galactose transporter, and of related complementary DNAs belonging to the nucleotide-sugar transporter gene family JOURNAL J. Biochem. 120 (6), 1074-1078 (1996) MEDLINE 97164005 REFERENCE 3 (bases 1 to 1186) AUTHORS Ishida,N. TITLE Molecular cloning and characterization of an isoform and the related genes of human UDP-galactose translocator JOURNAL Unpublished (1996) REFERENCE 4 (sites) AUTHORS Lennon,G., Auffray,C., Polymeropoulos,M. and Soares,M.B. TITLE The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression JOURNAL Genomics 33 (1), 151-152 (1996) MEDLINE 96224170 FEATURES Location/Qualifiers source 1..1186 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="20 week post conception fetus liver and spleen" source 1..365 /note="5'RACE products from normal, whole liver from a 33-yr-old Caucasian female (Clontech Human Liver 5'-RACE ready cDNA)" /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="33 year old" /sex="female" /tissue_lib="Clontech human liver 5'RACE ready cDNA" /tissue_type="liver" source 1..365 /note="5'RACE products from normal, whole livers pooled from two spontaneously aborted female Caucasian fetuses, aged 22 to 26 weeks (Clontech Human Fetal Liver Marathon-ready cDNA)" /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 to 26 week" /sex="female" /tissue_lib="Clonetech human fetal liver marathon-ready cDNA" /tissue_type="liver" CDS 88..1056 /note="highly similar to UDP-N-acetylgulucosamine transporter of K.lactis" /codon_start=1 /product="UGTrel1" /db_xref="PID:g1669560" /translation="MASSSSLVPDRLRLPLCFLGVFVCYFYYGILQEKITRGKYGEGA KQETFTFALTLVFIQCVINAVFAKILIQFFDTARVDRTRSWLYAACSISYLGAMVSSN SALQFVNYPTQVLGKSCKPIPVMLLGVTLLKKKYPLAKYLCVLLIVAGVALFMYKPKK VVGIEEHTVGYGELLLLLSLTLDGLTGVSQDHMRAHYQTGSNHMMLNINLWSTLLLGM GILFTGELWEFLSFAERYPAIIYNILLFGLTSALGQSFIFMTVVYFGPLTCSIITTTR KFFTILASVILFANPISPMQWVGTVLVFLGLGLDAKFGKGAKKTSH" source 289..1186 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I.M.A.G.E. Connsortium clone ID 295714" /dev_stage="20 week-post conception fetus" /tissue_type="liver and spleen" polyA_signal 1169..1174 polyA_signal 1173..1179 polyA_site 1186 /note="20 A nucleotides" BASE COUNT 258 a 300 c 296 g 332 t ORIGIN 1 gatgtccggc tggagctgtc gcctccgccg ccgctgctgc cggtgccggt tgtgagcggg 61 tctccagtcg gctcctctgg gcgtctcatg gcctctagca gctccctggt gcccgaccgg 121 ctgcgcctgc cgctctgctt cctgggtgtc tttgtctgct atttttacta tgggatcctg 181 caggaaaaga taacaagagg aaagtatggg gaaggagcca agcaggagac gttcaccttt 241 gccttaactt tggtcttcat tcaatgtgtg atcaatgctg tgtttgccaa gatcttgatc 301 cagttttttg acactgccag ggtggatcgt acccggagct ggctctatgc tgcctgttct 361 atctcctatc tgggtgccat ggtctccagc aattcagcac tacagtttgt caactaccca 421 actcaggtcc ttggtaaatc ctgcaagcca atcccagtca tgctccttgg ggtgaccctc 481 ttgaagaaga agtacccgtt ggccaagtac ctgtgtgtgc tgttaattgt ggctggagtg 541 gcccttttca tgtacaaacc caagaaagtt gttgggatag aagaacacac agtcggctat 601 ggagagctac tcttgctatt atcgctgacc ctggatggac tgactggtgt ttcccaggac 661 cacatgcggg ctcattacca aacaggctcc aaccacatga tgctgaacat caacctttgg 721 tcgacattgc tgctgggaat gggaatcctg ttcactgggg agctctggga gttcttgagc 781 tttgctgaaa ggtaccctgc catcatctat aacatcctgc tctttgggct gaccagtgcc 841 ctgggtcaga gcttcatctt tatgacggtt gtgtattttg gtcccctgac ctgctccatc 901 atcactacaa ctcgaaagtt cttcacaatt ttggcctctg tgatcctctt cgccaatccc 961 atcagcccca tgcagtgggt gggcactgtg cttgtgttcc tgggtcttgg tcttgatgcc 1021 aagtttggga aaggagctaa gaagacatcc cactaggaag agagagacta cctccacatc 1081 aagaatattt aagttattat ctcaaacagt gacatctctt gggaaaatgg acttaatagg 1141 aatatgggac tgagttccag tcttttttaa taaaataaaa tcaagc // LOCUS D88152 2682 bp mRNA PRI 20-MAY-1997 DEFINITION Human mRNA for acetyl-coenzyme A transporter, complete cds. ACCESSION D88152 NID g2114303 KEYWORDS acetyl-coenzyme A transporter. SOURCE Homo sapiens Melanoma cell_line:SK-MEL-28 cDNA to mRNA, clone_lib:Invitrogen. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2682) AUTHORS Kanamori,A. TITLE Direct Submission JOURNAL Submitted (28-SEP-1996) to the DDBJ/EMBL/GenBank databases. Akiko Kanamori, The Institute of Chemical and Physical Research (RIKEN), Cellular Glycobiology; 2-1 Hirosawa, Wako-shi, Saitama 351-01, Japan (E-mail:kanamori@rtcs1.riken.go.jp, Tel:048-467-9614, Fax:048-467-9614) REFERENCE 2 (bases 1 to 2682) AUTHORS Kanamori,A., Nakayama,J., Stallcup,W., Sasaki,K., Fukuda,M. and Hirabayashi,Y. TITLE Expression cloning and characterizaton of a cDNA encoding a novel membrane protein required for the formation of O-acetylated ganglioside: A putative acetyl-CoA transporter JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Kanamori,A., Nakayama,J., Fukuda,M.N., Stallcup,W.B., Sasaki,K., Fukuda,M. and Hirabayashi,Y. TITLE Expression cloning and characterization of a cDNA encoding a novel membrane protein required for the formation of O-acetylated ganglioside: a putative acetyl-CoA transporter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (7), 2897-2902 (1997) MEDLINE 97250462 FEATURES Location/Qualifiers source 1..2682 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SK-MEL-28" /cell_type="Melanoma" /clone_lib="Invitrogen" CDS 388..2037 /codon_start=1 /product="acetyl-coenzyme A transporter" /db_xref="PID:d1020884" /db_xref="PID:g2114304" /translation="MSPTISHKDSSRQRRPGNFSHSLDMKSGPLPPGGWDDSHLDSAG REGDREALLGDTGTGDFLKAPQSFRAELSSILLLLFLYVLQGIPLGLAGSIPLILQSK NVSYTDQAFFSFVFWPFSLKLLWAPLVDAVYVKNFGRRKSWLVPTQYILGLFMIYLST QVDRLLGNTDDRTPDVIALTVAFFLFEFLAATQDIAVDGWALTMLSRENVGYASTCNS VGQTAGYFLGNVLFLALESADFCNKYLRFQPQPRGIVTLSDFLFFWGTVFLITTTLVA LLKKENEVSVVKEETQGITDTYKLLFAIIKMPAVLTFCLLILTAKIGFSAADAVTGLK LVEEGVPKEHLALLAVPMVPLQIILPLIISKYTAGPQPLNTFYKAMPYRLLLGLEYAL LVWWTPKVEHQGGFPIYYYIVVLLSYALHQVTVYSMYVSIMAFNAKVSDPLIGGTYMT LLNTVSNLGGNWPSTVALWLVDPLTVKECVGASNQNCRTPDAVELCKKLGGSCVTALD GYYVESIICVFIGFGWWFFLGPKFKKLQDEGSSSWKCKRNN" BASE COUNT 708 a 575 c 597 g 802 t ORIGIN 1 gaattcgcag cgagagctgg aggtgttggg tcgggagacc agccattcga tcccgccgca 61 ggtaggagct ggtttccatc ctggcaccac ggcacacacc tccagcctcg agcccggcgc 121 tgctgcccgg gggtctcctt caggctcttt gacgccgttc cagggggcac ctatccaggc 181 atcctctggg cctctagcca gaggactggc tcccggcttc agcactccgg gctgcagtaa 241 gaagtgccct tatcgctctg agccctgcca ccatcccgtg aaccaccgaa accctggtcc 301 agcgcgacag ccttggacct gggactggac ggatccaaaa cgctcagcct cggcccccca 361 cagacggggc tctgcatcgt ctctgatatg tcacccacca tctcccacaa ggacagcagc 421 cggcaacggc ggccagggaa tttcagtcac tctctggata tgaagagcgg tcccctgccg 481 ccaggcggtt gggatgacag tcatttggac tcagcgggcc gggaagggga cagagaagct 541 cttctggggg ataccggcac tggcgacttc ttaaaagccc cacagagctt ccgggccgaa 601 ctaagcagca ttttgctact actctttctt tacgtgcttc agggtattcc cctgggcttg 661 gcgggaagca tcccactcat tttgcaaagc aaaaatgtta gctatacaga ccaagctttc 721 ttcagttttg tcttttggcc cttcagtctc aaattactct gggccccgtt ggttgatgcg 781 gtctacgtta agaacttcgg tcgtcgcaaa tcttggcttg tcccgacaca gtatatacta 841 ggactcttca tgatctattt atccactcag gtggaccgtt tgcttgggaa taccgatgac 901 agaacacccg acgtgattgc tctcactgtg gcgttctttt tgtttgaatt cttggccgcc 961 actcaggaca ttgccgtcga tggttgggcg ttaactatgt tatccaggga aaatgtgggt 1021 tatgcttcta cttgcaattc ggtgggccaa acagcgggtt actttttggg caatgttttg 1081 tttttggccc ttgaatctgc cgacttttgt aacaaatatt tgcggtttca gcctcaaccc 1141 agaggaatcg ttactctttc agatttcctt tttttctggg gaactgtatt tttaataaca 1201 acaacattgg ttgcccttct gaaaaaagaa aacgaagtat cagtagtaaa agaagaaaca 1261 caagggatca cagatactta caagctgctt tttgcaatta taaaaatgcc agcagttctg 1321 acattttgcc ttctgattct aactgcaaag attggttttt cagcagcaga tgctgtaaca 1381 ggactgaaat tggtagaaga gggagtaccc aaagaacatt tagccttatt ggcagttcca 1441 atggttcctt tgcagataat actgcctctg attatcagca aatacactgc aggtccccag 1501 ccattaaaca cattttacaa agccatgccc tacagattat tgcttgggtt agaatatgcc 1561 ctactggttt ggtggactcc taaagtagaa catcaagggg gattccctat atattactat 1621 atcgtagtcc tgctgagtta tgctttacat caggttacag tgtacagcat gtatgtttct 1681 ataatggctt tcaatgcaaa ggttagtgat ccacttattg gaggaacata catgaccctt 1741 ttaaataccg tgtccaatct gggaggaaac tggccttcta cagtagctct ttggcttgta 1801 gatcccctca cagtaaaaga gtgtgtagga gcatcaaacc agaattgtcg aacacctgat 1861 gctgttgagc tttgcaaaaa actgggtggc tcatgtgtta cagccctgga tggttattat 1921 gtggagtcca ttatttgtgt tttcattgga tttggttggt ggttctttct tggtccaaaa 1981 tttaaaaagt tacaggatga aggatcatct tcgtggaaat gcaaaaggaa caattaatat 2041 atatgctact ggacattcta gcaaggtaat tgtagtttag ttttaattcg gagagcaatg 2101 ataatcagtg cacaggagta taaaatatta ttttaaacag cgaaattaat aatataaaat 2161 gccaaatggt tgaaaaaata gaaacctttc tgtatatttg atcatatttt ttttttgcct 2221 tgtcaatgta tttaaagttt acttaaggtc aggaaattct aaaacaactt ttctggcctt 2281 gttatttgat gtatatcttt taaatttact gaccaaagca tgttttaagc tgcaatgcag 2341 tagtcacggg tggtaaccat gtagtcaggt attgttatta gtacctatca ctgctgagct 2401 gtatttaaaa ttttggtaca atatataaaa tggagaagag cttgatattc aggtactaac 2461 cacaactagt ctgacattgt tggcagttaa aatcttattt tgaattgtaa attagttaaa 2521 ttttatgtgg aatttgctga gaaaagaata tagactactg aaatgtcatt ttagttattt 2581 ttcttatgac cacattgtac aaatgaatct gtgttaaaaa gactatttta aatgtatttc 2641 ctgcttttgt aagcattaaa gatttgaatt ccaccacact gg // LOCUS D88153 4763 bp mRNA PRI 31-JUL-1997 DEFINITION Homo sapiens mRNA for HYA22, complete cds. ACCESSION D88153 NID g2289785 KEYWORDS HYA22. SOURCE Homo sapiens tissue_lib:pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4763) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (28-SEP-1996) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Institute of Medical Science, University of Tokyo, Laboratory of Molecular Medicine, Human Genome Center; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) REFERENCE 2 (sites) AUTHORS Ishikawa,S., Kai,M., Tamari,M., Takei,Y., Takeuchi,K., Bandou,H., Yamane,Y., Ogawa,M. and Nakamura,Y. TITLE Sequence analysis of a 685-kb genomic region on chromosome 3p22-p21.3 that is homozygously deleted in a lung carcinoma cell line JOURNAL DNA Res. 4 (1), 35-43 (1997) MEDLINE 97323004 FEATURES Location/Qualifiers source 1..4763 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3" /tissue_lib="pancreas" CDS 148..1170 /codon_start=1 /product="HYA22" /db_xref="PID:d1022517" /db_xref="PID:g2289786" /translation="MVTRRRPLEPSGGGRRELGRGPGPPLPERGAGRRARPGSGCERP PAPRPRAPRAAPPRAWLAGGRACGRPPRRAPMDGPAIITQVTNPKEDEGRLPGAGEKA SQCNVSLKKQRSRSILSSFFCCFRDYNVEAPPPSSPSVLPPLVEENGGLQKPPAKYLL PEVTVLDYGKKCVVIDLDETLVHSSFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEF LQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGNYVKDLSRLG RELSKVIIVDNSPASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSREDDVYSML HRLCNR" polyA_site 4763 /note="18 a nucleotides" BASE COUNT 1156 a 1182 c 1146 g 1278 t 1 others ORIGIN 1 cccctcaccc cactcaactg ccccgggccc ccgcgcgcgc ggccgcccct ccactcaccc 61 tgtgtcggcc ccgctcccct ctcccccacc aggcgagcag gcgagcgggc agagcccgcg 121 gcggaggtcg gcgcggctcc ggggttcatg gtgacgaggc ggcggccgct cgagcccagc 181 ggcggcgggc ggcgggagct ggggcgcggg cccgggccgc ctctcccaga gcgcggggcc 241 gggcggcggg cgcgcccagg cagcggctgc gagcgccccc ccgcgccgcg cccccgcgcc 301 ccccgcgccg cgcccccgcg cgcttggctt gcggggggcc gggcctgcgg gcggccgccg 361 cgccgcgcac ccatggacgg cccggccatc atcacccagg tgaccaaccc caaggaggac 421 gagggccggt tgccgggcgc gggcgagaaa gcctcccagt gcaacgtcag cttaaagaag 481 cagaggagcc gcagcatcct tagctccttc ttctgctgct tccgtgatta caatgtggag 541 gcccctccac ccagcagccc cagtgtgctt ccgccactgg tggaggagaa tggtgggctt 601 cagaagccac cagctaagta ccttcttcca gaggtgacgg tgcttgacta tggaaagaaa 661 tgtgtggtca ttgatttaga tgaaacattg gtgcacagtt cgtttaagcc tattagtaat 721 gctgatttta ttgttccggt tgaaatcgat ggaactatac atcaggtgta tgtgctgaag 781 cggccacatg tggacgagtt cctccagagg atggggcagc tttttgaatg tgtgctcttt 841 actgccagct tggccaagta tgcagaccct gtggctgacc tcctagaccg ctggggtgtg 901 ttccgggccc ggctcttcag agaatcatgt gtttttcatc gtgggaacta cgtgaaggac 961 ctgagtcgcc ttgggcggga gctgagcaaa gtgatcattg ttgacaattc ccctgcctca 1021 tacatcttcc atcctgagaa tgcagtgcct gtgcagtcct ggttcgatga catgacggac 1081 acggagctgc tggacctcat ccccttcttt gagggcctga gccgggagga cgacgtgtac 1141 agcatgctgc acagactctg caataggtag ccctggcctc tgcctgcctc ccgcctgtgc 1201 actctggaac ctctggcctc aggggacctg cctgtcctca gctccctggg agctgaaagt 1261 gaggatactc cgtgctccag gccacagggt gaatgtggcc atgcctacct gttttgtttt 1321 tttaagaaca gaaacaacta ttttaaaaga actcttttaa gaaatttcat aaagggacat 1381 gcattttact gggtttgctt ttcttaaaac ataccaaaaa agaaaaaaat agaaaaaaaa 1441 aaaaaaaaag ctgatctcta tcagactctt caactgtcct ccctccaagc agaccacctg 1501 tccccttcta tcccagctca gagcagctga cccaactcag aatctctttc ctacaggatg 1561 aaagtgcctt ttgaatgtta tttttaagcc gagagttaat ttttctacac aacatatttc 1621 cagacatctt ttagtctttt attgtcttag atactataag aagatgaaca tgacaatttt 1681 ctagaacctg gtagcgtgtg tgtgtggttg gcggggggtg ctgagggagg ggagtgagtc 1741 acaggagcct gtcccccaac aggtgtgact gctctgacaa cctgtggcat gctgcagggt 1801 caggctcctg ataggaggat ttcatgacta tgtcattgtc tccactcatt tttgacccag 1861 tttggaatgt atctgcaatt gtgtggctca acactttagg aaacatagat tattttatat 1921 tattatttct gatggtgaca agtttgtctt gaggtcacat tttctccttg aaaagtgaca 1981 tcctgtcact tctgctctca cactactgcc atacatttgt gtttttttgt tgttattgtt 2041 tgggtagagc agttacaaga aaccctaaaa cccttggata taaaagaaat ctgtttattg 2101 atttttaaat ctttcctttc caaaagctgg atacacatgg agctgtttgg gaattttcct 2161 tgctgctacc gcgctgccac caaatggaat tgaccagcgg ctgttacact gttctttgcc 2221 actgtgccta tgctcagaat atgctcactg ctaagctaca aactcggaca gggtcagaaa 2281 cagaggtgtc ccatcccatt gcagcctcca ccacctgtaa ccccttcctg gcattggcca 2341 ctgaagggta caaaggcaaa aggaccacag caccacttag gtgtagcatg gattttaaac 2401 tgcagtcagt atcagatcct gtttgataaa taagctgact gttctctctt gagaacctgt 2461 ggcctcaacc agccaccaag ctgatgtggc ccaagctcca tctcttggtc ttctcctttg 2521 aagcacagcc tatttctgag ccaagggttg gggaagcctg tctagatgtg ggactcattg 2581 ccccaaacca gggagaggaa gagctcccac agggagagcc caggctctct ttgcagcctt 2641 tcccagtttg gtgtttaaag cagtgccatg ttccttgttt gacaacaaga cagtctgtaa 2701 agtattgctc ttaaaaacaa ttaaaaagaa ccctttcata ttggcaccat tgccttagtc 2761 ctctgtgggt tggtcttcag ccagcattct ggtgggagtg actggcatta acaagactgg 2821 aaatcggggg tcaaagtaaa atatctttgt tttgctttca ttcacaaagt aatgaagcca 2881 gctgccaatt acatcctccc aacagcactt tggtctgtga ctgctgtgtg atattcagaa 2941 gggaagtagt attcaggggg taaacaggtc tcccagcatt ctgagtgttc caaaccagta 3001 atccacatgc caattcaaat agaacagccc cttgctagat attaccacag ataatgacag 3061 tacatggtag aactgcccat gccacaaata tttatttgga aaagtagtca ttaaatgaac 3121 ccactgcctt aaatgtcttg aatgttgcag tcaagtgtct gtcatgtgtt gatatccaca 3181 cagaattagg ccctaatgag agccttagac cctcaaccat gcccccttcg ttggcatcac 3241 agggccttat ttggaagagc ggggcaaaga ggatggaaat cataaaatat ttcatgggaa 3301 tcgaacctag ggatagtgct ccacttctga cgatggagtg aagacacttg gcagacttga 3361 gccagacact tcacctagta gttcctgaaa ctgtgagcac cactgcacta agccagtgcg 3421 gagctgttag ggacgggccc agctcctgca cacggacaca gaatgtctgg agagggcagc 3481 aggcctctga gggttctgga atctgtgcca ccttatttga ccacactcca aaattctgtt 3541 tttattttaa cccttgaatc tgctttatgt acataatcaa aatatctata tctatatcta 3601 tatctatatc tatatatttt taatcatcta catgtaaatg aagcaataga attctaacat 3661 aaggccaaga aatgagacga atgtttgggg tttatgtttt ttaaggtaaa tacgggtatt 3721 gtttttaatt attaccatgt attaaattgt gggctttgaa acctaatgaa acctgttagc 3781 cacttctctg tgccatatac ttcccatgtt accaaaatac gcccaactct ttagccaaaa 3841 gagaacnctg acctcctgag tttccatgct cctttctgtc aggtttaaat gtagtcttct 3901 ggagaagtat ttttgacatt gagctctggg acaggacacc ttgggtttgt ggactgcagc 3961 ccactatgat gttattactt ctctggccag gcctccagtg gaagtgcaca ggcactccca 4021 atgttgttaa tgctctgtct tccatttgtt ctggaatcct acgtgttggt ctgtggttcc 4081 atgcattagc tgtttgtaaa taatgcattt gcatactgaa aaaggaatgc cacctgccac 4141 agttgatggt gagaagctcc tttgacgtgg tgcaattttg atgagatgtc tctggggaca 4201 cgaggatgcc ctaatgatgc tgacttgtca tggttgcagc atttgaactt ttggtgttaa 4261 aaaaaaaaac ctgtaagtct gtaacctggc aacattttac aaccctgtat ttttaaagat 4321 ggctttctaa taaaaaatcc agaaccacac agccctatgg tcaaacaatc ctacgtttgt 4381 gcctctgctt ttaaaggtgc tgtgctggac agttggcatg ccagggttcg agaagagtga 4441 atggcttgac gtccttgcag ttaactgtgc aaaattggct ggctgcctct gttcctactg 4501 tactgtaact ttgatcatgt ctgttcctat tccattctcc caggagcttc tctgcagact 4561 gacacaccct cccccacccc gggtagtgga gatgctggtg tctgggtagt catggatttc 4621 tgctgacatt tgaatgtgat aaacaatcca gcattactta ggaaatgcta catgcggaat 4681 gtgcacgttt ccaggggcga gtattgtcaa tcaaaaggtt tgcaatgatt tccttcctgc 4741 caaaaataaa catgtgaaac tgc // LOCUS D88213 2572 bp mRNA PRI 27-DEC-1997 DEFINITION Homo sapiens mRNA for retina-specific amine oxidase, complete cds. ACCESSION D88213 NID g1906805 KEYWORDS RAO; AOC2; retina-specific amine oxidase. SOURCE Homo sapiens etina cDNA to mRNA, clone_lib:human retina 5'-stretch cDNA library (clontech). ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Imamura,Y., Kubota,R., Wang,Y., Asakawa,S., Kudoh,J., Mashima,Y., Oguchi,Y. and Shimizu,N. TITLE Human retina-specific amine oxidase (RAO): cDNA cloning, tissue expression, and chromosomal mapping JOURNAL Genomics 40 (2), 277-283 (1997) MEDLINE 97237047 REFERENCE 2 (bases 1 to 2572) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (02-OCT-1996) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) FEATURES Location/Qualifiers source 1..2572 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /clone_lib="human retina 5'-stretch cDNA library (clontech)" /map="17q21" /tissue_type="etina" gene 27..2216 /gene="AOC2" CDS 27..2216 /gene="AOC2" /codon_start=1 /product="retina-specific amine oxidase" /db_xref="PID:d1019744" /db_xref="PID:g1906806" /translation="MHLKIVLAFLALSLITIFALAYVLLTSPGGSSQPPHCPSVSHRA QPWPHPGQSQLFADLSREELTAVMRFLTQRLGPGLVDAAQAQPSDNCIFSVELQLPPK AAALAHLDRGSPPPAREALAIVLFGGQPQPNVSELVVGPLPHPSYMRDVTVERHGGPL PYHRRPVLRAEFTQMWRHLKDVELPKAPIFLSSTFNYNGSTLAAVHATPRGLRSRERT TWIGLYHNISGVGLFLHPVGLELLLDHRALDPAHWTVQQVFYLGHYYADLGQLEREFK SGRLEVVRVPLPPPNGASSLRSRNSPGPLPPLQFSPQGSQYSVQGNLVVSSLWSFTFG HGVFSGLRIFDVRFQGERIAYEVSVQECVSIYGADSPKTMLTRYLDSSFGLGRNSRGL VRGVDCPYQATMVDIHILVGKGAVQLLPGAVCVFEEAQGLPLRRHHNYLQNHFYGGLA SSALVVRSVSSVGNYDYIWDFVLYPNGALEGRVHATGYINTAFLKGGEEGLLFGNRVG ERVLGTVHTHAFHFKLDLDVAGLKNWVVAEDVVFKPVAAPWNPEHWLQRPQLTRQVLG KEDLTAFSLGSPLPRYLYLASNQTNAWGHQRGYQLVVTQRKEEESQSSSIYHQNDIWT PTVTFADFINNETLLGEDLVAWVTASFLHIPHAEDIPNTVTLGNRVGFLLRPYNFFDE DPSIFSPGSVYFEKGQDAGLCSINPVACLPDLAACVPDLPPFSYHGF" BASE COUNT 538 a 736 c 669 g 629 t ORIGIN 1 tctgatttca cctctcagca tccaccatgc atctcaagat agtcctggcg ttcctggcac 61 tgtccctcat taccatcttt gccctggcct atgttttgct gaccagccca ggtggttcca 121 gccagcctcc ccactgcccc tctgtatccc atagggccca gccctggcca caccctggcc 181 agagccagct gtttgcagac ctgagccgag aggagttgac agctgtgatg cgctttctga 241 cccagcggct ggggccaggg ctggtggacg cagcccaggc tcagccctcg gacaactgca 301 tcttctcagt ggagctgcag ctgcccccca aggctgcagc cctggcccac ctggacaggg 361 ggagcccccc acctgcccgg gaggcactgg ccatcgtcct ctttggtgga caaccccaac 421 ccaatgtgag tgagctggtg gtggggccgc tgcctcaccc ctcgtacatg cgggatgtga 481 ctgtggagcg tcacggcggg cccctgccct atcaccgtcg cccggtgctg agagctgagt 541 ttacacagat gtggaggcat ctgaaagatg tggagctacc caaggcaccc atcttcctgt 601 cgtccacctt caactacaat ggctctaccc tggcagctgt gcatgccacc cctcggggct 661 tgcgctcaag ggaacgaact acctggattg gcctctacca taacatctca ggggttggtc 721 ttttccttca ccccgtgggg ctggagctac tactggacca cagggccctg gaccctgccc 781 actggactgt ccagcaggtc ttctaccttg ggcactacta tgcagacttg ggccagttgg 841 aacgggagtt taagtctggc cggttggaag tggttagagt ccctctacct ccaccaaatg 901 gagcttcatc cctgaggtct cggaactctc caggtcctct tccccctctt cagttctcgc 961 cccagggttc ccagtacagt gtgcaaggaa acctggtggt atcctccctc tggtcattta 1021 cctttggcca tggggtgttc agcggcctga ggatttttga tgttcggttc cagggtgagc 1081 gaatagccta tgaagtcagt gtccaggagt gtgtatctat ctatggtgcc gattcaccca 1141 agacgatgct gactcgctat ttggatagca gctttggact cggccgtaac agccgaggct 1201 tggtgcgggg agtggactgc ccctatcaag ccacgatggt ggacatccat atattagtgg 1261 gcaaaggggc agtccagctg cttccagggg ctgtgtgtgt atttgaggaa gcccagggac 1321 tgccccttcg aaggcaccac aattaccttc aaaatcattt ctatggtggt ttggccagct 1381 cagcccttgt ggtcaggtct gtgtcatctg tgggcaacta tgactacatt tgggactttg 1441 tgttgtaccc aaatggggca cttgaagggc gggtccatgc cacgggttat atcaacacag 1501 ctttcctgaa agggggagag gagggcctcc tctttgggaa ccgtgtgggg gaaagagtgc 1561 tgggaacggt gcacacacat gccttccact tcaagctgga cctggatgtg gcagggctga 1621 aaaactgggt ggtagctgaa gacgtggtgt ttaaacctgt ggctgccccc tggaacccgg 1681 agcactggct acagcgccca cagctgactc ggcaggtcct gggaaaggag gacctgacag 1741 ctttttcctt gggaagcccc ctaccccgct acctctacct ggctagcaac cagactaatg 1801 cgtggggtca ccagcgcgga taccagcttg tggtgaccca gagaaaggag gaggagtcac 1861 agagcagtag catctatcac cagaatgaca tctggacacc cacagttacc tttgctgact 1921 tcatcaacaa tgaaaccctc ttaggagagg atctggtggc ttgggtcaca gccagcttcc 1981 tgcacattcc ccatgccgag gacatcccaa acacagtgac tctggggaac agagttggct 2041 tcttgctccg accctataac ttctttgatg aggacccctc catcttctcc cctggcagtg 2101 tgtactttga gaagggccag gatgctgggc tctgcagcat caatcctgtg gcctgcctcc 2161 ccgacctggc agcctgtgtc ccggacttac cccctttctc ttaccacggc ttctagtcct 2221 gagggtgtgg cgggcggcgt ggttaggcac atgtactttt ccctgtttct actttctatt 2281 ctccgtgttt ttatcacacc tgctccccag attcccaccc cctcaatgtt cctctcacac 2341 gaaaccccca tcagtccctt tggttaattc ttacttcctg ttcatctcta aagtgttaaa 2401 ttataaaaat gatttttaaa tattcaaaga aaaatatcac aaatcctact actcagaaat 2461 aggtggtcac attacatcag acatctcttt atgcatgtgc attcaaaagg aagagtagat 2521 agaattttgt aaaacagatg ttgtatgtaa tttataataa aaagtattaa ag // LOCUS D88214 1934 bp mRNA PRI 19-NOV-1997 DEFINITION Homo sapiens mRNA for myocilin, complete cds. ACCESSION D88214 NID g2627176 KEYWORDS myocilin. SOURCE Homo sapiens retina cDNA to mRNA, clone_lib:human retina cDNA library 5'stretch (CONTECH). ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kubota,R., Noda,S., Wang,Y., Minoshima,S., Asakawa,S., Kudoh,J., Mashima,Y., Oguchi,Y. and Shimizu,N. TITLE A novel myosin-like protein (myocilin) expressed in the connecting cilium of the photoreceptor: molecular cloning, tissue expression, and chromosomal mapping JOURNAL Genomics 41 (3), 360-369 (1997) MEDLINE 97312692 REFERENCE 2 (bases 1 to 1934) AUTHORS Shimizu,N. TITLE Direct Submission JOURNAL Submitted (02-OCT-1996) to the DDBJ/EMBL/GenBank databases. Nobuyoshi Shimizu, Keio University School of Medicine, Department of Molecular Biology; 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan (E-mail:shimizu@dmb.med.keio.ac.jp, Tel:03-3351-2370, Fax:03-3351-2370) COMMENT Sequence updated (17-Nov-1997). FEATURES Location/Qualifiers source 1..1934 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone_lib="human retina cDNA library 5'stretch (CONTECH)" /map="1q23-q24" /tissue_type="retina" CDS 65..1579 /codon_start=1 /product="myocilin" /db_xref="PID:g2627177" /translation="MRFFCARCCSFGPEMPAVQLLLLACLVWDVGARTAQLRKANDQS GRCQYTFSVASPNESSCPEQSQAMSVIHNLQRDSSTQRLDLEATKARLSSLESLLHQL TLDQAARPQETQEGLQRELGTLRRERDQLETQTRELETAYSNLLRDKSVLEEEKKRLR QENENLARRLESSSQEVARLRRGQCPQTRDTARAVPPGSREVSTWNLDTLAFQELKSE LTEVPASRILKESPSGYLRSGEGDTGCGELVWVGEPLTLRTAETITGKYGVWMRDPKP TYPYTQETTWRIDTVGTDVRQVFEYDLISQFMQGYPSKVHILPRPLESTGAVVYSGSL YFQGAESRTVIRYELNTETVKAEKEIPGAGYHGQFPYSWGGYTDIDLAVDEAGLWVIY STDEAKGAIVLSKLNPENLELEQTWETNIRKQSVANAFIICGTLYTVSSYTSADATVN FAYDTGTGISKTLTIPFKNRYKYSSMIDYNPLEKKLFAWDNLNMVTYDIKLSKM" old_sequence 104..105 /citation=[1] /replace="gaga" misc_feature 317..361 /note="leucine zipper-like motif 1" misc_feature 413..562 /note="leucine zipper-like motif 2" polyA_signal 1778..1783 polyA_signal 1928..1933 BASE COUNT 510 a 503 c 525 g 396 t ORIGIN 1 cgggccccgg acacccgctc tgcacagcag agctttccag aggaagcctc accaagcctc 61 tgcaatgagg ttcttctgtg cacgttgctg cagctttggg cctgagatgc cagctgtcca 121 gctgctgctt ctggcctgcc tggtgtggga tgtgggggcc aggacagctc agctcaggaa 181 ggccaatgac cagagtggcc gatgccagta taccttcagt gtggccagtc ccaatgaatc 241 cagctgccca gagcagagcc aggccatgtc agtcatccat aacttacaga gagacagcag 301 cacccaacgc ttagacctgg aggccaccaa agctcgactc agctccctgg agagcctcct 361 ccaccaattg accttggacc aggctgccag gccccaggag acccaggagg ggctgcagag 421 ggagctgggc accctgaggc gggagcggga ccagctggaa acccaaacca gagagttgga 481 gactgcctac agcaacctcc tccgagacaa gtcagttctg gaggaagaga agaagcgact 541 aaggcaagaa aatgagaatc tggccaggag gttggaaagc agcagccagg aggtagcaag 601 gctgagaagg ggccagtgtc cccagacccg agacactgct cgggctgtgc caccaggctc 661 cagagaagtt tctacgtgga atttggacac tttggccttc caggaactga agtccgagct 721 aactgaagtt cctgcttccc gaattttgaa ggagagccca tctggctatc tcaggagtgg 781 agagggagac accggatgtg gagaactagt ttgggtagga gagcctctca cgctgagaac 841 agcagaaaca attactggca agtatggtgt gtggatgcga gaccccaagc ccacctaccc 901 ctacacccag gagaccacgt ggagaatcga cacagttggc acggatgtcc gccaggtttt 961 tgagtatgac ctcatcagcc agtttatgca gggctaccct tctaaggttc acatactgcc 1021 taggccactg gaaagcacgg gtgctgtggt gtactcgggg agcctctatt tccagggcgc 1081 tgagtccaga actgtcataa gatatgagct gaataccgag acagtgaagg ctgagaagga 1141 aatccctgga gctggctacc acggacagtt cccgtattct tggggtggct acacggacat 1201 tgacttggct gtggatgaag caggcctctg ggtcatttac agcaccgatg aggccaaagg 1261 tgccattgtc ctctccaaac tgaacccaga gaatctggaa ctcgaacaaa cctgggagac 1321 aaacatccgt aagcagtcag tcgccaatgc cttcatcatc tgtggcacct tgtacaccgt 1381 cagcagctac acctcagcag atgctaccgt caactttgct tatgacacag gcacaggtat 1441 cagcaagacc ctgaccatcc cattcaagaa ccgctataag tacagcagca tgattgacta 1501 caaccccctg gagaagaagc tctttgcctg ggacaacttg aacatggtca cttatgacat 1561 caagctctcc aagatgtgaa aagcctccaa gctgtacagg caatggcaga aggagatgct 1621 cagggctcct ggggggagca ggctgaaggg agagccagcc agccagggcc caggcagctt 1681 tgactgcttt ccaagttttc attaatccag aaggatgaac atggtcacca tctaactatt 1741 caggaattgt agtctgaggg cgtagacaat ttcatataat aaatatcctt tatcttctgt 1801 cagcatttat gggatgttta atgacatagt tcaagttttc ttgtgatttg gggcaaaagc 1861 tgtaaggcat aatagtttct tcctgaaaac cattgctctt gcatgttaca tgggtaccac 1921 aagccacaat aaaa // LOCUS D88308 2362 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for very-long-chain acyl-CoA synthetase, complete cds. ACCESSION D88308 NID g2653564 KEYWORDS very-long-chain acyl-CoA synthetase. SOURCE Homo sapiens (strain:caucacian) adult male liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Uchiyama,A., Aoyama,T., Kamijo,K., Wakui,K., Fukushima,Y., Shimozawa,N., Suzuki,Y., Kondo,N., Orii,T. and Hashimoto,T. TITLE Molecular cloning of a possible human homolog of the rat very-long-chain acyl-CoA synthetase cDNA and its chromosomal localization JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2362) AUTHORS Kamijo,K. TITLE Direct Submission JOURNAL Submitted (08-OCT-1996) to the DDBJ/EMBL/GenBank databases. Keiju Kamijo, Shinshu University, School of Medicine, Department of Biochemistry; 3-1-1 Asahi, Matsumoto 390, Japan (E-mail:kkamijo@gipac.shinshu-u.ac.jp, Tel:+81-263-37-2603, Fax:+81-263-37-2604) FEATURES Location/Qualifiers source 1..2362 /organism="Homo sapiens" /strain="caucacian" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="liver" gene 223..2085 /gene="vlacs" CDS 223..2085 /gene="vlacs" /codon_start=1 /product="very-long-chain acyl-CoA synthetase" /db_xref="PID:d1024525" /db_xref="PID:g2653565" /translation="MLSAIYTVLAGLLFLPLLVNLCCPYFFQDIGYFLKVAAVGRRVR SYGQRRPARTILRAFLEKARQTPHKPFLLFRDETLTYAQVDRRSNQVARALHDHLGLR QGDCVALLMGNEPAYVWLWLGLVKLGCAMACLNYNIRAKSLLHCFQCCGAKVLLVSPE LQAAVEEILPSLKKDDVSIYYVSRTSNTDGIDSFLDKVDEVSTEPIPESWRSEVTFST PALYIYTSGTTGLPKAAMITHQRIWYGTGLTFVSGLKADDVIYITLPFYHSAALLIGI HGCIVAGATLALRTKFSASQFWDDCRKYNVTVIQYIGELLRYLCNSPQKPNDRDHKVR LALGNGLRGDVWRQFVKRFGDICIYEFYAATEGNIGFMNYARKVGAVGRVNYLQKKII TYDLIKYDVEKDEPVRDENGYCVRVPKGEVGLLVCKITQLTPFNGYAGAKAQTEKKKL RDVFKKGDLYFNSGDLLMVDHENFIYFHDRVGDTFRWKGENVATTEVADTVGLVDFVQ EVNVYGVHVPDHEGRIGMASIKMKENHEFDGKKLFQHIADYLPSYARPRFLRIQDTIE ITGTFKHRKMTLVEEGFNPAVIKDALYFLDDTAKMYVPMTEDIYNAISAKTLKL" misc_feature 898..924 /gene="vlacs" /note="ATP-binding domain; putative" misc_feature 1540..1713 /gene="vlacs" /note="hydrolysis domain; putative" BASE COUNT 633 a 554 c 589 g 586 t ORIGIN 1 ggaattccaa aaaaaaaaaa tacgactaca cctgctccgg agcccgcggc ggtacctgca 61 gcggaggagc tctgtcttcc ccttcatctc acgcgagccc ggcgtcccgc cgcgtgcgcc 121 ccggcgcagc ccgccagtcc gcccggagcc cgcccagtcg ccgcgctgca cgcccggggt 181 gaaccctctg ccctcgctgg gacagagggc cccgcagccg tcatgctttc cgccatctac 241 acagtcctgg cgggactgct gttcctgccg ctcctggtga acctctgctg cccatacttc 301 ttccaggaca taggctactt cttgaaggtg gccgccgtgg gccggagggt gcgcagctac 361 gggcagcggc ggccggcgcg caccatcctg cgggcgttcc tggagaaagc gcgccagacg 421 ccacacaagc cttttctgct cttccgcgac gagactctca cctacgcgca ggtggaccgg 481 cgcagcaatc aagtggcccg ggcgctgcac gaccacctcg gcctgcgcca gggagactgc 541 gtggcgctcc ttatgggtaa cgagccggcc tacgtgtggc tgtggctggg gctggtgaag 601 ctgggctgtg ccatggcgtg cctcaattac aacatccgcg cgaagtccct gctgcactgc 661 ttccagtgct gcggggcgaa ggtgctgctg gtgtcgccag aactacaagc agctgtcgaa 721 gagatactgc caagccttaa aaaagatgat gtgtccatct attatgtgag cagaacttct 781 aacacagatg ggattgactc tttcctggac aaagtggatg aagtatcaac tgaacctatc 841 ccagagtcat ggaggtctga agtcactttt tccactcctg ccttatacat ttatacttct 901 ggaaccacag gtcttccaaa agcagccatg atcactcatc agcgcatatg gtatggaact 961 ggcctcactt ttgtaagcgg attgaaggca gatgatgtca tctatatcac tctgcccttt 1021 taccacagtg ctgcactact gattggcatt cacggatgta ttgtggctgg tgctactctt 1081 gccttgcgga ctaaattttc agccagccag ttttgggatg actgcagaaa atacaacgtc 1141 actgtcattc agtatatcgg tgaactgctt cggtatttat gcaactcacc acagaaacca 1201 aatgaccgtg atcataaagt gagactggca ctgggaaatg gcttacgagg agatgtgtgg 1261 agacaatttg tcaagagatt tggggacata tgcatctatg agttctatgc tgccactgaa 1321 ggcaatattg gatttatgaa ttatgcgaga aaagttggtg ctgttggaag agtaaactac 1381 ctacagaaaa aaatcataac ttatgacctg attaaatatg atgtggagaa agatgaacct 1441 gtccgagatg aaaatggata ttgcgtcaga gttcccaaag gtgaagttgg acttctggtt 1501 tgcaaaatca cacaacttac accatttaat ggctatgctg gagcaaaggc tcagacagag 1561 aagaaaaaac tgagagatgt ctttaagaaa ggagacctct atttcaacag tggagatctc 1621 ttaatggttg accatgaaaa tttcatctat ttccacgaca gagttggaga tacattccgg 1681 tggaaagggg aaaatgtggc caccactgaa gttgctgata cagttggact ggttgatttt 1741 gtccaagaag taaatgttta tggagtgcat gtgccagatc atgagggtcg cattggcatg 1801 gcctccatca aaatgaaaga aaaccatgaa tttgatggaa agaaactctt tcagcacatt 1861 gctgattacc tacctagtta tgcaaggccc cggtttctaa gaatacagga caccattgag 1921 atcactggaa cttttaaaca ccgcaaaatg accctggtgg aggagggctt taaccctgct 1981 gtcatcaaag atgccttgta tttcttggat gacacagcaa aaatgtatgt gcctatgact 2041 gaggacatct ataatgccat aagtgctaaa accctgaaac tctgaatatt cccaggagga 2101 taactcaaca tttccagaaa gaaactgaat ggacagccac ttgatataat ccaactttaa 2161 tttgattgaa gattgtgagg aaattttgta ggaaatttgc atacccgtaa agggagactt 2221 ttttaaataa cagttgagtc tttgcaagta aaaagattta gagattatta tttttcagtg 2281 tgcacctact gtttgtattt gcaaactgag cttgttggag ggaaggcatt attttttaaa 2341 atacttagta aattaaatga ac // LOCUS D88378 3188 bp mRNA PRI 30-OCT-1996 DEFINITION Human mRNA for proteasome inhibitor hPI31 subunit, complete cds. ACCESSION D88378 NID g1655487 KEYWORDS proteasome inhibitor hPI31 subunit. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3188) AUTHORS Sandra L,M., Matsuda,K., Shimbara,N., Tanaka,K., Clive A,S. and Georoge N,D. TITLE cDNA cloning, expression, and characterization of PI31, a prolin-rich inhibitor of the proteasome JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 3188) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (11-OCT-1996) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The Univ. of Tokushima, Inst. for Enz. Res.; 18-22 Honkomagone 3-chome, Bunkyo-ku, Tokyo 113, Japan (Tel:03-3823-2101(ex.5351), Fax:03-3823-2237) FEATURES Location/Qualifiers source 1..3188 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 127..942 /codon_start=1 /product="proteasome inhibitor hPI31 subunit" /db_xref="PID:d1014299" /db_xref="PID:g1655488" /translation="MAGLEVLFASAAPAITCRQDALVCFLHWEVVTHGYCGLGVGDQP GPNDKKSELLPAGWNNNKDLYVLRYEYKDGSRKLLVKAITVESSMILNVLEYGSQQVA DLTLNLDDYIDAEHLGDFHRTYKNSEELRSRIVSGIITPIHEQWEKANVSSPHREFPP ATAREVDPLRIPPHHPHTSRQPPWCDPLGPFVVGGEDLDPFGPRRGGMIVDPLRSGFP RALIDPSSGLPNRLPPGAVPPGARFDPFGPIGTSPPGPNPDHLPPPGYDDMYL" polyA_signal 3163..3168 BASE COUNT 757 a 824 c 784 g 806 t 17 others ORIGIN 1 attcgcggcc gctgcaagaa ccagcgcaag agggaagcgg agttatagct accccggccg 61 cggagccggc tcactgcact acccccgccc ccttctttcc tccagacgcc gaagtcgcgg 121 gcgctcatgg cgggcctgga ggtactgttc gcatcggcag cgccggccat cacctgcagg 181 caggacgcgc tcgtctgctt cttgcattgg gaagtggtga cacacggtta ctgcggcttg 241 ggtgtcggtg accagccggg tcccaatgat aagaagtcag aactgctgcc agctgggtgg 301 aacaacaata aagacctgta tgtcctccgg tatgagtata aggatgggtc cagaaagctc 361 cttgtgaaag ccatcaccgt ggagagcagc atgatcctca atgtgctgga atatggctca 421 cagcaagtgg cagacttgac cctgaacttg gatgattata ttgatgcaga acacctgggt 481 gacttccaca ggacctacaa gaacagtgag gagcttcggt ctcgtattgt gtctggaatc 541 atcacaccta tccatgagca gtgggaaaag gctaatgtaa gcagtcccca ccgggagttc 601 ccccctgcta ccgccagaga ggtggaccca ctccggattc ctccacacca cccacacacc 661 agtcggcagc ctccctggtg tgatcccctg ggcccgtttg ttgtcggggg agaagactta 721 gacccttttg ggcctcggag aggtggcatg attgtggatc ccctgagatc tggcttccca 781 agagcactta ttgacccttc ctcaggcctc ccgaaccgac ttcctccagg cgctgtgccc 841 ccaggagctc gctttgaccc ctttggaccc attgggacca gcccacccgg acctaaccca 901 gaccatctcc ccccgccggg ctacgatgac atgtacctgt gaaggcctca agaatgtaac 961 atcccaggct tccctccatt ctcctggagc tgccaccgct gtccccatca gcaaccatgt 1021 tcttgcaggc tgggggcaag ggattctgct catgtgtgtg gagaccggct gggatagcct 1081 ccccacccct tatcagagnc aagacacctg ctggagctct ccacctagct ggagatagct 1141 cccaaagaga aatcagtgtg tctcttncac catcagctcc tccccttaca ccaccagctc 1201 ctctccactt cccangggag actccggcan ccttcagcaa catatatcct cgaccagatg 1261 cagtgctata agaacagaac gcattttgga tgttattatt aagaaccaaa tgtcaataca 1321 gaattcatgt tgccggtttc ccacttttct ttttacatta atgcatagct gcttccattt 1381 atgagacttt agagtttgag tttctgtagg gctgaatgac tctttttcct gcccagggcc 1441 cattcttgct tctcaggcac cttccgttta ttaattgcca ttgctcctga catcactaag 1501 atgggtcccc ttctggctgc atgaatggaa atgagtgact ggaaatccca taggccacaa 1561 gaatgacttt cacaagggca ggaacattgt ggaaagactg catcattctg atgaggcaaa 1621 atcctccagc tattcctgtc tgggccagtt ttgtaggtcc atctgtgcat gggcagcagt 1681 agtcacaaag ccaagganaa aacagagcag acctgaaggc taatcttatt tttgccacta 1741 acttagtgan tgaccctaag caagttcctt ctcctcttag ggccttgtgc caagcctatg 1801 aaattggagg tgnctttcct gctctaaagc attttgatgt ctcattctgt gtttggtaac 1861 ccctataaac tggggcagag gaaaagaatg atggttcaag gccatacttc ccttgaacct 1921 tgtgtggttc ttgcctaact ctgtggtttt tggaccccat ggggcccaga cagagcacag 1981 gagcatgggc tgcctctgag tgtggtgttg aacttcggga ggagcaggga gccctgcacc 2041 ttgtgtcctg gcccacctga cctttggtgt tctccggatc cttttcagcc cgaggcctga 2101 cagacgcggg cagtgatgag ccctgttctg gagtggaaag agcacgatag agcaccaggc 2161 taagaggcac gagatcaagg cggtagtcac ttccgctctg cagctagcat ttcaaccata 2221 tgtggatcct ttcatttctc agctccctgg attccttccc ctaaattagg acctattatt 2281 tacctgtagg taagcaagct actgtagctc ttctgaggta tctcccaggc tgttttctgt 2341 agcctcagan tgcctatctn cttagcctga gaacaggtag atgnaaacta aactgatgcc 2401 taggcccagg gtcagtctca gatggaagct gggcctgggt ggggaggcta gcatgcgtgg 2461 ctccctgggt atttctgtca gtccccatgg caagcagtga tttagtaaaa caccccagag 2521 tcagggaagc caaccacctt gaaaccttta ggacatctct gctttggaga aagacccaga 2581 gatcaggcag aggtgcagat tcantcatta ctcataacct ttgagagatg tcacntgggn 2641 ggagtgttag tctttgtttt ggagntgggc cattcttgca ccccccagga cttagagcag 2701 tttgntcata aagacatcct ttattataaa aggaagtatt tataggatga tagagaccat 2761 cagatagaag cagggggggt agataacttt taggnccttg atgtgtggag aagataaaat 2821 ttaaacaata aatttgccac ttagattttc tancaccaca gtccgacgaa agatagttat 2881 atacaacatt ctgttttctg ataacaactg tgattcacct tcagaattgg ccattttttt 2941 tgtgagtttc cttgcatcaa ggacactgag aaacacagtc attgtcttag gtgttctatg 3001 ggaggaagtg aatagagcct ttaggaactt cctggtcaag cttatggtgc ttattttgat 3061 ctgggccact tccctccttc cagtcatgag taatcatcaa ggagcaagtt ggagtgtttc 3121 aggtgtatat tttgtagaac ccaaaagatt ggagccttaa caataaacat cagagcggcc 3181 gcgaattc // LOCUS D88435 4331 bp mRNA PRI 08-OCT-1997 DEFINITION Homo sapiens mRNA for HsGAK, complete cds. ACCESSION D88435 NID g2506079 KEYWORDS HsGAK. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kimura,S.H., Tsuruga,H., Yabuta,N., Endo,Y. and Nojima,H. TITLE Structure, expression, and chromosomal localization of human GAK JOURNAL Genomics 44 (2), 179-187 (1997) MEDLINE 97446136 REFERENCE 2 (bases 1 to 4331) AUTHORS Nojima,H. TITLE Direct Submission JOURNAL Submitted (16-OCT-1996) to the DDBJ/EMBL/GenBank databases. Hiroshi Nojima, Research Institute for Microbial Diseases, Osaka University, Department of Molecular Genetics; Yamadaoka 3-1, Suita City 565, Japan (E-mail:hnojima@biken.osaka-u.ac.jp, Tel:06-875-3980, Fax:06-875-5192) FEATURES Location/Qualifiers source 1..4331 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p16" CDS 1..3936 /codon_start=1 /product="HsGAK" /db_xref="PID:d1023491" /db_xref="PID:g2506080" /translation="MSLLQSALDFLAGPGSLGGASGRDQSDFVGQTVELGELRLRVRR VLAEGGFAFVYEAQDVGSGREYALKRLLSNEEEKNRAIIQEVCFMKKLSGHPNIVQFC SAASIGKEESDTGQAEFLLLTELCKGQLVEFLKKMESRGPLSCDTVLKIFYQTCRAVQ HMHRQKPPIIHRDLKVENLLLSNQGTIKLCDFGSATTISHYPDYSWSAQRRALVEEEI TRNTTPMYRTPEIIDLYSNFPIGEKQDIWALGCILYLLCFRQHPFEDGAKLRIVNGKY SIPPHDTQYTVFHSLIRAMLQVNPEERLSIAEVVHQLQEIAAARNVNPKSPITELLEQ NGGYGSATLSRGPPPPVGPAGSGYSGGLALAEYDQPYGGFLDILRGGTERLFTNLKDT SSKVIQSVANYAKGDLDISYITSRIAVMSFPAEGVESALKNNIEDVRLFLDSKHPGHY AVYNLSPRTYRPSRFHNRVSECGWAARRAPHLHTLYNICRNMHAWLRQDHKNVCVVHC MDGRAASAVAVCSFLCFCRLFSTAEAAVYMFSMKRCPPGIWPSHKRYIEYMCDMVAEE PITPHSKPILVRAVVMTPVPLFSKQRSGCRPFCEVYVGDERVASTSQEYDKMRDFKIE DGKAVIPLGVTVQGDVLIVIYHARSTLGGRLQAKMASMKMFQIQFHTGFVPRNATTVK FAKYDLDACDIQEKYPDLFQVNLEVEVEPRDRPSREAPPWENSSMRGLNPKILFSSRE EQQDILSKFGKPELPRQPGSTAQYDAGAGSPEAEPTDSDSPPSSSADASRFLHTLDWQ EEKEAETGAENASSKESESALMEDRDESEVSDEGGSPISSEGQEPRADPEPPGLAAGL VQQDLVFEVETPAVLPEPVPQEDGVDLLGLHSEVGAGPAVPPQACKAPSSNTDLLSCL LGPPEAASQGPPEDLLSEDPLLLASPAPPLSVQSTPRGGPPAAADPFGPLLPSSGNNS QPCSNPDLFGEFLNSDSVTVPPSFPSAHSAPPPSCSADFLHLGDLPGEPSKMTASSSN PDLLGGWAAWTETAASAVAPTPATEGPLFSPGGQPAPCGSQASWTKSQNPDPFADLGD LSSGLQGSPAGFPPGGFIPKTATTAKGSSSWQTSRPPAQGASWPPQAKPPPKACTQPR PNYASNFSVIGAREERGVRAPSFAQKPKVSENDFEDLLSNQGFSSRSDKKGPKTIAEM RKQDLAKDTDPLKLKLLDWIEGKERNIRALLSTLHTVLWDGESRWTPVGMADLVAPEQ VKKHYRRAVLAVHPDKAAGQPYEQHAKMIFMELNDAWSEFENQGSRPLF" BASE COUNT 903 a 1340 c 1299 g 789 t ORIGIN 1 atgtcgctgc tgcagtctgc gctcgacttc ttggcgggtc caggctccct gggcggtgct 61 tccggccgcg accagagtga cttcgtgggg cagacggtgg aactgggcga gctgcggctg 121 cgggtgcggc gggtcctggc cgaaggaggg tttgcatttg tgtatgaagc tcaagatgtg 181 gggagtggca gagagtatgc attaaagagg ctattatcca atgaagagga aaagaacaga 241 gccatcattc aagaagtttg cttcatgaaa aagctttccg gccacccgaa cattgtccag 301 ttttgttctg cagcgtctat aggaaaagag gagtcagaca cggggcaggc tgagttcctc 361 ttgctcacag agctctgtaa agggcagctg gtggaatttt tgaagaaaat ggaatctcga 421 ggcccccttt cgtgcgacac ggttctgaag atcttctacc agacgtgccg cgccgtgcag 481 cacatgcacc ggcagaagcc gcccatcatc cacagggacc tcaaggttga gaacttgttg 541 cttagtaacc aagggaccat taagctgtgt gactttggca gtgccacgac catctcgcac 601 taccctgact acagctggag cgcccagagg cgagccctgg tggaggaaga gatcacgagg 661 aatacaacac caatgtatag aacaccagaa atcatagact tgtattccaa cttcccgatc 721 ggcgagaagc aggatatctg ggccctgggc tgcatcttgt acctgctgtg cttccggcag 781 cacccttttg aggatggagc gaaacttcga atagtcaatg ggaagtactc gatccccccg 841 cacgacacgc agtacacggt cttccacagc ctcatccgcg ccatgctgca ggtgaacccg 901 gaggagcggc tgtccatcgc cgaggtggtg caccagctgc aggagatcgc ggccgcccgc 961 aacgtgaacc ccaagtctcc catcacagag ctcctggagc agaatggagg ctacgggagc 1021 gccacactgt cccgagggcc accccctccc gtgggccccg ctggcagtgg ctacagtgga 1081 ggcctggcgc tggcggagta cgaccagccg tatggcggct tcctggacat tctgcggggt 1141 gggacagagc ggctcttcac caacctcaag gacacctcct ccaaggtcat ccagtccgtc 1201 gctaattatg caaagggtga cctggacata tcttacatca catccagaat tgcagtgatg 1261 tcattcccag cagaaggtgt ggagtcagcg ctcaaaaaca acatcgaaga tgtgcggttg 1321 ttcctggact ccaagcaccc agggcactat gccgtctaca acctgtcccc gaggacctac 1381 cggccctcca ggttccacaa ccgggtctcc gagtgtggct gggcagcacg gcgggcccca 1441 cacctgcaca ccctgtacaa catctgcagg aacatgcacg cctggctgcg gcaggaccac 1501 aagaacgtct gcgtcgtgca ctgcatggac gggagagccg cgtctgctgt ggccgtctgc 1561 tccttcctgt gcttctgccg tctcttcagc accgcggagg ccgccgtgta catgttcagc 1621 atgaagcgct gcccaccagg catctggcca tcccacaaaa ggtacatcga gtacatgtgt 1681 gacatggtgg cggaggagcc catcacaccc cacagcaagc ccatcctggt gagggccgtg 1741 gtcatgacac ccgtgccgct gttcagcaag cagaggagcg gctgcaggcc cttctgcgag 1801 gtctacgtgg gggacgagcg tgtggccagc acctcccagg agtacgacaa gatgcgggac 1861 tttaagattg aagatggcaa agcggtgatt cccctgggcg tcacggtgca aggagacgtg 1921 ctcatcgtca tctatcacgc ccggtccact ctgggcggcc ggctgcaggc caagatggca 1981 tccatgaaga tgttccagat tcagttccac acggggtttg tgcctcggaa cgccaccact 2041 gtgaaatttg ccaagtatga cctggacgcg tgtgacattc aagaaaaata cccggattta 2101 tttcaagtga acctggaagt ggaggtggag cccagggaca ggccgagccg ggaagcccca 2161 ccatgggaga actcgagcat gagggggctg aaccccaaaa tcctgttttc cagccgggag 2221 gagcagcaag acattctgtc taagtttggg aagccggagc ttccccggca gcctggctcc 2281 acggctcagt atgatgctgg ggcagggtcc ccggaagccg aacccacaga ctctgactca 2341 ccgccaagca gcagcgcgga cgccagtcgc ttcctgcaca cgctggactg gcaggaagag 2401 aaggaggcag agactggtgc agaaaatgcc tcttccaagg agagcgagtc tgccctgatg 2461 gaggacagag acgagagtga ggtgtcagat gaagggggat ccccgatctc cagcgagggc 2521 caggaaccca gggccgaccc agagcccccc ggcctggcag cagggctggt gcagcaggac 2581 ttggtttttg aggtggagac accggctgtg ctgccagagc ctgtgccaca ggaagacggg 2641 gtcgacctcc tgggcctgca ctccgaggtg ggcgcagggc cagctgtacc cccgcaggcc 2701 tgcaaggccc cctccagcaa caccgacctg ctcagctgcc tccttgggcc ccctgaggcc 2761 gcctcccagg ggcccccgga ggatctgctc agcgaggacc cgctgctcct ggcaagcccg 2821 gcccctcccc tgagcgtgca gagcacccca agaggagggc cccctgccgc tgctgacccc 2881 tttggcccgc ttctgccgtc ttcaggcaac aactcccagc cctgctccaa tcctgatctc 2941 ttcggcgaat ttctcaattc ggactctgtg accgtcccac catccttccc gtctgcccac 3001 agcgctccgc ccccatcctg cagcgccgac ttcctgcacc tgggggatct gccaggagag 3061 cccagcaaga tgacagcctc gtccagcaac ccagacctgc tgggaggatg ggctgcctgg 3121 accgagactg cagcgtcggc agtggccccc acgccagcca cagaaggccc cctcttctct 3181 cctggaggtc agccggcccc ttgtggctct caggccagct ggaccaagtc tcagaacccg 3241 gacccatttg ctgaccttgg cgacctcagc tccggcctcc aaggctcacc agctggattt 3301 cctcctgggg gcttcattcc caaaacggcc accacggcca aaggcagcag ctcctggcag 3361 acaagtcggc cgccagccca gggcgcctca tggccccctc aggccaagcc gccccccaaa 3421 gcctgcacac agccaaggcc taactatgcc tcgaacttca gtgtgatcgg ggcgcgggag 3481 gagcgggggg tccgcgcacc cagctttgct caaaagccaa aagtctctga gaacgacttt 3541 gaagatctgt tgtccaatca aggcttctcc tccaggtctg acaagaaagg gccaaagacc 3601 attgcagaga tgaggaagca ggacctggct aaagacacgg acccactcaa gctgaagctc 3661 ctggactgga ttgagggcaa ggagcggaac atccgggccc tgctgtccac gctgcacaca 3721 gtgctgtggg acggggagag ccgctggacg cccgtgggca tggccgacct ggtggctccg 3781 gagcaagtga agaagcacta tcgccgcgcg gtgctggccg tgcaccccga caaggctgcg 3841 gggcagccgt acgagcagca cgccaagatg atcttcatgg agctgaatga cgcctggtcg 3901 gagtttgaga accagggctc ccggcccctc ttctgaggcc gcagtggtgg tggctgcgca 3961 cacagctcca caggttggga gccgtcgtgg gacctgggtc cccaccgtga ggaccccgtg 4021 ggcgacagca ggtgtggcca gggtggggct ccgagccccg ggtcaccgcc cgcccagcgt 4081 tccaggcaca tgaagagaaa gcattccaaa gcctctgatt gttgtttcct ttttctcctc 4141 ccgaaggaac agctgattca tgctcctccc gcaattgtca cgtctgtgat ttatttggtg 4201 tttcgggcgt ggcctctgga gccccggcac gtggtgggcc acgctgctgg cgctcatggg 4261 ccctggtgtt tgcaccgcac tttgtaatca gtcccgtggt tgtctgtaca gaattaaact 4321 attttccgat g // LOCUS D88460 1792 bp mRNA PRI 07-OCT-1997 DEFINITION Homo sapiens mRNA for N-WASP, complete cds. ACCESSION D88460 NID g2116983 KEYWORDS N-WASP. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fukuoka,M., Miki,H. and Takenawa,T. TITLE Identification of N-WASP homologs in human and rat brain JOURNAL Gene 196 (1-2), 43-48 (1997) MEDLINE 97464048 REFERENCE 2 (bases 1 to 1792) AUTHORS Fukuoka,M. TITLE Direct Submission JOURNAL Submitted (21-OCT-1996) to the DDBJ/EMBL/GenBank databases. Maiko Fukuoka, Institute of Medical Science The University of Tokyo, Biochemistry; Shirokanedai 4-6-1, Minato-ku, Tokyo 108, Japan (Tel:03-5449-5417, Fax:03-5449-5417) FEATURES Location/Qualifiers source 1..1792 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 255..1772 /codon_start=1 /product="N-WASP" /db_xref="PID:d1020943" /db_xref="PID:g2116984" /translation="MSSVQQQPPPPRRVTNVGSLLLTPQENESLFTFLGKKCVTMSSA VVQLYAADRNCMWSKKCSGVACLVKDNPQRSHFLRIFDIKDGKLLWEQELYNNFVYNS PRGYFHTFAGDTCQVALNFANEEEAKKFRKAVTDLLGRRQRKSEKRRDPPNGPNLPMA TVDIKNPEITTNRFYGPQVNNISHTKEKKKGKAKKKRLTKGDIGTPSNFQHIGHVGWD PNTGSDLNNLDPELKNLFDMCGILEAQLKERETLKVIYDFIEKTGGVEAVKNELRRQA PPPPPPSRGGPPPPPPPPHSSGPPPPPARGRGAPPPPPSRAPTAAPPPPPPSRPSVEV PPPPPNRMYPPPPPALPSSAPSGPPPPPPSVLGVGPVAPPPPPPPPPPPGPPPPPGLP SDGDHQVPTTAGNKAALLDQIREGAQLKKVEQNSRPVSCSGRDALLDQIRQGIQLKSV ADGQESTPPTPAPTSGIVGALMEVMQKRSKAIHSSDEDEDEDDEEDFEDDDEWED" BASE COUNT 498 a 493 c 416 g 385 t ORIGIN 1 ggcagaggga caacgaccat ccggccctag cctggccggg cgggtgccgg gagcttccct 61 ttctcagcgc ggcggaaggt ggctcgccgt cagcgcctgc ttccctcgac ctcgtcctcc 121 tccccgctcc ggacgagccg agatgtggcg cctctgactc cacttctccc cgcccctgtc 181 accgagaggg ggaacgagct ctcgcccact cgccggagag acggccctgg actcccaacc 241 ccgccggcga aaccatgagc tccgtccagc agcagccgcc gccgccgcgg agggtcacca 301 acgtggggtc cctgttgctc accccgcagg agaacgagtc cctcttcact ttcctcggca 361 agaaatgtgt gactatgtct tcagcagtgg tgcagttata tgcagcagat cggaactgta 421 tgtggtcaaa gaagtgcagt ggtgttgctt gtcttgttaa ggacaatcca cagagatctc 481 attttttaag aatatttgac attaaggatg ggaaactatt gtgggaacaa gagctataca 541 ataactttgt atataatagt cctagaggat attttcatac ctttgctgga gatacttgtc 601 aagttgctct taattttgcc aatgaagaag aagcaaaaaa atttcgaaaa gcagttacag 661 accttttggg ccgtcgacaa aggaaatctg agaaaagacg agatccccca aatggtccta 721 atctacccat ggctacagtt gatataaaaa atccagaaat cacaacaaat agattttatg 781 gtccacaagt caacaacatc tcccatacca aagaaaagaa gaagggaaaa gctaaaaaga 841 agagattaac caagggagat ataggaacac caagcaattt ccagcacatt ggacatgttg 901 gttgggatcc aaatacaggc tctgatctga ataatttgga tccagaattg aagaatcttt 961 ttgatatgtg tggaatctta gaggcacaac ttaaagaaag agaaacatta aaagttatat 1021 atgactttat tgaaaaaaca ggaggtgttg aagctgttaa aaatgaactg cggaggcaag 1081 caccaccacc tccaccacca tcaaggggag ggccacctcc tcctcctccc cctccacata 1141 gctcgggtcc tcctcctcct cctgctaggg gaagaggcgc tcctccccca ccaccttcaa 1201 gagctcccac agctgcacct ccaccaccgc ctccttccag gccaagtgta gaagtccctc 1261 caccaccgcc aaataggatg taccctcctc cacctccagc ccttccctcc tcagcacctt 1321 cagggcctcc accaccacct ccatctgtgt tgggggtagg gccagtggca ccacccccac 1381 cgcctccacc tccacctcct cctgggccac cgcccccgcc tggcctgcct tctgatgggg 1441 accatcaggt tccaactact gcaggaaaca aagcagctct tttagatcaa attagagagg 1501 gtgctcagct aaaaaaagtg gagcagaaca gtcggccagt gtcctgctct ggacgagatg 1561 cactgttaga ccagatacga cagggtatcc aactaaaatc tgtggctgat ggccaagagt 1621 ctacaccacc aacacctgca cccacttcag gaattgtggg tgcattaatg gaagtgatgc 1681 agaaaaggag caaagccatt cattcttcag atgaagatga agatgaagat gatgaagaag 1741 attttgagga tgatgatgag tgggaagact gatctatata ttatatatat ac // LOCUS D88532 3371 bp mRNA PRI 05-NOV-1996 DEFINITION Human mRNA for p55pik, complete cds. ACCESSION D88532 NID g1661000 KEYWORDS p55pik. SOURCE Homo sapiens cDNA to mRNA, clone_lib:library of Jurkat cells clone:p55pik. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3371) AUTHORS Suzuki,T. TITLE Molecular cloning of human p55pik JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 3371) AUTHORS Suzuki,T. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) to the DDBJ/EMBL/GenBank databases. Toru Suzuki, Institute of Medical Science; Sirokanedai 4-6-1, Minatoku, Tokyo 108, Japan (E-mail:toru@hgc.ims.u-tokyo.ac.jp, Tel:03-5449-5303, Fax:03-5449-5413) FEATURES Location/Qualifiers source 1..3371 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="p55pik" /clone_lib="library of Jurkat cells" CDS 421..1806 /codon_start=1 /product="p55pik" /db_xref="PID:d1014332" /db_xref="PID:g1661001" /translation="MYNTVWSMDRDDADWREVMMPYSTELIFYIEMDPPALPPKPPKP MTSAVPNGMKDSSVSLQDAEWYWGDISREEVNDKLRDMPDGTFLVRDASTKMQGDYTL TLRKGGNNKLIKIYHRDGKYGFSDPLTFNSVVELINHYHHESLAQYNPKLDVKLMYPV SRYQQDQLVKEDNIDAVGKKLQEYHSQYQEKSKEYDRLYEEYTRTSQEIQMKRTAIEA FNETIKIFEEQCHTQEQHSKEYIERFRREGNEKEIERIMMNYDKLKSRLGEIHDSKMR LEQDLKKQALDNREIDKKMNSIKPDLIQLRKIRDQHLVWLNHKGVRQKRLNVWLGIKN EDADENYFINEEDENLPHYDEKTWFVEDINRVQAEDLLYGKPDGAFLIRESSKKGCYA CSVVADGEVKHCVIYSTARGYGFAEPYNLYSSLKELVLHYQQTSLVQHNDSLNVRLAY PVHAQMPSLCR" BASE COUNT 950 a 657 c 824 g 940 t ORIGIN 1 tttctcccaa cgtgttcttt tttttcctct tcattctccc tccttcgagg acacaaaagt 61 ggcttccgcg gaaagatttg gaggcggtgg gagcttttct ccccggagag cgactgtgta 121 gaaaggattt ttgggaagcc gctttttaac acctctgctc tccgtccccc aagcctctgt 181 gtaatcctct gaggagaaaa gcccatagct tgaaagttcg ggggcatttt gttgtgttct 241 gtaggagaga gggggaggac cctgttcggg tagtttggcc ggactggtac tggccgttgg 301 aaaacccgaa gtacatttcc gtgtggaact tttgcagata tatattttta gatttttaaa 361 taccagataa aaaatatatg ccttctatat atctcctggc gacctgcccc tgacagcgcg 421 atgtacaata cggtgtggag tatggaccgc gatgacgcag actggaggga ggtgatgatg 481 ccctattcga cagaactgat attttatatt gaaatggatc ctccagctct tccaccaaag 541 ccacctaagc caatgacttc agcagttcca aatggaatga aggacagttc tgtttctctt 601 caggatgcag aatggtactg gggggatatt tcaagggagg aggtaaatga caaattgcgg 661 gatatgccag atgggacctt cttggtccga gatgcctcaa caaaaatgca gggagattat 721 actttgactt tgcggaaggg aggcaataat aagttaataa agatctatca ccgggatggt 781 aaatatggct tttctgatcc tctgacattt aattccgtgg tggagctcat taaccactat 841 caccatgaat ctcttgctca gtacaatccc aaacttgatg tgaagctgat gtacccagtg 901 tccagatacc aacaggatca gttggtaaaa gaagataata ttgatgcagt aggtaaaaaa 961 ctgcaagaat accactctca gtatcaggag aagagtaaag agtatgatag gctgtatgaa 1021 gaatatacta gaacatccca ggaaatacag atgaagagga ctgcaataga agcttttaat 1081 gaaacaatta aaatatttga agagcagtgt cacacacaag aacaacatag caaagaatat 1141 attgagcgat ttcgcagaga ggggaatgaa aaggagattg aacgaattat gatgaattat 1201 gataaattga aatcacgtct gggtgagatt catgatagca aaatgcgtct agagcaggat 1261 ttgaagaaac aagctttgga caaccgagaa atagataaaa aaatgaatag tatcaaacct 1321 gacctgatcc agctgcgaaa gatccgagat caacaccttg tatggctcaa tcacaaagga 1381 gtgagacaga aacgcctgaa tgtctggctg ggaattaaga atgaggatgc tgatgagaac 1441 tattttatca atgaggaaga tgaaaacctg ccccattatg atgagaaaac ctggtttgtt 1501 gaggatatca atcgagtaca agcagaggac ttgctttatg ggaaacctga tggtgcattc 1561 ttaattcgtg agagtagcaa gaaaggatgc tatgcttgct ctgtggtggc cgatggggaa 1621 gtgaagcact gtgtgatcta cagcactgct cggggctatg gctttgcaga gccctacaac 1681 ctgtacagct ctctgaagga gctagtgctc cattaccagc agacatcctt ggttcagcac 1741 aacgactccc tcaacgtcag gcttgcctac cctgttcatg cacagatgcc ctcgctttgc 1801 agataaagag gaagtgggaa gagaggtggt cttctggcat ttttttctac agtttttatt 1861 agactacgat gagggcattc tttctacata gactgcttgt tttgcacaag aagtgatttt 1921 gtgaatgtga agtggagagg ccgaggagca gccggccggg atgggggcat tagaggcctg 1981 aggttctcta ggactcagcc atgccgctgc actgacatac taagctggaa gcagatgttt 2041 tttttgaaag tctgtttcat tggggttttt gttttgttta gccagacacc ctcaacagaa 2101 tattaggctt gatggttata gcgggtgggg ttgtatttgg aagcctctga agagaccatg 2161 tctttttaaa atctaactct tgagagtgca gcaggggcat ggctctgctg ggagttgtgt 2221 tttgctttgg cagtctctct tccccccacg aagaaggctg tttaggtttt gtgatagaat 2281 gggatttgat gaaaaagaca accaaaggaa aatggggagg cttgggattt catttaaata 2341 atctaagcca agatgataaa aaaaaccttc aactgaaggt actttgtttc ttacaacata 2401 atttaggctt cagcatctca ccagcccctc cctctgaaga agtattatgt tcagaagcca 2461 acaaaacagt ttgttgccag accaatgttt gatgggaaaa cgtggcactc atagttgaat 2521 gtatacttct gtaccaaaac ttgaacataa aagactagaa tttgtgagtt ttagcaaacg 2581 ctaaattgat cactgtaact aaccccttct gtccttcctg cctgtttctc tgagatgagg 2641 aatagcattc tttttgtggg gatggtgagc tttgaatcat aaaatgaagt tggtgcttgt 2701 atggtgtttc cttagcctaa agaatgatct gttgtttgaa acctttgtaa cttgtttgta 2761 tgagtaaaga aaaggtgcaa tgcagtgctt ttagatggct tgatatacca aataacaata 2821 tagacaacat tattatatgt gcttccccaa gtttaaaggc cctgcagaaa tagtaaacat 2881 ggtttaattt cccttcattt ccccctcctt tgctggatgg ggttttggga gctataggtt 2941 gctaaggagg ggagtcagat tgtggtcagg tgcctcagta aatcacagac ccaggggccc 3001 tgtggtccag ggtgagagtc acaccacatt acacatgtgc ttccatacag tggtttctga 3061 agcttttgca gggagagaag atggcttagt gtttagactg ttagtagaag ccatctggaa 3121 gcttttctct ttgccttttt ttgtgatcct gccattaagg ctatgtgcag tctgccctcc 3181 tgctccagtg gccttgattt taggccagga atcttctgct ccatgtggct taagccttcc 3241 agctgagtga agctaggcaa atggagtggg ggcaggcatc tattcctgcc cccatcatgc 3301 cccacaccca tcagtcaaca ctcatttgac aaatagagtc cagctgcctc tgagccaatc 3361 ctgggaccta a // LOCUS D88613 1651 bp mRNA PRI 08-JAN-1997 DEFINITION Human mRNA for hGCMa, complete cds. ACCESSION D88613 NID g1769819 KEYWORDS hGCMa. SOURCE Homo sapiens female placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Akiyama,Y., Hosoya,T., Poole,A.M. and Hotta,Y. TITLE The gcm-motif: a novel DNA-binding motif conserved in Drosophila and mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (25), 14912-14916 (1996) MEDLINE 97121489 REFERENCE 2 (bases 1 to 1651) AUTHORS Akiyama,Y., Hosoya,T., Anthony,P.M. and Yoshiki,H. TITLE The gcm-motif: A novel DNA binding motif conserved in Drosophila and mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1996) In press REFERENCE 3 (bases 1 to 1651) AUTHORS Hosoya,T. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) to the DDBJ/EMBL/GenBank databases. Toshihiko Hosoya, University of Tokyo, Molecular Genetics Laboratory; Hongo 7-3-1, Bunkyo, Tokyo 113, Japan (E-mail:hosoya@bio.phys.s.u-tokyo.ac.jp, Tel:03-3812-2111(ex.3034), Fax:03-5684-0785) FEATURES Location/Qualifiers source 1..1651 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="placenta" gene 213..1523 /gene="hGCMa" CDS 213..1523 /gene="hGCMa" /note="GMC protein is a DNA-binding protein with gcm-motif (glial cell missing motif)" /codon_start=1 /product="hGCMa" /db_xref="PID:d1014347" /db_xref="PID:g1769820" /translation="MEPDDSDSEDKEILSWDINDVKLPQNVKKTDWFQEWPDSYAKHI YSSEDKNAQRHLSSWAMRNTNNHNSRILKKSCLGVVVCGRDCLAEEGRKIYLRPAICD KARQKQQRKRCPNCDGPLKLIPCRGHGGFPVTNFWRHDGRFIFFQSKGEHDHPKPETK LEAEARRAMKKVNTAPSSVSLSLKGSTETRSLPGETQSQGSLPLTWSFQEGVQLPGSY SGHLIANTPQQNSLNDCFSFSKSYGLGGITDLTDQTSTVDPMKLYEKRKLSSSRTYSS GDLLPPSASGVYSDHGDLQAWSKNAALGRNHLADNCYSNYPFPLTSWPCSFSPSQNSS EPFYQQLPLEPPAAKTGCPPLWPNPAGNLYEEKVHVDFNSYVQSPAYHSPQGDPFLFT YASHPHQQYSLPSKSSKWDFEEEMTYLGLDHCNNDMLLNLCPLR" BASE COUNT 466 a 422 c 379 g 384 t ORIGIN 1 gagcctgctg ggacttgaac cagcagtaag attttcacga cacagtgctg tctgcttctc 61 cgtaagaagt tagaagccct agaaaacaat ctcctggtcc aaggtgcttg agtgggccga 121 tccagctata tcaagaacct ttgagaacaa aattctcaag catttctgag gggagtcgaa 181 taggtgaaaa ccttggctgg cctgacctta tcatggaacc tgacgactct gattctgaag 241 acaaagagat attaagctgg gatattaatg atgtgaaact gccacagaac gtgaaaaaaa 301 ccgactggtt ccaggagtgg ccagattcct atgccaaaca catctacagc tcggaggaca 361 agaatgcgca gcggcacctg agcagctggg ccatgcgcaa taccaacaac cacaactccc 421 gcatcctcaa gaagtcctgc ctgggtgtgg tggtgtgcgg ccgcgactgt ctcgcagagg 481 aggggcgcaa gatctacctg agacctgcca tctgtgacaa ggcccggcag aagcagcagc 541 ggaaacgctg tcccaactgt gacgggcctc tgaagctcat cccttgccga ggtcatgggg 601 gcttcccggt caccaacttc tggaggcacg acggacgctt tatatttttc cagtcaaagg 661 gagagcatga tcatccaaaa ccagaaacca agttagaagc tgaggcaaga agagccatga 721 agaaagtgaa cacagcacct tcctccgtct cattgagcct gaaggggagc acagagacca 781 ggtctcttcc aggtgaaaca caaagtcagg ggagtttacc tttaacttgg tctttccagg 841 aaggcgtcca attgcctggt agttacagtg gacatttaat agctaacact cctcagcaga 901 actcactaaa tgattgcttt tccttctcca agagttatgg tctgggagga atcacagatc 961 tgactgacca gacttccact gtggacccca tgaagctcta tgaaaagcgc aaattgtcca 1021 gtagcagaac ctacagtagt ggagacctgc ttcctccttc tgcctccgga gtctactctg 1081 atcatggcga tctacaagcg tggagtaaaa atgctgcttt ggggagaaat catcttgctg 1141 acaactgtta ttccaattat ccttttcctc tgaccagctg gccttgcagc ttctctcctt 1201 cccaaaactc ttcagaaccc ttttaccagc agcttccatt ggagccacct gcagccaaaa 1261 ctggctgtcc cccattatgg ccaaatccag cgggtaatct ttatgaagag aaagtacatg 1321 tggattttaa cagctacgtc cagtctcctg cataccattc acctcaagga gacccctttc 1381 tcttcaccta cgcctctcat cctcatcagc aatattcact gccaagcaag agcagcaaat 1441 gggattttga ggaagaaatg acatacttgg gtttggatca ctgcaacaat gatatgcttc 1501 tgaacctgtg tcctttgaga tgacccaaat ctttcactat gtgcacccca gcccctcaaa 1561 aatggggaag ggctgaaaga atttccttag gaaataattt ttaaaacata accacagata 1621 aatgagaatc atgataagca gtagacaagg c // LOCUS D88667 1791 bp mRNA PRI 05-MAR-1997 DEFINITION Human mRNA for cerebroside sulfotransferase, complete cds. ACCESSION D88667 NID g1871140 KEYWORDS cerebroside sulfotransferase. SOURCE Homo sapiens renal cell carcinoma cell_line:SMKT-R3 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Honke,K., Tsuda,M., Hirahara,Y., Ishii,A., Makita,A. and Wada,Y. TITLE Molecular cloning and expression of cDNA encoding human 3'-phosphoadenylylsulfate:galactosylceramide 3'-sulfotransferase JOURNAL J. Biol. Chem. 272 (8), 4864-4868 (1997) MEDLINE 97184132 REFERENCE 2 (bases 1 to 1791) AUTHORS Honke,K., Tsuda,M., Hirahara,Y., Ishii,A., Makita,A. and Wada,Y. TITLE Molecular cloning and expression of cDNA encoding human 3'-phosphoadenosine-5'-phosphosulfate:GalCer sulfotransferase JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1791) AUTHORS Honke,K. TITLE Direct Submission JOURNAL Submitted (29-OCT-1996) to the DDBJ/EMBL/GenBank databases. Koichi Honke, Osaka Medical Center for Maternal and Child Health, Department of Molecular Medicine, Research Institute; 840 Murodo-cho, Izumi, Osaka 590-02, Japan (E-mail:k62117a@center.osaka-u.ac.jp, Tel:+81-725-56-1220, Fax:+81-725-57-3021) FEATURES Location/Qualifiers source 1..1791 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SMKT-R3" /cell_type="renal cell carcinoma" gene 204..1475 /gene="CST" CDS 204..1475 /gene="CST" /codon_start=1 /product="cerebroside sulfotransferase" /db_xref="PID:d1014369" /db_xref="PID:g1871141" /translation="MLPPQKKPWESMAKGLVLGALFTSFLLLVYSYAVPPLHAGLAST TPEAAASCSPPALEPEAVIRANGSAGECQPRRNIVFLKTHKTASSTLLNILFRFGQKH RLKFAFPNGRNDFDYPTFFARSLVQDYRPGACFNIICNHMRFHYDEVRGLVPTNAIFI TVLRDPARLFESSFHYFGPVVPLTWKLSAGDKLTEFLQDPDRYYDPNGFNAHYLRNLL FFDLGYDNSLDPSSPQVQEHILEVERRFHLVLLQEYFDESLVLLKDLLCWELEDVLYF KLNARRDSPVPRLSGELYGRATAWNMLDSHLYRHFNASFWRKVEAFGRERMAREVAAL RHANERMRTICIDGGHAVDAAAIQDEAMQPWQPLGTKSILGYNLKKSIGQRHAQLCRR MLTPEIQYLMDLGANLWVTKLWKFIRDFLRW" polyA_site 1791 /note="10 A nucleotides" BASE COUNT 324 a 615 c 523 g 329 t ORIGIN 1 ggcagcctgg gagttggacg tggctcaggc agtgggtaga aaggggcagc cagccacagc 61 ccgagggtct cactgtgtca tccagagctg gagtgcagcg gcacagtcat ggctcactgg 121 aactcaggct caagcaatcc tcccgcctca gccttccaag taactaggac tacaggcatg 181 tgccaccacg cctggtgtct gagatgctgc caccgcagaa gaagccctgg gagtccatgg 241 ctaaggggct ggtgctgggc gcgctcttca ctagtttcct gctgctggtg tactcctatg 301 ccgtgccccc gctgcatgcc ggcctggcct ccacgacccc ggaggccgca gcgtcctgct 361 ctccacctgc actcgagcca gaggcagtga tccgggccaa cggctcggcg ggggagtgcc 421 agccgcggcg caacatcgtg ttcttgaaga cgcacaagac ggccagcagc accctgctca 481 acatcctgtt ccgcttcggc cagaagcacc ggctcaagtt cgccttccct aacggccgca 541 atgacttcga ctacccgacc ttcttcgccc gcagcctggt gcaggactat cggcccgggg 601 cctgcttcaa catcatctgc aaccacatgc gcttccacta cgacgaggtg cgcggcctgg 661 tgccgaccaa cgccatcttc atcacggtgc tccgcgaccc cgcccgcttg ttcgagtcct 721 ccttccacta cttcgggccg gtggtgcccc tcacgtggaa gctctcggcc ggcgacaagc 781 tgaccgagtt cctgcaagac ccggatcgct actacgaccc caacggcttc aatgcccact 841 acctccgaaa cctgctcttc ttcgacctgg gctatgacaa cagcctggac cccagcagcc 901 cgcaggtgca ggagcacatc ctggaggtgg agcgtcgctt ccacctggtg ctccttcaag 961 agtacttcga cgagtcgctg gtgctgctga aggacctgct gtgctgggag ctggaggacg 1021 tgctctactt caagctcaac gcccgccgcg actcgcccgt gccgcggctc tcgggggagc 1081 tgtatgggcg cgccaccgcc tggaacatgc tggactccca cctctaccgc cacttcaacg 1141 ccagcttctg gcgcaaggtg gaggccttcg ggcgggagcg catggcccgc gaggtggccg 1201 ccctgcgcca tgccaacgag cgcatgcgga ccatctgcat cgacgggggc cacgccgtgg 1261 acgccgccgc catccaggac gaggccatgc agccctggca gccgctgggc accaagtcca 1321 tcctgggcta caacctcaag aagagcatcg ggcagcggca cgcgcagctc tgccggcgca 1381 tgctcacgcc cgagatccag tacctgatgg acctcggcgc caacctgtgg gtcaccaagc 1441 tctggaagtt cattcgcgat ttcctgcggt ggtgacgtcc caccgcccag cggcttgcct 1501 gcctgctcgc tccctgcaga ggggctgagc aggacgccgc tggtgctggc cgcccccagc 1561 cccctcctgg tgccacctca gaccccgggg tgaggggggg ctccctgggg ggaggcagcc 1621 agccaagact gggcccatga acacagagag ggcctaaccg agatcagtat ttaactaatt 1681 ataccagttt ttattaaacc cctttccctc cccgataaag aatgttctat ttctgcctcc 1741 ccttaaaggg gagacctcag aagtaaagga atttgatgtt gtgtttttgt t // LOCUS D88674 2619 bp mRNA PRI 21-NOV-1997 DEFINITION Homo sapiens mRNA for antizyme inhibitor, complete cds. ACCESSION D88674 NID g2641951 KEYWORDS antizyme inhibitor. SOURCE Homo sapiens kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Koguchi,K., Kobayashi,S., Hayashi,T., Matsufuji,S., Murakami,Y. and Hayashi,S. TITLE Cloning and sequencing of a human cDNA encoding ornithine decarboxylase antizyme inhibitor JOURNAL Biochim. Biophys. Acta 1353 (3), 209-216 (1997) MEDLINE 98007871 REFERENCE 2 (bases 1 to 2619) AUTHORS Koguchi,K. TITLE Direct Submission JOURNAL Submitted (30-OCT-1996) to the DDBJ/EMBL/GenBank databases. Kazuhiko Koguchi, The Jikei University School of Medicine, Biochemistry II; Mimato-ku, Nishi Shinbashi 3-25-8, Minato-ku 105, Japan (E-mail:koguchi@jikei.ac.jp, Tel:03-3433-1111, Fax:03-3436-3897) FEATURES Location/Qualifiers source 1..2619 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 710..2056 /codon_start=1 /product="antizyme inhibitor" /db_xref="PID:d1024472" /db_xref="PID:g2641952" /translation="MKGFIDDANYSVGLLDEGTDLGNVIDNYVYEHTLTGKNAFFVGD LGKIVKKHSQWQNVVAQIKPFYTVKCNSAPAVLEILAALGTGFACSSKNEMALVQELG VPPENIIYISPCKQVSQIKYAAKVGVNILTCDNEIELKKIARNHPNAKVLLHIATEDN IGGEEGNMKFGTTLKNCRHLLECAKELDVQIIGVKFHVSSACKESQVYVHALSDARCV FDMAGEIGFTMNMLDIGGGFTGTEFQLEEVNHVISPLLDIYFPEGSGVKIISEPGSYY VSSAFTLAVNIIAKKVVENDKFPSGVEKTGSDEPAFMYYMNDGVYGSFASKLSEDLNT IPEVHKKYKEDEPLFTSSLWGPSCDELDQIVESCLLPELNVGDWLIFDNMGADSFHEP SAFNDFQRPAIYYMMSFSDWYEMQDAGITSDSMMKNFFFVPSCIQLSQEDSFSAEA" polyA_signal 2303..2308 polyA_signal 2365..2370 polyA_signal 2539..2544 polyA_signal 2598..2603 polyA_site 2619 /note="30 A nucleotides" BASE COUNT 726 a 507 c 564 g 822 t ORIGIN 1 agtttttcct tttttcttct gccgtcgcct tctctgcctc ttctcatcct ttctcgctct 61 gctgctctgc agtgtgacga gtccgaatcc tcttcccacc cagcccgcgc ctttcttctt 121 ttgcctgcgc tgttctattt ctccttcggc cgccgccgcc actgctgcac acagctggtg 181 tcggtgccgc gcttttaccc ccaagtcgtt cccgcagcct atggcccagg ccgccttggg 241 tatttctgct caaggtaacc acatccctct ttaaaaattc cgccgaaaaa gagaagacgc 301 tttacccgac tctttgggcc gttatctcac ggcgaacttt ctgaccaagt atacaactac 361 ccagagggcc taggagaagt gctgtataga gagcagttcg acttcaacgc tgagccacct 421 tgggaaccta gctgatgata ggggggttcc atctcccaac ttgtccatgg aggtcttcac 481 ttcagaaatc caagactcat attcatccag cttggtgtca agtgggctgt tgctgccaga 541 attatcttgt gattatttga gagatgtatc agtttcttct gaagtacaat caactgtaga 601 agcctttgta gcagtttgtt gcatattcta aggacccaga cataggcttg gtggcccgtc 661 tcttgtcttt cctggtttat gactttcggc tttgtggaat acggctgaga tgaaaggatt 721 tattgatgat gcaaactact ccgttggcct gttggatgaa ggaacagacc ttggaaatgt 781 tattgataac tatgtttatg aacataccct gacagggaaa aatgcatttt ttgtgggaga 841 tcttggaaag attgtgaaga aacacagtca atggcagaat gtagtggctc agataaagcc 901 attctacaca gtgaagtgca actctgctcc agctgtactt gagattttgg cagctcttgg 961 aaccggattt gcttgttcca gtaaaaatga aatggcttta gtgcaagagt tgggtgtacc 1021 tccagaaaac attatttaca taagtccttg caagcaagtg tctcagataa agtatgcagc 1081 aaaagttgga gtgaatatcc tgacatgtga caatgaaatt gaattgaaga aaattgcacg 1141 taatcaccca aatgccaagg tcttactaca tattgcaaca gaagataata ttggaggtga 1201 agagggtaac atgaagtttg gcactaccct gaagaactgt aggcatctct tggaatgtgc 1261 taaggaactt gatgtccaaa taattggggt taaatttcat gtttcgagtg cttgcaaaga 1321 atctcaagta tatgtacatg ctctatctga tgctcgatgt gtgtttgaca tggctggaga 1381 aattggcttt acgatgaaca tgttagacat tggtggagga ttcacgggaa ctgaatttca 1441 attggaagag gttaatcatg ttatcagccc tctgttggat atctactttc ctgaaggatc 1501 tggtgttaag ataatttcag aacccggaag ctactatgtg tcttctgcat ttacactcgc 1561 agttaatatc atagcaaaga aagttgttga aaatgataaa tttccctctg gagtagaaaa 1621 aaccggaagt gatgaaccag ccttcatgta ttatatgaat gatggtgttt atggttcttt 1681 tgcaagtaaa ctgtctgagg acttaaatac cattccagag gttcacaaga aatacaagga 1741 agatgagcct ctgtttacaa gcagcctttg gggtccatcc tgtgatgagc ttgatcaaat 1801 tgtggaaagc tgtcttcttc ctgagctgaa tgtgggagat tggcttatct ttgataacat 1861 gggagcagat tctttccatg aaccatctgc ttttaatgat tttcagaggc cagccattta 1921 ttacatgatg tcattcagtg attggtatga gatgcaagat gctggaatta cttcagactc 1981 aatgatgaag aacttcttct ttgtgccttc ttgcattcag ctgagccaag aagacagctt 2041 ttccgctgaa gcttaaacag gcattaacgc ttctttagat ctgaagttgc aggttaagct 2101 tgtctggtca acattccagt gtggaaaaat aatttaaaca atcttattct cttaattctt 2161 ttggcaacaa aaactattag taatagctat ttgggaccag acaaaatcag ctttcatcta 2221 taattcattg gggataatgg gagatttaga taatgtatcc agatttaaac ctaccagttt 2281 gtcctacccc ttaagcgttt aaaataaaat atgcaacaaa atggatgact tagtggagat 2341 ggaagcccat taattgggtt ccccattaaa tcgtttacat acaagaacac agtttttata 2401 ctaaggattt gtgtttaaag tcttgtaaag ttcatgtctt tcacccagat atatcaaatg 2461 ttagaagacc agtgtgactt cattagataa cgtttagtgt atttagaatg tgtaaatttg 2521 tgctttgaac tgtagtttaa taaatgtaaa attgcatcat agtatttgtt gacctaatgt 2581 aacccttgta tgattgcaat aaaattttgt gtagatttt // LOCUS D88827 3240 bp mRNA PRI 29-NOV-1997 DEFINITION Homo sapiens mRNA for zinc finger protein FPM315, complete cds. ACCESSION D88827 NID g2342505 KEYWORDS zinc finger protein FPM315. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokoyama,M., Nakamura,M., Okubo,K., Matsubara,K., Nishi,Y., Matsumoto,T. and Fukushima,A. TITLE Isolation of a cDNA encoding a widely expressed novel zinc finger protein with the LeR and KRAB-A domains JOURNAL Biochim. Biophys. Acta 1353 (1), 13-17 (1997) MEDLINE 97398134 REFERENCE 2 (bases 1 to 3240) AUTHORS Yokoyama,M. TITLE Direct Submission JOURNAL Submitted (07-NOV-1996) to the DDBJ/EMBL/GenBank databases. Masahiro Yokoyama, Japan Tobacco, Inc., Pharmaceutical Frontier Research Laboratories; 13-2 Fukuura 1-chome, Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:yokoyama@ikrl.jti.co.jp, Tel:045-786-7694, Fax:045-786-7692) FEATURES Location/Qualifiers source 1..3240 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 293..2344 /codon_start=1 /product="zinc finger protein FPM315" /db_xref="PID:d1022711" /db_xref="PID:g2342506" /translation="MASGPGSQEREGLLIVKLEEDCAWSQELPPPDPGPSPEASHLRF RRFRFQEAAGPREALSRLQELCHGWLRPEMRTKEQILELLVLEQFLTILPQEIQSRVQ ELHPESGEEAVTLVEGMQRELGRLRQQVTNHGRGTEVLLEEPLPLETARESPSFKLEP METERSPGPRLQELLGPSPQRDPQAVKERALSAPWLSLFPPEGNMEDKEMTGPQLPES LEDVAMYISQEEWGHQDPSKRALSRDTVQESYENVDSLESHIPSQEVPGTQVGQGGKL WDPSVQSCKEGLSPRGPAPGEEKFENLEGVPSVCSENIHPQVLLPDQARGEVPWSPEL GRPHDRSQGDWAPPPEGGMEQALAGASSGRELGRPKELQPKKLHLCPLCGKNFSNNSN LIRHQRIHAAERLCMGVDCTEIFGGNPRFLSLHRAHLGEEAHKCLECGKCFSQNTHLT RHQRTHTGEKPYQCNICGKCFSCNSNLHRHQRTHTGEKPYKCPECGEIFAHSSNLLRH QRIHTGERPYKCPECGKSFSRSSHLVIHERTHERERLYPFSECGEAVSDSTPFLTNHG AHKAEKKLFECLTCGKSFRQGMHLTRHQRTHTGEKPYKCTLCGENFSHRSNLIRHQRI HTGEKPYTCHECGDSFSHSSNRIRHLRTHTGERPYKCSECGESFSRSSRLMSHQRTHT G" BASE COUNT 799 a 824 c 968 g 649 t ORIGIN 1 gtctgagtct tgcgtgggtc ctctatatag ggtgagaagc gtggcgctcg gttcctgcct 61 cggggaagtc ctggcgcaga tgggccacgg ggccggcgtg gcggcgcctg ggaccgactg 121 aggcctaggc gccggagccg gccgcgcctg ggctggagcg gggctcctcg gcctggactg 181 ggagcccccg gccccgggct cctgctggcg ccgtccaacc ttacatgggt tcagggcgcc 241 ttcgtaggcg ggcacggctg gtttcgggct aaggcgctct ggagacctga cgatggcgtc 301 gggcccgggc tcccaggaac gggaagggct cctgatagtg aagctggagg aggactgcgc 361 ctggagccag gagctgcccc cacctgaccc aggaccgagc cccgaggcct cccacttgcg 421 cttcagacgg ttccgcttcc aagaggcagc tggtccccgg gaagccctca gccggctcca 481 agagctttgc catgggtggc ttcggcctga gatgcgcacg aaggagcaga tcttggagct 541 gctggtgtta gagcagttcc tgaccatcct gccccaggag atccagagca gggtgcagga 601 gctgcatccg gagagcggcg aagaagcggt gacccttgtg gagggtatgc agagagagct 661 tgggagactg agacaacagg tcacaaacca tgggcgggga acagaagtgc ttttggagga 721 gcctttgcct ctggaaacag cacgagagtc accgagcttc aagctggagc caatggagac 781 tgagcgaagc cctggcccca ggctgcagga gctgctaggc cccagccccc aaagggaccc 841 ccaggctgta aaggagaggg cattatctgc tccctggctt tctctttttc ctcctgaagg 901 gaacatggaa gacaaggaga tgactgggcc ccagttgcct gagagcttag aggacgtggc 961 aatgtacatc tcccaggagg agtgggggca tcaggatcct agtaagaggg ccctctccag 1021 ggacacggtg caggagagtt atgagaatgt ggactcactg gagtctcaca ttcccagtca 1081 ggaggtccca ggcacccagg tgggacaagg aggaaagcta tgggatccca gtgtccagag 1141 ctgcaaggag ggcctgagcc ccagaggccc agctccagga gaagagaaat ttgagaacct 1201 ggaaggtgtt ccgtctgtat gctctgagaa catccaccct caggtgctgc ttcctgacca 1261 ggcccgaggg gaggtgccct ggagtcctga gctgggaaga cctcatgacc ggtcgcaagg 1321 ggattgggcg cctcccccag agggtggaat ggagcaggcc ttggcaggag cctcaagtgg 1381 cagagaactg gggcgaccga aggaactgca gccaaagaaa ctccatttat gtcccttgtg 1441 tggcaaaaat ttctctaaca actcaaacct aattaggcac cagagaatac atgcagctga 1501 aagactgtgt atgggtgtgg actgcactga aatctttggt gggaacccac gtttcctgtc 1561 actacacaga gcacacctgg gagaggaggc ccacaagtgc cttgaatgtg ggaaatgctt 1621 cagtcagaac acccatctga ctcgccacca acgcacccac acgggtgaga agccctatca 1681 gtgcaacatt tgcggaaaat gtttctcctg caactccaac ctccacaggc accagagaac 1741 gcacactggg gagaagccct acaagtgccc tgagtgtggg gagatctttg ctcacagttc 1801 caacctcctt cggcaccaga gaattcacac tggagagcga ccttataagt gtcccgagtg 1861 tgggaaaagt ttctctcgga gttcacacct cgtcattcac gaaagaactc atgagagaga 1921 gagactttac cccttctctg agtgtgggga agctgtgagt gacagcaccc cctttcttac 1981 aaaccatgga gcccataagg cagagaagaa gctctttgaa tgtttgactt gtgggaaaag 2041 cttccggcag ggcatgcacc tcaccagaca tcagagaaca cacacaggag agaaaccgta 2101 taaatgtacc ctttgtgggg aaaacttctc tcatagatcc aatttaatca ggcaccagag 2161 aatccacaca ggagaaaaac cctatacctg tcatgagtgc ggagacagct tctctcacag 2221 ctccaatcgg attcgccacc tgagaacgca tacgggagag agaccctata aatgttctga 2281 atgtggagaa agcttctctc ggagttcccg tcttatgagt catcagagaa ctcacacagg 2341 ttagtaacag tggggtttct ctttgcccca ggtgaggtgg catattcaga ggagcctgtt 2401 ggcaagagct ggtattccct gcccagccga ccaaatgacc tctgcattct tcaggtaatg 2461 ggggctcatt gtgagggagg tgcagaggca gcagaggatt ggcataaaac tgaaaaggag 2521 ttctgtctgc atgagaaagg atggcaagtc tctgaggtga cctcagggtg gaattctctg 2581 ttaagtccac cctgccccag ggtgctccta ccctcttggt ctttttaaag ccaaggtgcg 2641 atttgggcac ctgactgtcc agtttacctt aacaagtttg ggaatccatg tgatgttttt 2701 gatacttctt cctcatttgg gacattcagt aggagcattt gggcttccgg ggcccctgag 2761 accaaagaag aggggccaag taccctggga aatcagctga aggtcaacaa aagactggtt 2821 gtgagttgca gctgtcccga aggccccagt tgggaagcca tgggcagtcc agatcaagcc 2881 accacgtgcc ctacgatggc ctaacaggag tgcccattgg cagattacac atgtaaatat 2941 gacctcagac aaaaaggaac cagaggccca agggcaataa taaggtggaa tttgcaggtc 3001 agcccaggaa ttggcagagg aagtaggtgt ctgataaccc tttgtggaga atgagattcc 3061 ccccacctgt gtgagaaaaa taaacagctc tggagtcttg ttcctgactc cagaggaacg 3121 agagcattcc aggaaagaga gattccctgg aaaattgaaa atgtgaatcc tagggggaaa 3181 ttggggattg tgtctttccc tgttgaaaat gtttggatgg gaataaatat cttcaggaaa // LOCUS D88894 1407 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for brain acyl-CoA hydrolase, complete cds. ACCESSION D88894 NID g2780413 KEYWORDS hBACH; brain acyl-CoA hydrolase. SOURCE Homo sapiens (strain:caucasian) 57-yr-old male whole cerebral brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamada,J., Furihata,T., Takama,H., Watanabe,T., Hosokawa,M., Satoh,T. and Suga,T. TITLE molecular cloning and sequence analysis of human brain cytosolic long-chain acyl-CoA hydrolase JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 1407) AUTHORS Yamada,J. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) to the DDBJ/EMBL/GenBank databases. Junji Yamada, Tokyo university of Pharmacy and Life Science, Department of Clinical Biochemistry; 1432-1 Horinouchi, Hachioji, Tokyo 192-03, Japan (E-mail:junymd@ps.toyaku.ac.jp, Tel:+81-426-76-5679, Fax:+81-426-76-5679) FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /dev_stage="57-yr-old" /sex="male" /tissue_type="whole cerebral brain" gene 62..1078 /gene="hBACH" CDS 62..1078 /gene="hBACH" /codon_start=1 /product="brain acyl-CoA hydrolase" /db_xref="PID:d1025262" /db_xref="PID:g2780414" /translation="MSGPDVETPSAIQICRIMRPDDANVAGNVHGGTILKMIEEAGAI ISTRHCNSQNGERCVAALARVERTDFLSPMCIGEVAHVSAEITYTSKHSVEVQVNVMS ENILTGAKKLTNKATLWYVPLSLKNVDKVLEVPPVVYSRQEQEEEGRKRYEAQKLERM ETKWRNGDIVQPVLNPEPNTVSYSQSSLIHLVGPSDCTLHGFVHGGVTMKLMDEVAGI VAARHCKTNIVTASVDAINFHDKIRKGCVITISGRMTFTSNKSMEIEVLVDADPVVDS SQKRYRAASAFFTYVSLSQEGRSLPVPQLVPETEDEKKRFEEGKGRYLQMKAKRQGHA EPQP" BASE COUNT 313 a 426 c 410 g 258 t ORIGIN 1 cgggccagac acctgcgccc ttctgcagcc gcccgccgca tccgccgccg cagcccccag 61 catgtcgggc ccagacgtcg agacgccgtc cgccatccag atctgccgga tcatgcggcc 121 agatgatgcc aacgtggccg gcaatgtcca cggggggacc atcctgaaga tgatcgagga 181 ggcaggcgcc atcatcagca cccggcattg caacagccag aacggggagc gctgtgtggc 241 cgccctggct cgtgtcgagc gcaccgactt cctgtctccc atgtgcatcg gtgaggtggc 301 gcatgtcagc gcggagatca cctacacctc caagcactct gtggaggtgc aggtcaacgt 361 gatgtccgaa aacatcctca caggtgccaa aaagctgacc aataaggcca ccctgtggta 421 tgtgcccctg tcgctgaaga atgtggacaa ggtcctcgag gtgcctcctg ttgtgtattc 481 ccggcaggag caggaggagg agggccggaa gcggtatgaa gcccagaagc tggagcgcat 541 ggagaccaag tggaggaacg gggacatcgt ccagccagtc ctcaacccag agccgaacac 601 tgtcagctac agccagtcca gcttgatcca cctggtgggg ccttcagact gcaccctgca 661 cggctttgtg cacggaggtg tgaccatgaa gctcatggat gaggtcgccg ggatcgtggc 721 tgcacgccac tgcaagacca acatcgtcac agcttccgtg gacgccatta attttcatga 781 caagatcaga aaaggctgcg tcatcaccat ctcgggacgc atgaccttca cgagcaataa 841 gtccatggag atcgaggtgt tggtggacgc cgaccctgtt gtggacagct ctcagaagcg 901 ctaccgggcc gccagtgcct tcttcaccta cgtgtcgctg agccaggaag gcaggtcgct 961 gcctgtgccc cagctggtgc ccgagaccga ggacgagaag aagcgctttg aggaaggcaa 1021 agggcggtac ctgcagatga aggcgaagcg acagggccac gcggagcctc agccctagac 1081 tccctcctcc tgccactggt gcctcgagta gccatggcaa cgggcccagt gtccagtcac 1141 ttagaagttc cccccttggc caaaaaccca attcacattg agagctggtg ttgtctgaag 1201 ttttcgtatc acagtgttaa cctgtactct ctcctgcaaa cctacacacc aaagctttat 1261 ttatatcatt ccagtatcaa tgctacacag tgttgtcccg agcgccggga ggcgttgggc 1321 aagaaaccct cgggaatgct tccgagcacg ctgtagggta tgggaagaac ccagcaccac 1381 taataaagct gctgcttggc tggaccc // LOCUS D89016 2219 bp mRNA PRI 28-NOV-1996 DEFINITION Human mRNA for Neuroblastoma, complete cds. ACCESSION D89016 NID g1694953 KEYWORDS Neuroblastoma. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2219) AUTHORS Sasaki,S., Takei,Y., Ito,M., Nakagawara,A., Fujiwara,T., Takahashi,E., Muto,T., Tokino,T. and Nakamura,Y. TITLE Isolation and Characterization of a Candidate Gene for Human Neuroblastoma Mapped to 1p36.3, NBR: a New Member of the Rho/Rac GEF Family JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2219) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (12-NOV-1996) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Laboratory of Molecular Medicine, Institute of Medical Science, University of Tokyo; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:yusuke@ims.u-tokyo.ac.jp, Tel:81-3-5449-5372, Fax:81-3-5449-5433) FEATURES Location/Qualifiers source 1..2219 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p36.3" gene 428..1693 /gene="nbr" CDS 428..1693 /gene="nbr" /codon_start=1 /product="Neuroblastoma" /db_xref="PID:d1014441" /db_xref="PID:g1694954" /translation="MFEILTSEFSYQHSLSILVEEFLQSKELRATVTQMEHHHLFSNI LDVLGASQRFFEDLEQRHKAQVLVEDISDILEEHAEKYFHPYIAYCSNEVYQQRTLQK LISSNAAFREALREIERRPACGGLPMLSFLILPMQRVTRLPLLMDTLCLKTQGHSERY KAASRALKAISKLVRQCNEGAHRMERMEQMYTLHTQLDFSKVKSLPLISASRWLLKRG ELFLVEETGLFRKIASRPTCYLFLFNDVLVVTKKKSEESYMVQDYAQMNHIQVEKIEP SELPLPGGGNRSSSVPHPFQVTLLRNSEGRQEQLLLSSDSASDRARWIVALTHSERQW QGLSSKGDLPQVEITKAFFAKQADEVTLQQADVVLVLQQEDGWLYGERLRDGETGWFP EDFARFITSRVAVEGNVRRMERLRVETDV" polyA_signal 2201..2206 BASE COUNT 457 a 686 c 692 g 384 t ORIGIN 1 cccatttcag gtgggacttg tggcatagga gaggtcttgg gactggtgcc cagcctggtg 61 cagtcccctc ccaaagcctc cctctcccac acagccgtga gagcctgtgg gcctagagga 121 ctcagctggc aggttgcagg gaggcgcagc cttttgcaat cccccagggc cacaggttac 181 cctccttctc tctctagacc cccagctcta ccaggagatc caggagcggg gcctgaacac 241 cagccaggag tctgatgacg acatcctcga tgagtcctcc agccccgagg gaacccagaa 301 ggtggacgcc accattgtgg tcaagagcta ccggcccgcc caggtcacct ggagccagct 361 cccggaggtg gtggaattgg gcatcctgga ccagctctcc actgaggagc ggaaaaggca 421 gagggccatg ttcgagatcc tcacgtcgga gttctcctac cagcacagcc tgagcatcct 481 ggtggaggag ttcctgcagt ccaaggagct gcgggcgacc gtgacccaga tggagcacca 541 ccacctcttc tccaacatcc tggatgtcct gggtgccagt cagaggttct tcgaggacct 601 ggagcagcgg cacaaggccc aggtgctggt cgaggacatc agtgacatcc tggaggagca 661 cgctgagaag tacttccacc cctacatcgc ctactgctcc aacgaggtct accaacagcg 721 cacgctgcag aagctgataa gcagcaacgc cgccttccga gaggccctga gagagattga 781 gaggcggccg gcgtgcgggg gcctgcccat gctctccttc ctgatcctcc ccatgcagcg 841 ggtgacccgg ctgcccctcc tgatggatac gctctgcctc aagacccagg gccactccga 901 aaggtacaag gctgccagcc gtgcactgaa ggccatcagc aagctggtga ggcagtgcaa 961 cgagggggcc cacaggatgg agcgcatgga gcagatgtac acgctgcaca cacagctgga 1021 cttcagcaag gtcaagtccc tcccactgat ctccgcctcc cggtggctgc tgaagcgcgg 1081 agagctgttc ttagtggaag aaaccggact ttttcgaaaa attgccagcc ggccaacgtg 1141 ctaccttttc ctgttcaacg atgtcctggt tgtgaccaag aagaagagcg aggagagcta 1201 catggtccag gactacgccc agatgaacca catccaggta gagaagatag agccgtctga 1261 gctccctctg cccgggggcg gcaaccgtag ctcctccgtg ccccacccct tccaggtgac 1321 cctgcttcgc aacagcgagg gccgccagga gcagctcctg ctctcctcgg actccgcgag 1381 tgaccgggca cggtggatcg tggcgctcac acacagtgag agacagtggc agggcctctc 1441 cagcaaagga gacctgcccc aggtggagat caccaaggcc ttcttcgcga agcaagcaga 1501 cgaggtcaca ctgcagcagg cggacgtggt cctggttctg cagcaggagg atgggtggct 1561 ctatggcgag aggctccggg acggagagac gggatggttc cccgaggact ttgcccgctt 1621 catcaccagc cgtgtggccg tggagggcaa tgtccgcagg atggagcgtc tgcgggtgga 1681 gacggacgtg taggcctggc gaggccagcc ggcggcagca cagcctgtcc ccaatcagca 1741 agtggtcgtg cctggctcta gagagcgtgg ggagctggtc tcaaggaccc agcatggttc 1801 cctggggctt cccaagagcc tgtggctgtg gtgccgggct ccagacactt cacggaagga 1861 agatcacatg tccccagaga ggcaccccca ggcaagctcg agggggccac accgtgtccc 1921 agggagccca gcctattccc gttggctggc tgggcccctc tcagctgctg ggccccacct 1981 ccccactgca cccaggggac aactccacct ggactgatgg gcacaggagg caccaatagc 2041 gattattggg ggcaatgcga ggtctcctcc tatgcccttc ctacccctga gtgggacaag 2101 aagggccttg agtgcccagg agtgccccac gttctgagaa ggggccggcc ggagggaggg 2161 gacccggcag ggagatttcg gttttgaggt ttctaaatac attaaagtta tttcttaag // LOCUS D89052 987 bp mRNA PRI 27-NOV-1996 DEFINITION Human mRNA for proton-ATPase-like protein, complete cds. ACCESSION D89052 NID g1694672 KEYWORDS HATPL; proton-ATPase-like protein. SOURCE Homo sapiens adult male kidney tissue_lib:kideney cDNA library cDNA to mRNA, clone_lib:cDNA library clone:HATPL-8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 987) AUTHORS Nishigori,H., Yamada,S., Fernald,A.A., LeBeau,M.M., Takeuchi,T. and Takeda,J. TITLE Cloning and Chromosomal Localization of the Gene Encoding a Protein Homologous to the yeast protein PPA1, an Proton-ATPase-like Protein JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 987) AUTHORS Takeda,J. TITLE Direct Submission JOURNAL Submitted (13-NOV-1996) to the DDBJ/EMBL/GenBank databases. Jun Takeda, Inst. for Molecular and Cellular Regulation, Gunma University, Department of Molecular Medicine; 3-39-15 showa-machi, Maebashi, Gunma 371, Japan (E-mail:jtakeda@news.sb.gunma-u.ac.jp, Tel:81-272-20-8856, Fax:81-272-20-8896) FEATURES Location/Qualifiers source 1..987 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /clone="HATPL-8" /clone_lib="cDNA library" /dev_stage="adult" /map="1p32.3" /sex="male" /tissue_lib="kideney cDNA library" /tissue_type="kidney" gene 83..700 /gene="HATPL" CDS 83..700 /gene="HATPL" /note="similar to yeast probable proton-transporting ATPase PPA1" /codon_start=1 /product="proton-ATPase-like protein" /db_xref="PID:d1014449" /db_xref="PID:g1694673" /translation="MTGLALLYSGVFVAFWACALAVGVCYTIFDLGFRFDVAWFLTET SPFMWSNLGIGLAISLSVVGAAWGIYITGSSIIGGGVKAPRIKTKNLVSIIFCEAVAI YGIIMAIVISNMAEPFSATDPKAIGHRNYHAGYSMFGAGLTVGLSNLFCGVCVGIVGS GAALADAQNPSLFVKILIVEIFGSAIGLFGVIVAILQTSRVKMGD" polyA_site 987 /note="23 A nucleotides" BASE COUNT 153 a 288 c 287 g 259 t ORIGIN 1 cagactgcgg gacggacggt ggacgctggg acgcgtttgt agctccggcc ccgccgttcc 61 gacccccgcc gccgtcgccg ccatgacggg gctagcactg ctctactccg gggtcttcgt 121 ggccttctgg gcctgcgcgc tggccgtggg agtctgctac accatttttg atttgggctt 181 ccgctttgat gtggcatggt tcctgacgga gacttcgccc ttcatgtggt ccaacctggg 241 cattggccta gctatctccc tgtctgtggt tggggcagcc tggggcatct atattaccgg 301 ctcctccatc attggtggag gagtgaaggc ccccaggatc aagaccaaga acctggtcag 361 catcatcttc tgtgaggctg tggccatcta cggcatcatc atggcaattg tcattagcaa 421 catggctgag cctttcagtg ccacagaccc caaggccatc ggccatcgga actaccatgc 481 aggctactcc atgtttgggg ctggcctcac cgtaggcctg tctaacctct tctgtggagt 541 ctgcgtgggc atcgtgggca gtggggctgc cctggccgat gctcagaacc ccagcctctt 601 tgtaaagatt ctcatcgtgg agatctttgg cagcgccatt ggcctctttg gggtcatcgt 661 cgcaattctt cagacctcca gagtgaagat gggtgactag atgatatgtg tgggtggggc 721 cgtgcctcac ttttatttat tgctggtttt cctgggacag ctggagctgt gtcccttagc 781 ctttcagagg cttggtgttc agggccctcc ctgcactccc ctcttgctgc gtgttgattt 841 ggaggcactg cagtccaggc cgagtcctca gtgcggggag caggctgctg ctgctgactc 901 tgtgcagctg cgcacctgtg tcccccacct ccaccctcaa cccatcttcc tagtgtttgt 961 gaaataaact tggtatttgt ctgggtc // LOCUS D89078 3002 bp mRNA PRI 12-JUN-1997 DEFINITION Human mRNA for leukotriene b4 receptor, complete cds. ACCESSION D89078 NID g2196448 KEYWORDS leukotriene b4 receptor. SOURCE Homo sapiens cell_line:HL-60 cells cDNA to mRNA, clone:HL-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3002) AUTHORS Yokomizo,T. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) to the DDBJ/EMBL/GenBank databases. Takehiko Yokomizo, The University of Tokyo, Department of Biochemistry and Molecular Biology, Faculty of Medicine,; Hongo 7-3-1, Bunkyo-ku, Tokyo 113, Japan (E-mail:yokomizo@m.u-tokyo.ac.jp, Tel:03-3812-2111(ex.3448), Fax:03-3813-8732) REFERENCE 2 (bases 1 to 3002) AUTHORS Yokomizo,T., Izumi,T. and Shimizu,T. TITLE cDNA cloning of human leukotriene b4 receptor JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Yokomizo,T., Izumi,T., Chang,K., Takuwa,Y. and Shimizu,T. TITLE A G-protein-coupled receptor for leukotriene B4 that mediates chemotaxis JOURNAL Nature 387 (6633), 620-624 (1997) MEDLINE 97320501 COMMENT Sequence updated (29-May-1997). FEATURES Location/Qualifiers source 1..3002 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60 cells" /clone="HL-1" CDS 1718..2776 /codon_start=1 /product="leukotriene b4 receptor" /db_xref="PID:d1021256" /db_xref="PID:g2196449" /translation="MNTTSSAAPPSLGVEFISLLAIILLSVALAVGLPGNSFVVWSIL KRMQKRSVTALMVLNLALADLAVLLTAPFFLHFLAQGTWSFGLAGCRLCHYVCGVSMY ASVLLITAMSLDRSLAVARPFVSQKLRTKAMARRVLAGIWVLSFLLATPVLAYRTVVP WKTNMSLCFPRYPSEGHRAFHLIFEAVTGFLLPFLAVVASYSDIGRRLQARRFRRSRR TGRLVVLIILTFAAFWLPYHVVNLAEAGRALAGQAAGLGLVGKRLSLARNVLIALAFL SSSVNPVLYACAGGGLLRSAGVGFVAKLLEGTGSEASSTRRGGSLGQTARSGPAALEP GPSESLTASSPLKLNELN" polyA_signal 2984..2989 BASE COUNT 596 a 888 c 808 g 710 t ORIGIN 1 gccattctct cacatcccgt gcggtcagga agcccttcct gaactctgac ttcagttctt 61 gctgcggttt ctgcccattt ttttcatatc ctctgacagc tgcgaggtca tctctgctct 121 ggcttttctc caagcagaac aagtgggggc tctggaaagg ttaagggacc tcagtggcca 181 ccattatact ttgcatcttt cctgagaagt gagagttgaa agggaagcag gaaggcccat 241 ggtcagattg aaggaaggac tttttagttt cttttttttt tttttgaaat ggagtctcgc 301 tctgtcattc aggctggagt gcagtggtgc gatctcagct cactgcagcc tccacttcct 361 gggttcacat gattctcctg cctcagcctc ccaagtagct gagactacag gcacatgcca 421 ctacacccag ctaacttttg tatttttagt agagacgggg tttcaccatg ttggccaggc 481 tggtctcaaa ctgctaacat caagtgatct gctcccctca gcctcccaaa gtgctgggat 541 taccggtatg aaccaccaca acctgccagg aatttttagt ttttagcttt tgcaggagac 601 ttcaaggaaa ggagacattc ctctgtccag gaaacgggta aggggaccat ttctgcattg 661 ctggtttccc ctcttggcag ggtgggcatg aggcatcact gttcctgctc cctcactcct 721 gctcctcatg ctcagcctgc cagctcggcc tcaactttgt gtgtctaaag tggaactgaa 781 tagtagctgt gagaagatag gaaagaggta gtgccaatct ccttgcccag atcataaatc 841 cagactcagc agggtaacca catgggcaag cacaaggtag gtgcttgggg aaaggggaag 901 taattggcat tctgtgtgat accaaggaga ccatttggat tttggcttct accaaagaga 961 atggagaatt ggttgaccta aatggaacca gtccctttaa gtaaggggag gaaagggggt 1021 gctggaagat ggccctcttc ccaccaccta gatcatagct tgaactgaag ccaaggacag 1081 agtgctgccc ccttcggcat ttactgatgt gccctcttta aatcatgatg ttatctaacc 1141 caaacccaga cccaggacct agtcacagct ccaacctaca cttcctatta atcttaaaac 1201 aaagcgaaac aaacacaaaa agatatcagc attgtagcct ccaatctgag cccatttccc 1261 ttctctggct accatacctc cttctcctat atgataccat tcactacttt gttcaattat 1321 ccagtctaga cctgcatctt gaggccacac ccagccttct cactccccac acccctcttt 1381 cctctctcac tgctccttcc tggtctcttc tcatctggcc ccacctctaa ggagtcctcc 1441 tgccttctgg gttgccctgg aaaacagact atcccccctc ctagtgaagg gagtgggtag 1501 gggtttcagc cccaccctca ggaagatgcg tcttccctgt cctctgctct gtggtacttc 1561 ctctctggct gatttagcaa acagcaccta gacctggggc caggcctttg gcagtgggac 1621 agatccaggg ataggctaca ccaccctgcc ctgaccctgg gattggcatc agcttccaac 1681 cagttcctgc caaagcttgt aagtcctccc gacggccatg aacactacat cttctgcagc 1741 acccccctca ctaggtgtag agttcatctc tctgctggct atcatcctgc tgtcagtggc 1801 gctggctgtg gggcttcccg gcaacagctt tgtggtgtgg agtatcctga aaaggatgca 1861 gaagcgctct gtcactgccc tgatggtgct gaacctggcc ctggccgacc tggccgtatt 1921 gctcactgct ccctttttcc ttcacttcct ggcccaaggc acctggagtt ttggactggc 1981 tggttgccgc ctgtgtcact atgtctgcgg agtcagcatg tacgccagcg tcctgcttat 2041 cacggccatg agtctagacc gctcactggc ggtggcccgc ccctttgtgt cccagaagct 2101 acgcaccaag gcgatggccc ggcgggtgct ggcaggcatc tgggtgttgt cctttctgct 2161 ggccacaccc gtcctcgcgt accgcacagt agtgccctgg aaaacgaaca tgagcctgtg 2221 cttcccgcgg taccccagcg aagggcaccg ggccttccat ctaatcttcg aggctgtcac 2281 gggcttcctg ctgcccttcc tggctgtggt ggccagctac tcggacatag ggcgtcggct 2341 acaggcccgg cgcttccgcc gcagccgccg caccggccgc ctggtggtgc tcatcatcct 2401 gaccttcgcc gccttctggc tgccctacca cgtggtgaac ctggctgagg cgggccgcgc 2461 gctggccggc caggccgccg ggttagggct cgtggggaag cggctgagcc tggcccgcaa 2521 cgtgctcatc gcactcgcct tcctgagcag cagcgtgaac cccgtgctgt acgcgtgcgc 2581 cggcggcggc ctgctgcgct cggcgggcgt gggcttcgtc gccaagctgc tggagggcac 2641 gggttccgag gcgtccagca cgcgccgcgg gggcagcctg ggccagaccg ctaggagcgg 2701 ccccgccgct ctggagcccg gcccttccga gagcctcact gcctccagcc ctctcaagtt 2761 aaacgaactg aactaggcct ggtggaagga ggcgcacttt cctcctggca gaatgctagc 2821 tctgagccag ttcagtacct ggaggaggag caggggcgtg gagggcgtgg agggcgtggg 2881 agcgtgggag gcgggagtgg agtggaagaa gagggagaga tggagcaaag tgagggccga 2941 gtgagagcgt gctccagcct ggctcccaca ggcagcttta accattaaaa ctgaagtctg 3001 aa // LOCUS D89092 1291 bp mRNA PRI 15-JAN-1998 DEFINITION Homo sapiens hnRNP JKTBP mRNA, complete cds. ACCESSION D89092 NID g2780747 KEYWORDS hnRNP JKTBP. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tsuchiya,N., Kamei,D., Takano,A., Matsui,T. and Yamada,M. TITLE Cloning and characterization of a cDNA encoding a novel heterogeneous nuclear ribonucleoprotein-like protein cDNA and its expression in myeloid leukemia cells JOURNAL J. Biochem. (1998) In press REFERENCE 2 (bases 1 to 1291) AUTHORS Yamada,M. TITLE Direct Submission JOURNAL Submitted (16-NOV-1996) to the DDBJ/EMBL/GenBank databases. Michiyuki Yamada, Yokohama City University, Graduate School of Integrated Science; 22-2 Seto Kanazawa-ku, Yokohama, Kanagawa 236, Japan (E-mail:myamada@yokohama-cu.ac.jp, Tel:+85-45-787-2214, Fax:+85-45-787-2370) FEATURES Location/Qualifiers source 1..1291 /organism="Homo sapiens" /db_xref="taxon:9606" gene 109..1014 /gene="hnRNP JKTBP" CDS 109..1014 /gene="hnRNP JKTBP" /function="DNA/RNA binding, transcriptional repression" /note="containing RNP motifs" /codon_start=1 /evidence=experimental /db_xref="PID:d1025273" /db_xref="PID:g2780748" /translation="MEDMNEYSNIEEFAEGSKINASKNQQDDGKMFIGGLSWDTSKKD LTEYLSRFGEVVDCTIKTDPVTGRSRGFGFVLFKDAASVDKVLELKEHKLDGKLIDPK RAKALKGKEPPKKVFVGGLSPDTSEEQIKEYFGAFGEIENIELPMDTKTNERRGFCFI TYTDEEPVKKLLESRYHQIGSGKCEIKVAQPKEVYRQQQQQQKGGRGAAAGGRGGTRG RGRGQGQNWNQGFNNYYDQGYGNYNSAYGGDQNYSGYGGYDYTGYNYGNYGYGQGYAD YSGQQSTYGKASRGGGNHQNNYQPY" polyA_site 1291 /note="14 A nucleotides" BASE COUNT 418 a 229 c 312 g 332 t ORIGIN 1 gatctcttcc gccgccattt taaatccagc tccatacaac gctccgccgc cgctgctgcc 61 gcgacccgga ctgcgcgcca gcacccccct gccgacagct ccgtcactat ggaggatatg 121 aacgagtaca gcaatataga ggaattcgca gagggatcca agatcaacgc gagcaagaat 181 cagcaggatg acggtaaaat gtttattgga ggcttgagct gggatacaag caaaaaagat 241 ctgacagagt acttgtctcg atttggggaa gttgtagact gcacaattaa aacagatcca 301 gtcactggga gatcaagagg atttggattt gtgcttttca aagatgctgc tagtgttgat 361 aaggttttgg aactgaaaga acacaaactg gatggcaaat tgatagatcc caaaagggcc 421 aaagctttaa aagggaaaga acctcccaaa aaggtttttg tgggtggatt gagcccggat 481 acttctgaag aacaaattaa agaatatttt ggagcctttg gagagattga aaatattgaa 541 cttcccatgg atacaaaaac aaatgaaaga agaggatttt gttttatcac atatactgat 601 gaagagccag taaaaaaatt gttagaaagc agataccatc aaattggttc tgggaagtgt 661 gaaatcaaag ttgcacaacc caaagaggta tataggcagc aacagcaaca acaaaaaggt 721 ggaagaggtg ctgcagctgg tggacgaggt ggtacgaggg gtcgtggccg aggtcagggc 781 caaaactgga accaaggatt taataactat tatgatcaag gatatggaaa ttacaatagt 841 gcctatggtg gtgatcaaaa ctatagtggc tatggcggat atgattatac tgggtataac 901 tatgggaact atggatatgg acagggatat gcagactaca gtggccaaca gagcacttat 961 ggcaaggcat ctcgaggggg tggcaatcac caaaacaatt accagccata ctaaaggaga 1021 acattggaga aaacaggagg agatgttaaa gtaacccatc ttgcaggacg acattgaaga 1081 ttggtcttct gttgatctaa gatgattatt ttgtaaaaga ctttctagtg tacaagacac 1141 cattgtgtcc aactgtatat agctgccaat tagttttctt tgtttttact ttgtcctttg 1201 ctatctgtgt tatgactcaa tgtggatttg tttatacaca ttttatttgt atcatttcat 1261 gttaaacctc aaataaatgc ttccttatgt g // LOCUS D89289 2002 bp mRNA PRI 23-APR-1997 DEFINITION Human mRNA for N-Acetyl-beta-D-glucosaminide, complete cds. ACCESSION D89289 NID g2055306 KEYWORDS N-Acetyl-beta-D-glucosaminide; GDP-L-Fuc; alpha 1-6 Fucosyltransferase; alpha1-6 FucT. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2002) AUTHORS Taniguchi,N. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) to the DDBJ/EMBL/GenBank databases. Naoyuki Taniguchi, Osaka University Medical School, Department of Biochemistry; Yamadaoka 2-2, Suita, Osaka 565, Japan (E-mail:proftani@biochem.med.osaka-u.ac.jp, Tel:81-6-879-3420, Fax:81-6-879-3429) REFERENCE 2 (bases 1 to 2002) AUTHORS Yanagidani,S. TITLE Purification and cDNA cloning of GDP-L-Fuc:N-Acetyl-beta-D-glucosaminide:alpha 1-6 Fucosyltransferase (alpha 1-6 FucT) from human stomach carcinoma MKN45 cells JOURNAL Unpublished (1997) REFERENCE 3 (sites) AUTHORS Yanagidani,S., Uozumi,N., Ihara,Y., Miyoshi,E., Yamaguchi,N. and Taniguchi,N. TITLE Purification and cDNA cloning of GDP-L-Fuc:N-acetyl-beta-D-glucosaminide:alpha1-6 fucosyltransferase (alpha1-6 FucT) from human gastric cancer MKN45 cells JOURNAL J. Biochem. 121 (3), 626-632 (1997) MEDLINE 97279058 FEATURES Location/Qualifiers source 1..2002 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 143..1870 /note="GDP-L-Fuc; alpha 1-6 Fucosyltransferase; alpha1-6 FucT" /codon_start=1 /product="N-Acetyl-beta-D-glucosaminide" /db_xref="PID:d1020545" /db_xref="PID:g2055307" /translation="MRPWTGSWRWIMLILFAWGTLLFYIGGHLVRDNDHPDHSSRELS KILAKLERLKQQNEDLRRMAESLRIPEGPIDQGPAIGRVRVLEEQLVKAKEQIENYKK QTRNGLGKDHEILRRRIENGAKELWFFLQSELKKLKNLEGNELQRHADEFLLDLGHHE RSIMTDLYYLSQTDGAGDWREKEAKDLTELVQRRITYLQNPKDCSKAKKLVCNINKGC GYGCQLHHVVYCFMIAYGTQRTLILESQNWRYATGGWETVFRPVSETCTDRSGISTGH WSGEVKDKNVQVVELPIVDSLHPRPPYLPLAVPEDLADRLVRVHGDPAVWWVSQFVKY LIRPQPWLEKEIEEATKKLGFKHPVIGVHVRRTDKVGTEAAFHPIEEYMVHVEEHFQL LARRMQVDKKRVYLATDDPSLLKEAKTKYPNYEFISDNSISWSAGLHNRYTENSLRGV ILDIHFLSQADFLVCTFSSQVCRVAYEIMQTLHPDASANFHSLDDIYYFGGQNAHNQI AIYAHQPRTADEIPMEPGDIIGVAGNHWDGYSKGVNRKLGRTGLYPSYKVREKIETVK YPTYPEAEK" BASE COUNT 620 a 415 c 467 g 500 t ORIGIN 1 ccagagagaa taatttgtct gaagcatcat gtgttgaaac aacagaagtc tattcacctg 61 tgcactaact agaaacagag ttacaatgtt ttcaattctt tgagctccag gactccaggg 121 gaagtgagtt gaaaatctga aaatgcggcc atggactggt tcctggcgtt ggattatgct 181 cattcttttt gcctggggga ccttgctgtt ttatataggt ggtcacttgg tacgagataa 241 tgaccatcct gatcactcta gccgagaact gtccaagatt ctggcaaagc ttgaacgctt 301 aaaacagcag aatgaagact tgaggcgaat ggccgaatct ctccggatac cagaaggccc 361 tattgatcag gggccagcta taggaagagt acgcgtttta gaagagcagc ttgttaaggc 421 caaagaacag attgaaaatt acaagaaaca gaccagaaat ggtctgggga aggatcatga 481 aatcctgagg aggaggattg aaaatggagc taaagagctc tggtttttcc tacagagtga 541 attgaagaaa ttaaagaact tagaaggaaa tgaactccaa agacatgcag atgaatttct 601 tttggattta ggacatcatg aaaggtctat aatgacggat ctatactacc tcagtcagac 661 agatggagca ggtgattggc gggaaaaaga ggccaaagat ctgacagaac tggttcagcg 721 gagaataaca tatcttcaga atcccaagga ctgcagcaaa gccaaaaagc tggtgtgtaa 781 tatcaacaaa ggctgtggct atggctgtca gctccatcat gtggtctact gcttcatgat 841 tgcatatggc acccagcgaa cactcatctt ggaatctcag aattggcgct atgctactgg 901 tggatgggag actgtattta ggcctgtaag tgagacatgc acagacagat ctggcatctc 961 cactggacac tggtcaggtg aagtgaagga caaaaatgtt caagtggtcg agcttcccat 1021 tgtagacagt cttcatcccc gtcctccata tttacccttg gctgtaccag aagacctcgc 1081 agatcgactt gtacgagtgc atggtgaccc tgcagtgtgg tgggtgtctc agtttgtcaa 1141 atacttgatc cgcccacagc cttggctaga aaaagaaata gaagaagcca ccaagaagct 1201 tggcttcaaa catccagtta ttggagtcca tgtcagacgc acagacaaag tgggaacaga 1261 agctgccttc catcccattg aagagtacat ggtgcatgtt gaagaacatt ttcagcttct 1321 tgcacgcaga atgcaagtgg acaaaaaaag agtgtatttg gccacagatg acccttcttt 1381 attaaaggag gcaaaaacaa agtaccccaa ttatgaattt attagtgata actctatttc 1441 ctggtcagct ggactgcaca atcgatacac agaaaattca cttcgtggag tgatcctgga 1501 tatacatttt ctctctcagg cagacttcct agtgtgtact ttttcatccc aggtctgtcg 1561 agttgcttat gaaattatgc aaacactaca tcctgatgcc tctgcaaact tccattcttt 1621 agatgacatc tactattttg ggggccagaa tgcccacaat caaattgcca tttatgctca 1681 ccaaccccga actgcagatg aaattcccat ggaacctgga gatatcattg gtgtggctgg 1741 aaatcattgg gatggctatt ctaaaggtgt caacaggaaa ttgggaagga cgggcctata 1801 tccctcctac aaagttcgag agaagataga aacggtcaag taccccacat atcctgaggc 1861 tgagaaataa agctcagatg gaagagataa acgaccaaac tcagttcgac caaactcagt 1921 tcaaaccatt tcagccaaac tgtagatgaa gagggctctg atctaacaaa ataaggttat 1981 atgagtagat actctcagca cc // LOCUS D89479 1031 bp mRNA PRI 30-JAN-1998 DEFINITION Homo sapiens mRNA for ST1B2, complete cds. ACCESSION D89479 NID g2826145 KEYWORDS ST1B2. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fujita,K., Nagata,K., Ozawa,S., Sasano,H. and Yamazoe,Y. TITLE Molecular cloning and characterization of rat ST1B1 and human ST1B2 cDNAs, encoding thyroid hormone sulfotransferases JOURNAL J. Biochem. 122 (5), 1052-1061 (1997) MEDLINE 98104061 REFERENCE 2 (bases 1 to 1031) AUTHORS Fujita,K. TITLE Direct Submission JOURNAL Submitted (20-NOV-1996) to the DDBJ/EMBL/GenBank databases. Ken-ichi Fujita, Tohoku University, Faculty of Pharmaceutical Sciences; Aramaki-Aoba, Aoba-Ku, Sendai, Miyagi 980-77, Japan (E-mail:Ken@phi2.pharm.tohoku.ac.jp, Tel:022-217-6830, Fax:022-217-6826) FEATURES Location/Qualifiers source 1..1031 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 25..915 /codon_start=1 /product="ST1B2" /db_xref="PID:d1025466" /db_xref="PID:g2826146" /translation="MLSPKDILRKDLKLVHGYPMTCAFASNWEKIEQFHSRPDDIVIA TYPKSGTTWVSEIIDMILNDGDIEKCKRGFITEKVPMLEMTLPGLRTSGIEQLEKNPS PRIVKTHLPTDLLPKSFWENNCKMIYLARNAKDVSVSYYHFDLMNNLQPFPGTWEEYL EKFLTGKVAYGSWFTHVKNWWKKKEGHPILFLYYEDMKENPKEEIKKIIRFLEKNLND EILDRIIHHTSFEVMKDNPLVNYTHLPTTVMDHSKSPFMRKGTAGDWKNYFTVAQNEK FDAIYETEMSKTALQFRTEI" BASE COUNT 353 a 172 c 209 g 297 t ORIGIN 1 atatttgtac aatctggtat taaaatgctt tccccaaaag atattctgcg aaaagatctg 61 aagttggtcc atggttatcc catgacctgt gcttttgcga gcaactggga aaaaattgaa 121 cagttccata gcagaccaga tgacattgtg atagccactt atcctaaatc aggtactact 181 tgggttagtg aaattataga catgattcta aatgatggag atattgaaaa atgtaagcga 241 ggttttatta ctgaaaaagt tccaatgttg gaaatgactc tccctggatt aagaacatca 301 ggtatagaac aattggagaa gaatccatca ccccggattg tgaaaacaca tctaccgact 361 gatcttcttc ctaaatcttt ctgggaaaac aattgcaaga tgatttatct ggctcgtaat 421 gccaaggatg tttcagtctc atattaccat tttgacttaa tgaataattt acagcctttt 481 cctggtacct gggaagaata tctggagaaa ttcttaactg gaaaagtggc ctatggttcc 541 tggtttactc atgttaaaaa ctggtggaag aaaaaggaag gacacccaat actttttttg 601 tactatgaag atatgaaaga gaatccaaag gaggaaatca agaagatcat tagatttcta 661 gagaagaacc tgaatgatga gatcttggat aggatcatcc atcacacctc atttgaagtg 721 atgaaggaca atcctttggt aaattataca catctaccaa ctacagtgat ggatcatagc 781 aaatccccct ttatgcgtaa agggacggct ggtgactgga agaattactt caccgtggcc 841 caaaatgaga aatttgatgc tatttatgag acagaaatgt ccaaaactgc acttcaattc 901 cgcacagaga tttaaagtgt ctaaatcaca aatctgagaa atagagattg tctgtagttg 961 attgaaacga gggcagttat gaattgattt gggcaatcaa atgaatttat aaaggagaat 1021 aatatgcctt t // LOCUS D89630 2028 bp mRNA PRI 13-JAN-1998 DEFINITION Homo sapiens mRNA for VEGF-D, complete cds. ACCESSION D89630 NID g2780339 KEYWORDS VEGF-D. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamada,Y., Nezu,J., Shimane,M. and Hirata,Y. TITLE Molecular cloning of a novel vascular endothelial growth factor, VEGF-D JOURNAL Genomics 42 (3), 483-488 (1997) MEDLINE 97349118 REFERENCE 2 (bases 1 to 2028) AUTHORS Hirata,Y. TITLE Direct Submission JOURNAL Submitted (29-NOV-1996) to the DDBJ/EMBL/GenBank databases. Yuichi Hirata, Chugai Research Institute for Molecular Medicine, Gene search program; 153-2, Nagai, Niihari-Mura, Ibaraki 300-41, Japan (E-mail:hiratayu@tk.chugai-pharm.co.jp, Tel:81-298-30-6211, Fax:81-298-30-6270) COMMENT Sequence updated (12-Jan-1998). FEATURES Location/Qualifiers source 1..2028 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /tissue_type="lung" CDS 403..1467 /codon_start=1 /product="VEGF-D" /db_xref="PID:d1025175" /db_xref="PID:g2766190" /translation="MYREWVVVNVFMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQI RAASSLEELLRITHSEDWKLWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVI DEEWQRTQCSPRETCVEVASELGKSTNTFFKPPCVNVFRCGGCCNEESLICMNTSTSY ISKQLFEISVPLTSVPELVPVKVANHTGCKCLPTAPRHPYSIIRRSIQIPEEDRCSHS KKLCPIDMLWDSNKCKCVLQEENPLAGTEDHSHLQEPALCGPHMMFDEDRCECVCKTP CPKDLIQHPKNCSCFECKESLETCCQKHKLFHPDTCSCEDRCPFHTRPCASGKTACAK HCRFPKEKRAAQGPHSRKNP" BASE COUNT 575 a 441 c 431 g 581 t ORIGIN 1 ccagctttct gtagctgtaa gcattggtgg ccacaccacc tccttacaaa gcaactagaa 61 cctgcggcat acattggaga gattttttta attttctgga catgaagtaa atttagagtg 121 ctttctaatt tcaggtagaa gacatgtcca ccttctgatt atttttggag aacattttga 181 tttttttcat ctctctctcc ccacccctaa gattgtgcaa aaaaagcgta ccttgcctaa 241 ttgaaataat ttcattggat tttgatcaga actgatcatt tggttttctg tgtgaagttt 301 tgaggtttca aactttcctt ctggagaatg ccttttgaaa caattttctc tagctgcctg 361 atgtcaactg cttagtaatc agtggatatt gaaatattca aaatgtacag agagtgggta 421 gtggtgaatg ttttcatgat gttgtacgtc cagctggtgc agggctccag taatgaacat 481 ggaccagtga agcgatcatc tcagtccaca ttggaacgat ctgaacagca gatcagggct 541 gcttctagtt tggaggaact acttcgaatt actcactctg aggactggaa gctgtggaga 601 tgcaggctga ggctcaaaag ttttaccagt atggactctc gctcagcatc ccatcggtcc 661 actaggtttg cggcaacttt ctatgacatt gaaacactaa aagttataga tgaagaatgg 721 caaagaactc agtgcagccc tagagaaacg tgcgtggagg tggccagtga gctggggaag 781 agtaccaaca cattcttcaa gcccccttgt gtgaacgtgt tccgatgtgg tggctgttgc 841 aatgaagaga gccttatctg tatgaacacc agcacctcgt acatttccaa acagctcttt 901 gagatatcag tgcctttgac atcagtacct gaattagtgc ctgttaaagt tgccaatcat 961 acaggttgta agtgcttgcc aacagccccc cgccatccat actcaattat cagaagatcc 1021 atccagatcc ctgaagaaga tcgctgttcc cattccaaga aactctgtcc tattgacatg 1081 ctatgggata gcaacaaatg taaatgtgtt ttgcaggagg aaaatccact tgctggaaca 1141 gaagaccact ctcatctcca ggaaccagct ctctgtgggc cacacatgat gtttgacgaa 1201 gatcgttgcg agtgtgtctg taaaacacca tgtcccaaag atctaatcca gcaccccaaa 1261 aactgcagtt gctttgagtg caaagaaagt ctggagacct gctgccagaa gcacaagcta 1321 tttcacccag acacctgcag ctgtgaggac agatgcccct ttcataccag accatgtgca 1381 agtggcaaaa cagcatgtgc aaagcattgc cgctttccaa aggagaaaag ggctgcccag 1441 gggccccaca gccgaaagaa tccttgattc agcgttccaa gttccccatc cctgtcattt 1501 ttaacagcat gctgctttgc caagttgctg tcactgtttt tttcccaggt gttaaaaaaa 1561 aaatccattt tacacagcac cacagtgaat ccagaccaac cttccattca caccagctaa 1621 ggagtccctg gttcattgat ggatgtcttc tagctgcaga tgcctctgcg caccaaggaa 1681 tggagaggag gggacccatg taatcctttt gtttagtttt gtttttgttt tttggtgaat 1741 gagaaaggtg tgctggtcat ggaatggcag gtgtcatatg actgattact cagagcagat 1801 gaggaaaact gtagtctctg agtcctttgc taatcgcaac tcttgtgaat tattctgatt 1861 cttttttatg cagaatttga ttcgtatgat cagtactgac tttctgatta ctgtccagct 1921 tatagtcttc cagtttaatg aactaccatc tgatgtttca tatttaagtg tatttaaaga 1981 aaataaacac cattattcaa gtctaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS D89667 1035 bp mRNA PRI 13-DEC-1996 DEFINITION Human mRNA for c-myc binding protein, complete cds. ACCESSION D89667 NID g1731808 KEYWORDS c-myc binding protein; MM-1. SOURCE Homo sapiens brain cDNA to mRNA, clone_lib:Matchmaker human brain cDNA library (CLONTECH) clone:MM-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Maeda,Y., Maeda,J., Iguchi-Ariga,S. and ARIGA,H. TITLE MM-1, a novel C-MYC binding protein JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 1035) AUTHORS ARIGA,H. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) to the DDBJ/EMBL/GenBank databases. Hiroyoshi ARIGA, Faculty of Pharmaceutical Sciences, Hokkaido University, Molecular Biology; Kita 12, Nishi 6, Kita-ku, Sapporo, Hokkaio 060, Japan (E-mail:hiro@ph.hines.hokudai.ac.jp, hiro@pharm.hokudai.ac.jp, Tel:011-706-3745, Fax:011-706-4988) FEATURES Location/Qualifiers source 1..1035 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="MM-1" /clone_lib="Matchmaker human brain cDNA library (CLONTECH)" /tissue_type="brain" 5'UTR 1..423 gene 424..927 /gene="MM-1" CDS 424..927 /gene="MM-1" /codon_start=1 /product="c-myc binding protein" /db_xref="PID:d1014706" /db_xref="PID:g1731809" /translation="MGVDVMTVVGLPNMAQSINITELNLPQLEMLKNQLDQEVEFLST SIAQLKVVQTKYVEAKDCLNVLNKSNEGKELLVPLTSSMYVPGKLHDVEHVLIDVGTG YYVEKTAEDAKDFFKRKIDFLTKQMEKIQPALQEKHAMKQAVMEMMSQKIQQLTALGA AQATAKA" 3'UTR 928..1030 BASE COUNT 253 a 240 c 275 g 267 t ORIGIN 1 cactggaatt cgggcccctc ttccacttcc cttcacacta tctcttttgc ctaataaata 61 cggaaggctg tgtacaaggt caggtccctt gtccactaga ggcaaggtgc ttggcgtcag 121 gaagcaattg ccctcagcaa accttctggg gcaggcacag tcatgagttt gcccacattc 181 tgtattcatg ataaacagtt tgctgtttga tcgtatagac tcagtggaat gttggtcacg 241 tcccatgggc ctttggctct ctgtatatcc tcctttctgt ttatgtatta attgaaggag 301 tgtaaggcca gggtgggcag ctctcatttt cccattgatg gtccatccaa ctttacagac 361 tgtccctggt gctccagtag tttctcagcc tcctgtgtgg ttttcttgag ttgtccccag 421 gttatggggg ttgatgtcat gactgtagtc ggccttccca acatggcgca gtctattaac 481 atcacggagc tgaatctgcc gcagctagaa atgctcaaga accagctgga ccaggaagtg 541 gagttcttgt ccacgtccat tgctcagctc aaagtggtac agaccaagta tgtggaagcc 601 aaggactgtc tgaacgtgct gaacaagagc aacgagggga aagaattact cgtcccactg 661 acgagttcta tgtatgtccc tgggaagctg catgatgtgg aacacgtgct catcgatgtg 721 ggaactgggt actatgtaga gaagacagct gaggatgcca aggacttctt caagaggaag 781 atagattttc taaccaagca gatggagaaa atccaaccag ctcttcagga gaagcacgcc 841 atgaaacagg ccgtcatgga aatgatgagt cagaagattc agcagctcac agccctgggg 901 gcagctcagg ctactgctaa ggcctgagag tttttgcaga aatggggcag agggacaccc 961 tttgggcgtg gcttcctggt gatgggaagg gtcttgtgtt ttaatgccaa taaatgtgcc 1021 agctgggcaa aaccc // LOCUS D89675 1575 bp mRNA PRI 23-APR-1997 DEFINITION Human mRNA for bone morphogenetic protein type IB receptor, complete cds. ACCESSION D89675 NID g2055308 KEYWORDS bone morphogenetic protein type IB receptor; BMPR-IB. SOURCE Homo sapiens male prostate cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1575) AUTHORS Terada,M. TITLE Direct Submission JOURNAL Submitted (05-DEC-1996) to the DDBJ/EMBL/GenBank databases. Masaaki Terada, National Cancer Center Institute, Genetics Division; 5-1-1, Tsukiji, Chuo-ku, Tokyo 104, Japan (E-mail:mterada@ncc.go.jp, Tel:81-3-3542-2511, Fax:81-3-3541-2685) REFERENCE 2 (sites) AUTHORS Ide,H., Katoh,M., Sasaki,H., Yoshida,T., Aoki,K., Nawa,Y., Osada,Y., Sugimura,T. and Terada,M. TITLE Cloning of human bone morphogenetic protein type IB receptor (BMPR-IB) and its expression in prostate cancer in comparison with other BMPRs JOURNAL Oncogene (1996) In press REFERENCE 3 (sites) AUTHORS Ide,H., Katoh,M., Sasaki,H., Yoshida,T., Aoki,K., Nawa,Y., Osada,Y., Sugimura,T. and Terada,M. TITLE Cloning of human bone morphogenetic protein type IB receptor (BMPR-IB) and its expression in prostate cancer in comparison with other BMPRs JOURNAL Oncogene 14 (11), 1377-1382 (1997) MEDLINE 97322244 REMARK Erratum:[[published erratum appears in Oncogene 1997 Aug 28;15(9):1121]] COMMENT Sequence updated (21-Dec-1996) by:Masaaki Terada. FEATURES Location/Qualifiers source 1..1575 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="prostate" CDS 19..1527 /note="BMPR-IB" /codon_start=1 /product="bone morphogenetic protein type IB receptor" /db_xref="PID:d1020546" /db_xref="PID:g2055309" /translation="MLLRSAGKLNVGTKKEDGESTAPTPRPKVLRCKCHHHCPEDSVN NICSTDGYCFTMIEEDDSGLPVVTSGCLGLEGSDFQCRDTPIPHQRRSIECCTERNEC NKDLHPTLPPLKNRDFVDGPIHHRALLISVTVCSLLLVLIILFCYFRYKRQETRPRYS IGLEQDETYIPPGESLRDLIEQSQSSGSGSGLPLLVQRTIAKQIQMVKQIGKGRYGEV WMGKWRGEKVAVKVFFTTEEASWFRETEIYQTVLMRHENILGFIAADIKGTGSWTQLY LITDYHENGSLYDYLKSTTLDAKSMLKLAYSSVSGLCHLHTEIFSTQGKPAIAHRDLK SKNILVKKNGTCCIADLGLAVKFISDTNEVDIPPNTRVGTKRYMPPEVLDESLNRNHF QSYIMADMYSFGLILWEVARRCVSGGIVEEYQLPYHDLVPSDPSYEDMREIVCIKKLR PSFPNRWSSDECLRQMGKLMTECWAHNPASRLTALRVKKTLAKMSESQDIKL" BASE COUNT 474 a 335 c 371 g 395 t ORIGIN 1 gcaaacttcc ttgataacat gcttttgcga agtgcaggaa aattaaatgt gggcaccaag 61 aaagaggatg gtgagagtac agcccccacc ccccgtccaa aggtcttgcg ttgtaaatgc 121 caccaccatt gtccagaaga ctcagtcaac aatatttgca gcacagacgg atattgtttc 181 acgatgatag aagaggatga ctctgggttg cctgtggtca cttctggttg cctaggacta 241 gaaggctcag attttcagtg tcgggacact cccattcctc atcaaagaag atcaattgaa 301 tgctgcacag aaaggaacga atgtaataaa gacctacacc ctacactgcc tccattgaaa 361 aacagagatt ttgttgatgg acctatacac cacagggctt tacttatatc tgtgactgtc 421 tgtagtttgc tcttggtcct tatcatatta ttttgttact tccggtataa aagacaagaa 481 accagacctc gatacagcat tgggttagaa caggatgaaa cttacattcc tcctggagaa 541 tccctgagag acttaattga gcagtctcag agctcaggaa gtggatcagg cctccctctg 601 ctggtccaaa ggactatagc taagcagatt cagatggtga aacagattgg aaaaggtcgc 661 tatggggaag tttggatggg aaagtggcgt ggcgaaaagg tagctgtgaa agtgttcttc 721 accacagagg aagccagctg gttcagagag acagaaatat atcagacagt gttgatgagg 781 catgaaaaca ttttgggttt cattgctgca gatatcaaag ggacagggtc ctggacccag 841 ttgtacctaa tcacagacta tcatgaaaat ggttcccttt atgattatct gaagtccacc 901 accctagacg ctaaatcaat gctgaagtta gcctactctt ctgtcagtgg cttatgtcat 961 ttacacacag aaatctttag tactcaaggc aaaccagcaa ttgcccatcg agatctgaaa 1021 agtaaaaaca ttctggtgaa gaaaaatgga acttgctgta ttgctgacct gggcctggct 1081 gttaaattta ttagtgatac aaatgaagtt gacataccac ctaacactcg agttggcacc 1141 aaacgctata tgcctccaga agtgttggac gagagcttga acagaaatca cttccagtct 1201 tacatcatgg ctgacatgta tagttttggc ctcatccttt gggaggttgc taggagatgt 1261 gtatcaggag gtatagtgga agaataccag cttccttatc atgacctagt gcccagtgac 1321 ccctcttatg aggacatgag ggagattgtg tgcatcaaga agttacgccc ctcattccca 1381 aaccggtgga gcagtgatga gtgtctaagg cagatgggaa aactcatgac agaatgctgg 1441 gctcacaatc ctgcatcaag gctgacagcc ctgcgggtta agaaaacact tgccaaaatg 1501 tcagagtccc aggacattaa actctgatag gagaggaaaa gtaagcatct ctgcagaaag 1561 ccaacaggta ccctt // LOCUS D89729 4088 bp mRNA PRI 14-NOV-1997 DEFINITION Homo sapiens mRNA for CRM1 protein, complete cds. ACCESSION D89729 NID g2626839 KEYWORDS CRM1 protein. SOURCE Homo sapiens chronic myelogenous leukemia cell_line:K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kudo,N., Khochbin,S., Nishi,K., Kitano,K., Yanagida,M., Yoshida,M. and Horinouchi,S. TITLE Molecular cloning and cell cycle-dependent expression of mammalian CRM1, a protein involved in nuclear export of proteins JOURNAL J. Biol. Chem. 272 (47), 29742-29751 (1997) MEDLINE 98037803 REFERENCE 2 (bases 1 to 4088) AUTHORS Kudo,N. TITLE Direct Submission JOURNAL Submitted (06-DEC-1996) to the DDBJ/EMBL/GenBank databases. Nobuaki Kudo, The University of Tokyo, Department of Biotechnology; Yayoi 1-1-1, Bunkyo-ku, Tokyo 113, Japan (E-mail:kudo@bio.m.u-tokyo.ac.jp, Tel:+81-3-3812-2111, Fax:+81-3-3812-0544) FEATURES Location/Qualifiers source 1..4088 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" /cell_type="chronic myelogenous leukemia" CDS 19..3234 /codon_start=1 /product="CRM1 protein" /db_xref="PID:g2626840" /translation="MPAIMTMLADHAARQLLDFSQKLDINLLDNVVNCLYHGEGAQQR MAQEVLTHLKEHPDAWTRVDTILEFSQNMNTKYYGLQILENVIKTRWKILPRNQCEGI KKYVVGLIIKTSSDPTCVEKEKVYIGKLNMILVQILKQEWPKHWPTFISDIVGASRTS ESLCQNNMVILKLLSEEVFDFSSGQITQVKSKHLKDSMCNEFSQIFQLCQFVMENSQN APLVHATLETLLRFLNWIPLGYIFETKLISTLIYKFLNVPMFRNVSLKCLTEIAGVSV SQYEEQFVTLFTLTMMQLKQMLPLNTNIRLAYSNGKDDEQNFIQNLSLFLCTFLKEHD QLIEKRLNLRETLMEALHYMLLVSEVEETEIFKICLEYWNHLAAELYRESPFSTSASP LLSGSQHFDVPPRRQLYLPMLFKVRLLMVSRMAKPEEVLVVENDQGEVVREFMKDTDS INLYKNMRETLVYLTHLDYVDTERIMTEKLHNQVNGTEWSWKNLNTLCWAIGSISGAM HEEDEKRFLVTVIKDLLGLCEQKRGKDNKAIIASNIMYIVGQYPRFLRAHWKFLKTVV NKLFEFMHETHDGVQDMACDTFIKIAQKCRRHFVQVQVGEVMPFIDEILNNINTIICD LQPQQVHTFYEAVGYMIGAQTDQTVQEHLIEKYMLLPNQVWDSIIQQATKNVDILKDP ETVKQLGSILKTNVRACKAVGHPFVIQLGRIYLDMLNVYKCLSENISAAIQANGEMVT KQPLIRSMRTVKRETLKLISGWVSRSNDPQMVAENFVPPLLDAVLIDYQRNVPAAREP EVLSTMAIIVNKLGGHITAEIPQIFDAVFECTLNMINKDFEEYPEHRTNFFLLLQAVN SHCFPAFLAIPPTQFKLVLDSIIWAFKHTMRNVADTGLQILFTLLQNVAQEEAAAQSF YQTYFCDILQHIFSVVTDTSHTAGLTMHASILAYMFNLVEEGKISTSLNPGNPVNNQI FLQEYVANLLKSAFPHLQDAQVKLFVTGLFSLNQDIPAFKEHLRDFLVQIKEFAGEDT SDLFLEEREIALRQADEEKHKRQMSVPGIFNPHEIPEEMCD" BASE COUNT 1313 a 693 c 794 g 1288 t ORIGIN 1 ttcaatctct ggtaatctat gccagcaatt atgacaatgt tagcagacca tgcagctcgt 61 cagctgcttg atttcagcca aaaactggat atcaacttat tagataatgt ggtgaattgc 121 ttataccatg gagaaggagc ccagcaaaga atggctcaag aagtactgac acatttaaag 181 gagcatcctg atgcttggac aagagtcgac acaattttgg aattttctca gaatatgaat 241 acgaaatact atggactaca aattttggaa aatgtgataa aaacaaggtg gaagattctt 301 ccaaggaacc agtgcgaagg aataaaaaaa tacgttgttg gcctcattat caagacgtca 361 tctgacccaa cttgtgtaga gaaagaaaag gtgtatatcg gaaaattaaa tatgatcctt 421 gttcagatac tgaaacaaga atggcccaaa cattggccaa cttttatcag tgatattgtt 481 ggagcaagta ggaccagcga aagtctctgt caaaataata tggtgattct taaactcttg 541 agtgaagaag tatttgattt ctctagtgga cagataaccc aagtcaaatc taagcattta 601 aaagacagca tgtgcaatga attctcacag atatttcaac tgtgtcagtt tgtaatggaa 661 aattctcaaa atgctccact tgtacatgca accttggaaa cattgctcag atttctgaac 721 tggattcccc tgggatatat ttttgagacc aaattaatca gcacattgat ttataagttc 781 ctgaatgttc caatgtttcg aaatgtctct ctgaagtgcc tcactgagat tgctggtgtg 841 agtgtaagcc aatatgaaga acaatttgta acactattta ctctgacaat gatgcaacta 901 aagcagatgc ttcctttaaa taccaatatt cgacttgcgt actcaaatgg aaaagatgat 961 gaacagaact tcattcaaaa tctcagtttg tttctctgca cctttcttaa ggaacatgat 1021 caacttatag aaaaaagatt aaatctcagg gaaactctta tggaggccct tcattatatg 1081 ttgttggtat ctgaagtaga agaaactgaa atctttaaaa tttgtcttga atactggaat 1141 catttggctg ctgaactcta tagagagagt ccattctcta catctgcctc tccgttgctt 1201 tctggaagtc aacattttga tgttcctccc aggagacagc tatatttgcc catgttattc 1261 aaggtccgtt tattaatggt tagtcgaatg gctaaaccag aggaagtatt ggttgtagag 1321 aatgatcaag gagaagttgt gagagaattc atgaaggata cagattccat aaatttgtat 1381 aagaatatga gggaaacatt ggtttatctt actcatctgg attatgtaga tacagaaaga 1441 ataatgacag agaagcttca caatcaagtg aatggtacag agtggtcatg gaaaaatttg 1501 aatacattgt gttgggcaat aggctccatt agtggagcaa tgcatgaaga ggacgaaaaa 1561 cgatttcttg ttactgttat aaaggatcta ttaggattat gtgaacagaa aagaggcaaa 1621 gataataaag ctattattgc atcaaatatc atgtacatag taggtcaata cccacgtttt 1681 ttgagagctc actggaaatt tctgaagact gtagttaaca agctgttcga attcatgcat 1741 gagacccatg atggagtcca ggatatggct tgtgatactt tcattaaaat agcccaaaaa 1801 tgccgcaggc atttcgttca ggttcaggtt ggagaagtga tgccatttat tgatgaaatt 1861 ttgaacaaca ttaacactat tatttgtgat cttcagcctc aacaggttca tacgttttat 1921 gaagctgtgg ggtacatgat tggtgcacaa acagatcaaa cagtacaaga acacttgata 1981 gaaaagtaca tgttactccc taatcaagtg tgggatagta taatccagca ggcaaccaaa 2041 aatgtggata tactgaaaga tcctgaaaca gtcaagcagc ttggtagcat tttgaaaaca 2101 aatgtgagag cctgcaaagc tgttggacac ccctttgtaa ttcagcttgg aagaatttat 2161 ttagatatgc ttaatgtata caagtgcctc agtgaaaata tttctgcagc tatccaagct 2221 aatggtgaaa tggttacaaa gcaaccattg attagaagta tgcgaactgt aaaaagggaa 2281 actttaaagt taatatctgg ttgggtgagc cgatccaatg atccacagat ggtcgctgaa 2341 aattttgttc cccctctgtt ggatgcagtt ctcattgatt atcagagaaa tgtcccagct 2401 gctagagaac cagaagtgct tagtactatg gccataattg tcaacaagtt agggggacat 2461 ataacagctg aaatacctca aatatttgat gctgtttttg aatgcacatt gaatatgata 2521 aataaggact ttgaagaata tcctgaacat agaacgaact ttttcttact acttcaggct 2581 gtcaattctc attgtttccc agcattcctt gctattccac ctacacagtt taaacttgtt 2641 ttggattcca tcatttgggc tttcaaacat actatgagga atgtcgcaga tacgggctta 2701 cagatacttt ttacactctt acaaaatgtt gcacaagaag aagctgcagc tcagagtttt 2761 tatcaaactt atttttgtga tattctccag catatctttt ctgttgtgac agacacttca 2821 catactgctg gtttaacaat gcatgcatca attcttgcat atatgtttaa tttggttgaa 2881 gaaggaaaaa taagtacatc attaaatcct ggaaatccag ttaacaacca aatctttctt 2941 caggaatatg tggctaatct ccttaagtcg gccttccctc acctacaaga tgctcaagta 3001 aagctctttg tgacagggct tttcagctta aatcaagata ttcctgcttt caaggaacat 3061 ttaagagatt tcctagttca aataaaggaa tttgcaggtg aagacacttc tgatttgttt 3121 ttggaagaga gagaaatagc cctacggcag gctgatgaag agaaacataa acgtcaaatg 3181 tctgtccctg gcatctttaa tccacatgag attccagaag aaatgtgtga ttaaaatcca 3241 aattcatgct gttttttttc tctgcaactc gttagcagag gaaaacagca tgtgggtatt 3301 tgtcgaccaa aatgatgcca atttgtaaat taaaatgtca cctagtggcc ctttttctta 3361 tgtgtttttt tgtataagaa attttctgtg aaatatcctt ccattgttta agcttttgtt 3421 ttggtcatct ttatttagtt tgcatgaagt tgaaaattaa ggcattttta aaaattttac 3481 ttcatgccca tttttgtggc tgggctgggg ggaggaggca aattcgattt gaacatatac 3541 ttgtaattct aatgcaaaat tatacaattt ttcctgtaaa caataccaat ttttaattag 3601 ggagcatttt ccttctagtc tatttcagcc tagaagaaaa gataatgagt aaaacaaatt 3661 gcgttgttta aaggattata gtgctgcatt gtctgaagtt agcacctctt ggactgaatc 3721 gtttgtctag actacatgta ttacaaagtc tctttggcaa gattgcagca agatcatgtg 3781 catatcatcc cattgtaaag cgacttcaaa aatatgggaa cacagttagt tatttttaca 3841 cagttctttt tgtttttgtg tgtgtgtgct gtcgcttgtc gacaacagct ttttgttttc 3901 ctcaatgagg agtgttgctc atttgtgagc cttcattaac tcgaagtgaa atggttaaaa 3961 atatttatcc tgttagaata ggctgcatct ttttaacaac tcattaaaaa acaaaacaac 4021 tctggctttt gagatgactt atactaattt acattgttta ccaagctgta gtgctttaag 4081 aacactac // LOCUS D89858 1200 bp mRNA PRI 18-DEC-1996 DEFINITION Human mRNA for D-aspartate oxidase, complete cds. ACCESSION D89858 NID g1742023 KEYWORDS D-aspartate oxidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Setoyama,C. and Miura,R. TITLE Structural and functional characterization of the human brain D-asparta te oxidase JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 1200) AUTHORS Miura,R. TITLE Direct Submission JOURNAL Submitted (12-DEC-1996) to the DDBJ/EMBL/GenBank databases. Retsu Miura, Kumamoto University School of Medicine, Department of Biochemistry; 2-2-1 Honjo, Kumamoto, Kumamoto 860, Japan (E-mail:miura@gpo.kumamoto-u.ac.jp, Tel:096-373-5062) FEATURES Location/Qualifiers source 1..1200 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1026 /codon_start=1 /product="D-aspartate oxidase" /db_xref="PID:d1014731" /db_xref="PID:g1742024" /translation="MDTARIAVVGAGVVGLSTAVCISKLVPRCSVTIISDKFTPDTTS DVAAGMLIPHTYPDTPIHTQKQWFRETFNHLFAIANSAEAGDAGVHLVSGWQIFQSTP TEEVPFWADVVLGFRKMTEAELKKFPQYVFGQAFTTLKCECPAYLPWLEKRIKGSGGW TLTRRIEDLWELHPSFDIVVNCSGLGSRQLAGDSKIFPVRGQVLQVQAPWVEHFIRDG SGLTYIYPGTSHVTLGGTRQKGDWNLSPDAENSREILSRCCALEPSLHGACNIREKVG LRPYRPGVRLQTELLARDGQRLPVVHHYGHGSGGISVHWGTALEAARLVSECVHALRT PIPKSNL" BASE COUNT 300 a 290 c 324 g 286 t ORIGIN 1 atggacacag cacggattgc agttgtcggg gcaggtgtgg tggggctctc cacggctgtg 61 tgcatctcca aactggtgcc ccgatgctcc gttaccatca tttcagacaa gtttactcca 121 gataccacca gtgatgtggc agccggaatg cttattcctc acacttatcc agatacaccc 181 attcacacgc agaagcagtg gttcagagaa acctttaatc acctctttgc aattgccaat 241 tctgcagaag ctggagatgc tggtgttcat ttggtatcag gttggcagat atttcagagc 301 actccgactg aagaagtgcc attctgggct gacgtggttc tgggatttcg aaagatgact 361 gaggctgagc tgaagaaatt cccccagtat gtgtttggtc aggcttttac aaccctgaaa 421 tgtgaatgcc ctgcctacct cccgtggttg gagaaaagga taaagggaag tggaggctgg 481 acactcactc ggcgaataga agacctgtgg gaacttcatc cgtcctttga catcgtggtc 541 aactgttcag gccttggaag cagacagctt gcaggagact caaagatttt ccctgtaagg 601 ggccaagtcc tccaagttca ggctccctgg gtggagcatt ttatccgaga tggcagtggg 661 ctgacatata tttatcctgg tacatcccat gtaaccctag gtggaactag gcaaaaaggg 721 gactggaatc tgtccccgga tgcagaaaat agcagagaga ttctttcccg atgctgtgct 781 ctggagccct ccctccacgg agcctgcaac atcagggaga aggtgggctt gaggccctac 841 aggccaggcg tgcgactgca gacagagctc cttgcgcgag atggacagag gctgcctgta 901 gtccaccact atggccatgg gagtgggggc atctcagtgc actggggcac tgctctggag 961 gccgccaggc tggtgagcga gtgtgtccat gccctcagga cccccattcc caagtcaaac 1021 ctgtagatga cataaaatga cagcaaagag actgagagac tgttgatcaa agcacagaac 1081 aggttcaaat aacttttcca ctgcatgaaa gtttaattag acatttcttt gttttcaaca 1141 ttagaagtgg tgtaacatgt aagctgagca cggtagcatg cctatagtcc cagctacttg // LOCUS D89859 2896 bp mRNA PRI 22-MAY-1997 DEFINITION Human mRNA for zinc finger 5 protein, complete cds. ACCESSION D89859 NID g2117021 KEYWORDS zinc finger 5 protein. SOURCE Homo sapiens cell_line:HeLa cDNA to mRNA, clone_lib:lZap clone:hZF5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2896) AUTHORS Sugiura,K., Muro,Y., Nagai,Y., Kamimoto,T., Wakabayashi,T., Ohashi,M. and Hagiwara,M. TITLE Direct Submission JOURNAL Submitted (12-DEC-1996) to the DDBJ/EMBL/GenBank databases. Kazumitsu Sugiura, Nagoya University School of Medicine, Department of Anatomy and Department of Dermatology; 65 Tsurumai-cho, Showa-ku, Nagoya, Nagoya, Aichi 466, Japan (E-mail:g950034d@eds.ecip.nagoya-u.ac.jp, Tel:+81-52-744-2031, Fax:+81-52-744-2041) REFERENCE 2 (sites) AUTHORS Sugiura,K., Muro,Y., Nagai,Y., Kamimoto,T., Wakabayashi,T., Ohashi,M. and Hagiwara,M. TITLE Expression cloning and intracellular localization of a human ZF5 homologue JOURNAL Biochim. Biophys. Acta 1352 (1), 23-26 (1997) MEDLINE 97320628 FEATURES Location/Qualifiers source 1..2896 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone="hZF5" /clone_lib="lZap" gene 70..1419 /gene="hZF5" CDS 70..1419 /gene="hZF5" /codon_start=1 /product="zinc finger 5 protein" /db_xref="PID:d1020947" /db_xref="PID:g2117022" /translation="MEFFISMSETIKYNDDDHKTLFLKTLNEQRLEGEFCDIAIVVED VKFRAHRCVLAACSTYFKKLFKKLEVDSSSVIEIDFLRSDIFEEVLNYMYTAKISVKK EDVNLMMSSGQILGIRFLDKLCSQKRDVSSPDENNGQSKSKYCLKINRPIGDAADTQD DDVEEIGDQDDSPSDDTVEGTPPSQEDGKSPTTTLRVQEAILKELGSEEVRKVNCYGQ EVESMETPESKDLGSQTPQALTFNDGMSEVKDEQTPGWTTAASDMKFEYLLYGHHREQ IACQACGKTFSDEGRLRKHEKLHTADRPFVCEMCTKGFTTQAHLKEHLKIHTGYKPYS CEVCGKSFIRAPDLKKHERVHSNERPFACHMCDKAFKHKSHLKDHERRHRGEKPFVCG SCTKAFAKASDLKRHENNMHSERKQVTPSAIQSETEQLQAAAMAAEAEQQLETIACS" BASE COUNT 894 a 521 c 639 g 842 t ORIGIN 1 ctgatcagag ttattctgat ctgaagattt tgggcgttca aggcattaag ataatagcct 61 gagttgttca tggagttttt catcagtatg tctgaaacca ttaaatataa tgacgatgat 121 cataaaactc tgtttctgaa aacactaaat gaacaacgcc tggaaggaga attttgtgat 181 attgctattg tggttgagga tgtgaaattc agagcacaca gatgtgttct tgctgcctgc 241 agcacctact ttaaaaagct tttcaagaag cttgaggttg atagttcttc ggtcatagaa 301 atagattttc ttcgttctga tatatttgaa gaggtcctga actacatgta cacagcaaag 361 atttccgtga aaaaagaaga tgttaactta atgatgtcat cgggtcagat tcttggtatc 421 cgatttttgg ataaactgtg ttctcagaag cgtgatgtgt ccagtcccga tgaaaacaat 481 ggtcagtcca aaagtaagta ttgccttaaa ataaatcgcc ccattggaga tgctgctgac 541 acccaggatg atgatgtaga ggaaatcggg gatcaggatg acagtccttc tgatgacaca 601 gtagaaggca cacccccgag tcaggaggac ggcaagtcgc ccaccacaac gctcagggtt 661 caggaagcga tcctgaaaga gctggggagt gaggaagttc ggaaggtcaa ttgctacggc 721 caggaagtag aatccatgga gaccccagaa tcaaaagact tggggtccca gacccctcaa 781 gccttaacat ttaatgatgg gatgagtgaa gtgaaagatg aacagacacc aggctggaca 841 acagccgcca gtgacatgaa gtttgagtat ttgctttatg gtcaccatcg ggagcagatt 901 gcctgccagg cgtgtgggaa gacgttttct gatgaaggca gattgaggaa gcatgagaaa 961 ctccacacgg cggacaggcc atttgtttgt gaaatgtgca caaaaggttt caccacacag 1021 gcccacctga aagaacacct aaaaatccac acaggatata agccctatag ctgtgaggtg 1081 tgtggaaaat catttatccg tgccccagac ttaaagaagc atgagagagt tcacagtaat 1141 gaaagaccgt ttgcgtgcca catgtgtgac aaagccttca aacacaagtc tcacctcaag 1201 gatcatgaaa gaagacacag aggggaaaag ccttttgtgt gtggctcctg caccaaggca 1261 tttgccaagg catctgatct gaaaaggcac gagaacaata tgcacagtga aaggaagcag 1321 gttaccccca gtgccatcca gagcgagaca gaacagttgc aggcggcagc gatggctgcg 1381 gaagcagaac agcagctgga gacgatagcc tgtagctaga ggcggtggga cagggacact 1441 ttgcctggaa agtggagact gagatgacgt ggatcataat gagtgaatgc cagttacaat 1501 atttttgtgg aaacgtatga acattgtact cactggactt aaggcagtgc ttggttagct 1561 atttttaaga cttttcaagg aaatggtgtt cctcagttct gaccaaaccg tttcactgtc 1621 ttgtctggtg tctagtatta atgttgccag taagcacctc tctccctttt ttttttttta 1681 ttattttaat ttgagaactc ctgtgtccag tttagaagtg agagacttcc atttttagtt 1741 cctttacact caccacccta gcaagtgccc tgcacagagt aataagtaaa ttgatttcct 1801 aatcacaatt ctatgtgact tatggtcaaa agagcagttt taataacttt aaaagtactt 1861 cagatagacg cagaaaattg gtgagtggtt gaccaagaac actgcacaaa tataaaaaaa 1921 gttctggaaa tgcagaaggg cgttagattt atatttggtt tgttaatttt atatcactgt 1981 ttttcactgt ttttgtggac aaataatggt tgctttgctg aagtgttctt cctcaatctt 2041 gattgccctg tacctaccca aaagctgtag tcacacgtcc taaaggccaa gcaaacccac 2101 cgggatggtg gggggtcttg gagccaagct cttaggttcc tcttatttgg ggcagtacca 2161 gtccatacca gctgcgattt gtgagtggac ctgtggtaag aagaatagaa aaggctctca 2221 gagataaggt tttttacatg tgtaacaatc ccaagatttc ctagattaaa atcttaattg 2281 attttgaaat tggattttta tttagaatca aaattaggac aagaacagat aacttcttca 2341 gatacatttg tgtaacttta cagaatgtca tcaagctttg gggctctgtg gggcacatga 2401 tttatccata aaggagatgc agtatgctta cttaaattaa taaatttaaa atcttttaag 2461 tgtgtaaata gtagtgttgg tcttacgtat tccaagtaaa aagtagacag ctgcactttt 2521 tttgcacatt ggattaaaat aacttccatc agcaacaaac atcagactgt ttttaacaaa 2581 tattaaagat tgtcagacca aatgtttatg ttttcgaaat atatttcatc actggttaca 2641 gttttaaata gaagttgatt gccttttcat agccgtaaat gagaattata aactctattc 2701 cagttttggt atactaaatg ttcttttaac catctttagg aatatattga aatgccaaca 2761 atagtttgaa ttgtgttctg taaaaaagta ttagtcaatt atttttcaaa atgtagaatt 2821 gtagaaaatg tcaatttttc aaactcattt ttcattgcta ggatttcttt taaaaaaatt 2881 aaagtaattt cacttc // LOCUS EN1838 1216 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens mRNA for maleylacetoacetate isomerase. ACCESSION AJ001838 NID g2832730 KEYWORDS MAAI gene; maleylacetoacetate isomerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1216) AUTHORS Fernandez-Canon,J.M. and Penalva,M.A. TITLE Characterization of a fungal maleylacetoacetate isomerase gene and identification of its human homologue JOURNAL J. Biol. Chem. 273 (1), 329-337 (1998) MEDLINE 98079064 REFERENCE 2 (bases 1 to 1216) AUTHORS Penalva,M.A. TITLE Direct Submission JOURNAL Submitted (08-OCT-1997) Penalva M.A., Molecular Microbiology, Centro de Investigaciones Biologicas CSIC, Velazquez 144, Madrid, 28006, SPAIN FEATURES Location/Qualifiers source 1..1216 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="melanocyte" /clone="EST 265310(5 )" gene 104..754 /gene="MAAI" CDS 104..754 /gene="MAAI" /EC_number="5.2.1.2" /codon_start=1 /product="maleylacetoacetate isomerase" /db_xref="PID:e1249844" /db_xref="PID:g2832731" /translation="MQAGKPILYSYFRSSCSWRVRIALALKGIDYKTVPINLIKDGGQ QFSKDFQALNPMKQVPTLKIDGITIHQSLAIIEYLEETRPTPRLLPQDPKKRASVRMI SDLIAGGIQPLQNLSVLKQVGEEMQLTWAQNAITCGFNALEQILQSTAGIYCVGDEVT MADLCLVPQVANAERFKVDLTPYPTISSINKRLLVLEAFQVSHPCRQPDTPTELRA" BASE COUNT 323 a 309 c 336 g 248 t ORIGIN 1 aagacacggg cctgattcgt cgagtctcac tgagccttag tcgtcggcag gtcccaggcg 61 cgaagtttct cggcctggag gagggggtcg cgcgaagtgc cagatgcagg cggggaagcc 121 catcctctat tcctatttcc gaagctcctg ctcatggaga gttcgaattg ctctggcctt 181 gaaaggcatc gactacaaga cggtgcccat caatctcata aaggatgggg gccaacagtt 241 ttctaaggac ttccaggcac tgaatcctat gaagcaggtg ccaaccctga agattgatgg 301 aatcaccatt caccagtcac tggccatcat tgagtatcta gaggagacgc gtcccactcc 361 gcgacttctg cctcaggacc caaagaagag ggccagcgtg cgtatgattt ctgacctcat 421 cgctggtggc atccagcccc tgcagaacct gtctgtcctg aagcaagtgg gagaggagat 481 gcagctgacc tgggcccaga acgccatcac ttgtggcttt aacgccctgg agcagatcct 541 acagagcaca gcgggcatat actgtgtagg agacgaggtg accatggctg atctgtgctt 601 ggtgcctcag gtggcaaatg ctgaaagatt caaggtggat ctcaccccct accctaccat 661 cagctccatc aacaagaggc tgctggtctt ggaggccttc caggtgtctc acccctgccg 721 gcagccagat acacccactg agctgagggc ctagctccca aatcctgccc cgttggcaca 781 gggccacagg agcagaagct gggtgggctg aagaggcctg gaaacgagag tcttaattga 841 ggagatggga gactcgaact ctagccctgg atctgccttc ctgctgaaac ttgttccacc 901 tcagtcccct catctgtcac acgcatgtgg ggtggagtag ggagatgcgg ggagcagggt 961 gggcaggaat actgttatct atgtgacggg gcagtcgtga ggctgagatg agaatgcgga 1021 ttaaaatgcc tggcgtgctc accgtaacac cacggggaag gctgtgtgcc ttttctcatc 1081 cgcttttgtt gtgtgtgact ccaaagaatg cccgcgctga aatttggcgt gaattaaact 1141 gaagcccagg cctctaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1201 aaaaaaaaaa aaaaaa // LOCUS HAAXTRSYV 6972 bp RNA PRI 28-MAY-1996 DEFINITION H.sapiens mRNA for axonal transporter of synaptic vesicles. ACCESSION X90840 NID g1212916 KEYWORDS ATSV gene; axonal transporter of vesicles. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6972) AUTHORS Furlong,R.A., Zhou,C.Y., Ferguson-Smith,M.A. and Affara,N.A. TITLE Characterization of a kinesin-related gene ATSV, within the tuberous sclerosis locus (TSC1) candidate region on chromosome 9Q34 JOURNAL Genomics 33 (3), 421-429 (1996) MEDLINE 96299637 REFERENCE 2 (bases 1 to 6972) AUTHORS Furlong,R.A. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) R.A. Furlong, University of Cambridge, Dept of Pathology, Tennis Court Road, Cambridge CB2 1QP, UK COMMENT Overlaps with M78444, M78705, T07754, T15633 and T77291. FEATURES Location/Qualifiers source 1..6972 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="foetal" /chromosome="9" /map="q34.1-34.2" gene 64..5136 /gene="ATSV" CDS 64..5136 /gene="ATSV" /codon_start=1 /product="axonal transporter of synaptic vesicles" /db_xref="PID:e197137" /db_xref="PID:g1212917" /translation="MAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPK ETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAG KSYTMMGKQEKDQQGIIPQLCEDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNP KNKGNLRVREHPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATNMNETSSRSH AVFNIIFTQKRHDAETNITTEKVSKISLVDLAGSERADSTGAKGTRLKEGANINKSLT TLGKVISALAEMDSGPNKNKKKKKTDFIPYRDSVLTWLLRENLGGNSRTAMVAALSPA DINYDETLSTLRYADRAKQIRCNAVINEDPNNKLIRELKDEVTRLRDLLYAQGLGDIT DMTNALVGMSPSSSLSALSSRAASVSSLHERILFAPGSEEAIERLKETEKIIAELNET WEEKLRRTEAIRMEREALLAEMGVAMREDGGTLGVFSPKKTPHLVNLNEDPLMSECLL YYIKDGITRVGREDGERRQDIVLSGHFIKEEHCVFRSDSRGGSEAVVTLEPCEGADTY VNGKKVTEPSILRSGNRIIMGKSHVFRFTHPEQARQERERTPCAETPAEPVDWAFAQR ELLEKQGIDMKQEMEQRLQELEDQYRREREEATYLLEQQRLDYESKLEALQKQMDSRY YPEVNEEEEEPEDEVQWTERECELALWAFRKWKWYQFTSLRDLLWGNAIFLKEANAIS VELKKKVQFQFVLLTDTLYSPLPPDLLPPEAAKDREKRPFPRTIVAVEVQDQKNGATH YWTLEKLRQRLDLMREMYDRAAEVPSSVIEDCDNVVTGGDPFYDRFPWFRLVGRAFVY LSNLLYPVPLVHRVAIVSEKGEVKGFLRVAVQAISADEEAPDYGSGVRQSGTAKISFD DQHFEKFQSESCPVVGMSRSGTSQEELRIVEGQGQGADVGPSADEVNNNTCSAVPPEG LLLDSSEKAALDGPLDAALDHLRLGNTFTFRVTVLQASSISAEYADIFCQFNFIHRHD EAFSTEPLKNTGRGPPLGFYHVQNIAVEVTKSFIEYIKSQPIVFEVFGHYQQHPFPPL CKDVLSPLRPSRRHFPRVMPLSKPVPATKLSTLTRPCPGPCHCKYDLLVYFEICELEA NGDYIPAVVDHRGGMPCMGTFLLHQGIQRRITVTLLHETGSHIRWKEVRELVVGRIRN TPETDESLIDPNILSLNILSAGYIHPAHDDRTFYQFEAAWNSSMHNSLLLNRITPYRE KIYMTLSAYIEMENCTQPAVVTKDFCMVFYSRDAKLPASRSIRNLFGSGSLRASESNR VTGVYELSLCHVADAGSPGMQRRRRRVLDTSVAYVRGEENLAGWRPRSDSLILDHQWE LEKLSLLQEVEKTRHYLLLREKLETAQRPVPEALSPAFSEDSESHGSSSASSPLSAEG RPSPLEAPNERQRELAVKCLRLLTHTFNREYTHSHVCVSASESKLSEMSVTLLRDPSM SPLGVATLTPSSTCPSLVEGRYGATDLRTPQPCSRPASPEPELLPEADSKKLPSPARA TETDKEPQRLLVPDIQEIRVSPIVSKKGYLHFLEPHTSGWARRFVVVRRPYAYMYNSD KDTVERFVLNLATAQVEYSEDQQAMLKTPNTFAVCTEHRGILLQAASDKDMHDWLYAF NPLLAGTIRSKLSRRRSAQMRV" BASE COUNT 1495 a 2135 c 2048 g 1294 t ORIGIN 1 gaggtgttcc ccccacactg gggctcccac tactgcgagg agtgacccac gaaggccaca 61 gagatggccg gggcttcggt gaaggtggcg gtgcgggtcc gccccttcaa ttcccgggaa 121 atgagccgtg actccaagtg catcattcag atgtctggaa gcaccaccac cattgttaac 181 cccaaacagc ccaaggagac gcccaaaagc ttcagctttg actactccta ctggtcgcac 241 acctcacctg aggacatcaa ctacgcgtcg cagaagcagg tgtaccggga catcggcgag 301 gagatgctgc agcatgcctt tgagggatac aacgtgtgca tcttcgccta tgggcagacg 361 ggtgccggca agtcctacac catgatgggc aagcaggaga aggaccagca gggcatcatc 421 ccacagctct gcgaggacct cttctctcgg atcaacgaca cgaccaacga caacatgtcc 481 tactccgtgg aggtcagcta catggagatt tactgtgagc gcgtccgtga cctcctgaac 541 cccaagaaca agggcaacct tcgcgtgagg gagcacccac tgctggggcc ctacgtggag 601 gacctctcca agctggctgt cacctcctac aatgacatcc aggacctcat ggactcaggg 661 aacaaggcca ggaccgtggc ggccaccaac atgaatgaga ccagcagtcg ctcccacgcc 721 gtcttcaaca tcatcttcac ccagaagcgc catgacgcag agaccaatat caccacggag 781 aaggtgagca aaatcagcct ggtggacctg gctgggagcg agcgggctga ctccacggga 841 gccaagggca cgcgcctcaa ggagggggcc aacatcaaca agtcgctgac caccctgggc 901 aaggtcatct ccgccctggc tgaaatggac tccggaccca acaagaacaa gaaaaagaag 961 aagacagatt tcattccgta ccgagattcc gtgttgacct ggctcctccg ggaaaacctg 1021 ggcggtaact caaggacagc tatggtggca gccctgagtc ctgcagacat caactacgat 1081 gagaccctta gcacgctgag gtatgctgac cgggccaagc agatccgctg caatgctgtc 1141 atcaatgagg accccaacaa caagctgatc cgcgagctga aggatgaggt gacccggctg 1201 cgggaccttc tgtacgccca gggtcttggc gacatcactg acatgaccaa tgccctggtg 1261 ggtatgagcc cctcatcctc gctctcagcc ctgtccagcc gcgcggcctc cgtgtccagc 1321 ctccacgagc gcatcttgtt tgccccgggc agcgaggagg ccattgaaag actgaaggaa 1381 acagagaaga tcatagctga gctcaatgag acctgggagg agaagctgcg gcggacagaa 1441 gccatccgga tggagaggga agccctgctg gccgagatgg gtgtggccat gagggaggat 1501 ggcggcacct tgggcgtatt ctctcccaaa aagacaccac acctcgtcaa cctgaacgag 1561 gacccgctga tgtctgagtg cctgctctac tacatcaagg atgggatcac cagagtgggc 1621 agggaggatg gcgagaggcg gcaggacatt gttctgagtg ggcacttcat caaggaggag 1681 cactgcgtct tccggagcga ctccagggga ggcagcgaag ctgtggtgac cttggagccc 1741 tgtgaggggg cagacaccta cgtcaatggc aagaaagtca cagagcccag catcctgcgt 1801 tcaggaaacc gcatcatcat gggtaagagc catgtgttcc ggttcaccca ccccgagcag 1861 gcccggcagg agcgtgagcg cacgccttgt gcggagacgc cagctgagcc tgtggactgg 1921 gccttcgccc agcgtgagct gctggagaag cagggcatcg acatgaagca ggagatggag 1981 cagaggctcc aggaactgga ggaccagtac cgccgcgagc gggaggaggc cacctacctg 2041 ctggagcagc agcggctgga ctatgagagc aagctggagg ctctgcagaa gcagatggac 2101 tccaggtact acccggaggt gaacgaggag gaggaggagc ccgaggatga agtccagtgg 2161 acagagcggg agtgtgagct ggcgctctgg gccttccgga agtggaagtg gtaccagttc 2221 acgtctctgc gggacctgct gtggggcaac gccatcttcc tcaaggaggc caatgccatc 2281 agcgtggagc tgaaaaagaa ggtacaattc cagtttgtcc tcctgacgga cacactctac 2341 tcccctctgc cacccgacct gctgccccca gaggccgcca aagaccgaga gaagcggccc 2401 ttcccccgca ccattgtggc cgtggaggtc caggaccaga agaacggggc cacccactac 2461 tggacgctgg agaagctcag gcagcgtctg gacctgatgc gggagatgta cgaccgcgct 2521 gcagaggtgc cctccagtgt catcgaggac tgtgacaacg tggtgaccgg cggagacccc 2581 ttctatgacc gcttcccctg gttccggctg gtgggcaggg ccttcgtgta cctgagcaac 2641 ctgctgtacc ccgttcccct ggtacaccgt gtggcaatcg tcagcgagaa gggcgaggtg 2701 aagggcttcc tccgcgtggc cgtccaggcc atctcagccg atgaagaggc ccctgattat 2761 ggctctggcg tccgccagtc gggaactgct aaaatctcct ttgatgacca gcattttgaa 2821 aagttccagt ccgagtcttg ccccgtggtg gggatgtccc gctcgggaac ctcccaggaa 2881 gagcttcgca tcgtggaggg ccagggccag ggtgcagacg tggggccctc agccgatgaa 2941 gtcaacaaca acacctgttc agcagtgccc ccagaaggcc tcctcctaga cagctctgag 3001 aaagccgccc tggatgggcc cctggatgct gccctggacc acctccgcct gggcaacacc 3061 ttcaccttcc gtgtgacagt cctgcaggcg tccagcatct ctgccgaata tgccgacatc 3121 ttctgccagt tcaacttcat ccaccgccac gacgaggcct tctccacaga gcccctgaag 3181 aacacaggca gaggcccccc acttggcttc taccacgtcc agaacatcgc agtggaggtg 3241 accaagtcct tcattgagta catcaagagc cagcccattg ttttcgaggt ctttggccac 3301 taccagcagc acccgttccc gcccctctgc aaggacgtgc tcagccccct gaggccctcg 3361 cgccgccact tccctcgggt catgccactg tccaagccag tgcccgccac caagctcagc 3421 acactgacgc ggccctgtcc gggaccctgc cactgcaagt acgacctgct ggtctacttc 3481 gagatctgtg agctggaggc caacggcgat tacatcccgg ccgtggtgga ccaccgtggg 3541 ggcatgccat gcatggggac cttcctcctc caccagggca tccagcgacg gattacggtg 3601 acactactgc atgagacagg cagccatatc cgctggaagg aagtgcgcga gctggtcgtg 3661 ggccgcatcc gaaacactcc agagaccgac gagtccctga tcgaccccaa catcttgtct 3721 ctcaacatcc tctctgccgg atacatccac ccagcccatg atgaccggac cttttaccaa 3781 tttgaggctg cgtggaacag ctccatgcac aactctctcc tgctgaaccg gatcacccct 3841 tatcgagaga aaatctacat gacactctcc gcttatatcg agatggagaa ctgcacccag 3901 ccggctgttg tcaccaagga cttctgcatg gtcttctatt cccgtgatgc caagctgcca 3961 gcctcgcgct ccatccgcaa cctctttggc agtgggagcc ttcgggcctc agagagtaac 4021 cgtgtgactg gtgtgtacga gctcagcctg tgccacgtgg ctgacgcggg cagcccaggg 4081 atgcagcgcc ggcgccgacg agtcctggac acatctgtgg cctatgtccg gggcgaggag 4141 aacctggcag gctggaggcc ccggagtgac agtctcattc tggaccacca gtgggagctg 4201 gagaagctga gcctcctgca ggaggtggag aagactaggc actacctgct cctgcgggag 4261 aagctggaga ccgcccagcg gcctgtcccg gaggcactgt ccccggcctt cagcgaggac 4321 tctgagtccc atggctcctc cagcgcctcc tccccgctct cggctgaggg ccgcccatca 4381 cccctggagg ctcccaacga gaggcagcgg gagctggccg tcaagtgctt gcgcctgctc 4441 acgcacacat tcaacagaga gtacacacac agccacgtct gcgtcagtgc cagcgagagc 4501 aagctctccg agatgtctgt caccctgctc cgggacccgt cgatgtcccc tctaggggtg 4561 gccactctca ccccctcctc cacttgcccc tctctggttg aagggcggta cggtgccact 4621 gacctgagga ccccgcagcc ctgctcccgg ccagccagcc cagagcccga gctgctgcca 4681 gaggccgact ccaagaagct cccttcccct gcccgggcaa cagagacaga caaggagccc 4741 cagcgcctgc tggtccctga catccaggag atccgagtca gcccgatcgt ttccaagaag 4801 gggtacctgc acttcctgga gccgcacacg tcaggctggg ccaggcgctt cgtggtggtg 4861 cggcgcccct atgcctacat gtacaacagc gacaaggaca ccgtggagcg gttcgtgctc 4921 aacctggcca ctgcccaggt ggagtacagt gaggaccagc aggctatgct caagacaccc 4981 aacacattcg cggtgtgcac ggaacaccgc ggcatcctgc tgcaggccgc cagcgacaag 5041 gacatgcatg actggctgta cgccttcaac cccctcctgg ccgggaccat acggtccaag 5101 ctctccagaa ggaggtctgc ccagatgcgg gtctgaacct gagccctccc gtgacagccg 5161 gcaggcccag cccatcccct ccctcatcct cgtctgtcct gtcacctgcc gcccagcccc 5221 tctcctgcca gacagcccac gaccgggtcg accccccagg ggacgcccat gccaggcccg 5281 gggacctgtg ccacacgacc agctgtgctc ccagcagagg ctgtgcgtgt cagttcttct 5341 tgcagaatgt gctctggtgg aacaagttgg gagaggctgg gggggccaag ggcacaggtt 5401 acgggggttc ttgctgccgt tctaatattt ttttaagcat agacagactt ataattaata 5461 tacgttagtt agtgacattg aaacagtcaa ctcggaaatt aactataaga cttgttctat 5521 ttataagtat ttatttctaa tgcctccaca tagccctgta atattcagat ggaaccccca 5581 accacctcca ccctgtttgt tcccacatgt gtctcccaag cctgctaggg acaggcaggg 5641 cagggacagc caccttggaa ggccgcagtg aggagctgtc tggaccagtg gggcaccttg 5701 gggctagcac acgggtgtat cgcctgggcc ccaggcttct ccatggccac atgggtcctg 5761 ggtgtatgtg tgggagagtg ggggggtgtc tttggtgcct gaagtctgcg cggcatggag 5821 ggtggtgtga gttcctctgg tgggagggag aacgcacatc tcttctgggc ggccacctga 5881 ggagtgactc caagaagagt tccggcagct ttccccagga aagggtgagg ggtgacactc 5941 ggctctggct ctgagatgag gcagacggca cccaggctgt gatctgtcct gggcggggac 6001 caggagggag cggggtcggg atcacctgcc agtgtgcaga ctctgggact gcgtgctgtc 6061 tccggaccat cagggtaggg tggtgggttg agaccaggaa gtcagggaag atcggaattc 6121 agggcgacgg tctaggtgtc gagggctgtg gcgcagcctc ttcagctgcg gcgagaaatg 6181 gagtgagtca aggtagcttc tgggaagaaa tgctgccatt agcaggtttc ttgcaaagac 6241 tttcctctct ttgttcccag ggcagagagt ttctgtgagt cccactgaga aaatcccatg 6301 gggtgggggt atcctggtcg gtcggcaatg gagggtggct ggcttggtgg ttattgtctt 6361 caaggagctc tttgctgctg catctgcggt gtccctttgt tcttgtccca tttcaccccc 6421 tctgcagaca ccaatgtccg agggccaccc aggacaggac gggggtcagc cccaagctga 6481 gagtctggtc ataggagtca tgtccagagg cctagggagg ttttagggcc ctccccaccc 6541 acacccacag gtcgatttgg tctcttttta gctcaaggaa agacagtagc caagcaacag 6601 agcccctctc ccgccgtggc ccgtgggagc agttacatcg ggtctggtgc tccagaccta 6661 gggcccagca ctttcatcag atcctgcctc ctggagtggg ggaaacgcag caccccactg 6721 gttctgaggc ccctaccctc ccaggctgtc ccacgtgatg ctgacatgag cctcagagac 6781 cccaatccca tgcctggggg tccctgagtg gcaaaacatc ctacagtgga tagtcataca 6841 caacaaaaga taatcctgct caaaatgcca acagtgttcc cattgagaaa cactgaatta 6901 ctgatccttc acaggtcagt tcaaatcata cttgtcttta gaaacagttc tttatgttaa 6961 ccctaagccc gg // LOCUS HALIG4 3325 bp RNA PRI 08-JUN-1995 DEFINITION H.sapiens mRNA for DNA ligase IV. ACCESSION X83441 NID g860936 KEYWORDS DNA ligase IV; lig4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3325) AUTHORS Wei,Y.F., Robins,P., Carter,K., Caldecott,K., Pappin,D.J.C., Yu,G.L., Wang,R.P., Shell,B.K., Nash,R., Schar,P., Barnes,D.E., Haseltine,W.A. and Lindahl,T. TITLE Molecular cloning and expression of human cDNAs encoding a novel DNA ligase IV and DNA ligase III, an enzyme active in DNA repair and recombination JOURNAL Mol. Cell. Biol. 15 (6), 3206-3216 (1995) MEDLINE 95280920 REFERENCE 2 (bases 1 to 3325) AUTHORS Schaer,P. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) P. Schaer, Imperial Cancer Research Fund, Clare Hall Labs., South Mimms, Herts EN6 3LD, UK COMMENT cDNA sequence deposited by: Ying-Fey Wei, Human Genome Sciences, Inc.,9620 Medical Center Drive, Rockville, MD. FEATURES Location/Qualifiers source 1..3325 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="prostate" /clone="HGS340221" /chromosome="13" /map="13q33-34" mRNA <1..>3325 /gene="LIG4" gene 1..3325 /gene="LIG4" CDS 475..3009 /gene="LIG4" /codon_start=1 /product="DNA ligase IV" /db_xref="PID:g860937" /translation="MRLILPQLERERMAYGIKETMLAKLYIELLNLPRDGKDALKLLN YRTPTGTHGDAGDFAMIAYFVLKPRCLQKGSLTIQQVNDLLDSIASNNSAKRKDLIKK SLLQLITQSSALEQKWLIRMIIKDLKLGVSQQTIFSVFHNDAAELHNVTTDLEKVCRQ LHDPSVGLSDISITLFSASKPMLAAIADIEHIEKDMKHQSFYIETKLDGERMQMHKDG DVYKYFSRNGYNYTDQFGASPTEGSLTPFIHNAFKADIQICILDGEMMAYNPNTQTFM QKGTKFDIKRMVEDSDLQTCYCVFDVLMVNNKKLGHETLRKRYEILSSIFTPIPGRIE IVQKTQAHTKNEVIDALNEAIDKREEGIMVKQPLSIYKPDKRGEGWLKIKPEYVSGLM DELDILIVGGYWGKGSRGGMMSHFLCAVAEKPPPGEKPSVFHTLSRVGSGCTMKELYD LGLKLAKYWKPFHRKAPPSSILCGTEKPEVYIEPCNSVIVQIKAAEIVPSDMYKTGCT LRFPRIEKIRDDKEWHECMTLDDLEQLRGKASGKLASKHLYIGGDDEPQEKKRKAAPK MKKVIGIIEHLKAPNLTNVNKISNIFEDVEFCVMSGTDSQPKPDLENRIAEFGGYIVQ NPGPDTYCVIAGSENIRVKNIILSNKHDVVKPAWLLECFKTKSFVPWQPRFMIHMCPS TKEHFAREYDCYGDSYFIDTDLNQLKEVFSGIKNSNEQTPEEMASLIADLEYRYSWDC SPLSMFRRHTVYLDSYAVINDLSTKNEGTRLAIKALELRFHGAKVVSCLAEGVSHVII GEDHSRVADFKAFRRTFKRKFKILKESWVTDSIDKCELQEENQYLI" polyA_signal 3248..3253 /gene="LIG4" polyA_signal 3318..3323 /gene="LIG4" BASE COUNT 1099 a 574 c 690 g 962 t ORIGIN 1 ccacagcgct gtagactgcg ccgcattaga agcctggcct cctgatgctg tgctcttcat 61 ctagacccaa gccccaggtc gtgggacgat ttctcccgtt tttgactccc tggaactgta 121 ttgcctgctt tacctgcgta catgttgatt ctttctcatg gcaaccccgc aggaaaccat 181 caagatctca ttttacagct gggattctct ggttcacaga ggtaacggag cttgcccgag 241 gccagttaaa cgagaagatt catcaccgct ttgatggctg cctcacaaac ttcacaaact 301 gttgcatctc acgttccttt tgcagatttg tgttcaactt tagaacgaat acagaaaagt 361 aaaggacgtg cagaaaaaat cagacacttc agggaatttt tagattcttg gagaaaattt 421 catgatgctc ttcataagaa ccacaaagat gtcacagact ctttttatcc agcaatgaga 481 ctaattcttc ctcagctaga aagagagaga atggcctatg gaattaaaga aactatgctt 541 gctaagcttt atattgagtt gcttaattta cctagagatg gaaaagatgc cctcaaactt 601 ttaaactaca gaacacccac tggaactcat ggagatgctg gagactttgc aatgattgca 661 tattttgtgt tgaagccaag atgtttacag aaaggaagtt taaccataca gcaagtaaac 721 gaccttttag actcaattgc cagcaataat tctgctaaaa gaaaagacct aataaaaaag 781 agccttcttc aacttataac tcagagttca gcacttgagc aaaagtggct tatacggatg 841 atcataaagg atttaaagct tggtgttagt cagcaaacta tcttttctgt ttttcataat 901 gatgctgctg agttgcataa tgtcactaca gatctggaaa aagtctgtag gcaactgcat 961 gatccttctg taggactcag tgatatttct atcactttat tttctgcatc aaaaccaatg 1021 ctagctgcta ttgcagatat tgagcacatt gagaaggata tgaaacatca gagtttctac 1081 atagaaacca agctagatgg tgaacgtatg caaatgcaca aagatggaga tgtatataaa 1141 tacttctctc gaaatggata taactacact gatcagtttg gtgcttctcc tactgaaggt 1201 tctcttaccc cattcattca taatgcattc aaagcagata tacaaatctg tattcttgat 1261 ggtgagatga tggcctataa tcctaataca caaactttca tgcaaaaggg aactaagttt 1321 gatattaaaa gaatggtaga ggattctgat ctgcaaactt gttattgtgt ttttgatgta 1381 ttgatggtta ataataaaaa gctagggcat gagactctga gaaagaggta tgagattctt 1441 agtagtattt ttacaccaat tccaggtaga atagaaatag tgcagaaaac acaagctcat 1501 actaagaatg aagtaattga tgcattgaat gaagcaatag ataaaagaga agagggaatt 1561 atggtaaaac aacctctatc catctacaag ccagacaaaa gaggtgaagg gtggttaaaa 1621 attaaaccag agtatgtcag tggactaatg gatgaattgg acattttaat tgttggagga 1681 tattggggta aaggatcacg gggtggaatg atgtctcatt ttctgtgtgc agtagcagag 1741 aagccccctc ctggtgagaa gccatctgtg tttcatactc tctctcgtgt tgggtctggc 1801 tgcaccatga aagaactgta tgatctgggt ttgaaattgg ccaagtattg gaagcctttt 1861 catagaaaag ctccaccaag cagcatttta tgtggaacag agaagccaga agtatacatt 1921 gaaccttgta attctgtcat tgttcagatt aaagcagcag agatcgtacc cagtgatatg 1981 tataaaactg gctgcacctt gcgttttcca cgaattgaaa agataagaga tgacaaggag 2041 tggcatgagt gcatgaccct ggacgaccta gaacaactta gggggaaggc atctggtaag 2101 ctcgcatcta aacaccttta tataggtggt gatgatgaac cacaagaaaa aaagcggaaa 2161 gctgccccaa agatgaagaa agttattgga attattgagc acttaaaagc acctaacctt 2221 actaacgtta acaaaatttc taatatattt gaagatgtag agttttgtgt tatgagtgga 2281 acagatagcc agccaaagcc tgacctggag aacagaattg cagaatttgg tggttatata 2341 gtacaaaatc caggcccaga cacgtactgt gtaattgcag ggtctgagaa catcagagtg 2401 aaaaacataa ttttgtcaaa taaacatgat gttgtcaagc ctgcatggct tttagaatgt 2461 tttaagacca aaagctttgt accatggcag cctcgcttta tgattcatat gtgcccatca 2521 accaaagaac attttgcccg tgaatatgat tgctatggtg atagttattt cattgataca 2581 gacttgaacc aactgaagga agtattctca ggaattaaaa attctaacga gcagactcct 2641 gaagaaatgg cttctctgat tgctgattta gaatatcggt attcctggga ttgctctcct 2701 ctcagtatgt ttcgacgcca caccgtttat ttggactcgt atgctgttat taatgacctg 2761 agtaccaaaa atgaggggac aaggttagct attaaagcct tggagcttcg gtttcatgga 2821 gcaaaagtag tttcttgttt agctgaggga gtgtctcatg taataattgg ggaagatcat 2881 agtcgtgttg cagattttaa agcttttaga agaactttta agagaaagtt taaaatccta 2941 aaagaaagtt gggtaactga ttcaatagac aagtgtgaat tacaagaaga aaaccagtat 3001 ttgatttaaa gctaggtttc ctagtgagga aagcctctga tctggcagac tcattgcagc 3061 aggtggtaat gataaaatac taaactacat tttatttttg tatcttaaaa atctatgcct 3121 aaaaagtatc attacatata ggaaaacaat aattttaact tttaaggttg aaaagacaat 3181 agcccaaagc caagaaagaa aaattatctt gaatgtagta ttcaatgatt ttttatgatc 3241 aaggtgaaat aaacagtcta aagaagaggt gtttttataa tatccatata gaaatctaga 3301 atttttactt agatactaat aaaat // LOCUS HAMLN70 550 bp RNA PRI 27-NOV-1995 DEFINITION H.sapiens MLN70 mRNA. ACCESSION X80201 NID g951232 KEYWORDS MLN 70 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 550) AUTHORS Tomasetto,C. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) C. Tomasetto, IGBMC, BP 163, 67404 ILLKIRCH Cedex, FRANCE REFERENCE 2 (bases 1 to 550) AUTHORS Tomasetto,C., Regnier,C., Moog-Lutz,C., Mattei,M.G., Chenard,M.P., Lidereau,R., Basset,P. and Rio,M.C. TITLE Identification of four novel human genes amplified and overexpressed in breast carcinoma and localized to the q11-q21.3 region of chromosome 17 JOURNAL Genomics 28 (3), 367-376 (1995) MEDLINE 96039245 FEATURES Location/Qualifiers source 1..550 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="breast derived metastatic lymph node" gene 78..395 /gene="MLN 70, S100 C" CDS 78..395 /gene="MLN 70, S100 C" /codon_start=1 /db_xref="PID:g951233" /translation="MAKISSPTETERCIESLIAVFQKYAGKDGYNYTLSKTEFLSFMN TELAAFTKNQKDPGVLDRMMKKLDTNSDGQLDFSEFLNLIGGLAMACHDSFLKAVPSQ KRT" polyA_site 520..525 BASE COUNT 137 a 184 c 108 g 121 t ORIGIN 1 cagacccgca cgccgcgcgc acagagctct cagcgccgct cccagccaca gcctcccgcg 61 cctcgctcag ctccaacatg gcaaaaatct ccagccctac agagactgag cggtgcatcg 121 agtccctgat tgctgtcttc cagaagtatg ctggaaagga tggttataac tacactctct 181 ccaagacaga gttcctaagc ttcatgaata cagaactagc tgccttcaca aagaaccaga 241 aggaccctgg tgtccttgac cgcatgatga agaaactgga caccaacagt gatggtcagc 301 tagatttctc agaatttctt aatctgattg gtggcctagc tatggcttgc catgactcct 361 tcctcaaggc tgtcccttcc cagaagcgga cctgaggacc ccttggccct ggccttcaaa 421 cccaccccct ttccttccag cctttctgtc atcatctcca cagcccaccc atcccctgag 481 cacactaacc acctcatgca ggccccacct gccaatagta ataaagcaat gtcacttttt 541 taaaacatgg // LOCUS HARNAMLK2 3454 bp RNA PRI 29-JAN-1996 DEFINITION H.sapiens mRNA for mixed lineage kinase 2. ACCESSION X90846 NID g971419 KEYWORDS kinase 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3454) AUTHORS Dorow,D.S., Devereux,L., Tu,G.F., Price,G., Nicholl,J.K., Sutherland,G.R. and Simpson,R.J. TITLE Complete nucleotide sequence, expression, and chromosomal localisation of human mixed-lineage kinase 2 JOURNAL Eur. J. Biochem. 234 (2), 492-500 (1995) MEDLINE 96128179 REFERENCE 2 (bases 1 to 3454) AUTHORS Dorow,D.S. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) D.S. Dorow, Peter MacCallum Cancer Institute, Research Division, Locked Bag #1 ABeckett Street, Melbourne, Victoria 3000, AUSTRALIA FEATURES Location/Qualifiers source 1..3454 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" /map="19q13.2" /clone_lib="lambda gt10" CDS 289..3153 /codon_start=1 /product="mixed lineage kinase 2" /db_xref="PID:g971420" /translation="MEEEEGAVAKEWGTTPAGPVWTAVFDYEAAGDEELTLRRGDRVQ VLSQDCAVSGDEGWWTGQLPSGRVGVFPSNYVAPGAPAAPAGLQLPQEIPFHELQLEE IIGVGGFGKVYRALWRGEEVAVKAARLDPEKDPAVTAEQVCQEARLFGALQHPNIIAL RGACLNPPHLCLVMEYARGGALSRVLAGRRVPPHVLVNWAVQVARGMNYLHNDAPVPI IHRDLKSINILILEAIENHNLADTVLKITDFGLAREWHKTTKMSAAGTYAWMAPEVIR LSLFSKSSDVWSFGVLLWELLTGEVPYREIDALAVAYGVAMNKLTLPIPSTCPEPFAR LLEECWDPDPHGRPDFGSILKRLEVIEQSALFQMPLESFHSLQEDWKLEIQHMFDDLR TKEKELRSREEELLRAAQEQRFQEEQLRRREQELAEREMDIVERELHLLMCQLSQEKP RVRKRKGNFKRSRLLKLREGGSHISLPSGFEHKITVQASPTLDKRKGSDGASPPASPS IIPRLRAIRLTPVDCGGSSSGSSSGGSGTWSRGGPPKKEELVGGKKKGRTWGPSSTLQ KERVGGEERLKGLGEGSKQWSSSAPNLGKSPKHTPIAPGFASLNEMEEFAEAEDGGSS VPPSPYSTPSYLSVPLPAEPSPGARAPWEPTPSAPPARWGHGARRRCDLALLGCATLL GAVGLGADVAEARAADGEEQRRWLDGLFFPRAGRFPRGLSPPARPHGRREDVGPGLGL APSATLVSLSSVSDCNSTRSLLRSDSDEAAPAAPSPPPSPPAPTPTPSPSTNPLVDLE LESFKKDPGQSLTPTHVTAVCAVSRGHRRTPSDGALGQRGPPEPAGHGPGPRDLLDFP RLPDPQALFPARRRPPEFPGRPTTLTFAPRPRPAASRPRLDPWKLVSFGRTLTISPPS RPDTPESPGPPSVQPTLLDMDMEGQNQDSTVPLCGAHGSH" BASE COUNT 594 a 1217 c 1136 g 507 t ORIGIN 1 cgcgcggcca ggccctctta gccctctgcc gtttgggggg cacgggtgaa cctgccgccc 61 cactcccacc ccgccccgcc ccgcccgtac agacaaatcg gaagggacga gcctgccctt 121 tgaaagggtt ttttttcttg ctcctgcgga gggcgcccca gccatggccc tcaggagctc 181 cctagacccc gcagggactg ccctccatcc cggccgccgg ggcccgccct ctgcatcccg 241 cgggcagcct gtgtgaagcg gcctcccgca gcccccggcc cctcccccat ggaggaggag 301 gagggggcgg tggccaagga gtggggcacg acccccgcgg ggcccgtctg gaccgcggtg 361 ttcgactacg aggcggcggg cgacgaggag ctgaccctgc ggaggggcga tcgcgtccag 421 gtgctttccc aagactgtgc ggtgtccggc gacgagggct ggtggaccgg gcagctcccc 481 agcggccgcg tgggcgtctt ccccagcaac tacgtggccc ccggcgcccc cgctgcaccc 541 gcgggcctcc agctgcccca ggagatcccc ttccacgagc tgcagctaga ggagatcatc 601 ggtgtggggg gctttggcaa ggtctatcgg gccctgtggc gtggcgagga ggtggcagtc 661 aaggccgccc ggctggaccc tgagaaggac ccggcagtga cagcggagca ggtgtgccag 721 gaagcccggc tctttggagc cctgcagcac cccaacataa ttgcccttag gggcgcctgc 781 ctcaaccccc cacacctctg cctagtgatg gagtatgccc ggggtggtgc actgagcagg 841 gtgctggcag gtcgccgggt gccacctcac gtgctggtca actgggctgt gcaggtggcc 901 cggggcatga actacctaca caatgatgcc cctgtgccca tcatccaccg ggacctcaag 961 tccatcaaca tcctgatcct ggaggccatc gagaaccaca acctcgcaga cacggtgctc 1021 aagatcacgg acttcggcct cgcccgcgag tggcacaaga ccaccaagat gagcgctgcg 1081 gggacctacg cctggatggc gccggaggtt atccgtctct ccctcttctc caaaagcagt 1141 gatgtctgga gcttcggggt gctgctgtgg gagctgctga cgggggaggt cccctaccgt 1201 gagatcgacg ccttggccgt ggcgtatggc gtggctatga ataagctgac gctgcccatt 1261 ccctccacgt gccccgagcc ctttgcccgc ctcctggagg aatgctggga cccagacccc 1321 cacgggcggc cagatttcgg tagcatcttg aagcggcttg aagtcatcga acagtcagcc 1381 ctgttccaga tgccactgga gtccttccac tcgctgcagg aagactggaa gctggagatt 1441 cagcacatgt ttgatgacct tcggaccaag gagaaggagc ttcggagccg tgaggaggag 1501 ctgctgcggg cggcacagga gcagcgcttc caggaggagc agctgcggcg gcgggagcag 1561 gagctggcag aacgtgagat ggacatcgtg gaacgggagc tgcacctgct catgtgccag 1621 ctgagccagg agaagccccg ggtccgcaag cgcaagggca acttcaagcg cagccgcctg 1681 ctcaagctgc gggaaggcgg cagccacatc agcctgccct ctggctttga gcataagatc 1741 acagtccagg cctctccaac tctggataag cggaaaggat ccgatggggc cagcccccct 1801 gcaagcccca gcatcatccc ccggctgagg gccattcgcc tgactcccgt ggactgtggt 1861 ggcagcagca gtggcagcag cagtggagga agtgggacat ggagccgcgg tgggccccca 1921 aagaaggaag aactggtcgg gggcaagaag aagggacgaa cgtgggggcc cagctccacc 1981 ctgcagaagg agcgggtggg aggagaggag aggctgaagg ggctggggga aggaagcaaa 2041 cagtggtcat caagtgcccc caacctgggc aagtccccca aacacacacc catcgcccct 2101 ggcttcgcca gcctcaatga gatggaggag ttcgcggagg cagaggatgg aggcagcagc 2161 gtgccccctt ccccctactc gaccccgtcc tacctctcag tgccactgcc tgccgagccc 2221 tccccggggg cgcgggcgcc gtgggagccg acgccgtccg cgccccccgc tcggtgggga 2281 cacggcgccc ggcggcgctg cgacctggcg ctgctaggct gcgccacgct gctgggggct 2341 gtgggcctgg gcgccgacgt ggccgaggcg cgcgcggccg acggtgagga gcagcggcgc 2401 tggctcgacg gcctcttctt tccccgcgcc ggccgcttcc cgcggggcct cagcccaccc 2461 gcgcgtcccc acggccgccg cgaagacgtg ggccccggcc tgggcctggc gccctcggcc 2521 accctcgtgt cgctgtcgtc cgtgtccgac tgcaactcca cgcgttcact gctgcgctct 2581 gacagtgacg aggccgcacc ggccgcgccc tccccaccac cctccccgcc cgcgcccaca 2641 cccacgccct cgcccagcac caaccccctg gtggacctgg agctggagag cttcaagaag 2701 gaccccggcc agtcgctcac gcccacccac gtcacggctg tatgcgctgt gagccgcggg 2761 caccggcgga cgccatcgga tggggcgctg gggcagcggg ggccgcccga gcccgcgggc 2821 catggccctg gccctcgtga ccttctggac ttcccccgcc tgcccgaccc ccaggctctg 2881 ttcccagccc gccgccggcc ccctgagttc ccaggccgcc ccaccaccct gacctttgcc 2941 ccgagacctc ggccggctgc cagtcgcccc cgcttggacc cctggaaact ggtctccttc 3001 ggccggacac tcaccatctc gcctcccagc aggccagaca ctccggagag ccctgggccc 3061 cccagcgtgc agcccacact gctggacatg gacatggagg ggcagaacca agacagcaca 3121 gtgcccctgt gcggggccca cggctcccac taaggcctgc ccaccaccgc ccgcctgggc 3181 agccatgaat gtagcgcccc aggccctgcc ccagcccgcc atgccacaag gtgggggagg 3241 ccctgggcag gatgttcact ctatttattg gggaaggagg gagggggggg acacttaact 3301 tattcctttg taccccaggg ggtggagccc tgtgcccacc ctgcactggg gggagggtgg 3361 gcagggatac tcagggacag ggcatcatgg gggatttggc acaaaatgga acaataaagg 3421 taaccccttg cccccccccc caaaaaaaaa aaaa // LOCUS HAST5 1041 bp mRNA PRI 04-FEB-1998 DEFINITION Homo sapiens sulfotransferase mRNA, complete cds. ACCESSION AF026303 NID g2828823 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1041) AUTHORS Zhu,X.Y., Windmill,K.F., Brix,L. and McManus,M.E. TITLE Characterisation of a human stomach sulfotransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1041) AUTHORS McManus,M.E. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Physiology and Pharmacology, University of Queensland, Brisbane, Queensland 4072, Australia FEATURES Location/Qualifiers source 1..1041 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /cell_type="epithelial" /tissue_type="stomach mucosa" CDS 86..976 /function="sulfonation of xenobiotics" /codon_start=1 /evidence=experimental /product="sulfotransferase" /db_xref="PID:g2828824" /translation="MALTSDLGKQIKLKEVEGTLLQPATVDNWSQIQSFEAKPDDLLI CTYPKAGTTWIQEIVDMIEQNGDVEKCQRAIIQHRHPFIEWARPPQPSGVEKAKAMPS PRILKTHLSTQLLPPSFWENNCKFLYVARNAKDCMVSYYHFQRMNHMLPDPGTWEEYF ETFINGKVVWGSWFDHVKGWWEMKDRHQILFLFYEDIKRDPKHEIRKVMQFMGKKVDE TVLDKIVQETSFEKMKENPMTNRSTVSKSILDQSISSFMRKGTVGDWKNHFTVAQNER FDEIYRRKMEGTSINFCMEL" BASE COUNT 318 a 235 c 249 g 239 t ORIGIN 1 catcatacta cgtgtgaccc ttgagtgggc ctttgagctg ctgactttca gctggaactt 61 gaagggaccc caaccctgag acactatggc cctgacctca gacctgggga aacagataaa 121 actgaaagag gtggagggga ccctcctgca gcctgcaact gtggacaact ggagccagat 181 ccagagcttc gaggccaaac cagatgatct cctcatctgc acctacccta aagcagggac 241 aacgtggatt caggaaattg tggatatgat tgaacagaat ggggacgtgg agaagtgcca 301 gcgagccatc atccaacacc gccatccttt cattgagtgg gctcggccac cccaaccttc 361 tggtgtggaa aaagccaaag caatgccctc tccacggata ctaaagactc acctttccac 421 tcagctgctg ccaccgtctt tctgggaaaa caactgcaag ttcctttatg tagctcgaaa 481 tgccaaagac tgtatggttt cctactacca tttccaaagg atgaaccaca tgcttcctga 541 ccctggtacc tgggaagagt attttgaaac cttcatcaat ggaaaagtgg tttggggttc 601 ctggtttgac cacgtgaaag gatggtggga gatgaaagac agacaccaga ttctcttcct 661 cttctatgag gacataaaga gggacccaaa gcatgaaatt cggaaggtga tgcagttcat 721 gggaaagaag gtggatgaaa cagtgctaga taaaattgtc caggagacgt catttgagaa 781 aatgaaagaa aatcccatga caaatcgttc tacagtttcc aaatctatct tggaccagtc 841 aatttcctcc ttcatgagaa aaggaactgt gggggattgg aaaaaccact tcactgttgc 901 ccagaatgag aggtttgatg aaatctatag aagaaagatg gaaggaacct ccataaactt 961 ctgcatggaa ctctgagcaa gatgtaaata aaattaaaag gtggatggca agagtgcaaa 1021 tactatcttc aatccttcag t // LOCUS HCOX4AL 1061 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens COX4AL mRNA, complete cds. ACCESSION AF005888 NID g2738487 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1061) AUTHORS Bachman,N.J., Wu,W., Grossman,L.I. and Lomax,M.I. TITLE The COX4 gene and a linked gene, COX4AL, are controlled by a bidirectional promoter JOURNAL Unpublished REFERENCE 2 (bases 1 to 1061) AUTHORS Lomax,M.I. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) University of Michigan, Anatomy and Cell Biology, 1335 E. Catherine St., Ann Arbor, MI, USA, 48109-0616 FEATURES Location/Qualifiers source 1..1061 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="IMAGE consortium ID# 219691" /clone_lib="Soares retina N2b4HR" /map="16q22-qter" /sex="male" /tissue_type="retina" /chromosome="16" 5'UTR 1..116 /gene="COX4AL" gene 1..1061 /gene="COX4AL" CDS 117..749 /gene="COX4AL" /function="unknown" /codon_start=1 /db_xref="PID:g2738488" /translation="MPGVKLTTQAYCKMVLHGAKYPHCAVNGLLVAEKQKPRKEHLPL GGPGAHHTLFVDCIPLFHGTLALAPMLEVALTLIDSWCKDHSYVIAGYYQANERVKDA SPNQVAEKVASRIAEGFSDTALIMVDNTKFTMDCVAPTIHVYEHHENRWRCRDPHHDY CEDWPEAQRISASLLDSRSYETLVDFDNHLDDIRNDWTNPEINKAVLHLC" 3'UTR 750..1060 /gene="COX4AL" polyA_signal 1037..1042 /gene="COX4AL" polyA_site 1060..1061 /gene="COX4AL" BASE COUNT 241 a 314 c 283 g 223 t ORIGIN 1 gcgccatcaa tcgccgccgc ctcgtcccgc ttctcggctg aggcgccgcg cggccaggca 61 gcgggtccag gcctcagccg cgcgcccagg ggcctccggg gccctcccgg gtcagcatgc 121 ccggggtgaa actgaccacc caggcctact gcaagatggt gctgcacggc gccaagtacc 181 cgcactgcgc cgtcaacggg ctcctggtgg ccgagaagca gaagccgcgt aaggagcacc 241 tccccctggg cggccccggc gcccaccaca ccctcttcgt ggactgcatc cccctcttcc 301 acggcaccct ggccctcgcc cccatgctgg aggtggctct caccctgatt gattcatggt 361 gcaaagatca tagctacgtg attgctggtt attatcaagc taatgagcga gtaaaggatg 421 ccagtccaaa ccaggttgca gagaaggtgg cctccagaat cgccgagggc ttcagcgaca 481 ctgcgctcat catggtagac aacaccaagt ttacgatgga ctgcgtagcg cctacgatcc 541 acgtgtacga gcaccatgag aacagatggc ggtgcagaga cccacaccat gactactgtg 601 aagactggcc agaggcacag aggatctcag cctcgctcct ggacagccgg tcctacgaga 661 cgctcgtgga tttcgataac cacctggatg acattcggaa tgactggaca aacccagaga 721 tcaataaagc tgtcctacac ttgtgctagg caggcaccgc tgtgactggg ctccgggcct 781 ttcccactac gttgaagaag aaaacctatt tttaaatgta aataaaatat ctggtagcct 841 gtgtggaaag ctgaccgttt taagaagtgg catgtgcctt gaaagggggc agaatgttca 901 gtcggtcgtg tttttaacac agagtctcta gaagaggtgc agacatcccg tctgactgtc 961 cctgtggact ctctcagttg tatgttgcta taatcctcca aatcaaagct ctttctgctt 1021 gtgcaagatt gttcctatta aacagtttta actaaccttt a // LOCUS HHSCYSDIO 1556 bp RNA PRI 14-JUL-1995 DEFINITION H.sapiens mRNA for cysteine dioxygenase type 1. ACCESSION Z31357 NID g467560 KEYWORDS cysteine dioxygenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1556) AUTHORS McCann,K.P. TITLE Direct Submission JOURNAL Submitted (18-MAR-1994) McCann K. P., University of Birmingham, Department of Medicine, Birmingham, UK REFERENCE 2 (bases 1 to 1556) AUTHORS McCann,K.P., Akbari,M.T., Williams,A.C. and Ramsden,D.B. TITLE Human cysteine dioxygenase type I: primary structure derived from base sequencing of cDNA JOURNAL Biochim. Biophys. Acta 1209 (1), 107-110 (1994) MEDLINE 95035042 FEATURES Location/Qualifiers source 1..1556 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="hepatocyte" /germline CDS 255..857 /EC_number="1.13.11.20" /codon_start=1 /product="cysteine dioxygenase type 1" /db_xref="PID:g467561" /translation="MEQTEVLKPRTLADLIRILHQLFAGDEVNVEEVQAIMEAYESDP IEWAMYAKFDQYRYTRNLVDQGNGKFNLMILCWGEGHGSSIHDHTNSHCFLKMLQGNL KETLFAWPDKKSNEMVKKSERVLRENQCAYINDSIGLHRVENISHTEPAVSLHLYSPP FDTCHAFDQRTGHKNKVTMTFHSKFGIRTPNATSGSLENN" BASE COUNT 449 a 347 c 346 g 414 t ORIGIN 1 cggtcagatt tgtgtgtgca ccgcgtctcc agcgatcccg gatccactgc gctgccaggg 61 gcctgggggt gggtctcttg ctgtctctgc gacgacatcc ttacgtttcg gcactctaat 121 gctgggtttg tgcgtgtgtg tctgcttagc ggtctagcgg gctgttaggc tccctcgccc 181 ccagctcctt ggctcgctca gctcctccac cgcagcccag cagtgagacg cgcgcgcagc 241 cagctcccca cgagatggaa cagaccgaag tgctgaagcc acggaccctg gctgatctga 301 tccgcatcct gcaccagctc tttgccggcg atgaggtcaa tgtagaggag gtgcaggcca 361 tcatggaagc ctacgagagc gaccccatcg agtgggcaat gtacgccaag ttcgaccagt 421 acaggtatac ccgaaatctt gtggatcaag gaaatggaaa atttaatctg atgattctct 481 gttggggtga aggacatggc agcagtattc atgatcatac caactcccac tgctttctga 541 agatgctaca gggaaatcta aaggagacat tatttgcctg gcctgacaaa aaatccaatg 601 agatggtcaa gaagtctgaa agagtcttga gggaaaacca gtgtgcctac atcaatgatt 661 ccattggctt acatcgagta gagaacatca gccatacgga acctgctgtg agccttcact 721 tgtacagtcc accttttgat acatgccatg cctttgatca aagaacagga cataaaaaca 781 aagtcacaat gacattccat agtaaatttg gaatcagaac tccaaatgca acttcgggct 841 cgctggagaa caactaaggg gcaccaaacc ctctgaggtt ttactttaag gttcgctgta 901 tgtttgcctt ggacaaaaag gctacctacc acgtgctatc cagtaatata cttaaataag 961 ccaatactta gatctactgt aaggcagatg ctaattataa ggcattaagt aagcaaatag 1021 tgccctcagc tactgcagaa gaaaagtccc actgaggaaa agaaagtctt gtgattttta 1081 aaggcaagtt ttcaagtgct ctcatagttc tatcctctaa ttccattaaa tccatactag 1141 gagcgtcagt gagggttttc atagcttttg gaaatacttt ggtctctgaa ctgtaattag 1201 caagaagtaa aaacagaaac gtcaaacgtc aaatgtttgc tttgttacct ggaggactaa 1261 atgtagatgt ctttagtata ctttgtatgt tcttaaatat tggaagataa ttttgtgaat 1321 ctgtagattt tattttttca gtcttacctt acaaatttct tttctatgaa taatagagga 1381 actcacggca ctctgccact tgttaatgaa aggaagtgca gaggatttag aaaagtacat 1441 gatccccaga ccacaacaaa ccaaaacata aactcatgtc tgtgtcccat ggtcatagtc 1501 aaagattttg tactgctaaa attaccaaat aatttaaata aagtggattt gaacac // LOCUS HMAIF 2281 bp RNA PRI 16-JAN-1995 DEFINITION Human mRNa for adipogenesis inhibitory factor. ACCESSION X58377 NID g22952 KEYWORDS adipogenesis; cytokine; inhibitory factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2281) AUTHORS Kawashima,I., Ohsumi,J., Mita-Honjo,K., Shimoda-Takano,K., Ishikawa,H., Sakakibara,S., Miyadai,K. and Takiguchi,Y. TITLE Molecular cloning of cDNA encoding adipogenesis inhibitory factor and identity with interleukin-11 JOURNAL FEBS Lett. 283 (2), 199-202 (1991) MEDLINE 91257301 REFERENCE 2 (bases 1 to 2281) AUTHORS Kawashima,I. TITLE Direct Submission JOURNAL Submitted (22-JUL-1991) I. Kawashima, Bioscience Res Laboratories, Sankyo Co Ltd, 2-58 Hiromachi 1-chrome, Shinagawa ku, Tokyo 140, JAPAN COMMENT See M57765, M57766, M37006 & M37007 for related sequences. FEATURES Location/Qualifiers source 1..2281 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KM-102" /clone="20-I" mRNA 1..2281 /evidence=experimental sig_peptide 64..126 CDS 64..663 /codon_start=1 /product="adipogenesis inhibitory factor" /db_xref="PID:g22953" /db_xref="SWISS-PROT:P20809" /translation="MNCVCRLVLVVLSLWPDTAVAPGPPPGPPRVSPDPRAELDSTVL LTRSLLADTRQLAAQLRDKFPADGDHNLDSLPTLAMSAGALGALQLPGVLTRLRADLL SYLRHVQWLRRAGGSSLKTLEPELGTLQARLDRLLRRLQLLMSRLALPQPPPDPPAPP LAPPSSAWGGIRAAHAILGGLHLTLDWAVRGLLLLKTRL" mat_peptide 127..660 /product="adipogenesis inhibitory factor" polyA_signal 1090..1096 repeat_region 1307..1607 /rpt_family="ALU repetitive element" repeat_region 1885..2006 /rpt_family="ALU repetitive element" polyA_signal 2261..2266 BASE COUNT 492 a 609 c 656 g 524 t ORIGIN 1 gaagggttaa aggcccccgg ctccctgccc cctgccctgg ggaacccctg gccctgtggg 61 gacatgaact gtgtttgccg cctggtcctg gtcgtgctga gcctgtggcc agatacagct 121 gtcgcccctg ggccaccacc tggcccccct cgagtttccc cagaccctcg ggccgagctg 181 gacagcaccg tgctcctgac ccgctctctc ctggcggaca cgcggcagct ggctgcacag 241 ctgagggaca aattcccagc tgacggggac cacaacctgg attccctgcc caccctggcc 301 atgagtgcgg gggcactggg agctctacag ctcccaggtg tgctgacaag gctgcgagcg 361 gacctactgt cctacctgcg gcacgtgcag tggctgcgcc gggcaggtgg ctcttccctg 421 aagaccctgg agcccgagct gggcaccctg caggcccgac tggaccggct gctgcgccgg 481 ctgcagctcc tgatgtcccg cctggccctg ccccagccac ccccggaccc gccggcgccc 541 ccgctggcgc ccccctcctc agcctggggg ggcatcaggg ccgcccacgc catcctgggg 601 gggctgcacc tgacacttga ctgggccgtg aggggactgc tgctgctgaa gactcggctg 661 tgacccgggg cccaaagcca ccaccgtcct tccaaagcca gatcttattt atttatttat 721 ttcagtactg ggggcgaaac agccaggtga tccccccgcc attatctccc cctagttaga 781 gacagtcctt ccgtgaggcc tgggggacat ctgtgcctta tttatactta tttatttcag 841 gagcaggggt gggaggcagg tggactcctg ggtccccgag gaggagggga ctggggtccc 901 ggattcttgg gtctccaaga agtctgtcca cagacttctg ccctggctct tccccatcta 961 ggcctgggca ggaacatata ttatttattt aagcaattac ttttcatgtt ggggtgggga 1021 cggaggggaa agggaagcct gggtttttgt acaaaaatgt gagaaacctt tgtgagacag 1081 agaacaggga attaaatgtg tcatacatat ccacttgagg gcgatttgtc tgagagctgg 1141 ggctggatgc ttgggtaact ggggcagggc aggtggaggg gagacctcca ttcaggtgga 1201 ggtcccgagt gggcggggca gcgactggga gatgggtcgg tcacccagac agctctgtgg 1261 aggcagggtc tgagccttgc ctggggcccc gcactgcata gggccgtttg tttgtttttt 1321 gagatggagt ctcgctctgt tgcctaggct ggagtgcagt gaggcaatct aaggtcactg 1381 caagctccac ctcccgggtt caagcaattc tcctgcctca gcctcccgat tagctgggat 1441 cacaggtgtg caccaccatg cccagctaat tatttatttc ttttgtattt ttagtagaga 1501 cagggtttca ccatgttggc caggctggtt tcgaactcct gacctcaggt gatcctcctg 1561 cctcggcctc ccaaagtgct gggattacag gtgtgagcca ccacacctga cccataggtc 1621 ttcaataaat atttaatgga aggttccaca agtcaccctg tgatcaacag tacccgtatg 1681 ggacaaagct gcaaggtcaa gatggttcat tatggctgtg ttcaccatag caaactggaa 1741 agaatctaga tatccaacag tgagggttaa gcaacatggt gcatctgtgg atagaacacc 1801 acccagccgc ccggagcagg gactgtcatt cagggaggct aaggagagag gcttgcttgg 1861 gatatagaaa gatatcctga cattggccag gcatggtggc tcacgcctgt aatcctggca 1921 ctttgggagg acgaagcgag tggatcactg aagtccaaga gtttgagacc ggcctgcgag 1981 acatggcaaa accctgtctc aaaaaagaaa gaatgatgtc ctgacatgaa acagcaggct 2041 acaaaaccac tgcatgctgt gatcccaatt ttgtgttttt ctttctatat atggattaaa 2101 acaaaaatcc taaagggaaa tacgccaaaa tgttgacaat gactgtctcc aggtcaaagg 2161 agagaggtgg gattgtgggt gacttttaat gtgtatgatt gtctgtattt tacagaattt 2221 ctgccatgac tgtgtatttt gcatgacaca ttttaaaaat aataaacact atttttagaa 2281 t // LOCUS HS09008 2057 bp RNA PRI 15-APR-1997 DEFINITION H.sapiens mRNA for uracil-DNA glycosylase. ACCESSION Y09008 NID g1850820 KEYWORDS ung2 gene; uracil-DNA glycosylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2057) AUTHORS Nilsen,H., Otterlei,M., Haug,T., Solum,K., Nagelhus,T.A., Skorpen,F. and Krokan,H.E. TITLE Nuclear and mitochondrial uracil-DNA glycosylases are generated by alternative splicing and transcription from different positions in the UNG gene JOURNAL Nucleic Acids Res. 25 (4), 750-755 (1997) MEDLINE 97169285 REFERENCE 2 (bases 1 to 2057) AUTHORS Nilsen,H. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) H. Nilsen, UNIGEN Center for Molecular Biology, University of Trondheim, N-7005 Trondheim, NORWAY REMARK revised by author FEATURES Location/Qualifiers source 1..2057 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="UniZap-XR" gene 71..1012 /gene="ung2" CDS 71..1012 /gene="ung2" /codon_start=1 /product="uracil-DNA glycosylase" /db_xref="PID:e303306" /db_xref="PID:g1850821" /translation="MIGQKTLYSFFSPSPARKRHAPSPEPAVQGTGVAGVPEESGDAA AIPAKKAPAGQEEPGTPPSSPLSAEQLDRIQRNKAAALLRLAARNVPVGFGESWKKHL SGEFGKPYFIKLMGFVAEERKHYTVYPPPHQVFTWTQMCDIKDVKVVILGQDPYHGPN QAHGLCFSVQRPVPPPPSLENIYKELSTDIEDFVHPGHGDLSGWAKQGVLLLNAVLTV RAHQANSHKERGWEQFTDAVVSWLNQNSNGLVFLLWGSYAQKKGSAIDRKRHHVLQTA HPSPLSVYRGFFGCRHFSKTNELLQKSGKKPIDWKEL" BASE COUNT 488 a 489 c 539 g 541 t ORIGIN 1 cacagccaca gccagggcta gcctcgccgg ttcccgggtg gcgcgcgttc gctgcctcct 61 cagctccagg atgatcggcc agaagacgct ctactccttt ttctccccca gccccgccag 121 gaagcgacac gcccccagcc ccgagccggc cgtccagggg accggcgtgg ctggggtgcc 181 tgaggaaagc ggagatgcgg cggccatccc agccaagaag gccccggctg ggcaggagga 241 gcctgggacg ccgccctcct cgccgctgag tgccgagcag ttggaccgga tccagaggaa 301 caaggccgcg gccctgctca gactcgcggc ccgcaacgtg cccgtgggct ttggagagag 361 ctggaagaag cacctcagcg gggagttcgg gaaaccgtat tttatcaagc taatgggatt 421 tgttgcagaa gaaagaaagc attacactgt ttatccaccc ccacaccaag tcttcacctg 481 gacccagatg tgtgacataa aagatgtgaa ggttgtcatc ctgggacagg atccatatca 541 tggacctaat caagctcacg ggctctgctt tagtgttcaa aggcctgttc cgcctccgcc 601 cagtttggag aacatttata aagagttgtc tacagacata gaggattttg ttcatcctgg 661 ccatggagat ttatctgggt gggccaagca aggtgttctc cttctcaacg ctgtcctcac 721 ggttcgtgcc catcaagcca actctcataa ggagcgaggc tgggagcagt tcactgatgc 781 agttgtgtcc tggctaaatc agaactcgaa tggccttgtt ttcttgctct ggggctctta 841 tgctcagaag aagggcagtg ccattgatag gaagcggcac catgtactac agacggctca 901 tccctcccct ttgtcagtgt atagagggtt ctttggatgt agacactttt caaagaccaa 961 tgagctgctg cagaagtctg gcaagaagcc cattgactgg aaggagctgt gatcatcagc 1021 tgaggggtgg cctttgagaa gctgctgtta acgtatttgc cagttacgaa gttccactga 1081 aaattttcct attaattctt aagtactctg cataaggggg aaaagcttcc agaaagcagc 1141 catgaaccag gctgtccagg aatggcagct gtatccaacc acaaacaaca aaggctaccc 1201 tttgaccaaa tgtctttctc tgcaacatgg cttcggccta aaatatgcag aagacagatg 1261 aggtcaaata ctcagttggc tctctttatc tcccttgcct ttatggtgaa acaggggaga 1321 tgtgcacctt tcaggcacag ccctagtttg gcgcctgctg ctccttggtt ttgcctggtt 1381 agactttcag tgacagatgt tggggtgttt ttgcttagaa aggtcccctt gtctcagcct 1441 tgcagggcag gcatgccagt ctctgccagt tccactgccc ccttgatctt tgaaggagtc 1501 ctcaggcccc tcgcagcata aggatgtttt gcaactttcc agaatctggc ccagaaatta 1561 gggctcaatt tcctgattgt agtagaggtt aagattgctg tgagctttat cagataagag 1621 accgagagaa gtaagctggg tcttgttatt ccttgggtgt tggtggaata agcagtggaa 1681 tttgaacaag gaagaggaga aaagggaatt ttgtctttat ggggtggggt gattttctcc 1741 tagggttatg tccagttggg gtttttaagg cagcacagac tgccaagtac tgtttttttt 1801 aaccgactga aatcactttg ggatattttt tcctgcaaca ctggaaagtt ttagtttttt 1861 aagaagtact catgcagata tatatatata tatttttccc agtccttttt ttaagagacg 1921 gtctttattg ggtctgcacc tccatccttg atcttgttag caatgctgtt tttgctgtta 1981 gtcgggttag agttggctct acgcgaggtt tgttaataaa agtttgttaa aagttcaaaa 2041 aaaaaaaaaa aaacccg // LOCUS HS1054RNA 1213 bp RNA PRI 19-JUL-1993 DEFINITION H.sapiens mRNA for HS1 protein. ACCESSION X57346 NID g23113 KEYWORDS kinase related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1213) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (23-JAN-1991) H. Leffers, University of Aahrus, Institute of Medical Biochemistry, Universitetsparken Bygn 170, DK 8000 Aahrus C, DENMARK REFERENCE 2 (bases 1 to 1213) AUTHORS Leffers,H., Madsen,P., Rasmussen,H.H., Honore,B., Andersen,A.H., Walbum,E., Vandekerckhove,J. and Celis,J.E. TITLE Molecular cloning and expression of the transformation sensitive epithelial marker stratifin. A member of a protein family that has been involved in the protein kinase C signalling pathway JOURNAL J. Mol. Biol. 231 (4), 982-998 (1993) MEDLINE 93294871 COMMENT The protein is related to the protein kinase C inhibitory protein (KCIP) and protein 14-3-3, an activator of tyrosine and tryptophan hydroxylases. It is expressed in all the cell lines and tissues being assayed. See accession numbers x57345-48. FEATURES Location/Qualifiers source 1..1213 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SV40 transformed MRC-5 fibroblasts (MRC-5 V2)" /clone_lib="lambda gt11 cDNA library" /clone="1054" gene 373..1113 /gene="HS1" CDS 373..1113 /gene="HS1" /codon_start=1 /db_xref="PID:g23114" /db_xref="SWISS-PROT:P31946" /translation="MTMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERN LLSVAYKNVVGARRSSWRVISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLEL LDKYLIPNATQPESKVFYLKMKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKK EMQPTHPIRLGLALNFSVFYYEILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTL IMQLLRDNLTLWTSENQGDEGDAGEGEN" polyA_site 1179 BASE COUNT 346 a 299 c 321 g 247 t ORIGIN 1 taccgccacc gccgccgccg attccggagc cggggtagtc gccgccgccg ccgccgccgc 61 tgcagccact gcaggcaccg ctgccgccgc ctgagtagtg taccgccacc gccgccgccg 121 attccggagc cggggtagtc gccgccgccg ccgccgccgc tgcagccact gcaggcaccg 181 ctgccgccgc ctgagtagtg ggcttaggaa ggaagaggtc atctcgctcg gagcttcgct 241 cggaagggtc tttgttccct gcagccctcc cacggcagag tctccagaga tttgggccgc 301 tacaaaaagt gcattttgcc cattcggctg tggatagaga agcaggaaga gcactggact 361 tggagtcagg gaatgacaat ggataaaagt gagctggtac agaaagccaa actcgctgag 421 caggctgagc gctatgatga tatggctgca gccatgaagg cagtcacaga acaggggcat 481 gaactctcca acgaagagag aaatctgctc tctgttgcct acaagaatgt ggtaggcgcc 541 cgccgctctt cctggcgtgt catctccagc attgagcaga aaacagagag gaatgagaag 601 aagcagcaga tgggcaaaga gtaccgtgag aagatagagg cagaactgca ggacatctgc 661 aatgatgttc tggagctgtt ggacaaatat cttattccca atgctacaca accagaaagt 721 aaggtgttct acttgaaaat gaaaggagat tattttaggt atctttctga agtggcatct 781 ggagacaaca aacaaaccac tgtgtcgaac tcccagcagg cttaccagga agcatttgaa 841 attagtaaga aagaaatgca gcctacacac ccaattcgtc ttggtctggc actaaatttc 901 tcagtctttt actatgagat tctaaactct cctgaaaagg cctgtagcct ggcaaaaacg 961 gcatttgatg aagcaattgc tgaattggat acgctgaatg aagagtctta taaagacagc 1021 actctgatca tgcagttact tagggacaat ctcactctgt ggacatcgga aaaccaggga 1081 gacgaaggag acgctgggga gggagagaac taatgtttct cgtgctttgt gatctgtcca 1141 gtgtcactct gtaccctcaa catatatccc ttgtgcgata aaaaaaaaaa aaaaaaaaaa 1201 aaaaaaaaaa aaa // LOCUS HS1141O19 52086 bp DNA PRI 07-JAN-1998 DEFINITION Human DNA sequence from PAC 1141O19 on chromosome 1q23-24. Contains Tenascin (Restrictin, Hexabrachion, Cytotactin, Neuronectin, GMEM, J1-160/180, Miotendinous antigen, Glioma-associated-extracellular-matrix antigen, GP150-225) or Undulin 1 like gene and KIAA0040 gene, ESTs and an STS. ACCESSION Z99715 NID g2760553 KEYWORDS 1q23-24; Cytotactin; Glioma-associated-extracellular-matrix antigen; GMEM; GP150-225; Hexabrachion; J1-160/180; KIAA0040; Miotendinous antigen; Neuronectin; Restrictin; Tenascin; Undulin 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 52086) AUTHORS Bird,C. TITLE Direct Submission JOURNAL Submitted (02-JAN-1998) sanger.ac.uk/HGP/Chr1/) Sanger Centre, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is not the entire insert of clone 1141O19. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. During sequence assembly data is compared from overlapping clones. Where differences are found these are annotated as variations together with a note of the overlapping clone name. Note that the variations annotated may not be found in the sequence submission corresponding to the overlapping clone as we submit sequences with only a small overlap as described above. This sequence was generated from part of bacterial clone contigs of human chromosome 1, constructed by the Sanger Centre chromosome 1 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr1/ This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true right end of clone 1114G22 is at 104. The true left end of clone 262D12 is at 51983. 1141O19 is from the library RPCI5 constructed at the Roswell Park Cancer Institute by the group of Pieter de Jong. For further details see http://bacpac.med.buffalo.edu/. FEATURES Location/Qualifiers source 1..52086 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q23-24" /clone="1141O19" /clone_lib="RPCI5" mRNA join(85..260,5676..9835) /partial /gene="KIAA0040" /note="match: cDNA D25539; match: ESTs N25843 AA348192 M85618 AA296438 AA359242 AA243958 R08958 AA443568 AA343958 N32205 H24925 H95372 T82224 AA359203 H95398 AA443770 AA359242 R48229 T84408 T04985 AA476824 AA465478 AA447496 T98862 T99456 T79953 AA447497 T87595 H02176 H45761 N57302 R48230 H95627 AA593111 AA044572 R08851 AA555171 T28636 AA044894 AA443567" /evidence=not_experimental gene 85..9835 /gene="KIAA0040" repeat_region 388..489 /note="MER5A repeat: matches 39..139 of consensus" repeat_region 1460..1600 /note="MIR repeat: matches 208..36 of consensus" repeat_region 3712..3913 /note="MIR repeat: matches 32..262 of consensus" repeat_region 5996..6034 /note="13 copies of 3 mer 95 % conserved" CDS 6192..6653 /gene="KIAA0040" /note="match: protein Q15053" /codon_start=1 /evidence=not_experimental /db_xref="PID:e1226493" /db_xref="PID:g2760554" /translation="MHYVHVHRVTTQPRNKPQTKCPSGGQSQGPRGQFLDTVLAAMCP IAMLLTADPGMPPTCLWHTPHAKHKEHLSIHLNMVPKCVHMHVTHTHTNSGSRYVGKY ILLIKWSLAMYFVQGSTLSTVTKMSHGKALPDSDTYIQFPNQQGPHTPSIP" misc_feature 6654..9835 /gene="KIAA0040" /note="match: STS G05565" repeat_region 9715..9742 /note="14 copies of 2 mer 89 % conserved" repeat_region 9759..9855 /note="MER3 repeat: matches 202..99 of consensus" repeat_region 9865..10162 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 10166..10229 /note="MER33 repeat: matches 64..1 of consensus" repeat_region 11276..11455 /note="MIR repeat: matches 65..262 of consensus" repeat_region 12133..12255 /note="MIR2 repeat: matches 18..145 of consensus" repeat_region 12314..12406 /note="MER5A repeat: matches 104..2 of consensus" repeat_region 12635..12772 /note="3 copies of 46 mer 86 % conserved" repeat_region 12724..12858 /note="3 copies of 45 mer 93 % conserved" repeat_region 12816..12999 /note="4 copies of 46 mer 89 % conserved" repeat_region 12954..13088 /note="3 copies of 45 mer 90 % conserved" repeat_region 13043..13134 /note="2 copies of 46 mer 90 % conserved" repeat_region 13090..13224 /note="3 copies of 45 mer 90 % conserved" repeat_region 13180..13363 /note="4 copies of 46 mer 82 % conserved" repeat_region 13418..13474 /note="3 copies of 19 mer 90 % conserved" repeat_region 14755..14896 /note="AluSp repeat: matches 303..161 of consensus; incomplete repeat" repeat_region 16793..16859 /note="MER5A repeat: matches 80..146 of consensus" repeat_region 17054..17164 /note="FLAM_A repeat: matches 1..111 of consensus" repeat_region 17274..17388 /note="AluJo repeat: matches 135..251 of consensus; incomplete repeat" repeat_region 17600..17828 /note="AluJo repeat: matches 81..297 of consensus; incomplete repeat" repeat_region 18022..18159 /note="MIR2 repeat: matches 1..146 of consensus" repeat_region 18188..18297 /note="MIR2 repeat: matches 33..146 of consensus" gene complement(19814..49883) /gene="dJ1141O19.1" CDS complement(join(<19814..19891,22272..22435,29834..30001, 30881..30977,38076..38227,38658..38790,39737..39867, 43159..43422,47998..48261,49617..49883)) /gene="dJ1141O19.1" /note="match: proteins O00531 Q05546 P02751 P10039 Q64706 P04937 Q29116 Q90994 Q90824 Q15567 Q00546 Q14583 P07589 Q90484 Q91740 Q05707 P24821 Q60847 P13944 Q28275 P78530 Q02388 Q91008 Q60847 Q62704 P22105 Q99715 Q62308 Q91008 Q05708 P32018 P11722 Q91289" /codon_start=3 /product="Tenascin (Restrictin, Hexabrachion, Cytotactin Neuronectin, GMEM, J1-160/180, Miotendinous antigen, Glioma-associated-extracellular-matrix antigen, GP150-225) and Undulin 1 like protein" /db_xref="PID:e1226494" /db_xref="PID:g2760555" /translation="IDSPQNLVTDRVTENMATVSWDPVRATIDRYVVRYTSAKDGETR EVPVGKEQSSTVLTGLRPGVEYTVHVWAQKGAQESKKADTKAQTDIDSPQNLVTDWVT ENTATVSWDPVQATIDRYVVHYTSANGETREVPVGKEQSSTVLTGLRPGMEYTVHVWA QKGNQESKKADTKAQTEIDGPKNLVTDWVTENMATVSWDPVQATIDKYMVRYTSADGE TREVPVGKEHSSTVLTGLRPGMEYMVHVWAQKGAQESKKADTKAQTELDPPRNLRPSA VTQSGGILTWTPPSAQIHGYILTYQFPDGTVKEMQLGREDQRFALQGLEQGATYPVSL VAFKGGRRSRNVSTTLSTVGARFPHPSDCSQVQQNSNAASGLYTIYLHGDASRPLQVY CDMETDGGGWIVFQRRNTGQLDFFKRWRSYVEGFGDPMKEFWLGLDKLHNLTTGTPAR YEVRVDLQTANESAYAIYDFFQVASSKERYKLTVGKYRGTAGDALTYHNGWKFTTFDR DNDIALSNCALTHHGGWWYKNCHLANPNGRYGETKHSEGVNWEPWKGHEFSIPYVELK IRPHGY" mRNA complement(join(<19824..19891,22272..22435,29834..30001, 30881..30977,38076..38227,38658..38790,39737..39867, 43159..43422,47998..48261,49617..49883)) /gene="dJ1141O19.1" /note="match: cDNAs X78565 X56160 M55618 M23121 J04519 Z18630 D90343 X56304 X78565 X61599; match: ESTs AA558949 W77035 W68429 Z41686 AA442164" /product="Tenascin (Restrictin, Hexabrachion, Cytotactin Neuronectin, GMEM, J1-160/180, Miotendinous antigen, Glioma-associated-extracellular-matrix antigen, GP150-225) and Undulin 1 like protein" repeat_region 21250..21551 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 21605..21958 /note="MLT1A2 repeat: matches 1..367 of consensus" repeat_region 23284..23317 /note="17 copies of 2 mer 91 % conserved" repeat_region 23571..23724 /note="MIR repeat: matches 207..51 of consensus" repeat_region 23834..23905 /note="MER5A repeat: matches 1..72 of consensus" repeat_region 23907..24206 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 24209..24335 /note="MER5A repeat: matches 61..189 of consensus" repeat_region 26889..28088 /note="MER7B repeat: matches 1205..2 of consensus" repeat_region 29423..29458 /note="18 copies of 2 mer 89 % conserved" repeat_region 31464..31577 /note="MIR2 repeat: matches 141..23 of consensus" repeat_region 32258..32548 /note="AluSg repeat: matches 300..1 of consensus" repeat_region 34866..34903 /note="19 copies of 2 mer 100 % conserved" repeat_region 36035..36144 /note="MIR2 repeat: matches 33..142 of consensus" repeat_region 37097..37213 /note="MIR repeat: matches 59..179 of consensus" repeat_region 39456..39589 /note="MIR2 repeat: matches 146..1 of consensus" repeat_region 40423..40721 /note="AluSg repeat: matches 1..299 of consensus" repeat_region 42120..42423 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 44418..44713 /note="AluJb repeat: matches 299..1 of consensus" repeat_region 44860..44912 /note="MIR2 repeat: matches 94..146 of consensus" repeat_region 45585..45739 /note="MER3 repeat: matches 6..166 of consensus" repeat_region 45740..46023 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 46552..46654 /note="MIR2 repeat: matches 1..126 of consensus" repeat_region 46887..47187 /note="AluY repeat: matches 3..301 of consensus" repeat_region 47820..47865 /note="23 copies of 2 mer 83 % conserved" repeat_region 48520..48641 /note="MIR2 repeat: matches 145..22 of consensus" repeat_region 48741..48811 /note="MIR2 repeat: matches 71..142 of consensus" repeat_region 50159..50265 /note="MIR repeat: matches 34..141 of consensus" repeat_region 50563..50673 /note="MIR repeat: matches 106..203 of consensus" BASE COUNT 14634 a 12413 c 11250 g 13789 t ORIGIN 1 tgctgctccc atgtccccca acacagaact tgaccacaga actcatcaca ctatacttct 61 atttttattt ctcccccttg ctagactggg cggtgatcag gatctgaatg cacagggcgg 121 gtgttcagcg attgtttact acgttgaacg tgacctccag gaaagcagtt ctggccgaga 181 tcccctgaca acgcaaagca agaagtaacg tggaaggagg ctccccaagc tggctggcca 241 ttttgctgct gtgtgtggag gtaaacttca tcccccgtct cctcacccat gattcttccc 301 ctcatttagc acatggctcc tgtgctcata aatgaaggtc ggctgctcag cttgcttatt 361 caattattcc caaacctgac aattgtccag aaacactctg gctgcttgct acaaacataa 421 attcctgggc ccatcctaga cccgactgaa tcagagtctc caggagcagg ctgcaggaat 481 ctgtattttt aactccttaa atgactcttc gtagccaaca gtccatcctg ctgtcacctg 541 ggatcctggg cagaggagct aaagcaagca cagtgatcag cgcacccaag gtagaaagct 601 caatggagct gtttgagcaa aggcccaagg ggaggtcttt gctcccaact gtcagttata 661 aaatggcccc ccacacaaat ccaggagtca gcagtcgcag aagaggggtc tcttaaggtt 721 ctcatagaga actgaacctc ccacctcccc ctacaagaag ggtggtgatt tgtaccattt 781 ggaggtcact ctggagttgg tctcagggat tctaattggg tgccattgtg atccttatta 841 ttaacctttc cagaaagtat ttactgtctc acagacactg ttctagagct ttacatttat 901 tattttgtct aaaaaacaag aacaatgaga ggctgatgca gaatagaacg ccctgctccc 961 ttcttggaga gcaccggaag gaacagcagt gtcagggcca gccctggaca cagcctgcag 1021 tagcttttgg ctctcagaga ttgaagcaga cctccctgtg gaatgaccca cccctgtaac 1081 acttccaagg gtcccaaagc ctgggaggga ggctggggcc tgtagatggc tagctgctct 1141 cctccacctc acctatgaac tcagcaatcc tccaggcacc ttcagatgtt ccccaagcga 1201 ccgctctcca tctgatttct ggatgccaga gtctagagca acagccacag acctacagac 1261 caaccatatt agggagatta tttctgaaga ccttttcatt tcaatgcggt ggggacccat 1321 ggttctttgg caatagggat tggcttggag gtgaggaagg aggaagaaaa gcccagcact 1381 ggaattagtc cattgttatt tcaagataac tcactatttt atggattata tttgtgtaag 1441 tattttacag ttacagaatg cattaaatat attgaatcat tttgatcccc acaatgtcat 1501 tatgagggaa gatcattagc ttcattttgt agctgaggtc tggttaactt gccaaggtca 1561 tgcaggttga cccaaggctc aaaaccaggc cttctggctc tgggatcact gtttcccatc 1621 gtcacctcat acatgggtgg atagaagttt gcgacagaaa gagtgaatga atgattctgt 1681 ctccctagct agatactaca gaggatatgg aggtaaataa tatttggttt caaaaaaatt 1741 gaaagaaagt tttcccctct tctctgtcct ctcgtccacc cccacagcca caatcagaca 1801 gatgaggttg gttggcacct attgttggtc ttacaaaaat agatgaatta cttttcaagc 1861 attgtcagtg ctattcaaaa gctaacaaaa ggctagagtg tacttccagg ggccgaagtg 1921 tgtcttcttt tgcattccat ggaagttctc acttttaatt catgccacta aggagatatt 1981 tgattgtttt ctggaaatgc taataatgca agactcataa atactaatga aacactatag 2041 attagcagag ctcagcaagt aacctggggg gtgcctagaa aagaattctt agattaaaaa 2101 atataaagaa taccttttaa tcgaattttc cccagactgg cttcccaaat gttaatgaat 2161 actctttggc tgtatgggga gcagcctcaa ataagtccct ttaggaatcc agaagagaat 2221 tattgactta gctttagctt gaaaagaaat ctatttattc aaaggccatt ttaggtgcca 2281 agggatggat gagtttaaga caaaggttgg gaggggtggg agggccctag atgttgagat 2341 aaacatataa ttcaaatgca tgcttctact tcagccaact aagcaatcta gtttggaatt 2401 tgcttattgt acttggcttt tatgacttga tctatcaata gtaataactc ccctggaatg 2461 gacgaggctt cccgatgtca gaaaccatat ctttgtcatt cactattgaa ttctcaacac 2521 ccaacagtgc atgaaacata ttaggtgctc agtgaatgtt tgataaacgg ataaggagat 2581 ttaaagaact ttcttaacac attttccaga gttcttcaga tcttgttatc ttttcctgaa 2641 aggcagctct gatcacaaca ttctcctgtt gtaaagcctc caaataaacg caaaccctat 2701 gacggcccct ctaccatggc catgcctacc tcctagtgtc ccttctcatc ctctcttctc 2761 ccaccctgag ctccaggctc accagaagac cccctgttct cctgctcatg ttctagaagt 2821 ttctctgcct ggaatgtttg aacatccttt tcacctatat ttctctgtca aaatactacg 2881 catcttcaag gtcgtccccc aggtggctcc accttcatga aaactgacct gatctccatg 2941 agctcgccct ccttccttgc tggctgaacc cagaggggcc cctgttaccc atctgctccc 3001 tgtaggaaca gcataacccc tggcacaaaa tagttggtga acatttgtgg tggtggtgtt 3061 atctaacgtg aatcaggaaa gctgtgagta gaatcaaatt aaagaaaagc acccaaagcc 3121 tttccacaac agagcagaaa gatcccaaca gacaggggca cagacccact caagactgag 3181 gtaaaagcaa aggtcattgc agagggagaa ggaaaatcca gaagagagat gagtggcctg 3241 agatacagca tcaccggggc ttagaagcat gccaaaggct gcttttctct acagagagcc 3301 tggcagcttg atggtgtgtg ggcagatcca tggcatgggc tgttcccagg cctcagaggg 3361 gcttgagcag agccttggat gagaggacat gactcctcag gatacttttc tttgttgcat 3421 gttgagttta agttcaggca gccatatgct ttctataaac ctgcacttgg gcagaacttt 3481 gtttttattt tatgtatctt catgcatttg ggaaggatct aacaatgaga aataaatctc 3541 tttgcagtct ttcctacaat agactgggat aagcatttaa gttaatctgg ggaatagcca 3601 aatattgtaa gattctgtaa atcctgggct gcttgccaaa aatgggttaa ttccacacaa 3661 aggaatagga ggccacagag caacgcctgt gaacagatca ctccttgcgg atctggagcc 3721 aaactctcca gaatgagtga gaccggctct ggcactcact caccgtgtgg cctgagcctc 3781 catctcttca gctgcaggat gggggaacca cagtacctac ttcataaggg tgtcatgaag 3841 atgaatgagt taatagagtg cctagaaaag tgcctggtat ataattactg cttactaaaa 3901 gttggctaca attattattg tcatcctata acatttcttt gggctccatg gaccaaaagg 3961 ctcagtagta ggtgccaatt tgtacgcttg caaaatctat gtaaataatg ccatgagagg 4021 gtgggattac atgccagcag actaggactg aggacagctg gggcttagat aggcacatgg 4081 gcactgggat catgagacat tgatagagca tgctatccaa ggaattagac ttctgaggac 4141 aattgctggg gcatcatgta gaaaacatca cccaggaggg ctgcacatgt atgcaatgaa 4201 gcctgagctg atcttgaatt ccaggcccag gtccccagtg ttcccatccg tagacacagc 4261 tcactcaaaa gattaaatgc agatgaaaat gattagaaaa gtatgaagtg aaatgcaagt 4321 ataaagtgac atcattgtca tccttagttc ctctgcttgc ctggcagcct gcggagagtg 4381 ctgaacccat tgtccagagg tctgggtggg tctgggtggg cctgtgtggc ctgggtgggt 4441 ctctatggcc tcagacatgt cacctcactg ctctggaccc cagattcttt atccttaagt 4501 gagagagttc attgggatga tctctggagt gtccatcctg aaaatcaacc actccttaat 4561 atttgggtcc tggagaatgt cttacaccaa caggcctctt aaatttcctt gaatcatgcc 4621 tgtccttgaa gtcttgtttg tacacaagaa cctgtaattc cagaatggaa ggcagcctct 4681 aagaccttat acttgagagc tgagctttaa gatatatgtg gaggggtgac cagggaagag 4741 gggcagctct gtacaagcaa ccaggagaat ggccccagct gtatgctgag ttcagaacag 4801 gaacagggag gcacccagga acctcactct ggtttccaag caccatcact aagcagcctg 4861 caaggactcc tctcccatag agaagaaaga gagcaggaga ctttggagta tttggtcaag 4921 gaatagaaaa catagagaag ggtaatgagg aagaggaagg tagaaggtag aaagtggaaa 4981 gccagcaaat ggtcacccct ggttctctca gccttgggag gagaggactt ctcaggcaca 5041 ggaatgtgat ataaggaggg gaggagacat ggagcaggga agagatcatg aggggacagc 5101 acggaaagca ggcatggggc aggatgagga ctgatgacct cttagcccag aggagaccca 5161 ggttccctac tacagacagc aagaagtaga catctcactt tgtgcttaag cttcaataag 5221 ctttgaaaat ggcaagttat gttaataaaa ggttacatac agacctgtcc tgtggggagt 5281 gtgtgttcaa catctgtcag gtgagctaca gtggaaaaaa gattgagaac tgctgttgca 5341 gaagggtcca gggaagcaac cccttatcag aacactgaac acacacattg attcatttat 5401 ccaacaacta ctgagaactt cctacgtgcc agacctgata tactattcat ttttatatcc 5461 taagcccaat ataatgtgtg tacatagcaa gaattagttg atgcttattg attacatgca 5521 tgcatttatc ttgtttattg agtcaataca agtttactga accctggaaa tcttcaattc 5581 cttccccttt cctaagccta aaatctatga cactattaaa attgtagacc agtccccctg 5641 catttctctt cagttgtctg ttttaatctc ttcaggtgct gccagtggca tgcccaaacc 5701 caaagctgga agaggaataa attacaagtg gtcaaggttg catccttttg agcccaggac 5761 ctgcttgtaa gccgagaggg ttctctggcc ctaatctagc caagcaccat ggagagaatc 5821 agtgccttct tcagctctat ctgggacacc atcttgacca aacaccaaga aggcatctac 5881 aacaccatct gcctgggagt cctcctgggc ctgccactct tggtgatcat cacactcctc 5941 ttcatctgtt gccattgctg ctggagccca ccaggcaaga ggggccagca gccagagaag 6001 aacaagaaga agaagaagaa gaagaagaag aaggatgaag aagacctctg gatctctgct 6061 caacccaagc ttctccagat ggagaagaga ccatcactgc ctgtttagtt aggcaggaag 6121 cagaggtgtt tcctttctgg ggctaagcct ccttctgacc acacacagac atttcaggaa 6181 cccctgaaat aatgcactat gtccatgtcc acagagtaac tactcaacca aggaacaaac 6241 ctcagactaa gtgtcccagt ggagggcagt cccagggacc acgtggacaa ttcttggata 6301 ctgtcttggc agctatgtgt ccaatagcaa tgctccttac tgcagaccca ggcatgcctc 6361 ccacctgtct ctggcatacc ccacatgcaa agcacaaaga acatttatcc atacatctca 6421 atatggttcc caagtgtgtg cacatgcacg taacacacac acacacaaat tcaggtagca 6481 ggtacgtggg caagtatatt ctgctcatca aatggtcatt ggctatgtac tttgtgcagg 6541 gaagtacatt atctacagtc acaaaaatgt ctcatgggaa agccttgcca gattcagaca 6601 catatataca atttcctaac cagcaaggcc cccatacacc atctattcca taaaccactc 6661 aggttacaga tgcatgcttt cctatttcta actctacaca taaactttta ctggaagtac 6721 tcataattgg acattccagc aacctgctac agtccccacc cttgtgtgtc ttgatacaga 6781 cacaccaagt ttctgtgcct ctgacccctc acctgtgcca agatgtttaa agtgtgatgg 6841 ttcaaaattc attgaaagct cttttcttgt aactcatgac aaagtccgtc ctcattgcca 6901 ctgagaggtg tttaatgtga tccaagacct ctctgtgaaa cattaccccc gcaaaccact 6961 cagcaaagtg cctttctcca agcaagaaca aagagctctt ggtggtgact gctagaaaat 7021 tatggaagcc cactcattta tgtcagtgga ctgcaactgt gtacctgtgc aatgtttaca 7081 gatggaaagg gtgaggagat gctacacctg agctaggtat ctcctatata accaaagttt 7141 ccagcaggga aggaactaga caatcatcag tgcagtctca cagaaggcaa cactggaagt 7201 gatgtcataa ggttgtgatg tgtgcacggt atggcacagg tgggatgcag aggtaacaga 7261 gtttaaatga aagtaggatg aagctataaa gaggtttatt tatatttata ttgaagctca 7321 ggcaagtgcc ttgcacacag taggtactta taactaactg tggttactgt tggatatgtg 7381 atgttgttaa gggtaagctt gtaatacctc accagttctc cccgagtgat cttctcttct 7441 aagtgagccc actaattgct gcaatggatg aaattgggtg tttaatgctg gagagcacat 7501 gtaggtgaca catgtgcctt gaggtatgtg aggacatgta aattagatcc acagtgagct 7561 gaggagggct ttccccgcca gagtgaggtt gggaagcaga gttaatccac ttataggatg 7621 aactgcttgg tatttttatt gtattgtgac tgtattacaa agatggacaa ttcactcctt 7681 gggagcaagt tatgctctag aagtttattt acaaatatgc tgggcagctc tcttgaaata 7741 ttttcccaag gaagctattc tacacagtgg caaaattgct atctaattaa taatgtagct 7801 aaactatgat atttatagta gcaaaaaact aaattctata agattgcatt aaaggaaaga 7861 tatattctat ttgctcactt gggctgcttg gtactcacct gccctccagg tgtactttag 7921 gcctgtggag ggtgggcatt tagtggtgac ccttgcacca gggttttcta acagatgacc 7981 ctgtgaatca taatttaaac ctgcatatat tttatagcca gtcacatttg ccctctcacc 8041 ctatatggcc ataaactgcc taagcactca ggcctcccac tcatcaaccc ctttgaccag 8101 agaaagaagc actctggttc tctatcccct tgtcacatag agagtttgtc atggggcctc 8161 tggctgtgcc cttcacataa cagaatgact tgccatctgc ctgcaccaaa cccagggatg 8221 tggaagacat ctccccacaa ctgccactgc tcaccaggac aagctgccct tcctgtctcc 8281 acctctcagt ccccctagaa tggatggctg gggagaggtg gaggctgaca gctgagacgt 8341 agtgtcagat atgatctagg agggcggatc accgggatcc gggaccatac aagtaacatg 8401 gtttccatgg caactgcttg ctcctttgaa ttaagacagc agtcagttgt cattgccatg 8461 acaaggcctc tatctccagg cacaatgtcc ctgctgtctc ctaatccaat ggacttgctc 8521 tcaccccagg gatgaaacac ccagaaactc acttctcagt cacttccaca gccgatgact 8581 cagaagagcc aaacccagaa tggggcctct cttttcccca tcacagactc ccctgacaac 8641 ctttcctggc gtaactagag gagtcccagt gcaggatagg ccctaaacgt tttgttaaat 8701 aaacaggtgc atgaaaggag cctaaggcca ttgttgatat ccactctctt ctttccactt 8761 ccttctcatc tttttctcca tgttttatgc ttctctgatt ccctcttctg cctgcaccag 8821 accagcccca gccctttatt cctctccatt ttcactcctt ccagcctctg tccctgaact 8881 gccactggca acccatggga cctcaggacc agagactgct tgactcatct ggggagggta 8941 agttcacggg ggacaaaaaa atgattccta aagaagaggc ttcctagacc agcacaggct 9001 cgagaaagac atcccctagg cctggacttc tgagcagctt tagccaggct ccggacggca 9061 gccagaggag gcctttcccc attgctcctt tccccattgc tcaatggatt ccatgtttct 9121 ttttcttggg gggagcaggg agggagaaag gtagaaaaat ggcagccacc tttccaagaa 9181 aaatataaag ggtccaagct gtatagtatt tgtcagtatt tttttctgta aaattcaaac 9241 acacacaaaa gaaaaattta tttaaataaa atactttgaa aatgaaaagt cttgatgtag 9301 tcagatggtt actctcttaa cattaggtat tacccccact cagacatcac tcagaaatga 9361 tcaatgcagg gactctttct gtgacacaaa tgtcccagcc ctccctggtc accgccttcg 9421 ccatggtaga gtcataggtc tgaggatgag gaatgtggct gtctcaccct tgcttgcaaa 9481 acagatggcc ttggagacca gactccctca aaggtgccag ctacaggaaa aatatactga 9541 tgttccttgg caacacttac agaactttcc atcaatgagg tccatcaatg gcttcttaaa 9601 ggaaaagggg ggaaatagca aaaacctaag gaagaatgga cctttgagtt aaatccagtg 9661 tttgttggga aaggagggat caaaaacctc tatagtagcc actagggcaa aaactgtgtg 9721 tatgtgtgtg tgtaagtgtg tgtacactgt tcaatatggt tcaatatggt accaatagcc 9781 acatgtgact atttaaattc attgcaatga aataaaatta aaggtatact agctcagcta 9841 tgtctgccat atttcttttt tctttttttt tttttttttt ttttgagatg gagtcttgct 9901 ctgtcatcca ggctggaata cagtggtatg atcttggctc actgcaactt acacctcccg 9961 agttcaagta attctcatgc ttcagccacc tgagtagctg ggatgacagg catgtgccac 10021 catgcccagc taattttttt tatattttag taaagatggg gtttcaccat actggccagg 10081 ctggtcttga actcctggcc tcaagtgacc cgccagcttc ccaaagtgct gggattatag 10141 gcatgggcaa ctgcgcttgg ccatgccagc cacgattcaa gtgctcagta gctacacgtg 10201 gctagcggct gccatcctgg acagcatagg tctatgatct tgggggcagg gtcagacacc 10261 tgggaaaaac agccatgctg actggatttc aaggagggct taagaaggaa ggcctagagc 10321 acaggtggac tctttgcctt atctcgccca aggcaggtta tgtctgtagt tcagatgagt 10381 tcattcctgg gctcccttcc ccatcaccac acactcatga caaccagaga acagagactc 10441 ttcattcagc tccttgttca tccacccagg acattgctaa acattctggt gcagaggagg 10501 aactgatgct cagaaaccat ctgggagggg agcagctgct gcacccctga ggggatgcca 10561 gtgctcaatg cctgcagccg cacctgcccc ggctccatcc tggccttggg gtataatatg 10621 gatttggaca atgaatgcca tcacgtcctt agcattctca gtttatcaca gcactctgca 10681 ctggcccctt gcccgttgac cttagtagta tttcccaatt caaagtcaca ggtttactgg 10741 actggaaaaa aatcccttca acccgctttc tgtcttagac tcaaagggtt gttttcaaac 10801 acctgtgtgg gcaatgtcac actatatttc ttcaggaatg gctgtgactg tgtacctcca 10861 gccctctttt gaggctgttc atcccattaa catttctgac tgtttactat atgccaagca 10921 cgctaggtgc cagtgatact aagttgtcct agaggggctc agaacctggt aggggggcac 10981 agacaagcaa accatgacaa gacagtgtgg ccattgcttc cttagaggga agctcccagg 11041 ctggcttcag gacaggccgg gggagatcgc ctaggaggtg tctagagact tgcacaggaa 11101 gaaactgaga aaggagggcc ccagctgcac agtctacaaa ggcaagtgca ccctggagta 11161 gtgatgcagg cagacaggat gatggggacg attcagaaac catcagggct ctctggaagc 11221 cataagtgca gtgagctagg actcatgact agggttgtcc caaccctgat gctaactccg 11281 actctcacta gttatgtaat ttggacccat ggcttaatct ctctgtaaga cttacctgct 11341 aaataaagct aatgagagta gctacctcat agggagcatt aaatgatcgt gtgtacaaag 11401 catttagcac aggacgtggc ttatattaag tgcccaataa atgttactta ttattattaa 11461 gggttgaagg aaaaccgtga gagagctgtg tcctaggaac caaaggaaaa gaaatttcca 11521 agaagaaagt gatggtccat agagtcaaat gatgcaggca aatcgaatga gatagggatt 11581 gaaaataatt agcattttct attccccatg gtgtgatgtg ttccctaatt tcaccacata 11641 ctgcactatc tacaggatta ccaatactcc atggattaag gtctgcagat ttgcagatta 11701 ggcaaggctg cctaattatc tgtcccgcct ctgcccacaa cgaagtcgcc ccagcttgtc 11761 agtgccacct ggtggccaga tcaggagaaa gcgtgagtca acatcttttt caccttagaa 11821 gtgtttcctg gagaacatcc atctagctag ttcacctgtt ctcaaattaa atgtgcaaaa 11881 gagtcaccgg gttttatgtg aaattcacaa atctgtcagg gtccggaatg ccgtgctcat 11941 gtgacatatg ggggatttcc ataaatcata ggtctcaact gttgttacaa acatattttg 12001 gtgaagtgtt ctatatatag cctcactaca gtaaagcttc cctaaccaca caagtcttgc 12061 ccattgcagt aagactagac cacattgcaa ttgtgtgttg cttctattcc cagcacccca 12121 cccctatttt gacccccatt agaattagct ttatgagatt agaattagaa ctgtcttatt 12181 cactagtata tcttcaacac ctagcatgga gcctggcaca tactatatgc taaataaatg 12241 tgttgattgg gggaaataaa cactgaggtg tttcttaaaa atgcaaattc cctgtctctc 12301 cctgcccttc tggctgatta ttaagtctag ggtgatgcca gacaacctgc tctaataaac 12361 accaggtcaa tgcaggtggt ccctgaacca caccttgaga aatactatca aaacaacaaa 12421 agaactgtgt acggtcaaat aagtttgcaa acactgcata gtataacttt cccacttcac 12481 ccccttggga tctgcaacgc atttatcaca ttaaagtttc tagcaggtgc tgtaataaag 12541 aatttggttt aactcccctt aaaattttcc caaatttatt taaccacaag cctctttttc 12601 ttttagcaca tctggaggaa caaagattca ggggccacca tttgggaaat gctgccatgg 12661 agggctagta cgcactccac acatcctttg ctcctgactt ccatggaggg ctggtgcgca 12721 ctccacgcat cctctgctcc cgtcttccat gaagggctag tgcgcactcc ggcgtcctct 12781 gcgcctgtct tccatggagg gctagtgcgc actgcggcgt cctctgctcc cgacttccaa 12841 ggagggctag tgcgcactcc acgcgtcctc tgctcccatc ttctatggag ggctagtgca 12901 cactcctggc attctctcct cccatctacc atggagggct agtacacact ccaggcattc 12961 tctgctcctg tctaccacgg aggactggtg cgcactccac cgtcctctgc tcccgtcttc 13021 catggagggc tagtgcgcac ttcgacgtcc tctgctcctg tcttctatgg agggctagtg 13081 cgccctccag gcattctctg ctcctgactt ccatggactg ctagtgcgca ctccgatgtc 13141 ctctgctccc gtcttccacg gagggctagt gcgcactccg gcgtcctctg ctcctgtctt 13201 ctatggaagg atagtgcgca ttccaggcat tctctgctcc tgacttccat ggaggactag 13261 tgcacgctcc acgcgtcccc tgctcccgtc ttccatagag agctagtgcg cactccacgc 13321 ttcctctgct cctgacttcc actacagact cagtcaatgt catctccaga ggaaactgtt 13381 tctgcagtgt atacagattc tcactaacct ttatgaacct ttctctttta ctcttccctt 13441 tctcttttac tcttcccttt ctctttcttc catttgcaac ccataataat cctgttggtg 13501 gctattcata gcaaatatgt tttagagctg ctaaagagac atctctatca aaagaaaaaa 13561 aagaaaaaaa gaaaaagttt taaatgaaaa tatgaagact gaagatccaa gcccactatt 13621 ttaccattcc agagtgcctt cgtacataaa aaccactttc attatttcac ttgattctca 13681 ctttataaat gaagaaaatg aggcccagag aaggtaagcc agtttaccca aaactggaat 13741 agaaacccaa gtcttcagac tcctacccca gtgttccttc tgcacaccag gcatcgagat 13801 ccaggccaat ctgatcttgg ctcctcaaag actgttgaac ttgtagacgc ccctcccttg 13861 agttggaaac ttacagaaca caatcaaggg tcttcccata agcctttaaa gtgcccaaat 13921 gcctccttta tatgtcccac agcaagcctc atctgcctcc ttctccttct ctaattcaac 13981 ctaattttct tcacttacca aaataaagaa ctgtttccat tccattttgt ttctttcttt 14041 tatttaaatt acagtaccgc aatactgttg tttcctattt ttcattatac atacactgca 14101 actcatctat attgtagagt aaacagagtt ttatgggcta acctgagaac ctaccttctt 14161 tatataacat tataattgga actcaaataa tttcacccac agaaacagcc aactgacaga 14221 aggaattgga gaagtgaatt catttgcaaa ttaggagctg cctgaaatct ctcccagggg 14281 tgcccccctt cacctttcat caggctgaca acctctatcc gcctttcctc cccagcagcc 14341 cagtgcgccg ggtctgggga tagagtccaa tctcaccaag caatctcctc aaggttgttc 14401 ccagttattt tttgcgcatt tgtccagcaa gaacgctgcc taagaaacca ccaagtgcag 14461 gcccaatgtt aagctgggcc tcaccgactg actctccaga tagcatcacc caggaaaaga 14521 gaaaagtggg gacctagccc ccagaactcc cctttataga agagagaagt ggcgctgaaa 14581 tgagaggctc ccgacaggac aacactgcca tctcctggcc aggctgaata tagcaaacct 14641 cttgcactaa gctgcggtcc caaccgtatt gatcagctcc gtattgacat gaactccctc 14701 ctgatcccct ccaccaccct agcgaagcgc ttgaggtaat gacttggaat ttcttttttt 14761 tttttttttt ttttgagacg gagcttcgct cttgttgccc aggctggagt gcaatggctc 14821 aatctcggct cactgcaacc tctgcctcct gggttcaagc attctcctgc ctcagcctcc 14881 ggcgtagctg ggattagtgg agggcctttt ccatcaactg gctaaatttc accaagagtt 14941 tctcgctgtt atacttgttt ttccttctgc catgccatcc cttcatctgc atgtcccctg 15001 tccacatctc ctcacacacc acacctggcc tttttccttc ctcaatcaca atttctggag 15061 gcattttcta tctttctcct cagctccttc agcatttcct ggctttaatc atctttaaat 15121 ccctgagaat tcaaatttta aacgcagagc ttttgatctc atttgacaaa cataaactct 15181 gggaccctat tcccagagga agccactgtc aggaacaacc agctctaagc tctgacatga 15241 aacgacccat ctcttgcctg taccacccat cctcctcaaa gcccaccatt ccttcttcct 15301 ggatatctca ctaatctaga ccctcctccc cttctccacc atttctgcca caggccaagg 15361 aagttatgac cgcttgcctg aactatcaca atagcactca acttgcctgt gagtccctat 15421 ccttcctcca aaacttccct caacaccccc agccctgttg tctatcctat caccagactg 15481 atatttccga aatatacatc tgaccttgtc actctcaggt ttaaaaacct gtgtgttccc 15541 cactttccac atgctttcat ctgagcagtg gctctcaagc gtggtctatg ttggaatcct 15601 tgggggcact ttaggaagct caggtgcaca gaactcacct gatacccact ggagcagagc 15661 ccgttgaggg gagtggagct tctgtaccca tggaggcccc ctgccgccgc cgacaggtag 15721 tgagcgctga gaacctcagt actggcctgg caaaggaaga atccccactc atctcaccag 15781 cctggctccc caccaagctc tcctcacccc agtaagaaac cactttgccc ccctcagaca 15841 tgccacactc cacactttct cttccttgct gccaagtctc ctgtacccga tgggccccac 15901 ccctcttggg cctctttttt ctccccccag aactcatcct tccagactta aagccgacat 15961 cacatccccg tgaagctctc tcccttggca cccctggcag gatgagtcac cctgactctg 16021 cccacagtgt tcagtccacc tccctgtgga ctcactctcc ccagtttgtt tccatggctg 16081 cctctgccac cactccaaga gctgctggaa ggccacattt gtgttctgtt tatcactgtc 16141 ttgctcatgt cctgcataaa aggcaacact ttggaaactt taccaagact tctgtctttg 16201 catctaccct agtgcctggc ttgcagcctg tgccttctat gagagctttt gtttgcttta 16261 gtttgttcta aagggtagaa atcgtccctg taagcttcaa aaccttgatg gctcataacc 16321 tggttctaag aaaattgact ctctctaaaa agcaggtcta tccagaaacc cattccacgg 16381 gcccatattc agctaccaga ttggaagttg tgagcactgg tcacgttaaa atgaggttag 16441 gcatggaagt ttcaagttgg aatgttttcc tcagtcagac agaggccctt tgaaaatgag 16501 tttctctcaa cttccttcct ggtccccagt agaacctttg aagggtcagc tgaggttttc 16561 ttcacccgct cctttgtgca ccatgggtcc tacaggcaaa ataatgtaag aaatagctcc 16621 tgtttcccaa aagtcctcct cttgcccagt ggtggagaaa ccctggggtc aatgaacaca 16681 ggggtgtgtg ttttgtaatt tgctaccatc acaggctcct ccagagcaaa aagaccaccg 16741 gcggctcccc cacaccacag ctactcagag tggacaccac tccagcagca gccctcggcc 16801 cagacctact taactagagc gtctggggaa gggaccccgt gatctctgtg ttaacaagca 16861 tgttcaaagt tgagaagcat tgccaacaat caccatcctt ggctcgaagt caatttctgg 16921 gatttaatcc attttgcaaa ggaggcactt ttctaaaaac cctggatcac tgtgatgagc 16981 ccaaagcttc ataattacga agctgagtac cccacatgca aacggggtca cggagcagca 17041 gaagggaatt ccagccaggt gcggcggcta gcacccgtag tcccatctac ttcccgagtc 17101 aaggctggag gatcatttga agccaggagt tcaagaccag cctgagcaat atagcaagag 17161 cctgctatat atctatagat atagacatag atatagatat agatgatata catatagata 17221 cagatataga tatagatgat atagatatag ataaagatat agatatagat gtatagttgg 17281 gcatggtggc actcgcctgc agtcccagct actcaggagg ctgagacggg aggattgctg 17341 agtccaggaa ttcaaggcta caatgagctg tggtcacaca ttgcactcgc ctttttttat 17401 tccctatatt cttcctcaca acaccagaaa ttaaaaggga gagatttaca ctgttttcct 17461 cacacagcct gggtgacaat aagactccat ttctattaaa taaaacaatt aaaaaactac 17521 aaaaatgttt ttaaaagaag agaatttgag gagaagactt ggaactggga agttgtgcag 17581 acagaccttt aagacagatg agaccagcct gggtaagata gcaagactcc atctctacaa 17641 aaaagtaaaa aattagctgg gcacagtagc atgtgcctgt agtcccagct actcaggagg 17701 ctgaggcgga aggattgctg gagcccagga gttcaaggct ggagttcaag tctgcagtga 17761 gccttgatca tgtgaatata cagcagcctg caagacagag agagacccca ttcaaaaaaa 17821 aaaaaaaagg cagacaaaga aattagctca ggctcaggcc ctgataaagt tagatcaaga 17881 ctaggagaaa tttggtcaat ttctgtaaac atctcctcct cagagaggct tcatagtcca 17941 acaccatgca acaccttcat cttctcatcc taccttttct tccacggcat ttcccacttc 18001 ttgtaaccac acatttattt gttgcttaat gtcagcctcc accatgagaa tattagcaca 18061 aggacaggaa tatgatgtta tgtccttgat taacaccaat gcttagaatg gaacttggca 18121 cataataggg cttcatctct attcgttgaa tgagtgaatg taaacgaatg aatgaatgac 18181 agcacacaag ctccatcaat acaaggatag tgtctatctt gctcaccttt gtttccccag 18241 tgcccagcac actgcttgat acaaagcaaa caacaaactc ttgttgaatc aattaataac 18301 atgcaagcac attttttcat cgtcagcatc aacactaaaa ctgtatttca ctgagaccac 18361 tcattatcag agcgagcagt taccatggca ttgctcaatg tgtgtgtctg gggaaggaag 18421 gcaccagagg gggatgctga agactttaaa ggttgacaga aacggtttca ggggtatggt 18481 ttgacattca cccagaggtt aaaacctgga gacatctagg caatagacta caccctcgtc 18541 ctggaggggt acggcagctt ttggagcatt cgccttactt cccattcact cacttctgca 18601 ttctaaatac cattatttcc catcatgcaa atcagagtcc acagccaacc aaattccatt 18661 acaatgcacg aattgtacct tcacttgcat gtaggttaaa tttagaattc aagtgcactt 18721 acgtggagga aaataacaca tggcaatttt tttttaaatc aagtggttta ttattttttt 18781 tcttcaatag gaatttaagt tacttcattg gtcactgatt ttgctacaga aaaccctcca 18841 aagtgataga aaataagcaa aaaggatctg atcttgctaa ctgtccgatc aaacagtaag 18901 actcacggtg ctgagcactc ccacaaaacc cagggctcca atagtgcagg gcggggaaaa 18961 gagggagtct gaggcctggt gggcatgaat gggagttgca gaagagtctt tgggaggaac 19021 caattatcaa gacagtcaaa ttattcaaag ttgaaatgag tgagaattta ttcacaacat 19081 atgcagctct atggacgcaa ttatgcaaat gactgaaaag agggcactta aaaatagcag 19141 taaggcggca gctttggtct ccagcatgct ggatacaagt cagaacttag ctaaacatga 19201 gtccatctta gcccctttta tcccctttat tcttcctcat aacaccagaa attaaaaggg 19261 ggaggttcac actgttttcc taaaaaccca atgggaaagg aggtgacaaa cgtccggttg 19321 tctcagccct aacccctggg tcatcagaac tcctcctctg tgcttctttc cagagaaaag 19381 atttgcatta gcagaacttt aaatacatag atgtaaattt ccagagattt ccaactggtg 19441 aagtgctggg tgattccttc tcttgcccta atccagtata tatgttgctg aagataagct 19501 gagaaacctt gccgttccag tactgtcttt cctacacata ctgtgcaact actattttga 19561 ttttcaagac cctcataaag acgggcaaaa atgcaggtga gagggaagga acctccatgg 19621 cttgaagctt ggtgacatga tttgttctgc gggctatctg agcactccca gaccgcagtg 19681 accactaccc accccgcccc aagctgccac agctggtggt gtctcctgcg aggactgctc 19741 acacgggcca tcagaacgtt cgcagccttc ctctcagcgt ccgcttcttt ctgcccagga 19801 caggctccct gctgtagcca tgagggcgga ttttcaactc cacgtaagga atggagaatt 19861 catgtccttt ccaaggctcc cagttcaccc cctaccaaaa aaaaaaaaaa agccacatca 19921 agaagaggct cttaggagga gggcagggtg ggttatggta caaaggaatt aattaattaa 19981 ttaattacca gtgaaaccac tccctctctg ctcaggagag agaaagagac ggggtgatga 20041 gaagggaggg gagggcagga tccggacaca cgcgccctct gctgggcagc tggggttaga 20101 cgggctgaga ggacttggac gatttggaag ctggcggtct ctatggtggg agcaaggggg 20161 tgtgagcccg ggagctgggt gaggccagag cagccacaga gacaacacag gggtcaaccg 20221 ggccggcaat gggcaaatta tctttccttc actcttttcc tttccaaaaa attaaaggca 20281 aaaataaaca tattaattgg ccagttgtgt aaaatcattt tcaatttaga ctgtttagat 20341 attagctctg gggtaacaaa tagcctgctg aaaactaaat ttcattttat ttcttttgac 20401 tggtgagcgc aataatattt aattcactta ataaccctgc aacctggcca ataaaaagcc 20461 caagtgacca attttatgta ttttaatttg tgaggtaaat aataaacaag aagctaaatg 20521 ccttgcagac tgtgaaaagc gcattaaaca gatgaatgtc ttttattagc aaatatggga 20581 tggttcaggg catcctggct gctcgtttcc aatatttgat ctcctatccc tggcttccct 20641 gagtcagccc acacgctgcc accatgattc atgggcccac cctccccagc ttccctgggg 20701 gcaaatttct tcttcccttt gaaaaatcac ataaagattg ggagtcccga agactggtca 20761 gggccaaagt aatcaaaatg tacctggact tcggatgtgc taatatttac agtaatattg 20821 gtacaaggtg agttctgaag aggtggcttc ctggaggctt aggatcaggg ctgcagatgg 20881 gaaattagaa acctaggatg gggactcagc gtttgtggag gggagaggat taatgacgag 20941 cagagacaaa gatgcaattt tcaagaaact gtgagggagc aggtgggaag tgccccgcag 21001 ctgtcaaaga gtcaggatga tgcactgggg cagaggaggc ttgagaaata gcgggtggga 21061 cagtttcagg gcaatgtggg aagaggagcc ctcctggacc cctgggccct gggcgcacac 21121 tcccaatgtg gagagactgg tctagctcaa tgttctggga aggttttggg tgaagtggat 21181 ccaagagtag gtctgaccta aggcacaaaa gactgttttc ctggtgatca aaggacttga 21241 gacagctttt tttttttttt tttttttttg agacagggtc ttgctctgcc acccaggctg 21301 gagtgcagtg gcttgatcat ggctcactgc agtctcaacc tcccaggctc aagtgatctt 21361 cctgcctcag cttccctaga ggcagggact acaagtgcac gccataatgc ccagctaatt 21421 tttgtatctt ttgtagagac agggtttcac gatgttaccc aggctggtct caaattcctg 21481 ggcacgagca atcagctcac ctcagcctcc cagagtgctg ggattacagg catgaaccac 21541 catgctcagc ctaaagaaat agctcttgtt caactaggag aacaactaga aagtaggcag 21601 tgtttgttat aaactgaatg tttgtgtctc ttcaaaattc atgtagtaaa gccctaacca 21661 ccccccacca acatgatggt atgtagaggg ggcctctgga aggtaattag gtttacatga 21721 gaccatgagg tggagccctc atgatgggac atatgcccct atatgaagaa gagatcagag 21781 ctcctttctg tccgccatgt gaggacgcac agccacaagc cagcaaggtc caccaccaag 21841 aaccaaatct gccagcagtc tgatcttgga cttcccagcc tccagaactg tgagaaataa 21901 atgtctgttg tttgagccac ctggtctgtg ttattttgtt acagcagcta gagctgacag 21961 tgtgtgaact agtaaaatga gacagcgtgc agagcacatc atctgtcatt acatggcagt 22021 gggcagccca agcttttccc gcaccttccc ctaggcaagt gtgcagctct gggaatgcaa 22081 aacaagcagg gccatggcca tcaggaaggg aggccacgac ctcaaggcca tggagcctgg 22141 aggagaagct ctactgcttt gactggaagc ttcagaagga gaggggctgc cagactggcc 22201 tggagggtgt ccatttggaa gctgaatatg gacataccct gggacagaaa agacattcac 22261 ctgtcaccta cctcactgtg cttggtctcc ccatatctgc cattagggtt ggccaagtgg 22321 cagttcttat accaccagcc accatgatgt gtcagggcac agttgctgag tgcgatatca 22381 ttgtctctgt caaaagttgt aaacttccat ccattgtggt aagtaagagc atcccctgca 22441 ggagagaaga gacggaggcg agagcccagg gttagccacc gaggagatga tcaaaagagg 22501 acccagcagg aagatctcta ggcatgaaaa tgccaaagat gcagccagtc atgtcacagg 22561 tagtcaccgc tctctcaaaa cagcccaaat gggaacatga cacatgggaa caagctgtct 22621 ttcctggcct gtaccccaac ttcaggcctg tttctccaac agttctcata gcattttcac 22681 ttggaagctc cacagtcaga tcaaatatcc aaacataaca actctcccca agtccctgac 22741 cacaccacac acaagagcac cactctccac tctacccact ctgccttggg tccacccttc 22801 tccacaccac ccagacctgg aacctggagc catgtttgga aatcccttca tacctcttct 22861 caagccctcc ctctgcaggt cactcacata cacccctatg tctctgctcc catgacagcc 22921 accctcggcc ttctctcatc ctgggtctgt gttaacaaaa cagccttctc tctggcttct 22981 tgcctgccag cccctctgga gtttagacgt ccattttcat catctcaggc ctaacagtct 23041 acttcatacc acgtggaggt gaattcctct tccatgcctt gcaggcctgt aggttggttg 23101 tgccctatgc acagtcttgc atcccaccat ctccaccaca caccctccac tgtggaccag 23161 cccaccttcc aggagatttc tctctgccct catctgtcca aatcccaacc atcccttcaa 23221 agcctagctc aaggccactt ctaaagaaga ttctctttat tggtcttccc atgcaccctc 23281 ctaacacaca cacacacaca cacctacaca caaacactcc tgaacttcca gagcccttct 23341 ggagacacat gtaaaattca gtccttcatt acatcccatc tggaaaactg ctttaattat 23401 tttgcttatg tatctcctgt ttccctatca tatttttcat ccttttacaa gctttctttt 23461 tctgctcata aaaggcaata agggcccata tcaatcacaa gcttcatcaa gctctgttcc 23521 tgcaataaag attcttacat ttatatctca gctctaagtt tagcgatctt cttcgcatgt 23581 gcattctcat ctcactctca cagcaaccct gtgaagtgag cagggcagct cttagtcccg 23641 tggggaaaca aattcaaaaa ggttagactc taggaacttc ttaaggtccc accgtccaca 23701 aacagcagag ctgggactca aatctctgtc ctgagattcc aagtccaggg ttctttctga 23761 gacaccaatc cacctatgcc ttccttgttt caatgggggg cttcttgcac gctagcaaga 23821 gagagctatc tatcaatggc tcttcaagtg gggttcccag accagcagca tccacatccc 23881 ctgagaactt gttagaaatg caaatgggcc aggcacggtg gctcacacct gtaatcccag 23941 cactttgaga ggccgaggca ggcagatcat gaggtcagga gttcaagacc agcctggcca 24001 atatggtgaa acgccgtctc tactgaaaat acaaaaattg gcctggcatg gtggtgcaca 24061 cctgtagtcc cagctactct ggaggctgag gcagaagaat tgcttgaatc cgggaggtgg 24121 aggttgcagt gagccaagat cgtgccactg cactccagcc tgggtgacag agtgagtctc 24181 catctcaaaa aaaaaaagaa aagaaaaaag aaatgcaaat tatttggccc cagcccagac 24241 ctacagaacc ggaaactaca ggttgggccc cggcaaccta tgccttgaca agcccttccg 24301 atgatctgac gcaggctaca gtgtgagaac ctctggtcta caacttcact tgttttttag 24361 ggaacattta ctgaatacct gctgtggagc agcacatgct tggctctgtg aacacaaaaa 24421 taagaaggtc cttgctctca aaaacgtttt tggagtgggt ggggtgaata tgcagagttg 24481 gtagcatatg ttaaaaggga aggtcctaat ttgccccctc caagtctgca atctcaccag 24541 cagctcgttt atgcctcacc cagaaaaagc aggaggattc tggggccacg gaggccactt 24601 ccacgcaaac acagcagaca ggtctgtctc cagtagtttt ctccaatcca agaaaggtat 24661 attgtcatgc tagtcataca catttccgcc tccctctaga ctttgtacag aaagttaacc 24721 acagcttgaa ataacgtgca gtgtctccct tagtaacctt ctttaaatga gcacagctta 24781 gctaatttac atagaaggtt cccagggctg ctttctaaga atctgtgcag aggctctctt 24841 aaagacatgt ttgtgatgat ctaaacctgg gcactgggat aaagaaaggt aaagcaggtt 24901 ttcacccaaa tgtgggaggt ctttttctgc agaaggattt tctaccttct agaaactgaa 24961 gccatcctaa tgttcagagt gtagagagtg agaatgagga ctggaggggt tcatggcagg 25021 cccatgaaag gtctgtgccc aaactcagag gggatcaagt tggtcaccat ccatctcatc 25081 aaaatgtccc tctgaggccc actctccccc tctaggattc accaggttgg gaaagatggc 25141 ccaggcagct gaaccttcaa agatcaggag aagccattaa catcgcatcc cctgtcagtc 25201 acacacagac acagtagtgt gtgtcagaac cttgaatgtc ttttatttca ggattaggaa 25261 tggatgatta caaatgattt gtgccatttc tagctctccc tttcatccag gtacccatta 25321 attttttttt tctttttcag gtgccagcat ttaaaaatcc atccctggtc ccaacccact 25381 aaatgcttcc agagctcccc agtcacattt ccaaatgccc ccaggttggg atccatccct 25441 ggttgtgtcc ccactgtggc ccatgcaggt gggactcgaa atcagctgta aatcattggt 25501 tagccctaga attcattctg caccattctg aatgcctagc tctttatgtt tgcattatga 25561 tctctaaata gatgaataac atcttaatgg tggagctcat aagatagcct ccctgccccc 25621 actagaagac tcagaatcag aaactaaatt tacacatcat taccttgctg gaatttgctt 25681 catgaaggga aaatggactt tctgatataa cttgagtgga agtcaaaagt tgagggccca 25741 cctggacagt ctgagtgggg agcaagcctt aagccacaca ttcctgccag gctgcatgca 25801 tgccagttct aatcagagct aaaatgaaga tagaggctgg ggagagctgg tgcgtgcagg 25861 gaggtatcaa aatattcacc ttccatttga atgccgtgtc ctctaaacag tgcctcagca 25921 ttcaagacat gagtagccag gagctttcta aattctgaaa ccctcctcat ggacaaatca 25981 gtgtacataa ggaacgctaa aactccacta accctaccat tgtaattcag aatttaaaag 26041 cgggaatgat agtacacatg ggaagactag atcgctgcct aaatgctgtg tgaaagtctc 26101 ctattaaaaa ataattctat taaaaaatac atcctctact tgctggcact gttggcctga 26161 acaaccttag aacaggaatt atcccctcat tgcagttaac atgcaaacag agtacctggg 26221 aggagaatta aataattcac caaacatgtt gactcccttt cagtctcttt cccattggca 26281 tcagttcatt tgctggatca ctttaacaca ggcttcagaa aaggcaaatg gtaatggcca 26341 agggaaagga aatgtgcagc ccgggaggac aggcctgagt gaggttcctg accaggaggc 26401 cagggagggc actgggtttc agggagagat gtggggcaca cgggccaaag aaaaacaccc 26461 acttcatatt gtgtccaggg agggtcccac gcatggagct tatgtagact tctttaacct 26521 gagaaggaaa ggctgaggct aaccgatggc catgatcaag agcagacaac agggtactgg 26581 aaaccaaaga aatgatcaga tggactccat gtggactgaa gaggcccaag ggctctggga 26641 cagactggaa tgtgtgtact gtattatacc aaagggttgg aagaacctgg aataagctgt 26701 caggaacata gtctgatctc atttagagat ggggtgggag gatatggaca gtgtgcttag 26761 tcacacaaaa tagcatcttt aaatcccctc ttattcagtt aatatcctca taatcatatg 26821 aatgtccatc tatgtttccc aactgtatta taactgcctt tttctatatc tgactttcat 26881 gttttataca gtcatgccct gcataagaac atttcagtta acaagggact gcatatacga 26941 cagtggtctc ataagatcac aataaacctg aaaattacct agtgacatca cagctgtcgt 27001 aaggttgcag accaatgcat cactcatgtg tttgtggtga tttggtgtaa acaaacctac 27061 cagttataca aaagtcttgc acatacaatt acatacagta aataatactt gataatgata 27121 ataaataatt atgttactgg tttgtgtgaa taatactatg ctttttattg ctattttaga 27181 gtgtatgagt actttacaaa aaaaaatcct gtaaaacagc ctcaggcagg tcctttggga 27241 cgtattcaag aagaaggcat tgttatcaca ggagatgaca gctccatgtg tgctattgct 27301 cctgaagacc ttccagcggg acaagatgcg gaggtggaag acagtgatat tgatgatcct 27361 gaccctgtgc aggcctaggc taatgtgtgt atttgtgtct tagtttttaa cacaaaggtt 27421 taaaaagtaa aaaataaaaa taaaaaatgt ttaaatagaa aaaaaaactt acagaatatg 27481 gatataaaga aagaaaatat gtttgtacag ctgcacaatg tgcttatgtt ttaagcgaag 27541 tgttactcca aaggagtcaa aaaaattgaa aaagatgtta aagtttataa agtaaaacag 27601 ttacaataag ataaggttaa tttttattaa cgaagaaaga acatttttaa ataaatttag 27661 tgtggccgaa gggcagtgtg cctaaagtct acaggagtgc acagtcctgt cctgggcctt 27721 cacattcgct caccactcac tgactcaccc agagcaactt ccagtcctgc aagctccatt 27781 catagtaagt acccttacag attggccatt ttttatcttt tataccatat tctcactgtg 27841 ccttttctat gtttacatat atttagatcc ataagtactt accatggtgt tacaattgcc 27901 tacaatattt agtaacatgc catacaggtt tgtagcctag tagcaatagg ctacaccata 27961 taccttggtc tatagcaggc tctgccatct agatttgtgt aagtgcattc tatgatgttt 28021 gcacaatgat gaaatgctta acaatgcagt tctcagaatg tatccctgtc gttaagcaat 28081 gcatgactat gttatacttc aagtacatgt agcacagcct gtcttcagcc cgtggagacc 28141 caggactact taggaaagaa gtgacagtat ccaaaggcag tgaggatctt ctagctgcaa 28201 tgcttttttt tcaaggattg tcttttccct gaaataattc tttgcatact ttttctcacc 28261 caactcctac tcagaatcct ggaacagttt ccccgtgtct gtagcccaaa tgcctcatac 28321 tctggtaggt gcctacccct ccctctttgt cccctcccat gtcaccatgt caccaaactc 28381 acctttcatt cacccctatg gaacttattt gggtctcagg gactataatg taagatacta 28441 aacatcccta agattatgta tatagaaaag tcctgctagc aatttcattt tttaaactct 28501 cctactctgt accaggctct gagataggca ctgggaaaca agaacaaatg caaaattcat 28561 gtgactattt tttaaatgaa attataagat ctcaaagaag gtttgcattc atagttggcc 28621 acccagatca tggccccaga cactcagaag ttctgttcac agtcttctgc catcaggggt 28681 ccttgagtac atgatcttga gaagtccagc aaggctgtga gataaactcc agcattcttg 28741 cctctgctca gtccctactc accctaaagc ccccaggcct ggagagccct ctatttcctc 28801 ccacttgagt ttgccccttt ttgcagatcc agctccctcc agttgcttcg tactccatgt 28861 gcccaccttc tccacactcc tgtaactgtt tatgtttctc accccttatt tgactctctg 28921 caaaagttac cttctatttg tattcaccta tttacattta agtctaagga gagagaccgc 28981 catgtcttat acagtcggtc agtctggctg gtaacacatt cctgtgtaaa tacaccaaag 29041 tcaaaaacgc aaatgtcaaa acacaggcaa atgtcaaaac acttgttatg gtttaaaaac 29101 cataaatacc gcttttaaaa tattggtatt aaaacaatga ccacacctcc ccctcttaca 29161 taatgtattt aatttactgg tatcattcca gcatttcagt catgttgctt aattttgctc 29221 ttttccctca caaccaaaca caccagcaac tggcttagct ccaatgagca gctactagag 29281 gctaaaggtt ccttcacttt ccactgaact gctcatttct gtgttgtcac agttgaatta 29341 aattaatgag acctatactt cttattattt gaatcagcac tttccttatt ctgtttttat 29401 agcaaagtgc caccaaaagc tcaaataatc acacacacac acacacacac acacacacta 29461 ctgtgcagac aaagatgaca gccacactgc taaagccgaa tgaaagactg acaagcatca 29521 agaaattgtt cagaatatgg gctcatcagc taatgtacaa cagcctgaat aagaggagta 29581 caaggggagg gtttttattg gggtgggggg atgacatata acatacaaaa ttccaatttt 29641 atggtggtca aggattgctt caactatttg ttcactcctg accccctgca gagagtagga 29701 cctccactgt caatgagaac aattcacctc cacagaataa tattccatcc cccagatcag 29761 tggcacacgg caaaccaatc cttacacaac agacagcaag agaccttcgc agtaagaaaa 29821 cattttttct cacctgccgt gcctctgtat ttcccaactg tcagcttata ccgctccttg 29881 ctggaggcca cttggaagaa atcatatata gcataggcag attcattggc agtctgtaaa 29941 tccactctca cctcataccg cgctggagtg ccggtggtga ggttgtgtag cttgtcaagt 30001 cctaaaaata aaaaacggaa ttccaataat caatcaaccc atgtgtgcga gtaagcactc 30061 aacatgtgtg tctgaggctt tcagaaccta aagtcacttt agtctttacc ttctctgcta 30121 ataaggggca gtctcagcac tttctctttt gggggctcat agtcacacta aggagtgcga 30181 agaccacatt ctatggaagg gagcagtgaa gggggcaggg taaccccaga aggaatctgc 30241 ctccattggc tccatccttc taaaacagaa gcagttgtat ttatcatcca tttatctcac 30301 caagtaggag agcaggtcaa cgactcatcc taagatcccc aaagagattt tacactgagc 30361 aaaccacaaa tatcatctac tttattagac actgggtatc tgaactatga ctttttaatt 30421 cttcatttcc caattcatgc ttttggctat ttcactgtct gtgaacatct gctccagagg 30481 agagcagagg gaaaaggggg aaaaaattgc tcataatgcc tggcatcacg aacagaattt 30541 tagagactgc tagagtgacc tcccagtaat ggaccagcat cccaatcact gaactttttt 30601 tctactactg tggcttctaa gatcttcagg ctgcaggcag atcctgcaga gcttaaatca 30661 gcataaaacc cctgtgacca ccctaccttt caacaagttc agagtgcaag acagccttcc 30721 catcaacatg cccaccaaca ccctgctcgg cacagtcctc tgtctccctg ccactggggc 30781 ctctctgtga aggtggtgag cttccctaag cctcctagct agcttgtcct gggagaaatc 30841 accccccaca gccccagcct cctggattct gagatcatac caagccagaa ctccttcatg 30901 gggtccccaa agccttccac atagctcctc catcgcttga agaaatccag ctgcccagtg 30961 ttccgcctct ggaagacctg agaaggaaga tgaatacttc actctgaccc tccaaacaga 31021 cataggtgct gggagacaag agaagaccaa gagcttgctc atacactgcc ttcctccagc 31081 tagcaagcct ttcattctag agaggctttt tatttatatg tcagtagatt cactatatta 31141 ataagcattc tttatatgta ccgtagtatg ccaggtacta agacatgtag agcaaaagca 31201 gagaaggttc ctgccctcaa gaaatttcaa ttctttccat tagttgtgtg attcagttgc 31261 tagtgtgggt tgtctagggc aagatattcc agacctctgc taactctcac atgtaacttt 31321 caacacccat tcaactcaat tggggcacat catgtttatg gcatcttttt tccccttttc 31381 tgaaatagtt tgagggtagg aattattttt taattcacct gattcttaat atagcgatta 31441 tttaatactt atggagaaaa aagttcacac aacaaatatt tactgagcat ctactatgta 31501 ctaggcaata tgcaagatag tgtgcaaagt gaataaatca gaaatagtcc ctgccctcct 31561 ggaactaata gttcagtaaa taaaattact aaattaaata atttttaaat gcagttaaaa 31621 ctgtgataag ggctatgagg gaaagatgca acagaactgt aagagcataa aacaggagga 31681 cctgaccaag gagccaagaa accaccccca ggacatggct gagctgcagg atgagtagga 31741 gtggtcaggc aagagcacgg gggtgaatgt tggggcttct gggggctgag gaagggatgg 31801 aaagggaggc tgagaagatg gcaagagtca gtcctggtga gcctttaaaa gggattgttg 31861 aagacgttgg tccaggtaag agcgctggaa attattgcag agttttaagc aggaaatgag 31921 atccaatcag attcgtctct agcttgtagt gtatggaatg gactggtcag gggcaagagt 31981 ggaagtgaga tgaccaatta ggagaccact gtagagtgca gatgaagggg gacagtagct 32041 tggatgagga atgccctgag aatagatgga agtgcataga aggtaaagtt gacagaggtg 32101 gtgacagaat gtggaactga aagaggagga agtcaaggta agataagttt cttatagtat 32161 gggcttgcag atgactcttc tctcagctat tcactgagat gggggattca gagaggacag 32221 gtggtagttt attgttattt atttattttt gttgttgttg ttgtttgttt gttttttgag 32281 atggagtctc actctgttgc ccaggctgga gtgcagtggt gtgatcttgg cttactgcaa 32341 cctccgcctc ccgggttcaa gcgactctcc tgcctcagcc tcccaagtag ctgggattac 32401 aggcacacac caccatgccc ggctaatttt tgtattttca gtagagacgg ggtttcacca 32461 tattggccag actggtcttg aactcctgac ttcgtgatcc acccacctca gcctcccaaa 32521 gtgcaggtgt gagccactgc gcctggccaa gtacaggttt tatgaatggg tcatgagctc 32581 atttggggat gtggttaagt gtgagttatt tctgaggtgt gctagtgaat gaaaatacca 32641 cagagcaatt ggatgtaagg gctggagttc taaagagagg tctatgctag acttattaag 32701 gtagagtgtg tctcaggcaa tgctaggagt tcactgagtc tcagttcact ttcctcttcc 32761 caggaacaca ggaggaggat gtccatttac agcttccctt gcatttaagt tggagccatg 32821 tgactgtttg cagccaaaag gctgtaggtg ggagcctttt gtgtgtgggt gggttctcca 32881 cactctctct tctactgcca tgcaccatgg ggccatggga aaggccacac actgaaacag 32941 cagaactaca agctggaaac agcctgaatc cctgagtcac acttatatct ggctgccaaa 33001 cctgcacaac acattgcaca tataagaaaa atacactgtg tttgttcagt cgctggaatt 33061 tggagcccat tctatcctaa ttagtacaga gaagcatggg acccagagga gggtaaaatg 33121 aagtcatggg aaaagctgag atcacgtagg aagaaaatgg gaactgaaaa gataaagtgg 33181 cctagaatca agccttgagg acatccaata tgtataatga agaaggacga atccattaag 33241 gagagagaag gggcaacaga agtggagaaa aagaaactag aaaaatatgg aatcacaaaa 33301 gccacaaaga gaaaggtgtg gtcaatagtg tcaagtactg atgagagctc aaggaacagg 33361 aggccaaaac catccctaga atttagtggc acagaaatca ctggtgattt tagcaagggc 33421 tgtgtcagtg tgagatatgg acagaagcca aactggaacg ggccaaagaa tgagtgagaa 33481 gcaagaaaat ggaggcagag ttgggtgcca ttcaccaaag tgtggactgc cttcttttca 33541 aaaattccac tgtttggaga aggaaggcag ttccagtcac agtaaattgg attcttttat 33601 aggaagtaaa tctgaaaagt acccacacac acctgtatac actattggaa ttgataatac 33661 atctaattcc acatgacctt gagagccaag ctgaatgcca tcctagctaa cctcaggaat 33721 gattcccact aaccaaaggc tagaattgca tttctctctc tgcctatttt tctttgcatt 33781 cattcagatg cagaagcccc aacacaaaat ctaatctccc tcctttccac tagacttcac 33841 agcatgtgcc ttgaagtaaa acacattatc ttgaatttat aaggtaaacc acctgatatt 33901 tgtgccttct ttatattttt ctccgggaag agagagtagg ttaccctgca gcaatgctga 33961 aagctccaca tgggagtgca tatttaactc cttccaaagc acccattcat gtctgctggc 34021 caaggagtgg ggcaccatgg atgcagaagg ctccagactc caagtcagac cacagccact 34081 tcctgggcag gggatgggtg gggattgctg aggcagcctc ttctattgct gaccctcagc 34141 cagctcatgg taaaacagga atggaaggga agagaagagc ctacttcata agacttctgg 34201 gaggactaaa tgagacacca cataaaaatg ttctataaat aggcaagagc tactgaacag 34261 aatgtaggaa atatcacggc aaacagctaa caccacatta acattgttgt tctctttccc 34321 ctttgctcaa gtatccatct tttaaaaatg ccctatttcc ttcttggctt ccctcacttt 34381 tcttcactca tttattcact tagacgacaa tcatttcgtg agtgctggcc ctggagcaag 34441 tatttatgat gccatttgac agatgtgaaa gtccagaaaa gcgagtatcc agagtcctac 34501 agctaatcac tgatacaaag accacgcata aatcttatct ttctgtcaga aagtttgaaa 34561 ataccagccc tactgaatct gtttcccaaa tcgaattttc aaagactctg agaacttgct 34621 aatttcctag caaggtgcct ggtccaaccc cttccttggg actcttgttg taccaaggac 34681 agtggaattt taggatacat gtagcaccga tgtatggctc cacatacatc gtgtcaaatc 34741 acgttcccac cacaaggtca ggctcaaatc caaccctgcc cccctagaat gcattacaaa 34801 ggtatttggt ctctatctcc tatcagtgaa cccagtgccc aagcttgata ggagctttcc 34861 agcgatgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtaataat acttcatctg 34921 ggccagggaa gcttccaaaa atgattctgg tgctctttgc atcactgttt tgaaccagaa 34981 ccagttctat acttactagt tgtgtctttg ggcaaccttg acaagcctca cttttctaaa 35041 ctgttagatt aagatgttaa cagcctctcc gtcacgatct taaatcaagg cttcggcaag 35101 atcattaatc atagtaaatc ataaagtagt taagccagac cctggcatat atagatataa 35161 taagttatta tgatctctca ataaatgtta gtttgcattt tattatatca gtttccaaat 35221 gaactgaccc acagggatca tttatacatc ttgatcaggc agccttaatt gactcactca 35281 gagtctaatc cactcccatt ctatgagatg aaaacaccct ataatgaact atttgggtat 35341 cacaaaaccc ccttaccaaa taagtttcaa agggagaaaa acaaacaaac aaaagaaacc 35401 tctttgtgtc tggttctata ctagagttgt cattgcattc tagaacttac tcaatgccct 35461 gaatgactaa atacagagaa aaaaagtcaa acttgcattg aaatgtctgt ggaaagagcc 35521 tctaatgcat taagactttc aagatggaaa acaaagatgg aaatatttga aacattaagt 35581 gttccctgat gttctttcca gcctcccacc agcggcagag tttacctcag ggcacagttc 35641 ttctggccta ttcttcagat tgagttcctg ttgcttggga tggaagaatt accataaaag 35701 gattttgaaa gctcaataaa acagagaaac agaccatggg tcatccttca gtgacaaagg 35761 actcttgaac ccccccaaaa aaactgtaac aactgtcagt aaaatgtttt gtggggaagt 35821 tttcaggact ccatgttttc tggagcatgg tggctttaag taaaaccctg cagaacctca 35881 gttcatgcat gacccccagg agtctggctc agaaaacaag acttggggtg gggttggggc 35941 tggggagggt gagaatgcac agtggtttta ttatttttgg aaccgatctc tgtgtttgag 36001 gaggggtcat ttgcatgagc aggatgtaaa tggcaagctc cttgggggca gggatttctg 36061 ttggtttggt tcactgctgg atcttgagta catgaagcag tgccgacaca tagtaggcac 36121 tcaataaata tatgtgcaat aagtcatgac tattccaggg cagacagact gacagatctc 36181 tttctaaatg ttgccccaga taactccttc tgtgtgggag cttggcttac aaaaaggcca 36241 cactagacaa tacaaacgtc ctgccaaaag tcaattacag gcacaaaggc aggaaattgg 36301 gctgcaggcc tgaagggaca gaatgcttag ccttacaatg gtgctgatga aaagcagcag 36361 gtacagccct atgggtctgg tcaggtggaa gacctcccat gttacattct gctctgttgt 36421 ttttataaat caatcgggaa tttcccccag ccaatccaat tactgccttt caaccgtcag 36481 atgtcgctgt gcagataata agttttcttc attgctgctc atcagatggt atctttttat 36541 tcatggctgc tcaagtgtct ttggtttccc aggcccctga gtgaatgtgt aaatgaataa 36601 ggtttgagca ctatgagatc atgctctctc tttcccacag tgagttgcgg atgcacagca 36661 ggttaaggct gcatgctggc ttggagcagt gagagtcttt atcattcagg cacagggaga 36721 aaaggggcaa ggggagatgg aaggatcaga ttgcaaccct caaaaacagc ttatcatata 36781 attggatcag gctgatgcaa tcaaagaagc tgccatgtgg tttaggcaga caatggccca 36841 agagaataaa acatctacac agaagaggga caattagagc attgcccaag tttccgtctt 36901 tcagacttta gtctttccaa tcaaaacctc aaaggccttg ttgctgtact gtggagacag 36961 ggtacagaac ttggacaatg gactccagtt cagaggtctg aactccccag tcaggctatg 37021 cattcgcatc ttggatgacc tgagatgaac tcactgagaa gaccaatgat taggaggctg 37081 gagttttgaa atctggtccc agctttgtca ttcttgtatg acttaggcca ggcacttctt 37141 tctgtgggac tcactgtccc tatctgtaaa ataagaaaaa caatcctctc tcacttactc 37201 taaagctgtt gtgtgagggt catgtgtaca gaagtgcttt gaagaaggta aaatataggt 37261 aatacataca tataaggcat tactagatga tgatcaactc acaagccttc atagtcatca 37321 gccttagtct aaccaagcct gttctaggac agtggttaag tactaggctc tatccttaca 37381 gttgattggt gtcagaggca tttgaaccag agccatcttg agtgagtgct aagacaatga 37441 ggctgggact tgctgggctg cattcccaga aagttaggta ttcctagcct ccagatgttt 37501 atggttaagg gaacagattg acagtgttta ctaaacagac ccagacttag gaatgtcctg 37561 atatcttgag aacagaagca ctcctaatgt tgctttaaat ataataatat tgattctcgc 37621 aaaatatagt aattaagaaa attaatcctt tatcacgaac cattgtaata gagcacatcc 37681 cctcatgatc tttttttatc ctatatataa acaagtcttg tacatagggt gcatgcattc 37741 ctccttactt tcagaaacgc cctactctgt ctatggagta gctgtacttt caccacttta 37801 ctttcttaat aaacttgctt ttgctttgca ttgtggactc accctgaatt ctttcttaca 37861 tgagatccaa gaaccttctc ttgggatctg gatggggacc cctttcctgt aacattggga 37921 tctctctaaa aacagctaca agccaccaaa actcctctct tccatgggtg gagaagaaat 37981 agggctccaa acactctatg gagcatccat gaccggcatc ctgggctgag aaggaaggcc 38041 ctacacagag ctccccaggg ttctgcgtga ctcacaatcc agccacctcc gtccgtttcc 38101 atgtcacagt acacctgcag gggccggctg gcatcgccat gcaggtagat ggtgtacaga 38161 ccactggcgg cattgctgtt ctgctgaacc tgactgcagt ccgaagggtg tgggaaacgg 38221 gcaccaactg ggagccaagc agagagagtg ttgttaggac aagggcaagg aggaaagagg 38281 tggagtttct tcctatttta aggcaattta ttttgtttgg aaattatttc ataacttcct 38341 acttcagctt cttccaactc ttcactcata tcttattttt ctcatcaact ccccaaatgc 38401 accaggaaaa ggtcaagttc acatcacttc ctcccccaac tcagctccct cacaaagtaa 38461 cactctattt cactcctctc tttagcctcc ccactgtggg gtggatttct tttgaacaca 38521 caaccccatt tccacttccc catgcctaga gaaatcaata gttctgcatc cacttgctcc 38581 tccctggact cctttgctag gctttccggt gctggttctc attgcacggt cgttcagagt 38641 acaggattcc atattacctg tggagagggt ggtggataca tttctgctcc ggcgaccacc 38701 cttaaaggca acaagggaga cagggtaggt ggcgccttgc tcaaggcctt gcaacgcaaa 38761 cctctggtct tcccgtccca gctgcatctc ctaccaaaca agggagagat tagacagcca 38821 gtgactcagt ttatcaccca ctagcaccct aggtgtcaac agcgttccct ggtgctaatg 38881 acagaagctc tctgagagca cagtgttttg tctctccgaa gcctcagtgg cagcccagaa 38941 tcgcagtggg tcccagccac ttctctggag agccatgcca tggagagtca gcactactgg 39001 gagcatcagc tgagaaccag cacccaggac tgcagcttca ggatcaccgg ccaagtttct 39061 gggagacttt ggccttcaga tacacacagg ggctctaact atagtggcta cattgccagc 39121 aggaacaggc tggcaatgcc gcgctaagga gaaattaata ggggaggtga aactggccac 39181 tgatatctga gacaaggaga tagttgatat ttcaataaac ttccttctga accagtctta 39241 cttcctctat aaacataggg ctaaccttcc tagctgtgct gtcataagca ctaattaagg 39301 taacacataa acctaagtgt ccttcctgtt tttatttcct cccttctttc ttcattggca 39361 aggagggaga ataataattt tccattggga caggttttga aaatttgctg gccttcagat 39421 tctaagggga ggtataaccc ttgaaataag ggttaatcaa tttgttcaat aaatatatat 39481 cgaacaccta ccatgtgcta agcactgtcc tatgcacttt agatatggtg ataaatgaaa 39541 cagacaaaca tagagcttac atttattggg gaagactaac aataaacaaa ataacccggg 39601 gcagatgtac aaccagcaat gtgatcagta aagatcagca agttgccgtg aggccagcaa 39661 atgagctgat tggcccaagt tagttgcttc acacagaaga gaaggcacca ggagaaaaga 39721 caaggaatcc ccgtacctta actgtgccat ctgggaactg gtaagtcaga atgtagccgt 39781 ggatctgagc agaggggggc gtccaggtca atatgccacc agactgcgtt acagcagatg 39841 gacgaaggtt tctgggaggg tcgagttctg tacgtagaaa gtcaggtaat tgttattact 39901 aagatacaac tgaggcaagg taatttttga gacatcagtt tatattttcc ccttttcaaa 39961 ccctcccatt tctcctcctt ctttcaagca gtttgcaggc tctcttcctc cctgccacag 40021 ggcctttctg acctttgttt ctacaagtac agacccatta gagacaagaa gaacagggcg 40081 ttgggtgggt cccagaggaa aggaatctgg gcccccattt cccaggcagg ttcccacaaa 40141 gtggtggata ttaacagcct agattggcag atagatcctc agagatggca gggctcccaa 40201 aaggatgtgg aaagataaga gggccacaga tctgaaaggg ccatgcacaa agaaaccagg 40261 gtgacttggc ccgcctggaa agttgcctga aatgctggat gggccccaat gcctgtcact 40321 gggaaactac ccaagactta catacagccc tgaggaagga gcttacccta agtagaatat 40381 gggcttatga ttgaaattaa gttggcatag aaacaaaaag ttggccaggc acggtggctc 40441 atgcctgtaa tcccagcact tcgggaggcc aaggcaggcg gatcacaagg tcaagagttc 40501 gacaccagcc tggccaatat gatgaaatcc tatctctacc aaacatacaa aaattagccg 40561 ggtatggtgg tggtcgcctg tcatcccagc tactcgggag gctgaggcag gagaattgct 40621 tgaactcggg aggcggaggt tgcagtaagc caagatcacg tcactgcact ccagcctggg 40681 tgacagagca agactctgtc ttaaaaaaaa aaaaaaagaa agaaacaaaa agttatgttt 40741 cttacatact gttacatatt gaagtgtgtg gccttagatg catacccagt agagaaagaa 40801 gagaaggaga aggcaaggga acaggtgctt gtatagagga gagagagaga gagagagaag 40861 gaaagaaaga aagggagaga agcgagagaa agaaagaaag agaagagaga gagagcggga 40921 gagagaaaga aagaaaaaaa gaaaagaaag aaagaaaaag aaaaaagaaa gaaagaaaaa 40981 gaaaaaagaa agaaagaaag gagggaggga gggaaggaag gaaggagaga gagaaaagaa 41041 gagagaaagg gagggagaga gagagaaaga aagaaagaaa gaaagaaaga aagaaagaaa 41101 gaaagaaaga aagaaagaga gaaaaaagag agaaagaaag gagggaggga gggagagaag 41161 gaaggaagga aggagagaga gaaaagaaga gagaaaggga gagagagaga gaaagaaaga 41221 aagagaaaga aagagggaga gagagagaag gaaggaaaag agaagaaagg aagaggtggg 41281 ggagggaaga gggaaagaga agggatatag gaaaagcaga gctcttcttc tatgtacttc 41341 ctcacccaaa cactagaggg aacgcgtggg taagcaagtg tgcactgtag accaggaaaa 41401 cagaagctca gcaaaaatac tggtcaacat cactggtcag aaggctcttt ttttttttaa 41461 ccaaagaaga tatgaatgga aatttctctt ttgaaaaagc attatcatag aaggttattt 41521 taatttataa cttcccaaaa agacgtctcc aggctatttt ctacccagta gcatttcagc 41581 tgagagctga attgcagtat tttaatagaa atcagacagc tgttgttttg aattaggatg 41641 ttttgacaaa ttttgagttt ctgtaatatc aacttggttt gattttctaa agcataacat 41701 ctggcatgga ttttattcta ttcgtctcac agaaaaactg aatttcctga tttcacaatc 41761 aaggcaaaac tgtggctatt ttaacatctt aagcaatctc ctgccacctc tttgactgcc 41821 caccactccc tggttatgga cttgagatgt ctgtcttagt tactttcaca tttctatgaa 41881 gcaggaaaat gagaatcttt ttattgctgt cttcaaaaat ggagaaggtg aacccaagag 41941 aagataaaca aggaagagga gaagaaagga ataagagaga aaggaacaga gaaaggaagg 42001 ggagaaggag gtaggaagtg ggaaagaggg aagaagggag aactggctac tggttgcatc 42061 tggagagcaa ccctgcatac tagggagtcc aggacagaga gaggagtttg tttgtttgtt 42121 tgtttgtttg tttgcttttg agatggagtc tcgctctttg ttgcccaggc tggagtgcaa 42181 tggcgcaatc ttggctcact gcaacctcct cctcctgggt tcaagcaatt ctcctgtctc 42241 agcctcccaa gtagctggga ttacaggcac ctgccaccac acccagctaa tttttgtatt 42301 tttagtagag acggggtttc tccatgttgg tcaggctggt ctcaaactcc tgacctcagg 42361 tgatccacca gccttgacct cccaaagggc tgggattaca ggcatgagcc accgcgcctg 42421 gccgtttttg ttttgttttg ttttgtttta acacgtctct atttagtttg aaaatattta 42481 ctataagcat atattacttt ttagattaaa aatgcaagct aattttaact aataaaaacc 42541 atagagaaat tcaagccatc tctaggtccc gtggcaacac ttggtgcgct cagccatgga 42601 tggaatggac agaccttgcc cgtactgcca gaagaggctc atagacctca gggtgggcac 42661 agtgctctct tccaagtgcc tctgggttct gacctcagtg atgctctgcc aagatcagag 42721 gccccatcac acacacaaaa tacatgcaat tcaggcacct ttcatcatca actgaggatg 42781 gaatggagaa agatgcatcc ccattgcctc cattctggaa gatggaaaca gaagaggctc 42841 ctgcctctgt cttctcccaa ctgcagctga ggcccaacca ttttctccat agcaagtgtg 42901 gagattctga gcgctgggcc gggtggaggg gtgagggagg ccacctttga caccagcact 42961 cactccctct atactctctc atcttatctg agctgttgtg tcaacttaat tagctcaaag 43021 acgtaaaaaa ggaaccactt tgaagaatgt cctcgggtaa agcaaccacc ctcctgctcc 43081 cctccccagc ccactgatgg aggaaggttc tctcctgata agtgtgaggt gttcccctgg 43141 cctggtccct ctcagtacct gtctgggcct tggtgtcagc cttcttgctc tcctgggccc 43201 ccttctgggc ccacacgtgc accatgtact ccatgcctgg tctcaggccc gtcaggacag 43261 tgctgctgtg ctccttcccc accggaacct ccctggtctc tccgtcagca gaggtgtagc 43321 gcaccatgta cttgtcaatg gtggcctgaa ccgggtccca ggagacagtg gccatattct 43381 ccgtcaccca gtcagtcact aggtttttgg ggccgtcaat ttctttttaa aaagatttag 43441 aaaaggctga actttagaac aaaggaagac atcgcatctt accacctcat caccaggagc 43501 tctccatcat gttatcatta ataatggtag aattccactt aaaacacgag tcaaaggccc 43561 ctggtctcca tttccaagga aacaaaccac tggatataaa actggaagga atcagatcat 43621 ctcccacttt ttaatataaa aaatttctgt ccttgtaatt ctcatcaaaa taacaataaa 43681 acttcctgag caacaagata agccaccaaa actgctgtgg atataacctc ttttctgatc 43741 tccctggtat ctagttattt tgtttcacag aatttgagag ccaagaagtc atatcctcca 43801 acctttccga cttataaaag gggaggagaa tggaggccca gggaaatgga aggtctaggc 43861 caagttcctg ttgacatctt atgttaaagc caagactctt cgctctgaat tttaggcctt 43921 cccaccctct caaggacttt gctgcagatc tgccctttct ctcctgtgtc atcaatctct 43981 tcttctctgt catcagtccc agacaagcaa ggtctagaat cccccatcct ctcccgtatc 44041 attccatttc tctgcaacct taaactgaca gattgtctca agagaaatgt ctacatacct 44101 taacttcact gctcactctc caatcaatcc tgtttgcatc tttctatgcc acacaactaa 44161 atctgctagt ctaagtcacc aacaacatcc gtgttgccac atctaatgca tacttgccat 44221 tctcaacttt acactgcttc tgggtggcat ttggcagagc tgatccactc cttccttctt 44281 gaaacactct caacccttgg cttccatgac tacatactcc cttgattttc ctaactcact 44341 ggctcttctt cagtcctttg ttagcaacct tcaaacactg gagcacccaa agctcccttt 44401 ctctgttatt tgttgtgttt tgttttgttt tgttgataca gagtctcggt ctttcaccca 44461 agctggagag cagtggtgcc atcataggtc actgcagcct tgacctccca ggctcaagca 44521 atcctcccac ctcagcctcc agagtagctg ggaatacagg tgcatgccac cacactcagc 44581 caatttttgt attttttttt tttgtaaaga caggatttca ccaggctggt cttgaactcc 44641 tgagctcaag caattcacct acctcagcct ccaaaagttc tgggattaca ggtgagagtc 44701 actgcacctg tcccaaagct ccctcttttc ctctctgtac ttttccacct aggtgatctc 44761 actcatgccc atggctttaa ataccaactg gatgctaatc gctccacaat atctattaat 44821 tctgagcccc tgactagttt atacaactgc atctccacca cggtgcctgg cctgtattag 44881 acactcaata aatatttatg taatcaataa atgaatcact caattaacca aggaatctca 44941 caagtatctt gaacttaaat atttaaaagt cttgatgctt ctccccaaac tttttatttc 45001 tcagtctccc ctctcaagaa atggtagtcc aatctcccat gttgctcaaa taaaaaacac 45061 aggtcatact tgagtcctca ttttcttcct accccacatc tatcaacagg tattatcatc 45121 catacctcaa aaatatattc cagacatccc ctcttctctg tcccttttgc catcatgctt 45181 acctagacaa gcagcatttt gcacccagac ttttctagca gcttccctac gaacctccca 45241 acttctatcc ttgtcatttt ctccatttat tctcctcact gtggcctgga aaatgttttt 45301 taaatgaaca ctgcatcata tcactcttct agtattaata agtatagccc attttgtcct 45361 ctaagtacca catggtcctg gctgatctgc ccctgctcct cccgcctcat ctcaggccac 45421 ttgctcttcc accctctatg ctccagcaaa actggcctct tttcattctc caaccatccc 45481 aggtactatt ccagctgggg gacatcacac atactcttcc gtttgccaga aatggctagt 45541 ttttgccaaa atccattctc ccccttcttc tgtaagcaca cttactgtcc aacagatctc 45601 tctgcaatga tgaaagtatt ctataatcgg cacagtccga taccgtagcc attagtcaca 45661 tgtggctact gaccatttga aatgtggcta atccaaatga gaaaatattt ttattttatt 45721 tttaattaat taaaatttag gctgggcata gtgacccatg cctgtaatcc cagcactttg 45781 ggaagccgag gcaggaggat cacttgagcc caggagttca acacagtgag accccatctc 45841 taaaagaaat aaaaaattac aggtttagtg gcacgtgcct atggtcccag ctacttggga 45901 agctaaggtg ggaggatggc ttgagcccgg gaggtcaagc ctgcagtgaa ctgtgatcat 45961 accactgtac tccagcctag ctaacagaat gagaccctgt ctctgaaaat aaaataaaaa 46021 taaaaataat tttaatagct acatgtagct agcagctgct gtgttggtca ggacagcata 46081 accaaagtaa agctgggcac agggccatcc agctaaagac ctcatttcct agcctttttt 46141 gcagtgaggt ctgtctggga cctggcaatg gaacatgaga agggctgata tgtgcaatgt 46201 cccagtgatg ccctgacttt catcatgcct cttcccactg gctacaatgc agacagggtg 46261 gtgaccaggt aaaggagggc aaattgctag ggatgactaa gcaacaatca cagaagaagc 46321 ctgggtctcc tggacgagga gctaccgtat cagccctgga cattctacag agaggaatac 46381 actccattcc tgctgatgca actgttattt gagtgtttct gtctatgtat atcctattac 46441 tactcaccct tcccatctgc aagacatcct taccctgtaa ttctctatac ctacaccatg 46501 ttcatctcct tcacagaatt tagcataatt tacggtttta aatgtgtttt cttatttatt 46561 gtctatcttc actctagact tcaagagagt agagatattt gttttgttca ccactgtgtt 46621 ccccaggcct ggcacttaga tgatgctcat taaaagtcag ctgcctactg caaggaaaca 46681 gggctccagc cacccaggtg aatacttgct cctttctatc acacagcgtc tattcggtca 46741 aacctataca aaatagcttg ccacaaaaca ggtttttaat cagtggtttt aagaatttta 46801 tattggtttt acattatttc ctatagctga gagacacaat gattaaaaac acaggctttg 46861 gaaagagttt taagacattc atgggcccag gcgcagtggc tcatgcctgt aatcccagca 46921 ctttgggagg ccatggtagg cggatcacga ggtcaggaga ttgagaccat cctggctaac 46981 acggtgaaac cccgtctcta ctaaaaatac aaaaacaaaa ttagccgggc atggtggcgg 47041 gcgcttgtaa tcccagctac ttgggaggct gaggcaggag aatggcatga acccgggagg 47101 cggagcttgc agtgagccga gatagcgcca ctgcactcca gcctgggcga cagggcaaga 47161 ctgtctcaaa aaaaaaaaaa aaagaaaaaa aaggcattca tgggactgct cccactgggt 47221 ccattactat agttttgtca gacatgcttg aaccagagaa actccatctt gaataggaac 47281 tgggtcaaat aagggtgaaa cctactgggc tgcattccca gatggttaag gcattctaag 47341 tcacaggata agataggagg tcagcacaag atacaggtca taaagaccgt gctgttaaaa 47401 cagcctgcag taaagaagcc ggccaagacc caccaaaacg aagatggcaa tgagagtgac 47461 ctctggtcat cctcactact acactcccac cagtgccatg acagtttaca aatgccaagg 47521 caatgtcagg aagtcaccct atatggtcta aaaaggggag gtatgaataa tccacctctt 47581 gtttagcata tcttcaagaa ataaccataa aaatggacaa ccacaagccc tcagggctgc 47641 tctgcccatg gctagtagcc attcttttat tcctttactt tcctaataaa cttgctttca 47701 ctttactcta tgggctcacc ctgaactctt tcttgcacaa gatccaagaa ccctttctaa 47761 gggtctggat cgggacgcct ttcctataac acgttgttct ttctttgggt gaatgaaagt 47821 tttttgtttc tttgtttgtt ttttgttttt gttttgtttt gtttttacag caaactctcc 47881 taaaacagtc tgcagccctg aggttacaga tgccaaaagc agctgcatca tcactcagct 47941 gcaatagatc aacctgcagc tactacccgg gtttgctctt ctcttcactt ctattacctg 48001 tctgggcctt ggtgtcagcc ttcttgctct cctggttccc cttctgggcc cacacgtgca 48061 ccgtgtactc catgcccggc ctcaggcccg tcaggacagt gctgctctgc tccttcccca 48121 ctggaacctc cctggtctct ccgttggcag acgtgtagtg caccacatac ctgtcaatgg 48181 tggcctgcac cgggtcccag gagacagtgg ctgtattctc tgtcacccag tcagtgacca 48241 ggttttgggg gctgtcaatg tctgaaaaaa aaaatgctga ccaatatgca ctatgaacag 48301 gtgccagagt gcagaaactg gtggtcattt tgcgtggggt gaaacccttt catgttgtgt 48361 ggggtttccg cctcatatgc tcttgtttca tttttagctg tttcatggac atcatcttct 48421 cccccaagat atacctggct cctcaagtag aaagcacaat agatcagaac ctgagaatcc 48481 aggctgttac ctctaccaag tctttagcaa aacaaatctt ttattcatct agtaagggta 48541 tatcaagccc caactctgtc catgaaatgt tctagacagt gtgcagaatc cagaattgga 48601 aaagacatgg tctctgctca ccaggaacta acaattttgt gtctcacatc atgtcagtaa 48661 agacctccca tttcatacta gaagattcta aaagccactt ctaaatgtaa gcccctggag 48721 gaagagggca gcatcctatt ctcctgtgtt cccagtgcct agtagtgggc agggcacgtg 48781 gaaaatgctc actatatatt actaaaggaa tccgcaaaaa gtaaaaccct ggttgaatat 48841 tctctagcat atcatagatt cactcatcat gctggtttgc tgattaaatc actgtacaaa 48901 atatggttag atatcgcttc cactaaaata agaatataac gtaggagcta tccttatcct 48961 caaaactctg gccaacgtac actctgctat tcaacaagag gaatacgctc ctttcttgtt 49021 taagcaactg ttatttggtt gttttggcca tgtgtatcct attactattt ccccttccta 49081 tctgagaaac atcactactt cattattctc tatacctaca ccatgtcatc tccttcatag 49141 aacttatcat aatttctagt tataaatatg ttttatagtt ataatgtgat tcatttgtca 49201 taaagcaaat gatatacatc tccttgctat ggcagatgtg acatctacgt tcttaaagtc 49261 aaattttcaa tatactatta ttattaacaa aatttgattt tcattttctg ttcccactca 49321 caaactcact aagagaccct cgatactgtc tacagcaaaa gatccatgca atcagaagag 49381 gagttgatgg acatgaccat gtaatcatag cttggagact gccttgtggc cctgacttgc 49441 aaggaaacta gttaaaggaa aatgagggat ggtgtgtgca atctgttatg aagggatgcc 49501 atcccactct tccattaatc attgacttct gccaatgtca gagaaggaaa acaaaacttg 49561 tcatccactg acttagcatg aaagctctta tattcccaaa gaacaatgct ccttacctgt 49621 ctgggccttg gtgtcagcct tcttgctctc ctgggccccc ttctgggccc acacgtgcac 49681 cgtgtactcc acacccggcc tcaggcccgt caggacagtg ctactctgct ccttccccac 49741 cggaacctcc ctggtctctc cgtccttggc agaggtgtag cgcaccacat acctgtcaat 49801 ggtggcccgc accgggtccc aggagacagt ggccatattc tctgtcaccc ggtcggtcac 49861 caggttttgg gggctgtcaa tgtctgaaaa aaaaaattgc tgatcaatga acactatgaa 49921 ctggtaccaa agagtgagat ctgggcattt tacttgtggt aaggcagttt cctgttctta 49981 tggtttctcc atatcatatg ctggttttca gttgtttcat ggatatagac cttatcccca 50041 cagctggatt cttggatttt caacaaggca gcacaggaga gtggtatgag aactaaatcc 50101 aaagtcagaa gcactgggtt ctagttccag cccagagagg cagagcagag gtgcaggttg 50161 gagccaggtt actagcaatc taatcccagc tctgccatct gccagttgtg gctccggaca 50221 gttcagctgt tcctggctgg gcctcagctt cttcatctgt aaaatctgag gtttgggtta 50281 gataaacttt taaggttcat tcaagcatga atattccatg catgtaacta atttcataac 50341 ttactaactt gtttttgtga cttggggcaa gttgctttac ttctgtattt gtaacaaggg 50401 gttgttaatg attattgcac ttatcactct gagtagctct gtaaaccaag agaaaaaaat 50461 gatatgtagc aaaggggctt tgaaaatgtt aaagtattac agtgtttacc actctgaagc 50521 cacagaatat ttactctata atttaaaata taattaaggg caaacctacc tctctgcctc 50581 aattttctcc acaagaaaaa aaaggcaggg aaaaataata ttagtacata ttctacagag 50641 ttattgtgaa tattatatga gctaataaca tgtctcctag tgtgacatca gcaagatggc 50701 cagttagaaa tccctgacac tcatcttctg cacaaaaaca actaaaagaa taaacaacga 50761 catttaaaaa aaaaataact aaaggagagt gcagaattct gtcaaaggag aaacagaaac 50821 tctggtgagc acagaaactc aggatggcca catagagaac agaaggaaac actgggaccc 50881 caccacccca ttgcccaggt aagagcagct ggaaaccagg aagaacttct ccctacagca 50941 aaacgtaagc aagaagatcc cagtagcccc tatcagcacc ttgtacacct acagtcctca 51001 ccactgggat ctcctacagc tctcagaggc actaagccca gctgagggag ctgcctacag 51061 gtacacaact gtgctccctc agaaatggag ccaacactgt gccccacccc ctgtggcctg 51121 catggctatt gtactacacc gtcttggtgt cttggaactg gagctactgt tggaatgtgt 51181 cttgctctcg gggtgagtag ccattcaccc cttcatccct aagggtaaac tgctgctgag 51241 acaccaccac ccagtgacct gacatctcca agcggagctg tgagcaactg ttacaccttt 51301 ccccatgggg ccaagcagca gtggaacgac tccacatccc cctcccctac ccactcattc 51361 cagagctgaa actgcacgct ctttcgtgga gaatcattac tttggtaaaa cagctccatc 51421 cacttctgtc acactaggca gcactctact cgtaggggcc tgagctgaaa ctgcactagt 51481 ttcagctcct cagggaagct ccccagggaa caatgctttt ggcagagccc tatatcccag 51541 gaaaatggtg ttgaggccat ccagcatagt cacatccctc aggactgagc tggagtggca 51601 cattgcccac tggggaatca gtgcctcaac caagctgagc agctgcacgt cctagggctg 51661 tgctgacata gtaccctagg tcccagggaa atacagcagt ggttggctaa gacaccccac 51721 cctactggcc agacaactct agtattattc ttccctagag ctggactagg ccccctgagt 51781 ctgagctcct gagacactct tctcccaggg agtagagtca ttgttgtgct gctccctgcc 51841 ctggtacagc ccaaaaaaca gtgtgcttga ttattttggg atgcttgctg ctgctgcatc 51901 tggccttaca gagactggga tactgccaag tcccaccatt gtagagttta gaatccccac 51961 tacacaattc ctcaccccta gggatccaag ttgccactga gctctattgg tgcaggtttc 52021 caaattacag caattacagc catatcctgc aacctggtct cacacttcca gagcacttct 52081 tcttcc // LOCUS HS1433 1862 bp RNA PRI 18-MAY-1993 DEFINITION Human mRNA for 14.3.3 protein, a protein kinase regulator. ACCESSION X56468 NID g23221 KEYWORDS protein kinase regulator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa/Eumycota group; Metazoa; Eumetazoa; Bilateria; Coelomata; Deuterostomia; Chordata; Vertebrata; Gnathostomata; Osteichthyes; Sarcopterygii; Choanata; Tetrapoda; Amniota; Mammalia; Theria; Eutheria; Archonta; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1862) AUTHORS Nielsen,P.J. TITLE Direct Submission JOURNAL Submitted (06-NOV-1990) to the EMBL/GenBank/DDBJ databases. P.J. Nielsen, MAX PLANCK INSTITUT FUER IMMUNBIOLOGIE, STEUBEWEG 51, 7800 FREIBURG, GERMANY REFERENCE 2 (bases 1 to 1862) AUTHORS Nielsen,P.J. TITLE Primary structure of a human protein kinase regulator protein JOURNAL Biochim. Biophys. Acta 1088 (3), 425-428 (1991) MEDLINE 91198149 COMMENT abundant acidic protein in the brain; activates tyrosine and tryptophan hydroxylases in the presence of Ca2+/calmodulin-dependent protein kinase II X56468 and X57347 are related sequences. FEATURES Location/Qualifiers source 1..1862 /organism="Homo sapiens" /dev_stage="mature T-cell lymphoma" /tissue_type="lymphoid" /cell_type="T-cell" /cell_line="Jurkat" /clone_lib="lambda-gt11, cDNA" mRNA 1..1862 /note="14.3.3 protein" /evidence=experimental CDS 126..863 /codon_start=1 /product="14.3.3 protein" /db_xref="PID:g23222" /translation="MEKTELIQKAKLAEQAERYDDMATCMKAVTEQGAELSNEERNLL SVAYKNVVGGRRSAWRVISSIEQKTDTSDKKLQLIKDYREKVESELRSICTTVLELLD KYLIANATNPESKVFYLKMKGDYFRYLAEVACGDDRKQTIDNSQGAYQEAFDISKKEM QPTHPIRLGLALNFSVFYYEILNNPELACTLAKTAFDEAIAELDTLNEDSYKDSTLIM QLLRDNLTLWTSDSAGEECDAAEGAEN" polyA_signal 1514..1519 /note="putative" polyA_signal 1659..1664 /note="putative" BASE COUNT 506 a 428 c 413 g 515 t ORIGIN 1 gtggtgggac tcgcgtcgcg gccgcggaga cgtgaagctc tcgaggctcc tcccgctgcg 61 ggtcggcgct cgccctcgct ctcctcgccc tccgccccgg ccccggcccc ggccccgcgc 121 ccgccatgga gaagactgag ctgatccaga aggccaagct ggccgagcag gccgagcgct 181 acgacgacat ggccacctgc atgaaggcag tgaccgagca gggcgccgag ctgtccaacg 241 aggagcgcaa cctgctctcc gtggcctaca agaacgtggt cgggggccgc aggtccgcct 301 ggagggtcat ctctagcatc gagcagaaga ccgacacctc cgacaagaag ttgcagctga 361 ttaaggacta tcgggagaaa gtggagtccg agctgagatc catctgcacc acggtgctgg 421 aattgttgga taaatattta atagccaatg caactaatcc agagagtaag gtcttctatc 481 tgaaaatgaa gggtgattac ttccggtacc ttgctgaagt tgcgtgtggt gatgatcgaa 541 aacaaacgat agataattcc caaggagctt accaagaggc atttgatata agcaagaaag 601 agatgcaacc cacacaccca atccgcctgg ggcttgctct taacttttct gtattttact 661 atgagattct taataaccca gagcttgcct gcacgctggc taaaacggct tttgatgagg 721 ccattgctga acttgataca ctgaatgaag actcatacaa agacagcacc ctcatcatgc 781 agttgcttag agacaaccta acactttgga catcagacag tgcaggagaa gaatgtgatg 841 cggcagaagg ggctgaaaac taaatccata cagggtgtca tccttctttc cttcaagaaa 901 cctttttaca catctccatt ccttattcca cttggatttc ctatagcaaa gaaacccatt 961 catgtgtatg gaatcaactg tttatagtct tttcacactg cagctttggg aaaacttcat 1021 tccttgattt gtgtttgtct tggccttcct ggtgtgcagt actgctgtag aaaagtatta 1081 atagcttcat ttcatataaa cataagtaac tcccaaacac ttatgtagag gactaaaaat 1141 gtatctggta tttaagtaat ctgaaccagt tctgcaagtg actgtgtttt gtattactgt 1201 gaaaataaga aaatgtagtt aattacaatt taaagagtat tccacataac ttcttaattt 1261 ctacattccc tcccttactc ttcgggggtt tcctttcagt aagcaacttt tccatgctct 1321 taatgtattc ctttttagta ggaatccgga agtattagat tgaatggaaa agcacttgcc 1381 atctctgtct aggggtcaca aattgaaatg gctcctgtat cacataccgg aggtcttgtg 1441 tatctgtggc caacagggag tttccttatt cactctttat ttgctgctgt ttaagttgcc 1501 aacctcccct cccaataaaa attcacttac acctcctgcc tttgtagttc tggtattcac 1561 tttactatgt gatagaagta gcatgttgct gccagaatac aagcattgct tttggcaaat 1621 taaagtgcat gtcatttctt aatacactag aaaggggaaa taaattaaag tacacaagtc 1681 caagtctaaa actttagtac ttttcccatg cagatttgtg cacatgtgag agggtgtcca 1741 gtttgtctag tgattgttat ttagagagtt ggaccactat tgtgtgttgc taatcattga 1801 ctgtagtccc aaaaaagcct tgtgaaaatg ttatgcccta tgtaacagca gagtaacata 1861 aa // LOCUS HS14A4BT 292 bp DNA PRI 12-MAY-1995 DEFINITION H.sapiens 14A4BT DNA sequence. ACCESSION X72881 NID g667002 KEYWORDS Alu repeat; L1 repeat. SOURCE human. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Argyrokastritis,A., Leversha,M.A., Ferguson-Smith,M. and Moschonas,M.K. TITLE A cosmid clone mapped to human chromosome 11p15 detects a Taq I restriction fragment length polymorphism JOURNAL Unpublished REFERENCE 2 (bases 1 to 292) AUTHORS Moschonas,N.K. TITLE Direct Submission JOURNAL Submitted (25-MAR-1993) to the EMBL/GenBank/DDBJ databases. N.K. Moschonas, Inst. of Molecular Biology & Biotechnology, Forth Dept of Biology, University of Crete, PO Box 1527, GR-711 10 Heraklion/Crete, GREECE FEATURES Location/Qualifiers source 1..292 /organism="Homo sapiens" /tissue_type="placenta" /clone_lib="pJB8-human genomic DNA cosmid library" /clone="14a" /chromosome="11" /map="11p15" CDS 82..123 /note="ORF" /codon_start=1 /db_xref="PID:g667003" /translation="MRGSEASKPARFR" CDS 197..265 /note="ORF" /codon_start=1 /db_xref="PID:g667004" /translation="MDRPDYQEAHSLGEQRANTEEH" BASE COUNT 102 a 55 c 87 g 48 t ORIGIN 1 tagagagaaa aaacttagag tgagaaggag tggacaacat acgcacaagc acccaaaact 61 aactgccctg ggtttactca catgcgtggg tctgaggcct caaagccagc acggttcagg 121 tagtaaccag tcactgcatc tgggatcaga gaaatcaatt aaagaactgt cagcagagcg 181 gaggagggtg agggaaatgg ataggcctga ttatcaggaa gctcacagtt tgggtgagca 241 gagggcaaac acagaagagc actaggggag caaatggaag agactctgaa aa // LOCUS HS14AGGRE 3202 bp RNA PRI 17-MAY-1996 DEFINITION H.sapiens mRNA for -14 gene, containing globin regulatory element. ACCESSION X90857 NID g984124 KEYWORDS -14 gene; globin regulatory element. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3202) AUTHORS Vyas,P., Vickers,M.A., Picketts,D.J. and Higgs,D.R. TITLE Conservation of position and sequence of a novel, widely expressed gene containing the major human alpha-globin regulatory element JOURNAL Genomics 29 (3), 679-689 (1995) MEDLINE 96121379 REFERENCE 2 (bases 1 to 3202) AUTHORS Higgs,D.R. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) D.R. Higgs, Inst. of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford, OX3 9DU, UK FEATURES Location/Qualifiers source 1..3202 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" gene 286..1995 /gene="-14" CDS 286..1995 /gene="-14" /codon_start=1 /db_xref="PID:g984125" /translation="MRDNTSPISVILVSSGSRGNKLLFRYPFQRSQEHPASQTSKPRS RYAASNTGDHADEQDGDSRFSDVILATILATKSEMCGQKFELKIDNVRFVGHPTLLQH ALGQISKTDPSPKREAPTMILFNVVFALRANADPSVINCLHNLSRRIATVLQHEERRC QYLTREAKLILALQDEVSAMADGNEGPQSPFHHILPKCKLARDLKEAYDSLCTSGVVR LHINSWLEVSFCLPHKIHYAASSLIPPEAIERSLKAIRPYHALLLLSDEKSLLGELPI DCSPALVRVIKTTSAVKNLQQLAQDADLALLQVFQLAAHLVYWGKAIIIYPLCENNVY MLSPNASVCLYSPLAEQFSHQFPSHDLPSVLAKFSLPVSLSEFRNPLAPAVQETQLIQ MVVWMLQRRLLIQLHTYVCLMASPSEEEPRPREDDVPFTARVGGRSLSTPNALSFGSP TSSDDMTLTSPSMDNSSAELLPSGDSPLNQRMTENLLASLSEHERAAILSVPAAQNPE DLRMFARLLHYFRGRHHLEEIMYNENTRRSQLLMLFDKFRSVLVVTTHEDPVIAVFQA LLP" misc_feature 2881..>3202 /note="alternate 3' end" /evidence=experimental BASE COUNT 593 a 1097 c 875 g 637 t ORIGIN 1 ttcccatccg caccagcaag agggcccctg tgccccaaac gcgaaccgta cggctccaga 61 cagcaccgcg gaactcggtg gcttccagaa ggccccgcgc ctgcgcattc cgctgcctgc 121 gcctgcgcct gcgcctgcgc cgttctcccg gccgccgcct tagcacctcc tccggacggt 181 gtcgccgaag tctcgcgagc ccggagcgtg gcacgtgggc cccctccgcc tccggctccg 241 tcctcctctg gccccctccg cccccggccc cggccccacg gcgggatgcg ggacaacacc 301 agccccatca gcgtgattct ggtgagctcg gggagcaggg gcaataagct gctgttcagg 361 taccccttcc agagaagcca ggagcacccg gcgtcccaga caagtaagcc gcgtagcaga 421 tacgctgcca gcaacacggg cgaccatgct gatgagcagg acggcgattc caggttttca 481 gatgttattc tggcaacaat tttggcaacc aagtctgaaa tgtgtggcca aaaatttgaa 541 ctgaagattg ataatgtgcg atttgttggg cacccaacac tgctacagca tgctctgggg 601 cagatctcca aaacagatcc ttccccgaag agggaagcac ctactatgat tctttttaat 661 gtggtgtttg cactgagggc caacgcagac ccgtcagtga taaactgtct gcataacctg 721 tcccgtcgta tcgccaccgt gctgcagcac gaggagcgcc gctgccagta cctcacccgg 781 gaggccaagc tgatcctggc gctccaggat gaggtgtccg ccatggctga tggaaatgaa 841 ggtcctcagt ccccattcca tcacatcctg cccaagtgca agctggccag ggacctcaag 901 gaagcttatg acagcctgtg cacgtcgggc gtagttcggc ttcacatcaa cagctggctg 961 gaggtgagct tctgcctgcc ccacaagatc cactatgcgg cctccagtct gatcccccca 1021 gaggccatcg aacggagcct gaaagccatc cgcccctacc atgccctgct gctgctcagt 1081 gatgagaagt ccttgctggg tgagcttcct attgactgct cccctgccct agtgcgggtg 1141 atcaagacca catctgctgt gaagaacctg cagcagctag cccaagatgc ggacctggcc 1201 ttgctgcagg ttttccagct tgcagctcat ctggtgtact ggggcaaggc catcatcatc 1261 tacccgctgt gtgagaacaa cgtctacatg ctgtctccca atgccagcgt atgtctgtac 1321 tccccgctgg ccgagcagtt ctcccaccag ttcccatctc atgacctgcc gtccgttctt 1381 gccaagttct ccttgccggt ctccttgtca gaatttagga atcccctggc ccccgctgtg 1441 caggagaccc agctcatcca gatggtggtg tggatgctgc agcgccggct tctcatccag 1501 ctgcacacct atgtctgcct gatggcctca cccagcgagg aggagccccg tccgcgagag 1561 gacgacgtcc ccttcactgc ccgggtcggc ggtcgcagcc tcagcacgcc caacgccctc 1621 agctttggct ccccaaccag cagcgatgac atgaccctca ccagccccag catggacaac 1681 tccagcgcag agctacttcc cagcggggac tcgccactga accagaggat gacggagaac 1741 ctgctggcca gcctgtcgga gcatgaacgc gcagccatcc tcagtgtacc cgcagcccag 1801 aaccctgagg acctccgcat gtttgccagg ctccttcact acttccgcgg ccgccaccac 1861 ctggaggaga ttatgtacaa cgagaacacg cggcgctccc agctgctcat gctgtttgac 1921 aagttccgca gcgtgctggt ggtgaccacc cacgaggacc ctgtcattgc cgtcttccag 1981 gctctgctcc cctgagccca ggcggagggc ggaaggctgc tggggtgcgc aggtgggcgc 2041 tcgcgtctcc ccaccccagg gctccccccg tgctgaggct gagccctctt ggccctgagg 2101 cctggcattg ggtggatgcg ggctggccgt ggcccagtga agcctgcaga gccccgctgt 2161 ccttgcccct ggtggttccg ctgtggggct gctgccctct gtgttcctac gcttccctcc 2221 agtccttgcc gcacgcgtag gcatctccac gctggagggg ggcctgccag gactcctgtc 2281 tctgggtgag gccgcatcct tcaaggccca tgtgggctgt gcgtttctca gaccctcctt 2341 ctgtccctac acctgctcct tggacccccc agtctgtggc caccctgaag aatgtgcaga 2401 aacacttgtg tggcctgtcc ctgtctctct gacagccttc catttgtgaa gtgccctgtg 2461 gccccctccc cagcacctct gtctgccatg cgcttcttcc tcccaggcta ccctgagcct 2521 ttcctggccc agtcctcacc acagtccaca gaagccacaa acaggctcat agccaagctg 2581 tgacctggtc ctgaccatct ggggcacgag ggcctgggct ggccctgcta ggctggagaa 2641 gccctgtcac ctgtgcacat cttgctggtg gaggcatggc ccactgtgca ggaccccacc 2701 ctcgggggct ttcggctccc acactgatga ttctccccag catccacacc gggcctggcg 2761 ctacgtactc aggcccccag ctctcgtgtc ctggaggaag agctagctcc agacatgggt 2821 tgatcaccta gaggagctct ggctaaggca cagttttcta gaaataaaac atttattcgg 2881 gtttggaaag gccccttgga ttcctggctg gggacaatga tgacctggac cctggccaga 2941 agagccctgg ccctccagca aggcagcacc tgctctgatg cacctgtgca cccctggccc 3001 tcccagccac ctgggagccc gaagcttagc ctgtaggtgg ccaagccagt ccctactctt 3061 gggctgcggc cacgtgaggc tccgcatcag cagccagggg cgagcactag tggacaaagc 3121 cagcaaatgc ggcgttcctg tgagcagata ccctcaacca gccagctgtc aaaagtattt 3181 caagataaaa tccaggccaa gc // LOCUS HS14KDAPT 1020 bp mRNA PRI 17-JAN-1998 DEFINITION Homo sapiens mRNA for translational inhibitor protein p14.5. ACCESSION X95384 NID g2792003 KEYWORDS 14.5 kDa protein; translational inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1020) AUTHORS Schmiedeknecht,G., Kerkhoff,C., Orso,E., Stohr,J., Aslanidis,C., Nagy,G.M., Knuechel,R. and Schmitz,G. TITLE Isolation and characterization of a 14.5-kDa trichloroacetic-acid-soluble translational inhibitor protein from human monocytes that is upregulated upon cellular differentiation JOURNAL Eur. J. Biochem. 242 (2), 339-351 (1996) MEDLINE 97129113 REFERENCE 2 (bases 1 to 1020) AUTHORS Schmitz,G. TITLE Direct Submission JOURNAL Submitted (29-JAN-1996) G. Schmitz, Inst. Clinical Chemistry and Laboratory Medicine, University Regensburg, D-93042, Regensburg, FRG REMARK Revised by [3] REFERENCE 3 (bases 1 to 1020) AUTHORS Schmitz,G. TITLE Direct Submission JOURNAL Submitted (16-JAN-1998) G. Schmitz, Inst. Clinical Chemistry and Laboratory Medicine, University Regensburg, D-93042, Regensburg, FRG COMMENT Related sequences: H97319, H62771, R89756, H79856, H61480, R94233, R89124, R92013, D49363, X70825, H71836, H67621, T98630, T98680, H67085, H79855, R91820, H59495, H71835, R91725, T39930, R94329, R92158, D62115, and H62831. FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 95..508 /note="expressed mainly in liver and kidney" /codon_start=1 /product="14.5 kDa translational inhibitor protein, p14.5" /db_xref="PID:e1240168" /db_xref="PID:g1177435" /translation="MSSLIRRVISTAKAPGAIGPYSQAVLVDRTIYISGQIGMDPSSG QLVSGGVAEEAKQALKNMGEILKAAGCDFTNVVKTTVLLADINDFNTVNEIYKQYFKS NFPARAAYQVAALPKGSRIEIEAVAIQGPLTTASL" BASE COUNT 342 a 168 c 225 g 285 t ORIGIN 1 ggcgctgctg tggttggtca gtccagtaag aagccagcag ggctggtgct ggggcttctt 61 ctcctgaagg ggctgcaaga gggaaggctt agccatgtcg tccttgatca gaagggtgat 121 cagcaccgcg aaagccccag gggccattgg accctacagt caagctgtat tagtcgacag 181 gaccatttac atttcaggac agataggcat ggacccttca agtggacagc ttgtgtcagg 241 aggggtagca gaagaagcta aacaagctct taaaaacatg ggtgaaattc tgaaagctgc 301 aggctgtgac ttcactaacg tggtgaaaac aactgttctt ctggctgaca taaatgactt 361 caatactgtc aatgaaatct acaaacagta tttcaagagt aattttcctg ctagagctgc 421 ttaccaagtt gctgctttac ccaaaggcag ccgaattgaa attgaagcag tagctatcca 481 aggaccactg acaacggcat cactataagt gggcccagtg ctgtgtagtc tggaattgtt 541 aacattttaa tttttacaat tgatgtaaca tcttaattaa ccttttaatt ttcacaattg 601 atgacagggt gagtttgatg aaaatatctg aagctattat ggaaatacca tgtaataggg 661 agagttgaac atgaatatta gagaaggaat ccagttactt ttttaaatta cacctgtgtg 721 cacctgtatt actgaatata ggaaagagat acccattaca tagttactca gtaaacaaaa 781 gagaaatacc aggtaggaaa gaagagttac tattcctgag aaataatcaa gaacatattt 841 aatttaaact aatgatgtga actatttagt tttgatgtcc gttatgtgat tctgctttta 901 cttgagtaaa attaaagtgt ttaaatttga gatcaaggag aagatagtgg aacaaaatgt 961 tatatagata atatttttct aatggaaata aaataggcag atttccaaaa aaaaaaaaaa // LOCUS HS15E1 57248 bp DNA PRI 28-JAN-1998 DEFINITION Human DNA sequence from BAC 15E1 on chromosome 12. Contains Cytochrome C Oxidase Polypeptide VIa-liver precursor gene, 60S ribosomal protein L31 pseudogene, pre-mRNA splicing factor SRp30c gene, two putative genes, ESTs, STSs and putative CpG islands. ACCESSION AL021546 NID g2826890 KEYWORDS 12; 60S ribosomal protein L31; CpG island; Cytochrome C Oxidase Polypeptide VIa-liver; SRp30c. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 57248) AUTHORS Murphy,L. TITLE Direct Submission JOURNAL Submitted (13-JAN-1998) E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is not the entire insert of clone 15E1. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. During sequence assembly data is compared from overlapping clones. Where differences are found these are annotated as variations together with a note of the overlapping clone name. Note that the variations annotated may not be found in the sequence submission corresponding to the overlapping clone as we submit sequences with only a small overlap as described above. This sequence was generated from part of bacterial clone contigs of human chromosome 12, constructed by the Sanger Centre chromosome 12 mapping group. This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true right end of clone 166H1 is at 1098. The true left end of clone 75N14 is at 55991. 15E1 is from the Research Genetics human BAC library by Birren et al. VECTOR: pBeloBAC11. FEATURES Location/Qualifiers source 1..57248 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /clone="15E1" /cell_line="978SK" repeat_region 1..97 /note="L1MC1 repeat: matches 980..1071 of consensus" repeat_region 132..436 /note="AluY repeat: matches 1..301 of consensus" repeat_region 438..718 /note="AluJo repeat: matches 281..1 of consensus; incomplete repeat" repeat_region 746..1035 /note="AluSq repeat: matches 2..293 of consensus" repeat_region 1039..1337 /note="AluSx repeat: matches 3..301 of consensus" repeat_region 1447..1544 /note="MIR repeat: matches 212..108 of consensus" repeat_region 1830..2117 /note="AluSq repeat: matches 1..291 of consensus" repeat_region 2118..2414 /note="AluY repeat: matches 8..301 of consensus" repeat_region 2780..2934 /note="MIR repeat: matches 233..71 of consensus" repeat_region 3219..3507 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 3561..3862 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 3921..3965 /note="MIR2 repeat: matches 142..97 of consensus" repeat_region 4026..4342 /note="AluJo repeat: matches 302..1 of consensus" repeat_region 4578..4874 /note="AluSx repeat: matches 301..1 of consensus" repeat_region 4948..5076 /note="FLAM_C repeat: matches 133..5 of consensus" repeat_region 5713..5911 /note="MIR repeat: matches 13..204 of consensus" repeat_region 6218..6245 /note="14 copies of 2 mer 89 % conserved" repeat_region 6247..6409 /note="AluJo repeat: matches 285..120 of consensus; incomplete repeat" repeat_region 6417..6716 /note="AluY repeat: matches 301..2 of consensus" repeat_region 6749..7039 /note="AluSq repeat: matches 292..1 of consensus" repeat_region 7050..7165 /note="AluJo repeat: matches 124..11 of consensus; incomplete repeat" repeat_region 7197..7335 /note="L1PB3 repeat: matches 883..745 of consensus" repeat_region 7362..7592 /note="AluJb repeat: matches 237..1 of consensus; incomplete repeat" repeat_region 7756..8058 /note="AluY repeat: matches 301..2 of consensus" repeat_region 8088..8158 /note="AluSg repeat: matches 230..300 of consensus; incomplete repeat" repeat_region 8182..8481 /note="AluSp repeat: matches 301..1 of consensus" repeat_region 8489..8770 /note="AluJo repeat: matches 288..2 of consensus" repeat_region 9057..9223 /note="AluSg repeat: matches 134..300 of consensus; incomplete repeat" repeat_region 9234..9308 /note="MIR2 repeat: matches 108..33 of consensus" repeat_region 9736..9921 /note="AluSq repeat: matches 1..186 of consensus; incomplete repeat" repeat_region 9927..10144 /note="AluSx repeat: matches 216..1 of consensus; incomplete repeat" repeat_region 10177..10242 /note="AluJ repeat: matches 238..302 of consensus; incomplete repeat" repeat_region 10512..10557 /note="MIR2 repeat: matches 64..19 of consensus" repeat_region 10601..10894 /note="AluSq repeat: matches 299..1 of consensus" repeat_region 10895..11031 /note="AluSq repeat: matches 135..1 of consensus; incomplete repeat" repeat_region 11068..11200 /note="FLAM_C repeat: matches 133..1 of consensus" repeat_region 11566..11670 /note="MIR repeat: matches 217..112 of consensus" repeat_region 11733..12029 /note="AluSg repeat: matches 1..297 of consensus" repeat_region 12426..12723 /note="AluSx repeat: matches 1..298 of consensus" repeat_region 13132..13319 /note="MIR repeat: matches 212..16 of consensus" repeat_region 13443..13594 /note="AluSx repeat: matches 299..134 of consensus; incomplete repeat" repeat_region 13595..13895 /note="AluSq repeat: matches 292..1 of consensus" repeat_region 13897..14197 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 14488..14785 /note="AluSq repeat: matches 300..3 of consensus" repeat_region 14804..14854 /note="MIR repeat: matches 260..210 of consensus" repeat_region 14926..15105 /note="AluJo repeat: matches 1..183 of consensus; incomplete repeat" repeat_region 15111..15398 /note="AluSx repeat: matches 6..292 of consensus" repeat_region 15530..15642 /note="FLAM_A repeat: matches 132..20 of consensus" repeat_region 15876..15949 /note="MIR repeat: matches 102..23 of consensus" repeat_region 15970..16270 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 16271..16560 /note="AluSp repeat: matches 301..12 of consensus" repeat_region 16721..16843 /note="FLAM_C repeat: matches 133..7 of consensus" repeat_region 16877..16902 /note="13 copies of 2 mer 100 % conserved" repeat_region 16903..17208 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 17307..17461 /note="MIR2 repeat: matches 146..1 of consensus" misc_feature 17467..18357 /note="putative CpG island" mRNA join(17895..18023,18173..18315,20248..20523) /gene="COX6A2" /note="match: cDNAs X15341 M38520 U08440 X12553 L06456 X79866; match: ESTs AA190489 D52123 AA173875 W95832 AA173876 AA443968 AA588882 D54544 AA593799 H71914 AA122190 N48244 H93458 AA046801 AA259210 H84585 N63557 AA482243 AA662308 N42987 AA444149 W19445 AA659530 AA644566 W96092 AA037779 AA338795 AA339849 H18231 C02469 AA598458 W95966 H48072 AA502162 W32700 AA457100 W32701 H69728 T52256 AA522924 AA483109 AA133892 AA315179 AA046818 AA483741 R98452 AA307397 AA485567 W56867 AA464996 D55139 N48060 N30549 T52180 H78844 AA482340 N95438 AA405296 W96035 W60497 W69346 T29498 D51705 D55107 D55463 H47721 AA664032 AA321519 AA516183 AA431269 AA464455; AA173876 and AA173875 match intronic sequence starting at position 19923" /evidence=not_experimental /product="Cytochrome C Oxidase Polypeptide VIa-liver precursor (EC 1.9.3.1)" gene 17895..20523 /gene="COX6A2" CDS join(17921..18023,18173..18315,20248..20331) /gene="COX6A2" /note="match: proteins P13182 P12074 P10818 P43024 Q02221 P07471 P10817 P43023 P13182 O13085 O13082" /codon_start=1 /evidence=not_experimental /product="Cytochrome C Oxidase Polypeptide VIa-liver precursor (EC 1.9.3.1)" /db_xref="PID:e1248288" /db_xref="PID:g2826891" /translation="MAVVGVSSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFF VALPGVAVSMLNVYLKSHHGEHERPEFIAYPHLRIRTKPFPWGDGNHTLFHNPHVNPL PTGYEDE" misc_feature complement(join(18222..18315,20248..20524)) /note="match: STS G06862" repeat_region 18580..18607 /note="14 copies of 2 mer 96 % conserved" repeat_region 18692..18846 /note="AluSx repeat: matches 146..296 of consensus; incomplete repeat" repeat_region 19039..19344 /note="AluSq repeat: matches 1..301 of consensus" repeat_region 19345..19379 /note="U2 repeat: matches 35..1 of consensus" repeat_region 19528..19760 /note="MER20 repeat: matches 217..1 of consensus" misc_feature 20369..20504 /gene="COX6A2" /note="match: STS G29082" repeat_region 20793..21065 /note="AluSx repeat: matches 1..295 of consensus" repeat_region 21260..21560 /note="AluY repeat: matches 301..1 of consensus" repeat_region 21777..22078 /note="AluYa5 repeat: matches 301..1 of consensus" repeat_region 22092..22395 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 22869..23160 /note="AluJb repeat: matches 299..1 of consensus" repeat_region 23331..23593 /note="AluSg repeat: matches 1..292 of consensus" prim_transcript 23755..24305 /note="match: ESTs AA369283 AA631304 AA100694 F21636 AA641805 AA316591 AA187245 AA101703 AA641755" mRNA complement(join(<24350..24749,26020..>26166)) /gene="15E1.1" /note="match: ESTs AA597073 AA457942 AA087475 AA369284 AA220309 AA187377 W53804 AA288126 W10025" /evidence=not_experimental /product="GENSCAN prediction 15E1.1" gene complement(24350..26166) /gene="15E1.1" CDS complement(join(24666..24749,26020..26166)) /gene="15E1.1" /note="predicted by GENSCAN, supported by EST matches" /codon_start=1 /evidence=not_experimental /product="GENSCAN prediction 15E1.1" /db_xref="PID:e1248289" /db_xref="PID:g2826892" /translation="MNSVGEACTDMKREYDQCFNRWFAEKFLKGDSSGDPCTDLFKRY QQCVQKAIKEKEIPIEGLEFMGHGKEKPENSS" repeat_region 24903..25199 /note="AluSx repeat: matches 1..297 of consensus" misc_feature 26039..26694 /note="putative CpG island" gene 26275..39822 /gene="15E1.2" CDS join(26275..26355,26451..26623,36870..36973,39701..39753) /gene="15E1.2" /note="partially predicted by GENSCAN, supported by EST matches" /codon_start=1 /evidence=not_experimental /product="predicted protein 15E1.2" /db_xref="PID:e1248290" /db_xref="PID:g2826893" /translation="MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLA LVDFGSREAVARLEKAIAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADE LLQNSHRVVEEYFVAPPGNISLPKLDEQEPFPHS" mRNA join(<26291..26355,26451..26623,36870..36973, 39701..>39822) /gene="15E1.2" /note="match: ESTs AA101291 AA311181 AA310150 R20577" /evidence=not_experimental /product="predicted protein 15E1.2" repeat_region 26871..27168 /note="AluSg repeat: matches 298..1 of consensus" repeat_region 27548..27847 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 27885..28111 /note="AluJo repeat: matches 297..40 of consensus; incomplete repeat" repeat_region 28126..28416 /note="AluSx repeat: matches 297..7 of consensus" repeat_region 28438..28726 /note="AluSq repeat: matches 303..12 of consensus" repeat_region 28752..29050 /note="AluY repeat: matches 301..2 of consensus" repeat_region 29243..29540 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 29560..29855 /note="AluSp repeat: matches 297..1 of consensus" gene complement(30230..30597) /gene="15E1.3" CDS complement(30230..30597) /gene="15E1.3" /note="60S ribosomal protein L31 pseudogene; match: proteins P12947 P04649 P45841 P46290 Q06739; match: ESTs AA534302 F20561 AA301780 AA301836 N87226 AA652879 AA583859 AA614606 AA333817 H78304 AA128386; located in intron of gene 15E1.2" /codon_start=1 /pseudo /evidence=not_experimental /db_xref="PID:e1248291" repeat_region 30654..30954 /note="AluSx repeat: matches 1..299 of consensus" repeat_region 30959..31253 /note="AluSg repeat: matches 1..296 of consensus" repeat_region 31254..31554 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 31670..31970 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 31971..32099 /note="MIR repeat: matches 195..65 of consensus" repeat_region 32263..32495 /note="AluJo repeat: matches 221..1 of consensus; incomplete repeat" repeat_region 32516..32788 /note="AluJo repeat: matches 273..2 of consensus; incomplete repeat" repeat_region 33495..33626 /note="FLAM_A repeat: matches 1..132 of consensus" repeat_region 34092..34172 /note="MIR repeat: matches 196..100 of consensus" repeat_region 34228..34508 /note="AluY repeat: matches 286..1 of consensus" repeat_region 34680..34976 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 34977..35262 /note="AluSx repeat: matches 295..1 of consensus" repeat_region 35484..35778 /note="AluY repeat: matches 295..2 of consensus" repeat_region 37398..37663 /note="AluSp repeat: matches 303..38 of consensus; incomplete repeat" repeat_region 37893..38174 /note="AluSx repeat: matches 299..1 of consensus" repeat_region 38659..38952 /note="AluY repeat: matches 295..1 of consensus" repeat_region 39369..39666 /note="AluY repeat: matches 1..299 of consensus" repeat_region 40518..40807 /note="AluY repeat: matches 1..296 of consensus" repeat_region 40897..41183 /note="AluJo repeat: matches 297..8 of consensus" mRNA complement(join(41462..41956,43744..43916,45421..45581, 49216..49455)) /gene="SRP30C" /note="match: cDNA U30825; match: ESTs H83799 AA054320 W05582 W69646 AA459073 AA054420 AA025338 AA204469 R36365 AA490721 AA138258 AA617652 N30640 AA641542 AA581846 W87911 AA156241 AA491213 AA458883 AA551037 N67420 AA132151 R05766 AA147254 W87822 W56129 AA282730 AA076082 AA036574 N27031 H10272 AA341584 H35515 AA341088 AA155566 AA667802" /evidence=not_experimental /product="pre-mRNA splicing factor SRp30c" gene complement(41462..49455) /gene="SRP30C" CDS complement(join(41813..41956,43744..43916,45421..45581, 49216..49403)) /gene="SRP30C" /note="match: proteins Q13242 Q13809 Q07955 P26686 Q39201 Q23796 P38159 Q09167 Q13245 Q13247 Q24113 Q08170 P38922" /codon_start=1 /evidence=not_experimental /product="pre-mRNA splicing factor SRp30c" /db_xref="PID:e1248292" /db_xref="PID:g2826894" /translation="MSGWADERGGEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIE LKNRHGLVPFAFVRFEDPRDAEDAIYGRNGYDYGQCRLRVEFPRTYGGRGGWPRGGRN GPPTRRSDFRVLVSGLPPSGSWQDLKDHMREAGDVCYADVQKDGVGMVEYLRKEDMEY ALRKLDDTKFRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQSRGSPHYFSPF RPY" repeat_region 42034..42193 /note="MIR repeat: matches 262..82 of consensus" repeat_region 43081..43205 /note="FLAM_A repeat: matches 1..127 of consensus" misc_feature complement(43236..43483) /gene="SRP30C" /note="match: STS G19881" repeat_region 46024..46158 /note="AluJo repeat: matches 134..301 of consensus; incomplete repeat" repeat_region 46213..46279 /note="FLAM repeat: matches 62..132 of consensus" repeat_region 46337..46628 /note="AluJo repeat: matches 295..1 of consensus" repeat_region 47021..47313 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 47607..47904 /note="AluSq repeat: matches 1..299 of consensus" repeat_region 47946..48055 /note="AluJb repeat: matches 1..111 of consensus; incomplete repeat" misc_feature 48670..49885 /note="putative CpG island" repeat_region 50264..50527 /note="AluJb repeat: matches 297..2 of consensus" repeat_region 50557..50851 /note="AluSx repeat: matches 1..297 of consensus" repeat_region 50876..51189 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 51404..51703 /note="AluY repeat: matches 300..2 of consensus" repeat_region 53095..53206 /note="AluJo repeat: matches 25..135 of consensus; incomplete repeat" repeat_region 53207..53506 /note="AluSx repeat: matches 2..302 of consensus" repeat_region 53510..53556 /note="AluY repeat: matches 127..173 of consensus; incomplete repeat" repeat_region 53557..53632 /note="AluJb repeat: matches 87..12 of consensus; incomplete repeat" repeat_region 53634..53776 /note="AluJb repeat: matches 154..295 of consensus; incomplete repeat" repeat_region 53895..54098 /note="AluJb repeat: matches 84..287 of consensus; incomplete repeat" repeat_region 54224..54251 /note="14 copies of 2 mer 89 % conserved" repeat_region 54266..54376 /note="L1PB1 repeat: matches 801..902 of consensus" repeat_region 55326..55531 /note="MER3 repeat: matches 2..209 of consensus" repeat_region 55634..55669 /note="MIR2 repeat: matches 101..136 of consensus" repeat_region 56405..56703 /note="AluSp repeat: matches 301..3 of consensus" repeat_region 56722..57022 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 57141..57244 /note="MIR2 repeat: matches 145..41 of consensus" BASE COUNT 15003 a 13601 c 13307 g 15337 t ORIGIN 1 gtgcctgtgt ctggagagga gagagatata taggaaatct ctgtaccttc tgcttaatat 61 tgctgtgaac ctaaaactgc tccaaaaaat aaagtttttg tttgcttgtt tgtttattaa 121 gacagggtct cggctgggca tggtggctca cgcctgtaat cccagtactt tgggagactg 181 aggtgggcgg atcccaaggt caggagatgg agaccatcct ggctaacacg gtgaaacccc 241 atctctacta aaaatacaaa aacaaaatta gtcaggcgtg gtggcgggcg cctgtagtcc 301 cagctactcg ggaggctgag gcaggagaat ggtgtgaacc cggaaggcgg agcttgcagt 361 gagccgagat cgcgccactg cactcaagcc tcggcaacag agcgagactc cgtctcaaaa 421 aaaaaaaaaa aaaaaaaaga cagggtctca ctctgttgtc caggctggag tgcagtgacg 481 caatcatggc ttattcagca ttgacctcct gagctcctgt gatcctcctg cctccacctt 541 ccgagtagct gagactacag gtgtgcacca ccatgcctgg ctaacttttg tattttttgt 601 agagacgtgg gtctcactat gttgcccagg ctgatcttga cctcccgggc tcaagcaatc 661 ctcctgcctt ggcctcccaa agtgctagga ttacaggcat gagccaccat gcctggccaa 721 taaagcttaa ttaaaaagaa aagaagccag gtgcggtggt tcctgcctgt aatcccagca 781 ctttgggagg ctgaggtggg tgcatcacct gaggtcagga gtttgagacc agcctggcca 841 acatagtgaa acctcatctc tactaaaaat acaaaacttg gccgggcatg gtggcgggtg 901 cctgtaatcc cagctactcg ggatgctgag acaggagaat tgcttgaacc tgggaggcgg 961 aggttgcagt gagccaagat ctgccactgc tctccagcct gggcaacaca gtgagactct 1021 gtctcaaaaa ataaactact gggagccgtg actcatgcct gtaatcccag cactttggga 1081 ggctgaggcg ggcggatcac ctaaagtcag gagttcaaga ccagcctgtc caacatggtg 1141 aaactccgtc tttactaaaa aatacaaaaa ttagccggtt atggtggcgg gcacctgtaa 1201 tcccagctac ttgagaggct aagacaggag aattgcttga acccaggagg cggaggtagc 1261 agtgagccaa gatcgtgcca ctgcactcca gcctgggcaa cagagcggga ctccgtccag 1321 aaaaaaaaaa aaaaagagag agagagagag agaggagatt gtgcagttct caattattta 1381 tcaattttga tttcttctct ctccagacca taatactaat atcggcacca cactttgcag 1441 cttataaagc actttcaatt ctattttatt taattcttgc aaccccaggc agaaggcttt 1501 tattaccccc tttctatagg ttgggaaact gaggctcaga gaggagaaat gaaggttcca 1561 agtagaggta agtaaggcat tcaactgggg caagaattta gagttacttg atgaagtgcc 1621 taattagccc cagggcatgg tgttgacaaa ggaactataa tgaaaacact aaactctacc 1681 ttttgtggag gaagatcccc tggggcgggg gaaggaggta aggagtagag actaagaagt 1741 tgaccatgtc tggttcatct gtgatcctgg gccaatgtag gccctgggga aagatttggt 1801 gctgggttga agaaataaga agtgaaggag gccaggcaca gtggctcacg cctgtaatct 1861 cagcactttg ggaggccaag gcgggtggat cacgaagtca ggagttcaag accagcctgg 1921 ccaagatagt gaaaccccgt ctctactaaa aacacaaaat tagcagggcg tgatggtgca 1981 tgcctgtaat cccagctact cgggaggcta aggcaggaca atcgcttgaa tccaggaggc 2041 agaggttgca gtgagctgag atgtcaccat tgcactccag cctgggcaac aagagcgaaa 2101 ctccatctca acaacaacac ggtggctcac gcctgtaatc ctagcacttt aggaggctga 2161 ggtgggcaga tcgcgaggtc aggagatcaa gaccatcctg gttaacatgg tgaaaccccg 2221 tctctactaa aaatacaaaa aaaaattagc caggtattgt ggcgggtgcc tgtagtccca 2281 gctactcgcg aggttgaggc aggagaatgg catgaaccca ggaggtggag cttgcagtga 2341 gccgagatcg cgcgactgca ctccagcctg ggagacagag cgagactctg tctcaaaaaa 2401 aaaaaaaaaa aaaaaaaaaa ccaacaacaa ccaaaaaaaa aacaaaaagt gaatgaataa 2461 atgaatgaat aaagggagaa acttggagca agaccccttc gccctctctg gactcagttt 2521 ccccattttt tacttgttga cagctaagtt tctcatcctt aggaaaatga gcagacacaa 2581 tattgtactc tatttccaga tgcacacaga ccaccctcct gaacccctac tgtggatccc 2641 ttgggagggg gtgatccact gaaccctatg tagggagccc ctgaatttga ggatctgaga 2701 tcccttccag ccctggtgtt aatagttgta ataatgataa tagtaataat aataggatcc 2761 agaagttttg ggatattggc tctgggtcag gcactgtgct aaaagcttta tctcatgtaa 2821 tctctttaca gccctgtgac gcagccatgg ttattatcat ctccattttg cagatgagaa 2881 aactgaggct tagagaggct aaatgacttg ctcacgttgc acagcctaaa agtgaagtcc 2941 atggtttcca atttagcact gcagacacca gggccccagc actacccccc ctccattacg 3001 gaagagtagg acctccatgg ggcccgccct tccttccttc cttccttccc tccctccttc 3061 cttccttcct tccttccttc cttccttccc tccttccctc cctccccttc cccttcccct 3121 tccccttccc cttctccttc tccttcttct ttccttcctt cccttctctt tccttccttc 3181 ttttcgtttt gttttgtttt gtttcgtttc ttactttctt ttttcttttt tttttttttt 3241 gacggagtct ccctctgtcg cccaggctgg agtgcagtgg cgcgatctcg gctcactgca 3301 acctccgtac cctggattca agcaattctt ctgcttcagc ctcccaagta gctgggaata 3361 caggtgcgca ccacatgcct ggataacttt tctattttta gtagagacgg gggtttcact 3421 atgttggcca ggctggtctt gagctcctga cctcaggtca gccacccaaa gtgctgggat 3481 tacaggcgtg agccactctg cctggcccat ggggccatgg ggctctttca agggcaaact 3541 caagttgctt ttttcttttc tttttttttt tttttttttt gagatggagt ctcatgctgt 3601 cgcccaggct ggagtgcaat agcaccatct cagctcactg caacctccgc ctcccaggtt 3661 caagtgattc tcttgcttca gtcccctgag tagctgggat tacaggcagc tgccaccgtg 3721 cccagctaat ttttgtattt ttagtagaga cagggtttca ccatgttggc caggctgatc 3781 ttgagctgct gacgtcaagt gatcctcctg cctcggcctc ccaaagtgct gggattatag 3841 gcgtgagcca ccactcctgg cccaagttgc tgtatttgaa gagtgtatca taacactggc 3901 aaccattcat taatttaagt attcattcag aaatcatgta tgagcacctc ctatgtgcca 3961 ggcacagact ggttaatgcg ggctcatgag gatcatggtc tctgccctca agcagcccca 4021 cttatttatt atgtatttat ttatagagac agaatctcac tctgttgccc aagctggagt 4081 gcaatggcgt aatcatagct tactgcagcc ttgaactcct gggcttaagc aatcctccca 4141 cctcagcctc ctgagtagct aggactacag gtacgcacca ccatggccag ataatgtttt 4201 tcaatttttt tttttttttt gtagagatgg ggtcttacta tgtggcccag gttgctctcg 4261 aacttctggg ctcaagcaat actcccacct ctgcctccca aagtgctgaa attacaggtg 4321 gcatgagcca ccacacctgg ccagtcacat ttattttatt gttcctctcc acgagattca 4381 ggtgtgagct attgcagtgc agagtctcgg accctgttct ggtcccacag agtcggatgc 4441 tttgtggttg gagcccagga atctgcatct tacaagcttc ccagatgcct tataaacact 4501 gagatgcggg aacccctggg ttagggctga ggcgacctcg acaagcaaaa attcatcaca 4561 cagcaagaaa tgtttgcttt tttctttttt ttttttgaaa cacagtctcg ctctgtcacc 4621 caggctggag tgcagtggca caatcttggc tcactgcaac ctccacctcc tgggttcaaa 4681 tgattctcat gcctcacccc cccgcctgag tagctaggat tacaggcagg cgccaccaca 4741 cccagataat ttttgtattt ttagtagagt ccaggtttca ccatgttggc caggctggtc 4801 ttgaactctc ctgacctcaa atgatcaaac tcccaaagtg ctgggattac aggtgtgagc 4861 caccgtgccc ggccaaggaa cattttctat agggattatg ggggttgcca gggaaggctt 4921 ttgggaggag gtaacccctt tttttttttt tttttttttt tttagagaca gggtctcgcc 4981 acgttgccca agctagttta gaactcctgg gctcaagtga tcctcctgcc ttagcccccc 5041 aaagtgctgg gattacaggc atgagccacc aggcccaccc agaggtgacc ttttaagatg 5101 agacctgagg ggtgagtagt agtgaacaag tcaacgagaa gggggaagag acacagcaaa 5161 gtagaagccc cagtccttgc cccgaaggag ttgacagcca ggtggatgga ggggatatag 5221 taatcaagca tagcagatga tgcttccagg aggttgtgga aggtgaaaga agccctgact 5281 cagcctgagg aatccaggtg actttgccga ggaggaccat gttttccaac acccagatat 5341 gagtgcagac tgcgtcacaa cttgggtgaa gagaatgggt gttcaagaga atcagggcag 5401 gccaggagtg agggaggagt ctcctaagtg ttaaaggcct cttaacagcc aactagtgat 5461 gggtagaagt cccagtcgac ctctgatctg gatactgttt tcacctccat tgtcaatgct 5521 aagcctttgt ttgtttgaac taagtccttc tatctggagt ccttagtgat ccggcagtaa 5581 atggtgtaaa tggtgacatt taagtgagtg gaggattcct ttgcaatggg acccagagaa 5641 cacctgctaa ttcagaggaa tctcaggatc tcatccctct cgaaataacc ttcatggcca 5701 ccattgctga ggagtgggta gtagaactct ggccctggcc ctggccagcc ttcattcagt 5761 cctaacatgg ctgctttcta attatctgtc cttggatacg tgttacttaa ttttctgatg 5821 ctgcaatctc atactctggc gaaggaggct aataatagta cttactactg ccagagagtt 5881 attttgaaaa cagaattaat tttcttatgt attaactttt ttgtgtgagt taaattttca 5941 tttacaaaaa agaaatatat atttatatat aatatataat atataattat atattttatt 6001 tatatataat atatataatt attcattttt aatttatata tatttataat atataattat 6061 atatattaca tatatttata atatataatt atatatttta tatatttaat atatataatt 6121 atatatttta tatacttata atatatataa ttatatattt tatatattta taatatatat 6181 aattatatac attatatata tgtgtattat atatatcgtg tgtgtgtgtg tctatgtctg 6241 tgtgtgtttc agacagggtc tcgctctgtt gcccaggtgg agtgcggtgg catgaacata 6301 gctcactgca gctttgaact gggctcaagc gatcctccca cctcagcctc ccgagtagct 6361 gggaccacag gcatgtacca ccatgcctag gtaatgtctt ttatttatta atttatttat 6421 atttatttat ttatttgaga cagggtcttg ctctgttgcc taggcttgag tgcagtggcg 6481 tgatctcagc tcactgcaac ctctgcctcc ccggttcatg ccattctcct gcctcagcct 6541 cctgagcagc tgggactaca ggtgcatgcc accacacccg gctaattttt tgtattttta 6601 gtagagatgg ggtttcactg ttttagccag gatggtctcg atctcctgac ctcatgatct 6661 gcccacctcg gcctcccaaa gtgctgggat tacaggcgtg agctactgtg cccagcatgt 6721 cttttttttt ttatttttaa tttaaaaatt tttttttgag gtgaagtctc actctgttgc 6781 ccaggctgga gtgcagtggc atgatctcgg ctcactgcaa actccacctc ccaggttcaa 6841 gcgattctct tgcctcagcc tcctgagtag ctcggattac aggtgtccac caccacaccc 6901 ggctaatttt tgtattttta gtagaaacgg ggtttcacca tgttggccag gctggtctcg 6961 aactcctgac ctaaagtgat ccacctgctt cagcctccta aagtgctggg attacaggtg 7021 tgagccaccg cgcttggcca tgtcttttat ttttgtagag ataggggctt gatctgtgtc 7081 accgaggctg gtctcgaact cctggcctca agcaatcctc ccactttggc ctgtcaaagt 7141 gttgggatta caggtgtgag ccacctggct tcacccatgt tttttatttg tagaagttta 7201 tggggcgcat gtgtaatttt gttacttcca taggttgcct agtgctcaag gcagggcatt 7261 tagggtatcc atggcccaag taccatacat tgtacccatt aactaattgc ttatcctcct 7321 ctcaactccc acccttgaat tttcttatga aatcttctta ggatctaaga tcactgcagc 7381 ctctgcctgc tgggctcaag tgaccctcct gcctgcctca gcctcccaag tagccggaac 7441 tacaggctca caccaccatt ccgtgctaat ttttttgata ggaggttttg ccacattgcc 7501 caggctgatc ttgaactcct gggctcaagt gatccacccg cctcagcctc ccaaaatgct 7561 ggaattacgg gtgtgagcca ctgcccccag ccttttatga aatctttgga agagcacctg 7621 atgcacagtg agacctcaga aaatgttaag tattgtcatt ttatcagtta tgtcccaaaa 7681 actgtgcaaa gtgttgggga tacaacaaca acaaagacaa gctaaataag ctaaatgctc 7741 tcgtggaatt tataatctat tttttgtttg tttttgaaac agtctcactc tgtctcccag 7801 gctggagtgc agtggtacag cgtgatctcg gctcactgca acctccacct cccgggttca 7861 agcaattctc ctgcctcagc ctcccgagta gctgggacta caggcgcatg ccaccacacc 7921 catctaatta tttgtagttt tagtagagac ggggtttcac catgttagcc agaatggtct 7981 cgatctcctg acctcgtgat ccgcccacct cagcctccca aagtgctggg attacaggcg 8041 tgagccacca tgcccggcac acccagctaa tttttgtatt tttagtagag atcacaccag 8101 tgcactccag cctgggtgac agagtaagac tccatctcaa aaaaataata aataaaaaga 8161 atttttaaaa ataataaatt attattatta ttattatttg agatggagtt tcgcttttgt 8221 tgcccaggct ggagtgcaat ggcatgatct cagcttaccg caacctccgc ctcccaggtt 8281 caagcaattc ttctgcctca gcctccctag tagctgggat tacaggcatg tgccaccaca 8341 cccagctaat tttgtatttt tagtagagac agggtttttc cacactggtc aggctggtct 8401 cgaactcccg acctcaggtg atccacctgc ctcagcctcc caaaatgctg ggattacagg 8461 cgtgagccac tgcaccccgc caaaatgatt ttttgagaca gggtctcact ctgtccccca 8521 tggtggagtg cagtggtgct cactgcagct ttgacctcct gggctcaaat tatcctccca 8581 cctcaacctc ccactgagta gctgagttta taggcactca ccaccacgcc ctgctaattt 8641 ttttgttttt tgtagagatg ggagtctcac tttgttgccc aggctggtct tgaacacctg 8701 ggttcaagca atcctttcac ctcagcctct gaaagtgctg gaattacagg cgtgagccac 8761 cacgcctggc tttaaaaaaa agttaagaac aattagtgtt tgttattact ctgcagactg 8821 tcgtgggaaa tgcagacagg actgtggtga ttgcctgggt ggagtttaga aaatagctgt 8881 ggagcacatg aatctgctag ataaaaacag aattcataat tcatgacctg ttatattgca 8941 ctgtaagagt ccagacattg tgtcaaaatt taaagaagca gaattgactt gtgtgtatat 9001 gtatgtgtgt gtgtatatat atatattttt ttgatgatac atatatatat gtataaagct 9061 gggtgtagtg gcacatgcct gtaatcccag ctactaggga ggctgaggca ggagaattgc 9121 ttgaactcag gagacagaga ttgcagtgag ccaagattgg gccactgcac tccagcctgg 9181 gtgacagact gagactctat ctcaggaaaa aaaaaaaaag taagcacttt gtttgtgtca 9241 ggtactgtgt ttggtgttgg gatataggag tcagcaagac agacaagctc cctgctctcg 9301 tgaagctttt cccaggcgac ttagaagtta atgggggagg gggaagagca gtcctgggac 9361 agggaaccgc ccgagcgaag gtcctgaggt tggaaggggc ttggcacatt tcagaaacag 9421 aaggaagcct agctcggagg atggatgtgg tgtccctagg gataatggtg gagataggca 9481 gcgatggtat tccttagcac cttggaggtc agagtaagta gtttgagttc tatcctgggg 9541 gcaacagggt ttcaaacagg gatggtggga taggcagatt atactttaga aggatctcct 9601 ggccaggcaa cctggctcac acttgtaatc ccaacacttt gggaagctga ggcaggagga 9661 tcacttgaat tcaggagttt gagaccagcc tgggcaacat ggtgaaaccc catctctaca 9721 aaaaatacaa aacttggccg ggcgtggtgg ctcacacctg taatcctagc actttgggag 9781 gccgaggcag gcggatcacc tgaggtcagg aattccagac aagcctggcc aacatggtga 9841 aacccggtct ctactaaaaa taccaaaatt agctgggcat ggtggcggat gcctgtaacc 9901 ccagctactc aagagactga gcagaactgc ctcccaggtt caagcgattc tcctgcctca 9961 gcctcccgag tagctgggat tacaggcatg tgccaccaag cccgtttaat ttttgtattt 10021 ttagtagaga cagggtttcg ccatgtgggc caggctggtc ttgaactcct gacgtcaggt 10081 gatccgcccg cctcggcctc ccaaagtgct gggattacag gcgtgagcca ctctgcgcct 10141 ggccgaggtc tttctctcaa agcgatcact aacctggcgc cactgaactc cagcctgggc 10201 gacagagacc ctgtctctaa agaaattatt aaaaaaaaaa aaaaaaaaaa aaaaggagtc 10261 cttgaggccc aggaagggtc ctctccagcc ctcgcacaag ctccacctcc cctcctcctc 10321 caaggatcct gagaccagac agccacgtat aaggtggcgg gcgtgactcc tctggccagg 10381 ctcgtgggcg gcgcccgcac atgcggagcc tgccacccac tgggcgagag cggtactgca 10441 gacgcgggtg tcaagagtcg ccccacccgg tggctttgct gtaaatgttc actgagcgcc 10501 ttttctgttc caggcagaag gggtccctgc cctcatgtgg cttacaatct aatggaacta 10561 ataccagcaa cggaagtgcc gctagcgact tgattaaaac tttttttttt ttttttgaga 10621 cggagtttcg ctcttgctgc ccaggctgga gtacaatggc gcaatctcgg ctcactgcaa 10681 cctccgcctc ccgggctcaa gcgattcttc tcagcctact gagtagctgg gactacaggc 10741 gcccgccacc acgccgggat agtttttttg tatttttagt agagacgggg tttcaccgtg 10801 ttagccagga tggtctcatc tcctgacctc gtgatccacc cgactcggcc tcccaaagtg 10861 ctgggatgag aggcgtgaac cacggcgcca ggccaatttt tgtgttttta gtagagacag 10921 ggtttcatca tattagtcac aggctggtat cgagctcctg acctcaggtg atccacccac 10981 ctcggcctcc taaagtgctg ggattccagg tgtgagtcac catgcctgac ctttttaaaa 11041 tttctttaca gcaatgcaag aaaaaacttt tttttttttt aatagagatg ggatctccct 11101 atgttgccca ggctggtctt aaagtcctgg aatcaagcaa tcctcctgcc tcagcctccc 11161 aaaatgttct gattacaggc atgagccacc acgcctagcc tatagcaact ttataatctg 11221 ctttttgatg aaagcttttg ttcttataat acaagtacta tacgttcatg gtagaaaatt 11281 taaaaagaaa atttaaagcc cagacataac ttctgtcaac atgtgagtgt agcccttacg 11341 gaattgtcgt gtgttttaca aaaatagaag tatacattta atattttgta gcctgctttc 11401 ttgtttttaa cagctttatt gagatataat gtaagaggga ccgatatcaa tacattccaa 11461 tataccacct tttcacatac aatatatctg ctttgtttta aacttaatgt aatagtggtt 11521 tattaataat aatataacac aaatcactgc tataggttga ggaaagtgct aagtgcttta 11581 cacacatcat catgttcagt tctcccaaaa accctaacag taggtactat tattttcctc 11641 attctgcaga tgaagaaact gaagcataaa attacttgtc cagcaatagc agcagaacct 11701 ggatttattt ttatatttta agatagggtt ttggctgggt gtggtggctc acgcctgtaa 11761 tcccagcatt ttgggaggcc gaggcaggtg gatcacgagg tcaggagttc gagaccagcc 11821 tggccaagat ggtgaaaccc cgtctctact aaaaatacaa aaattagcca ggcaagttgg 11881 cgggtgcctg taatcccagc cactcgggag gctgaggcag gagaatcgcg tgaaccaggg 11941 aggcagaggt tgcagtgagc tgagattgtg ccactgcact ccagcctggg tgacagagtg 12001 agactccgtc tcaaaaaaaa aaaaaaagat agggtctcac tctgttgccc aggatggagt 12061 gcagtggcat gaccttggct cactgcaacc tctacttcct ggactcaagc aatcctccac 12121 ctcagcctcg caagtagctg agattacagg caggcatcac catgcccagc taatttctgt 12181 attttttgta gagatagggg ttcgccatgt tgcctaggtt ggtcttgaac tccagggttc 12241 aagtgatcct cctatctcag cctcccaaag tgctgagact ataggtgtga gccacctcac 12301 ccagccagag gtggacttaa atccagcttt gtctgagaac ctgagttctt aaccattttg 12361 ttatgcctgg cctgtgccag gcattgagag aatttagagg aatccgaaaa aaaaaggggt 12421 ctcctggccg ggcgcagtgg ctcacgcctg taatcccagc actttgggag gccgaggtgg 12481 gcagatcacc tgaggtcagg agttcgagac cagcctggcc aacgtggtga aaccccatct 12541 ctactaaaaa tacaaaaatt agccgggtgt ggtggcagat gcctgtaatt ccagctactc 12601 agaaggctga ggcaggagaa ttgcttgaac ccaggaggcg gaggttgtag tgagtcgaga 12661 tcgtgtcact acactacagc ctgggtgaca gagcaagact ctgtcttaaa aaaaaaaaaa 12721 aaaggttagg gggtctcctg gggcctgaag tggaatgagg aagtggaata gtcatcataa 12781 gaggtacagt gggctgaaca tcactggggg accagaggcc agggtcagga tgtaaccagg 12841 aaaggcttcc tggaagaggt gagaaccgag tgtgatgtgt gcagacagag gtggggaacc 12901 acgtggggct ggcctaagca gaacctcggg tgtagacgag tgtatgagca tcaacaggga 12961 ccagtaagga aaaataactg agcaccaaat tctatggggt cacagagata gatgatactg 13021 ggggtctgcc ctctaaagct gacaatctag ctcggcagaa ggttgagagg tgggggtagg 13081 aacacagaaa taagtaatag ctatcactgt gcacacccgc ccccacgtgt caggctttct 13141 acctacattg acacatttaa gctcacaaca tctctggaag atccagttat tagccccatt 13201 tatgaatgaa gaaagaaagg ctcagagtaa agcagattgc tctaggtctt cctagcagga 13261 agggctaggg ccaaggctca aacctgggtg gctcaaggtc taaatccagt gttctaacct 13321 gcacagctac gctattcagc ctaccccagc tggaccgccc cctcccactg caaggaaacc 13381 aaccctgaca cccagccttc tatgccttct atgcaaacct ctaggtgcac ttcagtcaga 13441 aatctttttt tttttttttg agacggagtc tggctctgtc gcccaggctg gatcttggct 13501 cactgcaacc tccacctccc aggttcaagt gattcttgtg cttcagcctc ccaggtagct 13561 gggattatag gtgcccgcca ccacacctgg ctaatttttt tttaagacgg tgtttcactc 13621 ttgtcgtcca ggctggagtg caatggcgtg atcttggctc actgcaacct ctgcctccca 13681 ggtcaagaga ttctcctgcc tcagcctccc aagcagttgg gattacagat gtctgccacc 13741 acgtctggct aatttttttt tttttttgta tttttagtag agacggggtt ttaccatgtt 13801 ggccaggctg gtctagaact cctgacctca ggtgatccac ccgcctcagc ctcccaaagt 13861 gctgggatta caggtgtgag tcacagcgcc tggccttttt tttttttttt ttttttttga 13921 cagagtctcc tctgttgccc aggctggagt gcagtagtgc aatcatgtct cactgcaagc 13981 tctgcctcct gggttcaagc gattctcatg cctcagcttc ccaagtagct gggattacag 14041 gtgtgtgtca gcacacccag ctaatttttg tatttttagt agagacgggg ttttgtcatg 14101 ttggccaggc tggtctgaaa ctcctgacct caggtgatcc gcctgccttg gcctcccgaa 14161 gtgctgggat tacagtcatg agccactgca cccagcctct ctggtggtat cttttggaaa 14221 gatatctttc tggaaaactc tttccatccc agcccaaact tgtctctgac tcactcattt 14281 atttgagggt aggtgactta gatgtcatat cctacaggag ggcagactga ctgcccctcc 14341 aaagccaaat gccattttga tcctcttcta ttgaccctgt accacctcga tcaagacaat 14401 tctttgtttt tttttctttt atggtcaaaa aatttaagac tagttccaac ttgaagtagt 14461 tttgtttcaa cagatcaaga taattccttg tttttttttt ttttgagaca gagtttcact 14521 cttgtcaccc aggctggagt gcaatggcac gatctctgct cactgcaacc tccacctccc 14581 aggttcaagc gattctcctg cctcagcctt ccaagtagct gggattacag gtgcccacca 14641 tcacgcccgg ctaatttttg tatttttagt agagacaggg tttcaccata ttggccaggc 14701 tggtctcgaa ctcctgacct caggtgatcc acctgcctcg gcctcccaaa gtgctgggat 14761 tacaggcatg agccatcgca cctggactag atcaagacaa ttctaacagt taacaattat 14821 tgagcactca ttttgtgcca ggcactgtgc taagagtgca gtatctcaga gtggttaaac 14881 atggaggctt tggggccaga cttctagatt taaattctgg ctctgggtca ggaacagtgg 14941 ccgacacctg taattccaga actttgagag gctgaggtgg gaggatcgct taaactcagg 15001 agtttgagat cagcctgagc aacatagtaa gaccccgtct ctaaaaaaaa aaaaattagc 15061 tggatgaggt ggtgtgcact tgtagtccta actactcagg atgcttgtta ggtgcagtgg 15121 ctcatgcttg taatcccagc actttgggag gccgaggcag gtgaatcact tgaggtcagg 15181 agtttgagac cagcctggcc aacgtggtga agccccatct ctactaaaaa attcacaaat 15241 tagccgcgtg tggtggcgca cgcctgtaat cccagctact cgggaggctg aggcaggaga 15301 attgcttgaa cccaggaggt ggaggttgca gtgagccaag attgtgccac cacactccag 15361 cttgggtgac agagcaagac tccatctcaa aaaaaaaatt tgttatacaa aggaatgaat 15421 aaatgcatat tacattatta cattcctatt gtagagaaga ggaagcatag ctggtggcaa 15481 accagagatt tgaacccagg agttcctgat ccctgtcttg aacactttct ttttcttttt 15541 tttttagaga tggtgtcttg ctatgttgcc caagctggac ttgaactcct gggcttaacg 15601 gatcctctgg cctcagcctc tcaagtagct aggactacaa gcaagcccgg ctagcttaga 15661 ccctttgtat ggccccagtg tggggcctca tatacccttt ggctgtgttt gaggacgtgt 15721 ttgacagatt tgatgatctt gattgcaccc tgaagtataa aaacttaggc tactcgagaa 15781 agatttcatg ctaaatcagg tagtaaagct ctagtaatat gatcctaagg ttttgtcttc 15841 atgaaactgc atgaccttaa ggcatcggtt tcagctaacc tgcccaaggc cacacaggct 15901 gtaagggaac aagatgtgag cccagccttc tcacactaga gtccctgttt gttctcctac 15961 ccggactttt ttgttgctgt tgttgttttg agaccagagt ctcgctgtgt cgcccaggct 16021 ggagtgcagt ggcatgatct cagttcactg caaactccac ctcctaggtt caagagattc 16081 tcttgcttca gcctcccatg tagctgggac tacaggcttg cgccacgacg ccgaataatt 16141 tttgtatttt tagtggagac ggcgtttcac catgttggcc aggctggtct cgaactcctg 16201 acctcaagtg acccacctgt ctccgcctcc caaagtgctg ggattacagg cgtgagccac 16261 agtgccggcc ttttttttgt tttgttttga gatgggagtt caactcttgt tgcccagatt 16321 ggagtgcaat ggtgcaatct cagctcactg caacctctgc ctcctgggtt caagcgattc 16381 tcctgcctca gcctcccaag tagctgggat tacaggcatg taccaccatg cccggctcat 16441 tttgtatttt tagtagagat ggggtttcac tatgttggtc aggctggtct cgaactcctg 16501 aactcaggtg atccactcac ctcggactcc caaagtgcta ggattacagg catgagccac 16561 tacatgcagc ccccaggagc tcttgatctg aggcataaag atgcttagag ggtttccaaa 16621 tatattttat ggtcacctat tctcccaaca atgtgtgcta tactgtgtgt ttacacatag 16681 gtgtgtcttt agaggaaagg gttcaaagct ttcattagat tctttttttt tctaataaag 16741 aaaaaagctt tgttgcccag gctggtctca aactctcaag ctcaagcaat ctgtccgcct 16801 cggcctctca aagtgttggg attacagcca tgagccacca cgctggtcct cattacattc 16861 tcaaaggggc ctattgaaaa aaaaaaaaaa aaaaaaaaaa aaggccgggc acagtggctc 16921 acgcctgtaa tcccagcact ttgggaggcc gaggaaggcg gatcacaagg tcaggagatg 16981 gagaccatcc tggccaatat ggtgaaaccc tgtctctacc aaagtacaaa aaaaaaaatt 17041 agccaggcat ggtggtgcgt gcctgtaatc ccagctattc aggaggctga ggcagtagaa 17101 ctgcttgaac ctgggatgca gaggttgcag tgagccaaga ttgcaccgct gcactccagc 17161 ctaagcaaca agagtcaaac tccgtctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga 17221 tgagactgtt gagtcaggat taggatccag gtctgctggc accaatttca gtgctcttta 17281 tactttctgt tgctattcat tcattcatcc attcattcag caaatttata tcaagtatct 17341 ggtatgtgcc aggtatgatt ccaaattaca ccattcccag attattctga gaatgctgca 17401 gtgccaaact cttggtatca cagactttat attctagtgg gggagacaga caaaataata 17461 atataacgac gaaggtgcaa tgttatgtgc tatgagaaaa cgcagtcgga ggatggcgag 17521 gaaggggaac ggaattttag atagggcggt ctgggatcag aacctccaga tgcccgtcca 17581 tggactccaa cgaagggagc gattccaggt accagggcac cctgcacaaa cctctcttga 17641 gcttccgcct cgcccaggtg gggacggtag ggagaccgga gggcacacgc atgcgcacga 17701 aggaaacggt aaagcctgaa gggaggtgca gagcgcatgc tctctcttgc ccgagatgcc 17761 gaggattttg acaaggactc cgtcgtcccg gatgatagtg ctcaggttaa tgccagtggg 17821 agggcggcgc ccaatagtaa cttcctttgg aggttgtagt accgccccca gagccaattt 17881 tccacttccg cttccggcgc tgcggcagtc cagatcaaaa atggcggtag ttggtgtgtc 17941 ctcggtttct cggctgctgg gtcggtcccg cccacagctg gggcggccta tgtcgagtgg 18001 cgcccatggc gaagagggct caggtactgg ggccggggtc gacgggtcga gcctcagccc 18061 cactcgggcg agacagggag ggactgtgac cttggcccga ggccttgcgg gagggaaagt 18121 gagacccggg cccgccccat accggcgctg aacgtttgtg gcttctccgc agctcgcatg 18181 tggaagactc tcaccttctt cgtcgcgctc cccggggtgg cagtcagcat gctgaatgtg 18241 tacctgaagt cgcaccacgg agagcacgag agacccgagt tcatcgccta cccccatctc 18301 cgcatcagga ccaaggtacg cccttgtaca tctcttcaag cgtccgttct cttttcgtta 18361 tgtgtgcctt agtgcaagtt cttcattctc tgaaggcatg ggtgccaggc gtgtacagct 18421 tgtttatcct cacaaacaga aaatgtattt tcttccattt tgtggatgga cagctgacac 18481 ttgggattac gtctcaattc tcttcttcaa ggtcatacaa taagtgctct tcagtttccc 18541 ttttctccga ttcatcctac ctcctgcctt ctgagacagt tctttttttt tttttttttt 18601 ttttttttac ttctgagaca gttctgaaac aggtgtccgt tgtgtaaagc ctggaggttt 18661 gggaaatcca attagagtgg caaagtctga aggtggttcg cgtaagcgaa cccagcagct 18721 acatgggagg ctgaagcagg agaatccctt gaacctagga ggcggagttt gcagggagct 18781 gagatcgtgc cactgcactc cggcctgggc aacagagcga gactctggtc tcaaaacaaa 18841 acaaaacagc aacaacaaaa catgggcatc tggcagctac tagttgcata atgaagagac 18901 tgacagcttt ctctattccc caccaatacc ttttctcgac tgttttatca ttggttgttg 18961 atgtagatat attacaatgt tgtgtcctct gttgtttgct gtatctttaa caagggctta 19021 ttctgattct tcaatactgg ccatgcgcgg tggctcacgc ctgtaatccc agcactttgg 19081 gaggccaagg tgggtggatc gccggaggtc aggagttcaa gaccagcctc gtcaacatgg 19141 cgaaacacta tctctactaa aagtacaaaa agtagccggg cgtggtggcg ggcgcctgta 19201 atcccagcta ctcgggaggc tgcggtggga gaatctagga ggttgaaccc aggaggagga 19261 ggttgcagtg agccaagatc cagccattgc actgcagcct gggcaacagg agtgaaactc 19321 cgtctcaaaa aaaaatagta ataatatact tgatcttagc caaaaggccg agaagtgatg 19381 aaaaaagtta ttattattat tattattatt acatcaccta gcccatatag gactggcctt 19441 gtagaaactt aacctagact cttgaatttg ttgagtcttt gagcggcact cccatccttg 19501 tacaatgcaa ttagaagcca aacagtaaag gtttctcagc ctcagcactg ttgacatttg 19561 gggctgggta attcttgttg gggggctatc ttgtgcattg tgaaatgttt agccatatcc 19621 ttggcctgta cccactaggt ggcagtagta cctgcgcaac cccactcccg cttccccctc 19681 ccccgccccc atcacagtgt ctcccgatac tacccagtgt gccctgaagg ggcaaaatca 19741 ccctgggttg gagaccgttg ccctaaagta atgcttttca aaggtaatag gcaagataat 19801 tcagggtggg ctgcagtttc ccagcccaga tattgtttga agtctgggat ggtgccagaa 19861 atctgcattt aattaagcat ctccctcccc taatactgcc accattttga tataggtgat 19921 ttgaggatta cactttcata aacacagcac tggatatact agttactagg gagaggatga 19981 tgagtaacta acttgagctt gcattttctg ccatccatat ttgtttctgg aattaaaact 20041 gtcctgtagc caggataact ttaagggctc ccctggtagt tataagagct ccttaaataa 20101 tgggaaggtt gaactacacc agtagataaa ggactaattt ccattttgtt tttatatatg 20161 atctgtcacc ccgttataag cagttcatga cggtgttctt tctaaattat tttctggaat 20221 tatctaactg acatcttcac tccacagccg tttccctggg gagatggtaa ccatactcta 20281 ttccataacc ctcatgtgaa tccacttcca actggctacg aagatgaata aagagaatct 20341 ggaccactac ccgggcacca gggaccacag cactggtttg gaccgttact ctgcacatgg 20401 accagaaaaa gtatatggga ccttaagctc accttcttta cttgtatcaa atgatgactg 20461 gtatactggt ctcccatccc tttgcttgtg gcaggagatg gcttaaataa ataacttaaa 20521 tttagattgg tcatgagctg agagttactt tctttggttg tgttttttcg cagaaattag 20581 catttgagct tcaagtcagt gtccaagttg aaaagaaatt ggcattagtt atttctgttt 20641 ccacagtgaa ggtctaggct gctgtttagt gttacaggtc caattcagcc cattgaataa 20701 gtgtttataa cagcccctct tggtttattg tgcctgatgt ggataaaggg accaagtgga 20761 gtggtgctaa ttaagcatca agattagtcc ttggtcaggt gcggtggctc atgcctgtaa 20821 tcccagcact ttgggaggct gaggctggca gatcacctga ggtcaggagt tggagatcag 20881 ccgggccaac atgatgaaat cctgtctcta ctaaaaatac aaaaattagt tgggtgtggt 20941 ggcgcactcc tatagctggg tgtagaggct tgaacccggg aggtggaggt tgcagtgagc 21001 caagatcgtg ccactgcact ccagcctggg caacagagtg agactctgtc tcaaaaaaca 21061 aaagattggt ccttgacggg cctgagatat caaagacatt gaatggctag cagaatgcct 21121 gcaatttaca cctgaaagag gtatctgtat ttctcaagag gaagtgcttg atctgggtct 21181 catctttttc ctgtcccatt ccactagcat ttactgcaga atcttccctg tagtcataat 21241 tttcttgaac atgtcaagtt cttttttttc tttttttttg agacggagtc tctctctgtt 21301 gcccaggcta gagtgcagtg gcacgatctc cactcactgc aagctctgcc tcccaggttc 21361 aagccattct cctgcctcag cctcccgagt agctgggact ataggcaccc gccaccacac 21421 ccagctaatt ttttgtattt tcagtagaca cagggtttca ccatgttagc caggatggtc 21481 tcgatctcct gacctcgtga tccacccgcc tcagcctccc aaagtgctgg gattacaggc 21541 gtgagccacc gtgcctggcc tcgaatgtgt caagttctta ctttgaggta atctccttca 21601 gggctggtag ggtcagaaat cttaagagaa acaaagggag cgtgacacat ttctgcttat 21661 tggcttttct gtggaattaa aactatcctg ctttaatgga tagtaaatgg tagtagtagt 21721 agttacaaaa ccactctgag agggcaatta ttattattat tattattatt attttttttt 21781 tttttttttt ttttttgaga tggagtctcg ctctgtcgcc caggctggag tgcagtggcg 21841 ggacctcggc tcactgcaag ctccacctcc cgggttcacg ccattctcct gcctcagcct 21901 cccaagtagc tgggactaca ggcgcccgcc actacgcccg gctacttttt tgtattttta 21961 gtagagacaa ggtttcacat gttggccagg ctggtcttga actcctaacc tcaggtgatc 22021 tgcccacctc agcctcccaa agtgctagga ttacaggtct gagccactgt gcctggccag 22081 agctggttta ttttatttta ttttattttt tgagacagag tttcgctttt gttgcccagg 22141 ctggagtgca atggcatgat cttggctcac cacaacctcc acctcccagg ttcaagcaat 22201 tctcctgcct caccctcctg agtagctgtg attacaggca tgtgccacca cacctggcta 22261 attttttgta ttttaagtag agatggggtt tctccatgtt ggtcaggcta gtctcaaact 22321 cccaacctca ggtgatctgc ccacctcagt ctcccaaagt gctgggatta ctggcgtgag 22381 ccaccgcgcc cagccagctg gtttattctt aaggctcaaa ggaacacttt cgtggtaagg 22441 ttcaagaggc aacaaggtat ttcatattat attacattaa actggtaacg caaaagtaag 22501 gcatatacag tacggaatag agaatctcat tgtctttgcc attttgtaat aattaacgga 22561 atagagaatc tcattgtctt tgccattttg taatattaca gcctgcactt ctgagtccct 22621 atagagctgt tttggttgtg gccagtctga gtttaggaca gatgcattat acttcagctt 22681 gctgcctggc ctcccttggc tgggttccaa acatctttta taatccttgg ataatggacc 22741 ctagccagtc ataaaatata aatgaagaat gttttgaaat gacattaagg acaagtcaca 22801 ttaaatatga tgtagcaatt ggtaaagtgg agaaaagaca atagaattag ggccttctat 22861 aaatcaagtt tcttggttta tttttcagac agggtcttgc tgtgtcaccc agcctggagt 22921 gcagtggtac aatcacagct cactgcagtc tccacctctc aggctcaggt aattctccca 22981 cctcagcctt tagagtagct gggactacag gcacacacca ccatacccag ctaatttttt 23041 tgtagagaca gggtttcgct atgttgccca ggctggtctc aaattcctga gctcaagtga 23101 tccgcccatc ccagcttccc aaagtgttaa gactacaggc atgagccacc atacccggcc 23161 tgtaattcaa gtttcatact actttgggta tctaataaaa ccaaattact caattaattg 23221 tggggattac gtgacattct gaggtaagta agtgctctcc tgatcctttt ttggtaccgt 23281 agtgtatttg tatagcctat tgtggaaatt ttagaaatca gtgaggtttt ggctgggtgc 23341 ggtggctcat gcctgtaatc ccagcacttc gggaggtcaa gagttcaaga ccagccaaca 23401 tggtgaaacc ccatctctac taaaaataca aaaactagct gggcatggtg atgcatgcct 23461 gtaatcccag ctacttggga gactgaggca ggagaatcat tggaacccag gaggtggagt 23521 ttgcagtaag acaagattgc gccactgcac tccagcctgg gtgacagaac cagactgtct 23581 caaaaaaaaa aaatgtgagg tttctgattg ccatactgta ccacctagct taagacaagt 23641 cactttttct tcacctttaa gttacaaaac acaaaactct ccccactgta tatatgaggt 23701 ttgtagggta acagggtaat agttccataa aatgcactga gttccagagg caaataatga 23761 tagaaactaa atgttacaat ttattccatc ttcaggatta cagacattac agggcaggat 23821 gagtaaacaa ggcaaatgaa gcgagcacct tcagcttccc ccctaccccg acatttaacc 23881 agatgcagca ttttgacatt tttaggatat gcaggttgac aattcactga cttgggttga 23941 gagctggcaa tagcagatct ttgcaattta ggttcttcct ccacagctat tccaagtatc 24001 ttaattcctg aactgcacac tgagagctct gaaatggtgt tcactgcaac attcttgcag 24061 ctttcacatc ttaatctgac acctcttgtg aaggcaggga actgtgttaa aagctgtctt 24121 cctctttgct aatccaggcc accatcaatc ttaaccatta gttactatat aaaaataaaa 24181 gtgtgctcaa aagcactcac tgaaactgtt gtgcccagat cccttttcag agcattagtt 24241 ccctgagagg aaaaaaagag gtcctaacca attgctttca taaatagtga ccccagtaca 24301 gtgtatatgt cttttgcagc agaaatcaag aggtcaggca gctctgctca tcctgactag 24361 tttatcatcg tttgcataca aggtatattt atgtaaagct gattccacgc caagtattgc 24421 aaccataatc tcaaaaaaat tgttcaattt tagcacacag ttcctgcaag ataccacagc 24481 aggtgagaaa tcatctcaaa gagttcatct tttacaactg agaggaaaac atcgaaggag 24541 gaaataaaac tcctctctcc taagttcctc atcaaatctg atggctatgt tcacagagtt 24601 agttgacaaa aatccagagt cctcaatttc tggacttgcg aaatccttca aggtgactgt 24661 caaggtcaag aagaattttc aggcttttct ttgccatggc ccatgaactc cagtccttca 24721 ataggaatct ctttctcctt tattgctttc tggtaggagg agaaaacaca ttataaagac 24781 ctacatgaag cttaagttgc aagctttgta aataaaatat gacagtacaa aagtaaaatc 24841 aagctgatta taaagactta taaccatgat tcaaaccaca tactttttaa aaaacaagta 24901 ccggtcaggt gtggtggctc acgcctataa tcgcagcatt ttgggagacc aaggcaggtg 24961 gatcacttga ggccagaagt tagagaccaa cctggccaac atggcgagac cctgtttata 25021 ctaaaaatac aaaaattagc tgggcatggt ggcgcatgcc tgtaagccca gctactcggg 25081 tggctgaggc atgagaatag cttgaacctg ggaggcggag gttgcagtga gctgagattg 25141 tgccattgca ctccaggctg ggagacagag tgagactctg cctcaaaaat aaataaaaac 25201 cagaagtacc ataatgaata aatcaagtgt aagacctgag ggagactgga aaaggagttg 25261 tgagggaacc gttaaaatta agtgaaattt gaaaaataat catcactttg tgttagtgaa 25321 agacttaggt tcaaatctgg gttccaggac ttactggcta tgtgactaac tgtaaatggg 25381 atgaagacag caactacctc ggggaattgc tgtgaggatg tgttgggtta acttattgga 25441 agtgctttgc ccggtgctgt tcttaacaat tcaaagcagt tccacatcca ttgggccaga 25501 ataaacattt cctggctgca gttgcgggga ctggtcaaga gagagaaaca ggctaacgaa 25561 gggtttataa caacaaaaaa atgtgaccct ccagtctgaa ttctgaagta taatcccgcc 25621 cagaccccag cagtaatagt ggaaaagtgc accacctgcg aaagaaccaa ggaaacctga 25681 tgctagtgaa ctcggattaa tcacttccta tttctgggtg ttagtttttt atgtataaaa 25741 tatgagtaac tatgcccacc ttgcgagttt agggtgaaca ccaaatgaac caaaggatgt 25801 gactgtgctt cttgaaaagc actgccccaa tatttgtcat tcggagagac acccaagtgg 25861 cctcggtggt gatggtaggg gagcgagctc ctgccactgg cctcactgcg acttttctcc 25921 acccttccag tccagtccca atccccgtaa aatacgatga gtgtggtggg acacagcgcc 25981 gagaatgcag ggcctgggaa cagaggcggg agggctcacc tgaacacact gctggtagcg 26041 cttgaagagg tcggtgcacg ggtccccgga gctgtccccc ttgagaaatt tctcggcgaa 26101 ccagcgattg aagcactggt cgtactcgcg cttcatgtcc gtgcatgcct cccccacact 26161 gttcatggcg acagtggtgg cggcggcgac gacggcgcac tctgatgtca tcactctcag 26221 gcgcgtcgct cggcgttacg cgcgggcgca ctgcgggggc caaggaagga agaaatgtgg 26281 tcgcggttgg tgtggctggg ccttcgggcc cctctgggcg ggcgccaggg cttcacctcc 26341 aaggcggatc ctcaggtaaa ggccagggcc atctaggcgg gtggcggagc aagccgggag 26401 gcacttcgga gcgccggtga cccacactcc ccgcctcatc cctcctccag ggcagtggcc 26461 ggatcacggc tgcggtgatc gagcacctgg agcgtctagc gcttgtggac ttcggcagcc 26521 gcgaggcagt ggcgcgactg gagaaagcta tcgccttcgc cgaccggcta cgcgccgtgg 26581 acacagacgg ggtggagccc atggaatcgg tcctggagga caggtaaact cgcggctgca 26641 gccccgaagc cttgaccgtg gcccgttcgc agccgtttaa tgtgacgatt agcgaacagt 26701 tttccagggg gttaaagagt gttagcaaga ttcaggaact tgcccacagt cacctcgcga 26761 gtcagtggct tcactcttcc ccttgttcat tactgatgct ctcgctagtg taagtggaaa 26821 caaaacccga actgctaatc actgcctttg tttttttttt tgtttgtttg ttttgttttt 26881 ttttttttga gacggagtct cgctgtgttg cctaggctgg agtgcagtgg ctcgatctcg 26941 gctcactgca acctccgcct cccgtgttca agcaattctc tgcctcagcc tcccgagtag 27001 ctgggattac aggcgactgc caccaggccc ggcttatttt ttttgtattt ttagtagaga 27061 cgtggtttca ccatcttggc caggttggtc ttgaactcct gaccgtgatc cacccgcctc 27121 ggcctcccaa agtgctggga ttacaggtgt gagccaccgc gcctggccta atcatcgcct 27181 ttaaggccct actccataga gtccctacct actcactcca acgtactctt ctcagtactc 27241 tccttggcta ccttgactgc ttctccagca agacaagttc ttttcctctc tagatctctg 27301 cactggccct tcgttttttt ccttcctctc agtatttgca tggctgcctc cctcttggca 27361 caggtctcag cttatcttct cagaaagggc tctcctaacc actccaaatc ccttctcccc 27421 ttctagttag taactcatcc tattgggttt tttccctaac attttaatgg acttttgttc 27481 atttgttata tctcttttat taaaatgtga gctcattatg tagttactgt tttattcctg 27541 ccttgaattt ttattttatt tgtttttgag atggagtctc actctatcac ccaggttaga 27601 gtgcagtggc atgatctcag ctcattgtaa tgtccacctc ccagactcaa gagagcctcc 27661 catctcagcc tcccaagtac aggaccacag gcgcgtgcca cttggcctgg ctaatttttt 27721 gtatttttga tagagacggg attttgccat gttccccagg cttgtctgga ctcctgagca 27781 aaggtgatcc acccatctca gcctcctaaa gtactgggat tacaggcatg agccacagca 27841 cctggccgtg tttgattctt tcaaatgtct tctctacttt gcaattttct ttttcctttg 27901 ggacggggtc tcactatgtt gcccaggctg gtcgcaaact tctagactca agagatcctc 27961 ctgcctcagc ctcccaaata gctgggacta taggtgtgca ccaccacacc tggctaatac 28021 ttaaaatttt tgtagaaacg gggtcttgcc atgttgcccc atatggtctc aaactcctgg 28081 tctcaagcaa tcctcctgcc tcagcctccc acttcctcaa gaaagttttt tttttttttt 28141 gaggtggagt cctgccccat cacccaggct ggagtgcagt ggcatgatct tggctcactg 28201 caacctccac cctccaggtt caagtgattt ttgtgcctca gcctccggag tagctgggat 28261 tacaggcacc cgccaccaca ctcagctagt ttttgcattt ttagtagaga cagggtttca 28321 ccatgttgac caggctggtc tcgaacccct ggcctcaagt gatccgccca tcttcgcctc 28381 ccaaagtgct gggattacag gtatgagtca ctgcactggc caaaaaagtc cattcctttt 28441 tttttttttt tttttttgag acggagtttc actcttccac ccaggctgga gtacagtggt 28501 gcaatctcag ctcactgcaa cctccacctt ccggtttcaa gtgattctct tgcctcagcc 28561 tcccgagtag ctgggactac aggcgcccgc caccacgccc agctaatttt tgtattttta 28621 atagagacgg ggtttcacca tgttggccag gctggtctcg aactcatgac ctcacagtcc 28681 acccgcctcg gcctcccaaa gtgctgggat tataggcgtg agccacaagt cttcattctt 28741 ttatacttag gttctttttt ttttttttta attgagacaa gagtctcgct ctgtcgctga 28801 ggctggagtg cagtggcgtg atcttggctc actgcaagtt ccgcctcctg ggttcatgcc 28861 attctcctgc ctcagcctcc caagtagctg ggtctacaag tgcccactac cacgcctggc 28921 taattttttg tatttttagt agagatgagg tttcaccgta gccaagatgg tctcgatctc 28981 ctgacctcgt gatctgccca cctcagcctc ccaaaggtgc tgattacagg cgtgagccac 29041 tgtgcctggc tatacttagg ttctctacct gtggctctcg tgatatccaa aagggctgat 29101 ttatacatgc gtttatacat ctatttaagg gatgtgttga gtctctgccc ctctcctgct 29161 cacccacctt ccacagcatg tctgtttttc cttatcatgt tctcagtggt tggcactgag 29221 tattcagtaa atgtgttttg ttttgttttg ttttgttttt ttgagatgga gtctcactct 29281 gttgcccacg ctggaggcag tggcgtgatc tcggctcact gcaacctctg cctcctgggt 29341 ttgagattct cttgcctcag cttcccgagc acccgggatt acaggcatgc accaccatgc 29401 ctggttaatt tttgtatttt tagtagagag gggtttcacc atgttggcaa ggctcgtctt 29461 gaactcctga cctcaggtga tctgcctgcc tcagcctccc aaagtgctgg gaatacagcc 29521 ttgagctgcc gtgcccggcc acagtaaatg tttaatgaat tttttttttt ttagagacag 29581 agttttgctc acgttgccca ggctggagtg caatggcgcg atctcagctc accacaacct 29641 ctgcctccca ggttcaagcg attctcctgc ctcagcctcc caagtagctg ggattacagg 29701 tgcacgccac cacgcccagc taattttgta tttttagtag agacagggtt tctccatgtt 29761 ggtcaggctg gtctcaaact cccgacctca ggtgatctgc ccaccttggc ctcccaaagt 29821 gctgggatta caggcatgag ccaccatgcc cggtcccagt gttctttatt aatatctcta 29881 ggaaaggagg gactagaagc acagagtcat cagaatttcc ccaagttatc tagagggtaa 29941 atacaaagaa atataaccat cattaataag ggaaaaggtc gtgaaaaaag atgaggaatt 30001 tggatccagt agctagcatc tcaacctcgt attcagtgat ctgtactgtt tttcataaat 30061 ttggaaacca gtttgggtta ataattccca ttaatcattt taatgtgaca tctgtaggac 30121 tttaatgctt tgaacacaca gaatgtaaag tgaagacaag tattaaacct tttatttttt 30181 atttttttgt agtttttata acttcatttg atgtatttga tgatcagcag ttagttccca 30241 tccacactga ctgtagattt gtgaaagtgg taacaagtat ataaccaaag tatagagctt 30301 atttggtgaa tttctaacct cattatgttt tctggaccat ccactgcaca tggacacagt 30361 atggacattc cttactactt tggcccagac agctttgttg agcctggtat caatacacat 30421 atctggagtt ttctatctcc ttcattgcaa atttccggtc tctttaagag cctgaggggc 30481 acacttcttg aagcccactc catggataca cttgtgaata ttaatggtgt actcttgggt 30541 caccacctag ttgatggcat aatggccatt cttctcacca tccttctttg caggaggcca 30601 agttgcttaa atgttttagg tagtaggcat cttagctata aaagccctga cttggccggg 30661 cataatggct cactcctgta atcccagcat tttgggaggc cgaggcaggc agatcacttt 30721 aggtcaggag ttcgagacca gcctggccaa catggcgaaa ccctgtttct acttaaaaaa 30781 tacaaaaatt tgccgggcat ggtggtgcat gcctgtaatc ccggctactc tgaaggctga 30841 ggcaggagaa ttgcttgaaa tcaggaggcg gaggttgcag tgagctgaga tcacgccact 30901 gcactccagc ctgggcgaca gagcaagact ccatctcaaa aaaaaaaaaa aaaaggtaga 30961 ctgggcgcgg tggctcacac ctgtaattcc agcactttgg gaagctgagg tgggtggatc 31021 atggggtcag gagttgaaga ccagcctggc caagatggtg aaaccccatc tgtactaaaa 31081 atacaaaatt agccaggcaa ggtggcaggc gcctgtaatc ccagctactc gggaggctga 31141 agcaggagaa tcggttgaac caggacggca gaggttgcag tcagccaaga tggagccact 31201 gcactccagc ctgggcgaca gagtgagact ccgtcttaaa aaatacaaaa caaggctggg 31261 cgtggtgttt cacgcctgta atcccagcac tttgggaggc tgaggcaggt ggatcacctg 31321 aggccaggag ttcgagacca gcctgaccaa tatggtgaaa ctccttctct actaaaaata 31381 caaaaattag ccgggcatgg tggcgcactc ctgtagtcct agctgttcgg gaggctgaga 31441 cagaagaatt gcttgaacct ggggagcaga ggttgcagtg agccgagatc acgtcactgc 31501 actccagcct gggcaacaga gcaagactcc atctcaaaaa aacaaacaaa caaacaaaaa 31561 acgccttgac tggctaatag aaaccccttt gcacactaat aacatgtcat ttggtttgag 31621 gtaaaaacag tatgcttaca tgccaggggc taataaatta tataaattct tttttttttt 31681 tttttttttg agctggagtc tcactctgtc gtccaggctg gatgcagtgg cacgatctca 31741 gctcactgca acctctgcct cccaggttga agcaattctc ctgtctcagc ctcccaagta 31801 gctgggacta caggtgcccg gcaccatgcc cagctaattt ttgtattttt agtagagacg 31861 gggtttcacc atattggtca ggctgctctt gaactcctga cctcaggtca tccgcctgcc 31921 ctggcctccc aaagtgctgg gattgcaggc atgagccacc gtgcccggcc taaattattt 31981 caattttcac atgcagtcct ggaaggtcaa tacaagttat ctattttata tagatggaaa 32041 aaactgaggt tcagaaaggt tatataatgt gcccaaagtt tcacaggtaa taagcagaga 32101 atttggaact cagagcttct gctgccaaat actgttctaa gatatgccag ctaaagtcga 32161 ttcctaggaa ctttactctc tttcatctga aatccagaga atgagttgac agcaactctg 32221 cagccaaata cttaaatgtc cctcatagtc ttatgaacca aaaacttcaa actcttaggc 32281 tcaagccatc ctcctgtctc agcctctcaa gtagctggga ctacaggtgt gtacccctat 32341 ggccagctaa tttttttctt tgttactttt tttttggtgg agacaggatc ttgctgtgtt 32401 gcccaagctg gtctcgaact cctgggctca agtgatcctc ctgcctcggc ctcccaaagt 32461 gctggcatta taggcatgag ccatcatgcc tggcctgaat ccagaacttt ttttttctca 32521 ttctgttgcc caggctgcag tgcagtggcg caatcttggc tcactgcaga ctccacctcc 32581 tgaactcaag tgatcctccc acctcagcct cccgagtagc tgagactata ggcacacgcc 32641 accacccctg gctaactttt gtatttttta ggggagatgg ggtctcacca tgtttcccag 32701 gctggtcttg aactcctggg ctcaaacaat tcaccccctt cggcctcctg aagtgttggg 32761 aatatagatg tgagccacct tgcctggctc agaactttta accaagttaa ttagccccca 32821 ttccagtctg catcaaacta gaaataaagt aggaggaagg gagaaggctt ttagatggaa 32881 gtagaggggt ccacaagaag ctattccttc cagccagata gcagaaaaga agctcaaaga 32941 aagaaaagca gtgggagaac ggtcccatca gtgagtccaa gggctgacag aatcacaaat 33001 ggagccactt tgtcctccct gttgccacag cactttccaa tgccaccgta gcagcatttc 33061 tcacagttac ttaagttatt gttcctgagt tttaaccagc agggacttct agtttacaac 33121 tgaagtcttt ctcaaccagg gttcctcagc caaaccagaa cacagaaaga ttcaagtgac 33181 tttttttcag ttctcccaag gacagtatat gacaaatgtt agcttaaaca cacaggagat 33241 aagttaactt atcatcagaa tgggtgtctt ggggtatgta ggctaactta tcttacagaa 33301 ccctgattgg gaaagacagg tggtgaggta gaagtttagt caatcattaa ttgatctgaa 33361 aagctcatct ggtcctgttc agtgttttat ataattctga ggtccaagaa agtgaagcaa 33421 ctcaccaagt gagaggtatg cttgctgtac cactgtcaca gagaccgtgg gttcaggagc 33481 ctgtaattaa taaagctagg tatggtggtg cacacctgta atcccagctg cttgggaagc 33541 taaggtggga ggatcacttg agcccagaag tttgagacca gcctaggcaa cacagcaaga 33601 ccctgtctca aaaaaaaaaa gtaaaacaac aacaacaaca acaacaacaa aaaaccaacc 33661 ccagaaaatg tgtctatttt aggaacaact ccatcataac caataactaa atagaaatgt 33721 atgtatgtat gtacatacat acataaatag aaaggtatgt aaaggggtga attttgtttt 33781 cctggaaatt tcacttacgc aggaaactta attatgaaac ttgattactt gatgagtaaa 33841 tagattagtt gctattagat ctactcttgg ggaagcagta gcatttaacc gcctattggc 33901 ggtagtagag agtcttccca aaccttcaga tgctaggcag gttgtgcctc atcaggagtg 33961 cacagacttt gaatgagtgg tagggactga gatgtactgg ggtgcactaa ggctttctga 34021 aaggagcagg tgctactcag agccctctta ctgccgtgaa tgctggctca gcattatcag 34081 atcttgtttt tttaacattt catttaatct tcccagaaat cctatgaagc attcaacaaa 34141 caaggaaact gaaatataga gaggataagt aatgtgtagc aatgggatct gaattcagct 34201 gtgacttgag agcctaagct cttagccttt ttgagacgga gtcttgctct gtcgcccagg 34261 ctggagtgca gtagcatgat atccgctcac tgcaagctcc acctccaggg ttcacgccgt 34321 tcccctcctc agcctctcgc tacaggcccc cgccaccacg cctggctaat tttttttttt 34381 ttgtattttt agtagagacg gggtttcacc ctgtcagcca ggatggtctt gatctcctga 34441 cctcgtgatc tgcccgcctc agtctcccaa agtgctggga tcacaggcat gaggcaccgc 34501 gcccggcctc ttagccattt tttaatactg tccaacagaa actataaact ccagaaagtc 34561 caagggttac ctttgaatag tataattatg atttttattt tacttttgtt ctcccaagtt 34621 ttcttaaatg aatacagtcc ttgtgttatt caaaaaaaaa aaaaaatcta gtacaactct 34681 cttttgtgtg tgtgtgtgtg agacactgtc tcactctgtc acccaggttg gagtgcagtg 34741 gtgtgatctc cgctcactac aaactctgct tcctacattc aagtgattct tgtgcctcag 34801 cctcccaggt agctgggatt acaggtatgt accaccatgc ccggctaact tttatatttt 34861 tagtagagac agggtttcac catattggat ggtctcgaac tcctgacctc aggtgatcca 34921 cctgctttgg cctcccaaag tgctgggatt acaggcttga gccactgtgc ctggcctttt 34981 tttttttctt cttgctctgt cgcccaggct tgagtgcagt ggtgccatct cagctcactg 35041 caacctccgc ctcctgggtt caggcgattc tcccgcctca gcctcccgag tagttgggat 35101 cacacgtgcg tgccaccacg cttggctagt ttttgtattt ttagtagaga tgtggtttcc 35161 ccacgttgcc caggctggtc ttgaactcct gaattcaggt gatccacccg ccttagcctt 35221 ccaaagtgct gggattacat gcaagagcca ccacacctgg cctttctccc ccacaccccg 35281 ccgcccccct ccaagaaatt taatgtctga aaagatcata cttgttcaag gccacacagt 35341 taagaatgtg ccaggtcctg taacagactt caactttcta atattcatac catgagtaaa 35401 tatttgttgt gtctagcttt gggctgggtg ccacaaaaaa agaactagtc ctttcctaaa 35461 ataatcttat tgtgacaact tagttctctt tttttttgag gcagagtatc gctctgtcgc 35521 ccaggctgga gtgcagtggc gcgatctcgg ctcactgcaa gctccgcctc ccgggttcac 35581 gccattctcc tgcctcagcc tcccaagtag ctgggtctac aggcgcccac caccacgccc 35641 ggctaatttt tttgtatttt tagtagagat ggggtttcac cgtgttagcc aggatggtct 35701 cgatctcctg acctcgtgat ccgcccacct cggcctccca aagtgctggg attacaggcg 35761 tgagccaccg cgcccggctg acaacttagt tcttatgtgc agccttagcc cagatgtacc 35821 tgtcacatct ttgtccctct ctgacatgac tcaaacgcca ttagggtctg ccctcagtct 35881 gtgcccactc aaagcaggag gagcgtggct actaccgaat accatatcag tgatttaaca 35941 gactgcttat ttttttaatc taagtatctg gaagcaggct tacacacttt tacaccttga 36001 aacaatagtg aagcctcatg caaatgttca agctcttgtg tgttttttac cggtttgcca 36061 tctgaaggct ttagtaaagt cactaggtcc cacagaacac tgggagattc ttgagctgtt 36121 ggtggcacat cttgctcaag ggcaggcaca tccaccaaag tggctgtggt ggaaaaagca 36181 ttcattttag gtgtatcaga ttgggctgag atcccagttg ttacttagca gccaacatcc 36241 ttaagttact cttgtggcaa gtttgcttcc ccatctctgc ttagctttcc agcttcatct 36301 tccaccccgg cagaccacac cccagcctgc taaactccct gtccttctct gaatctctga 36361 gctctcggct ctattgcttt gggattcatg ctgctatacg tgccaggaag tccatttctc 36421 ataccttcat cccccaatcc agctatttct tgcccatcct tgaacagtga catcagctat 36481 tctgaggatg tgctactcac gcaggctgga ctaggtaccc tttctatcct ttggcaggct 36541 gtactaacct ctgttagatt taaaatattg tttttgtaat tgcttttacc tgtccccacc 36601 aaaagacagt tgcccatata atcatcaaaa actatttgtt aaatcaatcc agagaagtaa 36661 ttagcactta cttctcaggg ttgttcggtc aatctataaa ctgtaggctg ttatttggtc 36721 tactgttgat cccagcttct ctatctttga acagtgccca ttaactcaag aatatactgc 36781 ttccttgttc tctccttctt gcccctctcc atcatcccct aaaagccccc ttctctgaga 36841 gggtaacctt gagcttctgg gttttgtaga tgtctatacc tgagatccga caatgtggta 36901 gaaggcaact gtgctgatga attactacaa aactcccatc gcgtcgtgga ggagtacttt 36961 gtggcccccc caggtacgtg ctgcccagaa tggtttaaca gatagtctca cagtaacctt 37021 aggaatgaag caggaagcat gaggaccaaa gatgctatgt gaaggcactt tgaaaaggat 37081 gaagtgcata cagatggtgg cattaaggtg gcagtagggt ccagggtcat actcacctct 37141 agggggcagt tagtaacagg aacaagcaaa acaggattgg gggcatgcgg gaggtttaaa 37201 gtaggtggaa atctggagct caagatctcc ctaaaggggc agctggcatt cagtgtttct 37261 caaagccaca gttaagaccc tgccagatcc tgtaatcgac tccaactttc tagtattcat 37321 accatgaata aatatttgtt aggtctagct ttgggctggg taccacaaaa aaaccagtcc 37381 tttcctaaaa gaatcctttt tttttttttt tttttttgag acggagtttc gctcttgttg 37441 cccaggctgg agtgcaatgg tgcaatctcg gctcaccgca acctccacct cccgggttca 37501 agcgattctc ctgctcaggc tccccagtag ctgggattac aggcatgcac cagcacgcct 37561 ggctaatttt ttgtattttt agtagagaca gggtttcact gtgttagcca ggatggtctc 37621 gatctcccga cctcaggtga tccacccacc tcggcctccc aaaacgaatc ttattgtgac 37681 aacttagtta tcatgtgtgg ccttagccca gatgtttctg tcacatcttt gcctatgaca 37741 tgattcaaac gccattaggg tctgccctca gtctgtgccc attcaaagca ggaggagcat 37801 ggctgctacc ctatcagtgt tgccaggcag gaatgtaaac ccatttctcc agagaacctg 37861 gatacctgac attttgtgtg atatttctta aatttttttt tttttttttg agatggagtc 37921 tcgctcggtc acccaggctg gagtgcaatc ttgactcact gcaacctctg cctcccgggt 37981 tcaagcaatt ctctgcctca gcctgccgag tagctgggat tacaggtgcc tgccaccatg 38041 cctggctaat atttttgtat ttttagtaga ggcaggattt caccatcttg gccaggctag 38101 tcttgaactg gtgatctacc cgcctcagcc tcccaaagtg ctgggattac aggggtgagc 38161 cactgcaccc cgccgatatt tcttaaattt taagtgctag caaccaattc aaaatttaaa 38221 aaacaaaata ccatccaagc caaataaaac acatctgcaa gccaggtgca gtctgtgagc 38281 catcagttta caacagctgg tatagagatt tgcatcatct acctactgta aatattttaa 38341 atagaagctt taaaattact ctgggctgac agtaaaatcc agaaccttgg ataggttgac 38401 tgtttttatc tcctctgcag gtaggtgaat acttggagct gttgtactac ccctctgggc 38461 atctgggagt gggactagcc aaaagcaatc aggtagcctg ctaagtgcca acttccagtt 38521 caggatatcc tgagggagaa tcattctgag cagactccac agaagacatt ggccctggtc 38581 agcttaaggt gccagttaaa ggcaggctct ggttgatgca aggatgggaa agaaagctag 38641 aaattttgtg agctaaactt tttgtttgtt ttgagacgga gtcttgctct gttgcccaag 38701 ctggagtgca gtggcgcgat ctcggctcac tgcaagctcc accttccacg ttcatgccat 38761 tctcctgcct tagcctccca agtagctagg actacaggcg cccaccacaa cgcctggcta 38821 attttttgta tttttagtag agacggggtt tcaccgtgtt agccaggatg gtctctctct 38881 cctgacctcg tgatcgtccg cttcggcctc ccagagtgct gggattacag gtgtgagcca 38941 ccgcacccgg ccgtgagcta aactcttaat catgcctcca atccactaaa ccacctgtta 39001 actgtggaga gaaaggagac tctccatctc agagcctgat agttgtggct ccagcagaaa 39061 tatcttttga tcagacatgc tatctcccac acagaacctt acttcagact ttgctttttg 39121 ggaccacctg agttcatggt cagtgtttcc ttctagtcct atctgcatat tagtggctaa 39181 tcaacctgta tactccttac tgaaaaaggt gaatttggct acacttagaa ctgattcttt 39241 cagatactcc cagttaggaa aacaggacag tagctacatt acttccatat tttccatgag 39301 aaagtagtga tcagttagct cagtagagcc agctgggcgg caggctcagc tataacattc 39361 tctgttgtgg ccgggcgcgg tggctcatgc ctgtaatcct agcactttgg gaggccgagg 39421 agggcggatc acgaggtcag gagatcgaga ccatcctggc taacacggtg aaaccccgtc 39481 tctattaaaa atacaaaaaa ttagccggga gaggtggcgg gcgcctgtag tcccagctac 39541 tctggaggcc gaggcaggag aatggcgtga accccagggg gcggagcctg cagtgagctg 39601 agattgcgcc actgcactcc agcctgggca acagcgagac tctgtctcaa aacaaaaaca 39661 aacaaacaaa caaaaattct cttttgttat tttcaaacag gtaatatctc tttgccaaag 39721 ctggatgaac aagagccatt cccacacagc tgagtagctc attctggaaa gggggtactc 39781 tgtgaacatg tggaagcata atgacagtat ttttttactg tgaatactaa tgttcctgct 39841 tttttcagtc ccctgaaaaa atggatgctc aagcatttct taataacaga ttcttctgaa 39901 gacagaattg ggaaagatct ggccccaaca aggcagtgag ttcctgatgc taactgaggt 39961 gaaagaaaag caaaagtcag cttccaagga attcacttaa caggcctgtt cagtatggaa 40021 gacattattt atctgccttt aactcccccc aaaggaccat accaactgca tgaaagtgaa 40081 cttttctatc tacgtaactg gtagacggag catcttgatc actatgtgac aaccttggct 40141 gtcattttta gttgccattt gcattgattt gagcagccct atctttaccg aacatacctg 40201 aatttgttcc tgggctccca ctttcttccc agaagagggc taacttccta ctaaggtctg 40261 aagagtgttg aaagtagact agagcttggg aactcctaac ctagaactat ctgccatccc 40321 acaaagtgat tatatgccaa agggatacta gtcataccta gtgtttctct ttctgaaaag 40381 agaacttatc ctaaaattag ccctgggcct gggacaaagg agccctctcc gcccccaaaa 40441 tgattattaa attgagatga gtccaggata aaactcagat accaaggata aatgaaactt 40501 atttagggat aaaagtgggc tgggcgcagt ggctcactcc tgtaatccca gcactttggg 40561 aggctgagac gggcggatca caaggtcagg agatcaagac catcctggct aatgcggtga 40621 aacctcatct ctactaaaaa tacaaaaaat tagctgggtg tgatgacagg tgcctgtagt 40681 cccagctact cgggaggctg aggcaggaga atggcgtgaa cccgggagac ggagcttgca 40741 gtaagccgag atcgcaccac tgcactccag cctgggcgac agagcaagtc tcaaaaaaaa 40801 aaaaaaagtg atagaatcag agagtttttc ctctaaccaa actgccaaag ttggttttgg 40861 ctaagaattt cccaataata tttatgctgc tgctcatttt tttagttttc tgagacaggg 40921 gtctccctct gtcacccagg ctggagtgca gtggtgacag atcactgcag cctccaactc 40981 ctgggctcaa gtgatcctcc cacctcagcc tcctaagtaa ctgggactac atatgtgcat 41041 caccacgtcc agttaatttt tttttaagta gaggtggggg ttttgctatg ttgcctatct 41101 ggtcgtgaac tcctggcctc aagggatcct ctggcatgcc ttggcctccc aaagggctag 41161 gatcacaggc aaaagccacc gtgtgtttat ttttcccatt cccttccttt atctgcaatg 41221 cttttttctt taaggcagcc taatttacaa gcttggcctt gaattaaaag agaacccaga 41281 acccgctctt ggagtttacg ttctcaaacc aacatcagag caactgtcta tccccatata 41341 aataaagtta tctacccctg ctcctaactg ggtacaagtt atgactacaa ctcagtgatt 41401 ttttaaatta gtgtgcctat ttggtgaaat ctgtgaactt aatcaaggac aaccacacat 41461 gtcaattact aaacatttaa aatatatttc taaacagaat gggccgactc agtcacagta 41521 actgttgatc tccatagtag agcaacccac aaagacagaa ctgatttttt tcccataatc 41581 aggggtgaaa aatatacaac ttgtttctga accaaaacca caatttctgc agtttaaaat 41641 gtttcactgc taatatggcc ctggtagaaa ttatgtagtt ttttttcttc tttaaaaaaa 41701 aaaaattaaa aaaatttcct aagacactaa atcctcaatc tggaatgtag attctgagca 41761 caaagcagct cagttaacct aaaaaataaa gaaaaaattc ccatcacctg tctcagtagg 41821 gcctgaaagg agagaagtag tgtggggaac ccctgctttg gtatggagag tcacggcccc 41881 ttgacccaga ccgagaccgt gagtagccat agctggtgct tctctcagga taaactcgga 41941 tgtaggaagt ttcaccctga aatgcaaaca aaaacaaaaa gagtaaaggg gaaaaaaatc 42001 agagccagaa gaataagcaa accaacatct aacaataata gttaagtatt gagcacttac 42061 tgtgtactct gtgcctggca tgaggctatc tcattaaacc ttcataacat tatgaggtag 42121 gtactcttac tatcccattt caagaatgaa gaaaattaag acttaagtta tatcatttgc 42181 ccaaggtaac acaataaatg ccaatttgac atgttgatac tcaattaact atgataaaca 42241 atatatctga attactgcca ttgacccccc tcaaagctaa gaactggaaa ggatctcagg 42301 ggtcatatag tttaatcccc tgccacctca tgtatgcatc tttggatggg catctatgga 42361 aggagctagc ctgtatttaa taccttccag atgtgaggac ctgagctaga tgctttagac 42421 gttatcctct tgctctagat tcctggcaag tcttgtgatg gtgctattat gtgaccattt 42481 tgcccatgaa ggaatcgagg tcccaagagt agttcaggta aggaattaat aagcagcaga 42541 ggagtcatgt ccaggctggc ttgtccccag gccctctgcc ttcagccacc attctcagaa 42601 gatccaaaga tgccaagggg aaagaagccg gatgcttttt caccttaggt gaaagtcagt 42661 aattggagtt accctttctg aggccctgct ttgcagtata actgaggatc aggtgctgtt 42721 ctgtttctgc ttcaaattca agtcctagag gacagtctga acaaaagcat cattaaacca 42781 gggaaggacc aacctctaag ccaaaaacta ggcaattata tgaatcctca gaaaaagaaa 42841 tgagcccaga gacctgtgtc cccaaacatc cgtaaatctc atagtaaggc aggacaggat 42901 aaccagaaaa aaacaatgtc caactggtct ggtttctggt ctcaatggta ccaaagggat 42961 agtcaccacc tcaatcaaac tgagatccaa aacagcagct gctttcctac aaggaaagac 43021 atcaagaact actgagtctg aagggtgaca taatcaacca caagaaaatg ttacggttta 43081 gcctggcaca gtggtgcatg cctgcagtcc caactactgg agaggctgag gtgggaggac 43141 tgcttgggcc caggagttca aggtcagcct cggcaatata atgagactgt ctctaaaatc 43201 aaaaattaga gaaatataaa aattataaaa ttaaaaaaga aaatgtcatg atagctcagg 43261 tttcagagag gatgctgcac tactctggga aaaaagtgaa gtgacagctc agccctgaca 43321 gcaggaccta gaagggctga gatgaagagg ctgacctgta gagctcagcc tggacccaca 43381 ggaggggtag ttgagttcac agagggatac tgccatttaa gctggttgtg aatggggtgg 43441 tatgaagacc gactgcacta gtattctctc caggttacaa ataccatgac caaactgtgt 43501 aatgtgatac taatatgcag aataaagcat attaaaaaac aaacgtacca tgaatgcata 43561 cttctttcag ctatgaccct gaaggactta ggcatagcaa attccatggc tttgtccacc 43621 taagatagat ctcttatggt actgtgaggt taccaaatgg catggcctct tgctacctct 43681 gctaggtcag gttcaagtgg agtgatttga gtacaagctc ctcaacagaa ggcaaggacc 43741 cacctcatga gagcggaatt tggtgtcatc cagtttacgc agggcatatt ccatgtcttc 43801 ttttctgaga tactcgacca tccccactcc atccttctgc acatcagcat aacagacatc 43861 cccagcttct cgcatgtgat ccttcaggtc ctgccagctg cctgacggag gaagtcctag 43921 gcatggagaa cataagaaag cccacatcct tcagctatgt taactgccct gttaagagcc 43981 agcaatacct atggtccttt ctccatgatc acttaaatcc tgagggtccc caaatcataa 44041 accttgataa gggacatgaa atcagaaaag agcactgaag tatgtgtggt aaatgaacca 44101 ccttgagaag ctggttttaa gttcatcttc cagagaccag gccttttcaa cagagcaaga 44161 agtccatccc attacaggtt tagtatcttt gagaattcat agcctctttc tttgtccttt 44221 aaaatcaatg ccaaactcct aatactttga gttccaattt tttccattat gaaatttaac 44281 agtaccaatt atttattgct tgacagggtt tcacaggaca tgggaatttt gaaaccagac 44341 atgaaataac caaagagtag ctgaaggtaa caacacaaca acaaaaaatc taaagtttgt 44401 tttcttgtag gtatgaatga caaactatac agagtacctc ttaaaagatc accaacattc 44461 ttcaatacag tgggggggaa ggtaagactc agatgctaaa actagtcggg agtcatacaa 44521 ccctacctga tgtcatgaaa gggaaatgtg ctcctgctct caaaggctgt gtttaccgct 44581 cacagctcag gactgcagcc atttcctaat agttttctaa gcaggatttc agagcagcat 44641 ggagagaaaa aaagataatg tatgacatcc tgaaggaggg aatagagttc tggggttaaa 44701 aaattacaaa gggacacaag ataggcagat actgagagta tcactcttaa gcacatggga 44761 gtatgctgtg attgtgtgac attcacatct taaaaaggtg ccaggctcct cgatttattt 44821 cattccacca gggccctaaa actcccatgc aatatgcaga aggccccagg tctttccagt 44881 gtgttaatat tttgaagata gataaacctc ccagtttagc atttcatcac ttataaacac 44941 tagacagaaa tagtctatac tttccaataa aatctctaag actcttaagt tacggggttt 45001 taaaaaatat ttaattctaa aaccaccaca aaatctgatc ttcttcagga tccaaggctg 45061 ggtaagttgt tcaaatctgg ctccttgaat gcctttctac catgtctgat gccaagtata 45121 ttccaaaaga agcatcttta gaattgtagc tcaagttttc ccatcatctt tccttctcac 45181 cactagaaaa ctcctttaca agaaatatgg ttatgttctt ccaaggctgc actcccagcg 45241 caatttcatt tgcaggatac tttcctcatt ggcaatccag ggcaaggctt cacacaggct 45301 attgaaaaga aagcccccgt gtcccatgca ctgaagaata acagctatca tttcttggaa 45361 gacagactct gtgttaagta ttttacatgt atcatctcat tctgtttgaa aggaacatac 45421 ctgaaacaag aactcggaaa tcagatcttc ttgtaggagg cccattcctc ccaccacggg 45481 gccacccacc ccgacctcca taagtcctgg ggaactccac acgaagccga cactggccat 45541 aatcataacc atttcttcca taaatagcat cctctgcatc tctaaaaaaa acaacaacaa 45601 caaaaaaaac gtttggagtt agtgctgttc aggaggccaa gtgactttcc aagtgagggg 45661 catgagtagt aagcataaaa gggaaaccta gagtgaagaa aaaaacacac aaatcaggat 45721 cctgaagcac ttctgctttg gtttggagac gtttcatgat actggaatta acctttacag 45781 ctgtcacacg cagttttaga ggtatgaaat tataatgaat gagcaaacag attactcata 45841 agagggctaa ttcagagaaa aaaaagaatc aactgctgac agattattga ttttgccaat 45901 aaaccatcag tgtttttcta atcttcgtga gcattttctc gaaaagcaca ccacacacta 45961 accccttaat atttttctac aaagcctagt gccataaagt ctatggaaat aaaaagtagg 46021 cggttagcca ggcatggtgt tgcacgcctg tagtcccagc tacttgagag gctgagatgg 46081 gagtgcagag ccaagatcac actactgtac tccagcctgg gtgacagaga ccctgtccca 46141 aaaaacagga aaaaaaaagt agccagttag ataggagagt atttgagggt aaataaagta 46201 gctgggcatg atgtttgagc ccaggtcagg ccagcctggg taatatagtg agacccggtc 46261 ttttaaaaaa ataaataaat aaggctcaaa ggacaaaaga aaaatgcaaa tgagtgtttt 46321 gtgatgaatg tggaactttt tttttttttg aaacggggtc ttgctctgtc accaaggctg 46381 agtgcagtgg ctggagcata actcacagca gcttctaact cttgggcgca agcaatcctc 46441 ccacctcagc ctcctgaata gctagggcca caggcgtgta ccactgcacc cagctaattt 46501 ttggattttt tgtagagaca ggtctccctc tgctggccag cctggtcttg aactcctggg 46561 ctcaggcgat ccttcccctc agtctcccaa agtgctagga ttacaggcat aagccactgc 46621 gcctggcctt ggaactctaa gtgtaccttt tcagcatggt ttggagaact atttttgtca 46681 aaaaacattc tttggataca attttgccaa aagagataaa tattcaataa aattacttta 46741 agtctttcct aggactctag gagagtcagg aatgccacgt tgattctcca gtattggatg 46801 gtagaaagtg tgaccttgga gcttgggtgg acaagaggag ctgaatgtgc taatgaaggc 46861 attgttcagt gtcacaggca cataatcact tgtaagattt tagaactaaa agcaacaatg 46921 accatatcca gtgcaacctt gactacgtaa aatgcagact gggagaggca gaactgctag 46981 gaggtcagaa caactctgaa taaaaactgg ccagcattca ggccaggcgc tgtggcacat 47041 gcctgtaatc ccagcacttt gggaggtgga tcacttgagg ttagcagctt gagaccagtc 47101 tgggcaacct ggtgaaaccc ctcgtctcta ctaaaaagat aaaaattagc tgggcatggc 47161 atcgcgcagc tgtaatccca gctactcagg aggctgaggc acgagaatct cttgaaccca 47221 ggaggcgaag gttgcagtga gctgagactg tgccgctaca ctccagcctg cacgacagag 47281 ccagactcca tctcaataaa tacaataaat aaataaaatt agcctgcatt caaatcacta 47341 ttctgcaaat tactagtagt catctctggg taagtgttta atcaagtttt tacatctcta 47401 atgtgaacag tatgcctatc tcacagggtt attgtaatgg ttaaataaga tactacatga 47461 aatgctttgc ttgaatcttc acatctagtg acaccaaaag atagtttttc tattatgaaa 47521 ggaatgaggg aagatgacaa tgtagcgtat catatagtaa gtttgcattc gaaaaacaga 47581 agagggagtt caaatccatt gatcccggcc tggcgtggtg gctcaggcct gtaacctcag 47641 cattttggga ggctgaggtg ggtggatcac ttgaggttag gagttcaaga caaacctggc 47701 caacatggag aaaccccgta tctgctgaac atagaaaaat cagccaggca tgatggcggg 47761 cgcctgtaat cccagctact gggaagctga agcgggagaa tcgcttgaac tcaggtggcg 47821 gaggttgcgg tgggccggga tcgcaccaca gcactccagc ctgggcaaca agagtgaaga 47881 cctgtctcaa aaaaaaaaaa aaaatagtac ctacacctca taaattgtta aaaatactaa 47941 atacaggctg ggcgggttgc tcacgcctgt aatcctaaca atttgatccg ctgagacggg 48001 agaatcacct gagctcagga gttccagacc agcctgagca acatggtgaa acccctaatt 48061 tatcaatcta aaaaaataag aaggaaagaa aaaaagctaa atacacacca tggcacttga 48121 cccatagcaa ctgttgaaaa accacaatgg cacggttact aaataagcga ggcatagcct 48181 tctccggctt aggggtaaaa atgcccagtt ccggttttga ggcaaatgag gagatactag 48241 ggatgagctg tcagagtcga ctcttcagcc atcaacgcct cgcggtgctt cctgcctgcc 48301 cttcctcaga tgctcccctt gccctatcaa tacgtccacc ccaggtaggt cggacaccct 48361 acgaacattc cgcgttcccc gggagcaaat gatgctagga aaacagggtg cgggctctta 48421 tttggccgga gtggagtgac caggtcagcg ccgcagctgc taaccaccac caaccacagt 48481 tttgaaagtt cctgggcttt ttccggcctg agggccgacg ctgtttgcaa aatcacaagc 48541 actccttaga gcggggttag gaatctagag tttccaaatc cctggaacct ttagcatctc 48601 gccaagcctc cctcatcaaa ggagggaagt gaaatcagag ccccactccg gctgttttag 48661 aagttttccc gaatccgtga tccctttaca aagcccgggc cctggcctgc ggggcggggc 48721 gcgcgaacgg cgagctattc caccagagat ccctccacga cctccagcgg catccgcgac 48781 cctgcagctt gctctccagc cacggcgagc acgggccggg ggagaggcga gcgggaaggg 48841 cgaacggcga cccccgcgct gccccgcggc cgcttccccc tacggtgctg ccccaggcag 48901 gaggcggaca aggctcattt ggaccccccg aggccggtgc aggcgggctt gggccctcca 48961 aaggctcagc ttttgggttt ctaagtaaaa tattggaata ataaaaataa aggaagtgac 49021 gacggcgcga aggacggccc gggcctgcga ggagaggctg cctcatcctc cacccgtgag 49081 ggggcgccgt gaggggtctg cgcggaggcg gggggagggg aggccggggg gaggggagcc 49141 ctgggggagg ggacagcagg gaaggcgggg gcctcaggca ccgaggagga gagggcaggg 49201 gcgcgggggc ctcaccgggg gtcctcgaag cgcacgaagg cgaagggcac gaggccgtgc 49261 cggttcttga gctcgatctc gcggatgcgg ccgtacttgt agaacaggtc ctccaagtcc 49321 ttctcgcgca cgtcggtcgg aaggttcccc acgtagatgc gcccgtcgcc ctcgccgccg 49381 cgctcgtccg cccagcccga catccgcacc gcccgacgcc gcgggcccgc cgcagcccac 49441 gtcgccgccg ccgcctcagc acgggtcccc ccgcagcgtc cccgcgggct ccgaggcgct 49501 cagccgcact gcattgtggg aacgcggagc ggaagcgaag gggtcggcgg aggcaaaagg 49561 agtcctctta aagagaacgc ggctgcgacc attgcgcgtg cgcgcaggcc ggcgccccct 49621 ggaggtgcga agggccttcg ggcgctgaca gggagagcct ggggccgggc cgtgtggatg 49681 ccatccccga gcgcggttcg cgctcggctg aggcgctgga caagtggctt gggctcccgc 49741 gcctcagttt ctctctgtgg cgccgcctac ctcacagact tgtgagcact cactgacgtg 49801 ggtagcgccc agggcctgcg gggcgcagga gagctggagt caggcggaga ccgcaggctg 49861 accccgcagc ggccgggctg tcgcggcccc cacctcaggt cagtaggccc ccgggccacc 49921 ctgtccccat tttacagttg ctcagaactg acggtgtcag catgagatgg actccatagc 49981 ttggccttga gtttgtcgga gtcgagggag taagactctt aatgagtatc acagcgatca 50041 ctatagagtg catccgactt gtgcttttca tcctctcccg ggcagagtct gaggccacag 50101 aggggtcagg aatggaaccc agagccatgc tcccatccta agtactgagt ccttatggag 50161 ctcaggtaga cacccacctt ggcacttact tttccacatt cacatcccag aacgcggagc 50221 attgaaccca gggaccttac agtgaggcct tgcaccagaa gactttttgt ttgtttttga 50281 tacagggtct tattctgttg ctcaggctgg agtgcaatgg cgcgaacacg gctcactgca 50341 gtctcgatct cgctggactc aggtgatcct cccacctcgg ggcgccgtga ggggtctgcg 50401 ctgcgtccag ctagtttatt ttctattttt tttgtagaga ggctggtctc gaactcctgg 50461 gctcaaccga tcctccagcc ttggcctccc aaagtgttgg attaccggct tgagcccggc 50521 gcccggctgg aatactccta acctaaagat tgttcaggct gggtgcggtg gctcacgcct 50581 gtaatcccaa cactttgaga ggccaaggcg ggaggatcac ttgaggtcag tagttcgaga 50641 ctagcctggg caacatgggg aaaccccgtt tctactaaaa atacaaaaat tagctgggcg 50701 tggtggcgtg ttcctataat cccagttacg cgggaggctg aggcaggaga attgcttgag 50761 cccgggaggc gaaggttgca gtgagccgag atcatgccac tgcacttcag cctgggtgac 50821 agagtgagac tgtctcaaaa ataaataaag atttttcata ttaaaaatga aacctggctg 50881 ggcgtggtgg ctcatgcctg taatcccagc actttaggag gctgaggtgg gtggatcact 50941 tgaggccagg agttcaagac cagcctggcc aacatggcaa gacccgcccc cctactaaaa 51001 atacaaaaaa aaaaaaaaat tagccaagct tggtggcacg cctgtagtcc cagctgcttg 51061 ggaggatgag gcacgagaat cgcttgaatc tgggaggcag aggttgcagt gagccgagat 51121 tgtgccactg cactcctcca gcctgggtga cagagggaga ctctgtctca aaaacaagac 51181 aaaacaaaaa caaacaaaaa aagggactaa aaccacacat ttaatcagaa ggtagttgat 51241 ttctcattat accatctgag cattttaaaa tcaaaacaaa ccaaagacta gacacccaca 51301 tatactactg agtgtatatc ttttttggaa tacaagttga tagtatacat caggctttca 51361 taatatacat atacccttcg atcttgtaat tctgcttgta ggattttttg tttgtttgtt 51421 ttgagacgga gtcttacact gtcacccagg ctgcagtgca gtggcgtgat ctcggctcac 51481 tgcaagctcc gcctcccagg ttcatgccat tctcccgcct cagcctcctg agtagctggg 51541 actacaggtg cctgccacca cgcccggctg atttttttat atttttagta gagacagggt 51601 ttcaccgtgt tagccaggat gatctcaatc tcctgacctc gtgatccgcc tgcctcagcc 51661 tcccaaagtg ctgggaatac aggcgtgagc caccgcgccc agctgcttgt aggattttta 51721 tcagacggaa ataagcaaga gattctcata ataaattaca tgtgtatatt tatcattgca 51781 ttctgtgcaa ttaaaaaaat taaacttgat aataagggag tggctcaaca cattaacttt 51841 gaatgttaca caattattaa gtcatgtttt taatatgaca gagatattaa agtcacagta 51901 atttacagtt aaagtgaatt gaaagtggtg tttaatgtat acacatatat agtgtatttg 51961 ctgcttgtaa caaaaaaaaa atagaaaaag ctttccaaat gttgtggttg tggagatatc 52021 tcttgatggc agtgatttct atgtttttat ttatgctttt ctgtatttct caaattgtaa 52081 ttagcatgca ctacttttat aattggaata aaacataata aatgattcct aaaataaaac 52141 cattataact atctgttgta ttctggatta tgtaaagcat ctagctgttc tctagacaag 52201 caggaaagag ccctggccaa atatgatctc gagtctaagt gtgcagcctt ataggagctt 52261 acatatccta taggcatccg aaagtccaga tgccttctca ttttgggtgg tgtgcttagg 52321 tgtattaata ccattctctg agtcctacca ctcctatgtg attgttgcct ggaaagacaa 52381 aaaaggaaaa aaaagtgctg ttggagacaa ttgggcagtc tcctagggag agagaaatag 52441 agtcagctaa gtgactcaat aggactgtag gtggaaaact gcttgcaaat aataataatt 52501 cagctttaat tgtgcaatgt cctgcagcat acaatctcca aaggagttca gttacccagc 52561 agaaaatgta gggatgtggt tagtacaggc ttcttggatt agatagagca cagctaaatc 52621 cttaggaggc atttatgatc atccatgaaa ccccctttat tcctgtactg acctttgcag 52681 ttagaacaca aatgagacca caattatgac ttcatgttca aagggttcat aggttccatt 52741 ctctgctaag gccagagact ctaggcaggg acacctttta aactttttgt tcatctagat 52801 attcaacatt atttatgatt taggtcctgt atgaggccca gggcatgcaa taatacagct 52861 ctgtccttga gttgtgtata taggttgcta ttcggtcatg tagagtacag ttgtgataag 52921 gataagctgc ttgagactac atcggagggt cacctaccac agtcttagag attctaagga 52981 aggcttctgg gaggagatgt ttagagtctt gaagaaccag taggagttaa ccagggggaa 53041 gagggaaggg catttcagac aaaaatgagg gcatgactgc tgagtgcagt ggcagtaatc 53101 ccagcacttt gggaggctga ggcaggagga tggcttgagt ccaggagttt gagaccaggc 53161 taggcaatgt agtgagaccc ctatctctac aaaaaactta aaaattgcca ggcccagtgg 53221 ctcatgcctg taatcccagc actttgggga ggtcaaggtg ggtggatcac ttgaagtcag 53281 gagttcgaga ccagcctggt caacatggca aaacctcatc tttactaaaa acacaaaaat 53341 tagtcggagg tggtggcgca cacctgtaat cccagctact ttggaggctg aggcaggaga 53401 attgcttgaa cccgggaggc agaggttgca gtgagccgag attgcgccac tgaactccag 53461 cctggtcaac accgagactc tgtctcaaaa aaaaaaaaaa aaaaaaatta aaaaattatc 53521 taggcatggt aacgtgcacc tgtggtctca gctacttgat ctcgaactct tgagctcaag 53581 tgatccgccc atctcggcct cccaaagtgc tgggattaca ggcattagcc acggcacctg 53641 acagtcccag ctacttaaga ggctgaggtg ggaggactgc ttgagcctgg gaggttgaag 53701 ctgtagttag ccatgattgc accactgcac tccagcctgg gcaacagagt gagactctgt 53761 ctcaaaacat aaaaaaggac accatgagaa atcacccaaa taggagagag agtagcagac 53821 cagggaaaat ccctactggt gtgactgggg caaggagttc aagcagggaa gaagagagaa 53881 ggagctgagg caccaccagc ctgggcaaca tgttaaaacc ccgtctctac aaaaaataca 53941 aaaattagct gggcatggtg gcacacaccc atagtcccag ctacttggga ggctgagggg 54001 ggagaatcac ttaagcccag gaggcagagg ttgcagtgag ccaatattgc accgctgcac 54061 tccagcctgg gcgacagagt gagatcctgt tacagaaatg ggcttgtcct tcttttcacc 54121 aaactgaaaa gaagcctttg atgttggcag actattaaga attaatgcag ttcccacagc 54181 aagccattta ttctccacca gagtgtaggg cagtgcccaa ctgaaaaaaa caacataaaa 54241 aaaaaaaaaa atcccaaaac caacaacagc tcgggtgatg agtgcatgaa aatctcagaa 54301 atcaccacta aataacttaa agaacttatc catataacca aacaccacct gttccccaaa 54361 aacctattga aataaaaatt taaaacaaac aaaaaaccaa caacgccagg cataaaacat 54421 gctgcaaacc tgcccatctg gttttctcct tttctctgtg acatcagtcc cctctggcat 54481 cccagcaaaa tattttcctc tctgtcaggc tcagtcatga gctcctccca gaccttggcc 54541 ctaggtatca gtgggcagag aggaagtgat gggggagtag accatctgag aagatgagga 54601 actggtcccg cctggcagga gagctatgga atgagtcact gggaaacgtg ttcctggtca 54661 cagcaccctg ggagctttct ctgtcattct ctaccactgt acttgtgtga tggagtaaaa 54721 taaaacatat tctggaaagc tacataataa agaactgagc catgcagggg aggaagaaga 54781 gaaacagcag cagaactagg aggtttggaa gagagtttgg aattgggggt gagctttagg 54841 aaagcaagtg tcaagtgtgg gaggatgaga agagaaaaat gggaaagata attcaagatt 54901 tgcagtatat tgtgaatata attcccctct ggtgtttttt tcagagactg caaaaaatct 54961 ttaacattat ttttcaataa cttttagata atgaactggt gttttctttg aatatgccca 55021 aacaggtggg aacacaaggg cttgatttta tatgaagtta cttcaagtga acatactgta 55081 tttgtagcag ctcggttgct ttatttttac taacttgtta aattctgtga tggagtcttc 55141 aaacagcttt ttatatcagg gcttcctttt cttggattaa tgggcttctg ggatcagtga 55201 actcctgcct ccctgccccc tcaacctcca cctcctgacc ctgatcctgc aaagttgtaa 55261 cagttgtatg caaaatattg tatgtgggca gaaaagtgtt tttattctgg ggagaatctc 55321 catgtagcac tgtcccacag aactttctag gatgatgaaa acattctata tttgagttct 55381 atgtggtagc cattggttac atgtagctat tgagcactta aaatgtggct actgcaactg 55441 agaaaccaac ttcttaaatt atatttaaca ataattaact ttaattgtaa tagccaaatg 55501 tggctagtgg ctaccataca ggacagcgta ggaatttatc attctcaaag gtctataagc 55561 tgtactcccc cattccccca aaaaaggcca agaaacactg agaccaatca cttcccttta 55621 cagccctatg gtactggcac atagtaagtg cttagtaaac acctgttgat tttattaact 55681 agtatataaa gaacagttat tctagctcca gaaaagtgga actgaggtac aaaatggcaa 55741 atgagctcat caggttcacc aaatttcatt tcctaagctc taaattcctt accctcaacc 55801 cagaactttt tccaaaacat tttccactgt aacaaacctt gaaaactttc ttcacagcag 55861 agtaaatcag aaacacaagc aaagggtagg ggggtgtgaa aagattaata acagtactaa 55921 gaatgttttt caatgcaaca aaaaaggatg catttattta gtagaatcct aattttttta 55981 gacccaagtc agttttatac taagcagatt tttaatgaac tgcgtcaata taattcagta 56041 atggcacctc caaggcaaat acattcctta aaggtaaaaa acaagtatgg gaattatctg 56101 gagttttttt tttttaagtt aaaaaaaaat aataataact gaaagccagg aagcttctag 56161 gctacacatc ccccatgcta acacatgccc agtggctgaa cttccttctg tctgcctgag 56221 gctggctctc cccctcagcc agcacatgtg cagggcttct gtgaatgtca cattgacaga 56281 cattccttgc taggcagctg gggaccagtt tcttttgtaa ttccatcctt ctcatgggct 56341 gccagtctgt cttgttgtga cacactcctt gaaggtgcta gaaggacagc ggatgtttgg 56401 ggggtttttt tgtttgtttt ttgagacaga gtttcgctct tgtcacccag gctggagtgc 56461 agtggtgcga tctcggctca ctgcaacctc cgcctcccag gttcaagcga ttttcctgcc 56521 tcagcctcct gagtagctgg gattacaggc atgcaccacc acacctggct aattctgtat 56581 tttttagtag agatgggatt tctccatgtt ggtcaggctg gtctcgaact cccaacctca 56641 ggtggtccgc ccgccttggc ctcccaaagt gctgggatta caggcatgag ccaccgcgcc 56701 cggactttgt tttttgtttt gttttttttg tttttttttt tgagacggag tctcgctctg 56761 ttgcccaggc tggagtgcag tggcacgatc ttggctcact gcaacctctg cctcctaggt 56821 tcaagcgatt ctcctgcctc aacttcccaa gtaactggga ttacaggcac ctgccaccac 56881 acttggataa ttttgtattt ttagtagaga ttgggtttca ccatgttggc caggctggtc 56941 tcgaactcct gacctcaagt gatccacctg tcttggcttc ccaaaatcct gggattacag 57001 gtgtgagtta ccacgcccag ccaagagcag gtttttttta tctagtgcta tgcttggacc 57061 ccagctttaa gagcccaggg ttatgtacat ggcatctggg actttggtgg tggcttcact 57121 ccttgtctgc tcaacatttg ttcatgtatt caacccacat ttatcgagcc atagaacatt 57181 gagcctggct ctgtgctggg cactggtgct cgggatacaa cagtaagact caaccctgcc 57241 ctcacacc // LOCUS HS15HPGDH 660 bp RNA PRI 23-JAN-1996 DEFINITION H.sapiens mRNA for 15-hydroxy prostaglandin dehydrogenase. ACCESSION X82460 NID g1164906 KEYWORDS 15-hydroxy prostaglandin dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 660) AUTHORS Pichaud,F., Frendo,J.L., Delage-Mourroux,R., de Vernejoul,M.C., Moukhtar,M.S. and Jullienne,A. TITLE Sequence of a novel mRNA coding for a C-terminal-truncated form of human NAD(+)-dependent 15-hydroxyprostaglandin dehydrogenase JOURNAL Gene 162 (2), 319-322 (1995) MEDLINE 96032365 REFERENCE 2 (bases 1 to 660) AUTHORS Pichaud,F. TITLE Direct Submission JOURNAL Submitted (02-NOV-1994) F. Pichaud, U349 INSERM, Centre Viggo Petersen, 6 rue Guy Patin, 75010 Paris, FRANCE COMMENT Related sequence: A35802. FEATURES Location/Qualifiers source 1..660 /organism="Homo sapiens" /cell_line="HL-60" /cell_type="HL-60 leukemia cell line" CDS 1..537 /codon_start=1 /product="15-hydroxy prostaglandin dehydrogenase" /db_xref="PID:g1164907" /translation="MHVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQ CKAALDEQFEPQKTLFIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNKKNWEK TLQINLVSVISGTYLGLDYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVG FTRSAAPTIDCQWIDNTH" BASE COUNT 193 a 130 c 166 g 171 t ORIGIN 1 atgcacgtga acggcaaagt ggcgctggtg accggcgcgg ctcagggcat aggcagagcc 61 tttgcagagg cgctgctgct taagggcgcc aaggtagcgc tggtggattg gaatcttgaa 121 gcaggtgtac agtgtaaagc tgccctggat gagcagtttg aacctcagaa gactctgttc 181 atccagtgcg atgtggctga ccagcaacaa ctgagagaca cttttagaaa agttgtagac 241 cactttggaa gactggacat tttggtcaat aatgctggag tgaataataa gaaaaactgg 301 gaaaaaactc tgcaaattaa tttggtttct gttatcagtg gaacctatct tggtttggat 361 tacatgagta agcaaaatgg aggtgaaggc ggcatcatta tcaatatgtc atctttagca 421 ggactcatgc ccgttgcaca gcagccggtt tattgtgctt caaagcatgg catagttgga 481 ttcacacgct cagcagcgcc caccattgat tgccaatgga ttgataacac tcattgaaga 541 tgatgcttta aatggtgcta ttatgaagat cacaacttct aagggaattc attttcaaga 601 ctatgataca actccatttc aagcaaaaac ccaatgaaca gcttatgtgt tagccatagc // LOCUS HS165 4939 bp RNA PRI 11-OCT-1993 DEFINITION H.sapiens mRNA for skeletal muscle 165kD protein. ACCESSION X69089 NID g407096 KEYWORDS fibronectin repeats; immunoglobulin superfamily; sarcomere M line; titin binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa/Eumycota group; Metazoa; Eumetazoa; Bilateria; Coelomata; Deuterostomia; Chordata; Vertebrata; Gnathostomata; Osteichthyes; Sarcopterygii; Choanata; Tetrapoda; Amniota; Mammalia; Theria; Eutheria; Archonta; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4939) AUTHORS Fuerst,D.O. TITLE Direct Submission JOURNAL Submitted (29-OCT-1992) to the EMBL/GenBank/DDBJ databases. D.O. Fuerst, Max Planck Institute for Biophysical Chemistry, Am Fassberg, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 4939) AUTHORS Vinkemeier,U., Obermann,W., Weber,K. and Furst,D.O. TITLE The globular head domain of titin extends into the center of the sarcomeric M band. cDNA cloning, epitope mapping and immunoelectron microscopy of two titin-associated proteins JOURNAL J. Cell. Sci. 106 (Pt 1), 319-330 (1993) MEDLINE 94095665 FEATURES Location/Qualifiers source 1..4939 /organism="Homo sapiens" /tissue_type="skeletal muscle" /clone_lib="lambda gt11 & gt10" CDS 49..4446 /codon_start=1 /product="165kD protein" /db_xref="PID:g407097" /translation="MSLVTVPFYQKRHRHFDQSYRNIQTRYLLDEYASKKRASTQASS QKSLSQRSSSQRASSQTSLGGTICRVCAKRVSTQEDEEQENRSRYQSLVAAYGEAKRH GFLSELAHLEEDVHLARSQARDKLDKYAIQQMMEDKLAWERHTFEERISRAPEILVRL RSHTVWERMSVKLCFTVQGFPTPVVQWYKDGSLICQAAEPGKYRIESNYGVHTLEINR ADFDDTATYSAVATNAHGQVSTNAAVVVRRFRGDEEPFRSVGLPIGLPLSSMIPYTHF DVQFLEKFGVTFRREGETVTLKCTMLVTPDLKRVQPRAEWYRDDLLLKESKWTKMFFG EGQASLSFSHLHKDDEGLYTLRIVSRGGVTDHSAFLFVRDADPLVTGAPGAPMDLQCH DANRDYVIVTWKPPNTTTESPVMGYFVDRCEVGTNNWVQCNDAPVKICKYPVTGLFEG RSYIFRVRAVNSAGISRPSRVSDAVAALDPLDLRRLQAVHLEGEKEIAIYQDDLEGDA QVPGPPTGVHASEISRNYVVLSWEPPTPRGKDPLMYFIEKSVVGSGTWQRVNAQTAVR SPRYAVFDLMEGKSYVFRVLSANRHGLSEPSEITSPIQAQDVTVVPSAPGRVLASRNT KTSVVVQWDRPKHEEDLLGYYVDCCVAGTNLWEPCNHKPIGYNRFVVHGLTTGEQYIF RVKAVNAVGMSENSQESDVIKVQAALTVPSHPYGITLLNCDGHSMTLGWKVPKFSGGS PILGYYLDKREVHHKNWHEVNSSPSKPTILTVDGLTEGSLYEFKIAAVNLAGIGEPSD PSEHFKCEAWTMPEPGPAYDLTFCEVRDTSLVMLWKAPVYSGSSPVSGYFVDFREEDA GEWITVDQTTTASRYLKVSDLQQGKTYVFRVRAVNANGVGKPSDTSEPVLVEARPGTK EISAGVDEQGNIYLGFDCQEMTDASQFTWCKSYEEISDDERFKIETVGDHSKLYLKNP DKEDLGTYSVSVSDTDGVSSSFVLDPEELERLMALSNEIKNPTIPLKSELAYEIFDKG RVRFWLQAEHLSPDASYRFIINDREVSDSEIHRIKCDKATGIIEMVMDRFSIENEGTY TVQIHDGKAKSQSSLVLIGDAFKTVLEEAEFQRKEFLRKQGPHFAEYLHWDVTEECEV RLVCKVANTKKETVFKWLKDDALYETETLPNLERGICELLIPKLSKKDHGEYKATLKD DRGQDVSILEIAGKVYDDMILAMSRVCGKSASPLKVLCTPEGIRLQCFMKYFTDEMKV NWCHKDAKISSSEHMRIGGSEEMAWLQICEPTEKDKGKYTFEIFDGKDNHQRSLDLSG QAFDEAFAEFQQFKAAAFAEKNRGRLIGGLPDVVTIMEGKTLNLTCTVFGNPDPEVIW FKNDQDIQLSEHFSVKVEQAKYVSMTIKGVTSEDSGKYSINIKNKYGGEKIDVTVSVY KHGEKIPDMAPPQQAKPKLIPASASAAGQ" BASE COUNT 1244 a 1212 c 1411 g 1072 t ORIGIN 1 ttctctctcc tccttgcaat tttcctttct gtctgggagc acgccaagat gtcccttgtg 61 actgtcccct tctaccagaa gagacatagg cacttcgacc agtcctaccg taatattcaa 121 acacggtacc tgctggacga atatgcgtca aaaaagcgag cttccaccca ggcatcttcc 181 cagaagtcct tgagtcagcg gtcgtcttca cagagagcct ccagccagac gtccctggga 241 ggaaccatct gcagggtctg tgcgaagcga gtgagcacgc aggaagatga ggagcaggag 301 aacagaagca ggtaccagtc cctggtggcc gcctatggtg aggccaagcg acacggcttc 361 ctcagcgagc tggcccactt ggaggaggat gtccacctgg cacgctccca ggcccgcgac 421 aagctggaca aatacgccat tcagcagatg atggaggaca agctggcctg ggagagacac 481 acatttgaag agcggataag cagggctcct gagatcctgg tgcggctgcg atcccacacc 541 gtctgggaga ggatgtctgt gaaactctgc ttcaccgtgc aaggatttcc cacgcccgtg 601 gtgcagtggt acaaagatgg cagtctgatt tgccaggcgg ctgaaccggg aaagtacagg 661 attgagagca actatggcgt acacacactg gagatcaaca gggcagactt tgacgacact 721 gcgacatact cagcagtggc caccaatgcc cacggacaag tgtccaccaa cgcggcggtg 781 gtggtgagaa ggttccgggg agacgaggaa ccattccgtt cggtgggact cccgattgga 841 ttgcccctgt catcgatgat tccgtacacg cacttcgacg tccagttttt ggagaagttt 901 ggggtcacct tcaggaggga aggcgagacg gtcactctca agtgcaccat gctggtgacg 961 ccggacctga agcgggtgca gccgcgcgcc gagtggtacc gcgatgactt gctgttgaaa 1021 gagtccaagt ggacgaagat gttctttgga gaaggccagg cctccctgtc cttcagccac 1081 ctgcacaagg acgacgaggg cctgtacacc ctgcgcatcg tgtctcgggg cggcgtcacg 1141 gaccacagcg ccttcctgtt tgtcagagat gctgacccgc tggtcacagg ggcccccggt 1201 gcacccatgg acttgcagtg ccacgacgcc aaccgggact acgtcatcgt gacctggaag 1261 ccgcccaaca ccaccactga gagccccgtc atgggctatt ttgtggaccg atgtgaagta 1321 ggaacgaata attgggtgca gtgcaatgat gcaccggtga aaatctgcaa atacccggtc 1381 acagggcttt ttgaaggaag gtcttacata ttccgagtga gggcagtgaa cagtgcgggc 1441 atcagccgac cctccagggt ctctgatgcg gtggctgcac ttgacccctt ggacctcaga 1501 aggttacaag ccgttcattt ggagggagag aaggagattg ccatttatca ggatgacctt 1561 gaaggtgacg cccaggttcc agggcctccc accggtgtgc acgcttccga gatcagcaga 1621 aactatgtcg tcctcagctg ggagccaccc actccccgtg gcaaggaccc gctcatgtac 1681 ttcattgaga agtcggtggt ggggagcggc acgtggcaga gagtcaacgc ccagacggct 1741 gtgagatccc cgagatatgc cgtgtttgac ctcatggaag ggaagtctta tgtgttccga 1801 gtgctgtcag caaaccggca tggcctgagc gaaccttcgg agataacgtc ccccattcag 1861 gcccaggatg tgaccgttgt cccttctgct ccgggtcggg ttcttgcttc ccgaaacacc 1921 aagacgtcgg tggtggtgca gtgggaccga cctaagcatg aggaggacct gctgggctac 1981 tacgtggact gctgtgtggc cggaaccaac ctctgggagc cctgcaacca caagcccatc 2041 ggatacaaca ggttcgtggt gcacggctta accacgggag agcagtacat cttccgagtc 2101 aaggcggtca atgctgtggg gatgagtgaa aattcccagg aatcagacgt cataaaagtg 2161 caggccgcac tcaccgtccc gtcccatcct tatgggatta cgctcctcaa ctgtgacggc 2221 cactccatga ccctcggctg gaaggtcccg aaattcagtg gtggctcgcc catcctgggc 2281 tactacctgg acaagcgtga agttcaccat aaaaactggc acgaggtcaa ttcctcaccc 2341 agcaaaccga caatcctaac ggtggacggc ttgacggaag gctcactcta cgagttcaaa 2401 atcgccgccg tcaacctggc cggcatcggg gagccctcag atcccagtga gcacttcaag 2461 tgtgaggcct ggaccatgcc ggagcccggt cctgcctacg acttgacgtt ctgtgaggtc 2521 agggacacgt ccttggtcat gctgtggaag gcccctgtgt actccggcag cagccctgtt 2581 tctggatatt tcgtggactt cagggaggag gatgctggag agtggatcac tgtcgatcag 2641 acgacaacag ccagccgtta tttaaaggtc tctgacctgc agcaaggtaa gacctatgtc 2701 ttcagggtcc gggcagtcaa tgcaaatggc gtggggaagc cctcagacac gtcggagcct 2761 gtgctggtag aggcgagacc aggcaccaag gaaatcagtg ctggtgtcga tgaacagggc 2821 aacatctatc tgggcttcga ctgccaggaa atgacagacg cgtctcagtt cacctggtgt 2881 aaatcctacg aggagatttc agatgatgag aggtttaaaa tcgaaaccgt gggggatcac 2941 tccaagctgt acttaaagaa tccggataag gaggatttag ggacttactc cgtgtctgta 3001 agtgatacag acggagtgtc ctccagtttt gttctggacc cagaagagct cgagcgtttg 3061 atggcattga gcaatgaaat aaagaacccc acaattcctc tgaaatcgga attagcttat 3121 gagatttttg ataaggggcg ggttcgcttc tggctccagg ctgagcactt atcaccagat 3181 gccagctacc gatttattat taatgacaga gaagtctctg acagcgagat acacagaatt 3241 aaatgtgaca aagctactgg cattattgag atggtgatgg atcgatttag tattgaaaat 3301 gaggggacct acactgtgca gattcatgat gggaaagcca aaagtcagtc ttctctagtt 3361 cttattggag atgcattcaa gactgtgctg gaagaggctg agtttcaaag gaaagaattt 3421 ctcaggaaac aaggccctca ttttgctgag tacttgcact gggatgtcac ggaagaatgt 3481 gaagttcgac ttgtttgcaa ggttgcaaac accaagaaag aaaccgtttt caaatggctc 3541 aaggatgatg ctctgtatga aacggagaca ctgcctaacc tggagagggg aatctgtgag 3601 ctcctcatcc caaagttgtc aaagaaggac cacggtgaat acaaggcaac cttgaaagat 3661 gacagaggcc aagatgtgtc catccttgaa atagctggca aagtgtatga tgatatgatt 3721 ttggcaatga gtagagtctg tgggaaatct gcttcgccac tgaaggtact ctgcacccca 3781 gaaggaatac gacttcagtg tttcatgaag tattttacag acgaaatgaa agtgaactgg 3841 tgtcacaaag atgctaagat ctcatccagt gagcatatga gaatcggggg gagtgaagag 3901 atggcttggc tgcagatatg tgagccgact gagaaggata aaggaaaata cacttttgag 3961 attttcgatg gcaaagacaa ccatcaacgc tcccttgacc tgtccggaca agcttttgat 4021 gaagcatttg cagaattcca gcaattcaaa gctgctgctt ttgcagagaa gaatcgtggc 4081 aggttgatcg gcggcttgcc tgacgtggtg accatcatgg aagggaagac cttgaatctg 4141 acctgcacgg tgtttggaaa ccctgacccc gaagtgattt ggttcaagaa cgaccaggac 4201 atccagctca gcgagcactt ctcggtgaag gtggagcagg ccaagtacgt cagcatgacc 4261 atcaaaggcg tgacctccga ggactcgggc aagtacagca tcaacatcaa gaataagtat 4321 ggcggggaga agatcgacgt gacggtgagc gtgtacaaac acggggagaa gatcccggac 4381 atggccccgc cccagcaagc caagcccaag ctcatccccg cgtctgcctc agcggcaggc 4441 cagtgaaggc gttttcctag cctggagatg ggaaaatatg cttggcagag acaggaatgc 4501 tgtgtgcttg ttccaaatga gcagctggca tccgagtggt gtcctgtgtg ggctgatagt 4561 tgatcacaca ttgtgctttt gatttttgca tttggtgatg aatattttat acccgtctaa 4621 gggagaaagc taatgttttc cacaagactg aacaacgtgt atttacacga gggtagacgg 4681 cagatgcctg acagagagtg ggttggcaga caacacacta gcattttcac gggtgtgggc 4741 acatgggtgt ggcacctgga cgtgtgcagc atgtggcggt ctctgtgtga agccaccgtg 4801 cttctctttg gggggccgcg agatctagca tctctgaaat cctggctgtc gaggctttga 4861 agcatgtgtt acctggttaa gcttgttttc tcttgcttta ggcaaataaa agtttaaaaa 4921 tcaaaaaaaa aaaaaaaaa // LOCUS HS179M20 147708 bp DNA PRI 23-JAN-1998 DEFINITION Human DNA sequence from PAC 179M20 on chromosome 20q12-13.1. Contains adenosine deaminase (ADA), placental protein Diff33, CA repeat, ESTs, STS. ACCESSION Z97053 NID g2813964 KEYWORDS 20q12-13.1; ADA; Diff33; placental protein; repeat polymorphism. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 147708) AUTHORS Ho,S. TITLE Direct Submission JOURNAL Submitted (22-JAN-1998) E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT The true left end of clone dJ179M20 is at 1 in this sequence. The true right end of clone dJ179M20 is at 147708 in this sequence. 179M20 is from the library RPCI1 constructed at the Roswell Park Cancer Institute by the group of Pieter de Jong. For further details see http://bacpac.med.buffalo.edu/. FEATURES Location/Qualifiers source 1..147708 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q12-13.1" /clone="179M20" /clone_lib="RPCI1" repeat_region 1..62 /note="AluSc repeat: matches 62..1 of consensus; incomplete repeat" repeat_region 440..743 /note="AluSc repeat: matches 299..1 of consensus" repeat_region 1357..1384 /note="14 copies of 2 mer 96 % conserved" prim_transcript <1631..>11281 /note="match: multiple ESTs; match: AA312512 AA306020 AA482177 AA306031; match: AA251421 AA060462 AA309696 AA354600; match: AA207166 AA207167 AA213785 T28970; match: AA147375 AA482275 A251422 AA213452; match: AA613112 M78119 AA147345" gene 1634..10969 /gene="ADA" CDS join(<1634..1756,4204..4347,5119..5234,6474..6601, 7725..7796,7873..7974,8151..8215,9656..9785,10402..10504, 10956..10969) /gene="ADA" /note="match: M13792" /codon_start=2 /evidence=not_experimental /product="adenosine deaminase (ADA)" /db_xref="PID:e1246378" /db_xref="PID:g2813965" /translation="RRGIALPANTAEGLLNVIGMDKPLTLPDFLAKFDYYMPAIAGCR EAIKRIAYEFVEMKAKEGVVYVEVRYSPHLLANSKVEPIPWNQAEGDLTPDEVVALVG QGLQEGERDFGVKARSILCCMRHQPNWSPKVVELCKKYQQQTVVAIDLAGDETIPGSS LLPGHVQAYQEAVKSGIHRTVHAGEVGSAEVVKEAVDILKTERLGHGYHTLEDQALYN RLRQENMHFEICPWSSYLTGAWKPDTEHAVIRLKNDQANYSLNTDDPLIFKSTLDTDY QMTKRDMGFTEEEFKRLNINAAKSSFLPEDEKRELLDLLYKAYGMPPSASAGQNL" repeat_region 2260..2560 /note="AluSx repeat: matches 300..1 of consensus" repeat_region 2820..2930 /note="MIR repeat: matches 262..157 of consensus" repeat_region 3323..3621 /note="AluY repeat: matches 1..299 of consensus" repeat_region 5805..5931 /note="MIR repeat: matches 212..84 of consensus" repeat_region 6758..7057 /note="AluJo repeat: matches 1..302 of consensus" repeat_region 7345..7570 /note="MIR repeat: matches 4..262 of consensus" repeat_region 8334..8441 /note="MIR repeat: matches 172..60 of consensus" misc_feature 10970..11280 /note="match: STS G13625" prim_transcript <11766..>12521 /note="match: multiple ESTs; match: AA552387 AA447627 AA533578 AA127939; match: W68287 H62904 H62941 H82167 W01305; match: AA431470 AA150832 W25342 T91415 AA151069; match: W15525 W32158 T24529 T78160 W16436 N38747; match: AA448027 W04656 W39306 AA586530 AA614827; match: R56665 R27694 H02613 W39649 R65999 H02716; match: T77999 AA025302 AA412511 W32222 AA412634; match: AA431607 W68400 AA366243 W19216; match: R28198 AA494417; match: AA465200 AA057373" repeat_region 12905..13040 /note="AluSx repeat: matches 136..1 of consensus; incomplete repeat" repeat_region 14916..14984 /note="MIR repeat: matches 76..144 of consensus" repeat_region 16524..16618 /note="MIR2 repeat: matches 133..25 of consensus" repeat_region 17411..17638 /note="AluY repeat: matches 74..300 of consensus; incomplete repeat" repeat_region 18468..18543 /note="L1ME1 repeat: matches 431..507 of consensus" repeat_region 18591..18967 /note="L1MA6 repeat: matches 305..688 of consensus" repeat_region 18985..19282 /note="AluY repeat: matches 1..297 of consensus" repeat_region 19289..19449 /note="L1MA6 repeat: matches 698..865 of consensus" repeat_region 19450..19750 /note="AluSx repeat: matches 1..300 of consensus" repeat_region 19754..19929 /note="L1MA5A repeat: matches 860..1041 of consensus" repeat_region 20274..20301 /note="14 copies of 2 mer 100 % conserved" repeat_region 20318..20355 /note="19 copies of 2 mer 95 % conserved" repeat_region 21068..21127 /note="MIR2 repeat: matches 13..69 of consensus" repeat_region 21987..22282 /note="AluJb repeat: matches 299..1 of consensus" repeat_region 22808..23117 /note="AluY repeat: matches 1..298 of consensus" repeat_region 23163..23425 /note="AluSx repeat: matches 37..301 of consensus; incomplete repeat" repeat_region 23426..23487 /note="31 copies of 2 mer 81 % conserved" repeat_region 23758..23949 /note="MLT1A1 repeat: matches 363..171 of consensus" repeat_region 23982..24276 /note="AluSx repeat: matches 297..1 of consensus" repeat_region 24282..24410 /note="MLT1A1 repeat: matches 150..1 of consensus" repeat_region 24556..24661 /note="L1MB6 repeat: matches 118..12 of consensus" repeat_region 24885..25171 /note="AluJo repeat: matches 292..1 of consensus" repeat_region 25377..25679 /note="AluY repeat: matches 301..1 of consensus" repeat_region 26122..26230 /note="MIR repeat: matches 213..98 of consensus" repeat_region 27146..27194 /note="MIR repeat: matches 136..88 of consensus" repeat_region 27648..27756 /note="MIR2 repeat: matches 26..146 of consensus" repeat_region 27945..28252 /note="AluSx repeat: matches 300..1 of consensus" repeat_region 28279..28363 /note="MIR2 repeat: matches 146..56 of consensus" repeat_region 30106..30239 /note="AluJo repeat: matches 133..1 of consensus; incomplete repeat" repeat_region 31063..31361 /note="AluSc repeat: matches 299..1 of consensus" repeat_region 31378..31505 /note="MIR repeat: matches 237..96 of consensus" repeat_region 32367..32901 /note="MLT1G repeat: matches 1..512 of consensus" repeat_region 32722..32901 /note="MLT1F repeat: matches 366..541 of consensus" repeat_region 34994..35290 /note="AluSg repeat: matches 3..299 of consensus" repeat_region 36162..36324 /note="MIR repeat: matches 30..204 of consensus" repeat_region 36430..36585 /note="MIR repeat: matches 94..262 of consensus" repeat_region 36671..36781 /note="MIR repeat: matches 57..167 of consensus" repeat_region 36921..36986 /note="L1MA9 repeat: matches 1055..988 of consensus" repeat_region 36960..37404 /note="L1MA9 repeat: matches 983..530 of consensus" repeat_region 37406..37437 /note="16 copies of 2 mer 100 % conserved" repeat_region 37459..37747 /note="AluJo repeat: matches 293..1 of consensus" repeat_region 37755..37958 /note="L1 repeat: matches 4726..4524 of consensus" repeat_region 38552..38717 /note="MIR repeat: matches 61..233 of consensus" repeat_region 39075..39149 /note="MIR repeat: matches 158..85 of consensus" repeat_region 41241..41342 /note="L1ME3 repeat: matches 468..573 of consensus" repeat_region 41450..41612 /note="FRAM repeat: matches 164..1 of consensus" repeat_region 42147..42443 /note="AluSx repeat: matches 299..1 of consensus" prim_transcript 42518..42887 /note="match: 5' EST AA309905" repeat_region 44557..44700 /note="MIR repeat: matches 214..70 of consensus" repeat_region 45232..45394 /note="MIR repeat: matches 56..228 of consensus" repeat_region 45883..46083 /note="AluSg repeat: matches 102..298 of consensus; incomplete repeat" repeat_region 46486..46540 /note="MIR2 repeat: matches 111..56 of consensus" repeat_region 47067..47141 /note="MIR repeat: matches 155..84 of consensus" prim_transcript <48072..>48218 /note="match: 3' EST AA234180 clone 666789" repeat_region 48779..48872 /note="MIR repeat: matches 48..142 of consensus" repeat_region 49371..49517 /note="MIR repeat: matches 102..262 of consensus" repeat_region 49519..49680 /note="MIR repeat: matches 200..53 of consensus" repeat_region 50156..50455 /note="AluSx repeat: matches 1..300 of consensus" repeat_region 50562..50636 /note="MIR repeat: matches 113..190 of consensus" repeat_region 51173..51346 /note="MIR repeat: matches 227..52 of consensus" repeat_region 51776..51911 /note="AluJb repeat: matches 136..3 of consensus; incomplete repeat" repeat_region 51915..52030 /note="MIR repeat: matches 218..104 of consensus" repeat_region 52042..52283 /note="MSTB repeat: matches 425..175 of consensus" repeat_region 52284..52449 /note="AluSx repeat: matches 1..167 of consensus; incomplete repeat" repeat_region 52455..52744 /note="AluY repeat: matches 301..1 of consensus" repeat_region 52767..52920 /note="MSTB repeat: matches 184..31 of consensus" repeat_region 53370..53685 /note="AluSx repeat: matches 1..292 of consensus" repeat_region 54476..54603 /note="MIR2 repeat: matches 14..137 of consensus" misc_feature 54497..54785 /note="match: Z51767 STS containing (CA) repeat; D20S911" repeat_region 54604..54643 /note="20 copies of CA 100 % conserved; differs from Z51767" repeat_region 55356..55583 /note="AluSg repeat: matches 70..299 of consensus; incomplete repeat" repeat_region 55587..55747 /note="FRAM repeat: matches 1..162 of consensus" repeat_region 55934..56225 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 56239..56436 /note="MER42c repeat: matches 1323..1528 of consensus" repeat_region 56993..57157 /note="MIR repeat: matches 262..84 of consensus" repeat_region 57253..57359 /note="MIR repeat: matches 207..90 of consensus" repeat_region 58498..58799 /note="AluSq repeat: matches 1..301 of consensus" repeat_region 59580..59851 /note="AluSx repeat: matches 276..1 of consensus; incomplete repeat" repeat_region 60152..60267 /note="MIR repeat: matches 237..113 of consensus" repeat_region 60658..60956 /note="AluJo repeat: matches 297..1 of consensus" repeat_region 61051..61197 /note="AluSq repeat: matches 155..301 of consensus; incomplete repeat" repeat_region 61703..61785 /note="L1MC2 repeat: matches 993..1075 of consensus" repeat_region 61799..62016 /note="L1ME3A repeat: matches 359..580 of consensus" repeat_region 62056..62330 /note="AluSg repeat: matches 2..292 of consensus" repeat_region 62644..62919 /note="AluJb repeat: matches 301..21 of consensus; incomplete repeat" repeat_region 63942..64238 /note="AluSx repeat: matches 300..2 of consensus" repeat_region 64742..65041 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 66118..66230 /note="FLAM_A repeat: matches 19..133 of consensus" repeat_region 66974..67275 /note="AluJo repeat: matches 2..302 of consensus" repeat_region 67382..67476 /note="MLT1E repeat: matches 461..555 of consensus" repeat_region 67735..68024 /note="AluSx repeat: matches 301..3 of consensus" repeat_region 68183..68460 /note="AluJb repeat: matches 301..14 of consensus" repeat_region 68662..68795 /note="FLAM_A repeat: matches 133..1 of consensus" repeat_region 69049..69187 /note="FAM repeat: matches 24..169 of consensus" repeat_region 69456..69747 /note="AluSg repeat: matches 291..1 of consensus" repeat_region 70534..70879 /note="AluJb repeat: matches 302..2 of consensus" repeat_region 70887..71117 /note="L1ME3A repeat: matches 340..570 of consensus" repeat_region 71884..72049 /note="AluJb repeat: matches 133..301 of consensus; incomplete repeat" repeat_region 72448..72536 /note="MIR repeat: matches 117..206 of consensus" repeat_region 72960..73264 /note="AluY repeat: matches 301..1 of consensus" repeat_region 73265..73561 /note="AluSq repeat: matches 301..1 of consensus" prim_transcript <73567..73664 /note="match: 3' EST AA233804 clone 666277" repeat_region 74141..74440 /note="AluSx repeat: matches 1..300 of consensus" repeat_region 74665..74967 /note="AluSg repeat: matches 1..300 of consensus" prim_transcript complement(74972..75279) /note="match: 5' EST D78734 clone GEN-508C03" repeat_region 76422..76726 /note="AluSx repeat: matches 1..295 of consensus" repeat_region 77095..77204 /note="MIR2 repeat: matches 22..143 of consensus" repeat_region 78027..78169 /note="MIR repeat: matches 261..111 of consensus" repeat_region 79450..79740 /note="AluY repeat: matches 1..299 of consensus" repeat_region 79755..79943 /note="MER42c repeat: matches 1337..1538 of consensus" repeat_region 79998..80130 /note="MIR repeat: matches 70..198 of consensus" repeat_region 80156..80595 /note="HUMAR1 repeat: matches 859..1278 of consensus" repeat_region 80618..80693 /note="MIR repeat: matches 176..252 of consensus" repeat_region 80966..81260 /note="AluJb repeat: matches 295..1 of consensus" repeat_region 81899..81948 /note="25 copies of 2 mer 100 % conserved" repeat_region 82604..82908 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 82947..83050 /note="AluSx repeat: matches 104..1 of consensus; incomplete repeat" repeat_region 83064..83155 /note="MIR repeat: matches 160..64 of consensus" repeat_region 83223..83303 /note="MIR repeat: matches 237..157 of consensus" repeat_region 83423..83535 /note="MER42c repeat: matches 1423..1532 of consensus" repeat_region 83949..84255 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 84538..84835 /note="AluJo repeat: matches 298..3 of consensus" repeat_region 84980..85252 /note="AluJo repeat: matches 287..11 of consensus" repeat_region 85571..85869 /note="AluSx repeat: matches 4..302 of consensus" repeat_region 86719..87033 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 87534..87770 /note="MIR repeat: matches 261..2 of consensus" repeat_region 87912..88220 /note="AluSx repeat: matches 2..302 of consensus" repeat_region 88537..88585 /note="MIR repeat: matches 142..85 of consensus" repeat_region 88815..89110 /note="AluSq repeat: matches 1..301 of consensus" repeat_region 89298..89517 /note="AluJo repeat: matches 84..302 of consensus; incomplete repeat" repeat_region 89922..90234 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 90543..90846 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 90981..91051 /note="MER30 repeat: matches 1..74 of consensus" repeat_region 91021..91083 /note="MER30 repeat: matches 106..163 of consensus" repeat_region 91085..91385 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 91386..91471 /note="MER30 repeat: matches 145..230 of consensus" repeat_region 91568..91880 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 91893..92004 /note="FLAM_A repeat: matches 12..123 of consensus" repeat_region 92447..92549 /note="MIR repeat: matches 111..216 of consensus" repeat_region 92918..93225 /note="AluSx repeat: matches 1..300 of consensus" repeat_region 93270..93401 /note="AluSx repeat: matches 1..134 of consensus; incomplete repeat" repeat_region 93402..93698 /note="AluSp repeat: matches 2..300 of consensus" repeat_region 93699..93875 /note="AluSg repeat: matches 124..300 of consensus; incomplete repeat" repeat_region 94523..94982 /note="L1MB2 repeat: matches 434..888 of consensus" repeat_region 95154..95444 /note="AluSq repeat: matches 291..1 of consensus" repeat_region 95864..96020 /note="MIR repeat: matches 12..198 of consensus" repeat_region 96035..96333 /note="AluSx repeat: matches 1..299 of consensus" repeat_region 97353..97469 /note="MIR2 repeat: matches 1..135 of consensus" repeat_region 97475..97598 /note="FLAM_A repeat: matches 1..124 of consensus" repeat_region 98346..98508 /note="MIR repeat: matches 262..82 of consensus" repeat_region 99251..99551 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 99880..99992 /note="LTR8 repeat: matches 1..122 of consensus" repeat_region 100006..100282 /note="AluSp repeat: matches 301..1 of consensus" repeat_region 100464..100764 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 100800..100944 /note="LTR8 repeat: matches 310..460 of consensus" unsure 100938..100953 /note="Single clone. Confirmed in cutoff." repeat_region 100956..101256 /note="AluY repeat: matches 301..2 of consensus" repeat_region 101258..101498 /note="LTR8 repeat: matches 446..687 of consensus" repeat_region 101499..101770 /note="AluJb repeat: matches 31..298 of consensus; incomplete repeat" repeat_region 101771..102063 /note="AluSc repeat: matches 2..293 of consensus" repeat_region 102200..102370 /note="LTR8 repeat: matches 691..521 of consensus" repeat_region 102373..102673 /note="AluSx repeat: matches 302..2 of consensus" repeat_region 102676..103198 /note="LTR8 repeat: matches 535..1 of consensus" repeat_region 103330..103627 /note="AluSc repeat: matches 1..299 of consensus" repeat_region 103958..104238 /note="AluSg repeat: matches 1..281 of consensus; incomplete repeat" repeat_region 104620..104921 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 105023..105165 /note="MIR2 repeat: matches 2..146 of consensus" repeat_region 105186..105441 /note="AluSx repeat: matches 51..302 of consensus; incomplete repeat" repeat_region 105626..106037 /note="LTR7 repeat: matches 450..1 of consensus" prim_transcript <108718..>131601 /note="match: multiple ESTs; match: AA507396 AA276771 AA609529 AA576628; match: R00401 H23255 AA210244 AA024503; match: AA165221 AA165222 W40317 N73356 R27750; match: R26626 T77450 AA024588 R49654 AA382817; match: AA336819 R96269 AA449278 AA579384 H04517; match: T57899 T10513 F05586 R43240 AA410911; match: N39900 W02541 AA105354 H70958 N24773; match: AA546320 R21629 T05247 AA089600 W45135; match: W05814 T83705 T90195 W56740 N75748 W56705; match: AA373756 W46387 AA294859 C16634 H13790; match: AA053110 H13791 AA310613 AA064205 AA406106; match: AA154313 AA406107 N54825 N75077 AA053196; match: AA345382 W46470 T77236 AA506655 AA163787; match: H69020 H23366 N73500 N26252 N22535 N22536; match: N24471 R21525 AA201048 W52041 R17144 T83005; match: AA090019 H69782 AA089304 N75764 H99188; match: AA377450 AA376443 Z18911 D31033" mRNA join(108768..108890,116763..116924,117809..118002, 119435..119514,120775..120912,123807..123976, 125912..126002,126808..126988,129503..129730, 130369..130732) /gene="Diff3" /note="match: U49188 Q13530; 130508..130578 differs from Q13530" /product="placental protein Diff33" gene 108768..130732 /gene="Diff3" CDS join(108852..108890,116763..116924,117809..118002, 119435..119514,120775..120912,123807..123976, 125912..126002,126808..126988,129503..129730, 130369..130507) /gene="Diff3" /note="match: U49188 Q13530" /codon_start=1 /evidence=not_experimental /product="placental protein Diff33" /db_xref="PID:e1246379" /db_xref="PID:g2813966" /translation="MGAVLGVFSLASWVPCLCSGASCLLCSCCPNSKNSTVTRLIYAF ILLLSTVVSYIMQRKEMETYLKKIPGFCEGGFKIHEADINADKDCDVLVGYKAVYRIS FAMAIFFFVFSLLMFKVKTSKDLRAAVHNGFWFFKIAALIGIMVGSFYIPGGYFSSVW FVVGMIGAALFILIQLVLLVDFAHSWNESWVNRMEEGNPRLWYAALLSFTSAFYILSI ICVGLLYTYYTKPDGCTENKFFISINLILCVVASIISIHPKIQEHQPRSGLLQSSLIT LYTMYLTWSAMSNEPDRSCNPNLMSFITRITAPTLAPGNSTAVVPTPTPPSKSGSLLD SDNFIGLFVFVLCLLYSSIRTSTNSQVDKLTLSGSDSVILGDTTTSGASDEEDGQPRR AVDNEKEGVQYSYSLFHLMLCLASLYIMMTLTSWYSPDAKFQSMTSKWPAVWVKISSS WVCLLLYVWTLVAPLVLTSRDFS" repeat_region 109686..110019 /note="MER21B repeat: matches 5..342 of consensus" repeat_region 110042..110386 /note="L1PA16 repeat: matches 897..553 of consensus" repeat_region 110463..110652 /note="AluSq repeat: matches 262..55 of consensus; incomplete repeat" repeat_region 110656..110954 /note="MER21B repeat: matches 467..794 of consensus" repeat_region 110995..111171 /note="AluJo repeat: matches 301..120 of consensus; incomplete repeat" repeat_region 111428..111513 /note="AluY repeat: matches 92..1 of consensus; incomplete repeat" repeat_region 111817..112120 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 112460..112483 /note="12 copies of 2 mer 96 % conserved" repeat_region 112487..112623 /note="AluSx repeat: matches 139..1 of consensus; incomplete repeat" repeat_region 113037..113467 /note="MLT1C repeat: matches 464..18 of consensus" repeat_region 113557..113704 /note="MIR repeat: matches 262..120 of consensus" repeat_region 113952..114111 /note="AluJo repeat: matches 134..300 of consensus; incomplete repeat" repeat_region 114112..114412 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 114487..114781 /note="AluSg repeat: matches 295..1 of consensus" repeat_region 114849..115152 /note="AluSx repeat: matches 1..302 of consensus" unsure 115075..115108 /gene="Diff3" repeat_region 115159..115450 /note="L1MB5 repeat: matches 860..532 of consensus" repeat_region 115451..115731 /note="AluSx repeat: matches 281..1 of consensus; incomplete repeat" repeat_region 115733..115941 /note="L1MB5 repeat: matches 534..333 of consensus" repeat_region 116023..116324 /note="AluY repeat: matches 301..2 of consensus" repeat_region 117311..117599 /note="AluSg repeat: matches 287..1 of consensus" repeat_region 118470..118772 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 118821..118970 /note="MIR repeat: matches 154..4 of consensus" repeat_region 119590..119891 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 119992..120270 /note="AluSg repeat: matches 283..2 of consensus; incomplete repeat" repeat_region 121015..121271 /note="AluJo repeat: matches 42..297 of consensus; incomplete repeat" repeat_region 122289..122600 /note="AluJb repeat: matches 300..2 of consensus" repeat_region 122917..123214 /note="AluJo repeat: matches 300..1 of consensus" repeat_region 124199..124491 /note="AluSg repeat: matches 300..1 of consensus" unsure 124303..124325 /gene="Diff3" /note="Single clone. Confirmed in cutoff." repeat_region 124503..124776 /note="AluSq repeat: matches 302..9 of consensus" repeat_region 124903..125182 /note="AluSq repeat: matches 1..278 of consensus; incomplete repeat" repeat_region 125188..125480 /note="AluSq repeat: matches 6..303 of consensus" repeat_region 127149..127437 /note="AluSx repeat: matches 289..1 of consensus" repeat_region 127438..127898 /note="L1 repeat: matches 4736..4318 of consensus" repeat_region 127912..128186 /note="AluJo repeat: matches 6..296 of consensus" repeat_region 128199..128498 /note="AluJo repeat: matches 302..1 of consensus" repeat_region 128500..128799 /note="L1 repeat: matches 4313..4004 of consensus" prim_transcript <132057..>133372 /note="match: multiple ESTs; match: N49399 AA027958 AA027959 C02347 W95681; match: AA468575 AA558944 W95794 AA315410 AA025062; match: AA025063 AA151072 T93211 W43008 N80691; match: R33818 W45208 AA047733 AA456325 W07752; match: AA480293 AA047682 AA187821 W52566; match: N49488 AA090929" repeat_region 132211..132510 /note="AluY repeat: matches 298..1 of consensus" repeat_region 132676..132742 /note="MIR repeat: matches 13..84 of consensus" repeat_region 133946..133993 /note="HY3 repeat: matches 98..44 of consensus" repeat_region 133994..134290 /note="AluSq repeat: matches 299..2 of consensus" repeat_region 134580..134718 /note="MIR repeat: matches 175..28 of consensus" prim_transcript 135034..>135359 /note="match: 5' EST AA299344" prim_transcript <135703..>137579 /note="match: multiple ESTs; match: R96555 AA074974 R49379 N78787 AA426597; match: AA426598 H48102 H53254 H70625 H63515 H53255; match: N54141 H63916 H47752 N29427 R97112 R96510; match: N89810 AA436207 AA436208 N64089 N57359; match: R97064 AA353663 AA074871 N57848; match: R34993 W39066" repeat_region 135718..135784 /note="MIR repeat: matches 38..103 of consensus" repeat_region 136755..137051 /note="AluSx repeat: matches 293..1 of consensus" repeat_region 137502..137802 /note="AluSq repeat: matches 302..1 of consensus" prim_transcript 137726..>138175 /note="match: multiple ESTs; match: AA368224 AA325298" prim_transcript <139690..>140543 /note="match: multiple ESTs; match: AA010283 AA243976 H05940 H96407; match: AA249444 AA224902" repeat_region 139708..140006 /note="AluSg repeat: matches 300..2 of consensus" prim_transcript 140583..141214 /note="match: multiple ESTs; match: AA010225 T06328" repeat_region 141605..141640 /note="MIR2 repeat: matches 107..141 of consensus" repeat_region 142294..142401 /note="MIR2 repeat: matches 18..145 of consensus" repeat_region 142556..142622 /note="MER5A repeat: matches 188..121 of consensus" repeat_region 142736..143034 /note="AluSx repeat: matches 1..299 of consensus" repeat_region 143341..143511 /note="AluJo repeat: matches 291..134 of consensus; incomplete repeat" repeat_region 143512..143801 /note="AluSx repeat: matches 297..5 of consensus" repeat_region 144423..144721 /note="AluSx repeat: matches 1..299 of consensus" repeat_region 145671..145972 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 146710..147002 /note="AluJb repeat: matches 1..298 of consensus" repeat_region 147003..147139 /note="MIR repeat: matches 24..169 of consensus" repeat_region 147647..147708 /note="AluSq repeat: matches 1..62 of consensus; incomplete repeat" BASE COUNT 41048 a 33019 c 32946 g 40695 t ORIGIN 1 gatccgccag ccttggcttc ccaaagtact gtgattacag gtatgagcca ctgcacccgg 61 cctcctattt ttctgcttct gctttgtgga taattggatg cttggacctc ctgatttaat 121 cttctaattt ccttaactgt ttactcctat ttttcatcat cttgtctttt tgttctactt 181 tgtggaggat ttcttcactt ttagcttcca gttcttttct tacatcgtga cagttgctgc 241 cgcattctct tgtaaatttc cgagggctcg ttcttgggtt ctgaatgttc cctcctttca 301 aggatcttct catctctttg aggatattca tgtctttttt gttttggttc ttaggttttc 361 atctgttctc tgtgctgttt cctcggagtg cttttgtcta ttctgttgtt ttgtccctca 421 tgttagaagc atttcttttt tttttctttt tttttttgtg atacagagtc ttgctctgtc 481 accaggctgg agtgcagtag catgatctcg gctcaccaca gcctctgact ccctggttca 541 agtgattctc ctgcctcagc ctcctgagta gctgggatta caggcacaca ccaccacacc 601 caactaattt ttgtattttt ggtagagacg gggtttcacc atgttggcca ggatagtctc 661 aatctcctga cctcatgatc ctcgcacctt ggcctgggag gccaaagtgc tgggattaca 721 ggcgtgagcc accatgccca gcctagaagc atttcttaat gtctggtgtt ctctggctgt 781 tgtatcttaa aaaaaaaagg ggggggaaac tgaggctcga ggtgaccttg tgagctggag 841 cagagccggg atgggatgag gaggcaggag cgtgtgcaga agagagggag cccccctgag 901 ctcgcaccct gcttcccgtg gctgggaggg gaggccgaga tgcttgggga gaaatggagg 961 ctccaagcca gaggggctgt ttccagcacg ctcttactga gcgctgctgt agtccagctt 1021 ggtgtggcgg ctgtgggcag ggaggggaga gaggtctgag ctggctggcg gcccactggg 1081 cccctcccct gagcctccac cggccctctc ccagtgcgct gggctgggca agcctctgat 1141 gtgccagcca gatggagggt gaagtcctga tgcctgcccc taccctggga attgtgatgc 1201 tgcagttact gcccctgata acccctgact gggcatagga ccagctggct gagccagctc 1261 ctggggctga ggaggaagcc atgaacttga cctggcactt tccttgtctc caagcatcag 1321 tcaaccaagg atatggaggg ggtgtgtgca tgtgtgcaca catacacaca cacacacaca 1381 cacacttcaa cctgtttatc ccccttgaga tttgctgact tgtgcattgg gggtagaagg 1441 tgctggaaaa attccggtcc tggttctcag tttccccatc tgtccagtgg gagcagctgg 1501 actgagagac gcccatgtct cctgctgtgg tcctgcaagg aggctggcgc tcctgagtct 1561 gctccatcct ggcctgtcag gcctgcctgg atcctgcccc gggttggtcc accactcact 1621 gttttgtttc caggaggaga gggatcgccc tcccagctaa cacagcagag gggctgctga 1681 acgtcattgg catggacaag ccgctcaccc ttccagactt cctggccaag tttgactact 1741 acatgcctgc tatcgcgtga gttgccccca acccacaggt cctagggcag cattgatccc 1801 tatgactagg accaggcctg tccctcagcc tgtgggggcc agagaagttg ctctgaaacc 1861 acagctgtct ttctcaccat tgtgtacact tagtgagtct ctccagtgcc tttaggcctc 1921 agttttccct tctgagatgt gggtgtgatg gactgaaatt gcttcaagtt ctacagagaa 1981 atggcagaat atgggagcta agaacacagg gtcagaggca gtgcagggct tgaacccggg 2041 ccatctatct cctagttcag ggcttcgtgt tgtgagggga ggagaggcct gaatataggg 2101 tgggggcggg gagatgtggg gaagattctc caaaaggctt tttctttttc ttgtcttgag 2161 tcgccaggga acagcactag gtaccgaaaa ggcccagaag gggtatgggc gagtactaga 2221 gagaaatttc catgactgct ttatttattt atttatttat ttatttattt atttattgag 2281 acagagtctc actctgttgc ccaggctgaa gtgcagtggt gcgatctcag ctcactgcaa 2341 cctccacctc ccagtttaag ggattctcct gctttagcct cccaagtagc tgggatcaca 2401 ggcacccacc atcacaccca actaatggtt ttgtattttt agtagagatg gggttttact 2461 atgtttgcca ggctggtctc gaattcctga cctcaggtga tctgcccgcc tcggcctccc 2521 aaaatgctgg gattacaggc gtgagccact gcgcctggcc tccatcctca tcctgaagat 2581 gcaagaactt ctggtgaccc cttctcctga gagtggcctg atctcccctg ggcagggcac 2641 tttcttccca cgctgggctc tcccagcact tgtgtgcctt ccctcacaca ttctagtaac 2701 cacttcattt tcactcttca tggtgggaac ttccagctaa gcacagtcca ccgttacgtg 2761 atcaacacag tggccctggc aggccaattt gtgccttgct tctggaacaa acatgcagta 2821 ataacaacga aaatgttttg agcatttgtc cgctctgctc caagcactga cccgggtggg 2881 gtttatgaag tttgactcat ttgtccccgc aataactcct tgacctaggt gtcagagggt 2941 gactaaccag gggtcacaca gcagataagt gtgggcacaa ggatccaagt ccatgactgt 3001 atcccacgtg tctcccacat ccaggcatcc ctctggactt gtccagctgt gtccttttct 3061 ctcatttctc ttccctgcca gccttaactc catcaccaac aaatattggg ctactctgtc 3121 ctaggcatgg tcctcagctg agaggtcgca gccatcccaa gacagagggg tccttgccac 3181 atggagactg cattctagta gggaatacag caaactggct gataagccat atgacacaca 3241 atgttgagta gtgataagga cctgggagaa aaagaaagcc caggagaatg gtggaggggc 3301 cgttttaaga taaggcggtc tgggccaggt acagtggctc acgcctgtat ccccagcact 3361 ttgggaggct gaggtgggcg gatcatgagg tcaggagatc gagaccatcc tggctaacac 3421 agcgaaacgc tgtctctact aaaaatacaa aaaattagcc gggcgtggtg gcatgcgcct 3481 gtaatcccag ctacttggga ggctgaggca gacgaatcac ttgaacccag gaggcagagg 3541 ctgcagtgag ctgagatggc gccactgcac tccagcctgg gcgacagagc aagattctgt 3601 ctcaaaaaaa aaaaaaaaag ataaggtggt cagggaaggc ctctctgagg aggtgaagct 3661 tcagctggct ctaaaccagg ggagcgggag agacgcagtg taggacagta tcggggaaga 3721 gcaggcctgt gtcttctccg gtggcctcag ggaatgaggg agaaggaagg tgctggggag 3781 gctggcaagg cctggaggat gcaggccttg tgggcaggac ctgggagttg cgatgtcact 3841 ctccgtggca ggaagctact ggggcttcga ggggagaagt gatatgcttt gatttacctt 3901 cttaaaagat tgccccaact gctgggtgga gaacaggatg acaggggcaa gcatggagac 3961 agggaggcca gttagagatg gcgtgattca ggccaggatg gaggggtgag aactggtatg 4021 cagttccaaa gtagagctga taggacttgc ccagtgtctg ggatcttatc cagtggatgc 4081 ccagagcttg ggtctggggg atgaagtggg tttaatctgc caagggttgg ggatgtcatt 4141 tgctcctgga gctcccaagg gacttgggga aggttgttcc caaccccttt cttcccttcc 4201 caggggctgc cgggaggcta tcaaaaggat cgcctatgag tttgtagaga tgaaggccaa 4261 agagggcgtg gtgtatgtgg aggtgcggta cagtccgcac ctgctggcca actccaaagt 4321 ggagccaatc ccctggaacc aggctgagtg agtgatgggc ctggaagggg ccatgctgag 4381 ggtgtggctg ggaggctcag ctctgagact ggaagggcga actgctggga atccctgacc 4441 caagcaagac cttgttcttg cccccagtct ggtccatggc ctcagaaaga tgggtttaac 4501 tctgtcacaa gagacgtggt tcccatcctc cctttgccgt tatgttctta ccttgggcac 4561 aagtgtttgg ctgtgtcttg ctctggccac aggcctgctg tccaggaatg ttaacctgct 4621 tagccaccca ggatttctga ggggtctccc ttgtcactga tgctgatcag atctctaaag 4681 gccctaaagg tcctgctcta acttcataac tgaagtgagt ctggcccatt tctagccccc 4741 tgcctgggcc cccatggatc tctaagtggt atcacaaaac caccctgccc cattttctga 4801 gccatgattc tgatacatat agaatgtgaa catcatggca ggcccaagct tagcaatgct 4861 gtccatctgg gggtggggag ggccatgttg acaccccaca cctcccacta agatctagga 4921 gcacccagct gctttaagag ctagagggac atgttagggc ctgggggcat ctctgccagt 4981 ctttcctctg aggcagtggg tcagtggggg aggagggtcc tccccaaagc ctcctcttcc 5041 tcctctgtcc cagtcccaga gctgcccttt aggccttcct tttgcctcag gcccatccct 5101 actcctctcc tcacacagag gggacctcac cccagacgag gtggtggccc tagtgggcca 5161 gggcctgcag gagggggagc gagacttcgg ggtcaaggcc cggtccatcc tgtgctgcat 5221 gcgccaccag cccagtgagt aggatcaccg ccctgcccag ggccgcccgt ctcaccctgg 5281 ccctgacctc ctggcctagc agtggggctg tacctgatct cccctgtgcc ccacagcccc 5341 atggtgtccc cttgagccca ctggcatgaa cttggggctt catgaaacaa ctggagacct 5401 cctaggcagg ctcagaactt ctggagatgt tctccccagg gacaccatgc ctttatagcc 5461 accctgcagg aagctcaaca ccaaatagga acgtaactat tgaaaaaaaa atctaggcta 5521 gattctgatc agcccatagt cctccctcga gacccagtgg accaggcccc atcctgtctg 5581 ggcctgaata ggtctgattt ccaagatttc tgaggggtct cccttgtcac tgacgcagat 5641 cagatctcta gagtttgtgc ctcatggtgc acagcctcac tgtgtgatat tgggcaggtc 5701 acactgctgc tctggttatg caccaagaca cctcagttgt gcactgtcac aaggagatga 5761 tcacacttac ttcattcctc taccctcagg attagtaaga accaaagagc tacctgcacg 5821 catttcctct aatcctcgca gcagcctgca aagcagaact accattgctt agtcccattt 5881 gacagatgag gaaactgagg tggagtgagg tgcagcctct tgcaaggcac aaaccctgga 5941 tttgtatccg gggacatcta gttccaaagc ctgtgttcat tcattctttc ttaaacactt 6001 cagaataact ttattggtta agagtaccta atacattagc gagatacttc ccaatactag 6061 tgtgagttct attttagatg acgtgttaaa cggtcctccg tttcctcatc tgcgcatggg 6121 aataagccta ccatgagtgt tgttggaaac accaggtgag agaagggtcc gtgtcattta 6181 ctgagctcag gccccgtcct tggtgcttta cacacatggc ctcggcaaag cctggccgtg 6241 accctgtgca atagctggca gggttctttc tgaaaagggc ggaaactgag gccataagca 6301 gagcagtttt ccgcaggcca tgtggttagg acatagcagt taggatttga agacactgag 6361 ccctgttttg tgctggcctc ccatgggggg tttgggtggg acagcaggca ggtaggctgg 6421 gaggtctctc catggtgctg gtgacagagc ctgggtgggc atctcgccca cagactggtc 6481 ccccaaggtg gtggagctgt gtaagaagta ccagcagcag accgtggtag ccattgacct 6541 ggctggagat gagaccatcc caggaagcag cctcttgcct ggacatgtcc aggcctacca 6601 ggtgggtcct gtgagaagga atggagaggc tggccctggg tgagcttgtc tcccacccat 6661 agttgggaga aatcacaaga accagggacc atggtgtctc ctgagttctg aagtgtgtct 6721 ttgttgggtc ttaaggcttg gaactggaat ccccctgggc caggcgtggt ggttcatgcc 6781 tgtgatccca gcactttggg aggcgaggca ggaggattgc ttgagcctag gagtttgaga 6841 ccagccaggg caacatagtg agatccatct ctgcaaatac aaaaaaaagt agtcaggcat 6901 ggtggtgcat gcctgtagtc ccagctactt gggaggctga ggtgggagaa ttgcttgagt 6961 ccaggaagtc aaagctgcag tgagctgtga taatgcgact gcactccagc ctgggtgaca 7021 gagggagacc ctgtctcaaa aaaaaaaaaa aggaagaaag aagaaagaga aaagaaagag 7081 aaagaaagag aggaaggaag gaaaaagagg aagggaggga gggaggaagg aaggaaagaa 7141 ggaaggaagg gagagagaaa gaaaagcctc cacttggtgt tgggagtcct gtgctgagcc 7201 tgcttctggc tgtgatttgc tgtgtgaacc tgggcaacac tgtgtcttct ctgggcctct 7261 gtttcttcta ttgggatgac tgagttggag ccgacatctc aaaagtcgct tccagcgtga 7321 tgatgaatgg gcctcctgtg gagggtgcag catggtggag aagtcagggc tctggagtcc 7381 cactgcccgg gctcagagct tggttccaca cttcctgtct gaccttggtc acattacttg 7441 aatctcctga gcttcagtcc ttcatcataa aatgggtggg ataatagttg tgaatattag 7501 ataatgtata caagtcactt catatactac ctgacacatg gtaactggct aatgagtgac 7561 agctaccact tagataagga cttggagggt aaaagaccag gtttccccat gctgttgaag 7621 caggcagcat gactaggatg gttcaatctc cacagcatgg tcaaggcagg gcctgccggg 7681 gccctcccgc tagggcaccc atgacctggc tctccccctt ccaggaggct gtgaagagcg 7741 gcattcaccg tactgtccac gccggggagg tgggctcggc cgaagtagta aaagaggtga 7801 gggcctgggc tggccatggg gtccctcctc actgcctcct cccatacttg gctctattct 7861 gcttctctac aggctgtgga catactcaag acagagcggc tgggacacgg ctaccacacc 7921 ctggaagacc aggcccttta taacaggctg cggcaggaaa acatgcactt cgaggtaagc 7981 gggccaggga gtggggagga accatccccg gctgtcccaa cttcctgtat agagaggcag 8041 aaagcagggc gggtcccagg aactcgaggg gtggccccag gcccagacat ggggggagga 8101 atcagcatgg cctggggcca tccctgccag ccacacacct gctcttccag atctgcccct 8161 ggtccagcta cctcactggt gcctggaagc cggacacgga gcatgcagtc attcggtgag 8221 ctctgttccc ctgggcctgt tcaattttgt tccaggaagg ccaaagaggg aagaaacttt 8281 agggattggg catcagccca tgccgcgtct tttagatatg aaatctcttc gacaccctgg 8341 gaagcaggca ttgccgtcct catcttacaa atgaggaatc cgaggcccag atgtgctgtg 8401 gcttgactgg gattacccag ctgctaacca gcagagctgg ggccctacag ctcatcagct 8461 ggagcagaac gctccattac tctgagggaa gcttccacac ttccaattct cccaactctg 8521 ccccctgggc atcgcatagg aagcaggagt ccctctggcc agcatgttct ctcttcctga 8581 cacctggccc ttgggacccc tgggcattcc cctgagcgcc atcttgaagc tttccaccgg 8641 agggtctgtt ccaccctgcc tggctcccat cctggagtct aaccagggtc aaggccctcc 8701 ttccgtcctg tcgccaagcc acaggagcag tatcaggcct taggaaaaag ccgccttccc 8761 caagacaagg acagcaagaa ctcagggtga ccatggtcag gccagcactt atccatctgc 8821 caggcatatg agaaggggag gggcttcggc tctgatgttc tgatgacaag ggggtcttgg 8881 ggcttgcctt agggacacgt ggcacctgtg gaggttcttg gaggcatgtg ggtataccat 8941 gggctggaaa aagatccagg agtcatctgc acagatatgg tggctgaagg agaagcagtg 9001 gccccaggag gtggtggagc aagaagggcc taggatagaa cccagaagga caatggtatt 9061 taagggacca gcaaaagaga caagtaggag gaaagtcaaa agtgtggtgt cacagaaatc 9121 cagggaaaag gtttcaagaa acagtcaaca gtgtgaaatt ctgctatgca agtcgattat 9181 ggtcagagct aggaaagatc cattagatac aacaagatgg tggtcaggga tcgtgccaag 9241 aacagcttcc atggtatgtt ggagtagcca gctcccagtg ggactgagga acaagcaggg 9301 tagggtgcag aggggaaggc tggagagggt ggcagccgga gggggatgtt gctttcttgg 9361 ctcccacccc cacgccccca ccggctgcca ttctgcctgg ttcccatgtc tggcccctct 9421 gctgcctttg cccagctctg gtcttcagga tgggctggat tctggacttt ctggttacat 9481 agacttgaac aagtcaccta agttctgaat ttatttcccc ctctgcacaa ggatcagatc 9541 tttcagatct gtttgaggct gctgtgagga tcaaaggcgg gtgaacgtca atgtgttctg 9601 actatttatg taagagtaaa aggaggctga ttctctcctc ctccctcttc tgcaggctca 9661 aaaatgacca ggctaactac tcgctcaaca cagatgaccc gctcatcttc aagtccaccc 9721 tggacactga ttaccagatg accaaacggg acatgggctt tactgaagag gagtttaaaa 9781 ggctggtgag tgggtgtgag ccatactggc cttgactcgg gtttgggagt atggtatcta 9841 caggtccagt ccggggcctg gaatctttgg agagagggag tgagtctgcc tcaacagtcc 9901 aagacaagcc caacctagac actttccaca gagaagacat ctttgtgttg acgtcctgac 9961 ctaggaccag gtttttgatc ctttgcttgg gttgagtgcc tttaaagaat ccagtgaaag 10021 ctgtcaaccc tctccccaga aaggtgtgtg cagcagctat gaagtcttgc acactctctt 10081 caggttgttc ttaaatccca ggctgaataa gtccattcct gcacgtgtct gcgaggtgtc 10141 tctggccccc tacatgccac cctgtctctc aaaggtttct ccaacttcct tctcacagcc 10201 ctttttcatg taatgacaaa ttaagaacac gacctcatgg tctctactct ggcacttgct 10261 gccgtgtgac agtggacaaa tccttccccc tctaagcgta tctgcccatg ttgagtgaag 10321 aggatggact atcactacat tgctaagagc tgccttcttt gttctctggt tccatgttgt 10381 ctgccattct ggcctttcca gaacatcaat gcggccaaat ctagtttcct cccagaagat 10441 gaaaagaggg agcttctcga cctgctctat aaagcctatg ggatgccacc ttcagcctct 10501 gcaggtaggt tcctgtctgg gcttctgggc agttgccctg tcctggcccc agtgtggctt 10561 tctgtgggac ttctagcaag atgcccttcc attcttgggc agcgccatga atgtgtgatg 10621 actccctggt ttctgggccc tggctgggag cagcgtctca ttagatcggt ttgttttcta 10681 taaaagttct tgagaggctg ttctaagggg agactttctg aagcccagtc ccaaaggtct 10741 gggcagttgg ggacacctcc atggctgccc aaagccaagg gcagggagag gggcccaggc 10801 ctgttctgct cctttcttcc tatgtggtct tggcaaggca tcttcttgcc atcataggaa 10861 ggagttcctt tctggttctg gtgttctatg atttttacaa catcctgggt actacaagtt 10921 gcctgatctt tttgcttctc tgaaccaacg agcagggcag aacctctgaa gacgccactc 10981 ctccaagcct tcaccctgtg gagtcacccc aactctgtgg ggctgagcaa catttttaca 11041 tttattcctt ccaagaagac catgatctca atagtcagtt actgatgctc ctgaacccta 11101 tgtgtccatt tctgcacaca cgtatacctc ggcatggccg cgtcacttct ctgattatgt 11161 gccctggcca gggaccagcg cccttgcaca tgggcatggt tgaatctgaa accctccttc 11221 tgtggcaact tgtactgaaa atctggtgct caataaagaa gcccatggct ggtggcatgc 11281 agcaggtggc atgtaatttg gtggtcttgg gcgggccgat gtgggcagga tgagcatgga 11341 gggagctggg tcagcctgct cagcagcagg gcctgagcct aagggtggct gtgaatgcca 11401 ggccagagat cccaatgctg tgggccaaga ggggtccaga ggctgtcctc cttccagaag 11461 aaataaggct tctctggttg ttgctcaaac attccctgaa ctctcagccc ctcctaactc 11521 taggttttaa ggagtaaagc ttccttttgg gttcctgaag ctggcagttg gggtgagagc 11581 agatgagatg gaagagggct catcagacac tggccttgga gggtgctggc ctctgcagaa 11641 cgccagcatc ttctcagaat cgtatgttct agaagcctgg ggcaagtccg gctaattgtg 11701 gacttgggga aaataaggcc caacccctgt ttttgcaagg ttaaggagaa ataatcttaa 11761 accagtcaca caaatcatcg gcatttattt cctgggtcct aggtgtcact tatcctggtg 11821 gacagggcag aggtggtcag atcgttttga gccaaaatcc cttccctaaa aatggatctg 11881 tggagctcca tgagggaacc tcagagatgc acaatgacag tttagctaaa atggcttaaa 11941 aaatgtgaat tgattgtcag ctctctccat atctgctgaa aaaaggttta aaatttttaa 12001 aaagtttaaa agtgttttct aaaaaaggga caagcaggtc tggacccaga agattgggct 12061 ggagaggagg tgttggtgtt gggacagggt ccctgcccgg gccgctcgcg agaggagcgt 12121 tcctgtgtgg gctgctcact gtagggcagg ccccgcccaa gtcccggcgc cagctcaata 12181 aataacatct tgtggttaca gtgtcttgaa agtgtacctg ggcacggtgc gaaggtggtg 12241 ggaatgtgaa gcctgtgggt gtgggcagag ggctttcccg gagtggaggc cacagaggcc 12301 tctggggagc cttctcagtc tggcctggtt tatctgggac atggagctca gagaagaggc 12361 tgctgggcca gtgccagggt tccccctctg ggaggggaca gaaggtctct cgtccagcct 12421 tcttggacaa ggtcagattc aagacgaggt ggtcccatcg ctgctctggg gctggttgcc 12481 agcttccttg tctggggcgc ttccctccac ctgtccttct ggagaggaga gaaagaggtt 12541 tttcttgggc acatatacac aaagtaatgc acagaaggag gtgtgaaagg gcccagccaa 12601 actgatagca gtggatatgc cctggggagc aagacgagct ggagctggac tgaggtggtg 12661 gtggcctatg gactttagcc tggtctgtaa catcttttta catggaaata cacatagtgt 12721 ttgttagtat aattaagagt caattaaggg ccattatgga tgggtgctta aagcatcaag 12781 tgaagggctc ttgaaggttg ctttgtttgt gcaggttccc cagagggaga ggccagatgt 12841 taagtcccca gtgtgtcaca ggtgctcagc aaatggcagc tgttctgttt gtttgttttt 12901 tttttaattt ttgtattttt agtagagacg gggtttcacc atgttggcca gactggtctc 12961 aaactcctga cctcaggtga tctgcctgcc tcagcctccc aaagtgctgg gattataggt 13021 gtgagccacc gtgcccggcc aatgttcttc taagagagta aagagcactg aaccaggcca 13081 gctcagggag cgccagggtg tggcctgcat cagagtctgt gggggccaac cgtgcatcat 13141 ggggtgtgtg ttgtacctcc ttctcaaatc catcctgtca gacatcttgg tggggtgagg 13201 ggcagcagca ggctcacaga gatggagcac ctcactgagg ggagtcagga tacaagccaa 13261 gtctcctcac tttcagcttc ccttgagccc ccctgtgcaa ggtgggacag gcagctcctg 13321 ccacccactg gaccagcagt gcaggttgag gggagtgcca aactcacacc caggccccag 13381 tctccatgct ttcctcctca ggctgcatga taaacaggtg tcttttcata aggtgcacca 13441 cgtggggcct ggcatgtggg cagagctcaa taggtggtac attcttaaac cccagtgaca 13501 ctgtgcaatg ttactaggtg ccctacaagc tcactgaggg cctgagttca aatccagccc 13561 aaccagacca caccttttta tccttcaggc taagcccttc acagaaccag gagaatcacc 13621 accctatagc aagctccagg gtctgcagat ggatgggcac tttctcgtgc aatgcacaca 13681 gtaacatgga ggtgtaacta gggtcatgat atgcatgaga aaaccgaggc ccagcgaggt 13741 gagctgattt gcctggatca agggtggaga gccctgcctg ccaccaaggc tagtacccct 13801 tctaccaaat ttaacctcct ccaataagca gcattcctga aggtttatag aggcctgaca 13861 tcagcactgc ctcactgtgc ccctgcggca agctaggaag tgggtggtag ctcgggaccc 13921 tattccgagc ccgctgcctg gaaagcctgc ttttattccc agcggggtgg cgcctggaac 13981 tctgtactcc ccctgccttg gggagtcgct ggagtgggcg tggcccgcag ccccgcctgt 14041 gaaacggaac tgctccctgc gggcttcctg gaaggctcct tggccagtca ctcttgcctg 14101 gacgccaccg tgagtgtttc ctttccttca gtgagcactg ttctttcctc aggccccctt 14161 tttaaatgtc agtcaccctg gcaggtgaat gtcatgtagc cagaggaccc atactttagg 14221 gaatctgagt cagggaaaac aggtgttagg agagcctggg cacatgggcc agcagctaca 14281 actgctgagg gagagagaga cagaacagag ctcaccagag ctgagggtga ggacacagaa 14341 gccttgagag agagcccagg gcagcgtgcc cagtgtggca gctggacatg ctggtcattc 14401 agtcggtcaa tgaacagcca ttgactcggt acctccatca tgtgccaaac accaccaggc 14461 accgagggca caagacacag agcctgtgct ggagaagctc agtctagagc agggccatta 14521 tgctcagtgg gctcatggag gcagcaggga gaagaggtgc ctccccggtc tgggaggggt 14581 actgcggtca gggaaggctt ctcagctgag cagatgcctg ggtctctaag aatgaagaga 14641 acccagccag gcaaggggag ggaggggaga agcaaacaga ggtgcagcgt gtgaaggccc 14701 acaggggctt ctgagcgatc aaagtacttg actatggctg gcgagaggat gagagttgat 14761 gctggagagg cgagcagggt gagcctgtgg aggttctgcc tgggccacgt gacagaggag 14821 aatgaataga ttatgggaca gggaaggctt ttggggtggg gttccgtcga cagggtgctg 14881 ggggctctct ctcaagccgg ccagattcta tgttaactcc ctgggtggcc cagagcaagt 14941 cctccaactt ctctgggcct cagtttgtcc atctgtaaca tgagagggct gggtccaagc 15001 cccggcaaag tcttctctgt tctgtacttt tgattcactt ctggccagtg aggggcaaga 15061 agactagggc tacccaggga ggggcagggc tgtggccctc cggacagccc acggggagac 15121 agttgcagcc tgatggctgt gctgtgagaa gcacacacgg cctctgtggc gctgtgagag 15181 caggagggaa aggcctcagc cgtgcagggg tcagggaagc ttcccaagga aattcccagg 15241 ctgactgtgg acttgagcca aggggttctc tgggggaatg gagggggtgg gggtgttcca 15301 gccaggactt gggtgggtgg atgctgtttc cttttgtcat tcggtctgtg gttaccaagc 15361 acctgctgtg tgccaaacag aatgcaaggc ctctgccctc ccggcgctca catttagacc 15421 cctgagacct tgtcagagag tgatgagggt catgtgattg gggaaggctg atctagcact 15481 cagagaaggc cccttggaga ggcttgactg agaaacagac caaaggtaga tgtagctgga 15541 gggtgtcagg gagaggagtg gcaccgggtg agacttgggc agggcccggc ctacagcagg 15601 actgccccag ggcaatgtga caagcttagc ttcactcagt gaagccactg ggggattttt 15661 tttttttggc agaagcatga tggacagaat ggtttaaatg gggtctgctg gctgctgcat 15721 agagctgaac aggaagggcg gagatgggtg tagggagccc agggaagccg gccgctaaag 15781 tcagccaggt gagagctgtg agtggccaga gccaggtata gctgtgggga tagcgagaag 15841 tggagggacc tgggctaggg ggcaggggga caggaagagt cccggatgcc tgtccagagg 15901 agtccttggg gcctgagact caaacatctt ccaacctaga gattgctgga ctctgccact 15961 gaccatgggg caggcctgac ttaaatgtgg tttcttctct aagaaagtat gggcagtcta 16021 ggacttggag gctagcccgg ggcccggcag agggcctctg gcatgcagta gtgccaagga 16081 cctgctggct ctaacctgcc ccctcgagtg ccagctcgcc catgtctcca gccagcttcc 16141 tcacgctcac agcctctgag tctccctgga tgtcagggac cgcattccga cggcctgtcc 16201 ggtcacagga gatgaagtcc gagtaggagg actcgacctc catcatgcct gtcgcatcgc 16261 tcctcaggcc tgtggggaca gaaggcaagg gggcagaggt aagtccagat gcattctgca 16321 gagaggacag ctccagttcc ccagcacagt ggcttctgtc aaaatgacag ttattgcctc 16381 ttattgaaaa caaagcaaaa agcacagaaa agtacaaaga acaaaacaag tcactcataa 16441 ttccacaccc agacaaccac catcaacatt ttaaagaaca tgggctcaga cttttcttta 16501 aataaaatta ttgacaaaca tgtacacaca tttattaata ccatgcgcca ggcactggca 16561 cagataccac catgaacaaa ctggccatgg tccctggcct cacagagctt accatctact 16621 acaggtggca taaagtcacc tcgtacctaa gtcatggcct tggtgctcaa ggcttcaagg 16681 gagaatgcag ggtgctggag tgatcactgg gacctgatca aggctggggg tcacaaagtg 16741 acattttagc tgagaaccaa ggaggacgag gcgtcatgca gatgaagagt gaggggaaga 16801 gcactacaga gaggggaaca gcatgtgtga aggctctgag ggagggaggc atgagggctt 16861 ggaaggacag aaagagccag caggcgggag catggcagaa gagggacaat gacaccaagg 16921 acactgctga ggtaggcagg agccagacca catctggaac ttggttcctg aggggcttta 16981 ggcaaggcca tgaagtagca gccgtgcatt ttcaaaggtg actgtggcgg ccgggcacag 17041 cagtgctggg aaggggcaag ttaggaggct cctgtgaagt tcaggtgcca agatggtggg 17101 tggaggaggg gggcagtggg aaggattcta gatataactt ggaaatcgaa ccaacaagaa 17161 gtagtgttgg attgacaggt aatttaggtt tctggtgtga ataagtaggt ggacagaagt 17221 gtggtttgct gacatatgga agaacaaagg aggaatattt gagggcataa agtcctgttt 17281 ggcctattcg gtaactttga gatgcctgtg agccctccaa ggggagaggt aggctaggta 17341 catggaacta ggaggctgga gctcagaaga gaggtggagt ggggaggaaa gatatgaaat 17401 tagtagttct agatcaagac catcctggct aacacagtga aatcccatct ctactaaaaa 17461 tacaaaaaaa ttagccgggc gtggtggcgg gcgcctgtag ttccagctac tcaggaggct 17521 gaggcaggag aatggcatga acccgggagg cagaacttgc agtgagccga gattgtgcca 17581 ctgcactcca gcctgggcga caaagcgaga ctccatctcg aaaaaaagaa aaaataaatt 17641 agtagttctt actacagaga cagcagttaa cagccctggt gcttggtaac acagtctaga 17701 gaaaggagag aagagtgcca catatataaa gaagagccac atacatttag aaatgatttg 17761 ggaccatacc ttatgttgtg ttttgtagcc tactctttgg gacgctttag aacatgtcac 17821 taagtctccc tccacatgaa ttttgtgcca gcatgcttgg atgggccatg atttactcta 17881 ggatgaggcc tccagaaatg aacacagagg tacttccctg ttttctaccg ccacacacag 17941 cactgggact ttatgaaagt ctctgtgccc atccctaatt gtttcctgag agtatatccc 18001 tagaagtaga actgctgggc taaagggaag ggtgaaataa tttttagatt tttgacacat 18061 ttcccagttg ccttcccaaa tagcacagtg gtattcatga tgacaaaaaa ttggcagaaa 18121 gacttataaa gtcagtgctg tttttattta tattcccacc agtagcacac aagattgtcc 18181 atttatgtac tcttggcaac aagaagtact gctactatat taacaaaagg ttaatttgcg 18241 aaggggatac tagtatctct cattgagcac ctatggtttt ctgaaggcaa ttagatagca 18301 gtcattgcat ggactctggc tacctcgcag acctgaatta gtaagtggga ggggtggtct 18361 aagcactggc ctgactttag gagttcatag tccagtgggg tgggcccatg ggcccaccca 18421 gcaaatccta gttctagacc cactcaggca aatgcccaat gttttctcta gtgtgaggat 18481 gtttactgca cctggtcagt aatagcagcc tataggaaac cacccaaatg tccaccagga 18541 ggttaggcat cgcgtgaatt atgaagtagt cataaaaaag aaatgtgcca tacagccact 18601 atggaaaaca gtatggaggg ttcctgagaa aattaaaaac agtaccatat gacccaacaa 18661 tcccactact aggtatatat ccaaagaact gacattggta tgtcaagaag acatctgacc 18721 tcctatgttt actgcagcac tatttataat agccaagata tggaatcaac ccaaatgtcc 18781 aacatgattg aatggataca gaaaatgtat atatacacaa aggaatacta ttcagccata 18841 aaaaagaatg aaatcttgtt atttgggaca aaacggataa acctggagga cataatgtta 18901 agtgaaataa gccagacaca gaaacacaaa tactgcatga ttttacgcat acatggaatc 18961 ttaaaaaaaa aaaaatttgt tatcggctag gcacagtggc tcacacctgt aatcccagta 19021 ctttgggagg ccgaggcggg tggatcacta ggtcaggaga tcgagaccat cctggccaac 19081 atggtgaaac cccctctcta ctaaaaaaac aaacaaaaat tagccaggcg tggtggcagg 19141 cgcctgtagt cccagctact caggaggctg aggcaggaga atggcgtgaa cccaggaggc 19201 ggagcttgca gtgagccaag atcatgacac tgcactctag cctaggtgac agagcaaaac 19261 tctgtcaaaa aaaaaaaaaa aagttgatat agaaggagag aatacaacag tggttaccag 19321 agactgggga ggggagagga gaggaaagga tggaaagctg gtcaacgggt acatagttac 19381 aattagatag gagcagtaag ttcctgttct attgtacagt atggtgatgg ttaacaataa 19441 ggcattctag gccgggtgcg gtggctcatg cctataatcc cagcactttg ggaggccgag 19501 gcgggtagtt cacctgaggt cagaagttcg agactgggct ggccaacatc gtgaaacgtc 19561 gcctctacta aaaatacaaa attagtcggg tgtggtagca cacgcctgta gtctcagtta 19621 ctctggaggc tgagacagga gaatctcttg aacccatgag gcggagactg ccgtgagctg 19681 agatcacgcc actgcactcc accctgggca agacagagcg agactccgtc tcaaaaaaaa 19741 aaaaaaaaaa ggcattctat attacaaaat aactacagaa gcttttgaat attctcatca 19801 caaagaaata aatgcatgag atgatggata tgctaactgc cctgatctga ttattataca 19861 acatatatgt atcaaaacat caagttgcac cccacaaata tatacaatga catgtcagta 19921 aaaaaaaaat aataaagaaa tgtgccagat gggcctgtac tgacatagga caatgtctga 19981 gataaactgg caactgaaaa acagaggttg caaaacaaca cctaaaataa aacccagttc 20041 ctgtggggaa agatagcaga catatgaaaa aaaaatctga aaaaacatct atcaaccctg 20101 tcaactgtgg ttctttgtga ttaaggggaa ctttcgtttt ctaagtctat attttttcaa 20161 tggttgaagt tttataaacc caaattattg tgcaatgctt ttagaaggta gctgacattg 20221 gcagaagtgc ccagaaggca cccagggtgg ggttgagcca gtaagtgctg caggtgtgtg 20281 tgtgtgtgtg tgtgtgtgtg tgagagagag agagagaaga gagagagaga gagagagaga 20341 aagagagaga cagagaagag gatcagtatg gagggcaagg cccacatccc cccacaacac 20401 ctagctcaga agaaacacct aaagaatgaa ttaggggccc tcaaagatca gacaggtgag 20461 acagggaggt gtcctatggt agaaagagca aagaatgaga cacgtgacct taaacccaag 20521 ctctgccacc taaccctggg caagaccctg cctttcttgg ggccttcatt tcccctgtgt 20581 aaaatagagg agggccagga cactgtccac ggcaattccc cccggctaca gcagatggcg 20641 caccctgcct atcttcctag cagccttgcg cgctgcccta gtcacgctct ggctcctcct 20701 tggctaaaag aattcccgga agttccagag gtggctcctt ctacaggtca cggaggttca 20761 ctcctgcgct ctgcctctgg ggaaaccgct caatgcttca ggacaggaaa atggcttatg 20821 ataataacag taataacaga ggataaaaat gatgctgaca attcctacga taaggaaagg 20881 taccatggcc caggtgcctg tgttttcata cacatcacgc actcagtggc cacagctgcc 20941 ctctgaggtg agtgaggccg gtttacagcc acggctcaga agggcctcag ccccacctga 21001 aacccgctgc tctggaaaat gacaaacccc gagcaggcca cactggctcc cggagcctcc 21061 gagtcctcgg ctcttcctca gttagactgt cagctccacg ggaacaggaa tttggtctgg 21121 atttttccca tgtccagctg aaacctaatc ctggcagctg agccaacagg ttctggccgc 21181 tgcattccct gattctcatc cagcacctgg taaagagcag gcgctccatg cacactcgct 21241 ggatttcact ctcgggcctc tgaaggagag ggagtttgtg tctatccctc ttctcccctg 21301 ctgcccgcct gtcctctgtg gggaggagcc gcagttcctt cttggggctg ttgagaaggg 21361 cctccgtgca ccgtggagtg ctgtgcaggt gttcttgtcc taggacctga caccatggga 21421 tggaattaca ggctctatct gaaaggccag cttcatgctg gaatgtctcc tggggctttg 21481 atctcacaat gccatctgat gttacaggct gcccagtgca ggccactggc cctcgggaca 21541 cagagacagc aaatgccaca ctcaaagacc aatcctagaa aacatcagct gggccaacat 21601 gctctggcta ccatatcccc tcagggaaca gcaattccaa agacggctct ggggagctct 21661 ccaatcgaaa gcagaaacac atcgttttat ttttctccca tatttatatt cgtcactcac 21721 cctctcatct ttagagtcaa cagcaaatgg ggaaataaag aatcctcaga gttggaaatc 21781 ctgtcagaaa atggtcattc tcaaaccctg ttaatgagac tgtaaacttc caagacctcc 21841 ttgggcagtg acttgacagt gtgtatccag aaacctcaac tttttccata tcttttagct 21901 actaaatcag tttataataa tctattctaa gaaaataatc caagatctac atgaagataa 21961 aatgtacaaa gatgttcctc atagtatttt tttttttttt tttgagacag ggtctcgctg 22021 tgtcacacag gctggagtgc agtggtacaa tcatggctca ctgtagcctc aacttcctgg 22081 gctcaagcaa tcctgttgcc tcagcctcct gagtagctgg gactattcca tgtctggctc 22141 attttttttc agggatggga atctcattat gttaaccaag gtggtctcaa actcgtggtc 22201 tcaaactcct gggctcaagt gatcctcctg cctcggactt ccaaagtgtt gggattacag 22261 gcctaagcca ctgcgcctgg cccacagtat tatttataat agcaggaaac taaagacacc 22321 aaaatgagtt cctaaatgga tgtgtcacat gtcctaaacg gacaatggag aattattaat 22381 atatacatac ttaactaatt tgtatctaaa tgaaatacca ttcaggcatt aaaatgatgt 22441 ttgtgaatat tttttaaaga aatggcaaaa tgttcacata tagttaaatg aaaaagaata 22501 ccagtgatat aattacatac attaatattt ttataagtaa ttttaaaatc tgttatattt 22561 cctttttata gtgaacacat tcacttcatc ttaaaattac ttaattttag attatgtgtg 22621 catgtggtct aaaactcaaa aggtccaaaa ggacagagat aagagcttag tctcccttcc 22681 catccgatcc agacaccctc cccaaggcaa gcacgttttg gtgtccttcc agacaaaatt 22741 tatgcaatat atatgttccc cctttttctc cttcccttcc tttttccttt atttaaaaca 22801 aaattttggc cgggcatggt ggctcatgcc tgtaatccca gaactttggg aggccgaggc 22861 ggcggggggt gggtgtggat cacgaggtca agagatcgag accatcctgg ccaacatggc 22921 gaaaccccgt ctctactaaa aatgcaaaaa attagctggg cgtggtagtg ggcgcctgta 22981 gtcccagcta ctcgggaggc tgaggcagga gaatggcgtg aacccaggag gtggagcttg 23041 cagtgagccg agatcgtgcc actgcactcc agcctgggtg acagagtgag actccgtctc 23101 aaaaaaaaaa aaaaaaattt attttccatc ttgccttttt ttcatttaaa aatgtgtctt 23161 ggctttggga ggctgaggcg ggtggatcac ctgaggtcag gagttcaaga ctagcctggc 23221 caacatggtg aaaccccatc tctactaaaa atacaaaaat tagctgggtg tggtggtgca 23281 cgcctgtagt cccagctact caggaggctg aggcaggaga attgctcgaa cccgagaggc 23341 agaggctgca gtgagccggg attgcgccac tgcactccag cctgggtgac agtgagactc 23401 tatttcaaaa aaaaaaaaaa aaaaatatat atatatatat atacacacac acacacatat 23461 acacacacaa acacacacac acacacacct tggaaacaac tgtagatcac tctgaagcca 23521 ctgcagcact gattacatca acacactgaa aacaaaccca aagtaataac taaattgttc 23581 cttttattgt ctgcatgata ctctgttgtc tcactgttgt agattgattt aattacttcc 23641 ctgttgatgg acctgaggat tgtttctata aaagtggaat tgccacctca aaggatctgt 23701 gactttttta ttataacagg tattgccaac ttctccccac agaagttagt tcctgtctct 23761 cagtccgttt tgtgctgcta caacaaaata tctgagtctc agtaatttct aaagaacaga 23821 aatttatttc acacagttct ggaggctggg aagtccaaga tcaaggcgcc agcctgtggt 23881 gaggcctgct ctctgcttcc aagagggcac cttgaacact gtgttctcac agagcaaatg 23941 gcagaaaggc aaaaaggggc aaactcactc cctcaagccc atttatttat ttatttgatg 24001 gagtcttgtt ctgtcgccca tgctggaatg cagtggcatg gtctcggctc actgcaacct 24061 cctcctccca gattcaagtg attttcctgc ctcagcctcc caagtagctg ggcttacagg 24121 tgtgcaccac cacgcccagc taatttttat atttttggga gagatgaggt ttcaccatgt 24181 tggccaggct ggtcttgaac tcctgacctc gggtgatcca gccacctcag cctcccaaag 24241 tgctgagatt acaagcgtga gccaccatgc ctggcctcaa gcccttttat aaaaggtgcc 24301 taatcccatt cacaaggaag gagccctcat gacctaatca cctcttaaag gccctacctc 24361 ttaatattat cacattggta ttttggaggg aacacattca aaccatagca atccccagta 24421 ggtatatcag atgagcactt tccttcacca cagtcttctt aatgctgttt tgttcctaat 24481 ctaaaaggtg gttttaaaac cagtgcctga ttgttttagt tcctactgct cctaggagtg 24541 aacctgagca tcttcccact tgcatctcct tttctgttta ctgtttcttc aaattctttt 24601 gcccacttct aatggatttt aggtcttttt cttattgatt tgtaggagtt cttaatatat 24661 taaggcaatt agtcttttgt ctaattagga gttactagta ttttttactt gtcatttgta 24721 ctttgacttt atttttcggc acttctcccc acatacatat ttttatattc attagtaaaa 24781 ttgatcttta ttttatttta tggtttctgg gctttgtaac ataattagaa atgtcttccc 24841 cactctaaga ttagtaaaga aatccctttg tttatttctg gtactttgtt ttttgagaca 24901 gggtctggag tcacccaggc tggagtgtag cggtgccagc ctagctcaca gcaaccttga 24961 actcctgggc tcaagagatc ctcctagctt agcttcctga gaagctggga ctacaggtgc 25021 atgctaatgt gcccagctaa ttcttttttt tttaagagat ggggtctcac tatgttgccc 25081 aggctggttt tgaactcctg ggctcaagtg atactcctgc ctgagcctcc caaactgctg 25141 ggattacagg tgtgagccac cacgcctggc ctctctagta cttttatggc ttcccttttc 25201 acatttaaat ctttgatcta gaatttatct gtaataaggt atgaattagc gagccacctt 25261 aattttttct acctatctca aatcatttgt ggagtaatcc atcatttcct cattgatgta 25321 aaaagctatc attgttacat acaaaactgt gtattggggt ctgtttctga atcttttttt 25381 tttttttttt ttttttgaga cggagtctca ctctgtcgcc caggctggag tgcagtggtg 25441 caatctggac tcactgcaag ctccgcctcc tgggttcacg ccattctcct gcctcagcct 25501 cctgagtagc tgggactcag gcgcctgcta ccacacccgg ctaatttttt gtattttgtt 25561 cagtagagac ggggtttcac cgtgtcagcc aggatggtct cgatctcctg gcctcgtgat 25621 ccgcccgtct cggcctccca aagtgctggg attacaggcg tgagccaccg cacctggcct 25681 gtttctgaat cttttaatct ctccagctga tctgtctact catatactag tatcatgttg 25741 taggatggtt tccttctcat agctcaaatc ttttttgact ttagattttt tttttctggc 25801 tgtttttgct gatctatgtt ttcttataaa cttaatagtc atttaacttg tcccccaatt 25861 tatcttgtat ttatcaggac tgcattaaat atatatagta atatctggta cctgaatgta 25921 gatagaaatc acatttttat gatattgagt cttctttctt tcacaattcc tctcccacaa 25981 ggtcaggcac tcagacttcc cttttgctcc tctgtgtact ccaaagggct ggcccagctg 26041 cctggcatat gctggtgctt gactcttact tacggaatga atgatgaatt attgcatgag 26101 ttctacttct gatatgcaag gtaagtgcct tccacacatt ctctcatgta accctaaaca 26161 caacgagtag gcacgattct aatcctcatt ttagaaatga ggaaactaag gctcaggcag 26221 taaagcaact gccttcaact tctcagaatg ctcttttaca ttctcctagt ccaacacctt 26281 gactacttgt atatctcctt aattccatcc tttactctct tctaatgttt ctctaattgt 26341 tctacacgtc tcgttctctc tcatgacatc atcttccatt catttcaatg tcgatcccct 26401 cattgctgag ctgcagacat ctagatgaat cctcccggag gatatcaaca aaattatgct 26461 ctccctcctc caggactctc tctcattcat tcccttagtc aggagccacc tccacaatca 26521 attatgcaag tcaaaaacct attacccgta actcatgcct gacttcttcc tctctttaag 26581 aaggcatcct gcaccttaaa gctcaattac ccaggttcaa ttattttctc tcttcaggtc 26641 catcttccca ttttctccat gcagtaatca tggcctcagt tacgcttcat catctctatc 26701 cagtctacta caaccttccc caactggctt ttctgccccc agtcttctca cttattcctc 26761 accaccccca cccccaattc atcctcaagc cctctgtcct ctacttcaag agggacattc 26821 cccaacatca gttggctcat cctcatgctc agttcagaaa ccttcaatga ttaaaaagaa 26881 tcaggccaat ccatttatgt atattgacac ggaaggatgt tctcaatata tgattactaa 26941 tatgccctaa cacatgggtc tacagtcttt tctctcctga gttctaaacc tttacttcca 27001 acagcccatg acctatctcc agcagggaga cagctttgat tcattcagga ccatcacagg 27061 gcaaatacct gttcaattaa tgaaaagttg ttgtagaatc taaccgaaag acatcccaga 27121 gctcatctag acaagcctct catcaacagc tgcggaaacg gaggctcaga aagggcaagg 27181 cacttgccca gggttcttga tctagcggtc ctaactcccg gtccagagct cttttcccca 27241 catcacactg cccccttggg agtccctgat gacattacaa atctctgagg ggaatgggtg 27301 gcagggtaga aggcatctag ggtgctcgtc cagcccagct ttgttgttta ccagctgtgg 27361 gccgtgcaca actcccctca cctcagcaaa gcggggctaa cagtcacacc ctgcagttca 27421 gggggctgtt atgaagattc aataagatgg tgatgcaaaa agtgtagtaa gtgatgggga 27481 atgtaagtgg aatgcaatta ataaatgatt aggatggttc atttgggggc tgagttagta 27541 aaacctgcct ttggacattt cattcagctc ctttgttttt actctgttta acggtgctac 27601 agaagattaa aacttccttc tttggtgctt tatggtgttg tgtggtaaga atataaattc 27661 cccagaagtg gtgaccctgg ctggtgtgtt tactgttacg tccagaaggc cctgcacata 27721 cttgaagctc aatgaatagc tgatgaacaa gtgaataata gaagatgttc tagattacag 27781 ggaaagaaga gcaaagaatg tgcggccctg catcataaca gtggctgaag atctggttca 27841 atacccgcgc aatagggtga cccttgatga gtccccttct ctctcagctt cctttcccac 27901 catctaaata atgaagtttg aaatatgatc tccaagtctt ccagtctttt tttttttttt 27961 ttgagatgga gtcttactct gtcgcccagg ttggagtgca gtggcatgat ctctgctcac 28021 tgcaacctcc acctccccgg ttcaagtgct tttcctgcct cagcctccca agtagctggg 28081 attacaggca tgaaccaaca tacccggcta attttttttt ttttgtattt ttaatagaga 28141 tggggtttca ccatgttggc caggttaatc ttgaactcct gacctcaagt gatccacctg 28201 cctcggcctc ccaaagtgct gggattacag gcatgagcca ccgcacctgg cctccaagtc 28261 ttccagtctt gatatttcat ttattcattc aatgtactga gcaacttcta catgccaggc 28321 actgtgccgg gcatagggta tatactggaa actaaaaaag atatttacaa tgtgtaaaga 28381 gaaaaaggca ctgcacaagt aaatgaatct tatcacagat tatggtatgt gctatgggag 28441 aagaggagag ggttctctgc atgtggatca tgcagttaat ggctgtatga tcaggaaagg 28501 cctctctgag gaggtggttt ttgctgaaat ctacagaaaa gttggataga gagggaccca 28561 gagagccaac aggcaatttg agtctaaaaa caatctagaa tcttctagct cagcctgagc 28621 cctacagggt gctaaagaat gaagctcttt cttttcctca tcctcaggag gttttggtat 28681 cttgtgggat ctggatgatt gccagacatg tgattgacct tgaatgccca ggacaaaggg 28741 tgaagccctt atactctgtc cccaggcctt ttggggccct ggggtcagag gttacagtgc 28801 ctgcaggtgc ctgctctggg catttcccca caggaagcca cttctcccgt gggtttctaa 28861 tactcccccc agctttgtaa actgaagagg ccccaggctc tgccaagagc tacatggtct 28921 gttccaagtc tgcagctatt agtcctcatt gctcaggcag attccattac aagctccaaa 28981 ttgttttcca aatatactag aaaggtcatt tcctgttttt ttttcttttt caaaaattca 29041 gtatttgcag aaaccataca aaaagcaaac agacttttgc agaagacatg aagcacagaa 29101 gtgggacctg gtcttatggg agaagtttta tcaggggtgg gcaaaatgct ctctaaggct 29161 tctctgaggg acgaaagggc cgtggaaaca gaaatgccag ggcctactga acaggcattc 29221 atcaaatttc aagggtagcg gcttctctct gtgatagaaa gccagaaaag agaggcactg 29281 aggccaacag aaaactcaga gccccaaaat gagaatattc caagaacctg ggaaaaggga 29341 gggctgccgg ttctatccat gtgtggcctg cctggggctg gctgcaaagc cctacccagc 29401 gccttggggc acatccagca caaggtcacc agagaggagc ctgtcgcctg cctgggcagt 29461 gcctgttgcc attcaaagag tggccctggg tgaggctttg tttggaggca gagaggccgg 29521 gccttggcct ctgcacctgc caggcctaga cccaagcagg cctggatgca ggcagtgata 29581 ggaagggaag gtcagggtgc tcaagtcaga caaggtcctg agtgacttct acggtgccta 29641 tccccaggga cacctttctg tcctcctctt ccaactcatt tcttcccgga ccattctctt 29701 catgggactt ccagacacca caccctttgg gcctccctcc aaccagtctg cacaggcacc 29761 tgctactgcc gggacctttg tgttaggacg ccagtctctt ctgtttgctc ccgaggggct 29821 cttatccttt cccgaaataa tctatgttgg tgaccttcaa agagctgcaa ccctctctta 29881 agagttctag atttctgact cagcctattg agcgtgtcat ttggctatcc cacagcactg 29941 caagcttcag acgtcaagtc accagcctgc tcgccacggc cttcccaact gagcaagggc 30001 accaccatct gcccagctgg aaacccagga ccatccctga cctcgcttct tcttcaccac 30061 ccatggacaa ttctttaccc agtcctgtca tttcttcttc tttttttttt taatgttttt 30121 gtagagacag ggtcttacga tgctgcccag cctagtctta aattcctggc ctcaagcaat 30181 cctcccacct tggcctccca aagagctggg attacaggca tgagcaacag tgcccagcct 30241 ggtccttttg tttctacctg caaaacagag cttgaatctg ctgacctctc tcctccacat 30301 gcaccaccca ggctccggaa tgtctcacca ggactctgag cagagaccct gaccagagtc 30361 ctgtgtctct ctcaccccta gcccagggtc tattttttat atggcaggca gggtaacgtt 30421 ctcaaaatgg aaattggatc actggaattt ccctgcttcg ttcaacttct gatgcttttc 30481 tatttcacat ctagtgaaat ccactctcct gatgccagtc cctgaagccc tgcattgacc 30541 tggccctgcc tgcctctgca gatgagcctt tgtctcattt ctactctctt cttggtccat 30601 tttgctctgg tcacactggg cctctcttta aatcctcaaa cctcatttaa ttcctgacct 30661 caaagccttt actagactat ttcctctgtg tacaacattc tctgtaaccc tgctccctat 30721 agagcctcct aagtggataa atggattcct ggccttggtt tttcaaatgt tcttgctcta 30781 ggacctgaca tggggctcct gtagggcaca aagcctcctg agttcctctg cccccaacta 30841 ttgcatgtca ggaaagccct ggcagagctg cccggagggt aatgtggtgt ggtggatgcc 30901 acaaggactt cagagtctca gctgctgtcc ctacagtggc acatgatgct gggcacctca 30961 cctaactgca gacacccctt ctataaaatg ggagggctca cacttgtctt aaaaagtggc 31021 tgtggggatt aattggagcc aacagttata cagttttttt gttttttgtt ttttgttttt 31081 ttgagacaga atctcgctct gtgactaggc tggagtgcag cggcgcgatc ttggctcact 31141 gcaacctctg cctcccaggt tcaagcgatt ctcctgcctc agcctcccaa gtagctggga 31201 ctataggtgt gcgccaccat gcccagctaa tttttgtact tttagtagag acagggtttc 31261 accatgttgg ccaggatggt ctcgatctct tgaccttgtg atccgcccgc ctcagcctcc 31321 taaagtgctg ggactgcagg cttgagccac ggtgcctggc ctgttacaca gttctttctt 31381 actgcatgcc agactgtgct aagcatctta caagcattta attttatagc aactctatga 31441 cagaaagtat cattatccca tagtacaaaa gaggcagctg aaactcagca aggttaggga 31501 gcttggagcc agatggagcc tgtgctatta ggctataaag caggggagtg aattagtatt 31561 atgcctggca tgtagtatgg atttactgaa aagtacccat tcccttttct gaccttcttt 31621 ttcccctgcc cccagcttca taataaggca tttcttaagg aagggcaaga aagtaagcaa 31681 gtccctggaa aacatagtac aaggctgtct aaaacataca ccagatgcat tttacccatg 31741 cttaagcaac atcctgtttt ggatattcag ctggatacaa accttcagaa tcttttcagt 31801 gatgttgggt tgactctctt gattctacca ccacattcta ctctttggta tctaacttca 31861 cagcaagagt aaaaataatt attttggggt atgaattttt taaaactctt acctcctcag 31921 cagctatatc ctactggcag ggcggtgtta ctacttcaca cagggccagg gctagggact 31981 ggaattgtga cacaaactct cttcaaccca tggaagtctg tgcttccatg gaggtctctg 32041 gaaagctgcg gaacagtcca gctgcatcac tgcttggtgg ccgggcatgc tgggtcctcc 32101 ccaggcctcc tgctgtgtct gtattatcaa acttgctgag gctcagctac acagcaaggt 32161 tcccagagag tggcagtctc tatcttactg tgaggggaca ggaaggaagg gaaagtgaga 32221 tcacttcctg gcaggcctgc caaaggccta ggggctgggt cacactccag gtctgcccat 32281 gccaggctgg gcacctctgg acaaggctct tcccctcaaa cctcgggagt ctcaagtgtg 32341 gaatgaaggg ggaaactggg tatctctgtg gtggttctga aacatgtctg caaattatct 32401 gacattcctc ctctcgggag gtagagttta atttcaattt ccccgaacga gctggcctta 32461 atgacctcta acaaatagaa tgcaacaaaa atgattttgt gtcacctcca agatgaggct 32521 ataaagggtt atacagcttt cacctagctc tctctgggga aacctgcctc tagagcattg 32581 agctgcatat gggaaatctg aggtggccgt gctgtgagga agcccaaaca ggccctatga 32641 agtaaccaca ggggagaggt cctgagacgg ccaggaaaga gcagcagccg gccagccttc 32701 agcggtgctg gccccaagct gctccagttc cagccaccat ctaactgcaa ctgcataaaa 32761 gatccaagcc agaactgccc agccaagctg ttcccaaatt cctgacccac agaaaccatg 32821 agatataaga aactattgct gttgttttaa accactgagt aacaggggtg atggattatg 32881 cagaagtgga taactgaagc aatctctaat gttacttcta gctctggggt aagggaggta 32941 agagtcttct agctctgaag gaagggcagg aaagcaagga agtccttgga aaactgcccc 33001 tcctttcaca gtaggatgga gactgccact ctctgagaac cttgctgtgt agctgagcct 33061 cggcaaattt agatcataca tccgcagcag gaggcccggg gaggacccag catgcccagc 33121 caccaagcag tgatccagat ggactgttct gctgctttcc agagaccttc atggaagcac 33181 agactttcat gggttgaaga cagtttgtgt caattctagt tcctctaatt ctataaaatg 33241 catctccaaa atacaccgtg gagaatgaaa agaagcaaga tattatctct atagtattca 33301 tccattcctt cagaaaaatt tcactgatcc ctcagtgtat actaggcttg ggactagata 33361 ccacagatac aaaaacagta agataccatt gtgcccctga gggaagtcca tggtctagtt 33421 agggagctca gttcataagc ccgtaattct taaccagtgg tagtaggggg gcaaaaatca 33481 caattaacag aatcactaga ggatagtttc tcatagtaat tctaataaag ccaggtgggg 33541 ggatctggtg ggggattatt attattctaa taatgccggg tggggggacc tggtggggaa 33601 ttattattat tctaataatg ctgggtggag cgactgaggg tgtgctctct agctaatgat 33661 cactgtcctc atcagctgct gttactgatg gttgggtatc acagcccaca cacatgatga 33721 gagagaggag tttagaaacc acttctgcat atacatctaa gtatgcgagg cagagggcac 33781 agcatgcaag aggcaagggg agcctgctgc ccagaaagac taataagagg ccatggggct 33841 ggggagcagt gaatgggaga ggaggtgagt ggaatgacag tcactgggga ctttcagggt 33901 cttgaggggc aaagagcaga ctttaaattt tgctctgact ttgattttac ccttaatatg 33961 tgaatgcttc ccttccaaga caatattcag aggcttctca ctagtgtagg gtgagtgaag 34021 cccttcagac agtaagggag tctcttcaca cactgctcag acactgcact gtgtgtccac 34081 ggccactgcc agcagtggct ggggctctgc cgtcttcagc tgcaggaggg atgtccttcc 34141 atccttcctg cctcctgaga gaagtagggt tccgaaccca gaacttgcta cataggtaaa 34201 taccggttga acaacagcct cccttggaac ctggaagagc aacatgtttg caagccaggg 34261 accatcacta cttgagcaga gtgtagatgt catccgtacc tctggcccac attaacatca 34321 tttactcaag gttcctaagc ccttggttgt tcagtattcc agccctcccc ccacacctcc 34381 agtcttcatg ggaatgccag ccccctgtaa acagtcctgt gtaattcctg gcagcctggg 34441 gcgtaggccc tttagcctga gtccctcttc ctcttccctg acccaccaga agagatgaag 34501 tttccttgcc tgccacctgc agcaccttga cttcttccta ataatgactc agtcgaaatt 34561 cattcaacag tcattcgaca aatacacaca gaaatgtccc gtgagccagg catgatgctg 34621 atacatttgt tcattatctg ccattccaac tgattccaag agggtaggta ccctgccttt 34681 cttgtttgtc actgtatctt tgaaatatag cacagctcct gacatataga ttttaatcac 34741 tcagtgacta aataaccatc tgattcctgg ctcctgacca aataaataat aacaatgata 34801 agagtaaaac aaaacaaaaa tgtgtttaaa attaatttat ttaatattta ataaaataaa 34861 aaattggcta ccacttatgg ggtgaatata ctatgttcaa acatgttgac atgtttgggg 34921 tgctgggcat gcattttcgt ggttgtactt ggggcatcca caggacagtt gtacttttac 34981 agcaccaggg gcaccaggcg cagtggttca cacctgtaat cccagcactt tgggaggcca 35041 aggctggcgg atcacgaggt caggagttcg agaccagctt taccaacatg gtaaaacccc 35101 atctctacta aaaatacaaa aattagccgg gcatggtggt gcacgcctgt gatcccagct 35161 actcaggagg ctgagacaga agaattgctt gaacccagga ggcggaggtt gcagtgagcc 35221 aagatcatgc cactgcaccc ctgcctgggc aacacagcga aactccatct caaaaaaata 35281 aataaataaa taaaagtact tggggcatcc caaaggacag ctgtacattt atagttataa 35341 ttgtgaactt cagcaatatt tgatactgta tcccagttca tgcctaagca gaaacactag 35401 aggcctggtg tgctctctca agattctggc atctgagagg ctggtggcca agggagcaca 35461 gtggcccaca acactttggg ttctctctag accagtgttc ttaaacttta agggtcacag 35521 acacatttga gagtttgagg gaagctacag atactttgcc cagaaaagta tatattctct 35581 gcgtgtgtgt gtaaccatac ctatgatcac agggggctct taacccataa gcccctggct 35641 aaaaactcca gctctagaca tcacataacc ctgcattctt aaggtttgga ggctctggac 35701 ctcaactctc ccagcacagc tggaacagaa gccttcctaa aatggccttg gccagaaaaa 35761 gagtcctggg agccgactgc tgcatcctag tcactgcaaa catctgcacc caggggattc 35821 ccaggggccc tgagtcagaa caggtcacaa gatgggtggc ttggaaagtt gctatggcaa 35881 ggccctgaag ccttgctgat aagcagccaa atagttccaa cctcaaggca gcttccctcc 35941 agatgagacc cttgctatga caacagtgaa aaggactctt aagagaggtg ctggaggcag 36001 agtagaggcc aaaaaaggga attcaaagga gctggaaatc ccccagactc ccaggactac 36061 taggacctgg agaaataaag gctggatttt ctgcccccat tgcaaggaga cagagccgaa 36121 gaggggtgag gggtgggaat gggggaggca agtaagctta agctatgggt cagagcctgt 36181 atgaatcggg gctccaccaa ttgctagctg catgacgctg agcaaatcac ttaacatctc 36241 tgagcctcag tttcttcttt tgtaaaacag agatatcaac ccccaaccct tagggttgtt 36301 gtgaaattaa aggaaattca tgcataattt ttttgagtcc tttcctccta ggaacctgaa 36361 aaggtaagga atgaaggaag tacaggatga tagttgagag ctcactctag tctcagatga 36421 aatctgtgag gcaaaattcc tgacctctct gaacctcagg ttcttcatct gtgaaacggg 36481 gacaatgata ctttacttgt aaggttctac tgggattgga aataatggat gtataatgcc 36541 tggctcacat gacaagatgc tcaataaagg gtggtggtta ctattatcag aaaagcaaag 36601 ctaaattaaa gaagtcgttt tggtgtagct gaaaaaatag aagatgtgga gctaaaaaaa 36661 gctcaagtga aatcttagtt atgcaacttt ccagctctgg tacattggcc aaatgatgta 36721 atctccaaaa gcctcagttt cctccttggt aaaacatgga taataacaga aactacctaa 36781 ttagacactg gatgtaaagg tttagcccag cgttggcaca agagcactca tccatatgct 36841 ataaaccata ctctttgagc ttacacaaca gctcagggta tgactgatgg cttcctgtga 36901 cctgggcaga ttctttaaat ttttttcagt tttattaagg tatgaatgac aaataaaaat 36961 gttatatatt cacactgtga catgattatc acaatcaagc taattaacat atccaccacc 37021 tcacacagtt actttttgtg acgagaatac ttaagatcta ctctcttagc aaatttcaag 37081 tatacaatac catattatta actgtaatta ccatgctgta cattaggtcg ccaaaattta 37141 ttcatcttat aactgcaagt ttgtaacctt tgctcaacat ctctccattt ccccaacaac 37201 ccttctactc tctctggaaa ccacccttct actctctctt agttcaactc ttttagattc 37261 cacatataac tgagatcctg tagtatttgt ctttctgtgt ctgccctatt tcacttagca 37321 caatgtcctc caggttcaac catgtttttg caaatgtcag gattttcttt tttaaggctg 37381 agtaatagtc gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgttg 37441 attattatag cattataatt tttttttttg agacataatc tcactctttc agccaggctg 37501 gagtacagtg gcatgactac ggctcactgc agccttgatg cgatgcctgg gctcaagtga 37561 tcctcccacc ttggcctcct gagtagctgg gactacaggt gcacgccacc atgcccagct 37621 actttttgga tttttttttt ttcttttata gaaacagggt ttcgctatgt tgcccaggtt 37681 ggtctcgaac tcccgcctca gcctcccaaa gtgctggaat tacaggtgtg agtcaccatg 37741 tccagcctat aatatttgaa atcaggttgt gtgatgcctc caacttagat cttcttgctc 37801 aagattgcct tagtaatttg gggtctttca tggtttcata ggaattttag gattatttgt 37861 tctatttctg taagcaatgc cactggaatt ttgataggaa tggaattaat tctgtagatc 37921 gctttgggta gtatggacat tttgacaata ttgattctac tgggcagatt ctttaaatgg 37981 cttggtatgg acacagcctt aactctttcc agaacaggct gaactgcctg ttcaactagg 38041 caggattctc actttttact ctctccactg gcttcagagg ctcctctcaa aggagttttc 38101 ctcatggcca ggagacaggg agtgacttca gtgtccacgg taaaccagat tagggggtgg 38161 ccttctggca gaaggagtaa ctatgtgctt ccctggatgc acggcctgcc ctccttgctg 38221 aggtgcaaag gaaagcagcc tgggagatga gcaggcacac tcctagctgc caagcagttt 38281 gcccataggg tgggaaggca gctgccctgt ttaatgcacc tggggaaaag gactgattga 38341 gagaagtgct aaatgggcat gttggcccca ggattcagtt ctggaccacg ttttggttat 38401 ctgagggctc tgtacaggat tttcaaggtt tgtaagagaa cagctgtccc ttgattcctc 38461 tgtgtcctct gcactgtctg aaaggtctga aggagcacca gcagaatggc agatgctacc 38521 cacagaagtc agagagcacg aactccagac tccggccctt ctccttcttc ccaattgtgt 38581 cactaccagc aagcagttcc acctctttga gtctcggttt cctcatctat caaatggaga 38641 taatatatat gtacttcata agatcaaact aaatgaggct gagaaggatg taagtgctta 38701 tcacatctgg cacatggctt aaataatagt tctagcagct ctgcccaagc actcttttgt 38761 ctgtgggtgg tgccaagcac acacctggcc ttcggtcagt gtttgctccc tcccctctgt 38821 agaccggtcc ttcttggctg cagggagagg gagctgtgag actgatcctg aggctagggc 38881 ttcttcactg ctctcatccc aggcgataag ctgcagcgac aggatgcact taatgtgcgc 38941 ttgaggattt caacaaatcc ttacatggca aacgaaacag agagatttct ctggaagccc 39001 tagctgggca tgtaaaatat ccatgatatt ttaagatatc atgatgatat ctcatgagat 39061 gagagaaaag aagagtaata cttattatct tcatttaata gataaggaag ctgaagctca 39121 taggtgtgaa gtgacttgtc caaggcaacc tgtacactct gctttacact gggtgccccc 39181 acccacccac ccagcaccac ttcctctttt ctccccaagg agtctatctt atctcagact 39241 tcgggctctg tcctttgcct tcttgaagtt ccaagggaag gaggagcaca ggcgctaaga 39301 agagaggtga tgagagtggg ccaggcagag tgcggaactc tctcagtccc ttagaactct 39361 ctcagtccct tacccaaact gaagctagac agtctccttc ctaagcctga ctcacaactc 39421 tcccctgctt aaagccctcc agcgactttc cattcagagc catcaggata aagttcagat 39481 tcattcatgg acacctccag actcaactcc agccagggaa ccccctctgc actgtgtttg 39541 gcccatggaa tttccatgga gtgctgcacc ccccaagcct ccaggccttc gctgtgctgc 39601 tccgcaagcc cccagcctgt ccccgcccac tcttctccaa gtcatctcct gcataccttt 39661 caagccgcag caccctctgc ccactagccc tctggaccta ccgcagacta ctgctactag 39721 tctactgtca gccagaccca caagcttcct gagggcaagg actgtgccac ttttgctttg 39781 ttatctggtg tccaatttat agaagatgct ccataaatac ctgctaaatt aatgtaaaaa 39841 tgaaaaaaac ttttcctggc tgaggcagga gtctaaactg agctgcacca gacacagctg 39901 caatgtgccc cctacagtgc tgaagagaga gaaaatcaca agaccttctc tctcccatgc 39961 agctttcagt cctcacacca tccttagctt atggacagat gcacagtgga ttcggcaagc 40021 tcttcagagc gatgcgctca tccccaggct ctttagcctc tttgtgaaaa ttccagctat 40081 ggagaaggtc caggtgggtg taggccaatc tagaatagcc ttcaattctg ataccttaac 40141 aaccacatcc atggatgctg atggatgcct ggtgggtgaa aagatctctg tggacccagt 40201 acttccaact tgacaattca aagaaattca ccagatttgt tgataaatac tgctgcttta 40261 ggtaaaaagt cgattattat gttaccctga gtttaaaatt attaacttat gccagaaaca 40321 tttagatatc ataaactatg gacactggac ccaaggtggc catactgcca ggtgggccac 40381 ttgtgctctt tgaatggcaa tttgttctca gctggaggga ctgcatcttc atcccttctc 40441 accagactac tgggccgaac ccatttcttt gggttgggtt ttgctggctt ttgctcctct 40501 attacagcat taaaatgaaa attcccataa tgtgccctgg tctttgtgaa aacagcttta 40561 cagcaaacac ttggtggaga ggaaagctgg caccaggccc aatgttgcta ataactcctg 40621 tgtatgattc cacctgctta agcttaagta catggccggg aaacccatcc cctcttaagc 40681 caccaaaggt aaattttctt tctgatgcta ttaaaaaaaa aacgcctgaa aacagaatct 40741 tgactgtgaa attgtagcat gctgttatgt atataacttc tgaaaagagc catcctgcct 40801 tttcttcata aggtcaaaat ctcccactaa tttagtttgg aacttacgta ttttcttcct 40861 tcaaccagaa ataaagatat tcaggcaaaa acgtgtgctg ctctggcagt agagtatgaa 40921 cagtgccact acctacctgg tttaatttct ttttcctgct tgcatctctt cagattttcc 40981 actgttgact aatcagtacg ggaattctaa ggaaccagaa gaaaaacaaa aattagagcc 41041 agacatactc tgggattata aattgaaaca atcttttggg aagacaagtt gacaatatct 41101 attaacttgt taaatgtgca tacatatcct ttggggtcaa attcctagac acgtcacatt 41161 cctgaaatgt ccaggaattt atactacaga tatatttacc taagtaaaaa gatactcagt 41221 gaaatactgt atagtagcaa aaccaagggg cgaaaacaac ttaaatgtct attaatagaa 41281 aaatagttaa attatggtgt atcaatatca tgagaaacta tacagcaact tcaaagaatg 41341 aaggagattt cattgtactg atgtggaaaa aagtacatgc ttgattgtta aaaagcatat 41401 tgtaaaattg catatattat cattttttaa ttaaaaaaag aactttacat tttcttttta 41461 attttaagag accaggtctc actctgtccc caggctggag tgtactggtg caatcatagc 41521 tcaatgtagc ctcaaactcc tggcctcaag caatcatcca ctctcagcca cctgactagt 41581 tgagattaca ggtgtgaact actgtgtcct gctcaaatgt gaacatgatt atttatgttt 41641 agaaaaaaaa aaagtacagc atttgtgtac gtgcacattt ttaaaaagta tacatacttt 41701 acactttaaa caattttcag tatatataga aaagggcaaa ctgtaaccaa tggttccccc 41761 tgggaaactt ctctcttttg aaaaaaatta ttctcttttg agaaattgtg agaaaagaaa 41821 tgaaagactt tcactttgta cattatctat ttctgttttg attgaaaact tttaaagagg 41881 catgtataac tttcacattc attacacatg ctatgcaaaa aattctatat ctagctaaca 41941 tatcaagtgt gagggcaaaa gatattttca aacaaagact gggcaagttt accattcaca 42001 gaccccaaga atattctcca atagaaagga aactgaatac agaaaataag tgagattttt 42061 agaatcaata atgagcaaag gaaggaataa tccagcaccc ttctttactc cctctccctc 42121 ctttactctt ggtataagtg actccatttc tttttttttt tttgaaagtc tcactctgtc 42181 acccaggctg gagtgcaatg gcacaatctc agctcactgc aacctctgcc tcccaagttt 42241 aagcaattct tccgcctcag cctcccaagt agctgggact gcaggtgcgt gccaccgcgc 42301 ccggctaatt tttgtatttt tttagtacag acggggtttc accatgttgg ccaggctgat 42361 cttgaaatac tgacctcagg tgatccactc gccttggcct cccaaagtgt tgggattaca 42421 ggcatgagcc accgcgccca gccctcttct cagtttgaat accacctttt caaggaagac 42481 cccagccaca ctctcctctt tcactccccc tgcttattga cttcttaacg ccaccatttg 42541 tgcttgtaaa cttcatgcag gtaaaggcca tgcctgttat atttaacatc atccacctat 42601 caccttgccc agtgcctggc ggatattagg ttctcaacaa atattggcta aatacatgaa 42661 aaatttattg aaggctcagc tcaaatgcta ctgctcctcc aatctcttag gctttctaca 42721 ggactctgtc cccatcttgg ggcacatttt cacaatgtgt tcatgttatc tgagtctttt 42781 tttcctatcc ggccatgagc aacctcaaag tcttatttac ctcgacattc cccaccgcac 42841 cttgcataat gcctaacaca taaagggttc ttagcagtag atgaatctac acacacgtcc 42901 aggaaagtat aaatgatggc cgcaggctct aaccaattgc agatagtagt gattggagcc 42961 cacaacacaa gcacctttga gatatgcaaa caaacatgag actttgccca gggagaaatt 43021 aaaattagag ctaaacttga aatacagcgg tatgtgggtg gggggattgg gagacaggcc 43081 cttattcaaa aacagagaag tgaaatcact gagtgatgat ttattttaag tgtgtttttc 43141 tgtactatct agcttttctt cccttgaaca tattctaatt ttgcaatcag aaaaaaaaaa 43201 tccacaataa atgttataaa aacaaaccaa cggaaaatca gtggtgattt tagactacaa 43261 aagtcacccg tccctctgtt tccttactac aactttggac ttcagattca gggccctctc 43321 aggtggggcc catacatttt gtaaggaagt gttctttcca cccctctgaa gccacggggg 43381 ataggaaggt gccagaaatg agtgcttcag caggggcagc tctcagttgc tccctaatgc 43441 aaatgctggc aaagctccca gagccaggaa ttgtagggag agagcagaag gaatcaaccc 43501 agcgcaagat caggacctcc acaatatgct gatccagtgt ctggtgtttt gtagcacttg 43561 tgcatatttc ttttgcagta cttttcacgt gccattaaaa ttattcactc ctctgtctat 43621 cttcccttgt tgcacagaga attccacaag accagaggca gtgtctttac cctggtcctc 43681 tagttcagtg cttagtacta acaggtcctc aatgtacatt tatggaatgt taaagactac 43741 actgataaag tactaaaaca gtaactgacc taggttcctg ggacatagta agaatacacc 43801 aaatactttc ttcggtgcca tgaagggcca ttattcagtt ctgaagagtc aagctaaagc 43861 ctgtaatttc ttgaaacatt aaagagactt ctttttggaa atgcagcttc ctccagtaag 43921 agcggataca ttatgtgttc taatctgcaa ttcataatca gtcagcatag gagcccacgg 43981 caacttccaa agaaaccact ctttgaagga gcagtactcc aagttttcct aagacatttc 44041 tgaaggattt ccaggaggta gtgagctcag cctagggtca tgattaataa cctgtttccc 44101 agaactttag attttccacc tcagtctcca cttaggaatt gcttctggcc cctcgtgaaa 44161 gcatttggca gagttaattg ggccagttca gcttcttgtc tgtcacagcc agcacctgtt 44221 accagcttct tccttaagtg gctgggctag ggtcctctgg tcccaagaga atagcaccca 44281 cttctgcttt tggcctcagc actggctggc tgatttgagg ttcagcatca cttgactggg 44341 tgacaggaaa taatgctgag gcttggtttg aagagccaac atggcagctc cagggttcat 44401 cacaggggtt tcaaaatgat gagttcaaag ataacttctg gttcctcatg caaatgccat 44461 cctatgtctg agggggcaga tgcctgtgta cattggctga cattctcttt ctggaaattg 44521 agagcagcta atcctaagca tgtatttttg tactaactaa gtgctttatg agccctatct 44581 catttcatcc gcatggcacc cctttgaggt cagtatgact gtgacccccc acttgacaga 44641 aagagaagta ggttctataa agtgaggtga tcagtccagg atctcccagc taggaggcgg 44701 tgaaagcaga gctcttacct gctatcccct tccaatttct cactgaactc gagagctctc 44761 acctgctgtc cctttccaat ttctcactga actcaagagc tctcacctgc tatctctttc 44821 caatttgtca ctgaactcta gagctcttac ctgctattcc tttccaattt ctcactgaac 44881 tcgagagctc ttacctgcta cccctttcca atttctcact gccctctaga ttgtctaaat 44941 cgaaagaaac ggtagacacc aaccagatgc agggctgcaa attcccacag gggccagggg 45001 ccaggcagga gactagaggg cacaggggaa tgcgcaaatg gtaatgaact ggagaacaca 45061 cctgcctaaa ggcattcaaa ttaaaaacaa aacaccactc tgacttctga ctagagttca 45121 cttgtcatca tttaacagat ggagcaactg agatccaggg aggtgatatg aaaaaggcac 45181 tgcagtatgg tgaaatattc aagggcttcg gggacaacca gttctcattg ggaactcaag 45241 ctctagcccc tgccagctgt gagactctgg gcaatccact gcacctctct gtgcctttct 45301 taggtgtaaa atggagatga tctctgcctc acatcattgc tgtgaagatt aagagaagtc 45361 agcatatgca agatgcccag ccctgtgtgt gacatttatt tcaggaagat accaaagcca 45421 aatgagatct ctggacacat cttttccaca acctacctcc actcccaaga tgccctaaac 45481 tcactgccgg gagtgactga ggtgggagag atcagcaaac acacactgat aacaggagca 45541 actttgacac acaaaacaag aactgaaagc caccctcatc tggtacatga aggcctacct 45601 agttgaggag tgtgaggcat tccacatttc cactctccac ttcctggcag ggaccacaaa 45661 gtctaggtac cttccttcca gaacatcacc tcctcaccac caccttagga aaaaggatgc 45721 cttttttgat taccccaaat ctcaagatcc cagcggggct gggatgaagc agaggatgcc 45781 ctggccttcc cactgcccat gcccatgggt gggtccatgg cagcaacagt ccaccttcat 45841 gggatgaaat gctggggaag actgcagctt tagaagtgcc aagagacccc atctctacta 45901 aaaatacaaa aattagctgg gcgtggtggc aggcgcctgt aatcccagct attcgagggg 45961 ctgaggcagg agaatccctt gaacaaaaac gggaggtgga agctgcagtg agccaagact 46021 gcaccattgc actccagcct gggagacaga gtgagactcc gtctcaaaaa aaaaaaaaaa 46081 gaagtgccaa gggcctttga gaaccctctt attaactagg accttaattc aaacaaatag 46141 tttctaaata ccacaacttc ctgaggatta gggtctctga tgcccttaaa gaaaacaaaa 46201 caaacaaaca aaaacaaaac aaaacaaagc aaaaacctta atttgtcaag acccggcacc 46261 ttggaaactc aaaattcaag atctgacgtc ttttgctttt tttttttttt ttttgcactc 46321 tggctgtggc tatccttccc ttgtggcagg agagggtggc tctatgttaa atgatgaaag 46381 aggggcaaat agagggattt tcccccttct tctccctctt tcttgccaaa tgaggaacta 46441 ggatggcggg ctttaagttt aaattaagtt gtgtctcctt tgtacctatg tgccaggctt 46501 tctactaggc agtggggcat agtgatgggc tagacagaca cgcagagaag gtgcagatgg 46561 tctagtggag acatggggaa ttgaagttat gccagtaatt atgtaaatac agtttagttc 46621 aggataatat gaaaacatat aagtagatgg ttgtcctcag tcaatggggt tagggaataa 46681 tactgaccat ctagtgctct gtaaagggtt gagcataaca tagggcctag tgcttaagag 46741 atactcaata atgccaattc cttctttcct ctctccccaa gtgtatgtgc tagtgtatgc 46801 gtgtgcacat acagttacca ctgagacttt ttctttctct ttcttcaaga acacctaatc 46861 tgaaaaagac ccataaaaac tcagaagtaa cttaatgttg cctgttttat gtcttctttt 46921 aactgtccta caggcacccg tttttagcta ctctgggaaa ctgaaaacct agggtccttt 46981 gctcctaaag ccataagggc ccaatacaaa gggaaaagaa atatggacag ttcatttcat 47041 actgaatttg gaaatacaga gatcatctag ttttgtcccc tcattaggta caaataaaga 47101 aacaagaccc agagagtgtg aaacacttgc ccaaggtgac agtttccttg ggtgaccctt 47161 ctaatatatg gtagcttttt gttgaagctg aaatcttccc gttagctgta acttttcaaa 47221 tatacttaac ctgctaagag gggacacctc ccctgttagc acactgcaag ccatctcaag 47281 ccacctagga atcctcctca gttcagggac ccaccaagta agggacctgg gtacagcaag 47341 ggccccctct ttccctctct acagagtagc ttgcttagtt tcaaaccctg aaaacagata 47401 ccaacaaata aaatgcctaa aggcttacat gtcctgttta tattccaggc ataaacagca 47461 aatttttaaa aggaagcaat aaaaagatcg caaacccaag gttaaagatt tccaacccct 47521 ttccatcttg gagttttcca cacagctgct gcagcactgg gtttatatgt ggtaaattca 47581 tggagtctga ccatatctgt aactgaaatt taacacaaca accatgttcc tgagggggaa 47641 gataataggg aattttcact actttcggag acttttcttt ctagaatgtt taaaaattat 47701 tataccacct ttgtaatcag aaaaaaagaa agggttttac aaacacgaat ttcttaggga 47761 agatttattt cttcacctat atttggagca aggagctgct tttaaagctc tgaatactct 47821 gtcaaagttc tttgagtaaa aatgaaacag catctgaatg gagcccattt gtgagactgg 47881 aggctccgac tagccttagg cagtttcaga caaggccata attttcagag caagctcaca 47941 tatatcatta tataaaaata gacattttgt ggccaacagg ctggaaagca agctctccag 48001 agagctctgc ctccaagctc tgcttctccc tgggggtgtt gggcagggga ggagagtaaa 48061 gttctactta caagaccaaa gtttcctcaa aatccagtgg tcctcagtca gaagcggagg 48121 cagaatttaa gaaaccagga ttcaaggtta gcagtccaga cctgtggtgc gggcaggatt 48181 ctcactttac cctcctctct gtctcttccc cgtgtctcct aaatgaaaag cactttgagg 48241 ttttagtggg tgtttcccat gtctcgggcc aagtaagtcc ccatttgttg taatccaatg 48301 ctcagcctga tgatgtcaac cacacaactc actcactcag tttccacctc tctgtccaca 48361 ggagagatcc atttcctttg gacaatgagc cagggggctg taactgcatc ccaggccacc 48421 acgcttgcct aatatccttc cttcaatttt caaccctgca gctttagtga caaaaggctt 48481 cctatccact cactgagcca gggttcctgg gtaaattcct ttgcttctta gtgggttgtc 48541 tgctttcctg tgtgtttaat tcctttcttt ggtatataat atactagtta gggtggagat 48601 cttcacccca ccccacaccc atctctttct tcctgcccaa ttttccaagt tgtttcattt 48661 ctcccaggtc tcctaggtac atgctgggca cagtacatga gtttgtcctt catgtagggg 48721 aggtgatgtg tgcccccggg gaaggcttgg aaacagcact ggtgtgtccc ctgggaaact 48781 gggttccagt cttggtcagg ccttcaactg aatgaatgac ctcaagcaag tcacatccct 48841 ctctggacct cagtttccct atcaacaaaa tgctggcttt ggtgctttcc acttctgaga 48901 ttcaatgagt aaaaatcagt gactggggaa cttataaaca aatgcattat agagcatttt 48961 actacaagct acccaactac atgcttgagg gaaggaagaa ccctggagaa gtcagaggcc 49021 actgcccaag atgccaggct cagctgggca cctttaaccc cctttatgaa ggagatgaac 49081 agattctctg ggagtaaaac caaaagaatt cactgaacaa gtccttaggg acatgaaaag 49141 cagcagcata aactacaaga taagacaaac aagtaacagg ctataagtgt ccttagggac 49201 caacatggaa cagcagtggt tacgagagat catttcttgc atttggagag tgcttgatgg 49261 taaagacaaa gaacatctct ctactttcgt gtacaatggg gagcgagtga catgcatagt 49321 ggttaggaac atcatttggg agccacagat gtggatctga gtcctgtctg acttaatttc 49381 tctgaacttc agttttttta actgtcaagt gggatactag tgcctctgtt ttaggattgt 49441 gagaattaaa aaacacatgt aaggaaactg aaagtatttg gtacatacta agtattgaat 49501 aaatgtttgc tctcgttgta cataatctca tttgattctc ataacatcct taggagaatc 49561 agataagtgc tatgaggcta cattattaat ctctccttcc tggataagaa gacatataca 49621 gagaagttag gtgacatgcc caaggtcaca cagctcaaaa cgaaagggac agaattagaa 49681 atgaagcctc aactcctaac taggtcagca cacatgccac acacatgcca atgccaatgc 49741 catttctcaa agtggttgcc gcagccagcc tgggaaagga aacactcagg ccactaggtc 49801 agagaggctg ctttccttgc tttgctaccc aaatcctgtg tggcctgaag caagactggg 49861 cttcaactaa ctccctcaaa aggagcagat ctgatctgtt tcaatgccac tttgatggtg 49921 atgggaaggt aggaccggag ctttactggg aaagcggctc tctataggta aggaagaggc 49981 tgtggtaatt tgtggtttaa caagaatttt tattagttct cagaaaacac agatgcttat 50041 cagaaaacat ccaagaaaac aggctatatt aggagggagt ttgtcaatct gaaggctttt 50101 caacaatgtg acaatgtgac gtaggactgc tgtggaataa aaaccacacc agcctggccg 50161 ggcgcggtgg ctcacacctg taatcctagc actttgggag gccgaggtgg gtggattgcc 50221 tgagctcagg agtttgagac cagcttgggc aacacggtga aactccatct ctactaaaat 50281 acaaaaaaaa ttagtcaggc gtggtggcgt gttcctgtag tcccagctac ttgggaggct 50341 gagacagaag aattgcttga acccaggaga tggaggttgc agtgagccga gattgcacca 50401 ctgcactcca gcttgggcga cagtgagact ctgtctttaa aaaaaaaaca aaaaacaaaa 50461 aacaaaacaa aacaaaaaaa caccagcctg gtaacccaca gagagcagcc gttcacagct 50521 cagagagatg tgggcttcct tctggtccct agaccttctt gctgtgcttt ggtttcccag 50581 tgtaaaatga gaagaataac agccttactc tccaaggttc ttgtgtgggt tatgtgcctg 50641 gcacatacta tacattgaaa aatatttgaa tacttcttgt ctaattctga ccatttcatc 50701 atcctttgcc ctaatggtga tgtgacaatg aaagaaagtc agagattggt cgggaagtca 50761 aaagggtcct gtatttctcc agaatgtcac gatgagtaat gaacaaaagc ccacccagag 50821 ctgtaaggcc tgttctcagc agcaacactc agtttggagg gactggaggt cagggctatt 50881 gtttggctga gcctttattc cagaagactg agtaggtgtc aggaaatgag gctgggctct 50941 tgaagatggg tgctttgaat catgttttgt tacttgaaca aaaataatag taaggagccc 51001 attcataaaa acaacatcat tttcttcaac aacatgtact ctgtatcact tcccttgtga 51061 acggaataat tctacgtctg tgacagaatc ccggaagaca gaaccatagc aacactggca 51121 cattcatttc ttcttataac tgacaatact ggacacactt atgtgggctg gggccaagct 51181 gtgcactgag cactctatga acaccacttc atgtcatcct cacaataatc gggtgcagga 51241 gacattatta atattcacat cttacagctg tagcaacaga tgcatggtga gccgaccaac 51301 ctgctcaaag gctcacggga gcaaggggca gagcctgcac tctaactagg gaccatgtgg 51361 tgccagtcct gaatccacaa cccccttgca ctaccacctc ttaaagggct tcccttcagg 51421 aaacttggct gtttttcaca gacagcagca tcagctccat ttgaggatga aggaaacttt 51481 ggctgtttaa ttccaccttt tacccccagt ttagaagggg tctctccttc ctctgaattt 51541 ccttagtatg ctctgtttct ctcttaagaa acataccctt ttctctgatg acacatattt 51601 agttgggggc aggtcttttt tctctgccag aatcctaact ttttaaggaa aagatctgtg 51661 tctaatatag cagcattttg tacatattat gagatactta ttaaaataag gaaatccata 51721 aacaaacccc aaaactatta ttgtatattt accatatgcc aggtaatatt ctgtttattt 51781 tatttattta tgttgtagag acgggggtct caccatacgc ccaggctggt tttgaactcc 51841 tgagctcaag tgatctgcct gccttggcct cccaaagtgt tgggattaca ggtgtgagcc 51901 accgtgcctg gttgtattct aactgcttta tataaattac ctcatttaac cccataagga 51961 tctctatgag gtaggtataa ttattatccc tgcttgacac atgagaaaat cacaggcaca 52021 gattgattaa tttgttacgc agtgttagtc cgtttgcact gctataaagg aatacctgag 52081 gctgaataat ttatgatgaa aagaggttta tttggctcac agttctgcag actgtataag 52141 aagcatggca ctagcatctg cttctggtga ggccttagga agcttttaga catggtagag 52201 ggaaaggaga gccggaatgt tgcatggcaa gatggagagc aagtgtgaga gggagaagtg 52261 ccaggttctt tttaaacaac cagggccagg cacagtggct gacgcctgta atcccagcac 52321 tttgggaggc caaggcaggc agatcacctg aggtcaggag ttcaagacag cctggccaac 52381 atggtgaaac accgtctcta ctaaaaatac aaaaattagc tgggcatggt ggtgcacaca 52441 tgtagtccct tgcatttttt tttttttttt ttttgagatg gagtctcagg ctggagtaca 52501 gtggtgcaat ctcggctcac tgcaagctcc gcctcccagg ttcacgccat tctcctgcct 52561 cagcctcccg agtaactggg actacaggcg cccaccacca tgcccggcta atttttttgt 52621 atgcttagta gagatggggt ttcaccgtgt taaccaggat ggtctcgatc tcccgacctc 52681 gtgatccgcc caccttggcc tcccaaagtg ctgggattac aggcatgagc caccatgccc 52741 ggccaaaata atttttttta attaaaaaat aaccagttct cacttgaact aaccgaacaa 52801 gaactcactc attacagcag ggaggggatc acgccattca tgagagatct gtacgcataa 52861 cccacacacc tcccaccagg gcccacttcc aacactgggg ttcacatttc aacatgacat 52921 ccaacagcta tccaccaata tagttagaca tccaaactat atcagctagc aactgacgga 52981 tatctgttga aggagtatat gaataaagaa attctggcca ccactaccca ctttctagtc 53041 cacacagctg ctgtacagtg caatcgctca tcacccatca aaggcaggtt tcagctgtca 53101 taccatttcc aactcactta ttcactgtgc catgcagtta gaaggtccat gctttttcta 53161 gacaattaat tgagaagtat attgattgag atttacgaat aaaagtagat tttgttttct 53221 tccaattata atcataattg cttattgtaa aaagataaga aaatacagaa gaacaggaaa 53281 aaagaaagga gaacaataca tcgtctcacc acaatcctgg tgtaaggccc cagagaagga 53341 accacactta cacttaaaac aatgttcttg gtcaagcacg gtggctgacg cttgtaatcc 53401 cagcactttg ggaggctgag gcgagcaggt cacttgaggt caggagttcg agaccagcct 53461 ggccaacatg gtgaaaccct gtctctacta aaaaaaacac aaaaaaatta gctgggcgtg 53521 gtggtgcaca cctgtagtcc cagctactca ggaggctccc agctactcgg gaggctgagg 53581 caggagaatt gcttgaacct gggaggcaga ggttgcagtg agatgagatc atgccacagc 53641 attccagcct gggtgacaga atgataatct gtctcaaaaa aaaaaccaca cacacataca 53701 cacaatgttt tgaaatcaga aacttaagtc atttttgggg gatggggaat caaacccaca 53761 aaagatgact aagacattgc tcaacgtatt tttttggttt tggagtacac gtggctgctt 53821 tttcttcagg gttcgatgtc ctgaggatcc actttgtggt cccctttgga ctgctgattt 53881 ggtgtgtctg aaggagagcc agaaggagaa gagaagcctg accccagtga ttatgtggct 53941 agaaactgga taggttcgca ggatatcaca ctgtcccctc cctgcctttt ttcaaaccac 54001 ttcccaccag cttgagataa gtttccaact acagctgttt tgggagagat ggaccttcac 54061 ttaaacagtg accttaactt taaacagtct tggactcatt gatttttgca gctgaaagga 54121 gcatgtggtc tcctcatttt aggggtgagg aatgccagat ccagagaagg caagggacta 54181 tcctgaggta gtacagcctg gtgctccaac aggagaaggt ttgaggctct cacttgctac 54241 tctctattgg tctcagactc tcacttgcta ctgcactttg acatacaaat ttgaagggct 54301 cttccctctt attttggtta actcctactc agccatcaga tttcagatta aatatcactt 54361 ccctagggcg cccttttctc tgagccacgt tagctccccc tgctgtactg gtctcatggt 54421 actcctatgt ctcttcttat aacccttaac atatttataa gtacacgttc catgtgtttc 54481 cccaccagac tgtaagctcc acaaagtcaa cagtatgtct ccttggttca ctgaatacat 54541 ctatcccaag tgcctagaag agtgcctggc ataaaatagg aaactcaata aatattttct 54601 aaacacacac acacacacac acacacacac acacacacac acactctact tagtccagtc 54661 tatggtaaaa cattaacata ctttttgtaa aaagcaccac gtcttctgaa ctacaaattg 54721 ggcctttttg ctgaaggaag gatcattaga gaacaaggga tttcatttca aataatttcc 54781 ggaagataga aaaagttcag agaggtatga caacagaagc tgaagtttgg agcacaagga 54841 caagaaagca ccattggaaa tgagagacaa ggaatggtgt gggaccccag ggaggctctg 54901 aacctgaaga cagcaggcgt cgagaagatt cagaagggcc aaaaaattaa gggatccact 54961 gaaggtcctg tgccagccag ctcctctatc ccatacctgc agcacacaga gggcccaaaa 55021 gctctgcaga accacagcca aagacaggag gcctctttaa ataaattgaa tgaactgcct 55081 aggattgatc gagggcttct agtaagagac tggcctcctt atgctaacac gtgaagggcc 55141 ctcagtctaa ctgctgctcc tgctcatgtg ccctaaagca tcttccactt agtaccttag 55201 tacccttcca gtacccagca gcttcccagg cagggcagag gcctatggga caaacagatg 55261 aactgagacc tatcccctta aagaagtctt ttattattga ttgtgaatgg ccaaataaag 55321 catgacagga tgtctgatca aaaccagctg catgacagga gtttgagacc agcctggcca 55381 acatggcaaa accccatctc tactaaaaat acaaaaattg gctgggcgtg gtggcaggtg 55441 cctgtaatct cagctactcc ggaggcagag gcaggagcat cgtttgaacc caggatgcag 55501 aagttgcagt gagccgagat catgccattg cactccagcg tgggtgacgg ccagcctccg 55561 tctcaaaaat aaaaatacaa aaattagcca ggcatggtgg cgcatgcctg taatcccagc 55621 tacttgggag gctgaggcag gagaactact tgaaccttgg gaggtggagg ttgcagtgag 55681 ctgagatggt gccactgcac tccagcctgg gcaatagagc gagactgtct caaaaacaaa 55741 caaaaaactg atgagtcaag aaaaagcagt atggtaatgt gaggtccttg tttggatctg 55801 actagaacta aaaaaataac ttatgagtta aatggagacc actgctaata ataaaggatt 55861 attaattatt ttatgtgtga tagtgtggtt atatttttaa aaggagtcct tattctctta 55921 aaagtacatg ctaggctggg cacggtagct cacgcctgta atctcaacac tttgggaggc 55981 caaggcaggt agatcacctg aggtcaggag ttcgagacca gcctaatatg gtgaaaccct 56041 gtctctacta aaaataaaaa aattagctgg gcatggtggc gtgtgcctgt agtcccacct 56101 gcctgggagg ctgagacagg agaactgctt gaaccaggga ggtggaggta gcagtgagcc 56161 aagatcgtac cactgcactc cagcctgggt gacaggcgac attgcaaaaa caaaaacaaa 56221 aacaaaaaca aaaacaaaaa aaacatgctg aagtaattac tgataaaatg atgtctagaa 56281 tttactccaa aataatctag tggcagggga aaatggctac cagcacagat gaaacaaaac 56341 tggtcatggg tttttgattt ttgaagccgg gtaatgggtg gtacatgagg attcattata 56401 ctattctttc tagttttgta catgtttcaa aatttctgta atacagttaa aaaaaaccat 56461 atctcgataa gcatattatt tagatataga gagataaatg ctagatcaaa cgcctaaaag 56521 gatcaaaagt gtatgccctt gggagcagga taggaggaag ggtaaagaaa gggtgggggc 56581 agggaacaga tggtttttat tacaaatctt ttatgatatt tgaccttaaa aagactaagg 56641 caaactatgt gcatgtacct ttctgacaaa aacaaaattc atttaaaaaa ggaaatggga 56701 gtaacattat ttagagaaat agaagtaacc accaaaagaa gtaaaagcag aaaaataaaa 56761 tggctgcctc agggtgaggg gtaagcagga gacagatgcc tttctagact cagagacttt 56821 ttaaaaagtc atattcatgt attacttgga ttttttaaag ttagctatta ttatcagttg 56881 ctactatcag ttactagcag ggccttttgc taatctctac atctataaac tgaatacttc 56941 cttaggttta gaatcctaaa tcacccagag tgaatatatc cacattaagt gtaataataa 57001 caattattgc tattcaacta ttgtttgcca tttgtcagtg ctttacatat gttatttcta 57061 atccttctaa cactcctgca aggcaagtat agatgttccc attttacaga tgaaaaggtt 57121 gaggctgaga gacattagcc acttgcttga ggtcacagtg tgggcgaggc ctgtgagact 57181 caaacaccaa gctccttcca ttaccctact tctcctttgc attccatttg ttcggtatag 57241 tccagaaagt acctttacat aaattttctc atcaacccca agaaacagga gggacatgac 57301 tatctttact tgacaaatga agaaatcaaa gctaggaaaa gttacgtaac ttgcgcaaga 57361 atccaggctc ctgatcctta atctagtact ccttctacca ttccttatgt cctctcaaca 57421 acaaaatgtt aaattagtca taatctcctc ataggcattt gacataagag tcactgcaca 57481 tgaatgcagc aatgcagcta aagcaggaat tttaaagaga gagagagggg gagacatttt 57541 ttccaaagga aatatactaa taaactatgg actgtcctca gttgctctaa tgacagcatg 57601 ttgctggagt cctcaaaggg actcttgaaa tgtcagcagc ctgcctgcct ggcacagagc 57661 agatgagttg ctggcacagg cagcctgcag cagttctgac aaccagtgcc actctactta 57721 gttccaaaga aagaaatgag ccatgcgtgc ctctgccctg aaaaaagaac ttctgtgctt 57781 cctaaatcta cttcagtcaa cagccatggg gccccagggc tttttaatag ttgccaggaa 57841 tcttcttaga tcagcgctac tcatcagagc agaggcatgt ttatgaaact gtacatcagg 57901 ttgcataggg gtgggagcca aacgtgggat aactaaagag ggtgaacatg tgacccttgg 57961 tcatctcaag tcagggtagc tgaagctctc cccacaagag aagcagccca aaatttgggc 58021 acaagagcct tgagcttggt gtgagatgct cctcagtctg tgctggcatc tgaacaatgg 58081 ccaagcagca gttagggaga agaggctgtt tacttgagtt caaactattt atagcccagc 58141 aatctgcaca tatttggatg acaaatctct gggcttcaag taagtggtgc tttgctcctg 58201 gacaatttca tcaaacacac tccctcccca cgccaccccc agtccccctc actaacacat 58261 gttgccctca ggatgtttta tttaagttgc tgaattttaa aagattataa aaagaaacca 58321 cattttaaga gcaagtgact tttaatccat ttatttccaa gtttgaacta catattttct 58381 ttaaggcttg aatcagaaat taaattataa tacatgtata ttctataaaa atatatattt 58441 gccccataca tgaacaaaac agtaactgag tgatgtctat aatttaacac gtatattgac 58501 ctggcgtggt ggctcacgcc tgtaatccca gcactttggg agtctgaggt gggtggatca 58561 cttgaggtca ggagttggag accagcctga ccaacatggt gaaaccctgt gtctactaaa 58621 aacacaaaaa aattagccgg gtgtggtggt gggcacctgt aatcccggct actggggagg 58681 ctgaggcagg agaattgctt gaacccggga ggcagaggtt gtaatgagct gagattgtgc 58741 cactgcaccc cagcctgggc aacagagcga gactccgtct caaaacaaac aaacaaaaac 58801 atgtatattg tcgtaaagat gctgaggcct gctcaagccc tacaaagaga acgtttctac 58861 gatgattaaa ggctttcaga acagatccac aatatgtaat catatacacg ttacaaacaa 58921 gaggtcttat tacctgtgga agcaatggta gtgtacagtg ttcaccaatg cccattgccc 58981 ccattgcatg ccatacagga aaaaggctca actgtaacaa acatatgtgc agagccttct 59041 gcacagtttt aactcacaga acgtgttaca gccttaacaa tgaatttgtt tggaaaaacc 59101 agtctaccct aaaagctgcc ttgtttttat agcaaaagca ttgctgttct gaatcttgca 59161 tctcccctga cccaggcact gcctaattct aaagacgaaa aagtgcttgt tagttaacat 59221 gcatgttggc agcttccagt cctacaatgt aggcaggatg tggtgtaaag aaaaaggcca 59281 gtaaagtgaa gacaacggtt tatagtataa tggatgatgg tagacaatta acacagggag 59341 aggaaataga agagattcct aaaagggact tacacatttt cttatctttt ccctctgcca 59401 gtaaagctca ttagaaggtc cctaagacaa tctttgaaca accccacttt ttttttacat 59461 taggaaaaaa gttctggtaa tggtaaagaa aaagaggcat gtctatacat agaatgcaaa 59521 ttatttaaaa attttttcag gtcattttgt cattattatt attattatta ttattatttg 59581 agtctctcta tcgcataggc tgtagtgaga gcacgatctc agctcactgc aacctccgcc 59641 tcctgggttt aagcaattct cctgcctcag cctcccaagt agctgggatt acaggcctgc 59701 accaccacat ccggttaatt tttgtatttt tagtatagac ggggtttcac catgttggcc 59761 aggctggtct cgaactcctg gcttcaagtg atccactcgc ctcggcctcc caaagtgctg 59821 gaattacagg cgtgagccac ggtgcctggc ctaatgcatg taatgtttga tcccatgttt 59881 ttatttctag gaatttttga attataaaac tatccttatc actgagacct tagctgagat 59941 gtcacttctt cagaaaaact ttcccaagcc acccaaccta aaataaccac ccatctacct 60001 atcatgtcac tttctgttaa gcgtccatac agcaccactt ttttttgttg ttcattcatt 60061 tctttactgc ctttcactgc tgcctcctcc tgtcccacat taaaatatca gttctatgag 60121 atgaggggcc tggtctgtct tggacaccat tcttcctagg tgctgcacac tagccaggga 60181 cattgcagca gcatctcatt taatcttcac agccatatga ggcttctgtt atccttccat 60241 tttagagata ggtaaactga tgtacagttt cattttagtg agtggccagt aagtggcagt 60301 cagaatctaa atactggtat gtatgacgac aaagtccaca ctctaaacta cctccctcag 60361 gatgatacaa aagtttaaat agtttaaatg tgcacacttt ttaaaccaaa ccatcctagt 60421 tgtggaaatg tattaaatgg agatgatcac acaagaacac ataattagat acatcttatt 60481 tattgcatac aacttatcat aggaaatact ggaaatgttt aggcaagtca ccagttagcc 60541 ataaggttca agtttacgtt ttggtaaaca agggccccca tctcagacac tatgaggatt 60601 taatgagact atgcatagta cctggctcac agtccatgtt ttgttgttac tgatatattt 60661 attgattcat tcgagacagg gtcttgcact gtaactcagg caggagtgca gtgatatcat 60721 catagctcgc tgtaacctca aactcctagt ctcaagtgat tctcccacaa ctcagcctct 60781 ggaataacta ggactacagg catgcaccac cattcctggc taattttttt catttcctgt 60841 ggagacaggg tctcactgtg tcgccaggct ggtctcaaac ttctggtctc aagtaatcct 60901 catgccttgg cctcccacag tgctgggatt acaggcgtga gccactgagc ccagcctaca 60961 tttaaatctc taaaaggata tacaatacac caaattgtat atctctggac agcaggatta 61021 cacggggagt gtgcgatatg gcaaggctca cacctgtaat cccagctact tgggaggctg 61081 aagcaggaga atctcttgaa ctcgggaggc cgaggttgca gtgagccgag atcgcgccac 61141 tgcactgcag cctgggcaac aagagtaaaa ctccacctca aaaaaaaata aataaaatga 61201 caaaataaaa taacaaaaaa aataaaaatt acatgtttat gtatatttta tttacagttt 61261 tggaagtaac caccacaatt acaaaaaagc attaaagtaa aaaacaaacc gctttcagaa 61321 ggcctggcta aagcaagtgg ggattttttt tactttcatc gctatatctt tctgttctga 61381 atgtttattt ttgtgtctat gtattagttt tataattttt gttaaattgg ccaaagaagg 61441 aaggttggtg gaactgtgtg atcactaacc acaataaata ataatggact gaataatgta 61501 aatataaatt tcaggagaga gaaaatttag tcttagcttg actggaatag gtattctact 61561 agacatgtgc cttatcaaag atttctttta aatgatattt ctcagtgttg cagagatatg 61621 agaaaatgta ggtgaaaatg gaaaacggta caacattcct aaagggcaat ttgacgatat 61681 gtgctgaaat ccttaaaaat gtagggggga catgggacct ctctgtactt gctgtgcagt 61741 ttcactggga acctacaact ggtctaaaaa ataaagtttc ttaattaaaa aatgtaaatg 61801 ctctctgacc cactcatctt acatctaaga atctatccaa gagaaaccac tatagatgtg 61861 tgtgcaaaga cttgacttaa agttgttcac ttcagcactg ttcagaaaac tgtaaaactg 61921 aaaatcctta aagtccacta ttaggcagtg gttaaataaa ctttggtaaa tctacagaat 61981 ggaatattac acagtcacaa taaatgatac tatcgaagaa tatctgatga cattaaaaaa 62041 aatagtcaca taagagccag gtgcagtggc tcacgactgt aatcccagca atttgggagg 62101 ctgaggcagg tggatcacaa ggtcaggagt tcaagaccag cctggccaag atggtgaaac 62161 gccgtctcta ctaaaattac aaaaattagc caggcgcggt ggcaggtgcc tgtaatctca 62221 gctactcgtg aggctgaggc aggagaatca cttgaaacca ggcggcagag gttgtgccac 62281 tgtaccccag cccaggcgac agagtaagac tttgtctcaa aaaaaaaaaa gtctcacttt 62341 aatgttaagg tgagccccta tatgtagaag gggctcatgt ctgtaaaacg gaaagtagaa 62401 aaaaatggaa gaaactatta atcagtagca tatcaggatg atgacactgt gggtaatttt 62461 tattttctta ttggtatttt aaattttttc tataaaatag tattttacac taaaatatag 62521 cattacttct acaaaaagaa aaaatacgaa aagttatttt taagttcctt tggaaaatac 62581 ataccaagtg ctacaaaatg tgcatacact gacttataaa tgagagagac atttatatca 62641 gagttattta tttatttatt ttcagacagg gtctcattct gtcacccagg ctcgagtgca 62701 gtggcatgat catggtgcag cctcgacctc ccagactcag gtgatcctcc cacttcagct 62761 tcctgaatag ctgggactac aggcatgcgc caccatgcct agctcatttt gtattttttt 62821 tgtgagacgg gggttcacca tgttgcccag gctgatctca aacacctggg ctcaggcgat 62881 ctgccagcct cagcctccca aagtgctggg attacaggca gagttattta taatagtgaa 62941 ataatagaaa taatcaaaat atcaaacaca ttgtctggcc attaaaaatg gtgaagagag 63001 acataagcca cactctgtca gccgggaaca gtctcagtct ctagggccct gctcctgttc 63061 ttaatcctgg gcctggccca gaaggtgtgc tgcccagcct gtgcccggca ctggaaggtg 63121 cacaagaaac acaagctgga caacctcaat gaggagtagc tagaaaagcc gttgaaggta 63181 cactgaaagt gggaagaaat gagctctaag ctctgttcag ggacctcagc cttgagttac 63241 gccacctgat gataggaatg gccaagatga gtatgaggat aaaggaaaag gagaaggaaa 63301 ggagcaaagt agatagctga agaaaaggaa aataaaatag gagaagaaaa attaggagtt 63361 ttagaacagc gagatgaaaa tgaagcagga ttccaaaagg tagaaatgcc ctgagcagcc 63421 tctgagctcc tgagccagga gagccctcct tcaaggagct ttggtgagac aatcacagag 63481 actgagttct tacggggctg accttgcagg atatctggta gccagaccct agggtcacag 63541 gtgctgcgca atggcaggaa tagggttttc tctagggaag tctcatggtg tagttagatg 63601 gctttcccca gcctgcattt catttgctgc ttaactgcca tctaggtagc actaggatct 63661 gttatctggc cctctataag attctgtaag atccctactt ttttttaaga aatgcttttc 63721 tgcttaaacc agctggagca gacacttatt gttagtagca cctgagaatc ctgacagatt 63781 gagtaatgga aaccagaagg gaatgcagac aaaactccta aggaaatgaa acagatcttg 63841 gataattcat gaatggaaca tgcctattca tgtacctatg ggttacaact gatttgtgga 63901 acatgaaatc aatttagtag actgggacta gcatattttt atttttattt atttatttag 63961 agacagtctt ggtctgtcac ccaggctgaa gtacagtagt acaatcttgg ctcactgtaa 64021 cctctacctt ctgggttcaa gcaattctcc tgccaaagcc tcccgagtag ctgggattac 64081 aagtgtgcac caccacatcc agctaatttt tgtattttta gtagagacgg gatttcgcca 64141 tgttggccag gctggtctcg aacccctggc cttaagtgat ctgcctgcct cggcctccca 64201 aagtgctgga attacaggca tgagccacca tgcccagctg ggactagcat acttttcaaa 64261 acatgaaacg gtatagaata gaatgaaaag aagaaaatgt ttcatttaca tataatatat 64321 gatatgcatg catgagtcat aatataaatt gtgcatctta ttttcaaagt caaaaagttt 64381 gaaagaaact gatttacctc acctgctgga cttaaaaggc agagagaatt taataagcat 64441 tacaacagcg gctagtaaac tttttctgct ttttctgtaa gtggatagac agtaagtact 64501 tcaggctttg tggtctctgt ggtaactact caactctgcc actatagtgc aaaagcagcc 64561 acacacaatg tgtaaacaaa atgaacatag ctgtattcca ataaaactaa ttatttacaa 64621 aaacaagtgg caggctggat tttgcctgca ggccatagtt tcccaatctc tgtcttagag 64681 aatgagatgc gggtagttag tggcatgcaa tgttgtaaga gttttaaaat aatcacctat 64741 tggctgggtg tggtggctca cgcctgtaat cccagtgctt tgggaggcca aggtgggcag 64801 atcacctgag gtcaggagtt cgagaccagc ctagccaaca tggtgaaacc ccatctctac 64861 aaaaatacaa aaattagccg agcatgaagg cgggtgccta taatcccagc tacgcgagag 64921 gctgaggagg gagaattgct tgaacccggg aggcggaggt tgtagtgagc cgagatcgtg 64981 ccactgcact ccagcctggg cgacagagca atactctgtc tctaaataaa taataaaata 65041 atcacctatg attgcctgag gtaaagtcca tattaaagtc agggctattt caaacccagt 65101 agtagttgct atagaccaat gaagggcatg aaggatataa agattatgaa tgggctaact 65161 gcatctaact gcactagaaa acgtaaagaa aattataagc ccatgttcta ggactgttat 65221 gggacagttt ctgatgtcca gtatctgatg tcaccagtgc tggtaaaggc agcagtggtg 65281 ttttctctag aagagtcccc cttagtgggg ttgaccattt ctcatggaaa tattgtctgt 65341 gattgcttca ctttcatctg tgtaacatcc aagcacagat gttaaaacta cctttctgct 65401 tatgccaact acagtgaatt ttgttgtttg ctactaagaa tcctgaccaa tacatgcccc 65461 caataacact tctattatag ttcacagcac tgtacattat aactacttgt tatattctat 65521 ctcccttagt agatggaaaa ttttatattt tattcaactc tatagtccca gcactagcag 65581 aaggcctgac aaacagaagg cgcttaataa atgaacataa gtagacatta gaaaaaaacc 65641 cactaacaca caaaacctat tgctttagtt agaatggtga gaaaggtaat ttttaagtga 65701 tgctttaaaa tattttcata agcgcccagg tcaaagttgg tatcattcat ccaagaacaa 65761 gtagcatagt agtattcgta tcaaggtcaa cacgtcctaa tgaagaaaaa gaattgctga 65821 ttttctttat ttttattaat gatttttact ggttgttcaa aggacttatc ttagactaag 65881 cctcaaaggt ctgagaggtt tatacatgtc tggaacaatg ataggaatgt gaaatcatga 65941 tcatgaaggt ctgttttctc ctatattcac agcctatctt ataatataat aaatgaatgg 66001 agtgttatag aaagttacag aaagaccact gcctgagatg gacttgggat caaacaacaa 66061 aacaagaaag gagagccctc caaaactgat cctagaaata atggctaaag ccgggaacac 66121 ctgtagtccc agatactcaa gaggctgagg caggaggctt gagcccagga gctctgattc 66181 tcgctgtgga aaacacagca agaccccatc tctaaaaaaa ataataataa caacggctag 66241 aattcacagt aactcaacac tatatcacaa tttagagaga gcttatacat aaaatgaagc 66301 tgtgtccttt catgacatca gaaaaaaaaa acactaaaat agtacaaaaa cgagatcaaa 66361 atcataacat gtcatgaggc cagtgatacc acggtagcag aaagtcttag gaaatgagaa 66421 ggaacagtga aaaaggattc agtttaaaac aatttttccc acctgctgta gaggagaggc 66481 agttttgact tcagatgctc attccaattc ctttgggtga aaagagagta attcctcaca 66541 atcccatacc ctaagagatg atcaagatac tttaccagag cccccttctc cacagtcaaa 66601 attaatctct ccatctgact gagaaaagcg aggttaataa gtagtctttt acctcacccc 66661 aaaaatagct cctggaggat ccacaaatcc aaatgtaaaa atatgaaacc ataaaagcac 66721 tggaaaaaca catggcatat tttatgacct cacaatggag aaggtatttc caagtactat 66781 gatataaaat ccagaagtca ttaaaggaaa tattaataaa tttgactatg tataaaaata 66841 ttctgcatgg caaaaagcat gatcagcaaa ataaaaatca aatgtcaggc tagaggaaat 66901 tatatttcct atcaatataa atagcttatt tccttaaccc ataaagagat tttagaaact 66961 ggtaagacag acagccaggt gcagtggctc atgcctgtaa tcccagcact ttgggaggct 67021 gaggtggaag gatcacgtga gtccaggagt tttagaccag tctgggcaac acagtaagac 67081 ctcatctcta caaaaaaata caaaaattag caaggtgtgg tggcatgtgc ctgtagtcac 67141 agccactcag gaggctgcgg taggaggatc gcttgagccc aggaggctga ggctacaatg 67201 agctatgatt atgccactgt actctagcct gggcaagaga gcaagatcct gtctcagaaa 67261 aaagaaaaac aagaaattga cacgacaaag atcaacaagc caagggcttg attccaagcc 67321 ctcctatgag tcttcccagc tgggaaccta gacaaaaaca gaaacaaacc atccttgttg 67381 tgcccagtcc aaacttccga cccacagaat ctctggaaaa aaaaaaaatc agttgttcta 67441 agccacaaaa ctttggtagt ttgttatact acaataataa gtaaaatatt ggattgtgct 67501 ctgctaatat atagttggga atttttcata gtaaaatgta gaaattttcc tgagtaaaag 67561 ggcttatgat ttttcctttc tcataatgtc actgtcaggt tttttgtttt gctaatctct 67621 ttctattctc aggaagcatt tgtgtaatat taattctttc tcttaagtgc ttggtagact 67681 cctagtgatg ggctagtagt cttctttaag ggaagagttt aactgctgat tccatttggt 67741 ttctgttttt ttttaaatgg agtctcattc tttcaaccag actggagtgc aatggcacga 67801 tctccactca ctgcaacctc cgcctcctgg gttcaagcga ttctcctgct tcagtctccc 67861 aagcagctgg cacaggccac cacgcccagc taatttttgt atttttagta gagatggggt 67921 ttcgccatgt tggccagact ggtctcgaac ttctgacctt aggtgatcct cctgccttgg 67981 cctcccaaag tggtgggatt acaggcgtga gccaccatgc ccagatgact ccatttttat 68041 taatagttat attactaata aatttatcca ttcatctaaa ttttcaaaat tttggtttaa 68101 aattattcat agtattcata gtatcattat tttgttaatg agtatactat ctcccttttt 68161 cactcttttt ttttaagcac tctttttttt tttttttttt tgagacaggg tctcactgtg 68221 tcccctaggc tggagtgcag tggcacgatc taggctcact gcagcctccc cctccaagac 68281 tcaagccatc ctcccacctc agcatcctga gtggctggga ctacaggcac acaccaccac 68341 acctggctaa tttttgtaga gatggcgttt ggccatgttg cccaggctgg tctcaaactc 68401 cggagctcaa gtgatttgcc cgcctcggtc ccccaaagta ctgggattaa aggctgggcc 68461 cagccccctc tttttcattc ttgattttgg ttatttgtgc tcctttcttt atttctgtct 68521 tccttttctt gatcaaccta atcagagtat tatcattttt attaatcttt tctaagaatc 68581 atactttggc ttcattgatt ttccctgcta tacatttgct ttatacttca taacttctac 68641 tcttttcatt ctttctttcc ctttattttc tttcttttag agacagggta ggtggggtct 68701 cactatgctg cccaggctgg cctccaactc aagcaatctt cccattgcag ccttccaagc 68761 agcttggact acaggcatgc accaccatgc ctagcttctc tttatcttct tcagaattat 68821 tttgctgtgt tttctctgac ttacaaaatg tgatctgact tacaagatgg atgcttgctc 68881 agtttttagc ttttagcttt tctttaaggc tatacacttt tgaatctgca aattcatctt 68941 tctgcagttc tagaaaatga tcatcaatta tctcttcaaa tattgcttct gcaccattct 69001 tgctccactc attttgtaat tctaattaaa tgtatattaa agctgggcgc agtcccacct 69061 atttacctga ggtgggagga tcacttgagg ccagaagttt gaggccacag tgtgccatga 69121 tctcacctgg gaatagccac tgcacttcag cctggacact gcagcaagaa aaaacttgaa 69181 aaaaaaattt ttttaaatgc atattagaac ttgtcactgg atcgttttaa ttctctaatc 69241 atcttttcta tattttacat tttttaaatc tcttgaaggt ttattctgga gaattttctt 69301 ctatcttcta gttcactagt tctctcttgc taggtaatct actttaaata catccactgc 69361 attcttgttt ttgattatta tatgttttgg ttctagaatg tctatttgga ttttttcaaa 69421 tctgctgtca cttttgacag tgtctactta ctgacttttt ttttttgaga cggagtctcg 69481 cactgtcacc caggctggaa tacactggtg cgatctcggc tcactgcaac ttctgcctcc 69541 cgggttcaag cgattctcct gcctcagcct ctcaagtagc tgagatcaca ggcgcccgcc 69601 accacatcca gctaattttt tgtattttta gtagagatgg ggtttcacta tgttggtcag 69661 gctggtcttg aattcctgac ctggtgatcc acctgcctag gcctcccaaa gggctgggat 69721 tacaggcgtg agccatcaca cccggcccat tgtgacgttt ttatcttggc ttctatttct 69781 ttttgcatag taaatacagt gttttacatt ctgcatctga aattccagta tctcatggct 69841 ttgatacttt tgttgtatat ggcttttgct aattcttttc atggtatctt atttctttgt 69901 gaacctggtt ttaatcccat gtgctagtga ttacataaaa aattatttag aggactattt 69961 tgatgtccgg gatgataaca tcttcctcga gagaaaattt gcatttgctt ctgccagctg 70021 cctgatggca ttattacttg agaaccataa tgatgaaaga tcttggtcta cattccctac 70081 acctagaagc ttggtggcaa ggctattgtg agggttggct tagttcaagt tccctcttat 70141 gcccttaggg gttccattca aactaaaaaa gtggtttatc tgattcccca tccttgacgg 70201 acaacaagct aagaattctg tcctcatata tgaagacacc aaaaatacag cttaacctaa 70261 aaaatgcttt ctcgggatca gcaaatgctt ccagaacaaa aatggctttg gtagctatgc 70321 ttacctgata aatactgaaa ctctatacct cagaaacttg gacttccctt ttcacctggt 70381 gtgacacttt gtggcttttg ctgaatatat ggccataaat aactttgtat cttgtaagtt 70441 aaaagccagt tcactttgta gatgggagtg taaaccagtg aaagcttcta agcatgaatt 70501 acaatataca tcacaatggt atttatttat ttatttattt atttatttat ttagagacaa 70561 ggtctggctc tatcacctag gctcctgtgc agtggcacaa tctctggtca ctgcaacctc 70621 cgcctcccca gctcaaacca tcctcccacc tcagcctcct gagtagctgg aactacaggt 70681 gtgcaacacc atgcccagct aattttcttt tttctttcct tctttctttt tttttttgta 70741 tttttttcat attttcttct ccgggttttg cccatgaaaa aaatgttgcc caggctggtc 70801 tggaactcat gagctcaagt gatccgccca cctcagcctc tcagttttgg aattacaggc 70861 atgagccacc atgcccggcg ctacatcaaa atgttaaata tacatactct ctgatccagc 70921 acagtcttct tgaaattaat ctcacagata tacttacaaa aatgcacaaa gacataaata 70981 attttctttg cagcagtggt ttttgctagt aaaaaattga atatgacaat ctgaatattc 71041 accaagaagt ggctagttat taatacataa attatggcat gtttatacaa tagaatacca 71101 atgtaatgga aatgtatcat tttttaagtt ttccaacatc cagtcacttc tttctatttg 71161 ggggtacttc cccattgtgc atagtctcac tgggactctg aaccttaaaa aatgacacta 71221 tgacaaagga acagttggaa tttattcctg acagtgtcca agggcagtga tggcaacata 71281 ctggcctggc tgcttcagct aaaacaggat gatgattccc tggcctgcct tggttctgat 71341 aatttggaag ccttgtctga caccccttct gtacattccc tgcttctgtt actcaaagcc 71401 aaaaggtttg taagtgatac aaccatgaag tcatttaaaa gaataaaatg tatctatgtg 71461 aacaatagca gtagaatagt ttgtataata tgatctcatt tatagatttt ttaagcaacc 71521 aatagatttt atatttatgt ttggacatgc ataatttctg gaagaattat ccaggaaatt 71581 gttaagaatg gtttacttct acgaaatgag aatggagatt taggggaaag gggagaagaa 71641 atcttactta aatattctgc accttttaaa aactttaatt attttttaaa agtttatttt 71701 aagaagaaaa acaggcagat gagatggact gctctgaagc tgagataaac tgaaatcttt 71761 aggtgcttat acatactgag ggccattcaa agccaaagcc ccctgtcatt ctttaggtga 71821 taagtgataa ggaatgacat gtgcaaagaa gggtttaggc ataacttcag cagaaggagc 71881 tttattagct gggcacagtg gcacgcattt atagtcccag ctacttggga gactgaggca 71941 ggagcatcac ttgagctgtg gaggtggagg ctgcagtgag ctgtcatcat gccactgcac 72001 tccagcctgg gcaacagagc aagaccctgt aagaaagaaa gaaagaaaat gaaaagaaag 72061 gaaggaaggg aaggaaggga aagaagggaa gaaagaaaga gaagagagag aaaaagagaa 72121 agaaagaaag agagagagga aggaaggaag gagaaagaaa aaaagaaaga atgaaaaaga 72181 aagaatgaga aagaaagaaa gaaagaaaga aagaaagaaa gaaagaaaaa agaaaaaaag 72241 aaagaaaaag aaatgtgagg ttggactgga gtgttggcag taggaatgga atagaaggga 72301 atagtccaaa agataagatg attttctctc tcttacaatt atttgttacc caaatactta 72361 ctattttaga gtgtcaagca caggacaaaa gatatctgaa ctttaccaaa atcaaccaat 72421 cgatcaatca tatagtccac atatcaagtc tcagttctcc cacccacaaa atggagagaa 72481 taaaatgcct aactcatagg attattataa gggttaaagg aagctatgta tgtaaaatat 72541 atgaagtgtt ccataaatgt taactaccag tactactaat attactatta ctaccacaat 72601 gatgaggaca atttaaaaag caacctctcc gtatattaga ataggactca agtatatcta 72661 aatagcatac ctttcagaaa cacatttaca gtatggaaag ggatggcata gagtacatat 72721 tcattgtcct ttttaaaaat attatagtca tttaaagaca aagttttatt actaatgact 72781 agtgatttta tgggggttat cttatagaaa gctagtcaac tttggagaaa tagtgaggtt 72841 cctagtccct ctgctatgca aaggctgcct cctgagttct cagacatgga aaagccttag 72901 gaatccaaaa gagcaccaag tctcatgttt gtaccatata ctccaccaca gctttttttt 72961 ttattatttt ttattttttg agatggagtc ttgctctgcc acccaggctg gagtgcagtg 73021 gcgtgatctt cactcactgc aagctccacc tgccaggttc acgccattct cctgcctcag 73081 cctcctgagc agctgggact acaggtgccc gccaccacgg ctggctaatt tttttttttt 73141 ttttttagta gagacagtgt ttcattgtgt tagccaggat ggtctctatc tcctgacctt 73201 gtgatccacc cacctcagcc tcccaaagtg ttgggattac aggcgtaagc caccgcacct 73261 ggcctttttt tttttttttt ttgtcacctg tcacctgtca cccagctgga gtgcagtggt 73321 gcgatctcgg ctcacagcaa cctctgcctc ccaggttcac gcaattcccc tgcagcaacc 73381 tcccgagtag ctgggactac aggcgctcgc caccacacca gctaattttt gtctttttag 73441 tggagacgag gttccaccac gttggccagg ctggtctcca actcctgacc ttgtcaagtg 73501 acccaccttc ctttgcctcc cgaagtgctg ggattacagg ggtgagccac cgcgcccagc 73561 ccaccacagt tcttagagac caaggaataa gggtatagtt ttttttaaaa catttactgt 73621 tcaggaaagg ctgaaaaata gtacaaagaa atgaaaagta caagtccctc tcattaagta 73681 tagattactc taaatacact gaaaatccca ataaaatcat ttatttatat acgaccttaa 73741 tagaatagat agatgcatgg cccccttttt aattttaatt aagtactcat attgcaagat 73801 gcataaaacc cccagcagtt tattttaaaa ctgcaaacaa tcctggaagg cttgaataat 73861 gagcttgaca aattggagac ttttattcca ctatcataaa aataatgagt ggggcttttt 73921 gaaaagcatg gcatattcca aaggtttaga agaaagaaaa tcttatgaag gagacttaaa 73981 cctatatatc tagtattatt tttacatacc aatttgagaa aactaataaa gccaacaaat 74041 atctatcaac cctctactct gccataaata aaaaagaaaa aaagaaatgt ttcacttgga 74101 atccaaattt agtacagatc ttcattaact tactactatt ggccaggcgc ggtggctcat 74161 gcctgtaatc ccagcacttt gagaggacaa ggggggcaga tcacctgagg tcaggagttc 74221 aagaccagcg tggccaacat ggtgaaaccc cgtctctact aaaaatacaa aaattagcca 74281 ggcatggtgg cacacgcttg taatcccagc tactcagaag gctgagacag gagaatcact 74341 tgaacccagg aggcagaggt tgcagtgagc cgagatcgcg ccactgcatt ccagcctggg 74401 tgacagaacc agactccatc tcaaaaaaca aacaaacaaa caaaaatctt actactaata 74461 ttagccaaaa gacactatta tggctaataa ttaagttttg gttgtaacat atgtgggata 74521 atgtgatata taagccaaac acactttttt taagctcaga gattcagcta aatgagaaca 74581 gaaaataaaa agattatttt aggtaataaa tgatgtagtt gttcagagta ataaagcagc 74641 aataaacttt aaattaggtc tctaggctgg gtgtggtggc tcatgcctgt aatcccagca 74701 ctttgggagg ccaaggcggg tgaatcatga gttcaggaga tcgagaccat cctggctaac 74761 agggtgaaac cctgcctcta ctaaaaatac aaaaattagc caggcatggt ggcacgcacc 74821 tgtagtccca gctacttggg aggctgaggc aggagaatcg cctgaacctg ggaggcagaa 74881 gttgccgtga gcagagatca cgccattgca ctattccagc ctgggtgaca cagcgagact 74941 ccgtctcaaa aaaataaaat aaaataaaat aaataaagta ggtctctaat attcctccca 75001 actcttgggt tttatgacct tatcacacaa gtattttctt gttaaaaaaa atgaggttcc 75061 ctagcaagtg gacaaaaaat aaatatacat atatgtggaa caataaaata atctatacac 75121 atatacacac acaaaatgag tgggagtact aagatgaaag aaatattttc agagtatgaa 75181 caggatgctt ttcagatgaa atcctcaagt atcaacaaag aactgtgggt gagcttaaat 75241 catgtgggcc taaactactc tgtaacttat ctaaagccaa caggatggta tctctgaatt 75301 tagacacaga aagatacatg gtagtttgga tatttagaag gatttagttt aaagccaatg 75361 tttacaagaa tatttataaa cttacataaa acctatagaa tttggcaagc tttggagaga 75421 caaacaaaca attctggtca gaaatgaact tcctgtaaat agatgtcagt gttttgacag 75481 tacactgtta agcactattc cttatagttt ttgtaaaagg aataaaacaa aggtagagaa 75541 tacacacatc tctcaataca ttttcatgtt tgtttacttg tttgttttca attgaaatta 75601 aaacatttct taatttctaa tactgacgat aattctatcg atagtatctt ttcttttttc 75661 tttaagaata acatcaacaa aaataataat gacaacatca acaaaaatag cacagtaaca 75721 tgttaaatga tttctggcag tctgtcaaaa atacagatat taaaacagat attaaaatag 75781 gacacttaaa tgatattaag cttcaacaaa tctgtaaaac ccaaaaccgt aacaattttt 75841 aataccctac tctcagaaaa gaaaaaatgt cacattacta tagaacaaaa gtaaatcaat 75901 aagatgatta ttttcctgta tcttatcgag ttgggggaaa tcctgtctct gtaggtgccc 75961 ccacgccccc tgcccttttc agaagacttt cttcaaactg caggtagcct ggaagtcttc 76021 ccagaaaatc tttgaaaaaa cacagtctgt tgtcttctgc atagacactt tggatcttac 76081 aaagcactta gaggtttgtc ctctgctaaa aacaagttag gaaaaccaag aatctgtttt 76141 ttgcccccaa gctggaagct cctgcccacc agcaaactac tgcctattcc agctctgtcc 76201 ccactggtac ttgccacccc ctcaccccct cacccttgtc ccagttctta gctagcaatt 76261 agcatcaagt ctggaccctg ctcaaaaacc catccttccc cttgctctga ctaccaagtt 76321 aaaagttcct actcaaaatg gggtgttggt tacataggta tatgcatttg tcatgagcat 76381 ttcactatat gttaaattta tctggattaa aaaatttttt aggccttgca cagtggctca 76441 cacttataat cctagcactt tgggaggctg aggtgggtga atcacttgag gccggagttc 76501 gagaccagcc tgaccaacat ggtgaaaccc tgtctctact aaaaatacaa aaaaaaaatt 76561 agccaggcat ggtggtgtgt gcctgtaatc ccaggtactg ggaaggctga ggcacgagaa 76621 ttgcttgagc ctgcgaggca gaggttgcag tgaactgaga ttgagatcat atcacagcac 76681 tccagcctgg gcgacagagc aagattctgt ctcaaaaaaa aaaaaatttt tttttaaatt 76741 aataaactgg tctacaatgc tctgctgtca ctgttagctt ctctgaattt attttgtgcc 76801 aaacttccct ctcactgtcc tttggtcact tttttcagct tctccaaggg caagctcttc 76861 tctcgacccc aagactcata gctcatttca gcatgtaata ctcttcagcc aaccccacat 76921 tcccctaact aacccttttc tcctagttaa tgcctactca ccccaccaga ctcagcttgg 76981 gtgccttttc ctccaaggag ccttccttcc tcccctcacc cgtgtgcaca ggaacatgtt 77041 cacatatgca ggccaaggca tctctctgtt cccttagcat ttactataaa cgagcattag 77101 agtataaggt ccatgacagc aggcactatg tttttattag tcactgtcat aaccctagca 77161 ccaagtatat cacagatgat cagtaaatgt tcatttaatg aatgttaaat gaaggcacaa 77221 agatatcagt cattctgatc tataaagctt ccttcaccat tttgctgatc taatgtgctg 77281 tctggcctct ctgttatctg aaatccatct gtgaagctac cacagctctg actctaatac 77341 tgcttgggtt tcctatctca gaagagcaaa ctttccaggt tcaacttaag gcaaccattt 77401 tcaaagagca aaatgtgttt actcttaaca tgtacacttc aggcacttgt ctaacctctc 77461 agccaacaga agtggtttgt ctagacaagg gtttatgtga gatattcact acatggcgtt 77521 ggtcatactg gcactccgag gcagcataaa ctttagcctt tcctgggatg gaattaatga 77581 cctgatacct tcaagctttc aactttggaa aatcacaaaa aagtaagtca actgattccc 77641 taaaaattct ctctcttgca aggttgttat gaggcggtgg gaagaaaaat gagaaagatg 77701 agtcagtgtg agtcagtgtt aagaaaaaag ctctgcttct ggtaaacagt ttgctctttt 77761 actgacagga ctaggattac ccgactccca cttctaaaaa tacacaaata acaggtggta 77821 tacattatat aagtggttcc aaaacacaga ctcagatcat tgactctaaa aagggaacag 77881 ttaatcagca atgagccaga ccttccccag agcctgcttc tcggagaagc ccccagacta 77941 agtaagcatt tccctaaaga caattaagat cctaaatgct tccactgccc actcaagtga 78001 cctaaaggaa tgactgaacc tcccttacaa taacaactac ttcatgagca cctactacgt 78061 gccaataact ctgctaggct tcatacaatt catctttaaa ctttacaaca atcctgtaag 78121 atggttgtta ttatccatgt tttacagata aggaacctgg cgcacagggg ttgggttaga 78181 tttatataca gttgcatagt ggcagcacca gatttgattc aaaagtgcct gcttgtctcc 78241 tttcattcca aactgaattc agaaatatat atgaattacc ctgagcaagg caatgcatca 78301 ggtactgcag gggatacaag ctagaaaagg ctactccaat tatcaagaaa gcaaaattgg 78361 aagtaaatga gactatgtag gagagcttca tctctctatt tccaagatcc caaacaaggc 78421 ccgccacaca taagttgccc aggaaattac tactgaaaca tatgtggaag aaaagggaaa 78481 cacacaatca aatgtaacac aggcagtctg tgcttagtgt cacaagacag aatgtgctgt 78541 gatgatttag attgtatcct gatggatggc tttgagaaat acgcgcaaac aacgtggcat 78601 ttgagataag tctgacaaac agactgacag acaacagaga aaagtaaaga gatcaatgga 78661 atatgtcagg ggtagtaaca gttgagacaa accctttggt acctctagca gcacagggag 78721 aaataaagct agaagaaaca gattctacat tgtgttggcc ctgagtctgc ttcaagtccc 78781 tttttatata gaaaagtgta gcaattacga atatttccta gccgattcca atacacgggg 78841 agacactctt aatggttaaa taaataaaaa tctgctgtat aaccttttgt ggaacaaagt 78901 ggtggtggta gaatacaaca gaataaatga atatttgttt acaaatctag agcatatgtg 78961 gttgtagaac taaaggagaa ttacagataa gttctaccac ctttattacc tcaactctga 79021 atcatgatga taaggattct gtgacagcac attttatttt aatttaacct atagctcctt 79081 cagcagaaga ccagtcaagt caatgacact caacaggcac agctggctag aggactttct 79141 ggaaacggag gggatgggag gcattttttg gttgacacaa cagtcaattg tctcctcatt 79201 caaatgggca tttagtaggt agagaccaag atgttaaaca tccagcaaat tgaggggctg 79261 tcttatagaa acaaaaattg tccctttact ctgcttgatt tttaaatatc aagatactca 79321 tgtaaagctc tttacagtta tctgaaccta gaactccact cctttttgtc tgtagcaaag 79381 tacttttgca tgattttaga ggcaatgtat tttccaagaa tgcatttacc aagtaaaagg 79441 aaaatgttgg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg ggaggccgag 79501 gtgggcagat cacaaggtca ggagatcaag actatcctgg ctaacacggt gaaaccccgt 79561 ctctactaaa aatacaaaaa aattagccag gcgtggcggc gtgtgcctgt agtcccagct 79621 gctggggagg ctgaggcagg agagtggtgt gaacccggga ggcgaagctt gcagtgagca 79681 tcactgcact ccagcctggg cgacagagca agactccgtc tcaaaaaaaa aacaggaaaa 79741 tgttgctttg ttttgtattt aaggagtgaa ttataatgtc cataatatat tttcaaatag 79801 ttcagcaaaa aattgaaata cacacatgca gataaactat atgcagcaaa atgttaacaa 79861 ttgtttgggc taggtttttg atacaggtgc tcattgtact cttgttaatt tttttccaca 79921 tttgaaattt ttcataataa aaagctagag gggaaatctg gccctcatta agtgagaggt 79981 cataatagag gttaagacca ctgactctag agctctggga ccttaggcag gttactgtcc 80041 cgctctctgc ctcagtttct ccatctgtaa cataaggata ataatagtac ctatttcatg 80101 gagcagttgt aaggattaga tgacttaaca aaatgaaagt tgtggctggg cacagtggcc 80161 aaagctttag cacaaaaacg cctgggaaag cttcaccaga gtccttctcc accatgacaa 80221 cactcctcct gctcattcct tttatcaaac aagggcaatt tttgcgagat ttcaatggga 80281 aatcactgtg cacagaaatc attatgcatc caccttacag tcctgatttg gctctttctg 80341 acttcttttt gtttcctaat cttaagaaat ctttaaaggg aatccatttt ttttttcagt 80401 taataatgta aaaaagactg cattaatatg gctaaatttc cagaaccctc agttctttag 80461 ggacgggcta aatggccggt atcatcacta caatattgcc ttgaacttga tggagtttat 80521 attgagaaat aaagtttata tttttcattt ttatctttta attcaatttt ttccatgaat 80581 tttttgaagt ccccttttat gtatctaaca acgcttgtgt gagaattaaa tgagataata 80641 tatataaaca cacagcacaa tggtaagcac atactagatg ctcactacat atttaaatga 80701 aagaataaaa ctggtcctaa agattcaata cataaaaaac taaggtatac ctcaggtgct 80761 agataaaatt cccccaggtc cttcagaagt taaaattagc tccgaattcc atttttaaat 80821 gattaacaag gaagatcaaa gagtacattt agatctcttc ctaactccct tttcactgaa 80881 agagatataa tattatcaaa gaaaagtaaa gaattcctat aattacaaaa atccaaaact 80941 tttctgtcac tttctcaaat ggaagtttgt tttcttttga gaaaagatct catgctgtta 81001 cccaggctgg ggtgcagtgg ggcaaacaca gctcactgca gcttctacct cccgggctca 81061 agccatcttc gcacctcagc ctcccaagta gctgagacta caagcatgtg ataccatgtc 81121 agctattttt ttccattttt cgtagagatg gggttttact ttgttgccca ggctggtctt 81181 gaactcctgg gctcaagtga tcctcctgcc ttggcctccc aaagtgctgc gattataggt 81241 gtgagccaca gcgtcaagcc ttaaatggaa gtttaaatta gaagtaacac atcttctagt 81301 acaacactgg ctggatgact actggagatg atttgtgcgg gagcttttga aaagaaacat 81361 caacaaagta taacgaaatc tccagaaaaa taatccagaa aaaaagactg tgtaatagaa 81421 catttacaca gaacagtata tggggaaaaa aacgaacagt tcatatgtca ctgggaccaa 81481 atgatagctt caggatattg ctcattttga gataaaactg tatcttgtac gtgttttttt 81541 gttatctggg catagagtgg gggtttagtt tgtttgaata agtaagtcta taaaagtaca 81601 agatgctata aatcacaaat actaagaatg ccaaaagttt agtatctgaa aaagcaaaaa 81661 tcaaactttt aaaaagtcaa atccaaaagg caagtggtag ttttaaaagt tttaaaactt 81721 cctgagaatt taagtctgca gcattgactt tctgatctgg gtttatataa ctctcatacc 81781 caggggatgt gacagctgat tttcttgtcc aaacaaacac tgcagaaaat tattaaaacc 81841 atcagggata tttcatttct cttcccacag tatctaatta ttatctcaga cagatatatg 81901 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtggt gggaggagtg 81961 gctggatgaa tgtcctacat aattagaatt ctttatagta ctaaggagaa ggcaaagtgt 82021 tttgaagtct taaaagaaaa ctgattttct aaagaaattc tggctttagg aaagtagcta 82081 agtgatgtgt ggatgacaga gctatcttta attaggagtt gagcaaattg ttataagcaa 82141 aatctagaca aataaagatg ctgtttatat ttcaaagacg gggaaacaga ctgctacaaa 82201 acttgtccaa cacaaatccc cagcaaatag ttttccttag gaaaactatt tttatccctt 82261 tatcaacttg gttgataagt atttcatttt cagtgcatat atggacaagc tacataaagt 82321 aaaattgttt tcattattta aaaaacattt tgcaaacgga aacaaagcct attgtcctgt 82381 aatgtttatt tgctaggctg acaaatgact ctgccacgaa tattatgaag tctcagaatg 82441 agaatacaca gaaggaattg gcttgactcc cgaaaaaacc tccagaaaca tggtccctcc 82501 ctccagacta gaaaaccact ctggaaaatt gtgattaatg ttaattaacc cagaacatat 82561 ttacatatat tattccactg cctgtgtggg ataagatgct accttttttg ggtttttttg 82621 tttttctaga cagtcttgct ctatcatcca ggctggagtg cagttgtgcg atctcggctc 82681 actgcaacct ccacctccca agttcaagta attctcctgc cccagcatcc tgagtagctg 82741 ggattacagg tgcacaacac cacacccggc taattttttg tatttttagt agagacgagg 82801 tttcaccatg ttggccaggc tgatctcgaa ctcctgatct ccagtgatcc gcctgcctca 82861 gcctctcaaa gtgctggaat tacaagtcat gagccaccat gcccagccta tttttaatgg 82921 agaacagggt tctccattaa aaccaacacc atgttggtta ggctggtctc aaattcctga 82981 attgaaatga tccgcccacc ttggcctccc aaactgctgg gattacagga atgagcaacc 83041 gcgccccgcc aatgctatgt ttaagatgtt attactattc ccattttttc cgacgaagaa 83101 actgcagctc agaaaggtcg gtgtccaaag tcaccaagtt agtaagtgcc acagctccca 83161 cattcagctc ccacaaacta cgcaggagtg atagtaatgg tggtgggagc acttaggctg 83221 tactcaccat gtgtgaggca ttgttcttcc cactttccta tgtcaactca agtgatcctc 83281 aagacaactc ctatgaggca ggtttgtatt tgggccacca gcctcctctt tctctaccct 83341 gctcaccaca catggcctgc ttctctcccc acaaaatcaa aatcagcatc ctgatgtcat 83401 catattctct ctacccttat cctaaatcaa aatttgctct aaaatgctgg taattgttga 83461 agcttgatta tagataatgg aggtttcttt ataccattct ctctactttt gcttatattt 83521 gaaattacct tataaacatt taacattttt tgctccaaca cggaatgtaa gctattgaaa 83581 aggctacaag actagctcca ctgacaatga attcaaagga agtgttactg aagatgtcta 83641 tgcaataata gaagacagtg acattgattc attctctcat tccatgaaca taaattgagc 83701 acccattgta tgttaggatg tacagaggtc caaggaagga ctcctgtcct caacaagcta 83761 agagtttagt agcagacaca aaataaacac aaaggtaatt atggaggagg actatttatg 83821 agaaatgtta taaaaagaaa agtttgaagt gctatgtgat ttggaaaaag aaataacatc 83881 aggacaaatt ccattacatt gaactccctg agacattcca aacttcacat tatttaaaat 83941 cagtgagggg cctgggtgca gtggctcata tctgtaatcc cagcactttg ggacgctgag 84001 acgggtggaa ttacctgagg tcaggagctc gagaccaacc tggtcaacat ggtgaaaccc 84061 cgtttctact aaaaatacaa aaattagccg ggagtggtgg tgcatgcctg tagtcccagc 84121 tacccggggg gctgaggcag gagaatcgcc tgaactggga aggtagaggc tacagtgatc 84181 caagattgta ccattgcact ccagcataag caacaaagta agactctgtc tcaaaaaaaa 84241 aagaactcaa aaaaacaaaa aatatactag agaagatgtc tctggaaaga agttcaccag 84301 ggcacaattt ataatgacca acaataaggc ctggtgatat tcatttcaat tgatacacag 84361 taacccatac ataacaactt aagaaacaaa gtagaagttc tctccttgat ggaatcagaa 84421 ctaataatgg tataactcat tttttaaaaa atacagttaa agagaggaat cattaaaaat 84481 gcatgtttat tttccaaata ctggctaagg gatgaaagcc tctcctctcc tctctcctct 84541 ccttctcttc tctgagacag ggtctctccc tgtcacccat gctggactgc aatggcatga 84601 tcatggctca ctgcagcctc aacctccctg ggctcaggtg attctcccac ctcagcctcc 84661 gagtagttgg gagtacaggc atgcaccacc acaacctgct aatttttgta ttttttgtag 84721 agacagggtt ttgctgtatt tcccaggctt gtgtctcgaa ctcctggact caagtaatcc 84781 acctgcctca gcctcccaaa gtgctgggat tatagacatg agccactgtg cccagtgaag 84841 gccttttctt gagagtaagt tcaagcaatt aaagtttctg catgaactac actcatgatg 84901 tacatctatg attttcagtt tttactggaa tgatacttgt gaagtgttct gaaggcagaa 84961 acatatatgt atatatatgt tttggagaca gggtcttact ctgtgtccca ggctggagta 85021 agtacagtgg cgtgatgata gttcactgta acttcaaact cctgggctta agtgatcctc 85081 ccacttcagc ctcccaagta gctaggacta cagggacaca ccaccacatc tggctaattt 85141 tctttttgta gagacggggt ctcactgttg cccagattgg tctcaaactc ctggcctcaa 85201 gtaatcttcc tgcctcagcc tcccagtgct aggattacag gtgagagtta ccctatctgt 85261 cagaatctcc atatttttat aatttttttc ttcaatcttt tcagaggtct cttccacccc 85321 taatttgaca aaaatatttt ttcttagcca tttgccttta tcaggtaagc acattcaact 85381 tatttttcat aaaagcagta tcttttttcc tttttccctt tacatttcat tggttaacat 85441 aatatttaat tattttgtaa tcacaataat aaaactaaac aggctgggga acaaacccag 85501 ttgattctta ttgattgtca ctgttagcaa cacattatga acaaatattt ataatgcagg 85561 ccgggcattg caggcgcagt ggctcacacc tgtaacccca gcactttggg aggccaaggt 85621 gggtggatca cctgaggtca ggagttcgag actggcctgg ccaacatggc aaaaccctgt 85681 ctctactaaa aatacaaaaa ttagctgggc atggtggggc acgcctgtaa tcctagctac 85741 tccagaagct gaggcagggg aattgcttga acccaggagg tggaggttgt agtgaaccaa 85801 gattgtgcca ccgtactcca gctcaggcca cagagcaaga ttccatctca aaaaaaaaaa 85861 aaaaaaaaaa aaaaaaagaa agaaggaagg aaagaaagaa agaaaagaaa agaaaagaaa 85921 gaggaaagtc aatgtaaaaa ctgcaatttt ctccaaatac agggtatagc tgacattgtt 85981 ggccgcctcc tgtggtgggc actgcccact tccaggcttg tatctatttc tcattcttcc 86041 acagctgcag aatccctgtt ttggggaatg gcaatgtgta cagctaagaa actgcattta 86101 ccagccttct tcacaagtaa gaatggacaa tgagatgtaa actgaacatg gtggttggga 86161 cttagggtac tttaaagggt gtcctctgcc ctcccctttc ctacttcttc ctgcctcaaa 86221 gaccgtttgg ctagagagcc agagccaaca cgtgcgacca cttggtgact gcaagctgtg 86281 tgataagcat ggtgaaacag aaagagagaa agatctaggc tgcctgatgg cactgtggaa 86341 ctgccatgcc aggaacagac tatctacttt ggacttaata ttttttgaga gaaaaataac 86401 ctgtcttctt taaagcactg ttatcagcag ccaacacaat tcctcttcga ccaatctact 86461 cagcgtccat tcacctactc atagaaaaaa aaaaattttc ctaacagaaa cacaattctg 86521 tttctgaatg gtctagagat aatttcatcc ctcttgtaaa tggcttgttc aagaaccagg 86581 cctaagccca tcagggcatc acattgccct actaacaaaa attggctcag gaatgagcat 86641 atgacccagt ccaggccaat gagattctag ggtagttctc tggaggcttc tgagaaaaat 86701 tctcactcta aggaacaatc tttttttttt ttttttttga ggcggagttt tgctcttgtt 86761 gcccaggtcg gagtgcagtg gtgtgacctc agctcactgc aacctccacc tcccaggttc 86821 aagcgattct cctgcctcag cctcccgagt agctgggact acaggcgtct accatcacac 86881 acggctaatt ttttgtattt ttagtagaga cggggtttca ccatgttggt caggctggtc 86941 tcgaattcct gacctcaggt gatccacccg cctcggcttc ccacagtgct gggattacag 87001 gtattttacg ggtgtaagcc accgcgccca gccaagacaa tctttctatt ccttaggtgg 87061 gtgccataca taaatctggg gcctaggcct actgcagcta ttttgctgct agcctgagga 87121 agaagccaat actctaggga gggcacaacc aagaaaatct tagggaaaga agcctgtgac 87181 agtagattaa gtgaaccttg aagtctgtcc aggacaagtt tggctgtatc agccaaaaca 87241 tatacacgtt gcagaggtca tctcagaagc aatggagtag gtgctgagag ttatgcaaat 87301 attttgagaa ggccaaagtc agtcaggtct tcccagtggg catgcaagac agggcatgcg 87361 gtcttcaaaa tatttctgtg ccctttctgt acgaggcctg ataaagaact tactatagag 87421 taagtttgca aatattgagt tattttagaa gtgctttttg aacaggtttt tgcaaaaaaa 87481 aaaaaagttt actgtataag tgtgcacgta tcaaaggctt tacagtaata aacataacag 87541 ctgacattta catgttaagt tctatctgcc aagactgtgc tgagcccttt atttcatgca 87601 atcctcacaa gaaacaagga actctcataa tatccacttt acaagtgagg aagctctgaa 87661 gcttagaaag gttaagtata ttgtgcagta agtagaaagt gcctgggcca agatgtgaat 87721 tcatgtccaa aggactctag ggcctcattc ctaaccacaa ctttgtactg cttccttatg 87781 aatacataaa agcacagcga ccatttctct gagagtgcac ttcttcaaaa tattctgttc 87841 tcctaacctt cctaagtata ctatacttat cacaccacaa gtcccaattt tttggtttat 87901 aaaatatagc tgctgggcac agtggctcac gcctgtaatc ccagcacttt gggagggtga 87961 ggtgggccta tcacttgagg tcaggagttc gagaccagtc tggccaacgt ggtgaaaccc 88021 tgtctctact aaaaatacaa aaattagctg ggtattgtgg cgcatgcctg tagtcccaga 88081 tactggggag gctgaggcag gagaattgct tgaacccagg aggcggaggc tgaggctgca 88141 gtgagccaag attacgccac tgcactccag cctgggagac agagctagac tccatctcga 88201 aaaaaatatt ataaaagaaa ttaaaaaaaa atatggccac tatatttaat atgtaccata 88261 tataacatgc agagatcatc aaaaaccaat gaaagttaag ccttatgcac ataagaaaac 88321 aatagaatta ttttatgctt aactgacact gaaaatcaag attaaaccat cagctttttc 88381 tttttttttt taagtttcat tctggtgatt agcagaaaca cggagtaggt aggtttgcaa 88441 aatcaggact gggagaaagg aagataactc acattcatta agccccttcg agtggccaag 88501 ctttataaca attaactctc acatgacttc atgagacatt ttacagataa gggaacctag 88561 actcagagac cttgcccaag gttaccctga tctccctgaa acccaagttc ttgttcttct 88621 catcatacaa ctctgcattc ctatgagttc tagcaatttc taaggattgg ctaacagaga 88681 acaatgaccc tgaggccaca ggaaagctcc gtaagttatt agcctccccg gaaatatcat 88741 tttcattttc agttgctttc aatcttggtg tctttggctt taataaatta caaataagaa 88801 aataagctat acttggccgg gagtggtggc ttatgcctat aatcccagca ctttgggaag 88861 ccgagggggg tggatcattt gaggtcaaga gttcaagacc agcctgacca acatggtgaa 88921 actctgtctc tactaaaaat gcaaaaatta gccgggcatg gtggcacacc tctgtagtcc 88981 cagctactcg ggaggctgag gcaggagaat cgcttgaacc tgggaggcga ggctgcagtg 89041 agccgtgatc cattgcactc cagcctgagt aacaagagca aaaccccgtc tcaaaaaaaa 89101 aaaaaaagaa cctaaagaac ctacacttaa aaatctaatg cttcaaattt tacttaaaag 89161 ctacaagaaa acaggtatta tgttttatat aagccaatta aaaataatac atatttattt 89221 tcttaggtag taaaatataa tcaataagca gcagaatgtt aagttttggt acaaatattc 89281 aaataaaaac acagtctacc agccaggcca acatacagag atcccgtatc cacaaaaaat 89341 tttaaaatta gctggatgtg gtggcacaca cctgtagtcc cagctactca agaggctgag 89401 gtgagaggat cacttgagtc ccagagtttg aagtggcaat gaactatggt tgcaccacag 89461 taactccagc ctgggtgaca gaatgagacc cagtctcaaa aacaaaacaa aacaaaaaaa 89521 cccatagtct agtcttaaaa gcacacagct gatggagaat cacttaaaaa ataaatcaaa 89581 gtctgtattt cctagtcctt cattgccgat gtcactaaac atcccactgc tcatgtccta 89641 agcctatcct gattagaaag ggcaggaata aaatgagggc tactcttgct accatcatat 89701 gcataggagg tctggctccc tccgctgaat tcccatgcct agaagccaac caaacggctg 89761 agtcagtttc tccacccact cattcctgcc tggtcctaac ttccttcccc tctccaacca 89821 actaatactg acgaggccct acattccttt aatggttcca gtgctggtta gaatcattta 89881 ctctacctca agtataacca gagaatcaaa aacatcagta cggccaggcg cagtggctca 89941 tgcctgtaat cccagtactt ggggaggccg aggcaggtgg atcacctgag gtcaagagtt 90001 caagatcagc ctggccaaca tggtgaaatc ctgtctctac taaaaatatt tttaaaaatt 90061 agccgggtat ggtggtgggc agctgtaatc ccagctactc aggaggccga ggcaggagaa 90121 tctttgaacc caggaggtag aggttgcagt gagtcgacat catgccattg cattccagcc 90181 tgggaaactg gagttttgtt tgagactcca tctcaaacaa aaacaaaaac aaaaaaaaac 90241 aaaacaaaaa catcagtaca tgtgaaaacc cctggaaggt ttttcacata actcatgtat 90301 ccagttcaat ttgaccaggg gattaatttg gtcattggga taacgtaatt aattttggtt 90361 aattggtagt cacccaagaa agcctttaaa ggaaattcta aaatgaatca atatattgca 90421 taattttata aatggatcaa tatactgtat tattttataa gtagcatgct tgaacataaa 90481 tgaatccttg actgggaaac ttgctatagc acaggttggt tgtttaaata ttggaaaaac 90541 tgggccaggc acagtggctc atgcctgtaa tcccagcact ttgggaggct gaggcaggtt 90601 gatcactgga ggtcaggagt tcaagaccag cctggccaac atggtgaaac cccatctcta 90661 ctaaaaatac gaaaattagc caggcgtggt ggcatgcgcc tgtagtccca gctactcagg 90721 aggctgaggc aggagaatag cttgaaccct ggaggtggac gttgcagtga gccaagatca 90781 caccactgca ctccagcagc ctgggcaaca gaacgagact ccatctaaga aaagtaaata 90841 aataaataaa tagatgtaat cagtcaaatg cagaatatga gaagttgtat agggcaaatg 90901 atatgacttt tttaaacaaa taaatggcaa agggggaaaa aagaagagga gaaatggcta 90961 tagattaaga agattttaaa caggggtatc cattcttttg gcttccctgg gccacattgg 91021 aagaagaaaa aaaatcacac acacaaaaaa tctcataatg ttttaagaaa gtttacaaat 91081 ttgaggctgg gcatggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcagg 91141 tggatcacct gaggtcagaa gttcaagacc agcctggtca acatggtgaa accctgttac 91201 tactaaatat acaaaaatca gccgggcgtg gtggtgggcg cctgtaatcc cagctactca 91261 ggaggctgag gcagaagaat cgcttgaaca tgggaggcaa ggttgcagtg agccgagatc 91321 acgccactgc gctccagcct gggcaacaag agcaaaactt cgttcaaaaa aaaaaaaaac 91381 aaagaaagaa agtttacaaa tttgtgttgg gcctcattca aagctgtcct gggctgcatt 91441 tggcccacag gctgcggatt ggacaagctt gatttaagag atcatcaatc aggtctaata 91501 caggcacctt gtttggatcc cagtgaacac agaccaactg taaaaagaca tttatgaaag 91561 ttgggcaggc cgggcatgat ggctcacgcc tgtagtaatc ccaacacttt gggaggctga 91621 ggtgggcggg cagatcacct gaggtcagga gttcgagacc agcccggcca aaatagtgaa 91681 accctgtctc tactaaaaat acaaaaatta gtggggtatg gtggtacgtg cctgtaatct 91741 cagcttctca ggtggctgaa gcaggataat tgcttgaacc caggaggcag aggttgcagt 91801 gagtgagtgg agatcacgcc actgcattcc agcctgggca aaagagtgag gctctgtgtc 91861 aaaaagaaag aaaaaaaaaa agctaggtac actgacacac atctgtattt ccagatactg 91921 atgaggctga gataggagga tcacttgaac ctaggagttc aagtacagca tgagcaacat 91981 agcgagactc catcttgaaa aaaacatttt tttagaaaga catttataaa attgttgagg 92041 aaactgaata ccaaatggat attatgtaat atttaggact gactactgtt tatttcctta 92101 ggtgtaataa tgaaactgtg actacagggg ggaaaggagg aggaaaaaat ccttctctct 92161 tagatgtgta tattgaagta cagtattcat ggatgaaata tgatgactag gatttgcttt 92221 atgataattt aaggtgggat ggaagtaggc agtatagata aaataaaagt gaccatatgt 92281 tggcaattgc taaagctcag tatattaggt gtcttttatt atcttttcta cttttgtatg 92341 tttaaacttt tccataataa aatgctaaaa tataagaaga aaataaaacc tggattcaaa 92401 ctccaggact tattagatat agtatctgag agggatctgt cacatcctct gatcttcaat 92461 tttctcattt gaaaatggga atgataatcc ctaccccaca gaattgttag gaggactaaa 92521 atgatatcat ggatttacat cttttggaaa tccagctctt ctaaactgca gagctattta 92581 caaatgttag cacaacagtt agctccctct gtctgtaatg ctctggaagt agacagagac 92641 tacaggtttg tgggagtgga gctaagtgac gggctgaatc catctggacc ctccaaattt 92701 tcattttgta agagttcctt aatatgacac tgtatacctt atattgtcct ttttctctca 92761 aaaggaggct gtcagaattc agtaaaatgt gaaaaatttt aaattttaaa attttaacag 92821 aattgtgaca tactctacct tgaattattt ttgtttcatt tattttccca attattcata 92881 cttctcctac taaagtcaga aagtatattt acaggatggc tgggtgcagt ggctcacacc 92941 tataatccca gcactttggg aggccaagat gggtggatca tctgagttca ggagtttgag 93001 accagcctgg ccaatatggt gaaaccctgt ctctactaaa aatgaaaaaa aaaaaaaaaa 93061 tagccggtgt ggtggtgtgt gcctgtaatc ccagctactt gggacgctga ggcaggagaa 93121 tcacttgaac ctcaaaggcg aagggtgcag tgagccaaaa tcgcaccact gcactccagc 93181 ctgagggaca gagtgagact ctgtctcaaa aaaaaaaaaa agtaagtata tttacaggag 93241 actgtgttat tgagataaaa ctgaacacag gcagggcacg gtggctcacg cctgtaatcc 93301 cagcactttg ggaggccgag gcaggcagat cacttgaggt caggagtcaa gaccagcctg 93361 gccaacatgg tgaaacccca tttctataaa aatacaaaaa tgccaggagc agtggctcac 93421 acctgtaatc acagcacttt gggaggccaa ggcgagtgga tcacctgagg ttgggagttc 93481 aagaccagcc tgaccaacat ggagaatgct gtctctacta aaaatacaaa attagccggg 93541 cgtggtggcg cgtgcctgta atcccagcta ctcaggaggc tgaggcatga gaatcgcttg 93601 aacccaggag gtggcagttg cggtgagcca agatcgtacc attgtactcc agcctgggcg 93661 ataagagtga aactccatct caaaaaaaat aataataata caaaaattag ccgggcataa 93721 tggcaagtgc ttgtaatccc agctacttgg gaggctgagg caggagaatt gcttgaacct 93781 gggaggcgga ggctgcagtg aactgagatg gtgccattgc actccagcct gcacaacaga 93841 gcgagactcc gtctcaaaaa acacacacaa aaaaacaaaa aacaaaaaaa cacacactaa 93901 aacattgttt aaaatattcc aggtgctttt agaataatag atatgtgtac ttcagtacta 93961 aaggttcatc tcaaataaaa ggaaattaac agaagttcaa gtataagcaa cacaaaggga 94021 tacctggcac tatctgcctt cagtttaaag aatggttgga aagagccagt gtcatctctc 94081 tatgtgcaaa tcagatcaag tcatttcact tcctttcaat ggtttctctc caccttctag 94141 aagtccaaac tgctaaacaa aggcttgggt ggctctccag ccttatttcc tccctcctcc 94201 cctagtcaaa gccccaccag gactgatgcc ccttcccacc agaggccagg aaagcttcat 94261 cttccactcc tctctccctt acctcttgtt ggtcaagtcc tgttgattct ccttctgaaa 94321 tgtctctcaa atctgccctc tcctttccct ttccgctgcc actgcccttt tttaccctta 94381 tttccccagc ctcctaaaga agatacttcc tcattttttt tctgattatg aaggcaattc 94441 atgctcataa aaaaggaaaa ttagggcaat gaggaacaga ataaaagttt ccccatgatc 94501 tcataaacag cttacctttc tagtacactt atgtttcact gcagtattat tcacaatagc 94561 caaaagatgg aagcaactca aagtgaccat cgatgaatgg ataaacaaag tgcggtataa 94621 acatgcaatg gaatattatt cagccttaca aaaagaatga aattctgaca catgctacac 94681 aacaaggatg aaccttgagg acactatgct agtgaaataa gcccgtcaca gaaagacaaa 94741 taatgcaaga ttccactaac acgatgtatc cagagtaacc aaacacatag aaaacaaaag 94801 tagaatgggt gctgccaagg gctggtgggg aggcggaaat ggggagctaa acctattaaa 94861 ggtatagttt ggcttttgca cttttgcaag atgaaaaagt catgcagatt ggttgcacag 94921 catgaataca cttaatacta caaaactgta aacttaaaaa tcgttaagat ggtaaatttt 94981 atcacaatta aagatttaac atttttaaaa agaactagta tttccagaca tttttattat 95041 aaacatgtat acatttaaga aaacagtaac atactgtacc gtaatctgct ttttacacct 95101 aaccacataa tgtaaacatc tttctaatat atatagctcc cacctaactg tccttttttt 95161 tgagatggag ttttgctctt gtcacccagg ctggagtgca atggtacaat ctcagctcac 95221 tgcaacctct gcctcccagg ttgaagagaa tctcctgcct cagtttccca agtagctggg 95281 actacaggcg tgcaccacta cacccagcta atttttgtat ttttagcaga gatggggttt 95341 caccatgttg accaggctag tctcaaactc ctcacctcag ataatctgcc cgccttggcc 95401 tcccaaagtg ctgggattac aggcgtaagc caccgtgcca ggccaatttt tcaattttat 95461 aaaaacatga tagtcaatat ctcatgatgt cttatgatag acacctgttt tttttttaac 95521 ctaccccttc tccagtatca atactctgat gttcccatgg gaaaactact ttaccttact 95581 catcctgtca tcccatcccc atgcccagcc aactcttact ctgtgtggct aagctgggta 95641 catgtatttt taagtttatt tctaagtatt tatttttcat gttgctattg tcagttctcc 95701 tacaattaca ttttctattt ttgtttgcat aaagctattg ctttttgtct atcttataac 95761 tgctgttcac ttagccttat tatatctaag tatttttagt ttaattctct tgaattttct 95821 cagcagacaa tcatatcaca ggtagagatt tatatcatcc aggtagtggt taagagtaaa 95881 agctctgggt tcaggtaaac ctagattcaa atactgtgtg accctgagtg agcatgttac 95941 ctcagtttct ttatccataa atttgcaatt attagagggc ctatctcaca gggttgttgt 96001 aaacatcaca ttattcatta agaacggtgc atgaggccgg gcgcagtggc tcatgcctgt 96061 aatcctagca tcttgggagg ccgatgcggg tggatcactt gaggtctgga gttcgagacc 96121 agcctggtca acatggcaaa acctcatctc tactaaaaat acaaaaatta gctgagtgtg 96181 gtggctcgca cctgtattcc cagctacttg gggggctgag gcaggtggat cgcttgagca 96241 caggaggcag aggctgcagt gagccaagat tgcaccacta cactccagca tgggcgacag 96301 agtgataccc tgtcccaaaa aaaaaaaaaa gaatggtgca tgaaaagtac ttaacacaat 96361 actagttcaa taaatgcctg ctatctgccg acaatgaaaa ttgtgacttt ccttccacac 96421 ttataccttt atttattttt gttgcattac actggtgcac tcctttcagg aataacctgg 96481 cttcccacgt ctccaaaaag taacaggcat cagactccta ttgaaggatc tgtactgtgg 96541 agctcctcca ctcccacctt ctcaggaaag ttatactatc aattctccct ttcatatcct 96601 gtgtcttctc agcctttctc tctctccact ggctctttgc caacagcatt taatctctct 96661 catgctcaaa aaaacttcac ctattcccca atcgctcctc agccttctct ctacctccct 96721 tcgcagccaa acttctctga gctgcctaca ctttgtcttt ctcacatccc actcttaact 96781 catcccactg atatattata ttgaaccttt gaaagctgct gatattatag attccgcccc 96841 aaccatttca ccaaatctgc tcaatgaggt cccaaccgac ctccatattt atttacccaa 96901 aaggacacat tttaattttt aacttgtatt actcgacatc tcagcaacat ctacaaaggc 96961 caaccaacct ctccctttgg ccttcattac tatactttgc tggtttctct cctgttcctg 97021 tttctctagc catttctttt agtgtagact aagtgcatgg ctcattcttc caagagtatg 97081 caatgcagta gccctgaata atctcagcct tttttttttt gtcaacctct ctggatggat 97141 ctctttcaat attgatcttt aagttccagt tgtagtggac tgcttttggc tccatgaatg 97201 tacttttgaa aatgtttgga acatccccac gatagctaac tcagactaca ccctccatac 97261 attttatcaa aactctacct gaatctccaa actatgctag attgatctta acatgcgccc 97321 cctttcacaa ttctcatcac aattataatc acttgtttga tgaatatttt ctccaccaat 97381 ctgtgatctc cttgagggca gggacctgct tccttactac ctacacagtg ccttgcacat 97441 agaagaagat gctcattaaa tattttttga attagccagg tgcagtggtg tgcacctgca 97501 gtcccagctg cttgggaggc tgagacagga ggatcacttg agcccaggaa tctgagacca 97561 gcctgggcaa catagtgaga cctcatctca acaataaata catacacgca cacactgtca 97621 cacacacaca tatatatgaa tcaccacaaa tgaatgcctg caaagtgaaa aacagatggg 97681 aggaatgata agaccaagag agaaacacaa aaaactgatc tttctgaaac tatacctctg 97741 atcaaagaca cacacctccg gcttaaaacc ctttgatggc ttcccactgc ccccaacaca 97801 aagcccaaac tcctccacac ttacagtctt tcaggaggag cacctgcttg tctctcaaac 97861 accatctagc cagaccccct ccaactctca taatctgctc tccagctagc ctgaaatcct 97921 tgcagattcc caagaagtct gcactctctc cccaaggctt ttctctacct actttctcta 97981 cctgctttat ctacccacag caaggtacac tccaccagcc tcatcttctg gccaacataa 98041 agacagcctt catgtctcat tttggaagtc accttctttt ggcatctctc cctgatcatc 98101 cacactaacc tcacctccag aaatccaggt taactgtttt ctctgtgcat tcacagcccc 98161 cctgtatgtc caaaatcata accggaatca ctgtgaattc ttgtttacga gtctgtctcc 98221 tccggaaaac tgtggagttc cctcaggaca cgatgtctct cgttcacttt tttaccctgg 98281 ctatcagccc gggccgtggc tcacaataaa tgctcagtga tggtcctcct tgtagtagta 98341 gtaataataa aagctcccat gtattgaggg cttacctcca cgaattttaa atataaattt 98401 accttgtaat aacgtcatca gacaggtgct accatgattc ccattttgca gacgataaaa 98461 ctgacgcaca aagaagtcaa tcctctctgg ccccagaaga gtcacacaca caaataaatt 98521 ttctgcacag ttttgagtta ttttctggcc cccaaacaag aagggatttt ttttttaaga 98581 aactttcgaa tgaggtcgac caatggctac aaatataata aacagccgaa aacggactct 98641 ccctgaacgc ggggcggggt caaagggcgt cgggggcgcc gtccccggcg cggctgaggg 98701 acaaagatcg ggccgcagcc tccctccccg ggatccccgg cggctcagcc cctcgccccc 98761 tgcgacgtgt cgacgccagg cccggagcgt tggggccgca accggccgcc cggctcctgc 98821 tcacctgcgg tctcgccgcc tctccgtgcc tgggccgccg gtccgcagcg cctccccggg 98881 gcagcctagc gcccgcagcc cgcaacccgc agcggagccc gttgccttgg cgacctgggc 98941 tgccgaactc ccgcggcact cgcgctacgg cggctcggat gggaccagga cggttcgcgt 99001 ccccttccgc agccgcggag ggggcagagg agggacgaag tgggagtcga gggctgagct 99061 gcgaaggagg gatccgggtt ggaacttggc ccggggaaga tacggaaagg gggacagtga 99121 ggccaagggg aggtcttggg ccgcgggtgc tgcttgtgcc tcagtttatc tctctgtaac 99181 ctcaaatggg gaccgtgcga ggaggagaca gggcctcagg tggtggcttg gtagactggg 99241 tttttgttat tttgtttttg tttttgtttt gagacagaat ctcactctgt cgcccaggct 99301 ggagtgcagt ggcgcaatct cggttcactg caacctctgc cccctgggtt tgagcgattc 99361 tcatgcctca gcctcccgag tagctgggat taccggaagg tgccaccacg cctggctaat 99421 ttttgtattt ttagtagaga cgaggtttcg ccatgttgac caggctggtc tcgaactccg 99481 acctcagctg atccatccgc ctgggcctac cagcatggtg ggattacagg cgtgagccac 99541 cactccctgc caagactggg aatatttaaa gacaagaaaa ctgatgtggg aagggtggca 99601 gattcctctc ttttatagtc tctgaagggg tggtgtggca aagagtgaat aaattggttt 99661 ctttgatgga gaattagatc accagattta atttcaatat gaggaataat ttatttttcc 99721 ttgtgtctag caactacact tcttggaata actctgcaga tatgctgaaa cctgagcaca 99781 aaggatatat tcattaattc atttttgtaa aagcaaaagg gttgcgacca gtctaaatat 99841 ctatcagggg ctgggcgcaa tggctcacac ctgtcatcct gaaaccacct ttgcaaaaaa 99901 tcataactga gatatcagac ctaactgacc ccatcttgct tctaacctct aaagtgtcct 99961 gttcattcct gggtataggc taaaccagtt ttttttgttt ttttgttttt tgtttttttt 100021 tttgagacgg agtttcgctc ttgttgccca ggctggagtg caatggcgcc aacttggctc 100081 accacaatgt ccgcctcctg agttcaagtg attctcctgc ctcagcctcc tgagtagctg 100141 ggattacagg catgcaccac cacgcccagc taattttttg tatttttagt agagacaggg 100201 tttctccatg ttggtcaggt gatcctcccg cctcggcctc ccaaagtgct gggattgtag 100261 gtgtgagcca ccatgccaag ccaggccaaa ctagccttgg gaaggaattt agtgtatcat 100321 ttaaacaata gccctttcca gaaagctaaa ctgttcttgt aaaacaaatg aaaggccaca 100381 ggccaccagc caccaagcca agatgagaag ggctggagtt ctaaatagta cccaccatta 100441 ttcttattct agaggtcata agattttttt gttttgtttt tttgagacag agtcttgctc 100501 tgtcacccag gctggagtgc agtggcacaa tcttggctta ttgcaacctc tgcctcacag 100561 gttcaagcaa ttatcatgcc tcagcctcca cagtagctgg gattacaggg acctgccacc 100621 acgccagcta atttttgtat ttttagtaga gacggggttt tgccatgttg gccaggctgg 100681 tcttgaactc ctgacctcaa gtgatctgcc tgcctcagct tcccaaagtg ctgggatcac 100741 agacgtaagc cactgtgcct ggcccccaat tactcttgag gtaaaatcgc tattgtgaac 100801 ctaagatcag ccttttgaga tgtcttttca ggtttttgca tttctaacaa ccagatggcc 100861 ccactggacc tgccaaccag ttctgtggcc ctcatccagg aactgactca gcagaagaaa 100921 acagctttga ctctctgcaa tttctggttt ttttgtttgt ttgttttgtt tttttgagaa 100981 ggaatctcgc tctgtcaccc aggctggagt gcagtggcgc gatcttggct cactgcaagc 101041 tccacctcct ggggtcacgc cattctcttg cctcagcctc ccgagtagct gggactacag 101101 gcacccgcca ccacgcccag ctagtttttt tgtattttta gtagagatga ggtttcaccg 101161 tgttagccag gatggtctcc atctcctgac ctcgtgatcc acccgcctcg gcctcccaaa 101221 gtgctgggat tacaggcatg agccaccgcg cccagcgact ctctgcaatt tcatccccca 101281 gccaagcaat cagcactccc aattcattgg ccccctaccc accaaattat ccttaaaaac 101341 tctgatcctt gagttttcgg gaggatgatt tgagtaataa caaaactctg ggctcccgca 101401 cagcaggctt tgtgtgaatt actttctctg tcgcaattcc cccttcttga ttaatcagct 101461 ctgtctaggc agcgtgcaag gtgaacccct tgggcagtcc agcactttgg taggctgagg 101521 caggcagatt gcttgagccc aggagttcaa gaccagcctg ggcaacatag tgagacctca 101581 cttctacaaa aaattaaaaa attagccagg tgtggtggtg tgcacctgta tgtagtccca 101641 gctactcagg aggctgaggt gggaggatcc cttgagcccg acaggcggag gttgaagtca 101701 gcagagactg tgccattgca ctccagcctg agtgacagag ggacaccctg tctcaaaaaa 101761 aaaaaaaaaa gccagccacg gtggctcacg cctgtaatcc cagcactttg ggaggccgag 101821 gcaggcggat catgaggtca ggagaccaag accgtcctgg ccaacgtggt gaaacaccat 101881 ctctactaaa aatacaaaaa ttagccagac gtggcagcgc gctcctgtag tcccagctac 101941 ttgggaggct gaggcaggag aattgcttaa acccgagagg cagaggctgc agtgagcgga 102001 gatcacgccc ctgcacccca gcctggacga cagagcgaga ctctgtctca aacgaaaaaa 102061 aaagtctatc cttagggaac tgaggaaata aattatgata catccataaa ataaaataga 102121 ttaccgtagc aagaattaag agggaccaag gtagatatag acgtagatac agtttagata 102181 cagagctgtc ttcaagctgt gtaaccagcc agggggttta cctcgcctgc tacctagaca 102241 gagctgactt atcaagacag gggagttgca acagagaaag agtaattcac acagagccgg 102301 ctgtgtggga gaccggagct ttattattgc tcaaatcagc ctccctgagc atttggggat 102361 cagagttttt tgtttgtttg tttgtttgtt ttgagacgga gtctgttgcc caggctggag 102421 tgcaatggca caatctccac tccctgcagc ctctgcctcc ccagttcaag tgattctcct 102481 gcctcagtct ccagagtagc tgggactaca aggcatgcac caccatgcct ggctaatttt 102541 ttgtattttt tttagtacag acgggggttt caccatattg gccaggctgg actagaactc 102601 ctggcctcaa gtgatccacc cgcctcagcc tcccaaggta ccaggattac aggcatgagc 102661 caccgctcca ggctggggat cagagttttt aaggttaact tggtgggtgg ggggaagcca 102721 gtgagcaagg agtgctgatt ggtcaggtag gagatgaaat catgggaagc tgaagctgtc 102781 ctcttgcact gagttcctgg gtgcgggcca caagatcaga tgagccactt tatcaatctg 102841 tgtggtgcca gctgatccat caagttcagg gtctggaaaa tatctcaagc gctgatctta 102901 gcagaagttc agggagggtc agaatcttgt agcctccagc tgcatgactc ctaagccata 102961 atttctaatc ttgtggctaa tttgttagtc ctacaaaggc agtctagtac ccaggcaaga 103021 ggaaggtttg ttttgggaag gggctgttat catctttgtt ttaaactata aactaagttc 103081 ctcccaaagt tagttcagcc tacacccagg aaggaacaag gacaggttaa aggttagaac 103141 gaagatgaag ttagatcttt ttcactgtct ccgtcataat tttgcaaagg cagtttcagc 103201 tatgtatacg gcaacagtct ccgaactagt cccgccattg cttctactat aaccctcaac 103261 cctaacccat ccctgctcca gtctttattc caaccagcag ccagaatgat ccttttgaaa 103321 ggtaaacacg gccgggcgca gtggcttatg cctgtaatcc caacactttg ggaggccaag 103381 gcacgtggat cacgaggtca agagtacaag accatcctgg ccaacatagt gaaaccatgt 103441 ctccactaaa aatacaaaaa ttagctgggc gtggtggcgt gcgcctgtag tccagctact 103501 cgggaggctg aggcaggaga attgcttgaa cctgggaggc cgaggttgca gtgagccgag 103561 atcacgccac tgcactccag cctggtgaca gagtgagact ccgtctcaaa acaataaata 103621 aaataaaatg taaacatgat tggcgcggtg gctgaaggct tttttttgaa actccagctt 103681 aaaaaaaaaa aagtaagcat gaccatgtcc ctcacctgct caaaagcttt caactgtttc 103741 cccattacat ttagaataag atgtagagtt gttgtcacca tcaacaaggc tggaagtgaa 103801 ctgccccctg aatacttgtc cagcctcatc tctcaccatt ttccctctca gctgtcacca 103861 ttggagggtg tccaggttct tggcatcttg aacaaagaat tggacacaac acacaaagca 103921 agaaggaatg aaggaattta ttgaaaatga aagtatgggc tgggcgcggt ggttcacgcc 103981 tgtaatccca gcactttggg aggccgaggc gcgcggatca cgaggtcagg agttcgagac 104041 cagcctggcc aacatggtga aaccccgtct ccactaaaaa tacaaaaagt ggctgggcat 104101 ggtggtgcac acctgtaatc ctagctactc aggaggctga ggcaggagaa ttgcttgaac 104161 ctgggaggtg gaggttgcag tgagccgagt ttgagccagt gtactccagc ccgggccaca 104221 gagcaagact ccatctcacc aattcactcc gcagtgtggg agggggccca agcataggag 104281 ctcaagggcc cctgttacag aatttttggg agtttaaata ccctctactt gggacacgcc 104341 ctatgtaaat gaaaaggatg aagtaaagtt acaaagtcat ttacttggcc tacaccctat 104401 ggagaggata tttcctgtca tagctgaagt gtgaatcagc cttttgttcc ccgactccag 104461 actctgtttt cctgccttaa agcatgcttt catctcaggg cattttctcc tgctcttcct 104521 ccaggtatct ttatggcttg cttcctcggt tccctcatgt ctgtgctcca gtgtcaagcc 104581 ctctaaaaga cctgcccaga ctactcttga aaatagtagg gccgggcgca gtggctcgtt 104641 cctgtaatcc cagcactttg ggaggcctag gtgcgcggat cacttgaggt caggtgttca 104701 agaccagcct ggctaacatg gcgaaaccct gtctctacta aaaatacaaa aattggccag 104761 gcatggtggt atgtgcctgt aattccagct actggggagg ctgaagcatg agtattgctt 104821 caacccaaga ggctgaggtt gcagcgaggt gagatcatgc cactgcactc cagcctgggt 104881 gacagagtga gaccctgtct caaaaaaaaa aaaataaata aataaaagaa aagaaaataa 104941 tggaccccca ccaactctgt attctccttt ctcttttttc acagctcttg tcagtgtctg 105001 agtgttatat tatatacgtg tgtgtttgct gcttgtcttt tttccctaca gcataaggtc 105061 ttttaagggt gggcactttg ttttgttctc tactattttc ttatcactgg caacagtgcc 105121 tgacacatag taatagcatg gtaaatattt atcgaatgaa ctaataagtt aagtgaaaaa 105181 aaagcggcgg gtggatcacc tgaggtcagg agttcaagac gagcctggcc aacatggtga 105241 aaccctgtct ctactataaa tacaaaaaaa aattagccag gcatggtggt gggcacctgt 105301 aatcctagct actcaggagg ctgaggcagg agaatcgctt gaactcagga ggcggacgtt 105361 gcagtgagcc aagattgtgc cactgcactc cagcctgggc gacagagcga gactctgtct 105421 caaaaaaaaa aaaaaaaaaa aaagtgccat aatgctggca tatgcatatt cctgtaggaa 105481 ctctgaaagg agatctggga gatgggggtg ggggaagggt gggaaggaca cttcattttc 105541 accgtgtgct cctttatgca gtatgttata accaagtaca tgtgttatct tttcaacaac 105601 atatgggggg aaataaccta ggttttttta ctcgcgtccg tgtgaagaga ccaccaaaca 105661 ggctttgtgt gagcaacagg gctgtttatt tcacctgggt gcaggcgggc tgagtccgaa 105721 aagagagtca gcgaagggag ataggggtgg ggccgtttta taggatttgg gtaggtaaag 105781 gaaagttaca gtcaaagagg gttgttctct gatggtcagg ggcgggggtc acaaggtgct 105841 cagtggggga gcttttgagc caggatgagc caggagaagg aatttcacaa ggtaatgtca 105901 tcagttaagg caggaacagg gcattttcac ttctttcgtg gtggaatgtc atcagttaag 105961 gcaggaacag gccatctgga tgtgtatgtg aaggtcacgg ggttatgatg gcttagcttg 106021 ggctcagagg cctgacaggt ttgacaatgg atgtgtaaag atgggaggag ctgccttggg 106081 tggcagtgag ccccctaccc tagaagagat gttgctaagg ggtttctgcc tgaggcggga 106141 aattggagta gatgatcaca aagatctgct gtcctcgctt tggccaagta gaagtgacac 106201 actcaaccag ttagggccag aaacactaag agatgggggg atcctacatg tgggaggaca 106261 aatgcttaca ggtaagaaaa tgaaaaactg ttcagctgac agctagacct aatgtttcta 106321 actgaacaaa aacagtgatc acagaatata aacaaagatc agacaagacc accatgtagt 106381 tggatcaaga tacagacaaa gccactctgg taccgcagat aggactcaac atcttccaat 106441 tctgactaat aaatgagtgg attttgcttc tttttttgta accgagcgaa ttatagagaa 106501 acgccacact ctgagacgaa ttcaggagtc ctttattagc cggcaactga gagatggcta 106561 gtgcttaaaa ttctctcggc cccgaagaag gggctagatt ttcttttata ctttggttta 106621 gaaaggggag ggggagccta gctgaggcaa tcttacagaa gtaaaacagg caaaaaagtt 106681 aaaaagacaa atggttacag gaaaacaaat agttccaggt gcaggggctt taaatccatc 106741 acaaggtgat agatgtgggg gctttgggta ccatcaaccg gacaaaaatg caggggctta 106801 gggtactatc aaccgggtga attcctggaa actgcagcta tcgcttgcca cagtatctta 106861 tcagttaatt gcattctttg atgtgctggg agtcagcttg cacaagttaa atccttgagg 106921 aagtgggatg ggtaaggagc ggcaagtgaa ggagccaaaa tggagtttgt ctgtctccct 106981 cagctaagag agagtcaatt caggttaaga caaggtaggg tatcacactt ttccaaggac 107041 taatctagcc ttgttttaat ccttcttcct tctggataaa aatttgtaag atacctaatc 107101 actgaattgc ctctactcct tgactgtgtc caatccaaaa ctgttctcct cctccttaaa 107161 ctctccttag atttagttac ccagcataaa cccaaatcct aaaataagtt ctccccagtc 107221 acctcttacc aagacacctt atattttctc tgatatgtgt tctccctttc tgaacaagct 107281 aaataaacct aattcttttt gactacaggt atgtttctgg tagaattcga tctgtaaaat 107341 ttatcacagt cagctctctt aagtccagac cagcaatact cagacaggaa gaagcaagaa 107401 tgttaaggac tacctggaaa gttcagtggt cctgaagcaa cttgacttga gctggtattt 107461 gtgtatgtgt ggtgaggaga gggatctggg aaaagagagg taggaattat taaatgtgag 107521 tagttctctt tcttctttca tcagcccttt tcatggcctt gggcaaactt tgagcctcat 107581 tccatttctg taaaagatga atttggaata cattttcaag gaatccagga aataacatcc 107641 ccaaatatgc cacactggta tgatgattgc ttggggaaca gcacatggag agaggggctt 107701 ttcatgaatt tcccttatca gactaaaggc tgacaatcca gaaggaattc aattatgaat 107761 ccccttccag agagttttat ctatcaggga agattaacac accacaggag aagatattga 107821 agtgtgacac cttgttgtcc ggacatcatc attgtttgcc acattccacc taaactatta 107881 tatgtgtaag taaaaaatgt tttcattttc ttctttaagt gaactctcaa atttgacatg 107941 tattttgaaa ggaatattta catcactgct taaatgggaa atcagcattg ctcggaagaa 108001 atgaaaaggt tattgctaac acacacctca aaacaaaaca atattttaaa actgcaggca 108061 tggttgcctg tttgagagat ctggaaaatg aaggctttac tctgttagaa agggagataa 108121 atactagaaa aatgttaaat accatactgg cctcaaacaa ctagcctgtt ggtttgaaat 108181 gggacgttct cactctgctg tttaatatcg cttaatgtcc ttcatgcacc cttagagcca 108241 ttcatttttc ccaaatgggg tgcatcccac attttaggat acgatttcca tcgtttctag 108301 gttaatttgg actcatcctt tagttcgcag attaaaaaat aaatccttca gagaagcctt 108361 ccaagactac tgcccagcta aattactttc cccttacaat aggaggcact aaatacacct 108421 ttattgaaag cctattcccc catactactt catcgtattc attacattgg ggcacggtgc 108481 atttatgttg tcagatctcc tattaactag agaataaatt ctgagcgtac tgtgcctctc 108541 tgtacacttc tgtaacttcc agatctttgc acaatgaata caaatgatgt aaggaatgac 108601 accttcacag ttcctgcaac cccaggatgg tttagcccca ccttccccct cagccaatga 108661 caggcgcccc cctctgccgc tccggtcgcg gacggtgggc ggggccacgc cgtgacgcat 108721 ccgtgcgtct gtggaaggct gcgtttccgg cctgagaaac cgtcatgttt ctggggagtc 108781 acctcagctg gcagttacca ccgtgttaga aagcagcctc aggaccggcc acctccatca 108841 ctggcgtcac catgggggct gtgctgggtg tcttctccct cgccagctgg gtgagttcgg 108901 ggtcccgagt cctcgccgga cctgcggttt gcggggaaag gcctcgagac cagagctgca 108961 cgggccttgg ctccagaggc tctcgggatg tcagcaggcc ctgggctaca gagggtctcc 109021 ggggccccag acgcaaatcc gcgataccct caggtccagc ttaggcggct cgccctccgg 109081 gttcctcggg gaacggaaga ggctaggccg ccattgttta caattcggat ggctccgccc 109141 tgcgaggctt cctgactagg ttcctctgga gctggagagg gaccaaggaa tgacagctcc 109201 cagcgagggg ctccggttcc tcccgaagca cctcttaagg tgccatcatt tagttctagg 109261 tggctcaagt aatattttta tcccctagca cgggatttgc caaaactaca gagtgtgtgg 109321 caaagctggg gttttagcgg aggcttgtct gcagttcagg tgcaagggaa agaagccctt 109381 gtggattgtt attcaaaatt ggaaaccagg gcctgaatat tttgggactg ctgtaagtca 109441 tttatccgag gataatcacc ccagcattcc tctcttcgag aacgattgtg acagtgaagt 109501 gaaggtttaa tcaccttgta aactgtaaac attttaaaga gaaaattata gggcctataa 109561 aagaatatgc tgagttccct catttaccca cttcagagga gcgtgattcc ttgattcctc 109621 attctccagc tgatttaatt gaccaagtgt gtagaaaaag cacactcagt ttctcctgct 109681 gtactcacaa cacacttttg acacctgatg tgtggtgtgt ggggattttc ccccaaacat 109741 caggcaaaca gtcagctctg caggggatgc caacttggtg ccctacaagt caattccgaa 109801 gctctctact tggagatagc atcaaatcac acaagtggag ggctcagtct cacaagactg 109861 tgcccacttt ctgtgccaat ggcaagcctc aggttgtttt acctgtgctt ctgaccggcc 109921 acctataaat caaagttccc gtgacctccc tccttggctt tgattaattt ggtcacaaaa 109981 ctcaaggaaa catgtttact ggtttattat aaaagtattt tattttgtgt atttatgtat 110041 ttttaatttt agattcagag ggtacatatg cagttttgtt acatgggtat attgccttat 110101 cgtgaggttt gggtttctaa tgattccgtt gcccaaacag tgaacacagt atccaacagg 110161 tagtttttca acccctatcc ccccaccgtc cttccccact cttggactct ccagtgttta 110221 ttgttgccat cttcgtgtct gtgttcccag tgtttagctc ccacttgtaa gcgagaatat 110281 gctgtatttg gttttctgtt tctgtattaa ttcacttggg atagtggcct ccagctgcat 110341 ctttgttgct gcaaaggaca tgattttctt ctttttttat ggctactatt aatatatttt 110401 taaagataca gatgaagtga tatgtaagac aaggtgtaga gggaacgggt ctggagcttc 110461 catgctctgt ctggggcagt ggcgccatct cggctctctt caacctccaa gggattctca 110521 tgtttcagcc tcccggagta gctgggatta tagtgtatgc caccacgccc agctaatttt 110581 tgtattttca ataaagacgg gatttcgcca tgttggccag gctggtctcg aactcctggg 110641 gtgtgagcca ccatgcccag ccaaaagttc ttctgaactt tgtccttctg agcttttatg 110701 caggcttcat tatgtagtca gcattcatta aatcattggc ttttggttac tgacctaacc 110761 ttcagcccct ccttggaggg tcaggggtga gactgaaaag tcccaacctc ctactatgac 110821 cctcacccat catgaagcca cctaggggct gccagtcacc agtcatctca ttagcataca 110881 aaaagacgct ggttattcca aggattttag aaattgtatg ccaggaaatg ggaaccaaac 110941 atacatatat ttgacagtat tacaccaagg attctgtttc agctctttct actatttttc 111001 tttttttttt ttggaggggg acagagtttt gctctgtcac ccaggctgga gtgcagtggc 111061 tcagtcatag ctcactgcag ccttgaactc ctggattcaa gtgatcctcc cacctcagcc 111121 ttcccagtac aggtgtgtga caccatgcct ggcttcattc cactatttct taggaacaga 111181 atgtatttta aggttgttgg tcctaaatca aaagagtgtt ccatttttaa gggctttctt 111241 gagagcagtg gttttctaat tattctgacc ttgacctaaa gaaagattgt acatagttga 111301 tctgtgatct gatgcaccca tatagagaca aagttttatg aaataatcgt ttactgtaca 111361 atttttcttt tatcttctat cacattagaa aatcctggtg agttgaaggc caggcatggt 111421 ggctcacgcc aggatggtct caatctcctg acctcgtgat ccgccggcct cccagagtgc 111481 ttggattaca ggtgtgagcc accgcgtctt gcccggcttt tactcttaag ttatatggga 111541 atctattaca gagtttgaaa aaaggagtga tgtgatctgg gcctgtgttt taacgaggtg 111601 gtttctggtt gctgtgtgga gaatagatgg tagaaaagta aaggtaaagc agtgtaggct 111661 gggaagaatc ttcctccagc cttttgggtt ctgtttctgg gggcttgtga attaaactga 111721 caaaacagat tatttagagt ttatgcatgc aagacatgta cacacaggag tgctcagtaa 111781 tagtaactca aagggggcgg ttagaacttg aggctttttt tttttttttc gtttttgaga 111841 aaacaagttc cactcttgtt gcccaggctg gagtgcaatg gtgggatctt ggctcactgc 111901 aatctccacc tcccaggttc aagcgattct cctgcctcag cctcccaagt agctgggatt 111961 acaggcatgt gccaccacgc ctggctaatt ttgtattttt agtagagatg gggtttctcc 112021 atgttggtca ggccggtctt gaactcccga cctcaggtga tctgcccacc tcagcttccc 112081 aaagtgctgg gattacaggt gtgagccact gcacccagcc aaggctttct aatttagtag 112141 gaaaaaggaa gctgggagaa aggcttctat gagaagaaca aataggtttc tttaggaaag 112201 acaagttttt ttaggagaat aaataggaaa tacgtttgtg gtagtacttg tttatgcagg 112261 tatggtgggt ctttccagct tcttcatgat cttaaaactg ccctggagag ggactttatg 112321 gcagcctcgt ttcccagaag ttgctgcttt tagtcagata agagaagctc ccagaaggct 112381 tttttttttc ctctgcatct gttgaagctc agatgcttca gcttaaaagc ttaaaataat 112441 ctttataaca attcagggct tctttttttt tttttttttt ttttgaggct aattttgcat 112501 ttttagtaga gacaggattt cactgtgttg gcctggcggg tcttgaactc ctgacctcag 112561 gtgatctgcc tgccttggct tcccaaagtg ctgggattac aggcgtgagc cactgtgccg 112621 gccaactcag agcttctgaa tgggtcgcta cagctgtgag atgggaactt ttgcgcttaa 112681 tccagatgag agatgatagt tgtggcttgg acaaaggtga tagctaagga tactgtgaaa 112741 atggtcagat tctggatata ttttaagggt agagctgatg tactgacaga ttggatgagg 112801 attttgggaa aagggaaggc attcgggatg actccaggtt ttgggcccaa gcaacaggaa 112861 gaatggagtt gtcatttact gaaatagaaa aatgacataa catatagtag gacagatttg 112921 tcccacagat atttactgag tacctgctct gtaaagatac aaaagatata acgttgtaaa 112981 gatagtctcc ccttcgagat atttacattc ttaagacaac tataggtaag taagtttatt 113041 agtttgccag ggctttataa caaagtacca cagaccgagt ggcttaaacc cagtgatttc 113101 ttatctctga attctagagg ctggaagtct gggatgaagg tgtcagcagg gttggttcct 113161 tctaagggtg gtgagagaga atctgtatca tgcctctccc ctggcttttt ttttttttgg 113221 tttgctggca gtctttgatg ttccttggct tgtagatctc tgccttcaca tggcatgtgg 113281 aattctccct gtgttgtaag cctttgtcca agtttcccct ttttataagg acacagttat 113341 attgaattag ctgctcactc tactccatat aacctcatct taacttacta catctgcagt 113401 gaccaagtaa ggatacattt tgaggaattg ggcattagga cttcaacata agaatttgtg 113461 gaggacatgc ctctgcagtg tttggcctat actgtagtta agaattggag actgagactg 113521 gctggtcaat ttcccattgc tattgcaatt acaaataata atggctacca cgatatattg 113581 agcctgataa atatgtcagg cattatacta ggttctttat attcctgaca tcatctcatt 113641 ctttgctcaa tcaccctgtt cgagtaggtg gtattatgct catttttaca gatgagaaaa 113701 ctgaactttt ttagaatgtt agcaccctgc tgcaaaccac acaaaattat cataaagtgc 113761 acacaagata ccatctataa aaaaatgaag attattcctg ggaccatatt ttaagaccag 113821 agttatccat tgtttacttc tgctaattct gaattagctt gtccttgtaa tatgacttga 113881 taatgtcacc ccctcagagc aatttcaagc agtttcctaa tcaagcataa tgactcaaat 113941 tatatcctct cttagctggg catggtggtg cttgcttgta ctcaggaggc tgaggccaga 114001 ggatcgcttg agcccaggag tttgatgctg cagtgagctg tgatcacaca actgtactcc 114061 agcttgggca acagagtgac gccttgtctt tttaaaacaa aacaaaacaa aggccgggtg 114121 cattggctca tgcatgtaat cccagccctt tgggaggcgg aggcgggcag atcacttgag 114181 gtcaggaatt cgagaccagc ctggccaaca tggtgaaacc ctgtctttac acaaaaatta 114241 gctgagcatg gtggcacgtg tctgtaatcc ctgtaatccc agctacttgg gaggctgagg 114301 tagaattgct tgatcccagg agggtgaggt tgcagtgagc cgagatggcg ttactgcact 114361 ccagcctggg agacagagtg agaacctgtc tcaaaaaaaa agaaagaaaa gaaaaaaaga 114421 aataaaatta tatcttcttt catgtagtgt ggtcactaac agattttagc agactaaaga 114481 tctaggttac tctttttttt tgagacggag tctcagtctg tcgcccaggc tggagtgcag 114541 tggcacaatc ttggctcact gcaaccttgg ccacccgggt tcaagcagtt ctcctgcctc 114601 agcctcctga gtagctggga ttacaggtgc ctgccactgc acccggctaa tttttgtatt 114661 tttagtagag atggggtttc accatcttgg ccaggctggt cttgaactcc tgaccttgtg 114721 gtccacctgc cttggcctcc caaagtgctg ggattacagg cgtgagccac cacgcctggc 114781 cctaggttac tctttaaaaa aaaggaggta aagttgacat aaaattaacc atttcaaaat 114841 gtttagtggg ccgggtgcag tatctcacgc ctgtaatcca gcactttcgg tggccagtgt 114901 gggtggattg cctgagctca ggaattcaag accagcctgg gcaacatgac aaaaccctgt 114961 ctctactaaa aaatacaaaa aaaaatcagc tgggggtggc attgcacacc tgtagtccca 115021 gctacttggg aggctaaagc acaagaattg cttgaactca ggaggtggag gttgccgtga 115081 gccgagattg tgccactgca ctccagcctg ggcaacagag tgagactgtc tcaaaaaaaa 115141 aaaaataaaa aagcgtatag tggcatttag tacattcaca atgttttgct gccatcacat 115201 ctgtctagtt cgtcgcccta aagggaaatc ctttacccat taaaccatca ctcccatttc 115261 ccttcccttc aacccctaat cttctgtctc tagattcact tattctggat attttattat 115321 ataaatgcac taatacagta tgtgaccttt catgtctggc tttttaaata actttgcatg 115381 atgttttcag ggttcacctg tgttctatca tgtatctatc tattatgtct gaataagcca 115441 ttgtatagat agaccaagtc tcactctgtt gcccaggctg gagtgcagtg gcatgatttc 115501 agctcaatgc aacctccacc tcctgggttc aagcaattct cctgcctcag cctcctgagt 115561 agttggggcc acaggcatga gccaccacgc ccggataatt tttgtatttt tagtagagat 115621 gggatttcat catgttggcc aggctggtct tgaactcctg acctcaggtg acctgcccac 115681 ctcagcctcc caaagtgctg ggattacagg tgtgagctgc cgcacccagc cagatatacc 115741 atatttttac ttaggccata tttacttatc aattaatgga cagtgggttg tttccacatt 115801 ttggtgattg taaatagtgg tgctgtgaat attcgtgtac aagcttttgt tttaatagca 115861 gtttcaaaca aaacagctat tggggtcggg tatatacttt ctgggtcata tggtcattct 115921 gtatttaact ttctgaggaa cctggttcct cttgaattga gctaaatggc ccttgctgtt 115981 attatttgtc agagggtttc catgtagtgt tgctttattt ttttatttta ttttattttt 116041 ttgagatgga gtctcgctct gtagcccagg ctggagtgca gtggcgcaat ctcagctcag 116101 tgtgagctcc gcctcctggg ttcatgccat tctcctgcct cagcctcctg agtagctggg 116161 actacaggcg tccgccacca cacccggcta attttttctg tatttttagt agagacaggg 116221 tttcaccgtg ttagccagga tggtctcgat ctcctgacct tgtgatccac ccgcctcggc 116281 ctcccaaagt gctgggatta caggcgtgag ccaccacgcc cggcatgttg ctttatgttg 116341 taatatcagc tttggtaact tggattcgca gtaagcagca agtggagtta ggtttatatt 116401 tttctcatat tttaaaacaa atgatgctct cagaatataa ttaacattta ttgaactgtt 116461 aatgcatttt aggtagtgtt gagtgctcat ttaatacagc agagtctgag ggaggaaaat 116521 tccattattg agctcatttt acaaatggtc aaatggggtt agagtacgta acttgctggg 116581 gtaaaaagcg gcagaaataa tatttactcc agagcccgta cttttaacca cttcctccca 116641 gccatgattc tctgagttaa aactataaag cctgatttgc ctcgctttct cttctgcgac 116701 atctgtgtca taaaggagta tactgaaggc ggtctcaggt gaccatggtg tgctcattcc 116761 aggttccatg cctctgcagc ggtgcctcat gtttgctgtg tagttgctgt cctaacagta 116821 agaattccac ggtgactcgc ctcatttatg ctttcattct cctcctgagc actgtcgtat 116881 cctatatcat gcagagaaaa gagatggaaa cttacttgaa gaaggtaaga agtaacagtg 116941 cccttaaggt attggttatt tttattttta tattctgcaa aattctagtt ttatttcgtc 117001 gttgttaatc aagttggttg taatttggag attttttact tatatggggg tgtggtcaaa 117061 acacagaaat atgaggacat tagtcctttt ccagaaatac agaatgaaca tctgggctcc 117121 agtcgaacaa ctgagagaaa acattaaaag ggtggctcca aaacattttc tttccttatg 117181 tgttaaaggt cgaagcatct aaactggact agttcttcac ctttggaagg actcagtcgt 117241 agctggaaaa tctcacagga aagtggcaca cagcctcttc atcctaaaag ggcaaagaag 117301 agagaccata tttttttgaa actaagactc actctgttgc ccaggctgga gtgcagtgac 117361 atgatcttgg ctcactgcaa cctccgcctc cctggttcaa gcaattctcc tgcctcagcc 117421 tcccgagtag ctgggattat aagcatgcac caccatgccc gggtaatttt ttgtattttt 117481 agtagagata gggttttcac catgctggcc agggtggtct cgaactcctg acctcgtgat 117541 ccacccacct cggtccccca aaatactggg attacagatg tgagccaccg tgcccagccg 117601 agagagcatt tttattttta aaatgcatgt tatttatact gtttggtaac aagattgtaa 117661 atgaattgga agacaaagag aaaaattcca tagtcagtct tgcagttgtt gaagtagtta 117721 attttctaaa tccaggcttt cctgcagtgg gttagagtct ctgtctaaat gtagatattc 117781 tcgtaacatt gaaattgctt ttttccagat tcctggattt tgtgaagggg gatttaaaat 117841 ccatgaggct gatataaatg cagataaaga ttgtgatgtg ctggttggtt ataaagctgt 117901 gtatcggatc agctttgcca tggccatctt tttctttgtc ttttctctgc tcatgttcaa 117961 agtaaaaaca agtaaagatc tccgagcggc agtacacaat gggtatgtta gatttatgat 118021 taactggtta cattatggtg tggctcttcc ttcccttcag cagttaatat agccattctg 118081 atgaaatcaa tggaatatgt aaatgagtgg ctcagatgca tttctgatct attatgcatg 118141 tgaaattgtt ttaatgtcat tttcagttag attgctgtaa catggagggg atagggggtc 118201 ttgtcttttt attttcttct attaaattcc tttgcttcat gttcttagat ttttattgaa 118261 cacctgttaa atagttgaag cactatgcta gatactacgt acacacacac ttttttcttt 118321 ctgttgttta aagataaaca tgtacctgat aaatttaaac agtgtacaaa agtcagtgaa 118381 tatatatgtg ctactgattt acatattcta ttgcttctga ctcctgacca ccttttacct 118441 atgtttctaa ttctctcctt tttttttttt tttttttgtt ttgtttttgg agatggagtt 118501 tcactcttgt cacccaggct ggagtgtaat ggcgtgatct cggctcactg caacctcccc 118561 ctcccgggtt caagtgattc tcctgcctca gcctccctag tagctggcat tataggcacc 118621 tgccaccatg cctggctaat ttttgtattt ttagtagaga tggggtttca ccgtgttggc 118681 caggctggtc tagaactcgt gacctcaggt ggtccgcccg cctttgcctc ctgaagtgct 118741 gggcttacag gcgtgagcca ctgtgcccag ccgtatttct aattctcaaa gactgaatgg 118801 cgttttataa acttgatgtg tattataatc ccctttttac agatgtagta actgaactgc 118861 agagaggtta ataaattgtc tgagacctgt taaccagttg gaggttggag ctaagattga 118921 aacttagatc tgtccagctc caaagcctgg ttctttccac tctgccatac aaggcagtca 118981 ttaaagatca tgaactgaaa atcaggtgcc agtgatgagc tgtgtcattg tgggcaaggt 119041 atctgtgccc tgtgttgtaa atatttgtga ttacttttca tccagacctt aacagttaat 119101 ggctgtgtgt gtacctaaca ctgttatttt agaactccca gaatgctaga aaaatgccta 119161 cctttaagtg ctaaaagtag ggagactcta aattttcagt tttctctggt aataaatgtg 119221 agtgatttat cttctaggct aatgttgtcc tttaattgtg tgaaatatta aatgaattag 119281 cctattaaat gaatagtcaa atgaagaaaa tagtccagac aatcatgttg ggaaaaaagc 119341 tattaagtta ttatttttgc atttcagaaa tgaatttttc tcagtataag ggtgaaactt 119401 agcatttcat aaatgcattt ttatatttta ataggttttg gttcttcaaa attgctgccc 119461 ttattggaat catggttggc tctttctaca tccctggggg ctatttcagc tcaggtaggg 119521 ttcaccattg agggggttaa ttttaaacta tataaactct atagatgatt gagttaaagc 119581 tacattttag gccgggtgca gtggctcaca cctgtaagct cagcactttg ggaggccaag 119641 acgggcggat cacctgaggt caggagtttg agaccaacct ggccaatatg gtgaagccct 119701 gtctctacta aaaatacaaa aattagccag gcgtggtggt gcacgcctgt gattccaggt 119761 actcagaagg ctgaggcagc aaaattgcct gaacctggga ggcagaggtt gcagtgtgcc 119821 aagatcgtgc cactgcactc cagcctgggc gacagatcaa gactctgtct caaaaaacaa 119881 aacaaaacaa aactaccttt taattactgt ttttgtagaa ctgacaagta aatgagatgg 119941 tatccgatta atatgactta gtacctgcaa ctcactatat atatatatat attttagaca 120001 gtctcactct gtctcccagg ctggagtgcc gtggcacgat ctcagctcac tgcaacctcc 120061 acctcctggg ttcaagcagt tctctgcgtc agtctcccga gtagctggga ttacaggcgc 120121 ccaccaccac gcccagctaa tttttgtatt ttaggcagag acggggtttc accatcttgg 120181 ccaggctggt cttgaactct tgacctcatg atccacctgc ctcagcctcc tgaagtgctg 120241 ggattacagg tgtgagccac cacgcccggc tgcaactcat tattaatatg tggatttagg 120301 tcaggaggca ttcactgcac ctctcagatt ttctggaacc atatgttaat cttttggcaa 120361 tatcttaata gtttgtgttc tcacttgatg gaatacaaag ctaatcagag agcctgtcaa 120421 aataaggact tacaactctt gtgctcccct atctctgtct taacctactt cctaagcatg 120481 gcatcaggtg acatctttag atttctactg tgatttaaaa gaaaagactt atagtatcat 120541 ttggaactca ggtcccaggc ttcccttggc acaagctaaa acatatttta aaaggtctga 120601 aacccagtag aaagctttta agaatgccgt tgagtttgga atcttgctgt actgctggag 120661 acagaatgtc tttaccacct ctgtcttgtt ttaattgaaa tccgttcaga atgttggatc 120721 ttctgaaatg ttactctatg gtagagaata actttgttct cttggttctt gtagtctggt 120781 ttgttgttgg catgataggg gccgccctct tcatcctcat tcagctggtg ctgctggtag 120841 attttgctca ttcttggaat gaatcatggg taaatcgaat ggaagaagga aacccaaggt 120901 tgtggtatgc tggtaggtat ctctactcac ctatgttagc catactgcta cattaagcac 120961 ggtgtacaat aaaaatcatc attatagtcc cagcagctat agtcctcagg gacagaagtc 121021 caaggtggga gaatggcttg aatccaggag ttcaagacca gtctaggcag catattgaga 121081 ctccatcttt acattagaaa aaaaaaattt gccaggcatg gtggcatata cctatagtcc 121141 tagctactca gcacgctaag gcaggagaat ctctggaatc caggagttca aggctgcagt 121201 aagctatcat catgccactg tactgcagcc tgggtgacag agcaagaccc tgtctctaaa 121261 aataataata acattaaata aaaggaaaaa aatccacaat ttccagaatt tggctctgaa 121321 ttatatgcat cactggaaga aagtagcaca gataatataa attgttcgat tttcccaatt 121381 tatgtatttt gagagtcata atttaattac atgtgtttag ataaggaaaa acagttatat 121441 ttagtgactt agttctggtt ttgataacct tgtttccatg gagaggaaac tcagatgggt 121501 tcaacatatg ggacagctgc ccacctactt taggcagata ggagtgagta gtccttatgg 121561 ccttattcag agaggggccc catacctgct atgaaaaact ttctggtggg aaaggggcac 121621 ttacaatttc tagagtgata atgggattta acagtagctt gcgaatatat tttggctatg 121681 ttagagcact tactacttca taatatacat ttgcacacat ttctgattat ttccttagga 121741 cagattccca gaaatagaag agtcaaagtg tatgaatatt ttaaaatgct ttcagtacat 121801 actattgaat tgcgcagttt tattttcatt tgatttagtt ttttaaaata tttacatgtt 121861 tttaaaaata ctatgtttac cataaagaca aatgatagaa ctttataaag actaaaacat 121921 gcaaatctcc acctatctgc aacccatttt cctttccaga tgtagccact cttgtgaacc 121981 tttatgtatt tatgcattca ccttgctttt gtatttaaat gtaaacagga ttgtactgta 122041 tgttgttcca tgacttgctc ttttcacttc ctacatgatg aatatccttc cttaccagtg 122101 agtgtaggta tgtcttactc atttaacctg cattcaagct cctggcatgg agtttactgt 122161 aagctgcatg gtattccatt ccatacaatt ccatggcatt ccatagcgga gtataccata 122221 atttatttaa cctttaccct agtgttggtc atttaaattg tttgtaatga caattttatt 122281 tttatctatt ttttttcttt ttttttgaga ttgggtctca ctctctcacc caggctggag 122341 tacagtggca caatcatggc tcactgcaac ctctgccttc ctggctcaag ctatccacct 122401 cagcctccca agtagctggg actacaggtg cacaccacca cacccacact accacgccta 122461 caccaccacg accggctaat ttttgtgttt tttgtagaga cagggttttg ccatgttgcc 122521 ggaactcctg gactcaatcg atccacctgc cttgccctcc caaagtgtta ggattgcagg 122581 cgtgagccac cacacccggc tgttattttc tttaattcat tttggatcat acaactctag 122641 aagaatttat tgagttgcaa attggtcata ctttacagta actaagtgta gaaaaatctt 122701 gtatattcag aggtgaaaac attaagtgtg ctagtaactt aatgtacatc atttaatttt 122761 agacatgcaa aattcattga gataagtaga aactagtttt cttaatctgg gctttgactt 122821 gaatacaagt tagctgagct catcattctt gcagttggtc tgagtgtact gtatgtgctg 122881 tttcatgttt tctaattttc agagctacac agctaattgt tttgtttgtt gtttgagatg 122941 gggttttgtt ctgttgtcca ggctggagtg cagtggcacc atctcggctc actgcaacct 123001 ctatgtcttg ggctcaagca atccccctgc ctcagcttcc caagtagctg gggctacagc 123061 cttatgccac catgcctggc tgatttttgt attttttgta gagacagggt ttggccatgt 123121 tgcctgggct ggtctggaac tcaggctcaa gcaatccagc caccttggcc tcccaaagtg 123181 ctgggattac aggcatgagc caccatgttc agccaacaca gctaattgtt gatttttagt 123241 gtgaatggaa aagaccatta ctttagatat taaaaaatag caaatgagcc acttttccag 123301 ttcagggaca gtcttattgt ttgtgatggg aaatggattt agatagccaa tggctgcaga 123361 ggaaaaaact gtcataaaaa gataccagtg tgtgtattct gttttcctct aacaccatcc 123421 tggcatggag cttgctgtga actgggtcat tttgagggta aaggaatcat ttaggttcct 123481 aagaaaagct cacaatacac attaattaac atataggtag cagtaaactc tagctggaat 123541 gatcaaatgt ctggtaatca acccatgaaa tcaaatcagg taattcagga agtgaattca 123601 gttgcttggt ggattttgga atttaacatc aagccaattg aatacaatct tgatacttta 123661 accttgagac ctctgtaatt ttactgtaaa ttaccccaga aaagtgacat aatgtctcct 123721 catttttata tggagtggaa gaaaattggc cttcattatt agtttattat agttgtggtc 123781 attcatattg gtttcccttt ctccagcttt actgtctttc acaagcgcct tttatatcct 123841 gtcaatcatc tgtgtcgggc tgctctatac atattacacc aaaccagatg gctgcacaga 123901 aaacaagttc ttcatcagta ttaacctgat cctttgcgtt gtggcttcta ttatatcgat 123961 ccacccaaaa attcaggtat gattgtttac tacttctttc tcctgtgaaa ggtttttaat 124021 tctaagctta aaatctaggt accctttctt atgaatttat gtctaagttt atccacttga 124081 acatttttcc atattataaa gtcttcataa ttctatttca tgcctatatc ataattaaat 124141 gaatttcact atcattaaaa aaaaacccag gtaacttgaa aatttatagt gctttttttt 124201 tttttttttt ttttttttga gatggagtct cgctctgtca accaggctgg agtgcagtgg 124261 cacaatctcg gctcactgca acctctgcct cctgggttca agcagttctc tgcctcagcc 124321 tcccgagtag ctgggatcac aggcagccac caccgggcct ggccagtttt tgtatttttg 124381 gtagagatag ggtttcacca tcttggccag gctgaactct tgacctcgtg atccacccgc 124441 ctcggcctcc caaggtgctg ggattacagg cgtgagccac cacgcctggc cgggaatgag 124501 tctttttttt tttttttttt tgagacaaag tctcactctt gtcccctagg ctggagtgcg 124561 atggtgcaat ctcagctcac tacaacctcc acctcctggg ttccagtgat tctccagcct 124621 tggccaggcg cctgccacca tgcccggcta atttttgtat ttttagttga gacggggttt 124681 caccatgttg gcaaggctgg tctcgaactc ctgacctcag gtgatccacc catctcagcc 124741 tcccaaagtg ctggttttac agttgtgagc caccgcatgt ggctgggaat gattcttatg 124801 agtcattagc caagatattt taggggaatt aaactgaaaa ataattgatt ttaatatttc 124861 aaaacaaaac tcatgctctt gagttctaaa taccatagtt taggctgggt gtggtggctt 124921 aacgcctgta atcccagcac tttgggaggc caaggcgggt ggatcacttg aggtcaggag 124981 ttcaagacca gcctgaccaa catggtgaaa cgtcatctct actaaaaata caaaattagc 125041 taggcgtggg ggcgcacacc tgtaatccca gctacttggc aggctgaggg gcaggagaat 125101 agcctgaacc ctggaggcgg aggtagcagt gagctgagat tgcgccattg cactccagcc 125161 tgggcaataa gagcgaaact ccagcttggc gtgatggctc atgcctgtaa tcccagcact 125221 ttgggaggcc aaggcgggca gatcacctga ggtcaggagt tcaagaccag cttggccaac 125281 atgacgaaac cccatctcta ctaaaagtac aaaaattagc cgggtgtggt ggcgggcact 125341 gtaatcccag ctactcagga ggctgaggca aggagaaatt tgaacccagg aggaggttgc 125401 agtgagctga gatcgcacca ctgcactcca gccggggtga caaaagcgag actccgtctc 125461 agaaaaaaaa gaaaaaaaaa ccgtttaaaa aaattcatgg agttatagat atgaaatatg 125521 aaaattctat ctagacatca cctttccttt taaaagtgag aaaacagaag tctcaaagaa 125581 gtgataggcc tctcccagct actaattagt tgcagagctg atcttagatc ttttcactac 125641 accaagggag tggaaaatca aagtccttgt gtgatgcaca ggaatttcta ggattgtgaa 125701 atgcgttgag aaccagtagt acttgcttgg tctttctggg ttcttttgcg agctaactga 125761 acagaaggtt acttgtcacc tttcaactct caagaagaat ccgaaagttc tcagttactg 125821 atagttgaat taactccaga gaagtgcact gcattttgga aagtcagccc ttggcagtcc 125881 ctgtgccact gttttccttt tgattccata ggaacaccag cctcgctccg gcctcttgca 125941 gtcctccctc atcaccctct acactatgta cctcacctgg tcagccatgt ccaatgaacc 126001 tggtaaggga atgtcaataa cacattcatt tgataaaaag aaagtaggaa atcagccttt 126061 acatagttgg tacaaaaact agagctagag caaatgtgaa acaagcaaac aaaacttaaa 126121 cgtacagtgt tcactgtagg tttgtaggta tttacctttc acattttagt ccccacatct 126181 attgatagct atttataact gacagtctaa agggtcagag tcctttagat ggtctaggct 126241 ttacctaacc taggctgagt caaactagtt aagagctgtg gtcaagggct ttggcccttt 126301 gttgtgactg aattttagtt tgatgtcaca ggctgtagaa gaaacagctg taggaaatct 126361 ccttttgtag ttattatagg attgtttctc agtaggatat tctatgggct tcttttaaag 126421 gaaaacaatg cagacccccc ttccaccaat gataacttat attttgttct taagtaacta 126481 tgtaaaaacc aaaatagaga taaacttctg tgcttaaaac cttgacactt aaagaattaa 126541 aaagatttgt caaactacaa aaccaataaa atcaataatg atgtttatga aatttgacag 126601 attctgttgt tgttttcttt ttttttccta attctgtcat tgattgtgac caaacgtgtt 126661 actactgact gctaagtttc agatttttta agtaaagcat tcccatgttt cacatcatga 126721 ctgaatgata tgtagggaac tcctcaagtt ctttcttgga acaagatgag ctgtaaccaa 126781 ggataataca aattttctca ttttcagatc gttcctgcaa tcccaacctg atgagcttta 126841 ttacacgcat aactgcacca accctggctc ctggaaattc aactgctgtg gtccctaccc 126901 ctactccacc atcaaagagt gggtctttac tggattcaga taattttatt ggactgtttg 126961 tctttgttct ctgcctcttg tattctaggt aagttaaaag gtacttagaa caagattttc 127021 atggaggttg ataataaagt gaagaagctg ttttgctttc aaaaagcagc atgagaaatt 127081 taaaaacttc actctcttta atgaaaagtt ctttacttgc ttcagttaca gagctgtgta 127141 catatatatt tttttgagac agtgtcttgc tctgtcacct aggctggagt gcagtggcag 127201 gatctcagct cactgcaatc tccacctcct gggttcaagc agttcttctg cctcaggctc 127261 ccaagtagct gagattacag gcacctgcca ctatgcctgg ctaatttttg tatttttagt 127321 agagacagtg tttcaccatg ttggctaggc tggtctcgaa ctcctgacct caggtgatcc 127381 tcctttctca gcctcccaaa gtgctgggat tacaggtgtg agccaccatg cccagcctgt 127441 agtaagtttt gaaattagga agtatgagtc ctccaaattt gttctttttc aaaattgttt 127501 tggcagttat gggtcctttg cattcccatg tgaatttcag gatcagcttg tcaatttctg 127561 caaaaaaata tgtagctggg atttttatag ggattgagtt gaatctgtat agaccaaatt 127621 ggggagtatt gtcatctcag caatattaag gcttccagtc tgtgaccatg aagatacctt 127681 actgaagtct tctttactta aaccatttat ttaggtcttt aatttctttc agtggtttat 127741 agttagttat tagtatacaa atcttaaact tattttgttt ttaagtgttt tattctaaat 127801 acaattctaa gtattttatt cttttcgatg gtattataaa tgaatgtatg ttttcttagc 127861 ttaacttttg ggttgtttat tgctagtgta tagagatatg attggtgacg aggcatggtg 127921 gctcatgcct gtatttccca cacgttagga ggctgaggtg ggaggatcgc ttgcatccag 127981 gagtttgaga ccagtcagag caagatagta agagcccgtc tctataaaaa ttagccaggt 128041 gtggtggtgc aatctgtagt cccagctact tggggaggtg ggaggaccaa ttgagcccag 128101 gaggtcaagt ctgcagtgag ctgtgattgt actactacag tccacctggg tgacagagtg 128161 agacctcatc tcaaaaagaa atacaattga tttttgtgtt tttttttttt tttttttgaa 128221 gagaagaagt gttgctctgt catccaggct ggaatgcagt ggtgtgatca tagctcacta 128281 cagcctcaag ctcctggtct caagtgatcc tcccacctca gcctcctgag tagctgaaac 128341 tacaggcaca caccatcatg cccagctaat ttttaatctt tttaggaaca gtcttgctgt 128401 gttgcactgg ccagtcttga gctcctgtcc tctggcgatc ctcccacctc agcctctcaa 128461 gtagttggga ttataggcgt gagccaccac acccagtcct gatttctata tattgatctt 128521 gtatcctgca actttggtga acttatttat tggttctaat agtttgcatg tgtatgtatt 128581 ccataggatt tcctataaac aagatgaagt catctacaga tagagatagt tttacttctt 128641 ttctagtctg gataccattt atctcttcta tttgctctgg ctagaatttt actacattgt 128701 tggatagaaa tggtgagagt aggatccttg tcttgttcct gatctcaggg ggaaagtttt 128761 tagtctttca ctgttaagta tgatgttagc tctgggtttt ttgttccttt tgtcagcctg 128821 aggaagttct actttgactg tttttatcat ggaaaggtat tggatttgtg agatgttttt 128881 tctgtgtctg ttgaaacgag catgtaattt ttgtcctgtt tttgtattta catttttatt 128941 cagtcattta acaaactttt agtgaacatc tgtagtgcta gaattacata gactaaggtc 129001 actgttaagg cattctttga atagcaagga agataggagt gatatgtgca gtggcagaag 129061 gaagcctagg aagctgtggg aagtcggagg agtgtcttgt aacctagcca gtgaggggtg 129121 atgtcaggga aggccttcca gaggacatga cagcaagtca gagaagtgaa ggatgagtaa 129181 gttagttgga aggaggttgg cacgtggtac cttccaatca tagaagcggc acttgtgaag 129241 tcacggaaac agacacagag agcatagtgt gttttatgaa actcctaagc aattaagcca 129301 tcacatcaag tagtaactga tgacagtgac gtcagtctta tgatgacagg cagcagttca 129361 gaactttaag caggatcttc agaaagcttt cagtctccaa agtaggaaca gctcagcagt 129421 aagtgtcttg tatacaggcc agtggcagca gttcattgtc atggcccctt tctgatttgt 129481 cttccttggg atcactttcc agcatccgca cttccactaa tagccaagta gacaagctga 129541 ccctgtcagg gagtgacagc gtcatccttg gtgatacaac taccagtggt gccagtgatg 129601 aagaagatgg acagcctcgg cgggctgtgg acaacgagaa agagggagtg cagtatagct 129661 actccttatt ccacctcatg ctctgcttgg cttccttgta catcatgatg accctgacca 129721 gctggtacag gtaggagaca cagacaaaac agacagaaga ccataaaacc cttggatggc 129781 ctataaaatc tcatccattg tcctgacctc agaacccttg ctcttaccca tataatgaag 129841 ttgggttaaa tactcttgac tttatcccct ctagcattta aaacccattg gaattctttg 129901 caatgcaaat cattgggagc tcttacagag ttgagttaat tgagagagaa aatcatgaca 129961 taaagcaatc tgacccatct ttgttcacta attttctgtg tatagagatg gccatgtcag 130021 agcattctgg ttcccgtaat aatgtactaa ggaaggaaag tttcagggag gctatctcaa 130081 tctagacctt tgccagtaac taggaacatt taaagaactt tgccctgata gggtctttca 130141 tgctggcaag aagaaaagtc atttgagagg cagagtcaat gagctgttta gttcttggtc 130201 ataagaacta aaaatagcag cttcctctgg tccccttcaa ccctactggg gggtagtgcc 130261 agccaagttt cactggagaa tttcttggcc ttggctgggg ggctgcagga ggctgataga 130321 agtcaggtag accaggcttt tctacctgac tctctcttgt tttcttagcc ctgatgcaaa 130381 gtttcagagc atgaccagca agtggccagc tgtgtgggtc aagatcagct ccagctgggt 130441 ctgcctcctg ctttacgtct ggacccttgt ggctccactt gtcctcacca gtcgggactt 130501 cagctgaacc tctgagtgcc aaggacacca ctggaactca caaaggtctc cttcaccgaa 130561 aacccatata ccttttaagt ttgtttcaac taaaatatta agtgaatgct ttgcaagttt 130621 gactgtatgc aggtttatat cagaaggtga gattgaataa tgcttgatgc agaatcgaaa 130681 cttctcattt atctgtatat tatgtttact tctaaggata tagcacaaag ggaacatttt 130741 ttgtttaaag tgaactacag ctgtgctgtg aagagagttc tttataaagc ctgtaggttc 130801 ttttaacttt ggtttaaaat gtaagatagg aaaatgttgg atatttgagg ccatgcttaa 130861 tatatttata ttgcagtatc ctttaaaagc aaaaaaaaaa aaatgcattt atattacagt 130921 tttcctctat gaaagtcctt acttatatga tacaagcact gtgttttgtg cttaaactct 130981 tcagcggggt agcatcaaag ttcttgggga aggatcgtat atgtgggtcc cttccctaga 131041 agaatggttg ctgatatggc tactgcttct acatcttgag ttttttaatt tacttttttt 131101 acactgtagc attgagactg cttgattcaa gtctggtgct ttgccagatg tattaatttc 131161 cataaatgct ttgtgagttt ggttaaaatg aagattcact tgggaaaaca ctgcagcttt 131221 agtctgtgtt actatcttgt tatgagtatg taaaagtaaa atgcatgtga atttatcata 131281 tttgcactat gaaggtattt ggttaaaata caaagacttt taagatttta aggccctttc 131341 ttccaacagc ttttatagtt agcagccatt ctttattttc tggatagcca ggttttatca 131401 cgcttctagt caggatgctc ctattccttc taaaaattac ggtctgacta gtgagcaaag 131461 tcttgaattt attcaaaagt cctaaatacc ttctctaggt aagacacttg gtagatgaga 131521 gacggaaggc attgtcaaga accattttca tgagaggtgg tgtgcaaaaa ggtagaataa 131581 aagagttctt tcaacaaaga tttactgtct attctgtact agaccctgta ggttttgggg 131641 tacagtgtta aacatgatag aggctctgcc gtcttggact ttaatagctt agagaagaga 131701 gcaaatgagc tgacaggtgg ttataatgtg aattagtgct gtggtttagg aattggagag 131761 aactcaaagg agaggtattt ggtgtaatgg taggctttct ggagaaaatg atatttaagc 131821 caagaactct tagaagttag ctaagagaga gatgggaaaa tgagacgaca ttgctggagt 131881 agataaaact gcatgttaaa ggcaggaaga tggggaaaaa aagttcagta aagctggaat 131941 ggggaaatgt agtcagggac tgaattttaa agggctttat caacctcagt aaagagtttg 132001 gaccttatgt tgagggtggc tgaaaacata ttcatagtgt catgaacaaa ttttatcttc 132061 agtcacttgg gctgatatat agagaatgga tttagagaga tgagaccagg tgcagtccat 132121 atgagatgtg aaatagagaa gtggaatcgt agggacgggg agaaattgac aggtgagggc 132181 tacttagcaa ttagaatttt tttttttcaa ttttaatttt tttttttgag acggagtctt 132241 gctctgtcgc ccaggctgga gtgcaatggt gcgatctccg ctcactgcag gctccgcctc 132301 ccgggttcac gccattctcc cgcctcagcc tccctagtag ctgggactac aggcacccac 132361 caccacgcct ggctaatttt ttttgtattt ttagtagaga cagggtttca ccatgttagc 132421 caggatggtc tcgatctcct gacctcgtga tccacctgcc gcggcctccc aaagttctag 132481 gattactggc atgagccacc gtgcctggcc agcaattaga attttaacac tggcagttat 132541 gaataatatg aaggagaggt agatttctga gtgattctgg tttaaccagc tgggtggatg 132601 gtggttccac gtattcaggt ggcaaacagg aaaaacatgt gttcgaagaa gaatggaggt 132661 aggtggtctc ttaagaatgg ttaagaggct tgggagtcag actgcttggg tttgcatccc 132721 agctttgccg ttttctggct atcaaacttg tcagctatta tttgttgagt acgtactatt 132781 tgatttatga ccacaggcag ctgagcctca gtgttggtgc ctagtgtaca agattgttaa 132841 agaataaagt tattttgcaa agtgtaaccc atttttagca ctgacatagc actgacagta 132901 gctgctgatc tcattatggg ctaaaataag acaatattca aaggtcagag atatctagcc 132961 agaatctgat ggaggctgga tttcagattt tgttacagaa ttagacagag gaacacagag 133021 gggacaggct cagttagggt ggaggtgtgg ggtagggaag caggacttga tataaattat 133081 tggaatcatt gtcttttaaa ccagtggttt atgtcagggt atagcgtttc aagggatttg 133141 agggtcagat ggggaaatgt agccccttat tttgccagtg tgaagcagat accctgcttt 133201 tctttacagt agcggagtca gcttaagagc tttaaaggtc ctaaacttca aaaacattac 133261 agtgccccat cctccgcctt aatgtaattc aaaatacaaa caatactaaa ctgtaaaata 133321 aatgtaacaa agtccaataa agtttttatt tttttctcat gatgataact gatattaatt 133381 tgaagtaaca aatgctcttt aaaagtcggg aggcagtgtc cctgctttgc ttggtcctac 133441 cttaagaagt agtgaagtct gcctcttagt aggaaaacca gtagtctaat caagactcaa 133501 gtgaattata catgtgaatg ctgaattata tacaagcatg agtaagacag cattctgaat 133561 aaggccagga tcatcattta aagcagggga gtattttgag tcatcagatc tcagggtggc 133621 ctttcattcc agatgagtac ctttatgttt catcactatg aaagggcttg cctagccttg 133681 gggaatgcgg gttggggccc tggctttcag atcagggaca tgctctagga actcatcaaa 133741 ggcaatagca gtggctgcca cttttacctc gttaaagccg ggagaaagct tctaggtcca 133801 tgctggactc ctttattttc taggttggtc ttgcgggtta tctggttggg aagtaaaact 133861 ttctcttggt aatcaaagaa actgcctgcc aaagagtagc catggacctg gttctcagta 133921 tagaggtttt gttctttttc tttttggcta gccaagtgag gaatggagaa ggaacaaaga 133981 aatctgtaac tggttctttt ttcttttttt agacagagtt ttgctctgtt gcccaggctg 134041 gagtgcagtg gtgccatctt ggctcactgc aacctccacc tcccatgttc aagcaattct 134101 cctgcctcag tctcctgagt agctgggact ataggtgcgc gccaccacac ctggctaatt 134161 tttgtgtttt tagtagagat gggatttcac catgttggcc aggctggtcc cgaactcctg 134221 gcctcaggtg atccacctgc ctcagcctcc caaagtgttg ggattacagg cgggagccac 134281 tgcacctggc tgtttttaca cattcaggct aaagtgagta ttcacataat gagttgtgca 134341 tttgatctgt gcagattgat aacagatagt aggcttatgc ttattaatgg aagctgctgt 134401 tcatctcttt actcctgttg tgttggtctg tccgtcctgt tacctaggca cccaactcag 134461 tgatgtccta attactgctg gatgactcaa gccacaccat gcctatgttc tcaggtgaag 134521 aagaataata gaaaccacat agatgggtag gtgtaccaag cattaaacag acagtatcca 134581 catgcctgta aggtggtgat tatttccctc cctccttcct tttacgtatc agtttgaggc 134641 ttagagaagt tcgtgacttg cctatggtta ctcagtaaga ggtgaaacag agaaacccag 134701 ggcttggctt aagagttctt ttttcaatta gacctcagtg tctcttgggc cataagaata 134761 taagagacag gatttattag agacatgtag gggagtcatc tggctcacta atgactcatc 134821 accaactcac cagatctacc atgggaacct agtttttgtc tactctcagg cacagagtgt 134881 ttgctcctct acccatcact gacaaagaca agttagtagt cacctgaagg caagtgactc 134941 atgtctttta ctaggtcagc tatccctagt ggctggcacc tactcaggaa aaagcatgaa 135001 tattaggtta aaaacgtttt ccccagaaga gtggtaccct aaagggtctt attacttacc 135061 ctttaaatgc gataatgtga aactttattg ccattgagtg tatgggtacc aaaccagtac 135121 tgtgtaagga gggggcaggg ggcatcaaga aaatactgac tcatggaaaa gggcctcata 135181 agactgctgt gagagaaaag gattttaaca gttagcttca tttttctata tgtgagaaac 135241 agactgaact taagttccac aatttcctca ggaaaggctg acctgaaggg gtggttatgc 135301 tgcaatataa gtacttaacc tgttgcatca tcaaaagtgt acctcctgtg ttagagtatt 135361 tctcaactga ctcatcaagt caccagatct accatgggga cctagttttt gtctactctg 135421 gggcacagag taccaaataa acaggggaca tagaaacagc ttagcaggcc tgggaccata 135481 gttgggcaag ccgtgctcca gttacccctc aaggagtctg ggcttacttc tggaataaaa 135541 tgttgttaca acttgggttt cagagaggtt aacaaccact gcttgtattt tcaagaagcc 135601 agttaaaaag tcaatttctg gttctttctt actctatgaa agagtcctgc cttctctctc 135661 tcttctatct ggggactcgt tatttgtctc cagccatcag gagaaaacag tgggcttgtc 135721 atacttcctg ggtttgaatc ccatccttag tatttggcta gttgtgtgtc actgggcaag 135781 ctacattttg ccaacaggaa cagtaatgca gcttcattaa gggtagctga aggttaaaat 135841 acattattta tgagtgtcct taacacatgt gtgcccagcc cacatgcgca cattgaggaa 135901 atcgctggct atcactggaa agactcaagg ccagtccaag aagtatagct gggcctcaag 135961 aatgtaaaaa tttgcatatc acctctactg ctctgtgtag ctagctgcat tctcctctcc 136021 atactgcagc agtctaattg tgtatgatct tcagaggcag ccagcgcatc tacagaatgc 136081 actgtcctat cacagtgccc ctccctccct cacctgcaga ggaggtggtc tacccgggct 136141 gtgaataatg aaccttggaa cctttatcca aactgtcatc atcacaaaag aaccccatgc 136201 acatttttca gagattttta ttgtttcaga ctgagtcatg catactaaaa ttattacata 136261 ttttcaccac tttgacttag aaaatgcact agaaaaataa actttggtca aaacaaacac 136321 tgaagtacat gaatccacca tgtatcccta tactcaaagc caaactgaat ttcagtttga 136381 agcaaggaat gtgaccagtg gctgaaacag tgccccaagc tggtcagaga attagctcac 136441 ctcccactcc atcagaggct cttggtcaga gaggtttcaa gtatttcact tgtaacaggt 136501 tcctacctga tatgccaaga agccgaggca tagctacaag aatccacagc agcagcatct 136561 cactgcctca ctaaacctgc tgcccagtga gatgaaaaga aaaacagccc catgaaacag 136621 gaatttcatc actatcatct ccatccacaa atcacattga tccttcgcat gaagcaagct 136681 cctgttgact gtgatgtttg ttactagcat tttaagcaaa ccatttctct aagtcaggag 136741 agcagaagaa tgaatttatt tatttgacac agaatattgc tttgtcaccc aggcgggagt 136801 gcagtggcat agtcttggct tactgcaacc gctgcctccc aggttcaagc gattctcctg 136861 cctcagcctc ccaagtagtt gggattacag gcacgcacca ccacacccag ctaatttttg 136921 tactttttag tagagatggg gtttcaccat gttggccagg ctggtctcaa actcctgacc 136981 tcaaatgatc cgccccccct tcagcctccc aaagttctag gattacaggg catgaactac 137041 tgcacctggc caagagcaga agaatttaga tttacaaaat taacagatcc caatacagag 137101 aggtgttacc agaactgaca tctttcctga acagtaggca tgtttctgtt cggtatagcc 137161 acacctctca tggccacagg agatacagtg tatagtatat agtatataat tggtttagct 137221 aacattttac tccctaaatt aaaggaggct tcaaaggaaa accttaatta aatgttggca 137281 actttaaaat ggtaaaactt tttaaaagta ttataatcgc cacagcaata ctctagcttt 137341 tgctacccaa atgaaaaaaa ttcccaggat tgatctgagg gtcaaatact atttaaagaa 137401 attgcccacc cagggagaat tctgaaaggc tcaatataaa ggaattcaga gaaaagcctc 137461 ttattgtctt atttcgggaa aaatggtgtt ttgtaacctg atttttcaat ttttattttt 137521 gagatggagt ttcactcgtc acccaggctg gagtgcaatg gcgcgatgtc ggctcactgc 137581 aacctccgcc acctgggttt aagtgattct cctgcttcag cttcccaagt agctgggatt 137641 acaggcaccc accaccatgc ccagccaatt ttttgtattt ttagtagaga cggggtttca 137701 ccatgttggc caggctggtc tcaaactccg gacttcaggt gatacaccca cctcagccgc 137761 tcaaagtgtt gggattacag gtgtgagcca ccgctcccgg ccttgtaacc tgacttacag 137821 acctcctaag ataagtgact ccttgtgaaa ccaggatgaa atatgaaaga tcgaaagcta 137881 cttttgcctt aatttttgca gcagcaatag aataaaaaag aaaatgcaag gaaaaaaatt 137941 aaaaaataaa gcgacctttg ccaagagcaa gttagaaatt caagacatga agccagggtg 138001 gcacttttgg ctgtttttgt agatttcatc cagtctctcc cccatgaagg cagaacaggg 138061 gaagaggcac agttttccag aattccatgt tgggatatgg gggaagaaga ggaagaaaat 138121 tcccatattc tgaaatgtat agatttggaa gcttttaggt accaattcat tcagacctaa 138181 atttactaaa taagccaatt gagggggaaa aaaaaggaaa aaaacttcta aaaagagaga 138241 gtacagtgtc tgtggcagtg aagccaacta gataccagga ataaagtttg gagtatagat 138301 ctctatcatg gaatcactca gggatgttaa gccagctcat caaaggccac gtagtaaggt 138361 aaaggtggag cccgcgtctc ctaatttact gtcccatttc cttccgacac cacactatct 138421 tgaatttagg gaaaagttaa tttccaagtg tttgtgcaat gggccagaac ttctccctga 138481 aggaacatac tcccagttct tgaggttcat gggtttgcct caccagtgcg ctccccagtg 138541 ctgagtaagg ctcactgtgc tccctggctg gtcattcccc accctgctga gacagcctga 138601 cacaaacatc ctggtgaagc cacaaggaga ccaccttgat tttcacgcag ggaggtcttg 138661 gtttcacagc ttggttcaag tttcctgagg tgtcaaaact atctgatacc tcttcggagc 138721 caaaacaaag caagcagctc ccggaggcag ataaacttag gctccaactc cttctagaag 138781 ctgagtgaaa ccactccttc cttgtccaac tggaaaaacc taagaaatag gcctctccag 138841 gtcccttctg gtctttgaga taattctcta ggagagctgt ggccatgcct gcattaaaac 138901 aaatgcgact ggccaggaat gcagtgtaga attgaaacat cagggctgag tgattcattc 138961 ctctttaccc agtttttgtt ttgttttgtt tatctgtctt ccctcttggg gaagcctcca 139021 cccagttctc ccatccaggc cagtgggttc tgaaccccct ccttccctcc ttcagccatc 139081 ccacttgccc tgcttgggcc ttttcagcag tgtgtgtcac atcatcatca gatggaacca 139141 gaagcagcca ggagtgcttt ttggcacttc cagctcagcc agtaccgtag aggatccagg 139201 gttactatcc atggtgaaaa gaactttctc tagaattggt attaatgagg ttaagtaact 139261 taatttccac caacaactta tccagacatt taaaaaggca atagagataa acagcttagt 139321 aacaacagat tggccttcaa atttcagagt attgttttct ccctaactga gctgtctgct 139381 tgactggtta gtctaataaa tgcacggaga aggtgcacac acaacacagt acttttcact 139441 atttaacacc atgatcatgt ggaatacatt agaaaggaaa taggtcaaaa accatcatgt 139501 gaaccgtcta atttgatgtt tgggctaagg ctaaaatctt cacagggagg ttctaaagtg 139561 ctgaaatgca gaacattcag gctgactggt tcacccacac cagagagaga aactcaagtg 139621 gggctgggag ggcagggggg cccagaagtg acccaggaat aacctgccct tgtgttcatt 139681 tccagagacc aaaaaaaaaa aaattttttt tttttttttt tttttttgag gcagagtctt 139741 gctctgtcgc ccaggctgga gtgcagtggt gcgatcttgg ctcactgcca cttctgcctc 139801 ctgggttcac gtgattcttc tgcctcagcc tcccgagtag ctgggactac aggagcccgc 139861 caccacgccc agctaatttt tgtattttta gtagagatgg ggtttcacca tcttggccag 139921 gcgggtctcg aactcctgac cttgtgatcc gcacaccttg gcctcccaaa gtgctgggat 139981 tacaggcatg aaccaccacg cccagctcag agaccaaaat taaagaatct caagtaatta 140041 actactgtag ccaggtttac aatatagtac cagagtctca gggatggaag gaattcatca 140101 tacaagatcc tggtcacctt aacctggcct cacaaaagcc atcttttcaa aattgaacca 140161 tcactataaa aagaaaattg gtgtcagtga gtcccaggaa tacacataat agtttcttta 140221 aaggcagggc taattggtga cacatctctc ttggaaagac acatgtacca caaacaaaaa 140281 caggatccaa ccacttgtat ctagatagtt atctttctag gcagaaaatc taacattcag 140341 taactaaacc aaatcccaga gcctcagcag ccaaaatggc cctagctgac atttcatcca 140401 gcctcatttt atagaggaga aagtagaggc ccatatcttg tccatagttg tacacctggg 140461 atccaggcca gaattcttat agctacacgg gcctgggatc cagcatttta ggttgagtaa 140521 aatcatttcc tgctaggtcc ctgattctat tgagaaccag ggctaaggct ttatggattt 140581 gggaaaactt aagtttctat tgcaaacttc aaccagtggg tcccagaggt gctttgaagt 140641 ctccggcttc cacatttgac cacaaagtgc tgcttatgtc ctgatgctta catgattcag 140701 ttaaggtgct ttccaaaccc ggtgacaaca gctggggcct tcatctccag taagtttcct 140761 gatgagcagc agaggtctat acttagggtt cctgtgaaaa acacaaactc ccagtctcac 140821 atttggtcac actgaaggtt tggccaaatt atttcataca agtagttctc actggccaaa 140881 caggtggttt ttctcctgga aaagtgggat caaagtgact ttggtcctga ttgttcttgc 140941 ttccaagatg tgtaagaacc agatcaccac cccccaacca aaaagcaaaa caaaatagat 141001 atagcctcca gattaatagt tgcagactgg gattccaaat gtttaagaat cagtggggac 141061 attttccatg tcttccatgg agtaattaaa gtaaccaaag gcttacccca aaatgggctc 141121 atgctcatat ctcaggctgt tcctccatcc tgcagtttaa ttacaaggag caagactgaa 141181 tccatggacc cttaaattct ccttgtgcct ctccaaagaa aagaaggaaa agaattaaag 141241 atggtgaccc tgggggacgg gctagtagca ggagtacagc tgtgacttca cagctcgcaa 141301 ggagtcgtca cactgtgcat ctgaggtcag gccctcgggc agcagcgtct ggcccaggat 141361 gctgtcacag gcaggaacag gttggcagaa ctctttcaca aaatcgtctt ctgaagccag 141421 cagtaccgcg ttccaggtgg cagtgtccag ctccccagcc gtgcccccat actccttggg 141481 gaggatgctt cttggaaggt ttgtgtggag agagttcaag tcagacccat ggaggaagaa 141541 ctaaaaggga atgaaatgaa aacacacatg tccttaggtt aatcaggaat gcagaaatct 141601 cctccatggt aggtgctcaa ataaatatct gttgaatgaa gaaaggaaat gttaatggca 141661 actctaccct tactttttca gcccagaaag cctggagtat ccttaattct cttttcctta 141721 taccatacat cctgttggct ggagcttcag aatatatcca ggatccagcc gcctcaccac 141781 cacactattc aattaatggc agcagccaca tcgcttctcc ctggattcct ccagtggcct 141841 cctatctggt ctccctgctc ccactcttgt ccctctgcaa tctaagttct tttcagtgaa 141901 actcagaatc acagtcgtgc ctctatgtac agccttccag cactcccacc tcagactctg 141961 aggcccatca taatctagcc tcattacttc tctgagtgct cctctcacca cctgcccctc 142021 cagtcttgac ccagctgcgc cagcctagct gctctcccag catgccggac aggctcccac 142081 gtcagggttg tgtccttgct ggcctcccag atacctagat ggctcactct ctcacttctt 142141 gtctttgctc aaatgtcagc ttctcaaggc tgccatctct aaccactcag ttgaatactg 142201 caacccttcc cacctcacac acgctatact tttctctgct ttatttttct ctgtagcact 142261 taccacattc taatactaga taattcactg tggctccctt tagaatataa ttttcctagg 142321 ggcaggggtt ttggtctgtt ttgttcatct agaacatgct tgtacataga gttgatcaat 142381 agatatttct gaataaagga acctatgaag gaacaaacga agaacattgt gggccccaaa 142441 tcagcgcatt tcacagatgt aaaagacctg tccaaaggct gctttggtga aagttccaaa 142501 cagaatcttt agggacccac atgggttcac atacagtcag ggaagccact aagttagtgg 142561 tttgcaaatc tagctgtgca tcagaatttc ctaaggggat tttaaagcac agatccctga 142621 gctgatgaat caaaatcttc agggaaagag cctaggaatc tggagctgga tttcaagttc 142681 cctcccttac ctccttgaga ttttccatct atatatgtta ctttgaaaaa gcaagggccc 142741 agtgcggtgg ctcacgcctg taatcccagc actttcggag gccaaggcgg gtggatcacc 142801 tgaggtcagg agtttgagac cagcctggcc aacatggtaa aagcctatct ctactaaaaa 142861 tacaaaaatt agctgtgcgt ggtggcacgt gcctgtaatc ccagctactg gggaggctga 142921 ggcaggagaa ctgcttgaac ccgggaggta ggggttgcag tgagctgaga tcgcaccact 142981 gcactccagc ctgggtgacg gagcgagact ccgtctcaaa aaaagaaaaa gcaagtaaaa 143041 ttcagggccc agtctttgca gtgaccttcc taggcctagg gaaggagaaa aataccaccc 143101 atcccatttc aatcagtggg actgggaaag gctgaagagg gaattggctc ttctgggtca 143161 gaaaggaaga caaagtgcta tttatccctt cttaaataag aggtaagcta aaaactcaca 143221 tagctcgtaa accttgaatt gctccacaaa ctgtacagtg ggacaacata ttaggcaaat 143281 ggaaaggctt aagtagcatt tttaaacccc acagttctaa tgaccataac ttaaaaaaaa 143341 tttattttag agacagtgtt ctgctctgtt acccaggctg gaacacagtg gcaagaccat 143401 agcccattgt tcaagcccat ggcagccttg aattcctgag ctcaagcagt cctcccacct 143461 cagcctcctg agtagtggag actacagaca tgtgccacca cacccaccta attttttttt 143521 ttttttgaga cagtctcact gttgcccaag ctggagtgca gcggcatgat cttggctcac 143581 tgcaacctcc gcctcccgga ttcaagtgat tctcctccct cagcctccca agtagctgca 143641 attacaggca tgtgccacca tgcctggcta attttttgta tttttagtag agtcggggtt 143701 tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccat ccacctcggc 143761 ctcccaaagt gctgggatta caggcgtgag ccactgtgcc ccaggcacac ccacctaatt 143821 taaaaaattt ttttagagat gagtcttgct atactgccca gggaggctct tatgataaaa 143881 atcctggttc tgctctctct actggcaaat attaaattct tgttcattta ctcttccatc 143941 ttctcctcac aaaagggact gttcatttat tttatctgta actctgtact tttccaaatt 144001 agttgttcca gggcaaaaat aataaacaag taccagatgt cacaaataaa tcaacagagg 144061 aaaagaaaaa tctgaagtca ataggatcat cacttactct gtttgctatt ttctccttta 144121 gaaatggttt tatgatggca aaaatgcctt taaatattcg aggttcattc accacatgga 144181 ctgcttttat ccgaatgggg aaaccatcct gtgtgggaaa ggcaaagggg tttagaacat 144241 ctggagaatc tttccacacc ttcctccccc aagttaaatc ttcactttca gccccaacat 144301 cagtctgatc tgatccttat ggcctcctca atgacagtcc acgtggggcc taactaggaa 144361 gacggacctc agggaaagtc tttcctggga cccaaaactg ggttagaaga gaggcaaaca 144421 gaggctgggt gtagtggctc atgcctgtaa tcccagcagt ttgggaggcc gaggtgggcg 144481 gatcacttga ggtcaggagt tcgagaccag cctggccaac atggtgaaat cccatctcca 144541 ctaaaaatac aaaaattagc tgggcatggt ggcacgtgtc tataatccca gctacttgga 144601 gaggctgagg ctggagaact gcttgaaccc tggaggtgga ggttgcagtg agccaaaatc 144661 gtgtcactgt actccagcct ggatgacagc ctggactcca tctcaaaaaa aaaagaagag 144721 aggcaaacat atgaagagct gctgctctta cagcagaaaa cagtctgagt ctacagggac 144781 agatctggca aataccagac cttccagatc agtgttgccc aaagtatatt tctcaggcta 144841 ttagaggtta tataaaacag ggctccacag caggcaaatg agtttgagaa acgtagtcca 144901 ctaagcataa tcaagtgagt ttccttaact gcattcctgt tcagagcctt taacatgtaa 144961 attcttacag tgaaattcac aagaaaggga catgctatac agcatttccc aaacttactt 145021 aattttttct cctctccaga tgatcttacg gtactaacac tgatccacac tcttgatcct 145081 gacacttgaa tctacaaggt attcaggagg aggcatcaaa aggagtgcca gcgaaggaaa 145141 gccagggttg gctcaaagcc agaatccaaa tgtttgataa tgtccaaatg tgcactgatt 145201 tatttaaata ataaacctgg tgcaccacag gggctgaaac atttgtggaa tcagtgagaa 145261 gagctactat accaccggtg ccccatagtt tacaagggaa aacaccccac gtggtgggtc 145321 agaatgaaca agtaatcaaa acctgtcaca gctgtgacag gctatatttt ctctaggtct 145381 ccatcttccc atctatcaaa aaggaaaaat agtttttgcc ttgcctatct cagaggattc 145441 gctgagggac tgaataggcc ctgatgactg taaaagcact gtgcaagctg ttaaacacgg 145501 tgcagatatg gagggagcac cacagtcccc agtgactcat gttctgaaac acagaacatg 145561 aggtgtgaag catctgacca aaatacagcc caacagaaca ccatatccct caggtccaca 145621 gagggcagga ggggctccag caccttacgc agaagcataa tttcttaggt tttttgtttg 145681 tttgttttct gagacagagt tttgctgttc tcacccaggc tggagtgcaa tggcgcgatc 145741 tcagctcact gcaacctccg cctcctgggt tcaaatgatt ctcctgcctc agcctcccaa 145801 gtagctggga ttacaggtgc ctgccaccat gcccagctaa ttttcgtatt tttggtagag 145861 atggggtttt accatgttgg ccaggctggt ctcaaactcc tgacctcaag tgatccacct 145921 gcctcggcct cccaaagaat tggattacag gcgtgagcca ccgcgcccag ccagttttgt 145981 ttgttatgag aattcttcct ctgccttcca gaccaagaaa aggcaacaag caaattctct 146041 agttttttgg actcttgtct tactgaagta cctactccag agagtacttg agagactgat 146101 gagaacaagt ggtaaatagg acttttaaat ttcttcccat aatgcttttg tgtttttcag 146161 attgcaggtg tgggatgtgc agtttgtgta tcatgtgaga cccacagtaa atggacaggc 146221 cctggggaaa gccagagcca ccacgaaaca ggcaggacac atacgggtct tacctggagg 146281 atgccaatca cctttttggc tataaaaggg ccaaagtgag atgcttttga taaactcact 146341 cctttgtagt ctgcaagaat tacaattcca ttcacctggg tttcttcaga ctgaatgagt 146401 ttttctaagg tcaagtatat ggctcggatg ttttcagtaa ttggatagtt gcttggtatc 146461 catctgtcta aggtcataag agattgacag tatgttacag aagttaatat aaataagtgg 146521 tcaaacaaag atttatctcc tttctatcac tgttgtttca aaatggaaaa gaaagatatt 146581 ctagtagcca tgactcttat ttaggttagt aaatggaaca ataagcagca gctggtttct 146641 taactccaaa cagcagcaca ttttgtgatg gaaatacaat tgagtgttga ggttaagagc 146701 atgagctctg gctgggcaca gtggctcatg cctgtaatcc cagtactttg gaaggccgag 146761 gtgggaggaa ttaagcataa gcccaggaat tcaagaccag cccaggcaac atggcgaaac 146821 cccatgtcta caaaatatac aaacactagc caggtatggt ggggtgtggc tgtagtctca 146881 ctacttggga gactgaggtg ggaggatctc ttgagctcag gaggttgagg ctgcagtgat 146941 catgccactg tactccagcc tgggtgacag ggtgagaccc tgtctcaaaa atataaaaaa 147001 aagtatgcac tctgaagtca gacacacctg aggtttgaat gttactagct atgtgacctt 147061 gggcaagtta ctataatacc tcactctgag ccccgatgtc ctcataagta gaatagggtg 147121 ataatgccta ccttcatgga catctgtgct ttctgccatc atgtatttcc ccttatcttc 147181 atagcactaa cctgattttc cttagggaac cacctcctcc cactctagcc atgcagtttg 147241 gatagaactg acttgactcc tgacttcaga ggtgagcaag tgactgggga ttgattggcc 147301 aatctgacag cacatccttc tggccacagt gtgtgatcca agccaggtca agcagagtca 147361 aggagactca attctaggac ccctggccct ttggaaagag ttctgtcttc cctcttgtag 147421 ttttgaggcg gtgagtctga gattcacggg ccccatgagg caagagagct ggcctgggag 147481 tgaagcccct gttgaggaca gcagtggtgg ggacagcttt ccagtgccag agtttgggcc 147541 cctgaatgga aatacggcta gacctcagtt atctgagcaa ctgaggtttt tttgtttgtt 147601 tcagctaacg tgagttggct gtcacttgcc acacagactg agttttggcc agatgtggtg 147661 gctcacgcct ggaatcccaa cactttggga ggccaaggtg ggtggatc // LOCUS HS17BHDEH 2593 bp RNA PRI 01-NOV-1995 DEFINITION H.sapiens mRNA for 17-beta-hydroxysteroid dehydrogenase. ACCESSION X87176 NID g1050516 KEYWORDS 17-beta-hydroxysteroid dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2593) AUTHORS De Launoit,Y.P. TITLE Direct Submission JOURNAL Submitted (11-MAY-1995) to the EMBL/GenBank/DDBJ databases. Y.P. de Launoit, CNRS URA 1160 - Unite d Oncologie Mol., Institut Pasteur de Lille, 1 rue Calmette, F- 59019 - Lille Cedex, FRANCE REFERENCE 2 (bases 1 to 2593) AUTHORS Adamski,J., Normand,T., Leenders,F., Monte,D., Begue,A., Stehelin,D., Jungblut,P.W. and de Launoit,Y. TITLE Molecular cloning of a novel widely expressed human 80 kDa 17 beta-hydroxysteroid dehydrogenase IV JOURNAL Biochem. J. 311 (Pt 2), 437-443 (1995) MEDLINE 96033037 FEATURES Location/Qualifiers source 1..2593 /organism="Homo sapiens" /tissue_type="liver" CDS 49..2259 /note="pid:e158977" /codon_start=1 /product="17beta-hydroxysteroid dehydrogenase" /db_xref="PID:g1050517" /translation="MGSPLRFDGRVVLVTGAGAGLGRAYALAFAERGALVVVNDLGGD FKGVGKGSLAADKVVEEIRRRGGKAVANYDSVEEGEKVVKTALDAFGRIDVVVNNAGI LRDRSFARISDEDWDIIHRVHLRGSFQVTRAAWEHMKKQKYGRIIMTSSASGIYGNFG QANYSAAKLGLLGLANSLAIEGRKSNIHCNTIAPNAGSRMTQTVMPEDLVEALKPEYV APLVLWLCHESCEENGGLFEVGAGWIGKLRWERTLGAIVRQKNHPMTPEAVKANWKKI CDFENASKPQSIQESTGSIIEVLSKIDSEGGVSANHTSRATSTATSGFAGAIGQKLPP FSYAYTELEAIMYALGVGASIKDPKDLKFIYEGSSDFSCLPTFGVIIGQKSMMGGGLA EIPGLSINFAKVLHGEQYLELYKPLPRAGKLKCEAVVADVLDKGSGVVIIMDVYSYSE KELICHNQFSLFLVGSGGFGGKRTSDKVKVAVAIPNRPPDAVLTDTTSLNQAALYRLS GDWNPLHIDPNFASLAGFDKPILHGLCTFGFSARRVLQQFADNDVSRFKAIKARFAKP VYPGQTLQTEMWKEGNRIHFQTKVQETGDIVISNAYVDLAPTSGTSAKTPSEGGKLQS TFVFEEIGRRLKDIGPEVVKKVNAVFEWHITKGGNIGAKWTIDLKSGSGKVYQGPAKG AADTTIILSDEDFMEVVLGKLDPQKAFFSGRLKARGNIMLSQKLQMILKDYAKL" BASE COUNT 757 a 471 c 643 g 722 t ORIGIN 1 ggccagcgcg tctgcttgtt cgtgtgtgtg tcgttgcagg ccttattcat gggctcaccg 61 ctgaggttcg acgggcgggt ggtactggtc accggcgcgg gggcaggatt gggccgagcc 121 tatgccctgg cttttgcaga aagaggagcg ttagttgttg tgaatgattt gggaggggac 181 ttcaaaggag ttggtaaagg ctccttagct gctgataagg ttgttgaaga aataagaagg 241 agaggtggaa aagcagtggc caactatgat tcagtggaag aaggagagaa ggttgtgaag 301 acagccctgg atgcttttgg aagaatagat gttgtggtca acaatgctgg aattctgagg 361 gatcgttcct ttgctaggat aagtgatgaa gactgggata taatccacag agttcatttg 421 cggggttcat tccaagtgac acgggcagca tgggaacaca tgaagaaaca gaagtatgga 481 aggattatta tgacttcatc agcttcagga atatatggca actttggcca ggccaattat 541 agtgctgcaa agttgggtct tctgggcctt gcaaattctc ttgcaattga aggcaggaaa 601 agcaacattc attgtaacac cattgctcct aatgcgggat cacggatgac tcagacagtt 661 atgcctgaag atcttgtgga agccctgaag ccagagtatg tggcacctct tgtcctttgg 721 ctttgtcacg agagttgtga ggagaatggt ggcttgtttg aggttggagc aggatggatt 781 ggaaaattac gctgggagcg gactcttgga gctattgtaa gacaaaagaa tcacccaatg 841 actcctgagg cagtcaaggc taactggaag aagatctgtg actttgagaa tgccagcaag 901 cctcagagta tccaagaatc aactggcagt ataattgaag ttctgagtaa aatagattca 961 gaaggaggag tttcagcaaa tcatactagt cgtgcaacgt ctacagcaac atcaggattt 1021 gctggagcta ttggccagaa actccctcca ttttcttatg cttatacgga actggaagct 1081 attatgtatg cccttggagt gggagcgtca atcaaggatc caaaagattt gaaatttatt 1141 tatgaaggaa gttctgattt ctcctgtttg cccaccttcg gagttatcat aggtcagaaa 1201 tctatgatgg gtggaggatt agcagaaatt cctggacttt caatcaactt tgcaaaggtt 1261 cttcatggag agcagtactt agagttatat aaaccacttc ccagagcagg aaaattaaaa 1321 tgtgaagcag ttgttgctga tgtcctagat aaaggatccg gtgtagtgat tattatggat 1381 gtctattctt attctgagaa ggaacttata tgccacaatc agttctctct ctttcttgtt 1441 ggctctggag gctttggtgg aaaacggaca tcagacaaag tcaaggtagc tgtagccata 1501 cctaatagac ctcctgatgc tgtacttaca gataccacct ctcttaatca ggctgctttg 1561 taccgcctca gtggagactg gaatccctta cacattgatc ctaactttgc tagtctagca 1621 ggttttgaca agcccatatt acatggatta tgtacatttg gattttctgc caggcgtgtg 1681 ttacagcagt ttgcagataa tgatgtgtca agattcaagg caattaaggc tcgttttgca 1741 aaaccagtat atccaggaca aactctacaa actgagatgt ggaaggaagg aaacagaatt 1801 cattttcaaa ccaaggtcca agaaactgga gacattgtca tttcaaatgc atatgtggat 1861 cttgcaccaa catctggtac ttcagctaag acaccctctg agggcgggaa gcttcagagt 1921 acctttgtat ttgaggaaat aggacgccgc ctaaaggata ttgggcctga ggtggtgaag 1981 aaagtaaatg ctgtatttga gtggcatata accaaaggcg gaaatattgg ggctaagtgg 2041 actattgacc tgaaaagtgg ttctggaaaa gtgtaccaag gccctgcaaa aggtgctgct 2101 gatacaacaa tcatactttc agatgaagat ttcatggagg tggtcctggg caagcttgac 2161 cctcagaagg cattctttag tggcaggctg aaggccagag ggaacatcat gctgagccag 2221 aaacttcaga tgattcttaa agactacgcc aagctctgaa gggcacacta cactattaat 2281 aaaaatggaa tcattaaata ctctcttcac ccaaatatgc ttgattattc tgcaaaagtg 2341 attagaacta agatgcaggg gaaattgctt aacattttca gatatcagat aactgcagat 2401 tttcattttc tactaatttt catgtatcat tatttttaca aggaactata tataagctag 2461 cacatgatta tccttctgtt cttagatctg tatcttcata ataaaaaatt ttgcccaagt 2521 cctgtttcct tagaatttgt gatagcattg ataagttgaa aggaaaatta aatcaataaa 2581 ggcctttgat acc // LOCUS HS18D 905 bp RNA PRI 26-MAY-1993 DEFINITION Human 1-8D gene from interferon-inducible gene family. ACCESSION X57351 NID g311373 KEYWORDS 1-8 gene family; 1-8D gene; interferon inducible gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 905) AUTHORS Lewin,A.R., Reid,L.E., McMahon,M., Stark,G.R. and Kerr,I.M. TITLE Molecular analysis of a human interferon-inducible gene family JOURNAL Eur. J. Biochem. 199 (2), 417-423 (1991) MEDLINE 91301153 REFERENCE 2 (bases 1 to 905) AUTHORS Kerr,I.M. TITLE Direct Submission JOURNAL Submitted (22-JAN-1991) I.M. Kerr, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London WC2A 3PX, U K COMMENT See X02490 for overlapping cDNA sequence. The Human 1-8D gene shows sequence identity to 1-8U (See X57352) and 9-27 (See J04164 for DNA sequence, X02491 for cDNA sequence). FEATURES Location/Qualifiers source 1..905 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoid" /clone_lib="loristB" misc_feature 189..202 /note="ISRE (interferon-stimulable response element)" /evidence=experimental gene 280..678 /gene="1-8D" exon <280..626 /gene="1-8D" /number=1 /evidence=experimental CDS 280..678 /gene="1-8D" /codon_start=1 /db_xref="PID:g23396" /db_xref="SWISS-PROT:Q01629" /translation="MNHIVQTFSPVNSGQPPNYEMLKEEQEVAMLGGPHNPAPPTSTV IHIRSETSVPDHVVWSLFNTLFMNTCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTA KCLNIWALILGIFMTILLVIIPVLVVQAQR" exon 627..>678 /gene="1-8D" /number=2 /evidence=experimental repeat_unit 698..765 /rpt_type=TANDEM /evidence=experimental repeat_unit 766..833 /rpt_type=TANDEM /evidence=experimental polyA_signal 886..891 BASE COUNT 189 a 304 c 206 g 206 t ORIGIN 1 caacacaggg gcagtctcca ggacctccac accattaaca agatgagcct tgtgctccct 61 tgggctctag agaggaagcc cctctgagcc ctcagcccct ctttcctccc tctcctaaag 121 taatttgatc ctcaggaatt tgttctgccc tcatctggcc ctggccagct ctgcatttga 181 caaatgccag gaagaggaaa ctgttgagaa aacggaacta ctggggaaag ggagggctca 241 ctgagaacca tcccggtaac ccgaccgccg ctggtcacca tgaaccacat tgtgcaaacc 301 ttctctcctg tcaacagcgg ccagcctccc aactacgaga tgctcaagga ggagcaggaa 361 gtggctatgc tgggggggcc ccacaaccct gctcccccga cgtccaccgt gatccacatc 421 cgcagcgaga cctccgtgcc tgaccatgtc gtctggtccc tgttcaacac cctcttcatg 481 aacacctgct gcctgggctt catagcattc gcctactccg tgaagtctag ggacaggaag 541 atggttggcg acgtgaccgg ggcccaggcc tatgcctcca ccgccaagtg cctgaacatc 601 tgggccctga ttttgggcat cttcatgacc attctgctcg tcatcatccc agtgttggtc 661 gtccaggccc agcgatagat caggaggcat cattgaggcc aggagctctg cccgtgacct 721 gtatcccacg tactctatct tccattcctc gccctgcccc cagaggccag gagctctgcc 781 cttgacctgt attccactta ctccaccttc cattcctcgc cctgtcccca cagccgagtc 841 ctgcatcagc cctttatcct cacacgcttt tctacaatgg cattcaataa agtgtatatg 901 tttct // LOCUS HS190 4949 bp RNA PRI 30-NOV-1995 DEFINITION H.sapiens mRNA for skeletal muscle 190kD protein. ACCESSION X69090 NID g407098 KEYWORDS fibronectin repeats; immunoglobulin superfamily; sarcomere M line; titin binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4949) AUTHORS Fuerst,D.O. TITLE Direct Submission JOURNAL Submitted (29-OCT-1992) D.O. Fuerst, Max Planck Institute for Biophysical Chemistry, Am Fassberg, 3400 Goettingen, FRG REMARK Revised by [3] REFERENCE 2 (bases 1 to 3287) AUTHORS Vinkemeier,U., Obermann,W., Weber,K. and Furst,D.O. TITLE The globular head domain of titin extends into the center of the sarcomeric M band. cDNA cloning, epitope mapping and immunoelectron microscopy of two titin-associated proteins JOURNAL J. Cell. Sci. 106 (Pt 1), 319-330 (1993) MEDLINE 94095665 REFERENCE 3 (bases 1 to 4949) AUTHORS Fuerst,D.O. TITLE Direct Submission JOURNAL Submitted (24-JUN-1993) D.O. Fuerst, Max Planck Institute for Biophysical Chemistry, Am Fassberg, 3400 Goettingen, FRG FEATURES Location/Qualifiers source 1..4949 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /clone_lib="lambda gt11" CDS 118..4473 /codon_start=1 /product="190kD protein" /db_xref="PID:g407099" /translation="MVPIFSGRQKHVSGITDTEEERIKEAAAYIAQRNLLASEEGITT PKQSTASKQTTASKQSTASKQSTASKQSTASRQSTASRQSVVSKQATSALQQEETSEK KSRKVVIRGKAERLSLRKTLEETETYHAKLNEDHLLHAPEFIIKPRSHTVWEKENVKL HCSIAGWPEPRVTWYKNQVPINVHANPGKYIIESRYGMHTLEINACDFEDTAQYRASA MNVKGELSAYASVVVKRYKGEFDETRFHAGASTMPLSFGVTPYGYASRFEIHFDDKFD VSFGREGETMSLGCRVVITPEIKHFQPEIRWYRNGVPLSPSKWVQTLWSGERATLTFS HLNKEDEGLYTIRVRMGEYYEQYSAYVFVRDADAEIEGAPAAPLDVKCLEANKDYIII SWKQPAVDGGSPILGYFIDKCEVGTDSWSQCNDTPVKFARFPVTGLIEGRSYIFRVRA VNKMGIGFPSRVSEAVAALDPAEKARLKSPLSTLDWTVIVTEEEPSEGIVPGPPTDLS VTEATRSYVVLSWKPPGQRGHEGIMYFVEKCEAGTENWQRVNTELPVKSPRFALFDLA EGKSYCFRVRCSNSAGVGEPSEATEVTVVGDKLDIPKAPGKIIPSRNTDTSVVVSWEE SKDAKELVGYYIEANVAGSGKWEPCNNNPVKTHRFTCHGLVTGQSYIFRVRAVNAAGL SEYSQDSEAIEVKAAIAPPSPPCDITCLESFRDSMVLGWKQPDKTGGAEITGYYVNYR EVIDGVPGKWREANVKAVREEAYKISNLKENMVYQFQVAAMNMAGLGAPSAVSECFKC EEWTIAVPGPPHSLKCSEVRKDSLVLQWKPPVHSGRTPVTGYFVDLKEKAKEDQWRGL NEAAIKNVYLKVRGLKEGVSYVFRVRAINQAGVGKPSDLAGPVVAETRPGRKEVVVNV DDDGVISLNFECDKMTPKSEFSWSKDYVSTEDSPRLEVESKGNKTKMTFKDLGMDDLG IYSCDVTDTDGIASSYLIDEEELKRLLALSHEHKFPTVPVKSELAVEILEKGQVRFWM QAEKLSGNAKVNYIFNEKEIFEGPKYKMHIDRNTGIIEMFMEKLQDEDEGTYTFQLQD GKATNHSTVVLVGDVFKKLQKEAEFQRQEWIRKQGPHFVEYLSWEVTGECNVLLKCKV ANIKKETHIVWYKDEREISVDEKHDFKDGICTLLITEFSKKDAGIYEVILKDDRGKDK SRLKLVDEAFKELMMEVCKKIALSATDLKIQSTAEGIQLYSFVTYYVEDLKVNWSHNG SAIRYSDRVKTGVTGEQIWLQINEPTPNDKGKYVMELFDGKTGHQKTVDLSGQAYDEA YAEFQRLKQAAIAEKNRARVLGGLPDVVTIQEGKALNLTCNVWGDPPPEVSWLKNEKA LAQTDHCNLKFEAGRTAYFTINGVSTADSGKYGLVVKNKYGSETSDFTVSVFIPEEEA RMAALESLKGGKKAK" BASE COUNT 1415 a 1044 c 1324 g 1166 t ORIGIN 1 cggacagatt ccagtctgct gttagatgat tattcatcca agttgagccc caaaccaaag 61 agagccaagc acagcctact gtctggagaa gagaaagaaa atttgcccag tgactacatg 121 gtacccattt tctcaggacg tcaaaagcat gtcagtggaa ttactgatac ggaagaagaa 181 agaattaagg aagctgctgc ttatatagcc cagaggaatc ttcttgctag tgaggaagga 241 atcacaacac ctaaacagtc cacggcatcc aagcagacca cggcatctaa gcagtccacg 301 gcatccaagc agtccacagc atccaagcag tccacggcat ccaggcagtc cacggcatcc 361 aggcagtctg tggtttccaa acaggccaca tccgctcttc aacaggaaga aacttctgaa 421 aagaagtcaa ggaaagttgt gattcgagga aaggcagaac gcctgtccct gaggaaaaca 481 ttagaagaaa ccgagacata tcatgccaag ctgaatgaag accatcttct ccatgctcct 541 gagtttatca ttaaacctcg ctcccacacg gtttgggaga aggagaatgt aaaattgcat 601 tgctccatag caggatggcc agaacctcgt gtcacgtggt ataaaaacca ggtgccaata 661 aatgtccatg caaaccctgg aaagtatatt attgagagtc gatatggaat gcacactctg 721 gagattaatg catgtgattt tgaagataca gctcagtacc gggcctcggc gatgaatgtt 781 aaaggagagc tttcggcata tgcttcagtt gtggtaaaaa ggtataaggg agagtttgat 841 gagactcgct tccatgctgg ggcttccacc atgcccctca gctttggtgt gaccccatat 901 ggttatgcat cccggtttga gatccacttt gatgacaaat ttgatgtgtc ttttgggaga 961 gagggagaga caatgagtct aggctgtcgt gttgtcatca ctcctgaaat taaacatttc 1021 cagccagaga tccggtggta cagaaatgga gtacctcttt ctccatcaaa atgggtgcaa 1081 acactttgga gtggagagcg ggcaacgctg acattttccc atctcaacaa agaagatgaa 1141 ggcctctata caatccgtgt acggatggga gaatattatg aacaatatag tgcttatgtc 1201 tttgttcgag atgctgatgc agagattgaa ggagccccag ctgctccctt ggatgtgaag 1261 tgcttggagg ccaacaaaga ttatatcatc atctcctgga aacagccagc tgtcgatgga 1321 gggagtccta ttctcggata ttttattgat aagtgtgagg tgggcacaga tagctggtcg 1381 cagtgcaatg acacacctgt gaagtttgct cgttttcctg tcactggatt gatcgaaggt 1441 cgttcctata tcttccgagt tcgagctgtg aataaaatgg gaataggttt cccatctcga 1501 gtttccgagg ccgtggctgc tctggatccg gctgagaaag ctagactaaa gtcgcccctc 1561 agcaccctgg actggacagt cattgttact gaagaggaac cttcagaggg tattgtgcct 1621 ggccccccga cagacctctc tgtcactgag gccacccgga gctatgtggt gctcagctgg 1681 aagccccctg gccagcgtgg tcatgagggc attatgtact ttgtggaaaa gtgtgaggca 1741 ggaacagaaa actggcagcg agtgaacacg gagctccctg tgaagtctcc ccgctttgct 1801 ctgtttgact tggccgaggg gaaatcctac tgtttccgtg tccgctgttc taattctgca 1861 ggagttggtg agccctcaga ggcaacggag gtgactgtgg taggggacaa acttgatatc 1921 cccaaggctc ctggcaaaat catcccaagc agaaacacag acacctcagt ggtagtttcg 1981 tgggaggagt ccaaagatgc caaagagctg gtcgggtact acatagaggc aaacgttgct 2041 ggctctggca agtgggagcc ctgtaacaac aaccccgtga aaactcaccg attcacttgt 2101 catggattag tgactggtca gagttatata ttccgggtca gagcagtcaa tgcagctgga 2161 cttagtgaat attcccagga ttcagaagct attgaagtca aagctgctat tgcaccacca 2221 tctccaccct gtgatatcac ctgtcttgaa agttttcgtg actcaatggt tcttggatgg 2281 aagcaaccag ataagactgg aggggcagaa attactggct attatgtgaa ctatcgcgag 2341 gtcattgatg gggtaccagg aaaatggaga gaagccaatg tcaaggctgt cagagaggag 2401 gcatacaaga ttagcaactt gaaggaaaac atggtgtatc agttccaagt ggcagccatg 2461 aacatggctg ggctgggcgc gccctccgca gtaagcgaat gcttcaaatg tgaagagtgg 2521 accatcgccg tcccaggacc accgcacagt ctcaagtgta gtgaagtcag gaaagactca 2581 ctggttctcc agtggaagcc gccagtccac tccgggcgga ctccggtcac tggttacttc 2641 gtggacttga aggagaaggc caaagaagac cagtggcgag ggctcaatga ggcggctatt 2701 aaaaacgtat acctgaaggt tcgaggcctc aaggagggcg tcagctacgt gttccgtgtt 2761 cgagccataa accaggcggg agttgggaag ccatctgacc ttgctggccc tgttgtggca 2821 gagacccgtc caggaaggaa agaggttgtt gtaaatgtgg atgatgatgg agtcatttca 2881 ttgaacttcg agtgtgataa gatgactcca aagtccgagt tctcctggtc caaagattat 2941 gtatccactg aggactctcc acgattggaa gtcgaaagca agggcaacaa gacgaaaatg 3001 accttcaaag accttgggat ggatgacttg ggtatttact cttgcgatgt aacagacact 3061 gatggaatag catcaagcta cttaatagat gaggaagaat tgaaacgttt acttgctctc 3121 agccatgaac acaagttccc aactgtccca gttaaatcag agttggcagt tgaaattttg 3181 gagaaaggcc aggtccggtt ttggatgcag gctgagaaac tgtctggcaa tgccaaagtc 3241 aactacatat ttaacgagaa ggaaattttt gaaggcccga aatataaaat gcatattgac 3301 cgaaacactg gcatcatcga aatgttcatg gaaaagctac aggatgagga tgagggaacg 3361 tacactttcc agcttcaaga tggaaaagca actaaccatt ctactgttgt tctcgttgga 3421 gatgttttca aaaagctcca gaaagaagct gaattccagc ggcaagaatg gatcaggaaa 3481 caaggtcctc actttgttga gtatttgagc tgggaagtga ctggtgaatg taatgtacta 3541 ttgaaatgca aggtggcaaa tattaagaag gagactcata ttgtgtggta caaagatgag 3601 agggagatat cagtggatga aaagcatgac tttaaggatg gtatatgtac cctgcttata 3661 acagagtttt ccaagaaaga tgctgggatt tatgaagtta tcctgaaaga tgaccgagga 3721 aaagataaga gcagactgaa gcttgtggat gaagccttta aggaactgat gatggaagta 3781 tgcaaaaaaa tagctttgtc tgctacagac ctgaaaatcc agagcacagc cgagggcatc 3841 caactgtact cttttgtaac ttactatgtg gaggatttga aagttaactg gtcccacaat 3901 gggtccgcca ttaggtactc agacagagtt aagaccgggg tcactggaga gcagatctgg 3961 ctacaaatca acgagcccac cccgaatgac aaagggaagt atgtcatgga gctctttgat 4021 ggcaaaactg gacatcagaa gacagtggat ctctctggac aagcatacga tgaggcctat 4081 gctgaattcc agaggttgaa acaagctgcc attgccgaga aaaatcgtgc ccgggtgttg 4141 ggaggtctcc cagacgtggt caccatccag gaggggaagg cccttaatct cacttgcaac 4201 gtgtggggag acccgcctcc ggaggtgtcg tggttgaaga acgagaaggc cctggctcag 4261 acggaccact gcaacctcaa gttcgaggct gggaggaccg cgtacttcac catcaacggt 4321 gtgagcaccg ctgactcggg caaatacggg ctggttgtga agaacaagta tggctcggag 4381 accagcgact tcaccgtcag cgtgttcatc ccagaggagg aggcgaggat ggccgccttg 4441 gagtccctga aaggtggcaa gaaggccaag tgaccggagg tgcgaggaga gcagccggcc 4501 tgtgtgactt gggtgtgaat ggtttgggtt aaggatgaga cgtccttcat gcttctcctc 4561 cctattattt ctggcttgag ggaaataatg tcaggtcttt cactcatata aaaaagcacc 4621 aactaatgac actttaattg tttttcttta tctacaaaat tatgtgttaa gaaaatacca 4681 ttcatagcat gaagattagg aaacagtttt aaggagaaga cttgaatgaa gttggaggga 4741 cattgaatga tggtcagagg gcagacgaat gtgtcgtggg gcggaattgg gatttgctgc 4801 agctgtgaag ccatggccgt gtctcgtgtg ttgttacaga ggtgatgtgc ttttcgacgg 4861 gccctcgtgg cttggaacct cctctgtatg aataaacagt tttcacgtct gtcctcttcc 4921 ccgaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HS1R20RNA 1227 bp RNA PRI 30-NOV-1993 DEFINITION H.sapiens 1r20 mRNA for alpha helical basic phosphoprotein. ACCESSION X73427 NID g313214 KEYWORDS early response gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1227) AUTHORS Newton,J.S., Deed,R.W., Mitchell,E.L., Murphy,J.J. and Norton,J.D. TITLE A B cell specific immediate early human gene is located on chromosome band 1q31 and encodes an alpha helical basic phosphoprotein JOURNAL Biochim. Biophys. Acta 1216 (2), 314-316 (1993) MEDLINE 94060109 REFERENCE 2 (bases 1 to 1227) AUTHORS Deed,R. TITLE Direct Submission JOURNAL Submitted (18-JUN-1993) R. Deed, Paterson Institute for Cancer Research, Dept of Gene Regulation, Christie Hospital NHS Trust, Wilmslow Road, Manchester, M20 9BX, UK FEATURES Location/Qualifiers source 1..1227 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="peripheral blood" /cell_type="lymphocytes" /clone="1r20" /clone_lib="gt10" /chromosome="1q31" gene 15..605 /gene="1r20" CDS 15..605 /gene="1r20" /codon_start=1 /db_xref="PID:g313215" /db_xref="SWISS-PROT:Q08116" /translation="MPGMFFSANPKELKGTTHSLLDDKMQKRRPKTFGMDMKAYLRSM IPHLESGMKSSKSKDVLSAAEVMQWSQSLEKLLANQTGQNVFGSFLKSEFSEENIEFW LACEDYKKTESDLLPCKAEEIYKAFVHSDAAKQINIDFRTRESTAKKIKAPTPTCFDE AQKVIYTLMEKDSYPRFLKSHIYLNLLNDLQANSLK" misc_feature 1196..1202 /note="immediate early motif" polyA_signal 1202..1207 BASE COUNT 423 a 221 c 249 g 334 t ORIGIN 1 gggagttaga caaaatgcca ggaatgttct tctctgctaa cccaaaggaa ttgaaaggaa 61 ccactcattc acttctagac gacaaaatgc aaaaaaggag gccaaagact tttggaatgg 121 atatgaaagc atacctgaga tctatgatcc cacatctgga atctggaatg aaatcttcca 181 agtccaagga tgtactttct gctgctgaag taatgcaatg gtctcaatct ctggaaaaac 241 ttcttgccaa ccaaactggt caaaatgtct ttggaagttt cctaaagtct gaattcagtg 301 aggagaatat tgagttctgg ctggcttgtg aagactataa gaaaacagag tctgatcttt 361 tgccctgtaa agcagaagag atatataaag catttgtgca ttcagatgct gctaaacaaa 421 tcaatattga cttccgcact cgagaatcta cagccaagaa gattaaagca ccaaccccca 481 cgtgttttga tgaagcacaa aaagtcatat atactcttat ggaaaaggac tcttatccca 541 ggttcctcaa atcacatatt tacttaaatc ttctaaatga cctgcaggct aatagcctaa 601 agtgactggt ccctggctga agggaattaa cagatagtat cagcgcagaa ggaatgtgcc 661 agtatggatc cctgggtgaa cagcttggcc ttttttgggt gtcttgacag gccaagaaga 721 acaaatgact cagaaccgga ttaacatgaa agttatccag gcgcagagtt gaagaagcat 781 aagcaagcaa gacaaaaaca gagagaccgc aaggaggaag atctgtggta ctgtcataaa 841 aaacagtgga gctctgtatt agaaaagccc ctcagaactg ggaaggccag gtaactctag 901 ttacacagaa actggtacta aagtctatca aactgattac acagactgta agaattcaaa 961 gtcaactgac atctatgcta catatattat atagtttgta cttgactatg agccattaac 1021 ttaaagcata tgtttcaaat agccattgct actattcctt gtccggtgta attttatttt 1081 attgttttta ctttggaaga gatgaactgt gtatttaact taagctattg ctcttaaaac 1141 cagggagtca gatatatttg taaggttaaa tcattggtgc aataataaat gtggattttg 1201 tattaaaata tatatgaagc aaaaaaa // LOCUS HS219MRNA 1606 bp RNA PRI 02-JUN-1995 DEFINITION H.sapiens mRNA for 2.19 gene. ACCESSION X87193 NID g854081 KEYWORDS 2.19 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1606) AUTHORS Bione,S., Tamanini,F., Maestrini,E., Tribioli,C., Poustka,A., Torri,G., Rivella,S. and Toniolo,D. TITLE Transcriptional organization of a 450-kb region of the human X chromosome in Xq28 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (23), 10977-10981 (1993) MEDLINE 94068527 REFERENCE 2 (bases 1 to 1606) AUTHORS Toniolo,D. TITLE Direct Submission JOURNAL Submitted (09-MAY-1995) D. Toniolo, Instituto di Genetica Biochimica ed Evoluzionistica, CNR, Via Abbiategrasso 207, 27100 Pavia, ITALY FEATURES Location/Qualifiers source 1..1606 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="fetal brain" /clone_lib="lambda ZAP" /chromosome="X" /map="q28" gene 278..970 /gene="2.19" CDS 278..970 /gene="2.19" /codon_start=1 /db_xref="PID:g854082" /translation="MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSP ESSVTAAPRARKYKCGLPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVG RGLNIALVNGVSGELIEARAFDMWAGDVNDLLKFIRPLHEGTLVFVASYDDPATKMNE ETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGCPEALE MEGCIPRRSTAS" polyA_signal 1588..1593 BASE COUNT 301 a 521 c 503 g 281 t ORIGIN 1 aaccacatct tcgtcccagc cccggaggct cctgtgggca agatcgtgag ccaacgggtt 61 cctgaggccc ctcctggcca ggcagggttt ccccgcgcgt ttccgaggag ccctgcctgg 121 ccgggcggct ggacaaacag gtcgtagcac cgatcgcgcc cgcccccagc aggggtcccg 181 cacaggcttg cccctgaccc ccacccaaac ctgtccttcc gctttgcccc caaacagtgc 241 acttgccggc ggtcccaacc cagcaggaga agtggacatg aggttggcag gccctctccg 301 cattgtggtc ctagtcgtca gtgtgggtgt cacatggatc gtggtcagca tcctcctggg 361 tgggcctggc agtggctttc ctcgcatcca gcaactcttc accagtccag agagctcggt 421 gactgcagcg ccacgggcca ggaagtacaa gtgtggcctg ccccagccgt gtcctgagga 481 gcacctggcc ttccgcgtgg tcagcggggc cgccaacgtc attgggccca agatctgcct 541 cgaggacaag atgctgatga gcagcgtcaa ggacaacgtg ggccgcgggc tgaacatcgc 601 cctggtgaac ggggtcagcg gcgagctcat cgaggcccgg gcctttgaca tgtgggccgg 661 agatgtcaac gacctgttga agtttattcg gccactgcac gaaggcaccc tggtgttcgt 721 ggcatcctac gacgacccag ccaccaagat gaatgaagag accagaaagc tcttcagtga 781 gctgggcagc aggaacgcca aggagctggc cttccgggac agctgggtgt ttgtcggggc 841 caagggtgtg cagaacaaga gcccctttga gcagcacgtg aagaacagta agcacagcaa 901 caagtacgaa ggctgccccg aggcgctgga gatggaaggc tgtatcccgc ggagaagcac 961 ggccagctag cacggccagt gccaggaccg ggccgaggga ggccagacca agggaggcac 1021 gcgcgctgcc gggcggacag aggctgaggc tcacacccca cacccgggca ggagcgctcc 1081 ctggccccaa cacatcgggg ctccgaggca gtgaccagaa cgtggtctca aggtggtggg 1141 ggctatgggg gctgcagggg gtagccctgc cgcactttgt cacgggagcc cagggtaccc 1201 gcctcctttt cgtaacactg ttccccccgg tcagcccatc tagccctgtc ctccattcct 1261 cacgccatct ccatccccat cttgagtcct ggaacggccc tgggtgcctg cccctcactg 1321 tgcatctctg ggagcagccc ggcaggttgg ggcgtcttcc agaacctctc ccttctggag 1381 ccactctgca ctgcgggcta aacatgtttc cagtgtgatt ccttccagtg agccaaaccc 1441 ggtggctgct tcatgagcct gactgcctct cgcctgctct cagcaggaag ggacccctgg 1501 agcaggctgg cccggggtgg tgaagtagct ggagcccgat cacagtcccg cggtttgtca 1561 gggggcccac cttctagatg accccttaat aaagtgatgg ccccac // LOCUS HS23KDHBP 672 bp RNA PRI 28-OCT-1992 DEFINITION H.sapiens mRNA for 23 kD highly basic protein. ACCESSION X56932 NID g23690 KEYWORDS basic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 672) AUTHORS Price,S.R. TITLE Direct Submission JOURNAL Submitted (29-JAN-1991) S.R. Price, National Hearth Lung and, Blood Institute, Lab. of Cellular Metabolism, Bldg 10 Room 5N307, Bethesda MD 20892, U S A REFERENCE 2 (bases 1 to 672) AUTHORS Price,S.R., Nightingale,M.S., Bobak,D.A., Tsuchiya,M., Moss,J. and Vaughan,M. TITLE Identification of Multiple mRNAs that Arise Through Utilization of Alternative Polyadenylation Sites and Encodes a Putative Human 23 kDa highly Basic Protein JOURNAL Unpublished FEATURES Location/Qualifiers source 1..672 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 18..629 /codon_start=1 /product="23 kD highly basic protein" /db_xref="PID:g23691" /db_xref="SWISS-PROT:P40429" /translation="MAEVQVLVLDGRGHLLGRLAAIVAKQVLLGRKVVVVRCEGINIS GNFYRNKLKYLAFLRKRMNTNPSRGPYHFRAPSRIFWRTVRGMLPHKTKRGQAALDRL KVFDGIPPPYDKKKRMVVPAALKVVRLKPTRKFAYLGRLAHEVGWKYQAVTATLEEKR KEKAKIHYRKKKQLMRLRKQAEKNVEKKIDKYTEVLKTHGLLV" polyA_signal 634..639 polyA_site 655 BASE COUNT 166 a 190 c 195 g 121 t ORIGIN 1 ccaagcggct gccgaagatg gcggaggtgc aggtcctggt gcttgatggt cgaggccatc 61 tcctgggccg cctggcggcc atcgtggcta aacaggtact gctgggccgg aaggtggtgg 121 tcgtacgctg tgaaggcatc aacatttctg gcaatttcta cagaaacaag ttgaagtacc 181 tggctttcct ccgcaagcgg atgaacacca acccttcccg aggcccctac cacttccggg 241 cccccagccg catcttctgg cggaccgtgc gaggtatgct gccccacaaa accaagcgag 301 gccaggccgc tctggaccgt ctcaaggtgt ttgacggcat cccaccgccc tatgacaaga 361 aaaagcggat ggtggttccg gctgccctca aggtcgtgcg tctgaagcct acaagaaagt 421 ttgcctatct ggggcgcctg gctcacgagg ttggctggaa gtaccaggca gtgacagcca 481 ccctggagga gaagaggaaa gagaaagcca agatccacta ccggaagaag aaacagctca 541 tgaggctacg gaaacaggcc gagaagaacg tggagaagaa aattgacaaa tacacagagg 601 tcctcaagac ccacggactc ctggtctgag cccaataaag actgttaatt cctcatgcgt 661 tgcctgccct tc // LOCUS HS25ABP 3568 bp RNA PRI 15-SEP-1995 DEFINITION H.sapiens mRNA for 2-5A binding protein. ACCESSION X76388 NID g608721 KEYWORDS binding protein; RNase L inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3568) AUTHORS Salehzada,T. TITLE Direct Submission JOURNAL Submitted (29-NOV-1993) T. Salehzada, Inst de Genetique Moleculaire de Montpellier UMR-CNRS 9942, 1919 Route de Mende BP 5051, 34033 Montpellier, Cedex 01, FRANCE REFERENCE 2 (bases 1 to 3568) AUTHORS Bisbal,C., Martinand,C., Silhol,M., Lebleu,B. and Salehzada,T. TITLE Cloning and characterization of a RNAse L inhibitor. A new component of the interferon-regulated 2-5A pathway JOURNAL J. Biol. Chem. 270 (22), 13308-13317 (1995) MEDLINE 95286622 FEATURES Location/Qualifiers source 1..3568 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblastoid" /cell_line="daudi" /clone_lib="lambda zap" gene 118..1917 /gene="RNase" CDS 118..1917 /gene="RNase" /codon_start=1 /product="RNase L inhibitor" /db_xref="PID:g987870" /translation="MADKLTRIAIVNHDKCKPKKCRQECKKSCPVVRMGKLCIEVTPQ SKIAWISETLCIGCGICIKKCPFGALSIVNLPSNLEKETTHRYCANAFKLHRLPIPRP GEVLGLVGTNGIGKSAALKILAGKQKPNLGKYDDPPDWQEILTYFRGSELQNYFTKIL EDDLKAIIKPQYVARFLRLAKGTVGSILDRKDETKTQAIVCQQLDLTHLKERNVEDLS GGELQRFACAVVCIQKADIFMFDEPSSYLDVKQRLKAAITIRSLINPDRYIIVVEHDL SVLDYLSDFICCLYGVPSAYGVVTMPFSVREGINIFLDGYVPTENLRFRDASLVFKVA ETANEEEVKKMCMYKYPGMKKKMGEFELAIVAGEFTDSEIMVMLGENGTGKTTFIRML AGRLKPDEGGEVPVLNVSYKPQKISPKSTGSVRQLLHEKIRDAYTHPQFVTDVMKPLQ IENIIDQEVQTLSGGELQRVRLRLCLGKPADVYLIDEPSAYLDSEQRLMAARVVKRFI LHAKKTAFVVEHDFIMATYLADRVIVFDGVPSKNTVANSPQTLLAGMNKFLSQLEITF RRDPNNYRPRINKLNSIKDVEQKKSGNYFFLDD" misc_feature 206..239 /gene="RNase" /note="4fe4s-ferredoxin" misc_feature 373..397 /gene="RNase" /note="p-loop" misc_feature 1178..1202 /gene="RNase" /note="p-loop" BASE COUNT 1191 a 603 c 678 g 1096 t ORIGIN 1 ccggtcctga gacacgctgt gtggctgaaa agtgaaggca agagctcatt tggcctctgt 61 gctcccctcc gcaagggatc gtttctccag aagagctgga tattctttcg cccagttatg 121 gcagacaagt taacgagaat tgctattgtc aaccatgaca aatgtaaacc taagaaatgt 181 cgacaggaat gcaaaaagag ttgtcctgta gttcgaatgg gaaaattatg catagaggtt 241 acaccccaga gcaaaatagc atggatttcc gaaactcttt gtattggttg tggtatctgt 301 attaagaaat gcccctttgg cgccttatca attgtcaatc taccaagcaa cttggaaaaa 361 gaaaccacac atcgatattg tgccaatgcc ttcaaacttc acaggttgcc tatccctcgt 421 ccaggtgaag ttttgggatt agttggaact aatggtattg gaaagtcagc tgctttaaaa 481 attttagcag gaaaacaaaa gccaaacctt ggaaagtacg atgatcctcc tgactggcag 541 gagattttga cttatttccg tggatctgaa ttacaaaatt actttacaaa gattctagaa 601 gatgacctaa aagccatcat caaacctcaa tatgtagcca gattcctaag gctggcaaag 661 gggacagtgg gatctatttt ggaccgaaaa gatgaaacaa agacacaggc aattgtatgt 721 cagcagcttg atttaaccca cctaaaagaa cgaaatgttg aagatctttc aggaggagag 781 ttgcagagat ttgcttgtgc tgtcgtttgc atacagaaag ctgatatttt catgtttgat 841 gagccttcta gttacctaga tgtcaagcag cgtttaaagg ctgctattac tatacgatct 901 ctaataaatc cagatagata tatcattgtg gtggaacatg atctaagtgt attagactat 961 ctctccgact tcatctgctg tttatatggt gtaccaagcg cctatggagt tgtcactatg 1021 ccttttagtg taagagaagg cataaacatt tttttggatg gctatgttcc aacagaaaac 1081 ttgagattca gagatgcatc acttgttttt aaagtggctg agacagcaaa tgaagaagaa 1141 gttaaaaaga tgtgtatgta taaatatcca ggaatgaaga aaaaaatggg agaatttgag 1201 ctagcaattg tagctggaga gtttacagat tctgaaatta tggtgatgct gggggaaaat 1261 ggaacgggta aaacgacatt tatcagaatg cttgctggaa gacttaaacc tgatgaagga 1321 ggagaagtac cagttctaaa tgtcagttat aagccacaga aaattagtcc caaatcaact 1381 ggaagtgttc gccagttact acatgaaaag ataagagatg cttatactca cccacaattt 1441 gtgaccgatg taatgaagcc tctgcaaatt gaaaacatca ttgatcaaga ggtgcagaca 1501 ttatctggtg gtgaactaca gcgagtacgt ttacgccttt gcttgggcaa acctgctgat 1561 gtctatttaa ttgatgaacc atctgcatat ttggattctg agcaaagact gatggcagct 1621 cgagttgtca aacgtttcat actccatgca aaaaagacag cctttgttgt ggaacatgac 1681 ttcatcatgg ccacctatct agcggatcgc gtcatcgttt ttgatggtgt tccatctaag 1741 aacacagttg caaacagtcc tcaaaccctt ttggctggca tgaataaatt tttgtctcag 1801 cttgaaatta cattcagaag agatccaaac aactataggc cacgaataaa caaacttaat 1861 tcaattaagg atgtagaaca aaagaagagt ggaaactact ttttcttgga tgattagact 1921 gactctgaga atattgataa gccatttatt aaaaggagta tttactagaa ttttttgtca 1981 tataaaactt gaatcaggat tttatgcccc acatactctg gaacttgaag tataatatac 2041 ttaatataac ataaaaagcc agttgggttc taaattgtag ttgaaacaca gaaaatgcca 2101 cttttctgtt cctgaagagg ctcttttgtg cataatattc taaaatgaag acatttcaag 2161 ctatacaaat tacttccaag ttttcatgat gtatgggaag attttcagta ggtgtattat 2221 attcacggta ccaaatgctg accagtgttg ctccattttt taaatcttga aaagggtttc 2281 tgtacttacc tggtttgcca agtatgccag tgtaatgaaa ctgcccttat tttaaaagcc 2341 agtcaaagat tccactgatt gacatttgat aaataaacat caggattatg tttattgttt 2401 gttttcagtc tttgcactat attaccagta tatggtttcc gaggaagatt atctactgca 2461 aaacaccact gttggaaaaa taggtatttt taaattgttt ttaatccttt tttggtgctt 2521 ttaaacatgt ttaagcaaaa accaattcag tccattcccc gcaaaaaacc cctaacttta 2581 ctctgaactt tttttgtttt tgcattccat gaggttctgt attcagtcat tctctaggta 2641 atgtcatttt tgtacacata tatttatata atcactgatt gagatttagg aaaaagcatt 2701 tctaaagaat atttgcttcc cttagaacta cagactcgaa atctttaaag atggtgccta 2761 agcatctatg tatttttttt aagttccaca gatttttctg ttgggcaggc caaggattat 2821 aaaccacttc cctaaaggca acattaatgc aaaagtcccc agatggcaat acaaagtatc 2881 ccctggtacc acatatattc atttgtgagt ttggatatag agcacattat ctaaaccatt 2941 ttgtagttcc aaaaacccat ctaaatttct tgagttcctg aattttgaac aggattacct 3001 ggagcctgga gccactttaa gttgtacttc tgactaaact ggaattatga gtgaggaaga 3061 gtgtttacta aataaatgac tggggcaagc aaaattgagg aggaaattag aaactgtttg 3121 acaaacttta agagctactt gaaataacag aagtcttgat taatatgcaa ataatggcta 3181 gaaagtatgg tttaactgga ccctattatg ccttttaaaa ataatttcag taacccataa 3241 atacatgttg taaaaaattc aaatatacag aatggaataa aaaaatgatc tccctttatt 3301 accctcccaa aggttaccag cgtttgaatt taataatgta tattctttca tgcttttttc 3361 tgtgcactta cctaagtgtg aatatgtaaa gggtttgttt tgtatacaaa tgggattata 3421 ctaaaataag taatgcctat ttttaaggat aggttaaatt tgtgaatgat catttcaaat 3481 atattgaata aaataagcaa aagctattgt tatttactga tcctgaaaaa aaaaaaaaaa 3541 aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HS296K21 111994 bp DNA PRI 03-MAR-1997 DEFINITION Human DNA sequence from PAC 296K21 on chromosome X contains cytokeratin exon, delta-aminolevulinate synthase (erythroid); 5-aminolevulinic acid synthase.(EC 2.3.1.37). 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase (EC 2.7.1.105, EC 3.1.3.46), ESTs and STS. ACCESSION Z83821 NID g1869771 KEYWORDS 5-aminolevulinic acid synthase; 6-phosphofructo-2-kinase; cytokeratin; delta-aminolevulinate synthase; fructose-2,6-bisphosphatase; kinase; phosphatase; phosphofructokinase; X. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 111994) AUTHORS Isherwood,J. TITLE Direct Submission JOURNAL Submitted (24-FEB-1997) Sanger Centre, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT de Jong P.J., enquires: http://bacpac.med.buffalo.edu/ IMPORTANT: This sequence is the entire insert of clone 296K21. This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone 296K21 is at 1 in this sequence. The true right end of clone 296K21 is at 111994. 296K21 is from the human PAC library described in Ioannou A.P. et al Nature Genet 6, 84-89. FEATURES Location/Qualifiers source 1..111994 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="X" /clone="296K21" /clone_lib="RPCI-1" repeat_region 1..1424 /note="L1 repeat: matches 3783..5212 of consensus" repeat_region 1373..2227 /note="L1PA4 repeat: matches 28..891 of consensus" repeat_region 2753..3052 /note="AluSx repeat: matches 1..300 of consensus" repeat_region 3170..3315 /note="MIR2 repeat: matches 2..146 of consensus" repeat_region 4557..5173 /note="L1PB1 repeat: matches 902..232 of consensus" repeat_region 5181..5480 /note="AluSp repeat: matches 302..1 of consensus" repeat_region 5481..5715 /note="L1PB3 repeat: matches 242..1 of consensus" repeat_region 5573..6481 /note="L1 repeat: matches 5390..4467 of consensus" repeat_region 6493..6766 /note="AluYa5 repeat: matches 301..27 of consensus; incomplete repeat" repeat_region 6767..7290 /note="L1 repeat: matches 4471..3949 of consensus" prim_transcript <7284..7651 /note="match: multiple ESTs; match: C06016 W70298 H61289 W60456; match: W39377 R77904 N78835 D12240; match: R32006" CDS <7303..7404 /note="similar to M26324" /codon_start=1 /product="cytokeratin" /db_xref="PID:e304605" /db_xref="PID:g1869772" /translation="SFSCTCSSRAMVVKKIETRDGKLVSESSDVLPM" repeat_region 7786..8086 /note="AluY repeat: matches 1..301 of consensus" repeat_region 8331..8698 /note="L1MB3 repeat: matches 931..551 of consensus" repeat_region 8691..9014 /note="L1PA5 repeat: matches 567..892 of consensus" repeat_region 9316..9427 /note="MIR repeat: matches 20..140 of consensus" repeat_region 9681..9808 /note="L1MB7 repeat: matches 578..451 of consensus" prim_transcript complement(<9859..>10014) /note="match: 5' EST T95185 clone 120153" repeat_region 11113..12002 /note="L1PA2 repeat: matches 891..1 of consensus" repeat_region 11853..12612 /note="L1 repeat: matches 5390..4633 of consensus" repeat_region 12600..13359 /note="L1 repeat: matches 3877..4635 of consensus" repeat_region 13363..13537 /note="MIR repeat: matches 197..11 of consensus" repeat_region 13958..14358 /note="MLT2A repeat: matches 1..413 of consensus" repeat_region 14359..14390 /note="16 copies of 2 mer 88 % conserved" repeat_region 14402..14473 /note="MLT2A repeat: matches 386..453 of consensus" repeat_region 15226..15331 /note="L1MB8 repeat: matches 815..920 of consensus" repeat_region 16378..16450 /note="MIR2 repeat: matches 74..146 of consensus" repeat_region 16515..16681 /note="MIR repeat: matches 192..2 of consensus" repeat_region 16683..17161 /note="L1MC2 repeat: matches 54..536 of consensus" repeat_region 17462..17695 /note="MIR repeat: matches 20..262 of consensus" repeat_region 19657..19779 /note="MIR repeat: matches 10..133 of consensus" repeat_region 20081..20319 /note="MIR repeat: matches 261..20 of consensus" repeat_region 21065..21188 /note="MIR repeat: matches 183..57 of consensus" repeat_region 21194..21288 /note="MIR repeat: matches 262..168 of consensus" repeat_region 22337..22370 /note="MIR2 repeat: matches 138..102 of consensus" CDS join(23181..23346,24326..24448,25298..25408,27891..28113, 28661..28845,31500..31679,33423..33587,34150..34418, 35517..35679,39822..39985) /codon_start=1 /product="5-aminolevulinic acid synthase" /db_xref="PID:e304606" /db_xref="PID:g1869773" /translation="MLLQCCPVLARGPTSLLGKVVKTHQFLFGIGRCPILATQGPNCS QIHLKATKAGGDSPSWAKGHCPFMLSELQDGKSKIVQKAAPEVQEDVKAFKTDLPSSL VSVSLRKPFSGPQEQEQISGKVTHLIQNNMPGNYVFSYDQFFRDKIMEKKQDHTYRVF KTVNRWADAYPFAQHFSEASVASKDVSVWCSNDYLGMSRHPQVLQATQETLQRHGAGA GGTRNISGTSKFHVELEQELAELHQKDSALLFSSCFVANDSTLFTLAKILPGCEIYSD AGNHASMIQGIRNSGAAKFVFRHNDPDHLKKLLEKSNPKIPKIVAFETVHSMDGAICP LEELCDVSHQYGALTFVDEVHAVGLYGSRGAGIGERDGIMHKIDIISGTLGKAFGCVG GYIASTRDLVDMVRSYAAGFIFTTSLPPMVLSGALESVRLLKGEEGQALRRAHQRNVK HMRQLLMDRGLPVIPCPSHIIPIRVGNAALNSKLCDLLLSKHGIYVQAINYPTVPRGE ELLRLAPSPHHSPQMMEDFVEKLLLAWTAVGLPLQDVSVAACNFCRRPVHFELMSEWE RSYFGNMGPQYVTTYA" repeat_region 23535..23833 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 25940..25973 /note="17 copies of 2 mer 100 % conserved" repeat_region 26272..26592 /note="L1MD2 repeat: matches 632..299 of consensus" repeat_region 26698..26857 /note="L1 repeat: matches 5366..5210 of consensus" repeat_region 27119..27304 /note="MIR repeat: matches 71..261 of consensus" repeat_region 27379..27546 /note="MIR repeat: matches 236..65 of consensus" repeat_region 30597..30639 /note="MIR2 repeat: matches 146..104 of consensus" repeat_region 31730..31759 /note="10 copies of 3 mer 90 % conserved" repeat_region 31765..31952 /note="MIR repeat: matches 261..65 of consensus" repeat_region 32197..32332 /note="MIR repeat: matches 2..141 of consensus" repeat_region 32625..32799 /note="MIR repeat: matches 25..213 of consensus" repeat_region 33129..33212 /note="42 copies of 2 mer 91 % conserved" misc_feature complement(34474..>34749) /note="match: STS G21476" repeat_region 36354..36658 /note="AluSg repeat: matches 299..1 of consensus" repeat_region 37060..37137 /note="MIR2 repeat: matches 146..84 of consensus" repeat_region 37085..37277 /note="MIR repeat: matches 262..68 of consensus" repeat_region 38222..38361 /note="MIR2 repeat: matches 143..1 of consensus" repeat_region 40576..40872 /note="AluSq repeat: matches 302..1 of consensus" prim_transcript <41396..42876 /note="match: 3' EST C01178; match: 3' EST N59092 clone 246817; Paired with EST N59497 matching this clone; match: 5' EST N59497 clone 246817; Paired with EST N59092 matching this clone; match: 5' EST N59517 clone 246865" repeat_region 42730..42813 /note="MIR repeat: matches 142..56 of consensus" repeat_region 43204..43266 /note="MIR2 repeat: matches 146..85 of consensus" repeat_region 44217..44346 /note="MER5B repeat: matches 178..50 of consensus" repeat_region 45726..45823 /note="MIR repeat: matches 216..116 of consensus" repeat_region 46525..46614 /note="MIR repeat: matches 145..47 of consensus" repeat_region 47153..47348 /note="MIR repeat: matches 205..1 of consensus" repeat_region 49817..49940 /note="MIR2 repeat: matches 21..145 of consensus" prim_transcript 50369..>50915 /note="match: multiple ESTs; match: T67107 R08790 T67108 R08791 T98932; match: T98977 W87537 T67104 W87619" repeat_region 51095..51201 /note="MIR repeat: matches 80..180 of consensus" repeat_region 53448..53655 /note="MIR repeat: matches 208..1 of consensus" CDS join(55158..55254,85781..85906,88246..88339,89270..89336, 90238..90312,90801..90857,92889..93010,97051..97258, 99934..100080,103612..103716,111430..>111559) /codon_start=1 /product="6-phosphofructo-2-kinase" /db_xref="PID:e304633" /db_xref="PID:g1869774" /translation="MSPEMGELTQTRLQKIWIPHSSGSSRLQRRRGSSIPQFTNSPTM VIMVGLPARGKTYISTKLTRYLNWIGTPTKVFNLGQYRREAVSYKNYEFFLPDNMEAL QIRKQCALAALKDVHNYLSHEEGHVAVFDATNTTRERRSLILQFAKEHGYKVFFIESI CNDPGIIAENIRQVKLGSPDYIDCDREKVLEDFLKRIECYEVNYQPLDEELDSHLSYI KIFDVGTRYMVNRVQDHIQSRTVYYLMNIHVTPRSIYLCRHGESELNIRGRIGGDSGL SVRGKQYAYALANFIQSQGISSLKVWTSHMKRTIQTAEALGVPYEQWKALNEIDAGVC EEMTYEEIQEHYPEEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVIC HQAVMRCLLAYFLDKSS" repeat_region 56654..56869 /note="MIR repeat: matches 29..258 of consensus" repeat_region 57241..57468 /note="L1ME3 repeat: matches 684..434 of consensus" repeat_region 57294..57468 /note="L1ME1 repeat: matches 632..434 of consensus" repeat_region 57362..57468 /note="L1ME3A repeat: matches 548..434 of consensus" repeat_region 57578..57623 /note="23 copies of 2 mer 94 % conserved" repeat_region 57624..57653 /note="15 copies of 2 mer 97 % conserved" repeat_region 59917..60242 /note="L1MB6 repeat: matches 921..581 of consensus" repeat_region 60250..62254 /note="L1 repeat: matches 5052..3097 of consensus" repeat_region 62301..62849 /note="L1 repeat: matches 2810..2247 of consensus" repeat_region 62858..62991 /note="FLAM_C repeat: matches 133..4 of consensus" repeat_region 64789..65122 /note="L1 repeat: matches 4310..3978 of consensus" repeat_region 65121..66002 /note="L1PA12 repeat: matches 22..912 of consensus" repeat_region 66230..66421 /note="L1 repeat: matches 3990..3781 of consensus" repeat_region 66430..66742 /note="MER2 repeat: matches 1..345 of consensus" repeat_region 66644..67178 /note="L1 repeat: matches 3860..3339 of consensus" repeat_region 67592..67852 /note="L1 repeat: matches 1972..1706 of consensus" repeat_region 68408..68453 /note="23 copies of 2 mer 100 % conserved" repeat_region 68820..68849 /note="15 copies of 2 mer 93 % conserved" repeat_region 68881..69406 /note="MLT1E repeat: matches 568..59 of consensus" repeat_region 69177..69478 /note="MLT1F repeat: matches 292..6 of consensus" repeat_region 69981..70506 /note="L1MC3 repeat: matches 2486..1938 of consensus" repeat_region 70522..70634 /note="MLT1G repeat: matches 123..10 of consensus" repeat_region 70642..70992 /note="MER42c repeat: matches 787..417 of consensus" repeat_region 70996..71372 /note="L1MC1 repeat: matches 1079..701 of consensus" repeat_region 71330..72919 /note="L1MC3 repeat: matches 1620..3 of consensus" repeat_region 73160..73491 /note="L1 repeat: matches 5042..4712 of consensus" repeat_region 73528..74448 /note="L1PB2 repeat: matches 902..1 of consensus" repeat_region 74299..75295 /note="L1 repeat: matches 5390..4373 of consensus" repeat_region 75323..78076 /note="L1 repeat: matches 2643..5390 of consensus" repeat_region 77927..78819 /note="L1PA2 repeat: matches 1..893 of consensus" repeat_region 78921..79434 /note="L1 repeat: matches 3333..3849 of consensus" repeat_region 79456..79884 /note="MSTD repeat: matches 1..392 of consensus" repeat_region 79893..80236 /note="L1 repeat: matches 3884..4233 of consensus" repeat_region 80253..80551 /note="AluSx repeat: matches 300..1 of consensus" repeat_region 80691..81580 /note="L1PA2 repeat: matches 891..1 of consensus" unsure 81260..81330 repeat_region 81431..81801 /note="L1 repeat: matches 5390..5019 of consensus" repeat_region 82410..82465 /note="28 copies of 2 mer 80 % conserved" repeat_region 83619..84212 /note="L1 repeat: matches 1965..1363 of consensus" repeat_region 84424..84625 /note="L1PA16 repeat: matches 704..904 of consensus" repeat_region 86265..86359 /note="MIR repeat: matches 146..47 of consensus" repeat_region 86646..86731 /note="MIR2 repeat: matches 113..24 of consensus" repeat_region 87917..88106 /note="MIR repeat: matches 7..198 of consensus" repeat_region 88417..88604 /note="2 copies of 94 mer 100 % conserved" repeat_region 89385..89635 /note="MIR repeat: matches 257..9 of consensus" repeat_region 89908..89999 /note="MIR repeat: matches 64..162 of consensus" repeat_region 91101..91252 /note="MIR repeat: matches 187..29 of consensus" repeat_region 92738..92795 /note="MIR repeat: matches 84..145 of consensus" repeat_region 93355..93495 /note="MIR repeat: matches 190..53 of consensus" repeat_region 94631..94930 /note="AluSx repeat: matches 302..2 of consensus" repeat_region 95608..95705 /note="MIR repeat: matches 85..185 of consensus" repeat_region 96257..96324 /note="MIR2 repeat: matches 69..143 of consensus" prim_transcript <96387..96600 /note="match: EST F17539" repeat_region 98092..98278 /note="MIR repeat: matches 258..46 of consensus" repeat_region 99699..99740 /note="21 copies of 2 mer 91 % conserved" repeat_region 100177..100278 /note="MIR2 repeat: matches 142..40 of consensus" repeat_region 100933..101228 /note="AluJb repeat: matches 1..297 of consensus" repeat_region 102286..102433 /note="MIR repeat: matches 245..96 of consensus" repeat_region 103121..103152 /note="16 copies of 2 mer 88 % conserved" repeat_region 103409..103462 /note="27 copies of 2 mer 93 % conserved" repeat_region 105387..105685 /note="AluY repeat: matches 299..1 of consensus" repeat_region 105764..105914 /note="L1 repeat: matches 1902..1744 of consensus" repeat_region 106758..107649 /note="L1PA2 repeat: matches 893..1 of consensus" repeat_region 107500..110779 /note="L1 repeat: matches 5390..2110 of consensus" repeat_region 111234..111305 /note="MIR repeat: matches 154..83 of consensus" repeat_region 111630..111859 /note="MIR repeat: matches 5..262 of consensus" prim_transcript complement(111753..>111994) /note="match: 3' EST W90029 clone 418071" BASE COUNT 31178 a 23416 c 22613 g 34787 t ORIGIN 1 gatcaagtgg gcttcatccc tgggatgcaa ggctggttca acatatgcaa atcaataaat 61 gtaatccagc atataaacag aaccaaagac aaaaaccaca tgattatctc aatagatgca 121 gcaaaggcct ttgacaaaat tcaacaaccc ttcatgctaa aaactctcaa taaattaggt 181 attgatggga cttatctcaa aataataaga gctatctatg acaaacccac agccagtatc 241 atactgaatg gacaaaaact ggaagcattc actttgaaaa ctggcacaag acagggatgc 301 cctctctcac cactcctatt caacatagtg ttggaagttc tggccagggc aatcaggcag 361 gagaaggaaa taaagggcat tcaattagga aaagaggaag tcaagttgtc cttgtttgca 421 gatgacatga ttgtatatct agaaaacccc atcatctcag cccaaaatct ccttaagctg 481 ataggcaact tcagcaaagt ctcaggatac acaatcaatg tgcaaaaatc acaagcattc 541 ttatacacca ataacagaca aacagagagc caaatcatga gtgaactccc attcacaatt 601 gcttcaaaga gaataaaata cctaggaatc cagcttacaa gggatgtgaa ggacttcttc 661 aaagagaact acaaaccact gctcaatgaa ataaaagagg ataaaaacaa atggaagaac 721 attccatgct catgggtagg aagaatcaat attgtgaaaa tggccatacc tcccaaggta 781 atttatagat tcaatgccat ccccatcaag ctaccaatga ctttcttcac agaattggaa 841 aaaactactt taaagttcgt atggaactga aaaagagccc gcatcaccaa gtcaatccta 901 agccaaaaga agcatcacgc tacctgactt caaactatac tacaaggcaa cagtaaccaa 961 aacagcatgg tgctggtacc aaaacagaga taaagaccaa tggaacagaa cagagccctc 1021 agaaataatg ccgcatatct acagctatct gatctttgac aaacctgaga aaaacaagca 1081 atggggaaag gattccctat ctaataaatg gtgctgagaa aactggctag ccatatgtag 1141 aaagctgaaa ctggatccct tccttacacc ttatacaaaa attaattcaa gatggagtaa 1201 agacttacat gttagaccta aaaccataaa aaccctagaa gaaaacctag gcaataccat 1261 tcaggacata ggattgggca aggacttcat gtctaaaaca ccaaaagcaa tggcaacaaa 1321 agtcaaaatt gacaaatggg atctaattca actaaagagc ttctgcacag caaaagaaac 1381 taccatcaga gtgaaacaaa caaccccatc aaaaagtggg caaaggacat gaacagacac 1441 ttctcaaaag aagacatttg tgcagccaaa agacacatga aaaaatgctc atcatcactg 1501 gccatcagag aaatgcaaat caaaaccaca atgagatacc atctcacacc agttagaatg 1561 gcgatcgtta gaaagtcagg aaacaacagg tgctggagag gatgtggaga aataggaaca 1621 cttttacact gttggtggga ctggaaacta gttcaaccat tgtggaagtc agtgtggcga 1681 ttcctcaggg atctagaact agaaatacca tatgacccag caatcccatt actgggtata 1741 tacccaaagg attataaatc atgctgctat aaagacacat gcacacgtat gtttattcac 1801 aatagcaaag acttggaacc aacccaaatg tccaacaatg atagactgga ttaagaaaat 1861 gtggcacata tgcactatgg aatactatgc agccataaaa aatgatgagt tcatgtcctt 1921 tgtagggaca tggatgaagc tggaaaccat cattctcagc aaactatcgc aaggacaaaa 1981 atccaaacac cgtatgttcc cactcatagg tgggaattga acaatgagaa cacacggaca 2041 caggaagggg aacatcacac accggggcct gttgtgggat ggggggaggg gggaggtgta 2101 gcattaggag atatacctaa tgttaaatga cgagttaatg ggtgcagcac accaacatgg 2161 cacatgtata catatgtaac aaacctgcac gttgtgcaca tgtaccctaa aacttaaagt 2221 ataaaaaaaa agtgaatctc tgcaaaagtt atactgtctg acacatagga agcaactagc 2281 aaatattctg ttcatttaat ttcaattctc tttatgaatt ccttttctgt gtctccaagg 2341 ccattgttgc agttcaggct tcatcatctg catcctcccc actggatgtt ggcatgatct 2401 ttctatccca gaaatatggc catgtcactc tgctgtttag actccttcag tggctcctca 2461 acgttgccag gaacaacaac aaagtatttt acaagatcta taaagttggt cataaactgt 2521 cttctgctta tgtcctgtgt cttacctgtt cacacttttc tgtaagcttc ctttagtcca 2581 cctatgcttg ctttcttcct ggtctttaca ttcttttctc tggctagaat tatcttctcc 2641 aatccctctc ttttttccct tatataatcc ataggttcac agaatccatc tttgttgttg 2701 ttgttttggt ctagcttact aactcttgct tgtcctttaa gactcaattc agggccaggc 2761 gtagtggctc acgcctgtaa tcccagcact ttgagaggcc gaggcgggca gatcacctga 2821 ggtaaggagt ttgagaccag cctggctaac atggtgaaac tccgtctcta ctaaaaatat 2881 aaaatttagc tgggcgtggt gacaggtgcc tgtaatccca gctactcagg aggctgaggc 2941 aggagaattg cttgaacccg ggaagtggag gttgcagtga gccaagatcg tgccactgca 3001 ctccagccca ggcgacagta tgagactctg tctaaaaaat aataaaataa aataaaataa 3061 ataaaaataa aagagattca attcaaatat cacctattcc agaaatcttc cttaagttgc 3121 tctcttgtga tagtggcaga tgtttctcct ccgtgcctct atgatttcct gtttatgtgc 3181 ctgtttctcc tacaagacct tgagcccttc aaggacagac ttcttatgtt atttgacttc 3241 tttgtatccc cagtaaccta aaatagaccg tgcaacgtag cagccattcc acaaatactt 3301 gttgaaggaa tgtatattga aaggaacatg tagtgttttt ctgtttcttg ttttcaaaca 3361 gtgggggtac tggttagcta gctgtggctt agcaaaattt acagggtttt ctggaggtga 3421 cactgcttag atgtctcaga cttgggcctc tatcttcagc ctccatacat gtgttcctgt 3481 atctgaggat ttccatcctc ttaggttcct aggaacctat ttctgagttc tcagaaggtc 3541 ctgtctaaat ttttcctgca aattctcctg tatggctagg gatgggtttt aagagggctc 3601 agcatgccaa agccaattac cacatatacc tatttttttt tcagatgata agactgacag 3661 caagaattct attccaaagc aagacatgtt agtcatagaa aaatgtctag agatgttaca 3721 agcagggtca ttccttaggc ccccaactct ggaacaactt ctgagtatta aggcaagtca 3781 gagtgctagg ttgaacatag cgtgatatgt gtttccttcc aagaaaactg ttttggttct 3841 gtcactgaac agccagttct tcctggggga gagttcctga aataatatcc acgagaagaa 3901 ctgaagcctt atgacccaaa ggagagcagc tatcggctag agagtctgga gtagaaattc 3961 acaaaatttt atcatcatac cgaactctga attttttagg tagtcagata attagtctta 4021 aggaaataca ttaaaagtat aactagtgtt taaaagcttt taggcagaat tcaattctta 4081 ttctccgcca gagaattcac agtaggaaaa agtcctataa atgcagtgaa tatgcaagga 4141 taatcatcag gagctctaaa cataaaagaa tcaatagtag agacaaatcc tagggatata 4201 gtgtctatgg gaaaaacttt tacttggagc tcaaactgtg taaacttcaa gattattcac 4261 agtagagaga agccctttga gtgtcatcta tgcatcaggg ccttaattta ggcctcaggc 4321 ctgattatgc atcagcaatc tatagcagag aaaaacttta tgaatgtcat gcatatgaca 4381 agccttaacc aggacaacct tattaatcac cagaaaattc atgacagaag tcattgtagt 4441 agatactatg actattgatt cctcctaatt caagcctggt gtcttgcctg gactaatctc 4501 ttcactggtc tgtttacctc caattcttac tccttgttac accagaactt ttttttttta 4561 ttttaatagg tttttgggga ataggtgatg tttcattaca tggataagtt ccttagtggt 4621 gatttctgag attatggtgc atgcatcacc caagcagtgt acgctgtact caatgtgtag 4681 tcttttatcc ctcaccccat ccacccttcc ccccaagtcc ccaaagtcca ttgtatcatt 4741 cttatgcctt tgcatcctca tagcttagat cccacttata agtgagaaca tacaatattt 4801 ggttttccat tcctgagtta ctttacttag aataatggtt gctgtgaatg ccattatctc 4861 attccttttt atggctgagt agtattccat ggtgtgtata caccacattt tctttatcca 4921 ctcattggac ggtgagcatt taggctggtt ccacagtttt gcttttgtga ttgtgctgct 4981 ataaacattc atgtgcaagt gtccttttca tataatgact tttttttttc atttgatcaa 5041 atggtagatc tacttttagt tctttaagga atctccacac tgttttccat agtggttgta 5101 atagtttaca tttccaccaa cagagtaaaa gtgttccctt tttaccacat ccatgccaac 5161 atctattatt attattatta ttattattat tattatttag agacagagtt tcgctcttgt 5221 tgcccaggct ggagtgcaat ggtgcgatct tggctcaccg caacctctgc ctcctgggtt 5281 caagtgattc tcctgcctca gcctcccaag tagctgggat tacaggcata caccaccaca 5341 ctcgcccgat tttgtatttt tagtagagcg ggcgtttctc caggttggtc attctgtctc 5401 aaactgccga cctcaggtga tccgcccacc tcagcctccc aaagagctgg gattgcaggc 5461 aagagccact gcgcccggcc ctattatttt ttgattttta aaattatggc cattctgcag 5521 gagtaaggtg gtatcacatt gtggttttga tttgcattta tctgataatt agtgatgttg 5581 agcatttctt catatgtttg ttggccattt atatatcttc ttttgagaac tttctattta 5641 tggacaatac tttcggatgg gatttttttt tttcttgctg atttgtttga ggaccctgta 5701 gattctggat attagtcctt tgtcgaatgt gtagattgtg aagattttct cccactctgt 5761 ggattgtcta tttactctgc tgattatttc ctttgctgag cagaaacttt ttagtttaat 5821 taagttgcat ctatttatct ttgtttttgt catgtttgct tttgggcact tggtcatgaa 5881 gtctttgcct aagccaatgt ctagaagagt tttcctcatg ttatcttgta caatttttat 5941 ggtttcaggt cttagattta agtcctatct tgagttgatt tttgtataac gtgagagatg 6001 aggacccagt tttattctcc tacatgtggc tcgccaatta ttccagcaac atttattgaa 6061 taggatgtcc tttccctact ttatgttttt gtttgctttg ttaaaggtca gttgactcta 6121 agtatttggc tttatttctg ggttctctgt tctgttccat tgatatatat gcctattttt 6181 ataccagtac catgttttga tgaccatggc cttataggat agtttgaagt caggtaatat 6241 aatgcttcca gatttgttct ttttgcttaa tcttgccttg gctatgtgga ctcctttttg 6301 ttccatatga attttaggat tgttttttct agttctgtga agaatgatga tggtattttt 6361 atgggaattg cattaaattt gtagattgca tttggcagta tggaaatttt cacaatattg 6421 attctaccca ttcataagca tgtgatgtgt ttccatttgt ttgtgtttct atgatttctt 6481 tttttttttt tttttttttt tttttttttt ttgagacgga gtctcgctct gtcgcccagg 6541 ctggagtgca gtggcgggat ctcggctcac tgcaagctcc gcctcctggg ttcacaccat 6601 tctcctgcct cagcctccca agtagctggg actacaggcg cccgccacta cgcccggcta 6661 attttttgta tttttagtag agacggggtt tcaccgcttt agccaggatg gtctcgatct 6721 cctgacctcg tgatccgccc acctcggcct cccaaagtct atgatttctt tcagcagtgt 6781 tttatagttt tccttgtaga ggtattttac cttcttgatt aggtatattc ctaattattt 6841 tattttattt tttgcaactg ttgtaaaagg ggttgcattc ttgatttgat tctcagcttg 6901 gtcgctgttg gtgtatagca gagctactga tttgtgtaca ttaattttgt atcctgaaac 6961 tttgctgaat tcatttatca atcctaggag atttttggag gagttagtag ggttttctag 7021 gtataggatc atatcatcag caaacagtga ctgtttgatg tcctctttac tgattttgat 7081 gccgtttatt tctttttctt atttgattgc tctggctagg acttccagta ctatgttgaa 7141 tagaagtggc gaaagtgggc atccttgtct tgttccagtt ctcagggtaa tgttttcaac 7201 ttttccccat tcagtataat gttggctgtg gatctatcat agatggcttt tattacctta 7261 agttacatcc cttctatgcc aattttgctg ccgggtctga gctccttcag ctgcacctgc 7321 tcctccaggg ccatggttgt gaagaagatt gaaacccgtg atgggaagct ggtgtctgag 7381 tcctctgatg tcctgccgat gtgaaccgcc acggcagccg ctcccagcct acccctactg 7441 tggctgcccc agagcctgtg ggggagacca ctgtgtgggg gagcataggg gacaggagac 7501 ccaccagagg ctcagcccta gccctcagcc cacatgtagg ggcagtttac tgcctggggt 7561 acttgccttg cccatgcctt cagctacaaa acaattcagt tgggtttttt ccaaaataaa 7621 acctcagcta gctctgccaa ctgtcaaaca acagcaacaa caacaacgaa acaaagaaat 7681 aaaataccag cttaaccttt taacttgtat attattagga aaaaattgga aattataaaa 7741 ataaaccagt atttttagcc tttgtatttc aataagaatt attgaggccg ggcacggtgg 7801 ctcacgcctg taatcccagc actttgggag gctgatgggg gcggatcacg aggtcaggag 7861 atcgagacca tcctggctaa catggtgaaa tcccgtctct gctaaaaaca taaaaaatta 7921 gccgggcatg gtggcaggcg cctgtagtcc cagctactca ggaggctgag gcaggagaat 7981 ggcatgaacc cgggaggtgg agactgcagt gagccgagat cacaccactg cactccagcc 8041 tgggcgacag agaaagactc cgtctcaaaa aaaaaaaaaa aaaaaaagaa ttattgaaca 8101 cctactaatg ctaagtgtaa cactgtatgg gagattaagt tctaccagag atgctgccag 8161 tctgaatagg taaatagcat taaataccat ttcacaatgt agaaactaaa ggtagttaaa 8221 tagatacagc taaagtacta tagaaggtca gacctatcag agatcaggtt cagatttggg 8281 ggaagaacgc aagcaaacct tacagcattt tatataggtt ttgttgatta ttctttttag 8341 tcatggtaaa atatacttaa cataaaattt accatattta aatatagtta gttgagtggt 8401 taagaacaat cacattgttg tacaactatc ataatcatct atctctagaa ctttttcatc 8461 ttcccaaatg gaaactctgt atccttaaac agtaacttcc attctcccca ccgccccgcc 8521 cctgacaacc accattcttt ctatctctat gaatctgatt gctctaagta cctaacttat 8581 gtaagtggaa tcatacaatg ttcatccttt gtgactggct tatttcacat agcataatgc 8641 ttttaagatt catccatgtt gtagcatgtg tcagaatttt cttccttttt ggatgagttc 8701 atgccctttg tagggacatg gatgaagctg gaaaccatca ttctgagcaa actatcacaa 8761 ggacagaaaa ccaaagactg catgttctca ctcataggtg gggattgaac aatgagaaca 8821 cttggacaca gggtggggaa catcacaccc cggggcctgt cgtggggtgg gggagggggg 8881 aggaatagca ttaggagata ttcctaatgt aaatgacgag ttaatggtgc agcacaccaa 8941 catggcacat gtatacatat gtaacaaagt tgcacgttgt gcacatgtac cctagaactt 9001 aaagtataat aataattaaa aaaaagagag aagtagctaa agtagacata tcctgatcca 9061 aaattttaaa aatagattaa aatccgcctc tggtttatga tctcttcaaa aaaaaaaaaa 9121 gaaaatattg agttccttta agtgaattga cagtagtggt gttttgttta atacgtaata 9181 tacttaagct atattttcaa tttttttatt attttcctga tgcttttaaa caaaaacaca 9241 aattcctacg tcataaagca tggtccattg gcaaatggaa agcaataagc aagtgacagt 9301 gaagaagcta ttgaaaagaa caaggacatg ggccagcccc gggctgaatc ccagcctggc 9361 ccacactgtc tatgtgacct taggcagtga ctcagttctc taagaccagg tttcctcttc 9421 tggaaaagca gtatttgcaa aatcccatga gaaatcgaca agataataat atttgcaaag 9481 aatccacact gctccgggca ccataaatag taacagtagc atgtaacaat aacattcatc 9541 tcagaaaaat gtggcatcag caaagaactg gaaatgcttg ttttcagggt taaaaaaata 9601 ccattagggt ttcttcacta aataaactag taattttagt ggcccaatct ggaggcctgt 9661 ggtttaaaaa aaaaaaaaaa agaattttct tccttttcaa ggctgaacag tattctttgg 9721 tatgtatata ccacattttg tttatccatt tatccttcaa tggacatttg ggttgtttcc 9781 accttttggc tattttgact aatgctgcag acattacttt taaatatatc tcagttccat 9841 tcacttctgt ccatcttcat cgctactgtg aaagctgatt atatgaattg ggtcattctt 9901 gtcataccca actaaaacag agtcaagaag ccaggggggg aaaatcactc agggcacata 9961 acattgctcc aaaaatataa ttctctgcaa gcctggctgc tgaaactgcg tgctgtaacc 10021 tgaaagcagt tttatctcat ggctactgaa acagcctgct gcaactttta cctactacga 10081 ccactcaaca ataagagctt gccacctccc caaaaatctt actagtgcca atgtactgtc 10141 tcaaagagca atgtgtagta tttctatttt taaataaaac ctctaaattt ctctttgttc 10201 tttggacata ctgaagaaca cctggtctgc ctgtatactc caaattgcaa ttctttcttc 10261 ccaaattaaa agttttaatc tcagagattt gtctctatat tttatttgac gtcagtgcta 10321 cctccttatt ccaggttacc atcaactcac acatgaacta gtacaacaaa cttctaacca 10381 taaaatatct aattttaaaa tactaaatat aagaatatta aatatctaat attggatcta 10441 aatgcatttg aagcatcata cctagtaaat tatgaaatta caaaaacttc agaaaattca 10501 actccatcat cttatttatt tatacctgcc tcttcaagga taggtcaatt tcactgattg 10561 aaagtaagag acagctaaac cctcatgtgc cattcataca agtccctatt taaacttccc 10621 catttaaaag ttatcaacaa gtgtaagttg acaatttgat tatttccaaa aacatagtat 10681 ctcagtaaat atatgctagg tctaacagtg ttgagcctat cattttattg agagtaaata 10741 taacgaacat attcacattt tataacttgt tgagtccagt gatgttttta atccagccag 10801 tttaacctaa tcagctgcca actcatatag gatgggtgct ttaaaaattt ctttattgac 10861 aaatgatgct tttttaggca taaagtaaat atcacgcatc atttattaaa tggggaaagg 10921 caaaattttt tttacatatt atgaatatgt gaattatatg ggtagagagc agaattggat 10981 aagctttttt ctttccaggt tttcttgggt ggtggggcag ggagggattt gctggggata 11041 ggggaggatg tggatggaat ttgttgacag ttttcatctt gcagtttttc ttttttttat 11101 ttttattttt tattaatata ctttaagttt tagggtacat gtgcacaatg tgcaggtttg 11161 ttacatatgt atacatgtgc catgttggtg tgctgcaccc attaactcat catttagcat 11221 taagtatatc tcctaatgct atccctcccc cctcccccca ccccacaaca ggccccagtg 11281 tgtgatgttc cccttcctgt gtccatgtgt tctcattgtt caattcccac ctatgagtga 11341 gaacatgcgg tgtttggttt tttgtccttg tgatagtttg ctaagaatga tggtttccag 11401 cttcatccat gtccctacaa agaaatgaac tcatcctttt ttatggctgc atagtattcc 11461 atagtgtata tgtgccacat tttcttcatc cagtctatca ttgttggaca tttgggttgg 11521 ttccaagtct ttgctattgt gaatagtgcc gcaataaaca tacatgcgca tgtgtcttta 11581 tagcagcatc atttataatc ctttgggtat atacccagta atgggattgc tgggtcaaat 11641 ggtatttcta gttctagatc cctgaggaat cgccacactg acttccacaa tggttgaact 11701 agtttccagt cccaccaaca gtgtaaaagt gttcctattt ctccacatcc tctccagcac 11761 ctgttgtttc ctgacttttt aatgattgcc attctaactg gtgtgagatg gtatcccatt 11821 gtggttttga tttgcatttc tctgatggcc agtgatgatg agcatttttt catatgtctt 11881 ttggctgcat aaatgtcttc ttttgagaag tgtctgttca tatcctttgc ccactttttg 11941 atggggttgt ttgttttttt cttgtcaatt tgtttgagtt cattgtagat tctggatatt 12001 agccctttgt cagatgagta gattgcaaaa attttctccc attctgtagg ttgcctgttc 12061 gccctgatgg tagtttcttt tgctgtgcag aagctcttta gtttaattag atcccatttg 12121 tctgttttgg cttttgttgc cattgctttt ggtgttttag acatgaagtc cttgcccatg 12181 cctatgtcct gaatggtatt gcctaggttt tcttctaggg tttttatggt tttaggtcta 12241 acatgtaagt ctttaatcca tcttgaatta atttttgtat aaggtgtaag gaagggatcc 12301 agtttcagct ttctacatat ggctagccag ttttcccagc accatttatt aaatagggaa 12361 tcctttcccc attgcttgtt tttgtcaggt ttgtcaaaga tcagatggtt gtagatatac 12421 agcattattt ctgagggctc tgttctgttc cattgatcta tatctctgtt ttggtaccag 12481 taccatgctg ttttggttac tgtagccttg tagtatagtt tgaagtcagg tagcgtgatg 12541 cttccagttt tgttcttttg gcttaggatt gacttggcaa tgcgggctct tttttggttc 12601 catatgaact ttctcaatag atgcagaaaa ggcctttgac aaaattcaac aacccttcat 12661 gctaaaaact ctcaataaat taggtattga tgggatgtat ctcaaaataa taagagctat 12721 ctatgacaaa cccacagcca gtatcatact gaatggacaa aaactggaag cattcccttt 12781 gaaaactggc acaagacagg gatgccctct ctcaccactc ctattcaaca tagtgttgga 12841 agttctggcc agggcaatca ggcaggagaa ggaaataaag ggcattcaat taggaaaaga 12901 ggaagttaaa ttgtccctgt ttgcagatga catgattgta tatctagaaa accccattgt 12961 ctcagcccaa aatctcctta agctgatagg caacttcagc aaagtctcag aatacaaaag 13021 caatgtgtaa aaatcacaag cattcttata cacaaataac agacaaacag agagccaaat 13081 catgagtgaa ctcccattca caattgcttc aaagagaata aaatacctag gaatccagct 13141 tacaagggat gtgaaggact tcttcaagga gaactacaaa ccactgctca aggaaataaa 13201 agaggataaa aacaaatgga agaacattcc atgctaatgg gtaggaagaa tcaatatcgt 13261 gaaaatggcc atactgccca aggtaattta tagattcaat gccatcccca tcaagctacc 13321 aatgactttc ttcacagaat tggaaaaaac tactttaaac agattcactc ttttacactt 13381 cataaaaaca ctataaagta gatattatca gtatcttcat attacagaca aggaaactga 13441 aattcagaca ggtgaaataa cctcccaaag ttcacacagt taataaatgg cagaacccag 13501 ttaatctggc tcgagggctg atgttcttaa tcactgttca ttcttgcagt aacatttaga 13561 agaaccaagt accaggatga gatctcatct tccataaagt gacggagcaa gtaattccag 13621 actcttcctc ctccctcaac atctacaaaa ctcactgttc ctgaaatgca gtaaacatct 13681 taagacctcc ctgattttgt tgatattgta ccctctgcct ggattgcccc tttccctgtg 13741 ctttatccag taaactcatg cttcataacc catttaaaat atcatctgtg ctctgtagtg 13801 tttcctgaaa taccttaggg acttttaaac ttttttcctt ttgctcatat tgctttctct 13861 gcctgctgaa taggggattg agaaagaaaa cacattcttc tttaccaaaa gacaaaaact 13921 ataaatagtt actagctgta ttaagatata tgccaggtgt gatggttaat atcgagtgtc 13981 aacttgattg gattgaagga tgcaaagtat tgattttggg tatgtctgtg agggtgttgc 14041 ccaaaggaga ttaacatttg agtaagtggg ctggggaagg caaacccacc cttaacctgg 14101 gtgggtatca tctaattggc tgccagcaaa tataaagcag gcagaaaaac atgaaaaggc 14161 tagactggct tagcctccca gcctacatcc ttctcccatg ctgcatgctt cctgcccttg 14221 aacattggac tccaagttct tcagctttgg gactcagact ggcttcgttg ctcctcggct 14281 tgcagctggc ctattgtggg accttgtgat catgtgaatt aatactactt aataactccc 14341 atatatatat acatgtatat atatacataa atatatatat gtatgtatat gagagttata 14401 tataaactca cgtgtgtgtg tgtgtgtgtg tctcctatta gttctgtccc tctagagaac 14461 cctgactaat ataccaggac cttttttaat tctcctaata ggaagatatt cctactaagc 14521 atatagatta cttataaaag gaagagtatg agcctgacat tgaacatttt agctctaatt 14581 ataagtgtta ttaaacaatg aaataatatt tataacactc tgaggggtga aaaactggac 14641 tttggaattt tatacctaac caaaaatgat aagagaattc tctgtattac tctatgatat 14701 tttgaaaaat attgattact aaaattgata tgggaagaag ttttttaaaa aacgtataaa 14761 caaattatca tggaaaccat tggaaaagct gtctaagaac aacccagcta ccatcaccaa 14821 atggtattag ggcaaataat tatattggct ggtattacca aaacttacaa gaacagatgt 14881 gttatttaaa ttaaaatgga aactatagac caatcttaca tgaagaagaa accacagcac 14941 gtctgggaca tagctatcat gctactctct gagtttcatt ccagaatgtg ggtaagggat 15001 atagctaatg aaggagagag atgctgatag tgagaccaac ccttcaggaa gtaacccaac 15061 aaagcctgta tgtcttcctt aatccccact catcacaagt ggttgagact atcgttaact 15121 gggacaacct tttacagaga tattggataa aatgtaggaa aaaatcttta aaagatccat 15181 atcctttggt cgaataatcc cattactaaa aatcaatact aatatttgtg gtgatggttg 15241 cacagttctg taaatatact aataatcatt gaattacaca tttaaaatgg gtgagtttta 15301 tggtttataa ataaactttt agaaaatcta tgctaagaaa ataacctaag atgcagagaa 15361 aactttaagt accatgatgt ctactgcagt gcaactgcaa agtaaaaagt tgaaactcct 15421 aagtcaccga agtctattca atcaaacatt gtctgaatag ttacttattg tcaggttgac 15481 actggggttg cagaaatgaa tcagacctag actctgattc cagaggaatt cccacatggg 15541 taggaagtca gccaagacac agaacaaatc aatgcagaga gagagagggg actatgtaga 15601 gaagggccag gaaagtgttt cctttggaca cgctacatga gcatatggga ctgaaactta 15661 cttctccctc cagactacat tgcctcataa aaaggtgaga atcacacaac caccacccaa 15721 agagaaaatg agacattttc ctctaaagga gaggagatat caggcacaac cacagccacc 15781 tacactcacg ggctcagctg gaccaggaat agtacagaga ggcaccacct aagtatcctt 15841 cccaatgacc tgggaagtcc cacacacttt cctccattag ggcacttcga gtttttctct 15901 acgtcacctc agaggtttgt taacctcagt agattttatt cttccagttc cttttctggg 15961 tacaggcaga acagagtaga aagaacggag gaaatgcatc tgaatgcaac aggcatttct 16021 gacagccccc ctagtcaatg gttccagatt cctgtcagca agttgctcaa ggtctgatgt 16081 gggaggaaga cataagacaa ggtgccagta gtacagtgtg ataaattctg agagagaaga 16141 tgtgggggcc cagaagaggg aacaagatcc agtgtggggg tttaaaaggg cctcccagaa 16201 gtgatgacct ttgagctcac ttttcaaagt gacaaatggt tagccagcta aagagttgga 16261 ggtcaaggtt attgcagctg catttgaaat accttctaga agaactggaa ttaatcttta 16321 gttttaaaaa tggttaaata caccatgcta tacccatact ctccaattta catgccactg 16381 catctccact acctagcacc acacctgaca tgctataggc actcaataaa catttgttga 16441 attaataaat taatccagat gaagcagcaa atatcacaga aaaggagtta atattaatat 16501 tattcaaagt tcttctcatc tcatcctcac aataatccaa tagggtaggt tctatgactg 16561 tcccatttta tatatgagag atctggagat aagtgaacca aatagctaag gtgttggtga 16621 cttggtgact tgggtgagtc tgactccaga gtgcatgctt ttagtccact atattttact 16681 gccaacccac ccccaaaatg ggcaaaaagt agaccactca gaagagaaat agacaactga 16741 taaagaaaca tgaaaagaca cccaatctca ttaataatga aagaaaagca aatcaaagca 16801 acactgatat accattttta gtgcctataa gattaacaaa tattttaaaa attgaaaata 16861 acaaatggta ttggtgagtt tgtgagaaaa tagatacaca tttactctta tacaccactg 16921 ggtggggagg ggtgtataac acatgtatag gacaaattgg cagtatcaaa aactttaaat 16981 gtctattacc tttagatcca ggaatgctta tattttagga tgccaccctc taaaaaaatg 17041 ctctcacaaa tgccaaaaga tgttcattgc aacattaatc ataatggagg ggggaaaatc 17101 cttgggaata acctaaattt atatcaataa gggagtgttt aaataaatta cagtgcattc 17161 agattttgta acctccatgg tatattggca tatggttgaa tgaaaaacaa acaaaaaaaa 17221 gaaattgcaa atcaatatgt agactatggt cttatttatg taaatataat attaatgatg 17281 acaatctgct ctaaagatga aattgtctgg ttataacatg gtcctgctgc cttaggattg 17341 ataagggaag gggaaaaaga gagtgacaaa aggggaagaa aatgaaaaag ggaggagaag 17401 gtaatagaaa agataagaca gagaacttca catttttaat tgtatattat gagaggaaga 17461 gaagagtgtg ggttctgtaa ctacattgcc taggttcata ggcgggctct gtactttcta 17521 gttgggtaat tatcaacaag ttacctcacc tctctgagcc tcagtttctt catctgtaac 17581 atagggataa tagtacctat cttttaaggt ccttatgtgg ctaaattaat tcatgtaaag 17641 cacttagtga agggtctggt gcattgtaag cactcagtca acagtaggtg ttatttattt 17701 gggccaggca ctgtggtatg cgttttccat acatgatatt accaaattgt tacaacaacc 17761 ggggatctag tctgtggaag ctgactgttt gagctcataa gcacaacact gtattctagg 17821 tctgactcca gagccctccc aacttgtgcc acagtcactt ctgggtttta ttgccctcct 17881 tttatagttg aggagatcta tagtcagaga ggtggttttg ctcaaggtat cccagctcta 17941 tgtggatggt ctgattccaa agcccaaatg agctaatttt actgtcctat agagagaaga 18001 aaatagggaa gagccagagc tgggggaggg ggtacactag agggaggggc tactttgggt 18061 tttatctcta gcaaggaagg gactgagata cctttggggc caatgcaggg ccaggccggc 18121 agaaggcagg gtgggtgggg ctgagtcaga ggagaaggga taaatgccag gtcctaaccc 18181 aagtacccac ctgtcattcg ttcgtcctca gtgcagggca acaggtaaga gctgctttca 18241 gcctggcacc ctatctctgg tctgccagct ggtctctcag ggctgtacac actgactctc 18301 tggtctgagt agatctgact ttttcctttg tttgtttctt agaatctgtc tctttttcat 18361 tttcttttta tctcccatgt ctctttctgt ctttcctcat tttcagcttt tttctctctt 18421 tttcccttcg ttactttctt ttgttagttt tcaagatcat tcatttcatt tcatcattct 18481 ctgacactct tgctttctct tatttttccc tctgaattct aactatcttt ttctctaaat 18541 ttctttctct cccccttttt gtctctttcc tcggctttgt atctctccgt ctctgtgttt 18601 ctgtctctct cttcctctct atcaagaacg atggcttaat atttcttcct gcaattcccc 18661 attcctctct ccctttgact ccctctacct gctgggctga cagcagagct cagtgggtca 18721 gagcccatgg ggagcctagg ggtgggggaa gagctaggga gggaaactaa gaggatgtgg 18781 gggtgatggg aatgatgaat tgggtaagga gagatttggg gaattgagag atgaataatt 18841 agcagaaata agtgaagaaa gtggaagagg aatgtagtgt cactatacag aaagtaaaca 18901 gatttctatt ctcatcctaa ttcactgtga gaccctaggc aagtcattca ctctctgaaa 18961 aaaaggcttg gcctgtaatt tccaccaccc tttctagttt tgattttgtg atcttctaaa 19021 ttttcctgtt tctaagaatt tctgattctc tgattacagt tatctaaagt tctgtatgat 19081 tctttcatgg tgggaaaggg gtactaggaa gagaagtaag gcctgatgtt tccaactcct 19141 gaagagaaat taccacttcc cttccagacc taattgactt ttgcaaagca ggccacaaaa 19201 ggggtggggg ggtgggggac aaggaatgct gcaatgagtg ttttctggct gtctgctggg 19261 gtagagttgc agttggccct tttcacctct gggagtacag attgggtgct gacacaagag 19321 aggattttaa agtcgtaggg aaaaactttc agtaatgatc tgttacttgg tctcaaattt 19381 caccatcatc tctttggtta aaagtattgt tttaagaaga tgcctggcaa gcattatcac 19441 acattaggta cataagttat tgaatggtag agtaaatgaa tattcaacag tacctgaaat 19501 tccactgtag ttacagatct gttcctttgg taaggcattg gtgacaaatg gcatatgacc 19561 tggaaagagg cctatgttag tgcagcagag gagataaatg tctagagtca ggccctcagt 19621 caagaaaaaa aggtagtaat atttgaatca cagatccata atggttaagt taggaatctc 19681 tggaaacaga ttgcctaggt tcaaatcctg cttctcctat gtactagctt tctgatctag 19741 acaggttact taatcttttt gggattcagt ttccctatca tcacagggtt gacatgagaa 19801 cacggcctgg cacagagggc tctgtaagtg tttgactatc agaactaggc ggaatctatg 19861 aaattatcta gtccaatgtc agtggagaaa cggaagccca gagaggggaa ttacagagcc 19921 caagttcaca caataaattg taacaggatt gggacaagaa tcaattctct agcttcccaa 19981 acccagcctg gtatattcat gtgacttccc ttggctgtac gttcattttt tctacatggg 20041 aaatggagaa aataaaaata ataaagtcta tcaattaaat ataatattta acactttttt 20101 actgtttact ctgggatagg tactctgcta aatgctttat atggattatc ttactgaatc 20161 ttcacaacat tcctgtgatg cagattgtcc ttgttattac caacattttc cagatataag 20221 atgtacagca gggaagtgac ttttctaagg tcccaaagct agtgagtggt ggagccagga 20281 ttcaaaccca agtagtttgg ctctagagcc tatactcttt ataccctaaa ttgactaaaa 20341 tgcttccttg attcaatttt actcactcta gtctcttggt aggtaatgag atggaataga 20401 aacagagccc atggtaacta gactacaagg tcatgggtat aatgatggcc aggcagagtg 20461 aggcagagca aatttcagga aaggagtaac agaacaagag aaatgagaac aggagcttga 20521 aagaacttga gaattcaaca aattccaaga agtggtctat attttcccag gaccctgagc 20581 atatcatggc caaaagcccc ctagtaatga tgtgtgttaa tttctcctgt ttttatatac 20641 aggaggtagg tcttctccac catcccaagg caggactgga ctttgcctcc aatattgggg 20701 gctttccttc ccactacata ccccaatgtt gttggcatta ttgttgccag tattgatgtt 20761 aggggagttt acaggagcct ggagccttgt catctgcctt gcctgcactt ctgggccatc 20821 catttcttac caccaatagc cagggccagc tctagccaga tgctcagacg tgattccagg 20881 aaggggctcc tcttctctcc cacgccctgg tctcagcttg gggagtggtc agaccccaat 20941 ggcgataaac tctggcaact ttatctgtgg tctgcaggct cagccccaag tgctttagct 21001 ttcacaagca ggcaggggaa gggaaacaca tatctccaga tatgaggtag gcactggatc 21061 caattcctta cctaccttgt gaagtggcca taattacctc acgtttgaca gctgatgaag 21121 gccaagatcc agagagggga agtgatttga acaagaacat ccaacaatga aattggagag 21181 ctggaatttt aataagaaaa gctaacattt attgaagatt tactatgtgc caaaaactat 21241 actaaaggct taacttggat tgtttcattt agtccctcca acaacccttc tgtcttttcc 21301 aatttcaggg cccacatgcc ttggccccac ataccaaccc aggctgctgt gacagcccat 21361 gagaggggga gaggttgctc tgggatggaa caagaaaaag aggttgtttt gtgaggtacg 21421 gggagggtgc ttgttctatg agatcaggaa gggagggaga tgaaggaggt tgccatatga 21481 gggcagggcc atgagctgac ctgtccctca aaacataagg ctgagggtgc tagtagattc 21541 tactcagtaa ctttcttcac agtgtcagtg ctttagtctt ctcacattct cccatgtctc 21601 tcccattgta ctgtccctta tcttgtctca ctttttgact ctgtctttcc aatttgccct 21661 ttttctttac atctgtctct ccttcttgct ctctctagct gtctttctct tggtgtctct 21721 cagctctcac ccctcttaac cctcatcccc ctgctttagt cacctctctg tctctatcct 21781 ttgatcttgt cattttctct actctcttct ctctgtccct cagtctctct ctcatctccc 21841 tcaattaggg ccatgattct cttccctaaa cttacttagc cttttgcaat ttctggcagc 21901 atttttttat gtttgtgtct gactgactct ctacccctgc tggatcctct ccactcctgt 21961 tctcacttct atgaatcttt gtataatcct ctagactcat tgatccctcc tcatgtccct 22021 ttcgtgcccc ttggtctatc tgtctctgcc tttatccctg tgtgcactat caccaccccc 22081 tttttctttt ttcattttct ctttctctcg actcaatctc tgttttcatc tctaccctgc 22141 tccctttccc tctacctttg atctcttttt ccccctcaat ttctgttctt ttaactctac 22201 caccaccacc acatctttgt tctctctcta ctttcctcct tttatctttc ctaaattttc 22261 ttttcttctg gcttttctcc tagtcccttc tccttcctca atttcagact ctgttcattc 22321 atcaatttac cccaaaattc aacaaatatt tattgagtgc ctgtgtgtca tttgctttct 22381 ctttttctga tctctttgcc ccctttctct tctctgtctt ggcctctgcc tgtttcacta 22441 atccatagac tatgtctttg tccctgtttt ccagccccac tgggacttgc tttcacctct 22501 tcctatatct gtgcttatcc aagagacagg agcaaattca aagacagcat aatatcaggc 22561 tggtggtaca cattctgtag gacctagggc ctacccttcc ttccggatcc cttgatttcc 22621 ttaaactgat acatgtgacc tcaagctcct tctcccctct ggctgatcct gcttaggaaa 22681 caccctgggc caagcctcag gagctctact caatgacata tgtttgcatt agcaggctga 22741 atcttcactt ggctaagacc aacattctta gaaagattct tggccttaag tattgatcaa 22801 agggttagtg ggttggcagt tctcatcctg ccacacaaaa acacatttca gtgatcctca 22861 tcatcacaga ggtagtcagt gccagaatgt gagtcagaat ccaggctttc tgacctccag 22921 ttagaactgt ttccttcacc cctttgccca gtagtcagtt tcctatttct tcctccctca 22981 tgttttattg gtacatgtta acattgggaa agaagttctt tccctggaag ggcaataaga 23041 gcatctcgga ggcagcaagt tttgggtggg aagctgaaga cgaggatcaa aggcttggct 23101 ttttgccagg ccctcatgat ggaacctcat ctcttccatg tcttctgcag gactttaggt 23161 tcaagatggt gactgcagcc atgctgctac agtgctgccc agtgcttgcc cggggcccca 23221 caagcctcct aggcaaggtg gttaagactc accagttcct gtttggtatt ggacgctgtc 23281 ccatcctggc tacccaagga ccaaactgtt ctcaaatcca ccttaaggca acaaaggctg 23341 gaggaggtaa gaagaggctg ctagcaaaag gggagaatgt tagggtcctg gggtaaaagt 23401 tccaagttat actggccatc tttgcctaat aattaggacg gttcatgtga aaagtgtcaa 23461 gatagcatga actggcccca aaatataccc agaatctgtc ttctgccagg ttctctagaa 23521 agagtctcat tctcggccag gcacagtggc tcacgcctgt aatcccagca ctttgggagg 23581 ccgaggcgag tggatcacga ggtcaggagt tcaagaccac cctggccaag atggtgaaat 23641 cccatatcta ctaaaaataa aaaaattagc caggagtggt ggtgggcgcc tgtaatccca 23701 gctgcttggg aggctgaggc agagaattgc ttgaacccag gaggcggagg ttgcagtgag 23761 ccaagatcat gccactgcac tccagcctgg gcaacagagc gagaatctgt caaagaaaag 23821 aaaagaaaag aaaagaaaca gtctcactgt catgtccctc acacactata ctccagacat 23881 gctgaaacta cttaaaattg cctaaatcaa ctattctgtc aagagtttgt gcctttgctc 23941 ctgtcagatt accctctcct agaccctgta ctggagaatc tcatacttct catttgacac 24001 taagcttggc catcatctcc tctgcaaagc ctgcttagac ctccaaactg tctaattcca 24061 attctggctc atttcccctc cctcttctgg acttctgtag cccatgtact tcctctatcc 24121 cagcactgtt cacaatgtgt cttcagtgta tgccattccc accagtttag tagctcccct 24181 agcacaggga ccagactcat ctatctctgt gtctctacaa tagcctgaga tagggcttta 24241 ggggtacatt agatctcagc aattattgtt gagctgaact tatgactaga aatgcacccc 24301 aaattactct cttacctttg catagattct ccatcttggg cgaagggcca ctgtcccttc 24361 atgctgtcgg aactccagga tgggaagagc aagattgtgc agaaggcagc cccagaagtc 24421 caggaagatg tgaaggcttt caagacaggt tggagtcaag ttccacctta tgcaaccttt 24481 actcctaatg cttgaacaca ctacgtcaca gtcctgagct aggctaatac aaaagcagcc 24541 agtacacatc ccatgatgag aagtccagtc tttccagggg agccatggta ggcaacagtt 24601 taggctgtat gctgaagcac accatacctg acaaacacat atgtacgggc tcctgaaact 24661 tttagtcatt attctaagat gagccctcta gaattttgac tcctcttttt caggtggcta 24721 aactgatccc aacaggctgg ggtcccacat ttcagcaaga ccactctatg agaatatgga 24781 tttgcatgaa agagaaagag ctgggagtag gtacctcctt taaccagggt gcagatcccc 24841 aggtcaatta attagtgcag accacccaag ataatcaccc ttgagatatg gccacactgt 24901 tgacatcttt cataggcccc tttgggatat cattaaggac aaaaacttca aaatggaaat 24961 ttaatgatgt ttagaaaaga agagtaaggt acattatcct gcatctactt tctaaatgca 25021 ggacccaggg tggctgctcc agttacctga gccaagggaa aatcctagtg gagagaagta 25081 tgattcacct tatagaaggt ttcctaacaa tgtaatagtc tccattcggg gggataaata 25141 gaagctcacc ttggagaaga tttcttctcg ctgtagaagc tgcccttacc ttataaactt 25201 gaattttcat gtgttgcatt gagcttaaag aggacaacac atgctttctt tttcccccat 25261 tctcttcacg gccaatgaat ctcacattcc gtctcagatc tgcctagctc cctggtctca 25321 gtcagcctaa ggaagccatt ttccggtccc caggagcagg agcagatctc tgggaaggtc 25381 acacacctga ttcagaacaa tatgcctggt gagtttgctg aggtggaaaa aaaggggacc 25441 ggaataggga aggcattctg aaagggcctc tgtcacagta ggggaaacag tacagaaggg 25501 ccttggaacc aaaggaaatt tgagtttaaa atttaatgct ggcacttgct ggatctaggt 25561 gttttggcaa gtaagacact ttccttcagt ggcatttaat acctacctca ataggttacc 25621 atgagaagaa agtgaaatta catttatgga agtgtttcta atgaggcttc attaaatatt 25681 aggcttattt ccattatttc ttctctatgc ttccctcaaa aactttcacc cttcatacag 25741 caccttttcc ccattcttat atgtgtttat attcctttcc ataatgacat ttacattatt 25801 ttctaatgta aaaggaatat gattcatggt aaaatatttt tcaacatata caggaaagta 25861 taaggaggga aatttaagtc atgcagagtt ccaccattaa gtttttgtta tattttctcc 25921 cagatatttt tctatggcta cacacacaca cacacacaca cacacacaca caccctctgc 25981 tctcttcacc acacccatgc ttttgttaga agtgtgatct tattttacct ggagttcgtt 26041 atgctgtttt gttcacttaa aaatatgtca tgggtatagt atggattcaa tatcattcag 26101 ttaatcaagc atctataatt taagttgttt ccaatttttt gtattctctc agtttagatt 26161 gtaggttggt tttacataca tacaaatgta ctcaaagaaa atgtatagta ttactttttt 26221 caatttttat ttttacctaa taatatcttg ctatatattt tactctgtgc ccttttttca 26281 ctcaacaata tactgtggaa atgcttccac tttaacacat atgtatctac cttatttttc 26341 aatgcttcaa aatattttgt agtatagata taatagagat tatttggcta ctcctctatt 26401 tggttgcttc caattttttc tattacaaac agtggtgcaa caaacatcct tgaatgtatc 26461 tccttgtgta cacaggcaag tgtttctcca ggataaacac tcagtggtgg aaattcttgg 26521 gatgtaagga tgtgtacatt tttgatatta atacattttg tcaattagcc ctccaacatg 26581 gctgtaccag ttatcaagga gggtatccat agtctcatac ccttaccagc ccttgatatt 26641 atcaaacttt aaatctttat caattgatag gtgaaatttt gttttcccag ttttattttt 26701 cctgattaag aatctttttc tacatttatt gaattgtctg ttcatattct atgcccattt 26761 ttctactgag ttgaaatttt tcatgttaat ttttcagaga ttatataata aattctgagt 26821 atcaatcatt tgtctgttaa gtatgctgca aatatttctc tagatatgtc agtatgtgca 26881 tttaaaaaac ttttgatatg tatttccaaa catctctgca gcaaggatgt taccagtttg 26941 cacctccagc agccatataa attgctgtct gcaacatgat ttctgtctca cgtaaagagt 27001 tctagagttt aacaagctct ttggcaaacg ttatttcaat ttatcctaga aataaagtta 27061 ccccattttg tagtggtaat ggttaaagaa gtgggctctg agttacttac ttgatgaaca 27121 cttacttgct gcatgaccct ggtcaagttg tctaacactt aatgccccag ttccctcatc 27181 tgtaaaatgg agatactaat agaactgtcc atggagcatt gttgtgagga ataaattaaa 27241 tatttataaa gttcctagga aagaacttac atgtactagg cattcattaa atgttagcta 27301 taatgatgta attgaatatt agctatcttt attagtatta ttatgactac taatactata 27361 gcagtaataa tactactatt accatgtgcc atttattagt ttgaatatat tacatgttgt 27421 tggttgtcag atgctcacaa ctctccaagg aaagtattat tagcctcatt ctacaaataa 27481 agaaatttaa agtaagaaag aagattcatg acttgttcaa ggccacacag ctaggaagtg 27541 gcaaagagat cgctagaaac aagatctgtt gatactcctt ccagtgagac tgaaagcagt 27601 gattctagta aggaggctgc cacaccaacc cgggaagaga gatgaggcca taagaaagtc 27661 taaatgaatg tgtgaatgaa ctactgagtg aatgagtgaa tgagtaagca aaaggatggc 27721 tgaatgaagt agtagagagt taatgtggtc cataagtcaa tgactgagca aataaatgaa 27781 tatgtggaaa aagagttgga gaactcaaaa tcagcaacat gggtaaaata cagactagcc 27841 agggagagac ttaaaacgaa ttcttttcat cctcatatct gctcctgcag gaaactatgt 27901 cttcagttat gaccagtttt tcagggacaa gatcatggag aagaaacagg atcacaccta 27961 ccgtgtgttc aagactgtga accgctgggc tgatgcatat ccctttgccc aacatttctc 28021 tgaggcatct gtggcctcaa aggatgtgtc cgtctggtgt agtaatgatt acctgggcat 28081 gagccgacac cctcaggtct tgcaagccac acagtgagta gtaggctttc agccatcagc 28141 agtggccaga ggagatgaaa aaccacacat ggaaaaaaaa aaaaggcaga gctggcagtg 28201 gaaacttggg ttctatcacc acttcttttg tccaaggtcc tccatcatat ctattccttg 28261 gatatgaaat aagtcaacac accatgtttc ccaaactctt cggtgtccaa tgctatggag 28321 gggaaggatg ggagaccaag caaggcccac tctgcctgag tttttaatct agctgcagaa 28381 ttagtattgc cagagatgga gtgtgacttc ctctaggtct tccaaactac tcaagctcaa 28441 cctagcttct ccctctctcc ctgagtacct ccagtcctag aaggaaggca catgtctccc 28501 tatcctcccc atccttccct ctactttgtc tcataggaca cagtttatat aggatcacta 28561 actcaacatt gactcccatc aaggaagaga aacctaccca gttcctcgat gcctgacaag 28621 agtttctttt tctccttttc tcctgttttc tcctggccag ggagaccctg cagcgtcatg 28681 gtgctggagc tggtggcacc cgcaacatct caggcaccag taagtttcat gtggagcttg 28741 agcaggagct ggctgagctg caccagaagg actcagccct gctcttctcc tcctgctttg 28801 ttgccaatga ctctactctc ttcaccttgg ccaagatcct gccaggtaag cctgaggcct 28861 gagctttgtt cagggctggt atcctgcaat acagcatcca gtttcactgg ttccatcact 28921 ccttccctgt atttggagtt ccctcactcc cattgttctt ccttcttatc caccttgcat 28981 atcctcaaca ctggataatt atatccctct gctttctctc cttctgcacg tagagaggac 29041 cattaccggg gaacattacc ccacctcaca gaaaggaaac actataaatt catcacctcc 29101 caactcaact gagctcttaa cacacataca tagttatttt atgtctccac aggagctttt 29161 tcaaacttct tctcctcttc taaaacctct gactaccttc tcctccacac ttagcaaata 29221 acctcacatc ttacttcaca ataaaaacag aagccccaga cagagaatcc ttatttattg 29281 ccaccaaacc tacgaactta tctaattgtt tatctagcct tgcctcattc tttcctttta 29341 caatggaagg catatctctc cttctgccta aaaccaatcc cttcacttgt acactggttc 29401 ccatattccc agtctcctac tctctagtct gtaatgtcct cacctcatac gccttgttgt 29461 ccttccgcca aggcccaatc cagaatgaat acaaccctcc atcttcacta tatcaattcc 29521 gggctcatac agttgctcag acaggagtca ctaaaaattc atactcttaa cctctactgg 29581 gttctccatg gtctctgaca atcccatttc cctggtcagt tctcgaagtt tatggggcag 29641 ttttgccaaa ccaccattat cctcagcctt cccacacccc ctcctcccca tctccctcag 29701 cagacaactt catgttctac tacattcaaa atagaagata ccagacagca atgtccttga 29761 ctcccagcca caaagcacct acaaactcat aagcatcttc aaatgtcctc tcctcactcc 29821 ttctcttctg tcatagtgga agaagtatcc tttttcttgt gactaatcct tccactgttg 29881 ctctgtgccc cattcccctc taccacctta ggaatcttga cctattggct ctctcctcct 29941 ctcctgtatc ttcagcctct ccctctcttt aaacatgttt tcaagtctct tgtatcttat 30001 aaaaaaacat tgcctcaacc cctgatcact ctctagctac tgccctcttt cctccctata 30061 acaggcaaac tgcttgagag aagtcttcgc tcttactatc tacttcctca cctcctgctg 30121 attcttcagc acagcaaaaa tattaccacc acttctcaga aacttttttt gagtccaccc 30181 ataagcccca actaaactca acatctttaa gttgttttta gtccatcccc tcctcaacca 30241 ttaaacttct ttccatctct actgccagca tcctagcctg atccaacatc attttttaaa 30301 gaaaatttta cctttgccct ccgataatct attctttaca acagtcagaa ttttttttaa 30361 tgcaaaacta tctttgtcac cccaccctca gccctggtca aaacccttta gtggaccccc 30421 attcccccag gaccaaatcc aaatttctta tcacagcttc taaagttctc aataatctgg 30481 cttctatgta tctcttcggt ctcacctttt tgcatccctc ctctcactat ttcattcagt 30541 aatacattca ttcatatact cattcactta cttataaatc tgtcatcagt ttatttatcc 30601 attcatttaa taaatgttta cttagcatct actgtgtgct tactcttata ctggacacca 30661 gagacagaga gataataaga tgtttttgct cccatgcaac tcccagtctg cttgtctttc 30721 aagccatttt ctccagaaag ccataactca ttttctcagg tggaagttat cccttaatct 30781 tataataagg ccacagttcc ttgatggcag tgcagttggt ggcaggggtt ggggaggtcc 30841 aggaatcaac tccctctacc aatttcacat gcccacctgc cccaccagga ttgcccagta 30901 aaaagccctg cattcttcaa atctttctgg accttagctt tctcacttgt atagtaaagg 30961 gatgaatccc atgatcacta acagccctgc cagctctgac atgccataag cttatgattc 31021 caacagtaaa agcctgataa atatccatcc ctgtaaccac aagcagatgc tacctggaat 31081 ggatggaatt tcatctagac taggaacaat ctagcatcag tccgagtcaa caaacattcc 31141 ctggggtaat ccctttttca agtcttgatc ttatatattg gggagaagga aaataggtcc 31201 cgtcctcaaa aaactctgaa gcttcttggg aaattaaatg ttcttccacc ccaaggcagt 31261 cagaggctag accagggtta caaatgactg gagggaagga tgtaggggtc agaatttggg 31321 aacagtgaag tccttccaag ggagaaagaa gtgtcacaaa agttcccaga gaaggaagaa 31381 gcagagcaag gtcttcaaag ggaagaaagg gttggccctt ttctttgcca ggtcaaacct 31441 gaaggttgaa gtgggagtac tgggacagaa gcttaaggat tatacatctg cttcctcagg 31501 gtgcgagatt tactcagacg caggcaacca tgcttccatg atccaaggta tccgtaacag 31561 tggagcagcc aagtttgtct tcaggcacaa tgaccctgac cacctaaaga aacttctaga 31621 gaagtctaac cctaagatac ccaaaattgt ggcctttgag actgtccact ccatggatgg 31681 tatgtatatg agtgagtgta tgtttactag tgttggtctc acaaaaacca tgatgatcat 31741 gatgatgatg atgacgataa cattataaca gctaatattt atagtgttta ttatgtgcca 31801 agcaaaatta ttagtatttt acatgtatta attcatttaa ttttctgaac aattctatgt 31861 gataggtgtt attattattt tgatttttta catgaggaaa ctgagacata agagtaattt 31921 gtccaaggtc acacagctag taaatgccaa agaatggagg cagctattac attcatctta 31981 taggtaaaga aactaaagtt cagagttggc atccaattca tcttgagtgg ctcagcaagt 32041 tggtgctaaa gtgagtatct gcaccctaac acatataact ccaattcctc gagtaacact 32101 tctcttgtta gaaatgatat gtaaatcaat aatcccagtg tttggttttt atgaaggaaa 32161 tttcaaaaac cattgcctag gatttttttc aaggtccagt atgaagcatt ggggtcaaaa 32221 caggttttca agtcagagag acctgggttc aaatcccacc tttgacagtt actggctatg 32281 accatgggta actctttaac tgtctaagcc tcaattttcc caaaggtaaa atatctggtt 32341 gtaagaatta gagatgatag aaaccattct agttattatg ctttagtaga attaaatgat 32401 cttcacactc ctacctcctt tctttgctca attgaaacaa tgtccaaagc tttctattgc 32461 tggccctgtt gtgtagaaat catgtgtttt aggcatcctc ttatggattt atttaaggga 32521 agaggtcctc aactcatttc agtttgtccc ttttccaact gaaacaaaag agtccatagt 32581 attccctgat ttaggtatct taagtggcat gtaatgacta tacacacagg ctctaaaacc 32641 agactatcca tgttcaaatc ctagcatgac catttactag cttgggcaag cttcttaatt 32701 gctctgtgtc tcagttctca gttgcttatt tgaaaaatgt aagtgataat aattaaatag 32761 gtatgcaaat taaatgagtt aatatatgta agaaacttac tattatgccc actcccacat 32821 ttctaacact agcaataaag taaaactatc ctatcccttt tgtatatttc taccactgag 32881 actattcaaa ttcattattt ctctagtgga aactatgttg gtaccattct acctcgttac 32941 atttgcaaat aaatagttat ttacctattt ttggggtgca aactctgccc aaactgttga 33001 tccttaggct gaatctctcc cattgaaatg atgctaggct gaacacagca gaaacaggaa 33061 aatagacatt gtcagaatga agtaaaaaca gaaagacaaa gagtcaagcc ttgatcccag 33121 gctggggaac acacacacat gcgcacacac acgtacacac acacacacac acacacacac 33181 acacacacac acacacacac acacagagag acagagagag agagagagaa ggcagggatg 33241 agatacaggc aatcgatcca tacacagagg tttgtaatag ttctaaatga aggcgcacat 33301 cctccttcct ctctacaaca cccttttcca acccaaagta ggcatgtatg ggaaattcca 33361 cattggagat ggagctgggg aagggttatg atgtcctacc tctatccctt ggctttgctc 33421 aggtgccatc tgtcccctcg aggagttgtg tgatgtgtcc caccagtatg gggccctgac 33481 cttcgtggat gaggtccatg ctgtaggact gtatgggtcc cggggcgctg ggattgggga 33541 gcgtgatgga attatgcata agattgacat catctctgga actcttggta agtgaatgct 33601 ttgggccttc ttatataccc tccagagagg aggcccttac aaaattcttt tctgcctcct 33661 ccccaaagct ataggggttg tttggacaga attcacagcc ccaggctgct gccatcctgg 33721 actccctctc tccactcgca tcccactgca gagttgatga gaaagtctgg tagagttttt 33781 tgaaaagacc ttgaactagg ccaaatagtt agattcaact tgagtatgtg aagagctgtg 33841 tttctaaacc cctcccccac cctagcccca agcttcatct tagctccact cctgacccta 33901 tccagctaaa ggtccccacc cagctcctgc ctatctagtc attgcatatg gcaagacttg 33961 aaagtcctat ctcaaagcag cagaattatc agctacgact gccttgtcat ggacagatga 34021 gcagaggcct gggaagacag cctggagccc caacttctgg tgcaccccct tgtgttatct 34081 ggcacatgat cctgttgctc tgggactgat tatgggatct gtgtatatct tattcctttc 34141 tgtctccagg caaggccttt ggctgtgtgg gcggctacat tgccagcacc cgtgacttgg 34201 tggacatggt gcgctcctat gctgcaggct tcatctttac cacttctctg ccccccatgg 34261 tgctctctgg agctctagaa tctgtgcggc tgctcaaggg agaggagggc caagccctga 34321 ggcgagccca ccagcgcaat gtcaagcaca tgcgccagct actcatggac aggggccttc 34381 ctgtcatccc ctgccccagc cacatcatcc ccatccgggt gagagcccca ccatgcccat 34441 tgccctctcc acctatttat tctgggagcc tcacgctccc aacaaaccta catctgttgc 34501 tgtcttcaat tatttgcttt cctgctaacc attcccttta ttgccagctt tgtttccctt 34561 tttgaaaaat tatcagccat tctggattaa ccagtctttt ccttgcatca gccattacct 34621 catgcttatt agattatcct aaccctaaca atagcgagtg ctcacagcct ataattcaga 34681 gtttttcaaa ctggatcaag acaattaatg ggtcacaaaa tcagcttagt gggttatcat 34741 tagcattaaa aaaagaaaag aaacagaaaa tgttggagta catcacatac taagggtatc 34801 atcaatttgt gaaaaatttg tatgcatttt gggtatttgc atatacacat gtatgtgtat 34861 gtgtgcgttt atggtcacgg tgtaaaacgt acttcttatt gagaaatgag ggcagaaaaa 34921 taaaatcaaa agccatagga ttagctgcta ctttggatcc tcaatatgag catttactgc 34981 ctttaaaaat gaactgctac ttctttctta aataacacgt atttgtgtga gtcagtaagc 35041 cagggcaggg aaaggacact tatttgtgac aattttgtgg atgagaaata gtcactgctc 35101 tttagactaa cctagtattt cctttaaaca ctcattttat gaattaattt agtgacagca 35161 ccccagaatt ggcttggcgg gggttccaga attggcttgg tggggggtat cttctcaccc 35221 agaaccatcc caaactaaga tattagctaa gtaaaatcag tgtgcttgct ctgcaaacag 35281 cttccaaaca gggctcctgg taccacctct gctccatcct tttcaaacca aattgctagc 35341 tctgagctcc tccttgatag aaattctgga gctgccacta agcccctaat ggaaaaaaaa 35401 aatctatccc aaaattcagt gatgttccct catctagttc cctccatctg cttaatggag 35461 ctagtgatgg tggagccaga gtggcaggta ctgattagcc tttctcctga gtccaggtgg 35521 gcaatgcagc actcaacagc aagctctgtg atctcctgct ctccaagcat ggcatctatg 35581 tgcaggccat caactaccca actgtccccc ggggtgaaga gctcctgcgc ttggcaccct 35641 ccccccacca cagccctcag atgatggaag attttgtggg taagttctca acatgggtgc 35701 ctacaggacc tccctcccct cagccccagg atctgaaaga gaagctgaga ggacagagac 35761 cactgagttt acaaaatatt tctggaacat ctaatgtgtg ccagcaccta tactagggtc 35821 acaaataaat gagaagcagc ccctacactt gtagggctcc agtttggttg gggataccat 35881 agtgaacaca aacaatgaca ctaagggatg atcaaagctc cacaaggcag tgcatgatag 35941 agttgtcgga gcagagagga ggggcctgac tcagcctgag ggatgcaaga cccacttcct 36001 agtagaggtg acacctgagc tgagtcttgc aaagtgagtg gtattaaaag aaagagggca 36061 tggaagaagt attcctacca gagggaagag catgaagata ggtgaggaga atgagaagca 36121 gccagggata tatcaagaac aataagcagg tggtattgga atgtagggtc ataggaatgg 36181 agtggggcag gggagtatca atctatgagt ctacaaagac aacatgagat agagactgga 36241 ttgagaggct tgtagagctg agtagtttga gatttaccct gaaaatgcca gtttagtcaa 36301 ttcacctaat gtttgttgga tttctgttgg gtagttttgt ttttgtttgt ttgtttttgt 36361 ttttgttttt ttgagacaga gtctggctct gtagcccagg ctggagtgca gtggcacgat 36421 cttggctcac tgctacctct gcctcccggg tcctggctca agcaattctc ctgcctcagc 36481 ctcccaagta gctgggatta caggcacgtg ccaccatgcc tagctaattt ctgtattttt 36541 agtagagatg gggtttcacc atgttggcca ggctagtctc gaactcctga cctcgtaatc 36601 cacctgccta ggcctcccaa agtgctggga ttacaggcgt gagccaccat gcccggcctg 36661 ggtagttttt aatgcagggc ctgacattga ataggtgctc attccaggcc tgttggatga 36721 aagacatgta ggcagttgat ggtctagcag aggagccaga tatagatggt actggtccag 36781 tatgatgagc tccagtattc tgggagctag agggagtgga cacattatgg agagagaggg 36841 tgggaaggat gaaattggag aggctttgtg agtaaggaag tttttatgat gcatgttgaa 36901 gtacatgtga atatgttgta agaatattcc agaataaggg aattccacga gcaatgacct 36961 agagatagga aagcagtggg tatgtattga caacataatt ctgtttgtct gaagcatggg 37021 cagtatgaga attcaaggaa gacaagctag gtaggcgcca ttcattcatt caaaaacatt 37081 aaataatgct ggctaacatt aagtacttac catgtgccaa gcactgttct aaacacttta 37141 cacgtattaa ctcatctaat ccccacaaca acctcaagag ttagagatcc tcttatcatt 37201 tccattttgt acatgtggaa attgaggcac aaaaatatat agtcgctgat ccaaggtcac 37261 acagcttcta agttgcaact gggaggtctg tctctacctc catggtcata actgctaggt 37321 ctaccacctc tctgagctga tgacccagac tcctgggcct tttgttcagt attctctttt 37381 gctctgggct tcaattgtag agctctcagt attcttggtt ctctgaatgt ccacctaggc 37441 taggcttttg taagaatata tgaggcatcc acgatggctc caccagtccc taagttccat 37501 agccaatcca tcctgaaatc ctgcaaaagt tatctataat ctctctcaaa cctatttgct 37561 tttctcccct gccacttctt taatccatgt caacatgatt tttttcctaa tttctctgct 37621 tctctcttgc tcctctcaaa tcctttctcg atgatgacca ctagagggat ttttctaaaa 37681 ttctgactat attgctccct tgcttaaacc ccttcatgtt tccctctaga ctctaaagca 37741 gtgacctcca aggggtatgc aaaatgatta cagggtgaag gaacagaata tgtattagaa 37801 ttttatgttt ttttatctta aaaataggaa atcaagcatc actgatactg atctttaata 37861 tacagactga cagttataca tgtatataat atataaacaa atatagagat tggaggtaca 37921 tgctaaaaca tttgtactga tagggatgta tagtccaaaa tttggaaaca ttgacatata 37981 ggacagagtt gaagctcttc agcatagcat tcaatgcctt ccacatggtg atctctatgc 38041 cctcacctcc tccccacatg cattttgttt tttcagctac actgaaggac ttgtcgttcc 38101 ctcatttttt tctgctctct tacctctggg actttgctca tgctgctctc ttttgattgg 38161 aatgccctcc ctcacacttt cctctggctt actttccttc atcttgtaga cttaacttag 38221 gcattctttc aacaaatatt tattgagtac caactgtgta ctagatactg ttctaggcac 38281 tggggatgca gtagcaaaca aatcagacac aaaattccta ccctctggag cttacattct 38341 agtggaaggg gtagtaaaaa aaattaccaa aaataagcaa attaagtagc acattagttc 38401 taagtgctat gggaaaaaat aaagcaggat aaggagaatg ggataagggg ccaggggcga 38461 gttcagagaa gggttgtagt attagagtgg caagggtaga agacgctgag gtgaaacttg 38521 agcaaaaatt tgaaggaggt gaagttagtg aggcagatat ctaagggaat ggcatcgcag 38581 gcagagggaa catcctaagg cagggaagac acaggagtat tccttttata tttgaggaac 38641 agtaagaaga tgggtgtggg tggaatggta taagcaagtg ggagacagaa aaattgagta 38701 catagaggca atgtgggacc agattgtata gggtatggta ggccattaga aggagtttgg 38761 cttttactct gagagccctt gaaaggattt gaacacagga ctgatatttc tgactcgggt 38821 tttaacaaaa ttgctccaac ttctatgtag agaatacact aaaagggagc aagggtggaa 38881 gcagggagac ccaagagtgg gctacagtaa tatcccaggt gagagatgat ggtggctcag 38941 acttgatcat aatgaaggca ataagaagtg gtcagatttt gaaggtagag ccaagggtct 39001 ttgctgatag atgggatata gggtaagaga gaaagagaaa aataaaggat agctctgaaa 39061 tttttggact gagcaactgg aattgccatc cactgagatg ggaaaagcta aaagtagaat 39121 agcttggtgg agggtaggga catgagtagc tcagttgtac tcctaagtta gaaatgcata 39181 ttagacatct aggtggagat ggagaaaagc cattggatat acaagattgg aaaccagtag 39241 agtggcgtga gctggagatt aaaatttctg aaccatcagc atatagatgg tctttaaagt 39301 catgtgacta gacaagatca acaagggcat gaacacagaa aaggccaaga acagagccct 39361 ggaacgtacc tggggtactt cctccagcta ggtcaggttc ccttctctgg gttttcacac 39421 ccccaggtgg accccctacc ccaggtttcc tggtcatagc accaatgaca cagtatagtt 39481 actgtcatta tcattgtcct catagggctt agagttccca agcagacagt cattcttggg 39541 ccacagcaca tcctatactt agggagtggt ccaggccagg acagtatggc ttcaaattgt 39601 gtcaaaggag agcttccaaa tcttttataa tatatatccc agcatccaga tacaaatggt 39661 aatattcacg gcacacacag aagcaaacag taggctactt ctggccctga ggtatcttga 39721 agggttgagg gggatcaata tcttggctca tctgtactgt gacagatttg gaagatctag 39781 tctaacccat tttttccctc ccctccccct accaccttca gagaagctgc tgctggcttg 39841 gactgcggtg gggctgcccc tccaggatgt gtctgtggct gcctgcaatt tctgtcgccg 39901 tcctgtacac tttgagctca tgagtgagtg ggaacgttcc tacttcggga acatggggcc 39961 ccagtatgtc accacctatg cctgagaagc cagctgccta ggattcacac cccacctgcg 40021 cttcacttgg gtccaggcct actcctgtct tctgctttgt tgtgtgcctc tagctgaatt 40081 gagcctaaaa ataaagcaca aaccacagca tgtgaagcct tttattggac agggaacaga 40141 caagtgcatt ctgactccct cagacaagtg gcagatctat gaggtaacca taggtcactt 40201 gttggtcacc attccatttt accaacaggg aaacagaatg agaaagagga aggaaatgcc 40261 caaaaactac acagtgagtt gcaaagagct aggatacttg gcagatttat ttgagatggc 40321 tcttctccaa tggtttcaat aaaaatccag ggattgacgc ttcactggta atccgggccc 40381 tggaaatctg ggcccagggt agcccctgaa gccaggaaat gagtcagccc taaccaaatt 40441 ccctagaaag gagtggttcc ccaaaaccaa agaggtattt tccataaggg gatatggatg 40501 tcttcctcaa tccattatcc tctctccctc ccacggcatc ccacagaaca ggatttacca 40561 gtcccattct ttttcttttc tttttttttt tttggagact gaatttcact cttgttgccc 40621 aggctggagt gcaatgcatg atctcagccc actgcaacct ccgcctccca ggtacaagct 40681 attctcctgt ctcagcctcc taagtagctt ggattacagg cacgcgccac cacgcccagt 40741 taattttttt gtatttagta gagacggggt ctcaccatgt taggctggtt gcgaactcct 40801 gacctcaggt gatccaccca tctcagcctc ccaaagtgct gggattacag gcgtgcacca 40861 ctgtgcctgg tctaccagtc ccattttcaa aggtgagaag aggcttggct agagcaagca 40921 ctgtgttcaa agcaatgctc ccactttcca ggtcacagta tacaggaaac atttccatgg 40981 cacacttagg ggtactaatg atactacttg ctaaggaggc aatcgtccct gaaacttgtg 41041 tggtcccagg ccctgcctgg caactctgag ggctgaggac cgtttctttc aaccctgtaa 41101 cccactgtgg cagctgtggg acgcactagc tggaaagctc tggtttaagg ctacacagcc 41161 caccttcctt gctgatttca ccttagccgt acaggaaatt cctacaactg agcagagaac 41221 cctttatggt gagcactgct gctcacgcct cactattccc tcctcccccc tccctcctcc 41281 cctttctctg ggaattacat cattgggcta cagccagccg gcaaaccaag tattgagata 41341 gcttctctga ctcaacttta tggagtacct ccacaatgtc caaagtgcaa ccctgaaaca 41401 aatacacaga ctttatttat gcacaaatac actttccagc aaagacagaa gtgatttaaa 41461 atcacacaac ttcctggggc agctagtgtc cgtgtacttc ggacaggaag gtggaaagtg 41521 gtgtaagggg ctgggtcaca ggattaaggc acaagaagct caccaacaag gagctaggta 41581 ggaaggaaag cgaggaagag agggcttaaa agaggaagag aagggtagga gactttgagg 41641 gagaagcagg aggcagctca gggaagggga gctggcctca gatcatgtgc aggggtgacc 41701 atgccagatg tccccaggcc tccattggtt cagctgggcc tgctccagag gaagaagttg 41761 caccgggagg aggggtcagt gggaggaccc cggggcctgg cacacatgta gaagcggcgg 41821 cccaagttgg gtcctggctt cttcacagta cgcatcacac atggctccct gtggccccca 41881 cagaggggtg tgcgcaaggg ccccgccagc acagacttcc agaatgaggt ccgtaactcc 41941 ttctcatctt tggcttctga agtcttggcc tgccccttca ccactttggc cactgccttc 42001 tcttctggag tcttcggggt catgagggcg ctcatcagtg gtaggctagg cagctctatg 42061 tcaggagagg cttggggaca gctaggggaa ggctgaaagt agctcttcag gtttttctgg 42121 cctctgctag agccaacctg actgggctga ggcctggttg agcgcacttg ggctttgttt 42181 tggcatgtct gtacccgggt ttgattgttg tgctgcagcg tcgactgctc caacacagga 42241 ctttgttcga gaggaactag gaagcgaagg atcttgagct gggtgcctgc aaactcaggg 42301 aggaagcggg tgcacagagg tgggcactgt tttgcaggca cagaggacac actcaagact 42361 gcacccacag ggcagtggtc agagcccatc acctcaggca gcaggaaaga ggcctgaaag 42421 gtgtctatga ccagggtcct gtcccccagc acatagtcaa gccgggagcc atagttgaga 42481 tggcgggcgc cagtgactgc tgaccagcag gtgaaggccc cctcctgctt tggttggaag 42541 cagcggtagc tatcgatgaa gggccctaca tgagaggcag actggcaccc caagttactg 42601 agcaagctgt ccatccactt gcgccctggg tcctcttcaa agcattccta ggtgaggagg 42661 aaagggggtt ggaagagaga gagacagaac tagataccta gaactggaag ggacttagac 42721 taactgctgc attttgcaaa taagaaaatt tggttgtccc aagaggacta ctgctcaagg 42781 tcacacagtc cggaattggc agagctggga ttctgatttt catccttact gatttaaaag 42841 accatgcttg tttttccctc tgcacgagct tgcatttgtc ttgcattggt ccgtcaatgg 42901 caaatctgaa agtttgctta ctcaaagcaa ttcctggaac tgcttttcaa aaggatggaa 42961 aacaggaagt taaaagtgtt tgcctctagg aagcagggct ggggttaaga gttaggaaga 43021 ctgtcagctt cttcctttaa accctctggc actatttggt ttatctagtg tgtcaaagcc 43081 ctcaatgacc attcctacct cctgagctga ggacctgcaa ggatccaggt ctgaatactt 43141 ccttaggctc tgatccccaa tccatcatcc atcagtccat ccattcaatc cacctaccca 43201 cccactcatt caatgcaaga aatgttcatg gaatacctat tatgtgccat gcatgatgcc 43261 agggacaaag taggaacctg atctactcct ggggaccaag gaggcttcct ttcgagaagt 43321 gatctggagc tgaggtgtgg gggctgagta agagtaacct aggtggaggg tgctcctaac 43381 tttaagaaca gtctctgtgc tgggcctgtg gagggagtgt gctgcgtacc caaaagactt 43441 gaggaaagac cagactggag accagaatac cagcaggaaa tggctgccat agtcacacaa 43501 gcaagatgat ggtcacttgg actagggtgg gtgccagagg agacagaaac atgaaattat 43561 gagatgattt gtagctagaa tcaacagtac ttgctgacta actgaatgtg ggaagtggag 43621 taagaaagaa agacaatact agcacaattc ccaggcttta agttgaagca atcaagggga 43681 tgatggtgcc atttacagaa atgacaagac taaaagagca agcctggtag aaagtgggta 43741 tgttctgcta ttttggaaag actgtgtaag tcatctgagc tggcactatt agttctcttt 43801 tcctggagga gaaaatgaaa gaggtataca taagagatct aggatacctc tgaactaagt 43861 agaaaaggca caaagatgta tgatgctatc acaggaatct gaaagaacac actgaatgaa 43921 tgcataagag gaatgaggca gggagggact gaaggctcac actttgtagc attatctccg 43981 ctccccacca caccaccacc accattctct cattctctca cttctgctca ttttcccagc 44041 tgggagccat atctacattt atgtagctct tctgtgaatt aatcaaggaa tattcccacg 44101 gtcccttctc taagccaagc ccagtgctga gtaggactgt accagcataa atcactgatc 44161 ccccatcgtg agacataagg caagaaaaga tgagagctgg cttgtggatt ctagggcagt 44221 ggttctcaag cctggctgca caataaagtc acactggggg cttttagaac gtgccactac 44281 cctggcccca cactcgagag ttctgagttc attgatctgg agtagtgcta agatattgat 44341 atttttgttc taatatgcag ttaggggctg aaaaccatgg gcctagggga acttagagat 44401 aagaggttca tataacaaaa agacagggag ggggtcatgt ccaccaattc agggcacagt 44461 gagaaaagcc agaatgagaa taacttttac aacgggccga caggagacaa cctgtctaga 44521 gaggacaaaa tgtgtaggga cactgaaggt ggaaataaga gacaggtcac agaccaagga 44581 gagccttgaa tgctaggtct ccatgttttc tctgcatcac agaggtaaca gggaggtact 44641 gaaagttcat aaacaagaaa tacagtgtat taaggaaatg accaaggcaa tgtggaattc 44701 acagtccttg aaatctgacc cccaacaact ctccagcccc accttctgcc actcacctcc 44761 cttcctgtgt cactaagaag aaagggactt gtggttcccc atgcacacaa tgcaaattct 44821 caccttgaaa cctttgccca tgctagaatg ctttctactt caaagctctg ttctactcat 44881 ctgttaagat tcagccattt tcagctgcct gtcctgatgt ccgagctcac ctaggaatcc 44941 catctctgag ctcctgtagc tttctgggat gccatctgtg cctctgccct gagaatacca 45001 tcctataatt ggctagccat gtgtctgtag cccaccacca tgctggagag ctcctaggag 45061 ccaagggctg gatctgactc ctgagtcctc agtatgtggc tcagggccag gccatgggca 45121 gagtcagtat tgtctattga atgaatgacc aatcctcagt actataagaa tagtattgca 45181 agccaagctc cgctttccca taaaggaagc tgtccctggg ccaaaaactc agggttggtt 45241 ggctggacaa acagaatcag tcaggagtgg ggccaggcac ctagagggag ccttaccagg 45301 ttgactgcat cccagtggtc aatggggcgg tgggctgtat tcaggtcacc cagaatgatc 45361 acatggctga aaaacacaat gtgggattgt tagaaagggg tgcagcagtc ctggcccata 45421 tactttttcc tgggcccaag agctcagaca ggccctctcc catattgagg cttcctagcc 45481 aacagcaact cataacagcc tacctctcca atcttaccac aaacttgctg cctttcatat 45541 ctgaccaacc caattctgat ctccaaaggt gacagtgttc catgtttgac atgactgcct 45601 cactaatcag gagccaatgg gcatttttct tgtccttcaa agctgagctc agataccacc 45661 tccccttgat tgctctaacc caccctcact ttcctgtcct ccgctggaac aagactttcc 45721 tttgctccca cgcactttac cattctcatc tcattttacc cttaaaacta ttgtgtaggg 45781 tagagattca tatattcatt tgacaaaaga gaaaacagag gcaagactag ccctagggct 45841 cacaatgagt aagagcagga gcctgaatag gtgctcagag tacaaaaccc atgtttgttt 45901 tgagcagagc ctattcaggt gggcatattg gtgtttgcaa ctatgctgtg aaagaactca 45961 gagacaaagg gtccatatcc ctttcaatgg agcttgacac agggctgggc acccaggacc 46021 aactcaagga agccttactg cccaggcttg cagtacctgc ctgccgccag gagggcttct 46081 gctcggattt gcagcaaacg atagaagcgc atcttaaaga ctagccgctc aggcctccca 46141 gggtccgcat gggggcagta cacgttgatt agggtcaagg tcttctcctt accttcccat 46201 gtgctggaga aaagagaaac ccccaaaagg ggtcagtttc aaacctcccc ttctgtccct 46261 tccccatacc acatggcttc cattttgtga ctgtctggga tggaaaagaa aggggttaaa 46321 gagacacaaa tgatctggca tgcacaggac ttggggactg actactgata aggcagggag 46381 agggatatct aaggattacc accaggtaat ggtgctgcta ttcacacaga acacccagaa 46441 agacagcaac ctggggcact gagttcagca ctaagttgct tttgagatgt ctatggtaat 46501 aagagctgga tcactaggcc agccccacat tgtagagatg gaaactcaag cttcagaggg 46561 gggaaaaatt tgccagggtc acacactgag tagcccagcc aggactaaac caggaaaggc 46621 aaatttggga acttctccac ggtgcatgta tgtttacatg gcaagcctta gaatcttcac 46681 actcaaagag tagcaccagc agtgctcagt agcagggaga tatgagctat taccggatct 46741 tatgctgtgt gaggagggcc ctgccctcac tatccagagc ccggagttcc tcttgggtaa 46801 actcatccat gtttccatag caaccaacat ccccattctg ggtggcaaac aggccactca 46861 ggccttcttc agcagccact ggggtagcat tgtccttaca gaaggtggct acacctggga 46921 acagacaggg cactctgagc agacctggca gtcaatagcc tgttgcacag cctgtcccac 46981 tccaattgtc cccacacaag tcatctaagt catcttcata aagtgtggtt cttgggcaac 47041 cttattcaaa ccctggagtc tcagttccta catctaaaaa agccacagaa tgttctaagt 47101 gttttacatt gatacatggt gagtgctcaa gacgtattag caattattat tatcacatgg 47161 attaattcat ttaatccttg taacagcccc ggaaggtaaa cattatcctc attttctagc 47221 tgaggaacct gaggcaccaa gaggtaatgt aacttgccag agggtgcaca gctaggaaat 47281 gagccatgat tcaaacccag gctgtctggc tccacagttc acacaattaa gcaccacact 47341 actactgtac ttctgaggtc cagtaggata attctatcca taaaaattat atttcatgta 47401 atttactatt atgcagcttt agctctgtaa actaaatgtc taagatttct tattaccctc 47461 tcatcaaata ttggaaaccc ttttccccta tctgcctccc tcactaattt aaagtccaca 47521 gaaagcccca tttaccagaa tagccgctac ggttgcggct gaagctgaaa taggagttat 47581 aaccctcaac gatagccagg ggctctgtca gtgcatcccc tggaaaaaaa aaatgtaaga 47641 ttaagtcaga ggtaaaacta gcatccgaga aacacaaagg cctggaatta cagttgtagg 47701 ctaattcctt gcacacatca gagccagagt tgggggtgac gggaatgaaa tatccatgat 47761 tagaggagga cctagaaaga tgggaaatgt cagagaggaa attgatatca gaattagaag 47821 gcttggtggg gggcggttag agactatgaa gtggaggtgg agaattaggt gggaagggga 47881 aagaatcaga gctaagcctg ggggtctgaa cgataggagg aagctgggga tggggatgga 47941 catacgagag gatggtggga atctgagcta agggtgagag gtaagagaga gggaattagg 48001 ggatctgaac tgggggtgga gggaaaagga attaggggat tctgagggga gataagaggg 48061 tgaagaaata agaggacatc taagcttggg gtgggaaatg aggtgggaat tagggacttt 48121 ccgctgggag gagagatggg aatagataag gggtactggg gttctgaggt ggaaatgtga 48181 aaaagaggaa attagactga gctggggtca aagatggggt gggaattaga ggatctgagc 48241 ttggggcagg gatgaagaat tcgggggttt gacttggcgg cggggagatg gacagcgtat 48301 ggaaggagga aattcagggg actgagtttg gggccggaga atgagtggga atctgactag 48361 atatggggtt tcgagaagga gctaaagttt gctgaagaca ggagatggag tgcaattagg 48421 ggatctagct agggtgggaa atggaagttt aagagggaag ggacctgggt ctaaggacag 48481 gactctaaga ataggaaact gggaaagtaa aattacatgg gacaggggac aaaagttagc 48541 gggaaagaaa aagtatgtag ggatcctgga atccagagcg ctcactggtc actttggttt 48601 cctggagaca gacgatatcc gcatccagct cgtccaaaat gcgccccacg gccacggcgg 48661 cacagttgct gggttcctga tttgccaccc cttgcagggg tctccgaatc ccattgatgt 48721 tccagctcac cacgcgcaac atcttaaagg gttggaacac ctcccagccc gcgccaacct 48781 aggcgcgagc gaactgcttc ctgttcagaa gttggccagg cccgcctccc gcgcgcgttt 48841 gcgcaggcgt tctcgtgcgt accactaaag cgcctgacgc acgtcgggcg ggggcttcaa 48901 cttcctcagc tctcattggc gggaggagga actgggtgga cgctgaagga agcaaagcgg 48961 aagcctaggg gcaatgaatg gcgagaggag tttccagttg ggctcccgct ccttccgctc 49021 ggacagctga gtctcaatgc gtttttcctc tactcagaag tccaagtggt gcgtccggtt 49081 acttacaaac ggcattttgc cactttttaa gcgacggaac cctgagcgga actgctcagc 49141 tgccaaagcc ccgccctcag tgcggcggac cgcccaccta agttcacaga tggctgcagc 49201 gcttgcgccc caggaatagc tggcgctaag tgctcgcgtt ggactgggca aagggaagtt 49261 catggttgga ggagggagca gtaagagtcg ctccactgag ataagactta ggaggagggc 49321 ctttggcttt attggggcag aggggaggag gaggaggctg tctcaagcct gccctgtagc 49381 ttttgtcttg gctcttaaca ttctgaagga tgtagccgga cacatcctta ctcttggttc 49441 tgtttctcag ataagtgaaa tcgaagccca ggaaatgtaa gctcgtgagc agcattgtct 49501 ccggatgcat ccatggctct tctgtatatg ttatctcatt tcattctcaa gacaacccag 49561 ttcgggacat ggaggctaag ttagagacgt cgaatgaatt catttcattc tcacagttca 49621 ctctctctta cactgaagta gtagagccag gattcacacc ctgtcttaat tagctcaata 49681 attacttttc tgagtgtttt tactgaccac acacctcccc agttaagtct tttcctctgg 49741 ggccccacgg caccctatgg tccattctgt cacagtactt ataatacaca gctatcattg 49801 acttctattt ctgctgccat tagaatgcca gcaacttgag agcaatgacc atgtctggtt 49861 catcttgtta gccccatccc ctaatactgg acacagcata tgttgggaag caatagatat 49921 ttgtcaggtg aatgaatgaa caaaggataa tagcaagagc atcgacaggt tcatggcttg 49981 tttttgagag cacaaaaatt agaagtaatg gatatgccca atgtccaggg aaaagcataa 50041 cgatgcatct ttaggttcct cacccagaat tgcagcataa ttatttgttc cccagaattt 50101 aagtcttaga gccatcatcc tcaatggtta cagcaaagac gcagtaaact gcttttcctc 50161 ctggaggact taaaaggtta ccagttgttg ctatcagaaa atagctctgt gattaaaata 50221 aaaatttgca tgtcctgagc ttcttcatgc tttgggagtt attgactcag gattgactcc 50281 catgagtgcg ataactagga caaaggaaat gaacattttt attgctcttg cttcatattg 50341 ccagattgct ttccagtaag ggattatgcc catttacact gaagatcgat ctgaaactca 50401 gcaccagcga aatccagaac ttgcctgtct ccatggctgg ttttaatttc cccattctgc 50461 agtggcttgt taatattagt tctgaccttt ggggcaaggt gaacacatgg ttggactgaa 50521 gagaaaaggc ttctggtggc tcaggaacgt ctttggcaac tacaacagct gatatttcaa 50581 cagagcacat acatccccca cttaacaagg gtacgtcctc agccttctca gggaaccaac 50641 gaacacctcc aggcttcctc tttgatgcca cccactggac ctgccttggg ggtctgtaaa 50701 tgcaagagaa ccgagtgttg gataattagc gatggaagaa aaaacctcta gaataaaagg 50761 taggtgagaa agaagggaaa gtaggcagga gttgcacaaa tccacatcca ttcttcaggc 50821 cctctggaaa gttttctttg catagatggt tggttgtcgt gtgcattaaa gggcatggga 50881 aagtggggga aaagattata tctatatgta acctatttat ctgtctgtat atttttaaaa 50941 gtccattctg gcctttatgg gacactggcc cccagggcaa ttcactagca cctgtgaaag 51001 tctctctaac ttggtcaagt ctggttttga atgaactgag tgattgagat tatagctctt 51061 acctccccac tgatgtgtaa agtagggttg atttgctgtg gcgctttgaa caagctctct 51121 tgcttctctg ggcatcagta gcttcctcag taaaatggaa atcaaagtct ctagccaacg 51181 ttcctcagag ggctgtcacg atcaagaaga gtggaaaggc agaaaccaaa gctctttgga 51241 gacatttcag gacttcctcg ctctaagagt agtcatcttg ctttttctat aggctccaca 51301 agcaatccca cccaaaaccc tccagctgct gtgcgactat agtctgtgga gagccacttc 51361 catgactagg attcagtgag gtttggggcc caggcatgag gccgaaaata gtggagagat 51421 ggtgggccag gacagtgggt gagaaggaag gggatcctag gctgctgtca cagaaggcct 51481 gacactcata tacatgcacc tagcttgtaa gtgtgtggtc gatctcccct cacaccttca 51541 ctctccacaa atgcattcat acccaaacag tctgagagct accaataggg agaccccatt 51601 cgtgacttta ttatccatca gcactgtcat atgctgcttt acatggtatt ctcccccgat 51661 acacatgccc tatactttgg gaggggtatg gggtcaggat ctgggggcct tgggaaatca 51721 cattgtaatc ttttcttccc tagagatagc cccattttag aattttagac caagagctgc 51781 aggtcagagc ttgatcttga atgagaatct ggtactgact gaaccctttc tctgccacga 51841 gttcattggc ccttttcaga gaggacttca ttcttttatc tgtttctgtc ccattcaagg 51901 tagagaaacc tttctggatg ccacctccta cctcagacaa attagttctc ccatccaatt 51961 ccctgcacag ctctctgtta taatatttac cttgttgaac taggattagt tgtccacaag 52021 ttcatctcat ctagggcaga gagatgtaga ctgcacaaac acaataagca agtgtcttct 52081 ctgtcctctg tgccagggcc tgagttggac actggaaaca caaattacag atcactgaga 52141 tctagtctcc atactcaagg agttcacagt ctagtgggga agcagaccag gaaaatactg 52201 gtggatactt gctcagatag gagcatgtgg gcaatataag aaccaaatac aaggcctgtc 52261 ccaaggatac aggggtcact tcccagagga agtgaacatc aaaccagacc ttgaaaagtg 52321 gatcagattt tgttgggaat agaaggagga aaatggcatt tcaggctgag caaaggccaa 52381 aaaacaggaa aatacaccat gtattttggg gcccagtgag tcagccagtg cagtttaagg 52441 atgatctaag gagagatatg gctggacaga taggctgaag cttgattgaa aagtcttgag 52501 tgccaagctg aggtctttcc ataaagcttg ttttggtctc ctgcttctgg gatcagaatt 52561 ggtaacagta gtgaggaatc tcatagctaa ttccaggcaa ttttaagtta ggctttcaag 52621 cctggacccc aggtttaccc tttaagactt tgtcaaaagc ctgccagctt cttatctcag 52681 atttcctgct gctaaaatta tccttctgcc tgttggcagg gccatgagga caaaattgta 52741 aacataaaag agatgttcat gtgaccagct agattacctt gaacatgcca cttcccctcc 52801 ccagacctca gtttccctgt aaggctaaca aggatgtttg caaggatcaa ataagataag 52861 gaatatgttg ttgctttgtg ggcaatggaa aacatacagg gaaagggtta taattaagat 52921 tattaatatt aacagattta taccagttag gaaatttatt agaattaaaa gagttgtatc 52981 tcttacaaag tctagttctg agactctaga ggttctgagc taacctgtaa gaccacattc 53041 ttagatcagg cttgtttgat taattaaaga aaagccctgg cctgggagaa attaggtccc 53101 tagtgttcta ttcctgactt ggccactgat ttactacatg accttaggaa aatatttcta 53161 gacctcagtt ttcttacctg agagggtgag atgaaatgat cttgcaagtc tttacacaaa 53221 atcaaaagag taaaaccaaa ttcatccttc taattgattt gcaggagctc tttgtgtatt 53281 ataaatgtta accctttccc ataatattgt aagtttcctt aactgagagt tttctgtatg 53341 gtcagcatta tattagaagt ttcagattag tcaaggaggt gtctgatcat ttacctccca 53401 aagggagtaa cagtgttaga taatatattt ttaacatttc acagattgct ttacattttt 53461 taacacattg aatcttcaac agtcctaagg agggtaggta ctcctatcat tcccatgttc 53521 cagataagaa aactgaagca ctgaaaagtt tagtcttctg ctcattgcct catagccaac 53581 aaggggcaga ggtaggattc cactcaagaa atattgctcc agagcctatt gctcttaacc 53641 atcatgatct gctatctcag ttgattcatt tagataggtg ggctctggaa tgtattaaaa 53701 atttggtatg atttatacac aacaactgga gggaatctga cccttgtgta gactacggct 53761 tgcaaatgta ttttagggaa cttcttaaaa ggcaaagtgg aactgagagc atttgagggc 53821 agggattgga aatttctgtc cccaagagat tgcccttgtg tagcccagct cagtggaaga 53881 aagttgagct tcttcatagc caatcgttgg aggttcattg ggggtgggaa ggttgcaaaa 53941 gagtgaagta agaggacact tgtgcatctt cgaggcaagc tgtgcagttc tatgagttat 54001 acacataaaa gcctcaaaac aacgggtcag gggagcggga gagggcaggt gagagtgagt 54061 catctgggct ggatgactct ggcctatgga cttgtcctac tttgaggcca ttgctctctg 54121 ttccctgctg cagcctccac cctcctaaat ctccatccta atagcttggc atgaggaact 54181 ttagtgaaat tttctatcag ttcatgatgc cagtgtcatt ctccttaggg catctgatgg 54241 tggctaacag ccatgtgggc agagcttacc attcccccac ctccagcccc aggcaatggc 54301 tgcactcttc caaaggtgtc agtttactcc tatttattta gcccttggct ggagagagta 54361 gaaagatgat cggatagtgc ccatctcttc aagagtgctc caggcagcac atttcaggtg 54421 ggagaggccc tgtggcaagg aaggcagcag aatccagaga agacaaggga tgcaatgcca 54481 ttttgggtcg gggaagcaga gaagtagtgg accttccatg tccagtttgg gcctctaatt 54541 gccagctcaa aacactgcat tccccagtag ggactttgtg tttgtgtttt tgtccttggg 54601 aaccttgcat ttctgatgtg aatcttggtt tctgggttgg agcttgacat tctacttggg 54661 cccatttttt tatttcaaag ctgggaaagc cgctgagacc tacctccatc acctttgcct 54721 ctatacccca cacaggctgt aagcacatga tctttggtct tttatttgca tactctatta 54781 gtcttgtctt cattggaggg aagtgactgg tcagacctgg gctgtgttgc tggctgcctt 54841 ccttgttggg ctccaactta cctcctatgt acacagccct tggagttcag aggcctctcc 54901 tgacctctgc ccaacctctc gccctctgca cccaataccc cactagtaca tacaggagca 54961 gaggcaggct aggaccagag gaaagaagag aggggactga agacagactt tgagagggct 55021 cagaaaccca gtacaccccc tgtcccagcc ctgtccacct gctgctgtgg ccacagcgag 55081 agaaaagaca gctagtaaga taggaagtga ggccaggtac cttgtgggca gtgatgtcat 55141 tcggtgcgac tcctaagatg tctccagaga tgggagagct cacccaaacc aggttgcaga 55201 agatctggat tccacacagc agcggcagca gcaggctgca acggagaagg ggctgtaagt 55261 ggggtttgct tccgtgggtt taaaggtggt ggtggtagag tgggtgtaga agagcaaagg 55321 atgcacctcc ccttgccatc cactcctacc tcaacctaaa agttggaata gatgaacaac 55381 tggatgaaat agaggctagt gtcaaaaacg tgaggaaaag aatgagagag aagacaggcg 55441 gaaagagaga tgaagacaaa ggaggattag gttaagacca tcttaatcag agggatcttt 55501 aggaactttt cctttcttat gcagagcttg tttggaatgc ttcttgtttg gaatttaaga 55561 tttctagttt aggactgttc agtgatagcc tagtatgttt cattttggtg acttctgtga 55621 agtgtctggg agttggtctt gcctgactga tagtgcctgg gtgaaataag gtgatggctc 55681 ctggcatcat ctttcagggg acaggtgctg ggtacacaac agtgccatgt cactggtcag 55741 tattgctcct ctcctttctc aggccttttc tgagtctata cctgcctagg ttgggaaaca 55801 cttaatgtcc tgtctccagt gttgccttcc tggcacatat ggacttcctt gctgctctgg 55861 aaatttgaga gaggcagtct ctcagggcca gcccactggt gtggtgctca ctttggaaat 55921 ttaagaaaga tcaagggcaa aacaaaacct gtacatattt aaaacaaaat ctggagtgta 55981 ccatttccca tagataacag gcgacagtgg tggaccaagg tgcatccagc ccagggattg 56041 tggcctgagt ggcataaggt gggttaagat aaggcagctg tggactgtgg gactcagaca 56101 ccccaaaggc cagctaacca agtttccagt ctgctaatgc tagaggaggc ccaggcttag 56161 ggatattggg ggtagcagta gttgcttaga ggatctagga gcatatagga tgttaaatat 56221 agttataata tacaggaggt agggtgtggg tggaggaaga gagagaaaaa agactaggct 56281 cctattctct ctctttctta tctccaccag ctctctcact cactcatctg gagatctagg 56341 agcctggaga ctgagaggta ccctcgccca gccagccact cttggccttt tccaaggggc 56401 tgtttagttg acaaacaagc aagccattct ctgcatacac gggtcctcca gagctagaat 56461 gcttatagtc ctgggaactg tgtctttcaa gttggagaga taatatacat aaatcaccta 56521 atatagtctc tatcaaatag aaggtgaaaa ataaatatgt gcttactttt gtttggagtg 56581 tcactttcct cagctataaa acaggattaa tgcctgcaac ccagtgttat tgggagtaat 56641 tcactgattc agaggttctg gatccagact gcctaggttt gagttccagc tctatctccc 56701 cctgtgtctg tgaccttgga caactctctt aacctctctt ggcttctgtt tcctcacttg 56761 taaaaggaag ttaataacag tgtagttgtg aagattaaat gaactggagc atgggatgtg 56821 cttagaagag tgtccagcac atagtacata cgcagtaatt gttaattatc ctaattatta 56881 ttgttcttgt tatataaaca agaaagcggg tgtgtggtct tatttaccat catatcctca 56941 acatcaggcc ctgatcctgg ctcatagtag atgctcagta aatggtcact ctgtgctact 57001 tccactctac cactcaccat agccaagaat ggggctctgt gtgtacaaga cagtcactca 57061 ttaatatttg actaaatttt atcagaccat agatgcaaaa aacttagata gtggttgcca 57121 agtctggaca atgtctggtt cacttcttgc tagggaggcc agtctattgg tccctgtaag 57181 gtctgaatca cctttttctt ggagcatcct ctctagcaga cacacagtta cttaatgtta 57241 ttttgaagat taaatacatt aacatgtaaa gtacttgaac agctgtgcat attcttttct 57301 catccaacat tatgtttatg cgaaccaccc attttatcgt tgcaagcagt tgtagtttgt 57361 ttattcccct tgtgaatatc ccacagttta tccatcttac tattgttgag catttggtta 57421 gtttctaatt tttgattatg aagaataatc caatgggcat tcttgtacct ttatcctggt 57481 agagcaagtc ccttcatctt gtccttctta tggagtgtct tggctattct tggtccaatg 57541 cccttctcca taaattttgg aatcagcttt ccaagttaca cacatataca aacacacaca 57601 cacacacaca cacacacaca cacagagaga gagagaggga gagagagaga gagaactttt 57661 gggattttga tggaaattac attcaatctg taaatcagtc aatttgagga gagttgccat 57721 ctttgtgaca ttgagtcttc cctaaacacg gtatatcact ccatatattc agactttttc 57781 atgttgagtg aagttttata atttaatcta taaaaggttt gcatgccatt gtaagatttt 57841 tttggcttat tttttgttgc tatcttaaac agtacctttt tagaaaatta aattttatag 57901 cagtatgttg ctaatgtata caactgtatt tgtttttcat gtaatgaatt tgtaactagc 57961 aaacctgcta aactttctta tgagttttga taatttgtat gtaaatgaca gctttcattt 58021 tccccacaac cctcattttt atctttctta tattattaca ctgctaagac ctcccgcctt 58081 taatatacac acacgagatg tgcagtacac gttgaaaaga aatgttaata atgtatattc 58141 ttatgctgat cccgatttta agtttcacca ttgagcaaat ttgctgtagt ttttttggta 58201 gatatcattt aataggttaa agacgttcct gtttattctg agtttgccaa tacaattttt 58261 aaaagatgaa tgaatattga attttattaa acatgttttc tgtattcact gaaatgacat 58321 taattttcta ttttaaccca ttaatgtggt acaatacatt agagtttttc ttgtgttgaa 58381 ataatcttgc agtcctggga taaataaaaa cttctgaatt tctttcacta attttttgtt 58441 taggatttta gcattttgct caggaaagag attggcctgt gatttttcca agagtgtcaa 58501 gattacacta acctcataaa atcaggtggg aactatttcc tttcattctg tcctctgaca 58561 gaggttgtct gcagaactag ccattcctca aatattttac ctggcagcat ttccaggtaa 58621 aatatctgga cctggtgtct cctttaacat tttaactgat acaatctctt ttatcagtta 58681 tagagtcctt tcagtttgct atttccacta aagttagctt tcttaagctt tcctaagtta 58741 tggttttcta tgaatttttc tacttcacat agttttcaaa tttattttgt gtaaagttct 58801 tatctttgta tcttttttat ctctgttgca tttgtagtta tagcttctct ttcattataa 58861 tattatttat ttgcaccttt tttctcttga tcaatcttgc tataagtttg acacttttat 58921 aattctaaaa taatcaactt ttgacttttt gatccttttt tatctttatt ttctatttaa 58981 ctattttttc atcattattc tgtttctcct actttctttg gatttattct gttgttcttt 59041 tttaactcat agagttgcct gctgctcagc tcattcattt tcagactttc ttcttttcta 59101 atttaagtgt ccaatgagat aattggccta ctaaacaggg ctgatattcc acagattcac 59161 aatcgtatgg ccatatgaat tttcagaata tttctagtcc attttataaa taggcactct 59221 cccaggcttt tttagtctct ccaacactta cccagttacc ctggccacct caattccttt 59281 cccaggatag ccaggaccta ctgattatca tctagcaact aaaggattaa tgcctaccac 59341 agcaggaggc ctccagtact tggtgccagc ccttggacca gcttctgttt tgatagatca 59401 gtgctcttgt catttctatt ttaattccat tctggtcaga gaacaaactt tgtattattt 59461 cagttctttg aaatctgttg agacttgctt tatggaccaa cttatggaca atctggtaaa 59521 tgttccaaga gcacttgaaa agaatgtgcg ttttacagtt gtgtttagtg tactatatac 59581 ctcgattaag ttgtgtttgt taatctggct caaatgttct atattcttct ggggttttgg 59641 ctgcctgttg atatggaaaa tttgttaatg tttccttgta attctgtcaa tcttgaaagt 59701 tcgtagttac aaacaaattt agaattgcta tatcttcctt gtgaattgaa tattttatca 59761 ccatgtaagg aaattattaa ctctggtaat acctggtaat ttggtatcat tgatgttttt 59821 gccctataat caatcttttt tctcatatta atatacctac attagcttta tttagcttaa 59881 tatttgcctg gtatgtgttt tcctacattt aaaaaattgt ttttgataga gatataattc 59941 acataccata aaattcacca ttttgaagta tacagtggtt ttgagtatat tcacatagtt 60001 gtgtaaccat caccactaat tccaggacat tttctttatg ccaaaaagga attccatact 60061 ctagactcac tccagatgac cctctcccct tagcccctgg caaccactaa tctactttca 60121 gtctcaatga atttgcctat tctggacatt ttatataagg ggaatcatac aatatgtggc 60181 cttttgtatc tggctcgttt ctttgggcat gttttcaagg ttaatccatg ttgtagcatg 60241 caaagtcaca aagatatatg cctaggcttt tttctaagag ttttatagtc ttggctctta 60301 catttaggtt tttcaattta ttctgagtta atatttgcat gtggtgtgag atgggaatcc 60361 agcttcgttc ttttgcatgt ggctatccag ttgtcctagc agcatttttt gaaaagactc 60421 ttctttcccc tacttaattg tcttggcacc cttgatgaaa atcaggtgac tataagttga 60481 agggtttact tctgggctct cagttctatt ccatttgttt atatgtatct ctacccttat 60541 accagtacca cacattcttg attattgtag tttaattgta attttgaaat tgggaagtat 60601 gtcttctaat tttggtcttt ttcagattgt ttttatgatt gttttggcta ttctaggtcc 60661 tttgcatttc catataaatt ttaagaccaa tttattaatt tctgcaaaga attaagctga 60721 aactttgaca gaagctgcat ttaatctata gattagttag ggaatattac catcttaaca 60781 ataataagtt ttaggtccct gaacatgaga tgttgttgca tttacttagg tctttaatta 60841 tattcaccaa tgttttgcag ttttcagagt gttttacatt ttttaaagtg ttcttttaag 60901 cattttattg ttttgatgct attgtaaata aaattgtttt acttaatttc attttgtttg 60961 ctcatggcta ctttatagaa atgtaattga tttttatata ttgatcttat attctgcaac 61021 attgctggac tcatttatta ggtccaatgg tgttttttag tgcattcctt gggattttct 61081 atatacaaga ttatgttatc tgcaaataga aataatttca tctcttcctt tccaatccag 61141 atgcctttta ttttcttttt cttacctaat taccttggct ataacctcta ctagaatgtt 61201 gaaaagatga gatgaaaaca gacatccttc tgttgttact gatcttggca ggcggcatgg 61261 gagaaactat tcaatctttc accattaagt gtgatattca ctgtgggttt ttcattggtg 61321 tcttttatta ggtttagggc ctttccttct agttttaatt tgttgttggt tttatcatga 61381 aggggtattt aatttcccaa atgctttatc tgtgcctatt atgatgatca tttggttttt 61441 ttgtccttaa ttctattgat gtgatctgtt gcattaattg attttcaaag gttaaatcaa 61501 gcttgcattc ctataataaa tctaacttga ttgtgatgta aaaagtataa gtataatata 61561 tacatatata taatgtatac atataaatcc ttttatacat tcctgggttc agtttgctag 61621 tgttttgtgg ggatttttac atccatattc acaacagcta ttggtctata attttctttt 61681 cttctaatat atgtgcctaa ctttggtagc acggaaatac tggcctcata gaatgagttg 61741 ggaagtgctt cttcctcttc tactttttga aagagctttt gaagaatcgt attattcttt 61801 aaatgtttgt agaattcacc agtgaaacca tctaatcctg gacttttctt tctggcaagg 61861 tttttcatta cttattcaac cccttgttat agatctattt agatttcatt tcttttttag 61921 tcagttttga gagtttgtat cattctaata atttatctat ttcacctaag gcatttattg 61981 tgttggcata cagtttatga ttgttcataa tattccctta taattctttt tattttcata 62041 aagtcagtag taatgtctcc tctttttttc ctgaatttat taatttgagt tttctctttt 62101 ttttcttggt cagtctcact aaaggtttgt caagtttatt aattttttca aataaccaac 62161 ttttgatttt gttgattttt ttctaatgtt ttcctagtct ctgttttatt attttcacct 62221 ctaatcctta ttatatcctt cttctgtttc ctttaggttc ccatatttgt gaatttccca 62281 aagttctctc tgttattaat tttttatttt actccattat gattggagaa catactttat 62341 atgatttcca ttcttccaaa tttattgaag cttcttttat ggcctagaat attgtctctc 62401 ttggagcatg tttgatgtac actttggaaa aatgtatagt ctgctgtttg gagatggagt 62461 gttttacaga tgtttgttat tactactttg tgttgaagtc ctctatttcc ctgttgttga 62521 tcttctatgt agttgttcta tccattattg aaagagtggt cttgaagttt ccaactattg 62581 tcagtggatt gtctatttct cccttcaatt ctgccagttt tgcttcatgt atttggcagc 62641 tctgttgttt aggcatatac gtttatactt gtgatatatt cattatggat tgaccctttt 62701 gtcattattg aatgtcccta tatatctcta gtaatgtttt tgaacatcca ttttgtctca 62761 tattagtatg gtcactccaa atttcttgtg gttgttgttt gcatgatgta ttacttacgt 62821 tattttactt ttaatctatt tgcttctttc tcttttttct tctttctttt ttagaataga 62881 gatggagtct cactatgttg cccaggctgg tctcaaactt ctggactcaa gggatcctcc 62941 caccttggtt tcccaaagtg ctcggattat aggcgtgagc caccaggcct gccttgtatc 63001 tttcagtcta aaacctatct tctgtaggca gcatgcagtt ggatcttgtt tgtttgtatt 63061 cagttagata ttctctacct tttgattggg ttgattaatc tgttctgatt taaaatcatt 63121 attgatatag ttggtttaaa tatgcagttt tatttttcca tagagctcat gtctttttct 63181 ctctattctt cctttactgc tttcttttgc attaaatgaa ttattctagt ttaacttttt 63241 aattcatttg atcatttttt attatatatt ttttaggtat attcttcgtg gttgttctag 63301 ggcttttcac atacatcata catcagtgtg cttcagattt gtactaattt aatttcaatg 63361 agatataaaa attttactca tatatagctc tctttcacct tcctttttga gctattataa 63421 tatatatatt atatctgtat atgctaaaaa ctaaataata catcattata agtattattt 63481 tatataattt tatgttttaa ggaagctgag aaaattaagg agaacatgta tgtatttatt 63541 attttctatt attttcattt gttcctgtgg gtttgaatta ccttctggta tcatttcctt 63601 atttctatat aactctttgc tttgtgactt ggctaggcta tttttaaagt atgtttccct 63661 gcagtgtgaa cccttcagtg ttgcttcttg gagtgcatag tttttggctt gcaggtaatt 63721 tgggctttct ttgacttttt cccagtctct ctattaagct gtcagcccac atgggggtat 63781 tacactctag gctccactaa ttactagctg aatgctgtat tttgtttttg ttttttgttt 63841 gtttgtttgt ttggcagtgt cctgaagcat gcattgccca gcagccttat ctaattaaac 63901 tctagcagga gtagtttgtg aagtaagtct tggaggtttc ttcatacccc agaagggctc 63961 ttcttacctg tccctctctc tcttgttctc tctgatgaac aagctggcct atgttgctgt 64021 tagttaagtg aacttgtagt ttttgagaac atccttgggc ttgagcttct ttgtactatg 64081 ttaaaaataa agttagtttt ggggggagag tttcagagct ctctcatctt atgaatttct 64141 tctctccatg ggtatactgt ctgagcactg ttctgggtgc tgggcgagac agcagcttct 64201 ggtcttagtt tgcctatcct ggtgtggaac ctaccctgtg agcaagatgg aatgagggta 64261 atcaggacct cagtattctt agcctattgc acctggtata tagcctccat tctatgagta 64321 aaaactggga ggaggaaggg atcctccaac ctcttaacca cacttgccag gaatttagcc 64381 tctgcaattc tgagctgaag ggaatgctaa atgctagctc tctgctcctt ccagtaagat 64441 gccaaagtat tgatgggaca ttctctccaa gacgaaaggg agcttcatct tggctataca 64501 caccttaagt ggagcttcta tcaagctaag cttggagcca gtagtgaggg gacagtagta 64561 aggtcgggaa cagtagcatg gcacatgtat caaagactgt gttcttacca catttcagta 64621 gattttcttg aataaaaata tcttcatgta ctgtattcca ttaagacaat ttccagagat 64681 tttaaaaggt tgttgtgttg taattttgcc agctgagctt gttttattag agaatgggtc 64741 tttagagctc ctcatgctgt catgccagaa atgaaagtct caagattgtc tttgtgtgtt 64801 gatcttgtat cttgcagcct tgatgaactc acttgttagt tcgaggaatt tctgtagttt 64861 agggggatat ttctatgtag acaatcatga catttgcaaa tagggatttt ttaaaaatta 64921 ttttccaaac cgtatgcctt ttatttcctt ttattgcctt attgcagtgg ttagaacttc 64981 tagtactatg tcaaaagagt ggagaaggtg gaaatcctta cattagtcct gatgttaggg 65041 ggaaagcatt tagtctttca cttttaagta taatgttggt gatagttttt gtttgcttta 65101 tagaagtttc ttatcaagtt gaacttaaac aaatttacaa gaaaaaaaca accctattaa 65161 aaagtgggca aaggacatga acagacactt ttcaaaagaa gacacacata cagccaacaa 65221 gcatatgaaa aaatgctcaa catcactgat cattagggaa atgcaaatca aagccacagt 65281 gagataccat ctcacaccag tcagaatggc tatcataaaa aagtcaaaaa ataacagata 65341 ctggtgagat tgcagaagaa agtgaatgtt tatacactgc tggtgggaat gtaaattagc 65401 tcaaccatgt ggaaagcagt gtagtgattc atcaaagagc taaaaacaga actaccattc 65461 aacccagcaa tcccattact gggtatatac ccaaggggat ataaatcgtt ctattataaa 65521 gacacatgca gacatatgtt cattgcagca ctattcacaa taccaaagac atggaatcaa 65581 cctaaatgtc catcagtgat agactggata aagaaaatgt agtatgtgta tgccgtggaa 65641 tactatgcag ccataaaaaa gaatgacatc atgtcctttg cagggacatg gatgctggag 65701 gccattatac ttagcaacct aaaaggaaca gaaaaccaaa tactgcatgt tctcccttgt 65761 aagtgggagc taaatgatga gaacacatgg acatagaggg gaacaacaca catttgagac 65821 ctacctgagg gtggagggtg ggagaaggga aaagatcagg aagaataact aatgggtgct 65881 agttaatagg taataggtac acctgggtga caaaataatc tgtacaacaa actcttgtga 65941 cattaattta tctcactaag aaacttgcac atgtacctct gaacttaaaa taaaaaataa 66001 aaatataaaa tcttaagctt aaccattcag aaaccaccaa ctaacctcta actatggact 66061 ttctacttta agcaatcaaa tatttatttc gtctttcttc tgagaacacc ttataaaagt 66121 tttctcttga tccccctcag tggagccctg aactgcttgt gtattgtgct tcccaattca 66181 caaattgctg aatgctcaaa ctcattaaaa ttatttaaag aaaggagtgc ctatcaagtt 66241 taggtaattc atctctattc ctaacttgct gaaagttttt atcattaata ggttttatga 66301 agtgctttgt cagtatgtat ttatttttct tctttagttt gttgataggg tggattacat 66361 taattgattt tttgaatttt attctagtct tgcatacctg tactaaatcc cacttaatca 66421 ttgcatatac agtcatccct cagtatctgc agaggattgg ttctaggaac ccctgcagat 66481 accaaaatcc atgaatgctc aagtcccttt tataaaatgg cctagtgttt gcatataact 66541 tacacatatc ctctcattta ctttaaatca tctgtagatt atacctaaca caatgtaaat 66601 gctatgtaaa tggttgttat actgtattgc tttttatttg tagtgtttta ttcttgttat 66661 ttttattgtt tttaaaagta ttttctatct gctgttggtt gaatctgcag atgtggaacc 66721 cacagatatg gaggactgac tgtaattatt ttcatgtaat gttggattct gtctgccaat 66781 attttgttga ggattttttc atctaagttt atgacagagt ggactataga gtttttgggt 66841 ttttggtaat tcatttgtct ggttttgata tcagggtaat tctggcctca taaaattagt 66901 tgaaaatatt ccctcctctt cttatttctt tgaaaatgtt gtttaaaatt tgcattaatt 66961 cttctttaaa agtgttacag acttttgaag taaacacatt tgggtctgga gatttctatt 67021 tctggagatt ttatgttgaa ctaaattatt ttaatgttta taggattatt cgagtgcctg 67081 ttttattttt ggttgacttg tgagaggctg tggttttcaa ggaattgatc catatattct 67141 aaatagataa atttacaagt gtaaagttgt ttgtagtagt taagtattaa atagcttctg 67201 gcatccagtg ctttccgcac gagataagca aatttcagat gctttctctc tctctgaaag 67261 tgcctgactc ttcagatttc aaggtagtag ttttctctat ggcctcagtt ctctaatggg 67321 tcccagaaaa gtcattgatt ttcagtgtga cccaattttt cttattgtaa agatgagaat 67381 aagaactccc aagctcttgg aactgaaact agaagttcct tgcttagttg ttttataaac 67441 ccatgaaata gacattttta ttgttgtttc tcaagtcagt gttggcttag tttgacacac 67501 atgtttacca ttttacttgc tcaccatttc ttcttgtctc ttagcccttt cttttggtac 67561 tatttttctt ctgtctgagg tacattccat agaaccctta ttgagggtct gttggtgata 67621 aattctctta gagttgtttg catgaaaatg tgtttatggc atcctcattc ttgaagagac 67681 ttatttctga gtatacaatt ctgggttaaa agttattttc tcatagaact tcagtaatat 67741 tatttcacta tctctggttt tcattattgt tgttgagaag tcaactgcga atctaattgt 67801 tactcattag gtaaactgtc ttttatcttt gattgctttt taagatctat tcagctttag 67861 tgttctgaaa ttttattatg atttttctaa atgttgcttg tttttttttg tttgtttgtt 67921 ttgtttaaag ttgggcttcc ctaacttgag gactggtatc tttcatcaat tcttttcttt 67981 ctggaactgt aattaaaccc atggtagact ttatcattct agccatgtgt tttaatctct 68041 attttatatt tccatatctt gttattccta cagtactttc tatataattt cttcagattt 68101 attttccagt tcattaatta tctcatcacc tctgtttaat ttgctgttta accctcatac 68161 attgagtttt taattttaat gctctcattt ctaatttcta gaagctgtat ttgattcttt 68221 taaaaattca tgtagtcttt ctttgatagt ctcacatttc gttgacatac tgttgatacc 68281 ttcttcattt ctttaaacat acaaccaatg cttatttcac agtataaatc agataattcc 68341 aattatagct tttgcaattc tgattctttg gtttgtttct gtggattctt attcatacag 68401 gtttttcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgttggcttt 68461 ttttcttcct acacgggtaa tgtacatccc agccccaaac ccacataaat gcaggactgg 68521 gctaggaaat ctcaggaact gttcactctt ttgtttccct ccacgattga gccaaggttg 68581 agacagtctc agttaggtgc ttcttcttca cccgctgatt acgtcacctt tgtctggtcc 68641 acactttagt caggctctca gatccaactt cctactttga gcccaaagag ttttgtttac 68701 tatcttctat acgtgctatc tcttaaagcc caagttttaa gccagaaatg tgtcaatcct 68761 ttgacaatgg cccagagaag aaacattgtt ttttactttg ccactcaata aacacacacg 68821 tttgtgtgtg tgtgtgtgtg tgtgtatgtt aaatcaaaaa atatgctttt tatgatccac 68881 tgtattattg tgtcaactac tgtgtaataa attaacctca aaacgtagtg gcttaaaaca 68941 gcaaacatct attttgtctg tttctgtggg tcagagatca gtgcacagct cagctgggtc 69001 ctctggctca gtatctctaa taaggctaca atcaaggtat tggctgaagc tttggtcatc 69061 tcaaggttct actgggtagg agtgagtgga tagggatcca cttccaactt actcacaagg 69121 ttgttggcaa gattcagttc ctcataacat attggaatga ggtccttagt ttctcactag 69181 ctcttgacca gagatcatcc tcaacttctt gccatgtggg tctctcccta gggaagctca 69241 caacatgaca gctggcttcc aacagagcaa gtcaatgaga gagccaaaga ggaccaacaa 69301 aatggacggc cagtcttttt gtgataaaat ctcaggagtg atatcccatc acttttgcct 69361 tattttatgg gttaaaaaca agtcactaga ttcagcccac actcaaaagt aggtgactac 69421 acaatgtcat gaataacaag aggtatggat cattaagggt gatattgtat gctgcctatt 69481 gcacccactg aattgatttc atgtttgtga aacactgctc caggctacca gggtaaacat 69541 ggattttagt gcttttttat ctctttagat cagtaattat tttctcattc ctggcctctg 69601 aggatttctg ttggtttcac accagctcag ccatgcttta tttgttttaa atccattatt 69661 tttaggtgtt ttgtatttgg aaggactctt tacaaatata gcctaaaata tcaccagaaa 69721 tagaaatctc tcttctatat tctaccatat acaacttgag taagataccg tactacttaa 69781 aattacctga aagtttacag aagagcttga ggaaaatgtg accagctgtc ataagatcct 69841 tttctaaaaa catgattagg tgataggata aataagagtt ttatccacct gaagtgtcta 69901 gataggaagg gaaagcattc aggaagtgaa tggctactac ctaagtctgt cctgggagta 69961 agctttttta ttttttagct ttttattttt aaataatttt agatttacag aacaattgta 70021 gaattgctgt atacctttca ttcagcttcc accaatgtta tcactttaca taagcatggt 70081 atatttgtca aaactaataa attaacatgg gtaagttact attaactaaa ctacataatt 70141 tattcagact tcttcagttt tttcactaac gtcctttata tgttccagga tccaatccag 70201 cataccacgt tgggattaat tgtcttattt ccttagaatc atccaatctg tgacagcttt 70261 tcagtctttc cttgtttttc ataactttga caattttggt caagtgtttt gtgaaatgac 70321 cttcaatttg agattttctg atattttctc atgtttagac tggggctgtg ggttttaagg 70381 agaataccat agaggtgagg tatccttctc atcacatcac atcagaggga agatgatatc 70441 aacatgactt attgctgatg atattaacct tgaactgtta gattagttgt ggtctgccag 70501 gttactattt gtctttccca tactgtattc tttggaagtg aatcattatg tccagcccac 70561 actcaagggg aaggaaatta aatgccaccc cctagaggga gaagtatcaa ctaatttgtg 70621 gaaatttatt aaaaacactg taataatttg ggtaaaatta tttgagccta tgcaactctt 70681 cagattctcc ttaaagtttc actcactaat tttagcatga atcagtggat tgtgcctgtg 70741 gcaattatta cagtagtgtt ctaatgatga ttttctgctt cccttattta ttctacattt 70801 attatttgga atccctctat aaagattttt cctttctccc ctacttattt attcaacaat 70861 gtatatatat aagtgtgtac tcataaatat ttaattattt gggttataat actcaactta 70921 tttattttgt tggtcacctt gtcccagctt tgcccattag gaactctttc atattggctt 70981 ctgcgttctg ttctattttt agtagacttt atattttaca gtagttttag gtttgcagca 71041 aaattgagca gaaggtacca agatttccca tattctccct tcctcccctc cacatgcata 71101 gcctctccca ttatcaatag ccactgccac aggggtattt gttacaaatg atgaacctac 71161 attgacacat tgctatcacc cagagtccat agtttacttt agggttcact cttggagttg 71221 cacattctat aggcttgagc aaatttataa tgacatgtat gcagtattaa agtagtttcc 71281 ctgccctaag aatcctcttt gctccactta ttcatccctg tcttttcccc tcctgtgtcc 71341 ctttgatata cttctattct ccttattttt ctagcacttc gttacttttt tacaagataa 71401 caccagcctc ttctccaaga aaccatggtt cctgtaattg aaaattggta tttggaaacc 71461 aagatctgga tgcttttgtt acttttaggg tatcattaat tctagcccgc tcagcagaca 71521 gagctaagaa atatttgtgt gcatgctaac ccgtctgtat tttgatgtct atttatatat 71581 acgttaaaaa taaacatgag ttaataccat ttctgaccct aaattgcact ataaggcatc 71641 attctagacc ttctcccttg tttactatga gcttctttct tagactgtga aatacctgac 71701 tctcattatc tacaagttat ttatttattt atttatttgt ttgtttgttt gtttatttat 71761 ttattccacc ctactacact ggtagtttca gaattgttaa ctcatacccc tgggagaaac 71821 aattttatca attagagtac agtgtctaca cacaattact ttctgtcatt agtcttaaaa 71881 tattcagcaa aacactattt tcgtggttat ttatttcagg ttttctttct ccaccccatt 71941 tagtgagatt atgatatata tacttgtaat atagttagaa ccatttgttg caaactgcat 72001 tcaatattga gttgctcaga cattttggtt gatttataaa aacttgaatg caataaaatt 72061 cagttttgtg agttttgaca aacgtataaa gtcagatatc ctcttccata gtaccgtata 72121 gaacaattcc atcaccctaa aaattctcct gtacagccca ttatagtcaa cccttcagac 72181 ctcccccaac ccctggcaac tactgagcag ttctctgttc ctataatttt gccttgtcca 72241 ggatgtcata taaatggaat tatatcacat gtcagctttt ggatatggct tctttctctt 72301 agcataatgc atttaagatt catcctagct gtatgaatat caacagttat cttaattgcc 72361 gtatggtatt ctagtttatg attgtaccac aatttgttta tccattcacc aactgaagag 72421 catcttggtt gtttccagtt tttagtgatt tttagtaagg ctccataaat attctcatat 72481 acatttttgt gtaaacttaa tttttcaatt cccttccaaa ggtatgtatg agtgaaaatg 72541 ctaggtcata tgtcaagtgt atgttaaatt ttaaaagtaa ctgccaaact gttttccaaa 72601 atgcctgaac cattttgcat ttccactagc aatgagtgag agttcctttt gtaatacatc 72661 cttgccagca tttcatattg tcatgtaaat ttatatgtgt atttaattct agccatttta 72721 atagctgtat cttgaagtgt ttctcattac acatttaatt ttcattttcc caatgattga 72781 tattatcttt taatacactt gtttgccatc tgtacatttg ggttggtgaa atgtttgttc 72841 agatgctttg ctaattttta aaactaaatt ttatctttta ttgttcagtt ttaagtgtta 72901 tttatatatt ttggatatac cagatatata atgtacaagt atattctttc attctgtggc 72961 atttttcatt cctttgtgtc tttcacagat aaaaaaatgt taatatgtaa atatttaaca 73021 atttttattt taaaattttt cgtaaataaa aatgtttaaa tttttatagt tcaatttatc 73081 tatttttttc ttgtgtggat aatgcttttt gtgatgtaac taaaaactaa ttggcatatc 73141 caatatttta cattctctcc caagtccttt ttttagaagt tttatagttt tacattttat 73201 atttagacct atgatccact ttgaattatt ttttgggtaa ggtttaagta ttcaagttca 73261 ttttttgcag tggacttcca atcgttctag caccatttgt tgaaaagact ctattttctc 73321 tattgaattg actttgtacc tttgtccaaa attagttgac tttgtccaaa atttgagtgg 73381 gtctatttct gggctcttta ttctatttca ttgatctaga ttgctatcct ttttccaata 73441 ccacactgtc ctgagtattg tggctttata ataagtctca aaatcatgta gtaaagttca 73501 gtccaccaac attgttttat ttatttattt attttcaata gtttttaggg tacagatggt 73561 ttttggttac atgggtaagt tgtttagtga tggtttctga gattttggtg catccgtcac 73621 ctgagcagtg tacactggcc ccaatgtgca gtcttttatc actcaccctc cttccatcct 73681 tccccgcaag tccccagagt ccattatatc attcttaccc ctttgcatct tcataactta 73741 gctcccactt ataagtgaga acataacaat atttggtttt tcattcctaa gcgacttcac 73801 ttaaaaaaat ggccaccatc tccatccaag ttgctgcaaa ggccattatt tcattctgtt 73861 ttatggctga gtagtattcc atggtgtata tataccatgg tatatgatat aaagaaaatg 73921 gatacatttt ctttatccac tcgttggatg atgagcattt aagttgattc catagttttg 73981 caattgtgaa gtgtgctgct ataaacatgc gtgtgcacat gtctttttca tataatcact 74041 tcctttcctc tgggtagata cctagtagtg ggattgctgg atcaaatggt agttctactt 74101 ttagttcttt aaggaatctc catactgttt tccacagtgg ttgtactagt ttacattcct 74161 accagcacta taaaagtgtt tccttttcac cacatccatg ccaagatgta ttttttttta 74221 tttttaaatt atggccattc ttgcaggagt aaggtggtat ctcattgtgg ttttaatttg 74281 catttccctg aaagatagtg atgttgagca ttttttcatg tgtttgttcg ctgtttatat 74341 atcttctttt gagaattcta tattcatgtc ctttgcccac tttttgatgg gattatttgt 74401 tttttcttgc tgagttgttt tgagtttctt gtatattctg gatgttagtc ctttatcaaa 74461 tgcatagttt gcgaatattt tctcccactc tgttggttgt ctgtttactc tgctgattat 74521 ttcttctact gcacagaagc attttagttt aattatgtcc catttattta tttttatttt 74581 tgttgcattt gcttttgggc tcttagtcat gaattctttg cctaagccaa tgtctagaag 74641 agttttccca atgttatctt gtaaaatttt tatgatttca ggtctgaggt ataggtcttt 74701 gatccatctt gagttgattt ttgtataaag tgagagagga gggtccagtt ttattcttct 74761 atatgtggct tgccaattat cccagcatca ttaattgaat agggtatcat ttccctactt 74821 tatgttattg catgctttgt taaagatcag ttagctgtaa gtatttggct ttatttctgg 74881 gttccgaatt ttgttccatt gctctatgtg cctgttttta taccagtacc atgctgtttt 74941 ggtaactata gccttgcagt ataatttgaa gtttcagcaa cattgttttt attttgcaaa 75001 ttgttttggc cattgtagtt tcttagccct tccatatgag tttcaggatc agcttgtcta 75061 tatctaaaat atcctgatag ggtttttatt ggtattgtgt tacgtatata ggtcactttg 75121 gggagaatag acatctttac catattgagt attccagtcc atgaacacag tatgtctatt 75181 tatgtagctc tttgattttt taacaacatt ttgtaatttc cagcatacag atcttttata 75241 catgttttat tggtgttata cctaaagatt ttgtcttatt agaactattg taaatttttt 75301 ttaactttga ttttgaattg cttacagaac tctccacccc aaatcaacag aatatacatt 75361 ttttccagca ccacaccaca cctattccaa aattgaccac atagttggaa gtaaagctgt 75421 cctcggcaaa tgtaaaagaa cagaaattat aacaaactgt ctctcagacc acagtgcaat 75481 caaactagaa ctcaggatta agaaactcac tcaaaactgc tcacctacat ggaaactgaa 75541 caacctgctc ctgaatgact actgggtaca taacgaaatg aaggcagaaa taaagatgtt 75601 ctttgaaacc aacgagaaca agacacaaca taacagaatc tctgggacgc attcaaagca 75661 gtgtgtagag ggaaatttat agcactaaat gcccacaaga gaaagcagga aagatccaaa 75721 atggacaccc taacatcaca attaaaagaa ctagaaaagc aagagcaaac acattcaaaa 75781 gctagcagaa ggcaagaaat aactaagatc agagcagaac tgaagaaaat agagacacaa 75841 aaaacccttc aaaaaattaa tgaatccagg agctggtttt ttgaaaggat caacaaaatt 75901 gatagaccgc tagcaagact aataaagagg aaaagggaga agaagcaaat agacgcaata 75961 aaaaatgata aaggggatat caccaccgat cccacagaaa tacaaactac cgtcagagaa 76021 tactacgaac acctctatgc aaataaacta gaaaatctag aagaaatgga tacattcctc 76081 gacacataaa caccctccca agactaaacc aggaagaagt tgaatctctg aatagaccaa 76141 taacaggctc tgaaattgta gcaataatca atagcttacc aaccaaaaag agtccaggac 76201 cagatggatt cacagccgaa ttctaccaga ggtacaagga ggaactggta ccattccgtc 76261 tgaaactatt ccaatcaata gaaaaagagg gaatcctctt taactcattt gatgagacga 76321 gcatcatcct gataccaaag ccgagcagag acacaaccaa aaaagagaat tttagatcaa 76381 tatccttgat gaacattgat gcaaaaatcc tcaataaaat actggcaaac caaatccagc 76441 agcacatcaa aaagcttatc caccatgctc aagtgggctt catccctggg atgcaaggcc 76501 agttcaatat atggaaatca ataaatgtaa tccagcatat aaacagaacc aaagacaaaa 76561 accacatgat tatctcaata gatgcagaaa aggcctttga caaaattcaa cagcccttca 76621 tgctaaaaac tctcaataaa ttaggtattg gtgggacgta tctcaaataa taagagctat 76681 ctatgacaaa cccacagcca atatcatact gaatgggcaa aaactggaag cattcccttt 76741 gaaaactggc acaagacagg gatgccctct ctcaccactc ctattcaaca tagtgttgga 76801 agttctggcc agggcaatta ggcaggagaa ggaaataaag ggtattcaat taggaaaaga 76861 ggaagtcaaa ttgtccctgt ttgcagacga catgattgta tatctagaaa accccattgt 76921 ctcatcccaa aatctcctta agctgataag caacttcagc aaagtgtcag gatacaaaat 76981 caatgtacaa aaatcaccag cattcttata caccaataac agacaaacag agagccaaat 77041 catgagtgaa ctcccattca caattgcttc aaagagaata aaatacctag gaatccaact 77101 tacaagggat gtgaaggact tcttcaagga gaactacaaa ccactgctca atgaaataaa 77161 agaggacaca aacaaatgga agaacattcc atgctcatgg gtaggaagaa tcaatatcat 77221 gaagatggcc atactgccca aggtaattta tagattcaat gccatcccca tcaagctacc 77281 aatgactttc ttcacagaat tggaaaaaac tactttaaag ttcatatgga accaaaaaag 77341 agcctgcatc accaagtcaa tcctaagcca aaagaacaaa gctggaggca tcacgctacc 77401 tgacttcaaa ctatactaca agcctacagt aaccaaaaca gcatggtact ggtaccaaaa 77461 cagagatata gatcaatgga acagaacaga gccctcagaa ataatgccgc atatctacaa 77521 ctatctgatc tttgacaaac ctgagaaaaa taagcaatgg ggaaaggatt ccctatttaa 77581 taaatgatgc tgggaaaact ggctagccat atgtagaaag ctgaaactgg atcccttcct 77641 tacaccttat acaaaaatta attcaagatg gatgaaagac ttaaacatta gacctaaaac 77701 cataaaaacc ctagaagaaa acctaggcat taccattcag gacataggca tgggcaagga 77761 cttcatgtct aaaacaccaa aagcaatggc aacaaaagcc aaaattgaca aatgggatct 77821 aattaaactc aagagcttct gcacagcaaa agaaactacc atcagactga acaggcaacc 77881 tacaaaatgg gagaaaattt tcacaaccta ctcatctgac aaagggctaa tatccagaat 77941 ctacaatgaa ctcaaacaaa tcgacaagaa aaaaacaaac aaccccatca aaaagtgggt 78001 gaaggacatc aacagacact tctcaaaaga agacatttat gcagccaaaa aacacatgaa 78061 aaaatgctca ccatcactgg ccatcagaga aatgcaaatc aaaaccacaa tgagatacca 78121 tctcacacca gttagaatgg caatcattaa aaagtcagga aacaacaggt gctggagagg 78181 atgtggagaa ataggaacac ttttacactg ttggtgggac tggaaactag ttcaaccatt 78241 gtggaattca gtgtggcgat tcctcaggga tctagaacta gaaatgccat ttgacccagc 78301 catcccatta ctgggtatat acccaaagga ctataaatca tgctgctata aagacacatg 78361 gacacgtctg tttattgcgg cattattcac aatagcaaag acttggaacc aacccaaatg 78421 tccaacaatg atagactgga ttaagaaaat gtggcacata tacaccatgg aatactatgc 78481 agccataaaa aatgatgagt tcatgtcctt tgtagggaca tggatgaaat tggaaatcat 78541 cattctcagt aaactatctc aagaacaaaa aaccaaacac cgcatattct cactcatagg 78601 tgcaaattga acaatgagaa cacatggaca caggaagggg aacatcacac tctggggcct 78661 gttgtggggt ggggggaggg gggagggata gcaatgggag atatacctaa tgatagatga 78721 cgagttagtg ggtgcagcac accagcatgg cacatgtata catatgtaac taacctgcac 78781 attgtgcaca tgtaccctaa aacttaaagt ataataaaaa aaaagaatca aatggatatt 78841 ttggactaaa aatataatag ctgaaataaa aacttaggaa tgaaagagta gatagaaatt 78901 tcataggtat tacaatgaca agagtatata atagcaattc taaacacata aattcgacaa 78961 catagatgaa acaaattcct caaaaagaac aaactggcaa accttatcca aaatgtaaca 79021 aataatctaa ataatctaaa tccttctata aatattactg aaattgaatt tgtaattaaa 79081 aatcttcccc caaaataccc aggtccagac ggtttcactg gcaaatttta ccattcattt 79141 aaagaagaaa tagcatcaat tctatacaat ctctctatga aactagaaga caggaaaata 79201 catatcaact tactctgtga ggccagcatt accctgatat aaaaaccaga aaaataagtg 79261 atacaaaaaa agaaaactcc aagtcaatat ccctcatgaa cataagaatc ctaaaaaaat 79321 gcaaatgaaa tttggctata tataaaatag ataattaata taccacaaga aagcacggtt 79381 tttcccagga atacaaggct ggttcaatat atgaaaatca atcaaagcaa tccattctaa 79441 agatgtaaaa ccccatggta tggtttgaat acatcctcca aagttcatgt gttggaactt 79501 aatctccaat gcaacagtat tgagaggtgg gacctttaag aagtgattag gtcatgaggg 79561 ctctaccttc atgaatagag gcttcttccc tcatgaatga attaatggca ttaataatgg 79621 aatggtttag ttaactcaga aaacagttcc tgataaaagg ataagtttgg cccccacact 79681 cgcatgcaca ggcacgcaca taagcacact ctctcttgcc ctttcacctt ccacaatggg 79741 atgatgcagc aagaaggccc tcaccagatg tggccacgtg atcttggact tcccagcctc 79801 cagaactgta agaaataaat ctctgttctt tataaattag ccagtctgtg gtattctgtt 79861 ttagcagcac aaaatagatt aagatgcccc aaattatatc agcagatgaa aaaaactttt 79921 tacaaaatac aatatccatt cgtaataaaa ctcttagcaa tttggaatag aagggagctt 79981 ccttaaccag ataaagaata tttacaaaaa atctacagct aatgtcatac tttatgttga 80041 cagactaaat gctttctctc taatatcagg aacaaggcaa ggatgtccac tctcacgact 80101 cccattcaac atagtattgt aactcccagt caatattata aggcaagaaa ggtaatataa 80161 ggcatgcaaa ttgtaaagga agaaataaat ctgtccctgt tcatccataa tatgattgtc 80221 tgtgtagaaa atcccaggaa atctataatt tattttatgt tttatttttt gagacacagt 80281 cttgctctgt cacccaggct ggagtgcagt gccatgatca cggctcactg caacctctgc 80341 ttccgaggtt caagggcttc ttatgactca gccttcatag tagctgggat tacaggcatg 80401 tgccaccaca cttggctaat ttttgtattt ttagtagaga tggggtttca ccatgttggc 80461 caggctggtc tcgaactctt ggcctcaagt gatctgccca cctcagtctc ccaaagtgct 80521 gggattatag gcatgagcac tgcagccggc ccaaaatcta tttttaaaaa cctttctagt 80581 accaataagc aagtttatca agttcacagg atacaatgtc aacacataaa agtcaatcat 80641 aatattatat acaatcaata attcattttt ttattttttt agttttatta ttattatact 80701 ttaagtttta gggtacatgt gcacaatgtg caggttagtt acatatgtat acatgtgcat 80761 gctggtgtgc tgcacccatt aactcgtcat ctatcattag gtatatctcc taatgctatc 80821 cctcccccct ccccccaccc cacaacaggc cccagagtgt gatgttcccc ttcctgtgtc 80881 catgtgttct cattgttcaa tttgcaccta tgagtgagaa tatgcggtgt ttggtttttt 80941 gttcttgaga tagtttactg agaatgatga tttccaattt catccatgtc cctacaaagg 81001 acatgaactc atcatttttt atggctgcat agtattccat ggtgtatatg tgccacattt 81061 tcttaatcca gtctatcatt gttggacatt tgggttggtt ccaagtcttt gctattgtga 81121 ataatgccgc aataaacaga cgtgtccatg tgtctttata gcagcatgat ttatagtcct 81181 ttgggtatat acccagtaat gggatggctg ggtcaaatgg catttctagt tctagatccc 81241 tgaggaatcg ccacactgaa ttccacaatg gttgaactag tttccagtcc caccaacagt 81301 gtaaaagtgt tcctatttct ccacatcctc tccagcacct gttgtttcct gactttttaa 81361 tgattgccat tctaactggt gtgagatggt atctcattgt ggttttgatt tgcatttctc 81421 tgatggccag tgatggtgag cattttttca tgtgtttttt ggctgcataa atgtcttctt 81481 ttgagaagtg tctgttgatg tccttcaccc actttttgat ggggttgttt gtttttttct 81541 tgtcgatttg tttgagttca ttgtagattc tggatattag ccctttgtca gatgagtagg 81601 ttgtgaaaat tttctcccat tttgtaggtt gcctgttcag tctgatggta gtttcttttg 81661 ctgtgcagaa gctcttgagt ttaattagat cccatttgtc aattttggct tttgttgcca 81721 ttgcttttgg tgttttagac atgaagtcct tgcccatgcc tatgtcctga atggtaaagc 81781 ctatttatat cttctgtttc ttcactgagt gcttctgttt taaatatttg ttccaaaatc 81841 gcttgttatt gttcattcaa gcatttttat tatagctaat ttaagataat ttaatgtaat 81901 ctcatcattc atgtctgttg tctttttcca tgttaattga aatttttatg gttatttgca 81961 tgctgattaa ttgtagtttg tatcctggac attgtgagta ttgtgttgag attctgtatc 82021 ttgtttatat cctatggaga aggtctatat ttttgtttta gccttttata tattttggtt 82081 cccacatcaa ttcaattttc cagcctttct cagtactata taggtctgtc ctgcatgtgt 82141 accatctagt ggttagtctg agacttgggt ggaggtgtaa tttgtagttg agtttccaaa 82201 tctttggtat gctaatgaag atcaaaggag tcacaagtca gaggtaagcc caggggttca 82261 aatcaacttt acactgttgc tttttcaaaa ctcttccctt tcctttatct tttcagtact 82321 ttccagttcc ctgggctctc atttctgttt tctcttcaaa aactacccac tgtcatggtt 82381 gtagcatttc cagaaacaag tagcaggagg agagagtgag gatgagagag ggagagggag 82441 agtgagaacg agagggagag ggagaggaaa aactggtata aaggtgtctt cttgacatcg 82501 tggcttcacc tagaggaagg accttatccc acagttttga ttcctggagg ctcccattct 82561 gggccactat tgcttctgag ccacctttgc ttttactacc tccactgtgg aaccaaagaa 82621 ggagtaaata aaaacccaat ggatttcccc cactctccga attttagttc tcttttacat 82681 tccttgagcc aaaactccaa gacttctcct gcatctctgt ctgctaaaca atgctcattt 82741 tggattgcac accatgttga attcaggctc aggatactga atgggggaaa agtaatgttg 82801 aatttgcaat tgtttagttg gtacttcaaa ctctggtgtc ttctctgatc tacctactta 82861 gattttcctt tcaattttca tatagttgtc catgcattct ctcagagttt tatatttgag 82921 ttagtgagag ataaaaggtc tgagaattgg acccaactat atatgttttg taaatggcat 82981 gtagttggat cctacttttt aatctaatct gaccatctcg gtctttaatt ggaatgttta 83041 gtcgattcac atttaaagtg attgtcaata tacttagatt aaaatctacc atctttctgt 83101 tttcttttca ttatatttgt tcttgatgtt ttctatttac tgtatttggc tttcgtttcc 83161 tttttccctc tttttttctg cctttcatta ttttatttgt tcattttgtg tgatcccatt 83221 ttgtctcatg tatttattta ttatttagat atcttataaa gaaacttttt tagtcatttc 83281 cattaagttt gtaatgtaca tttttatata atttggttca cctttaaata atactatact 83341 gcttcatgtg tagtacaaga ctatccccaa ttccttcctt ccattctttg tttcattgtt 83401 gtattccatt atttttttta caaatgcaat aaacatataa tacattggta ctattattgc 83461 tttaaaagat cagttatatt ttagaacaat taagaatatg atagttatat ttttcttcat 83521 ttattccttt tgcaacactc ttcctttaat tatgtacacc caattttctg acctatatca 83581 taaccctctt tctgaagaat ttcctttaat atttagactg taagacagtc tgctgatgtt 83641 gagtttcctc agtttttgtt tgtctgaaaa agtattttat cttaattttt gaaggatatt 83701 tttgcagaac ttggaattct ggattggcat ttttttctta caatacttta aaggtttcat 83761 ttcactgtct ttttgcttat gtggtttctg acaagaagtc tggtataatt cttctctttg 83821 ttcttctgtg ggtaagatgt tccccaccta ccgccttctt cctggcaatc ttcaagattt 83881 tatttctgtc ttgattttct gcaatttgaa tttgatattg tctgggtgtg gtttttggtt 83941 tggttttggt atttaccctg gttggtgttc tctgagcttc ttgaatctgt ggtttggtgt 84001 cagtcattaa tttggaaaat tttggccatt atgtcttcaa atatttcttc tgccttattc 84061 tttcttcccc ttcttgtgtt ctaaatacgt atatgttata ctgtttgata ttgtcccaca 84121 gttcttggat gttttgttct ggtttctttt tttgctcttt ttttctcttt gcatttcagt 84181 ttgtgtgatc tctattgacc tatcttcaag ttcactgatg cctcagctgt gcggagtcta 84241 ctgatgagca catcaaagat agtcttcaga tctgttactg tgtttttttc tctcccattt 84301 atatttgatt ctttcttata cttttcacct ccctgccgaa attacctatc tgatcttgca 84361 tgataaactt ttccaataga gcctttaaca tactaatcat agctatttta agtttcctat 84421 ctgataaaga tggaaacagt agacaaaggg gactccaaaa gggggaacag tgggagggag 84481 caagagttga aaaactacct agtgagtact atgttcgcta tttgggtgat gggttcacta 84541 gaagcccaaa ccccaccatt acacaatata cccaggtaac aaacctgaac atataacccc 84601 tgaatctata atgtaaaaaa agaaagtaaa ataatccaaa taaataaaca aattccttgt 84661 ctgatatttc caacatctgt gtcataactt actctggtcc taatgattgc ttttctcttc 84721 agacattgtt tttattcctt gctttgttcg ccctttgtgc tacaccctgg agataatata 84781 tctcttaaat ctcttataga tgtattccac tgttattttt actgatgctt tttagcctgg 84841 tagtggggga atggggacct gggacattct atgctatgct gattaagtct cagtcttagg 84901 caggcttacc ctgggtgttg gaggtatgac cttcacaagt gttcctgccc ctcctcctat 84961 gttctaaccc aagtgggctt agcacatttt cctgcccctc ctgggttagg actttttttt 85021 ttttcaattt tctcccccag ctgtagtgag ttttcaccag tgccctcagt ttctgttgcc 85081 ctttcttctg aggactaaga cttttttccc tttgggaaga taaggtagat tggtatgggc 85141 agagtttcag tgatgttggc cattctgctc tgccagctgc agtgagtttc tcccagtact 85201 ttcaagctgg tttttgttgc cttttccctg aatagtaagg cttttgctcc atagagaata 85261 tagggaaact agtctgtgca gaatttcagc aacagctatt atactgttct gcccgagcag 85321 aactacagga tatctttctc aggattcttc tcagtctttc ttgtcagcac ctggggcagt 85381 tcctggagga aaactctgca aaggggtgca aaattgccta tgtctgcagc ccctacaggc 85441 ttcacattct cacactagcc cataaccagc ctttagcaat tcattataat ttttagctga 85501 atcctcttat aaagtttgta tgttgttcag tggtatctgc accaggtaag caaatgcctg 85561 ttcctgtggc tcccttgaag atggttgttt ctacaggttt gtggtactgc aacctcagtt 85621 ttctgatgag tctgagagaa gttcttaatt ttcactttcc cggttttttc gttactgtaa 85681 tggtgggaga catgttcttt acaactctac atttctggcc taataccaga agagcctatg 85741 aatctgagga taagcttttg agggtctgtt tatttttcag catccatacc ccagtttacc 85801 aattccccca caatggtgat catggtgggt ttaccagctc gaggcaagac ctatatctcc 85861 acaaagctca cacgatatct caactggata ggaacaccaa ctaaaggtat gtctctgaaa 85921 gcctttgttc aacaacccca taaaagctac tggctgaaaa ccttacttag gtagaaggcc 85981 aaagtcctca ataccagaaa tgccccatca ttattattac cagcattagg actactgcta 86041 ccactattac tacattctta tacattttat aacttcttga gcacatactt tgtgcccaac 86101 accactctag gtgctaagca taggaaaact agtattaaga ttaaatgtct tctctcaaga 86161 agttccatga cctcaaaaca atcctgtgag taacatatta tttttattgc cattactatt 86221 attgccatac caatattact acttttatta gcattaccac ttcttcacca tttcatagct 86281 gaggaaatta aggcccaagg agtgaagtgc cttgtctaaa ggcatatatc cagttacaga 86341 accaagatat gaacccaggt ctatttgatt ctaagcctga tattcactgt gccaagtcta 86401 atgcatgatt ctactgattg gaagagaaaa gaatcccatg ttagccctat agtctccttt 86461 tcttcattct ttttgtctcc ttcttttcct tgttaatctc aacatcctgt ctctctgccc 86521 cacttcactc tatctctccc cctcttcctt cctctcccca atttttgttt ctttttccca 86581 tttcttcttt ctgttcatct atgttctctt tatctaattc actggtgagg agtttcccga 86641 tggtttacca tgtgcccaaa ctcctgctag gcagtgggga tacaaaactg gttgagactc 86701 acaccctgtc cttgaggagt tctgagtcta gctgggatgt ataagaatac ctcatatcat 86761 ctaggaattc tagagcagga gcagggcgct gtatgaaatc acaaagggat atgccaactg 86821 gaaggcaagg aaagcttcct acataggata tgtttgagct gagtcttgtg ggaggaataa 86881 gacattggca gaagggaagg agagaaggag cagctctcat ggaggactgc aagctctcag 86941 gaaaatctgc tgagtgtaga ggcacatggg acactggcag acgggtttct gccttaaggg 87001 ccccatcatg ggttctgatt cccctgcttt ctatcttctt cactcaattc tctcaaaggc 87061 ctccagagtc atatttcaaa gcacaaagct gaacatgcct gtggcctcca aaggtcccct 87121 tggcctgcag gtttaaatcc aagctcctta ccctgttgtt caaggccctc cacaattggc 87181 cccttccagc atctcaggcc tcagctatag tggatgactc accatcctcc aaacatgctc 87241 ttccttctgc caatgccata ctcttgctca agctgttcct ctacctggaa tgcctttgtt 87301 cctgactccc gctgcccata tctgcctgta aaaggtgcct accctaaaaa gcctggttca 87361 agtggtgaaa ccttctcaga tcattccaag tacaattagt ctttccttct ctctcactgt 87421 cttccgtggc ttaatattgt ctttgtcaga gcattgaaca aactctgtgg gagctctcca 87481 tttactggtt tactcactgt cttacctgtt ggactgtgag tgccttaaag gccccatcat 87541 gaagatcggg ctaatccctg agacccagag ctcatatcct gattcacaga gtaggagtca 87601 aaaagtattt gttagatgag tgaatgaaat attaactctt tggctgctta tcttctttct 87661 cctcctttcc ttccctcctc tctagttttt catcattgca agtgggaaag tatggcctag 87721 agatttagat ttagaatctt cctcataata tagaaaaata aaataacttt ggactagaag 87781 tcaggaggtc cagtctatat ctgtcaccag ctcactcact gtttgaccct tggtgattaa 87841 ttttactccc tagatctcca tttcttcatc tttccactgg agtagggcat tttaaggatt 87901 ggcatttatg ataaagtaga ataatggaaa gaccaccagc tatggagcta aatagacctg 87961 ggtttgatct cagatatcca cagataggaa ataacttaca ctgtgtaagt tatttcctat 88021 ctctgaactt tcagtttcct catttataaa atggtaatag tgatctctac tttgcagggc 88081 tgttgtgagg aataaataaa ctggtagcca taatttgaag ttttaaaagc acattttgta 88141 gctacaaatt ataccttctt ctacctctga ggtccctccc agcaaggaca tttttttggc 88201 cactgggatc ttcccaacct cttcactggc actccctgct ttcagtgttt aatttaggcc 88261 agtatcgacg agaggcagtg agctacaaga actatgaatt ctttcttcca gacaacatgg 88321 aagccctgca aatcaggaag taagtaccca atattttagg aacctgtgct gtctcatggc 88381 tgaaaggccc ttgttcagat attgggggag tagggcagaa taatctggac cagaaaggag 88441 gccctctaga ctctcaaaca gtagttggaa cagcccttcc attttcctgc ccaaatcctg 88501 tttctggtca agaataatct ggaccagaaa ggaggccctc tagactctca aacagtagtt 88561 ggaacagccc ttccattttc ctgcccaaat cctgtttctg gtcataacca tccacttatc 88621 aagaaccttt aatggtcttc tacttctgac aggagagagt agaaacttct cagcttagca 88681 tttgaggcat gaatctcaat accttggcct actctagttt ctgtagtctc atcagttcaa 88741 cctcaaggtc tctctgcact acaagactct gagtacttgc agaatagagc ctgtgcctta 88801 tgccttcatt atagaacctg gcagtgagta atcactaaag agatttttgc tgaaagcata 88861 aatacattgc cactgtcatg cctgtgtgtt acccatgcca ggaatgcctt ttacttcaca 88921 gaccacatgc tctacttggg tccataaaac agttcagctt gagttccatt agctcaaggg 88981 ttggagccct taatattatg gcttctgtca ttgctcagct tttcctgatt gtgtcacttt 89041 atacctgccc gtaaaccgtg taagctgtgc acacatagcc actttcaact gtttgacccc 89101 tctgtgctta gcacggtgcc taaagcgtag ggcaccctgt gtagatgttc aatacatgat 89161 aagttagtct aagacatgag agatggtaca gaagatactg cagacaatga gagagaaatc 89221 catcctgggg gataagggtt gcctcttttt ctttcctggt ctatttcagg cagtgcgccc 89281 tggcagccct gaaggatgtt cacaactatc tcagccatga ggaaggtcat gttgcggtaa 89341 agaaattttt tattttttcc ttgggtaact actgggaagg gaattagcta atatgtactg 89401 aacatttacc aatgtgtaca gaacattatg ttctgagtat ttgcatatat taataattta 89461 attttatgct ccccccttga gataggtgtt gttattatgc cccaattcag agcccagaag 89521 aatggttcac agagacatta aataatttcc caaggtggta caacgtatac ctggcagggc 89581 cagaagtgga atccaggttg catagcttca gaatccccgc tcttaaccac tatacactac 89641 cagtctaccc ccaataattc agcaggacta tattagaatt taaaactgca gtacaaacag 89701 ttttgagtta tagagactta cctaaatatc tagtcaatat tttggctagg atttgcaaat 89761 tataaaaggg gaggagattc tcaaatatgc aaatggtcca atctcaggtg ctgaggttag 89821 ggaattgttt cctgttttag aggacttcta aagtgataag gaaggaagaa gcttcagcta 89881 ggaattgaga gaaccagttc tagctctgct ctgtcagcaa ctctcggtga ctttggtcaa 89941 gccaacttct tctaggcctc agttacctca cctataaaat gaccctaata aaaacccaca 90001 atttcctctt gccttattac actcacaggg tagcatacat ttctggacag accataaggg 90061 cccaatatgt gcttgtgaaa ttgtcataag ccaatcatga atccttccta tccttcaaga 90121 ccttgttcaa gttctacctc catatggaag ccctccactt agcacaggga tgggcaaaga 90181 gtaggtgttc tgttagcacc agattcagag taagccaatc tcattacctg cctctaggtt 90241 tttgatgcca ccaacactac cagagaacga cggtcactga tcctgcagtt tgcaaaagaa 90301 catggttaca aggtatggta acactccaac tcttatctag agcctatttc tccagcacaa 90361 accccaaaga aaggcaaacc aggggactca gagcggccaa atccgcaacc ataagctcaa 90421 cattttaatt tggttctcta actttccagg atatagtcag aagcagcaaa gcagaacaag 90481 aaaggagggg tgccacatta ggcaatggaa gttgactgca gacaggctgg ctatggggaa 90541 agagtggaga gagtcactac aaaaaactta tataagacct aaacccagta aagatataaa 90601 acaacagtct ggggtcattc agtttagagc caatttgcct catgaaactc agaagcctgg 90661 aagtaccact gtatgagagc attatgttgt cagagatgcc cctggagatg acttagagcc 90721 attgaaaagg agtaaagcta tgttcagctc ctaccaaaca taatttcctg ctggaaatct 90781 cttttctgtt tcttttatag gtgtttttca ttgagtccat ttgtaatgac cctggcataa 90841 ttgcagaaaa catcagggta aggaccacca gttacttttc cactttgctc tgccttaggc 90901 ttagatcatg cctaaggcag aggtgagatt atgcctaagg caactccatg agattaaaga 90961 aagtattcct ccctgatctt ctctgagttt tcataggaag cattccctaa ccatcctggc 91021 ttacaatgac tctattaatt ttaatgaagc caacatatat agtgcccttt gtatgctcca 91081 gatatcaaac ttgactgtca ttaattttca caacaaaact gcatattaga gattatttcc 91141 atgttagaga gaggaaactg aggttccaaa agcataaatg aattgtcaaa aattcataca 91201 aagggatgga agtgggactg gacctcaggc ttggctgtct gcctcaaaag cctacatttt 91261 ttattctgcc ccaaaattca taccagaact atcactgctc ttagattcta gacatgtggt 91321 ctagaactag gtcatttacc acattttgtt gttccctact tcatcaaatt actggagctc 91381 aagaacagaa gagaccttaa gggtaatttt cttctaccct ccatcagata cttgaaatct 91441 tacctaaatg tatatctata ttttcttttt atacctgcaa tgacaagggg cccattgctt 91501 tcaacaaatg cggtccagtc aagctggaat ctgcctccct gtaattttta ttaataggtc 91561 atttttttaa ctttttggga catacagcat gtccatttcc tcccttcctt cttcatcctc 91621 ccatctatct atctatctat tgattcaata gaaattgact atatacctgg tcttggccat 91681 tctgggcatt agagacatat atatggatag tggagagtcc ttcccttgag cgcacagtct 91741 agtaaggaag actggcaagt atatatacag aactctgaga actgaggtga gatgagggaa 91801 cagaggaact ttgggaagac agtgctgagg gcctaacttt gcatgggggc agaaatcagg 91861 gaagccttcc taatggaggt aaaattaaaa tgaatggtac cttgaagaac acataagaat 91921 ttggtgggtg aagaatggaa aataaggata taagcatata ctaagtatgg catctccaga 91981 gaactataag tagcttaaat tggctaaagc caaatgtgtg gaggaataag ggagccatgg 92041 gagataagga tgaagaaata atttggggcc caatcatgac aagccatgag ccacctgcta 92101 aaataattca cacttttttc taaaaggtat gacaaaatgt agaagggctt taggcaggag 92161 aatactttgg ccagatgttc atattagaaa tacccttctg gctgccagta tggtgaatac 92221 attagaacta aaaagaatat agacaaagta attgggcaca aagctatagt gagcacaagc 92281 aagaaagaat ggatatttag aggagatgat agagtggaga ggatttggta cctgctaaga 92341 ggtgagggat gatagaaaga cagacatcta ggatgactcc caggttccat ttaggatcat 92401 atgactgggt aggtgatagt gccagtacct gagatgagga ctgtgggagg agacgtgggt 92461 attggtggat taggagaggc cagtcttcag aaatccaaag acagcacatc ataatctccc 92521 ggtaaacact tgatacatgt aagtcataat gacccctggc caagatctct agttcctctt 92581 cttcagccat ttctcatatc acaatttctg gtcctcttat aaatgttctt cttttataag 92641 gagtgtattc tctgggactg agtataatgt aacaggcctt gtttgagatt aggaatgagg 92701 agtcacacaa ccagcttctt ttcaattggc tgatttttgt aaacttgggc aagatgtgtt 92761 ctccaggtct caatttcctc atctttataa tgaggcatct tgtactcaat ggtctctaag 92821 ggaacttcca aagaaaacca tctgtctccc aatgtctccc ttgatacctc tgttctccct 92881 gctcccagca agtgaaactt ggcagccctg attatataga ctgtgaccgg gaaaaggttc 92941 tggaagactt tctaaagaga attgagtgct atgaggtcaa ctaccaaccc ttggatgagg 93001 aactggacag gtaagagcca gcaccccaaa gtcaaggtct agatctttca cccagttttg 93061 atatcctgta gaacctagag taaatatcct cccttgctgg gcctcaactt ctctatctgt 93121 aatatcagga aactgaactt gatgatcttt cttcaaggat cctttcagct ctgattcctt 93181 gatgtgtaag gtgttcctaa gatgactaaa atagtgcttt cctgagcaat atgaacagtt 93241 caggaccctg ggaacaaatg cccccctcct gtgggtgttg cgattattat tacaaaagtg 93301 ttcttcacac gtgaatgaca cttcagtttc caaacatgta taccttcttt cttacattta 93361 atcctctagg actcctgtaa aggaggctga gataagtctt atggtcataa aatttcaaga 93421 aaaagaaatt gagtcttaga gagttgaaat tactttccca agctcccacc acaaggggta 93481 gagctaggat tctaattaca tgttctgtct cctagtcatg tctcttttca caacactccc 93541 atgtctcatc aatcaacaag tatttggacc ctttgggctg tggtatatgt agggggtata 93601 aataggggca aaagaggtgg gttctatcct cgggtagctt ctatagtgct ttgacatatg 93661 agaagaccca taaaaaaaaa caagaaatgt tctgtcatca taccaactaa agataggaga 93721 gagcagatgt gagttttaga tatagaatgt tcccagaacc ctactctctt tgaaaatgta 93781 ttatgatgag aagagacccc aagtgagaag tataaagagg cttcaaatgg tcatcaccac 93841 tctaagaaat ctaaagagtg gccttaccca actgaagtac caagaaatga gcactgttac 93901 tgccccactc tttagatgga ggaactgaga cccagatcaa gtcaaagtct taccttgggc 93961 tacacaatga atgagtcagt agcacaatat atactcgaac ccatatccca gttgtgcaga 94021 ttgttcattg tacaggcacc ctggtcaagg gtacactcgc ttctctcacc aagatgtgtg 94081 ctgaagtagg gctgtgtgac cctgaggaaa gtacggcttt ttaaaattat cacaaaggca 94141 caatatgggc taacagcata tggtgaaagg agcctgaagg ccggggtgaa ggatgtaagt 94201 gaaagaacgg cattgacaaa gccccagccc tgggcctaga ccctgcagtt gctaggttgc 94261 tgtttgatga attagtttgg gccagaagaa gacagatcta gaaggtcagc tttatgtcat 94321 catttctcca tcaacagaca tttacagtct atgcatcagg tcccagccct gaacgtggaa 94381 tcaaaagagg aaagggctgt ccctgatttg acagaatgtc cagttagttg gagaagacaa 94441 tgagtacaca gttggtcaca actctggtga cagtaaacca actgggctgt aggaacatag 94501 gacccagtcc agcttagagg aagaaggtct aaacgagctg gattacccaa taggcagact 94561 acactgaggt taccagcaag ataagggcta ccaaactaag aaaaaaaact atcttaaaaa 94621 cttttattta tttatttatt tttaattttt gagacagagt cccgctctgt cacccaggct 94681 ggggtgcagt tgcgtgatct cagctcactg caacctctgc ctcctgggtt caagcaattc 94741 tcctgtctca gcctcctgag tagctgggac tacaggtgcc cgccaccgtg cctggctaat 94801 ttttgtattt ttattagaga tagggtttcc catgttggcc aggctgttct cgaactcctg 94861 acctcaaatg atctgcccgc cttggcctcc caaagtgctg ggattacagg catgagccac 94921 cgcacccagc tggcttaaag gcttttataa tattgactat ttcaaaaaca gcatcaaggt 94981 agtcaaagtg gaaaaatata aaaccagaac ttgggaggac ctccccatat tctgtagttt 95041 ataattttta ttttaaataa aaatttaaaa attttttaaa aaatttttaa ataaaattaa 95101 aagtttggat gagggtaggc attagctact ctgctatttt tggtgtggaa cacctctaaa 95161 tgacttcaca tgatcctggt ctcaaaaaag agagtgaata tgccggctct tgaggtataa 95221 atgcatagca gctcaacttt ggaaatcggg caaagagaat agcactttgt aagtgcctag 95281 ttaaatctga acaagagtag agaaatacag aaaatattcc aagagtaata gcaaaggtat 95341 gaagacaaga atgaaacaga gaagatatgg gtatgggcat gggaccataa agtgaggcca 95401 actgggctca gtgtgagaag gcaaaaccag aagatcagac taagctgagg aatttggaag 95461 ctcctagagc ctcaaaggac ctagtgttag gaacagacag tagccaggca acaagtttac 95521 atcagatatt ttctatccct aggctctgag ggttctagaa gaaatgattg gagaggggag 95581 cattctgggg ttcttcttcc cgagaaggtg gccttggaga aatcactttc tctctctatg 95641 cctcagtttc ttcatctttg aaacagaggt gataatctct gctctgtagc aatattatga 95701 ggattcatgc gcttaaaagt tcccactccc tgccagatgc acagaaagtg ttcaacaaat 95761 acaagtgccc tgtggaagcc attaggcaca gatgggcatg atgagtgtgg tgcagccata 95821 aagggcttct ctaggaagaa ctagggagag gctttctgaa accaaaccta tcagcaacct 95881 tacctgcctg gagttagtgc agtgcagctc tttccattca ctcggttttg ttaccatgca 95941 accagggctg ctggtgtcac cccattgccc cagccttcca ctcctcttcc cagggatttc 96001 tctgaactct gtacaggcca acaactaaag gatttaatgc ctggtacagc gtaggcctca 96061 gaagtcttgc tccatgtttc tagtcttctg attagtcagt gcctatactc cacccttcag 96121 cttctacaca aacaggtggt tgaacagttt aaatatcatt cagtatatag gtattagttc 96181 atgacagagc acagaattga ggatacagta aggaacttgg gacaaggacc acaagttccc 96241 caataatgtt gtggaccacc tctgtatccc catgccccac tgcttggcac accttagagg 96301 gagtgactat ttatggaatg aatgttgagc actgatcatg tgtttccaag tatttgttcc 96361 tgactggcca gagagctccc aaagcacaga cagtgggtct ccatgccaag cacaatgcct 96421 agaacagagg agtatgtttg gagtatgttt gttgactgca tagcaaatag gaaaataacc 96481 caggccaacc ccacttctga gtgaggagag aaacacacac atgcacagac acacatatgc 96541 acacagaggc aatctggtag ctgtgtgtag agtataaaag acaatatgta gcagaggcta 96601 aaaaaaaaaa aaagaggagt gaagagctca actcaggatc agaaagggta gaggagtgaa 96661 caagtcagga gctgccagta agacattgga gctagccagg gatgcaaagg atactgcttc 96721 tctctgacta tgttaggggt gtgaaccaca gaagagagaa cccagctgcc tgtaacagga 96781 tgagtgaagg tgaaaaggca gtttggctca agttgaggct gaaggaagga gacaggggga 96841 aatggtagca tgcaatgagc tcctttgggt ggccaataac caagtctcta ggacaggtac 96901 tcggtacagg gcaaggcatt gcatcacatt aggtgacagc tgggttgtgg aagaggtttg 96961 aaagggtgtg tggttcgggg tttctctcgg ggtcaaccct accaccatta gcccctgagc 97021 tgcccttgct cctgcttggg tctggcccag ccacctgtcc tacatcaaga tcttcgacgt 97081 gggcacacgc tacatggtga accgagtgca ggatcacatc cagagccgca cagtctacta 97141 cctcatgaat atccatgtca cacctcgctc catctacctt tgccgacatg gcgagagtga 97201 actcaacatc agaggccgca tcggaggtga ctctggcctc tcagttcgcg gcaagcaggt 97261 agggtgggcc acacacaccc agatgggttg gctgggctgt cctaggtggg ctgcaggtgt 97321 agtggtggcc acattctggg tcctgaggtg gaatgaccca gggatggggg agaagcagat 97381 tccatcactc ctgctcttcc acccatcagc tgctcagtct ggacccagcc gccgaatggc 97441 tcccaatgcc acctgttctg caggtctaag agcttggccc tggcacccaa ggcctgccca 97501 aagccagctg ctccatgaca gtgactccct tggccctgcc tgactggttc cctcatcttt 97561 ctcacgagtt tgtgcctttg ctagtgcagc ttccccacct ggtctaccct cttctgtctc 97621 ctctgtcagt gggtgactgt cagtggcaca tacagctgcc ctgagagacc cacccttgaa 97681 tcactcttct tcccaattct gccctcccga ccttatccca acgcaggtgt cacactggca 97741 gcccctgtcc ccactcatga cagtgtcctt cctgccctcc ccactccctt ccaaggccca 97801 gcacctgctc caggaagccc tccccatcct ctcctccctc ccaggcaggt tgccatgccc 97861 tgggctccag ctctcccagt tctggcctat gtgtggctct cctggcctga gtatcttctc 97921 cccagcccgg cacagtgacc tactggctgg ttgctggctg aatacaggga tccatttcca 97981 cagcatttgg ccctgtgcct gggcttatta gcacaggtat tgttcctgct ggcaggaccg 98041 cttggggatg gaacaggtgg gaaatgaagt gcaggaagga cagttgttgt cataattgtc 98101 atttattgga cacctatgca cagtcacggt gccatctgct tccgcatgtg tggctcattt 98161 catcttcacc acaggtctgt ggggtgggca ttcctgccac aagtttatgg atgaagaagc 98221 agagagcagc ccagggcagg cggccaggat gagacagagc caggactgaa agccgggcta 98281 ctgcctgccc tcagctcccc cactgagatg gggggagtcc cagcaaggca ctatcctgga 98341 tggccacacc ctggtgtctc cctttctgtt acctccagac tgacagcctc ccctcgaggt 98401 agacgctgac ctgtgactcc ctcatgtccc actcctcaga gcagagtcag aggctgctcg 98461 aggtgtgacc ccagcccatg gcagcccctg cagccagggc cctcttgggt ggagccagcc 98521 cggccctgcc cacacagctc ggagcccagc tggacatgtg gctgtctcca cagggcacgc 98581 ctccctgcac acaaacctgt gtgtgtgcat gcacacagct ggcgccacct tgacgcttct 98641 cacagagcac caggcccagt ttacaaagca cctccccgtc gaggtagctc tcacttgcgc 98701 agcacccact gtgtaccagg ctgtggtctg ggtgctcccc atccactatc taaagtgcag 98761 cccccagaca gcaggccaga tgagacaatg ctctttggat atgaggccag agaggggcag 98821 ggatgtgtcc gagggcacaa gggaccatgt aagggagctg agggagttta acctatggct 98881 catacctccc tgtcaagggc tccttccacc agagcacgct gactcctgac tcaggtccat 98941 aatgaaatgt atgtatgaaa cgcctatagc tgcatataga aatgtgtgcc tggaactttg 99001 cacctgtgca gatgttccta acacatgccg tcagcagtta acgacacacc tatacacaga 99061 ggaggaaatc actagctcct ctgggctccc acagcccttt atggttcccc tcggtgctgc 99121 cagtgtttcc ctcactggac tggccatgtc ttgagaacaa gaacacccca gcacccagca 99181 cagggccacg aacactcaat acatgtgtga atgcccttag aacctgggta ccaggaaccc 99241 tgcttaagca ataagtgttc acagtgtcaa aaagtaatgt ctcccaaatc ccacaacccc 99301 tacttccagt ccatactctg ggtaacccca aacctagaat tctttgccct ttttgtctag 99361 agtccgcttt cagtgcctac atgggcctat cagtgcctac atggcctgtc cacttgaggt 99421 caggccatat cacacgacca aggggttgct gtttgtggag catggggatc agtggaggtg 99481 gaaatgtgga cagaacttgg tcatgtgact tggttatcca caagcatgta ctaaaggcac 99541 gtagaagtgc aggatagagc cagaaatgga gagatagact gtgcatggag tgcctccatt 99601 ggtactcttg tcccaggctc cacaaatgtt agaggtgggc ctgtttgaat gaatgaataa 99661 ccaagtaggg gcttcggtga gcaatcgtca aataagcgct ctctctctct ctctctctct 99721 ctctctctct gtcatgctct tttctattcc cttggcccac catcacattg gaaagagtat 99781 ggctttttat aagtttggaa catttaaact ctttgctctg cttgttcatc accactcaat 99841 ttgtggcaga tctgagcttg ttctgatggc caccctgctc tctctcacct ctccttttcc 99901 ccttctctcc tcctctgcat ccccacatcc tagtatgcct atgccctggc caacttcatt 99961 cagtcccagg gcatcagctc cctgaaggtg tggaccagtc acatgaagag gaccatccag 100021 acagctgagg ccctgggtgt cccctatgag cagtggaagg ccctgaatga gattgatgcg 100081 gtgagatgca tggggtgaat ctcctgtgtg tgggggtgac ttgagtctgg gattatcccc 100141 tgggttgcct ctgctccagc ccctatgggg agcctaatta attcaacaaa cttttttgac 100201 aactagccat gtgctagacc ttattctatg tcttggggat ccagtgaagg ccagaataga 100261 caaagtctct gccctcatcc taaagacaga cagtaaatag gacattacat ctcagtgtga 100321 caaagggcta tgagaggaga agtagaaggg attatggagg atcagagggt aagggtgctc 100381 ctgactcaac ttggtgctgg ggaatttaca aggaaagctt cctggaggaa gctagaggtg 100441 atataagaag gagatgaaat ggtctgggga gtggggaaat aaatttaatt cagagacaga 100501 aaataactca tgtgaagacc cagagatgag aagagagcat ggcaatttca ggggattgaa 100561 ggaaagaata gcacagaggc tgggacagca atagtgagat ctgaagaaaa gggagatgag 100621 gctagaaagg aagccaggcc accaggcttt atagttcatg ttgagggatt caaatgctgc 100681 ccaaacaaca atgagatgtc attgatgggg tccacatcaa gtaggagtaa tctgaatgtg 100741 ttgcacattt ttcaaaaaag cccactctgg gcatgtgtgg aaacttggtg ggaggaaggc 100801 cagaatgaac aagtagacca gtaagaagtt gacttcatta gtctggcaag aggtgggggc 100861 atgttgaacc agagtagtgg catcggagat ggaatgaaga aggactttgg gaatgtttca 100921 aaggtggaat caggccaggt gcagtggctc atacctataa tcccagcact ttgggagggc 100981 taggtgggag gatcacttga gcccaggagt tggagaccag cctgagcaac atagtaaaac 101041 cccatctcta caaacaattt aaaaattagc tgggtgcagt ggtgcatgcc tgtgacccca 101101 gctactctgg aggctgagat aggaggattg tttaagcctg ggaggttgag gctgcagtga 101161 gctgtgatca tgccactgca ctccagcctg agtgacaatc aaggccctgt ctcaaaaaaa 101221 aaaaaaaatt aaggtggaat cagcagggct tgctggtgca ctggctgtaa gggatgaggg 101281 agagggagac atgaaggacc caaccctggt ggtatagcag tctctcttca gggtcaggtt 101341 cttccatgag ctctttgaga aaggagctgt ttcctctcca ttgttcacct gcactgtagg 101401 tacaatggca ctgggcagca gtgcacaggt tatgattgtt aggcgtcaac ataaaagctt 101461 caccacttgt gtaaataact tgcgtgtggg taggttaact tacttgcctg ctagatatgc 101521 ctctatagaa acagacatat atgtgtgtgt acattcatac atatgtgtat ataaatatag 101581 aggtacagaa atacacatag ttagaaaagg gctatacagc attgttctgt ataccacata 101641 catttgctgt aaaatatatg caagcaaata tgaaatttct tcatcataag tttggtggct 101701 tcttatttct ggcccacatg tggtataaac atttattgaa cccctccttg tttcaggttc 101761 tggagtgatg ctggaggtgg aaattgatct aattcattcc tgtcttcact gaattttcag 101821 gctaaatagg gtagacagag gaataaacag agaaatagag aacagcatgt gtgatacagg 101881 ctctgatagg taagactatg ggtgcataga ctttggggca ggacacctaa ctcagccaag 101941 atctacctgg aagaagtgat acctagctga agcctcaggt ttgagtaaca gtgaaccaag 102001 tagaaaggat caagaatgac attccaggca gagagacagg gatagggagc ttcataaaca 102061 aagagggtat tgtatgtgtg tggaaatgct aaggttgtag tagtacaaat ggagaggctg 102121 gaaaaaagga atccatttga gagagcttta ggaggtggaa tctcaggaat tggctatgag 102181 gggtgagtga ggaagaagca agaatttagg atgaagataa agcaagaatt tacgtttctg 102241 gtttattttc catgagtaat aaaatattaa tacttcattc taatgattga gtctttaaaa 102301 tgtcccttgc gcattctgta ctagtatgtt ttacttacat tttcccaata ttcaaaacac 102361 ttctgtgaag tagttctaat tctccacatt ttgcagataa ggaaagtaaa attcggagag 102421 gttaagcaat ttgactcaat aaggagtcaa tgaccatttc cataggcgac aggctcctag 102481 tactcctcag aatagaaagt tgaagccaat cttgaacgac ctcgtgtacc cagtagggtg 102541 cactttcttc catgggcagt gtgggttgga gaagtttcac actgcagaga tgtttgtggg 102601 ttggggtagg ggttcacttc ctcacccctg atgggggcat ggtagatggg cccaatagac 102661 aacaaccaga ggccactggc cccctacagg gagatttaat tggaatccac atacccagaa 102721 ggaaacctta gaaacccaac atcactttcc accaccagca gcagtcacct cacctctggg 102781 agtttctcag tagacttgga atctctcata agctccttag gaactcattc tctgagtact 102841 caattccttt tttgagtgag gagagtgtat ctcccccaaa tccactctga aaccattagg 102901 aatttgttaa tttttcctgt ctgagatgtc tatggatgcc tgtttatctg ccagagttcc 102961 caaagtagag gtcttcagac acaggcacac agatactctc acatatacac cagagacata 103021 atgagggaca tatacattaa cagacacaca tgccacagac acactgagac ctgcatggaa 103081 agaccacatg gacacattct cagatgctta gacaccctat acacacacat acacacacac 103141 acgcaagcac acaacatatg ctgagacaaa tacacaaaca caaggcataa gtctttaaac 103201 ataccacaag tacatatcat aagcatgctg cacacagact tgtaaacaca gccacaaatc 103261 cttatgccca tatatttata aaaatacata cttatatgta taatgtacac ctacagacct 103321 atctgtacaa tcatagaaaa acatcataca ctcagcctga acacatctac tcacatacac 103381 tcactgtcac atctacacag ccacacacct cacacacaca cacacacaca cacgcacaca 103441 cacacacaca cacgcataca cacttatgga caacaggcac aaacccagag ctgtgttcac 103501 agagacccag gtgctcatgt gctactgttt cacaatcatc gaagaattat ggcaaagact 103561 ctgtggaatt tcatcttgtg aatactttcg cccccaaata ctttttctca gggtgtctgt 103621 gaggagatga cctatgaaga aatccaggaa cattaccctg aagaatttgc actgcgagac 103681 caagataaat atcgctaccg ctatcccaag ggagaggtaa ggtttgcggg gtggcctaat 103741 gcctaggact actccccagt gtcttcattg caactgctag gattccacaa ttgtctagac 103801 agctttagaa acagagtaag catatcttgc agcagctgcc atcccacaag gcacatgatt 103861 atctggccct tggttgtttt agattcttgg ctataggcaa tagatccatc cctttctttt 103921 aaaagctaca taattttctt gcttattctc tttctgcttc agaagtgctg actttatcct 103981 ggaccttgtc tctggtatcc cttcatgaag acactttaaa tagttcaaca tactctgatt 104041 tcctgatatt ttgttgccaa gttgcctaaa gctccagcct gtctgggcag ttggcattgt 104101 cccaagggac aatggctaat aatttaatca acttctttgc cactacatga accagaccat 104161 cagttcccta tccccatatg agatgaatgc tcattgcttt taatcaagcc tccacttaac 104221 cagttccaca ttttagagca tttattagtc atactttatt tctgatataa atggctggga 104281 aaagttatta gaaaagaagg acatctggga ggataaaaca atagatattc tattcagtta 104341 tacaacactt tcacattttt aatcttcacc cacagagatg acattataaa attcatctta 104401 atttgccctc attttccttt aggtatttag aggctatgtt attagaggta tacaaaatta 104461 aagcaattat gtcttctcag tgaattgcac attttattat tacatagtga ccaaacctct 104521 ttctcctcga taatcctttt tgcctaaaaa cgtattttat ctgatattaa tatagctgta 104581 caagattcag tgacttctgg tgactcctta cctagttcac ttttagctat tcatttactt 104641 tcaacttctc tgtgtatctt tgttttgtgt ctcttataaa cagcttatag ttgggtttcg 104701 ttgtttttaa tccaaactga aaatatatgt cttttgaatg gtaagtttgg tatatttact 104761 tttattgtaa ttactaactt ttttagttta ttactattat cttgtcttga gctctctgct 104821 tagttttttc tcttttcctg ccctctttag gattgataaa gtttttatat ccatatttaa 104881 gttatataat ttgtttccat tcatttagag attaccctta acattttaaa aatttttcat 104941 catgattctt tcaattttta gttgatatct caataaggat gtaaggtcaa aatgtatttc 105001 aaaatcactc ataataattt tttgctgagt gctattatgc aaccaacttg aatccttaac 105061 attttaatat tcattattaa caaaatctaa aatcaatatc tctaccctct acctaaacaa 105121 cagaagaact ttaaaaccct ggccattgtg tctcctaatt tacatgttaa ttgtttgtat 105181 tttaggtcaa acttggtctt aacccccaag tgaattattc ctgttgttgt tattattgta 105241 tgcagtcaat gtttatttag atttaaccac atgcttacta attcctctga tcattcttgc 105301 cccttgcacc tcaaatcttc cctctgagtt cattttccgt ctttcttttt tatttttatt 105361 tatttatttt ttattttatt tatttattta tttatttatt ttttgagacg gagtctcgct 105421 ctttcaccca ggctggagtg cagtggcccc atctcggctc actgcaagct ctacctccca 105481 ggttcacacc attctcctgc ctcagcctcc cgagtagctg ggactacggg cacctgccac 105541 cacacctggc taattttttg tatttttagt agagacgggg tatcaccgtg ttagccagga 105601 tggtctccat ctcctgacct catgatccac ccacctcggc ctcccaaagt gctaggatta 105661 caggcgtgag ccaccacgcc cagccttcca tctttcttta gaagcttttt tagtgagtaa 105721 acacaccctt ctttttgtct aaaaatgtct tggtaggagg gggtcctcaa tcctagtgat 105781 gcttggctgg gtaaaaactc taggatgaca gttgttttct ctcagcactt tgaaagtatt 105841 accctcactg tcttctggct tctactgtgg ctaggaaatc tgctgtcagt ctagttttct 105901 ttcctctctg gctactttta aggtttttgt atcattgttg ctttaaagtt tccctctaat 105961 atgtctagat gataatttct tttcatttat taatgtatag cttgggatta gtgagacttc 106021 ctgaatctga ggactgggat tcttcttcaa gtctggaaaa ctctcagaca atttatcttt 106081 aaaaattcat ctctcacatt ctattatctt cttctggaaa tacaattgga tgtatgttac 106141 tcctgcctct attcttcatg tctcctaact tttccttcct attttctatc tctttgtatc 106201 ttttgctgca tcctgagtaa tttcttcagc tttatcttct agtcactatt taagtagtat 106261 ccaatatgct ccttacccca tacactgagt tttcaatttt aatgactatt ttttatttct 106321 gaaattctat ttagttattt ttcaaatcta cattgtatat ttgatggcat cttgtttctt 106381 ccttacattt tttattccac cttttatttc tttaaaagtt tcaaacatat ttattttata 106441 ttgtggcatc ccttatctta agtttttagg gttctcttct tgtttctgct tattctcact 106501 catggtagct tatttccttg tgtgtatatg agctccagct tgtcaagaat ttatctatgg 106561 aaatctcacg cagcctgagt tgagagaatc cagagagatt tcatttgctt cttctaagca 106621 gtcctgggag ctaacgactt gggatcactt tttattaaac tctcagcata acattcttca 106681 aaccaaatgg atagtgtagc tctgaatcac cagcccacct gagggaagtc tggtggttat 106741 aaattcttac aggatttttt tattataatt taagttttag ggtacatgtg cacaatgtgc 106801 aagtttgtta catatgtata catgtgccat gctggtgtgc tgcacctatt aaatcgtcat 106861 ttacattagg tatatctcca aatgctatcc ctcccccatt cctccacccc acaacaggcc 106921 ccggggtgtg atgttcccct tcctgtgtcc aagtgttctt agtgttcaat tcccacctat 106981 gagcgagaat atgtggtgtt tggttttttg tccttgagat agtttgctga gaatgaaggt 107041 ttccagcttc atccatgtca ctacaaagga catgaactca tcatttttta tggctgcata 107101 gtatttcatg gtgtatatgt gccacatttt cttaatccag tctatcattg ttggacattt 107161 gggttggttc caagtctttg ctattgtgaa tagtgccaca gtaaacatac gtgtgcatgt 107221 gtctttatag cagcataatt tatactcctt tgggtatata cccagtaatg ggatagctgg 107281 gtcaaatggg atttctagtt ctagatccct gaggagacgc cacactgact tccacaatgg 107341 ttgaactagt ttccagtccc accaacagtg taaaagtgtt cctatttctc cagatcctct 107401 ccagcacatg gtgtttcctg attttatatt gatcaccatt ctaactggtg tgagatggta 107461 tctcattgtg gttttgattt gcatttctct gatggccagt gatgatgagc agtttttcat 107521 gtgtcttttg gctgcataga tgtcttcttt tgagaagtgt ctattcatat ccttcaccca 107581 ctttttgatg gggttgtttg tttttttctt gtaaatttgt ttcagttctt tgtacattct 107641 ggatattagc cctttgtcag atgagtagat tgcaaaaatt ttctcccatt ctgtaggttg 107701 cctgttcact ctgatggtag tttcttttgc tgtgaagaag ctcttgagtt ttaattagat 107761 cccatttgtc tgttttggct tttgttgcca ttgcttttgg tgttttagac atgaagtcct 107821 tgcccatgcc tatgtcctga atggtaatgc ctaggttttc ttctgggttt ttatggtttt 107881 aggtctaacg tttaagtctt tcatccatct tgaattaatt tttgaataag gtgtaaggaa 107941 ggggtccagt ttcagctttc tacatgtggc tagccagttt tcccagcacc gtttattaaa 108001 tagggaagcc tttccccatt gcttgttttt ctcaggtttg tcaaagatca gatagttgta 108061 gatatgcagc attatttctg aaggctctgt tctgttccat tggtctatat ctctgttttg 108121 gtaccagcac catgctgttt tggttactgt aggcttgtag tatagtttga agtcaggtag 108181 catgatgccc ccagctttgt tcttttgact taggattgac ttggcgatgc gggccctttt 108241 ttggttccat acgaacttta aagtagtttt ttccaattct gtgaagaaag gcattggtag 108301 cttgatgggg atggcattga atctataaat taccttgggc agtatggcca tcttcacgat 108361 attgattctt cctacccatg agcatggaat gttcttccat ttgtttgtat cctcttttat 108421 ttcattgagc agtggtttgt agttctcctt gaagaggtcc ttcacatccc ttgtaagttg 108481 gattccttgg tattttattc tctttgaagc aattgtgaat gggagttcac tcaggatttg 108541 gctctctgtt tgtctgtttt tggtgtaaag aatgcttgtg atttttgcac attgattttg 108601 tatcctgaga ctttgctgaa gttgcttatc agcttaagga gattttgcgc tgagacgatg 108661 gggttttcta aatatacaat catgtcttct gcaaacaggg acaatttgac ttcctctttt 108721 cctaattgaa tgccctttat ttctctctcc tgcctgactg ccctggccag aacttccaac 108781 actatgttga ataggagtgg tgagagaggg catccctgtc ttgtgccagt tttcaaaggg 108841 aatgcttcca gtttttgccc attcagtatg atactggctg tgggtttgtc ataaatagct 108901 cttattattt tgagatacgt cccatcaata cctaatttat tgagagtttt tagcatgaag 108961 ggctgctgaa ttttgtcaaa ggccttttct gcgtctattg agataatcat gtggtttttg 109021 tcattggttc tgtttatatg ctggattatg tttattgatt tgcatatgtt gaaccagcct 109081 tgcatcccag ggatgaagcc cacttgatca tagtggataa gctttttgat gtgttgctgg 109141 attcggtttg ccagtatttt attgagcatt tttgcatcga tgttcatcag ggatattggt 109201 ctaaaattct cctttttttg tgtctctgcc tggctttggt atcaggatga tgctggcctc 109261 ataaaatgag ttaagaagga ttccctcttt ttctattgat tggaatagtt tcagacggaa 109321 tggtaccagt tcctccttgt acatctggta gaattcggct gtgaatccat ctggtcctgg 109381 actatttttg gttggtaagc tattaattat tgcctcaatt tcagagcctg ttattggtct 109441 attcagagat tcaacctctt tctggtttag tcttgggagg gtgtatgtgt ccaggaattt 109501 atccatttct tctagatttt ctagtttatt tgcgtagagg tgtttatatt attctctgat 109561 ggtagtttgt gtatctgtgg gatcggtggt gatattccct ttatcatttt ttattgcatc 109621 tatttgattc ttatctcttt tcttctttat tagtcttgct actggtctat caattttgtt 109681 gatcttttca aaaaaacagc tcctggattc attgattttt tgaagggttt tttgtgtctc 109741 tatctccttc agttctgctc tgatcttagt tatttcttgc cttctgctag cttttgaatg 109801 tgtttgctct tgcttctcta gttctttcag ttgtgatgtt aggtgtcaat tttagatctt 109861 tcctgctttc tgttgtgggc atttagtgct ataaatttcc ctgtacacac tgctttaaat 109921 gtgtcccaga gattctggta tgttgtgtct ttgttctcgt tggtttcaaa gaacatcttt 109981 atttctgcct tcatttcgtt atgtacccag tagtcattca ggagcaggtt gttcagtttc 110041 catgtagctg agcggttttg catgagtttc ttaatcctga gttcttctag tttgattgca 110101 ctgtggtctg agagacagtt tgttataatt tctgttcttt tacatttgct gaggagtgct 110161 ttacttccaa ccaggtggtc aattttggaa taagtgcggt gtggtgctga gaagaatgtg 110221 tattctattg atttggggtg gagagttctg tagatgtcta ttaggtccgc ttggtgcaga 110281 gctgagttca attcctggat atccttgtta actttctgtc tcattgatct gtctaatgtt 110341 gacagtgggt gttaaagtct ctcattatta ttgtgtggga gtctaagtct ctttgtaggt 110401 cactcaggac ttgctttatg aatctgggtg ctcctgtatt gggtgcatat atatttagga 110461 tagttagctc tccttgttga attgatccct ttaccattat gtaatggcct tgtctctttt 110521 gatctttgtt ggtttaaagt ctgttttatc agagactagg attgcaaccc ctgccttttt 110581 ttgttttcca tttgcttggt agatcttcct ccatcccttt attttgagcc tatgtgtgtc 110641 actgcatgtg agatgggttt cctgaataca gcacactgat gggtcttgac tctttatcca 110701 atttgccagt ctgtatcttt taattggagc atttagccca tttacattta aggttaatat 110761 tgttatgtgt gaatttgata caggaggatt ttcacttatt tcccctgaga gccaaggaga 110821 gacagacaca ttttcttgtt gtttcttttt gcttgttaag gaactttttc tagtttgcac 110881 ttagagcgta aagatttggc ctttgagcat ctcagattta tgtgagagtc tcagtcttaa 110941 ctccccacct tgtgcacacc caaggccttg tcttctactt ccgtacggtt tttaaaactc 111001 aaaactctta gttactatga ttggcaaatg cctgcagggc agctgtggct gctacatcca 111061 cttatcactt gtttttttcc gtttttctct attttggggc cctatattta taaaattttc 111121 tttatttttg tgaaaggtca gcagtgcatt taagaagaca atttgcagtg gggagttgta 111181 ttgttgttat tgttatttag tccactgtat tgatacattt atttggtcca ctatattgat 111241 agcccccatt tagcagataa tgaaactaag accctaagaa ggaagtgagt tgcctaaggt 111301 catactggtg ggtcaacaat tgaactggca ctggaaccaa atgttcttga ttcctagttt 111361 agagttcttc ccctctgggc aaatgtgcct catcttcctt tctcactcaa gtatctctct 111421 tacgtttagt cctatgagga tctggttcag cgtctggagc cagtgataat ggagctagaa 111481 cgacaggaga atgtactggt gatctgccac caggctgtca tgcggtgcct cctggcctat 111541 ttcctggata aaagttcagg tactaccctc atctctgctt tgggaatgtt tgatggtatt 111601 tgagacactg acaccaacaa ttagatatct atatcttagt agttaagaga agagtatcta 111661 gaacaaggat gcctaggttc agattctttc tctttcacat actagctgta ctgcctttca 111721 caaatttctt taacttctct ggcctcagtg tcctcatctg taaaatgggg agaataatga 111781 tgaattttaa ataacataag taaaaagttt agaacagttc atgacacatt gtgtgggcta 111841 tatattaact gctattatta gtatagtatt attcttagct aacattttca aagccaacgt 111901 taacagtctg cctcaatctg gaaaagatga gctagtattt ctgtgaagca tgctaccatc 111961 aggtgatagt agggcctttc atagcttcat gatc // LOCUS HS2CGMPPK 3328 bp RNA PRI 15-MAY-1996 DEFINITION H.sapiens mRNA for type II cGMP-dependent protein kinase. ACCESSION X94612 NID g1181224 KEYWORDS type II cGMP-dependent protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3328) AUTHORS Orstavik,S., Solberg,R., Tasken,K., Nordahl,M., Altherr,M.R., Hansson,V., Jahnsen,T. and Sandberg,M. TITLE Molecular cloning, cDNA structure, and chromosomal localization of the human type II cGMP-dependent protein kinase JOURNAL Biochem. Biophys. Res. Commun. 220 (3), 759-765 (1996) MEDLINE 96183022 REFERENCE 2 (bases 1 to 3328) AUTHORS Orstavik,S. TITLE Direct Submission JOURNAL Submitted (02-JAN-1996) S. Orstavik, University of Oslo, Institute of Medical Biochemistry, pb 1112, Blindern, N-0317 Oslo, Norway FEATURES Location/Qualifiers source 1..3328 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="q13.1-21.1" CDS 15..2303 /codon_start=1 /product="Type II cGMP-dependent protein kinase" /db_xref="PID:e217761" /db_xref="PID:g1181225" /translation="MGNGSVKPKHSKHPDGHSGNLTTDALRNKVTELERELRRKDAEI QEREYHLKELREQLSKQTVAIAELTEELQNKCIQLNKLQDVVHMQGGSPLQASPDKVP LEVHRKTSGLVSLHSRRGAKAGVSAEPTTRTYDLNKPPEFSFEKARVRKDSSEKKLIT DALNKNQFLKRLDPQQIKDMVECMYGRNYQQGSYIIKQGEPGNHIFVLAEGRLEVFQG EKLLSSIPMWTTFGELAILYNCTRTASVKAITNVKTWALDREVFQNIMRRTAQARDEQ YRNFLRSVSLLKNLPEDKLTKIIDCLEVEYYDKGDYIIREGEEGSTFFILAKGKVKVT QSTEGHDQPQLIKTLQKGEYFGEKALISDDVRSANIIAEENDVACLVIDRETFNQTVG TFEELQKYLEGYVANLNRDDEKRHAKRSMSNWKLSKALSLEMIQLKEKVARFSSSSPF QNLEIIATLGVGGFGRVELVKVKNENVAFAMKCIRKKHIVDTKQQEHVYSEKRILEEL CSPFIVKLYRTFKDNKYVYMLLEACLGGELWSILRDRGSFDEPTSKFCVACVTEAFDY LHRLGIIYRDLKPENLILDAEGYLKLVDFGFAKKIGSGQKTWTFCGTPEYVAPEVILN KGHDFSVDFWSLGILVYELLTGNPPFSGVDQMMTYNLILKGIEKMDFPRKITRRPEDL IRRLCRQNPTERLGNLKNGINDIKKHRWLNGFNWEGLKARSLPSPLQRELKGPIDHSY FDKYPPEKGMPPDELSGWDKDF" BASE COUNT 1072 a 618 c 775 g 863 t ORIGIN 1 ggtccctgag caaaatggga aatggttcag tgaaacctaa acattctaag cacccagatg 61 gacactctgg gaacctcacc actgatgctc tgcggaacaa ggtgacagag ctggagagag 121 agttgaggag gaaggatgct gagatccagg agcgggagta ccatttgaag gagctgcggg 181 agcagctgtc gaagcagact gtggccattg ctgaactcac agaggagctc cagaacaagt 241 gcatccagct gaacaagctg caggatgtgg tgcatatgca gggaggaagc ccgcttcagg 301 cctctccaga taaagtgcct cttgaggtcc accggaagac ctctggattg gtctctctcc 361 atagcaggag gggagcaaag gctggcgtgt ctgctgagcc aacaacccgg acctatgacc 421 tgaacaaacc ccctgaattt tcctttgaga aagcaagagt cagaaaagac tccagtgaga 481 agaagctcat tacagatgcc cttaataaaa atcagtttct gaaaagactg gatcctcagc 541 agatcaaaga catggtggaa tgcatgtatg ggagaaacta tcagcaaggg agttacatta 601 ttaagcaagg agaaccagga aaccatatct ttgtgctggc agagggtcga ctagaggtgt 661 tccaagggga gaaattgctg tcctccatcc ctatgtggac cacatttggg gagcttgcca 721 ttttatacaa ttgtacaagg actgcctctg tgaaagctat taccaatgtt aaaacatggg 781 cactagatcg agaggtattc cagaatataa tgaggaggac agcccaagct agagatgaac 841 aatacagaaa cttcctcaga agtgtatcct tgctgaagaa tttacctgaa gataaattaa 901 ccaagatcat tgactgcttg gaagtggaat actatgacaa aggagattac atcattagag 961 agggcgagga aggaagtacc tttttcattt tggcaaaagg aaaggtaaaa gtaacacaga 1021 gcacagaagg ccatgatcaa ccacagctga taaaaacact gcagaaagga gaatactttg 1081 gagaaaaagc tcttatcagt gatgatgtca ggtcagctaa cattattgct gaagaaaatg 1141 atgttgcatg cctggttata gatcgagaaa cattcaacca aactgtcggt acatttgaag 1201 agctgcaaaa ataccttgaa ggatatgtgg caaacctgaa ccgtgatgat gaaaaaagac 1261 atgcgaagcg gtccatgtct aactggaagc tgtccaaagc actctctctg gaaatgattc 1321 agctgaagga gaaggtggcc agattttcct catcatcccc attccagaac cttgagatta 1381 ttgcaacact gggcgttggt gggttcggaa gagttgagct tgttaaagta aaaaatgaga 1441 atgttgcttt tgctatgaag tgtataagga agaagcacat agttgacacc aagcagcagg 1501 agcatgtcta ctcagagaag aggatcctag aggagctgtg ctctccattc attgtgaaat 1561 tatatcgtac tttcaaggac aataagtatg tatacatgct tctggaggcc tgcttaggtg 1621 gggagctctg gagtatatta agggacagag gcagctttga tgaacccacc tccaaattct 1681 gcgttgcttg tgtgacagaa gcatttgatt acctgcatcg actaggtatt atctacagag 1741 acttgaaacc agaaaactta attctagatg ctgagggtta ccttaaattg gttgactttg 1801 gatttgcgaa gaaaataggg tctggacaga aaacatggac attctgtggg actccagaat 1861 atgtagctcc tgaagtcatt ctcaacaagg gacatgactt cagtgtggat ttctggtcac 1921 tgggaattct agtgtatgag ctcctaacgg gcaacccacc cttttctggg gttgaccaaa 1981 tgatgaccta caatttgatt ctcaaaggaa ttgaaaaaat ggattttccc aggaagataa 2041 cacgacgacc tgaggatttg attcggaggc tttgcaggca aaatccaaca gaaaggctgg 2101 gaaatctgaa gaatggaata aatgacatta agaaacacag gtggttaaat ggttttaatt 2161 gggagggact gaaagcacgg agccttccat cacctttgca aagagagctc aagggaccca 2221 tagatcacag ctactttgac aaatatcctc ctgaaaaggg aatgcctcca gatgagctat 2281 caggctggga taaagacttc tgacagaaga aaagttgatt actgcctgta ctctacagaa 2341 gaggacctca aggatcaata atccaacaca ttattttctt ttcagagtat tataatatct 2401 ttggaagacc attagggaaa agaaattcct gcacaatggg aagaggagaa tggtgtggat 2461 atggttctga gttatagtgt cttatttaga tgctgtgaat tattgatgta ttacattatt 2521 tgcttttctc aactgctaga ggctacccca ttttcctttc cacaatcaga gccatttttg 2581 ttaaagtggc agttttttct gcaatctatt gttccattcc aatcatatcc ttctcgtttg 2641 agtactacta acgtttaaaa agggtcttgc ccttgaataa ctaagtcata caaatagaag 2701 gaaaaacaat ggtgaatttt ggagagctcc ccagctctca gcaacttcat ataagcatgg 2761 tggtatctta aaatggtggt ttgcagaaac catggtagcc aacagaacct cctgactttc 2821 acctgctttt aacctgaaaa tatattaata cgcctctgac atgagtccag ggagaggcaa 2881 gattgagcaa tagtcagtgg caccatttcc taaagtaact gaatcaatca atccagtcag 2941 gtaattattt tctattgggg aacttagaaa aaaaagtaaa gcaagaaata ttttctctcc 3001 tttatgatca gtgatacctt agagactttt cgaaagtcct tgttaaagat agttacaatg 3061 tgtggttaaa tgatgctaaa atattttcac aaccagagta gaaaacgtgc tggaacaaaa 3121 ttgcgaaggc ttttgcctcc cagtaaatag aaaggaaaat atattctagt aggtaagact 3181 gctgactctt tgaaaatctg aggtattttg aaaaattctg tgatagttgc tattaataaa 3241 ccaaaattgt atagaatcat cgttaatttt taatataaat cccattgaca aatgttgatg 3301 cactgtagtc actgctgagt taatagcg // LOCUS HS308937 2032 bp RNA PRI 20-SEP-1997 DEFINITION H.sapiens mRNA; IMAGE cDNA clone 308937. ACCESSION Z93322 NID g2425154 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2032) AUTHORS Thiel,C., Lehrach,H. and Yaspo,M.L. TITLE Isolation of candidate genes mapping in the APECED disease (PGD type I) region on Human chromosome 21q22.3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2032) AUTHORS Yaspo,M.L. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Yaspo M.L., Max Planck Institute fuer Molekulare Genetik, Ihnestrasse 73, Berlin D-14195, Germany FEATURES Location/Qualifiers source 1..2032 /organism="Homo sapiens" /note="cDNA clone 308937 was cloned in reverse orientation" /db_xref="taxon:9606" /tissue_type="lung" /dev_stage="19 weeks" /clone_lib="cDNA library constructed by Bento Soares" /clone="308937" CDS 8..778 /codon_start=1 /product="c21ORF-HumF09G8.5" /db_xref="PID:e308951" /db_xref="PID:g2425155" /translation="MKLTRKMVLTRAKASELHSVRKLNCWGSRLTDISICQEMPSLEV ITLSVNSISTLEPVSRCQRLSELYLRRNRIPSLAELFYLKGLPRLRVLWLAENPCCGT SPHRYRMTVLRTLPRLQKLDNQAVTEEELSRALSEGEEITAAPEREGIGHGGPKLCCT LSSLSSAAETGRDPLDSEEEATSGAQDERGLKPPSRGQFPSLSARDASSSHRGRNVLT AILLLLRELDAEGLEAVQQTVGSRLQALRGEEVQEHAE" 3'UTR 779..>2032 polyA_signal 1064..1069 /note="Alternative polyA signal (as found for IMAGE clone 303091). No polyA signal identified for clone 308937" BASE COUNT 376 a 601 c 669 g 386 t ORIGIN 1 ggccgccatg aagctgacgc ggaagatggt tctgacccga gccaaggcct cggagctgca 61 cagcgtgcgc aagctcaact gctggggcag ccgcctcaca gatatctcca tttgccagga 121 gatgcccagc ctggaggtga tcacgctcag tgtcaacagc atctccaccc tggagcctgt 181 gagccggtgc cagcgcctga gtgagctgta cctgcggagg aaccgcatcc ccagcctggc 241 tgagctcttc tacctgaagg ggctgccgcg tctgcgggtg ctgtggctgg ccgagaaccc 301 gtgctgcggc accagccccc accgctaccg catgaccgtg ctgcgcaccc tgccgcgcct 361 acagaagctg gacaaccagg ctgtgacgga ggaggagctg tcccgtgcac tgagtgaggg 421 agaggagatc actgcggccc cagagagaga gggcataggc cacggcggcc ccaagctatg 481 ctgcacactg agctccctca gctccgctgc tgagactggc cgggacccgc tggacagcga 541 ggaggaggca accagcggcg cccaggatga acgtggcctg aagccgcctt cccggggcca 601 gtttccttcc ctctcagcca gggatgcctc gagcagccac aggggcagga acgtcctgac 661 tgccatcctg ctgctgctgc gggagctgga tgcagagggg ctggaggccg tgcagcagac 721 tgtgggcagc cggctgcagg ccctgcgtgg ggaagaggtg caggagcacg ccgagtgacc 781 gcaggacctg aacgccgctc cagcctccac ggggacccca gcgtcttccc cagcccccgg 841 gagctggagg gtggctgcca tggccgcagc cccggcccca cacaaaagcc tccccggttt 901 gccacatcgg ccgagggcag gagtgggtgt taggtactgg ctaaccgggg cggtggagat 961 gcctgtctac accagtcctg tccccaggac tccccttctg tggtctggag gttctaggct 1021 ggcctgggct cttaaaggga ggattttgca ggctgtcctc cctaataaaa gattttccca 1081 aggttgatct ggaggtgacg tttcccaagg catggttgtg acagtcttcc tgaaagcaag 1141 ggccgtttct ccacagaatg aggaggtctt cacggatggc tttgccgatt ccagcccttg 1201 gtccttggat agatttctgt tgggcaaggg agacttcgag aatgctggca cccatagcga 1261 catctgccct gagctgctca gcaggcaggc agcagcctcc cgcacccatg ggagagagtt 1321 ttgttttctt ggccgggctg ggcaggtgcg ccttcctttc tctctcctgt gaagaactgc 1381 aggagggccc tcagctgtga ggttgtggat gaacaggcct gttcgccctg gggtccgggc 1441 ctggcacagg aagcagctag actaggggag tccttggagc atctgagagg ggttttcttt 1501 cctctgggaa gagtcgcaag tcttatgagg atgccacaat ctgggcagtg acccgtggcc 1561 ctggggacct tcctagtgcc ccccaaaacc aaggaagcat tcacgtgaga tggggagtca 1621 gccttccagc tggtgtgtag gaaagccagg tccctgggcc cttgtcccat gtggggttga 1681 gatccagatc tccatgagat attcccatcc tccgccgagc ggtgatgcgt gaccgtgtgt 1741 gcaggccgct gttctcccaa gctcggggcc atgccgtgcc accgtcacca gagccagctg 1801 cagcactggg gcagccactg tcgccggact tggggaagcc cagggtgaag acagtcactc 1861 atccgtgcaa gctctccaga agggatgcaa gctctccaga aggggtgtga tctctgagca 1921 gtggggagtg gagtcaccac tcgggacata gagggtggac agttcaggcc tgtgtctgcc 1981 tccttgggac tgtggagtgg gtgccctcag taaagcgctt tgttggaatg gt // LOCUS HS326L13 127247 bp DNA PRI 19-DEC-1996 DEFINITION Human DNA sequence from PAC 326L13 containing brain-4 mRNA ESTs and polymorphic CA repeat. ACCESSION Z82170 NID g1730463 KEYWORDS brain-4; repeat polymorphism; X. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 127247) AUTHORS Deadman,R. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) Sanger Centre, Hinxton, Cambridgeshire, CB10 1RQ, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone 326L13. The true left end of clone 326L13 is at 1 in this sequence. The true right end of clone 326L13 is at 127247. 326L13 is from a whole genome PAC library. FEATURES Location/Qualifiers source 1..127247 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="X" /clone="326L13" repeat_region 1..290 /note="L1 repeat: matches 2109..2400 of consensus" repeat_region 292..705 /note="MSTA repeat: matches 1..426 of consensus" repeat_region 706..780 /note="MST-INTERNAL repeat: matches 1..75 of consensus" repeat_region 786..1166 /note="MSTA repeat: matches 1..426 of consensus" repeat_region 1166..2731 /note="MST-INTERNAL repeat: matches 75..1651 of consensus" repeat_region 2728..2853 /note="L1 repeat: matches 2273..2398 of consensus" repeat_region 2857..3270 /note="MSTA repeat: matches 1..426 of consensus" repeat_region 3266..6050 /note="L1 repeat: matches 2392..5195 of consensus" repeat_region 6008..7510 /note="L1 repeat: matches 3888..5390 of consensus" repeat_region 7359..8399 /note="L1MA1 repeat: matches 4..1046 of consensus" repeat_region 8434..9146 /note="L1MA2 repeat: matches 220..949 of consensus" repeat_region 9142..10766 /note="L1 repeat: matches 4641..3043 of consensus" repeat_region 10752..10900 /note="MER11A repeat: matches 4..151 of consensus" repeat_region 10899..11523 /note="MER11A repeat: matches 60..738 of consensus" repeat_region 11208..11819 /note="MER11B repeat: matches 2..632 of consensus" repeat_region 11828..14353 /note="L1 repeat: matches 3099..589 of consensus" repeat_region 14500..17731 /note="L1 repeat: matches 3949..757 of consensus" repeat_region 18449..18681 /note="L1 repeat: matches 256..12 of consensus" repeat_region 18675..18922 /note="L1HS repeat: matches 257..1 of consensus" repeat_region 18774..19055 /note="L1 repeat: matches 5390..5110 of consensus" repeat_region 19056..19338 /note="AluSc repeat: matches 1..279 of consensus; incomplete repeat" repeat_region 19382..19562 /note="L1 repeat: matches 5097..4907 of consensus" repeat_region 19985..20113 /note="L1MA7 repeat: matches 900..1031 of consensus" repeat_region 20498..21088 /note="L1 repeat: matches 4802..5390 of consensus" repeat_region 20939..21827 /note="L1PA2 repeat: matches 1..889 of consensus" repeat_region 22413..22848 /note="L1PA5 repeat: matches 890..455 of consensus" repeat_region 24222..24577 /note="L1PB2 repeat: matches 902..530 of consensus" repeat_region 24677..24990 /note="L1PB1 repeat: matches 538..216 of consensus" repeat_region 25604..25673 /note="MER25 repeat: matches 1513..1581 of consensus" repeat_region 26123..27978 /note="L1 repeat: matches 3950..2079 of consensus" repeat_region 28491..29124 /note="L1 repeat: matches 1982..1341 of consensus" repeat_region 29544..29794 /note="MER43 repeat: matches 1..272 of consensus" repeat_region 31871..31904 /note="17 copies of 2 mer 85 % conserved" repeat_region 34282..34514 /note="MER8 repeat: matches 7..238 of consensus" repeat_region 34533..34560 /note="14 copies of 2 mer 100 % conserved" repeat_region 36367..36469 /note="AluJo repeat: matches 11..113 of consensus; incomplete repeat" repeat_region 36472..36959 /note="L1 repeat: matches 4880..5390 of consensus" repeat_region 36812..37658 /note="L1PB3 repeat: matches 2..845 of consensus" repeat_region 37656..37797 /note="L1PA5 repeat: matches 749..890 of consensus" repeat_region 37852..37908 /note="L1PB2 repeat: matches 845..902 of consensus" repeat_region 37916..38081 /note="AluJo repeat: matches 129..302 of consensus; incomplete repeat" repeat_region 38838..38884 /note="U2 repeat: matches 49..1 of consensus" repeat_region 40029..40333 /note="AluY repeat: matches 1..301 of consensus" repeat_region 40339..40447 /note="MIR repeat: matches 8..114 of consensus" repeat_region 40726..41031 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 41045..41176 /note="FLAM_A repeat: matches 1..133 of consensus" repeat_region 43923..44491 /note="L1PA2 repeat: matches 893..322 of consensus" repeat_region 44620..44924 /note="AluY repeat: matches 1..301 of consensus" repeat_region 45353..46200 /note="L1PA9 repeat: matches 909..35 of consensus" repeat_region 47540..47857 /note="L1 repeat: matches 3369..3683 of consensus" repeat_region 48795..49488 /note="L1PA9 repeat: matches 783..79 of consensus" repeat_region 49487..49595 /note="L1PA16 repeat: matches 796..904 of consensus" repeat_region 50564..50877 /note="L1 repeat: matches 1729..2042 of consensus" repeat_region 51445..51673 /note="MER46 repeat: matches 227..2 of consensus" repeat_region 51713..51911 /note="L1 repeat: matches 2251..2451 of consensus" repeat_region 52321..52930 /note="MER25 repeat: matches 1514..2124 of consensus" repeat_region 52713..53946 /note="L1 repeat: matches 1403..2650 of consensus" repeat_region 53949..56515 /note="L1 repeat: matches 2833..5390 of consensus" repeat_region 56368..57238 /note="L1PA15 repeat: matches 1..896 of consensus" repeat_region 57421..57721 /note="AluJo repeat: matches 1..302 of consensus" repeat_region 58610..58915 /note="THE1B repeat: matches 362..52 of consensus" repeat_region 59121..59282 /note="L1 repeat: matches 3656..3812 of consensus" repeat_region 59291..59501 /note="L1MB1 repeat: matches 706..916 of consensus" repeat_region 60538..62078 /note="L1 repeat: matches 4399..2821 of consensus" repeat_region 62147..62332 /note="L1PB2 repeat: matches 706..902 of consensus" repeat_region 62590..63179 /note="L1 repeat: matches 1406..1986 of consensus" repeat_region 63594..65329 /note="L1 repeat: matches 1965..3707 of consensus" repeat_region 65328..65819 /note="L1 repeat: matches 4904..5390 of consensus" repeat_region 65671..66407 /note="L1MA3 repeat: matches 1..758 of consensus" repeat_region 66427..66510 /note="L1MB3 repeat: matches 834..921 of consensus" repeat_region 66515..66664 /note="L1PA2 repeat: matches 744..893 of consensus" prim_transcript 67762..68425 /note="match: 3' EST D59947 clone GEN-076F02; Paired with EST D59948 matching this clone; match: 5' EST D59948 clone GEN-076F02; Paired with EST D59947 matching this clone" repeat_region 68456..68483 /note="7 copies of 4 mer 96 % conserved" repeat_region 68708..69022 /note="AluJo repeat: matches 3..301 of consensus" repeat_region 69141..69269 /note="L1 repeat: matches 4889..5021 of consensus" repeat_region 69296..69445 /note="AluJb repeat: matches 133..292 of consensus; incomplete repeat" repeat_region 69794..70201 /note="L1ME3 repeat: matches 295..713 of consensus" repeat_region 70488..70563 /note="MIR repeat: matches 79..154 of consensus" repeat_region 71132..71317 /note="MIR repeat: matches 235..49 of consensus" repeat_region 72477..72817 /note="MER1B repeat: matches 1..337 of consensus" repeat_region 73089..73110 /note="11 copies of 2 mer 100 % conserved" repeat_region 73948..74135 /note="MIR repeat: matches 261..64 of consensus" misc_feature complement(80796..81180) /note="match: Z16949 DNA segment containing (CA) repeat" repeat_region 81033..81070 /note="19 copies of AC 100 % conserved" repeat_region 81033..81070 /note="19 copies of 2 mer 100 % conserved" CDS 87087..87761 /note="match: X82324" /codon_start=1 /product="brain-4" /db_xref="PID:e286617" /db_xref="PID:g1747362" /translation="MATAASNPYSILSSTSLVHADSAGMQQGSPFRNPQKLLQSDYLQ GVPSNGHPLGHHWVTSLSDGGPWSSTLATSPLDQQDVKPGREDLQLGAIIHHRSPHVA HHSPHTNHPNAWGASPAPNPSITSSGQPLNVYSQPGFTVSGMLEHGGLTPPPAAASAQ SLHPVLREPPDHGELGSHHCQDHSDEETPTSDELEQFAKQFKQKKNQVGLHAGRRGVG AGHTVW" repeat_region 89944..89975 /note="16 copies of 2 mer 84 % conserved" prim_transcript complement(90619..90846) /note="match: 3' EST H88604 clone 252959" repeat_region 91204..91247 /note="22 copies of 2 mer 89 % conserved" repeat_region 97042..97079 /note="MIR2 repeat: matches 142..106 of consensus" repeat_region 97447..97585 /note="AluJb repeat: matches 293..156 of consensus; incomplete repeat" repeat_region 98815..98975 /note="L1MD1 repeat: matches 810..971 of consensus" repeat_region 99416..99461 /note="MIR2 repeat: matches 146..101 of consensus" repeat_region 99518..99724 /note="MIR repeat: matches 49..262 of consensus" repeat_region 100012..100299 /note="AluSx repeat: matches 301..1 of consensus" repeat_region 101681..101946 /note="AluSx repeat: matches 37..302 of consensus; incomplete repeat" repeat_region 102925..103178 /note="MIR repeat: matches 262..2 of consensus" repeat_region 103226..103267 /note="21 copies of 2 mer 100 % conserved" repeat_region 103712..103857 /note="MIR repeat: matches 246..97 of consensus" repeat_region 104638..104939 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 105034..105107 /note="MIR2 repeat: matches 141..68 of consensus" repeat_region 105524..105568 /note="MIR2 repeat: matches 89..133 of consensus" repeat_region 105769..106072 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 109655..109902 /note="MIR repeat: matches 251..5 of consensus" repeat_region 110094..110157 /note="MER45 repeat: matches 1..66 of consensus" repeat_region 110397..110517 /note="MER45 repeat: matches 59..175 of consensus" repeat_region 110565..110712 /note="MIR repeat: matches 36..177 of consensus" repeat_region 112221..112439 /note="MER20 repeat: matches 218..1 of consensus" repeat_region 112798..112865 /note="MER45 repeat: matches 1..68 of consensus" repeat_region 113445..113569 /note="MER45 repeat: matches 63..178 of consensus" repeat_region 116221..116470 /note="MIR repeat: matches 9..261 of consensus" repeat_region 117515..117618 /note="MIR repeat: matches 3..102 of consensus" repeat_region 119322..119349 /note="7 copies of 4 mer 93 % conserved" repeat_region 119350..119649 /note="AluY repeat: matches 300..1 of consensus" repeat_region 119869..119967 /note="L1PA4 repeat: matches 891..793 of consensus" repeat_region 121161..121938 /note="L1 repeat: matches 4..772 of consensus" repeat_region 121934..126433 /note="L1 repeat: matches 896..5390 of consensus" repeat_region 126288..127176 /note="L1PA2 repeat: matches 1..893 of consensus" BASE COUNT 42139 a 23700 c 23416 g 37992 t ORIGIN 1 gatcaaattc acacatacca atgctaacct taaatgtaaa taggttaacc gccccagtta 61 aaaggcacag agtagcaagg tagataaaac agcaagaccc aatttaattc tttcttcaaa 121 agactcgtct cacatgcaat gacactcata gtctgaaaat aaagggatgg aagaaaatct 181 accaagcaag tggaaattag aaaaaagcag gggttacagc cctaatttca gacaaaacag 241 actttcaact aaaaagataa aaaatgacaa agaagggcat tccataatgg ttgatatggt 301 ttggatctgt atccccacct aaatctcatg tggaattgtc atccctaatg ttgaaaaagg 361 ggcccggcag gaggttattg gatcatgggg gtggagatct cacaaatggt ttagcaccgt 421 ccccatttgg tactgtatag tgagtaaatt gtcacaagat atgattgttt aaaagtttat 481 agcacctccc atttctctct cctcctcccg ctctggccac gtgaagatgc cttctcctgt 541 ttttcatgct gccatgagta atgtctccct gaggactccc caaaagcaga tgctgccata 601 cttcctgtac ggcctgtgga actgtgagac acttaaacct ctttttctaa taaattaccc 661 agtctcaggt atttctttat agcagtgtga gaatgaacta atacagaaaa ttggtactga 721 ggagtggagt attgctataa agatacctca aaatgtggaa gcagctttga aattaggtaa 781 tcagttgata cagtctggat ttgtgtccct gcccaaatct catgtcaaat tgtaattccc 841 agtgttggaa gaggggcctg gtgggaggtg attggatcat gggggtgcat ttctcccttg 901 ctgttcttgt aatagtgagt gagttctcac aagatctggt tgcttaaaag tgtgtagcac 961 ctgccccttc tctctctcct tctccaacca tgtgaagaca tacctgcttc tcttttgcct 1021 tctgccatga ttaaaaattt cctaaggcct tcccagacat gatacctgta cagcccatgg 1081 aaatgtgaga taagaaaacc ccttttcttt ataaattaca cagtctcagt tattttttta 1141 tagcaatgtg agaatggatc aatacactgg ctaaagttgg aagagtgtgg agggctcaga 1201 agaagagagg aagatgaggg aaaattcaga acatcctaga gactggttga atggttgtga 1261 ccaaaatgct gatggtgata tggacaatta agtctaggtt gaggaggtct taggtggaga 1321 tgaggaactt attgggaact aaagcaaagg tcacattcat tatgagttaa caaagagatt 1381 ggctgcattg taccccagcc ctagggagct gtggaacttt gaacttgaga atgatgattt 1441 agggtatctg gtataaaaac atttctaagc ggcaaagcat ttaagatgtg acctggcttc 1501 ttctaatcac ctatgctcat atgcatgagc aaagaaatga tgtaaaactg aaagttatat 1561 ttaaaaggga atcagagtgt aaaagtttta aaaatttgca gccgggtcat gtggtaaaaa 1621 tgaaaagcct atttctgtgg ggaggaattc aagcaggctg cagaaatttg cacaagtgaa 1681 aaggagtcaa ctgttactat tcaagacaat ggggggaagg cttgaaagac atttcaaaga 1741 cctttgtggc atcccttttc atcacaggcc tggaggccta ggagggaaga atggtttcat 1801 gagtcaggct gagggaccca ctgccctgca caacttcagg acactggtcc ctgcatccca 1861 gctgctccag ctccagctat tcctaaaagg gtcccagata caatttgagt cactgcttca 1921 gagagtgcaa accataagcc ttggcagctt ccatgtggtg ttaagcctgc aagtacacaa 1981 aatgcaagaa ttaaggctgg ggagcctctg cctagatttc agaggatgta tgaaaaggcc 2041 tggatctcca tgcagaagcc tgctgcagga gcagagccct catggagaac ctcttctgag 2101 gaagtgagga agaaaaaaat gaggttggag cccccaaaca aagtccccac tggggcattg 2161 actagtggat ctgtgagaag agggccatca tcctccagac cccagaatgg tagattgact 2221 agccatttgc aatttccttt tagaaaagct gaagacactc aatgccagct catgagagca 2281 gccaaagggg caaaaccctg aaaagccaca ggggcaaagc agaccaaaga ttgggaatcc 2341 acccattgca tcagtgtgac ctggatgtga gatatggagt caaatgagat tattttgtag 2401 ctttaagatt taaagcttaa atcttgactg ccttgctgga tttcagactt acactgggcc 2461 tgtaggccct ttgttttggc tgatttctcc cttttagaat ggggaaattt acccaatgcc 2521 cataccttcg ctggatcctg aacataacta acttgttttg attttgcaga ctcatagtcc 2581 aaagcaacac tgcttgtctc agataagagt ttgaactatg gacttttgag ttaatgctgg 2641 aatgggttaa gactctgggt aacttttggg aaggcatgat tgtattttga aatgtgagaa 2701 agaggagatt tgggagaggc caggagcaga atgaagaaaa tctaacaagc aaatggaaat 2761 cagaaaaaaa gcagagattg ctatcataat ttcagacaaa acagacttta aaccaacaag 2821 gatcaaaaac aacaaaggag ggaattacat aattcttgct atggtttgtg tttgtgtccc 2881 ttcccaagtc tcatgtggaa ttgtaatctc cagtgttgga ggagaggcct ggcaggaggt 2941 gattggatca tgggagtaga gttctcctaa atggtttagc aacatcaccc cttggtaccc 3001 tatagtgagt gagttctcac aagatctgat tgttttaaag tgtatagcac atcccacctt 3061 ctcggccttt ctgttgctct ggccatgtta aaatgcctgc tcctgctttg ccttctgcca 3121 taagtaaaat attccagagg ccttcccaga agctgatgct gccatgtttc ctgtacagtc 3181 tgtggaagca taagccaatt aaaccttttt ttaaataaat tacccagtct caaatatttc 3241 tttataacaa taagagaatg aactaataca atggtaaagg gtccaagtca acaagaaggc 3301 taaactattc taaatatata tgcactcaac acaggagcac ccagattcat aaagcaagtt 3361 cttagagacc ttcaaagaga cttagacttc tgaacaataa taatggaaga cttcaatgtc 3421 acactgacag tattagacag attataaaga cagtaaatta aagatattca ggaccttaac 3481 tcatcactgg atcaaatgga tctgatagac atctgcagaa ctctccacct caaaacagca 3541 gcatatacat tcttctcatt gccacatgcc acaactctaa aatcgatcac ataattggac 3601 ataaaacact catcagtaaa tgcgaaagaa ctgacataac aacaactctc tcagaccaca 3661 gcacgatcaa attagaaatc aagcaaaaga gatttgctca aaacaataca attacatgta 3721 aattgaataa actgctcctg aatgacttct ggttaaataa taaaattaag gtagaagtga 3781 ggaagttatt tgaaagtaat aaaagacaaa acataccaga atctctggga aataggtaag 3841 gcaatgttaa gagagaaaaa tataacacta aattcccaca gaaaaatgtt aaagagatct 3901 caatttaaca acctaacctc ataactaaaa gaactacaga accaagagaa actcagtgcc 3961 aaggccagaa gacaagaaat aaccaaaatc tgtactgaac tgaaagagac tgagatataa 4021 aaaaaaattt aatagatcaa tgaatccagg agttggtatt tataaaataa ttaataaaat 4081 agactgctag ctaaactaat aaagaataag agatagaaga tccgaataaa cataattaga 4141 aatgacaaag ggaatattac cactgaaccc acagaagtac aaataatcat cagagaaaat 4201 tatgaacacc tatatgaaca caaactagaa aatcttgaag acatgaataa attcctattc 4261 acatacactt ttctgagact aagtcaggaa gaaattaaat ccttgaacat gccaataatg 4321 agcttcaaaa ttgaatcagt aataaataaa ctagcaacca aaaaaactcc aagaccagat 4381 gaatttttag ctgcattcta ccacatgtac aaagaaggga tggtaacatt cctaccaaaa 4441 ctgtttcaaa aatattaaga aggaggaact cttccctaac ttactctatg aggccagcat 4501 cattctcata gtaaaccaga cagagactga ttccaccacc gtgtcccaaa ggaaaaaaaa 4561 aaaaaaactc tttagtctaa tatccttgtt taacatgatg caaaaattct caacaaaata 4621 ctggcaaatc aaattcagca acacataaaa aagctaatcc accacaatca agtgggcttt 4681 tttcagagat gcaaggttgg ttaaacatac acaaatcaat aaatgggatt catcacataa 4741 acagaactaa aaacaaaaac cacatgatta tctcaattga agcagacaag acttgatgaa 4801 attcaacacc ccttcatgtt aaaaactctc aatgaactag gtattgatgg aatgcacttc 4861 aaaataataa aagccatcta tgacaaaccc acagctaaca ttatgggcaa aagctgaagg 4921 cagtcgtctg gaaagccagc acaagacaag gatgtcctct ctcaccattc ctattcaacg 4981 tagtattgga agtcctggcc agaacaataa ggaaaaaaaa aaaaaagaaa gagtatccaa 5041 atagtaagag aggaagtcaa actatctcat ttggcagatc tcatgatccc atatctagga 5101 aaaaatctca ttgctttggc tcaaaaactt cttaagctga taaacaactc cagcaaatct 5161 caaaatattt aatgaacata caataatcag taggattcct atacaaccat aacagtcaaa 5221 ctgagaatca aaccaaaaat gcaatgccat tcacaattgc atcaaaaaac taaaatatct 5281 ataaatacac ctaactagga aggtaaaaga tctctacaag gagaactata aaacattact 5341 caaaaataac agagatgaca taaacaaata aaaaaatcca tactcatgga taataagaat 5401 caatatcact aaaatggcca tatggcccaa agtaatttac agatgcaatg gtattcctat 5461 taagctgcca ttgatgtttt tcacagacct agaaaaaaat atttacaaat tcatacagaa 5521 cctaaaaaca gcttgaattg ccaggggaat cttaaccaga aagaacaaag tttgaggcat 5581 catgttacca cttttcaaac tatacaacac atctacagta atcaaaacag catagtactg 5641 ctacaaaaac ggacagatag accaatggaa caggaaagag agcccagaag taaagccaca 5701 cacttacaac taactggtct ttggcaaaag tgacaaaaac aagcaatggg aaaaggatgc 5761 tgggataact ggctagccac atacagaaga ttaaaactgg acctctttct tacaccatat 5821 acaatagtca aataaatata aattaaagac ttaaatgtaa aacccaatac cataaaaagc 5881 ctggaaaaca acttaggcaa taccattctg gacataggaa tgagcaaaga ttttaggatg 5941 aagacgccaa aaccaactac aacaaaagca aaaattggca aatgatatct aattaaaata 6001 aagagcttct gaacagaatg agaaatgatc tacagagtaa acagacaacc cttaacaata 6061 aatgtcctca acacagagta tagaagtaac atacctcaac atattgaaag ccatatagat 6121 cagaatcaca gctagtatta tactcaatgg ggaaaaacag aaagccagga tgccctggtt 6181 gtgtatggaa cacaaacagg atgcccactt tgaccacagt tattcaacat agtactggaa 6241 gttccagctc aaacagtcag ataagataaa gaaaaaaagt gcatccaaat tggaaagaaa 6301 gaaatcaaat aatttttcca aatatcaaat tacccttgtt tgcagaaaaa ctggaacact 6361 ttacaaaata actattagaa atgttaaaca aattcagtaa agttgcagga tacaaaatca 6421 acatgcaaaa atcgatagca tttttatatg ccaaaagcaa aaagtctgaa atataaatat 6481 aaacaaattc catttatagt agccacaaat aaaattaaat acctaaggat taactttgcc 6541 aaagaagtga aagatctcta caatacaaac tataaaactt tgatgaaagt aactgaagag 6601 gacaccaaaa tttggaaaga tattccaagt tcatggattg aaagaatcta tattgttaaa 6661 atgtttacac tacacaaatc aatctacaaa ttcaatgcaa tacctttcaa aatatcaatg 6721 acattctcca cagatataga caaaacaatc ttaaaatgta tatagaacca caaaagacca 6781 agaatagcca aagcaatcct gaagaaaaag aacaaaactg gagggatcac attacctgat 6841 ttcatattat actacagagt tctagtaatc aaaacggcag ggtaatggca tggaaacaaa 6901 catgtagaag agtggaacaa aataggaacc ccccaaaatt catatattta cagtgatttc 6961 attttcaaca aatatgccaa gaacatacat ttgggaaagg atatttcctt caataaatgg 7021 tgctgggaca actggatatc catatgcaga tgaatgaaac tagatgctta tttctcatca 7081 tatacaaaaa tcagatcaaa atgaattgaa attttaaata taagtcctca agctatgaaa 7141 ctactaaaat aaaactttgg ggaaactctc cagggcactg gactgggcaa aatatttctt 7201 gagcaatacc ccataagcac aagcaaccaa aaaaaaatgg acaaatggga tcaagttaaa 7261 aagcttctgc taaggaaaaa aaaaaaagcc aacaaagtga agagacaaca cacagaatgg 7321 ggaaaaaaat tggaacctac ccatcttatg agggagagat aaccagaatg tacaaagagc 7381 acaaacaaat ctatagaaaa taaatctaat aatctgatca aaaagtaaac caacaaatga 7441 tctgcataga catttctcaa aagaagacat acaaatggca aacaggcata tgaaaatgtg 7501 ctcaacatca ttaatcatca gagaaatgca aatcaaacta caataagcta ttatctcacc 7561 acagttaaaa tggcttatat caaaatgaca ggcaataaca gatgcagggg aggatgtgaa 7621 gaaaatggaa cttacgtgca ctgttggtgg gaaggtaaat tagtacaacc actatggaga 7681 acagtttgga ggtttcttaa aaaactaaaa accgagctaa catataatac aggaatttca 7741 ctgctaggta tgtacctaaa agaaagaaaa tcagtacatc gaagatatat ctgcactcct 7801 atatttgttg cagcattgtt tacaataggt aacatttgga agcagcgtaa gtgttcatca 7861 acagatgaat ggatagagaa catttggtac atatacacaa tggagtacta tttggccata 7921 aaaataatga gatccagtca tttgcaataa catggatgga actggatatt attacgctta 7981 ataaaataag ctaggcacag aaagacaaac atcacatgta tttacatatt tgtgggatct 8041 aaaaatgcaa tcaattgaac tcgtgaacgt aaagagtgga aggatagtta ccagagtctg 8101 ggaacggcag tggggcctgg aaggaggtgg ggatggttaa ttggtacaat aaaattagaa 8161 agaatgaata agacctacta tttgatagca caatagggta actatagtca ataataactg 8221 aattgtatat tttaaaataa cttaaaaggt gtaattggat tgtttgcaac tcaaatgata 8281 aatgcttgag gggatataca ccccattctc catgatgtgc ttattttata ttgcatgaat 8341 gtatcaagac atctcatgta ctccataaat atatacagct actaaatatc aacaaaaata 8401 ttaataaata aaaatttaaa aattttttta aaaactcaaa agacaggtaa taacaaatgc 8461 tggtgaggat gtggagcaaa gaaaagcctt gtacagtgtt ggtgtgagtg tagcttagta 8521 caagcactat ggagagtttg ggggctcccc aaaaaaacta aaattagggc taccatttga 8581 tataacaatg ccactaagta tataccctcc aaaaggaaat cagtatatca aagttatatc 8641 tgcactccca tgtttattgc tacactattc acaatagcca agatttggaa acaacctaag 8701 tgtccatcag cagaaaaata gataaagaaa ttgcggtaca tatacacaat ggaatactat 8761 ttgggcatga aaaagaacga gatcctgatt tttacaacaa catgattgca actggaagtc 8821 atcatgttaa gtgaagtaag ccaggcacag aaagacaaac ttctcatgtt cccacttatt 8881 tgttgaagct aaaaattaat acaattgtac tcattgagat agagagtagg atcattgtta 8941 ctagaggctt caaatattat tgggagggta agggggaagt ggggatagtt aatgggtata 9001 aaaagtagaa agaataaaaa gattagtatt tgatagcaca acagggtgat tatagtcaat 9061 aattatttaa ttgtaaattt aaagataatt aaaattgtgt aaatggattg tttgtaacac 9121 aaaagataaa tacctgaagt gatgaaattt aaagtagatt ttctaattct atgaataaag 9181 tcaatggcag cttgatgggg atagcattga atctataaat tactttggga agtatggcca 9241 ttttcacgat attgattctt cccatccaca agcatggaat gtttttccat ttgtttgtgt 9301 cctctcttat ttccttgacc actggttcat agttctcctt gataaggtcc ttcacgtccc 9361 ttgtaacttg gattcctagg tattttattc tctttgtagt aattgtgaat gggagttcac 9421 tcatgatttg gctctctatt attggtgtat aggaatgctt gtgatttttg cacgttgatt 9481 ttgtatcctg agactgcaga agttgtttat cagcttaagg agattttggg ctgagatgat 9541 ggggttttct aaatatacaa tcatgtcatc tttaaacagg gacaatttga cttcctctct 9601 tcctatttga atacccttta ttgctttctc ttgcctgatt gccctggcca gcacttccaa 9661 tactatgttg aacaggagtg gtgagagagg gcatccctgt cttgtgcccg ttttcaaaag 9721 ggatgcttcc agtttttgcc cattcagtat gatattggct gtgggtttgt cacaaatagc 9781 tcttattatt ttgagatgcg ttccatcaat acctagttta ttgagagttt ttagcatgaa 9841 agttgttgaa ttttgtcaaa ggtcttttct gcatctattg agataatcat gtggtttttg 9901 tcattggttt tgtttatgtg atggattacg tttattgatt tgcgtatgtt gaaccagcct 9961 tgcatcccag ggatgaagcc aacttgatca tggaggataa gttttttaat gtgctgctgg 10021 atttgctttg ccagtatttt attgagtata acagcattga tgttcatcag ggatattggc 10081 cagaaatttt ctttttttgt tgtgtctctg ccaggttttg gtatcaggat gatactgatt 10141 taaaggatga tgttggttta aagcctgttt tatcataaaa tgagttaggg aggattccct 10201 ctttttctat tgtttggaat agtttcagaa ggaaggtatc agctccttct tgtacctcag 10261 gtagaattcg tctgtgaatc cctctggtcc tggacttttt ttattgttgc tgtttgtagg 10321 ctattaatta ctgcctcaat ttcagaactt gttagtggtc tattcaggga tttgacttct 10381 tcctggtttt gacttgggag ggtgtatgtg tccaggaatt tatccattcc ttctagattt 10441 tctagtttat ttgtgtagag ttgtttatgg tattctctga tggtagcttg tatttctgtg 10501 ggatctgtgg tgatatcccc tgtatcattt tttatttcat ctatttgatt cttctctttt 10561 ttcttcttta tttgtctgaa tagtgatcta gctattttgt tgatcttttc aaaaaaccat 10621 ctcctggatt tactgatttt ttgaagggtt tttttgtctc tatctccttc agttctgctc 10681 tgatcttagt tatttcttgt cttctgctag gttttgaatt tgtttgctct tgcttttaat 10741 ttgtggtttg ttgttgtggg aagtcaggga ccccaaacgg agggactggc tgaagccatg 10801 gcagaagaac atagattgtg aagatttcat ggacatttat tagttcccca aattaatact 10861 tttgtaattt cttatgcctg tctttactgc aatttctaaa cacaaattgt aaagatttca 10921 tggacactta tcacttcccc aatcaatgcc cttgtgattt cctatgcctg tctttacttt 10981 aatctcttaa tcctgtcagc tgagaaggat gtatgtcacc tcaggacctt gtaataattg 11041 cattaactgc acaaattgta cagcatgtgt gtttgaacaa tatgaaatct gggcaccttg 11101 aaaaaagaac aggataacag caatgtttag gaaacaagag agataacctt aaactccgag 11161 cactggtgag ccaggcagaa cagagccata tttctcttct ttcaaaagca aatgggagaa 11221 atatcactga attctttttc tcagcaagga acatccctga gaaagagaat gcgcacctcg 11281 gggtgagtct ctgaactggc ccccctgggc gtggctgtct cttatggtcc agactgcagg 11341 ggtgaaatag accccagtct cccatagcgc tcccaggctt attgggaaga ggaaattccc 11401 gcctaataaa ttttggtcag accagttgct ctcaaaaccc tgtctcctga taagatgtta 11461 tccatgacaa tggtgcatga aacttcatta gcaattttaa tttcgccccg gtcctgtggt 11521 cctgtgatct caccctgcct cgacttgcct tgtgatattc tattaccttg taaagtactt 11581 gatgtctgtg acccacacct atttgcacac tccctcccct ttgaaaatcc ctaataaaaa 11641 cttgctggtt tttgtggctt gtggggcatc aaggaaccta ccgacatgtg atgtctcccc 11701 cagatgccca gctttaaaat ttctctcttt tgtactctgt ccctttattt ctcaagccag 11761 ccaatgctta aggaaaatag aaaagaacct atgtgaatat cggggcaggt tccccgatag 11821 tttgtgattt tgaatttgtt tgctcttgct tttaattgtg atgttagggt gttgatttta 11881 gatatttcct gctttctttt gtgggcattt agtgctataa atttccctct aaacactgcc 11941 ttaactgtgt cccagagact ctggtacatc atgtctttgt tctcattggt ttcaaagaac 12001 ttatttattt ctgccttgat tttattattt acccagtagt tattcaggag caggttgatt 12061 agtttctatg tagttgtgcg gttttgagtg agtttcttaa tcctgagttc taatttgatt 12121 gcattgtggt ctgagagatg gtttgttatg atttccattc ttttgcattt gctgaggagt 12181 gttttacttc caattatgtc gtcaaattta aaataagtgt gatatggtgc tcagaagcat 12241 gtatattttg ttgatttagg atggagagtt ctgtagatgt ctgttaggtc cacttggttc 12301 agagctgagt ttaaatcctg aatatccttg ttagttttct gtctcattga tctatctaat 12361 atggacagtg aggtgttaaa gtcccccact attattgtgt gggagcctaa ttctctttgt 12421 aggtctctaa gaacttgctt tatgaatctg ggtgctcctg tattgggtgc atatatattt 12481 aggatagtta gctcttcttg ttgcattgat ccctttaatg ttatgaaatg cccttcttgg 12541 tctcttttga tctttgttgg ttgaaagcct gtttttatca gagactagga ttgcaactgc 12601 tgcttttctt ttttctttcc gtttgcttgg taaatattcc tcaatcagtt tattttgagc 12661 ccatgtgtgt ctttgcacgt gagatggctc tcctgaatgc agcacaccaa tgagtcttga 12721 ctctatccaa tttgccagac tgtgtctttt aataggggca tttagccctt ttacatttaa 12781 ggttaatatt gttatgtttg aatttgattt tgtcattatg atgctggctg cttattttgc 12841 tcaatagttg cagtttcttc atagtgtcta tggtctttac aatttggtat gtttttgcag 12901 tggttggtac cggttgctcc tttctatgtt tagttcttcc ttcaggggtt cttggaaggc 12961 aggtgtggtg gtgacaaaat ctcttagcat ttgcttgtct gtaaaggatt ttatttctcc 13021 ttcacttatg aagcttagtt tggctggata ttaaattctg gattgaaaat tcttttcttt 13081 aagaatgttg aatattggcc cccactctct tctggcttgt aggatttctg cagagagatc 13141 cactgttagt ctgatggtgg gtaacccgac ctttctctct ggctgccctt aacatttttt 13201 ccttcattac aaccttggtg aatctgacga ttatgtgtct tgggattcct cttttcaagg 13261 agtatctttg tggtgttctg tgtatttcct gaatttgaat attggcctgt cctgctaggt 13321 ttgggaagtt ctcctgaata atatcctgaa gagtgttatc caacttggtt ctgttctccc 13381 cgtaactttc aagtacacca atcaaactta ggtttggtct tttcacatac tccaatattt 13441 cttggaggct ttgtttgttt attttcattc ttttttctct aatcttgtct tcacacttta 13501 tttcattaag ttgatcttca atatctgata tccttccttc cacttgatcg atttggctat 13561 tgatacttat gtatgcttca cgaagttctt gtgctgtttt ttggctccat caggtcattt 13621 atgttcttct ctaaagtagt tattctagtt agcaattcct ctaacctttg ttcaaggttc 13681 ttagcttctt tgcattgggt tagaacatgc tcctttagct cagaagagtt tgttattgtc 13741 caccttctga agcctacttc ggacaatttg tcatactcat tctccgtcca gttttgtttc 13801 cttgctggcg aggagttgtg atcctttgga ggagaagagg cgttctgttt cttggaatgt 13861 tcagcctttt tgtgctggtt tctccccatc tttgtggatt tatctacctt tggtctttga 13921 tgttggtgcc ctttggatag ggtttctgag tggacatcct ttttgttgat gttgatgtta 13981 ttcctttctg ttagttgttt tcctaacagt cagaaccctc tgttgcaggt ctgctggagt 14041 ttgctggaag tccactccag accctgtttg cctgggtatc accagcggag gctgcagaac 14101 agcaaagatt gctgcctgtt ctttcctctg gaagctttgt cccagacggg cacctgccag 14161 atgccagcca gagatctcct gtatgagggg tctgttggcc cctgctagga agggtctccc 14221 agtcaggaga cacaggcgtc agggaccagc tagaggaggc agtctgaccc ttagcagaac 14281 ttgaacactg tgctgggtga tcccctgctc tcttcagagc cgtcaggcag tgacgtttaa 14341 atctgctgaa gctgcgccca cagccgccct ttcccccaga tgctctgtcc cagggagatg 14401 ggggttttat ctataagtcc ctgactgggg ctgctgcctt tttttcaggg atgcccttcc 14461 cagagaggag gaatctagag aggcagtctg gctagagccg agaattttta atatgaagag 14521 atgttgaatt ttattgaagg ccttttctgc atctattgag ataatcatgt agtttttcct 14581 ttagttctac ttatgcaatg aattatgttt actgatttgc atatgttgaa tcaggtttac 14641 atcccaggaa ttaagccaaa tttatcatgg tggataagct ttttgaggtg gtgctggatt 14701 ccatttgcca gtaatttttt gagaattttt gcatcaatgt tcttcaaata tattggccta 14761 aagttttctt tttttgttgt atctctgcca ggttttggta tcaggatgaa gctggcctca 14821 taaaaagagt tatggaggag tctctctttt tcaattgttt ggaatagttt ctgaagaaat 14881 ggtaccagct cttgtttgta cctgtggtaa aatttaggtg taaatccatc tgattctggg 14941 cttttgttgg ttggtagact ttttactact ccctcaattt caaacttttt gttggtctct 15001 ccagggattc aatttcttcc tggtttagtc ttgggagggt gtatgtgtcc agaaatttat 15061 tcatttcttc cagattttct agtttatttg catagggggg ttcatagtat ttgctgatgg 15121 ttgcttgtat ttttgtggga tcagtggtga tatccttttt atcatttttt gtagtgtcag 15181 tttgattatt atttttttct tttttattag tctagccagc agtctatcca ttttattatt 15241 tttttaaaaa ccagctcctg gatttattgg ttttttgaaa gtttttttgt gtctctatct 15301 ccttcaattc ttctctgatc ttaatcgttt cttgtcttct gctagctttt gggtttcttt 15361 gctattgatt ctctagttct tttttttggg atgttagggt gctgatttga gatcctttta 15421 gcttttcaat gtaggtattt agtgctataa gtttacctct taacactgct ttagctatgt 15481 ctcagagatt ctggtacatg gtctctttga tctcattggt ttcaaataac ttcttcattt 15541 ctgccttaat tttattattt acccccagag tcattcagga gcagattgtt caatttccat 15601 gtgtttgtgt ggctttgtgt gagtttctta atcttgagtt ctaatttgat tgcactgtgg 15661 tctgagagac tgttatgatt tcagttcttt tgcatttgct gaagagtgtt ttacttccaa 15721 ttctgtgata gattttagag taagtgccat gtggctctga gaaaaatgta tattctgttg 15781 gttttgggtg gagagttcta tagatattta ccaggttcac ttgatccaga gctgagttaa 15841 agtcctgaat atccttgtta tttttctgtc tcgatgatct gtctcatatt gacaatgagg 15901 tgttacagtc tcccactata attgtgtgaa gtctaagtct ctttggagat ctctaagcac 15961 ttgttttgtg aatttgggtg ctcctgtatt gggtgaatat atatttagga cagttagctc 16021 ttcttgttga attgatgcct ttattttatg tcataccctt ctttgtgttt tttcttatct 16081 tttttagttt aaattctgtt ttgtcataaa ctaggattgc aacccctgct ttttttctgc 16141 tttccaatta cttggtaagt tttcctccat ccctttattt ttagcctatg tgtgtcttca 16201 catgtgagat gggtctcttg aatatggcac accaatggtt cttgactctt tatccagttt 16261 gccattccgt gtattttaac tggggcattt agcccattta catttaaggt taatattgct 16321 atgtgtgaat ttggtcctgt catcatgatg ttagccggtt attttttaga cttgttgatg 16381 cagttgcttc gtagtgtcat ttgtccttgt atttcagtgt gtttttgcag tggctggtaa 16441 tggtgtttcc tttctacatt tagtgcttct ttcaggagct ctttgtaagg caggcctggt 16501 ggtgatgaat tccctcagca tttgcttgtc tgaaaaggat tttatttcca ctttggttat 16561 gaagcttagt atggctgggt atgaaattct gggttgtaaa ttattctctt taagaatttt 16621 gaatattgga cttcaatctc ttctggtttg tagggtttct gctcagaggt ctgctgttag 16681 tctaatgggc ttttctctgt aagtaacgtg gtctttcttt ctggctgccc ttaacatgat 16741 ttccttcatt tcgaccttgg agaatctaat gattgtgtct tgggcttgac cttctcatag 16801 agtttcttac ttgggttctc cagattttct gaatttgaat gtttgcctat cttgttaggt 16861 tgaggaggtt cttttggatg atatcctgaa gtatattttc caacttggtt ccatcctccc 16921 catctctttc aggtacccca atctgtcata agttcagtct ttttatataa tcccatagtt 16981 cttggaggtt ttgttcattc cttttctttt ttctttttct tttctttttt tttctcctct 17041 aatcttgcct gcctgtctta tttcagcaag atagtctccg agctctgaaa ttattttctc 17101 tgcttggtct attctcttat tgatacttgt ggttttattg tgaagttctc atgttgtgtt 17161 tttcagctcc atcaggtcat taatgttctt ctctacactg gttattctgg ttaacaactc 17221 ctataatgtt ttatggtggt ccttagcttc tttgtattgg gtaagaacat tctcctttag 17281 ctcagttaag tttgttatta cccaccttct gaagcttact tctgtcaatt catccatttc 17341 agcttccgca tcagttctgt gcccttgctg gagaggtgtt gcaatcattt ggaggataat 17401 aggcatttgg gcttttggaa ttttctgtga ttttactttg attctttctc attttcatga 17461 gtttctctaa ttttgatctt tgaggctgtt aactttggat gtagtttttg tggggacatt 17521 ttgtgaatgt tgtcactttc tgtttgtttg tttttctttt aacagtgagg ttcctcttcc 17581 gtagggctgt tgcagttttc tgggtgtcca cttcaggcac tattcacctg gctctcttcc 17641 cccctggaca tgtcacctga catgcagaac agcaaagatg gctgccttct ccttcctctg 17701 ggatccctgt ccctgaggga cacagacctg ttgtagcagg aatgttcctg tataaggtat 17761 ctggcaaacc ctgttggggg gtcccgccta gccaaggggc acagaatcta ggacccactt 17821 aactagcact cgggctgccc cttggtggag acagtgtgct gcactctgag gagcccactc 17881 atctggactg tccagatttt ttagagtcag caggggaaag actaagtctg ctaatacaca 17941 gagaccacag ccacacaggg gctcagtccc agggggatca gaattctgtc cctatacccc 18001 atggctgggg ttgctgaaat tcctgcaggg agtccccatc cagtgagaaa gaatgggtca 18061 agggccagcc taaagaggca gtctggcatg atatgccaca gctggtgtgc tgtagctctg 18121 tgaaattcct gctgggctca aaccatcctg tctccccacc accagcaggg gaaaaattgc 18181 agactggagc ttcagtgatg gctgctgccc ctcctccagg gagctcagtt gtcttaggca 18241 gcaagcagcc acagtgatga tggctgtccc tccccttggg aacgcagtag ccttaggcag 18301 tctccagcca aatagccact gagaatctgc acagctctgt gcttgagacc caaggccctg 18361 gtgacatggg cccacaagtg ggatctcctt atctgtgggt tgcacagatc tatgaaaaaa 18421 agcaccattt cccatgcagg gtagcacaat cactcaccac ttcccttggc tgggggtggg 18481 agcttccctt gccccatgtg gctctcattt gggctgtcac actaccctgc tctcactgac 18541 tgtctgtggg tcacaccaac tacctaatta gttccagtga gagaagctgg atatctcagt 18601 tactggtgca ggattcactc gcccttttgg ttcttcttgg tgggagcctc caaccacagc 18661 tctttctagt cgaccatcct gtccagaata tctgactttt taataatagt cactctgact 18721 ggtgtgagat gatatctcct tgtggttttg atttgtattt ctctaatgat tactgatgtt 18781 gagcagtttt ttatatactt gttggtggca tgcatgtctt aattgtgaaa gtgtctgttc 18841 atgttatttg cacacttttt aatggagttg tttgtttctt gcttgttagt tgtttacatt 18901 ctttaaagat tctggatatt agatgtttgt tggatgcata gtttgcaaat agattctccc 18961 attatgtgtg ttgtctgttt actctgatga tagtttattt tgctgtgcag aacttcttta 19021 gtttaattac atcctatttg tcaatattta tttttggccg ggtgcagtgg ctcacgcctg 19081 taatccctgc actttgagag gtcgagatgg gtggatcacg aggtcaggag atcaagacca 19141 tcctggccaa catggtggaa ccccgtctct actaaaatac aaaaaattag ccgggcgggg 19201 tggcacactc ctgtagtccc agctacttac tcacgaggct gagccagggg aatcgcttga 19261 acctgggtgg cagaggttgc agtgagccga gatggtgcca ctgcactcca gcctggcgac 19321 agagcaagac acactctctc tctctctctc tctctctata tatatatata tatatatatg 19381 cttttggcct cttctccata aaatatttgc catggcttat gatgagaatg gtatttccta 19441 ggttatcttt caggaatttt ataggtttaa gttgtgcatt taggtcttta attccccttt 19501 gtgtatggtg taaggaaggg gtctagtttt agtcttctgc atatgactag cctgttatcc 19561 cacgtcgttc agcttttcta ctactccata ataacatcag ctgctggaac atcatttcaa 19621 ttaaaaacct tgaatggatc agattttgac acataatata attgggatat tttagtaagt 19681 gaaataaata atctgcatta acttttaaac acattcatta acctttttgt aaagagaaag 19741 gaaatgaaca gctttgaggt atttggaata gtattgtttt cgaaaaacat ttcaaagaca 19801 gagttttaat tcagaaattt ttacatatct agacctccag ctttggttac caattttctt 19861 ctcttttact tctgggaaaa tcaacacata ggacttttag tgatgttgtc acatttatat 19921 ccataaaaat agtggcatta aaaaaattgt attacattaa acggaattta ctttatgtct 19981 atttaaaaca atgataagta tggtgagatg atgtgcatct taattagttt gaatgaatct 20041 ttctgtaatg tatatgtaga tcaaaacatc acgttgtact ctgtaaatat ccacaattat 20101 tgacaattaa aaatacataa aaataaatat gaaaaacaaa acaacatttt ctaaaggtag 20161 ccatgaaacc cagttttact gagtgaagag gattctcata cttgaagcat ttgagataca 20221 ttacataatt tttcctcata aattgtcttt agaataatat tttaatacat acaaatgtca 20281 aacaattttg tttcaactgt tagtgaggaa aaatgaactg aacttataaa ttgatttttc 20341 tatcataaat atatgatatt agattctaaa ttgtacacat tatggttaat aagtttggct 20401 aacaagtagt atttacttaa aatttaagca ccacaggcag taaaaacttg taatgtcttc 20461 atattttcaa tcaaatgttt acacataaac attacataca gagccctcag aaataatgct 20521 gcatatctac aactgtctga tctttgacaa atctgacaaa aacaagaaat ggggaaagga 20581 ttccctattt aataaatgct gctgggaaaa ctggctagcc atatgtagaa agctaaaact 20641 ggatccctcc cttacacctt aaacaaaact taattcaaga tggattaaag acttaaatgt 20701 tagacctaaa accataaaaa ccctagaaga aaacctaggc aataccattc aggacataag 20761 catgggcaag gacttcatgt ctaaaacacc aaaagcaatg gcaacaaaag ccaaaattga 20821 caaatgggat ctaattaaac tgaagagctt ctgcacagca aaagaaacta ccatcagagt 20881 gaacaggcag cctacagaat gggagaaaat ttttgcaatc tactcatctg ataaagggct 20941 aatatccaga atctacaata aactctaaca aatttacaag aaaaaaacaa acaaccccat 21001 caaaaagtgg gcaaaggata tgaacagaca cttctagaaa gaagacattt atgcagccaa 21061 gagacacatg aaaaaatgtt catcatcact gaccatcaga gaaatgcaaa tcaaaaccac 21121 tatgagatac catctcacac cagttagaat ggcaatcatt aaaaagtcag gaaacaacaa 21181 gtgctggaga ggatatggag aaataggaac aattttacac tgctggtggg actgtaaact 21241 agttcaacct ttgtggaagt cagtgtggca attcctcagg gatctagaac tagaaatacc 21301 atttgaccca gccatcccat tactgggtat atacccaaag gattataaaa catgctgctt 21361 taaagacaca tgcacacgca tgtttattgc gagactattc acaatagcaa agacttggaa 21421 ccaacccaaa tgtccaacaa tgatagacta cattaagaaa atgtggcaca tataaataat 21481 ggaatactat gcagccataa aaaaggatga gttcatgtcc tttgtaggga catggatgaa 21541 attggaaatc atcattctca gtaaactatc gcaagaacaa aaaaccaaac accacatatt 21601 ctcactcata gatgggaata gaacaatgag aacacatgga cacaggaagg ggaacatcac 21661 acaccagggc ctgttgtggg gtgaggggaa gggggaggga tagcagtagg agatacgcct 21721 aatgttaaat gacgagttaa tgggtgcagc acaccaacat ggcacatgta tacatatgta 21781 acaaacctgc acgttgagca catgtaccct aaaacttaaa gtataatttt aaaaaagtac 21841 atacatttat gagagtttta aaacctcatt ctgctctatg tgaaagctaa ccatttaatg 21901 aaagtacgtt tctagttgtg aaattttcaa ccatttttac atgcagacaa atgtgggagg 21961 gagctaatct gagaaattcc tattaaatga tcagcactga aataccttca gcacagtgca 22021 ctgccaaaca agcatataag attttaaaaa gtgaataaaa caacacctgt gagactctga 22081 tattggaaat aacactttcc aaaatgactc ctttaagtgc atggaatcct ttacttagca 22141 gcaacattga cctattttgt aatgatatct tctaattgaa taattattta aatcgataga 22201 gattttaagg cagagcaatg ctctacagag aagttcaaag ggaaataata agataataga 22261 gaagaacaca aaataaagtt accaagagca aaatgttaaa gcagttagtg tttggcttag 22321 atgtcctctg catcttttgc ttttctttcc agtaccttct ttccatgcca gagttatgga 22381 gctcttacgt tgatcaggcc tacttgtttt ttttgttata cttcaagttc tagggtacgt 22441 gtgcacaaca tgcaggtttg ttacatatgt atacatgtgc catgttggtg tgctgcaccc 22501 attaactcat catttacatt aggtatatct cctagtgcta accctccccc ctcctcccac 22561 cccacaactg gccccggtgt gtgatgttcc ccttcctgtg tccatgtgtt ctcattgttc 22621 aattcccact taagagtgag aacatgtggt gcttggtttt ctgtccttgt gatagtttgc 22681 tgagaatgat ggtttccagc ttcatccatg tccctacaaa ggacatgaac tcatcatttt 22741 ttatggctgc atagtattcc gtggcatata tgtgccacat tttcttaatc cagtctttca 22801 ttgatgaaca tttgggttgg ttccaagtct ttgctattgt gaatagtgac caggcctacc 22861 tttaagggga aaggcaatct gatgaaacca ttcacagcaa atagttaata actgcaagga 22921 ggcttcaaca ctttaccatt caccatcccc gatatctaaa ttatagatac tcgacataga 22981 aaagaatctt gagagtcatt ttattcagtg tttcaaaaaa ctttgattcc actggccagc 23041 aatttatact ttaattttca aatgtttata agaagactaa cattgttgcc atctttttgg 23101 ttaaataaaa acatttttaa agtttttcat ctaacatcat ttttaaagac atttcataac 23161 taaaaaagta aataaattta tcttcattaa actttatata ttttacttgt ttttttaatt 23221 atgccattac ctgatttttc cctccaacat ccctacatac agttatcttg agatgtttga 23281 acatcacaag agacatagac cgtccattta atttagggtc agctgttact cttagaaaat 23341 tcctgtttta atatagacaa catttgactc cctttaactt ccatgtattg atgtaactct 23401 ccctgttgga gtcataaaga tcaacttgaa tccttcttct atataatagt tctttgaatg 23461 tttgaagacc tctattctgc tttaatcccc caacccccag cctgcattaa acaaacagaa 23521 ttgtctcttc taaaactaaa tatcctccaa ttttttttca tgtgatgtgg gctttcttca 23581 tttttatcat cttagtcact ctcctactat ttcaagtatt ttttataatg gctgacttga 23641 ctgattgaat gactaaaatt atgtccccct gaatagactt cagtttttca ataactcttt 23701 tcaaacgtgg cactaatatc tggattgaga cctttagagg tcttacatat caaaaagatt 23761 atgatgccct tcctttgcac ctccatgtgt aatgtaaaat aaaacataac tgaaaaccag 23821 atgaaatata ttgcttttcc tgaatgacac aacattacag gaatgctgat aatggaggca 23881 cctaatatta agaacagaac tcgagatgta cattgatgca catggagcaa aataaagctc 23941 tcattaactc ccttacaaac aacaccttga ctacaagaga gctaaaacat gttgccttaa 24001 ccagtaaatc aacagaataa tcaactctgt caaatagttg agtaaatttt tacgactaga 24061 acagttgttt gaatccgtag actctgccag ttgtaacata tgaagactca gtaggattat 24121 tctccaaggt tttattcttt gaacatgcag aataactcac cttcaataac ttcataaagt 24181 tagcaataaa aaagtagcat attctttttg attttaactt ttttatttta atagttttgg 24241 ggatataggt ggtttttggt tacatggaca agtttgttcg tggtgatttc agagattttg 24301 gtgagcccgt tatctgggaa gtgtatgctg tatgcagttt tttatccctc accccccctc 24361 aaccttcgcc ccaagtctcc aaagtccatt atataattcc tatgccttta caccctcaaa 24421 gcttagctct cagttacaag tgagaacata tgatatttgg ttttccactc ctgagctact 24481 tttcttagat ttatgggctc cagctccatc caaggtgctg caaatgccat tattttgttc 24541 ttttttatgg ctgagtagta ttccttggtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 24601 tgtgtatata tacacatata tatacacaca tatatacata cacatatata tgtatatata 24661 tatatacaca aatatatata catatctcac atattcttaa tctgcttgtt ggttgatgtg 24721 catttaggtt agttccatat ttttgaaatt gcagttgtgc tgctataaac atgtgtgcgt 24781 gtgtcttttt catataacat ttttctttgg gtagatgccc agtagtagga ttgctggata 24841 aaatggtagt tctattttta gtcctttaag gaatatccat actgttttcc atagtggttg 24901 aactagtttg cattcccacc agcattgtaa aagtgttccc ttttcacaac atccatgcca 24961 acatctatta ttttttgatt ttaaattatg aataaaagat gcctgtcaga tttgaatagc 25021 tgaaacactg ggacaggagc aaggctatga ggtgaataac tttcctttct ggcctggcag 25081 ggaatctgag gtagcttcca ctctttaccc tgattaaacc tcagtgcatg ttataaagat 25141 ctctctccaa ccaacctcat taaggccaag acctcagcac atcattgggt attaatctac 25201 ccacctgcct tagctacaac tgccttagct acccagggat atctccctta ttggcctgaa 25261 gcctgaattg tcaactcagt aaataaagta ctgggtaaag tgcaaataaa taaagtgtgc 25321 accactagag aacaaggtac acttcaaggg gacccggcca tttcaacccc gtaagagaca 25381 gtgaactctc cccacaccaa gaatgtaact actacaacta gcatctggta aagccagtgc 25441 acaaagactc tctataacta aggaactcat aaagttttaa cccctaaatg caccaagaat 25501 catattaggc tcaaataaat attgaagtca attcttagag aacaaaaaaa aactttaaaa 25561 atgcagtcca attaaaaata aattcaagaa caatttaaaa aaaatagtct acccaaatga 25621 gaaggaacca gaaaaagtaa ctcaggtaat aagacaaaac aggattccat aactttaatt 25681 ttgattcaac atagtatttg tttaacttac ctctgggcct gctgctggtt ttttatcact 25741 ataaacccat tggtgtggga aaaactattg ttacattgac tacgttttcc tgtttttttt 25801 tttttttaac ttttcaacaa aggttacagt gaataagaaa tagaaaagag tctttccact 25861 tttgcttatg tcactttctg tcttctacaa ctttaaactc ttagaaaaag taccctatac 25921 cattgggttg gtttgtttct ttttctgatt gttttgtttt ctatttcttt atctttttga 25981 aatgacataa acaacatcct tttttgtttt ggggtttttt ttttgcacat actactctta 26041 agaaagtatg tttggagagt atatattaca cttaaagaaa ggcaaaaaca ggaagaaaaa 26101 gttttactct atgcaaatac tataaaaatt ttcaaatgaa gggatgctga atattatcaa 26161 ctgctttttc agcattaatt gacatgatca catggttttt atcattcgtt ctgttgatat 26221 ggtttatcac atgaattgat tggtgtgtgt tgaaccatcc ttgcatacct ggaataaatt 26281 ctacttggta atgatgaatg atctttttaa tttattattg gattccattt gtcagtattt 26341 ttggggaatt tttatatcta tattcatcag cactgttggc ctgtactttt ctttttctga 26401 tgtatctttg tgtggttttg gtttcaaggc aatactggcc tcatagaatg tgtttggaag 26461 tattttctcc tcctctatgt tttagaatag ttagaacagg gttggtatta gttatttttt 26521 aaatgtttgg ttgaattcag tagtgaagtt attgggtctc aggattttta ttttatggag 26581 aaatttgtat tgtgactttg atctctttac ttgttattgg tctgtttagg ttttggattt 26641 cttccttgtt cagtcttggt atgttattat gtgtccaggc atttgtcaat ttctcctaga 26701 tttgtgaatt tattggcata tagttgctga cagtagccac taatgactct ttcaatttct 26761 gcagtatcag ttgtaatgtc ttctttttca tttctcattt tatttatttg gattttctct 26821 ctttattttt ttctttttta gtcaggttag aggtttgtca gttttgttta acttttcaaa 26881 catccaactt tttgtttcgt tgatcttttg tattgttttc ttcatttaaa ttttatttac 26941 ttcttctttg atctttattt cttttcttct actgattttg agttttgttt gctcttgctt 27001 ttctagttct ttaagatgca ttgaagtttt tttttcctac ttttgatgta ggcacttata 27061 gctataaact tccctgttag tacggttttt gctatatccc ataggttttg gtatgttgtg 27121 tttccattat aatttatttt aaaaaaatca acttctttct catgacccac ttgtcattcg 27181 ggagcatatt gtttaatttc tatgtatttg aatagtttcc aaaattttct tcttattaat 27241 ttctagtttt atttcattgt ggtcaagaaa gatgcttgat tttttttttt caagtttttt 27301 ttaatgtttt aagacttatt ttgtaaccta acatacagtg tatccttgac aatcatttat 27361 gtgctgtgga cagtcatgtg tattttacag ccattggata aaatgttcta taaatattta 27421 ttagatcctt ttggtctgta gtgcagatta agtctgatat tcctttattt ctgttctgga 27481 atatatgtca aatgtcgaaa gtgaggtgtt gaattcttca gctattattg ttttggggct 27541 tctctctctc tttagctcta gtaatatttg ctttatatat ttggtgctcc agtgttgggt 27601 gcatatatat taatgtttta aattgttata ttatcttgct gaatagacca ctttatctca 27661 tatagtgacc tttttgtctc ttcttatagt tttcatcttg aaatctattt tgtctgatat 27721 aagtatagtg actcctgctc ttttttggtt tctattcaga tggattatat ttttacatcc 27781 ctgtattttt agtctttgtg tgtctttata gcaaaagtgt atttcttgta gacaacagat 27841 caatgggtcc tgttttttca tccattcagc cactgtatga gttttgattg gagagtttgg 27901 tcgatttact tccaatgtta ttattgataa gtaaagactt actgttgcca ttttgttatt 27961 tgttttctgg ttgttttgtg gtcttgtctt ccctctttct ttccttcctg tgttcattta 28021 gtgatggtga ttttctctgt tgatatgatt tagtttcttg ctttttattt tatgtgtatc 28081 cattttatgt tttctgcttt gaggtcacca ttaggctttc aaatactatt tcataaagca 28141 ttattttatc ccaataacaa attaacatcg ttttttataa acaaaccaac aagcaggcaa 28201 aaaaaaaaaa ctaataaaaa ctctaagcct taactgcatc cccctacttg ttaacttttt 28261 gttgttgcta tttatatctt atggtatggc ctacatcttg ctaagttgtt gtggttatta 28321 ttttgatttg ctaattgctt agtctttcta cttaggataa gagtggttta cacagcacag 28381 ttatagtgct acagtaatct gtttttgtgt tcttactatt actggtgagt tttgtatgtt 28441 caggtaatta cttgtttctt attaacatca ttttctttct gcttgaggaa ctccctttag 28501 catttattgc aggaaaaccc ttgtgttaat gaaatccctc agcttttgtt tgtcttggaa 28561 agtctttatt tatccatcat gtttgaagga tattttcacc acatacatca ttctagggta 28621 gttttttttt tcttcagcaa tataaatatg tcatgccact ctctcctggt ctgtaaggtt 28681 tccactgaaa aatctattgc cagatgtatt ggagctccat tgtatctatt tgttttttac 28741 ttttgctgct tttagggtcc tttctttatt cttaattttt gggagtttga ttattaaata 28801 ccttgaggta gttttctttg ggttgaatct gcttggtatt ctgaaacctt cttgtacttt 28861 aaaattgata tcatttctta ggtttgggaa gttctctgtt gttatcttat tgaatatact 28921 ttctaccccc tatctctctc tctaccacct cattaagtcc aataaccatt agaattgccc 28981 ttttgaggct attttctagt tccagtagga atgcttcatt gttttttatt tattttactt 29041 ttgtctcctc tgactgtgta ttttcaaata gtctgtcttc aagctgattt ttttctgctt 29101 gatcaattct gctattgaaa ggctatgatg tgttctttaa tatgccaatt gcatttgtca 29161 gctccagaat ttttattgat tcttttttaa tctctttgtt aaacttatct gatagaattc 29221 tgaatgtctt ttctgcatta ctttgaattt ctttgaattt cttcaacaca gctattttga 29281 attctctctc tgaaaggtca catggctctt tttctccagg agtgatccct gctgctttac 29341 ttaattcatt tggtgaggtt atgttttcct ggatggcgtt gatgttaata gatgtttttc 29401 agtgcctggg cattgaagag gtaggtattt attgtaatct tcactctctg ggcttatcag 29461 tacctgtcct tcttggaaag gctttccaga catttacaag gacatgggtg ctgtaattta 29521 agctgtatca gctttgaggg cacccaaagt gcagtaacat tgtggttatt ttagacatgt 29581 agaggtactg cctttatggt catgattgtg gagaattctc tgcattacca ggctgagatt 29641 ctttttctct tcctttactt tcttccacac aaagggagtc tttctattct gagccaccta 29701 aagttgaagg tcaagtgaca caaacacctc ttgggccaac actatgactg cactgggtca 29761 gacctgaagc tagtacaaac gtgggtctca ctcaaggcct gttgtaaacg ctccctgtct 29821 actgcctttt ttgttaaggt cttagggccc tacaatcagc aggtagcaaa tccaaccaga 29881 tttgtgtcct tcccttcagg ttaatgagtt ccctcaggcc ccaggtgtgt ccagagttac 29941 catacggggg tcagggacta gattcaacaa ccttagaatt ctactttgtg ttctattgtg 30001 ctgcagctga gctggcactc aaactacaag atacagacct tccctctctt ccctcccctt 30061 tccaaactta ctttaccaca tagccactgc caccacaggc cacaggatac tcccagctat 30121 caccagtgtt ttcttaaggc ccaagggctc ttaagtcacc ttgtggtgaa tgttacttca 30181 cctgggattc atgcttcagg gcagtgggct ccattttggc ccagagcagg tcccaaatgc 30241 catccaagag tcaaatcctg gaatcaggga ccccaagagc ccacttggcg ctctatctcc 30301 ctgtgtcgat gctggtatct gaagccagca agcctcagaa gctcatccaa agccctggac 30361 atagtacctg ggtattcttg ctggttattc tggtcccaag ggctcttaat ttagcaggtg 30421 acaaattctt ccaggactgg gtccttccct tcaaggcagt aggttccctt ctggccgagg 30481 gtgtgtctag aaatgtcatc caggaactag ggcctggaat gaggccctca tgactctgac 30541 tagtgcccta tccttctgtg agtgagctgg tatccaagat gcaagacaaa ttcctcccaa 30601 ctctttcctc tcctctcctc aagtggaagg aaggggtctc ttttggaggt gcaagctgtg 30661 cagtctagtg ttagaggaag ggtgatgcca acattccctt agctgcccca gctggtgtct 30721 cagtagatgg gatgctcccc catgatccac tgtctctggg cccagttcag ccctaggact 30781 catctaagag tttaagtcct tatagcctaa actgcctttc aattttactt agagattcag 30841 agcagtttat acctcagtgg caaggtttgc aggaactcaa gtttagagca ccgagaacag 30901 taattcccct atggctaggg ctggtttaat ttctccctct gaggacgggt gtcaattgag 30961 ttttctcgaa ttttcttttc tgctctaaca ggacagtact gaatttgctg cctcacaatt 31021 tctatgctct ccctctccca gtgcccagag atgctcttgg catcatgcag ccactgccag 31081 gggatggggg aggggtgaca tcagcaattc aagattgatt attattactc ttctgtgcct 31141 ctttcagcaa tgtgtaattc aaaccaggta ctatgagggc ccacctgatt tttgcaccct 31201 actcccatac cgaccagaag gatctcaacc accaaaaatg ttacaaaatc actgtcactg 31261 tttacctagt gatttgttga gtattatatt attgtagtgt atgttttcat aatgtgcacc 31321 gttaactttg cgcaataatg taaaatatct cttgcctttt tcagagattt accgtctctg 31381 gaagctgtta gaagattgtt agctatttta attagtacag actgaaaggt ctcctgtagt 31441 ttaatctgca tagaaagtca caataatctc tggtaaccag tttggctgct taaaaatgtt 31501 ttttaattgt tacttttgac ctggaaatgc atatagtcat aagctatatt caactaggga 31561 atgtgtgagg aattactata tgatataaaa agaggaaatg taacttcctg acttaaatta 31621 aatctgatga agttagtagc catgttatta gagacaagat aaatgtccag aattcattat 31681 tttgtttcaa acttagttta tcgtaaatct tccccatgac atataataac ttttgtactg 31741 tttttcaagt tatttgatag caagttcaac aatgtaggtg aatgacaatg tgagcagcac 31801 gataatgact cttcaggaaa atcagccttt agggcacaag gttagaacta catgggcttt 31861 tattcaaata atatatatat atatatatat attcaaaaat ttatgattga tctcataaga 31921 ccatcagcta atatactgtg tggttaaagg cacaagctga ttatatataa caaaatttat 31981 attatgtggt cttgttatgt tacagaacta tgtattaaaa gaaatggcag tccaatttga 32041 tctggtataa aaagaaaaaa tagttagaaa gcacagaaac tagttttaat ttttccctat 32101 acaaaagaaa tatcaaatca cgaatatgaa cagcactagc tcaaaggttt gctgaaggta 32161 aatcagttga gatatattaa tttttccaca tattttatcg atggacatta ttatgtagta 32221 ttgcttttct acaggaagat acattaggct gaaatttaaa atttggtagg taactgatta 32281 tttcccacta tgttaaaaat gtttacactg gaaatgtgta gacagaaatt tttatttaaa 32341 catacatatt attcttctca aatgccagag acagcctttc caagtcctta actagggaga 32401 tgctgacaaa ctgccaccca cccttgtata gtagtataac aactcccaca acatgaccct 32461 tcctcttgag ttgttgctgt gcattggaac attgagtagg ttaatttagt ctctcttttc 32521 actagggttt gcagatttac aactttaata tttccgtttg ttgcaagcac atacatgatt 32581 acttgaaaaa tttgcatgag gaaggataat gcatttttga gggggttaaa atgattgtgt 32641 taaagaacag tgccaaaatt catctttaaa aacccctgag attgtagaga taatcatata 32701 tgacttgagc cattaggtac gtcgttatgc ttgtcaaata ctctattttg ctcctaataa 32761 tacattcaca tatccctttc tgtacattga aaggctgact cagaacacat ccctggtcac 32821 tgaaataaaa attaggcttt actttcaaat catcttcttg tgttttaaac aataccaccc 32881 acatctctca catttcatta catactctaa catatgtctt cacggcaaag aagcagaaat 32941 ttcttggtag attttatcac acatataaag agaatcatgt gactgaaaaa gaccatagaa 33001 gtcaccaata aaaagagaat ggtggtttta tacatatgtt gggaagttac tcatttccaa 33061 cattttcagt taacaatttg actatgttct tttcatttcc ttcacaggga gggttagcca 33121 ttaaaagaat agaaacagga attaatgcaa gtgccataca atacatgtag tataaataca 33181 gcacaaacat gaatcctaaa ataaacaaca ctcatttaat agcttagcat gagacaagca 33241 gtgtaagaat aaaactataa tattaccact taagctctct attcttaccc agtattaaat 33301 aaattgttat ttaagagtta tttaaagaaa atttaaattt gaatggaaaa ctgtgtaata 33361 aacatgtaaa acattactga taataccata atactaaact accactaaat gatttatgaa 33421 ccattatttg catgcattaa attttacctt acaactttct gataagaata aacatattgt 33481 gatgaattgt aaatcaatag attgctaaaa attgttcttg aatgttcaaa tattaaaaat 33541 aaattaataa ctattttcaa tagtgataat gggctctcat tactgatgaa gtttattatt 33601 tatccttaag taattggttt aaaatgtgaa tttgtagtat ctgctgaatt atgtaatatg 33661 aaaacattta taaatcatta tgtaagaatc ctagtaagca tgaagcttct gagagctagt 33721 ttaacaaaag tattcacaga tatacagaca gcaggtctat tcttggttct tcctgtgtga 33781 tattttagaa atgtcattat caattttctg tttattattg tgctttgttt aaagggaatt 33841 tgccaagtga tccctggact ccaatcttta ggcctattgt caagacaaat taatttccgt 33901 ggaaacccac tatggctcag atactaccta ggaaatgcag gataaatgga gtctgtatac 33961 aaatatctct gtttagcttc ttggggttgg gggaacaggg aggaacacct ggaagaagta 34021 gaaattttac catatttacc atttaccatt ctcatatttt gacgtaaaaa gcatagtata 34081 ttaggaggca atatgcaaca acactcttcc cagaacttct tggactagcg ttacgtgcat 34141 aggaatcacc ggggaaattt gttaacatta agtttctgag cagtaggcct ttgatgcact 34201 tggaatactg cctttataac aagtgatgaa tgacttctca tccaagagac acacttcgaa 34261 gagcaagggc atatatactc atcccttggt atacgcaggg attggttcca ggaaccccgt 34321 gtgtacccaa atccatgcaa tactcaagtc ctccagttgg acctgtgagg tggcctttct 34381 ctatatggga gttatgcctc atgcagaatt ctgtaattgt tgataccggc tttgttgaag 34441 aaagtccaca tataagttgc tccatgtgta agtgggcatg agcagttcaa atgtgtgttg 34501 ttcaaaggtc aactaaacat atatatgtaa ctacacacac acacacacac acacacacac 34561 atttaacctt catgagtctc agttttctaa ttggtccaat tgaactaaca acttctgccc 34621 tatgtcctca aaagtttatt gggaagatca aaataaatat ttgggaattt gttttataca 34681 aatttaaagc aaaggtttcc ttttaagcaa gaatcattat ttatgattga caggtattag 34741 ctgagtggct accatgtgtt cagcactgtg ctagatgctt taagaccaaa ggaataaatg 34801 atccctattc taagagttta taatccagct atatttaaaa ctagtccaaa gaagtgcagt 34861 attaatttgt acttcattta caattttttc tcactacatc cagtacgtag taactatttt 34921 attacactat gtcatgctgg taaatattta tcaaccagtt ctctttaaaa ggtatgaata 34981 tattaattat atatatatgt gaattataga tttttctgac ataaaggact tgaggcacat 35041 aatttacaaa caataataaa atacacaata tgctttatgt aaattccaca taattggttg 35101 cttctcagaa aacactttgg ttgatttttg gcaacatttt tttaactata gtcaaactat 35161 gtttgtaagt gatgaacaag tgtaattcaa gcatgaatct tggatgatat ttttgtttat 35221 gttaatgaga aagatgaaag tgaaacaaca aagtcatgtg aaagaacttc actcatttat 35281 cattacttga atgacttact ttctgagtta gatgatcgtt ttcaaacatt aaaacaatag 35341 tattttcaat tttgggggct actcaacaat attgcagcta caaacatgct acacttgtga 35401 atttaatctg aattattaac attttctcca acattttctc caaattaaac caagagttgg 35461 caatcctttt atgtagaggg atagatggga aatatttggg gctttacaat ccatatagtc 35521 tctgcttcaa ctactcaact ctactgttgc agcactgcag ctggaactgg ttatgctcca 35581 gtcaaatttt atttactaaa tcagatggca agctagattt ggcccattaa ctgaagtttg 35641 ctgaccctta ggttaaagaa tcaagaaaac aataaatcat accctgattt gtagctgttg 35701 cctatttcct tagtgcaaat acttggaaag agttgtacag tggcatatca ttataaagta 35761 ttcccattat aaaattctct cattgtaaag tattcctatt gtgtaaatac aattaccatt 35821 atgtaactat gatagctgta aataacttca agagcataca taatagtaaa atgtagtaaa 35881 ataattaaga agtgataagt acttgttatc tttgttttta ttataattta tttaattgta 35941 agtttacaaa atttaccttt aatgatggct ctgtttaaca tctggctcaa aaaattcttg 36001 aaattttaac tcagctcttg ttagccaata acagccagct ccaggatgcc attaattaac 36061 agaaatggat ttaattgcac atcttcacct ctcaaccact tgggttgata ccacatggat 36121 gagggcttga gtggggaaaa aattttgggt tttcctactt ctacaccact ctcctcacct 36181 catcaccaac ttcctgcttc cttgagctaa aaagagaaaa taaatattga agaaggggaa 36241 atggagaaaa agaaagaagc atagatgaca gggaaagtag tgaaagctgg cagaagtagt 36301 gaaagcctta cataccaaat tttctattct atcactagca taaaaataaa ataatggacc 36361 tcgtatggtg gctcgcacct gtcatcccag cactttggga ggctgaagca ggaggattac 36421 ttgaggccag gagtttgaga ctagagtggg caacatagtg agactccata gaaggacact 36481 ctattcaaca aatggtgctg ggataattga caagccacat gcagaagaat gaaactggat 36541 cctcatctct caccttatac aaaactcaag ataaatcaaa gacttaaatc taagacctga 36601 aaccataaat atgctagaag gtaatatcaa aaaaactctt ctagacattg gcttagcaaa 36661 gaattaatga tcaagaatgc aaaagcaaat ccaataaaaa caaaataaat gagacaatta 36721 aaccaaaaag cttttgcaaa aacaaaataa taataataat cagcagagta aatagacaac 36781 ccacagtaca agctttgcat gcgaaaaaga ttaacatcca gaatctacaa ggaattcaaa 36841 caaatcagta agaaaaaaca tataacctca acaaaaagtg gacaaaggac atgaatagac 36901 aattctcaaa agaagataca ctaatggcca acaaacatag gaaataatgc tcaatatcac 36961 taactatcag gaaaattcaa atttaaacta caatgagata ccaccttgct cctgcaaaaa 37021 tggctgcaat aaaaaaatta cagatttagt tgtggatgtg gtgaaaaggg aacactttca 37081 cactgctggt gggaatgtaa gctagtacaa ccactgtgga aaacggtatg gagtttcctt 37141 aaagaactaa atgtagaaat accatttgat tcagctatcc cacgactggg tatctaccca 37201 aaggaaaata aatcattata tgaaaaagtc actcatgcat gttcacagca gcacaattcg 37261 caagtacaaa aatatggagc caacctaatt gtccatcaac caatgagtgg ataaagaaaa 37321 cgtggtacac acacacaaac acacacacac gcacactatg gaatactact caaccatata 37381 aagaaaggaa atattgacat ttgcaacaac atggatgatg ttgggaacca ttattctaag 37441 ccaagtaact caggaataga aagcgaagta ttgtatgatc tcacttataa gtgggagcta 37501 agctatgagg atgcaaagga ataagaatga tataatgaac tttggggact tgggggaaag 37561 gttgggagag ggctgagaga tagaagacta cacagtggta cagtgtacac tgctcaggtg 37621 atggatgcaa caaaatgtca gaaatcacca ctaaagaagg ggggcgggat agcattagga 37681 gatataccta atgtaaatga cgagttaatg ggtgcagcgc accaacatgg cacatgtata 37741 catatgtaac aaacctgcat gttgtgcaca tgtaccctag aacttaaagt ataaaaaaaa 37801 atatgtatat aaagaaaaga aatataatat ccaagaaaaa tagaaaaaaa aaaagaactt 37861 atccatgtaa ccaaaaacta cctgttcccc ccaaactatt gaaatgaatt ttaaaaaata 37921 ttagccgggt atagtcttat agtcctagct acttgggaag gtgaggcagg gtaattactt 37981 gagtccagga gttcaaggct gcggtgagtt atgatcacac cactgcactc cagcctgggt 38041 gacagagtca gaccctgttt caaaataaat aaataaataa ataaataata taatgagctg 38101 ataccatgag tattcataag tgtataagtg aggcatgaaa aacgctatcc ctcccccaga 38161 aatggtcttc aaaatggagg tattttgcat taaatgggaa aattgaagaa gtaagaaatg 38221 gtaactaaat aggttggtac aaacccaagt agatcataat gagacgtgga aagatgaaga 38281 ggaaagaaat ggcaccatta tcttcaggtc aaaggctgta tataaatcaa atcttacacc 38341 acaagatgtg aagagcgcat tcttaaacta ttaagggttc tttgttagaa ccttcttttg 38401 tttccacata cagcactgtt gtatgttgca gtcacataac actaccttgg acagatgaac 38461 ttataagtgg aatgatcctc accaaatcag tcctactgga aaatatttaa atttcatctg 38521 tgttctttac aacttaactt gaaggaaagg aacaacccat gggtttgtac cagaagtttg 38581 tctggtacaa acttctctgt gactgaatat ttttgaacca cattcatcat caaaagaaga 38641 aaatgttcac acacgatgtc ttctttctgc atgaagcctc tttgacataa ataaaaacct 38701 ctaaactcca gaaataatta tgagcatata gtttaaccaa gtagagaatt ctggcaatat 38761 aggatatgta ataaagtgaa atagcagtag tcaaaatgga tttaaaagaa aataatatat 38821 acaatatata gcatatataa taaataaatt acacttgatc ttagccaaaa ggtcaagaaa 38881 tgatgtgtaa tatacaaatt agattatttt atttatgtaa ttaaattata tattttaata 38941 cattttctca ctacccttca gattacatcc cctgctctgt gggaaacaaa cgagtctaaa 39001 gagtgctttg gcacaccaaa agctacgcat gttgtatcca gtcataactt ggaacactca 39061 ctaaacgtat acaaagggtg tgggtcatag actcctgtat aattttgctg aattgttgtt 39121 tctgagaagc caatttcaaa gctaaggaaa atatcataaa actatttgga gtcttttaaa 39181 gtttacgaat agcatttaca aaggcaatga ggaacggaag gccaaattat caaacaacat 39241 acgaaagcaa ggttaatttt accttgtaat aaaccccatc aggtgattta ttttgctttg 39301 aattaaaata tccatacatg aacacaataa tcagataaca atgaatcatt tccgttacct 39361 ccccacaatt cagaagggag agtcatcttc tcctaagggg cattaagttc agactcactt 39421 catctcttgg tgaagcaatt tccccctaga tgacttacaa tcatctggcc tttctgatcc 39481 ttcatgacat tctgcacact gcagtcagaa atattttttc taaaggcaag ttttcttaca 39541 gcactcttct gatgaaaatc tttcaacaac tccctattgc ttattggtta atgcccaagt 39601 tctttagcga aaagcaggcc aaaactactc tcctagaatc atcatttcct tcctccatct 39661 cttttgtggt ctcactcctc tcttctttgg ttgatggatg tctgtttctc tctcattcaa 39721 ggcccaggtc taatgacacc tccattatga atcttcttct gtcttctacc aaatgaatta 39781 gccactctct cctgggttcc cataaatctt tacccatgca atgtgcagga ttgggagaga 39841 gagggaggga agaagtgtgc aggtagagca atggatttca ccagagaggt aactacgggg 39901 ctgcgaaaca taggccctac ttcaaacaaa gcatattaaa tcttaaaatt tgtatacatt 39961 ctgaataaga ttttatttta aaaagtattc tccctcaaca aaagaagcat aataaaatgc 40021 aggtataggg ctgagcgcgg tggctcatgc ttgtaatccc agcactttgg gaggccaagg 40081 cgggcggatc acgaggtcag gatatcgaga tcatccttgc taacacgggt gaaaccccgt 40141 ctctagtaaa aatacaaaaa attagccggg cgtggtggcg ggcgcctgta gtcccagcta 40201 ctcgggaggc tgacgcagga gaatggcgtg aacccgggga ggcggagctc gcagtgagcc 40261 gagatcgcgc cactgcactc cagcctgggc gacagagcga gactccgtct caaaaaaaaa 40321 aaattaaaaa aaaatgcaag tatagtggaa aaagcattat ctttggagct agacagagat 40381 aggttttaag caatcactct tctactttac tgaatgtgtg gctttgagca aatcaattat 40441 cctacctact tacctaacac cattatctgt cacctacttc ctttcaactg atatgccaat 40501 tgagcagagt aattagagga tgatcttact gcgctacctt catgtctaac tcttactgat 40561 acgccaagac tcagttagtg ttacttccac tgtgaggtct aacactagat ggagcgcaat 40621 actttataca aacctctatc atgttttgtc attatttgca tgtcttgaac ccagagctta 40681 ttaagcccta atgtaatcct aatagttgtt ataatattaa aatatggccg ggtgtggtga 40741 cttatgactg taatctcagt acattgggag gcagaggcag gcaaattgct tcagcccagg 40801 agtttgagac cagcctgggc aacatggcaa aaccccattt acacaaaaaa atacaaaaaa 40861 aattggcctg gtatggttgt gcatgcctgt agtcccagct acttgggagg ctgaggtaag 40921 ggtatcacct gagcctggga agttgaggct ggagtaagtt gtgatcatgc cactgaactt 40981 cagcctaggg gacagagtga ggccctgtct caaataaata aataaatata ataaaatact 41041 tccagctgtg tgaggtggtg catgcctatt gtcccagcta cttgggaggc tgaggcagga 41101 gaatttcttg agcccaagaa tttgacccag cctgggcaac acagcaagac cctgtctcta 41161 aataaataaa ataaaaataa aaataaaata atgctgtaca ggttggtaaa tatccagtgt 41221 cctaaatctc acctcatatg cctcctagca cttttaggac atccctccaa aaaaagcctc 41281 cagtgaccag gcaactaaat gattaatatc taaaacagca agggtcctcc agaactctgc 41341 catagctaag gcttgtgttc atttcaatcc atcagttcct gtgcattcct ggttgttaat 41401 atttgaatat agttgttgtt catctcccta atagaagatg aatggcctga gagcagaaat 41461 tgggttattt acccttgtgt ttttgaaggt aagccttcta ctgttccagg tgttaattat 41521 ttagttaatc tttcttttca ttactcctct cctctctgat attaactact ctatagcttc 41581 aagacatcct cagtgacctc atattctcct tttgaggtgt cacggggtct ttacctgggc 41641 ctaacctact tggacctata attggatctt attcttcctc tatccttaca tgcaaaaact 41701 aatccctttt ctcagaagca cacaaaaaga atcataaaat ttattgtatt tttaagaaga 41761 aaaaatgcaa ttgagtatgt atagttagct ataaactata ctaaatatta gcatggttta 41821 caaatgcata tcacataaca cagatgctct tcaattatta atcagtaggt gctgttttat 41881 tcatctgtcc ttttgaataa ctcatttcaa aaatgttata acagtaaata aatatattta 41941 gacattttag gagtccttac cactcttacc ttcaccccaa tatattgtcc aaaacattac 42001 cacaattact atatatagca cctgctgtat tttctgaaat gattattcca ctttatatat 42061 taaacacaat tgaatataat tcttaaaatt ctgcgacaac caaaatcacc ttggaaagag 42121 agctgccatg aatgaaaaaa atgataattt ctttcattta atgtccagat atatgacata 42181 gtatgaaaga agtgtgtttc taaacaaaac ttctattact aatctagaaa cttgaagtag 42241 aatttgattt ttgtccaaaa gatactatta ttacagtctc agaagattct agtggatgct 42301 agtttccact acagaatcat tctaatatat tttatatcat tagtagtcac agctatctag 42361 agttgtatca gaccaaacgc aaagtaaaat atgatgggat aatagtagta agatgttcat 42421 tatttggaac atacttgcta tgttgttttt agttaattga ttcattcatt tatcaaatat 42481 tcattaaatg tcagtgccac atgctaaaaa tgcaaagtat agtacccaat attaaagtgt 42541 tcacattagc aaggaaaaat tgttattttt aaaaaacata tgtttctgaa agtcaatgta 42601 ttactaagac tcacattcca tagcacagtc aagaagcagg aaaaagataa aaataaaatc 42661 ctctgaagac acgtgctttt tataagggca ggcaagttga tttaacttaa ttttctctgg 42721 cttttgattt tcacaaagct agaaaaaaca ttttcagtgt tcttaaaatc ataaaaaaaa 42781 attggacgat aatgagaaat ttctggtttg agatatggca agggaaaaga accaaagcta 42841 gttttaccct tgaggtattt tcagatctag aataagcagc tgaaaaactg aactgagtat 42901 ttagttgtct cccaacatta gacagaaaac aattcgaagt gcagagtctg caaaggatgg 42961 aggcccaaca tattttgctt taatttggat taaaaaagta aggggttaaa cactaggtgc 43021 aagggcaaga cagaaggaaa ctaaccttca caaaattgca gcctgaattt gaggtccaga 43081 aaacaataag ccttgaactt gctgacttgc tattgtcccc gggaacctaa caaaagcaaa 43141 ctgacatcat ctttggagaa agatagaatc atcccaggct tcaaattatt tgtacaaaat 43201 gcagttaatg tgcagaacac atatagagac agctaaacac acacacaaaa agacctcaca 43261 tgctagaaac agaagaagaa aacagaaaag aaagacacaa aaggatttca gatattggta 43321 ttatgcacac atccggtaaa acaactatgc atattttatt caaggaaata accatcaagc 43381 ttgaaacttt tagcaggaaa gtgaaacaga actgatataa acaaatacaa accaaaattc 43441 tggacttgaa aaatactatg ataaattaag gactgaatga ttgggctaca caataaatca 43501 tacacagctg aatataagct tagtgaactg gagaaaaagt cagaagaaat tatgcaaaat 43561 gaattcttaa gagacattat gataaaaaat atgaaagaga tgatgataca taaaaaatac 43621 aatctgatac tgttaattgg agtcccacca ggagggaagg gaaaaaatat acatagtcaa 43681 tgctagaaaa aactatttct ggtaattcat aatttcaata cctccaaaaa attccaagca 43741 gaaaaattaa aagaaatcca cacaccatag agaatctatg aaaaaccaaa aataaatata 43801 ttttaatatt tgtcagaata taaaacagtt taccttcaaa gaaacagcag ttaagaagac 43861 atctgatttc tttttttttt tttttgaggt aggggtccta tttcattgtt ttcattttac 43921 ttttttatta tactttaagt tctagggtac atgtgcacaa catgctggtt tgttacatat 43981 gtatacatat gccatgttgg tgtgctgcac ccattaactc atcgtttata ttaagtatat 44041 ctcctaatgc tatccctccc ccctcccccc acccctcaac aggccccggt gtgtgatgtt 44101 ccccttcctg tgtccaaatg ttctcatttt tcaattccca ccaatgagtg agaacatgcg 44161 gtgtttggtt ttctgtcctt gcaataattt gctgagaatg atggtttcca gcttcatcca 44221 tgtccctaca aaggacatga actcatcctt ttttatggct gcatagtatt ccatggtgta 44281 tgtgtgccac attttcttaa tccagtctat cattgttgga catttgggtt ggttccaagt 44341 ctttgctatt gtgaatagtg ctgcaataaa catacatgtg catgtgtcta tatcagcatg 44401 atttatagtc ctttgggtat atacccagta ataggatggc tgggtcaaat ggcatttctt 44461 gttctagatc cctgaggaat cgccacactg aaatacatct gatttcttaa tagcttcagt 44521 ggaagacaaa agacaaggga acattatctt caatgtactg ttaggaaata actgccaacc 44581 tattcaagga aaatgtcctt caagaataaa gacaagatag gccaggcgtg gtggctcacg 44641 cctgtaatcc cagcactttg ggaggctgag gcgggtggat catgaggtca ggagatcgag 44701 gccatccggt taacatggtg aaaccccgtc tctatgaaaa atacaaaaaa aaaaaataac 44761 cgggcatggt ggtgggcacc tgtagtccca gctacttggg aggctgaagc agaagaatgg 44821 catgaacctg ggagatggag cttgcagtga gctgagattg cgccactgca ctccagcctg 44881 ggcgacagag caagactccg tatcaaaaaa aataaataat aaaaaaaaga atgaagacaa 44941 gataaattac aatttcaggc aacgactgag aatgctcacc acaagaaacc atacactaaa 45001 gggaatacta aaatattcat cagttagaaa taaaaaaaaa attaaataag taatcacagt 45061 tatgggaaaa aatgaaaatc ttaaatatgt aggaaaatct atgttaatat tcgatgtata 45121 agaatatgat atcttgtgct aatatatgta tatgtggaga gagggagaga gagagtaaaa 45181 acgagagaag agaaagagga aaaagagaag gattaaaata tagtacacag cattagagtc 45241 aggaacagag gaaaaaaaca gttaaaatgt tataaattcc tcatattgtc ctggataagg 45301 gtagtatata ttatcattca gttttcataa tgtaaagata tatatttttt cctttatttc 45361 ttctaaaaaa gcaggataca tgaacagaac atgcaggttt gttacatagg tatacgtgtg 45421 ccatggtggt ttgctgcacc tattgacccg tcttctaagt accctccctt caccctgaca 45481 cacccccctc accccccacc ccccaacagg ccctgctgtg tgttgtttcc ctccccatgc 45541 ccatgtgttt tcaatgttca gctcccacct ataagtgaga acatgaggtg tttggttttc 45601 tgttcctgtg ttagtttgct gaggatgatg gcttccagct tcatccatat ccctgcaaag 45661 gaaatgaact cattttttat gactgcatag tattccatgg tgtactcata ccacattttc 45721 attatccagt ctatcattga tgggcattta ggttgattcc atgtctttgc tattgtgaat 45781 ggtactgcaa tgaacataca cgtacatgta tctttataat agaatgattt atattccttt 45841 gggtatatat ccagtaatag gactgctcag tcaaatggta tttctggttc tagatcctta 45901 agcaatcacc acactgtctt ccacaatggt tgaactaatt tacattccca cagacagtgt 45961 aaaagtattc ctatttcttc acagcctctc cagcatcagt ttcctgactt tttaataatc 46021 gccattctga ctggcatgag atggtatctc aatgtggttt tgatttgcat ttctctgatg 46081 aatagtgaag ttgacctttt tttatatgtt tgttggccgt atgaatgtct tcttttgaga 46141 agtgtctgtt cgtgtccttt gcccactttt tgatggggtt gtttgttttt ttcttgtaaa 46201 agacatgtat aatacatttt aaagtaacta aatgaagtgt ttatgacttc caaacaaaaa 46261 aaagggaatg taaataataa caaaaattac ttaatacatt aaaaatgaag catttaaaga 46321 cagaaaaaga aagtagaaaa gataaggcat taaaagcaca acataagact gtagatttaa 46381 atagcaaatg tatcattaat cacattaaac ataaatggac aaaatgctat catttaaaaa 46441 aattgtcaca ctggaaaaaa aagaaagctc aaattaatat taactgcaaa taacagattt 46501 aaaacataaa gatacaaaaa tgctaaaata aagagacaaa aagaccatta aagcaggaaa 46561 taaaataaag cagatttggc tataaaaaca tcagccaaat ataaatgata gggcaaaaag 46621 atgactagaa acagagatca aaatattatg ataaaacatt tatttcatca ggatgaaata 46681 aaagttatct gtgagtgtac acctaatacc ataaagcaaa acttgatagt gcaaattcac 46741 aatcataata gaagacttta acaaacttct ctgtttgtta cagagataga gagacaaaaa 46801 aatcattaag aatatagatt taaataacac aattagcaaa cttaatgcag atatgtttct 46861 atctatgcat acaaaactgc agagtgctca tttttttcaa gaatatatta aatacttgga 46921 aataaaaaga ccgcatttgt ggccccaaag ctaacattaa cacatataaa agaactaaaa 46981 taatacaaaa tacattctca tgtcacaatg caagtgtgct tgaaaccaat aacaaacaaa 47041 aaaatttcca tatgtatgga aattaagaat atacctttaa atacttaaat agcagaagat 47101 agataatgaa aaaagtttta actgaatggt aataaagtta acaaatatca gaacttgtga 47161 aatgcagctg atgcagaaca tgataaaaat aaagagcctt acaattttat gttagaaaaa 47221 atatggagat taattagcta ataatatttc ctaagaaaat aagttattga tattaaaata 47281 aacccaaata tatagaaagg aaaataagat aaatgcagaa attaatgaaa aaacaatgtt 47341 acaataggga aaaacaatga aggtaaaaat tagtttgctc taaggtaaat aataaacaca 47401 cttctcacaa atgagcaaga ataaaagaaa ataagaacag ttaaccagca tctggaataa 47461 agcagtggca aaatcacaat ttctgcagta attataaaga taagagatta taactgataa 47521 atgtatgaca caccagattg aaaatctagt tgaacatatt tctagaagca gacattacca 47581 aaactaccca aaagaggaac agaaagactg aatttttcta taactgattg aaaattgaat 47641 tcataagcaa aagctatcca ctacggaaat ggaaattcat aaccaggtta cttcactgct 47701 gaattctact gaacattcaa agaggaagaa actttaatcg tacataagtt ctcccaaaga 47761 atagagaaag cagaaatgtt tcacaattca ttttatgagg ctcacatatt cttaatatca 47821 aaaccacaca aagcagacgg tgacatcaaa aagatagcag aataggaagt cccacagaag 47881 caacaattga acaattatat atagaccaaa atacttttat gatatttcca aactccagat 47941 aggaatttgc tgtaccccag gtaagcatag agctgagaac aggcacattg aaatgggaaa 48001 gaagagcaat taagatttac caacattagg cacttcccaa gacagcacag gttgttccag 48061 agaaaaaaat gtctcagctc gttatttctc cctagaggaa gggagaggga gaaaaaagtg 48121 gaaaatgtat acaaattttt ggcttttctg cagccttcct gaaagaatga cttttcattt 48181 cactcagagt gctgatgggc ttggcatagt ttggatgcct aggggctact aaaagcaaag 48241 atgagcagga ggggcagctt actgaggcca gcacagattg gcaaaatcaa gagaaggtgc 48301 aaaatctgag gctcctccca tgaaagggag ggagaagact gaaattttta tccactgttt 48361 catagtttta aagggctgcc taagggatta gcatctgtct cattttactt gagacactga 48421 taaagagctg acataatttg ggtgcctagt ggatgataag aacaaaagag gatgtgtaat 48481 gccattacag aagcagacaa cttggagtgc cacaaacaga caccagaggg agcaagaaat 48541 taccagctat gagcccaagc aattgagaaa ttgcatgccc atacccagag aagtgacatc 48601 cctccaaaac tggtttcaga ggccctcaga atctaaagat aggctaattg gtgagggtcc 48661 ctcctggtat gaagccagtc cctaagtact gggaaaggtg actttttttc taatgtgcag 48721 aaccgagtac aaagttgtga gacacacaaa gaagccagga aacatggctt taaaaaatga 48781 tgcacaatac ttgataggta gtttttctac cctcactcac ctcccaccct tcttcctcta 48841 gtagtcccct gtgtttactg tttccatttt tatgtcaact tggacacaat gtttagcttc 48901 cacttatagg tgagaacatg gttttctgtt tctgtgctat ttgcttcaga taatgacctc 48961 cagctgcatc catattgctg caaagtacat aattttattt tttatggctg catagtattc 49021 catgctatat atgtaccaca ttttctttat ccagtctgcc cttgaaggta atcgttgaag 49081 ttgattccat gtcgttgtta ttgtgactag tgctgtgata aacatactag tgcatatatc 49141 tttttggtag aatgatttat attcctttga gtacctaccc aataatgaga ttgtaaggtc 49201 aaggggtagt tctgttttac attctttgac acatcaccaa actactttct acagtggctg 49261 aactaattta cattcccacc aacagggaga aagagttccc ttttctctgc aacctcacca 49321 acatctgtta ttttttgtct ttttaataat gaccattcca actggtatga gatggtaact 49381 tgttgtggtt ttggtttgca tttctctaat gattaatgat gttgagcatt ttgtcatctg 49441 tttcttggcc acttgtatgt cttcttttgc aaagtgtcta ttcatgtcac tacctgggtg 49501 agaggattat ttgtaaacta aacctcaatg acacacaatt taccaacata gcaaacctgc 49561 acatgtactc tctgaatcta aaataaaagt tgaaaaaaaa ttgttttaaa gtggcttaaa 49621 aaatcaaccc tgaatgaaaa ttatagcaat ctctgaaatt atgttaaaat gtatttcact 49681 ttatgtgtgc attatcaacc tgagagggtg gctaagaata tctaaattgt atgtttatat 49741 gagaaacata agaattaaca acaaatataa ataattattg attcttgctt aagaatctgt 49801 gttgccagtc caaaattctt taggaatcac aaatattaaa tgaatgtact atacaaaaaa 49861 tacaaaagta gaagagcata ggtaatcatg aaatcatatt tagaattaga attagaatta 49921 attggaactt gcaattaaat acaactaaca cttttaaact ttgaaaaaaa tggaacaata 49981 taaatctcca gaaattgatc tttaaaaaat ggagatatac aacttacctg aaacaagttc 50041 aaaacaatca taataaagat gctctgtgag ctcaagaaaa ccattcatga agaaaataag 50101 aatacaaaca aagagataga acatattttt taaaaagcaa acaaatttta gagttaaaga 50161 gtacaaaaaa ctgaactgga ccattcagaa gaagaaccca acagcaggca agatcaaaca 50221 gaagtaagaa ttagcaaaat caaaaatcag tcatttgaaa ttatgcagtc ataagaacga 50281 aaataaaata aaatgaaata tagggaaaaa cgctaaggaa cttatggaag acaatcaaga 50341 ggaacaaatt aagaattaag aaagttctag aaagaaaaat gagagaaaag aaggaaaaag 50401 cttattccaa gaaatagtag tcctaagctt cctaaatctg ggaaagaaaa tggacataca 50461 tattaaagaa gtccaacaaa ctctaactag agtgaatcca aagaaatcca caccaagata 50521 atcaaattgt caaaagtaag acagcatttt ttttataata gcaagagaaa agtcttgaca 50581 catacaaaat aactacaata agactataag tggatttctc agcagggaca tttcagaaca 50641 gaagaaagcg gaatgatata ttcaaagtgc tgaaagttaa aaacaaactg cccaccaaaa 50701 aatactctat tagggaaact atcctttaac aatgatggag aaatacttgc tataacaaag 50761 aaaagctgaa ggagttcacc aacactacac ttgccttaca agaaatgcca agaggagtca 50821 ttcaaggtga aataaaaaca ttctaaacag caacacaaaa acatataaaa gtataaaact 50881 cattcgtaaa agtaaatata tagaaaaatg aagaatgctc taatactata atggtgataa 50941 acaaatcctg ttttaattct gctatagaag ataaaagaca aaagttcaaa aataaatgtt 51001 actgtaaaaa tgtcagtgga aacaaaatat aaaaatgcaa tttttgacat taatagcata 51061 aagtgtgggg tagaagtaaa ggtgtacact atttgcatgc aattgaagtt aagttgtcat 51121 ctgcttaaga taggttgtta taactataag gtgttttatg tatgacccat ggtaagcact 51181 aaggacagat caatagaata tacacaagag aaaatgagga aggaaccaaa acatgtaatc 51241 acaaaaaaat caacactgta caaaggaata caacaggaga ggaaaagaga gaaaaatagc 51301 tacaacagag acaatgaaaa attaataaaa atgtcaatag caagtccctc attatcaaga 51361 attactgtaa caaaatggat taaaatcccc aattaaataa taaagtggct gaatggattt 51421 tttcaaagtt ataactataa accaagcatc actaatgcaa aaacccaaaa ttttaaatgc 51481 tacagatacc aacacttttt gagtgctaat atgatgctca aaggaaatgc tccttgcatc 51541 atttcagatt ttgaattttt gaattaggca tgtctaacca gtaatagcaa tgcaaatatt 51601 tcaatatccc aaaacatcca aaatctaaaa acacttcttg tcccaaacat ttttaataaa 51661 agatacacaa cctctatatt atatttataa gagatttgtg ttagatttaa taacacacat 51721 aggctgaaag tgaaaagatg ggaaaatata tcccatgcaa gtgctaacaa aagagagcag 51781 gttggacata attatatcag aaaaaaatag attttaagtc aaaaattgtc aaaagagata 51841 aagagaccat tatacattga tgaaagtgtc agttcaccag gaatatatag caattataaa 51901 aacataggta caagatgccc tgccccagca gatatgcaag caccccacag cactgccatg 51961 tctgctggca tgtgcaggcg agcatggttc ccactgccac tgccatgacg aagtgctttg 52021 gctggcacca cctattaaaa tgttgtcagc agaccagaaa cacctcagct gctccagcac 52081 aacaggttcc taacctgaga ggccagagga aaaaagccaa aggcctcgtt caagtccccc 52141 aaagttagag aacacagctc tggagtgctg aattgagcct tggtccccta aaatctccca 52201 gaaataaaac catttaactg aacccaactt ataccacaat taaaccccaa aagacatcaa 52261 agaaggtaaa agcaaaaagt tccatccaaa ggagagtaat ttcaaatatt gaaggaacat 52321 tagcctgtag atatgagaaa gaatcttcac aagaactcca gcaactcaaa aagccagatt 52381 cccttcttac cttagaacaa ccataatgct ttcccatcaa tggatcttaa tcagactgaa 52441 atggctgaaa tgacagacat agaatttgga atatggatag aaatgaagat cactgagatt 52501 caggagaaag tgaaaaccca attcaaggaa tctaaggaat acaataaaat gatacaggag 52561 ctaaaagagg aaatagtcat tttaagataa aatcaaactg atctgataga actgaaaaac 52621 taacttcaag aatttcataa tataattgcc aatattaaca gcagaatcaa ccaagctgag 52681 gaaagaatct cggagctcga aaattggcct tctgaaataa gtcattcaaa aaaaaaataa 52741 agaaaaagca ataaagaaaa acacaacttt tgagcaatat gaatttatgt aaagatgaca 52801 aatctatgac tcattgacat ccttgaaaga gagggagaga aagcaagcaa ctcaaaaaac 52861 atatttgagg atattaccca taaagatttc cccaacctcg ctagagagac caacattcaa 52921 attcaggaaa tacagagaac cccagggaga gatgaccatc cccaagacac atagtcatca 52981 gattctccaa ggttgaaatg aaagaaaaaa aggtaaaggc ctctggagag aagggtttgg 53041 acacctacaa tgggaacctc agcaggctaa cagtggaact ttcagcacaa gccctacaag 53101 ccagaatata ttggaggcct aggttcagca ttcttaaaga aattccaacc aagaatttca 53161 tatccagtca aaccaagctt tcataagcaa aggaaaacta agatcctttt cagataagca 53221 aatcctatgg aaatttatta ccaccagccc tgtctttaag tgaatgctaa atatgaacag 53281 aaaagaccat tactggatgc cacaaaaaca cacttaagac cttaaccaca tacacaattg 53341 ataatataaa gcaacttcac aatgaagtct acatgataac cagctaaaaa cacattgact 53401 ggatcaaatc tgcaaatatt gatattaact ttgaaggtaa atgggacaaa tgcccaaatt 53461 aaaaggcaca gagtagcaag ctggctaaaa atgcaagact gaatgaactg tatgagatct 53521 tcaagaaacc catctcacat acaatgacat tcatagtctc aaaatataga gatggagaga 53581 aatgaaccaa gcaaacagaa aacagaaaaa gaaaacaggg ttgtagtttt aatttcagac 53641 aaaacagagt ttaaacaaac aaatatcaaa aagacaaata aggactttac acaattgtaa 53701 agggttcaat tcgacagaaa aacctaagta tcttaaatat gtgtgcactc aacccaggag 53761 catccagatt cataaagcaa gccttacaga cctatgaaga gacttagata actacacaaa 53821 aatagtggga aacataaaca tctcattgac agtataagac agatcatcga ggcagaaaac 53881 taacaaaaat atttgggaac tgaactttat gcttgaccaa atgggcctga cagacatcta 53941 tagaacagaa aaccatacga ttccatggaa attaaacaac ctattccaga atgactttgg 54001 ggtgaacaac aaaattaaat aaaaaaacaa aaaaatcttt caagctaatt tgaatgaagc 54061 tacatcatac aagaatctct agaacatagt tacagcagtt ttaggagaaa agcttatagt 54121 gctaaatacc cacatttaga aagacttcaa gttaacaatg taacattaca cctagaggaa 54181 ctagaaaaac aagagcaaac caaccccgaa gctagcagaa gacaaaaaat aaccaaaatc 54241 agaactgaac tatacaaaaa aaatcaatga acccaggagc tgattcttag aaagaataaa 54301 taagattgat agatcacagg ctgatccaaa ttaacacaat cagaaatggc aaaggggact 54361 ttactactga tgccacagaa ataccaaaaa ccctcaggac tactacaaac acctctatga 54421 acacaacctg gaaaacctag aaaaatggac aaatgcctgg aaatgtacaa ctttccaaga 54481 ttgaatcaga aagaaactga atccttgaaa agaccaataa tgcattccaa agttgaatta 54541 gtaagaaaaa gcctaccaac taaaaggacc agacagattc acagatgaat tctgctaggt 54601 gtataaggaa gagttagtac catttttact gaaaatattc cccccaaaaa tgagaaagag 54661 ggactcccta atttattcta tgagggcagc atcattctga taaacctggc agaaacacaa 54721 taaaaaaaga gaaaacttta ggccaaaatc cttgatgaat atggatttaa aacctcaaaa 54781 aatactagca aactgaatcc aacagcccac caaaaagcta atccaccatg atcaagtagg 54841 ctttatctct gtgacataag gttggttcaa tatacacaaa tcagtaaatg aaattcatca 54901 cataagcaga actaaaaaca aaaaaaaaga tcatctcaac agaggcagaa acagctttct 54961 ataatattca acattccttc atattaaaaa acttgcaaca atctaggtat tgaaggaaca 55021 tacctcaaaa taataaaagc catctctgac aaacccacag ccaacattat actgaataag 55081 cagaagccga tgcattcctg ttgagaacca gaacaagaca agaattccca ctctgaccac 55141 tcctattcaa ggtagtattg gaagaccaaa ccagagcaat tgggcaagag aaagaaataa 55201 aagctatcta aataggatgt caaactattt gtttgcagac aacatgattc ttcacctaga 55261 aaataccata gtctctgcca aaagctccta gatctgataa accacttcaa caaagtttca 55321 gtacacaaaa ccaatgctca aaagtaagta atatttctgt aaacacacca caacaaagct 55381 gagagccaaa tcaagaaaac aatcctattc acaatagaca caaaaagcat aaaataccta 55441 tgtatatagc taaccaggga gggaaaagat atctacgatg agaattacaa aacattgctg 55501 aaacaaatca gagaattcac aaacaagtgg aaaaacatta catgcttatg aagagaaagc 55561 atcagtatcg ttaaaaatgg atatactgcc aaaagcaatt tacaaattca aagctatttc 55621 tatcaaacta tcaatggtat ttcctaaatc attagaaaaa aaactgtttt tatgtacaca 55681 gggaaccaaa aaaagagcct gaatagccaa atcagtccta agcaaaaaga acaaactgga 55741 agcatcacat tacctaactt cgaactatat tacattacct aacttcaaac tatattacag 55801 gggtatagta atcaaaactg catggtactg gtacaaaaac agacacatag gccaatggaa 55861 cggaaaatag tccagaaata aagccagaca cctacaacca tatgatcttt gacaaagtca 55921 ataaaataag caatgaggaa agaactccct attcaataaa tggtgctgga ataactggct 55981 agccatttgc agaagatgga agggagtaga ccacttactt acatcatata aaaataccaa 56041 ctccagatgg attaaagact taaatgtaaa acctacaact ataaaaatcc ttgaagaaaa 56101 cctaggaaat acaattctgg acataagctc tggaaaagat ttcatggaga aaaagccaaa 56161 agcaattaca atgaaaacaa aaattggcaa atgggacctg tttaaactaa agagattctg 56221 tacagcaaaa gaaactatca acagagtaaa cagataaatt agagaatgtt tggaaaccat 56281 gcatctgaca aagaaataag aaataagaaa tctgaataaa gaaatctgaa taagaaaatg 56341 tttggaaacc atgcatctga caaaagtcta atatccagaa tctagaagaa atttaaatta 56401 acaagtataa taataaacaa tgccgctgaa cagtgggcaa tggacttgaa cagataatct 56461 tcaaaagaag acatacagtg gtcaataagc atacgaacaa aaatgctcaa aatcactaat 56521 cattagagaa atgcaaataa aagccacaat gagataccat tccacaccag tcagaatggc 56581 tattactaaa tgttaaaaaa taacagatgc tgacaaggtt gcagagaaaa tggaatgatt 56641 atagtttgct agtgggaatg taaattattt cagccattgt ggaacacagt ttggtgattt 56701 ctcaaagctt aaaatagaac taccatttga cccagcaatc ttattatcgg ttatatacac 56761 aaaggaatat aaatcgttct atcataagta cacatgcaca cacatgttca tcgcagcact 56821 attcacaaca gcaaagacat gaaatcaacc aaaatgccca tcaatggtag agtggataaa 56881 gaaaatgtgg tacacatata ccatggaata ctcttcagcc ataaagaagg atgagataat 56941 gcccttttca gcaacatgga tggagctgga ggccactatg cttagcaaac tagcacatgt 57001 tctcacttat aagtgggcac taaacactgt gtacacatga acgcaaagaa ggaaacaaca 57061 gacactggag cctatttgag ggtggacaaa gggaggagga tgaggattga aaaactatca 57121 ttttattata tcctatgctt attacctgga tgattaaata atctgtatca ccaacccctg 57181 tgatacacaa tctacctata taataaaact acacatgtac ccttgaacct aaaataaaca 57241 catatgcaca cgcacaccag atgtcgggag accccaaaat ataaagcaaa tattgaaaga 57301 actgaaggga gaaataggca acagaatatc agtaggaata atagtagggg atttcaatat 57361 tccactttca atcccaggtt gaacatctac gcagaagatg aatacaaaaa cagagtatgt 57421 ggctgggttc agtggctcac tcctgtaatc ccagcacttt gggaggctga ggtgggcaga 57481 ttgcttaagg ttaggagtcc gagaccaacc tgggcaacat ggcaaacctt gtctctacaa 57541 aaaatacaaa aattatctag gcgtggtggc ttgtgcctgt attcccagtc acttgggagg 57601 ctgaggtgga aggatcactt gagctcagga ggtagaggtt ggaaaaagct gagatcacgc 57661 cactgtattc cagcctgggc gacagaggga caccctgtct caaaaacaag caagcaaaca 57721 aacaaacaaa caaaaaagag gatgtaaaca acactatgga tcaaatagat ctaaaagaca 57781 tatacagaac attccaccca aaagcaacat cataaacatt cttctcaagt gcataagaaa 57841 cattctccaa gataggtcac atcttatgcc acaaaacaaa tttaagaaga ttgacatcat 57901 accaggtgta ttttttaacc atgacagaat aaaagtggct gaagaaaata gcaaaattta 57961 caaatatgtg gaagttacac aatatactta tgcatagcca aataactaaa aacaaaaagt 58021 aaaagaaaaa atagaaaatg tattgacaca aattaaatac aaaacactgt catttgccaa 58081 aacttatagg atgtttgcca aaacttatat gatattacta tagcagtatt aacataaaac 58141 tttattttga taagtgccta tattagaaag aagaaacatc tcttaacaat ctaagtttat 58201 gcccatagga actagaaaat gaaaaacaaa ctctgcttag agttagcaaa agaaataaaa 58261 tgaaattgca gcagaaataa atgaaacaga gaagagaaaa taaccaatga aactagggat 58321 tgattttttt agaagaccaa taaggttgaa aaacatgtag ttagattaac agagaaaaaa 58381 gggagaaagt gcaattaaat aaaattagaa attaaagata cattgcaact gatgccataa 58441 aaataaacag gattacaaga gactactatg aacaaagtac acaccaacaa aatggataac 58501 ctagaaaaaa aatggatcaa ctcctataaa catacttgtt ttacttgttt ttttttctga 58561 cattagagga aaatatttca gcttttcatc attaaatatg atgtgatgtt attagtcagt 58621 tttcatgctc ctgataaaga catacaccaa acagtaattt ataaagaaaa agaggtttaa 58681 tggactcaca gttccacgtg gctagggagg cttcacaatc atgacagaag gtgaaaggcg 58741 tatcttacat ggcagcaggc aagagaacga gatccaagca aaaaagggtt tccccttata 58801 aaactatcag atatcacaag acttattcac taccacgaga acggtatggg ggaaactgcc 58861 cccgtgattc aattatctcc cactgggtcc ctctcataac acgtgggaat tatagacggc 58921 actatgagta gaaaacacaa acagacctat aaataatgaa cttgaatcag taatcaaaac 58981 ctcacaacaa agaaaaaccc aagtccaggt tttttcactc attaatttta ccatttgaag 59041 aataattaat gccaattctt ctcaaactct ttcaaaaaat tgaagagcag gagcattttc 59101 aagctcatta catgaggttt gccagaaaaa gacactacaa aggaagaaaa caacaggtca 59161 atatccctaa tgtatataca tacaaaaatc gtcaataaaa tactagcaaa ctaaacataa 59221 cagcacatca aaatgatcat tcaccatcaa caatttgaat ttatccctgt tgaaccatgc 59281 aatgtaaaaa aaaaaatagt atagtgtttg tcagggactg tagaaaaaaa gaaatgagga 59341 gttgccgttc aatgaatata gagttttaga cgtgcaacat gtgaaagttc tggagatcat 59401 ctgtacaaca ttgttcctat agttaacaat attgtgctgt acattaaaaa tgtattaaca 59461 ttatagatct catgttatat gctttttacc acaaaaacaa atacaaagaa gaaagaaaaa 59521 acaccagcca atctcatata taaaagtata tgaaaatatc ctaaacaaga acttaaacgg 59581 ttttaaaatg tcagctttat tataagcaag ctggcttcac tcttcaccac caacagaaga 59641 caaaaaacaa tacagcacca agtttttcaa cagaaacaac ccagaactca aatataagga 59701 tgagacagat cccagggctg cagagaagtg gaaaacccct gagcagatgg taggagaatt 59761 agacttccac atctgcagtg ctcctactct tgtattctgc ttggcaacaa gtgtatgaaa 59821 attatttccc caattcgcag tttctacact ggagaaagtg agattgaggt ggtaaacaag 59881 ctctgtcacc agtctgggtt ccctggcagg agatctgttc ttgccttaac ctacaggaag 59941 cattgtaacc gcctgaagga agaaatatcc ctgaggacaa gcagaaacaa aatggggagt 60001 gggagtacca tccccagtcc tggaaacact tctgtgtaac tcagccaaag gatacatcaa 60061 attagagtga ctgttcaaca gcaccaagct gtataaggta catttcacta gtctcttggg 60121 tgcaaaccct tccttagccc gccttcccac acggcttgta taatcccatt agatctctcc 60181 tattctggac aggcagcact tctattattt actagagcca aagtgaaact gggcttaagg 60241 tgtgacctaa aggtgaaaag gacaaagcaa ctgagcagta aatatttact aagcacatat 60301 tgccaatttt ttttgtaaaa tctgtacaga gaaaactgac ataaataacc aattaatcat 60361 ttaatgaaaa gatatagaca tatacctaca agaaacaacc acgaacagga aacctgactg 60421 ccacaaaggg acaaagcaaa aatccagtta cttaccccaa ctaaacagca atttgtgagc 60481 actctgacca aaaattcaaa atagtttttt gtttgtttgt ttgttttgtt ttgtttgttt 60541 gtttgtttgc agctgttgta aaaggggttg agttcttgac ttgattctcg gcttggtcgt 60601 tgttgatata cagcagtgct actgatttgt gtacgttgat attgtatccc aaaactttac 60661 tgaattcact tatcagattg aggagttttt tagacgagtc tttaaggttt tctaggtata 60721 gactcatatc attgggaaca gcgacagttt gacttccttt ttaccaattt ggatatcctt 60781 tacatctttc tcttgtctga ttgctctagc tagaacttcg acagaagttg tgaaaattgg 60841 catccttgtc tcgttccagt tctcagggaa aatgcttaca acttttccct gatcagtata 60901 atgttggctg tggatttgtc atagatggct tttattatct tgaggtatgc tcctttgatg 60961 ccaattttgc tgaggggttt aatcataaag ggatgttgga tcttgtcaaa tgttttttct 61021 gtgtctacta acatgaccat atgattttgt ttttaattct gtttatgtgg catatcacat 61081 ttattgactt gtatgtgtta aacaatccct gcatccctgg taagaaaccc acttgatcat 61141 ggtggataat ttttttatat gcttttggat tcagttagct aatgttttgt tgaggaattt 61201 tgcatctaag ttcttcagag atactggtct gtagttttct ttttttgtta tgtcctttcc 61261 tggtttgggt attagggtga tactggcttc atagaatgat ttaaggagga ttccttcttt 61321 ctctatcttt tggaatagtt taatcaggac tgataccaat tctcctttga atgtttggca 61381 gaattcagct gtgaatctgt ctggtcctgg atttttttgg ccatttttta attaccattt 61441 caatctcgct ttttgttatt ggtctgttca gagtttctat tttttcctgg tttaatctgg 61501 aaagatatat ttcaaggaat ttatctatcc cctagtttgt gtatgtacag gtgttcatag 61561 tagcctttaa taatcctttt tatttatgtg atatcagttg taatatctcc catttcattt 61621 ctaattgagc ttatttggat cttctttctt catttcttgg ttaatttcac taatggtcta 61681 tcaattttgt ttatcttttc aaaggatcag cttttgtttt attcatcttt tgtatttttt 61741 gtctcaattg catttaattc tgctctgatc ttagttattt cctttcttct gctgggtttg 61801 ggtttggttt ttttcttgtt tctctagttc cttgaggtgt gaacttagat tgcctatttg 61861 tgctttttca gacttttgga tgtaggggtt taatgctatg aacttttttc ttagcacaac 61921 ttttgctgta ccccggaagt tttcataggt tgtgtcacta ctatcgttca gttcaaataa 61981 ttttttaaat tttcattttt atttcattgc tgacccaaag atcattcagg aggagattat 62041 ttaatttcca tgtgtttgtg gaggtatgag ggttttctgg ggttttaagc aaacagcaac 62101 aatttaactt tctttttatc aatttggatg cctttactgt tttgtgaaga acaatataat 62161 ggatgttggg gattcatggg gaagggttga agatggtgag caataaaaga ctacacactg 62221 agtacagtgt acactgcttg ggtgataggt gcaacaaaat ctcagaagtc accaataaag 62281 aacttatcca tgtaaccaaa aaccacttgt tcccaaaaac tattgaaaca aaaaataaat 62341 taaaaaataa accctccccc aaaaatagga tttttaatga aactcagtgg tcttcaacac 62401 agaaaaccaa ctcagaaatt tattagataa atttaccaaa aaggttgaaa taatttaaaa 62461 aatcaaatgt agaagtctta gagctgagaa atatatttgc taaactgaaa aacaagttag 62521 aggctacata caggagaata tatcaagcag aggaaaaaat tcagggacct caaaggccat 62581 ctatttgaca atgaaccgtg agagggaaaa aaaataacga aaaaaaaaaa aaacgaagat 62641 gcctacaaga tatattaata gaaaatactt caaaaaacca tatctaagaa ttggtgttaa 62701 agaggaagct gagcaagatg cagggataga acgcttaata aaagaaataa taaccgaaaa 62761 ctattcaaaa cttgaaaaaa tataaatatt cagatgtagg aaagtctgag aacaccaaat 62821 agattcaacc taagtaagac aacccccagg tatataataa tcagactttc aaggggcaag 62881 aacaaataga ggatcctaaa agcagcaaga gaaaagaagc aaataacatg ttaagtagtt 62941 ccaattcatc tggcaacagc cttttcaagg aaaatcatac aggccagtag ggagttgtat 63001 gatgtttcat tttcaaggtg cttaaagaaa aaacacacta tccaagagta atgtactaaa 63061 caaaattacc cttcaaatat gaagaagaga taaagtcttt cacagaaaat gatatctaag 63121 tacctaagat atttcaccac caccagaccc atcttacatg aaatgctaaa gggagctctt 63181 caatctaaca aaaaacaaaa ctcctatgtg caaaaaaaga aagacagaaa aaacatttga 63241 aggcataaaa cccactggta aaattaagta cttgaacaag ccaataatac tctattacta 63301 tagagtattg tgcagtccac tcgtaactct attatgaagt ccaaaagaga aatctaccaa 63361 aaaaatcata gctatggaag cctattaaga tataggtatt gtaaaataca taaattgaga 63421 cacctaaaag tcaaaatatg gacgatatgg aaataaagtg tagaatttgt tgtgtgtgtt 63481 ttgcctttgt ttctatttct atatttgtga cctaagataa gttttcatct ctttaaaaat 63541 accgtgttaa atccataaac tgtttttgta aacctcatag aaaccacagc actaaaagct 63601 atgatagatt cactacacat aagagcaaca aattaaaaca tattaccaga gaaaattact 63661 taactacaaa agaacatagt aagaaaggaa gaaagacagg agtcccaaaa taactggaaa 63721 acaggcaaca aaatgacagt agtaagttct tacttagcaa taataatgct gaatgtaaac 63781 agtctcaatt atccaattac aaagctgaat ggataaagaa cgacctgact atatgctgcc 63841 ttaaagaaac atacttcacc tacaaaggca aacttagact gaaagtgaaa gggtagaaaa 63901 acatattcca tgcaagtgga aactgaaaaa gagccgaagt gtctatattt atatcagata 63961 aataggctac aaatccaaga ttgtagaaaa agacaaagaa ggtaactata taataataaa 64021 ggtgtcaatt cagcaagatg atataacaaa tataaatagc tatgaaccca atactggagc 64081 tcccccaagt atataaagca aacatcaata tatctaaagg aaaagaaaag ttgcagtaca 64141 ataatagtag gcaactttaa cactccaatc tcagtaatga acagatcttc cagatagaaa 64201 atcaacaaag aaacagaaca tttaaactac acacactaga tctaataagg ctaaatgata 64261 tttacagagt tttcacccag ctgccgcgga atacacattc ttttcatcag tacatgaaca 64321 ttctccagaa tacaccatac cttaggccac aaaacaagtc tgaataattt tttttaaaaa 64381 atagaagtca tatagggtat cttttctgac cacaatgaaa taaaactaga aataaattcc 64441 aagagaaaca agaggaatct caaaaaatac acaaacacat gtaaatgata caacatgctc 64501 ttgagttatg aatgagtcaa tgaagaaaat aagaagaaaa ttaacatttt tcaaaacaaa 64561 tgaaaatgga aacactaaaa tttgtgggat atggcaaaag cagtactaag aaggaagttt 64621 atagcaataa acacctatgt aaaaaaaggt agcaagactt caaataaaca acctaatgat 64681 gcacctcaaa aactagaaaa gcaagaacaa aatgacccca taattaggag aaggaaagtc 64741 atgattaaga ccagagaata aatcaatgaa acggtgacta aaacaaaaat atagatcaat 64801 aaaatgaaaa gttggttttt ttgaaatgat caacaaaata aaacaaacct ctagctagac 64861 tatccaagca aaaagagaga cgacccaaat aaataaaatc acaaacaaca aaaagggagg 64921 cataacaact gaactcttgg aaatacaaag aatcattaga gaatattttg aacaactata 64981 tgacaacaaa ttggaaatcc tagaaaaaaa atggataaag ttctggacac atacaagcta 65041 tcaacactga accaggaaga aagagaaaac cttgacaaat aacaagtaat gagatcgaag 65101 ccataataaa aagtccccca tcaaagaaaa gctcaggcct tcatggcttc actgttgaaa 65161 tctaccaaac atttaaagaa cttatgccaa ttcaactcaa actctttaaa aaaaaaatgg 65221 aagcagaagg aatacttaca gactcattct acaaggtcag cattaccctg atactgaaac 65281 cagacaaaga cacaacaaca acaaaaacaa taggccaatg tcactaatgt tgggaaagct 65341 ggataactat atgtggaaga atgaaaatag gaccctgtct ctcaccacac acagaaatga 65401 aatcataatt gattacagat ttaaatctaa gacgtgaaac tatgaaacta ctagacaaaa 65461 cctttgggga attgctatag gacattagtc agagaaaaga ttttgtgtgt gtgtgtgtta 65521 agacctcaaa aacacaggca actaaagtaa aaaaaataag attgcattga cctaagaagg 65581 ttctacacag aaaaggaaac aatcaacaaa gtaaagagat aactcacaga gtggtagaaa 65641 atatttgcaa actattcatt tgataagaga ttaatagcca gaatatataa ggagctcaaa 65701 caactcaata gtagaaaaca aatacttcag tttaaaaatg agcaaaagat ctgaacagac 65761 atttctcaaa agaagacata caaatgacca tcaggtatat gaaaaaatgt tcaacatcac 65821 taatcaccag aggaatgcaa gtcaaaacca caatgagcta ttatctcact tctgttaaaa 65881 tggcttatat caaaaataca ggcaataaca gatgctggtg aggatgtgga gaaacggaaa 65941 ctcttataaa ctgttagtgg aaatgtaatt tagtacaacc actatggaaa acagtatgga 66001 agttcctcaa aaagtgaaac atagtactac tgtatgatcc agcaattcca gtactggata 66061 tatacccaaa ggaaatcaat ataatgaaga gatatctgca cttccatcat tattgcagca 66121 ctattcacat agccaaaata cgaaatcaat ctaagtaccc atcagtggat gaatggataa 66181 ataaaatttg gaatatatac acaatgcaat attattcatc cattaaaaat gaaattattt 66241 tatttgcaac aacatggatt aaactggaag ccattatgtt aaatgaaata ggccaaacac 66301 aaagataaat atcacatgtt ctcactcctt tgtgggagca aaaaatgtgg atatcatgaa 66361 gacagagtag gttggtggtt acccgagact ggtaagggtg gggagaggag agatgaagaa 66421 gaaaaaaaga atgtaaatgt aattattacc actgaactgt atgcttaaaa tttttaagag 66481 ggtaaatttc atacgtatat tttacctcaa aaaatggggg gaggggggag ggatagcatt 66541 gggagatata cctaatgcta gatgacgagt taatgggtgc agcacaccag catggcacat 66601 gtatacatat gtaactaacc tgcacaatgt gcacatgtac cctaaaactt aaagtataat 66661 aaaaacaatt aaattaaaaa atcctaaaca aaatatttgc aaaataattt tcacaatatg 66721 taaaagtaat aatatgtcag ggccaagttg attttatcca agggattctg tgttgattta 66781 atactcagaa actcacaata tcagactaac gaagaaaatt cataggatta tttaaatata 66841 tggaaatttt ttttgtaaaa tcaactttct ttaatgataa agctttccgg aaaacaagga 66901 ataaaaagaa acctccttga ttttttttat aaatgtaaaa cagaaaaatc taccgcaaaa 66961 tcatacttaa tgatgacatg ttgaatactt tcaatttgac atagggaaca agacaaaaac 67021 acattgttct aaaggtctta gccagtgcag taactcaaga aaatggtcca tttagcagag 67081 aaaagttaat aatgtgagtt atataaatag gcgtcagaag actgaaaagg ctgttgcaaa 67141 cactgtggta acaagagagc aactgcagaa acagataccg cccctagacc tagagagaca 67201 aaggagatag attagagtta tcagaaccta gaagcttgga gaaaaagact tgagaaatcc 67261 tctctgagag gatgctgcct gactagtgat ggtattcagg agcttggaag gggatctcat 67321 ggatcaggga ctcagacctc tgaaaatagg ccacaggtaa tacctccagt ttgtgttaat 67381 acctccaagt atttataatg gggttggatc tgggagtgtc agggaatgct ggacattgga 67441 accaactgct attgaaacca aatgccactg ctggggtgaa gagccattgc tggagtgaaa 67501 cagggagcaa ggaagaaaga acaaattcct tccccctcat ctccaaccgg aaaatctctt 67561 cctaaggccc actattgaag aaatataatg cagaaccaag ttgcaaagaa aaaatgtgct 67621 atgctgagct tcacctcagt cccacggagc agagaataga agggtaaatt tagagcagaa 67681 agacaatagc ttaataactg ctgcaaaagt aaaagctaaa atgattgtat tttatttcaa 67741 cttttactat aataacagtt ccaataattt cagcaacact tatttcttgc ttacatgaca 67801 tgaaggttgc gggttgattg cagttctaca cagctgtgct tagcttcaca tgtcttctta 67861 ttcttagccc caggttaagc accagcccca ttcgagttat gttcttcctc agacagggga 67921 catacaggtg caaaagaatc taaaccaatc tacaagatgg cagttaaatc ctttgttcaa 67981 acatggcaca cactatattc attcacatga tatcacccaa accatgttcc atttcaaatg 68041 atcaagtcat tgatgcagaa agtatagccc tcccacagct actggcgggt ggaaagggat 68101 gcacaagcaa gtcatatgac aatgttctgg gttgtataat cctattgcac aggagtagtg 68161 aatagatgaa aaaatctaat atttcataca tatgaagcaa atagataaaa cttattattt 68221 taagttatat gtttgtgttc ctggaaaatc caacaaaatc tatatataaa atattaaaat 68281 taattaaaga acttggcagg ttgagggaaa ctaaacaacc aattgcaaaa tcgatcatat 68341 ctctataaaa tagcaacaaa tgtttataaa gtgatatttg taaaagggtg ccctatacaa 68401 aacatgacaa aataatgaac aagatttcta tggagagata taaaacttca ttaacagaga 68461 gagagagaga gagagagaga gatgtcatgt ttatggattg gaagaccccg tatggtcaaa 68521 aaatgaataa attcaatgaa atctccatta atatctcaat gggcttttct ctaaattgac 68581 aagattatct taacatttat atggaaacac aaaagtctaa aagtagaaaa gatattcttg 68641 gagaaaaaaa caaattttag caaatttatt taggaaatta taaaattata gtaataaaaa 68701 ctatggtctg ggcacggtgg ctcatgcctg taatcttaga actttgggaa gaggaggctg 68761 gaagatagct tgagctggaa gatagcttga gtttaggagt ttaagatcag cctgggcaat 68821 acagcagaac ctcacctcta caaaaaaata caattagcca ggcatggtag tatgcgcctg 68881 tagtccaagc cactcaggag gcagaggtgg ggggaatctc ttgagctcag gaggctgagg 68941 ttacaatgag ccatgatcgc gccactgcac tccagcctgg gtaacacagc aagacccttt 69001 ctcaggaaaa aaaaaaaaaa aagaatatgg caattgcaca gatttatata tagaccaaat 69061 gagagataat tgaaaacctt gaaacagaca aatgcatgta ggatcccttg atttatacaa 69121 cattaacaaa gtagatcagt cttttcaata aataatggtg aaacaagttg cccatatagg 69181 taaaaataaa attggagccc tacatcacat ggtcacaaaa attaatccag gtagattata 69241 gacctgtgtt tgaaacacaa aacaacaaag ctttcagata ataaaataat atattatcag 69301 ccaggcacgg tggtttattc ctgtaatccc agcactttgg gaggccaaag tgcttgagcc 69361 tgggaggttg aggatgcaat gagcctaatc atggccctgc attccagcct gggtgacaga 69421 gcaaagctct gtctcaaaaa acaaatatat ttatgctctc aggacaagaa aatatttctt 69481 aatttcataa gaagcactaa ctataagggg aaaaatgaat aaatttgact ttttaaatta 69541 agaatttatt ttcatttaaa gaactattaa gaaatgtaaa aggtaagtca caggatgaaa 69601 gaatatagtt gaaaaaatat aaccaacaaa gggatccttc taaggacaca gaaagcatta 69661 ctacaaagat gtaatatgaa aaagtaagac aacccaatat atgtgcaaat gacagcagga 69721 atgtcataaa aaaggaaaga taaaatgaaa caatgagaaa tctcatattc tgaatgagtg 69781 tggaataaaa ggctgttaat tgatacgatc aatcattttg gaaaaaaatt tggagtaatc 69841 tagcaaaatt ggagatacct tacaacacaa cattccaatg ttagatctat atgccttaca 69901 gaaatataag ttatgtgtaa aagacccatg tgcaaaaatg ctcatagcag cattccttag 69961 agcaaaaact agaagcaaca taaatatcca tcaacaatac aaagggataa ataaattata 70021 gtgtattcac gcaatgaaat actatgcaga agtgttaatg aactacaact aatgctacaa 70081 tgtgggtgac acacactcat attatattga tcaaaatcag ccagttataa aatagcacat 70141 actgtatgat ttcatttgaa taacattcaa aaataggcaa aactgaactg ttatttcaga 70201 acacgggtgg aaaaactaca aagcgaacca tggaagtcag aagagggtta ctcttagtgg 70261 aggagagggc tacatttaag aaaatgtaca caggtgttga gagtgtttta tttctttccc 70321 taaggaatta catgggggtt gctttacaat catttccaag ttgtactttc agttttttat 70381 atatggtatg tgggttataa ttagcaatac aaatgtaaag ataagttctc tgattagtca 70441 aaatacattt tttgaagcct ttagtcaaaa ggattggttt gaatctgagt tgtaaaacct 70501 tgagcacatc tcttaccttt tctgtgcctt cattttccca cctgtaaact gaggataatt 70561 atatctattc actccaaagt gtcaaatgtt cagtccaaag tgttaaatat gaatgcacac 70621 tgtagactgt aaagcacaat gtagataggc tgctgttatt gttttctgag cctacagcag 70681 gttagagaaa aaggggaaag aaaatgaggt cgaggttact acaggcaggt atattcacag 70741 catcttttaa ctatgacagt tgaacttaaa aagaaaaaaa actggaaaat cagataaaga 70801 tctgaaaatg cacagcttag aacaatttta gaaatgggac aaacacagct cttaataaat 70861 tagcaggcca aggtcattga cacatgagag aagagcaaag acatgagaat ttcttgtgat 70921 tacccaggac cattaagaaa catatcagtt atgttcacta tagctcactg aaagaacgag 70981 agaatgcaat ataaatgttt caatccaaat tgataaattt ttaaaggctt ttgtgggcca 71041 tttagaggta aaatttttat ctgtcaaata cacatttaca cagaaaaagt tatagcaata 71101 aaaataattc cttccctttt gctgaaatat ctgccatgtg tcatgcacag atctaaaagc 71161 tattcagatt tttaaaattt aattattaag ataaaccagc aagctatatt attttgttat 71221 ctccatctca caaattaaaa aaaaaaaact gaggtttaga gagttttaat gattttctct 71281 tacattgaag gaggagtaga gatgaaattt gaacccaaat atgattcatt ccaaatcttg 71341 gacttcttct atcataaaag ctcctgttta ctacttaaaa ataatagata acgtagtgga 71401 gcaggtattt atatggtcag tcacatatcc attcatttta ctttacacaa caatttaatc 71461 acataagttt ttattaatca tcttaaaggc ttcacttctt aaaccattga ttctttcacc 71521 acttgtagat aatgtcatat cattttagtt catccacaca aggcattttt tctgagtaat 71581 ttagaatctc agttcatcat ctccatcagt atctcaatta gcccttatat gtacttgtat 71641 atatttccag tttttgctaa aacaatttaa aattctattg gggtaaagtt ttaaattgct 71701 tttaaaaata catttgagat atttggacta cataattatc tcacaatttg aactttattt 71761 agtaaatact gctactttaa aatagtgaat aaaaaataat ttattttttg atgaaaccta 71821 aatttgtcat ttaaaacatc tctaaagaca aaaccttgaa attcattttc aacaatttat 71881 aatcggttaa ttagcatatc ttagtcttag aataatttta gtaaatgtct tttagagcat 71941 ctgagtactt aatatcttaa aagtcatcta gtctaatcat gattcaccca cttctatttc 72001 tatacatgct ttataaacct cctgctctag taccttgact ggaggagata atgattctag 72061 tacatgtaat ttttattgct atctctataa tatttttact gcacaacgtc attgctgatt 72121 agcaatacaa aattatagtt ctgcaatttt ggcatattat tgctttaagt aagagctagc 72181 cattggcagt gaacaactaa atctcaacaa gggggaaatg atagattcta caaaatatag 72241 accaattaat gccccccaaa attctcaaat ttattattaa aaacttttgt aagaacctaa 72301 aaatatccaa ggatgttcat tagaagccat tatagattca ctaaaaacaa gtcatgttga 72361 accagcccca tttctttgtc ttataaaatt caggtggaac tggtacagat gtggcggcct 72421 tttaacaatc attcaaaatt ttatgttaag gtcctggact gtataatcat ctaaagcagg 72481 ggtacccagc cactgggctg gggaccagta ctggtctgtg ccctcttagg aatcaggtgg 72541 cacaggaaga gatgagcagc gggcaagaga ggatggccac ctgagctcca cctcctgtca 72601 gatcagcagg ggcattagat tctcatagaa gtgcaaactc tattgtgaac tgtacgtgca 72661 agggaatcta ggttgcacac tccttatgag aatgtaacta atgcccgatg atttgaggtg 72721 gaactgtttc atcctgaaac tgtccccagc acctgtctgt ggaaaaattg tcttccttga 72781 aactggtccc tggtgccaaa aaggttgggg atcactgatc taaagtttct tttgactttc 72841 aaaatcccat gggtttcacg tggtaaaaat aaagatacat ttttgctgga tgctaaggta 72901 tttataattt gatatacaag tttgtaaacc tgaccaaaat ctcaagtggt aaaactttga 72961 ctctctcttt tgccttatgc ttttcaatat tattttcaat gtggaccaag atatagaaga 73021 gtctttaatt tgcacatgac acaatcatag ggatagctaa tgtgtgagat ttaaaaaaaa 73081 aaataagatg tgtgtgtgtg tgtgtgtgtg tccgtagact gaaactaagt accaaaaaag 73141 gtaataaaac attttacaaa tatatttgtg ttcaaatata aattacactg tattcataat 73201 atttgaaaag tcaacaaaaa ggaatgttta tgtaaattat gttacacttt taatatgaaa 73261 taacatgata tctttaaaaa atattaatag taagagtttg caatgacata gggaaatcat 73321 gtaatataat gtcacgttga aaaagaaaac acatgaaatt gcttataaag cagtatcaca 73381 atcatattta tagatattta aaaccaaaga aaataataca gggaaaatta ccaaaatgat 73441 aaatgtgaat ggtgggattg ttattttcta catcagtatt ggaatcacat tcccagtctt 73501 ggcataggta acaagacccc tcccccaatc ccaccagctt aagaagactg tgattccaga 73561 gggaactcaa caggtcaaag acaaaagttg aaatgtttgg aaaagctcca caaggtttcc 73621 tatattaggg agaaagattt gcaccagaga tggttagatt ccagaagaac tgccagaaaa 73681 caaagcaaga aaacagtatt ttggggtgaa gaagagggtg atcatgatgg caactgaatg 73741 gtaatagggg cactagtaaa aataacttag gagtagggca gcaggggctg ttctcataca 73801 tggttgttca aaaaccccac gttgggaaga acatgaagag tggatttgac atttattcat 73861 ttattttgga gaggaattga tatttaaata aaaagtaaga aaaactctca aattctctca 73921 acaaaataat ggaacaagtt tttttttaca attattaaca ctaattgagc atttctatat 73981 gtcaggcact atgatgaact attttatgcg ttatgtcatt taattctcat aacaattcca 74041 tgaggtaggg tctaatgtta ccctcatttt cctaaaggga agtcagagtc tcagagatga 74101 catgagattt aaggtcattt agtcaatagc agggctcagc tgactcagtt aaccattata 74161 gtactctgta aaggctttca catttaggcc cgaaggtttg tgacaaatag cataatagtt 74221 tagtacacat caaaatgttt gctggtaaga aaggagtttc tttaaatttc tttttgaaat 74281 aaaccaaaaa tgagaatatt tatcttttta aaaatcattt tgtttataga tttgttggac 74341 attcctgctt tcagcagcac ctaccactat aggtctcatt gattatacat atgtattaac 74401 caaggatgga atctagttca ctttacaagc tctgcagttg ctcgcactga gaactaagga 74461 attatttggc tttctcccat gattgttatc tcccagtttg aggaaactac aatcagtcaa 74521 ttggtaaaat attaaaaaga aactttctac actcaatact gtctaaaaat cattttattc 74581 cttctatgtc cctcaatttt tcacaaagta ttttactgta aacaaatttg ggggccactc 74641 tgtgattatt cccacttgga aaaatatact gttttctctt cattgaaact agaacagaag 74701 gtctaaaaga aagaaaatta ttaagcattc tactgagttt ccccccctcc accccaacaa 74761 aaactaagat agaggtcaat gaataagtat gtgcagactg gaacttgaat cctctagggt 74821 gactgtaggc taattaataa ttaatgccag agccacggtg tttgaaatgc agatgtgata 74881 actagaggcc tggcagagct gttcagagga agcagattta gatgctatag ggaataggca 74941 ctcccttgcc ggctggctca gagagccagc ctgggtgcaa agcagggcct atggtttaga 75001 ttatattccc ctcagctgtt aattacctga acctactttg aagttcttct ctttttatcc 75061 ttgcatgaaa agagattggt tcagccaagg gacttgaaga gatgatcaaa tacatggggt 75121 ggagggaaga ggggtgggaa tagaacttca gatataagaa gataacgaag aaaaacacat 75181 agtagaacat gcaagaacat acattattag gaaatatcga attgtggctc attgtctcaa 75241 gcacattatt tctctcagtt tttctagtta agttgatttt aagtaaattg taaactttct 75301 ctgaatagtt ataacaacaa atttccatca ttgttactat atagagattt gataagagac 75361 taccacaggt gctttgtaat gctattttta acttaataaa gtcatttaac actctttata 75421 tttaaagcca agcagtgata aggataaaaa ttaatagact cagcctcaaa atgcttctat 75481 gtaccttgtt tgtgttcaaa taattcattc tcagaacaca aatagtaagt gctattattc 75541 aatgggagaa gctacaggat attagaaaac tgtggaaata aaatgcagtg gaaaaaatat 75601 gaaaatatat tgtggggggt agctgcatta aaaagcaccc taattttttt tcagaatgtt 75661 tgtgtgtgta tgcacaaaat ttaaagaagc atgtctgtta actaccttgc aaatggatta 75721 cagcatacct ggattaaatg taaatgctga cattttttgt ctctttgtgc tttatttttt 75781 ttaaatgtta tttttagcta ttttaggtag ggattatccc tgagacacat gctactcaca 75841 catgaacact gttcgctctt ttagtgggag agcatgcact catgaaatag ctaactttca 75901 attgtatctc tgtatctttt tgataggata atagtactaa tattgccatc aaggaaaaat 75961 gtgtacctat attgaaattg acacctataa gtgcaatatc atctgactaa ggtgacatca 76021 aacgcataca agcaaatgag tgacacaaca catagacctc ccgtgttcag cgtagacaca 76081 acacatagac ctccatgcag tgattttgtt gggggggggt cctaatttat ttacagttca 76141 tttctgttta aaagaatact tttatgagca gaattgttac attttttcag aataaaaagc 76201 tagaaacttg ctcatttgaa tctattattc aaaatgacaa attccaaaaa aataaaaaat 76261 aaataaaata aaatgtagga ctgtattatt gcacaattta aatttcctta ggcagaacaa 76321 caacaatatg actaaaagtc ttctatcaaa ggcaaaacac attgggaatt tgtacaattt 76381 atgtgtaaca cagctcttcc agttgtggtt ttcattttaa gaatatatac acattaacaa 76441 taagattgtg caaaactaaa acaaaaaaaa atctccaagc aatttcatct gtaacttgta 76501 ctgacataaa gcaggagggc aaaatcacag aaaacagagt ctacttccat gctgatttta 76561 tcatcaatca ggtatctctg cagtattata ttgtcggagt ccagagtgct aatgaaaatg 76621 tggcattttc tcaaagcttt acatataaac tttgtatgtc gagtcccctc taataagaaa 76681 atgagggaga gagaagtgtt tgaaaactgt gagcagtaag ccagaggtaa aatatgaaaa 76741 atcaccggct tatatcagca cctaagatgt tcatggccat agccctttct tcaatgggca 76801 ttcatagttg ttcacgtgtt ttgggtaact attttaacac acaaacatag aacaaaacac 76861 acttattaca atgcaacagc agttatcttt caaagttgag gtctgaactt ttgaattaca 76921 cttcctcaat accaaattaa gctacattgg agaatgatta cattccgatt tcatttatat 76981 gtgcttttat ttttgttttt gttatattaa acaatatgct aacaaataaa tgtattcatg 77041 tatatttttc cacttctata tttttcacat tctattcctt tggccaattc aatttaagca 77101 aaaagatatc tgaaattatg acatctagct aactttcagt gcaaaaaagt gccagcttag 77161 agacatatga agagcagaga tataaattat aaaacagcac aatggtgtca ccaagaatat 77221 taggttaaac tttctacttt atttcaactt ggctgtttta catcccccat ttttctcatt 77281 tccacatgac tgtctcttct actaacagtt gtatattttg caaggactta ttcaatttct 77341 taaagtacat tttaaaagtt ttaatataat ctagtggcac aggcttgtat taatctcaag 77401 ggattaatat attttttaag taatgaaaaa aaaatctgca tatcaaacaa tatttaagcc 77461 atggaatctg aaggccataa tttatatatt tttcttatga atttggaaca actaaagcat 77521 tcttctctat gtcacatata ttttaaatat ttttctccta ttgctatatg tttgccacag 77581 ttcagtttcg attatgtgtt tatattatga aaatgcttgt gactacaggc tcctacatat 77641 ctgtacaata ttttgcttct gtctactgca atctaaaatg tggctatatt ttctagtttg 77701 ttttgtgtcc ttttcacctg cggctacaga gaaactgaaa tgaagttctc agctctttat 77761 tagtaaaagc atcagaaaag taaatagatc ataacataaa tgtttttctg cctccagtat 77821 gagtgatgta tgttgcatga gctatgcacc gcaaagcgga ggaagagttc tttaccctgc 77881 aaaaggaaaa gtaaagcatt gaaaaaggat tttaatgaaa ttataacatt agaatacttt 77941 atattaaatc tcactaattt tctaggtgaa ggcggtgaag tgaatgtgag gaaacattaa 78001 aagggaattg aatttgcatt ccttatgcta caggcacatg tttacatgac ctttaaaaca 78061 gaattaaagc tgtatcatca gcagctgtag gcaattcaca ttactaaggc aatcaaatca 78121 gaatgctaaa ctaactataa aacataaatt aaactgatag agaatcatct taattcacta 78181 ggtttacttg caatctactt atactctact agtcatttca aagcagtttg tggatccacc 78241 aaatgagaaa tactgcaaca tatctaaaaa tagagtgtcc aactatacaa atgtaagcta 78301 aaacacttta acatatcttt ttaacatact tccatgaatc atctgagtta gcttacttca 78361 tgaacattat ttaccttcaa caatcaccta caaaagtaag cattattcca taaagacata 78421 atactcttga tatgctatac atagcagata taatttttat ttcaaatgag cacttggaag 78481 agttaataat gaatgtgtaa aggctagtaa aaattaactg aacatatttt tattccaaac 78541 ttcattgtat tattcctcaa tatccttcat accttaacaa taaatatatt ctggtttcct 78601 ttcatagcat ggtacagata gtcctccttt taaaatgtgg taggatggaa tattaaatag 78661 aatttaaaat tcaattgtgt gggatgagac ttattctaat gacataaaat gagaagtgta 78721 aatatattta taacgcaagt tatcatctca caaaaactgt tttaagaaag cttgactttt 78781 tttccatttg tagtaaattt taggaaaaca aacatattct atgtttccca aagtgtttct 78841 gtttgcatgt tgtaacttaa tttcctcctc tgtaaaaaaa atttctaaga ccccttgatg 78901 ctactgaaaa catacagaat atgtttgaaa atgcaattct tctactccaa gggatgaata 78961 aaagtagggg ggactcaaat gcatctttcc atcactaatg tgggtgtctg tataggacaa 79021 tctcggttct ttacaaaact gtcaactagc tgttaaaaga aaaccaatcc tccacaactc 79081 taagattctg cttagctgag ctgagatcca cactcaagag ttgagatcat tattgaatgc 79141 aatttatatt caaataggtt tgttctccaa tatgcaaagg tccaaccctg taaattcaca 79201 cccactgccc ctcaccacat gcgcaaacat gtgagttatc catttgttta agagaggtgg 79261 taacatacct aatgtttaca tcaagcacct ggaaaaaaaa tattgtattt aaaggttaag 79321 atgagagcag atgtgattga actattgttt attggcaaac aagaattcca atctctcccc 79381 aaaagtagaa atactaagaa gaagataggt gaggaaaagc aaaagcctca ttctcactat 79441 tatttgaata agttaaaaga aatatttagg atgtaccaaa gacatctgag tattgatatt 79501 tagattgccc taaatgtctt tccttagaag cctaaggttt tcttgcccct tcaattgttg 79561 aaaacatgaa acttcttatt tttgaagagg aataggcaga tttaaattcc tcccttaatg 79621 ttatctaagt taacacctta ttgccggaac aaaatatttt agggcaactg tcactttaac 79681 acaccagggg aataataggg gtgctagtcc tgagcatatc ccccgagggg gggtgtcact 79741 gctgcagagc tgtggtgaca cacatcattt gttggggaag gagatgggcc ttgtctcctg 79801 agaaagcact accttgagcc ctttacatag gtgcaggttt agagagcagg acaactggct 79861 tggtagtgtt caaggggcaa aaagaacctt cgacctagga ggggctcgcc tggcggagca 79921 gcagaggctg cccagggact aggatattcc tttggccagg ccgaaaaggc ccagggctct 79981 ccaaaagaac agcaggtggc gactattgtc ggggtggggg gccgggagct ctaggctaat 80041 acagacactt tggtgtcttg cttctagatc cttgccccct ccctccgcct ccaacagagg 80101 gaaagtgacc ttgccaccaa tccttcagat cgtgcttcaa atagaagaga ggctggggag 80161 gtttaggggg caatgaagtc ctgaaggctg tcaaaataag cccctctagc cagctgccct 80221 gtgtcgcaat agaatccatt tggcctaaag aattgccacc cgcggagtta tcttaaaaca 80281 atatatctca gtagaatgct cctgatcccc accacaacac ctctattcag tgtatgtcca 80341 ggaggctaag gtcaggtcca ccgttccgaa acatctggac attttagcca agctctccag 80401 aaggcgaggg atcctgttgc tgcccaaatc gcgccccctc ctgccgatgc cttgtctgat 80461 gttttgggga agctccttga ttctttatgg ctttgcatct cggcagaatg gagctagctg 80521 acttgggtta gcggcaacac cagaacggga gcgagctggc tgggggagtc acaaccacct 80581 gagccgggag cagaatcacc aagggcaaag cactggtcag gggctaggtt ctgctgtcta 80641 cctgagctct tgtacttgga aaagagattc caaaaagtta ttggatatta aatgtgtgtg 80701 tgcacgcaca acacccacga tggcctgact gatgtattca caggcagctg gaaaccggaa 80761 tacccgaccg aagggaactc ttctcggggc aggggagcta aatcgattcc tcaaatccat 80821 agtgaggcaa agtgcaagag atgtcggtcc aaagaaaagt tacagaaaag ggaaatttta 80881 agaccactca ggaaactcac acccctcccc cagaagaact aacacaacag caatcacaac 80941 agaggaagca ttaatattct tgttaaaaat gcgttcccca aatgtccttg caaattggac 81001 ttttccccaa actctctctc tctctctctc tctgtgtgtg tgtgtgtgtg tgtgtgtgtg 81061 tgtgtgtgtg tatacatttt gttgggtggg gggccacaat ccccccatac atacacacac 81121 atacacacac actacacatt attttcataa tcatcagcag ccccttctct actctttcct 81181 cctggaaaat aatgataata gtaattgaaa tttctgaaaa atgacagcag gttaaatctg 81241 gtcggttgct cattttcaaa gtgataaact tacattcttg ttctttgtgg agccctgaaa 81301 tggtaatgtg agctccagac actgagcaga ttacttacta aaagtcattt ccagaaggat 81361 cccataatca atctataatc aagcttcctg ttgtcccata accacaaagg gacgtggaac 81421 gcacgccgtt tcagacaaat aaataccatt tactaagggg cggggagaca gggtggcagg 81481 gaacacagga cctcagttag gaatgaagcc taagcattct tacaattttg cacattaaaa 81541 ccttaatcca tccatgaaga aggaagtgat ttgcaatttg gggaggctct attaactact 81601 atagtgactt aagaaaagct tctaatgttc agccagaaga cttattgatg actggaatac 81661 cgatgcgcgc agggtttaaa gtaagagatt tctacccgcc ctctgatcaa tcgcgcagct 81721 atgactattt tttttttcgt ttttaaatac ctctaccact tcttactttc aaaaaaataa 81781 ttaaaatgaa aatcaaaagt accattgcat tttaaatttg gaagcattct gcggcttatt 81841 tttttttttg ataaaagtgt aattccacgt gtgttatata ttttgtaaaa ccatccgctg 81901 ggttcctcga ctctaggctt ggtctgcagt gcaaaactgc acaatgattc ctaaacaccc 81961 caaatcctgg tgccaggaaa atgatagcta tgctgagtat gacacaatag ctctgtctct 82021 gtaaagctct tttctctggg atactgtaaa attagggttc acccaccgac attgcaacgg 82081 caaaaatgtg atccctaaag agaaactgag actggacgga cacgacctgg ccaacgcagt 82141 tataggagca agcatccttt cccttggtag tgagggtctc tattgtccca gcctgtcctc 82201 taaccatcct gtcagaggcc cctgccacac aatgcctcag cacttctccc gagcattcaa 82261 ttttgctagc aaatgactag tggttctcac agattggggt atttgtatta gatttacatg 82321 aggagtattt taagtttccc tgtttctgct gcagcattca ctgtgtgtac tcaccccctc 82381 ccctcttcct ctttcctttt tctctgaaga ttctttttct tacgttcagg aaagccaata 82441 gagacctgaa aaaaaaaatg aactaattga gcttgagctc aattggttca tgcctatgaa 82501 ctattttgcc atgtcctacc atcagtattg gcaatgtctc ctaatgtcat ctaaattctt 82561 ttttaaaaat tttccccaga atattaggag ccccttacct cccccccatc ccatacagct 82621 tttgaaaaag aaaaaaggag aagaagaaag aaggcgacta taagaggagc tgcagtgggc 82681 tagtgctctg aaggacagct gagtctcaca ccgagaacag tcttatgaga tcaatatgca 82741 gaaaccagga aaacagcttt ttttccctgc tgctttgaga agtctgttca attgtttcct 82801 tcccttatga gctcccggtg ttaaaaagaa aatgtacatt tgtccctacc aagctcaatc 82861 taacttgttc tgtttccaaa gtgaacttgg gttgaaggtg aagttttcat cttgagataa 82921 tgcctatgtt ccatgtcctg ccatttggag cttcatgcta tattgcacac aatcaaatct 82981 ggccttttga acttatttcc tgaaccccct ccacccccca ccgctactcc ctgtcctgac 83041 cccctgccag cttcagcccc aaaccacctt cctctcccca ccctcaattt gtacacataa 83101 tgagaatgct tcataatcta cactcacata tgaaatctgc aagagacaca atccccatgg 83161 ctggggtact ttccaaagga gccaggacgc aggctaccca gctgcggccg tgctcagatg 83221 tttccacagg aacacaacct gaaccacatt aatctgctta atttcttttg tgaatctttc 83281 cttgcttaga aagaaagggg agtgggtatc tgtgaagacg ttcaggagct ggtatcccta 83341 aatcttgtga ggaagaaaat ccactgcaaa ctctgaagtt acaaataaat gaacatatat 83401 aaataaagcc aatctgactt tcttttaaaa atgatctgaa ttgccctctt agaaacaaag 83461 caaggggaaa gggtaaggtg gggtgtgtaa ttggagtgtg agggcaactt aacaaggaat 83521 aaaaattgtg ccaggttctt gcctggcaca ggaattgaaa cctgtgctgc tttgctgaag 83581 ataggtgggg attcgctaca attacagttt gtctacttca aagctgggct cctctctagg 83641 gcgactttta actgccccta cccggtaggt tccacagcta ttgttatatt attcaaacca 83701 aagctctaga gggggtgggg agaagcagat ctgaggaccg ggtttgggct tcaagatcag 83761 gatgtggagc acatcaagtg tgtatgtagc ccagctgcat gagagagcaa gcttgccagt 83821 ggtaacttcg agtttaaggc aaaatgcagt ctatgaaaaa ttagttacca gctctttgca 83881 acagtagctt cgtatcaact tatttttctc ccatgtccag acgggtcgtc tattgaatgt 83941 ggctctttta ttttgcatgg ggcaggggat ttgtgaccat ggtcccccct tttattcctc 84001 tctctatgga aacccagatc atggggagaa atatgtgaag ctgaataaag agaagctgca 84061 gaaaggagcg cttgagagcg gcatctgtta cacttgggaa gttaaactga caggagaggg 84121 cgaaggaaac ctctccagca gattagttag aacagcaaga agatttgact atatgcagat 84181 attttattca aaaaacaaaa caccttacat gcaagagaag attctatcac agttactaga 84241 ggatgtgatt ggacctgaat tcaataggca gtatctatat tcatggaaac atttttttta 84301 aataaaagaa agaaggaaag agaaagaaag aaaggaagaa agaaagatgg aaagaaagaa 84361 aaaaagaaag aaagaaaaag aggagagaga aaagaaaaat aaaggcgata acgacagttt 84421 gtgctcctga cgaaagctat tatttccatt aaaaagagag tttttacatt aaaaaactgc 84481 aaagtaaggg tgtaaaaaac ctgtccctgg gatcagatct ccctccttac ctctccccct 84541 ccaagcctct ggcaaatggt ctgaaccact tgatgcggct ggcaaaatag ggtaccagga 84601 gcagaggcca tgggcaagag gaaggttgat tgtaagaact caggcaagtc aagattgaaa 84661 aaagatcccc ttcttgatgc aaaagtctta tcatttcata attaccttaa ccccctgaat 84721 ggcttaagat tagcaactct cgcccaaagg cagctgcaag gtgggggtgg ggaacctaca 84781 acaacaaaaa cagcatggca atgatcatga tcgttttaca cgtaaggttt cttcaacgcg 84841 tttccctctg acctctaaat tgagccgaca cagactcaca aatggatctt ccaaaggcca 84901 gcttaacaga gcacctaaaa actttacttt tgtttgattt tgacaagttc ttgtttcttc 84961 cagaagcgga atttcaggca caaatgagat tgcattcttg caaaggaaga tgcaaacagt 85021 tagcagtaaa gaatgagttc cttagcagaa aaggtgggga gttggggggg gcgggagaga 85081 gagaaagaga gagagagaaa ggaaaactca atttctaaaa gcagataagc ctctacaata 85141 agacgtgggc aaatgaagct acagtttcaa tgagagagca ggaagagggg gtgagatggc 85201 aaatgatcct ctctgccaat aattactaag tgtttacctt aaaaaaaaag agtgaaaaaa 85261 agaaatagat ctgaatcaga tgccaggttt tcatgcctaa taaacacgtt tacacttacc 85321 aatatcagct cctattgctc tgtgctctca taagaaatta ctcatttcac aggaaataga 85381 tgctctttag ctatataaag gaatggcaga ataagctcac cttgttttta tttttttctc 85441 attttcttgc acattcttca gatctgtctt cttagagaaa ctaatgcctt aatgactttc 85501 tggtagcagc aagccttctt ttatttctgc taaatatatg aaaatgtatg caaatttaaa 85561 tcaagcttaa cctaggcgct cttgcaatat atttaatgga atttcaaacc aataagggaa 85621 ctctgatgcc gtcttctgta agtgcaaata taattaggca gctaggttac ctccaaaatt 85681 attcaacgaa aggtcatttt tttaacttca aagaccatgt tgggtatctg gtgttgcctt 85741 ttctcagcta gaacaaatca aatctttgcc tttctcccca ctctgcttcc aagcactgcc 85801 tttgcgccgg gctgtggagc acacggactt gtcctttcct aacttataaa agcttagaag 85861 gagcacaagt gtttcacctc ttttattcta tccaaaatgt aaaagcagtc ctatcttttc 85921 aagctgaaag caaattcaag cttgatgggg actggggaag tgggggcggt acttgatgcc 85981 aaacatcccg ccatttctct ctctctgaat ttcctgggca cattaaatgc agggcgggtg 86041 aggagggtgg gggagcggtg aaggttaaaa acaattttcc taggggatgg catcggggcg 86101 cggggtcacg aacgttttcc tgtataccta ttcgtagtgg gtctgcagag tgttttgggt 86161 gagtgtgagg gctgcccgct tcagcaagca gaacctttga cagaagttgt gggatctgtg 86221 atgggaactg atatggagca gtgagaaagc aaggtttcca attgaccagc tagattggga 86281 agcaaacttc gaaataactg aatattgtca gtatcacatt agattttact ttcccagaat 86341 aattcagttc ttcgtgacct aaaaaatttt tcagaaaaat tgaaagaaca ggatatctct 86401 tagaagcttt cagaccttga tatcgaggtt cttaatacaa tgaaataaac tgaaatgttc 86461 ttattggaat cattggtttc aactttattg gtttagggtc caatcgaaga cgttactttg 86521 ctcaaagttt gtagcgacgt tttgttttgc agtcctaggc acacaactag cccgcccaga 86581 cagccgagtg agtctgaaaa atccctattc actgagatca aaatgacatt taaacttgct 86641 ttactttttc tttatttaga ttaatctaaa taatcaaaaa ggagttctac aaccttaatt 86701 ttccacctca ctccctgact gccccccaac aacacatccc tcacatcttt tccccatcaa 86761 cataaacccc gcggtggggg gggtagtaat caaatgtacg gccgccactc tgcgctaggc 86821 agccaatgga gctgaaagat tgattcactt cctgcttggg tctcattgaa actccggagc 86881 ttcaccagcc taatttggaa agcgagcccg acctcccgca ccgctattgg ccgggcttac 86941 tcctggccag gcggagacgc gctgcgattg gccgtggtgg gtgccggtaa cccgtgctag 87001 cgtctttggc tccccgcacc gtagatgtca aaggctgaag ctgctccctt tgccacatta 87061 taactagtag gggatcctca ccgaccatgg ccacagctgc ctcgaatccc tacagcattc 87121 tcagttccac ctccctagtc catgcggact ctgcgggcat gcagcagggg agtcctttcc 87181 gcaaccctca gaaacttctc caaagtgatt acttgcaggg agttcccagc aatgggcatc 87241 ccctcgggca tcactgggtg accagtctga gcgacggggg cccatggtcc tccacactgg 87301 ccaccagccc cctggaccag caggacgtga agcccgggcg cgaagacctg caactgggtg 87361 cgatcatcca tcaccgctcg ccacacgtag cccaccactc accgcacact aaccacccca 87421 acgcctgggg ggccagcccg gcaccgaacc cgtctatcac gtcaagcggc caacccctca 87481 acgtgtactc gcagcctggc ttcaccgtga gcggcatgct ggaacacggg ggactcaccc 87541 cacctccagc tgccgcctct gcacagagcc tgcacccggt gctccgagag cccccggatc 87601 acggcgaact gggctcgcac cattgccagg atcactccga cgaggagacg ccaacctctg 87661 atgagttgga acagttcgcc aaacaattca aacaaaagaa gaatcaagtt gggcttcacg 87721 caggccgacg tggggttggc gctgggcaca ctgtatggta acgtgttctc gcagaccacc 87781 atctgcaggt tcgaaggctt gcagctgagc ttcaaaaata tgtgcaagct gaagcccctg 87841 ctgaacaagt ggctggagga ggcggattcg tccacaggga gcccgaccag cattgacaag 87901 atcgctgcac agggccgcaa gcgcaagaag cggacctcca tcgaggtgag tgtcaagggc 87961 gtactggaga cgcatttcct caagtgtccc aagcctgccg cgcaggagat ctcctcgctg 88021 gcagacagcc tccagttgga gaaggaagtg gtgcgtgtct ggttctgtaa tcgaagacaa 88081 aaagagaaaa gaatgactcc gccaggggat cagcagccgc atgaggttta ttcgcacacc 88141 gtgaaaacag acacatcttg ccatgatctc tgactggagg aagcgaggag gcggccggcc 88201 gcactgggag cagcgcggat ttctctttct ctctcactct cttcctttca ttctagtatt 88261 ctttattatt tttctctctc tctcgttcgc tcgctctctc gtactctctc tcttttccct 88321 cctttccttt ttctttcctt tccccttttt ctttcccttc tttttccctt tcctttcctt 88381 tcattttctt tcctttcccc ttcccttccc ttcccttcca tctcttcctt tcctttcctt 88441 tcttttcttt tgctttcctt tccttttttt cccttttctt tccttttcat aagaggttct 88501 aacttctgtt gacaaaggaa acacatactc tctcattcag gcttctcaat gctgatacac 88561 agttacatta agcagtccaa gctgggatct catattcctg gctccccggg caacagttcc 88621 ctttagcctt ttctgctgat caatacatat tgtttactca gagtaaggtt tgtttggtcg 88681 ctcctctcta ggaagaacaa ggagtgggat aacgtggggg cggggtgggg cgggggaggg 88741 attgggaaga cagatgtgtg cctctgaata acctttcagc gccttggtta tagcagctgt 88801 atttcaggtg aaatttgttt tacaatagac tagttttgca tttttaaaaa cttctatagc 88861 gtttctaaat gtctgcggtg ttttactaca atctgtacac aatatttgtg agataatttg 88921 tatctaatcg accactccat gttattattg ctattattat tactacttcg ttgagctgat 88981 ggctttttaa ttgcttagaa aagggaggag atgaatggga ataggcgggg agagagatgc 89041 ctatctactc accgcccccc cccttcccaa atctctatga ggaggagagg atacaacgaa 89101 gagagttatt gatgatgata ttggttatat tttttttccg gggtggagtg cctaggaagg 89161 gcagaatagg gaagcctatt aggtttgcaa actagtgcga agtagcagcg gcgatttcgc 89221 agctttctgc cctccctcct ggaagtgtgg agcccgcgca ggctaccgca gcatagatcg 89281 aggtatctga gaaggattcg gcggcaactc tgggccaaag aggtggcctg acagtctggg 89341 agggctccgc actcaccgga ctccaggatt cctgcgcgca gaaatccagg cccccagcct 89401 cgctagggcg accttccgcg cgacctcact tcggccctcc tagaggaaag actttgcctt 89461 cacacgcgat tactttcact ggagatcacg acccggagac gctggcgact gggccagcgc 89521 ctcccatttc ggttcacctc cagttcaccg ccccctctga gtaagaaacc cttggggagg 89581 ggtggcgagg gagtggggga taggcattct ctgctcagtg cagaacggca atgaaatgat 89641 tggcaaaggt cacagggatt ctggtccatc cgcgtaccac ctctcccccc atgtatatac 89701 cgctgtctgt gtgtgggctc gtggggagct gtctcggcgt ttctgataag cacagctggt 89761 ggaagcccac agcgacgatc cagcgagcta gcgagcgccc gggaggctgg gttcctgctt 89821 tttaaaatat aaatatatat atatttttaa atctctgtgt gtatgtgtct ctgtgcgtgt 89881 gtgtgtgtgc gtgcgcgtgc aaacgtgtgt gtgtagatag ctgtgtgtat cgcgttacta 89941 tgtacaaaaa caaaaagtac aaaaaaaaaa aaaaacctgg ctaggctgtg tagagatccc 90001 aaaaatatgc agttgctgtg gctttattgc tagctagtat ccacacacca ggtccacagg 90061 cgaacctcat tcccaaggag cacacacgtg tgaggccggc caagcaccca acacccagcc 90121 gccgccggcc agacctggct gggggcgctg cacgccactg ctggccgcct agtccggcct 90181 ttaggcgcag ctgctgtaga caaagcttca aactacctaa ggaccagaaa gcccggcccg 90241 gttgtcgagg atgcgttagt atgagttgcc aggcctctgg tccaccacac tgtgtctccc 90301 tcagctgcag gtcgaagagc atgtgttaat tcaaatcccg gacccttcta cttttttgtc 90361 aagtctctgc attctgttgg tagttaggtt ttagttctgg gattttattt ttttttcccc 90421 cacactgagt tcatgtttta caactgttgg gagttttttg tttgtttcga ttttttttct 90481 ttgcagttaa ttgctatggt tcgaacctga gttaacccaa atatttctgc cgactttata 90541 actgtacagt tttgtatatt aaacgttttt gaagttatta atttaaagaa agatcattta 90601 gtttattcat ttagtaactg atgtataata aacgctattt tcattaagag ttgctttttc 90661 aactatttta ttcaaattgc ctattgcagt tgccaacaat tatttatttt caataaattc 90721 tgcattgtgt ttaaaaggaa attctttcac agataagttg gatgtcaaaa agaaaacaat 90781 tcgctgtaat gtttgatgtg aagttttagt caaggggttt atattgacgc aatattttat 90841 tgttgtaaaa gattaaaaag gtttaaataa aacatttttc caaaaaaaaa gttttcttct 90901 cctggtctta ttgtgtgctg tgtgaggtgg agcgtttaaa cagaggcaca ttccgggtga 90961 acagcacttc tgctttgctt cagcttataa atcattagtc attgttttga ctcttgattc 91021 ccttcctctg actttcggcc tttcagctca gctctgaaat acactggcgc acacggcacg 91081 gcctattctg gagcccacgc tcacgcaaag cacagcacaa attgtgggaa cccgcatagg 91141 gaaggctcga ggtgggtttt attttaggag gtgcaacctt actaatgttc tccaaaaaga 91201 gagtctctct ctctcgctcg ctcgctctcg ctttctctct ctctctctgc cagcagatag 91261 acctttctcc aaacctgcct ctcctgaaac tgagctcaga tcttctataa ggctgtcatt 91321 tcgcctcact gcacccatcc ttcttcttcc tctccattct gttaggttcc cttttggtac 91381 ctgatccctc attcccctga ggagggatgg gaggatgata ataggtaagg agaagcactc 91441 tccctcttaa ggctcttcct aggcccagaa acgcaggtaa actgtttatc ccaacaatgc 91501 tgaactctgt ccattcctat ctgagtatgg ggggaaggaa aacataatac aaggagatat 91561 tttgggggtg ggagtttgat tacactcttc ttcccactcc catacacaat ttcatttgtt 91621 ttattttatt tcacaagacc ctccataaaa taagtgactg ttacctttag tgatagttga 91681 ggagtaaatt accaaatctg aaccagttct gttttaatgt tttgtatgta caggcatgcc 91741 ttattcaact ttttttggta gatgcatatg aagcaatgaa caggacattc aataatttcc 91801 cagcagtaaa cccccctcaa ttcatagtct gaagtaggtg tttttgttag aggacccaaa 91861 atgatttaat catttgaata ggtaatgatt acagcatatg atgtcttgta tttttcaaag 91921 tgcctaataa tttacagctc catggtgcat cacacacgat tgttgtgtat gaaatacggc 91981 acatgctgtg tgtatagtca tgggcatatc actctttata aataatattt attctataca 92041 ggtatatgta caggcacaaa actcaaagtt ttaagcctct tggttctctt tcagttcata 92101 ccaactaact tgcttctctt tcaatatact ataaaaatta gaattggtta aaaagttata 92161 gcttttctca caaaactgca caaatatcaa ctttcaatgt atcctgaact ctttaatgga 92221 accttttgta gatattttag gagatacaaa aatatatata cacatttatc actatcatca 92281 gtacttttat taatatacat tactcatatt tttatatgta ttttcagatt tgttttcttt 92341 actgctcctt ttctttctcc tgaaatataa tcaagtccta actggttttt ctgatcaaca 92401 tttttgcaac atgaacatat tttcactaaa aatcgggaga tgttggtgtt tctgcctggg 92461 gcttccttta tcttgagttg ctgctaatcc tgcttcaaac caaggccttt gtaaaaaaaa 92521 aaaaaaaaag tggctttgtc ttcagttttt ctcaggcctt cacaacagta gctctctttc 92581 tcatgaaata agaattggta ggttccaaag aacaacaaca gtaaatcata tctaatgggt 92641 aaaagcaaaa aagttccctc tccttcacca gatcagaaca atagaaaatg cagaggttgt 92701 ggagggaggt aggatgtgaa aagagtgaat atacaggtgt tacagatgtg taaacaaagt 92761 caaatatgca aatcaactat atatttgcat ataaacatac cttggttgtc taaatggctg 92821 tggctctctg catgcataac aaaagggaaa tagattacaa aaatcaagca tttatatttt 92881 ggacaataaa ataagaaaat gaatgttatc cctggaaatt tgtttcttac ctttaaaaga 92941 aaagggtaag gggaaatatg cttaaattag tataggaagt gtgcttctct ttaataaaat 93001 caaagactat tcccttggaa gtatttagca ttcccatatt tctcttgctc ccaaatcagg 93061 ctgctgttgt gttaattgtc tgtaggctaa aagtagaaat atacatttct ggttttattt 93121 aaaaaaaaat ttttggcggt gggggaagtg gggaggagaa gaaagaggag aatggagaag 93181 tagaaaatgg ttacaggtta aacagttcag ttttccaaag agaataacgt attttcccac 93241 ctgtcagata ttgttaaaga cacatatgga acttcagaat tgtggaaatt ttttctgtga 93301 ctaagtactt aaaaatccaa acatgtgacc ctttcaaaaa aagtcttgaa aggaataaaa 93361 agtaacaatg cagcacagtg ccattagaat tgatgtcact cacactcatc cattctgtaa 93421 aataaaagag tagacattct ttatacatgc ccctgatagc agttttaaaa tagtaatccc 93481 ccagcttatt cacctactga tattggtttt taaagcatta agaacagtga atataaaata 93541 tatttgaaac caattaaggc ttttttctaa cataaaagta tcttttctta agagcactgc 93601 tggtgcagca tttcccaaac taggggatta gttaaaagaa tacagtcttg aaaacaaaat 93661 ttagttaatt tctgcaatct ctaagaaggg gtagaaacaa gaatctagac tactgttcct 93721 gtaaaccaaa tccacaaaat taaacatttg tgcattccag taaaaaatgc attgccttac 93781 tatgctttct ttcatcaaca tgagaatgga actacaccta ttcacacttc acacattatt 93841 taattaacta gatagaagac tggattctct ttacaacact atttgttatc aggagcattt 93901 ggtataaggt tcatcttata tggcttccaa ggcgggtgtt tattcaacta ccaaaagggc 93961 cttattatga agtgcactca tccttgaagg gagggaatct caggacagct tgctttcggc 94021 tgaaattcta cagtatttaa tacaagtata ttgtgtagac acatacccac agtcataaat 94081 gcattccttt tccttcccct caaaccctac caccccaccc ctgcacttgc acacagacac 94141 tatgatttaa caaaaagcag actgtccctc agagaccaaa aaggaagaga tcaactcatt 94201 gaccaattta ttaagtttgt cttttcatct gtcttcctag tcaagctgtg tccacaccat 94261 tctggtgcag tcaactgaaa agagaaaaaa atctctctgt caaactctgt tttatctatg 94321 taatgtctca aacctagttt gagtgattct ttctccaact ctgatacata cagaattact 94381 gtttgtgttg gcattttgta gcaatttcta ttatatcagc attgcttctt gggagaaagg 94441 gcaccatcta aacattttta atattgatta cttctcatct tccttagtta tttttcacat 94501 actgtagaat cccattctaa tgtagctgat taactctcac atttggccag actggagacc 94561 agcccttacc ctgacccctt ttcacagaca tggaaaagcc tgctataaat gtatagtgta 94621 aagcttgtaa agtggatgtt agcaaccccc caccagcaga ccaatattaa aaagaggttt 94681 ttacacataa tttacatttg gtaagagatt ttcctctacc ataattcttg caatctgagc 94741 acagaaaacc acacacatac ccataccacc tgcaaagatc actgctagac atcatatttt 94801 taaagcacag aaaatagaag gctatggacc agcggcagta gttcgttggt cttcaaatgg 94861 catgtaattc caagttgcag aaatggctat ctgagtggat gtcttgcaga gaggaatagt 94921 atttttcttc atgatttgct attcatactc tacctgtcat ggtactggta gtattcatgt 94981 ttaattaatg tcattttttt ctataaaatg tattaccaca agagcatatg tatacatgca 95041 tatagaaaag atctgaatca aaggcagaag tatctctcaa acctctagtg gtgtagaaat 95101 taagcagaaa aaaaatcata tttgtgaact atattggcat aatcagggac tacgtctgtt 95161 cagaacagtg tacagcagcc acttctaagc aatagggttt gacttgctcc ccagtggaca 95221 gtagttctct acaaatgaat ttctctcacc ataagagagg agttgagtgt gtttaacctg 95281 aaattctgaa ttctgagcct ttcactggag taagagtgtt tgaaattcta gataaacata 95341 cttgtattaa tcaccttcat tttccatgtt atttcctggg tgacaatact ttaactcctc 95401 cagattttat actaaaagtg aaaacaaagt aatttttaag gtagttattt catgtgccct 95461 gagtcttccc cccatcactg tccatggaga agtaggcaat attggctcta tagatgctcc 95521 ccagcagcct cctcagggca ggtaaaggaa gttagtgtag aaaggattgc aagacaaagg 95581 atctgctgcc tacactgaga gctccagcct ctcatctaca gtggaaaatc aactgataaa 95641 atacagtcct atagtctctt ctactcctta atgatctgct tctgcatgtc ctactatgat 95701 gttttcgatt gaattagaaa actaactagc tttgttttca atggattaaa gttatgtcta 95761 aacaagtaat ttcaaaattg ctgacttgaa atctttttat tttctagagc tttctctgca 95821 caattatttg cactttgtat tactgtgttc ctttatgcat gattttgaga actcacaata 95881 gaaaaattta agttaaaggc aataataatc ttttaaattt acatactgtc tttcctccct 95941 aaagattcct aggcaatttg gaaatgatgt gcacactata ctttgcaaca atcaatatgc 96001 aaagcacaac aaaatgacca cttcaggcca cacaaaatga tcacactaag accaggcaaa 96061 aagatcatta atatcaatag gcatcctgtg cacaactaaa gagcagtata tgccatgggt 96121 ctcattcatt gctgtgtgag agtatgaaag tgcttgtact cacatctcaa acaaatcaag 96181 tccaacattt tcaactcgta tcatataaga gataataggc aaataaagtg aacacaaaaa 96241 catttgaaaa ttattgatta aagttgtctt tgagtcttct ggagaaaaac aagtattttg 96301 gtaactgaaa gaccaacaat actcaaaggt aattgtgcct ttaagatatg tgatattgta 96361 aattaaaatt gttgtaagca tttgtgaaaa acgtaatgac aaaatagcca atttgaaaat 96421 cacagactct catttgcgtg tgtgtgtgtg tatgctaccg cctatctata tttcagtctg 96481 tattttgtaa aaatgaccaa gctgttcaga ttgtggggga gggggtggtt agttttaagt 96541 aaattaagtt aaaacaggac tctcaggtca gagactctta agagtttaat agagaccaac 96601 ttatcttttg gctcattttt aacacctatg acaatgagat atgtgataca aatggtgact 96661 cctctatagc aaaataagtt taaattatat taccattacc cccgagagta tgcaaccatt 96721 ttaaaaaatg ttttgaaaaa catatcaaga atcaaactag ctgtgatgat attgaatatt 96781 cgctggaata tatgtgttgc tctaaaactc tttgcatgtc ctaaccctgt ttcatgcctc 96841 tggctgtcag aattttgcag cgcccactca aagttcagtg tcaaattaaa aagtgacaag 96901 aatggagcag cataaataac agcttaaagc ttttctctac tttcccacat acaaacatat 96961 gaatttcaaa acccagcagg ttctgctgaa taaatatgtt cagatttcat atataagttg 97021 cagaataaaa atggcaacaa tattcaattc aacaaaaatt tattgagtgc ctgctttgtt 97081 aaagtggcat gtgaagcatg tggagtaagg caaggagctt gcttcaaagg gctaacaata 97141 cctgcggtaa gataagaata agaaatgtac ataactaact gttaaataaa ggagtgctag 97201 agatatgtac tattatcaat gtggatgcca ctactctcta tttttattgt tcctttttta 97261 ataatacaca ttcataccca aacatagcat ttatttgtgc tattttaata atattgtgca 97321 tccttgttga aacccaggta gatgcaatta ttagttgatt tatatgtcta gacaataaaa 97381 gtgttaattt gaaaattcct ataactgaat ttagctctgt tgagcttgaa tgacttttgt 97441 aaggtatttt ttgttttgag acaagctcta actttgttgt ccagactgaa gtgcagtggg 97501 gtgatcatgg atcactacac cctagacctc cggggcttaa gtgattcttc ccattttagc 97561 ctcctgagta gctgggacta caggcctcaa atgatcctct catctcagca tcccaaagtg 97621 ttgggattat aggaataaac taccgagcct ggcctctaag aaaatgtttt taaagggata 97681 aaatgttgaa gaataaacac actgcaagct tctaaaagtc accttctatt cctctgcttc 97741 ccattggaaa actgggaatt aatggcaggt aacctgagca atgtgaaata ttcacctccc 97801 ctaattttca gtccttttct aattagagat tttgtgtgtg tctcacaatt ctgtaactag 97861 ttacatgagt taagttagta cttgagttaa ggaggaggga aagactcagc agcaggtgct 97921 aggaatcaga ctgcaaagca aggaggttct gtttgatggt tctgctccac tgtggacttc 97981 attccatctg atttataaca tggtaagaac aataggcaaa gagaagcatg aaaaaataca 98041 tttctagact agaaatccct gtttgaaaaa ttacgggtat tgtgatttaa aacaagtaag 98101 gaaactaaaa acatccacca gattggtaac agttttttgt aaccaaattg tccccagcta 98161 aaagtcggcc tcattgagat ttttggtagt acatttctcc attgactagt aatgtatgga 98221 ttcatttcta tgaattcagc aacctatggg gaaaaggcat tgatgtttaa agtatgacta 98281 tttagttgac ttgttaggat gatattcaga gcttgaagaa acctaaatct ctaacaaaaa 98341 ctcattaagc atggaatgca ttctgtgctt gttttctttt ctttatgtat ttttcccttt 98401 ttgaaatgca ttctacaaag caaagcttgc agcgctattt caaaatttcc aattattttg 98461 tggaaaatat gctcagcgtg aaatatgttc cctaacactg ataaaagaca taatcagaga 98521 ggaataagaa aatcattctg aatgggcctg catttgcact cttaacacag agtaatgaga 98581 gacaggagca tgcttatctt aggggaaagg actagaaata cagtagacaa tctcatgaaa 98641 acgctgtctg cctcactaag cttttcagga cagactgcta ctgtatcagt catgctcttc 98701 agggaacaga gatgaactcg ggtagagttg catcagtgaa atagtgagca atgctccaac 98761 aggatcctaa aagactgaac gaatgcattt tgagtttgtc aggcttaacc cagatgtatt 98821 tatgtccatc aatattttgg ttgtgatatt gtactatagt tttgcaatat gttactattg 98881 ggagatacta ggtaaagggc acatgggtgt ttctgtacta gccttaaaac tttctgtaga 98941 tctacaatta cctctaaaaa attttcaatt aaaaacttca aagcacaggt ttaacaatta 99001 aataaacgct aactcaaatt atttatctaa cattcagtaa ttttcatgac aataaatgtt 99061 gacctttcat ttttctgttt ttgattttct ttttttgttg tgggggaaag ttttgggtga 99121 agcagtgtct gaataaaacc aggctcttct gagtttttct tgaactttca ataagctcct 99181 gtctctggca cctacagagc cctcaatccc acactgggac actagaaatc ctgatcatac 99241 acctggaatg ctttctgatt gatttcttgt cctagctgag gtaactgcct acaatatcag 99301 ccacaatacc aacccaggga ctggcacccc aattaaatga gtgcctgagg aataacccct 99361 caaaaactag gtatggaatc ccttgtggaa atatgagttt aataatttca gaaccattca 99421 ttaattcaac aaacatttat tgagcactta ccatatacca gaattttgaa atcggtattg 99481 cttttgggga tttattttta tagtctggat tttgatttga gtttgaattc taggtttgca 99541 gcctatctag ctgtgggaga atgagcaagt cctttaacat ctgtatctca gtctttctgc 99601 ctataaaatg ggaataataa ttgcacctat ctaataaggt tgttgtaaaa tttaataata 99661 taatcacaca aaacgcttac cttggtgcct gacataaagt gctcacttga tattagctct 99721 tattgctctg tttgtttctc ctcctattct atgttctgta ttatgcataa aattgcaata 99781 tggttgaagt tcatttaaac agagtggaga ggaagtttcc tcttcattat tgactactgt 99841 agtgcttcac aacatctttt tttccccatt ttaacacgca tgacaattga cccttatatg 99901 agcaacatac tttttttccc ccagcattat gtgcaattcc ctggacatta actgtaatgt 99961 aggaccttac gagtgctata tcatcaagaa aaaccttgaa tctgttctga cttttttttc 100021 cttttctttg gaggcagagt cttgctctgt caccctatct ggagtgcagt ggtgagatct 100081 ctgcctactg caacctccac ctcccatact caagcctcct gggtagctgg gatcacaggc 100141 acacattgcc acatgtggtt tttttttttt tgtattttta gtagagatgg attttctcca 100201 ttttaaccag gatggtcttg aaatcctggc ctcaagtgat ccgcccgcct cagcctccca 100261 aagtgctggg attacaggtg tgagccaccg tgtccggcct gattttattt attttttaac 100321 aaggcacata agtctcacac ttcggtactg caactaccat tagcaacaca ctctttttga 100381 tgcaccccac aacatattga aacacaatag tttgttgtcg caccaagatt gaaaattgct 100441 ggtgtacagg gatatctatg ggttttccaa gctaaaccat aagcaagtaa cactcttatc 100501 ccagcttaac ctttataaat aactactatg tcttattccc attaacgaag aaaattgaaa 100561 agtgactcca ctacaataac atagaaattc ctatctcctt tgtgttcacc atcacttata 100621 aaaaaagatt catatttctt atttctatac ctatttctaa cttaaatcca agaaggcttc 100681 cctgaaagtt gtgcagttgc ttattccctt tcttagaata tttgaaatat attgaatatt 100741 tgaaattgtt aaatattcta gggaagagaa taagcaactg cacaactttg accttttcag 100801 ggcttgtgca atatgattca ctatccattc cttcctgttt acactctcaa taacctgtag 100861 agataaagca gtatgaacag ccaatcaaaa tgcagaatat tgaaaaggtt aaagtctatg 100921 tttatgagtc tattaaatgt ttatagtctt tcacagactg cactgggcat taaaactgtt 100981 gtatcttgtt ctggtcataa acattgcaac aagaaagaag tgggtaaaac taatttcatt 101041 cgcaagacaa aatttgtaat gagaatacaa aatggcataa tatgtattaa tttgttccca 101101 tgacaataac tgcaagaaaa acagaaaatg gtgtcaacct aaaccactca ttagggtgga 101161 aacaggagtt attaaataat tgttaagaaa ctcaatttaa aagtctagta cttagcgtga 101221 taacttacca ttcatgcttt ggaaaaagat gagggacatt tgaaccccat aaatccataa 101281 ctataaatca gctgactcct ttctaacctt tctagccaaa gaccaataat tgatgtgact 101341 tcatttttag gtagccaatg atgatgcacg tactattatt tctacatctg ctttaaattt 101401 gaaatgtatt attattttcc tggataatac cttgaagtat ctagagtttc atagcataat 101461 tatatgttat ataaaaacat actgttttgg tcttgtcatt aggtcaactg ttaggcctta 101521 ttacaatgaa agaaaaattg ccttttatat actatgctta tccaaattaa ctgtacttag 101581 aatgtctcct ataagaaaag ctaaaatatc ttggtataaa tcacacagac tgctagatgt 101641 tagctgcagg catacgatcc ttgattaaaa tacattaatc ctttgggagg ccaaggcggg 101701 tggatcacct gcagtcagga gttcgagacc agccttgcca acatggcaaa accccatctc 101761 tgcaaataat ataaaaatta gccaggcgtg gtgacgcatg cctgtagtac cagctactcg 101821 agattcttag gcacgagaat ctcttgaacc cagaaggctg agaatgcagt gagccaagat 101881 tgccccactg cactccagcc tgagccacag agtgagactt tgtctcaaac taaataaata 101941 aataaataca ttaatctgct ttttgtttta aatgtaaaaa ttattaacat ttcctagtat 102001 tactaaaaat cctagcaaaa acaattggat acttttattg taattgtact ttcttctacc 102061 agagaggaat aaaggggtat tttagggtat ttggaacagt gatctgtata atgtaaagct 102121 aatgaaaaat atgttagcaa tttgacgaca cccacattta caattatata attttgagaa 102181 tctttttttt ttttgctgta ctctttcagt tgtgaaaaaa ttgaagactg aacacctaac 102241 acaattaaag acctagtggt taaaacataa tttcatttga aaggtaacat tctaagctga 102301 tttaaaatta tatgaattat tctttcctat tttcaacaga tttggagaag gcagccacta 102361 aatttggaaa gaaaagaaaa atgaatccat gactatcata atgacctaat tttaacacat 102421 agtggttttc tgaagtcctt ccaagtctag acacttcatt ctaatacaac aatactcagt 102481 atgtggaatg aataaataac aacattttca tactaaaggc ctctatgttc ctctgagaaa 102541 cattttggag agggtaaaat taatcttggc tacttatctc tgggtttgtt gcagttgctc 102601 ttggctcccc tagccctata aaaccctatc aaagcctgac gatcacgtaa agcattcttc 102661 ctctctcctg ccttctcaag tgaaagaccc attccctcat tggatctact catcattact 102721 ggatcaaata tactcttagt attgttccca ctgaaaaaaa gaatgccaac ctttttcttc 102781 ccctgtcgca caagcccacc tgccctccaa ttaagagtaa ttgcaacaaa gattcttata 102841 ttctctattt tgatatcaga tatttatcag aaaggtgtgg ctttaatgaa gtcactctta 102901 ccttatgttt ctggcataat caacagtaat agctagcact tatatagcac ttgttatgca 102961 ccatcatcct aagtatttta tatatattag ctaatttaat cctcattata actctaaaat 103021 atggaaatta tttccttgct cattttgcag atggggaaat ttagatcaga gacgttgtgt 103081 taaatttata agattttata actaataaat actagagtga gttttcaagt tagacatctg 103141 gctctagaat ctgtactctt caccactgta ctacaatgat tccctacagc aagtgctacc 103201 tccacaaaca ggtatatcac ccctctgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 103261 gtgtgtggtc acataatggg gtatcctaag ttttgaggat ctagaaagga agtaatgaat 103321 taaaaattag aaagtaccag gacctgggac ctatgatgat aatcaaaccc taaaaagatg 103381 aggatttcca aaatctattg gtgttatatc tacaatttaa gaaaattaaa ggttctacat 103441 gaaattgttg attggaaagc acaaacatta caaaatttag taagttataa aaattaagta 103501 aatacaatct actatttttt acataattat tttcatctta tgatcaagtg ctaaaacaat 103561 gatttcagtt atctgtcttg gaatacagtc actgctagag ttctaatctc tgaagacagg 103621 gactgggaat tatttcctca gaaactagag acatgcctga cataaaataa gtactcaata 103681 tactgaaaga atgaataaca agagctatta ataatgatta cctactaagt gttaggcact 103741 gtgtaacatg ctttgcatat atcatgttga atcctttaac aatccaaatc taagacaggt 103801 tccattatcc tcagtttaca aattagaaaa ctgagtttta gaaagtttta gtcatttata 103861 taaattattc actgacagaa taataattca tacctatgtt atattccaaa ttctaggtgc 103921 ttttcattat gttatttggc aagaatgaat aaatgtctag atggtttatt ttctctcatt 103981 ctgaatgagc agaaattatg acccaactta tctaattcct gtcttcaaaa tagttcttat 104041 ttccaaagaa ataagaaaga aaaaagggga aagaaaagaa atagttaaac aggaaaatga 104101 aacagataaa gttaatgagt agctatacat tcttatagaa tgctagtcat aattctaatc 104161 aaagtgtgtc ataagaggtt atttactagt cagtctcccc aactatactg tgagctctcc 104221 caggacagag aatgtatctt ttattcatta cagtagtacc taccacagga tctagttcat 104281 agtaagtcat caaataactt ttttaaataa atgataggag agcattgata tggagggaat 104341 ggacagaaat gggggcaaga cagataagaa caactctaga gcaaagagct gaggtgggaa 104401 caaaagtata ttttacaggc ttacaatttc aaatgccttt aggccagaaa aagtaatttc 104461 agggagtggc atagaccagg cattagatag tagggaggag caatgactga ggcgaactgg 104521 acagttcatg ctccttctca aatcacctat ggaaaagaga caattaaaac tagatattgc 104581 ttggacaagt tgtgttatta gcttaatttg cctaatgtgg gaacagttta agactatggc 104641 caggcacagt ggctcatacc tgtaatccca gcactttggg agaccgaggt gggtggatca 104701 cgaggtcagg agatcgagac catcctggct aaaatggtga aaccccgtct ctactaaaag 104761 tacaaaaaaa ttaaccaggc gtgttggcac gtgcctgtag tcccaactat tcaggaggtt 104821 gaggcaggag aatcgcttga acctgggagc cggacgttgc agtgagctga gattgcacca 104881 ctgcactcca gcctgggaga cagagtgaga ttccatctca aaaaaaaaaa aaagaaaaaa 104941 aaaaagacta tggcccccac tgacttacat ataagtagaa atagatgcaa aaacagcagt 105001 tatacctatt ttgaacagat acctgcattt ttttccattc aacaaatgtt tcttgaatgt 105061 ctactatgta ccagatactg ggacaggcct tagaggtaca atggtaatct ccgccctcac 105121 agtttacttg aagagataga aacatgacaa acagatctat cgtcatgccc tctattgtct 105181 aggagaaaca gaagccatgc taaaatagta ttcaaaagtc tgtttttact atcttccttt 105241 ctttctcttt cattcttttc caactcccca ctcaagacca tcatgctcct gtcattttcc 105301 atattcactc tctggctatg ccaaatgtct ttccaggcct tccctcctca gaaatgcttc 105361 aatataatct ttcactctag cccttaataa tcaaaatttt gtaaagactc tgagaataaa 105421 aatgtcttat catatgaata aagtaattca ggctaaatat atgaatgaca gcagtgtgct 105481 aagtttactg agagcataag ctttgcatta catttcctct tttccacaac aatgacttgt 105541 aaatagtagc agctcaataa atacttgtct cacctttaag gtctttaaaa acatctatct 105601 cccctggcct atttacattt taaaagaaaa gcaaataaat tggaataatg gtgggaagta 105661 gtcagttgct aagacagtct cctggtgctg tcatcctaat acaacccctt tggatactgg 105721 gggttatgga cagaattaca gggacaattc agaattgtac tatagactgg tctggcatgg 105781 tggctcatgc ctgtaatccc aacactgtgg gaggctgagg taggccgatc acttgaggcg 105841 aggagtttga gaccagcctg gccaacatgg taaaaccccg tccctactaa aaaatacaca 105901 aattaggccg ggcgtggtgg tgggtgccta taactccagc tccttgggag gctgaggcag 105961 gagaatcaca tgaatccggg aagcagaggt tgcactgagc caagatcccg ccactgcaat 106021 ccagcctgga agacagagca agactctgtc tcaaaaaaaa aaaaaaaaaa aaaaaaaaat 106081 ttatgataga ctatcctcta ataactccaa agctttagaa gaactgtttg caactgtttg 106141 caattatagg ttggttgtca ccttgattaa tattttagag taaatcagcg aggggacaag 106201 tagtaaagct acatttggtt ttgacttttt tttaaaaaat gagaagtagc ccatggattc 106261 tcaagtggtt ccttttttgt tctgtgctag atttgacatg ctaaatattt tcaaaccctc 106321 taaacagcag tgcaaatagg ctgtatttaa tatcaaaagg ggagataaaa ataaataaaa 106381 taagaatgtt gttttctctt tcacagtctt cttatagttt tataaaacca taaattctgg 106441 agcataaaaa aagtagaaaa agttgcaata aaaataaata ttaaataaag aggctgctcc 106501 atcttcctct ctctctcgtg aaattctctt gatctttcca attttatgtt atattgtata 106561 ctaaagaaac ctacacattg gtttgggcaa tgtgcgagtc ttactaataa cggacattga 106621 aagctgctga tttaggacac tgtggtttat tttattttaa acatattttg taaagagcca 106681 acgtctaata tactgccttt attgtctttg ttccatgtgt acacacctta tctcagcaaa 106741 attctaagtt ccttgagggc aaaaacagtt tatctccgta tcttcgtagc acaatgatta 106801 attgaaaaca gtcataaata cctacatgta taaatacata gataaacaaa tgtttgttaa 106861 tatctggtca tgattgttgc ttccttcatt aagtaataaa cacacagtat gtgtaagtta 106921 ccctaagact caacatgtta aagaagtctt cctgttttgg ggtgtacatt tttgcttctg 106981 aacttggctc ctggtgtgca taattagcca cttgccaaca taattggcca cttgcttttt 107041 ttttaaggat attcagtggc aaagtgcagg tgcacttttg ggacccataa ctagttgcag 107101 ctgaaaaata aatgaaaaaa aagaagacta attgtgatcc tctgacaatt gggctcttta 107161 gggttataac ccccacagcc atcaaacctt ccttaaaaat cttccctcag ttccattgtt 107221 aagaatttta tagatgtttg gaaaatataa tttgttctaa aatacaatta cagcatgtaa 107281 gtaaaaaaaa aagaagaaga agaaaaatat tgaaaccaat ttctacagaa actctctcaa 107341 ggagcatttg caagattcag tgtagatttg aggaaagtct ccatgttata aaaccacctt 107401 tctcttctgc agaaaatttt catttgagca acttgtcatt ttgtccagta ttagaacaag 107461 ttatgcaagt caggagctca gtggagaatc ctttcagctc acttaaagtt tcacataatg 107521 gtgatttatc aaaaggaaaa atgtaaaact aaagtttgtc acacatcaag agtgacttcc 107581 ataaaagaaa aataaaaaag ctttcaataa aaagaatgaa gaaacctact aattagtcta 107641 caccatatct agattctgag caactctact gctattgtca gtagtcaacc actgaaggtc 107701 ttggtttatt aactggctat tgttgaaatg atactatata ataatgcatt gcagaaacac 107761 agcgaattat aattaaaaaa ttgtatttct tattatgggt ctgtgaatca tttggtaaac 107821 ttctgactag gacttcgagt tgattagtcc tggatatacc atgaattgta tttaagtatt 107881 ttcctcatat tccttcttct tggactagca gaactaatca gagtgtatat ttctcatgac 107941 ggatcacaga agggcaacct gacaagccaa atcatgcaca tacattttaa acatctattg 108001 gcattatttt cactaatctc ttattggaca aatcgagtca aaaggctaag aatacactgc 108061 acgttgagag gatactacaa atgtacctgg caaagtgtgt ggatgtacaa ttttattaac 108121 aatagagggg aaaatcaaga ctgataatcc aatttatcat aattgggtac aaccgtattg 108181 actctgataa attacagtgc atttaaaagt ttcagaaaat aattttgtac aaagaattat 108241 tgaaagaact gaggattttt agcctccaga atgtaaaagt catcggttag agatgctaga 108301 caaggaattc ctgcctttgg gaaaatggtt ggataaaagt ctcatttagc tctaggagag 108361 tgtgtgatta tttgagacat ttccttgtag gaacattggt taacagactg gagaaaagtt 108421 ggcaggggtt gggggagttg caggggcagc aataggtgag tctttgtttc aaagggtggg 108481 atttctataa gttgttgtat aggctgcttt tgtttgcttt gctgttgaag gagaggaagc 108541 tgtcattatt catgttgttt tatattccaa gctctcctct cagggttcat cagtgtattt 108601 caaaatgaag agcatcatct gagtcttctg agacattttt tagacattct aattactgtt 108661 caaccagaaa aataaacctg cataactttg gcaagtcact ctcatttctt catttaaaaa 108721 ttgaggggct ggatgatata caatctcact ggaagaggac aaatatacta gcatttgagt 108781 accttctatg tgcctaatta tggtgctctg tgctttcaca ggctctggaa gaccttatat 108841 tctatgcttc tgtgatttta tactgtcatt cagcacctgt aggtaacaat gttaaatagc 108901 agaacaactt tgtgaactac tgttgccatc taatggattt ttaataaaac tgtccataaa 108961 attcacatct gtgacctttc tcataatatt cctctcctca ctttttcccc aaaaggtaat 109021 agctactgat ttatttggcc ataaaatgca tcatcataat gaacgatgtg aaatccttta 109081 aaagagcttt agtttcaatc actcccaagt gatcagagaa ctgaattaca tcctagccac 109141 tgaggtttca aaagatttca atattaatac taacagctat catttattga gcatttattt 109201 atgtcaggaa ctacactagt agttacatgt gtaattttat ttaagcaata atatcttctg 109261 tgtgaattta taatcacagc ctgttaggct gcaagactaa atagatacaa ttacaaaaac 109321 ctcatgtgct gaacttgcta ctgaagttca acttaatttc aaatgttttc cttgggatca 109381 aaatactttt caaaattaaa tgtgctttgc acagtacccc atctatgttt ttcatggaat 109441 gaaattttat tatcttgaac cttttatttc ataatttagt aagcattaga aacagtttct 109501 ggatgactta cactgttagt aagcttgtag atatacttaa aagttcaggg ctgaggtcaa 109561 tagtagtttt ttggacacaa gctctgtgat caccttcatt aatcatgcta tagtgtcatc 109621 ctcatatgct gggtccatag aattcagaat aaagacattt cctaagtatg tattaagtat 109681 caggcaatat tcaaagcatc ttatatatat caatttattt catcctcata acaattctat 109741 aaaataggta ttattgttat tatcatcata cttattttgc agataaggaa attaagacac 109801 agagatgcta actaacttgc tcaaggatac acatcaaaga aggttaggtt gtcaactgag 109861 ccatttggct ctagaatttg cactcaaaat tactatgctg tgatacttta ggatagttag 109921 gtaatggggc ccaacatacc tgccttaata gtatctgatg ttgttaggta tttgtttctt 109981 ggatagtgat ctgacatggc tagaatacag gcagtatttt ttctctaaca ggcagttatt 110041 ttgggatcac agaaactcta tttgggcata aaatcatgtt tctaacaact gttcagggtt 110101 tgctacataa ttttcagggc ttaatgcaaa acgaaaatcc aggactctgt taaaaaagca 110161 aggtgatggg gtgctgttaa agatactaaa atatacagct ttgtttcttt attccatagt 110221 ctctcttaca acttgccatg gtacttttta tttgccattg aacgtcacac acccccagga 110281 acagggatat tcacaggaag gcagggaggt agaatgctga gattcacatg aaccaaaatt 110341 ctaagccttt gtctcacact tcattgtcac atctgacttc ccttataaaa cacaaattca 110401 aatataaaaa taaagaattt cagccaggga ccacagagca ttaaatccca agtatgaggc 110461 ccttctaagc atatgaccat atgtgattgt actcatcaca cgcccatgaa gctggccata 110521 caactatttc acaaaaatat ttgaggcagt attgtgttac cattgagtca gagtatctgt 110581 gtttgaatca tggctctcta ctttatcaaa tttatatatt accttgatca aattattttt 110641 atctcgtata aactaggttt gctcatctat aagatttgta gtaatagcag tacatttttc 110701 ataagactct tggaatttag gtgagacact ctctggcata tattactcta taaatgtaag 110761 aaatgtcatt taatttcaaa gatgattgga taacgttata ggaataatgt agaaacatct 110821 atgtatgtct ggtggacacc tttcatccag tatactccat tattccagca aaatataccc 110881 cttctggaaa ttgccacccc acttcaactc catctatata gcagctgaga gtgctgagtt 110941 gtgagtacat gtatctccac ctcttaccag agtcgattga tctagggtgg gcaccaaacc 111001 caagcaaggc ccagtccatt tttgagattc ataggaaatg catctaggtc tcagaaatca 111061 ctggaaccag agtctggaac tcaataagga ctcttctctt ctagtctctg tttttttgtt 111121 ttgttttgtt ttgtttttct gcgcacatgc tcatttttct acctttcttt aagttctttc 111181 cttggttttc atgagatacc tcaggatcct taagataaac tatcctttta gccttaatta 111241 gtttgagttg ggcttctgat accgctaata aaaaagtccc ctctatatac cattccatta 111301 cagaaatctg attctaagaa tgtatttgca ttttttacct tcttcacctg gatattgcca 111361 ttcccttcac aatatggcag ataaggcact atccgaattc tacctcatga ataaaatcca 111421 ggtcgataat caccaatatc ataaagactt cccgatcaca ccagtcagaa acaacttctt 111481 ttttcttcct tgaacaccta aagaattcgc tctcttttgt tttacaacct gtgacactta 111541 ctccatactt tcttcatagc tatctcatta ttagctacgt atttggttta gtgattttgt 111601 gtcttgctgc aattattaga cgacaagtaa ggatctatgt agatttttaa agctattact 111661 attcacttta ttctctgaaa gaatagaaaa aggtaagata ttaattaaaa cttattgaat 111721 aaatgaatat attaatgagt tagtgaataa ataattatcg gagttagcca gtagtgtctt 111781 ccttattcat ctcaaagtta ctagtgatag taacgactag ccttgttaga acactctatt 111841 tattgtgtaa aaatactgtg tctctaaaga aattatcacc tttaactttt ctccaataca 111901 acgaagtata ccagtggaaa aattgcttaa tcattagccc aagaaagcag ttatctctat 111961 gcttctccat gcattatgtg tgagatactt ttataacaac acttcatcat tctctttcta 112021 tcttgacaaa caaaacactg caggagacat atgttcccat ttccagatgc cagaattcaa 112081 taacgatgtt agttaacatc tttcatgttt tgattttgtc acatagcctt gatcctaaca 112141 ttatgaagcc aaaatccatt gttagaggaa aagttggatt ttccatatga attcaaagag 112201 atagtatgct tttcacaggc cagaggttgt tagccttggc acttgacacc ttggattgga 112261 taattctttg ttgtaggggg ctgccctgtt cattgtaaga tgtttagcag tatccttgtc 112321 ccctgtacac cagaattcaa tagcacactc tcttcccttg aggtgcagcc atcataaatg 112381 tctccagata tttccaaaca tcctctgggg gacaaaattc cctcaattga gcaccagtga 112441 tatagagaga atagaattgt tgtactgtac cctacttgct aaagcaaaaa tcatttgtaa 112501 taagacatgt acagaaaggc agaattctta catttggatg gctttggaaa acacacaaaa 112561 cataaccctt ttattttact aatgaaaaaa ttgaacatca gagagtgagg agacttcttc 112621 atgattacag agtgaggtag tttgcagagt tattttaaat ccagatttct gaccccccta 112681 gttagggttt tggccatcac accatgctca aattaagaat tctattaaca tggtcttagc 112741 ttgccagtag gccacttgga ttgccaccta aaattggttg gtggactacc tcaatagcag 112801 gactggctgc ataatttgtg agacccagtg caaatttaaa atgctgagct ccttgttcaa 112861 aaactgggca caagtatcat taaagctttt tttcttcctt ttatgattca tctctcttga 112921 cctgtcatag cttttaaaaa tttgctatct aatgttccaa tccttcaggc atagggatat 112981 tcatgaggtg agttcatgcc ccatgccttc gtgtgtgcct atttgctgct aggttctccc 113041 atttgccagc cactgggcca aaatgccatg tccttccagg gagagggaag aaataaagca 113101 agacatatct ctttcccaca ccagtactca ctcctggatt gggggagatg gctggatgag 113161 aggatcactc ttacagaggc acctgacaaa ttcgttgtgg tgactgccag ctcagggcag 113221 gaatagcagt caacaagccc cagcccaaga ctcacaggat cttcaaccct ttctcctgga 113281 gtgtagacaa cagatgttgc tgggcaacct tgagttccaa gaactcgtcc tgggcaggca 113341 gagaggcagt gggaggagga attgcaggta aggccaggct ccaaactcct gtcacatgct 113401 ccattgtctc attggacact atgtacaaaa cacaagtcaa aaataaaatt accaagattt 113461 tcaagatagt gactgcagag cattaaacct caagcacaag cacagaacac aattttgagt 113521 gtggagttct tgtgatggca caggtaacat gcccatgaag ttagccctgc tctcaagggc 113581 cagagtgcaa aattttacac agctgcctaa agtgacaatt ctaaaaataa aatttatttt 113641 attaaaagat aatgattatt accactttct gatcatgatc cactctgcag tcatactgat 113701 cttttaaagc attgaggaga atttttgaaa agcaatttta ctatgccatt agtgattcct 113761 tgaaagattt tgagctctga gcaaatatgc caaatatcag tataaatcat gcataattat 113821 ggatgtcttc attgtatcca gtagaaagga aacacagttt taagtatcat gaatttgtaa 113881 taagaaaaat atttaaaatt tttctcacag tgaggctata gtgcgggaaa tagggaaaac 113941 tacatttttg ccttttctaa tcgggtactt aacctctgca cttctatttc cctagagaaa 114001 atgtaaacaa aacaagaaat ttgggaaacg tgatcatata ttcaacaatc tgttcagaca 114061 ggatcatgaa acttctgaat gtagtctagg gatggcccgt taatttcccc ttcatattca 114121 aatgatgcaa agatagttaa aagacactac ctacagatat ggctgtcaac tcagtttggg 114181 aaattctacc aatgtgatta tagctcatca gcaagttcat taacatgtta tctggacaag 114241 ctaaacctag ggaggcagaa ttggtttgat gacaaccttt ccagggaaag ggtgaacaat 114301 ttaccattac tgtcttctat ttgcaagtga aacctagctc tggtttcgat ttggcttggc 114361 tccataagaa gagctacata agcagttgag agtgtttaca tgggaggata acagcaggct 114421 actgtagaaa cttggactcc aagtaattgg catggaaaat gaattcaact cattaacagg 114481 tctctgaaac acagcccttt gaaaatcagc ttgacagttt cgtttaataa gtattcggca 114541 cctccaggca acctcactag actctcacct ttcactttct tttttgtgtt ccttgcagcc 114601 agttttcgct gcctgttaac agtcactcct ttggaaagaa aatgttttgc aggaatagca 114661 acagcagaca ctcaaatgta tgtatgcaca aacacatgga atatttttct tgtgaaaatg 114721 atggtgctct cctagccaaa tgttcagagc aaggcagaag ccagaaagtc tctgtagcaa 114781 agaagagatt gttcctggga ctgcagttct ggatatgaga ctcagccaca gactgtcggt 114841 atctggcatc ctcagcatgg atataagttg cacggataga tggggcctga cacttagaga 114901 aaacatcact gttttctcta agtctagtcc gatgagacaa aatcattcat agaatgtcag 114961 agctgggatg gttctgagat tattacattt tgcataaagc cctgagaggt accactaact 115021 gcccaaggtc acattggaat tcaactctcc aagtctgaaa ttatctggaa ttccccccag 115081 ggactgaatt tttgaaaggc caggctggtg taaaaatcag taaatggaag tggggaaaga 115141 agtagatttg gggagatata tttatcagtt aaacactgca gcaaattgtg ggcaccccag 115201 acaaagtgaa gtggaaaatt gaagcaatta ggtggataat tcagttcaaa acacagtcat 115261 ctgaaatagt cagataatta aagcaagaac cattttttgc cttgaattat cttgatagtt 115321 gtctcaatta tccacttcag cgtgtgtgta gactcagcca gccagagaga tcataattat 115381 cactacttga ccttaatctg atctctttac ttgtaaaaca gttaccagta aatgtgaacc 115441 taaaggtaac ttgggagaga tgtggtaaag aggtgagagt accctttaag tatctccagc 115501 aagaatagaa aatggtaaaa tagagtgtgt gaattgtgtg tgtataaaac cactgtgaat 115561 taagcaattt tacctgtcat caaaatcagc catcaggaat tgggcattac ctaagtacaa 115621 aagagcaaga gggctttgtg tgttctttga ttgaagggat ttaatttagt tgcaaataca 115681 gtttttttct gttatgaaaa attaaggaca atagctttta aaaagaggtt aaacctatga 115741 ggcttttgaa cattacacat gttacattat agcctacttt cagaaaacag ctcagttgca 115801 aataggcaga tgagagtaat tctgttcttc ccacaaggag tgtgtgaaga acacatccat 115861 ccatatttac aaatagatga ataagtctgt cctggtacaa tccattccct tatgactttt 115921 tactgtgttc tcattaaggt agacttgaaa atgtgaaact aactaacaaa aagttctatt 115981 gaactttttg ttcaatagaa cttttgaaca aacatgtgaa cacttgtcca cgtgttttca 116041 actttgggga aaggaaagcg atgccgaaat agatgggcat ttgtatataa actttggaat 116101 cctgtggaga gaaggggctc tgtaaatatt tgctaatttt atcttttttc agttaacact 116161 tgcatccttt tcaattaata ctcactattt tttcaattac tattaagaaa cacagctaat 116221 gtataacagt tgagtgcatg gttgctggag ccagtttggc tgaattcaaa ttgtaactct 116281 gtcacttact agttttatga atttgcctaa gtgtttttag cctcacctgt ggataggggt 116341 gtcctcagct gtaagatgaa ggaaatagta gtgtctctat cctagggttg ttgtgtagac 116401 tgaatagctt ggaattaatt taaagggtgg ctgccatgta gtgaacacca ttctaagtat 116461 tagatattat gatttaattc tccttggtaa tatgtattga gacatggatt aacattagaa 116521 tattaaagtc agtacaaatt agagatgctc atgcataatt ccttgctgga gtttactttt 116581 gtattaaaga acttctttca ttggccactc tatctcattt attaatgagc aaatgggcta 116641 agagaagagg attctatttg ctcaaagtca cacagttatg aaccaacaga gctgaaccaa 116701 atggctgaga cctagaaaac ctaacatcaa gtcaccagct tgagtcttat aaaagacata 116761 tgctttcatt tctgtcaccc agcttatata agcattgact tatatatggt cattttgtcc 116821 tcctttttta tttaaaattt tataattttt tttctatttt tttttttttt ttattgtagg 116881 aggactggaa aagaaagggt ttggacattt aaggagcaag ccaaaatggg ccatcctcaa 116941 aaacaatctt atggaaacct gaaggcttgt cagaaacatt gtaatatctt gagaagtgat 117001 ggggaattag ataagaataa cagatttttt tttaaatcat aaataacagt ggatcagaaa 117061 gcaaaaacaa cagtgataat gaagacaaag cccactgact gaactagagt aagctaatag 117121 ggaaatgggt ttcctctatg gccccgtgcc attttaagtc aggcttgcat agctaatttc 117181 atctcctctg gctttggggc ttccaagata gccctgagca ggaacacttg ttcaattttt 117241 accactatcc ctctccctaa tccctacatc cccagcaaat agacagacat ttttaaagtc 117301 agtttgaaac ataacagttt tctcaggtac tttcacattc cttcgaacca acaaatttta 117361 aagccattga atgtaacggc atactttgaa taaaatgaaa ctctgagatg aaagcaagcc 117421 aacatgcaca gccctcttct ctgtttttct tccagagttt aaattcagct aatcaaagtt 117481 tttcctcata ttagagggcc tgtcatgaaa aggtagtata gaatggtgat taagagttca 117541 ggcagactct gaagccaaac tgcctgggtc aaaatcctag ctctgttgtt taactgttaa 117601 gtgactttag gcaagttaag ctgtcctaaa attgcctaag atgaatattt ggaaatacgg 117661 gaacttgaaa aagcagagac cagtggaggt tgggtagtca ggagaggctt ctcaaagaag 117721 gtgaggctaa aggataacca gtgtttgttc agacatgtat ctctttggga aaaaattgat 117781 caaactacat ctaccttgtt tttaaatctt tttttttttt tcagtttcac taatgggaca 117841 ctgccaaaac tatcacggca ggtcctttct gttcatttac aaaaagccac ttttccaaaa 117901 ccaagagatc atatttccat ataaaaatgg atgaaaaaca tacatatgga aagactttaa 117961 gacacatttt agtatttaag aaaaatattt ttttttcctc aaagcaaaag atctggtaag 118021 agattggcag ctgcaactat ggtattgaaa acggcatcaa ggaaaaggat gtattacaca 118081 tgtctcttaa aactaagcca tctgatagcc aagggagaaa taaataacag agaaggaagc 118141 cctgtttccc ttccttgttc ttttttaaat cattgtaaca gaaataaaat caacacagag 118201 gggcataaaa ggaaatagga aaatagatat caaaaagaga agctggaaat tctgtttaaa 118261 atgatacact acagtcacaa atatctctgt atctttcttc ttgctaagta aaagagacaa 118321 ttgcagatgt atttccccca cattgtaaat ctgaacatgt ttttcagcgt cagtttttat 118381 ccctagcctg tccctaatga tcagcaccaa gttatttctt cactgtgttg gaaatttgac 118441 tgtatattgt agataacact tttatatcaa aaaatgtcaa cattttttca aaaaatggaa 118501 aaatgtctta gaaatggtaa tctcatagaa aaacaggaca ttatgtttga gaggaactct 118561 ttcaatgtga agccactttc catttcttgc ctattttcct ttgtgattgt catttggtga 118621 tggtgcttag atatgcctcc aggaaaagaa aaaaaagaat gagagctacc aagtgagtgt 118681 gagaaagaga aatgaagata ggaagaaaaa atgtgtatgt cctactcagt ggcctaatta 118741 cggcacttta taataaaacc aatttatcat gactcttaaa aagtaataca agatagaaat 118801 atcatcttca tcttgtcatt caataaatat ttttaaattt ctaaggcttt acaacacaga 118861 tatcccagaa aggagtccag caggaaggat aaagtagcca gatgattaag tatttgtaaa 118921 cagcccaaaa tagtgccatg catcagtaag ttaggtctca ccttctttcc ttttctgtat 118981 ccagaaaagg ggtgtggggg aatgaggacc agtcggagta aggacacata gctgaagcca 119041 acattagcat cattctcttc taccacacct ctatgcctgc atatgagata caaataaact 119101 acgactttac aggactgaaa gcccttcatc ttcaaatctt taagattcat taaaaactct 119161 tgtgttcata ttactaccaa atatttcaaa gttcttcagg tttctaaact aaatctttca 119221 gaaaaacaac aaaaattatt tcaagaaatc aaaagctcct ttcaatggct acattaatgc 119281 aaaattcttt tttaaaaaga atgcatttgg tatatttaac tttaattaat taattaatta 119341 atttatttat ttatttattt acttttttga gacggagtct cgctctgtca cccaggctgg 119401 agtgcagtgg cgcgatcttt gctcactgca agctccgcct cccgggttca cgccattctc 119461 ctgcctcagc ctcccaagta gctgggatta caggcgcccg ccacaacgcc cggctaattt 119521 tttttatttt tagtagagac ggggtttcat catattagcc aggatggtct cgatctcctg 119581 acctcgtgat ctgcccgctt cggcctccaa aagcgctggg attacaggcg tgagcccctg 119641 cgcctggccg gcatatttaa ctttaaaatt tgctttgaag gaaatgtttc ttagtctgag 119701 acatgcccac cagttcataa aacagggcat tttttcctgg tcacttaaag agcctctcaa 119761 ttagaaagtt gaacaaggat tagcttcagt gatcctgagt tatttcccta agtgtattag 119821 cataatgacc tactctcatg caatatttta ttttatttta ttttattttt attatactct 119881 aagttttagg gtacatgtgc acaacgtgca ggtttgttac atatacatac atgtgccgtg 119941 ttggtgtgct gcacccatta actcgtctct catgcaatat tttaaatgag aaggcctata 120001 attagattat gagagagaaa atatatgttt acaaagtcgt tagcataaaa taacacatgc 120061 agactaggca atatccaaat actggctcct ttttatatat atgacccatc catctagttt 120121 ttgggataga agtgtatcag aggaaaacac ccagtacata atcttattga acatccacac 120181 agggaatgtg cagtcacata caaaaaaaat cccattgatt tctacttgag atgctgatct 120241 attgattttt gagtggctct ggttacctca ccataggagt gtggcttcag gaaaagtgaa 120301 ctaactggga aatctgcaaa gcagcaagca gagcaggaca aattggtaaa attatttaat 120361 actgaaaccc aacctcactg acctcctggc aagactcaca attagaaagt agggaataaa 120421 tgttaattat caataatgaa ctatttgggg tcataatcat gaaggtggaa atattagagt 120481 acaggaacag gcatagttag gtactctatg tattcttata atgttcttaa ttagtgcagc 120541 aatctaggca gcagtaagct gtggaaagag ggccaagagg gacactctca gactttctgt 120601 tgcatgtaca tttgctgaaa ttgcaatggt acgttctcac taaacttttt gtacccctgt 120661 ctcacctcag atttcctcag gtttttctct ggctgtctga atccagttcc ccattcctcg 120721 gccagcttac cctaatatct tgtttacccc aacatgcaac tcacctcttt tcaaaagtct 120781 tctccattcc ttcccccact ccccaaagtc tcttctgtaa ttgtaaggct agatttttca 120841 gagaaataaa aatttgatta attactatta aggcgtgtgt ttttattcat tttaaattgt 120901 aactggcaga ccaaatggat tatacatcaa tagtctttaa ttgcctacac caatcaatct 120961 tctgtatttt atacatattc acagtaattt cactaacctt atatggaaga atagggctca 121021 atatattgcc aatatatcag caactatttt tggaaagatg ccttccttag gtcacacatc 121081 catatacaac ttgtttcggt agttctatgt accagaactc tgaagtgtta ctttctacca 121141 agggaaaatc tataaccttg ggaggagcca agatggccga ataggaacag ctccggtcta 121201 cagctcccag cgtgagcgac gcagaagacg ggtgatttct gcatttccat ctgaggtacc 121261 gggttcatct cactagggag tgccagacag tgggcgcagg ccagtgtgtg tgcgcaccgt 121321 gcgcgagccg aagcagggcg aggcatcgcc tcacctggga agcgcaaggg gtcagggagt 121381 tccctttccg agtcaaagaa aggggtgacg gacacacctg gaaaatcagg tcactcccac 121441 ccgaatattg cgcttttcag accggcttaa gaaacggcgc accacgagac tatatcccac 121501 acctggctca gagggtccta cgcccacgga atcgcgctga ttgctagcac agcagtctga 121561 gatcaaactg caaggcggca acgaggctgg gggaggggcg cccgccattg cccaggcttg 121621 cttaggtaaa caaagcagcc aggaagctcg aactgggtgg agcccaccac agctcaagga 121681 ggcctgcctg cctctgtagg ctccacctct gggggcaggg cacagacaaa caaaaagaca 121741 gcagtaacct ctgcagactt aagtgtccct gtctaacagc tttgaagaga gcagtggttc 121801 tcccagcacg cagctggaga tctgagaacg gtcagactgc ctcctcaagt gggtccctga 121861 cccctgaccc ccgagcagcc taactgggag gcacccccca gcaggggcac actgacacct 121921 cacacagcag ggtattccaa cagacctgca gctgagggtc ctgtctgtta gaaggaaaac 121981 taacaaccag aaaggacatc tacaccgaaa acccatctgt acatcaccat catcaaagac 122041 caaaagtaga taaaaccaca aagatgggga aaaaacagaa cagaaaaact ggaaactcta 122101 aaacgcagag cgcctctcct cctccaaagg aacgcagttc ctcaccagca acagaacaaa 122161 gctggatgga gaatgatttt gacgagctga gagaagaagg cttcagacga tcaaattact 122221 ctgagctacg ggaggacatt caaaccaaag gcaaagaagt tgaaaacttt gaaaaaaatt 122281 tagaagaatg tataactaga ataaccaata cagagaagtg cttaaaggag ctgatggagc 122341 tgaaaaccaa ggctcgagaa ctacgtgaag aatgcagaag cctcaggagc cgatgcgatc 122401 aactggaaga aagggtatca gcaatggaag atgaaatgaa tgaaatgaag cgagaaggga 122461 agtttagaga aaaaagaata aaaagaaatg agcaaagcct ccaagaaata tgggactatg 122521 tgaaaagacc aaatctacgt ctgattggtg tacctgaaag tgacggggag aatggaacca 122581 agttggaaaa cactctgcag gatattatcc aggagaactt ccccaatcta gaaaggcagg 122641 ccaacgttca gattcaggaa atacagagaa caccacaaag atactcctcg agaagagcaa 122701 ctccaagaca cataattgtc agattcacca aagttgaaat gaaggaaaaa atgttaaggg 122761 cagccagaga gaaaggtcgg gttaccctca aaggaaagcc catcagacta acagcggatc 122821 tctcggcaga aaccctacaa gccagaagag agtggaggcc aatattcaac attcttaaag 122881 aaaagaattt tcaacccaga atttcatatc cagccaaact aagcttcata agtgaaggag 122941 aaataaaata ctttatagac aagcaaatgc tgagagattt tgtcaccacc aggcctgccc 123001 taaaagagct cctgaaggaa gcgctaaaca tggaaaggaa caaccggtac cagccgctgc 123061 aaaatcatgc caaaatgtaa agaccatcga gactaggaag aaactgcatc aactaatgag 123121 caaaatcacc agctaacatc ataatgacag gatcaaattc acacataaca atattaactt 123181 taaatataaa tggactaaat tctgcaatta aaagacacag actggcaagt tggataaaga 123241 gtcaagaccc atcagtgtgc tgtattcagg aaacccatct cacgtgcaga gacacacata 123301 ggctcaaaat aaaaggatgg aggaagatct accaagccaa tggaaaacaa aaaaaggcag 123361 gggttgcaat cctagtctct gataaaacag actttaaacc aacaaagatc aaaagagaca 123421 aagaaggcca ttacataatg gtaaagggat caattcaaca agaggagcta actatcctaa 123481 atatttatgc acccaataca ggagcaccca gattcataaa gcaagtcctg agtgacctac 123541 aaagagactt agactcccac acattaataa tgggagactt taacacccca ctgtcaacat 123601 tagacagatc aacgagacag aaagtcaaca aggataccca ggaattgaac tcagctctgc 123661 accaagcaga cctaatagac atctacagaa ctctccacca caaatcaaca gaatatacat 123721 tcttttcagc accacaccac acctattcca aaattgacca catagttgga agtaaagctc 123781 tcctcagcaa atgtaaaaga acagaaatta taacaaacta tctctcagac cacagtgcaa 123841 tcaaactaga actcaggatt aagaatctca ctcaaagccg ctcaactaca tggaaactga 123901 acaacctgct cctgaatgac tactgggtac ataacgaaat gaaggcagaa ataaagatgt 123961 tctttgaaac caacgagaac aaagacacca cataccagaa tctctgggac gcattcaaag 124021 cagtgtgtag agggaaattt atagcactaa atgcctataa gagaaagcag gaaagatcca 124081 aaattgacac cctaacatca caattaaaag aactagaaaa gcaagagcaa acacattcaa 124141 aagctagcag aaggcaagaa ataactaaaa tcagagcaga actgaaggaa atagagacac 124201 aaaaaaccct tcaaaaaatc aatgaatcca ggagctggtt ttttgaaagg atcaacaaaa 124261 ttgatagacc gctagcaaga ttaataaaga aaaaaagaga gaagaatcaa atagacacaa 124321 taaaaaatga taaaggggat atcaccaccg atcccacaga aatacaaact accatcagag 124381 aatactacaa acacctctac gcaaataaac tagaaaatct agaagaaatg gatacattcc 124441 tcgacacata cactctccta agactaaacc aggaagaagt tgaatctctg aatagaccaa 124501 taacaggctc tgaaattgtg gcaataatca atagtttacc aaccaaaaag agtccaggac 124561 cagatggatt cacagccgaa ttctaccaga ggtacaagga ggaactggta ccattccttc 124621 tgaaactatt ccaatcaata gaaaaagagg gaatcctccc taactcattt tatgaggcca 124681 gcatcattct gataccaaag ccgggcagag acacaaccaa aaaagagaat tttagaccaa 124741 tatccttgat gaacattgat gcaaaaatcc tcaataaaat actggcaaac cgaatccagc 124801 agcacatcaa aaagcttatc caccatgatc aagtgggctt catccctggg atgcaaggct 124861 ggttcaatat acgcaaatca ataaatgtaa tccagcatat aaacagagcc aaagacaaaa 124921 accacatgat tatctcaata gatgcagaaa aagcctttga caaaattcaa caacccttca 124981 tgctaaaaac tctcaataaa ttaggtattg atgggacgta tttcaaaata ataagagcta 125041 tctatgacaa acccacagcc aatatcatac tgaatgggca aaaactggaa gcattccctt 125101 tgaaaactgg cacaagacag ggatgccctc tctcaccgct cctattcaac atagtgttgg 125161 aagttctggc cagggcaatc aggcaggaga aggaaataaa gggtattcaa ttaggaaaag 125221 aggaagtcaa attgtccctg tttgcagacg acatgattgt ttatctagaa aaccccattg 125281 tctcagccca aaatctcttt aagctgataa gcaacttcag caaagtctca ggatacaaaa 125341 tcaatgtaca aaaatcacaa gcattcttat acaccaacaa cagacaaaca gagagccaaa 125401 tcatgagtga actcccattc acaattgctt caaagagaat aaaataccta ggaatccaac 125461 ttacaaggga tgtgaaggaa ctcttcaagg agaactacaa accactgctc aaggaaataa 125521 aagaggacac aaacaaatgg aagaacattc catgctcatg ggtaggaaga atcaatatcg 125581 tgaaaatggc catactgccc aaggtaattt acagattcaa tgccatcccc atcaagctac 125641 caatgacttt cttcacagaa ttggaaaaaa ctactttaaa gttcatatgg aaccaaaaaa 125701 gagcccgcat cgccaagtca atcctaagcc aaaagaacaa agccggaggc atcacactac 125761 ctgacttcaa actatactac aaggctacag taaccaaaac agcatgggac tggtaccaaa 125821 acagagatat agatcaatgg aacagaacag agtcctcaga aataatgccg catatctaca 125881 actatctgat ctttgacaaa cctgagaaaa acaagcaatg gggaaaggat tccctattta 125941 ataaatggtg ctgggaaaac tggctaacca tatgtagaaa gctgaaactg gatcccttcc 126001 ttacacctta tacaaaaatc aattcaagat ggattaaaga tttaaacgtt agacctaaaa 126061 ccataaaaac cctagaagaa aacctaggca ttaccattca ggacataggc gtgggcaagg 126121 acttcatgtc caaaacacca aaagcaatgg caacaaaagc caaaattgac aaatgggatc 126181 taattaaact caagagcttc tgcacagcaa aagaaactac catcagagtg aacaggcaac 126241 ctacaacatg ggagaaaatt ttcgcaacct actcatctga caaagggcta atatccagaa 126301 tctataatga actcaaacaa atttacaaga aaaaaacaac cccatcaaaa agtgggcgaa 126361 ggacatgaac agacacttct caaaagaaga catttatgca gctaaaaaac acatgaagaa 126421 atgctcatca tcactggcca tcagagaaat gcaaatcaaa accactatga gatatcatct 126481 cacaccagtt agaatggcaa tcattaaaaa gtcaggaaac aacaggtgct ggagaggatg 126541 tggagaaata ggaacacttt tacactgttg gtgggactgt aaactagttc aaccattgtg 126601 gaagtcagtg tggcgattcc tcagggatct agaactagaa ataccatttg acccagccat 126661 cccattactg ggtatatacc caaaggacta taaatcatgc tgctataaag acacatgcac 126721 acgtatgttt attgcggcac tattcacaat agcaaagact tggaaccaac ccaaatgtcc 126781 aacaatgata gactggatta agaaaatgtg gcacatatac accatggaat actatgcagc 126841 cataaaaaat gatgagttca tgtcctttgt agggacatgg atgaaattgg aaaccatcat 126901 tctcagtaaa ctatcgcaag aacaaaaaac caaacaccgc atattctcac tcataggtgg 126961 gaattgaaca atgagatcac atggacacag gaaggggaat atcacactct ggggactgtg 127021 gtggggtcgg gggagggggg agggatagca ttgggagata tacctaatgc tagatgacac 127081 gttagtgggt gcagtgcacc agcatggcac atgtatacat atgtaactaa cctgcacaat 127141 gtgcacatgt accctaaaac ttagagtata ataaaaaaaa aaattaaaaa aaaaaaagaa 127201 aatctataac cttataataa tatcacattt tgtctgatat ttggatc // LOCUS HS37LIM 1130 bp RNA PRI 28-NOV-1995 DEFINITION H.sapiens mRNA for 37 kDa LIM domain protein. ACCESSION X93510 NID g1085021 KEYWORDS 37 kDa protein; LIM-domain protein; ril gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1130) AUTHORS Scharm,B. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1130) AUTHORS Schaefer,R. TITLE Direct Submission JOURNAL Submitted (29-OCT-1995) R. Schaefer, Division of Cancer Research Dept., Dept. of Pathology, Schmelzbergstr. 12, CH 8091 Zuerich, CH-8091, SWITZERLAND FEATURES Location/Qualifiers source 1..1130 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lung" /cell_type="fibroblast" /clone_lib="Clontech #HL1011b" gene 42..1028 /gene="ril" CDS 42..1028 /gene="ril" /codon_start=1 /product="37 kDa LIM domain protein" /db_xref="PID:e211802" /db_xref="PID:g1085022" /translation="MPHSVTLRGPSPWGFRLVGRDFSAPLTISRVHAGSKASLAALCP GDLIQAINGESTELMTHLEAQNRIKGCHDHLTLSVSRPEGRSWPSAPDDSKAQAHRIH IDPEIQDGSPTTSRRPSGTGTGPEDGRPSLGSPYGKPPCFPVPHNGSSEATLPAQMST LHVSPPPSADPAEASRGAGSRVDLGSEVYRMLREPAEPVAAEPKQSGSFRYLQGMLEA GEGGDWPGPGGPRNLKPTASKLGAPLSGLQGLPECTRCCHGIVGTIVKERDKLYHPEC FMCSDCGLNLKQRGYFFLDERLYCESHAKARVKPPEGYDVVAVYPNAKVELV" BASE COUNT 217 a 388 c 345 g 180 t ORIGIN 1 tgagagtccg gctcaggctc cggctgcggc tccagcccgc gatgccccat tccgtgaccc 61 tgcgcgggcc ttcgccctgg ggcttccgcc tggtgggccg ggacttcagc gcgcccctca 121 ccatctcacg ggtccatgct ggcagcaagg cctcattggc tgccctgtgc ccaggagacc 181 tgatccaggc catcaatggt gagagcacag agctcatgac acacctggag gcacagaacc 241 gcatcaaggg ctgccacgat cacctcacac tgtctgtgag caggcctgag ggcaggagct 301 ggcccagtgc ccctgatgac agcaaggctc aggcacacag gatccacatc gatcctgaga 361 tccaggacgg cagcccaaca accagcaggc ggccctcagg caccgggact gggccagaag 421 atggcagacc aagcctggga tctccatatg gaaaaccccc ttgctttcca gtccctcaca 481 atggcagcag cgaggccacc ctgccagccc agatgagcac cctgcatgtg tctccacccc 541 ccagcgctga cccagcagag gcctcccgcg gagccgggag cagagtcgac ctgggctccg 601 aggtgtacag gatgctgcgg gagccggccg agcccgtggc cgcggagccc aagcagtcag 661 gctccttccg ctacttgcag ggcatgctag aggccggcga gggcggggat tggcccgggc 721 ctggcggccc ccggaacctc aagcccacgg ccagcaagct gggcgctccg ctgagcggcc 781 tgcaggggct gcccgagtgc acgcgctgct gccacggaat cgtgggcacc atcgtcaagg 841 aacgggacaa gctctaccat cccgagtgct tcatgtgcag tgactgcggc ctgaacctca 901 agcagcgtgg ttacttcttt ctggacgagc ggctctactg tgagagccac gccaaggcgc 961 gcgtgaagcc gcccgagggc tacgacgtgg tggcggtgta ccccaatgcc aaggtggaac 1021 tcgtctgagc tgggaccctg ctcccacccc tgcttcttaa ggtccctgct cggccggtgt 1081 aaatatgttt caccctgtcc ctctaataaa gctcctctgc tcaaaaaaaa // LOCUS HS3OCOAT 1391 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for 3-oxoacyl-CoA peroxisomal thiolase. ACCESSION X12966 NID g23873 KEYWORDS 3-oxoacyl-CoA thiolase; thiolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 867; 869 to 1288; 1290 to 1391) AUTHORS Bout,A. TITLE Direct Submission JOURNAL Submitted (20-SEP-1988) Bout A., Dept of Biochemistry, University of Amsterdam, Meibergdreef 15, 1105 AZ Amsterdam REFERENCE 2 (bases 1 to 867; 869 to 1288; 1290 to 1391) AUTHORS Bout,A., Teunissen,Y., Hashimoto,T., Benne,R. and Tager,J.M. TITLE Nucleotide sequence of human peroxisomal 3-oxoacyl-CoA thiolase JOURNAL Nucleic Acids Res. 16 (21), 10369 (1988) MEDLINE 89057483 REFERENCE 3 (bases 1 to 1391) AUTHORS Bout,A. TITLE Direct Submission JOURNAL Submitted (30-AUG-1989) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..1391 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 7..1281 /note="3-oxoacyl-CoA thiolase propeptide (424 AA)" /codon_start=1 /db_xref="PID:g23874" /db_xref="SWISS-PROT:P09110" /translation="MQRLQVVLGHLRGPADSGWMPQAAPCLSGAPQASAADVVVVHGR RTAICRAGRGGFKDTTPDELLSAVMTAVLKDVNLRPEQLGDICVGNVLQPGAGAIMAR IAQFLSDIPETVPLSTVNRQCSSGLQAVASIAGGIRNGSYDIGMACGVESMSLADRGN PGNITSRLMEKEKARDCLIPMGITSENVAERFGISREKQDTFALASQQKAARAQSKGC FQAEIVPVTTTVHDDKGTKRSITVTQDEGIRPSTTMEGLAKLKPAFKKDGSTTAGNSS QVSDGAAAILLARRSKAEELGLPILGVLRSYAVVGVPPDIMGIGPAYAIPVALQKAGL TVSDVDIFEINEAFASQAAYCVEKLRLPPEKVNPLGGAVALGHPLGCTGARQVITLLN ELKRRGKRAYGVVSMCIGTGMGAAAVFEYPGN" BASE COUNT 295 a 374 c 460 g 262 t ORIGIN 1 tgcgcaatgc agaggctgca ggtagtgctg ggccacctga ggggtccggc cgattccggc 61 tggatgccgc aggccgcgcc ttgcctgagc ggtgccccgc aggcctcggc cgcggacgtg 121 gtggtggtgc acgggcggcg cacggccatc tgccgggcgg gccgcggcgg cttcaaggac 181 accacccccg acgagcttct ctcggcagtc atgaccgcgg ttctcaagga cgtgaatctg 241 aggccggaac agctggggga catctgtgtc ggaaatgtgc tgcagcctgg ggccggggca 301 atcatggccc gaatcgccca gtttctgagt gacatcccgg agactgtgcc tttgtccact 361 gtcaatagac agtgctcgtc ggggctacag gcagtggcca gcatagcagg tggcatcaga 421 aatgggtctt atgacattgg catggcctgt ggggtggagt ccatgtccct ggctgacaga 481 gggaaccctg gaaatattac ttcgcgcttg atggagaagg agaaggccag agattgcctg 541 attcctatgg ggataacctc tgagaatgtg gctgagcggt ttggcatttc acgggagaag 601 caggatacct ttgccctggc ttcccagcag aaggcagcaa gagcccagag caagggctgt 661 ttccaagctg agattgtgcc tgtgaccacc acggtccatg atgacaaggg caccaagagg 721 agcatcactg tgacccagga tgagggtatc cgccccagca ccaccatgga gggcctggcc 781 aaactgaagc ctgccttcaa gaaagatggt tctaccacag ctggaaactc tagccaggtg 841 agtgatgggg cagctgccat cctgctggcc cggaggtcca aggcagaaga gttgggcctt 901 cccatccttg gggtcctgag gtcttatgca gtggttgggg tcccacctga catcatgggc 961 attggacctg cctatgccat cccagtagct ttgcaaaaag cagggctgac agtgagtgac 1021 gtggacatct tcgagatcaa tgaggccttt gcaagccagg ctgcctactg tgtggagaag 1081 ctacgactcc cccctgagaa ggtgaacccc ctggggggtg cagtggcctt agggcaccca 1141 ctgggctgca ctggggcacg acaggtcatc acgctgctca atgagctgaa gcgccgtggg 1201 aagagggcat acggagtggt gtccatgtgc atcgggactg gaatgggagc cgctgccgtc 1261 tttgaatacc ctgggaactg agtgaggtcc caggctggag gcgctacgca gacagtcctg 1321 ctgctctagc agcaaggcag taacaccaca aaagcaaaac cacatgggaa aactcagcac 1381 tggtggtggt g // LOCUS HS40KDAP 2791 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens 40 kDa protein kinase related to rat ERK2. ACCESSION Z11695 S38869 NID g23878 KEYWORDS protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2791) AUTHORS Gonzalez,F.A. TITLE Direct Submission JOURNAL Submitted (27-JAN-1992) Fernando A Gonzalez, Biochemistry and Molecular Biology, University of Massachusetts Medical School, 373 Plantation St., Worcester, MA, 01605, USA REFERENCE 2 (bases 1 to 2791) AUTHORS Gonzalez,F.A., Raden,D.L., Rigby,M.R. and Davis,R.J. TITLE Heterogeneous expression of four MAP kinase isoforms in human tissues JOURNAL FEBS Lett. 304 (2-3), 170-178 (1992) MEDLINE 92316223 FEATURES Location/Qualifiers source 1..2791 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Lambda Zap II" /clone="pBluescript II" CDS 135..1181 /codon_start=1 /product="40kDa protein kinase" /db_xref="PID:g23879" /db_xref="SWISS-PROT:P28482" /translation="MVRGQVFDVGPRYTNLSYIGEGAYGMVCSAYDNVNKVRVAIKKI SPFEHQTYCQRTLREIKILLRFRHENIIGINDIIRAPTIEQMKDVYIVQDLMETDLYK LLKTQHLSNDHICYFLYQILRGLKYIHSANVLHRDLKPSNLLLNTTCDLKICDFGLAR VADPDHDHTGFLTEYVATRWYRAPEIMLNSKGYTKSIDIWSVGCILAEMLSNRPIFPG KHYLDQLNHILGILGSPSQEDLNCIINLKARNYLLSLPHKNKVPWNRLFPNADSKALD LLDKMLTFNPHKRIEVEQALAHPYLEQYYDPSDEPIAEAPFKFDMELDDLPKEKLKEL IFEETARFQPGYRS" BASE COUNT 717 a 679 c 599 g 796 t ORIGIN 1 tgcgttgggg tgaggccctc acttcatccg gcgactagca ccgcgtccgg cagcgccagc 61 cctacactcg cccgcgccat ggtcttcatc tctcgacctc gtgatgggcc gccgcggcgg 121 gcgcgggccc ggagatggtc cgcgggcagg tgttcgacgt ggggccgcgc tacaccaacc 181 tctcgtacat cggcgagggc gcctacggca tggtgtgctc tgcttatgat aatgtcaaca 241 aagttcgagt agctatcaag aaaatcagcc cctttgagca ccagacctac tgccagagaa 301 ccctgaggga gataaaaatc ttactgcgct tcagacatga gaacatcatt ggaatcaatg 361 acattattcg agcaccaacc atcgagcaaa tgaaagatgt atatatagta caggacctca 421 tggaaacaga tctttacaag ctcttgaaga cacaacacct cagcaatgac catatctgct 481 attttctcta ccagatcctc agagggttaa aatatatcca ttcagctaac gttctgcacc 541 gtgacctcaa gccttccaac ctgctgctca acaccacctg tgatctcaag atctgtgact 601 ttggcctggc ccgtgttgca gatccagacc atgatcacac agggttcctg acagaatatg 661 tggccacacg ttggtacagg gctccagaaa ttatgttgaa ttccaagggc tacaccaagt 721 ccattgatat ttggtctgta ggctgcattc tggcagaaat gctttctaac aggcccatct 781 ttccagggaa gcattatctt gaccagctga accacatttt gggtattctt ggatccccat 841 cacaagaaga cctgaattgt ataataaatt taaaagctag gaactatttg ctttctcttc 901 cacacaaaaa taaggtgcca tggaacaggc tgttcccaaa tgctgactcc aaagctctgg 961 acttattgga caaaatgttg acattcaacc cacacaagag gattgaagta gaacaggctc 1021 tggcccaccc atatctggag cagtattacg acccgagtga cgagcccatc gccgaagcac 1081 cattcaagtt cgacatggaa ttggatgact tgcctaagga aaagctcaaa gaactaattt 1141 ttgaagagac tgctagattc cagccaggat acagatctta aatttgtcag gacaagggct 1201 cagaggactg gacgtgctca gacatcggtg ttcttcttcc cagttcttga cccctggtcc 1261 tgtctccagc ccgtcttggc ttatccgctt tgactccttt gagccgtttg gaggggcggt 1321 ttctggtagt tgtggctttt atgctttcaa agaatttctt cagtccagag aattcctcct 1381 ggcagccctg tgtgtgtcac ccattggtga cctgcggcag tatgtacttc agtgcacctt 1441 actgcttact gttgctttag tcactaattg ctttctggtt tgaaagatgc agtggttcct 1501 ccctctcctg aatccttttc tacatgatgc cctgctgacc atgcagccgc accagagaga 1561 gattcttccc caattggctc tagtcactgg catctcactt tatgataggg aaggctacta 1621 cctagggcac tttaagtcag tgacagcccc ttatttgcac ttcacctttt gaccataact 1681 gtttccccag agcaggagct tgtggaaata ccttggctga tgttgcagcc tgcagcaagt 1741 gcttccgtct ccggaatcct tggggagcac ttgtccacgt cttttctcat atcatggtag 1801 tcactaacat atataaggta tgtgctattg gcccagcttt tagaaaatgc agtcattttt 1861 ctaaataaaa aaggaagtac tgcacccagc agtgtcactc tgtagttact gtggtcactt 1921 gtaccatata gaggtgtaac acttgtcaag aagcgttatg tgcagtactt aatgtttgta 1981 agacttacaa aaaaagattt aaagtggcag cttcactcga catttggtga gagaagtaca 2041 aaggttgcag tgctgagctg tgggcggttt ctggggatgt cccagggtgg aactccacat 2101 gctggtgcat atacgccctt gagctacttc aaatgtggtt tatacctcgc agatacaaga 2161 atctttatga atatacaatt ctttttcctt ctacagctta gctccgtctt ttcaaccacg 2221 aacatttaaa acccgaccta ctagcactgt tctgtcctca agtactcaaa tatttctgat 2281 actgctgagt cagactgtca gaaaaagcta gcactaactc gtgtttggag ctctatccat 2341 attttactga tctctttaag tatttgttcc tgccactgtg tactgtggag ttgactcggt 2401 gttctgtccc agtgcggtgc ctcctcttga cttccccact gctctctgtg gtgagaaatt 2461 tgccttgttc aataattact gtacccctcg catgactgtt acagctttct gtgcagagat 2521 gactgtccaa gtgccacatg cctacgattg aaatgaaaac tctattgtta cctctgagtt 2581 gtgttccacg gaaaatgcta tccagcagat catttaggaa aaataattct atttttagct 2641 tttcatttct cagctgtcct tttttcttgt ttgatttttg acacgaatgg agaatgggtt 2701 atataaagac tgcctgctat tgacagaaat gcatttgtaa ttcatgaaaa taaatgtaca 2761 tcttctatct tcaaaaaaaa aaaaaaaaaa a // LOCUS HS434P1 124990 bp DNA PRI 03-FEB-1998 DEFINITION Human DNA sequence from PAC 434P1 on chromosome 22. Contains inward rectifier potassium channel 4, (potassium channel, inwardly rectifying, subfamily J, member 4) (hippocampal inward rectifier) (HIR) (HRK1) (HIRK2) (KIR2.3), ESTs similar to lumen protein retaining receptor 2 (KDEL receptor 2), DEAD-box protein P72, ESTs, CpG islands. ACCESSION Z97056 NID g2832593 KEYWORDS 22; CpG island; ion transport; ionic channel; P72; potassium transport; transmembrane; voltage-gated channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 124990) AUTHORS Connor,R. TITLE Direct Submission JOURNAL Submitted (02-FEB-1998) sanger.ac.uk/HGP/Chr22/) Sanger Centre, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone 434P1. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. During sequence assembly data are compared from overlapping clones. Where differences are found these are annotated as variations together with a note of the overlapping clone name. Note that the variations annotated may not be found in the sequence submission corresponding to the overlapping clone as we submit sequences with only a small overlap as described above. This sequence was generated from part of bacterial clone contigs of human chromosome 22, constructed by the Sanger Centre chromosome 22 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/ This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone 434P1 is at 1 in this sequence. The true right end of clone 449O17 is at 36646. The true right end of clone 434P1 is at 124990. 434P1 is from the library RPCI3 constructed at the Roswell Park Cancer Institute by the group of Pieter de Jong. For further details see http://bacpac.med.buffalo.edu/. FEATURES Location/Qualifiers source 1..124990 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22" /clone="434P1" /clone_lib="RPCI3" variation 240..242 /note="clone 449O17; TCT in this entry; substitution" /replace="tat" repeat_region 828..1120 /note="AluSx repeat: matches 1..295 of consensus" repeat_region 1697..1797 /note="MIR2 repeat: matches 43..143 of consensus" prim_transcript 3453..3874 /note="match: 3' EST AA489791 clone 839545" prim_transcript complement(5202..5634) /note="match: 5' EST AA489921 clone 937202" repeat_region 7722..7932 /note="LTR7 repeat: matches 450..238 of consensus" repeat_region 7994..8045 /note="LTR7 repeat: matches 450..399 of consensus" repeat_region 8044..8073 /note="LTR7 repeat: matches 30..1 of consensus" prim_transcript 8168..8495 /note="match: 5' EST R31219 clone 134314" repeat_region 8701..8726 /note="13 copies of 2 mer 92 % conserved" misc_feature 8911..10548 /note="Putative CpG island" variation 9314..9342 /note="clone 449O17; CCCCCGCTCCCGGCGGAGATTCAGGGAAC in this entry; insertion" /replace="cc" variation 9837..9839 /note="clone 449O17; AGC in this entry; substitution" /replace="aac" variation 10447..10449 /note="clone 449O17; CCG in this entry; substitution" /replace="cgg" variation 10477..10479 /note="clone 449O17; CTC in this entry; substitution" /replace="ccc" repeat_region 13268..13375 /note="MIR repeat: matches 82..197 of consensus" variation 13364..13366 /note="clone 449O17; ACT in this entry; substitution" /replace="att" repeat_region 13720..13975 /note="AluSc repeat: matches 2..257 of consensus; incomplete repeat" repeat_region 14653..14952 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 15610..15909 /note="AluJb repeat: matches 301..1 of consensus" repeat_region 15961..16261 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 16353..16442 /note="MIR repeat: matches 186..94 of consensus" repeat_region 16612..16913 /note="AluY repeat: matches 2..301 of consensus" repeat_region 17001..17298 /note="AluSg repeat: matches 298..1 of consensus" repeat_region 17530..17832 /note="AluSq repeat: matches 303..2 of consensus" repeat_region 17862..18140 /note="L1ME3A repeat: matches 419..136 of consensus" repeat_region 18155..18453 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 18771..19074 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 19075..19130 /note="AluJ repeat: matches 56..1 of consensus; incomplete repeat" repeat_region 19270..19757 /note="L1MA8 repeat: matches 217..698 of consensus" repeat_region 19758..20050 /note="AluSp repeat: matches 2..293 of consensus" repeat_region 20051..20392 /note="L1MA8 repeat: matches 689..1038 of consensus" repeat_region 20478..20557 /note="AluJo repeat: matches 84..1 of consensus; incomplete repeat" repeat_region 20599..21908 /note="SVA repeat: matches 4..1345 of consensus" repeat_region 21938..22238 /note="AluJo repeat: matches 1..298 of consensus" repeat_region 22282..22376 /note="MIR2 repeat: matches 142..50 of consensus" repeat_region 22407..22503 /note="MER5A repeat: matches 162..61 of consensus" repeat_region 22606..22904 /note="AluSp repeat: matches 1..303 of consensus" variation 22884..22885 /note="clone 449O17; CA in this entry; deletion" /replace="caa" repeat_region 22916..23215 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 23260..23556 /note="AluSx repeat: matches 301..1 of consensus" repeat_region 23557..23683 /note="MER33 repeat: matches 137..1 of consensus" variation 23944..23946 /note="clone 499O17; GCT in this entry; substitution" /replace="ggt" repeat_region 24618..24687 /note="MIR repeat: matches 188..119 of consensus" repeat_region 25155..25311 /note="FAM repeat: matches 174..4 of consensus" repeat_region 25550..25837 /note="AluY repeat: matches 298..1 of consensus" repeat_region 25941..26234 /note="AluSx repeat: matches 299..1 of consensus" repeat_region 27121..27269 /note="MIR repeat: matches 203..39 of consensus" repeat_region 27609..27916 /note="AluSx repeat: matches 302..1 of consensus" variation 28177..28179 /note="clone 449O17; GGA in this entry; substitution" /replace="gta" variation 28246..28248 /note="clone 449O17; CCG in this entry; substitution" /replace="ctg" variation 28352..28354 /note="clone 449O17; ACC in this entry; substitution" /replace="atc" variation 28985..28987 /note="clone 449O17; CGC in this entry; substitution" /replace="cac" repeat_region 29042..29139 /note="MIR repeat: matches 89..192 of consensus" repeat_region 29493..29788 /note="AluSx repeat: matches 302..1 of consensus" variation 29510..29513 /note="clone 499O17; TTTG in this entry; insertion" /replace="tg" repeat_region 31445..31740 /note="AluSx repeat: matches 298..1 of consensus" repeat_region 31745..31809 /note="MIR repeat: matches 146..81 of consensus" repeat_region 31978..32200 /note="MIR repeat: matches 249..20 of consensus" repeat_region 32348..32646 /note="AluSg repeat: matches 300..1 of consensus" repeat_region 32973..33160 /note="AluSg repeat: matches 1..195 of consensus; incomplete repeat" repeat_region 33165..33467 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 33468..33580 /note="AluSp repeat: matches 189..302 of consensus; incomplete repeat" repeat_region 35293..35382 /note="MIR repeat: matches 110..191 of consensus" repeat_region 35832..36023 /note="MLT1G repeat: matches 31..226 of consensus" repeat_region 36529..36603 /note="MIR repeat: matches 153..64 of consensus" repeat_region 36899..37199 /note="AluSq repeat: matches 1..301 of consensus" repeat_region 37217..37240 /note="12 copies of 2 mer 100 % conserved" gene complement(37978..39315) /gene="KCNJ4" CDS complement(37978..39315) /gene="KCNJ4" /note="inward rectifier potassium channel 4; (potassium channel, inwardly rectifying, subfamily J, member 4); (hippocampal inward rectifier); (HIR) (HRK1) (HIRK2) (KIR2.3); match: SWISS-PROT; P48050; IRK4_HUMAN; match: EMBL U07364 S72503; match: MIM; 600504" /codon_start=1 /product="dJ434P1.1" /db_xref="PID:e1249590" /db_xref="PID:g2832594" /translation="MHGHSRNGQAHVPRRKRRNRFVKKNGQCNVYFANLSNKSQRYMA DIFTTCVDTRWRYMLMIFSAAFLVSWLFFGLLFWCIAFFHGDLEASPGVPAAGGPAAG GGGAAPVAPKPCIMHVNGFLGAFLFSVETQTTIGYGFRCVTEECPLAVIAVVVQSIVG CVIDSFMIGTIMAKMARPKKRAQTLLFSHHAVISVRDGKLCLMWRVGNLRKSHIVEAH VRAQLIKPYMTQEGEYLPLDQRDLNVGYDIGLDRIFLVSPIIIVHEIDEDSPLYGMGK EELESEDFEIVVILEGMVEATAMTTQARSSYLASEILWGHRFEPVVFEEKSHYKVDYS RFHKTYEVAGTPCCSARELQESKITVLPAPPPPPSAFCYENELALMSQEEEEMEEEAA AAAAVAAGLGLEAGSKEEAGIIRMLEFGSHLDLERMQASLPLDNISYRRESAI" misc_feature 38058..39371 /note="Putative CpG island" repeat_region 39815..39985 /note="MER3 repeat: matches 191..15 of consensus" repeat_region 39986..40286 /note="AluJo repeat: matches 1..302 of consensus" repeat_region 40635..40713 /note="MIR repeat: matches 88..170 of consensus" repeat_region 41339..41401 /note="MIR repeat: matches 81..145 of consensus" repeat_region 41404..41700 /note="AluSp repeat: matches 1..298 of consensus" repeat_region 43158..43458 /note="AluSg repeat: matches 2..300 of consensus" repeat_region 43460..43493 /note="17 copies of 2 mer 82 % conserved" repeat_region 43870..44163 /note="AluSx repeat: matches 1..292 of consensus" repeat_region 44349..44649 /note="AluSp repeat: matches 1..303 of consensus" repeat_region 44673..44798 /note="AluJo repeat: matches 127..2 of consensus; incomplete repeat" repeat_region 45992..46103 /note="MIR repeat: matches 60..177 of consensus" repeat_region 46595..46895 /note="AluSx repeat: matches 2..302 of consensus" repeat_region 47265..47426 /note="MIR repeat: matches 72..247 of consensus" repeat_region 47883..48071 /note="MIR repeat: matches 225..27 of consensus" repeat_region 49004..49139 /note="AluSx repeat: matches 1..136 of consensus; incomplete repeat" repeat_region 49140..49442 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 49443..49589 /note="AluSg repeat: matches 134..297 of consensus; incomplete repeat" repeat_region 49591..49889 /note="AluY repeat: matches 1..300 of consensus" repeat_region 50147..50357 /note="AluJb repeat: matches 1..213 of consensus; incomplete repeat" repeat_region 50358..50634 /note="AluSg repeat: matches 24..300 of consensus; incomplete repeat" repeat_region 50678..50977 /note="AluY repeat: matches 1..301 of consensus" repeat_region 50993..51298 /note="AluY repeat: matches 2..299 of consensus" repeat_region 51312..51439 /note="FLAM_C repeat: matches 2..129 of consensus" repeat_region 51443..51744 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 53450..53745 /note="AluY repeat: matches 1..299 of consensus" repeat_region 53777..53852 /note="MIR repeat: matches 71..154 of consensus" repeat_region 53926..54216 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 56233..56270 /note="19 copies of 2 mer 87 % conserved" repeat_region 56668..56834 /note="AluJo repeat: matches 299..129 of consensus; incomplete repeat" repeat_region 57856..58157 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 58195..58278 /note="MIR repeat: matches 120..35 of consensus" repeat_region 58969..59262 /note="AluSg repeat: matches 3..300 of consensus" repeat_region 59269..59449 /note="AluSp repeat: matches 122..303 of consensus; incomplete repeat" repeat_region 59554..59655 /note="MIR repeat: matches 145..46 of consensus" repeat_region 60509..60621 /note="MIR repeat: matches 109..233 of consensus" repeat_region 60920..60974 /note="MIR repeat: matches 95..150 of consensus" repeat_region 61819..61944 /note="MIR repeat: matches 25..144 of consensus" repeat_region 62108..62301 /note="MIR repeat: matches 21..227 of consensus" repeat_region 63236..63306 /note="MIR repeat: matches 151..81 of consensus" repeat_region 64364..64520 /note="MIR repeat: matches 237..80 of consensus" misc_feature 65945..67525 /note="Putative CpG island" prim_transcript <66003..>107200 /note="match: multiple ESTs; match: AA364467 W04949 AA317696 W74547 W28864; match: D20347 W79753 H27811 D58580 N40642 H44626; match: H27375 H50947 T36041 H96047 N59053 AA436122; match: N55931 AA320753 AA313229 H88068 AA316092; match: R01497 AA226146 AA173560 AA603833 AA526433; match: AA125862 AA173724 AA125863 W47342 AA361529; match: T32847 T57093 AA076086 N92984 T08586 R95179; match: W68471 AA148819 W80954 AA548375 AA089558; match: H86131 H45560 N40614 AA300721 N20096; match: AA335050 N44138 AA134540 N27840 W52023; match: N29227 N79303 AA555111 AA136441 AA341371; match: AA247520 AA436016 AA064950 W28280 W80606; match: AA565910 T15649 C04214 W19256 AA492535; match: AA333596 T35964 Z42478 AA359496 T93985; match: W74488 AA579394 AA573697 T69888 N24145; match: N29232 N22684 W21343 AA043434 N23774; match: AA355304 AA600709 N53972 T35094 N26964; match: N93537 AA181085 AA622923 AA354616 AA491812; match: AA603774 N75793 AA366832 AA128872 W80852; match: W30977 AA393448 W52757 AA442787 AA186892; match: AA578079 C04664 N31162 H05269 L44416 N27902; match: AA311434 U46221 AA326161 AA345070 AA225523; match: W31236 AA622931 AA094812 T57163 AA150288; match: AA167118 N67385 R92096 N90759 N27867; match: R74556 AA535537" unsure 66470..67054 unsure 67409..67434 repeat_region 67595..67683 /note="MIR repeat: matches 145..50 of consensus" misc_feature complement(68676..68852) /note="match: H55400 5' clone C22_438" repeat_region 69246..69319 /note="MIR repeat: matches 81..154 of consensus" repeat_region 69338..69381 /note="22 copies of 2 mer 100 % conserved" repeat_region 69383..69684 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 69853..69909 /note="MIR repeat: matches 81..137 of consensus" repeat_region 70271..70426 /note="MIR repeat: matches 197..46 of consensus" repeat_region 70688..70989 /note="AluSx repeat: matches 1..302 of consensus" misc_feature 72170..73164 /note="Putative CpG island; match Z79829; Z79830 22 CpG island DNA" repeat_region 73113..73407 /note="AluJo repeat: matches 2..299 of consensus" repeat_region 73409..73497 /note="L1ME2 repeat: matches 858..773 of consensus" repeat_region 73666..73809 /note="MIR repeat: matches 212..65 of consensus" repeat_region 73880..74182 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 75493..75572 /note="MIR repeat: matches 86..171 of consensus" repeat_region 75622..75796 /note="AluJb repeat: matches 295..125 of consensus; incomplete repeat" repeat_region 75800..76100 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 76119..76250 /note="AluJb repeat: matches 133..1 of consensus; incomplete repeat" repeat_region 76253..76569 /note="AluJo repeat: matches 301..1 of consensus" repeat_region 76581..76664 /note="MIR repeat: matches 152..59 of consensus" repeat_region 77417..77550 /note="MIR repeat: matches 41..176 of consensus" repeat_region 77617..77709 /note="MER5A repeat: matches 105..9 of consensus" repeat_region 77727..78018 /note="AluSx repeat: matches 1..292 of consensus" repeat_region 78121..78236 /note="FLAM_C repeat: matches 1..117 of consensus" prim_transcript <79261..>85809 /note="match: 5' EST W23902 clone 309865" gene 79417..93719 /gene="dJ434P1.2" CDS join(79417..79507,85706..85806,90776..90934,92395..92647, 93679..93719) /gene="dJ434P1.2" /note="similar to ER lumen protein retaining receptor 2; (KDEL receptor 2) (ELP-1).; match: EMBL; X63745; G31218; match: EMBL; M88458; NOT_ANNOTATED_CDS.; match: PIR; S28975; S19881; match: PROSITE; PS00951; ER_LUMEN_RECEPTOR_1; match: PROSITE; PS00952; ER_LUMEN_RECEPTOR_2" /codon_start=1 /product="dJ434P1.2" /db_xref="PID:e1249591" /db_xref="PID:g2832595" /translation="MNVFRILGDLSHLLAMILLLGKIWRSKCCKGISGKSQILFALVF TTRYLDLFTNFISIYNTVMKVVFLLCAYVTVYMIYGKFRKTFDSENDTFRLEFLLVPV IGLSFLENYSFTLLEILWTFSIYLESVAILPQLFMISKTGEAETITTHYLFFLGLYRA LYLANWIRRYQTENFYDQIAVVSGVVQTIFYCDFFYLYVTKVLKGKKLSLPMPI" repeat_region 80919..80995 /note="MIR2 repeat: matches 111..26 of consensus" repeat_region 81004..81304 /note="AluSx repeat: matches 300..5 of consensus" repeat_region 81307..81541 /note="AluSg repeat: matches 298..64 of consensus; incomplete repeat" repeat_region 82045..82347 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 82865..83165 /note="AluJb repeat: matches 1..301 of consensus" repeat_region 83259..83358 /note="MER44A repeat: matches 1..100 of consensus" repeat_region 83381..83677 /note="AluSx repeat: matches 3..300 of consensus" repeat_region 83687..83872 /note="L1MC3 repeat: matches 2207..2394 of consensus" repeat_region 83880..84182 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 84187..84279 /note="MER42B repeat: matches 1200..1300 of consensus" repeat_region 84326..84643 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 84874..85176 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 85886..85996 /note="MIR repeat: matches 31..148 of consensus" repeat_region 85998..86282 /note="AluY repeat: matches 1..301 of consensus" repeat_region 86289..86389 /note="MIR repeat: matches 145..248 of consensus" repeat_region 86390..86673 /note="AluJb repeat: matches 14..295 of consensus" repeat_region 86807..87116 /note="AluSx repeat: matches 298..1 of consensus" repeat_region 87167..87390 /note="AluJb repeat: matches 249..25 of consensus; incomplete repeat" repeat_region 87407..87700 /note="AluY repeat: matches 295..1 of consensus" repeat_region 87706..88006 /note="AluSg repeat: matches 300..1 of consensus" repeat_region 88012..88090 /note="AluSc repeat: matches 295..197 of consensus; incomplete repeat" repeat_region 88091..88395 /note="AluSg repeat: matches 300..1 of consensus" repeat_region 88394..88685 /note="AluSg repeat: matches 293..1 of consensus" repeat_region 89382..89443 /note="MER5A repeat: matches 14..75 of consensus" repeat_region 89447..89734 /note="AluSx repeat: matches 1..299 of consensus" unsure 89744..89750 /gene="dJ434P1.2" /note="Single clone area" unsure 89748 /gene="dJ434P1.2" /note="Potentially a C not T at this point" repeat_region 89823..90123 /note="AluSq repeat: matches 303..3 of consensus" repeat_region 90291..90365 /note="FLAM_A repeat: matches 24..99 of consensus" repeat_region 91222..91310 /note="MER5A repeat: matches 119..30 of consensus" repeat_region 91328..91613 /note="AluSx repeat: matches 299..14 of consensus" repeat_region 91693..91953 /note="AluY repeat: matches 45..299 of consensus; incomplete repeat" repeat_region 91961..92133 /note="AluJo repeat: matches 134..301 of consensus; incomplete repeat" repeat_region 93022..93203 /note="MER30 repeat: matches 1..230 of consensus" prim_transcript <96855..>117488 /note="match: multiple ESTs; match: R96695 H70107 H61512 H04507 H53252 H96560; match: H65636 H65637 H37935 W01164 AA044183; match: AA503841 N72548 AA262149 AA333349 AA348550; match: T54581 AA572830 R81737 AA252746 N57874; match: T77731 AA324199 R94367 H05403 H01443; match: AA468827 H01444 H05404 H22735 AA010513; match: AA324200 R31225 AA210334 H47426 H16817; match: AA030353 AA332144 R07082 R63047 AA461244; match: R02397 W27316 AA328644 T79920 R94376; match: AA331896 T99443 AA096498 H73433 AA310452; match: AA515040 W00452 AA052703 W20451 W22154; match: H69690 N77335 W80931 AA262882 AA282961; match: AA225397 AA282962 W27809 R96646 N72172; match: N74035 H83089 W38101 AA521056 H58854; match: H37885 N77264 AA455342 AA514565 N89086; match: T66834 AA460938 AA552865 AA552868; match: AA485944 T99547 D20629 AA290999 W75994; match: N44240 H80184 R16500 N55330 H16925 H62681; match: AA317102 AA311208 R02509 AA093452 N28449; match: R09376 H96686 R09377 W39009 AA377344; match: T77373 AA507288 W57781 AA358476 R99657; match: H53143 R02510 H75563 AA044072 AA001488; match: AA290614 N77284 T95441 R62991 H99687; match: AA337358 T95681 AA189759 AA595879 AA010407; match: H37831 M78310 N64108 AA332350 AA283014; match: H78808 AA491331 AA074363 AA232089 W38054; match: R99115 AA004969 N35999 AA337768 H24017; match: R08111 R08112 N55206 AA283101 R08431; match: R08432 AA331632 AA056074 AA018257 N86851" gene complement(97118..117183) /gene="P72" CDS complement(join(97118..97623,99062..99298,103239..103298, 104893..104954,105204..105314,105813..105985, 106069..106229,106979..107120,109268..109333, 109623..109756,110583..110682,112313..112463, 117134..117183)) /gene="P72" /note="DEAD-box protein P72; match: EMBL; U59321; G1592565; match: TREMBL; Q92841" /codon_start=1 /product="dJ434P1.3" /db_xref="PID:e1249592" /db_xref="PID:g2832596" /translation="MRGGGFGDRDRDRDRGGFGARGGGGLPPKKFGNPGERLRKKKWD LSELPKFEKNFYVEHPEVARLTPYEVDELRRKKEITVRGGDVCPKPVFAFHHANFPQY VMDVLMDQHFTEPTPIQCQGFPLALSGRDMVGIAQTGSGKTLAYLLPAIVHINHQPYL ERGDGPICLVLAPTRELAQQVQQVADDYGKCSRLKSTCIYGGAPKGPQIRDLERGVEI CIATPGRLIDFLESGKTNLRRCTYLVLDEADRMLDMGFEPQIRKIVDQIRPDRQTLMW SATWPKEVRQLAEDFLRDYTQINVGNLELSANHNILQIVDVCMESEKDHKLIQLMEEI MAEKENKTIIFVETKRRCDDLTRRMRRDGWPAMCIHGDKSQPERDWVLNEFRSGKAPI LIATDVASRGLDVEDVKFVINYDYPNSSEDYVHRIGRTARSTNKGTAYTFFTPGNLKQ ARELIKVLEEANQAINPKLMQLVDHRGGGGGGGGRSRYRTTSSANNPNLMYQDECDRR LRGVKDGGRRDSASYRDRSETDRAGYANGSGYGSPNSAFGAQAGQYTYGQGTYGAAAY GTSSYTAQEYGAGTYGASSTTSTGRSSQSSSQQFSGIGRSGQQPQPLMSQQFAQPPGA TNMIGYMGQTAYQYPPPPPPPPPSRK" repeat_region 98078..98348 /note="AluSx repeat: matches 5..282 of consensus; incomplete repeat" repeat_region 98406..98506 /note="AluSq repeat: matches 11..111 of consensus; incomplete repeat" repeat_region 98530..98821 /note="AluSx repeat: matches 11..302 of consensus" repeat_region 103669..103968 /note="AluSp repeat: matches 301..1 of consensus" repeat_region 104374..104659 /note="AluJb repeat: matches 293..11 of consensus" repeat_region 105453..105657 /note="MIR repeat: matches 230..8 of consensus" repeat_region 106266..106380 /note="AluJo repeat: matches 6..124 of consensus; incomplete repeat" repeat_region 106430..106721 /note="AluY repeat: matches 298..7 of consensus" repeat_region 107163..107202 /note="20 copies of 2 mer 88 % conserved" repeat_region 107445..107527 /note="MER46 repeat: matches 86..175 of consensus" repeat_region 107534..107832 /note="AluSx repeat: matches 297..1 of consensus" repeat_region 108201..108500 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 108519..108820 /note="AluSq repeat: matches 2..302 of consensus" repeat_region 108828..108960 /note="AluSp repeat: matches 1..133 of consensus; incomplete repeat" repeat_region 109056..109124 /note="MIR repeat: matches 36..107 of consensus" repeat_region 109926..110214 /note="AluSq repeat: matches 1..302 of consensus" repeat_region 111066..111371 /note="AluSx repeat: matches 1..295 of consensus" repeat_region 111544..111658 /note="AluSg repeat: matches 1..115 of consensus; incomplete repeat" repeat_region 111710..112020 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 112033..112164 /note="AluJo repeat: matches 2..133 of consensus; incomplete repeat" repeat_region 112710..112946 /note="MIR repeat: matches 3..262 of consensus" repeat_region 113313..113611 /note="AluSx repeat: matches 2..299 of consensus" repeat_region 115456..115657 /note="L1MB8 repeat: matches 578..365 of consensus" repeat_region 115948..116057 /note="MIR repeat: matches 205..82 of consensus" repeat_region 116111..116391 /note="AluSp repeat: matches 283..1 of consensus; incomplete repeat" misc_feature 116732..118083 /note="Putative CpG island" repeat_region 118268..118559 /note="AluSx repeat: matches 1..292 of consensus" repeat_region 118992..119118 /note="MER4C repeat: matches 350..469 of consensus" repeat_region 119125..119369 /note="AluSg repeat: matches 252..1 of consensus; incomplete repeat" repeat_region 120057..120339 /note="AluSx repeat: matches 293..1 of consensus" repeat_region 120347..120560 /note="L1PA10 repeat: matches 910..691 of consensus" repeat_region 120558..120850 /note="MER4A repeat: matches 392..123 of consensus" repeat_region 120851..121144 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 121200..121335 /note="AluSx repeat: matches 87..226 of consensus; incomplete repeat" repeat_region 121355..121656 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 122143..122336 /note="AluJo repeat: matches 115..302 of consensus; incomplete repeat" repeat_region 122485..122787 /note="AluJo repeat: matches 6..294 of consensus" repeat_region 122804..122954 /note="AluJo repeat: matches 281..130 of consensus; incomplete repeat" repeat_region 123156..123455 /note="AluSx repeat: matches 302..3 of consensus" repeat_region 123985..124280 /note="AluSx repeat: matches 1..296 of consensus" repeat_region 124758..124990 /note="AluSq repeat: matches 303..59 of consensus; incomplete repeat" BASE COUNT 31879 a 32311 c 31060 g 29740 t ORIGIN 1 gatctttaac attaagcttt tctaaataag actgaagatc aaggattagg gtaaaggaaa 61 gaatgtcctg aagtctcttc tttgttgcta tcacaatgga ggatcaacac atgggctggt 121 gaccagcttg gtgtgatata atagaacact ccagagaccc gagttctagt cacaaccttc 181 aagttgtcac tgccacatac cacatctttc tattatctat ttaacgcagt tggttaaact 241 ctaggatggc ctttacagtt ccaatattct gtggttctgg ggatagatct acacactgaa 301 aagaatattc aattactctg ctaatcagaa tgtgaagtga cttaggtttc tatacataga 361 agcttaagtt tcatttgctt attcaacaag tatttactga atgctagcgt cccaggcaag 421 gttttaactg aataacagca cataccctag aaaaacagtt aacaaatgct gacaacgctg 481 ttttccctgg ccaagaacaa agactcattt atttgaggcc cttcctactt gcagatgcta 541 aaatacgctt attttttaaa aaatcaaata gtggcaatat gcaatagtgg agttttttgg 601 gtttttttta atgtccttat ttagcaaaat caaagttagg tgagaatttg aaaaaaaaaa 661 aacaaaaact aagttagtcc ctacgtcagc tctcaacatt attactgtat tttagaaact 721 tgactaatat gtgaaactgt aattttgggg aactccactt ctctactgac atcctttcca 781 aaaagagaca ccagctctac agtctcggga ttagaaatat gatgtgaggc cgggcatagt 841 ggctcacacc tgtactccca atgctttggg aagccgaagt gggtggatca cttgaggtca 901 ggagttggag accagcctga ccaacatggt gaaacccctt atctgctaaa aatacaaaaa 961 ttagctgggc atggtaatac acacctgtaa tcccagctac ttgggaatca cttgaaccca 1021 ggaggtggag gttgcagtga gccaagatcg cgccactgca ttccagcctg ggcaacagag 1081 cgagactttg actcccccct cttgcccaaa aaaaaagaaa tatgatggag tgttatttcc 1141 cactcaggga aacagcataa cttagttact gaaaatattt attaacagac agatgttttg 1201 gttccaattc tcaattatct gggttaaaga agttaccatc agtctccaca caagaccttg 1261 taggattatt tctctgcagt gaaggacaat gaaggaggtt cagtgggaaa cgatcttagg 1321 acagaggaag tttccacttt gcctgggagg gtgaacagcc tgggagacct ggagatgagg 1381 gcaggcattc taggccaagg ggtcttcttg tgtggtcaga tggggaagga aaccctcctg 1441 gcaaaacagc agcaattctt tctccaggaa agcttctgag actgactcta atagtcaatt 1501 gtgatcaact gtcctaggct cccagggcca cagcaatcct taggcacccc aaatgtgcca 1561 tccctgacta cccagcacac cctaacccca aagcattgtg acagctcaag gatgacttcc 1621 tctgaaaacc tctgtccaga tctaaaacca tcctgtttgt ttgcttgtaa ctgtcttttt 1681 ctaaaagtaa cacctcagga tggagactct gttgccttgt ccactgctgc ccccaatgcc 1741 tggaacaagg tgcctgacac aggagtaggc acgcgacaca tagctggtga ataaacgtca 1801 cggaattagc acacgtgtag aatcagtacg cacatgtttg gccacagtca ctaaacgtaa 1861 ctgatcagaa attaaaatta agttgcatgt tcatcaaaac ttctacatgg ctatcagtga 1921 ctgccatcta attgccattc tgcaacttca agagtacaat ttcaagagcg taataaaaaa 1981 atcatattta atcacaccat aaagttcatc agcacaagca aaacaaaaac tgcactattt 2041 ggatttatac tcactctcct tacgaccctc caaaacaaaa atgctgggca tcctagggct 2101 atctctgaca gcaaatgcct tcagctcttg acttagaaac tgctaacata ctagacaaaa 2161 cactttcctt aaactctaag acaatgaata atactttaaa ccaggtttac atcgcttaaa 2221 accaattcgc tcatcagtaa tcgttaaagt ggttgtaaca ataaagatgt taataacaga 2281 atacagcaaa atagaaattc taatctacta acagcaaatc accattccaa ttgaatggtg 2341 tgtgtgcaca aacacaagcc ttcccagaag cctccagctc tccacactcc tgcacacaga 2401 aggcagcagt aaaccgatgg cactgctgtt aaagtaaatt gtctttctca acaaagaact 2461 ggaggggaaa aggaggatat tataatccaa gactataatc taaatgcttc tagcttgcaa 2521 gaaagtgagc attacgaaca cgctaagctt actgacaagt acttcacaaa aatggtttcc 2581 aatgtttaaa aacattttaa aatctttact caggtaaacg acagaactta actcatatac 2641 cccctaacac ctaaggcaag tgggctattg agtatagcca tattttgaaa gaaacttcta 2701 ttgggattat ttcagtagat acttttaaaa tttaccagag cataatcttt gaaacccagt 2761 gggaaccaca ctgctacaaa aggctggaat agaagtgtga tgttctttag gtcactaatt 2821 acacactgat ttgtgactaa agctaccatg aattctacca ctacctatag atcttgaaat 2881 tagcaaatag gacacattat acactgctta ataaaaagca cagtaatctg tgcttaagta 2941 caactaacta ggtacaacag ccactttgct gacaattact gtaacttggg aatatttttc 3001 tttacatcct gatgtactca aactgttaac ttttagaaac agcacagcct ctttgtttct 3061 tctgcttaat aggttatctt acactgttga tatggaatat gtatctacca taatacaaag 3121 caataaggta ttcaagtcct caaaaacaca gaaaaattat atttatacac aatttgtcat 3181 actgctaaca aagctacttt ttcccaagtt actgaggtgt ccaaataagg agttattaaa 3241 aaattcaact taaattgcta ttctctggcc aatgcatttg taccctgaaa actacaaaat 3301 tatctccaag acctggctag aagtttacat tttctctgaa agtttttcct ctgtctttta 3361 cacacagtat tttttcttaa tctgacaagt ggagaaaact ctgcatagcc atctactaac 3421 aatcaaaaag ctacaggttt ttttattttt tatttttaat gctctaaatt cttcaacact 3481 gtttcaagga aattactagg ggtcacaccc acgaggctaa atggtcaacc caactttcct 3541 aatttagctg tggaaagatg cttccaagtt aagcgaaaga aaaagattgg aatattcaaa 3601 gtttatattt aaggttacaa cctaaactct tcaacttggg aagcttgtaa aggaaataat 3661 ttcattttgc tgaaattgtt cataaattac caattttaaa taggattaga gaccaaagta 3721 ctataaaaat ataatcatat agcatatttg agtgtcatta actagaaact gcctgaattc 3781 atcctaaatc agttaaggtc agttaaggga tcttagtaag aacttaaagt ataacatact 3841 agttttagtt tcaatattaa cagatactgt catctgcaaa gtggttttta aaaacatcct 3901 gatctgaaaa aaagttataa ggaaagtgaa tttttaaaaa ctttaattaa gcaagactaa 3961 gcattgctat ccatttctcc tattactact acttaaccag tacaacctta aatatctgta 4021 aaaatttgac aggcctaact gcaccaaaaa aaaaagattt cagcaaatta attaactgtt 4081 aataaactgt tctatcttta cctgcactaa ctctctgcaa tgaaaacact gaacgtgtgt 4141 atacacacat ctatctatat atatatacac acacaaataa ctatacccac tactagacca 4201 gtacaccata ccactttttc caatgagcgc tctgaattaa acacaattca agtaacctag 4261 ctaaatacaa gcttaggcaa tgccttttaa ggcataagat cattcttctg caagatttaa 4321 atattcccct ccccatccta tcttttgctg actacatact aaagttgtat atctggctct 4381 ttccagtcac ctgtttgctt tcaaaatcaa gggggaaagt ctcctttcaa ttaataatac 4441 aatcattgtt cagtcgattt ataaactgaa gaaaaactct atattgatta caaaagaaag 4501 aaacacttct tcagtataac tgagatgctt tatgttatcc ttatttgata gaggtagaag 4561 taaaagtaac tattattcct aatgctgatt caagaaatgt ctagagtcta catttccagt 4621 ccatttaact acattttaaa ttccaaaggt ttccagacac accagaaaat aatggaacac 4681 caatacagaa acactacacg agtctgttaa aaacctagct tctgagtctg ggggagcttt 4741 tctataggaa gtacagtcat gtggaggtca gaaacaagtt ttgtaaagct taataagaac 4801 aaaggctaag taaggaatcc aactaactct ctcatacaac ttcttcagct gccctttcag 4861 gtttagtaat tttacttctc aactttatca taaaattacc aaactgtccc caaatttcca 4921 gtaactcaat tatgctgata ccctgggcag gaagtaatgg agatgttcag aacagtcagt 4981 caacaaatgt tcaaatggaa agcattccaa cattaaatat aaaagggtaa tatgtggcgg 5041 cagctggcta gcgggggctt cccccctatt tatctggaag atagatgtgt acaataattg 5101 agcatgtcat cgtgagctgc agattttagg aaagaatggt ccctgttctt ccactttctt 5161 aaagtcagat ttctatctct cttcgtggtc taaattcaaa aacctttgcc ccagatacca 5221 ctgcaaattt gtctgttatt actggttggg gaaaaataac ccagttcacc ttactccaaa 5281 aacatgagca tttcagaaaa gtttttattc tgcctctaaa atgaggccat taagtgatat 5341 tcaataatca ggcttccttc acccgaattc aagaaatttc ctgccgtctt tgtgtaataa 5401 acttttttct tccatgtgca ttttaatttt aacacccctc cccgccgcag aatctggaaa 5461 agaggtgtgc ggatcaagac ccctccaccc gccattgtct caagtttaat ttacatgctg 5521 taaccaaacc cttcactctt tcgaatattt aacaagattg agattccttt aattccatgt 5581 tggtagctgt tctgtaaata cctttcaggt ttttattttt taatatccca gtaatctaaa 5641 aactattctg ttgttagatt tcaattgaag gaactggatt taaaatctta ttacgtttat 5701 tttatcagtg gaatactcag tttatgcaac tgtctcatgt ttttcattcc tactttctaa 5761 ttctaaggct tgaaaacagc cattgctcag tccttcagct tcaccttccc aattcattca 5821 ttaaacattc tgctcaggga ttaaactgaa atgggaaatt aggaaagaac catttaaaac 5881 aaatcatacc cagcaacctc aaaatctata acatacaggc catatgggca tacaggaaat 5941 gggacaattt gaattttttt ttttaagcaa atggaaacaa agcttcatat aaaacataaa 6001 aagggtccaa tcatttactt tcaaatagag tcagctaaaa gtagctcctt ttttaaagtg 6061 gcatctttca ggcagtcaag agtcattaaa actactccaa agttcttatc aaccaatatc 6121 atgtttgtga caactgtgag aaaagaatgt aattttataa agtgggaaaa cagcacatct 6181 tttgggaccc ctgaggctct aatcaggcat ctcagaatta agagtgctcc agggttggga 6241 aatggctacc ctgctgtggc gaaaacatgc catttcataa taaacgttgt taagagagaa 6301 aaaagtgata agcatgtacc tgcaatgagg agatagaacg acttagtcat tgccaggcca 6361 ttatattcct taattatttc atcagtctat aagtgacttg atgattaagt cttattttca 6421 gatttcctcc aaaacaagtc cactggaagg ggattagggg ctcctactcc tccctgaact 6481 tttcctgcca aaaatggagg ctcacaaaga aaatgcaaat ctacccccac cttaaaaaag 6541 taaaggcagt caaagtcaaa accttcactg ctcctccttc ctcccctact acgtggaaaa 6601 aaattttgac ctgttaccaa aatagattcc tcagggtaaa ggggagaaaa gttttctaca 6661 aattttttct aagtggtgta ttagttcctt tttttgagta agaagtgaaa gggaatagga 6721 acccaacaca actgctaaac ttcgatctca caagttatta aaaaacagca aaggtcaaac 6781 tgcaaagcgt tagaaaaatg tgaagacaaa aggattaaaa gagtgggaca ctaaagaaaa 6841 aagtgtgtgt ccttcaccaa taaaagtcgt atcttgtcct ctacttcctc cttctgtagg 6901 caaataaaga caattctaaa ggactagctt tcctcatcat gccaaataat taaaattcca 6961 aacagattat taatcctatt ctaacaatta aaataaatga taaatctgcc ttgtaactac 7021 tgacaactgt attttaaata acttacggcc tcttgaattt aaatataacc gattaaaaat 7081 ggcagcaaca tcagggttaa gaaacaaaat gaaacactat tagtctccag aatattgaac 7141 aaaacaatat ctaaaataaa tatgacttct cagaaggaaa aaagttatct accattaagg 7201 acatacctta aaagttctta gcacacatta agtaagcaca aactagtggc ttgcctgtta 7261 tattttcaga caaaaagcaa tgaggttatt cctctgcact tgtcactgca gacacataaa 7321 ggcttttaga aactacatct attactcatt gtctaagtat cataaaactt agcattttaa 7381 gtcttgtttg atatcaatca agaaagaaca tccacttcag gggaaagaag tacagtaaaa 7441 taagaacatt gtaaatgtcc ctcttgacat atacaaaaga gaaggtaaaa ggcctcctaa 7501 gacccctacc tagtggccag gcctggacgg cagctgttgc tacctcacct tttcttttgt 7561 tgagcttaat ctcatgtcaa gtcattcaac caactcaaaa gcgatgaaga cattattgaa 7621 tcaatctgaa ctaaatcaga cctaggcttc ttaaaacata cagcttaatg cttccaaacg 7681 atctagaaaa ctaagaaacc tagctacgct gtaggacaca ctttcatatg cgtccgtgtg 7741 aagagaccac caaacaggct ttgtgtgagc aacatggctg tttatttcac ctgggtgcag 7801 gcgggctgag tccgaaaaga gtcagcaaag ggagatgggg tggggccatt ttataggatt 7861 tgggaaggta atggaaaatt acagtcaaag ggggttgttc tctggtgggc aggggtggat 7921 ctcacaaagt acattctcaa gggtggggag aattacaaac aaccttctta agggttgggg 7981 tcacggcacc aaatttcata cacgtccgtg tgaagagacc accaaacaag ctttgtgtga 8041 gcaatggcct ggcctgggct cagaggcctg acacacacag tggccaatta tacaggaccc 8101 ccaaactggc cagtggacca ctgcaaccgc catttacttt ctccgctgca agcaccatag 8161 gctgattcag gaagataaac tgaggctcaa ggaatttcga agtggaacaa tacaccaaag 8221 ccttaaacct gaaatgacta tccttttctg gggggtgggg gggaaagaaa aagaaaaaaa 8281 agtttctggg gctctcaggg tggcccagat gccagggtcc cagaagtggc cttttctagc 8341 tcctgtaact aaacctggcg gaaaactccc cgcctgctca ccccatccgc ccaagaatgc 8401 gccttcccgt cttcggtggc cctacccaga atcccaaaat gtggattcca acccaggccc 8461 tgaacgtctt ctcaaatccc cgggacttac gttgcggcgc gcgccccgct gtcgggtctt 8521 gcccctcggg cggtaccgcc caggcagccc taaatccagc ctcccaggcc cccagcagcg 8581 ccctccgccc ctccaccatc ccgtccggct cgcagtcggg gccaaatcga gagacaagag 8641 ggctgtgcct gaaactgagc ggtttcacca cttggcacct cctggcagaa acttcccttt 8701 aaaaaaagaa aagaaaaaaa aaaaaagcag cagcactttt gggctagcat ttcaatcctt 8761 cctgcccttt agagttccca gttctgcttc cagctgcctt tgggtgttcc actacaattg 8821 agttgtaaag atattcttta agtgtttata gaacattaag acttaagaaa aatctttaaa 8881 attagaggag ggaaaaaaaa gccaccttat cgcacacatc caggaaacgc agccccgtgc 8941 atccctgctc agggataagc aggcgcccca ggactcccgg ggacagattt ttgggcaccc 9001 gagggagtca cccggcgcac ctcggggtcc gcggagaggc ccagcccctc ccgcggtccc 9061 ttagacgcgc cctccgcctg gcccgtgtgg accgtcccgg ccattgttta cgggggaagc 9121 ctgtcccgac gcattgtttt ggccatttcc aacttccccc ggcccttccc ggggtatcgc 9181 gggggaccct acgccaacgt cccccctccg cccgcgcccc aagggccgac tgggcaaatt 9241 gggagacccg ccccgcgggg cgacccaact tttcggaaca gccccccacc gcccaccccc 9301 gcagaccccc ggacccccgc tcccggcgga gattcaggga accccgcacc ccaagccctt 9361 ctgaatcgtg cggcgtgagt gtgacggcca agagcggatg cagcccggga tcgcccgcac 9421 cttcccgcca gcggaagcgc aggagccggc tggggagggg gcgccctaga aagagcagct 9481 agaaagctga gacggggaac tgaggtcatc ctgggggggg acaagacaac gagagccggg 9541 cgcctcgggg gcggcgcggg agcctccgca ggaccaggcg ggcgccccgg ctggcgcggg 9601 cggggggcgc ccccctttac ctgtggctcc ggcccctcgg ccatttcctc gcgcggcggc 9661 tgccgggact gagctgactc cactcgggcc ggccgggttt gaaagaggag gagcgggcgc 9721 ggaggggagg gggcggggag ggcggaggga gggaggcctc gcgcagtttt ctcggccttt 9781 tgtgcggaca cctcccggat tccgcgcccg cacccggccc cccaaaagac acggggagcc 9841 gcaagcgagg ggtgcagcca tctgccgagg cgcctagtgc cttcgcgcct ccaagacccc 9901 caacaaaaaa ggagcgtccc gcacccccac ccccgcccgg aggatttagg ggccgggctc 9961 acctcgggcg cggggctaag tgcaggcgcg gggggggtcc ctagagccgc cagggcgcgg 10021 cgcgtccggc gctgggggac tgttgggtca gaaagtgttc agggagcagc tgctgcgccc 10081 tccctcggcc cgccgctcgg agacgccccg ccccgcccca ccccgccccc gcgcccacgt 10141 gactagcata ggcccgcccc cgctccgccc cccgccacag actccgcctc cgggacgcga 10201 gcgagcggcg agcgcgcgca ctcccagtta tcgctcggcg actcccgcgc acgcacgcgc 10261 cgtgccaccc tccccgcgcc ccccgcccca tacccctact cccgccatcc gatttaacgt 10321 ggcgggcgag cgccgcggcg gtagccgtga caggtacccg gcgggggggg cggggggggc 10381 cgcgagggtg tgcgcaggcg cagacccggg tcccgtcccc gccgccccct cctctgcaag 10441 gtgtgcccgg gcgaggggag gggcccgcgg cccgaactcc tgggtcaccc cgaattacaa 10501 acaaaacctt aacgccgttg ctcgcgggtt agaaggcagc tgtgcgcgaa aaacacctca 10561 gattttcttc aagcgtgagg aaggtcgaga agataaagtt tttaaatgag tatcttcaga 10621 aagctattta attgttctga tttttttttt tctgaaagga ctgggcttgc gctattctaa 10681 cttgagattt ctaaacttag actaggttcc aagaatctac ttggttttga tagaaatcca 10741 tttggaggaa accggggaca aaaaaaaaca aaaaaccgct accctttccc ctcctcccta 10801 cgcacaactc atttaaatcg tttcccaact gttttttttt tcttactgaa aaaaaggaca 10861 aaaaaaatcc tgattcctaa ggcctgtgat atttccccat tttaagtaag cctcgaggaa 10921 ccacagacat aacaaacata tccacaattt ttctacccca aaatgcaaac taatgcaaat 10981 gtgaaatgga tctatgcaag tcgctaccaa caaaatagcc aaatatttgt ttttttactg 11041 gccttcggca tatccaaaat gaaaaaactt taaaacacat caagaaaagc atttaaaata 11101 tctatttcct aaatcaaaag aaagaacgca ctacagaaac gatccggagc tggtgagcaa 11161 tgcaatctta ataaatagga aaaggaattt gagaagacag ctcagcaaag gggaagaatt 11221 tgccaagtgt catgttttgc tgtcatttta tatataacta ctaacccccc atatggtctg 11281 gttagtcttt tttaaaaagg aagccaatca attactcaca attaattagc agtgccggta 11341 gattcttagg ccagcactaa aaaactaaat tgggtgaatt ttttttcact caagtccact 11401 gatattgatc acttttagta ttaggagtcc ttttctgaca aaagatgaat acctatcttt 11461 aaaatatatt ttttaatctt attgtcacat tctaaaacac agtccaaaaa cacttcataa 11521 acaagaaact tcatttaact ttatgaaaac aaatgactat gagccttgct acacaacaaa 11581 gaaactatga acaagtgcaa ccaggttgag cctttaagtg tagcctcatt ctacatgtta 11641 atggataacc tttgaactct gagaacttga atgctgtaaa cacaatagca ctgacagtta 11701 aaaattccag tgtcaaaatc gagcaaaaag aaaagtccaa acttttcagg gggaataggg 11761 aaacaatcaa atattttgac aaataatcgt ttcagtaaga acaaggagat tcctggctct 11821 cagaccttgt tgctactgaa ttattttctg agcaaaaacg gttgtgcttg ttttccccct 11881 gtgccctcca aagctaccac tttgttcaca ttatagcatc atgatttgtt actgtttcgc 11941 tgcctccctc aaaaaaaaaa aagtgcctta ttatgttgta gcttttaaaa ttgaaagatg 12001 attatattca ctgatctatc cgctaagagt tttatttata tgtgatcctc cttatgtttg 12061 agcaaagaat tacaaagaac cttgtcctgg agtcgtgcca acgaaactgg gaaatcgttg 12121 caaaattctg ccagtgactg actgctgggt tgcaaataca tactgtgttt tcagaagaac 12181 atagagggat ctgaattcgt ccgggagggc ttccctgaga cacattcttc ccttagaatg 12241 tgagactcat acctcccatt tatttggaaa gcatgtatag ttcaaacaat taaaatatgg 12301 aaactccaaa agtaacagga gaacactgca aacatttggt tttgtaataa aagctagaaa 12361 ctcataactg catgaagtga aaattgaaaa tcgaacgttg gtcttcctat gcttagctcc 12421 acccacaaga aatccagctt tgccatcact cccttcctag gccatctcta aaggagtcag 12481 gtgctactag atgccagcaa aaaaaacaaa gaatggaacc aaaagagaag ctctgcacat 12541 ggtacttgga gaaggggcag aaaagcccca cttcaccttg gttcaatgca gcaaacatca 12601 gctacacctc atttttcaca attgtccaaa cagctcaggc agagctgacg aggagggtca 12661 gcaccttctg ccacctcctg ctgctgagct gtgcacctgt cccgcagtta actttctctg 12721 agctgagtca acctgagccc acctgtgggg agcccaaagc ttggtggaga cagatgccag 12781 ctggggccct ttgttgggag aagctaaagg acctggtgac tgcatgtggg ggcaagaaaa 12841 ggaagactcc aagggtttgc tcaggactct ggcccaggaa atgggggact gtgggactgg 12901 aaggggcctg gtggggagca gcagggggtg ctgagcttgt ttggcaccct ctgagctgga 12961 gaagtgtgtg gaggctggga gaggggccca gagggactgg aggtatagac aagacatgct 13021 cccagcttcc ctgggtcgca ggccaggcac tatttgatag atgagatcat cgagttccag 13081 ggaggaagga gagcgggcca tgtcaccaag gtcacccagt gatgagcatc cctgccgaca 13141 ggagtccagc tatcccagtc cctccacagt ccctgggact ttgctgctga ggggctcggg 13201 ctctccgggc gggagtccca aagacaggaa cagggcagac ttagagccca catggaggct 13261 gattctttgg gcggccttgg gcaagtctct tcacctttct gaggctcagt tgccgcacct 13321 gtgggtgtga aatgcctcct tccataaggg ttggaaggat taaactagag tttattttta 13381 ttttttattt atttatttat tttaatgtca tagcgctgga cccagcagca ggtatttagt 13441 aagcccggcg ggtggtgagc atgtcctccc cttccccttc tggctgaagc agttaggaag 13501 gctgcctgga ggaggcaggc cttgaaggta agggctgatt cagagagact cagaatgagc 13561 acaggagaga ctgccgggct ccccctggcc ccggggcccc tgaacacagg ccaagggctg 13621 tggaccttct gttgtcacag ctgcccatcc aggccccaca gcactggaac cgtagccaga 13681 gagagctctg cctgagaatg ggacaagaaa acaagcccag ccgggcgcgg tggctcgagc 13741 ctgtaatccc agcactttgg gaggccgagg tgggcagatc acgaggtcaa gagatcaaga 13801 ccatcctggc caacatggtg aaaccacatc tctactaaaa atacaaaaat tagctgggtg 13861 tggtggtgct tgcctatagt cccagctact cgggaagctg aggcaggaga atcgcttgaa 13921 cctgggaggc ggaggttgca gtgagctgag atcatgccac tgcactccag cctgggtgaa 13981 aaaaagaaaa gaaaagaagg aaggaaggaa agaaagagag aagaaagaaa agaaggaagg 14041 aaggaaggaa gaaaggaagg aaagaaaaaa aaagaaaaga aaagaaaaaa gaaaagagag 14101 agaagccctg attgccacca gcagctcata ccgattggcc ccaagcctgg ccagcgaatg 14161 ttcaaaaagc ctccctgacc ccattgcatc tgctctaggc aaagtcttgt tttctggaag 14221 gaaagcagca tttattaaag acactgctct gctctcacgc ttgcatgagc ctcttgccca 14281 tgacgttgct gccctgagat gcaagtacca ttactaccct cagggaagga gaggtcttgc 14341 cgaacccagc atcttgactg cagagctggg gctctaacca ctgggctgct gccttttgta 14401 aatgaatggc tccccatctc ctgcgagctc ctgagggcag agcggagtct cccctgtgcc 14461 tggcatgggc cctgcattca gcagcctcct caggaatttt gagtaagtga ataaagacat 14521 gtaagggtga cagttcagtt tcagagtgac ccagaggcat cgccctaaga tgcaaagatg 14581 cagtggatct gaacagttgg aggcggaatc acctgggaat gtatagagta gcttccagaa 14641 ggacacacaa gggaccaggc acggtggctc atacctgtaa tcccagcatt ttgggaggcc 14701 aaggcgggca gatcacctga ggtcaggagt tggagaccat cctggccaac atggcgaaaa 14761 ccgtctctac taaaaataca aaaaatagcc gggcatggtg gtgcacgcct gttagcccat 14821 ttactcagga ggctgaggca ggagaattgt ttgaacctgg gaggcggagg ctgcagagag 14881 ccaagatcac gccactgcac tccagcctga gcaacagagt gagactctgt ctcaaaaaaa 14941 aaaaaaaaaa aaggacacac aaggaattgg caccgggggt tgtgtctggg aatgggggtg 15001 ggagggagac atacttttct gcttgaatgt ctgtgttaca tattataatg tggtatcatt 15061 ataatgcgtc accacgcgca ggtccatgaa tgaatgcgga agtcagtcgt cacccgggag 15121 ctccctgggc atgtggactc ctgggctgca gccccaaagg gaggaagcag cagggctatg 15181 ggtggcaggg gccaggatga gctcctgggt gattctgtca caggtggccc tggaccacac 15241 ttaggaacaa aatcgccagc tgggtcctct tcccaccggc cacccagcca ggctcatttg 15301 catgatcttt tgcatatatt tgcatgactt ccctggttcc aaataaagcc tgttagtcct 15361 tatcctcact tctgctgggt gctgagcaaa gcttcttccc tcccgccctc ccctagcact 15421 gcccccgccc agacctggct ggcctggact aaccccgtcc ttctcacctc ctagccccag 15481 tcaaatatgt catcctttga ccccatctcc actttctcct agtcccccaa ggcctggcac 15541 acacacatca cctcccccac cccagagccg aaagcctgtt ctcagtccca cactcgtttt 15601 ttttttttgt ttgttttgct ttgtttttga gacagggtct cactctgttg tgcaggttac 15661 agtgtagtgg tgtgatctca gttcactgta acctctgcct ccagggctca agcgatcgtc 15721 ccacctcagc ctcccaaata gctgggacga caggccctcg ccaccacacc tggctaattt 15781 ttgtattttt tgtagaagca aggtttcgct atgttgctga ggctggtctc aaactcctgg 15841 actcaagtga tctgcccgcc ttggcctccc aaagtgctgg attacaggag tgagatacca 15901 tgcccagccc agtctcataa tctttagccc ctcttctgaa ctcccaaaac actggtcccc 15961 tctttttttt tttttttttt gagatggagt ttcattcttg ttgcctaggc tagagtgcag 16021 tagtgcagtc tcggctcact gcaacctcca cctcccgggt tcaaaggatc ctcctgcctc 16081 agcctccgga gtatctggga ttacaggcac gtgccaccat gcctggctaa ttcgtatttt 16141 tagtagtgac ggggtttcgc cacgttgacc aggctggtct tgaactcctg aactcaggtg 16201 atccgcctgc cttggcctcc caaagtgttg ggattacagg cgtgagccac tgcatccggc 16261 ccggtccccc tcttttcagt gcttctgtgt cctcctaaac atccactgag cacccagggc 16321 taaggcaggc cccttgccca tctcattctc cctaatcctc cagggactct gtgagacagg 16381 actccactgg gcccatttta cagatgaaga aactgagcct caggaaggtc ccatcacttg 16441 ccagaaagtg gcggctcttt ccatcccacc agcctgcctc tgcaaatggc cccgtggcat 16501 tttcgctcat ctcaagtgaa tgttatttgg gttttgtttt gcttcacttg ctgtaattag 16561 gtctttgaaa ttatgcaagt catatgcact tactgtaaaa acattaaaac cgctgggtgc 16621 ggtggctcac gcctgtaatc ccagcactct gggaggccga ggcgggtgga tcacgaggtc 16681 aggagatcaa gaccatcctg gctaacacgg tgaaatcccg tctctactaa aaaaatacaa 16741 aaaattatcc gggcgtggcg gtgggtgcct gtagtcccag ctacacggga ggctgaggca 16801 ggagaatggt gtgaacccag gaggcggagc ttgcagtgag ccgagatccc gccactgcac 16861 tccagcctgc gtgacagagc tagattctgt ctataaaaaa acaaaaacaa aaacaaaaaa 16921 aacacaacac taaaaccatc caggaatatt tgggtaaaat ataagtctcc tccttgaatg 16981 cacctgtccc ccaataacac tttttttttt ttttttttga gacagagtct tgttctgtgg 17041 cccaggctgg agtgcaatgg cacgatctcg gttctctgca aactccgcct cctgggttca 17101 agcgattctc ctctctcagc ctcctgagta gctgagatta caggcacccg ccaccacccc 17161 tggctaattt ttgtattttt agtagagatg gggtttcacc atgttggtca ggctggtctc 17221 aaactcctga tctcgtgatc cgtccacctt ggccttccta agtgctggga tgacaggcgt 17281 gagccaccgc gtccggcctc ccaataacac tttctaggaa gagccacagt gaacagtctg 17341 gggtctcctt ccagaactat tgacctacag acatgatata ttcaaatata tagacataat 17401 ttttattgta agatatacat gcaaaagaca tacaatttta aaaaaataga gataccatgt 17461 actcatcacc cataaaaact gctaacacct gaattatacc cctgtatgtc tctatgcctg 17521 cacctcctct tttctctcta tttatttctg agacagaatt tcattcttgt tgcccagact 17581 ggagtgcaat ggcgtgatct cagctcactg caacctccgc ctcccgggtt caagcaattc 17641 tcctgcctca gcctcccaag tagctgggaa tacaggtgcc cgccaccacg cctggctaat 17701 tttttgtatt tttagtagag atagggtttc accatgttgg tcaggctggt ctccaactcc 17761 tgatctcagg tgatccaccc gcctcagcct ctcaaagtgc tgggattata ggtgtgagcc 17821 accatgccca gctgtatttc tcttttttct tactattagc cacatatatg ttccttaagt 17881 aataagtgta agatgagaat tgctcaatta ttgtgtgtgc aacatgttca actttattag 17941 gcaaggctaa actatttttg caggagattg aaccagttca tactcccgcc caaagcagat 18001 gagagtcctc actgattcat ttccttgcca aatctgaaac cgttcaattt cataatatct 18061 gccaagccag tgagagtaaa atcccaaccc actgcggctt taatttacat ttcccttatt 18121 actaatgata ttgaatatct gtcttgttta tatcttcttt tttttttttt ctttgatgga 18181 gtttcgctct tgttgcccag gctggagtgc aatggcatga tctcagctca ccacaacctc 18241 cgcctcccgg gttcaaacta ttctcctacc tcagcctccc aggtagctgg gattataggc 18301 atgcactacc atgccggtta attttgtatt tttagtagag acagggtttc tccatgttgg 18361 ccaggctggt cttgaactcc ccacctcagg tgatccgcat gcctcagcct cccaaagtgt 18421 taggattaca ggcgtgagtc accacacctg gcccctgttt atattttcta tatccttagt 18481 gattctttca tctttttgat gtgtcagtca ctgaatgttt gtcataaaat ctctaattat 18541 attggcacca gcattatcga aagaaaggaa aaaaaactct gtttagaata atggatttgt 18601 tagcttctcc ttgatttgct ttatatagtt tgaggttgtg ttatcaggtg tataagagtt 18661 cagaattgtt tttatcttcc tggagttata tactccttct ttgtttctaa taatactttt 18721 gccttgaagc ctagtttgtc tgttattaat acaatgaaaa atcttttggt tttaatcttt 18781 tttttttttt gagatggagt ttcgctcttg ttgcctgggc tggagtgcag tggcgcgatc 18841 tcagctcacc gcaacctctg cctcccgggt tcaagcgatt ctcctgcctc agcctcccaa 18901 gtagctgaga ttacaggcat gcgccaccac acccagctaa ttttgtattt ttttagtaga 18961 gacggggctt ctctgtgttg gtcaggctgg tctcaaactc ctgacctcag gtgatctgcc 19021 tgcctcagcc tcccaaagtg ctgggattac aggcgtgagc cactgcgccc ggccctcacc 19081 tcggcctttc aaagggctgg gattacaaac gtgagccacc gtgcctggcc tcaatgtctc 19141 ttaaatctca ttcaatctat aggttcctac tctatctctt ctattttcct tgcaatttgt 19201 tagttgggga aaccaggtca tttgtccttc atataaaata ttttgtttaa tgaaattata 19261 tagataatga tctttagaaa tacatcttgg atataacaag tgttggagag gacgtgaaga 19321 aaagggaatc cttgcatatg gttggtggga atgtaaatta gtatagccat tatggaaaac 19381 agtatggagg ttcctcaaaa acaaaataaa aatagaacta ccatatgatc cagcaatccc 19441 actactgggt gtgtatccaa aggaaatgaa atcagtatgt caaaaagaca tctgcacccc 19501 atgctcattg cagttttatt cacaatagcc aagctgtgga aacaactgaa aatgtccatc 19561 ggtggatgaa tggatcaagg aaatgtggta tatatacacc atggaatact attcagccta 19621 aaaaaagaga cagtcctgtc actgtcagca ccgtgaacga acctagagga cattatgtta 19681 agcgcaataa cacaggcaca gaaagacaag tacgcgtgat ttcacttaaa ctatggaatc 19741 taaacaagtt gaactcagcc gggcacagtg gctctcacct gtaatcccag cactttggga 19801 ggccgaggca ggcagatcac ctgaggtcag gagtttgaga ccagcctgac caacatggag 19861 aaaccctgtc tctaccaaaa atacaaaaaa ttagctgggt gtggtggtgc acgcctgtaa 19921 tcccagctac tcaggaggct gaggcgggag aatcacttga acccgggagg cagaggttgc 19981 agtgagccga gatggcgcca ttgcactcca gcctgggtaa caagagcgaa actccgtctc 20041 aaaaaaaaaa gttggactca cagaagcaga gaggagaacg gtggttgtgc tgaggctggc 20101 agagggagtg ggggtgttgg gagatgttgg tcaaaggata caaaatttca gttagatggg 20161 aggaataagt acaagagatc cattgtacat ggtgaccaca gttaataaca atgtattgta 20221 ttcttgaaaa ttgctgagag tacattttaa gtgttctcac cataaaacaa taagtatgtg 20281 gtgacaccta cattacttag cttgatttag ccattccaca gtgcatacat aattcaaaac 20341 aacatgttgt gtacaataac tacaattttt atgttaattt ttaaaataaa aaattttaat 20401 gattttaaaa ataagtatat aaaatatatt ttattttatt attatactat atgttaattt 20461 ccttgtgtat atatagctct caaactctgg agctcaagca gtcctcctcc tcaacctaaa 20521 gtactgggat tacaggcctg agccactgca cctggcctac agttattatt attattatta 20581 ttatttcttt tttcagtatt tattgatcat tcttgggtgt ttctcgcaga gggggatttg 20641 gcagggtcat aggacaatag tggagggaag gtcagcagat aaacatgtga acaagggtct 20701 ctggttttcc tagacagagg accctttggc cttccgcagt gtttgtgtcc ctgggtactt 20761 gagattaggg agtggtgatg actcttaacg agcatgctgc cttcaagcat ctgtttaaca 20821 aagcacatcg tgcaccgccc ttaatccatt taaccctgag tggacacagc acatgtttca 20881 gagagcacgg ggttgggggt aaggttatag atcaacagca tcccaaggca gaagaatttt 20941 tcttagtaca gaacaaaatg gagtctccca tgtctacttc tttctacaca gacacagtaa 21001 caatctgatc tctctttctt ttccccacat ttcccccttt tctattcgac aaaaccgcca 21061 tcgtcatcat ggcccgttct caatgagctg ttgggtacac ctcccagacg cggtggcggc 21121 cgggcagagg ggctcctcac ttcccagacg gggcggccgg gcagaggcgc ccctcacctc 21181 ccaaacgggg cagtggccgg gcggaggcgc cccccacctc cctcccggac ggggcggctg 21241 gccggacggg ggctgacccc ccacctccct cccggacggg gcggctgccg ggcagagacg 21301 ctcctcacct cccagatggg gtggcagtcg ggcagagaca ctcctcagtt cccagacggg 21361 gtcgcggccg ggcagaggcg ctcctcacat cccagacggg gcagcggggc agaggagctc 21421 ctcacatccc acacaatggg cggccaggca gagacgctcc tcacttccca gacggggtgg 21481 cggccgggca gaggctgcaa tctcggcact ttgggaggcc aaggcaggcg gctgggaggt 21541 ggaggttgta gcgagccgag atcacgccac tgcactccag cctgggcaac attgagcact 21601 gagtgagcga gactccgtct gcaatcccgg cacctcggga ggccgaggca ggcagatcac 21661 tcgcggtcag gagctggaga ccagcccggc caacgcggtg aaaccccgtc tccaccaaaa 21721 aatgcaaaaa ccagtcaggt gtggcagcgc gtgcctgcaa tcccaggcac tctgcaggct 21781 gaggcaggag aatcaggcag ggaggttgca gtgagccgag atggcggcag tacagtccag 21841 cctcggcttt cacaactttg gtggcatcag agggagaccg gggagaaggg gagggggagg 21901 gagagggacc agttattatt ttttaaaatg tcttctcggc caagcacggt ggctcaagtc 21961 tgtaatccca gcactttggg aggccgagac aggtggattg cttgagcctt gggagtttga 22021 gaccagcctg ggcaacatag caagactctg tctctacaaa aaaatacaaa aattaactgt 22081 gtgtggtggt gtgtgcctgc agtcccagct actctggagg ctgaggtggg aggatcatct 22141 gagcccaggg aggttgaggc tgcagtgagc cgtgattgga caactgcact ctggtctggg 22201 tgacagagtg agacccagtc tcaaaaaaaa aaaaaaaagt cttctctccc ctacttacct 22261 actatgagta tagatgaggc aatttacttg acaaatattt aatgagcacc ccactatgtg 22321 ctgggcccca tgccagatcc tgagaaatat aacggtgaac aggcccgaca tattcctgga 22381 gcttcaatct gagatttggt agtttgcaga atcctgaaga gagcttatta aaatgcaggt 22441 cccaggaccc atctcacaca ggatgtcagg ggtgtgaggt ggggaccagg agtgggcacc 22501 ttttgccaat tgagtctagc cctgtcctac aatatgtaag ccacaaaatg atccacatca 22561 catcaattta aattttccag caattacatt aaaaagtaaa aagaaggccg ggcatggtgg 22621 ctcggctcac gcctgtaatc ccagcacttt gggaggctga ggcaggcaga tcacctgagg 22681 tcgggagttc gagaccagcc tgaccaacat ggagaaaccc tgtctctact aaaaatacaa 22741 aaattagctg gacgtggtgg tgcatgcctg taatcccagc tactcgggag gctgaagcag 22801 gagaatcact tgaacccggg aggcagaagt tgcagtgagc gctattgcac tccagcctgg 22861 gcaataagag agaaactccg tctcaaaaaa aaaaaaaaaa aaaaagtaaa aagaaggcta 22921 gacacggagg ctcatgcctg taatcccagc acttcgggag gtcaaggcag gaggatcgct 22981 tgaggccagg agttggagac tagcctggcc aacatggtga aactcgtctc tactaaaaat 23041 acaaaaaatt agccaggcgt ggtggtgggt gcctgtaatc ccagctactc aggaggctga 23101 gggaggagaa tggcttgaac ctgggaggcc gaggttgcag tgagcagaga tcgcgccact 23161 gcactccagc ctgggctaca gagagacttc atctaaaaca aaaaacaaaa aacaaaaaac 23221 gataatttaa agatagtttg caatctttct ttctttctct tttttttttt ttttttttga 23281 ggcagtctcg ctctgttgcc aggctggagt gcaatggcgc gatcttgggt cactgcaacc 23341 tccgcctccc gggttcaagc gattctcctg cctcaggctc ccaagtagct gagattacag 23401 gcgcccgcca ctgcgcccgg ctaatttttg tattttagta gagacggggt ttcactatgt 23461 tggtcaggct ggtctcgaac tcctgacctc aggtgatccg cccacctcag cctcccaaag 23521 tgctgagatt acaggcgtga gccaccacgc ccggcctgca atctttctta ctacaaagtc 23581 tttgaaatcc ggtgtgtatc ttacattcac agcacacctc agttacacca gccacatttc 23641 aagtgcgcag tgtgactcgg ggctactgtg ttggacagcg cagcccggag gctgatccgg 23701 gtagcccggg ggccacgctg ggagcagcgc agggctggat cgtgtctgtg aaactcactt 23761 tgcagactcc ggtgctccaa acaaaatgag ggaacaaagt catacccgga ctccaattct 23821 gcctcccact cctggctgtc tgccccaagg tccttccgcc ccgtcctggc cacgtgacct 23881 tgggagtccc ttcgcctcca gagccttgac ctgtacacct gagaaatggg cgtggtgact 23941 cctgctggca gcagcagtgg tgcgcgtggg gacacgcgtg cggggagtca ggcgcccccg 24001 ggtgctcgcg gcaggaagga gtgaagcctg cgggcgggtg tgggggcaag aactgaagcc 24061 gggggaacca gacgggctcc gggtgcgggg gctccgctgg ggccacgcca gtccgcctgc 24121 agcagccggc agctccggag gcggccaccg ggcggcgctg tggagcaggg accccggggc 24181 ccgcaggggc ccgcggagga gtgagaataa ttagtaacag cccctgtctg gggtgcactt 24241 cccgggggag gggagaatga ggtcactctg gtccttctgt gacaggaggg aaaacaggcg 24301 aagaagccca tccccccaac caaccccccc taattcaagt cggccaacac tgcggccatg 24361 ggaggatccg ggtttggagt cgggggctct gggtgaggtc ctgattctga cctctgaggg 24421 acctcgggct gctcggcctg gtctcccacc tgtagacggg gacacctgtg cctccctgca 24481 gatgccggct gcaaaggggc gttgccaact gtctggtgcc gagccagagt cagtgcgtgc 24541 ctgggattga gaggcgaact ttgagagatg aggataaaaa taacagctgc cgatgctgag 24601 ctaaatgcct actcttcttt atactctctt aacagccggt gaggttggta tgactgttgc 24661 ccccatctga cagatggggt agctgggcct ccaaggtctg gtgtggcctt acctggccca 24721 gcagagcggg accctgattc cagggttctg tgctgccttg tgaccagacc ttggcttaga 24781 cgggttcgga tgaaccttct cctgcacaca cccccaccca tggctttttc ctccctcccc 24841 tcctgggctc tcagaccctc tggccttccc tcccctcccc aaaggctgcc agggcccaca 24901 gccaggaccc ccagtctccc cattaagatc agaagcctct ccatgcccac agctcaggcc 24961 tagccagtga gagtggtcct ctggccgtaa tagtggtttc aagacaggca tgggactcaa 25021 gcctggccag tgtggctgga tgttgggact ctggcaccta gagcagttgg gaaacctgcc 25081 tggaggccta ggggctggga gcctggagcc cggaggcctt gggagatttg cttgagtctt 25141 gtcttcacct tctattttgt atttttctag agagacaggg tctcactatg ttgtccaggt 25201 tggtgggcag tggctattca cggggcaatc agactcgaac tcctggctca agccatcctt 25261 ccacctcagc ctcctgagta gctgggacta caggtgtgca ccactgcgcc ccacttggct 25321 tcccctttaa tggaggagct gctgactatg gtcttgtgct ctggaatgca gccaaagcac 25381 ctccagcttt ccacctgccc gcaaccccag ttctgtgccc acagtctgcc tcccaccctc 25441 ccgtccatct gcaccccgtc cccacattga ctcaccaaat ctagagtctg atcattggag 25501 ggtccctcgc aaccccttgc ctcctccaga aagaaaggct tggctccact ctctttttta 25561 ttttttgaga tggagtctcg ctctgtctcc caggctggag tgcagtggcg cgatctcagc 25621 tcactgcagg ctctgcctcc caggttcaca ccattctcct gcctcaacct cccgagtagc 25681 tgggactata ggcacccgcc accacgcccg gctaattttt tgtattttta gtagagacaa 25741 cgtttcacca tgttagacag gatggtctcg atctcctgac cttgtgatct gcctcccaaa 25801 gtgctgggat tacaaacctg aaccaccatg cccggcctgc ctttgttttt tatctgtaaa 25861 atgtggcaaa gggctgaaag ccccctagct cctctggaag ccagaaacac cctgcccgat 25921 gatggcagcc ccctttttaa tttaatttat ttattttgag atggtgtctc attctgtcgc 25981 ccaggctgga gtgcaatgga gtaatctcag ctcactgcaa cctctgcctc ttggattcaa 26041 gtgattcttg tgcctcagcc ttctgagtag ctgggattac aggtgcccgc caccacgccc 26101 agctaagttt tgtattttta gtagagacag tgtttgaggc caggctggtc tcaaacacct 26161 gaccttaggt gatccgcttg ccttagcctc ccaaagtgct atgggattac aggtgtaagc 26221 caccatgccc ggcccagccc cctcttttat aaggggactt tgtatatgct gtccctctgc 26281 ttggacgtcc ttccttagtg tccctagagg gggtaagagc atgactggac gcaggcaatc 26341 tgcaatccag tcccagctgg agacaacaaa gggcgctctg ggctgggggc actgcaggtg 26401 cgaggcctct ggagcgggaa ggcataggta gctcgggggg aaggttctgt ggggttggac 26461 gaatatattt tatggctcaa gttctaattc agcagccctg tctccctcct gcccctggga 26521 gaacttctgc tttccctgga atggggctgt ggaaggacag actggtccgt gggtccgggt 26581 gacctaattt gaataatgcg aacgggaatg ctgagattct gatatgcacg atggtcttgg 26641 ggcctatggg tcggggggaa tgaagaccct ggattggaat tggaattgga acccaaccag 26701 gaggaacagg cagcccaggg gtttggtgga gggttttggg tgacagggac gttaccagga 26761 ggcaggagag gcccaagcac accacttggc agtgcctggc agagcagggc agggcgcggg 26821 agggcgggtg acagtggctg gtacaggttt cagcagtgaa cacctcccgc tgcactctgg 26881 gcctccccat ggaggagctg gaagctaaaa taggactcta ttttgggggc ctgaggcgaa 26941 ctggagtcac ttccctgcca gaagtgagaa gttgcttcag cctctaccag ggcaggccct 27001 cccctatgcc ggagtggcag ggggtgggcc ttatgggtgc ggcctgaggc agagggtgcg 27061 ggccaaggag tgggagagga ccgtggctgc tgaggggctg aggggctggg ggcctggccc 27121 acagacatca tctccctccg tcctcagggc agcggtgtga ggtgggagct ggggcttccc 27181 catgttacag atgaggctca gagaggccaa gtcacctctc caaggtcaat cagtagcaag 27241 gggcagagcc aggatttggc caggtctgga tgacgcctga gatgtaatcc caaccccagc 27301 agtgggcgtg agcgggacct ccacaggcag aaggagcagc tgtttcctga ccacctgccg 27361 ggtgcctggc cctctgcacg cgcgatggct cagtgaggcc ctgtgtcccg ccccacagga 27421 ggctgcagcc tcgccccttc tcacagacag gcctgaagcc aagaaggcag gcctgggaag 27481 ctccagcctc aggtgagcgt ctgagttact gcccaaacgc aggcaaaccc ctttcctgtc 27541 tgggcctcag tgtccccatc cgtacttcca ggagatcggg ccacagtccc aaaagaccca 27601 tccaactttt cttttttttt ttttttttga gatggggccg ctcgctctgt cacccaggct 27661 ggagtgcagt agcgcgatct tggctcactg caacctctgc ctcccaggct caagcgattc 27721 tcttgcctca gcctcccaag tagctgggat tacaggcgcc caccaccatg cccagctaat 27781 ttttgtattt ttagtagaga caggatttca ccatgttggc caggctggtc tcaaactcct 27841 gacctaaggt gattcacccg cctcggtggc ccaaagtgct gggattacag gcgtgagtga 27901 gccaccgtgc ctggccaagg cccatccaac tctgatgttt ataacatgtc ccacggggag 27961 gctggcagag agctgggctg ggaggcagct gcaatgaccc ggggctttcg aagcctgcct 28021 tccccagcaa acccattctc caggtctctc agtgaggccc acctgcggcc gggggaagag 28081 gaggagtgtc tttcctgatg cagaccccgc tccatgctgc agtggatcca cgtgcacgat 28141 gccatgtagt tgcaggttgt cacccctgtg agagagggac tgttgttatg gcctcttttt 28201 tatagttgaa gcccagggcc aggaagtccc tgggcccagg accacccgag agggaggtca 28261 tggttgggcc cctcacccag gctgcgggct ccaggcccca cccttctgcc gccaccttca 28321 tgatgccaca aagtcgaagt ctgatttcat tacccccctg accccaagct ggtctttggc 28381 tgccaggagg tggcaaggcc atttgcggcc ccctccgcac ccagtccttc cctgcagtga 28441 ctgggctgtg aatagcctgg cccatcctct ccacggcccc acctttgccg tccccatgct 28501 caattgccca tatcctccct cctctcacct gccatttccc agtcattcca ttgtctcgcg 28561 ctctcctgcc cactgaggtg cctctggcag cttcccaggt cctccctgac tggttttgga 28621 accccaaaag tggcccccct attcctgctg ccccattcaa tgtcccacag cagccccaga 28681 cacctggaga ctggcagagg ctgcgagaag gctggccccg tccaggctgc cactctgccc 28741 acccgggcct gctggcttcc catccctgcc cagcccctgc tccaggccac tcagctgcca 28801 cgtgcagctc ctgaatgcca cctccgtcac ccactcctct gctcccagcc tgcccagagc 28861 tccccactgc ctacaggaaa aggtcccagg tcattagtgt gtcgtcctca cctgctcacc 28921 tggcccagcc cccagtacgg ccccacccat gccatgcacc ccccttcagc ctcactcgcc 28981 tccccgccaa cctctgccag ggtacagagg ctgaagagtc ttcaacctta aggccagaaa 29041 gccttgggca agacgtgtca cccctcagag cctcctcatc tgtacaatgg gggaatcaca 29101 gaacccccct cacagggcag atggggggtg tcaaatggga aatcagagag taaggggctc 29161 catcacctgg aaagtgcacc attcactgtg acccttaccg taggggattg ctcagattcc 29221 cctccagcgg tgataagaga ctgggggtgg gcctgacagg gtaggagtca ggggcggtgg 29281 aggagatctc acagccagga cagagtcgcc gcctgctcca gcccacaggg cagacatcgg 29341 ggtccccatc tgcagagaag aagataggag gcttggggca cacagctgcc aagcggaact 29401 ggcattggca acaggcctgt cctttgcccc caacacgcac gcagcggggc ttctctgtcc 29461 cctccagcct ctgggttttc ttcatctcct tttttttttt tttttttttt ttgagatgga 29521 gtctcactgt gttgcccagg ctggagtgca gtggtgccat cttggctcac tgcaacctct 29581 gcctcccagg ttcaagcaat tcttctgcct tagcctccca agtagctgga attatagcca 29641 ccacacctgg ctaatttttt gtatttttag tacagacggg gtttcaccat gttggccagg 29701 ctggtttcaa actcctgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 29761 ttacaggtgt gagccaccgt gcctggcctt catctccccg atttgggttc cagccgctgc 29821 tgtagttgct gggctgtctt ggtctctgta tctccttagc ctcctctggc ttttccgtgc 29881 agctcccgca ccgctcagcg cccccggcct tctcccgggg ccatcccttc tttggcccta 29941 gctccttttc tctaatgccc accttcctct ctgcctggct gcatgcgggc ctagggtttt 30001 gccatcctgg ggtccctgag cacccagctg cttccccaca gacatgccag tccttgtagt 30061 gggtgggagg ctgtcacttc ccctccgtct gtgccccgac acctgttgct gtccacctgt 30121 gactcagtgt ggtgtctccg gacccacaag ccggtgcctc ctctgctgga gctccgcttg 30181 atctgttttg cgttctcttc tgcctccctc gggctcactc ttggcccctg ggcctcagta 30241 ggcctccctc tctcagccct tgtctctctg tcattttttg tctctgactc tctcagtgtc 30301 tctttctgtg tttctcctct ctctccccat ctctgtcaca acctatccct gtcacctcct 30361 cccacatcag tgtgtaatat cccttccatc ctcacagctc tgtcgcctct ccctgaaccc 30421 ttcccgcctg tgtgcactca cccttaggcc ctgcaccagg tgtagacaga ggccccgggg 30481 ggaggtgagg caggaggggc tcagacaggc ctgggggacg ttccagggat agcaagggga 30541 ggacagtgag ctgagcctct gccctgctta gcaggagccc caggaggcca gaggggcagg 30601 tgctggcctc cagaccctca tcccaagggg ccacatccca gaaggtaacc ctggtgacag 30661 gcggagggtc aggagcctgg ctctggatag ggctctgtgg ctgttcctct gtgtgatgtc 30721 gctaattccc ccaccctccc ctctcttcct ctgaactggg gacaaccaaa ggtctcttct 30781 tggcctcctt ctcactgtgg gcactgtgag agcggaaagc aacctccacc cagcaggggc 30841 tctggcttac ttgccccatt atcaaagccc ctgcaaccgc agtggacggc tccagcgacc 30901 atgaccaaga agccctggct gctgtcccaa gcctgagcag gtagcggcag gggtttgccc 30961 atggtcccac ctggcgtcgg tggcagggcc caggccagag gccggcttca ggactctcag 31021 cacagggctg aagggtgggg tcccagctac atctttgtcg gtttggttca gtgctgaacg 31081 gtttctgcta aaacaggcct tccttactca cagaaggtgc tcaacgggtt ttcatcgcgt 31141 gaccttgaga cgccctttag aatctgtggt gggtggggcc ggcgtgtgag ctgagggcca 31201 ggatccaggc acttgcttct ggctgtgtgg gaaggttctg ccaggttcca ggcagcaagt 31261 ggagtgaggg gtggccagtg attgggggag gggcaaagct aggcttggcc aattttgcta 31321 agtgcctatc tctgtctgtc ctcacctctc ccccagacct cctgccagct tctgaaggcc 31381 aaaggaacaa aatcctaggt ggtcaggcac atcaggaccc tggaggtcat caggtcccag 31441 ctcctttttt tttttttttt gagaccgagt ctcactctgt tgcccaggct ggagtgcagt 31501 ggcatgatct cagctcactg caacctccgc ctcccgggtt caagtgattc tcgtgtctca 31561 acctcctgag tagctgggac tgcaggtgtt caccaccatg cctggctaat ttttgtattt 31621 ttagtagaga tggggtttta ccatgttgat caggctggtc tcgagctcct gacctaaggt 31681 gatccaccca ccttgggatc aaagtgctgg gattacaggc gtgagccact gcacctggcc 31741 tagctcctaa ctttgaagat ggggaaaatg aggcccagga aagagaagag aacacccaag 31801 atcacacagt ggtcaaaccc aagctctctc tctagccttc cagttttcat agactgagag 31861 caaagtaacc atcaggtgtc cctttttcca tggctgccag gtagctcaga ggtccatttt 31921 caaattgtgt gtcttatatt gtaagccaat ccaaatccct tccagaaggc aattcgtatt 31981 tatttaggtc cagtaatgta cctgacacca tgccatgcct ggtcacatat gggatctcat 32041 ttaacgcaca cagcaagcat gtgagataga tatgaatggc tctttcagat aaggaaacag 32101 gctcagagac atcaattaac ttgcccaaag ttgcacagct agtaagtggt aaaggcagga 32161 ttcaaatcta ggaccctctg gctctgatac ccgtgatctt tcctctctgt gaataggcac 32221 atagaggcca gtgctctcgg ccccatgccc tggtctcatt attgtgccca ttgcctgcca 32281 ttcttcctca tcttcattca ctctcttgac taggagcccc tttcgtggct gtgaagggga 32341 atattggtta tttatttatt tatttttgag acagaatctc actctgtcac acaggctgga 32401 gtgcagtggc atgatctcag ttctttgcaa cctctgcctc ccgggttcaa gcaattctct 32461 gcctcagcct cctgagtagc tgggattaca agcccacgcc accacgcccg gctaattttt 32521 gtattttcag tagagaaagg gtttcgccat gttggccagg ctggtcttga actcctgacc 32581 ttgtgattca cacacctcgg cctcccaaag tgctgggatt ataggcatga gccaccgtgc 32641 ctggccagga tattggattt aaagctcctt cgctttccaa ttcctgggac ccaggtctac 32701 atcgtccttg taggaagatt caaaggctga agggtgcatt tctacagcct gccagacaga 32761 gcccagaatt ctgttagctt ttgaaccttg tactcagctt atgcaaacag gcacaagaag 32821 aaaaaatagt gaacaaagat gttcatcact gcgttgctcc aggtctcagt ttctctatct 32881 gtaaatgagg ggttgccttg atgatccata atggtgtata atttttgatc ttctataatt 32941 tgatgaacat tctatagtca ttaaaaattg aaggccgggc gcagtggctc acatctgtaa 33001 tcccagcact ttgggaggct gagggcggat cacaaggtca ggagttccag actggcctcg 33061 ccaacataac cccgtctcta ctaaaaatac aaaaaattat ccaggcatgg tggtgcacac 33121 ctgtaatccc aactactcgg gaggctgagg caggagaatc tatttttttt tttttttttt 33181 ttttgagatg gtgtttcact cttgttgccc aggctggagt acaatggcat gatctcggct 33241 cacttcaacc tctgcccccc cgggttcaag cgattctcct gcctcagcct cccaagtagc 33301 tgggattaca ggcatgtgcc accatgcctg gctaattttg tatttttagt agagacaggg 33361 tttctccatg ttggtcaggc tggtctcgaa ctcccaacct caggtgatct gcccaccttg 33421 gcctcccaaa gtgctgggat tacaggtgtg agccaccacg cctggccagg agaatctctt 33481 gaacccagaa ggcagaggtt gtggtgagcc gagattgcgc cattgcactc cagcctgggc 33541 aacacagcga gactccatct caaaaaaaaa aaaaaaaaaa ttgaaaagat gtagaatcca 33601 tcaataccca gaaaagtcaa tacctggaaa aaactgcaat gagaaaagct agacgcagga 33661 agcagtcatc tgatcgcagc catgggctct aacaaagaca ggggtgaacg caatgaggtg 33721 aaatagctga gtcggatgcc tcctaagttc ttttctattc ttctgtgagc tatttgagtt 33781 cataaggaag ataaatcttc cagaagctgg tgggaaggaa gccatggtga atctacaagg 33841 attaggcctg aaccgtggaa ttccaggcag gcaggcaggt gtggccagat gagggcgggg 33901 ttttttgctc ccattagatc cgccctcttc ccccctccct gtcttcctcc ctttcttcct 33961 cttcctcctt ccttgtcttc ctccctttct tcctcttcct ccctccctgt cttcctcctt 34021 ttcttccaca tgcatgatac cctcttcatt ctctccatcc tgcccaccac tcctgctccc 34081 caagaggctg tgaccgaagt ggggaagaga gggcagggca gggcagaggg ggtctcctta 34141 ccctggctct gcagggaagc tggtgtggca tgaggagggc ttggcagcac tttaaggccc 34201 cttcagcctg cccgttggca caattccaag ctccgtgaca tctaaggttc ttacaccccc 34261 agacgcagca actcccccca ccccgctccg tcagcgccgc aggagccgga gtggagtgag 34321 ggatcagagg agcggcacga ggagggaggc aggtgggagg gctgtgaaag gggaacaggt 34381 gggagaggga aggtgagtca gccccgagca gagctgagcc agcaggagtt tgcagcgcag 34441 ccaggcctgg gagcctccct gcagctgcgg gctttcggca ccgcgtggcc ccctccccca 34501 gtgcccacac tggcactgcc ctttggcact ccagaagtcc tggctttggg ctttcacttg 34561 cctcttctgt gtgaccctgc cacaggtcac cttgcatgtg accagagaga ccaggcctct 34621 cccaaaagag caactaccac aagcatatga gcaagctgca agctggtgaa gggcctgggg 34681 ggaccttgga acgccccatg ttgctctgga tggtggatga tgggccctct tgggggtaac 34741 cccacatgta ccccttcccc accctgcatc cgggcctctc gcacatctta gaggagggta 34801 gacatcccct aagacgtgct ctaggtggag gcttctgagt ctgggagcac ctgagcctga 34861 atcctcttca gttactgctc gtagcccagg aagccgtggc atcctgggga tgtggtggga 34921 ggagtgggga gacctggcga gacccaccca gggcctgcag ggctggatag agggacatct 34981 ctcatgagcc tccagggtca ggattgagac cctggggtac tcacctgaca gaaggggacc 35041 aggggtcctt ccagaatgtg ggcagatgcg ccactgaatt tccttcttgc tgagcggccc 35101 ccacaggctc ttcggtgggg gtgggggcca aaggccacag gtgcactctg ggaaatctgg 35161 gagggcacag aggaggaaag gaagcagccc ctccccctca ttgagccccc acctgggaga 35221 ggatggagtg aaggagaagc aggctcccgg gcctctgact gtcctcactt tgtttccagc 35281 cagccccctg gttctctggg cctctattta tttcctcatc agtaaaagga ggccataatc 35341 cttgtcctgg cttcctcttg gagtttcgta agagttcaat gaacccaagg acgtgaaagg 35401 tgttatgcaa atgagagaca ttatgagtta tctattggga ggcttggatc ctgagcttgg 35461 ctctgtctcc aggaaagagg cctcaagttt ccttctgcag cagcgcagca tacggggtgc 35521 acagtgatcc acccaaacgg cagctatgct gccggggctg gtgagacgtg gtcattcaca 35581 aggcagctcg ctgtgctgca cagcaggtaa gcttgcctgc tggttccctg tttctagagg 35641 cagctggcac ctttgtgtgg aagatccctg ggacccagaa gggtgccttc tgcttccttt 35701 cagcctgggg cacacagtca gcctgtagag tagaagtcgg gtgggaggaa ccctcaaact 35761 ccaaggctct gggatagatg accaactatg gccacagctc cctcctgctc ggcatgtcct 35821 tggcaatgtg actttgcctc tttcccaaaa ggtggaaact aatttcccac cctttgaatc 35881 tgggctggcc tcgtgacctg gactctggcc gatagactgc agtggaaatg atgttatcca 35941 agttcaaggg gctaggcctc cggagacctt gcagcttcta cctttgccct cgtaggttgc 36001 tgcccggagt cgccatgaag gaatgtgatg cagcctattg gaggagaacc aaaccccacc 36061 accaccccag aaagccatac cagctgccga catgtacatg ggctcatcgg gggttttcca 36121 ggcccactga tgttccagct gagcacagct gccgggtaag cccaggagaa atcagctagg 36181 aaccccagcc cacccacagg accctaagag aggatgcgtc attggtctct gttttgtgct 36241 ggctggttat agagcatagg ctaaccgaag gcagagaagg gaagggactt acccgaggtc 36301 ccacagagag cagagaccaa gcctgtcttg gttaagtctg tgaccaggtt tcctggcccc 36361 tgtgcccttc ccatcagtgt cctgccgcca gctctaccag cggtgtgtgg atctgcatgt 36421 atgtagggag gtgcggatcg ggtgggccct tcccacttcc acctgagtta ggaagggttg 36481 gctgacgtcc acacagggct gtcctgccca tcattgcagg cgggagggat ggttaatccc 36541 attctgcaga ggatgaaact gaggccaaga gctgacagtc acacagctac taggtaacaa 36601 agctctgaca tcaaagcttt tggttctttg atgccaactc cagatccttc tattgactcg 36661 ccggaaggag aatggcaggt ggtaaagttc cctccaaggg tcagtcactc agaaagggcc 36721 acatacacag gcacacacat aacacacagg cacacacatg acacacaggc acacacacga 36781 cacacacagg cacaattaca tgtgtgcttg ctcactctct ttctctcctt ctcacccctc 36841 catcttcaac cagcctaccc ctaatggacc atgtacttta ttaaaaatat atagttacgg 36901 ctgggtgcgg tgactcacac ttgtaatccc agcactttgg gaggccaagg tgggtggatc 36961 acctgaggtc aggagttcga gaccagcctg gccaacatgg tgaaacccca tctttactaa 37021 aaatacaaaa cttagctgag tatggtggct catgcctgta atctcagcta ctcaggaggc 37081 tgaggcagga gaatcgctcg aacctgggaa gcggaggttg cagtgagccg agattgctcc 37141 attgcactcc agcatgggca acaagagcaa aactccgtct caagaaaaaa aaataataat 37201 aaaaaataaa aaaaaaatat atatatatat atatatatat ggttacaaac tctgttgtca 37261 atctgggctg ggcccggggc tggagggact gcagggaggg agggatggtc cctcatccca 37321 cccctcagag tccctcctct gcacccaaac ccaacaccat ccaagggcca aggtggctgg 37381 cgttgagggc ctcaaggggc cagccagctg ggcaggggaa ggcgctaggc ttgatgggct 37441 gggtggggag gggtgcacag agagggactg catctccctg tccagcaggt aaaaacggca 37501 gaaagtgaat ttggtgtctc catctgtctc catttataga tatttacaaa agaaagtcat 37561 gagcaacagg ccacagtcat gcaaaagaaa cgcacccccc cctccccaaa ctaaccccct 37621 gcccaccctg aaaccccgag atgtcccccg cccagcagtc cagccacctt ccccaaggtt 37681 ctgagagcaa agagggcacg tccttgaaga gtcagtgggg gaagggtggt ggatccgggg 37741 acctagagcc accagaagta ggcgctggcg gaactcaggc tgatcggggc cgagctcttc 37801 ccaggcctgg gtgctggagt caggaggaag gggtccccca gtctttgaaa cccccacctt 37861 cctccagaag gtccacggag ccagggttgg ctctgtcctg agtgtgggag ggggtgtcct 37921 ggcatcccac ccccggcaga ggctcttgtg ggcagtggtg agggccgggc ctggaggtca 37981 gatggcagac tccctgcggt aggagatgtt gtccagcggg agggaagcct gcatgcgctc 38041 caggtccagg tggctgccga actccagcat ccggatgatg cccgcctcct ccttggaacc 38101 cgcctccagg cccaggcctg cggccaccgc ggccgccgca gctgcctcct cctccatctc 38161 ctcttcctcc tggctcataa gggccagctc gttctcgtag cagaaggcac tgggaggggg 38221 cggtggggcg ggcagcacgg tgatcttact ctcctgcagc tcccgggccg agcagcaggg 38281 cgtgccggcc acctcgtagg tcttgtgaaa acgtgagtag tccaccttgt agtggctctt 38341 ctcctcgaag accacaggct caaagcggtg gccccacagg atctcgctgg ccaggtagga 38401 gctgcgggcc tgggtggtca tggccgtggc ctccaccatg ccctccagga tgaccacgat 38461 ctcaaagtcc tccgactcca gctcctcctt gcccatgcca taaagcgggc tgtcctcgtc 38521 gatctcgtgg acaatgatga tgggcgacac caggaagatg cggtccaggc cgatgtcata 38581 gcccacgttg aggtcccgct ggtccagggg caggtactcg ccctcctggg tcatgtaggg 38641 cttgatgagc tgggcccgca cgtgggcctc cacaatgtgg ctcttgcgca ggttgcccac 38701 gcgccacatg aggcagagct tgccgtcgcg caccgaaatg accgcgtggt ggctgaacag 38761 caacgtctgc gcccgcttct tgggccgcgc catcttggcc atgatggtgc caatcatgaa 38821 ggagtcgatg acgcagccca cgatggactg gaccaccaca gcgatgactg ccagcgggca 38881 ctcctctgtc acgcaccgga acccatagcc gatggtcgtc tgcgtctcca ccgagaacag 38941 gaaggcaccc aggaagccgt tcacgtgcat gatgcagggc ttgggggcca ccggggctgc 39001 tccgccacca cccgccgccg ggccccccgc cgcaggcacc cctgggctgg cctccaggtc 39061 accgtggaag aaggcgatac accagaagag gaggccgaaa aagagccagg agacaaggaa 39121 ggccgcggag aagatcatga gcatgtagcg ccagcgcgtg tccacgcagg tggtgaagat 39181 gtccgccatg tagcgctgcg acttgttgct caggttggcg aagtacacgt tgcattggcc 39241 gttcttcttg acgaagcggt tgcggcgctt ccgccggggc acgtgggcct ggccgttgcg 39301 gctgtgtccg tgcatgtcct gaagccggcg tggtcacctg ggaagacgca gggcctgcgg 39361 aggagaagcc ggacaggtga gatgctgggg agcgaggggc tccctcgggg ccactcagac 39421 tcagccccaa gactctcaga agcaggtgag cctggaccag gacctaagac ttcctctgcc 39481 ccggcaggcg gagggtggca gcagaggtgg ctctgatgac agggtctgag ccagtgccac 39541 ccagtccggc ccagcttaga gactggggtc catagaccag gaagctaaag ccacaaggat 39601 ccaggtccta gtccctgccc ctcaacactg cccctccggc cctcctatgc tcactgttct 39661 ggaagcttcc tgatgcccat cttggttcct ccctgtccct ctgtcctgct tgccctgagc 39721 atagggagcc acggaggcgc taagcccgct tccctccctg gattccatct ctgcactttt 39781 gggcttctcg ggggctggac atcctcctgt gcactagcca ctggccacgt gtggctcttt 39841 acatttaatt aaaacgaaat aaaatattaa ctgcagctcc ttccttgcat gagccacatt 39901 tcaagtgatc aggagccaca tgtggtcaac ggccaccaca ctggacagag aagacacagg 39961 agagttctat cactgagaaa tttctggctg ggcacagggg ctcatgcctg tgatctcagc 40021 attttaggag gccgagatgc aaagattgtt tgtgccagga gtttgagacc agcctgggca 40081 acagagtgag accccatctc tacaaaaaat aagaaaatta gccaggcatg gtggtgcatg 40141 cctctggtac cagctactca ggaggctaag gcaggaggat ctattgagcc cagggggtca 40201 aggctgcagt gagctgtgat catgccactg aactccaacc tgggtcacag agcaagaccc 40261 catgtcaaaa aaaaaaaaaa aaaaaaaaag aaagaaaaag ttctactggg acccgcagcc 40321 ccacctgggt ctgagctcct ttcttaaact cttctctccc agtgttgctt tgagtccttg 40381 agcccaaatt cctgcccagt gcccacactt gttaggagca gtggtgaggg gagaatccaa 40441 tgtgacagtc agggtggagg aagggggagc ccccagctct gactcctctg tagacatcgc 40501 tattcccatg tcccaggtgg ctcagagatg gaaagccacc tgccagccac aaagctggca 40561 gtggatacaa acttgcgtgg caccaacatg aatgagttga tggcggggag acagggagtg 40621 ggaggggagg acacacctcg ggcaattcag ttcacctctc agagcctcca tgtcttcatg 40681 taacatgggc accaaaactc cctacctcat gggacctacg gggacagcag gtgctggggt 40741 aaagccatca ccctcctgcc ttccttccca cacttcccag gagctggtga caggcaggat 40801 ttgcctgtca agcatcccca gctgttgcag cagaagaggc ccctcacccc actgggtggc 40861 gggatgcgcc tcgggcccgt agaggacaga ccttctctcc acccctctct gccattgcgt 40921 ccccagggaa gacatgctgc tgccccaggc cccagcagct ggcagggcca ggactgggcc 40981 tgggttccca ataacatttt ggcccagaat tacataaggc tggaagtttc tattttctgt 41041 tctattcttc ttcctgatct aattgtgcat ctgtggcaat gtttatacac ttccccggtt 41101 caggacaact cggttgccgc agcaaccggt acgcgccaac gctataattt agagatcaac 41161 agcctcagag cctgcctctt cccaagagaa actccctctc cgggcccagg gagagtgagc 41221 cctgccccca cctcctcccc cgcggtcacc caccctgact ccgagttctt ctccctattc 41281 ccctggttct caaagagctc cctggctcct acagtcctgg gttgaagtcc tagatccact 41341 gtgtgacttg ggcgagctgt tgactactct gaacctcagt ttcctcatct tgaaaatggg 41401 gctgactggg tgcggtggct cacgcctgta atcccagcac tttgggaggc cgaggcgggt 41461 ggatcacctg aggttgggag tttgagacca gcctgaccaa catgaagaaa ccccgtctct 41521 actgaaaata caaaaattag gccgggcatg gtggcgtgtg cctgtaatcc cagctactgg 41581 ggggctgagg caggaggatc gcttgaacct gggaggcaga ggttgcagtg agccgagatc 41641 gtgccactgc actccagcct gggcaacaaa gtgagacttc gtctcgaaaa aaaaagaaaa 41701 tgagaaaatt gggccaatga gatatcatgc tggagacccg acaggtcatg caggcacggc 41761 ctctaacgta atgctggccc ttcagagctg ggaggcccag accctctttg ccccctcccg 41821 cagcatccca gcagccaggg ctccaggaga aaggcttggg gttctggggc caggcacacc 41881 ttctgctctg taaatagagt tttcagggag gcctttcttt gctaaatgta ttccccctgt 41941 tctgtcccac tgcctgaaat ggctccaggg gcaccgcgga ctgtcaggac aggaggagca 42001 tagggtcacc gagttcagga ccctcatttt atatcggaga ccaaggacat ccagagatag 42061 gaaggacttg ctaagagtcc cagagtggag ccaggtgggc cagccctgaa caaggcctca 42121 cccccacccc agagctcctc atccacttcc tccctgcagg gggctggagt gggttttggg 42181 gcaagggaag agaagaaacc agccagtcca aactggacca ccatccacag gtgtgcagga 42241 tagaggcaga ggggacaggg tgtggcggcc caggccttca acgggtgagg tcactgagct 42301 ccagagaagg tggggggccc tgcccaaggc caaacacaag cccagggttc caggggatcc 42361 ctggttctct gcccggtggg attgcctcct gacatggcga gttcacctgg tgctgcagac 42421 gagatctctt cctggacttc cccacctctg gaggggcctt gggggtaagg gagccgatcc 42481 cctgatggct acagagattg ggcatcccct cgccagcagt gcccactcct ccccattatc 42541 gagtgcccag ggtcaattct gagcttccac aatcctcagc cccagcggca acactcactc 42601 cccaggccca ccagcccact ggcctccaca gtgcagcccg atggggacat ggtttacccc 42661 tcaccccggt gcctccccga cagcccacag caggtgtcct gagcccttgc tatgtgctgg 42721 gccctgtctt gggttcccag tgcccccagc acatcctgga gcttagcagg aaagctgctt 42781 ccctggcact tctcagacct ccacccttcc ccatggagca gcctctgtga catcccatcc 42841 ccctgaaaac ccttggtctg cagtgacttg ctcaaataag agcacgctgc tgccaccaca 42901 gaccatttta gcaggagaga gaggttcagc tgctaattcg aggctgaccc gtgcaaggcc 42961 tggccaaaaa gtgaagggta tgagcttggc attgtgttct catgtgacaa ccaccaggag 43021 atggccagat tttctcttcc ttgtcacatg gggcttccca gggaaggctg acgatggcag 43081 gagagggtca aaaggccttg acaattcagt accatcctcc tctgactaga gatgtttttt 43141 tttttttaaa gggccctgct gggctaggtg actcacgcct gtaatcccaa cacttttgga 43201 ggctgaagcg ggcggatcac aaggtcagga gttcaagacc agcctggcca acttggtgaa 43261 agcctgtctc tactaaaaat tcaaaaaaat tagctgggca tggtggcggg cgcctgtaat 43321 cccagctact agggaggctg aggcagaagg attgcttgaa cctgggaggc agaggttgca 43381 gtgagccaag attgagtgac cgcactctag actgggtgac acagagagac tccatctcaa 43441 aataaataaa taaataaata aataaataaa ataaaataaa ataaaataaa aaagacccct 43501 ggttgtttag gtgtctgctt tatccctatg attaaaccac ttttgtggat gcagggatgc 43561 agggcctaat ttgcttgcat ggaatgtgca ctagaagaat gcagtttgag gcttgttttc 43621 tgcgatggag gccctgggag gtggagaggc cctctgtcac agccctggta tgtgaccttg 43681 aacaagttct tcctctccct ggcatcaggt cctcctccaa ttaagggact actcatcctc 43741 cccaactttt ttgaaggtca cagagcttga catagaacca gagggattgg tgaaatgggg 43801 cctgaggttt ccaaaacccc cagtgcaggc cactagccca tggcaaaatg agcctaagaa 43861 ggatcaatgg gccgggcgcg gtggctcacg cctgtaaccc cagaactttg ggaggcagag 43921 gcaggcggat ctcttgaggc caggagttca agaccagcct gagcaacatg gcaaaacccc 43981 atctctacaa aaaatacaaa aaaattagcc gggcatgatg gtgcacgcct gtaatcccag 44041 cttctcggga ggctgaggca tgagaatcgc ttgaacccag caggcagagg ttgcagtcag 44101 ccgagatcac accactgcac tccagcctgg gcaacagagc aagactccat ctcaaaaaaa 44161 agatcaatgg tactgctgtc ctttatatcc tgggactctt tcagtggcaa acattttttt 44221 agaaagtgat ggacatagga gatgctgact gctttttaaa tgttaatgtc ttttcttggc 44281 aaaatgaaga agtgacaatg ctacaccagt taccagactt ttcttttaat ttttaaaaat 44341 ttattattgg ccgggtgcgg tggctcacgc ctgtaatccc agcactttgg gaggccaagg 44401 cgggcggatg acctgaggtc aggagtttga gaccagcctg accaacatgg agaaaactgt 44461 ttctactaaa aatacaaaac tagccaggtg tggtggtgtg tgcctgtaac cccagctact 44521 cgggaggctg aggcaggaga atcacttgaa cctgggaggc ggaggttgca gtgagccgag 44581 atcgtgtcat tgcactccag cctgggcaac aagagtgaaa ctccgtctca aaaaataaat 44641 aaataaaaat aaacacaaaa attgattatt actatttttt atagagatgg ggtcttacta 44701 tgtttcctag gctggtctca aactcctggt ctcaagctat cctcctgcct cggtctccca 44761 aaatgctggg atcacaggca cgagccactg catccggctt tctcttaatt ttactgtgct 44821 gtaaacatcc aaaatctgca agtctttagt ccagacttca ttagccagct ggggaaaccg 44881 aggccaagaa cagggcatgg caccgtgtcc gaggtggcac agtaagtcaa cagctgtgga 44941 aggaagctca ggcctcacca ctgacaccag ccatcctgcc ccatcctgag gaattaggga 45001 ggggctggag actctgcctc ctgggggtcc agaggaggca ggaagaccac cccaaccctg 45061 tcaactttta ggttttcccg acaattttag gacactaact tatctgtcct tgcccacaac 45121 tccctttgca aaagaaaagc tgatcagctt gcagcagcct cggcaggtcc cagctacctg 45181 gaaggcttcg agggcaaatc tcctccccga gatccgtgcc actgaccaga gacaccatgg 45241 ccactgcagg ggcacctccg ccaggccctg ccagagccag gcagcagctc cagccctgcc 45301 tgaaactctg ctcctttaga caggagcagg aagctcacgc ctagagagca gtggccagtg 45361 ccagcctgcc aggttgtcat gcatctcatg tcatcccacc accagacttc gaggtgacta 45421 ttattgggtg agacaaccag ggaggtgatg tgactggcct gaggtcacag agctcagcct 45481 tggggatccg aggaaccagt catcctctct ctccagcccc tctcagggaa tttctgccaa 45541 ctccatgcca ggcgctgcct cctctctctc attgaagcat cactgtatcc tttgtgtatt 45601 taacaagtat gtattaagca cctaccacag taacaagaag ggttgttatt tctcctcccc 45661 agctgaagaa ttgcaggcta cagagatgcc ggggggttga ggggtgatgc tgagcttgag 45721 ctggaaatct aacttgagtc tgactagctc caggtccttg gctcaccctg gtccccagcc 45781 tctggcttcg ggaccccttc tcctgccaca tcaccctgct caacctggga cccccaggct 45841 ctccccatcc ctccagacct accctgagcc cctgaggctc tctgaccctg ctactctgag 45901 ccctcaggcc tcactgtcta tctgcagcac atggcctctg ttcaatatga gaaaatgtgc 45961 tggttaaacc catatgtttt gggtggcaag acctggctct cagtctcagc tctgtcacct 46021 cccagcatat cctctttgaa cctcagtttt ctcatctgca aactggagat gacgatactg 46081 gtaccttcct caccaggccg ctgctgaatg aacatgacaa ccaccatgcg ttattacatt 46141 gcaggtaaag tgcaaaaact gatgttgata atggtgatga caaagatgca gaagctgatg 46201 accaagaatg ctggactggg agctggggaa ccccttccac tcctgactcc accactaaca 46261 tgctctggtc agaagttcct tccctctgtc tgcaagatgg ggagaccatc atggctccct 46321 ggcaggagag gtgggaaaat gtacaaaacg aggcacccag acagagcttg gagtcaggtg 46381 acagatgggg ggcgctcctg agtctcctta gccccagcca tccctgagcc ctcctgggcc 46441 tagatgggga ggctgactct ccagcaaatg tcctaccaga gggagccagc acagggcagt 46501 ggaggtcagg ggacgcagcc cagggcatag ctcagcgcac ccaggatggg ttctgaaggg 46561 tagaggattt tcctggaaga aatagacaga atttgccggg cacggtggct cacacctgta 46621 atcttagcac tttgggaggc cgaggcaggt ggatcacctg atgtcaggag ttcgagacca 46681 gcctggccaa catggtgaaa ccctgtctct actgaaaata caaaaattag ccaggcgtgg 46741 tggcaagtgc ctgtaatcct agctacctgg gaggccgagg aaggagaatc gcttgaaccc 46801 aggaggcagg ggctgcagtg agccaagatc gtgccactgc actccagtct gggcaacaga 46861 gcgagactct gtctcgggaa aaaaaaaaaa aaagaaatag acagaattaa caccactgtc 46921 ccatttccca gcccttgtgg agcacagagt ctgtccccac tccagctgag gccctcaggt 46981 ctggtttgat tggaggggct ttccactcca acaagtggaa ctttgcaggg aggaagagaa 47041 ctcaagaccc caggcccacg gctctttgag ctgtacccct caaagagtaa ggggagccaa 47101 agacagactg agccacaggg gaaactgtgg cagggtcccc acctgccact gccctctgca 47161 ggccctgcag tcccctgtcc ctgggaggcc tgactccctc ctcttcatga gtctcaggaa 47221 ccccccgcct ggtgcagcag cttcagctct ggagcctcta aacaactttg tgcccatgtg 47281 accttatgcg aggaacttaa cctctctagt ccctgcatgc cccatcttta ggagaagcta 47341 ataatagagc agacttccta ggactgtaat gaggacaaat gggatcatgc acaccacgtg 47401 cccagcccag agtaagtgat cgataaggtg cattttactc aaggctgaac attgctccct 47461 cttgatttcc tcccctcaat ttcagatgca gcatcccaga gcctgcctgg gccccatcca 47521 cagggccctg cccagtttgt cttcattcac tgcactaggg agctggtggg caagaagagc 47581 agcctcggtg ctgttccaca cagagctggg ctggaaccaa gccgagtccc acttggctct 47641 agcaggtcac tggattctct gagcctcggt atctccaatg gtgaaaggag catcgcagtg 47701 cctacctgca gggcctgatg tggattgtcc caaaagcgca tgccacatgg tgggggctca 47761 agaaagctgt ccccctcccc accttcttct ggaccacgaa atcctgtggt acacctagcc 47821 cctccctcac caggcagccc tctgacatcg caaaaattca ggctgagcat ctgagcattg 47881 agcacgcatt tactcagtac tttaccctga tcgattccta tcattctcac aatgatctca 47941 gaggaaatta tcattacccc tgttttagaa gaaaaccaag gcacagagag gttcagcaac 48001 ttgcccaagg tcacacagtt tgtaagtggt aaagccagga tgtgggcctg ggcaatgggc 48061 atcaaagcac gctaacaagg catgctactg tcctgcctcg aagaaatact gtcacacgct 48121 tgcttccacc tggtacacat ccagtcctac tacgtgccag gttctgtgaa gtcagtccta 48181 tggggatgga gaaactggat cacagggcaa gttggagcca ggacccaggt tcctgatttc 48241 aggtccagcc aaacgctgga gaccaagctt tgggctccac aggggcttgg agcagcaagg 48301 ggtgaaccgg ctcattcctc ctactctgca ccccaccggg ccctcccaac cacttggcca 48361 gcgaatgtgg ggcgggggag caggctgaca agcaagctcc ccaaaatgtc cagaggggcc 48421 tttggtgagg gccaaatcac aggcgctggg ggctggagga cccagaggcc acctgtttca 48481 acccacaggt gtgccctttg ctgaattgct gacgccagtc ctctgccctc cctcgagagc 48541 ttcccaccct ccacaaatac ctgtgcccat cttccccaca gggctgtgaa ggccagtgga 48601 tccctgaagc ttctcttttc cagaacaacc gcagctccct cagtggcggc aggtggggct 48661 tcaacacccc actgcctggc tggggcttcc actcagttac tatggactgc acgctagctg 48721 tgggccagac acaggccttg gactcggcca gcttatgcaa gagctggaga gacacactct 48781 ggatggtagg catttgccat cacaggcggc cggatgtgtg tggcgttaag cagtttgaag 48841 gaaaagtaca acatggggtg agtcagaggg cgggcctctg gtttgatggg catctccaaa 48901 ggcctctgtg agaaaatggc agagccaaga cttgggtagt aagtgggagt ctgtccggag 48961 aactgtatat ctttgtggaa gtggggtgta gaatactcca ggaggtcggg cgcagtggct 49021 catgcctgta atcctagcac tttgggaggc cgaagtgggc agataacttg aggtcaggag 49081 ttccagacca gcctggccaa catggtgaaa ccctgtctct actaaaacta caaaaattag 49141 gccaggcatg gtggctcacg cctgtattcc cagcactttg ggaggtcgag gtgggtggat 49201 cacctgaggt caggagtttg agaccagccc agccaacttg gtgaaaccct gtctctacta 49261 aaaatacaaa aattagccag gtgtggtggc aggcgcctgt agtcccagct actagggagg 49321 ctgaggcagg agaattgctt gaacctggga ggggggaggt tgcagtgagc cgagatcatg 49381 ccattgcatt ccagcctggg cgacagagcg atactctgtc tcaaaaaaaa aaaaaaaaaa 49441 aaagccaggt gtggtggctg taatcccagc tacttgggag gctgaggcag gagaattgct 49501 tgaacctggg aggcagaggt tgaggtcgtg ccactgcact ccagcctggg tgacagagca 49561 agactctgtc tcaaaaaaag aaagaaaaag ggcctggtgc gatggctcac gcctgtaatt 49621 ctagcacttt gggaggcgga ggtgggtgga tcatgaagtc aggagataga gaccatcctg 49681 gctaacacgg tgaaaccctg tctctactaa aaatacaaaa agttagccgg gcgtggtggc 49741 aggcgcctgt actcccagct acccgggagg ctgaggcatg agaatcgctt gaatccggga 49801 ggtggaggtt gcaatgagct gagatcgcgc cactgcactc cagcctggca acagagcaag 49861 agtccttctg aaaaaaaaaa agaaaaaaag aaagaaagaa agaaagaaag aatactccaa 49921 aagtctgagt ctggtgttga gaagctgcag gaaggccagg ttgtggctgt gaagtgagcc 49981 tgggagtgag tggaccagga tgaggccaca gaggcggcag atggcaaggg ctgcggatgt 50041 ccccctcagg gctggcgggc tgtcaggctt tgcctggtct ggttttaatt tccatgtgaa 50101 atatttaaga caaataggaa taaggactaa tacaataaag cccttaggcc agacatggtg 50161 gctcacaccc atagtcccag cacgttggga ggcagaggca gaaggatcac gtgaggccgt 50221 gaatccaaga ccatgctggg caacatggta aaaccccatc tctacaaaaa atttaagtta 50281 gctgagtgtg gtggtgcatg cctatagtcc cagctactca ggaggctgag gtgggaggat 50341 ggcttggacc tgtgaggtgc aatcccagca ctttgggagg ccaaggcgga tggatcacga 50401 ggtcaggaga tcaagaccag cctggccaac atggtgaaac tctatctcta ctaaaaatac 50461 aaaaactagc cgggcatggt ggtgggcatc tgtaatctca gctcctcagg aggctgaggc 50521 aagagaatca cttaaacccg ggaggcggag ggtacagtca gccaaggtca caccattgca 50581 ctctagcctg ggcaacagag tgagactccg tctcaaataa ataaaataaa ataaagcctt 50641 gtgcatctac caccaggctt aagaaataca acattccggc cgggcatggt ggttcacgcc 50701 tgtaatcccg gcactttggg aggccaaggc aggcagatca cgaggtcagg agattgagtc 50761 catcctggcg aacacggtga aaccccgtct ctactaaaaa tacaaaaatt agccgggtgt 50821 ggtggtggac acctgtagtc ccagctactc gggaggctga ggcaggagaa tggcatgaac 50881 ccaggagacg gagcttgcag tgagccgaga ttgcgccact gcactccagc ctgggcgaca 50941 gagcgagact ccatctcaaa aaaaaaaaaa aaaaagaaat acaacattcc tagccaggtg 51001 tattggctca cgcctgtaat cccagcactt tgggaggcca aggcaggcag atcacgaggt 51061 caggagattg agaccatctt ggctaacacg gtgaaacccc gtctctacta aaaagtacaa 51121 aaaattagcc gggcatggtg gcgggcacct gcagtcccag ctactcggga ggctgaggca 51181 ggagaatggc gtgaacctgg gagacggagc ttgcagtgag ctgagatcgc gccactgcac 51241 actgcactcc agcctgggtg acagagcgag acttcgtctc aaaaaaaaaa aaaagaaata 51301 caacattcct agccaggtgt attggctcat gcctgtaatt ccaacacttt gggaggctga 51361 ggcgggtgga ttgcctgagc ccaggagttt cagatcagcc tcggcaacat agtgaaactc 51421 tttccctaaa acataaaaag taggctgggc atggtggctc acgcctgtaa tcccagcact 51481 ttgagaggct gacgtgggca aatcacctga ggtcaggagt ttgagactag cctggtcaac 51541 atggtaaaac accgtctcta ttgaaaatgc aaaaattagc caggcatggt ggtgcatgcc 51601 tgtaatccca actactcggg aggctaaggc aggagaatcg cttgaaccca ggaggcagag 51661 gttgcagtga gccaagatct tgccactgca ctccagcctg gacaacagaa tgagacgctg 51721 tctcaaaaga taataataaa taaaaataaa agataaacat tcccagtata atcccagtgg 51781 caccctggcc cttgcctcag ctagcctttg aagtctgcac tgtccagagt gcagtgccag 51841 tcgctgtgat gaggtatatt cggctgggga gaagaccatg atggtagctt ggagctacca 51901 ggatggtggc catggagctc aaaaggagat ggatttggga cacatgcatt ctgggtgaag 51961 agtccgcagg gtttgagggt ggaagggggt tgtcgagagt ggcctgtagt ggtggaagac 52021 tgaaacgggg ccaggtttcg ggagcgacgg tttctagtgt gcagtggctt tgaacatcca 52081 aatgcagatg tctacctgac gtcagatccc gagcctccag agggagagtg ggcagagctg 52141 ggtgcatgag atggtgaacg ctggtgacag gttgagttgt gagggtgggc gagactggct 52201 gggccaaggg catggaaaag gagagggggt cttagacccg gccccagcag ccccacagat 52261 gggagagaag ctgcagagca gatgaggagg accggccaga gagcaggagg agtaggggag 52321 ggaggccagc tcacagacag gagagggagc tctcagcaga gttgcccggt aggccctgcc 52381 cagcctggag cacctgagga ctgaccatgg tagccacaag tccctcctcc acttggcttc 52441 cctccctccc cagccagaaa gggggaaatc tcctcatggg ctgggcagaa agggaggtgc 52501 tggggaagag ccaggagttc tgctgggcgg gggtcaggga agttgcagca gaggtgacat 52561 ggtaacagcc catgtctgat catcatctga ccctcatccc cgcactccag catgggtggg 52621 gctgggggtg ggggtgaggt ggtgtttgct gctgccccat cccattgcct catccttagc 52681 tgtcagcaaa caagactcca ggagttgggg ctgccggctg gtggggggtg tctcccagaa 52741 cctaggggga tgggggctaa tgtgcccctc tcactcacac acaggatttc tctgggatgc 52801 tcggctggcc ccagcaggtg ccccctaaag aagtggcgtg ggagcaggga ggagtgagaa 52861 cagtcagggc tgagttcgag ccccactccc tcacttatcc agtggaggcc actgggtggt 52921 atcgcctgtg acgcctcctg ggccctcctg ggactccagc ctgagactca cacgtgggga 52981 atgagcagcc ttgccttcag cagcaccgcc tggaagcggc ccaaatgccc atccacgaat 53041 ggcacagcac acagacagaa aaagccagga gagagctctg tgaggagcaa tgtaggtctt 53101 caagatgtat ttttaagcaa aagaaggcaa cagtgatgag cggtgagtat agtagctacc 53161 atttgtgtaa aaaaggagag agaagaaatt tatagataca catatttgct tgtctctgct 53221 tgtctgtgga tggagaccca ggaagctggt gacattggct gctccgggga gggaaagcag 53281 gtggctgggg acagggtggg aggaagaact gtcactgtat tcccgtttgt gtctttttaa 53341 attttagaac cgtatgaatg tattacctgt gggaaaatgc acatgaaaag ttttaaaatt 53401 aatgtgtaca catgcatatt ttccagagaa ggcattcata aggttgtgtg gccgggcgcg 53461 gtggctcacg cctgtaatcc cagcactttg ggaggccgag gcgggcggat cacgaggtca 53521 ggagatcgag accatcctgg ctaacatggt gaaaccccat ctctattaaa attcaaaaaa 53581 ttagccaggc gtggtggcgg gcacctgtag tcccagctac tcgggaggct gaggaaggag 53641 aatggtgtga acccggtaga cggagcttgc agtgagctga gattgcacca ctgcactcca 53701 gcctgggtga cagagcaaga ctccatcaaa aaaaaaaaaa aaaaaggact gtcaccccga 53761 aagtctgaga acacttcacc tatgggctgc gtggctttgg gaaaatttct taacctctct 53821 gagtcacatc tgtaaaatgg gaaggaataa cacccagtga acaggcatgc tgccaaggcg 53881 aacagtgtgg gcaggggttt gtaaactgta aagtgcagaa gcctttcttt ctttttttat 53941 tttttgaggc agggtctcgc tgtcacccag gctggagtgc agtggcacta tcacagctca 54001 ctgtagcctt gacttcccag gctcaagcga tcctcccacc ccagcctctc gaatagctgg 54061 gacttgaggt gtgtgctgcc atccctggct tattttttaa ttttttgtag agacagggcc 54121 tccctatgtt acccaggctg atcttaaact cctgggctca agtgatcctc ctgtctcagc 54181 ctctcacagt gttggcatga gccaccgtgc ccggcctgca gcccttcttg aagggcatga 54241 cttactctcc actcttcctc acctcctagt ctcccctcag ccccctccag agctcctctg 54301 tgcccccttc cctctcagca gctctgatgg aggctctgga ctggccctcg ttttcctcct 54361 tgagtcctcc aggcctgagg gtccctctcc aggtcgcccc ttgctctcct gcctctggcc 54421 tctccatgac ccacaccgtg cagtgcccag ctccgccatc ccacctccct tctcttctcc 54481 ccagtcctgt gatctcatcc agtcctgtgg cttaaaatac cacctagcct gcgcctccct 54541 cccaggcagc tacaagccca cttggctcct gctccctgtg tctgctggac agtgaagagg 54601 catcgcgaac ccggcactgc caagcccaag ccatgccccc gtcctcccct ctcctgcctt 54661 ccccatctca gtctgtgact cctcccagga ggtcgctcag gcctgacacc taggcatcat 54721 cctccagccc tccctaagcc caaccgcaat aacaatgggt taataactaa catgcactta 54781 gtgttcactg agggccatgc actgactggt ccacacctgc aaggcaggtg ctgtcatcat 54841 catcccacag tgggcttggg agcttgccca aagcgggtga acctcggggc cccggacacg 54901 tcccctcacc tagtcatcca ccccaaagag ccacatcact attgcagccc acagtctctg 54961 cccaccacct cctcccacag cctccaccca cctcccgcag tccatctcca catcactccc 55021 tgctcagagc tctcgtctcc ccagggcact cgggatgaac gccccgcctc agagaggtcc 55081 tgccctctct gactgcaccc cctccactct ccaaccacgg gaagagacct tccttcccgt 55141 ccctttaacg caccaagttt attcctgtct ctgggccttt gcactggctg ttctttctgg 55201 ttgaaattct cttctccaag actccaccgc ttgtgccttc ctaccattca ggtcttgatt 55261 caaaggctgc ctgttccaag aggccccgag gccctccctc ccatccagct ctcagatctt 55321 cctggcagcc cttagcagtc aaaccagtct catctgcaac atcattacag gcaggaacct 55381 tgcctcccta cttcaccccg actcccagag cctggcatag agtagctgct tcataaatat 55441 ctaacgactg acaaacaaga taacccagcc cgtcaaaatt cccggaaagg gtctaactgg 55501 gtcatgtgac acattgtgac caaggagaag ctatttttga cacggtagtc agggagaact 55561 tcccagcgga ggtggcctct gagcagtgac caggagaccc tctgtgtcac tgtacatgag 55621 cacaatggcc atgagcagag gggacagagg acaggatgtc agagggctca tgcctctgcc 55681 tcaccctcca cggagcatgg gcttctcttc ccaggtgtcc cccaggtgtc aggggctcgg 55741 gggaggatgt cctggcctca cccagggatg ggtggcaggg acatgtgtac aggacctagt 55801 gggagagggc atgggagaca ggcctcagaa ggaccagaga cagcaggagg aacagtcagc 55861 ccctgcccat gactcaatgc tgggccacag ccccaaggga ctgtcactga cagaacgcct 55921 gccatggggg gggccacggg ctgcatgccc ctctttactg ctgaccacct tcccactggc 55981 tcctcaggcc agagaatctc caactccagc cgacccccat ccatccatcc attcattcat 56041 caccctaaat gtcccctgag ctggaacagc atctgccttc ctgccaggtg ccagatgtgt 56101 ctcaacacag cgcagtgcct tgccttggag ggcggtctgt aaataattga gtgaacattt 56161 attgagcacc tactgtatgc tgaagggaag caggtaagtc acaaaccggc tttgctgagc 56221 tccgtgcacg catgtgtgcg tgcatgcgtg tgtgtgcgtg tgtgtgtgtg aaagggcata 56281 aagagaaggt cctaggaaga tgagggcaag caggagggca caaggacaga gcagggctgg 56341 gcctgggctg agcggaaagc agggaatttg gcccggcctg ggatctaggc cctggggaga 56401 ggtggtggct ctggaggagc atgggtgcct ggacctgggt ggtccactga gcaagggaca 56461 ggcggctact tctgagggca ggacagcagc atcctggaag ttgtggcacc tatgtcttta 56521 ggaaccaaac atccagggaa atcggaacct gagcatcaga acctctatca gggcgcctcc 56581 tggatttcta cacaggacag tagctttcaa gttttttgac caaaatccgt agtaaaagca 56641 aacatcctgt ttgttatttt aacttacttt atatttattt tttagagaca gagtcttgct 56701 ccaatgccca ggctggagtg cagtgatgcg atcacagctc aattgcaacc tcctgggttc 56761 ttgccatcct cccgcctcgg cctcccaacg tgctgggatt acaggcgtga gccactgcac 56821 ccggctatat atttaaagct gaagtaagag cgtcacaaaa taatatttac cctcactaca 56881 tgcatagtat tttgaaattt tctttttaat ttcattttcg gaaaacaagt agtgcaactc 56941 actaagtcaa tttcccaacc cactaacggg gcaggacctg tgtgcaattt taaagcactg 57001 gcctgagtgg gggttaggag cggtgaaatg ggatccggcg tgtgtcttga agctgtgacg 57061 ggcggtgctt ggactccggg ctttgtgaca cacaaccatg tggggaattg ttggatgtta 57121 tgggggaata ctgggaaaag tctgttagac cccccagctc acaggaccgc cactctgcag 57181 cccaaataac agggactctt tggggtctgg ttccaatttt ttccccgggt cttttaatag 57241 agccccctcc tgccctgata acagatgcca ttcagcagtt gcttacacat ctcccaggac 57301 agggcgctca cccacttctc aggcagccca cttgactgtg aggagtaagc tctgccgggc 57361 tgctgatgta gctgctaagt ggggccccgc ccgtggctgg ctggccattc tcctctgtcc 57421 tctagcctgt tgtctcaggg agccgggaca gctctggcaa gtggtcctct ctctgcccca 57481 cccagttctc tctctctgct gcccagccac tgccgggtgt ctgttggggg cagccttgga 57541 gcccagccag agagaggccc tttccaaggt caggtgggct caggatctga agcctcatgg 57601 cggggctggc actcagccag ccttgcccac caaagcagtt ctgggggcct gggtgaggga 57661 gggccggggg aggctcagct gtccagagaa gtggtacccc aaggcagtgg ctggtgatgg 57721 gggtctggaa atcaggccca gcagtgccct tgagtggttg ggtcacaaag tagttcatat 57781 caacgagcta tgatacattc attcactcac aggctcacag gatcttccca gtcaccctca 57841 gaaacagatg caaagggcca ggcacggtgg ctcataccta taatcccacc actttgggag 57901 gctgaggtgg atggatcaca tgagggtcag gagttcgaga ccagcctggc caacatagtg 57961 aaaccccatc tctactaaaa atacaaaaat taaccgagca tggtggtgcg cacctgtagt 58021 cctagctact tgggaggctg aggccagaga attgcttgaa ctgggaggcg gaggttgcag 58081 taagctgaga tcgcgccact gctctccagc ttgggcaaca gagcgagact ccatctcaaa 58141 aaaaaaaaaa aaagaaacag aggcaaacgc ccacccttac ggggcagggg gtaaaggctc 58201 agagaggtga aaaacctgcc ccagattccg cagctgctaa ggggtggagc tggacatgaa 58261 cccagatctt ctgtttccca gtcacgggcc tcgcctggga caactgaggc tggaggagcc 58321 ttggggtagg ggagggggag gtgggccact cctcctgtct tgctgagagc aggctgggtt 58381 tctctgcccc agagggcagg gcgaaggaaa gagggagtca gggatcaatt agcagaacct 58441 tccagacacg agtgggcttc cccggagatg gggagctccc cgtgaggaac agtgaggagg 58501 gacttgaggc ccatttctca ggataactgc agaagggtat ccatgggcac gaggtttgaa 58561 ggatgcccaa agcccctctg cctgctccat ttcaccccca gccagggcct ctccagcctg 58621 tgcccaccac ctcccctgta tggccccaag gagcccctaa ttgcggaccc catcccattt 58681 gtctcccagg ctgagtggta ccttcacact cacagggcag agggtcctgg gcttctcagg 58741 cctatgagga gaggggaggc tcctggggga gtctagggtg ggggctggga gaggagctgc 58801 agggggttgg gggaagcttc actgaggtgt cccaggggcc aactctgcca ccaacttgaa 58861 gatgggcaca cccctcccct aggcctgaat ctccccatca gtaaaatggg gtcagctccc 58921 cacagcacat gcaccctctc ctgtggccca gctaaagaac cctcccgtcc gggtgcagtg 58981 gctcacccct gtaatcccag cactttggga ggctgaggca ggcggatcac aaggtcagga 59041 gttcaagacc agcctgacca acatggtgaa acttcgtctc tactaaaaat acaaaaatta 59101 gccagacgtg gtggcacacg cctgtagtcc cagctactcg ggaggctgag gcggaagaat 59161 cacttgaatc tgggaggggg aggttgcagt gagccgagat tgcaccactg cactccagcc 59221 tgagagacac agtgagactc tcaaaaaaaa aaaaaaagaa aagaaaagaa aagaaaaaag 59281 tagccgggca tggtggcgca tgcctgtagt cccagctact tggaaggctg aggcaggaga 59341 atcgcttgaa cccgggaggc agaggttgca ctgagctgag attgtgccat tgcattccag 59401 cctgggtgac aagagcgaaa ctccgtctta aaaaaaaaaa aaaaaaaaag gaaccctccc 59461 atccctgcag cccgggtcgc tggtgtgtga gcttctccca ggacaggcaa aacaggaaaa 59521 ggctgagcac gccccaccat ccccatgggt ggcccccgat acacagaggg gcaaagagag 59581 gcccccagat ggacagtggc ctagtcaagg tcatgcggag ccagtgaggg gctgagctgg 59641 gattcgatcc caggcccctg acgcaggcca cccttcctac tgctgtgacc agggagtgct 59701 gcccttcccc tcgctccgat ctggggattt tctggggaga aggagcttaa agagacccag 59761 accctgctcg cctcccacag accccacagc tctggggagt gtcagtgaga gaggaggggc 59821 cggcagccct gggagccatc aggacacaga ggagtcagga gcagctccgg gttgagaggc 59881 tctgccacca ttgctgagtg accctgggca aaccctggtg gtctccggac tcggtttctc 59941 catctcccag gagaggaaat ggagagaagg ccaatgcagc cgcagacagg agctgctggg 60001 ccaggcttgg tgggggcttc tcctccatcc cagtcccagg gagctcctcc gcccaagtgc 60061 acctcccatg ccccagctcc cctctgccat gctgggggcc ccccaggcct agaccaaagc 60121 tcccagctga caggtcacca ttacaacgct gacttctccc tccacagctc ctgctggggt 60181 cgggaacacc aaggaccccc tttcctggta gcccctaagg acctcttgac ctgccaatgt 60241 ctcatctggg gcacctcacc ctcccctgcc tgccaaggcc agacgtccca gctgttggca 60301 ccagggctgg tcagggactg gccagcagcc acagggagga gtggctgagc tgttgcatgg 60361 aaggaaaggg gaggcctagg ctggcagaca tgtacccacc tcccttcagg tggccaatga 60421 gggtcctgga ggggctggca ctgggtgcac agtaggacaa agcatctgtg gctgtttcca 60481 cccaggagcc tggaaccaat ccttgcttct ctctgagcct cagtattctc atctgagaag 60541 tgggggtgat ggttctccca gggtggtgga ggattaagtg aaatcatgaa tatgaagccc 60601 agcgctgagc ctggcactca gcttccctcc tgcagcaata ccctgagaac actgctctca 60661 ccccagccac caagcaacgg caccagccat ctgagcagcc ctcacgcatg cgcatgcgtg 60721 tgtgcctgtg tctgtgcctg agcatgcacg cgcaggctga aagggggctg cgagcaggtg 60781 gggcaacggg tgcagaggga ggggcagggg ccacacgcca gcactcctcc ccaccggttt 60841 cccaccctgt gtgtcccaac agagaacagt ggtcttagaa tcaggcaaaa cccatccatt 60901 caaatccagc ctgtttccag ccagccactt ccccttgctg agcctcagtt tccccttctg 60961 taaatgagaa tcatctatgt ccaaagtctt ccaggttcac gacagggacc ccgtgttggg 61021 ccacggggcc aggattggtc aagacctttg ggagtggggg gccaagacca gagcaagcca 61081 gagcgtgagc tcacccctgc tccccatgct gcaccccacc caccaccccg tgagcgcccg 61141 tgtgcgcgcc tgcaggagcg cgcgtgtgtg actcaaggtc tattcatagc cgagctccac 61201 cagctgcagc gtcacaagca gaggggcctc agcccatatt attatttcaa ttaccatctg 61261 aattaaaagt ttgagcagcg atcagcccac gtagcgaggg gaagagggaa caggagaaaa 61321 ataaagatga aaaggagaag aagggctgtg actggcgatc ccccacaccc caaggagggc 61381 tgagtccgca gaggaggctt aaaggttatg ccaaccccta ggattcccgc ctgccccttg 61441 ccaggaccca cagtcctccc atcgttggga ggagcgcttt tcgacagcag cccaaccaca 61501 gctactcggt ggaaatagat ctttcagccc aggttcacat ggatgggtgg acatcactcg 61561 gagagggtga gggactgacc taaggtcaca cagagagcca cccaaggaag ttgtctccaa 61621 gaccctctcc tgccagccag gtcccactga cctcaccagg ctctgtagca catgtgccat 61681 ttcccttctg ctcccctctc cctgcgccga taactttcct cctgcagccc tagcttggag 61741 ccagccctct ctcagccctc tgcattcccg acctcacacc tcatgccacc accgtcagcc 61801 caaaggcccc tcctcactca ctggcacagg ggacagagcc ctgggccagg agacaggcag 61861 cctggcccta atctaactct ctgggcacct tcgggcaggt cattccttgt ctctgggcct 61921 cagtcttccc atctatgaaa tgagttctcc cctaagatgc atggtggtat ctgtcttcag 61981 aatggagccc ccagaggagc cagaggatcc ctgagggggt gtcacgctgt gggggcgtct 62041 agaccatctc ctgccaagcc ccagccccag ggaggaacgt tttccactct ttccccaggg 62101 aagccccaga gacagccccc gagaatcaga ctagtctgaa ctcacaccca ggctcctctg 62161 ctgtcttact gtgtgaccct ggacaaatca catagattct ctgagccatg gcccttccat 62221 ctataacacg gggacatcaa atcaacattg ccatggagaa ttaaatggga atttgcaaaa 62281 gctcctagcc cagtgcctgg cgagcaaatg tgtttcctcg ctgggaagcg gggaagcgat 62341 gggcactgct tcccagctcc tctttgggaa acaccaaaga ctcctaggat tttggaagtg 62401 gggatgttat tcggcctcac ccgtaaggcg agagtcacct ggcaggtgct catctggctc 62461 tgcttgaata cctctaggat tgaggagccc actatctcac aaaacaacca cagctgctcc 62521 ctatcttgct ccacctccag ctctgggaat tttctagaat ggagcataag cttccagtta 62581 accccaaatc caacctgtcc ttgcctggga ctgcctgaaa ctggctttaa aatgagactg 62641 aaaggagttc ttcaaagaga ccaatagagg gaggggatta gtgcccaagg acagtggggt 62701 gaggggatct ggagagaagg gagggagaaa ggaggtggcc gaattcagag ctgggccctc 62761 aaagctcctg ggtcctgtct gggcttgcca cagtgttaca cagggacatc tgtctgtctg 62821 cttccagata gagctgggag ccagaagccc tggattcagg ccctggccct gcttcctcca 62881 cccccaaacc atcactgtga ccctcatggg tccacttccc cctcaggcct cagtttccac 62941 ttttgtaaaa caatggccct ggctctgctt cctccgccca caaaccatca ctgtgaccct 63001 gatgggtcta cttccccctc aggcctcagt ttccactttt gtaaaacgaa ggcatcagac 63061 cagtgattta agctccaaac cttctgtctg aatccaggca ccctcccact ctaccctgtc 63121 caacccctgc ccagcatgct ggatccacgg gcccctgagt gtctatggtt tcagagtcag 63181 aggccttatc tctactcctg tctcctgctt tcatagctgt tgcctttgag gattctgtca 63241 tccccatttt acaggtgggg agatggaggc atgaggcatt gaagccaatt gctcaaggtc 63301 acagagggtg atttgaaatc cagggctccc tcctctgggc ccaggagagc cgcagaactt 63361 aagggcacct tctttcagga gaatcgccaa gatcttaagt ggccagccgt tctctataca 63421 gatgtggact ccacatgagg caaggggctg cccaaggtca cagagcaagg cactgacaga 63481 ccagggtcct gaaccaagcc tccctgcccc caggctccac ccagacaggg ggctggcaac 63541 atgacctgac tttctgccct ggtcctggga tctccccaag actggagccc acgtcctgcc 63601 tctagccgct gcccccagaa gcccccgagc atctagggct gagatgggaa gagtgagcag 63661 gaagacagga aagaaagttg agggctggcc acttgccccc ggccctctgg ccctccccta 63721 ctgctctcct cgtgcagggg gagcggcggc ggtggtgatg taatgcactc ggctttcgtc 63781 ccggtgccaa tctcagaaga gagacgacag attgcgtgag ccaaatttgg catcctaggg 63841 ggagaggcat gaaaagtgga gatgtggagg agcagagagg agcctggggt ggtgcaggag 63901 gagccccctg ccctacagcc ccgaggtgcc tcccctccct cccctcaggg cccccatctc 63961 ttccccagct agcctggccc tggggcctcc cctcctcact ctgaggggcc agcctggctt 64021 ctccccggac cccttgagcc cggcactgcc aggcctaccc gccaccccca ccccacactg 64081 ctccagctct gtttcccacc gccgctagtt tcctggttac catgttgccg gttgtcatgt 64141 cgcttacatg tcctcgtgac tcagtggtcc ccgcagaggg gccccctcct cctcacaccc 64201 cccttagcac tcgcccccgc ctcccccaga tccacattaa gaaccagccc gccctgtaag 64261 cctacactgt tcctcctccc cttccttctt atcctaagct aagacatctt ctcacagcat 64321 ctcccagagc tacttcattc tctctttcgc aaatgtctac gaacctacta tgctccaggc 64381 tggcgcacgc ggctttcaca gacctgacac agtctggaaa ctgctcaaca ccctgatgaa 64441 gtcagtatca tcatacctat tctccagttg aagaaactga ggttcagaga ggtcaagcca 64501 cccacccaag gtcacccagc agaaactcaa ccagctctgc ctggtcacag ccactccaga 64561 ggattttctg agagaagagc tttcttttca aggataaccc aaagggccat gaggaattgt 64621 accccagatt ccccctctaa tcctggacac tcctccttcg tcagctattt ccgccatatt 64681 cccactgccc cacgcccacc tccacccact agctgttcat ctctcttaag gtaaagtgag 64741 tctttactcc catctcccca gtgctttccc cagcctgtga acagcccact gctcattgag 64801 cccagggccc gccttggagg tcactggggc agggagtgac gtccccatgc cacagatgtg 64861 gaagatgacg gttgggagga gagtgaccag cccagggggc tgctctccca gccacctgag 64921 ctaatttctg ggaggctgaa tggaggacat atgagaccca ctgcccagtg tggaacaccc 64981 cagaaagagt ccctcctccc agtgagaatg gagtggaatg ttctaggctc agcccttcca 65041 actccagttg ggatatgggt cagttcagtg gccaacagtc tctgttccca ccatgctgca 65101 cccatctctt accccagctg ccctggtcca ccttttgctc ccctcagcac aggccaagcc 65161 agggccacag tgaggctctt gggcccctaa agtcctgcta actgccccca aaaggcctga 65221 agccatggag gtcatgggac tggatgtggg ggagaacaca cacacacacg gctccaaccc 65281 ttctctgggc ctccctccct gagttcccca cttcccccaa acctgctgct gaacattcta 65341 ctgctcttga ggccctgctc tgtcagctgg ccaggctgcc ctgcctctag cccgccgctc 65401 caccccaccc agcttgccac ctcccaactc cctgcatcct gtccatcctg aagcagggac 65461 actgggggca gaaatgtggg tagggggcag ggaggctgac actggtgggg ggacatttgg 65521 caggttgcag gggaagttct ccctgcggga ggctcattga gacctctctt gccttggggt 65581 gctccctaca cacggaggaa aaagcagcct ctgcctccct ccccagcctg tgtctatcac 65641 tgtgaacgtg cagtgcctcc tctaccaatc cctgcccctg cgccctgggg tcgtgtggcg 65701 ccaagtaggt gctcaaaaca tgagaagtga gatttcgttc ttcagatcac agggccattt 65761 gcaacgagga tgcagctggg gaggcggaaa ggcggcatcc cccaccttgc cacaagtcac 65821 ctgccttgaa gctccccggc atgacaactg gggtgggtgg aggcagggaa aggtcatggc 65881 tggggcagga ggtcaagacc ccttccaccc acctctctcc ccctttccag gccacaaccc 65941 agaccgcgac gctcccagac ggacagacgg acagagcccg cccgtggagc ccgcccgctc 66001 ccggggagtc acgggaccgc gcgcgcggcg ccgggcagcc gggacccggc tccggatcag 66061 gtgcggagcg ggcggggccg gagcggcggg cccgggacac acagacacgc acgcggcggg 66121 acaaactgac ggacagcgcc cggggtgggc gagcgcggct taccgctggg cggagggtcc 66181 gacgagagtc cgggaggcgg cgagcgggcg ctacatccca aggcgcgatc cggcgggcgg 66241 cgtcccgggc gcggaaaggg gagtcccggc gccccctgcc cgcgaactcg gccggccggc 66301 ccgcgctccc ctccccggct gccgctcggt ctgcgcgtgg gtcttggggt ctccgcgcgt 66361 cccggccgtc ccgcgccgct ccccgcgggc cgcccaactt tcagcgctgc cccgcgccgc 66421 gccctcggcg ccgcctcggc gtcagcatcg tcgggcccgg ccgggctcac agcgcggggg 66481 tccgggggcc cggggcgccc cgccgcccgc gccccccacc cccgctgcag gcgccctccc 66541 cgccgccgtc ccccgctgcg cccagctccc tcgctgtcac ccgctcctgc tcgcgcgctc 66601 tgcccccggc cctgcgctcc actcccggcc ccgctgcgct cggccccggc cccgcgaccc 66661 ctcggcctcc cggcctgggg tcgccctccc cgcccccgcc ggccgggctc tgtcgccacc 66721 tccccgaccg ctctcgcgtc tctctcctgc cgccggggtc tccgccgcgc tcgcgtcccg 66781 gtccctctcc cggcctccct gcctcctgcg gccccgctcg ccctctgccc gcggggtctc 66841 cgtctccgtc gcggtcgccc gccggcctcc ccccgcctcc ctctgcccgc gctgcctctc 66901 tcccctgcgc gtctctgtcc gtctccctcg cggcccctgt ccttctctgt ctctccccct 66961 cccactctcc gtgcggcccc ctctgtctcg gaccccggcc cccgccttgg ccgcgtcccc 67021 ggctcgcccg ggagcccgag ctcccgccgc ccgcggcgcg ccgagaactc ccaactcgcg 67081 gcacgccgcc ccgctttgcg agactttcgc ccctcctcac acgtgacccg cgccccactg 67141 cggaggccgc gcgcccccca gcccgggagc cccggctgcc cgcgggaccc cgccctgccc 67201 gcccctcggc ctaacccggc ctcaagcccc ctcccgcggg gcccgcggcg cggagcaggg 67261 gctggggcgg ggggctgctg gccccgctcc tgccctggat cctgcccggg ttttcctcct 67321 cgccccagga gcgctcctcc cagaacccgt tcctggatcc gctcccgggt tttgtgcccc 67381 ccccccaaaa gggggcaggt ggagagggtt cctccgagcc gggaaggggg gtgggtggga 67441 gctggtccct ctggggtccc acggagtccg cccctctcct tagcggtcct cgtactctcc 67501 caaagtaccg ccggtcaagg acccgaagca cctcactctg ggagtctgag ggggccctat 67561 tggagggacc ctcgcgatca ctcagtcacc actgccccat tttacagatg gcgagactgg 67621 gtaccagaga gggtcagaaa ctcggccaag gtcacccaga gatgacaaag ccaggccaaa 67681 ccctagtctc ttattctctg tcgctctcca ttaccccaag ctgtcaggcc tgggccccac 67741 actctggggt gtcagggaac ccccagcacc cacagcacct gtctagccag ctacatcctg 67801 accatcctgc cacgtcactc tagccctccg aggagacagg agggaaagtg ctcctccttc 67861 cagggtggta ggacgagaga acacctgcag ttagacagcc agcaaaactt ccctgggagt 67921 ctggccatca ccgactagcc ccattacaga gagggagaga ccgaggcacc ccaaagccag 67981 gtggagaact gatgaaatct gaaatcccct ctggctctcc tgagcctcga ggatgtcagc 68041 ataagaacag caacacacaa accaccacaa gtgtttccct tcttcgtggg ctctccctac 68101 tcaaaaggat tcagtcccaa ctcctcagtg tggtatagtc aggccccacc ttatccggca 68161 tcccacatcc cataaatcca caggtacacc ctgattccac agattccaga acacgccaca 68221 ggcctccaca cctctctgcc tgcacgcact gttcctcctc actaagatat ccttccacct 68281 tttttgccca gagaattcct gctgttcccc aagcccagcc tcagtgccac ttccttgaga 68341 aggcctcctg atgaccccag cagaagtcat ttatcaccca ctcattcatt caaccagcac 68401 taggcaccca ccacatacca gcccctgggc ggggcacaag tatggttctt gcttagtggg 68461 atgaggagtc aggctaacct aggctcaaaa gcacctagga gctacttgac cttgaacaag 68521 tttttattta atctgtctgt aaattgagga ccctaacgtc ctccattccc aaggggtaga 68581 ttaagtgagg tggatgcaca agaaatacac agatcatggt aggcgctcaa taaatgtggc 68641 catcctttct cacggctctc tccgactgtg cagacctcca gaacgcacgt cctgttccac 68701 gcctctccct ggttaccctg ctgtgtcctc acaggcagac agccaggact gatgttccat 68761 cacatgaatg ctcagtgcgt ggatgcaggg gagactcaag cagcctgagt gccctgtggg 68821 caagagatgt gtgcctggtc tttttcacct tttctgtgga tggaaagtgg gaggatgagg 68881 gtgagacaaa tgacctgcct ggccacgggc tgacctggta gccatcctgg gagtgtgggg 68941 tgcagggaga ggaggaaggg gcaggaagga ggggaaaatg cagacagaga tgtctgtaga 69001 tccaaagcat ctactgtcac agaatcagtc tcctgatgcc aagtccctgg gcacaaactt 69061 gaagatggga tgggggctct ggggcaagga gggttagctt gccgggcctg ggccgcaccc 69121 tggggctgct gttcatcctg caaggagact ggactgggag gaggtaggga gaggcaagca 69181 gaggagctcc aggcagggag aggttcggag gccccaggaa agagcccttt ctctcctctc 69241 cagccctgtg tgaccctggg cagctcaact ctcctctctg ggcctttgtt tcctcatatg 69301 taaaatgaga actataatac ccactttgag cttcggggtg tgtgtgtgtg tgtgtgtgtg 69361 tgtgtgtgtg tgtgtgtgtg tgtttgttgt tgttgctgtt tttagatagg atctggctct 69421 gttgcccagg ctggagtgca gtggcgcaat ctcagctcac tgcaacctcc gcctcctggg 69481 cttaagccat cctcctacct cagcctcccg agtagctggg accacaggca tgtgccacca 69541 tgccaggcta atttttgtat tttttgtaga gatggggttt tgtcatgttg tccaggctag 69601 tctcaaactc ttgagttcaa gcgatccacc ttccttggcc tcccaaagtg ctaggattac 69661 aggtgtgaac caccgcacgt ggtcgagcag ctgttttgag gataatcaaa gactaaagga 69721 agacttcaca ttgttcacct tccgtgtgag tagcctctag ccctgtgtga ggttgggggc 69781 agagatggaa atgggcatgc agaggcctgg gggcagtgtg agggagttga gaaaagtctt 69841 gggtgtcaga acctgtgtga ccctgggcaa gatgcgtgtc ctctctggac ctcagaggcc 69901 tcgtctgtag tgggacagtg tggactaggt ctcccaaggg taagtgggct ttgtcatgat 69961 ccaccacaga tgcgcccccc caccagacgg aatgaggaca acctgcttcc cctgcactcg 70021 tacccctagc tcagcactat ccagaccttc gcagggattg tctcctttga ttcttgcaag 70081 aattctccag gcacctgtta ccaatgggga aactgaggca cgggcagtaa tgccatgagt 70141 ggcagggccg gtgtggccag gaacaatggt ctggaggaag gaggttcgct ctgaagtttg 70201 gaggagaatg cttgtaaaac ctctgtccac catacacttt ggtaaaacgt cctggccatc 70261 tccattctcc attctctcct ttgatcctca caagatctcc gtggggtacg ggctatgcct 70321 gtctttacct cacaggtgag gactctgaac tgacacacag agaggggaag tggcttgtct 70381 ggggtcaggc agtaagcaac tgtctgggcc aggatttgaa ccaggctcaa cttccacttt 70441 ctggggaaag gtcattccag gctagcagag cactcactaa ccgatttact gtggggtcct 70501 gctcacacct cctgggtcct acaggggccc tgagagcctg tgtagagggc tgagaggagt 70561 ggaagggagg gcaacttggg ccaggcaggt ggacagctgg ggaggagacg gaaaaggcca 70621 cttcaaaatg ctagagtcac agaggccaga ggtttttgtc ctagagagtg tcttaaagtt 70681 cagccagggc cgggcgcggt ggctcacgcc tgtaatccca gcactttgtg aggccgaggc 70741 gggcagatca cttgaggtca ggtgtttgag gccagcctgg ctaacatggt gaaacctcga 70801 ctctactaaa aatacaaaaa ttagccaggc actgtggtgc gcatctgtaa tccaagctac 70861 tcgggaggcg aaggcaggag aatcacttga acccgggagg cagaggttgc agtgagctga 70921 gatcgcgcca ctgcactcca gcctgggcta cagagcaaga ctccatctca aaaaaaaaaa 70981 aaaaaaaaaa aaaaaaggtc cagccagccc attctgcaga tgaggcaaac tgaggtagcc 71041 cagatactta gattagaaat cagatcagga cacagctggc ctgcctcgga gctctaaagt 71101 cccaggagct tccagggggt gaggtccaga ggtgattaaa gggacactgg ggggccagag 71161 gcccagaacc catcccagca ggctgtgccc tggggacaga ggactgtagg ggtctctcct 71221 gccctccacc tccaccctcg cccacccccc agcactcagt agtcaccagg gtggaatatt 71281 ttagcaccta tagaaaaacc gccacagacg gccgtggcag ctgttgctaa ggagacgctc 71341 ccaaaatagc ccccctccgc caacctttcc cgctgctgct cctgctgtgg ggaggggagg 71401 agaccaggca gggagggggt ggcacatgag tgttcgccct aggctcaccc acccaccctg 71461 gcccctggtg cccaccctgg cccaagacct gggcagccac aggtctgtgg aagggcagga 71521 gctgtcccat gcatgcctgt gatctcagtc acagtcctca gctagggaag aaggtcactc 71581 aggtgatagg agagggggga gcagaacccc cgcacccaag accccccagc acactcccac 71641 acatacacaa ttacaggtta tcacagttct ccacatccac tgtacacacc acaaatacag 71701 tcccacgcat gcccaccgag gggtacacac acatcagcaa agctgcatca gatgcacggg 71761 taaacgtctg ccacaactac acaacatcca cctgcacaca cacaccatgg agccgcaaag 71821 acatgctttt atatccacat attcacacac ccatcccaaa tacacacgta caccaatata 71881 tgcatgccag gcacacggac acacacacac accagtgtac atataccctc acagcacatg 71941 gacacacata ttcacagata cacacaactg tccaaacaca tgcccagcct gtggcatcta 72001 aagggcaatc agttgtgtta cccagtagac tctgtctggg gtgctgtgtc ttagggtttc 72061 cagaaatcct tacttagtgg ggaacttcta caaccacagg gcagagggta aggaggaggt 72121 ggcaggcagg tggatgaagg gttgtgtctc tctgaacccc tttaagtctg gggactagat 72181 gatgactggt gctcaggaga gctgccctgg acccacttct cgaaggggaa agtggagtag 72241 actctcaccc gcagacttga aacttctggg cacagagatt tctgagcccc aggggctggg 72301 gcgataaaga ggacaagatg gcaattgggt ggcaactgtg atggtctgtg gcatgcgcaa 72361 ggcacgttcc agttggtgag ccgctttcac actccattat ttcacacgct tggtaacagc 72421 tactctgaga tactggtgga ggtgtcccca tgtctgagtc tcagagagga gagagaactc 72481 cacgaccaca cagtaagtta ctagcagagc tggacccata acccaagggg gtgccagcac 72541 ccagccccag tgcctggcac ttggctgcac tgcacgaact tggaaggagg ccaccgggtg 72601 cagggcctag ctctgcccct gatgagcccc gagttactgg gggggcagcc aggccctcca 72661 tcttcccacc tggaaagctg agaatgaatc cccaggggtg ttttcaagca gcggcttgtg 72721 gccaccgtca gggctgggga gtggtgtttg ccggaagtcc tcggaatgcc agcgcaagga 72781 cgaagggtgg cccagcctct cccctcctcg tttggccagc cgcggtctct ctccagcctc 72841 gctctgtgac tcctgaatcc gggctccggg tcttcgttcg cgcctcgtca ctctggagac 72901 gctgcgagcg gcttgcggtt gccgtaggaa cctcgccagg gcctccgcga ggcgcacgac 72961 tccgcgcgct gagagccgcg ttttggtcac cgagagacac ctgtcgccag cggcgagggg 73021 tgggggcgtc gcggcccggg ttattatgct tttcttttta attgaggtgt aacttacaaa 73081 tagtacagcg aacatcttaa aagtacagct ctgcggggcg cggtggcgca cgcctgtaat 73141 ccagcacttt gggaggccga ggcgggaggc tcacttgagc ccaggagttc gagaccagcc 73201 tgggcaacat agcgagaccc tgtcttttag aaaaaaataa cagtgtaatg gcgtgcacct 73261 gtagtcccag ctactcggga ggccgagccg ggagaatggc ttgagcccag gagttcgagg 73321 ctgcagtgag ctatcattcc tgtcactgca ctcctcctcc agcctgagca acaaggcgag 73381 accctgtctc ttaaaaagaa aaaaaaaggt acagctcgat taatttttca gacatactca 73441 taaaactcgt gttaccacca ccatgatcaa gacacagaag acttccaggc cctagaagtc 73501 tcacatcttc aggaataaac cagtgaggga gaggtcacca tctgaaaccc tggagttgtc 73561 tgtttttgcc cctctctgct ctggcagacc ctctttgggc ctcagttgcc tgtgaaggga 73621 agcaaggagc tgaggtgacg tgcattgcgt ggtttatatc ttgcaaatcg ctttagatat 73681 ttagctaact ttaacccgga taactattgt atgaggtcgg tacttggttt atacccaggt 73741 taatgatgtg aacactataa cttggagaga ttagctagtc taaattcaga tagctcttag 73801 gtggtggagt ttgagggcct aactcatctg tctaacctta atgctcatct tttccttctt 73861 cccttccttg gcctctacag gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 73921 ggaggccgag gcaggtggat cacctgatgt aaggacttaa gagacccgcc tgaccaacat 73981 ggtgaaaccc tgtctctact aaaaatccaa aaattagccg ggcgtggtgg caggcgcctg 74041 taatcccagc tactccggag gctgaggcag gagaatcgct tgaacccggg aggtggaggt 74101 tgcagtaaac tgagatcgca ccactgcact ccagcctggg tgagagagtg agactccgtc 74161 tgaaaataat aataataata aataataatg aatggtacaa gtaaaaggca ctgaggggcg 74221 ggggggcagg ccacgtctta tctcctcgcc tttgtgcaat tagtaccttc tactggggac 74281 tctcttctct tccccaaggt tttgtctccg aggcaagctc ctacacaccc tccaaagccc 74341 actagagcat catttgccct aaagccttct ttaagagcag tcctcccagg gactccctcc 74401 aacagacagt gattgccatt tttcctctgt gcggtccagt ccgtgcacag gcacccacct 74461 cccacagcac cagaaaggcc cctggaggct gcacctaagt tcaaccctca aggcgcctgg 74521 tgcccagctg gtcttcagta aatgtgtttg taactaaaac gggaagagag gttttgaacc 74581 ttggtgttta taaccttggt gtgtcagtaa aatgcggccc gctgcccctg cctcctgtca 74641 ctcaaaatat attgagctgg ggaaagagct tcccgaacac ctggggtgga accgcaggga 74701 gaggaaggct gggcccaccc cgcactcggt taggaaatgg actcgtccgc gcctccgggc 74761 gggcgggacc tcaggggctg cctgggaaga ggtcattcct tccacggggg agcgtccact 74821 cgcctaggcc agaggaagac ttggcggcct agaggaagct ctagggagcc ccggaagccc 74881 gggacatagc tttccttcca ggcccctgcc cggtgagaga cagccaatta agtggagtga 74941 ttttcacttt acacccctat ttccttccac acagaattgg aggcagtttg caggatattc 75001 caaacagtgg tgaaatataa gaaagggcta ggaccagaaa aagagagcga gcgagagaga 75061 ttatatatac ctatttttaa aaaataattg agcaagtccc ccattctctc ttccataaat 75121 ctgactttga acgtagaatt tgctttgtta agtgagaaga aaacagtcct tgccctttga 75181 gctaaaggaa atttccctct ttgctttccc ctgcagcagg cacatttttg agtgttggga 75241 aattgaggct cagagacgtg agggacttgc tggaatccac ccagcagcag ggacagatcg 75301 tgaatgacca tgcaggtggc cagagtgctg gcaggggcta gagccaaatg ttttccgtga 75361 gtaaagggtt tccccttttc tccatgggga gctttgaaag tctccacctg ggagcctagt 75421 gggcacaaaa gaggtggggc taattactgt ggacagagcc ttaggacttt ggggagagga 75481 gagccatgtg actgaccttg gggagcaaga gacaaatttg agccccgatt tcctcatctg 75541 taaaataagg acatagtatc catttcatag ggacatcagg aaaactggat gcaaagagca 75601 atggctaagc acctgcaata cttcatttct ttttgagaca gggtctcact gtcacccacg 75661 cccagtttca agtagagtag cgcaatctca gttcactgca accttgacct cccaggctca 75721 ggtgatcctc ccaactcagc ctctcaagaa gctgtaacta taggcacgtg ctaccacgcc 75781 tggctaattt gtgtgtgtgt tttttttttt tttttttttg agacggagtc tcactctgtc 75841 acccaggctg gagtgcagtg gcctggtgtt ggctcactgc aacctctgcc tcctgggttc 75901 aagcgattct cctgcctcag cctcccaagt agcttggatt acaggcgctc gccactaatc 75961 cggctaatct ttgtattttt agtagagacg gggtttcacc atattggtca ggctggtctc 76021 gaactcctga cctcaggtga tccgcccacc tcagcctccc aaagtgctga gattacaggc 76081 gtgagccact gtgcctggcc ataatttttg tctttttttt tttttttttt ttgtagcaat 76141 agagtttcac caggtcgccc aggtttgtct caaactcctg ggctcaagag atttgcccac 76201 ctggcctccc aaagtgctga ggttacaggc atgagccact gcgcctggcc tctttttagt 76261 tttttctttt tgagacaagg tctcactctg tcacttagtt tggaatacag tgatgtaatc 76321 atagctcact gccgcgttca actcctgggc tcaagcgatc ctcctgcttc agcgtctcaa 76381 gtagctggag ccacaggttt gtcaccatgc ccggctaatc ttttttttat tttctgtttt 76441 cttaattttt tgtagagaca agtcttgctg tgttgccttg actggtcttg tactcctggc 76501 ctcaagtgat ccccctgcct ctgcctccca aagtgctggg attacaggtg tgaaccaccg 76561 tgactggcct acatctgtct ttatttttcc catttatcag atgagaaagc tgaggcccag 76621 agaggggacg tggcaggctc aaggccacag tggcaggcct gggacaaggt ctagcccctg 76681 gagccattca gcttgggttc cttcccctcc agcaccacgg tccctcctca gactctccct 76741 ggccttggcc agtttccctg tggtcttgag gaaataaagg gaaggggagg atccagcctc 76801 attccagccc cacgtccccc ccaacacaca cagtgggggc tgcttcccag gaggacagat 76861 gttttcacca aatggcactt gcacgtgctt ttctctggct tggggaggag gtgggcgctg 76921 agaaggaaat gccagctggt tgggcagaga caaccgcagc cacacctgag ggatttaaat 76981 agcctccgcc ttctctgggc accggctgag gaggaggcat ctgggcaggg ttggagggga 77041 gcggggaaca gaagggccat tcggaacctg ccgcagctca gggatgagcg cctggctcct 77101 cctcccagag ggacctggtc taggcacctc acctccctgg acctcgaggt tcatcccctg 77161 caaagcgagg ctctgaaggc cgctgaacca cgtgcctgtt gatggaaaga acctgagtaa 77221 caggcctgac ccatgcctgg cacccagatg ctcagtactt gacggctatt ggggtcacca 77281 cctccctgcc agggcaggag ggcctgctgc tgagtctctg acaaatgagc gtccaccggc 77341 cgctcaccca cctctcgcga cagggtgctc atcacctgcc atggcaaccc acagtgtaag 77401 ggaatgattt catggtagac cgtgtgttgg aatctgggct acaggactgc ttggtgtctg 77461 tgtgacctca agcaagttat ttagcctccc tgcctcgctt tccccatgtg tgaaatgaga 77521 atgaggccac tgcttcctca cagtgctgtt tagatgagtg cctttggaac tttacagtta 77581 aacacgatcc ccaggggatt cgtgtgcaca tcggagtctg attcacccgg gaatgcctgg 77641 ggaggggctg agattctgca ttcccaacaa gcggtagcaa tgctgctggt ctttggactg 77701 cgtttggggt ctcaaaggac tgccttggct gggcgcggtg gttcacgcct gtaatccccg 77761 cactttggga ggctgaggca ggcaaatcat ctgaggtcag gagttcgaga ccagcctggc 77821 caacatggtg aaaccccatc tctactaaga atacaaaaat tagcctggcg tggtggcggg 77881 cgcctgtaat cccagctact tgggaggctg aggcatgaga atcgcctgaa ccagggaggc 77941 agagattgca gtgagcagag attgtgccac tgcaccccag cctgggccac agagcgagac 78001 tctgtctcaa aaaaaaaagg actgcctttc cctgggagga aaagtaggga aagtaggggg 78061 catagggagc catagaaggg tcttgaccac tgaagtgaca gatttttatc acagaaaaga 78121 ggctggccat agtggcttat gcctgtaatc ccagcatttg gaaggctgag gcaggcagat 78181 ggcctgagca caagagttcg agactagcct gggcaacaca gtgagacccc atctctgttt 78241 aaatacattt gtcatggaaa agcctctctg gttatctgtg gcaggaggtg ggagtgggag 78301 gaggtccgtt atttagagag agatgagggc gctgaatggg aggagggctt attcaggtca 78361 aggaggggag gctaagggcc ctcagctcac ctggtccact ctttcagacg cagagactta 78421 tatccagagg gggtggtgga cctgctcaga ccacacagcc ctgggaggac tccctcccac 78481 cagggaccct tcactcccca cccttcctgg tgctcagccc tcctgggtct ctcctgtcct 78541 gtgtgtccct gtttaacatc ctagtaacac caagaagagg gaaggcacga acgagggccc 78601 tggatctccc tgaccagctt gacctcctgc ctacccacaa caacccgatt ttacaggtga 78661 gaaaattgag accaagagag aggaaggccc accgacccag gctgcagccg aggtcagtct 78721 gattctgctg ctgagtggtg tgccattacc attgtccacc tgcagcaaga tcgggtcccg 78781 accgggtgga caaccaaatt aacagacgga ggtggggact ccgcttccgg ctcgccccgc 78841 cccaccccac tctcccaggg agggagctcc agctccggga agatgaggac aatagcagga 78901 atcgccccac agggtacttc cacgtgctgg ctcctgccaa gcgctggcac cagtgttatg 78961 gggcctgtgt acagatgagg aaactgaggc agagattggg gcttggtggc aggcctccag 79021 tttctgcgga gggaaggcag agggcacttg gggcatgggc gacatgggtc gtggcgggca 79081 cccggcgcac acggcagccg ggcgcaggcc agaagccccc ccgcgtcgcc gggccgccca 79141 gtccgggagc cgggcgctgg gatgggcggg aggcgggatc tccgggccgc gccgcttcct 79201 ggctccccac cctgcgccgg cggccgccct ggccacgtca ccgcccggcc aagagtgcgt 79261 gggcggcggc gcgcgggtgc gatcgcggag ctgtgaggcg caggcagggc tctggggcac 79321 ctagagaccg gggccggaga cgtggcagcc gccctgcccg ccagaaagtt tcctagaagt 79381 ttgctgggcg cgggcgcacg actgactggc tggaccatga acgtgttccg aatcctcggc 79441 gacctgagcc acctcctggc catgatcttg ctgctgggga agatctggag gtccaagtgc 79501 tgcaagggtg aggggcgcct ggcagggagg tgcgggaccc cctctctggc cagcctcatg 79561 cccctgcgtg cagggcaggg cgctccaggt gtctgctcag ggtggggact ccggcgtggg 79621 ggtccttgaa actttgcgga gctccacttc caaaaacaat gtaaagttgt ataaaatcag 79681 aggaggagga cccctcgaga tgatgggagg cgggtgggtt tgttcctgca cagacgcccc 79741 ctcaggttgg cagtgtcact gctggttgga agagggggtt ataagggagg tgacaacccc 79801 tccctcccac cagttacacc cattgttcct tccaccatcg actgtgggga gaagcagata 79861 tgcactcagg aacggcccac gattctcaag gtcttctctt ccctgcccct tcccagaccc 79921 caccccgccc cccaaagtga gccctcgtcc ctcccctgcc tgcatgtgtg gcaccggtgg 79981 ctgtgatgtg gcaggagggc accagccggg ctcaggcggg tatggagctg cctctaaacg 80041 gaagccgggg aaggctggag gggtcccctg tttgcattct ggctgccctg ctgggggttt 80101 ggagaagctt ggaggctgga ctcgcggtcg ccaacagatc cagagccacg aaaggctgcc 80161 agagcctcct ccagccatct ttgtccaccg gcatcctcgt ggtgccctcc ctgtgcctgg 80221 cactgggtgt gcagaggcag gagtgagctc aggctcctcg tggggctgag gaggctgaga 80281 aggctgagag tgggccccga aggccctgtg gaaggagcag ctccgtgctg gtcccggagg 80341 acgagcaggt gtcaatccag gaagtgtgtg ctgtggaggg ctccggggat gaagaggcat 80401 ggaggtcacc aagcaccggg catcccccgg gaagggtggg cagccagatt tggggtggtc 80461 tggagggaga tgagtccagg agggaggctg gtccagatgt gaaagggcct tttctgccag 80521 gccagggagt tgggtctggg gagcccttca tggtttggga gccgggaagg ggcctggaag 80581 ggggcctgaa caggttgctt tgcaagcttt gagagaaaag gcacagagca gggggaggcc 80641 tggcagggtg ggggttggtg cagacatgag caacagtacg gtgtggaggg agtggttcta 80701 ggcctggagg ccatttgggt ttggggtctc tggagagaca ggtacagaga cagttcaggt 80761 tcttcctcag gggaggtggc caggaatgct gagtggggga cccggagcag caggccctgg 80821 tgaggggaga atcagctcca acctcttcct gcacttttgc agaaggccct gcacggagtg 80881 agcaaccatg accaagaggg gacactggta cctggccgct gtctgccaga cactgctggg 80941 cacctagaat acagcccgga cagcagaggc ccagccctgg tggagctgac tttcttttat 81001 ttatttttta tttttatttt ttaacagagt ttcgctttgt ccctgaagct gaagtactgt 81061 ggtgtgatct cggctcactg caacctccac ctcccaggtt caagcgattc tcctgcctca 81121 gcctcctgag tagctgggat tacaggcacg caccaccaca cccggctaat ttttgtattt 81181 ttagtagaga cggggtttcg ccatgttggc caggctggtc ttgaacgcct gacctcaggt 81241 gattcaccca cctcggcctc cgaaagtgct ggtgctggga ttgtaggcgt gggccactgc 81301 gcccacttct tttttttttt ttttgagaca gagttttgtt ctatccccta ggctggagtg 81361 cagtggcgca atctcggctc actgcaacct ctgcctccca ggttcaagct attctcctgc 81421 ttcagccttc caagtagctg ggattacaga cgcccgccat cacactcagc taatttttgt 81481 atttttagta gagacggggt ttcgccatat tggccaggct ggtctcgaac tcctgggctc 81541 gagtcacagg tgtgagccac catgcccggc caagtggtgg acctgacttt ctagtggtag 81601 gaagccatca ataaacaata gtcctaggaa actgagcaaa tgacacggtt tgttggaggc 81661 catagtgagt actatgggaa cagaaaaagg gcagctgggt ggcagggatg ggcctgtgtg 81721 tgcacagaga gaagtcagag gtgatggagg cagggcctgg tatgacctca acagacccag 81781 ccccaaagac cagagacttg ggcttcactc tcactcctcc ccttgactcg ctgtgtgacc 81841 ctgggcctgt gttttcccat ttgtcaaatg ggagtgaggg ctgcccctgg cagtggggtc 81901 cacctatgct tagtggaccc cggagggact ctggccagtc caggctcaca ttgtcctctg 81961 tgtcccgggt ctccatgtgc agcttcctgg tggaaatgat tgttttcaaa agtgtgaaaa 82021 ctattgtgct caagatcacc ggggggctgg gcacagtggc tcacacctgt aatcccagca 82081 ctttgggatg ccaaggcggg tagatcacat gaggtcagga gttcaagacc agcctggcac 82141 gtggcgaaac cccgtctcca ctaaaaatac aaaaaaatta tctgagtatg gtggcaggtg 82201 cctgtaatcc cagctactca tgaggctgag gcaggagaat cgcttgaacc tgggaggcgg 82261 aggttgcagt gagctgagat tgcaccattg cactccagcc tgggcaacaa gagcagaact 82321 ctgtctcaaa caaacaaaca agcaaaacaa aactgaagat cacaagggtt cccaaagtgc 82381 taaggaaaat gtgagtaaaa cccaggggac tggcaggctg aggatgaaga gaaaagaggg 82441 gtaggggaag gcccaagcgg gtgaccaagc gtcccccacc agggctgcct cccaggcagt 82501 ccttcgtggg gctgagggct tctctgtcaa agtctgtgtg agggctcagt gagatgaaca 82561 tcaggccatc ccagagctgg cctcaggtca gtatggcctc ccccagtggg gttcattacc 82621 cactgccagc cccactgttg ccctcacccc ggctagtgca cgtgcgcgcg tgctagaccc 82681 cccagaggga ctcaggttct cactgagcac tgtgtattcc tcagctcagt cctgccactc 82741 tgcatgaggg gccgtttggc aggtgaggaa tgtgagaccg aggctcgccc taaagccagc 82801 acattctcca gaatgttagg gacttcaggg gtgttccttc ttgctcatta aaaatgttcc 82861 cccaggctgg gtgcagtggc ttacccatgt aatcccagca ctttgggaag ctgaggcagg 82921 aggatcacct gagcccagga atttgagacc agcttgggca acatggcaaa atcttgtctc 82981 tagaaaaaat ataaaaatta gctgggcatg gtagcatgtg cctgtagtcc cagctactca 83041 ggaggctaag gtaggggggt catccgagcc tgggaggttg aggctgcagt gatctgtgat 83101 tgtgtcactg cactccagcc tgggtgacag agtgagaccc tgcctcacaa aaaaagaaaa 83161 aaaaatgttc tcccacccca aatgtctgga accagaggga tggttagaga aggcacagtt 83221 caggggtaca ggggaaccag accctgaatt agacagcaca gtagttcccc cttatccacg 83281 gaggatatgt tccaagactg ccagtggatg cctgaaacca cgggttaaac caaaccctat 83341 atatatactg tttcttccat ctgataacca agactgctgt ccgggcacag tggcttacgc 83401 ctgtaatccc agcactttgg gaggccgaag tgggcagtca cttgaggtca ggagttcgag 83461 accagcctgg ccaacacggt gaaaacccat ctctactaaa aacacaaaaa ttagccgggc 83521 atgatggtgg gcgcctgtaa tcccagccac ccgggaggct gaggcaggag aatgacttga 83581 acccgggagg cagacgttgc agtgagtcga gatcgcacca ctgcactcca gcctgggtga 83641 cagagtgaga ctctgtctca aaaaataaat aaataaataa aaagtggcca aaggacacta 83701 aggagcatga caacaaaatg cagcatggat tccgggatgg aatccaggaa tggaaaaagg 83761 gcagcagtgg gaaaactggt gaaatcagaa taaagcctgg agttcagtca acagtgctct 83821 actgatgttg atttctttgt tttgataaag tactgtcact aaatacgatg tttttttgtt 83881 ttgttttgtt ttgtttttta agacagagtt ttgctcttgt tgcccaggct ggagtgcaat 83941 ggcacgatct cggctcactg caacctccac ctcctgggtt caagcgattc tcctgcctca 84001 gcctcccaag tagctggggt tacaggcatg agccaccaca cctggctaat tttgtatttt 84061 ttagtggaga tggggtttct ccaagttggt caggcttggt ctcaaactcc caacctcagg 84121 tgatctgccc gcctcggcct cccaaagtgt tggattacag gcatgagcca ccgcccccgg 84181 cccaaatacg atgttaacac tagatatagc tggatatgtg ggattctctg tactgtcttt 84241 acaactctgt tgtaaatcta acattctttc aaaataaaaa gttaaaaaaa gaaagagacc 84301 aaagtgggtc aagaaacagc atttagactg ggcgcggtgg ctcacgccta taatcccaac 84361 actgagagaa tgaggtagga ggatcacttg ggcccaggag tttgagacca ggctgggcga 84421 catagtgaaa ctctgtctct acaaaaataa taataataat aattaataat aaaatttgcc 84481 agtctgatgg tttgtgtctg tagttaccag ctactcggga ggctgagata ggagaattgt 84541 ttgagtccag gagatagagg ctgcagtgag ctatgattgc accactgcac cccagcctgg 84601 gagacagagt aagacccttt ctctaaaaag aaaaaaataa aaaagaaaga aacatcatgg 84661 agagtatgaa cccttttttg tcaactcagt gcatccatgt gcatcatatg tgtacacata 84721 ggtgtcatgt gtgtacatgt tcacaaagat atggaaggat gaacactagg ggtttggaat 84781 atttttatct ttgcatcttt gtatattgtt tgatttttgt tttgcaaaga gtaaatatag 84841 gcataacttt tgtaactaga aaaaaggctt tttggctggg cgtggtgggt catgcctgta 84901 atcccagcac tttgggaggc caaggcgggc agatcacttg aagtcagaag tttgagacca 84961 gccatgacca acctggcgaa actccgtctc tactaaaaat tcaaaaatta gccaggcgtg 85021 gtggcaggca cctgtaatcc cagctacttc ggaggctgag gcaggagaat cacttgaacc 85081 caggaggcag aggttgtggt gagccgagat cacgccactg cactccagcc tgggcaacag 85141 agtgagactc catctcaaaa aaagaaaaaa gaaaaaagaa aaaaggcttt ttaaatctta 85201 cagtttgaaa aataaaatag aatgctcccc tggccctgaa ccttcctctg cctgctctgg 85261 tctccctggt gaatcaacac cttgtaaagt cagtgagttg ttcctcctaa gactgagctt 85321 tggggactgt cagtggaact ctgggggtgg agtgaagaga agcctcaaca acccccacca 85381 agcccgactc acaccacccc cgacatcctc cccagagcaa gagcccaggc aggcacacgc 85441 agggggactg ggaactcggt gttcgtgtct ttatttggag actgggagac agattacagt 85501 ttaatgagag gaacgacgac tcaagtgatc cgatgggaag ggtgagtttc ctggccctta 85561 gggagcgaca gtgccctgga caggaggtct ttgagtggga gtggacactc tagaggcctc 85621 cccagctcca ggctgtgcac ctcttgaggg tgggcaggct tgggagtctg tgtcctcatt 85681 gtctcctgtc tacccttggc cacaggcatc tctgggaaga gccagatcct gtttgctctc 85741 gtcttcacca ccaggtacct ggacctgttc accaacttca tctccatcta caacacagta 85801 atgaaggtga ggggctgggt gatgatggtt gggggaagcc accaagcccc cacaaactgt 85861 gaggtagcct gcttggaaac tcattctgtg gagccaggcc gacctgggtt tgaatcctga 85921 ctttgctgct tagtggctgc aagacctcat cttgggcctc tttgtatgtc agtgtcctcg 85981 tgcaaaaatg aggacagggc tgggcactgt ggctcatgcc tgtaatccca gcaatttggg 86041 aggccgagtc gggcggatgg agaccatcct ggctaacaag gtgaaaccct gtctctacta 86101 aaaatacaaa aaattagccg ggcgtggtgg ctcatgcctg cagtcccagc tactcaggag 86161 gctgaggcag gagagttgct tgaacttggg aggcggaggt tgcagtgagc cgagatcgcg 86221 ccactgcact ccaccctggg tgacagagtg agactctgtc tcaaaaaaaa aaaaaaaaaa 86281 aaaaaaaaga cagtaccatt acttagctca gagggtttgg ttgggaacat taaatgagat 86341 aataaatgta aatgtaaaaa accctgccat atattaaaca ttcaataaag ggtcacacct 86401 gtaatcccag cactttggga ggccaaggcg agaggaccgc ttgagctcag gagtctgaga 86461 ccagcctggg caatgcagtg agactccgcc tctacataaa atttaagaaa tttagctggg 86521 cgtagtggtg ctcatctgtg gtcccagcta ctcaggaggc tgaggtggga caatcacctg 86581 atcttggtag atcaaggctg cagtgagcca tgatcatgcc actgtactcc agtctgggca 86641 acagagtgag atcccgtctc aagaaaaaaa aaattattta ggtgatagag aaggattaga 86701 gagctacttc agggagtgtg gtcagggcat cttcaaagag atgacatgta agctgacagc 86761 ttagtgacaa caaggaacca gccatgtaaa gatcaaagga cacaccttct ttctgttttt 86821 ttaagacgga gtctcactgt tttctgttgc ccaggctgga gtgcagcagc gcaatctcag 86881 ctcactgcag cctccacctc ctggtttcaa gccgggagct tgccttgtgc ctcagcctcc 86941 caagtagctg ggactacagg catgtgccac catgtccagc taatttttgt atttttagga 87001 gagacagggt ttctccatgt tggccaggct ggtctcacgc tcctgacctc aagtggtcca 87061 cccacctcga cctcccaaag tgctgggatt atgggtgtga gccaccatgc ccagccaaga 87121 tgtagatttt agttggacac attatctaat aggaatctta tttcttgtgc agtggtgcga 87181 tcatggctca ctgcctcaac ttcccgggct gaaggggtcc tcccacctca gactcctgag 87241 tagctgggac tacaggcacg caccaccatg cctggctaat ttttttgtat ttttaataga 87301 gacaagattt caccatgttg cccaggctgg tctcaaaccc ctgggctcaa gtgatctgcc 87361 tgtgtcagtc tcccgaagtt ctggaattac catgatcagc taggagtctt tttttttttt 87421 gagacggagt ctcgctctgt cgcccaggct ggagtgcagt ggggcaatct cggctcactg 87481 caagctccgc ctcccgggtt cacgccattc tcctgcctca gcctcccgag tagctgggac 87541 tacaggcgcc tgccactgcg cccagctaag ttttgtattt ttagtagaga cagggtttca 87601 ccttattagc caggatggtc tcgatctcct gaccttgtta tctgcccgcc tcggcctccc 87661 gaagtgctgg gattacaggt gtgagccatt gcgcccggcc aggagtcttc ttttttgttg 87721 ttgttgaaat ggagtcttgc actgtggctc aggctggggt gcagtggtgc aatctcggct 87781 cactgcaacc tctgcctcct ggattcaagc aattctccta cctcagcctc cagagtagct 87841 gggattacag gcacccacca ccacgcccgg ctactttttg tattttttga tagagacggg 87901 gtttcactat gttggccagg ctggtctcaa actcctgacc tcgtgatctg cccaccttgg 87961 ccccccaaag tgttggaatt acaggcgtga gccaccacac ctggcccccc cttttttttt 88021 tttttttgag acagagtgga gtgcagtggc acgatcttgg ctcactgcag tctctgctcc 88081 cgggttcaag ttatttattt atttattgtt tagacagaat ctagctctgt cgcccagggt 88141 ggagtgcagt ggcgcgatct cggctcattg caacctccac ctcccaagtt caagagattc 88201 tgcctcagcc tcccgagtag ctgggattac agatgtgtgc caccatgcct ggctaacttt 88261 tttttttttt tttttgtatt tttaataaag acggggtttc accatgttag ccaggatggt 88321 ctcctgacct cgtgatctgc ccacctcggc ctcccaaagt gttgggatta caggcattag 88381 ccaccatgcc cagtcttttt tattttgaga cagtctccct ctgttgccca ggctggagtg 88441 cagtggtgtg atctcggctc attgcaacct ccgcctccca ggttcaagcg aattctcctg 88501 cctcagcctc ctgagtagct gggattacag gtgcacacca ccatgcctgg ctaatttttg 88561 tatttttagt agagacgggg tttcactatg ttggccgggc tggtctcgaa ctcctgacct 88621 cgtgatccac cctcctcggc ctcccaaagt gctgggatta caggcgagag ccaccgtgcc 88681 tggcctagga atcttatttc taaggaacaa gggttccatg ggggaattct atcttgcagg 88741 gctattgaga agatcacatg agataacata cagcacctag aaaatattta actaaagtga 88801 gttgtccttt tcttccatat cagagacatc cctttgggat ttcaggcatg ttgttactag 88861 ttggtagagt ggaggttgct gttacttgtg aggtgttgaa ctcattcaga tttagtgaat 88921 gtgaatctga ggactggaaa tcaccgtcta cttaagttca cacatgtatc aaacatcacc 88981 tgctagttca tttctcagcc caagtcagcc ttgggcattc ctggcagaga cagcagactc 89041 agcccttgcc atagctgaat gaggtctagg aagctcaaag aggctagttt ctacatttat 89101 gaaatgaaaa agtgaccacc tcattcactc attccagcat ttgttttgtg ccagggactg 89161 agttaggagc tggggattag gaggtggacc ttgccacgga gtcctctctt agaggggaga 89221 tggacttgta cctagacatt tacagagcat gtgatcacgg gctaaaatga gtatgttcac 89281 atggagctat ggcgacagaa tgtgtgcctc attgtagtga aaaaaagact gcttcgagga 89341 aataactcca aataggagta caaataggag tgctctaggc cagtgtggtc ctgctccggc 89401 agtatcagcg tcacttggga aacttgttag aaatacaaac ttttaaggct gggctcggtg 89461 gctcatgcct ataatcccag cactttggga ggccgaggcg ggcggatcac ctgaggtgag 89521 gagttcgaga caagcctgac caacatggtg aaactaaaaa tataaaaatt agccgggcat 89581 ggtggtgcac gcctgtaatc ccagctactc aggaggctga ggcaggagaa tcacttgaat 89641 ccaggaggcg gaggttgcag tgagacaaga ctgcaccact gcactccagc ctgggcgaca 89701 gagcgagact ccatctcaaa aaaaaaaaaa aaaaggaaga acaaactttc aggcccatcc 89761 agacctatga aatcaaaagc tcttggggca agggaagaga cagcatctgt gatttttttt 89821 tttttttttt tttttttttt ttgagacaga gtctcactct tatcccccag gctggagtac 89881 aatggcatga tctcggctca ctgcaacctc tgcctcccgg gttcaagcga ttctcctgcc 89941 tcagcctccc aggtagctgg gattacaggc gcccgcatgc atgcctggct agttttttta 90001 tttttagtag aaacagggtt tcaccatgtt ggccaggctg gtctcaaact cttgacctca 90061 ggtgatctgc ccaccttggc ctcccaaggt gctgggatta caggtgtgag cctccacacc 90121 caggcagcat ctgtgttttg atgcacactc atttgagaag tgctggcctg gaaggttgga 90181 ggccatatca aaatgaaatc attatatgcc agaggacagc tactgtgatt aggagagctt 90241 ttctgaagga gcaatggtca gttagttcaa gggatgttag caggccataa gtagtcccag 90301 cttctaggga ggctggagca ggaggatcac tcgaccccag ggtttgaggc cagactgggc 90361 aacatttaaa aaaaaaaagg ctagtccttg tggaagaatg ctcagaagtc actgatgcat 90421 atccaggaaa ttgttttaaa attgtttcaa gagccctgga atgcctctgg taaatcccct 90481 gaaatagacc aatcctgttt cacctaatgc tccacaaatg aaggggacga caagggcacc 90541 tttgttcatg gcagcactgt cagtcccagt atagaagcat ggagttattt aatgaaaggg 90601 tcactgattt caaattctag gcaactggac ccaagataaa ttgcaaacac atttaacctc 90661 ttcatgttac attatggcag aacactgacc cttaggtttg gtaggctagg agcgtagtaa 90721 ataggccggg aaatgacttc atagattcga ttcccatgtc tctctcccct tttaggtggt 90781 ttttctcctc tgtgcctatg ttacagtgta catgatatat gggaaattcc gtaaaacttt 90841 tgacagtgag aatgacacat tccgcctgga gtttcttctg gtcccagtca ttggcctttc 90901 cttccttgaa aactacagtt tcactctgct ggaggtaagg gaatggactg agtaccagtt 90961 ctcaaaggga aatatgctcc ctgcatcctt ccctccagac cctgggcttg ccttctgctt 91021 acaactgtga ccattactca tgtatctttc agttataagt tgagaagaaa ttgacaagct 91081 atttacaatg ctttagttgg aaatgagcac atctcaatag tcatgatgcc cataaaccaa 91141 gcaggcatca cagccctact tctggctgtt gagacgttct gttcaaccga ttagcagatc 91201 cctgtatccc tgtagcacag tccaggccca gaaatttcaa ctcagtaagt ttggactgtg 91261 gcccatgaat ctgcaggcaa caagcatcat gggagattct gatgctggtg tttatggaca 91321 tgactaattt tttattattt ttttgagatg gagtctcact ctgttgccca ggctggagtg 91381 cagtggcagg atctcagctc actgcaacct ctgcctcctg ggttcaggtg attctcctgt 91441 ctcagcttcc caagtaactg ggattatagg cacgagccac cacactcagc taatttttgt 91501 atttttagtt gagacggggt ttcaccatgt tggccaggct ggtctccaac tcctgacctc 91561 aagtgatcca cccacctcgg cctcccaaag ttctcggatt ataggtgtga gcctggcaac 91621 atgactagat tttaagaaac tgtggcttca attatgattt tatactatga aactatcatt 91681 taagttattt ttggccgggc acggtggctc atgaggtcag gagatcgaga ccatcctggc 91741 taacacagtg aaaccctgtc tctactgaaa aatagaaaaa aaaaaattag ccgagcgtgg 91801 tggcaggcgc ctgtagtctc agctactcgg gaggctgaga caggagaatg gagtgaaccc 91861 aggaggtgga gcttgcagtg agccgagatc gcaccactgc actccagcct gggagacaga 91921 gcgagactcc atctcaaaaa aataaataaa aaagttattt ttggctgggt gtagtggcac 91981 atgcctgtat gtagtcccag ctactaagga aggtgagatg gaaagatcac ttgagcccag 92041 gagtttgaga ctgcagcaag ctatgatcat gccactgcac tccagcctgg gtgacaacag 92101 caagacccta tcttaaaaaa aaaaaaatta aaagtagttt tctatataaa tgctcacgct 92161 acactgtgaa aaaagcagtg tactgtataa tctttcaaaa attagaaaaa aaaaccctta 92221 tatttaccta gagatggtag gttaaactga ttaaaatatt ataattttta ttgagaactt 92281 tttccctcct tactgtacct tctccagcaa gctattgttt tagtaattat gttcttttaa 92341 gatgggcctc ttcttggtct tgctcagtct ctggttgctt tctctttggc tcagatcctc 92401 tggactttct ctatctatct ggaatcagtg gctatcctgc cccagctctt catgatcagc 92461 aagactggag aggctgagac cataactact cactacctgt tctttctggg tctgtaccgg 92521 gcactctacc tggctaactg gatcaggcgg taccagactg agaatttcta tgaccaaatt 92581 gcagtcgtgt ctggagtagt acaaaccatc ttctactgtg acttcttcta cttgtatgtg 92641 accaaaggta ggtcctggga tgacagcaat gctgacactg gcctaaggag ttactcatcc 92701 atttaataag tattccagca gatacagatg tgaacagtca agtctctgcc atccacaatg 92761 cttgtgttct aatgcaagaa gacaaatatt ttcaataaag aaacaaatgc cataaaaaca 92821 tgcaggccaa taggttatgt gtactatgca agacagctgt gagatgacat ttgacagaat 92881 acttggttag tagtgaaaaa cattccagaa cttattacaa gccccaaggc aaccaccagc 92941 ttggtgtgat ggacaaaaag cctagtgggg tcaaaatcta taaacctgga gatgagtcaa 93001 ccaagaccag ttcacatagg gcaggggtgt ccaacctttt ggctgccctg gcccacactg 93061 aagaattgtc gtgggccaca cataaaatac actgatagct tatgagcaaa aacaaacaaa 93121 aagacacttc atatgtttta agaaagttta caaattcgtg ttgggccaca aagccatcct 93181 gagccgcggg ttggacaagc ttgacctagc gcctcgtagg ccagggtaag gggcttgcac 93241 tacattctga ttgccaaggg caggtattgg gtcattaaaa aacccaatcc actagctgct 93301 atgtagggta gaaagcagtc caagtacagt gactagctat ggggcagttg cagttgatca 93361 ggaaaaaggc tctaggtaga aagacatgtg gacacaagtt ctaggacttg agaaggactg 93421 ctgatttgat tgtatgagat gtacaataga caatttgagg ccaatcgcta agtttacagc 93481 tggggggaag ggtgccagta accaagtcgg ggatcagtag gtccatagtc aaccggagat 93541 ggactccccc tttgctttaa caatcgaaca tatactgtga gccggccatt aagagatgat 93601 accaagatta tgcatcaagt tataaagcag tcctttcatc agatggtaaa ttcttatttc 93661 atctccattt tcttccagtc cttaagggaa agaagttaag tcttccaatg ccaatctgag 93721 gaccttcaga gacagtctac gccttaacaa gcacatgaag gaaactattt tgaatgttct 93781 ctttggcaac ttatccataa tttgggatca aatgttaaaa ccagaaaagt gtttagtgtg 93841 gatttcagca aaacctgatc atcccaccca gaagaccttc tcatcaatag atcgccctta 93901 aagacccatt gtaaggtcat aaaaaacctc ggccacctgc acaaagatgg tgcctcactg 93961 caacaagaaa ccttaaggtg tcttaccgac gaaataaaaa acataaatga ttgttctcca 94021 aggcctgagg gcaagactca tgatgagcaa gtcaacccca atctggaaca atgtccctcc 94081 tcttagaatg tcccaactaa agaccagtta aaatattagg gtacgttctt gtgaatttcc 94141 actttccagg tagatgacca aatttaggtg gtcaagatat aaaggtgtca gctagtttta 94201 agtgtgaaac ttatttcact ttcacactgc cttcaggcca gaagcaaacc aaatttacca 94261 ggtttggctg gaggagtttt gtgactcatc ttttactggt ttgaattttt tcaaaccagt 94321 ggctgatacc tgccttgtac ttagtacctt aataccaata acctaatggt acttaggcga 94381 gtaccatttg cacaatcact gttttactta tgagcagata cagatatatc caaaccctta 94441 cctactaggt atcctgctag ggttttcaat tccaattctt gtattaagtt ttttcctttc 94501 agttttaggt gcgaaagtaa tcagtcaatc caatatcccc catctttgtc ttgaaacaaa 94561 aactgtttta agacgtctac gttgaattat tcagagaatt aagcaataaa agctcacacc 94621 ttattgtcaa cagtgttttt atttatacct acaaaaagaa aacaagatga tggtatcaaa 94681 aggacaattt acaaactaag aatagtaaca tagctttcag catcctgtgc ctgaacatca 94741 cacatctaca agtctttcaa gtcttaatgc aacaggaatg tgtctggaga ccagcaagaa 94801 catcaataga gagcactgat cccaagcaaa agccactaac cttttagatg agaagtccac 94861 acaacgaatt gttagggagg attggggaga agcagcccat tgcttaatac attggaaccc 94921 tttccctaag ttgagtttca accatgaatg caataactag cataaaacga ttcttctgct 94981 catgttctga agccaacagc agaacctgaa ttataagtga cagacatgga ggcagaagag 95041 ttaaactctg ctagatttca gctgtgctca ggccataata gtttttgagg tttggaattt 95101 actgttattt tatgattaca atgtcccagg tggaaaaagg gaagcaagca atccaaataa 95161 ccacttgctt gcccagagac ctttccctca aacagatgct ttcaagagct gcgagagagt 95221 agggcatccc ttgtggtggt acctctatgt ttaaagaaag aagaaaaaaa cccagaattt 95281 ggttgtagaa aacaatgccc acaacagact ggccagtgct tagacaaatt tggggttggg 95341 gggaacactt tggtttgaaa gcacagagca gtttgccatg tttcttctgt gcctaccatt 95401 ctcccttggc ctcaacttct gtaagatggg ggggggacaa aaagagaagt aaagttaaga 95461 agaaagtgga aaattaaaaa aaaagatgtc aaagttttta catgcatata tttcagctta 95521 tgctgaagac ctacctgtat gttgcacatt gaatcatact ttcagaaccc ctcagaaacc 95581 atccctctct ccctaaagaa ttttaaaagg aaaacaaaac ctcgagtata gatcttacag 95641 atgagcaagc attcaggcct tagccaaaga atgcagtgga gccttccccc ttcaactgca 95701 ttgtgaatga ataccaatta acagcataaa aattaatagt cccatatcag atctggaagg 95761 ggtttctggg gctgtctgat gtccctatcc tgttgtagtg aacacaatag cagaaaattc 95821 tttctgggtc catctgctat aaagtcttgg taaaacagca ttaccatgaa gaggatgaac 95881 tcacctacct tcagatggag gaaaagtgaa aaggacttag gctttagtcc tccatgactt 95941 ttcttaagca ctacctacct gtaataagct gagtgcaaaa ggatgccgaa gaaaatctgc 96001 acccagaagc tgttagaaag cactgcagag aacagggtat gaagaaaata aagagttctt 96061 aataaaccct taagattctt tgttcaaggt aactttgcca aaagggcaga gtaggtggca 96121 aagagttgct tttaatctag ctctacactg catttgaaaa taaaatttgc ccattttgaa 96181 tatattgttt ataattaaat gtgcttttta cactgcaggt caatataaaa actggttagt 96241 aaatttccag cgagcattta tgttcatttg ctcacagcag ctgtctggat ggaaaattaa 96301 tttcacagat ggtccccagt taaactggga agaaatatag gcagcttcca cccagatgct 96361 gagatgctac agtttaacca ctacaataaa cacttgtggt ttttaattta aaggatacac 96421 aaaactcagc aattcaattt ctagcctgac actaaaatgg ttatttttca gtaacggggg 96481 gagaagtggg gaggcagagt gtgaagggaa ataaaaccaa ttagtaattt ttaactatca 96541 aatgcactcc agcaatcagt caaaacaggc ccgaggaaac ctgttccaac ttaagaaaca 96601 tttaaaagca caaaagaaaa tgtgcagggc aatccctctt tgttttctaa cccagtaatg 96661 agaaatttaa agcacagatg ccatcttcct ctgcaaaaca attcagccct ccccctccct 96721 ccctccccac caactcagaa aaacaaaaca tggggctccc acaaaagggc ctaatacctt 96781 actcttttag atgaaaaata gctcaaatat cttcaacatg caagaagttg ggggaagaaa 96841 ttaatctctt ccagtcagct atatatatat atatatattt tttttttttt tacaaaatgt 96901 tattccagac tggatcattt tggtgggcag aagaaacctg gcagtgaatt ctaactaatc 96961 tgcatgaaaa gacaaatcac gatggttggg gggaaaaatt aaaaaaaaaa aaagaaaaaa 97021 ggaaaaaaaa agaaaaggcg aagaggaaaa aaaaaggaaa gacagtgttc cttaaaatgt 97081 aattaagtct gctggagtca ctaccacttg agtggtttca tttacgtgaa ggaggaggag 97141 ggggaggagg aggagggtat tggtaggcag tctgccccat gtaacctatc atattggtag 97201 ctcccggagg ctgtgcaaac tgttgtgaca tcagtggctg tggctgctgc ccagaccggc 97261 ctatcccact aaactgctgg ctagagctct gtgaacttct cccagttgag gtggtgctac 97321 tagctccata agtgccagca ccatattctt gagctgtata gctactggtg ccataagcag 97381 ctgccccata ggtgccttga ccataggtgt attggcctgc ttgtgctcca aaggcagaat 97441 ttggacttcc atagccactg ccattagcat aaccagctct atcggtttca ctacgatccc 97501 gatagcttgc agagtctctc cggccaccat ccttgactcc tcgaagcctt cggtcacact 97561 catcctgata catcagattg ggattgttgg ctgaagaagt ggtccggtaa cgagaacgac 97621 cacctaatgg gaagatacaa gaagattttt aatggcagat ataggaccta aacataacaa 97681 tgtaaaaatt ctactgggta caaaagtatt aaacatgtct ccttgatatt tagttatgct 97741 ggccagcctc ccaattagta ctgtaaaaat atactcataa tttgattctt taatggggca 97801 cattcaaaag cacagtaaaa attaaaggac actagacctt ttcacccttt taggcccctt 97861 acttgaggat gatcaagaaa cacggtatta caatatgaaa ccttgagtca ggaagcccag 97921 atcctaggat attctgcaac tgtagatgca actgctatgt ctcagtttct acccaccact 97981 aaacagtgta aaatgactga cctctccgta tggaaaactg gaaacctaat atactactaa 98041 gcatttttga aagtattaaa aaagcaacca tgaccggggg cacactggct cactctcgta 98101 atcccagcac tttgggagat caaggcagga gggtcacttg aggtcagcag ttcaagacca 98161 gccaacatgg tgaaaccctg tctctacaaa aaatacaaaa attagctgga catggtggca 98221 catacctgta gtaccagcta cttgggaggc tgaagcagga gaatcacttg aacccaggag 98281 gcagaggttg aagtgagccg agatcgtgcc actgcactcc agtctgggca acagagtgag 98341 actgtctctg cccccgcccc gccaaagcaa ccacattaaa actcagacca gcctcagcta 98401 ggtttggtgg ctcactcctg taatctcagc actttgggag gccaagacag gtggatcacc 98461 agaggtcagg agttcgagac cagccaggcc aacatggtga aactccacag accagcctca 98521 gctaggtttg gtggctcact cctgtaatct cagcactttg ggaggccaag acaggtggat 98581 caccagaggt caggagttcg agaccagcca ggccaacatg gtgaaactcc acctctacta 98641 aaaatacaaa aattagcagg gtgtgctggc gggtgcctgt aatcccagct acttgggagg 98701 ctgaggcagg agaatcactt gaacctggga ggcagaggtt gcaatgagct gagatcgcgc 98761 cattgcactt cagcctgggc gacagagcaa gactccgtct caaaacaaac aaacaaacaa 98821 aaccaaacct cagaccagtc tctaagaaaa aaaaattcac aatttagctt aactaatagg 98881 actatcacaa caaacttcac gttttcttag ttgaaaaagg ttataaaaag cattaggcag 98941 tgtcctttgc tattaagcaa catacttaca gccttcttaa aaacagcaac taaaagcaag 99001 tcacagaagg cgtaaagaga taccttggca gtacctggaa aactccagga cttaccctta 99061 cccccgcctc cgccgcctcc tctgtggtcc acaagctgca tcagttttgg attgatagcc 99121 tgattggcct cttccagcac tttgataagc tctctggcct gttttaggtt ccctggggtg 99181 aagaaggtat aggcggtacc cttgttggtg ctacgggctg ttcggccaat acggtgcaca 99241 taatcctctg agctgtttgg atagtcatag ttgatcacaa acttgacatc ttccacatct 99301 tccacgtcaa tgatgagtca gtgtgtaggt tgatgtggca gggaaagaga aggtaaagga 99361 catgccaaca acaggtggga agcacgaatg cagatgacag caagaggaaa gaaggtgaag 99421 ttagtaacca ctctgaacag aaacaaaaaa agactcatcc caacagagta aaagaggatt 99481 actgattcct gggacatgca taggaaatag gatctttatt ggctgtagaa tcctaaggga 99541 atcccatcca aacatccaaa ctcctgcctc tgctacattt taccaatata ctgacaggag 99601 ttacaccaat attggtcttc agacgcaaat tccagtacta tcctttacaa aaccagaaca 99661 tagaacaaag tggtaaacca ctgtgaagtt atgaggccaa atcacttcta ctctgttggg 99721 agaagcctgg caaaatctac caacaacata acaaacttac ctttaaacca gtcacagagc 99781 aaagtaaaaa aaggagcttc atatatttta ttactacaat ttaaaggaaa cttaaatgtt 99841 ttgattaaca tttaactttt tttgctctta tctttttaag aaactgagta gagccaaagc 99901 tgttaagtac acacctattt tcctttgcag aaacatccag attatctatc ccctttcaat 99961 tacaagagca cattacacta aaattttact gtactaccaa gcccagcttc tgccaaaatg 100021 aggatatttt tgctcaactc aaggttattt caactcagga cttgtttcaa ttatcaatga 100081 ttgtctacgt gacctgggaa attaaattaa tagccaaaga acgtgggaaa accatgaaac 100141 cttgagacac aagtttagta ttctctctac tgctgggaat tgggctgaat aacctcctaa 100201 ggcttaaaat gtaaatgtta gtgtaatgct ttcagctgaa tgtatatttt tacctactta 100261 aacaaacaat tttaagaata taacaaagaa tcctactgtt ttgtggaaaa aaatctgcat 100321 tttacaaaga atattcagaa aatatcctag ataaaacaca gatttttgtg atataaataa 100381 ttaaaaaaca ttctctgttt gttaccagac cgatgcacac tccctcctcc tttgggaaac 100441 cgctgcagcc gatcccgtct ctttgccttt tatttttggc ggcctccttt cgaaaactcc 100501 gccttctgac acgggaccac tggagcgcgg ggagcatgca tcaagggtcc gtggcagtga 100561 gaagtcccca cacacgctcg ctctgtgaac tggcttagat gcttctctgt agccccattt 100621 ggttgccaag cgccggagaa cctgggttcg ggtaaaaatt tcaggtcctc caacgcctcc 100681 cccaaggata attggggtga ttgtagaaaa aaataattaa taaaaaaaaa tgtaagttac 100741 atccaaaggg tcagactgca tctattgccc agactggatt aatttcaagg gagaaagggg 100801 agattaaaaa aaaaaaatta gctgttgtta cagggcttgt gaagaagggg cattatgtct 100861 gtttcttatt acaataaagg ctcctcggtc tttactgacc cctaaagtcc tgaatcacac 100921 cagaaagacg agaaggcact tatcacaggg ggcagcttga gtctccggct agggtcgttg 100981 acgcaacaaa ggaaaaaaga atttacaggg ccagggtgga tgtaagacag ggggtgggga 101041 attctactcc atggtatctt cagagctagg ataatgctcc ttatgcaatc ccactgcata 101101 tgaccatggc agtagaacaa gttcaattac tacactggat gcgttaagtg tgctttccta 101161 gcagaaagca ccagggtgga gtcaacagtt cacatgctaa tacttggaag tatttctaga 101221 agggggtgct caatagaggg cagacatgat gcaagttctt catactagaa aggtgtcctg 101281 tgtgtgcatg cacagctgga tgggggcaca caggagcaag cgcagaattt ggttttctcc 101341 agtcaagtct accctgatgt tatctgtgca ctgccacaat attatcttgc tgcctttcta 101401 gaagatgctg tagcagggta agaaatccaa agttaatgtt ttggccagct gactgggata 101461 agaacctttt gtctgaaagg ccctatgaca tggcctgaat cgccaatcac taataaggag 101521 ggactaagtc tagccaagag gctcatctag agctgcttcc ggccaatgta gatttgtctc 101581 ggccatgtat tgagtgatct gcagctgata tcaaagcttc tgttaaaaaa caaaacaaaa 101641 taaattcaat acttttagtt taacagtgtg tagttcaaac aaaaaaaagg aaaaataaaa 101701 aatatcccag tggaaaggga agtggtagcc taaggatgac tattataaaa ttatgggaat 101761 caaagaactt acctgaaggg aacatttaag agaccatcgc ttgacaggaa gtgaaaaaag 101821 gccctggtga ctcagctgaa aattgcaccc ctgtctatga ggtagaagcc ttcactttcc 101881 aggatcttgt aggacattgg tcagaattct gccattttgg tgggttttct ccctcttttg 101941 attttttaca aggcagcaaa caaaggggcc caaaacaagc actgatgttg agtaacaaga 102001 aacagactcg ggtaccttcc ctctcccctc acttggatga gtgggggtgg gatgaagagt 102061 ctaagttatt ctaatggtag ccaattaatt tggtttagaa atgcaagggg acatggagca 102121 aatattttta attgtctagg caagatgcac aaagaagcta ccagcacata gcaagcctgt 102181 ttaaaaacca accattactg aaagctttat gtaaactgct ttcaggcctc cgcctcctct 102241 cccaacaggc ccaagttcta atggcacaga tccagacatg acctcagagg gcagaggggc 102301 agagagaact ttccatcatg gctgcaggtc accacctctg aaggagggaa tgatgagtgc 102361 aatgcataaa aaggcttcat tacttttgta ttccagtttg aatacctgtt ctctttatat 102421 tatatagtat aagggaataa tatggtactt ctcaccttac tgggcaaatt aaatacctgt 102481 caataattta aggatttcct tgtaatacaa ataatctttt tttttaaaat atagaagttc 102541 tgagttagac ctgtttagct cagaatagtg ggctaaacta ccataaaatt ctctgtatat 102601 cttaaatggt aatgggtcaa aaactccaga aaatcatcag ttgataacac acctacagat 102661 aagtgcatgg gtaggagggg atagccaagt gcccatgata atttgacctc agtaaattaa 102721 actgggcaat acacatattt gctattctga tactgcatta gacttataaa attccatcta 102781 ataagcattc ataaaactgg acctctctgt atatatctag cttagacagg gatagggaaa 102841 agaataactg aagaaactag cttacaatag ctaggtttcg tcaggcttat tctatccagc 102901 cagaaaccac caccagagag aagctgagcc attcagctgt ctgtctcctc tccctctgtt 102961 tgaatagtca tgcctaggcc ttgctgcaga ccaaagcctg tttgttagga gcaggaagag 103021 actttgaaaa taaaaggggg gtgggggaaa ccaaagttaa aaaaaattgt ggggggggca 103081 gggaatgggg agaatcacgc tttgtatttt tttctttcaa atatttatct aaccacatgt 103141 atatacaact ttctaaactt gaagtctgaa tttgaaatga cgaatcttta aaccaaaaga 103201 tacatatacc attgacagag acacctatct atacaaacct agcccacggg aggctacatc 103261 tgtagcaata aggatgggtg cctttccaga acggaactct gtaaaaacaa acaataatca 103321 tcatcaaata agcattcctt gctatctgat ctgtttacat aaccatgtta aaatttcaaa 103381 taatggtgcc agaaaatccc aagctcttgt aatgactgac agtgattctt ttgaaaatct 103441 ttattacact catattcagc agcacaatag ttacctctac taataatgaa ataccgttaa 103501 gagagtgtgg caatttgtta cataattaat atgagctaga tttttttccc cttaaagtta 103561 tttcccaaat gagacacaac caaaagttgg gggtcagtag gaaacaagtt caaaagatgt 103621 agaatgaaaa gaactgaatt tccccccaaa aggtatgcat attatttatt tttctttctt 103681 ttttttgaga tggagtttcg ctcttattgc cgaggctgga gtgcagtggc gcaatctcag 103741 ctcaccgcaa cctccacctc ccgggttcaa gcgattctcc tgcctcagtc tcccaaatag 103801 ctgggattac aggcatatgc caccatgccc ggctaatttt gtatttttag tagagacggg 103861 gtttctccat gttggtcagg ctggtctcga actcccaacc tcaggtgatc cgcccacctc 103921 agcctcccaa agtgctggga ttacaggcgt gagccactgt gtccggccac attttatttc 103981 ttaaacaaaa tatggcctct tccatttctc aatggcttcc cttaagagta gtcctgaaaa 104041 aaaggactga aaggaaatat gctaaaatat gactcattaa gaaacatttt tctttaaata 104101 tttcagtaac agtttaattc tccccaaacg gtagttatat ttttataatc agaaggaaaa 104161 aattgaagac tttattaata aagggagggt ggtgaaagtt gtgaaaaagt ttaaagagcc 104221 agcatcgtgg cccttaaaga accatgagca cacgctaaaa gtaccctcag taattcaatc 104281 aactatacaa caggggtaag acaaactggc ctttgggcca aatctggcac aacgcctatt 104341 tttgcaaata aatattgtct gaacatagcc acgtttatct ctttgagaca gggtcccacc 104401 ctgtcaccca ggctggagtg gcacgatctt ggctcactcc aacctgtctc ctagactcaa 104461 gcaatcctcc cacctcaatc tccggagtcg ctggggttac aggcgtgcac caccaagctt 104521 ggctaagttt tgttttgttt tgtttttctt ttagagacag gatttcacta tgttgcccag 104581 accggtcttg aactcctggg cctaggtgat ccacgcgcct cagcctccca aagtgttagg 104641 attataggca tgagccacca acggtgccca gtccaggcca cttctttaca tattatctac 104701 agcctacagc ttttgtccta aaacagcagt gctgagtggc tataacagag accaccattt 104761 ggtttacaaa gccaagcata cttactatgt ggccctttac agaaagtttg cctatccctg 104821 aaatagagca tgaacattca cttatgggca gggggtgggg aatcaacaga aaatgcttag 104881 ttttaaactt accattaagt acccaatctc tttctggttg actcttgtct ccatggatac 104941 acatagctgg ccaactatca gaagaaaagt gaccaatatt ttaaaaatct tttaaagtgg 105001 agaaaatcga actttaaagc caaatgctat catgcaattt ttgtaaggtt tgtcaggctt 105061 catgaataga tgactgtgct cattaaaaga gcaaactgca tggataggca ttattgaaat 105121 attacacaac atttggaata ccaatttaga aatttccagt taactataaa accagaggtt 105181 aattgccttg gtgctatact cacccatctc tgcgcatcct tcgagtcaga tcatcacagc 105241 gtctctttgt ctccacaaat attattgttt tgttttcctt ttcagccatt atttcttcca 105301 ttagttggat caacctgaaa acatgaccaa caatgcagtc atcacacaga aagttattat 105361 gtggacgatt atgtctgcaa tatcaactcc caaggctaca gtactttctt tgcacattac 105421 gcttttaaaa tttttaataa ctattagagt actgagtcag gttctgctat gtgctcaaga 105481 aatactaact cataccatcc tcacagcaag tctcatgtaa gtggtattac tcccatttca 105541 cagatgaaga aactgcagca ccaagacaca aaattgtgca aaattataca gcttgtaatc 105601 agcagagcta aatttcaaac tcaagaattc agaatccttg catttaacca taatgctctg 105661 acaaatgata tgatgatgga ttatcagaaa cctagcaatg cttttaatgt tctcaggcct 105721 caaaatatac tggtttctac agtactaatt ttggagggtt attagaaaag gtttatcagc 105781 attatcaaat cagagtaaaa tatctttcat acttgtggtc tttttcactt tccatgcaga 105841 catccactat ctggaggatg ttgtggttgg cactcaactc cagattgcct acgttgatct 105901 gggtgtaatc acgaaggaaa tcctctgcaa gctgtcttac ttcttttggc caggttgcac 105961 tccacatcag tgtctgccta tcaggctatc aaaaaaggaa aggctttctt tcaggctaag 106021 gaacttggac aaatggcata aagactgctt attattagct ttacacaccc tgatttggtc 106081 aacaatttta cggatctggg gttcaaaccc catatcaagc attctgtcag cttcgtccaa 106141 tacaaggtaa gtacatcggc gaagatttgt ctttcctgac tccaggaaat ctatcagacg 106201 tccaggagtg gctatgcaga tctcaacacc tgtaaaacaa aaccaatgat cgagccaatc 106261 gactaggtgc agtggctcac agctgtgttc tagcactttg ggaagaggag gcgggaagat 106321 cgcttgaggc cagttcacaa ccagcctggt caacatagca agaccccgac tctacaaaaa 106381 tgcagcagca gcagccctgg tggctgtact caatgctgca gagaagctat tttttttttt 106441 ttttttgaga tggagtcttg ctctgtcacc cagactggaa tgcagtggca cgatctcggc 106501 tcactgcaag ctccgactcc caggttcacg ccattctcct gcctcagcct ccccagtagc 106561 tgggactgca ggcacctgcc accacgcctg gctaattttt cgtattttta gtagagacgg 106621 ggtttcacca tgttagccag gatggtccca atctctggac ctcgtgatcc gcccacctcg 106681 gcctcccaaa gtgctgggat tacaggcatg agccaccgca ctggcccaga gaagctaaat 106741 tttaacaaag taaccgcaaa accatgtcag gacttatgta caatcaccaa atcctcacaa 106801 agtttccagc catgctgaac cttctctctc aaaatagctg ttttagtcac atctgagaat 106861 ttctaagctg ctgaaattct gagtttatca caagaaccca ctttttttct ccaatttcct 106921 ccacaggaat tggtaaagta agataaacta acaattcttt tacactatat attattacct 106981 ctttccaagt ctcgaatctg gggaccttta ggagcacctc cataaataca agtactcttc 107041 aatctagaac atttgccata gtcatcggcc acctgctgta cttgctgggc aagctctctg 107101 gtaggagcca gaactagaca ctgaaaccaa caaaaagaca cttgagacct catccaaggt 107161 ttaaaaaaaa aaaaaaaaaa aagaagaaac aaaatacaaa aagctaattt aattctttgc 107221 tgtctaatct agcacattaa aagtgatggg gtgaagatca gaagtgggga atattagcac 107281 aatggaatat agaaaaaaat tattagaact tcatgtttag catttggaat gatactggta 107341 tcctcgtcac ttttactatg acacattatc aatcattata gtttatatat actgtttaag 107401 tggagatata gattgtatta cctggtttta agtatacaac tctcatttac aggttgagca 107461 tcctaaaccc ccaaaaccaa aatgctccag caagcattta ctttgaggat catattggtc 107521 ctcaaaattt ggatttgttt tgttttctga gacaaagtct cactctatcg cccaggctgg 107581 agtgcagtgg tgcaatgaag gctcactgta acctctgctt cctgggttca aatgattctc 107641 ctgcgcctca gcctcctgag tagctgggac tagaggcagg caccaccatg cccagctaat 107701 tattgtattt ttgtagagat gggggtttcc ccatgttggc caggctagtc tcgaactcct 107761 gacttcatgt gatctgccca ctttggcctc ccaaagtgct gggattacag gtgtgagcca 107821 ccatgcctgg cctaaaattt ggaattttgg atggtcatcc tgacttaaac tggcagaatg 107881 acacattcct gacctctaat ttaagcacat caccaggtag ataaggcaaa aacaatgaaa 107941 atttacatgt aatctgacaa gactgtgtac aatctggcca aataatacaa aaatcaaggt 108001 atcatatacc attcaaatta aattatgaaa cttgctttat atactgatcc attagatatc 108061 aactaataat aatgctgtca gacatatact atgtattata agtactgcat tatttctttt 108121 tgtgaatatc ttgagacata ttgtacatgt tcataaatat ataggacaaa tgcaagctta 108181 tttcttaaaa aagttaagta ggccgggcgt ggtggctcac acctgtaatc ccattatttt 108241 gggaggccga ggcagatcac ctgaggtcag gagttcgaga ccagtctggc caacatggtg 108301 aaaccccgtc tctactaaaa atacaaaaat tagctgggtg tggcggcgca tgcctgtaat 108361 cccagctgct tgggaagccg agacaggaga atcgcttgaa gccaggaggt ggaggttgca 108421 atgagccgag attgtgccaa tgtactccag cgtgggcaac caagagcgaa actccatctc 108481 aaaaaaaaaa aaaaaaaaaa aaaaagtaca cattcaatgc caggcgcggt ggctcacacc 108541 tgtaatccca gcaccttggg gaggccgagg caggcagatc acctgaggtc aggagttcca 108601 gaccagcctg gccaacatgg tgcaaacctg tctctactaa aaatacaaac attagctggg 108661 agtggtggtg cacgccttta atcccagcta ctcgggaggc tgaggcagga gaatggcttg 108721 aacctggcag gcggaggttg cagtgagctg agatctcgct actgcactcc agcctgggca 108781 acaagagtga gaatatatct ccaaaaaaaa aaaaaaaaaa gaaagaaggc tggacgcagt 108841 ggctcatgcc tgtaatccca gcaatttggg agggcaaggt gggctgatca cctcaggtca 108901 ggagtttgag accagcctga ccaacacgga gcgaacctcc atctcaaaaa aacacaaaaa 108961 atcaaaacaa aaaaacaaaa aaaaccacac caaaaccaga aaacacttga tttactgcac 109021 agtaaatgcc catacagcat gcatggtttc tttaagagta agactttggt tcaaatgtaa 109081 gctcagctat tcaattgctg agtgatcttg ggtaagtcat ttaattccaa gtaagatgag 109141 aatgaaacca tttccaccta tggagggtgt ggttaagagt acaaatggat aatgtaaagc 109201 accaagccta aatagtcgct aaattgtagt ttttcttaga attacaaaga aactgaaaca 109261 cacttacgat tgggccatct cccctttcca agtatggctg gtggttaata tgaacaattg 109321 caggcaggag atactgtgga ggggggaaag aatgacaacc ttacgcttag aagtacaatc 109381 ttacttagat acttagtaat tactgaatta atgcataaat aggaaagtga aaaacagaac 109441 ttaccactgc tatatacagg aagcaataca aatttgaaca taaaatgctt ccatatattt 109501 aataactaaa atagtatcta actatctgaa tggcacttct acgaagaact gagaacgtat 109561 gacaactaga atagcaaaat aagttaccac aatatcaagg atcttctctt gctcatacat 109621 accgccaacg tcttcccaga gccagtctga gcaatgccca ccatatcccg gccactaaga 109681 gccaacggaa atccctggca ctgaattgga gttggttctg taaagtgctg atccatcaac 109741 acatccatta catattctgt aagagaattt tattttgcgt attttattgt gtttaaagac 109801 acaaaccaag agtttttaaa aaaactttta agctcactga agataagatt tacccagctt 109861 catctctcca aagtgctttt tctttatcta ctagatctcc gaccaactca ctgttagctt 109921 ctaagggcca ggctaggtgg ctcacgcctg taatcccatc actttgggag gctgaggcca 109981 gtggaggcca ggagttccag accagcctgg tcaacatggt gaaccctgtc tctactaaaa 110041 atacaaaaat tagccagagt ggtggtgggc ccttgtaatt ccagctactc gggagggtga 110101 agcagaatcg catgaaccca ggaggcagag gaggtagtga gctgagatcg cgccactgca 110161 cttcagcctg ggtgacaacg ttgagactcc gtttccaaaa acaaacaaac aaaagctgct 110221 acaatagtct acctctccca gaaatttgaa aaaccattac cctaaagcat ctaacaggac 110281 atgggaacat ggtagtcatt tattgtttag caaattcaca ttatatccag ctagaaagca 110341 atttcattct gctgttttct aaaaggcaga gggcaaaacc ctagaaaaag acaaaaacct 110401 caaaaaaaca aaacaaaaac aaaacctcag tctaaatact agaacagaac tgatgaccat 110461 aaaaacaatc tcttgtcctt tcaacaaaac cccaagcctg gctcaaagag tttaacctac 110521 tacccttgtc tcccattcaa cccaatttaa caaattaagg ttctagattg aagaacactt 110581 acgtgggaag ttagcatgat ggaaggcaaa cacgggttta ggacaaacat ctccccccct 110641 cactgtaatc tccttctttc ggcgtagctc atcaacctca tactattgaa aaaaaatgaa 110701 agaagttagt aaactagtta ttctaaacat tcagaattta tgttccaagc tagctgagca 110761 attatccagt ccacagctat gttggcaaat atttaatagt ttacatggta ctttcatgtg 110821 tatcacttta actctgcaag gctcctactt actttgtaag ttaattaatg atgtaggtga 110881 ttaagagttc cattttctat aggagaaaaa agataagtaa cttgctcaag gtcagaatga 110941 gaaggaaaaa aaagtggaac ttggacccaa gtactctttc cattacactt tcttctgaaa 111001 gagtaaccat ttcatgtgta gagtccaaaa aacaatggta atgaggtaca agaagagaca 111061 atgttggctg ggcacggtgg ctcccacctg taatcccagc actttgggag gctgaggtgg 111121 gtggatcact tgaggtcagg agtttgagac cagcctggcc aacatggtga aactccgtct 111181 ctactgaaaa aaaaaaaata aataataaat gaaccaggcg tggtggcaca tgtctgtaat 111241 cccagctact cgggaggctg aggtgggaga attgcttgaa cctgggaggc agaggttgca 111301 gtgagccaag atcatgccac agcactctag cctaggggac agagtgagac ttcacctcaa 111361 aaaaaaaaaa agccaatcta tctttttcac taacagatga aaaaacatga cctaatatac 111421 aatatttcaa tcatgtataa taaacagatt agtaagcatt atcaacaaga gaacaaatta 111481 aatgcatatt taaactggtc cccttttatg agtattaggg gtcttagata agaatgtaag 111541 aaaggctggg catggtggct cacacccata atcccagcac tttgggaggc tgaggcgggc 111601 agatcacaag gtcaggagac ggagaccagt ctgatcaaca tggtgaaacc ctgactcttg 111661 cactccagcc tgggtgacag agcaagattc cgtctcaaaa ataaaaagag gctgggcgtg 111721 gtggctcacg cctgtaatcc cagcactttg agaggctgag gcaggtcgat cacctgaggt 111781 ccgaggtccg gagtttgaga ccagcctgcc caacatagtg aaactccatc tctgctaaaa 111841 atacaaaaaa ttagccgggc atggtggcgg gtgcctgtga tcccagctac ctgggaggct 111901 gaggcagtag aatcgcttga acccaggagg cggaggttgc agtgagccaa gatcacacca 111961 ttgcactcta gcctgggcaa caagagcaaa actccgtctc aaaaaaaaaa aaaaaaaaaa 112021 aaaaaaaaat cagctgggtg cagtggctca cgcctgtaat cctagcactt tgaaaagctg 112081 aggcaggtag tttgcttgag ctagggagtt caagaccagc ctggggaata tggcgaaacc 112141 cggtctctac aaaaaggaaa aaaaaaaaaa aattaaccaa aaaatttttt aagaaaaata 112201 tcaccacact agaataaaat gtaaaacggc aacagtagcg tatgaataaa ctctaaccaa 112261 tatatctact tggttcaata atctgtacat taaaacacac atgtaaactt actggtgtca 112321 gccttgctac ttccggatgt tccacataaa aatttttctc aaacttgggg agctcactca 112381 aatcccactt ttttttacgc aaacgctccc caggattacc aaatttcttc gggggaaggc 112441 caccaccacc tcttgctcca aatctaggaa cagaattttt agttagttca tcagtatttt 112501 taaagcaacg tatcgtacat gccataagaa tattcaaaaa ggatggagtt ctagggatac 112561 taccagccta cctccaaaaa gaccatttta tttgctgtga tttaaaggta tggagaaatg 112621 tgtaaaacct cccttactac ccaacaataa atactgttta tgaatatagc tacaataggc 112681 aaaaaattaa aaatcccttt attgagagaa atacatccta gggattacaa tcacagattc 112741 tggatccaga catggtttga gtttttattc tgccacttac tgtataattt gaagagaatt 112801 atttaacttc cacacacctt agtttcctat gcaaagtaga aataatacca cttatttcag 112861 aggctgctga gaaaatgacc taataatgtg taatgtacta taaaagtatc tggcatagcc 112921 agcactcagt acaagttagc atcatttcat tcaacaaata gttattaggt accaagttac 112981 taggtaccaa gtatgtatcg gctactgttt gaagtggtgc tgtgctggag ttactaaaca 113041 aaacagacac aagtctctat ttttgtgtaa cagacatgca ggcaaaggat gacattaagt 113101 taagtttcaa ttaaagggaa gattgtggac aagaaggaag atacactcca agcaagtccc 113161 tcaatcttag aatagcttct tgggcctgct cccctattaa actgtgataa gcactccacc 113221 tgtctgctta tctgcaaagc aaaactgggt cttggaaagg actggtagac tggtcttcat 113281 ttttagctta cgctaataaa agagctgagc ctgccggggg cagtggctca gtcctgtaat 113341 cccagcactt tgggaggcgg aggccagtgg atcacttgag gtcaggagtt tgagaccagc 113401 ctggccaata tagtgaaacc ctgtctctta ctaaaaacac aaaagagtag ccaggcgtgg 113461 tggtgcacgc ctgtaatccc tagctacttg ggaggctgag gcaggagaac cgctggaact 113521 cggaaggcgg aggttgcagt gagctgaaat ggcaccatag cactccagcc tggtagacag 113581 agggagggtc cgtcaaaaaa aaaaaaaaaa agctgaacat agatttatga cctttccttc 113641 ttctttgatc tctactttta ccttctatct ccttcttgtt cccaaatggt attattaatt 113701 tacagagaaa aaaacactaa caagacttat ttaaggcttt aacactcaac taattactta 113761 agatcaactt cttgcttttt atataccttt cacaacagac aagctcaaga ggaatacaaa 113821 tatgaccact agccaaagct aatatttaag aaatcagaag ctacttttcc tagaagaatc 113881 ttaacactaa gatggcctta attttcccta tttcgatctt gtaaagtaca ggtctttgtt 113941 tggcatacaa actgtatagc attttctcag tttctttact ccttgtaaaa tattgtactg 114001 aatttagttc ccttttccca ctctaaccca cttgtaactt catcttatat aaagttccac 114061 tttatcattg ataggcaagt aggtgtatta ttctacaggg cctcaatttc ctgcaattca 114121 aagccattag aactctggct taaccagcca gacattcctc taaatacata cgtaacaaaa 114181 attgatggca ggagatttta ttgggatcac tggataccta cactcaacag gcagaacttg 114241 gctaattcca tttataccac tactgagaca gtgaagacaa attaagaatc agaacaggaa 114301 gagcacatag catagggata cccgtgtcag cgacatattt aatatgcccc tcagcttaag 114361 actcaatact gaacagacca aaaggccacc tcaattgccg ctcccgccac acttgcaaat 114421 gaagcagcta aataaggtac ttttcaagag gtagaaagtc tttggattat cactcttgca 114481 aattataaaa gacttcattg ttattgccac attgcaatgc tacccaaacc ttagtgactt 114541 aggaaaaaat aaaacttgaa agtaagattc ctgttaaggc tttaaactga tgattatcat 114601 tcatgtattt ttttttcctc tctccttact tccctggcta tttattcaag acattctatt 114661 ctacactaaa catttaattt gaaacatgtg gttcttggaa aatatgccgt cttccatgtt 114721 tataattaat gctgacataa ttaatgacct caaaattcaa gaaagccttt tacttttgag 114781 catatccatg ccatctttaa atacgcacac tgtactctct ggtatactat gctgctcaaa 114841 tgtttttatc cggtcagtaa ttagtttaat ttggctttac aaaaaaattc acctttgaag 114901 tcatatatta acattaaaaa ccatactact tcaaatgtac aatgcctatc atttttgcat 114961 cacacatgtg aaatacatga actgacctca cctattcctt tttcaaaata accaccactt 115021 caactgtgta acactcagtt gaaacaacag caattcaaat aatcaagaac atttcttggg 115081 aaagggagag ttggggcaca gatcttatga aagaaggcta gttcgtttga aatttttaaa 115141 aaatgtcatc tgatactcaa agtatggatc agtaattcac ttttttcctt tcaaataact 115201 tattaaagca tatatatggt gaaaggaaat attaaaccaa acaccaatgg taaagaaata 115261 gaacactatt agtaacttgt agcccctcta tgtgcctatt tcaagcttac aactttcacc 115321 ctaataacca ctaccttgaa ttttgttaac cactcccttt cctatcatat ttgcacatat 115381 ccttaattaa atgtgtcacc ctaccacaac gtgcttttta actcaacact tctgtgactt 115441 atccacatta atccaagttc ttttctcttt ttcacggctg attcaattgt acgaataccc 115501 acaatttatg gagacatttg cgttgtttcc aatatcctgt tagcacgaat gctggtatat 115561 aaacttttct gtacaaggat cctggggtac ctgtgcaagg atttctctag gcattacagt 115621 ctagggtata aagcttaggg aggaattgct gggtcgtttt caactttcct agataatctc 115681 aagttctttt tctaagtcaa tgaactgaaa ttcacttcta aacttagcaa tactgtcaca 115741 cgcgaagcaa acattccacc tctcatcctc taaacaatga gataaaatat tttccttcct 115801 aataaggtat aaatcaaaat aattttgtaa aaagtggcaa ctgaagtgct tgagactagt 115861 aaatccagca gttgtggatc tgaaccacaa aagacaaaaa cgtttggaga aaatatcgtt 115921 aacagagcgc ctactacagt gagactatta catccattat ctcttaattc ctgacaacac 115981 agcaaagtaa aggcaattat cacgttcctc agaggaaaca ggctcacaaa aggtaggatc 116041 ttgaccaagg tcacacacac acatatcaag tggcgtcacg taactctttg gggaagcggg 116101 ggggtcgggg gagacggagt ttcgctcttg ccacgggctg gagtgcaatg gcgcgatctc 116161 ggctcactgc aacctcctcc ccccgggttc aagcgattct cctgccttgg cctcccgagt 116221 agctgggatt acaggcatgc gccaccaagc caggctaatt ttgttatttt tagtagaaac 116281 gggatttctc catgttgatc aggctggtct cgaactcctg acctccggtg atccgcccgc 116341 ctcggcctcc caaatcgctg ggattacagg cctgaggcac cacgcccggt ccacaatacc 116401 aagaactttc tagcgaggca gaatagttga cgctgcagtc caattagaga aaaaaggctg 116461 aaatattaag attaaaacta aagtaacgac ccaaaaaccc atccttcccc caaacacggt 116521 catttagatg gcaagcaact ccactgcttt acatcccaat gcatttcctc cgacttaaaa 116581 tataactgaa gagaattaaa atctatttct aaaaatgaga agttggtctt ttcgtctccc 116641 gtgccttaaa cagtgaactc tggggagaga acgtcaaggg tgccatttcg tgtaaggctt 116701 tcctgggctg aagtgttctc tcaggaagat ccgcgttttt cagatgaacg ccgaggcctg 116761 gagacatcga acagcccgcc cgaagcggcc cggctcgaga gccgggaaac caggcgaggc 116821 gccaaagccc gggcctgggc tgatgcggcc aggccgcccc tcccgatccc ccgcggggct 116881 gggatggggc cgggccgcgc cacgacggcc gtccgcacgg agaggcccag cgtcgccaag 116941 cggccgccct cctcgggtcc cgagtgaccc cggggccgag gtccgcgcac gaaaccgctg 117001 tctcgcctcg agtcgcctcc cctcgcctcc cgggtcgaga gcaaggctcc caggctcgca 117061 gtccgccggg cctccccaag aagcaactcc cccaccccca cgccaggcct ctcctctcct 117121 cccccaccct taccctccac ggtcacgatc ccggtcccgg tccccaaagc ctcctccgcg 117181 catggtccca aaaggataga gatctgggag cggggcacgg atggccgggc tcgggagggc 117241 ctgcggctcc ggtctggtga cgaccgatgg cggcggcgcc tccgctgttg gggcggcggc 117301 aggcgcagcg ctctctcgct ccgacgcgct gtctcccgtc gcagacgcca ccgtcgccgc 117361 ctctctcgtc ggagacggga gcaaaacaca gagaatcggg gctacaaagc cggtgggcag 117421 gtttggctac gctcaaaccg ggcagtgccg cggtttaggc gtctccttcc ttcccagcga 117481 ctgcacaaaa tggcggccgc cgctgagtcg gctccaactt aacgctgcgt acggcggaga 117541 cgatacgtaa acagcgcctc ggaatagccc gagggctccg cggggaacta cgggtattct 117601 tggaagcact ttacgtcagt tgcgacgatg ggtgtatttt gcggtctgta caagaaaaac 117661 aaagtacgga gcgagcgtag agttctcgcg ccacccgaac tagaacgtgg gatgaggtgg 117721 gaggaagcgc ctctggtgcg acgtaacatt ctttgcgtcg ttagccggcc tgctgttgct 117781 aagatgcgca gtatgcgtaa cgcattctgt ttagagaata gaggattttg cctgattccg 117841 atctcggtac agaccccgcg gccggcttga agaagttgct ttgcagagac aataaccgtc 117901 ctccatattg ttatcaagtt tcagagtcct ctggcgccag aggccgggag ggtcttaaat 117961 tccgtggctg gccttcctat atggcgccgc cagccctctc gccccaccgc cctgggcccg 118021 ctgagaatgg cggcgggccc cggcgccatt tcctttccgt tgcctttcgc tgctctgtct 118081 ccggcagcca tgatggaaga gccgtgtccc tttctttccg gagtttgtga ggtggcgatg 118141 tcccttccag gttaaagtgc cctcagctac atttggttca cactgaggtg cattttctga 118201 accgtatatt tttaccgtcc ctggccgagc acagctgttg ctaatacaga aacaaaatgt 118261 gcttggaggc caggcatggt ggctcacgct tgtaatacta acactttggg aggccgaggc 118321 gggaggatca cctgagtccc ggagttcgag accagcctgg ccaaaatggc aaaaccccgt 118381 ctctacgaaa aatacaaaaa ttagcaggac atggtggcgg gtgcctgtag tcccagctac 118441 ttgggaggct gaggcaggag aatcgcttga acccgggagt cggaggtcgc agtgagccga 118501 gaacgtgcca ctgcactcca tcctggacga cggggcaaga atccgtctca aaaaaaaaat 118561 tgcttggagc cccagtggac tccgtgggcc ctttgctgcc gttgaacggc gctcctcaga 118621 aggcccggag aagaccccag cactctcagg agaacaaggt ttcagagcga tggtctttag 118681 acacagaaag ggcgaaggct aaggacaagt gtttaaccag gctagtccac gttctccgct 118741 caattccttc cagactggct tccctctcct tacctgcttc ctgtcgtctc aaactgcatt 118801 ttgaagcccc gcaagcttgg ccaaatggcg tcacacgact gttgagctgt tagggccctc 118861 gagacgggcc catctaacgt ctcaattcaa cagatgggta gaagcgggga gatttggccc 118921 ctcagtgtgg gtgctgagtg tttgtttccc aaatttttgg gaacatagca taagaaaagg 118981 agcttcaaaa agaaaaaaaa catggaccac tgatttttct gtggttatgt gttctttttc 119041 cccaagtacg ttcttaacat tggcaagtga acctcttaaa gtgattgaga ctcgcctggg 119101 tcatattctt tttctttttt ttgtctggag tgcagtggca caatctcggc tcactgcaac 119161 ctctgccgcc cgggttcagg caattctcct gcctcagcct cccgtgtagc tgggatcaca 119221 ggtgcttgcc ctcgcacttg gctaatattt gtagtttagc tggggtttca ccatcttggc 119281 caggctggtc ttgaactcct gacctcgtga tccacccgtg tcgacctgcc aaagtgctgg 119341 gattacaggc gtgagccacc gcaccggcct gggtcatgtt cttttattta cactgttaac 119401 ctgccccaat cctcatcatc tcttgcctgg atgactgcgc tcgcctcttt ttaatctgcc 119461 tgctcctgtc cttgctcact ctgaatttat tcttcacact gcagtcatag gggtcctttt 119521 attatgtcac tttacttaaa attttccaga ggtttccact tgaaagaaaa gacaaaggcc 119581 ttacagtggc ctacaagccc ttgaggatgc agccctcttc tgtctttcct ccttgttcta 119641 ctctagccac actcggcctc agggcctttg cactcgctct ttactctatg tcagcaaggc 119701 tagatacctc agtttaggtc tgtgctcaca tgtcactcaa tcagggaggc ctccccagaa 119761 cccaatgcaa cactgcataa cctgcatact ctcctccttc acaagtacac atgcattctc 119821 tcttttccat ttttctttgc tttcatttga attagtaact gcttgacata aacacttttt 119881 gtctccctcc tagagcgtaa aatctgtgag tgcaggaact tacttgttca ccaccgtgac 119941 ccagtgccta cctggcacag tgcctgacct tttagccatt cgaccacgcc atacccttgt 120001 ccagtgagtc tgtgactgtc cctttgtgtt tctgggccaa ctgacatctc aacggctttt 120061 gttttttgag acagagtctc gctctgtcgc ccaggctgga gtgcagttgg ctcactgcaa 120121 cctccgcctc ttggtttcaa gcgattctcc tgcctcagct tcctgagtag ttgggattac 120181 aggcatacgc caccacacct ggctaatttt tgcattttta atagagacag ggtttcacca 120241 tgttggccag gctggtctga aactcctgaa ttcaggtgat ctgcctgctt cggcctccca 120301 aagtgctggg attacaggcg tgagccacct tgcctggcct ttgtcatcaa cttttatttt 120361 aattctgggg tacatgttca ggatatacaa gtttgttaca caagaaaacg tgtgtcatgg 120421 tggtttgttg cacagatcaa ccccttgcct cggtattagg cccagcatcc gttagctatt 120481 cttctcgatg ctctccctcc gccgaccccc aacaggcccc agtatgtgtt gttcccccgc 120541 tccatgtgtc cgtgtcctct tctgggtctt tgatagtctt cactgatgct ttctgctcac 120601 gctcagataa tctgcatttt cacgtaagat aaaataatca tagagcagag gcagcagtca 120661 gctatgtgtt tgtttcaggc gagcagaggg atgacttttt agttttgtcc tttgtcctgt 120721 gcttgtgatg gtaaaccatc aatgtacatt gtcggggtga aattcaacgg aactgtttta 120781 gtgtaaagat cttggaggcc cacaaggaat ttctttgtgg gcaaattgtg agaggcctat 120841 gtagcttttt ttgttttttg ttttttttta gagacagagt ctcactctgt cgcccaggct 120901 ggagtgcaat ggcgcgatct tggttcactg caacttctgc ctcctgggtt caagcaattc 120961 tcctgcctca gccttctgag tggctgggat tacaagcatg cgacatgacg ctaattttta 121021 tattttttag tagagactgg cttttgccat tattagccag gctggttttg aactcctgac 121081 ctcaagtgat ccacccacct tggcctccca aagtgctagg attacatggc caccgcatct 121141 ggccttcatt atatattacg gtgtaataat aatataaata ggccgggcct aatagaaata 121201 gtctggctaa catgatgaaa ccccgtctct cctaaaaata caaaaaaatt agctgggcgt 121261 gatggcgcat gcctgtagct actcaggagg ctgaggcagg agaatcgctt gaattgggga 121321 ggtggaggtt gtagtcttaa aaatggtatt tacaggctgg gcacagtggc tgacgcctgt 121381 aatctcagca ctttgcgggg ctgaggtggg cagatcactt gaggtcagga actcgagatc 121441 agcctggcca acatggtgaa accccgtctc tactaaaaat acaaaaaatt agccaggtgt 121501 gatggtgggc gcctgtaatc acagctactc gagagcctga ggcaggagaa tggcttgaac 121561 ccaggaggcg gaggttgcag tgacccaaga tggcaccatt gcactccaac ctgagcgaca 121621 gagcaagact ctgtgtctaa aaagacagaa aaaagagaag tttcctctag agcgctgaat 121681 gacaagagac aatgtggaga gagaagtctc atacttacaa gtatatcctg ccaagacccc 121741 agatatacaa gtgaggccat cccagaccct tcagccagcc tactagctga atgtagctgc 121801 gtgagtgagc tatttccttc ctccccgcaa aatcccttat ctttttcttc atgttttcat 121861 cttaggatat gaaaaggaaa ctgctgtagt tgttttttaa ccatgaaggg aacttttgga 121921 gaacagaacc agcacaccga aggtggggaa agtggagaga aggaaagaac ctgggccctt 121981 gatgacatta tagagtcact gtattagcca actgtggagc tggttgactt ctggacttct 122041 tatttgaaat gacatgcttc ctcattgttt aatctacttt tttgagcttt gtgttacttg 122101 tagctgaaag cattctaact gatacactcc tcaaccagct cttctaatac aagtacaaaa 122161 tctgaaacaa gtcaggcatg gtggcatgtg cctatagtcc cagctacttg ggaggctgag 122221 gcgggagggt cgcttgagcc caggcatttg aggttgcatt gagctatgat tgtgccactg 122281 cactccagcc tgggtgacag actgagaccc catctgtaaa acaaacaaac agaaaatctg 122341 aaacaccttg catagtatac aataggcatt caataagttt atcattaatg aggaagtaaa 122401 agaagatttt gttgtaaaga atgctaattg cttatccaac aaccattctc cctttctccc 122461 ttttcaatta agagtcagga ggctggcgct gtggctcaca cctgtaatcc cagcactttg 122521 ggaggccgag gcgggaggat cccttgagcc caggagttca agatcagcct aggtaacata 122581 gtgaaatctc agctctacaa ataataaaaa aattagctgg gtgtggtggt gtgtgtgtgt 122641 ggttccagat actttccata tacgtggaag gctgaggtgg gagcgtcact tgagcccagg 122701 tgcttgaggc tgcagtgagc tatgcttgcc ccactgcact ccagcctggg tgacagagag 122761 agaccctgta acctctggaa aaaaaaattg agcgagagag agaagacagg gtcttgttct 122821 gtcacctggg ctggagtgca gtgacacgat cacagctcac tgccatcttg aactcctagg 122881 ctcaagcgat cctcacgcct cagctgctga gtacctaggt ctacaggtgc tcatcaccat 122941 gcctggctcc ttttcctcct taataacaga actccaaatt ttagctgggc atattgatgt 123001 ctggaacaaa agtctgtatt ttcctacctc ttttgtgcat gggtatggcc aatgagatat 123061 aagcacaagt gttgtatggt agggctgctt tacaggagct gagtcagcta gacagagctt 123121 ttccttcttt ttttttcttt ttgcccttct gccttttatc tttctttctt tctttgagac 123181 agggtctcac tctgtcaccc aggctggagt gcagtggcac aatcttggct cactgcaacc 123241 tctgcctccc gggttcaaga gattcacccg cctcaacctc ctgagtaact gggagtacag 123301 gcgcatgcca ccacacccgg ataatttttg tatttttagt agagatgaga tttcaccagg 123361 ttggccaggc tggtctcgaa ctcctgtcct aaagtgatcc acctgcctca acctccctaa 123421 gtgctgggat tacaggtgtg agccactgca cccagactgc cttttatctt tcttctctca 123481 acttggtttc tctgactaga gctccagctg ccaacttagg caatgagctg accttgaaga 123541 cggaagtcaa aaattaagat ggtcaagcag aaagatagag gcctgcgggc ctgatgacaa 123601 agtggaccca ctgtagtagc atctctggac tgcctagctt tagatttctt ttctatgaga 123661 gaaaaataaa cctgtatttt gtttgggaca cttattttgg gggttttccc ttatattcag 123721 tcaagtaaaa tctaagctat ccctttaaca acttccctct caaaatattg aagcctggcg 123781 taggaaagca cgtgaagagt atgaataatg agagctgtcc cacccgagtg ttaagcaagc 123841 ccctcctggc ttccctttct ccagtccctg aaggcggcac ttgagaggca ctctgttgtt 123901 accttggttg gagttctaga tggcaagtga tagaatctaa ctggctaact aaggcaaagg 123961 gaaacttatt agaaagacat taggggctga gcgtggtgac tcatgcctgt aatcccagtg 124021 ctttgggagg ccaaggcggg tggatcacct gaggtcagga gttcaagacc agcctagcca 124081 acatggcaaa accccatctc tattaaaaat acaaaaatta gctgggtggg gttgtgcaca 124141 cctgtaatcc cagctactcg ggaggctgag acaggagaat cgcttgaacc caggaggtgg 124201 aggttgcagt gagccaagat cacaccactg cactccaggc tgggtgacag agctagagtc 124261 tgtctcaaaa aaaaaaaaaa ggacattagg gttcattgaa ttagcagaag actgcagaaa 124321 cagcctgggc atggatggga gtctgagcca ctcagcagtc taggagccgg aaaccccagt 124381 ctagcagcag gaacagcctg gtcaggcctc tgccactgct gctaggccct gctgcccctg 124441 ctccccctgc tgtcactgca ggtgtgagca gcctctaacc cctgctgtca tccttgtgac 124501 tgtcacttca tattcaatgt ctcaggatgc actacctgaa ttggggagag gtgactcttt 124561 gaaagaaaac aagtgttatt atgatggaga aatgcaaact gtgcaaccag aaaatgtgaa 124621 acatttgccc taacaagcca gggtcagttg aaggagtcct gactagccac acttggtgtc 124681 taaggcccaa tccttctcct caccatgatg aaagatatcc cttgtatcta atctctttct 124741 ctttctctct ctctttcttt tttcttttct tttttctgag acggagtttc actcttgttg 124801 cccaggctgg agtgcagtgg caccatctcg gctcactgca acctccgact atccggttca 124861 agcgattttc ttgcctcagc ctcctgagta gctgggatta caggtgcgca ccaccaaacc 124921 cgactaattt ttttgttggg gtttcgccat gttggccagg ctggccttga actcctgacc 124981 tcaggtgatc // LOCUS HS445C9 131398 bp DNA PRI 02-DEC-1997 DEFINITION Human DNA sequence from BAC 445C9 on chromosome 22q12.1. Contains CRYBB1, beta B1 crystallin, CRYBA4, beta A4 crystallin, high mobility group-1 protein (HMG-1), ESTs. ACCESSION Z95115 NID g2661905 KEYWORDS 22q12.1; beta A4-crystallin; beta B1-crystallin; CRYBA4; CRYBB1; high mobility group protein 1; nonhistone protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 131398) AUTHORS Bridgeman,A. TITLE Direct Submission JOURNAL Submitted (02-DEC-1997) E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is not the entire insert of clone 445C9. It may be shorter because we only sequence overlapping sections once, or longer because we arrange for a small overlap between neighbouring submissions. This sequence was generated from part of bacterial clone contigs of human chromosome 22, constructed by the Sanger Centre chromosome 22 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/ This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone 445C9 is at 1 in this sequence. The true right end of clone 1048E9 is at 22657. The true left end of clone 373H7 is at 131295. 445C9 is from the human BAC library described in U-J. Kim et al. (1996) Genomics 34, 213-218. VECTOR: pBeloBAC11. FEATURES Location/Qualifiers source 1..131398 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q12.1" /clone="445C9" prim_transcript <1..121 /note="match: 5' EST AA299245" repeat_region 15..75 /note="MER30 repeat: matches 197..144 of consensus" variation 392..394 /note="clone 1048E9; ACA in this entry; substitution" /replace="ata" variation 463..465 /note="clone 1048E9; AGT in this entry; substitution" /replace="act" variation 975..977 /note="clone 1048E9; CTA in this entry; substitution" /replace="cca" prim_transcript complement(<1134..3898) /note="match: 5' EST T08106" repeat_region 1896..2154 /note="AluJo repeat: matches 1..299 of consensus" repeat_region 2262..2557 /note="AluSq repeat: matches 1..295 of consensus" repeat_region 3187..3481 /note="AluSx repeat: matches 302..1 of consensus" variation 3188..3199 /note="clone 1048E9; TTATTTATTTAT in this entry; substitution" /replace="tcagggatggag" variation 3200..3211 /note="clone 1048E9; TTATTCAGGGAT in this entry; substitution" /replace="tctcgctctgtt" variation 3212..3218 /note="clone 1048E9; GGAGTCT in this entry; substitution" /replace="gcccagt" variation 3217..3220 /note="clone 1048E9; CTCG in this entry; insertion" /replace="cg" variation 3222..3227 /note="clone 1048E9; TCTGTT in this entry; substitution" /replace="tggagt" variation 3229..3233 /note="clone 1048E9; CCCAG in this entry; substitution" /replace="cagtg" variation 3234..3241 /note="clone 1048E9; GCTGGAGT in this entry; insertion" /replace="gcgt" repeat_region 4894..5117 /note="AluJo repeat: matches 85..302 of consensus; incomplete repeat" variation 5097..5101 /note="clone 1048E9; CCAAA in this entry; insertion" /replace="ccaa" repeat_region 5310..5589 /note="AluSx repeat: matches 2..298 of consensus" repeat_region 5695..5907 /note="MIR repeat: matches 3..262 of consensus" repeat_region 6043..6331 /note="AluSc repeat: matches 2..299 of consensus" repeat_region 7042..7341 /note="AluSq repeat: matches 1..301 of consensus" variation 7209..7211 /note="clone 1048E9; ACC in this entry; substitution" /replace="agc" prim_transcript complement(7558..>9917) /note="match: 5' EST AA001171 clone 362209" repeat_region 8502..8951 /note="MLT2B repeat: matches 444..1 of consensus" variation 8702..8704 /note="clone 1048E9; CAG in this entry; substitution" /replace="cgg" variation 8812..8820 /note="clone 1048E9; GAATTGAGG in this entry; insertion" /replace="gagg" variation 9400..9402 /note="clone 1048E9; CCG in this entry; substitution" /replace="ctg" repeat_region 9484..9588 /note="MIR repeat: matches 180..73 of consensus" prim_transcript <10042..10533 /note="match: multiple ESTs; match: AA308338 R60841" variation 10471..10473 /note="clone 1048E9; GGA in this entry; substitution" /replace="gaa" repeat_region 10687..10755 /note="MIR repeat: matches 68..143 of consensus" prim_transcript <10752..11138 /note="match: 5' EST R13001 clone 27575" repeat_region 11022..11094 /note="MIR2 repeat: matches 60..126 of consensus" variation 11251..11253 /note="clone 1048E9; CAT in this entry; substitution" /replace="cgt" repeat_region 11442..11958 /note="MER42c repeat: matches 854..1395 of consensus" prim_transcript <11740..>12036 /note="match: 5' EST AA281860 clone IMAGE:712412" variation 11796..11798 /note="clone 1048E9; CAA in this entry; substitution" /replace="cga" repeat_region 12007..12211 /note="MIR repeat: matches 30..220 of consensus" repeat_region 12250..12397 /note="AluSg repeat: matches 298..152 of consensus; incomplete repeat" repeat_region 12404..12476 /note="AluSp repeat: matches 200..120 of consensus; incomplete repeat" repeat_region 12481..12774 /note="AluSg repeat: matches 299..6 of consensus" variation 12494..12496 /note="clone 1048E9; TTC in this entry; substitution" /replace="tcc" variation 12786..12788 /note="clone 1048E9; TTA in this entry; substitution" /replace="taa" variation 12791..12793 /note="clone 1048E9; AAC in this entry; substitution" /replace="acc" variation 12919..12921 /note="clone 1048E9; CAT in this entry; substitution" /replace="cgt" variation 13048..13050 /note="clone 1048E9; AGA in this entry; substitution" /replace="aca" repeat_region 13914..14307 /note="MLT1F repeat: matches 534..111 of consensus" repeat_region 14309..14609 /note="AluSx repeat: matches 301..1 of consensus" variation 14325..14329 /note="clone 1048E9; TTTGA in this entry; insertion" /replace="ttttatttattga" repeat_region 14642..14938 /note="AluSc repeat: matches 297..2 of consensus" variation 14658..14661 /note="clone 1048E9; TTGA in this entry; deletion" /replace="ttga" repeat_region 15150..15248 /note="MLT1F repeat: matches 93..1 of consensus" repeat_region 15469..15769 /note="AluJo repeat: matches 302..1 of consensus" repeat_region 15936..16125 /note="MER42B repeat: matches 1108..1300 of consensus" variation 16144..16146 /note="clone 1048E9; TTA in this entry; substitution" /replace="taa" repeat_region 16206..16504 /note="AluSx repeat: matches 1..302 of consensus" variation 16336..16338 /note="clone 1048E9; AAA in this entry; substitution" /replace="aca" variation 16499..16509 /note="clone 1048E9; AAAAAAAAATC in this entry; insertion" /replace="aatc" repeat_region 16579..16873 /note="AluSp repeat: matches 297..2 of consensus" variation 16666..16668 /note="clone 1048E9; CCC in this entry; substitution" /replace="ctc" repeat_region 17290..17616 /note="AluJb repeat: matches 299..2 of consensus" variation 17354..17359 /note="clone 1048E9; CACAGT in this entry; insertion" /replace="cagt" variation 17465..17467 /note="clone 1048E9; TTT in this entry; substitution" /replace="tct" variation 17544..17546 /note="clone 1048E9; CTC in this entry; substitution" /replace="ccc" repeat_region 17622..17929 /note="AluSx repeat: matches 302..2 of consensus" variation 17938..17941 /note="clone 1048E9; AATT in this entry; deletion" /replace="aattt" repeat_region 17951..18082 /note="AluSx repeat: matches 133..2 of consensus; incomplete repeat" repeat_region 18087..18165 /note="AluJb repeat: matches 138..61 of consensus; incomplete repeat" variation 18223..18225 /note="clone 1048E9; ACG in this entry; substitution" /replace="atg" variation 18404..18409 /note="clone 1048E9; GGTTTT in this entry; insertion" /replace="ggtt" repeat_region 18407..18687 /note="AluJb repeat: matches 302..21 of consensus; incomplete repeat" variation 18553..18555 /note="clone 1048E9; AAG in this entry; substitution" /replace="atg" variation 18578..18581 /note="clone 1048E9; TCTT in this entry; substitution" /replace="ttct" repeat_region 18748..19047 /note="AluSq repeat: matches 1..302 of consensus" variation 18919..18921 /note="clone 1048E9; ACT in this entry; substitution" /replace="agt" repeat_region 19060..19118 /note="MLT1G repeat: matches 454..512 of consensus" prim_transcript <19459..19853 /note="match: multiple ESTs; match: R34314 AA397904 F02137 F01475 F02797 F02698" repeat_region 19649..19766 /note="MIR repeat: matches 149..30 of consensus" prim_transcript complement(<20156..21000) /note="match: multiple ESTs; match: R34195 AA394098" repeat_region 20267..20485 /note="AluJo repeat: matches 2..239 of consensus; incomplete repeat" repeat_region 20581..20676 /note="AluJo repeat: matches 207..302 of consensus; incomplete repeat" repeat_region 21052..21350 /note="AluSx repeat: matches 301..1 of consensus" variation 21114..21116 /note="clone 1048E9; ATG in this entry; substitution" /replace="acg" repeat_region 21779..21976 /note="AluJo repeat: matches 86..284 of consensus; incomplete repeat" repeat_region 22695..22981 /note="AluSx repeat: matches 302..1 of consensus" prim_transcript complement(<23216..23584) /note="match: multiple ESTs; match: R45973 H94111" repeat_region 24269..24557 /note="AluSx repeat: matches 6..293 of consensus" repeat_region 25085..25219 /note="MIR repeat: matches 66..200 of consensus" repeat_region 26169..26468 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 26489..26594 /note="MER21B repeat: matches 716..616 of consensus" repeat_region 26595..26897 /note="AluSx repeat: matches 296..1 of consensus" repeat_region 26898..27008 /note="MER21B repeat: matches 628..529 of consensus" repeat_region 27191..27257 /note="AluJo repeat: matches 298..232 of consensus; incomplete repeat" repeat_region 27270..27328 /note="AluY repeat: matches 146..90 of consensus; incomplete repeat" repeat_region 27593..27625 /note="11 copies of 3 mer 88 % conserved" repeat_region 27635..27700 /note="FAM repeat: matches 71..4 of consensus" repeat_region 28008..28114 /note="AluSx repeat: matches 193..297 of consensus; incomplete repeat" repeat_region 28196..28497 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 29121..29401 /note="MER21B repeat: matches 76..346 of consensus" repeat_region 29406..29576 /note="AluJb repeat: matches 87..257 of consensus; incomplete repeat" repeat_region 29748..30031 /note="AluSg repeat: matches 299..13 of consensus" repeat_region 30321..30470 /note="MIR2 repeat: matches 146..1 of consensus" repeat_region 30601..30907 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 31062..31141 /note="MER33 repeat: matches 271..190 of consensus" repeat_region 31142..31442 /note="AluJb repeat: matches 300..1 of consensus" repeat_region 31443..31637 /note="MER33 repeat: matches 196..1 of consensus" repeat_region 31593..31676 /note="MER3 repeat: matches 92..7 of consensus" repeat_region 32974..33265 /note="AluJo repeat: matches 11..302 of consensus" repeat_region 34245..34537 /note="AluJb repeat: matches 1..296 of consensus" repeat_region 34710..34925 /note="MIR repeat: matches 13..237 of consensus" repeat_region 34926..35220 /note="AluSx repeat: matches 295..1 of consensus" repeat_region 35348..35462 /note="L1ME3 repeat: matches 580..457 of consensus" repeat_region 35387..35462 /note="L1ME1 repeat: matches 528..457 of consensus" repeat_region 36008..36310 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 36969..37255 /note="AluJo repeat: matches 21..301 of consensus; incomplete repeat" repeat_region 37279..37481 /note="MIR repeat: matches 47..250 of consensus" repeat_region 37543..37734 /note="MIR repeat: matches 211..16 of consensus" repeat_region 37786..38055 /note="AluJb repeat: matches 1..296 of consensus" repeat_region 38110..38221 /note="MIR repeat: matches 140..29 of consensus" prim_transcript complement(<38257..>39186) /note="match: multiple ESTs; match: AA459614 AA316165 AA463081 W21315 AA304841; match:AA051791 AA168531 AA563154 AA615526 AA277588" repeat_region 39358..39477 /note="AluJb repeat: matches 133..11 of consensus; incomplete repeat" repeat_region 39485..39726 /note="AluY repeat: matches 56..298 of consensus; incomplete repeat" repeat_region 39727..39876 /note="AluJo repeat: matches 256..101 of consensus; incomplete repeat" repeat_region 39877..39944 /note="AluJb repeat: matches 68..1 of consensus; incomplete repeat" repeat_region 40489..40601 /note="MIR repeat: matches 72..192 of consensus" repeat_region 40897..41074 /note="AluSx repeat: matches 174..1 of consensus; incomplete repeat" repeat_region 41527..41647 /note="MIR repeat: matches 34..154 of consensus" repeat_region 42718..42782 /note="MIR repeat: matches 154..89 of consensus" repeat_region 43014..43045 /note="MIR repeat: matches 145..114 of consensus" repeat_region 43266..43564 /note="AluJb repeat: matches 1..302 of consensus" repeat_region 44079..44246 /note="MER3 repeat: matches 209..40 of consensus" repeat_region 44464..44507 /note="MER3 repeat: matches 44..1 of consensus" repeat_region 44554..44867 /note="AluJo repeat: matches 300..1 of consensus" repeat_region 45130..45299 /note="AluJo repeat: matches 132..301 of consensus; incomplete repeat" repeat_region 45442..45561 /note="MIR repeat: matches 34..154 of consensus" repeat_region 45562..45662 /note="MER46 repeat: matches 234..134 of consensus" repeat_region 45667..45796 /note="FLAM_C repeat: matches 5..130 of consensus" repeat_region 46085..46142 /note="MER46 repeat: matches 115..58 of consensus" repeat_region 46150..46449 /note="AluJb repeat: matches 2..296 of consensus" repeat_region 46450..46507 /note="MER46 repeat: matches 57..1 of consensus" repeat_region 46680..46795 /note="MIR repeat: matches 153..34 of consensus" repeat_region 46846..47143 /note="AluSg repeat: matches 2..299 of consensus" repeat_region 47220..47527 /note="AluY repeat: matches 4..301 of consensus" repeat_region 47640..47762 /note="MIR2 repeat: matches 146..21 of consensus" repeat_region 48481..48773 /note="AluJb repeat: matches 2..302 of consensus" repeat_region 48781..49009 /note="MIR repeat: matches 262..21 of consensus" repeat_region 49147..49248 /note="MIR repeat: matches 50..151 of consensus" repeat_region 49355..49645 /note="AluJo repeat: matches 302..1 of consensus" repeat_region 49809..49969 /note="MIR repeat: matches 188..28 of consensus" prim_transcript <49837..50087 /note="match: EST T12361" repeat_region 50247..50548 /note="AluSq repeat: matches 2..303 of consensus" repeat_region 50838..51124 /note="AluJo repeat: matches 302..2 of consensus" repeat_region 51140..51474 /note="MER2 repeat: matches 1..343 of consensus" repeat_region 51482..51660 /note="MIR repeat: matches 188..14 of consensus" repeat_region 52047..52094 /note="MIR repeat: matches 99..146 of consensus" repeat_region 52132..52417 /note="AluSx repeat: matches 9..294 of consensus" repeat_region 52482..52606 /note="MER5A repeat: matches 188..58 of consensus" repeat_region 52698..52771 /note="MADE1 repeat: matches 79..5 of consensus" repeat_region 52884..53006 /note="MIR repeat: matches 69..182 of consensus" repeat_region 53725..54017 /note="AluSq repeat: matches 1..292 of consensus" repeat_region 54406..54520 /note="FLAM_A repeat: matches 1..109 of consensus" repeat_region 54787..54892 /note="MIR repeat: matches 87..195 of consensus" repeat_region 55233..55521 /note="AluJo repeat: matches 1..289 of consensus" repeat_region 55587..55634 /note="24 copies of 2 mer 98 % conserved" repeat_region 55642..55908 /note="AluSx repeat: matches 265..1 of consensus; incomplete repeat" repeat_region 56489..56550 /note="L1MB8 repeat: matches 920..859 of consensus" repeat_region 56553..56853 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 56925..57176 /note="L1MB8 repeat: matches 850..594 of consensus" repeat_region 57775..57890 /note="MIR repeat: matches 79..195 of consensus" prim_transcript <57896..>59092 /note="match: multiple ESTs; match: R68475 AA275567 H50065 AA150131 AA220281; match: N71219 W02810 C75489 AA101746 AA101747; match: AA524370 AA315282 AA132394 AA602941; match: AA029435 AA370554 AA152359 AA360952; match: W53706 AA083771 N72984 AA459233 AA054783; match: AA384714 AA356536 T64628 AA197248 R79294; match: AA600212 M62207 AA130703 AA312820 AA111473; match: AA311571 W32423 AA083026 AA384004 W40326; match: AA315731 AA204770 H96457 AA049323 AA045606; match: H99805 AA432138 AA050755 AA564621 AA316546; match: W02982 AA054719 AA206991 AA130788 AA557452; match: AA313759 AA133897 W85536 AA269429 W79255; match: AA576844 D57151 AA120232 W33201 AA312711; match: N24382 H99812 AA526214 R72803 AA607172; match: AA319424 AA313809 AA565522 AA117745; match: AA577106 AA026549 T90564 AA541647 AA507781; match: W07682 AA188911 N38949 W73923 AA546572; match: AA209397 AA146973 C02724 AA118599 W66758; match: AA586855 F22618 D52874 N75400 AA304303; match: AA123232 AA330028 W15510 D83849 AA363018; match: AA612057 N80559 W58023 AA311991 N93269; match: N26696 AA342484 AA196344 AA357094 AA467444" mRNA 57994..58629 /note="match: X12597; P09429 (HMG-1); high mobility group-1 protein" repeat_region 59671..59734 /note="MIR repeat: matches 80..143 of consensus" repeat_region 60090..60371 /note="AluSx repeat: matches 302..20 of consensus; incomplete repeat" repeat_region 60435..60550 /note="MER20 repeat: matches 102..218 of consensus" repeat_region 60620..60751 /note="MIR2 repeat: matches 145..14 of consensus" repeat_region 60797..61096 /note="AluY repeat: matches 301..1 of consensus" repeat_region 62155..62273 /note="MIR repeat: matches 205..82 of consensus" repeat_region 62336..62416 /note="MIR repeat: matches 66..146 of consensus" repeat_region 62606..62656 /note="MIR repeat: matches 115..65 of consensus" repeat_region 63118..63343 /note="MER33 repeat: matches 324..65 of consensus" repeat_region 63483..63583 /note="MER3 repeat: matches 154..54 of consensus" repeat_region 63603..63738 /note="AluSx repeat: matches 1..136 of consensus; incomplete repeat" repeat_region 63739..64039 /note="AluSc repeat: matches 1..299 of consensus" repeat_region 64058..64239 /note="AluSx repeat: matches 121..302 of consensus; incomplete repeat" repeat_region 64339..64558 /note="MER20 repeat: matches 216..1 of consensus" repeat_region 65146..65783 /note="L1ME3A repeat: matches 2..667 of consensus" repeat_region 66330..66629 /note="AluY repeat: matches 301..1 of consensus" repeat_region 66707..66954 /note="L1ME3A repeat: matches 662..906 of consensus" repeat_region 67006..67129 /note="AluJo repeat: matches 174..298 of consensus; incomplete repeat" repeat_region 67135..67438 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 67439..67554 /note="29 copies of 4 mer 89 % conserved" prim_transcript <69398..69960 /note="match: multiple ESTs; match: N79214 N79207" prim_transcript 70266..>71001 /note="match: multiple ESTs; match: AA504725 AA504628 N62546 H55551 N62539" repeat_region 70404..70495 /note="MIR repeat: matches 46..144 of consensus" repeat_region 70584..70797 /note="AluJb repeat: matches 213..1 of consensus; incomplete repeat" prim_transcript <71156..>71371 /note="match: EST AA526627 clone IMAGE:980592" repeat_region 71634..71685 /note="MIR repeat: matches 93..144 of consensus" repeat_region 71779..71912 /note="MIR repeat: matches 156..16 of consensus" repeat_region 72991..73110 /note="MIR2 repeat: matches 20..144 of consensus" repeat_region 73181..73482 /note="AluSg repeat: matches 1..300 of consensus" repeat_region 73484..73523 /note="8 copies of 5 mer 88 % conserved" prim_transcript <74020..>74354 /note="match: 3' EST T03270 clone FB8A11" repeat_region 74156..74281 /note="FLAM_C repeat: matches 2..126 of consensus" repeat_region 74282..74582 /note="AluY repeat: matches 1..301 of consensus" repeat_region 74588..74751 /note="AluJo repeat: matches 131..302 of consensus; incomplete repeat" repeat_region 74812..75115 /note="AluSg repeat: matches 298..1 of consensus" repeat_region 75120..75420 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 75571..75623 /note="AluSx repeat: matches 91..39 of consensus; incomplete repeat" repeat_region 75627..75740 /note="MIR repeat: matches 192..76 of consensus" repeat_region 75784..75924 /note="MER5A repeat: matches 6..152 of consensus" repeat_region 76227..76290 /note="MIR repeat: matches 77..140 of consensus" repeat_region 76317..76622 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 77434..77718 /note="AluJo repeat: matches 5..302 of consensus" repeat_region 78466..78535 /note="35 copies of 2 mer 84 % conserved" repeat_region 79064..79176 /note="MIR repeat: matches 88..217 of consensus" repeat_region 79507..79600 /note="MER5A repeat: matches 10..106 of consensus" repeat_region 79612..79913 /note="AluSq repeat: matches 2..303 of consensus" repeat_region 80452..80597 /note="AluSg repeat: matches 155..300 of consensus; incomplete repeat" repeat_region 80619..80922 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 81041..81318 /note="AluSg repeat: matches 2..292 of consensus" repeat_region 81621..81710 /note="MIR repeat: matches 144..55 of consensus" repeat_region 81986..82150 /note="AluJo repeat: matches 137..302 of consensus; incomplete repeat" repeat_region 82177..82260 /note="MIR repeat: matches 48..131 of consensus" repeat_region 82260..82355 /note="MIR repeat: matches 161..258 of consensus" repeat_region 82456..82567 /note="MIR repeat: matches 228..110 of consensus" repeat_region 83235..83528 /note="AluY repeat: matches 1..301 of consensus" repeat_region 83752..84014 /note="AluJo repeat: matches 273..1 of consensus; incomplete repeat" repeat_region 84824..85105 /note="AluSx repeat: matches 279..1 of consensus; incomplete repeat" repeat_region 85191..85278 /note="L1MD2 repeat: matches 265..351 of consensus" repeat_region 85279..85408 /note="FLAM_C repeat: matches 3..131 of consensus" repeat_region 85435..86027 /note="L1MB8 repeat: matches 341..920 of consensus" repeat_region 86563..86674 /note="MIR repeat: matches 37..146 of consensus" repeat_region 88087..88210 /note="MER20 repeat: matches 39..159 of consensus" repeat_region 88343..88426 /note="MIR repeat: matches 97..175 of consensus" repeat_region 88557..88688 /note="AluSx repeat: matches 1..134 of consensus; incomplete repeat" repeat_region 88690..88949 /note="AluSx repeat: matches 36..299 of consensus; incomplete repeat" repeat_region 88950..89224 /note="AluSg repeat: matches 24..298 of consensus; incomplete repeat" repeat_region 89227..89354 /note="AluSx repeat: matches 136..263 of consensus; incomplete repeat" repeat_region 89357..89394 /note="19 copies of 2 mer 100 % conserved" repeat_region 89396..89437 /note="MIR repeat: matches 127..86 of consensus" repeat_region 90822..90940 /note="AluJb repeat: matches 1..119 of consensus; incomplete repeat" repeat_region 90990..91291 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 91298..91498 /note="MIR repeat: matches 252..47 of consensus" repeat_region 92072..92176 /note="AluSx repeat: matches 198..302 of consensus; incomplete repeat" repeat_region 92761..93062 /note="AluSp repeat: matches 1..300 of consensus" repeat_region 93110..93241 /note="AluJb repeat: matches 132..1 of consensus; incomplete repeat" repeat_region 93242..93537 /note="AluJo repeat: matches 297..1 of consensus" repeat_region 93565..93862 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 95348..95648 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 95844..96149 /note="AluJb repeat: matches 301..1 of consensus" repeat_region 96366..96470 /note="MIR repeat: matches 20..132 of consensus" gene complement(96956..113785) /gene="CRYBB1" CDS complement(join(96956..97139,99345..99487,105355..105487, 109538..109656,113606..113785)) /gene="CRYBB1" /codon_start=1 /product="CRYBB1" /db_xref="PID:e1202388" /db_xref="PID:g2661906" /translation="MSQAAKASASATVAVNPGPDTKGKGAPPAGTSPSPGTTLAPTTV PITSAKAAELPPGNYRLVVFELENFQGRRAEFSGECSNLADRGFDRVRSIIVSAGPWV AFEQSNFRGEMFILEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQEHKISLFEGANFK GNTIEIQGDDAPSLWVYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLLEPGDFRHWNE WGAFQPQMQSLRRLRDKQWHLEGSFPVLATEPPK" repeat_region 97260..97444 /note="MIR repeat: matches 59..237 of consensus" repeat_region 97586..97880 /note="AluY repeat: matches 1..294 of consensus" repeat_region 97911..98151 /note="MIR repeat: matches 9..253 of consensus" repeat_region 98430..98509 /note="MIR2 repeat: matches 31..111 of consensus" repeat_region 99650..99759 /note="MIR repeat: matches 137..250 of consensus" repeat_region 99768..100069 /note="AluJo repeat: matches 1..295 of consensus" repeat_region 100107..100164 /note="MIR repeat: matches 120..63 of consensus" repeat_region 100308..100610 /note="AluY repeat: matches 1..300 of consensus" repeat_region 100618..100712 /note="MIR repeat: matches 55..148 of consensus" repeat_region 100715..101012 /note="AluSx repeat: matches 2..301 of consensus" repeat_region 101089..101407 /note="AluSg repeat: matches 1..298 of consensus" repeat_region 101650..101787 /note="MIR repeat: matches 47..186 of consensus" repeat_region 102768..102843 /note="MIR2 repeat: matches 32..106 of consensus" repeat_region 102858..103157 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 103378..103595 /note="MER30 repeat: matches 230..1 of consensus" repeat_region 103607..103736 /note="MER5B repeat: matches 175..47 of consensus" repeat_region 103740..103959 /note="MER33 repeat: matches 226..5 of consensus" repeat_region 104487..104786 /note="AluSx repeat: matches 302..2 of consensus" repeat_region 104877..105045 /note="FRAM repeat: matches 1..166 of consensus" repeat_region 105621..105765 /note="MIR repeat: matches 45..203 of consensus" repeat_region 106840..107043 /note="51 copies of 4 mer 80 % conserved" repeat_region 107213..107307 /note="MIR repeat: matches 12..110 of consensus" repeat_region 107670..107720 /note="MIR repeat: matches 134..84 of consensus" repeat_region 108201..108359 /note="MIR repeat: matches 200..26 of consensus" repeat_region 108586..108830 /note="MIR repeat: matches 2..262 of consensus" repeat_region 108932..109239 /note="AluYb8 repeat: matches 308..1 of consensus" repeat_region 109296..109483 /note="MIR repeat: matches 73..262 of consensus" repeat_region 109834..110118 /note="AluJb repeat: matches 301..1 of consensus" repeat_region 110790..110835 /note="L1ME3A repeat: matches 363..406 of consensus" repeat_region 110856..111006 /note="MER11A repeat: matches 2..151 of consensus" repeat_region 111005..111629 /note="MER11A repeat: matches 60..738 of consensus" repeat_region 111314..111929 /note="MER11B repeat: matches 2..635 of consensus" repeat_region 112176..112272 /note="AluSg repeat: matches 1..114 of consensus; incomplete repeat" repeat_region 112981..113279 /note="AluSq repeat: matches 302..1 of consensus" repeat_region 114177..114384 /note="MIR repeat: matches 242..18 of consensus" repeat_region 114406..114496 /note="MIR2 repeat: matches 58..146 of consensus" repeat_region 116011..116159 /note="MIR repeat: matches 248..79 of consensus" repeat_region 116823..116916 /note="MIR repeat: matches 144..53 of consensus" repeat_region 117760..117981 /note="MIR repeat: matches 232..16 of consensus" repeat_region 118029..118328 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 119032..119134 /note="MIR repeat: matches 191..73 of consensus" gene 120063..127953 /gene="CRYBA4" CDS join(120063..120101,120700..120818,122947..123088, 125754..125896,127806..127953) /gene="CRYBA4" /codon_start=1 /product="CRYBA4" /db_xref="PID:e1202389" /db_xref="PID:g2661907" /translation="MTLQCTKSAGPWKMVVWDEDGFQGRRHEFTAECPSVLELGFETV RSLKVLSGAWVGFEHAGFQGQQYILERGEYPSWDAWGGNTAYPAERLTSFRPAACANH RDSRLTIFEQENFLGKKGELSDDYPSLQAMGWEGNEVGSFHVHSGAWVCSQFPGYRGF QYVLECDHHSGDYKHFREWGSHAPTFQVQSIRRIQQ" repeat_region 121232..121259 /note="14 copies of 2 mer 89 % conserved" repeat_region 121338..121539 /note="MER3 repeat: matches 1..204 of consensus" repeat_region 121576..121604 /note="MER3 repeat: matches 176..204 of consensus" repeat_region 121638..121666 /note="MER3 repeat: matches 176..204 of consensus" repeat_region 121671..121798 /note="2 copies of 64 mer 93 % conserved" repeat_region 121822..121860 /note="MER3 repeat: matches 165..204 of consensus" repeat_region 122649..122848 /note="MER20 repeat: matches 216..15 of consensus" repeat_region 123181..123267 /note="MIR repeat: matches 69..159 of consensus" repeat_region 123522..123577 /note="MIR repeat: matches 165..107 of consensus" repeat_region 123586..123889 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 124019..124249 /note="MER46 repeat: matches 3..233 of consensus" repeat_region 124346..124510 /note="L1MD2 repeat: matches 677..508 of consensus" repeat_region 124578..124716 /note="MIR repeat: matches 216..67 of consensus" repeat_region 125113..125413 /note="AluSp repeat: matches 303..1 of consensus" repeat_region 125995..126035 /note="MIR repeat: matches 147..107 of consensus" repeat_region 126926..127220 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 127321..127470 /note="MIR2 repeat: matches 145..1 of consensus" repeat_region 128134..128430 /note="L1 repeat: matches 3244..2930 of consensus" repeat_region 128680..128973 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 129110..129192 /note="AluJo repeat: matches 299..217 of consensus; incomplete repeat" repeat_region 129439..129905 /note="LTR7 repeat: matches 450..1 of consensus" repeat_region 130726..131021 /note="AluSx repeat: matches 298..1 of consensus" BASE COUNT 35318 a 31579 c 31197 g 33304 t ORIGIN 1 aagcttgtcc aacccgcagc ccaggacggc tctgaatacg gcctgtttac gaacacaaat 61 ttgtaaactt tcttagtgtg acggtagtaa agtccagccc acagaagttg gttttggggt 121 tgtcactatg tttttctcat actggttaag acttgtcatt ttttctctaa aagtatctag 181 acatctggct actttgggaa aatcagagga tcaggcaata ctgagcccac atcctcataa 241 gcaacaatca cctagagctg aataaagatt taccttctag acagactaca gcctcacctc 301 tgtgccacag tccccaccac tctttattgt ctagcacctg gctcatctta cttgcttctg 361 tggtgtgcct ggcacctgta ggcttctgag aacacctgac tcacagtggt caatcccaca 421 tttatcaaat aactctttca tgaaaaggtc agtgacatga agagtaacat gtctctttgc 481 agcatcctgc tttcccccat aaattcaaac aggaaaagag taatacttca atcaagagag 541 ctggcccttt ctatcactgg ggaaaaggga cacacagctg ccagcagatg gctccctgct 601 gcacttggga tgaaacctga gcacctaaaa atggctgtca tggctgtctt gtggaccacc 661 cacccctctg aatgcacttc cttcccctct gctttgtgta ccgccctccc accgcaaggg 721 cttcctttat ggtttcttaa ccaccaagct cagagcctct cagtgtcctc tggcttgtca 781 gctcaccctt ctgccaagtc tcaactgaaa tgtttccttt cccaaaaggt ttccctgacc 841 agcctattta taaaagtagg tcaacaccac ctcccaaccc ccagccttat tttctgtcta 901 ggcagctcct ggcaccttct gccatttcaa ctgccacttc cacaccacag ctgcttgttt 961 actctctgtc caccctagcc ctctcatcat ggtgcaccta gccttcagcc cagtacctgg 1021 catacggtac atgtacaata aacatctgag gactaacaga atgaagggat aaatgaacag 1081 ccattagaaa tggacaggac accatccacc catgtctcct gacttccagt tacctcttca 1141 gcttcttcct ctgagtcaac cacagggaag tcttgcatgg actgagtggt gcgctcggat 1201 ccataagccc ccacagcacc ttttcccttt ctctgcttgg cttcaattgg gttaatgata 1261 cctgaggaat ttcagcaaaa aaaaaagaga gtcttacgat cacagcaatt catgtatgtg 1321 aatcagccac tcagtgaaat gacagtttct tgctccttcc acaaaataac atgctactca 1381 ggaaggagag ctattctctg ccctcagaac actactcttg tcccttgtga tttgaaactt 1441 atgagcattt atgctaccac agaacagtct gctgactagt ttatttatat cccaaataga 1501 tttccatttc cacttggatt gtttagtgga caggaagaaa acagaacact acatttataa 1561 ctgagaactc cagcacacta aattctatgc ttcacttctt tgggagcctc tattagggaa 1621 aatgcaataa ttccaatact atactgattc tgactgaagt gaagctgcaa tttaatataa 1681 gactcctcac atattttact taggagttcc tatctcctgt gtttaacttt tttataacaa 1741 gtttaagccc tggaaataaa cgttgttttt ctcataaaag agtttacgaa taggaccaca 1801 acatttcaat gaagacgttc ccctgactcc aaattttcca tccctgcccc ccttccattt 1861 agttccactg aagtcttaaa taagagctat ccctgggcca gatgtggtgg ctatgcttgt 1921 aatcctagca ctttgggagg ccaaggtctt gaaccagcct gggcaacata gtaagacccc 1981 atctacaaaa aaagaaaaaa ttagccaggc acagcagagt gtgcctgtgg tcccagctac 2041 ttggcagggt tgagcccagg aggttgaggc tgcagtgagc tatgatcatg ccatgccatt 2101 gcactctacc ctaggtaaca gagtgagact ttgtatcaaa aaaaagcaaa aaaactgtcc 2161 cctcattctc attcagtgta tctttactca agttcatgta tttttcatgt acctaatgtt 2221 ttcaacacat aacctggatt tcactttaat aattatttag aggccaggcg cagtggctca 2281 cacctataat cccagcactt tgggaggccg aggaggatgg atcacttgag gtcaggagtt 2341 cgagaccagc ctggccaaca cagtaaaacc ctgtctctac caaaaataca aaaattagct 2401 gggcgtggtg gcgggtgcct gtaatccctg ctactcggga ggctgaggca ggagaatcac 2461 ttgaacccag gaggtggaag ttgcagatga gttgagatcg tgccactgca ctccagtctg 2521 ggtgacaaga gtgaaattct gtctcaaaaa ataataatgt tattattatt atttagagac 2581 tggaatttga atatgtagga tacatttgat tatatctact tggcaatcat gcataggaaa 2641 tagctcaaga agagggggaa atgttcctgt aggaagaatc caactgtttg gtttgagagg 2701 ctttggtctc aagaaggtaa gagtgtgaca ggggctgtgt ctcaatgggg ctggccatga 2761 gaagacagga ggagactggg ctactagagg aagatggaag taaccaatgc aggatttgct 2821 cagccagttt tatagccata tggctctagc tattacaaaa actatgcacg atgttttact 2881 gatattcctg aaggacaaat agggttattt ccacaaaatt aaaacccttg tgagtaggga 2941 tggcaaactc aaatgactct gggaccaaga aaagttcctg tgggttgggg tggaagacaa 3001 cagggggtaa tggagactgc ctccaagaga aggccatgtc catcgtccca ggagcagttc 3061 tcacttgcct caactaacta ctgccaggca aaatctagaa ttttaggtga aattgtctgt 3121 tttttaaata ttagcaacta attcattttt gtttttattt attctattta tttatttatt 3181 tatttattta tttatttatt tattcaggga tggagtctcg ctctgttgcc caggctggag 3241 tgatctcggc tcactgcaac ccccacttcc tgggttcaag cgattctcct gcctcagcct 3301 cctgagtagc tgggattaca ggcacatgcc tccacaccca actgattttt gtatttttag 3361 tagagacggg gtttcaccat gttcatcagg ctggtcttga actcctgacc tcaagcgatc 3421 caccacccgc cttggcctcc caaagtgctg ggattacagg tatgagccac cacacccagc 3481 ctcatttttg tttttagaag cactggtcag gaccagaaaa aaagcctgtt ttcaacctgt 3541 ttgcagcctg ttttagaccc taaccctttc cttgctgttg aaacaattac ttatttaact 3601 tgagctttga taaagcaagg tagcttagga acgcaaatgt cttgtgctgg caaaccaggc 3661 tggagagata aagtgctttt aaaagggaaa agcaaacctc ctctcccaac gcccctctcc 3721 ttccctatgc ctgccctcct ccatcatgca agacgacaaa ggtgcaatac cttgtgcatt 3781 cttcccgagg ccccgtccag ggacgtagcc catcttctga agaagcttct gtccaattcc 3841 ttttgtgtgt ctttcccagc tgccgaagtc catgaaagat ttggttcctc ctgcaaaacc 3901 tttctggctg ggcttaaaat tgccaccctg caaaaaagaa aaatcaaagc ttagttaatg 3961 gaaaaacaaa atgaagaaaa atctaagaaa gcttggccaa caggaaaatg gcctaagtta 4021 tacagcacta tgaaaagtgt catatgaacc aaaaatgtgt gtgataacgt gtcaggaagg 4081 aagagactgc actacacact tcacaactca cagaaacctg ccaccaaaag gagacactca 4141 cgaaattcat tccagatgag tttctccctg gctagaaaat gcttgggcca caatgacctt 4201 tgaaggatat tcctagggtc acaatcctgc aatagagact accgttttta gcttccttgg 4261 tccaaaatcc ttaggaaagt cgtcctgctt aacaggtttc tcttcgtcat cagaatcttc 4321 caactctgcc tcctccgctg cccctttctt gagccctgcg ctgatgaagt tgactggcgc 4381 agagtagtca cgggccctgt aggcacagaa acaaagcccc accaggtcgg taaagaactc 4441 ttttctctgc agcgatcctt ttgcagaaac tacgtgcttg aagactcctt agagtgaatc 4501 tcatgggagg ttttttaaaa aagtaacaca ttttttaatg tgtcaaaaaa tccaaaggac 4561 cccagatttg gcttcctgta gtcacattta acattttacc tgacatatcc ctctacggct 4621 gaagagttat cacatcaatg tagggttgat tacaaattca aaatatgatg tgataataca 4681 attttcctac attttaaaaa attaatttat cataaaaagc aattgttttg aaaactattt 4741 caagtacaaa aatccaacag gagcagaaat tttgatggaa aatcataaat aaaacccatt 4801 gaaagattat atgaatgtaa agcatgtaac aattatatta atcaaactta gaaaaagctc 4861 ctagccatta aatttatttg aatgtgaaaa tttccagcca gggcaacatg gtgagacttc 4921 gtctctataa aaaaaaaata aataaaaatt agccaggcat ggttgtgcaa gcctgtggtc 4981 ctaactacat gggaagctga gatgggagga ttgcttgagc ccaggaagtt gaagctgcgg 5041 tgaactataa tcatgtcact gcaccccagc ctgggcaaaa gagtgagacc ttatctccaa 5101 aaaaaaaaaa aaaagaaaag aaaattacac taaagtcccc taagaaataa gaaattaact 5161 ctataaaatg aaatatattg atttaaactc tgactgcttc aagtacttac attaagccat 5221 taacctcagg tcttttttgt ataaaataaa attgatatat ttgtactgtg tggtttaaaa 5281 aagaaaaaat tacaataaaa aaccctggag ccaggtacag cggctcatgc ccataatctc 5341 agcactttgg gaggcttgag gccaggagtt caagaccagc tgggccaaca tagcaaaatc 5401 ctatctctaa taaaaataca aaaaactagc tgagtgtggt ggcacacgcc tgtaatccca 5461 cgccactcag gaggctgacg catgagaaac gcttgaaccc aggagacaga gggtgcagtg 5521 agctaaggtc acgccactgc acactccagc ttgggtaata gagactgtgt ttcaaagaaa 5581 aaaagaaaac ctgggaatac agcaaataaa gggatgcctt gtaaaatgac caacctagta 5641 attcaacacc ctcagggcaa caacagaatt tagggagttt tttcctaaaa ttgtaatata 5701 acttagtgat ttagatccca aactctagag tcagataggt ccaggttcca attctagctt 5761 agcctctgcc tagttgtatg accattagcc tcagtctctt aatctgtaaa atggggacaa 5821 taatggtacc tttctcataa ggaaaattaa tccaggttta tgaaatagcc aggacaagta 5881 agcagccaag agatgttagc tattattatt atcttatgat tatgcaattc ttgttccctg 5941 atctatttgt catccacttt cagacttcct tgtggatgag aagtttggac taatctctat 6001 actctttctc ccccagcctc ctttgaaaaa ggcacattct cagctgggtg tggtggctca 6061 cgcctgtaat cccagcactt tcggaggccg aggcaggcgg atgacaaggt caagagatcc 6121 tggccaacat ggtgaaaccc cgtctctact aaaaatacaa aaattagctg ggtgtgatgg 6181 cacatgcctg tagtcccagc tactcgggag gctgaggcag gagaatcact tgaatccagg 6241 aggcagaggt tgcagtgagc caagattgca ccactgtact ccagcctggc gacagagcaa 6301 gacttcatct caaaaaaaaa aaaaagaaaa agaaaaaggc acactctcaa taggatagag 6361 cctagaggcc acaggtgagg ccatggagga aaaaaccccc aacctggcac accctggttg 6421 agtgaggttc agccagggca cacataacct tgtgctgggg agaagacgtg cagcctgaat 6481 cctcattgcc caccactatc ctagaccagt cctccactac ctgtgcaggg accactggct 6541 tggcctcctc acagtcttcc actgtggctc catgcctatt gagcacacca gcaaaagggt 6601 cttctcaaaa cataacccag ccccacatca caccctctct atcacacaca ctatgctacc 6661 cttccccctc tctgctccag cccccaagcc tttcatatcc ttgtggatac cgtgatgctt 6721 cccatcaagg gcctttgcat attgggacct cttctaccac cctagttacc ttctactgtc 6781 agtcatatca gcttaagggt cacttttctc cacagaaatc ttccttgacc tctcagaata 6841 gatgaaatgc cctaaaatgt gttctccctc atgtcctcat gtccctcatg tcctgcatca 6901 cagtcacaag tttgatcgtc tgagttgtta atgcctattt ccatcttcta atgtctgggc 6961 tctaggaagg caagaatggg tctatagcac aggcctaata catagcaatc gctcaataaa 7021 cactgataat taaatcagtg aggccaggcg tggtggctca cgcctgtaat cccagcactt 7081 tgggaggctg aggtgggtgg atcacttgag gtcaggagtt caagaccagc ctggccaaca 7141 tggcaaaacc ccgtctctac taataataca aacattagct ggaggtggtg gcgtttgctt 7201 gtaatcccac ctactcggga ggctgaggca cgggaatcac ttaaacctgg aaggtggagg 7261 ttgcagctag ccaagagcac gtcactgcac tccaacctgg gcaacaaagt gaaaccgtgt 7321 ctcaaaaaaa ataataataa atagaataaa tcagtgaaag agcagcactc tagcctaaag 7381 aggaaaccag acctctcgag atgatgtcac agctaacagt ctcaacaagg ccaggtgaga 7441 agtatactta tcagggtatc tcctccactc cctgtccatc ttccccctct tccctaaagc 7501 aaggtgaagc agcccccatg ccaggcaata ccgtttgcct ccaaagctgg gcctctcatc 7561 atccgagtct cgctctgccc acaccccgta ggtggcttct tccttggtct gccagtggcg 7621 ctgtcggttg gggttgaact cattctggag atcccagtca gtgatctcaa agttctcccg 7681 ctcgtcatca tcatcatcaa tgcggccttc cccatcccgg tataagtggg acaatgacat 7741 ggccagtcac taaaggaaga gacaaaaaga gcacaggccg taagtcctgg gggcagcagt 7801 cgaaggattc cttgtgagtc tctattcctt gctgctttca gaactttctc caaattcaaa 7861 gtccattaaa gtcagcatta ctattcaata aaaaagatat taaaaaccta cttttagata 7921 acaatcagta ctaaaacagg tgaacaaaat actactccta acagtctatg attctttaaa 7981 agtacatatc tccttgtcct agtcaatatc ataaatatca cacatattca cacatacaca 8041 aaccacacaa aaataactaa gtaggttatt ttgtacatta ttaccttaga aaatgagtag 8101 gtatcttctt agatcccaga ttcttttttc tattactgag gaaagaatta tatctggatc 8161 cttggtggtc ctaggagagt gatttttttt agaatatgct ttttgaaata cattacttga 8221 tgaaacaaaa ttttcagttt tcctcacaaa acatgaattt cttctgatga actgaaagac 8281 tgtagtaatt atgaacacta ttttaatagt cagatgtcca gagattgata tagagaatct 8341 acctcacagt agctgggtga ccagattcag ttgtctttct taaattggga ttaactacca 8401 taccaacctt agagaaggct tacaaagcaa attcaatgag ttcaaaataa gtgctcaaga 8461 aacgatagct tctactttag tcccatgatt aggttaatca ttgtattagt cagggttctc 8521 tagagcgaca taactaatgg aatagacata catatataaa ggggagctta ttaactatta 8581 actcacacga tcaaaaggtc ccacaatagg ctgtctgcag gctgaggagc aaggagagcc 8641 agtcaagttc caaagctgaa gaacttgaag tccaatgttc gagggcagga agcatccagc 8701 acaggagaaa gatataggct gggaggctag gccagtctct cttttcacat ttttctgcct 8761 gcttatattc ttgccgtgct ggcagctgat tagattatgc ccacacagat tgaattgagg 8821 gtgggtctgc ctttccccac ccacagactc aaatgttaat ctcctttggc aacacccaca 8881 tagtcacacc caggatcaat acttcgtatc cttcaatcca atcaagttga cactcagtat 8941 ttaccatcac aatcatcatt attattatgg gactagggag taaacttagc aaagtgactg 9001 taaaatcagg caggctaaaa tcgaagcctg agttcaaatt caactctatc agatacccag 9061 tgtatgagaa ttaaatatgc acgtgaagcg ccaagcatgg tgcactacac aataatacac 9121 tccataggtg acagcaattg tgattcttgt cttgctgaaa gcggccttaa ggcacacgga 9181 gttcactact tccacaggca gtgtcaaact ctggtcacct ctggcaaaac gttatctctg 9241 ccttcttcta ctcactggtg caccggttgg gtgcaacggg tcccaacctc ttccaggagt 9301 gtttaggaca gaaggatttt cagaaaagac ggggaatctc aaatacataa agttaaataa 9361 taccactgcc tgcgtgggca tagtaaatgg taataagccc cgagtagtca caagttatga 9421 agcctttatc agtagatgct aggtacatgc tggtcgcttg agaccgatta tcaactgtct 9481 gcatcaagac acttctgcaa ggtaggtacc gttagttttc atgcttcaga taagaaaaag 9541 gaggttcaga gaagttagta acttacccta ggttccactg ctcatagggc tcaaactcag 9601 gagtgctaga agtcaggcct cgggcttgtc tctctcttgc tgtctaggtt tggtgaagga 9661 agctcaggtt cttaaatgaa tccaagccct aaatcccagt tgaccccaag cagcaccgcc 9721 cctctccgtg cctcggcaga gctgtctacg aaatggattc ccctccccac accccaacgg 9781 cttggaccga agctcgcctt ccctgagcct tcgtccctac tctcactggc gctgagtccc 9841 cttgtggcca gccgtgctca cctgtccgag ggtccagcgt accaaattca gcttcaccat 9901 ccgcgcgaga agacgccgct cctacaccag aacccggaag caccgtggcc ggcgcgccgg 9961 aaatgacgtt agtcgacagc ggatgtcctg ctggctgcca gggcagcgtg ggacgctacg 10021 gcggatatgg ctgcagagcg gccggctggg atcttagata ggaggggtgg atttgcaagg 10081 cctagaatag ctggggagtg gtttccccgc ggaatcggcc tccctgccgc tcctgctttg 10141 tactgtgacg ctcagcctgt gatgactggt gtggaatccg ctgagccacc ttggcctaag 10201 gagactttac cactctgaga ttgtaaatct gtaaaataga gatgtaggat tagcccatac 10261 ggtagttgtg gtaaatactg tgagacaata aggggcctgg gacacagcat tcaaatggga 10321 ataatgaagg tcaagactgt gattcctgta tctttgacgc tctcggtata agcaccgtcg 10381 tgggcacagg gcagtggcct ttatgcagga gtttaagagg gaatgaagga atgaatgggc 10441 aaactctgga gttcccaagt attctctcca ggagctgttt ccattctttt cgtttccagc 10501 aggttggtaa attcattaat ttattcattg atctaattaa aatatactaa gtgcccctca 10561 cctgtgctag gccaatgtga tacaatgagc agaacagtca tgggccctcc ctgggaagcc 10621 ctcactagcc caaggactcc ttgtagacat ttaagtgtcc acaggctctg gagttccaac 10681 cttgagtgca atttagcagc tgtggacctt gggcaagtca ttacatctaa gcctgttttc 10741 tcttctgcaa aatggttaag gattcaataa gataaaactg taggcaatga aaaccgtacc 10801 tggtaacagt aggtgctgaa gaagtgttag ctattaattt ttgcttaatt tttctctctc 10861 tgctctatgt gatgaaaaga ttcaagaggc aattgttgga atgtaaaaag agcacgggac 10921 ttggagtcaa atacttaagt ctaccatcaa gtagttgtta agaattaaac aacaattttt 10981 gtgtacccag ttaaatgtgg gctgcttagg aatgatgact gtgtcttaat gatctctgta 11041 ttcttagtga catgtagaat cattgtgcct gacacatagt atgtactcag gaaagaaatg 11101 gaaaatgtgg ttttagcatt gaaggccggg agagagggtc taacagacta caagccctgc 11161 caggagcaga gtaagggaaa cagaggagaa aagtgttttt agtctgtgcc tgaatgtatt 11221 tacatctgtt tgtagcccaa aagccaaaag catacatacg cttggctttt ctgtagctat 11281 gtttatggct ttacagcaga ttttatggag ctgcaattac tttgatcatg agggactgat 11341 gctagtggat ttacttcacc aaatggaact cactttgtgg cttctgaaga agggaccttt 11401 gtggactgtc atggagtagt taagagtgca ggctctgatt tagtgatcag agtctgcatt 11461 gtcaggaatg ggacaaagtg aagttatgtg gcacttgata ggatgccctg agaagattgc 11521 aacatcaccc ctgtgatatt cctgctgaag atccataacc tggatgtaat catgaggata 11581 tatcagacaa acccacgtaa agagacatgc tgtatacaaa actgtaatct tagaaagtgc 11641 caaggtcatg aaaatcaaag atagaccctg gaactgttcc aaactggagg ggaccaaaga 11701 ggcatgacaa ctaaacacaa cacatgattc tgaactggat ctttttgctt gaaaggaagt 11761 tacagggaca gttggaaaag tttaaatggg gcctacaatg ccgtggtaat gatgtgtccg 11821 tgttaatttc ctgattttca tggttgcctg ttaagttaca tcagaggatg ttcttgtttg 11881 ctggaaagta aatcaatgta tttggcaggg gataaggcat caaatggtca ccttaatttc 11941 aaattattac agggaaaatg tttctctctg tacttaataa cttttttgca atttcttaaa 12001 atgaaagctc tggagtaaaa acttcaagga tccaaatctc gactttgcta cttcttaggt 12061 atgggcccga gggcaggttg cttcacctct ctacacctca atttttccat ctgtagattg 12121 gggatagtga cagtactgat ctaagagagt tgtttagaga aaaacaaaaa cagaaagtga 12181 taattagtgc atgtaaagct cctggcatgg taagagcttt ctaaaggatt gtggctatat 12241 tcttatttat ttatttattt atttattgag acagagtctc actctgtcgc ccaggctaga 12301 gtgcagcagc acaatcttcg ctcatttgca acctctacct cccagattca agtgattctc 12361 atgcctcagc ctcccgagta gctgtcatta caggcgcagc tgtagtgatt ctcatgcctc 12421 agcctctcga gtagctggcg ttacgccgcc acgcccagct aatttttgtt tttttatttg 12481 ttttaatttt tttttccccc agacagagtc tcactgtgtc acccaggctg gagtgcagtg 12541 gcacgatctc agctcactgc aacatatgct tcctgggttc aagcaattct cctgtctcag 12601 cctcctgagt agctgggatt acaggtgtgc gccaccacgc ctggctactt tttgtatttt 12661 tattagagat ggggtttcaa catgttggcc aagctggttt cgaactcctg acctcgtgat 12721 ccacctgcct tggcctccca aagtgctggg attacaggcg tgagccactg cgccagcctt 12781 catttttaaa aacccaatgt catacccctt tcagtctcaa aaatccatgg tttttcagga 12841 tttctgggat caaatgccca gcctttgaaa aacaaggaaa agttttacaa ggcaaccaga 12901 gctttaggtt caaggtctca tatctatcgt gatctcctgt catttcctag aggggaaaaa 12961 aaaaaacaaa aacattgctt tgagatttgg gaacatccac actcatttct acattcccaa 13021 tgaggtgctc agacatgtgg ccttcgcaga aggggttgcc aaagcaacaa ggctccttgg 13081 gcagaaggag tctggcaggg ccctccagca ctcatgggtc tatacctctc ttttcgcagc 13141 tgaggcggtg ggggagcccc tggcagagtg gaggtgccag caccacacag tgagtttaag 13201 gacagggcca ggcctgaaac ccaggagctc aaactcccag gcaggtgctt tctttgcaaa 13261 accatgcatg gggtcccaca tcatgattcc tgtgtgttca gggccacaga gccaaactct 13321 gcagacatca tctctttgat tcttcagcct tctaaggtag tttgttataa ccccagagtc 13381 cacattccta acctcttatt tctacagcct atgccataca gcacactttc catattgcct 13441 acaaaagatt tcgggaaaaa aatatcgtca cttagttaca actttaaata ggttggtacc 13501 atagaaacaa tccataagtg gcatttcaaa gctgatatat ttaaagacta gcacatgtac 13561 tttttctagg acagtgacat tatacagttg cacatggctt attcttattt tgcattttgt 13621 cccaaaaata tatatggtgt ccttaacttg gcagaattga gaacccagat cagctgagat 13681 tagcaatttg atttccctct aaatgtttct tacagatggt ttctacccta cttatctttc 13741 aagaattcgt gaagactcta tccttatatg gaaagctgga acatgaggtt ccttaggctc 13801 ggggtgttta tctggaaatt aaatatccgc aaaacacagt aaggggagaa accacagcct 13861 tcatcattca tttattcatt tgtcattaac aaataattaa tgagtaccta catgttatct 13921 attgcagtat aacaattacc tccaaacgta gcagcttgca acaccagtaa acattatctc 13981 acataatttc ttttggtagg aatttgagag tggcttagct gggtggttct gactcagggt 14041 ctttcatggg gttgcagcca tctgaaggct tgaccagggc tggatgacca acttccaagg 14101 tgcttctctc acatagctgg caaactggtg ctggttgttg gcaggtttca gtttcttgcc 14161 atgtgagcct ctacacatgg ttaccaagtg tcgttatgac atagtggcta gcttccctca 14221 aaatgagcaa agcaagagag agcaaggcag aagccacaag gtcttttatg gcctagtctt 14281 ggagtcacac tgtcatttct gcaatatctt tttttttttt tttttttgag actgagtttt 14341 gctctgtcac ctaggctgta gtacagtggc gtgatctcgg ctcactgcaa cttctgcctc 14401 ccaggttcag gcaattctcc tgtctcagcc tcctgagtag gtgggactac aggcgcatgc 14461 caccacaccc ggctaatttt tgtattttta ttagagacgg ggtttcacca tattggtcag 14521 gctggtcttg aactcctgac ctcaggcaat ccacctgcct cagcctccca aagtgctggg 14581 attacaggca tggggcacca cgcctggccc atttttttat tttctttatt ttttattttt 14641 atttatttat ttatttattg agatggagtc tcgctctgtc accgggctgg agtgcagagg 14701 cacaatctca gctcactgca acctccacct cccaggttca agcaattctt ctgcctcagc 14761 ctcccgagta gctgggacta caggtgtgcg tcaccatgcc cagctaattt ttatattttt 14821 agtagagatg ggggtttcac catgttggcc aggatggtct tgatctcttg acctcgtgat 14881 ggacccgcct cggcctccca aagtgctggg attgcagggg tgagccaccg tgcccggctg 14941 cgcctggccc atttttgcaa tatcttattg gttacacatt cagccctata tagggttgcc 15001 agatgtagta aataaaaata caggatgccc aattaaactt gaagtttaga taaaacaatg 15061 aataatagta tatgtatgtc tcgtaaaata ttgcgaaata ctaaaaaatt atctgaaatt 15121 caaatttaac caccatcctg tatggaaccc taaccctatt cagtgcacaa aggaaaaaca 15181 tgccaaagtg ggtgtgaata ccaggaggcc aggttcactg ggggccatct tggaagctgg 15241 cttctacaac cttaacatgc tgggcatggt agactatggg gataaatggt aaacaatgaa 15301 aaataatgat tcctgttctc atgcagtttg tagcctgttg aataaaatca acttgaatca 15361 acccagcagg aaaataaata caaaacctta acgattataa aagtgcttca gagagaaata 15421 gctgatgctt gggacagtga tggggtcctc atcaaaatag ccactttttt tttttttttt 15481 ttttttttga gacagggtct tgctctgtct cccaggctgg agtgcagtgg cataatcata 15541 gctcactgca gcctcaacct cctgggctca agtaatcctc ctgcctcagc ctttcgagta 15601 gcttggacta caggctgcag tagccaaact cagctaattt tttaagatct ttttgtagcg 15661 atgagggctc actatgttgc ccaggctggt cttgaactcc tggcttcaag tgatccacct 15721 tggcctccca aagtgttggg attataggcg tgagacacct cacctagccc acttttgctt 15781 tatagatttg agtttttaca tgtgatttga tttgtctcta tgggcgtata aggagtatat 15841 ggagggatct gttgctaaaa caaggaaata aaaccctgaa atgcaaggtt ttgaatctta 15901 gcaaaatccg taagcacaca caaatgaatg cacataaaca tggcgaaaac tgaataaggt 15961 ctgtagtcca gttaacagtt tttactaatg ctaatttcct ggttttgcta ttgcactaca 16021 gttgcatcag atatcagcat tgggggaagc caggtgaagg acacatggaa ccctttgcac 16081 cactttttcc acttcacatg agtctacaat tatttcaaaa taaaaagtta ataaacaacc 16141 catttaaaaa aaagcagctg ttgtttggaa atggaaacaa ggttagatca aaataatcca 16201 gacttggcgg ggtgcggtgg ctcatgcctg taatcccagc actttgggag gccgaggtgg 16261 gtggatcacc tgcggtcagg agttcaagac cagcctgacc aatatggcaa aaccccatct 16321 ctaccaaaaa tacaaaaatt aactgggcgt cgtggcatgt gcctgtagtc ccagctcctt 16381 ggggggctga gactggtaag ttgcttgaat ccaggaggcg gcggttgcag tgacccgaga 16441 tctcgccacc gcactccagc catgatagag caagactcag tatcaaaaaa aaaaaaaaaa 16501 aaaaaaatcc aaattcatga aaggactgga ggagggcctg atgtggtgga gccagggatt 16561 ctgtctgaag aaaaagaatt cttttttttt ttgagatgga gtttcgctct cgttgcccag 16621 gctggagtgc aatggtacaa tctcggctca ctgcaacctc cacttcccgg gttcaagcca 16681 ttctcctgcc tcagcctccc gagtagctgg gattacaggc atgggccacc atggccagct 16741 aattttgtgt ttttagcaga gatggggttt ctccatgttg gtcaggctgt tcttgaactc 16801 cctacctcag gtgatccgcc cacctaggtc tcccaaagtg ctggcattac aggtgtgagc 16861 caccatgccc agcaggaatg gaatgcttat tgttaattat cctgagcagt gtgggtttag 16921 aacacagaat gaggccaact ttgaggacag gggtaagagg tgtctttctg gttctgggcc 16981 cagatgtggg gtccggagag gcctggaaca cttccattgg taggaccagt tgtctctggt 17041 cccaaggaac aggctgggga aaccctcaaa gaagcctggg ttttttggtg acctagcctc 17101 ttaatttaaa tagtagacga aagaaatgtt ttcattttct accccaaaat gaagaaaggg 17161 aatattctgg tgacattggt gtatcccaga gatctgaaag ccatttgctg acccacaatt 17221 gaggctgatc aattgttttt taaggataac tgtggccaat tgttgatggg ttttctgttg 17281 ttgttgtggt ttgttttttg ttttttgaga cagggtctcg ctctgtcaac ccaggctgga 17341 gtgcaggcag tggcacagtc aaggctcact gcagccttgt cctgctgggc ttaagtggtc 17401 ctcccacctc agccacccaa gtagctggga ctacaggtgc ttgccaccat gcccagctaa 17461 tatatttttt tgagacggag tctcactctg tcgcccaggc cggagagcag tggcaccatc 17521 tcggctcact gcaacctttg cctctcaggc tcaagtgatt ctcctgcctc agcctcctga 17581 atagctggga tacaggcgtg agccaccacg cccggctaat ttttttgttt tgttttcttt 17641 tgagacaggg tctcactctg tcacccagat tggagtgcaa tgcctcaatc ttggctcacc 17701 accacaacct ctgcctccca ggctcaagcg attctcctgt ctcagcctcc tgagtagctg 17761 ggattacagg cgtgcgccac taccacccag ctaatttttt tgtattttta gtagagagga 17821 gatttcacca tgttggccag gctggtcctg aactcctgac ctcaagatga tccacccgcc 17881 ttggtctccc aaagtgctgc gattacaggt gtgagccacc acacccagct tcccgctaat 17941 tttttttttt tttttgtatt tttagtagag acgggttttt accatgttgg ccaggctgat 18001 ctcaaactcc tgacctcagg tgatccgccc gcctcagcct cccaaagtgc tgggattata 18061 ggcatgagcc accacaccca gctcgagcta attttttaaa ctttttgtag agacaaggtc 18121 tcattatttt gcccaggctg gtctcgaact cctggactct tgtgacattt gttcaggggg 18181 aagccaacac catgtatgaa gtttaaccac cctgagatgg ccacgtcctg aagaagccca 18241 gtctagccat gcggagagac tacatgcata agaggtaccc cactaagcct ctagatgctc 18301 caaccatccc agccgaggta ccagacaaat gaatgaagaa atcttcagct gccatctgac 18361 tagagaacca ccagctcact ctagttaatt cccaaaactg agcggttttt tttttttttt 18421 ttttttcaga cagggtttca ctctgtcccc cgggctggac tgcagtggtg taatcacagc 18481 tcactgcaac ctccacttcc aaggctcaag tgattctacc tcagcctccc cagtagctgg 18541 gactacaggc acaagccacc gtgccttttt ttttttttct tttttttcta gagaccaggg 18601 tttcaccatg ttgcgcaggc tggtcttgaa tttctgggct caagtgatcc tcctgccttg 18661 gcctcccaac ttactgggat tacaggccac tgggattaca gttctagcca ctgcacctgg 18721 ctagaactga gagatttaac aagttgtggc caggcgcggt ggctcacgca tgtaatgcca 18781 gcactttggg aggccaaggc aggtgaatca cctgaggtca ggagttcgag accagcctgg 18841 ccaacatgat gaaaccccat ctttactaaa aatacaaaaa ttagccaggt gtggtggcgg 18901 gtgcctgtaa tcccagctac tctggaggct gaggcaggag aattgcttga acccaggagg 18961 cggaggttgc agtgagccaa gatcacacca ttgcactcca gcctgggcga caagagtgaa 19021 acgctctcaa aaaaacatac aaacaaacaa aaaaacaagt tgttttaagt cactcagttt 19081 ggggctgctt ggttgggccg caagagatga gtggaacaat gaatgagact ggatgtcctc 19141 tgctacccaa tgactggcaa ccatgagctt cctgggtctg taattccatc agctgtccat 19201 cccttaccct ggaagggtag aggatttatt ctttgggtag acatagttta agagtttttt 19261 tgtttttgtt ttttgttttc aatgaaaaat tgccaagccc tttggaggcc ctgatggctc 19321 aaccaggttg gtggctgcta acgagagtga agatctatat tcagtccatc tggcacatgt 19381 gcagaagacc cttctccaac cccagatcat gactacaaat ggctatcagt tggacaaaca 19441 tacatcctga tgagaagata tggttggttc atcacaacct tttatttcac aatattaata 19501 gtaatcatta tagctaccat gtattaagcc ttcaccatgg ggctgttaag tcttcattgc 19561 ttagctttac taaactagat aatgtgtttg cttatctcat ggatttctct taaccatccc 19621 ctgagatccc cctatagttc ttatcccttt acccccattt tacagaagag gaaaactgag 19681 gctcagagaa ggggagtcac ttggccacag tcgcacagtt ggaaagtggt agagccagga 19741 tgagacgcta attctgactc caaagctagt tatatataat aatagcaccc aagtaatgta 19801 tagattacca tgttttcaca tatggcaaat taaggaatca gaatgaatga tgcatgctgc 19861 ccaagctgga cagaggatgg gccagggcac aagtgacggc tgacccactg atccatggaa 19921 cccccaagaa aggctctaga acaaagctct gatttcacag tgagattaaa attctgccca 19981 aatcagggta catgggtgga attaatcaaa aaagttagta tgtatgcctt tcaccccagc 20041 aattctactt ctcattttgc agagactcac tcatacagtg gcatcaagag gttatatgtg 20101 caggagaacg tccactgcag ctttgccagt agcgatgaaa actgctaaca gcctaagggc 20161 ctagccaaag tggaatgctc aaaaaagatg gtttgctcta gaacaggcag ctggtaaaag 20221 gagtggtgtt agatcagcac atgttcacat cagaaatgct ccaggagcca ggtgcggtgg 20281 ctcacgcctg taaacccaac tctttgagag gctgaggaag gattgcttga gcccaggagt 20341 ttgagaccag cctgggcaac atagtgattc ccctctttac aaaaaataaa aagttaggta 20401 ggtgtcatgg tccagctact caggaggctg aggtgaaagg actgcttgag cccaggagtt 20461 agaggttgca gtgagctgtg tttgcaggag ttagaggttg cagtgagctg tgtttgcagg 20521 agttagaggt tgcagtgagc tgtgtttgca ggagttagag gttgcagtga gctgtgtttg 20581 caggagttag aggttgcagt gagctgtgtt tgtgccactg cactgcagcc tgagcaacca 20641 agcgggactc tgtctccaaa acaaacaaac aaacaatcct ccagagcaat ctgttaaatt 20701 acattttgga ataacctatt caaaatggtt ttttccatac gtctttaaga tgataatatt 20761 tatatgttca aaaatttaaa gtctggaaga ctatgtacaa acagaaaatg gtagcattac 20821 ctttaggagc tattgagagg aagtacaaat agagccactt gccttcttac tatcaattct 20881 gattgtccca tgttttataa tatattcatg tagtactcta aacaaaatct tttcagatca 20941 aatcagggga gatgaaatga gagtggattc tgagcctagt ttttttcagg gaacaagacc 21001 cagtgctacc acttaggggc tagaaacaga aattgtattt attttatttt gttttattta 21061 ttttattttt gagatggagt ctcactctgt tgcccaggct ggagtgtact agcatgatct 21121 tggttcactg caacctctgc ctcctgggtt caagcaattc tcctgcctca gcctcccaag 21181 tagctgcaac tacaggcatg cgccacgacg cccagctaat ttttgtattt ttagtagatg 21241 gggtttcgcc acgttggcca ggctggtctg gaactcctga cctcagctta tctgcccacc 21301 tcagcctccc aaagtgctgg gattacaagc atgagccacc gggcccggcc tctttatttt 21361 ctttcagtgt tttcccagga agttctctga acatccaggg tcttcacaga tggacttctc 21421 ataaaaggta actcctggaa ccagacataa attttttttt tggggtgtgt ggtgatggga 21481 tacatagctg tcgccagatt ctcaaaggga cagtgtcaca cttaacagag caaggagagt 21541 aatcggtgat tggtgaatga tcatttatca gagactgtgt gtcaaacatt taaaaaataa 21601 aaagtaactg ctgaagaaat gaacaaaaag ggtcacacct tcaaaaaagc ctaatttcca 21661 ttccagagca atctcaagac aaggaaaaat aagagctgct tcacactgag ggcttatctg 21721 gtatcaagga ccaccgccaa tgtcatccat tagccatccc caataaaccc atgaagttca 21781 gcctgaacag catagaaaga cctcgtctct acagaaaaaa aaaaatttaa aaagtagcca 21841 ggtgtggtgg cacatgccca gctactcagg aggctaagat gggaggatca cctgacccta 21901 ggagaccgag gctgcagtga gctatgatca cgccactgta ctctagcctg agtgacagag 21961 cgagattctg tctaaacaca cacatacaca catacacacc atgaagtttt aacttcccca 22021 ttttacagat gaagagacaa agtctcaagc catacagcca gtaggtcagc ctgactccaa 22081 aaacccatgc cccaagccac aataagtcac tgccttccaa aagcctcata ttatcatgag 22141 atataaggca aggaggtatg ggtaaaacac cttaaccctt aaagatacaa acttattgta 22201 ggcccagttg gtccctcagc atttggggac tgtgctataa gggttaactc agcaggcttg 22261 aagtgtccaa acccagcaca tctcaaagaa aaaactggcc ttaatgggct cctgggagat 22321 aacctctaag tccttggaat atcctcccaa gtaggggtgt ctttgtatac ctcaggcctt 22381 gggccatacc aggccagata gtttgtgcta ataatgtgat ttatggtggg gaccttgggc 22441 catgctgtat cattttgacc tctggagcag ctggtgatta agtaactaaa gtcggtcatg 22501 tgattgccct atgtgactga catgcaataa aaacctagac cccaaggcta gggtgagctt 22561 tctggctggt aatatttagt acgtgttgtc acatatcatt gctgggaaaa ttaagtgtgt 22621 ctgtaagact ctaccaggag atgacaactg gaagcttgag gccggtctct cccggactct 22681 gccctatgca cctttttctt ttgttgattt tattttgaga tggagtcttg ctctgtcacc 22741 caggctggag tataatggca cgatctccgc tcactgcaac ctccgcctcc caggttcaag 22801 cctcctgagt agctgggatt acaggtgcac accaccacgc ccaggtaatt tttgtatttt 22861 tagtagagac ggggtttcac catgttggcc aggctggtct cgaactcctg acctcaggcg 22921 atctgcctgc cttggcctcc caaagtgctg ggattacagg cgtgagccac tccacccggc 22981 ctcttttgct gattttaatg tttatccttt tgctgtaata aaccataacc ataaggataa 23041 tagctcttct gaattctgag ttcttctagc caatcatcga tcctgggggt ggccctgggg 23101 accctgacac aggaattcag ctgtgcctta catgcaccta tattctccag ctttttgctc 23161 atcacaaatc tacctgacaa aatccagtat ggaaagaaaa aaaaaaccct attcattcag 23221 taaagtaagg cacaaacaaa tgagcctctg ttcactttat tcaaagagta tttttttcct 23281 ctgtgaaact aggtagaact gaggaggaat caaagaaagc ctcatatatt aaactcttaa 23341 atagattctt tgaattcaaa gtaagggtca ataggagagg cacaggttgt gggccttgtc 23401 ccagcaaaca aagccaccaa ggcagtcctg caaattaagg aggatggcaa atctgtctct 23461 taaaaaaaag ttcttgaggg gaaaaatata aaatacctaa gtttcaaaag ccggactact 23521 tccataccct tcattctcta ccctggagtg tccagagata gagagtttca aggtacgttt 23581 tcaatgattc agcttttgca aacatatgca aatacattcc tcatcagcag tcctgttttg 23641 gcgattaata gcaagcaata tgcttggatt agaggtccga tttccactta aatgcatttt 23701 cttctcttct tggcaatgaa gtcatttgcg gagatctgaa agtggggaag actgtgttag 23761 gctcagagta acaattctga aatagatcat ttatatttat acatgtgatt gcagaacata 23821 ctgtcttcat ccattcgcat aagcatgtaa tgttttaaga atgcatccca gatggccaga 23881 gagagtttaa atttaaaaac gtttcctggg ttcttcccca catctcatct tgcatccact 23941 tgacctcaaa tatacacttt tcttatggac acagagcaaa tgacagtgat tatgtgataa 24001 atactgatga attcttggag gcaagagaag gactcatgtt tgttcaaggc caactctgta 24061 cccaggattc agctaaagca cctctatatg gtatctcatt taaacagtaa aacaagtaac 24121 aaatctgctt ttgtcaatga gaaggtccag gattctacta atgatctgca gaaaggcagc 24181 gagggaacag ttagtgagga acagggagga aagctgtttt tcatgagagg gttcttttct 24241 ttggaataga actatgggat cttgggtagg tgtggtggct catgcctgta atcccagcac 24301 tttgggaggc tgaggtgggc agatcacttg aggccaggag tttgagacca gcctggccaa 24361 catggtgaaa ccccgtctct actaaaaata caaaaattag ctgggtgtgg tagcgcacac 24421 ttgtaacccc agctacttgg aaggctgagg cacgagagtc acttgaaccc cagaaggtgg 24481 agattgctgt gagctgagat cgtgccactg cactccagcc tggggaacag ggtgagactc 24541 tgtctcaaaa acaaaaactc tgggatctta gtgcaactca agatgggaac cctaataatg 24601 acagagtcct tgcaggttgg agcaaagtaa attgaaagaa aaactaaaac atatgctgat 24661 tgtttaaaaa tagatacaga gactaaaaat aagagtccaa gctggaaggt gtttccatgt 24721 aaaataacct ctcttgaggt ttcctttaac ctgatcatag aggctggggg acagaaggaa 24781 gtgagctcac tcatgcagcc ttaactccag tgcccaagtc cacagcgagg gcccagcact 24841 tcacggtttc caagcactcc cacctgggcc agctcacgcc atctcatggc caacctcaga 24901 ggtgggcaat gatgaatccc cattctccag atgaagaaaa caggggctga agggagttaa 24961 acctggagat cttaaatctt tgaattacca agttcaaaaa aatccctttc cccacccctt 25021 agtagaatat ggatggataa atgtgcactt aaaatcattc actaaaagca aacctgggag 25081 gaattctgtc ccttggtggc tgtgtagcct tgggcaagtc aaagcccttc cttctctgtg 25141 ctttggtttt ctcttctgta aaatggaatg atcatgtcat ccgcctccca aggatgctga 25201 gggttaaatg agtaacacag gaccagtacc tggcccaacg gccatggggg ttattcttag 25261 aacagaacga cttgtgactg ttattagaac attaattgct ctggcccaac gctgatgctg 25321 tttttagaag ccagactggc agacagcatt ctgtgcaaaa atggttccag aatctccccc 25381 tcccccatca tagtggtgct tcctgccctg tccccagtgt cagacaaacc tttttctggt 25441 tggttcagcg ttaatccctc tccctgtcct tgatccagag ctagaaaact aagccttgat 25501 tctataaata gcacccccag tctgtcacca tggaaacctc tgatggggct gggtcagccc 25561 accctagtgg acatgtctac cccagggagg ctctggaagg acagaaaagt ggcttttcag 25621 aaaagtggct ccggggccac ccagaagcag cgggaacccc cagtgaagtc cacaggctta 25681 cctggaaatc acgagcttcc taagtgggag gaggtgctgt tctggttcac ctggagaaag 25741 acaatacaca gaagaagggg tcacggccag gtgagggatg ccttgctgta ttgagggtca 25801 cccctcagca tcactgagtc atgcacatgc ttattcacgt agtatttact gtacatctac 25861 tatgtgccag gtgctgggat agcgtggtga gcgggacaga cccaaaccta ggatcacgga 25921 acttacgttc gagttgaaga aaaagcctac tgaacacgca cagatgcaag aaaatgtcaa 25981 acttgggcaa tatgacaggc tgctatgagt gatcagtatc tgtgctactt caaaaacggt 26041 ggccaggaaa ggactctaca agatggcatt tcagctgtgg tataataata aatatttgtt 26101 cattgtcccc agttcctggc acatagctcc ccaaactctt ggaatctctg aaggaataag 26161 ggtatcttgg ccgggcgcca tggctcatgg ctgtaatccc agcgctttgg gaggccgagg 26221 agggtggatc acctgaggtc aggagtttga gaccagcctg gccaacatgg caaaactctg 26281 tctctactaa aatacaaaaa ttagccgggt gtggtggcgg gcacctgtaa tcctagctac 26341 tcaggaggct gaggcaggag aatcgcttga acccgggagg cagaggttgc agtgagccaa 26401 gatcatgcca ctgcactcca gcctgggtga cagagggaga ctaggtatca aaaaaacaaa 26461 aacaaaaaca aaaaaaaaac aaaacgtatc ttttgcatgc taatgacatg actaatggct 26521 gggggtccct agatagcttc aggattaggg gaaggggtct cagaaagacc aaggcatgat 26581 tagagggttg gaactttttt tgttgtttga gacggagtct cactgtgttg cccaggctgg 26641 atggagtgcg gtggtgcaat cttggttcac tgcaacctcc gcgtcctgag ctcaagcaat 26701 tctcctgcct cagccttcca agtagctggg actacagacg tgtgccacca cgaatggcta 26761 atattttttg tatttttagt agagacaggg tttcaccatg ttggcccggc tggttttgaa 26821 ctgctgacct caagtgatct gcctgcctca gcctcccaaa gtgctaggat tacaggcatg 26881 agccaccaca cccagccaga aggttggaac tttaagcccc acctgccttc tcccctccct 26941 gacctcctgg gaggggagag gggagagggt ctaggggcta ggagctggag attgggcaat 27001 cacccatgac ccatgattta atcagtcatg cctctatgat gaaatctcta aatgacagaa 27061 tttggaaagc ttcctggttg gtaaacacat tgagaggctg ggagggtgtc atgccgggaa 27121 agggcatgga tgttctgtga cccatccccc atacctcacc ctacgcttct tgcatttgac 27181 tgtacctgag ttgtattctt ttttatgaga cagggtctca ctctgccacc caggccagag 27241 tgcagtggca cgaacatgtg ccacatgtgc catacccagc taaggctttc tcttttttgt 27301 agggagatgg ggtctcactg tgttggcctt gaatttaaaa aaaaaaaatt aatgcgatct 27361 gagagctagg gaccatggct tttgagcatt tcctggttgt ttggggagag gtcccactcc 27421 ctttgtggga atcagagatc tatcttttgc tgacgtcaag aaagtagcca gaattagtga 27481 gcactaattt agtatttgtt ttaaccaaaa agatgatgta ggcagaacaa cacagtgggg 27541 atgaggagaa ttatccacaa actgaaactt ctcatttttg cacgttgcct tcgtttttgt 27601 tgttgttgtt gctgctgttg tttttaagag acaggggctc cagcgatcct cccacctcag 27661 cctcctgaga acctgggact gcaggcatgc cactgcaccc aactcctgcg tgttgccttt 27721 ttttgtccac ctgaatatgc atttttacat tagttgcaat tgtatagtat gagctgcttt 27781 gttttgcttt cttactattt tttcatcatt tgggtttcac tagcaccatt ttggatggct 27841 acagatattc tgtggagtgc atgtaccacc gtttactgac tgcttaccta agaatggttc 27901 caatttttgc agtattttaa atgacatcac atcctgattc tgcctctctt ctgttgggta 27961 ctgaagctct actgaatcat tcccttagaa ttcacttcta aaaaatggaa tcatgcttga 28021 acccgggagg tggaggttgc agtgggctga gatggcgcca ctgcactcca gcctgggtga 28081 cacagtgaaa ccccatctca aaaaaaaaaa ggaatcagca ggagcaggtg gaagggttca 28141 gatcttttga agactcttga acaacaaggc taaaagtgtt ttacaaaagg gttggggtca 28201 ggagcagtgg ctcacgcctg taatcccagc attttgggag gccgaggtgg gtggatcact 28261 tgaggtcagg agttcgagac cagcctggcc aacatggcga aacccctttc tactaaaaat 28321 acaaaaatta gccaggcgtg gtggccggtg cctgtaatcc cagctacttg agaggctgag 28381 gcaggagaat tgcttgaacc tgggaggcag aagccgcagt gagccgagat caccgcattg 28441 cactccagcc tgggcaacaa gaggaaaact ccatctcaaa aacaaaaaca aaaacaaaaa 28501 caaaagggtt gggccagttt atacttcctg ccgtgtatga catatgcaag atgtctagct 28561 gacctgggac tggtttttgt tgcttttagg aacaaaatat gaaggaatgg cataccccca 28621 cgctcaggag ggctgccctg cagataagtt ttgtcaagcc tctatttcca accctcatct 28681 ccctactgtc ctccactgtg cacataatgt cattctggat gttgtaaaaa ctggaaatgg 28741 tcttgaagaa gctgtttagt aaatggtagc acatgcattc cacagaacac tacgtagcca 28801 tccaaaatgg tgccggtgaa actcaaaaga taggaaaata ttaagaaagc aaaacaaggc 28861 agtttgaaca tgaaaattgc aatgaacatt cctgtaaatc acattggggt agaaacaaag 28921 aaaaatataa ttgcaacgaa tgcaaaagtg catacgcact ccagaggttt cggccacact 28981 ccccactttc aggcgtttac agttccctct gccgggaagc cttttctatc cagatctttg 29041 tgtggcagac ctagttttaa tccttcagat attggcttat gggtgctttc caatatcagt 29101 gaccaattct ctgattatct ggataccaac tgggtgttca acaattcagt tcagttctga 29161 caccagctcc cagacttact tagcatcagc tgccacagat taagggccca gttccacgac 29221 tgccccactt tagatgccag ctgcaatggg gtgcgcaggc tatccacact tcttcccagc 29281 caacttcaaa tctggaggtt cccacaaccc ccactctggt ttgataattc actagaacag 29341 ctcacaagac actagggaaa acactttact aactgctacc tgtttattat aaataataca 29401 actcaagctt ggacaacatg ccaaaaccct gtctctacaa aaaacacaaa aattagccgg 29461 gcatgatggt gtgtgtctgt agtctcagct tcttgagggg ctgaggtgag aggattgctt 29521 gagcccagga gattgaggct gcagtgagcc atgatcacac cactgcactc cagtctcaac 29581 aagaaagaaa gcccaagaaa aagtccccac gaagtccctc caactgcatt caagccggaa 29641 gacgaagtga gaggctgctt cctggcagaa gggccactga gatgtggccc cagctggatc 29701 agtgcttaga gaccctctgg tccagcacgc atccccacag ccctttattt tttattttta 29761 ttttttgaga tggagtctcg ctctgtcgct cgggctacag tgcaacggcg tgatctcagc 29821 tcactgcgac ctctgcctcc cagggtcaag cgattctcct gcctcagcct cccaagtagc 29881 tgggattaca ggtgtgtgcc actacgcccg gataattttt gtatttttag tagagacggg 29941 gtttcaccgt tttggtcagg ctggtcttga actcctgacc tcgtgatctg cctgccttgg 30001 cctcccaaag tgctgggatt acaggcagcc agctcacatc ccccttttta taatgggaaa 30061 accatcaacc agagtggaga agtgacacat ctaagggcat acagccagat ggtggtatag 30121 ctgagaaaga cagaattcgg aattcccctt ggggtttcta acctgaaaat atcctttcag 30181 attggctggt gttttatagt cccctttcaa gacctaagga agagaaaaag aaatatcact 30241 attatgaaaa tggccaaaaa ataaaattta gtcattccca acacatccag tcatccaccc 30301 atccacctcg ccacccaccc atccctccat ccaacaaaca ttgcttaagc tcctcttctt 30361 ctaggcatta ttctaagcat tgtgtctggt tgacatgatg ttaaccagat aatggaaatt 30421 cctgttcccc tggagctacc attctagagg gggcaccaga taacatacaa gaaaaataaa 30481 cacagataaa caaggacata gcaaaccatg ggaactgttg tgaaaaaagt acagtaaaga 30541 ggtgggatta ggagtaacca gggggcttct tttgagtggg tggtcaagag gggactcttg 30601 ggccaggcgc agtgactcat gcctgtaatc tcagcacttc tggaggccaa ggcaggagga 30661 tcacttgagt ccagcagttt gagatcagcc tgggaaacac actgaaaccc catttctaaa 30721 aaattaaaaa ttagctgggc atggtggcat gcacttatcg tcccagctgc tctggagcta 30781 ctccagcctc agctggagga ttggttgagc ccgggaggtt gaggctgcag tgaagtatga 30841 tcacatcact ccactctagc ctgagtgaca gagcgagacc ctgtctctaa gaaagtgaat 30901 aaataaaaat aaaaagggcc tcttgaaata gataaccttt gagctgagac ccaaacgaca 30961 aggaaaaata ggtaggctaa gctcagaaga aaaagcactc caggcagagg caacagagca 31021 aaggccctga agttagaacc acaacatttc aggcctttcc gctagtagtc acattttaaa 31081 ataaaatgaa acaagggaga ttaattctaa caatatcttt tatttaacct gatatagcta 31141 attttttttt ttttctattg agacatggtc tcactcccat agcccaagct ggagtgcagt 31201 ggtgcgatca cggctcactg cagcctccac ttcccagcct caggtgattc tcccacctca 31261 gccttctgag tagctgggac tataggcacg tgccaccatg cctggctaat attcaatatt 31321 tttagtagag atgaggtttc accatgttac ccaggctggt ctcaaactcc taggctaagc 31381 agtctgcctg ccttggcctc caaaagtgct aggattatag acatgagcca tcatgtccag 31441 cctatgtaaa atattaacat ttcaatatgt aatttatata aaagtgacta ataagaaacg 31501 ttacatactt ttttgtccta agccttcaaa atctagtgtg tattttacac ctaacaggac 31561 ctctcacttt ggactagcct cctgttacag aatcaggggc cacacatggc tagtggctac 31621 cgtatgggac actgcaggca gagacgattt catcagcata gatgtccttt tagccactgt 31681 gtttactgag ccccaggatc cattattgga acacacaggg tccacagcac tcacagaagc 31741 acacaggcat gcatgcaaac tcacagggct tagtgagagg acttttcttt gcagtaacag 31801 ccaagctatg gcaaaaacta aaccagggat ctagcacttg ccaagacttg ttgaccgccc 31861 cagggaagaa atctctttgg atgaaaatga gagacagctt ccattttaaa taaacaaaca 31921 caaaaagctc caaagagttg atgttaacca ccagtgctca tgaaaattat ttgtgtttta 31981 taaaatacat aaagtcaggc tttactccag gaacgtccga cccatagcct ggggcaagcc 32041 aggggcagcg agccaccttt taaggagacc ccagaggagc cactgctggg cccttggctg 32101 agaacagtga agtaagcaac atccagccca ctactgcagt gtccccagcc ggccctcgcc 32161 tctgcttctg agaagcacct ggggctggat tgtgagatca ctaccatctt tttaatccag 32221 ttaattaatt ttttattttg ataatacatg tacaacccac aggcagataa aagtatgtaa 32281 aacacataag tagggtttaa agattactat tattaaagaa taataaaata aaaattaata 32341 atatttataa taaaaataac aaaccacatc tgggttgcag ccattcacct gtcaatgtcg 32401 gctggtatgt acgttgaatc tctgagtatt tccataaata gatgtctgca aaatcaacta 32461 atgtcaagag gaagaggaac tcacgagacc cacagacaaa gcttacactt caattacaag 32521 ggcacagccc ccaaagctca gcgttagtaa attctctttg ttaaagagag atcagtgagt 32581 gctagagagg tggtaatgac tttttagcac caaatggaaa ctttcttctt gagccaactg 32641 aagagttacg gaactaagag acgattaact tttttccgtt gtgattcaat acgataactg 32701 tttatttcct aaaatcagca cacgctcaga aatgctgagc ttagcttaaa ttaaagactc 32761 tgggtcttaa tgaagtaagg gctgtgaaag ctcttcctca ggcctgggtc ccattttcaa 32821 atatttgcct ataaggaagt tagaacacag cacttaaagt aaactagaag ctgtaatgga 32881 aacgtataaa taaggccatc aacttgttaa ctgttctttt acccagtggg tgttagccta 32941 ctttaacact tcaaaagcat tttgtcagtc actggaggct cacacctgta atctcagcac 33001 tttgggaaga gtgaagcagg ggcatccctt acggccagga gttctagacc agcctgggca 33061 acgtagcgag accccgcctt tataaaaatt aaaattagct gtgcatggtg gcgtgcacct 33121 gcagtcctag ctactcagga gcctggggtg agaggatggc ttaagccgag aagtatcagg 33181 ctgcagtagc tatgactgca ccagtgcacc cagcttgggc aacagagcaa gaccatgcct 33241 caaacaaaac agtacaaaac aaaaaagtca tttcccttca ccatcttaac taaggagaat 33301 tttaagcata atttaaacag ccctaaactc acatttgtct gagaaatcca cattaaacct 33361 tataagacaa gaaagggacc aatagtttat ttatttatac cccctctcaa caaagacttg 33421 aggtgcctta caaacagata aacaaatagc cagggaaata ggggtaaaaa tgtaaataaa 33481 attagagaaa taagatgaaa ttagtgtaca agctggcctg ctgttagatg ttagagtatg 33541 agagacaggc taccagttgg ctttgagctt ccctgcagcc caagcaaaat ggaaatacaa 33601 gcaatcatgt agttcactgt gtccatcaga caggaacaaa tcagatgacc aggaattgag 33661 agactgaaca tttcctcagg tggctggagt ttccacatct gcagaagccc catatcccca 33721 agagtacact tccacaagta caggtgcaca ctcacccgct gtgtgttgtt gatgacgaag 33781 gggtcagggt tgccatagtt gggggggttt gcataagggt catagccgag ctgagccagc 33841 atgggggcga tctgggccat gtcccgcacc acatccccag ggatgtggcc agtccacttg 33901 gagagcgctt ccaggttaac aggcttgatg acctggtccg tggaccgctc gatcctgggg 33961 agagaggaga cgctggagag ggtagggcag acccagatgg cgcctgagcg gattccctgc 34021 tcagccactc tggggcagca ggaaagactg gagaaacctg gcctccaccc agctctgccc 34081 catcaactgt gagatggcag gtgagtcatt tttgcctcct catacctcct ttgtacaatg 34141 ggaacacagc atgcagtgag aagtcagaag aaagtactag cagtatacac cattggtaca 34201 tacattctgc tttcttctcc agcttaaaaa ataacaacag tttaggtcag gcacagtggc 34261 tcacacctgt aatcccagca cttcagaaag ccaaggcaga aggatcactt gagctcagga 34321 gtttgacacc aacttgggca acatggtgaa accccattac aaaaataagc tgggcatggt 34381 aacaagcgcc tgtagtccca gcagctgcgc aggaggctga gatgggagaa ttgcttgagc 34441 ccgggaggtt gaagaagcag tgagcctgag ctgtgattgc accactgcac tccagactgg 34501 gcaacagagc aagaccctgt ctcaaaaaaa aaagaaacac tttttaatgt gtatgactaa 34561 catgaatatt ttgtccctaa aaaaagttat aaaacactaa tgataaagct gaaagccact 34621 gtgttcatca ctgccagccc atcggcttcc tacaggcact cagtctttgc actgtggagc 34681 acaccacctt ctctaacctt ttcttgtaga gtagttaaga gcaaaccagg gagccagatt 34741 gcctgggttc agacgccagc tgttccccag ctgtgtggcc ttgggaaagt tacttcacct 34801 ctctgggcct cagtttcctc ttctgtaatt ggcaatgata gtagtaccta cctcactggg 34861 gttgtctgaa gagtaaatga gttaatacaa agaaagtgct tagaagagcg taggcacatt 34921 gtaagttttt tctttttgga gatggagtct cactctgttg cccaggctgg agtgcagtgg 34981 cacaatcttg gctcactgca acctccacct cctgggttca aacgattctc atgcctcagc 35041 ctcccaagta gctggaacta caggcatgta ccaccacgcc cagctaattt ttgtattttt 35101 agaagagacg gggtttcacc atgttggcca ggctggtctc gaactcctga cctcaagtga 35161 tctgcccgcc tcggcctccc aaagtgctgg gattccagac ataagccact gtgcctggcc 35221 aacacattgt aaatttttaa taagggttag catgctctct ctcctgtgca cacgctctct 35281 ctctcacaca cacacacaaa cgcgcgtctg cagctgtaaa acaaatacgc tagttttttt 35341 tccccactca atacctcact ctttttgcca gctatgtagc cttcctccat cacttatttt 35401 tccattcccc tgaagacaga catttaggtt gtttccaact tttcacagct accatggaat 35461 aaccatcatc ttcaacctgc ttcctaactc tccctgtaac aggcttccca agccacctga 35521 atgatgcaga tcttaccaat tataactcag gggcctgaca ggatgaatgg tggggaaccc 35581 ccaggatcaa gggcctagtt ctaaatgttt tcatcccaac taagaggtga cttgggaaag 35641 aacagaaaga ggcagcaaaa ggaagacaag accgggcgtg aggtgaggag aacaggggaa 35701 ggtgaaaggg tgtggactgg tgggtagaac acagattcag aggctgcaaa cctggattta 35761 aatcttaact ctacccaata tgtcaaaaat cttgagtatg tgcatggctt ttgccaaaat 35821 ccttgtgtct ctaggacttt atggtgcaga aatgaggtgg acacatccca gaacatgaat 35881 gagggcaatg cttgatacat atcagtaaaa ctggaaataa ccgaagtgtc caacaatagg 35941 gaatgtgaag tcgaatatgg caatccgtgt agcagcctgt cgcacagcta ttcaaaatga 36001 tgctgcaggt caggcgcggt ggctcatgcc tgtaatccca gcacttgggg aggctgaggc 36061 gggcagatca cctgaggtca cgagttcgag accagcctgg ccaacatggc aaaaccccat 36121 ctctactaaa aatataaaaa ttagcagagc gtggtggcgg gtgcctgtaa tcccagctat 36181 ttgggaggct gaggcaggga gaactgcttg aacccgggag gcggaggttg cagtgagccg 36241 agatcacacc actacactcc agcctgggtg acagagcgag actccgtctc aacaacaaaa 36301 caacaacaaa aatgctgcta ccaaagagtg ctggctgcca ctggcaaggg ctcccacacc 36361 accgtggaaa aaggctacag aggagtatgt ttcgttggac tccagtttca tcaacaggcc 36421 ctgaaaatgg tgcagtaggc cacacatcaa gatctgggtg gtgagatcat gggttgtttc 36481 tccgttcttc tttggtgtat cttttctaat tttctatagt caagatgtat tactttggta 36541 atctaatatt tttgtcataa taatttataa aatcctagct ccagcaccaa ctggctgggg 36601 tacccggcat gtaggaactt tgttgctagc ggagagcagc tgcgagcctc tagctctgct 36661 ctgggcctgg gaaggggccc agcagggacc agccatggga aggaccagcc atgggaagga 36721 ccaccctttc actgagctct caggctatcc agagcctgat caggatggga aagcgtggaa 36781 actgcatatt ctgctccagc attgttcaaa gtgctctggg tctcaggata gggacagcag 36841 cttttccaat ggaagaaggc agggagcaag ggacagaggg aggaaggaaa ggaaaggatg 36901 agggggagtg aaggggagga tgggaggagt tctggacatg gggagtttaa gaacacaata 36961 ggccaggggc ctgtaatccc agcactttgg gaggccaagg caggaggaag gcttgagctc 37021 aggggttgga gaccagcctg gaagacatac tgagatctca tctctactag gaaaaaaaaa 37081 aaaaaaatga gctgggtgtg gagatgctcg cctgtagtcc caactactcg ggagactgag 37141 gtgggagaat tgcttgagcc caagaggtcg aggctgcagt tagctgggat cgtgccactg 37201 cactccagcc tgggcaacag agtgaaaccc tgtctcaaaa aaacaaaaac aaaaacaaga 37261 aaaaccccaa aaaacagacc tgggttccaa tcccagcacc atgctttgtt agctgtgtag 37321 cctgaggcca gtaacttctc ctctgtgaga ttcagttttt tggtctataa aatgtggata 37381 agaggtgttc ctatgtcctc aggttgctgt gaggattaaa ggagataaac ttgttggggg 37441 cttaacccaa tacttgccat acagtaggca ctcattaaat gacaatggtg aggatgggta 37501 ctccatcaat aaaagcagta tcaacactaa aatagtaata atagcaccta acatttatta 37561 acctatttga tgcccacagg tatctataag gtaggcaaat tatatttcca tgttacagat 37621 gaagcaatgg aagcacaggg gttgagtcgc tttgcccagg gtcacacagc taagaagtgg 37681 cagagctgag gtttgaaccc agacatcctt gttctggaat ctgtgctctt aacctttacc 37741 cactattgcc ttactacaaa gcatattcaa agtttgcaga ttctaggctg ggtgtggtgg 37801 ctcatgcctg taatcccagg actctgggag gctgaggtgg aagaactgct tgaggccagg 37861 agtttgagac cagcctgggc aacagggcaa accccatctc taccaaaaat acaaaaatta 37921 gcccagcttg gtggcgcatg cctatagtcc cagctactca ggaagatgag gtgggaggat 37981 cacttgaaca gggaggtggc ggctgttact ccagccgggc aacagaccga gacactgcct 38041 caaaaaacaa acaaacagcc cacgaagttt gcagattctc tcagaaagga gaggagtaat 38101 tgttgtcatt ttttgggata atgaaacgga ggctcagaga gatatagtga cttgcccaag 38161 gccacacagc tcaagaaagg cagtgctgat tttcttgcct ggcgcctctg actccagggc 38221 cagagtctga gtggaagcat caggggctcc actcacttgg acagggagac accaccgggc 38281 ttgccaatga ggtcttcatg gtggaggaca gcgtcgctcc aggcgatgcc gaggaagtcg 38341 aggatgagct tgagtgagcg cctggggtgc agcaccagct gctcgtagta cacaggcagg 38401 cacttctcct tgcctacctc catgcactgg gcgtacatca cctcgatggc cttgttccac 38461 ttggtgaggc agtcacggta gctgctgagg tcaaagcccg caatggtgac tttgcgcgtg 38521 atcatggagt gcacggaggc ccggccgtcc cgcaccatca gcaggaactt ggagttgggg 38581 aacaggcgcg acaggtagac cgaggacttg agcgtaaatg ggtccttgtt gcagagcacg 38641 cgggccggct ctccgtgctt ggcaatcacc tccaggatga aggcctgcat ggcggcgtcc 38701 agcacctcat ccgtcacccc cgcctcatcc agccgcagct tctcacggcc agacttggac 38761 caggcctggc gcatggccag cacgcgcggg atgatgcggg tctcctcgcc gcagcgcacc 38821 tcggggtgcg cgtccagcat ggcgcgcatc aacgtggtgc cactgcgagg cacgccaccc 38881 acgaagatga gcggcatggc cttgccatag cggtattcca cgtggttggt gcccaccatc 38941 accagctcct cctgctcagg ccgcatggcc ccccgggggc tccgcaggcc cgccagcacc 39001 gcccggcact ctagcacctg ctgtcccagc tgaaccgcca gcaccaggac cagggcgcag 39061 ccggctgcca gcagcaccct ccgcaccgac aggcgcatgc tgggccggag gcagggtagg 39121 cctggcctga gggcccgctt ctggggcttc agcgacaggt tagcgggcag cccgccaggc 39181 tcacatctgg ggagagaggg ggacatgcat agtcagggag gtctgtgagc tgtgagctct 39241 ccaataactg caaagcccta taactgcaaa caagaggctg ctgacatgaa tgcagagggc 39301 acctttcata agcactagct gtgtgcctgg caccctgctg ggcacttttt tcaatttttt 39361 ttgttttttt aaagagatgg ggtctcacca tgtttcccag gctgctcttg aactcgtggg 39421 cctgggtgat cctcctacct gtctcccaaa gtgatgggat tacaggcatg agccacccgg 39481 tccagcggat catgaggtca ggagatcgag accatcctgg ctaacacggt gatatcccat 39541 ctctactaaa atacaaaaaa ttagctgggc atggtggcgg gcgcctgtag tcccagctac 39601 tcgggaggct gcggcaggag aatggcgtga acctgggagg cagagcttgc agtgagccta 39661 gactgcgcca ctgcactcca gcgtgggcga cagagcgaga ctccgtctaa aaaaaaaaaa 39721 aaaaaaggct ggagtgcatt gagtggtgag agatcatagc tcactgcggc ctcggactcc 39781 tgggctcaag caatcctctg cctcagcctc ccaagtacct gggactacag gcacacacca 39841 ccgcacctgg ctaattttta gaaacagggt ctcgcttcaa gtgatcctcc cgcctcagcc 39901 tcccaaaatg atgggattac aggcatgagt taccccatct ggcccaagaa tattagtgat 39961 ttctatcata atcattggtt atttacttat tccaacattt aatgagcacc tgctatgtgc 40021 tgggagcctc agcctccaaa gtctactctt accatggacc tgtgcctgag agagattgat 40081 gggttctacc tgcgtaggtg agggaaggcc tcctgcagga ggtagccttc atgcagaatt 40141 ttcagtggtg ggtaggaaac aggcaggtgg agaagggaaa caaggatgtt accagcaaag 40201 ggaatagcca gcgcagagcc acaggcagga aaggatgcag cacgactggg gaactgcaag 40261 tgacttagtg aacggagggt ttggtggaag ccagtgggaa gacagcatgc ctctttggat 40321 gtgaggttcc acccaaaacc acccaagggc ccagtctgca gctgggccta tccacctgca 40381 agggcttggg ctgcctgggc cttcttaggc tatagcaagt ggaaaataaa catcaaagcg 40441 gagggaacac agtctcaggc ctggggtgga aactgctaag aggctaggac ttcctggcca 40501 tgtggcttgg gagagtcact tccctcatct tagccacagt ctcttcagct gcaaaatcag 40561 ggtggatcca ctgacctcag aggctgctgt gaaaatatga gaagcagctg acaaaccaga 40621 gatggatctc aggctggggt caccggcctc tctgtgcatt ggataatttc cctctcttca 40681 gaaaaacgtg ctgatgaagt tctgctgaag aggttggggc ttcatttgta ccctcgattc 40741 agggtctgtg atggagaaag ctaattcaga gctacttcct gcggggaggc aggctgaaca 40801 ggacatgaag ccattccgaa atccctgcaa cgaccctttg tcttcatcag gtgctcagtg 40861 aaattctccc ctgagacctg tgaccagaat cctgtcagtt attaggattt cagtgacagc 40921 atccccacct tggctgattt ttttgtgtgt ttttagtaga gatggggttt caccatgtta 40981 gccaggctgg tctcgaactc ctgacctaag gtgatctgcc cgcctcagcc tcccaaagtg 41041 ctgggattac aggtgtgagc caccgcaccc ggccagcttc agcattttat gttcagcaaa 41101 ggtgttcgaa attatggagc ataaactctg ctaaagcatt ttccctgcag tatctaagct 41161 caaagagatg aagcagcttg cccaaggttg catggccagg gcaggaatcc tggatcagac 41221 caggccaagg ctgaagaaaa cagctcaaag caggcaagtc agaagtgagc agaagacagc 41281 aatgacctgc acgatggctt aagaagaaaa agcagggtga cttgggacac agaaagcagc 41341 ctggaaccca caaagcagaa agtagcttcc tggttaccca gagctaggga ccccactgtg 41401 gaaggagcta gagggcccag cccagggaac taggcctgcc caacaccctc atcaccctgc 41461 tggaccctgg caagggaggt atcctggcat cctcgcagtg aggatgatag gtgagggctt 41521 ctaacttgga gtactacact ccaaggttca aatctcactt ctgctgccac ttgccagctg 41581 gtgatctgga caaaaggtat cacttctctg agcttctact ttcttctcca aaaaaggaat 41641 aataataccc ttttctcaag gaagctattc agaaggccaa ggaactcctg tggggtactg 41701 agaaagcata cctaaaacac taacattagg agaacgggac acaacagcgt ttgggttctg 41761 gcttgaatct gctccgtgac actgggcaag tcccccgccc actttgggac tcagccttct 41821 ctctcctatc acctggagaa tcgtcttcct aatagacaag gaagatgctc atctctctat 41881 gatgcttgct gggatgcagc tttttctagt accagatttc caacctgctg gctgaacaga 41941 aaggtgaaaa ccagccaggg gagtttgccc aggcacccaa tcatggaggc tgggggcatg 42001 gaggggactt ttggagggcg gtctcaggag gggccaggac tccaggggaa gttccttcta 42061 ggtgcactta cctcccttgg gcactctcat ctctgggctc cagcccttgt gcctgtgtgc 42121 caaggctgtc tgctggcaga gccctggaga ggcaaaggag atattgcatc agggatgggc 42181 actgtggaga ggggccaagc cttcaaagag gcagagtgag ggtttaggct cccaagagca 42241 ctggacagtg acatttggga gaaggactct ggaatgcagg ggaggctgct tcagctcaag 42301 ataccacagc tggagtctca acttatctac tcctacaggg aatgaactgg ctcacttctt 42361 aggcagagag gtgtgagctg aagtctgagc taaggcgccg ctgccttagg gagcccctgg 42421 tttatgccaa ccaggcctct cccctcctct tggcctaggg accttggact ccaggaaaag 42481 taaatataac aataagcaac aacaaccacc actacacacc actattgcca cctgagatgg 42541 ctgggtatgg ttgatcaggt tgtggactgc acaattccaa atggtaccac aggcccagcc 42601 tacaaggaat atgctgcttc ctggaattga acacaacatc ctactgtcac cacctaacac 42661 tgaccaaatg ctgaccacgc aaatacctgt gtcatttaaa ccccaataaa aggaaagtgt 42721 gatgatccca attttacaga tgagaaagca tgacaccaag aggaagagca acttacccaa 42781 ggaatggatg gaaagagttt caccatccag atctgtaaga ccttcctaac acttaatctc 42841 ccatggttac aggaacgagg ctgtgtccac gatgagtcct ggcatgtagc cagaaccagg 42901 ccaaagtgga atcaagggga tccttatagc tttagggcag ggagtatata tctcccagtg 42961 ctgtgtcctt accatttagg gtagggaaga tttttagtcc aattcaaacc ttcccccatt 43021 ttacagatgg gacaactgag gcccacaagg aaaaatcatt ttacagactc acatactgag 43081 tctgaagcta aacctcccac ctgtcagtcc tgtttcagga ggtgggtagt ttggaatgca 43141 gctctggagt tggccaactc cttgagtata catatcactc ctctcagact tttcctattc 43201 tgtaaacagg agattttaac cagggtaaca acactttaaa atgtcagctt tttaaatctc 43261 agccaggcct ggggtggtgg ctcacgcctg tgatcccagc actttgggag gcagaggcag 43321 gaggatcact tgagcccagg agttagagac cagcctgggc aatatgggga gacccagtct 43381 ctacaaaaaa taaaaagtta tccaggtgat ggtacctgcc tatagtcccg gctacttggg 43441 aggcttagat gggaggattg ctgaagccca ggaggtcgag gctgcagcga gctgtgattg 43501 catcactgca ctacagtctg ggcaacagag caagaccacg tctcaaaaaa aaaaaaaaag 43561 aaaaaaaaaa ctcagctttt attatttgtg ccatgatggc aataaaccgg ctgcagggtg 43621 ttgactgcat ttcttggtcc tccattcctt cctattcact cactcctttt ctccatacca 43681 ttttcctatt cctgaaatat caggccaatt aggacatcta aacccagatg gaaaagggaa 43741 ccaaggtgta accacagcaa ctagaccccc ccacagcagg ccacccccaa actatagata 43801 ctcaagcccc ggcactcctg ggacatgggc ggtcaatgat gtgaggcctg ctctcttgga 43861 ataaacaaaa ccaattagga accaggcact tgaagctctt ccagaggcag gatggctcgc 43921 ccagctgccc tcagacagct cttgaaatgc cagcgagccc aagaggcctg ccagccaggc 43981 agctcccatg gcccacgagc ccagccatcc ccctccaggg aacacaggcc ctttcagagc 44041 aatggaacag ggctgtcagg gctgtcagtc atttggagct gtgcatccaa catgcagcta 44101 ccagccacat gtaactattt gaactcacct tatgcaacat taaatacaat taagaatcca 44161 gcttctcatt tgcactagcc acactgcaag tgcccagaag ccacatgtga ccagtggcta 44221 ctgtactgga cagtgcacat ttaggatggg gtaggccaat gatggcccaa tccaacccag 44281 ggcctgtttt caaatcactc ttgaactaag aatgatttgt aaattttatc aagggaagag 44341 ggtgaaggga agaggaagag gagagagagg aggaggagca atggcaatag aaaccacatg 44401 aacaccacaa aattggaaac atttactatc tggacattta tagaaatttg ctgacctctg 44461 cactagaaca ttcccatcac tgcagagtgt tctgctgggc agtgctgatt agagggcaaa 44521 tgagagggag aggtaggagt caaagagctc tccttttttt ctttttttgt tgagatgggg 44581 tctcactctg tccagcccag gctggagtgc agtggcatga tcttcgttca ctgcagcctt 44641 gacttcctcg agctcaagcg atcctcccag atcctcccat ctcagcctcc caggtatcca 44701 ggactacagg tgcatgccac catgcctggc taatatttgt atttttgtgt agagatggag 44761 tcccgctatg ttgcccaggc tggtctcaaa ctcctggact caagcaatac tcttgcctgt 44821 gcctcccaaa gtgctgggat ttcaggtgtg ggccacttca cccagcccct ccttttcttg 44881 taggaggaat atattaaata cacagcaccc ctgtgtattt aacaccaaag caagggcttt 44941 ccatgcccta cctggtttca tactcacatc agcccagtca ggcaggtggt ggcatgttct 45001 ccattttaag ctaagaaagg cccacgtggg tgcagggtga gcacgactct accacacagc 45061 aaaatcgcca acaaatagaa accacactta agaaaaaaga caagaaggaa atatgactca 45121 actgctacca gttaaccaag cagggggtgg tgcacgcctg tagtcccagc tactcgggag 45181 gccgaggttg ggggattgct tgagcccagg agatcaaggc tgcagtgaac tatgatcgag 45241 ccactgtact ccagcttgga tgacagaatg agactgcctc aaaacaaaga aacaaaaaac 45301 attgctacca ggggtggtac tcgggtggta ggtgatattt ttctcctaaa aacataactt 45361 gatggtgtca tcttattgtc tttccatatg ataaaataat aaagtgacta gtaatttttg 45421 aattgtatgc catacatgtt atgggcccag actgcttggg ttcaacattc tgctgcacca 45481 gctactggct gaaggacctt ggcaagtcac ttagcctctc tgagacttgg ttttctcgtc 45541 tgtaaaatgg ggatgataat acaggtcaag tagccctaat ccaaaatttc aaaatctgaa 45601 atgctccaaa atccaaaact tgttgagtgc caacatgatg ctcacagaaa acgctcatcg 45661 gatgtggggc atggtggttt acacctgtaa tcccagcact ttgggaagcc gaggtgggag 45721 aatcacttaa gcttgggagt tcaagaccag cctgggctgt taatacagcg agaccccatc 45781 ttgaaaggaa agggaagggg aggggagagg gagggggaaa ggaggcgggg gaggggaagg 45841 gggaggacat ggcaccaccc gcctgattgg gctgatgtga gtatgaaacc aggtagggca 45901 tgggaaagga aaggggaaag gtaaggaaag gagaaagaaa aggaaaaaga aagaaagaag 45961 aaaggaagga aggaaaaaag gaagaaaaag aagaaaagca aggaaggaga aagaaaaaag 46021 agaaagagag aagaaagaga aagaaaaagg aaggaaggaa gcaagcaagc aagctcatca 46081 gacattttgg agtatggatg ctgaactgat tagtatgatg caaatattcc aaactcggaa 46141 aaaattcaag ctaggtgtgg tagctcatgc ctgtaatccc agcactttga gggctgaggc 46201 aggaggctcg cttgagccca ggagtttgag accagcctgg gcaacaaagt gagaccccat 46261 cactacaaaa aaaaattttt ttttaattag ccaggtgtgg tggtgcttgc ctgtagtccc 46321 agctaattgg gatgctgagg caggaggatc acttgaaccc cagaggtcaa ggctaaagtg 46381 agccatgatc atgccactgc actccagcct cggtgtcaga gggagcccct gcctcaaaag 46441 aaaaaaaaat ctgaaatcca aaacctttct agtcccaaga atttcagata aagggatacg 46501 cagcctgtaa cacctcctag aagggcactt tgaaatgtgc ctagcgcaaa tgagaaacta 46561 gtcttgcagg agtgctggga ggtataaata aattagggca tgtcggaaca gcacctggcc 46621 cagagcaagt gctacctgcc tgctggctct tgttttcagc aaatccttgt atggtggaga 46681 tttttatccc catctgagaa gtgaagaaat tgaagctcca agagatggag tgattagcag 46741 aagccaaaca acaagtaaga ggaaaggcca gtttccaaat gcggttctgc ctccaaggtg 46801 ccaaccctag cgtggtgtca tgcggctcat aaaacagggg cagcagccag gcacaggggc 46861 tcacgcctgt aatcccggca ctttgtgagg ccaaggtgga cagatcacga ggttaggaga 46921 tcgagaccat cctggccaac acagtgaaat ctcctccgta ctaaaataca aaaaattagc 46981 caggcatggt ggcgcgtggc tgtaatccca gctactcagg aggctgaggc agaggaatcg 47041 cttgaacctg ggaggcagag gttgcagtga gccgagatca tgccactgca ctccagcctg 47101 ggcgacagag caagactccg tctcaaaaaa aaaaaaaaaa aaagaaaaag aaaaaagaaa 47161 aaaaaaataa tcagattaag ccagaatcgg ttgttggaaa cagcggggtt tcatttcagc 47221 aggtgtggtg actcacacct ataatcccag cactttggga ggccgaggca ggtggatcac 47281 gaagtcagga gatcgagacc atcctggcta acacggtgaa acccccctct ctactaaaaa 47341 aaaaaaaaaa aaaaaattag ccgggtgtgg gggatcgcgc ttgggagctc ccagctactc 47401 aggaggctaa ggcaggagaa ttgcttgaac ctgagaggca gaggttgcag tgagccgaga 47461 tcatgccact gcacttcagc ctgggtgatg gagcaagact ccatctcaaa aaaaaaaaca 47521 aaaaaaacaa atgggcagca ggagaccatg ggaggaaact ttgctccctc tttccattga 47581 ccctgaaatg tccctggaag ggaggggcta atcaatctaa gagatcccct tagactcaga 47641 ttcattcatt ccaccaatgt ttactgagca agtaccatgt gcggggccct gctcaagata 47701 ctgggataca tcaagggcaa cagacaaagg ttcctgcctt catagagctg acattccagt 47761 ggggagccaa gaaaccagta tgtgagacaa tgacaccagt gataaaggct cagaagcaaa 47821 tgcccatgat gagaggtgga ggcattcccc aggggttaaa gaaggaaaga agggagtgac 47881 atttgcattg agaccacact gaattttggc tcaaatctat aggaactttc taaatcatca 47941 tttctccaag ctcagcagat caggaggttc ccaagagaaa agcggcaggg gggcagggag 48001 gggacaacag cggggagctg gcagtgcaca aagccagctc aggaggtcac tgcacttaca 48061 gagtggccct ggagcagggc ccagtgccca cgtgctcagg ctcagccatc agccctgcaa 48121 ttcctcccca tgggaaccag ctcccaacat tccattgcca tggctattcc caaccctacc 48181 agaccacgct gaccacacca actcccaccc accacgcctc aaccttctgc agagctgggg 48241 agacgcaggt ctcttgggca gggtccctgg gaatcaggga cctctgacct attgcggggg 48301 gaagccagcc aagaaagccc tccagattcc aatggggtgt gagtaaggga aaaataattc 48361 tgaagctctg gacactgctt gagcataaaa agcatgaaga gtggttcaag agggaaaaag 48421 ggcagggagg ctggcatgca gccttaaaaa tattagctca ggccaggcac agtggctcac 48481 gccaggcatg gtggctcatg cctataatcc cggcactttg ggaggctgag ggggatggat 48541 tgctacagca aaggagtttg cgaccagcct ggaaaataag acaaaaccct ctctttacaa 48601 aaattagctg ggcttgatga taatacaccc atagtcccag ctacttggga ggctgaagtg 48661 ggaggatggc ttgagcctgg gaggcagagg ttgcagtgag ccaagatcac accactgcac 48721 tccaatctgg gtgacaggga gagattctgc ctaaaaaaca aacaaacaaa caaacaaaaa 48781 aacattagtt cagacttaga gcccttatta tataccaggc actgaactaa gggttttaca 48841 tatatcattg cattttgtct taaccattga ccctatttga ccctccctta ttaaccatcc 48901 atttcctggg gctcagagac gtaaaggaac ctgccgaagg ccatgcagct gggaggcagg 48961 ggagctagaa ggaaaactca agtctctctc tttccagagc ctgtgttctc ctccctgtct 49021 tgctttgcat ttttttttac ctaaaaccat caaagcaggg tcccacacac tgtcccagtc 49081 tcccgttcag aggcccagtt cagcactctg ttagggacta gccaggggtg gggactgatg 49141 aggctggggg tcacatcctg gctcccccac tcaaaggctg ggtgtcctgg gtcaatcttg 49201 ggcagctctc tgagcctcag tgttctcatc tgagaaatgg gaataacacc tggaagagtg 49261 tcatgggaac tatgtaactt ccatcaagcc aggcagagcc agcataagaa aactattacc 49321 atttaattct tttccttttc ttttcttttt cttttttttt tttttttttt ttttgagaca 49381 gtttcttgct ctgttgctca ggctagaata cagcggtgca atcatggctc actgcagcct 49441 caaacacctg ggctcaagca cattctccca tctcagcctc ccaagtagct agtgcaccac 49501 catgcccggc taatatttta ttttttgtag aaatggggtc ttaatacatt gtccaggctg 49561 gtctcaaact cctgggctca agtaatcctc ctgcctaggc ctctcaaagt gctgggattg 49621 caggcatggg ccactacgcc tggccgcctt ttgatgttag gatcagtcat tacagcacat 49681 ggataaagac tggcagggac atacacacaa actaaggaac gcgtggtccg gtcaggtggg 49741 atcaggggtg acttcttctc cctgttcgtt gtcatttaga gtcaaaatgt acagggccaa 49801 ctgtgcattt ttactctaat gacaaccctg gagggtgggc gctattatta gccacgtttt 49861 accgaagggg aaactgaggc ttagagcggt gaaatgcttt gttggaggtc acacagatgg 49921 taattggttg agcctgaact gaaactcagg ctatctgatg ccagaaccca aactgtttcc 49981 ccaactttcg acgaccctgc tttatcatct ctcttggggg taggcaagct tctgtgaagt 50041 cataaaaccc atgtttttgg ctctgtgggc cacaagagct ccactgcgag tttgcaactg 50101 taatgcaaac aaccaagatg acacatacat gaaggcaagt ggctgcgttt ccataagact 50161 agttacgaaa acaggcggtc agctagattt gtcccgtggg catagtttgc caaccccctg 50221 gtctatatca cgaaaagaca ctatcagcca ggagcagtgg ttcacgccta taatcccagc 50281 actttgggag gccaaggcag gaggatcacc tgaggtcagg agttcgagac cagcctggcc 50341 aacatggtga taccccatct ctactaaaaa tacaaaaatt agccgcgcat ggtggcgggc 50401 gcctgtagtc ccagctactc gagagcctga ggcaggagaa ctgcttgaac ccgggaggcg 50461 gaggttgcag tgagctgaga ttgtgccatt gtactccagc ctgggtgaca acagcgaaac 50521 tccatctcaa aaaaaaaaaa aaaaaaaaaa aaaagacact atcagcagca cctgccctca 50581 gtgcaacctg ggcaaactca caaagcacaa atacagtgag ccgaggtgga agccagtttt 50641 ggttctggga cttttctcag tagggactgt gggaggaccc aggtggccag cctttggtaa 50701 gagaaaaatg tacaattcgg gaaaacaaat agttgattga aaagaaaaag aagtgcacac 50761 tcatgcatag ggaatcatcc agcatcctgt ctgttgccac gtaaggacta gcagagaggc 50821 ttgtgcatgc ctggcagttt tttttttttt ttttaaagag acagggtctc cttctgtctc 50881 aaaggctgga gtgcagtgga gtgatcactg ttcactgcag ccttgaaact cctggtctca 50941 aatgatctgt ctgccccagc ctgccaagta gctgggacta caggcaagtg ccaccatgcc 51001 cagttaattt tttatttttg tggagacggg gtctcactat gttgcccagg ccagtctgag 51061 tttaatcctc ccacctcagc ctcccaaagt gctaggatta taggcatgag ctactgcacc 51121 tggctggcag ttttatatac agtcgcccct cggtatcata agggactgat tacagaatat 51181 ccctcgaata ccagtataca tcaagtccct tatataaaat gatgtaatat ttgcataaaa 51241 cctacacgta tcctcccata tactttaaat cagtgctaga tcaattatac tacctaatac 51301 aatctaaatg ctatgtaggt agttgttaca ctgtattgtt tagagaacaa tcacaggaca 51361 aaaaagctct gttcatgttc agtacagatg caaccaccca ttttttcccg aatatttttg 51421 atccactgtc ggctgaatcc ccgcatgcag aacccaagaa gacggacagc caacaatata 51481 ttttgaccct catgacaggc ttggagggta ggcactatta ttattagcca cgttttacag 51541 aaggggaaac tgaggcttag agaggtgaag tgctttgttg gagatcacac agatgataag 51601 tggctgagtt tgatctggaa ctcaggtcta tctgatgccg gaacccaagc tcttaactac 51661 ctcttccaaa tctcgacccc accccataca cagatgggga agctatgtgc cccaggtcag 51721 acagctgctc agagtagtgc ccggctcaga ctcctggacc atctcccagt cttgcccctg 51781 acacctggca cagcccctgc tcaggacttc ctgcccagaa acacagaggg cccctgcctg 51841 gtaaacaagg caaacaggaa agggccagcg tgatcttgta gggaccagtg cttcctgagc 51901 cctcaccccc acccaccatg ggatcccgtc tctactcctc tgtccagaag ctgtcagagg 51961 taggggactg agcatagggg caggaatcct ccacacctgg ctgattccta gcaatgcagc 52021 cctgggttgg gggtcagtgg ggaggtgtta tctaactact ttgtgcctca ctttcctcat 52081 ctgtgaaatg gggaggccat taacacttgt gaagactaaa taaataaact tgcagtggct 52141 catgcctgta atcccagcac tttgggaggc caaggtgggc ggatcacctg gggtcaggag 52201 ttcaagacca gcatggccaa catggtgaaa ccccgtctct gccaaaaata cagaaattag 52261 ccaggcgtgg tggtgaatgc ctgtaatccc agctacttgg gaggctgaag cacaagaatt 52321 gcttgaaccc aggaggagga ggttgcagtg agccaagatg gcgccactgc actccagcct 52381 gggtgacaca gcaagactct gtctcacaat aaataaactc acataaagca ctcagaacag 52441 agctttgtta ctacctgact gttattcctt ccctttaggg tagtggttaa actttggtgt 52501 gcttctgaat agcccaaaac tttaaaaggc agatttcctg gggcctgccc tcaaatattc 52561 tgatttggaa gaggcaaggt ggagctgagg gaatttgcat tttaacgtgg agctgaggga 52621 atttgcattt taacaagcgc tgccccttgc tccctctctg tgaaacaccg cttaaggcac 52681 tgaactcagc gttctagtag gctggtgcaa aagtaatgca gtttttgcca ctgaaagtaa 52741 taccaaagac cgcaattact tttgcaccag ctaactattc actggctgcc cacctgggcc 52801 aggctcagga atgcaatcat cgtggagtgc aggggtcagg agtgtagacc taaagttgga 52861 cagacccagg cgatcttggt gtggccactc accagtggcc ttgagccagt gacttcttgt 52921 ctctgagcct cactcacctg ttataacgtt taacatttcc aaaataagtc tgaaaacggc 52981 acctaccttg tggagctgtt gtgaagctgg caaagagtgt gagctcagag gcatcagcta 53041 caattctgta ccattattta ttaggacgga cacagtcctg cccataatga gtgtcacggc 53101 cattgagccg gcagacaatc aacccagtgg gatggatgcc cagacagact ggggtgtgca 53161 ggaggcggtg ggaacacgtt cagggcacgc tggagcaccc agtgtcacgt cctgcccaga 53221 agacctaata acacccaaca gacaggcacc acaacaaagt ggcatcgact cccactgcag 53281 gcctcccacg gcaaccctgt gggggagaca gcccctctct ggctggatca tgaataagaa 53341 atccagaggc ggaatgggct gctcacactc tgctcacact cacacagcag gtgcagggca 53401 gcatgaggac tcaaacccag gtctaatggg ttcttccacc attttgtgcc atggatctct 53461 tcaaccatct ggtgaaatct atgggcccct tcccagaata atgcttttaa atggaataca 53521 ataaaatata caggatgaaa aggaaaccaa catagtaaac tttggctaag aatgtgatta 53581 caaaaattgc tgtgaaatat tatctgtaac ctatctgctc ctttattaaa gcaatcaata 53641 acaagatcta gtagcagaca gaataatcac tgtcatctgc aagcagacac atctgtggca 53701 cctaaaatgt gatataaaaa tatgggccag gagcagtggc tcatgtctat aatcccagca 53761 ccttgggagg ccgaggtggg cggatcacct gaggtcagga gttcaagacc agcctggcca 53821 acatggtgaa accccgtctc tactaaaata caaaaaaatt agccaggcgt ggtggcagcc 53881 acctgtaatc ccagctactt gggaggctga agcaggtgaa tcacttgaat ctgggaggca 53941 gaggttgcag tgagccgaga tcgcgtcatt gcaccctagc ctgggcgaca agagcgaaac 54001 tccatctcaa aaaaaaagtg tggtttctac tggtgatgcg agtgctgcca ctactactgt 54061 gatttatcac ctatgttgtc acctggagga aatgttaaac tagggtcaat aaagatgtaa 54121 ttttcccctc atccaaattc ctaggccccc gaatgcagcc tggaacccca attaagaatc 54181 gctgccagca tctggctccg ttatggacac aaaatctatg gctgttaggg ccagggagcc 54241 atggtcctgc ctcaggagtc tagattcacc ttccttccca ttcctgcccg tttcctcgcc 54301 ccaccatcaa acacaagagc agtagggatt tgcaccagaa acacttgatg gcatattgtg 54361 gagcatggtt tgcatgttct gattttattt taagaagtaa aagtggccag gagcagcggc 54421 tcatgcctgt gatcccagct gctcacaaag ctgaggcagg aagacagctt gagcccagga 54481 atttgagacc cagcctgcgg ctatcaacat agcaagcccc aaatataagt aggtaggaga 54541 tggcaactgt tataaaagaa ccatggctcc tctccacgct ggcatcacac actggcatgt 54601 gaccttggct actaagttag cagtgtgaag gccaacaggg accaaagtgg gggtgaccct 54661 ggcagaggca ctgtacacat ggccctgagg gaaagcaaac atccgcagtg gatgagggga 54721 ggcacaggca ggtggtcaga gtcccatgct gccagccaga ctgggcgtgc acctgcatct 54781 accccagacc cttggcagca tcacctccac tctctgggcc tgagtttcct catctgtaaa 54841 atgggcccaa taattcttac tagcgatgct gctgtgagga ttaagtgaga taggggctta 54901 ataagggaaa gaccagaggg gtgtggctaa tgccttgcag ctcttcaata actgtgagtc 54961 ccatctgaat ctaccgatga gaaatcagca gtggtgagca ggaaaagtgg ggaaaggaga 55021 ttggcaagct ctagactgtg ccaggctttg aatgccaggt aaggagcttg gagttatcct 55081 ggggacggtg ggaaattgtg agcattggtc tggttgtgac ctgggagggg acacaggctc 55141 tctctggtgt aggatgcctg aaaagacaag ggctggaggc caagtgatag ctgctggggg 55201 acccagggag ccatctcaaa gaagagagat ggggtcagac gtggtggccc atgtctgtaa 55261 tctcagcact tcaggaggcc gaggagggag gattgcttga gctctggagt tcaagaccag 55321 cctgggcaat gtagtaaagc tccatctcta caaaaaaatt aaaaattagc cgggtgtggt 55381 ggcatgctcc tgtagtccca actacttggg aggctgaggc cagaggatca cttgagccca 55441 ggaggtcaag actgcagtga gctgtgatca caccactgta ctctggcctg gttgacagag 55501 ggagactccg tctcaaaaaa atatatattt tatataaaat ttatatatat atattgtata 55561 aaaattatta tataaaaata tatataatac acacacacac acacacacac acacacacac 55621 acacacacac acacatatat atgtcgccca ggctggagtg caatggagca atcacagctc 55681 actgcaacca ccatctccca ggttcaagtg attctcctgc ctcagcctcc tgagtagctg 55741 ggattacagg cactgggcca caacactggc taattttttt gtatttttag taaagacagg 55801 gtttcatcat gctgaccagg ctggtctcca actcctgacc ttaagtgatc tgcctgcctc 55861 ggcctcccaa agtgctgcga ctacaggcat gagccactgt gcttggcctt aaacaaaaca 55921 aaacaaaaaa attaaaaagg agggagatgg gagagaaggg agaaagcagc aaaggagaca 55981 ctgtgggcag gaggaggatg tgatgtggag agagccccct actgtggtcc tcacacatga 56041 ctcactggct ggacagacaa atcatagctt ctcttggggt gggcttgctg gtctcagccg 56101 tcagatgggg agacagagag aaggaaggga cagagtggtg ggggagaccc tggccaccct 56161 agggagagca aaaggcttgc ctggcagggc aggaacaggg tgaggcaagc agggcattgt 56221 ccttgagcac aaaaactcaa gggggtgcca aaaaactcag taaccaagat caataatgtt 56281 tcaatgcaat cttttttaga tcagtggaaa aaaatccatg acaaacaaaa aatataaagt 56341 gtttaaatag gacaagtgat gttgcgctgt gctaagccac actggagcca aggcagaaag 56401 aaaaaaatca gtaatactga tcctgacatt atttacaatt tttatatgtt gtttttcatg 56461 gatatttttg catttagatt gtttcataac agctttactg agggacaatg cacatatcat 56521 ataactcacc tatttaaagt gtacagttca tgggcccggc gcggtggccc acgcctgtaa 56581 tcccaacact acgggaggcc gaggcaggcg atcacctgag gtcaggagtt cgtgaccagc 56641 ctggccaaca tggtgaaacc ctgtctctac taaaaataca caaattagca gggtgcagtg 56701 gcagatgcct gtaatcccag ctgctcaaga ggctgaggca ggagaatcgc ttgaacttgg 56761 gaggcagagg ctgcagtgag ccaaaattgc gccactgcgc tccagcctgg acaacagagc 56821 gaaactccat ctcaaaaaaa ataaaaaaat aaagtgtaca gttcaatggt ctttagtata 56881 atcacagggt ttaatatatt cacaggtttc tagtatgtgc agagtagtac actcacagag 56941 ttttcgtata gtcatagtca attttgctac attttcatca cccaaagaga aacctcgtac 57001 ccttacgtat caaactgtaa cccccttctc ctcccagccc caggagacca ctactctact 57061 ttccgtctct atgagttcgc ctattctgaa catttcataa aaatggaatc acacaaagca 57121 tgttctttaa tgaccagctt tttccactta gcacactctg ttcaaggttc atccgttagt 57181 tttgattttt caaattttgc attaaaatat tatcttgatc acttgatcat cccctgctgc 57241 ctagggctct cccaggtcaa accctggttt ctgcaaaccc ccacttaaca gcatggttct 57301 attcacacct tcacgcagaa gccagggagc ttggggctgc cagaaacctc ctagcctgga 57361 cagcagggcc ttctaggagc ctctgaggaa atccacacgg tgtgacagac cgggtataga 57421 agcatatcat atcaatgtat atgtatcata tacatatcaa tgctgtaata cgatataaaa 57481 taagagaaga tgatttctag atttgatgca taactttggg tccttgagct cgggaatgca 57541 gagccagcag tggcctcgga gaaagcctgg gcgagctcag aaaggagagt gtggccaagg 57601 tcacacagca agttaatggc acggccaggc ctgaacacct ggctgggttc ttcccatgta 57661 cccagaggcc ttctctgctg gcatcatgct ggcttgctct cctaacaaga tacggtagga 57721 tggtctcctg gttaaagtgt ggctccggag tcaggctaca gatgctggct acagagctac 57781 atgactttgg gcaagacact taacctctgt ggtcatcagg ttcctcattt gtaaaatgag 57841 aatcatattg cagcagtgcc atatgacttt tgtaaggatt aaataagata ttgtgcattg 57901 cagtacactg agctccacag agacagcgcc ggggcaagtg agagccggac gggcactggg 57961 cgactgtgcc tcgctgagga aaaataacta aacatgagca aaggagatcc taagaagctg 58021 agaggcaaaa tgtcatcaca tgcatttttt gggcaaactt gtcgggaggc gcataagaag 58081 aagcacccag atgcttcagt caacctctca gagttttcta agaagtgctc agagaggtgg 58141 aagaccatgt ctgctaaaga gaaaggaaaa tttgaagata tggcaaaggc ggacaaggcc 58201 cattacgaaa gagaaatgaa aacctatatc cctcccaaag gggagacaaa aaagaagttc 58261 aaggatccga atgcacccaa gaggactcct tcggccttct tcctgttctg ctctgcgtat 58321 cgcccaaaaa tcaaaggaga acatcctggc ctgtccattg gtgatgttgc gaagaaactg 58381 ggagagatgt ggaataacac tgccgcagat gacaagcagc cttatgaaaa gaaggctgcg 58441 aagctgaagg aaaaatacga aaaggatatt gctgcatatc gagctaaagg aaagcctgat 58501 gcagcaaaaa agggagttgt caaggctgaa aaaagcaaga aaaagaagga agaggaggaa 58561 gatgaggaag atgaagagga tgaggaggag gaagatgaag aagatgaaga tgaagaagat 58621 gatgaataag ttggttctag cgcagttttt tttttcttgt ctataaagca tttaaccccc 58681 ctgtacacaa ctcactcctt ttaaagaaaa aaattgaaat gtaaggctgt gtaagatttg 58741 tttttaaact gtacagtgtc tttttttgta tagttaacac actaccgaat gtgtctttag 58801 atagccctgt cctgatggta ttttcaatag ccgctaacct tgcctggtac agtatggggg 58861 ctgtaaattg gcatggaaat ttaaagcggg ttcttgttgg tgcacagcac aaattagtta 58921 tatatgggga tgatagtttt ttcatcttca gttgtctctg atgcagctta tacgaaataa 58981 ttattctgtt aactgaatac cactctgtaa ttgcaaaaag aaaaagttgc agctgttttg 59041 ttgacattct gaatgcttct aagtaaatac aattttttta ttagaaaaaa aagatattgc 59101 atgtgaagca cttagttcca cacctagcaa acagcaggtg ctctaggtct tatcatcatt 59161 attactatta ttagtatctg gaatctagca tctcacggag agtcctttgt aggacatgga 59221 atgctatccc agcaggcaca ggaaggaaaa tttaggctta ggtactaggg agccatggca 59281 gatgtttgag cagtgggtga gcagcatgac tttgaagctg catagggctg gccgccagtc 59341 cagtacccac ctacagggaa catctttgcc aaggtccttc ctcgcaggct ccaaatccct 59401 ggcagtcctc ccagtgctgc tgggcaaagc cagagcacaa ctcccacagg ctacaccccg 59461 tgacatttcc agccttgttt gcttcctggg tgccagccgg gctggtgaac accaagcagt 59521 caacaagctg ggggacttga agcagcccac acagcctcaa tccacagata caaagaccag 59581 gaggcccagc ggcccggcgg gggccacagg gtggcagagg agaaactgtc cttattttgg 59641 agtcatgaag gcaagcctag ctcctccttt gctgtgtgac cttgggagag tccctttccc 59701 tctctgagct ccagtctgcc catatgcgag aaggacttat ctgcttctgt atgcttaggg 59761 gctgctggag agatgcaaca caataaaaca acaatggtgt cctttggcag cacccccacc 59821 atgagccaga cactgtgctg aacgttccag gtgtgtaact tcactaaacc ctcacagctc 59881 tacaagccag tccatgtgcc atttctactt ttaagatgca cctaattgct caactcattg 59941 gtacaacctc tatattacaa actggaggga gaaggcataa gtctctttgg agtaaaagat 60001 ccaatcctat gttcttcccc cttttctaag atgtaactgc tgttgattag tttggagggg 60061 agccatctac acttgctttt tcttttcttt tcttttcttt tttttttttg agactgactc 60121 tcactatgtc acccaggctg gagtgcaatg gagcgatctc agctcactgc aacctccgcc 60181 tcccaggttc aagcgattct cctgcctcag tttcctaagt agctgggatt acagacatgt 60241 gccatcatgt ccagctaatt ttgtactttt agtacagaca aggtttcacc acgttggtca 60301 ggctggtctc aaactctcga cctcaagtga tctgcccgcc tcggcctccc aaagtgctgg 60361 gaccaggggc gattttgcca ttcagggaac catcggtaat gtccagagat agttttggtg 60421 ttacaattag ggatcatctc atgggtagag cccacggatg ctgctaaaca tcttacaatg 60481 cccaggtaat ccccagcaca aagaatcatc cagtcccaaa tgtcactagt gccaaggtta 60541 agaagtcctg gattctacat agcctcctga aactgactgg gttcactgaa cacgaggtct 60601 ttccgtgtca gcatgtttgt tctatcatcc agtaaacatt gcttgagcac ctactgggtg 60661 ccaggccttg ttctagcacc ggggccaaat cagtaaacaa gacaaagccc cagcctccag 60721 agggcactga tatccttcca gaagaaaaga cacatcaaca tgcaacacct tgatggacct 60781 ccttccctcc ctcttcttct tttttttttt ttttttaaga cagagtctca ctccagccca 60841 ggctggagtg cagtggcgtg atctcagctc actgcaagct ccgtctcccg ggttcacacc 60901 attctcctgc ctcagcctcc caagtagctg ggactacagg cacccgcctc cacgcccggc 60961 taattttttt gtatttttag tagagatggg gtttcaccgt gttagccagg atggtctcaa 61021 tctcctgacc ttgtgatcca cccgcctcgg cctcctaaaa tgctgggatt acaggcgtga 61081 gccaccacgc ccggccctcc ctccctcttc ttaattgttc cagagcatcc cagcaggaac 61141 aacacaccat tcagttagcc aaggccttct gacagacatg taagctgctt tcatcttctt 61201 gcaatgacat accatcacaa aggaccatgc ctccttgtgc ccatgcatga aaggtgccct 61261 agggcctcac tagggtcttg caagcagtaa ggccacctgg gaggctgcta gtcccatgcc 61321 ttagaaggta cttaaccggc tttttaacca gaccccaggc gtttccatgc acaatacgat 61381 ttgagaagca cagctctagg tagctacctg tagggggtcg gcgggctcag gggccactct 61441 ccctttatat ttcagcagat actgcaagtg ccctcccaaa gaccctcctg ccagcagtct 61501 gcatgttccc cttctgcacc aactttgtgt tgccggaggt tgccgttttt gccaagctga 61561 cgattcctcc cattttccaa ctatggagac tgaggctgaa cgaggtgtct agacaaccag 61621 cgccaggggc caatgacctc taaggcctct cccagttctg acatttagga tctggccaca 61681 gcctaggtgt cttcgtacca gcctcctgat caaccactct cactctcagt agctgcttgg 61741 agcaggaagg aggaaaggtt acagctcacc cagcaaagga agcagagttc ctgagggagg 61801 gagcaggctt agtgggctgg acatctgtga gcgccactgg caaggctgct gctatgcccc 61861 ttcatgtcca gtcatgcagg acggaggacc aggctgggga cagcgcaagc cttgccccgt 61921 catgaataac aggtaacgcc ggagggccga gcagaggagg gaaagtgctc cagggccaag 61981 aagccagatg agtccccaac tttgctctca ccagttgcac atcttagcct cagtttcctc 62041 ccctaaaaat gggctcctac cactcactgc agaaggctgc agtgaacact tatgcacaga 62101 tcagtcaaca aacagcagcc accacctgaa gccttgcctt gggccggtgc tctgttactt 62161 ttattcaact tactatagcc tcacagcagc tgactggcac tgttattctc cccatttgac 62221 aaatggggaa attgaggccc agagaggtgc agtgctctgc ccttgatcga acactgagtt 62281 atagccatgg aaggggtatc cttggacctg caaagtcagc attactcatc tgaattctgc 62341 ctctgtggag ctgtgtggcc ctgggcaaac tgcttcacct ctctgagcct ccactttctc 62401 agcagtgaaa tgaggacgta acacacagag tggagagggg aagccatctg caaacagaga 62461 atactgtctg cagacccccc actgagactt cataatgcca cctccaaaac acccagccca 62521 ggaagacctg tcacctgtgg gcatctctga tctttcctga gctctagctg caacatgggc 62581 caggccatgc aaagctggta gcccccagag agggaaagcc acttgctcta gttgacacag 62641 ctggtaagta gtagagagca gagccaagca gcaagagatg aggtttcaaa cctgagagcg 62701 gtacaggcag cagccagggt ggaggcgagg gtcaggctcg agctggcagg tggctggagt 62761 ggggaccacc aggagctggc agacccttgg tggccaaggt ctctctcctg ctggctttta 62821 gctgggaggc tcctttgaag gatcagtcag ccttgcgatg ggcctgcctc tcctaaaggc 62881 tcctccctct tctggatcca gtcttgttgc tcacctgtgg actggcagcg ggcacctcct 62941 ctctggggaa tcagggcttc tcttggcggc tgtggctcag gtctcctagg ccatgcagtc 63001 tccctgtggc cagacttgcc ctggtctctg gaattccctc ttgtcatttt ggccagagga 63061 tgtcagcccc tgcctaaaac cttccacaga attctagctc cttcctgtag tttgtaacag 63121 cgccacccaa tagaaatgca gtgtgagaca cacttcaaat tctcagtttt ccagtagcta 63181 tgttacagaa gtaaatttaa taacatttta tttaactcaa tatatataaa atattatcat 63241 caatcaatat aaacattact gagatagttt atagtctttt ttttttaatt acttagtcct 63301 tgaaacccag ggtaaaacta ctttcagcac atctcaattt ggatgctaaa ttctcagtag 63361 aaatggctaa tctgtacaga gatttcataa aattgatagt tgaacaaagt atatccacac 63421 acccaaactg ttgtacacac acttaaaagt tttccaagaa tgggaatgag tatctgttct 63481 gaaattaaaa tagataaaga cttgaaattc agttcctcag ccacagcagc cacatttcga 63541 gggctctgca gccacacgtg gtttgtggtt gacgtggtgg gcatgaaggt taataaactc 63601 ctggccaggc acggtggctc atgcctgtaa tcccagcacg tcgggaggct gaggcaggca 63661 gattacctga ggtcaggagt tagagaccag cctggacaac gtggtgaaac cctgtctcta 63721 ttaaaaatac aaaaattagg ctgggtgcgg tggctcacgc ctgtaatccc agcactttgg 63781 ggggccgagg aaggcggatc acaaggtcaa gagatcgaga ccatcctggc caacatagtg 63841 aaaccccgtc tctattaaaa atacaaaaat cagccaggcg tagtgacacg cacctgtagt 63901 cccagctact cccgggaggc tgaggcagga gaatcacttg aacctgggag gcagaggttg 63961 cagtaagctg agatcgcgcc actgaactcc aacttggtga cagagcaaga ctctgtctca 64021 aaaaaaaaga aaaagacaaa gaaaaagaaa aagaaaaaaa aatacaaaaa tcagctgggc 64081 atggtagcag gcgcctgtaa tcccagctgc tcaggaggct gaggcagaag aatcacttga 64141 acccaggagg cggagtttgc agtgagccga gattgagcca ctgcactccg gcctagatga 64201 cagagtaaga ctctgtctca aagtcagtaa ataaacaaac aaacaaacaa acaaactcct 64261 taaggccagg acaacactcc caatccctct gctctcctct cttcccccgc tcatatcgta 64321 ttgtaattgt ccatcttgga gtttctcagt ctctgcacta tagacatttg gaccagatca 64381 ttctttgtgg tagtatgggg tggggctgtc ccatgcattg taggatgttt agccgcatcc 64441 ccggcctcta ccaaggagat gccagtagca tcccctccct gagtcctgac aaccaaaaat 64501 gtccccaggc attgctgact gccccctggg ggtgcagtca tccccatcaa gaacactggt 64561 ctattgcggg gtaagagggg gtgagactaa gccgattctt tccatctccc aggtacctgt 64621 ccagtgcctg atgcacaaag gtgcccaagc ctgtacctga acagaagagg cttactccac 64681 cctgagactc acctcttcca ggaagctttc cacaatgcct tagcccagtt agtgtttctg 64741 ctgggatccc acagccccct gcgctcacca ctggtgcagt cccctccaag ggtaggaaat 64801 ggccttgctc actgtttatt ttcaaggcct agcctggaag ttggtataca gacctgagaa 64861 acagggatga cttaggttga aagccatgca aggccagttt tgcccatttt aattcagggc 64921 ttcctcatcc cagcaaaaga tgctaatatc agtcacagtt ccatctatac accagtcaat 64981 gagctaatat ccacaggatt tagagaggga aactgaggct cacacatgtg aagcttgtcc 65041 tggattccaa tgaccttgag gcttaaaatt cccggttcct cctcttatgg atcaaatgaa 65101 gtcatgcatg aaaaaagata tttataatac atctcaaaca aatatttata ctgagaacat 65161 ataaagagtt ctgtgaaatc aatttaaaaa aggacaggca acacaacaga aaaatgggaa 65221 cgtctgaaga catacttcat aaaagaggtt atctaagtgg tcaataagca tatgaaaggt 65281 gctcaaactc atatccatcg gggaaacaaa tgcaaattta aagtacattg tgggatcatt 65341 acataactaa ataactaaaa aaagaaaagg ataccaagtg ttggcaagga tgtaatgcaa 65401 ccagaacttt catactctgc tggcaggagg gaaacctggt tcaacccctt tggaaaactc 65461 tggcttaact gagacaaagc ctgaatgtgc accttctcta caacccagca tttctactgc 65521 tgattataca tccatgagaa atgcattcaa ctgttcaccg aagggcacat atcaaaatgt 65581 tcccggtggc atgatcgtaa ttgcccccaa ctggacacta tccaaatgcc cattggcaat 65641 agaatggata aacaagtaaa tacaaataaa cagatatata ctaaacagcc aggagaaaga 65701 acaagccaga accttacgca ataacaacac ctgtgatgag cgaaacaagt cagcacaaga 65761 aagaccataa cgcgtggttc tgtcagaggc actggaacca cagcaactcc atctctccag 65821 tacgggctgg gtaaatgagg ctgagacctg ctgggttgca ttcccaagag gtcaggcatt 65881 ctttgtcaca gagacaggaa gtcagcagga ctgctttcac aagatacagg tcgtaaagac 65941 cccattaata aaacatgatg caataaagaa gccggacaaa accgccaaaa ccaagatggc 66001 aacaaaagtg acctctggtc atcctcactg ctcattatag gccaattata atgcatgagc 66061 gtgctaaaag acactcccac caccagcgcc atgacgttta caaatgccat agcaatgcca 66121 gaagttaccc tatatggtct aaaaaggtga ggaaccctca gttctgggaa ctgcccacac 66181 ctttcccctt atttagcata tgatcaagaa ataaccataa atatagccaa ccagcagtcc 66241 tctgggaggc tctgtctatg cagtagccat tctattgttt gtttacttcc ttagtaaact 66301 tgctttcact ttatgaactc accctgaatt cttttttttt tttttttttg agacggatct 66361 cgctctgtct cccaggctgg agtgcagtgg cccgatctcg gctcactgca agctccgcct 66421 cctgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta caggcgccca 66481 ccaccacgct cagctaattt ttcgtatttt cagtagagac gggctttcac tgtgttagcc 66541 aggatggtct cgatctcctg acttcgtgat ccgcccgcct cagcctccca aagcgctggg 66601 attacaggtg tgagccaccg cgcccagcct caccctgaat tctttcttgc acaacgtcta 66661 agaaccctct cttggggtct ggatcgggac cccgttccag taacagttcc atggatacaa 66721 agtacaaaaa caggtgaaat tctaggggga gatgtcagga tggtggttac cccggtgggg 66781 caaggggtaa ctgcaggggg agcaaggtgt cctggggcag ggaggggtgc tatgtacgtt 66841 cagctattga tccaggtgct ggtcacacag acacattcgt cttttgaaaa ttcattgagc 66901 tgtatgcgta tgaaatatgc atttttctgt atacacgtta gacttgaata aaaatagtct 66961 taacgaatca cactttcagc aagcttggtg gctcacacca acactttggg aggctgaggc 67021 aagaggatca cttcaaggcc aagagctgga ggctgcaatg agccatggtt acaccactgc 67081 actccagcct gggcaacaga gagactttgt ctcctaaaaa aaaagaaaat taatggcctg 67141 gcacggtggc tcatgcctgt aatcccagca ctttgggagg ctgaggcagg aggatcacct 67201 gaggtcagga gttcaagacc agcctggcca acatggtaaa accctgtttc tactaaaaaa 67261 tacaaaaatt agccgggcgt ggtcgcaggc acctgtaatc ccagctactc gggagactga 67321 agcagggaga actgcttgaa cctgggaggt ggaggttgca gtgggccgag atcatgccac 67381 tgcactccag cctgggtgtc agagcaagac tctgtcaaaa aaaaaaaaaa aaaaggaaga 67441 aagaaagaaa agaaagaaaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa 67501 agaaagaaag acagaaagaa agaaagaaag aaggaaagaa agaaagaaag aaaagaaaga 67561 gagaaaaaaa ttaattaatc acactttgtc aaatgctgag gaatggtgtt gctggctggg 67621 gtatgctacc cgccccactc caccccacac actttttttt cccggctgct gcagaggaac 67681 acacaagact ctttgcattc attcaatcaa cacttgcagg acggcgctct cccagggcta 67741 gtcggtcctg ggcgccagga acttcaagga caattgagct tcagcctctg atcatgccac 67801 ccacaccccg gctcctggca tggacacctc ccagacccac actcaataaa tgagcagcca 67861 ggccacggtg gcttttgatc agatgccaag gaggttatgt ctgcttgtct tcctctttct 67921 ctacagcccc atgaaaacgt ttacagggtt tttccttatg cggggccaaa cccagccatt 67981 ctggcacagg tatcaggagg tcacacttgg ctactgtggc cctctctcct ccctccagtt 68041 caggagatac tagcgcccct ggatgctccc cacaggctga catctcctga attgtcatct 68101 tgaacccaaa tctctctcct cacttggaca cagttgccca ctctcttcct ccctgcacct 68161 gggtgttgag cctggtcctt tccccagcat accccacctg cagctggtag cagcagcata 68221 ccccacctgc agctggtagc agctccatcc tttcatgctg gtgccaaaaa ccttgaaggc 68281 agctttgatt cttctttctc accctaatcc aatccagcag caactcctgt taacttcacc 68341 tttaaaacag agccagaatc cgccatttct caccacccgc actgccagac gtttgtttca 68401 ggcgttcact tcttaccact ctggtccaag ccacctcatc actagcttcc ctgattctgc 68461 ccttgtctca tcagttgatt ttcagcaagc agccacagtg accttaggaa aacggaagtc 68521 aggctatggc cctcctctgc tccgcatcct cccatggctc ccactttact caggaaaaac 68581 cagtcctcat ttgccttaca ggtttgaccc caccccttac ctttctgccc ccaaccccac 68641 tggcccttgc ctcttctcca gccacgctgg cccccttgct gtcactcaaa cacacaagca 68701 ccctccctcc tctgggccag tgccctggct gtgcccgctg cccggatgcc ctttccccaa 68761 acatccactt ggctcactct ctcaccctcc tgtcctctct cacatcgcac ttcttcaagg 68821 cctcttctga tcaaccagtt agtgctgcaa cctgccccgc caacacggca ctccagaagc 68881 cctcaccctg cgtgattttt tttttccccc agcactcacc accttctaac acatcatatg 68941 attgacttac atattctgca ttcattgtct gggagccttg aatgctctgt ttccggagca 69001 ttgcagctca gtaggtggtc aattaatatt tactagatag agttgtcact ctgcacccct 69061 tctcctaagg ccacagaatc ctgaccggcc cctctcagca tagcaaccgt gaaatggcca 69121 attctccact agtctgtcac ttggatgctt gacagcattt caaactcaac acaatcaacc 69181 taaactccta ataggatctc ctgagcttgg tttctcttcc aggcttgcct gttggaacaa 69241 gccagaaacc caagatcatg aacctcttcc tccccatgaa cccctgatcc aatcaccact 69301 gagtcctgtg cattacactc aacaccccca ggatccaact gtttctctcc atcaccagca 69361 gcccagtcca agtccccact gtccgccacc tgatcatgct tccacccctg cctgtcccca 69421 tatggtggca tgtcaccccc gccttcacta tactttcaaa gcttcccact gtcttcagta 69481 tgcaatttct tcacgtggct tagaggggtg tcatgatcag accccttctc cagccccatg 69541 gaccttctat cagccccaag ggtgccatgc atccctggct ttcttatttt cattctcagt 69601 ccaccttcga cccctggtgc agacacataa cctaggcctg gccaatcaca gcatttcatt 69661 cccagactgc agtgttggtt caaagatgga cgtttgaccc aatctaggtc aatgaaactc 69721 aactggagac ttttacagaa ttctgcgggg aggctaagca agtagggtat agggctggag 69781 gtgtggggcc atctctagtc accacctgca aagctggcat gagagcaaag ccaacctagg 69841 aaggtggagc cgagatggag acacaaatcc ctaaaattac ttgagtcctt ggatctagcc 69901 atacctgagg ctcacctctt acataatcca ataaatttac ctttttgctc aatgctatgg 69961 aagctgggtt tctgtctctg tttatcaaga gtcctagcta atagatgggt ccataccctc 70021 ttcccacctt atcttgggct gcttatctcc tcagctaaag tatctccttc ttcaggaagc 70081 cttccctgaa taccccgact agattaagct ctaccctcct atgtctcaca gtcccctctg 70141 cctaccactc accattctgt atggtaaatg ctgtttgccc ttctgtttcc catattccac 70201 tgtgagtgct cacgggctgg gaggcattgc agtctggtgg gttttttttc ttcttcttct 70261 ttttcttttt ttcctacact tatcacagag cctgacactg agaggcattc acattttatt 70321 tttttcaggt cattttgctc atcaataagt ttcctcagat atcagggagt agtgaaaaaa 70381 gaatgcgttt tgcagaaaga tgcgcctgag tttgaattcc ccacctatta gttttgtgac 70441 ctcaggtaaa tcacttaacc tctctgggcc tcatttttct catctgtgaa tcgggagagc 70501 agctaaatac acataagggc tttactacca tattagagct ggttaagtgt ttaattaatg 70561 atagttcctg gctctacctt cttcctccag ggactgctca agcaatcttc ccacctcagc 70621 ccctcgagta gagaggacca caggtacaca ccaccacacc tggctaattt tattttttgt 70681 cgagatgagg tctcactatg ttgtccaggc tggtctcaaa ctcctgggct aaagtgatcc 70741 accctccttg gcctcccaaa gtgctaagat gacaggcgtg agccacggcg ccctgcctag 70801 ttccaccgtc ttatctgaag gctcggggtg acagctgggg ccagtcactg actccgactt 70861 ccgccccaac aatgcggatt tgagtttcca cacaagatgt gactctagag tatctttatt 70921 aacatgtcag aaggtttcct gatgtttctg cttacaaagt cttcagctaa cttccctctt 70981 cccctggccc cagctgtagt ttatggcttt tctgattgca tgactttcca cttctttgtt 71041 gtggaggttt tgcaatggct gatggccaaa aagtgaaaag gcttgaccaa gaacaaagca 71101 agcaataaat aaatacttgc tggatgtaat gttcctcaac aagatggcct caaccttttt 71161 ttttgtggtt catccaggaa gcatgaggcc ttcgtcctgg taatattttg ggcatattat 71221 taggggcagc agttggctct aggtgggggt tttaagagca gaatttgcca gccccaattc 71281 tggggtagtg catactacga atccggtttc ctgagcattc acctgctcct ctgtttcctt 71341 cctggacaga ctgtgatatc ttcttcgttt tcactcactc ctagaagcag gaagaaccgg 71401 aagaattgca aaggctttaa gagctgctca gagtatgaac acttaaaaca tcacagggag 71461 aaacccacga gagagaaggg gcatgatctc atggttcact gttttggggg acagatttac 71521 aattggggaa cagccttttc ctaatggtgc agggctttgg gggacctggg ttttatggaa 71581 agtatttcca tctgcaaact tgcctggcca ttaccaggaa gcaaaatgcc atggggagag 71641 tcccttaact tctctgagcc tcagtttcct tatctgtaaa atgggtgaag aacaacaaaa 71701 ttttcgttgc tatgaattac tgagcacttc tatgcattag atgccatata actattatct 71761 gtcacccaca aaataatcat tattatgatc cctgttgcac agatgaggaa actgaggacc 71821 agagaagtgt ctggagcaag gtctcaaagc tcatgcataa gccagctgga gttctaactc 71881 aagtgtgtct gacttgagag ttgctatcat cccacaacaa caggccttct cttgatgcct 71941 gcctcacggg tcactgtgag gttccaatgg catcacacac acaaaagata aagatactgc 72001 ctgaggcacc tgtttgccca ggaacgagca agccataggt atttgggagg cgactggctg 72061 gagacaaata caagtccctt ggatcctcat tatctgcttt tctcctgggc tcccctcatg 72121 cccagtccta agtagagagg gtctcagaga gcagaaacga ggaaaaggcg acagtgagca 72181 actgcatcct tgcaatgaag gagcactcac cagcatcctc accacccaga atgtaggctc 72241 ccaaccatgg gctgcactgt gcctatcacc tcccacaact ccagccccgc cagggacttg 72301 ccaccaacca ggagaaccca actagaaggg gtccaggagg agccatcctt tcctacccac 72361 tcttggaggg gctgcatgcc actggctttc tggaccataa aatgagatgt gccatgaact 72421 tccccaccct gtccatcccg ggaccatcta agggactcca gcaggaggac agtcaagatt 72481 tgagccctcc ctatgaggac ctgtccccac tcagtctcat ttcccatcac acaccctaga 72541 tgcaggggta agaatctagg tttgactccc gcccctccac ttccccagac aagtggttca 72601 gttcccccgt gccctatgtc tttcatctgg aaaatgggga taacacagca cttccctgga 72661 ggttgagatg aatttgcacc taggaagtcc ttagcagagt acctgacaca cactagggca 72721 tccataaaca ttggctgtga atattacgac tttgacaccg aatgccttct tgaccctgta 72781 ctgaaactat tccctcttcc cacaatgcct cccttcccct tttgcctcct ctcccaaata 72841 attacggcaa taacagtaat catcacactt accaattgct aagggcctcc tacttgcaag 72901 accacgcaca gaacacttct caaagaccat ctccttcctg ctgcatcctc cggagacagc 72961 gacactatca tttctatacc tccccttgtg tccatctaaa atatcagctc tcccagggca 73021 ggaacagggt ctctcctgct cactgctgga ttcccggcat ccggcataga gcaggcacac 73081 aggaggtgct catatttgct gcagaaatga gaccccactg aatcctcaca aaaacccgct 73141 gctgtaaatg atgatggcat ccttttaaaa tgaagatgca ggccgggcgc agtggctcac 73201 gcctgtcatc ccagcacttt gggaggccaa ggtgggcgga tcatgaggtc aggagttcca 73261 gaccagcctg accaccatgg tgaaaccccg tatctactaa aaatacaaaa aaattagccg 73321 ggcctggtga tgtgtgcttg taatcccagc tactgaggag gctgaggcag gaaaattgct 73381 tgaacctggg aggcggaggt tgcagtgagc cgagatcgtg ccaccgcact ccagcctggg 73441 tgacagagcg agactccatc tcaaaataaa taaataagta aataaataaa aaataaataa 73501 ataaaataaa ataaaataaa atgaagatgc aaaggacaca caaaaaaatt taaggaaatt 73561 atccgaggct gcatcacact aaatggcaaa acacagacat gaacctggga ctgtggagtg 73621 ctaagttcac actcttacta attaaggaaa tcccacgtcc caatggggct ccaccccata 73681 tgccttggaa ggcttctctg acagctgcca ctgacaacga tggagctggt cttcttgaaa 73741 caccatcaaa gctggacttt gaaagtaaca ttcctgggag actctctgta ccagcaaagc 73801 cccaagtcag atctacagga gcctgagggt cagcacagga gattgacacc caagtaacag 73861 gataattcac agaatgaaat aagagcttgg ttaggggaag caagtcatgt catcacagac 73921 atcagaggaa ggaatctgag ctgagtcgaa aagaatgcca gcattttaac acgtgcagaa 73981 gttagaggaa gcgtcccaag tagaaggaac agctcgagca gcgggtagaa gtgtgaaagc 74041 acactggaaa tttggagaat ggcaaggaat tcactgtgaa cagaattgtc tttgtggagg 74101 aggggaaggt aaataatcat ggcaataaga ataacaacaa taatagtaag acaaagccgg 74161 gcacagtagc tcatgcctat aatcccaagc attttgggag gctgaggtgg gagaatcact 74221 tgagacaagg agttcaagac cagcctgggc aacatagcaa gaccctgtct ctaaaaaaaa 74281 aggccgggcg cggtggctca cacctgtaat cccagtactt tgggaggccg aggcgggtgg 74341 atcacaaggt caggagattg agaccatcct ggctaacaca gtgaaacccc gtctctacta 74401 aaaatacaaa aaattagccg ggcgtggtgg cagatgcctg tagtcccagc tactcgggag 74461 gctgaagcag gagaatggtg tgaacccggg aggcggagct tgcagtgagc tgagattgcg 74521 ccactgcact ccagcctggg cgacagaacg agactctgtt gcaaaaaaaa aaaaaaaaaa 74581 gaaaattaaa ttagccaagc gtggtggtgc atacttgtag tccctgctac ttgggagggt 74641 gagatcactt gagcccagga attcaaggct gcagcaagct atgatcatgc caccgcaatc 74701 cagcctgggt gacagggcaa aaccctgtct ttaaaaaaaa aaaaaaaaaa agtgaacatg 74761 catatagtgc ttactgtgtg cctggggggt tttctacaca ctttacaaat atttgctctt 74821 tttttttttg agacaaggtc tcgctctgtc gcccaggctg aagtgtagtg gcactatctt 74881 ggctcactgt aacttccacc tcccaggttc aagcgattct cctgcctcat cctcccgagt 74941 agtagctggg actacaagtg tgtgccacca tgcccagcta attttttgta tttttttagt 75001 agagatgggg tttcaccgtg ttagccagga tggtctcgaa ctcctgacct tgtaatctgc 75061 ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgcacc cggccatttt 75121 tttttttttt tttttttttg agacagagtc tcactctgtc tgccacccag gctggaatgc 75181 agtggcacaa gctcggctca ctgcaacctc catctcccag gttcaagtga ttctcctgcc 75241 tcagcctcct gagtagctgg gattacaggc ttgagccacc atgcctggct aatttttgta 75301 tttttagtag aaatggggtt tcaccatgtt ggccaggctg gtctcaaact cctgacctca 75361 ggaggaggtc tagacctcct aaagtgctag gattacaggc attagccacc atgcctggcc 75421 gagtttcatt tcttagatct cacccacagg agagaaaaca ttatcatgta gctggaggaa 75481 ggaatatcag taatgttcat gagtaaacat caaaatgtat tattatcttt aagaattacc 75541 aaaaggaatt tcaattaaaa tcaattatat aggctggtct tgaactcctg acctcaggtg 75601 atccgcgcat ctccgcctcc caatcgctca tttaatcctc acaactaatc tatgaggaca 75661 gtgctctgtc attccatttt acagctgagg aagccaaggc acagaaaggt gaagtcgttg 75721 cctgagatct cacagcaggt tttagcgccc gtgtgcttcg tctgctgcac ctctactacc 75781 ccagctctca acgtgggttc tctgaccagc aggaacagcg cttaggaacc tacacaaaat 75841 gcagattatc agggctcaag gagaccacgc tgaatgggaa ctctggaagt ggggtacagc 75901 aacctgtgtt gaagaagccc tccatccacg gtgaggtcag caacctccga ccctgcaaac 75961 ctgaccacgc tctgcgatcc tcaggactgt gccatgggga ctatgcctgg ggaccccaag 76021 gtcctcagag aaggaagatt ttgcccagag tcccgcagta ggtagaggag agcaaaccag 76081 gcctcgaacc agaatctccc atctctcaga agtcggtgtc tacacaacaa ccgctgaggg 76141 atgtcaaagc agccaccagc tcaacacagg ggtgcactgt tcccagtttc tgaggcatgg 76201 aagacccaga ccccattatt tccaagctat ttgcatgatc cagaacaagt tacctcaact 76261 ctctgggcct cagtttcccc atctgtgaaa agaaaaagct ggaccctatt tctttctttc 76321 tttttttttt ttttttgaga cggggtctca ctctgtcacc caggctagag tgcagtggcg 76381 tgatcttggc tcactgcagc cttgacctcc caggctcaag caatcctccc acctcagcct 76441 ccaaagtagc tgggaataaa gcatgcacca ccatgcccgg ctgatttttg tacttttttg 76501 tagagacagg gtttcgccat gttgctcagg ctggtcttga acccctgagc tcacgcaagt 76561 gatccgcccg cattggcctc ccagagtgct gggattatgg gtgtgagcca ctgtgcccag 76621 cctggaggca agactcagtt tctaaccttg tttttcttcc caaaactctt agccatcaaa 76681 tatgcactgc tggtctactg tgcacttgag caaggaatga ccacgtagca gaatgaaggg 76741 caaaggcacc tactgcatgc agcttccagc agtgctcaga ggaagggtag ccgacactcc 76801 cccagtaccc caaaccaaat caaaccaaaa acagcatctg ccgtcctgct gctagcccag 76861 gcttcctctt cttccccaca gccctggggc ccggtgggga agggagggcc ctgggggcgt 76921 ggaaagggtg tgggttgaga ggaaaccatg gccttgtagg gcggccttgc caggtgcctg 76981 actccacagg gcctctgtct tccacctggg aagtagctac taccactttc tcacacctcc 77041 aaaggtttct ggctgagccc ccgggctctg caagggtccg gatggttctg tggcaggatg 77101 tcagccacca gccaaagcaa gtgggatgaa aagtttttaa tgagagaaag gaagggccgt 77161 gctggcacct gtgtgggctt ggggtcggca gcgggaagga agggggaagt ggtgcagatt 77221 ggggcggaag aagacagaga gagaaaaaaa cagacagagg cagagacaga gggagataga 77281 gacagaggat gtagatgggg agagagagtc agaggagagg caaagatggg agtaagacag 77341 aaagagtgac agagacaggg aaagacgtga gttgggggga tgggtgggag ggagagagag 77401 gcagggagac aaaaatagaa accaagacaa gttgggtaca gaggcttacg tctgtaatcc 77461 cagtactttg ggaggctgag gcaggtggat tgcttgagcc caggagttca agaccagcct 77521 gaacaacata gtgagacccc atttctacaa aaaataaaaa atgaaccagg tgtggtggtc 77581 tgtacctgta gtcccagcta ctcgagaggc tgaggcagga gaattgcttg agcccaggag 77641 ttcaaggctg caatgctgcc cctgcactcc agcttgggca acagagcaag accatctcaa 77701 aaaaagaaag aaagaaaaga agaataagaa aatcaccaga cagagaagga gagagatgag 77761 cactagcagc agcctggaac aggaacctca gctcctacac catcaacggg cacctacatc 77821 cctgcaccca catcttagct acacagccat cccgcccttc agtcaaggcc cctgggtatc 77881 cagggaagaa ggggaatgtt gagagagagg tgactactgg cccagccccc tccagccctc 77941 tcacctgttg tggggctctc caaattccaa gcttctgaac actaccttcc caatgttaat 78001 tacatccaga acccccttta atcgtattca tttttgttca atggatccaa tatttttaaa 78061 acatattttc aagggaaact ataatccact ggattaaatt taaaacatgc cctaattaaa 78121 aaattaatgc aaaaataaaa tcgaaggaag caatgttttt ttatgccttc tggctagaga 78181 cggccgccag cctaaggaag gatccccaaa ggagccggtt gtgtccgatc tttgttaaac 78241 aagaggagtc tggggagaag cactaaagac gcgttagcac ctaatggagg ctttctccgg 78301 aaagtcatca gagggtgggg aggaagactt tcgctctgtg atttagcact tcatactagg 78361 caacagcagc ctccagggca aaattcaagt cctttggcat cgcactgagg ctctgttgtg 78421 acttaactcc tgctgacctc caattttatt ataacaccac cgattccccc aaccctcaac 78481 atacatacac acacacacac acacacacac acacacacac gcacacacac acacactata 78541 gtctttgtgc cctgaactct accaaacata ccaggccaca gagtctcaca cctccaaact 78601 tttgcacttg tgattctctc tccttggaaa acctttcctg acctcctcta caaagctaac 78661 tcctactggc atcatttgca ccccttcctc caggaagtct tccctgactg cctcctgtcc 78721 ctgtaccagt atcccccaag gtgcttagca tcctgtgtgg cttctggtgt tgcttccctg 78781 ccaaccccat ggaatgtgca gggctggacc ccatcctagt ccccacatcc tctcagaacc 78841 cagcatggtc cccaaacttc cagcacaggg tgcacaggta tttgttgagt aaacagaagg 78901 atggattgag aaagaaatga aggcaccaga acactctggg acagaaggaa atctaaacta 78961 aaccctctct gtgctctggg aaaaagttct aggcagaagc tcctgtccaa taagaatcag 79021 tcaacttgac tcaaattggc ctcacctact cattggtcca caaactctgg gccaatggcc 79081 cagcctctat tcagttttct cgcctgtaaa gtggagacag tagtattgcc tatgatagaa 79141 ttgagctaag aaaaataaac gtaaagagct aagtacccag aaagcaccca agtcttaact 79201 gcaatcatta ctatttttat cagcatctaa cactcaccaa atcaaagttt tataacttgt 79261 agggactcaa aagatgatta gaaaagcaac tggcagactt tctcttcaaa ggccagcaat 79321 gtgggctctc tggtctctat cacaactact ccactctgct tttgttacac aacagcagcc 79381 acagacaaaa tgtcaataaa taagcatagc tgtgttccga taaaacttta tttacaaata 79441 caggtagcgg gccggatgtg gcctatgagt ggtaatctgc caatttctga tccagaccct 79501 cactattcca aatgtggtct ttagacaaac accatcagag gcccctggaa actggttaga 79561 aacacagtct tagtctctgc cccaaacctg ctaatcagaa tgtgcacttt agccgggcgc 79621 tgtggctcac gcctgtaatc ccagcatttt gggaggtgga ggcaggcaga tcacttgagg 79681 tcaggagttt gagaccaggc tggccaacac agtgaaatcc cgtctccact aaaaatacaa 79741 aaattagccg ggcatggtgg caggtgtctg taatcccagc tacttgggag gctggggcag 79801 gagaatcgct tgaacccggg aggcagaggt tgcagtgagc cgaggttgta ccattgcact 79861 ccagcctggg caacaagatc gaaactccat ctaaacagaa acaaaaacaa aaacaaccaa 79921 aaacacagaa tctgcacttt aacaagttcc ccaaggggat catatgcatg ttaagatctg 79981 agaggcctgg gtactcaact tcataggctg acatctcacg tcagagatgt gggttacttt 80041 ttcttgcctg atgcctccac cagaacgtaa actccaggac aactaggacc tcgttgcttc 80101 ttaaatcttg caccccaaca tccagttgag ggttctcatg tgtccttatt ctagtggaag 80161 acgtcactca gtggctatcc gtgcagtgct gggatcaagg catgacttcc actgacagcc 80221 tctgaagttg ggcagatggg ctttgagtcc tggtgctgaa tctcaggcct gtgtgacctc 80281 gggaaagtga ctatttccaa gccttggctt ctgggaaatg gcgtggtcaa taacaaatat 80341 ctcactgtgg ggaacaacag ttaggtaggg gcctgtacca ggtaaggtgg cacggcaccc 80401 tagcgcccag agacaggaac tccccacaca gggagtatgc aatggggaca acctccagtc 80461 ccagcactct gggaggctga ggtgggagga tcgcttgagc ctgggaggca gaggttgcag 80521 tgagccgaga tcacaccact gcactctagc ttgggcaaca gagcgagact ccatctcaaa 80581 aaggaaaaaa aaaaaaaaaa agaacaaaac aaataaaagg ccaggcacag tggctcacac 80641 ctgtaatccc agtactttgg gaggctgaca cgggcagatc acttgaggtg aggagttcaa 80701 gaccagcctg gccaacaggg tgaaaccccg tctctaccaa aaaaatacaa aaattagctg 80761 ggcatggtag taggcacctg taatcccagc tactcaggag gctgaggcag gagaatcact 80821 tgaaccgagg aggcaggggt tgcagtgagc caagatcata ccactgcact ccagcctggg 80881 cgacagaagg aaactccatc tcaaaaaaaa aaaaaaaaaa aatgccttga gggcttaaaa 80941 tggcagacag ggtatcagga ggcctcagac aacctccaat atcctagaaa cacagagctg 81001 gaagggctct tggagcgtgt caaaaaaaaa aaaaaattgt gccatgcgcg atggctcaca 81061 cctgtaatcc cagcactttg ggaggccaag gcaggcggag ttcgagacca gcctggccaa 81121 tatggtgaaa ccctgtctct actaaaaata caaaaattag ctgggcgtgg tggcatgcga 81181 ctgtaatccc agctacttgg gaggctgagg cagaagaata gcttgaaccc aggagacgga 81241 ggttgcaagg aaccaagatc atgccattgc actccagcct gggcgacaag agcgagactt 81301 cgtctcaaaa aaaaaaaatt gtttttaaag caatggaact gcttattcag gtggcacgag 81361 gcaggcctaa ggcatcctca gatcccctga aaagccgcac ttacgtgaca atggggagta 81421 gggggaacca cagtgactgg gagagtccag gccacctaag gggcagcttc aggccactcc 81481 agtttggcct aagctcctgg gtttccaaga gaaatgagaa atcagaattt ttctctagaa 81541 atgtttgaat gttgactcaa ttttaaaaac acacacgtag catgggagct caacaaaaaa 81601 catgcagcag ggtgaactca cccatttaac atgtacggaa actggggctc agagagggga 81661 agctccttgc ccaatgtcac acagcaattt ggaggaagag ttgggacttg gctgcttcca 81721 agctctgctg tgcatgagag tgtgggagtg ggaggaacag agcgagtagg gaggagtggg 81781 tgggaatctg gactttacag gccaagagcc ctgcatgttc tgatcatcag ggaaggagtg 81841 tggacagagg gatggagagc ggatcaatat ttattcagaa tagttataac aacacctgat 81901 gttttctgtg tgttttctat gtgctaggcc ttaagatggg atatcaagga ggagtaagaa 81961 tgagaattca gagtgtagag tctgggctgg gtgtggtggc tcacgcctgt aaccccagca 82021 cttgagaggc caaggtggga tgatcgcttg agcccagaag ttcaaggtta tagtgagcta 82081 tgattgtgcc acagcactcc agcctaggca acagagcaag accctgtctc aaaacaaaca 82141 aacaaaaaaa cccaaaacaa aaaaaaaaga aagattctgg atccaaatcc tatgtctccc 82201 acttcctagc tgggtgacct tgtgcaaatt acttcacctc tctgagcctc agcctgttca 82261 catcatatgg ctgttggggg aattaaatga gttaaaccac atacagtgcc tagaacagtg 82321 tctggcaccc agtaagccct gttgatattg gctgtattta taattgttgc cttttgacag 82381 acggaggaac taaggttcac aaacgaacat gttcaaaatt ccaaagagcc tgtttaaccc 82441 cagggcctca gcttttgcct ggtaccaaga taagtgcttt accaccattc ccttccatcc 82501 agcacccctg gaagataggt cctacttata cctgcattca acagatgaga aaaccaaagc 82561 atagagaaac agccaacagc cccagaaaga tgtggagcac ctggattcta atacaggcca 82621 gtgcccccta gctccctatc acactgcctg ctttctgggc tgcctgcctg ctttctggct 82681 gcagaagccc cagtcaaaag tcttactgat acaaagtcac tcggcagacg ggctggctgc 82741 ccgccatctg agcacatagg ataagaagtt tctccctggg ccatcagccc cagacgttcc 82801 ctagggcctg tgggcactgc ctctgaggtc cttccctgca atttctgggg gctggccaag 82861 atctcaagca gctctgaatg aatcattccc acaaatcctc ctctggctgc agtaaagggt 82921 gggggagacc tgagaagtga cgggagggga cagtcatccc agatctctgg aggggcccca 82981 gaccgggtac atgcaggtcc tcagagggca cggggacttg caagcaatcg gaataggcag 83041 caagtcgggg agcaggagct gctgctaccc agagaagctt ccctggtgcc acgtggggca 83101 tcaaagccac acaagccaca tgcatttttc caagaaagta cattctggtc accaacccca 83161 ggctgtgaag ctacatctga tctgccactc ttactacaaa tacacccaca aagtcacatc 83221 aataatacat ttcaggccag gcgtggtggc tcacacctgt aatcctagca ctttgagagg 83281 ccgaggcggg cggatcacaa ggtcaggaga tcgagaccat cctggctaac acggggaaac 83341 cctgtctcca ctaaaaatac aaaaaaatta gtcaggcata tgcctgtagt tccagctact 83401 cgggaggctg aggcaggaga atggcgtgaa cccgggaggt agagcttgga gtgagccgag 83461 atcacgccac tgcactccag cctggacaac acagcgaaac tccgtctcaa aaataataat 83521 aataataatc ataataatac atttcaacag cctaaccgat cccaaggcac caccacatct 83581 tagttctgaa tggtctatgc ccattcctca gtagggtaca ctgacaccca gagagcggca 83641 gttccttgcc tagggagcct tcccaccccg tctccatgtc cctaggctgc tttctttctt 83701 tcttttttta atatttaatt gatttttatg tactgaccta ataccctaca atctcgccct 83761 gttgcccagg ctggagtgca gtggtgcaat ctcagctctc tgcaacctct actcaagcga 83821 tccttccacc tcagcctccc ggctatctgg gaatacaggc gcgcaccacc acgcctggct 83881 aatttttgta ttttttgtag agatggagtt tcactatgtt attcaggctg gtctcaaact 83941 cctaggttca accaatccat ccacctcagc ctccaaaagt gctgggatta caagcatgag 84001 ccatgcaccc ggcctaggct gctttcttaa tgcacaggaa ctcaggattt taaaaggaca 84061 aggaaaggca agtatgcaaa aaagaataaa gtcacgaggt gccagataca gcggctggga 84121 ggcgtggtct taacaccatt cccaaacagc agctgcccag atgaaccaag ttgtcaggaa 84181 actggaacac gcagataagt gctggtctaa gttcctgaac cagcagagta caggttcaag 84241 ctggggtgaa ctacctacct tctcaaggaa ctcttcctct gtgaaaacgc tgggttctga 84301 cctgagttct tccggtgcag cctcttgctg aagcaaagag tgtgaaggaa gaaacacagg 84361 gtcctagggg tggggaaggt gctgagagga gggggaagat gtctgtgtgt gcagaatgtt 84421 cacagatgaa gcgtctgtaa ctcaggactg ctcatctcag gcctgggggg tggcccctct 84481 tgaaataagc agcaactcac ataggacact ccggtggcca cccagagaag tcaccttcag 84541 agtagcgtgg ggggttccag atagggtgtg ggcaggacca gacttaattt tggggtcaag 84601 ggacaagtct tccccttcag ttatggtcta ggagagccat ggagggtctc cctggcctcc 84661 acaaagcctc agctggcaag ggggctccag gctggcccgg gatgggaaga tccgagcagc 84721 ttccaccttc ctttcacccc ctcctaatac tgtgtgccag ctcctggggt gggctctgta 84781 agccactggg tcctacactg cctctccttg cttcggtatt ttcacagagt cttgctctgt 84841 cacttagacc ggagtgcagt ggtgtgattg atcttggctt actgcaacct ctgcctcctg 84901 ggttcaagcg attcccctgc ctcagcctcc caagtagctg gggttacagg cacgcaccac 84961 cacacctggc taatttttgt atttttagta gagacggggt ttcaccatat tggataggct 85021 ggtctcgaac tcctgacctc aggtgacctg cccaccttag ctcccaaagt gctgggatta 85081 caggtgtgag cctccgtgcc cggcctgaaa atagtttaca actaacacat caaagccagt 85141 gatttcaagg gttatgactg cttcagagaa cactaaatgt atttaagatt attggaaccc 85201 tcatacactg gtggtaggaa tataaaatgg tacagctgtt ctggaaaaat agtctggcag 85261 gtcctcaaca ggttaaacct gggtgcagtg gctcacacct ataatcccag cactttggga 85321 gcctgaggta ggaggatcgc ttgagaccag gagttcaaga ccagcctggg caacatagtg 85381 ggacccctat ctctacaaaa taaataaata ataaaataaa ttaaaaaaat atttaaaggt 85441 gaaacctaaa gttaccttac gacccaatca ttccactcct agaaatatac ctgagagaac 85501 tgaaaatgta tgtccaaaga cctgtacatg aatgttcata gcagtattat tcataatagt 85561 caaaagttag aaacaaccca aacatccatc agctgacaaa tggatagaat gtggtatatc 85621 cacataatag aatattatgc agccataaaa aggaacgaag tacttaaaaa aaaaaaaagg 85681 aatgaagtac tgatacaggc tgcaaggggc tgaaccttag aaacattctc tgaagtgaaa 85741 agcatcagat accaaaggcc acatagtata tgactgcatt tatatgaaat gtccagcaca 85801 ggcaaattca tagagaattt gagagtcagt aagtggttgc caagggatag gggaagaaag 85861 gattggggag tgaccatgaa acagcctttc ttttgggcat gataaaaatg ttctaaaatt 85921 agacagtggt gatgactatg caactctgaa catcctaaaa attacggaaa catatacttt 85981 aaaagggtga attgtacggc atgtgaactg tatctcaaca aagctgttat aaaaaaagtt 86041 tgcatgagat ttttaaaaat caggtgatac aaagatgtat attaagtaaa caacagtaca 86101 attggggttc agtggggtgg ggggattcag aaatgaccca acagttcctt taggacaatg 86161 aaataaaccc agctttacag ttatggaaac tgagactaga tcggaaggaa aatgcgtcgc 86221 ccgggccaca tcactactaa atccggtgtg tcacagatct gttcaaatcc agaagcctac 86281 cctcgccagc agaccaccct gctgagatga aagggttatc acatggaggc tagggagagg 86341 tgtccaccca cagcccagtc tggcagacag acatgcaagg gaatgtctgt tgggaagtga 86401 gaaaaagggc tggctaactc tgtggggttc aaaacaggaa gcacccaaca ataccaggag 86461 aatgccaggg aaggcttccc ggaggaggtg ctgcttgagg gggcttctgt aggataagta 86521 ggagttcaac agcccgcctc ctcaacatcc ccaaggacat taagtcagga ggcctgtgag 86581 tgagtttcag ctctgccaca tcctagctgt gaatcctagg gtcacgggca ggtgacctct 86641 ccaagcctca gtctccccat ctgtaaaagg ggaagggggc aagaggttag gcacggttct 86701 ttctaatgac tctgctgagg tggatggttc cttccctagg gaacgggcct ttgagtggga 86761 atgccagacc tcgacctctg tcccatctaa caacgactct gccaggtcag gcgtgctcag 86821 cctctgagcc catgctccca gtcccctgct accagtccca gtcccggctt tccgggcacc 86881 tcccatccag ctcacaccga gaagcggagt ttacggaggg aggttggggg ttggggtggg 86941 atcctcgggt caaattcctc tccactcctc accagtgacc tccaagttca ctagggctcc 87001 ccaacccagt gctcccagcg cccacctccc cagccgacga tttccgcccc caacgcctcc 87061 acttctgacg ctcctcgcaa ctccacggct gctgcaagtt aacttccacg cgccctggcg 87121 cccgcctggg ggtccccgcg cgcctgggcg aggttgggga ctccggaggc ggggacgcgc 87181 cgccctcccc cccagtcccc ccgccaacag acgctgagcg tctcccaggc gctgggtccg 87241 aagcgggaag cgtggtaacg ggggcccggt ggatgccgca ccccgcggaa agcgctcggg 87301 acgccgagcg ggtgcggatc ccagaaccgc gcgcccgcgc cgcctgcctg cctcccagat 87361 cccagctccc agccccggag cccgagcagt cgcgccgggc ccggaggcag ccgccaggac 87421 ggagggctgc caggcgcccg aggccgaccc gcacggcccc cagcccgcgc ccccggttgc 87481 aacccgaggc caccccctgc cgccccaggg ccacctaccg tggcgagacg cggcgaggca 87541 gccccacgca cccagcgact cccggctctc cagccgcccc agccggggaa gggggaggtg 87601 cccgcgcggg ggcggggccg cggcgacgtg acgtacgggg gcgggcccga tacgcggccc 87661 cgccccgccc cgccccatcc cctccactgc gcctcgggac cgcgaggtga gcagaggctg 87721 gtcgggaagt ttcccggagg ctcgccccgg gaaccccgcc ccgccgccct ggaagcccct 87781 gtctgctcgc tggcttaccc ctaggacggg cagctcacca cttctcccgg tcagttctgt 87841 ttaagagaag ttcctttctg cctgtaagca atgaagtgga atgatcacca aaaagtggtg 87901 taagaaaaga aaagtagacc cgcagcgggg tgtattcgtg ctactgtttg tgtaaaacac 87961 agaaaaaaga aaggggtggt ctgatgtata gtcatacaca ccagtggttt tggagggagt 88021 ttttgtggat ataccacatt ctctccattt atcagggatt ttgccctgac atccctccac 88081 caccacgaca tttggcaatc tctgaagaca tttttggttg tcataactgg ggaagtggag 88141 aggcgatact ggcatctagt gagtagaggc ctggaatgct gctaaacatt ctgcggtgca 88201 taggacaccc tttccacccc cagccctgcc actacccccc caaaaaaaac cctgcaaaaa 88261 atgccgaata gcgtaaaggt tgtgaagccc tgatatacac aaatgaaaca aaactcctga 88321 ctgataaatc ttagcacagt ctaagttcta taacctttca gtgcttctgc ttcttcatgt 88381 gtaaaattgg aaaaataata gagctatcta tgcatcgtat gcttgtgcat tgaatacctg 88441 aagaaaaggt agggtaggca tgcaggctta cttttcattg tgaactttgg ttctttgttc 88501 tcttttaact ttctatttgc catatgtata tattatctgt ttaaaataaa tttctgggtc 88561 aggtgcatgg ctcacacctg tgattctagc actttgggag gcgaggcggg cggatcactt 88621 gaggctagga gttcaagacc agcctggcca aaatggtgaa acctcgtttc taataaaaat 88681 acgaaaatga ctttgggagg ccaaggtggg tggatcacat gaggtcagga gttcgagacc 88741 agccaggcca acatggcgaa accctgtctc tactaaaaat acaaaaatta gccgggtgtg 88801 gtggtacatg cctgtagtcc cagctactca ggaggctgag acaggagaat cgcttgaacc 88861 tgggaggcaa gggttgcagt gagctgagat tgcactgcac tccagcctgg gcaacaagtg 88921 agactctgtc tcaaaaaaaa aaaaaaaaat gcaatcccag cactttggga ggccgatgcg 88981 ggtggatcac caggtcagga atccaagacc agcctgacca atatggtgaa accctgtttc 89041 tactaaaaat acaaaaatta gctgggcatg gtggtgcatg cctgtagtcc cagctactca 89101 ggaggctgag gcagaagaat cgcttgaacc caggaggtgg aggttgcagt gagccaagat 89161 cgcaccactg cactcccacc tgggcgacag agtgagactc cgtctcaaaa aaaaaaaaaa 89221 aaaatgagca gggcatggtg ctgctcacct gtaattcccc ctacttggga ggctgaggca 89281 cgagaattgc ttgaacccag gaggcagagg ttacagtgag tgtagatcgc accactgcac 89341 tccatcctgg gtgagtacac acacacacac acacacacac acacacacac acacagaaac 89401 tagggcacac agatattaag taatttgccc atggccagcc acatgttcat tcattggtca 89461 ttcattaatt cattcatgtg tccaataaat agttagtgct tgccttatgg aaaatgctgg 89521 gactgataaa gctgcctcca tggagccagc cattactggt ctggaaacaa tacaacaaca 89581 ttacatagtg tttattgcgt gcatgttaga gacttcatat gactgatttg atagacacat 89641 agattaaaca ccagcagggt gttttataat gtccttttgt ggatgaaaaa aatggagttc 89701 agaggaagag gtgtgaatgg gggcatttgt ttaataacat tctagcaatg gcatttcctg 89761 agctctctat gctaggcaca gattcagcag ctaacagggg gacaaaggtc caagttgggg 89821 aagctagatc cttcaacaag ccttcacagt aaagtgagca cctgtaactg atgaggcagg 89881 agcagcagag attaacctgg tccagggagg gagggccagg cgtgtgtata catgccagag 89941 gcatgggtgg ggatccttgg aagagtgcac tggtcacaga atctagcagg tgcaaaggcc 90001 tagcaggaag gaagcgagcc tggggccttg catgaagtgc agagtggaca ggatgtggtg 90061 gaggtgagag aagagatcag aggtgtctgc agggctggat gctgcagagc tttgtgggct 90121 gtgggaagga gtcggggctt ttctcttgca gtgatggaga gacagggaag gtgttcaaca 90181 ggggaagggc atggggagcc tgactttctg caaagatccc tctggtgctg tgtgaagaag 90241 aaacaggcag aaactggttt ggagggtgtt gaaataattc aggaacccag agaggggatg 90301 tgggcatgga gggggtgttt tggttccgga gaggtttgga agacagaatt ggcagatctc 90361 agttgagagg ctaaggagga ggatcagaat gatcgtcttg ttgtctaggt tgggacttct 90421 ctcagtaccc ctagctcagt gaacctttcc aggtctcaat ctgtctgtca cctccttgag 90481 gaaggctaat tgcccctcag gacatcgtgg tggaccagac accagactgt gcagcccagg 90541 aatttgatgt gactttgaag gcagacaaaa taattcattc ctcctgtctg catcttcagc 90601 cgggaacacc tagagcagat acctcttctg tcttcctccc caacctatct tcccagcatc 90661 taggcaaaac ccagtactta agaggtttta tcaaaagcag gaagaatatt tatcaaaagc 90721 aggaagaata gatgtagcat aatacacatt gatccaaaaa ttctttccag ctctgacatt 90781 caagaggatg aggttgatgt tatgtcaata attacacttt aggccaggct caatggctca 90841 ctcctgtaat tccagcactg tgggaggctg aagagggagg atagcttgaa cccaggagat 90901 tgagaccagc ctgggcaaca tggtgagacc ttgtctctac tttttttttt taattaaaat 90961 ttttttaaaa accattttat ttatttttat tttatttatt tatttctttg agacagagtc 91021 tcactctgtt gcccaggctg gagtacagtg gtgtgatctc ggctcactgc aacctccgcc 91081 tcctgggtcc aagcagtcct cctgcttcag cctattgagt atctgggatt acaagcatgc 91141 gccaatatgc ctggctaatt tttgtatttg gagtagagat gggatttctt catgttggcc 91201 aggctcgtct ccaactcctg acctcaagtg atccacctgt cttggactcc caaagtgctg 91261 ggattacaag tgtgagccac catgcccaac cacagaaaac actttattca ggtgcttgct 91321 atgtgctaaa caccttataa ttattatatt caagctgatc atttaccctc tccaagcttt 91381 tggtgtttta ttatcattac tcccatttta cagatgtgga aacagaggct ccagcaggta 91441 atttgatttg gccaaggata atagctggta attgtcagag ctggaatgca aactcaggtc 91501 tctctcaccc tttctagtac actgatagaa ggcagtggct cggcagtggg caggaagtaa 91561 aagtaagtgg gcaggttttc ttctcctgct ggaagggtgt agtattattg ttcctagctg 91621 gggatccagg cttcacaaag cattcgttga tcctaagtat ctgacccagg ctattggctg 91681 tcgtggcagt ttggctctaa gaaggctcat ctctttagaa cctctgcagg gcaccctgga 91741 accatgtctt ttccccagag cccagttgct gtgtatttat cagcatctca tgctggcaca 91801 aaagcattca atgcctctgg gtgatcctcc tgcctccacc ccacatatac acactgccgg 91861 cgttgggctt atgctccaat gtctaaaaga ttacgggttg cctcttgtcg atgagaatga 91921 gagccgctct ctcataatca agatgctttg cctttccaaa ggcctgagac tcaaagattc 91981 acaagctggg atctatctag ctttgaaaca atatgaagtc cccagtttcc tcaggcattg 92041 tattttaaag ccagtgctta gggacactcc tgcttgaacc cgggaggtgg aggttgtggt 92101 gggctgagat tgcgccactg cactccagcc tgggcgatag agtgagactc tagctaaaac 92161 aaaaaacaaa aaacaaaaac caaaacaaac aaacaaaaag ccagcattta gagccaaacc 92221 catgcagtca aaattgcctt ggagaagccc ccgaaaaggc tgaaggcaac atagcccctt 92281 ggaagtccag cttcactggt ttccgtcaat attaaacttg gactcatcat gatttctggg 92341 gttgagcacc tgatgaaatg aaaccttagt tttagtttga aggcaaaagt gtggcttatc 92401 tttaagctct gaaattctgc tccaggtggt gagatgctca tgttttgaga gatatgttga 92461 aactaggacc catggaagga acaccaggtt tatggttcct aagtcacact cactctcagc 92521 cttactacat gttattttct ttaacttgct tcttttagtc tctattcttt attttttgct 92581 tataatttta tatgtcctcc tttgttatta caaggcattt tgtttattca taaattcatt 92641 agcaaacaat acattaaata taaaatgact ttaattgtta ttttcttctg gtgtttattc 92701 tgccaagtgt acatatcttg tttcagtgat gctaatatgc acattttaaa atgcctgtga 92761 ggccaggcgc agtggcttac gcctgtaatc ccagcacttt gggaggccga ggcaggagga 92821 tcaagtaagg ttgggagttc gagaccagcc tgaccaacat ggaaaaaccc ccgtctctac 92881 taaaaataca aaaaattagc caggcatggt ggcgcatgcc tataatccca gctactcggg 92941 aggctgaggc aggaaaatcg cttgaacctg ggaggcggag gttgcagtga gccaagatcg 93001 ggccattgca ctccagcctg gacaacaaga gcaaaactcc gtctcaaaaa aaaaaaaaaa 93061 aatgcctgtg agatgaaagt gatccagaag acctggactc tgtgttttgt ttttaatttt 93121 ttgtagagat ggggtctccc tgtgttgccc aggctggtct caaactcctg ggctcaagtg 93181 atcctcccac ctcagcctcc caaagtgctg ggattgcagg cgtgagccac catgtatggc 93241 cttgtttttg tttttagaga cagggtcttg ctctgttccc cagactggag tgcagtggca 93301 cattcatagc ttactgctgc ctggaactcc tgggctcaag ggatcctcct gcctcagcct 93361 ccacagtagc tgggactaca ggcatgcacc actgcaccca gcttattttt tattttttgt 93421 agaggtaagg tctcaccatc ttgcccaggc tgatctcaaa ttcctgggct caagtgatcc 93481 tcccacctta gccttccaaa atactgggat tataggtata agccactgtg ccaggccaga 93541 atctggacca tcaatgcaaa gagtttttat tttattttat ttttgagatg gaagcttgtt 93601 ctgtcgccca agctgacatg cagtggcacg atctcactgc aacctccacc tcctgggttc 93661 aagctattct gctgcctcag cctccagagc agctgggatt acaggtgtgc accaccatgt 93721 ccggctaatt ttttatattt ttaatagaga tggggcttca ccatgttgac caggctggtc 93781 tcaaactcct gaccttaagt gatccacctg cctcagactc ccaaagtgct gggattacag 93841 gcatgaacca ctgtgcctgg ccgcaaagag gttttagaac tgacttaact gatgaatttc 93901 tagtatgttt accttttata ggacttatca gattgataca ctgaaaaaaa atctgttaaa 93961 tctaaataag ttgaaaataa ttctttcaat aagtataaaa attctgagtg atcagaaaac 94021 aattgtgtta tgttttgatt ggtattattt ctctttttcc tcctggaatt ataaaatcat 94081 agtgcatctt accaacaata agaccttgaa ggtaaagaac tatggaagtc aatttcaggg 94141 ggctcatatg actgtgaaga tggactccac tgcagtagaa ttttagagct ttactcacag 94201 accttaagaa cttagaatgg ggatccatgt ggcttggctt tggaatcaga gaatcacaga 94261 agcttagaga taaatatcag aaccttagag agtcatagaa ctttggaacc tcagaatctt 94321 ggggctggat gggtatttga gctgtcgttt tatagaagag aaaaacaagt cccagggaat 94381 ggatgcaatt tgcctagttc tttccaactt ctaacagtac agcagggtta ataccctttt 94441 attcccgatt tctgtccagg aatcctggtc cgcagacagc cctgtcttct agtccaagat 94501 gctggccaag agtgttgagg tggggagaat gaaagtgaga tgagggaagg tgagtgagac 94561 ctgtccaaga ttatgtgagg tggtgtctca actttgaagc tcagcaaaac cacagggtgg 94621 ggtgtcatga ggccctatct ttcttgccag ctttatctcc tgacgtacaa gtgtatccgt 94681 cccaccctgt gcttctgccc acacaaccac caagctaatg ttcccacctc aataagcttc 94741 accggggccc accacaactc tctcaacctc tatgcctttg ttcatggtgc tccagggctg 94801 ccaagttcag ctgagcatgt tgggcacagc acaagggtat cacatttaca tggtagacgt 94861 cccagattta taccaataat ttctaacaga tggcagtaaa atgtcttgag gaagagacat 94921 ctttttatca tttgcacaaa ggtgccatat ggggtagcgg ggacactgct gtttgctctg 94981 cctggaatat catctcccct attctccatc aagcaagctc ctgctccatc tttcaaggcc 95041 cccttcagat attccctcct ccaggaagat atccatagca accccgacac cccgttccac 95101 caggctgtct gcatagatta atgattaata acatcaatca atattaactg ttgatatggg 95161 aaacaatagt gtctggcaca gatttccaaa tgagatgatt cctgatgtac cccagctcca 95221 ggattgagta acactgaatc agatacaaga aaacgatgct cttttcaacc tttattgatt 95281 aaatcaagga ggaaaaaagt ctcagtttgg tgcaaacatc tttcacactt ttctttgttt 95341 tgttggtttt ttttcttctt tttttttgag gtggagtctc actctgtcac ccaggctgga 95401 gtacagtggc gcaatcttgg ctcactgcaa cctctgcctc ccggttcaag cgattttcct 95461 gcctcagcct cccaagtagc tgggattaca ggcacccgcc accacagccg gctaattttt 95521 tatatttttg tagagacggg gtttcaccat gttggccagg ctggtctcga actcctgacc 95581 tcaagtgatc tgcctgcctc ggcctcccga agtgctggga ttacaggcgt gagccaccgc 95641 acccggtcct ctttcacact tttcaacact ggctaatctc ccattttgtc gaagagtgaa 95701 caggttgcag gcagtggtat ctgctggaaa tttgatactt ggccttcttt aaattgtatt 95761 tgatttattt atacgactgc ctgcttcctt ccttcctttc tctctctctc ctttccttcc 95821 ttctctcttt ctctttctcg ctctctctct ctttctttct ttgacaggat ctcactctgt 95881 tgcccaggct ggagtccagt ggcatgatca cgactcactg tagcctcaaa ttcccaggct 95941 aaagcctcaa gtgatcctgc cacctcagtc tcccgaatag ctgggattac aggcgcacgc 96001 caccatgctg ggttaacttt tggaattttt gtagagatgg ggtttcgcga tgttggccag 96061 gctggtctcc aactcctgac ttcaagtgat ctacccacct cagcctccca aagtgttggg 96121 attacagggg tgagccacca cgcccagcct gtgtgattgc tttctatgta tatgatgctg 96181 attttccact tcttggagca ttataaggat ttatttacaa atcagtgcat ttaaggacaa 96241 aaaataagtc tatttaaata aaaatatgtc aaaagtagac caggtggcat gcagacagtg 96301 caaaaaatcc tgagggggga tgctcaactg gctacagttt gaaaaaaatt tatagatttg 96361 tcttgaagag tacaggctct ggagccagac agcctgaatt caagtcttga cttgctaact 96421 gtgtgacctt gggaaaatca cttcccctct ctgaacctca atttcctcat accaacctcc 96481 tggagcagtc ttcataatga gtaaggccaa gtacttgtcc tccaccccaa cacatccatt 96541 caggacccag aaaccaacct gcctcctctg acctgctgca cgaatgccct ggctggacca 96601 tctcttacct agtatccttg gacagcatct cacaggtgta tctgggtttc ccctgctagg 96661 tggagcaaac tccttgacgg caggaacttg gccttattca tttctgccta cacatttggt 96721 agactcacat aacaggcaga aagaactttt cagtgtttgc tgcttttatt atcgttgtaa 96781 ttattaagag cgaggaagtc acatcccagt aactatgggg gacctcaagg cacacatctg 96841 tttacagatc caggagaaat tttggcttta gggaatttta tttgcctggg aaaaatgggg 96901 gaaataattg aacatgaaga agggttgggg caaggtagca gagtgaggtg tggactcact 96961 tggggggctc tgtggccagg acagggaagg acccctcgag gtgccactgc ttgtcacgca 97021 ggcgacgcag ggactgcatc tgtggctgga aggctcccca ctcattccag tgccggaagt 97081 caccaggctc taggaggtac tggtacccgc ggtagccagg atactgatag ccaacccatc 97141 tgagagaaaa gtgagaagga cagagtggag acagcctgtc tcgttgcctg gcacccagac 97201 ccctgccaga ccagcctgtc cttcattgat ccctgcagtc ggctgctctc tcacttctgt 97261 cctggctata tcccttcctg ctctgtgccc tggggcaagg actttgcctc tctgataggc 97321 ctcggtttcc ttttctagga agtgagcatg gtcacggtgc ctggttttat taagagtgct 97381 gggggggacc aaatggagaa tgcacctcaa gtgtttgcta agcatgttgc ctgtgcatgg 97441 taggaatatg aatacacatc ccattaaacg tgatgatgca gggctcagag aacaaatcca 97501 aggaagcaag aataggactt agggactggt taacaatgaa gcaaatgaaa gagaggaagt 97561 gttatttcaa aaaattggtt ttcttggctg ggcgcggtgg ctcaagcctg taatcccagc 97621 acttagggag gctgaggcgg gcaaatcaca aggtcaggag atcgagacca tcctggttaa 97681 tgcagtgaaa ccccgtctct actaaaaata caaaaaaatt agcagggtgt ggtggtgggc 97741 acctgtagtc ccagctactc gggagactga agcaggagaa cggcgtgaac ccaggaggcg 97801 gagcttgcag tgagctgaga ttgcaccact gtactccagc ctgagggaca gagcgagact 97861 ccatctcaaa aaaaaaaaaa ttggttttct ttgtaggtaa gtgcatcagt gcgtagcagc 97921 tgggaacaca tactctggta tcaaatatat gtgggttcaa atcctagctc tgccattgat 97981 ttactagctg tgagaatcta ggtggataag gtatcagaac ctcaatgtac tgtctgtaca 98041 gtggggatga taatgcaccc acctccttgg tttgttatga agatttaata gttattcatt 98101 taaagtggtt agcacagtcc ttggtacata gtaagcactc aatagaagtt atcatcagtg 98161 attaatttct aacttctctc attgtctttg ctgagagggt ctttccaggc ttctgccaga 98221 ccccctccct atattgtaaa gcctggttta agttccatcc tctgcagata aacctttgac 98281 ctgccctctg ccctaacata cacagccaaa tcctagaaac ctctcctcct ctgacctcct 98341 acaacacagt tgccctggct ggcccattct ttacctagta ttgttggaca ccatctcaca 98401 ggtatatctg ggtttccctg ctaggccaag taagttcctt gaaggtagga acttggctgt 98461 attcatcgct gctttcccca gtgtacctag cacagtgcct ggctcacaga caaggctccg 98521 tatatttcag atggcttgaa tggcagggat ataggtttca gggaggcaga aaggaacaag 98581 aaaaaggcca tagccatgat ggtgttacag gagctcagaa caagtctggg aatggagaga 98641 cagagcctta cgtttaccta gaggagatca agggtgtctg gcatggtcat ttggaggcag 98701 ggggatggat aaagtgactc ctttgatctg gcccttccat gtattctcat gacataaggt 98761 catggggctc cgttttgata gcttcagatc ctgagcgttc agagcaggaa gggcactctt 98821 ggatcaggga attttgtgtg cccattgcct agctggcaag actgagaggc agagaggtgt 98881 ttaggctcac tcgagttcct cagcatgtca gtagcagggt cagggccatc ttatgatgca 98941 aattcactgc tcagaactca ttctgttcat tctcccagaa gcttctgatt tccctgaata 99001 ttccctgctc ctggatgcat ccagcctcat ctatccacat cagcgccggc ccttttagtt 99061 gattactcct tcaaccccaa ctaggcttag cagctcctat tttctccatc taacagcttt 99121 ttagagctga tttgtttaca ctgacctgga attcctcaaa gtgtgactct attccccaga 99181 ggaatagagc tctgttggct gatcccagga accagcactg ggagactgtg gaaggggcca 99241 tggccttctc taggaatctg atgttataac ctgtaagggg agcagcctct gattctgcct 99301 gtgcttgaag caggggacgc ggacctggca ggagggatgc ttacgttcca ctggagacct 99361 tcacgctgcc cacgcggtca ctgaagccgt agacccagag actgggtgcg tcgtccccct 99421 ggatctctat ggtgttgccc ttgaagttgg ccccttcaaa cagggagatt ttgtgctcct 99481 gggcatcctg gggaaagaga ggccgggtca ggtgtgtgaa gtacaaatgg gacaaagaga 99541 agaaacttag cggggccttc tgggtgtgga gcgagagaga tgagcctgtt caggctgctg 99601 ggggactctc cagggaccac tgctgtctag tctggaagga gacataagga aaatgggatg 99661 acaactccca cgtcataggg tcctcgtgag gatgaaatga gtgtatccat atagagcact 99721 gagcacgatg cctggcatac atcaagtgct caataaatgc atttattggc cgggtgccat 99781 ggctcatgtc tgtaattcca gcactttgag aggcccaggt gggaggactg cttgaggcca 99841 agggttcaag accagcctgg gcaacatagc aagatcctgt ctctcaaaaa aataattttt 99901 tttttaattt gccaggcatg gtggtgcatg cctgtagtct cagctacttg gaaggctgag 99961 gcaggaggat tgcttgagcc caggaagttg aggctgcagc gagatatgat tacaccactg 100021 cactccagcc tgggtgacag agcaagaccc tgtctcaaaa aaaaaaaaag tatttattat 100081 ttttaaaatt atgatcaggg acaaaaaggc tcagagaggt caagcttatt gatcaacatc 100141 acacagccag ggaggggcag aacctttcag agaaacactt cattcctaag cagtgtccct 100201 gtgccatcag gagtttaacc aattgccctt ttgtaaatgc ttcttacaaa ccaggtgaaa 100261 cgtggaacat tggtggaaca cagatctagg ttcaaatcct gcctctaggc cgggtgcgat 100321 ggctcactcc tataatccca gcactttggg aggccgaggc gggcagatca tgtggtcagg 100381 agatcaagac catcctggct aacacgatga aaccccgtct ctactaaaaa tacaaaaaaa 100441 attagccggg cgtggtggtg ggagcctgta gtcccagcta cttgggaggc tgaggcagga 100501 gaatggcgtg aacccaggag gcagagcttg cagtgagccg agatcgtgcc acttgcacac 100561 caacctgggc aacagagcga gactccgtct caaaaaaaaa aaaaaaaaaa caaaaaacaa 100621 atcctgcctc tgacatttat gcactggtga tcttggacat tttactctgc ctccttcaga 100681 gcctcagttt gttcatctgt agaatggggt tattgccagg catggtggct cacatctgta 100741 atcccagcac tttgggaggc cgagacaggt agatcacctg aggtcaggag tccaagacca 100801 gcctggccaa catggtgaaa ccccatctct actaaaaatg caaaaattag ccaggtgcaa 100861 tcacatgtgc ctgtaatccc agctactcgg gaggctgagg caggagagtc acttgaacct 100921 gggaggtgga ggttgcagtg aggcgagatc acaccactgc actccagcct gggcgacaga 100981 gcaagactgt ctcaaacaac aacaacaaaa aagaatgggg ttactattgc ttcgtgggat 101041 tgtcatggct actcaactca cggttcagtc caactataaa acaggcctgg ccgggcgcgg 101101 tggctcacgc ctgtaatccc agcactttag gaggccaaga tgggtgggat cacaaggtca 101161 ggagttcgag accagcctga ctaacatgga gaaaccccat ctctactaaa aatacaaaat 101221 aataataata ataataataa ttagctgggc gtggtggcac gcacctgtag tcccagctac 101281 tcaggaggct gaggcaggag aatcacttga acctgggagg cagaggttgc agtgagccaa 101341 gattgtgtca ctgcactcca gcctgggtga cagagcaaga ctctgtctca aaaaacaaac 101401 aaacaaacaa ccaggcccac ccttgaatgt gggcacatga ttcagttctg gcttgcactt 101461 atttttaatg ctttcatgac cctgtatctt gtttttaaag ttcgtctgtt atttctgagg 101521 gcagggatca tccatctttt caaaagtcat catgcactgt tttaggccca agggtttgaa 101581 gagccaaaga gatgaacagg aaatagcaga gtgatgccgg acctcagtgt ctaggttcaa 101641 acccacagac cttggttcaa atcccagccc cagcaactcc cagctgtgtg gccttaggcc 101701 agtcattttc tacttctgag tctcagtttc cttgctttta aaatgcatgt tatcatagcc 101761 tgtagcatag gacagttgta gggattatgt aaaaggcagg taatataccc agtacgtagt 101821 tggtgcttaa aaatggatgc tgtgattatc tactgttatt atctactgtt attcaccacc 101881 actaggagga agaggagccc agaacgtaag tgcatccttc tgtgaaagac tccagtagga 101941 gattatagat gaagaaagaa ctggttaatt cttccagaga aaagcagagc agatcaggga 102001 aggcttcatg gaagaagtga cactagagag ggaccttgga ggttaaggag gagtgcacaa 102061 agctgagaag ggaaggaggg ttgagggaag gcactgaggc agagaaggca taggcaaaag 102121 ctgggaggtg ggaatgtgca agacaatatc tgtgtggcca gggtgcagga aggcaagtgc 102181 aaggtgccag gcagggatgt aagcacaggg gaccctttgg ggcatggcaa gacaaatgcc 102241 cttagtgcaa ttaccagcca gtggtaaaag tacaacaatg attacttatt actgttacgg 102301 tcttgagggc cttgggaagg ttgttagtca atggctcccc actgctagag gggtcaagac 102361 tgaacttgcc accacctatg agaccctaca agaccctggt ctccaccacc acctcccctc 102421 ccatcttgcc tccctctttc ctccactccc tctgctgcag ctttgttggc cacctttaat 102481 tttcttggtc ccaccagtcc atcgttgctc taggagtttt gcacacactg ttccctggaa 102541 ggctttacca tctgctcttc acgtggctga gttggtttcc aaactttggg agctggcttc 102601 cctgatgacc ccctgaccac tcccacctct aacaatatct gagcccccag ttcctgcttc 102661 tgctttccta gttagattgc atggcctttg tgtaacacct accacagttt gcaattgtag 102721 atggattttt atgattagct ttttgatagc tgcctccctg ggtagcttaa gtcccatgaa 102781 gacagggacc tagtctgttt gatgcacagt tttgcactca gagcctagaa ccagagcctg 102841 gcagaaacag gccaactggt tgggcgcact ggctcatgcc tgtaatccca gcactttggg 102901 aggcccaggc gggtagatca cttgagatca ggagtttgag accatcctgg ccaacatgat 102961 gaaaccctgt ctctataaaa aatacaaaaa ttatctgggc gtggcggtgc acacctataa 103021 tcccagctac ttgggaggct gaggcaggag aattgcttgc cccagggagg tgaaggttgc 103081 agtgagccag aacgtgccac tacactccag cctgggcaac agagttagat gtgtctcaaa 103141 aaaaaaaaaa aaaaaaaagg aaaatagaaa caggccaacc aaaatgtctg ggttcaagta 103201 gaagatggtc aggtttattt ttttttttta ttttgaaaag atccctctaa cttcccacga 103261 cagggggctg gaaaggagtc agggaaccca gcagggagag agctgaagat gaccaggtca 103321 gagatgatgg tggcctggac caggactgtg gcagggcaga gagaggaagg gttagaccaa 103381 gcttatttaa cctgcggcct gcaggccaca tgtggctcag gatggctttg aatgtggccc 103441 aacacaaatt tgtgaacttt cttaaaacat tatgagattt tttttgcaat tgttttaaag 103501 ctcatccgct atcgttagtg atagtgtatt ttatgtgtag cccaagacaa ttcttccagc 103561 gtgggccagg gaagccaaaa tattgggcac ccctgaatta gatggttgga tctcagccct 103621 ggctgcacag acggactact tagagggcat agaaaagata gcagtgccgg agttccagcc 103681 ctagagattc agaattaatt ggtctggggt ggggtctagg ctttgagatt tctaaacatt 103741 tctaataaca tattttattt aaacaaatat acctaaaata ttataatttc aatatgtaac 103801 caatataatg atatcaatgc attgttgaca ttctttaaaa ttttttttac tatgtattca 103861 aaacctaata tgtattttac acttatgaca catctccatt tagccaaaat tcaagtgtcc 103921 aatggccaca tatggctact gcctattatg ttgaatagca agcttctaga agctactgga 103981 aatgccctgg ttgatcactc ttccaaacaa accccttaac ataggttggc atataagaac 104041 cattcaatga ttgagagaga catctgtgaa ttcttacaga gagtgcatca tgaaggcaaa 104101 taattcagtc ttacattctg cgaaaagtgt tgggaatgat ctgtttgaat aatccattaa 104161 gataaatttg gtgatgtcaa aacagctggt ggttggcact tgatagcctc aaatgcacat 104221 tcatgaaaat tcttataaat gttcccatca tggaagacaa aagtcggtta ccatcagcaa 104281 ctaacattgg tgagtaagat gtagctggtt catgactcat catcaactta attctggaaa 104341 aaactgtgaa tcacttggag atatcgccac cattttgacc ttgaatcttc tggaagttga 104401 cttagttaaa tattataaga aactaattca gtatattgtc ttattattaa gaggttcaag 104461 ctgcgtttct acagtatccc tctccttttt tttttttttt ttttttgaga tggagtctgg 104521 caccgtcacc ccggctggag tgcaatggca tgatctcggc tcactgcaac ctctggctcc 104581 cgggttcaag taattctcct gcctcagcct tgctagtagc tgggattaca ggcatgcgcc 104641 accacgcccg gctaattttt gtatttttag taaagacggg gtttcgtcat gttgatcagg 104701 ctggtcttga actcctgacc ttgggtaacc acccacctca gcctcccaaa gtgctgggat 104761 tacaggcatg ggccaccatg cccggctaca atatccctct tcttagccat tgcatgatct 104821 gcaaccagca tgtttggtgt ttcgaatggg cactgaaatt tgaaaagttt tcccaggcct 104881 ggcatggtgg ctcacggctg taatcctagc actttaggag gcccaggcgg gaggatcact 104941 tgaggccagg agttcaaggc tgcagtgagc tatgatcaca ctactgtaca ccaggctggg 105001 ggacacagca aaaccccatc tttgggaaaa aaaaaaaaaa aaaaagctcc cagtaaggtc 105061 taaatgtgca gccagggata tcaagagaat caaggaagat gctgaagttt ttagctctaa 105121 cttggaagct tttgtgctta tcatttgccc agggttccta gctgggtatg tagtgagtgt 105181 gttaaggaca ctgatgaatt gatgagctca agaatccacg gtcctttgac aactcggagg 105241 ctgccacgcc tccctaccca ccatcatctc cttcttgccc ttgtcagatc tcagacttac 105301 acctgccctg ccccgcctgg ctgattctcc agccccagcc ggagagccac tcaccatttt 105361 gatgggccgg aaggacatga gccgatcact gcggtagctg ctcgaccatg tgttccagcg 105421 agggtactcg cccttctcca ggatgaacat ctccccgcgg aagttggact gctcaaaggc 105481 gacccagctg gatacaagaa ggaccatgag gcagacagga gacatatggt tagtagaagc 105541 ccccactccc tacttgcctt tctctctccc ctggcaaggc agccctgagg acagtaggaa 105601 agtccaaagg gggctgaaca tgtctgggtt tgattttagg ctctgacact tacaggctgt 105661 gtgactatgg gcaagtgact ttacctctct gagcctttgt ttctcatcta cactcatatg 105721 cctgtttcct tggagttggt gaaggattaa gtgagataaa tgtgtgtgat tatctgagca 105781 cactgtttct ctagatgtaa ggttcataca tgcttcaagt actatcagga cttggagcca 105841 aggaacacat agacatttga aacttgacgc cagctaggag aaaggctcgt ttgatgttaa 105901 tgtgccttta acacttacta atctctcctt tttttctttt gtaacaaaga gagaaaagac 105961 cactagccca gaacctccag cagacaatgg tgtaatagca acataataat accacttttg 106021 tgtttattgt taattttgta tggatacctt ttacttatag gaaataatac tagccttcca 106081 tttgtaaaaa gtggtgtcag ggttctttta aaatgtgttt atttgtggtt agcaggatca 106141 ttttaaatga aatatattaa atgaacgaca gtctcggagg tacaggggta agggaaaaat 106201 tgatacaggt gtttgcatcc ttgctgggta aatactggcc tggtattcgg gagatactca 106261 gtaactagta tgaggcaatg ggaagggaat gatggtaata tctaccctca gatacctatg 106321 gtaatataat gtgatagcgc ttatgaagat gctaggttcc ttacatggac taattagaag 106381 tttgaaatta gatggaagat tttcctggct ttcccagtga agattgatta aggacaagaa 106441 gtgagcttgt gttacaggga ccagggattg tggcagtgca ggtggggtag ataagacaat 106501 ccccattccg tagatgagaa agctgagact aaggaactcc ttgacctttg ctccccgtgt 106561 gttcaagagt tcttaggaaa aaagagacgg atgaggggac actgacacta gcatcttctc 106621 ccagggggct gctgggaccc tgctctcccc agaaggagac aaggatgatg ggctatcacc 106681 atgggctgag acagaatgca tcacaggagc ctcaggtctt tgtctgagag atgagttaac 106741 cacactggag taaattaaag atatgtttgt cttcccagaa tagagttgca tttgcctcag 106801 ccctgtctgg tctggagtag aagagaaata aatgggagaa tagatggatg gatggacaga 106861 tgaatgggtg gatggatggg tggatgaaag gatggttgaa tagatggata ggtggatggg 106921 tagatggaca gatgaatgga tgcatggata aagggatgga tgaagggatg gttgaattga 106981 tggatagcta gatgggtaga tggatgggtg ggtagatgga tagatttata gatggatggg 107041 tgggtgatgg agaacacatg gtgggtacgt gtgtataaat ggagaatgga tgtatgtatg 107101 gatagatcaa tagacgaatg ggtagatgca tgggtaagta gatggatgat gaataaatgg 107161 gtggatagac gcatgggtga ataagaaggg agccttggaa ggtggtttca tgtaggagtt 107221 aagaggacag accctggaaa cagacagacc tgaattctaa tccaagcttt gccatttgct 107281 gcatgcattt gggcagcact tacacctgct aagctttagt aacagagaag aaatgataca 107341 tctactttgt tgcatctacc ttgttgggag attatgaggg cctaaaagag ggaatgcatg 107401 gaacagatac agcacagagt aagcctccaa attaatactg ttcctgatta aagggagaag 107461 gaaactggag gtctaggctt ctggaaaaac agacactgat gtggtcttgc caccatgccc 107521 aagtccccct aaaatctccc ctatcttctt cagcaggacc agtgaggatc taaattttaa 107581 atgtttatat ctatgccttt tcttatctcc catggctcat ttcaacccgc aagcagggag 107641 agagggtagg gcaggaattt taatttataa gatatggaaa ctgaggctaa gaggggttaa 107701 gtccccagcc taaggtcaca tgatggtgtt gggtgtcagg ctgcagtctc ctgggtttca 107761 ggcaggcagg cctcctcact ttccccatgc ctacctcacc ctcttcattc tttttttccc 107821 atggccctgc cagcgcccac acagacccag gcccccggac cacaggcccg tttccatgcc 107881 catcaggcac ggctccaagg cacacgtcag acaccagctg catctttgtt tgaggagaaa 107941 tactctgggt ttgagtcact ggtaccaaaa aaagcctgaa aacccatgct ggccacatcc 108001 ggtagcttct gcttccagct gggttttggg gctgagggtg gagctgaggc tgcgttgacc 108061 acactgaggg tgcaggaggt agaaatggtc ccgggtagcc tccaggataa gttggctaaa 108121 ttcaccgggc agggaggtgg gtacgtgacc atcagagtca cctgccatgt ttgggtgccc 108181 acaaggtaac aggcccatgc tgtatgatct ccctcagacc tcagcataat cctgtgagga 108241 gtggtgaggg ccatccttct ccccatttca aagatgggga aactgaggct tgcacactga 108301 acaaggcaaa caggggaagt caggactcaa acccaggctt gtctgacctc aaagctcatc 108361 tcggaatggg ctcagaatac aggatgactg gaacccttgc aggctttcaa agtgggggag 108421 acaagagcaa acgagctgcg tggatcttgt cctggggctg cccctccacc ctgacctcag 108481 cacacacagg ctggacatag ggcaaccact ctgcccaagg atgactatcc ctgttttata 108541 tgggagtcac taaggcccag agacacccac agagacagga acaggcagtg aagcatggtg 108601 attaagagct ttaggaatga actcaatcaa cctgagttga atcctggctg tgccacttac 108661 caataggtga ccttgggtaa gagactcacc ccctctgagc ctcagtcttt ccatctgtca 108721 agtgggcata agaatagcct ctgcctccta cagcactgtg aatcatctag gtgtggtccc 108781 gagtcagtgc ctggcacctg gtgggtgtgc cacagatgct tatcattact cccaggcagc 108841 tggacgccca ggagagggga cagcgagtgg acacttcact ctttgacctc tctcagccct 108901 cggtaatggg ctagagtttg tttttttttt ttttgttttt tttttttttt tgagacggag 108961 tctcgctctg tcgcccaggc cggactgcag actgcagtgg cgcaatctcg gctcactgca 109021 agctccgctt cccgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta 109081 caggcgcccg ccaccacgcc cggctaattt tttgtatttt tagtagagac ggggtttcac 109141 cttgttagcc aggatggtct cgatctcctg acctcgtgat ccacccgcct cggcctccca 109201 aagtgctggg attacaggcg tgagccaccg cgcccggccg ggctagagtt tttgctatag 109261 tagaagcagc atttctccag agcccagaac catggcttac tgccagtgtg atcttatgct 109321 aattggatca cctctccgag cctcagtttc ctcatccgta aaatgagatt gagaacaatt 109381 cctccctctt cagtttgata tgaggattgg acataatgta tgtgccagga gtacgaacgg 109441 ccgcaggcac agagcagatg ctggagaaat ggcagctact gttgtgtggt cattttactg 109501 tggcagagag tgtgcccctc cgccgcccag tactcacggt cccgcggaga caatgatgct 109561 gcgcacacgg tcgaagccac ggtctgccag atttgagcac tcccccgaga attctgctcg 109621 acggccctgg aagttttcca gttcgaagac caccagctgc aggagagaag cccccatgcc 109681 aagggcagag tgagggggga gtcaaaaatt cattaagtat ctacataaat aaaagccagc 109741 agtgcaggag tcacataaaa gtgtgatgag ccacagtttg tgaaatgatc ctgtcctccc 109801 atcccccaat tctttactga tttttgtttt ttgtttgttt gtttgttttt ctgagtctca 109861 ctctgtcgcc ccagctgaag ttcagtggcg caatcatggc tcactgcaac ctccgcctcc 109921 caggctcaag cgatcctccc acctcagcct ccgagtagtt gagattacag gcacccacca 109981 ccatgcctgg ctattttttg tagagacagg attttgccat gttgctcagg ctggtctcga 110041 attcctgagc tcaaatgatt ggcctgcctt ggcttcccaa agtgttggga ttacaggcat 110101 gagccattgc acctggcctc atgccctaat tctgaagcta agatctgcgc catcattagg 110161 aacttatgaa cctccctttc aggcctttct ttgtgctatt cctcctgcct ggaatgcctt 110221 ctccaccaca catatccagc ttggagctcc cactgcctaa ctgaagttcc ctaatacgca 110281 ttccagctcg ttattctgtt ttaggctcag gtgccacctt ctccaggaag ccttctctga 110341 ttgccccttc ctccatcggc cttggttgag catcccccac tgtcctctcc ccattagcag 110401 cacaactaac ccaccacaca gtcactgctg ttttgctttg gggtctccct ctctctctgc 110461 actccctctc tcccattacc cagcccctcc aaagaccaag gactcctcca gggcagggcc 110521 tgaggttgat caatttctgt atctccagcc ccagccccag tacctgccac agagctgatg 110581 ctcaggagac acttgcaaat ccaaccctcc acaacatagg ggaaaacttg agctaatctt 110641 ctctcctcac taagcctcat gtttggttcc ttgtccactt gctgttccaa gttgggacaa 110701 cgttgccatt cacatggacc aaggcacaat tttctgcctt ctttgagcct cagtttcccc 110761 agctgtaaga ggaggggagg ttcatataac catgacccag cacttccact tctacctttg 110821 gttgcccaag agaaactcct gtatgtgtgc cccgggatgt tgcgggaagt cagggacccc 110881 aaacggagga actggctgaa gccatggcag aagaacgtgg attgtgaaga tttcatggac 110941 atttattagt tccccaaatt aatacttttg taatttctta tgcctgtctt tactgcaatc 111001 tgtaaacata aattgtaaag atttcatgga cacttatcac ttccccaatc aatacccttg 111061 tgctttccta tgcctgtctt tactttaatc tcttaatcct gtcagctgag gaggatgtat 111121 gtcgcctcag gaccgtgtga taattgcatt aactgcacaa attgtacagc atgtgtgttt 111181 gagcaatatg aaatgtgggc accttaaaaa agaacaggat aacagcaatt gttcagggaa 111241 taagagagat aaccttaaac tctgaccgcc agtgagccgg gcagaacaga gccatatttc 111301 tcttctttca aaagcaaatg ggagaaatat cgctgaattc tttttctcag catggaacat 111361 ccctgagaaa gagaatgcgc acctggggtt aggtctctaa actggccccc ctgggcgtgg 111421 tcgtctctta tggtcgacgc tgcagagatg agatagactc cagtctccca tagctctccc 111481 aggcttatta ggaagaggaa attcctgcct aataaatttt ggccagaccg gttgatctca 111541 aaaccctgtc tcctgataag atgttatcaa tgacaatggt gcccaaaact tcattagcaa 111601 ttttaatttt gccccggtcc tgtgatcctg tgatctctcc ctgcctccac ttgccctgtg 111661 atattctatt accttgtaaa gtacttgatg tctgtgaccc acacctattc tcacactccc 111721 tccccttttg aaactcccta ataaaaactt gctggttttt gcagcttgtg gggcatcacg 111781 gaacctactg acatgtgatg tctcccccgg acgcccagct ttacaatttt tctcttttgt 111841 actctgtccc tttatttctc aagctggccg acgcttagga aaaatagaaa agaatctatg 111901 tgaatatcgg ggcaggttcc ccgataccag gagacaggga taagaacgtc caagccatgc 111961 tgttcacgag agcaaagcta gcaaccactg aaatcgtgct tagcaggaaa gagatggtca 112021 cgctctgaaa tatcaggcag cagccagaat gaatggccgg cagtcatgcg caacaatatg 112081 gatggatatt agggatctca tagtaagtga aaagtacaaa gaaaagccca gaataatcac 112141 attgcatatt tcaataaatt aaaaacaagt aaatcggcca gttgcggtgg ctcacaccta 112201 taattccagc acttggggag gcggaggcta ggagttcgag accagcctgg ataatatggc 112261 aaaacccgtc tcataattcg gtctcaaaat aaatgaataa ataaaacaag taaataaata 112321 tacatgtgta tgtgtgtata catttttaaa ggaataatgc ataaaattgt atacaaaagc 112381 aagcaagcag attacagaca caggttgagg ataggacagt atttcctgga gttggggagg 112441 aaaatgatgg actcaaggga gacaccttag agttaggtag aaagttttta accaagtcct 112501 ggcttttata tgggtggtgg gtttgtaggt gcatattgca ttattagaaa ttaaataaaa 112561 ctaatgatta aataggtaca taaataaaaa cctgccacgc actgaccaca tgagagcgtg 112621 tcatgaacta ggggttgcaa aatggcctac ttctgtgacc tgaggttgaa accacaccaa 112681 accaaatccc aggggactgg acaccttgct ccacaggatg cctttcagcc cagaccacct 112741 cgagcaattt tgagggtctg aggccatcct gctgccccag tgagcctcca aggacagctg 112801 gttccctgct gctaagccag cccacttcct tttcttcttt ttttttcttg ctttcgataa 112861 actctaagca tgtgcgtgtg caggtgtagc ctgattgcca ggtgtccctt cagctgtgaa 112921 gcggtatggt ttggttgttg ttattgcaca actcaactat aggtcccctt tttctttcta 112981 ttcttttctt tttttttttg atggagtctc cctctgttgc ccaggctgga gtgcagtggc 113041 atgatctcag ctcactgcaa cctccgcctc ctgggttcaa gtgattctcc tgcctcagcc 113101 tcccaagtag ctgggattac aggcgcccac cactacgcct ggctaatttt tgtattttta 113161 gtagagacag ggtttcacca tgttggccag gctggtctcg aactcctgac ctcaggtgat 113221 ccacctgcct cggcctccca aagtgctggg attacaggca tgagccactg cgcccggccc 113281 cactttctcc aagtgtgtca gaacccctgg ctattttatg ggctcccaga ttggctgggc 113341 cacacttgtg tcccagaagg aggagggagg aggaggccaa ggtgagggaa aaagagaata 113401 gggacagagg ataagagtct ggggaggtgg agaagaagaa aagggagagt aaatagaaag 113461 gagagaaaat aagccaagaa aaaggaggag aaagaggtgc ggaggagtaa gaggtgaaag 113521 aggaagagga ggaggagaag aaggaggagg agggaaggaa ggaaagaagg catggcaccc 113581 agccactgca cccccaggtc cttaccctgt agttcccagg aggcagttcc gccgccttgg 113641 cgctggtaat aggcacggtt gttggggcca gggtagtgcc gggactaggg gatgttcctg 113701 caggtggggc ccccttcccc ttggtgtcag gccctgggtt caccgccact gtggccgagg 113761 ccgaggcctt tgcagcctga gacatggttc ccgcctgcaa aagtctgtaa agaaactctg 113821 gccttcaggg atgacacccc caaactgcct cctgccctca tagccccaca tcctgtgctt 113881 tcagcctcct tggagctcgt ttcctctcta tctccttctg aaacggttaa gcctggaagt 113941 ttttctttca tccctacttc ccagccaaga ccttcccgtc tgggggccac tcatcccaca 114001 tatatccttc ctccaggctt ttgaccatgc ggttccctcc actagctgtt ccaccctcca 114061 ttaccccgct aggtccttcc ttagccccct gcaattttcc acactgctta tcacactgat 114121 ttgcttgtgc aactccaatt tgcctcccta gactaataac aagaagtccc atttgggagt 114181 acttaaggtg tgccaggcac tgtgacatac attaattaac ccatctgacc atgaacctca 114241 cccgatggag tagctgctat catgatcccc tctgagcagc agggggctgg agaaggaagg 114301 ggcccagctc aggcccaaac agctactaag tggtagggct ctgatttcaa ctcaaggaaa 114361 tggagccagc aactgggctt ttaaagtatt ctgccatttg cttaatctga tttattcacc 114421 gtccatccct aacacttgct ctgcacctgg cacataggag atgcccaatt aacatttgtt 114481 cagagaatga atgaatgcat gctcccagct cctttccccg tctgcttcct ctgaggggtt 114541 ggtgtatggg tgcaaataca ggccggggac tgaggcttat gggaggaccc aagaggatgt 114601 atttactgca tgtggagggt tctgaaaatg aggactgggg ctgggtaggc tttgggtgga 114661 gagggctttg gtcaattcca gtggttcctt gttgctttgg ggtcgtggac tcctttgaat 114721 gtttgatgaa tacctcaact ttgtccccgg acaaaaataa tgctctcttt aactttggag 114781 acaatgttaa gggatttgag gaccccctgc tagcacagat taagaccctc tggtcaagct 114841 gattcccgaa cacattctgt aaggcggagc tagttgggat aagcacatcc ctcccccagc 114901 ctcctccagg aaggctgtgc atattctggg gtgccccctg ttcaccaggg gtgaggccag 114961 acatctggac cacatctgtg gctttaaaca ctcccccagg gctggcagct ggactcatgc 115021 cttcctggag atattctcat tctctctctc tccctgcttc accctctccc tctgttcctc 115081 ctctcttccc tgtctttgcc tctctatctg tttcttactt ctctccctct ttctgcctat 115141 ctgcctgtct gtttctctcc ctctctgttc ctctctcccc cactatctct ccctcattgt 115201 atctttgctt atctctctat ctgtcctttt taattatatt tacttctctc tctatttctg 115261 tcttttctca ctctctcccc tctccctctc cctccctttc ttcctctctc cacttccgtc 115321 tttatctcca tggccctgct ctctgactct catggcactg ctatctctgc ctccctctct 115381 ctgtctccct tttctcttcc cctccccttc agctctcagc tgggctgtgg gttccctctt 115441 acctggggac ttgctacttc ctgctggagg acaggcaggc gggcaggtgc tatgggtgac 115501 tagagtccga ccccccattt tatagtctgg cagacatcag gccctatagt ctgtggcaca 115561 aaggggaagc tggctcagca ggcctgggca gggagtgggg gactggggcc ctgacctcgc 115621 ctggatcact gcagacagca gcactttcct ttgtgagacc agcacacaca gcagaaaccc 115681 tggccacagc agcttcacct tccaacccta ccctgactcc tttctcaaca accccagcca 115741 gggatgggcc cccaaaaggg tccctgctgg ctgccaacat ggggcaggga aaaggggaga 115801 ccctgggagg caggagacca gagtttgaat cctgactcag ccactgcgcg ctgtgtggct 115861 tttggtaggt ctctacccct aggggggcct ctgtgtctgc aattataagg tgggcacgtg 115921 tccctgttgg atgccatata gaaaagaagg tgacaatgag cacagaagcg ttttgtaaag 115981 gcagatgtgg gtttgatatg gacagccttt tttacggagt gcttactgtg tgccgaaccc 116041 gtgttatctc ggtcagtccc cgggacaatc tgccttattt ggtactttca ttatgcccat 116101 ttcacagaag aggaaagtga aggcacagag aggtcagagc cagcccaggc cagccagctc 116161 tgggcctgtg tgctttcttc tcctgctgtc cagccttgac cttggcctga ggccccctgg 116221 cctgacttcc tctttgtagg ttggggtggg aatgggagag tcgcactgtg ctcagctggg 116281 ttggatatcg gagacctggg tctttttctt ttccccacat tgggccccat cctttcctga 116341 cctatcgaca gtgatccctc tcgccacccc agttccccgc ctctacccag gccacgttat 116401 ctgacctgga ggacagtgct ggtggccaga aatatgactg cctgtcccag agcctcctgc 116461 cctggcaagc cccagatgcc caccacatct tgttcaccac tggccccagc acctagctgg 116521 gcctggctag gtagagcttg gcacatcatg ggttctcagt caataactgg tgactcaatg 116581 gtggttccaa tggatcggaa cagaaggtag ggtacggagg agatagggaa accaaggcaa 116641 ggagagagaa gtaaggaggc agatagaagt aacagacaaa gagagaggtg gagacagaga 116701 cacagagagg cacaggggac atggacagtg aaggagcgag agacggagag acagaggtca 116761 agttggggtg atcaagcctc aggacggagg aacagacgcc agaaatcaca gggcccaatc 116821 ctcccatttc acaaatggag aaaccgaggc ccaggtggaa caaatgactt caacatagtc 116881 acatggcggg aactcacagt acagctgggg tcagaaggag cctggggagc tgctcagcta 116941 ggtcacctgc agtcggctcc ccagtcccag gcagggtata ttgcgggggc atctggatag 117001 gctcctgtct cttctttttc ggggttctag gctccccgtg gagcccagct ctgtcttgaa 117061 ctctccctgg cctggagcca gctttgcctc tgtccctcca ccacccagca ccccctcgct 117121 tcctagcctc tccttggcct ttcatcctgg gccaagactt ggtgggtggg gtgtgggcag 117181 gaactgcaag atggggtagc cactggaact ggggccaaga cgcctgtggt gagggcccac 117241 ccaccagtct aggcttcaaa gccaggtcat ctggggcagt tccgtgcccc ccctaaagcc 117301 ccaggaatgg gctgtctgcc agagggtggg ggcagagaag ggggatggcc caccatgttg 117361 gtggctccac tctgagcatt tattattatt attgtatcct tcattatatc tttttctttt 117421 ttgccctgtc ttatggagcc aatttatttt cttttctttt tttcttttct tttttttttt 117481 ttgcattgtc ctgtactcct cagtataagg atgaagtctc tgggattaac cccaattacg 117541 gataaggaaa ctgaagcaac tccctttcct tcctctctct cagccacttt ctcataaaaa 117601 tggaaatgta agataaaatt ggaactcaca tcttctttac tcatgaaccc aggctcttaa 117661 aaactatgca cccctgcttt ttgatcaaca attaataata atcatctttc ttattagcaa 117721 tggtgatgat gaagatgatg gaaataccac aatatttttt atgttgcagg agcaattcca 117781 gctgctgttc atgtgctatt cacatttatc ctcacaatcc ccctgtttct tagactaatc 117841 ccaacttacc ttgttacacc cattttacac acaagtaaac tgaggctcag agtgcgtaag 117901 taacatgctt aaaagttacc cagccagcac cagtgggatg caaacccagg cagtctggct 117961 gtagagggtg cattgttaac ctttcatttt catgccaacc cttcatcaaa gtcatgcatt 118021 cctttttttt tttttttttt ttttttttga gacggggtct tgctctgtca cccagggtca 118081 agagaaatgg catgatcttg gctcactgaa acccctgcct ccaggttcaa gcaattctcc 118141 tgcctcagcc tccctagtag ctgggattac aggcgcccgc gctactcccg gctaattttt 118201 gtgtttttag tagagacggg gtttcaccat gttggccagc ctggtcttga actcctgacc 118261 tcaggtgatc cacccacctc ggcctcccaa agtgctgaga ttacaggcgt gagccattgc 118321 gcccagccta aagtcatgcg tttgatagag ctgcatttca ccctcttaaa aggaaactgg 118381 tgtggccact ttacaggtca tgagcaggac tcaggtgtcc tccaggtcac acggatggta 118441 agaaagaagc agggctggat ttcatctcag atgatctggt ttccagtcat gctgttttcc 118501 tatgcttgtt acacccattt tacacacaag tatgtatgta tgagggacct cttaggtccc 118561 tcatacgcca attcagaggg cagggtttcg ggacaagagg cacacagcaa attgtacaag 118621 tcctggatat ttggcaagtg tgccacagtt tgcaaagcat gtttgcaact aacagctcct 118681 ctatgctgcc cacagtgcca tccggattca tcttcaggct ttgtggaccc agcaggccct 118741 ggcctagccc tcttctcctg cctcacttcc cttcaggtcc cctgagctcc aggccaattg 118801 ggctaccagc tctttcttgt acatgtgccc caagttccca cctctgaact ttttcttaga 118861 atgcctctcc ttccatctca ctctgtacat atcttcctca tccttcagag tctagttcaa 118921 atgcctcctc taacggcaag cctatctgga ttgcttcaac tggaagagaa aggggaaaga 118981 gagaaggaaa ggaataggca tttcctgaat gtttactctg taccaggcat gttatgtaac 119041 ctcacaacaa ctctgtgata tggtttgttt ctgccccatt ttataggcag aaattgaggg 119101 gaaatgactc gctggagctc acatagctag gaaggatgct tctggactcc tcatctagtg 119161 ctcattctcc tgctctagtc aggaaccttg tggggctgat gagggccaat attccacctc 119221 tctccttctt cgtggctgat ccctgggggt ggtgatctgg tcctttccct ccctgctaaa 119281 gccagagctg ctgacggctt gccattggct cttcagccag ggctggcttt tgtttctgtg 119341 gcccagttgc tgagtcgggg ctttgtctcc caggggcccc tcttccctcc ctgcctataa 119401 gagcccgacc tcggctggtc tcagatctga catgttccct gggcctatct cggtaagtcc 119461 caggtttgga ggaggggtgg gatcagagtt ctgagtgagc attctgggag gcgaggaacc 119521 caggtcccca gtcctaggga tgtcactgtg tcatcctgaa tgccatcctc atcctaacca 119581 aggatgagcc ccaaccatca agaccagcac ctgcttccct catttaaaaa caaattattc 119641 ctgtagccct attgcctaaa tttccaccat cagagctacc tctttgagct aggtgctgtc 119701 tttggaaggg ctctcttatg ctgagcattt cagcactcgc tggggtgatg cacccaccca 119761 ccagcagcca caaaaatttt cccataacaa gcagataaga gaatacagag tggggctcag 119821 agtcaatgag acccagcaag ggtgagtttt gagatgggga taggatggaa gatggcattg 119881 gtgacagatg gaaaggacag gagagggaca ggccaaagcc atgcattgcc cctagcccag 119941 tcactcctgg actccctatg tggatcctga ccaggtccaa gccagccatc tgcataaaga 120001 ggagagcaga gggtcagggt ggaagattat aatgttcttc tcttctgcag gaaggggcca 120061 caatgaccct gcaatgcaca aagtcagcgg gaccctggaa ggtaggaaga ggcatgggga 120121 gggggtgttc agggggtatg gggagggttc aggtccccat gaatcctagg aggggggcat 120181 tataaatgcc tatttcacag aggtgcaatc aaggctcaga gttgagaagg gacttgtatg 120241 tcacatacag ggaggttgaa tgtgaatttg ggtcttctgg ttcccaaagt tgacctgtta 120301 cactctttgg cagaggttgc agtctttggg catgatttct ttgaccatct ctgtgtgtgt 120361 ttgtttgttt tataaagctt ttggatttat catcagcctt taaaagcctg gaagaaccac 120421 ataaaagtcc agatctagtg attgtcttga gaaattggaa ggtctggcct gcattcccac 120481 ttggcaacca ttggtggcac tgggaagagg ccacccattt acctggcacc tgtgctgtct 120541 agtgtctcac atttatcact gctcttgcct tcctggctcc tggaggaatc tgagtttgca 120601 atccctgctt tacctgccag atcctggggc gagcccccaa cctctcaccc ttcacgggac 120661 tctgatgcgg atctccacct tttttttttc ctggcacaga tggtggtgtg ggatgaggac 120721 ggcttccagg gccggcggca cgagttcacg gccgagtgcc ccagcgtgct ggagcttggc 120781 ttcgagactg tgcgatcttt gaaagtgctg agtggagcgt gagtctaggg ggacactgag 120841 ttggggtaga gggtggacag gaagggacct agagacgggt gctaggactt ttagatattc 120901 taggtcccct ctccctaggc tcttactgtt gtgccctcct gaagtactga ggagtgtgca 120961 ggactgccat gtaagattat gcaggttgcg cactgcccaa cagtaggagg gtgccattta 121021 cacagaccct gcccactgga tgaggcagtt ctgcaggaga tccttagaat ccagtgttgg 121081 atctaaaaat gtccctccca gccgtaaatt gaaagccaac atcacccgcc taaagtagaa 121141 ggtaactgta aaaataaaca taatgtttta atgctattaa tttttagcta aatagtcttg 121201 ctgctaagca tgtggcctga tcattttttg taaaaaaaaa aaaaattaaa aaacaaaaaa 121261 gagagtgaga gagagagatt agtgagacac agagaggtgt taaggacaca ctaataacac 121321 accaagactt tctagaccag tgtcattcag tggacctgtc tgcaatgatg gagatgttct 121381 acatctacac catccatttc agagccactg gccagatgtg accattgagg atttgaaagg 121441 tagccagcat gagtagagaa ctgaatcttg tttaatctta atttatttaa gtttaaattt 121501 aattagctac atatggccag tggctaccat attagataga aggttctaga aggttggaaa 121561 gcaggtagaa aggagatgtg gccagtagct accatattgg atagaaggct ctagaagttt 121621 ggaaagtagt tggaaagatg tggccagtag ctaccatatt ggatagaagt ttctagaagg 121681 caggaaagta aatagaaagg aggtgtggcc agtagctacc atactggata aaagttctag 121741 aagtttggaa agtagatcga aaggagatgt ggccagtagc ttccatattg gatagaaggt 121801 tctagaaggt tggaaagtag ctagaaaggc aatgtggcca gtagctacca tattggatag 121861 aaggttctag aaagttggaa agtagataga caggagatgt ggccagtagc taccatattg 121921 gatagaaggt tgtagaagtt tggaaagtag ataggagatt tggagtcact taatgctaca 121981 cctgtgagtg ccacctgagg tcatcttggg tgatttttgc acatctcatg ctttagtaaa 122041 tactctttta gcttctgatt tgtgcaggtg gggaatgggg gtcagatggg aacagaggct 122101 ttctaggggt atgtccagca ggctgacaga ctcagccagt aagcagacaa tgagcttgga 122161 tagattcaga cagtttatta cttatacaga cagtaaaggc aaggtgcagc ctcctgtgtc 122221 ccttgtccta ctggacgaca ccaaaacaaa gggggctgaa tgacaggcat taggaatggt 122281 aggaaccccg tggctgaaga gctgcttcca tgcttcaacg atagcttgct gctgtcgcct 122341 gtggggaggc gaggtggaaa gccccacatc tcagagtggc cagagaggca gatgaggggc 122401 tgcctcgtgg cagcctccca caagagaggg aggaaaatga gagatggcca tgcggtcgct 122461 ccttgcaaga ctgcccgtct cccacaccct ggcaggatca cggggaattc caccaaggtg 122521 agatcagtct gtgatcagct ttgcctcctt ggcctctgtg gacacggacg gaggagcaag 122581 gttgtcaggg cacctgtgtg aagcagtttc cctacctatg ggacacacag caagtgaggt 122641 ctagagctgg gttcctcaac tctggcgcta ctgatgtttc gggctggatc atcctttgca 122701 gggaaggctg gcctgtgcat tatgggatgt ttagtggtat ccctggcctc tacccgatag 122761 atggccttcg cagtccctcc cctagtcgtg acaaccaaaa atgtctccag ccatcgtcaa 122821 gtgtttcctg ggggcacagt cacccctgaa tggttgtgac tgtgaccgtt ctagacccaa 122881 ttgctggtct agaatgcagg gtgaggggga cgcttacctc ctgcacactc taccctctgt 122941 ctgcaggtgg gtgggctttg agcatgctgg cttccaaggg cagcagtaca ttctggaacg 123001 aggcgaatat ccaagctggg atgcctgggg cggcaacacg gcctaccccg ccgagaggct 123061 cacctccttc cggcctgcgg cctgtgctgt aagttctacc actgctgcat cccggggagg 123121 cccaagcccc tcatgtgggc acttcggaat caaaggttcc agagttgaaa ttctcatgtc 123181 gccacttcaa gctgtgtgac aagggctgtt cagtctcttt cctctccaag cctcggtttc 123241 ttcatcttga aatggggcaa tagtaccaat gttcttggga agaagaacgt ataacatctg 123301 accagagcct gccttaaact ctctgctcac agcaaccctg gttgtgatag gagcccctgc 123361 cagttttgtg cccttttttt tttgtgattc agggaccctg agagaatctg ggtaccaggg 123421 atggtagggc caagtggaga ggaaaggttt catttcttgc tcctaattca gcatcagtgc 123481 tcctgatctt acattttact gctgtttttg gagtgtgggt ggaggtgggc attattattc 123541 ccatctcacc aatgaaaaca ttgaggctca gaaaggtgta ttttaggctg ggcatggtgg 123601 ctcacacctg taatcccagc actttgggag gccaaggcgg gtagatcact tgaggccagg 123661 agttcaagat cagcttggcc aacatggtga aaccctgtct ctactgaaaa tacaaaaaaa 123721 ttagttgagt gtggtggcac atgcctgtag tcccagctac tcaggaggct gaggcaggag 123781 aatcacttga acccgggaga cagaggttgc agtgagccaa gatcgcacca ctgcattcca 123841 gcctgggcgc cagagcaaga ctctgtctca aaacaaaaca aaacaaaaaa acaaaaacaa 123901 acaaacaaac acaaaaacaa caaagtgtgt tttatttttc aattcttatt gatttaaaat 123961 gacataagtc accatattta ccatccaaaa ttctgaagga aaatcctgct atggtcaggg 124021 ttgagtatcc cttacctgaa atgcttggga ccagaagtgt tttggatttt gaatattttc 124081 cccagatttt gcaatatttg taatacttac agttgagcag cctgaatcca aaaatccaaa 124141 atctgaaata ctgcagtgag cattctttga gtgtcatgtc gtcactcaaa aagttttaga 124201 ttttggagca ttttggattt cggattttta ggttagggat gttcaaccta tacatttgtg 124261 tctctatgta tatacacatc caaatcaaca tctacatcta tatctgtata atctatatat 124321 cttatctata ttgcacacat aattttgtca taaaattgga atcataatga atctaaggct 124381 tggaaactag cttttctcac tgagcaaatg tgtcgcaatc atcagtccag gtgattaaat 124441 ctcacttcca tcgtttttca tagctcccta gtattctgtt ttgtgaataa gtccatttgt 124501 tcatttattc tgcaaacagt ctagatattg tgctaatata gggacccgtg gcgggctgca 124561 acagaaacaa tccctcattc ccagcagttt gcaactcctg atttcattta gtcctcaaaa 124621 gaaccctgaa aaatgggtag ggttagtttc attttataga tacaggaaca gaggctcaga 124681 aatcctgtct aacctcagtc agttggtaag tggcagctgc ctcctagcag gaaggatttg 124741 agatgatatc tctacacctc ctttctttcc ttcttttttc ttttcttttc ttttcctttc 124801 ccctccctcc ctccctccct ccctcctttc tttctttctt tctttctttc tttctttctt 124861 tctttctttc tttctttctt tctttctttc tttttctttc tttctttctt tccttctttc 124921 tttttctttc tttctttctt tctctttctt tccttttctt tctttctttc tttctttctt 124981 tctttctttc tctttctttc cttttctttc tttctttcct tctttcattc tctgtctttc 125041 tttttctttc tttttctctt tctttctttc tctttctctc tctctccttc cttccttcct 125101 tctctctctc tctttctttc tttctttctt ttgatggagt ttttgctttt gttgtccagg 125161 ctggagtgca atgccacaat ctcggctcac tgcaacctct gcctcccagg ttcaagcgat 125221 tctcctgcct cagtctccca agtagctgag attacaggca tgtgccacca cgcttggcta 125281 attttttgta tttagtagag acagggtttc accatgttcg tcgggctggt cttgaactcc 125341 tgacctcagg tgatccaccc tcctcggcct cccaaaatgc tgggattata ggcgtgagcc 125401 accgcgccgg gcctctacac cctttttctt aaccacagtg taatgcctca gcttaccact 125461 ggatccccaa ttttggacat aaaagttgac tccttttttt gtttgtttgt ttttgaaaaa 125521 actgttatga tgaacattct tgcagcctat ttattccatg attattttct ttgtggaaat 125581 ttctggaagg gcaaatggca aggtttctgg tacctgttgc cttattgccc ttccaaaagg 125641 tttgcaagga aggctgattt gggagacaag ggcagggagt gtggaggcca ggagaggctc 125701 ctgggtttcc aactgggagc ctgctggtct gactgtgttc cctgttcctg tagaaccacc 125761 gtgactcgag gctgacaatc ttcgagcaag agaacttcct gggcaagaaa ggagagctga 125821 gcgatgacta tccttccctc caggccatgg gatgggaagg caatgaagta gggtccttcc 125881 acgtccactc tggggcgtaa gtgtattcaa ggctctacct ggcaggggag gggctactgg 125941 gaggggtagg tgtacctcct gtgaaaactg tgtgctgggg acttttgtgc tctgatgccc 126001 acattccaga ggagaacact gaggcacagg gaggtatcag gcagaatttg agggactgaa 126061 acccagatca aactggtttg agcccagaag atatttatca gcccctgtaa ttagcttcaa 126121 gcatagctgg atccaggtac tacagcaggt atctctctcc ccccacaacc cccaacctgt 126181 tgccctgcct gcctctgtgt gactatattc ttagacaagc tcttcctgtg tgaaggcaaa 126241 ggtggcggct tcatcgttac atccagccct agggaaagac agtgtccagg ctcagtgatt 126301 ctattaaagg ttccaagaaa ggtgctcatt ggtccagctg ggtcctgtgc tcaactctga 126361 accaatcatc atgcccatag gaatggggtg ttctgattgg ctaggtttgg tcatgtggtc 126421 cattgctaga ttcagcctca ttttctcacc ttggctgaga gtgagaggcc acaggctctt 126481 tggctagaag aggggagaat gaatgcccct agccataaaa cacaggtgtg gctccagcaa 126541 cctgctcaaa gtccatcagc aggtaggtaa caatccagct tcaaactcag cactgactgg 126601 ccctttcctt tccacccagg aggtggcagg tgaaggctaa ggtgtgagat tcttatttag 126661 ggaggacatc agaaccactt tctgaggtgc cccttggctc ctggtaggca gggggcaaag 126721 atgggaggta gggtttttca tctttttttt ttttcttttt ttggctttag ttttctcagg 126781 aaccaccaaa gaatcatgtt gttttagagt aagagggacc ccttgtgatt cttgagttca 126841 cttcctcttt tacagaaagg gaaatggagg cttggaaaac cagacaccaa tccttgtcct 126901 gaacagagta gaaatcaatg agttgggcca gacacggtgg ctcacgcctg taatcccagc 126961 cctttgggag gctgaggtgg gcggatcacc tgaggtcagg agttcgagac cagcctggcc 127021 aacatggtga aagcccgtct ctactaaaaa tacaaaaaat tagctgggtg tcgtggctat 127081 gcctctagtc ccagccactc ggatgaggca gaattacttg aacccaggag gcagaggttg 127141 cagtgagccg aggtcatgcc acggcactcc agtctgggca acagagtgag actctgtctc 127201 aaaaaaaaaa aaaaaaaaaa aatcaatgaa ttgaaatgat agcttttttt cttggaagtg 127261 cctctggata aggaaggagg gagaaagggg aaagaacaag acttcccatt ccttcacttg 127321 ttcacttatt cagcaaatat ttaggaaccc cttcttcctt gctagacttt tatatattta 127381 tgcaggtggt gataacgatg aacaaaacat acacgttcct acactcagga gcttgcctgc 127441 tggcagggga ggctgatatt ttacaaataa tttgaaaact atcgaattac atgagtgcta 127501 caaaggtata ataagggact gtggcctgat ttggggaacg agtcagattg ccctcataat 127561 taagatgagc catgaaggac aaatacaact tgtctaggaa aagaaaggct gggatggtgc 127621 ttctggcaaa gggaatggca tgatcaaagg ccctgtggta ggggagagag catggcacat 127681 tgtgagcact gaagaaaggc caggatggct ggaattgtgt gcgtgtgcat agatcccttt 127741 gcccctgtgt tgatcaccat gctggtgtct ctgtagagta actgtctgcc ttttctcttt 127801 tccagctggg tttgctccca gtttccgggc taccgaggat ttcagtatgt gctggaatgc 127861 gatcaccatt ccggtgacta caaacatttc cgggagtggg gctctcatgc cccgaccttc 127921 caggtgcaga gcatccgcag gatccagcag tgaacagggg tgcggcacgg aggagcgcat 127981 gcgtgcttat ctgcaatgga ggcgctctgg aggctgtggt gtgttctctc cttctgcctc 128041 cccctgtaac ctgtgtgaac ccagcaccca tgtgaactgg tccgtgcaca gtcagcacaa 128101 aaaactcaaa cgaataaaaa agagaaagtc tggtattagt tgtgcttttc aattttatta 128161 atcttttcaa agaaccggtt tttagccttg ttttctctat cattattcca ctgattttta 128221 ctcttaactt cagtatttcc tttcttctat ttacttcagg cctactttgt tcttagtttt 128281 tctagctttt ttttaaggca gacttctggg tcattaattt taggcctctc tttcccccac 128341 ccagtacggt catttagcta taaatttccc tccaagtatg gctttagctg cttcccacaa 128401 atattaattt gatgtgtttt cattgtcatt cagtttgata ttttctagtt tcacttgtgg 128461 tttttttttt ttcttttgac ctatgggcta tttaaaggga tgttatttaa attccaaata 128521 ctttaggatt ttctaggtgt ttattgttat tggtgtgttt ttaaagacca atgtggtcgg 128581 agaacatatt ccttaagatt taagtcttct gcaatgaatt aagacttatt ttatggccca 128641 gcacatggtt tttgtatttt aattttattt tttgtttgtt tgtttgtttt gtttttaatg 128701 agtcaaggtt tcactcgtca cccaggctgg agtgcagtgg tgcaatcttg tctcactgca 128761 gcctcgacct cccaggttca agcaattctg ccttagcccc acaagcagct gagattacag 128821 gtgtgtgcca ccacacctgg taatttttgt atttggtagt gacagcgttc gccatgttgc 128881 ccaagcttgt cttgaactcc tgagctcagg acatctgcct gcctcagcct accaaagtgc 128941 caggattaca ggcctgagcc actgctcctg gccacatggt ctatctttga aaggcttggt 129001 agtacacttg aaaactattt atattctgca gttattgggt acagtgttct gtaaatgtca 129061 gatcaaattt gttaacagtg ttgatgcgat ttttctatac ccttgctgat tttttttttt 129121 ttttttgaga caggatctca ttctgtcacc caggcaggag tataatggca caatcatagt 129181 tcaccgcagc ttgtttttaa aaaatcaatt ttagtgaatt acatagaaag aatcattgat 129241 tatttggatt tatctgtttc ttaccttagt tttgtctatt tttgcttcct gatttttttt 129301 ttccttcagg aattgtgatg ctctgttgtt aggtgtacat atatttatga ttgctatgtc 129361 ttcttgataa attgttccta gcttgtggtt ttcaaagcca cgcatggaga gtcagttctc 129421 tcccctagca aaacttcttt tcacgcgcgt ccgtgtgaag agaccaccaa acaggctttg 129481 tgtgagcaat atggctgttt atttcacctg ggtgcaggcg ggctgagtcc aaaaagagag 129541 tcagcgaagg gagatagggg tggggccatt ttataggatt tgggaaggta atggaaaatt 129601 acagtcaaag ggggttgttc tctggtgggc aggggtggat ctcacaaagt actttctcaa 129661 gggtggggag aattacaaag aaccttctta agggtggggg agactacaaa gtaccttctt 129721 aagggtgggg gagattacaa agtacattga tcagttaggg tggggcagga acaaatcaca 129781 atggtggaat gtcatcagtt aaggctgttt ttacttcttt tgtggatctt cagttacttc 129841 aggccatctg gatgtatacg tgcaagtcac aggggatgcg atggcttggg ctcagaggcc 129901 tgacacttct catacctttt ttctctcttc ccaattctct ctctcacaca cactaccctg 129961 gagtttctca tttaacaccc tccatgctcc aaagaaagcc tataaaacaa agaaacccct 130021 gtttcacttg aagactaagt ggcaggattg gtctcttcta tctctctttt tctcctcccc 130081 tccttacttt ctcctggttc tgcattttat aggttgtcta aacctctctc atgagccctc 130141 tagctcttgt ttcaggctct cttggcccat cttcctctgc ccacaaatgt ccctgatcag 130201 tctgataggg acaggaaggg gtggggacca gaacattctg ctggaaaaga taacattatc 130261 tggctcagga aggggagggg aatgtttagg aacatttgcg cctcagtaaa ttttaactcc 130321 tcattgtatc gtaccattta ttcaacagat catttgcctg ggacatacaa tggtaaacat 130381 gacacctaaa gttttttctc tcactcagtt taatggggaa tgaatgcgga aaagtacaga 130441 gctgtttgca aatattctat gataggggct gtgggggcgg gaagcatggg gttggggggc 130501 ggacccttct tgtgcctacc tctactcaga ccgtcccaga acatctgacc actgcttgac 130561 accttgatgt gaacgtctaa tacacatctc aaacaaaacc ggaccaaaca taactgaagc 130621 tacacttttc aagttgttca ggctaaatgt ctggaatgat ccataaccct tctctttctc 130681 tcacacctca cattctaatc cattggcaat tcctactagc tctactcttt tttttttttt 130741 ttagatggca tcttgctttg tccccaggct ggagtgcagt ggcgcgatct cagctcactg 130801 caacctctgc ctcccaggtt caagtgattc tcctgcctca gccccccaag tagctaggat 130861 tacaggcatg tgcatcacac ccagctaatt tttgcgtttt ttgtagagac agggtttcac 130921 catgttgacc aggctggtct cgaactcctg acctcaagtg atttgcctgc ctcggcctcc 130981 caaagtgctg agattatagg catgagccac ggcacccagc cactagctct actcttaata 131041 tgtattagaa tccaacccac tcatcatgcc cactgctgcc attctggtcc cagccaccct 131101 cttctctctc cttctcatag tccctctgct tttgtctttg cccatgagtg atctgctctc 131161 tgcccagcat tcagagtggt cctgctagaa acgtaaggca gatcttgtca cactcctgct 131221 taaaatcctc ccatgctccc caccatgcca agaaaacaac aataaaaaaa cccagtatta 131281 ttcccagtcc tgtgaagctt gtcatcatct gcatctcccc actctcccca caacctattt 131341 tgatctcatg tccctgtatt ggttgttctt taaagatgcc aggtggctaa catctgta // LOCUS HS46KDA 2434 bp RNA PRI 11-MAR-1997 DEFINITION H.sapiens mRNA for 46 kDa coxsackievirus and adenovirus receptor (CAR) protein. ACCESSION Y07593 NID g1881446 KEYWORDS 46 kDa receptor protein; coxsackie and adenovirus receptor protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2434) AUTHORS Bergelson,J.M., Cunningham,J.A., Droguett,G., Kurt-Jones,E.A., Krithivas,A., Hong,J.S., Horwitz,M.S., Crowell,R.L. and Finberg,R.W. TITLE Isolation of a common receptor for Coxsackie B viruses and adenoviruses 2 and 5 JOURNAL Science 275 (5304), 1320-1323 (1997) MEDLINE 97190109 REFERENCE 2 (bases 1 to 2434) AUTHORS Bergelson,J.M. TITLE Direct Submission JOURNAL Submitted (20-AUG-1996) J.M. Bergelson, Dana Farber Cancer Institute, Lab of Infectious Disease, 44 Binney Street, Boston Ma 02115, MA, 02115, USA FEATURES Location/Qualifiers source 1..2434 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="cervical carcinoma" gene 60..1157 /gene="CAR" CDS 60..1157 /gene="CAR" /codon_start=1 /product="coxsackie and adenovirus receptor protein" /db_xref="PID:e284081" /db_xref="PID:g1881447" /translation="MALLLCFVLLCGVVDFARSLSITTPEEMIEKAKGETAYLPCKFT LSPEDQGPLDIEWLISPADNQKVDQVIILYSGDKIYDDYYPDLKGRVHFTSNDLKSGD ASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVLVKPSGARCYVDGSEEIGSDFKI KCEPKEGSLPLQYEWQKLSDSQKMPTSWLAEMTSSVISVKNASSEYSGTYSCTVRNRV GSDQCLLRLNVVPPSNKAGLIAGAIIGTLLALALIGLIIFCCRKKRREEKYEKEVHHD IREDVPPPKSRTSTARSYIGSNHSSLGSMSPSNMEGYSKTQYNQVPSEDFERTPQSPT LPPAKVAAPNLSRMGAIPVMIPAQSKDGSIV" BASE COUNT 743 a 443 c 487 g 761 t ORIGIN 1 gaattcccag gagcgagagc cgcctacctg cagccgccgc ccacggcacg gcagccacca 61 tggcgctcct gctgtgcttc gtgctcctgt gcggagtagt ggatttcgcc agaagtttga 121 gtatcactac tcctgaagag atgattgaaa aagccaaagg ggaaactgcc tatctgccgt 181 gcaaatttac gcttagtccc gaagaccagg gaccgctgga catcgagtgg ctgatatcac 241 cagctgataa tcagaaggtg gatcaagtga ttattttata ttctggagac aaaatttatg 301 atgactacta tccagatctg aaaggccgag tacattttac gagtaatgat ctcaaatctg 361 gtgatgcatc aataaatgta acgaatttac aactgtcaga tattggcaca tatcagtgca 421 aagtgaaaaa agctcctggt gttgcaaata agaagattca tctggtagtt cttgttaagc 481 cttcaggtgc gagatgttac gttgatggat ctgaagaaat tggaagtgac tttaagataa 541 aatgtgaacc aaaagaaggt tcacttccat tacagtatga gtggcaaaaa ttgtctgact 601 cacagaaaat gcccacttca tggttagcag aaatgacttc atctgttata tctgtaaaaa 661 atgcctcttc tgagtactct gggacataca gctgtacagt cagaaacaga gtgggctctg 721 atcagtgcct gttgcgtcta aacgttgtcc ctccttcaaa taaagctgga ctaattgcag 781 gagccattat aggaactttg cttgctctag cgctcattgg tcttatcatc ttttgctgtc 841 gtaaaaagcg cagagaagaa aaatatgaaa aggaagttca tcacgatatc agggaagatg 901 tgccacctcc aaagagccgt acgtccactg ccagaagcta catcggcagt aatcattcat 961 ccctggggtc catgtctcct tccaacatgg aaggatattc caagactcag tataaccaag 1021 taccaagtga agactttgaa cgcactcctc agagtccgac tctcccacct gctaaggtag 1081 ctgcccctaa tctaagtcga atgggtgcga ttcctgtgat gattccagca cagagcaagg 1141 atgggtctat agtatagagc ctccatatgt ctcatctgtg ctctccgtgt tcctttcctt 1201 tttttgatat atgaaaacct attctggtct aaattgtgtt actagcctca aaatacatca 1261 aaaaataagt taatcaggaa ctgtacggaa tatattttta aaaatttttg tttggttata 1321 tcgaaatagt tacaggcact aaagttagta aagaaaagtt taccatctga aaaagctgga 1381 ttttctttaa gaggttgatt ataaagtttt ctaaatttat cagtacctaa gtaagatgta 1441 gcgctttgaa tatgaaatca taggtgaaga catgggtgaa cttacttgca taccaagttg 1501 atacttgaat aaccatctga aagtggtact tgatcatttt taccattatt tttaggatgt 1561 gtatttcatt tatttatggc ccaccagtct cccccaaatt agtacagaaa tatccatgac 1621 aaaattactt acgtatgttt gtacttggtt ttacagctcc tttgaaaact ctgtgtttgg 1681 aatatctcta aaaacataga aaacactaca gtggtttaga aattactaat tttacttcta 1741 agtcattcat aaaccttgtc tatgaaatga cttcttaaat atttagttga tagactgcta 1801 caggtaatag ggacttagca agctctttta tatgctaaag gagcatctat cagattaagt 1861 tagaacattt gctgtcagcc acatattgag atgacactag gtgcaatagc agggatagat 1921 tttgttggtg agtagtctca tgccttgaga tctgtggtgg tcttcaaaat ggtggccagc 1981 cagatcaagg atgtagtatc tcatagttcc caggtgatat ttttcttatt agaaaaatat 2041 tataactcat ttgttgtttg acacttatag attgaaattt cctaatttat tctaaatttt 2101 aagtggttct ttggttccag tgctttatgt tgttgttgtt tttggatggt gttacatatt 2161 atatgttcta gaaacatgta atcctaaatt taccctcttg aatataatcc ctggatgata 2221 ttttttatca taaatgcaga ataatcaaat acattttaag caagttaagt gtcctccatc 2281 aattctgtat tccagacttg ggaggatgta cagttgctgt tgtgtgatca aacatgtctc 2341 tgtgtagttc cagcaaatca agctgagctt tgaaaaagtt tgtcttagtt ttgtgaaggt 2401 gatttattct tagaaaaaaa aaaaaaaaaa aaaa // LOCUS HS560B9 99074 bp DNA PRI 23-JAN-1998 DEFINITION Human DNA sequence from PAC 560B9 on chromosome 1q24-1q25. Contains profilin-like pseudogene, 60S ribosomal protein L4 pseudogene RNA binding protein, ESTs, GSS. ACCESSION Z98751 NID g2814366 KEYWORDS 1q24-1q25. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 99074) AUTHORS Wray,P. and Patel,D. TITLE Direct Submission JOURNAL Submitted (22-JAN-1998) sanger.ac.uk/HGP/Chr1/) Sanger Centre, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT IMPORTANT: This sequence is the entire insert of clone 560B9. During sequence assembly data are compared from overlapping clones. Where differences are found these are annotated as variations together with a note of the overlapping clone name. Note that the variations annotated may not be found in the sequence submission corresponding to the overlapping clone as we submit sequences with only a small overlap. This sequence was generated from part of bacterial clone contigs of human chromosome 1, constructed by the Sanger Centre chromosome 1 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr1/ This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone 560B9 is at 1 in this sequence. The true right end of clone 454G6 is at 8384. The true right end of clone 560B9 is at 99074. 560B9 is from the library RPCI4 constructed at the Roswell Park Cancer Institute by the group of Pieter de Jong. For further details see http://bacpac.med.buffalo.edu/. FEATURES Location/Qualifiers source 1..99074 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q24-1q25" /clone="560B9" /clone_lib="RPCI4" repeat_region 1..67 /note="AluJ repeat: matches 234..300 of consensus; incomplete repeat" repeat_region 270..575 /note="AluJo repeat: matches 301..2 of consensus" repeat_region 883..1324 /note="L1ME2 repeat: matches 902..465 of consensus" repeat_region 1328..1455 /note="MLT1B repeat: matches 387..260 of consensus" prim_transcript <1444..>2293 /note="match: multiple ESTs; match: AA158962 AA149656 AA496671 AA165699 W96880; match: AA186868 AA045232 AA242109 AA272144; match: AA292502 AA312970 AA151629 AA441836 AA293075; match: AA609290 W56174 AA069077 W77468 AA486685; match: AA081001 AA102360 N73205 AA404312 AA050910; match: H72378 AA404711 AA603126 AA600850 AA410195; match: N53683 R74533 D58599 AA614570 AA293603; match: AA158343 AA362229 AA316744 AA607409; match: AA454672 AA523775 W49571 W49572 W93555; match: AA293487 AA186929 AA599836 N20266 AA135204; match: N41671 AA522611 H72828 AA312317 AA464003; match: AA027630 T57311 AA305264 AA160077 F18789; match: AA410880 AA360770 AA501449 AA151566; match: AA053476 T47435 N73974 AA464642 AA443916; match: AA551088 AA311986 AA058808 AA399146; match: AA188358 AA551088 AA311986 AA058808; match: AA399146 AA188358 AA551088 AA311986; match: AA058808 AA399146 AA188358 D51345 D54417; match: AA403006 AA444013 AA365271 AA278070; match: AA394143 AA456119 AA441783 AA356329; match: W46608 AA243668" unsure 1599..1647 gene complement(1704..2107) /gene="dJ560B9.1" CDS complement(<1704..>2107) /gene="dJ560B9.1" /note="similar to profilin; match: SWISS-PROT; P07737; P35080; match: EMBL; J03191; match: GDB; 120278; PFN1; 120279; PFN2" /codon_start=1 /pseudo /product="dJ560B9.1" /db_xref="PID:e1246374" repeat_region 2434..2733 /note="AluY repeat: matches 2..301 of consensus" repeat_region 2734..2901 /note="MLT1C repeat: matches 167..1 of consensus" repeat_region 2905..3350 /note="L1MC3 repeat: matches 461..13 of consensus" repeat_region 3364..3445 /note="L1 repeat: matches 5262..5181 of consensus" repeat_region 4349..4649 /note="AluSx repeat: matches 3..301 of consensus" repeat_region 7081..7248 /note="AluSg repeat: matches 298..131 of consensus; incomplete repeat" repeat_region 7250..7545 /note="AluSx repeat: matches 6..302 of consensus" repeat_region 7695..7891 /note="MER20 repeat: matches 218..1 of consensus" repeat_region 8323..8623 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 8759..9058 /note="AluY repeat: matches 1..300 of consensus" repeat_region 9687..9991 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 10131..10424 /note="AluJb repeat: matches 301..1 of consensus" repeat_region 11344..11462 /note="MER45 repeat: matches 178..59 of consensus" repeat_region 11894..12192 /note="AluSx repeat: matches 298..1 of consensus" repeat_region 12371..12671 /note="AluSx repeat: matches 301..1 of consensus" repeat_region 12980..13712 /note="L1MB3 repeat: matches 931..191 of consensus" repeat_region 13801..14105 /note="AluSx repeat: matches 301..2 of consensus" repeat_region 14107..14164 /note="L1MB7 repeat: matches 199..142 of consensus" prim_transcript <14237..>15671 /note="match: multiple ESTs; match: AA484295 AA554927 AA313899 AA131106; match: AA134014 AA232503 AA306214 AA181142; match: AA165105 AA181143 R06906 AA081193; match: AA608720 AA291853 W05289 AA187447 F22752; match: AA315050 R01266 AA165035 AA075360 W28520; match: AA173056 AA205863 AA094318 AA085561; match: AA057546 AA307678 AA484638 AA078991; match: AA311262 AA162133 AA224822 AA525081; match: W61900 AA322833 AA126094 AA515760; match: AA595123 AA075616 AA622448 AA558351; match: AA533268 AA133868 AA209199 AA563783; match: AA588237 AA424968 AA496943 AA276489; match: AA486668 H22358 AA312244 AA181253; match: AA143750 AA071823 AA604471 AA071268; match: AA604557 AA147078 D55355 AA114132; match: N32663 AA602543 H87377 AA328037 AA205817; match: AA135186 N88364 N85691 AA486124 AA134854; match: C14120 AA595141 AA326258 AA157762 T59393; match: AA468383 AA599900 AA149748 AA079597; match: AA580979 AA120221 T57142 AA134229 AA162085; match: Z20174 AA223726 AA442657 AA206795 AA484833; match: AA098870 W38925 T19884 F21514 F01193; match: AA584945 AA565676 H42861 L44529 AA307242; match: AA210944 Z20182 N36088 AA605155 AA602249; match: AA209238 W85105 AA313648 AA126854 AA192855; match: AA578318 AA127589 F15204 AA199725 AA228987; match: AA593868 AA111414 AA081023 N83700 AA314062; match: AA313811 AA405262 AA143713 AA506342; match: AA157071 AA228355 N29928 AA188163; match: AA374383 AA149618" gene complement(14342..15627) /gene="RPL4" CDS complement(14342..15627) /gene="RPL4" /note="60S ribosomal protein L4 pseudogene; match: EMBL; X73974; L20868; match: PIR; S37197; S39803; match: SWISS-PROT; P36578; P39029" /codon_start=1 /pseudo /product="dJ560B9.2" /db_xref="PID:e1246375" repeat_region 15679..15754 /note="MER45 repeat: matches 79..2 of consensus" repeat_region 16253..16554 /note="AluJo repeat: matches 1..302 of consensus" repeat_region 16583..16755 /note="MLT2_internal repeat: matches 596..420 of consensus" repeat_region 17708..18010 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 18550..18851 /note="AluJb repeat: matches 302..1 of consensus" repeat_region 18918..19227 /note="AluY repeat: matches 301..1 of consensus" repeat_region 20034..20924 /note="L1PA2 repeat: matches 891..1 of consensus" repeat_region 20775..24333 /note="L1 repeat: matches 5390..1838 of consensus" repeat_region 24342..25882 /note="SVA repeat: matches 1..1371 of consensus" repeat_region 25883..26838 /note="L1 repeat: matches 1849..896 of consensus" repeat_region 26834..27607 /note="L1 repeat: matches 772..4 of consensus" repeat_region 28964..29149 /note="AluJo repeat: matches 121..299 of consensus; incomplete repeat" repeat_region 29159..29289 /note="FLAM_C repeat: matches 2..132 of consensus" repeat_region 29330..29626 /note="AluY repeat: matches 301..2 of consensus" repeat_region 29720..29900 /note="AluJo repeat: matches 119..302 of consensus; incomplete repeat" repeat_region 30064..30095 /note="16 copies of 2 mer 84 % conserved" repeat_region 30646..30944 /note="AluJo repeat: matches 1..301 of consensus" prim_transcript <31345..>33192 /note="match: multiple ESTs; match: R45399 AA416569 Z44493 T78817 AA469097; match: F10934 F05174 AA299279 F13519 H18821; match: R25060 H28707 R18720 H28837 R42084 R21673; match: R20944 AA417234 T16554 W05625 R42728 Z38212; match: R46563 N36489 AA455922 AA456391 T16866" repeat_region 31927..31997 /note="MIR2 repeat: matches 143..74 of consensus" prim_transcript 33325..33719 /note="match: 3' EST AA400295 clone 742691" prim_transcript 34209..>34443 /note="match: 3' EST N52608 clone 283903" prim_transcript <34516..>35724 /note="match: multiple ESTs; match: C23305 AA576581 AA592746 D52028 H06263; match: AA387399 D51871 D51955 H56724 N50682 R70623; match: AA118332 H67970 AA401519 AA513327 AA034844; match: AA516632 AA424329 AA498204 AA209655 R68563; match: F11025 R89094 AA117985 AA495734 D52153; match: D51743 AA495797 D61763 W97983 AA061021; match: H42697 AA491221 N51629 AA570541 N66537; match: AA283660 AA448191 D52020 T91746 AA281666" repeat_region 34615..34664 /note="25 copies of 2 mer 96 % conserved" repeat_region 38468..38695 /note="L1ME1 repeat: matches 906..682 of consensus" repeat_region 39207..39545 /note="AluSx repeat: matches 1..301 of consensus" repeat_region 39773..40081 /note="AluSp repeat: matches 6..303 of consensus" repeat_region 40695..40753 /note="MIR2 repeat: matches 142..83 of consensus" repeat_region 41418..41486 /note="MIR2 repeat: matches 146..77 of consensus" repeat_region 42998..43336 /note="MER7A repeat: matches 335..4 of consensus" repeat_region 43960..44065 /note="MIR2 repeat: matches 146..31 of consensus" repeat_region 44192..44336 /note="MIR2 repeat: matches 2..145 of consensus" repeat_region 44630..44671 /note="5S repeat: matches 42..1 of consensus" repeat_region 44740..45034 /note="AluJb repeat: matches 1..300 of consensus" repeat_region 45679..45971 /note="AluJb repeat: matches 1..295 of consensus" repeat_region 46605..46849 /note="MIR repeat: matches 4..255 of consensus" repeat_region 46916..47209 /note="AluJo repeat: matches 1..302 of consensus" repeat_region 48808..48938 /note="FLAM_C repeat: matches 128..2 of consensus" repeat_region 49169..49330 /note="MLT1E repeat: matches 405..568 of consensus" repeat_region 51605..51713 /note="L1 repeat: matches 3160..3272 of consensus" repeat_region 51745..51860 /note="MER42B repeat: matches 1179..1298 of consensus" repeat_region 51860..52669 /note="L1 repeat: matches 3494..4309 of consensus" repeat_region 52756..53856 /note="L1 repeat: matches 4293..5390 of consensus" repeat_region 53707..54114 /note="L1PA3 repeat: matches 1..409 of consensus" repeat_region 54104..55029 /note="MER11A repeat: matches 1100..59 of consensus" repeat_region 55171..55663 /note="L1PA2 repeat: matches 398..891 of consensus" repeat_region 55668..56072 /note="L1 repeat: matches 4388..4788 of consensus" repeat_region 56109..56490 /note="L1MB4 repeat: matches 927..534 of consensus" repeat_region 56505..56623 /note="AluSq repeat: matches 3..116 of consensus; incomplete repeat" repeat_region 56885..57176 /note="AluSx repeat: matches 295..1 of consensus" repeat_region 57246..57538 /note="AluSg repeat: matches 289..1 of consensus" repeat_region 57747..58136 /note="L1 repeat: matches 4871..5268 of consensus" repeat_region 58139..58448 /note="AluJo repeat: matches 1..301 of consensus" repeat_region 58476..59193 /note="L1ME1 repeat: matches 60..827 of consensus" repeat_region 59211..59445 /note="MER44C repeat: matches 724..501 of consensus" repeat_region 59551..59621 /note="MER44A repeat: matches 71..1 of consensus" repeat_region 59623..59703 /note="L1MA9 repeat: matches 974..1055 of consensus" repeat_region 60428..60506 /note="MER46 repeat: matches 1..86 of consensus" repeat_region 60698..60782 /note="MER46 repeat: matches 145..234 of consensus" repeat_region 60795..60847 /note="MLT1D repeat: matches 6..58 of consensus" repeat_region 60848..61016 /note="AluSg repeat: matches 299..131 of consensus; incomplete repeat" repeat_region 61019..61172 /note="MLT1D repeat: matches 44..207 of consensus" repeat_region 61131..61428 /note="MLT1D repeat: matches 198..505 of consensus" repeat_region 62112..62243 /note="AluY repeat: matches 1..132 of consensus; incomplete repeat" repeat_region 62278..62381 /note="MIR2 repeat: matches 142..37 of consensus" repeat_region 62395..62609 /note="L1PA4 repeat: matches 679..893 of consensus" repeat_region 63429..63749 /note="AluJo repeat: matches 2..302 of consensus" repeat_region 64117..64269 /note="L1PB1 repeat: matches 1..170 of consensus" repeat_region 64365..64658 /note="AluJb repeat: matches 1..292 of consensus" repeat_region 64826..65198 /note="L1MD2 repeat: matches 180..573 of consensus" repeat_region 65203..65476 /note="AluSx repeat: matches 29..302 of consensus; incomplete repeat" repeat_region 65477..65816 /note="L1MB8 repeat: matches 569..920 of consensus" repeat_region 67079..67375 /note="AluSx repeat: matches 297..1 of consensus" repeat_region 67836..67881 /note="23 copies of 2 mer 83 % conserved" repeat_region 67884..68182 /note="AluJb repeat: matches 1..298 of consensus" repeat_region 68390..68453 /note="MIR2 repeat: matches 82..145 of consensus" repeat_region 69178..69454 /note="AluJo repeat: matches 287..3 of consensus" repeat_region 70220..70928 /note="MER21B repeat: matches 788..35 of consensus" repeat_region 71185..71289 /note="MIR repeat: matches 258..153 of consensus" repeat_region 74411..74485 /note="MER5B repeat: matches 149..73 of consensus" repeat_region 74713..74874 /note="FRAM repeat: matches 162..1 of consensus" repeat_region 76038..76307 /note="AluSx repeat: matches 25..298 of consensus; incomplete repeat" misc_feature complement(76097..76533) /note="match: GSS B46079" repeat_region 77064..77097 /note="17 copies of 2 mer 100 % conserved" repeat_region 77128..77482 /note="MLT1B repeat: matches 1..390 of consensus" prim_transcript 77960..78338 /note="match: EST AA576332 clone IMAGE:951915" repeat_region 78381..78640 /note="MER42c repeat: matches 1450..1190 of consensus" repeat_region 79268..79544 /note="AluSx repeat: matches 1..281 of consensus; incomplete repeat" repeat_region 79912..80461 /note="MER7B repeat: matches 1205..619 of consensus" repeat_region 80467..80765 /note="AluJb repeat: matches 3..302 of consensus" repeat_region 80774..81367 /note="MER7B repeat: matches 622..1 of consensus" repeat_region 81690..81774 /note="L1MA3 repeat: matches 637..563 of consensus" repeat_region 81775..82084 /note="AluSq repeat: matches 303..2 of consensus" repeat_region 82258..82557 /note="AluSc repeat: matches 299..1 of consensus" repeat_region 83663..83965 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 84208..84383 /note="AluJb repeat: matches 298..120 of consensus; incomplete repeat" repeat_region 86135..86297 /note="MIR repeat: matches 88..258 of consensus" repeat_region 90436..90735 /note="AluY repeat: matches 300..1 of consensus" repeat_region 90888..91186 /note="AluSg repeat: matches 299..1 of consensus" repeat_region 91839..92126 /note="AluY repeat: matches 299..2 of consensus" prim_transcript <93180..>94365 /note="match: multiple ESTs; match: T71966 R68298 AA496677 N21471 D83870; match: T72106 AA514035 T27233 AA305641 N56148; match: W47005 AA143724 AA307902 T30629 AA143762; match: AA211694 AA473631 AA480923 AA091196; match: AA593820 AA460548 T78803 AA219439 AA369721; match: W32657 AA313689 AA473718 AA093873 R62689; match: AA491638 AA480866 C06307 H42711 AA554897; match: AA356897 AA001622 H20596 T09300 AA003840; match: R36350 AA093322 AA177031 T35504 AA076252; match: AA192462 AA450457 W37253 T55643 AA085427; match: AA484019 AA177057" gene 93263..93721 /gene="dJ560B9.3" CDS 93263..93721 /gene="dJ560B9.3" /note="match: U28686 RNA binding protein" /codon_start=1 /evidence=not_experimental /product="dJ560B9.3" /db_xref="PID:e1246376" /db_xref="PID:g2814367" /translation="MSTKEGKIFVGGLNINTDEQVLEDDFSSFGPVSEVVIVKETQWS RGFGFITITNPDAMRAMNRESLDGHQIRVDHAGKSAGEPEEVTLGPMGVVTATLEVVG TRTMGVAGMTVDLEGIDMDMDSPETIMSEARVVMTATQKEITEAIMTTEM" repeat_region 95588..95696 /note="L1MA4A repeat: matches 493..602 of consensus" repeat_region 95868..96073 /note="L1MA8 repeat: matches 558..365 of consensus" repeat_region 96060..96391 /note="L1MA6 repeat: matches 582..927 of consensus" repeat_region 96392..96688 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 96696..96830 /note="L1MA6 repeat: matches 911..1042 of consensus" repeat_region 96904..97144 /note="L1PB2 repeat: matches 900..656 of consensus" repeat_region 97036..97186 /note="L1PA9 repeat: matches 763..611 of consensus" repeat_region 97939..98239 /note="AluSx repeat: matches 1..302 of consensus" BASE COUNT 30163 a 19378 c 20027 g 29506 t ORIGIN 1 gatccggcca ctgcactcta acttgggtta caaattaaga ccctgtctct gaaaaaaaaa 61 aaaaaaagta tagcaatggg atacattcta ttaagaaatg gatcactttt taaattaata 121 attttatatt aatgtatgat atctctattg ttgtttaaca aaaataatga ttccattttt 181 atagattaac acctaagctt taaagagatt gattgaagca ttattattgc aaaacactgc 241 agggaaaccc acaggtgttt gtttgtttgt ttgtttgttt gtttgtttga gacagagtct 301 cactctgtcg cctaggctgg agtgcagtgg tgggatcatg gctcattgca gcctcgacat 361 cctgagctca agtgatcctc ccactttagc atcccaagta gttgggacta caggtgtgtg 421 ccaccatacc tggctgattt tgttttttaa agttttagta gggacaaggt ctctctctgt 481 tgcccaagct ggtcttgaac tcctgagctc aagcaattct tccgcctttg cttcccaaag 541 tgctgggatt acagatgtga cccactgaac ctagcaaccc acagttttca acaacagggt 601 attggttgaa gaaactgtag catacccata tcctatgaaa taaatagacg taaaatttca 661 gaataatatt gttatttaca atattttgtt aaatgaaata agtaaattat aaaactagat 721 atgcaatgcc accctacact tttaaatata tattatctat ataaaagcta gatataaata 781 tagcaaaatg ttcatagtga ttatatttag ataatgagaa tatatgatat tttggcttcc 841 atctttgtat tttttgtaat ataattttta ggagtcaggt ttattgaagg gttatttaca 901 tgtagtaaaa ttcacccgtt ttaggtgtat gtgttttgac taatatgtac aataatgtat 961 ccactaccgt aatcaaaata cagaatattt cctttgtccc taaaagttcc tttgtgcctt 1021 tttgtagcca attccttctc cctagtcacc ggcccttggc aaccactgat cttattttag 1081 tcttgacagt tttgcctttt ctggaatgtc atataaataa aaacatatag actgtagcct 1141 tttgtttttg gcttatatca gttggcatga gtgcttttga gagtcagcca tgttgttgta 1201 tttacaagta gttcgtttct ttgtattaat gagccgtatt tcacagcatg gatgtaccat 1261 aatatgttat ccttatcctt tcatgggtta gtggacattg ggttgtcttc aattttggac 1321 tacttacatc agcttgctag tgctgccata actaagtacc acaaactggg tagcttaaac 1381 aacagaaata tactgtccaa cagttccaaa ggctacaaat ccaaaatcaa ggttccagca 1441 aggtttgttc cctttttttt ttttttttgt agaatctttt ttattcagaa aaaataaaac 1501 aatcctccca aaaaagtttt acaaccacac agaggagggg tatgggtagg ggaaggtgtc 1561 tggccatcag ccctgtaccc cagcccatgt ggttttggca gcaataaggg gtgtggggta 1621 atggccccca aaataaaatg gtgtatgggg agggaaggga tacaaagctg tggggagcgg 1681 tgaagggcaa gggacagacg aggtcagtac tgggaacgcc gaaggtggga ggccatttca 1741 taacatttct tgttgatcaa accaccgtgg acaccttctt tgcccatcag caggactagc 1801 gtcttgtcag tcttggtgac agtgacattg aaggtggggg ctccaccggt actcttggta 1861 tacgaagatc catgcaaaat tccccatcct gcagcagtga gtcccggatc accgaacatt 1921 tctggccccc aagtgtcagc ccattcacaa aaaaacttga cctgtctttg ccagccagga 1981 caccaacctc agctggcgcg atgttggcga aggttttccc ggggacggcg gcccagatgg 2041 agggcgagtc cttgtagccc acgatggcca cgtcctgaca ggtcccgtcc gccataaggt 2101 tgtcgatgtc ggtgtccacc cggccatggt catgatgttg tcaatgtagg cgtccacccg 2161 gccatggcgc tgctgctggg gcagcgggct gggctcgggc tgcctgggct ggcgggcggt 2221 gggaagcgga gagctcgggg cacgcgctgc cgtctggacc gtggctctgc tcgctgtgca 2281 gcagccttgt accactacct gtttgttcct tctgagggtc atgagggaag gatctcttcc 2341 cgttctgtct ctttggctat ctttatgttt acatagcatg ctccctgtat ctgcgcctgt 2401 gttcacattt ctccttttaa aaagactcca gttgccgagc tcagtggctc acgcctgtaa 2461 tcccagcact ttgggaggcc gagatgggcg gattatgagg tcaggagatc gagcccatcc 2521 tggctaacgc ggtgaaatcc cgtctctact aaaaatacaa aaaattagcc gagcgtggtg 2581 gcgggcgcct gtagtcccag ctactcggga ggctgaggca ggggaatggc gtgaacccgg 2641 gaggcggagc ttgcagtgag ccgagatggc accactgcac tccaggctgg gggacagagc 2701 gagactccat ctcaaaaaaa aaaaaaaaaa aaaaagactc cggtcatact ggattaaagt 2761 cccactctac tccaatatgg tttcatgtta attaattata tccgtaacca ccctatttcc 2821 aaataaggtc acattctgag gtactggggt taggacttca acatattaat tgggggacag 2881 ggggcaattc aacccataac aaatagtaaa gctgctgtaa gcgtttgtgt ataagtcttt 2941 gtatagtcat atattttcat ttctcttggg taaatacata ggaatagaat tgctggatca 3001 tatggtaatt gtatgtttaa ctttataaga aactgccaaa ctgttttcca ggatggctgt 3061 actatttcat attcctacca gcagtgtata aaagttcctg ttgctctaca tctttgtcaa 3121 catttggtat tgtcaggttt taaaaattgt agtcattctg acaagtgggt ggtagtatct 3181 ccctgaggct ttaatttcac atccctgaag actaatgatg ttatttatca tctgtatatt 3241 ttcactgaca aagtgaaact ttcaacaaag tgatactgta cacatctttt gcccatttta 3301 aaaattagat tactttcttt ttactgagtc atgagactca ctacatatat atgtcttata 3361 tatattgtat attctggata cgagtccttt ggcaaacatg tgatttgtaa atattttcac 3421 tcagtctgta gcttgtcttt tcatttcctt ttttttttcc tttctaagaa aaatagagaa 3481 tcttttcatt ttcttaagaa tgtcttgaaa agcagaaaat tttcattttg ataaagtcaa 3541 ttttatcttc ttttaaaatt tttgttcatg ctttttgtgt cttatctaag aaatttgcct 3601 aagtcaacat cacagagatt tttctcttgt cttcttctaa acattttatg attttacgtt 3661 tcacatttag acctatggtt cattttgtgt tgttttatat gtggtatgga gtattttttc 3721 ttaaaatttc aacaatataa catttttggt tctggaaaga tgacatgatg tgatttatca 3781 tctgtatctt tcctttcata aagtgtctgt acacatcttt tgtatgtgta gcaccagata 3841 tactagataa aataaaacaa aaactacttt taatgcaaag cttggtttat aaaaaaaaac 3901 aaggaaattt catgtgcaaa aaatgagaag ggatctgaaa gcagtattga gtatatggac 3961 caatacaggt agctgcctgt attttttcgt gcgggcaagg atcggggtat aaagtttcta 4021 ctgacctaag gacttggagt aaacaacatt gtgagcacag gagaggagtc ttgggttcac 4081 ttaatggttg ctgattgtgg ctgagactca atataaagta ggaacccctg agggggtacg 4141 ctttcagtga aagaatggca gaaggaacac aaccatttac caacttaaga aaatgaaaag 4201 gaagcttgag tctctcctga gaggtctctt gaaaaactga aactgccttg ctccttgcac 4261 agatttgagg attgaattga taacacctga ggggtccaga aacctccatt gtttaaaaga 4321 aataatcatt taaaaattga tacctggccc aggcatggtc actcatacct gtaatcccag 4381 ccctttggga ggccaaggtg ggtggatcac ccgatgtcag gagttcgaga ccagcctggc 4441 caacatggtg aaaccccatc tctactaaaa atacaaaatt agcagggtgt ggtggcagat 4501 gcctgtggtc ccagttactc aggaggctga gacaggagaa tcacttgaac cccggaggcg 4561 gaaactgcag tgagctgaga tttgcaccac tgcactccag cctgggccag acagactgag 4621 actctatctc aaaaaaaaaa ataaaaaaat aaaaaaaatt gatacctgat aatccccttt 4681 gaatgcttgg cagaatttat gtaaaacagg ggagcaattt ctattgagat tcccacagga 4741 aaaaataccc ctattaaaga aaaattactg ataaaaatgt ataaaacaag gcagaatcca 4801 ttaggcacac catagatgtc ttcttagctt gtgtgtttca cacagcaagg gcacatgcag 4861 gatgatgata acagagccct tgcagtgtat atatgttgtg gtggccccag aaaccattta 4921 ccaccagcaa gaaccagcag gcattgcaac agggggatta gaacccaaga acttcataat 4981 attgctgcaa caataatatc aaagagaact caaaacagac atggaaaaat aattgaagac 5041 ttgacataaa tacttaaaat cctaagaaaa taatatgaca ttatgaaaaa aaaagaacag 5101 gcagatttga aatttttgaa aaagataaaa tatgttgtca agattgcacc acagagaact 5161 aaagagagaa aatatgacag aaaagaaggt gcttggggga tgattgacat gctcttttaa 5221 gatgtggctc ttcagcagtt tttttgatag aaagaggcta ttacagggca agaagttctt 5281 tagctttctc cattttctga gctgataaag aaaattacac aaagaaaaat cttaaggaat 5341 ctgaacttta gatattatat atcaagataa aacctcaaga aagggtattt tattagagaa 5401 ttagaattaa tagcaagatg acaaactgca gaatataatt gttatgaagg cagatatggt 5461 tggaacagac agaactgttt aggcatctga taaactccaa gtggaagaag tttttaaaca 5521 tctttatgga aaaattggag aaataagaat taaatggtga tacttaatgc ttgagtttca 5581 tggaaaaaga gagcctagat tgtatctggg gtgaatgtat ttctttaagt ctttctaaaa 5641 gtatttattt atattaagca aagtcgttgt ggggaaagtt acattcatat ttaaacttca 5701 ttgttagatc atacttcgga ttatttcaat cacatctgga gtaagtttac aactgggggc 5761 tgtggccaaa tgcagggatc caatatttac ttggtaatat catctgcaat caaaaagcag 5821 aatgccaaga aggaaatcat cagtctcttc agagctgact tatacttcca attaatacag 5881 tacagcaagt atctcatcaa aatttccttt caaatataat atgaactact ggtgaatgga 5941 caaaagcagg aagcactgaa gactagagag aggagaatgc cttcagcaag agacaatgtg 6001 tttacaaaaa caggcctgga ctgctttaga aagtcatcac caatggcctt cagcctttct 6061 tataataaaa taaggaagga agaaatcact tcaaatttta aaagtagatt agttagaatt 6121 ttagctgcat acttctggga aagtgatttt tctctccttc taccctcttc cattaagtag 6181 ggcatactgt tcagaattca aaacctgctg gccaagtttc tagggtaaac aagactttgt 6241 ggctctagaa tgagccctgt tttgaaagct gaatgatttt ttgggagaag agagaggaag 6301 aggagttctt aaataaaagg aatatggact ttaaaaataa acagtgaagt ttgaaatgcc 6361 tgaagtttgt tcctggcacc agtatgtctc caggcaactt taggcacata taatactgag 6421 ccaacttggc tgagcctaga actggtggtg gacctgaaaa cagctcagtg tgaaagccat 6481 caagaactgg gatttgggct gtgtttacag agattggtct tagaagacaa tgtaagcttt 6541 tggggaaagc agcataacat aaacttttgg ggacaacaag ctttttagga acgctaggaa 6601 acaatggaca ctacttgaag gaaagacata gtatggggaa aatgctccat tgaataaatt 6661 agatagaaga atataaggtt aagtatcaga aatgaaacaa aactaagtaa atggttggct 6721 tttcattgtt gttggctgat ctttgtctga aatagaggat tttaaacttt tgttttaaat 6781 ctaccttgac aatgttgaag aaacctcttg actcaagatg ggaaataaaa gtttcacctt 6841 acaggataat aaacaggttt agtttaagga tcgccagaaa gattgaaagt taaagggaaa 6901 aagttggaaa acaatgatat tttcccagtt tccttataaa agaagccatt gttcactgag 6961 ctagtaacat ttttttcatt aagaaaatta tatttttata aggctaacag ccctactttt 7021 ttgctcttcc ctttgtggag ataaaaagtg ctttgacttg tagttgtttt atgttttttg 7081 tttgtttgtt tgttttttga gacagagttt cattcttttg cccaggctgg attgaagtgg 7141 cgccatatca gctcactgca acctctgtcc cctgggttca agcgattctc ctgccttagc 7201 ctcctgagta gctgggatta gaggtgtgag ctaccatgcc tagccagtag gcatggtggc 7261 agtgtctata atcccagcac tttgggaggc caaggtgggc agatcacctg aggtcaggag 7321 ttcaagacca gcctggccaa catggtgaaa cactgtctct actaaaaata taaagattag 7381 ccaggtgtag tggtgcatgc ctgtgatccc agttactcag gaggctgaga cagcagaatt 7441 gtttgaactt gggaggcgga ggctacagtg agttgagatt atgccacttc accccagcct 7501 gggcaacaga gtgagacttc taatcaaaaa aaaaaaaaaa aagaaagaaa aaaaagaaat 7561 atagttctca tgactgcagc aaggtgtgaa aattgtatta aaaactgtat gatttgatgt 7621 aaaagagtta atggcaatct aaatttttct tagtggtgca tttttttcta atccctataa 7681 aaattagctt atctcagggt ttatcaccct cagcactctt aacattttgg gccagatatt 7741 tctttgaagt cgggggctgt tctgtgcatt gaaggatgtt tagggcctcc cctaactaga 7801 gcatcccctc tagctatgag aaccaaaaat gtttccagat aatgacagat gttctctaaa 7861 gggcaaaatc atctccggtt gtgaaccact ggtttatttt aatagtgtta agatacgcta 7921 cttaggattc actttctaat gacagtagca ataaagtaag cttgtttagt taatatcctc 7981 aacaatagtg tgcttgtgac aaagtaattg gagttgagat aatgcatata aagaacctag 8041 actgtgcctg gcaccactcc agtacttaaa aatattagtg atgataatga tggctctgaa 8101 ggtgatggaa agggggaagg aaggagagga gggacagaga agaggaggag aaaggaaatg 8161 agtgagagga tgagcacagg catatattat actctctttg cttggaaatg tgtagtcagc 8221 taattctgat tctcacttgg caccagatca gagctggtga gcagagagga ggcagaggag 8281 aggggaacag tcaggtgggg ggtaaaagaa ttggatgggt ttggccgggc gcagtggctc 8341 actcctgtaa ttccagtact ttgggaggct gaggcaggtg gatcacctga ggtcaggagt 8401 tcaagaccag tctggccaac atggtgaaac tccgtctcta ctaaaaatac aaaaattagc 8461 caggcatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc aggagaattg 8521 cttgaacctg ggaggcggag gttgcagtga gccaagatca caccactgca gtccagcctg 8581 ggtgacagag caagattctg tctcagaaaa aaaaaaaaaa aaagaattgg atgggttttt 8641 cattcaaaat attgtgtttt cttgttacaa ttatctctag gacttaactt ccttacacgc 8701 aaggcatttt ctggtacaaa ttaatatcga cttggcttta gaaaaataca cacccattgg 8761 ccggacgcag tggctcatac ctgtaatccc agcactttgg gaggccaaga caagtggatc 8821 acgaggtcag gagatcgaga ccattctggc taatatggtg aaaccccgtc tctactaaaa 8881 atacaaaaaa ttagccaggt gtggtggcgg gcgcctgtag tcccagctac tcaggaggct 8941 gaagcaggag aatggcgtga acccgggagg cggagcttgc agtgagccga gatcgcacca 9001 ctgcactcca gcctgggcga cagagcaaga ctctgtctca aaaaaaagaa aagaaaaata 9061 cacacccatc aaacaagtag gatttatttc cagggcagat tagagaaagt agatgttaca 9121 ttacttgatt cattaaggca ttaaacttcc tcatattaca tttatattta cttaagttca 9181 cactcatttt gtgtatagat cccataaatg tttaatgcat gtgacgcact aataaatgct 9241 aattgcatgc actggtatta actttactgc agtgaatact agcaagggtc ttgggaggta 9301 aatgttctac acatttaagg ggaggatacg agactcagaa agataagcag ttgcccagga 9361 taactctcaa gttattctga cacgtgagca caggtctttc ttccctcagg ctgtggccct 9421 ttctgccacc ccacggctgc tcttcataaa ctcatatgcc tacagctgaa cgtgggcaaa 9481 ccatgatgct cctgagaaat ggcagatggc ttattttctt ctccgcatac cttttttctc 9541 tttcttgtct cttttcctcc atctccctct tccattcctt aggcctgtca gtaacaccct 9601 gctgctcgct tatgctgctg aagggagcca gagttaaaga cgttgttgtg gatggcattc 9661 caccaccatt caggtagttt gtttgtttgt ttgtttgttt gtttttgaga cagagttttg 9721 ctcttgtcac ccaggctgga gtgcaatggc acaatctcgg cttactgcca cctccgcctc 9781 ctgggttcaa gtgattctcc tgcctcagcc tcctgaataa ctgggaatac aggcacccac 9841 aaccacacct ggctaaattt tgtattttta aagtagagac agggtttcac catgttggtc 9901 aggctggtct cgaattcctg acctctggtg atccacccgc ctcgggctcc caaagtgctg 9961 agattacagg tgtgagccat catgcccagc caaatagttc taatagagat ttcttattgg 10021 ttttgtacca ttgcagagag caagaagaac cctcaggggt gtcaatagaa gcctactgta 10081 gtgcccgcct gatatcagcc agtgggtata tgtgagtaca gctatactag ttttgtttgt 10141 ttgtttgttt agatggaggt ctcactctgt tgcccaagct agagtgcagt ggtgtgatca 10201 tgactcactg cagccttgac ctcctgtgct caagtgatcc tcctgcctca gcctcctaag 10261 tagctgggac tacaggcagg ctttagtgtc tggctagtta ttttttgtac agacaggtgt 10321 cactaagtta cccaggctag tcttgaactc ctgacttcaa gtgatcctcc tgcctcagct 10381 tctcagaatt ctgggattat aggagtgagc caccatgcct ggtcagctat actagtttta 10441 aattattgaa ctacttcttt ctctttggta gacagttgct ttcctgagcc cctgagtgct 10501 cctaccctga gccctgggcc cagtcctccc aactccttgt cgtacagtac caattaaaaa 10561 atattttgag tatcaccctg ctttgggcta gtgcttttaa tgcttttagc taataactaa 10621 gttctataaa aaacatattt cattgagctt gcaggtcaca ctaaggaggg tagaatatta 10681 agaggacaaa tttgacaatc ataaggagta tatggttagt aaaaatggga agaaatttta 10741 gagaagtggg atgaaaatat agcagcaatc tctctgcacc ttgacgttga tgataatcat 10801 caatattcct catctttgtt ctaactagaa aggtgagaga aatgagaaag aggcctgaaa 10861 gggaaggaga gagggcctgg catggtatag atgtaccaga aatatttgta actgaataaa 10921 tagttgaata catgactgga tgacagtaga agtgccacca cagaatattt gtttctgagc 10981 actgaataag atagtcagtc aatgcaacaa gtgtaaagca cctgctctgt gttaggtttg 11041 ggctaggata caaagtggaa tcaaacatgg tcttaccaaa gactattaat taatttagaa 11101 gaaaaatata aggttgagaa atgagaaatg tatgcagcaa ggaattgatg ggccagagga 11161 agttctggta ttaaaaagaa aaagaagtta acatgatact ggtattagta gaagtaaaac 11221 caccaagctt tatgagatcc ttctcccact ctacccagcc ttactaaact gacactcagt 11281 atgggcccca ccttttaaga caatgttgag aatgcgtaaa tgctccaaga gagtagccat 11341 aagcagggct gacttcatgg gcatacaaca agcgcagtcc cagcaggcca ctctcagaag 11401 ggcactgcac ttggtggtta aaagctgtgt catcttgaaa ttcttaatat tttatctttg 11461 aatttgtgtt ctgtgagtga agtccaatgg gacagtggag catgcgcgta agcagaggag 11521 atacatgcgt gctggtttcc ctcatgcctg cagatccatt tgcaaaacat tcaggattgc 11581 taccctgctt ccatgacaca caagaagtgt gtctacttgc ttgtacttgc tttgagtctt 11641 gcccacctcc tatgtatgtg ggtccaccag aattctgtgc tcatgtctca cacctcatat 11701 cagcagatgc tcatactgtc atgtgctcca catgtatcaa catgtgccca gaccaacacg 11761 tctacaatag cagaggtgct gatagcccca agaagccatc ctttccatta gaactggaac 11821 ttgtttccaa tgcagaaaga aggtaatggt attcagagaa ataagaatga cctaggaacc 11881 ctatcatata cagttttttt tttttttttg agacagagcg ttgctctgtt acccaggctt 11941 gagtgcagtg gcgtgatctc ggatcactgc aacctctgcc tttcaggttt aagggattct 12001 cttgcctcag cctcctgagt agctaagatt acaggcatgt gccaccatat tcagctaatt 12061 tttgtatttt ttagtagaga cggggtttag ccatgttggc caggctggtc tcaaactcct 12121 ggcctcaggt gatccaccca cctcgccctc ccaaagtgct ggggttacag gcatgagcca 12181 ctaagcccgg cctctatcat atacattttt acttgcatta tttcctgtat gacccaatca 12241 cttatgttaa aaatgttgac atagaaggaa aggggaaata ggtccatcca tagttgccat 12301 attataagga ataagttctt tccttctttc ttccttcctt ccttccttcc ttccttcctt 12361 ccttccttcc ttccttcctt tctttcttta agacacagtc tctctctgtt gaccaggctg 12421 gagtacagtg gtatgatctt ggctcgctgc aacctctgcc tcccaggttc aagcaattct 12481 cctgcctcag cctgacaagt agctgggact acaggcatgt gccaccaggc ccagctaatt 12541 tttgtatttt tggtagagac agggttttgc catgttggcc aggctggtct caagttcctg 12601 acctcaggtg atctgcctgc cttgaccttt caaagtgctg ggattacagg tgggagacac 12661 tgcacccagc caggaataag ttctttctta ttcatcagtg atccaaaggt agagtgttgg 12721 tagaatgtga acatatcaag aaataaaata aaaagtttta tgcagtatta tcattgttct 12781 gataagaatg aaacaattta tatttatgag ctctaaaatg caaattgtga aattttagtg 12841 attccacata taagataaaa gctctcgtaa tttctattta aaactggcat tgcataaaat 12901 aagatgaatg gtaaaattca ttctaataat taaatttaaa ctttaaaatt tagaatactt 12961 ctattacact taaaaaacat tatttttaaa tgtggtaaaa tacatataac ataaaattta 13021 ctatcttaac catttttaag tgtacagttt ggtgacatta agtacattca cattgttgtg 13081 caatgatcac caccatctaa ctccagaatt tttttatctt gcaaaactga aattctgtac 13141 ctattaaata ttaactcctc attctcccct ctctccagcc cctgccaacc accattctat 13201 tttgactact gaatttgact actctagata cctcatagaa gtggaatcgt aaaatatttg 13261 tccttttgtg actggcgtat tctactttat ataatatcct caaaattcat ccatctttta 13321 gaattgtcag aattcccttc ctttttaaag ctgcgtaata ttccattgta tgtacattcc 13381 acattttgct atccattcat ctgtcagtgg acactgggtt gcttctactc cttggttatt 13441 gtgaatggtg ctgctatgaa catgagtata gaagtatctg ttcgagcctc tgctttcaat 13501 tattttgggt atatacccag agtggaattg caggatcata tggtacttct atttttagtt 13561 ttttgagcaa ctgtcacact gttttctcta aagtctatac caatttacat tcccaacaac 13621 agtgcacaag ggttccaatt ttcccacatc ctcgccaaca cttatgttct gttttttttt 13681 tatagtatcc attctaataa gtgtgaggtg gttcccttcc cttcccctcc ccctccacct 13741 tccctccccc tcccctcccc ctcccctccc tttcctttct tttcctttcc tttcctttcc 13801 tttcctttcc tttctttttg acagtcttac tctgttgccc aggctggcgt gatctcggct 13861 cactgcaaac tccacttcct gggttcaagc ctcagcctca tgagtagccg attctcctgc 13921 cttggcctca tgagtagctg ggactacagg ggcgtacccc tacacctggc taatttttgt 13981 atttttagta gagatgggtt ttcaccatgt tggcaaggct ggtctccatc tcctgacctc 14041 aagtgatcag cctgcctcag cctctcaaag tgctgggatt acaggcatga gtcaccatgc 14101 ccggcatgag gtagtatttc attgtgattt tgattttgat ttccctaata attattgatg 14161 ttgaccctat tatgctttgt aattaataaa aacattttta aaggaaaatg tcttatatgt 14221 tggtacttta acatcatttt ttccatgctc tttttttctc actgcctctt tgatcaggtc 14281 tttattcaaa agaagctgtc caaaatgatt tgacctttat ggaataatca aatttaagag 14341 tttatgcagc aggcttcttt cctctgtagt aggtttcttt tctgctggct tcttttcagg 14401 gactggtttc ttggtagctg ctgccttttt tcccaccaga ggtttcttct gcttcttaac 14461 accaacagca atcttctttc ctttcttacc taccacaggc ttcttgcctg caaccccctt 14521 ctcatccgat ttggcttcta gtgcttcagc tgctgctgct gccttatcca cccagagctt 14581 gtgattcctg gcctggcgaa gaatggtgtt ctgacacatg catggtcttt gcatgtaggt 14641 ttagcttcaa catgattctc aggtttttca gtgggttctt ctttaggact ctgcgatgaa 14701 tcttgttgcg tggtgctcaa agggctcttt ggatttctgg gcttttcaag attctgctaa 14761 gatctgtatt gatcatctta tggatgggaa gattgtagtt actcttgaga gaagaagctt 14821 tacgccaagt gccattaaat tcatctaact tctggaaagc acttttggtc caaatgcaga 14881 aaagtcccac atgcccacca ggagcaagct tcaaaatgtt cagtttgctt acattaagca 14941 gagtaattcc agggatgttt ctgaaggcct tgatgatacc attatcctca ttatagatga 15001 tgtaggggcc tgtgcactgg atacaatgac ggtttctcat tttgcccttg ccagctctca 15061 ttcgctgaga ggcatagtcc tttttgatat cattccaggc tttaaatttc ttaagaagcg 15121 aaacagcctc cttggtcttc ttgtaccctt caactttatc ttcgactacc aaaggaagtt 15181 caggaacttc ctcaatacta tgacctttag acatgaccag tgctgataag gctgaggcag 15241 ccagggcaga acagacggca tatcattttt gggttgtgtt cactctatga tcctaacggc 15301 atcaggtttt ggttggtgcc aacattcggc ctccacaaca catgtttcca gaagcaccct 15361 ggccagaatg gtgagtccca taacaccttg aactctggga atttgagcca cagctctgcc 15421 ggtaccccaa gattcagcac tggtctgacg acctgctaat tcactgacag catagggctg 15481 tctgttgttt ttgtgcaagt tggtgtgaac aaagttcaca atatctggtc gaataggagc 15541 cttgaataca gcaggcaaag tgacattttt gccagatgac tccccacttt cagagtacac 15601 cgatattagt aggcgagcac atgccatggt ggagaggaga gagccacgct cctctcacca 15661 tggctgttgc cacaggcaaa gttctctgtg cttttaaaaa tgtgccccac attttaattt 15721 tgcgttgggt cctggaaatt atgtagccag ccttacatat gggaaagaag gttaatgaaa 15781 aaaatggact gggtagtaac ttaataatct gagtgtatga tacttgttta tcacctgttt 15841 cccatctcta ttgactcagt gaaagtctga gagatttatt tagaaaggaa aactgcctta 15901 gtgtctgcag cttcccgtaa agaatctgat cacttgttcc actttggatt acagcctcgc 15961 ctctgtcaca catcacctgt gtaagtgccg ggcaaatcat taaacttctc tcttgagcct 16021 tgattccaga ggggttatgg gtctggtggc caggtaggta gtagagggca gatcctgaga 16081 tgggccagca tcctcctgga agttacttgc cttgttttga ggctactgaa tagctttcaa 16141 gcaagcaaga gtgaaggggg aaatggggtt ttatgtttca acaaggggga ctgggccact 16201 tctgcaggtc cagggagttc tggggaatct gagcattcaa gatcctcaag taggctgagt 16261 gcagtgactc atgtctgtaa tctcagcact ttgggaagct gaggcagaag gattgtttga 16321 gcccaggagt ttgagatcag cctaagtaac atagcaagac ctcatctcaa taaaaaacaa 16381 aacaattagc tgggcatggt ggcatgtgcc tgtagacctg actactcagg aggctgaggc 16441 aggaggattg cttgaacaca ggatgttgag gctgcagtga gctgtgatgg tgccactgca 16501 ttccggccta ggcaacagag tgaaatcctg tctcaaaaaa aaaaaaaaaa aaaaggaaaa 16561 gaaagaccct taggggcgac tacccagata tcccatccca agtctctagg ccctgttctt 16621 tccctaacaa gatcctgact ttggcatagg aaacttgctg aggccgaaaa ttcatccttc 16681 tttgaaattc tgccacctta aaactaattc tgggcctggt cttcaactct ctctgctctt 16741 cagcttcagg aaataaaggt ctctttaact actctccatt agagaattag agatctctga 16801 ttctcatatc ttgctttaaa ttcacaattg attgccctgc atctgtcaca tctgtcgttt 16861 tctttttttc caagctaaga atggagttta gcagcagccg ctccatgtca aaatccttaa 16921 aatgaccgta tctcctgttc ctctcaaatg ccagagatgg catgactttc acattcccct 16981 ctacttgtat ccctctcact tcaccagagg tgaacacttt agcaatgaat ggcactgcta 17041 ctgcatgtca gagactactg atatcagtac ttaccccacc taccaccagt gatgtgatcc 17101 ttattgctct ctcttcagtg ggtgatctaa ttccagaatc ccatccttag aacctgtttt 17161 ctgggaccaa tactggtact aaccctcttc agttgggctc tttagaaaca gttggctaca 17221 gggattcagg tacatgtgat ttattgaagg agtgttcaac tatctaagag actaagagat 17281 gcaagacaat gaagggaaag ggccaagcaa aaatggtctc aggtaaagtc tagccttggc 17341 cagattcaag ggtgtgtgtg gcatctgaag catatatcac aacatggagt tgtcccacct 17401 tcagataagg ggttcagcct ttttgacccc aatatcagtc taccattggc tccaaggtac 17461 ctccaggaat gggacataat tcagaggtga ggtggctccc attggctgac agcagttctc 17521 cagagaaggg ggcaaatgtg agccattggc agccaacgtt aacaacatcc acaataatag 17581 ctggggaatg gaatctgggc atcagatgca ctttctaccc tcctttcagg atggtgaccg 17641 gtgtggctta caccgctagg attccttgcc ctttgttttt caatggggtt caaaaatagg 17701 ggagtctggc tgggtgtggt ggctcatgcc tgttatccca gcactttggg agggcgaggt 17761 gggatcactt gaggtcaaga gtttgagact gggctggcca gccaatatgt tgaaactcca 17821 tctctactaa aaatacaaaa attagcctgg tgtggtggca cgtgcctgta attccagcta 17881 ctcaggaggc tgaagtggga gaatcgcttg aacctgggag gcggagtttg cagtgagcca 17941 agattgcacc actgcactcc agcctgagcg acagagtgag actttgtctc aaaaaaaaaa 18001 aaaaaaaaaa aaaagcggag tcctagcatc aggtcacagg gaataaaaca acaactaggc 18061 cagggtattt attcccctgg catcctccct gtaagatcac cttaggctgg ctgcatcccc 18121 tggccaaaag tcaaggtagc cctctgcatg tgactttgtc cttccaggtt ttagtaacct 18181 ctgccttccc tcaacccttc aaggtgtact atcctttgta gtttctttaa gccctactac 18241 ctttataaat ttacctttat taaattctcc ttgaattatt ttgagtgtgc tgtgatgtaa 18301 aacagaatgc taccctggca gcaactgatg agatcaagga aggtaagcag ttcagttttc 18361 actgtaagtg caatgagact tcatcaaagg gtttcaagta gaagaattat ctgtttttga 18421 actctcaatt ttgatttctc ccttagctgc cagtataatg taaattttcc agtaattttt 18481 atcaagaaaa gagacaagga cgttatgttt tctgaactct catattttta agaatattgt 18541 gtttgtggtt ttcatttttt ttctttttaa agacagggtc ttgctctgtc acccaggctg 18601 gaatgcagtg gtgccatcat ggcttactgc aaccttgacc tccagagctc aaatgattct 18661 cccacctcag cctcctgagt agctgggact ccaggcatgc gccaccatgc ttggctactt 18721 tttttagttt ttgtagagac aggttctcac tgtgttgccc aggctggtct caaactcctg 18781 ggctcaagtg atcctcctgc ttcagcctcc taaagtattg gaattacggg tgagagccac 18841 tgagcccagc ctggtttgca ttctttttaa aaaattttcc aatggtccat agaatcctca 18901 atggttttca ttcttctttt tttttttttt tttttttgag atggagtctg gctctgtcgc 18961 ccaggctgga gtgcagtggc acgatatcgg ctcactgcaa gctccacctc ccgggttcac 19021 gccattctcc ttcctcagcc tcctgagtag ctgggactac aggtgccctc caccacacct 19081 ggctaatttt tttttttttt tgtattttta gtagagatgg ggtttcaccg tgttagccag 19141 gatggtctcg atctcctgac ctcgtgatcc gtctgctttg gcctcccaaa gtgctgggat 19201 tacaggcttg agtcaccgtg cctggcctgg ttttcattct taagtgacaa ctggtttggt 19261 ataaaaattc ttgagtcaca gctgtatcct ctcaaaaatc tacagatgct gttttagtat 19321 tctataattc agtggcagaa aagatggagg caagcttaat ttttgcttcc ttgtaggtaa 19381 cacatttttc aggtctcaat gcttatagaa tttacccctt taatcttact attcatatac 19441 tctactacaa ggtaacttaa tgtgtctttt gataattttt ttttctcctg aaggacttta 19501 ttaaaactta agttttcatt aaagaataca tcaaagaata atgtttctga tgattgctct 19561 gtttcactct ttttggattt cttccacagg aacactatgt aattcttagt aggccttttt 19621 ttttggtctc ctagatctaa catcacactg tccatttaca gttttgatgt ttttcctctg 19681 tgttcagtgg cagcttcttg aacctctctt ctgcatcact gattttcttc aatcaattct 19741 actcctcact gcttcagtga agggcttaat gtttgtgttg caatttgttt cctggcaaat 19801 tattttaact tgctcccctt ttggctgacc taactgtata cataacacat ttaattgtac 19861 ttcttttctt gccttttctc tctttatgac tttctgaatt ccaaaagcat cggtttcatc 19921 ctgttgcaga tgctgaggtg ctcaaaatca atttagttct tgtggtacat catttttttt 19981 ttttgtacac acgcatctct ttcttttttt tttttccttt tttttcttta ttattattac 20041 actttaagtt ttagggtaca tgtgcacaat gtgcaggtta gctacatatg tatacatgtg 20101 ccatgctggt gcgctgcacc cactaactcg tcatctagca ttagatatat ctcccaatgc 20161 tatccctccc ctctccccca accccacaac agtcccttga gtgtgatgtt ccctttcctg 20221 tgtccatgtg ttctcactgt tcaattccca cctatgagtg agaatatgcg gtgtttggtt 20281 ttttgttctt gcgatagttt actgagaatg atgatttcca atttcatcca tgtccctaca 20341 aaggacatga actcatcatt ttttatggct gcatagtatt ccatggtgta tatgtgccac 20401 attttcttaa tccaatctat cattgttgga catttgggtt ggttccaagt ctttgctatt 20461 gtgaataatg ctgcaataaa catacatgtc catgtgtctt tatagcagca tgatttatag 20521 tcctttgggt atatacccag taatgggatg gctgggtcaa atagtatttc tagttctaga 20581 tccctgagga attgccacac tgacttccac aatggttgaa ctagtttaca gtcccaccaa 20641 cagtgtaaaa gtgttccaat ttctccacat cctctccagc acctgttgtt tcctgacttt 20701 ttaatgattg ccattctaaa tggtgtgaga tggtatctca ttgtggtttt gatttgcatt 20761 tctctgatgg ccagtgatgg tgaggatttt ttcatgtgtt ttttggctgc ataaatgtct 20821 tcttttgaga agtgtctgtt catgtccttt gcccactttt tgatggggtt gtttgttttt 20881 ttcttgtaaa tttgtttgag ttcattgtag attctggata ttagcccttt gtcagatgag 20941 taggttgcga aaattttctc ccattttgta ggttgcctgt tcactctgat ggtagtttct 21001 tttgctgtgc agaagctctt tagtttaatt aggtcccatc tgtcagtttt ggctttagtt 21061 gtcattgctt ttggtgtttt agacatgaag tccttgtcca tgcctatgtc ctgaatggta 21121 atgcctaggt tttcttctag ggtttttatg gttttaggtc taacatttaa gtctttaatc 21181 catcttaaat tgatttttgt ataaggtgta aggaagggat catttcagct ttctacatat 21241 ggctagccag ttttcccagc accatttatt aaatagggaa tcctttcccc attgcttgtt 21301 tttgtcaggt ttgtcaaaga tcagatagtt gtagatatgc ggcgttattt ctgagggctc 21361 tgttctgttc cattgatcta tatgtctgtt ttggtaccag taccatgctg ttttggttac 21421 tgtagccttg tagtatagtt tgaagtcagg tagtgtgatg cctccagctt tgttcttttg 21481 gcttaggatt gacttggcga tgcgggctct tttttggttc catatgaact ttaaagtagt 21541 tttttccaat tctgtgaaga aagtctttgg tagcttgatg gggatggcat taaatctgta 21601 aattaccttg ggcagtatgg ccattttcac gatattgatt cttcctaccc atgagcatgg 21661 aatgttcttc catttgtttg tatcctcttt tatttccttg agcagtggtt tgtagttctc 21721 cttgaagagg tccttcacat cccttgtaag ttggattcct aggtatttta ttctctttga 21781 agcaattgtg aatgggagtt cactcatgat ttggctctct gtttgtctgt tgttggtgta 21841 taggaatgct tgtgattttt gtacattgat tttgtatcct gagactttgc tgaagttgct 21901 tatcagctta aggagatttt gggctgagac aatggggttt tctagatata caatcatgtc 21961 atctgcaaac agggacaatt tgacttcctc ttttcctaat tgaataccct ttatttcctt 22021 ctcctgccta attgccctgg ccagaacttc caacactatg ttgaatagga gtggtgagag 22081 agggcatccc tgtcttgtgc cagttttcaa agggaatgct tccagttttt gcccattcag 22141 tatgatattg gctgtgggtt tgccatagat agctcttatt attttgagat acgtcccgtc 22201 aatacctaat ttattgagag tttttagctt gcagggttgt tgaattttgt caaaggcctt 22261 ttctgcatct cttgagataa tcatgtggtt tttgtctttg gttctgttta tatgctggat 22321 tatatttatt gatttgcata tattgaacca gccttgcatc ccagggatga agcccacttg 22381 atcatggtgg ataagctttt tgatgtgctg ctggattcag tttgccagta ttttattgag 22441 gatttttgca tcaatgttca tcaaggatat tggtctaaag ttctcttttt tggttgtgtc 22501 tttgcccggc tttggtatca ggatgatgct agcctcataa aatgagttag ggaggattcc 22561 ctgtttttct attgattgga atagtttcag aaggaatggt accagttcct ccttgtacct 22621 ctggtagaat tcggctgtga atccatctgg tcctggactc tttttggttg gtaagctatt 22681 gattattgcc acaatttcag atcctgttat tggtctattc agagattcaa cttcttcctg 22741 gtttagtctt gggagagtgt atgtgtcgag gaatttatcc atttcttcta gattttctag 22801 tttatttgcg tagaggtgtt tgtagtattc tgtgatggta gtttgtattt ctgtgggatc 22861 ggtggtgata tcccctttat cattttttat tgcatctatt tcattcttct gtcttttttt 22921 ctttgttagt cttgctagcg gtctatcaat tttgttgatc ctttcaaaaa accagctcct 22981 ggattcatta attttttgaa gggttttttg tgtctctatt tccttcagtt ctactctgat 23041 tttagttatt tcttgccttc tgctagcttt tgaatgtgtt tgctcttgct tttctagttc 23101 ttttaattgt gatgttaggg tgtcaatttt ggatctttcc tgctttctct tgtgggcatt 23161 tagcgctata aatttccctc tacacactgc tttgaatgag tcccagagat tctggtatgt 23221 tgtgtttttg tcctcattgg tttcaaagaa catctttatt tctgccttca tttcgttatg 23281 tacccagtag tcattcagga gcagattgtt cagtttccat gtagttgagt ggttctgagt 23341 gagattctta atcctgagtt ctagtttgat tgcactgtgg tctgagacat agtttgttat 23401 aatttctgtt cttttacatt tgctgaggag agctttactt ccaagtatgt ggtcaatttt 23461 ggaataggtg tggtgtggtg ctgaaaaaaa tgtatattct gttgatttgg ggtggacagt 23521 tctgtagatg tctattaggt ctgcttggtg cggagcagag ttcaattcct gggtatcctt 23581 gttgactttc tgtcttgttg atatgtctaa tgttgacagt ggggtgttaa agtctcctat 23641 tattaatgtg tgggagtcta agtctctttg taggtcactc aggacttgct ttatgaaact 23701 gggtgctcct gtattgggtg catatatatt taggatagtt agctcttcct gttgaattga 23761 tccctttacc attatgtaat ggccttcttt gtctcttttg atctttgttg gtttaaagtc 23821 tgttttatca gagactagga ttgcaacccc tgccgttttt tgttttccat ttgcttggta 23881 gatcttcctc catccgttta ttttaagcct atgtgtgtct ctgcatgtga gatgggtttc 23941 ccgaatacag cacactgatg ggtcttgact cttcatccaa tttgccagtc tgtgtctttt 24001 aattggagca tttagtccat ttacatttaa agttaatatt gttatgtgtg aatttgatcc 24061 tgtcattatg atgttagctg gttattttgc ttgttagttg atgcagtttc ttcctagtct 24121 tgatggtctt tacattttgg catgattttg cagtggctgg taccggttgt tcctttccat 24181 gtttagcgct tccttcagga gctcttttag ggcaggcctg gttgtgacaa aatctctcag 24241 catttgcttg tctgtaaagt attttatttc tccttcactt atgaagctta gtttggctgg 24301 atatgaaatt ctgggttgaa aattcttttc tttctttttt ttttttaatt gatcattctt 24361 gggtgtttct cgcagagggg gatttggcag ggtcatagga caatagtgga aggaaggtca 24421 gcagataaac aagtgaacaa aggtctctgg ttttcctagg cagaggaccc tggggccttc 24481 cgcagtgttt gtgtccctgg gtacttgaga ttagggagtg gtgatgactc ttaacgagca 24541 tgctgccttc aagcatctgt ttaacaaagc acatcttgca ctgcccttaa tccatttaac 24601 cctgagtgga cacagcacat gtttcagaga gcaccgggtt gggggtaagg tcatagatca 24661 acagcatccc aaggcagaag aatttttctt agtacagaac aaaatggagt ctcctatgtc 24721 tacttctttc tacacagaca cagcaacaat ctgatttctc tatcttttcc ccccatttcc 24781 cccttttcta ttcaacaaaa ccgccatcgt catcatggcc cattctcaat gagctgttgg 24841 gtacacctcc cagatggggt ggtggccggg cagaggggct cctcacttcc cagaaggggc 24901 ggccgggcag aggcaccccc cacctcccgg acagggcggc ggcagggcag aggctggccc 24961 ccacctccct cccggactgg gcggctggcc aggcgggggc tgacccccca cctccctccc 25021 ggatggggcg gctggccggg cgggggctga ccccccacct ccctcccgga cggggcggct 25081 ggccgggcgg gggctgaccc cccacctccc tcccggacgg ggcggctgcc aagcagaaac 25141 cctcccccct tctcagacgg ggcagctgcc aagcggaggg gctcctccct tctcagacgg 25201 ggcggccggg caaaaaccct cctcccctcc cagacggggt cgcggccggg cagaggtgct 25261 cctcacatcc cagacggggc agcagggcag aggcgctccc cacatctcag acgatgggcg 25321 gccggacaga gatgctcctc acttcctaga cgggatggcg gccgggaaga ggcgctcctc 25381 acttcccaga ctgggcagcc gggcagaggg gctcctcaca tcccagacga tgggtggcca 25441 ggcagagacg ctcctcactt cccagacggg gtggcggccg ggcagaggct gcaatctcgg 25501 cactttggga ggccaaggca ggtggctggg aggcggaggt tgcagcgagc tgagatcatg 25561 ccactgcact ccagcctggg caccattgag cactgagtga acgagactcc atctgcaatc 25621 ccggcacctc gggaggccga ggctggcaga tcactcctgg ttaggagctg gagaccagcc 25681 cggccaacac agcgaaaccc cgtctccacc aaaaaaatat gaaaaccagt caggcattga 25741 ggcaggagaa tcaggcaggg aggttgcagt gagcagagat ggcggcagta cagtccagct 25801 tcggctcggc atcagaggga gaccgtggaa agagagggag agggagaccg tggggagagg 25861 gagagggaga gggagagcga aaattccttt ctttaagaat gttgaatatt ggcccccact 25921 ctcttctggc ttgtagagtt tctgccgaga gatccgccgt tagtctgatg ggcttccctt 25981 tgtgggtaac ccgacctttc tctctggctg cccttaacat tttttccttc atttcaactt 26041 tggtgaatct gacaattatg tgtcttggag ttgctcctct cgaggagtat ctttgtggca 26101 ttctctgtat ttcctgaatc tgaatgttgg cctgccttgc tagattgggg aagttctcct 26161 ggataatatc ttgcagagtg ttttccaact tggttccatt ctccccgtca ctttcaggta 26221 caccagtcag atgtagattt ggtcttttca catagtccca tatttcttgg aggctttgct 26281 cgtttctttt tattcttttt tctctaaact tcccttctcg cttcatttca ttcacttcat 26341 cttccatcgc tgataccctt tcttccagtt gaccgcatag gctcctgagg cttccgcatt 26401 cttcacgtag ttcttgagcc ttggttttca gctccatcag ctcctttaag cacttctctg 26461 tattggttat tctagttata cattcttcta aatttttttc aaagttttca acttatttgc 26521 ctttggtttg aatttcctcc cgtagctcag agtaatttga tcgtctgaag ccttcttctc 26581 tcagctcgtc aaagtcattc tccatccagc tttgttcctt tgctggtgag gaactgcatt 26641 cctttggagg aggagaggcg ctctgctttt taagagtttc cagtttttct gctctgtttt 26701 ttccccatct ttgtggtttt atctactttt ggtctttgat gatggtgatg tacagttggg 26761 tttttggtgt ggatgtcctt tctgtttatt agttttcctt ctaacagaca ggaccctcag 26821 ctgcaggtct gttggagtac ccggccgtgt gaggtgtcag tctgcccctg ctggggggtg 26881 cctcccagtt aggctgctcg ggggtcaggg gtcagggacc cacttgagga ggcagtctgc 26941 cctttctcag atctccagct gcgtaccggg agaaccactg ctctcttcaa agctgtcaga 27001 cagggacatt taagtctgca gaggttactg ctttttgttt gtctgtgccc tgcccccaga 27061 ggtggagcct acagaggcag gcaggcctcc ttgagctgtg gtgggctcca cccagttcga 27121 gcttcccggc tgctttgttt acctaagcca gcctgggcaa tggcaggcgc ccctccccca 27181 gcctcgctgc cgccttgcag tttgatctca gactgctgtg ctagcaatca gtgagactcc 27241 gtgggcatag gaccctccga gccatgtgcg ggatataatc tcctggtgcc ccatttttta 27301 agcccgtcgg aaaagcacag tattcgggtg ggatgacccg attttccagg tgccatctgt 27361 cacccgtttc tttgactagg aaagggaact ccctgacccc ttgcgcttcc caagtgaggc 27421 aatgcctcgc cctgcttcgg ctcgtgcacg gtgcgcgcac ccactgacct gcacccattg 27481 tctggcactc cctagtgaga tgaacccggt acctcagatg gaaatgcaga aatcacccgt 27541 cttctgcgtc gctcacactg ggagctatag actggagctg ttcctattcg gccatcttgg 27601 ctcctccctc ctggtacatc attttcaaaa ctaccatctt ccttcacatc ttcaacataa 27661 ttaatacatt cagtaattat gtgagcactc actgtgacaa ttgctgttgt agtgagtggt 27721 ttgggatata tcagagaaaa ttctggaatg ttagaatgct ctggaaaaac agagggtaaa 27781 gggattggat ggtgtactcg gggagatttt gagaaagtca ttcaagagaa tgactttcat 27841 acagtggttg aggaggagga agggcttttc tggcaggaga aacagcaagt ccaaggccta 27901 tggcaggagc acacctggtg tgtctgagga acagcaaaga ggctggtgtg gctgcagcac 27961 agcaaggaag gcaaagacaa gggcagaggt cagcaaggtg acttggggat gcatcacgag 28021 accctgtagg ccacggaaag aattttggct ttcactccaa ggaagatggg aagccactga 28081 agagttttca gcaaagcaat ggcatgatct gattacattt tgaaagaatt tttcactatt 28141 ctgttgagaa cagactctgg gaagaggggg aatggtagaa gcctgagacc agttagaaag 28201 ctactgcagt attccaggta agagatgatg gtaggagagg accaggtggt cacgatggag 28261 gcagtgtgaa ctggtcagat tctaaagata tcttaaaggt agcaccagag tatttcctga 28321 tgaattagag gtgagttgag agagaagtta tggagtcagt atgactctac agtttttggc 28381 ctaattggaa ggattcagtt gccatcaact gagatgggac agaatgcatg tggaggaggt 28441 ctgtgaaagc agactggaat gcgtttggga catgctcaac ttgaaataat gaagacgtca 28501 aaaagagtac cagatgactc caaattcaag ggaagtcaag tgggagacaa caacttgaga 28561 tttgtcagca caaagatgtg atttcaaatg acaagattag gtaagatttc taagagaatg 28621 tgtgtaaaca ctggaccaag agctgaatcc tggatactcc aatattaaga gttgaggaga 28681 acaggaggaa caaccataag ggtaagaaga aacaaagccc tggaagcatg tatgaaggag 28741 aagagagtaa tcaactgcat caaatgctgc tgttgcacag agatcaggac tgacaatgga 28801 ccactggatt tagcaataca taggtcactg ggcacctccc aagagtaagg ctggtggagt 28861 atggaggcaa aatcttgact aaagtgttga ggagaataaa aaaaaaaaat ggaaatagca 28921 agtatagtga aatctgtcaa ggactattac tataaagagg agtaaagata cggagtggag 28981 ccaggcgtgg tggcatgcac ttgtaatcct agctacaagt gagactgaga caggagacag 29041 gaggattgct tgagcccagg agttcaaggc tgcagtgagc tatgattgtg ccactgtact 29101 ccagcttggg tgacagagtg agaccctgtc tcaaaaacaa aacaaaaaat cctctgcagc 29161 caggcatggt agtttactcc tgtaatccca gcactttggg aggccaaggt gggaggacca 29221 cttgagccca ggagttcaag accagcctgg gcaacatagt gagaacccgt ctctaaaaac 29281 aaacaaaaac ccagagccca tgatttcagc acccatgttc tttttttttt tttttttttt 29341 tttttttttg aggcgaagtc ttgctgtcac ccatgctgga gtgcagtggc gcaatctcag 29401 ctcactgcaa gctccacctc ccgggttaca ccattctcct gcctcagcct ccagagtagc 29461 tgggactaca ggcgcctgcc accacgcccg gctaattttt tgtattttta gtagagatgg 29521 ggtttcactg cgttagccag gatggtctca atctcctgac cttgtgatcc gcccaccttg 29581 gcctcccaaa gtgctgggat tacagacgtg agccaccgtg cccggcacat ccatgttatt 29641 tctactatac aactcatata ctattctgcc tctctaaatt ctgccacagg aaagtatatg 29701 tcaggaagag gacaattatc aaaaatgtat agcaatgagg gctggtcatg ttggctcatg 29761 cctgtaatcc catctactcg ggaggctgag gcaggaggat ggcttgggag ttcaaggctt 29821 cagtgggcta tgattgtgcc actgcactcc agcttacatg acggagcaag accttgcctc 29881 taaaaaaaaa aaaaaaaaaa gatggatgtg tagatgtgtg tgtatataga tatacgcata 29941 tatatacata tattttatat acacatatac atatatttta tatatataca tatttacata 30001 tattttatat atatacatat ttacatatat tttatatata tacacacata tatacatata 30061 ttttatatat atatatacac atacacatgt atatagtgga atttggcggg ggggagtgga 30121 gaaatgacat ggagtttttt taaggaagac agagttagta tgttgctggg aatgatcagg 30181 aggatgttct cttttctgtg tttttacaat gttactcata ggtccatgtg catcttttta 30241 gacatcctca gatgagatct ggcccaatta ctcaataaat agggtaagtg cactatcact 30301 gactgctcag ctgtaacttt gcttactagt cagaatccct tcttacactg acagctgaaa 30361 agcaggctga tagtcactgc tccacagcag tttttagtgt tcgatgggca cacactaagt 30421 atgactctgt tggatttagg tagtatctga tcttcctgcc cggttctctc atctgttgaa 30481 tggtggattg tattaaacac catcattctc agagtgccct taccaggttg cttcctgtct 30541 ccatttctaa caacagcact ttatttgtga agactatctc agggtacaac gtgcctccaa 30601 tgtcatctcc cctcccatgg ctttttgtag tattaagaca ataaaggcct ggtatggtgg 30661 ttcacacctg taatcgtagc actttgggac gctgaggcag gatagcatga gtctaggaat 30721 ttgagaccag tctgggtaac acagtaagac cccgtctata aaaaaaattt aaaaattagc 30781 tgggcatggt ggtacatgcc tgtagtccca gccacttggg agaactgagg tggaaggatt 30841 gcttgagccc agcagtctga gactgcagtg agttgtgatc gcaccactgc acttcagcct 30901 gggagacaga acaagatcct gtctcagaaa caaaaacaaa aagacacagt ttagtgtgtg 30961 ggtggaggga ggctagagag aatgaacatt actctggttg cctaaagcac caggcagctg 31021 agggacagga tcaggttttc tgattgggac atgtaagttt aagagaaaac tgcctgcctt 31081 cctctcctac agacactgtg gactgaggta aagacattgc tttactgagt taaaaatcac 31141 taagaaattc ttcttagatt ttgtcagtat caggcataat ttttggtgaa agaatgcctt 31201 ttctttttct tatttttcct ccatgcactt tattgggaaa ataggataga ttaaatcagg 31261 ggtgattatc cagatgttat tgttaaataa gctatgcttt gaagttaagg caacaggtaa 31321 cacatattga ataagccaaa gtgcttaatt tttgttttct atgaaatagt ttatttcatt 31381 gttcactatg gaggggctac aaatacactt ctgatattaa cacaaaggtc taaactttgc 31441 aacactggta aaaatggtta tgaggaattg agaagctagg gaacagggca catgacattt 31501 acatttgctc atctcccact atacatttca aaacatcaaa ggcacttacc aaagtgatta 31561 aataagggag tgtggaattt aaaataatga catataccca agacacaaaa taaaggactt 31621 gaattttctt atagtttgat ttctaacatt ttactgacta aaatctctgc taattatgta 31681 atctactttg tggctttaaa atattggata ctttgatttt aagactactt taaatgaaaa 31741 gaacctaatg gaataccata aagccagagt ttttccacac ccaattacag tagaagagca 31801 cccctatatc caggcaaaag caaaaggccc cactcttcag tcactcatat tgcttctcca 31861 gagagcactg tggagtcacc tctcactgat atactaacat gtggcatctt tgtggcttca 31921 taggcacatt cattcaacac acatttatct agcatctgcc aaggttcaca gcactgtgct 31981 gggtgctggg gagacagaaa catggtgtgg tgcttacttt aacaagctta catttatccc 32041 ctgtaattgg gttggtcatg ctttcttttc aattcctact gatttgttcc aagaagcttc 32101 ttggaataaa acagaagcca gaataaatgc atttgtattc ccatccctaa ttctacttgg 32161 tttccagaag aaaaaaaaaa tcagaataag actatctcaa agtattaaag aggatctaaa 32221 ccctacatca atctaaagca tttagacaaa tgcattttta gacaaaagca tttagacaaa 32281 tttcatttgt ctacaacatg gaaaaaacaa gttaagaagt agagcacctt tcagccctta 32341 ctacattcta aaacaaactt gtaaatgctt tcaaactgac ttatgctctt actctcaact 32401 cacagaaaaa tggagtccaa tcacatttcc atgtgaaaac acaaaattag gttaacagaa 32461 cagtcatcct tgacatagta atactctgac aatacaatta aggggacatc ttctaatggc 32521 atttgttctt aacatggaaa aacaatttct tgcatatttt ctaatcatag tgtatataca 32581 gatgtaaata cagagtcaat gtgtgcaatg atgctgacaa ggcatcaagc atgttttaga 32641 gtataccggc aatgagagat ttcaattact ttatattcct gccttctcag tctgtacaca 32701 tattatcatt tacaggatag tcacatacta tacagaattg aaagatgcca atttttgcta 32761 tcaaaagtgc tgaagttata atttcctaat acagactatt gaaatgcaca gaaagactac 32821 aataggcaat ctatagcagg gctaacaaag tatttctata tctcatgtct attaattacc 32881 cttcctccat cttatataat attccatctt cagtaatgaa cttttatcta attaagaaat 32941 cccccagttt ataaaatctg agggcaattt ccagtgaact ttgtcaggat agaggaacca 33001 aatgaccaca ggtattactg agttataata agaaggagta accaattaat cagtattata 33061 acaacaaaat tgctgcaagg caataggctt ctgtaattaa acataaatca caaaggtatg 33121 gattcaggca tcaaaatagt gacgtgattt cactgagtat caactagtat ttgcaaagct 33181 acttgaattt ttctgttgat cttcacttgt tgttaaataa tgatagtttt gatggtggta 33241 aaacttcatt tttcaagtgc aggatgatct gagctgaatg agaaaaaaag taattaaggg 33301 agggaataaa ggtaggcctg gcatcagttt gaacaatata tttttttctt ctgtttttga 33361 tcgtaagagg gcttatatct ccaaggttac cacaataaag tcataaccat gccagccatt 33421 ttaaggattt caggattaat caagaccaag aaagatgaaa tgatgaatta caaaccaagc 33481 catcactact acaaagttcc atgagtgata attctgttac ccatatccca ggcagcaata 33541 tgtttaaatt acaaagacag tgacatatat attgtcaaaa aattttactg ttacttatta 33601 tttaaattta aaaattttat aattttacat tttgtaatta tttatttaaa agtggcatat 33661 tttattaaaa ggaacactgg aaatatggat actttttgag aagtaatctt catcttcact 33721 tgtagatatt atggagaaac cagatttcta catcaactaa tggaaaatta accacttatg 33781 agcctagttc caattcaaat aaaggacata catacaaatg ttaggactga ttctgtccat 33841 tcagttaaaa tatttccatt cctgaattat tagctactga aattctaaaa atgctcattt 33901 ttctgggtat cttaacttaa aagatttatt agctgttcag atgctgttta tattaattga 33961 catttacaaa gagcatagag accaaagaca atatacacta tagtatgtaa gacctttatc 34021 tccttctaaa tgctctaagc aaactataaa atttagtaat tctatttaga tcttcttaga 34081 taacttgcag gtgtcttgag tgtctcagtc tgctttctct cttcaaattt tttccattct 34141 aaacagcatt ttttgcctcc tagtaataat cctgtagaat tttaaaaaca gacaaaaaaa 34201 caccaaaccc agattctatg ctttattcat tttgcattaa gatgaccaaa gaaataaaac 34261 ctaaataaaa taaacataaa atcatttttc cctatttaaa tactgataat ttctattaac 34321 cctattttta atttagggag ttttatattc atattgtcaa gatatccagt gcacaagttt 34381 ctgcagaatc actaatttta ttataaagct tcctctcccc acaccttagg ggacaatgta 34441 ttttcactct gacttgttta aaagacacaa ggaggttggc atattttaaa gttaacacat 34501 tttttcatta taaaaacagg cttaggttaa tggtcactct ctagcatttg gttaccgact 34561 gactgattat catatgtgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtttg tgtgtatata 34621 tatatatata tatatatata tatatatata tatatataca cataggttcc tgactttcta 34681 ctactcatta ctgttggctg atgcaaaatt tgattgtgca acatacatct cctcaaggat 34741 gaacaggaaa tgcaaagata taaaattcaa agtattaata gatggttttg ctactttttg 34801 tacttttggt aaatgagata gttggaagtt agaggctatt aactaaaata aaattaagga 34861 gtaagtaagt tacctgtaaa tttcatttac tcacattagg actgactggt ctctctttat 34921 tggagtttaa cagaaaacag actggaccat gactggcagc tatatattgg tcattttcaa 34981 atactccaat gacactcttc aataacttta ttatctccag ttttatgatg ccataccatg 35041 ttttttcaca ctgtttaaaa atttccattt ccataatcca atgccaaaaa aattactaca 35101 cattctctta cacagtcaaa aatatattta ctggaatgtt tttagttata cctgagggca 35161 acaacattta caacatttag cttttgttac acctataaat gtggaaattg cttgcaaaac 35221 aactgtaata tgaaaggaac tatatttctg caaatattga cagaatggtt atgcatctct 35281 ttcagcatat aatttggtac tctacacagg cttcctgacc cccagcatga cccgctccct 35341 aatgcaataa tgcaagtcct tatgccatta aaatattaat atgaaatatt ataatcagtg 35401 tttatataca caaataatga actggcaagt atctagtttc aaattagatc cttgaaatat 35461 ggaaaaatag aggctagttc tctttgcctt aaacataatt attaaaagaa cattttgtta 35521 gaaagggaga agataattgg acattctcaa cgttccaatt tgaagtgata cttgcctctt 35581 agtttcttga aaaagaagtt ttgaaagtta tatacacata ggtttcattt aaattatgca 35641 gcaatctttt attactgtcc cagatcttgt ttaatgaaga tctctgtcat caaatcaagt 35701 acggtatttc atgactataa gaactgtaag agaaaacatg aaaaaagcaa tatatcaaca 35761 tcagatatat atgctatgtt tgaagcaata ttcctaaatg tagttgtgtc tacttttaga 35821 aatgggtaat gaggattgac tgaactttta tctgaggtaa taaataatca atgggaaagg 35881 agaagcaaaa aatatcataa aacaacctaa atgtctaaca ttggcatctt aaataaataa 35941 tggcatagcc atagcttgta tctgctgtta aaatattact aaaaattttg ataacctggg 36001 ataatagtca caatatattt attagaaagt acatataata tagtcttagt tttttaaaaa 36061 acatgcttgt atattaatag gaaaagagaa gtatatataa tagttatatt agtagctatc 36121 tctactaatt gtaagattat agttgaattt ttcttctttt tgttattaat atttcacatt 36181 ttccaattgc tcactatata tacatgattg attcttggtt tctgaggtaa ttattttcta 36241 taaggtcact acaaacacta aattagtgaa tatggaacca ctgcttctag ggaaaatata 36301 gggttaagtt cctgtgagtc tctggctact acatttttgt caatcaatgc ataaccttgg 36361 tttatatgtg tttctgttta aagacacttt atttaatatg tattattgat tcagtaacac 36421 tgtgctcaag gccaacagca ctataattca tgcctgaaca aaagtttgcc taatgtgtat 36481 tttttccatg aagacacatc acagctctct tgtactgagc aacactagac agcacttcag 36541 caccacgtgt ggggaccatt ttaaacagtg aaatcaccaa caaaaagcac aacaatgcag 36601 aaaatgtggt actaaataga ctacaaaaag aatgtttatg ttatgagtgc tgaaataaaa 36661 aggcagaaat ggctttgttt aacctcagct agggatgtgt gggttggggg acttacaatt 36721 ttttgttact ctgaaaatgt ctgtaaatga ctgtgaaagc accatgagta ctgattttgg 36781 ggttacaaat aaattttacc tggtaactgg atttgcaaac acggaatcta tgaataatga 36841 ggactgactg tatttctttg taacaaaaca tttttttctg tttaaaaata taacagcata 36901 atataatttt tcaaatgatt atttttagat actttccaaa gcgtcataaa acctctctgt 36961 aaacataggc agaggccttg aagaaaaaga gatatttaaa atgatctgat ttttaaaaga 37021 acattaaggg aaaaaggacg ggaaggagtc atgctttaat gatggaaatt atgctcaact 37081 acagaacaaa acaaaaacca aacaaaaaaa aatcatctgg agaacgtatt tggatttata 37141 cttccaaaac aatattacaa aatgcaaact gccattcctg agagttaata ataaatagaa 37201 aatgagtatt ttagacctta gtgtaatagg aaccagattt ggccttgaac aaatcactta 37261 gccctggatt tgggggtgag ggttgggggc agaacaggag ttccctttga gaatctgaca 37321 aaagatatag accctctctc cataaaaatg tacatataca tattcacata aaattttaca 37381 tattacttaa gagcattttc agagatcaat agattctaag agctcctgag acaggtcaga 37441 agtcattatt caatcctgac ttcagacata ctaggtcaac tgaaaataaa agggaattaa 37501 aatgaactcc aaaataaatg ataccctagg aagtaataat ccaataaata cttactgata 37561 atcactagca aaaggatagc agcaaccaaa gccatgatgg cttttatcta caacacacaa 37621 aagggcatat atattaatat ttgacttgtt aataaagtca agactatttg ttgttctatt 37681 tactgcatca tactaaatca tctcatcacc ttaaggtttc ttaactacaa aactgacaca 37741 gaatgctctt aggcaaagtt gcttattaca tgaggctatt cataaagaat ctaaaatata 37801 ttcctcaaaa tggagataaa aatgcaaaga agaaagtaag gggagaaaaa actaacagaa 37861 ttctcatata ctttgaaaaa aagtgctata aactgctacc ttcgatgtat tgaaccattt 37921 gaattgcatg gcctatgtta ttcatcatat cttgacactt gaacagagtt gctaaaatac 37981 tgtacatttt ctctggtctc acattagtaa taagcccatc aaatcataga ggttcagttt 38041 atatgataaa gtactttaag ccttttttat taatctgaag catttctaaa acaatttgga 38101 atttctacgg tagtttttca actttacctg aaatgtacat taatttgccc tgttaaacgt 38161 gtgtttggct ttacttaaga agatttgaat gaaaacatca atatcaattt taaaacacct 38221 caatgaatag agttttcttc cttgggtttc taacactggg cagaaaatgt ttacagctcc 38281 aaatggcagt attgtattca tgcaagtcca aagctctctt attccaaata acatgaacaa 38341 aaatctgttt gtcattaaac atataagagt ttcacagccc ccagatatat taagagaggt 38401 gttagcgatt ctctttcccc acctatggat aattttatta actcatttat ttttaaagcc 38461 aggtatattg agatacactt tacaaagaat aaaacttatc ctcttaaaat acttttgata 38521 ggttttgaca aatatatcca gctgttaata aacatgacca caatcaaata taggacattt 38581 ccaccaacac ccttaacaag tttcctcatg acctgttgtg agtctccaac tgcacgacat 38641 cccccttagc cactggcaac tcctgatctg cgttctgccc ttatatcctt tttcttaaaa 38701 cccttctgaa attaccatta tggaagtcaa tctagaaaat ggcaggcaca gggaggctgc 38761 cccagatgaa cagtagcaca atgaagtatt aatggggcac agtaaaactg aaggaagcta 38821 agcattagaa aataataaat gaagaatact cgctactgta aatggtgttt gttacattat 38881 ggaacaatga ttttcattac atggtcgatg ctattcatgg tccccagagg ttctcaaggt 38941 gatgactatt taaaaatatg actcaatgtt tttcctcctt aaaagcacgc tattcctctg 39001 acccaaaccc cataaagtac aaagcaaaca tttttagcaa tggttaaaaa aaaagctagg 39061 caaaatgtac attatgacca tctttttgtg gatacatgaa tagggtacat gagtttggga 39121 gagagaaggg gtgagtggct catggacttt ttgtttttct aaattagccc acagaataga 39181 aaatttctat agatatagaa gctttaggca aagtgtggtg gctcatgcct gtaatccagc 39241 actttgggag gctgaggtgg gcagattgct cgaggccaga agttcaagac cagcctggcc 39301 aacatagtga aaccccgtct ctactaaaaa tacaaaaatt agccaggtgt ggtggtgcat 39361 gcctgtaatt ccagctactc gggagactaa ggcaggagaa tcgcttgaac ctgggctact 39421 taggaggctg aggcatgaga atcacttgaa cctaggaggc agaggttgca gtgagccaag 39481 attgtgccac tgcactccag cctcagagac acagcaagag tgccacaaaa aaaaaaaaaa 39541 aaagacttta aagaaaataa atttattctt taaagaataa ataacttaaa atagaataga 39601 aaatttaaat tctgaacaag gggagaaatg aaacatattt cttccctcag ttccacttct 39661 gaaagtagga cgattcaata ggtagacaaa taaaaataaa taaatccaag gtaactcttg 39721 gaaaaaatcc tcagcatatg gctaagtcag gtttagaaag ttattgctgg ctggtgcagt 39781 ggctcatgcc tgtaatccca gcactttggg aggccaaggc aggtggatca cctgaggtca 39841 ggagttcaag accagcctga ccaacatgga gaaaccccgt ctctactaaa aaaaaaaaaa 39901 acaaaaaaaa caaaattagc tgggcatggt ggcacatgtc tgtaattcca gctactcagg 39961 aggctgaggc agaagaatcg cttgaaccct ggaggcagag gttgcggtga tattgtggtg 40021 agccgagatc agcctggaca acaagagcaa aactctatct caaagaaaaa aaaaaaaaaa 40081 aaaaggagag ttattgctaa acttgaaatt gaaaatgaga gactaaaaac agtgagttca 40141 tgtctaaatt ttaaacaaag ccaaaaatta taactagaat caggagaaaa aaaaaaaaga 40201 tttgctcata taacgtgaaa aacattttct gttaatgatt agtagccaca gtggatttaa 40261 gaagtaaatg tatttctata caatgttttg caggagtata tattagtatg tagtgattcc 40321 aatgtttacc ttgttaaaaa tggtaaaaat ttcttttact ttttattttt ctggaataat 40381 gagggaagtg gagttttttc ttaactccat ttgtcttaac agattcggat atgtaagctt 40441 ttatttttta ccctttcaac cgaaagtcat atttttctct agctcctact tttgatatct 40501 ctacataatg ctatagaggt tgctccaaaa aagtattaat agtaaatttc aagaaaaaat 40561 gtatatttgt tttaagttaa taagtttaga ggtagtcaaa tattaataaa atgtggatta 40621 ctttcattaa aaaaatctat gaaaacatcc ttttaggcat actgatttat ttagttatcc 40681 ctatcccaaa agcaattaag ttaacaagta tttactgaat ttctaccatg tctcaggaca 40741 gttataggcc ctgttctcac agaggttatg ctctagaggc ccttttttct tcatgtgttt 40801 gaataccact tcagatgctg accagtctat ccagtagctt ctgatttact tactttgcat 40861 ccacgccacc acatttgcct tcgaagttgt ttggatctgt tgctaaaagc tgttgcatta 40921 tccgataagc tttctatatc acatagagga tggagagaag gattaaacga cataacagga 40981 ttcttttatc tagtcacaca aagataagaa aatattaatt tctacttctg caaatttatt 41041 cccttttaag catgaaaaca gcattctccc ttttttccac ccacaaactc tggcattcta 41101 cattattaaa aaaatgaaaa caaaaagcaa aaaacagctt gcagatcagt ttgctcactt 41161 cttttttatt ataaaattaa gaaaagtctt tttagggcct taatgcttat aaataaactt 41221 aggaaatcta gtaagaaaaa tactctttaa aaattcaaat acttggcaca cctctagaaa 41281 aaaaattaca tgttgagaag ttagcttctt agaagatatc aaaagtgact gattttaaag 41341 gaaaacaata tacattttaa atgtgactaa gcgaggtgtt tttttttttt ctaaagtctg 41401 tcaaagatta atcaggtatt cagccattca ccaaacaata tttgagtggc aagtacgtgc 41461 caggtatagg tataggtata ggtataggtt ataggtgtaa tgactgaaag taaagaaagg 41521 tgactaagtt accaaataaa acataaaata aaaatagcag ataagtttga aacaagtgtt 41581 gaaaagtgag taggtcttta ggccctgaga tcatgagaaa cacaaagttc atcaatatca 41641 cagggcaaaa gaaaaaacag ctctactctt cactcttcaa caatgaaatt ttaaattctt 41701 cattctccac tgaacaatta aaaaataatg aaacaataag atgactcaaa gtaaaggagt 41761 taaaattaaa ccattaaaat catatactta agaaccaaat gccaatttcc taagccaaga 41821 tttgtaaacg ttggctagac aagtagaaga caaaaactag caaacagatt agtaatatca 41881 agaaatacat gaagatctgt acctgatttg tcctgtagtt catctagtct ctcccctctc 41941 tcaattacct ttgtaatatt ttcttgcatg acatcaataa cttcatccac ttgattctga 42001 acactagttt aaaaaagata cacaattatt tatccttctt ttgagaatgc ttacaatttt 42061 taaaactttc attagacata ataaagaata gcaaattctc agagaattat tcaactataa 42121 aatttaaatt taagactttg tagcaatgtt cttccatttg tgggcttcag tactcagata 42181 atgaaagaca atatgatata aattaaagta atatttaaag atagcttact tttagccctg 42241 cttctgactt gtaactcata agcaagttaa atgtactgtc agcatgcccg ttaagtacag 42301 aatttagtta aacagcttca ctaaaaataa tcagtttttc ttttcaacag gctattgagg 42361 cagcacatag acaccgctgt agcctacttt tgccatatat ttcatacatg aaggctaaca 42421 aaggaggtac cttaacatac tttttaacat accttttgct caggtaaggt aaaggcttat 42481 gaaacatacc tagtttccaa aatggaaatg tagggcatat tctaagagat actgatgctc 42541 aatgggtaga attcttcact gctcatcaga tttttcttaa acaggatata agacaactgg 42601 ggacacctgg gaatattttg aagtacccac caggaagcaa aaaggaaaaa attgttttct 42661 gtttttcgat tgtttgttca tcctttatat tagatgatgg gggcctgttt taaccaaatt 42721 tcaggctgtt caataaagac ttaacctaaa cttaaaattg taataactgt cctcagctat 42781 ctactctcat tcaggagtta cctccagagt atacaaacat ggacatgttt aattatttta 42841 agtgtcttat tctcctctgt tgctctaacc acaaaaatta ttttaaatta ctttacgaaa 42901 taagtcacaa caatgaaata tttaaaagga gaaataaagc aaatccatgg tcctcatact 42961 aaacactgtt taaagtgcat tggattacta gcgactacag acatgtactg cataacattt 43021 tggttaatga caaagcactt ttacaacact ggtcccataa gttataatac catattttta 43081 ctgcatgttt tctacgttta gatatgttta gacatacaga tatttaccat tgtgttataa 43141 ttacctacag tattcagtgc agtcacatgc tgtacgggtt tgtagcttag gagctataag 43201 ccataccata tagcctaggt gtgtagcaga ctacaccatg taggtttgtg taagtatact 43261 actctatgat gttcacacaa tgaaactgct taatgatgca tttctccaaa catagcctgt 43321 cattaagggt ccatgatggt tataaacatt aaatgtatat atatttaaat agatgtagat 43381 atgcaattgt gtttctactt ggaaaaggaa gaagtgtgta gggggcttta aaaatttttt 43441 acaatacaca aatatggtct ccgccctcta ccccacacaa ctccccaaac taatgtagta 43501 agtgaggggt gaggcacagt catgtttaat tttgaaagag ttctatcctt tccgatcact 43561 gagctagttc ttcaaaaagt ttgctataaa aacaaaatga aggcccactt agacataggt 43621 ggattcagaa gtactctcag attcaagtag caattactaa gcttgatgac agtcacttcc 43681 ctcccatgtt aaaagcatta tgtacaccca ctgattaccc acacataaac ttcaagattt 43741 ttcacagcat cccagagttc tgtgcttccc acacattatt ccactggagg atataagact 43801 gaagctcaac aaagttgggc atgcagaatt ctcttctgcc caaacaaatc tgaaagaacc 43861 cctaaatctc taagacatct tgttgcaact cctaatcccc taagtagaga gttaaaggtg 43921 ttcatcctat ttaagaacaa tggcacaaat acacaactaa tcaatttatt cagcaaataa 43981 tgagaaccta ctatgtactg ttttaggtac tgaggaaata acattgaaca aaacagatga 44041 taatcccagc ccttatggaa catattatgt tgaatgcact gtattaacct tatagctgtg 44101 aaacagcatg ctttacccca ttactctatt ttatttatag catgtaagca aatataaagt 44161 aatagtaatt gtacctattt acttatttac atgcttattt tctgcctctc cccactagaa 44221 ggtaagctca atgaatgcag tgaccttgcc tattttgttc acttttgtaa gtccagtacc 44281 tatagtggtg catggggcag agtggccatt caataaatat ttgctaaata aatgaacagt 44341 ttgagtataa atcctataaa gcatatccag ctacttgagg aggagatata aagcagtgac 44401 tgtagagccc tccatttaac tacaaacaaa agcaaagcta ctgttcatca aggatcaatt 44461 ttgaaattcc cagccagctt tttttttttg tcatatgcaa gtgacgataa ttaatggctt 44521 aaagtagggt gctatagcag gaagggaatg aataaaatgt actaaggctc ctatacacga 44581 tcagattata gctattatgg atattaatat atgtatacta tacgtactct tagacgagat 44641 cgggcgcatt cagggtggta tggccgtaga cattatatat acttttaaaa tgaaaacaaa 44701 aagctaataa aataattatg gtcccagaat ctctttaaag gccaggcctg gtggttcatg 44761 cctgtaatct caacactttg ggaggccaag gtggaaggat cccctgagcc tgggagtttg 44821 agaccaggct aggcaacaga gtgagccctt gtctttacaa aaaaatttaa aaataagctg 44881 gttatggtgg catgtgcctg tggtcctagc cacttgggag gctgcggtgg gagatttcct 44941 tgagcccagg aggttaaggt gcagtgaact gtgattgcac cactgcactt cagccttagc 45001 aatagagatc tgtctcaaag gaaaaaaaac ataacccttt aaggataaga ttccatggat 45061 ataggcagat taattgggac agaaccaata aaattctaga ttcttactag agtatcataa 45121 acctacaagt attccctcaa caacatggca cacatgtaat aaatcactat tgataatgga 45181 aaccgcaaat attttaattt ttctgctaca ctattagtga gttattgctg ctttgtcctt 45241 cacacattaa ttatctgctg gtgtaacact ctgaatccag ctactatgta accttcactc 45301 aatgatgtaa aatctcatga ttgtttttgt gtttcaggat gtttccctac ccagaacaca 45361 atttttgtag aagaaagaaa atgatctgct tatttacttt aatatggaga tagtttggtt 45421 tacaaagcaa gtgataacat acttccccat gatcttttac attttcacaa tagttatagt 45481 cctttgattt ctatttgtat ttcacattgg tcttgtcccc attgttatac tataaatgcc 45541 ttaaaaacag agactaaccc acttattctt taatttccca ggtcaagtga aatgtggctg 45601 gtacaaagta agcatttaac aaatattttc aaatctgaat taaatatatc attcattcaa 45661 aaaacattaa aaagttatgg ctaggtgtgg tggctcaggg ctgtaatccc agcactttgg 45721 gaggctgagg caggaggatc gcttgagccc aggagtttga gaccagactc gccaacatgg 45781 caaaaccctg tctctacaaa aaatacaaaa attagccgag catggtggtg tgcacctgta 45841 gtctcagctg ctcaggaagc tgaggtggga ggatcctcct ccgggaggtg gaggctacag 45901 tgagccaaga ctataccact gcactccagc ctgggtgaca agagtgagaa cctgtgtcta 45961 agaaaaaaaa agttattatt atgctattca tcaaggttat catcatgaag gttatgagcc 46021 tggtatttag aaagattata ctccagtagg gaaaactgat atccatatac taccatgtgt 46081 taagggatat aatagaagga agcccatagt atcagtcatg tagccttgat gaattcagcc 46141 tggaccagag atgacatcaa gacccctgtg ctgagttttg aggagtcaag agttaatcag 46201 gcagagtaaa agaggcagtg gatggtgaga aagttctagg ataaggaaat atatgtcaga 46261 ggaggtaata ttttaagaac taggagatta actggatgtg ggtgaaggaa ttgggaaaag 46321 agtagacaac tcctagattt ataatttgag caattaactg gtagtagacg atggctatta 46381 actaaaataa aaaattaaga acgcaaagga aaagctagtt tctgaaggaa gggtggagat 46441 ttttatatca tggccagtta aaggctattc atataagata ataaaagaca tgagagagaa 46501 cacaatgagg aaaagacatt tccaatactg ttttaaatcc tcttagagtc tgacggcacc 46561 tgtatttctg aaatggctct attctaatta ccaacagaga agctgtatgg tataatgtag 46621 agcccctggg ttctgtattc agacaaacct gggtttgcaa tacaacttta tcccttacta 46681 cgtatgtgac tttggaattc accttggaag cctgttttag cctaacattt tctcaccttt 46741 aaaatgcagc tgataatctc tactttactt gtgagaatta aatgagatag tgttattcaa 46801 acacttagca tagtgcccaa cacatagcac atacttgata aatgtcagtg cccttcgatt 46861 aagaaacaaa ttcatgctct gcaatacgca ccccttattt tataaagaaa acactggcca 46921 ggtgcagtgg ctcacacctg caatcctagc attctgggag actgaagtgg gaggatccct 46981 tgagtccagg agtttgagac cagtctgggc aacataggga aaccccgtct ctataagaaa 47041 ttagccaagc atggtggcac gtgcctgaag tttccagtta ctcaggagct gaggtggaag 47101 gatcaattga gcccaggatt ttgaggctgc agtgagctgt gactgtgcca ctgcactcta 47161 gcctgggtga tggagcaaaa ccctgtttca aaaaacaaac aaagcaaaaa tccctgaaag 47221 aaaatattta ttttcatagt taatatcccc atcttctggt cttttaaggt actgctgaca 47281 aaagaagttt taaagttaag ctgtattttt gcaactacat atatattgct gttacaccat 47341 ggaacattcc ctggggtgtc ggggagggaa tctcataaaa gttaagctgg tctgaccaag 47401 actataattg ggagattccc tactcctaac acagaatgtc tacaattcac tacagtggca 47461 tttataatgc tattgtttaa tttaaaggca aaggaaatgt aatcctgcta aaagttcctg 47521 tctgcttata taaatgaaac cttaacttct ctactctgga atgctgactc cattcctttg 47581 gagctggtat ttccaggtgg tccatcctca taatttatgt ttgaacaaac tctctttaaa 47641 aacaaaacaa aatttaaaaa gcaaaggaaa cattccaaag agaagcataa attttatgta 47701 gacatatgaa tatttaaaaa ttgtcctcaa aaagagtctt tcctaagaag tgatgtcaag 47761 tattttttgt aggattaatc tgttatcaac actaataaat gtggctaatg aagttcaaag 47821 aaagcaatga tgttgctaat ctatgcagta ggaaacactc tgtattgaaa tgttataaat 47881 taagtggtga taacagctta agtaaagaaa ctgaccagtg ctagcaattt tgggtactaa 47941 aataacgcca gggatttgtc attgggtaag ttgtgtggca agttgtgctc ctccatcttg 48001 ggcacactgg tgagtcatgc cctgcttaga taagcattaa tcaaggttta atgcatgctt 48061 gatgaacgac cttattctaa ctaaatccag caagctttca gctctagagt aaaaaacgct 48121 atacttaaca atcccttact attctaaaat tctgtgatca ccaaagttat tgtgatatta 48181 atagtgtaat acataaatat acaaattcat gtgtgtatgt ttggcattca ttaacataga 48241 catatattct ttaatgccta ctctgtacaa ctggtggtgg tttggggatt cagctggaaa 48301 atcatactca aaagacccca atcggcaaca tcccttacta ggttcactct tcagaaggat 48361 atcagacagg aaacagagca tactggcagg acaaaaaata ggtgtgtcat cagtggaggc 48421 ctcttctcca gtggcagctt aggcaaaaga gaacatgtga gtggataaag aagtcattct 48481 ggctcagtgc aggagaatgg ttcagacatt acacattggc atccacatat gccacacagg 48541 cttgaactac agctaatttg agtcctggta gaatgcttgt gatctccctg gaacatgaga 48601 gaatgagtgg ctcatgaggt cttcttccat ccacctccct gtttccctta tgagactatc 48661 ataaggcagt ttcttatcac ctcccaggtt ctgatgcggt ataaactttc tgcaggctct 48721 atgtgcagac tataaaaacc acagcctaga aatagctcct ttgcaataga gtatcccaca 48781 atttgtactg cagccattga cctccccttt tttttttaag agaagggggt ttcactttgt 48841 tgctcaggct ggtctcaaaa cttttggctt caaacaatcc tcctttgaag gagaatccca 48901 aagtgctggg attacaggcg tgaaacactg cacccagctg tatcattatt tttggtgtgc 48961 aggacaatga ccctgagagc tactaacttt ggctactctt ccttttacta tataagtaat 49021 aaattgcatc taaaagtggc ttgttgtatc tttagtcaaa taagagactt gaaacagtac 49081 agattctctc ttctactggc ctcaaagaag caccatgaca tctatagctg caggaaataa 49141 attctgataa cttcctgagt tggtcatacc tggtgatacc ctgattgcca tcttgttaaa 49201 cctgagcaaa aggctcacct aagaagtgcc tgaactcctg acccaaggaa actgtgagat 49261 gataaatgtg tgttgtttta agctgctaag tttatggtaa tttgttacat ggcaatagaa 49321 aactaataca cccagtctag tacctgtaat catttattgg ttctttggtt ccatatctca 49381 aaatccatta gttagaatta ggctaagtat ggaaatattt cttcacacca acagctacct 49441 gagtaaagcc caagtatctc accaataact agatacttca ttatagaacc cttcccacaa 49501 tgtgccatct atacccaggc ttagggtagt gactattatc cctccttagg tcttgaacct 49561 gggccatgca gttcacattc cacagttagt ctttacagct tggtcagttt ccagagcctt 49621 ccctcaagtg actcactgct ctgtctattc cttctcccct ccagtcagga ctcatatctt 49681 tccgatatca gacacattga ttacagcata ttatactcat ctatcaccaa ataatatata 49741 cctcaaaggt tttaaaccac aaattagcca taaaacttag ttcaatatat gggttatgat 49801 aaataagaga tatgaaaaaa ttctggaagt agataaaaac atgcagtaca tcaacaaatt 49861 aaagctatta tattaccttt gttatcattc tgaaaatagg aatcaagcta cagatttaaa 49921 accaaaatag accttcagac tatgctaaat aaatggattt gacataaggt aatcaagtca 49981 taaaaaaacc aacaatcctg catttcctta gctataattc tgagaaaata aaatgaatta 50041 ggtttttata acccagcagt tatatttagt cttcatcagg ggtcaacaac tttttctgta 50101 aaggaccaga cactcagcag atattttaga ttttgtgggc cataaaatgt gacacagctt 50161 tgtcacaact actcaacttt gccattgtag cttgaaagta cctatcgata atatgaaaac 50221 aaatgggcat ggctatgttc caataaaact ttatttacaa aaacaggctg cagccagatt 50281 tggctcacag gctgcagttt gccaatctct agtctacaca gaaatatgat tatcatttaa 50341 acaaatgaac aaaacttact gcttaatttt atcatttcta ggtccaaatc ttggtccaga 50401 tggtcccctt ctgaaaacaa gtacatacca agtacatatt agtagtgaca ggattaaaac 50461 aaaacaaaaa aaagatagaa agtatacaca aatacaaaca atcaagctga actctaaaca 50521 tggatcagtt atttatcata agggaccaca ttttactgat actggtgaaa ctgaataata 50581 ggtaatcatg tctttacttc attggcactt attactagcc acaattaagc taattcttct 50641 gcatgttaaa actttcagag aaattagggt ttatattatc ttatataaca aacttacaca 50701 gaatttgttc tgatcattaa actctgaaat atcctaacta cttaactgct agtgaatatt 50761 aatatcttta tggtcaactg tagaactaaa agcgcaattc atcttgaatt tagactataa 50821 tgaaatgagg aaactacaaa gtgaactgcc gattgacaaa gatggatata gagaaaaatt 50881 cattgaagac actaacaatt ttaagtatgt ttttgattat tcatcttaaa cttatcaaca 50941 tagcaaaagg gaaatgcaag atattaagaa aattttaaca ttactcagat tattatctcc 51001 ttattgtagt tgagtgtata tcacacctta accaatccat tttaatattc ttgaaatgaa 51061 aacatttggt tatttaaagt aatcatgatt cttccggaga tacagaataa ataagacata 51121 aactacaaaa ctcaaaagaa accaggttct tagatttcta gacatggcaa tttctaaagg 51181 aggtgaaaaa atacagcaat tgataaaaaa attttggaaa aaaaaactgt ttactgaaac 51241 ttatttaaca taatgtataa gtgtgaatat atcatcttgc cttagttcct ctaagctttt 51301 tccttaaagt cttttaagct ccaactaagc ctcagaatac tggaacagta ttccattcca 51361 aagtgatgtt aaacattgaa tttcatagag aaaactaata ttctgaacca gaggccattc 51421 ctaaaaacta ggcaagaact tacaagagct ggtcaaccga aaaacaaaga aaagcagtca 51481 ttaaatgagg aaactataaa gtgaaataga atctgaaaag tactgaatta aaatgatccc 51541 cactacaggt taaaaaaaaa gcaggagcta tgtctgtttc atctatcaag taacatgaga 51601 cttcaaaata cttatagaaa aacacaaaat gaaaagctgg ttctttgaaa attaaaactt 51661 gataaacttc tagccagact gattaggaaa aaaagaaaga aggcataaat tagtaacagg 51721 aatggaagag tgaacatcac tacagacaaa tgtaccatgg ttatataaga tttattaaca 51781 gaagctaagt gaagcgtaga taggaactct ctatactatc tatgcaaccc ttctgtgggt 51841 ctaaaattat ttttaaaaaa tcccccaaag aaaattccag gactagattt gttcagttgt 51901 caattctacc aaacatttaa tgaagaaaaa ataagaattc tatgcaaact attctggaaa 51961 actaaaaaag acagaacttt cccaactcat tatccagggc cagtatgaac ctaccacgaa 52021 aaccagataa agacacaaca agaaaaagac ataactatca accaaccaat acctctaatg 52081 aacatagatg caaaaattct taacaaactc ttaacaaatt gattctaaca atgttttaaa 52141 agaataatac atgatgacca agtagggttc atatcacaaa tgcaacgacg gtttaaaatt 52201 tcaaaatcaa cataattcaa cacattcaca gaccaaaaaa aatgatcatc tcaatagata 52261 caaaataaat gcttgacaaa atccaacgtc agctaatgat aaaaactctt cacaaactag 52321 gaataggagg aaacttcttc aacctgataa atctacacac aacaacagta acaacatgca 52381 atgaaaatga aacataatat gaaagatgaa tgcttcccct aaaggagatc aggaacaaag 52441 caaggatatt ccctctcacc acctctactc aatattgtgc tggatgttct aatcagagca 52501 ataagctaat aaaaaataaa aggcaaacat agacgaggag gaaataaaac tgtctttatt 52561 tgcagaaagc ataattgttc tatgttgaaa atccagtagg atatataaaa acactactaa 52621 aactaagtgg gtttggcaag attgtagaat acaaggttaa tttacaaaat gatatttata 52681 ctagcaataa gtatgacatt ttaaaatttt gaaacaccat ctgaaatgtc atttttagta 52741 gcatcagaaa ttatgaaaat caatgtgcaa aaatcacaag cattcctata caccaataac 52801 agacaaacag aaagccaaat catgagtgaa ctcccattca caattgcttc aaagagaata 52861 aaatacctag gcatccaact tacaatggat gtgaaggacc tcttcaagga gaactacaaa 52921 ccactgctca acgaaataaa agaggacaca aacaaatgga agaacattcc atgctcatgg 52981 ataggaagaa tcaatattgt gaaaatggcc atactgccca aggtaattta tagattcagt 53041 gccatcccca tcaagctacc aatgactttc ttcacagaat tggaaaaaac tactttaaag 53101 ttcatatgga accaaaaaag agcccacatt gccaagacaa tcctaagcca aaagaacaaa 53161 gttggaggcg tcacgctacc tgacttcaaa ctatactaca aggctacagt aaccaaaaca 53221 gcatggtact ggtaccaaaa cagagatata gaccaatgga acagaatagt gccctcagaa 53281 ataataccag acatctacaa ccatctgatc tttgacaaac ctgacaaaaa caagaaatag 53341 ggaaaggaat ccctatttaa taaatggtgc tgggaaaact ggctagccat atgtagaaag 53401 ctgaaactgg atcccttcct tacaccttat acaaaaatta attcaagatg gattaaagac 53461 ttaaatgtta aacctaaaac cataaaaacc ctagaagaaa acctaggcaa tatcattcag 53521 gacataggca tgggcaagga tttcatgaat aaaacaccaa aagcaatgtc aacaaaagcc 53581 aaaattgaca aatgggatct cattaaacta aagagcttct gcacagcaaa agaaactacc 53641 atcagagtga acaggcaacc tacagaatgg gagaaaattt ttacaatcta cccatctgac 53701 aaagggctag tatccagaat ctacaaagaa cgtaaagaaa tttacaagaa aaaatcaaac 53761 aaccccatca aaaagtgggc gaaggatatg aacagacact tctcaaaaga agacatttat 53821 gcagccaaca gacacatgaa aaaatgctca tcatcactga ccatcagaga aatgcaaatc 53881 aaaaccacaa taagatacca tctcacacca gttagaatgg cgatcattaa aaagtcagga 53941 aacaacaggt gctggagagg atgtggagaa acaggaacgc ttttacactg ttgatgggag 54001 tgtaaactag ttcaaccatt gtggaagtca gtgtgcgatt cctcagggat ctagaactag 54061 aaataccatt tgacccagcc atcccattac tgggcatata cccaaaggat tatatcaagg 54121 gaaccacccc cgataattca acataggacc ttttctattt tccctaagtg tcggctggtc 54181 tgagtaataa agggaaagac tacaaaagag agaaatttta aaactgtgtg tctgggggag 54241 acatcacgtg tcggcaggtt ctgtgatgcc cccctacgcc gcaaaaccag caagattttg 54301 ttgtgatttt caaaggggag ggagtgtact aatagggtgt ggattataga gatcacatgc 54361 ttcacaaggc aataaaatat cacaaggcaa atgggggcag agcaagatca cacgactggg 54421 gcaaaattaa aattgctaat gaagtttcgg gcacgcactg tcattgataa catcttatca 54481 ggagacaggg tttgagagca gacaactggt ctgactaaaa tttactgggc aggaatttcc 54541 tcgttctaat aggcctggga gaccagggct tatttcatcc cttatctgca acatataaga 54601 cacacattcc cagagcagcc attttagaaa cctcccctag gaatgcattc tctttctcag 54661 gactgttcct tgctgagaaa aagaattcag tgatatttct cctatttgct tttgtaagaa 54721 gagaaatatg gctctgttct gcccagcttt caggcagtca gacctaatgg ttatctccct 54781 tgttccctga acatcgctgt tatcctgttc ttttttcaag gtgaccagat ttcatattgt 54841 ttaaacacac atgctttaca aacaatttgt gcagttaatg caatcatcac agggtcctga 54901 ggcgacatac atcctcagct tacgaagatg acgggattaa gagattaaag taaagacagg 54961 cacaggaaat cacaagagta ttgactgggg aagtgataaa tgtccatgaa atcttcacaa 55021 tttatgttca gagattgcaa taaagacagg tgtaagaaat tataaaagta ttaaattggg 55081 gaaccaataa atgtccatga aatcttcaca atttatgctc ttctgccatg gcttcagcca 55141 gtccctccat tcggggtcgc tgacttccca caacggatta taaatcatgc tgctataaag 55201 acacatgcac acatatgttt attgtggcac tattcacaat agcaaagact tggaaccaac 55261 ccaaatgtcc atcagtgaca gactggatta agaaaatgtg gcacatatac accatggaat 55321 actacgcagc cataaaaaag gatgagttca tgtcctttgt agggacatgg atgaagctgg 55381 aaaccatcat tctgagcaaa ctatcgcaaa gacagacaac caaacaccgc atgttctcac 55441 tcataggtgg gaactgaaca atgagaacac ttggacacag ggtggggaac atcacacacc 55501 gggtcctgtc atggggtggg gggagggagg agggatagca ttaggagata tatctaatgt 55561 aaatgatgag ttaactggtg cagcacacca acatggcaca tgcatacata tgtaacaaac 55621 ctgcacactg tgcacatgta ccctagaact taaagtataa taataaaaaa taaaagaaaa 55681 tacttaagga tcaatttgac aaagaaaaaa atacaacatc tggacactga aaactacaaa 55741 acactgctga gagaaattaa gtacccaaat aaataaaaag atatactatg ctcatgggtc 55801 agaaaaccaa atattgttaa gatgttaatt atccccaaat tcatctatag gttcaatata 55861 atctcaacca aaattctagc aggcttgctg gaaactgaca agatgattct aaaatttata 55921 tgaaaatgct tagagcctcc aatagccaaa agaactttca aaaggcagtg caaagttgaa 55981 ggatatacac tacactacct cacaacttac tataaagtcg tagtactcag gatagtgatt 56041 actatggtat tgatgtaaga aagacacata gagtccagaa atagagccac acatatatag 56101 tcaataaatt tttattgtgg taaatatata taacatatat ttactatttt aaccgttttt 56161 aagcatacag ttctgtggca ttaagtaatt cacttaatga actgtcatca ccattcatct 56221 ctagaacttt ttcatctttc aaaactgaaa ctctgtcccc taacttgtca ttcccttttc 56281 ctcaaaagcc ctggcaaaca ccattctatt ttgtgtttct atgaaattga ctactctagg 56341 tacctcatat aagtggaatc atgcaatatt tttcctttta tgactggctt cttttactta 56401 tcataaagtc ttcaaggttc atccatgttt tagcatatgt cagaatttcc ttccaaactt 56461 tttaaggctg aataaattcc atttatgtat tcaatagtga gttcctgggt gtagaggctt 56521 gtgctattaa ttctagctac tctggtaggg gacctgaggc aggagaatca cttgaggtca 56581 ggagttcaag accagcctgg ccaacataat gagaccctgt ctcagtcaat caatcaatca 56641 gcaaacaatt aaaaaaaatg agttcctttt tgaggggtag ctattgctct agtctaaggc 56701 ttttcaccac aagtagcatt gtaggcaatt cactttgtga ctctatagca atcttgtcac 56761 ccatagaaaa ttatctcccc acattatctg acattctgcc aaaaatgata ctattttctg 56821 ttactggatt tgctgctatc tatcctgtaa gctctttctt cactgcttct cgggtttcca 56881 ttaatttttt ttttttttag atggagtctt tctctgtcgc ccaggctaga atgcagtggc 56941 gtgatcttgg ctcactgcaa cctccgcctc ccaggttcaa gtgtttctcc tgccttaacc 57001 ccctgagttg ctgggattac aggtgcacac caccaggact ggctaatttt tgtattttca 57061 gcagagacgt ttcactgtgt tggccaggct ggtctcgaac tcctgacctc aggtgatcca 57121 cctgcctcgg tctcccaaag tgctgggatt acaggcgtga gccactgtgc ctggccaggt 57181 ttccgttaat ttttataaaa tgttttcact tggaaaaaaa cccagcctca acaaaaaaag 57241 tcacattttt ttttgggggg ggcgaagtat cactctgttg cccaagctgg agtgcagtgg 57301 tgcaatcttg gctcactgca acctccaact cctgggttca agcgattctc ctgcctcagt 57361 ctcccaagta gctgggtcta taggcatgtg ccaccatgcc tggctaattt ttgtattttt 57421 agtggagacg gggtttcact atgttggcca ggctggtctt gaactcctga ccttgtgatc 57481 cgcccgcctc ggcctcccaa agaggtggga ttacaggcgt gagccactgc acccagccaa 57541 gtcacagtat tttaacaagg taagaggtaa atgtttcctc tcaggcatat aaccttctac 57601 atcaatcccc agagaaaact cccacaacaa tgttgtcctt cctgacttgc caaatataaa 57661 cctagacctg tctgagagct ctttctaatc ataactacaa aggattgatc aggtcaacta 57721 attttttatg agataccaag gtaattcagt ggggaaagga tgatcttttt atcaaattac 57781 aatgaaacag ttggatggcc atattccccc agtttcatcc ttacctcaaa tatatacaaa 57841 aattaactta aaaagggatt gtaggcctaa atataaaagt caaaactata aaactcctag 57901 aagaaagtac aggtaaaaat cttagtgacc ttgagtttgg gaaagatttc ttatatatga 57961 aaaaaatagc aaaaacaata ggaaaaaaaa ttcaataaac tgatgccatc aaaatcaaaa 58021 ttttaatctt caaaagatat tattacaaaa tgaaagggtt aagttataga tttgtagtaa 58081 atattttcag aacacatatc tgacagaaaa cttgtatctt gaatatataa aaactcttgg 58141 ccaggcacag tggctcatcc ttgtaatccc agcactttgg aaggctgagg tgggaggatc 58201 agttgaggcc aggagtttga gaccagcctg ggaaacaaag tgagacctca tctctgcaaa 58261 atataaataa atataataat tagttgggtg tgatggcaca cacccgtagt cccagctact 58321 tgggaggctg aggcgagaag atcatttgag cccagtagtt tgaagctgtg gtgagccaag 58381 actgcgccag gacactctac caacctgggc aaaagtaata ccttgtctca aaaaaaaaaa 58441 aaaaaaaagc caaaaaaaca caactcaatg agaaaaattt taaaatgggc aagatttgaa 58501 cacatacttc accaaagaag gtaagaggac gaaaaataag cacactacta gatgttcaac 58561 atcatatgtc attaaggaaa tgcaaattaa aaccacaaca agataccact acacatccac 58621 tagaatggct agaatgaaag aggctgaaaa agactgacca tatgtaatga cgaggatatg 58681 gagcaactgg aattctcata caatgctggt gacaatataa aatggtgcaa ccactttgga 58741 aaacaattgt aaacttacac caaccattct actcctaggt atttttattt ccaagagaaa 58801 taatagcata tatccacaca aaaacctgca cacaaatgtt tataacagct ttatatgtaa 58861 aagccaaaaa gtagaaacaa cccaaaaatc cacacataag tgaatgcata aacgaactgc 58921 agcatcatac aatggaatac tactcagaaa attaaaaaac ccaactatta acacatggat 58981 gaatctcaaa aaccatgctg aataaagcca ggttttaaaa agggggcata ctgcatgatt 59041 tcatttataa aaaattttag aaatacaaac tacagtgaca gaaagcagag cactggttgc 59101 ctagggactg ggagagcaat ggattatcaa gggttatgag gaaacttgca gggcgacaga 59161 tggttcatta tttgattttg gtgatgatgt cacaggtaca cggtacagta agctctctct 59221 cacccatggt tttgctttcc acagtttcag ttacccatgg tcaactgcgg tctgaaaata 59281 ttaaatggaa aattccagaa ataaacaatt cataagtttt aaattgtgca ccattctgag 59341 tagcatgatg aaacctcaca ttgccccgct ttgtcccacc tgaaatgtga atcacccctt 59401 tgtcctgcat atccacattg tatatactac ccagcccttt agttatcaac atcttcttgg 59461 cttcatgatc caaggtcacc tgaaatagat gattctcctt ctgacatatc atcagaggtc 59521 aatagtacct aacgttatgt cacaacgtgc tactctctgt gttttcacat atccactggg 59581 ggatcttgga aggcatctct cacgtataag ggagactact gcatatctgt caaaatgtat 59641 acagtatatt ttagatatat gcatgtattg tatgtcaatt acaccctaat aaagctgtaa 59701 aaaaaaactt acagaaaaaa gtcctcttct tcatctgaat catcttccaa aagatttctc 59761 tgtcaaaaaa agaaaaagac ttgagttttc agattcagag ggcttataaa atgtcacaga 59821 ggagtaatga aataagtcct atactagtga aatttcttaa ctccaggtgt ccataaagag 59881 ggaaaagtaa ttacatatcc ttcactggaa gctaaaagat gtgggacaac atttatagac 59941 tttgcaaaga aaatgctcat aattatagaa tcctataact gactaaaaaa ggtagaaagg 60001 tcactgaagc tggtatggga gtgaggaact gcaggtagtc aggagggagc aagtcccctc 60061 tccttccttc tgactgccag tgtctgtcta ctggcagaac ttagggaacc aggtggaaag 60121 aagaaataca gtttgtggat tccaactgca gcatcagaga agagtataga gggagtttgg 60181 agctgaggac aagggcttaa aaatgggcat gttctacttt ttagctactc tacccacaaa 60241 cagcccccta aacaaatttg aacttccaaa caaaagcaac tttagacttt tataacatac 60301 aacaattttc cttaaaatga ggacagtctc accttctccc caaaaggggt gggtataaag 60361 tccaaaaggt gactaaagaa gagcaaaacc cagacctttt cagacacaca agaactcaaa 60421 cagtatacaa gttaagaatc ccttatccta aatgcttggg accagaagta tttcaaattt 60481 tggattttag aatatttgca tatatataag gaaacatctt ggggatggga ccgaagtctc 60541 aacacaaaat tcattatgct tcataaagac cttgagtttc aaatatagct tatatacata 60601 gcctaagggt aatttcacac aatatttttt aatacctttg tgcatgaaac aaagttttga 60661 ctgcgttttg actgcaacat gaggtcaggt gtggaatttt ccattgtggc atcatgctgg 60721 tacttaaaaa agtttcagat ttgggagcat tttggatttt cagattaggg atgctcaaac 60781 tgtaccactt gctataggca gaataatagc tccccaaaga tgttcatttc ctaatcacaa 60841 aaacctgttt tttgtttgtt tggtttgaga tggagtcttg ctctgctgca caggctggag 60901 tgcagtggtg caatctcggc tcactgcaac ctccacctcc cgggttcaag tgattctccc 60961 gccttagcct cctaagtagc tgggactaca ggcgcatgcc accacacctg gctaatacag 61021 tcaccaaaac ctgtgaatgt gttaccttgc atggtaaaag ggactttgca gatgtgatca 61081 cggatcttga gaagaaaaga tcattatgaa ttattcatat gggccccatg cagagtgagg 61141 gagagagaga gagacagaca gacagagaca gacagacatt acactaatgg ttttgaagac 61201 acaggaaggg gtatgagcct aaaaaggtaa acagcctcta ggagctagga aaagcaagaa 61261 aacagtttct tccctaaagc ctccagaaag aacatagccc tactggcacc ttgattttag 61321 ccctataagc tcattttaga cttctggtca ccagaaccat aagacaataa atttgtgttg 61381 ttttaagtgt gtggtaattt gtagcagtag caatgggaaa ctaatacact acccatgtac 61441 ttttcctgag gaaattcctc aaggaattac accagccaaa aagaaaatga aatcaaaaca 61501 aagaccttaa gactgagatg gatgatacaa tcaatgaatc taatacaatg tgtggtcaaa 61561 tctaaatagt tgatgataat gtgactgtaa acaataagtt aaagcttttg aaaaaaatac 61621 atattatata gaaaagctta aaaagaatct aaaacttttc caacaaagcc cagaagctgg 61681 gagtgggaga gtgaagtgaa gaagtgaaag cattgccaat atgctcatct tgatgaggag 61741 tggaatggag gaagggggaa caggactaga attaaaaagt aaaacaatgg aagggcctta 61801 gtttttggta tgttgacaga gaacacagat taacactacc attaaaatat aaagttagta 61861 acaaaagaga gagggaagga gggggaaggg agtaaagtat ctaaactgtg gtttagatat 61921 ggtggttctg ggggcaagga gggagatgag taaaaataat caatctagca taagggaagg 61981 cataagggaa tctaccataa gaaaaggcat aagggaaaag gaaaatataa gaaatacaat 62041 gaatatcact atatctatgt ataatatatg acatatacta aagagtaaaa aaatgaaaga 62101 aggatgtatt cggctgggcg cagtggctca cgcctgtaat cccagcactt tgggaggccc 62161 aggcgggtgg atcacgaggt taggagatcg agaccatcct ggctaacatg gtgaaactct 62221 gtctctacta aaaatacaaa aaaacaaaaa aagaagaaga agaaggatgt attcaatatt 62281 catttaacaa gtatttttgt gtatctttta cgtgcttcta tgtgcactgt tctaagtact 62341 aagcagcggt tttgaacaaa acaaatctga gctctcacag aaaaacagga ttgcaattga 62401 acaatgagaa cacatggaca caggaagggg aacatcacac accggggcct gttgtggggt 62461 ggggagaagg gggagtgata gcattaggag atatacctaa tgttaaatga caagttaatg 62521 ggtgcagcac accaacatgg cacatgtata catatgtaac taacctgcac gttgtgcaca 62581 cgtaccctaa aacttaaagt ataataataa taataaaaaa aacaggattg caacaaatgc 62641 tatgcagaga attaaaacag tgatgggaca gaaagtgatg gggtgactac atgagattag 62701 gctgtcataa aaaatcactc tgaagaggtg aaatttaagc tgagatccaa ataacaaaga 62761 gccagccaaa gatcagggtg aagagcattc attgctgcag gactagatgg ctggcacaga 62821 aaaagtccaa gtgtattagt tattcacaat aaatgtgact ggtttgaatg tccctattta 62881 ataacaaaaa ctatatgtga ttagggttca aaaaaattct agccaatatc tgtttgcaag 62941 aaattcacct aaaacaaata ccaaataaag gatggggaaa agcaggggta gaaaagaaac 63001 atcagcaaat tctaacaaaa agaatgttac agttagcaat ataaagagga attttaggac 63061 aaagtattaa acaatcaaat aagatgattt tacaataata aaatatacaa tccatcaaga 63121 agatttaata gcaacgaatg tttaaatacc taataatata gcttaaagta tcttacagaa 63181 gctgcgcaaa ttatgagaaa cagttacatc aaaatcatag ggaggattta acataccctt 63241 cagagatgga gaagccaaac acacaaaaaa atctagaact ttgaaactag acagcaaatt 63301 ttattttatt ttcctttatg ggtaagccac aaaggacatt taataaattc caaaaaatag 63361 aaattacaca agtcatattc tctgaccata atgcgatgga aaaaggaagt tttaaaatac 63421 tcttctcagc tgggtgcagt ggctcacacc tgtaatccca gcactttggg aggccaaagt 63481 ggggggatca cttgagccca ggagttcaag accagcactg gcaacatagc aagaccgtgt 63541 ctctatggga aaaaaaaaaa aaaaaaaaaa atttaattag ccaggcatgg tggctcatgc 63601 ctgtagtcct agctattcag caggctgata tgggaggatc ttttaagccc aggaatttga 63661 ggttagagtg agctatgatc atgccactgc actccaacct gggtgacaaa aggagaccct 63721 gtagcaaaaa acctcacaaa aaacaaaaaa catccttctc attaactttt atgttaaatg 63781 taaataaagg tgattagact attttgaaat gaaagacact gggaagatca caggtcaaaa 63841 tctgaagagt atggccacag cagtacttaa aggaaaatgg gaccttagac atattttgtt 63901 atgaaatcac aaagattgaa aatatataaa tagttaactc ttaagatggg taagttattc 63961 ctaaacaaca cacaactcca gaagtctgaa agaaaaagat taacatattt gattatataa 64021 aaatttaaaa cttgaacata agacatcaaa aacaaggtta aaaaacaagc aacagactgg 64081 gaaaaaacat ctgcagtatc ctttaagaca aaagacctga tgtctgtatt atataaaaag 64141 ctttgacaaa ctagtaagaa aaagagacat tatagaaaaa caaagaatag aaatagataa 64201 ttcccaatag aagaaatggc caatgaccac atgaaatgtt caaaaccaat taatattcaa 64261 ggaaatgcat taggtcattt tttaaataaa tatttcttga gtaatattct agctaacaag 64321 taacagacaa gcattacatt tataatataa tgtatttggt ttatggccgg gcatggtggc 64381 tcacgcctgt aaccctggca ctttgggaag atgaggaggg aggatcactt gagccccgga 64441 gttcgaaacc agcctgggca atacggtgaa accccatctc tacaaaaaaa aaaaaaaaaa 64501 aaaaaaaaat taggcgtggt ggtgtatgcc tgtaatacca gctactctga aggctacttg 64561 gggggattgc ttgagcccag gaggaggctg cagaaccctg atcgtgccac tgcattccag 64621 cctgggtgac aaagccagac cctgtctcaa aataataatg ataatagtaa tagtaataat 64681 ttatttggtt tagaggataa tgatactaaa gatatgttta ctataattgt acactgagaa 64741 acacacatga aaaccaatgc tttttcatat attaattgaa taaacacaag tgctagggat 64801 acagtgtcaa aaaagaataa taggacacaa tgagataata tttcataccc actaggcaga 64861 taatattaag tgttggtgag gatgtggaga aattagaatc atcatacatt gccggtgaga 64921 atgtaaaaat tgtacaacca ctttggaaaa cagctgggca gttcttcaga aggctgaaca 64981 gagagttact acaatactca gcaattctac ctttaggtat acaccaaaga gaaataaaaa 65041 catgctcaaa caaaaactgg tatgctaatg ctcatagcag tagtatttaa aatagccaaa 65101 agtagaaaaa aacccaaaca tctatgaact gataaatggg caaaccaaat gtgttatatc 65161 aatataaggg aatattatta gcaataaaaa ggaatgaaga tctcccagca ctttgggagg 65221 ccgaggtggg tgaatcacct gaggtcagga gttcgagact agcctggcca acatggtgga 65281 accccatctc tactaaaaat acaaaaatta gctgggcgtg gtggtgtacg cctgtaatcc 65341 cagctacttg ggaggctgag gcacgaaaat cacttgaacc caggaggtgg aggttgcagt 65401 gagccgagat cgcaccattg gactccagcc tgggcaacag agggagactc cgtctcaaaa 65461 aaaaaaaaaa aaaaaaatga agcactgata tgcactataa cgttgaggaa tcttaaaaac 65521 attatgctaa gtcaaagaag ccagtcacaa aaggctatat gttatatgat tccatttaca 65581 gaaaacgtcc agaatagaca aatctatagg agtgcagttg cttagaggct gaggggataa 65641 agaaaatggg agatgactgc aaaagggcac agcatttctt tttgggctaa tgaaaatatc 65701 ctaaaattga ttgtggtgat tgttgcacaa ctctgtgaat ataataaaac tcactgaatt 65761 gtacacttta agtgggtgaa ttgtatggta cgtgactttt atctcaataa agctattacc 65821 aaaaaagtat aataggaaag aggaacaaaa cttttaaaag attttaaaag taatttatat 65881 attattctaa ccaatcagtt atcaaacttc ataatataat aaacatcaaa aaggagttct 65941 cttgaaacag tcaattttat tcagttctgg tgtctaatat actaataatc aatgaagaat 66001 ataaggagaa agtgatacaa ttacagtctc agcagtggaa tttattgaga caatacattt 66061 taaatttaaa aatgtattca taaagcttat ttatttgtat tatttttaat ttaaggttta 66121 ctattgtttt tcttgaagta atttttcctg cccttgttat tagggtgggc tttatgtact 66181 atttaatctg atttccacac atattattaa actggtattt aatatctaca aataataatt 66241 ttgtcactat tttctcagga ttatgaccaa catgccatga aactgtgatc ctagtatatc 66301 acgcctggac ttgattttta aaaagcaatc atattcaagt gactgtgtcc ttaacagtga 66361 ttccattaag tgaacaacaa atttacgttc agcagtgatt gtgagaaaga aacacagggg 66421 aacttaggaa atataaatag tttcagttta aaatcatgta aaaataaaac tataaaatta 66481 atgggatttg catcaattaa aaattattaa ataatatact cttgcatact atacattgaa 66541 tttggctaca accttctctg ctttacatac atttttcaaa attgacatag attgtatgaa 66601 ttacaaacaa gttccgagag gtgattttac tacatacatt tccatacttc agtcttcatc 66661 ttacacaaag gcagatgtga cttaggggct ctctgtccag gactgaggta tttacctgac 66721 atatctcaga tatcagtcaa tcaactgtca gcatttattg atcatcttcc aggatattaa 66781 tgctttgcaa gcatctctgg gtaagtgact aaagatttcc ttggggcaat aggaaagagg 66841 ctcagattct tctaaccgat gatttaaaaa atatttcagc atgctttgtg cacgctgatc 66901 actgaatatt catcaggagc tttcagtcag cttttactgg atgcatatca agtctgtgtt 66961 gttttacact gggttccttt ataggttcac aattggtatc atttacttta acaaattgac 67021 atttattctc tgaatgggtt tagtatttcg ccaaaccttc tgctttgtaa agacatgctt 67081 tttttttctt tctgagatgg aatttcactc tgtcacccaa gctggagtgc aatggcacaa 67141 tctcggctca ctgcaacctc tgcctcccag attcaagcaa ttctcctgcc tcagcctccc 67201 aggtagctgg gattacaggt gtgcaccacc gcacctggct aatttttgta cttctgctag 67261 agatggggtt tcactatgtt ggccagactg gtcttgaatt cctgacctca agtgatctgc 67321 ccgcctcggc ctcccaaagt gctgggatta caggcgcgag ccaccacgcc tggccaagac 67381 atgcttttga tagtaaatct tcagtgtatg gaattccagc acaggtatat caaattttct 67441 gaatcacttt ttcttggagg tttatcttga aagatattac attaaagaaa gagaccctgg 67501 attatccctg agtgatttct caatccctga tcatcctctg ccttctctcg gccagagcct 67561 tccgggttca cccagacttg gtatatttca aggcatttct caattccttg catggggaac 67621 tagctcaagc agacactaag agtttcacca tggcatggag gaaaagatag gtgaccaaaa 67681 tggttgaaac tgcaccaaat gctagtaagc agttgaaatt atggcaaaga aggagtaact 67741 ggatttggca acatggaaat cattggtgac tccttaaagt ttattagtaa aagcttgcta 67801 gcaattaatc tattggtgga ttctatttcc ccattaaaaa acaaaacaaa acaaaacaaa 67861 acaaaacaaa acaaaacaaa acaggccgag tgcagtgtca aatgtctgta atcccagcac 67921 tttgggaggc caaggcaggt ggattacttg agctcaggaa tttgagacca gcctgggcaa 67981 cacggcaaaa ccccatcttt acgagaaaaa caaaaactag ctgggcatga tatcatgcac 68041 ctgtagtctt tgctactcag gaggctgggg tgggaggata gcttgggccc aaaaggtgga 68101 ggctgcagtg agctgtgatt gcaccactgc acttcagcct caatgacaga gcgagaccct 68161 ggtctcaaaa aataaaaaca aaccatcaga aacacacaaa accgcaaaag ctgtacaaag 68221 ggttagaata aggggaagaa aatgtttcct gtgttgtact aaaagaaact tatgactaaa 68281 agtcataaga tggtggacag agtggataat atgaagttct gaggttgtcc aaaaagagca 68341 tgtaaaacaa agcagactaa aagctttgca agggcacaga ctatgtcttc caatttttag 68401 cccagtgcct agcacatagg aggtggtcaa tacatatttg ttgaatgaat caagacaaga 68461 aggaaaggaa ggagggcttc agaatggtgc attaaaaaaa taagaacgtg aattatttcc 68521 cccaataaat atgtattgag tacttattct aaaagtaatc tgcctacaac attcatataa 68581 aatgttgtct tttaaatgcg gcataatttc tagatttata tatatttgca tagaatgttc 68641 tggaataaac tagaaattga agcattatat aaaagatact ctggaacata agataattac 68701 aggtctgggg acctttattt gcaattccaa aatctaaaag cttctgaaaa tcaaaaagtt 68761 ttggaggaaa atttgacctg aactgatgtt agctagttta tttacacact ttatttacat 68821 agttagttag atataactaa atatagactt cactgagaaa tattaatatt tttggaggca 68881 ctggtccagg ccccaaaaat gctgttacat aatatgttct taactttcta aaatctgaaa 68941 aattccaaat acattcaaag ttctgaataa gatattgtgt atttgtataa aactcccttt 69001 caaaaactag aaagaagcta tgagaataat gatttgatca ataatggaag tagaattcag 69061 ttgtcaccaa tctaagtaac atgtttttaa atacatttgc tatgaagaaa acaattagga 69121 ttagaatata gtcataagct tttgattttt ctgttgttat tattattagt taattaattt 69181 ttgagacagg gtcttgctct gttgctcagg ctggagtgca gatcacagtt caccacagcc 69241 ttcatctccc aggctcaggt gatcctcctg ccttagcctc ttgaatagct ggaaccatag 69301 gcatgtgcta ccatgcccca ttaatttttt aaaaattatt tatagagaca gggtctccct 69361 gggttgtcca agacagtctt gaacttctgg gctcaagcta ttctgcttca gcgtcccaag 69421 tgcggcgatt gcaggcatga gcaaccacgc ccggatggtt tttcttctat tattcacaaa 69481 gggaatgaaa acatattttg aatcttttgt ttttaaagct taagattccg aacttaccct 69541 ttcacttttc acagaacctg tgacatcatc atcattgagg tggcgcttaa acttgggagg 69601 catatttttt acttgcaaag tatcttttgc ttctcaacag gatagtcacc accttaaaga 69661 gaaaataaat ttcatcagat ttgagacttt ggggcataag ctaatgtact taaaacagtg 69721 tacatgaaac cctgtatgac agggactccc tcatttttct gacttcaatt cctgccctct 69781 cacttgttcc actccagtga ctggggcctc catgctagac ctcaacacct aagaaatagt 69841 taaaattatg agaaaatgga tgtaattgcc ctaagagaac atgtgtagga tatgagtgat 69901 ttaagatttt tataaaatga aagagcagag agtagaggga ggacgtctca ctgagacatt 69961 aaatgcttta caatttaaag ttggagtcag aagacaggag aacatgccta tttctctcac 70021 ttttagttgg ttgcttaagc aatgattcag tttcctcagg tgtaaaatac tgattttcag 70081 gcttctgcag agttgggacg gtcaaattgg ataatacatg caaaaatatt ctgcaaacta 70141 taatgtccca ttggtactcc atgaaaaggg gtaagtttgg taagtactgg tgtgatatcg 70201 tgatttacaa aatatactca tttggtctta atccccattt cctgacatat ccttggaatc 70261 gctgcagtga tgtgtcttgt gtatgctaat aagatgactg gtagcttagc ccccagatag 70321 attccagatg gaggctggtc acagaaaaac ctaggcatga ttagaagact gggactttac 70381 agccccaccc cctagcttcc tggaagggga ggagggctca aggttgaact gatttctaat 70441 agccaatgat gtaatcaatc atgcctacta aggaagactc cataaaaccc aaaaaggaca 70501 gggttcagag agcttcccaa aagctgaaca agtagagagt tcctggacgg tggcacacct 70561 ggagagggca cagaagctct gtaccccttg ccctatgcac ctcttccatc tggctgttga 70621 tctctatcct ttataataaa ccagtaaaca aagtgtttct cttaattctg taagccactc 70681 tagcaaataa actgaacaca aggaagggat tgtgggaact ctgatttata gccagtcagt 70741 tggaaacaca ggtaaaacaa cctggaactt gtgattggca cctgaagtgg ggggaatctt 70801 gtaggattaa gccctcaacc tgcaggatct gatgctatct ctaggtatac agtatcagaa 70861 ttgaattaga ggacacccag ctggagaatc tgctgcagaa tttcctcctt gcttgtgagg 70921 agaaatcttt tgggggtctg tgttgagaga gaataggaga aactcaattt ttttctatca 70981 aaacaactgg attaaacaaa gctagttttc tttgctacag aatgactcac atcctgttat 71041 atgctaatat gcattgtgac tttccaggaa aatgcttgtg tatgcaacat actaaaattt 71101 ttttttgcga aaaagttctt ctatgcaaat aatgaatttt cagtattttt gactagatgg 71161 tagatgaata cccatcaatt caggatagct cagaaatact gaatgtttta ctatgtgcta 71221 ggcacggtgc tcagtgatac acatgcattt tcttttactc tttatgatgc ttctttaaag 71281 taggcaccaa gaaactagga gaaaactaaa gtttagtaac gataacgtta gtcactacat 71341 tcattgaaac agcagagata caggcctaaa ttaatcaatg gctgtgcatt atcagtttta 71401 atcatttgtt caattagttg cccatcaggt ttacagaatt ctgttctaga aagactcact 71461 tcctgctaac aacccaatag tcctgcttcc ttttccataa gaagaaacac ttagaccttt 71521 aaattaggta tagttagaca tcagtttgtt tgttaatgat atacacttaa aaaagaataa 71581 tcccaggatc tggcttcacg aaccaaataa aagtaatgtt ttaagtcagc tgcttgtcaa 71641 aaccaaagga cacacttttc tttgataaga aatgcttcca aatcagaaat tattcacttg 71701 taacaaaaac caaattcttg ctgcctgagt aacagctctt gagttcttgg ttccacaata 71761 tttctgtcac atacatctgg cccttttatt cctcatactt tgatgtcaga atacttcatt 71821 catgcagaaa tgtaacagaa tgcatggaaa aacagatgac aaggcaaatt actgaagaac 71881 ttgtgctttt aactattatc tgtatttaga ttttaacccc tgcaatgagg ctgcctcaac 71941 atcccaaagc aaacaacggg cagacttata taatgccttt ataagacata cagtaaaact 72001 acaatcttaa ggatgagcaa acaccatatt caggtctctt aatcccaaga aaccatgtgc 72061 aacaaatagc ccatcagcaa agcagtaggc agattaatta gctaccttga tgaactttga 72121 tgttatttgc aactgattat tagtaactga tatgtgtatc aaacaactag tagaaataaa 72181 tcatgctcag aaagtagact aggatatagt ctgcctaaga tttctcctac aaaaaaagac 72241 atgtgaattc agctgtttgt gactaggtat acatgtaaat aagagagcca tcatgtactt 72301 ccacattgag gatagaccac cactatacaa ttcactcaaa tgagtcacat caccgtatgc 72361 ttcttataca tcagcatata cttaaatgtc tatgtgtgcg tttttgtaag tagtacgtta 72421 tctagtcaat agctttgaat ctgggcaaag gctctccttt tcccagcctc cattctcttt 72481 ctgattcgtc ttccttgtct gtcatcagcc ttcttgcttg atgcagtttc acagaaataa 72541 agtgacttgg atggacacag accggagata ttatttgcaa aacctgtcaa gtgtaacgat 72601 tcacccccac cttccggtca ttataaacac agcagggttg gcgctctttc ccccgggccc 72661 tttaaataca caatttcagg cagctccgtt caccccccac acccaatctg tcacaccagg 72721 ccacgtccca cgatatatct gctcctaacg ctggcccttc ctgccccatt gtaccccctg 72781 gaattcctca ctacatccca ctcttaacaa accttcccca tccatggtac tgttctcttc 72841 ccatcacccc agagtacccc tcaattccaa ttcttctctc cctcaactgc tcctcgtccg 72901 catacccctg tcctcgtctt ccttcctcac ggcacccccg gccccaacct tttcccactc 72961 cggggtccct tcctagatcc ccgaggcccg ccgaccagcc cgactaggac gtcggggggc 73021 ggcaaggcag gggcgtcttc ttcgtttcct tcctcctcac accagcccct cccgctgcga 73081 acaggccagc gccagtacct gaggctcccg gggaactccc ggcagctctc ccgggctggc 73141 ggtgctgggc gggcggggga aggagtagga agccgaggta agaagctggg agggggagac 73201 agttggtgcg gacccggcgt cggccgcagc gccgcagcag cgcctccgct catcgagccg 73261 atcggcgctc ggcggcctgg cgccaggcgg agggggcggg tccagggcac ggagataatc 73321 gaggaagtcg atccgccgag ccctattcac ctaaaagagc ccgaacccac acgcagttta 73381 gatgcatctc tccaaattta tgctgtgctt taggggagga acaaccactg gttgtccacg 73441 ttccggtcgt gatccggcgt gcgtggaggg cgattagggg aggagtggga aacttccgag 73501 aaccgctcgg gactgggcgc cgcgccggag tgcgtggctg gggtgatgag gcgttagaaa 73561 gaaggtggtg cttggtggca catccgtccc tctggctgac tacgttgagt ggcaagtgga 73621 gagtggctcg cagctcgagg acctgcttcg ggttcgtctg gggccttttc tgaagaaagg 73681 agatctgttt gatcccctgt ttacagaatt agagctagac ttggaaagag atgttaaggc 73741 caggctcctc cctttagata ggaggaatat ctggttcaga gaggagattg acttaaagtt 73801 aacacggcta gtttagatag atacgggacg agggctcggt gttcttttcc cggtatccct 73861 tagttgtaac ctgaaagcct tcaggaaatc gccctcgcca ggccattccg ccactctgcc 73921 cattccttcc tcactccctc cccctccgcc caatttggta tcatgtttag atctccacgc 73981 ccgcagtagc tgtctaccag aaacaattcc taattttgtt tttccgttct cttatttgtt 74041 ttattatccg ttcctcccac gccccagctg gattatttta ctcattgaat catattcact 74101 acacatcttt taaaattaat acagtatcta tgggtccttt gttgatacag tattttaact 74161 gtctgaaata ctgtgccagg cactttctaa agacataaaa ataagagggg gtgttcaaga 74221 acaagagtag acaagaaaac agcaggagga aagatctgcg gacactaaag gaaaggaaag 74281 aatagacacc ctcttaccaa aaagttgttc ttcttttgtt tgatgtttat ttttatgtaa 74341 aaatgtacta acgaatttga aagatgccat gactaaggca acgatgctcc actctgattc 74401 acttggagtt gaatcacttg gaaagctttt aaaactacgt aggtgcccac gtgctaccta 74461 gattctgatt taaatgttct gagatttaga aacagccgtt gagttgtcag ttgaaggttc 74521 tgaaagctta atgctaaact cagttgcctg tgtttattca gtccaagttg ctgtccaagt 74581 cagcaacttg gactgaaacg gcccacttag aagagaagcc tctcatggtt tgtatccctg 74641 tctaggtgtg atagtgggtc tacccccaga ctccacaggt aaccctgaca ttattttttc 74701 ttctacagct gatctttttt ttttttttga gatgaagtct cacttcttcc cccaggctgg 74761 agtgcagtga cacaatcata gcgcacagca gctctgaact cctgagctaa agtgatcttc 74821 ctgcctctgc ctcccaagta gctgggatta caggcatgag cagctgtgcc cagcacttac 74881 agttgatttt atgtctgtat atttgcatgt tatgggctat tttctggctt tggtgcctta 74941 gccaattaga gtttagctag ttatagcctc atctcttaga caacttaagg tgtggtcagt 75001 gagctctaac aggcacgtca ttgcatagct tttacagata atccattctt tgggtttttt 75061 tgttttttaa acataattct ttagtcctgc cagtcatttt ggaatgatcg cttaggagag 75121 gggtaatgca agccccaccc ttaatcacct gaacaaagtc cctttgacta accccaatat 75181 ttgtgaaact tagaaaatgt cattttacag ttaacatgaa tcaaatatac cagtaagcct 75241 aaaagattat aatagcactg agcttagagt cagaactcca agcaacttga gttacttatt 75301 tcctccattt tctttttagt gagcttcagt tattcagatg ttagtcctct aattttcttt 75361 tccccttttt caacatgaca ccttcatcct taagagggcc tgctgtcctc ctagctagag 75421 aatcagcgtt ttgctgtctt tcaggactaa atctctgggt ttttgatggg atagcaaagg 75481 gaatctcagc atctgattca ttcttaaaca gactttatac cattccgcct gtatttatag 75541 ctcacatcat gctctcttca gctttacgtg gtgtttccaa ttcctgagcc ttatacaggg 75601 tttcactgtg tgtatcaggt tgcttcatgg cctgcttggg attcctactt ttcaagtctg 75661 ctaaggcaat tgcatcttga ttcatctccc ttctagcttt caatattttg ttgctaaatg 75721 ttcaacatta cttaaaatgt aaaagtgaac aaggaaatac catttttcat ctatcagtga 75781 ctgagaaaga aggagttgag cagaagatct gcaatccagt ctctcccttc gtagggaaat 75841 actctcagga aagcaaggtt aaagcaagca aaaaagcatg tgatttttat aaaatataaa 75901 gtatatacta attacagacc tcctggcaaa caaactaatg gcatttgcta gcaagagagt 75961 ccatatttac ggaattcctc taacaggatg tcccagggct gggattgttt atggtaaaca 76021 cattaagaat cgatgtggta attccagcac tttgggaggc tgaggcaggt ggatcacttg 76081 aggtcaggag ttaccagcct ggccaacatg gcaaaacccc gtctctacca aaactacaaa 76141 aattagctga gtgtggtggt gtacacttgt aacaccagct actcaggagg ctgaggcaga 76201 agaatagctt gcactggaga tgcggaggtt gcagtgagca gagatcgcgc caccgccctc 76261 catcctggtc aacagaatga gactccctct cacaaaacaa aaaagaatca atgcggattt 76321 caggggtatg attaactgga aaataccgcg gaagttatgt aacgtagcat aaaatggaag 76381 tgttttgtaa ctgatttggc tccttctgtg taagtggggt gactgtacac ttgatttaga 76441 aaccacatct actgtcctgg gccaggatgc atccaaagca agaagaaagg acctggggat 76501 tactggtatc cttgaaacta ttagtaaagc ttgatactga gtaaaatctc tgatttgagc 76561 ctgatttgac agtttgatct tttgtgttag ctcaatagca aaaataaaaa aaatttgata 76621 ctacaatatt gggtgtgtgt atgtgtggta tgggagcagc gtgggaggaa caagcattct 76681 taaacagtaa gagtataaaa attggtgcat ttttaaaaat ttcatggtaa ttatatgtat 76741 caaaatttta tatgaagttc tttttgatct aataatttta cttctaaaat ttatcccatg 76801 ggtatacagg ttgtatataa aagtattcta taaaaatatc tgttacagga tttttgtaac 76861 agagtgaaaa aaaaaactct aaatgttcag caataggaaa ttgctgaaat gagtcatagt 76921 ctagccattc aatagaatag tatgcagcca ttaaaatgaa ctagatagat ctatatgtgc 76981 tgacatggaa tacctctatg atgctataag gtgagaaaag taaggtgcaa aatagtgtgt 77041 agactctact tccatttgag catcacacac acacacacac acacacacac acacacacca 77101 ggaggactgt gaatctggca tggaaagtgt tatggactga attatgtccc ctcaaaattc 77161 atatgttgaa gtcttaactt ccagtgtgac tagatttgga gatagggcct ttaggacaat 77221 aattaaggtt aaatgaaatc aaaaaggcaa aaccctaatc ctgtagggct agtgtctgag 77281 tgcatgggca cagagaagag gccatatgag aacacagaag atggccattt gcagtccaga 77341 aagagaggcc acaccataag ccaaccctga tgactccttg atcctggact tttagcctct 77401 aggactgtga gaaaataaat ttctgtagtt aagccaccta gtccatggta tttcattata 77461 gcagccctag aagactaata cagaagggaa actttcttga tgtgtagtgt tttgtactat 77521 tggaatatct ttaccatgta tatcagtttt tcaattaaaa caatggtaga ggcactctct 77581 tcccctaaaa ttgggaccac tggaccagtt aatttaaaaa gtcagttaca gtggtacata 77641 tttaaaaatg gcaaatatat actttagtca ttttattttg actcaaatgt atatgttcac 77701 tcaccagtct tggtagtagg cccaagacct tttatctgag catgaatttg ctggttggaa 77761 cacctgcttt gtctttgtta caagttgcat tgtttacagt ttacagtttc attttcttca 77821 aagtggaata agaagttaaa gacacctttc aagcaatttg aatgcccagt cagtctagat 77881 ttttgtacaa ttttttctcc aaagttttac aggtttgtct tacctgtgcc tagttctata 77941 gtaatttttg tttgttgttc atgtttatca atttagtttt tatagaaaat aactttctat 78001 ttgtatttca ttagtttaca tatttaaata tattctataa gtataatagg cataaatact 78061 ttggagtaca aatgcaaatt gactcttgta agcatgtaac tcaattagtt tggagaagaa 78121 gtatacagat atcatgtagt taaccataag cttttcatgc ttagagtgcc ttggtgttat 78181 ataaatatga tattagttta ttttagtttt gattttgaat gcttttaaat cctctatcca 78241 gattcattaa taacgctttt ctaatgctga cctcaactcc taaatggatt tcatgacagt 78301 gtgaataaaa acaaaatagt tcagccctct ttttgtctcc aggtagatga agaaacagag 78361 tgccagagag tttacgtggt ttgttaacat ttcactaagt ttgctttatt acttctcttt 78421 aaatttatat ctcccctcgg catgcattat ttgaaaatta gttatagcta tcatgcctct 78481 ttacccctaa atacttcatt agatatctcc taagaacaag aacattgtct tcataactgt 78541 aattcagttg tcaaattcag aaaatttaac attgatacaa tattattttc tatcaaagca 78601 tccatatcaa agtttgccag ccgtctcagt tatgtccttt gcaatgtgag tatagtctac 78661 agagaaaata gtaatgagta acaaagcaag aggaatttag tttttaaaga taagttcaac 78721 tatagtttaa ttatttttaa tatatataat aatatgggaa atgccctgat gccttcctca 78781 tgaaacttta gatgtgacat tataagatgg tctcagggag gtgctttgtt tctcttctaa 78841 ctgttcttgt ttattgaggt agtgacatgt tgcaagaatt ttatcttgga gaaattacct 78901 cccacaaaga tggcaacatt agaagaacct gcataataat atgctctaat tgtgttacaa 78961 atttatttta ctactctgaa agttaaaaat ggttaccatt tattttcttg gtttcatgcc 79021 tgttttctta agaactatgc ttaatgatta gcatatgttt ctgcacaaat cctctctgat 79081 atcaagtgtc attttcctca ttttatcatt tagacctcaa ggttctgaga tgacgtgtga 79141 tttgtccaag gttagagaat ggtacaactg agaacactag aaaatatcgc agagcaataa 79201 gcttaaattg agataaatat tttctctaaa tgaatcacat gcttttccct tataatagag 79261 attaggcggc tgggcgcggt ggctcgtgcc tgtaatccca gcactttggg aggccgaggc 79321 gggcggattg cctgaggtca ggagtttgag accagcctgg ctaacacagt gaaactccat 79381 ctctactaaa aatatgaaaa aattagcccg gcgtggtggc acacacctgt agtcccagct 79441 actcgggagg ctgaggcagg agaatcgctt caacctgaga ggtagaggtt gcagtaagcc 79501 aaaatcgtga cactgcacct gggcgacaga gggagactcc atctttagac ggagaatcca 79561 tcaagcataa tgtcttgtta tgatcgcact tgattcaaga ggctttagtc tgcttaaggt 79621 aaaatacaac tgtaatttct tgctgaagat ggtaatattt atcttgagtg aagactgtac 79681 tttcagggta caacaaacta taacatagaa ctgtgttata gttctataac aaactataga 79741 aatgatctat agtttgttaa tcttgggtgc tatatcaaag tgaatcaatt tgtccatcag 79801 atatgtgtga caggtattct gttaaagtct ggggatacaa aaatggcaaa cagtgttact 79861 gtcctcttaa aaacttacaa tcttgcacag aaaaacaaga ctaggcaact acagccatgt 79921 gccacagaag gacgtttcag tcaacgaagg actgcatata tataacatta taattgatct 79981 gaaaaattcc acctaacgat gttgttgctg ttgtgcaatg gattactcat gagtttgtgg 80041 tgatgttggt gtgaacaaac ctgcactgcc aatcttacag aagtgtagca tatacaatta 80101 tgtacagtct gtgatacttg atcataaatg actcttacca gtttatttat tatactatat 80161 atcattattt tagagtatat tccttctact tatacaaaaa gtttactgtc aaacagcctc 80221 aggcaggtcc tttgggaagt attccagaag aaaacgttgt taaattagtg tgtgttaaaa 80281 tagtgtgtgt tattgttcct gaagaccttc cagtgggaca agatgtggag gaaaacagca 80341 atattggtga tcctgaccct gtgtagacct agactaatgg gtgtgactgt gcattagttt 80401 ttaacaaaaa agtttagaaa gtaaaaatat atacaataaa aaatttttaa aatagtaaaa 80461 aatggtccag gtgtggtggc tcatgcctgt aatcacagca ctttgggagg ccgaggcagg 80521 tggattgctt gagcccagga gtttgagact agcctgggga acacagtgag accccatctc 80581 tgcaaaaaat ttaaaaagta gccaggcatg gtggcacaca cctgtagttc cagctactca 80641 ggaggctgag gtaggaggat tgcttgatcc tgggaggtcg agtctgcagt gggccctgat 80701 tgtgctgctg ccctctagcc tggtggcaga gcaagactct gcctcaaaaa aaaaaaaaaa 80761 aaaaaaaaaa agtaaaagct tatagagtga aaatagaaaa tatttttgta cagctataca 80821 atgtatttgt gttttaagct aagtgttatt atgagtcaaa aagtttaaaa gtttatagag 80881 tgaaaaatta tagtaagcta aggttaattt attggagaaa aagttttaaa aaatatattt 80941 agtgtagcct aagtatacag tgtttatcaa gtatacagtg gtgtacagta atgtcctagg 81001 tcttcaggtt cactcaccac tcactcactg acttgccaga gcaacttcca gtcctgcaaa 81061 cttcattcat gaagtgccat atacaagtgt atcatcttta aaccttttat accatatttt 81121 tattgtgcct tttctatgtg tatctaaaca aatacttgtt atttacaatt gcctacagta 81181 ttcaatacag taacatgcta gacaggcttg tagcctagga gcaataggct ataccatata 81241 gcctagatgt gtagtaggct ataccatcta ggcttgcgtg cgtacactct gatgtttaca 81301 tgaggactaa atcgcctaac aatccatttc acaagacata tgcctgttgt caagtgacac 81361 atgactgtat tgaaaactga agtaaattac agagtcaaat gaacaatgga gtatatatat 81421 aatttataca tatacacagg tacatgaata ccttagcaaa taaaattgtc taaggtgatc 81481 tgtaatatta aattgttgtt taacaataaa ttaagatcac aaatggaaac aaatagggat 81541 gctgctatac ctcaggaata gcaataggag gaagccacga ctattcatag gcataaacag 81601 tcacaagaag ggtcaataga aaaaggcaag aaccatagtt gtaggagatg ggctgtccaa 81661 tagaggaatg cagacattgc taacccagcc ctgacctatt tctcttccca gaaacctgac 81721 ctccgggttt atccatgttg ttacaaatgg caaatgacag gattccatcc tttttttttt 81781 tttttttttt ttttgagaca gagtttcagt ctgttgccca ggctagagtg ccatggtgtg 81841 atctcggctc actgcatcct cccccaactg gccccacccc ctgatttcaa gcaattcttc 81901 tgcctcaacc tcccaagtag ctggggctac aggcaccaat caccacacct gactaacttt 81961 gtatttcaaa attttgtatt tcatcatgtt ggacaggctg gtctcaaact cctgacctca 82021 gatggtctgc ctgcctctgc ctcccaaagt gctgggacta cagacacctg ccaccacaac 82081 cggctaattt tgtgtattta gtagagatgg ggttttacca tgttggacgg gctggtctca 82141 aactcctgac ctcatgtgat ccgcccacct tggcctccca aagtgctggg attacaggca 82201 tgagccacca cacccagtcg atttcataat ttttaatggc tgaatagtat ttcctttttt 82261 tttttttttt tttttttgag accgagtctc accctgtcac caggctggag tgcagtggtg 82321 caatcttggc tcactgcaac ctctgcctcc caggttcaag tgattctcct gcctcagcct 82381 cccgagtagc tgggactaca ggtgcgtgcc accatgtcca gctaattttt tgtattttta 82441 gtagagatgg ggtttcacca tgttagctag gatggtctcg atctcttgac cttgtgatcc 82501 acccgcctcg gcctcccaaa gtagtggcat tacagacgtg agccaccgca cgcagccttc 82561 tttccttttc tcttagaaat gaacatgctt taacatttac ttatgagtca tggctcacat 82621 gtatagcaag aaagggtttg gtgaacaggg aaattctcag aataggatga acacaaaaag 82681 gaccataatg atgaagtaca ggcagcacac tccatctcag aacagtcacc agccaccaag 82741 gaattccctg atcatctgaa tgggactgaa aacaatccag tgattcccca gaggctcctg 82801 ctcattgact cttttccagt agatgacacg cagaaaaata gctaaaaatg gaggcccaaa 82861 gtaactagcc attgactctg cgtaatggaa gagtgaccca tttggagata tctgtaccaa 82921 tgaaacccaa acaatgtggt tcagagttat gaggatggta acctttaggc ctaggtgccc 82981 agtctatccc actcaatgta ggaactgtga agcagagggg attccatgtg aaggggagaa 83041 tagcagtttc tctcttggcc aattgggaag gctgtacaaa tcagggctga tcatccctgg 83101 cctccccctg agaaacatgg gcagcagctt caggtggcca cagagaatgc aggcagcctt 83161 cttaaaatac ttctccatgc ttgataggaa gtgccgggca ataacacagt accaaaatta 83221 cttcatcctg tcttttctgc atgtaaattg atttgttccc ctgtactgtt ctaaggagcc 83281 cagcacaggc aatacagcac acctactcag ctacagattt ccagtttttg tttctgctta 83341 taatagtttt ggtattttcc tgcaaagttc taaaaataca cttaaaactt ttcctttggg 83401 actcgcaatt tcaaaaaaat aaaatttttg gttcattaca aaatcatttt acttgcctgt 83461 gctacccaac tcttatttct cattattatt ttccttttgt ctagtggtta cgtttgtaac 83521 ttttaaagtg ctttctcaag ctctatttct gggtctatca tgtttatgta tctatgtact 83581 taatattcac aaggtcccat aacaattgga atccttacgt gttttttaca caacccactc 83641 ccaaattctg tcaactgtaa cttttttttt cttttttttt ttgagatgga gtcttgctct 83701 ctcatccagg ctggagtgca gtggcgtgat cttggctcac tgcaaccttt gcctcccggg 83761 ttcaagtgat tctcctgcct cagcctccca agtagctggg actacaggtg catgccacca 83821 ctcccggcta atttttgtac tttttaatag agatgaggtt ttgccacgct ggtcaggctg 83881 gtctcaaact cttcacctca ggtgatccac ctgccttggg ctcccaaagt gctgggatta 83941 caggcatgag ccatcacacc tagccaactc taacattgtt tttaaatggc taattttcat 84001 agagtttaaa tggttctctg taattgtaat taagttgtcc atactttgtt tctatattga 84061 tttaatagaa tttttgctat taagattgtg taaatttctt tctccataga tccaagacat 84121 acatgtgagt gaccacagtg aataaaaatc acatctggga agttccaagg taatattttt 84181 gggtatcatt ttttaaaata aatactattt tttttttttt tttgagacag ggtttctttc 84241 tgttgcccag gttaagtgca gtggtgcaat catgactcac tgcagcctca acctcctgca 84301 ctcaagtgat cctgccacct tggcctccca aagtgctggg attacaggtg tgagccacca 84361 tgcctggcca agtatcattt tttaatgttt ataatctctt atattctcaa agtaatgttc 84421 atcgtaatta actttatact tagcccttgt tttattgcac agatttattt ttttttagaa 84481 tttgtaattg ccttttaaaa tttgttgttg atataactaa catacaattt ctttttgaaa 84541 tttttaaaac ttttttctct taaaaaattt tttttgtatt ttcttttatt tttttctttt 84601 tcttcagggt aacagttatt aataacatac aatttcacct tctcactaac attgctaata 84661 ataggttcta aagtttctta atcatcgcat ttcttctact aaatccactt ttatttttta 84721 aaaagggaaa gcactctttc agggcttctt gaaattcttt tgtaacttga atcaggtact 84781 atttatttct gctaaaaatc tgtcatgata tttctcttta ctggaccgct ggactaggtc 84841 cataatggta ttcatgacct tctctaaaat aactactgct actttaagtc agtttctaaa 84901 gttcacagat catagtgtat ttttttaaga tcattgggaa ttatttattt ccagaattca 84961 tgattgcaga cattgtgtgg tcagcaaatg gtcacaaagg aagtacactg tatatattcg 85021 acctggaagg aaaaacaccg ataagacagt gaattctttg gcttcctcat ccaggccatg 85081 catgtcagtg cttctgtccc cttgttggtc agtagctcag tgtctaggca gatgaagttc 85141 ttagaagggt gctggtcagc cagagatcac agatatgcaa aagttggctt taaaaaaaat 85201 tcagtgtccc ataattggat tctccactgc tgcaaactgt gtgccactat gctatttgcc 85261 tatgcctgtc ctaggctgct cttctaaatt tcctttgtca gtggctctaa gtctctgtcc 85321 caccaacacg gggcttcctt gatttgtgag tggaatttgc aagtgtaaat gtgtcctgta 85381 ctgtttgttc ctgaggcatg gtttcaagta aaatggcatt ttacagcctg cattaatggg 85441 aaattacttt ctattagaat tctgaataat gtgcacataa tgtttataaa gtgaaggaaa 85501 gatgtcaagc atttcattaa caaatgaaag caaagctcac gagactctgc cagaagtgaa 85561 ttccaagaat actttcttca tcatttctcc tttcagctca caaatctgta ggttggtgat 85621 tcagcctggg cagctcttca ggtctcagct aggctcactc acgggactgg ggaacagctg 85681 gctatcagct ggtgcctcct aggtctggag gttggctggg gtgacttagt tctgctccaa 85741 atgtctccct ccagcaggct agccctggca tgttctcaca gtgaaggcag agcttccact 85801 ggagtacgtg tctccgggta ctctttttcc ccaagctccc aacataggct tctttagcgt 85861 gttcccaagt ccacttcaca atttctccac aagattccag tcattttaca tgaagtagag 85921 agacagtctt gcagaccctc ttcttggcca taggcagtca ccagagtaag gaagggaaga 85981 ggcgaggaag gcaacagggc caactcctct tggtaaagca caaacagtaa ctggaaagta 86041 ccaaccacag agaaggcagg agagcaactt cagggtgaac atcccaggct ctagagaaaa 86101 gcagatgctg attccagtcc tagcttctcc actcaccatg ggcgttattt attccgcagc 86161 ctcagattcc ttggcggtga aatgggataa tgctactaac tactgcctta aggcagtagt 86221 agtttgtgaa gatcaaatac aaacatggtg cttggcacag cagctgttgc aagtgaactc 86281 aaggaaatga tggttatact aggacttagg tgggggaata ctatgggctt taaaaggaat 86341 ccctagcctg tggcttacac tgctggcatt aggccaagga cagagataaa catttcccct 86401 gacttctttt cactttcaga agtccacagg caacttgccc cagtctcctg tccccaacac 86461 cactacccca cctcaaatcc tggacctggg cactatatta tacccacagc tgtggagctc 86521 ttagttcccc aggctggttc atacaagggg gaaccaggga ctcggccaag aggtacccct 86581 gctaggtcaa acgtctgtgg cccctgggca gtgagtgctc tggagtccac tggtccctgc 86641 ctggggcttg tgctgggagg cagaggtgtc aatgatccca gctattggcc tggcggccag 86701 cctggggcaa ggagcgttcc agccaagtgt caccatcacc attgttctga gaggcctggg 86761 ccggccccac agatggcctc ttagctcctc tcctggtgcc ctgggctcag gcagagagca 86821 tggtgcccaa gcacatagcc tcatagtgaa ttaatgaaac agcaacaaag cagataaatt 86881 gtcccactgg tcctttacat aggttctttc caacagcttt ggagagtgac ggtgtctcct 86941 catcttcatc agcacaagat gtcattcaca ttgaacagtt ttgacagtag gtacaacagg 87001 aatgggttta gtgaacagag ggattcccag gctagccagg atggataaaa gaatgaaaga 87061 gacaaaaatg gtaaagctca gatagtgcac tccacttatt tatttaggac agtcaccaaa 87121 catcaagcag ttctctgttc tataggtaaa ctctgtgatc ctatgaataa ggctgaccac 87181 aagttcagtg agaccctgga gcccactgtt gattgactct tttacagtag atggcaagga 87241 ggactgtaat ctataattag agtctcaagg taatgaacaa tttattctat gtaacagaag 87301 agtggcccat ttggagatat ctgtactaat gggacccata tgatgcaatg agagttatca 87361 ggatgagaaa taactttcta gctgtaagga gctccatctc tctttctctc attctgatgt 87421 ctgctcaggg accttggtgt agaggttcac actgaagaga gtgctggtga tgtgaaggtg 87481 gtcagggagc aggagagtcc tccttatcac ttatcctctt cagaatccac ttactttccc 87541 ctgaagttga tgatatattg ccagcatctt cttttcagat ctgtagaatg tgcagaatga 87601 ggtcactgtg cacatggtcc ttaggggaag ctgaagcatc atcctcatca ctgctgagat 87661 tctccagaaa ccactcaaga ttacatgaag ttgacgtatg ggaaccatct tcttccttat 87721 ccattgattt gccagatgtt atgctgagat ctttcaggag cagctgaaga gtcttcatcc 87781 ttatagctac cttcataaat gatgtccaga ggagaattgt attccatgtt cttgaggctg 87841 tcttggtctt ccatggaacc ctctccaatg ccctgtaata aggaagctgc ttcagaatgt 87901 cctcaacaga aagattttca tttgcttttc agtccacgct gaccacagat gaggcctcct 87961 tctctgacct gtcacttgga aaatccccta ccaggtcttc agcagaagct tcaatgaatt 88021 gctccccagt gtgtactctg ttatgtggcc agattgtttt gacccgctgc ttggggcagg 88081 cagcatatct tctggaccac tgaagatatg tttctcccag atgcgactat ctgtggctag 88141 tgtcacaggg agtagaggaa tttaccaaga cagttgtagg taaagaaagg caggtttgtt 88201 aaagaaagta tgaaaatgtg ttgcaagatt gcaatgggca gcacagcaga gaaggggctg 88261 tctgcaaaga ggcagaggtg gaagggaagt tttatagggt cgtactggat ggggttatgt 88321 gcagaacgag gtcattgtgt ggaatgaggt catctctcag aacaattatt tattattctt 88381 ccccacctgg agcccttccc cacctgaggc cccttcctca tttttgctta cttatcagga 88441 atccacattc ccccatgaca gaatagcaat gacaaatctt tggcattagg gtagaggtct 88501 catctccaac tgcttcctgc tgaccagggg catagagttg acccaaccta tggttttggt 88561 ctgtcaggag accctgtgag ccattgtcct gggctgtagg acctaggata ttggatcatg 88621 ctgaaggaaa gcattcatcg gagaagggga gcatgtctag gcaaaactgg cctccagctg 88681 agtgggagac tcctaaaggt agcaggtatc attgaggaga taccaatatc ctgttcatag 88741 caccatttga aggtgaaact actgaattct agaagataaa attttgtttc ttgagccagt 88801 taccaaaaag gcaaagaaaa cccttctgta gtatgactgt ttttccttat tggaagccca 88861 tttagataac ttggaagttg cacttgatga aaaagtgttt gaatttaatt tgacacagca 88921 tagggagcca attttagaaa gactattata tcttaattac atatagtgtt ccttttatga 88981 atttcacaaa ttttctcatg atttacacag accatctatg atgtgcccag actttctgac 89041 ttgtcctaaa catccctcat tcttaaacaa gcagtcattt tactttagga caagaattta 89101 tggtacaaaa tcctttctcg tgcaaaatct tcataacctt cattaccaaa aatacctctt 89161 tacctttata actttcttta tgtttctctt atttcctggt tccttttatc ttgttttata 89221 tataaccatt atcctttgaa ttagacaaaa gtcattttcc tttttttagg agtttatggt 89281 ttatactatg tgttgctgtg tgagttctgt ggaagggaag caaatgagga ggttatttat 89341 atattgtaga agttgttcca tcaagagatt actcagttag ctttcttgct agggcatgtc 89401 tgaataagta tgggctattt taaacccctg agataagact atctaggttg aagttatttg 89461 ttaaagattt aggtagcttt cccaggagaa atggggctac tagaaggaaa ggtgaattca 89521 gaggctgggc aaatattaag caggcacaca tcttggaaag tatgtttttg ccccaaagga 89581 gtgtggggta tttagatatt accagggcat ggagggaaaa tggtaactga tcccttacat 89641 agtataaagg ggtgtgactt tttcttttgg agggagaggg tgccgcttgt tcccattacc 89701 caacaggatt tggaggagag ttgtttagag aaggagatta gtacagagtc actataagtc 89761 atttagccaa aatgataagt ccaaaatttt taaaaggcaa aaatctttac tcgctgatag 89821 aggggagact tagctttcaa aacaggctgc aataaagaca gcatgaggac aactgaatct 89881 gtctcctttt tcttccctcc tttttctgtc attcatttaa aaggaaaaca aaattctttc 89941 atttttgttt atattacatg aaaatcttgt ctaaaagaga aagccaaatt gcatctttgt 90001 actagttttc tattaatgtt aaatccaatt cttagcaaaa ccttataaac aaatctgtcc 90061 aatctttatt agcttgacca tagggtaaga tttccataaa cctcttataa ccctttatcg 90121 ttttctgtta aacagcagat taattatcta agaaaactat tatctggaca cacgggccca 90181 gattctggac ctgcattagt gtgcttttat ttcaattttc aacctatggg aaaactgaat 90241 aattcccttc aaatcttagc cagctttttt atacccacag acctttttac aagatcagcc 90301 ctccacaact tactatctaa cttgcttaaa ccttcagttt tattccatta ctcctgtagg 90361 ttaggccaat ccttaaaacc tctgagcaag acaaaattac attcccttta acaaaagcca 90421 tatttccatg ccttcttttt tttttttttt ttttttgatg gagtcgctct ctgtcgccca 90481 ggctggaatg cagcggcgca atctcggctc actgcaagct ctgccttcag gcttcacgcc 90541 attctcctgc ctcagcctcc caagtagctg ggactacagg cgcctgccac cacgcctggc 90601 taattttttg tatttttagt agagacgggg tttcgccatg ttaaccagaa tggtctagat 90661 ctcctgacct cgtgatccac ccacctcggc atcccaaagt gctgggatta caggtgtgag 90721 ccaccatgcc cggcctattt ccatgcctcc ttataacctc ccaccaaaaa catattctac 90781 tttccttcta tacttttcat gtaaactgtt tctccagcag tctcaagtac atgttacaat 90841 gttaactctc agcaactttt atttttggtg aaaaacctga tgagtaattt tttttttttt 90901 ttttttgaga cggagtctag ctctgtcatc caggctggag tgcgatggca caatctcggc 90961 tcactgcaac ctccacctcc ttggtccaag caattctcct gcctaggccc ccaaagtagc 91021 tgggattaca ggtgcccgcc accacgccca gttaattttt gtatttttag tagagacagg 91081 gtttcactgt gttaaccagg ctggtctcaa actcctaacc tcgtgatccc cctacctcag 91141 cctcccaaag tgctgggatt acaagcatga gccacggtgc ctggccctca tgagtaattt 91201 taaccatgta taagattgca gaactcagga caagctgcag ataatgtctg actctttcca 91261 gactagccag gggaccaggc taacaccata tgtccccagg ccttacctag aatctaatgg 91321 ctataaaaca aacaagtcaa ttattaaaag tcatagaagc agtttatggc cttaaagcat 91381 ctgacaaaca gtatctgact gcctaattta gtttaagtgt gtgaactttg aagacatctt 91441 ttatttacct taccaataat ctttaaactg gctttttaaa agagtcgtat tagagtcacg 91501 tgacctaaaa ggcattaaat ttttattttt ctgacaatat gtttaagtgc ttatttttct 91561 ttaagccaat tagaattctt ttatataaac acataaaaat acacagacag aaatggagaa 91621 ttaagacaaa attctatcaa ctgagaagtt tttagagcaa aagcaggggc tttaaaaacc 91681 aatatctgta cacatgcaga ccaaatataa gctttaagtt gatttctaac tattaccaga 91741 tttcagtcag gacaaatggc tagtatttct ggcttttgaa tcttttacta aaggtaattt 91801 ataacatgat attagtaagc cttaactaag aagaagcctt tttttttttt ttttttgaga 91861 cggagtgtcg ctctgtcgcc caggctggag tgcagtggcg ctatctgggc tcactgcaac 91921 ctccgcctcc caggttcacg ccattctcct gccttagcct ccccagcagc tgggactaca 91981 ggcgcccgcc accgcgcccg gctaaatttt tttgtatttt tagtagagac ggggtttcat 92041 cgtggtctcg atctcctgac ctcttgatcc gcccgcctcg gcctcccaaa gtgctgggat 92101 tacaggcgtg agccaccgcg cccggcgcag aagaagactt aaacacagac acatgagctg 92161 tctccaaaga gacggtcagt gatttacaag atctagaatt gccccaaagg taattcagag 92221 aaaagaaaat ttcaagacag gaaatcagaa gctgttcatg gaggggaaaa gaatcaataa 92281 agggcaaaaa taccacaagt attaaaccac aaaggactca cattgaaaga caagaattga 92341 acgcattgtg agagggcaaa gccttagcca cttatctaca gcacaaggtg actgctgttt 92401 tttccagaag gagtttaaag cattttcaag cttgcaaagg attttaactg ctcaagataa 92461 ctttctgaga ctagccatgt tgttgttatg catccttctt ttaatttaac ctttttcttt 92521 tattggtcta cccagttcca atagtgaccc aatctaaaag gctgttaaca tccagatagt 92581 aattttccag gtttttacca tataagcaaa aggttatttc cagaaagggg tagaggaggt 92641 gtctccatga cagacagctt ttgaactcaa aagggaaatt tataatttta cttgctgcct 92701 ccagagttgc ccttggcttt gttttattga taatgatgtt tgatttggaa gccagctgga 92761 gcagagaccc ctaccccagc agaaggccat caggggtgtc tgatgggatt ctgtcctggg 92821 ggcccttcag cccttagggc agtcccattt ccagtggcca agcttgtggc cgagggggca 92881 aactgtgtaa ggcttttccc catttgtccc attgggacag tttgcctttc agtggcctgg 92941 ccttctgcac tgatggcagt tacctggagg atattctgag ggcaacccag actgggctgg 93001 gggctcataa agcagccagc agttgagcct gccttttctc cttgtgtttt ctgtgttttt 93061 ctttcccgtt agccctgtcc tttttattct gctctcagtt ataaaagact gaggaggcta 93121 atttgagaat ttcctgcaaa ggggccatgc tgttaatata tgacacaaga aaattagatg 93181 tgccctattt tttctgcacc tctgagctta ctgtctgtct ggctttttta agttttaatt 93241 tccaggactt gaactcgctg ccatgtccac taaagaagga aagatcttcg tgggagggct 93301 caacattaac accgacgagc aggtgctgga agacgacttc agcagcttcg ggcctgtctc 93361 tgaggtggtc attgtcaagg agactcagtg gtccaggggt tttggtttca tcaccatcac 93421 caacccagac gccatgagag ccatgaacag agagtccctg gatggtcacc agatccgcgt 93481 ggatcatgca ggcaagtctg ctggggaacc agaggaggtg actttggggc ccatgggcgt 93541 ggtcacagct actctagagg tggtgggaac caggactatg ggagtggcag gtatgacagt 93601 tgacctggag ggtatagata tggatatgga cagtccagag actataatgt cagaagccag 93661 ggtagttatg actgctactc agaaggaaat tacagaagca attatgacaa ctgaaatgtg 93721 acatgtgcac ataatataca caaggaatag ctcttctgat ccaggattgt ccttccaaat 93781 ggctgtattt ataaaggttt ttggagctgt actgaaacat cttattttat agtatatcaa 93841 cctcttgttt ttaaattgag ctcccaaggt agctagttaa agatctttta gacagctcca 93901 tctttgttta aaattttttc tcctatttaa agacaaatta tggaacattt gtagggtcgg 93961 agtatttttc tttttgccag ttttttagtt tgagctgtca ggattattgg atctagcaat 94021 aattggttct gacatttgac cagactggtt tttgaaaatt agtgtgtatc cagggacatt 94081 taaaaaacct gtacacagtg tttattgtgg ttaggaagca atttccaaat atacctagaa 94141 gaaatctgca tcaagaacat gattatctag gggttttctc taattcagat aaccaaactg 94201 attatataga agagccgctt taaaatgttt gcaaatgtct ttttgtaata ctggaagaaa 94261 aaatattgtt ttgtctcata tagtgcttag gatgtccttc acagagctta ttaaaaagtt 94321 gaaaccagaa aaaaaaatta ggtgtttctt tttgagagtc tgagagttaa atttgtctta 94381 atgttttagg atgtagccca gaggtgagtc tgaataaata gatggggttt gtccatgttg 94441 gaactggaaa acaatgagcc actggagggt aatgtctgct ggggacacga accacacttt 94501 ctttaggggc atcctgctga ggaaaagggt tccacttaca ttgcagaggg atacaagtac 94561 actttctaaa ggggcatccc acccccatta gaaaggacct ttcagcgctg gcaccttatg 94621 ctgggtgatc agcccaggca tgaggaaaaa gggtaaagag agaaagacag cctgctgcca 94681 gccctggaga aggaacagga atggggacac tcactgctct gaggccactt gagatcacct 94741 gatttgggaa catccagagc aggatgtctg gctgtcttca tgggagaatt tagaatttag 94801 agtgagaaag agggagtctg agttccccaa aacatgtgtg ccaattgtgc cacacaaagg 94861 gattggggac ttttaaccag aaaggatagg agagggtctt cctcccttct gggtaaggca 94921 gccaaagtct ttcaccctct ggccttcagg ctacaccaag gtgtgggcct ggccagttgt 94981 gatcaattgc cagagggatg ctagagtttg tctgctggaa ggcaggaaag gaaagcaaac 95041 tctgaactct tacctgatca ggcagtggtg gtcagacatc ttttcacagg gactttctgg 95101 tctgccgagg agtggccccg gctggggacc ttcagctgtc tcggtgcttg gatgctgtca 95161 ccaagaggtc agagttggag gacaggaaag ggagtgagta ggggaagagg tccccgaaca 95221 gtccctgtac gggccaccaa aacatcacag gtgtgactat ctggggctgg gctggtgtcg 95281 caggtggtaa aggaatttac caagatagtt gtaggtaaag aaaggcagat ttattagaga 95341 aaatatgaaa atacattgca agattgcaac aggtagcaca gcagagaagg gctgtctgca 95401 aaaaggcagg ggctggaggg aagttttata gggttgtgct agagggggct gcacatagag 95461 gtcatctctc agaacagttg ttcattgttc ttccccacct ggggcccttc cccacccgag 95521 gccccttcct cattgttgct tacttatcag gactccacaa cattttcata ttcttgaggc 95581 aagattctgt ccatgaagag aggaaaggat aaagaaaatg cagtatatat acataatgga 95641 atgctcttca gccttaaaaa ttcatgaaat catgtcatta cagcaacatg tatggacgag 95701 gtaggaaaaa tgccccagag aatccacacc aaagtttcct cctcagcatc tgcgcccaca 95761 gacaggagct gtttcttccc cgggatgttg ccccaggtcc tcctgaagct gccatggtgc 95821 tccctcagga gggtcctgaa ggaactggag ctcttctcct cctcacaggc tgaatagtat 95881 ttctttgtgt ataagtacta cattttcttt atccattcat ctgttgatgg acacttaggt 95941 tgcttctata cgctgttgtg aataatgctg cttgtgaata atgctgcaat aaacatggga 96001 gcacagatgt ctcttgcaca tactgatttc atttcatttg gatatatacc cagtagtgga 96061 tttgtggcaa catggatgaa cctggaggac actatgctaa gtggaataag ccagatacag 96121 aaagacaaac atgatttcaa tcatgtatgg aatctaaaaa agtttatctc acagaagtag 96181 agaatagaat agtggttacc agaggctgag gaggggaaag ggaatggggg cataggaagc 96241 aattagtcaa aaagtacaaa gtgacagtta gaaggagtaa gttctggtgt tctagtcaca 96301 gtagggtgac catggttaac aatattgtat atttcaaaaa agctagaaga gagaattttg 96361 aatgttctca ccacaaagaa atgataaatg tggccgggcg cggcggctca cggctgtaat 96421 cccagccctt tgggaggcta aagtgggcag atcacttgag gtcaggagtt caagaccagc 96481 ctggccaaca tggtgaaacc ccatctctac taaaaataca aaaattagta gtggtggtgg 96541 gcacctgtaa tcccagctac ttgggaggct gaggcatgat catcacatga acctggaggc 96601 agcagttgca gtgagctgag atggggccac tgcactccag catgggcaac agagtgacac 96661 tccatctcaa aaaaaaagaa aaagagaaaa gaaagaaaga aatgataaat gaggtgatgg 96721 gtatgctaat atcctgattt gatttgttga tttttacaca acgtatacat ttatcgaaac 96781 atcacactgt gccccataaa tatgtaaaat tattatgtgt caattaaaaa cattttaaaa 96841 tgtctgccct ttatttttta aattatatat aatacatata tatataaagc tataacatat 96901 atatatttca acaccttttg ggatacaagt ggttttctgt tatatggatg aattatatag 96961 tggtgaaatc tgaggtttta gtgcacctgt caccagagta gtatacattg tactcaatac 97021 gtagtttttt tattcctcac tccccttcca ctttccctgc ttttgagtct ccgatgtcca 97081 ttataccact ctgtaggcct ttgcgtaccc atagcttagt tcccacttat aagtgagaat 97141 atactgccct tttctttaca tgtactttct ttggagagat tgttggaatt ctcagagcaa 97201 gagaatcctc agcttcagtt atcatctcta tgaagaggaa tctaagaaag agtgtatgta 97261 aggaaaaagg caaaagctag aggaatgtct gtatttagca gtgggaagat gaagagcagt 97321 caatgggaaa tggtcagagt ggtagagaga aaaacaatga agataaggca gtataaaaga 97381 agatgaagaa agagtagtca ttgcatcaaa aacttcagag aaaccgggaa gggtgagaac 97441 aaagactaga aaaagctatt agatttgttg actcacatgc ttttattttt cttcacgaag 97501 tatttcagta acattatgga agcaaaagcc atattataaa agtttatttg ttagtgaaat 97561 tacagattac agattccttt tccaagaaat ttaattgtaa gggaaaagac acagtacatt 97621 gcagcttgaa gtggtactgg gggcaggtac catcagaaat acatcagaaa taaccaactt 97681 atatttacta atggagaaga agaggccact gggagagtag gattgaaaat gcaaattgag 97741 gaagaaataa tggtggggtg ataccctgca aatgtgggac aatcagtgtt tgagactata 97801 ggagatgagt tctagcaaga tgattagagg atcctttctt ggggaaaaga gggaaggaag 97861 taaaagagat tttgaaaata gagaagttaa gaaaactcct aaggtagctt caaaatattc 97921 agtaaaaaag gcaagcaagg ccgggcatgg tggctcacgc ctgtaatccc agcactttgg 97981 gaggctaagg agggtgggtc acctgagttc aagagttcca ggccagcctg gccaacatgg 98041 caaaaccccg tctctactaa aaatataaaa ttagctgggt gtggtggcag gcgcctgtaa 98101 tcccagctac tcaggaggct gaggcaggag aattgcttga accagggagg cagagattgc 98161 agtgagccaa gattgcacca ttgcactcca gcctgggaga cagagcaaga ctccatctga 98221 aaaaaaaaaa aaaaaaaaaa aaaaaggcaa gcaaggtcaa gagtcaaagg agcagtgttg 98281 aagttgggag ctttaagaaa aaggagtttt gaaacagtta tcttagcaga taaatatgca 98341 tggtaaatgg taaacaagcg gctgagcagc tatgaagatc caggctagaa cagtgatttt 98401 taactactcc tttaaatcaa ggactgtctt agaccgtttt ttgtcctgta tgatactttg 98461 tatcgaacaa gcacacacta aatattgttg aattgatgct aattttttaa agtttgcttt 98521 tgaatgagag gaaaagacaa gagaaaaaaa gaaaacaagg aaagtgaaag caggagagtg 98581 tttcagcttt cccagggcat gggaatctca gaagcacttt gcttgtttaa tgacctttac 98641 caagtcacct tttcaatact ggcttcattt cccaggcaaa acaggggaaa taatctctgc 98701 ctccttattg tcttccaaga atatcataag ctcttagtaa aataatattt gcatactctt 98761 ggttgcctta tagacaagaa aatatttagt ttcctcattt ttatagattt gcttctcttc 98821 tctaggctgt ttattattat agattggtta gcaaaagttt ggcctgttga cattgatttc 98881 tagacaatag aaggggaaat aagaatatga gcaatgaaaa tttgtgacaa aaataaaaca 98941 gaaaaatgaa aggagaggat gtttgttact ttcaacctac cagtcaatat cattgtcttt 99001 aaatatctct tggaatttca tgcaattatt ctacctcatc agtacacttg aagttgcatt 99061 ttaatttcat gatc // LOCUS HS5HT2BSR 1790 bp RNA PRI 02-MAY-1994 DEFINITION H.sapiens mRNA for 5-HT2B serotonin receptor. ACCESSION X77307 NID g475197 KEYWORDS 5-HT2B serotonin receptor; serotonin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1790) AUTHORS Schmuck,K., Ullmer,C., Engels,P. and Lubbert,H. TITLE Cloning and functional characterization of the human 5-HT2B serotonin receptor JOURNAL FEBS Lett. 342 (1), 85-90 (1994) MEDLINE 94192809 REFERENCE 2 (bases 1 to 1790) AUTHORS Luebbert,H. TITLE Direct Submission JOURNAL Submitted (14-JAN-1994) H. Luebbert, Preclinical Research 386-226, Sandoz Pharma Basel, CH-4002 Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..1790 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SH-SY5Y cells" CDS 56..1501 /codon_start=1 /product="5-HT2B serotonin receptor" /db_xref="PID:g475198" /db_xref="SWISS-PROT:P41595" /translation="MALSYRVSELQSTIPEHILQSTFVHVISSNWSGLQTESIPEEMK QIVEEQGNKLHWAALLILMVIIPTIGGNTLVILAVSLEKKLQYATNYFLMSLAVADLL VGLFVMPIALLTIMFEAMWPLPLVLCPAWLFLDVLFSTASIMHLCAISVDRYIAIKKP IQANQYNSRATAFIKITVVWLISIGIAIPVPIKGIETDVDNPNNITCVLTKERFGDFM LFGSLAAFFTPLAIMIVTYFLTIHALQKKAYLVKNKPPQRLTWLTVSTVFQRDETPCS SPEKVAMLDGSRKDKALPNSGDETLMRRTSTIGKKSVQTISNEQRASKVLGIVFFLFL LMWCPFFITNITLVLCDSCNQTTLQMLLEIFVWIGYVSSGVNPLVYTLFNKTFRDAFG RYITCNYRATKSVKTLRKRSSKIYFRNPMAENSKFFKKHGIRNGINPAMYQSPMRLRS STIQSSSIILLDTLLLTENEGDKTEEQVSYV" BASE COUNT 545 a 377 c 341 g 527 t ORIGIN 1 tactaaccat gctgaccact gttcggaacg ggattgaatc acagaaaaac agcaaatggc 61 tctctcttac agagtgtctg aacttcaaag cacaattcct gagcacattt tgcagagcac 121 ctttgttcac gttatctctt ctaactggtc tggattacag acagaatcaa taccagagga 181 aatgaaacag attgttgagg aacagggaaa taaactgcac tgggcagctc ttctgatact 241 catggtgata atacccacaa ttggtggaaa tacccttgtt attctggctg tttcactgga 301 gaagaagctg cagtatgcta ctaattactt tctaatgtcc ttggcggtgg ctgatttgct 361 ggttggattg tttgtgatgc caattgccct cttgacaata atgtttgagg ctatgtggcc 421 cctcccactt gttctatgtc ctgcctggtt atttcttgac gttctctttt caaccgcatc 481 catcatgcat ctctgtgcca tttcagtgga tcgttacata gccatcaaaa agccaatcca 541 ggccaatcaa tataactcac gggctacagc attcatcaag attacagtgg tgtggttaat 601 ttcaataggc attgccattc cagtccctat taaagggata gagactgatg tggacaaccc 661 aaacaatatc acttgtgtgc tgacaaagga acgttttggc gatttcatgc tctttggctc 721 actggctgcc ttcttcacac ctcttgcaat tatgattgtc acctactttc tcactatcca 781 tgctttacag aagaaggctt acttagtcaa aaacaagcca cctcaacgcc taacatggtt 841 gactgtgtct acagttttcc aaagggatga aacaccttgc tcgtcaccgg aaaaggtggc 901 aatgctggat ggttctcgaa aggacaaggc tctgcccaac tcaggtgatg aaacacttat 961 gcgaagaaca tccacaattg ggaaaaagtc agtgcagacc atttccaacg aacagagagc 1021 ctcaaaggtc ctagggattg tgtttttcct ctttttgctt atgtggtgtc ccttctttat 1081 tacaaatata actttagttt tatgtgattc ctgtaaccaa actactctcc aaatgctcct 1141 ggagatattt gtgtggatag gctatgtttc ctcaggagtg aatcctttgg tctacaccct 1201 cttcaataag acatttcggg atgcatttgg ccgatatatc acctgcaatt accgggccac 1261 aaagtcagta aaaactctca gaaaacgctc cagtaagatc tacttccgga atccaatggc 1321 agagaactct aagtttttca agaaacatgg aattcgaaat gggattaacc ctgccatgta 1381 ccagagtcca atgaggctcc gaagttcaac cattcagtct tcatcaatca ttctactaga 1441 tacgcttctc ctcactgaaa atgaaggtga caaaactgaa gagcaagtta gttatgtata 1501 gcagaactgg cagttgtcat caaacataat gatgagtaag atgatgaatg agatgtaaat 1561 gtgcccagaa tatattatat aaagaatttt atgtcatata tcaaatcatc tctttaacct 1621 aagatgtaag tattaagaat atctaatttt cctaatttgg acaagattat tccatgagga 1681 aaataatttt atatagctac aaatgaaaac aatccagcac tctggttaaa ttttaaggta 1741 ttcgaatgaa ataaagtcaa atcaataaat ttcaggcttt aaaaaaaaaa // LOCUS HS5HT4AR 1336 bp RNA PRI 02-DEC-1997 DEFINITION H.sapiens mRNA for serotonin receptor 5-HT4. ACCESSION Y08756 NID g2661732 KEYWORDS 5-HT4 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1336) AUTHORS Blondel,O., Vandecastle,G., Gastineau,M., Leclerc,C., Dahmoune,Y., Langlois,M. and Fischmeister,R. TITLE Molecular and functional characterization of a 5-HT4 receptor from human atrium JOURNAL Unpublished REFERENCE 2 (bases 1 to 1336) AUTHORS Blondel,O. TITLE Direct Submission JOURNAL Submitted (10-OCT-1996) O. Blondel, Faculte De Pharmacie, Inserm U-446, 5, Rue Jb Clement, 92296, FRANCE REMARK revised by author 11-APR-1997 FEATURES Location/Qualifiers source 1..1336 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" mRNA 1..1336 /product="serotonin 5-HT receptor" CDS 19..1182 /codon_start=1 /product="serotonin 5-HT receptor" /db_xref="PID:e311752" /db_xref="PID:g2661733" /translation="MDKLDANVSSEEGFGSVEKVVLLTFLSTVILMAILGNLLVMVAV CWDRQLRKIKTNYFIVSLAFADLLVSVLVMPFGAIELVQDIWIYGEVFCLVRTSLDVL LTTASIFHLCCISLDRYYAICCQPLVYRNKMTPLRIALMLGGCWVIPTFISFLPIMQG WNNIGIIDLIEKRKFNQNSNSTYCVFMVNKPYAITCSVVAFYIPFLLMVLAYYRIYVT AKEHAHQIQMLQRAGASSESRPQSADQHSTHRMRTETKAAKTLCIIMGCFCLCWAPFF VTNIVDPFIDYTVPGQVWTAFLWLGYINSGLNPFLYAFLNKSFRRAFLIILCCDDERY RRPSILGQTVPCSTTTINGSTHVLRYTVLHRGHHQELEKLPIHNDPESLESCF" BASE COUNT 297 a 343 c 321 g 375 t ORIGIN 1 cggtgcttat ttcctgtaat ggacaaactt gatgctaatg tgagttctga ggagggtttc 61 gggtcagtgg agaaggtggt gctgctcacg tttctctcga cggttatcct gatggccatc 121 ttggggaacc tgctggtgat ggtggctgtg tgctgggaca ggcagctcag gaaaataaaa 181 acaaattatt tcattgtatc tcttgctttt gcggatctgc tggtttcggt gctggtgatg 241 ccctttggtg ccattgagct ggttcaagac atctggattt atggggaggt gttttgtctt 301 gttcggacat ctctggacgt cctgctcaca acggcatcga tttttcacct gtgctgcatt 361 tctctggata ggtattacgc catctgctgc cagcctttgg tctataggaa caagatgacc 421 cctctgcgca tcgcattaat gctgggaggc tgctgggtca tccccacgtt tatttctttt 481 ctccctataa tgcaaggctg gaataacatt ggcataattg atttgataga aaagaggaag 541 ttcaaccaga actctaactc tacgtactgt gtcttcatgg tcaacaagcc ctacgccatc 601 acctgctctg tggtggcctt ctacatccca tttctcctca tggtgctggc ctattaccgc 661 atctatgtca cagctaagga gcatgcccat cagatccaga tgttacaacg ggcaggagcc 721 tcctccgaga gcaggcctca gtcggcagac cagcatagca ctcatcgcat gaggacagag 781 accaaagcag ccaagaccct gtgcatcatc atgggttgct tctgcctctg ctgggcacca 841 ttctttgtca ccaatattgt ggatcctttc atagactaca ctgtccctgg gcaggtgtgg 901 actgctttcc tctggctcgg ctatatcaat tccgggttga acccttttct ctacgccttc 961 ttgaataagt cttttagacg tgccttcctc atcatcctct gctgtgatga tgagcgctac 1021 cgaagacctt ccattctggg ccagactgtc ccttgttcaa ccacaaccat taatggatcc 1081 acacatgtac taaggtacac cgttctgcac aggggacatc atcaggaact cgagaaactg 1141 cccatacaca atgacccaga atccctggaa tcatgcttct gattgaggac atggctcaca 1201 acttagccat tcattcgcat tcatgtttgc atgaacaggt caccctggca tcacttctga 1261 acctcatcac caccagtgag gcatcaggta gtaggctgag agcccagagg aggtacatgg 1321 tggacagtgt tggccg // LOCUS HS5NUASE 3547 bp RNA PRI 05-JUN-1991 DEFINITION Human placental cDNA coding for 5'nucleotidase (EC 3.1.3.5). ACCESSION X55740 NID g23896 KEYWORDS 5'-nucleotidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3547) AUTHORS Ikehara,Y. TITLE Direct Submission JOURNAL Submitted (23-OCT-1990) Ikehara Y., Dept. of Biochem., School of Medicine,Fukuoka University, 7-45-1 Nanakuma, Jonan-ku, Fukuoka 814-01, Japan REFERENCE 2 (bases 1 to 3547) AUTHORS Misumi,Y., Ogata,S., Ohkubo,K., Hirose,S. and Ikehara,Y. TITLE Primary structure of human placental 5'-nucleotidase and identification of the glycolipid anchor in the mature form JOURNAL Eur. J. Biochem. 191 (3), 563-569 (1990) MEDLINE 90361037 COMMENT Data kindly reviewed (11-FEB-1991) by Ikehara Y. FEATURES Location/Qualifiers source 1..3547 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placental" /clone_lib="cDNA in lambda gt11" mRNA <1..3547 sig_peptide 50..127 CDS 50..1774 /EC_number="3.1.3.5" /codon_start=1 /product="5'-nucleotidase" /db_xref="PID:g23897" /db_xref="SWISS-PROT:P21589" /translation="MCPRAARAPATLLLALGAVLWPAAGAWELTILHTNDVHSRLEQT SEDSSKCVNASRCMGGVARLFTKVQQIRRAEPNVLLLDAGDQYQGTIWFTVYKGAEVA HFMNALRYDAMALGNHEFDNGVEGLIEPLLKEAKFPILSANIKAKGPLASQISGLYLP YKVLPVGDEVVGIVGYTSKETPFLSNPGTNLVFEDEITALQPEVDKLKTLNVNKIIAL GHSGFEMDKLIAQKVRGVDVVVGGHSNTFLYTGNPPSKEVPAGKYPFIVTSDDGRKVP VVQAYAFGKYLGYLKIEFDERGNVISSHGNPILLNSSIPEDPSIKADINKWRIKLDNY STQELGKTIVYLDGSSQSCRFRECNMGNLICDAMINNNLRHTDEMFWNHVSMCILNGG GIRSPIDERNNGTITWENLAAVLPFGGTFDLVQLKGSTLKKAFEHSVHRYGQSTGEFL QVGGIHVVYDLSRKPGDRVVKLDVLCTKCRVPSYDPLKMDEVYKVILPNFLANGGDGF QMIKDELLRHDSGDQDINVVSTYISKMKVIYPAVEGRIKFSTGSHCHGSFSLIFLSLW AVIFVLYQ" mat_peptide 128..1771 /EC_number="3.1.3.5" /product="5'-nucleotidase" polyA_signal 2901..2906 polyA_signal 3533..3538 BASE COUNT 1067 a 753 c 751 g 976 t ORIGIN 1 gcactcgccc ggctcgcccg ctttcgcacc cagttcacgc gccacagcta tgtgtccccg 61 agccgcgcgg gcgcccgcga cgctactcct cgccctgggc gcggtgctgt ggcctgcggc 121 tggcgcctgg gagcttacga ttttgcacac caacgacgtg cacagccggc tggagcagac 181 cagcgaggac tccagcaagt gcgtcaacgc cagccgctgc atgggtggcg tggctcggct 241 cttcaccaag gttcagcaga tccgccgcgc cgaacccaac gtgctgctgc tggacgccgg 301 cgaccagtac cagggcacta tctggttcac cgtgtacaag ggcgccgagg tggcgcactt 361 catgaacgcc ctgcgctacg atgccatggc actgggaaat catgaatttg ataatggtgt 421 ggaaggactg atcgagccac tcctcaaaga ggccaaattt ccaattctga gtgcaaacat 481 taaagcaaag gggccactag catctcaaat atcaggactt tatttgccat ataaagttct 541 tcctgttggt gatgaagttg tgggaatcgt tggatacact tccaaagaaa ccccttttct 601 ctcaaatcca gggacaaatt tagtgtttga agatgaaatc actgcattac aacctgaagt 661 agataagtta aaaactctaa atgtgaacaa aattattgca ctgggacatt cgggttttga 721 aatggataaa ctcatcgctc agaaagtgag gggtgtggac gtcgtggtgg gaggacactc 781 caacacattt ctttacacag gcaatccacc ttccaaagag gtgcctgctg ggaagtaccc 841 attcatagtc acttctgatg atgggcggaa ggttcctgta gtccaggcct atgcttttgg 901 caaataccta ggctatctga agatcgagtt tgatgaaaga ggaaacgtca tctcttccca 961 tggaaatccc attcttctaa acagcagcat tcctgaagat ccaagcataa aagcagacat 1021 taacaaatgg aggataaaat tggataatta ttctacccag gaattaggga aaacaattgt 1081 ctatctggat ggctcctctc aatcatgccg ctttagagaa tgcaacatgg gcaacctgat 1141 ttgtgatgca atgattaaca acaacctgag acacacggat gaaatgttct ggaaccacgt 1201 atccatgtgc attttaaatg gaggtggtat ccggtcgccc attgatgaac gcaacaatgg 1261 cacaattacc tgggagaacc tggctgctgt attgcccttt ggaggcacat ttgacctagt 1321 ccagttaaaa ggttccaccc tgaagaaggc ctttgagcat agcgtgcacc gctacggcca 1381 gtccactgga gagttcctgc aggtgggcgg aatccatgtg gtgtatgatc tttcccgaaa 1441 acctggagac agagtagtca aattagatgt tctttgcacc aagtgtcgag tgcccagtta 1501 tgaccctctc aaaatggacg aggtatataa ggtgatcctc ccaaacttcc tggccaatgg 1561 tggagatggg ttccagatga taaaagatga attattaaga catgactctg gtgaccaaga 1621 tatcaacgtg gtttctacat atatctccaa aatgaaagta atttatccag cagttgaagg 1681 tcggatcaag ttttccacag gaagtcactg ccatggaagc ttttctttaa tatttctttc 1741 actttgggca gtgatctttg ttttatacca atagccaaaa attctccttg cctttaatgt 1801 gtgaaactgc attttttcaa gtgagattca aatctgcctt ttaggacctg gctttgtgac 1861 agcaaaaacc atctttacag gctcctagaa gctgaaggtt agagcattat aaaatgaaga 1921 gacagacatg attactcagg gtcagcaacc tagtgagtta gaaaaaaaat taacataggg 1981 ccctataagg agaaagccaa ctatgttaag tttacgtgtc caaattttaa tgaaatttta 2041 ctaacaattt taaaccatat ttttcttctt catatccatt tctaatccat caaacagctt 2101 atgtttacat aaaattttat cattcacaag gaagttttaa gcacactgtc tcatttgata 2161 tccacaactt atttttggta ggaaagagag atgtttttcc cacctgtcag atgaaaaaac 2221 tgaagctcaa aaagggttga cttgaccata cagctaatgc tgacagatcc aagacctaga 2281 cctaggtctt ttgaactcaa gtccagcatt ctcaactata tcaagttact gttcagaata 2341 cttaatatct cctctcttca taattatcaa tagccccaag ctcatggatg acaaatctct 2401 gctttatttc ttgtctctat tttttcactt tatagctcct gttataatag caagtttaat 2461 ggtataaaca caggatacca tcctctcttg caacacccat gtgcctttga tgagtcaggt 2521 agcaagctgt agtagataat gagaaaggcc agaggctgca aagacagtca aaggacacga 2581 gagaaaggaa ggggaagaac aggactccag gactgtttta tattatagaa aagcaagagc 2641 taaagagcat ttacacatgt taaacagata cttgttaagc atagtgcctg acacacggca 2701 ttagctgtta ttttatgaga ttccatcagc tctgcctctg tcctctttct tctaacatga 2761 aggtatcatg agaagagaac cttctaacat aagctgtaat tctaaacctg cacttgtccc 2821 tctccagcaa gaggctagca ctgaattcat tctactcata ctacacaccc agttatggaa 2881 tgtccagagt tctcgaagaa aataaatgac tttaggaaga ggtatacatt ttttaagtcg 2941 ctctgcctcc aaatctgaac agtcactgta aatcattctt aagcccagat atgagaactt 3001 ctgctggaaa gtgggaccct ctgagtgggt ggtcagaaaa tacccatgct gatgaaatga 3061 cctatgccca aagaacaaat acttaacgtg ggagtggaac cacatgagcc tgctcagctc 3121 tgcataagta attcaagaaa tgggaggctt caccttaaaa acagtgtgca aatggcagct 3181 agaggttttg ataggaagta tgtttgtttc ttagtgttta caaatattaa gtactcttga 3241 tacaaaatat acttttaaac ttcataacct ttttataaaa gttgttgcag caaaataata 3301 gcctcggttc tatgcatata tggattgcta taaaaaatgt caataagatt gtacaaggaa 3361 aattagagaa agtcacattt agggtttatt ttttacactt ggccagtaaa atagggtaaa 3421 tcctattaga aattttttaa agaacttttt ttaagtttcc taaatctgtg tgtgtattgt 3481 gaagtggtat aagaaatgac tttgaaccac tttgcaattg tagattccca acaataaaat 3541 tgaagat // LOCUS HS5T4OA 2053 bp RNA PRI 15-APR-1994 DEFINITION H.sapiens 5T4 gene for 5T4 Oncofetal antigen. ACCESSION Z29083 NID g435654 KEYWORDS 5T4 gene; oncofetal antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2053) AUTHORS Myers,K.A., Rahi-Saund,V., Davison,M.D., Young,J.A., Cheater,A.J. and Stern,P.L. TITLE Isolation of a cDNA encoding 5T4 oncofetal trophoblast glycoprotein. An antigen associated with metastasis contains leucine-rich repeats JOURNAL J. Biol. Chem. 269 (12), 9319-9324 (1994) MEDLINE 94179356 REFERENCE 2 (bases 1 to 2053) AUTHORS Myers,K.A. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) Myers K. A., Paterson Institute for Cancer Research, Immunology, Wilmslow Road, Manchester, UK, M20 9BX FEATURES Location/Qualifiers source 1..2053 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11 library of J. Milan" /sex="Female" misc_RNA 62..372 /note="This region contains four conserved cysteine residues found in many other LRR containing proins." /citation=[1] /function="Unknown" /label=N-flank /product="LRR N-terminal flank" CDS 85..1347 /citation=[1] /codon_start=1 /evidence=experimental /product="5T4 oncofetal antigen" /db_xref="PID:g435655" /translation="MPGGCSRGPAAGDGRLRLARLALVLLGWVSSSSPTSSASSFSSS APFLASAVSAQPPLPDQCPALCECSEAARTVKCVNRNLTEVPTDLPAYVRNLFLTGNQ LAVLPAGAFARRPPLAELAALNLSGSRLDEVRAGAFEHLPSLRQLDLSHNPLADLSPF AFSGSNASVSAPSPLVELILNHIVPPEDERQNRSFEGMVVAALLAGRALQGLRRLELA SNHFLYLPRDVLAQLPSLRHLDLSNNSLVSLTYVSFRNLTHLESLHLEDNALKVLHNG TLAELQGLPHIRVFLDNNPWVCDCHMADMVTWLKETEVVQGKDRLTCAYPEKMRNRVL LELNSADLDCDPILPPSLQTSYVFLGIVLALIGAIFLLVLYLNRKGIKKWMHNIRDAC RDHMEGYHYRYEINADPRLTNLSSNSDV" sig_peptide 130..171 /note="Putative - based on hydrophobicity of the amino acids." /citation=[1] misc_RNA 373..966 /note="This region contains seven repeats interupted by a unique central domain of hydrophilic amino acids." /citation=[1] /function="Unknown" /label=LRRs /product="Leucine rich repeat region" misc_RNA 966..1119 /note="This region contains four conserved cysteine residues found in many other LRR containing Proteins." /citation=[1] /function="Unknown" /label=C-flank /product="LRR C-terminal flank" misc_RNA 1153..1215 /standard_name="transmembrane region" /note="Putative - based on the hydrophobicity of the amino acids. The protein is known to be membrane bound." /citation=[1] /function="Anchorage of the protein to the cell membrane" /product="transmembrane peptide" BASE COUNT 461 a 602 c 499 g 491 t ORIGIN 1 ccggctcgcg ccctccgggc ccagcctccc gagccttcgg agcgggcgcc gtcccagccc 61 agctccgggg aaacgcgagc cgcgatgcct ggggggtgct cccggggccc cgccgccggg 121 gacgggcgtc tgcggctggc gcgactagcg ctggtactcc tgggctgggt ctcctcgtct 181 tctcccacct cctcggcatc ctccttctcc tcctcggcgc cgttcctggc ttccgccgtg 241 tccgcccagc ccccgctgcc ggaccagtgc cccgcgctgt gcgagtgctc cgaggcagcg 301 cgcacagtca agtgcgttaa ccgcaatctg accgaggtgc ccacggacct gcccgcctac 361 gtgcgcaacc tcttccttac cggcaaccag ctggccgtgc tccctgccgg cgccttcgcc 421 cgccggccgc cgctggcgga gctggccgcg ctcaacctca gcggcagccg cctggacgag 481 gtgcgcgcgg gcgccttcga gcatctgccc agcctgcgcc agctcgacct cagccacaac 541 ccactggccg acctcagtcc cttcgctttc tcgggcagca atgccagcgt ctcggccccc 601 agtccccttg tggaactgat cctgaaccac atcgtgcccc ctgaagatga gcggcagaac 661 cggagcttcg agggcatggt ggtggcggcc ctgctggcgg gccgtgcact gcaggggctc 721 cgccgcttgg agctggccag caaccacttc ctttacctgc cgcgggatgt gctggcccaa 781 ctgcccagcc tcaggcacct ggacttaagt aataattcgc tggtgagcct gacctacgtg 841 tccttccgca acctgacaca tctagaaagc ctccacctgg aggacaatgc cctcaaggtc 901 cttcacaatg gcaccctggc tgagttgcaa ggtctacccc acattagggt tttcctggac 961 aacaatccct gggtctgcga ctgccacatg gcagacatgg tgacctggct caaggaaaca 1021 gaggtagtgc agggcaaaga ccggctcacc tgtgcatatc cggaaaaaat gaggaatcgg 1081 gtcctcttgg aactcaacag tgctgacctg gactgtgacc cgattcttcc cccatccctg 1141 caaacctctt atgtcttcct gggtattgtt ttagccctga taggcgctat tttcctcctg 1201 gttttgtatt tgaaccgcaa ggggataaaa aagtggatgc ataacatcag agatgcctgc 1261 agggatcaca tggaagggta tcattacaga tatgaaatca atgcggaccc cagattaaca 1321 aacctcagtt ctaactcgga tgtctgagaa atattagagg acagaccaag gacaactctg 1381 catgagatgt agacttaagc tttatcccta ctaggcttgc tccactttca tcctccacta 1441 tagatacaac ggactttgac taaaagcagt gaaggggatt tgcttccttg ttatgtaaag 1501 tttctcggtg tgttctgtta atgtaagacg atgaacagtt gtgtatagtg ttttaccctc 1561 ttctttttct tggaactcct caacacgtat ggagggattt ttcaggtttc agcatgaaca 1621 tgggcttctt gctgtctgtc tctctctcag tacagttcaa ggtgtagcaa gtgtacccac 1681 acagatagca ttcaacaaaa gctgcctcaa ctttttcgag aaaaatactt tattcataaa 1741 tatcagtttt attctcatgt acctaagttg tggagaaaat aattgcatcc tataaactgc 1801 ctgcagacgt tagcaggctc ttcaaaataa ctccatggtg cacaggagca cctgcatcca 1861 agagcatgct tacattttac tgttctgcat attacaaaaa ataacttgca acttcataac 1921 ttctttgaca aagtaaatta cttttttgat tgcagtttat atgaaaatgt actgattttt 1981 ttttaataaa ctgcatcgag atccaaccga ctgaattgtt aaaaaaaaaa aaaaataaag 2041 attcttaaaa gaa // LOCUS HS63KDAP 4219 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens 63 kDa protein kinase related to rat ERK3. ACCESSION X59727 S38873 NID g23902 KEYWORDS protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2354) AUTHORS Gonzalez,F.A., Raden,D.L., Rigby,M.R. and Davis,R.J. TITLE Heterogeneous expression of four MAP kinase isoforms in human tissues JOURNAL FEBS Lett. 304 (2-3), 170-178 (1992) MEDLINE 92316223 REFERENCE 2 (bases 1 to 4219) AUTHORS Gonzalez,F.A. TITLE Direct Submission JOURNAL Submitted (27-JAN-1992) Fernando A Gonzalez, Biochemistry and Molecular Biology, University of Massachusetts Medical School, 373 Plantation St., Worcester, MA, 01605, USA FEATURES Location/Qualifiers source 1..4219 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Lambda Zap II" /clone="pBluescript II" CDS 538..2211 /codon_start=1 /product="63kDa protein kinase" /db_xref="PID:g23903" /db_xref="SWISS-PROT:P31152" /translation="MAEKGDCIASVYGYDLGGRFVDFQPLGFGVNGLVLSAVDSRACR KVAVKKIALSDARSMKHALREIKIIRRLDHDNIVKVYEVLGPKGTDLQGELFKFSVAY IVQEYMETDLARLLEQGTLAEEHAKLFMYQLLRGLKYIHSANVLHRDLKPANIFISTE DLVLKIGDFGLARIVDQHYSHKGYLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAAGCI LAEMLTGRMLFAGAHELEQMQLILETIPVIREEDKDELLRVMPSFVSSTWEVKRPLRK LLPEVNSEAIDFLEKILTFNPMDRLTAEMGLQHPYMSPYSCPEDEPTSQHPFRIEDEI DDIVLMAANQSQLSNWDTCSSRYPVSLSSDLEWRPDRCQDASEVQRDPRGFGALAEDV QVDPRKDSHSSSERFLEQSHSSMERAFEADYGRSCDYKVGSPSYLDKLLWRDNKPHHY SEPKLILDLSHWKQAAGAPPTATGLADTGAREDEPASLFLEIAQWVKSTQGAQSTPAR PPTTPSAACLPRPPPPGPGGRRRQPPVRPGRVHLPRPEALHQARGPAGQ" BASE COUNT 1005 a 1208 c 1108 g 898 t ORIGIN 1 cccccctcga ggctcgacgg tatcgataag cttgatatcg aattccgagc tttggagcat 61 cttaaggagc tcagctcagt aaacaaactc ttgcatttca gccagaaaga gcctcttgta 121 acaagtattc aaaggggaga gtttctgcat cttttacttt gcagtccact atggtagaaa 181 acttgacatt ccatagataa tgatactggg ttttctttcc aagatccgac gtttaaaaga 241 aatatgagcc attctaagct ttaagaaggg ttcaggaaac acaggaatta gtagacagcg 301 ctcccaatgc aggttaagac gacagcctgc gcccccaact agcacagctc agcgagcatg 361 accatatgcc attctcgtct ccagagagct ggtggcagtg acctcactag gagaaaacac 421 atccctcagc cgtgggactt gacagaatga ggtgcgcgag ggaggccgct agccgagact 481 tggcctttcc tgactgcccc tgtgttacct gggcagctcc agatcactga gcccacaatg 541 gctgagaagg gtgactgcat cgccagtgtc tatgggtatg acctcggtgg gcgctttgtt 601 gacttccaac ccctgggctt cggtgtcaat ggtttggtgc tgtcggccgt ggacagccgg 661 gcctgccgga aggtcgctgt gaagaagatt gccctgagcg atgcccgcag catgaagcac 721 gcgctccgag agatcaagat cattcggcgc ctggaccacg acaacatcgt caaagtgtac 781 gaggtgctcg gtcccaaggg cactgacctg cagggtgagc tgttcaagtt cagcgtggcg 841 tacatcgtcc aggagtacat ggagaccgac ctggcacgcc tgctggagca gggcacgctg 901 gcagaagagc atgccaagct gttcatgtac cagctgctcc gcgggctcaa gtacatccac 961 tccgccaacg tgctgcacag ggacctgaag cccgccaaca tcttcatcag cacagaggac 1021 ctcgtgctca agattgggga tttcgggttg gcaaggatcg ttgatcagca ttactcccac 1081 aagggttatc tgtcagaagg gttggtaaca aagtggtacc gttccccacg actgctcctt 1141 tcccccaata actacaccaa agccatcgac atgtgggccg ccggctgcat cctggctgag 1201 atgcttacgg ggagaatgct ctttgctggg gcccatgagc tggagcagat gcaactcatc 1261 ctggagacca tccctgtaat ccgggaggaa gacaaggacg agctgctcag ggtgatgcct 1321 tcctttgtca gcagcacctg ggaggtgaag aggcctctgc gcaagctgct ccctgaagtg 1381 aacagtgaag ccatcgactt tctggagaag atcctgacct ttaaccccat ggatcgccta 1441 acagctgaga tggggctgca acacccctac atgagcccat actcgtgccc tgaggacgag 1501 cccacctcac aacacccctt ccgcattgag gatgagatcg acgacatcgt gctgatggcc 1561 gctaaccaga gccagctgtc caactgggac acgtgcagtt ccaggtaccc tgtgagcctg 1621 tcgtcggacc tggagtggcg gcctgaccgg tgccaggacg ccagcgaggt acagcgcgac 1681 ccgcgcgggt tcggcgcact ggctgaggac gtgcaggtgg acccgcgcaa ggactcgcac 1741 agcagctccg agcgcttcct agagcagtcg cactcgtcca tggagcgcgc cttcgaggcc 1801 gactacgggc gctcctgcga ctacaaggtg gggtcgccgt cctacctgga caagctgctg 1861 tggcgcgaca acaagccgca ccactactcg gagcccaagc tcatcctgga cctgtcgcac 1921 tggaagcagg cggccggcgc gccccccacg gccacggggc tggcggacac gggggcgcgc 1981 gaggacgagc cggccagcct cttcctggag atcgcgcagt gggtcaagag cacgcagggc 2041 gcccagagca cgccagcccg cccgccgacg accccgagcg ccgcttgtct gcctcgcccc 2101 ccgccgcccg gccccggtgg acggcggcgc cagcccccag ttcgacctgg acgtgttcat 2161 ctcccgcgcc ctgaagctct gcaccaagcc cgaggacctg ccggacaata aactgggcga 2221 cctcaatggt gcgtgcatcc ccgagcaccc tggcgacctc gtgcagaccg aggccttctc 2281 caaagaaagg tggtgagggc ggaggggccg ctccaggccc cacagagcag gagaccccca 2341 gagaaagccg gggctggcag gaggcggccg cctccgccct ctctgctgcc ttggggttgg 2401 cagaacacgt gaaggatccg aggagcgaga ggaatgtcca tttcttaaac tgccttaata 2461 actagccttt aacctgtggg agcgggtttg aactggaccc tggcttaggg gtgactcatt 2521 tctacgaaag ggagaccaca tgtgtgcaca gggaagaacg ctttagacac gagtctgcgg 2581 ccactggtgc agatcggaga atctgcagag gtagctcgaa accatctgcc caactagcct 2641 caactgacag ctgaggaaag caattagcca gagaggcaga gacactcgct taagatcaca 2701 ggcttagtgt gaggacgagc ttgaaatccc agtctcctgg cccccaggca gggtctgtca 2761 ccatagaatg tcttcctcta ctggggtcgt tctggctttt tgttagaaac ttggtctgag 2821 atgttcttcc cctgtccatt accattcgat gttcttttgt tcagagcaat gtttcttgta 2881 ttctgaaact ggaaactgaa ccagtttgcc tttctcctag tcaccaagca tactttcctg 2941 gctccccaag tacttaaatg ttctcatctg tcgcacccct gtatttgcct cacccctgca 3001 tggtcggaaa tcttcgtttc aggtcagaac agcctggggt ctgtgggtaa aatcagccct 3061 tctcccaggc ctgtgcacac accccctcag cactccctat gcactttcct gacacgcaaa 3121 gacacagccc tctttcccca ctgggcgtcc taccccagtg aggttgaagg caccaattcc 3181 aagaatccct ccaacctccc tgccagcact cccccttcac cccacacccg gcccccccac 3241 cctaaccaca gcgcctctcc agacctacct cggaccaaat gttctctaca tgaactgctc 3301 atttggagga cagcagtgag gtcctgccat agagcaaatg tgttaggaga gaaggtttca 3361 catgggaccc aacatccttc atcaatactt tcctgagttt gatcatccat ttagccttga 3421 caaacagcag accctacaga gatgtgttgg agagcacgtc gtgaccttgg gggcaaggaa 3481 tccagaaagg taggaagata tgaaaagaga ggtgtcaaca gcaagggctc ttaggggtca 3541 ggcaccagca tggagacctc atgacaaagg agggactcaa agcagcaatg cccctcatag 3601 tgtaggctaa ggtgagtttg gtgcatgcaa actgtgtgtc acccacagag catggggtaa 3661 tggtgtgtag acacaggcct ctgcagaagc gtggggtggg gacactgaca gccctatctg 3721 gtcccaggac attctaccat ttctgccact ggtgttcagc tccttctctt cccccaacac 3781 tcccaaagat acccacagaa gtccagccag tttccaggta gagatccacc attggtcttg 3841 ggctgcgttc accctcacac cacacgcctt aaatctaatc agcaaactat aatttgtcgt 3901 taaacctgca acacattaga aacttatatt taaaaacaga attaactcac atgaccaact 3961 tttaaatgga aaatatgtaa ataggaagtg tttgggtttt gttttttctt taagaaaaag 4021 aaatgtacac cactcctcat gtgccatttt gtcctcagag ggcgcttact ttttggtaaa 4081 gaacaagctg ctgccttgac caggagttca tatataactg ttattacaga ggaattgtta 4141 taactactaa tgtttttaaa aaatttatta aacattatta aacttgatca ggtcaggcca 4201 aataaagttt tattggaac // LOCUS HS7B2 1152 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for polypeptide 7B2. ACCESSION Y00757 NID g23910 KEYWORDS 7B2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1152) AUTHORS Martens,G.J.M. TITLE Direct Submission JOURNAL Submitted (20-JUN-1988) Martens G.J.M., Univ. of Nijmegen, Dept. of Animal Physiology, Toernooiveld, 6525 ED Nijmegen, The Netherlands REFERENCE 2 (bases 1 to 1152) AUTHORS Martens,G.J. TITLE Cloning and sequence analysis of human pituitary cDNA encoding the novel polypeptide 7B2 JOURNAL FEBS Lett. 234 (1), 160-164 (1988) MEDLINE 88271601 FEATURES Location/Qualifiers source 1..1152 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pituitar gland" /clone="p-lambda-H6-7" sig_peptide 29..106 /note="signal peptide" CDS 29..664 /note="polypeptide 7B2 precursor" /codon_start=1 /db_xref="PID:g23911" /db_xref="SWISS-PROT:P05408" /translation="MVSRMVSTMLSGLLFWLASGWTPAFAYSPRTPDRVSEADIQRLL HGVMEQLGIARPRVEYPAHQAMNLVGPQSIEGGAHEGLQHLGPFGNIPNIVAELTGDN IPKDFSEDQGYPDPPNPCPVGKTDDGCLENTPDTAEFSREFQLHQHLFDPEHDYPGLG KWNKKLLYEKMKGGERRKRRSVNPYLQGQRLDNVVAKKSVPHFSDEDKDPE" mat_peptide 107..661 /note="mature peptide" BASE COUNT 322 a 241 c 282 g 307 t ORIGIN 1 cgctcctcgg gctgcccctc ggttgacaat ggtctccagg atggtctcta ccatgctatc 61 tggcctactg ttttggctgg catctggatg gactccagca tttgcttaca gcccccggac 121 ccctgaccgg gtctcagaag cagatatcca gaggctgctt catggtgtta tggagcaatt 181 gggcattgcc aggccccgag tggaatatcc agctcaccag gccatgaatc ttgtgggccc 241 ccagagcatt gaaggtggag ctcatgaagg acttcagcat ttgggtcctt ttggcaacat 301 ccccaacatc gtggcagagt tgactggaga caacattcct aaggacttta gtgaggatca 361 ggggtaccca gaccctccaa atccctgtcc tgttggaaaa acagatgatg gatgtctaga 421 aaacacccct gacactgcag agttcagtcg agagttccag ttgcaccagc atctctttga 481 tccggaacat gactatccag gcttgggcaa gtggaacaag aaactccttt acgagaagat 541 gaagggagga gagagacgaa agcggaggag tgtcaatcca tatctacaag gacagagact 601 ggataatgtt gttgcaaaga agtctgtccc ccatttttca gatgaggata aggatccaga 661 gtaaagagaa gatgctagac gaaaacccac attacctgtt aggcctcagc atggcttatg 721 tgcacgtgta aatggagtcc ctgtgaatga cagcatgttt cttacataga taattatgga 781 tacaaagcag ctgtatgtag atagtgtatt gtcttcacac cgatgattct gctttttgct 841 aaattagaat aagagctttt ttgtttcttg ggtttttaaa atgtgaatct gcaatgatca 901 taaaaattaa aatgtgaatg tcaacaataa aaagcaagac tatgaaaggc tcagatttct 961 tgcagtttaa aatggtgtct gaggttgtac tattttggcc aagtctgtag aaagctgtca 1021 tttgattttg attatgtagt tcatccagcc cttgggcatt gttatacacc agtaaagaag 1081 gctgtactca agaggaggag ctgacacatt tcacttggct gcgtcttaat aaacatgaat 1141 gcaagcattg gc // LOCUS HSA1ATR3 517 bp RNA PRI 12-SEP-1993 DEFINITION Human macrophage alpha1-antitrypsin (alpha1-AT) mRNA 5'-end (L17). ACCESSION X05826 NID g24440 KEYWORDS alpha-1-antitrypsin; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 517) AUTHORS Perlino,E., Cortese,R. and Ciliberto,G. TITLE The human alpha 1-antitrypsin gene is transcribed from two different promoters in macrophages and hepatocytes JOURNAL EMBO J. 6 (9), 2767-2771 (1987) MEDLINE 88054975 FEATURES Location/Qualifiers source 1..517 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 255..347 /note="ORF (AA 1-30)" /codon_start=1 /db_xref="PID:g24441" /translation="MVDLATSGTATKDSAVRAEGQLSGTLPETV" CDS 404..514 /note="ORF (AA 1-36)" /codon_start=1 /db_xref="PID:g24442" /translation="MTPFASPVAPLDPLLKYGRGQGPVSSASGTTTDLGQ" BASE COUNT 118 a 152 c 149 g 98 t ORIGIN 1 gaattccact gcctccacgc agcaaccctc agagtcctga gctgaaccaa gaaggaggag 61 ggggtcgggc ctccgaggaa ggcctagctg ctgctgctgc caggaattcc aggttggagg 121 ggcggcaacc tccgccagcc tcaggccact ctcctgtgcc tgccagaaga gacagagctt 181 gaggagagct tgaggagagc aggaaagggc ggcagtaagt cttcagcatc aggcattttg 241 gggtgactca gtaaatggta gatcttgcta ccagtggaac agccactaag gattctgcag 301 tgagagcaga gggccagcta agtggtactc tcccagagac tgtctgactc acgccacccc 361 tccaccttgg acacaggacg ctgtggtttc tgagccaggt acaatgactc ctttcgcctc 421 ccccgttgcc cctctggatc cactgcttaa atacggacga ggacagggcc ctgtctcctc 481 agcttcaggc accaccactg acctgggaca gtgaatc // LOCUS HSABCCT 6246 bp RNA PRI 27-AUG-1996 DEFINITION H.sapiens mRNA for ABC-C transporter. ACCESSION X97187 NID g1514529 KEYWORDS ABC transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6246) AUTHORS Klugbauer,N. and Hofmann,F. TITLE Primary structure of a novel ABC transporter with a chromosomal localization on the band encoding the multidrug resistance-associated protein JOURNAL FEBS Lett. 391 (1-2), 61-65 (1996) MEDLINE 96326608 REFERENCE 2 (bases 1 to 6246) AUTHORS Klugbauer,N. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) N. Klugbauer, Inst. f. Pharmakologie u. Toxikologie, Technische Universitaet Muenchen, Biedersteiner Str. 29, D-80802 Muenchen, 80802, FRG FEATURES Location/Qualifiers source 1..6246 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="medullary thyroid carcinoma" /chromosome="16" /map="p13.3" /clone_lib="pcDNA2" CDS 348..5462 /codon_start=1 /product="ABC-C transporter" /db_xref="PID:e243436" /db_xref="PID:g1514530" /translation="MAVLRQLALLLWKNYTLQKRKVLVTVLELFLPLLFSGILIWLRL KIQSENVPNATIYPGQSIQELPLFFTFPPPGDTWELAYIPSHSDAAKTVTETVRRALV INMRVRGFPSEKDFEDYIRYDNCSSSVLAAVVFEHPFNHSKEPLPLAVKYHLRFSYTR RNYMWTQTGSFFLKETEGWHTTSLFPLFPNPGPREPTSPDGGEPGYIREGFLAVQHAV DRAIMEYHADAATRQLFQRLTVTIKRFPYPPFIADPFLVAIQYQLPLLLLLSFTYTAL TIARAVVQEKERRLKEYMRMMGLSSWLHWSAWFLLFFLFLLIAASFMTLLFCVKVKPN VAVLSRSDPSLVLAFLLCFAISTISFSFMVSTFFSKANMAAAFGGFLYFFTYIPYFFV APRYNWMTLSQKLCSCLLSNVAMAMGAQLIGKFEAKGMGIQWRDLLSPVNVDDDFCFG QVLGMLLLDSVLYGLVTWYMEAVFPGQFGVPQPWYFFIMPSYWCGKPRAVAGKEEEDS DPEKALRNEYFEAEPEDLVAGIKIKHLSKVFRVGNKDRAAVRDLNLNLYEGQITVLLG HNGAGKTTTLSMLTGLFPPTSGRAYISGYEISQDMVQIRKSLGLCPQHDILFDNLTVA EHLYFYAQLKGLSRQKCPEEVKQMLHIIGLEDKWNSRSRFLSGGMRRKLSIGIALIAG SKVLILDEPTSGMDAISRRAIWDLLQRQKSDRTIVLTTHFMDEADLLGDRIAIMAKGE LQCCGSSLFLKQKYGAGYHMTLVKEPHCNPEDISQLVHHHVPNATLESSAGAELSFIL PRESTHRFEGLFAKLEKKQKELGIASFGASITTMEEVFLRVGKLVDSSMDIQAIQLPA LQYQHERRASDWAVDSNLCGAMDPSDGIGALIEEERTAVKLNTGLALHCQQFWAMFLK KAAYSWREWKMVAAQVLVPLTCVTLALLAINYSSELFDDPMLRLTLGEYGRTVVPFSV PGTSQLGQQLSEHLKDALQAEGQEPREVLGDLEEFLIFRASVEGGGFNERCLVAASFR DVGERTVVNALFNNQAYHSPATALAVVDNLLFKLLCGPHASIVVSNFPQPRSALQAAK DQFNEGRKGFDIALNLLFAMAFLASTFSILAVSERAVQAKHVQFVSGVHVASFWLSAL LWDLISFLIPSLLLLVVFKAFDVRAFTRDGHMADTLLLLLLYGWAIIPLMYLMNFFFL GAATAYTRLTIFNILSGIATFLMVTIMRIPAVKLEELSKTLDHVFLVLPNHCLGMAVS SFYENYETRRYCTSSEVAAHYCKKYNIQYQENFYAWSAPGVGRFVASMAASGCAYLIL LFLIETNLLQRLRGILCALRRRRTLTELYTRMPVLPEDQDVADERTRILAPSPDSLLH TPLIIKELSKVYEQRVPLLAVDRLSLAVQKGECFGLLGFNGAGKTTTFKMLTGEESLT SGDAFVGGHRISSDVGKVRQRIGYCPQFDALLDHMTGREMLVMYARLRGIPERHIGAC VENTLRGLLLEPHANKLVRTYSGGNKRKLSTGIALIGEPAVIFLDEPSTGMDPVARRL LWDTVARARESGKAIIITSHSMEECEALCTRLAIMVQGQFKCLGSPQHLKSKFGSGYS LRAKVQSEGQQEALEEFKAFVDLTFPGSVLEDEHQGMVHYHLPGRDLSWAKVFGILEK AKEKYGVDDYSVSQISLEQVFLSFAHLQPPTAEEGR" BASE COUNT 1249 a 1941 c 1790 g 1266 t ORIGIN 1 ctcttctggg acccagccat gagtgtggag ctgagcaact gaacctgaaa ctcttccact 61 gtgagtcaag gaggcttttc cgcacatgaa ggacgctgag cgggaaggac tcctctctgc 121 ctgcagttgt agcgagtgga ccagcaccag gggctctcta gactgcccct cctccatcgc 181 cttccctgcc tctccaggac agagcagcca cgtctgcaca cctcgccctc tttacactca 241 gttttcagag cacgtttctc ctatttcctg cgggttgcag cgcctacttg aacttactca 301 gaccacctac ttctctagca gcactgggcg tccctttcag caagacgatg gctgtgctca 361 ggcagctggc gctcctcctc tggaagaact acaccctgca gaagcggaag gtcctggtga 421 cggtcctgga actcttcctg ccattgctgt tttctgggat cctcatctgg ctccgcttga 481 agattcagtc ggaaaatgtg cccaacgcca ccatctaccc gggccagtcc atccaggagc 541 tgcctctgtt cttcaccttc cctccgccag gagacacctg ggagcttgcc tacatccctt 601 ctcacagtga cgctgccaag accgtcactg agacagtgcg cagggcactt gtgatcaaca 661 tgcgagtgcg cggctttccc tccgagaagg actttgagga ctacattagg tacgacaact 721 gctcgtccag cgtgctggcc gccgtggtct tcgagcaccc cttcaaccac agcaaggagc 781 ccctgccgct ggcggtgaaa tatcacctac ggttcagtta cacacggaga aattacatgt 841 ggacccaaac aggctccttt ttcctgaaag agacagaagg ctggcacact acttcccttt 901 tcccgctttt cccaaaccca ggaccaaggg aacctacatc ccctgatggc ggagaacctg 961 ggtacatccg ggaaggcttc ctggccgtgc agcatgctgt ggaccgggcc atcatggagt 1021 accatgccga tgccgccaca cgccagctgt tccagagact gacggtgacc atcaagaggt 1081 tcccgtaccc gccgttcatc gcagacccct tcctcgtggc catccagtac cagctgcccc 1141 tgctgctgct gctcagcttc acctacaccg cgctcaccat tgcccgtgct gtcgtgcagg 1201 agaaggaaag gaggctgaag gagtacatgc gcatgatggg gctcagcagc tggctgcact 1261 ggagtgcctg gttcctcttg ttcttcctct tcctcctcat cgccgcctcc ttcatgaccc 1321 tgctcttctg tgtcaaggtg aagccaaatg tagccgtgct gtcccgcagc gacccctccc 1381 tggtgctcgc cttcctgctg tgcttcgcca tctctaccat ctccttcagc ttcatggtca 1441 gcaccttctt cagcaaagcc aacatggcag cagccttcgg aggcttcctc tacttcttca 1501 cctacatccc ctacttcttc gtggcccctc ggtacaactg gatgactctg agccagaagc 1561 tctgctcctg cctcctgtct aatgtcgcca tggcaatggg agcccagctc attgggaaat 1621 ttgaggcgaa aggcatgggc atccagtggc gagacctcct gagtcccgtc aacgtggacg 1681 acgacttctg cttcgggcag gtgctgggga tgctgctgct ggactctgtg ctctatggcc 1741 tggtgacctg gtacatggag gccgtcttcc cagggcagtt cggcgtgcct cagccctggt 1801 acttcttcat catgccctcc tattggtgtg ggaagccaag ggcggttgca gggaaggagg 1861 aagaagacag tgaccccgag aaagcactca gaaacgagta ctttgaagcc gagccagagg 1921 acctggtggc ggggatcaag atcaagcacc tgtccaaggt gttcagggtg ggaaataagg 1981 acagggcggc cgtcagagac ctgaacctca acctgtacga gggacagatc accgtcctgc 2041 tgggccacaa cggtgccggg aagaccacca ccctctccat gctcacaggt ctctttcccc 2101 ccaccagtgg acgggcatac atcagcgggt atgaaatttc ccaggacatg gttcagatcc 2161 ggaagagcct gggcctgtgc ccgcagcacg acatcctgtt tgacaacttg acagtcgcag 2221 agcaccttta tttctacgcc cagctgaagg gcctgtcacg tcagaagtgc cctgaagaag 2281 tcaagcagat gctgcacatc atcggcctgg aggacaagtg gaactcacgg agccgcttcc 2341 tgagcggggg catgaggcgc aagctctcca tcggcatcgc cctcatcgca ggctccaagg 2401 tgctgatact ggacgagccc acctcgggca tggacgccat ctccaggagg gccatctggg 2461 atcttcttca gcggcagaaa agtgaccgca ccatcgtgct gaccacccac ttcatggacg 2521 aggctgacct gctgggagac cgcatcgcca tcatggccaa gggggagctg cagtgctgcg 2581 ggtcctcgct gttcctcaag cagaaatacg gtgccggcta tcacatgacg ctggtgaagg 2641 agccgcactg caacccggaa gacatctccc agctggtcca ccaccacgtg cccaacgcca 2701 cgctggagag cagcgctggg gccgagctgt ctttcatcct tcccagagag agcacgcaca 2761 ggtttgaagg tctctttgct aaactggaga agaagcagaa agagctgggc attgccagct 2821 ttggggcatc catcaccacc atggaggaag tcttccttcg ggtcgggaag ctggtggaca 2881 gcagtatgga catccaggcc atccagctcc ctgccctgca gtaccagcac gagaggcgcg 2941 ccagcgactg ggctgtggac agcaacctct gtggggccat ggacccctcc gacggcattg 3001 gagccctcat cgaggaggag cgcaccgctg tcaagctcaa cactgggctc gccctgcact 3061 gccagcaatt ctgggccatg ttcctgaaga aggccgcata cagctggcgc gagtggaaaa 3121 tggtggcggc acaggtcctg gtgcctctga cctgcgtcac cctggccctc ctggccatca 3181 actactcctc ggagctcttc gacgacccca tgctgaggct gaccttgggc gagtacggca 3241 gaaccgtcgt gcccttctca gttcccggga cctcccagct gggtcagcag ctgtcagagc 3301 atctgaaaga cgcactgcag gctgagggac aggagccccg cgaggtgctc ggtgacctgg 3361 aggagttctt gatcttcagg gcttctgtgg aggggggcgg ctttaatgag cggtgccttg 3421 tggcagcgtc cttcagagat gtgggagagc gcacggtcgt caacgccttg ttcaacaacc 3481 aggcgtacca ctctccagcc actgccctgg ccgtcgtgga caaccttctg ttcaagctgc 3541 tgtgcgggcc tcacgcctcc attgtggtct ccaacttccc ccagccccgg agcgccctgc 3601 aggctgccaa ggaccagttt aacgagggcc ggaagggatt cgacattgcc ctcaacctgc 3661 tcttcgccat ggcattcttg gccagcacgt tctccatcct ggcggtcagc gagagggccg 3721 tgcaggccaa gcatgtgcag tttgtgagtg gagtccacgt ggccagtttc tggctctctg 3781 ctctgctgtg ggacctcatc tccttcctca tccccagtct gctgctgctg gtggtgttta 3841 aggccttcga cgtgcgtgcc ttcacgcggg acggccacat ggctgacacc ctgctgctgc 3901 tcctgctcta cggctgggcc atcatccccc tcatgtacct gatgaacttc ttcttcttgg 3961 gggcggccac tgcctacacg aggctgacca tcttcaacat cctgtcaggc atcgccacct 4021 tcctgatggt caccatcatg cgcatcccag ctgtaaaact ggaagaactt tccaaaaccc 4081 tggatcacgt gttcctggtg ctgcccaacc actgtctggg gatggcagtc agcagtttct 4141 acgagaacta cgagacgcgg aggtactgca cctcctccga ggtcgccgcc cactactgca 4201 agaaatataa catccagtac caggagaact tctatgcctg gagcgccccg ggggtcggcc 4261 ggtttgtggc ctccatggcc gcctcagggt gcgcctacct catcctgctc ttcctcatcg 4321 agaccaacct gcttcagaga ctcaggggca tcctctgcgc cctccggagg aggcggacac 4381 tgacagaatt atacacccgg atgcctgtgc ttcctgagga ccaagatgta gcggacgaga 4441 ggacccgcat cctggccccc agcccggact ccctgctcca cacacctctg attatcaagg 4501 agctctccaa ggtgtacgag cagcgggtgc ccctcctggc cgtggacagg ctctccctcg 4561 cggtgcagaa aggggagtgc ttcggcctgc tgggcttcaa tggagccggg aagaccacga 4621 ctttcaaaat gctgaccggg gaggagagcc tcacttctgg ggatgccttt gtcgggggtc 4681 acagaatcag ctctgatgtc ggaaaggtgc ggcagcggat cggctactgc ccgcagtttg 4741 atgccttgct ggaccacatg acaggccggg agatgctggt catgtacgct cggctccggg 4801 gcatccctga gcgccacatc ggggcctgcg tggagaacac tctgcggggc ctgctgctgg 4861 agccacatgc caacaagctg gtcaggacgt acagtggtgg taacaagcgg aagctgagca 4921 ccggcatcgc cctgatcgga gagcctgctg tcatcttcct ggacgagccg tccactggca 4981 tggaccccgt ggcccggcgc ctgctttggg acaccgtggc acgagcccga gagtctggca 5041 aggccatcat catcacctcc cacagcatgg aggagtgtga ggccctgtgc acccggctgg 5101 ccatcatggt gcaggggcag ttcaagtgcc tgggcagccc ccagcacctc aagagcaagt 5161 tcggcagcgg ctactccctg cgggccaagg tgcagagtga agggcaacag gaggcgctgg 5221 aggagttcaa ggccttcgtg gacctgacct ttccaggcag cgtcctggaa gatgagcacc 5281 aaggcatggt ccattaccac ctgccgggcc gtgacctcag ctgggcgaag gttttcggta 5341 ttctggagaa agccaaggaa aagtacggcg tggacgacta ctccgtgagc cagatctcgc 5401 tggaacaggt cttcctgagc ttcgcccacc tgcagccgcc caccgcagag gaggggcgat 5461 gaggggtggc ggctgtctcg ccatcaggca gggacaggac gggcaagcag ggcccatctt 5521 acatcctctc tctccaagtt tatctcatcc tttattttta atcacttttt tctatgatgg 5581 atatgaaaaa ttcaaggcag tatgcacaga atggacgagt gcagcccagc cctcatgccc 5641 aggatcagca tgcgcatctc catgtctgca tactctggag ttcactttcc cagagctggg 5701 gcaggccggg cagtctgcgg gcaagctccg gggtctctgg gtggagagct gacccaggaa 5761 gggctgcagc tgagctgggg gttgaatttc tccaggcact ccctggagag aggacccagt 5821 gacttgtcca agtttacaca cgacactaat ctcccctggg gaggaagcgg gaagccagcc 5881 aggttgaact gtagcgaggc ccccaggccg ccaggaatgg accatgcaga tcactgtcag 5941 tggagggaag ctgctgactg tgattaggtg ctggggtctt agcgtccagc gcagcccggg 6001 gcatcctgga ggctctgctc cttagggcat ggtagtcacc gcgaagccgg gcaccgtccc 6061 acagcatctc ctagaagcag ccggcacagg agggaaggtg gccaggctcg aagcagtctc 6121 tgtttccagc actgcaccct caggaagtcg cccgccccag gacacgcagg gaccacccta 6181 agggctgggt ggctgtctca aggacacatt gaatacgttg tgaccatcca gaaaataaat 6241 gctgag // LOCUS HSABHGENE 1953 bp RNA PRI 28-MAR-1996 DEFINITION H.sapiens mRNA for alkB protein homolog. ACCESSION X91992 NID g1237209 KEYWORDS ABH gene; alkB homolog protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1953) AUTHORS Wei,Y.F., Carter,K.C., Wang,R.P. and Shell,B.K. TITLE Molecular cloning and functional analysis of a human cDNA encoding an Escherichia coli AlkB homolog, a protein involved in DNA alkylation damage repair JOURNAL Nucleic Acids Res. 24 (5), 931-937 (1996) MEDLINE 96174661 REFERENCE 2 (bases 1 to 1953) AUTHORS Wei,Y.F. TITLE Direct Submission JOURNAL Submitted (05-OCT-1995) Y.F. Wei, Human Genome Sciences, 9620 Medical Center Drive, Suite 300, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1953 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="synovial sarcoma" /chromosome="14" /map="q24" gene 224..1123 /gene="ABH" CDS 224..1123 /gene="ABH" /codon_start=1 /product="alkB homolog protein" /db_xref="PID:e205285" /db_xref="PID:g1237210" /translation="MHIEQVFSPSASGKPMDSKAILGLFLSQTPSSQVTSGHWVKQCL KLYSQKPNVCNLDTHMSKEETQDLWEQSKEFLRYKEATKRRPRSLLEKLRWVTVGYHY NWDSKKYSADHYTPFPSDLGFLSEQVAAACGFEDFRAEAGILNYYRLDSTLGIHVDRS ELDHSKPLLSFSFGQSAIFLLGGLQRDEAPPPMFMHSGDIMIMSGFSRLLNHAVPRVL PNPEGEGLPHCLEAPLPAVLPRDSMVEPCSMEDWQVCASYLKTARVNMTVRQVLATDQ NFPLEPIEDEKKRHQYRRFLPSG" BASE COUNT 532 a 476 c 438 g 507 t ORIGIN 1 gggaagatgg cagcggccgt gggctctgtg gcgactctgg cgactgagcc cggggaggac 61 gcctttcgga aacttttccg cttctaccgt cagagccggg cccgggaccg cagacctgga 121 aggggtcatc gacttctcgg cggcccacgc agcccgtgca agggtcctgg tgcccaaaag 181 gtgatcaaat ctcagctaaa tgtgtcttct gtcagtgagc agaatgcata tagagcaggt 241 cttcagcccg tcagcaagtg gcaagcctat ggactcaaag gctatcctgg gtttattttt 301 atcccaaacc ccttcctccc aggttaccag tggacactgg gtgaaacagt gccttaagtt 361 atattcccag aaacctaatg tatgtaacct ggacacacac atgtctaaag aagagaccca 421 agatctgtgg gaacagagca aagagttcct gaggtataaa gaagcgacta aacggagacc 481 ccgaagttta ctggagaaac tgcgttgggt gaccgtaggc taccattata actgggacag 541 taagaaatac tcagcagatc attacacacc tttcccttct gacctgggtt tcctctcaga 601 gcaagtagcc gctgcctgtg gatttgagga tttccgagct gaagcaggga tcctgaatta 661 ctaccgcctg gactccacac tgggaatcca cgtagacaga tctgagctag atcactccaa 721 acccttgctg tcattcagct ttggacagtc cgccatcttt ctcctgggtg gtcttcaaag 781 ggatgaggcc cccccgccca tgtttatgca cagtggtgac atcatgataa tgtcgggttt 841 cagccgcctc ttgaaccacg cagtccctcg tgtccttcca aatccagaag gggaaggcct 901 gcctcactgc ctagaggcac ctctccctgc tgtcctcccg agagattcaa tggtagagcc 961 ttgttctatg gaggactggc aggtgtgtgc cagctacttg aagaccgctc gtgttaacat 1021 gactgtccga caggtcctgg ccacagacca gaatttccct ctagaaccca tcgaggatga 1081 aaaaaagaga catcagtaca gaaggtttct gccatctgga tgaccagaat agcgaagtaa 1141 aacgggccag gataaaccct gacagctgag acttggagat cccatccttt ttactcaggc 1201 acctgcttac cgtaaatgat catgttattg tgtattgccg tggacttcag cacccagaca 1261 agccaaaaac agagacaggg aagaactcat tgttgatcac actgttgcct tggaacccac 1321 gcagaagtaa actcatccac tttgctcaga gaagtgtttg acatggtctg ttcctagtta 1381 catgttggct gtaatgtatg ttgagaagtc agtccaagga ggtatgttct tccacaacag 1441 ccttctcagc ctctgctatt tcctttgagg aaggtagaag tgagtttcca tgtttgcaga 1501 gtatttaaat acctcagatt ttattaatga gaaatacagt acccctccct ccactccatc 1561 tggtaattta tggtaaaatt gtggttctgt gaaccagcta ttagtctcat cttcttaact 1621 ccctcaggca tcatcaaatt ctttgatctt ctcttccacc tctctggctc tcatggaaga 1681 atcctttaca catgaaaaca atggaactgg aaaatcttgt cttttagaaa agaaattaat 1741 cacaactatc tctcttgcct aaaagataaa tataggtaaa cccaaggaaa ggggaattta 1801 gtttctctac atgtcatttc ggtctccaaa ctccctgttg gctttttaat gcaattttaa 1861 ttgttggaat aaaaaagtcc caagggtgtt ttgttactgt tttctccatg aataaactca 1921 cttgatttta aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSAC000099 40677 bp ms-DNA PRI 23-JAN-1997 DEFINITION Cosmid g0771a003, complete sequence. ACCESSION AC000099 NID g1764159 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40677) AUTHORS Iadonato,S.P., Yu,J., Wong,G.K.-S., Magness,C.L., Green,E.D., Green,P. and Olson,M.V. TITLE Large-scale MCD Mapping and Sequencing of Human Chromosome 7 JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 40677) AUTHORS Iadonato,S.P., Yu,J., Wong,G.K.-S., Magness,C.L., Green,E.D., Green,P. and Olson,M.V. TITLE Direct Submission JOURNAL Submitted (06-JAN-1997) Human Genome Center, University of Washington, Box 352145, Seattle, WA 98195, USA REFERENCE 3 (bases 1 to 40677) AUTHORS Iadonato,S.P., Yu,J., Wong,G.K.-S., Magness,C.L., Green,E.D., Green,P. and Olson,M.V. TITLE Direct Submission JOURNAL Submitted (23-JAN-1997) Human Genome Center, University of Washington, Box 352145, Seattle, WA 98195, USA COMMENT Verification: This sequence has been verified by Multiple Complete Digest Mapping. Comparison of the experimentally derived map digest fragments with sequence-predicted fragments is given below. There are no significant remaining descrepancies between the experimental and predicted values. Uniquely ordered fragment groups are separated by dashed lines. EcoRI HindIII NsiI Map Seq Map Seq Map Seq -------- -------- -------- -------- -------- -------- 2191.75 2156.00 4897.20 4774.00 1832.70 1826.00 -------- -------- -------- -------- -------- 1256.20 1255.00 5450.76 5398.00 2043.29 2023.00 -------- -------- -------- -------- -------- 5997.67 5912.00 961.14 949.00 5961.48 5893.00 -------- -------- -------- -------- -------- -------- 9341.29 9114.00 6419.38 6348.00 3734.22 3670.00 -------- -------- -------- -------- -------- -------- 955.47 945.00 4595.28 4465.00 1534.81 1557.00 -------- -------- -------- 1693.77 1694.00 1246.67 1242.00 4478.23 4394.00 -------- -------- -------- -------- 1908.81 1904.00 4596.14 4502.00 858.09 850.00 -------- -------- -------- -------- -------- 1907.98 1899.00 2115.74 2094.00 4841.65 4754.00 -------- -------- -------- -------- -------- -------- 7571.42 7368.00 5426.74 5318.00 4923.00 4820.00 -------- -------- -------- -------- -------- -------- 2179.00 2181.00 1245.76 1240.00 -------- -------- -------- -------- 957.40 957.00 -------- --------. This entry has been annotated with sequence quality estimates computed by the Phrap assembly program. These values are not generally visible from the Genbank flat file format but are available as part of the ASN.1 file. FEATURES Location/Qualifiers source 1..40677 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q31.3" /clone="NCHGR:yWSS771" /sub_clone="UWGC:g0771a003" /cell_line="GM10791" /clone_lib="E. Green Chromosome 7 YAC Resource" repeat_region 409..491 /rpt_family="MIR" repeat_region 1952..1978 /rpt_family="(CGG)n" repeat_region complement(6175..6302) /rpt_family="MIR" repeat_region 6472..6624 /rpt_family="MIR" repeat_region complement(7101..7202) /rpt_family="MIR" CDS 11631..12185 /codon_start=1 /evidence=not_experimental /product="putative Metabotropic Glutamate Receptor 8" /db_xref="PID:g1764160" /translation="MVCEGKRSASCPCFFLLTAKFYWILTMMQRTHSQEYAHSIRVDG DIILGGLFPVHAKGERGVPCGELKKEKGIHRLEAMLYAIDQINKDPDLLSNITLGVRI LDTCSRDTYALEQSLTFVQALIEKDASDVKCANGDPPIFTKPDKISGVIGAAASSVSI MVANILRLFKVGKGIVFLTIVKEQ" repeat_region 13101..13403 /rpt_family="AluY" repeat_region 16496..16540 /rpt_family="(CA)n" repeat_region 17903..17999 /rpt_family="MIR2" repeat_region 18146..18519 /rpt_family="THE1B" repeat_region 18637..18806 /rpt_family="L1MA7" repeat_region 19013..19180 /rpt_family="MER5B" repeat_region complement(19885..19928) /rpt_family="L1MC1" repeat_region complement(20045..20239) /rpt_family="MER5A" repeat_region 21586..21698 /rpt_family="MIR" repeat_region 21706..21997 /rpt_family="AluSg" repeat_region 22008..22118 /rpt_family="MIR" repeat_region 22286..22585 /rpt_family="AluJb" repeat_region complement(22587..22622) /rpt_family="(GA)n" repeat_region 22622..22651 /rpt_family="(CA)n" repeat_region 23951..24302 /rpt_family="MLT1A1" repeat_region 26481..26708 /rpt_family="MIR" STS 40560..40676 /standard_name="HUMSWS999" /note="Genbank Accession: G00157" /db_xref="dbSTS:157" BASE COUNT 11182 a 8233 c 9188 g 12074 t ORIGIN 1 accctcttca gagctgcact agacattcac agggaattga ggttccctat gcagaaaggt 61 tttcagacag aggacccagc ccattttctc tctccaaatc ctcctccctc cctcttttgt 121 ggttgcagct ataaaataat gattcaacaa tcaaagagcc tctcatgaga tgggagaagg 181 aagggaacga aaggggaaga ggtgccctgg gctgaaaacc tgtagatttt ttctaagtga 241 caaagggcaa gggtcctttt ttgaaattag gagagagaca gctgtttcct caatcgaaac 301 ttcgcacatt acaaccagcc ttgggaaaag ccaacaccac cagggaggaa gtgctatggg 361 gttttgaagc cagacactga tttctggctg taatactttc cagccgtgtc tctaagcctc 421 ggcttctgcc tccatctgta gagtgggctt aataattttg accatctcac agagttgatg 481 tgaaaattaa aataacccga acagaacagg cgatatatac aggaattaat gaaagattcc 541 tgacataatg cttataagtt catgatgctt aggaagataa aattcattgt agagatgccg 601 aacgggggtc tagagcttgc cttttcgggg tctctcccgt ccttctccat cctccttatc 661 tcctgggctg ccccctctcc ccccgcgccc cgctctccct ggctcgcccg ggcctagggc 721 gccgcctgca gttgcgcgcg gccgcctcta gatggaactt tctcaccaac gcaaggcccg 781 gccggagcag ctacccggga gctgggcggc gaggggctac tttctctcat tccggcgggt 841 gcaggatcgg ggggcctggg cagtaactag tggggaggag cgctggactg tgcacgtcga 901 gcccggcagg tttccgcatg cggcatgcga agggaatccc gaggtttccc tgcagaaccc 961 gagcacggct cccccggagt ttcctgcccc gcgtctgcgg ctccttgaat catctcaata 1021 aaatgaccgt cccggtagcc acccatgccc cttcctgcgc agtgcccgca gcggaccgcg 1081 ctgtgtggta cctcgagccc tggggactct gttgcacgcg tccctcagac cctcgggggc 1141 ggggagtggg gagacaatcg ccagagccgc ggggcgggac aaatggcgga accgccgcgc 1201 ggcgccaggc aaactttgca agggaaccgc gcggcttgcc ggctctactt taagcattcc 1261 cgaaaagaaa gcatgtggcg ggacacttgt catctaccat gtgttattct cggtgacgct 1321 ttctggagct gtgttcaccg gggacccggg ctcgcgggtg ctgcccgctg tgctcgggcg 1381 ggcgtcgcgc ctccccgcgc cggtcccggg ctcgccaggc agccggagcc gctgggctcc 1441 tccacaacca tattccttct tctaccgctc ccgcttcttc ccaccctctc actctgtagt 1501 tgggtcctcc cctttttctt gggggcgggg aaggggggat gatttttaaa aatcagaact 1561 attgacattt ctggtctcct cgtcgcttca ggctgaagag cggaggggga tccgcgggcc 1621 gagggtcccc ctccctgccc gcgccagggc cgctgggtga caccgaaatc cagaggctcc 1681 cgcccctcgg gggttcctcc tcccgcttcc cgaggtgact ggttggcgcg aagcgattgg 1741 cgatcccggg cgcgatcctg gccgcggctc cccgcgccgc gccgggtgaa tggccgcggg 1801 cggaggatcg ggaggcgccg ggcgcagacc aatcgcggcc gccggtggga gtatttgtta 1861 ttcacatgga agagacttgg cgcctgctag gccagctcag ccccctcagc ccagagatca 1921 gccacaagtg cggccgctgt gctcgcctca cgcggcggcg gcggcggcgg cggcggcgct 1981 gacatggagc tgcgggcccc cggcgggctt cctcaccgcg ccctctgcgg ggagcagggt 2041 aagactcgcc gcccggcagc agaaagcggc tccgaggaaa gcaagtgccg aaccacggga 2101 caaaaagccc ctggccccca aacttcccca aaacgccctg cttgttggaa aaggagaatc 2161 cccccgtttt attccccttt ccgttttctt ctctaaaacc tttcagcgga cagccaggtg 2221 caccattcct tctcatatca ctctagtgat ttgtttttcc tgagcacaag tagatggccc 2281 tgcatgtccg tagtctgtga gagaaatagc gggcggaggt gacgggaatg gggggagaag 2341 gagagtgggc accagggggc gaggggctgg ccgggcattg gagcttgatt gggtgtctgt 2401 tgattttacg ttgcaaagaa caaggcagcc acccctcttg tctgcccatg attacattac 2461 agtaaagagg ttcagaaaag agggagactt tacccggaac tcccaaggct tggagatgtg 2521 atcccccaaa agtttagtcg gcgaatggga gactgaaggg tgacgggaag ggggcaggcg 2581 cgctcggcgc tctgactggg cgtgcggcgg caggatttta aagcgctctg cctcggatcg 2641 tctgccctgg gtgacctccc ggacctgccc tggtggaatc cggacttgcc ccgccgagat 2701 gacgaggtac cttcgtttcg cgaacctaac aggagggatt tctgaagtta gcggcttccg 2761 ggatgagtcg gggtaacccc cgccctctga gcgtgggctg gatggatgtc taaccaggat 2821 ctagggagga tgggggtcgc gggggaagcc gggcgcttcg gggtcctcga cacccacatc 2881 cccgcacggc gcacgctctg cctcgagttt gcgttctccg agttagcacg gcgagcttgg 2941 gatttccgtg cccctctttt gtggctttcg agaaacgccg gagttccgtg tcctgctggg 3001 ggttgggggg atggagaaaa ggtctcgggg atcttgttta aaaagcaaag gcaaacgcac 3061 cccaacagct cctcggtagc tcccgcggct ggcgcggccc cagtttcggg cggccccgcg 3121 cggggcgtgg gcgcgggcgc tcgggcggcc aggtgcggcg ggggcggagg ctgcatcccc 3181 agcgcgccgg aatgccccgg ggcgcgggaa aggcaaaagc agcctggcgc tctccccacg 3241 gccccagtct ctatctaccc agaacccggc ttcaacagcc caaggaactg agaagaaaaa 3301 ggaacctgcg ctgggaaaca ggggctcctg gcgggctgag tggtcgctgt tatggtagag 3361 atagaaacgg agaggtagag atggaccgct cagccatgcc aagcgcgaag cccatccgcc 3421 cagccagaaa gctcggatat cccttgagct tcgtacagtt ggtgaaaaaa taacagaatc 3481 accaaaaaag aaaaaaaaat tcccttcgtc tgacacgcgc tcccttggtg caaatgaaat 3541 gcggagggac acgcgagtca cccagttgcg caaaaccgca cccccgcggt tccctgggga 3601 attacttccc ccgtggccca aactgtgaag cctggtgtca gagagtgtct gcgcacccaa 3661 gatagcgggg gtgaaaacag accccctcgc cggttgcaga aacccttaag tcccctaagc 3721 aagtcccacc ccgggaggga gtgccttatt tgccatttgg gggatccttt cccacaaacc 3781 tgaaggtgcg ggagtctctg gaacacccaa gaatacacta gagtctgggc tactaccgcg 3841 attgaataca aatcctgcca actgacagaa ctggaaaaca gggttggtgg tggagataca 3901 agaaaaatca tgaacgggag aaaagcattc ctatggtagc ttgtattttg cttgaaagaa 3961 aaatagagaa aaagagccac cagcaggtag atcgtggata aacagcttat tgcacttgcc 4021 tgtaatttcc actttccgtt atgaaattgt ttgtcaataa ttgcaaatgg ctacagaaac 4081 cactttagtt taccttggtc acttgcttgt gaataatacc gtgaaagatc tctgtaggag 4141 gcagaatcag ctgtgtgcaa ttcattctct ggttcttgaa aacgtgtggc ttgataattt 4201 tgaaaagaaa agcttcctcc gtcagggaaa atgaaccaag gttttttaaa atgcaatttt 4261 tccattttag agtcaactac aggttgtacc atgtctaaga tagagaattc ttgcttttga 4321 ataagggatg tggggagaaa tagatgacgg gaggctttgt aggagacaat tgacccggaa 4381 taaagaagaa gatactgaga acacagctgt atactgttcc ttaggctcaa tttgttctgt 4441 ttggattgtg ctagctagtt ctgccctgtt atgcaccgtt cgaaagttat ggctgcttga 4501 ataattctaa aaacatctac atccaaaagg aatagagaaa aggaagagct gagctccagt 4561 gagtgcagga gaaagttaga ggtctggaga gatgtgctct tttcaaagtt ggctgagcag 4621 gttgaagctg ctgcggggct tgtttaattg gtatttatgc agcaaataac ccgtctgaca 4681 ttcacattgg caattgcgtg ccaatctttg tgtatgaatg ttccattgtc ttcagggcac 4741 gatgtacagc acactacaca catgtttttt tccttccaag tttccttcct accacctttc 4801 tcaggtagtt agaatgccca tcataaagat gaccttgctt tagttggggg gagtggagtt 4861 gctctgtccc agtgtcttaa aagaagatgc tgcttgaatt tctctcttcc tggacccaca 4921 gccaagacct ggccacatct ccatgtctgt caagttggct ggcttgcgga ggttcaggaa 4981 acttgctgtt ttcttgatag actctgactg ggcatcccca gactcacaaa tgtggttcat 5041 tcaataggga cggccacttc atgttgttcc caggagaggc tatgagctga gagttggctg 5101 gcttattgtt tccatttcaa gagagggttt tctttctgct aggaattgga atttagattt 5161 gggctttctc tcttttttct ttgattttca tttctacttg gtaatttgac aaattgcgac 5221 aaactcaccc actcactggg agggacgctg agcataatat tgtctcatcc caggacaccc 5281 tttctcagtg gtgggcaatg ctgttggaat gatttgagag gtactgttga gacgctcctg 5341 actcctagct ctctgctctg gcttgttgca ttgctagcct taaccattat taatggtgat 5401 atttctttgg taaataaatt atggctctcc taggaatcat tttgagaatg cagatgttcc 5461 atttggggat gacagtaatg catcttcagg ctttgatacc tgctaatgaa tccactcact 5521 ccctggggag ggccaaggta atgaaagaac aaaggcactg agggaaggaa gagacctagt 5581 tctggggagg tcagcgcagg gcagggcagg gcaggaggtg aacagaagta ggaatatcta 5641 ggagagccct cagaggagta tagattgttg gacacatgtg ttgccaattt catacttttc 5701 caggcagggt tatgcctaaa gctagaaagc tgctggctat ttattggctg acttttccag 5761 tggtcatgac ccagcggtag aagcattata ttgtacctac aacttaggaa ggtttccggt 5821 tgtcttgttt tcacaacagg agattatctt gtcaccactt tcttcagaat agcgatttag 5881 atgctcagtg tccatgacct ttagttttcc aggtcaggca tggtagttcc attgacttct 5941 tcttgcgtcc tttccagttt tccttcaagc tctgtgctct ccatataggc ttgccagggt 6001 tgactttttt tggacctgtg accgttgtta aggtctgact agacccaagc gtgatgagaa 6061 agtggttttc ttagatcttc taagtacaat acaccagtaa catattttag ttctaggaca 6121 gattgttaga ctaatagtat ccctgtgcat tcttatagca cttggcagtt tataaagtac 6181 tttgacacac attatattgt tggatttcac aaactgggag atgggggaaa atcattcccc 6241 attttagaaa ccaagaaact gagatctggt taagtgactt gcccagtgct atacagtgag 6301 tactggggga agtaagacta gaatcagtgt ctcctaggac tctagaacat ctcactccca 6361 gtcctcatta tcaaatcagg gttagtttga gatccagcat ggcctcctgc ccccatgggg 6421 catggagtaa gggaaggaac ataatggcag tcctgcattt gaattcttgc ctgagacttt 6481 agacacagtc atataatctt tctgttttca ggttccttag atcctgaatg gaggctgtga 6541 taccatcctc atagttgtat tttagaggga aatgaaataa tgtgtctgca gctccaggca 6601 cactgcaggc aatcagtaag tgttgatgtc acccactttc tctcttctta tctgatacat 6661 attataaaaa gtagcttctc attactttaa atatggtctt tttgtgcaat agctgtgtta 6721 ttggaaaaat aaaggcacaa attgaattct cctaaatcag attttttttt aaatgcctgg 6781 gatttagcta tttaaaggaa acctgtggtg aattcttctg taatgcaaat aattactttc 6841 caatttatct gtttagtaag tcgagcttct cctgtatcag attttttaaa atatgaggct 6901 ggcttgtgta gaagagcttt cacaatcctc tcactgatga aaaaaagtca ctcaacttgc 6961 aaacaaaatc agttattgtg ttcagttggc aaagccttgc tctgatgttt gagcataatc 7021 tcatattccc ttattattaa gtaatagaat ctcagaatag gaagctggaa gacaccttag 7081 agatcacgtg gtccatctca ctcgtttcac agatgaggaa atggaagcac ataggggtga 7141 ggggacatgt tcagtgttac cagccagtga ctgtcagaca aagatccgat tccaagcgtc 7201 tgttgccaag ggagagctct gtctgcccca tcctcactct gccccaccaa gaatccatag 7261 acatgcacac atattgcagc atgggagatg atacagaaat acgatgcatt ttcaaaacaa 7321 ccaaaagagc tgcaagtgtg agcagtcata tgaattgatg agaatattca gcttaacctt 7381 tttgttaaag agctgaatgg aatgcagcat aggaatgctg ctttggcctt tttatgagaa 7441 aaaaattaca attcttggat aatggtattc ctcagtgagg attttaccct atcatatttt 7501 aatcatacct gctaaatgcc caaggaagat gtgatggcag acccagagag accctgcatg 7561 cgttggtggg tgtgttccac ctgacatcac cttgacatca acctggccag gagacctcat 7621 ttggagcaga agtgtttgga caaaccgcat tggatcttct cagccctttg agagattcag 7681 aaatcacctc cctggaaatg cccccagatt gtcccttcct aattaatatg ttagtgttct 7741 tccagagcta agcattgaag atgtgcacct ttctgaaaga ttgcagtgga ggggcagggc 7801 cattcccttt ccttgtgctg ccacagtgag tccctgcagc actggggtct ttctccaagg 7861 acaggaccca ccttctggga cagatgactt tggtggcact gccacttatc agctcacgct 7921 ctcaggggtt ggggccatat tcaatccctc tgagccttct tttctcattt atacaagagc 7981 taatgcctgg tgtatgaaaa tttatggaga tactgtgtta gatacttgac actggtacag 8041 agaaggtgtg tagaaaataa agcagaattc tgggctctgg ccagtgagga ggtatccttc 8101 tggcctttta ggaaacagag cacatttagg gggatacagg ctcggggtcc agccagggtg 8161 cctgcttcca ggagtcttag gccaggggac tgcccatgga gggcccaatg atgtatgatg 8221 tgccctgtaa gtaagggtgg gctggcgggg gagtgcacag gaagggctat gagggttcat 8281 ggttaagcct ccccaaaggg ttccacggcc tctccctgtg ggcccagaca tagtgctgag 8341 aggatttgtc cccctctgag attggtttca ggagtccaga cctcacgcct ccttgaaaca 8401 gtccagcaca gtccccgctt ctgccacatg tcagaggcag gtcagtttga gagaggagct 8461 gtttatactt taaatcaaac acaaaacata caatagaggt atttggatct atagctctga 8521 aagattgtga tttgtttaaa actcacagac tccactagga tgagcttgga ttaaagctca 8581 caagaattgt acaagagcaa gataagtaat ttaagattat acaatgaaac ctaactgatg 8641 tagtttctca caaactttca aatatatttt tccttaggtt ttttaaaagg ggaaactaaa 8701 ccttctggca gtggagtctg atttttatga caatgtatca gattttgaaa taaatggaag 8761 atttaaatat tttataagta gctctgattt tattttttaa agtgacaaac gaaattcagt 8821 aaaagctcac caactgggca aatacagaaa gagttgctct gctccagtag aaaatgaatt 8881 acagactaag ttagcaacaa taactcctgt ttaatgagcc agatactatg ctttccatgt 8941 gtgctgtgtg gcaggttcta tcacctttac agacaagaaa agcaagcttc agagaggtta 9001 catgcaatta attctgtgtt ccagagacgg aaactccagc aaaaattcaa gtgagctgtc 9061 caataaatga aactgaaatg aaaccaaaac tgaacctcta gttttccatt tcagtgcatt 9121 cagccactaa acaaaactga ttcctgcctc actgctattc attccccaat ctttttttaa 9181 aaattaaagc tccaaacaaa gctatgagat gccaaagaga gtaaacaata ctgcaattac 9241 caataatttt aaaagcgact caattttatt actttctgga ctgtacagca agtggaacta 9301 ggttaatgcc acatggatag gtaaggatgc attgtgatga acagatgggc tgtttgtatg 9361 ttcgtctata attgaaagcc atttattctc ttaagtgttt taagtacttg ggggatggct 9421 tggtgtaaag aggaaaggtg gtcccaagac ccatccactc cccctagaca tttctccttc 9481 accttattta ttcatttagt ctgtactctg tcagaacaag ggtcggctca gccacgtgga 9541 attggtctca aattctcaac tagtgatcag agttaatgag gttttactat catgtgactg 9601 gttttccttg tgaaactgtt aaacacaatc ttacttctct taatgggtgg ccggtgtaat 9661 ttatggccta tcacttaaat ggtttcatcc agaaaatcag gttgactttc tgagtgtttc 9721 cttttgtgga atgcctgagg tcaggctgtt attgaatcac agattttctg agggaaaagc 9781 agctgctagg agaatccagt gtgagggact aaacagtgaa gtggtgagaa aaatcagaaa 9841 gtattttggg ggcagcacag aggaaatcta aggcttaatg tgtgcccgga ggcagggaat 9901 acttgtctta ggctgcacag aactatctca gccgagtggg ccattggctg gctaggctgc 9961 caggtgtttg ctcagaaata tggggcagtg ggctgctatg agccctcttt tctaatctgc 10021 aagtgcaaaa aatgcagctt tctttgaagg tgtttgggca tattttcagt tatcccccag 10081 ggaagcacca tctttccttt cggggcaaga aggataaaaa gaaaatttta tgaactggaa 10141 cttgaacaag tgaatggaaa atacaagcag atgttggttg gctagggtac aggagaactg 10201 tccaaatttc aggtttcaaa cccatatata aataggttag aaatgagcat tagattaacc 10261 atcctgccaa agtaaattgg tgctgtttat tctcttaatt agctgtgcat tacaaatctc 10321 agattccctg tgttaagtca gttctcattt tcagtcacat ctcttcaaaa gtcgtttgaa 10381 aaaaatctta cggaattaag ggtgaaggcc gtaggaaaaa ggctttgtct gtagatttgc 10441 gtgtagcctt tttcacttca agttatttct agtcttgatg aacttttccc cactgagtca 10501 ggcaaaatta ggcagctctt caattgattg acaactctga tttgctccta gtgattctgg 10561 gatcaaggca tgtttccctc agaactgtta atatacccat tgctcccaca ccgacaaatt 10621 aattggaaac cctgacaatg gtaataggaa tatcttcaca tgtcacctta cctgtgtgtt 10681 tcattttcaa gtctttgttt tcttaccaga gaataagaca atttatatga aatagctgac 10741 tttctatgtt aatttttttt cacaagctac aaaacttaca cattttttag ctctttgggt 10801 ggatttcttt ctctctttca ctttgtaatt gaataatcac tttttcctac tgggtaattt 10861 gaggcactaa aaaaaaagtc attaacagat gactggcttg ttgaatacat cacagaaaat 10921 aatatttatt atgattttgt tgacaaaatt agaatgctaa attcagatag ctgcaaaaaa 10981 aaaaaacaga tttgaaaaga aatggtgcca tcagttgcag gatttttacc ttcatggaaa 11041 taaggtttat tgaaataaga tgataaataa aagtgctttc tggtttggga gaatcaacct 11101 acttcctgat actgttctgt ttctcttgct ccaaaaatat gatcaatctc cctgtttaga 11161 tgcagtgcag ccaggtgttc agaatcacag ggggacaaaa agaattagtt tcttactgaa 11221 taggggattt tttttttaag aggctggcgt aaacagggct gcaaaatgtg aaacatgtag 11281 acccactgaa ctgaaccccc ttttttcccc tcttcccagg aataattctg ctacaaggct 11341 gatttcaagg acatgaattg ttgacctcat cccaacatca gaacctcaga tgttctaatt 11401 tttgcaccat tccaggcaag ttgatcttat aaggaaataa aattgaacct taggggtctg 11461 atggaaattc actgtgacat tcaaatcaag aaaacttgct aatgcccaca gagccttttc 11521 cccatgggcc ctgatggtag cctccagaag gtgcagcctc aggtggtgcc ctttcttctg 11581 tggcaagaat aaactttggg tcttggattg caataccacc tgtggagaaa atggtatgcg 11641 agggaaagcg atcagcctct tgcccttgtt tcttcctctt gaccgccaag ttctactgga 11701 tcctcacaat gatgcaaaga actcacagcc aggagtatgc ccattccata cgggtggatg 11761 gggacattat tttggggggt ctcttccctg tccacgcaaa gggagagaga ggggtgcctt 11821 gtggggagct gaagaaggaa aaggggattc acagactgga ggccatgctt tatgcaattg 11881 accagattaa caaggaccct gatctccttt ccaacatcac tctgggtgtc cgcatcctcg 11941 acacgtgctc tagggacacc tatgctttgg agcagtctct aacattcgtg caggcattaa 12001 tagagaaaga tgcttcggat gtgaagtgtg ctaatggaga tccacccatt ttcaccaagc 12061 ccgacaagat ttctggcgtc ataggtgctg cagcaagctc cgtgtccatc atggttgcta 12121 acattttaag actttttaag gtaggtaaag gcattgtctt tctgaccatt gtgaaagaac 12181 aatgaaatgc tatgactcct actgcaggtt taagaagaaa gtgaagatgg tgaaggtgct 12241 catagaccct tttggtgtta ccaggttcct caaatgggac attctatatg gggcattcta 12301 catgtattgg tttagttcca ttacttacaa ttaactggat taattttgac tctttttttt 12361 ttttcctgaa caaagagaaa ggtgaaaagt tctgtgatat attgggtcct tgtcatcttg 12421 ggataacctc tttgttaggc tcattttaaa ttaaacaagc atttttgaag agaataatga 12481 actcagcgca tttatcatgg aaaacatttc tatggtattt gagctattta atttactgag 12541 tgaaacttct ttaaggtggg atgctctgtt atgacaaaag tttgttcatg aagaaacagt 12601 atttattact aatgtgtaga tgggagtgac actctgggag atgagatggt cataaaattg 12661 tcaggcttat tataactaca atagagcaga accttattaa ttcagactgc tctgatacag 12721 acttggaata attatatata attcttattg cattaaaagt tcaaatttag caggataaat 12781 tagacttttt gatttaggtt tggtgatcta ttacccatat gttgcattta catattgaat 12841 aaaattggaa atatggaata atatatgtaa aaatactttg aaaagtataa agtgcagtaa 12901 aaataaagtt attgttaact caccattttc attaactttc tcattttatt tattaataac 12961 caatatgtag tcttttgtaa tagacataat ttgtttaatt gaaaatactt tcactctggt 13021 ttcttactca gggggacatg aatttttcaa cttaatctga attaatgagc ctttgtttta 13081 tattgtgaaa aatggttcct ggctgggtgc agtggctcac gcctgtaatc ccagcacttt 13141 gggaggccaa ggcgggcgga tcatgaggtc aggagattga gaccatcctg gctaacacgg 13201 tgaaaccccg tctctactaa aaagatacaa aaaattagcc cggcatagtg gcacacgctt 13261 gtagtcccag ctacttggga ggctgaggga ggagaatcgc ttgaacctgg gaggtggagg 13321 ttgcgatgag ccgagatcat gccactgcac tccagcctgg gcaacagagc aagactctgt 13381 ctcaaaaaaa aaaaaaagaa aaaaaaaagg tcctatgcac attagattta gaaatattaa 13441 aagaatatgg agaagttgga cagatgacag gtattgtgac aaatgtgaac aaagtgatgg 13501 gaaatgtcac acaaaagatg aaaggaattg aatcaaagga tttctggctt ggcagatatc 13561 tcagaggcac atctagtctg gcccactcat ggcactggaa tcctctttgt aacaccccct 13621 tgagtggctg tgcagcctct gtatggattt ccctagttta gaagagaaaa tggtatgagg 13681 caacttaatg gctattttca gatagatgaa attttctcaa ataaggagga tgctaagcag 13741 aagctctctg cgtctaagga gtactgaaga ggcacaaaca ggcttcaggg tgtgctggag 13801 acatttgagc aggacattaa gaagcgcctc ttgacagtga gaaggacaca attaaaacat 13861 gtagccagag ttgctctagt tctccatttc tggaaacctt caagggaaac taggcagttg 13921 ttaaccattt ctcttagtaa gagggactcc ttgcttagag ggaagatgtg gaaccagaat 13981 tctccaggct tcctcccaca tcttgggatc tagtattcct cccagctacc ccggttctgt 14041 gccatatggc attttctgct ctgggttggt gtccagctca gacatttctg taactcttct 14101 ctctgatttg aactccctcc tcacctgctc caaggcatct cttttctatt tttaacttag 14161 caactgtggt tcaaaaacat cacttttaga ggagaggaaa agtttcctaa gaatcttctc 14221 cttttggttt ccctctcctt ataaaccaat tcgggttcct gtcaatacca ttgtaattcc 14281 tgtgaatcct ggctgtttcc tgtagcctgg tgcccagcca aaagtgggca aagatttgag 14341 gagtgtggtg gaggagataa ggtattcata tgaggttttc caagcttgtt tgcatttttt 14401 tttttttttt gccatagaat cagccaatct gcaaaatggg aataataaga ttgtgttaga 14461 ttgttgggta tatttaatga gaaaatgcag gaaggcttcc atcaggacct ggtcatgtcc 14521 tttggttttt tgggctattg aaggaaactc caggccacct gaaggcaaga gggtgttttg 14581 tttttatgca gcggggatct ggcttctctc ttctctattc ttccctcact gactggttag 14641 ttctcttaat ttcttgggta attctggagg gaccccatct gactgggtga gagggttact 14701 actgactctc tgggcagagt ccaatgcccc aggccatctc ttgagacacc caccaacaca 14761 tgaaagggca gctcttgcta cccttaactg aagtgcaggg agtgggtggg tcacttatcg 14821 ccaggagggg gtgctgttat ggtagatgtc atgactaaca tgatcggttc actggtccag 14881 agtgcgcgtt taacctacag aagcaaatct agttattaac caagttattg gacttgaaaa 14941 tcgtctgtaa ttcaaaagtc cgtcagtgag atgtcagtgc ctagtccatc ttacaggtga 15001 ttctggaacc ataagaatgt gaattagata atagcaactt atgctacttt agtttgattt 15061 ttatgtgact ttcaggaagc actccatttg tagtattaat tataaagcaa ttgggataga 15121 aaattgtcta ctctatgtat ggatgacttc tgtttataca tatatgctag gatggacatg 15181 aaatattaga atagttatca gaggaatgat tagcagttaa tgcattggag attggcatat 15241 agctcctttt tgaaaaataa tacttttctt aaatactaat tttccaaagg ttatgacttg 15301 agtaaaagta agttttagga cagttttctc aagcttaggc cctttcaggg tagcaaaaga 15361 cccacagaat ctaaaaatat gcccagaaat taatgtattg ctagttattc ccagataatg 15421 gagttaaaga aggtagagca tctccactga agatatgcaa tgaaccttga tcataaggaa 15481 tacaaacagg gtcttaattc atgaacttac tcactgtcct actcaacaat aaacataaat 15541 tggggtttta agaatgtgat tgcgatgtgt ttaaaatttt atggaaaata tttgggccaa 15601 atttggaagt gatccctaaa tgtctcagtt tccttttttt ggacaacttt aggaaggcca 15661 tgtattttac ctgcagtatc ttcctgaact aaaatgtgaa ggagacaaca tcttccatgc 15721 agccccattt accactttct tcagtcttgc tcctcctgtg tcagcctctc cagtccttgt 15781 attcctaagt gttctcctaa tcaaaacctg atgcaaggga tctaggtgtt gtcagctatc 15841 tctatgttag tatgtctgtg tttaagtcag aggagcattt tatttccttc aggatcacta 15901 attgtggaat taaatgggat ttgtctttgc aaaggagatg ctggcatcag ttggggaggt 15961 tcatttagcc ccaacagaga ctgcacttag tggatagcca cctcctctgc tggaatacag 16021 gtggccctgc tcatctgacc tagagggaag tggtgtcatc ctatctttgt ttcttttctt 16081 atccaagctc agaagcaaat gaaaaaaatg ggcctcttta actttagaag agggttgaat 16141 aagggcctgg agctggacac aacctgggaa cagaagatgc catgtggcgt agatccatag 16201 gatactgaat gctttactag attagtgata ctcaacctcg agtgtacggt ggggttcata 16261 ctgctgttgc ctcacccaca aggggtgtgg gctgcattat taaaactacc tgggaggctt 16321 ttcgaatggc aggatccagt gaatgatgtg tgcctcccaa gcccctctct gggaactatt 16381 tttattgaga tgagtgtctc atatccctca ggtgtgttgg gagtggaaac aaatttgaga 16441 gttacagaac aagaatttaa tgatctgaac tttttagatg cactagtgta ccctaacaca 16501 cacacacaca cacacacaca cacacacatg cacacacaca tcgtatatca gctaacatgg 16561 ggcagagcag acctaggtga ttgcttcact catataacat gctaatggct ttagataaca 16621 gaagtatcaa ttaggataat gtgacatgta acggtagaat agcagaccca gaacagctcc 16681 atcctataca gaggtataaa cgacacctca ggggcttttg tactgcagac aacccagtga 16741 aaagaggggt ccagctcccc ataagagatt gtgtagtttt gaaagaattt aaaaaggaga 16801 gggtctttgg aaatgtttga gaagcccata aatttgtgtt ttataaagag ctcaaatttt 16861 ttctcctttc tagaacattt cctctgctct cctttaaatt aaagcaaatt gattactcca 16921 aacactaatc atatgtctta ctttgagcaa agattgaata ttctaagtta aaaatagcac 16981 attgatgctg gaggtaactc acattaatgt ccacaaagat gcactgggtt gtaagaggac 17041 actgtcaaat ataattcctc tctcttttgt ttttctagtt aactttacct ggattgttaa 17101 ttttctaccc ttgaaggata ggggttagat agtctgtagt gtatccttcc caaagcaatc 17161 aaatgagact tttaaataga aatccatttg tgggtgggct agataatttc accctatgtg 17221 gaagtgtgct gattataaat ggctaaaatt gatcttcatg tgaagtcaat gaaatactga 17281 gaagcagctt gtttccctgg cagtcttcgc ctaccctgct atgtttatga agatatattt 17341 tcttccaaga gactccacca attgtagcaa ccacttctat aaagccaagc atttttttga 17401 gttatttatt gcatacaatg ggttttatta gcttcaaaac tttgaaaagg taaatctgcc 17461 atatccgggc tcatctagag atgaactcat aaatgagtgc agtcagtaaa ctcagtagga 17521 ggtaaatttc atgttcaatc tcatttgatg gcaatagata atggaatatt gggagatgga 17581 cttcttcagt caactctttg gtactgtagc agggtttcat ttaactttga aaagagctca 17641 ctctccttag gtgagagagg actggtaagt agcacttaaa ctcccagcta cagagatctt 17701 tcaaaatgga gacttggtaa tctctttccc cttcttaaaa ctctttaatg gctagggttt 17761 tacccaggat gaaggtccac ttccactttg gggatccttt tctcactttc cttgactaga 17821 ttgtctcttt gttacaagta acaagcagct tgtgcttcct ctccttaatg tctctctcac 17881 tgggttgtag ttgtttacct acttgtctgc atcccccact agactgtaag caccatgtgg 17941 gcaggggcat gtctatctag tttgttgctg tattcctagc acctagcatg ggcctgtcag 18001 agaaataatg aatgaatgaa taaatcatat ctctctactc tctccctcat cccaatctca 18061 atgttggaaa tatgtgttga atagatagga aagaagtata attttgttta tttttcttgt 18121 atactccttt agaaaaagga gggcaatatg tttaggcttt gtgtccccac ccaaatctca 18181 tcttgaaatg taatccctat aatccccaaa ggtcaaagga gagaccaggt ggaggtaatt 18241 gaatcatggg ggtggttttc tccatgctgt tctcgtgata gtgagtgagt tctcacgaga 18301 tctgatggtt ttataagggg ctcttcccac tttgtttggc acgctccttc ctgccacctt 18361 gtgaagaagg tttcttgctt ctcctttgcc ttctgccata attatgagtt tcctgaggcc 18421 tccccagcca tgctgaactg tgagtcaatt aaacctcttt cgtttataaa ttacccagta 18481 tctggcagtt ccttatagca gtgtgaaagc agactaatag ggagaagcgg agaaagaagg 18541 aaacatgatt atattttgag agttgctatt ttcatgtgct ttctcttttt cctacaaatt 18601 tccatttaaa tgtattttta acattaagaa attgcgtgta tatttcaaaa ttgttgaaag 18661 agtagatctt aaatgttttc accatacacc aaaaaaaatg tgagatgatg gatttgttaa 18721 ttagccaatt taatcacccc atattgtaaa cataaaccat cacattatgc cccataaata 18781 catataaatt gtcaatcaaa aataaagtaa aataagcaaa atatatgaac acattctcat 18841 tttggaaggt ttaagcaata catagtatga aaggaaagtc cacttccaaa tgcataaatc 18901 cgttaataac acatgcacac atgattttat taacagatat gtgatttatg actttctttt 18961 tcacttaata tgtttgggaa aattttccat gttggttaat acagatgtac tgcagtaatt 19021 cttgatcctg gctgcatgtt agcattgcca gcatagcttt taaaaattat ggatgcccaa 19081 accctagcca ggaccagtta aattggttgg tgtagagccc tgcattggta tgttttaaaa 19141 gatcaccagg tgattctgat gtttggtcaa gaatgagaac tgcaggactt tattttttta 19201 ataattgctt ggtttttaat catatggatg gaccatactt tattttacaa atctctgata 19261 aacaagttac ttccctttgc tcttcaaatt tgttttttga aaaacagttg aaacaatgct 19321 agagcctcat ccgaatacta ttagctctag aatgaatttt ggtggtgtag ggccctttct 19381 tggaccaaca aaggggccag atgagtgaat tgctgaaagc atcttgctgt aagacagggt 19441 cacaggacac aacattccaa gtcatgtctc ttcgaaacat gcaataactt agaatcagca 19501 tggtggtcat atgctcagac agttcaggga gcaccatgac tcttgggcgg agcacatttt 19561 aaacttggtg aatttaaatt tcttttcttt cccctagcct taaatggcct gaaatcctga 19621 agtgaatacc gaagacatct taagagcgtg cttttagtga taaatgccta tgtttggggt 19681 gttttaaata atgaggttgt ttgaaaagta gtgagaagtc tttccataag aatggtttgt 19741 gactggatgt tatatatttt ggtttacaaa attgtctgaa aacctttcta gaataagtca 19801 gggtctatgt aaataaatct gcataaggaa agtactgtgc tctttctacc cattcaccct 19861 agagactgca ggtttttaaa ttttatggac ttcacttcag agcagtttta ggttcacagc 19921 acaattgaag actgcagcgg ttttaacaca ggtgtcttcc tgggctcttc tcatagctgt 19981 ttgcaaccaa aggtaatgcc ttcctaggaa gacttaaaat ctgaaggctg tcctgtgctc 20041 agtccaatgg ttatcacact gtagcatgcg agaaatcacc cagagtgctt gttgaaacac 20101 gcgttcctga gcccacctgc agagtttctg attcagtggg tccggggagg cccggggtgg 20161 ggtctgagaa ctgccatttc taatggttcc caggtgatgc tgatgctgct cttttgcaca 20221 tcacactttg gaagcactgc tccatacaaa ttgcttgctg gctgacttag caagtcaaag 20281 tgtgcaaaac aaaaatgaga tttaacagtg ttctgtctgg cagcagcttt tttcttttga 20341 cacacacctg gttggatttc tgttcatatt ggtcataatc acagcagctg ttgtgtggat 20401 ttatttttaa ttgtgttcat ttttgagttg tgttccttat taagtaatgc atagctcatg 20461 aataatgatc agtgcaatgg tcacttaaac acaatttctt aaagtagggt caggcagatt 20521 accattttac acatttttac tacctgagac agtagagcag actgatttaa gaaacttaga 20581 ttgtaagata tacattaatt gggaattgag aacagtaaaa gatgaaagag gctatgattt 20641 gtctcagcta ccagcaattt gtcttcagaa accatgacag ctacctcctg atttccatta 20701 tgcaatgaga ttgcaaataa cattccacag cagtcttaat caatagaggg tcacaagagg 20761 cccttgcttc cccacagtgg gaaaagggct tatttaccca tttaattgat ataacacact 20821 ttctggcaaa agagacatct cttttagctg taattggggg cttttcctat ccccaaacac 20881 tgtcctcaca acctgcgtct acacatctaa tctttatgta tcttgaatac ataaggaaca 20941 ccaggctggc aaggtgtagg ggtctaggca tagcattgtg cacagccact gttatctgtt 21001 tctagaatga ggtcagtctc tgctgtatta cttcattagg ctcagcaagc taatagacat 21061 gtgttctcaa tgttggccat tagtctccct ctaagacgct tcaaggctcc ctctgggatt 21121 agggaggctg gtttccttgg tgtaatacag ccctggctta gtcccagctt cagggacacc 21181 atggatctgt ctctgtatct cctgcagtgc cactgaggct tctttgggac atgtggatgg 21241 tgacttattc ttgctctagt gtcttactct tgcttcctct cacatggatt gagagtgacc 21301 tctagctttg actagtgctt gtactatttc tttatgttgt ggcaatttta tttgttagtt 21361 ttcagagtga tttatcgtct catctcagag tgtttgaact tatattattt ctagaacaaa 21421 taccatgtca tactgatgta aatttcttag atatttgcga cttatagata ttatgtttgt 21481 tattaggaga tgtctgttca aattatagat tattttaacg tatttatcta gtggtgctga 21541 tactcgtgag taaaatagat taatatgaat tatatgaaat aggttgggct ctgtagtcag 21601 actgcccgag ttcaaacttt gacctaatgc ttgttaactg tgttacctta agcaagataa 21661 tttatctctc taattcttaa gctttcttat ctgttaaaca acccaggctg ggtgcggtgg 21721 ctcacacctg taatctcagc actttgggag gccgaggcag gtggatcaca aggtcaggag 21781 ttcaagacaa gcctgacaaa catggtgaaa ccccgtctct aaatacaaaa ttagctgggc 21841 atggtggcat gtgcttgtaa tcccagctac tcaggaggct gaggcaggag aatcacttga 21901 acctgggaga cagaggttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggtaa 21961 cagcaagact ctgtctcaaa aacaaaacaa aaacaaaaaa ccccagttaa taatagtacc 22021 tttttcataa ggtttttaat gaagacaagt aagatatagt tcagagtcct aagcaaaatc 22081 cctgctccat aggaggcacc ataaatgttt gttatgatga tgataatgag gaggaagata 22141 tttacatctg ctattaagag gatattcatt ttgagtggta cctctcctcc agtttctcag 22201 ccattcagac tgacttgccc tattttgtga acaaaggcag attagcatta aaggtatagt 22261 cattggtcaa gaaacagaac tgatgggcca ggtgtggttg cacacacttg taatctcagc 22321 actgtgggag gccaaggtgg atggattact tgagcccagg agttcaagac cagcttgggc 22381 atctgagacc agcctgagca acatgacaag atcctgtatc tataaaaaat acaagaatta 22441 gctgggtgtg gtagcatgcc cttgtagttc cagctactct ggaggctgag gtgggaggat 22501 tacttgggcc tgggtggttg aggctacagt gagccatgat catgccactg cactccagcc 22561 tgggaaacag agtgagaccc tgtctgtctg tctgtctctc tctctctctc tctctctctc 22621 tcacacacac acacacacac acacacacac aaagaagaaa cagaactgat ggcaggtcta 22681 gattggctct ggcatagggc aagcctatct agttttcttg aaggccaaac ttgagactag 22741 gacttcatca gctcatgttc cagccaagtg agctaagaag gtggtgtact gctatccttc 22801 caaatccatg tgttgggtgg taagagtcag ttttattcaa aaaatggaag acccagagaa 22861 acattagttt tctatttttt taaataaagt tgcagtaaat tttaatatcg tttcagcgcc 22921 aaaagaaaaa tacccttagg attcaattgt ttaaaaaaaa aagaaaaaga aaaaaatcac 22981 caattctggc ttacagaggc cacaggtttg atggtggagt tgtggttgag catgaacagc 23041 tgaataaccc caggggtggg tggcagccct caagcctgtc ctttgggttg gaattccctt 23101 tcttttccat gtgtggtaag gacagaaaca caggcctgca ccagatgaag aatgttacct 23161 tttgttgagg gaataatttt tttttctgcc agagatggct gacatgacac taattcctat 23221 catctctgat acaaatgtca gaaatgaaag cactaaaagg cttcagaagt acttgttcta 23281 gaagtcacat tcattttagg acaatcagaa aaacagtgga actaaatgta ttctgattca 23341 cactatgacc tatcttgtgc cacctgttta aaatgtgact aattcagtca aacatatcaa 23401 ttatgcccta gtaaattctc atttcaatca aatatggata atttgctcaa atgtaaaagg 23461 acttcacagg gaaagaaatc agtgcagagt ttaatgtggt ctagtcctat cctagcattg 23521 tgaggcagtg ctgaacagtg cctggtatag cttctaaacc cagtaagtgc taaatctttt 23581 ttttttcccc catcaagaga agtagaatag tgtaagagat gctagcattt aacccaccta 23641 ttctccttct gtaacttagt aatacaggta ggcaggtgat attcaatcaa cccaccaaca 23701 ggtctatgca gcatcccata ggtatatgcc gcatgttcga cattgtagtg gataatatga 23761 aagaaaagaa gactctaagc catcaatgtt acttgtactg tagttgaggg gaaatgacca 23821 aaatcctgta aaataaccac gagcaatata aaaccgtgta aaaagggcta cagtgtggtc 23881 ctggggttta aaagagaggc tgacttcatc caaggaagct gcagttatca gccaaggttt 23941 ctcaatcagc tctgttatag gctgctgcaa cagaatacct tagactgggt ggcttaaaca 24001 atagcctttt tttcccctca catttctgga agctggaatt ctgagatcag ggtaccagct 24061 tggtgggttc ttggtgaagg ccctctccct ggtttataga tgctgtctcc ttgttgcatc 24121 ctcacatggt gctgaaagag ataagaaaag ctctctcctg tctctccttg taagggcgct 24181 tacaagggct ccacgcttat gagttaatta cctcttaaag atgccatctt taaataccat 24241 cacaatgagg gttaggattt caacatgaat tctggtgggg gatacacatt cattctatag 24301 cacaaggctt catgcagtaa tatgggattg ggctgggctg ggcttaaagg aatggatagg 24361 agtgtgcaaa ctgagaggcc gggagaggag gaagagtctt ccaagactgg gaatcctgtg 24421 aacttttacc tgatgccagc tcatgttcag ctatgtgggg gtggaagtat gtggaaacct 24481 ctttccctct cacagcatct ctctgctatg acaaaacttc aaaggattat tttaggagtc 24541 tctaggagga ttaaaaatgg ggaaggttgg agtgctgtgt tcccaacttc cattggacac 24601 aaaactgggg gtcccagtaa aaacccaagc cagtgagcac agagcaccca gagggcacaa 24661 tgctgtaccc tcttaatcag cattgtagaa cagctaaaca ggggtgaatt gctgttgaat 24721 ttaaggaaag tgttatcaaa cctaggagaa tccctccccc caaccctttc cccatccgtt 24781 gcactctggg aatccatgta actcaggata aaacattttg ctttattttt catttgccac 24841 atgcatatct tgactgttta cacggttatc attcaatgta gatacacaat taattgctaa 24901 gaaagaaatt aaatgcattg tcaatgaatt tattttattt acttcattcc aataagaaaa 24961 gccaccattc ttgaggggct cacggttctt ttccattttc ctaacccttg agtatacagg 25021 gagtaatcat acctacactg tctcagtttc aaaaggtggt cgtagatctc agcaaggtaa 25081 aaatttgaga tttgacaata ttccaagtgt aaagggtttt gatttggaaa taaattctgt 25141 attgaaaatg gcataaatga ttatttttct tctttatgct gcagcacttt aacttgtaat 25201 ttctaccctg gttaacccca cctatttaaa tgagagctca atgtgagatc ttttgttgcc 25261 tttgaaacca ctggatagaa ttttaaatcc ttggatattt ttcctctgcc tgagaaagga 25321 agatacagga aatttcattt tttagtttgg atagctttat ttcatttata ttcatgtagc 25381 ccaggcatct agtgagtttt gctggtgtat aaattaatat ttacaggtaa aaccttgctg 25441 aaaataaaat agattaagtg gagaatgttt ttcccacagc tttcgtctct aacaaaaatg 25501 aaaaaaccca gcaccactcc tttaccgaat gccccatctg tcagctcctt gctgctctga 25561 gccaacatcc tctcgggatt gtttggggcc aagggtctgc atgtggagca gggtgggttt 25621 agcacatgct gggagaatag ggattttaat taagacttat atttaccctt aaatatttct 25681 gtgaattatg gagaactcaa attaaaacct gtatctttgg actgaagtta cagtaaaatt 25741 tcagcctctg ctgatcagtg gtatgcatag tatgtgataa attttttcag atattaaatc 25801 actgatctaa aatattgtag tggtatagga ttgatgaata aatattattt taacacctta 25861 tttttaaaaa atgttatgat gtttaaatta atagcgttaa tatttcattc cagtaaagtg 25921 gtacaagttt agattttgaa gtaaaatatc tgcatgataa gaattccctc aaagaatcta 25981 tgcccatcta gaaaatggtt cttatgaaag aagacagttt tgcattgtac tttgttttaa 26041 acaacctaga atgaagttct agtagtagag ttccatggtt ttcaaataaa atggataata 26101 ccctatgatt catggatatt ggacatttta aagatgtttt aggaagcttg gcactaaata 26161 gttttttttt taagtgatta tttttcctaa ttctccaaat gcctggaatc atgaaacctt 26221 agaattggaa gagacttttg aggttttctt ctatagcctc tcatccaaac caaagtctat 26281 tcaagtaaga cccttgatag gtggtgattc aatgctgtct ttgttcttcg atttgagtta 26341 aacatctctc tgctcaattc tcctgtcttg tgcatctctt tttcatgaga cagtccttca 26401 ggtgtttgaa gaccccgttc ttcctgagcc ttcagttctc caggctaaac atcacctgct 26461 ccctgtaaca gaaacagtgc aagaggtaat gtttattgac agcttcccat atgccaggca 26521 ctatgctaaa catgtaacac gcctgatctt attggatcct tacaacaatt ctgtcagata 26581 ggtatggttt tcagatgagg aactgaggct tatctaaggt tacatggaaa gtaagtgggc 26641 atgtgatatt caaacctatg tgtctgatgg actcaaaagt ccatgctttc aaccattgct 26701 ttttactgcc ttttagatgg tatgttttca aattatttct gtatggtttc ctttctctaa 26761 acacagtcct gctgttagtt taaccccatt ctaagttaaa ataatttttc cttgctattc 26821 taaaagactg cctttgagca ggaaaattaa gccttttaat gcagaagact tcagctctgt 26881 aagcctcata ctctccgtat atcgctggct agaatggcag atgggctggc cagcacaccc 26941 atggtgggca gtggcctcag tacaaggatg gagtctgact gcctgtgacc ccatgtatct 27001 tccctgctgc cggaatgatc acatggcttc cagaaacgtt gtgttccttt agcagcattg 27061 atgggagctt cctattgttc cagatgtttt ctgaggagga tactggataa gagtttgtat 27121 tattacagat ggtcagagca gtttatgtat caggaagttt tccatggaag gaagaatata 27181 tccaaaattc tattttgaag ctgatggcag gatgtggtga acttatttgg atgttaggga 27241 aacaccgtaa acaccaaaat aaatacttgt ggtttgcgct tactcctttc ccaaactaaa 27301 ataggacctg tgacagcatt tcctgaacac atgagatcaa gctctagatc aaatgatgag 27361 ttgataaaat gaacatattt atattaagct tttcatttgc ttctcgtttc catgtctaga 27421 atctagagtc tagaatggtt ggatggatgt attaggacca gggtgtgtat gatattattt 27481 atgtccgaag aacaaactac tcattttcac ctctgtgggg agagactgga gttctaacca 27541 tgtctcctgt accttggcag aggaagagta ttgggttctt tcagccactt ttttgggggt 27601 ggacttggtc ggcagcccga cttatccatc cttgtcccag tttcagagcc tcccagtggc 27661 gctgctgagt tttctcccat cactacagag tccgcctcct tcctctgtta acccttgcga 27721 ttcctttccg aggctccttt aacctaaact taatgtattt gactctacct ttactttgaa 27781 aacacttttg atctttttta gagttcctct aagttttcct attagataaa aacagctttt 27841 tatacctttt gtttccatgg taatgaattc caggactgct ttagaaaacc actagctttg 27901 tgtgccagtg gaaagttgca gcattatgta ctaatacaat aaacatcagc ccacccgtag 27961 gaggcgtgca agccggtaga aacactgtag caccacagtc atgaaatgat gtgagcatat 28021 tattttttac tgcaaaaagc agaatttgga acatgagccc tgcattgcag gcaaatggtt 28081 tgggagctct agcatgaatt ctagtgttca ctaatcagct caagacagaa agagccaaat 28141 taacttgagg cgttaatttt caaatttaac gttatattga aaccatggaa ccccatttcc 28201 aatttttttg caaccttcct gagtttttac aagtgccagt tctgctctta ggaagtccta 28261 ggtgggagag gcttccttgg gcaatttact ttttggtgta aggaagattg tttccaaaaa 28321 attcagggaa atcttcctgg ttttgtctgc aaatatacct cttaagaatc tcaggtacag 28381 atcagaactc aacagaaaaa tctattagag ttggctaaag acagggcaaa cataaaaagt 28441 agatttcctg ggaaaaaatc gaaatatttt tataaaaggc tataggtttt tgaatacgtg 28501 caaggagaag gaataattgg aatctgtttt ttgtcaaccc tgactctgcc ctctacagcc 28561 ttctctttgt tatatcatgg gctgggggag tgagaagcaa gagaaaaggc agaaagaaca 28621 tgagaggttg gaaagaaagg ttagaggtgt gtgctaagtt gattgtgcta atttgaaaag 28681 agacaatgaa tcaaagctcc ttttttccta ttgtagattt taatatttaa agttaccagt 28741 gccactactg attatattaa gatgctttta tgttgatctg attaaaatta catttgaaat 28801 aatgaatgac atgtaatgta caaataccct tatatttgga ataatgaatg acgtgtacaa 28861 ataccttccc agatatatgt attttttaaa gtgtaaacat tatcaacaag tgtaatgaaa 28921 gtttttagaa gcattcttct aggaggtaca aagttaaggg tctagaaaca cagcagcaca 28981 cggtttacta caagactgtg gggctatctt cttgcctgga aggattcagg actcctgtcc 29041 ctaataaaat taaagggagg ttctctggaa cttagtatct gttttgtctt agtatttcta 29101 agtcagaaaa tgcctattat aaaatatcta tataatatat agtatatata ttataaatat 29161 ataatctatc tattataaaa tagatattat atctatctat aggtagagtt taccaacgct 29221 gactgatttg tacccagctc atttaagcgt ttggggcact gctccaaatt gttgcgtttc 29281 tttcttcccc ctgacctagc tgcatcagat cattctcaaa gacaccttag ttccattttt 29341 tttttttttc cagcagatag gttgcacaca aagcatagct tttggctttt tgtaaacaga 29401 gtgcaaatgt actgttctcc cccaggaggg aaactccacc atgctgaagg tgagctgaga 29461 ccctgagcct tgacaaggag ccactttggg tgtgctggca ggtcagcctg cgggctttga 29521 gaagagcctt ccataagcac cagggcacac attcaccaat gcaaggcaaa gctctgtgga 29581 aaagcctggt gggttcatcc ctgtcccacc ctagatgtac ccccagaaca gcagaaatga 29641 ctgtgctccc caccttgtga aaagggtcta gttttaaagt gaccctggca gaaggagggg 29701 cagatgcctc cattcctcct cttcagtttc tccttggacc ttttcctgct gttctgtggg 29761 ttctttgagc attttataga atccaatttt gatttatcta tattgcttta gagtgtatct 29821 tttgacatgg ctttttaaac atgtagacat acatgattta tcattgccta ctattattgt 29881 cattttatta ggtccagcga agtatagaaa cctagacatg ggccaggcac aggggctcac 29941 acctgcaatc ccagcacttt aggaggccaa ggcgggcaga ttgcttgagt tcaggaattc 30001 agaccagcct aggcaacatg gcaagacctt gttgctacaa aaaagtaaaa gaaaaagaaa 30061 attacctccc tttacccttc caattatatg attgtcttat ttcttttgca taccttgaga 30121 actgcattag acagtgttat gattttttaa tcatcaactg aatctagaaa actcaagagg 30181 agaagcaaag caaactcgat tgtatttata tatatttttg ccatgttttt ttcttccttc 30241 ctgatgttct gagacattct ttcttttatc ttttcctttt tgtttagaga acttccttta 30301 gtcattattt tagggcaggt ctcctggtga caaatcctct tagtttttct tcatctgaga 30361 atgtcatccc tgaagaatat tttctctgga tatagaattt tggattgata gttctttctt 30421 tcagcactta aaaaatgttg cactgcttcc tactggcctc cttggtttat tttgcgaaat 30481 cctctgtcat tcaaattgtt cttcccctat ggcttcatgc atcatttctc tctggctact 30541 gtcaaggctt tttcctttgt ctttttgtgg gactgtgatg catcttggtg tggatttctt 30601 tgggtttatc ccatttggtg ctcagattct tgattctgtc ttttgccaac tttggaaagt 30661 ttttagccat tatttcttca aatgcttttc tagccctacc ctctttttcc tctccttctg 30721 ggagtcagat gacctacctt taacatcttt tttttgtagt tcaacacgtc tctgaggctt 30781 agttcatttt tttccccagc ctaatttatt ttgattgttt atgggtaatt gttttatctg 30841 taagttcaca gatttttttt ggcctctgtt ctctttattc tgctattgaa cccatccatt 30901 gagttttttt tttttttttt acatttcatt tataatatat ttcagtccca aagtttctac 30961 ttggttcttc tatcttctgt ttctttgctg atacttctca ttactttgct agttctttgt 31021 agttttttgt ttcaagaaca tttatagttg ctctttgaaa cccatatcca tctggctgcc 31081 aggaggagga gtcctgttat ggttctccat gtgacctcca ctcccacggt tggaatggct 31141 tcattactgc tgagtggtgg cgggtaagac ctgattctcc accagcctcc tctgacacct 31201 ttgcagccac attgccacct agcagagatg aaaggccagc ctccacactc agcctctcag 31261 aaactatcct ggtggaggtg ttggggctcc tcactactgt ctaggttccc cactaggtct 31321 ttgcttgtat gggtgggtcc acagttttgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 31381 gtgtgtgtgt gcgtgtgtgt gtgtggtgtg tttcgagtag agtggttatt atctaaaagt 31441 tttcattttt ttcttcaatc taaaagaagc taggctgccc ctttcctggt tctttggaca 31501 aagagaggag ctttttttgc ttatttgttt ttggtctatg tgtattggtg tttccaggtt 31561 gctggctttt caggtataga tgaggaaata catgagggaa aaagaaaatt caggaactcg 31621 ttgccatgtc attccctggg tcctgaggtc cttaacccac ttggcagctc cccacctttc 31681 ggtgtcttct tatgcttgtt ttagatctaa tgctcagttt ttagctagac ttatgggaga 31741 aacagggaag agaacattta ctctgccttc ctgtcaaagc ccctcttgat ccttacattt 31801 gtctacacca gatgctcttc ttcttgttcc tctctcaagg tcagagaaca gggtgtactt 31861 ttgagaacgt ccattaattc ttgtacgaag cttagcttgg aacaggaact gctgcagggt 31921 agaaccagtc tcaagcactg gagaacagga ggtagaggtg gtgaagtggc ccagtcactt 31981 gctttgtggg tagacagacc tgggttcaaa tcctgtttct accatgtatt tgttgcttaa 32041 ccttgatcaa gttatttaac ttctctgacc agtattgtcc ccacctgttg caaggattag 32101 aagggacata tttctggtgc ctggaacata ttgggcacta tacatgaata gccattattg 32161 ttacaaagca ggttataagt tctccggata ttttgtcaat aatctgatct ctgctgtgaa 32221 agaaaccagt gcaaactcat tttctaggat gctgttttat gtaaacaagt catagttaaa 32281 taattactcc agagtgacaa acctttttaa ttttgtatcc agataagaac atcagttaga 32341 acatgtagca agaacagtga ttcctctgag aaagtagggt agagtatctt caaaagcaat 32401 gggtggcttg ttggtgattt atgttttgag acagggtatt accctgtcac ccaggctgga 32461 gtgcagtaac gtgatctcag ctccccacag tctccgcctc ccggttccaa gggattctca 32521 tgcctcagcc tcctgaatag ttggggccat aggtatgtgc caccacgccc agctactttt 32581 tatatatttt ttgcaaagat gggattttgc cacattggcc aggctggcct tgaactcctg 32641 gcctcaagtg atccacctgc cttggcctcc caaagtgctg ggattactgg catgagccac 32701 tgcatgttgg tgatgttaat gctgtaatga aaaccctcat tctctgttta agagatttca 32761 ccatttaaga agcaaattaa gggacactgt gggggcattt ctctgctccc taaggagcac 32821 cgaagactaa atattttagg gagtgttatg ctgctgatga cctcagcttg atgattcact 32881 gacttcagag tttctttcag atcctaacta caagtcaggt gagagcttga gagggcagga 32941 gggagggact tggtgcttct ctcaggagga agagtggact tctaggctgg aagggacaag 33001 ggatattcct gactgaggga ctggattgaa tgttgatggg ctctggataa agattgtcat 33061 ggaggccatt tgagcactgt ggggcctgga ccagtatact tgcttggaat gctcagagag 33121 aaccccagtg tcaggttaga ttccctagaa gcagagctga atataagaga atcttagcaa 33181 aatgagttat taacgagcat tctcaggaga aaggatgcac tttcaggtga ggcccattct 33241 gagcctgatc ctgtggatgc tctgaagcat caatcatagc agggtgtgtt cttccttgag 33301 gcaggaaagt aggaattttg tatctttgca tcaaccagtc ttgagctacg ggcaacctgg 33361 tcctaagtga taattcccgg gcatacttgg gcaaggtggc tatatcaccc aagagtggtt 33421 ctctcgagaa gattagaggt gtgaggttag cagtagcact ggtgacaggt ggaaaaggaa 33481 tactgggcag ggctcaacgg ggtctacagc cccaggcttc atcggtggta ccatgtgaaa 33541 accactggcc tgggtaacag gaggcttggc ttgtactcct taattttctt tatgaccttg 33601 agcaagtcat tcagcctctc tgccttacat gtaaatgaca agattggcat aggtagactt 33661 ttaaggcctg tttgactcag gtgagttgat tgagaacact tgatttagta agagcagggc 33721 ttcctggcct atggttggac aataggaacc ttcaagaaga gcaaaacagg gaagtatgaa 33781 gttactgctg catggctagt ggcccaggca ggtgactttg gggacttgga agtgaagagg 33841 cttccttacc ttgggggcta gggaccagaa ttatacctgc atgggccaag ggtggaaggg 33901 ctgcatggct tggtgaccaa ggggacaaga aggaagacca tgtgaaggac tgagcctaga 33961 aagcagattc tgggaggagc caagcttgcc tcagtggtta caatgggatt tactctggta 34021 agaagtttga ctaaatatgt tcagcggctg gggcacagtg gctgacacct gtaatcccag 34081 ctacttggga ggccaagata gggggagtac ttgagcccag gagtttgaga ccagcctggg 34141 caacatactg agacattgtt tctacaattt ttttttttta aaattagctg gacatggtgg 34201 catgcaccta tagtcccagc tacttgggag actgagatgg gaggatcatt tgggcctggg 34261 aggtagaggc tatagtgagc catgattgca ccactgcact ccagcttggg caacagagta 34321 agaccctgtc tcaaaataaa taaataaaac attaaaagaa tgtgttcagc atctgaactt 34381 cagctttttg tattctgtca acattttttc ttgattttca ttccattctt cattagatgg 34441 aaggcttatt aaagatcctt tttttgtaat ttggtaatgt tattagtaat attttaagat 34501 aattccttga cttgatagtt cttttactgc agcttttgcc ctggtcagtg gaggaggcat 34561 atgatattgt attttcatat atgtccatgt gctggaactg gacagctctg ggggctagtg 34621 agaaaagata aagttgggaa gaccttagcg atgtcagtcc tggagctgtt cgaagttagt 34681 ggagctcctc cctctactgc agacagtttc cttctctagg atagggtttc tgatgagaat 34741 atttcattgt tctctaacac aagtagtcaa tgttttgctc agtttctctt gctgaaataa 34801 tggtatagat tcagtaggtt ggggcggtag gaagattcca gtttggtgca gcaaaaggaa 34861 agtcttacca aggagtgagg tgcttgtttg ctggctcctg caaagttcag gtattgctga 34921 gtgggtagat cttttaagag gaggagtctg tttgctccca ggtcagcaga tccgtggtgg 34981 tctgctgggg cttgtgcccc tcagatgcct tgtctcccct gggtggagct tttggggtgg 35041 gtatagtggt tcatgggtac atcgctgaat ttgcataagg ctagcatggc aatggactct 35101 caaaatatct gtggtcataa ataggaactt acttaaaatt ttaggcctgg gcaggaacat 35161 cacttgaggc caggagtttg agaacagtct gggcagcatt gtgagacccc cacctctaca 35221 aaaaaataat tagctgggtt tggtggtgca ttcctgtagt tcagttgatc aggaggttga 35281 ggtgggagga ccactggaac aggatttcga atctacagtg agctaggatc atgccactgt 35341 actccagcct ggatgacaga gtgagaccct gtctcaaaaa taaaagaata aaatattatg 35401 cattactcta aaactagttt tatttacaac gaaaaagtaa ttttaagtac ctccttagaa 35461 gatttagtaa aaatatgacc tatggaataa tttctgtcat gaaagacaaa agttctttac 35521 tatatttcta tcttgaggta actattgctg tatttcagtc ttgaggtaac agctgacatt 35581 ctcaggatgt ggcttccttc cttggctgag atgtcacctg gacagataca gatatcacca 35641 tttgctacta atgatgattg atgttgtaaa aaaaaaaagt tcttctatgt caagagggca 35701 cttggaaagc aggccataaa gagcagttct caccatcgga aagcatttgt tatcactcat 35761 gcttttatgt ggcaaacaca ttgcagttct tggcataatg agaggcaggt ttcaggagtt 35821 actgctgcaa ccaactctta cctgccaaac tataaagaca aactattttg aggattaaag 35881 gatgggatcc ctttgtgctt tgtggaagca gagctgaaag gttcataatg acagacagaa 35941 ccattaggag gcatgaaaga aaaagaagca gcaaggctga aagtgttgtg tgctagaaag 36001 gagttgatgc cctttgaaca ttcaagttca aataatgtgc catactttat ctccaaagac 36061 aacactactg acccactgtc agtagaccat gggctgtggg ttgctttcca gcagagagga 36121 tggattttct tcagttccgt ctactcatgc ttgacagaga tttcctttgg gacaagtacc 36181 acgccccaag gagcccaagc ctggagaaag ggaggctgga tcatcatggc tgacgctcac 36241 tatgcaccag gttcccttct cagcatttta ctcagttgtc cttcacaacc aagcacccta 36301 tgaagtagca atgcttatta tcctcacctt acattgagga aactgaggct gaaggaggca 36361 aaataacttg cccaaggcca cacagctagt aaatggtaca gctgggattt gaactcatca 36421 agtccgatgc tagagctgaa tgaggctctt aagcattgta ccctgctgta gtgtagttag 36481 cagttggaaa catggatgct ggagtccaaa gaacatggat tcaaatcctc attccccagc 36541 tggttgctat gtgaacctgg gcaagtgatt caacactgtt ggggcaagaa aggaagtcag 36601 aagcctggga caaatcacga cacataagaa gcactcaatg catgctggcc actgttagtg 36661 ttttggttat tgttttgttt ttatcctgga acagagcgaa gctaagaggt agtggttctc 36721 tccttgaggg tgaataagaa agaattaact gggcattacc aatgttgaca atgtggattt 36781 ctgggcccca ctcatgagag atgctaattc agtggctctg gggggttgga tggtaaatgg 36841 ttggacccag tcacctgcat tctaaagaag ggctttcagt gatatggagc ggtggacttc 36901 agactggcca gaggcggcat gtgcaaggct ttggaattta ccaaaattca gagcttaatt 36961 ttataattga tttgtttagc tataactgag tgacagagtg gaacactgat attctgtaga 37021 ctatctctta atcattaaaa aacaaaaacc agcacaaaag aaatcaccaa aaggaatcaa 37081 tatgttttac agttccagta ctggccaatt cagaatgtgt gctggagcaa acacgggagt 37141 atctaacaga cctgcagaga tggaagggat atgagagacc tacgagccac atcctggttt 37201 tatagctgct gaaggtgggg gcccccaggg gtagaatgac tctcttagaa accatatgat 37261 tagttaatgg gaaagtctga aacaggactt tgatatttgg ccacctggac atctctggac 37321 aagaggccaa aactggaacc ctttgggaag ttctgattcc ctgaattcag ggtgatagca 37381 tgaagtaagg aggaatctca cagctgcttg ttgctgcatg tgctttttct gtctggatgc 37441 ttttttccac catgcctgcc agtaacttcc agcttttctt tagggtcttt ggtaagctgc 37501 cttttcaagg acagtttccc tgcctgtgtc cttcacttgc tcccaacaga gttcattatg 37561 tcccctttca tcactcctga acctaacggc attttttgca cttagttgat gttttttgtt 37621 actgtcacct ttcctctgat gcacttaggt agtacaagta gagccaccct cctattttgc 37681 atgcttgcag acatcactga ccatttatcc atttatgtat tcatccatat gtataccaca 37741 gagcacccat tacatatgcc aggtacattt agagaatata gagaaaagta aagaataaat 37801 gagaaaatat acaaataact tgataattta agatactgat atggactgtg aagaaaattt 37861 aaaaagggtg atgtgacaga taaatgatat ggtgggagag atccttttta cagggtcttt 37921 agggaagggc cctctcaaag tgtgtctcat ttaagctgag actcatgatg agaagtggga 37981 cccatgtaaa gaactaagaa tattctaggt gggaggaaga atatgtgcaa aagagctaaa 38041 gtgagaatga actaggcacc tttgaagagc agaaggaagc tagtatggat agaatggagt 38101 gaacaagagg aagactggga gacaggcagg gccacatctg ggaggacctc cttgtaggcc 38161 acagagtttg gattacaacc tgggagtaat gggaagctat tagaggtgct tcaacaggga 38221 aatggcatag tggggtttgt attttaacag atgactcact agtttagact tgtcctcagc 38281 ttcttctttt tttttttttg tttttttttt tttttttgag acggagtctc gctctgtcgc 38341 ccaggctgga gtgcagtggc gcgatcttgg ctcactgcaa cctccgcctg ctgggttcaa 38401 gaaattctcc tgcctcagct tcccaagtag ctgggattac aggtgcatgc caccatgccc 38461 ggctaatttt ttgtattttt cgtagagaca gggtttcact gtgctggctg gaatggtctg 38521 aatctcctga cctcgtgatc ctcccgcctc ggcctcccaa agtgctgaga ttacaggcgt 38581 gagccaccat gcctggcctc agattcttaa tacaatgctc caggcaatct ctacttattg 38641 atcaatttgc acttaaggca agaaaatttt taggcattct agctaaactc tttgagggcc 38701 aagttcctgt tttaatcact tttgtttgcc tagcactttg taccttgcaa agagcagatt 38761 cttggtgaat atttattgta tgaatgaatg gtttctgcat tatttgatcc tttggcataa 38821 caaagtggca aagctctctc tgctaggtga ctttttagca agtccactga atagtaggtg 38881 gtagcatatt gtgaaaatgg attgattgac acctcacctc acctctgctc atgttaaaca 38941 ttttttttag ttctagaaaa ttcagttgta attaagctgg gaggtaacag agtaggacct 39001 ctccaaagcc ctgaaatgac tgcctcgcta gcttttaagc atcagataga gaaggcctgt 39061 tgaacttctg cttctagtat tatgaagaat agatatttgg agaaactctc ccaattacag 39121 acactgagaa atgctgaatg taatataaaa aacaaccttt taaatatgta gctaagctca 39181 caaaaattga atgtaagtct ctacagaggc aaaaatggag agagagctga caccagggtg 39241 gtgagtgagg atctaagata tgggctgcct tgggagtgtg tgcagataac agaaactgaa 39301 agcttcagtt ttaattgcca tacaacaggg atgagtaggg aggggtgcat tgtggaatgg 39361 gagatgagga cttgggcgct gttacaaggt caacggaact gattaagagt ctgcaaaagc 39421 caggaatctt gaagatctac accctcagtg aaaagatgga ctagaaaaat ccatttggca 39481 aagaaaaact gctatgagaa attgttctgg ccttggctgg gggtggggaa atgtctctgc 39541 tgagaattct gaactcatga aggtttaggg cttaaactca cattatccac atatgccgtc 39601 caggaaacct caagcaaaag aattgttaag ataattccag tttgagagag actcaggatt 39661 ccagcagaat cggaagagaa tcagctttgt aggaatgttc ccacagctgg ggctctttag 39721 aattctcaca gattaaaatc aaccacatat gacttcacaa tgaaaagttt ccagttacac 39781 agagtaaaca agccatcagt ggaaacaaca aacagtagaa ttagacttcc caagaacttg 39841 aatttaagaa ttatcagtca gaatataaat gaaatgtcct aaaatgtttc tagaaataaa 39901 agatggattt gaaaacatga gcaaaatgct attaaaaatg tctaggcaga atttgaaaag 39961 gaaacaaaat gaaatttttg aaataattat attcaaaact gacaactcaa aaaggaaaaa 40021 aaaggtttta acatctgata tggtttgact ctgtgtccct gcccaaatct catctcgaat 40081 tgtaattccc cacctgtcag gggagggacc tggtaggagg taattggatc atgggggcag 40141 attttcccca tgctgttctg tgatagtgag atctcatgag atctgatgtt taaaagtgcg 40201 gcacttcctt caacacgctc cttctcctgc cgccatgtaa gacgtgactt gcttctcttt 40261 cgccttctgc catgattgta agtttcctca ggcctcccca gccacttgga actgtgagtc 40321 aattaaacct cttttcttta taaattaccc tctctcaggt cgttctttat agcagtgtga 40381 aaacagacta atacaacatc caaatatttt tctgaaaatc actgattaga gtttgagttt 40441 gttatactgt taaattatat acaaagttta aaaatcataa ggatatacca tgatgaattt 40501 ccataaatta atgtactatg taaccagtat tcagatcaag gaacagaaca ttaccagcat 40561 ctcataaacc ccctttctgt tcccttcatg taatctttta ctccaaggtc aactagtatt 40621 actccaaggt caaccagtat ccaaaaatag gggtgtgact attatgcaag tatcctg // LOCUS HSAC000115 95855 bp DNA PRI 31-JAN-1997 DEFINITION Human BAC clone GS188P18, complete sequence. ACCESSION AC000115 NID g1809230 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 95855) AUTHORS Tin-Wollam,A, Graves,T and Ozersky,P. TITLE The sequence of H. sapiens BAC clone GS188P18 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 95855) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (31-JAN-1997) COMMENT Genome Sequencing Center Department of Genetics, Washington University St. Louis, MO 63108, USA http://genome.wustl.edu/gsc e-mail: sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping sections once, or longer because we provide a small overlap between neighboring submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. SOURCE INFORMATION: This clone is from Genome Systems first BAC library. Cell line: lymphoblastoid Haplotypes: two VECTOR: pBELO Selection: chloramphenicol NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of H_GS188P18; actual end is at 95855 of H_GS188P18 The location of this clone is unknown. FEATURES Location/Qualifiers source 1..95855 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GS188P18" /clone_lib="GSBAC1" repeat_region 382..675 /rpt_family="ALU" repeat_region complement(987..1093) /rpt_family="L1" repeat_region 1365..1658 /rpt_family="ALU" repeat_region 2878..3169 /rpt_family="ALU" repeat_region complement(6747..7040) /rpt_family="ALU" repeat_region 7948..8236 /rpt_family="ALU" repeat_region 8254..8423 /rpt_family="L1" repeat_region complement(8723..8842) /rpt_family="L1" repeat_region complement(8911..9189) /rpt_family="ALU" repeat_region complement(9201..9418) /rpt_family="L1" repeat_region complement(10059..10122) /rpt_family="L1" misc_feature 10808..10937 /note="match to Human EST R73306 (NID:g847338)" misc_feature 10912..10934 /note="match to Human EST R65857 (NID:g838495). Defines an intron in the 5' UTR of GS188P18.1a." repeat_region complement(13000..13356) /rpt_family="L1" gene 14642..35159 /gene="WUGSC:H_GS188P18.1a" CDS join(14642..14855,27209..27628,29388..29441,35071..35159) /gene="WUGSC:H_GS188P18.1a" /note="coded for by human cDNAs R76043 (NID:g850725), R65857 (NID:g838495) and H12868 (NID:g877688)" /codon_start=1 /db_xref="PID:g1809231" /translation="MEEMKKTAIRLPKGKQKPIKTEWNSRCVLFTYFQGDISSVVDEH FSRALSNIKSPQELTPSSQSEGVMLKNDDSMSPNQWRYSSPWTKPQPEVPVTNRAANC NLHVPGPMAVNQFSPSLARRASVRPGELWHFSSLAGTSSLEPGYSHPFPARHLVPEPQ PDGKREPLLSLLQQDRCLARPQESAARENGNPGQIAGSTGLLFNLPPGSVHYKKLYVS RGSASTSLPNETLSELETPGKYSLTPPNHWGHPHRYLQHL" exon 14644..14856 /gene="WUGSC:H_GS188P18.1a" /note="GRAIL prediction, score = 82" /evidence=not_experimental repeat_region 15709..16003 /rpt_family="ALU" repeat_region complement(16190..16485) /rpt_family="ALU" repeat_region complement(20761..20785) /rpt_family="L1" repeat_region 22119..22393 /rpt_family="ALU" exon complement(23191..23226) /gene="WUGSC:H_GS188P18.1a" /note="GRAIL prediction, score = 92" /evidence=not_experimental repeat_region complement(23899..23930) /rpt_family="L1" repeat_region complement(26629..26920) /rpt_family="ALU" gene 29366..35159 /gene="WUGSC:H_GS188P18.1b" CDS join(29366..29441,35071..35159) /gene="WUGSC:H_GS188P18.1b" /note="coded for by human cDNA N49626 (NID:g1190792)" /codon_start=1 /db_xref="PID:g1809232" /translation="MFSSSFLDKKLYVSRGSASTSLPNETLSELETPGKYSLTPPNHW GHPHRYLQHL" repeat_region 30420..30550 /rpt_family="ALU" exon complement(31760..31873) /gene="WUGSC:H_GS188P18.1b" /note="GRAIL prediction, score = 92" /evidence=not_experimental repeat_region complement(31938..32231) /rpt_family="ALU" repeat_region 32449..32475 /rpt_family="L1" repeat_region complement(32531..32579) /rpt_family="L1" misc_feature 34470..34807 /gene="WUGSC:H_GS188P18.1b" /note="match to Human EST R28570 (NID:g784705)" misc_feature 34810..35216 /note="match to Human EST H53783 (NID:g993930)" repeat_region complement(37269..39221) /rpt_family="L1" repeat_region 38390..38798 /rpt_family="L1" repeat_region 39227..39509 /rpt_family="ALU" repeat_region 39511..39530 /rpt_family="L1" repeat_region complement(39523..40942) /rpt_family="L1" repeat_region complement(41413..42180) /rpt_family="L1" repeat_region 42559..42811 /rpt_family="MER" repeat_region complement(44448..45071) /rpt_family="L1" repeat_region 47263..47304 /rpt_family="L1" repeat_region complement(47672..47705) /rpt_family="L1" repeat_region complement(48128..48410) /rpt_family="ALU" repeat_region complement(49194..49489) /rpt_family="ALU" repeat_region complement(50054..50327) /rpt_family="ALU" repeat_region 51463..51561 /rpt_family="L1" repeat_region complement(51903..52332) /rpt_family="L1" repeat_region 51969..52344 /rpt_family="L1" repeat_region 52479..52596 /rpt_family="L1" repeat_region complement(56376..56433) /rpt_family="L1" repeat_region complement(57251..57376) /rpt_family="ALU" repeat_region complement(58296..58953) /rpt_family="L1" repeat_region 61435..61862 /rpt_family="MER" repeat_region complement(62197..62488) /rpt_family="ALU" repeat_region complement(62510..62957) /rpt_family="MER" repeat_region complement(67124..67415) /rpt_family="ALU" repeat_region complement(67661..67967) /rpt_family="ALU" exon complement(68835..68951) /note="GRAIL prediction, score = 80" /evidence=not_experimental repeat_region 69097..69116 /rpt_family="L1" repeat_region 69562..69617 /rpt_family="ALU" repeat_region 69628..69920 /rpt_family="ALU" repeat_region 70079..70376 /rpt_family="ALU" repeat_region 70413..70459 /rpt_family="ALU" repeat_region 70475..70621 /rpt_family="ALU" repeat_region complement(70878..70992) /rpt_family="L1" repeat_region complement(71013..71137) /rpt_family="L1" repeat_region complement(71138..71324) /rpt_family="ALU" repeat_region complement(71326..71405) /rpt_family="L1" repeat_region complement(71406..71697) /rpt_family="ALU" repeat_region complement(71713..72717) /rpt_family="L1" repeat_region complement(73186..73479) /rpt_family="L1" repeat_region complement(75741..75855) /rpt_family="ALU" repeat_region complement(75929..76382) /rpt_family="THE" exon 77792..77890 /note="GRAIL prediction, score = 92" /evidence=not_experimental repeat_region complement(78381..78398) /rpt_family="L1" repeat_region 80857..81210 /rpt_family="L1" repeat_region 85077..85117 /rpt_family="L1" repeat_region 85348..85480 /rpt_family="ALU" repeat_region 85986..86115 /rpt_family="MER" repeat_region 86152..86331 /rpt_family="MER" repeat_region complement(86893..87059) /rpt_family="ALU" repeat_region 89195..89351 /rpt_family="L1" repeat_region complement(89331..89378) /rpt_family="L1" repeat_region complement(89385..89706) /rpt_family="ALU" repeat_region 89707..89897 /rpt_family="L1" repeat_region 90427..90717 /rpt_family="ALU" repeat_region complement(91176..91467) /rpt_family="ALU" repeat_region complement(91707..91724) /rpt_family="L1" repeat_region complement(91746..92021) /rpt_family="ALU" repeat_region 93468..93775 /rpt_family="ALU" repeat_region 94838..95150 /rpt_family="ALU" exon 95214..95252 /note="GRAIL prediction, score = 93" /evidence=not_experimental repeat_region complement(95480..95623) /rpt_family="ALU" BASE COUNT 27189 a 19755 c 19527 g 29384 t ORIGIN 1 aagcttcata agaaaggcaa atgagctgca aagatgcaca tatgttggac aagccaagaa 61 caggaggaga ggaggctctt atttgttcac tttggtggca gcttaagcta aattgatcat 121 gtggcgatca acgttgattg ttagcaaaac taatgaacac atctgctgtg ctaccaacca 181 tcacctcccg aaatttcctc ttgtcccttt tattattatt atttgttgta ggaacactta 241 acatctactc tcttagtgga tagagcatga gctaaatttt ctcagcacac actcacaggt 301 aactttgtag atgttaatag ataatgttaa ttagtttgat tgtggtgatc atgtctacct 361 atatcaaaac accaagttgt tggccgggcg cggtggctca cgcctgtaat cccagcactt 421 tgtgaggccg aggcaggcgg atcacctgag gtcaggagtt cgagaccagc ctggccaaca 481 tggtgaaatc ccgtctctac ttaaaataca aaaaattagc tgggtgtggt ggtacgcacc 541 tgtaatccca gctactcggg aggctgaggc aggagaattg cttgaacctg ggaggcggag 601 attgcagtga gccgagatca cgccattgca ctccagactg ggggacaaga gcgagacttc 661 atctcaaaaa aaagcaaaaa aacaaaacaa aacaaaaaac aacaacaaca aaacaccaag 721 ttgtacacct tttaaaaaaa atcttatttt aacagagata gcctattact ttcaacacat 781 cacgaagagc cacaccaatc tctcaatacc aaaaactaaa gggagatgtc ctccctaaac 841 aaggaaataa tagatttgat gttctaaatt agaaaatgaa aatgaaacaa aacttcagag 901 gactccatct tcttctccta gccattacca actcagggtt ataaatatcc tctttaaata 961 aaactcaaaa ctgcaacttt aaaaaaagat ctaccctttt agcagatttt aagtatacaa 1021 tacagtatca ttagctatgg gcactatgct gtataataga tctccagaac ttatttatct 1081 tacataactg aaattgtcta ctctttaatc attaatgctc aggggttctt aaccaaagat 1141 ttgccttaat ctctcttgag tgtgcagctt ggtatttgcc cattattatg aggagtgaag 1201 attatgaaat actgatttta agtaatgcag tcttaatgga gatgttagtt ttaccaatta 1261 actatgacgt aaaaagattt ggatgatgcc cggctatttc tgtactgcaa atttagattc 1321 tgtgaatctc aattgattta taaaggagga aagaaggggg gatagccggg cacggtggct 1381 catgcctgta atcccagcac tttggaaggc cgaggcgggc ggatcacgag gtcaggagat 1441 tgagaccatc ctggctaaca cagtgaaacc ccgtctctac taaaaataca aaaaaattag 1501 ccgggcgtgg tggtgggcgc ctgtagtccc gcggctactc gggaggctga ggcaggagaa 1561 tggcgtgaac ccaggaggct gagcttgcta gtgagccgag atcgcgccac tgcactccag 1621 cctgggcaac agggcgagac tccatctcaa aaaaaaaaaa aaaaaaagaa agaaagaata 1681 aaagaaaggg ggacacatgt gttttatttc ctaccataaa atataataga tgattaataa 1741 acatttcaga aaaataacca ctgttccctt cctcttattc cagaaaacct ttgcttattt 1801 aatagcttaa gggtctaccc atcctatgaa gacacagttt tactgtccct tgctatagac 1861 tgatattctt ttgttcggtt tagttaagtt tagtttttaa attaagcaaa gaccagtagc 1921 attattagtg tctttggagg tttgcacaga tgaacactgt tcatgctgaa acatcctagt 1981 tgacttgcag ttatcttatt acaaattaga atttttttat taaatgcaag tttctcttaa 2041 aatatatttt aatatattta aagaaatggt atttctttaa attagatttc ataaagtggg 2101 tttgccgcat cataactcaa aatttatgta tttatgtaaa atttatgtaa acaaacttta 2161 aattatttta aattttgata gattttgcca aattgccctc cagaaatgtt gaattaattt 2221 acactcacaa gagtatgaga gtgttctatt cccatatttt tgttaatact aggtattact 2281 gaccttctaa gcttttgcca atctaatagt tgaaaatggt atcttatttt aatttgttct 2341 ttcctgataa gaaagttaag tattgtttta aatgtttgtt gactctgtat ttcttccttt 2401 ttgatttggt tgttcatatc ttttgctcag ttttttgatt tggttgttca tatcttttgc 2461 tcagtttcct gttggcttat ttgccttttt taatgattca tagaagtttt ttatattatg 2521 gatagaaacc atatggtact gcaagtagat tctctcagtc tgctctttgt cttgtaactt 2581 tatattggtt ttaggtacac agaagctcac cttgtgtcaa ctggtaaatc tttctcttta 2641 tgactcctga gcttcatgtc agagttggaa aggccttccc cacttcaata ttacatttat 2701 tgtgatatat ttcccataca tacatttcta tatattattt tagcagttta atagtttaat 2761 tttttatgtt tacatcttta accttatagt atttcctttc atgcgaagta tgagaaacaa 2821 atttgggtgt ggagaattaa gagctcagtt ttagacatat taaatttgag atgcctgggc 2881 tgggcgcagt ggctcactcc tgtaatccca gcactttggg aagccaaggc gggcggatcg 2941 cctgaggttg ggagttcgag accagcctga ccaacgtgga gaaaccccgt ctctactaaa 3001 aatacaaaat tagctgggcg tggtggcgca tgcctgtaat cccagctact cgggaggctg 3061 aggcaggaga atcgcttgaa cccgggaggc agaggttgcg gtgagcccag atcgcgccat 3121 tgcactccag cctgggcaaa aagagcgaaa ctccgtctca aaaaaaaaaa aaaaaaaaaa 3181 aaaaaaaaaa aaacagtttg agatgcctat tagaggagac gtctgggctg gagatgtaaa 3241 tttgtgagcc atagttacat agaagatact taaaactttg ggctcggatt gtcaaaagag 3301 agattacaga atcagaggag agatcccaga accaatcaat ccctgaggta atctgagatt 3361 tagggtttga acaaaggaga ctgagaagaa agctagtgag gtcagagaaa aacagtacaa 3421 ggtggtgcca taaagtgaat aagagaatgt ttgaagaaga aagaaaactg atcaactctg 3481 ttgaatgcct ctaagagatt gagtaggata aagacaaaga agtaggaatg gatttagtaa 3541 cagtgagatg attggtgact ttgacaaaag tagatacact ggaaaatgga agtttaatgg 3601 agtgggttta ggagtgaatg gaagtgagaa gtagagatag tgaccaaaca tggacaagtt 3661 caagaagttt tgctacaaag gggatcataa aaatgaaaat agagatgtgg gaggggtgta 3721 agaagcggga gatacatggg catatttgaa tgaagatggg aatgatccaa tagagatgga 3781 aaaattgagg ggggagagaa aggggataat caagttttaa gataggcctt aaacctgatg 3841 cttttcccca atgaatagcc agttgtctcg atatcatcat ttgattaatc catctgttcc 3901 tcggtaattt gaaacctcta aaaatatgaa attcccatat atatttgtac tctatattct 3961 gttccactga tcttccatga cttggtcttc caaactgcta agacagtttt caagttattt 4021 ccattgctat ttatctattt tattgagtta ttttatttca gcaattatgt tttcaatttg 4081 taagaactct gtcttattct ctgattactt ctttttcata gctcaccaat tttaccctat 4141 gaatgcaata ttatctttga catctctcaa gatgctaatt aggatttttc aaacatattt 4201 gcttctgtgt tccacattca cttttccctc agaagtccac tgttttgcct tgtcatctta 4261 gtgtttctgt atcatgctgt cggtttttct taaatgtctg gtgactggcc attgatacag 4321 aggtctagat tgctatattt tggtaactgg tttggggtcc tcagcagtta tggaggttac 4381 cattcacctg acaggcctcc tctctgaatg tgacagagat gtgcaggtga atggttgggg 4441 gtgagccaac aggagggctc ctatttatga aatgtggatt ggataggcac atttgtgttg 4501 cagtttgggt agctagtagc ttagaggttg ggaaagcctc ttattgggtg tatactgctt 4561 tagaggactt aactgctaga cctagatggc ttttgtctcc atatccttat gatggcctct 4621 aaattacctg ggcagtccca cttcactctc caaccacaaa gtccctggta ggctccctct 4681 ctatgcaaaa gaagagcttt atttagaaag ctattctcac tggctatagc caggggtgaa 4741 tactacagca agacactgtg cacagagttg gaacatggcc cagaagcttt ggcagacctt 4801 cgactcactt ctctgctttc tgctccatga ctcactctag cttctagcct ctgcaaactc 4861 tgagcttttt ctggtgcttc atttggaaga tacatgatca tttctccaat atgcttctcc 4921 tatttctatg ttagcagggc acgggttcca cattataatt ttcagacatt tcctactatt 4981 tcagtgaggt cacataacat agcacagacc ctagaggcca gactgcctgg cttggacaag 5041 ctatgtaact tctctgtggc tctgtttact caatttttaa aaatgagata ataatagtac 5101 ctgccttata gaactattta aatgctttaa tacatgtcaa ttgcttagtg tattacctag 5161 cccactgtat ttttctggtc tgctggtgat accttttctg gctttccagc actgctctag 5221 attattttct tcttattttt atatttattt cggcatttta gtgagagttt ggaacgaaca 5281 tgagattaaa tacagtggct tcatttatct ccttaaactg gaagcaagca cttgtatttg 5341 aaaacataat ggaatgaaca ttctaagtta gttttctaac attttggact ttaccctctc 5401 ttgatttact tagagtagct gtaagaaacc ttagagatca tctaacagag caccctcact 5461 taaaaaaatg ggcaaaggaa tgagaaataa cttccccaag gttgaaaatc tatttaggaa 5521 aagatctcag actagagtca agatctcttg cctcctagtg caatttccct accactaaaa 5581 ataagtaaat aaataatggc taacacattt gccaagatta gaaactccat tatatctgcg 5641 tgcgtcctat gtctttcctc ctcttagaac agtgcctcct tctattgaaa gccaatcctt 5701 ccacctgaac tttgatccca tcctctccac ctgctttctc cagaatttta tattattagt 5761 aacactttat ttctgtggta tattcaacct cttgctctaa ctggatcctt tccattggcc 5821 tttaatgatg tcaataccta tccctttaag aaaaagaaac aaaaaactca tttagcccgc 5881 cgaatcccat ccccagattc tcttctgtct ctccactctt ccacctccaa actcctacaa 5941 agtactctcc atattgtcag taccagctct tcctctccac ttctctcatc ccacttcaat 6001 ctgcctttgg cctcaacact ccagcaaaga cttctcacta aggtcactta tgacctccat 6061 gctgctaaat ccaacagaca tttttattag actgttctac ttctcagcag catttggtac 6121 aggtgatcag ttcctccatc ttcaaataca ttcttcttta ggctttagga ttctagcatc 6181 cctctggttt ttttcctacc tccttgccca ttctttataa gtaatgtttg ctggctcatc 6241 acttccatgt ggccagtaaa tgttgcagtt acacaaagct tggtcttatc cctgtttatg 6301 tttcactgtc tgctctctct gggtaagtta tagcccaagg ctttaacagc catctgtata 6361 cggctggccc ctaaatgtat atctctaact cagatccctc tcctgaataa ctgacttctg 6421 ttatcactca aaatctacat gctcaaaatg gaagtcttta ttccctctcc taaacatgac 6481 actcttccag tgtttcatat ctccgtgaaa ggcagactca tcaatcaact tatgtaaccc 6541 ccaaacctag ggattatcct gatacccccc cttccttatg tcaaattcag taccaaatgt 6601 gtccattttc tcttttcttt tattattttc ttttcttttt ccttccttct ttccttcctt 6661 ccttccttcc ttcctttctc cttccctccc tccctctccc tccctctctc tctctctctc 6721 tctctctctc tctctttctt tcttcctttt tttgagatag ggtatgactt tgtcacccag 6781 gctagagtgc agtggcgtga tcatggctca ctacagcctc aaactccagg gctcaagcaa 6841 tcctcccacc tcagtctcct gagtagctgg gattacaggc atgagccacc atactcagct 6901 aattttttaa ttttatattt tgtagagatt gaaatctcac tatgttgccc aggctggtct 6961 caaactcctt atctgaagcc attctcctgc cttggcctcc caaagtgctg ggattacagg 7021 tgtgagccac tgcatccagc taattttatt ttctaaatgt ctcttgaatc catccacttc 7081 tctctacttc caactccttt tccccaacct ccagatgcag ctactgtcgt ctcttcccta 7141 gacaatgaga acagttctct aactggccta ctcacattca ctgactcctt caatcttttc 7201 tgcctactgc agccagactg gtgtttttca aaatgcacat ctgattattt cattcccact 7261 cccaccgctt ccccacactc ctcagcacat atatctttaa acccctttaa aggcttcata 7321 ttattattag gaataagaca aaactcttca acatggcctg caaagccctg cttgctttgg 7381 ctcctgcctg ccttcccagc tcttctccca ctggttccct ctcattgcct ccactccaat 7441 cacatagctt cctctgctcc tgtcactcgg gtttgccacg cccctcccat tacactcttc 7501 cctcccttct ttgcttggct aactccttgt cttatcagat ctgagctcaa gcattacttc 7561 cacagagagg gtaccctgtc ttccctgatg aagtcaggga tccttatcac aggctcttct 7621 tttaccaggc attgagtcct caccgttgca gttgtacatt tacttgtgtg aatactggga 7681 taatttttct cccactgcta cttctcatgt gtccattcaa caaatattta ctgagtcctc 7741 actacttacc aggcattatt ttggaaaggg tgacaaaccg taaacaaata aatgtgtaat 7801 atgataccaa gaggtgaaaa gtgctatgaa gaaacacgaa gtggagcaac aggccagaga 7861 ctttgaaagg ctgctacttt ggataggata gttagagaag gggagctaaa cactgagtac 7921 acatggacac aaagaaggcg acaacacgcc gggcatggtg gctcacgcct gtaatcccag 7981 cactttggga ggctgaggca ggccgatcac gaggtcagga gttcaagacc agcctagcca 8041 atatggtgga atcttgtctc tactaaaaat acaaaaatta gctgggcgtg gtggcacaca 8101 cctgtagtcc cagctactca ggaggctgag gcagaggaat cgcttgaacc caggaggcgg 8161 aggttgcagt gagccgagat catgccactg cactccagcc tgggtgacag agcaagactc 8221 tgtctcaaaa aaaaataaat aaataaaaaa taaaaaaaaa gaaggggaca acagacaccg 8281 ggacctactt gaaggtggaa ggagggagag gatagaaaaa ctacctatca ggtactctgc 8341 tgattacctg ggtggtgaaa taatctgtac accaaacccc catgacacac agtttaccta 8401 tataacaaac ctgcacatgt acctaaagta aaagttttaa aagaaaggat aattaaagaa 8461 ggaagacatg actgagcaca gatttgaatg gagtgagaga actcatcaca gggatgcctg 8521 gggggtgggc attctgggaa gagcaaatgg tgagggtaaa gactagaggg tgtttagcct 8581 gttcaaggag gagccaggaa gtcagcgaag gggagtggag tgaaggggag agtggtaaaa 8641 gataagagca gagatggagg cagggaccgg gtcagggagg gccttgtagg ctatgctgtg 8701 ggaagagaag caactagaag cttttttaat tgatacataa tatttgtatg tatttatggt 8761 gtatatgtga tattttgcta catgtataga ctgtgtaatg atcaagtcag gatatttggg 8821 atgtcaatca ctttgagcat ttttgtttgc tttgttttgt tttgtgtgtg tgtgtgtgtg 8881 tgtgtatgtg tgtgtgtgtg tgtgtgtgtg tttgagacag gatcttgctc tgtcacccag 8941 gctggagtgc agtggcatga tctaggctca ctgcagcctt gatcacctga actcaagcga 9001 tcctcaccct ccccaatagc tgggacaaca ggcgtgtgcc accacaccca gctaattttt 9061 tttattattt gtagaaacga ggttttgcca tgttgcccag gcttgtctca aactcctggg 9121 ctcatgcgat cctcccacct cagcctccca aagtgctgga attataggcg ggagccactg 9181 cacccagcct ccaccttgag tatttaggtg ttgggaacat ttcaagtctt ctcttctagc 9241 tattttgaaa tacataatac attgttgcta actatagaca ctctattctg ctaccaaaca 9301 gtagaaggtt tgtttgaata tagaagaata ccttctattt aactgtagtt tagagcctct 9361 gttcttcttc ctgctcccac ccacacaccc ttcccagtgt ttggtatcta ttctctaccc 9421 ccatgagaca aacttttttc attataatat tttaaacaag gagtgactgc attccatgtt 9481 ctaaaatggt cactatggct gtcttgggat gaatggttta gaaaggggta aaggtggagg 9541 tcaggagtcc agttaggagg ctgtggtggt ggtggtcctg gggaaaaatg atgagggctt 9601 ggactacagt gaatgttgga gaggtagtga gaaagcacag attggaaagc atctggaagg 9661 cagaactttc aggatatgtg gatgtgttcc tccaacccct ctggtgtgtt tccttctcac 9721 ttacaggaaa agccaaaagc tgtagcgtgg ccaatgaggt cctatgttac ctggtcctca 9781 tcttcttttc ccctcactcc ctctcttcta accacactgg cctgcctttc tgtgaacaga 9841 ttaagcacta tctcatcttt tgaatttgat gttccctcag gtatcaggaa agcaggtgtc 9901 catgtggttt ttccccctca gggttctgct gaagtgtcac cttatcagaa aatccttttt 9961 tggccaccct ctataaatta gcacctagcc taccactccc tgttcttcta ccctgcttaa 10021 tatctcttta tagtacttat caccatctgg cattgtagat ataatgtaca tatgtgtata 10081 tatacacaaa aacatataac atttgtatgt atttattcca tgaggacagg gatttggtat 10141 gttttgttca ctatagaaca caacctccag aacagtgcct ggcacacagt tggcattcaa 10201 taaatatttg ttgaattaat attgatgaag tagacaaaga gaggagcaaa caatggttcc 10261 aggatttggg gcctgaataa tggtgccact tactgaggta atattgaggg gtttttaaaa 10321 tgaggattag gatcccctcc tccaaacctc ctgccctgaa ttgctttgcg tattgcagat 10381 tgagagaagt cctttctccc tctttggcca atggaagcct gtgacaagtc cagactccta 10441 ggggtctgag ctctcttttt cttcctatgg agagagaggg aaaaactaga gataaaccaa 10501 actgactctc agttcctgag actgatgtga catcggcttg atctaaaccc caaagctgat 10561 gttttatttg ttgttgactc tgtgtatggt tttggacaaa caattggcct cccagtttaa 10621 gagacaatct cacaaaatgc tttttttaaa ccaaaaagaa agaaatgaaa aaaaaaaaaa 10681 aagagctacc atgttcctca ttcctgcagg tgcccaacca ggttgacttg ttggagtaca 10741 ttttctgcat tcttaataga tagctagcag gaaacgcctt gtacagtccc agctagaggg 10801 gcggaaagta acaaggaggt gggggtacaa atcctcagct cctgcttccg caagcactaa 10861 cctgctctga agtgagccag gcagctctgg ccatcttttc ccagccacag aatcaggtga 10921 tggtccagaa ttaagaggta agagaatctg gggatttttt tggggggaga agcaaagaaa 10981 cttaactggg aagaaatggg ggtagtcatc aaatactttg ctcaatgaga tacaatctgg 11041 tccccttggg gatttcgttc caagtgattg gtttgacaaa ctaacccctc tgacttccca 11101 ccctacccca gactgctagg taccattttg gagaagtgcg gtaagtgcgt ccatgcaggc 11161 agattgtggt agagttctgg ccaaaatgct ggtttgggat tctgtgctca aatatttctt 11221 tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctt 11281 tctttttctt tctttcttct ttctttcttt cattttttgt tctttctttt ttttttttca 11341 gaagtgtggg caatatttga gggagcactt tctgtctttg tatcttgtga gatggggaag 11401 ttgcaaactg cctgttgttt agtaattatg ccgttctggc ggtaccactg gtaaaacttg 11461 gccttggggt tttgtctact acgagaggct tttgctataa gccaggcact gatatggaaa 11521 catcttctct gggagacagc caccctttct ttagcctcac tgacatcccc ttctatccag 11581 ctttgcagca tctctttctc ctaggcgaga cacaccccag caattaattc atgaatgaag 11641 agacctctta tggagggctc atttgaggat gaagctcctc atgtttctcc agaggtagag 11701 tgagatgcag ttcttcatag gactaagcca atgacagttc tcagaatttt aatattcctc 11761 aaatacagtt tctctccact ttggtccaag tgcagcccag ggagttgctt gtgagccgat 11821 ccagagaagt ggcctatggc tttccttttg attgtgtgcg gtgtggttgg ggaggaaaag 11881 aagggcatag gtggtactta ttcagtgaat actgaattta ttattcagga tgcccaattg 11941 cccacggact gagctctcca aggtgaagag gctgaaacac cacctctgac tttctccatc 12001 tctaatattt cctgactgta gccagttggc ctggggcatc tgtggattct acccaaggac 12061 actaggcaca tgcttcaggg ccctaatgtc ctcagggtcc taccagctcc ttcccttgtg 12121 gcttcaaggg gagttagcaa agagagattg caagctgtgt caagcctgaa ccttgagcag 12181 tggccactgg aaattctgag aatgagaggg tgagtcctcc aaagttgttc actggccctt 12241 ccagcagcag acagaggctt caaggagaag ccctgaggga gcagacagag ttgggagttg 12301 ctgcctgcgt tatctgaagg aaaccagatg gctggtttag ggttctggct tgaggccccc 12361 tagggattag gagatagatg gagagactga ggcccataga ggagaaggga ctattagacc 12421 caatgggtag tcaataaggg atggtaaagg taggacagaa gttaacgcca gaaagaccta 12481 ggttgaaatc ccagctctgc cactccccag ttgtgtagcc taggcagtct atttcacctt 12541 ctttaagacc caaattgctc atctgtaaaa tggggataac aatacctaat ttgaggatta 12601 tgaagattgg agaagatagc tgtgtagttt ctggcacaca gtaggtgttc tgtaaactgc 12661 agctttaaaa aagtactgcc tccatgcttc tcattgcttt ataattcata acaccttggt 12721 ttttatttct gtcccttcaa gtatcagaat agatcatgag ctttttctct aaataaagga 12781 ataatttacc tacagtgaaa tgcataggtc ttacgtgtaa aatttgatga gttttaacaa 12841 atttatccat tcatgtagta accatctccc tcatcaagat gtagaacctt tccaacaccc 12901 ctgagggctc cctcatgccc tttcctagtc aatctctcaa tctctaccct acaggcaacc 12961 acagttctga atttcatcac cgcagattat ttgtgcttgt ttcatgtatg tgaaattgta 13021 caatatgtgg cctttagtgt ctggattctt tcactcaaca tgtttttgag attcatacat 13081 gctgttgtgt agattggtag tttgtttgtt tgttcttttg ctgagtaaca ttccattgta 13141 taaatatgtc acagctaatt tatttattct cctggtgata aacatgtggg tactttccat 13201 ttttggttac tatgaagaaa gctgctgtga acattcaagt acatgtcttt gtgtggacat 13261 agtttttatt tctcttggct atgtgtacct agaagtgaga ttgctgggtc acaagataag 13321 aatatgttta actttataag aaattaccag tttttcaaag catttgttca catgtttttg 13381 gaaagtaagg gtgcaatgga ttcgtttttg agctagtacc tggttacgct tagcatgact 13441 gtcaaccaga ggaaacattg gttacttaac acctacatta tgctaggtgc taggaataaa 13501 gaaatgaata tgaaatgctg ttctcaagga tctcataaac tagagagaaa gatagtctat 13561 taaactctag tcattttcat taaattgaca agaatgaata ggtcaaaccc atctttagcc 13621 agagcacatg gcaagtcaga ttcggtgatc aaaggtaaag aagcatcctg acaggctact 13681 tccctagtaa aggaaggcca cagcatgtct tgagggccca cggtgtgcca ggacctggtc 13741 tgaatgcaaa tagccagcct ctgcccacaa ggaactctca aggtgggata tgatgatacc 13801 aatgatggaa cccaagttta gggcagttca atacacagct attgagcatc tactctattc 13861 caggcactgt tctagataca aaagatgtgc tggggaacag agctgacaat cttacaatct 13921 ttcaggaggt ttccaccaat ggtgacaaca ttggtataaa aatgtttttt tgtacatctg 13981 cattttgttc agaatatatc tgttggagtc cttagagatg gtgttgtttg taattgctag 14041 tttctaagat gggatgcggg taggtgtagg acacagtctt acatagatta aggcgtcagg 14101 ggagtaggag aatttgatgt gtttactcct ctgcgtgcga gtggtcaaga tggaaagcaa 14161 tgcctactat tgaaattgtc aagcaccgac tggattgcct ggtcaagctc tccatgggag 14221 cccatggtag ttctgggcca accttggctc attcgtttag gagaaaggca cttccccagg 14281 aggcaagtta ctctgctatt tggcctcaat gtcctcccac ctccccattt cctgatgctc 14341 tctctagtca ggttgcctga tcattctcat ttgtaagaga aggccccttc tcagcacttc 14401 atggagattt gtcatgttgt cccttgaaga ctgattagac aaggcttctc aacagcatcg 14461 tcattgacct ttaggagcta tgtggctccc tggggcccca gtgagcacgt gtacctggtg 14521 ggagcaaatt gcttgccccc aaggacactg ctcacccaaa ccacagccaa gccacttggt 14581 aaatagtgcc agtgacattt tgcctttggt ccacagctgt cacctgtgtc attcactcac 14641 aatggaagaa atgaagaaga ctgccatccg gctgcccaaa ggcaaacaga agcctataaa 14701 gacggaatgg aattcccggt gtgtcctttt cacctacttc caaggggaca tcagcagcgt 14761 agtggatgaa cacttctcca gagctctgag caatatcaag agcccccagg aattgacccc 14821 ctcgagtcag agtgaaggtg tgatgctgaa aaacggtgag catgtgggga gggagggagc 14881 tgggattccc tatcaaggct gccacggata caccagttct ttgatgtact tgacacactt 14941 ggacacttat atgtttgcta aaggcaagtt gctctgacta gtgagaagcc acaccccttc 15001 ttagaagttt aatgtcaaaa tggggctgcc atctactgct ccaaccttac ctccccagca 15061 ctgtcctcca gaaactggct tccttctgtc cgtcatgttc gctttgattt cttcctggcc 15121 tttgcacaaa ctgttccttg tctgcaaatc tcttccctcc cttctcctaa ttaacccaca 15181 ttcatcccaa gatcacttct ttagggcagc tttccctgac cacccacacc cccaagcatt 15241 caaatatcct attacagatt ctcctaccgt tctttaactc tccttcacag cccttgtcac 15301 agttgcagtt tttacacttg tataattatt tgactattgc atactcctcc attaaatgaa 15361 agttctgtga gggcaaggac cattgctgat tttgcttaat attgtatttc tagcatctag 15421 aaacagtgac tgtgacatag gggacaacca acaaggattt gttgaatgag taaatgaaca 15481 tacaaatatg caaatttgtg agcactgtat ctaggtacaa agaaccttga aatttccata 15541 aaagactaaa acataaatta gaatcttgct gaataagcca ttagagcaga agtatgttat 15601 gatagttttt aaatgttgat aatatttaca ttttatataa tttatttcca taaatggaaa 15661 gttcccagtt gtagagaggt ggttttgaaa agttttaaaa tcaaagctgg ctgggtgtga 15721 tggttcacac ctgtaattcc agcactttga gaggccaagg caagaggatc acttgaggtc 15781 aggagtttga gaccaacctg gccaacatag tgagaccctg tgtctacaaa aaaatttaaa 15841 gaaattattc aggtgtggtg gctcatacct atatctctgg ctactaagga gggtgatgca 15901 gaaggatcac ttgagcctgg gaggttgagg ctgcagtggg ccatgattgc accactgtac 15961 tccagcctga gtggcaaagc aagaccctgt ctcaaaagaa aaaaaaaaag taaagctact 16021 ggaaaaagaa aaaaagaagt ttaatgtaaa aatgggtgct gtttaatcaa atcagttcac 16081 ctattttatt atttcctcaa tcatgttaaa tgctgggttt actgctcagc acacaatgaa 16141 actccacaac tatttactgt aatcattatc agtatttttc ttcatcttct gttttttaga 16201 gacagggtct cactctgttg ctcaggctgg aatgtggtgg cgcgatcata actcactgca 16261 gccacgaatt cctgggctca tgaaattttc ccgccttagc ctcgcaagta gctagaatta 16321 caggtatgcg ccaccacacc cagctaattt aatttttttt tttttttgta gagacaggat 16381 ctcgctatgt tgcccaggca ggtcttagat tcctgacctc gagcgatcct cccacttcag 16441 ccttccaaag tgctggtatt acaggtgtga accaccacac ccagctagta ttattatcac 16501 tattattatt actataggcc tgatggggca ggaacaccaa agcatctatg aaaagaaaac 16561 gcatgtccca tgtaggtgct cccagcctct tctcaactaa tagtgcatgg tttggtgagc 16621 aagggtaggg gccagaggga agtagatgaa tagtcttgcc acatcccttt cccacccttc 16681 acagggccag gtgtgtctat ttgctaattg ttaagaaata aacagtcgta catctcaaaa 16741 cagttacttc aagataacta agtcaaaatg tgcacatcaa aatatccagg tttgaaggaa 16801 gccttaaaca tcacactagc aggggctgta gtatgacttt cacggaccct ggacactttt 16861 gtcttcatgg acctccttcc tccacaaaaa cattaaaagt tacattttat gactgcattg 16921 gtataaagat gaatataatc caagctgagt tcatgtttta aatatattca gtatcactgt 16981 catattcatt ttttcttctg attttaaaac atttttgtgg gcccctaaaa ctattaatgg 17041 gtcctaggca ctgtgcctcc tgtgcctaat ggagaagtca ccctcgccag ctggatgtca 17101 ctcaatatgg ctccctttac ctttttggaa agtgagcaaa tatatcaggg tctggtggat 17161 ttcaagagct ctttattggt gttggtgagt gcacatgttg aacgacttgc tcccactgga 17221 agacgtacgt acctccaagt aggggctttt gtgtaaagtg gacaagtggg gcctctgcct 17281 ccaggcaaag agctcccctg tttccagatg ccatcattct ggctttgctt tttgggactg 17341 gagagggagt tgatgtgtat ctgatgtatt cacctttgca gaatttactt ctctcttaga 17401 atttactttt gcagtcaatc aatgtggttt tctcagagaa tcttgtgggt taaataaata 17461 tgtggaaggg catggggtag cagcagatac taaaggaaat gatgtagagc ttgctgttaa 17521 aatgggcagg agggggtggg tggggagtgg ggaggtgatg tctaagagtg tgctgaataa 17581 tcacaacaga agcaggaatg accatggtct tgaatcattt gggggattga catgtggtaa 17641 gagctgagaa atgtggtgtg tgggattcag aagatctaac tgactccttg gaagtgtcaa 17701 aatctcgggc agttaataca cagacaagta tactgacttc ggctatttga ctctgattta 17761 tttctgaaca cattggcttg gattcattat aatcattccc ttgtaagcat catcaagcca 17821 tggtctcact ttttctattc tacttgcctg gtaaaaccca agctgggata aaaccaactc 17881 ttggcctatt gccccatttc tctgcttcat tttgcagcaa aactcaaaat tggtatttat 17941 gtgtactctg tcctttcctc tcatgctctc taaactccat tccactcatg ctttcaacct 18001 catcactcct tgaaatcgct cttatcaagg ttacccagtg acctccactt tccaaatcca 18061 atggtcccta tctgtcctca tctgatttga cctatcataa catttgctgc agaagattgc 18121 tctctcctcg tgaaatacct acttcgcttg cagtcagccc ataacattat gctttcttgg 18181 ttttcctcct gcctccctgg ttactatttc ttctcagtct cctttgctat atgtccttct 18241 ctctccaact tctaaacatt tcagggcccc agtactcagg tcccagacaa ctcctttctc 18301 tgtctacact tactgtcttg gtcttaaatg ccatctatat gttgagcaat tccaaatgca 18361 tatctgcaaa ccagaatgga tttcagacgt gcctgtctaa ctgcctactt gatcttcact 18421 taacatgcct aataggaaac tcaaacaaaa tagctcccaa actgaataaa ctcttaattt 18481 ccccctcaac ctgctctgcc cacagtcttt cccatctcaa taaatgggag ctgcaccctt 18541 ccaattcctt gaagtcatgc ttaacacctc tctgtctcac accccacatc caatcctttt 18601 ggaaattgtg taggttctat cttcaaaata tccaagatct gaccgcatct tctcacctcc 18661 gttgtcacct gggtcctaga agctatcatc accccccgct gaattgctgc agtcacctcc 18721 taattggtct cattgcttct tcccttcctc tccttcagtc tattcttagg agggcagtca 18781 gaatggtagc attaaaatga aagtcacaca ctagcactct tttgtttgaa accctctgac 18841 agcttcccat gtcacttgga atcatatcca aagtctataa tggggcccta cctcatctgg 18901 gcccccgttg tcttactgat ctcatatctt accactttct gcctcacccg ctccagtcta 18961 ggctccttgg tattcataaa ctatgccaag gacaatcctg cctcaggcct gccctttgca 19021 catggtctct cctatccctg gaatgctctt tccccagata cctgcatggc gtcacctttc 19081 tccattcaag tctccacctg gatgtcacca cttcaaacag tccttgtctg agcaacctat 19141 ttaaaaaatg aaagccccca ccctctcacg tctgtcttct taccctgctt catttttctt 19201 cttagcattt gtcaccaccg ggcacattat atgttgtatt tctttctaga ctcctctctc 19261 cccttaccct ctgagctccg tggcagtagg gactcaatct gttaactgct gctttctaag 19321 tacctaccac aatgcctgac acatggaggc actcaataca tacggcttga acgaatgaaa 19381 ttatgatttt gatcacaagg tgttgaccct acagatgggg gtcaccgtag agagggagta 19441 cggtgtggtg tacagcttta gaaccactca gcatggggcc aagccagctc catcacttac 19501 taacagtgtg atctccagac atatcacttc tcctctctgt gcctcagttt cctcatctct 19561 aaaatggaat aatattgaat tataggactg tgcttgtgag gagtcaatga gataatgcat 19621 gtcaagtggc acatagtaag tgcaatgtag tccaagcttg agttgagttt ttgaacagct 19681 aagatagatg tgaggacctg tgttcacaac accaggttca cagagcagca ggcatctccc 19741 ataagcctcc aggttttcag ggggttctgt ttcatggctg ccatagagtt tcctgtgcct 19801 tctgatttca cctcgacctt ctgtgaaacc cctgcctgct gagggaagtt gggcatggcc 19861 ctgttccttg cccaagccag ccttcacagg ccagatagag aggtggcttg gttctccatc 19921 ttccttcttg gttgttgcat tttccctata agaaccccac aaatcctttg actgtgaaga 19981 tgttgtctcc ctcacagagt gtggtgcggg ctaggcaaat atttgtccgc tgattgagca 20041 aatgaaggaa aaactgcctc cctcaaagcc tctcttatca ccactaggag tcaagtttct 20101 cataagctct gctgcctcag gggagggccc catgaaaggc ttccagtccc acagaggttc 20161 tcactggagg tgttatatga catacatatg gctttgaaaa tggtgggaaa tctttcaagt 20221 ggcctttcat tactgactgt tagtatcctc tcaaacccta cagaccagca ccaggccgtc 20281 cagcccagct aggccattcc aaggagaaca ggctcccatt catgggctcc catggccgac 20341 tcgcccacat tctaagggct ctctgttgag ccaaatatgc aagtcccaaa cacagagacc 20401 cgtgtggcta ggaaatgccg tcaaatttcc tggcttctct tccccaatca gtgttgtgtt 20461 tttatttact tatgtataat ttatactggc tgtacttcag aaaagaaaat acaagttgaa 20521 ggtatctttg aattatctcc cttttaaaag gacattggga attttagaat cagtaactag 20581 agaagcttca gtaactatag tttgtagggg gcatggtcct ggtacgtatg ccaccctgtg 20641 ttcttcttgt cctgtcccca gccccagtcc tgaggccggc ctcaggcctg gcaacattgc 20701 tctagggctg ctgagagaag gtgagagggt atatacgtgg gcctttagag gtctcaaagg 20761 atccagcctc cttctctctc ccctcccatc tgccataatc agtaagggta gggaagaaat 20821 gctggctttt tttaaaaaaa aaaagtgtaa atatctatgt gagatatagt catgctagtt 20881 gtttatggcc cttgtaaata ctgacagttt cctgtcttct tttcccgtaa ggaagaggaa 20941 tctatctgca caaacatata tggagcttcc agcgaatacc agacaccaca ctaaaattag 21001 tgcattttac ttcagtccac gaaatccact gtgatttata aagttcaaag acaggcaaat 21061 ctaacctatg gtgtcagaag acaggatatg gttacgtttt gggggtgttg tgactgagag 21121 gggtacggtg tgcttctggg gtggtgatca ggttctgtta ctcgagctgg gtgctcatta 21181 cccagtttat gaaaattccc ccagctgcac acttccattt gtgcgttttt ctgcatgttg 21241 gttgtgcttc acaaaactat tttgcaatgg atattattat ccttatttta cagagcagga 21301 aactgcagcc cagagaaggt aagtgcctag agatgggtca cacagccagt gagtgataaa 21361 gattgaaacc caagtccatc gatctccaga gccagtctct tgaatacccc attgtcttgc 21421 ctttgtacac tctgacaatg ccccagcaag aggatttaat acctataagg actatcgcta 21481 ataatgtgta taatggctaa cactttttga acactttctg tgtgccaggc cctttgctag 21541 ggtctgtcac gtggattatc tcactgaatc ttcacaccaa ctctactggt taactctatg 21601 tccattttcc tgatgaggaa actgaggctc agacagtcaa gtgacttgcc caagatctct 21661 cagctaatat ggggagtcta tagtatttgc cggcagatgg aattgatttc ggatcttagc 21721 cactcagcat ttcgaatact agctcatatt ctgtgtttga tctctgtttt cagttgggaa 21781 aaggaatctg gatattcatc caaagactag gtaccccagg gaggcagctg tgaaagaggg 21841 gaaggaactg agaggccccc ctggtgccct ggaacagcac aatctcctct aagctctccc 21901 tccatcccca gtgatatttt gacctcagct aatgcaaaca atatgctttt tgtacttggg 21961 ggtgaatgga tgaagtgcaa gtgccaccag caaagagggc agagagggaa agtcaacgcg 22021 tgcagatcat gcaggccaag gttggacttt cacagtgcaa agtgtgcatg tgttcagagg 22081 gttcttgatt tcaacttcat cataaaatga gatttgtagg ccaggcatgg tggctcattc 22141 ctgtaatcgc agcactttgg gagggccagg cattcaaaac cagactgggc aacatagaga 22201 gaccctgtct ctacaaaaat atttaaaaat tgtccaggtg tggtggctca cacctgtatt 22261 tccagctact caggaggctg aggctgaggc gggaggattg cttgagcctg gggaaactga 22321 ggctacagtg agctatgatc ttgtcagtac actccagcct gggagacagt gagaccctgt 22381 caaaaaaaaa aaatcccaca gaatttgcat taaaagatgc agaaagaggg agaagagaag 22441 gaaggaaaga agaaaggata gatggaagga aatcaggcaa tcatggatta gtcgttcaca 22501 gctgtgactc gttttttggg agcttcacgt gtcaagtggt ttcttacgtt tttgcttaaa 22561 acataatttt tactatttac aaacataaaa tatactcctt tttggaaatt tagaaaacac 22621 agataagcat tcaggaaaca aaaaccacat gtacatccat ggacatgtac taatttttac 22681 aacgtgacat cattttgcac agaatgggtt gtaacctgct tttttaaaaa attcatcaat 22741 gtcatatcca cttctccatg ccattaaatg ttctactaca acatcatttt aatggcttaa 22801 gagtatcatt ttatagatgt accaaattta ttaaaatttt tggatattta ggttgttgct 22861 aatctttcgc tatattatac aaatcacttt gaacatcctt gtgggtacat ttttacatgc 22921 atcattgact atatccttaa aataaattgc tacaagtgga atgcttgggt caaagagtac 22981 acacaaattt tagggttttg atatttaccg acaaatttct tcctagaaat gttatgccaa 23041 tctgccagcc ctccaaccca aggagcatgt aaaagtgccc acctcactgc attctttgca 23101 atattgggga ggcatagttg gtaagagcat aaggtttgga atcagacacc tctaccactt 23161 gctagccgtg tgtccttaga caagtcactt taaccattct gggtctcgat tttctcatct 23221 gggttgctgt gaggatcaca tgagatcatg tatgcagagt gctaagtaaa gtgcctggcc 23281 catagtgagt actcaataaa caaaaactac tatatattta caaattgttc ctgcccacct 23341 ctgatgtgtt attccctggg aatggccaca gactgagcaa agacttctga aatccctctt 23401 ttataaccag tatctactcc agcctccaaa ccagccctga agtgctgctg ctgcctctgc 23461 tgctgttatg ggccccttgg ctcagtcctt tctgcccctc ccttctccag cactaggtga 23521 ctgttggcag ggagggttct taggcctcat gcattaatga ccatcttgct ttttctcacg 23581 ttgattgtgc tgagtgttgc aaaccagtaa agaaacccag agtggctttt taaactgaaa 23641 tgatgaataa taacaacagc aacagctacc acttattgag tgccaattgt gtaccaggca 23701 ctgtgctaaa ccttttacat ggatggtctc attcaatctc ataacagtct cagaggtggg 23761 aggggacttg ctcagtgtca catagctagg aagtggcaga tctgggattg gattccaagc 23821 ctctctgact tcaaagtctg tgcttttaac cactttcatc ctgctctctt cgatttagaa 23881 aggttcagtg gggaagggtg tgtatgtgtg tgtgtgtgca tgtttgtgtt agtgagtgtg 23941 tacatgtttg caacggtgca tatgtatgtg tgtttgtgtg taaataagtg tgtgcactca 24001 ttagaataat actttttcag ggccactatg tgctaggcac tgtgcaagtt cttggggata 24061 taatggtgaa caaaagagaa atgatctcag ctctcgccaa gcttactgaa cgataaagaa 24121 gaccaatagt gaatgaatga atgtaagaat taagaaataa ataaattaca gtattactgc 24181 catgaaggaa ataaacaggg tgcagtcaca gaaagtaaca gggtcacaaa ctttagattg 24241 gaagttgagg gaaggccttt ctgaaagggt gacatttaag tttgtatgtt gaaaaactaa 24301 tctggaatag gaagtcaggg cctccatgca aggataaagt ggagggaagg tactctggaa 24361 tggcggcaaa ctcataagtg cctgaagatg ttttctagag tgttctgggg ctcccaaaag 24421 tgtatatggg tggaaatgtg tggcgcctct gggtatgcat ggaaagcagg gggaatggcc 24481 tgccagtttc accgaaaaag ggcagtgcgg gaaatggaga ccagctctct ctggacaggt 24541 tttgtctcag gaccaacagc atcaggaatt ctagagggat gaatgggact aagaggtagg 24601 gtctagatgc ccagggttga agttagtgga gtggaagcca tttgtactcc tgctgaaaga 24661 aaaatcctag cctgccccat gtatgcctgc atcagggtgt ttcttacctg ctctccactc 24721 attcatgtgg tcaggggctt gagaaggcag caaattctgc cctggggctc cctctctcac 24781 ttctagaggt aactaatagg gggcacttgg atgaaaaaaa tgtcaccagg atcactttta 24841 gtagcagcag cagccttgag ccagtgcagg tttgtagttt aagaggtctt tacccatcca 24901 cagggtaggt gggagtggct gctggagcag ttggcccaag aggcctgaaa aagagaggcc 24961 cactcaaatc cagggtctcc actccatggg ccactgactt ccaagcctaa tctggaatga 25021 tgtgtttccc caggccaccc aaggcccgct gttggatcac tgtgaatttg gggttaggac 25081 accctgtgcg tgcccataag ctcccatcca aactgtgctc tggggcacct ggtgaccata 25141 taggactttg ggctgtggca caccaaacca cttcccttaa catttcagag aattctagaa 25201 aggaagcagt taagagaaat attcaaaaca ttctggggct ctgctttcca catttcaagc 25261 attcagcccc taagtcatat gtatgagatc tgtaaggcct gttcacatgg aaactgccta 25321 gttgtcaggg gctacgcttt gtttctaatt tgtgcttagg acatgcagta ccctcattca 25381 atatgaaaca cagactgtga ttacaccaaa ttagcttaat cactctcttt gtgcactgaa 25441 ttgcttctgg cctggagata tcaattttca gaagtagcat tcaccctcac caaaacaaat 25501 cttaatatat tgtcttgcct ttggtttcat aagaagggcc aaccatcccc catcactcac 25561 tgtgaatggt gtgagatgtt ggggagaatg tctttgcctt gagggcttgg acttctcttc 25621 ctatcctttg tgtcacactg agtgacggca ctccagtaac actgggcagg cctttcagaa 25681 ttttttatct tgtacacaat ccagtcaaac caaacattca agagcgctag gagaggtttg 25741 tatgggaaag aggaccagtg tcccagacat cctctccagc cggctaatct tcctttgtct 25801 ccacacccac cctctctcca ttatcttgga tagatgatag gctccaggct ttaagaactg 25861 tgtggaagaa ctctttccct cctcccttct tgcctgagtg ggtgttgaga atggggaaaa 25921 cattcatcct tgtcctaagc tatcctggac atttgagatc tcagattcag cggctatcat 25981 aggcacaata agaatagtgt ctatgggacc aaaggacact ccatccctct taaccaatcc 26041 acatgaactt cacaagaaga aactaagtca cctcttgacc ctcagaattc tgcttttgca 26101 aaatggggtc caagcaacta cttccctccc cattacttat cgttccagct cacttccctt 26161 cttggctaat ctgtttttgc taccttggtt cagtctctta acatccctgt gcctcatatt 26221 actctggaaa atgtggtttt agctcctaga gttgtgtgag tgaccagcac aggtaagtgt 26281 taagcccttt ttaaagttct gtcgagtccc cctatttggt gtatttaaaa caccagctgt 26341 aaaataaagc tagatgatag cagcaaccaa acacaaatca agcacattta cttccagaaa 26401 agcatcctta aggagctggt tggatccggg agcaaaggta tctatacaga gatgctcatt 26461 gtaatgctgt tagcataata gaaaaaaatg gaaataacca agcagtgcac aaatatgaca 26521 tttgttaatt actgccttac catggactac ttactgtaca gcactttaaa aagctggtgt 26581 ttggctctcc cccttccctt cctttccctt cccttccttt ctttttcttt ttttctttct 26641 taccgagtct cactctgttg cccaggctgg agtgaagtgg agcaatctcg gcctactgca 26701 acatccgcct cctgggcaca agcaactctc ctgccttagc atcccgagta gctgggacta 26761 caggtgtgca ccaacacacc cagctaattt ttgtattttt agtagagatg tggtttcaca 26821 atattggcca ggctggtctc gaactcctgg cttcaagtga tctgcccaac tcggcctccc 26881 aagaagctgg gattacaggc atgagccact gtgcctggcc tattcatttc tataggaggc 26941 ttttcatgat attttaatga agcaaaaatg gcttgaataa caatatatgt tgtgaaaaca 27001 atgcatacat gaatagaaac caatcctaga aacagtacac ctacggagga caatggctgt 27061 ttctggatat tatcattagg atgattttat tttagctttt ctgtatttcc ttcaagttaa 27121 aaaaaataga gtatgtatta aaaaaaaaac tacatccatt ttgaaaacaa taaacaccta 27181 acatgtcttc taaatctttc cttctcagat gatagcatgt ctccaaatca gtggcgttac 27241 tcgtctccat ggacaaagcc acaaccagaa gtacctgtca caaaccgtgc cgccaactgc 27301 aacttgcatg tgcctggtcc catggctgtg aatcagttct caccgtccct ggctaggagg 27361 gcctctgttc ggcctgggga gctgtggcat ttctcctccc tggcgggcac cagctcctta 27421 gagcctggct actctcatcc cttccccgct cggcacctgg ttccagagcc ccagcctgat 27481 gggaaacgtg agcctctcct aagtctcctc cagcaagaca gatgcctagc ccgtcctcag 27541 gaatctgccg ccagggagaa tggcaaccct ggccagatag ctggaagcac agggttgctc 27601 ttcaacctgc ctcccggctc agttcactgt aaggaaatgc ctacagatac ttggaggggt 27661 gcctctgatt ggttgtggtt gagagtgagt tggcatgaga tgaggtgcca agactaattg 27721 ctgccactct ggacataggg ctgctgaggg aagggatgtt ctgcttaaat gttccataca 27781 tcttcactag gcttcattga cattcacagt ttaatagcta aagggttaca tagatagcca 27841 gccagatgga cttggcccga agtactcacc ctagcctgtc aacttggcct taccttagaa 27901 aaatgatggg ctgctgcgta atatcgcagg gtcctgtgtt tacctcattg ggagcaatta 27961 aataagccat ttgcagaaca gagctctgtg acgtacctac aggctcaaac tgtagcctct 28021 aaacaggttg gcaacttgca gtgatgtcag gctccaggca aacaacaagg gcaatctata 28081 ccaaccatct gctccacaca ttaaaattca aagcgactgt aggaggcagc cccattgctg 28141 cttggagacc taacagtgcc tgactgtcct tggcttgctc ctgcactctg ggatcctcct 28201 gcttggtttc ccttgggttc ctgctccctg accaccaccc ctcccctgaa ctaaaaacac 28261 ccttaaggct tactcacctc ttcattcact cattcaccca gcaaatattt aatttgcgcc 28321 taccagatgc aaaatatcca tcagaaaaca aggctccctt ctttgtaaaa ggacggggaa 28381 gaattggcat taattatact tccaccaaaa aatcaaccta ctctgttgca agatttgcct 28441 ttaatatgaa acacttataa atatatattt tatagtttca tggaaccaag tctgcatcct 28501 ctgccattct gtactctgga cgaattttct tttgcagctt ccttgtccat acatagtgtg 28561 gttctagtgt tccatgcctg aaagaaatta ctagaagcca tctttacact gaactaagca 28621 tatagtaaac aaaaggcctg ttcagcatag cctacctaag tcaggcactg tgttctattc 28681 taatgtggaa aacaaatttc ccattaaaat accactctgt catttccaga atttgattag 28741 gaatgtactt aaggagtaaa tcatagaagt gtcatgtttg ggaaacctgt gaaatttcct 28801 acccctctac tctttttcaa aagtcctttc ctactggatt cacagggaaa attaggccaa 28861 ctgtgcttgt atatatgcct gtgcatacac acagcaaagc cctgtttcga cagtaggtta 28921 gaacacagat ggctgaaatc atcaccttcc aaacttagca actctttaca gttcaagtcc 28981 aattcagcaa atatttagtg agcatctact gtaagcaagg cacatcatgg tgaaactaat 29041 gcaggcattg tcctattcct gaaatctcaa gacacattca caaacaaatg gttatcacca 29101 aggtcttcat gctctactca tgttgacatg agttgtatta attggtgact ggaagtccag 29161 gatctgttga ggaagtcagt gacccttaat caggaacact gccttggaag gtggtggacc 29221 tttaaaacag aagcttctca gtttttgtag catctgatat gagagaatat gctagatatt 29281 cataaactta gggccaggca atgtggggcc cctggaatgc tactgggcac tctctaacct 29341 agtcctagaa atttcagttc caataatgtt ttcttcttct tttctagata agaaactata 29401 tgtatctcgt ggatctgcca gtaccagcct tccaaatgaa agtaggtatc tgggccagcc 29461 tttgatggtg tgtgtctgta tcctgagaac gaacttgaga aataaggctt ctgtctacta 29521 ctggagacac tggtagtata aaacccagag tctccagtaa tggacgggag ccttatttct 29581 atcactcagt gttttcatag atagaatttg tttcatttta gcctcaaaaa agagtatatg 29641 cacttcctat cttgtaaaat tttattttgc tatagaagaa actctcctta gtttcttttt 29701 tcttttcttt ctttcttttt tttttaacct gagtttgact taactgaagt agccaaggtg 29761 actcagaaaa attaagaaat tcatgaggat gaggattagc ttgctatgtc ccagatgttc 29821 agttttcagc ctagtattct tttatttatt tctttttttt taacctgagc tggacttaac 29881 tgaaatagcc aaggtgaccc agaaaaatta agaaattcat gaggatgagg attagcttgc 29941 tgtgtcccag atgttcagtt ttcagcctag tacatagggc caccccagga ctctgcagtc 30001 agtgaaaatg tcaactgagg catgaggcca ggactctgag ctccacactc aatttgaaag 30061 tggtgtcagt gggaacttac aaaagaggac aatctgggtg gaatgacagg aggatcttca 30121 tgacctgggt tcagggtcaa ttgccagctt tattatttac ttgcttcagg aacctggtca 30181 catttcttaa ccacttcatg ccccagttgc ttcatttttg tcaaataagc ataaaatagt 30241 gcccacctca ttgggtgagg tgtgaggagt aaatgaaata atgtatacca atgcttagaa 30301 taatggctgg cacattactc taagcgttgg catacattaa accatcaaga attctttttg 30361 gagggcatcc tgtgttcccc aagctgtgct ctgtgtgctg tgtttaagag aggactcaag 30421 ctgggtgtgg tgcctcacac atgtaatccc agcactttgt gaggccaagg cgggaggaac 30481 acttgagccc aggagttaga gactagcctg ggcaacatag caagaccctc atctccataa 30541 aaaataaaaa tagagaggat ccaaaggaag agccactacc acagcccctg atctcaggca 30601 tttgacagtc tggtgtgaag taggggtgag ggcaatctat agaccccact agaatgtcaa 30661 ccccatgagg actgagactt tgagaggtat accactgtat ccctagcacc ttgcacagtg 30721 gtatacaaat ttgttgaata aatgaatgag tgaatgctgg aaacaataaa agagcattta 30781 acaaggaaac agtaagaaaa tgtagtaatg tagtaaaaat tattccaatg tagtaaaaat 30841 tattctgttt gatgccttcg tgtgagagac aaacaaaaca aaaaccaacc ctggtgatta 30901 attggccatg cctccatgag gagattcatt tagagctggt tcaaaagcct tagcatagcc 30961 aaggcaggca tcacataagt ccctgcccaa ggaaatgagt tctgctggcc ttctaactta 31021 tgaaaggcaa ggattgtggt gtttcctgag tacttgttaa tagtttctac atttgtttat 31081 tgtgagtctc agaattactg ataacttgat gcttaaaatt gttcatagac taaaactaac 31141 agcaacaaca acatgtattg ggtgttgact atgagccaca taccacatac tccatcttgc 31201 ttagtcccca ccatgacctt gtgaggtgta tgtgaatatg tcctaggtgc tttactcaac 31261 cgatattcag taaatagtga ttgaatgaat gaatatcccc attttataga tgaagaaact 31321 ggatctcaaa aaggtgaaag gacttgcctt aaatcacaca gcagcaagtg gcagatgcag 31381 tattatctcc ttgctctcac agtgtcataa aattatcaga aacaattcca agttaaatca 31441 agtatacaac agtgaaaatt agtcactggt gagattattt taccacacat gagttcactg 31501 tctatctggg accagggcac tatcctaaaa taaaggaagg gggatgcaga aggctcagaa 31561 agcaagcaaa agagcaaaga aaaatatacc tttgggaaag aaggtggggg atgttctttc 31621 cttgcagaga gtagttgagg aaaggtggct atgccaacag gcccaagtct gcctttctaa 31681 cactgtggcc ctgggcagag ggaggtaaca tggagctgag aggaagagaa gggacttcct 31741 agggctccaa gtaaaaggac aatggaaccc agcaaagccc atgctcttaa gcatgacacc 31801 agtctcccaa tcttgcctga tggtacgaac cacctgggga gcttgttcac tgcccagatc 31861 ctgctccccc acttcctggg ggagtagatt tggcatgaga ctcagttgcc tcttcatgtt 31921 ttattctttt tttttttttt tttttttgag atgaagtttt gctcttgttg cccaggctgg 31981 agtgcaatgg cgcaatctcg gctcactgca acctccgcct cctgggttca agtgattctc 32041 ctgcctcagc ctcccaagta gctgggatta caggcacccg ccacaatgcc tggctaattt 32101 tttgtatttt tagtagagat ggggtttccc catgttgatc aggctggtct tgaactcctg 32161 acctcaggtg atcagcccgc cttggcctcc caaagtgctg ggattacagg cgtgagccac 32221 tgctcctggc ctcatgtttt attctttata taatttatat taacctcatt taaaagtgaa 32281 ctacatgaac ctttcagtgg aaatttaggt tgttctgaaa gatttcctat gtgaggctcc 32341 agaaaaatat ttgtcaagga agggaggcag agtctcctcc tgcccctccg atatggccct 32401 agtcccctca tgtgggctgt catgcatcta tgaagcaggg gcatgtggag cagaagagtc 32461 atcagagcca aaagggtagt cagagagacc ttggagaaag acattctgga ggttagtctc 32521 acaaatgttg ttcaaaagtt ttagtgaatt tgattttatc ttttggtcgg tagaaataga 32581 catactggtt atatattgct tcaatggcta caaataataa taacagttga tattgctaat 32641 tgccagatac aggactataa tatacattgg agatataaca gtgaataaaa ttgtcttggt 32701 ccctgccctc acagagttaa tagtttcgtg gggaagacat gcccaccccc aaagggaaaa 32761 tacaaaatat ttgcaagtta tgataagtgg agtgaggaaa acacatgagg tgttgaaata 32821 gggaaaaatg ggggccaact ttagataaag tgttcaaagg aggcctttct gaggaagtga 32881 catgtgatca gggacctgaa tgaagtgaga gagccagctc tgggaagaat tagggcagag 32941 atggagggag gtggaggaaa ttgtccctgc aaaagcccga gatgtgtgtg aagaatggaa 33001 aaaaaaaaaa gaccagggtg acaggagtgg agtaagaaag gagagagtgt ggtacaagag 33061 aagtctggag aggtaggcag gggccaggtg gtgcaggggc cttatgggtc acattgggag 33121 ttgagatttc atattctaag cgacattgtg agccatggaa ggattttacg tgtgatccaa 33181 cttgcatctg ataagatcac aaaagctgtt ctgtagagga cacattggag aaagggaaga 33241 atgaaaacaa agaggcaaga agcatgagag gatgatagcc tggattaggt ggcaacagca 33301 gagacagggg tatttgagat ctatgtagga gggttagctc ttaggacttg gtaatggatt 33361 aaataggaat ggcaaagaaa tgggaagaat ccaggttatc tcctaggttt catgtttcaa 33421 caaatggata ggagaatata actgttttgg ccatgtcaat tctgaggtgc ctataagaca 33481 tccaagtgga tatacctttt aaacagttgt atatgtaagt gaagaaagtc tgggtaaaca 33541 tttgggatgt atattagtcc tgtacataag tggtatttaa ggccattgga tgtgatcacc 33601 tagtcagagg acgtagacat agaagagagg agagctgcag aaagaggcct ggagggtgtc 33661 aatatttaga gactggggac atgaggaaca aacggaggag actgaaaagg tgtagccagt 33721 gtgataagag aagaaccagg gcagtgtggt acataacaga agctaaaaga ggtgaatgtt 33781 tcaaaacaga aggagtgatc aactgtgtta aaggctgtgg agagatcaaa taagagaagg 33841 tcaaataaga agaaattcct attggatgtg gcatcagaga tttttgtaga cgtgttaaga 33901 gcagttttca ttatgtgatg tggacaggag ccagattgaa gtgggctgaa gagtgaacca 33961 gagatgagaa agtggagata gtaaattcga ggagcttctt agagaaattt aacattgaaa 34021 tgataaagag aaaaaagatg ttagctaaag agattaatgg ggtttgagaa ggtctttttt 34081 taagaggcaa atattagagc ctctttgcat gataatggga atgatccaat ataggaggat 34141 gaaggtaact tgaggggtga agtcattgag aagacagaag ggacatgtag aagcattgag 34201 ctttcataag aacaggggct gttctgccat tgaaagtata gaaaagaaag ctaataagga 34261 cacagatgca aattgatttt caggcttgct tgtgaggaga ctaaaaagta ctaatctaat 34321 gatttctatt ttctcaatca actgtggagc aagacagcaa gagtgtggga tatttaaaga 34381 gagtgagaaa gtatgaagaa gtcttcttgg gcattgggaa agcaaagcct gatagtagac 34441 aaacataagt aggatttctg ggtggtgatc tgaagtctga gctcatgaat ttcaagtgcg 34501 aactctcagt tgggttgtgc aattttcatc agtgatgtta gcttagtgta atgtgcagag 34561 aaggcaaatg gggacgctta tctaggggtg aggttgtggc aagtgagtat gaaggataga 34621 gaataaggaa aaggagtcaa gggtgggcaa gacccattga gaataggttg aggctgtggg 34681 tttcctacat tttcaaagtt tgaaaagaag ctgaaaagtc tcagtctggt cctgatgaat 34741 gacatacttt cacttgaaaa ctttggtttc tcaatcctaa caaagtgacc cccaaatgca 34801 attgttgaga aatctgataa cttactcatt agaaattgag taaactgccc gtcacaagtt 34861 gaataaattc tgttgtggct gcaatagaat ttactctgta tccagagcat ttcttggatg 34921 cagtttcatg agtttctgtc ttcccttcct ctgaggttga ggactctggt atctgacatc 34981 ccacaggtgt tcatctggac cctggaggag acttacagcc ttccataggg gcctaactgg 35041 ctttcattaa ccatgtaact tctcctttag ctctttcaga gttagagaca cctgggaaat 35101 actcacttac accaccaaac cactggggcc acccacatcg atacctgcag catctttagt 35161 caagttggag gagaaagaca acacttggtc taagacacgg cagcaagaca tccctgcata 35221 ttgttccaga taaaaatgaa agctgctcac acccacttgc ctccccaatc tgttaaacag 35281 cttcgtgtct agtatgagct cagtacttgc cctgtgaaaa tcccagaagc ccccgctgtc 35341 aatgttcccc atccacaccc tgcttgctcc tgtgtaacag ctcagatgat gaataataat 35401 aaaactgtac ttttttggat ggtgctatgc cgggttctaa tttctttcac atccatcatt 35461 tcattcaaat tttccagcaa ccctaatatt ggcattatta tcttcattta atatatgtgg 35521 acactgaggc ttagagaagc gaaatgacct gcctaagatc acatgaccag tatcagggcc 35581 aacactgaaa tccaggcctc tgaactcagt ggtcttttct caccactgga atacttaata 35641 tcaggatatt aagttaatgt caggatggct cttagatatt ttcccctccc atctgctatg 35701 taaataaaga gcacaactct tctatgagtg gattggcgat cttggcatta atacagtaat 35761 tctcctgtct caagaacagt ttttccagtg tattacatat gtttcctaga ttcagtaaga 35821 ggaaaaaagg cttaaggagt acattttctt tcaaagttat ttactttcaa aatccatgtt 35881 cttctttagg tgatggttgg aaaaattcag ggtggaggct gattctagag cacttgtagc 35941 acataattat agaagtgtgt gtttatcttg gctgactcta gaaagattca tctgtgttag 36001 aagataatat ataccattag aagtgaactc ttaacatgca gtttagggaa aagaagaaat 36061 gattccaata gaattggcat attatgtagg ttggatgatc taataacatt tccctgcaaa 36121 cccctggaag ctctctcttg aataaaactg aaggcagtaa taatttttct actttaataa 36181 agaggaaaga aaagactccc tgttcctaag agacaaacgc tcgctcctct gacataagca 36241 ggtaaattcc tgctcacatc aactgatact aatctcagac tcagatttct ggcttttcat 36301 atctatcttg tttttccttt cccaccttac ttccaaagag ctaaggggaa aattatttct 36361 ttgtatcact ttggcatttt actcttgcaa ttaaaaaaat actctctccc cttgcaaaca 36421 aggtgtatac caccctttac acacaaaggt ggggtaattg acctctgcgt gttgtccgtg 36481 tgacttcatc tacatgctag ttaataatct tgtgagatgc taacttgggc agagagtagg 36541 ggaggtggga tttttttttc caccaggtca atcaaggtct tgaaactgtg gaaagtcagg 36601 gccccatagg actagagcct gtaaacctca ttcacctctt tctctatagt tttgagaaat 36661 ttacaagtgg cccgactggc tttgacttag cagctgcctc tgccctccac atcacccacc 36721 catcacaccc ccaacacgca cccattcctg actttacaat ctggaaaaat tgaaattgct 36781 tcaaatggac tggacactca catactcaaa tgatgcctgt ctctgaaaaa gcaaaaaggt 36841 ggcaattccc tttttctgtt agcaaagatt atcctttaaa aatctcagct aagactgttg 36901 tttgcagatc caaatgagac ttgtgtttaa gaacatttcc attctaaaaa tcagtgatca 36961 tgttcaacac tgaaaataaa atcactcctc cacaaataaa tgctaaatag aatatgttat 37021 ttttttctat gtaaacaatg aacactgacc tttccctaat tatttcccta gcttaacata 37081 attaattgaa aataattttg taaaccattt gtgtatacaa tggtttcttt caattaattt 37141 aatgtgtaag attttaaaat agcatgttag tatgaatttt tatagattag tgattataga 37201 aatgaactag gctatagcat tctcataagg attacagttt tatcactatt tttattttta 37261 ttttttgtgg ttacataatg gatgtatata tttatgggtt gcatgagata ttttgataca 37321 ggcatgccat gtataataat aacgttagga taaatggggt atccatcacc tcaagcattt 37381 atcctttgtg ttacaaacaa tcctcttaca ctattttagt tattttaaat tatttttgac 37441 tatagtcacc ttgttgtgct agcaaatact aggtctttct cattctttct aactatttgt 37501 ttgtatccat taaccatccc cattccctcc cacccaccac caactctcca ctatccttcc 37561 caggctctgg taaccatcct tctattctct gtctccatga gttcaattgt tttgattttt 37621 aaattccaca aataagtgag aacataagat gtatgtcttt ctgtgcctgg cttatttcac 37681 ttaacataat aatctcgagt tccacccata ttgctgtaaa tgacaggatc tcactctttt 37741 tatggccgaa tagtactcta ttgtgtatat atacacattt tctttatcca ttcatctgtt 37801 gatggacact taggttgctt ccaaatcttg actattgtga ctagtgctgt aataaacatg 37861 ggagtgcaga tatctcatcg atagactgat tccttttctt tggggtatat acccagaagt 37921 gggaatgctg gattttatgg tagctccatt tttagtttat tgaggaacca ccaaactgtt 37981 atccatagcg attgtactaa tttacattcc tgccagcagt gtacgaaggt tcccttttct 38041 ccacatcctc accagcattt gttactacct gactttttga taaaagtcat tttaactgga 38101 gtgagatgat atctcattgt agttttgatt tgcatttctc tgatgatcaa tattgagcac 38161 cttttcatat acctgtttgc catttgtatt tcttcttttg agaaatgtct attcaggtct 38221 tttgcccatt tttaatcagc ttattagatt ttttcctata cagttgtttg aactccttat 38281 atattctggt tattaatccc ttgtcagatg ggtggtttgg aaacattttc tcccattctg 38341 tgtgttgccg cttcactttg ttgattgttt cctttgttgt gcagatgctt tttaatttaa 38401 tatgatcata tttgtccatt tttgctttgg ttgactgtgc ttatggggta ttacttaaga 38461 aatctttgca caatccaatg tcccaaaaag tttccccaat gttttcttta gtagttccat 38521 agtttgaggt cttagattta agtctttaat ctattttgat ttgctttttg tatatggttt 38581 tgcggggtct agtttcattc ttttgcatat ggatatccag ttttcccagc accatttatt 38641 gaagagactg tcctttcctc agtgtaagtt cttggcacct ttgtaaaaaa tgagttcact 38701 gtagatgtag ggattatctc tgggttctct agtctgtacc cttggtctat gtgtctgttt 38761 ttatctcagt atcatgccat ttttttgtta ctatatctct gtagtataat ttaaagtcag 38821 gtaatgtaat ttttccagtt ttgttctttt tgcttaggat agctttggct attctggatc 38881 ttttgtggtt ccatataaat tttaggattg gcttttctag ttctgtgaaa aatgtcctta 38941 gtattttgat agggattgca ttgaatctgt agattgcttt gggtaatatg gactttttaa 39001 caatattgat tcttccaatc cataaacatg gaatatcttt ccattttttt gtgtgtcttc 39061 ttcaacttct tgcatcagta ttttatagct ttcattggag acatctttca attctttggt 39121 taattcctag gtattttatt ttatttgtac ctattgtaaa tgggattact ttcttgattt 39181 ctttttcaaa ttgtttgctg ttggcaaatt gtttgctgtt gctactggcc gggcgcggtg 39241 gctcacgcct gtaatcccag cattttggga ggcccaggcg ggtggatcac gaggtcagga 39301 gatcgagacc acggtgaaac cccgtctcta ctaagaatac aaaaaaaaaa ttagccgggc 39361 gcggtggcgg gtgcctgtag tcccagctac tccggaggct gaggcaggag aatggcgtga 39421 acccgggagg cggagcttgc agtgagcgga gatcgcgcca ctgcactcca gcctgggcga 39481 cagagcgaga ctccgtctca aaaaaaaaaa aaaaaagaaa aaaagaaatg ctactgattt 39541 ttgtatgttg attttatatc ctacaacttt actaaatttg cttatcagtt ttaatagttt 39601 tttggtggag tctttaggtt tttccaaata caagaccata ccatctgcaa acaaagatag 39661 tttgactttt ttctttccaa tttggatgtt cgttacttct ttctcttatt tgattgctct 39721 agctaggact tgcagtacca tgttgaataa cactggtgaa aatggacatt cttgtcatgt 39781 tccttcttag aggaaaggct ttcagttttt ccccattcgg tattatacta tttacggatc 39841 tgtcatatat gatgttatta tgtcaaggta tgttccttct atacccaact ttttaggggt 39901 tttaccatga agagatgtta aatttcatca aatgctttct cagcatttga aatgattata 39961 tggtttttgt ccttcattct gttgatatga gatattgcac taattgattt gcatgtattg 40021 aaccattctt gcatctctgg gacaaatccc acttgctcat gatgaatgat tttctaatgt 40081 gttgttgaat ttggtttgat agtattttgt ttaggatttt tgcatcaaca ttcatcagtg 40141 atattggcct gtagttttct tttttgttgt gtctttgtct ggttttggta tcagggttaa 40201 tactgatctc atagaataag tttgaaagta ttccatcctc ttctattgga atagtttgag 40261 tagaattggt attagttctt ctttaaatgt ttggtcaaat ttagcagtga agccatcagg 40321 ttccaggctt ttctttgctg ggagactttt tattatggct tcaatcccag tacttgctat 40381 tggtctgctc aggttttgga tttcttcctg gttcaatctt agtaggttgt atgtgtctag 40441 gaatttatac atttcttcta gattttccta tttaatgata gatagttgct tatagtagcc 40501 actaatgatc ctttgaattt ctgcagtatc atttgtaatg tctccttgtt catgtctgat 40561 tttatttatt taggtcttat ttcttttttt cttagtctgg ctaagggttt aacaattttg 40621 tttacctttt caaaaagttg actttttgtt tctttggtct tttgtattat tttctcaatt 40681 tcaaattcct ttatttctgc tctgatcttt attgtttatt ttcttctact aattttgggt 40741 ttggtttgct cttgcttttc tagttcttta agatgcatct ttaggttatt tgaagttttt 40801 ccttttttga cgtaggcact tacagctata agctttcctc ttagtactgc ttttactgga 40861 tctcattggt tttgttattt tgtgtttcca tatcatttgt ttcaagaatt ttaacatttc 40921 cttcttaagt tcttcatttg ccattttgtt atttgttttc tggctgtttt gtggctttct 40981 ctcctttctt tccttccttc ctgtcttcct ttccatgaaa ggtgattttc tctggtggta 41041 tgatttagtt tcttgctttg attttttgtg tatctattgt atgttttttg atttgaggtt 41101 accatgaggc ttgcaaatgg tatcttatga cccattattt taagctaata acaacaccgt 41161 ttgcataaac acacaagcaa aacgaaaact aataaaaact ctacaactta actttgtccc 41221 ctctgccccc tgacctttaa actttttgtt gtttctattt atatcttatt gtatggtcta 41281 tgttttcaaa atttgtttta gttattattt ttgtttggtt cattttttag tctttctact 41341 taagctaaga gtagtttgca caccacaatt acagtgttac catattctgt gtttttctgt 41401 gtacttacta ttaccagtga gttttgtacc ttcagacttc ttattgctca ttaatgtctt 41461 ttactttctg gttgaaatac tccctttagc atttcttgca ggacaggtct ggtgttgatg 41521 aaattcctca gcttttgttt gtctgggaaa ttatttctcc ttcatgtttg aaggatattt 41581 tcactgaata tcctattcta gggtaaaggt tttttccttc agcactttaa atatgtcata 41641 ctactctctc ctggcctgtg aagtttccac tgaaaagtct gctgccacac atattggagc 41701 tccattgcat gttattttct tattttctct tgctgctttt aggatcctct ctttatcctt 41761 tacctttggg agtttgatta ctaaatgcct tgaggtagtt ttctttgtgt taaatctgct 41821 tgatgttcta caatcttctt gtatttgaat atcattatct ttcttgaggt ttgggaagtt 41881 ctttgttatt atccctttaa ataagctttc taccccatct ccctttctac ctcctcttta 41941 aggccaataa ctcttagatt agcccttttg gggctatttt ctagaacttg taggtatgct 42001 ttattgtttt ttattctttt ttcttttctg attgtgtatt ttcaaaaagc ctgtcttcaa 42061 gtgcattaat tctttcttct gattgatcaa ttctgctatt aaaatattct gatgcattct 42121 tcagtatgcc aattgcagtt ttccgctcca gaatttctgc ttgattcttt ttcattattt 42181 tagtctcttt gttaatttat ctgatagaat tctgaattcc ttctctgtgt tatcttgaat 42241 ttctttgagc ttcctcaaaa cagctatttt tgaattctct gtctgaaaga tcacatatct 42301 ctctgtttct gtaggattga tccgtggcac cttatttagt ttatttagtg aggtcatgtt 42361 ttcctggatg gtcctggtac ttgtagatat tcacctgtgt ctgggcattg aagagttagg 42421 cacttattgt agtcttcaca gtatggactt gtttatatcc atccatcttg ggaaggcttc 42481 ccaggtatta taccacaatt ggtattgtgt tgtgatcttg gtgttgtgat ctaggccata 42541 tctgcgttag gggcaactca agaccagtaa cactgtgatt cttgcagact catagaggta 42601 tcaccttgat gaccttggat aaaatccaga agaatcgtct ggattaccag gcaaagacta 42661 ttgttctctt tccttacttt ctctcaaaca aatggagtct ctctggttct gagctggctg 42721 aggctggagg tggaatgaca caagaacccc tgtggccacc accattggga ctgcactggt 42781 tcagacctgt agccagtact gcactgggtc ttgcccaagg cctgctgtaa ctactacctg 42841 gctactgcct atgttcactc aaagccctgg ggctctacag tcagcaggta gagaagttag 42901 ccaggcttgt atccttccct tcaaggtggt gagtttccct aggcccttgg caggtccaga 42961 ggtgccatct gggaacccgg gacttgaatc acaaacctta aaagtctccc tagtgttcta 43021 ttgtactgtg gctaagctgg tactcacact atgagatgta gtccttccca ctcttccctc 43081 ccctttctac atgcaaagga gccacacccc atggctgcca ccagcacagg cccatgggga 43141 gtactgccag gttaccaccg atgtcctctt aaggcccaag ggctcttcag tcatcttgtg 43201 gtaaatgctt cctggcctgg gactcactct tctggacaac gggctctcct ctggcccagg 43261 gcaggtccag caatgccacc caagagctaa ggcatagaat tagggacccc aagagcccac 43321 ttggtgctct atgcccttgt ggctgagctg ttacctaagg ttcaggacag agtctcctat 43381 acttttccct ctgcttttct gaagcagaag gagtcttacc acatagccac catagcttgg 43441 aatgtgctga gtctcacttg aggccagcta gtttcagaat gtcacccaag gccatggcat 43501 cctacctggg tattgctgct ggttttcagg gcccaagggc tttttagtca gcaggtgatt 43561 ggtcctgcca ggactgggtc cttcccttca aggcagtggg ttctcttctg gcccagggtg 43621 tgtctagaaa tgctgtctgg gagctagggc ctgaaaagag ggcctcatga tgctgagtag 43681 tgctctatcc tgctgtggct gagttggtat ccaaaatgca agacaaagcc ctctttactc 43741 ttccctctcc tctcaagtgg aaggaacgtg tctcttatgg agccatgggc tatacagcct 43801 ggggttagga gaggagtaat gccagcacta ccttagccac cccaggtggt gtttcggtag 43861 atcatgtgtc cttcacatcc actggctttg agcccagttc agcactagga gttgcagtcc 43921 ttttggccta ggctaccttt caagtttatt tagggttcca gagcccttga gcctatggtt 43981 gggaggcttg ccagaactca agttccagtt gctgggatag gtgattgccc tccagctagg 44041 gctggttaaa tataccctcc atgggcagtt gtcagttgac ttcagtctgg ttttgctttc 44101 tgctgtgaca gggcagcaca gagttcaatc caatatctca caatccctgt gctcttcctc 44161 tcccaagtgc acagattctc tctccacccc acacagctgc tccatgggga tggggaaggg 44221 gtagcatcag cgattcaaga ctgtcttccc taccctcttc cagtgcctct ttcagcgata 44281 tgaagttaaa accaggtacg gtgagtgctc gtttgatttt tggttcttat gaagatttgt 44341 atgtgtgtgt atatagatag ttgttaaatt ggtgtccttg cagggggtgg gaatgattgg 44401 tggagccttc tattcagcca tctttctcca ccctctatca ctatttgttt tctgttttta 44461 tttagattca atgggtacat gtccagattt gttaaatggg tatattgcat gattttgagg 44521 cttgggcctc taatgattcc attgctcaag tagtgaacat agcacctgat aggtagtttt 44581 tcaacccttg ttcctctccc tcctttccct tctttggaat actcagtttt tattgttcct 44641 acctttgtgt ctgtgtgtag ccaacattta gttcccagtt ataagtaaga acatgccata 44701 tttggttttc tgttcctgat aatgggataa tggtttccag ctgaattcat gttgctgcat 44761 aggacatgat ttcattactt tttatggctg tgtagtattc catggtttat atgtaccata 44821 ttttctttat ccaatccact gttgatgggt acctggattg atttcatgtc tttggtattg 44881 tgaatagtgt ttggatgaac atatgggtgc atgtgtcttt ttggtagaac gatttatttt 44941 cctttgggta tatatccagt aatgggattg ctgggtcgaa tggtagttca acccttagtt 45001 cttagagaaa tctccaaact gctctccaca gtggctgacc taatttacat tcccacaaac 45061 agtaaagaca tacacaaaca gtgtatgtat ttatatattt ataaattgtc tcaaagttct 45121 actcaaatac gtttagttat attattcttc attttaaaga ttctgtatgc ttaggagtaa 45181 gtcaatgagc ctatttatta tatttacagc agttatactt accttttagt tgaaagctgt 45241 gaccttccac taatctttaa agatgagaag ccttttccta atgctactga gaacctggac 45301 tgagagttag gacagctggg ttctggctct ggctcttcca ctagctagct gtgtcatctt 45361 gaatcagtca cagcccctct tccctgtttc tccaagtctc agtttctgta tgtataaaat 45421 aagttgattc ctaaggtcat tttcaactat gtgagcctgt attttgaaat aaatgttgta 45481 gatatgagaa aacccatcaa cagttgccat attgtgtctc gaagaagaag acagggtctg 45541 gataactgag gagcagaaga gagaactgtg gcagccattg agggcgctgc caggaactcc 45601 cttcaagaaa gaattcgcca ttcagttgca agaagtgtcg ttagttgagt acttccacct 45661 attagcacct taggatctgt ttcagctatg ctgtttgtgg gctttcccag ccaatggttg 45721 agcattatat gggtactagg gcctggccat ttctgcccca atgtgggact gctccaaggg 45781 gcaatatttg ttccagaacc cctactaatt tggctgaact tttgtcagat ctgcatcttg 45841 gtctgagtct ctccctgctc aattcttctt cttccttttt tcttttgcag gtgttactta 45901 ccaggcccca taaacccctc acacctttaa ctcctcagct cctgctttct ggaggacctg 45961 ctgatacaag gatatagcag cactttaaga gaaaatataa agggagaaat cccaagactt 46021 attgaataga tggtgattag actagtctgg cttaagcaga gtatttgggt tgatgaacag 46081 ttgaaaatta agctggcaag atagattaag ctcaaattga tgggttgaat gacagtgtca 46141 cttaggaggc agagttaagg ttggtgccag attggagccc tgggccagcc ctagacacat 46201 ctttaatcac ctaccaagat catcttccag accagcaagt gactgtctcc atatcaagag 46261 atggttactg ttgcagacct atgtggtcta tatcagtggt tccccaaatc cccactagac 46321 acatcagtat cacctgggga gcttgttgaa aataaatgcc ccctgactct tccctagact 46381 actgaatcag catctctgtg accaccccct actcctgcca ttagtcacat gtgcaaccag 46441 ttgtatcagt tttctaacct gctctaacaa attaccacaa ccttggtggc ttaaagcaac 46501 acatatttat tatcttacaa ttctggagat cagaaatcct cctacatgaa tctcactgag 46561 ctaaaatcaa ggtgacctca aggctgcatt tcttctggat cctctaggga ggatttgttt 46621 cattaccttt tccagcttct agaggctgcc cacattcctt gtctcatggc ctccggcctc 46681 catcttcaaa gccagaaacg tgcatctcat tcttccataa tcacacctcc ctaggaatca 46741 cttattttgc cttcctcacc cattttttaa aaacccttgt aattacattg ggccccagaa 46801 gatccttctg ttttaaagtc agcaacctta attccatttg cactcttggt tcatcttgcc 46861 atgtaacata acatattcac gggttttggg gggattagga tatagaaatc atttgggcta 46921 ttctgcctac catatcaggt ttggaaacca gcaatctatg tggtataaat gtttcttccc 46981 aaggctatca tatctgaaga catttgctca gccaatctgc ataaatgtat tgacaacctt 47041 ttcctttaaa tgaagtgact actcttaaaa gtacacatgc cctttgagct gacaacttca 47101 cctctacaga tgtatctaaa gaaaataaat ttgacaaatg cacaaagatt taaatacaca 47161 gatgttcaac actgcagtgt ttataatagt gaaaaatcaa gaacaacgtg aatgtctaac 47221 agcagggcat tcattaaaca aactatgttt catccctctg atataatata cagccattaa 47281 aaatgatgct agagatctgt atttattcat taaaatgcag attaagtata aaaagttata 47341 tacacatata ctgcacacgc acatgcactt taagctcatt gtaaaaatat atatatataa 47401 tttgaatagg tataattatt tattacagca ttgttagtag cagcaaaaaa acactgtaaa 47461 tgaccccaat ttttatcaaa aataaattgt ctaaataaat tatgatacat ctatacaact 47521 gaacactatg gagcaggtaa aaaaaatcag gtagatttgt atacactgat atggaaaaac 47581 atctaagata tatcattaat tgagaaaagc aagttatgaa agtctattta atattactgc 47641 atttgtgttt ttttttaaag ggagatgtta atgtgtgtgt gtttgtgtgt gtgtggagtt 47701 gttgttttgt gcaatgctgc acaattccaa gggagaccat tcacattcta tcacatgata 47761 gtctgtgtgg caaagacttt atgaaacaca gggtgtatgt atgtatatgc atgcatgtac 47821 atatgtttat gtgtgtatac atacacacca tatgtgtata cagtatatgt gtataggaaa 47881 ctcatagcta attccaaaaa atctgttcac agtggtttcc tttggggaat gtgattgggg 47941 agactgtgtg tgtgtatgta tgtgtgtgtt tgtgtggtgg tattggtcag ggaaatttta 48001 ctttcacatt atatccttct gtgctgttta aaatttttta caatcctgtg tcactttcat 48061 gagaatatat atatatattt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 48121 gtgtgtgtgt gacagagtct ttctctgtcg cccaggctcg agtgaagtgg cacgatctcg 48181 gcttactgca acctccatct cctgggttca agtgattctc atctaagcct ctgcagtagt 48241 tgggattaca ggcgcccgcc accacacctg gctaattttg tgtagtttta gtagagacag 48301 ggtttcgtca tattgcccag gctggtctcg aactcctaag ctcaggcaat ccacccacct 48361 cagccaccca aagtgttagg attaacaggt gggaaccact gcacccagac gagaaaatca 48421 tttaaatatg tatcttattc agaaaatgtt tagaaaaatg agcagtggct atctctaagt 48481 agttgtaata attgtgaggc agtaaaatat gctccgttct ctccacgttt ctctgataac 48541 ccagaccact tcatttttcc tgtcctgttc cttttgtaac actggaagag aatcatacac 48601 tcctgggggc agctacagcc tcaagagatt atctaggcca ggctgctgcc ttcgtacaga 48661 taaattattt aaatgatcca ggaaaaatca ttgcctatca ggaaaatgtc aataactttc 48721 tttggaaaat actaaccaaa tactaattac ctgtattcag tgattatttt ctttcttgat 48781 tagttataac ttttgtagaa ataaacttga attaaaaaaa tttattgagt gtccattatg 48841 aagttatttt cacatacatt gtcacaattt aaacctcaca gtgaccctgt gcaattggct 48901 gatgaggaaa ctgaggccca ggcctcaagg agatgtgtat atactgctaa actggggaga 48961 gaagatggaa ccaacattta tttagcacct actatatact agttacagta taaaacttac 49021 ttcgttttct cttaacagca tatagaagtt attgttccca ccttgctgat gaggaaaccg 49081 aggctgtgag aggctcagta acttccccca aggtcacaca gcttgaaact tggcagagcc 49141 agaattccaa gccagctctt tcctccaccc cctaccccca ttttttgttt tgttttgttt 49201 ttttagacag gttcttgctc tgcaacccca ggttggagtg cagtggtgca atcatggctc 49261 actgcagcct caatctcctg ggctcaagcg atcctcctgc ctcagcctcc caagtagctg 49321 gtactacagg tgcatgctac catgcttggc tattttttaa aaaatttttt gtagagacaa 49381 ggtttcacta tattgcacag gctcgtctag aactcctggg ctcaagcaat cctcctgcct 49441 ctgcctccca aaatgttggg attacagaca tgagccactg tgcctggccc agctcttcct 49501 tttctgagaa aagtgttctc cccattggct tgatgggata tctctgcaat tttcccctgg 49561 ggggccttga atccccttga tataatcctg caatcttaaa aaaatcaatt tttaaatttc 49621 acagattaca gaatcctgta tttgaggatc tcaaaacttc tgctcctaag ccttgtgagc 49681 aatgatggtt acaaaaataa actcaactcg gagattttag ttaacaaaag ataaaagcat 49741 tgctatttca aaatcatttt tttgtccctg aggtggtggg cagggaaaaa gacagagctg 49801 tttatttttg tggtaattac tgctttgggt aaaaatactg acccgtatag gaatgtctgt 49861 gagtgatccc ttgatataac catcagagta gggaaaggaa aatgaggagc tgctgtttaa 49921 taaatataga gtttcagttt tgcaagatga aaacgttctg gaggtctgtg ttgcataata 49981 atgtgaatat ccttaacact acttaacttg acacttaaaa atagttaaga ttgtaaaatg 50041 tatttttttt attttttttt ttaggaacag ggtcttctct gtcacccagg ctagagtgct 50101 gtgatgtgat catggcttac tacagccttg aacccctggg ctcaagccat cctcctgcct 50161 cagcctcctg agctgctgag actacaggca tgggttacca catctggcta attttttagt 50221 tttctgtaga gatggggtct cactatgttg cccagactgg tctcaaattc ctggactcaa 50281 gtgaacctcc acctcggctt cccatagtga tggaattaca ggcatgattt tttttttaaa 50341 gactagtcaa gtgcagtagt gagaagggat gggtggaagt agaacaagga gttcgatctg 50401 caattgtgaa caatcaattg agataaccca ctaccttcgg acgagccaag atggcaaatt 50461 ttatgttatg tgtttgttac ctcaatttaa aattaaaaac aaaatatatt agagatttct 50521 gggaactttt aaacaatgtc caagaaagag aaatatgaga aagaggggtg ataagcatgt 50581 tattttcttc attttaacag tctaagttta ttgcttgccc ttacccttgt aataaatgaa 50641 gatctagaat cttcaaattt cagacctgaa tctatgtcaa ggtgaggaat caagacacag 50701 agctcatggt tatagcgaag agtaagggag ctgaccattt ctttgcacct ataatctata 50761 tgtgaaacaa tccacaatga cctgagccct gaaggattct cttaaagaat tattaactta 50821 aaaaatattt acatgtcagc tatattttgc cataccacta gatggtacaa aaacctagtt 50881 ttctggtaat gattcactca tgttgtgaag gcaatggtag agtatttaat gtgaaatact 50941 ctgaatatgt tctttgtgat gttgaaaagt aaaacatgtt ggagtaacac ctggaaaaag 51001 tagcattacg ctctcatttt ctgttagttt gcttccagct tgctaatgac agaaaaggca 51061 catttgggaa gatagtttat acatgtcttt taaatctccc aaccctttga ctgggatttt 51121 gtatctcaga tctagttttg atggtgatat gtctactctg tctggcttaa tctcagtaac 51181 ttctagaatt gttaaaatta tggtagaaac atctccaatt ttaggtcaca agtgcttgat 51241 actttgagaa tatattggct tttctgaaaa gagaaatgct ttgaagttcc aaaagaggat 51301 gtagcagtat tttgtttaga taaagatcaa aaacaggtta aacaaatccg tgtgttaaac 51361 atcaaggttc taaatgaaat aaaactagaa ttttaaagta aaaactaaaa agacccttca 51421 atctggaagt tttaaaaact cttgagtgaa aggggggaaa atcagaaatt atagattttc 51481 tgaaaaataa tggtaatgaa aacactacac atcagaattt atgggattta aagcagtggc 51541 cagaggaaaa ttcatagccc tggacatcta caacaattaa aggaataaaa acaaatgaat 51601 taaattctca actcaaaaaa aaaaaactgg aaaaagaaca aggcaaacca aaagagagta 51661 cagcaaagga cataaccaag ataaaaataa aagtggaggt ttcctttgga gggagtgggc 51721 agtggcttga aaggaggcct tggggaggag tgcctcggcg gtgctggtaa tgctctatat 51781 cctgagctgg ctgttgatta cacaattgtt ttcactttct cacaatccac tgagctggaa 51841 ttcactctct ctagagaaag aaagtttaaa agggcacaaa tattaagtgg ccatctcagt 51901 gaatatatat atgtataata tataatatta tgtgtatagt atataatatt atatattata 51961 taatatatta tatacatata catattatat gtatatatat acacatatat acatgtaaat 52021 atatacatat atacatgtat atatatacat attatatata aatatataat atgtataata 52081 tataatatat aatatatttt ttaaaatttt tttaacagag aagaccgttt ctaaaaaaat 52141 taaaaaatta tacattatat attatatatt ataaatatta ataatatata ttatatatta 52201 taatatatta tataataaat atatattaca ttatatattt tatataattt tatattatat 52261 atttatatat tatatattat ttatattata ttatatgttt tatatatatt atatatattc 52321 tctctatata tattctctat atatagagag ggagggaggg agggagatat ttacattttt 52381 atgggtacat agtaagcata tgtatttatg gggtatgtaa gatggggtac ataagatatt 52441 ttgatatagg catgcaatgt gaaataagca catcatggag aatggcatgg tactggcata 52501 aaaacagaca catagaccaa tggaacagaa tagagaaccc agaaacaaat ccacacacct 52561 acagggaact cattttcaac aaaggtgcca agaacatacg ctgtgacctt ttaaacttca 52621 ttgaagaatt tccccaaaaa gagaaactgt aattttatta ttctgtttcc cccagtacat 52681 tgttctccct ctgatgctta gcacagtatc tttcacagag caggcactca ataaatgtgg 52741 aattgccttt aatggcttta tatgtttcac cattgaacct gccattattg gaatgtgagg 52801 ccatagataa tgagagtaat agtaatggtt taaacacata cagcacttta taattcaaag 52861 ctctgaatcc ctgactttca tgtaagtgga acaggttgcc tggatccatt aattatgtgg 52921 accagatgta catgtggagg tctcctgacc tggaaatggt ggctattcca gtgataacga 52981 caaagtaggg taacaacata ctatactcat ctaggttgac catccatcct ggtttgcatg 53041 ggactacccc gcttttagca ttaaaagtcc catgtcccag gaaattcctc gttcccaggc 53101 aaaccaggac agctggtcac attatatcac tttatgtgca tgatctcatt agccttcaca 53161 ataaccttat cagggaggga ctattattat ttttatttta ctgatgagaa actggaggct 53221 cagaggggtg aagtcacctg tgtaaggtca tgcagctaga aagtagtgga gtctcgactt 53281 tagccctctt tggactgaat ccaaagtcaa cattctttcc actcttctgc agctgcagag 53341 agccaaaaga gatagacgac ggcagtcagt ggggactgaa ttactgaagt tatcctattg 53401 gaaaggacat tcttcaactt gtccagagaa tgacagagag attcaagacc agctgggaga 53461 agtgcaaatt gggatagctc ccttggccca gagccaaaag ctatggttta gtgcagtcat 53521 tctggacaag gggttgatta aaaccttcaa atatgaacaa atgttgcaga gttctcccta 53581 ggtagagaag caaatcccaa tgacctatta aattcattag caaagagcta aattccctga 53641 ggacttggag cagctcctct gagtacagtt tctagataac ttaccaagca gggtggccaa 53701 ctccacccag agtggaaatt cagaatcaca gttacggttt gagataattg atgtcagttt 53761 tttccaaatg acccacaaag actgttttct gatctttcaa gttcattaag tccaggttcc 53821 tattatagac ccaaagattc aggaggaaat gcagggttgg aagctccagg atggaacttt 53881 caggaagcac ctggacagaa atagcgaaag atgaacatga taacaagtaa acaaagggca 53941 cagaagcagc aatgccttcc caaaccagcg aaagagcata gaaaattata ttcaagagct 54001 gccctggtgg gatattctga ggtgtgcaat aagcacatca tttgaacctc attagtttcc 54061 cagaagcaca gctgggaata tctcaactct ccattaatgc aatttagcgt gtcctaccta 54121 agtgctctgc cccctggcag gtagagcaaa tctcagtatt aacatgcgtg ttcatatcac 54181 tgacattaag tttcatttca ccctcaaaag ccaaaggacg atcagtcttc gatattctgt 54241 tagctaatcc cataacatat aatcatgcag aaggtttcag aaattcaggt gtgctttcct 54301 gcttataagc caacacactt tgtgacacaa ataataacaa ataatcaaca gatattttaa 54361 ccaggcagaa gtccatccaa tggattattc agccttggtt attattctct gttacctgaa 54421 gctgatggag ttggtaattc tgaataaact gaagagcacc attgtcaccc acttcctaaa 54481 ggaaagtacc agctttcaga aggacaatgc ccatgtctga gccatctctg tattttatca 54541 tgcctagcac agcaacctag ttcagtaaat tttttcccca aaaaataaat gaatgaaaat 54601 agcattgtgt cactgaccaa ttgatattac tcttggccaa caaataaaac aaatgaactt 54661 tacagattgt ttttggacct tctcaagact ggaatagcta acctcctctg atatcccatg 54721 acgccatggg cttcccctat ctccttgaca ccaagtcaca gcacaggtca ctttgtgttt 54781 atcccaggtg actatttctt tgtctgtctt ccccactaga ctatgagcta ggtctgccgt 54841 gatcatcact gcatccccaa tgctatgttt gtgaatgact ggttgcatga aatattaatt 54901 ctactctatg ttccagaagc actagtttga ggctaaggaa taatcaattt accactcttt 54961 ccccaggtgc ccttgctcat ccaactgtgt gaccttgggt aaacagctca gcttcctggg 55021 agccagtgtc agttagagta gtgggctcgg ggatggacct gcaagggggt gcagggcatt 55081 ccctgatact agggctctga ggagggacct aaggctggag gcaagagtag attgctcctc 55141 ctgaacaaag ttgaaagggg gaggcttggt tttctggtga taaggtatag agcctttggc 55201 ttcagataga tgaaggctta aggtatatct ccagcactta tgactaactt tggacaatta 55261 accccatccc tctgagcttt atttctttat cacggctcat tttggcatct taccctaacc 55321 attgtgtatc ctcaaggtga attcataatt ttgtgtgcat gcgtatttgg aattttaaaa 55381 taatatttga acaataaagg ctaccattta ctgagcactc aacatgtggt gggctctggg 55441 ctaagtgcct tagatacatc atctcattat atcagcaaaa taacccttgg aggcaagaat 55501 tagtgtgcta tttgacagat aaagaaatta attcttggag tggtttagta agtggtccag 55561 cccatgcagc tgggtaagtt gaagaacagg gatgcaagcc tggttttgtc tgactccaaa 55621 atctcatcct cttaactggt acactaactg ctgtaattaa aagatattaa aacggaaaca 55681 tgtaaaagcc aaagtacaaa gtaaataaat agaagagctt ttgctttctg ttgaagggcc 55741 attagttcca ggcccattct gatgtcaaat ggaaaaaata ataaggttgg tccatcagtt 55801 agcagaaaca aacagaccac ttttccagta tgctggtcct ttctccccat tactagctaa 55861 aatgattacc atccactcaa ccactttaat gactcctgac tcagagcttt catttggctc 55921 tctttttacc cataagatat caaaatatta tgtagctaaa aagttctctc tgaaagaact 55981 ttgcaaactt tttttttaaa gcagaagtca tcagttatca gtcttctaaa gccaggaatt 56041 ctttcaaaat tctataagat cttttgtcaa atgaatttag ctctgaatga aggaaatact 56101 aagcgccctt cacactgtcc ttccccggag agtttcagac ctcatggacg ctgtgtaaca 56161 ttgggaatga ctttggtgac taaaactggt cctgttgccc tagtttcagg acaaaatttg 56221 caaactgctt aggtgtcagg ggaagcacat tttagtcaaa ccaagcacaa tttccatcct 56281 gttagagaca agccaattac tgtgccaggt tagagaagaa aggagacagg ccacaggcac 56341 agaggcttgc atcgtctatg cacaggtcta tgctttgggg gctctgctgc tctctgctgg 56401 ccacaggctg ttgctgtagt cgccgaggaa gtgattcatt caaagtgctg cttctggggt 56461 ttctcctact attgtcacta ccactaaata gtacctttta ttgagtgccc tttcggtgcc 56521 aggttctgag ctaggtgcca cgttctgagc taggtgccac ggagtcagag atgacaagtt 56581 ttggtcctca tccttggcaa gctcagtcta gtggggaaga tagattctca aacagagaat 56641 tataataaag tgcagcaaat gccgaaagag agtgtgagaa aaatccttgc tttccctcaa 56701 tcagcagtac tgagttccta ctctgtgtca actgtctagt gaggttctgg gaaaataagc 56761 acagaacttt ataggtgtag ctgtccactc agtacatgtc tcatctcatt tcatcatcac 56821 aacaaatccg tgaggtttac agggcagata gtaatagccc tattatatag attaggaagc 56881 tgagactcag agaggttgca tgagttgcca aatgtaacac agctaggtgc taggttatag 56941 aggcttggac tcgatggact tcaaattcac tctctttact gctgcaccac actgcttggt 57001 gctgcttact gagaggttgg tagggtaatg gttaagacta agtgctccta gagtcaggct 57061 gcctgggtcc actctgactg taccactctc ttgcatcagg caaattactt agcttctctg 57121 tgcctcattt tctccatctt ttaaatgaga agaataacag tacttaactt acgagattat 57181 tttgaggacc aaataagtaa atgcttatgt cagtcatata ataaacaata aatgtcagct 57241 cttttattat tattattata aagatggggg tctcactatg ctgcccaggc tggccttgaa 57301 ctactgggct caaactaccc tcccacctca gcctcccaaa atgctgggat tacaggcatt 57361 agccactgca cccagcggtc agctgttaat attactgtca acacgtgagg cagaacaact 57421 tggggcaagt aggaagaaga gatgtagatc aaggaaggtg ctaagtgctg ctaaagtaac 57481 atctgattag ttattcttgc ttgattaaca ggaaaaaccc ttccagctct ctatacagag 57541 tccactctga tagctcattt ttccagtcct ccagggacag cccctcttct ccttgtctcc 57601 aagagtaccc gcacaagtaa tggggctaca gaaagtggaa gaactaattc tgccctggat 57661 aggtgacctc tgcctttgta gggttggggg cagaataaaa ctgtcagaag gaccaggatt 57721 tgtaaaagcg tctattgtag aggatttata attgaatgag tcatcttatc catatcttta 57781 aaacagggcc agtatgggtt agtactagaa atatgagcag agacttaatt atatagtgat 57841 atttaaattg gatcttcctt ctatgtcaat actctgggat gccatggcaa tccccttctc 57901 ccacttggct gcttttcctc aattcgctct gacagctcct cttttggctc ctccatgagc 57961 tcttctacca ccccttaaat gctagtgttc cccagggatc tttgcttggc ctccagtcct 58021 cacacactac actctttttc tatgtaattg catcatttcc cacaacataa atgaccatct 58081 atttccaagt tgataactcc aaaatctctg tctcctgcca agacatctgc ccaagctcca 58141 acctcataat ctacctgtct gctagatatc agcacctaga ggccccacag gcaacagtcc 58201 ctgccaaatg tgtttcttca cttgtgttct ctacattatt ttctgacatc agcattaacc 58261 tagccagtca atgtaggcac ttggaagtca ttctttttta ttgagatatc acataccata 58321 atatttgcct ttttaaactg tacaattcag tgttttttag taaatttaca aagtttctca 58381 accagtccca caatcttgtt ttaggacatt tcatcacacc cgaaaaagtt ctgcttgtcc 58441 attaacagtc actccccatc gtcccctctc ccagtctctg gaaacaatga atctgtgttc 58501 tgtctccttg gatttgccta ttctgtgtat ttcacataaa tgcaattgta caatttatgg 58561 tattttgtga tggcttcttt cacttagcat aatgttttca tggttcatct atgttgtagc 58621 atgtatcagt ccttcattcc tttatgtggc tgaaaaatat tccattgcat agatttatca 58681 cattttcttt atttatcagg caactgacat ttgagctgtt tccacttttg ggctactatg 58741 aataatgctg ctatgaacat ccacatacaa gtttttgtgt ggacatatgt tttacttagt 58801 tttagagttg ctgagtatat acccaagatt agaattgctg ggtcatttgg taactgtacg 58861 tttaagcttt tgaggaactg ccaaactatt tttcaaagtg agcacaccat tttacaaccc 58921 taccagcaat gtatgatggt tcaaatttct gtaggagtca ttcttgattc ctccctctct 58981 ctcagtccac acagatagtc aatcacccaa gtcttgccaa tttaacaacc taaacctctc 59041 tcagtactgc aactgctgct cttggtcagt tctacatcat ctttcacctg gactattgta 59101 atagccttca agctggttct tgcctctggc tactcatcca cctgtaccag catgagcttt 59161 ctaaaaacaa gccccaaaga atgtcaattc cctgtttgaa accctttaat gcagtgactc 59221 tcaaccctgc ctgcacagta gaatcacctg gacagatttc agatatcctg atgccagggc 59281 tgcaccccaa gctgattaaa tcagaatatc caggtttggc atccagacat cagtattgct 59341 aaagctcccg aagagatttc aatatgtagc caaaaacaga aacattgctt tagtgactcc 59401 ctactgccca taagatgaag ttcaagctcc ctagcgtgac ccacgggtcc ttcatcatct 59461 gactcctgcc cacctattta tctgcatctt gtacccctcc ctgcatcaca cgtaataccc 59521 cagctgtact gatctgcatg tggcctccag cagaaacata tctttcttcc ctgtcttctc 59581 tgcctggaat gtccttctca gctctctttg cctcgggagc tccctcctac tcattcctta 59641 ggattcaaca tctgtagaaa ctcttccttt atcatgtcct ttttgtcctg tgtactcctt 59701 cagcgctatt ggaattgttt gtcttcccca gaagactggg aaccacccca tcccccaccc 59761 acagtttagc acctgaaatt tatcatgccc tctaagtaag agcttttggg taagtgaatg 59821 aattaattaa tgtccttgaa agagttgcaa cacagcattt tgttgtgtac catcttaaca 59881 atcagaccct gaaggcatct gggccagagt agacaaggac ctcaaccatg aagaaatggc 59941 ccggtgtacc caaccccacc ataccagcac acagcttctc tagccagggg aatctcaggt 60001 gccacagaaa ttggacctca accagagaag ttggctcttg actagaaaac actgaagatg 60061 aaataaattc ggacctaaca tgaagtggat agcttagcag acccttcctg taaaattcga 60121 aaactcctgc ccaagagaag agggaccact gcctgcgctt gaataaagga ctcaagtata 60181 ggcaaatcga agcaccttcc ttccctgagc ccattccaat gctcatctga tctctggagc 60241 tttcatgttg tacacctctc caccctatct taaccttgaa tcccaggtac actctgcctg 60301 ctctgacaac tctctaccta cctacctgcc tacctgactc tcatacctta gctccacttt 60361 gatcacctgg tagcaatgga ttgtccaact acaccaagtg tcaagcctac aggctctcag 60421 gaacccatct ctgagaaagg catcagcttc ccagccaact gcctgcccaa tgtccggccc 60481 cagacttggc cccttgatga ggtattctcc ttacatatat ctcaatacta aaatgactta 60541 ccaacactgc tatagactaa atgtttgtgt ccctccaaat tcacatgtta aaacgtaatc 60601 tccaatgtga tggtatttag gatgtggggc atttgggagg tgaataggtc atgaaggtga 60661 aggcctcata aatggaatta gtgctcttat aaaagaggcc ccagagagct catttgactc 60721 ttctgtcatg tgagaacaca gagaacacca ccatgtatga actggaaagt gggccctcac 60781 tagacactga atctgttggc atattgatct tgagcttcct aggcttcaaa actggaagaa 60841 ataaatttct gttgtttata agccacccag tctatagaat tctgttatag cagtccaaat 60901 ggactaagat gaacactaac actaattgcc catattgcat gccagctgta agagaaagca 60961 ccatttgagg aaacacaaaa ctaaacacac attagagtcc tttttacaac tctgctttgt 61021 tgctgtttaa ttcatggttc tcaaccttga ctgcacattg aaatactgtg tgtttcgaat 61081 gtttcgaaaa tactgacagc tataagtagg taaatcaatc aatcgaatta cctaattttg 61141 aaagtaaata aataaaaata ccaatttggg gtctcacacc caaatattct gatttaatta 61201 gttcagacgg cagcccagac agcaaaagtt ttataagctc tccaggtgat tctaatatgc 61261 aggcaggatt gagacccact tgcccaaaag ctacagtcat gtgccccata acgatgtttt 61321 gctcaatcac aggaccacat gtacagtggt ggtcctataa gattacaatg gaattaaaaa 61381 tttcctattg cctagtggca ttgtagtcat agtaacgttg cagccctaac accataatgc 61441 aatgcaatgc attactcatg tttgtgacac tagtgttaac aagctattgc tctgccagtc 61501 atataaaaga ctagcaatac aactatgtat agaacatata cttgataatg acaataaaca 61561 actatgttac tggtttagat atttactata ccataccttt tatcattatt ttcaagcgga 61621 ctccttctac ttattaaaaa aaacagttaa gtgtaaaaca agctcaggca ggtccttcag 61681 gaagtattcc agaagaaaaa attgttagcg taggagatga cagcttcgtg aatgttactg 61741 tccctaaaga ccttccagag gggaacaaga tgtggagatg gaagaccatg atactgatga 61801 tcctgaccct gtgtaggcct aggctaatgt gtgtgtttgg tcttagagtc taacaaaaaa 61861 gtctcaaaac tgcgcttgct ttggcagccc atgtactaaa actggagtga tacagagaat 61921 agtaacatgg ctcctgtgca agaatgacac acaaattgtg atgcgttcca ttaaaaaaaa 61981 ctaaaaataa gtaaaaataa taaaaagtaa aagttttaaa gtagataagc ttagagaata 62041 aggatataca tgatgaaaat atttttgtac agctgtacaa tttgtgttta agtgttatta 62101 caaaacagta aaacaggtaa aaaaatttaa ggtttataaa gtgaaaaaat tatagtaagc 62161 taaggttaat tagttattga agaaagaaaa aaattatgta ttttttgaga tggagtcttg 62221 ctctgtcacc caaactggag tgcaatgatg aaatcatggc tcattgtagc cttgaactcc 62281 tggtctcaag tggtcctccc acctcagcct ctcaagttgc tgggactaca ggtgtgtgcc 62341 tccacacaca gctaattttt aaattttttg tagagatgat gtattgccat gttgcccaag 62401 ctggtctgaa actcctgaac ttaagcgatc catccacctc agcctcccaa agtgttggga 62461 ttacaggagt gagccaccat gtccagccaa gaaaaaatat tttttataaa tgtagtgtag 62521 cctaagtgta cagtgtttgt aaagtctaca gtagcatgga gtgatgtcct agaccttcac 62581 atttactcac cacccactca ctgactcaac cagagcaact tctactattg caagctccat 62641 tcgtggtaag cgccttatac aggtgtataa ttttttaaat cttttatact gtatttttac 62701 tgtgcctttt ctatatttag atatgttcga tacataaata cttaccattg tgttcccatt 62761 gcctacagta ttcaacatag caacatgctg ttcaggttcg tagcctagga gcaatagtcc 62821 acaccatata gcctgggtgt gtagtaggct ataccaccta tgtttgtgta agtgcactct 62881 gtgctgtgca cacaacaaaa tcacctaatg acacattttt cagaaggtac cctcatcatt 62941 atgcaacaca tgactgtgct tcttagactc caaacatact tcccctgact tggttctagg 63001 gagtcaacat tgtcaatgtt acaacttcgt cttagaatgg tttagtgctc cccccacctt 63061 cttcttatac ccaggatcat aaaggcttct tcaagggcat tttgttgggc acggagctga 63121 ccacctccat gtgtttcctt cccttcaggc cttacagagc ttattctcat tattccacct 63181 ctatttgcaa cggagcctga aacctatcct tatttgcagt ttgccattta aaaatcatcc 63241 cttctggctt tactttttgc cctgttcttt ctgtgtccaa actgaaagtc cgaagtgaga 63301 gaaactcatg ctacagagtg actcgctggg gtttggtgtc aaatcttaaa acgcccactt 63361 tccaatcaag ggtgactaaa ttatgaagta gcaccgcccc caccccctcc tttttagctc 63421 ttcacaatgc ttgctatcac ctattttata attagcagta gattagcatg aaaaagatct 63481 tccttcctga attaggcact gcaaatatat tttctaactt ttgacctaaa ttgcagtctg 63541 atttaaagtt ttgtttgttt gtttaccagt tttctagaga aataagaaaa aaaaataaag 63601 ttttgtttgt ttgcttgctt gcttgctttt tttattcact caacaaatcc ttagcacatt 63661 ttgaacaaag cagcttgaaa gcagagttag gctatttctg tacacaaata gaagagcaac 63721 taagcttgtg ttttcaccag tggtcttttg cagaagataa gggtcaatag ctatgaggga 63781 ccaaggctca agtttcactt aagggaaact ggattcactt ctccaagtga tactgaaagc 63841 ttgtattcca gggttaagat ttaccaaatc agggactctc ttgttttcag ggatagtgca 63901 gattctacat atcgggtttt tagggtgcac atgatgaata gtttggaaac gtttctgcac 63961 cagtttatca ctggcaagac agaggtacca cttgcagtcc tcttcctgag aaggttcctc 64021 ccacaaggca tcccacttcc ttttcaaatt ttacttattt tgctcatcag caataaaatg 64081 aaaatgtgtc tttcgctctc tataatcatc tccccaactc agcaacttaa ggtgcctctg 64141 tcactttgat ttattcaaat gatttgaaat gtcaacttat tgaagatctg cagaggacca 64201 tgggtaatgt cacccactgc cctgaaaagc caacgaattt taactctctg aacggatgtc 64261 agacaagggc aaagctgagc ccatggaatc actttttgcc atgcttctca atggagcaat 64321 tcaaggtgac ttcgtcatgc ccattgagga agccacatct gccagtttgt atggcatcat 64381 attggcaaaa aggaaaaatg gtcactgaga ggagactttt ccccacaagt atgatttctt 64441 agagagccat ggcattattt tggtgcttaa tgaagtgaaa cctttttatc ccagctacag 64501 tccggcctta ctggattggc aacagcccca gcaaactgac aatttaaatg agggtgccac 64561 tgcttgagga ctttatgtga ggcctcatta attagtctat gaggatggtc taacacggtg 64621 cttggcacag aggtagatag cactcgtggc atattaacca ttgctatgat tatgaaaagt 64681 tcatttctag agagttccac cagtagttaa gaggtaggct tagggatttg acctctaaca 64741 cactgtgact attcctgtca ttctatcctg ctggtggctg gcctggtgct ctagggtagt 64801 caaaaccatt ttaagaaacc gcagagaaat gttaaattct cctgttgtgc tggaattcag 64861 tgaatgccaa tggatcttaa aataattggg tcagtgtctg aggcttagaa aggcaagaat 64921 tctcttttgc cacatgggat tttgtgtgaa tttggcatta atatcagacc tgtgctatct 64981 tggcttgctt ctttaggaag taagctggtg ctaacccctc aaggaaagga taaaacactt 65041 agcagagcct tgtttgtttc tcagtaggga gtggcacact agcttaacat gtaggaagtt 65101 ggcaataaag tactatgtat gacggagatc tgacctcgtt cttagctgag gttggtatgg 65161 cttcctctag agcatttgcc aagtggcctt tgtggttaaa cagcacactg aactttctct 65221 taactccttt taaagtgagg acaaaaggat ggatatgcct ggctgccgcc aatgaacatt 65281 acacagtgcc tcctctgtgt taggtgttgg aggtgccatg agcagcacag tcccagtccc 65341 agttctcaag ttcacagtct agtcagagat acaagcagtg atgcactgga gcccattgct 65401 actgattcat gagagttgat agatgtttag aaatttttgc cagtcagtcg ttaaacacac 65461 acttattaaa aataaaattc tataagcctg caattaaaat acatttcaaa caaaggtaag 65521 aaacactcaa aatgtatcac ttcctaatta ttttactgcc attgcttatt atatatgctc 65581 ttgaggtgat ctatagctat tgtagcagtc tgatggaaat gctgtataat gctgtactac 65641 tgggcatttc tacccaaatc aatgtgtagc ctcacgtcgg tagcttgaaa ttggtcatgg 65701 tgggaatatt tacacagtgg aaattggcaa tcactataaa cttggaatgt tttctttttt 65761 cctttcataa aatcagttat taaacattta ttagcacacc tggagacatt ctagcaaata 65821 gattacagta aatccagtgg gccccatggc tgtgggatta tagcccatgg taggagaaca 65881 tgggaaagtg ttgtgaaagg cttcacacag gaggcaggtg cttgaagcaa agctctgaaa 65941 gataattagg agtgaaggcc cagaggcaag atagggcaga attgatcgaa tgaaaaagat 66001 gtgggttttg ctctcagggg ctctcagtta aaaggaaaca ggagaaaggg agtgaagggt 66061 caagtgctct gtggggtcta ctgggactgc tgtaggggca cagaggaggg cacagggctg 66121 gtgggggtag agtaaggctt cctggaggag gtgatgtggc atcttaccac ggagatcaaa 66181 agagaactag gagtagccac cgttcctgga atggaggtag aaaagtaatg tcttcttata 66241 tattaaaata tattgccaat acaatcagca tagaagagta atcaatgttt taaaaatgta 66301 ttctcaagaa atgcttcact cttgagattg tgcaatcctt gaaatattgc cttgtgagtg 66361 gggatcttag tttctagtgc agcctcccac ctatcacggg gctttgctta taatagccct 66421 ggctgagggc tttgaagttc tgcctgaata ctacaggtgt tgggggagaa gggaattgag 66481 tgcatttgtt aaattctgaa ctaaatggga ataaagacct catttgggga gataattatt 66541 aaatcctata ttcagaagtc tggtgtttgc taagcccttc agtttcatgg acaaagcata 66601 aacaatattc ccttttcaac accacaaatt aacttttcca agaaaaattg gctgtgagat 66661 aaggatagat tatattctgg tatgtgtatg aggaatgggg actttgaact gctctcaagt 66721 ccctaactac agaaaatcta agaaatgatg ttgtctcact ttggtaaaat aacatctggg 66781 gaaaatacct actttctctt tgcatttagt gcctcattaa aaaaaaaaaa aacaaaacta 66841 aaggtcagat tagcaaccta aaaatgaaac tagaatggag gtgaggaaag aagtagtgta 66901 acattttacc ctgaggctga gaagagggag cagatctagt tgtgcaagca ctcagagtct 66961 gaaacgtagg cccttagtgt tacaaacagg atgaatgcct tggcaaataa actcttgaaa 67021 agaaacctgt gctaagattt cccagagact ctgagctttc tttaatcctt tacatccttc 67081 tttttgttac aaaaataaat ttatttattt atttatttat ttatttattt atttggacag 67141 ggtcttgctc tgtcgcccag gttggagtgc agtggtgaat cacagcacac tgcagccttg 67201 acctccaggg cttaagcaat cctcccacct cagcctcctg agtagctgag accacggaca 67261 cacaccacca tgcctggcta attttgtcat ttttttgtag agacagagtt ctctccatgt 67321 tgcccaagct ggtctcaaac tcctaggctc aagaaatcct cccacctcag gctcccaaag 67381 cgctgggata acaggcatga gccactgcat ccggcaaatt cttcttaact ttcttgtttg 67441 caagaaatag aaacccattc agctagttca gataaaggat gggtgggaaa atgaaattaa 67501 ttcagaaata ttaactgaca ctcactatgt gccgggttct gagctaggag tccctggtaa 67561 gcagtgacta agatggacac agtccctgtc ctcacagagc gtaccatcta ggaggaaaaa 67621 aagcactaaa ctaacaatta cacagtttgg gtgttgttgt tgttgtttct gagacagggt 67681 cttgctctgt tacccagtct gaagtgcagt agcgcaatct tgggtcattg caacctccgc 67741 cactggggtt caagcaatcc tcttgcctca gcctcctgag tagctggaat tacaggcatg 67801 tgccaccaca ccacactaat ttttgtttct gttttgtttt gttttttagt aaagatgggg 67861 tttcaccatg ttggccaggc tcgtctcaaa ctcctgactt gaaatgatcc gcccacctca 67921 gtctcccaaa gttctgggat tacaggcatg agccactacg cccagcctat ttgttattta 67981 attataagca cgatgtgcac tctgagggaa aggagtgagg ggtcgtgaga gcccatggag 68041 ctccaaccta atagggctgg atgaaaggcc agaggaggct tcactggaga agtgatattt 68101 gagaaaactc aggagcatac ctttagctgg gatgaaggaa gcattatctg gtacccttgc 68161 ttttctttct aattccctta tcaggcaata cgttacaaaa tcagcatgaa atttgataaa 68221 ttaaaactca gaaacaatgt agaaaaaaaa gcctcccgat ctattttata ctcgcaggct 68281 ccttccacca atatgctcaa ttattttccc tgaacagatg ttcccagacc tttcactcac 68341 ccaggagttt atttcttgtt tattcctgac aaaattgctg tttagcataa gatttaaaat 68401 gtgtgcattt acagcgctac atccagaatg caccccagaa aggaattttc accattaaat 68461 gtacaactta gtcagggtta tgatgactaa aatgttgtgg gaatgctgat tatttaaaca 68521 tgacattatt aagttttaat tcatctttgt aaacaattct ttatttttat actgctgcag 68581 aattaatacc actgttgact tgtgtttcag tctttacata tcgcattccc tgtgcttttc 68641 acaggtctac tgtgtttcca cagttacttc ccatctgaga agctggagag gaggcatttg 68701 ccttgaccct ccacccacct acagtcaggc aaacatgcct ctgactaata gaaactgcct 68761 gaaggtgatg acttggtaca tccagcagac actgaactgc tggacagaat catatttata 68821 gctttcttac aactatcttt tgcttctcac acagggtagc tactggatac aggaaggtgc 68881 gtcttctttt ctcacaccca tttctctcct ttgggtttat gactccacca ttgtagacat 68941 acttgtctgg cctaagaaag caaactgacc aaagagttgc gcaaaactca gcagattcca 69001 ttttgtaaac cttctccagt tggaattgca taaaagagcc cttctccttt aaatttcact 69061 ttgaactcta ccttggctga accaaaatat cagagggaaa ctcagtcaaa accacagtgt 69121 agaccccaaa taagtaaaga tttacttgaa acctgaagat tttctcaaaa gcttttgaga 69181 aggtgcagat agaaaggcca acctcacttg gatttatgat aaggagtgaa atctgaaacc 69241 atgggttcta ggcactggct accactagag gtaggtggta aaatagcaat aataataata 69301 ataattactg ctgctactac tacctcttgt cttaagcctt tgctatatgt cagtagatat 69361 tgtgctaagc attgtttatc tcaatggctt atctcagcaa ccctctgggg tgggtgtgat 69421 atgacatctc tttacagatg agaaaataag gcttgttcga attctgccaa ttatacttgc 69481 aagcatgtat gtgtagctct atgcaatttt atcacatgtg aagatttgtg taatcacaac 69541 cacaatcaag atactcaact gcagcctggg caaaatagtg agacctcgtc tctaaaaaaa 69601 aaaaaaaaaa aaaaaataca aaaaagaggc ctggcgcagt ggctcatgct tggaatccca 69661 gcactttggg aggccaaggt gggtggattg cctgaggtca ggagttcaag accagcctgg 69721 ccaacatggt gaaaccctgt ctctactaaa aaatacaaaa attagccagg tgtggtggtg 69781 catgcctgta atcccagcta cttgggaggc tgaggcagga gaatcgcttg cacctgggaa 69841 gtggagcttg cagtgagccg agattgcgcc accgcactcc agcctgggtg acagagcaag 69901 agtccgtctc caaagaaaaa aaataaagaa aaataaagaa agaaaaagaa agagagagag 69961 agagaaagga aggaaggaag gaaggaagga aggaaggaag gaagaaagaa agaaagaaag 70021 aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa aaagaaaagg 70081 ccgggcgcag tggctcacgc ctgtaatccc agcattttgg gaggctgagg cgggcggatc 70141 acctgaggtc aggagtttga gaccatcatg gccaacgtgg tgaaaccccg tctctactaa 70201 acatacaaaa aaaaaattat ccgggcatgg tggtacacgc ctgtagtccc agctactcgg 70261 gaggccgagg caggagagtt gcttgaactc aggaggcaga ggttgcagtg agccaagatt 70321 gcaccattgt actccagccg gggtgacaag agcaaaactc caactcaaaa agaaaaaaag 70381 aaaaaagaaa aataaaacaa aaaaagagcc aagctcacac ctgtaatccc agcaccttgg 70441 aaggctgaag caggacgatt gtaattacag acggggtggc ttgtgcctgt actcacagaa 70501 ccttggaaaa ctgaggcagg aggatggctt gagcccagga gtttgaggct gcagtgagct 70561 atgatagcac cattgcactc cagcctgtga gacacagcga gaccctgtct ctaaaaaaac 70621 taaaataaat aaataaaata aaagatactt aactgtacca ttgccacaaa actctgttat 70681 tatcctttta cggccacatt cggcccctct cccttatccc tggtaaccat taatctgttc 70741 tctatcataa tagttttgcc attctgagaa tattatataa atgaactgat agactacata 70801 aacttttgag attgcctttt tttttcccct cagcataatt gccttgagat tcatccaagt 70861 tgttgcatat tgtattgttt gtattattga gtggcatttc acggtatgga catagaacaa 70921 tttctttatt gaagtttatt cctattgaag gacatttggg tatttttcca gtttttggct 70981 attacaaata aagacaagtt tctgcatgca aatttatgac cataaagttg tttgtagtat 71041 ttcttattat ccttttaatg gctgcagagg ttgctgtgat atctttgttt tattcctcat 71101 attggtgatt tttgtttttc tgggtttttt ttttttgtat ttttttaggg acaggatctc 71161 actatgttgc ccaggctaga gtgcaagtgg ctattcacag tcacaatcat agtgtactac 71221 agcctcatac tcctgggctg aagcaatcct cctgcctcag cctcctgagt acctgggact 71281 acagatgtat gccactgtgc ctggtctgat attggtgatt tttatcttct ttcgttttac 71341 ctttgtcaaa tttgctagag gtttatcaac tttatcaagt tatttcaaag aaccagcttt 71401 tttttttttt tttttgagac ggagtttcgc tcctgttgcc caggctggag tgcagtggca 71461 caatctcagc tcaccacaac ctccacctcc tgggttcaag cgattctcct gcctcagcct 71521 cccgaatagc tgggattaca ggcatgcacc accacaccca gctaattttg tatttttagt 71581 agaggcaggg tttctacatg ttggtcaggc tggtcgcaaa ctcccaacct cagttgatcc 71641 gcccccctcg gcctcccaaa gggctgggat tacagacatg aaccactgcg cccgacctga 71701 accagctttt tgtttcatca gtttttctaa tgattctctg ttttctatgc cactgaaaac 71761 aatggtattg actttctctt ctttttctag gctcttaaga tgtgagccta aattattgat 71821 ttgagacttt tgcccttttc caatgtaagc attcagtgtt ataaatttcc ttcttggcac 71881 tgttttagct gatttccata tattttgata tgttgtgttt tcattttatt cagtcctatt 71941 tatttttaaa tttcctttga gactgctttt ttgactcatg tattattagg catgctgttt 72001 agtttccaag tgtttagata tttccctttt atttttctgt tattgatttc tagtttgatt 72061 ccatctggcc aaataacaca ttgtatataa ttccattatt ttaatttgtt aaattttttt 72121 tgtaacccag ggtatagtta tcttggtgaa tgttccgtag gaacttgaaa aaaaaaatgt 72181 gtattctgct gtgtttgggc ggaatgttct atatatgtca attagatcct actggttaat 72241 tgtgttaagc tatatccttg atgatttttt gtctagcaat tttatcaatt gctaagaatg 72301 gattgttgaa gtccccgagt attattgtag atttgtctat ttcgccattc agctctacag 72361 gttttgctca tgtattttgc agctccattc tttggtggtc ttctcagtgg tttgaccctt 72421 tatcattatg taatatctct ctttgtctct gataattttc tttgctctga agtctacttt 72481 atctgatatt aatattgcca cttctttctt ttgattaatg cttgtagagc atctttttct 72541 atccctttac tttcaaacta tcctgttata tttgaagtgc attgcttatg aacagcatat 72601 aattgagtca catatttaaa atctattctg ccaatctctt ttaatttgtg aatttagacc 72661 atttacattt aaagtaatta tggaatttct agagcttaag actgtcattt taatatttat 72721 tttatgttca ttctctgttt cttgctcctc tgatattttt tcttgccttc ctgtaggcta 72781 cctgaacaac ttttagagtt ccattgtatt tatttataac gcttttgatt atactacttc 72841 atatagtttt cttaatggct gctctaagtc ttactataca catacataaa ttatcacagc 72901 ctaatggtat tgacatttta ccaattgaga gaagtatgga aacattaatt ccatttaggt 72961 ccctttaatc tccccacttt tcaaatacaa ctgtcttagg tattttctcc aaaatcattg 73021 agctccacgt cagatagtgt aacaacactt ccttcaacta taaaatatga ttaaagaaac 73081 tcatgaagta aagaatagtc cattacactt ctatttttac caattccatt attcttcctt 73141 cctttctgaa ggtccatatc tccttttgtt aacattttct tgttctttag agaacttcct 73201 ttagccatta tttcagggta ggtttgatat tgacaaattc tcttagtttt ccttcacttg 73261 aaaatatctt gatttccccc ttcattctta aagaatggtt tcactggata cagaaattgt 73321 aattgacata tcttttcttt cattacctaa aaaatgtgcc atttccttct ggcctccatg 73381 atgtcatttg agaaaccctc tgtgattcaa actggtctcc tataggtaag gcatggttct 73441 ctctggctgc tttcaagatt tttttttctt cagttttcag atgtttaatt atgctgtatc 73501 tgactctcta cctcctactc catccttgca aatttatgtg gtgtggattc atattggtgt 73561 gaatttcctt gggtttatcc taacagagat tttctcagct tcttgaatct ttaggtttat 73621 gtcttttgac aactttgggg aattttcatc cattattact tcaaatactt tttcagtccc 73681 actctgtctc tcatgcacct ctcataatat taatattttt atttgtttca agagaacttg 73741 taaattcaac agaatttgta atagatcatt gaagcatttt tatgacatct gctttaaaat 73801 ccttgtcaga taattccagt gcgtggttca tctcagtatt ggtgtcactt ggttgtgttt 73861 ttttctggta tttggtatga caggtaatgt tttgttttgt ttcactttat acaatacttt 73921 tatctccttt tgtttttagt tgacatataa taattgtaca aatttatgag atacagagtg 73981 atattttaat acatgtatac aatgtgtaaa ggtcagatta tacactgtac attttgcttt 74041 tttatgttag aagacttgat tttatttaaa tcttctattt tagcagttag tttctcttct 74101 taggtttagt atgcaagttc tgacctacat atgtgggctt tagttccaat gaaagtttag 74161 tttgctgagc ctttggaagc catcatccta tgggcatctg ctctgggttt aggtcatctc 74221 tcaggattac aaaaacccct ggaaaaaaaa ctgattgtag tgtttcaagc tgctgctatg 74281 aagtcccaag ggtttctatt agaccagaat tttccaagac cgtagagacg aagctcaaaa 74341 ccactcttct gctactacag ctgccagcat tttggtaaat tatgttgact aaagaggatg 74401 tcttcacaat gtttgatgct gatgttttta tctcttctac ttaaaattga tctgtgttgc 74461 aaattatacc ctcctaaaga gtatgcatac aacttttttt tactaacatt ttcttacaga 74521 gtttgctctg aagcagtgtt ttctctatcc acttcatgct atgatgctag ttcttctgga 74581 atatttattt taaacagata catttcattc taaccaagaa ttttggagga taatctttct 74641 gattgactta aactaagaat tttaaaggtt agtctttctg atagactaaa gggagtaata 74701 ctatataact tttccatacc attctcattt tttaaaaatt actttgcttt ccatcataaa 74761 caaaatggtg ggtgcagttt tagataggat agtcagggaa gaccatttca aaagaaggta 74821 gcatttgagc tgagatgata caataagaat ctagctatat aaatatctgg gggaatagcc 74881 ttaaaggcaa aggaaacaga aatgcaaagg ctctagcagg aacatgttag gtgtgcttga 74941 ggaacaacaa ggaaatcagt gtggctgtag cagcatgagt aagggagcaa ctagtagaaa 75001 atgagatgag agaatggcag gagccagtcc tatagggctt tataggccac aatcaggatt 75061 ttagatttta ttctaaggta ttccacttag atagacagca agacattgga ggattttgaa 75121 aaaaaaaatt gaaagacatg atctgattta cattttagaa agatcactct ggctgtgata 75181 tagagaatgg attggaatgg ggccaaaaga aaaattagaa tagtgaggac actatttcag 75241 tggtccaagc aaggaatgac tagtctcttt ggaccaggct taaactagga ggagaatggt 75301 agaggtagtg agaggtggtt aaattttgat atacttttta gatatagccc attggaattt 75361 ttcagggttt atatggaagg gatgaagaaa aagttagcaa tcaagaatgt cctaggtttt 75421 tgcttgagca actattggag tgtgattcca ttttctgaga tggagaagcc tgtgggggga 75481 acagttttgc agggatatgg taggaggtag ggagtcaaat gttctgctct ggctatatta 75541 atttgagatg tctattagac gtctaaagga aatgattaag caggaggttg catatatgaa 75601 catggaactc aaataagggg tctagattga agattaaatc tttagagcca tcagcataca 75661 tagataatag ataaagccat aggattggat gaaataattt agagactata ggcaaatgag 75721 agggcccaag gttaaactct agatagggtt ttgccatgtt gtccaggctg gtctcaaact 75781 cctgggctca agtaatccac ccaagtcagc ctcctaaagt gctgagacta caggcgtgag 75841 ccaccacgcc cgaccatgat ctctcttcct cactggagta taagcttcat aagaacagaa 75901 tgcagatttt ctttgttcac tgcttatatt agtttgctac ggctgtcata acacaatacc 75961 acagcctggg tggcctaaac aacagaaatg tattgtttca ctattctgga gtctagaagt 76021 ctgagatcaa ggtgtcagca gggttggttc cttcttgaga ctctgaggaa gaatccgttc 76081 cgtgcctctc tcctagcttc tggtggtttg ttggtgatcc tcagcatttc tcagcttgta 76141 tatgcatcac cttgatctct gcctttgtct ttacatggtg ttctgtgtgt atgtatctct 76201 gcatttaagt tttttctttt tataagaaca tctgtcatat tgggtgaggg ctcaccctaa 76261 tgacctaatt ttaacttgat tatatcttta aggatcctat ttccaaataa ggtcacattc 76321 tgaaatactg tgatctagga catcaacata tctttttgcg ggacacaatt caacacacaa 76381 cactgctctg tctccagcat ctaacacaga ccctgcctcc gggcccctaa ttcccctagt 76441 gactccttga tgaatgaata tatgaatgga ctttagagct tctgtgaaat ggtaagcaat 76501 gttgtgaagg atgtgtgtgt gtatgtgtgt gtgcgtgtat gtgtgtgcat gtatctctct 76561 ggagagagag agagcccctc tagcctttca tcagattctc agtgatccat gatcctagaa 76621 gtgccaagac ccactagctt ataatgtaaa caccacatct ttgctgtgat atagaggccc 76681 ttcattatat agccctggtt tactctgcag cctggccttt taaactacac acacctgtct 76741 ccagtcacac tgaatcacct gtggttccct tcatggacca ttctcttgtt cacctctgag 76801 ccttttcata ttctggtatc acctctgcct agtctaggat tcattttcaa tcatctggca 76861 gattctcaat catctttcaa aaataagctc cagggtttgc ccctcaagga gaatttttct 76921 gacttctctg gagggtgata agcattttca cttctgggaa caatgttcac acatccacta 76981 aagcaactgc catagacaga tgggtctgtt tcttccactg gacactgaga tcctggaaga 77041 tagagaccaa ttcttactca cctctgcatc cctaggctag aagggtgtcg tcagtcacaa 77101 aacaagcctt taagagtgaa tgaaaaaaaa agagtggctg ccctggggaa gacagattct 77161 aacttaaaga atagcctagt tccaacaaca aaatctgttt cacagtgagc taaatagtgt 77221 gcagcttctg gacaccacat gactactcat gatggggata atctagaaag agcatatgca 77281 ttggatggaa ggtcgagctg catctggccc ccacgacctt ttccacatct gagtatttct 77341 gattccgcga cagcccgacc ttcctttttt ttttcttgtg actggaacct ggcttctgaa 77401 ctggagggtg tgtcaccctc gaactcccgc tgactaggcc aaggttagaa aaggtacttt 77461 ccagaagttc accatcccag gcagactcaa ggtgcctggc tttgccttct cagcatctct 77521 gacttcccat ctgaaaactg actaatactt gctaattgta tgttctaaga tcctttgatg 77581 aaaatgccag agtatgtaag tatcaagtat tgttatgaat aagaaaaaag gaatgaagtt 77641 tctgttgctt actccttgaa cgacattata gaaaatggat gtcttttgcc aaaccacagg 77701 gcatagctgc attgtagaaa agaaactggt aaaaatgtat tcaacccctc catcctttgg 77761 attcaacatt taaatatctg cttttctagg accctcctga ttacaaaaat gccaaggagt 77821 cttttggtgt cattgtggat tgtgctcagc agtcttccat gaagccctgc aatgaaacct 77881 cactgcagtg gtgatctagc ctggtgttct gtctacctag agcagcactc cccttctggg 77941 aaatgctttt ttctccactc caaccatgca tttgaaggat gcctgctatg ttcttctgga 78001 actctgttgc acacagtgac tggtaaagac atgggcacct gatccaagct gtgtcaatta 78061 aaatccttcc agggactttg aatacatggg catagaaaga gtgaatagaa agtcctctct 78121 ggaagcaact ctggggcagg ggggtatgag tacagaactg gccagctacc atgcatcagc 78181 ggtgtgggag aagactgaga gaatgaaact gacactcatg cagaggtggg agacagaaaa 78241 aagagggagg aagggtggga aaggagagag agagtgaaag agaaatagac agacagagag 78301 aaatcctgtt agttactcaa gtcctttgtt ctagttgttc ctgccttccc atggctcggt 78361 tacatgatca ccaataagcc tctttttttt tttttttttt tttttttttt tttttttttt 78421 tttgcctaag caaaattcca cctggctttc agtcactgga aaccacaaca atcctgacaa 78481 atatacacac agaacatgct attagctcca gaaagaaatg aagagcctta cagtctccaa 78541 gtattttcag ctcaagacag tttttctatc aaaatgactt ttctcaagaa tgtccactct 78601 tcctcccatg gcaatgacct aggaagaaaa ggaaagagaa tgggttgaca ttcctaagta 78661 cttgtgcatg aaacgggctt tacagatcat ctggtcaaac ttccattgta cagatgggga 78721 atggagccca gggatgatgt gcaactttgg gagcacatca tcccttcccc tccctgaact 78781 agaatcgagc tctggctcta ttactaactt tgtactaata atgtttgtac taatacaaaa 78841 atttgttact aagaacaaat attattagta atattactaa taattattac ttattactaa 78901 taagttagta atagagccag aactcaattc tagttctccc aatttattct gactcactag 78961 tttcctacag gcctccaatc tgaaattctg taaatgaaat tgcaatgcaa ttttttaaaa 79021 caccaaactt tattttattc ctggaataca catcactccc cacccccact cattctattt 79081 cttcaagttc atgaatatgt aaaatgtcaa gaattttaag ttatatatct gctggagggt 79141 tactgtggct gtgacactta aaattttagt tctgcaagct attttgattc atatccaatg 79201 cttgactctg acatgaaggt ctaatgaaaa atcaggcctc tggccttttt gttttcatct 79261 tgtagaacta ctgctacatt tggcctttta aaaatgatgt gaccagcttt cattttgaca 79321 attgcatcct caagacttcc agaaactgat gaaattgtca ttgctgtgaa attatcctgc 79381 aatcattgtg aatatatata tatagtgaat atatatatat ataatctttg tgaatatata 79441 tatatatata tgataagtta taatgtgact catgtccact catgtcattg tcaggccctt 79501 taatatttct tcagaaggaa atgtgtgcaa gtaagatatt attgcttatt atcaatcagt 79561 ccattgatta atcaaccaat caatcattga gtaccaagta ggtgcctagc actattctag 79621 ttgtttcagg aacaaaagaa atgtaaggcc ctgagatatt taaaagtata tgaatgcaga 79681 tatatttccc atcttaaaca tgcagcagct tccccatggt ggcttacttt tgttacaata 79741 ggaagttata tggtagagtg gaaagaacat ggactttgga gtcaggctgt cctgagtttg 79801 aatgtgtatt actaaaactg tgcaaacctg gagaagtcac agcccaccac tgcccctcac 79861 tctcctcatt tgttaagttg gaataataat aataatagca aaagtatcta cctttatcta 79921 cctttcaggg ttgttctgat aattaaatgt tatcatttac ctggcataga taggtgccca 79981 ataaatgcta actagtattg taatgaatat gatatatgtt taggtccagt tgttcaagag 80041 agtatggtaa ctcaggaagg tttagcaagc ttagtaggca gcagtatggt gtaggaatcc 80101 agagcaaggg ctccaaaatc cgacagaatt tgagttgaat cccagctgta ccactttaac 80161 ttgttgggtg accttggaca gattgctaaa cttctcaagt ctcagtcttt tcatctctca 80221 agtggggata acggaatcta ccacagagaa ctgttggcac aacttaatga gatgatgact 80281 ggaaagaact agtgtaccac ctggcacaaa acactatttg gtcacgtgtg aagctgttaa 80341 catctgtgct ataaaacgcc atggtcactt tataaattac tgttattgag acatcctttt 80401 gctttccagt attcacaggt aagaaaaact gagccattct ttatttccat ctgtttttgt 80461 catttaaaaa tagcattcat agcagagacc taaggattac ttttaagaag cacttaaaga 80521 ttttcaagat agtgcatcaa attgcaggaa gtctatatta tttttcagag tagagttgta 80581 gatcttcagc agttactcct ccagatacta atgtactaag taatctactg tttgaggggt 80641 tttcagagaa aaggtacttt gtaaatatat aaaattgctt acactgaaat aaaataaatc 80701 ctttaggtac tctgagatta tgtacactaa tattttatat atacgtccaa ctgctaatat 80761 gattctaaag gaaaggaaag aaaatacaag atcaagagtg cacttttcaa atgcattttg 80821 tagaaaacaa agactaaaaa tttttactgt attaaagcac tgcattaaga attctcaagt 80881 tcatgtcctt tgtagggaca tggatgaaat tggaaatcat cattctcagt aaactatcgc 80941 aagaacaaaa aaccaaacac tgcatattct cactcatagg tgggaattga acaatgagaa 81001 cacatggaca caggaagggg aatatcacac tctggggact gttgtggggt gggaggaggc 81061 gggagggata gcattgggag atatacctaa tgctagatga cgagttagtg ggtgcagcgc 81121 accagcatgg cacatgtata catatgtaac taacctgcac aatgtgcaca tgtaccctaa 81181 aacttaaagt ataataaaaa aaaaagaatt ctcaagttaa gtttctgaaa ataaacactt 81241 tttttgaaat ggaataaaaa gattgcactg gaagcagagt catggaggaa acagaagaaa 81301 gctaacattc attgaccatc agtcatatta cttgatctca tttaaaccat gcaacaatcc 81361 catgaggtgg gtggtattat ccccatttaa cagaagagaa aactgaggcc cggagaggtg 81421 aagtaacttg tctgagagca cacagcaagt ccgttatcaa accgaggcct atttgactcc 81481 aagctccatg ttcattccag atctatccct tgtgctgaca ccaaaaatct gagtactcct 81541 gaatatgtca tttaccttat tcatgtcttg tatctgttca gtggcatttg gctgcaagta 81601 gtagagtaac tcgctaacac cagtttacac cataaataca tctaattttc tcacataata 81661 aaaagtttag aaagatgtgg ttacagggaa gggaccacag gtcagatccc cagttctatc 81721 tttccactat gctgccttcc atgggcagga gatgtctctc ttagtctaaa atgtacttag 81781 cagctccaag catcacacta tcacctgaca acctccaaaa ataggaagtg aatgtaggga 81841 aaatacttta tatatgcctc tctctttgat cagagaggga aatctttccc agaagtcccc 81901 cagtgaactt ccccttgttt cattgaccca acctaggcca cacagacacc tataaatacg 81961 gcaagaggaa tgggatgacc atgcttggct tggaccagtg atgataatga ctagggacac 82021 cttccttgag atcaagagct cttgcctaga aagcctgaca caaacagggt tctgctatca 82081 agtaggacat aaagacagga aggccaatga gcaggcaccg aacagtgttt gccccagcca 82141 tccatttctc tatcaacaag taaacacgtt gctctgggcc acctcttagt gagattgagg 82201 agaagactct agctgcagtt aaaaaacaaa gcaaagcaaa agcctgagac atataaatac 82261 acataatcaa cacatgcagt gttagctcaa actcccaaat agcactcctt ccccatctct 82321 caaactacac ttgtcttttt gtaaattctt ttctccatca aagctgtcct tgacattcct 82381 tttaaactaa tgtttataga tggaggttct aggcatggga ggagatagat aggtccttgt 82441 ccctctctgg tctccctcct ctgtccacct caccattgta acatctgatc tcaatcactc 82501 tggattgatg aaattatttt gtcccatgct attcatagaa agaaataaga tacaggggaa 82561 atgatagtct gctgcagtaa gataacagat tcagaatatt tttttctatt tcatttctaa 82621 gataattata tttaacctca aatctatcac aatcagaaat ttttcgacca cattttgtat 82681 tgtgctaaaa aaaatcaaca gtagtcatat tttttaaagc agctgtcaga tttaatcttg 82741 taacagttta ttgactcgaa acaccttaaa taatggtttg gggatctgta ctgtagcaag 82801 aatcttcatt tcttattttg gattcaatat tacaacatca gccttctacc ttaaggaaat 82861 gtagaaaccg aatgcttttc aaaatgtttc ataacctttc taaaattgca gcactttatt 82921 aaaatggcat ttaaactgtt tttaaaatgt accttggatt cttgataaca ctattatttt 82981 gataacacat tgatttattt tcctatctgt ataggaaatg ctgtgggttt ttttccacaa 83041 atttttccaa aagacttatg gcaggccatc tttatctctc cagtgcagtg actcaatcat 83101 catttcaact ttttcatctt ttcttttaaa atgcttattg aatcagggaa aaaattgagc 83161 acctaactct tctgtacaag tggcaactcc atttatctcc tcaaaaaaac aaaaatgaag 83221 tggagaaaaa gtaattatct ttggaagaca atcagtactc actgttgact gccaatctag 83281 aatacctcac ctcaacttgg tctcaacagc cttgggggag gaaaggcaaa aaggcactca 83341 aattatcaga actaattgcc tcctatttgg ccttgatttt gccttcattt atgcatttag 83401 tcatctgctc actcactaaa tgtgcccctc ttatgtgcca ggcactgact gttcaggcac 83461 taaggagata acaatgcact agacaaagat ctctgccctc aaagagctta tgcacactgt 83521 gtattgggaa aaactgacat cattcaatga aaacacagag agtaaaagca ttaaaaaact 83581 ttaaagtgat cccagagtgg ttcaacatcc ttaatggatt gatgaaatct ctgggaccta 83641 gaggacctaa agtcatacag ctggtccatg gggctggata cagaacctat atctcccaaa 83701 taatcattgc taatgtttac taagtgcact gtgctttaca tgtataggtt gaaccataca 83761 aaattgctgt aatttgattg gttttcacct acagaaatga gaaattcaga aagttcagcc 83821 tatttttatc acattaaatg ctcacaattc tttaaagtca ataatacatt atctccaatt 83881 gacatgtgga gaaaattagg cagagagagg ttaagtgaca tatccatttg atatgactgg 83941 gttttgaact tggatctttt tgatgctagg gccagagctt ttggctctat actatgctgc 84001 ctcctgagtt agtggctgtt actttccact gtgctgtctc tcaccccagg aatcatacaa 84061 tcagtgaaac atttaaaagc agaaatctga agaagagcaa atgagtgtcc atataatgga 84121 tttcattaga acagagcaaa gccattgatt tcatcttcca aatggaaatc tccaaaaaga 84181 gactggtgtt tctggtaaga agtacatcaa aggcctttgt cctaggtgaa ggaggggtaa 84241 tgagaaatca aaatcatcat gatggaaatt tgaactcctt cgatggtctg gagcaccacc 84301 ttctttctgt tcaggattat ttgtttagtt tgcttcatga caatagaaaa gcctctcccc 84361 ttatgaatct caaagagaac gctgaatgac atgagggcaa ccgtcaaaat ccttaggttt 84421 gcttgggttc tggtgtagag gcaaaattta gttgcttacc agtaattttt ttttagaact 84481 aagcatgtct aattggttct tttcaaattt accaatcaga tgaatcactt tacttcccca 84541 gggggccaaa gaagagaaga acttactatt aattcctggg ggaaaaaaat gcactgacaa 84601 atatttcaaa tccatatcat gcctgttaac agacaagtta ttggggtagg agtcatgaag 84661 aaattagcca aagacaagaa ggccttcaga agatgtgttt gttgttgatg atttgtaatt 84721 gtttatttca ttgaatgaaa agatttttct ttcttcgaca caaatctgta tagcctgtcc 84781 ccgttctgca caccctaccc agcctgccct gtcaagcaac tagcaaaact caaaaacgag 84841 ccccaggcct cactcagatt gacctgcagt gtcccctgaa gtacaggcat tataatgaac 84901 ataatgatgg cagttaaggg aagagtgtgg cttgttccat tccttttgat tagacataaa 84961 acagaaacat atgtttttac tacccttctc atctttggca gtatttttct gtcatatgtt 85021 gtatagggac aatgtagctt catgaacaac tttttcttac aaaatgtgag cttgcaaaac 85081 aatgcaaatg tctatcgata gagaaatgga taagtaagtt acagtgcatc cttcaacaga 85141 tgcgtataca tcctgtgaaa agcaggaagt agatctgtat atatccacat gggaagatct 85201 ttgaaatatg atattaagtg aaaaaatggg acacagaaca atatgtgtat tacaatccca 85261 tttgcgttaa aaaaaaagga aagaaaacta aagatacata tatatgagca aataaagtct 85321 ataagaaaac acaagaagct attgataggc caggtgtggt ggctcacgcc agtaatccca 85381 acactttggg aggcccaggc gggcagatcc ctcgagctta ggagttcgag accagtctga 85441 gcaacatggc aaaaacccat ctctacaaaa tacaaaaaaa aagaagaaga agtattgaca 85501 gtgggtgtga cttctgggtg gcgtaagggg acagaggtgt tttcatttgg tcctgtactg 85561 tttgaggagg tgtgtttgag gaggtgtttt gtttgtttgt tgtttgttca ggattttgtt 85621 ttgttttgtt ttgttttgtt ttgttttgca atgactatga tacttttgta aatttaaaag 85681 acattttaaa gagttaatga taaaatctgt atttataata atatgcaagt ggaaaaaaga 85741 gaaagagagc tgaagtgagc cagaaatttg tatttttaag aagtatccca gacaattctg 85801 atgcaggtgc tccaagaatt atacatatat tttaaattta tatttcactt aaaactttta 85861 acattttgac acgtttcttc tatctctctg gatacatgtg gtgtgtgtgt gtgtaaattc 85921 tggaacactg aaaggttgca gacatttgtc atttgttgac agaaagagtc aaactctgta 85981 aaatatttga agagatttat tctgagccaa atatgagtga ccgtggcctg tgacacatcc 86041 ttcaggaggt cctgagaaca tctgcccagg gtggttgggg aacagcttag ttttatacat 86101 tttagagagg catgagacat caatcaaata tttaagaaat atattggttt ggtccaggaa 86161 ggtgggacaa ctcaaagtag ggcagggggt ggaggggggc gttctaggct acaggtgaat 86221 ctaaacattt tctggttgac aattaattgg ttgagtttgt ctaaagacct gggattcata 86281 gaaagggaat gttcaggtta aggtaaagat tgtggagacc aaagttcttt tcaagtctta 86341 tagtggctgc ccttagagac aacagatgac aaatgtttcc tgttcagatc ttaatctctt 86401 ttggattggg aggatctgga agaaaaatat ctagctatgt taatagagat tctttacaga 86461 tgtaaatttt cccccacaaa gaatggcttt gcagggccat ttcaaaatat ggcaaagaaa 86521 cttgtttcgg ggtaaaatat ttttattttc ttccttgtct cgtaatgtta tgccagagtc 86581 aggttggaaa gtcacaatat atagggttaa ataaaaccca tctgatgaga atttattatt 86641 tgtaggccat gactccccag gccccttaga taggaatttg ggtaagatta aaaaatcaga 86701 gtttagtcct catattttaa cagcacttca accctaaata cttcagcatg tatcttctaa 86761 gaaccataag cacaatgtca ctatcatacg aagacattta acatcaaatg agtaatgtta 86821 aaaacagcca aaattcatat ttctcaaatt gtctcacaat accttcaata aattgtttac 86881 tgtgattttt tttttttttt ttgaaactgg gtctccctct gtcacccagg ctggagtgca 86941 gtggcacgag ctcggctcac tgcaacttcc gcctcctggg ttcaagagat tctcctgcct 87001 cagcttcctg agtagctggg attacaggca cgagccacca tatcaggcta aatattttga 87061 gtgagtgatt gactgtatca tatatagtta tataaacaca tagaagaaga tatgtaacca 87121 cactgaacta gtcatagtgg gtacctctgg ggagtggaat tatggaagga gaactttcac 87181 tttttacttc ataggcttct ctactagctg aaagtttata gcaaacatat ctcatctttg 87241 taattagaaa aataccaaaa tagggattat taaaagtaaa agagcttgaa agagccaaga 87301 agctgcttag cagaacatat caggcagcag gtggcaggat tgcctcaact tctgcttccc 87361 tctgccaggc tagggtgcac cagagcccaa gtacatgctg gcgtggcacc ggcagaatgc 87421 caggaagagt ggtgtgaata tggggcaaag ggctgcaatt aacatatagc gccgtgtggc 87481 ccctggccgt cagtggctgc ggacttactt cacatttgta aaatcttctt tgtggcttcc 87541 tgcatgcttt gattcatggg attctacggc attgccctgc ctttgagaac cccaaggaga 87601 ttcctacagc taacttccgg acagtcaacc aggcattcaa atttaccagt gtcagttggc 87661 tctagaatga cctctgcttc aggttgctaa gaatttccca ggtactggct gaggagtttg 87721 gggttattgt tttgaagcct tcataggttc atgactgtag tctgtgtaca tctctataaa 87781 tgcatatttg aagaagaaaa atctagaaaa atgtgttcat tagacacttt tccctcaccc 87841 atttcctcaa ggttcttctc attctataac agctgccctc accagaggca tttctaagca 87901 ggatggtttt gtccttgcca acataagcca atggttagat gaacgatacc gtgatatgta 87961 gacgaggcag aaggcagaag ggcaaggcag tgctcttgct gccagccaag tctgactatc 88021 ttttgtgcca ctctgcttcc aggatctgct gcaaagctca ccttgggtta agattttgct 88081 gtaagatccc cttccctgcc aaaacctgcc acactgagac atgcacacgt gtgcacacac 88141 atgttgatgc acacacacac atctagctct ggccaggaag ttcataagaa gcctttgccc 88201 ctctttattt tacccaggca cctcttcctt cacccactcc atgctgccag ctggcctcag 88261 tgctgcttgg ccaatgtcca tcatcctgct gaacttccac tgttgcaact tgcctccctc 88321 tgcactccaa actcctgaat ctatacccat tctgcatagg cttcagtcag ggctcctttg 88381 gactcaagga acagagaccc attcgagctg gttcaacaac aagctgtggg tgttggtggg 88441 gcaggaaggt ggggcatagg tcattgtaag gagaccaaga tctcatggta ctcccaagac 88501 agccaccata gtgggacctg gaacaggtag caaggcagct acaatttctg gttctctctc 88561 cagaggccca tgatatttct cttggtgtct ctgctcctca cagtatacct gctccattct 88621 cctctttcta ctgactgatc tcctcagaac ttttgtccat acatggcccc tcacggttga 88681 ccgagctcca tgtctgtatt atagcctttc agtttcagta tattagtctt tcagactccc 88741 agactcttgg tcttccaata ccataccccc aagagagaaa tccgatttcc caagctccag 88801 accacagagg ttctacatag agcatgaacc tgcctaaaag ggggtcatca tcagatcaga 88861 tgatcaccaa tcctgctgca atcagacaag actgggagcg ggggcagttt ttatagacat 88921 ttagcgtccc aggagtgatg agcagatccc ctatgggtgg gcagggaaat tataatcatc 88981 tctagtacag taggcctgga aactggtctg gtgaaaatct agacccaagg gctctgtcta 89041 cctaccagta tttgaaaatc ctgcgtgata ttcatgtgtc ccatctagtc ttagaaactc 89101 ccagaatttt cagctcctaa gccccatcag gatgatgcct ttcctagccc ctgttcactg 89161 gtccccattg tttccattta agtttctttg ttgaaacaat atgcagtcta ttcatacaat 89221 ggaatattat tcagtcataa aaaggaaaga agtactgata catgctacag catggatgaa 89281 ccttgaaaat attatgctat gcaaaagcag caagacgtaa aataccacat actatattat 89341 ttattttatt tatttattta tttgcttatt tatttattta tttatttatt tattgagatg 89401 aaatttcact ctgttactca agctggagtg cagtggcaca atctcagctc actgcaacct 89461 ccacccttta ggttcaagcg agtctcatgc ctcaccctcc caagtagctg ggactacagg 89521 tgcatgacac tacgcctggc tggctggcta atttttgtat tttgtgtata tatatatata 89581 tatattagta aggacggggt ttcactatgt tggccaggtt ggtcttgaac ttctgacctc 89641 aagtgatcca cccacctcag cctcccaaag tgctgagatt agaggcatga accactgttc 89701 ctatacatat gatttcattt atatgaaagt ccagaataga gaaacataaa gagacaggaa 89761 gaagattagc agttgcttaa gctgggaggc ttggcgatag aatgagatag ccagaggata 89821 cacagtttct tttgaggtca tggaaatttt ctaaagttgg ctgtggtgat gattgcacag 89881 atctgtgtgt atataaaaat tcacatctct caaaatttca gaggcagaac tgcaaaaaga 89941 ctttcctagg actactgaga aatgtttaag taaaagaaaa atattagaca ctctctagct 90001 tccaataagc tttgcctggg cataagataa ctcatgaaat agaaagaggt acaaatccca 90061 gcaaaagata gtcaagcatc gtagtgaaat tccatgccag agctcacgat tctcccacac 90121 tagtaggaag aatgacagtg taaggggata tatgcatgga tgatggcatc agattttgcc 90181 aaggagttgg tggagcctaa cagagcaatt gcttttgcct gcgggaagta tcatatttag 90241 gtcaaaacag ccagtgttag gcaaggggtg gcaggaaggg cataactatt ttggaaggca 90301 gagcatggct caacttgttt tactgactga gtgattttcc atgtacagtg gtcatccctt 90361 atctgcagtt tcgctttccg cggtttccat tacctgaggt acagtacaat aagatattct 90421 gagacagcca gacacggtgg ctcatgcctg tcatcccagc attttgggag gctgaggcgg 90481 gtggatctcc tgaggtcagg agtttgaaac tagcctggcc aacatggtga aatcccgtct 90541 ctactaaaaa tacaaaaatt agccagacgt ggtggcacat gcctgtaatc ccagctactc 90601 gggaggctga ggcaggagaa tagctggaac ctgggagtcg gaggttgtag tgacccaaga 90661 tcgtgccact gcactccagc ctgggtgaca aagcgagaca ccgtcaaaaa aaaaaaaaca 90721 acaagcaaac aaaaaaagat attctgagag agaaagatca cattcatata acttttatta 90781 cagtatattg ttataattgt tctattctct tgttagttat tgttgttagt ctcttactat 90841 gcctaattta taaattaaac tttatcatag gtatgtatat atagaaaaaa acatagtgta 90901 tatggtactg tctgcagatt caggcatcca ccaggagtct tggaacgtat cccccatgga 90961 taaggcggag ggacactact gcaacagcag catctgtaga gcttcattca ggatagctat 91021 tatctcatag catttataaa ggtaagcaga gcagtgtttc tacaactaat agcataaaag 91081 aaaaacaggg aaaagacctg tacagagggg gtgtgtcatg caagaaaaaa aaactagagg 91141 gcaaactttt tcaagagtga ctcccaggaa gccttattta tttatgagat ggagactcgc 91201 tctgtcgccc aagctggagt gcagtgacac aatctcgact cactgcaact tccgcctacc 91261 aggttcaagc aattatcctg cctcagcctc ccaagtagct gagattacag gcgcccacca 91321 ccatgcctgg ctaatttttg tatttttagt agtgacgggg gtttcaccat gttggtcagg 91381 ctggtctcaa acgcctgacc acaggtgatc tgcctgcctc ggcctccgaa agtgctagga 91441 ttacaggcat gagccccagc gcctggcaag cctaatttat attactcagc tctcagttta 91501 aaagttcggc atgtctgtta ggagttttcg aatttattca gccattaaat actttaaaat 91561 tataatctat tgcgaacaca ccagcaggta cccatggata gcacatgcag atagcagaaa 91621 gcaaaacata gttttcacaa ttagttgttg cttacaaata tgaccaggat gtgattcata 91681 tgctgattta gtctttatta cagaattgtg tgtgtgtgtg tgttgggggg cgggggggga 91741 ggggaagggt ctcactctgt cacccaggct ggagtgcaga ggtgcaatct ccgctactgc 91801 aacctctgcc tcctgggttc aaccaattct cctgcctcag cctccaaagt agctgggatt 91861 acaggtgtgc gccaccatgc cctgctaatt tttgtatttt tgatagagac agggtttcac 91921 cacgttggcc aggctggtgt cgaactcctg gcctcaagga atctgcctgc tttggcctcc 91981 caaagtgctg ggattacagg catgagccac tgtgtctggc caagaatttt taaatagaga 92041 aattaaatca agttaagtgt atcaaaacat gatgttgagt accttaaaca tacacaataa 92101 aaattaataa taaatcttgc ctacttccat tgggcaactt cttgttgcac ctaaaaataa 92161 gaaattagat cactttaata ctaaaaactg caaaatataa caagatagag caatatatat 92221 ttacattttt atatgaaata tgacacagaa cacaaaacat ttacacacga gaacagtgaa 92281 ctgttccttc tcaaaagata tcccataacg cccatttaat attaagaatg aagttccatt 92341 atccttatgt caggtatagt attacgttta ttcctcagtt tgctttttct tgttttacaa 92401 aggtctgcgt agcaaaagga agaacacaaa atgcatattt tgcagcattt ccaactctta 92461 gattagactg ctcccaaact catgaattat ttcataagct gttgaaatag ctgcttcttt 92521 ccaattttta gaaagtgggg ctgggatggg ccatccctac ttagaagtat tctggttttt 92581 tgtttgtgtg atacagacac agtgtgagag acataaatct ctgcttctgg tgttaaagtt 92641 ccttctttcc gtcacaggct catggagaat gaacagatga aaaacacaca tttaatagac 92701 aagacacaat gcattatatg acaaatacat gtaactggaa tgatagccac tgcctgtccc 92761 ctcatctctc tatgtcaggg ttgctaggac agaaaaatcc tcattcagcc tccattccca 92821 gagattgcag gagagagcca gtccttggaa tttgacctgg cctccaaaat aaagaaaatg 92881 ctttccattt cttctgctag tttttggggt tggtgactct gggataggcc tactttttat 92941 gcctctggaa tgctcttctc ccttgcccct ttgggcacag tctgaaaggg gaagcagagc 93001 tcgagagggc tagctgggca ggttttgtga agctttggaa aaaatttctc tcctttccag 93061 tatgaatcag actggaccat ccttgataac taatggggac ccatgagagg tattaatcca 93121 ggagaagcag atcatctcct aagggtatga gctctcgagt caggctgcct aggttcaaat 93181 ctaccattta ctaactgtgt gaccttgagc aatcttgagg ttactagctg tgtaagctcc 93241 ctgagcctca attccatcat ctgtaaaatg ttgataaaaa tagtccttac atgatggagt 93301 cattgtgaca attaaattgc tacatctcat tgattctaag gcatcaccaa ttataagata 93361 aaccattatt ttatgtacca ctaagaaaga aaataacaag ttgctcctta aactctatat 93421 gccactgatt ttgagacaca tcctggtttc agaaatgtta aaacgtgggc caggcgtggt 93481 gactcacacc tgtaatgcca gcactttggg aggacgaggc aggcggatca cttgaggtta 93541 ggagttcgag accagcctgg ccaacatggt gaaactctgt ctctactaat aatacaaaaa 93601 ttattttgtg gtgtggtgtg gtgggtgtgg tggtgtgtgc ctgtaatccc agctacctgg 93661 gaggctgagc catgagaatc gcttgaactt ggaaggtgga ggttgccatg agccgagatc 93721 gtgccactgc actccagcct gagagacaga gtgagactcc ctctcaaaaa aaaaaagtta 93781 aaatgtgaaa tagatgcatt tatttcagaa tcaatgagat acagtagtta atactgatca 93841 agtacttaga gcagtgcctg gcatatagca agtgctcaaa ggttgctatg atgcagtgtg 93901 atttccctaa agcaagcttg tccaacccat ggcctacagg ccgcatgtgg cccaggacag 93961 ctttgaatgt ggcccaacgc aaattcataa actttcttaa aacattatga gattttgttt 94021 cataattttt ttaaaagttt atcagctatc gttagtgtta gtgtatttta tgtatggccc 94081 aagacaactt ttcttcctcc aatgtggaag atgggacact cctgccctaa aggaaggtag 94141 taattcccac cttgaaaatg gaggccttat tgagaaggag ggccccacag ccttcaacag 94201 gggagagcaa ggctaggaac tgggcatgct cacaaaggac cttggtgtaa ggataaattc 94261 tttcatgttt ttctgttgca caacacaaag gcaaaagcca ttttgcacgt acagtgttgc 94321 acccttccct ccctcattgt aattctctta gttaccctta ctcccaaccc ttcggtacag 94381 tccagtgctc ctacagaact ctgtggcggg cttgtgggga ttactagagt gagcagtaag 94441 ggaagaaggt cctgcaccac atgtgatgtg gggctgaggt ctaggcaggg gagtggcagc 94501 acatcaggaa aggccaccgt tcaggagatc catgtggtga caagcacagc tcagaggtga 94561 gcagtggcct gcaaggaaaa atcactgtga gtggcctgga gggatctaaa ctcagagaag 94621 tggctgccca gtgaccatct gactgaggag taagtgtgag aggacaaaat ctctttcccc 94681 aagattttag gggaaaagtg tccattgggg tggcagaagt gacacgtcct taggtcacat 94741 agatgataac attataagac aatatgcaca atctagacca gcagtcatta aatcagatca 94801 ctatctttat tttctcactt ttaaaaagca tgtttcagcc aggcatggag gcatgatcat 94861 gcctgtaatc ccagcacttt gggaggccaa ggcgggtgga tcacttgagg tcaggaattc 94921 gagaccagcc tggccaacat ggtgcaaccc tgtctctact aaaaatacga aaaaaaaaaa 94981 ccaaaaaaaa aaacagctgg gtggggtggc atgtgcctgt actcccaact actcaggagg 95041 ctgatgcatg agaattgctt gaactcaaga ggcagaggtt gcagtgagcc gagatcatgc 95101 cactgcactc cagcctgggc aacagaacga gactctgtta caaataaata aataaacaca 95161 tttttttcac atcttatcat tactgaaatc agactgtata attatgatga cagtggtggt 95221 gatgatgatg atgatgatga tgataatagt taacacttag gtacaactga atatatgcca 95281 ggcactattc taagtgcgtt atctgtatta gctcatttac aaccctataa gtatagttat 95341 tatactattt ttgacagttg gggaaactga ggcacagaaa tgttaagttc tccgaggtat 95401 cacaccccca ctccctcaca tacccacaca cacccttatt aggtggcaga ggtagacaga 95461 gtcaatgacc ctaaccatga cacccagcta atttttgtat ttttagtaga gacagggttt 95521 tgccatattg gccaggttgg tctcaaactc ctggcctgaa gtgatccacc cacctctgcc 95581 tcccaaagtg ctgggattac aggtgtgagc caccgtgcct ggcttcaaaa gctgttttga 95641 aaccaattat gcctcttcag atacccgatg gtatcttata attgaggaaa tccaaatagt 95701 ggttaaatgg tacctatggt ccaattccag gcccaccttc aagcattgtg gaactctgag 95761 atacagtata gtgtggtggc aaataacaca ggtggtggcc tctctctagt ctgtgttcaa 95821 gttccagcta tgccacttac aagctgatta agctt // LOCUS HSAC002073 128978 bp DNA PRI 12-MAY-1997 DEFINITION Human PAC clone DJ515N1 from 22q11.2-q22, complete sequence. ACCESSION AC002073 NID g2078469 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 128978) AUTHORS Du,Z, Scheet,P and Harper,M. TITLE The sequence of H. sapiens PAC clone DJ515N1 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 128978) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (12-MAY-1997) COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This sequence was generated from part of bacterial clone contigs of human chromosome 22, constructed by the Sanger Centre chromosome 22 mapping group. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/ SOURCE INFORMATION: This clone was derived from human PAC library RPCI-3 prepared by Pieter de Jong and coworkers at Roswell Park Cancer Institute, using the method described by Ioannou et al., Nature Genetics 6:84-9 (1994). The library is from one male donor. For further details, see http://bacpac.med.buffalo.edu/ The clone is available from Genome Systems, Inc. (http://www.genomesystems.com). VECTOR: pCYPAC2 NEIGHBORING SEQUENCE INFORMATION: The clone sequenced to the left is H_DJ400N23; the clone sequenced to the right is H_DJ412A9. Actual start of this clone is at base position 1 of H_DJ515N1. This clone contains STS WI-12936 (NID:g1344756) and A006I21 (NID:g1341182). FEATURES Location/Qualifiers source 1..128978 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone="DJ515N1" /clone_lib="RPCI-3" /map="22q11.2-q22" repeat_region 7..131 /rpt_family="L1" repeat_region 223..526 /rpt_family="ALU" repeat_region 528..944 /rpt_family="L1" repeat_region 989..1266 /rpt_family="ALU" repeat_region 1485..1576 /rpt_family="L1" repeat_region complement(1599..1626) /rpt_family="MER" repeat_region 1620..1939 /rpt_family="MER" repeat_region complement(1959..2201) /rpt_family="MER" repeat_region complement(2813..3101) /rpt_family="ALU" repeat_region complement(3383..3639) /rpt_family="ALU" repeat_region 5560..5648 /rpt_family="ALU" repeat_region complement(6859..7148) /rpt_family="ALU" repeat_region 8165..8447 /rpt_family="ALU" repeat_region complement(8453..8742) /rpt_family="ALU" repeat_region 9147..9438 /rpt_family="ALU" repeat_region 11744..12034 /rpt_family="ALU" repeat_region complement(12037..12313) /rpt_family="ALU" repeat_region 12643..12925 /rpt_family="ALU" repeat_region 13532..13693 /rpt_family="ALU" repeat_region 13889..14009 /rpt_family="ALU" misc_feature 14446..14613 /note="match to human EST AA122029 (NID:g1678048) zm25f10.r1" misc_feature 14452..14613 /note="match to human EST D31562 (NID:g644442)" gene 14543..23803 /gene="WUGSC:DJ515N1.2" CDS join(14543..14612,15591..15707,15803..15922,17188..17388, 17494..17572,23599..23803) /gene="WUGSC:DJ515N1.2" /note="Putative gene. Genscan predictions confirmed by EST splicing.; coded for by human cDNAs AA122029 (NID:g1678048), D31562 (NID:g644442), AA158721 (NID:g1733515), R59640 (NID:g830335) and F13082 (NID:g709111)" /codon_start=1 /db_xref="PID:g2078470" /translation="MLLAWVQAFLVSNMLLAEAYGSGGCFWDNGHLYREDQTSPAPGL RCLNWLDAQSGLASAPVSGAGNHSYCRNPDEDPRGPWCYVSGEAGVPEKRPCEDLRCP ETTSQALPAFTTEIQEASEGPGADEVQVFAPANALPARSEAAAVQPVIGISQRVRMNS KEKKDLGTLGYVLGITMMVIIIAIGAGIILGYSYKRGKDLKEQHDQKVCEREMQRITL PLSAFTNPTCEIVDEKTVVVHTSQTPVDPQEGTTPLMGQAGTPGA" misc_feature 14544..14613 /gene="WUGSC:DJ515N1.2" /note="match to human EST AA158721 (NID:g1733515) zo78g11.r1" misc_feature 15589..15641 /gene="WUGSC:DJ515N1.2" /note="match to human EST D31562 (NID:g644442)" misc_feature 15589..15708 /gene="WUGSC:DJ515N1.2" /note="match to human EST AA158721 (NID:g1733515) zo78g11.r1" misc_feature 15589..15706 /gene="WUGSC:DJ515N1.2" /note="match to human EST AA122029 (NID:g1678048) zm25f10.r1" repeat_region complement(16489..16778) /rpt_family="ALU" repeat_region complement(18503..18758) /rpt_family="ALU" repeat_region 19258..19548 /rpt_family="ALU" repeat_region 19569..19858 /rpt_family="ALU" repeat_region complement(20735..20996) /rpt_family="ALU" repeat_region 22684..22975 /rpt_family="ALU" repeat_region 23163..23289 /rpt_family="ALU" misc_feature 23942..24240 /note="match to human EST Z44818 (NID:g573984)" repeat_region complement(25449..25740) /rpt_family="ALU" gene complement(28446..58170) /gene="LIMK-2" gene complement(28446..94478) /gene="LIMK-2" CDS complement(join(28446..28590,31579..31736,33380..33435, 34183..34357,35686..35751,38687..38743,38980..39111, 39817..39903,40755..40941,44096..44292,44648..44753, 46810..46998,47657..47766,48461..48596,81068..81167, 94463..94478)) /gene="LIMK-2" /note="DJ515N1.1b; match to D45906 (NID:g1136921); coded for by human cDNA D45906 (NID:g1805593)" /codon_start=1 /product="Lim Kinase" /db_xref="PID:g2078472" /translation="MSALAGEDVWRCPGCGDHIAPSQIWYRTVNETWHGSCFRCSECQ DSLTNWYYEKDGKLYCPKDYWGKFGEFCHGCSLLMTGPFMVAGEFKYHPECFACMSCK VIIEDGDAYALVQHATLYCGKCHNEVVLAPMFERLSTESVQEQLPYSVTLISMPATTE GRRGFSVSVESACSNYATTVQVKEVNRMHISPNNRNAIHPGDRILEINGTPVRTLRVE EVEDAISQTSQTLQLLIEHDPVSQRLDQLRLEARLAPHMQNAGHPHALSTLDTKENLE GTLRRRSLRRSNSISKSPGPSSPKEPLLFSRDISRSESLRCSSSYSQQIFRPCDLIHG EVLGKGFFGQAIKVTHKATGKVMVMKELIRCDEETQKTFLTEVKVMRSLDHPNVLKFI GVLYKDKKLNLLTEYIEGGTLKDFLRSMDPFPWQQKVRFAKGIASGMAYLHSMCIIHR DLNSHNCLIKLDKTVVVADFGLSRLIVEERKRAPMEKATTKKRTLRKNDRKKRYTVVG NPYWMAPEMLNGKSYDETVDIFSFGIVLCEIIGQVYADPDCLPRTLDFGLNVKLFWEK FVPTDCPPAFFPLAAICCRLEPESRPAFSKLEDSFEALSLYLGELGIPLPAELEELDH TVSMQYGLTRDSPP" CDS complement(join(28446..28590,29751..30098,31579..31736, 33380..33435,34183..34357,35686..35751,38687..38743, 38980..39111,39817..39903,40755..40941,44096..44292, 44648..44753,46810..46998,47657..47766,48461..48596, 58118..58170)) /gene="LIMK-2" /note="DJ515N1.1a; match to D45906 (NID:g1136290) and D85527 (NID:g1754512). Alternative splicing of D85527 to D45906.; coded for by human cDNAs D85527 (NID:g1754512) and D45906 (NID:g1805593)" /codon_start=1 /product="Lim kinase" /db_xref="PID:g2078471" /translation="MGSYLSVPAYFTSRDLFRCSECQDSLTNWYYEKDGKLYCPKDYW GKFGEFCHGCSLLMTGPFMVAGEFKYHPECFACMSCKVIIEDGDAYALVQHATLYCGK CHNEVVLAPMFERLSTESVQEQLPYSVTLISMPATTEGRRGFSVSVESACSNYATTVQ VKEVNRMHISPNNRNAIHPGDRILEINGTPVRTLRVEEVEDAISQTSQTLQLLIEHDP VSQRLDQLRLEARLAPHMQNAGHPHALSTLDTKENLEGTLRRRSLRRSNSISKSPGPS SPKEPLLFSRDISRSESLRCSSSYSQQIFRPCDLIHGEVLGKGFFGQAIKVTHKATGK VMVMKELIRCDEETQKTFLTEVKVMRSLDHPNVLKFIGVLYKDKKLNLLTEYIEGGTL KDFLRSMDPFPWQQKVRFAKGIASGMAYLHSMCIIHRDLNSHNCLIKLDKTVVVADFG LSRLIVEERKRAPMEKATTKKRTLRKNDRKKRYTVVGNPYWMAPEMLNGKSYDETVDI FSFGIVLCEIIGQVYADPDCLPRTLDFGLNVKLFWEKFVPTDCPPAFFPLAAICCRLE PESRAPPGAAGEGPGCADDEGPVRRQGKVTIKYDPKELRKHLNLEEWILEQLTRLYDC QEEEISELEIDVDELLDMESDDAWASRVKELLVDCYKPTEAFISGLLDKIRAMQKLST PQKKPAFSKLEDSFEALSLYLGELGIPLPAELEELDHTVSMQYGLTRDSPP" misc_feature complement(29756..30043) /gene="LIMK-2" /note="Similar to Human PNG protein 2208307A (PID:g1588291). May encode an alternatively spliced exon of LIMK-2." repeat_region complement(30647..30933) /rpt_family="ALU" repeat_region 32187..32478 /rpt_family="ALU" repeat_region 32889..33181 /rpt_family="ALU" repeat_region 34513..34818 /rpt_family="ALU" repeat_region complement(37104..37403) /rpt_family="ALU" repeat_region 37461..37772 /rpt_family="ALU" repeat_region complement(41340..42246) /rpt_family="ALU" repeat_region complement(42888..43171) /rpt_family="ALU" repeat_region complement(43189..43329) /rpt_family="ALU" misc_feature 44939..45109 /gene="LIMK-2" /note="match to human EST N72431 (NID:g1229535) yv40e12.r1" repeat_region complement(45110..45706) /rpt_family="ALU" repeat_region complement(45737..45853) /rpt_family="ALU" repeat_region complement(47939..47972) /rpt_family="L1" repeat_region 49697..49839 /rpt_family="ALU" repeat_region 50249..50538 /rpt_family="ALU" repeat_region 50591..50880 /rpt_family="ALU" repeat_region complement(51918..52210) /rpt_family="ALU" repeat_region complement(54337..54625) /rpt_family="ALU" repeat_region complement(60119..60281) /rpt_family="ALU" repeat_region complement(60514..60670) /rpt_family="ALU" repeat_region complement(60719..61055) /rpt_family="ALU" repeat_region 61149..61274 /rpt_family="ALU" repeat_region 61286..61578 /rpt_family="ALU" repeat_region 63666..63956 /rpt_family="ALU" repeat_region 66166..66454 /rpt_family="ALU" repeat_region complement(68763..69064) /rpt_family="ALU" repeat_region complement(69323..69605) /rpt_family="ALU" repeat_region complement(69617..69905) /rpt_family="ALU" repeat_region complement(69961..70254) /rpt_family="ALU" repeat_region complement(70269..70560) /rpt_family="ALU" repeat_region complement(70583..70874) /rpt_family="ALU" misc_feature complement(71091..71269) /gene="LIMK-2" /note="similar to human EST T93730 (NID:g726903) ye10a04.r1" repeat_region complement(72158..72446) /rpt_family="ALU" repeat_region 72577..72866 /rpt_family="ALU" repeat_region 73990..74268 /rpt_family="ALU" repeat_region 74773..75059 /rpt_family="ALU" repeat_region complement(77930..78236) /rpt_family="ALU" repeat_region complement(78459..78752) /rpt_family="ALU" repeat_region 79978..80269 /rpt_family="ALU" repeat_region 80385..80674 /rpt_family="ALU" repeat_region 81690..81731 /rpt_family="MER" repeat_region 81761..82044 /rpt_family="MER" repeat_region 84251..84497 /rpt_family="ALU" repeat_region 84517..84806 /rpt_family="ALU" repeat_region 85105..85394 /rpt_family="ALU" repeat_region complement(85913..86206) /rpt_family="ALU" repeat_region 87017..87306 /rpt_family="ALU" repeat_region complement(89481..89766) /rpt_family="ALU" repeat_region complement(91071..91124) /rpt_family="ALU" repeat_region 91791..92068 /rpt_family="ALU" repeat_region 92294..92418 /rpt_family="ALU" repeat_region 92771..93063 /rpt_family="ALU" repeat_region complement(93148..93264) /rpt_family="ALU" repeat_region 97434..97902 /rpt_family="ALU" repeat_region complement(98055..98345) /rpt_family="ALU" repeat_region 98588..98739 /rpt_family="ALU" repeat_region complement(98743..99014) /rpt_family="ALU" repeat_region complement(99591..99744) /rpt_family="ALU" misc_feature 99878..101694 /note="Similar to Human mRNA L48211 (NID:g1160612), but note confirmed extra nucleotide at position 101424 in genomic sequence." misc_feature 101358..101570 /note="Similar to L48211 (PID:g1160613), but note confirmed frameshift at position 101422 in genomic sequence." misc_feature 101875..102189 /note="match to human EST AA081336 (NID:g1623141) zn33d12.s1" misc_feature complement(101875..102071) /note="match to human EST R35811 (NID:g792712) yg66e09.r1" misc_feature complement(101897..102389) /note="match to human EST AA158198 (NID:g1733009) zo55e10.r1" misc_feature complement(102371..102404) /note="match to human EST AA023950 (NID:g1488836) mh93d10.r1" repeat_region complement(102535..102824) /rpt_family="ALU" repeat_region complement(102836..103133) /rpt_family="ALU" repeat_region 103974..104266 /rpt_family="ALU" misc_feature complement(105272..105390) /note="match to human EST AA023950 (NID:g1488836) mh93d10.r1" repeat_region complement(106521..106812) /rpt_family="ALU" repeat_region complement(107778..108074) /rpt_family="ALU" repeat_region 108189..108480 /rpt_family="ALU" repeat_region 109006..109312 /rpt_family="ALU" repeat_region 111636..111926 /rpt_family="ALU" repeat_region complement(112150..112439) /rpt_family="ALU" repeat_region 112690..112984 /rpt_family="ALU" repeat_region complement(113023..113058) /rpt_family="L1" repeat_region complement(113084..113375) /rpt_family="ALU" repeat_region complement(115050..115338) /rpt_family="ALU" repeat_region 115763..115922 /rpt_family="MER" repeat_region 115935..115973 /rpt_family="MER" repeat_region 115985..116228 /rpt_family="ALU" repeat_region 116238..116292 /rpt_family="ALU" repeat_region 116301..116588 /rpt_family="ALU" repeat_region 116996..117272 /rpt_family="ALU" repeat_region 117291..117586 /rpt_family="ALU" repeat_region complement(117752..118048) /rpt_family="ALU" repeat_region complement(118062..118222) /rpt_family="ALU" repeat_region complement(118231..118342) /rpt_family="ALU" repeat_region complement(120277..120567) /rpt_family="ALU" repeat_region complement(122057..122370) /rpt_family="ALU" misc_feature 122099..122976 /note="CpG_island (%GC=70.6, o/e=0.70, #CpGs=65)" repeat_region 123666..123956 /rpt_family="ALU" repeat_region 124288..124768 /rpt_family="ALU" repeat_region complement(124871..125343) /rpt_family="L1" repeat_region complement(125536..125782) /rpt_family="ALU" repeat_region 126197..126358 /rpt_family="ALU" repeat_region complement(126414..126706) /rpt_family="ALU" BASE COUNT 35134 a 31020 c 29358 g 33466 t ORIGIN 1 gatcctactt cgcactcaca aatatggcca taataaaaaa ggaaaaaaat caacaacatc 61 cttgttggca aggatgtgga aaaattagaa tcctcataca ttgctggtgg gatgtaaact 121 ggtacagcca ccagtacctg actacagcca taccaccacg aacatgtcca atcttgtcta 181 aactttgatg tggaaaaact gatacggaaa gtagtttgat cagccgggca cagtggctca 241 cgtctgcaat ctcagaactt tggggagctg aggggggtgg atcacctgag gttgggagat 301 gaagaccagc ctggccaaca tggcaaaacc tgtctctact aaaaatacaa atttttgtat 361 ttttagtaga gccgggtgtg gtggctcaca cctgtaatcg cagctgctcg tgaggctgag 421 gcaggagagt catttgaacc caggaggcgg agattgcagt gaaccaagat cacatcactg 481 cactccagcc tgggtgacag agcgagactc catctcaaaa aaaaaaaaaa aaagaaaaag 541 aaaagaaaat agtttgatgg ttcctcgaaa agttaaacac agaataacca gcaatgccac 601 tcctaggtat atactcaaaa gaaatgaaaa caggtatcaa acaaaaacct gtacatgaat 661 attcatagta gcattattca taatagccaa aaggtggaaa caactcaaat gtccaccaac 721 tgatgactag ataaacaaaa tatggtacat ccatacacag gaatattatt cagccataaa 781 aagaaatgct gatacatgct acaacttaga tgaaccttca aaacatcatg cttagtagaa 841 gaagccagac acaaaaggtt gcatatagta tgattccatt tacataaaat gtctggaata 901 ggcatatcca tagaaataca aagagatcag tatttacttg gaacacttgg gacaataaca 961 aataaaataa taaaaatttt aaaaaagagt ccaggcgcag tggctcacgc ctgtaatccc 1021 agcactttgg gaggccaagg tgggtggatc ataaggtcag gagatcgaga ccgtggtgaa 1081 accccgtctc tactaaaaat acaaaaaatt agccgggcac gctggcgggc gcctgtagtc 1141 ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg gagcttgcag 1201 tgagccgaga tcatgccact gcactccagc ctggctgaca gagcgagact ccgtctcaaa 1261 ataataataa caataataat aataataata ataataatta ttattattat tatttaaaaa 1321 agaaggagat cagaggttgc cagagggtga ggagaaagga aaatgaagag tggctgccaa 1381 caggtactag gtttctccca tctgagtact aaccagactc aaccctgctt agcttctgag 1441 atcagacaag attgcacatg ttcagggtgg tatggctgta gacaggtacc agcgtttctt 1501 tttggggtga tgaagtgttc tggaattaga tagcggtgat ggttacacaa ccttgtgaat 1561 atacagacat gagccacata atgatgtttc agtcaacaga ccgcatatat gacagtggtc 1621 ctataacaca gtggtcccca acctttttgg caccagggac tggtttcatg gaagacaatt 1681 tttccatgga ctgggggtga tggggggatg gtttcaggat gaaactgttc cacctcagat 1741 catcaggcat tagttagatt cttggaagga gtgcgcaacc tagatcccta gcatgtgcag 1801 ttcacaatag ggttcacact cctattagaa tctaatgtca ccactgatct gacaagaggc 1861 agagctcaga tggtaatgtt tgctcacctg ctgctcacct cctgctgtgc tactgggttc 1921 ctaactggcc acagccctgg ggtttgggga tccccgctat aagattataa taccatattt 1981 ttactgtacc ttttctatgt ttagatacac taatacttat cattgtgtta aagttgccta 2041 tagtatttag tacagttaga tgctgtacag gtttgtagcc tgagatcagt aggttacacc 2101 atatagccta gatatgtagt aggctgtccc gtctaggttt gtgtaattat actctatgat 2161 gtttgcatga cgcatttctc agaacctatc cttgttgtta agtgacgcac aggtactaac 2221 aactactgaa ctgtatgctg taaaatgatg aatttcatga tatgtgaatt gtatctcaat 2281 agaaaagagc atttataaag tttttgttgc taaatttatt tactttcggt aaattttctt 2341 ttaaaagatg tctatgtttt actgtcttcg tggtgctggg acataactga catctgaagc 2401 cttgaatgcc tctctttgca aactcatttg ttactactca gttaccatct cacctccaat 2461 acccaactgc ataccaaacc tctagccatt actagaacac atactctgag ctctgccata 2521 ctttagcgcc tgcacacatg ctgtttcttc tgctcagaaa cttttgtgcc ttctgtccat 2581 cccatccaag cctgtttttc aatgcccaac acaaatgtca ctggtaccct gaaacttcaa 2641 ctcctccttg ccccatctca gaaaatgtaa ctttctgtat tcttctctct gtatattcct 2701 ctaaagtggt atttcttata ctataccata atgacctggt cccatatttg tcctcttctt 2761 ccctcttcat cccagactgg gaccacaagg aggtagaact ttttctgttt tttttttttt 2821 ttgagacaga gtcttgctct gtcgccaggc tggagtgcaa tggcacgatc tcggctcact 2881 gcaacctctg cctcccgggt tcaagtgatt ctcctacctc agcctcccca gtagctggga 2941 tttcaggcgc acatcaccat gcccagctaa tttttgtatt tttagtagag acgggatttc 3001 accatgttga ccgggatggt ctcgatctct tgtcctcgtg atccgcccgt ctcagcctcc 3061 caaagtgttg ggattacggc catgagccac tgcgcccggc cggaggtaca actttcttag 3121 tcaatcttga gcaccagtgt agtgcttaga atcgggtagt gagagaagct aacatttact 3181 gaatgcctaa gaagtccagg gcacataaca gtcttcacaa ccatcctgtg aagtacgtat 3241 taccctatcc attttacaga taaagaaact tgagtttcaa gaaaggtata cctaaggtca 3301 ttcaacttgt taaatgatgg aatggaggag ttgagaccag attggacttc tgcctctttt 3361 tttttttttt ttttttttga gacaggctgg agtgcagtgg tgtgatctca gctcactgca 3421 agctctgcct cccaggttca cgccattctc ctgcctcagc ctcccaagta gctgggacta 3481 caggcgcctg ccaccacgcc cggctaattt tttctacttt tagtagagac ggggtttcac 3541 cgtgttagcc aggatggtct caatctcctg acctcgtgat ctgcccacct cggcctccca 3601 aagtgctggg attacaggcg tgagcgaccg cgcccagccg aacttctgcc tcttaaatcc 3661 agggttctcc ctgtcagtac agtgaggtgg taactagcaa aagctatgag atatgactgc 3721 ctgggtacat atcccagctc tttcacttat ctttgtggct ttacgcaaat tacttaacct 3781 ctttatgatt gtttcttcat ttgtaaaagg aagataataa cagtgcctat atatagggtt 3841 tttatgaaga ataaatgaga tagtatatat aaagcactta gaacagtatc tggcacatac 3901 taggtgctca ataaatgtta gcgatgacat ttattactgt cccacattga gctggtgttt 3961 gttaaacact ggatgactgg ataaaatgta catctttcat gcagagattg ctcgattatc 4021 catttcactg attcaggatc agagtctctt aagaatccag agaggggatg ttgaagccct 4081 aagaagtaaa aacactaaaa tgataacatt ttgacctttt gaaaactctt tgaagtggca 4141 atggccacaa aattgagcgg aaaccaaaca tattcagccc catttccaga taaattggat 4201 ttgctaaacc actcagaata acaagagcaa gaattagatg ttcagctaag ctctagcttc 4261 tgatcctaga tggtggcaat aacaacgata gtattaatga taatagcaga aactcttaat 4321 atgtgtgagg cattcttcta aatgctttac atatattaat ttaatgttca tcggagcaga 4381 ttctcctact gtatccaaga gatcagaaat tcagagcaaa aagcagattg ctgcctgata 4441 gcagatccac tctaggaatg gggagatttt ctcataaatc taatgactgg tgaaagtgac 4501 aagaggttag tccactcagt gttggaaagg acaagatcct agaaagtgga aatgctccac 4561 tattgcttta gggggaacag cctggctaca actgggcaga ctttctatta ctaacatcag 4621 gataataggg aagctcctgc caatgctgca tagaactaag aacacttagc agtgatggct 4681 gacttccaac tgagccatta aggtaagtgt agaagaaagc tccaggtaga aatatctttg 4741 ggaactcaga gaacacactg agtgtttgaa gctggtactt aatctattat tgaactcaga 4801 gtttgccatt ctggggccaa tttgagtcca cttatgacac ctcccacagc actgctcatt 4861 tctgttaact gggcttcaca gagggtgaga agggcagcaa tcagatcata taattgggcc 4921 tcattgtaat aaaatgttga tttctgttgt ccttggactt ccttccagcc acaatcactc 4981 tatgatcttg tctgagattc tttttccttt ctctctccat tttgctgact gtaatggaat 5041 aatagcagaa gctctcaatc tgtatgagga atatatatga agaatgtgta acattcactg 5101 tgtttcctaa tagatgaatt atctaaagga cacctcttaa gaagagatga gatggagcca 5161 actgtcagaa aaaaaaatgg gtgaagaaag tttcaaggag aagactccac agagcagaat 5221 gggaaagatg gccaggagaa aagtgagata ataaaaggaa aagcagtccc tgacacagaa 5281 ggtactcaac agctatttgc tgactgaaag aataaacact aataagagaa tgaaatacca 5341 ggattataaa gagggaacaa agatattttc ccatctactt tgttgttacc ccacccagga 5401 tatatttagg ttggtgcaaa agtaattgcg gtttttgaat cgttaataga ttaacatcta 5461 atttatgtat tgtttacata catacaaata caaactacaa ccatatatga tacatttaaa 5521 gtaatacgct aggcacagtg gtgcatatct atagtcccag ctgaggtggg aagactgctt 5581 gagcatagga gttcaagtcc ccttgggcaa catagcaaga ccctgtctct aaacaaagaa 5641 aacaaaaaaa gtgaagtaat taatgccaaa cattgcaatc agggtttggt gcacttgaaa 5701 tacacaaatg agtgttttta catgctgaag acacagcaac caacaactgg ggctaaagtg 5761 ctgcattcct aaagttaacc cacagagcct atcatgccat gccaaagttc actgctccgg 5821 ctccaaaccg gggactttcc tttcctattt cctctcccta acagcttaga gctactggac 5881 tccatccttg aactcctctg tttactaaaa atgttcttta taagctatga aatctggatt 5941 tgaggtctgg tttcttactt caactttgca actcatttcc caccccgagt tcacacaatc 6001 tcaaaatgtt catgaatttt atggctagac ctaaagcttc tttgtgcctt cctgccagcc 6061 tgcctttgtt ttggtacctt ctagcctatc ctcactcact tgagttttta ctctaatttt 6121 ctgtccagtc ttcccaccac tagtccccta tcaaattgct tgctggacat ttctatatac 6181 atagcttgct tgtccttacg tcaaattcaa aactaccaag agcaaatcca atgtgtgttc 6241 ccacatcagc tcccctttct cacctccctg gtatggttac tgaccaatta gaatgaatgt 6301 ccaaccactc agaatgccac aggcaatagg ccaaatcagt gtactgggtc caatgtcagc 6361 cctgtccata atccttctac ccagctaaac ttggaggaat tgtctgttct cacagtctcc 6421 cttcttcatc tcttattcct tccataatcc atacattttt agccctcctt ccttatcata 6481 aaaattcctc ctgacaaggt caccaatggc cttcttgttg ctaaaaccaa ttaattcttt 6541 ttagctttta tttcacttga tcatccttgg taacatttaa caatgctgac cattccctcc 6601 tccctgaaac actctcctta gcttctctga cattatactc ttctggttgc aatccagtca 6661 cctctggtgg tgccttctca gcctccccca aacctgcttt tcctcaggtt tcttatctca 6721 gtgaaagtca ccaccatccc ctaagctcct aaaccagaaa ccgaattcct tattcttctt 6781 atttgcttct aaccttcctt cagtcatgga atcctgctta ttcttagttt gcttctcttc 6841 actgcctaca tttttttttt ttttttttga gatggagtct cgctctgtcg cccaggctgg 6901 agtgcagtgg catgatctca gctcacggca acctccgcct ccctggttca agtgattcgt 6961 ctgtctcagc ctcccgagta gctgggacta caggcacgtg ccaccaggcc cggctaattt 7021 ttgtattttt agtagagaca gggtttcacc atattggcca ggctggtctc aaactcctga 7081 cctcgtgatc cgcccgcctc ggcctcccaa agtgctggga ttacagacgt gagccactgt 7141 gaccagcctt cactgcctac atttttaaag cacttaaata tgtgctaggc aggtgtgttc 7201 taagtgattt atcctcacaa taagcccata aggtaaattt tattatctcc aattttcaga 7261 ttaggtaaca gaggcaaaga gtggttaaat aacttgcata aagtcacata actactaagt 7321 ggcagagcag gattcaaacc caggtggcct gactcaattc tggatccagg tcactatcct 7381 atctcactga aatctctgca acagcctcct tggtttccag ccttccatca tggatggctt 7441 ccctgtagcc acagtgatct ttgttaaaac agaagtctga ctcttgctac tcttttgcat 7501 gaaaatcttt actggctctt tattgccttc agaataaagt ctacactcaa tatgacattc 7561 agggacctcc ataacctagc cagaaagtgt atacttcttt ggcttcatgt ttttggtttt 7621 tggccccatt ttaaccacga ccctaaaagc tccactagca caatgatagc ctatacttcc 7681 aggtttctgc actgtttgat gtgccaagaa cactctttcc caccttcaca ttcgacgact 7741 tctgataagc cttttctgat ttcctaagac cagggtatat gcctctcctg tgagctccca 7801 cagctcccag ttctaattct cactgtagca tttatttcac tgtgttatat gattctacta 7861 cccatctact aatattaata tctgctttat cacaagattg tgagcttcct gagaatggga 7921 actgtgtagg attcactgtt gtatcctcag tactcactgc tgcctacctg gtgcacggta 7981 ggcaatattc atttaatgaa taaaacttta ttgtctatat cacactcttt aacacttact 8041 tatataatgc cctatattga ttttattttc atatgtgtta gcctgatctt tataaccaga 8101 atgtcagttt cttgaaagat agcaattata cccagtactt cttttataaa tttatttact 8161 cttaggctgg gtgcagtggc tcacgcctat aatcgccaaa ttttgggagg ctgaggcagg 8221 aggatcactt gagtcgagga gtttgaggtc agcctgggta tagtgagacc tcatctctat 8281 aaaaaataag caagattagc tgggcatggt ggcatgcacc tgtggtccca gctattcagg 8341 agagctgagg tgggacggct tgagctccag agcctgaggc tacagtaagc catgatcatg 8401 ccactccagc ctcggtgaca aagcgagatc ctgtctcata aaagaaaaaa aaattatctt 8461 tagagatggg gtctcactct gtcccccagg atggagtgca gtgtcatgat catagctcac 8521 tgcagccttg aactcttggg ctcaagagat cctcccacct cagcctccca agtagcgggg 8581 actacaggtc tgtgctgcca tgcccagctg attcttttat ttgcagagat gggggtctca 8641 ctatgttgcc caggctggtc ttgaacttct gacctcaagc aatactcctg cctcagcctc 8701 ccaaaatgtt ggaattatag gcaaaaccca ccatgcctgg cctataccta gtacttacaa 8761 gttaggaggt actcacaaaa attcctgata acagctaaca tttattcagg gatcaatgaa 8821 tgtcaacaat ccaatgattt tggtaatttt tctctctctc tctgcctccc tccatatgag 8881 gaaactgagc catagtttag tagcttgcct aagatgacat agctaggaag caacagagct 8941 gggatttgaa tcctcatgtt ggtgcctgca ttcttaacca ctatgataac aggctaagaa 9001 agcttaaaat tatagatgct aagtaactta cctaagatct ttttaaacat ttaaaaagtt 9061 ttttttttcc tttgttagta ggcaaggaac taagatcata gtgcagtgct aagactagaa 9121 cccaagtatt ttcaaaggct acgctcggct gggcgtggtg gctcatgcct ataataccag 9181 cactttggga ggctgaggtg ggtggatcac gaggtcagga gtttgagatc agcctggcca 9241 acatggtgaa accccatctc tactaaaaat acaaaaaata gctgggcgtg gtggcgggtg 9301 cctgtaatcc cagttattcg ggaggctgag gcaagagaat ctcgcttgaa cctgggaggc 9361 ggaggttgca gtgagccaag atcgcaccac tgcactccag tctgggcgac agagcgagac 9421 tctgtctcaa aaacaaaaac aaaaacaaaa aaaaacccca aagcctacgc tcataaatta 9481 tttggtatac tgcttctcct actttaaaag tcagtactaa gctcatactg ggtgttaata 9541 cattttaaaa ctgcaaatcc agtttactga aattggagaa tatcagatcc ccttccaaga 9601 gaggcataat tgtagcttac tgggtgaatt caccttttat attgacagtc tagccttatg 9661 aaccaattaa gtttgtagtt tctgcattct tcctgtgaca ctgaagactg ttttcatcaa 9721 tgtgaaaacc atgttatcta agctaaatca aaatccactc tcattcagaa tttattcttc 9781 cttttatttc tcaacaccag tgacatgttc ccaagttctc tgcctttgtg gctccttagt 9841 ttccagtgtg gaatgtctta tttctgaaag caacctgtga ttgacaagcc ttcttggcta 9901 ttctgtagtt taccctaaaa aaaatctgtt ctctctgtcg gctcattaga ctgcttttcc 9961 ttacatggta ctctgctttt cccctttatt tcaagcattt attaaaacta aactatgtac 10021 caagcataca caaaatggat ataacctaac tctttgaagg ctacacttca aagaaacagc 10081 cttggtgttt acatattttg actagaatat tatttcccta acaagctgca taaaaccatt 10141 cctgctcagt ttgtaattat acaccaaaat attgagcaca atttcagtag ctacttctac 10201 agcacatcat ataaagtgaa ttgctggagc caactcatta ggtcttattc caaagatttt 10261 taccaacact ttggccctct tttgctgaca aaaaaggatt cttaagcctt ttaacagctt 10321 tataacttcc ttattagagt tcactcacta acatatcctg ttaacaaaat gaagtgccca 10381 tccttcttgg gaaacactct ggagaatatt tcctgatttc atcgcacagc tttgtctctg 10441 gattagtaat catctaatac gaaactgagg atatgagtat tatgtttcac aatgttgtct 10501 tcaaggatat aaacaactca aaatttaagt atttgagtag cactgtatca cattattatt 10561 gccgtgatat tttcagaaga aatctgaagt ctgcgctaac tacaggccat aaaatatggc 10621 catcttgccc caataaatgg agcaaagttc aaatctggat catcagaaaa agacaagagg 10681 cacatttcta aaccaaatct gattgccttg aggaaggaag gtactgaacc tgttagagta 10741 catctctgaa ggactgagaa tcaagtcctt cattccctcg ccccctaaaa aaatccaatc 10801 aatcatgctg ataaatgtct tagttacagg ggagaaaaaa tagacatgac taagctgaaa 10861 aaaatcttct ggctttgtat cttttctagg cagtgctgtt cctgataaga cacatgacac 10921 aaaatatggt tttgctgagt atcaaggtaa gttagggaaa atagtatttt ttaaatagga 10981 aaattagatg acaacagcaa caggtaaaga agccttgggc ttctgaggta gaaatactag 11041 catccctaca taggcatggt agataatctg aaactagtac aagtgctggc ctgttatagc 11101 ctgtctctac cgtaatcaga gggaaaagta ttacaatcca gcaaaagtag cccttcaaaa 11161 tctaaggagg agcatttcat cagatatcaa taaatgaacc ctcgctgcat acaaaggcaa 11221 tacattacgt gctgagtgac caacttcaaa gaaccttctg ctgttctgaa atttaaaact 11281 actctcattt cctcaagcct gtattttatg taatgaaaca aaatgcacat catatgtctc 11341 ctgctgcact gtaagagtaa acataggacc cctcccacca aaaattgtgc aaagagccct 11401 tccccttcag ctttttgcta tagaaatatc ctactacttc cctgaaaata gtatatgctt 11461 tctttggcaa gtggaatatc accctgttac tctctccctt actctgcttc actttttttc 11521 gcataacgct taagattacc taatatttta catgtttatc atgtctctcc caattagaat 11581 ataagcttca tgagggcaga gatttcaaat attttgttca cagatgtatt cccagtatct 11641 acaactatat ttagcatgta gcagattctc aaaaatattt gtaaaattaa taaaatttaa 11701 aaatccctat tatattggaa ttgttaaatt aaaaaaaaat tttggccagg cgcagtggct 11761 cacacctata atcccagcac cttaggaggc caagacaggc ggattacgag gtcaggagat 11821 cgagaccatc ctggctaaca cagtgaaacc ccgtctctac taaacaaaat acaaaaaatt 11881 agccgggcgt ggtggcgggc gcctgtagtc ccagctactc gggaggctga ggcagaatgg 11941 cgtgaaccca ggaggcggag cttgcagcga gccgagatgg cgccactgca ttccagcctg 12001 ggcaacagag caagactctg tctcaaaaaa aataaatttt ttttttaaga cagggtctct 12061 ctttgttgcc caagctggag tgcagtggca tgatcacagg tcactgcagc ctccaactcc 12121 tgaactcaaa tgatcctccc acctcagtag gtgggactac agatgagtgc caccacacct 12181 acctattttt ttttttttgt agagatgagg gtcttattat gttgcccagg ctggtctcaa 12241 actcctgggc tcatcctccc accttggcct cccaagtgct gggatttaca agtgtgagcc 12301 actatacctg gccaaatttt aaaatgctac cgtatctgtg tatgaatatg tgtgaataaa 12361 tttacagact ctagggtgta attcacaggg tagaccaagt caagagtcag aggatcagaa 12421 gcttgggtgg ggagatagga gagtgattgc aaaaacaaag caatctatct tattaaatgg 12481 acgaaaggga tcaaacatta tcagtaaact attttaaaga gtgaattaaa tgtatctctc 12541 tatcctttct atgagtgtag tgaattcaga ttcacctcag attagttgaa ttgacactag 12601 ttttcttccc tgataaagct tttaaaattt atttagtaaa gtggccaggc atggtggctc 12661 acacctgtaa tcccagaact ttgggaggcc aaggtgggtg gatcacctga ggtcaggagt 12721 ttgagaccag cctggccaac atggtgaaac cccatctcta ctaaaaaaat acaaaaaaaa 12781 atttagccag gcgtggtggc gcacaccttt aatcccagct acttgggagg ctgaggcagg 12841 agaattgctt gaacctggga ggtggaggtt gcagtaagcc gagatggtgc cactgcactc 12901 cagcctggga gacagagact ctgtcccccc cacccaaaaa aaaattagtt gttttcctcc 12961 ttaaattaca gcctgcaaga ttgaaacaga gtttccccat ttctattttg ggatctgact 13021 gagtcacaag gtagtcttga cgaatcactc tgctctatct ctcagtttac tgaggctgcc 13081 tgggaacttg agctgtcagc caaacaggaa gaagggccct tctgcctact ccaaaaggcc 13141 tcagtcaaat ggatatggtg agatttaatg atggatataa attacagcaa atggatttcc 13201 tatagggtga gggaaaaaaa aaaaagcaaa cattttactt aaattcctct atacaattta 13261 gggctggcag aaagggtaga ctattacaat gtagctgagc aaatttttcc atgtttgcat 13321 aacatgtgtt ttcaactaaa gatactcctt ctatagtcct actatataca acaccccaaa 13381 aaacctcaaa tgtaagaaaa agcaatgtga cagatttgac cctggccaag ttacactgtt 13441 tttttttttt tttttttttt tttttttttt taagtgtcag tgttcataaa ggcccttttt 13501 ctttttcaag gatgggtata aagtgttact cggccgaacg cggtggctca cacctgtaat 13561 tccaacactt tgggaggccg aggcaggtgg attacgaggt caggagttca agaccagtct 13621 ggccaacaca gtgaaacccc cgtctctatt aaaaatacaa aaaattagct gggttgtggt 13681 ggtgtgcccc tgtcacaagg tagtcttgac gaatcactct gctctctctc tcagtttact 13741 gaggctgcct gggagctgtc agccaaacag gaagaagggc ccttctgcct aactccaaaa 13801 ggcctaagtc aaatggatat ggtgagattt aatgatggat ataaattaca gcaaatggat 13861 ttcctataga gtgagggaaa aaaaaaatcc agctactccg gagactgagg cagaattgcg 13921 agaacccggc aggtggttgc agtgagccga gattgtgcca ttgcactcca gcccgggcga 13981 cagtgtgaaa ctctgtctca aaaagaaaag aaaagaaaaa aaaaaaaact attactccac 14041 tccccaccag aaactcttat tttttccccc ctatgtggtc atgaattcct cctgggtgca 14101 tagctcccta agttgttctg tgttttagtg cttataattt cccacatccc atgatgcccc 14161 ttgatagaat tattttttct gtactcacac aaatggaaac agaagagatg aaaaaccctg 14221 gggatgcggc ctcgcaggtc tcaaactgat cggctgctaa gcacaacctg catatctctt 14281 ggctgagttc ctttcttggc tcagttagat tttgcatgac ctaggaggct taggacccag 14341 ggggcgcctt tcagctgaaa aacagctcgc gctgcagcaa gctagctggg aagctcccag 14401 ttctaaagag aggctgttta ccagaacagc ataacaaggg caggtctgac tgcaaggctg 14461 ggactgggag gcagagccgc cgccaagggg gcctcggtta aacactggtc gttcaatcac 14521 ctgcaagacg aaggaggcaa ggatgctgtt ggcctgggta caagcattcc tcgtcagcaa 14581 catgctccta gcagaagcct atggatctgg aggtgagatt acaatctttc ctttccgcag 14641 caacttcatg aaacagaagg ggtaaagccc cctatttccc aaacagggcc aggatgacag 14701 gaaacgaagc gtttagccgg tttctgttgt cccgtgaaag cttttccacc cctcctttcc 14761 ttgtcagggg cttgcagctc catctttgcc attaagaaca aaggcaagaa gccactttct 14821 ctctgccttt ccttgctcat gagtggatgg agcttggggg ttcatctccc ctactcccca 14881 agtcctgttc tttctaagcc ctttgcatgg tgtggtcctc cttccccagc ctccttctgt 14941 ggttctcctg cagaggggag tgggagcttg gagtctcatc cagtcagcga tgaggccggt 15001 ttcctccgtt tgaggctgtc ttgaatcttt tgcctgacgg agtcaaggct cctggagaat 15061 tctcctggga gggagtattt gccttttaaa cgggcagaag gtggccgtat ccgtgaagga 15121 ggaaacaccg gagtttatgt aacgtggcca ttacaacaca gggacaaatg tcacgtctgg 15181 gcgccccccg tcccaaaact cccagctcct caaagggacc ctaagccagg cctcctcctc 15241 tccatctcct cctccaaagt ggtcacaagt tcaagagtaa caacgctttg cacttctgca 15301 gcgctttcca ggtttcaaaa ataaataaaa cctaacagtc tctgagcgtt gactggggga 15361 caggcacagg ctgggggctt cacaccctga ggtgtgcgtg gagttggggg gggggggtgc 15421 gtggggcgtg ggggtgttgc cacgagagga gagggaggcc cagcagggcc cggggtcccc 15481 taaccaggct gaggttcagc ccgggtggtc ccggctcccc tgtccgccac gctgggcgtc 15541 ttccgttcgc tgcccgctat tctgtgtccg ggggcttctt ttcctcacag gctgtttctg 15601 ggacaacggc cacctgtacc gggaggacca gacctccccc gcgccgggcc tccgctgcct 15661 caactggctg gacgcgcaga gcgggctggc ctcggccccc gtgtcgggta agtgtcctcc 15721 gggggacggc cccggggctg gcgggcggcg gccgcgttcc cggtgccagc cccggcacac 15781 atttccttct ctcctctctt aggggccggc aatcacagtt actgccgaaa cccggacgag 15841 gacccgcgcg ggccctggtg ctacgtcagt ggcgaggccg gcgtccctga gaaacggcct 15901 tgcgaggacc tgcgctgtcc aggtacctgc ccgggacccc gggaatcccc gctcctggcg 15961 ctgacaggaa gcggctgaga cactcgtggc gcacgtgcgg ccgcgcgatg ccggccggcc 16021 ccgctccgcc tccgcgccgt ccgattggcc agggctgtct ccaggccgcg gggctcaaag 16081 cctattggcc gccgccgcca ctcgtatgag ctcatccgcc gcgagcgctt gtaaacaact 16141 gcgcggggcg aggctggggt gaggctaggt ctctccgccg cgcgtcctga ggagtggctt 16201 ttggcccctc agcgtttatt gaccgtttgc ggagcgtctg cagtgtgcta gacccacact 16261 gctcacaacc ccagctcccc aacctcccct gacacacacc gcgggaagaa gtcgtttgag 16321 gcatctgagc ttcagagaag ttaagcgatt taccaaaggc aatggcaggg gacagatttg 16381 aattcacgtt tcttagattc cagagcgcag gctttttgct cccaaagcag caagagagtt 16441 ttctaggttt aaatgggata atcaaaataa agcatttatt ttttattttt tattttttga 16501 gacggagtct cgttctgtca cccaggctgg agtgcagtgg cgcaatttcg gctcactgca 16561 acttccgcct tccgcgttac agtgattctc ctgcctcagc cgcttgagta gctgggataa 16621 caggcgtgca ccaccacgcc cagctaattt ttgtattttt agtagagacg gggtttcacc 16681 atgttggccg ggctgctctc taactcctga cctcaggtga tccgaccgtc tggcctccca 16741 aagtgctggg attacaggcg tgagccaccg ctcccggcac aaagtaaaac attttaccta 16801 aggactggca cagagcgtac gcttagatgt gagctgctca aaggaagcat gggttgtttc 16861 ctgaagtaga agtgggggga atcccctgca gaagaacgca accgagaggg gccttgggaa 16921 atagcaagga tttggacaga taaagacgtg ggaggtagga gagaagccct tccctacggc 16981 cttcctctct catttgcctt gtccctgaga tggttcagcc ctctcaggcc tgatcttacc 17041 tctggggcag aatttgactt tgggttgtgt gtggggggaa attcttaggt gacccacctt 17101 catctaccct aacactcagg ccaggttcag ctgtggaatt cactcagggc agtccctgat 17161 gagagctgtt tcccacctca tctccagaga ccacctccca ggccctgcca gccttcacga 17221 cagaaatcca ggaagcgtct gaagggccag gtgcagatga ggtgcaggtg ttcgctcctg 17281 ccaacgccct gcccgctcgg agtgaggcgg cagctgtgca gccagtgatt gggatcagcc 17341 agcgggtgcg gatgaactcc aaggagaaaa aggacctggg aactctgggt atgacggtcc 17401 cccacccctg ccctcgttgg gattcatcaa gagatgtcat ttgctgattg tctagggtgt 17461 ggctaatggg accttgtgtc ctatccttgg caggctacgt gctgggcatt accatgatgg 17521 tgatcatcat tgccatcgga gctggcatca tcttgggcta ctcctacaag aggtcagtag 17581 cttctcttct gggccctctt aggaggaggg gaggaaggta cacaaagtca aactttgtgg 17641 ctttcttacc caaaaggaag caagatgttg aaaatcagct caagatgtag agacttggtc 17701 ctagtcttta tctgctgacc ttggacaaac cctttccttt ggtgccttta cacctcactt 17761 aaaagccttg tgagggatgt tcttgttccg cattttacaa atgaagaagc ctctgtcaga 17821 gacttgctca aagtcacata gcctaggagt ggcagaacct ggattcagac ccagctcttt 17881 tgatagcaaa ttccctgctg tctgatccca catgcctggc attgaagtga gtggcactgc 17941 agaaatcagg cttgacactg attgcaggga ccttaagacc aaattctttc tgtgagttat 18001 aaaattctgc tactagaaaa ggtttccagc cactgttgca gttgaggctt aaatttcctg 18061 agctataagc agtgggatct cctctgctag ggaagaaaaa agtgagttgg agaacagaaa 18121 gtctgtttgc cactcttcac cacgtgtgag aaagccaaag aggctacttg tgaaactcaa 18181 ggacatgacc tgtggctggg acagcaggag aagaatgctt attttgaaag gtcccacgag 18241 atgtccgtca gtgactggtg catgatggta agccagactg gcattcatct ttacagcact 18301 ggctgattgt ggaaaattat ttgccattct tgggcaaaat cgggggattt gacttctagc 18361 cctgggtgtc tgtaatattc tcctctgtgg tctgagaaaa cttcctgtga ggaaagtgac 18421 acagtagcct gcatttttcc ttcctttcct ccctccctcc ctcccttcct tccttcccct 18481 ttctttcctt cctttctttg attttttttt ttcagggtct cgctctattg cctaggcttg 18541 agtgcagtgg tgcgatcacg gctcactgca gcctcaacct cccagggcgc aagtaattac 18601 ccccacccac ccccactgag tagctgggac tacaggcatg tcccaccatg cctggctgat 18661 ttttaatttt tttatagaga cagggtctca ctgtgttgcc caggctagtc tcaaactcct 18721 gggctcaagc agtcctccta cctcagcctc ccaaagtgta taggcatgaa ccaccatgcc 18781 tggcatgcag cctgcatttt ccagtgatga gggccatgtc tgagcccaca gcctcagtgg 18841 atcacccagc agggcatctt taaaggcctt gtagtcaacc tcctcattac agatgaaaaa 18901 ggggaagttt gggtataata ataacagcag tagcaaacat gtaccgtatg ccagacactg 18961 cttgaagcac atgctttaac tcataatcac ccttatgagg taaatgctgt catctccaat 19021 ttacagatga ggaaactgag gcaagagggg attatatacg ttacctgagg ttacacacct 19081 agcatttggt ggagagaagt cctttaacta gctatctctg tgaccctggg caaattgctt 19141 caatttccta ctccacatca tggtgtggag ctatggcttt tgactactta gctcagggtg 19201 tggcacatag taggtggtca gtaaatgatg gctgtttttc ttttctataa aatgtgtggc 19261 tgggcgtggt ggctcacacc tgtaatccta gcactttggg aggccgaggc gggcagatca 19321 tgaggtcagg agttcgagac cagcctggcc aacatagtga aatcctgtct ctactaaaaa 19381 tacaaaaaat tagccaggcg tggtggcggg cacttgtaat cccggctact tgggaggctg 19441 aggcaggaga atcgcttgaa cctgggaggc tgaggttgca gtgagccgag attgtgccac 19501 tgcactccag cctgggcgac agagccatac tccgtctcaa aaaaaaaaaa aaaaaaatgt 19561 aggtaatgga ccaggcgtag tggctcacac ctgtaatcct agcactttgg gaggcagagg 19621 tgggtggatc gcttgagccc aagagttcga gaccagcatg ggcaacatgg caaaaccctg 19681 tctctacaaa aaaatacaaa aattagccag gcatggtggt gggcgcctgt agtcctagtt 19741 actaaggagg ctgagatggg aggatcacct gagcctggga agtcgctatg gtgagccatg 19801 attgtgccac tgcactccag cctgggtgac agagtgagat gctgtctcaa aaaaaaaaaa 19861 aaaaaaaaaa aagtgggtaa tggactagta ctctctgatt tcttatgatg caggaagcag 19921 aggtccagag agggaaagta ttgacttgag aggacatcct gacgcccagt ctgaggttgt 19981 ttcccacaat gggatgcttg acatgcaggt ccagggacac ctgcttgttc atttcagagt 20041 taatgacata gtggccctgg ccttacaatt cataagcaga cctaagccac attggctgag 20101 tttagtcctt gaactatctc ctttttcctc agccacccaa cagcatttgg ctgtcctgca 20161 atcctgtgac agtttccagg ccttccaggt gcttgaactg actaaacatt cttaattctg 20221 agctcagttc tggtgtccag tggaagtctt ttttcttcta gatttgttcc tctaggctgt 20281 cacgagactg ccctgaggcc taaaatttat tttcctgtaa actcagggca aggcacccag 20341 tcatcagact ccccgtcact aacgaccaga tcaccccttt cttcagcaga actgctctga 20401 gaacagactg tacaacagga aaccgcatgt gggcttaaaa acaagcttgc agccttcatc 20461 tcttcctaaa gtcagagagc agagtgtcct gaaaagagct gtaggttgag tgtgggttca 20521 ggtgccctct gagccttggt atcttgtctg ttaaatgggg atatatcctc cctccacatc 20581 tacccatcta agattataag tattaaatgt aaacagaaac aatagcacac caatcaggta 20641 ataatcacaa tagcagccaa catttatttc atgctttctt aataagtaag gccctactgt 20701 atgctcttcc ctgattatct ttttttctta attttttttt ttttgagaca gagtctcgct 20761 gtgtcggctc actgcaaccg ccgcctcctg ggttcaagtg attctcctgc ctcagcctcc 20821 cgagtagctg ggattagagg cacccgccac catgtcctgc taatttttgt atttttagta 20881 gagacggggt ttcaccatgt tggccaggct ggtcttgaac ttctaacctc aggcaatcca 20941 cccgcctcgg cctcccaaag tgttgggatt ataggcatga gccactgcac ccagcctcct 21001 gattaccttg actaagcctc acaaccacac taggagcact gttagatttt acagtgagga 21061 gctgagacct agcagagtta agtacaatgg ccaggctaca cagctagtaa tgtatgtctg 21121 aggagtatta gtatctccag caagtccagt gaggccaagt agcctcatgt aaatttgctt 21181 tgaattgcaa aggaaagtag gaaaaataga aatttaaaaa aaggctgggt ttaaagtcag 21241 aagaaacaag atctagttct ggaactttca cctattctgg atgtgacact ggacaagtta 21301 tttaccttct ctggacctca atttcttcat ctgtactgta agggggttag atatggtgat 21361 actcccaact ctatttccta aaaagacaat attgccggtg ctaagactgg gttcaaatcc 21421 cagctctgcc atttgcttga acaaatagtg taacttctct gcatctgcct tcctatctat 21481 gaaacagatc aaaatagata ccacataggt tgtgaggatg aaattggaca gtatagttta 21541 ggcatacaat tggtgttcta tagcctggga gaggaagttg gaggcagcat acagtagggc 21601 cttacttatt aagaaagcag gaaataaatg ttggctgctc ttgtgattat tacctgtagc 21661 tgccttggga acttttcctg agcagcagtg tggcacagca ctgcctctga ctgggaaaat 21721 agcagcggct cagagtaata attgaagacg acctgtgtag tctgaatgag gtctgtggtg 21781 cttccatact ctgggagagg aagttggagg taggcagcag cttcctctac acctgctctc 21841 tctggccagc tcagctccca gtcccaccct ggccttggag caagcagtca gggtaggtac 21901 tgtaagcctt gagctctgcc aaaggccatg aggcttgcct aatgggatgg agcttttccc 21961 tggctgggat gggtgggcag gcaccaggct tcctcttagg gagggagggc tgtaaatggg 22021 ggtgggtggg agctcagcct ggccgggtct ttgtcctggt agtctaggcc acgaggttca 22081 aatggccagt ctgtgactga ggagctaagt gtactgtctg cactagcttc agagggaggt 22141 cttattttct gataaaaagg ggagggaaaa actgtaagtc aacaccactc cactccctta 22201 ggagaggaac taagtcagtg aacaagcctt ttgttctttc ttggctgcca cacaacaccc 22261 aggctaacca ccccctaccc ccaagcagtg agaaggggct aggctgcctg atggtcagtg 22321 tagcaggcct agtggcctct caaaggtcac ccaagggagc tgggagcagc cactgcatct 22381 ctaggactca caggcaccat taacagcagg cacatgggat tgagtgtgct tccaggcttc 22441 caaatggatg agactaatcc agactagtgc cagcatctgc tttctttacc cttgactgct 22501 cttccagccc tgctctgcct cattaataca aattgatttt tttcccctaa agacttgtat 22561 atgatgcagc ctggccatta gcagcagctc agctggccac aggtaaagga ggttgctgag 22621 gaaagcaagg cgcggaagtt aaactcaaaa gcctcagttt cttcatctat aaaacaggga 22681 ttaggctggg tgtggtggct cacgcctgta atcccagcac tttgggatgc tgaggcgggt 22741 ggatcacctg agatcgggag ttcgagacca gcctgaccac catggagaaa ccccatctct 22801 actaaaaata caaaatactc tgggcgtggt ggcgcatgcc tgtaatcctg gctacttggg 22861 aggctgaggc aggagaatcg cttgaacctg agaggcagag gttgcagtga gccgagattg 22921 tgccattgta ctccagcctg ggcaaaaagc acgaaactcc atctccaaaa aaaaaagcct 22981 gtagctgaca tttagtagat gcttattaaa agtaataagc atctctggga ggctgtctca 23041 cagcaggatg atcaggtcag ttttctggac aactttatgg gttaggtccg aagttcagat 23101 agttctctgg gtctctgggg ctgaaagaac aaaactgtgt cttgctttaa gagtctctat 23161 ctggccaggt gcagtggctc acgcctgtaa tcccagtact ttggggggtg ggaggatcgc 23221 ttgagcccag tagtttgaga ccagcctagg caacatgttg agaccctgtc tcaaccataa 23281 aaaaaaaaaa aaagtctctg tcctttactt tattttcatt gagccccggt gaagctgtat 23341 gaagtccata ctgctgtcac cctattttac agatggagaa cctaagacca gaggtgcagg 23401 agcttcttgt tgagtttaac gccgtcttgc agaatggata tcctagaatc ctggacctga 23461 gtttgggtcc tggctcagcc actgatttgt tgtgacaatg gcaagctgct gctcttgctg 23521 agcctcaggg atgctatcta atgagcaaag ctatgcagga ggcatagcct gaatgtttgt 23581 ttgttttttt cctgccaggg ggaaggattt gaaagaacag catgatcaga aagtatgtga 23641 gagggagatg cagcgaatca ctctgccctt gtctgccttc accaacccca cctgtgagat 23701 tgtggatgag aagactgtcg tggtccacac cagccagact ccagttgacc ctcaggaggg 23761 caccaccccc cttatgggcc aggccgggac tcctggggcc tgagcccccc cagtgggcag 23821 gagcccatgc agacactggt gcaggacagc ccaccctcct acagctagga ggaactacca 23881 ctttgtgttc tggttaaaac cctaccactc ccccgctttt ttggcgaatc ctagtaagag 23941 tgacagaagc aggtggccct gtgggctgag ggtaaggctg ggtagggtcc taacagtgct 24001 ccttgtccat cccttggagc agattttgtc tgtggatgga gacagtggca gctcccacag 24061 tgatgctgct gctaagggct tccaaacatt gcctgcaccc ctggaactga accagggata 24121 gacggggagc tcccccaggc tcctctgtgc tttactaaga tggcctcagt ctccactgtg 24181 ggcttgagtg gcatacactg ttattcatgg ttaaggtaaa gcaggtcaag ggatggcatt 24241 gaaaaaatat atttagtttt taaaatattt gggatggaac tccctactga cctctgagaa 24301 ctggaaacga gtttgtacag aagtcagaac tttgggttgg gaatgagatc taggttgtgg 24361 ctgctggtat gcttcagctt gctggcaatg atgtgccttg acaaccgtgg gccaggcctg 24421 ggcccaggga ctcttcctgt ttcataagga aaggaagaat tgcactgagc attccactta 24481 ggaagaggat agagaaggat ctgctccgcc tttggccaca ggagcagagg cagacctggg 24541 atgccccagt ttctcttcag ggatggatag tgacctgtct tcattttgca caggtaagag 24601 agtagttagc taacctatgg gaattatact gtggggcctt gtgagctgct tctaagaggc 24661 taacctggaa actaagctca gaggcaaggt aataaagcac ttcagggctt gctccccaag 24721 tgggcctgat ttagcaggtg gtcctgcggg cgtccaggtc agcaccttcc tgtagggcac 24781 tggggctagg gtcacagccc ctaactcata aagcaatcaa agaaccatta gaaagggctc 24841 attaagcctt ttggacacag gaccccagag aggaaaaagt gacttgccca aggtcgtaag 24901 caagctactg gcatggcaag agcccagctt cctgacggag cgcaacattt ctccactgca 24961 ctgtgctagc agctcagcag ggcctctaac ctgtgatgtc acactcaaga ggccttggca 25021 gctcctagcc atagagcttc ctttccagaa cccttccact gcccaatgtg gagacgggtt 25081 agtggggctt tctatggagc catctgcttt ggggacctag acctcaggtg gtctcttggt 25141 gttagtgatg ctggagaaga gaatattact ggtttctact tttctataaa ggcatttctc 25201 tatatacatg ttttatatac ctcattctga cacctgcata tagtgtggga aattgctctg 25261 catttgactt aattaaaaaa aaaaaaagac tccacattgc caagtttttg aggggtaaca 25321 ggaaccctcc gtgtaagttg agaagctttg ggttaccttg tcaacaccta tgtggcaagc 25381 tctgggcctt ttcttcattc atcctcgccc tgttactgga aagtgacaac tgcagcctgt 25441 gtccagattt ttttttttga gatggagttt ggctcttgtt gcacaggctg gagtgcaatg 25501 gcatgatctc gcctcactgc aacctctgcc ttgtgggttc aagcgattct cctgcctcag 25561 cctccagagt agctgggatt acaggtatgt gccaccacac ctggctaatt tttgtatttt 25621 tagtagagac ggggtttcac catgttggtc aggctgttct caaactcttg atctcaggtg 25681 atccacctgc tgaggcctcc caaagtgctg agattacagg ctagagccac cgtgcccggc 25741 ttgtgtgtcc agattatcat catcaggaat ctttcaggtt gaagcaggaa agacaggtgt 25801 gcaggcgtcc caacaaggtg gttggtcatt gcaaatcctc tccccttcag caattgtttc 25861 caaggtggag gtagttaaaa tgaatatcta gacagccctt gttaaaggct ggagtgcggc 25921 aggggttggg tggtgggtga cgggggtggt gctgcagcag ccaaaggggg aataactgca 25981 gaggaggaag ggtcactggg ctgcctgcag cagccctggg ctgaggaagt tccctgcaag 26041 gtgactacta tgactccaga agaaaagata ccaacaggag tgtctcctgg acttttagca 26101 agggatttca gccacaggtt atcagcctca ggaacaccca gatgacctct gcccatgccc 26161 agccattgct ctgtggttct gccagcaatg cgtactccag atcatcagtt tttttcggga 26221 accatattct gtccttaagt tggattcttc aagctttgag agtgcctggg agaaaacacc 26281 tttgggttta cagatttata ttgtgcccaa ggccccacta accagatagc attgtccttc 26341 ccaatggtca ctcttctacg acatccccca catgcagaag caagaagaaa ctggtcttct 26401 cctctcaagc acttgtaaag aaacacaagg ggtggggtga cttgcattca tttcttccca 26461 ttgcaaaggt ggcacccagc actgaggaaa gcaagccagg atacccaggc tatggaagta 26521 ttcatataga gaggatgaaa gggtagggct cctaaggaat gactcagctg gcatgtgaga 26581 gccaggccag gaaaaggtca tagtcagtca cacctgctgc aaggaagggg ctgcctcagg 26641 tgggatgtac atattctcta gagcagatgg gtgtctgctt gagggtttta gacttgggtg 26701 catgaaaact ctctgctatg gggagtttgt gtgggctagg ccagaaggtg gctagcagac 26761 cctctacatt tatgtccctg agccaatcat gttttcactc tccatattgc cagttatgtt 26821 aactgcttaa tttatttagt ctgatcaatg gcccagttca ggcccaccac tcctacactt 26881 gctggggaca gccaggaaac aaccagccac acaatgctga actgtgctgc tgctgggagt 26941 ccagggctca gccctaaagc aagcttgcaa acttcacaca taagtacagt ctatatagca 27001 agtaaactct gaccagagat gacatctggt cccacaactc atcaggtcta tgtacaatat 27061 ttcacatacc acccaataga taagataata ttaacagcaa ccactctcct ttatcaattc 27121 cccctgctcc aatacaacca ccacacattg cattaatacc ccaaacccat tcccaattta 27181 ttaaatatgg tgcaagctca tagacactta gaagaggcaa atctagttgt gatgaagagt 27241 tcctagagct ctgggagcca agatggaggt tttccagtac ctgcacatgt ggctcaggag 27301 gatgctgccc aggagctaat gagttgggag agcaaacatg ggaggtagaa gtcagatggc 27361 ccagctcagg gagctatctc tctcagcatc tcagctttga gactctgcca ccacctcttc 27421 ccagcccaag ctgctgccta aaccaggcat gttgaagggt gagcagtggt tgccatgaag 27481 ccaagaccaa gagattgctg agactcccac tcccctccct cagactctag gcctgtgaca 27541 agccacactg tcctccagaa cccatcgagc tttaggcaaa atgttttagg catctgacta 27601 aggagcccac ccgagtatga gtaacagaag ccaagatctg agctttctag agggcagggc 27661 ctctttctag tcccccagcc tcttcctttg cttgtgactg tttgtttcag acagaaagga 27721 tgttgtccgt aagttctcag ccagtctcca gtattaagtt tgaaaacatg aggtggaggc 27781 ttcctctcac attgatgtgg tccctgggct ggctagagcc agtgagcagg taaacagaag 27841 ccagcccttt ctctaactcc tggcctgttc caccacatat taagggactc ttcaaaacct 27901 actccctcaa ccttgctcca ggaaggacag gatctggagt agaaaggggg agcatcactg 27961 actttttcac accctttgag ccatcagtct ttctttcgtc cacacacttg aagtcctcct 28021 gcacccagct ggactagtga cttcagagtt cacaggcagg ttcaccctgg gatcctgccc 28081 agcaggtcca ggtcacgctg attagtgtaa gcaagagcct cccacaatcc actgccaggg 28141 agttcctggg tggattcgag taacagattc ctcctggccc aaaggccagg ggtttttttc 28201 ttcttcacag ctttcaggca agtattggat ttacagacag taactaggcc ccagaacctg 28261 tggagataca tttccctggt gctgcgccca cttgcctcct ggggaggtaa taggaatggc 28321 ttgttctgct tctaaacatt ccgccaatcc acaggaagcc cggacggccc tgctcacagc 28381 aggaatgggg cacagagggg caatgctggc tgtagaacac ccccctgcag ggggctgggc 28441 cagggctagg gaggtgagtc ccgggtcagg ccgtactgca tgctcacagt gtggtccaac 28501 tcctccagct ctgcaggcag cgggatgccc agctccccca ggtacaggga gagggcctca 28561 aaggagtcct ccaatttcga gaatgctggt ctagaaggaa aagaggtaaa ttagaggcag 28621 gtggaacatg agcagggcca tggggtgggt ggaagggatc taggctcagc tacttcctgg 28681 ttgtataacg tcaggcaaac tccttcacct agtagtgcct ctgtgagaaa tgggcataat 28741 gttaccggta cctcatatat ttgctataaa gattaggtaa cactataaca aggctggact 28801 agctggagtt aatacttaaa aacaatagga gaaaaacttc cctcccatat cttagagaaa 28861 atgcagttat caaaggtgga atcggaaaca ccaggctcct agtgccacgg aaatggcttg 28921 gctgccccgg aagcctaaga cagctcaggc ttactcctcc tcctccatgt ttatgcttaa 28981 atgtcatgtc acctcagtga ggcctgctct gacgaccata tttaaaattg caactggtac 29041 cctgaacaat cgtctttact tgtcttgact ctttttccat agcatctgcc acattctaat 29101 attcaatata gtttacttat tgtatctact gtatgataca gtagatacca tgctagaatc 29161 tatgttctat gagggtaagc acctttattt tgttcgctgc tgtaacccaa gtgtttagaa 29221 ctgtacctag cacacagtag gcattcaata ctgtatatat atatttttta ataaattaaa 29281 gatatttaga atgacaagag gtattagaaa acaaatactg tgtagcttca caatttctcc 29341 tcaagctttt acactggtgt tactgaagga ctgcggtcct tggcctataa aggaacaagg 29401 tatagattct cccaccagaa aatttcaact ctaatgtgga ccgctggaaa tcattttttt 29461 tttttttttt gagctgcacc aaaggcgtac cctacctcgc tttattaaag ggcccgtgcc 29521 gcagagtcaa tataaaaaca caaagtccca tcagtttaat aacaataaaa aaatccaaaa 29581 gtggaaaact gagggggcag gggaagagac ccctgggcca ggggcacgag gagccctgct 29641 catggaacca ggcctggccg cagggtcccc cggtattgct gttgctacga ggtcgggggg 29701 tagcgattgt cctatgggag ccaccgttcg cctgggtcgg ggaccctcac ttcttctggg 29761 gtgtgctcag cttctgcatg gcccggatct tgtccagcag gccagagatg aaggcctctg 29821 tgggtttgta acagtcaacc agcagctcct tgaccctgga agcccaggca tcgtcactct 29881 ccatgtccag gagctcatcc acgtcaatct ctagttctga gatctcctct tcctggcagt 29941 cgtagaggcg cgtgagctgc tccaggatcc actcctctag gttgaggtgc ttccgtagct 30001 ccttggggtc atacttgatg gtgaccttcc cttggcgcct cactgggccc tcatcatccg 30061 cgcagcccgg gccctctcct gcggccccgg ggggggctct gaaagtagat gcgtggtcct 30121 tggccgccac tgcctggcct gggggccggg gccggggccg ccaacgccgc gccccccgcg 30181 gtgccgctgt ccgccacggc ggcctccggg gccacgtgcg cgcgtggggc ccctccttgt 30241 gccgccgcct ccgtgacgcc cgccggctgg ctcaggttag ctcgccggct ccgcggcgag 30301 ggcggcggcg ggggcgcccg ggggcagctg cagcatgcgg agcccccggg cctggcctgg 30361 tcgcgccccg ccccctcccc gccgatccgc ccgccctttg tcccctgcgg ccgccgccgg 30421 ggctgccgcc gccgaagcgc tttctctatc cggaaaccat ttttaaataa agattatgtc 30481 ccaactggtt atcctgggag ggtggagttg ggggtgagga gaatgttgag ggatatattg 30541 gttaagggag attacagaag acgtatttcc tgtgctataa actttggtaa tgtttgaact 30601 tgtttttaaa agaaacatga tttttgatgg caggaaaaga accccatttt tgagacagag 30661 tttcactctt gttgccgagg ctggagtgca atggcgtgat atcggctcac tgcaacctcc 30721 gcccctgggt tcaagcaatt ctcctgcctc agcctcccaa gtagctgggg ttacaggcac 30781 ctgccaccat gcctggctaa gttttgtatt tttagtagag atggggtttc accatgttgg 30841 ccaggctggt ctcgaactcc agacctcagg tgatccacca gcctcagcct cccaaagtgc 30901 tgggattaca ggtgtgagcc atcgcgccca gccagattct cgttttttaa ccacacactt 30961 gaaaagacct agggtaactt tggtaataat gaaattatta cctgagaaga ataaaggtaa 31021 ttttgaaaat acttcctcaa aggagaaatt agctaaatgc ataaaactgc tcactgtagt 31081 taaataccta ttgtaaagag aaactggaaa aacctgaaca tctacattac aatatgttca 31141 gacagtctct caatagaatt tcatgttgtc gacaaaaatg ataaacagga agactggata 31201 gtaaacacaa gggaaatgca gggcaaaata cagaactatt tactataaaa aaaagtatat 31261 gtgtagatat ggacagggga atatcgaaag atgaatacac ttactgtatt caaatggttt 31321 gggtgaattt ccttctttta aactttcttt taatggctgt gtttctgtgt cctatccaga 31381 ttacttgctg ctatgtcttt acctctctct tctccaacct gtacatatgt gttgtcagat 31441 ggctcagggg ttgacagcca cattcttgac ctcaaggctt ctctcacccc tgtgcttgca 31501 gagggctcag gggtggcctt agacagaggc aaacgtccca ggaccctgtg agctgggaga 31561 aaaaggcagg ataccaacct gctctcaggc tccagtctgc agcagatggc ggccagcggg 31621 aagaaggccg ggggacaatc tgtgggaaca aacttctccc agaaaagctt cacgttgagg 31681 ccaaagtcca gtgttcgggg aaggcagtca ggatctgcat acacctgccc aatgatctgt 31741 agggtaagaa gacagctgtt tacaaaggac tacagaggcc acagatactt ggcaagggaa 31801 agggatgctg ggatgtggac aggaagaatg gcagaagagg aggtggtaaa ataggttaga 31861 gagctctccc caatccccaa tcatctctcc aaatcttact aatccagacc tgtatccact 31921 gagagctggg ttactgaaga ggactaagaa ggggcttcac tgcagaagcc aagtgttcca 31981 aatcagcatc tgctcctagc ttccactcat cctaccctgc cttgtctgtg tctccttgtg 32041 gctgtgtgtg tgtgccagag ttaaggctct atgaggtgct ggatctgatt catgtctgaa 32101 accctctggc accagcccag tgctctgaac agaggtgttc aattaatatt aaagaaaaca 32161 attttttaaa acaaatcttc cttcatggcc aggcacggta gctcatgcct gtaatctcag 32221 cactttggga ggctgaagcg ggtggatcac ctgaggtcag gcatttgaga ccagcctggc 32281 caatatggcg aaaccctgtc tctactgaaa atacaaaaat tagccaggcg tggtggcgca 32341 cacctgtagt cccagctact tgggaggctg aggcaggagg attgcttgaa cctgggaggc 32401 ggaggctgca gtgagccaag attgcacctc tgcactccag cctgggtgac agagtgaaac 32461 tccatctcaa aaacaaaaac aaaaacaaaa cattccttca ttatcctaac catcccttgt 32521 ctctcctctc ctctacccct cacaactttg taggaattac agaaggcttt gaatgtgtgt 32581 gtgtttgagt cgctgagata tgctgtcacc tgactaaact cctaaaggta ggaatctttc 32641 caagattcct cactctgtcc acgtggtacc tggcacaggg ctgacacttc ctagagtggc 32701 tacttctact tgaagggaac tgagaagggc tacatttggc ccaagagcta gctagacagc 32761 cacatgaaca gtctcttgcc cttccttctg gaattctgag tctcatctgc cccttatcga 32821 ccatgtgcaa gccactgccc tcttccaagc ctcagtttcc tcaactaaaa aaaaaaaaaa 32881 aaaaacctgg ccgggcgagg tagctcacgc ctattaattc cagcactttt agaggctgag 32941 gcaggcggat cacctgaggt ccggagttca agaccagcct ggccaacatg gtgaaaccct 33001 gtctctacta aaaatacaaa aattagccgg gcatggtggt gtgtgccggt agtcccagct 33061 acttgggagg ctgaggcagg agaatcactt gaacccagaa ggcagaggtt gcagtgagcc 33121 cagattgtgc cactgcactc cagcctgggc aacgaagtga ggctccatct aaaacaaaaa 33181 aaccctgcca gtccttctct taagaccaat gaaacacgaa cgtggcaaag atgtgcagat 33241 gcgttataaa ccataaaacc ctacacccat gttgatttta gtcaagctag tcatccctag 33301 gaggagatgc cccagttccg agggaaggca gagctgctag gcctgctgcc tcgggcatgg 33361 ccttggtgcc agagctcacc tcacagagaa cgatcccaaa ggagaagata tccaccgtct 33421 catcatagct ctttccttgg ggaacacagg agagcacact gttaagttta catcccttac 33481 ctgctctgca ctgggtctag ggtcaatcta agccacagct aagaactagg gctagaacag 33541 gttctgtagg gacaccccac aggctgtcct gcgcagacac cacctgaaaa tagtccttcc 33601 tcagctgtag gtgaactgcc ttggagagcc cttataacat aacactgcct aggatgaaac 33661 tgcacgttaa agtggaaaag aaccagacta gggttcatct agcctctgcc attaggcagc 33721 tgtatgctcc taggcaattc actgaacttc tctgggcctc agactccaaa atggattgta 33781 ttcagaccat ttctgaggcc tcacaaccct gatttcaaga atcccaacca gcatctctgg 33841 ttggtttgag cattatccac cagagggcac cacacttcag ttctcaccat ggagagagaa 33901 ggctaaagct gggacaatag cctgagctct tagcgatccc tgaggcacaa atgccacagt 33961 cagaactcac cttatccccc atgatgggac actgatcagt tcccagcccc ctgagctcag 34021 gagctaaatc gtgaggcccc acctggcaag ggaaaaatat cctcagctct gggtatccgg 34081 tcctacgctt cacaggcctc ccctggaatt cccagggctc tgatgcaagg gcagcatctg 34141 tcctccctct gcgggtgtcc cctccagggc ttcaggactc accgttcagc atctcagggg 34201 ccatccagta ggggtttccc accaccgtgt agcgcttctt gcggtcgttc ttgcgcaagg 34261 tgcgtttctt ggtggtggcc ttctccatgg gggccctttt cctctcttcc actatgagcc 34321 gtgacagccc aaagtctgcc accaccacag tcttgtcctg gtaggataaa agacaggtca 34381 ggacttcatg gggatgttct caaacctggc cctggggagg aagcataggc aagagaagcc 34441 actgagaagg ggagaaaagg tgtatagcaa gggagaatca gagcagaaag ctctatcaaa 34501 tagacacact caggccaggc acggtggctt ccacctgtaa tcccagaact ttgggaggcc 34561 aaggcgggtg aatcacctga ggtcaggagt tcaagaccag cctggccaac atggcaaaac 34621 cccgtctcta ctaaaaatac aaaaaaaaaa aaaaaaaaat tagctgggcg tggtggcaca 34681 cgcctgtagt cctagctact tgggaggctg aagcagaaga atcactagaa cccgggaggt 34741 agaggctgca gtaagccaag attgcaccac tgcactccag cctgggaaac agaatgagac 34801 tccatctaaa aaacaaaaca aaacaaaaca aaaaaacaga cacacttggc atttgtgttt 34861 ctgagggtct ggtatctgtg cacagcagac tcccagccca cagtgggttc atggcagagc 34921 cactggagcc ctggaggtga ggctcaggga cttggagagt ggaactcaga ggtgggacag 34981 ggggttgctt ttgtttttcc ctgtcctgtc ttattcctaa aatgatttca ggcagtggct 35041 agaaccagga taagacagaa aaaggcaagg tggaggcagt agatggggag acttggggga 35101 gacaagctct ttgaatagct acagggcagt catgtgggaa aggaatgaag attgcgtggt 35161 gtgggtctag agggcagagc cagggccagc cggtagacac ccaggaagcc agatagcaag 35221 tcagtaagac aaacaacatt gtaaccacca cagctgaggg gcagtgggag gaatgctggt 35281 ctcaaactgg tccctgtaaa tttactagct cggtggcctc aggctgaggc ttcaggtgcc 35341 accactgtaa aatggggtca gcaatgaact gttttacaag actactatgg ggcaaaagca 35401 ggtaaaatga cagggttttc tacatatagt caacaaactt ctgaagtccc ttctaatcaa 35461 attatttcaa aggttgctaa tgagagaagg tgaaattatg tctgtgtgtg tctcggagct 35521 cccagaggac agagaagggc attggaagaa gcaggaaggg aaagatagga gctcctaggc 35581 aacacctctg ccctctagcc agaggggaca agtcagcctc ctttgtgaca aggaagccag 35641 gaaggatagg accctggagg ccaggcccag agcagtggga cataccaact tgatgaggca 35701 gttgtgcgag ttcagatccc ggtggatgat gcacatagag tgcaaatagg cctggaacag 35761 aagcatgagc tgaggctaca gggccagcaa atcaactaga ccctgaatct ggaattgggc 35821 gccatagatg ctctcctacc aggccatgcc cacccagctt cagaacccaa cctggacaac 35881 agggccaggg cctcctatgg cctctccctt gtcccaatac caacaggaag cctggacttc 35941 ctctgcttcc ctgctgccat ctcctgccca tcctttggtt caccaatcac tagcacagtg 36001 gctggccaaa gccctcacag ctctctccta atgtccccaa cttcagaaaa ccctggccgt 36061 tgcagcaccc cctccacctc tacccatttt acttagatcc aggctgctcc ctcctcccca 36121 ctctcccaat ctctggcttg agccactttg gatccttcct gcctgcaaca gagaagtgac 36181 ttcttgctcc agctagctct gtctttcaga cattcacctc ctctaagact ccaccttccc 36241 cactgtctaa actttaagct ctcccccttc ctaccaaaat gtttgtcaac aaatttgttt 36301 gttatgtctc ctctccaggc ctccaccact gactcttacc actctcagga gcaacttcct 36361 ttctccttcc cttgcaaatg tagaaggagc cactccttgc tctccagccc cctgcaatac 36421 tactttcatc ccttaggctc ctggagctgc tcacacaccc agggagctct tcctgaatcg 36481 ctatagctca tgacctgcct gcctgcctgc ctgagggagt gctcctcaaa gactgtccac 36541 aactttccta tgctacctga tcctgagact cttgtgtgtc tgtttttttc ttccagcctt 36601 ttcagtgtca gccctgctca aagccatctc ctcttcacaa gctgagacac ccccaccagc 36661 tacatggatg cctcttctgt cccacagctc cagctgcccc ttctctgcag attaggtgac 36721 tagcagcctc tccctaagcc agtctcatgg agccagtttc ctgcccagaa gatgctgata 36781 ggttctgcta gcacagaaca gcagggatca atgctagctc ttcccaaacc agttgctcaa 36841 caatcatggc tatcccccat cccccaaatg tctggtaagt gacacctcaa gccactcgta 36901 actcctcctc ctcctcctcc tgttccctca ttgccaacca gtctctccat tctgactcca 36961 cccggcctcc caggctcctg tccccacagc cactacctta attcagaaat ttgaaatgtt 37021 tcattagccc cctatgcagg gctccattcc ctggaccctc cgcattccac cccaatgtac 37081 taaccatatc agtctctctt tttttttttt tttgagatga agtctcactc tgtcacccag 37141 gctggaatgc agtggtgtga tcttggctca ctgcaacttc tgcctcccag gttcaagcta 37201 ttctcctgcc tcggcctcct gagtagctgg gactacaggc gtgcaccacc acgcccagct 37261 aatttttgca tttttttctt tagtagaaac agggtttcac cagttggcca ggctggtctc 37321 aaactcctga cctcaagtga tccactcacc tcggcctccc aaaaaagtgc tgggattaca 37381 ggcatgagcc actgtgccag cccagactca ttttggaaaa accacagctc ttaagctttc 37441 ttgcttaaaa atcttttcct gccgggcatg gtggctcaca actgtaatcc caacactttg 37501 ggagggcgag gcaggaggaa tgcttgagcc caggagtttg agatcagcct gggcaacaaa 37561 gtgagaacct gtctctacca aaaaaaaaaa aaaaaaaaaa gaaaaaaaaa ttcaaaaaat 37621 taggtgtggt ggtacacacc tgtagtccag ctacttggga ggctaaggca ggaggattgc 37681 ttgagcccag gaggtcgaag ctgtagtgag ccatgatcgt gccactgcac tccagtctgg 37741 gcaacagagc aatatccttt ctcaaaacaa caacaacaaa aaaacctcct tcctgattca 37801 agccccagaa acaggaaaaa aaaacaaaaa aacaaacaaa caaacaaaca aaaaaaccaa 37861 aaagccagtc aaccaaaaaa ctcttttaca tttgtctcct atagtccaga ctccatagct 37921 taaaacacag cgaccttgat gtacccccaa actgttccag actactacta ttttcatgga 37981 cccccatgtg aagggcagtg gctaaagtct caggcttggc gaccagactg attagcccag 38041 gatcccagct ctgtcccttt ccagtccagc ttatctggaa accaaggttg tcatgaggaa 38101 taaacatgca catgcatgtt aatccctcag cccagtgcct agcatataga gggcactcag 38161 tacttgtcag ctgttattat ttctatgctt ggaatcattt gtttccaggt gaaatagatg 38221 attttctcct ctttcctccc aggtctttgc tcctgtcact tcaatcctgg tgcctctctg 38281 ccaataagaa ctttatccaa ttctgatcct tcagggctcc gctagaactc atttcccctt 38341 cattgatttc ctaggcagtg aggtgctcta gttggtctaa gggccctcag cacctgaata 38401 cctgtgatgg caagcccagg ccatctcagg ctacagtttg cctctccagt ttgccttgtc 38461 cgtgccctgt ttgcctttag agatgtgtaa ttctttgaaa acagggctgt gaactgagaa 38521 cagttggctg gggtcagttt cagtgacaca gacaaactca atgaatagct gctgacccag 38581 cccacacaag ttgcatagga aggggtctct accttccagt gaagcccaca attctcacac 38641 ctctccctac tctcgccctg ctggcaggtt tgttggtggg actcaccatt ccggaggcga 38701 ttcctttggc aaacctgacc ttctgctgcc aggggaacgg atcctgcaga atggggaatg 38761 attttcagag gctgttgagc cctgtcccac tcgtggtcca ggccccatgg gtcaatgtgg 38821 tctcccagtg aagcctccct gtgtatgctc agtcccaccc agcatggtcc tttggaggca 38881 aagcagggcc cctctgccct ttaagctcct agggtagtga tagcataggt gtctgacaac 38941 ccaccaaggc tcctggagac tatggggtgg tgtgctcacc atactgcgca gaaagtcctt 39001 cagtgtgccc ccctcaatgt actctgtcag gaggttcagc ttcttatcct tgtacagcac 39061 accaatgaac ttgagcacat tggggtggtc caggctgcgc atcactttca cctgggggtg 39121 ggggaagcca aggaaagctg gtggttaact tctcactgtc ttccttacct cttccctcca 39181 aggaatcagc ttgacaagtg ttaccacctg tatctcctac ctcctatctt gtgatatgac 39241 agggcactaa gatcaggatg agggaagacc accaccatat agatggagat ggcttcatag 39301 gccttaccct tggaactgca ttgggagaac tggggtggga gtggccaagt ggcttggagg 39361 gcaataccca gaccacaggt gtccccaact taagttactc catgaagtaa cttgggaggg 39421 gtagtgaaaa ctgtaaatgc ccaaggagtg tgagtgctca gggagctccc ctccaggctc 39481 ccttagcagg gcgttcaggt gaggattata acattgagtt gtactaaatt ggagtgtttt 39541 catgtccatc atcctcataa cctgagagct cttctgaacc ctgtatcttt tgttactgca 39601 tcccttgtac cctgtttgtg ttgcttttag taagtgttca ttgaacaaat aattacgctg 39661 taatttagag ggagtgggat attcctctag ggctcttttc tgcttagtct ttaagacagt 39721 aattccaaac tgtattttct cttgaagcca ttatttgtaa ggtcttctct cttccaatgg 39781 tgacaccaac ctcccgggcc ccctccatct tcttacctca gtcagaaaag ttttctgggt 39841 ctcctcatca catcgaatta actctttcat gaccatcact ttgcccgtgg ctttgtgtgt 39901 cacctacagg gagggtgact gttaatggtg ctgagaccag aggaagttct ggagaactgt 39961 ttgagaaaat ctatgcagct ggccatccca tcccttggag ggcacactga gccgggtcag 40021 aatgacagag ttcatgtgtg gaagatgcag cgtggagcaa catgggactc acacggaagc 40081 aaaccctctt tcctggagag ctgggcatgc acagaggaca ggcaagaacc cagaaaaggg 40141 aagtggggaa gaaggcagga ggggatgctg ccaagccaag caggggctct ggtcatttgg 40201 catagaagtg agtctcctga gtgaacagaa gtctgggggc aggaaggtca gtgccaaaga 40261 catgcaccag ggagctgggt ggatcggggg tgagctctcc agccctgctg cccccaagct 40321 gagacacacg tcttacctga ccccccctcc cagttctcag cagaagagga ggaggaggag 40381 agaaagagag aggttagtcc ctgagactgt tcaccaggga ggcaggagcc attcccctgt 40441 gtgctccttg tagcatgccc ggtactgagg tctgcaggca cctaacaaac acccactgac 40501 tggaggcagg agggccacag gcacagctca acccaacctg gatctccagg tccacaagct 40561 cactgtaaca gcaacattga ccttaggtgg cctgggtggg ggtgaaggat acctctcaca 40621 ccaacaccca aagccaggaa ggaggcctgg catcctgcat ggtgtggggt ggggccaagt 40681 gatgagaaat ccccgaaaga cagtgacaga gggactgggg gcagaagagc aaagcaattg 40741 ttgcctgcgc tcaccttgat agcctgccca aagaagccct tccccaggac ctccccatgg 40801 attaggtcac agggccggaa gatctgctgt gaatagctgc tggaacaacg aagggattct 40861 gagcggctga tgtcacggct gaacagcagg ggctcctttg gggagctggg gccaggggac 40921 ttggagatac tgttactgcg cctgaagatg aggaaagacg ggagtaggga tgtgtagtcc 40981 tcagagaccg gccaagaaac cctggaggga ggaaacttag gggcacctgt tcattccaat 41041 gtctatagtt tagatgttat tattggggaa caggggctct tacatcagat gtatcatttt 41101 ggggttgctg acatcagtca tttatttaag catagtaaaa acacattcaa aatgggaact 41161 ctgcatacat cacacataga acaattaact agattctact agctgataat gctcagtatg 41221 tgacatatat gacaacactg tagatggtaa ttcaccatct cccatatctg cttgatttaa 41281 gtactaactg gtttagagtg atagttatac tagtatccaa aaaatgagca aatgggattt 41341 tttttttttg agacagagtc tcgctctgtc aaccaggctg gagtgcagtg gtgtgatctt 41401 ggctcactgc aacctctgtc tccctagttc acctgagtct catgcctccg cctcctgagt 41461 aactgggact acaggcatgt gccaccaggc ccggctagtt attgttatta ttattatttt 41521 tgagacagag tctcgctctg tcgcccaggc tgaagtgcaa tggcacgatc tcggctcact 41581 gcgacctccg ccttccaggt tcaagcgatt ctcctgcctc agcctcccaa gtagctggga 41641 gtataggtgc acgccaccac gcccggctaa tttttttttt tttttttttt tttttgatgg 41701 agtctcactc tgtcgcccag gctggagtgc aacggcggga tctccactca ctgcaacctc 41761 tgcctcccag gttcaagaga ttctcctgcc tcagcctccc aagtagctgg gattacaggc 41821 gtgtgccacc acaaccgaat aatttttgta tttttttagt agagatgggg tttcaccatg 41881 ttggctaggt tggtctcgaa ctcctgatct cgcgatccat ctgtctcggc ctccctaagt 41941 gctgggatta caggcgtgag ccaccacacc cggccctaat ttttgtattt ttagtagaga 42001 tggggtttca ccatattggc caggctagtc tcaaactcct gaccttgtga ttcgcccacc 42061 tcggcctccc aaagtgctgg gattataggc gtgagccact gcgcccagcc taattattgt 42121 atttttagta gagatagggt ttcaccatgt tggccaggct ggtctcaaac tcctgacctc 42181 aagtgatttg cccgcctcgg cctcccaatg tgctgggatt acaggcgtga gccaccgtgc 42241 ctggccagca aatgggattt tggatgatat tttgcacatc tttgtacctg tccatgtttc 42301 ctgaatttta agatgataaa cacgtatgat ttttacagca ataaagtcac tatgtttaca 42361 ttttcaaaag cggtcgataa tagtcatggt caatgactgt tattttcact tactgaagaa 42421 ggaacaagaa ggttgaagtc cacagcactg caggtgaaca gccctgcgaa ctttctctct 42481 ggggtatgct cctgctttac ttgggctacc cagagaggac tgacgctgga tggctcttgg 42541 cttgtcccga cctacaggtt tacttaccct tgtccttttg ctctgcctcc tgctgaccat 42601 gcccaccaga gcacacaagg ctagctgccc ccctcctctg catggtggct ctacagatac 42661 ctggccactc ccaaatcttc ccttttccag gccatggatt tctagctctt ccctgccctc 42721 tgttgccctg cctcccacta atttcatcat actgaccaag gccaagaggc aatgggagtg 42781 agggcagcac tttaatgaga gggcactgtt tccaacacat gccagctgag agggctctaa 42841 cttactcagt ttgtttggtg aggagagtgg ataaaaaaaa acaaactttt ttttttggag 42901 acaaggtctc actctgttgc ccaagctgta gtacagtggt gtgatcaagg ctcactgcag 42961 tcttgaactc ctgggctcag gcaatcctcc tcctcagcct cttgagtagc tgggattaca 43021 ggcatgtgcc accatgccca gttaagtttt ttattctttg tagagacagg aatgtggccc 43081 aggctgctcc tgaactcctg gcctcaagca atcctccggc cttggcctcc caaagtgctg 43141 ggattacagg tgtgagccac agtgccctgc ccatctaatt aaaaacaatt attattatta 43201 tttttttttt gtagagacag tgtctcacta tgttgctcag gctggtctca aactcctgtg 43261 ttcaagccat cctccgacct aggcctccca gagtgttggg attacaggca tgagccacgg 43321 tgcctggcca agaacctgat tcttagtgat actaagaatg tgtcacgttt actgaacact 43381 gtccctatgg cagcactata caactttatg tccttttcca catttaatcc ttacataatt 43441 ggtgagacag ccacctttat cttaccatct tatgtcacag atgcagaaac aggctcagag 43501 agatgaagtg atttgtgaag ggaggagctc agcctgaagc cccagtaggg tatcagtatt 43561 tctgcctggg agccagtgct cttgacttct tttctgtaag tctccctact tggggcttcc 43621 actcaaggca acttctcctt ccccatggac ctggcagaat ctggggtttg ggggttgttg 43681 gtccctcaag tctaaaaaaa acgttcattc atcttttgct ttattgtttt caaagtcttt 43741 ctacacaatt tattcaatcc tatctgtgcc cctactccag gctctgagct ccatgctttc 43801 tccataggcc atgttgccct tcagggttta gcatcactgc ctcctgtcca gcttaggagt 43861 tcagtggaca gagactgact ctcaggtttc taaggaaacc tggcagacag ctgggtccgc 43921 acaggctctg gctgtaaact gtgatcccca caggggcagc tgggtcagca caatcctcat 43981 agtctgtccc tggccagctg gttcctttcc taactctgca ggcttctgaa agccagctca 44041 gcttcatccg agagacagac ataggacaga acagagccag ggtgggaggt ggcaccttag 44101 ggaacgtctc ctcagtgtcc cctccagatt ctccttggtg tccagggtgc tgagggcgtg 44161 ggggtgtccg gcattctgca tgtgaggagc gagccgggcc tccagccgca gctggtccag 44221 gcgttgggag acggggtcat gttcaatcaa cagctgaagt gtctggctcg tctggctaat 44281 tgcatcctcc acctgtggcc aagagcagta taaagctccc acaggctgcc ccatcccagg 44341 ccacccatgg ccaacccaga tgactcaggc acagtgtggc cagtgctcaa cttgtctaat 44401 ggtggccttt agctgccagt ggcctcacag tgccctctgc ctggcatcct gtcagctctc 44461 ccgtccatca accatccctg ctaccctggg aacctgttac gctgaaatag gtcaccagtg 44521 gtcacaggct caatgctaag acaaagagat gggctggggg taggaaaagg taaaggctac 44581 agcctgattt ctcagaggat ctgttccatg tcccaccctc acaagacaga ttagacacac 44641 actctacctc ctccactcga agtgtgcgga cgggggtccc attgatctcc aggatgcggt 44701 ccccagggtg gatggcgttt cgattgttgg gactgatgtg catccggttg accctgggcc 44761 agacaagagt tgaggcaggg taaagatgtc ctaaaaacct gcttgaccaa ttctgcttgt 44821 gcagggaagg tttaccaggt cccaggccac cagtggcccc taaggggccc aggataccca 44881 cacagaaagg caacaccttt tccaggatgc cccatctcag tgaagggaat tgccactcac 44941 cctgtggcct aagccagaaa cttgaacatc atcttcccaa tctggagttc cctcatctct 45001 cctactccca tctctaagaa ctctgccttc tttccatcat tgtagttatt cacatctctt 45061 acctggatca cggtaatagc ttctattttt tttttaaact ttgtttttaa tttttatttg 45121 agatggcatc ttactctgtc atccaggctg gagtgcagtg gcatgatctg ggctcactgc 45181 aacctccacc tcccaggctc aagcaattct catgcctcag cctcctgagt aactgggatt 45241 acaggcgcgc accaccatac ccagctaatt tttttgtact ttttgtagag atggggtttt 45301 gccatgttgg ccaggctaat cccaaactct tttttttttt tgagatggag tctcgctctg 45361 tcgctcaggc tggagtgcag tggcgtgatc tctgctcaca gcaagctccg cctcctaggt 45421 tcatgccatt ctcctgcctc agcctcccga gtagctggga ctacaggcgt ccgccaccat 45481 gcccggctaa tttttgtact tttagtagag acggggtttc attgtgttag ccaggatggt 45541 cttgatctcc tgaccttgtg atccgcccac ctcggcctcc caaagtgctg ggattacagg 45601 cgtgagccac tgcgcctggc ctaatcccaa actcttgacc tcaagcaatc tgcccacctt 45661 ggcctcccaa agtgctggca ttacaggcgt gagccactat gtctacgtct ggacttttct 45721 tttaattaag aaaaacagag acggggtctt gctatgttgg ccaggttggt cttcaactcc 45781 tggcctcaag caatcctccc acctcagcct cccaaagtgc tgagattaca ggcatgaacc 45841 accatgccca gcctacaata gcttcctaac tagtctctat acctttaggc tttcccctct 45901 ccaatccaac ctccatacca gtgatccttc taaaatgtta acttggtcac aacaccttgc 45961 tgcttaaact ctcaatggat ctgctggatt ttcataatcc aggccctcta tacttaacag 46021 ccccttttct ggccacactc aacctacatc agacatgatt aagactccta cttttggaga 46081 aatgcttctc cctctgccct gaacacaatt ccctctcctc ttcatctaat aaggttgtag 46141 aacagacctt agctcctcct gacttcccaa actctagtat ctttcacctg ctgtgcaagc 46201 actagcctag tgtccaacac atggtagata cacaatcaat atttacatgg taagcggcca 46261 gaggtacgac attactggta ttagatagac acagactctt tctccatctg gtctctggcc 46321 tgaatctagg atagggtctt aacttctttt gggccatgga ccacattttc tccattcatt 46381 ctctcgactg acaaaataat aatcaatcaa ctcctactat atgccaggta ctcttccaag 46441 ccctgggcat gcagcactga ataagacagc aagtttcttc tggaatttac attctagtgg 46501 aggggataga agataaacaa ctaatacaca taatcaagcc agatgatttc aaagagcaaa 46561 catgacctga aacctgacac ctgaaaggtg agtgattctt gccttaagta tcgggtagga 46621 agtagcctgg gcagagagaa cagtaagtgc caaggcccag gtacggttac agcatagtgg 46681 ggtccaggag aaagatatga gatgagaata aagggagttc tgcatgagac tttggaggtc 46741 taagccctct gaggcccatt tacagactct gctcaagaac ccctgctgaa gggttctcaa 46801 aatacttact ctttcacttg cacagtggtg gcgtagttgg agcaggcact ctccacggac 46861 acggagaagc cccgcctgcc ttcagtggtg gccggcatgg agatgagcgt gacagagtag 46921 ggcagctgct cctgaacaga ctctgtggag agtctctcaa acatgggtgc cagcaccacc 46981 tcattgtggc acttcccact ggggagaggg cacagggggg aataagatgc tagagtccca 47041 cacagaagcc ccccagcaac cactgcacca tgtggcttta agcaacagta cccatagtct 47101 cactaccacc ctctttgtca tgattctagc tggctgctca gaatgaggag cctgggttct 47161 cagggtgggc tgggactggg ggtgctgagc tagatctgca gcaagtggta tatttggaaa 47221 aacactggcc tgggttgggg tgagatggtc tgggttctct tcactctacc tttgactcac 47281 aatgagaacc ttgcccttgt actattctat tccatgggcc ccaggctgac ttcttttatc 47341 aaccaaggtg gttggacagg gtgagctcta aggggtcttt gactttaatg ttctgggatc 47401 gtccttgaag agtgcctccc actgggagga ttgcagggca gagtgaagac attttcaagg 47461 ttggtaagga aaaagggtag agacatgtgt aaatcattct gcttacaagt taaagagctg 47521 gttaagagca ctgctgtggt aagaatccaa tgctctgact accctaacac tccttctgcc 47581 caactcaccc tgccactgtc cctccccgcc agccactctt atatgggaga ggatagacaa 47641 aggaccacta tcttaccagt agagggtggc atgctgcacc agtgcatatg catccccatc 47701 ctcaatgatc accttgcagc tcatacaggc aaagcactct gggtggtact tgaactcccc 47761 agccacctgt gagcaggtgg ggagaagggt gtacaaaggg aggaaggaag tgagttatca 47821 ctgttgctgg agtgaggaca ggggtgggtg aatgacctcc aagcaaggcc cccgtcaaaa 47881 tccactcttt cactctctgc cctagagctt ccaaaggaag tgtcagattg gtacctcctg 47941 tcatttctga gactgcacaa aactgttttg gtctcagtta tgagaagaaa tccccagaag 48001 taatcatccc cactaaaaaa gatcagttct tccaatacag ggttggcaag tctgtcacta 48061 tgcttaaagt cgtttgtcca gagcaggtat aggaggtagc ccctctctct gccagcctat 48121 tccagagtct ccttttctga ttccctaaag acaacctaag tatatcctcc tctcagcctg 48181 taacactctc cttggaggtc cttatccaac tatattgtgt ctcctctctg cctgcctctc 48241 ctcccactcc caacgttcaa ctattaagtc tttgagggaa ggatcttacc tttcacttag 48301 taggtgctca attgatacct gctagttgaa tgaatgaata aataaaggga gatgccctgg 48361 ctgatggaaa gactaaaaga ttaacaggcc acagggaact ggaagcactg tcaatggact 48421 ctgaagacca agaggggcag atatgaaggg attcactcac cataaaaggc cctgtcatca 48481 gcagggagca cccatgacag aactccccaa acttccccca gtagtccttg gggcagtaga 48541 gcttcccatc cttctcatag taccagttgg tgagggaatc ctggcattct gaacacctga 48601 aaggcaagac aagaactgaa attctcctgg atccctggaa ccaggccaca taagctgtcc 48661 ctggttccta ggtatgacaa caaaatggac agtcctcttt gatccaaagc caagatgggg 48721 caaagttaca gccttgtgct atctccccaa gggaagctgg ttcagactga acaaggcccc 48781 agaaaccagc tctgtgttgc ctgcctcagt ggacagacta gtgtttcata acctctttgg 48841 gtcttggggc cctctgagaa tttgataaaa gccaagaagc ctcccctaga aaaatgaata 48901 tataaagaag atgtgtatat aatttaggtg ggggacaggg tgtatatata aaggattctc 48961 tgaaacctat ttatggcacc agttaaggat ctctggtctt gacagatact tcaatgaaca 49021 aaagcagttt aatcatctcc actcttcaaa actgacccat caccaacagc gttaccaatg 49081 taatgttctt gccaaaaata cgttaagtat aaaaaaacta atcaggcaaa tctagaatgt 49141 gggttattct ataagaataa ccggtcttga ctcttctaaa aagtcatggg gagaagaaaa 49201 ggcacagatt ttaaaaaccc tagagacaca gcaaccaaat acaatgcctc aaacttggat 49261 tcaaatttag attaaaaaag gaaacagcca tgaacatttt ttggaacaac tgaggaaata 49321 tggatttgga ctataatatt atattgtaaa attgccgtga attttcttag atgtaatgat 49381 atctgactta tataggagaa ttcttagcag aaacatattg aactgttcaa gggtaaaatc 49441 aagatgcctt taagttactc tgaaatagtt cagcaaaaat aaataaataa aagtatacag 49501 atagagggag agagatatat acaaagtggc ataacattaa caaatcatga atctatgtga 49561 aaggtatgta ggtgttcttt gaactacatt ttcaactttt ctatatattt gaattttttt 49621 tcaaaataaa atattgggga agaaacccaa cattctatta gcactgtgta agatgtacat 49681 gttagatcca tagctcgctg ggcggggtgg ctgatgcctc taatcccagc actttgggag 49741 gctgaggtgg gaggatcact tgagcccagg agttcaagac cagcctgggc aacatagtaa 49801 gaccccgtct ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaatcta tagctccctt 49861 ctcccctaat gacctggcta tgaagctcca cctctcaaaa gtctggtgtg ttctcaggac 49921 ttacctgcac agacaacctc ttcctccctt ctgagcttcc caacacacac agctaagagc 49981 tccctggcct gctgagccca tgccatctct ttccttctag ctccttcttt tccctattct 50041 tgttgtcaag gccttgcacc ttccacagtg gccacattct tgcttccaag aggtttgaac 50101 attgcttggt ttgaatcttg atcctgccac ttactagtta catcatcttg ggcaagatat 50161 tttacttctc tggcctagtt tcctcatcta taaaatgaac tgctgtgaaa attaatggaa 50221 ataatatgta ttaaaaatat agagagcagc tgggcacagt ggcttatgct tataatccca 50281 gcactttggg aggccgaggc gggtggatca cttgaagtca ggagttcgag accagcctgg 50341 ccaacatggt gaaacctggt ctactaaaaa tacaaaaaat tagctgggcg tggtggcatg 50401 cacctataat cccagctact cgggaggctg aggcaggaga atcactggaa cccaggaggt 50461 ggaggttgca gtgagtcgag atcacgccac tgcactccag cctgggtgac agagtcagac 50521 tctgtctcaa aaaacattat agctagctag ctagctagct agagatagct tatataaaag 50581 tcctaatata ggcagggcgc ggtggctcac acctgtaatc ccagcacttt gggaggctga 50641 ggcgggcaga tcacaaggtc aggagttcaa gaccagcctg gccaacatgg tgaaaccctg 50701 tctctactaa aaatacaaaa attagccggg cgtggtggcg tgcactggta gtcccagcta 50761 ctcaggaggc tgaggcagga gaatcgcttg aacctgggaa gcggaggttg tggtgaccca 50821 aaattgcacc actgcactcc agcctggaca acagagcaag attccgtctc aaaaaaaaaa 50881 aaaaaaaaaa aaaaaaaaaa aaaaaaaaga cccaatatag tagaaactca gtatgtgtta 50941 acttctatta aagatgatga caatgtaatg gcagctcact ttatcagcca cttactgtca 51001 ctagctggac aacttctctt agtagctttc tctcattgaa aataactaat agcatcctat 51061 tactacaaag agccaatgtg aagaggaaat aagctaatgt atgagagaaa gccttttgcc 51121 aggcaagtgc attattataa cttgctaagc actgcagata taaaagataa atacacaggg 51181 caccctactt cacctttcat gagcttatga agagttgagg aagagaaagt ctaaataaaa 51241 ctcaacaatg caaggtgaga ggccacagtg agttctacta cttctgagca ctgcgggagg 51301 tagagaagag gggcaggaat gagttgaagc agctgagcta gggaagtcta gagaagcaag 51361 tccctgagaa tccctttatc tttctggagt gcccaattcc tcctgggtgc tcaacaaaca 51421 ctgatggcta tttttaccct atagcttttc ctacttgtct tatggaaggg gaacttcagg 51481 gtcttaaggg ctgggtatgg catgaagagt atggctgtga agtgtctcaa aactgcacag 51541 tgaggttgga ggcagagggg aagactgttc ttggtatgtg gctcacactc ctctcaatat 51601 aagatacatg tgggatgagc tctaaatgag gaggagagag atatgggttc aagttcttgg 51661 gtgctaaatt gccttggaca agcccctttc cctctctggg ttgtttcccc acctgtgaaa 51721 caagaggaca aacaataccc tgggtcccct taactctgtg catatctgtg tttttcccta 51781 gatgatctga ggttctaatc ttagcaccac catttaccac ctaagtgatc ttgggctagc 51841 cccttcactt ctcatttgtg aaatggcaat gtcatcacct actttcaagg tagcagtgag 51901 gattttttct tttttctttt tttttttgag atggaatttt gctcttatcg cccaggctgg 51961 agtgcagtgg cacaatctcg gctcactgca acctctgctt cccgagttca agtgattttc 52021 ccacctcagc ctcctgagta gctgggatta ctggcgcatg ccaccacacc cggctaattt 52081 ttctattttt agtagagacg gggtttcccc atgttggtga ggctggtctc gaactcccga 52141 cctcaagtga tctgcccgcc ttggcctccc gaagtgctgc gattacaggt gtgagccact 52201 atgcctggcc tgcagtgagg attaaatgtg gttctgaaga actatggggt gcccaacaga 52261 gggctggcag taggagcagg ggcttccttc ttgcctcctg ccttctgact tagaagttcc 52321 tgggcaacca tctttcttct tcactagact gttagctccc tgaggacaaa aacagcatcc 52381 cacttactct tggtagactc agcaccaagc aaaggacctg gaagagcaaa gccttaacaa 52441 atatttccaa actgaaaatg gtaggagtgg caagaaccag tagcccaggt aggggagcac 52501 aagggttcca tggctcctgg tcaccaggtc cttcttttct tctgtgtgca gacttatttg 52561 cacccaaatg caggcctggg ctctgttacg cccttgcaat ggtatgtgcg ccgcagggca 52621 tgggagcacc agaacacacc ctcctttcca ttgccaatat cctgcttttg tttgctgcaa 52681 gtagctctgg ccaggcccag gagagggagc agggagacaa acaagtacag gcatgcagga 52741 gacagccagg ccagcctgct gggaagaaca cctggctgga gagcaacgtc accagtccac 52801 atatctgtgt caaggaatgc ctttaaaggg cacagccagc tatgtggcca aagtcactgc 52861 cacacatgtc tctgaagaac tggaaggtga acagctgaca ggagggctgg gtatggatcc 52921 ctaagctgtg ttttgccaca gaacacaact gggttctcag accacttaag aagttgccat 52981 ggtagacttc agagttctag cttagctatc aattaacagc tactatttat tgaatgccag 53041 ccataatcca ggtcctatat gaggtacctt atttatatta tctctaatcc tgacaaccct 53101 acggtagcag gattatccct gttttacaga agactgaaaa gcagagttaa agtcaaatca 53161 ttctgcaaag ctgcccaata cataattgat ggagctggaa tgtgaagcta tgtgtgcttg 53221 gctgcaaagc cacatccact gtactatgtc accggagtga gacagcacat ctcaacagct 53281 gagcctcagg attctcacct gtgaaaataa ggaagctgga gccctgacag ctcaagaccc 53341 tttgagtgtt aacacttcgt gatgcttctc ctttagccca agtactgaat gttaaatagc 53401 acaattctcc tccttcatat acttcacagt gttcaattcc ccagcagtct ctgttctttt 53461 aaacaaagaa cccagaacac actttggaac tgtaaggtgc tactcgaaca tgtcccacct 53521 cagatgctta gaggccatgg acatcatacg ttcttcccaa agccaaggct gtttttctac 53581 ctttcaattt ttggaagttt caagtgaaag tcaaaatctg ccctggctgt aggtagaaga 53641 gtacccagct gggaggtggt aaagctgagg tctgaaccag tcatcccact gtcctgagga 53701 gccagagtga attgagctga aatcctcagg ttaggggaga tcacctagag gcctacatac 53761 gctgaggcta tgcctaagga caacctccaa aactcatctt ccaaaacatt actgaggggt 53821 tctgagggct tgggcagata caaagtgcat ggaaccctga tggtcagttc agtgggtcaa 53881 cccagatcca cacagagacc ttttgagtca gcagagaaca tatgagtttt ttaaaaaaga 53941 taatcagttg agattttctc tttcaagacc tggatccata ttccttctct gccatttacc 54001 atctgtgtca cccaaggcca gttacttcat gactctgagc ctcagtttcc tcacttgtac 54061 attgctaata atatcttatg gggttatgag aattaaatta tgtacagatg tcttgcatga 54121 tgcctggctc acagaaggta gccagtaagt gttcatttct ttccctcccc aatttgagac 54181 caccaagggc aaatgcagac agaccccttt ttctcctatg cctagcaata tggacctgct 54241 tctcagccca gaatgaaaca gatttctgta agtgaaaaag aaaggtccta aaatgacctt 54301 tatttatcta tccaaataga aatcaacttt tttttttttt ttttaagaga cagagtctta 54361 ctctgtcacc caggctggag tgcagtggca caatcattgc tcactgtagt ctgacctccc 54421 aggctcaaga gatcctcctg cctcagcctc ctgagtagct gggactacag gtgcacacca 54481 ccatgcctgg ctaatttttt attttttgta gagacagggt ctccctatgt tgcccaggct 54541 ggtcttaaac tcctggactc aggtgatcct ccagccttgg cctcccaaag tgctgggatt 54601 acaggaagga gccaccaagc ccagctgaaa tcaattgatt ctaatagatt tctaataatt 54661 aggaaatagt tactgtgcat ctaggatgtg taagaacaag caggcaaggt gctgtgctga 54721 gagaagccaa gtgagataag atagaggccc agccctcact tacatccaaa ttcttcagac 54781 aaaaatgttt acaggtaaat tctcaacctt atggggtaaa agaaccagtc attattgcag 54841 tggatttaca catgttaaag cataagccct cccatgtcct gaggaaggcc tggctggatg 54901 gtcatccctt aacctcccag tgaacagcca gttgtagaga agcttctttc agcccctagc 54961 ctctgaattg agtgggctga aggagagaat gggaagctcc ctccaccaca gcctgagggc 55021 acctggagag caccacaact ggggatgcca acttttcccc agggcccaga agggcagata 55081 tgacaggggc ctagagagac tctgttccat aagagctcag tcaaaaatcc ctttattact 55141 cctccttcct gcagatccta tagactgaaa cagtctgcca attaagctgt aattattatg 55201 gaatatgtta ttgttgtttt aaaaaattaa ccaaacacat aatgaagcct acgaacagca 55261 agaaataagt gttgctagag tgcactgatt cagttccaac caaaaaccag gaagactgta 55321 gtttcaagta ttttgtgtag ttgtactgca gttaaagaca gggcctccat tatttggcaa 55381 ataatggaaa agagcttgtg tgcctctgag gagccacata agttgcagag ctgggaacca 55441 ttcacctgat ccaagtcctt catttttcag ataaggaaaa gagtcccaga ggacctaagg 55501 cacctgccaa aggttactgt gaagctgggc agagttatgc ataatttttt ccctttaaac 55561 atggattgct ttggaacaca aaggttcatt tgttttcttc aagagataat gtgtaaaccc 55621 aagctcatta aatcagtatt ggacattaag tgaaattcta taaaaagaaa aatgctttaa 55681 agatgtccga caaaagccac tccctaattt acctcttttg taaattccaa tttgaggtat 55741 tttcttactt ggaagcacag tggtaattgc agctcacttc tcacgctagt gtggtaggaa 55801 agacagtgca ttctctgctg agtatgtatg gcctgtcagg caggggtgta tgggacacag 55861 aagttgtatt cttcattccc ttgctccctc ttcctccctc acatgtcacg cctgatcatg 55921 tccccatgga atctggttgc gtcactttcc ttgcaaccac agccccagaa ctcccataga 55981 ctttgtactc ccccagaggc ctagggtcaa gaacttaatt tgtttcacag aatccagcaa 56041 ggaccctgtt caggagccca agaagtctag cttctacttt ctgccaagga tgaaactgag 56101 gcccaaagag gagcccttga ggatatccta taataaaaga gggaggggct aatggcagaa 56161 acagctttct gaccaggtag tgatgtggta acggatacct gggagagcag agggtctggg 56221 tggtccctcc ctggggggaa agagccctgt gtggcagcta tctcttcaga agtgacttca 56281 ggaatgtggc ctcttatttg gggaagggaa gaatgtctac gaaagtcagt cagtaccccc 56341 cagactctag ctgggcaaac ggagcttggg ggtaggggca agaagatcag gttctccaga 56401 aagcctggga cagaacaaga caaacggcag gtggacttgg gtggcattag tggtctggcc 56461 cttaaaggtt caggttgaga gttcactatc tgctctagct tctcctcttc ctgagaagtt 56521 gtgtcctccc tggcttccct gcctctccaa gggtaaaagg ggtatagttc ctggctccta 56581 aaggaacaac ccttctccta agctgccttc ccaggattaa gagaaattaa tgaggacctg 56641 aaatgtggct acttcaacac ttggcatctt aattccacat gtaaaagctt tctcctaaac 56701 tcaagatcaa tttccattgg ctctttagaa aacaatctca gccacacctt caacaagtca 56761 ggcaaactat cccttttttt actctctgct aggagaggaa aacaaacctg agaatttcct 56821 ctccactagg agaagaaaac aaacctgcag ccccctgaag gatgagagaa agtcaggaag 56881 acagaaaaag ctgcccagtg aggctccata tgttaactct gttgaaactg gttgcctcat 56941 ctgtaaaatg gggatatata atctctttag gctgttgaga gattaaagga taaagtgtct 57001 agcagggcgc ttggtagagt cagcattgag gaccagttcc cttccccctt ccccctgtcc 57061 agacacttcc cagcaatcta ccagggtccg gtcattccac aaaaccacat ggtgcactgc 57121 ttttctgccc agggcctggc cacctgacta gtatcccggc agcaagatct gggcactgga 57181 cctagcttag agaagccaga agggaaaagt cgaccaggac cactggtgag ggggaggcag 57241 cctttttgga acaacaggct gctcccaaag aatgaaaaca aaccccaccc tagcctggaa 57301 tggctccatt cagaagggac ccagagggcc agaaaccccc tggggagttt ctatgggagg 57361 tggagtgggg caatgagcaa agaaagccag gttagaggta gagtttccac atctccctct 57421 gagaagcccc acagagagag gctcgtggag gtacttaggc ctagggactg aggaggtgtg 57481 ggggctggcc cagcagctca ggaacactct aaagcccatt gctccagtct tagaagcact 57541 cgcccaggtc tttccccagc aactacattc acctgggagc tcagaagccc ttcaaggcca 57601 gaggcaaatg tgcctgccat taagagaaca aacgtttata aggtgcctac tgtgtaccag 57661 gctacccaca cccaggactt gccagtttcc acagtttggc aaaacctgcc catcctgctt 57721 gttgggacaa attcctgggc aggcagcatg gcccgcaggt aagacatgtg agagctggaa 57781 acagcactct atttgggcta ttccccccgg aagcagtaac atctacaatg ggcaggtcag 57841 tggcatcttt tggaggctcc ttcaggtttg tctgggatag aaggggagct ggagctccct 57901 aagaagttcc ttctgggtat atccctcccc tcaccacttc caggacataa tgggggtacc 57961 tcacaaggcc gattctcctt gggtataacc tccgctggct tctgcccggc agaaccgatg 58021 gtaaaggact aagtgagaag gagatttagg gagaccaccc tcctctcagg accaggggcc 58081 aggagagatg gagaggggaa ctcggagacc aactcaccga aacaggtctc tggaggtgaa 58141 gtaagccggg actgacaagt aactccccat cttcttccca caccagatag acaggctcag 58201 ctgcaaatcc gtctgtgact gcctcaggcc ggctgtagat ggccaccatg acaggcccgg 58261 ggctcccggc acctctggct gcaggcaggc tgccctgagc tcacctctcc ctccggagca 58321 gctgcgggag cctgaggact ccaggccact tggcccccag cagattccca caggggctca 58381 tggctgggat ggctcctccc tagttctcag aagctcaggc ggcccctctg gactccgaga 58441 aaacgacctc tctgtggagg ctacacacac actggcattc ttgagcaggg gccggtgact 58501 agctaagaaa gaagcgctgc tgtcaagtta gccttcttct tcagtcccac ccctgaatga 58561 gggagggagg gaggaagagg aggggctccc gcagaaccga ggtgtgtggt tggagcagct 58621 gttaggaaac agaccttgca ggcaaggggt ggagctgggc tggatcgagg ctgcaggcag 58681 gcgctgcaga caataactgg gaccaattga gcgtccatcc tgagctcagg gaagggcaag 58741 gcaggataga gtgtgaacat atctggcccc agatgtcctc aagcagagga gctgcctaca 58801 ggacagtaat tctgcttgga ggtggaaaaa aaaaaaaagc agaggatgcc aggtccagga 58861 agagccacac cagacggggg gacctctggt aggagagaca ggtccagaga gccatcctag 58921 ctctgagctg ccatctgttc tgtcctggca ctagggctcc cattcattcc ctccctctgc 58981 ctgtcctggg gacagcctca agagctgacc aatgggaggt gccacttcca ctccagggca 59041 gggctgtgtt gcaaataagg aaacaaggcc cagagtatta acctacacct gatgtgtctc 59101 catgtggcaa acactatatg cactctttta tttctctgag acaggaatta ttcccactct 59161 acagacaaga aaagtgaggt acagagagat taggtaactc actcaaggtc aaaaagggag 59221 tcagtggcag ggctggaatt taaatgtagg ctcatgtgac tctagagctg gagctctccc 59281 actaccaggc ttcattctgg ctctttaagg agaaaaatct tttccaaagc tctctgcaaa 59341 taggatggag cccaggttac tctcattcac tcactattca ctgagtgttc acatgcagaa 59401 caccatccat tggcctcagt ggacagtgga gcttcttcta cttgtcctgt gataaccatg 59461 cttttcaaac tgaaaaatgg ggcatgaggg agaaaaaggc ctggggtgcc aaaagggatt 59521 gataagccta ggtatctgga atgggagccc acctgaacag cccccagttg tgacgatgac 59581 caaaccagag cctaccatgt agaaagccct tgtctccctg cctgcaggat tgtaaaggaa 59641 aaagcaaggc tgctggagga aaccagaaca ctcagatctc actactggat ttggggcagt 59701 gacatcagct ctcccttacc aggcacacac cgctcaccat tgtaaagagc cagaagtcac 59761 cgtgcagctt ctatctctta ctgtgaacca tggcaggaaa gtctgataaa gatggtagat 59821 ggggttacaa gggagccttt tcccattcca ccaactggac acagggacaa caatgcatgt 59881 gataagagca catgggtttg aatactgtta agaagctacc atttattgag cagttaatat 59941 cttctaggcc ctgtgttaag tgcttagcat ttattatcca tcatgtttaa attgccagta 60001 aaaggaggaa acgtcatcag cctcatttaa cagattagga aataggttct gaggggttat 60061 gtcactattc tcaagacaaa tagctgataa gtggtgaggg tgtttttttt ttttgttttt 60121 ttttttttga gtcagggtct ctctctgtca ctcaggctgg agtgcggtga tgccatcatg 60181 gctcactaca atcttgacct cgaccttctg gactcaagtg atcctctcac ctcagcctcc 60241 agagtagctg ggactacaga tgcgtgccat cacacccagc tgtgaggctg agtttcaaac 60301 ctggtacata tggctctaga gcacaggtca aggcactttt tctgtaaagg gccagagagc 60361 aaatatctta ggctctgcag gccatacaga ctgtcgtggc cactcaattg tgtcattata 60421 gggcaaaagc agccatatat aaaacataaa ctaagggaca tggctatgtt ccaacagata 60481 tttacttatg gaaagatttt tttctttttt ttttttttct tttgagattc agtctcacac 60541 tgttgccctg gctaaagtgc agtggtataa acatggttca ttgtagtctc aaccacctaa 60601 gctcaagtga tcctcctacc taagcctcca gtgcatcctg ggaccacacg tgctggccac 60661 catacccagc gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgcgtgcgt gcgtgtgtcg 60721 gagtctcgct ctgttgccca ggctgcagtg tagtgtcgcg atcttggctc actgcaacct 60781 ccgcctccca ggttcaagca attctcttgt ctcagcctcc caagtagctg ggactacagg 60841 cgcatgccac catgctcggc taatttttgt gtttttagta cagacagggt ttcaccatat 60901 tggtcaggct ggtcttgaac tcctgacctc aggtgatcca cttgcctcgg cctcccaaag 60961 tgctgagatt ataggtgtga gccactgcgc ccggccctgg cttatttttt tttatttttt 61021 gtagagacag ggtctcactt tgttgcccag gctgggacac tgatatttga gtttcatgta 61081 atttccacat gtcacaaagc attcttttgc tttttttccc ctaaccatta aaaatcacag 61141 gtcatatagg ccaggcatag tggctcatgc ctataatccc agcattcgag aggccaagga 61201 gggaggactg cttgaagcca gaagtttgag accagcctgg gtaacaaacc aagaccctat 61261 ctctactaaa aaaatcagtc aatcaggcca gtcatggtgg ctcacgcctg taatcccagc 61321 actttggaag gctgaggcgg gcagatcacc tgaggttgga agttcgagac tagcctgacc 61381 aacatggaga aaccctgtct ctactaaaaa tacaaaaatt agccaggcgt ggtggcacat 61441 gcctgtaatc ccagctactc gagaggctga gacaggagaa tcgcttgaac ccaggaggca 61501 gaggttgcgg tgagctgagt tcgtgccatt gcattccagc ctgggcaaca agagtgaaac 61561 tctgtctcaa aaagaaaaaa aaaatcaatc aatcacaggc cgtattaaaa taggcaatgg 61621 gccatagctt gccggcctct gttctagaac cacagagctc aaccactctg tttactacct 61681 ctaatagtga agaacagctc ttggtgtttg tgcagagctc agccatatct atctatctat 61741 accacgaaga caccacactc tagccccaga ggcagccaga agaaggaagc actgaacagg 61801 tgctcaccct gtattaaaca ttgcttgcag cactacttgt atgaatgaat tttaaccccc 61861 actattccac agtgctgtga tttttctctt tacagataaa gcaactgagg tacagggagg 61921 ttaaatcact tgcctgaggt cactcagctg taaatgtgag aagtgaagcc aggatttgaa 61981 cccagactag acattataga caacgttcta ccatgctata ccaaagctca gcctgggccc 62041 agagccagac tgtctgggtt tgaatgcgac aatgcactgc atgtaagccc ttggcacaaa 62101 gtctgggaca cattaaaggt tgataaatgt tagccattaa tatcattaag actattgtta 62161 caactactgt gggctagcta tgtggccttt agcaagtcac tctctctgtc tgggccttgg 62221 ttaggtctct aaatagctcc cagcatcaac atttttcacc cctatgaggt tgcagagggc 62281 tctctatttt ccttgggtct caagttgagt ttaaactctt gagcaggagg tcaggatcct 62341 agccgagggg acacacaagg gccacagagg aggtgaggtg ggctggaagt ggggaaataa 62401 catacgggat actgaaaggg ctgcccagct ggagagctgg gagcaatctt tctcccagcc 62461 acacagacaa ctatttctca ccaaaaatct attatgttcc cctgtaattc aatcaggcaa 62521 gaagggggcc ccaaattaag tctgggacta gcaaattatc ttttaattgt accctctgat 62581 ttccatgtga tgaacacagt actaatggaa gaatatgcaa gccacaaaga gactgaggcc 62641 agggagccta tgagcataac tcctcttgac atttcacaca gctctgctgt tccagcggcg 62701 acgggctcag tgagtcggtg tctaggaatg tgagcaggag gctgcaggcg ctctcctctc 62761 cccctactcc tcttacaaag gacttggtgg gtgggtgggg aataagtgag aggacatctc 62821 aatgccatga aaactgcctc cccctggaag aggaacagcc ttctccccat gtcctgagtt 62881 gaagcagaag cccctcccag gtggaatcca tttgacctcc aaaaattcag acttcaggag 62941 gaacctttga aggattgatc accagatgtg ataagctgct ggctgctcag acccaagtgt 63001 gtggttcctc agagccaact tcaaagggcc caagccaccc atcactggga gatgaaatgt 63061 cacactggag gatatatttc accctcccac ccttggaaac tggtcctgga tagaaaactg 63121 ctcttctagg tcctcagagc agctggcaga acattcatca gttatggctg ccagaggcaa 63181 ctgcaggaat cctttgactg gcaattatag tagggataaa atggcaggag aggtgttaat 63241 ctccagcaca tttcataagt tggaaacctg gcacggagct gaacaaactg actccttaga 63301 accagctgag gcacaggtct tcaaagcgtc tgtgacaggc tggagaacaa tagccttcat 63361 aaccggcagc cccaagtgaa tgttcaggca aggtgtggta agtgaatgaa tcaccccaac 63421 cttttgtcgg aagaaagctc tggtcaaaca attcaaaacc caagaaactg tttgtgtgaa 63481 gccctctatt ggctgggagg actccaatca aggaagagaa tttttgagga ctccgaagcc 63541 tccaaatact gaaaggtaag cattaccacc cccatctact aaccatcttt cgtgccccac 63601 tcctcccctc ttccaaatca ctttctacat tatagcagca gtcatccttc taaaacaagg 63661 gtcccgggca ggcatggtgg ctcatgcctg taatcccagc actttgggag gctgaggcag 63721 gcagatcact tgaggtcaag tgttcgtgac caacctggcc aacatggaga aactccatct 63781 ctgctaaaaa tacaaaatta gctggatgtg gtggtgtgtg cccgtaatcc cagctactgc 63841 ggaggctgag gcaagagaat cgcttgaacc cgggaggcgg aggttgcggt gagccaagat 63901 catgccattg cacaccagcc taggtgacag agccagactc cgtctcaata aataaataaa 63961 taaaatgtcc cagcaattcc cagcttcaca gggttatgag cattaaatga gaacatatgt 64021 gaacaagctc acttcagtat ctggctcatg gaactcagtc agtgttttac agagtgagag 64081 tttagtatct gctactcctg taatgtgatg tcatgttcac aaacaccagg taaaagacaa 64141 tatttggcat ctatcttagg tatcatggaa agagaaaaga atttggcatc tggatagaac 64201 tttagagatt tagtctaatc ctttgtttta cagttcatga acttgaaact aggaatggag 64261 aattggaagc ccaagacata tgaggatacc aagtcacatg aaatgtgttc agattttcag 64321 actgggagaa ttatttccca atatattgaa aaaagaaaat atgttggcac agtgatggaa 64381 ccattgttag taactttgga gaaactggag aatgggtgca atatctccaa agtgtaatgt 64441 tccatatcta ctatatacca ggcgctgggg atgtattcat gaaataaaaa tggagatagc 64501 taactggccc agttttcaaa aaggaggaaa aacatactgg aaaaaggaag ccttatgatc 64561 cttttgcttt gattccttca cataaattct aggcagatga tggaataagt ggacagtaag 64621 cacttagaaa aggagctgat catcaggagt cagcagtagt gaaatagtag tgactagtta 64681 gacttgactt catgttttca ctgggttact catcatgaga ccttatcttg gatttccaca 64741 ggcatttgat aaactctttc ctgatatcca gacagtaaga atgtgggcag gaaatctaga 64801 aaaatcagat acaagaactc aacataccca gcaatttgag gctctaaact gtagaggcag 64861 gattcctgaa taccttgcta gagctcacat tgaggaaagg gcatgggatt tgagctcaga 64921 aaacttttct tttaatttca gtgctgccct tcccagcagc ataaacttgg ataaattatt 64981 taacatctct gggcttccct tgtcctgctt gtgctatatt cctcaagagg ttgttgcagg 65041 aactgaacta gatgatgagt acaaaagccc tctgtaaact gaagatgcat tataaacatt 65101 agttattatc actgttctag tgctatagtt gctcatcatg ggggtcctga atcataggtt 65161 tctttactag gcatctgtgg acacctggta tgggccaggc ctatgctggg cactgacagt 65221 acctagtgcc tgccttcacg tagtttgatc tggtgataga cagatatggc cacagggtcc 65281 ttcctgggtg tgatgtgctt ggagacacat tgaaggaacc cttcccagag gagaatgaat 65341 gctgtgtcag agcagggcct ggaaaaagga ggagaagaca gcacctgcag gagatgtagg 65401 aggctgtaaa aggagttaag agagcaaagg tttgggggac tcccagggag ccttatgtgg 65461 tgagtcatca atggagtaca gcaggaggtg accagagccg ttcctgaggg aactaaggtt 65521 cacagagaag gttgctgttt aaggtcacag gtctaggaac tgacaaagct ggagtttgac 65581 ccaagtcttt ggactccagg tccaacatac taaaactgcc tcaaataact attttattct 65641 tgccatttga tttacagtaa tatttctgaa agtgtaacta aaggaatagc agcattggac 65701 cctctggtca gcttgttgaa aatcaagatt cctgagtccc acttcagata tacagaatca 65761 gaatcgctga gactggggct aagaatctgc attttaaaca agccccagtg tttctgatcc 65821 acactaacaa ttgaaaacca atgcttttct ttttcctaag tgcttttcca cttgatttca 65881 ttttattctc acaataaccc tcagaggcag gcaggataga tcactatcat taccttgtta 65941 caaacgagaa aatgaatgta cagatagctt gttcatcttc ttatcctgaa cgtagcttaa 66001 taaataaatg tttggtgatt gacattttgt taagccttgg ctaagtagag actttcacct 66061 tcctggtgcc actcttaccc atttgtaacc ccacagtagc tgcctctctc tttccttggt 66121 gacgcggcag tggtgctcac gttatggtat aaaaaggtga ttctaggcca ggcgcggtga 66181 ctcacacctg taatcccagc actttgggag gctgaagcag gcagatcaca aagtcaggag 66241 atcgagacca tcctggccaa cacggtgaaa ccccgtctct actaaaacac ataaaattag 66301 ccaggagtgg gggtgtgcgc ctgtagtaac agctcctcgg gaggctgagg caggggaatc 66361 acttgaaccc ggaaggtgga gattgcagtg agctgagatt gtgccactgc actccagcct 66421 ggcgacacag cgagactcca actcaaaaaa aaaaaaaaag tgattctaga agatagcgtc 66481 attccatgaa agtgctatct ctggaactaa aacaaatttc aaacaaacaa acgtgaggga 66541 tgacattagc caacttcaga aacggtgctt ctgcaagagt cagtgctact gatggcagtg 66601 gagcaccctc tgatctcagc aaaatctcta tagggcatca tcaggaagtc atgaccagaa 66661 tacagctgac acacaaaaga aaacaacctt aactcaagtc tggcatgaga tgctgagggt 66721 tttttttgta atcgtaagca gtatggagaa agaatttgtt agcacacttc cagcactgcc 66781 catgggagaa gggactctag cttttggggg aaaaatgaaa aacacggcaa aataagcaca 66841 gcaggttgct acagttgcag aaggcatgcc attccttgag gccctggttt tgatggttta 66901 agacttaggg atgacattta cagactgagg tcttagtcta cataagtttg agcttatctt 66961 atcacctcaa aagataagct cttgcctaaa agaatattgt agagcgaaga gatggtaatg 67021 gtgcatactg gatggacttt attcaagtgt ccaaagagtc gttcaacaaa tacatcattc 67081 acccagcaaa cactctgagc acccactaca ttctaggcac tttgctaagc tctgggatac 67141 aaagacaaac tagatacggc tccacacttg gagaatttca cactctaaga gtgatgctag 67201 atgtgtaaac aaatactgaa atggatataa ctgaggtatg aagagaataa cgaagagaat 67261 aacaaagaga gaagaagtct ctgattggtg gggagaggta tgggtggatt tggcgggggg 67321 ctcctccaca gaagatggga tggtattggc agactctaat aggctgattg ccaggcagta 67381 acaggcattc tggcctgaag gaacaacatg tgcaaatgca ctgaggcata aagagtatgg 67441 tatgctcata taacgctgag caggtctgta agctctagtg catggcacat atgggcagca 67501 attagacagg aggccagaat ggtggaagag agtaggggca gagcactaaa gaccaagtct 67561 gctaggctaa agagttgcct acaggcagtg aagagctatc agaggtattt aagcaagcag 67621 tgatttttaa cagaacgcaa agctgaaatg ccaaagagaa tgctgatctt tagttgccag 67681 tttggctctc tgaatgggta taaatgccta aagtctacag ttggccgtgt atatgcccat 67741 gtgaaatttg ctggccaact agtgttcact aacagcagag tatgcctttg tcctggggcc 67801 acatctctac tgccttgtca ttttaacagc tgatatttgt ctatatgggc tgtaagcaca 67861 taagcatgga attgcaaccc cttatagata atcatccact gttgctccat gatactaatt 67921 agaaaattga atgtctaaca ttgagtaaca aatgctgtgt tgacagggac atccttctcc 67981 aggctgcact gggaaaaatg cttcggatat ctcaagaaat ctctaggcaa ttcaaggaaa 68041 aaagccctcg tctggcaggg aacacctgaa aaatcaccct ggttaacaga aaatctgcgt 68101 gtgatcttag gcaagccatt aaatctctct gggcttcaac ttcttcctct gtaaaaggaa 68161 ggagttaggt gagcaaaaaa ttccataccc attacaggtc aatgcttaaa gaagcaacga 68221 ggaaggaggt gtgatcaaag gaaggttgac tggtggacca cccagagaac tcttcaaggc 68281 tgatgttttc ttggtgaatg accctaatct aggcctagct tttggccttt caggtttttg 68341 accttcctaa tgtccaaaga ggtgtagaca ttctgtgatg gttctgacct aaccatttat 68401 caaaccctct gtacacttat gaattgacta ttaaaatcag aacactcata tcccctatac 68461 aggtactact tccagaccct tcctgttatt aagtatccca ggaagctgaa ttaggcatca 68521 gaaggttgct ctgtgagcat ggacttaaaa taaagggaaa tgagtagcca ctgaaagaaa 68581 tttcatcctg gcagtggtga gtcagccaaa ctagcccagg ttgccacaaa ggaccctatt 68641 ctggtgacca aatctagatt cagcatcttt atctgaatac agtactagtt ttgcttcact 68701 aacttttcac tttctgtttc attgctggcc acttatttat tatattggtt ttggtttttt 68761 tttttttttt ttgagacagt gtctccctct gtcacccagg ctgaagtgca gtggcacaaa 68821 ctcggctcac tacaacctcc acctcctggg ttcaagcaat tctcctgcct cggcctccca 68881 agtagctggg attacaggga cctgccatca tgcctggcta atttttgtac tttttttttt 68941 ttttagcaga gacagggttt caccatgttg gccaggctag tctcgaactc ctgacctcaa 69001 atgatctgcc cacctcggcc tcccaaagtg ctgggattac aggtgtgagc cactgcgctc 69061 ggccttatat tggtttttaa actaagtttt aattgaaaaa aataactact tctttagata 69121 tgtgcgacaa tcaccaattc acacatatta ttcagccctg gcagacattt gcctctctag 69181 taactaataa caaagaacaa acattcatga gtgctctgtg aatagtgttc ccagtgctac 69241 acacactttt attcacagta atactgtgag ggtaggttgt aaggaggtag gtagtaagtt 69301 tgtatctcca tttcttcttt tttttttttt ttgagacgga atctcactct actggagggc 69361 agtggcgcga tctcagcaca ctgcaacctc catctcccag gttcaagcta ttctcctgcc 69421 ttagtctcct aagtagctgg gattacaggt gcctgccatc gtgcctggct aatttttgta 69481 attttagtag agacaaggtt tcaccatgtt ggccaggctg gtctggaact cctgacctca 69541 ggtgatccgc ccgcctcgac ctcccaaagt gctgggatta caggcgcgag ccaccgcgcc 69601 caacctccat tttttttttt ttttttgaga cagagtcttg ctctgtcgcc caggctggac 69661 tgcagtggcg cgatctcagc tcactgcaag ctccgcctcc cgggttcacg ccattctcct 69721 gcctcagcct cctgagtagc agggactaca ggtgcctgcc accatgcctg gctaattttg 69781 tgtattttta gtagagacgc ggtttcactg tgttagccag gatggtctcg atctcctgac 69841 cttgtgatcc gcccccctcg gcctcccaaa gtgctgggat tacatgcgtg agccaccgtg 69901 cctgctggcc tccatatttt tttattttat tttattttat tttattttat tttatgttta 69961 tttatttatt gagacagagt ctcgctctgt ctcccaggct ggagtgcagt gacatgatgt 70021 tggcccactg caacctccgc ctcctgggtt caagcgattc tcgtgcctca gcctccaagt 70081 gagtagctgg gactacagga gcatgccacc atgcctggct aatttttgta ttttcagttg 70141 agatggggtt tcaccatgtt ggccaggcta gtctcgatct cctgacctca agtgatccac 70201 ctgccttggc ctcccaaagt gctgggatta ctggcgtgag caaacgcgcc tggctcgttt 70261 cttttttttt ttttttttga gatagggtct tgctctgtca tccaggctgg catgtagtgg 70321 tgcagtcatg gttcactgca gcctcaacct cctgggctca agcaatcctc tcgcctcagc 70381 ctcctgagta gctgggacca caggcatgag ccaccatgcc tggctaaatt tttgattttt 70441 tgtggggatg aggtcttgct ctgttaccca gactgatctt gaaactcctg agttcaagca 70501 atcctcccgc ctaggcctcc caaagtgctg atattataag tgtaagccct gcacccagcc 70561 tgtatctcca gtttgttttt tttttttttt tttagactga gtctagctct tgtcacccag 70621 gctggagtgc agtggtgcaa tcctggctca ctacaacctc tgccttccag gttcaagtgt 70681 ttctcccgcc tcaggctcct gggtagctgg gattacaggc acccgccatc atgcctggct 70741 aatttttgta tttttgtaga gacagggttt caccatgttg gccaggctgg tcttgaactt 70801 ctcacctcag gtgatctgcc cacctcggcc tcccaaagtg ctgggattat aggcatgagc 70861 cactacgcct ggcctctcca ttttctagat gagaaaaatg aagcacacaa agggcaagta 70921 acttgcccaa agtacacaga taataaaagg cagaaccagg atttgaaact aagcaatctg 70981 gttctagagt gtgctcttaa ctacaacaca atactgcttt cattacttgt ggatctaatt 71041 agttcaactc acagaggaca aacttcattt ccaacttctt gggggaactg ccaacctatc 71101 caccccactc cccatcataa atcacctata agtaaaagat actatgcaag tattccattt 71161 tttccactct atattgaata tatatcttta gttaatgcca tctttcaaat atacaattta 71221 agaaaacaaa atgaaacaaa acaaacttac gattggaatt gaaatttttc agatttcagg 71281 gcaattcata atcagtttgg tggctagagt agctgaactc attcttcctc tcttgccctc 71341 ctctccggag gtacctaggg cttgttcctg cctggattgt agctacagag aagtgcagct 71401 caagacctgc tgctggaaag aaggactcct ggggtcagct tctctggctt ctactcacta 71461 tgggatagct cctctgaggg caaaagacaa agctttaact tatcaccctg acacttgcag 71521 gcaattttct aaaacaaccc aatgtccgga actgattccg tcttgccatt tacagtaact 71581 gcagccacgg tggctgctat aggaaggatg tggggaaggg gaaggggaag ggcagggaaa 71641 agcctgactt acatcatatc tcttggccac ctctagcaat tcattcaaca aatatccctc 71701 ctatgccaac aatatgctca gtgctgacac aatgaggggc aaaaacccca cagtccatga 71761 ccaaattgtt ctattagagg tcttaagata tcattaagga gatgatcaca tgaataaatg 71821 tgagattaaa actacttttt taaaaaaagc ctcaaaggaa aggtctatga aaacataaaa 71881 cagggaccta acctattttt cagggtgagg aattacttct atatttctct gagccatcca 71941 ggtaaagatc taaacgataa gcaggagtta agtgaacaga gtaagggggc attctaggca 72001 gagggagcag aatgaacaaa gcccttgtca gaaggagggc ctgtccagaa gagaggtgag 72061 gctgaagtgc agggagcagg aggttgtgga gatgtggaga gaggccacat tacggagggc 72121 tctactgtag gccaaattaa ggaatttttt tttttttttt tttttttgag atggagtctt 72181 actctgtcac caggctggag tgcagtggcg tgatctgggc tcactgcaac ctctgcctcc 72241 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcacacgcc 72301 accacgacca gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg 72361 atggtctcaa tctcttgact tcatgatctg cctgcctcgg cctcccaaag tgctgagatg 72421 ataggtgtga gccaccgcgc ctggccaagt taaggatttt ggtctttatc caagaacact 72481 gggaagttgt ggaagtatac aaggtaggag agacaacaca gagtgacaca gtcaagtttt 72541 tttttgtttt tttttttttt aagatcactc cttccagcca ggtgcagtgg ctcacacctg 72601 taatcccagc actttgggag gccgaggcag gtggatcact tgaggtcagg agttcgagac 72661 tagcttggcc aacatggcaa aaccccatct ctactaaaaa tataaaaatt agctgggtat 72721 gggggccggt gccttaatcc cagctacttg ggaggctgag gcaggagaat tgcttaaacc 72781 cgggaggcag aagttgcagt gagccgaaat tgtgccactg cactccagcc tggggaacag 72841 agtgagactc tgtctaaaac aaaacaaaac aaacaacaaa aatcactcct tccaactgct 72901 ctatagagaa cagattaggg ggtgcagagt ggctgtggga gatgaggcta ttgcagacat 72961 ccaggtgaga tgatggtagc ttggagcagg gtggtggtgg gagaaaagga aagaagtaaa 73021 cctaccaaaa tataatttag agataaaact gataggcttg gatatgcagg atgagagaga 73081 acaggtatca aggattccta ggtttctggc ttaggcactg ggtggatggt ggtgcctctc 73141 aatgacatac atgtttcagt ggaagaggag ccagttttag ggggaagagt gaaagtccag 73201 caaagtctag agaagagagt ctgaagtgcc tggaagacat tcaaatggac aagccaggtt 73261 gatagtacaa catatgggtc tagaactcag tggaaaggtt taggctggag atactgattt 73321 tagggtcatc agtgtgtgga ggacaatgga aggaagtcat gagtgtgtgt aagactgcct 73381 aaggagaaaa cagcacgcct gaaggaatgg taacatccca gagctggaca aagaaggata 73441 agcaaacagg tattaccaga gaggtaagag aaaaccagga gagtgtggaa tgtcatgaaa 73501 gccaagggaa gagaacattt caagactgac agatactaat aataaatgct aacatctgtt 73561 aagtcccagt catggccaga cattctgctc aggactttga atacatgatt gaatttaatg 73621 ttaacaactt tttgaggtgg gagccatttc ccttttcata gacaaggaaa ctgtggcaaa 73681 gaatggttaa gcagctgact catggtcagg tggctggtaa gtgaagctgg catttaagct 73741 caagcagcct gactccgaac tttatccact tttctctaat acaaccctta gtgcagtttg 73801 ggaggtcata taagatgaag cctgaagagt ggtcggtaga cagactgatg aagctgttat 73861 tcggaatatt actaaaggaa tgttgtttca ggggagaaac agaatgaaag ccaggctgga 73921 gcccatggtt aggaatgctc catgacattc tctatttcca gctgtcttag ggaaccagtt 73981 aagagaccag gccaggtgcc gtggctcata cctgtaatcc caacactttg tgaggcccag 74041 gtgggaggat ctcgagccca ggagtttgag accagcctgg gcaacagagc aagaccctgt 74101 ctctacaaaa aattaaaaaa aaattagctg ggcatggtag tgtgtgcctg tagtcccagc 74161 tactcaggag gattgtttaa gcctgggaga ttgaggctat agtaagctat catcatgcca 74221 ctgcactcca gcctgggcaa tagagcaaga cctatctcaa aaataaataa gctaattaat 74281 gaattaatta agaggccagg ctagtatgcc aactccgtat cccccacagt gcctaatttg 74341 atgactttta tatatagcaa atgacgcttg atatatttgg atagttaata taagatgcac 74401 aatgttgcca tgtgctgctg ctaatccttc acattaaacc attctaatgt tttcagagtc 74461 cttgatcatc acaactactc catgagatgg atttctttta tgacatttta cagataagaa 74521 actgaggctc acaggggcta aggaacttgc ccacagttac acagttagcc tagctcaact 74581 aagactcaaa tcagattttc tgtctccaag tctactgttt tttttttttt tttctcctgg 74641 gagtcctagt ggaaagcaga ctctcttggg agccagggag acctgagttc aaatatcatc 74701 tgtgtgaccc tggacaagtt actccaaacc tttctgggcc ttcctgtttt agtataaaat 74761 gggggttgca ctggccaggc acagtggctt acttccgtaa tcccaagcac tttgggaggc 74821 tggggcagga ggatcgctgg agttcaagac cagcctgggc aacatagtat taataagacc 74881 ttgtttctac ttaaaataaa aaaaaatggc aaggtgtggt ggcacgcctg tagtccaagc 74941 tgcttgcaag gctgaggtgg gaggattgct tgagctcagg aggttgacac tgcagtgaac 75001 tatgattgta ccagtgcact gtagcctgga caacagagca agaccctgtc tcaaaaaaag 75061 gtgggggttg cagggttatt atgaagattg aatcagataa tctaagtgag gttcttaata 75121 caacgtttaa ttaatatcag ttatctatct ctggcctcat ctctctccat ttttctacct 75181 ctgcctccta tagaagctgt gaaacatctg tgggtcctta gttctctctt ctcaaagtaa 75241 agagcaggat taaggcacat tcaagctcat atgctgctgc tacatcctag gagtaccact 75301 ctgtggtcag gctggagtct cttgtagggg cagaggaaga accagatgtc tgcccttcct 75361 tcctctgtca ggcaggcccc agccccctgt ccaatagcca gccttaatga tctacgggct 75421 ggctgtttcc taaaagataa actggacaaa tgagtaaaca ccttctggag aaggccacag 75481 ccagaatcaa aggtttaggt tagtaagtgt ggcagcaaca tcaacagcat aacagcagat 75541 ggcagggttg ctccactgac tctacatcag gagcctgaga cccagaagaa agttaacagc 75601 acttttgttg acattataga gtccatgaag cctttaaact aacacccttc aatgttatct 75661 gacactacca gtgtgtggca ggcagggtga gggctcttta ccccatatta tataagagga 75721 aactgagcct cagtgaggtt aagaaatatg cttgcattca agaatgccaa gagactcttg 75781 caggcaagga gtaagaatgc ctgagttcaa atctcagcta tgctacctat tagctctgtg 75841 actttaaaca agtaacctcc atttccctat tctcgaagta ggtaagacta gtactacatt 75901 ttattaagtc attgtaagga ttaaatgtgc tatgcatgtc gtatgctcag gagggtgtct 75961 ggcacagagt actcaataag tggtacctat tactgtaatt atgattctac caggcttgct 76021 tagatgggga tgaggggaga gaaaatgaat gtggctttca gcatcacaga aagtaccctg 76081 ctgtcagcca acatattaca acacagtctc tggttgctac gtgtgttatt tcttcactgg 76141 ccttcctgga gtcactaaat ttagcggaaa tggtctgttg cctgatgcag tgaccacagt 76201 cttctatctg cattagccct ttcccagagt ataggtatta gtaaccgctt ttcaaaagtg 76261 aaagctcaaa tagtcttatg tttgcaatcc tgagaaagga agtcctgaaa gttcccagga 76321 gccttaagag cggtggattt tccttctgat aatgctgcta ggaaccaggt caagcaatag 76381 agacccacgc tggggccatc aagactaata ggggtagctt ttcttccctc ctcctaagcc 76441 aaacaccttt gatcagggtg aacggacaat catcaaaaaa gttgtcaagg tcctctggct 76501 tgctaacatg ccaagaggcc aggtcaccat tgatagctcc acactttaac ctcagctaca 76561 aatgacatgc tctctggcta ctctcctggc ctctcctaag gagagagagc caggtccaga 76621 gaacatcaga aaggattgcc ccagatacag aattaatggc actgctcaag gaaaggaaag 76681 agaggtaact cagagtttta aaaaaggcct actgggctgg tccaagtaca atagtgttta 76741 caagtaattg atcactacca gttaccgatt tctttgttcc ttttccactc ccacaatttc 76801 acttcactag cctaaaaaaa aaaaaaaaaa gaaagaaaga aagaaagaaa aaaaagaaaa 76861 gaaaaaaagg aaaaaaaaga ggcctgttgg gaaggcttgg gaggactagt cagccatatg 76921 tccaggagat ccaaaagatt ctggccagct gcctaatcgc accagcacaa ttactgacag 76981 agctactttc agttttgtca ctggcctcca ttgcacagtc tgaggttcca gctttaagta 77041 gcttcctgtt gtcctgggca cctgagggtc tgtcggaagg ctaacagagg caggattatg 77101 gatgtggctg cagacagaag aatggggtaa tgtaagggag agatgagggg aggagtttag 77161 gatggacccc agcaatttct ccattcaata tccagcccca ccctgtgact aggaaaacaa 77221 aggtactggg ggaaggggag aagaaaagtc atttccttta ttgtaactgt agcccagctg 77281 ggtaagagca aggctggaag ggggccttga agaatgaatt tctctgagta gcacagaggg 77341 agagtcctcc tcccagcagg cctgcactac tcaggtactc taaattcatc agagaaacac 77401 tggcaagctg gcgtgggaga gatcttacaa gaccattcag gttgatgccc catgttcctc 77461 aaaacccagg ttatctctgg ggtcagacaa atacaatcct tcctagccct gaagattttc 77521 ctccaactgg ggcaaaagtc aaattagatc caaaagacag tcactctccc cagggcaaat 77581 taccaggctc tatctgaggg cgccgtgaca tccagtttga atttaccagg ggaaagctca 77641 gaagtcagga gatgtgctcc ttcagcacag ctggattaga gtctctgtcc tcaaggtaaa 77701 gctgcaacta ggagacagaa cctagatact ctacttcctc cacgggaatg atatttggct 77761 tgattgtgtc agggtagttc aaagttgccc ttaatccctg gggctccaca tcagatcacc 77821 aatgaatgtg taataagaga aatgtataca tataggtgta tatagacaca cactcacaac 77881 ataatataaa gtgacagtta ctgattatta agggtttttt ttttcatttg ttttgttttg 77941 agacaggggt ttcactctgt cccccagctg gatggagtac agtggcatga tcacagctca 78001 ctacagcctt gatatccctg ggctcaggta atcatcgctc ctcagcttcc caagtagctg 78061 ggactacagg tgtgtgccat cacgcctggc taatattttt gtattttttg tagaggtgga 78121 gttttgccat gttggccagg ctggtctcaa actcctgggc tcatgggctc aagcgatcct 78181 tctgcctcgg cctcccagag tgctaggatt acaggcataa gccaccacgt ccggccctat 78241 taagagtttt ctatatgtca gatgctatgc tagccacttt atataggtta tcacatttaa 78301 tcctccctaa caaccatgtg gcatgagtaa tattcttatc ctcattttac agattggtgc 78361 ctgaagctca gggaagtcac ttaaggctag ttaataaatt gcaaagccag gatttaaaac 78421 catagtctca ttccatagcc tgtactcttg gtggtggttg tttttttgcg atggagtttc 78481 cctcttgttg cccaggctag actgtaatgg cacaatctcg gctcactgca acttctgctt 78541 cccggattca agcgattctc ctgcttcagc ttcccaagta gctgggatta caggcatgtg 78601 ccaccatgcc aagctaattt ttttctgtat ttagtagaga tggggtttca ccatgttggt 78661 caggctggtc tcgaattcct gacctcaggt gatccacctg ccttggcctc ccaaagtgct 78721 gggattacag gcgtgagcca ccacgcctgg cctgcctgta ctcttatcta ccatgctaac 78781 gtgtctactt cctctttatg tatttgtgta catatgttat ctttctgtct ctatctcaca 78841 cacacacaca cacacacaca cacacgcacg cacagagcta acaagctagc atctgactag 78901 acctacaaat gacccaagag ttcagtgcta gggccagctc ttgtcatcca tggcctgagc 78961 aagtcttttt ggtttcaggc ctcatctgtg atatgaatgg gttataccat atagtgggtg 79021 gtttcttcta gttctgaagt tttttagctc taagagaatc cagaaattcc aaagattaac 79081 ttgctctacc ctgtcctgct tattttggat agaaaatatt catacacata ggaaaacagc 79141 aagaatatga acatatgggt tacatgccaa ggcgagaccc aatctgtgaa ctcattgggt 79201 tttgacgtta cttattagtc ccagctataa aatgaaagga ccagccaaag gctttaacta 79261 taaggtccct tacagctctg acattctctg ctcctgtcat actaaatgga acagaaaaga 79321 cagaatcaac ttccctggga gtaaaagggg caagaaccaa ggcaataaga tccacagaaa 79381 agacttgaga gctctgagtt atagctgaaa ggaaaggata ataatgatgg ttgctttcca 79441 actcttccag aggccagacg tctccaggcc aatcccctat atggttccca gggaagaggc 79501 aagtgtttaa aggtcaactc aattacttgg tataagcaca gggatgtttt acaacaagag 79561 ctatgtgcag gagctggaat ctacagcaca aggtgcctgc ctagtctatc tcattctcag 79621 gatataggca cagccctacc acttcagacc ttgggggcca ggcatgccca tccagcctct 79681 ggagtggact gctcttcttt ccaccaacaa atttcaacta ttttacgttg tgaagaatat 79741 aaagacaagt tcccaccctt gagaattcca atccgatggg aggaaggaat aaccattcac 79801 tgggtgtcta ccatgtactg gatacactgc atatattatt tcattcaatc tcaacaagat 79861 tattaccatc cattttccag atgaagaaac tgagcctcag acagcaggaa caaatctgat 79921 ttgaagtctg tataattttt aagaaactga gctgcctaca aaaataatgt agctataggc 79981 agggcgcggt ggctcatgcc tgtaatccca acattttggg aggccgaggc aggcggatca 80041 cctgaggtca ggagtttgag accagcctgg ccaacaaggt gaaaccccgc ctctactaaa 80101 aatacaaaaa ttagctgggc gtggtggcag gtgcctggag tcccacctac taggggggct 80161 gaagcaggag aattgcttga actcgggggg cagaggttgc agtgagccaa gatcacatca 80221 ctgtacacca gcctgggcga cagagcgaga ctctgtctca ataataataa caataataat 80281 aataatgtag ctataataaa gctagtaaca aaaagtagga acctagggag gcaacctgtg 80341 gtgaggaaga gctggctctg ccaaccacac taaaaacaac tcttggctgg gcacagtggt 80401 tcacgcctgt gattccagca ctttgggagg tcgaggtggg tgcatcattt gaggtcagga 80461 gttcaagacc agcctgacca acatgttgaa acccccgtct ccactaaaat acaaaaatta 80521 gctgcgcatg gtggcaggtg cctgtcatct cagctactca ggaagctaag gcagaaaaat 80581 cacttgaacc cagcaggcag agattgcagt gagccgagat tgcaccactg cactccagcc 80641 tgggcaacag caagaatccc tctcataaaa taaaataaaa taaaataaaa taaaataaaa 80701 taaaataaga atacaaacaa ctcttgaagt acctgcataa tctgtagtgc agccacagga 80761 aagtcactct tcctgggcct cagtttcctc acccataaag tgagggatat gggacaaccc 80821 tggaaagggc tcctctcatc cccccaccct gcctccagct ggcccattta tggctcttcc 80881 aatttaacct gggttcaggg gggccacact tgaggtttgg tcatgagctc ttccttggta 80941 cattcctttc cccaccctaa gattgaaatt ctcaactcag aagcttgttt ctgtgttttc 81001 catcagaaca tgaaatagtg cttggcccat agtacactgg taaagatggg aggataggcc 81061 cacctaccgg aagcaagagc cgtgccaggt ttcgttgaca gtcctgtacc atatctggct 81121 tggagcaatg tggtccccac agcctggaca cctccagaca tcttcacctg cacacaaaac 81181 caagaggttg agaggcccaa agtagcagga ggcaaaggga caggattcct accatgtaga 81241 ataagactga gttaacagca tctcatttga aagggttagg ccaatgctct gctagatgca 81301 cagtttctgc ctccacaaga tctctggtaa gcacttgtcc aacctctgct catctctcca 81361 gggacaagga actcatggcc tcaaaagtag tctcattcca ttttctaaaa atggttctga 81421 ttatcctttt ttctactgaa acaaaacttg cctttttcta gaactttcag cctttgctta 81481 tagctctgag aagaagaaac gaacacaatc tccttctttt cagtgtcagc ccttcctatg 81541 actattccca gccaacagcc agaaacttct ctctgggtta aacatgttac tgccttttcc 81601 tattcctcaa aagccctgtt ttttcagatt tttccatcac ctaatccaag agctcattta 81661 ctcaaccaac attgaataaa ttggtcctct agatcagcag tcctcaacct ttttggcacc 81721 aggggccaat ttttccacgg accctaggct gggggttggg gggcgggggg atggttttgg 81781 tatcaaactg ttccaactca gatcatcagc cattagtgag attctcataa ggagcatgca 81841 acccagatcc cttgcatgtg cagttcacaa tagggttcac actcctatga gaatctaatg 81901 ccaccactga tctgacagga ggcagagctc aggcagcaat gctcactggc ccgccgctca 81961 cctcctgctg tgcagcctgg ttcctaacag gctacagacc cgtaccagtt ggcggtccag 82021 gggctgggga cccctgttct agatacaaag ctctttgtat gttggaggct gttagattac 82081 ataatctctg ctccttaaga agcaaatcca ctggaaaaga actgctcttt catgttgttg 82141 atttttcact tggatgttat ttcagagata gaaccaagta ttattttagt tcagaaaaag 82201 aaaagtaatg ggtataaaag gggaagaaca aatatttaat gaatgcctcc tatatgccaa 82261 gaactctgct aggtgctttc atagacctaa tctcatttaa ttctcaaaat aatcctgtga 82321 cacaggtatt attatattca cttaatggat aggaaaatta aaacttgagg cagggggcag 82381 ggttgactag ctaagtggtg gatctagcat tcatactcag gtctggctca ttctaaagct 82441 tctgctcttt ccattctacc atgtcaactc ttgttcactc caatgagggg aaccaggaaa 82501 gtctttgtag aggaagctga cacttgagct gagccttgca agatagggaa gacttaccta 82561 tcactggctg agaaaggagg gtgaagggtt gtgtgagctc agagaccagc atagacaaaa 82621 gcacagagtc ggtccatgtt cctccaaaaa gagaggcctc aaacaaaata caatactcct 82681 ggtatggctc agatgtcaac ttcttgaaat aaagaggaca gctgagatca agctgtggta 82741 cctgactcag ccctagaaag caggacaagg ttgttacaag actcccttgt gcgctctgtg 82801 tatgcagctt tgttctctct gggtaactac agactgccac aggcacagtg acagtcaccg 82861 aaccctgttt cacaggcaca ggggccatga atagacaaag cacagctcct ggcacttcaa 82921 gattgctcaa catcattctg gagcagccaa tgtacacagt tgtccccctc gttccccaga 82981 gaagagttaa acaagctggc aagaggcctt aaaacaccac ctcaacccac ccattccact 83041 cttctcatgc ctcagtaata ggatatatat atgcccatca ctcattcttc ctgggtctaa 83101 aatcctcaga ctccttgtca agaggggcct ttttggtttt agggtgacag tggtcacaga 83161 gtatgccatc tctctcacaa actccaaggc agaaagtctg agagactcac tcaaaccaat 83221 cccaggttgg gttacggaca gtggacaaga caatgaatta gtgagagatt ttaatgaaca 83281 caaacatcca aatgaatctt ccaagttttg gaaatatttt tggaactctt acatgtaaga 83341 gctttcaggg tcagtttatg agctgccatg ccccaccccc agcacatctc attattttag 83401 agccacacct tgtttcttgt ttttactttt taacccaaaa ttaagtctgt aacctgacca 83461 cccacttact caacaattca atctcttggc tctgaatgat ttctatttcc aaaattcaaa 83521 caggatcagg actttccacc atgaggatat tcaagataac ttgttattgc cataccctcc 83581 agagcttgcc ctgctttctc cagtcttgag aaacccaaag acattcaatc aacacatact 83641 ttagtacctg ctttgggtta agcatgaaag acccgaaatc aagaccctct actctctcct 83701 ctcatatcca cctggtttca ctttccaaat gtctctctag cccttccctg tccttcaccc 83761 cagccctact tcagattttt ggcttctttt ccctatctgc tgtactcagc tggctcatcc 83821 acactgcaag ctatctcagc tctgggttag actccccagt gtccacggaa caaaggctgg 83881 gctactcggt atgacacccc tgttctggat cccattccat cttacttctt agaagaagtg 83941 cacttttagt tatctttaca gcacccctct ccctcactgg ctcctcctta tcttttaaat 84001 aagttcaagt cactgccatc aaaaaaacct ctggtactct cgcttcagca gcacatacat 84061 taaaattgga ctgagacaca gaaggttagc atggcccctg cgcaaggatg acatgcaaac 84121 tcgttaagtt ttaaaatcta atttagtttt aaaatctaat tttaaaaaaa accaaaccac 84181 tggatccttt tctcctctag ctaccatcct tgtttctccc cttcacagcc acatgtatga 84241 aaaagttgtc ttgggaggcc aaggtgggtg gatcacttga ggtcaggagt ttgagaccag 84301 cctggccaac atggtaaaac cccacctcta ctaaaactac aaaaattggc tgggcatggt 84361 ggtgcacgcc tgtagtccca gctactcagg aagctgaggc aggagaatcg cttgaaccca 84421 ggaggtggag gttgcagtaa tctcgccact gcactcacgc ctgggcaaca gagcgagact 84481 ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaagaggct gggcacggtg gctcatgcct 84541 gtaatcccag tactttggga ggctgaggcg ggtggatcac ctgaggtcgg gaatttgaga 84601 ccagccatga ccaacatgga gaaaccacat ctctaccaaa aatacaaaat tagccaggca 84661 tggtggtgca tgcctgtaat cccagctact caggaggctg aggcaggaga atggcttgaa 84721 cccgggaggc agaggttgca gtgagctgag atctcgccat tgcactccag cctgggcaag 84781 agtgaaactc caactcaaaa aaaaagaaaa aagaaaaaaa gttgtctaca catcgctgcc 84841 ctcatttcct cacttcccac tttctgcata atgcactgca atctggcttc cacccccacc 84901 atttctctga aactgtctca ccaaggtcaa cagtaacagt aactgcttat tgctaaattc 84961 aatggacccg cttcagtcat tctttcagca aatatttatt gcatgccaac tatgcatctg 85021 acactgtgtt ggggaaagag gagcaaacaa gagagagatt aaacacaaag caaatattaa 85081 acaataatat aattatttat tgagggccgg gcgcagtggc tcacgcctgt aatcccagca 85141 ctttgggagg ctgaggcagg tggatcacaa ggtcaggagt tcgagaccag cctggccaat 85201 atggtgaaac cttgtctcta ctacaaatac aaaaattagc caggcatggt ggtgggcgcc 85261 tgtagtccca gctactcagg aggctgaggc aagagaattg cttgaacctg gtaggcggag 85321 gttgcagtga gccaagatca ttccactgca ttccagcttg ggcgacagag cgagactctg 85381 tctcaaaaaa taataattat tattattatt tatttataat tggggcaagt ggtacgaaga 85441 tatcaagttc ataactagga aaagggcggg cggccaaagt attccttgag gaagtaattt 85501 ccaagctgag ctctaaagga tgaatacaaa ttaacggggc aaaaaagggg ggaaggaagg 85561 ctaacactgg agagggggga catgtgcaaa ggccctgggg caggaacaaa aacttctcac 85621 acatcattaa agttggcttt tgctacctcc aaagggctag caaaccatgt ctggcccctg 85681 gtctgctaca atccttcaat aaacagttac tgaaagagca aatggtcctt acaatgtggt 85741 cccaaaggac ccttctctct caccattcct tacaaaagaa tgctctagtt acattttgac 85801 ctattttcta ttcctgccac ttttattcat gtacttgcat cctgacattc acacctccag 85861 ttattctcta tgtataagct tctctatctt ttcccttaat caaaaaaaaa attttttttt 85921 ttgaaacaga gttttgcttt tgttgcccag gctggagtgc aatggcacaa tctcggctca 85981 ccacaacctc cgcctcccag gttcaagtga ttctcctgcc tcagcctccc gagtatagct 86041 gggattgtgg gcatgcgcca cgacgcccgg ctaattttgt atttttagta gagacggagt 86101 ttctccatgt tggtcaggct ggtctcaaac tcccaacctc aggtggtctg cccacctcgg 86161 cctcccaaag tgctgggatt acaggcgtga gccactgctc ccggcccctt aatcaaattt 86221 ctatcctttc tttaaagttc cctaaaatca ggcagtgttt gtttcatttt gtctatacct 86281 attttacata atttgttatt taattacagt tcactgcata tgtttctctc tccttcagtc 86341 cctctctctc cactctcctc ccccacaccc cattcagaga actaaggtgt aagaactgta 86401 tctttattat gttcttatct ccagggccta accttgggcc tggtgcaaaa cagacattcc 86461 tttgtgtggt gaacccaact gaaaatttgg atccaactgc cagctataaa agcaatttcc 86521 aaagataagt tccaaaggtc agcacagatg attccactgg aataactgtt ctggctccca 86581 gattacctat gagtagtctc atgtggataa acctggcctc agtttacctg ttatgcattc 86641 acactgcttt tctgttagga agtacctgag caaagatatt cacccaaagc acacagcctc 86701 ctacgcccag accctcttca agagaacact cttgacttct ttagctctct tacagtctta 86761 aaaattcatt gatctattct cttaccacct ttattctagg aaatgtaaaa gaaatgtaat 86821 actgtaatgc aaagtcttag ggatttctgt gaagcctagt ggaggatcaa tccaaagtcc 86881 agtcagcctg ggaagttctt tcttaagaaa atatctcatt gtaccaggct gttcattctc 86941 tccctctata gtgtttacct ctcatatttg cagacttatc aacagtggct actgattgtt 87001 aagagtccag aaaagagccg ggcgtggtgg ctcacgcctg taatcccagc actttgggag 87061 gccgaggcgg gtggatcacg aggtcaagag atcaaggcca tcctggctaa cacggtgaaa 87121 ccctatctct actaaaaaat acaaaaaatt agccgggcat ggtgatgggc gcctgtagtc 87181 ccagctactc gggaggctga agcaggagaa tggcatgaac ccgggaggtg gagcttgcag 87241 tgagccgaga tcgtgccact gcgctccagc ttgggcgaca gagccagact ctgtctcaaa 87301 aaaataaaat aaaataaaaa aaagaagagt ccagaaaagg agcttgattt aagtaaatgt 87361 tcacagtacc ttcaaacaat aaaggaccct tattaggtgt ttaataaaca gaaaactgtg 87421 ttttcagtta ttattggaat ttgttgcttt taataagtgc aactgtgcaa cgtaaggtga 87481 gcactgcaac aaatgtctca ttataggggt tctcaccttt aagacctttt caatcttggt 87541 ataaaataca ttcactttac attcacttta tgaatgcttc tctctcccaa cttaattagc 87601 aatgtggccc tcaaacaaat gtgaatccac tggggcaggc gccaaaaaaa aaagaaggaa 87661 aatagacttg cctaaccacc acaaagctac tgaaagatgg tccaggaggg attcaaaccc 87721 atgctcagag actaccatgc cagtcatatt tttaataccc tatgcagcct cagacaagtt 87781 gagacttgct caagaacaca cagctattca gtggaagggt tggcctggta tcccagctca 87841 gagccaatgc agtccccaca atgttcacct taaatttggt ctaagagctt ggtttctgca 87901 aatgagtcct ttttaggctt caaaatggtg agaaatcctc aagcactgag agctcatgag 87961 actctaatgc ctagcttaaa tcaacatcct atcctttcat ttaaaaaaat ctttgaagct 88021 tctcacgttc caccagacaa cgagaaccag gaacaaaact cagccaattc agtgcagcat 88081 aaaacaagta tggtaacccc tgggcccctt gggaactggc gggtcaggtg tataaatccc 88141 acttgttctg ggacaaagca gaaagcagaa gggcagggca gcagctgctg gggaaaagcc 88201 aagagaaaaa gatcacattt taatgaagtg actatcaaaa agactaacag taaggattaa 88261 catgcctcta gatactctcc ctgtaaccca aagggttttc ttcctcacta gcttaccaaa 88321 ttttaccaaa aggagtgtct caggaaaaaa aaagattaga tgatttttaa aagagctttc 88381 gagcttctct aaaccttgga ttgttctttt gtaaaaggag ggatcaagtg atttcttcca 88441 gctctgagag actgacacca aaccaagggc ctctcctttt tcaggttaag gcacagacac 88501 aggtatctca gaaagtttcc ttcttgctgt tattctcttc ttcccttctt tggtttattt 88561 ggaaaaattt gtgaggggta aatgtatgtg tgtttgaaac ctgaatctgt cactgtttgc 88621 ctctgtgtcc ttgggcaaat tatttgacct ctctgagctt gtctcctaat ctttaaaaag 88681 aggataatac cacctgcttt acaggattac caagtggatt aatagaaata atgtaataga 88741 tgtgaaaata tctcccatgg tgccaggcac ttagttggac cgatttgtga ctaacttctc 88801 ttccttcctg cgtgtgtatc taggaagttg gctggggaaa aggttaagct gaatacagaa 88861 acttctaaaa tctgaaggga aggttttcaa aacaaactca gcaaactttt ttctatgtgc 88921 ccatgtacaa ggcacagagt aggccaggaa ttactggggc ccttggtgtg tactcatggg 88981 gtgtgaagga catggatata cacagaaata acatgtcaga gagggacaaa ccaggtgcta 89041 caggcaaggc aaactcagta cattaacaaa atacactgca gtgacaaatg agacagatgg 89101 ttcttatgtc tttcaggcca cccagactgc aacttccaac cagaagttca ctcaactgtg 89161 aatgagtagg cattttctta tctatggagg gcctataaaa atatctttat cagagaactg 89221 gttaaccaat tgatatattc tcatacagca gttctctgcc agttgtcaat gatagaggaa 89281 aaaaactact ttatgccaga caatgtccta agcactacgc aagtatgagc taactgaata 89341 caaagatccc atgagatcag gactagtctt aaagaactat tattgttctc attttacgga 89401 agaggaaact gaaatactca gctaataaat gtttgagctg ggattagacc acaggtctag 89461 tctccaaagc ctgtcttttt ttttttttct gagacggagt ctcgctctgt cacccaggct 89521 ggagtgcggt ggtgcaacct cggctcactg caacctctgc ctcccaggtt caagcaattc 89581 tctgcctcag cctcccgagt agctgggatt acaggcacct gcccccatgc ccggctaatt 89641 tttgtatttt agtagagatg gggtttcacc atcttggcca gactggtttg aactcctgac 89701 cttgtgatcc acccacctca gcctcccaaa gtgctgggat tacaggcgtg agccactgca 89761 cctggctcaa agcctgtctt taactacatt ttagactttc tttcctggac aaaacaaatt 89821 acaggcagaa agacatgata aaaaagaatc ccagaaagaa tgttcaagga acactttaat 89881 gaaaccaggc attggtttca tgttctagtt actattcaat gtaaattaat aggtaaaata 89941 gtctaattct ttctactata taaaaataga taaaggatga gaaagaaggt agacttgtaa 90001 gagactgcta cagtttcagg tcctctcatt gaaatgagat gtgtgtgatg gtgggcccta 90061 tcagacccgc tcaaggttac actaacaaga aggcagccct gttctgggaa tcagctcagc 90121 ctccagaaga tgcagaaaat ccaccatgaa tgtggaaagc ctaatacagc cccttgatca 90181 tgacggattt ttatttagtt ttgaaatctt gtggtcattt tggaccctac cctgctccat 90241 cacatatcca ggaccatcca agtcaccctg gcctgcctat gtcaggaggc atcttttatt 90301 ggttggtgcc ttcttcattg ctaccactaa gacctcatct cagggcccac tatacccacc 90361 tccagattgg tggccctgtg cccacatcat gagtcttcct ttctctaaac ctacggtgcc 90421 tactgtagac tagaaacttc caaaatcctc agcctggcat taaaccctct agtccggttg 90481 cacccacatt ctcagctgta ccttccactt cctatgaatc ttccacctcc caatccttgc 90541 ctctaccaca cgtatttgct accctttatg aagacctgaa acttgcctac ttttgtctcc 90601 tactcccaaa gtttcttccc tactaagata tcatcttccc caccttcttt aagcctattt 90661 aagatttctc aagtcttacc cacttcaaga agtattccct aaccaaaact gatggccacc 90721 ctggataccc tgctattctt taaacatgcc aaacacattc cctgacaact cacttcttct 90781 atccccttac cctacttcac ttctcttcaa tggatttatc cctgaatatg atgttcatac 90841 atttacttgc tgtctatctc tccccaccag aacatacact acatgaggga agggatttgg 90901 tctatcttgt tccttgctgt atccctaggg gccaaaagag tttagcacat agcaggaact 90961 caatatatat ttgtggaata aatgtttgaa gagaattctc cctcatcttt gaaattctat 91021 aatgttatca tacgcaagcc ttataacagc ttgtcttatc aatttttttc ttttttttaa 91081 gagatgggga tcttgctatg tagcccaggc tggagtgcgg tggcttttca caggcgcaat 91141 cccactactg atcagcatgg cagttttgtt ttttgcctta attaccttaa ctagacagta 91201 agtttctcag aggcacaaac tatatataac tcatttccac aaatacttat taagggctta 91261 ccatgtgcca gaccctgtac gtcttaagca gcctggaata ctggaaagaa cactgcagaa 91321 gtcttacctt ggttctggct ctaccactag ttgagtgacc ctggacaact ccgttttttt 91381 tccctgagcc tcagtttctt cataggtaaa acaaggagac tatacattat ctcaaaactc 91441 tctcctggct ctaacacctg ttaaaattta taagaggtaa aacttggtcc tgcccagtgc 91501 cacagacatc acaggcactc agtaaatatt tgatttgtta tcgattctcc ggttacacga 91561 ggaatctgag gtgtctcaga acatggtttg aatggaataa caagtgaaag tttaagtccc 91621 acttccttct cagactccat agtcaacttc attctgcttt ttgtgctaca gagaggagga 91681 agcactaaac ttgaagttgg aaaatatgaa ctctgaccac cactgctcat ttagctatgg 91741 aaccttgtca tttctgggcc tgtctcctca tctataaaaa agccatactt ggctgggcat 91801 ggtggctcac gcctataatc ctagcatttt gggagtccaa ggcgggtgga ctgcctgagc 91861 tcaggagttt gagaccagcc tgggcaacat ggtaagaccc cgtctctaat aaaaatacaa 91921 aaaattagct gggcgtggta gcacttgcct gtagtcccag ctactcagga agctgaggca 91981 gaagattcac ttgaacctgg gaggtggagg gtacagtgag tctggatcgc gccactgtac 92041 tccagcatgg gcaacagagc aagactcttc ctccccaccc cctacaaaaa aatcaaaaaa 92101 ccatattgac ctcccagctc caaggctcaa aggcactaca tgccttccta ccctagggcc 92161 tttttatgct gtttcctctg cctggaatgc tctgtgagct ttttttgcca ggctaacttt 92221 tacttatctc caggggtcag cataaacttg catttttaag aaaagtattc ccttcaaaaa 92281 aaaaaaaagt agaggatggg catggtggtt catgcctgta atcccagcac tttgggaggt 92341 tgagacagga ggatggcttg aaaccaccgt gggcaacata gtgagaccct gtctctacca 92401 aaacagaaaa aaaagtagag aaaaccctca cccctggtgc actcttacag cacttatctc 92461 aattgtttac cttaactatt atagcattta ttgcaattgt tacaattatt tattgtccat 92521 ctctaggatt attaagcacc acaaagccaa gaactatgtc accactgtac tcagcatttg 92581 gctaagtata tgacacagaa gaatccaaca aatattcttt gaatgaaaga aagaatgcat 92641 aaactaatgc atctagaggc tttatgcctc tagactattg tactgtgaaa cctttcacaa 92701 ttttctaaga aacagggttc aagacaggca acctatatta cacagctgtt agaaaaagtt 92761 actggaaatc ggccgggtgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga 92821 agcgggcgga tcacaaggtc aggagttcaa gaccagcctg gccaatatgg tgaaacccca 92881 tctctactaa atataccaaa aaaattagcc gggcgtggtg gcgggcccct gttgtcccag 92941 ctactcggga ggctgaggta ggagaatcgc ttgaacccgg gaggcagagg ttgcaatgag 93001 ccaagattgt gccactgcac cccagcctgg gcgacagagc aagactccat ctaagaaaaa 93061 aaaaaaaaag ttactggaaa tcattcgagt tgaaggactc tgtaagctta aaaaaaatct 93121 tttttaagag acaaggtttt gtttttaaga gatgaagtct cgctatattg cccaagctgg 93181 tctcaaactc ctggactcaa gcgatcctcc tgccctgccc tcccaaagtg ctgggattac 93241 aggcatgagc caccacgccc agcctaaaaa aatttgtttt aggcatttcc acccctagct 93301 ctctccagaa gtctgttatc accagtgaag agaaaatccc atgctccctt gctgaggaag 93361 tgacccactg gtagcccagt ccttgcattc aagatgaagg aagtgactgt ccacagacgg 93421 aaagccagga agcagggggc agatagccag cccccctcca cctcaatacc ctcttgccgc 93481 ctctccaaag cttctggacg ctgaccctgg tggacccaga ggtgggccac ccacacaggt 93541 ggctgggaaa cataggagac agaaggttcc tactgctggg ggaaaagatg gggaaggcag 93601 gaagctgcag ggtggtgtgt gaggtggcgg ggaggtgact gcaggtgcaa ctaacccaac 93661 tctgaagcta cccttcaccc ttagacaact cttccttcca ggaggtgttt ggccgaatcc 93721 ctctagtctt tagtctagag aggccaaggg agcttccaca ggaggactca gaaccccagg 93781 tggcctactt tcgccacctg tagcagagat ctttgggccc caaaaagaat ggtccgatct 93841 ctccttaccc ttagtcctgt cctaaacctc aaggggctcc atactccctt attttctgca 93901 aagagctccc agccccgggg gactcccaga caccaagatg cccgcgagaa gccttgggtc 93961 ttggcaggga agtcacatcc tccatttccc tcagaaaacc cagaacccgg ggtaactctc 94021 ctaattgtcc tgaaccctag aggatcctgt actcctcaaa ttccacagag cttccactat 94081 aagagaaccc catgcttttc aacttcctta aaaagggtta atttgggggt ctgcggccgc 94141 caggtccctc caaaaagctg gattcctggc agacccagaa tctcttcctc cccgggacct 94201 caactcgcgg gaagcctccc aacccgcctg cccctcggag ccctctgtct ccaggatgcc 94261 ctgcctcccc attcccggag tcccaccctc caaatgcctg gggagcgctg tgcccacccc 94321 gcctcccagc ttttctgagg agctcggacc cacgaggacc caaccagccc gaaacccccg 94381 agagacagcc ggtttccctc aggcgccatg gaaccggact tcggagataa cagcgggaca 94441 gggcggagca gcccttcctt acccgccagc gcggacatgg tcccgggagc gcggaaatgg 94501 ggaggaggag gcgggggaca cagcagctcc cctcagttcc ctacaactcc cctcagctcc 94561 tgccgccgcc gcctcaggcg cgggaagacc acagccccag gcgcgaacgg gatgcaccgc 94621 ctacggcgcc ctgggaggac caaaccacac ctggcacagg ggactgcccg ccgacgcccc 94681 gccccgccgc tctcaggcct cgctcccgag acgcggcccc gccccctggc tctggctcgg 94741 gctctgcagc ggccctggcg ccccctagtg cgagcggcgt cccctcgagg tgctcccgcc 94801 ggtaaggttg attctgcggg ggccggaagt ctcaacacgg atgacagtcc gcacccatct 94861 agtcgccctc atttacagat agggaaagtt aaggatcaga ggctaagttc gactacattg 94921 aaatggcgtc gctcattagg aggagagccg ggatgggagc ccaggcctgc taactgccag 94981 ttcagagctc tttccagtag tgtttggaat caggaatatt acaatggagg agccaggtct 95041 atcccacaaa cagctttgag gatggacttt tcttagaagc ttctagctca gagaaaccac 95101 ttctctgcgc cactatgagt aagctgacag taaataatct ctaagtcaaa gacagttgga 95161 ggcgaccaac cagcattcca ggagcagatc ctcagccagg ccaattaggt ttagcgcagg 95221 acgctggagc ttgtgcgccg cattggaaga cgagcactcc cctgcaggaa catgggccta 95281 ccattcccag gtcttccaac ctttcaaaac atgccaaaaa tccagatatt taggtgaaat 95341 ttgccacttt taaaaaatgt tgatagcatc aaaaagttct taaaatgcca tgtagaccaa 95401 atgaaacaca cctggccact agatttgccc tgtgggctgc cagtttacaa tcttctatga 95461 ggaggaggga ttacaggact aagagttagg acagctgact gggttatggc agctgctagc 95521 tggctgagta gctttgagaa ccacttccgc tcattaggtt tggaaggaga ttggggtcct 95581 ctgtgtgatt aagatattaa tacatcatca gccaggccag ctggtccctg atgacccacc 95641 ctcccatcag attctctcac gctgggtctt tcacaattgt catttcctta catcagattt 95701 agatcacagg aaacatgtcc aaatgaccag aaagaggatg gagcccaggg aaatggattc 95761 tgtcccctcc acccccaccc ctgaccaaag acaaagttca acccagctgc ctctggtccc 95821 agcagttggg taagcaaagg caaaggcaaa gacaaaggcg agcacaattc tcttccttcc 95881 tgctctaatt caaaccggat gccggtcaga tttttctctt ctgggtgtgg caggtggctc 95941 acgatgtcct gagtgtgcac aaagtgctgg ggaggagggc tctgtcccat ttccgttaat 96001 tcctgggaac tcttttcagt actcagtttt gtagtttctg aggaaatcat tatagtaaac 96061 acaagattgt gtattgaatt aaagtgacag gaatccagag actctcctga caagctgcct 96121 catggctagc aaacagggga cagggacttt ccccatcctt cctacagaaa gatgtaacac 96181 tgtattttcc caggcaacaa agcactggca tcctgggaac caaaacctcc tgggtgtctt 96241 ccctctccct ccgttccaca ccaaggcacc caggtctatc tcttctccct cctcattctc 96301 tctcacattc cacttcctcc tacctccagc ctcatggtct tcaccagcag ggagctgggg 96361 aatagccaag tcccccaagt tcctcccagg gaagttcagg cgactctgag aaacacaagt 96421 tgaaaatgta atggaaggga atagctttct atagaaatgc tatagccaag agatatattt 96481 ttaatatgga cccaaagaac attccagaaa tcacttgtga actcaagggg gacttctgga 96541 gggtagagaa agcctgggca gggagtagac caactgagtt tcaatagcac ccaggttacc 96601 tgtccacagt ttgtttcact ggccatagct tggtcagtcc ctagggccag atcctgccag 96661 ccgcctcaca gtcaactctc tgagcctagg atcctgaaac ctggaccttg aaaaacaact 96721 caaggttctg ccctcttgaa ggtactggtc cactggtacc caaccagact tgcccagtgt 96781 cacttccact tgcttcaaac tgagccagtc tctgccctcc ctgaatcatc ctacccaaaa 96841 ctgactgtgt taatcttcca aaagtatgtt tctggacctg tcatctctct gttcaaaacc 96901 cttcaacagc accccactgc ctacaaaata aaagccaaac cagtcctcat gagctgacta 96961 catctcaccc tgcggacttg tctccctatt cccttcactc caggactctg ctttttcctg 97021 agcacatcct gccttgcgcc acctgccatt cctacaacaa ccagctctcc aggttctctg 97081 taattttgtg ccccactctg cccagagtgc agtcttgttt ctcaactcta aatgatatag 97141 tggcatctgt aacatcacaa agtaggacta ccagacatta agtgcctcca ggtaagacgc 97201 accaagtata tactatcacc tatgacgtag tcttacaaaa caaaaataat aagcttgaat 97261 ctggtcaagc ttctagatct aacaaccaac ttacaggaaa tacatggaga agaggagctt 97321 aataacattt ccatgatgca aatagcaaaa atgatgctgt gggaaatgca acaggacaaa 97381 taacctagtt tctttaacag ataaactgca gagtagaaaa aatttcatag acaggccagg 97441 caaggtggct cacccctgta atcctagcac tttgggaggc tgaggcgggt ggatcatgag 97501 gtcaggagct tgagaccatc ctggctgaca tggtgaaacc ccgtctgtac taaaaataca 97561 aaaaaaatta gccgggcgtg gtggcgggca cctgtagtct cagctacttg ggaggctgag 97621 gcaggagaat tgtgtgaacc caggaggtgg agcttgcagt tagccgagat cccgccactg 97681 cacttcagcc tgggcaacag agcgagactc tgtcacacaa acacacacac aaaaaaacaa 97741 aaaattagcc aggcatgatg gcgggcgcct gtagtcccag ctaccaagga ggctgaggca 97801 tgagaatcac ttgaacccgg gagacggggg ctgtagtgag ctgggatcgc atcactgcac 97861 tccagtctgg gcggcaagtg aaactctgtc tcaaaaaaaa aaaaaaaaaa aattcagaga 97921 caaataatag caatgtatga aactgatttg gatctcaatt gaaactacac aagttatgag 97981 acaactaggg aaatctgaac aatgtgattg gatatgatga ttttaagaaa ttttttaaag 98041 gtattttgct tttttttttt ttttgagatg gagtcttgct ctgtcgcccc gactggagtg 98101 cactggtgcg atctcggctc actgcaacct ctgcctcccg ggttcaagcg attctcctgc 98161 ctcagtctcc cgagttgctg agactacagg cgcctaccgc tacgcctggc taattttttg 98221 tattttagta gagacggggt ttcactatgt tgcccaggtt ggtctcaaac tcctgagctc 98281 tggcaatccg cccgcctcac cctcccaaag tgctaggatc acaggtgtgt gccatcgtgc 98341 ccggcatatt ttgcttgttt ttaaaatact tgagatgcat atcgaaatac ttaataaatg 98401 acatgactgg ggtttgtttc cgaatagttc agacagtggg agaggaacag cgtgcgggag 98461 gtatagatgc aacaagactg atctgagttg atagttactg gagttgggtt atggctacat 98521 ggaggcttta ttgtcctgtt ctactactgt tttttttttt ttttccccat cgaaaacaaa 98581 aacctaaggc cgggcatggt ggcttaaacc tgtaattcca acactttggg aggctgaggt 98641 gggagaacct actgagaccg ggagttccag gctgcagtaa gctgtgatca caccactgca 98701 ctctagccag ggcacccgag taagaccctg tttctaaaag ggtctgttgc cctggctgga 98761 atgcactagc atgatcacaa ctcactgcag ccttgacctc ccaggctcag gtgatcttcc 98821 cacctcagcc tccctcccgg gtagctggga ccacaagcat gcaccactgt gcctggctaa 98881 tattttttat ttttgtagag atggagtctc cctgtgttgc ccaggctggt ctcaaactct 98941 tggcctcaag cgatcctccc acctcagcct cacaaagtgc tggggtttgt aggtgtaagc 99001 caccatgcct ggcctaaaat taaaatcttt caaaaaaaaa aaaaaaacct ggtcttttct 99061 cattgggctt atgagctcct gaagttccct cacctttgca ttccccactg tatcttggta 99121 ctattcaatt tgataactgt tagatcaaaa tctaagagga agaaaaaaag ctcttttcag 99181 aggaattact tgtagctctg gcatgttggg aaggctgagg ctgggccaga tttgtaacct 99241 cagcaacatc ttccatctca cactacaccc tgctactaag gaaccttagg gagggacaca 99301 ccttctgctc gagagagagg acttggctga cagcccacct gccctgttac taaacagatg 99361 gagagacgct gacagtgagt aacctgattg ttctaaaggc tctgataagt gacaagctaa 99421 gctctattgt ggcagcttgt cacttacgaa gctcaggaat gccatgtact tgtcaggact 99481 ggggctgcct gtcagacagg ttttatattg ggcaaggctg tgtatgaagc caggcctcca 99541 gcctacattt atcatctcta attatcaata gtcgtttaaa gactttcttt cctttttttt 99601 aagagacagt ctcgctttgt cacccaggct ggagttcagt ggcatgatca tagctcaatg 99661 cagccttgaa actcagggct caagcaatcc tcttgcctca gcctccagag taactaggac 99721 tacaggcata agtcatgccc agcttctttg aagactttcg ttctcagtgt gacattacgg 99781 aggatgcaca ggagcctcca ggagggcata ccagtgtgac tactccccac ccactcgccc 99841 tccacaacac agacaagaag tccaagctca tagtaatgta aaaccatttg tttaattcta 99901 aatcaaatca ctttcacaac agtgaaaatt agtgactggt taaggtgtgc cactgtacat 99961 atcatcattt tctgactggg gtcaggacct ggtcctagtc cacaagggtg gcaggaggag 100021 ggtggaggct aagaacacag aaaacacaca aaagaaagga aagctgcctt ggcagaagga 100081 tgaggtggtg agcttgccga gggatggtgg gaagggggct ccctgttggg gccgagccag 100141 gagtcccaag tcagctctcc tgccttactt agctcctggc agagggtgag tggggaccta 100201 cgaggttcaa aatcaaatgg catttggcca gcctggcttt actaacaggt tcccagagtg 100261 cctctgttgg ctgagctctc ctgggctcac tccatttcat tgaagagtcc aaatgattca 100321 ttttcctacc cacaactttt cattattctt ctggaaaccc atttctgttg agtccatctg 100381 acttaagtcc tctctccctc cactagttgg ggccactgca ctgagggggg tcccaccaat 100441 tctctctaga gaagagacac tccagaggcc cctgcaactt tgcggatttc cagaaggtga 100501 taaaaagagc actcttgagt gggtgcccag gaatgtttaa aatctatcag gcacactata 100561 aagctggtgg tttcttccta ccaagtggat tcggcatatg aaccacctac tcaatacttt 100621 atattttgtc tgtttaaaca ctgaactctg gtgttgacag gtacaaagga gaagagatgg 100681 ggactgtgaa gaggggaggg cttccctcat cttcctcaag atctttgttt ccataaacta 100741 tgcagtcata attgagaaaa agcaatagat ggggcttcct accatttgtt ggttattgct 100801 ggggttagcc aggagcagtg tggatggcaa agtaggagag aggcccagag gaaagcccat 100861 ctccctccag ctttggggtc tccagaaaga ggctggattt ctgggatgaa gcctagaagg 100921 cagagcaaga actgttccac caggtgaaca gtcctacctg cttggtacca tagtccctca 100981 ataagattca gaggaagaag cttatgaaac tgaaaatcaa atcaaggtat tgggaagaat 101041 aatttcccct cgattccaca ggagggaaga ccacacaata tcattgtgct ggggctcccc 101101 aaggccctgc cacctggctt tacaaatcat caggggttgc ctgcttggca gtcacatgct 101161 tccctggttt tagcacacat acaaggagtt ttcagggaac tctatcaagc cataccaaaa 101221 tcagggtcac atgtgggttt cccctttcct tgcctcttca taaaagacaa cttggcttct 101281 gaggatggtg gtcttttgca tgcagttggg ctgacctgac aaagccccca gtttcctgtg 101341 gcaggttctg ggagaggatg cattcaagct tctgcagcct aggggacagg gctgcttgtt 101401 cagttattac tgcctcggag ctccaaatcc caccaaagtc ctgactccag gtctttccta 101461 atgcacagta gtcagtctca gcttcggcag tattctcggc tgtatgttct ctggcagaga 101521 gaggcagatg aacatagttt tagggagaaa gctgatggga aacctgtgag ttaagccaca 101581 tgtctcacca ggaataattt atgccaggaa accaggaagt cattcaagtt gttctctgag 101641 gccaaagaca ctgagcacag cccagagcca ataaaagatc tttgagtctc tggtgaattc 101701 acgaagtgac cccagcttta gctactgcaa ttatgatttt tatgggacag caatttcttg 101761 catctctaca gaggaagaag agggggagtg ggaggggaag gaaagagaac agagcggcac 101821 tgggatttga aaggggaacc tctctatctg aggagccccc actggcttca gaagcaactt 101881 accaaggggt atttaaagac atgaaaattt ccagaaatac catttggtgc atccctttgt 101941 ttctgtaata ttaaactcag gtgaaattat actctgacag tttctctctt tctgcctctt 102001 ccctctgcag agtcaggacc tgcagaactg gctgaaacaa gatttcatgg tgtcacccat 102061 gagagatgac tcaatgccaa ggcctgaagt tatagagtgt ttacagcggt ggcgatattc 102121 aggggtcatc gccaactggt ctcgagttcc aaagctctga tgaagaaaca agactccttg 102181 atgtgttact gatcccactg attccaggag tcaagattag ccaggaagcc aaacaccagg 102241 agttggggtg gcacgtcacc agtccagagc cctgccacgg atgtaggcag gagcccagca 102301 ttaggcaatc aggagccaga acatgatcac cagggccaca aataggaaga ggcgtgacag 102361 gaactgctcg tccacatact ggggtgtccc agggacagct ggagagacag aaaggagaaa 102421 acaggctatt agaacactgc agctatgcac actgcccact caggccagaa gttgacagca 102481 tcaatcttta tctctttaca acatcagaat cctacacaca aaaagatttc tttttttttt 102541 ttttgaaatg tagtctcgct ctgttgccca ggctagagtg cagtggcacg atctcagctc 102601 actgcaacct ccgcttccca ggttcaagcg attctcctgc ctcagcctcc taagtagctg 102661 ggattacagg tgtgcgccac caagcctggc taatttttgt atttttagta gagacggggt 102721 ttcaccatgt tggtcaggct ggtctcgaac tcctgacctt gtgatccgcc tgcctcagcc 102781 tccaaaagtg ctgggattac aggcatgagc caccatgcct ggcctttttt tttttttttc 102841 tttttgaggt ggagttttgc tcttgttgcc caggctggag tgcaatggca cgatctcggc 102901 tcactgcaac ctccacctcc taggttcaag cgattctgct gcctcagcct cctgagtagc 102961 tgggattaca ggcacgcacc accacacccg gctaattttt tttttgtatt tttagtagga 103021 acagggtttc accatgttag ccaggctggt ctcaaactcc tgacctcaga tgatccgccc 103081 acctcagcct cccaaagcgt tggcattaca ggcatgagcc atcgcgccca gccaaaaaaa 103141 atttctaaac cccaaataat atcaggcaaa atgaaactat gtggaacaat aacaataagg 103201 acaagacacg gtccaattta tcataaaagg tttggcctgc acatgcaaat aaaccaggtg 103261 attaacagga aactcttgca ggaaataatg aaagcaagcc tactatttgt taagtatctt 103321 ctatctgcta agtgttttgt acatgttatt atttggtgga tattattatt tccattttga 103381 gtaggaagca gctgaaacat aggtttgagg tgacctgttc aagattgaaa ggctcagaaa 103441 ttagtggcag agctctgcag cctctccaca cactgcctct caagggcccc acagggattg 103501 ccttcctttt ccagaagaga actaggatcc agacaggcaa aatgacttgt ttttcaaggt 103561 agaggcagaa ctggggtgag aacctaagtc tccactcctc tctgctttag gaaaagctac 103621 catttagagc tccctgtcct gactttgggg gaaggaggat aaaagaatct gtgtccagga 103681 tgggcaaatt gtaccaagca tgacaaggga aaactgtgac tgtggcaggg ggataaagta 103741 atccagagtc tagcctggct gtggaactct gcttagctgg ggaatccaag gttagccttc 103801 gaccaggtta gaaggatgca agctatcagg caggttcttg ctggcttcac aggaatttgc 103861 cctttcttca gggcagagta caccctatct tttggggtat acttcctctg aaaaacaaat 103921 gtgaacagag gataaacctt caaggcagat aaagtaagaa agggcatccc cagggccggg 103981 cgcggtggct cacgcctgta atcccagcac attgggaggc ccaggcgggc ggatcacgag 104041 gtcaggagat cgagaccatc ctggctaaca cagtgaaacc ctgtctctac aaaaaaatac 104101 aaaaaaatta gccaggcgtg gtggcaggcg cctgtagtcc cagctactcg ggaggctgag 104161 gcaggagaat ggcgtaaacc tgggaggtgg aacttgcagt gagccgagat tgtgccactg 104221 cactccagcc tgggggacag agcgagactc catctcaaaa aaaaaaaaaa aaaaaaaaag 104281 gcattcccaa gatatcttgg tgctgagatt cctcagctag atgctctgat ggattcacga 104341 agacagacag gcaggcaggc aggcagtcaa gaaagactta tagatcacct gctgtgtacc 104401 aaggaccagg gtggggatga gaataactgg gatgacccag ttaccatctt tagggcataa 104461 gctcttggcg tgaaccacac aatggtcact tctgtaccat atgctgtgtc cagtgcacat 104521 gaagctccaa caagtgcagg cagagctcaa ccgatcatgg catctctgca ggctccacag 104581 taaggatctt ggcttgggtg tcagactgtg ataagagttc aaggcaccca cctcatagag 104641 gtggcttgca cctatagagc aagggctcta taccgaatgc ggatggcact ttcaagcagt 104701 aggagggaga actgcagcag cagttggtag acctaggcat ggttcccaca gggctgaggc 104761 tgcttctgac cagcctcatg ttgtgtcctg aaggaagcta gacagagatg ccagagaatg 104821 gagatgcctt cactccttat gggtcatctc acatttccag gcccccaaat ggtcaaagtg 104881 ggaagtaaat gtgcagaact aggcttttcc tcagctgagt atcacttaca gttttccatt 104941 ttatagacat ggggtatgtc ttggtaacat tactgagcat tttctgatct gttaaacgca 105001 gctaatcaca cacctgttcc atagggtggt tgtgatgatc ccctgagctt ggaatggaaa 105061 gtgctatata tatgcatgcc taatagttgg taagtgctca gcattgttac catgtctgta 105121 agtgatctaa aaacagctta agatattttg aaaaggctga ttatccatgt agtgacaatt 105181 atctgtaact tatatactgc tgaattccaa ggagctttca agcaatatta ctaagatatc 105241 ttcctaattt ctcaaggaaa tagggtctta cctggaggag gccgcccatc atttatatta 105301 aatgctgtgg caaatatccc aaagggaaat gccccaattc caaaagacat ctggaagcca 105361 ccatctccaa atccaaatcc ttgaaatccc tgtgtggaag aaaggcaatt aagggcagga 105421 gaatcaggca ggtttctcct gtgcctgtca aaactcaaca tacctcacgt tgtcttgcat 105481 gtggcacctg cctcttctcc ctaccttgtg ccacatgttt cgaaggtggg ggacagcttt 105541 tcctcccata tataccctga agtagaggaa ctctggcaag ccctctgccc ttttagcctt 105601 agcttccagc taagaatcta tgaatgggcc aaaaaggact ggctctgacc cagtgtgaaa 105661 cactctgctg acataagtca gcactcacag ggcttgagga tattttccat agtcccacat 105721 tagctagtaa caccttctac atgctttata ggagcaatat tgacagccac ccctgtgccc 105781 aagggaacat caaacaagct acacattatc tttaatagca aaaatgttga aactctaatt 105841 agccaacaat ggaagtcatg atcaaaatct tgatacatgc attcaaagaa ccattaccct 105901 gacttaagcg gtaacactga acaactcttg tgaccaagtt atcagacaaa attgtttgaa 105961 cactgtggtc acaactacat gacaatatgt aggtatatgg acaagttacg gattaaaatt 106021 tcttttaata ctgttacata gaatttttac attttaaaac actataatgt tttctggtta 106081 taaaagtaat atatacttgc tataagaaac tcacatatta cagaaataat tgacagagtc 106141 agagatcttc tattcacctt cccacttcgc agccactcca ctttacctct gaaataacca 106201 ctatcagcag tttggtatat caccctcagt tttctttcca tgtgcatgtt ctacactgca 106261 tcatctttca catgaaaaga ttcagcagga acaatactgt gaataaataa ctcccttttt 106321 taaagcgtga gctctgtgct aggcaccatg ttaagcacta ggacatataa atcatgttta 106381 atttttacat tggcttcatc tgcatttcac agatgaccaa actgaggctt ggagaggtga 106441 aataacttga ccaaggagta cagtttagat ctaaaactag gtcaatttga tctgaaaaca 106501 catgcttttc tttttctttt cttttttttt gagacagggt ctcgctctgt caccaagcct 106561 ggagtgcaat ggtgcaatca tggctcactg tagactcaac ctcccaggct caagcaatgc 106621 tcccatctca gcctcttgag tagctgggac cacaggtgta cccctccatg ctttgctaat 106681 ttttaaattt tttgtacaga tagggtctca ctttgttgct caggctggtc tcaaactcct 106741 gggctcaagt gatcctccca cctcagcctc ccaaagtgct gggattagag gccagagcca 106801 ctgtgcccgg ccttcatgct cttaattact gtgcgcatct ttcaaatatt tttttctctt 106861 cccctgcacc cctgatgaga ggtttagaac aaatgatttt aaaaataata aaaagcagac 106921 agactattgg gtcaaatgct cttcacaaag agctcaatgg ccttcattta tgagtttttt 106981 gcaagtcaca aaaatagctg ctctccatca cacactgatc agcctggcat ctagttccaa 107041 acttcttgcc agatggcatc acctataaag cctaacagct gacttaatga gagggcaaac 107101 ctaaagggag ggaaaacaaa gagtttccat ctcctcaata caaaatgcct cctactcagt 107161 tcatggttga atctaatggc tcctgggact gaggctgcct tacttgggaa ggtgcaattg 107221 tttcatcagg aaccacagat tcattcagtg caatcctgag gactaaccgt tctgcttccc 107281 ttctagcaac atctttatga tctcaccagt ccatacattt ggtgggatta tgttcctggg 107341 ccagagtatg caaaaaaaca tcgctagtag ctgcaggtgc ttataaatgc tagaggtatg 107401 cactttggtc aacaaggcca gggcctctgg cttctggggg cctctgattc ctgcattagt 107461 tgctaatgct ctgtaggaaa ccaaaaccta ggaatgttaa agtggtgtca cctggtggtc 107521 aagagcagag acttcaatac cagccagatc tgggtttcag cactgtacgt gtggccctgg 107581 acagatcact ttacctatct gggcctcagt ttcctcctcc gttaaatgag ttgatgtgag 107641 gcttaaatga gatcatgcat ataaagctct taacatatag ttatttccat atagtaacca 107701 ttcaacaagt gttcactgct agtattatta ttaagggtaa tccggagaag agaaccttaa 107761 tttctttctt tctttttttt tttttttgag acagagtctt gctctgtcgc caaggctgga 107821 gtgcagtggt gtgatctcag ctcactgcaa cctctgcctc ccgagttcaa gtgattctcc 107881 tgtttcagcc tcccgagtag ctgggactac aggcacatgc caccacaccc ggctaatttt 107941 tatgttttta agaaagcaga gacagggttt caccatattg gtcaggctgg tctcgaattc 108001 ctgacctcag gtgatccacc cgcctcagcc tcccaaagtg ctgagattac aggtgtgagc 108061 caccacgccc agccgagaac cttaatttct ttcaagatcc ctgactcaat aatcctactc 108121 ctggaaatcc taataaaatg ccattattca aactcaaaaa aaaaaaaaaa aaaaaaaaaa 108181 aagaaagtgg cagtgcgcgg tggctcacgc ctgtaatccc agcactttgg gaggccaagg 108241 agggcggatc atctgagctt gggagttcga gaccagcctg accaacatgg agaaacctcg 108301 tctctactaa aaatacaaaa attagctggg catggtggcc catgcctata atcccagcta 108361 ctctggaggc tgaggcagaa gaattgttta aacccgggag gcagaggctg tggtgagccg 108421 agatcgcact actacactcc agttgggcaa caagagtgaa acttcatctc aaaaaaaaaa 108481 aaaaaaaaaa aagtgtgcaa agatggttat tgtagcatta tttaaacagc aaaatgggga 108541 aacaactaaa aggtccttcg atgaggaaaa cccaaattac aatgaaccaa aactcaatgg 108601 gttcccagca atagcatgga aaaataaaaa ataataaaaa actcaatagg ataaatggca 108661 gcctaaataa taaagaggac acttcagagc atcatggaaa acctgccaaa attaaatttt 108721 aaagcaaaca gaatggtgat ggctacataa aaatacatat gcacagccta cagatgaaga 108781 gaacatgagg atatgagaat atacttgttc tgggctcttg gcaatggtgg gtaaatattt 108841 tgcttttaaa atttctttaa cattttaaca gaattttttt aagagtaaca tttgatgtaa 108901 atatgcaaaa aatgtaaatg tttttgttcc tagccactag accactagag acaaatgtgc 108961 aatttttttt tagccagtaa tcttacttct aagaatttat cctaagccag gtacagtggt 109021 tcacgcctgt aatcctaaca ctttgggagg ctgaggtggg aggattgctt gagcccagga 109081 gttagagacc agcctgggca acatagtgag accttgtttc tacccaaaca aaacaaaaca 109141 aaaaaaagaa agacaaaaaa aaactcaaaa aactagctgg gcatggtgcc gtgcacctgt 109201 ggtcccagct acatgggagg ctgaggtaga ctgagcccag gaggtcgagg ctacagtgaa 109261 ctgtgatcac accactgcac tccagcctgg gctacagagt gagaccctgt cttatcctaa 109321 taaaataata atgaacatgc tcacaaattt atctgtaagt tcatcaggat tttttttaac 109381 ttggaaaaag aaagccgcta atgtctaaca aaggattgaa ttatgttaaa tccacacaac 109441 taaatgctaa acagctagtt agtaaaatat tgctggcaat gtaaaatctc aagataatac 109501 taaaagactc ccactcacac aaagaaggta ctaagtatct gtagaacaaa tgaaatttta 109561 agagagggtt acaaaacagg agccacatat tgtaaaaaac aaaaaacaaa aactgtaaaa 109621 agtatacaca catctgtaaa atacacgcac agatttgtaa aatttgtgtg tatatataat 109681 tatagaaaaa tggaaaagaa aatagaacca aaatgtttcc agtggttacc tctggattgg 109741 tgtggttata agtgattttt ttccccctct tttcctttct gtattttaca agttttcctt 109801 tacttttata atcaaggaga aaaattgtaa ggctaagaag gaaacttcta aagccatcag 109861 ctcatttgta gaagcttctc ctagaatgtt cctcacccct ctattctccg gctctggcct 109921 ctgtccttga ggacgaggag gggtcttctc tctgaaacag aaatggtgta aaataagtct 109981 attacaaatt gctttgagtt gctccctgtg aggacctgga gctctttacc aacagggtca 110041 gggccatagc ctctttcttg tccaggacat gaacttatgc cagattaatt ctccaaatac 110101 gaactcttca ttctacctgc aactgttctt cccacgtgta tagagaacca gggagaaaat 110161 caactcgtta tgagatattc accagattgc actacagcct actcctctta cagagatcct 110221 gactcagatc ggttaggaat catggccaga tgccttccct gggcctgtta cagataacag 110281 aaatcccaaa tccatactgg aacagagggt gggttgtcag cagtaagggc tgaggttcat 110341 ggtttctctg accttcccct attaacttca gctctatctc ctactattct tgggcaaata 110401 cccaatgcta ttatttttga gtctcacccc ctcaaatcac tcaacccatg aatactcaat 110461 tgagggctta gttatacaat aagcattatg caggatgctg taatactgat gacagaccca 110521 acccttgaag tgttcatggt ttcaaggggg ataaagccag tcatatgaca ctgtggtggg 110581 aactgggtga gagcaactaa tgtgaagaag acagaccact tcaaggaaga gggaacttta 110641 gggctgctca taagagatga gtagaagtta gaggagcagg tgggggtaga ggggtagctt 110701 gagcaggact ggggagggaa gaaggcagaa gagagagcaa atacacaggc tgataaatgt 110761 gaaagagcaa gtccctttaa gagaaagtct aatatgtcgg gtacacagca tatgctaggg 110821 tagctggata aaggtaagag attcttctgg agagcttggg gatgccagac catgaagggc 110881 tctcaatact tgtgtttgtt gaatgctgga cccatgcaca tctccatttc tagaaatccc 110941 aaactccaca aagtcaccca gatcctgttt cccctcccgc catgctttat acatcactga 111001 tgcagcaacc ctgctccctg gagcctctcc tgctacatat cctgcttccc aggttgcaga 111061 aatgctcttt cacagggctc tgcacacttg tgattgtgat gtctgttcac tgctgggtcc 111121 tagcaagtgg caacttggat cccctctctt ccctcccctc ctagctgact ctcatctgga 111181 tgagactgct tcgaaaccaa gagatcatct agtactaagt aaagggtggg gttaggggat 111241 gctgaatggg agtgaattca catccaaggg gagaagtggt tgggagaggc atcctgagtc 111301 ctcacctggg gtcctgttgc ccagtgctgc cccttccata gagggggatg accttgtctc 111361 ggctgatgcc agctttgcaa acaggacaca cctgtctgtt aggtctggtc tccaaccact 111421 gcaaacagga gagaaaaatg catgtggatg gcccaagacc caccagagtg atcagccaga 111481 gccaccagaa caacacaggc cagaggcctc aaactgggaa acaccagcat gagcagggaa 111541 catctcaagg aacctgactc ccattccttc tcctaccatt cagcaaatac ctaccagtgg 111601 taaaatcaaa attgccttta aatatcatca taattggccg ggtgcggtgc tcatgcctgt 111661 aatcccagcg ctttgggagg cggaggtggg tggatcacct gaggtcagga gttcaagaga 111721 agcctggcca acaaggtgaa accccgtctc tactaaaaat acaaaaatta gccagacatg 111781 gtggcgcatg cctgtagttc cagctactcg ggaggctgag ggaggagaat cgcttgaacc 111841 cgggaggcgg aggttgcagt gagccgagat tgcaccactg cattccagcc tgggcaacag 111901 agcaagactc tgtctcaaaa aaaagaaaaa aaatcatcac aattaggctt cttcatgctt 111961 tgtttctctc tagggcctcc tagcaacaca gggggaggga aactgcaagt ctggggctgt 112021 caagtctgac atgcccagcc tcaatttagg ctgctggtgt aagtacagca aactttaaat 112081 ccccaccttc ccagtaggag caactgtatt ttcctgtcag tcatattctt ctttctgtga 112141 tttctcattt ttttttcccc agacagagtc ttgctctgtc acccaggctg gagtgtagtg 112201 gcatgatctc aggtcactgc aacccctgcc tcctgggttc aagcaattct cctgtctcag 112261 ccttccaagt agctgggatt acaggcgccc accactgcgc ccagctaatt tttgtatttt 112321 tagtagagac agggtttcac catgttggcc aggctggtct caaactcctg acctcgtgat 112381 cttcctccct tggtctgcca aagtgctggg atcacaggca tgagccaccg cgcccggcca 112441 atttctcatt tttatagcat tttcccaaca actgtgtatc ccaggcatca gaatcaggag 112501 ataagctgat cattttctgg tgacttacca catgctaaag gataggtttt gctcattatc 112561 taaatttgct taatgctttt tgtgtacttt cattgatttt ttgcattcac ctatttcttt 112621 caatataaaa aaaatcccgt tttgggtttc tcaggccctt tcactggtta cattaaaatt 112681 ttttgtttag gccaggcgcg gtggctcacg cttgtaatcc cagcactttg ggaagctgag 112741 gcgggtggat cacctgaggt cagcagttca agaccagcgt ggggaacatg gtgaaactcc 112801 gttgtctcta ctaaaaagac aaaaaaacta gccgggtgtg gtggtgtgca cctgtaatcc 112861 cagctaccca ggaggctgag gcacaagaat cgcttgaacc cgggaggcgg aggttccagt 112921 gagccgagat ctcaccattg cactctagcc tgggcgacag aatgagactg tctcaaaaaa 112981 aaaaaaaaat taattttctt aaaaaaacaa gtgataaaaa atgtttttgg tttgtttttg 113041 tttgtttttt gcttgttttt gatttttttt tcggtttttt ggtttttttt ctcgaaacag 113101 agtctcacta tgtcacccag gctggagtgc agtggtgcaa tctcagctca ctgcaacctc 113161 tgcctcccag attcaagcga ttctcgtgcc tcagcctccc tagtagctcg gactataggc 113221 gtgcaccacc acgcccagct aatttttata ttttttgtag agacggggtt ttgccatatt 113281 ggccaggctg ttctcaaact ccttgcctca agtgatccac ctgccttggc ctccctacgt 113341 gctggaatta taggtgtgag ccaccacacc cggccaaaat gtgtttttat agacattcta 113401 taaacatgaa tctaggactt cagggctgag gagatggcag cactgaatga ttttatttac 113461 tactcttact gttcatagct ccaaaactta atgtgttatg tgttcttcat aaactcatag 113521 tcaatatgta ctaccaggtc ccgctgggga cagaatgtgc agccctgttc ccaccttctt 113581 tctcccatcc agattctgaa gctttagcag tctggaactt ctgtcctccc ccacattccc 113641 aggccaccta cgagtgtggt actctactgc tgacgaccag ggagactttc agcaggctgt 113701 gattattcac agccacttgt gaatgccagg cccagctctg tctcaccaag taggtgctaa 113761 agctaggaaa atagactttt ctctcctcac aaaagctctt tgttgacacc aaaaaaattt 113821 tccccttcct aatattttca taacctttaa gggctggggg actgttgata tgataggagg 113881 cactgtttta atagaagcag aatctcaaaa ataaaaagga cctaactaac atactatcaa 113941 tccaatccaa tttataaatg agtaactgag atccaaaaag gccctctgtg gcagaattag 114001 gaccatcccg atttcatcaa gtgtctgtca gtggacaggt atgtaaagaa ggctgtgggg 114061 gtaagtaagc aaatgagccc acaagcaggg cagattgcag cttctgaaag cagtctcctg 114121 agactagggc tctcttgtgc aaccagagct gatgaaagac aaagtaaaat gaaatgcatc 114181 ttacctgatg taaacacggc caactgccag agtccccagg gcaaccagtc atcaaggaga 114241 aggagaaaag gggaaaggaa gaaaaatgtt ttcattagca gaaacacttg atgggtggag 114301 tggtagtagc agaatgagaa aacaggagta aactatggag cactgcatgc ccattgaggt 114361 tctgtaggga ttactggtat cggaaattaa aaaaaaactg gggtaaaaat cacagaaaat 114421 aaataacgca atggagataa gaccccagcc tctgcccctg agatctcagt atatgtgaaa 114481 tccccagggc tcagaccaca cagacccagc ttctctacaa atcaccctgt gagaggttac 114541 aacatctgaa ggaatccaat aaaagctaag gacaccctgt gcaaaaaatt gatgggggca 114601 ttcacataca catgtactac acactatttt acactcaatt tcagtgggtt catagtctcc 114661 tcgggctcac aagggatgtg tggactctca ggtagtccag atccagttta acaaaatctt 114721 caggctgtcc tcatccttga gaactctcct agcaggaagc agcttaaaag ttcaggtcta 114781 ctctaaaagc atctttaagt aatagaaagg atggggatgc tacttgacag aaagggttac 114841 cttcagctta gagagctgct gtccaatctc tctccatttt caaatgagga aaccaaagct 114901 cagaaacata aaatgaactg catcaggatt aaaaaaaaaa gtctcctgac ttctagttca 114961 ttgccctttc tataatctaa caccacatac aaaaaaaaaa tgatatctac agaattgaaa 115021 gaaaagtatt caactccact tttttttttt tttttttttg agacgaagtt ttgttcttgt 115081 tgcccaagct ggagtgcagc tcactcagct cactgcaacc tccacctccc aggttcaagc 115141 gattctcctg tctcagcctc ctgagtagct gggattatag gtgtgtgcca ccacgcccag 115201 ctaatttttt gtatttttag cagaaacggg gtttcaccgt gttagtcagg ctggtctcaa 115261 actcctgacc tcaggtgatc cgcccacctt ggcctcccaa agtgctggga ttaaaggtgt 115321 gagccaccgt gtctggccca catttttaat ttctttctct cgtttattta gttaacacat 115381 atctactgtg tacctactat gtaggccagg acagtggaaa atgcaggcat ggcttctgta 115441 cctcacatac caagtatctt tagctgagta cactttgaac tcaaaggata gttccttttc 115501 caattcctga accacagttt agcttcacaa accaaataca caatcttcct ggccataaca 115561 tctgactcca acattaatct tgcaatgtca ttctaaactg agtggagaat aaccccaaat 115621 catggcagat ctccactggc aagttccctt aaagactcta ggggagaagt ggaggtgaat 115681 gaatgaccct caggggaaag tcaggcttta taacaacgac cattcctccc atttattaag 115741 caacgatagt catgtatcac tgaacgacag ggataattct gagaaatgtg tcattaggca 115801 atttcatcat tgtgagcatc acagagtgta cttacacaaa cctagctgaa atagcttact 115861 acacacctag gctacatagt atagcctatt gctcctaagc tacaaacctg aatagcatgt 115921 gagtatcata ggtaactgta acagagtggt aaatatttgt gtatttaaac atacctaaca 115981 tagagccaag gtgtggtggc tcatgcctgt aatcccagca ctttgggagc tgaggtgggc 116041 agattatttg aggtcaggag ttcgagacca gcctggccaa catggtgaaa ccccatctct 116101 attaaaaata caaaaactag ccatgtgtgg tggcacatgc ctgtaatccc agctacatag 116161 gaggctgaag cacgagaatc acttgaacct gggaggtgga ggttgcagtg agctgagatc 116221 gtgctactca atctggagtg ctactgcact ccagcatggg tgacgcagtg agactccatc 116281 tcaaaaaaaa ccaaaaacca ggctgggcgt ggtggctcac gcctgtaatc ccagcacttt 116341 ggaaggctga ggcggttgga tcacctgaga tcaggagttc aagaccagcc tggccaacat 116401 ggtaaaaccc cgtctctact aaaaatacaa aaattagtca ggtgtggtgg cacgtatgtg 116461 taatcccagc ttctcgggag gctgaagcag gagaatcgtt tgaacctggg aggcagaggt 116521 tgcagtgagc caagatcacg ccactgcact ccagcctggg caacagagtg agactcctca 116581 aaaaaacaac aaacaaacaa acaaaaaatc caacatacct aacacagaaa aggtaacgca 116641 ctgctctata acattttaaa atcactagat tataggaact tttcaggtcc atcatcatct 116701 tatggaaccg ctgttggata tgtaatccat tgttgaccaa aatgtcatta tgtactgcat 116761 gactgtactg tgtgccacgc tgtggctgaa taccttttca tttcttaatg tcactggacg 116821 gtcacattaa cctatgaggc agtaccatca tctccatctt acagatgagg aaaccaaagt 116881 acagggaagc tagcagattt gctcagggat gcaccatcca gactggaagc atctgtttca 116941 ttttagagtc tgtgttcttt ccaatggcat tctgccttta aaaaaaaaat tgatagcagg 117001 catggtggct cacaccagta atcccagcac tatgggaggc cgaggcgggc agataacgag 117061 gtcagaagat cgagaccatc ctggccaaca tggtgaaaac ctctctctac taaaaataca 117121 aaaattagct gggtgcacct gtaatcccag ctacccggga ggctgaggca ggagaattgc 117181 ttgaacttgg gaggcagaga ttgcagtgag ccgagattgt gccactgcac tccagcctgg 117241 tgacaaagca agacttcgtc tctaaaaaaa aaaaaaaaaa tttgacacag ggccaggcat 117301 ggtggctcac gcctgtaatc ctagcacttt gggaggtcgg ggtgggcaga tcacttgagg 117361 tgaggagttt gaaaccagct tggccagcat ggcgaagccc catctctact aaaaatacaa 117421 aaagaaaatt agccgggcgt ggtggtgcat tcctgcagtc ccagctactt gggaggctga 117481 gacaggagaa ccacttgagc ccaggaggca gaggttgcaa gtgagccaag actgcaccac 117541 tgtactccag cctgggcaag agagcaaggc tgtctccaca aaaaaaaaaa aaaaaaaaaa 117601 aaaaaagata tacaacagct gtacatttaa atatatgtat agttaatata tacataaaat 117661 acataattta taatcttaaa atttataatt ataaaattta ataaatacat acacaaacac 117721 aaacacacac acacacacac acacacacac attttttttt tgagacagag tcttgctcca 117781 tcacccaggc tggagtgcag tggcgcgatc tcagctcgct gcaagctctg cctcccgggt 117841 tcacgccatt ctcctgcctc agcctcccca gcagctggga ctacaggtgc actccaccac 117901 acccggctaa tttttttttt tttttttttt tagtagaaac gggtttcacc gtgttagcca 117961 ggatggtctc gatctcctga ccttgtgatc cgcccacctt ggcctcccaa agtgctggga 118021 ttacaggcgt gagccaccgc gcccagccca gaatatatat atatttttaa gagacagggt 118081 ctctctctgt cgcctagact ggagtgcagt gatacaatca tagctcactg tagcctcgaa 118141 ttcctgggct caagccttcc tcccacctca gtctcccgag tagctaggac tacaggtacc 118201 gccaccatgc ctgactagtt ttaaacatag tggggcctca ctatgttgcc cagactcatc 118261 ttgcactcct gggctcaagt gatcctccca ccatggcctc tcaaagtgtt tggattacag 118321 gtgtgagcca ctgagcccag ccataatagc tgtacacatt ttttatgtta tatgtgataa 118381 tttgatacat tcacataatc aaatcagggg tatcctgtct ttgaattctt aattttgtca 118441 atcatttcaa gtcctggttt tacgataccc tgaagtatcc tgaatttcta ggagaatgtg 118501 gatcacaaca gcttttacac aagatggtaa ttagggcaag agcctcaaga acaaaaatct 118561 ctgctccatt ctatttattc accaataaat tgatgtttta cgctgttgcc aacttaaaat 118621 cccttaaata ttgtttaaac ggccttttct tcacacgatt tctctgctca aaaatattca 118681 ttacctctat gtccctttag tcgaatctgg gtttgataat ctggctcaac actactgtaa 118741 ccattatttt tcctaacact cccagaccct catccagagt gctgtgagag gtatacagta 118801 gcctcaaggg tctcagatga gtccaacagt gcatgtgcct gtccctagaa actccctgcc 118861 ttggagtgac acacacaccc ttcgccttta cctagcattt tttgatcgac ctttcaaggt 118921 tcagctcaaa atctcaagac aaaaaaaaac acacacacac acacaaaaaa ccaaaatccg 118981 accttcacct agcacctagg ttctcagagc cagaaagggc tcctgtaacc gtcaaggctg 119041 gaacagacac cgaccttgac acttcatttt gatgagtgta tctatctata tataaacata 119101 aaatatagta acaattatag tcatagctaa cacatagtat ttactgcaag ctatgtgctc 119161 ttctaagcct tttacatatt ttgagtcatt taatgcttgc aaacgaccct gtaaaatgag 119221 gcagaatgtg tgtaagtacc ttgtccaagg tcaaacagga aggtggcaga gtcaagcttc 119281 aaaaggagga agcccagttc cacagtcctt gcttataacc attatgctat actgcctcta 119341 ggctcagcac agcaagtagc aggtagtaag tgctcactaa atggtaagta ttattacttt 119401 agagatgggg agaacgagct tagagtggag acaggatttg cccaaggtca tccagctcat 119461 tggtagcaga gctgggatgg gaactcaggt ctcttgtccc ttgagcctga gtgtattccc 119521 aaagtgagca ggaagcattc tgctccaaac cacccagggc cacaggcttt ccaaacttgg 119581 caatgtgctc tctggggtgg aagtaggggt actgaccaga agaggtggcc acacaggctg 119641 atgacggcat ccttggctgt gtccaagcag atgttgcact cgaaagtgct gtcctgccct 119701 ccgctctcgc cagcgccatt gctgctccca ctgggccccc ctgcactgga gttctcagga 119761 gatgcagagg ccgagggccc cttgcttgcc atccttggct gtcagcgaag ctctgagtga 119821 agactttccc cagggaaatc ctaaaaggca gagaactctt gactgtccat ttctgcagcc 119881 ctctcccccc agcttccatg ccaacattcc tgtcttctct aggctccaga gctgattatg 119941 actgaatccc tgagcagact tcccaaagcg ctacagaaga caagagttgt agggcactac 120001 ccaaagggca gtgacccaga aacttctcat gaatatggtc cagaatatat cacacagata 120061 cgcaaactcc accaaataca gagcagcctc cagatggttc tccacaggaa ccaggacctt 120121 gtctcacttg tctttagatt cccccatgac aatcaaccca ggtgacaaat acatgctaca 120181 ttaaatggaa tgttcaggct gtccagcctc tatattgatg ggtgaatctg ctgaggcagg 120241 aaggttgtga ttgtcatatg ccagaacata tatatatatt tttaagagat ggagtttcac 120301 tctgtcgccc aggctggagt gcagtggcac aatcttggct cactgcaacc tctgcctccc 120361 aggttcaagt gattctcctg cctcagcctc ccgagtagct gggattacag gtgcctgcca 120421 ccacacctgg ctaatttttg tatttttagt agagatgggg tttcatcatg ttggccaggc 120481 tggtcttgaa ctcctgacct caggtgatcc acctgcctca gtttcccaaa gtgctaggat 120541 tacaggtgtg agccactgtg cccagcccat acgccagaat attaagaaat tcctgtaact 120601 atgaaaggat aggaagctaa taaggaccag aaagatcatc tggtccaagg tcttccatat 120661 tgcaatcttt ttcaatcctc cttctgccac attggaatac tattcacctt attttatcta 120721 aatatatcaa ctttttagcc tcagtacatt taaagaagag gtttatgtca ctgtaataaa 120781 tggaaaacca gtaatactta ccataagtaa aaaggttaat ataaaagtaa caaatgaaaa 120841 gtttaccctc ataccacctg acatcacctg gcacactcat actatgtttt ggacaatctt 120901 gatctggtcc aatttcttta ggaaactgca gttaccttgg ctagctgact aatgtggggc 120961 tgagactggg actcatggac accctgtttg caccgtacca ctacatacta aactggccac 121021 taaatgctgg tataaacaga ccactgttga ccaccacctc caagtcaaag aacccagcca 121081 ggagttgaac aagaaagagg ataaaaccaa caccaaggga tacccaggaa ttctggatgc 121141 acattaacaa tctgtcagga gccttatctg gccaccctca tcacctcaga aattccaagt 121201 aggaaatgtt cgctggctac cacagtcaga gaagtggagg cctggatact ggactcagcc 121261 tctagcttgc taggtggttc tgggaacctc cctttccctc tactgatcat aattttctca 121321 tctgtaaaat gaggcactgg actccctggt gtcctgggtg gccccctcca gctcttagac 121381 tgtgcctctc caggtcatct taacctactg ttacctccaa aacaagcggc cccttcacaa 121441 atttaggact taccactgga acaccagaag gtcaccctga tgtcagcttc cagagtatga 121501 gaagacaagg accaaaaaag ttacagcctt ctcagaccca gaaagcttcc tgccttcaca 121561 cggtggactc agagctctgg cctcatgagt atccctcacc acgataccag ctcagtgact 121621 tgagcagaga gaggaggcca gctcaccact ttttcactca gttacatcat tttacataat 121681 gtcccagaag ggtagctgga tgctggtggc accagtttcg tgcgctggcc tcggctccaa 121741 gaccttctct ctgaccttcc tcttcttcat attacatgtg ctctcttcat cttaggataa 121801 aacccacttt tgagggccac attactatct ttcctcatta ttccagctca acaatcaatc 121861 ctaagtaccc aacctatgtg tgtttctttt taaaaaaaaa aaaaaatctt aaaaaattaa 121921 aaattgctct ccctccccct ccccctccct ctcccctttg cacggtcctc gtctcccctt 121981 tgcacggtct ccctctgatg ccgagctgag gctggacagt actgccgcca tcttggctca 122041 ctgcagcctc cctgcctgat tctcctgcct cagcctgccg agtgcctggg attgcaggcg 122101 cgcgccgcca cacctgactg gttttcatat tttttggtgg agacggggtt ttgccgtgtt 122161 ggccgggctg gtctccagct cctgaccgcc agtggtctgc cagcctcggc ctcccgaggt 122221 gccaggattg cagacggagt ctcgctcact cagtgctcaa tgttgcccag gctggagtgc 122281 agtggcgtga tctcggatcg ctacaacctc cacctcccag ccgcctgcct tggcctccca 122341 aagtgccgag attgcagcct ctgcccggcc gccaccccat ctaggaagtg aggagcgtct 122401 ctgcctggcc gcccatcgtc tgggatgtga ggagcccctc tgcccagccg cccagtctgg 122461 gaagtgagga gcgcctctcc ccggcggtca tcccgtctag gaagtgagga gagtctctcc 122521 ctggccgccc atcctctggg atgtggggag cgcctctgcc ccgccacccc gtctgagatg 122581 tgaggagtgc ctctgccagg ccgcgacccc gtctgggaac tgaggagtgt ctctgcccca 122641 ccgccacccg tctgggaggt gaggagcgtc tctgacctgc caccctgtct gagaagtgag 122701 gagcccctct gcccggcagc cgccccgtcc gggaagtgag gagcgtctct gcccggcagc 122761 tgccccgtcc aggaggtggt gggcagcccc cgcccggcca gccgccccgt ccgggaaggg 122821 aggggcagcc cccgaccggc cagccgcccc gtccgggagg tggggggggg cagcccccgc 122881 ccggccagct gccccgtccg ggagctcggg gcagcccccg cctggacagc tgccccgtcc 122941 gggaggtggg agcccctctg cccggccgcc accccgtctg ggaggtgtac ccaacagctc 123001 attgagaacg ggccatgatg acaatagcgg ttttgtcgaa tagaaaaggg ggaaatgtgg 123061 ggaaaagaaa gagagatcag attgttattg tgtctgtgta gaaagaagta gacatagcag 123121 actccatttt gttctgtact aggaaaaatt cttctgcctt gggatgctgt taatctataa 123181 ccttaccccc aacccctgct ctctgaaaca tgtgctgtgt ccactaaggg ttaaatggat 123241 taagggcggt gcaagatgtg ctttgttaaa cagatgcttg aaggcagcat actcgttaag 123301 agtcatcacc actccctaat ctcaactacc cagggacaca aacactgcgg aaggcggcag 123361 ggccctctgc ctaggaaaac cagagacctt tgttcacatg tttatctgct gaccttccct 123421 ccactattgt cctatgaccc tgccaaatcc ccctctccga gaaacaccca agaatgatca 123481 ataaatacta aaattaaaaa aaaaaaaatt aaaaattatg catatagatt ttggaaagac 123541 aaccataaag tcagatcagt cacatcagcc tcctgatgtg atgcaacaga agtacacagc 123601 accaccaatg aaacatgttt gcgttaaaac tcagacactg atcaagcatc taaaatgaat 123661 taccagccag gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg 123721 tggatacctg agatcaggag ttcaagacaa gcctggccaa catggtgaaa ccctgtctct 123781 actaaaaaca caaaaattag ctgggcatgg tggcacacac ctgtaatccc agctacttga 123841 gaagctgagg catgagaatc gcttgaaccc gggaggcgga ggttgcagtg aactgagatt 123901 gtgtcatcgc actccaacct gggtgacaag agagagactc catctcaaaa ataataataa 123961 taataataat aataataaat aaaataaatt acctatttac agaaaatatg gaggataggt 124021 gcaagtatta aacgactcca tgaagatgtc atcagccaaa tccagaatgt gggactttct 124081 acaggacaaa caataaagtt ctacaaaaaa taaatagcat ggacaggatt ggggttatag 124141 attgagatac gagatattcc aaccacttgt agtatgtggg tcttattgga acaagattca 124201 aacaaaccaa ctgtacaaag acacttgtga catgattgac aaaaactgtt tagcctaggt 124261 attactcgat atgaaggaat tattgttggc tgggcatggt ggctcatgtc tataattcca 124321 gcactttggg aggccaaggt aggtggatcg cttgagctca ggaggtggag accagcctgg 124381 gcaacatggt gaaaccctat ctctaccaaa acaaaataaa acaaaacaaa aattagctgg 124441 gtgtggtggt gcatgtctat ggtcccagct actccagagg ctgaggcagg agaatcgctt 124501 aagcttgcga ggcggaagtt gcagtgagcc aagattatac cactgcattc cagcctgagt 124561 gacatagtga ctctgtctca aaaaaaataa ataaataaaa ataaaaaatt agctgggtgt 124621 ggtggtacac acctgtagcc ccagacagtc aggaggctga ggtgggagga tcacttgggc 124681 ctgggaggtg gaggctgcag tgagccgtga ttgtgctgct gcactccagc ctgggtgaca 124741 gagtggaaac cctgtctcaa aaaaaaaaaa aaaaaaagga attactgtta attttgttgg 124801 gtataataat gatattgtgt tttttttttt aaacaaaacc tgatctgcta gagacacatt 124861 tttttaaata tatatatatt tttattatac tttaagttct agggtacatg tgcacaacgt 124921 gcaggtttgt tacatatgta tacatgtgct atgttggtgt gctgcaccca ttaactcgtc 124981 atttacatta ggtatacctc ctaatgctat cccttccccc tccccccacc ccacaacagg 125041 cccaggtgtg tgatgttccc cttcctgtgt ccaagtgttc tcgttgttca tttcccacct 125101 atgagtgaga acatgaggtg tttggttttt tgtccttgcg atagtttgct gagaatgatg 125161 gtttccagct tcatccacgt ccctacaaag gacatgaact catcattttt tatggctgca 125221 tagtattcca tggtgtatat gtgccacatt ttcttaatcc agtctaccat tgttggacat 125281 ctggcttggt tccaagtctt tactattgtg agtagtcccg caataaacat acgtgtgcat 125341 gtagagacac atatttttga agcaatacaa tgtttgggat ttgctttaaa atactccagg 125401 ggaaaaaaat aaaaatggag agaaaagatg aaacaagaac aaggaagttg gttgttgaag 125461 ctgggtgatg aatggaggtt cactgtaata gtctgtgtgt gtgtttgaaa attacaataa 125521 aaggtatttt ttttttcttt tttttgagat ggtctcgctc tgttgcccag gctggagtgc 125581 agtggtgcaa tcacagctca ctgcaagctc aacctcctgg gctcaagcaa tcccctgcct 125641 cagcctccca agcagctagg actacaggtg cacaccacca tgctcagcta atttttaaat 125701 tattttgtag aaacaaggtc tatgttgccc agggtagtct caaactcctg ggctcaagca 125761 attctcctgc ctcagcctcc cagtgagcca ccatgcctag ccccaataaa aggtattttt 125821 taaaaagatt aagttgtaga agagtatgta tataaactta tttgggtaaa aactaagtat 125881 gtatatgagt atgtatgtgt gtggatccat acacagacag acaggcagac acacacacac 125941 acacacacac acacacacac acacacacac agagagagag agaaaatgcc tagaaaatac 126001 acattaaacc atcaacagtt atttctggga ttataagggc taggagacat tgaattatta 126061 tattcttacc ctgtttcaat gttttataac aaataagtat tactttcaca agaacatttt 126121 taaaaaggaa atcataaaat gtgaagtatt ttacatatgt tacaataatg aatataggta 126181 ttagaaaatg gctttaggcc aggcatggtg gctcatgcct gtaaccctag tactttggga 126241 ggccaaggta ggaggattac ttatgcccag gagtttgagg ctgcagagag ctatgaatgt 126301 accactgcac tccagcctgg gcaattagag cgagactctg attgtattaa aaaaaaaaaa 126361 aaagtcttta acaatgtgga agaacatact caactgtatt actttttttt cttctttttt 126421 tttgagacaa gtctcagtct gtcacccaag ctggaatgca gtggtgtgat ctcggctcac 126481 tgcaacctcc acctcccggg ttcaagcaat tctccctgcc tcagcttcct gagtagctgg 126541 gcttataggt gcccaccacc acgcccagct aatttttgta ctttttagta gagacagggt 126601 ttcgccatgt tggccaggct gctcttgaac tcctgacctc aggtgatcca cccacctcag 126661 ccttccaaag tgctgggatt acacgtgtga gccactgcac ccggccttta catgtattac 126721 ttttgaaatc agtcaaaaga ataataatac ataaaatttc ctacccccca cccccccgcc 126781 atgttctggt ttccacccca tgttctggtt ttccccaatc ctgaaagttc tgagttcaag 126841 tcctggttcc atcactgact gtccatatgg ccacggtaga tctcttgatg atttgaatct 126901 cttttctcac ctgaaaagtg agagtatcat gcctcctaca tggctactga ggtggcctta 126961 tgaaataaca cagtgacctg gagggccctc caaggcagga cagggctgtt gctgttgggg 127021 aggcacacat ttcataccat gtaacctggt ccactgccac cccacttcgt ggacccaaca 127081 aagctgtgca tggcccatac tcctacacac aggcatgagg ggacaaccag gatcaaggta 127141 ggtgttggag taggaatggt taagggacag taacaggtca gctcagagtt tgcagggagt 127201 ggcagccttg tgccaggcca ggcactgaga caggcctgca tgtcccaact gtaggcaggc 127261 catatcactc tccttgtaaa tcttagtaga caccacatgg tccccaccca ggatctacag 127321 cctcatccta gcaggcactc aaatatttgc tgagtgagga tgggcacgca tgactgactt 127381 tggtcaactc ccagctggct ggtcaccaac ttttctttgt aacatattct gggccccaca 127441 gctgcatact catccctgga gccacctctt ctcaccttct cctacaccta tgcacaggaa 127501 tctcagtaca ccaaggtggc atggcagaaa tcagcagctt cccctgggga ttgtctactg 127561 agcacctaca gaaccatatg ggtctaccag gactgggaca gtggcccaaa acatagaggt 127621 tcctgatcgg ctcaggaact aggcagaaag agaatcaggt cctgagcata cctatatggc 127681 catcctgcta aaatttctca tggtccctgt agaagctata gactttcacc taaccacaga 127741 gcaaccttgt tgtggctagt tctgtgcctc tgtccctacc ctgaccattc ttttctcctg 127801 accagaatcc actctctagt tttcaaagca tttcacatct atcgctgcat tttatcctca 127861 tgccatttta cagatgagga cactgggcca gggacattag aggactgcct aaggccagga 127921 caccacgcaa tgccaaattc agacttccta acacaatatt cagcactctc ttcattctta 127981 tgtttttcta ttaacatgct gctttacaaa attaaatacc tggtttttgt cttccctcaa 128041 atttatcgtg caatgttgta acttttgtgc tgtttgaaat gagtacagag taaattaatc 128101 actcatccac ttattcattc aaccaaccag cagttttcca atgccaatta catttgaggt 128161 ctggtgctac caactgtgga ggagttacaa agaggaataa gaccaagttg aggtcttgga 128221 gtctagtaca ggagctgaag gcacatagat taattttaag aattctaaac aaggggctcc 128281 atcaaactca cctgtttttc ctttcgttgt tgtttttgcc tgtttttctt aaacaaagca 128341 tcctctcacc ccacgaaaga tctggttccc agatcccttc agagatttaa ataggatctg 128401 gacctagatc aggcctaaga atctgtatgc ttttaatgta gccctcactt tctaccctag 128461 atgattctct aattaacaga gttttggaaa cccagctcta gaagagggta cagctctatc 128521 aatagcacaa agcaatacca acaaaataca gtattttaga aggacagaaa cctcagggcc 128581 tatccagcta caatccacag ggactacaga gaggtcagat aaacttgggg ggtaagggac 128641 aaatctgctg aattcagatt aatcagcatt tagtccaact ggaacacagc tgatcatttt 128701 aagaaataag ttcacaccta gttttaaaaa acagtcctaa ctttgagatg tccagggcaa 128761 cacatgtggt gctgggacta acgcggattt gcaatactta atgatcctgt aggaacacag 128821 agcctggcgg ctgaagttga ctgacccagt atcttcaata tattaccttc ccagagctgg 128881 cctttggtct acttggtctg gcccacctca caggaattga gcaagaaaga gaacacgaga 128941 gttgaatatc cctgcaggaa ccaccaggga acttgatc // LOCUS HSAC002086 112686 bp DNA PRI 13-MAY-1997 DEFINITION Human PAC clone DJ525N14 from Xq23, complete sequence. ACCESSION AC002086 NID g2085785 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 112686) AUTHORS Tin-Wollam,A, Graves,T and Biewald,T. TITLE The sequence of H. sapiens PAC clone DJ525N14 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 112686) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) COMMENT SUBMITTED BY: Genome Sequencing Center Department of Genetics Washington University St. Louis MO 63108, USA http://genome.wustl.edu/gsc mailto:sapiens@watson.wustl.edu NOTICE: This sequence may not represent the entire insert of this clone. It may be shorter because we only sequence overlapping clone sections once, or longer because we provide a small overlap between neighboring data submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest. MAPPING INFORMATION: This sequence was generated from part of bacterial clone contigs of human chromosome X, constructed by David Bentley's chromosome X mapping group at the Sanger Centre. Further information can be found at http://www.sanger.ac.uk/HGP/ChrX/ SOURCE INFORMATION: This clone was derived from human PAC library RPCI-3 prepared by Pieter de Jong and coworkers at Roswell Park Cancer Institute, using the method described by Ioannou et al., Nature Genetics 6:84-9 (1994). The library is from one male donor. For further details, see http://bacpac.med.buffalo.edu/ The clone is available from Genome Systems, Inc. (http://www.genomesystems.com). VECTOR: pCYPAC2 NEIGHBORING SEQUENCE INFORMATION: The actual start of this clone is at base position 1 of H_DJ525N14; actual end is at 112686 of H_DJ525N14. The orientation of this clone is unknown. This clone contains STS AFMb331yc9 (NID:g1235110). FEATURES Location/Qualifiers source 1..112686 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="DJ525N14" /clone_lib="RPCI-3" /map="Xq23" repeat_region 523..710 /rpt_family="ALU" repeat_region 1001..1081 /rpt_family="ALU" repeat_region complement(2902..3185) /rpt_family="ALU" misc_feature 3378..3418 /note="similar to human EST R69179 (NID:g842696) yi39b07.r1" misc_feature complement(3680..4065) /note="match to human EST T77594 (NID:g694797) yd73h12.r1" misc_feature 6471..6793 /note="match to human EST R69179 (NID:g842696) yi39b07.r1" misc_feature complement(6739..7041) /note="match to human EST R69071 (NID:g842588) yi39b07.s1" repeat_region 9255..9302 /rpt_family="ALU" repeat_region complement(12295..12496) /rpt_family="ALU" repeat_region complement(15224..15591) /rpt_family="L1" repeat_region complement(16974..17109) /rpt_family="ALU" repeat_region 17365..17631 /rpt_family="ALU" repeat_region complement(17661..17957) /rpt_family="ALU" repeat_region complement(20930..21221) /rpt_family="ALU" repeat_region 22978..23019 /rpt_family="L1" repeat_region 23615..23989 /rpt_family="ALU" repeat_region 24666..24947 /rpt_family="ALU" repeat_region 25334..25430 /rpt_family="ALU" repeat_region 26796..32933 /rpt_family="L1" repeat_region complement(31523..31940) /rpt_family="L1" repeat_region complement(35528..35802) /rpt_family="ALU" repeat_region complement(36166..36848) /rpt_family="LTR" repeat_region 36166..36849 /rpt_family="LTR" repeat_region 41112..41401 /rpt_family="ALU" repeat_region 41537..41616 /rpt_family="L1" repeat_region 41861..42159 /rpt_family="ALU" misc_feature 44658..44934 /note="match to human EST N23587 (NID:g1137737) yv99a09.s1" misc_feature 44658..44920 /note="match to human EST H83437 (NID:g1062108) yv83e03.s1" misc_feature complement(44956..45131) /note="match to human EST N23586 (NID:g1137736) yv99a09.r1" misc_feature complement(45478..45729) /note="match to human EST H83545 (NID:g1062216) yv83e03.r1" repeat_region 51930..52220 /rpt_family="ALU" misc_feature 52820..55727 /note="Elongation factor 1-alpha pseudogene" repeat_region 53066..53332 /rpt_family="ALU" repeat_region 53994..54029 /rpt_family="L1" repeat_region complement(55080..55370) /rpt_family="ALU" repeat_region 57186..57919 /rpt_family="L1" repeat_region 57998..58409 /rpt_family="L1" repeat_region 58422..58524 /rpt_family="L1" repeat_region 58525..58798 /rpt_family="ALU" repeat_region 58799..60287 /rpt_family="L1" repeat_region 60331..60776 /rpt_family="L1" repeat_region 60778..60903 /rpt_family="ALU" repeat_region 60904..61251 /rpt_family="L1" repeat_region complement(60905..61068) /rpt_family="L1" repeat_region 61272..62268 /rpt_family="L1" repeat_region 63265..63555 /rpt_family="ALU" repeat_region 63885..64052 /rpt_family="ALU" repeat_region 64062..64353 /rpt_family="ALU" repeat_region 64638..64679 /rpt_family="L1" repeat_region 68204..68329 /rpt_family="ALU" repeat_region 68462..68731 /rpt_family="ALU" repeat_region 68819..69221 /rpt_family="MER" repeat_region 69246..69441 /rpt_family="L1" repeat_region 69444..69728 /rpt_family="ALU" repeat_region complement(69847..70141) /rpt_family="ALU" repeat_region complement(70727..71293) /rpt_family="MER" repeat_region complement(71814..71880) /rpt_family="MER" repeat_region 74190..74487 /rpt_family="ALU" repeat_region complement(76351..76650) /rpt_family="ALU" repeat_region complement(78351..78759) /rpt_family="MER" repeat_region 82086..82239 /rpt_family="ALU" repeat_region 83720..83996 /rpt_family="ALU" repeat_region 84463..84753 /rpt_family="ALU" repeat_region 85074..85130 /rpt_family="L1" repeat_region complement(88509..88576) /rpt_family="L1" repeat_region complement(88920..89211) /rpt_family="ALU" repeat_region complement(91388..91427) /rpt_family="ALU" repeat_region 93294..93320 /rpt_family="L1" gene 96380..98398 /gene="WUGSC:H_DJ525N14.1" CDS 96380..98398 /gene="WUGSC:H_DJ525N14.1" /note="similar to zinc finger 5 protein from Gallus gallus, U51640 (PID:g1399185)" /codon_start=1 /evidence=not_experimental /db_xref="PID:g2085786" /translation="MESRKLISATDIQYSGSLLNSLNEQRGHGLFCDVTVIVEDRKFR AHKNILSASSTYFHQLFSVAGQVVELSFIRAEIFAEILNYIYSSKIVRVRSDLLDELI KSGQLLGVKFIAELGVPLSQVKSISGTAQDGNTEPLPPDSGDKNLVIQKSKDEAQDNG ATIMPIITESFSLSAEDYEMKKIIVTDSDDDDDDVIFCSEILPTKETLPSNNTVAQVQ SNPGPVAISDVAPSASNNSPPLTNITPTQKLPTPVNQATLSQTQGSEKLLVSSAPTHL TPNIILLNQTPLSTPPNVSSSLPNHMPSSINLLVQNQQTPNSAILTGNKANEEEEEEI IDDDDDTISSSPDSAVSNTSLVPQADTSQNTSFDGSLIQKMQIPTLLQEPLSNSLKIS DIITRNTNDPGVGSKHLMEGQKIITLDTATEIEGLSTGCKVYANIGEDTYDIVIPVKD DPDEGEARLENEIPKTSGSEMANKRMKVKHDDHYELIVDGRVYYICIVCKRSYVCLTS LRRHFNIHSWEKKYPCRYCEKVFPLAEYRTKHEIHHTGERRYQCLACGKSFINYQFMS SHIKSVHSQDPSGDSKLYRLHPCRSLQIRQYAYLSDRSSTIPAMKDDGIGYKVDTGKE PPVGTTTSTQNKPMTWEDIFIQQENDSIFKQNVTDGSTEFEFIIPESY" misc_feature 100489..100812 /note="match to human EST R31842 (NID:g787685) yh69d10.r1" misc_feature complement(100904..101361) /note="match to human EST D80048 (NID:g1177925)" misc_feature complement(100974..101338) /note="match to human EST R31806 (NID:g787649) yh69h10.s1" misc_feature complement(101047..101361) /note="match to human EST R31792 (NID:g787635) yh69d10.s1" misc_feature complement(101258..101362) /note="match to human EST T61211 (NID:g664248) yb84c08.s1" misc_feature 101614..101789 /note="match to human EST R39187 (NID:g796643) yc89b12.s1" misc_feature 101615..101822 /note="match to human EST (NID:g683307)" misc_feature 101615..102038 /note="match to human EST AA131343 (NID:g1692841) zo08h04.s1" misc_feature complement(103124..103580) /note="match to human EST AA131443 (NID:g1692930) zo08h04.r1" misc_feature complement(103788..103995) /note="match to human EST T75299 (NID:g692061) yc89b12.r1" repeat_region 104972..105263 /rpt_family="ALU" repeat_region complement(107849..108138) /rpt_family="ALU" repeat_region 108827..109127 /rpt_family="ALU" misc_feature 109916..110040 /note="similar to human EST AA018543 (NID:g1481798) ze50c10.r1" misc_feature complement(111208..111355) /note="similar to human EST T75299 (NID:g692061) yc89b12.r1" BASE COUNT 33780 a 23505 c 23725 g 31676 t ORIGIN 1 gatcagtcgt tggggtgact ggaacccaca gtacatgtag gggcccacaa actagtttaa 61 tttcttatat aaaataaggt tgatgaaaag ggtcattgtg caagcaaaaa tagaatgagt 121 cactcaaacg atggaggata ggttctatgg ttaaaaccta ctctctatat tcaggacatt 181 aacaagtcct gaagtatcta caaaacactt cttaaaggaa gaatcttaga gcactcaatc 241 tgtcctggct gtgattattt gacaataaat aaaatgtatt tattcctaaa caagagctca 301 gaaagatatc tgtagaaaag ggcaggagta gaggaaaaga gggctatgca gcagcaggaa 361 agaagggaaa tcactgcagg tccacagcac cagctcagca cagaaggtaa gtgcgcattt 421 gtaatgggtc tgtagccaga gacatagcag aagctgcgct atccacagtc acaaccaaag 481 gcaaaggaag gatgccctgc aattttagaa atcttggctg cagctgggaa cggtggctca 541 cgcctgtaat cccggcactt tgggaggccg aggtgggtgg atcacttggg gccagcagtt 601 ccagaccagc ccagcaaaca tggtgaaacc ctgtctctac taaaaataca ataattaccc 661 aggtgtgctg gtgcgcacct gtagttccag gtactcggga ggctgaggca ctcctgagta 721 cctgcacaat acatgcaaac acccacccca tcctactccc cggccccccc gccctactcc 781 ccccctccac cctacgcccc cccgccctac tccctccccc gccctactcc ccccaccccg 841 tcctactgcc cccactccaa cacgtgctct cccgtgccca aggcaaggcc aagcccctga 901 ggcatgcgca cctcagcagg cccaacccac agcaaatagt gggaagagga aaggcaagag 961 aggaggtctc taagtggata cactgttact gaatctaggt acccagaaga tggaggttgt 1021 agtgagccca gatagtgcca ctgcattcca gagacagagc gagactgtct caaaaaaaaa 1081 aaaaaaaaaa aaaaaagaaa gaaagaaaga aaagaaaagc aatctcggat gctctaagca 1141 tcattgtgca tgtttccacc cttgcacttt gatgtggatt cattactaag aattaatagt 1201 ggactttctc aaggagtgtg ctgcattttt acatagtgca tttgcgcagc ccacaagctt 1261 tgcaaatcca ggcccaagct cctgctcctg gagacctagg aagcatgggg ctagtgcaaa 1321 atctccctta tcgccataaa atgggcttgc acaattcagc acagcagctt cccccaagtt 1381 ccacactcac gcatgcctgt ctacaggaca acgcatgcaa gcacgcaccc catcctactc 1441 cccctcccac gccctactcc cccctccccc acaccctact acaccactta cccctccccc 1501 cccacaccac caacgcgtgc tctccctcat ccgggcaagg ccaagcccct gacgcatgcg 1561 cacctcagca ggcccaaccc acagcaaata gcgggaagca gaaaagcaag agaggaggtc 1621 tctaagtgga tacactgttg ctgagtctag acaccagaag aacgttgcag gcggcgactc 1681 acagttctag cactgcctag gagagcgtgg tggccccagc tcagaatctg cagaagtgca 1741 cagctccatc cacaccactc agggtatgga gcctccggac cagtgtagcc agtatatgac 1801 cagcttgctc agccctgcag tcgacgacga gaaagaacta cagggtcagt accttgatgg 1861 gacacatgct tttgcgaact caggaaacca aggttctctg ggagaggcga gtagactgga 1921 tgatacccct gcactaccac tgctctcaga gtgtagcccc tcaccacctc cttccaacac 1981 cctcaggacc aatgccttga tctttctctt ggggtttcta cttttccaga tatgaatgct 2041 atggtgctgt cgcttactga agaggtcaaa gaggaggaag aggatgcaca gcctgagcct 2101 gagcaaggca cagcagcagg agaaaagtta aagtcggcag gagcccaagg cggagaagaa 2161 aaagatggcg gcggagaaga aaaagatggc ggcggcgccg gagttcctgg ccacctatgg 2221 gaaggagacc tcgagggcac cagcggcagc gatggcaacg ttgaggacag cgaccagagc 2281 gagaaggaac ctgggcagca gtattcgcgc ccacagggcg ccgtcggggg gctggagcct 2341 ggcaacgcgc agcagcccaa cgtccacgcc ttcaccccat tgcagctgca ggagctggag 2401 cgcattttcc aacgcgagca gttccccagt gagttcctgc ggtaagccca ttgctctggt 2461 tggcgcgcgg tttgcaggga ggcggcgttt ggctttcccg cagtccctct cctaccctct 2521 cccctcctga accaaaaccc atctggggcc tggtgttgct gccgtcccct ccccgcagac 2581 ccctggcacc tagtgggttc tgtagtgggg ctatgcctat taggcatcat gcagaattta 2641 aatgaaccaa gttgggcaac tttgggctga ggtcagttat gataaataac tcctatccca 2701 ggcgaggcag aaataaagat gaggagatta aggttctgta cagcaagtgc agggtcgcat 2761 tctgacctta tttaaaattc tgagaagtcc gtcgttcgtc tgggtttcct ttggtgttaa 2821 tttttctaag tttcaaatag taagttagaa tgtcatttat attgattaac gatttttttt 2881 gttatggggg ggttattttt ttattttttg gagaaggagt ctcgctggga cacccaggct 2941 ggagtgcaat ggtgcgatct cggctcactg caacctctgc ctcccaggtt caagagatta 3001 tcctacctca gcgtcccaag tagctgggat tacaggcgcc agccaccacg cccggctaat 3061 tattgaattt ttagtagaga cggggtttca ccatattggc caggctggtc tcgaactcct 3121 gacctcaagt gatcctcctg cctcggcctc ccaaagtgct ggcattacag gcgtgagcca 3181 ccgccctgat tttttttttt ttggcatctc ctttatttgt ggcatgagag aaatgttcct 3241 aatgtgaggc tcagctgggt gttacagaac agcctactgg gtgtggggag ttggtagaat 3301 aaaaaaatta aacacaagaa tgaaacgaca cccacaactc caaatgctga acacgtgtgg 3361 tttcttccat agaaggaggc tggcaagaag catgaatgtg actgaactcg cagtgcaggt 3421 cagtaaaccg aaaaagcaat cgggcagggg agccattcta aaacccgctt cagggcttgg 3481 acacactttg acccagacat tgccatcttg gtgtttttgt ggcctttttc atgtataggc 3541 aatgaggtct gaactttggg atctttgtgg ctggaaaatg cagtaaggaa agctcagctt 3601 gtggaaattt cccattacca gagggaacat gacaatccac ggaaaaaaaa atgagtactg 3661 aactgtttcc tttattcctt gtgtataaaa tatattatac acttgaataa attacaaata 3721 tataaatata tgtattccaa attacaaata tatataatat ataacagtat aaattaaata 3781 tagcataata tataagattg aattcctagc actggtatcc tttaggtagc acttccatgt 3841 ggaggcatcc tggggttttc tgacatgggg attattcgaa ctaatgctcc aaagagctcc 3901 attaaactaa actatttaat atatttaaaa ccaagcataa actctttggt taagaattta 3961 taaatgttta ggcattgggg taaaggaata attcccaacc agaacgtatg tttccttaag 4021 ctgagactga tggcggggga tagcataaag ttcaagcgcc tggccctcct actattgttt 4081 ttaagtattt ggtgaagcct tgtagagagt ggcagctcta aattttaatt ttctggaata 4141 ctttgatatg gcaggctgaa aacattgctg acatagttgg ttttgcatta tggtagagat 4201 agcgaaaagg cttgcaacct ttgcaagttt atctagaaac caaaacaaac tacaagatca 4261 atttttcata aatttccacc aacctaagga tggaagtgtt aggcaagaat tgtatacagt 4321 cacactcatg gcttggggaa gctgccccca ttcaactttt gaagctacca aatgcatcag 4381 tgaatatttt cttgatcaat ttctagcatt taagagattt tttgtttctt ccagtttgtt 4441 gtactaacag gaagagatgt gttttgtgga atgtgttgat agtgggggat ggctgtcagg 4501 tcaggctaga ctcttacctg tggtaatttg aaataccttt ttatttattt gctcaggctg 4561 tgataataaa tttgtaggtc agtggatcca aaggagtatt tgcaaaaaag gaagaaaaga 4621 aagaaaaatc gggatgacct taatagcttt tctgcagagc tcatgtgtgt gagtgcagtg 4681 agggaactat gctgcggttc tgtcaatggt ggcatgcaga aaagctgcct gacaaaacgg 4741 ggcactgagt gttggcatcc tgttgccctc cattttctca ggcaggcaat gccaatttgg 4801 gggagatgaa atattgacgt tttggtgaat tgcctttccc ctccctgggt ttacatagca 4861 ttcactcaat gcagttctga aggcaaaagg accactctca ctcagcaaat aaggaaaaag 4921 gcatgatgtt ctcatctatt gccaaacatt tcagcatgaa tatctccaaa aatttttaat 4981 agatgtttcc cccctctgtt ttgtaaagat agtaagggct ggtttatgaa accgatacgg 5041 agactaggga cctaatgttg tcccatagca catttggccc acttttggca ttgcttatgt 5101 ttatgaacct ttgaccacag gtacattgga attgtctcta aacaagcact attacggtag 5161 gcttaaaatt tagactatgt attcatgagg gagaaggact ccaaatagta agatgggaga 5221 aggaaatggg gcatcaaaca gggagaagag tccctttgtc ccataaaatt tgcagtttgg 5281 gaacttccag aaggatcaaa atacaatgga aatgtcatta actagaagct tgggggatag 5341 tgttgtttaa ttgacaggtt ttattttttg gtcttttaac aattggcagg tgtttgggga 5401 gtagtgtttt ttgatacctc ctccacatta ctactgaacg tgttcagagt tgcctatttg 5461 gacactcccg ctttgggaat gctgatgaga ttgggatctt ctgctcgagt agtgaggcta 5521 gatggctgaa ggcgtcccat ggaactgggc aaccaagtct ctgtcccata tcccaggact 5581 atagggagaa caaagtgagc tgaccttgtc cttagcccag gaagatttgg agggagggat 5641 gagttggagc gtgcagaatt agcaatgctt cctttctctc aaactctcaa ctgggcaaag 5701 ctttttgaga atttttgtga atttggacaa catttgtgtc ctccttccac aaagcttcaa 5761 gctgaagtgt gaagagtgtc cagggcttca gataccctca tttcccagca agacaatcca 5821 caaactccct gggcccccta taacgagact tgccttgcaa tataccatct ttttaattct 5881 caaggagttt cagaagatga atgacccctt agaatttctt gaaggaaaga aaacttgttt 5941 taaaatacaa tgtactctta ccgatttttt actttacact gaagatgggc ttttcaaaac 6001 gggggcctca tcttcttggt atgactttaa aaggcacccc cgcctttttt tccaagctca 6061 gcttccccca cgaccagggt gacagtggcc tagtaagtag gagttggtgg gcagctgggt 6121 agctgatggt caagtatctg caagcagaca tgccacacat agcaatacca gcagggatcg 6181 agtcccactc atcagctaca cctgcaccca agagtggaaa aagtaatatt acagataaaa 6241 ggagagagac aaagaaataa gccaagccag agcgtacgta gcacatactt ggagctcact 6301 aggggaaagg tgtggtggtc acctttctaa gatattttag tagtaaggaa gtagcttgtc 6361 ttgatacacc cgaaaaggtt aaggacatag agagttgctt gttgtgtttt tttccatgat 6421 gaaattaaat gctcagtata ctaaacctgt ttcttttttc ctctgattga agatttggtt 6481 tgagaataga agagccaaat ggaggagaca tcagagggca ttaatggcaa gaaacatgct 6541 gcccttcatg gcagtgggcc agcctgtcat ggtaaccgca gctgaggcca taacggcacc 6601 cttgttcatc agcgggatga gagatgatta cttctgggac cacagccatt ccagcagcct 6661 gtgtttcccc atgccaccct ttcctcctcc gtccttgccc cttccactca tgcttcttcc 6721 acctatgcca cccgctggcc aggctgaatt tggcccattc ccttttgtta tcgtgccttc 6781 tttcacattc cccaatgtct aagggatagc ctctgtgcca ctttttgcca gagtgtcttt 6841 gagccagatt catattttgc atagcacccc atcaaaagta gttcatcaaa tgtctattaa 6901 acgttttaaa gaaaagtaca tcattgaccc atttttaggg cacttgtaaa aatgtttcta 6961 taaatatgtg aagggtatgt acatttgttt tgtgtgtcac atggggtcag taagttctca 7021 ataaaaattg ttaagaaatg ccattcaaac cgaatgtcac ggactctcgt ctcatgcagt 7081 aatcttgagt caccatctgg gggctgtgca tgtcacagaa ttttcccatg tgcccatggc 7141 aggcttacac cgacccagac attctcttcc acccccaccc ccaacacccc atactcatct 7201 cctctctgcc cccattgata ccaggtacat gtgcaggatg gggtggtgtg agtccctaat 7261 gagagaatag tgtgtttcaa gttactgcca gggtggaaaa cagcaaaacc aaaatggtat 7321 cgggaaggaa gccatgccca ctcttaaaaa ttcattaagt tttttcttgc tatggaagac 7381 ttctttaaag taattttctt gtaaaagtgt aagtgtaaat aatgccatga aattatacac 7441 ttatttaatg gctaaaatgg caaattttta tatggtttac cacaacaaaa gaaaagaaaa 7501 aatataccaa aaagttttta aaaagtgatc atgaacattg tggtcatcct gtgaagtgat 7561 tgcaatgcct gcagataagg agtgatggtt acaacagtat tttctctgaa aattatttga 7621 tggccagttt cataatgatt atgttttcag cctgaaggaa aattccattt tcttgggtga 7681 gcatgaactt tctgtcaggc tgtctgctgc ttttcattcc ccacttctct cttcacaatt 7741 gtgggtgtca agcttcagcc acgcaattgc atttggtgtg agggttttgc aaggagaagg 7801 aggtttattc atacccctga agccacaagc cttggtggga ataaggaaag tccatgaatt 7861 cactatgcat taatgcatgc ctcttggcca agtggattct tttttcttct ccgattgaga 7921 ttttcctttt tttttttttt tttttttttt gctcttgttc tcattgtatt gtgctttgta 7981 taattattta cagtaagtcg cgcatgtcag tgtacattcc gtctggaaat tgtttccatt 8041 tggtacattt tgtgccagtc ggtctattcc tgctcattat tttgtttttt ctacattcag 8101 actgaaacat ttggtagcct agagatactc agaaataggc aaagaaaggt aaaaggggag 8161 gaggggagat tgaaatcata cttccattat ccctccccgt ggttatcaga ttcataaaat 8221 ttttacacca tcaaaagaca tttttagcca tatatgtttc ctttatgtag aaaatgagat 8281 ctcctatact cagttggcat ttgttttcat gtatctgata agtacctgtt agaattatag 8341 gtcaagatgt tgtctagctc tggtaaccta gatcctgaac ttcaaagcag agttcttgga 8401 ccttaagaaa ttcaaattcg tagaattata gttaggatgt catttactga ccttgaacaa 8461 cacattgttt aggtagcata caatgggact agttcaaggg gctgggtgga gaacaagtta 8521 caattttttc tgatggagaa cattactgga gacagtaccc tttaaaactt tcattccctt 8581 tttattagca gctaacgtct tgtatgcctt tactatgtga caagcaataa aataatgtcc 8641 ttggatgatt ttgtttgatg ctcaaatgaa tcctaatagt gaggttctag catcttcatt 8701 gaaagaagag gaaacttatc catggtgcaa agctaatcag aggaacagga tctgaagcca 8761 aatttgtctg cctccagagc ccatgctttg tcaagtttaa ttcagcggtg gtgcactggg 8821 atgccacagt tagggcaatc tggcttccaa tcccagatgc accacttccc agctgtttcc 8881 ccctgggcat tttacttaag ctctgtagta tcattattat tctctataag atgataatat 8941 tatatcacag aattgatgag agggataaat aacatatgta agtggccttc cacaatgccc 9001 agccagagta agcacacaga aatgttttct actattattc tatattatct cctatcatca 9061 tattattata acctgttgtt ccacctttcc tatgtggtat aacagaaaga gatctgatta 9121 aaaataaggt ggtctgggtt cttgctgcag ctctacgctg tctttgttac cttgaatgag 9181 tcgcttaacc tactatatct ctgctgtctc atgtgttagg ccattcttgt tttgctataa 9241 ggaaatgcct gaggctgagt aatttataaa gaaaagaggt ttagttggct cacagttctg 9301 caggcttaac aggaagtgtg atgctggcat ctgcttctgg tgaggacctc aggaagctta 9361 cagtcatggt ggaagacaga ctgggagcag gcatctcaca tggtgagagc aggagcgaga 9421 gagtgggtgt gggaggtgcc acacactttt aaacaaccag atcttgtgat aactcgctca 9481 ctattgcaag gacagcatca agccatgagg tttccacccc catgacccaa acacgtccca 9541 acaggcctca cctccaacat tggggaaaat aaggattaca tttcaacatg agatttgggc 9601 agagacaaat atcccaacca tatcatctca cctatggaat catgacagta atagcatgac 9661 tcatgtattt ggagtaggtt tagtggtaaa agtttatcag gcatatttca aaaaatcaga 9721 aaaaatagga atgaatgcct tatgaaaaaa atcatttcac gcattgtttt ctagagtgtg 9781 gggagtacca ttcatgttta tatgcatgct cgaccccatt ttcccagatc aacaattact 9841 gtttgtcctt tcttgacaga gaaaaggaat caaatgaagt gaataatatg gagaggaatt 9901 ttttaaagtg agcagagtgg atatttgtaa aggcttttcc atgttttcgc tatggtagtg 9961 ataactgtgg ctgtctgtat aaaaagggtg tatcatttaa acatgccctg aaatggttaa 10021 tgatgaaatt atgtgatgtc tcgatgtgct tccaaataat cccaggttgg gtgcagggat 10081 gttggagaaa tcagcaaggc tactgatgaa acaagactgg atattaataa ttgttgaatg 10141 tgggtggtgg ctatatgagt gttcatctta tgtttgtttt atatattttg gacatttcct 10201 ataattttta aaaaaatagt aagggtcgaa aacctagctg ctaacagggc cgggaaggta 10261 acataaatga agaaaacaag cagagtgggg actggtaaac tgggaaaggc tatgtctccc 10321 ccaaagggaa tggccactac tctgcttcag cctaattttt ctggtcagga ctttggggtc 10381 cagatctggc agattttcca gtttgtaaag attggccaga aataagtgtt tacatgaaac 10441 cttctgatcc ttagatctga gcaatcaatt caaatatgtt ttatttaaaa ttacactggt 10501 catccgaaac agatctgcag accagatata ggatgcagct tccaatttgc aacctgtact 10561 tgagaaggta actttaagca aattcaaaac cgtggagcaa tagaatatga ttgaaagcaa 10621 ctctaaagcc agcaatccta ggatgccatc attccacctt attttttttc tcattataag 10681 tcctgttctc atgtgataaa agaaaaactt cagccaaatt aaagttaaag gagtttaatt 10741 gagcaatgga cgatttgcaa atcgggcagc ccccagaatg acagcagatt cacagagact 10801 ccagtgcagc catgtggtgg gagagttata gacaaaaaag ggaaactaca tacagaaatc 10861 agaagtgaga tagagaatgg ctggattggt tacagctcga cgtttgcctt atttgacatt 10921 ttcattttac caataatttt taaaactctc tttatttccc aaaaattact taagacacat 10981 gaactaaaag gcattacact ttttactttt ctgacaaaat atgttattta agcttttatt 11041 atttttaaac caattagtga aagctctttt atatataaac atcaaacaca taatacatat 11101 aaatccatag acagaagtta aaggactcat tttccaagcc aggaattgaa cgctgaaccc 11161 aggctgccat tgtgaagaga aagcatggcc acatggttac aaggtcaagc tcccaaggac 11221 atacaagaca agagggaaac cttatccagt tttttttttg ttttgttttt tgttttttcc 11281 agggacctgc agcaaagctt ataagtgatc agtttgcttg gctgtcttga acagcgggct 11341 tacaggtgtc ctaagcctgt attctatcct aaggtacccc tcattaatga cagaaaatat 11401 ggaaagacac acaaagcata ccaaatttgg tacagcttaa gactagcctt acaagtgctt 11461 tttctgatta atttaaactt tacaggagag taacagtgat ttttaccatt tattcaacct 11521 gtttgcacag agagagagag agagaggcca ggagtctgac tggtaagaaa ttgttacctg 11581 tttgccagca tgccaggctt ctgtgttccg ttgccctaag tggccctagt gacccacctc 11641 gctgcaccat agacctgggg ggccaagcca caacacaaag gaaaattatc tttttctgtt 11701 tgggccagag taaaatatgt gtgacgaaac atagacatca gctactctgc ttagcaccca 11761 atattaaact ggcaaggctt aaatttgtcc ttagatggct cccgtcatct ttaatccaac 11821 ttctgactag gaatttcaac acgtccctgg gcaagatggt caccctgagt aatagaaaag 11881 ataagaaagg gaaaggagag agagaaaagc attgcctgtg gcagggtggg gaaggtgaag 11941 agctcacaga ggccagagaa ggacccactc attcattgca gtgacactga aaatgaaaag 12001 ttcaggggac cacttgccag tagtgaaggg atcttttcca gcagttccat cagctgtcag 12061 gattcccctt ctggggagga aaaagctccc catctcccac ggtcctgcac atgcctaatc 12121 ctgtcaccca tagccgtcag caaaaagtgt aagaccgatt aatccaaaga gaatagcact 12181 taacattcca tagtgccaaa cccgtcctta gccaaaaggg attttaccaa gagccctcat 12241 ttttaaatgt atttcaatgt gttgttgttc atttggaacg ttccactgta agtacaagcg 12301 attctactgc ctcagcctcc cgagtagctg ggactacagg tgcatgccac cacgcccagc 12361 taattttttg tatttttagt agagatgggg tttcactgtg tttagccagg atggtctcgg 12421 tctcctgacc ttgtgattca cctgcctcga cctcccaaag tactgggatt acaggcttga 12481 gccaccgcac ctggcctctt gttctcttat tttctcccga acatacagaa tcgctctctc 12541 tgttctgagc caactaaagc tgggggtgga gtgacatagg tattaacacc cctgtggtca 12601 ccaccactat gactgtgctg ggtcagacct gaagccagca cagcactggg tctcacccaa 12661 ggcctgctgt aaccactccc tggctactgt ctctatgttt gcttaagacc ctgggctcta 12721 caatgagcag gtggcaaagc cagccaggcc tgtgtccttc cctccaggtc agtgagttcc 12781 cccaggcccc aggtgggtct agaagtatca tccaggagtc agggactaga gtcaaaaacc 12841 ttctaagtct acctggtgtt ctattgtatt gcagctgagc tggcactgaa accataagac 12901 ctagtccttc ctgcttttcc ctcccctttc caaaggcaaa ggagcctcac tccacagcca 12961 ctgccacccc tggccacaag gagcactgac agactaccat caatgttccg ttaaggccca 13021 aggtctctta agttagcttg tggtgaatgc tgcctggtct gggactcatc cttcagggca 13081 gtgggctccc ctctggcctg gggcaggtcc agaaatgctg tcaagtcctg gaatctggaa 13141 cctcaagaac ctgcttggtg ctctagcccc cgtggtggtg ttggtacctg aagccagcaa 13201 gtctcagagg ctcatgatgg ccctcaatgt agtgcctgtg cattgctgtt ggttattcag 13261 ggcccaaggg ctcttcagtt agcgggtgat gaatgctggc aggatggggt cctttccttc 13321 aaggcagtgg gttcccttct ggcccacagt ttgtctagaa atgtcatttg ggaactaggg 13381 ctggaacagg ggcctgttga ctctgactgg agccctatct tgctgtggct gagttggtat 13441 ccaagatgca agacaaagtc ctcccaactc ttccctctcc tctcctcaag cagaaggaag 13501 ggctctcttt tagagccaca agctgtgcat cctggggtta ggggaaaagt gatgccagca 13561 ttcccttggc tgtcctagct agtgtctcag catgttgtgt tccccgcccc gccaccccca 13621 atccactgtg tctgggccta gttcatccct aggacttacc taaaagttgc agtccttatg 13681 gcctaggctg cctttcaagt ttacttagag actgacagca ctttggccct tcatggtgag 13741 gtttgcaggt actcaagttc agactgctgg gatcagtgat tcccctctgg ctagggctgg 13801 tttaaatgct ccctctgtga gtgggcatca actgagtttg atctggtttt cctttctgct 13861 gtaacaggac attgctgagt gcaatgcttc acaattgaat tgtgaagcaa aagaactgtg 13921 ttctctcttc cccagtgccc agaaacactc tccacaccat gccatggctg ccagggtggg 13981 gaaggggtag cattggtgat tcaggattgt tttctatatc tcttcagtgc ctctttcagt 14041 gatacgaaat taaaaccagg tactgtaagt gctcacctga tttttggttt ttaagaaggt 14101 gtttttttct gtgtaggtag ttgttaactt ggtgtccttg cacagagtgg caggaggacg 14161 atcagcggag ctttctattc tgccatcttc gtctgcctcc tttcctagcc tcatttcccc 14221 ctgcttcctc ctctgccccc tgggtcctga gcacatgggt cttgagaaca caccaagctc 14281 cttcccacct tggggccttt gcactggctg ttctctctgc ctggaatgct cttcctacag 14341 gtttttgcat ggctgcctcc tttactgcat tcatgattct gctcaaatgc cctctctaag 14401 cacctgagct aaaatcccta accccgcagt ctctctactt ttttgtttgg ttattggctc 14461 tcttgtttaa tctctgtctt cctaccagag ggcaggattc agaatttaaa cagtgcccta 14521 gcacataatg agagctcagt tatcttctgt taaagataaa aatgaagcca cactttaggt 14581 gtacccgaag gccaaccact cataaccagg taaccaaaat ttaattcttc ccaatttccc 14641 caaaacactg tctctaatca taaatatcaa acataacctt tacatctttg tcagcgtgat 14701 tcagtgaaat taaaccaatc agctataggc aaatcagttt aaacagctct gtttacctta 14761 aaaagaatga taacgtaaaa cagccaacca caaaaaaaaa tcaaaatatt ctcctttatg 14821 ctttataaag tgtgctatga ctgccataag gtgagctttc taccactttg tttgaagtct 14881 cctgggtcat gagctgtact ttctcttact gtataacaat aaactttaaa atgtttccta 14941 acttgatctg attctcattt tgacacttcc aataatcttg agaaatgctc ctgagcactt 15001 tgggtctggt attcttagag agggcaaaca ttctcatttt tcaaagggga agttgaggcc 15061 cagagagaga cagtgactgc ctgaagtcat aaagttccag cactctcttt ccttactccc 15121 ttgttctcct ggtcttgggg acatcagaca tctttaaaca tttggtctca ccatagaaac 15181 tttgagatcc agtgcattag ggttccctag agggacagaa caaataggat atagatatag 15241 atatatagat atatatgtag gagatataga tatatatatg catatgtagg atatatatat 15301 gtatatgtag gatatatata tgtatatata tacagaactt atatatatat atacagaata 15361 tatatatata tacagaacta ataggatata tatatatcct ccatatatat atatcctcca 15421 tatatcctcc atatatatat atcctccata tatatatatc ctccatatat atatatcctc 15481 catatatata tatcctccat atatatatcc tccatatata tatatatcct ccatatatat 15541 atatatatat cctccatata tatatatata tatcctccat gtatatatat atggagttaa 15601 ttaagcatta acttacatga tcacgaggtc ctacagtagg ctgtctgcaa gcttgaggag 15661 caaagagatc cggtctgagt ctcaaaactg aagaacttgg gagtccgatg ttcaagggca 15721 ggaagtgtcc agcatgggag aaagaggtag gctgggaggc taggccagtc tctccttttc 15781 acatttttct gcctgcttca tattcactgg cagctgatta gattgtgccc accagattaa 15841 gggtggattt gccctgtgca gcccactgac tcaaatgtta atctcttttg gcaacaccct 15901 cagagaacac ctaggatcaa tattttgtat ccttctatct aatcaagttg acactcagta 15961 ttaactgtca tacccagagt ggctgttact tcccaaaaat attcagtctg taattgataa 16021 agtggggttt ttatcctggt atgtcagact tgtggctgag actttctgac acttgtgagg 16081 ctggaaggga aagaggcagt ggataaagtg ggccttggct tgttctcata ggttgcaatt 16141 aacatctgaa gcacttgctt tcaacctgaa gaacttcatt atttcttgtg aggtggatct 16201 actaacaaca aatcctccca tttttattta actgggaatg tctttgcctt catttctgaa 16261 aggtagcttt gctggatgta ggattcttgg ttgatagttt tttctttaag cattttgagt 16321 atttaatctt actgcctcct ggccttcatt gtttctgctg agaagtcaac taccaatctt 16381 actggggtaa gtgatgagac atttttctct tgccactttc aagattttct ccttgatctt 16441 agccggtttt actatgatgt aatctgtttg tggattcttt acatatatcc ttcttagaat 16501 tcactgagct tcctgagtgt gtaggttatt gtttttaaaa taaatttggg aaattttctg 16561 ccattatttc tttgactatt ttttctgctc ctctctatct cttcctttca tctcctctct 16621 atctcttcct ttcttctagt agatcccact taggtttgtg tggctgatgg tgtcctacat 16681 ttgtctgaag ctctatttat ttttcttcac tctttcttct ctctagtctt catcttgcat 16741 aatcactatc aatcctgttt caaatctgct aattctttct tctgccagtt taaatgtgct 16801 attgaacctg ctagtgattt tttcatgtca gttattgtac ttttcagatt cagaatttcc 16861 atttggttct tttaaaataa tttctatctc tttaatgata ttttctactg gatgcaacat 16921 tgttatcata cctttatttt ctgaatcatg ctttctttta gttcagtaaa catattattt 16981 tttttctctg tagagatgga ggtcttgcta tgctgaccag gctggtctca aactcctggc 17041 ctcaggcaat cctcccgctt cagctttcca aagtgctggg attacaggta tgagccacca 17101 cacctggcct gcaaacatat ttataatggg tattttgaag tctttctctg ttaaatccat 17161 catgtggttg ctctcacagg cagttggtgt tgtctgcttt ttttctgctg tatggggcat 17221 actttcctgt ttctttgcag gtctgtattt gtctgttctc acattgttat aaggaaatag 17281 ccgagagtgg gtaatttata aaggaaagag gtttaattga ctcacagttc agcatggctg 17341 gggaggcctc aggaaacata caatcatggc agaaggtgaa ggggaagcaa ggcaccttct 17401 tcacaaggca gcaggaagga gaagtgcaag caggagaaat gctagacgct tataaaacca 17461 tcagatcatg tgggactcac tcactatcac gaggacagca tgggggaaac cgcccccatg 17521 atccaattac ctccacctgg tcccatcctt gacacatggg gattatgggg attaagggga 17581 ttacaattca agatgagatt ttgggtggag acatagacaa accatatcac atgcctcata 17641 aactttttgc tggaaattga aatcatttct gagacatggt cccactctgt cacccaggct 17701 ggagtgcagt ggtgcaatca tggctcactg cagcctcgac ctcctaggtt caagtgatcc 17761 ttccacctca tcctcctgag tagctgggac tatcagcgtg caccattcca cctggctaat 17821 tttttttatt acttttgtag agatgggggg tctctctgta ttgcccaggc tgttcttgat 17881 ctcctgggct caagtgattc ttccaccttg gcctctcaaa gtgctgggat tacaggtgtg 17941 aagcattatg cttggccaca aaatttttga taatatgttg tagcaactgc ccccaaccct 18001 ggcctctggg acttgttatt tgctggtata ttttttttag tgattggctg gattatttta 18061 atgaagctta ttccttctcc tcatagtctt aagcctctga tgttgctctt caggaagaca 18121 tgactttggg tgtgtacacc gtcactctag gatggcagtg gtgttagtag ggctctctat 18181 ctttccctta ccatgcccaa ctattaaact ccactaattg cctgctgatc attctattgt 18241 tttcagcaat gctctaggac ataaattgtt ctacaaacta attcaattaa attgtggttt 18301 atttgaagga atagtttttg aggtccacgt ctgatatttg ttctgatacc aggagtgctc 18361 ttaccagcca tcttatttcc tggttttctc ctgaaaacta tccagcttac aggccatgct 18421 ttatcttcat tagatccaca aatctcaact gccttgtatg acaacgtcca ctgttcttga 18481 gagcactctt agctttgaac tttacactct gttgcaaatg aagtcaatcc ctttgggaag 18541 agattaggag ctacctgttt tgtagcctgt tcctcctcca aggcaaaatc tttgagcaag 18601 agctctagag acaaggtggg gacggtggca agcttctgcc tgaatggcaa cccctttcta 18661 tgggctgagg ccttggcaga gtggggtgca gcagcctaag gtgctctcgg cttgcctctt 18721 cagcatgtaa ccaccacctc atgagcgagg caaggaaaac ttgagccgcg gtttcctcag 18781 tgtgttgcat ctaaggtaga gcctccatta aataactggg ggacagagtc ggcagattaa 18841 atgagccacc atcactcagc tgtactcacc tgatacttag cctcagaaac aagtagctgg 18901 tgtcagtatg aatgatgctg aagtgctgct cctcttggga agaaagtcct ctggctggga 18961 gccagagggg agagggagcc ctatgctctt gtctgcagca gtctgaagta gaatctctgc 19021 cttactgatc tgggagggga aaaggaagaa gctgtagtga ttcaaatacc acagacttgc 19081 ttttcttatt gaatttttgt aggttctctt agaaagacgt ttcttcattt gctgttttcc 19141 cttaggacca tttccagggg ctttaagttg ttgttttaaa aataatattt accactttca 19201 cttgggagtg catcagcaga gttcctcaag ctgtcatgct ggaagttgaa ctccgtgttt 19261 ggtacttttt tcatttgcac agtgatcttg ttttttctat gcctagacaa ggttatgaag 19321 tggtattttt tttctcactt cgtcaagatc acagagtgat tttgtctggc agtagtgctc 19381 tgtgagactg tttatgttca acaggaaaac atcaagacct agttgtgcgt accaggccag 19441 tgctggctgt cagggactgc gatatggttt ggatttgtgt ccctacccaa atctgatgtc 19501 gaattggagg aggggcctga tgggaggtga ttatatcatg ggggaggatt tccccattgc 19561 tgttctcatg acagtgagtg agttctcata atatctgatg gcttaaaagt gtgtggcact 19621 ttccccctta attttttgct ctcctgtcac catgaaaaga tatgccttgc ttccccttca 19681 ccttttgcca tgattgtaag tttcctgaag cctcccagtc atgcttcccg ttaagcctgt 19741 ggaactgtga gtgaattaaa cctttttttc tttataaatt acccagtctc aggtagttct 19801 ttatagtatg agaacggact agtacagact gcttatcacc ttctcacagt gtagaacaat 19861 cacttcgtaa ttattctgtc taaggggcaa ggaagcttgg gcatttatcc accaactccc 19921 aataatcatt ggttgagggc tgctcctggg ggcatttatt ccccagcctt ccctgttcag 19981 gcagaggggc ttcagacccc agaggaagca tcaagctgtt gcaaattggg ccaagtacat 20041 atggctgaga tctagtagga cacagacaac atctgctata gatgacctgc caaagatcct 20101 gaaagggcca ggatttgagc tcagtgttat ctgatgacca tgtcttagcc accacttgat 20161 ttcgtgaatg gagaggtttg aaaagaacag cagaactttg ctagaggggt tgtcaagatt 20221 acttgcctga acagtttctc atgtatatat tacacagact gttggagata cctgctctgt 20281 tcccagctcc tggctagacg atggggtcac aaagatgacc caggcatgtt ttctgctctt 20341 atggtacaat ctgtctgttg agggtgggat atctgaaggg acaagcccac acttcgtccc 20401 cagtcccacc tgtgttggct agggagtcct tcctggagga tgtgttgcgt aaacaatgga 20461 gagaatttag ggaacatcga gcctactctt ttcactttac agatgaagga acacaggccc 20521 acagcaaggg agtggtctgc aaaagagaac atatagactg caccctggag aaggccagct 20581 tctaagtggc catgcccttc cttagctccc cagtcaccca gatacaggaa gtgctcacta 20641 tatcagattc agctcctcag cacttaccaa gtccatcctg cccagaatgc tttccttcaa 20701 aggcctgcag gctcaaggct ctgtcttaag cctgagcctt tttacctcta aagaaagtaa 20761 tcaacctcct ccaaaggcat tcagtgtcct ggtatgggag aatatgtgtg tataccaggg 20821 tctgggaatt ttgagaaacc atcttaaggt ttggtgagtt gctttttctc cttatgtgaa 20881 tttatacagt cctctatgga gttctggagg gaaaatgaac caaacaaaat attttttttg 20941 agacagggtc tcactctatc tcttaggctg gggtgcagtg gcacaatctt ggctcactgt 21001 agccttgacc tcacaggctc aggtgatcct tccacctcag cctcctgagt agctcggact 21061 acaggcatgc agcaccacac tcacctagtt tctgtattct ttgtagagat aggattttgc 21121 catgttgccc aggttgatct caaactcctg ggctcaagtg atcctcttgc ctctgcctcc 21181 caaagtgctg ggattacagg tgtgtgctac tgcacccagc cgaagatttt ttttaaagtg 21241 aaaacaaagt atgtttttat ctatttccaa actatagcat gaatttcccc aagaaacttt 21301 tagcagatgc tctcctccct tttgcaaata tactaaaact attttatgaa aacagtattg 21361 ggaatatgga cccaaaacac ggttatcctg atgtagagat ttggccaatg gtacatttga 21421 attccactaa aatgaactct gttatactga cctgcttttg actatatcct ttatacagca 21481 ctgtagaggg ctttctgcag gcccagagca atggagtggc ctacgtaaga gaacacacat 21541 agctcagctt agaagaggcc agccgctgaa tggccatgcc ctcccctggg ccagagatca 21601 ttcaattaag agaaacacac accagcccaa gttcagctct tcagcaccta ccacattatc 21661 tagcccagaa ctatttcact taaagggttg aagattcaag tcccttcccc aatcatggca 21721 ctgggccatt ttacctctag gaatagtaaa caacctcccc taaaggcatt cagtatcttg 21781 gtgtccattc ttacaggccc ttgagggagt gatacaactt gtaccctcct cacaactctc 21841 atttctccca atcttcttgc aaatctgtgc ctgaagctgg aagcacaaca catgaacaat 21901 aactagcttg gtgcttggtg ttgaattaac ttttttaaga aagcaggcta acaggttgga 21961 aattgttctg tcaatatttt agaatctagt aatacaaaag accccttcta aaagcacatt 22021 ctgttgtgtt acacaagcaa ggggacaaag gtgatttcag aggtagcatt caggtgtgag 22081 catcataatt gatacatttg gacccctcta aagcacaggc cacagacaaa actagggcct 22141 ctatgagaac tgcagagttg gattatgatt ttttatttca agccgaagaa tcttggttta 22201 ggcaaaataa tggagggaag gcagagaaca aggaagagtc tctgtccatt ccctttgaac 22261 gagagaaaca aaaaaattaa acaaaggatg ttttaaaatc aaaaggtgaa agagtctgtg 22321 gagacatagg aggaaagaaa tccatggcaa tggctgagcc tggagctgtt gttgaggtgt 22381 ggggagcttg acgataaaca gcggctagca gctaggtact gcctctggaa gatagccagt 22441 ggggcacaag gaggtagctc tggccagctg ggctttacca taaaaaaaaa aaggaaaaat 22501 agaactctgg tccctatata aatcagtccc taacatctcc ttccacagcc ctcatcacga 22561 ggccaagcag taagaccaat ttctctttac ctctgatccc tatcctccct ttaactcatc 22621 tggtcacatt aaggggtttc agtgggattg aaattattta taaggaagaa aggaaagtaa 22681 agactgagtg gaggtttcac aatctgattt tccctggccc aggtccccaa ccccagcacc 22741 tcaaggagca caaaatatga ttctattcat tttaagcaca aagaatttcc aaggatttct 22801 cagaaattcc acaggcctac ccgccacatc aaaaaggtca actccaaaag tgagtccgac 22861 agaaatattg taagatgcta aatcgtctct tctatttatt taacatacag acttagattt 22921 aggagcctct tcttggctca acccgttagg tgaacacatt ctatagtatc aaaattttgc 22981 ataactctcc actaagtaaa tgtttagtcc attcattctg aactggaagg aaggagacta 23041 gctgccagct gtgatcctag tcaacatgca ggatgtgatg aaatgaggaa gtcttcttca 23101 tggtcatcga gttactgtcc aagagtagca accacagcca acgacagaga tcaaatttca 23161 ttcatctcac tctgctcctt ctcaccattc tttcccctat gaaccatagg aaagactcaa 23221 tccataggca aaataaaaat gatactctga ataactcttt taacaccctc ctgatattta 23281 ggcaccatgc ccaaactgtc ttgttccctc tcacaattgg tagcattcta atattgtaca 23341 tcaatatctc agttaaatag aataacacta ggagggaact tgaattaatc atgattgcat 23401 acttattgca ggaaaagata aataaccaag aaagccactt ttcctctgct gaatcttttt 23461 ctttgcttag ctgttttttc catatgtgtt gctttatata actatttaga gaaagtctca 23521 gggatcagca cacattcaat ctaatagcct aagtgttttt cagctaacgc catggcaata 23581 aagtttgctc tttggagtga tggctcttag acactgtatt agtctgttct cacactgcta 23641 tgaagaaatg cccaagcccg ggtaatttat aaaggaagga ggtttaattg actcactgtt 23701 ctgcatggct ggggaggcct caggaaactt acaatcgtgg cagaaggcaa aggagaagca 23761 ggcactttct tcacagggca gcaagatgga ggaagtgcaa gcaggggaaa tgccagatgt 23821 ttataaaacc atcagatctt gcgagactca ttcacaatca tgaggacgga atgggggaaa 23881 ccacccccgt gatccagtta cctccacctg gtcctgccct tgacacattg ggattatggg 23941 gattacaatt caagataaga ttttggatgg ggacacagcc aaaccatatt agacactaca 24001 aactcacagt gaatctctgg aatataccat gctcatttct gactatcttt gcttgtgctg 24061 ttcctatact cctttcctca tctagccaaa tcattcatgt ctgttttcct ctcacatttt 24121 ctgcctcctt tcacccatct tcttcttttt ccttctgttt ctcaacctca cctgcccaat 24181 tgtctcccct tccagacccc ctgtagacct gaagtctcaa tcgctaatta agaggcaaca 24241 gtcatgactt atggcctctc actgaaacat aactgttttg tgggtgtaat tcttatgttc 24301 cctacctgca tggtaagcac ttgcattttc agactgggtc tcctactctt cttgtttcct 24361 tcacggaagc tagcagtggt gctttatgtg taatttttaa aagggcaatg attatcgtaa 24421 cagctaaaat ttttggaagg cttaatgtgt gacaagtact gtgtgtgaat taagcttatt 24481 gctatttcac aaatgaagaa ataaagacag agagagatta aataatctgt ccaaatcaca 24541 gctgacaagt ggctataatg gaattcattg accaaacatc ccattctgtt cagtagactc 24601 aaatgtgagg ggaaaataga catacataat ctcacatttt ttttcaacag aaaataagga 24661 aataagctgt gcacagtggc tcatgcctgt aatcccagca ctttgggagg ctgaggtggg 24721 gggattgctt gaggccaaga gtttgagacc agcctgggca acatagtgag accccatctc 24781 taggaaaaaa atagctgggt gtggtggcat gcacctgtat gtagtcccag ctctttggga 24841 ggctgagata ggaggattgc ttgagcccag gagttcaaag ctgcagtgag ctatgatcag 24901 accgctgcac tccagcctgg gtaacagagc aggaccccat ctctaaatta gttaattaaa 24961 ataaaaagaa aatgaggaaa taaacaatga catgggattc ttgttcaatc tcatttcttt 25021 aaaacagtcc attaaccaga ctaaatctaa cccttgggta ttatgctgaa ttaaggattc 25081 attttaatat ataagtacac aatatttctg gatacaaatt aaacagaata gactgctggg 25141 tatgtactag atatttttct ttcattcttc tatttgctca ttagattccc ttttctgtgc 25201 tgccattttt ttctttataa tcattaaccc atccatttat tgtctgttag gaagtaaggc 25261 aacatatatg caatgaatat aataagaaag gaaaactgaa aaaatgttgt tttagaaaat 25321 aacgttaata gtgtcacttg aacccagaag ttcgaggctg taataagcta tgatcgcgct 25381 cctgcactcc agcatgggca acagagtgag atcctctctc ttaaaaaaaa aaaaaaagtt 25441 aactgtggat taagacatct ttttcaaaag ataaagccat tttcctttgc ccgggtgaat 25501 gctaagcagt ttgtgaactc acagtatcac tgggatttgg aagtctcacg tgttaaaaag 25561 taagacatga tcaaagcttg aatacagcca taggtctgct tgtttgaatc ccatcattcc 25621 aagatataca tattttgttt agtgtttcag tatattgata tggtttatgt aacagtatat 25681 atacaactac tttgtggagt atttcctgca atttcatgta ctcgcattca gaagcaatca 25741 gaagccattc ttcttttttc aaaacaagga aaacaatatt tcttccatca ttctacatat 25801 atatttggca cctgttaatg tcatatatgc caacagctac ttcagacatc tctttttgat 25861 tgactagtat ttcatttttc agtcgtcagt cttaaagatt aatgttttgt ctttctaata 25921 aatatgtagt taagtgaatt ccatacttat ttttagatcg accgtgtgtt ttcttgctta 25981 tctttctgtg ttaattgaaa ttctgttaaa cctagaaaat tacttggagt atgtgttcag 26041 atgacgtggg gcaggcaagc ccccagactg gggcttagcc tgggagagtt cttggctttg 26101 cccaggaaag aattcaaggg tgacttggtg gtattaaaca gcaatctttt acttaactgg 26161 agctgttcct tgtgaagcag ggctaactca ttgacatagt gcccagagcc acatttgtgg 26221 gctgttggca attgtattta tatccactta tggtccccta cccaccttaa atttggagag 26281 cctcatgtaa aatcagaaac aagcagggga ctagattccc ttattgtcct ctttatgccc 26341 atattgtttc ataattataa aatgttagac aggaacatag ggtccttgga tggaaggata 26401 ccacataaat atattcagat agatgaaagg cagaactctt tgttacttac agctccaaat 26461 gagagaaggc tgccaggtag agccacacct gaagttgtaa cgcaggatag agctatacct 26521 gtaggaggca gtccaggtat ggcaagggag gtttcatggg ctccctgtgg atttgttaat 26581 ttgaaactta ggcaaaaggg ctgtccctag ttgtctggta cctatccctg tggtgattag 26641 ggcaggtaca tagttgccag gaatgtgaga gccccataag ggaaatggtt gagatgtgga 26701 tttaatcagc tgctcaagaa gaggaactga ctagcctcta gccagggcct caaaattggg 26761 tcaagatggc actgaagaaa acaaaaccca caactggtgg agccaagatg gccgaatagg 26821 aacagctcca gtctacagct cccagcgtga gcgacgcaga agatgggtga tttctgcatt 26881 cccaactgag gtactggttt catctcactg gagagtgtca gacagtgggt acaggacagt 26941 gggtgcagcg caccgagtat gagccaaagc agggtgaggc atcgtcgcct cacccgggaa 27001 gcgcaagggg tcggggaatt ccctttccta gtcaaagaaa ggggtgacag atggcacctg 27061 gaaaatcagg tcactcccac cctaatactg cgcttttcca atggtcttag caaacagcac 27121 accaggagat tatgtcccac gcctgactcg gagggtccta cgcccacgga gcctcgctca 27181 ttgctagcac agcagtctga gatcaaactg caaggcggca gcgaggctgg gggaggggcg 27241 cctgccattg ctgaggctcg agtaggtaaa caaagcggcc aggaagctcg aactgggtgg 27301 atcccaccgc agctcaagga ggcccacctg cctctgtaga ctccacctct gggggcaggg 27361 catagccaaa cagaaggcag cagaaacctc tgcagactta aatgtcgctg tctgacagct 27421 ttgaagagag tagtggttct cccagcatgc agcttgagat cggagaacgg acagactgcc 27481 tcctcaagtg ggtccctgac ctccgagtag cctaactggg aggcaccccc cagtaggggc 27541 agactgacac ctcacacggc cgggtactcc tctgagatga aacttccaga ggaattatca 27601 ggcagcaaca ttttctgttc accaatatcc gctgttctgg agcctctgct gctgataccc 27661 aggcaaacag ggtctggagt ggacatccag caaactccaa aagacctgca gctgagggtc 27721 ctgactgtta gaaggaaaac caacaaacag aaaggacatc cacaccaaaa ccccatctgt 27781 acgtcaccat catcaaagac caaaggtaga taaaaccaca aagatgggga aaaaacagag 27841 cagaaaaact ggaaactcta aaaatcagag cgcctctcct cctccaaagg aacgcagctc 27901 ctcaccagca atggaacaaa gctggatgga gaatgacttt gatgacttga gagaagaagg 27961 cttcagacaa tcaaactact ctgagctaaa ggaggaagtt cgaacccatg gcaaagaagt 28021 taaaaacctt gaaaaaaaat tagacgaatg gctaactaga ataaccaatg cagagaagtc 28081 cttaaaggac ctgatggagc tgaaaaccac agcccgagaa ctacatgatg aatgcacaag 28141 cctcagtagc tgattccatc aactggaaga aagggtatca gtgacggaag atcaaatgaa 28201 tgaaatgaag cgagaagaga agtttagaga aaaaagaata aaaagaaatg aacaaagcct 28261 ccaagaaata tgggactatg tgaaaagacc aaatctacac ctgattggtg tacctgaaag 28321 tgacagggag aatggaacca agttggaaaa cactctgcag gatattatcc aagagaactt 28381 ccccaatcta gtaaggcagg ccaacattca aattcaggaa atacagagaa cgccacaaag 28441 atactcctcg agaagagcaa ctccaagaca cataattgtc agattcacca aagttgaaat 28501 gaaggaaaaa atgttaaggg cagccagaga gaaaggtcgg gttacccaca aagggaagcc 28561 catcagacta acagctgatc tcttggcaga aactctacaa gccagaagag agtgggggcc 28621 aatattcaac attcttaaag aaaagaattt tcaacccaga atttcatatc cagccaaact 28681 aagcttcata ggtgaaggag aaataaaatc ctttacagac aagcaaaatg ctgagagatt 28741 ttgtcaccac caggcctgcc ctaaaagagc tcctgaagga agcactaaac atggaaagga 28801 aaaaccagta ccagccactg caaaaacatg ccaaattgta aagaccatcg aggctaggaa 28861 gaaactgcat caactaatga gcaaaataac cagctaacat cataatgaca ggatcaaatt 28921 cacacataac aatattaacc ttaaatgtaa atgggctaaa tgctccaatt aaaagacaca 28981 gactggcaaa ttggataaag agcacaccca tcagtgtgct gtattcagga aacccatctc 29041 acgtgcagag acacacatag gctcaaaata aagggatgga ggaagatcta ccaagcaaat 29101 ggaaaacaaa aaaaggcagg ggttgcaacc ctattctctg ataaaacaga ctttaaacca 29161 acaaagatca aaagagacaa agaaggccat tacataatgg taaagggatc aattcaacaa 29221 gagagctaac tatcctaaat atatatgcac ccaatacggg agcacccaga ttcataaagc 29281 aagtccttag agaccgacaa agagacttag actccaagac tttaacaccc cactgtcaac 29341 attagacaga tcaacgagac agaaagttaa caaggatacc caggaattga actcagctct 29401 gcaccaagcg gacctaatag acatctacag aactctccac cccaaatcaa cagaatatac 29461 attcttttca gcaccacacc acacctattc caaaattgac cacatagttg gaagtaaagc 29521 actcctcagc aaatgtaaaa gaacagaaat tataacaaac tgtctctcag accacagcgc 29581 aatcaaacta gaactcagga ttaagaaact cactcgaaac cactcaacta catggaaact 29641 gaacaacctg cttctgaatg actactgggt acataatgaa atgaaggcag aaataaagat 29701 gttctttgaa accaacgaga acaaagacaa aacataccag aatctctggg acacattcaa 29761 aacagtgtgt agagggaaat ttatagcact aaatgcccac aagagaaagc aggaaagatc 29821 taaaattgac accctaacat cacaattaaa aggacttgag aagcaagagc aaacacattc 29881 aaaagctagc agaaggcaag aaataactaa gatcagagca gaactgaagg aaatagagac 29941 acaaaaaacc cttcaaaaaa tcaacgaatc caggagctgg ttttttgaaa agatcaacaa 30001 aattgataga ccgctagcaa gactaataaa gaataaaaga gaggagaatc aaatagacgc 30061 aataaaaaat gataaagggg atatcaccac tgatcccaca gaaatacaaa ctgccatcag 30121 cgaatactat aaacacctct acgcaaataa actggaaaat ctagaagaaa tggataaatt 30181 cctcgacacc tacaccctcc caagactaaa ccaggaagaa gttgaatctc tgaatagacc 30241 aataacaggc tctgaaattg aggcaataat taatagctta ccaaccaaaa aaagtccagg 30301 accagatgga ttcatggccg aattctacca gagggacaag gaggagctgg taccattcct 30361 tctgaaactg ttccaatcaa tagaaaaaga gggaatcctc cctaactcat tttatgaggc 30421 cagcatcatg ctgataccaa agcctggcag agacacaaca aaaaaagaga attttagacc 30481 aatatccctg atgaacatcg atgcaaaaat cctcaataaa atactggcaa accgaatcca 30541 gcagcacatc aaaaagctta tccaccatga tcaagtgggc ttcatccctg ggatgcaagg 30601 ctggttcaac ttacgcaaat cactaaacgt aatccagcat ataaacagaa ccaatgacaa 30661 aaaccacatg attatctcaa tagatgcaga aaaggccttt gacaaaattc aacagccctt 30721 catgctaaaa actctcaata aattaggtat tgatgggatg tatctcaaaa taataagagc 30781 tatctatgac agacccacag ccaatatcat actgaatggg caaaaactgg aagcattccc 30841 tttgaaaact ggcacaagac agggatgccc tctctcacca ctcctattca atatagtgtt 30901 ggaagttctg gccagggcaa tcaggcagga gaaggaaata aagggtattc aattaggaaa 30961 agaggaagtc aaattgtccc tgtttgcaga tgacatgact gtatatttag aaaacgccat 31021 cgtctcagcc ccaaatctcc ttaagctgat aggcaacttc agcaaagtct caggatacaa 31081 aatcaatgtg caaaaatcac aagcattctt atacactaat aacagacaaa cagagagcca 31141 aatcatgagt gaactcccat tcagaattgc ttcaaagaga ataaaatacc taggaatcca 31201 acttacaagg gatgtgaagg acctcttcaa gcagaactac aaaccactgc tcaatgaaat 31261 aaaagaggat acaaacaaat ggaagaacat tccatgctca tgggtaggaa gaatcaatat 31321 cgtgaaaatg gccatactgc ccaaggtaat ttatagattc aatgccatcc ccatcaagct 31381 actaatgact ttcttcacag aattggaaaa aactacttta aagttcatat ggaaccaaaa 31441 aagagcttgc attgccaagt caatcctaag ccaaaagaac aaagctggag gcatcatgct 31501 acctgacttc aaactatact acaaggctac agtaatcaaa acagcatggt actggtacca 31561 aaacagagat atagaccaat ggaacagaac agagtcctca gaaataatgc cgcttatcta 31621 caactatctg atctttgaca aacctgacaa aaacaagaaa tggggaaagg attccctatt 31681 taataaatgg tgctgggaaa actggcttgc catatgtaga aagctgaaac tggatccctt 31741 ccttacgcct tatataaaaa ttaattcaag atggattaaa gacttaaatg ttagacctaa 31801 aaccataaaa accctagaag aaaaccaggc aataccattc aggacatagg catggtcaag 31861 gacttcatgt ctaaaacacc aaaagcaatg gcaacgaaag ccaaaattga caaatgggat 31921 ctaattaaac taaagagctt ctgcacagca aaagaaacta ccatcagagt gaacaggcaa 31981 cctacagaat gggagaaaat ttttgcaatc tactcatctg acaaagggct aatatccaga 32041 atctacaatg aactcaaaca aatttacaag aaaaaaacaa cccatcaaaa agtgggcaaa 32101 ggatatgaac agacacttct tgaaagaaga catttatgca gccaaaagac acatgaaaaa 32161 atgctcatca tccctggcca tcagagaaat gcaaatgaaa accacaacga gataccatct 32221 cacaccagtt agaatggcga tcattaaaaa gtcaggaaac aacaggtgct ggagaggatg 32281 tggagaaata ggaacagttt tacactgttg gtgagactgt aaactagttc aaccagtgtg 32341 gaagacagtg tggcgattcc tcagggatct agaactagaa ataccatttg acccagccat 32401 cccattactg ggtgtataca caaaggattg taagtcatgc tgctataaag atacatgcac 32461 acgtatgttt attgtggcac tattcacaat agcaaagact tggaaccaac ccaaatgtcc 32521 aacaatgata gactggatta agaaaacgtg gcacatatac accatggaat actatgcagc 32581 cataaaaaat gatgagttca cgtcctttgt agggacatgg ataaagctgg aaaccatcat 32641 tctcagcaaa ctatcacaag gccaaaaacc aaacaccgca tattctcact cataggtggg 32701 aattgaacaa tgagaacaca tggacacagg aaggggaaca tcacacaccg gggcctgttg 32761 tggggtgggg ggagggggga gggatagcat taggagatat accgaatatt aaatgacgag 32821 ttaatgggtg cagcacacca acatggcaca tgtatacata tgtaacaaac ctgcatgttg 32881 tgcacatgta ccctagaact taaagtacaa taaaaaaaaa aaaagaaaaa agcccccaca 32941 attgtattac acacacacac acctcttcac attggtggga ggtcccctag gactggggca 33001 gggctttgtc agcatacgtg aaatagcttc catgattccc tttgctttgg gtggttgcac 33061 tgtggtggca gcagtctggg agccaagaca acagagatgt ttccactctc agagagagcc 33121 ccgttctctc aggaaggcaa agtttccagt gtcccactca gtggccatga gggtaggtct 33181 ggagcatggg gttgggatgg gggtgggggg acactcatac gcttgtcctt agttcattga 33241 ctggaatggt cttgggcgtc agctggatgg cttcagctaa gttcctgttg ctgtagctgt 33301 gggttgggga tgggagagag ggtcatctac tgggggaaaa cagtctcctg aaactgccca 33361 ttccagactg gggaaagacc gaccgatggg agaggatatg tctgaaggca cagaacatta 33421 tagaactggc tctccatagc ccttatgcca tgcccattca ctcagagtaa attgtagtaa 33481 actctgcaaa aacaacaaag tgttttgtaa atcagttaat ttcttaagct tgcatgaact 33541 agtcattggc aacttgattg ctccaagaat ggctcacttg ttcacttggt cattctgtga 33601 cttggatttt ggcaaaatgg ctcttgatga attaaccagg agtcaaaatc atgaccgcca 33661 acaccaagca attaaggtga agtctttgtt catgggaaga ccctctttag aactttctcc 33721 atggagccct gtgttccaac gggagaagga aaatactagg attttatcct tacatatctg 33781 tggggacaaa tcattcttga tcctcccaat gaccttttct agctcttttt ttatctgtac 33841 cttcctaatg agatgctagg taggcagaat gggcagaaag agctgggagc atggtcaagt 33901 gtggggtctt ggtgatgaat ggggagaacc actgtaggta cttttttttg tagacagtcc 33961 ccattcctaa caaactctga tacatgaatg atcagatcct tgtttggagt accagcagca 34021 gtagctgaag aagcctcaga agtctttaag cagctcttac ccagggttcc cagaaagaat 34081 gcttatgaat gctgttatag caacaatttc aaccaatggg aagtaaagcc caaatatttc 34141 tgcccgtttc agttcggtca aggaggagga aatgggagag agtgagggaa ggaggaacat 34201 caaatttatc tgacctcaat aacaggacca gtctgtagta tccatactca gctccttccc 34261 agcaagagat gaactcatag ggctgtgcaa ggaggaagag tgacaataag ccttacagct 34321 ggtatgtatt cggaggttct tatgtgccag ttactgtgta aatactcggt atatgttgtc 34381 atttactcct cacaacagat attagtgttt tcatcttaaa tttcaggaaa cagacacaat 34441 gctcattttc ccaaggctca aactcagagg tgtctgtgtt ttacttggct attcctaaat 34501 tctgagatca gggcctccct tgtgggccta aaggttactc ttgtctttta tgaaagagag 34561 agacagtata ttatagtatg ctaactataa tggttatcag cagaggcaga agcaatttac 34621 atattgtttc tttttcttca atttccttgc ctataaaatg gggataacaa gcttactgaa 34681 caggattttt atgaggattg aatcagatat ttcattgaaa gactgagagc agaacctgcc 34741 acattgtaat ctttctttac aaggtttaga tattatttct gttattaaat attttgagag 34801 aggattgtaa ccacctgatg ggttcttcct gcctgctgca caaatgaaga ccatggcatg 34861 gtagtaaata aaagaattta attgatgcaa ggctggccac gccacatggg agatagaatt 34921 gttactcaaa tcaatcttct tgagcattta ggggtagggt tttccaaaga tagttttagg 34981 gagggagtta gggtggctaa gcaatgggtg cttgctgctg attggttggg ggtgcaatca 35041 taagggtgca ggaatggtcc ttctgcccac tgaatatctt ctgggtgggg ccacaggagt 35101 ggctggcagg tccaggtgaa gccattggtg tcagacagac aaacaaatct gaaaagatat 35161 ctcaaaaggc cagtcttagg ttctacaata gtaatgttat ctggaggagt aattggggaa 35221 gtagcatatc ttgtgacctc cagaacaata gctggcaatc gtttatgtct acaccttagc 35281 agaatttagg ctcctctatc cccctagcct ggtggtctct cattagcttt acaaaggcag 35341 ttgaattttg ggaaagggct attatcattt aaagtataaa ctaaatgtct ctccaagtta 35401 gcttggccta acccaggaat aattaggggc agcttgaagg ccaaaggcaa gatgggactt 35461 tggcatgatc agatctcttt cactgctata attttctcag tgttgtaatt tttgcaaaag 35521 ccgtttcagc ctcactctgt cacccaggct ggagtgcagt ggtgcaatca cagctcactg 35581 cagcctcaac ctcccaggct caagtactcc tcctgcctta gcctcccaag tagctgggac 35641 cacaggcaca caccaccaca cccagctaac ttttttatta tgtgtggaga ctagttcttc 35701 ctatgctgcc caggctggtc tcaaactcct gacctcaagt gatcctcctg cctcggcctc 35761 ccaaagtgct gggattacag gtgtgagcaa ccatgttcag tctactatta ttttaatagt 35821 agtgtaaatg gtgttgtttt aaatgtcctt aattagtgaa ataaaaagat aataacctat 35881 gtcaggaccc aattttatta actttattag tccctgacat ctccaagtat cagaacccct 35941 gcctttaacc actatgcttt gtatgtggag tagtttccca ccttaagaca attggtctta 36001 attgcaatac agaaagggaa gtgcttcgtt aaatgtcctc tgtatccagg aagaaatact 36061 aagcttgctg cctggaaaac actacacaaa tctaggaccc atatgtaaac atcatgctgc 36121 tgagatggca tggaggggtc agggcactgg ggtgggagga ggaagtgtaa ctgcccaagg 36181 ggtttacctt gcccactgcc taggcagagc cgatttatga agacagggga attgcaatag 36241 agaaagagta attcatgcag aaccggctgt gtgggagact ggagttttat tatgacttaa 36301 atcagtcttc tcgagcattc agggagcaga gtttttaagg ataacttggt gggtgggggg 36361 aagccagtga gccaggagtg ctgattggcc ggagatgaaa tcatagcgag tcgaagctgt 36421 cttcttgtgc tgagtcagtt cctgggtggc ggccacaaga tcagatgagc cagtttatcc 36481 atttgggtgg tgccagctga tccatcaagt gcagggttta caaaatatct caaccactga 36541 tcttaggagc agtttaggaa gggttagaat cttgtagcct ccagctgcat gactcctaaa 36601 ctataatttc taatcttgtg gccaatgttg gtcctacaaa ggcaatctag ttcccaggca 36661 agaaggaagt ctgctttggg aaaggctatt accatctttg ttcaaactat aaactaagtt 36721 tttctccaag gttagttggg cctacgccca ggaatgaaca aggacagctt ggaggttaga 36781 agcaaaatgg agttggttaa gttaaatctc tttcactgtc ttagtcataa ttttgcaaag 36841 gcggtttcag aagtatggga gtcagtgtgg ctaaatagga ggaacagaga ctctcaatgc 36901 agacagaggg tttcaagtca cagatctgcc acttactgga tttgtgactt taggcaggtt 36961 gttttagctt tctgagcctc atctgtacaa tggaattttc acagtaatta gttcataggg 37021 ctgaaaagaa gatgaaatga agagattcaa gtagaacact cagaacaagg cctgccatgc 37081 tgtaggtgtt taataaacat tatcttccac agtgatgatg attaagggac taaatttcat 37141 gcacgtccat gtgaagagac caccaaacag gctttgtgtg agcaacgggg ctgtttattt 37201 tacctgggtg caggcgggct gagtccaaaa agagagtcag tgaagggaga taggggtggg 37261 tggggctgtt ttataggatt tgggtaggta gtgaaaaatt acaatcaaag tgggtttttc 37321 tcttatgggc aggggcaggg gccacaaggt gctcagtggg ggaggtttgg agccaggtga 37381 aggaatttca caaggttaat tgctcagtta aggtggggca ggaacaaatc acaatggtgg 37441 aatgtcatca gttaaggcag gaaccggcca ttttcacttc ttttgtgatt cttcacttgc 37501 ttcaggccat ctggatgtat acgtgcaggt cacaggggtt acgatggctt agcttgggct 37561 cagaggcctg atactaacca taacagatag aatttaaaaa gcagaggaac caaaaacagc 37621 ccaccactct cctgagctgg gagaatttgc ccagaattcc agggataggt gtgtcctaag 37681 cttccgctag gtagggtcaa tgcccagcag ccaagccaga aggaagattt gggcagacac 37741 tgtcattggt gaacacgtaa gctccagagc tagaatacta ctgatgatgg tgactggctt 37801 tggttcttcc gtctggtctg acacccagaa agccgatcag actcaccgct atgaaccttc 37861 tagtgccaca gcaggccaga aaccacagcc ctgcctcccc ttctcctgag ttcccggtgt 37921 gggtgtggta ggggcacggg tggattcagt atttctgtaa tggaaactag aaacacgtag 37981 ccccacagta gctgcattcc aaaggggcag agatctgact tcctttgttt catgctttta 38041 gagacaatga cagcaagacc tctcacagga agtgctagga ccactggaga gagtgaattc 38101 aactcttggc caaaaaccta tgggtgatct ttcaggcctc gccataggca attacaaaag 38161 ctttacctgg cagggatcat ggggggacgt gccctcccca catcagacct gatgcacagc 38221 cgtgcattgc agtctgtgaa tggagttgca gtcaaggact cagacgtccc gaactgatca 38281 tgctctcaca catgtcggca ttcctagacc agatgcctcc atttggagtc cgccttatca 38341 ggagcctttg gcagaggcct ttctcaagca cctgagctaa gacctcttgt tgccatggtg 38401 ggagtggggg atgcctaaaa atattttttt cttcaagctg actcagtact cagtactttt 38461 ctactcctag cccttccctg tcacctcctc ctgcccctgg gtccataaaa tggcaggagc 38521 ctcttgttca gggctccctt aacagtgaga tgatgaagcc ctgcatccgt cagacccctg 38581 ttggagccac cccatgtgga aaaatggaat aatggaggag tcggtacttt ttctggttca 38641 gtcccttgct tatactatca aaggcagtta ttaaagtctg actgttactt ttattttgac 38701 tcgtcttaat cgactactct gacacccagc agctcagctc tctctagctc agctgagctc 38761 ttgaaaaatt ggaatggcaa gctgggtagt tgattatgca caatgtcgtt tctgaagagc 38821 aaagagcctg aaaatctcgt ggggcaattc actaatgatg ggtgtaagca gaccttgcag 38881 ggcactcctg gaggatgcta atggattatt tgccagagtc agaggatctg accgggcact 38941 ctgccagtga gtcctgggaa gtgctagcag acctcacgtg gcactttacc agccatgcca 39001 gcaggtctca ggaattgcta ttgtaagtca cagggtatac ttggaggatg atagaaaatt 39061 gtttataaaa gcttttggtt ccaaggtaac catcggtcct cctgtttatc cccatcacct 39121 cggctgagat aacctagtgg atcaaataag agatccagca gaggggctct aagaagccac 39181 agaaacaatt ccctggacac cgttctctga tgatggggcc agaagtcgtt ggagaccctg 39241 taaactgacc tgtaccccaa aactattcct agatctaagc agttccctgg ggctgacgga 39301 caacatgcca tgcagcatgt tactgagact catgcttcta ataaaaataa tgtagagctc 39361 tttagaatca ctttgtccag tgatctcact agatagggtc aatgcccagc agccaagcca 39421 gaaggaagac ttgcttaggg gaagaaacca cagagacttg gctagtttgc ttccattacg 39481 aaaaaggtaa tcttgttaaa taaacttcat aaaggaaagt ggctccttga ggggtacatc 39541 actacagacc ccctcctcca tagtgccttg atagtggcca ctttctctga ggaggaggag 39601 caaggtgggg agatgagagt gtatgtatac caacgaccat ctctttcctc agctgggtac 39661 tatttggtct ctgacagatt tggtccacac ataatgagta cctagtcact gaaaaacatg 39721 tctcaaccac cttcgcagag actatggagg tcctgcagcc ctcctggtgg tctcttgggt 39781 ctactacgag ggagtgggaa cggtagatga gaagattaac cagacttaga tttacgggtt 39841 tgttcatgct gctcctgctt gttgtcaagc ccaattaagc cacctcctga ggtcaagtcc 39901 atcaatctgc agtgtgatcc agatcttaac tcagaaattc tcaggcacag aagaaatggc 39961 ttccgataaa atagcaaaat ttgacccctc ccctccaaaa agggagcata aaaatagtac 40021 ttgagtgccc tcttctggac tgtctgggca agggccttcc caaagagatc tctgagtctg 40081 attactcaat aatggaacca cgaaagcaga ttgatgggct ccccactaaa aaattcaggg 40141 ccaagtacca ggccatgggg atttgataag tggtcccgct agagtctgca cagtgatggc 40201 cttccacctc tcctggtgga tttggagaca aatcagagat cctgagcaac ttttgctctg 40261 acactcagct ttcaaatata aacaagatta ggggttacaa acaaaaatcc aaagatttta 40321 ggggttataa acaagatcct tctgtactca ggggacccag aggccctatg ccaaggttag 40381 agttcaactg aacagtggag aggaattgca cttcttaggc cttttagata ccagtacaca 40441 ggtgactgtg atccccacaa acccttgcat caagatgagg agaaaacaga tcgtgttctc 40501 tgactttggg ctcctactaa cctccgttgt catggccccc actactgaat gaaatattgg 40561 cagtgatgtg ttttccatgt gcagccccac ttgcctttgc ctccaagccc tgacaggaat 40621 tgcaatagtt atggccatct tagtgagaaa tatagaagac catggaccca tctgatgacc 40681 aacacctagt actgttgttt cccaaaaaac aatgtcagct gcctagggga gaaaaagaat 40741 aattgccatc attgctgagc ttaaggaagc caaaatgttt catgagactg tttctccatt 40801 caatagcccc atctggcctg tgcacaagac cttggtttct tggagactta ccactgattt 40861 taggcagctc aatgcgtaat accacctttg gcaccagcag tacctgacat tgtgactatt 40921 acagaggccc aaaacagaga gaacttggta tgcagcaatt gatattgcaa aggcatatca 40981 aattgtagtc cccagtgttg gagatggggc ctggtgagag gcgattagat cataggggca 41041 gatttcccct ctagttttgt tctaatgata gtgagtaatc atgacatctg gctgtttaaa 41101 agtgtagcat cggctgggca tggtggctca cacctgtaat cccagcactt tgggaggccg 41161 aagcgggcgg atcatgaggt cagaagttcg agaacagcct gaccaacatg gtgaaaccct 41221 gtctctacta aaaatacaaa aattagccgg gcgtggtggt gtgcgcctgt aatcccagct 41281 actcaggagg ctgaggcagg agaattgctt gaacccggga ggcagaggtt gcagtgagcc 41341 aagatggtgc cactgcactc cagcctggac gacagagtga gactccgtct aaaaaaaaaa 41401 aaaaaaagaa agtgtagcac ctctcctctc ttcctcctgc tctggctgtg tgaagatgta 41461 cctgcttccc cttccaccat aattgtaagt ttcctgaaac atccaagcca tgcttcctct 41521 acagcctgcg tacctgagtc aattaaacct cttttcttta taaattaccc agtctcaggt 41581 atttctttat agcagtgcaa gaaaaaacta atacagagca atttcttaaa actactcctc 41641 cctgaatcct cactctcaca gccagagacc tagttgccat ctgcaatgga cttaattgtg 41701 ccctgcctca aattcctatg ttgaagttgt aatccccaat gtgactgcat ttgtagattc 41761 ggcttttagg tgataatcat taaggttaaa agaggtcata gggtagggtc ctaatccgat 41821 aggattggtt accttataac ctcataagaa gaggaagatt gaccaggcgt ggtggctcat 41881 gcctgtaatc ccagcacttt gggaggctga ggcgggtgga tcacctgagg tcaggagttc 41941 gagaccagcc tggccaacca acatggtgaa accctgtctc tactaaaaat acaaaaacta 42001 gctgggcatg gaggcacacc cctgtaatct cagctactca ggaggctgag gtaggagaat 42061 cacttgaacc tgggaggcag aggttgcatt agtaagccga gatcacgcca ctgaactcca 42121 gcctgggaga cagagcaaga ctccatctca aaaaaaaaaa aaaaaaaaga agaagaagag 42181 gaagaggaag attctctctc tctctcccct tccctctctc cctcccccct ctttctgtct 42241 actggaatgt gaaaggaaaa gccagtgagt acacagagag aaggcagcca actgcaagca 42301 aggaaggtga ctctcaccaa aacctgagct cactatcatc ctgaccttag acttccagcc 42361 tctagaactg tgagaaaata aatttgtgtt gtttaagcca gccagtctat gatattttgt 42421 tgtagcagcc caaacagact aagacaccat ccatgatacc ttcttctttc tcagccccac 42481 atatattatg tcaccatatc ctgtgtgccc ttccttcaaa atatgctctg aatctggtta 42541 cttaccactt ccactgttgc cacccataac caaactgcta tcatcagctc ttccctaaac 42601 acctgcaata gctccttact aggctcccgc actcacgcct gattccctcc catctgttat 42661 cttcatatta aaaataatac agtggctttc cactataccc taggatgtga tttaaaattc 42721 ttagcatggc cttcaaagct ttgcctaatc tatctcctgc ttaactcttc aaactactgg 42781 agcctctctt ctcctcactc ctcaccatct agtcacatgg cccctttggg aatatatgaa 42841 aacttcccag cttggaatca ctgcccatta tctttcacct ggctcattct tgtttatcct 42901 tcaggttcag ctttcatgtc ccttctttgg agtggccttc catgacctac atcctaaatt 42961 agaagtcttt gtattctttc tagttttagc atgttcttta cattgacatg cataccacca 43021 gttttgtatg tattggttta ttgtctggct ccccagttgg gtgaccaatt tcatgagagt 43081 aaggaaagca tctgccttgt tcactgatgt tattaccagc acccagccta gggttcaaca 43141 ggagtaggtg ctcagttaac gtgcgttgag tgagtgaata aatgaataga gtctatgagg 43201 ttggtgtttt tagttcagag tagtttggtg attcacttgc tgccacacag ctagcaagca 43261 gacataaatg ggcaagatcc cagaatcata gttccaaggt gaatgcataa tcaagagact 43321 gccagggggt gaaaaatgga agggggcatg gagtgtgaga caggtgaaaa gattccagac 43381 cattgattct agggagatct tcccccaggg tgtgtttttt tggggggtac atattcaata 43441 ttcacatttg ttgaaaaata gcttctgatt tcataataaa tatatattgc tgaaagacta 43501 ttccggttgg taaggataac agaaaatgca gcgtttcttc gctgagggca gatttttttc 43561 agtttattga aaggaggcaa gttaaaaacg tgactaggaa actgcacgaa agctttctgc 43621 cattcccctg attggagcta aggcctgtgg ggtgcaggag acaccaaggt ttgggtcgtg 43681 ccgggactat gctcatatag tggagagtag gtcttggtga cacagcaacc cacaggctgc 43741 acattgccac ccacccctca ccccagcaaa ctgcacatca ttccgcccct accccacaga 43801 ctccccttca caattcccta atctgtatat tacctctcag gctccacaca tgccccacgc 43861 cagcccccct ctccaagctc cactttaccc cacattacct cctagaatca catcacccca 43921 caaatggtga gaaatggcca tttgtgaaaa gcgagtttgg aaggtgctgg tcaggagagc 43981 cgggtgcgaa gccagaaccc ttgggcagag gcagcatgag ggtcaactcg acagcagaag 44041 gcgcctgaga gtctctccgc aaaggccatg caaggacgcg cgctcctgta agtacagcct 44101 ccatgcccgg gttaattcct agacaggtag tgagccaatc aagccagcca cccactttct 44161 aaaaaacaaa aaatctactc tttctccttg ccctcctctg ctcagcatcg ttcactatgc 44221 ctcgaatatt ttaaatggcc ctatggatat ctgtaaaaca aaggctagta gttatcatta 44281 ttattatcat taatcagata tataggtaaa aggtgaccct aggaacaaag ccaagaatct 44341 agagtaagga tggctaatgg ggattgtgtt tgaatagttt gcttttgttt cttattttgc 44401 ttgggttgtg acgtgtgttg cttcaacata tggcttaaaa aaaaaaaagt ctgttaatgt 44461 attggtcaca aagtgtcttt ctgggaaaag gcacttattt tcccagagtg atttgacaca 44521 tttctggtga aatgcattaa cattaacgtt cctttttttt tccccaagtt ttggacatat 44581 gggtttaatt acctgctaga ttagcagatg gaaatatgtg gtaaatggat gtaaacaagg 44641 tgcataaact taggagagag gcagaaattt ttaaacttta ggtttagaca ttcactagat 44701 tggacaattt ttaacattta aaataaaact ttttgccaca aagaaacaat tgtattaaaa 44761 ctagtttcca gaactgccca aatgactttt taaacatgat ttaaatgtca agctttcttt 44821 ggcaaatcta tccaaaatac taattttcct tctagaactt ggtttataat ttatatatta 44881 gagtgccctt aattgatgtt cattgggata ccttcctctg agccctaatg gtttctatct 44941 tccctttaaa atgattaagt atcttttaat taagttgtca ttcgcaaaga tattctggag 45001 caaaaatcag gaattaaaca tctatgaatt cttcttcact tctgttatag tgtagcaatt 45061 tatcttctaa cccatagtag tctatcaatt gagtaaggct tagtttcttg ttctcttcag 45121 acacagctga ctgcaactca taaagcaaca gctggctaca gctcctccac ttctttatta 45181 acaactgtaa tggagacaga ttctttgatc atacagtttg atgagtttta gcctccaaag 45241 aaggtcttct ttctcctgaa cctgcttcac caatttggct ttttcatcgt ttaatctttg 45301 tttagaaaga ttctcattca caaactcatt ttggagagca ctgcatgacc cagagtcaaa 45361 tgactgagat tcacagacat tgattcttca aagtattttt caaatttata ttttcttcaa 45421 attcactgtc tatgtgttta aaactttctt gaaattccaa acagttttcc tctgtggaag 45481 atgctggttt ctctgaaaag gtctgatcgt tttcttcatt ctctacttta agaagtttca 45541 ccacattata acaggaacta aatgaagatc ttgctttcct taatctttct ctaagtgttg 45601 cactcatagg ttgtgttcgt gaactatttg tatagggaga tgatggattc acagaggtct 45661 gaggagtgct aggtaaaacc acagctgagt ctgatggact ttccatcttg aaaatgaaat 45721 cttggttttc cttcaactcc acagcagact cccgaggaag tccaggtctc aggcatcccc 45781 cttcccctca gcagggtacc tccctccgcc atccccagag agagcaaaaa gccaactttc 45841 ccttcttaaa ggactcacgg gaagggaata atggaaataa taacttcaca tgctgaagaa 45901 aacagataaa gtgtgaagaa accatttatc aacatttaaa cctccacaca ttatttgttc 45961 tttttttaaa agaactttac taagaatcag attcattaag tcaaaggatg ttagaacaca 46021 aaggaacctg caaaattaaa tcgcgttctt attatataaa acaggagacc cagagagggt 46081 aagtgacttg tccaaagtca cagagcactt gaatgcagtc agggctgatt tccagtctgc 46141 aactcaaaag agaagcagtg tttctcagtg aaagaacagt ggtgtgagtt ttaaagattt 46201 tattctattc tttattacat gattattata ggaaaaattg ggacacattt taaattacct 46261 agaatcctac cacttcacaa tgatcattgt tagtattttt gctgtataat tttccagact 46321 ttttaatgta tatgaatact ggtaaatgtg tttactaaaa ttggatcaca ctgtctgtac 46381 atctctgtca tataactcct aacagatgca aaactagtcc actgaataga ttaacttcat 46441 cagtttaacc agtcccttac tgatagatag atagataagc tttttcgaat gttaaaccct 46501 tataaaagat gctgcactaa ccattcctgt tcagacatct tttgctgaag tgatttctac 46561 ctactgatca gtgtgaattt tgtagggatg caatgatgta gactaggttt gaactctggc 46621 tctagaactt actgtgtgtc tttacacaaa tcacaacttt tctgcactct agtttcttca 46681 tcttcaagtg ggaataatca tactgatagc ctagaggtgt taaaaggatt aaatgaaata 46741 atacatgtaa aatagttggt taatacctag catgtagcaa atcctaaata atattagtta 46801 ttattattac cacaatgctg aagaaaactg gctcttcaaa ataatgccca tgattttctg 46861 ttactcttta aaagagacta gccagaattt tacaaagaag gttttagatg caaaacttca 46921 gtagctaaat taatatttcc ttttttctaa cttaatctat cgagaagaaa agtaataaat 46981 ttaaatcatg tgtggtctgt tgagctagtt actatgttct atctatataa cttaatgcca 47041 gtcaaaattt aaaaggcata gcttttcatc aaacattctt gtcttagtct gttttctgtt 47101 gcctataata gaatatctaa aactgcataa tttataaaga aaaggaattt atttcatatg 47161 gttatggagg ctaaaaagcc caaggtcaag ggactgcatc tggtgaggac cctctagctg 47221 gtggggattc tctgcaaact tcctagacag cacagagcat cacagggtga gggggctgag 47281 catcctagct tgggtctctc ttcctcttct tatgaagcca ccagtcccac tcccatgata 47341 acccattaat ctattgaccc attaatccat gatggattag tcgattcatg agggcagagc 47401 ctttatgacc catcacctct taaaagcccc aagaccccac ctctcaatac cgccacattg 47461 gagattaaat ttcaacatga gttttggagg gggcaaatat tcaaaccata ctcatttgta 47521 agtaaaaaat caactgtttt gtgataaagt tgctcacctt tctttcccct ttggaaaatg 47581 ctcaaagcat ggcttacccc ataaaatcag cagaatatct ttacagtcat ctatccagcc 47641 tcccggtgcc ttcaagctat tctccactgc ttttctgttt tatccatggg cccccagcta 47701 tttcatattc ctcagagggg tgggaaagag cacagaagct gaagttttca cctctgagac 47761 tttgctttta aactgaaaag atatccattg tcacactggg attcatttct atctgaagct 47821 ttggaaacac atcacgactt ttaccaaagc ctatacagtg aatctaattt atgtgtcacc 47881 tttatttcac caatagttgc tttaaaaata agggaacact cacccccttt gaaaagctgg 47941 gtggaaggtt cagtgttgta tggcttatct cggaagtcag gggtcagtga ccaaaattca 48001 cggtgaccag aagcattcca aaatgctgcc agtaactcca tgcttactgt ttcttcctct 48061 tctgctctac tgaacattgt ggacatagac actcaggatt gttggggtgg atgatactgg 48121 ttaaatgagt ccttcttgga cacagctttg gttgaatgtg agcaccaaac actgagaagt 48181 ccttgtctat aactgatgtg cagaatgacc catacgtttt ttggtttggg attctttttt 48241 tttttctcta ctgtgtgtta tccttttctc taatagtggc cagtcacatt ctaccagtct 48301 gtatttgtct cacggttgtt gaagtggctt tgccaactta gctaagctag aaacatagtt 48361 tcctggaatt gccttcccaa tgtggttctg agttagagtg gaccaggaga aaaatttggg 48421 cagggcttgg gaagtgggag tgaagcaata gatactatat tctgaagttt ggtaggacag 48481 gcactattac cactgccata cactggcact gatcaactgg cacacctcgt tggtgtgggg 48541 cagcaacagg acctgcagca actccaactc tgacaagatc tccttcttca acttctccaa 48601 gtcctaagtc aggtttgtgt gcacttccac gactatggat gccagctctt tctttaggtt 48661 ggaggcaata agagacagat gcaggttcca gtaagtcttc atgggattta gtttgtccta 48721 gggattcaag ttaatccttg caggttccag tttgtccttg ctttccccca caccacatac 48781 acctcttctt cccaactgtg ggccctgaat atctacagtg acctcaggct gctataagtg 48841 cagagacaat agccttctat agacttcttc ctctgtgtaa ggtacatgga tgttccttgg 48901 tcaaggaata ggccaaggtg gatatccagg cctgcatgac tcagtgagtt tggcatgcag 48961 gcacacacct ccacttgtta tataacctgt ttgtgtaagt ttatacttcg ctctaagcca 49021 ctattgtctg taaaaggtat aactgccctg ctaacgttct acaggggctc ttgggactct 49081 ggttagctca acatggctta acatggtggg cacgctggtg cccagagaaa gaaagagaga 49141 gagccaaagc tgtccgtctt gcagatggac aggagggagc cagcacacag cttggcttgc 49201 tcatgcccag agagaaaaag agttaagccg ctgaccctga aggcaaggaa gagccagatg 49261 cacagctgtt tgtgggagcc actggctcaa gcagctgcga cagggcaaac agtgtgagag 49321 agctagtgtg agaaagctgt taataaaagc tgctgctgaa taaaaccaca ttcacctgcc 49381 ttcagccccg agtgttcttt ctgctcatcc accaactccc tctggacttc agcatgggct 49441 ggacccggac cccgggacct gacaactggt gatgagaatg ggatgaggtg agttggcccc 49501 agtccctgag ggctcccagg ttggctttgt ggccacagca tgggctgtga tacccggtgg 49561 cagctgtgct gcgaagatgg gctccagtgg aaacatggga ggcagtgggt gggtctcctg 49621 tgagtatgga gaaggcactg aagcacctgg aagtgcacag cactgagaag aagcatgcct 49681 ttgctggcag agtcagatga gcgtttgtaa ctgtgctgtg ggaagttcat gcccagtcct 49741 tgtggaactc agtgcagtga ggaggaagaa cctccatggc aggcttgccc agtgatccac 49801 cacaaaatag atcatgagca gctactgggc cccacgagta ggcccagaga ccccctgctg 49861 tggtggagca cccttccttt ggtgcctgtg tccctgctga gttgagggag ttaagcaagt 49921 aatgtcggca gtcatgcaca gacttagtgc aagccatttg ggagaaggac cttgctgcgc 49981 aacccagtcc tgctcgagca ttccagttca aagagtacct gctgcagttg gtggaagtgt 50041 aaagcctctt ctgtttgata agagaactgg ccgagatgcc cagcttggag ggcactggac 50101 aacgggagcc acatttggac ttcgcaatcc actggtccct gaacctggat aagtttccag 50161 gcagagctgc atttattgac aactatgaag actggtcagt gaaagtgaaa cctgtatctc 50221 tgcatcttgg catcgactgc ttggctctct gcttatgtgc tgtgtatgtc tctccatcca 50281 tacctgaaga cattctgggg tagatgtttt gcatggcttg gcagctgtgc catctatcac 50341 agacttaata gaccacttga caatggaact tggcagtgcc actgtgtggt ggacttggct 50401 aatgcattct tgtcaatcga caatgctcca gagagccagg aagagtttgc cttcatggga 50461 cagcgacaat ggactttcac agtgttgctg cagggctact gtatagcccc accatatgtt 50521 gtggttttgt taataatgtt atgttaacct ctgattctct tgcaggttta aaagtggcag 50581 tgtccctctt gcctgggatt ggggtgatga ggctgagaca gcctttctgg gtagtaaacc 50641 agggtgctca tttacactga atgtgcatgt aaccacagat aattttggct agggcctata 50701 gcagtgcatg gagcacttgg aagcaccagt gggctgttag tcccaactgt ggaagggagc 50761 tgagctccag tgcttactaa tagagaagca gttattaata gtgggatggg tgcattcatg 50821 ggtaatgacc ccctggacag aggcagcaca gacatcaact ttagcgaagt ggggtaccga 50881 cttgaaacag tgaagtatgc taagtaagta caaatccctt agcagcaaag ttgcaagagg 50941 tcttgggacc tgtagtccta atgcaagata aggccatggg gcctgaggca ctcctaaacc 51001 ctgagacttc atcattagga agggcatccc ctcattcctg atagggcatg gcacacagct 51061 aggtctagct ggggtgctac tgctgcctgg actgctggtg cggtccagcc tagtactaac 51121 accatatggt ttgaaaccag gtgtgggcaa agcagctaat aagctaaact cagggcagtg 51181 tgaatggtaa tcaccaagga tgtgacaacc tatggtaatc tgcgccaata gctgggcagt 51241 ttatcaaagc ttatgtatgt aacgggcttg agtgcccaaa agcttgtgta tgtattgggc 51301 ctgcatgccc aaagcttgtg tgtcaggctt atgtgtcaag cctgtgtata tatcaggcct 51361 gcgtgcccaa agtttatatg tcaggcctgt gtgcaaaact tgagtatcaa acctgtgtgc 51421 ccaagaccta agtctccctc agcctagggg gtggagtgta aggtacatgg atgtgctttg 51481 gtcaaggaat aagccgaggt ggatatccag gcctgcatga ctcagtgagt ttggcatgca 51541 ggtgcacacc tctgcttgtt atataacctg tttgtgtaag ttcacacttg gctctatact 51601 tggtataaag gtatatttgc cctgctaatg ctgtacaggg ctgttggggc tcagctcggc 51661 tcaacatggg ttaacatggt agtgggtgcg ctggtgccca gagaaagaga gagagccaaa 51721 actgtcagtc ttgcagatgg acaagaggga gccaggacac agcttggctt gctcatgccc 51781 agagagaaaa agagttaagc tgctgaccct gaaggcaagg gagaactggc tgcacagctg 51841 tgtgtgggag ccgctggctc aagcagccaa gacagggtgg ccagtgtgag agagccagtg 51901 tgagaaagct gttcataaaa gctgctgctg gccgggcgca gtggctcacg cctgtaatct 51961 cagcactttg ggaggccgag gccagcggat cacgaggtca ggagatcgag accatcccgg 52021 ctaacaaggt gaaaccccgt ctctactaaa aatacagaaa attagctggg cgaggtggca 52081 tgcgcctgta gtcccagcta ctcgggaggc tgaggcagga gaatggtgtg aacctgggag 52141 gcggagcttg cagtgagccg agatcgcgct actgcactcc agcctgggag acagaccgag 52201 actccgtctc aaaaaaaaaa aataaataaa taaaataaaa taaaataaag ctgctgctga 52261 ataaaaccat attcacctgc ctacagcccc ccgaatgttc tttctgctca tccacccact 52321 cctccggact tcagcatggg ttggacccgg accccaggac ctgacaccct gctcccacaa 52381 ttgcatatgg tctgattcct gtaatcatcc cttattgctt ttcactctgc ttctctgatc 52441 aaactctgca actatgttca agtgattctt attttacatc caaatttgca aatcagacac 52501 aggattatag acattacaaa aaaattcaat tttgaaaaaa aaaggaagag atttgtatat 52561 gacacttaag taacccatct attaccctcg ctttcttttt gatgtcaaac tcctttaaat 52621 ggcaccccaa gtttattgca aaagtatttt tcttcctccc attttctctg cagttccttg 52681 aatatgactc ctgcttttaa aatgctattt taattcatca gaaacttata aaacatcaaa 52741 tccagatctt agtccacttt atttctttgt actcttaaca ctgttgacta ctctcttttt 52801 tccctcatta aatttggttt caatgggtct caaatttctg tgacagattt ttggtcaagt 52861 tgtttccact aaaaagtgct gattttaaaa attaaataac ttaaaactac cagatgccaa 52921 aaaaaaaaaa gttcacaaaa cattctcctt tccttccaaa ggttttacaa tgcattgtta 52981 tcattaacca gtcttttacg actaaactta agtggccagt tgaaacaaac agttctgaga 53041 cccttccacc actgattaag actcaggcca ggcaccgtgg ctcatgcctg tactcccaac 53101 attttggaag gctaaggtgg gtagatcact tgagcccagg agtttgaaac cagcctgggc 53161 aatataatga gacctagcca ggcatggcgg cacatgtctg tagttccagc tacttgggag 53221 tctaaggcag gaggattgct tgaacccagg aggcagaggt cgcagtgagc agtgattgtg 53281 ccactacact ccagccaggg caacagagtg agaacctgtc tcaaaacaaa aaacaaaaaa 53341 caaaaaacaa aaacaccaga caacaacaac aacaacaaaa aagactgggg tggcatgtat 53401 tagggataat attcatttag ctttctgagc tttctggaca gacttggtga ccttgccagc 53461 tctagccgcc ttcttgtcct ctgaacccat ggcaactgtc tgtctcagcg aatttggcat 53521 gcaggtgcac acctccactt gttatataac ctatataacc tgttgtataa gttcatactt 53581 ggttctatcc ttggtataaa ggtatatctg cccgcaaaat gtcccagaag agaatagtca 53641 ggatagtcag aatattcaga gaagctctca acacacatgg gcttgttagg aactatatca 53701 gtcatggcag caccaccaga tttcaagaat ttagggccat gttctagctt cttaccagaa 53761 tggtcacctt ttctttctgc tcagaacact tgtaaggaat ttgagatgtg tgacaatcta 53821 gtgcaggggc atagccagca ctgatttggc ctagatggtt caggataatc acctgagcag 53881 tgaagccaga tgcttccatt ggtgggtcat ttttgctgtc accagcaaca ttgccatgat 53941 gaacatcttt ggcagatgcg ttcttttttt ttttcttaga aagaatgtaa tgcatgtttt 54001 taatcagaac aacagcaata acaaaagcta agtatggata tgccaatgta gtgttgaatc 54061 cagcaatgga cacaaataaa gagcaaatgc ttgacagaga cagctgtaaa taatctatgt 54121 acctcttaca tccccccact tcataaaaaa gacattctct ttggaaatat ggttacaaat 54181 gtaaatcact gatttcttgc aagaccacga tgattctgta ataaatatca aattgttcca 54241 tttcagattt gtaaaaggat tctttcagat atgaagactt gggagcaact gtgaagctgt 54301 ctttattcag atgctttgaa atataaagtg gtgatcttaa accaccaaga actgtaaggc 54361 atgctaactg atgtttataa caacttagcc ttcttttcct ttatatccga ttttattaag 54421 tgggcaaaca ccttgttcac attgaggccc acattgtccc caggaagaga ttcactcaaa 54481 gcttcatggt gcatttcaat agactttact tcaattctaa cattgactgg agcaaagctg 54541 accaccatgc caggtttgag aacaccagtc tccgctcggc ccacaggtac agtaccaata 54601 ccaacaatct tgtagacatc ctggagacac agacacaagg gctcataagt tggatgagtt 54661 aatggtagaa taaagtccag agcctcaagc agcgtggttc cactggcatt gctatcttta 54721 taggtgactt tccattccta gaaccaaggc atgttaccac ttggctccag catgttgtca 54781 ccatgccaac cagaaattgg cacaaatgtt actgtgtcag ggttgtagcc aattttctta 54841 atgtaagtgc tgatttcctt gtatctcatc tggtggtagg gtgactaagt ggaatccatt 54901 ttgttaacac caacaagtag ttgtttcaca cccagtgtgt aagccagaag ggcatgctcc 54961 tgggtctgcc cattcttgga gatgccaact tcaaattcac caacaccagg agcaatgatc 55021 aagagagcac agtcagcttg agatgtgcct gtgctcgttt tttttttgtt tgtttgtttt 55081 gtttttttga ggcggggtca tgctctgtcc cccaggccag agtgcagtgg tacaatcttg 55141 gctcactgca acctccacct cccgggttcc ggtgattctc ctgcctcaac ctcccgagta 55201 gctgggatta cagacgcatg ccaccacgcc cagctaattt ttgtattttc agtaaagatg 55261 gggtttcacc atgttggcca ggctggtctc gaactcttga cctcaagtga tccgcccgcc 55321 tcggcctccc aaagtgctgg gattacaggc gtgagccact gtgctcggcc tgtaatagtg 55381 tttttgatga ggactctgta tcctggggca tcaatgataa tcatgtagta cttgctggtc 55441 tcaaatttcc acagggaaat atcagtggtg ataccattca gctttcagtt tatccaagac 55501 tcaggcatac ttgaaggagc cctttcctat ctcagcagcc ttcttctcaa tttttcagtg 55561 gttcttttgt tgatgccact gcatttgtag atcagatggc cagtagtagt ggacttgcct 55621 gaatctacat gtccaatgac gacaatattg atatgagttt ttcctttccc attttggttt 55681 ttaggggtgg ttttcaagac aacctgtgtt ggcagcaaac ctgttgcaga aaagctactt 55741 tctcttttct gaatttctct cttcccttgg tttctatgac aagcactctt tggaattttc 55801 tcctatctct ccaactgctg ctccctgtct tagtatgctc acttctcttc tacccaatcg 55861 tcaaggtgta ggagttggtt gctcaaagtc cagttcttgg cactcttgaa ttctcactct 55921 accctctcta cctttatgac cccctccact ctaaaggatt caaccaccat cccttcacac 55981 tcatgattta tagatctatg cttttatctc agctttctct cctgaatgcc aaggtcatga 56041 ttcttatagt ccatttagtt gcagatgact ctcaaattca acaagcacaa actgacctat 56101 ttatttacca tgacatgttc catgtcccct aacttggata atggcattac aattcttcag 56161 tcactcgttc tcaaaacttc agagttgggg agtgatgtca ccaaagatgg agtagaagca 56221 atctggattc actccccctg cccactgaaa accaaaaaca actatccggt gccacaatta 56281 tcaccagcaa tatcccagaa ctcaaaatca aagctgtgac aatccctgag gccacagaga 56341 agtaaaaaac tatgagctga aagtgagaga aatggacttc tctatccaca atgcccctcc 56401 cccaaactac tagacaccac atggaaaaat cctccgagac tcatggtttc tacattggaa 56461 aaagtgagat caaggtggaa agccagcttc cccatcatct tgggttccta tgcaagaaag 56521 ctgttcctgc ctcaactcac aggaagcatc acaagtgcct ccacctaagg acaggtggag 56581 acaaaccttg gaggtggagc tgcatggccc agcaccagaa actcgggggg ctgctctcca 56641 ctctagaaaa aggggacacc aaatcagaga ggtggtttag cagcaccaca ctgtaggagg 56701 caccctgcag ggaacctctg ggcatgaacc cgtagccagc ctccccacac agctgaggag 56761 tcccctttgg aatctccccc tgtctgtaat gggcagcact tggagagcta cagaaaacct 56821 gtgtttaggg cgccatctag tgctcaaaga aggcagcaat ctagggctaa gggaatcaac 56881 gggcaaatcg cacagaacct ctaaacacac aaaacaactc agacaggaaa gacttgaaaa 56941 aataaccaat tctttaatgc gaagacatag atctacatac ataagaaata acagcaaaca 57001 gtgaataatg acctccccaa atggacaaag caaggaacca ctgatcaacc ccagtgagag 57061 agcaacatgt gagctctcag atagcaaagt cacaggagtc agatcaagaa ttcaaaatag 57121 cagttttgaa gatactcagc aatctccaag ttaatacaga aacgcaactt agaaatttat 57181 cagagaaatt taatgaagaa atggaaataa gaaaaaatca aacagatacc tggaactgag 57241 aaatacattg gctgaactga acaactcatc acagaggtat caacagcaga actgatcaag 57301 cagaagaaag aaatagtgag ctcaaagata gtctatttga aaatacacag tcagagaaga 57361 aaaaaagaaa aaagaaggaa aaggaatgaa gaatatctac aagaactaga aaataacttc 57421 aaaagagcaa atctgagtca ttgaccctca agaggaaatt gataaagaaa agggggtcat 57481 tagcttattc aaagaaataa cagaaaactt tccaaaccta gagaaagata taaatatcca 57541 gatacaggaa agtcaaagat taccaaacag attaaaccca aataggacta taccaaaata 57601 tgtattaagc tctcagaggt caaggacaaa gacaagattc taaaagcagc aagagaaata 57661 aaagcaaata acacacaaag gactttgatt tgtctggcaa cagactctca gcagaaacga 57721 tacaggacag aagagagtga gacgacatat tcaaagtgct caaggcaaaa actgagaata 57781 atatacccaa caaagctatc cttcaaacat gtaggagaga taaagacttt ctaagacaaa 57841 caaaaggtga atgaattcat catcaccaga cctgtcttgc aagaaaatgc tagagagagt 57901 tcttcaatct gaaagaaaag gatacgaatg tgcaataata aagtcattta ggctacgcat 57961 agaggcccac acctgtaatc ctagtgcttt ggggggcact aggcacagaa agacaaacat 58021 cgtattttct tacttatttg tgggatctaa agatcaaaac agttgaactc atggacatag 58081 tagaaggttg attaccagag gctgggaagg gtagtggggg ataggtaggg gagttgggga 58141 tgattaatgg gtaccaaaaa atagttagaa tgaatgacac ctactatttg atagcacaac 58201 agggtgtcta tagtcaataa taatttaatt gtacatttta aaataactaa aatagtataa 58261 ttggattgtt tgtaatacaa agaatgatca cttgaggggg tggatacccc attctccatg 58321 atgtgattat tatacattgc atgcctgtat ccacacatct catgtaccca taaatatata 58381 tacctactat atacccacaa gaattaaaaa taaaattttt ttaaaaagta gaaagacttc 58441 aaataaataa cccaatgatg cacctccagg aactagaaaa gcaagaacaa atcaaaccct 58501 aaattagtag aaagaatgta acaaggctgg gtgtggtgac acctgtaatc acagcacttt 58561 gggagactga ggcaggatga ttgcccgaca cccacctggg caacaaagtg agactgtgtc 58621 tacaaaaaaa gtttgaaaat tagccaggaa tgatggcatg tgcctatagt cccagctact 58681 taggcagctg aggcaggagg aatgtttaag cccaggtggt tgaggctgca gtgagccatg 58741 atcgcaccac tgcactccag cctggggagc agagcaagat tctgtcagaa agaaaaagaa 58801 aagaaaagag aaaaagaaaa taataaagat cagcacagac ataaatgaaa tcgagattat 58861 aaaaaaatac aaaagatcaa tgaaacaagt agttttccaa agagataaac aagcaaaact 58921 gacaaatctt taggtagacc aagaaaaaag agagaagacc caaagaaata aaatctgaaa 58981 tgaaaaggag acattaaaac taataccaca gaaatatcac agaaatataa aggatcatga 59041 gagaccatta tgaccaacta tataccaaca aactggaaaa cctataagaa atggttaaat 59101 tcttggacac atacaaacta taagaaatag aaagcctaaa cagatcagta acaagtaatg 59161 agatcaaagc agtaataaaa agtctctgat caaagaaaag cccaggagca gatggcttcc 59221 acaatgaatt ctaccaaaca ttttaagaat ataaactcta ttcaaattat tccccaaaaa 59281 attgaacagt agggaatact tccaagttca ttgtatgtgg ccagcattac cctgacagga 59341 aaacaagaca aaagatacaa caacaataac aacaacaaca acaacaaaac tacaggccaa 59401 tattcctgat gaacatagat gtaaaaattc ccaataaaat actagcaaac caagttcaac 59461 atcacattaa aaagatcatt caccatgatc aagtgggatt tgccacaagg atgtaaaatg 59521 gttcaccata cacaaatcaa taaatgtgtt atatcacatt aacaaaacta aggacatttt 59581 tgtccatgat gatttcaata gatgctgaaa aaagcattca gtaaaattca gtatcttttc 59641 atgacaaaaa ctcaacaaac tgggtataga aggaacatat ctcaaaacca taaaggccat 59701 atatgacaaa cccacagcta acatcatgct gaatggggaa aaattggtag tctttctggg 59761 gaaaaattgg cagtctttcc tctaagaaat gaaagaagat aaggatgacc aattttacta 59821 tttttattaa acatagtcct gaaagtccta gccaaagtaa ttgggcaaga gaaagaccta 59881 aagggcatcc aatttggaaa gaatgaaatt aaattatcct tgttcattga tgacatgatc 59941 ttatatctag aaaaacctaa agactctacc aaaaaattat tagaactgat aatcaaattc 60001 agtaaagttg caagatacat aatcaacata caaaaatcag tagcatttgt atatgctaac 60061 agcaaacaat tggaaaaata aatcaaaaaa gcaatcccat ttataattgc tacaaaaaaa 60121 tacctaggaa taaatttaac cacagaagtg aaagatctct tacaaagaaa actatgaaac 60181 actgatgaaa taaattgaag aggatacaaa aaaatagata tctcatgctt atggattgga 60241 agaattaata ctgttaaaat atccatacta tccaaagtga tctacagtgg tctcttcact 60301 ctattgattg tttcccttgc tacccataca atatgataca aaatgatcta cagattgcaa 60361 tccctatcaa agtgccaaca acattcttga tggaaataga aaaaacaatc ctaaaattca 60421 tatgaaacca caaaagaccc caaatagcca cagcaatcct gagaaaaaaa gaacaaagct 60481 ggaggcatca tactacctga cttcaaaata tgctataaag ccatagtaac cccaaacagc 60541 atagtactgg cataaacgca ggcacataga ccaatggaac agaataaaga acacagaaat 60601 aaatccacac atttacagcc aactcatttt tgacaaagat gccaagaaca tatattggga 60661 aagaacagtt tcttcaataa atgatgctgg ggaaactgga taaccatatg cagaagaata 60721 aaactagacc cctgactctc actatataca aaaatcaaat caaaatggat taaagaaagc 60781 caggcatggt ggtgtgtgcc tgtagtccca ctacttgaga ggctgaggca gaaggactga 60841 ttgcttgagc ccagaagttc aagtccagcc tgggcaacat gatgagaccc cctatctctt 60901 aaataaagac ttaaatgtaa gacctgaaac tataaaacta ctggaagaaa atgctgagga 60961 aatgcttcaa gacattggtc ttggcagaga tattttgtgt aagacctcta aagcacaggc 61021 aacaaaagca aaaatagaca aatggcatta catccaggta aaaagcttct gcacagcaag 61081 ggaaacaatc aatagagtga agagacaacc tatggaatga gagaaaatat tttcaaacca 61141 tatattaaat aaagggtgaa tatgcaaaat agaaaaggaa ctcaaataac tcaacagcaa 61201 aaaataataa tctgattaaa aatgggcaag tgatctaaat agacatttct ttttctttta 61261 gatttgccgt taacaatgaa tatatatttc ttaaaagaag acatacaagt agccaacaga 61321 tatatgaaaa atgctcaaca tcactaatca gggaaatgca aatcaaaacc acaatgagat 61381 atcatctcac cccaatttaa atggctatta tattcaacct gagaaaactg ataaaaatgg 61441 ctattataag acaaaaataa caattgctgg taaggacgca gagaaaggag aactctcata 61501 cactgcagat ggaaatttaa attagtacag ccattaatga aaatagtatg aagtttcctc 61561 aaaaaaacaa aaaaactacc atatgatcca acaatcccac tgttgtgtat atattcacaa 61621 gaaaggaaat caatatattg aagagatatc tgtactctca tatttattgc agcactatgc 61681 ataatagcca agatatggat tcaacctaag tgtccagcaa gaaatgaata aagaaaatgt 61741 ggtatatata ccacagtgaa atattattca gccataaaaa tgaaataaat actttcattt 61801 gcagaaacat ggatagaact gaagggcatt gtgttatgtg aaataaacca ggcacagaaa 61861 gacaaatatt gcatgttctc actcatatat gaaaacgaac aaacaaacaa aaaaaggacc 61921 tcatggaggt atagaataaa atagtgatta ccagaggctg tgaagggtag agggagggag 61981 gatgaagaga agttggttaa tgggtacaaa actacagtta gaagcaaaaa gttctagtgt 62041 ttgatagcac agtagagtga ctatagttaa caataattta ttgtatatct caaaatagct 62101 acaagagaag atttggaatg ttttcaacac aagtaaataa taaatatttg aggtgatgga 62161 taccccaatg acccttattt gatcattata cattgtatac atgtatcaaa atatatgtac 62221 ccccagaaat atatgtacaa ccattatata ttaattaaaa acttgagtta ttcaatactt 62281 ttccctctaa tcagactcct aactttagct tccacctctt tttctacccc ctaaagcttg 62341 tgtttccttc ccaaaacatg aattggctta tgtaaattct ttgttcaaaa tatctcctgg 62401 ttcccagtgg catatagaat acatatctgc cagttttgta cagcatagcc acattttgat 62461 ttcttccctt tcaactgtat gtcctactat actccaatgt ccctttcatc acacacacag 62521 tctgtggatc tccctcagca ctcccccatc ttctggatat atctggatgc tcctggccca 62581 ggggcttggc tcagaccttt tctgtttgag ttatgtacta tattgaatgc ctaaccatac 62641 tccctatcca gtttcagttt tcactcctag ctgatgactt ggataagagt ctcttgagtt 62701 agaaatagtt tttatcaata acatgggagt aacaatgcat ataatagcaa tagctcacat 62761 gaatcaagct cttaattact gtgccaggca ctgttaatga caaaaatgat aaatgtcctc 62821 caggataaag ggacctaaga gacagtttga tcaaatgtcc ttacaccctc aaagagggtt 62881 tcctgatata acaggtccag cctctgtcta cttgcccttt tctattccaa cttgtcagga 62941 gaagaggggc cataccctgt taagtccagt gagtagaatc acttcattca tcaaggcggg 63001 acgtggcttg tttatgcatg aggtagttat tccactctgg tttaatttct tttaaaatgg 63061 ggatgatgac aattgtggtt gttgtacaga taaaaactca tgtacagtgt caagcatagg 63121 acccatggtt aagagctatt aatattcaga atattaacac atcacagggt acctgtaaaa 63181 cacttcttaa agaactacag tgccttcaac ctagcctggc tcttatttat ttgacacttc 63241 agtggggaac aagaattcaa ttttggccag gtgtggtggc tcacgcctgt aatcctagca 63301 ctttgggagg ccgaggcagg tggatcactt gaggtcaaga ttcaagacca gcctggccaa 63361 caaggcaaaa ccacgtctct actaaaaata taaaaattag ctgggcatgg tggtgcacat 63421 ctgtagtccc agctattcag gaggcttagg cacaagaatt gctctagccc aggaagtgaa 63481 ggctgcagtg agctgagatc gtgccactgc actccagcca aggtgacaga atgagaccct 63541 gtctcaaaaa aaaaaaaaaa agaattcatt tcaggaagat agttctgaga ggcgaaagaa 63601 gaggagaagc agggaggagt gggaggatga ggtagcagca gtacccacag tgatgcagga 63661 gctcctggat gggaacctgg agggcagtgg tgtttttatt gcctcttcca tgattagaaa 63721 ataagttttg aactttgggg gctttaagga gtggaaatga gaaaagtaaa tacagaggct 63781 gttagccctt cttaagggag aatactactg tcttctgtgt atcttcatgt tgggactcag 63841 tctgagctgg aagtattcaa attaatgcac tgaggagctg gattatacaa aaattagctg 63901 ggcatggtgg catgcacctg taatcccagc tacttgggag gctaaggcag gagaattgct 63961 tgaacatggg agtcagaggt tatagtgagc tgagatcacg ccactgcact ccagcatggg 64021 cgacagagca agactccgtc tctaaataaa taaataaata gggccaggag cggtggctca 64081 tgcctgtaat cccagcactt tgggaggccg agcaggcgga tcacgaggtc aggagataga 64141 gaccatcctg gctaacacgg tgaaaccctg tctctactaa aaatacaaaa taattagctg 64201 ggcgtggtga caagcaccta tagtcccagc tactcgggag ttctgaggca ggagaatggc 64261 gtgaacctgg gaggcagagc ttgcagtgag cggagatcgc gacactgcac tccagcctgg 64321 gcgacagagc aagactccct ctcaaaaata aataaataaa taaataaata gggttcaatt 64381 gtacttaact atttgaagta tttaaaacca agcataacct ctgagctaaa atgagtgatt 64441 ttgtaaatat ttaggcagtg gtgtgaggaa gtaattcacc cccaaaacac actaatgtac 64501 atttagttat aatgagactg aagcagggag atgaaggcca agtccttctg ctgttgttct 64561 taagcctttg gtaaagcaac atagggagtg gcatctttag atttgtctaa aaaaaacaaa 64621 gaacagttac aaataaaatt ccatgcacat gtatcccaga acttaaagta atatatatat 64681 atggcaaaaa accacaaaaa caaataaaat tccagacaga aagacgatgc aaatattaga 64741 gaattataca caaagtatgc cttgcagctg ccacacaaac tcaacattat gaaactatca 64801 aatgcacaaa tgaacactct cttgaggaat ttctaatatc aaaggaattg ttttcttccc 64861 ttttggtggt acaattatga agaggtgggt tttgcagaat ttgtttagtg gtggtagggt 64921 ggctgtcaat tcagggttga ttcaaacctc tgttaatttg aaatactctt ccattttctc 64981 cagtagtgaa gacttatttg taggtcatga gatcaaaagg tgactacctg agaccgtgga 65041 aaacctaaag ttgtgtcaaa gaactattgc cagggaaaaa acatcaagat gagtttattg 65101 ttagttttgc tgcagagaga gagagagaga gtgtgtgtgt gcatgcagta atggaaacat 65161 gtgcagatct gatggtgggt tgtccaaaac aagcagcccc tgagtgctgt catcctaaca 65221 ccataggtag ggctcacagt ccgtcttggc tgactcccac agtcaggttg ggaggatgaa 65281 agaaacctcc actatctcag gaggcatgga aagctcagag gaaacgaaag acaaggtttg 65341 atgagctgct tttctggttt tcatagtggt ctcttaatag aattctgaaa aggaaaagga 65401 gccacaagaa aaaagagaaa gaaaatgttc tcatctattt ccaaattcta aggcatgacc 65461 ttcccccaag aaactcttgg tagatgtttt ctccctttct cttatttgca aatataatag 65521 aactgtttta tgtaatcagt actgggaata gggaactaat attggcccat aaaacagcct 65581 actttgggga tcggctattt agggagattt gataatagat ttgaacatgc ccaaaatgag 65641 ctctattaca ctagccttct ttagactata ttcatgcaga gaaaggatcc taaatcctaa 65701 aataggagag tgaaagtgag gagtcagaca gggagaaaag gcctgtagtg tcacaatttg 65761 cagattggga acttccaaaa ggattaaata caatgttaac tccattaact aggatgctga 65821 gtagacgtaa agcccacctg acatggttgt tttgttctgt tttttagtag ctgccaggca 65881 tttaagaaaa agccttttct gtgacccccc acacccacat tgatgttgac cttgcattaa 65941 aatgactagt tctgttatgg gatctttgga gtgtcaattt tctggccaga aacctctgtg 66001 gctggtgaca tctttgccct agttcttgtc ctgtgtccag gaagaatgag gtatgcgaca 66061 agtgaagagt gaacaagaca aagaggcgcc tgagttcttg tcctgcatcc aggaagaatg 66121 aggtatgcga acaagtgaag ggtggacaag gaggaagagg agctttatta agtgttagaa 66181 cagctcagag gagaaccaca gtgagtagct cctctctgta agcaggtaat cccatctctc 66241 catcctctgc cctgctctgg ctgagcccag aacttctagg gacctcagag gggaggaagt 66301 gtgtgctgat tgttccatgg gcagccatgg gtgggcccag aaaaggcacc acaggtcctc 66361 actccagtcg tcaggactgg ccgcctggcc tgaaggtggg gtcttactga ggacctgccc 66421 tcttccaccc aggagcctgt ctgcctccca cagcagtcca tggcgcccag gctgcttgca 66481 ccaaggggca cttgcaggcc agtgctgagc tgccctcagc cccccttagc ttcctctctc 66541 atgttcgtgg gcactcaaat tctggagggg gccaaggcag cagggcactg gcatgtcagc 66601 actgccctga gcctacacac atcctccggg ctgtgacagc acctccggct cggccccaac 66661 tcctctctga gatgagatca gagcaggtgc ctgagggaaa agagaacagg cagtgggagc 66721 agacacacct gagcctgtgg tggtggggct tcctggcccc caaggatgca ggctgcagag 66781 atgcccgtgt cctgcacctg ggagggtagc tacagatgca cctgaggagc tcctgcccca 66841 ccaacttgga aggggtgggg ctcccacttg tccctggctc ctgcccgctc tgtggagcca 66901 gaggcctggg tctgcagcca cggattgggc agttatagtt gcactcagga ggacagggat 66961 cctgcctgct cctggctccc tcgaagagca cagggaggct cagatctgca gcccagtagg 67021 gtcggggctc ctgcctgatc catggagctg gaggcccagg tctatagctg cagtttaggc 67081 gactgcagtg gcacccggga agctccctcc ccaacctgga aggggtgggg ctcccaccag 67141 ctccatggaa tgtgcagccc cagccatgcc tccctgctgc agccagcatg attgcagcag 67201 ccgctgccat cagttctact ttcctgcctt agaaattggt gatggtatag caggaatttt 67261 aaggaatcag agagactgat ggggttcagg gggatattta ttaattattt aggtgcacca 67321 gcccagttgg attaacatcc aaaggactga gccccgaaca aagagttaag ttacctttta 67381 agcatttcat ggggtcgggg gagatctgtg cagggggaag cgagaaacaa aggcagttat 67441 tcaattgaga catgcattac ttcatttctt actttttaag gaaaaacatg ttttgtgaat 67501 tgagtttatc tgtctagtga ccttgcagct gcacagctag ggaatcagag tcttcacaat 67561 gcctgggaag ggaggagaga taaggctcac tagccacaga aaaataggca gttagttttt 67621 aaaggactct agctctttct ctttctcaga gggaattggg ttttcttacg tacaactgaa 67681 tttctgctta catactcttt aatttctttt aattcctgtt ccaatgggaa tgagatctcc 67741 tagttttgga ctggggctag cagggcctcg ggtgcaccac cacttggcaa catcagcctc 67801 tgtccagaat cctgaacctg taagctagaa cagggttagc tgacagtgtc ctgaacccaa 67861 gaaggttggg agggtggggt tggattggag tatgcaggat tgacaatggt atacctcttc 67921 tccaaactac tggctgggca gaaatcttga gttctaatga atctgaacct agttaggcta 67981 acctttcctt agcacctttc cttagcagta aaggctaaaa tatcaagtca gtatccaggg 68041 cttcagcaac tcttgtgtcc cagtaagaca agccacaaat acctttggcc ccattatcag 68101 gcctccattg ccataaaatc tttttaactc tcaagaggtt acagagaatg aattactcct 68161 taatcatact tgaagcaaac cttctgtcta aaaatccaac gtagccaggc atagtgactg 68221 gagcctgtga tcccagacac ttaggaggct gaggtgaaag aattgcttga gaccaggagt 68281 ttgacaccag cctgagcaac atagtgactc ccatctctaa ataaataaat agacagatag 68341 ataagtaaat acatagacaa attgatagaa tgatagatag ataagtgagt aataaataga 68401 ttaggtagag atagatagcc agctaggtaa ctagccaacc agctgtctca aaaaaaaaaa 68461 tggctggctg tggtggttta tgtctctaat cccagcactt tgggaggctg aggcaagagg 68521 ctctcttgag tccaggagtt caagacgttt gggcagcata gtgacaccct gtctctaaaa 68581 aaaattaaaa attagctgtg tgtggtggtg tgcacctgta gttccagcta ccctggaggc 68641 tgaggtagga gatcaaggct gcagtgagcc atgactgtac cactgcactg cagcctggac 68701 gacagagcaa gatcctgtgt caagaaaaaa aaaaaaagaa acggtttttt ccctctactt 68761 tctcacagtt aatacaatac tcagcacaaa acacttttga catcagattc tccagtgaac 68821 accaactggg tgtcttatga tttaattcaa ttctgacact atctacctgg agatagtgtc 68881 agatcctatg ggttaacggc tcagtcccat gagattgtgg cccccatttc agatgccagt 68941 cacaagcact agattgtcac ttatacttct gactccctgg gtataaattg cagggtccta 69001 caaccccctc ctcaggttca attaatttgt tagagtggct cacaagactc agaaacactt 69061 acttacattt actagtttat tataaaggat acagaataac agcaaaatgg aaaagatgca 69121 tagggcaagc tatcaggagg agggtgcaga gcttctatac tctctctagg tgtataacct 69181 tccaggcacc tctgtgtgtt cagcaatctg gaagctcatc aaatctcatt gttcaagagt 69241 ttttatagtt ctggtgttct atggcacagt agggtgacta tagtaaacaa caaggtgttg 69301 tagatctcaa aatagctaga agagaggatt tatatgttcc tatcacaaag aaataataca 69361 tgtttgaggt gttggatatg ctaattactc tgatcatttg atcattacac aatgtataca 69421 tgtatcaaaa caaaatcaca ttaggctgcg tgcagtggct catacctgta gtcctgacac 69481 tttgtgagac caaggcagga ggatcgcttg aggccaggag tttgagacaa gcctaggcat 69541 catagtgata ccccatctct acaaaaaaat tttaaaaatt agttgggcat ggtggtgtgt 69601 acctgtagtc ctaactactt agaaggctgg ggcaggagaa tcccttgagc tcaggagtcc 69661 aaggctgctg tgagttataa tcaagcgact atactccagt gagggcaaca gagtggagac 69721 cctatctccc aaacacacac acacacacac acacacacac acacacacac acacaccaca 69781 ttatacccca taaatgtgta caaaatcatg tcaatgaaaa atacttcatt tttatgtttt 69841 attattaata tttttagaga cagggtatca ctctgtgacc caggcttcag tgcagtgacc 69901 tgatcatagt tcactgcagc tgtgaactcc tggggtcaag caatcctccc acctcagcct 69961 cctgagtagc taggactacg ggtatgcacc atcatgcatg gataattaaa aaaaattttt 70021 tttgtagaga tggggtcttg ctaggttgac caggctgaac ttgaaatcct gccctcaagc 70081 aatacttcta ccttagcctc ccaaagtgct ggaattacag gcatgagcca ctatgcctag 70141 aaaaaaaatt tattaaaaag agtttttata gaactcatcc cttagccctg ttttcccgcc 70201 tttcctggat gttggtgggt gaggctgaaa gttccaactt gccaatcctc taatcctcta 70261 ttacctgttt tttcaggtga ctggccctat cctgggctac ctgggagtca cactctgagt 70321 tatatcatta gcataaactc agatgtaatc caaagggaca cattataaat aagaagataa 70381 tactatttct caggaaattt caagggtttt aggagctctg tgacgggaac cagggacaaa 70441 gacctaatat atttcttata atatcatagg actatttccc tcaatatgac tttaaaaagg 70501 ttcagttttt atgatgctca gcttccccca tgaccagggt gacactattt gtagtaaaga 70561 gggacagaac agctgacagc tgatagtcag ttgtcaggac actaatattt ctttattctg 70621 ttaccaagca tagcattaac agaaaggaca cagtgcaatt cattagctag gtttgcatcc 70681 aagacttgga cgaaaatgaa agagtattaa aagcgggtag aaactttgaa aggaaaataa 70741 atctcggaac ccccaaatca ctaagccaaa gggaaaaatc aagctggaaa ctgtatcgga 70801 caaacctgcc tctcattcta ttcctaaata agatagttac aaagatttgt ttaaaagcta 70861 catacccctg tcacaatttg cccataagga aaatcctcat ggacaaagga cagacagaac 70921 tcaaagtcat ccctctgctg ctgtgcaaca aatgcatatc tgattgcttc ctttgcccta 70981 ttgtttcact aagccagact aaggcataag tgactattcc cgtaaattgt gtattcagtg 71041 aaaggctaat cagaaactca aaagaatgca accatttgtc tcttatttac ctatgacctg 71101 gatgccccct ccctgcttcg agttgtccca cctttctgga ccaaaccaat gtatatctta 71161 catatattaa tcgatgtctc atatctccct aaaacgtata aaaccaagct gtatcccaac 71221 cactttgacc acatgttgtc aggacatacc gaggctgtgt cacaggcgca tccttaacct 71281 tggtggaagg aacatggctg cactgaagcc aggcagacat aggctgaggt aaacatcctg 71341 catgactcaa caagtttggt gtacaggcac ataaatccac gtgttatata atcacagaca 71401 tgtagccata acatgggaag gctcatcact cagctcagag ccactattgt ctgtaaaagg 71461 tacaactacc ctgttaatgc tgtacaggtg tgcccagaga aagagagagc cagagctgtc 71521 tgtcttgcag atggacagcg ggaagccagg acacagtttg gcttgcttgc acccagaggg 71581 aaagagttaa gctgctgacc ctgaagggag agctggccgt gtggctgcgc gtgggagcag 71641 gtacagcagt tagccaagac agagacagac agtgtaagag agctgctgaa taaaaccatc 71701 tttcacctgc ctacaaccca ccccctcccc tgccctgagt gttctttcag ctatctgcca 71761 cccatccacc cactcccttc tgacctcagc ataggctgga acctgaccct ggacctgaca 71821 cttggcaaaa taaattttct aaattgattg agactcatct cagatacttt ttggtttaca 71881 aattggtgac cacagaggga ctcagtggag gtggccctga cctttggcaa atctcctgtt 71941 ggtgcttggt accagcttca ggtatcttta tagctcaaac caacaggacc atttcctgag 72001 gcctgggagc tcccctccct tcagagaatc cctgatctcc caaaatttgg ttgagatcta 72061 aagtatattt tgctgtacaa ctccttttct ggagttttac ttgcttccaa caaggcaggc 72121 aagttttcct gcttccatga cgatggaagg caggtaactc ctttctggag tttgagcttg 72181 cttccaacaa ggaaggcaag tttgagtttc ttcccgcttc taggatggta gagagcagtc 72241 ttcagtctga gacccatttc taggtaaata actgaattgg gttttttttt gtgtgtcttg 72301 gaaattctcc ttaataacta aaggttaaaa ttgacaacca gctggtctta atttctcctt 72361 accattagaa cactcagcga tcatattgtt ggggtttttt gttattgttt cagtcgttct 72421 cccatcggat ttgaccaact ctacccaatt tggtcaaatc tgaatgagaa ttccaaatca 72481 tggggaaaaa ggcctctgaa ttggctaaaa ttccttgcag ctgaaaacaa aacaaaaaca 72541 tatgtttagt ttctgtgtct gcttcctgtc ttttctttca ccctattcct ccttcccctt 72601 tgccattgct gaccaagaaa aaaatctaga gaaggcttct aatgagtcaa acccctcaaa 72661 gaactcaaaa tgaaggcact attcatgcct ctttggagtg ttctgttttc tttgtggagt 72721 ttcaagagtc atgggcagat tctttgtagg tctaaagctc tgctctcctg tattgcctta 72781 cctgacctct ttggtttttg gcagtactag agattacctc gtactgtaag aggatttgac 72841 tttggcatgt gtaatggcaa atgagagcta caaagttaag agtgggtgag gacagtttac 72901 aggaagtggt ctcagctgag tatttttcct cctaggaaat tgtttaggat cataattcta 72961 gttcagaggt tcattctaaa gggtcttctt tgttgccttt cctcctgaaa ttaatctcaa 73021 ttggcttgtc tgcacatttg catgaggaac tgaactgttt tcatagacaa atgagagact 73081 gagtttcctc agctccgaaa tgaaagggca ttttgctcct cccagctgaa aggcacccct 73141 gggtgatgag tgggagcttt cttttctacc tgctttaagt ctgctgttac ttttctactg 73201 aaataaaatt cactgtttgc atccaaccat ttctttttgt tattgtttgc aaactggtga 73261 gtttgtatta ctatctcatg gccagagttc tgaattaaaa gctataggat ctttgtatga 73321 gtgtgcatat gtgtgtttat gtgtacatgc atgcatttta ttatgtgttt ttggtcacaa 73381 ggtaccaaat tggcttaaag ttaagtagta ctcataaatt aaataagccc aaatgctttt 73441 caagttcatg tgacttaaaa tatttaataa gctagcttta caattattgg taaaataata 73501 ttagaaatgt cttaagaatt gtcagcatac atttttgttt gcatttatga atcaagagat 73561 ttcatactta tccctgccaa atattataag gtgtcaaaat ttggcataat ggttaaacta 73621 taaacccagc ccaaaacaga atgatatttg cttgtgtaat tcttgataaa taagatgtta 73681 atactgtttt aatgtaaaca gctaaatttt ggattattta gtaagataac catatattta 73741 atcttaaggt tcttacttag gtaaacacct gaaattcaca ggctataaaa cgcttcacag 73801 ggaaataatt ttaaatgatg actatcacag ttttcataaa taatctaggt aaacaattaa 73861 attaggtaaa tataatggga tatttataga caaatttgtc ataatttaga atctaaagtt 73921 atattaaact agatatttca ttaaatgggt attttccaat aaaaaatata tataatatag 73981 taggaaaaca ttctttctaa aaaaaggaag tgtgctctta tttaaatgtg aactactttt 74041 gtctaactaa aagcttattt aaaggttatg tgtaaaacaa ggtaaaagaa actaggaaat 74101 aagagagatg taaggaaagc tatagaaata aagaggtatt tttggttaaa aaaaagctta 74161 aagaaaaata attttatata agaatattag gccaggtaca gtagctcatg cctgtaatcc 74221 cagcactttg ggaggctgag gtgggtggat tgcttgagcc caggagttca agaccagcct 74281 aggcaaaagt gcgaaatccc atctccacaa aaaaatacaa aaattaggca ggcatggtgg 74341 tgcatgcctg cagtcccagc tacttgggta ggctgaggca ggaggattgg atcgcctgag 74401 cctgggaggt ggaggctgca atgagccatg atggcgccac tgcacaccag cctgggcaac 74461 agacaagact ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 74521 agaatcatat atggtaaatt tttttctcct agaataaaat gactggttgt ttaagaaaga 74581 gagatgttca ggacaaacca gaaagtccaa gcgtgtcatg aatggtatgt ataagtcata 74641 agatttatgg aaaaaaactt ttatatgatc taattggcta taattaaagg gaaatgattt 74701 ataatggtct ttctagagat tgggttttga tattaaaaaa agacatatac taaagaattg 74761 gtttgaacaa tgaaattttc ttaaggtatt gctttactct taaaattaca agacatttta 74821 attctttaat acaaagttca actttgtgtc tcactgtttt cagctttctc tcccctttta 74881 aaaggcctga aataataact ctatcattca actcattttc agctcctgta agtttttttt 74941 tcccttcagg ttctaatttt tgtagcctga tgctaaaaat gttttattgt aaaggtctaa 75001 agaaaacgtt tccttccaac ataatattct gtgctcttgg ctctaaattg ttctatgaac 75061 cagaaaattt gcatttatga cccaggaaac actctttcta tgtctaacta attcaagtac 75121 ccttttcatt agttttgagt tgcaggttat ctaaatggac tccccatagg gaacagcagt 75181 catactgcag atcttttctt ttgcctttgg gtaactggcc taataaacag attttatgct 75241 ttatcaaaat aattcctgtc attactaagt attggtttgc ttggaaataa tactgagatt 75301 aaaaaaaatt taattgaggg tattacatcc atgaaacttt ctgtatgtgc ttttaaagtc 75361 cttgtgctat taagttacgg ggctttgact cctgggtcta aaaagggtct aaaaaggacc 75421 aagtcctgct aaatcttaaa cactaacagc cgttaaagcc tcatcttcgg agctggtaga 75481 agatgtcaat caaaataaac tgcatgcatg agacacaggc cagaaattaa agctattcaa 75541 ctcctcaagg cccagggact atcatgagag aggtggatgt gtgagattat aagggccgat 75601 tttgaccatt aatgtcaaag gcacactgat gtaagaccag catatgggtc cctgtgttag 75661 attaacaagg ttttcttgaa gcattcaccc actccttaat aaaaggttat aaaggttata 75721 aaaaagactt aggaaaatta tatcttatag tcaagatgat taaaatttta taggtttata 75781 aaattttgaa aaacaaattt aattgggcct catgccatct ttattaggac ttattgtttt 75841 ggaaattaag tctcctgtct caaagaataa aagtttttaa cttttttaaa aaatcgagtt 75901 attactttgg ctaaatgaat gacttatttt acaatgacct gtgatcctat tttgtgatat 75961 catgtgtttt aaactcttta tgtttgacaa acttttcaaa atcaaatttt aacttcagac 76021 ctcattaatt ttttgatatt agtctcctga agtccaaaag agacatcttt ggcttatttg 76081 atataataaa accatacaga agtattgtta aatatgaaag tgtttaactt tctttggatt 76141 tatataaatg tgttattagt atgtgttcca gaattatatg aaattcctgt gattctgatg 76201 tgtcttatag catgttatcg gtggtaattg tgattattat gttaaattgt tgtatgccat 76261 agaagtaacc aaatttcctt gtcaattgtc tctttaacta tgactgttct aagatttttt 76321 catccacagt tattttacct tcatcttttt tttttttttt gagagggagt ctcactctgt 76381 tgcccaggct ggagtgcagt ggcacgatct cggctcactg caagctccac ctcccgggtt 76441 cacgccattc tcctgcctca gcctcctgag tagctgggac tacaggcgcc caccaccacg 76501 cccggctaat tttttttttt ttttgtattt ttagtagaga cagggtttca ccatgttagc 76561 caggatggtc ttgatttcct gaccttgtga tccacctgcc tcggcctccc aaagtgctgg 76621 gattacagac gtgagccacc gcgcctggcc actttcatcc ttttcaaaag gtggttttat 76681 aatcagcata ggactctgac aggtgctctt gaatgcacac ttttgataac tttggacatt 76741 gtgacactag aatagaggaa aaacctccaa ggctcccatg gagagctgaa atgtttatga 76801 ttatcaagca gaacaggagt taactacata gactgaacta atagaagact gaaataatta 76861 tgacttttgc tcaaaatgtt gctcatcctt tgtttttcag agccaagaaa acttttcttt 76921 tgagctattt acagctttta acacttaagt atactcctat aaacaaaatt tagtgcatat 76981 ttctctctac ctgatctctc caaaatttgg aaactagttg catgtatact taacttatag 77041 caacatagtt agttgcataa gtgcaataag aatctgtttt cttttgtaac aggatacaaa 77101 tggaaaaaac tggttatttt accaaggtat tgacaggaat aacatacttt cagatatagt 77161 ctcctttaag aaatcaaagt tgacttacag ggccaaaaaa agccccttgt aaaagctatc 77221 ctcatacctt atctacacag tccctgtaca gtctcctaac acgtggtaag taaagaatgc 77281 cactttctga caggcccagg agacccaagt tttctgggga ccttgaggtg aggaattcac 77341 ccaattaata caagtatttg caggcacagg ctgggcttaa ggcattaaag ttgaatctga 77401 gattccttat agaataaagt tccagcaaag ccaattttaa aaaaagagaa gactatatgg 77461 caaataatta tttttgctga ctttatgcaa atactgcagc cataagacta aaacttattt 77521 tgccaatgaa tttgtcctat gatttgtctt tagtgaaaac gggactggag agagaaaaaa 77581 atatgtttcc aaataaacta tagcatacct gttagattct agtttgccta gtgtttttca 77641 atttttatta ttttctatag tttagactga attctaattt tttcctggct acaagtctcc 77701 aaaataatgt tttcaaattg tccttctttc ctttcccttc ttccccattt ttcctcattt 77761 aaaatcacta aaaattaagc tgtgctttct tcaagccctg caaactgaag ctagacaact 77821 tcagaagaaa ataacagcaa cctatttaca tacatcaacc actttcataa ctgcctactc 77881 atgcatggac ttcagagtaa tatggcctat atagattttc caggattgtt cttgtttgtt 77941 gttgcctttc tcccttcctc cccgttttct cttcatagga catgaaactt cacaacctgc 78001 taaacatgag ctttcctaat aacatgggac ctaatcatgt aggaataaag catcctagcc 78061 atgagagatc agacaaacct aagaccaaag gactcatttt cttctaaaat gctttctctg 78121 aaggattttt aaaagggggg aaatgtaaaa aaaaaaaaaa aaaaaaaatc tcgcgacccc 78181 aaaatcacta agccaaaggg aaaagtcaag ctgggaaaag tcaagctggg caggacaaac 78241 ctgcctcctg ttctattcct aaataagata actacaaaga ttttttacaa agctatatac 78301 ctccctcaca atttgctcac aaggaaattc cttgtgaaca gacagaattt aaagtcatcc 78361 ctctcctcac gtgagacaaa tgcctatctg attgctttct ttgccctatt gtttgactaa 78421 gccagactaa gggataagtg aatattcctg taaatcgtgt attcagtgaa agactaatca 78481 gaaactcaaa agaatgcaac catttgtctc tacctatgac ctggaggccc cctccccact 78541 tcgagttgtc ccgccttttc ggaccaaacc aatgtgtatc ttacatatat tgatcggtct 78601 catgtctccc taaaaggtat aaaaccaagc tgtgccccaa ccaccttggg cacatgtcgt 78661 gaggacctcc tgaggctgcg tcacaggtgc atacttaacc ttggcaagat aaattttcta 78721 aattgattga gacctgtctc agatactttt tggtttacaa aacaaagcca gaggcctagt 78781 ccagagcttg gcaaatagtc cacgctcact aagggtgagc tgaagggaag gcttgcagtg 78841 ttaactttcc taagatattt tagtaataac ataatagtct ctcccactgt ggtttaaaat 78901 ttcaatagca catagatttg catgttatat ttttctatta agatatgaac tattaaatat 78961 actgaaccaa ttctccttct ccccctgaag gtttggttta agattaggaa gatcaagtag 79021 agaagatatc agaaggcttt agattttaga tatatgtcac ccattgccat cgatctgtca 79081 tcataatctt ggatgggctc tataatgcca tccttgttct ggagcccagt tggagttggg 79141 ttcttgtagt gccactgcag cctagaatat ccaggcttcc catgctgtcc ttgcctcctc 79201 tgtctatgcc ttgctttgcc ctgactttgt ttggcctggt ataggcccct gtcatcagca 79261 ttcatttgta gtctccattt tcttaggtct gttttttaca aatttttcca aaatgttttt 79321 ctgccagatt caaattctgc atggcacatc ctcaagtgca gttcaccaaa tgtctgttaa 79381 atgaactaaa taaacaaaaa agtcagtgca ccatttttta taaggcacat tttttattta 79441 aaaaaagttg tgaattcctt gtaaacattg gttcccagga cccctgcaga taccaaaatc 79501 catgcatact caagccccac acttgaacct gcagaatcca agtatacaaa aagttggcct 79561 tacctatctg caggttttgc atcccacgaa tactgtattt ttgatctgca tttggttgca 79621 ggtgtgaaac ctgtggatat ggggggccaa ttctatttat tgaaaaaaaa tccacatata 79681 agtggacata tgcagctcta acctgtgttg ttcaagggtc aactgtatat ctgttttgtg 79741 tgccaaataa ggtatggata agaatttcaa aggaaaagtc atttaaacag aattacaagg 79801 aagcctcatt ttatgagctg acattcctca atcaccaata cgccctgtac ggctcacata 79861 catacatatg caaaatgggg gtggggaaag tccctagtaa aagaacagtg tgtttcaaga 79921 cagggcccat ctccatttat gctgaagagc ttagacacaa ccctgtagtc taaaggaatc 79981 ttccgaagaa gctactcata gggctttatg ggaaggtggc agccaaacat ctatctagta 80041 tattgcaaat tctgttcact atcctccctt tctgtggagg acatagggaa ataggaatca 80101 ttctggccca aagttcctct gcatcaaatg aaacacaaga ggagactcta ccaaagccag 80161 aatgaggcag ttctcccttt gctcacccat ccccagccca gaaggcatat tgttgcatat 80221 tccccagctc agaagctgac aatcatgcat tcaaatgtat tttgtctcat tcaacctcct 80281 tgaaaggacc ctggttcctg atgataaaaa cacaaaaagg gaaccttttg tttagtgctg 80341 ctagaagcaa gagggaggag gtagttccct gattaatatt ctagattaat tgatcatcaa 80401 aaagaagcag tcagaccttg gctttcttta tgctgtcttc ctcatgtgaa actgctaggc 80461 tagaaaacaa caaaaattta aaaatggagt gtgatgagag ttgaggagtt attgttaatt 80521 tttttaggca tgataatggt attgtagtta tgtttttaga aaaagaacgt gtatctgata 80581 tggtttggct ctgtccccac ccaaatctca tcttgtatta taatccccac atgtggaggg 80641 aggggtctgg tgggaggtaa ttggatggat gggtttcccc catgctgttg tcatgatact 80701 gagtgagttt tcatgagaac tggttatctg taaagtgtct ggtgcttccc ccttctcact 80761 ctctcttctg ctcccatgtc agatgtgcct tgcttcccct ttggcttccg ccatgattgt 80821 aaatttcctg aggcctccca agccatacag aactatgaat caatacctct tttctttata 80881 aattactcag tctcaagtat gtctttatgg tgtgtgaaaa cagactaata aagaaaacag 80941 tcttttagag aaacatgctg aattgtttat aaatgacatt atacctggga ttggtttcaa 81001 attaattagg gggcaatggc aatttcccac aagttgttaa ttgttcaagc tgggtgctac 81061 gtgcctgagg gatcattata ccattctatg tatgtttata cacatttgac cttttccata 81121 ataaaaatta aaggaaaaaa tctggccatc tgcatttctt ttcctttttc ttagctgtct 81181 ttgctctcat ttttgtgtct tcatctgcaa attaaaacac attataattt ttaaaaagtt 81241 acctctaaaa cctcaatact tttgagtaaa taaatatgct tttgctgttt ctcatttgtt 81301 gttacagaga aatttattca ttcaaacatt tattactatt tgtcaagaac tgtgctaggc 81361 actggggata tatatattat taaagtcaca ttaggtatta tatgatataa gaaattaatg 81421 ttaatttttt cataagttaa tggtattatg gttaagtagg aaaatatcat tttaaagaga 81481 tacacagaaa tattttgtag tgaaatgtga tgacatttgt aatttacttt gaatacctta 81541 gtgaaaaatc catatttgaa gcaaacatgt caaaatgcta acaaatgtta aacctgagta 81601 ctaaggtata tagtattaat tatactattc cctttacttt tccgaacatt tgaagatttc 81661 catgataaaa agggaaagaa aactatgtgt gtgttggagt ggggaaggag aatgctttct 81721 aggagctcag tctaatggag gagacaaata ggtaaagaga ttgatacaat ccagagtgac 81781 ataaggaagc ttgaggtgct ctggacacac agaggagtct caaaatcaga actgatttga 81841 ggaggggggt tgaggaatgg ttgaccaaga aaggtggctc tgagagtagt tatttgatta 81901 caaaatccat tgttgcaatg agtaagccaa gcttgttgga agaagagact ggtattgtag 81961 gcagagggcc tctctaccaa gcaatagagt caagagaaag aacatggaat actgaggaga 82021 cagaaagcag ttctagccta ggcaacacag catgacctca tctctacaat tttttttttt 82081 tttaatagct gggcatggtt gcgcgtgcct gtggtcacag ctactcggaa ggctgaggca 82141 ggagaatcac tggaaccctg gaggtggagg ttgcagtgag cagagatcga gccactgccc 82201 ttcagcctgg gtgacagagg gaggccctga aaaaaaaaag aaagaaaagg aaaaagaaag 82261 aaagaagaag gcaggcaggc aggcaggcac gaaagaaaag aaaagaaaga gaagagaaga 82321 gaagaaagaa agaaggaaag aaagaaagga agaaagaaaa aaagttttca ggccacagag 82381 tatggtgcaa caaaaagtga cagggtatat gtgattagag aaggatcagt cagggtctga 82441 actttaccct gagggctatg gagaactact gaaggatttt gagtatggca ctgacatgct 82501 tagatgtgtg ctttaagaaa gtttggtttt ccagtggcaa ggaagtaaag tgacattggg 82561 gaccaggaaa ctggggcaat atcagaggaa gcagagcctt tgtggaatat gccaaccaag 82621 agaaagcacc ctctggtcaa tcattagaca ggagagggcg taaaaaaggt ggctttttgt 82681 gatgcactac cccagatggg gtgccttcca taaattggca gccagtctaa agggagtgcc 82741 tttttgaaaa ctgtacagca gtgcccactg ggcacgagaa ttacacaaat gagcccttcc 82801 actgggctga tgcagccctg taaggcttac tttggtacag cccaatcccc tgacttggtc 82861 tggtgccatg tgctttgtac aactatgcac aaccaatctg gagtcagatt gcttaggttt 82921 gaatcccaac tgtatgacct tggggacagt acttataatc tgctctgtct ccatttcctc 82981 atctataaga tgtacccaac agttatttat ggattagatg gtaataatga atataaagca 83041 cttatgggga atgcctggca caaggttagc aagcgttcaa aaactcttca gtattattag 83101 tgttttcagg aaccttcttc taaaataagt tttaagaaat aaaactagtt ttgaagcaag 83161 gcttgcagaa tgctgttggc tttcttacca ctggagtctg tttcagaatc agagtcactg 83221 tcgctatctt cagaatactt cttgtgtttt ctttttgata aatttttctt tcttcctttc 83281 ttggaccttt cacttgaatg actagactac tatttttctg taaaaatata aataaaaaag 83341 taacttctaa aaaagtattt ctgattatca attttattgc atttgaaata ctgaaaagga 83401 atagatgata ggcaatttga ataaaagtaa agtctaatgt ttaactattt ataaactttt 83461 tctaacttct caagaaaggt ctgtatctta aaaatacttt ctgatgtaga aactgaagta 83521 gtgcatttct ttggctcttc attctttact tcaatatgtt cagcagagct gaatttaaag 83581 aaaatggtta taagaattag aaaagtgatg aaatatggac catagagtat aaaaacacag 83641 acttaaaact tttcagcaac aatgacaatt gtccaagaac cattaaaaac aagcattgga 83701 cttctttcat aagaatacag ttcatgtctg taatcccggc actttgggaa gccaaggtgg 83761 gaggatcact tgagccccag agttcaagac cagcctaggc aaaatagcaa gaccccatct 83821 atacaaaaaa tttaaaacat tagccaggta tggtggtgct ttcttgtagt cccagttact 83881 caggaggctg agatgggagg atcacttgag cctgggatgt caaggctgca gtgagccatg 83941 atcatgccat tgcactccag cctgggtgac agagcaagac tgcctaaaaa taaaataaaa 84001 taaaaaataa ataaatacag ttcaagggtc tgccaccttc ataataataa aataacttac 84061 atttgcacaa tgctttacag attacaaagc actactcaaa ctgaaagcag catggaggat 84121 ttatattttc agaagaaatc acaggctaaa aatagagaaa acagaatcag aaggaaagat 84181 gaaagaaact gtatataaaa agtctgaaga agatatcata attttattct aagcatgtaa 84241 aaaacaaatg gtagagcagc aactgttctg tctcctcaga gcaagaagat cctaagaagc 84301 tacaatctaa agagatttgg gggacacatg ttctcaggac ctgaggctat gtcacggtaa 84361 aaaattgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtat ttgttttgtt 84421 gcagagaatt cttaaaataa gtattattaa agttctacag taggcttggc gtggtggctc 84481 atacttataa tctcagcact ttgggaggcc aaagcacaag aacagcttga gcccaggaat 84541 tcaagaccag cctgggcaac atagtgagac ccttctgcac aaaaaattta aaaattagct 84601 gggtgtagtg gtgtgcacct gtattcccag ctactcgaga ggctggggca ggaggatcac 84661 ttgagctggg aaggtcaagg ctgcagtgag ccatgatcgt gccactgcac tctagcctgg 84721 gtgacagagt gagaccccgc atcagaaaga aagagaaaga aagcaaataa atcaataaat 84781 aaagttctac aatactttag aaagggactt tctatattat ataatctttc cctttgaaga 84841 tcacatctgc aataatttag gtattgccta gaaaattatt tttagggttc cttctgactc 84901 taattctatg agttcaggtc ctttaagcat ttaccaaggg tatctataag aatacacttc 84961 agataataat gttttatcac cttgtcacaa attacaggtt tacctgtgaa acacattgcc 85021 tcaactcaac ctaaacacag aaatagagga gagggaagaa gaaacttatg agcaacacca 85081 aagagaaaag aaataagcaa aaaaaaaaaa gtactaatat ttcacatagt attggataca 85141 ggaaaattaa ctaaaccgaa ctacactggc atctgcattt tttaaaaatc gctgtgtaat 85201 actattgtct ggaatgcaat ctgctcctgg gttgaagaaa aaaatcaaaa tataaatgta 85261 ttatctgttt taacactgaa ctcctctatt aataggaaat tagcacccca tgaatacaat 85321 ttaaagaaaa caattagcat gctaaaatgc actagcaaca tgttttaatt cttaggtatg 85381 ttagactccc tttagttcct ttttaaaaaa gagtacctta cctatctgtt ctttggagga 85441 cactaatttt ttattgtatt ataatgtttg tcatttattg ttttaatttt tgctgttact 85501 cataagctga aaaaaataaa ctaagagagg ttcaatttcc tcaaagttta gctgcttacg 85561 tctataaaaa tccttataaa ataaatgttt gatttcattg tttgaataaa tattttttac 85621 ttctttattc agggaaaagg tattataagt atacagagtt agtctcagag ggtttcccat 85681 ttcaagagca ggtaatcaaa aagtaaacat acgtctaaca gttcttggaa tatgagtata 85741 gggaccaagg atgcctgcca gactataaag acacacagtc tttgtggtgt gaggcacaaa 85801 attaaggccc aatattgtgt actaccttga caattgggga aaccaggagg gctcccaatg 85861 gccttactgc gaattctcct ctccattctg ctcctgcaaa taaggtcccc tagccaaatg 85921 accctccttc tcaaagagac cagacgcagt tccagcttat ctctgggtag cagattttag 85981 tacattgtca gcttgtggaa ttattcaaac aagccaatca tatccttcca agggaaagca 86041 ggtattacct caatttcttg ataccacaaa gtctacctcc cacagcccct gattattcac 86101 tgtcttacca acagcaacct cggtgtggcc cttcgtggca cgctgtgttc tctcccattc 86161 cagctgtgaa catatgtgat taataaactg ctgttgatct catctgtcct gtgttcagct 86221 actccataac acaaagggtg ggaatccttc cctcatcaac atggtgaata gcagacaatt 86281 ataacactcc ctgctgctac ctggttttat ctgtgcctta agataaatta tttatcttct 86341 atgtgcacaa atgcaaattt tacccaagtt gttcttatat gttttggtta ggattctaaa 86401 ataagaagtt ttttgattga cacaaactgg gaggaccttt ccatcccaaa ggcaggagcg 86461 ctcctcagag gacaggtgca gtagtttcac tcacctctcc cacatgggca aaaaatgtgt 86521 atctgtaacc taactcagga ctgtttttac ttcctctgct cccaggcatg aaactctaat 86581 aaccaacacc tgtgaaatgc ttattacgtt caaggcacta catcattaca ataaccctaa 86641 aaacgtcagt actatttgta tccccctttt acaggtgaaa aaataatgaa gcttggagag 86701 gttaccaagg tcacataggt attaagtagc aaagcaaggc ttcaaactca gaaagtctca 86761 atccagaggc tgtccttctc atcactatgc tttattacat cccactagtg gaagaaaaag 86821 actcatcaag acatctccca cctgttggaa tttcaaccta cccctcaata cctgtaatgt 86881 tttaaaattt ctcatttttt aaccgtagtg catgtaaggc actcagtata gtacctgaca 86941 cattcaataa acatttgata tgagtactta cacagcaata attataatta ttttccttct 87001 caaaccatct acttcccact ccaatattca ctctcattag aggaaaactc atctcactct 87061 cctctcattc ttcaaggtca agattgagac cacttaagac cttcctgaag gtagccacat 87121 ctgcagttta gcacatcact ttgcctacat tcatccagtt atcttgtctc agaggaaggt 87181 atttctcctt tccaatttaa acccttaatc tcaacccttt ccaactccct acaaggaatt 87241 tccacttagt tagcctgtta ggatctcaaa ccgatcattc actcaacaca caacactttc 87301 agtagattga gcccctatta agtgccaggc actgcaaatt ctgtaatgaa taaaataggg 87361 gtcctccttt ccacgagctc agtctagtgg agaataaaaa tatgttcacg aacgataaca 87421 caaaacaaat aaatgctata cgtaggtctg tgacccttag ccgtgtctgc tccgagatca 87481 gaggcgaaag agggtggagg gagaccgggg caccctacaa aggggcccaa ggatgagtga 87541 gaatgtgaga gaggagaggg gcgaacaagg agctgtcgtg gatagattgc agaaaaggat 87601 gaaaatgggc gcggtgaagg tagaaggagc gaaatgcaaa tggtcagaag cgggctgaaa 87661 gatgaagcgg actcgctgaa ggcaagtggg ctccttagag cgaaggtgca gccatgatct 87721 cactcacctc tgccacggat tcacctccct ctcctcgtcc cgtaggagcg tgagtagttg 87781 ccgcaatagg cgcaagagca ggcagaagcg aaagtgatgg tccacggcgc agtcagccgc 87841 tcttgagaat gcgaccacaa ctgggagcgg cagggctggt ttcgggagcc ttggctgagc 87901 ccattccatt ccccagtcag agaacaagag ttccagtgag agttgcgccc cagcggggaa 87961 cgggcggctt tgctgggctt ggggctcttg gacaaactgc aacacattcc cccgcccgga 88021 gcccgaggcc tccctctcgg cactgcagaa ccggacaccg gagctactgc agcctcgaaa 88081 acgggacatt ctcaggagcc tcagcagtat ccgttgcctt tgctccaaaa acccagcccg 88141 cgcaaccgct gggagcaaag gaaaccatgg gaactattgc gagtacgttg gcaaaacatt 88201 tccccaggtc tgtgtggcct ccggccaagg tgaagaggtg tgtgtcttgg tgtaaataaa 88261 ccgctactgc aaccgatttc gctcatcctt tggttccgca ttgaggctcg ccagggtcta 88321 aatccgcaca gagactaggt cttccttagg cttctacttg aaataactga agggcaggac 88381 gagagtccac tggagacatt tcacagaatg gccatatcaa caacatagac aattctgttc 88441 gtgtacgggt gctttccgtg aatgtgtgta ttacatgttc tataatattg ggcgcactct 88501 gtgtcgtgtg tgtgtgtata aaatatgtac atttggttat gtgtgtatat atgtacatat 88561 ataataaata ggactggctc aaacagattc acatattttt tcggagtcta tttgaattaa 88621 aagaaaaatc tgagatggga agaaaaggga caaagtattc aaagtatttc gtacacaact 88681 gaaagtgttt atttcagtgt tgtcagattt attaatagca aaaagaaata tacagtaggc 88741 cacttatctt tgaatttcaa ataaagaata atgttttagt ataagtatgt tccatacaat 88801 aatacagtag aaagtgacta ctttctatta aagatgaggt gatagccacg gttgtacttc 88861 ttcctctcct actacttcta aaatttgttt cttatattct ttttttgttt gcttgttttg 88921 ttttgttttg agatggagtc tcactctgtc acccaggctg gaatgcaatg gcgcgatctc 88981 agctcactgc aacctccacc tcctgggttc aagtgattct cctgcctcag cctcccaagt 89041 agctgggatt ataggtgccc gccaccacac tcggctaatt tgtgtatttt tagtagagac 89101 ggggtttggt cacgttggcc aggctggtct cgaattcctg acctcaagtg atcctcccac 89161 ctcggcctcc caaagtgctg ggattacagg catgagccac cgtgcccggc cttgttcctt 89221 atattctaag gcttttcgct tgctgtttga gaaagggaag tgggagcagc ccctgatatc 89281 cacaaactga cctcgcactc acaattaggc atgttcttct cttgttgaac ataaactcac 89341 agaacaccaa cccaagacaa gatcactggg agcctgatca agtgagacaa aacaagacca 89401 cttcataagc ttgtctaagc acagacaaaa acaagctcac tgtgacaccc agaaaacacc 89461 aaacacccct tcttggccaa gataggtcac tgctactttg ccagttaatg tacagcttta 89521 tcctccctct agtctgccct ccctataaat aagatttatt gaaatgtgca atcatacaat 89581 tgcccctgct ttctgatgta tctgagacgg aactcctgat tcctagacct tcccccaaat 89641 tgtccaaatc ctataatagg ttcattctaa caacttctta ttgaaacaca tcatagttac 89701 ctatagtgtg tgttgtccct ccctggagta agagtaataa actcaatctg tcaactatgg 89761 gtatgtttct ggtagtcttt ggttgaaggg caatgacaca ttccactgcc taaaatgttc 89821 ttccccagat atttatatgg ccctttccct tagttcattc cattctttgg ccatacatca 89881 tcttttcaga gaggctgtgc ttgaccacac aatttacaat aacaccctca tcattctcta 89941 tccctttgct ctatttttct tcatagcact tactactttc tgtttttgta ttatatattt 90001 atattatata tatacaaatg ttgattgtat gtctcttcca ttataatgta agttccatga 90061 agaccaatat ctcaacttga tgataattat atccacagtg cctacaacac tgcctacaac 90121 actgcctggc acagagtaaa tactaaatga tatatgttta atatgtattg aaaaaatgaa 90181 tgaagtttac agtgtcgggc tttataagtt tcattgtcac atagtcataa tcaatctgtt 90241 gtttggactt agtcttccac ttaaatagat tggcttactg caaaatctcc attcttaatt 90301 ctttatttgg actcatttct tatttgattt gacttcatat attgccaagt aattttcaca 90361 aatgactcat aggtagtata ttccctcaat tttgttcaag tttggaataa gctgtctgtt 90421 gctttttttg gtagattgta ttattgttag caattatcga cgactccttt tccagtagga 90481 gagttatata tccctatcca attgactttg gaaagtttta agaaaaaagg gcaataggca 90541 gtattagaca ggtcacttaa agagagattt gttgggatta agcagaaaag gggatagtgg 90601 tgagaaagca acatatattt ggcacttact gggttccaga cactcattcg ttcagcatta 90661 agcaaccatc tctatctaca atttgttggg ccaagggatg cttcacaaat aagttagtac 90721 ttgagttggg tttttatgga aaaccaggca cttgccagga actaaccaga caacaggtta 90781 agcgaggtgt ggctatgaag ggaatgacta ttccaagctc agggctccag gctcagactt 90841 gaaggcagta aattgcatgg catattctgg aaaccgtgag cagtttgaca ctatgcatag 90901 ttgattcatt taatcctcaa agtgactctg tgggaaggta ttattatccc tattttacag 90961 atgaagaatc aggcataaga tcaaagagct aggaagtggc agtatcagaa ttccaaccca 91021 ctcacatctt ctgcttaaga ctttgtttct tccactctac cttactaggg tttctgcaga 91081 tcccatggag tgaccttacc aaatctgtaa ccataagcat aacatatagg aagtgctcta 91141 agactgaacg agtgtcactt tatcttttgt catcaattaa tatatattaa gcacccacaa 91201 gattgttggc attttaacac aggagaggac agttaccgcc ctcatggaat tcacaatcaa 91261 ggtcaaaagg caagcaaacg ctgacaatac ggctataaaa cctaactcat cccattttca 91321 gccaataaaa cctttctaga atgtgaaaca attgatacct ccaaatccat ttgatatagg 91381 ttgttctctg acctcaggtg atcctcccac ctcggcctcc caaagtgagt tgttgttgtt 91441 tgaaatagag ttcaaactac tgggttgttt gaagtagggt taagccctcc agattaaaaa 91501 caaaagtaaa agtgaaatga ctagttgaga aacagaaaat taattaaact ttccagtgat 91561 cctgtcccaa agacaattat taaggccact caggcttctt ggtctgttct gctttcctac 91621 aaataaaatt cctttcctga ccattggacc ccctttcatt ataggtagct tttaaacttg 91681 tttagttttt gcttggttag ggggtgagag atacagcaag aagggaagag aaggtttact 91741 gtgctttgaa tgaacaggtt tcatcatatg ttagttggta ctccaaactt ttaaaatact 91801 ttcaggaaag tgacaatggc atggcaattc tgttgccagc ctcagtgatg taaatgctgg 91861 tacaaaataa attgcattgt gccttccagt ttactgagag ctgaccaaca aacatgatat 91921 ctagagtggc atctattgcc actgtgatct gtatttcaaa gatatgtctc attttagttt 91981 tcatcaatta agccactgaa agataaatag atcctaaact ttcaagatat actttgttct 92041 aattactaag attttttttt ttactactaa gttttgggga ggataactca gaaaaatggt 92101 tattaagtac tcagttgcca ggaaaaatac taatctgcct tagtttattt tacaacagca 92161 tacttctatt tgccatgaaa tatctttata gtaaacaatg aaacttttca taatgctaag 92221 tcctaatcaa aaatcaataa tgcaataatt cccacatagt ttgttttccc acaatacaaa 92281 atcttggaga agagcttaag gaattgcata tttccatgtg aaatacagct atattatacc 92341 actttgatgt tttattatac catgtggctt tactttattg aggaggatga agaatagatt 92401 agggataggt agatgagatg atgacatcca tgtctgattt cctgtcagag aaaattagat 92461 atagttacat ctgttttcag tctaatatcc tgatggacat tttcagcaag attttttagg 92521 aaagaagcac aggaattcca ctgcaaggta ttaataggta caaatttaag attttggaaa 92581 tgaatgcagt attatcctat atatatcttt ttaaaaagat aatgaaggta gtgaatggcc 92641 ttaattatgg cagaaaaaaa gttactattt ggttccttag ttgaatacga tcaataacca 92701 gcctaaaagg gaaacatttt gctctccaaa cacccatcct gttgcttcat ttattatggt 92761 actcagttgg aagggtatta ccagaggatt ttaaaaaatt atgtaactca caatgtattt 92821 tttcggtcat taaacaatca ctgaacacat actatgtgtc agacactaaa gccctctgaa 92881 gatgaaatgt tatggaaata gtaataataa aaaattaaaa tagtaacaat aaaaaattaa 92941 aaatgcaagt gttaatatta ttaatctttc aagatacagg tcaaatgaca gcattcaatt 93001 tgctattctc ttctctctgc ttctacacct tgtacaaata tttttgtcca tgtttgtgta 93061 cctacctact gagctcccta aagaggtaga gacctttggg ttattcattt ttgtattccc 93121 gagagcctaa catagggttt ttgctacaga tcaagcaggg ggtcagtaaa tgctagttgg 93181 aatgagggaa ttaacatagt actgcaccat ttagtattca attaagaaga tagttgtcca 93241 aacactaaat ttctttctta aaagtagtta ctgtaatgta gttgctctct tgttatccca 93301 gaacttgaat taccataaga agttgtgaat ttctatttga ttgtatgtat tttgattata 93361 tttcaagatg gtttatcatc tagcaaacag tctaatagag cagattcaaa cttgcaaaat 93421 aaatgaaacc tgtatagtct ctaaaagttt gtatgcccta gaagttcaca ttgaatttcg 93481 atgaaaacga ctcgcttccc ttccatccaa ttgcttgcga ctgtcttacc cgatctttcc 93541 ctttcctttt ttcccacccg ttagccctcc ggaaagagcc gaacacacaa gagcttccca 93601 gtcttcctcc gcccccttgc ggaaagaacc gaaggcagag cacggcgccg aagtggagcc 93661 gcctcaagct cgggcccttc cggaccaccc cggcctgcgc tcggaagagg agggcgcctt 93721 tggcttcagc gcttcgcccc cgcgctgtgc cctctgtcgg cggcgtgggg cagctgtagc 93781 agcgttggcg gcaggaggcg gcggccgcgt cgacgtcgac ccagactgga gcgacgttta 93841 aagaaggggc agaatcgctg gggagtgcgg cttcttcttg ttgggggact cccagccttc 93901 cgcgcgtccg gaggaggaga agcggcggcg ccgggaagca ggtgaggacc cggccccaat 93961 cagggagggg gcgaaggagc gcgcttgcct tctccgtccc tgggccgcgc ctgcgtttgc 94021 attcgcctca cttgaaccag gaagcggcag agtcggaggc tcagctcctc cggctctttc 94081 tttgtgtggg ccggggagcg aaggagggga cgagccgcga gggccgcggc gcggtcccct 94141 tcgccgttag gggccgaggg gctgccggtg ccctgggtag gccccggggc ttggggaatc 94201 catcacagag acccacttgg cttttccctc gcccctctcg ctgctttttt gtgcctcttt 94261 tactccccta gcccctactt tgatttagag cttttctgcc aggcccatcc tccccttccg 94321 tccccttccc ggggcacaac aatgccgcct gcttcgcttc atcccccccc caacacccca 94381 ccctgccgtt cgccaccttc tactttccct ggtaccccat attcctcccg ccctcactct 94441 gctgtttgta cctccgtctg tatttgcaag aagcttgctt tgcacgtgaa ttggggttaa 94501 aaacctcggt tgcagctggc agggtccata tggggaagag ggaggtggag gaagggagat 94561 gaggttcagc tgccgcaggg agccgcgtgt cggtttgttt cactccctcg cggatggctt 94621 ttatctcttt ccaccgtcac cagctcctcc gaactggccc tcggtcaagg gtttcacagt 94681 gatgtggaat ggcttttcca cagtcgaatc gcatatcttc ggagtgggtt cagaccgact 94741 tgtgctgtct ctggggccac cctgagtggg aggaggggga cacataaacg aaacgaaacc 94801 gagtcccagt gtcacctgga cacgtacatt tgacgcattc ccatattgga agaacccggg 94861 agcaaatgac aaaaattgcg tgccgttttc agaaaacctg tctttccact taatatcgac 94921 ttctgtcgaa atgcccgtat ttcggaagcc ctggtttttg gcaaactgtc caaaggtcac 94981 cctttctgat gatgaatgcc tttatggtaa cctggaattc cttggtactg ttaacatgca 95041 acaccttcta ataatgtttt aaatcttgat ttttagactt tttgcctgct gtttgttctt 95101 aaggtatcat tttgaaaaat ttagaaggta tttgagacag aactgccctc cacctgtata 95161 taacttaagc attgtgacat cgacaccctt ttggtaactt caaagggaga gaatattaat 95221 aaaaatattt cttcactaaa agtacttgtc aactaattgt taatatataa tatccatata 95281 ttaaaatata tccatatata atatgtatgg atattatata tttttaatat atgggttttt 95341 ttaaactctg gcctctttgt tactggaaat atcttctgca aagtgcaaaa aatgatgttt 95401 tgctagtgct acacaacaat gcattctact gacatcatcc cctttttcta gagtagtcta 95461 ttccaataag ttaatgtttt cattttcact aactcatttg gtgtggaaaa aattctcata 95521 ccaatgcata tgattcttta ccaacaaaaa taaaatgttt cgtgttttct tggaagtggg 95581 catttagaaa tggtaaactt tccttttcct ttgtcatatt ttttcatgat agtgtctgtt 95641 gacttcactt ggcactgttt aacattccac ctctggaaat tgtaaagatg gagagatgag 95701 gggaagggac aaggggatga agaaagagga aagggtaaga aagatgttta atagtttatc 95761 aagatgaggc accccaattt aaataacaga aaaataatgg gacgttggga ggtaataatt 95821 gtcagtgttt ataaatgtca ttggctccag ggtgtagatt ctgcatatat tcttaaagat 95881 tagttaaaat taagagtatg tatttcagac acagtattgc acgcgctgag cttttctgac 95941 atttcaatca tttacaggcg aaataacttc tgtgtgttca tgatgatttt ccttaatttt 96001 gacagttaaa agtcaactct tacctcttct tggagagaaa acccaaaatt gggactaaga 96061 aaattaattc taatctttaa aaatcactta gtggaggtgc atgtatctaa tattcaaaaa 96121 aaggtgacta tggagattac tgttcaaatc tctgtaacaa tgtaaaactt gctcaactca 96181 ggagcaagcc atgaaattgg acacttgttc caaaagccaa cctgtatgaa caatttctgt 96241 aaaagccaaa aaattatgct gaactttggt taaaacttga ataaactatt taatgatgct 96301 actgcttaaa ttctaaataa gtacttttgt tttttctctc taatcctctc ccatcccctc 96361 ctctctttct cttaaaggca tggagagtag aaaactgatt tctgctacag acattcagta 96421 ctctggcagt ctgctgaact ccttgaatga gcaacgtggc catggactct tctgtgatgt 96481 taccgttatt gtggaagacc gaaaattccg ggctcacaag aatattcttt cagcttctag 96541 tacctacttc catcagctct tctctgttgc tgggcaagtt gttgaactga gctttataag 96601 agcagagatc tttgcagaaa ttctcaatta tatctatagt tctaaaattg ttcgtgttag 96661 atcagatttg cttgatgagt taattaaatc agggcagtta ttaggagtga aatttatagc 96721 agagcttggt gtcccattgt cacaggttaa aagcatctca ggtacagcgc aggatggtaa 96781 tactgagcct ttacctcctg attctggtga caagaacctt gtaatacaga aatcaaaaga 96841 tgaagcccaa gataatgggg ctactataat gcctattata acagagtctt tttcattatc 96901 tgccgaagat tatgaaatga aaaagatcat tgttaccgat tctgatgatg atgatgatga 96961 tgtcattttt tgctccgaga ttctgcccac aaaggagact ttgccgagta ataacacagt 97021 ggcacaggtc caatctaacc caggccctgt tgctatttca gatgttgcac ctagtgctag 97081 caataactcg ccccctttaa caaatatcac acctactcag aaacttccta ctcctgtgaa 97141 tcaggcaact ttgagccaaa cacaaggaag tgaaaaattg ttggtatctt cagctccaac 97201 acatctgact cccaatatta ttttgttaaa tcagacacca ctttctacac caccaaatgt 97261 cagttcttca cttccaaatc atatgccctc ttcaatcaat ttacttgtgc agaatcagca 97321 gacaccaaac agtgctattt taacaggaaa caaggccaat gaagaggagg aggaggaaat 97381 aatagatgat gatgatgaca ctattagctc cagtcctgac tcggccgtca gtaatacatc 97441 tttggtccca caggctgata cctcccaaaa taccagtttt gatggatcat taatacagaa 97501 gatgcagatt cctacacttc ttcaagaacc actttccaat tccttaaaaa tttcagatat 97561 aattactaga aatactaatg atccaggcgt aggatcaaaa catctaatgg agggtcagaa 97621 gatcattact ttagatacag ctactgaaat tgaaggctta tcgactggtt gcaaggttta 97681 tgcaaatatc ggtgaagata cttatgatat agtgatccct gtcaaagatg accctgatga 97741 aggggaggcc agacttgaga atgaaatacc aaaaacgtct ggcagcgaga tggcaaacaa 97801 acgtatgaaa gtaaaacatg atgatcacta tgagttaata gtagatggaa gggtctatta 97861 tatctgtatt gtatgcaaaa ggtcatatgt ctgtctgaca agcttgcgga gacattttaa 97921 cattcattct tgggagaaga agtatccgtg ccgttactgt gagaaggtat ttcctcttgc 97981 agaatatcgc acaaagcatg aaattcatca cacaggggag cgaaggtatc agtgtttggc 98041 ctgtggcaaa tctttcatca actatcagtt tatgtcttca catataaagt cagttcatag 98101 tcaagatcct tctggggact caaagcttta tcgtttacat ccatgcaggt ctttacaaat 98161 cagacaatat gcatatcttt ccgatagatc aagcactatt cctgcaatga aggatgatgg 98221 tattgggtat aaggttgaca ctggaaaaga acctccagta gggaccacta catctactca 98281 gaacaagcca atgacctggg aagatatttt tattcagcag gaaaatgatt caatttttaa 98341 acaaaatgta acagatggca gtactgagtt tgaatttata ataccagagt cttactaaac 98401 tcctttgaaa tactagaaag ttttgttttg gatgatgggg caggggtttc agaagatctg 98461 taaaacaaat taaggtgcga acaagttaat ttgatctgcc acattatctg aaggaagtgt 98521 agtgggattt ttgttgataa tttttagaag caaattttcc tgaaagtttt gagtagaggt 98581 gagaccccct ccccaagtat ctgtttatat agttagtttt cagctcattt aaaagaggca 98641 aaaattaaaa gcttggagag atagtttcct gaatagaatt tgaagcagtc tgaatgttct 98701 ttgaaaataa ctggagttat tagcataccc tagtacatct tacagctttc cccttccatg 98761 ttagcacttt actgctgaat tctcaatttt cttaacattg agacaataaa tgtgtgtttt 98821 gtcttgtata tggcataaag agtaaataag ttttagagtt gttctggaaa atgtcagaat 98881 aagtcagtac ttgggttgtg taatctgcta gtccaagcga acagcaacct cctgctaccc 98941 tccctctatg aaaatagcca tgcagacaag tctctcatct gaagaacaaa ttagatttag 99001 ctaattagaa ttaatcctgg ctttcattgc catagtctgt aaaagacttt ggtggctaga 99061 ccactttata ccttcgcagt gtggtctctg ggggcaaaaa actaatgaaa acaatctctg 99121 taatggcaga taggaggaga tgaaaagttc tgttgcatgg atttttaatt ctctggctac 99181 cacatagtag agaatggaat gaagatttcc ttttggcttc ttaaggttaa aaatattccc 99241 atgaacatga aaattttcaa attttgaatc tgaaagccac caaatgtatc tttatgtata 99301 aatccttgta aatgatagat tccatgggtg agactttaca tattttgggt gggaggctac 99361 tggcatatat ttttaaatgt tcatattgcg tagaatctcc actaggaagt ctttatttga 99421 aatagttgaa tcagtgatct agtattttcc tttcggcaag atttgttagg tttttacccc 99481 ttctaaaata agttttattc catctgcaaa ttgctgcaat attatagtaa tcagaaacta 99541 cataaggaat gttatatagg cttgtcagtt cccatttttc ttgacaacaa taaataccac 99601 ttttaaaaat gacacatatt taaacactta gaaaataaag ttaacactta ctgaagtgct 99661 agtactaaac tgtgctagta ctaaaagaaa acaggttgga acatacatat agcctagcat 99721 ttataacaga attgttgaac gtctgtaaat gatttttttt ttttttgcaa aggaaaaaat 99781 tgatactgga aaagattgtt gtgcatagtt attagtcatt tgtaaccttg cttaagtatt 99841 tcttagtcca acatagatat tttctttctc ctgaccatgt attttaaaat atagtctatt 99901 tcttgacttt gaacttaaag ctttaatcat aatttctcat gtatacatcg ttcttctgat 99961 ggtaagctgg atttgaaggt agtggtttca gtgtttctta agttggtagc tgagggtatc 100021 aggcatcagt tcatgcaata atacaagaaa aaaaatcctt tgcttgccaa gaggtagagt 100081 gatgtgcatt tatctgtttt ctgttctgta agtctagacc ttcaaaccat ttgtaaacta 100141 acccctggga aatttgaaat tacctgataa cttaagactc tgtgatctct ggaatcacca 100201 tatgtttctt ttttgtgtag atattaataa cattactctt tgactatagt gtgcactctg 100261 aaatgtactc agtgaaaatt tgttttgagt ttcattaatg ctatttcacc agttagacat 100321 aattacttct accgatgtga atgatacgga tgccggcaga gcttccagat ctttcagact 100381 caactgctag gtcaattagt ttgtcataat aaaacttggc agattctaca agtctattat 100441 gacaaaccag gaactaattc tataatggaa aactatccat tctgaataat aggtatgtaa 100501 ttatttgctg ctgctgctgt gctctgtaaa ttctgaatat gacatttaaa ctctgtgcct 100561 actaaaggta tcttctggag tttttgggag gagagaaact ggaaaattaa attgtatttt 100621 tgccagaaga ctcttacttg catgtgtctc agggtcttca gtttttctat aagtttccat 100681 atccaaagtt cagaattcat gtgaaatact tctttggggc aaaagtcctt cattcctggt 100741 atttattgga ttggaaatct gtagcaagat gctgtttaaa attaccatat tgttttttta 100801 tcttatactt agctctctgg ctattgaact tccttttctt gtttgaagtt agcttcaaat 100861 ttgctcctat gctaaattac ctgtaaatat tctggatagg aactacttga aatagtaatt 100921 tgttaaaaga tatgacaaaa tgaaaatgct taaactacag aaatttaaaa atgccataac 100981 aatcttgcga gactaacttt aaaatatact ttaaatgatt attatgattt tggtggtaac 101041 gatcccccac acacaaccac tatgaagaaa taatgccgca tttttccccc attgtaccaa 101101 aaagataaaa aaatggtaaa cactgatcaa ggtattttgt attgtcaagg catgcatatt 101161 ctaaagaatt aaatgctaac ttaacagcac tggctttctg gctggtcaac tatatgaaac 101221 cttgttcatt cctccgagta ctgtaatgtt cacacttgta caatcttccc tgtcatgact 101281 ttaagttcta cttttcatta accatggcct gatattagtt cttagagctt cttgtggcaa 101341 aaataaaatg atttaattct gatgtttgag tgcgtgtttt acaagattgt ctttcagaaa 101401 ttatatgggt ttttatattg tttttcaatt tttatagcag gagactgggg ctgtatttct 101461 gatgacagca cggcaaaatt tcccactagg ttttatatgt tggttaaaaa tgtccccttt 101521 tatcttgaac cctaatagtc aaaagtgagt cagctgctgt cagttgcttt agagcgtttg 101581 ctgtactttt atagctacct tgacacaagt ggataacgag gggaggatag ttttattcat 101641 ttgaaacatc acaaaagcag tctgagtttt caacatggca gtgatacaga ttttaagcaa 101701 catccagttt atacagtttc tggattaata ttttaatgtt tcatgcccag ttcagtactt 101761 taaagcagaa ttaggaataa agcagtaaat attacaaaat gcagtcaaga tacatattgg 101821 aaatacaaat ccattcatta cagcaaatgt tttgcaaaag agtaaagcag caaatattaa 101881 tttttctaag gtaacatgca ctttgtataa ttcaatgtaa taaaaaagct gctcaaatta 101941 agtgttacaa aatctatact gtttcaggtt attttgtaaa gtaagaaaaa tacatagaaa 102001 tgagccatag aatcttaaca ctttaagaaa tgtgataaat gtaggataaa ctgtgtaatg 102061 gtgccttaaa aaattaaata tggacattca ctaaagccaa cggtcaaatg taaatggcaa 102121 aaaaattcat tgagtattac aagttgatat ttgtttagtt agtcagttgg gtattagtct 102181 ctttttgaaa tccagtagaa ttttaatatg ctcagaactg taaaaaatca tagtaatttt 102241 gtataacata aaaggattat agtttttcgg attcaaaatc tggataagaa cattcatgat 102301 gctgaaagtt aaactgaata tcttgcaaac atttactgtt aatgagagag tgcacttctt 102361 gtgccttgat tcttgatggc taagtgtcta caaggtagat gggagcacca gcatttgcgc 102421 tacctctaat ccatcaagtt gggaggtagc aaaattggcc ggaaaatttt ccagtgtgtg 102481 ttctgtgttt ggagaactgt ctaattaggt accctcctgt agcccctgta tttttacgtt 102541 ataaagtata ggattatcaa tcttccttta aatcaattct caggttaaca taagatcaca 102601 atgaaggtgt tcattctgaa gtgaaggcat gggtgcaaaa gctttaaact catcaggttg 102661 atggtttaaa gctatgtgta attaaaagaa ctgtacaaat acctaccacg gtgtgccaga 102721 tggattttaa atctgccgta acacttaaat atttgtcaag ttggctacgt ttgaggtttt 102781 gcagacttga agcggcattg taacactttc ataatcttga atgctgtcat gactggtctc 102841 aacagacaaa aggaccgata agaccagtgg cattaaaata cagatgactc ttttggggtg 102901 gggtgcagga tacgtgcgga ttgctggggt taagcacaat atttgaagat taaatagtca 102961 caaagattag taacaattca tatcacgacc caagaaccta atttataaca tttttaaact 103021 ctgaattaca accataagat ttatatctcc tgtagcaaat gtattttgta ataatgcaac 103081 atgtagtaga accactgtct cctaagtgat ctacttaaaa catctcacat gttgctgtgt 103141 atttcagtgt ttccggaaca atacatcctg ttccccacta ctgaagatgc aagaatattg 103201 cacttttccc tttaggaggt accaataaca aaagctgagc tgagtgatca caacagccat 103261 ttttacaata ctcacagaga aggaaggagt aagagatacg ggcagtcttt ttcaacatcc 103321 agacacaagc aaacaaaatc ttagccagaa ctctctgtat tggaactact gtgtagcccc 103381 atcagctggg aaattgtcct tagggactga ctttaggggt agggaaagat agggcctata 103441 aaccaggtgc ttcaacctag gtgctggggt ggcaatctgc caagggcccg agggcaacac 103501 tcctttacat aaagtgaaaa agatggcaac cacacttctg ctgaatacag aggtggagca 103561 tagggtaatg caagtccaaa caagcttctt gctgggctgg ctccccagtc ttgcttagag 103621 aatgtgagct gcccttctgt gctgcgttca gcctgaccac ccctgctctg cactaataat 103681 gaataagctt cgtccctatc tttccctagc ctggcaggcc attgtaaaac tttatgcata 103741 agagtcacac tttaaatttt gtaccctaca cacttccact gaagcagaaa tacaaaagcc 103801 acaataatct caatagccag caggcattcc tctttaggga ctgtaaggtg gtggcttttc 103861 aaaaggtgga tagtagggtg gagagtaacg gggcggtgca cttgaggacc acatgtagct 103921 gggtgaggga gaggcagact ggggctcatc agaaagtcca gagggagggg aggatggaaa 103981 gacaccggaa tgctgtgtga taaaaagaaa acgttagttt aggcagtgag tttgcaattc 104041 cctgaacagt cattactatt taaaaatgaa tatgggccaa agcttttttt ctatttggaa 104101 gaacagagag ctcccaaacc tcacagagtt aatgagcctg gaacaggtgg ttctctcagt 104161 cactcaggct acaagccacc tgcagaaata gggaatcctc ctctgcaacc ccttgcagcc 104221 cacatccaat caattgccaa atcctgttgc ttctacttct gtgttgtctc ttgaatccat 104281 cactaccttt tttacattta taactcaagc ctccaatacc tctcccttaa gctatggcaa 104341 caatgtccca actagtcttc aaaaggtaat ttttggcttt gagaacattt aatgatatta 104401 agccagtgat ccctgggaaa gtctttgcta aaagagtata tcacagagag gtagtacagt 104461 ggtcaggagc ccctctcata gacaagaaac taaggtcaga gaggtgaaat agcttgctcg 104521 gatctggagt cttctgactc caagtccaac attttctact ccaccttaac tgctgccccc 104581 tcatccccaa gtgcagagta ggttggagag caaagagtaa caaaggcatg gaggtgagaa 104641 tgaggagcaa tatgtcaggt tgcagagaca ctgagtattc agttttactt tgtgttgtct 104701 tactttggga aaggaatgga aatgaatttt ttaaaatatg cagtgaattg gcaggagtca 104761 tttgcccctt tctctccctg gcccacgtat atccttgact ctagtgattt ccctgctgaa 104821 ggagaggtgg cctgttgtgt ttgttgctgt atcctccaga gctcagaaca gtacttgcca 104881 agtaataggt gctcaataaa tcttgaatga gtgaatacaa cttcaggagc atagtagatg 104941 aacttggcat gcttttaaaa aataaaatct aggccgggcg cggtggctaa cgcctgtaat 105001 cccagcactg taggaggctg aggtgggtgg agcacctgag gtcaggagtt tgagaccagc 105061 ctggccaaca tggcgaaacc ccatctctac taaaaataca aaaattagct gggagtggtg 105121 gtgggtgccc gtaatcccgg ctactcagga ggctgagtca cgagaatcgc ttgaacctgg 105181 gaggcagagg ttccagtgag ctcagatcac gccactgcac tccagcttgg gtgacagagt 105241 gagactccgt ctcgaaaata aataaataaa taaataaaac ctatgtccag atttctaagc 105301 cttgcttgag acacagaaaa atcttcctta ccacataatg ttaagactga gaagagatgc 105361 tagctgccat catcccctgc ctcccaaatc ccatccatta tttcactcca aagaaatggc 105421 agggtcgtag aacctggaga gcccttaaag gcaaaacaac ctgaggcagt ggactcaaga 105481 caatctagat tccttcttaa aaactcaatt ttcctcctcc ctccaagaag ataccctttg 105541 ttaaaaacat cggtatagtg atttcctgat tttccatctt gcacccttca aactctccct 105601 tcctcctcct ctccctccct cctgttctgt tgctttcctc aactaagaac tctctcctct 105661 taaccagaga ggggaacaaa gacccaccag ggcaaagaat ctctgtgcct gggcatatgg 105721 cactcactgc ctctctaaat cccactacag agatgacatg agaggcagca ggccagtttt 105781 aggtgttccc agccacagaa gggagaggca agtttcctga agtgaagcta aaggcagttc 105841 tgtcctgaga ggcattaaaa ggcttctgtg ggcagcagct gctgtgggtg gcagttgtac 105901 actttgataa taaaaggggg tggtacaacg gcccttgggt tttcacttct tagttttgtg 105961 tttgagaagt atggggggag ggggtcatga ttctcacttc cctttgagga tgctgggagt 106021 cttaaggata aagctgaagg gttcctggca gagctaggga gctaactggc tgcagtccac 106081 acacccttgg ggtttagcgg gggaaaaggc ccatagtacc agtgcatagt ttcctagcaa 106141 caggcaggca ccttgggaaa gatctggctt tgtcgcagcc agggagttgg cctggtgtat 106201 tctggttcct gcctgagagt gtgactttaa taatgctgtg ggcagctcag gagaacagaa 106261 ctttgaatcc ttgtgcacca ccaagcacga agagcatcta aatagcctac tgctgggcct 106321 agtcagctcc agcttcctga gcggcttctt agaggctggg gtggccaaca ggctgatggt 106381 gcttgaagag gagcaccacc ctttcaggga cacagatggc tttcctcttc caccttagcc 106441 tgcctcctgc ctgcctgatg gctgacccac acccttcaag atcaatgcgc atttccgtca 106501 gggcatgctt ctctccagga gccttgtggg acagtgactt cacaaagagc ccaggaactg 106561 gctggtgata attactagtt aaagacactg tgtcccaagg gtaaaatccc agcatctgtc 106621 ggtattgtgc attccctctc tgttctgccc ctggaagcaa aggctaggga gatggagaga 106681 agagtgggta aaagaaagag gaaaaaaaaa aggatgagtc aggaacaagg gaaaggaggg 106741 gacgaaagga aagacttgac agcggggtgg cgggcgggtg gcggggggaa gaaaaagttg 106801 agtgatagca acaacagcct tctaaggctc tcagcaacct gtgccataaa ctgtaccctc 106861 ctaggatcaa agcaatcctt ttaataaatg ggcccaattt ataatgagta acttttagtg 106921 gctagggtga aacctagtgt accctcgtga ttaaataaag aggaaaaata gaggcacttt 106981 tgaattgcat aggttgagac tttcgtttta agcttaaact tggctggtca ccaaatggat 107041 aggaaaggga ccattccatg gttagagtaa agtaggggta ctggcccagt atgccagact 107101 gggaaatatt gctgggaacc ttgaggggtt tataaatagc tgggtttaat ttcccagcag 107161 aagtgaaaac tgatgctttg taccctacag attttggact acttggctaa agctgaaggg 107221 gtggtgggga aatagtgcaa aggacaagag aaatgagtgt tggacagatg agcgaggagg 107281 caaaggtagt ctcagcttat atccttcctg ctttggagta tgaggagttc ctgagaagca 107341 aggacaccct ggagcccagt gtttgaaaga atccagtttc tctaagctca gcactcaaaa 107401 cttcctcagg gccagctgat cagacccaaa tccttgctca tgtctgacca caagaagtgt 107461 ttcccaagct tcctcactgg acagctaccc aataagggta tgactttgag gggttaaaac 107521 aaaacattat aagtcttttt tcctcatcct cctattagcc tcagggtcaa aaggaagttg 107581 aaatcaccaa acccaaagtc acagaaatag tcctacttgg aaataaccaa aatgggaagt 107641 cattcagaag aaagtctaga aaggtccatt ttctggctgt aacttaggtc attttacaat 107701 cagaagagca tgagggagct ctggcctttt gggggtgggt ctgtgcttca ggtggtttta 107761 ttgtcagatc tggagtgcat ttgctgactc ctttttattt atctgggaat ataagcagac 107821 gtttcctgcc ctcttgactt tccttttttt ttttttttta tacagcgctt ctctcttgtt 107881 gcccaggctg gagtgcaatg gcgtgatctc ggctcaccac aacctccacc tcctgggttc 107941 aagtgattct cctgcctcag cctcctgagt agctgtgatt acaggcatgc accaccatgc 108001 ctagctaatt ttgtattttt agtagagatg gggtttctcc gtgttggtca ggctggtctc 108061 gaactcccga cctcaggtga tcgcccgcct tggcctccca aaatgctggg attacaggcg 108121 tgagccaccg cgcctggctg actttcttgt gtaaagaatg gtattttcgc ctatgaaact 108181 ataagggaat gaaacttggt gagcaaaaga aagctaaact aaaatactaa aatgaaacca 108241 tttaagttct agagagaaat cacttaaaat cttaccccct caagacttag gctatacaat 108301 tataaagtct tcaaacagga aatactaaag acatcttgag cctagggaaa gaagtgggag 108361 acatataaaa agtgtccaag cagcccctga attcccctct gaggaatgca tgaaggaagg 108421 gaatcctttt tcagtgggga tcaagagtaa agagaccctt tctgaggacc atgagtccag 108481 tagaaagtat aaatatccag tagaaagtat aaatatccca aaattaactg gggaaagagc 108541 tgtgctccgc ttagattcac aacttcaaaa atcccccaac ccaatctctg gaatcagtca 108601 gccctagata taaatcctta ctctttcact taccacctct gtgactttgg tgaagtcact 108661 taatctctcc atgattcagt catcttatct taaaaatggc aattgcaaga gtacttctct 108721 taaagaggta aatgattgaa tgatgatata tgtaaagcac ttagcacaat gcttagtaca 108781 tagtaagtac tcagtgaata gtagctgtta ttaaaaagta agggtgggcc aggcacggtg 108841 gctcatgcct gtaatcccag cactttggga ggccgaggcg ggtgaatcac gaggtcagga 108901 gtttgagacc agcctggcca acatggtgaa accctgtctc tacaaaagat acaaaaaaaa 108961 aaaaaaaaat agccgggcat ggtggcacgc acctgtaatc ccagctactc gggaggctga 109021 ggcaggagaa tcgcttgacc ccaggaggtg gaggttgcag tgagccaaga ttgtgccatt 109081 gcactccagc ctgagcgaca gggcaagact ctgtctcaaa aaaaaaaaaa aaggagaggc 109141 ttctaacaat ccccaaagtc tctgctacaa gcccatagta taggtttgga ctaggtagct 109201 aaatctggat cccttttctg ggttaggtat gctgtttcca tagcattctg ggtaatgttt 109261 tctttacttt tttgtcccca ctactgtccc ctgacccccc acccccaagt ccctagacta 109321 acgcagggac tgtctcattt aatttctatt tccacagctc ctggctcaat gcgtggcaca 109381 tggaacacag agtaaacatt gattgaactg aaaccaacaa tccaagtcac cgttggcatt 109441 ctcaatcttg ctttcatttc tggggtagga tttgaatgag gaatgatggc tccataccca 109501 aaagagaggt ttcctatctt ataatttgtg gtaaccatca ccagatgtat tattgtccct 109561 acttaaaggg ttcattataa agtgcaaatt cctgaaatca atgtaattta ctttgatata 109621 aaagtagagg aggcctcttg ggctagaccc atgtgtggct cctctattta aggatgactt 109681 tttattatag ggaccccaag agaatgatga ccttccttgt tttagcttca acatccctgc 109741 attctcttga agaattcctt gttgcctcgg tcactattga aattatctat gcccttcttg 109801 aacatatttt tgttttccag tctataccac ctatttgggt aacaagttac ctgagttgtc 109861 tacctgtgta cacagtaggc tttagaacga attttatcat tatattttct aagctttaac 109921 ccatttatgc ctagtgtccc attattggaa cgctaagctt gtgggagtta tttatatcct 109981 cctgctcaag gtcatcgcca aggtctgatt tttcacaaaa aaatttgcaa cctctggcat 110041 caatgggtta atggatacct ttttgaacta gtactatatt ttataaaagt gaacaagtac 110101 atgtgcacac tatatgtatt attattgcat tatggagcca cagacctctc tgaacggtat 110161 ctaaaccaac cacaacaaca ctttattcaa cttaatcaga tttttaagtt tcagttattt 110221 tgccaaactg aagttgaaat taggttgccc aagggctttg tattgccaaa ctggctggtt 110281 agttggaagt acccccagat agttgtactt ttatgtcccc atggaacatt catggtctct 110341 tgatcaatag aagcaaattt attatgagtt tcccttcctt aaaaggtatt tcctttggtc 110401 tctaaagttt gaattaacta gattttagtg taaccttgct cttctatggt caagggttta 110461 taatcgttat acatacctaa aggaccattt cacttgtttg cccaaagtgt ctagttcagt 110521 gttctgcatg caatgaattc cttttaagta agtgcctttt ttgggtctgg ttttctacta 110581 gtagtcacag ttcaaaagga aaaggggaaa gtgcaaattt gttaacacta tttctgactt 110641 acgtgataac agggcaaaag ggggaggggt gttataaaaa tactctgtgt attgctctgt 110701 gtagtcctag ttaggaggcc tggagaaaaa gaatgggact tcttcctcat tacaggtttg 110761 attcaaaccc ttgtgtggct cagaatgccc ttgtgttaag gagcaggatt aggcctctcc 110821 ccatatcagg ggaatggaaa aagccactgt tgcacagggg tccatggtag tgtactaatg 110881 agctgccatt tcccaactag cttatcttct ccttggtgta gtttaaggca atttgggata 110941 ccttgatcct acctccctct ctcccttttt cttccatctc tctgtttctt ctcttgcgca 111001 gcttccaccc gcccatctct gccatcgttg cctgcctaca tagtccttaa gctggcaacc 111061 acaggctgct ggccattagg gagatgcatg gctacctcat acatatgcag acaatggaaa 111121 ggttccataa ttgaatttag ggagagaaca agagagcatg agtgagaaca agggtaatac 111181 agagttcttt tttctcccaa aacatacctg aaagtcataa gcagaatatg gtggcaggtg 111241 aggagggcta tggtagtagg tattgtagga tgccacttgg ggatgagcat agtaagaaac 111301 tgttggatgg gtattttcaa cagaacagtt cagtgctggg agagttgggt tctgtagaaa 111361 aacacaaatt agaaacaaca taaacagtca tcactaggag aatggttaaa taaacttagt 111421 acatctatac tatggaatac catgcagcta ttgataaaca atgatgtgga cctatttaga 111481 gaaggttgtc caggatatat ttttatgttt taaaagccat tatgttacat taatataaca 111541 tgatcccaat ttatgtttta aaaaaagacc cccaaaatta tacatgtatg tatgtttgta 111601 aatgcaaaga aattaacctg gaaggatgca tagcaaacta ttaactgtgg tttatctctg 111661 gggagcagag tagaattagg aggtatatag ggattcttaa ctttctacta tatatgtcaa 111721 aatgtttaca actaatttgt attgtgtcat tttccccaaa ctttttaact tgaaaaattt 111781 aaacctatag aaaagttgaa atcatttgtg tgatttttaa aaacaagaat taaaaagaaa 111841 aaataagact caaaatgcaa aacatcaaat agagaaggcc gaaccagcta gtgttcttgc 111901 tgtgacagag tgtgttcaat cacaaagaat ttgtaaaagt tgtaaggatt cttagagatc 111961 ccaaccttgc taagggggtc tcctgaagac atgaagggta ttgatttctc atcagccctg 112021 tagagtagag gcaaaatcct agcagtgcac agaagccatg aaagaccaag aaaaatatat 112081 tgatcccccc accttttttt tttttgcaaa taaaacgatg cagacataat tttatatcct 112141 tttcacaaca tggtggtgtg tgtagccctt caagggttaa agctaacctt gtccagaatg 112201 gaggccctgc tctcatcaga gccaaagcac tgctaaagga agttaccact actcattagt 112261 tcccaagtag gactgtgctt tgtgccacat ggatcaaggt gtggggtaaa tgatttttcc 112321 atcttctatg tcattgatac ctcccccatt catgtgtata attgagagct gtctgaaaaa 112381 tcaatgtgtc ttccttttgg aacatagcta cagcacatta acagagaagc agaagcttcc 112441 tctcaagtga tgaaaatgtg taggcaggct ttggcagtca caaaggcaac catgttaggg 112501 tgttggcagg gggaaactcc cttccagcaa gtttcaacac agttctgttg ctctcttcct 112561 tggagtgggt tactgtttac atctctagga ggtttgaaga gctacaaacc tcagtcataa 112621 tacagctcac agacagccag ataaatgttt gtcagcctct ttaatgtgtg aaagagggct 112681 gagatc // LOCUS HSAC002115 103574 bp DNA PRI 13-MAY-1997 DEFINITION Human DNA from overlapping chromosome 19 cosmids R31396, F25451, and R31076 containing COX6B and UPKA, genomic sequence, complete sequence. ACCESSION AC002115 NID g2098573 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 103574) AUTHORS Lamerdin,J.E., McCready,P.M., Adamson,A.W., Burkhart-Schultz,K., Garcia,E., Kyle,A., Ramirez,M., Stilwagen,S., Garnes,J., Danganan,L., Bruce,R., Quan,G., Montgomery,M., Ow,D., Kobayashi,A., Olsen,A.O. and Carrano,A.V. TITLE Sequence analysis of a 1 Mb region in human 19q13.1 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 103574) AUTHORS Lamerdin,J.E. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) Human Genome Center, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94551, USA COMMENT R31396 from 1- 36,162; F25451 from 25,661-55,793; R31076 from 66,237- 103,574. FEATURES Location/Qualifiers source 1..103574 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R31396-F25451-R31076" /chromosome="19" /map="19q13.1 between D19S208 and CAPNS" /map="Overlaps CH19F14121 to the left and CH19R28052 to the right" /cell_type="fibroblast" /map="orientation is centromere to telomere" /note="cosmid libraries constructed at LLNL from flow-sorted chromosomes from hybrids UV5HL9-5B and 5HL2-B, which carry chromosome 19 as their only human chromosome" repeat_region complement(187..466) /rpt_family="L1" misc_feature 719..842 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 57.000" repeat_region complement(957..1296) /rpt_family="MER41" repeat_region 1289..1590 /rpt_family="ALU" repeat_region 1907..2112 /rpt_family="L1" repeat_region complement(2199..2280) /rpt_family="ALU" repeat_region complement(2310..2392) /rpt_family="ALU" repeat_region 2377..2663 /rpt_family="ALU" repeat_region 3065..3341 /rpt_family="ALU" repeat_region 3429..3705 /rpt_family="ALU" repeat_region <3755..4205 /note="BLASTX similarity to (283..429); match: 0.47, score: 5.7e-29; database searched: nr; hypothetical L1 protein (third intron of gene TS)- human >prf||1510254A L1 repetitive element ORF [Homo sapiens]" /rpt_family="L1" repeat_region complement(4223..4455) /rpt_family="ALU" repeat_region complement(4517..4584) /rpt_family="ALU" repeat_region complement(4822..4931) /rpt_family="MIR" repeat_region 5127..5417 /rpt_family="ALU" repeat_region complement(5730..6009) /rpt_family="ALU" repeat_region complement(7534..7834) /rpt_family="ALU" repeat_region complement(8629..8915) /rpt_family="ALU" misc_feature 9070..9183 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 94.000" misc_feature 10193..10276 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 56.000" repeat_region 11501..11753 /rpt_family="ALU" repeat_region 11801..12092 /rpt_family="ALU" repeat_region complement(12570..12861) /rpt_family="ALU" repeat_region 12958..13233 /rpt_family="ALU" repeat_region complement(13377..13672) /rpt_family="ALU" repeat_region 13899..14179 /rpt_family="ALU" misc_feature 14234..14310 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 56.000" repeat_region 14660..14961 /rpt_family="ALU" repeat_region 14946..15122 /rpt_family="L1" repeat_region 15173..15445 /rpt_family="ALU" repeat_region 15584..15619 /rpt_family="ALU" repeat_region complement(15626..15915) /rpt_family="ALU" repeat_region complement(15950..16258) /rpt_family="ALU" repeat_region 16665..17493 /rpt_family="ALU" repeat_region 17507..18084 /rpt_family="ALU" repeat_region complement(18269..18375) /rpt_family="MER21" repeat_region 19017..19084 /rpt_family="ALU" repeat_region 19245..19346 /rpt_family="ALU" repeat_region complement(19351..19626) /rpt_family="ALU" repeat_region 19697..19833 /rpt_family="ALU" repeat_region 20425..20701 /rpt_family="ALU" misc_feature complement(20717..20970) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: marginal, score: 42.000" repeat_region 21268..21858 /rpt_family="ALU" repeat_region 21922..22503 /rpt_family="ALU" misc_feature 22690..22803 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 74.000" repeat_region complement(22986..23280) /rpt_family="ALU" misc_feature 23377..23497 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 57.000" repeat_region complement(23687..23971) /rpt_family="ALU" repeat_region 24001..24169 /rpt_family="MER21" repeat_region 24305..24583 /rpt_family="ALU" repeat_region 25093..25377 /rpt_family="ALU" repeat_region complement(25389..25668) /rpt_family="MER31" misc_feature 25459..25516 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 79.000" repeat_region complement(25784..26071) /rpt_family="ALU" repeat_region complement(26118..26558) /rpt_family="ALU" repeat_region 26645..27013 /rpt_family="LTR7" repeat_region 27474..27765 /rpt_family="ALU" repeat_region complement(28025..28395) /rpt_family="THE1" repeat_region complement(28597..28876) /rpt_family="ALU" repeat_region complement(29166..29667) /rpt_family="MER9" repeat_region complement(29979..30276) /rpt_family="ALU" misc_feature 30918..31090 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 95.000" repeat_region 31106..31666 /rpt_family="ALU" misc_feature complement(31848..31968) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: marginal, score: 45.000" misc_feature complement(32692..32734) /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 94.000" repeat_region complement(32823..33089) /rpt_family="ALU" repeat_region 33296..33518 /rpt_family="MER1" repeat_region 33532..33829 /rpt_family="ALU" repeat_region 33885..33983 /rpt_family="MER1" misc_feature complement(34062..34170) /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 97.000" repeat_region 34376..34477 /rpt_family="ALU" repeat_region 35483..35637 /rpt_family="ALU" misc_feature 35917..35949 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: good, score: 66.000" misc_feature 36009..36266 /note="BLASTN similarity to H61649 (1..258); match: 1, score: 1.0e-102; database searched: est; yr23g02.r1 Homo sapiens cDNA clone 206162 5' similar to contains Alu repetitive element" repeat_region 36286..36646 /rpt_family="ALU" misc_feature 37091..37196 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature 37276..37435 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 99.000" misc_feature 37912..37980 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: marginal, score: 49.000" repeat_region 38093..38379 /rpt_family="ALU" repeat_region 38338..38529 /rpt_family="MIR" repeat_region complement(38555..38859) /rpt_family="ALU" misc_feature 39173..39242 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 100.000" misc_feature 39336..39418 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 96.000" misc_feature 40120..40244 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000" misc_feature 40196..40244 /note="BLASTN similarity to R59127 (1..49); match: 1, score: 7.7e-133; database searched: est; yg96b12.r1 Homo sapiens cDNA clone 41238 5'." misc_feature 40197..40244 /note="BLASTN similarity to R20299 (1..48); match: 1, score: 2.2e-129; database searched: est; yg20g08.r1 Homo sapiens cDNA clone 32820 5'." misc_feature 40375..40514 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: good, score: 72.000" misc_feature 40447..40517 /note="BLASTN similarity to R20299 (45..115); match: 0.98, score: 2.2e-129; database searched: est; yg20g08.r1 Homo sapiens cDNA clone 32820 5'." misc_feature 40447..40517 /note="BLASTN similarity to R59127 (46..116); match: 0.98, score: 7.7e-133; database searched: est; yg96b12.r1 Homo sapiens cDNA clone 41238 5'." misc_feature 40603..40755 /note="BLASTN similarity to R59127 (99..251); match: 0.92, score: 7.7e-133; database searched: est; yg96b12.r1 Homo sapiens cDNA clone 41238 5'." misc_feature 40603..40755 /note="BLASTN similarity to R20299 (98..250); match: 0.92, score: 2.2e-129; database searched: est; yg20g08.r1 Homo sapiens cDNA clone 32820 5'." misc_feature 40618..40748 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000" misc_feature 40934..41032 /note="BLASTN similarity to R20299 (242..340); match: 0.96, score: 2.2e-129; database searched: est; yg20g08.r1 Homo sapiens cDNA clone 32820 5'." misc_feature 40934..41065 /note="BLASTN similarity to R59127 (243..374); match: 0.93, score: 7.7e-133; database searched: est; yg96b12.r1 Homo sapiens cDNA clone 41238 5'." misc_feature 40936..41098 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature 41033..41078 /note="BLASTN similarity to R20299 (345..390); match: 0.71, score: 2.2e-129; database searched: est; yg20g08.r1 Homo sapiens cDNA clone 32820 5'." repeat_region 41203..41327 /rpt_family="ALU" misc_feature 41473..41553 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 96.000" misc_feature 41662..41807 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 100.000" misc_feature 42061..42172 /note="predicted exon, program: grail2exons_human_1.3, frame: 1, quality: excellent, score: 100.000" misc_feature 42244..42376 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: excellent, score: 88.000" repeat_region complement(43398..43617) /rpt_family="ALU" misc_feature 44650..44782 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 100.000" misc_feature 44709..44789 /note="BLASTN similarity to R48878 (1..81); match: 0.97, score: 4.4e-135; database searched: est; yj69b08.r1 Homo sapiens cDNA clone 153975 5'." misc_feature 44922..45017 /note="BLASTN similarity to R48878 (72..167); match: 0.97, score: 4.4e-135; database searched: est; yj69b08.r1 Homo sapiens cDNA clone 153975 5'." misc_feature 44925..45032 /note="predicted exon, program: grail2exons_human_1.3, frame: 0, quality: good, score: 60.000" misc_feature 45017..45132 /note="BLASTN similarity to R48878 (165..280); match: 0.98, score: 4.4e-135; database searched: est; yj69b08.r1 Homo sapiens cDNA clone 153975 5'." misc_feature 45070..45128 /note="BLASTN similarity to T83250 (1..59); match: 1, score: 5.0e-106; database searched: est; yd40d12.r1 Homo sapiens cDNA clone 110711 5'." misc_feature 45129..45295 /note="BLASTN similarity to T83250 (59..225); match: 1, score: 5.0e-106; database searched: est; yd40d12.r1 Homo sapiens cDNA clone 110711 5'." misc_feature 45294..45369 /note="BLASTN similarity to T83250 (225..300); match: 0.94, score: 5.0e-106; database searched: est; yd40d12.r1 Homo sapiens cDNA clone 110711 5'." misc_feature complement(45307..45343) /note="BLASTN similarity to H44410 (302..338); match: 0.94, score: 2.8e-104; database searched: est; yo74d04.s1 Homo sapiens cDNA clone 183655 3'." misc_feature complement(45315..45335) /note="BLASTN similarity to R43640 (347..367); match: 1, score: 7.8e-93; database searched: est; yg20g08.s1 Homo sapiens cDNA clone 32820 3'." misc_feature complement(45315..45335) /note="BLASTN similarity to R59128 (340..360); match: 1, score: 9.0e-122; database searched: est; yg96b12.s1 Homo sapiens cDNA clone 41238 3'." misc_feature complement(45343..45659) /note="BLASTN similarity to T24109 (3..319); match: 1, score: 6.4e-124; database searched: est; seq2297 Homo sapiens cDNA clone Cot250Ft-b4HB3MA-22 3'." misc_feature complement(45359..45408) /note="BLASTN similarity to R59128 (261..310); match: 0.98, score: 9.0e-122; database searched: est; yg96b12.s1 Homo sapiens cDNA clone 41238 3'." misc_feature complement(45359..45408) /note="BLASTN similarity to H44410 (233..282); match: 0.96, score: 2.8e-104; database searched: est; yo74d04.s1 Homo sapiens cDNA clone 183655 3'." misc_feature complement(45385..45408) /note="BLASTN similarity to R43640 (265..288); match: 0.91, score: 7.8e-93; database searched: est; yg20g08.s1 Homo sapiens cDNA clone 32820 3'." misc_feature complement(45392..45658) /note="BLASTN similarity to Z40323 (1..267); match: 0.98, score: 9.1e-101; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(45397..45658) /note="BLASTN similarity to F04518 (1..262); match: 0.99, score: 3.6e-100; database searched: est; H. sapiens partial cDNA sequence" misc_feature complement(45408..45659) /note="BLASTN similarity to R59128 (9..260); match: 1, score: 9.0e-122; database searched: est; yg96b12.s1 Homo sapiens cDNA clone 41238 3'." misc_feature complement(45422..45481) /note="BLASTN similarity to H44410 (158..217); match: 1, score: 2.8e-104; database searched: est; yo74d04.s1 Homo sapiens cDNA clone 183655 3'." misc_feature complement(45428..45481) /note="BLASTN similarity to R43640 (188..241); match: 0.94, score: 7.8e-93; database searched: est; yg20g08.s1 Homo sapiens cDNA clone 32820 3'." misc_feature complement(45479..45659) /note="BLASTN similarity to R43640 (9..189); match: 1, score: 7.8e-93; database searched: est; yg20g08.s1 Homo sapiens cDNA clone 32820 3'." misc_feature complement(45479..45637) /note="BLASTN similarity to H44410 (1..159); match: 1, score: 2.8e-104; database searched: est; yo74d04.s1 Homo sapiens cDNA clone 183655 3'." repeat_region complement(46154..46448) /rpt_family="ALU" repeat_region complement(46516..46784) /rpt_family="ALU" misc_feature 47026..47190 /note="BLASTN similarity to D20017 (148..312); match: 0.96, score: 1.0e-101; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS00988, clone pm1728." misc_feature 47177..47264 /note="BLASTN similarity to D20017 (300..387); match: 0.97, score: 1.0e-101; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS00988, clone pm1728." repeat_region 47254..47699 /rpt_family=">MER42" misc_feature 47276..47297 /note="BLASTN similarity to D20017 (401..422); match: 0.95, score: 1.0e-101; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS00988, clone pm1728." misc_feature 47318..47354 /note="BLASTN similarity to D20017 (446..482); match: 0.86, score: 1.0e-101; database searched: est; Human HL60 3'directed MboI cDNA, HUMGS00988, clone pm1728." misc_feature 47577..47682 /note="predicted exon, program: grail2exons_human_1.3, frame: 2, quality: excellent, score: 81.000" repeat_region complement(47894..48182) /rpt_family="ALU" repeat_region complement(48828..49110) /rpt_family="ALU" repeat_region complement(49341..49624) /rpt_family="ALU" repeat_region 49673..49739 /rpt_family="ALU" repeat_region 49860..50148 /rpt_family="ALU" repeat_region 50242..50348 /rpt_family="LTR7" mRNA join(51108..51330,53189..53245,53380..53454,54985..55050, 55126..55301,55736..56068,56305..56422,59207..59401, 59491..59720) /note="virtual mRNA encoding ORF similar to ssRNA binding proteins, and yeast polyadenylate binding protein; assembled by splicing the following human ESTs: R06347, T46973, R37738, D19646, R43630, R06289, T15852, R44171; mouse ESTs: AA238796, AA240377, AA238796, AA241898; and rat ESTs: H32865, H34929, H33621" /product="F25451_2" repeat_region 52093..52376 /rpt_family="ALU" repeat_region 52549..52822 /rpt_family="ALU" CDS join(53436..53454,54985..55050,55126..55301,55736..56068, 56305..56422,59207..59401,59491..59603) /note="hypothetical 36.5 kDa protein most similar to ssRNA binding proteins; BLASTX similarity to (Y07952) ssRNA-binding protein [Dictyostelium discoideum] (52%) within RNP domains; and to (Z70043) hypothetical 24.4 kD protein C22E12.02 in chromosome I [Schizosaccharomyces pombe]" /codon_start=1 /product="F25451_2" /db_xref="PID:g2098575" /translation="MFLRRAAVAPQRAPILRPAFVPHVLQRADSALSSAAAGPRPMAL RPPHQALVGPPLPGPPGPPMMLPPMARAPGPPLGSMAALRPPLEEPAAPRELGLGLGL GLKEKEEAVVAAAAGLEEASAAVAVGAGGAPAGPAVIGPSLPLALAMPLPEPEPLPLP LEVVRGLLPPLRIPELLSLRPRPRPPRPEPPPGLMALEVPEPLGEDKKKGKPEKLKRC IRTAAGSSWEDPSLLEWDADDFRIFCGDLGNEVNDDILARAFSRFPSFLKAKVIRDKR TGKTKGYGFVSFKDPSDYVRAMREMNGKYVGSRPIKLRKSMWKDRNLDVVRKKQKEKK KLGLR" repeat_region complement(56706..56861) /rpt_family="ALU" repeat_region 57091..57380 /rpt_family="ALU" repeat_region complement(57566..57864) /rpt_family="ALU" repeat_region complement(58208..58497) /rpt_family="ALU" repeat_region 58908..58997 /rpt_family="MIR" repeat_region 60688..60975 /rpt_family="ALU" repeat_region 62376..62788 /rpt_family="ALU" repeat_region complement(62945..63248) /rpt_family="ALU" repeat_region complement(64179..64416) /rpt_family="MSR1" CDS join(64510..64579,64667..64747,65014..65094,65323..65802, 66284..66396,66701..66901) /note="hypothetical 36.6 kDa protein most similar to ets- related proteins; Most similar to mouse ETS-related protein 71; high conservation within ETS- DNA binding domains and at NH2-terminus" /codon_start=1 /product="F25451_3" /db_xref="PID:g2098576" /translation="MDLWNWDEASPQEVPPGNKLAGLGAKLGFCFPDLALQGDTPTAT AETCWKGTSSSLASFPQLDWGSALLHPEVPWGAEPDSQALPWSGDWTDMACTAWDSWS GASQTLGPAPLGPGPIPAAGSEGAAGQNCVPVAGEATSWSRAQAAGSNTSWDCSVGPD GDTYWGSGLGGEPRTDCTISWGGPAGPDCTTSWNPGLHAGGTTSLKRYQSSALTVCSE PSPQSDRASLARCPKTNHRGPIQLWQFLLELLHDGARSSCIRWTGNSREFQLCDPKEV ARLWGERKRKPGMNYEKLSRGLRYYYRRDIVRKSGGRKYTYRFGGRVPSLAYPDCAGG GRGAETQ" repeat_region complement(64756..64924) /rpt_family="MSR1" repeat_region 68042..68162 /rpt_family="ALU" repeat_region complement(68288..68566) /rpt_family="ALU" repeat_region 69019..69618 /rpt_family="ALU" repeat_region 69714..69995 /rpt_family="ALU" misc_feature 70359..70453 /note="BLASTN similarity to H25656 (1..95); match: 1, score: 1.0e-29; database searched: est; yl54a01.r1 Homo sapiens cDNA clone 162024 5' similar to gb:X13923 CYTOCHROME C OXIDASE POLYPEPTIDE VIB (HUMAN)" misc_feature 70359..70453 /note="BLASTN similarity to H45127 (1..95); match: 1, score: 9.8e-30; database searched: est; yo66e07.r1 Homo sapiens cDNA clone 182916 5' similar to gb:X13923 CYTOCHROME C OXIDASE POLYPEPTIDE VIB (HUMAN)" misc_feature 70359..70453 /note="BLASTN similarity to H44727 (1..95); match: 1, score: 9.4e-30; database searched: est; yp24f04.r1 Homo sapiens cDNA clone 188383 5' similar to gb:X13923 CYTOCHROME C OXIDASE POLYPEPTIDE VIB (HUMAN)" repeat_region complement(70626..70712) /rpt_family="MIR" repeat_region complement(71365..71949) /rpt_family="ALU" repeat_region complement(72055..72362) /rpt_family="ALU" repeat_region 72479..72768 /rpt_family="ALU" mRNA 73274..80914 /gene="COX6B" /note="BLASTN similarities to human ESTS: H19677, AA248670, AA230258, AA248033, D53765, T60622, R10623, T57953, and mouse EST: AA244369" /product="COX6B" gene 73274..80914 /note="cytochrome oxidase 6B" /gene="COX6B" CDS join(73293..73398,76620..76720,80643..80696) /gene="COX6B" /note="human cytochrome oxidase subunit VIb" /codon_start=1 /product="COXG" /db_xref="PID:g2098574" /translation="MAEDMETKIKNYKTAPFDSRFPNQNQTRNCWQNYLDFHRCQKAM TAKGGDISVCEWYQRVYQSLCPTSWVTDWDEQRAEGTFPGKI" repeat_region complement(73624..73904) /rpt_family="ALU" repeat_region complement(74082..74374) /rpt_family="ALU" repeat_region complement(74485..74822) /rpt_family="ALU" repeat_region complement(74870..76053) /rpt_family="ALU" repeat_region 76301..76571 /rpt_family="ALU" repeat_region complement(77068..77376) /rpt_family="ALU" repeat_region complement(77938..78153) /rpt_family="ALU" repeat_region complement(78530..78819) /rpt_family="ALU" repeat_region complement(78839..78991) /rpt_family="MIR" repeat_region complement(79043..79565) /rpt_family="MER1" repeat_region complement(79703..80113) /rpt_family="ALU" repeat_region complement(80954..81265) /rpt_family="ALU" repeat_region complement(81463..81578) /rpt_family="ALU" repeat_region 81634..81936 /rpt_family="ALU" repeat_region 82211..82524 /rpt_family="ALU" repeat_region 82653..82947 /rpt_family="ALU" repeat_region complement(83055..83341) /rpt_family="ALU" repeat_region complement(83825..84106) /rpt_family="ALU" repeat_region complement(84234..84532) /rpt_family="ALU" repeat_region complement(84587..84881) /rpt_family="ALU" repeat_region complement(85330..85648) /rpt_family="ALU" repeat_region 85719..85807 /rpt_family="ALU" repeat_region 86518..86695 /rpt_family="ALU" repeat_region complement(86807..87059) /rpt_family="ALU" repeat_region complement(87955..88212) /rpt_family="ALU" CDS join(88862..88945,90503..90703,95281..95355,95487..95594, 97889..98068,99861..99944,100024..100068) /note="human homolog of bovine uroplakin 1A; Predicted human homolog of bovine uroplakin IA (P38572);two proteins are 93% identical, 98% similar. Transmembrane glycoprotein expressed in bladder epithelium" /codon_start=1 /product="UPKA" /db_xref="PID:g2098577" /translation="MASAAAAEAEKGSPVVVGLLVVGNIIILLSGLSLFAETIWVTAD QYRVYPLMGVSGKDDVFAGAWIAIFCGFSFFMVASFGVGAALCRRRSMVLTYLVLMLI VYIFECASCITSYTHRDYMVSNPSLITKQMLTFYSADTDQGQELTRLWDRVMIEQECC GTSGPMDWVNFTSAFRAATPEVVFPWPPLCCRRTGNFIPLNEEGCRLGHMDYLFTKGC FEHIGHAIDSYTWGISWFGFAILMWTLPVMLIAMYFYTML" repeat_region 89067..89153 /rpt_family="MIR" repeat_region complement(89391..90126) /rpt_family="ALU" repeat_region complement(90899..91011) /rpt_family="ALU" repeat_region complement(91147..91345) /rpt_family="MIR" repeat_region 91443..91727 /rpt_family="ALU" repeat_region complement(92811..93086) /rpt_family="ALU" repeat_region 93118..93450 /rpt_family="ALU" repeat_region complement(93508..93810) /rpt_family="ALU" repeat_region complement(94350..94598) /rpt_family="ALU" repeat_region complement(94648..95053) /rpt_family="ALU" misc_feature 95193..95509 /note="BLASTN similarity (1..317); match: 0.99, score: 2.2e-126; database searched: CpG; bases 69 to 385 (SL to QR)" repeat_region complement(95784..95865) /rpt_family="MIR" repeat_region complement(95961..96248) /rpt_family="ALU" repeat_region complement(96296..96576) /rpt_family="ALU" repeat_region complement(96609..96701) /rpt_family="MIR" repeat_region 96820..97108 /rpt_family="ALU" repeat_region complement(97337..97628) /rpt_family="ALU" repeat_region complement(98245..98532) /rpt_family="ALU" repeat_region 98666..99595 /rpt_family="ALU" repeat_region 100595..100880 /rpt_family="ALU" repeat_region 101415..101695 /rpt_family="ALU" repeat_region 101992..102711 /rpt_family="ALU" repeat_region complement(103015..103295) /rpt_family="ALU" repeat_region complement(103441..103574) /rpt_family="ALU" BASE COUNT 25789 a 26515 c 25620 g 25650 t ORIGIN 1 gatctcacat ccagattgaa tataagtata atgggcagat ttatcactat taggaggcta 61 ctgtacatta ggcccaggct ggcccctgag ggccacatgg taactaaggc agacaagacc 121 ataactcaga tagaataata gtctccaatc ccatccaggt tgctgtgaat gccattaatt 181 cattcctttt catggctgag tagtattcca ccatatatat atgtaaaatc agtttcttta 241 tccactcatt cattgatggg catctgggtt ggctccacat ttttgcaatt gcaaattgtg 301 ctgctataaa catgtgtgta caagtatctt tttcacataa tgacttcttt tcctttgagt 361 agatacccag tagtgggatt gctggatcaa atggtagttc tacttttagt tctttaagga 421 atctccacac tgttttccat agttatacta gtttacattc ccaccagcaa tcagggaaat 481 gcaaatcaaa accacaatgc gataccacct tattcctgca agaatggcca taatcaaaaa 541 ctcaaaaaat aatagatatt ggcatggatg gagtgaacag ggaacacttt ccttgagttt 601 taactccaga ctggctcctt tgctgttttt ctggtggggc tcttaccaca tttcccacag 661 ccattggctg tgtgctcctg catacttgga tgctcctctc ctgcctgggc cccaccagga 721 cccctgagat ccaggagccg tggcttttgg gggctggtag agaagaaaca gggagctacg 781 tgaaggccaa tggcagaaac caagagagtt tcaagaaagc aagaaagaga aaccagctct 841 gaaaatagca aaaggaacaa cgggaaaaaa taaaatagtt tcatgtaacc taaaaacaga 901 cattgagtct tcgaatcaaa ttgaaagcac ttgctagcgt cagaggcatg tgaaccagag 961 caactccatc ttgaatgggg ctgagtaaaa tgaggctgac gtctactggg ctgcattccc 1021 agacagctaa ggcattctaa gtcacacgat gagataggag gttggcacaa aatacagatc 1081 ataaagactt tgctgataaa acaggttaca gtaaagaagc cagctaaaac ccaccaaaaa 1141 caaaaagtgg cgatgagagt aacctgtggt catccccact gctcattata cactaattag 1201 aatgcattag catgctaaga gacactccca ccagcgccat ggcagcttac aaatgccatg 1261 gcaacatcag gaagttaccc tatatggtct aaaaagagaa gacatgaggc caggcacggt 1321 ggctcacgcc tgtaatccca gcactttgga agaccaaggc aggcagatag ggtcacctga 1381 ggtcaggaat tccagaccag cctgggcaac atggtgaaac cctgtctcta ctaaaaacag 1441 gaaaaattaa ctgggcatgg tggcacgcac ctgtaatccc agctactcgg gaggctgagg 1501 caggagaatc tcttaaaccc gggaagcaga ggttgcagtg agccaagatt gtgccactgc 1561 actccagcct aggcaacaga gcgagactcc atctcgaaaa tttaaaaata aataaataaa 1621 taaaggggaa tatcacaagc aacgtatggc aacaaattag acagcttaga tgacagctaa 1681 aaattcttag atgaattacc aaaaccaact caagaagaaa tagaaaatct gaatagatgt 1741 ataacaaata aaaagagatt tgtaatttaa aatactcccg caaagaaaag ctcaggccta 1801 gatggtttca ttggtgaatt tcacaaaaca ttaaagcata aataatatta attctctaca 1861 atctgttcca aaatatggag aaagggagca ctttcccagc ttattatatg aggtcagtat 1921 taccctaata tcaaagccag acagtggcat ctcaagacaa ctagagacca atctctttca 1981 taaatatgat ataaaagtca ttggtaaaat atcagtaagc ccaatacagc aatatataaa 2041 aagggttata caccatgatc aagtgggatt taccccagaa atgcaagatt ggtttaactt 2101 ctgaaaatca attaatgtaa cacaccatag taaaagaata aattttattc atttatttta 2161 aagatggggt ctcactctgt cacccaggtg agattgcagc tcactgcagc ctcaaactcc 2221 tgggctcaag caatcctcct gccttggcct cccagagtgt tgggattaca gatgtgagcc 2281 gttgcacctg cccctgtaaa agaagacact tttttttttt ttgagatgga gtctcactct 2341 gtcgcccagg ctggagtgaa agactaattt ccgggtgggc gcggtggctc acgcctgtaa 2401 tcccagcact ttgggaggcc aaggcaggtg gatcacgagg tcaggagatc gagaccatcc 2461 tggctaacac ggtgaaaccc cgtctctact aaaaaataca aaaaaatcag ccaggcgtgg 2521 tggcgggtgc ctgtagtccc agctactcgg gaggctgagg caggagaatg gcatgaaccc 2581 aggaggtgga gcttgcagtg agccgagatc atgccactgc actccagcct gggcgacaga 2641 gcaagactct gtctcaaaaa aaataaataa ataaaagaaa agaaagacta aattcttcct 2701 ggctaagata gggaacaaaa caagaatgtt cattctcacc acttttattc aacagtatag 2761 tggaggtcct agccagtaca attagtcaat aaaaagaaat aaaaggcatc tgggttggaa 2821 aggaagaaga aaaaactgtt ttgacccaca gttgacttaa atttgcttgt agaaaactct 2881 aatgaatcca caaataacta ctagaagaaa tgagtccagc aatgctgcac aagatcaata 2941 tccaaaaatc aattgtatgt caataagcta ccaatgaaca atcaaaaaat gaaattaata 3001 aaatgaattc attcccaata ggatcaaaaa gaataataaa ttcaggctgg gcgtggtgac 3061 ttatgcctgt aatcccagca ctttgttagg ccgaggcggg cagatcatct gaagtcagga 3121 cttcaagacg agtttgggca acatggtgaa accctgtctc tactaaaaaa aaatacaaaa 3181 atcagctggg catggtggcg ggtgcctgta atcctagcta cttgggaggc tgaggcagga 3241 gaattgcttg aacctgggaa gtgaaggttg cagtgagcca cgatcgcacc actgcactcc 3301 agcttgggta acagagagag gctatctcaa aaaaaaaaaa agaagaaatt caattaaaaa 3361 aagtaataaa tttaacaaaa atacaagagt tgtacaataa aaactataaa tcttggctgg 3421 cggcgtgatg gctcacgcct gtaatcctag cactttggga ggccgaggtg ggcggatcat 3481 gaggtcagga gatcgagacc atcctggcta acacggtgaa accccgtctc tactaaaaat 3541 acaaaaaaat tagccgggcg tagtggcagg cgcctgtagt cccagctact cgggaggctg 3601 aggcaggaga atggcatgaa cctgggaggc ggagcttgca gtgagccgag atcacgccgc 3661 tgcactccag cctgggcaat agagtgagac tctgcctcaa aaaaacaaac aaacaagcaa 3721 acaaacaaac tataaatcat tcctttaaaa atttaaagat ctaaataaat ggagtggcat 3781 cccatgttca tggtttggaa aactcaatgt tagcaaaatg gcagtactcc cctaattgac 3841 ctaaagagtc aatgcaatct ttatcgaaac ctcaactggc ttgttttttt ttttttgcag 3901 aatttgacaa gttgatccta aaagtcatgt ggaaatgaaa aggatgcaga ataacctagc 3961 aatgttgaaa aagtaaaata aagttggagg acttccagtt accaatttca aaatttattc 4021 ctgattacaa agctatagga ataaagacag tgtggtactg gcataaggat ggacgtatag 4081 ctcaatgaag caaaattgaa agtccagaaa taactttcat atttatggtc aagtgatttt 4141 tgacaaaagt ggcaaaacca ttcaatgtga aaaggagatc caacaaatgg tgctagaaca 4201 attggatctc tctctctctc tctttttttt tttttttctt tttggagatg gagcctcgct 4261 ctgtcaccca ggctggagtg cagtggcatg atctcagctc actgcaacct ctgcctcccg 4321 ggctcaagca attctcctgc ctcagcatcc caagtagctt ggggctacag gtaaccacca 4381 ccaagcccag ctaatttttg tatttttagt agagacaggt ttcaccatgt tggccaggcc 4441 aatcttgaac tcctggccat tattttcaaa tgattctcac atgctctgaa ccccactcat 4501 tcatcattat tatctctttt tttttcttga gacaaggtct ccctcttgtc caggctggag 4561 tagaggggtg cgaccatggc tcaccacagc ctcaaattcc tgagcaaaag caaacctccc 4621 atcacaccct cccaagcagc tgagaataca gaagcctgcc ctcagagcct gatatatata 4681 tatatctgtc taaaatatat acattttttt tctagataca tggtcccact gtattgatcg 4741 ggctaggttt cactatcctg gtctcatgtg atcctctccc cttagccatt atcaccttat 4801 agagtgcaca ttttgttcca tgcactgttc aataccactt acatacatta atttaatcct 4861 cacaatgaaa caagagagaa ataattatta ttctcatttt aaagatgaga aaactggtac 4921 acagagaggt tgcacaaggt gatattgcta gttagtggca tggtggggat ccaaacctag 4981 gtagtctagt tccaaggtgt gtttattcag cccatgttta ctgagcacct actactatgt 5041 gttgggctct gctctaggtg ttagagctat agacagtata aaagaactac ctcgccccca 5101 ttaagacatc tctcctggct gggcatggtg gctcattcct gtaatcctag cactttggga 5161 ggccaaggtg ggtggatcac ttgaggtcag gagttcaaca ccagcctggc caacgtggtg 5221 aaaccccata tctactaaaa atacaaaaat aagccaagca tggtggtggg cacctgtaat 5281 cccagctact cgggaggctc aggcaggaaa atcacttaga acccaggagg tggaggtttc 5341 agtgagccaa ggttgcacca ttgcactccg ccctgggcaa caagagcaaa actccatctc 5401 aagaaaaaaa aaaaaaaaag acatctctcc catatctctc tcttattgag acatggtctc 5461 tctcaccttt cccaccaaaa atttttttta aaaaaacaaa accctttcat cactcagccc 5521 cctcaagtta tggatctggt ttttggtttt taattccaga gacaaaaaag ggggcagaac 5581 atagacccat gctaccatct tgacagagtc tccatttcag ttccatacgc tcaagtccct 5641 cctatcttta aaacaaacaa gcaaaatcat cattctcacc cttttgcatc taccctagtc 5701 ctgcctttcc tggctcttca taggcaaact tttttttttt tgaaacggag tctcactctg 5761 tcacccaggc tggagtgcag tgacacaatc ttggttcact gcaaactcca cctcccgggt 5821 tcaagcaatt ttctgcctca gcttcccaag tagctgggat tacaggcacc caccaccaca 5881 tctggctaaa ttttgtattt ttagtagaga cagggtttca ccatcttggc caggcttgtc 5941 ttgaactcct gacctcgtga tccacccacc ttggcctccc aaagtgctgg gattacaggc 6001 atgagccact gcacccagcc ataggcaaaa ttcttgaagc aaatgtatac actgtctcca 6061 attccacacc tcacactcaa tccatggctc tctgaaacca ggtgttggct tacaacattc 6121 cagaaactgg aacccatgtt cttctcagct gtggcactgc tctccttact ctttcgcctc 6181 caagctcttg gaagaaatgt ctacattgtc tatactaccc ctttcttgct cactgcctgg 6241 atccctgcaa tatggccact gcttctgctg atcctctgat actgccctta ccaaaggagc 6301 taattcattc ccgccttcag agctggagta catgtctcag ggcttgacac acttgtccat 6361 gtacttgttt ctaagcctgt cttagtttgt ttgttccact atccaaggga cttgccagca 6421 accaatacac atgcttcatc tcacccaacc ttcacttctg tcttccccca cctcggcgcc 6481 ccatgttctg ttgtcctgga acatccactt ggtcctttga aaggctctga atcaatttcc 6541 tccatgaaga atatcctgac ttctggattg catgtgttgt cactgttctt ctaactactt 6601 ctaggtccag gactggtctt aaatacccag gttgtgtctt ggactagcca tactaaggtc 6661 tacatttgtg gactgcacag tgtcacctcc cacccacccc caacttcctg aaaagtggct 6721 cctcaaaccc tctactcccc tagaaccacc agctaaagga agcatcagct aaaaccccac 6781 agtattagca ttcattaaaa gctcctacct gaatcagcaa ttactatggt atttgaaaat 6841 gttctctaat ggtgcataac aaattcccca caggtattaa ctcacagttc tgtaggtcag 6901 aatggctgct tctctgctca gaatctacca aagctaaaat taaggtgtca gctgggctgg 6961 gttctcatgt ggagctcagg atcctcttcc aaatgaatgt gattacagca ggattcacat 7021 ccttgctgtt gcagtactaa ggaccccgtt tccttgctgg ctgtcagctg ggagccattc 7081 tcagccagag gctgcctgga ttcctcccta cgtggccctg gtgacatctt catgccagca 7141 atggagacct ttgcttgcct ccaatcctct cacacctgaa acctcttttg ccaggaagac 7201 tcagtccttt tcaagaactc acctcattag gtgaggcccc cagaggatat tcttcctaac 7261 ttaaagttaa ctgatttcag accctaccta cgtctgcaag acgccttccc agcagcatac 7321 agactagtac ttgactgagt gactggagaa ggagtgtgta caccagggag caggaatctt 7381 cgggtcatct cagagttctg cctaccacat tggggatttc ctgttccatc gttccattag 7441 gttgtgttct actttaagaa agatcttccc ccttttcccc ttctttctct attctttcct 7501 tttttctttc tgtcagcata aaccaaggaa ttcttttttt tttttttttt tgtcattttt 7561 tcttgagatg gagtctcact ctgtcaccca ggctggagtg tgcagtgtga tctcgaatca 7621 ctgcaacctc cgcctcccgg gttcaaacga ttctcctgcc tcatcctccc tagtagctgg 7681 gattacgggc gcctgccacc acacccggtt aatttttgta tttttagtag agacggggtt 7741 tcaccacgtt ggtcaggcta gtctcgaact cctgatcgca ggtaatccgc ccgctttggc 7801 ctcccaaagt gttgggatta caggcatgag ccactgcgcc cagccggaat tcatttttaa 7861 ttcaagactt tacaatccac ttctgacact actcattttc ctcccctcag taaatctctt 7921 actcttactg ggcgtgtcta gtttctctct tccacaactc ctgggagacc ctcctgacat 7981 cgcccatccg cgcaggctgc agagagagat cagtgggagg aggaggagcc gagagccacc 8041 acttaggctt ccaaccaatc cctaccaggg tgggcggagc ccacttcctg attggctgca 8101 ctctctgtgt cttggggtgg gcgagaacgg cggggccacg ccccctaact aggcagccaa 8161 tcaggacgtg gggtgctggt ttctccattg cgaagcttca gcctttgatg gtttgggtcc 8221 tgggagtctg gttagtacaa ggggaagcct agtgggtctg gcgctccgtt ttcaagacac 8281 tcggagtccg tctttgaggg gaaaggtcat ggccctgaaa ccaccttctg ccacccagcc 8341 tgctcccaac gcgccagcta ccccagacgc cccccctacc acaggtgatc caggtgcttc 8401 agctgcccca ggttctccca ctaccacagg tggtccaggt gccccagctg aggtgcccca 8461 ggagccgcaa gagcctacac agacaccaga ggaactagca ttttacgccc caaactacct 8521 atgtttgacc atctttgcta tacttttatt ccctccattt ggattggcag ctttgtactt 8581 ctcttatgag gtaaaataat gcccaagtca tagcccccgt cctttttttt tttttttttt 8641 ctttttgaga cccagtctca ctctgtcccc caggctggag tgcagtggtg caatctcggc 8701 tcactgcaat ctctgccttc ccggttccag caattctcct gcctcagcct cctgagtagc 8761 tgggattaca gacatgagcc acagtgcccg gctcattttt gtatttttag tagagacggg 8821 gtttcaccat gctggccagg ctggtctcga gcttctgacc ttgtgatcca cccgccttgg 8881 cctcccaaag tgctggcatt acaggcgtga gccactgcac ccagcctacc ctgtcctcca 8941 tttggcccct gcatctgctc ctatcctggg acctgcatag gggctttccc tcagtgactg 9001 caggtctcca accgttctcc ccagggttcc tggacacaga aacccacatc tatgcttcca 9061 cccctacaga ctatgaaggc caaccagaac agtgaatggg aagaggctta catcaactca 9121 ggccgaactg gttggttcgg tgcattcgtg gtaatgattg gcttaggcat catttatggc 9181 ttggtcctat attaatgaag tctaggcata gcaacccagg ggtccgctcc ccaaccgagc 9241 cactcaccaa acactaacca gccaagtcag ccatcaggca agaaattcaa ccgctgagac 9301 atctagccaa gaaatcctct aaccaaggaa ctcaacattc aagaaaccca ccaacccagg 9361 aacacacaga ccacctgcct gtgctacttc aactcactca gctccagagc tgtcttcgtt 9421 caccctaggg gtccacactt catgccctag atccctttag ctccttgaca ctgggctgtt 9481 ctttcagagg aattttaaac caaaaaactg gaatggaatg ttctggtttc ctgtggtgtg 9541 tgtgtatgtg tgtgtgtcaa catgcccatg attgttttct gggtcagtgg tcagtatctg 9601 tgtttccctg gagctgtttc ctagactgtg aaaatcagaa tgtcccaatg gggaagccca 9661 cagacataga attcaggcag atgtcagtta aaaacttacc tctgacactg aaaaactgta 9721 tagccctgaa cagatacttt tcttgagcat agttcctttg tctctaaagc aggcataatt 9781 gccaatgtgg ggatgatatt tagaaatctg aactgatgtt tattctctag gggtcttctc 9841 atttgagctg ggattggaga tgtctagtgt ctcagagcag caataagaaa acagaaacct 9901 cttccagctt ctgacatcca aatgtcaagc tcttaggaga agaatggaaa gtcctcaaga 9961 aatgcaaata gctttggcag aatagctgat gaagaccacc tctcccccct ccagaaaggc 10021 attggttccc cattcatgga aaagggaatg tagagagaga ttagacaata gtacatccat 10081 aaggttcctg gaatctgcat ctgaggaaga ggggcgtcag agaccccagc tgttatctat 10141 aatccctcct cacagataag gcctagagag gctccaagtt ctcaaagaag atatgctaca 10201 tggtgtagac atttacaaag aaatgacctc agacagccaa ggaggatacc ccatggccca 10261 gcctagtgag aagaaggtgg gttccatagg aatcccaagt gccaggtgga gacagaactg 10321 tagaggaaaa ctagggataa gcagggagcc atcgtgcatg aatcaaaaca gagatctaga 10381 ctgacaccat gggccacaga caccccgggg ctgcctctga tgcccaagct tcatggacaa 10441 ccccaagttc agaaccagat gaatatttag cacaatattt agcagaacta agtttccacc 10501 accatggaga atggggattc tcagtagaaa gtcaatggag ctgctgggca agagactgct 10561 cactgtcccc cagagcccat cgcctctttg ttctgggaac attgcctggc cacgtctcag 10621 aggttcccct ctgcagccat gaaatgtgct ttggtcctgt ggaatgtgag cagaactgac 10681 atgtaccaca aacaaacctg gcttttcaat gtctcctggg acactcctct aaactcctcc 10741 tcccccggcc aacggggcaa gggataccca tgtgatgatt tagaaggcca catgttgtag 10801 acgtggcaaa gcccccatta gcctgggtcc ctgaaccatt ctaggaacag cctcccttcc 10861 cccaactgcc tcatgtggtc aagcagaaaa tagactacca tgttgacaaa tgacatgctt 10921 ggcctgtttc cacagcccaa tccacccaaa ccagtgcaga tgcaggagaa tagagttcta 10981 tttcttgcac atctgggatt acagcttcaa gttcataccc aacagagaca tccatcttgt 11041 tgggtggtct ataggaaata ataatagaag gaaacaacaa aggccctagt ccaccttaat 11101 tgtaattgtt gtgatgctga gtttgtctcc tctgtgggct tgggtgtctg ggcagatctc 11161 atgtgtttgt gggacaatgc tctgggttat ggggatgcct gcagatagga tgatgcctaa 11221 ttgtctgtca tgttagcaga tctagagggt tagccatccc tgtccctctc cagatccagc 11281 ccaggctggg cctgtcacat tcagctgcaa agccatgtca ggttaggatg gtccatgtca 11341 acatcagcag gcagacccaa gtgtcacctt ctgctacagg ccttggaaag ccagccagcc 11401 tcctccactt cagctcccag acttcatcca acctgaattt cactctcggg tatccaactg 11461 gtatttagca ccaggtaaac acaaaacgag gccaggcaca gtggctcgtg cctgtaattt 11521 caatgctttg ggaggctgag agaggaggat ggcttgagcc caggactttg agaccagcct 11581 aggcagcaag tcgatagata actcttttct acaaaatttt tcaaaattag ccaggcgtgg 11641 tggcacacgc ctgtagtctc agttactcag gagggtgagg tgagaggatc actttagccc 11701 aggattttga ggctgcagta agtcatagtg atactactgc acttcagcct gggggacaca 11761 gctagacttt gtctccaaaa taaataaatg taaaaaataa ggccgggtgc agtggctcac 11821 acctgtaatc ctagcacttt gggagactga ggtgggagga tcacctgagg ccaggagttc 11881 gagaccagcc tggccaacat agtgaaaccc tgtctctata aaaatacaaa aattagccac 11941 tgtggtggtg cgtgcctgta atcccagcta ctcgggaggc tgaggcagga taatctcttg 12001 aacccggggg tggaggttgc agtgagcaga gattgcacca cttcactcca gcctgggcga 12061 aggagtgaga ctcaacctca aaaaaaaaaa aatcttatct gactttgatt gcattacaat 12121 caatttagtt atcagttttg ggagatttag catctttatg ataagccttc ctattcataa 12181 atatgatata tcaaatagga cttgtacatc tttagcttta ttttatgtat tataagcaat 12241 ccttttttcc attacataat ctaaattatt attggggtag aagaaaggta ttgagtttcc 12301 aatgttgata tcttgtgtgg caaacttact gaactttctt ggtgtaataa ttcatcattt 12361 tacacactta cattgtctat gctaaatttg aattatttct taacaatttt aatacctcat 12421 tatatatgta tttttttagt tcctgggcta aacctttcag caaaatatga attagcatca 12481 taatcttgtt tccatccttg tatgggttag ttctattgtt ttactattaa gtcaggcagt 12541 tgatgtagat ttctgataaa aatttcatct tttttttttt tttttctgaa gcaaactctg 12601 gctctgtcac ccaggttgga gtgcagcggc acaatctcag ctcactgcaa tctccgcctc 12661 ccaggttcaa gcgattctag tgcctcagcc tctatagtag ctgggaccac aggcacgcac 12721 caccactcgg ggctaatttt ttgtattttt agtagagatg ggatttctct atgttggcca 12781 ggctggtctt gaacccctgg ccttaaatta tctgcctgcc tcagcctctc aaagtggtga 12841 aattacatgc atgagccacc gtgaccagcc aaaatttcat caattttgtt atgttttctt 12901 ctattcctgg cttgccaaga ttttttaaat tagaatttta tattgaggct aggcacagtg 12961 gctcatgcct gtaattccag tactttggga ggccaaggaa ggagagtcat ttgaggccag 13021 gaatttgaga ccagcctggg caacatagct agatctcgtc tctactgaaa aaagaaaaaa 13081 aaattagcta ggcatggtac ctgtattctc agctactcag gaggctgaga tgggaggata 13141 acttgaaccc aggagttaga ggcagcagtg agccatgatc acacccttgt actccagcct 13201 gggcaacaca gcgagaccct gtctcaaaaa aaatttgtgt tgaatcatat taaatacttt 13261 tcagcatcta ttaacatgat catacgagat tttaattcaa tcctttaaca aagttattca 13321 caatgaacca ttcttatact cttgggaaaa ccttactggg tcacaatcta ttattctttt 13381 tttttttttt tagagacaga gtcttgctct gtcgcccaga ctggagtgca gtggcacgat 13441 cttggctcac tgtaagctcc gcctcccagg ttcacgccat tctcctgcct cagcctcccg 13501 agtagctggg actacaggcg cccaccacca cacccggcta attttttgta tttttagcag 13561 agacggggtt tcatcgtgtt agccagatgg tcttgatctc ctgacctcgt gatccgcccg 13621 cctcagcctc ccaaagtgct gggattatag gcgtgagcca ccatgcccgg cctctattat 13681 tcttttaata tgtgatggat atggtttgct tatgccttat ttgggtccta catagttttc 13741 atctgggggc tgttctcagt cctttgcatt ggggcagtgc tggcaatgca gaatgggttg 13801 agaagcattc aatgtgtttc tgtgttctgg aaacatttct atgtcatggg aattagctgc 13861 ttcttaaaga ttggagtggg ccggacatgg tgcctcatgc ctgtaatcct agtactttgg 13921 gaggctgagg cgggtggatc acctgaggtc aggagttcga gaccagcctg gccaatatgg 13981 tgaaaccccc atctctacta ataatacaaa aattagctgg gcatggtggc acacaactgt 14041 aatcccagct gctcgggagg ctgaggcagg agaatcactt gaacccggga agcggaggtt 14101 gcagtaagcc aagatcacgc cattgcactg cagcctgggt gacaagagcg aaactccatc 14161 tccaaaaaaa aaaaaaaaag ttggagtgaa catgcctgca aaatcttctg ggcctgtaat 14221 ctttttcgtg tagcagattt tgagtacttt gtcttcacta tgatggtgaa agcagcaact 14281 ctcggagtca agtacaagcc actgttatga gcagcttcgg agtattgatt cttttggtct 14341 ccacactaac cctatagcct gcattcttac agactgagga taaagcacag ggagctggaa 14401 agacacagtg caggtctcac agcgaattca ctgaggggcc aggattggga accaggaaat 14461 ctgccaccaa aagccgtgtt cttaagcact aggtggacaa cactgcctcc catcgaccgt 14521 tagtagcaat tcaatctttt tacctgctgg agtcacttgt taatcactta cctttctctt 14581 aacaaatcat ttttattttc gacatagaag ctaacttgaa ggagcgccca ctggcccaga 14641 gtggtacaac ttaaacatca aaaatattaa taacggccag gcgcggtggc tcactcctgt 14701 aatcccagga ctttgggagg ctgaggcagg taaatcaatt gaggtcagga gttcgagacc 14761 agcccggcca acatggtgaa acctcgtctc tactaaaaat ataaaaatta gccagccgtg 14821 gtgacagtca cctgtaatcc cagctactcg ggaggctgat gcaggagaat cgcttagagc 14881 tcgggagtcg gaggttgtag tgagccgaga tggcgccatt gcactccagc ctgggcaaca 14941 agagcaaaac tccgtctcaa ataataataa taataataat agtaataaca ataatagatt 15001 taaacacacc aaatatatat catatttaat gatgaaggat ggaatacttt ccctataaga 15061 tcaacaacaa ggaaaggatg tctgctcttg acactcctat ttgacatagt gctggaagtt 15121 ctagtcactg caagaaaaaa acgcaaacag ggcccggcgc agtggctgac gtctgtaatc 15181 ctaacacttt gggaggcagg aggatcactt gagactagga tgtcaagacc agcctaggca 15241 acacagcatg accccatctc tacaaaaaat attaaaaaat tagccaagca tggtggcaca 15301 tgcctgtagt cctacctgct cagggggctg aggcagggag atcacttgag cccaggactt 15361 tgagactgca gtgagctatg atcacaccaa cactgcactc cagcctgagg aatagagtga 15421 gactctgtct caaaaaagaa aaaaaggcat ttagattgga aaggaagaaa taaaactgta 15481 tctgcaggtg acatgagata tgtgtgtgtg tgtgtgtctg tgtgtgtagt tgatgcaaaa 15541 ataattgcgg tttttgccat tgaaaggaat gactgctggg catggtggct cacgcctgta 15601 atcccagcac tttgggaggg tttttttttt tttttttttt tttgagatgg agtctcactc 15661 tgtcacccaa gctggagtgc agtggtgtgg tctcggctca ctgccagctc cgcctcccgg 15721 gttcacgcca ttctcctgtc tcagcctccc gagtagccgg gactacaggc acccgccacc 15781 acgcccagct aaatttttgt atttttagta gagacggggt ttcaccttgt tagccaggat 15841 gggctcgatc tcctgacctc gggatctgcc ctcctcaacc tcccaaagtg ctgggattac 15901 aggcatgagc caccgtgcct ggcgtttttc ttttcgtttt ttttttttct tttttttttt 15961 tttttggaga tagagtctca ctctgtcacc caggctggag tgcagtggtg tgatcttggc 16021 tcactgcaac ttccacctcc tgggttcaag cgattctcct gcctcagtct ccctagtagc 16081 tgggattaca ggcgcccacc accatgccca gctaattttt gtatttttag tagagacggg 16141 gtctcaccat gttggccagg ctggtcttga actcctgacc ttaggtgatc cacctacctc 16201 agcctctcaa agtgctggga ttacaggcgt gagccacagc cctggacgga tgtattttca 16261 atacatgtct ttataatgca tgtgaaagct aacgcataca tgggagacaa tgaagggacc 16321 aatatggtgg aagttttcca tattccactt gaagttgcaa aatattcaat ctaggtagac 16381 tatagaatgt ttaagtgtgt ttaccataat ccgtagagca atcacttaaa acttgcaaag 16441 atatatagtg aaaatcacaa taaataaatt aaaatggagt actaaaatag tatttgaata 16501 atccaaaaga aagaaggaaa agggaaaaca aggaacaaaa cacagaaaag tggaaataaa 16561 aaacaaataa tagtagacct aaagcaaaac ataccaataa tcatatttaa ggcacatgat 16621 ctaaacacac aagttataaa atattttcat aatgcactta aaaaaaaaat agccagagcc 16681 gggcgcgttg gctcacacct gtaatcccag cactttggga ggccaaggca ggcagatcac 16741 gaggtcagga gaccatcgtg gccaacaagg tgaaaccctg tctctactaa aaatacaaaa 16801 attagctggg cgtggtggcg gatgcctgca atcccagcta ctagggaggc tgaggcagga 16861 gaatcgcttg aaccagggag tcagaggttg cagtgagccg agatcgcacc actgcactcc 16921 agcctggcga cagagggaga ctccatctca aaaaaaaaaa atagccaggt tcaaaggctc 16981 atgcctgtaa tatcagcact tcgggaagcc gaggtgggtg gatcacttga gcccaggagt 17041 ttgagatcag cctggaaaac atggtgaaac tccgtctcta ctaaaaatac aaagaagtta 17101 gctgggtgtg gtggcacacg cctatagtcc cagctacttg ggagactgag gcaggaggat 17161 aacctgagcc caggaagtcg aggctgcagt gagccatcat cacatcactg cactccagcc 17221 tgggtgacag agtgaggccc tgtctcaaat aaatacatac ataaataact tgagcccagg 17281 agttcaagat cagcctgggc aacatagtga gaccccatct ctgcaaaaaa tttcaaaatt 17341 aacctggcat ggtggcttgc acctgtagcc ccagctactc agaagtttgg ggcaggaggt 17401 tgtttgagcc caggaggtca aggctgcagt gagccgtgtt cacaccacta cactccagac 17461 tagcaacaga acaaaacctt gtcttaaaaa aaatagaagt aggccaggcg cggtggctca 17521 ctcctgcaat cccagcattt tgggaggcca agatgggctg atcacctgag gtcaggagtt 17581 cgagaccagc ctcactaaca tggtgaaacc cagtctctac taagcataca aaaattagcc 17641 aggcatagtg gcaggtgcct gtaatcccag ctacttggga ggctgaggca ggagaatcac 17701 ttgaatgcag gaggcggagg ttgcagtgag ccgagatcac gccattgcac tccagcctga 17761 acaacagagt gagactctgt ctcaaaaaat aaaaaaatta aaaacttaaa aaatagaagt 17821 aggccgggcg taatgcctgt aatcctagaa ctttaggagg ccaaggcagg cggatcactt 17881 aaggtcaggc gttcgagacc agcctagtca acatggcaga accccgtctc tactaaaaat 17941 acaaaaaatt agccaggcat ggtggtgcgc acctgtagtc ccagctactc aggtggctga 18001 ggcaggagaa tcacttgaac ccaggaggcg gaggttgcag tgagccaaga tcacatcacc 18061 acactccagc ctggaagaca gagcaagaag aaagaaggaa aagaaagaag agaaggaaaa 18121 ggaagaaaag agggaaggga agggaaaggg aaagggaaag gggaagggga agggaagggg 18181 aaagggaaag ggaggggagg ggaagggaag ggggaaagaa tgaaggaaag aaaggaaagg 18241 aaaggaagga agaaaagaaa tgattgtttg cttggtgttg gaggggcttg agggggtggg 18301 gaaaaagggg ggaacccaca catttgatca aagaactctt ctgtgttgac tgttgtggta 18361 tgagagcaga gaaaatgcat tgtgtttttc ctgaaacaca agactccgag gcctacatca 18421 ctcttgcccc aagtagtgct tttctctcaa gatgctcagg aggaatcttt aaacaaagat 18481 gatcaaggaa atgaggctta tttataggtc tgttttgaag gtggaaccag ggattcctca 18541 attggccaat cacttgagag agtagaagga gctgtcaccc catcaaacca atcagaagcc 18601 aggtgttggt tcccacccta tgacatacca cctccaattg gccagtagaa gtagagagac 18661 aggctaggag gcggggcctg cctggccgcc atggtaatcc cctgtggttg gtgatcaagg 18721 aagagcatag tgccagacct aggtgccctc ctgggaatgt tccaggaggg caggagtagg 18781 aggaggagtg ttagagtaga ggggaaatga tgagagcaga aagggtatca taaaggaacc 18841 tcaggggtga tataagtgga cacagacata tgtcacaagg gcacgggttt agaacaggaa 18901 ggctgttgta cgtggttgga gagggagttc aaagagtcag aaagctggtt tcagaagttg 18961 agaggtgtta gagggagaca gggcaggact ttgtaaggtg acaaagaagt agagactgaa 19021 accctgtctc tactacaaat acaaaaatta gccgggcgtg gtggcgggcg cctatagtcc 19081 cagcaagagg agtgaatagg tggtcaaata aggtcttaga ggagctaaag gggcagggag 19141 cttatgggcg aacaggaaag gtttacagca ggaccccgca ggattcatag ggacgaagta 19201 aaggatatta taaaaggaac agtgagagag ctaacagaag ttttactttg gggggcagag 19261 gtgggaggat cacctgagcc caggaattca agaccagcat gggcaacatt gtagcaaaac 19321 cccatctcta ccaaaagaaa aaaatttttt tttttttttt ttttttttga gagagtctcg 19381 ctgtgtcacc caggctggag tgcagtggca ggatcttggc tcacttcaac ctccacctcc 19441 cgagttctgc ctcagcctcc caagtagctg ggattacagg tgcgtgccac cacgcccggc 19501 taatttttgt atttttagta gagacggggt ttcactatgt tagccaaggt ggcctcaaac 19561 tcctgacctc aggtgatccg cccacctcgg cctcccaaag tattgggatt acaggcgtca 19621 gccaccacac ccagcaaaac aaaaacaaaa caaaacaaaa ttttagttag ctgggcctgg 19681 tggtgtgcac ctgtagtccc agccactcag gaagctgagg caggaggatc acttgagccc 19741 aggagtttga ggctgcactg agctatgatt gtgcctttgc actcctacag cctgggagag 19801 aagaccctgt ctctgaaaaa aaaaaaaaaa aaaaaaaaaa aacagaagac aaagaagttt 19861 tacagggtga gagttttata agggaacagt gaatttataa tgaaggtttt atacccaaac 19921 acaggtccag tcactccacg cttgcagagt ccaattaaca agagcaagtt ctggtagaaa 19981 gaaggtgact ttattccaga gctcaggtga ggggaagagg tacaggttcc tgccttaagg 20041 gtattgcttc agctttcagg acagaaagca gggactttta aaggggggct tgatgtgaat 20101 gacataaagg tggggggcaa ggaggtgtgg ggtctatgtg acatgctttg atgtcttatc 20161 tatcaggtgt tctagctgtc accatcgtga gaagagaaat tgaccattgt ctcaaggcaa 20221 tctgatggga gagaattctg ggggtgcctg atttgtttca aggttcagtc cctggaactt 20281 ctaagtaaac atattgttag ataagcttgc catgtaggga ggttacagtt gcattcctaa 20341 agagttaagt aggagatggg gggaaaagaa aaaggaaaaa aatccttttt tttctttaaa 20401 aatggggtac tctggccagg cacagtggct caaacctgta atcccagcac tttgggaagc 20461 tgaggcaggc agatcacaag gtcggaagtt caagaccagc ctggccaaca cagtgaaacc 20521 ccgtctctac taaaaataca aaaattagcc gggcgtggtg gcaggtgcct gtagtcccag 20581 ctacttggga ggctgaggca ggagaatcgc ttcaacccac aaggcggagg ttgcagctga 20641 gctcacgcca ctgcactcca gcctgggcga cacagcaaga ctttgtctca aaaaaaaaaa 20701 atgtggtact caattacagt ttcccactgt caaattccat tccatttcta tgggatttgg 20761 atgccacatt catgctggct acttcctgct gaaagagggc attgtgatta gtcatcagaa 20821 tggaaccgat ccacttggag ttggaaatat ttatgagtag ttggaccaat atggtgtaag 20881 gtctggaaag tctctgggga agtctgtctt gcattcccat atagaaggtg aaacagcaga 20941 aaaatccaca gcagagaagt atgtctatca ttgcaactag ggccaaagtt gtaaatagct 21001 tctttccagg tgagaacagt cctcacctgc aacaggatgc caaccattga cctggtgata 21061 atgtggggtc agacatagct ttgattttta ggtgcatgtc tctcagggct attgagacat 21121 ttactgaatt atctgagatg tgtacacagc atttagtctt aattgtcagg cacattccca 21181 gctgtcaatt gtacctggag ctcctgtgga gatatgatga cagggtggtt aaacacatat 21241 atttaacagg ttacaagagg agctatgggc cgggcgtggt ggctcacgcc tgtaatccca 21301 gcactttggg aggccgaggt gggaggatca ctcaaggcca ggagttcaag actagcctgg 21361 ccaactacca gaccgatgaa ctcagtctct actaaaaata caaaaattag cgggcatagt 21421 ggtgcacacc tgtaatcccg gctcctcaag tggctgaggc ataagaattg cctgagccca 21481 ggaggcagag gctgcagtga ggcgagatcg tgccactgcc ctccagcctg ggtgacatag 21541 tgagactttg tctcaaaaat aaaaaataag taaataagaa agggctgggt gtggtggctc 21601 atgcctgtaa tcccagcact ttgggaggcc aaggcaggtg ggtcacctga ggtcaggagt 21661 tcaagaccag cctggccaac atggtgaaac cccaactcta ctaaaaatac aaaaattagc 21721 tgggcatggt agcacatgcc tgtaatccca actactcagg aggctgaggc acgagaactg 21781 cttgaaccca ggaggcagag gttgcagtga gctgagatca caccattgta ctccagcctg 21841 gagaaaagac tgagactctg tctcagtaaa taaataagag cgatacagag gccaggcatg 21901 gtggttcaca cctataatcc tagcactttg aaagatcaag gcagaaggat catttaaggc 21961 caggagttca cgacaagcct gggcaacata atgagattcc atctctacaa caacaacaaa 22021 aatcaaaaat tagctgggtg tggtggtttg taccagctac gtgggaggct gaggcaggag 22081 gattgcttga gcccaggagt tcgaggctgc agtgaactgt gatcatgcca gtgcacttca 22141 gcctgagtga gagagcaaga ccttgtctca aaaaataaaa gggatttgcc gggcgcggtg 22201 gctcacgcct gtaatcccag cactttggga ggctgagaca ggtggatcac gaggtcagga 22261 gattgagacc atcctggcta acacggtgaa accccgtctc tactaaaaat acaaaaaatt 22321 agccaggcgt ggtagcgggc gcctgtagtc ccagctactc gggaggctga ggcaggagaa 22381 tggcatgaac ctgggaggca gagcttgcag tgagccaaga tcgcaccact gcactctagc 22441 ctggatgaca gagcgagatt ccgtctcaaa aaaataaata aataaaataa aaaataaaaa 22501 aaagggatag agagaggtta gaggagacat ggtgggggtt agagaaagtc aatgagaatt 22561 ttaaatgggg tcaatgaaga tcatatagct gccaaggaga gatgtgttag ggtaagtagg 22621 gagagagtga gaaaggattg ttagagttca gaagccatag tgagacataa ccaactctct 22681 gccccccagg tgtttgaact gatgtctgat gaggatgaat ccagcgacta cctctgcctg 22741 tccatcctgg gcctcttctg ttgccttccc ctagccatcc cagccgtgat cttttcttgc 22801 ctggtgggca cctgcatcat gccaatcctt caccctccct ctccatcttg cccctgaccc 22861 tgactcaact tcttccttgt tctcccttct gaactaggag cctgagcaac accagcacct 22921 cctgtgggtt cagaccccca ccctgagtcc ttacagttcc acagcccacc ctcacccaac 22981 aactgttttt tttttttttg aaacagagtc tcgctctgtc acccaagctg gagtgcagtg 23041 gtgagatctc agctcaccgc aacctccatc tcgcaggttc aaacaattat cctgcctcag 23101 cttcctgagt agctgggatt agaggcgccc accaacacgc ctggctaatt ttcgtatttt 23161 tggtggagct gggtttcacc atgttggcca ggctggtctt gaacacctga cctcaggtga 23221 tctgcccacc tcagcttccc aaagtgctgg gattacaggc atgagccacc atgcccggcc 23281 acgcaataac tcttttatac tcatctttcc ctgttccctc tctcctaaaa ctcagacaaa 23341 gtttgtattg tagacaaaga actacaataa atccagtgac tatgagctgg cagccaagac 23401 ctccaaacaa gcctactact gggccatcgc gagcatcact gtgggaatct taggtaccat 23461 cttgtacacc tacctgatat acttacttag attgtaaact gcttcccagc tcttgaacaa 23521 accaccaaat atacaccaca gtgcaattta ccttggctct aagcatctac ttgggctgaa 23581 tggaacacct gctcctcaat gtctggagtg ccaagctatc taagattaga aaaacaaaat 23641 gcaaagtgtt cttcctattc ttacatgccc ctcaaagtaa cacttctttt tttttttttt 23701 tttttgacag agtcttcctc tgtccaggct ggagtgcagt gccacgatgt cggctcactg 23761 caacctctgc ccccaaggtt taagtgattc ttctgcctca gcctcctgag tagctgggat 23821 tacaggtgcg taccaccatg cccagctaat tttgtatttt tagtagagac ggggtttcac 23881 cgtgttggcc aggctggtct caaactcctg acctcaggtg atccactcac cttagcttcc 23941 caaagtgctg ggattacagg tgtgagccac cataccaggc ctcagcataa cacttctaac 24001 accaaatgtt agagccaggg gcaggggtat tttccccacc taccaaccaa ccagattctc 24061 cagcagacac caactgggtg tcctgtaatt taattcaatt ctgacactat ctacctggag 24121 atggcatcag atcccacagg ttgaaggctc agttccacaa gactgccccc aaatttggat 24181 gtcagccaca agtggtaagt tgtcacctag acttgtgact gacaagctat aaatcaggtg 24241 ttcccatgac ccccctcctc aagcctgatt aatttgttcc agctggccag gtgcagtagc 24301 tcaggcctgt aatcccagca ctttgggcgg ctgaggcagg aagatcgctt gaagccagga 24361 gtttgagacc agcctggcca acatggcaaa accccgtctc tactaaaaat acaaaaatta 24421 gacaggtgtg atggcatacg cctataatcc cagctactca ggaggctgag gcaggagaat 24481 cgcttgaacc caggaggtgg aggttgcaga gagctgagat catgccactg cactccagcc 24541 tgggcaacac agcgagactc cgtcaaaaaa aaaaaaaaca aaacactatt atcactccag 24601 cgattccaag ggttttagga gctctgggcc acaaaatggg acctagacca aatatatatg 24661 tcacagtatc acacaagtgt tgccagcctg ggtcagcagc ttgaagggaa ggaagtggta 24721 gtgccagcag cccactcgac cagaactcaa gaccaactcc aggaccctgc tatggcgtgg 24781 gggacatggg agagagagga ggaccccgtt gctcgaggac agcctgccca gattcatgta 24841 ttcacatcgc aagtctagtt ctgcctacca agggcgttgg ctgtgctacc tgactctgga 24901 gacgtagcaa tgactacttc aaattcaact caaccccctc agaacatata aattggggtg 24961 agagacagac atgccaccag acagtgacaa tcggagtggt ccaatatata atggaagaaa 25021 acttgggaag ggttactaaa gaaacacttg tttaatgaca ctcgttacag acagtaaggc 25081 gggccaggca cagtggctca tgcctgtaat ctcagcactt tgggaggccg aggcgggcgg 25141 atcacttgag cttgggagtt caagaccagc ctggccaaca tggtgaaacc ttgtctctac 25201 taaaaataca aaattagcca ggcgtggtgg tgtgcacctg taatcccagc tactcaggag 25261 gctgaggcat gagaattgct tgaacctggg aggcagaggt tgcagtggtt cgagattgtg 25321 ccactgcact ccaacctgtg ggacagagca aaactgactc catctcaaaa agaaaaagac 25381 agtaaggcag actttattca ggaccattgc aatatgcaga gggaatactg agacaggggc 25441 ctgcactgga ggagaaagat ggggctcaac actaaataca gcatgagcaa gtgggaagtc 25501 ctacccaagg agcagggtag gggtcagtgg atggaaaatt agtaagagaa aacattgagg 25561 gtaaggggga ttctgactaa accaacctaa taggattcta gttgaagaca ggccagggtg 25621 accagacatc accaggggga tggtggagga tgaggaacct gatcagatgc cgaggtgtat 25681 cagatacaga ggatgagggt ttctagctaa actgacgtag aagggttctt ttgctacaac 25741 tggattttac aaagaagtgc acagaaggag aaggttcaga tgcttttttt tttttttttg 25801 agatggagtc tcactctgtc acccaggctg gaatgcagtg gtgcaatctt ggctcactac 25861 agcttccgcc tcccgtattc aagccatttt cctgcctcag cctcctgagt agctaggatt 25921 acaggtgctc accacctcaa ccagctaatt ttttatagtt ttaatagaga tggggtttca 25981 ccatgttggt gaggctggtc ttgaactcct gacttcgtga tatgcccgcc tcggcctctc 26041 aaagtgctgg aattataggc gtgagccacc gtgcccagcc tttctttttt tttttttttt 26101 ttttttcttt tcttttcttt tttttttttt tttgagacgg ggtctcgctc tctcactcag 26161 gctggagtac agtggcatga tctcggctca ctgcaacctc tgcctcccag gttcaagtga 26221 ttctcctgcc tccacctcct gagtagctgg gattacatgc gcctgccgcc acgcctggct 26281 aattttttta tttttagtag agatgggatt tcatcatgtt ggccaggctg gtcttaaaac 26341 tcctgacccc aggtgatcca cctgccttgg cctcccaaag tgtaggaatt acaggcgtga 26401 gccaccgcgc ctggccgccc ggctaatttt tatattttta gtagagctgg ggttctccat 26461 gttggccagg ctggtctcaa actcctgaga tcaggtgatc cacctgcctc agcctcccaa 26521 agtcctggga ttacaggcat gagccactgg gcccggccca ggttcagaag cctgactaaa 26581 gtttggccaa gcaaagaatc cttgtcagga ggctgtaaat aagagacctg tgagaaataa 26641 aaattgtcag gtgtctgagc ccaagcctgc atgtatacat ccagatggcc tgaggcaact 26701 gaagaaccac aaaaacaagt gaaaatggcc agctcctgcc ttaactgatg acattacctt 26761 gtgacattcc ttctcctgga caataagtct ccggagctcc ccaccgagca ccttgtgacc 26821 cccgcccctg cctgcaagag aacaaccccc tttaactgta attttccact acctacccaa 26881 atcctataaa actgcctcac tcctatctcc ctttgctgac tcctttttcg gactcagtcc 26941 gcctgcaccc aggtgattaa aaagctttta ctgctcacac aaagcctgtt tgatggtctc 27001 ttcacacgga cgcacgtgac aaaaataaaa tcctaagtcc gccaactggc tgaatgaaca 27061 ctgtcttggc caaggggcct acagagaaac actaaaagct gagttcctag ccgtgactga 27121 acgggtggtc gggcacacct cattctacct cctcccttgc tgactgccat gaagctttct 27181 tccctaaggg ttaaacagaa accagcccta ttgaaagact cattcactgc taatttcaac 27241 caatggcctg aaatacatgg ctgctcctcc cttttgtggt ttcaacataa caactgacca 27301 gcattccttc ctgatcagag gccacctaac cacggggtgt ggctctggcc agtctacaga 27361 ggctgcacac aaagggcctt tgtgtcctct gcatcacctt ttgatgtata gggcctaatt 27421 gtaatacatt taagtgttaa gtctccactc caaagtgaac atggggcctg gcacggtggc 27481 tcacgcctgt aatcccagca ctttgggagg ccaaggtggg cggatcacct gaggtcagga 27541 gtttgagacc agcctggcca acatggtgaa accctgtctc tactaaaaat aaaaaaatta 27601 gccaggcatg gtgatgggtg cctgtaatcc cagctactca tgaggctgag gcaggagaat 27661 tgcttgaacc ctgggaaaga gaggttgcag tgagctgaga tcgcaccact acactacagc 27721 ctgggtgaca gagtaagact ccatctcaaa aaaaaaaaaa aaaaaaaaaa aaagtgaata 27781 tgggatgtta ctacacatgt gctgtacctt cccttcatga atattcatag ctcctcctat 27841 aacctgttga atatgtatac tgagccaagc cattcagcgt taactctgac cttatccttc 27901 ccttactgga ggtgcctgct ctcagcttct accggaggct acacttctcg gcctgtgaga 27961 tggcccggct gcaggctgca actctttatg agaaataaag ctctcctttc caaatttgta 28021 aacctgtatt agtctgttct cacgctgctg aaaagaactg cccaagactg ggtaatttat 28081 aaaggaaaga ggtttaattg actcacagtt ccgcagggtt ggggaggcct cagggaactt 28141 aaaatcatgg tggaaaagaa agcaaaaacg tccttcctca gatggcagca ggaaggagaa 28201 gtgccaagca aagagggaaa agccccttat aaaaccatca gatctcgtga gaactcacta 28261 tctcaagaac agcagcatgg gggtaaccac ccccatgatt cgattacctc ccactaggtc 28321 cctcccacaa cacatgggga tttggggaac tacaattcaa gatgagattt gggtggggac 28381 agccaaacca tatcaaaacc tcatgattct tcagttgaca tctaatctga gctaccatgg 28441 tctatcaaaa aaccacagct cttgagggga tctcaatggc tgttaacaaa ctggttagag 28501 ttctggtcag gttctgattg tagagagcct ctgtaacaga tatggattgc ctccttcccc 28561 ccaccctcct tgttattttt atttcatttt attaaatttt tttttgagac agggtcacac 28621 tatgtcaccc agactggaat gcagtggcac aatctcagct cactgcaact tctggctccc 28681 aggttcaagt gattctcatg cctcagcctc ctgagtagct aggactacag gtgcatgcca 28741 ccatgcccag caaattttta tattttttgt agagacaggg ttttgccatg ctgcccagac 28801 tggtctcaaa ctcctgagct caaccaatct gtctgccttg gcctcccaaa gtgctgggat 28861 tataggcgtg agccactgtt cccagccatg atttttattt aacaaatact cattatttaa 28921 caaatacttt ctgagcacct actgtgtccc aggcattgtc ttgggaacga ggaacacaac 28981 ggagcaagag aagtaaggcc aacttcttct ctgatggagc ttacattcta gtgggaaaga 29041 aaactataaa caagtatacc caaaaataaa caagataatt acagggaatg gtaaatgcta 29101 tagagaaaaa taaaacagaa taagaagcta aagtggtgcc aggactggtc cagcaactat 29161 tatggtgttg ggagcaagcc ccccaaaata tggccataaa ctgaccccaa gactggccac 29221 aaacaaaatc tccacagcac tgtgacatgt tcataatggc cctaacgccc aagctggaag 29281 gttgtgggtt tacgggaatg agggcaagga acacctggcc cgcccagagt ggaaaaccac 29341 ttaaaggcat tcttaagcca caaacaatag catgagcgat gtgtgtctta aggacgtgtt 29401 cctgctgcag ttaactagcc caacctattc ctttaattcg gcccatccct tcgtttccca 29461 taagggatac ttttaattta atatctatag aaacaatgct aatgactggt ttgctgttac 29521 taaatatgtg ggtaaatctc tgttcggggc tctcagctct gaaggctgtg agacccctga 29581 tttcccactt cacacctcta tatttctgtg tgtgtgtgtt taattcctct agcaccagtg 29641 ggttagggtc tccccaaccg agctggtgtc tgcaatggtg tgccaccaga taagtacgga 29701 aattgccagc aagtgtttag aaacacagca atttgacata tccatgacat ccaagtatag 29761 gatcaatgga ttttcttgtt gaacagggta gagaccaaaa ggattggtcc atgacaggtt 29821 ggaaacatca aacaaatgat ccttctccat agttacaagt agtttgagat acactgagct 29881 agtgggcaat gatgtctaca aggtatatta tttagagtca tgctagctgc tataacaaat 29941 aaaccccacc atttcagtag ctcacaaaat aaaagttctt tttttttttt tttttgagac 30001 ggagtttcac ttttgttgcc caggctggag tgcaatggca cgatctcagc tcacaacaac 30061 ctctgccccc tgggttcaag cgattctcct gccccagcct cccaagtagc tgagattaca 30121 ggcatgcacc accacacccg gctgattttt ttgtattttt agtagagacg gggtttctcc 30181 atgttggtca ggctagtctc aaactcctaa cctcaggtga tccacctgcc tcagcctccc 30241 aaagtgctgg gattacaggc atgagccatc acgcccagcc agaagctcat ttcttactgt 30301 tcctttgagc acattccagc atcagggtct ttccacttgc tgttcccact ttgggaactc 30361 cattttcaca gttttcaagt agctctcaca ctttggctct cacgtctcag ctcaaagaac 30421 atcttagaaa ggccttctgt ggcagagcca gcaggatgtt tattgactgc ctagaccagg 30481 actacatttc ccagaggccc ttgcagctag ttgaagccat aagacaagtt atgttcaatg 30541 gactgagagc agaagtgatg tatgtcactt tagggtgaag actttcaaga agcaggtgcg 30601 agttccttat tctctcttct gcctggattc aaaagactct gagcccatag aagatggttg 30661 aatcacctta tggaagaagc ctggatccct gaattactat gtggatcagg agaaatgtct 30721 gatgggatgt tctatgagta agaaatgaga agactcttct caaacgtgga aggagctgct 30781 gaaaccaaaa tcacaagtcc tagagaagct tcttattatc tctcagatgg tggatgggtt 30841 ccaaaaagca tcctgaaatt atccagaaaa acagaaaaga gatttcagat aaagcaaagg 30901 gagagagatg ctgacagaag aagaccaaga ggccacaggg gctctgctcc ttcaggatct 30961 aagaaagcag gaaggcacca gtacctccca ccaaggccaa cagaacagat accatgagaa 31021 aggaaccaag aaaaggacaa atggtgacat gtctccaaaa caaggagaca ttgagcataa 31081 gaagcagaaa gtgaggccag gtgcagtggc tcacacctgt aatcccagca ctttgggagg 31141 ctgaggcaag tggatcactt gaggtcagga gtttgagacc agcctggcca acatggtgaa 31201 accccgtctc tactaaaaat acaaaaatta gccgggtatg atggtgcatg cctgtaatcc 31261 tagctacttg ggaggctgag gcagagaatt gcttgaaccc aagaggtaga ggttgcagtg 31321 agccgagatt gtgccactgc actccagcct gggcgacaga gggagactct gtcacaaaaa 31381 aaataaataa ataaaagcag aaagtggcca ggtataatgg ctcatgcctg taatctcagt 31441 gctttgggag gccaaggtga gaggactgct tgaggccagg agtttgagac cagcctgggc 31501 cacatagcga gaccccgtct ctacaaaaaa ttaaaactga aaaaaatcta gctgggagaa 31561 ttggctcaag tcccagctac taaggaggct gaggcaggag gatcacttaa gcccaggagt 31621 tccagcatgt gatgagccgt gatcgggcca tggctccagc ctgggcaaca gagtgaggtc 31681 ccatctcaca aacaaacaaa caaacaaacg agaaagccca ggtagcaatg agaatgagac 31741 ttgcccagac aggcaccccg ggtgactgct cctgggggtg caggccatcc attgtccttt 31801 ggtggtgtct cagtccaaag gcaattgtga ccaaatcaca gggacagatc aagtcttctc 31861 aaagcagaga ctggtgaaac tttccaaatc atggcggtag ggcacgcctt gcatgttact 31921 gagaacttcc tcctctctct tgactttgca gtctgagacc cggtcatatc tacatgtgtg 31981 tttggggcag gatgggcaga gacatagcag gaaaggaagt cacaggggtg tgtcctaccc 32041 cagccaggaa gcaggcagca gggaccgctg tggggcatgc ctggaggttc ttcaccatct 32101 gctgagcacc agcataacaa tgggacattg ggactgtcca cctgcactgt gctgttacca 32161 ctaaccagac aactggcttt cgtgggattc accctgggtc cctggggcag aaagcaccct 32221 aagtgctcag ggccttcctc tgaaggggtc actttttcct tcattgtctt gggtgatgta 32281 acaagctgtg ccagactgga ggaagtgaga ccaggtgcct gcagtgagct actgagacgc 32341 aggtgacagt cacagcagtt gcattatcag ggagtttcct atggtcatct gaacttcgag 32401 ggaggttttc tggggccggg cattccttcc tcattcccat ttacctaaat tggcccatcc 32461 ttgtctggta ggtcagcacc cagcagaggc ttctccccat gcacagcact gtgaggttca 32521 ggcacctggg cacagtaacc acagtgattc ctggaagctg gtgggcatgc tatcctgggg 32581 ccatctgaac tgggaaaagc tgtgtgagtt cttgggggcc agcttgtagc ctttttggct 32641 ggggccaggt tgaaggtctt cctgggaacc agagtcaaga gcggccactt acattaagcg 32701 gataaagccc acgactttct tgagctgctt gacagctgta ccaaaaaggc acaaacccat 32761 tgaggatatt atgaaacatg aatacagatg attattttat tttggtttgg ttttgttttg 32821 gctttgagac agggtcaccc aggctggagt gcagtggtgc catcatagct cactgaagcc 32881 tcaacctccc aggctcaact gatcctcctg tttcaacccc ccaagtagat gggactacaa 32941 gcgtgcacca ccacgcctgg ctaatttttt ttaattttta atagagacaa ggttttgctg 33001 tggtgcccag gtgggtctca aattcctagc ctcaaacatt cctcccacct caaccgcccg 33061 aagtgctggg attacaggca tgagccacca cgcccagcca ttgccttata attttttact 33121 gatgtcagat attgttaatt ttacagtgtt ggatgctgga ttttttttaa tctctataaa 33181 tattcttgaa ttttgttctg gaatgcagct aaataatagc ttgatccttt cgggccttgc 33241 ttttaagcat ggttggttag tcagcatcag agaagtattt catctagggc aggggcaggg 33301 gtctccaaac cccaagccgt gaaccaatac aagtccatgg cctgttagaa accaggctgc 33361 acagcaggag gtgagtggcg tatgggtgag catcactgcc tgagctccac ctcctgccag 33421 atcaggagcg gcattagatt cttacaggag cgcaaaccct attgtgaact gtgcatacga 33481 aggatctaga ttgcgcactc cttatgagaa tctaatgctt aggcggggca tggtggctca 33541 cacctgtaat gctagcactt tgggaggctg agatgggctg atcacttaag gtcaggagtt 33601 caagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaaattagct 33661 gggcatggta gcgggcgcct gtaatcccag ctactcagga ggctgaggca gtagaatcgc 33721 ttgaaccccg gaggcaaaga ttgcgctgag ccgagatcgg gccactgcac tccagcctgg 33781 gtgaaagagg gagacttcac ctcaaaaaaa aaaacaggga agaaaaaaaa aaagagagag 33841 agagacagag aatataatgc ctcacgacct gaagtggaac agtttcatcc cgaaaccatc 33901 cactgctccc accctccatc ccccagactg tggaaaaatt gtcttccatg aaaccagtcc 33961 ctgatgctca aaaggttggg gactgctggt ctagcgcaga cctctcacac ctgctgtgtc 34021 ccaaattgac ccacccctgc tgcttatttt taccacccct acctggtggt gcgtctgtct 34081 ccccactcaa ctgtgagctc cgtacaatag ggaccagggc tgttgtggtc accatggtgt 34141 ccccagcatc acccagcaca gggtcagctc cctgagttta tggaatggta agcacgaatg 34201 aatgggaaaa cgcatgccag ggaaatgtta gggtcatgct ctctgcccct accctccact 34261 cagacatgag acacctgttt ctgaaatcaa agagcagtaa gtccaggatc tcattgcctg 34321 gttccagtcc tcctaggacg caagtcccat tttgtcaaga aaataaaggg ggtgagcgtg 34381 gtggctcacg cctgtaatcc caacattttg ggaggccgag gcgggaggat cgcttgagcc 34441 caggagttca agagcccaga agctgagact agcctgggca acaagtgaga tcccatctct 34501 attaaaataa aagtgtttag ttaaaagaag aaaaagaaaa taaatggaag catcaaggac 34561 tgttttgcct ctttcgggtt agctggtagt gagcgggaga acgagggaag gcgggcgggg 34621 ctgcagtctc tccacgccct tttcatttcc actcaaggag acctacctgg gaagagtgag 34681 cgctgcaaga cccagcctga ccgcgctaag attactatgc agcgcccgag cccgctgtct 34741 gcccggcaac cgatgacgtc actgttggcg cgcccgtgac gtcagaggcg ggcgccacac 34801 ttgaagaggc tgagggaggc ggtgtcgccg ccgcggcgct gtcatggagc tagcgcagga 34861 agcgcgggaa ctgggttgct gggcggtcga agagatgggg gtgcccgtgg cggcccgggc 34921 cccggaatcg acgctgcgca ggtgaggacc gctcctggag tgaggacacc ccaccctgga 34981 gattgagtct ccgggagtga gaactccacc cctcattgag gactcgactc cccccaattc 35041 cccatcgaga gtgacccgac ttactgaggg ctccctaccc tgggaggctc cccagagtga 35101 ggtccctagc actggtgagg atcctccacg agttagctaa gacctcgctc cttatggaaa 35161 taggatctcc agagtgaggc tgtccccgca gcacctcccc tccctccctt aacctaacca 35221 agaaggacac tcaggagtta gccgtggcca ccccatcggg gattgaagct gtaggagtga 35281 tgccccacgt ctggtcatga gatggcactt cctggcagta catcccccac ctccacctct 35341 ggatgtgact gggataaccc actcccagga gaaagggagc acccgcctct cccgcgctcc 35401 cccaccaccc gccttcccag ggaatgagaa taacaccaca cttagtgcaa acatccagtc 35461 cctacaaaaa aaatgtttta ccaattagcc gggtgtagtg gtgcctgcct gtagtcctag 35521 ctactctgga ggctgaggtg ggaggatcgc atgagcccag gaggttgagg ctgcaaggag 35581 ctatgatcgt accactgcga cagagcgaga ctgtctctta aaaaaaaaaa aaaaaaaaag 35641 gcaaaaaaaa gtttgggcct tccctcctca agaagtgaga actccctagt aaaaactccc 35701 taccaccatc accctaggaa tgaggatccc acaccctgtg tgctgcccca ggcatatgct 35761 tacacactgt ccccacaggc tgtgtctggg ccagggggct gacatctggg cctacatctt 35821 gcagcatgtg cacagtcaga ggtaagctgg gctagagcag gggagggggc acaggtgagg 35881 cccatagtaa ctcctctttc tgcctctgca ctgtaggact gtcaagaaga tccggggaaa 35941 cctactctgg taactgcttc ttaaagctat cccctcctgt cttccccact cccatttagc 36001 cctgtcccca ccacagcatc acacctattg ccctcccccc tagaagcccc actcatccgc 36061 tgactctggc ctcgttttcc taggtatggc caccaggaca gtccacaggt gagaagcata 36121 tgctacaaga ttctctttca tccgaaaaat gacttgtaga tcttctgctc taagcccaac 36181 caggtgctgg gtgatgctgg ggacaatggg tcacaagaca gccttgatct ccatcctcat 36241 agggcttaaa gtccagtgga aacagcatta gctcagccag gtgcagtggc tcacacctgt 36301 aatcccagca ctttgggagg ctcacctgag gcctcatctc aggtgatgag atcacctgag 36361 ggctccagga agcgatcagt actgggacaa agatgtggtt tgggaggcgg atcacctgag 36421 gtcaggagtt cgagaccagc ctggccaaca cggtgaagcc ctgtctctaa taaaaataaa 36481 aaattagcca ggcgtggtgg cacacgcctg taatcctagc tacttgggag gctgaggcag 36541 gagaatcgct tgaacctggg aggcagaggt tgcagtgagc tgagatcgca ctactgcact 36601 ccagcctggg caatagagcg agactctatc tcaaaaaaag aaaaaataaa taaagagcat 36661 taattagctg ggcgcagaga ggtgggacct ggatatgtgg catcaaagct ggtgagaaga 36721 gatgatggga caggtatggg agagatgctg aggccagtgt gggacccaat gggtgcaagg 36781 ggctcatatg acacctaagg ggaggcttcc aggaggcctt gggacttcta ggtgcagggc 36841 tccgggaaga gatcagtact gggacaaaga tgtggttgtc accagcacag acaggaaaca 36901 ggctggaagt gtgggtaaga caatcctagg aggaaggagg gagggagcca ataccctaat 36961 gcttctgtgt cagacacaga aactgctgag caaaaaccct ctgccaccag ttgaggagcc 37021 tggtaccccc aaaagacagg ccaggctgag ccccgatcag accatctgac cctgaatctg 37081 atcctcccag gtccgtcgga agttagagct ggaagctgct gtgacccgcc tgcgggcaga 37141 aatccaggaa ctcgaccaga gcctggagct gatggagcga gacactgagg ctcagggtga 37201 gccatggggc cagaagatgg gcaccagaat gtggggcggg aaaggtgctg atggccaccc 37261 ctgttcctcc cacagacacg gccatggagc aggcacgtca gcacactcaa gacacccagc 37321 gtcgagctct cctcctccgg gcccaagctg gggccatgcg aagacagcag catacgctcc 37381 gagatcccat gcagcggctg cagaatcaac tgaggcgcct gcaggacatg gagaggtggg 37441 cctcagggac ccaccctcac caggctgggt ggccagacat tctcttagag acctcagggc 37501 tctgccctga tcccttcttg ggttccagaa ttctggaacc attaggatgg aatgtccatg 37561 aggccaggag gttttgtcat ctcagtcacc tgtattttca gtgcctagaa tgataccacc 37621 tgtataccta gaacaccagc cctggggaaa cagccacaaa cagagaacat gtccctgccc 37681 tcctggagct tatgctgtga cacagagaag acagatagta aagtaggaac caaacagacc 37741 ctatcatttc tggtagagat gagtgctgtg aagaaaagcc tatcatagaa gggagaggga 37801 gatgctgccc tagcgagggt ggtcagcgaa ggcctctgag gaggggacat tgagttggaa 37861 gaggaatttc tgtggagaag gaatagcaag agcagaggtg gtgtgctcag catgcaggaa 37921 gagcagcccg gaggccaatg tcagctcaag tttctcaggc aagaccgaga acctgaggag 37981 gtgaggtcga aggagtggac agagaactgt ggatgatgtg ttgcgatttt accctgacac 38041 tggttcatgc cctggcttgg ccacttaaaa ttgggtgacc taaggctggg catggtggct 38101 cacacctgta atcccagcac tttgggaggc cgaggcaggc ggattacttg aggtcaggag 38161 ttcgaaacca gcctggccaa catagtgaaa ccccgtccct actaaaaaat acaaaaatta 38221 gtcgagtgtg gtggcacgtg cctgtagtcc cagctactca ggaagctgag gcaggagaat 38281 cacttgaacc caggaaaagg aggttgcagt gagcctggat tgcaccactg cactccagcc 38341 tgggtgacaa gagcaaaact ctgtttaaaa aaaaaaaaat tgggtgacct tgggcaaggt 38401 atttaacttc tttctatgtg cctcagattc taagatctgt ggaatgggga taatattaat 38461 accaagctta tagggtaggc atgatcgtta ggtggaataa atcatgtgaa gtgcttagag 38521 aagtgcctgt taaccattgt tattctgggt ttggtttttt tgttttgttt tgttttgttt 38581 ttttgagaca gagtctggct ctgttgccca ggctggagtg cagtggcatg atcttggctc 38641 actgcaacct ctgcctctgg ggttcaagcg attctcctgc ctcagcctcc ccagtagctg 38701 ggattatagg cgcatgccac tgtgctcgac taatttttgt atttttagta gagacggggt 38761 ttcaccatgt tgtccaggct ggccttgaac tcctgacctc aagtgatcca cctacctcag 38821 cctcccaaag cgctggggta caggcatgag ccaccacgct gggcccattg ttattctgaa 38881 tgcgctgggg aaacatcaca ttattttgga cagatctgag cagggacatg atcttcctac 38941 agttttaaca gaatcactga cttctgggtg gcaaacagtc tcgtggagca aaagcagtgg 39001 ggaggctaat gcaatattcc agctgggagg tgacagtggc ctaatctgga gatggctccc 39061 gagatctagg cttagtgatt cctctgaccg gccttgctgg cagtgagagg agatggggca 39121 cagacatctg ttctcccagg gaccccagcc ccagctgcac ttctccctcc aggaaagcca 39181 aagtagatgt gacctttgga tccctcacgt cggcagctct gggcctggag cccgtggtcc 39241 tggtaagagt cctggtcatg ggagaagggg cagggagcct ggagtcaggc agatgctagg 39301 gtccccctca ggtcccttcc tcctcttctc cctagcgtga tgtccgaaca gcctgcaccc 39361 tccgggccca gttcctgcag aacctcctgc ttccccaggc caagaggggc agcctcccgt 39421 gagtgtccct agcccccaga gaccctgtaa agaccctggg ttttaagtgg gggtctctta 39481 tttattaatt cctcagacac tgagcttctg ggctacgttc tgtagaaggg gacttccagg 39541 ctgggaatag aaggagacag tctctcatca caaacagtga cagccaggtg gtcagggctg 39601 ttgcgggagt gaggaggcct gagaggctct gggcacccag aggcagtgcc tttcccagct 39661 aagaaggcag ggaggggaga ggttctctgg tgaggaggct tagctggggt gaagggtcca 39721 gagagtgtcc cagatgaaca acccatgtaa aggccccaaa gccagcaggt tggtataatt 39781 ggggaactca agtctaacag gcatagggaa gatgacagcc cctaccttat agggtggtgg 39841 tgataactag aggataattg acgtgtcttg taacgttctc tgtccgctta cccaacccca 39901 cagaacccct catgatgacc actttggcac ttcgtaccag cagtggctga gctcagtgga 39961 ggtgagaggg cagggctctc aggaaagcca caccctcccg ctccttgatc aaaacttagg 40021 gtgccaggac ctaactcttg atgacagccc tattcccatg gggtctcagc tcacagccct 40081 gccccgatcc tgccctgact tcaccctccc tccccctaga cgctgctgac aaaccacccc 40141 ccaggccacg tcctggctgc cttggagcac ctggctgcag agcgggaggc agagattcgg 40201 tccctgtgca gtggggatgg gcttggcgac acagagatat ccaggtgtgg ggcagggtat 40261 tctcacagtt ggggaaggcc atgtgagtgg gtgagctggg acagcctcag gacccctggc 40321 ccagccacct gctgatgggg tagaactgaa acccactgcc tggtggggtg gcaggactga 40381 tactgtccac cctcgagtac caaggcaagg ccgcagcacg gctccctcag cagctcttcc 40441 cccaccccag accccaggcc ccggaccagt cagactccag ccagaccctg ccgtccatgg 40501 ttcatctcat ccaggtgacc ccagggctcg cctccccacc acattccaag acttccttaa 40561 tcccctcacc catggatcat gccctgggtg taccctgatt ctgtaccccg cttgcaggag 40621 ggctggcgga ctgtgggtgt gctggtctcc cagcggagca ccctcctgaa ggagcggcaa 40681 gtcttgaccc agcgcctcca gggcctggtg gaggaggtgg agagacgcgt cctgggatcc 40741 agtgagaggt ggggggctct ccgagaagag gtctctaggc catctcgctt ctgagtgtct 40801 cagcccattg gactttttca gcctctgtgg ctcctatctc tgcctcctct ccctgcagtg 40861 tctctccctg gtttatggtc ccttgtgtgg ctcagtcacc cttctctcct ttgctctcct 40921 ccacttgccc tgcaggcagg tgctgatact ggggcttcgg cgctgttgcc tgtggacgga 40981 gctcaaggcc ctgcacgatc agagccagga gctgcaggat gcagctgggc atcggcagct 41041 cctgctgagg gagctacagg ccaaacagca gcggatcctg cactggcgcc agctggtggt 41101 gagaggctag gcccagggcc ttgtggaggg ccagagggga ggcttgaaaa ctatggctaa 41161 gaactgggcc tgagccctgt gtggtagcac atgcctgtgg ttccagcact ttgggaggct 41221 gaggtgggag gatctgagcc caggagttca aggctgaagt gaggtatgat cacaccactg 41281 cactccagcc tgggtgacag agcaagatgc tatcacttaa aaaaaaagaa tgctgtcact 41341 taaaaattaa gaactgggcc tcctaggcaa gagcccaggt gagactccca ggcagaagca 41401 ggacctaagc agagaggcac actgaggctg ggagctggga ggagtgggtc tcagggtatc 41461 ctggttcctc aggaggagac ccaggaacag gtccgcctgc tcatcaaggg aaactcggcc 41521 agcaagaccc gcctgtgccg gagcccgggg gaggtgagat gggagttggg gtgaggtctg 41581 ggtagggctg gatctgccag gatggggctg ggtggacttg tgatgttgtg atgccccacc 41641 cacccctccc cttccccaca ggtgctagct ctggtccagc gaaaggtggt ccctacattt 41701 gaggcagtgg caccacagag ccgggagctg ctgcgctgtc tggaggagga agtccggcat 41761 ttgccccaca ttctgttggg cacgctgctg cggcacaggc cgggagagtg agactggggc 41821 tgccccaccc ctgccctgca tcccctgcca tgggtgactg tcactccctg aacctcctca 41881 aaaaagataa ccccagaccc ccagtgttcc caggctgcca gacctcaccc ttgggtcagc 41941 agccctggtc ttggcatcct agtccccaga cccaccttga agttcctggg cccccagagc 42001 ctccagaacc caaggcctcc tctcagtccc cacccaaccc agacttctcc ccgcccgtag 42061 gttgaagccc ctgcccacgg tcctcccatc catccaccag ctgcaccccg cgtccccaag 42121 gggctccagc ttcatagcgc tgagccacaa gctggggctg cctccaggga aggtgagtgc 42181 ccgtctcctg tgacttgtct ccccagcccc gccccaggtg accgtccttc cctcctcctc 42241 caggcctcgg agctgctcct gccggcggct gcctctcttc gccaggacct tctgctcctg 42301 caggaccagc ggagcctctg gtgctgggat ctactccaca tgaagaccag cctgccgcca 42361 ggccttccca cccagggtag gtctgccctg ccaccccatc cagcctcccc gcccattccc 42421 actaccagca gcctccacga gtccttacca gcaaccccgt tgtttacgca gtgcccattg 42481 tgtgctaagg gctttaaaca ttctcccttg tttcatctgc acggcagtcc cataaggtaa 42541 gtgctgctgt tctctccatt ctacagagaa gggaactgag acgcagaggt tagtaggttg 42601 cccaaatggc aaagctggaa tttgaatcca ggtggtctga aatgggtctg tgctcccaac 42661 cactaagcta cactgccgtc gactttccac gtgttcattc atcaccgtca catagtatac 42721 ctcagcagtt caaagcctgg gaatcctctc tgccgtctct tcctagctgt gtggcctccc 42781 cgtgccttag cttcctcctc agtggagtaa gcatgaagca atatcttacc gaggtgctgg 42841 gcagattgag gcaggggacc cacacagggc actgaacact gcctgctatg cagtaagcac 42901 cctgtgtctc caatgtgatt agcatcattg tctgtgtaca cacacagtct cttggagtca 42961 tgacgatgag gaaagggact tttaagaacc ccagtttaca gatgaggaaa ctgaaggtta 43021 gagaagtcat tcattcctgg attcactcac caaatactgg gtgtctacca ggtgccagcc 43081 cagatggctt gatggggagt gaggagaaag gtccccaccc caggggtttc cccagatgga 43141 agtaaagagt aaagaatgaa aggataaata catggaagag gagaattgat gagaaatgaa 43201 ccggcagtgc cagcagcact ggaggaagag actgttcaag cctctgagtt tggggctccc 43261 tgccctccac ttcttcacaa ggggagggaa cagtgtgcag gagcaatgag agcagcccca 43321 gaccccacac ggcctctgcc cacctcagcc tctgcctacc acgccctctg cccacctcat 43381 gtgttttttg tttttgcttt tgtttttttg agacaaggtc tcattctgtc actcaggctg 43441 gagtgcagtg gtgccatcct ggcttactgc agcctcaacc tcctgggctc aagccatcct 43501 cctgcctcag cctcccgagt agctgggacc acaggcatgc accccacacc cagctaattt 43561 tatatttctt ataatcatgg gcatcttgaa ctgtacacat ctatcatgga actcctggtt 43621 ctctctactc caaaaaacaa ttttgttcct cttcttggtt tccttccttg ttaaattatc 43681 tcaccctcct catccaagtg ctgcacccag aaacccagga gtcatcctga atttctccat 43741 caccctaatc cccccaccct caccagcttc agcctctcag caagccctaa gggtcccatc 43801 cccagagttc tcccatcggt cccctgcccc agtaattcca cctgacagca gacggagctg 43861 aacaaagagc cagtgtcgtg atcaaggtca cagggcaggg atgggatggg gtcaggattc 43921 aggcccagct tgtctgatca cgctgctcgt ctgcacttgc tgagctgggc ctgttctgtg 43981 cacatggaga ggtaggccca gtcccagtcc cagaggaaag caccgctagg gtgaagctag 44041 agcaggcgga ggaggaagcc ttggagcgag agtgttggag gggcctccag gaggaaagat 44101 actctctact gtgggccctg ggaatcggcc aggctttgcc acatcatgaa actcaaatac 44161 ctgctgaagg gcttggactc tgccttgggg gcactggaga accttggagg gttctagaca 44221 ggagggagag catcaaattt aggcttcagg aagatcactg tggtgtagtg cggagggtga 44281 accagagcag gagactgagg cctggagcag aggagaggaa gaaacgctca ggagactgag 44341 cggacaggcc atggagattg atgggctctc acggggaggg aggaaggcat ccagaaggag 44401 gcctggatgg ggctgggggt ggtggactgc ccttgagatg aggatctggg aggagggaca 44461 ggtcctcatc tcaagggcag tccctcagaa ggggacttcc aggctgggaa gagaaggaga 44521 cagactcatc attggtttgc gggagctgct gaggtcaaga ctaggggact ggagaattgg 44581 ggactcattg ctgtaaggta ccagcgagcc agaggactca aagctgcctc ctggctttct 44641 caccctcaga gctgctgcag atccaggcat cccaggaaaa acagcagaaa gagaacctgg 44701 ggcaggctct gaagaggctg gagaagctac tgaaacaggc actggagcga atccctgagc 44761 tgcaggggat cgtgggggac tggtgagagg ggaacgtgcc gcggatggga aacagaaaag 44821 tcaggactca agctgggggc ctgatgagaa ggagcaggcc atggttttgc agggagggtc 44881 tggagggtca gtcctttcct cgttgacgcc atgccattcc ccaggtggga gcagccaggc 44941 caggccgccc tctctgagga gctctgccag ggcctgtccc tgccccagtg gcggctgcgc 45001 tgggttcagg cccagggggc cctgcagaag ctgtgcagct gaagagaggg ttcaaacgga 45061 agccgagaac ttgacactgt tcaccccaac acctcacctc ccccaggaca tttggaagaa 45121 agcagcgcca ggattcctcg gcagtcgtcc ccacccgcac ctgcagtccc ctcatgtgct 45181 gttctgctgc cccactcagc tcctggaccc tgtcctttca tcccgctaaa gcacccccta 45241 aaaccccttc atcactttca ttctcagcaa aaagtaattg agcacctcct ctaggcgctg 45301 gggagtccac actgaacaaa agaaacagaa aaccctgtct tccagcagtt gagttctagg 45361 gcagggagac agagtttaca agataaggaa aatatatatg tagtatgctg caagttaact 45421 gctgtgtggg aaatccagca gggggtggga tgtgtgattt gaattgaggg ccacactgcc 45481 caggtcgtgc tccgtcaagg ggtgagcagg agcaacaggg gtggctgagt aagggcttgc 45541 agctggaggc aacagcacat gcaaaggccc tgagccagga tgtgctgcaa taagggccac 45601 tgagggggac agtgtaggtt gggggtgagg aatgtattaa aagatgagat tgccttctag 45661 tttgtatagt ttcttttttt acatgtaaat cagccatttg gaatttatcc tagcacatgg 45721 atccaatttt gtcttatttt acatatatcc tagtagcact tactgaaatt cccagtcaga 45781 gtggcagttg cctctgaggg aaggtgggtg gcagggatca actgaggagg agacacagaa 45841 atgtcctagc tgatgggaat gtgctgtgtc aagagcagcg gcggtgacgc aggtgtagag 45901 atttgtcaga gctcatgtag ttgtagtctt aaggcctgtg catttcactg cgtgggttac 45961 tcaatttata aagcaaagcc cttacacttt tgagatgcca cctttttatg cactaaattt 46021 ccatgtttaa ttgggttata tctggcttct ctccagacct gctttgtatt taactctcca 46081 tgtgtcattc ctacagtagc ttaatagaaa ggcttgtgac acattgtcat atctgttctt 46141 gttttctttt gttttttttt tttttttttt tgagatgcag tctcattctg ttgcccaagt 46201 tggagtgcag tggcatgatc tcagctcact gcaacctccg cctcccaggt tcaagccatt 46261 ctcatgcctc agcttcccca gtagctagga ttacaggtac gcaccaccac gcccagctca 46321 tttttgtact tttagtaggg acggggtttc atcatgttgg ccagactggt ctcgaactcc 46381 tgacctcaag tgatctgcct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc 46441 accgcgcctg accacattgc catatctgaa atggctggat tctcttaggt gcttttcttt 46501 ttctttcttt ctttcttttt ttgagacaga atctggctgt ttcccaggct gaagtgcagt 46561 ggcatgaact cagctcactg caacctctgc ctcctggatt caaacaattc tcatgtctca 46621 gcctcctgaa tagctgggat tacaggcacc caccaccata tctggctaat atttgtattt 46681 tttgtagagt cggagcttta ccatgttggt caggctgctc tccaactcct ggcctcaagt 46741 gattctatcg cctctgcctc ccaaatcttc acctgctggg attagtttcc cctgtctcag 46801 taaatttttt ttcccaaact aagtgaacat tagatttaat ttggctggcc ccagaagaaa 46861 aacatgcaca gatggtagtt ttactggtgt tgcagtaagt tttcaaattt ggaagtttgg 46921 cagctttatg atcttgaacc ttgctgtcca tctctcacac aactcctctc ttggcctcca 46981 gagtcccttt actctcccag ctttcctcct gcccctggct gctgtttccc agcctttcca 47041 cattggcttt cccatgtagt ccttagtgtg tctgctcctc ctctctctcc tcatcacagt 47101 tcccagcccc caccttcaac tgaactcaac aaaatcttca acttcataca gtagtcacat 47161 tgttagtaat aacactggca tttttatttt gataaaatag accgtttaaa tttttgagat 47221 tctaccttat attttttgaa ttatatacta aagcaaataa gtagtgatgt aatgtcattg 47281 gggaccaaga tttttagtgt aaatgaaaaa agataacaaa tataaactca agtaaacacc 47341 ctgtagtctt tcacatgaat tggaaatatc agaataaagt catgattttt ctctttctaa 47401 aaaatatgta tttccctagt taagtcccct gaaaaggtct aaatataatg acaccccagt 47461 aacactgagc agcactagta tacagactgt ggtctctata agtgccattt ctgactaaga 47521 actagggact tttgtgaaaa cagctgattc caggtctggg gcaagaaatg tgcaaaatgt 47581 gtccaggata ttttgtgcca ggaagcaaag agtcctccaa gactactgtg gccaagtcaa 47641 aggaactcag gagccagtct gaagggctgt cactcaccaa aggtgggaca tgttgagcag 47701 caaaaagaac cacagtggtt tgaagcacat caaatatgcc tgaatgtgta agttcatgtt 47761 gatactcata tacctcattg aaatcacctt tgaagagtac taggaaccca ctcattattt 47821 tgaaaaattt tttatatata aaataatcca gcatgttctc tatttttctt atatgtgaac 47881 aagtaaccaa cttttttttt tttttttttt tgagacaggg tcttgtgctg tcacccaggc 47941 tagagtgcaa tggtgagatt ttggctcact gcaacctccg cctcctgggt tcaagtgatt 48001 ctcgtgcttc agcctcctga gtagctggga ttacaggtgc gcgccaccat gccaggctaa 48061 tttttgtatt tttattagag gcggggtttt gccatgttgg ccaggctggt ctcaaacttc 48121 tgacttcagg taatctgccc gcctcagcct cccaaagtgc tgggattata ggtgtgagcc 48181 actgtgccca gcctaagtaa ccaaatttct gatgaagagg ttatttatgg aagaatcaca 48241 actaataact tcagagaaaa tactagaatt caaaaaccac tatttgctat taattaatct 48301 agtgatggat acaggtaatg atgatcaagg gctactaaaa ctattagggg aaaggttgat 48361 tgggcttctt aaaataggta gaccaggatg acaccactga tcaatcctga tgccaaagga 48421 aggcaaccag acatcttgtg cttctgatgg gatccaatag gatacaaggg atatgatacc 48481 atacaatatt atctatgaag tgtgctttta ccaaattaaa cctaaatctc atctagcctt 48541 agctctcact acaagcatat gggaaagggg cagaaggcag tcagaaacaa gtgaaacaaa 48601 aaggatgtaa ttagttaaat tcaaaatgtg tgacacccta taggataaat gatccagttt 48661 cttcaataaa gagaaggcaa aagggggaat ggagaaagaa agaagaaagg gagggaggga 48721 agaagggagg aaggaaggaa agaaagaaag atggcaggca ggcaggtatt catttgtact 48781 gattcatact gcagcttcaa gaaaccaagg cagccaataa aattaaattt tctttttttt 48841 ttttttgaca cggagtctcg cactgtcact ggagtgcagt ggcacaatct tggctcactg 48901 caacctccgc ctcccgggtt caagcgattc tcctgcctca gcctcccaag tagctcggat 48961 tacaggcgct cgccaccacg cccagctaat gtttgtattt ttagtagaga cggggtttca 49021 ccacgttagc caggctggtc ttgaactcct gacctcgtga ttcacccgcc ttggcctccc 49081 aaagtgctgg gattacaggt gtgagccacc tcgcctggcc tctaagtttt tcttgttggt 49141 aaacatctga gcaagcaaaa atttaagttt tgacaggaaa aaaaaaagac aatcaaatgc 49201 aatatttgga ccttgttttg atcctgattc tactatcgta taaagacatt tttgagacaa 49261 ggaaaactga acacggtctg ggtgttagat gatactaaga aattaatttg atggtattca 49321 taatggcata gttaatggta tttttttttt cttgacactg agtttcactc tatcgcccag 49381 gctggagtgc agtgacgcga tctcggctca ctgcaacctc tgtctcccgg gttcaaacaa 49441 ttctcctgcc tcagcctccc gagtagctgg aaccacaggc gcacgccacc acggccggct 49501 aattttcgta tttttagtag agacagggtt tcaccatgtt ggccaggcgg gtttcaaacc 49561 cctgacctca ggtgatccgc ccacctcgac ctcccagagt gctgggatta caggtgtgag 49621 ccactgagcc cagcgttaat ggtatttttt aaaaatagtt cttatcggcc gggccgagat 49681 cgcgccactg cactccagcc tgggcgacag agcgagactc cgtctcaaaa aaaaaaaaat 49741 agttcttatc tgttaaggag acataatggc gcatttatga ctgagttgat gtctggaatt 49801 tgctttaaca tactccagca aaaaaatata aaaataagaa aagtataagg ccatgcgcag 49861 tggctcatgc ctgtaatccc agcactctgg gaggccgagg cgagtggatc acttgagatc 49921 agaagtttgc taccagcctg gccaacatgg tgaaaccccg cctctactaa aaatacaaaa 49981 attagccggg tgtggtagta cacacctgta atcccagcta ctagggaagc tgaggcagga 50041 gaatcccttg agtccaggag gtggaggttg cagtgagccg aggtcacact actgcaatcc 50101 agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa gaagaaggta 50161 ctttgtaaca ttctccccgc ccttgagaat gtactttata agcctatccc aaacctgtaa 50221 gaactaatga taatcccacc accctttgct gactcctttt tcggactcag cccgcctgca 50281 cccaggtgaa ataaacagcc ttgttgctca cacaaagcct gcttggtggt ctcttcacac 50341 ggacgcgcgt gacaaagaag tatagatcaa gtgaaatcca gaagtgttga atattggtag 50401 cgtttgagat gtacttaaga gtgcgttata ctattcccct taatgtgcat gtttgaaaat 50461 actcagaata aaacacttgt gagggctacg ggaaaaaaaa aagggtcagc tgtggcccaa 50521 gaaaggacag ggaccaagcc cacggacgtg ctcctctgag ttgcactagt ctctgggcct 50581 tggctcccac cctgaacagc tctcatcttt aaatgtcttc agctgtcctc ctccaggaag 50641 ccgggtcagg cgcttcctcc cgggcgtctc cggtcttcct ggaaaccaca ggtcgggtgt 50701 caacctactc caggataggg gccccgaggg gccagccagg cctcccagcc ttgcccaacc 50761 ctggggttga cacggacttt agcgtctttt taaacttccg gcgcgggctc tgcggtggca 50821 caggagggtc ctccaaactt tcctgcgatc tcaaaggtcc agattccaca ttggggatgg 50881 gcccaactga ctttgcttcg ctctcattag ccggtggtcc tccaggaaag cggggccgcc 50941 tctccgctgt gctctcatag gcccaggttc ttgcgttcgt gtgtccttct ctcactctga 51001 ttggcctata ttccccctga gggtgtggcc aaatctcatt ggctctcgcg cccgaatgta 51061 tgacgtcatg cgccagcgcc cgtcgctttt gctggacgtc atcctcggga gcccacccgg 51121 acgaaggggg agagtagaca gcagaaccag cggcggcggc taagcagaga ctgtagtagc 51181 ggcgacagcg acgacggcag cgatggctgg ggcggggcca gccccgggac tcccgggtgc 51241 aggaggaccc gtggtcccgg gtcctggcgc tggcatcccg ggcaaaagcg gcgaggaacg 51301 cttgaaggaa atggaggcgg agatggccct gtaaggcccg cgacagagag tgataaaagt 51361 cgtcccagag ccagagccac cgaatctcgg agcctccagg cctcgatagg ccggagatac 51421 ggctctaaca tggggaagtg ggaccttcgg gattgtgggg gcggggatag gggagaggtt 51481 ggcaggggtg agggtcccct cgcgatatac ccacagttcc aaaattggta cgaggtcctg 51541 agcgctgatg tctgttctac tcctccaggt ttgagcagga agttctgggg gctccagtac 51601 ctggaatccc aactgctgtg cctgcggtgc ccactgtccc cacggtcccc acagtagaag 51661 cgatgcaggt cccagcggct cctgtgatcc gcccaattat cgcgaccaac acataccagc 51721 aggtacggct gagctaaggc atgtaagtca gggaacacaa tgcctcgaac agagtaggtg 51781 tttgtgtaca gagagctgtt gacgttttgg cttgttgact aattagcatg acaaatattt 51841 tatatgccag gcactgagaa cacagtaatg actgagagag acatttctgc cctgtgggag 51901 cttatattct agaacatgtt gtccagtaga gctttctgca atgacgagac tgttctgata 51961 tatttaaaat ttatatttaa tagttcgtta ttctcattag tcacatttca agtgctcagt 52021 agccaactat atatgactag tagagaaaga gagagaaatt attaagatga aaataggctg 52081 ggcgctgggc gtggtggctc acacctgtaa tcccagcact ttgggagacc aaggcgggca 52141 gatcacgagg tcaagagatc gagtccatcc tggccaacat ggtgaaaccc catctctact 52201 aaaattacaa aaattagctg ggtgtggtgg cacgtgcctg tagtcccagc tacttgggag 52261 gctgaggcag gagaatcgct tgaacccagg aagcggaggt tgcagtgagc tgagattgcg 52321 ccactgcact ggagcctggt gacagagcga gactccgtct caaaaaaaaa aaaaaagccc 52381 tgaggcaggt gcatacctgt tgtgttgttg tgttcaagga acagcaagga gaccagtgac 52441 agtggagtga gcaaagagga aagtgggagg agataaggac agagaggtga ggccaaatct 52501 tgtagagcct tatggccatt ttaagaaccc tggtggccag gcacggttgc tcacgcctgt 52561 aatcccagca ctttgggaag ccaaggcggg tagatcacaa ggtcaggagt tcaagaccaa 52621 cctggccaac atggtgcaac cccgtctcta ttaaaaatac aaaaattagg cgggcgtgtc 52681 ggcgggcgcc tataatccca gctactccgg aggctgaggc aggaatatca cttgaaccca 52741 ggaggcggag gttgcagtga gccaagctcg tgccactgca ctccagcctg gacaatagag 52801 cgagactccg tctcaaaaaa aagactgtaa gggacaaggg cagaagccag gagatggagg 52861 aaggggctag tacagtaatc tagacaggag ttaatggtgg cttggatgga ggttgtgcat 52921 gggagatggg gaggacaggt tggacacagt ttcagtgggt gctaagtgct gtggggaaga 52981 tgaaataagg tggtgagaga agttgctgct cagagagccc tagtgtgatc caaaaagcct 53041 ccccgagaag tggcatttta gctaagacct gagatagcca gccacataca gagcacaaag 53101 catgaagctc ttggtctctt ggggtgagct ggccgcagca gatgggaatg ctgagtcagg 53161 cccctcaccc tgacctcttc ctcgacaggt ccagcagact ctggaggccc gagcagctgc 53221 tgcagccaca gtagttcctc ccatggtggg tggccctcct tttgtaggcc ctggtaagta 53281 aagagtagca aggtgagggg gttgggcaat caggcagggt ggtaaaaagg tgtgcacagg 53341 actctcctcc caccatcctt gcctctcccc tcccaacagt tggctttggc cctggtgatc 53401 ggagtcacct ggacagccca gaggctcgag aagccatgtt cctgcggcgg gcaggtgagt 53461 ctggggccag ggacccccac atttaatcag gcactcttgt ttacatgact tcagcctctt 53521 gcctaaatcc ttgctctaaa ttaccctttg cctttgatcc caacttgatg ttgtcatcct 53581 aaagcacgaa cctgatcatg tcctttctga tgatgtcaat tttgttttaa cctgatcatt 53641 gcaaaccaac cctcctgctg tcagtcctaa ttcccagtta ccacagcact taacacctca 53701 taacaagcta tataatttct tatttactat ggctatcatc ttttgcatcc tctaaaataa 53761 aagctccatg aggggcagaa gtctcagtat ctgccacata ggaagcatta tgtactgtct 53821 gttgatggga tgaacggact gttgagacaa gcgccaggct tttcccctgg cagtcacaac 53881 cctacatgac cttgcttctt gcagggagag atccccagca gcccgaacat ttctgggatg 53941 tgccagcagt gcttttgtta aggtcccttc tctccctgct cactgtcacc cagctgtaat 54001 cttccagcca tccttccccc ccaggccatt ttctacattc caccttttgt tcttcaggta 54061 acctttactc acactgtcaa tttgtaggac attacttcct tatatgtatg aaaaggatgc 54121 gtgttcttac ttggatgtaa aattctggta taatctcggt taaaccaaac gtgctactta 54181 tgttacctaa atcctgttat acttattaat ctgatgatct ctgagaagtt gtaatgtctc 54241 ttacttttta gtatggtttt gatcaaaatg caaatctgac ctacttcccc tgcccagctt 54301 ccaaccctgc tgtggctccc cagtgccctc aggaggacac ccatggctca gccaagaatg 54361 ctgtgcttta agaacaaaaa agctctactg gggtgcctgc tggctgctca gcccttctca 54421 tgacactctc taagccccac tacttctatt acagttacac taaactgaaa agggaaaggt 54481 tttgcttata aatattcttc ctgccaataa tattcccact caccccttgc ctaacacttc 54541 attgcccctt gggtctcagc tcacaggtcc ctttctccag gaagccttcc ctgggcctgg 54601 ctgggtcaga gggctcccct ctcttagcac tgatcccact gcctgggctt ccctgcttcc 54661 agcgctggtc cccactggct ctccttctct agtgatgtct ccattccctc ctcctctgga 54721 ccaggagctc cctgagaacc ctgggtctgt tggtgtgtcc ccagcattgc ctgccacagg 54781 gctgagccag aggagcttct cagtggtaaa ttttcatgga atgaaccagg ggagaggccc 54841 ccacatctcc aggagggctc cccagactct tctccaggat ttcttgatgc tgagagtgca 54901 cacacacaca cgcatgcaca cacacacctt gtcccaagtg cccctgtccc ccatgacaca 54961 cacccccatc tctctctccc acagctgtgg ccccccagag ggcccctatc ctgcgtccag 55021 ccttcgtccc ccacgtgcta cagagagcag gtgaggggcc agggtcatca tccctgccac 55081 ataactcccc ccaggccctg gaactcccca ctaacaccct gacagattcc gctctctcct 55141 ctgcagcagc cggcccccgc cctatggccc tacggccccc tcaccaggcc ctcgtgggcc 55201 cccctctgcc tgggccccct ggaccaccca tgatgctgcc accaatggct cgggctccag 55261 ggcccccgct gggctccatg gctgcactga ggccccctct ggtgagtgtg aacagggaac 55321 taacggtcag atgtgcggtg gacggggaga ccatccatcc tggcccaatg ggctatgtct 55381 gagttggcct ctctcctgac tttctgtttc tctaccttct cactctgcct ttgtctctgt 55441 ctctttctgc atctttctgt ggcaaaatct gttctctcct tttgtgaagg gagtacagag 55501 agaaacatgc tattggaata tttctctttt gattgttact gatgttcttg tctgtggctg 55561 tctctctctg cctgtgtttc tccttctctg tgtttgtctg gctggagact ctctgcttgg 55621 gtgtctcttg gctgctctct ggcccccagg ccctttcttt gttggatgga atggagatga 55681 cagtaaagtc tgagcctgtg agccggcccc cctcatgctc tcctcttacc cacaggaaga 55741 gccagcagca ccccgagagc tgggcctagg cctggggttg ggcctgaaag agaaggaaga 55801 ggcagtggtg gcggcggcgg ctgggctgga ggaggctagc gcggctgtgg ccgtgggggc 55861 aggaggtgcc ccagctggcc ctgcagtcat tgggcccagc ctgccgctgg ccctggccat 55921 gccattgccc gagcctgagc ccctgcccct cccgttggag gtcgtccgcg gcctcctgcc 55981 cccgctgcgc attcctgaac tcctgtccct gcgtcctcgg ccccggcccc ctcggccaga 56041 gccaccccca ggcctcatgg ctcttgaggt aagcagggag cctagcggtg aagggacaga 56101 agggacgggg ggcagacagg agcccaggaa cttccacaca gacagaccgg acagcaggag 56161 gacctggggg agggaggcgg acacatgtgc cccacatcca gaagcacatc cagcccttac 56221 tgtggtggca ggagggccgg ggaccaaagg acagtgggag accccaatac caaccccttt 56281 gcacctctcc cctttcctct gcaggtccca gagcccctgg gtgaagacaa gaagaagggg 56341 aagccagaga aattgaaacg gtgcattcgc acagcggcag ggagcagctg ggaggacccc 56401 agcctgctgg agtgggatgc aggtaagctg ctgaagctcg aggtcaggcg tggctcggcc 56461 aaggtcagac caggctgctg tccttgggct gcagactggg agggtgaacc ctctcagtag 56521 ggtgggcgct gttgttagcc caccttgggg aggggaggct atggctcagg ggctggttct 56581 tcacctagcc atgggccgga gcttggtgga ggtgacgggg acatggggct ttgtagcctg 56641 ttttactcat gaggtgcctg gcttcagaga gcattctttt tttttcccaa ctcttaggat 56701 aaaaattttt tttttttttt tttatgagac agagtaccac tctgtcgccc aggctgcagt 56761 gcagtggcgc aatcttggct cactgcagcc tccgcccccc aggttcaagt gattctcctg 56821 cctcagcctc ccaaagtgct gggattacag gcatgagcca ctgtacctgg ccgggaaaaa 56881 ttttaaacac agatggcaca gtaaacaccc gtctacccac cacctagatt ctgcaattaa 56941 cattttgctg tgttcacttt tttcctcttt gtgtatgtat ctggtttctt tgtttttagc 57001 tcagtcattt gaagcaagtt ataagtgaat taatcacttc taaatatctc agtatttatc 57061 tcctaagaaa agagatattt tggccaggca cggtggctca cacctgtaat cccagcactt 57121 tgggaggcgg aggcaggtgg atcacttgag gtcaggagtt caagaccagc ctggccaaca 57181 tggtgaaatc ccgtgtctct aaaaatacaa aaaaattagc caggcttggt gggaggcacc 57241 tataatccca gctactcggt tggctgagac aggagtatcg cctgaaccca ggaggcagag 57301 gttgcagtga gccgagatca ggccactgca ctctagcctg gcaacagaac aagactccat 57361 ctcaaaaaaa aaaaaaaaaa gaaaagagat actttttgca tatcataata ctattatcaa 57421 gtctaagaac attaactaat tcggtatctt cttttcagtt tatattcgta tttccctagg 57481 tgtcccaaaa ctctgttaga actattaatg tttttgccct gaaccagcat gcatgcagtt 57541 ttatgtcctt caattatatc tttttttttt tctttttttt ttttgaaacg gagtcttgct 57601 ccatcgccca ggctggagtg cagtggcgca atctcggcag ctcactgcaa cctccacctg 57661 ccagggtcaa gtaattctcc tgcctcagcc tcccgaatag ctgggactac aggcgcgcac 57721 gaccatgctt ggctaatttt tggtattttt agtagagacg gggtttcacc atgctggcca 57781 ggctggtctc gaactcctga ccttgtgatc cgcctgcctc agccttccaa agtgctggga 57841 ttacaggcat aagccaccac gcccagcctc gattatatct taataagaac ggtctctttc 57901 ctgcctcttt gttttgtaca tattttttga agtgtccagg gcagttgtca tgtatgatgt 57961 cccaccttct gagtttgtcc ggttgtttgc tcctggcatt gcctaacttg tgcttagagt 58021 gtcccatatc acagacgcca tggggcagcc gtgcaatggc tgaggaagga gatggacccc 58081 tcctgcttca gtgtttgact tccaccccct ctcctgggac cgtctacctg gcccagagaa 58141 gaccctagct ctgtgacctt gggcaagtca cttagcctgg ttttgttttg tttcttcttt 58201 tgtttttttt ttttttcttt tttttgagac agagtctcgc actgtcaccc aggttggagt 58261 gcagtggtgt aatcttggct cactgcagcc tccgcctcct gggttgaagc aattctccca 58321 cctcagcctt ccgagtagct gggattacag gcacccacca ccacacccag ctaatttttg 58381 tattgtttag tagagatgga gtttcaccat gttggtcagg ctggtcttga actcttgagc 58441 tcaagtcatc cacctgcctc agcctcccaa agtgctggga tcacaggcgt gagccactgt 58501 gcttggctca cttagccttt taagcctcat attcctcatc ccctcccttg tggggacagg 58561 gattcattgg gacagagcat gttcagtgga agcacaggcc caactcagag gcagcatgag 58621 aatgtcagct gggtgcagcc atggcggagg agcaggcagc atgtccaggt ctcaaggggg 58681 agttagggca ggccttgcca aggaagcagg gctgaggaaa gatgaagccc atgttcctga 58741 ggtgccatga cgggcagcca gtgggtgaca ggggtccagg ggaccaaggg acaccaaggc 58801 tcagaggaaa ggccagggct agggcagagt taagggtgtt actacaggaa tgagtgaggt 58861 ggccctgggg tggggagggc agtgcagttc aaggttaaaa gtaaggactg gagccagatg 58921 cctgagctct tgtcttagct ctgccgcttc ctgcttgtgt cactgtgggc agtgacttca 58981 cctctctgtg cctcagtgta aaagcatggt tgggagtgcc agaaatggaa acacccctcc 59041 aggtgccagc agagctcctg gcgcagggtc agtaggtgtt gaccattatt actaagaggt 59101 ccctaggcag agactagata cctccgaaaa ggtgggatcg ttcagacaag gtagacactg 59161 ggcaaggcct ctgcatcctc tgatgtcatc tcttccccat ccccagatga cttccggatc 59221 ttctgtgggg atctgggcaa tgaggtgaac gatgacatct tggcacgcgc cttcagccgc 59281 ttcccatcct tccttaaggc caaggtgatc cgtgacaagc gcacaggcaa gaccaagggc 59341 tacggcttcg tcagcttcaa ggaccccagc gactacgtgc gcgccatgcg tgagatgaat 59401 ggtgggtgcg gcctcccctg ggaactgcag gcgcggcagg cgctggccta agcctgaccc 59461 gagcctgcct tgaaccctct gtctccacag ggaagtatgt gggctcgcgc cccatcaagc 59521 ttcgcaagag catgtggaag gaccggaatc tggacgtggt ccgcaagaag cagaaggaaa 59581 agaagaagct gggcctgaga tagggtctgt ggccaggcac ccgctcccac ctggccgggc 59641 gctggctcct ccctcagttc tctttggaaa acccccagct gtccacccat cccctgcccc 59701 aaaaccagtt tcaataaatt tacgttcatt tccacccctg gctgggcatg gaagcacgct 59761 gtggggagca gggcttggct tccccacagc tggaaatggc ggggagcctt gagagtttac 59821 ccaggcaaac acccaggcgg gggcttctga gcccttcctg cgcacccatt ccctcttcaa 59881 acctcttcca cacatgctga gtgctgctta ttccagagga gaaagggggg ccaatggcaa 59941 gggtgaagcc aagctgaggc ccatccccgt gtcctggcta cttcctgtgt tctcagcacc 60001 aaggtaggag atctaggccc ttcccttctg gagctcacag gcttctaggc cctcttccat 60061 ccagcagact ctcaccaagg gtctgaacag tgcttgagga cacaatggtg actaaaagtc 60121 ctgggccctg caccgttaga gctcaaggtc cagtgcagaa aataggcccg gtgccctgac 60181 gataagagcc cagagtggtc agggcaggat gggggcgctg acctgtgggt gagggggccc 60241 ttcctgaagg aaccgtccac ccccaagtct cctctggggc tttccagctt ctccagtctg 60301 ccatgcccag ccagctcacc atgctctaag tctcggtggg cacctctatc tctgtctctg 60361 tgggtacccc tccatttctt gtctctaagc ttctatatcc ctgttggtct ctgttctctc 60421 ccttggagct gggaaatgaa aaaggaggag agacccacag atagagagaa cagagacttg 60481 gggtctgttc ccctctcttc ccagctccaa gctggcctct gcctgtctga agaggtgtct 60541 gtgcagggcc ttgaatgcca ggctgagggc ttaagcttta cgcagaaggc agcagcaagc 60601 cccaaaaagc tgagaaaatt caagcagcag caaggtcctg ggcagcacag atgtgcgctg 60661 tagaaagggc cctgtgaccg gctgggtgcg gtggcgcacg cctgtaatcc cagcactttg 60721 ggaggcaggc agatcacttg aggtcacgag ttcgagacca gcctggccaa catagtgaaa 60781 ccctgtgcct actgaaaaat acaaaaatta gttgggcgtg gtggcgggca catgtagtcc 60841 cagctacttg ggagggctga ggcaggagaa tcgcttgaac ctggtaaaag gaggttgcag 60901 tgagctgaga tcgcgccact gcactccagc cttagtgaca gagtgagatt ctgtctcaaa 60961 aaaaaaaaaa aaaaaacgcc ctgcgaccac cactgtggct ggaagtctag aaggctgggg 61021 cagggactgg gccagaggag ggagatggca ggtttgagta tggggaggag tggagggtct 61081 cccagggaca cagccatcgg aaccctccat taggagggag caggttgggc aggactttgt 61141 ggggcataag tcgggggatg gacaatacca aggcctctca gggaccgagg tcccccagga 61201 gaggtgtcga atttctcctg ttcctgtctt ccccgcctta ctgccccctc tctgacgtca 61261 cattttccag tctggtcaga gggtgagagc ctccctggca ggaagtgccc tttagccccc 61321 tgccacctgg gccagaccac ccagccccag gatcctcaaa gccccttccc cccatcacta 61381 gtcttttcta cccctgaccc cccttgtact gacagcaagc atagggtcac gggaggagat 61441 gggaagagag gatggctcca gaaaggacgc ttccaaatgc atgtgattct cacttccaag 61501 tgggagcaga acagcaggta cacgtggaca tgcagagtgg ggtaacaggc actgcagact 61561 ccaaaaggtg ggagcgtggg ggaagctgaa cagttaccag ttgggtacgt tgctcaccat 61621 tcgggtgcac taaaggccca ggccagtata ctggatatac tgacatatat ccagtaagga 61681 atctgcactt gtacccccta aatgtttatc ttttttttta atgcacacaa gtcttcctga 61741 gaacaagaag agacacacac acatagaatg tggcaagcag aggtagagat ggcgctgtga 61801 gcaggaggct cagaggcaga gatgagaacc atcagagact gtgcagtggg aagggggacg 61861 gggagaaaac caagagacag atggagacgg acaaggagat gagataggaa cactcaaaga 61921 cagatgggga gggacagaga tgtcagaaga gaccaggtga caaagcctca ctggagtcag 61981 gcccgtggca acagagaccc agctttagat atgccaacct gaagaaagag ggcagagaga 62041 catggagaga cagacaagag gccagcttca aggatgggaa gccaggactc ccttcaggcc 62101 tccgcgctgg acggatagag ggaccagtga tgctccaggg ctctgtggct taccccacac 62161 tgattccaat cccagtccca ctcccactcc atgggacccc cggtttactt tactgtaaaa 62221 tggaaggagg agcaatgcca cccgctctcc agaacggcaa gtagttaatg atcaagaggc 62281 tgggctgggg actggaagat gccgatatgc aaaagttatt tttaatggta aatattacat 62341 acttattaaa atacagcagg atggctgggc acggtggtgg ctcatgcttc taattacagc 62401 actaaggcta aggcaggtgg atagcttgag gccaggagtt tgagaccagc ctggccaaca 62461 tggcaaaacc ccatctctac taaaaatgca aaaattagcc taggcatggt ggctcacgcc 62521 tgtaatccca gcactttggg agaccgaggc aagtggatca cttgaggtca ggagttcgag 62581 accagcctgg ccaacatggc aaaaccccat ctctactaaa aatgcaaaaa attagccggg 62641 catggtggtg cttgcctgta atcccagcta cttggaaggc tgaggcacca gaattgcttg 62701 aacccgggag gcagaggttg cagtgagctg agatcgcacc actgcactcc agcctgggcg 62761 acagagtgag actccatctc gaaaaatatt ttatatatat ataggacacc cagttattga 62821 attttacgtg aacaatttta tagtttaggt atgtcctaaa tatttcatgt atgtatgtat 62881 atgttacatg tatagctgtg gttttttgtt gttttttgct gaattatttg agcaagttac 62941 agactttttt tttgttgttt ctttttttga gatgaagttt cgctcttgtt ccccaggctg 63001 gagtgcaatg gcatgatctt ggctcactgc aacctccatc tccctggttc aagcaattct 63061 cctgcctcag cctcccgagt agctgggatt acaagcaccc gccaccacgc ctggctaatt 63121 ttttgtattt ttagtagaga cagggtttca ccatgttgga caggctgatc tagaactcct 63181 gacctcagat gatccacccg cctcagcctc ccaaagtgct gggattacag gcgtgaatca 63241 ccgcgcccaa ccacaagtta cagacattat aacatttcat ctctgagttc attttctaaa 63301 tgatatcttt cctcctgaat accattatca aatctaagaa aagaattcta tgatatttca 63361 tattcagttc atactccagt ttccccaatt gtcctaaact attacagcgc ggaagggcat 63421 ggggtctgga gaaggagagc agtgggagga tgcctgcctt accttccccc ttcatcaaat 63481 gagagtatgt attgtacttg cctcatagaa ggattcaatg aaggggtatg gcacagttgg 63541 tttcacacaa tgagccctca tttaatgttg gccgttattg ctactattgt tattgttgaa 63601 attgttgatg tcaatattgc tatgattggc actcctggga agcagcccca ggacgccctc 63661 cctactgggc ctggtggagg attgggtggg ccttcactcc tgctccacgc ccccgcagtt 63721 actctgccga ttgtgacgtc agctgacgct gggggcgggt gggggaatct ggccggaaat 63781 ccctcttcct gttgcagata agcccagctt agcccagctg accccagacc ctctcccctc 63841 actcccccca tgtcgcagga tcgagaccct gaggcagaca gcccgttcac caagcccccc 63901 gccccgcccc catcaccccg taaacttctc ccagcctccg ccctgccctc acccagcccg 63961 ctgttcccca agcctcgctc caagcccacg ccacccctgc agcagggcag ccccagaggc 64021 cagcacctat ccccgaggct ggggtcgagg ctcggccccg cccctgcctc tgcaacttga 64081 gcctggctgc gacccctgct ctgacgtctc ggaaaattcc cccttgccca ggcccttggg 64141 ggagggggtg catggtatga aatggggctg agacccccgg ctgggggcag aggaacccgc 64201 cagaggtgag cgatgaactg aggactagat gcctgggtgt ctgggttagg aaggacctgg 64261 gggactagac tcccaagaag ccgggggcct ggactcctgg gtctaacaga ggaagagagc 64321 tggggtccct cactcccagg accaagattt taggctcctg gggaaggagg gagcggaggc 64381 ctggactcct ggctctgagg gaagctaggg ctggggccca gactccaggg cctccaagtg 64441 tcaccagctc acccattgcc atctggactt ttcccgaccc agaacattca gaaggccttc 64501 atcgcatcca tggacctgtg gaactgggat gaggcatccc cacaggaagt gcctccaggg 64561 aacaagctgg cagggcttgg taggctgccg aggctgccac aacgtgtgtg gggagggtgt 64621 ccaggtgggg cctctgctga ccctaacccc ttatcgcctg cagaaggagc caaattaggc 64681 ttctgtttcc ctgatctggc actccaaggg gacacgccga cagcgacagc agagacatgc 64741 tggaaaggtg gctgcgggct gggaccccta agtgctggag aagaagcggg gaggctggga 64801 tcctagggca aagggaggag gggggcgtgc ctaggttcct gggactgggt ggggaggggc 64861 cgcgtgcttg acccctgagg gtgaaggaaa agggggcgcg gggtgctgaa atacgggctg 64921 gggggccata actcccagtc cctgacaagt agagactaga gagtgggtag ttgaggggtc 64981 tctttcattg ctcacagtcc tccctaaact caggtacaag ctcatccctg gcaagcttcc 65041 cacagctgga ctggggctcc gcgttactgc acccagaagt tccatggggg gcgggtgagt 65101 gtggggagag gcggtgggag gtggggactg gggtcccgag gcaccggggc tagaggtgta 65161 gactccctga tctttgagga ctgagaacac ctgcgccctc aaggtggcat gacctggatc 65221 cgggtcagcc gggccccaag tgccagggtt gagagcttag accctagagt ttttgagggg 65281 gcacctgggc tcccctcact cgggatccgt tactcctcac agagcccgac tctcaggctc 65341 ttccgtggtc cggggactgg acagacatgg cgtgcacagc ctgggactct tggagcggcg 65401 cctcgcagac cctgggcccc gcccctctcg gcccgggccc catccccgcc gccggctccg 65461 aaggcgccgc gggccagaac tgcgtccccg tggcgggaga ggccacctcg tggtcgcgcg 65521 cccaggccgc cgggagcaac accagctggg actgttctgt ggggcccgac ggcgatacct 65581 actggggcag tggcctgggc ggggagccgc gcacggactg taccatttcg tggggcgggc 65641 ccgcgggccc ggactgtacc acctcctgga acccggggct gcatgcgggt ggcaccacct 65701 ctttgaagcg gtaccagagc tcagctctca ccgtttgctc cgaaccgagc ccgcagtcgg 65761 accgtgccag tttggctcga tgccccaaaa ctaaccaccg aggtgagagg gccgcaaaga 65821 ctgcggggag ggcgaagctg gagtcctgag ccgggaccca ggcacctaag ggggcggggc 65881 ccgggagact gacagtgagg gggcgggggc ttagggacca ggggctcgaa ggaggggccg 65941 gtggcccgca ctccaggtcc ttggggagga gagggctaag aaactggtag tcttataggg 66001 accaagggga tgaggaccca ggctcctgga ttatataaaa cgaaagcgat aaaggcccag 66061 attcctgggt ctccgagatg gggaggccaa actcctaaat ctctgagact ggggccctgg 66121 acgcttgagt ctccaaggct gactgttgga tctcagagaa gggggggcgg atccccttct 66181 cgggtcctgg gtcccgagtt gggaggaccc ggacctctag atcattgaag tggtgtgatc 66241 tagggccggg aagactgagt gtgcccctcc cttcatcccg caggtcccat tcagctgtgg 66301 cagttcctcc tggagctgct ccacgacggg gcgcgtagca gctgcatccg ttggactggc 66361 aacagccgcg agttccagct gtgcgacccc aaagaggtgg ggcagctccc ctgcccagcc 66421 aaatccgccc cgtctcttct agttcaattt agctccgccc aagggctagg ttcaaccgcg 66481 tagccctcgg ccccgccgct ccccggccca ctcgaggccc cgcccaaccc ttctcaaacc 66541 caatctcccg cctgtactcc tgcctcaacc aacccagtct ccaccgggct ctgcgaggcc 66601 tcgcccaggt ctgcactgca caccgccccc aggcccggcc ctccccacta tcgccaagcc 66661 ccgccccttc ccactccgac cgagcgggcc tctgtcctag gtggctcggc tgtggggcga 66721 gcgcaagaga aagccgggca tgaattacga gaagctgagc cggggccttc gctactacta 66781 tcgccgcgac atcgtgcgca agagcggggg gcgaaagtac acgtaccgct tcgggggccg 66841 cgtgcccagc ctagcctatc cggactgtgc gggaggcgga cggggagcag agacacaata 66901 aaaattcccg gtcaaacctc ttcgcgcgtg ctcctctgca gcattcttcc cgctgatact 66961 gactacagca gctagggagc ctaagtgtgc agctccacat gttggaattt ggcctcccca 67021 cctcggagac ccaggcgtct gggccttcat ctctttcctc tcacatgatc cagaggcctg 67081 gatccttgct cgctaaaaga ctgcagtgtc ttagtcctct tctccctgaa gaccctgaag 67141 tgcgggccac cagccccaag tcccttaagc tcccaggtcc ctaagccccg ccccacgctg 67201 tcagtttccc cggtcccgcc cccttaggcc ccgcctctcg actccgcctt cactccctcc 67261 gcaatcttcg cggacctccc tcctcggtgg ttagttttgg gatgaggaac cacccacctc 67321 cagtgctgcg caatgatcaa aggagaccca gactcaggga gcctgccatg tgcccgccct 67381 gtgatgagag aggcaaagaa actaaaccca aagcccatag ggcctactgt gtccctgccc 67441 tgtgtcggga tttgtaaagg aatcagaccc aatcatgaga gtatatctgc tgtgtcccct 67501 tgaagagaag actcagaccc tgaaaagggc ttactgtgct gtgtgtcctc ccagtgcaag 67561 ggatcatcaa agaattaaaa ctagtgcttg caccccaaca ccactcactc tttccccact 67621 ccaagctgga ggacacagga ctcagaccta cactgtaaag agagcctacg ccattctgag 67681 agccactgag cccctaaaga gagcctgctg tctgtgagcg atgcggggga tacagcaggg 67741 accaagacag ccctggaccc tccccttaag gagctcacag tccagtggga agacagaccc 67801 atcatcagag tgttttgacc tgggatgaga agactccagg ggactgaggg aggccaaaag 67861 gagagacctg atctagccca ggaggagaag ggtcagggga gactccctgg aggaggagac 67921 agctgatttg agaagtcagg taggcaaaga atgtttcagg cagaaggaac aacaagatca 67981 aaggcccaga ggcagacgat ctcataccac tgctcaaaac tatcagtgtg ggccaggcat 68041 agtggctcac acctgtaatc tcagcatttt ggaaggctga ggcaggagga tgccttgagg 68101 ccaggagttc cagaccagcc tgagcaacat aggggagacc cccatctcta tttaaaaaat 68161 taaaaataaa cgtggctaca gcagcaacag ggaccaattc ataaaggatc tgtgggccgt 68221 ggtgaggagc ctggactttg tctccagggc tcctggggag ccatcaaggg ttttgtttgt 68281 ttgtttgttt tttgagatgg agtctcactc tgtcactctg gctggagtgc agtgatgcaa 68341 tttcagctca ctgcaacctc cgactcccag gttcaagcaa ttctcctgcc tcagcctccc 68401 aaatagctgg gattacaggt gcacaccacc acgcacagct aatttttgta tttttagtaa 68461 agatggggtt gcaccgtgtt ggtcaggctg gtcttgaact cctgacctca ggtgatccgc 68521 ctgccttggc cacccaaagt gctgggatta taggagtgag ccaccgtgcc tggcccccat 68581 caaggggttt taagcaggga aagggatttg gattttagga agatctcatt gcaggggatg 68641 aatacatgca tgtaaacagg taattccaca aataatcaca atgttgatgg acacagaatg 68701 tgtaagaaga ggtccctgac ccaattccat ctggacactg caccccagat ggtatgtgag 68761 aggctttaga ggccagatgg gggaggaagg aaagcagttc tctttcatag ctccctctcc 68821 tagtccctag atgcttcctt catttccaca agaaattctt gcattcagca aatatttata 68881 gaccacttcc tgtgtgccag gcactgcttt agatacaatg gtgaatgaat aaggcagaca 68941 aaaattcttc ctttatgata attcttttct gttgagggag gcaaaaagag gtaaacataa 69001 tgtaaaagaa cagatatagc cgggcacggt ggctcacgcc tgtaatccca gcactttgga 69061 aggccaaggc gggcagatca cctgaggtca aaagttcaag accagcctga ccaacatgga 69121 caaaccgcat ctctactaaa aatacaaaat tagccaggtg tggtggcgca tgcctgtaat 69181 cccagctact ggggaggctg aggcagaaga atcgctctag ctcgggaggt ggaggttgca 69241 gtgagccaag atcacaccac tgcactccag cctgggcaac aagagcaaaa ctctgtctga 69301 aaaaaaaaaa aaaaaaaaaa aaaaaaaagc caaattggcc aggtagggca gctcacgcct 69361 gtaatcccag cattttggga ggccgaggtg ggcagatcac ctgaggtcag gagttcaaga 69421 ccaacctgac caacatggtg aaatcccatc tctaccaaaa ataccaaaat tagccgggag 69481 tggtggcgtg cgcttgtaat cgcacctact cgggaggctg aggcaggaga atcgcttgaa 69541 ccgagtagac gcaggttgca gtgagccaag ttcgtgccac tgcactccag cctgggcgac 69601 agagtgagac tccgtctctc tctctatata tatattttaa gggtagggat ttttgtgttt 69661 tgtgcagcgt gctttctcaa cccgtaaagt gcctgatggc tggatacggt gggtcacgcc 69721 ggtaatccta acgctttgga aggccgaggc gggaggatcg cttagagacc aggagttcga 69781 gtccagcctg ggtaacacag caagacccca tctctataaa atgcatatat atatccagga 69841 gtagcggtgc gcatctgtaa tcccagctac tcgggaggat gaggcgggag aatcgcttga 69901 acccaggagg cggaggttgc agtgagctga gatcgcgcta ctgcactcca gcctgggcca 69961 tagagtaaga ccctgtctca aaaatttaaa aaaaagaaaa aaaaaattta aagaaaagaa 70021 atacctgaca cagaaaaagc gttcaataaa tatttcttgc atgaataaaa gtactttcgc 70081 tgggacattt ttctccacac ctaccctcca gccagccccg ggatctataa cactctctcc 70141 gagtgaagac acagaggtga tgagatcggg taacccacgg gcgcccgagc cccagccgcg 70201 caagctcacg attgggccag aagtgaggat gaacaagcat ttagcccaat agaaagtcgt 70261 agttctcttg ctgggcgtgg cttgaatgac ttcagtggcc tcctcctggg agggagctga 70321 agccgctcgc aagactcccg tagtccccac ctctctcagc ttccggctgg tagtagttcc 70381 gcttcctgtc cgactgtggt gtctttgctg agggtcacat tgagctgcag gttgaatccg 70441 gggtgccttt aggtgagtgt ggagggttct gtaacctggg accccagtct agcgggctga 70501 ggagccggac cccagcttcc ctgagaacgg gttgaggctt ccggctggcg gcgtccggcc 70561 tccctggacc ccggggccct gttctgttca tcgtaagagc taacaatgat acggttcttc 70621 cttgatgcca ggcaacatcc caaacaactt taattaattt atttaactct cccaccagcc 70681 ctatgagata gggcctgcga ttatcccctt ttgtcagtaa acaggctcag agataccaag 70741 cgacctagct gaacctacac agcgaccagg atttggccca ggagtttgtc ctccagggct 70801 cttcatggct aattcacagg cgtggcctct cgcggcttgc ctgggccgca gcctggctcc 70861 cgggtgtcca gtccatcctc ccttttggac gttggagggg ccgcagcgtc atctccgatg 70921 caccctcatc ctcagatacc gggtagagat ctcgccggta acgatccttc ctactgcgtc 70981 tgtgccccag cagctggtgt tttacacccc acttccccac cgaaagactg ttttagtgct 71041 ccagaccccc accaccacct tgcgtcgctt gaatttgaca ctgctctgca agtctctgta 71101 tagccccacg actcctagga ttccctggaa tatagatact tttctctcag gctcccgacg 71161 aacactgtca cttcttcaga gacattatat taaaggtgtt taaccctggg tccgcagagt 71221 tggtagaatt caaggaatgc tagtacttga attggggtgg ggggtggaat tgcatctgtg 71281 tttttactaa ccttttgagg cagcaacctc ggtagtacta gcagcagcgg tgattttgtt 71341 agcaatgaaa atcatagaca ttcctttttt ttgtttttct ccttttttga gacagggtca 71401 ctgtgtcacc caggctggcg tgcagtggag aaatcttggc tcactgcaac ctctgcttcc 71461 ctgtgcaagc gatcctgcct cagtagctgg gaccacaggc acgcgccacc acacctggct 71521 aattttaatt tttttttttt tttttgagac aagatcttgc tgtatcgccc aggctggagt 71581 acaatggcac aatctcggct cactgcaacc tccgcctctt gggttcaagt gattctcctg 71641 ccccagcctc cctggtagct gggattacag gtgcacgcca ccatgcccag ctaatttttt 71701 gtatttttag tagagatggg gtttcaccat attggccaag ctggtcttga actcctgacc 71761 tcaggtgatc cacccacctc ggcttcccaa agtgctggga ttacaggcgt gagccacctc 71821 ttccggccaa ttttaatatt ttttttgtag agatgagatg gggtttcacc atgttgccca 71881 ggctggtctt gaactcctgg gctcaagcga tcctcccacc tcagcattcc aaagtgctgg 71941 gattacaggt gtgaaccatt gtgctccacc caaaatcaca ggtattctta taccacatta 72001 taacagcaaa tgtcacaaaa tactatattc attcatgatt acttcaaaat tgtatttttt 72061 tttttttttg agacagtgtt tttgctctgt tgcccaggtt ggagtgcact ggcacaatct 72121 cggctcacta caacctccgc ctcctgggtt taagcgattc ttttgcctca acatccagag 72181 tagctgggat tataggcacc tgccaccaca cccagctgat ttttatattt ttagtagagc 72241 cagggtttca ccatgttggc caggctggtc ttgaactcct gacctcaggt gatccacccc 72301 cctggcctcc caaagtgcta ggattacagg cgtgagccac cgcacccggc gaaaattgta 72361 ttccttaatg aatcctcagc tggatataac tgcttttctt tgtgatgctg tgcaggtttt 72421 tttcacttca tgtactcatg gagcctatgc aatttttaaa tacatttaaa catattcagg 72481 ccgggtttgg tgtctccgtt ctgtaatctc aacattttgg gaggccaagc cgggcagatc 72541 atttgaggtc agaagttgga gagcagtctg accaacatgg taaaaccttg tctaaaatac 72601 aaaaaaatta gcagggcatg gtggcaggcg cctgtaatct cagctgttcg ggaggctgag 72661 gcaacagaat tgcttgaatc ttggaggcgg agtttgcagt gagcctaggt cgtaccagtg 72721 agcctgggtg acagagcaag actctgtctt aaaaaaagaa aaaaaaaaac attcagcgag 72781 aggtttctgg gattcatcag actgctgagg agtttgtggc accaaaaaga ttaagaatct 72841 ttgcagtgta gattatgcca acaacggagg cagcccacca gttattttta tgccgtttaa 72901 ccgtaggaag tcacttagac accctgttcc ttcctcgtct gggcaatggt gatagtaata 72961 gttcctacct tttagagtgg ctataggagt aaagtgtgta gaacagcgcc tggcatgtag 73021 taagcactca gtgagtgtta gctcatgtta tttggggacc aggcttgatt ttgctgagga 73081 gttgtacccc aaccctgccg cgacctcgac actcagatgt gtggattcgc cctgcccctc 73141 atctctgtgt atgttgagag gcaggaactt gttcacaccc agcccattgt tccgtgcctg 73201 ccctcttcca ttgatcctgg gtagtctggc ttgctcaggg cccctggggc ccctgctgac 73261 acccactcct ttcgcctcca ggattcagca ccatggcgga agacatggag accaaaatca 73321 agaactacaa gaccgcccct tttgacagcc gcttccccaa ccagaaccag actagaaact 73381 gctggcagaa ctacctgggt aagcaggacc tttccctggc cacatacctc gagtcactca 73441 ccgcttgcct cttcctaggg gacccatcca ccccagcctc ctccctttta ttctgaacat 73501 ccttactctg gaaggcccat gcctctcatc acctgctgtt caggtccagg ctagggttac 73561 aaactttagt tcctgcaatg attttttttt taatttaaat tttttattat tattatggtt 73621 tggttttgtt tttttccaga gatggggtct cactttgtca cctaggctgg agtgcagtgg 73681 tgcaatcata gctcactgca gcctcaacct cctgggctca agaggtcccc ctacctcagc 73741 ctctcgagta gctgggacca caggcccaca ctactgtact tggctaattt ttttgtagag 73801 aagaggtctc agtttgttgc ccaggctggt ctcaaactct tggcttcatg cgctcttcct 73861 gcctcagctt ctcaaagtgt tgggattata ggtgtgagcc accgtgcctg gcccctacaa 73921 tgattttttt ttaagctgta tgtgtgtgcg tactgtaaca tgtacagaaa gcacataaca 73981 aacatgtaca gctaaatgaa taattctaaa gtgaacccct tgtgcaacca taacctggag 74041 agaaaagcag acattgccca cactctcatc tccctgcctt cttttttttt tttttgagat 74101 ggagttttgc tcttgttgcc caggctggag tgcaatggtg caatcttggc tcactgcaac 74161 ctccgcctcc caggttcaag caattctcct gcctcagtcc cccaagtagc tgggattaca 74221 ggcacctgcc accacaccca gctaattttt tgtattttta gtagagacgg ggtttctcca 74281 tgttggtcag gctggtctca aactcctgac ctcaggtgat ccacccgcct cggcctccca 74341 aaatgctggg attacaggca tgagccaccg cgcctggcct ccccaccttc tttatgtcct 74401 gtctggaaca cagcaccccg ctctccccag tagagaagtg acgccactaa ctgggggtta 74461 tcacatcttt gtttttggtt ttggtttttg tttttgagac tgagtttttt gttttcgttg 74521 cccaggctgg agtgcaatgt cacgatctcg gctcactgca acctcttctg cctgctgggt 74581 tcaagcaatt ctcctgtctc agcctcacta gtagctggga ttacaggcac ccgccaccat 74641 gcccagctaa ttttttgtat tttttttttt ttttagtaaa gacaaggttt caccatgttg 74701 atcaggctga tcttgaactc ctgacctcag gtgatccacc cacctcagcc tcccaaagtg 74761 ctggaattac aggcgtgagc cactgtgccc ggcctcacat ctttgttttt taccaccatg 74821 tttattattt tattttattt tatatatgtt atgttgttat gttattgatg gagtctgttt 74881 cttgttgccc aagttggagt gcaatggcac gatctcagct cattgcaacc tctgcctccc 74941 gggttcaagt gattctcctg cctcagcctc ctgagtagct gggaatacag gcgctcgcca 75001 ccacacccag ctaatttttt tttttttttt tttttgagac ggagtctcgc tttgtcgccc 75061 aggttggagt gcagtggtgc aatctcgact cactgcaacc tccgcctccc agcttcaagc 75121 acttctctgc ctcagcctgc cgagtagctg ggattacagt cgcctgccac catgcctggc 75181 taattttttt tttttgtatt tttagtagag acagggtttt accatcttgg ccaggctggt 75241 cttaaactcc tgacctcgtg atccaccccc ctcagcctcc caaagtgctg agattacagg 75301 tgtaagccac catgcctggc caattttatt tatttatttg agacagagtc tcgctctgtt 75361 gcccaggctg gagtgcagtg gtacaacctt ggctcactgc aacctccgcc tcccgggttt 75421 aagcaattct cctgcctcag cttcctgagt agctgggact acaggcgcgt gccaccatgc 75481 ccggctaatt ttttgtgttt ttagtagaga caggatttca ccatgttggc caggctggtc 75541 tcgatctcct gacctcatga tctgcctgcc tcggcctccc aaagtgctga gattacaggc 75601 gtgagccacc gtgcctggcc aattttttgt attttttttt tttttttgag atggagtctc 75661 gctctgtcgc ccaggctgga gtgcagtggc gtgatctcgg ctccctgcaa gctccgcctc 75721 ctgggttcac accattctcc tgcctcagcc tcccgagtag ctgggactac aggtgcccgc 75781 caccgcgccc agctaatttt ttgtattttt agtagagatg gggtttcacc gtggtctcta 75841 tctcctgacc tcgtgatctg cctgcctcgg cctcccaaag tgctgggatt acaggcatga 75901 gccactaggt ccagccactt ttttgtattt ttaatagaga cagggcttca ccatgttgtc 75961 caggctggtc tcgaactcct accctcaagt gatccacctg ccttagcttc ccaaagtgct 76021 gggattacag gcctgagcca ccatgcccgg ctaccaccgt gtttagattc acctctttgt 76081 gaactttata catactgtgt gtattctttt gtgtttggat tttttcattc agtaccttct 76141 cagatccata catgctgtgg catatagctt gctcattttc attgctgtcc agttttctac 76201 aatttattct atgcctgtac acttgtgtga tttccatttt ggagttctta caaatggtgt 76261 ggctgtaaac atctttataa atgtaatttt gctgggtgca gtggctcacg cctgtaatcc 76321 tagcactttg ggaggccgag gtgggcagat caagaccagc ctggccaaca tggcgaaacc 76381 ctgtctctac taaaaataca aaaattagct gggcatggtg gcaggtgcct gtaatcccag 76441 ctacttgaga ggctgaggca ggagaatcac ttgaacccag gaggaggtag aggttgcagt 76501 gagctgagat cgcaccactg cactccagcc tgagtgacag attgagaccc tacctcaaaa 76561 aaaaaaaaaa atgtgttcca actcttgatc tgggctgact tgaacccctt tcttcacaga 76621 cttccaccgc tgtcagaagg caatgaccgc taaaggaggc gatatctctg tgtgcgaatg 76681 gtaccagcgt gtgtaccagt ccctctgccc cacatcctgg gtatgtgcct cctgccaggg 76741 cccttgggat gctggggtgg ggtcttagca gaggggagtg tggtggcttg gtgggagctc 76801 atctgtgagg ggcagaggga ggacagggca ccacactgtc ccaggactca gtgcctttcc 76861 tcccgcctag aattacctcc ctgtctcctt ctgctgatgc ctctcaacca gtcagggctc 76921 tcagctgagg cgaccagggt gctttgctag atgtagctca ctcatgcact ctccaaatac 76981 acactcagca ctgggctccc atccctgtgt agctcacaat tctgtgcagt cctgtccttt 77041 tttattagga acattttttt ttcttttttt ttttcttttt ttttttttag acaagggtct 77101 tgctgtgttg ccaggctgga aaggctggag tgcagtggcg cgatcttgcc tcactgcaac 77161 ctccgcctcc tgggttcaag cgattcccct gcctcagcct tccaagtagc tgggactaca 77221 ggtgtgcacc accatgccca gctagttttt tgtattttaa tagaaacggg gtttcaccat 77281 gttagccagg atagtctcaa tctcctgact tcatggtcca cctgcctcgg cctcccagag 77341 tgctgggatt acaggcgtga accaccgtgc ccggccctat tatgaacatt ttcaaagcag 77401 agataataat tcattgacct caaatacatc catcacccaa agtgaatagt tatcaggatt 77461 tatccacggt ttcttcatct accccttttc gttttcttct ttcctttgct gaagtattct 77521 aaagcaagtc ccagacacgt catttcaccc ctgcctactt cagtgtggac ttctaaataa 77581 tgacacagta agacagtcct ttcagccctg atctgaccaa atctgtcgtc aatatttaat 77641 agctctttgc tatgcccctg ccctttgttt aaaaaaaaaa aaaagttttt aaagacaagg 77701 tctcatgatg ttgcccaggc tagactcgaa ctcctgggct caagtgttcc tcctgtctca 77761 gcctcctgag tagcttggac tacaggcatg tgccactgtg cctgtctctc tctctctctc 77821 tctctctctc tctctctttg tcgccctctc gctctctctt gctctctttc tctctcgctc 77881 tctctttctc actctttctc tctttctttc ctttctttct tttctttctt gagccttgct 77941 ctgttgccca gactggagtg cagtagcatg atctcagctc actgcagcct cagcctcctg 78001 aatagctggg actacaggca tgagccacca cacctggccc aatttttgta ttttttttgt 78061 agaaatgggg tttcgccttg ttgcccagac tggtctcgaa ctcctgagct caaagcagtc 78121 tgcctgcctc agcctcccaa agcgctggga ttataggcat gagctaccgc acccagctga 78181 atctttcaaa ataaaaaaaa ttacccttta acttggcaat tccacatcta gggatgctac 78241 aaaaatcttc atgtacaaag atgcccattg cagagttgtt cgtaatggag aatatttgga 78301 aataacccaa gttaaaacat tgatcttgat agtctttatg gagactatgg tgatctacat 78361 gctagcacac agaaagaatg ctaagacaca ggaaaagtaa gttgcagaac aatctatata 78421 gtaggagtca atttgcaagg atggagtttt tatggtgtgg tatgtgtatg taaataaagt 78481 aggcacatac cgctggcact tattatgggc cagctatgtt gttagacctt tttttttttt 78541 tttttttgag atggagtctt gctctgtctc ccaggctgga gtgcagtggt gcaatctcag 78601 ctcactgcaa cctccgcctt ccaggttcag gcgattcccc tgcctcagcc tcccgagtgt 78661 ctgggattac aggcgcctgc caccatgccc agctgatttt tgtattttta gtagagacag 78721 agtttcacca tgttggccag gctggtctca aactcctgac ctcaggtgat gcacctgctt 78781 cggcctccca aagttctggg attaccggcg tgagccacca cgcccagcca tgttagcact 78841 ttacatcaca taaggtcaac atgctaacac aaccgtatga ggtaggtact attgttagta 78901 tcctcatttt acaaaattgg aaactgaggc ccagaaaggt tgagtaactt gcccaaagtc 78961 acacagctag gaagtgatgg agcagggatt cagattcagc ctctctgtta tccatgcttt 79021 catccactgt tcgttcttgc accagtggtc ctcacccttt ttggcaccag ggatgagctt 79081 catggaagac agtttttcca cagacagggt tggggatggt gtaaggatga gtccagcata 79141 ttacatttat tgtccacttt atttctgtta ttattacatt atataataaa ataattatac 79201 aactcaccat actatagaat cagtgggagc cctgagcttg ttttcctgca actagatggt 79261 cccatctggg ggtgatggga gatagtgaca gatcatcagg cattagattc tcataaggat 79321 catgcagcct agatccctca catgcgcagt tcacaatagg gttcgcactc ctaagagaat 79381 ctaatgctgc cgctgatctg acaggaggtg gaactcaggc agtaatgcga gcattaggga 79441 gtgcttgtaa atatagttga agcttgtctc gctcacctgc cactcacttc cagctgtggg 79501 gcccagttcc taacaggcca cagaccagta cctggtaggc actggttggg gctcggggac 79561 ccctgcctta tagcataaaa tgtgagcgtg tatgtatata aaaacgttta tgcttgtaac 79621 tgcttgtgag gtccacaaag agtagagtgt tactttcagg gagtgttaac ttgacatagg 79681 ccttatatgg gccttttcat tttttttttt tttttttttt tagagacagt cttgctctgt 79741 caggctggag tgcggtgtgc agtggtagca tcatagctca ctgcagcctc gaactcctgg 79801 gttcaagcag ttctcccacc tcagcctccc tagtagctgt gactacaggt gcatgccacc 79861 atgcctggct aagttttgta ttttttgtag agacagggtt ttgccacatt caccaggctg 79921 gtctcaaact cctgggctca agcgatcctc ccacctttgc ctcccaaagt gctggaatta 79981 taggcgtgag ccaccatgcc cggcttattt tttatttttt gtagagatgg gggtcttgcc 80041 gtgttgtcca ggttggtctc aaactcctga gctcaagcag tcctcccaca tcggccccgc 80101 aaagtgctgg gatgacaggt gcgagtcacc acacccagcc tccatcattg tttatactga 80161 gcatcgcctg tggtcagtgg tgggattgca ctgtgtattg tttggcacct tttttgcact 80221 tgactatgta tcttgaagat ccatccattc atgccagaac ataaatcagc ctcgtttttt 80281 ttcacagctg cctgctattc catctcccct gtttggacaa acagcatagt gttgacttga 80341 ggttttcaaa gcagggccca cccttggagc tctgtacagt gacacatctc cccatcgaca 80401 gctctgcagt gggcctgggg ttttcctgcc cattactctc ctcccagcat gtcagcttgt 80461 tcagaacagt gatttctgtc ttttggggtc actgctgatt ccccggcctc tagaatagag 80521 gttggcacac agcaggtacc tgtggatatt ggttgaacaa acaggtgggc aaagtgagga 80581 agataagaag tccatccgtt cagtttcccc actgcggagg gaataacact gtctttccac 80641 aggtcacaga ctgggatgag caacgggctg aaggcacgtt tcccgggaag atctgaactg 80701 gctgcatctc cctttcctct gtcctccatc cttctcccag gatggtgaag ggggacctgg 80761 tacccagtga tccccacccc aggatcctaa atcatgactt acctgctaat aaaaactcat 80821 tggaaaagtg agactatgcg tgtgaacggg ccaggcagtg gcaagaagct gggctggaat 80881 cagtgccctg accagggagg ggcttgaatg agtattttct ctgttttgtt tttatttatt 80941 taatttaatt taattttttt tttttttatg atggagtctc gttcttcacg caggctggag 81001 tgcaatggca cgatctctgc tcactgcaac ctccacctcc agtgttcaag cgattctcct 81061 gcctcagcct cccgactagc tgggactaca ggcacacccc accacacctg gctaattttt 81121 gtatttttag tagagatggg gttttaccat gtcggccagg ctggtctcaa actcctgacc 81181 tcagatgatc cacctgcctc ggcctcccaa agtgctggga ttacaggcat gagccaccat 81241 gcctagcgtt ttttaactta attttattga accttggcca ccctctctgt gccagtcaat 81301 gagtcaaata gttgattata cagaataacc taagttccta tgtgtgcaac caccctacat 81361 aattttaaag caaattatat agattataat ataatataaa atacataaag attatatttt 81421 atccagaaat aagaaactat atgtatctct tggagataca attttttttt tttttttttt 81481 gagacagagt ctcaatctgt tgcccaggct ggagtgcagt ggtgtcatct cagctctctg 81541 caacctctgc cttctgggtt caaactattc tcctgcctgg agatacctat ttttttaaca 81601 taattattta cacctaagaa attagttctt aatggccggg cacagtggct catgcctgta 81661 atcccagcac tctgggaggc tgaggtgggc agatcacctg aggtcaggag ttcaagatca 81721 gcctgggcaa catggcgaaa cctcgtccct actaaaagta cagaattagc caggcatggt 81781 agcacatgcc tgtaatccca gctactcggg agtctgaggc aggagaatcg cgtgaacctg 81841 ggaggcagag gatgtggtga gccgagatca tgccattgca ctccagcctg ggcaacaaga 81901 gtaaatctcc atctcaccaa aaaaaaaaaa aaaaaaaaaa aagagttctt aatatctaat 81961 aagcaatctc ctgcacaatg tgtaacttct ccccgtgttg ggacctgctc tttcatgctg 82021 atcatgatgg tagttggaag tcatttaagt aggccaagtt gagagctgtt cttcatggtg 82081 cccccaaaag aattgccaca caataacagc ccagaatatc cccaacagac tcgtcaacag 82141 ggacctagct agcatgacca gccggaacac tcttatggct gaattggtgt gtcctcaaaa 82201 cctttcaaat ggccggggtg tggtggctca cacctgtaac cctagcactt tgggaggccc 82261 aggcgagtgg gtcacgaggt cagggattca agaccagcct ggccaagctg gtaaaacccc 82321 atctcttcta aaaatacaaa aaaaactata aaaatacaaa aattagccag gtgtggtggt 82381 gcgtgcccct aatcccagct actggggagg ctgaggcagg agaatcactt gaacctggga 82441 ggtggaggtt gcagtgagct gagatcacac cactgtactc cagcctgggc gacagagcga 82501 gactccatcc ccaaaaaaca aaaacaaaac ttcaaatggt tacaaacttg tctgatgcat 82561 tcacatgagc agggatgatt cttgtatgga gaaaaatggc cccagagata gatcttgtga 82621 aatagctgct catgaagaaa agtggctgta ggggccgggc gcggtggctc acgcctgtaa 82681 tgccagcact ttgggagacc gaggcaggca gataacctga ggtcgggagt tccagaccag 82741 cctgaccaac atggagaaac cccgtctcta ctaaaaatac aaaatgagcc aggcgtggtg 82801 gtgcatgcct gtaatcccag ctactctgga tgctgaggca ggagaattgc ttgaacccag 82861 gaggtggagg ttgctgtgac ccgagattgt gccattgtac tccagcctag gcaacaagag 82921 cgaaactccg tctcaaaaaa aaaaaaagtg gctgtaaaca ggaccttagt gcacacacag 82981 ggagcttgag ggatcgtggg aggttatggc gtgggttgga agtagcaaag aacttaaaca 83041 ccatgaaaaa aaaatttttt ttttgaaatg gagtcttgct ctgttgtcca gactggagtg 83101 cagtgatgtg atctctgctc actgcaacct ccacctcccg gttcaagcaa ttctgcctca 83161 gccttccaaa acgctgggat tacaggcgct tgccaccaca cccagctaat tttttgtatt 83221 tttagtagcg atttagagag tagtttcacc gtgttgacca ggctggtctt gaactccgtg 83281 acctcaggtt aaccatccac ctcggcctcc caaagtgctg ggattacagg catgagccac 83341 cacacccagc caacaccatg aaaatttcaa atgaaacagg acggccagtg aaaagtcaag 83401 gtgctggact ccttgagtcc agctgcctga ttctgcccag ggtttgccac cagtgccagt 83461 gtttcctgta cattccttta gagatagctc atgcatctag aagtatttgt aaatatattg 83521 atctgattgg atatggtgcc cttttgctca cacagcccct gccacactgg cctcttcagt 83581 gcacaggaac tgtgagcaaa tgtatttcga cacaccaggc tggctccaac cccggatcct 83641 ttacacttgc tgtttgctcg acctggaatg tttggccacc agctgtccac aagctcaccc 83701 ctcactgcct tcaggtcttc atttagcgag gccttctctg accactcagc cttcaccttc 83761 agctcctcca tctttgtgcc ccttcactcc ttttgtttct ttctcttttt tcaattttta 83821 tttatttttt tagtttttgg agatggagtc tctctgtcac caggctggag tgcagtggcg 83881 caatctcagc tcactgcaac ctcagactcc ctggttcaag cgattcttct gcctcagcct 83941 cccaagtagc tgggattaca ggcacgcgcc accacgccca gctaattttt gtatttttag 84001 tagagacggg gtttcaccat gttggccagg atggtctcca tctcctgact ttgtgatcca 84061 cccagctcgg cttcccaaag tgctgggatt acaggtgtga gccaccacac ctggccttgt 84121 ttcttaagca cttaccgcta acatgccaaa tgttatcctt catcattgca ccactggaat 84181 gtaatttcca tgcgggcaga tttggaaata tatatttaat tccccttttc agcttttttt 84241 ttttttttga gataaggtct tactctgtca cctaggctgg catgtagtgg cgtaatctca 84301 gtttactgca gcctccgccg cccagactca agccatcctc ctacctcagc ctcccaagta 84361 gctaggacca caggcataga ccaccatact cagctagttt tttgtttgtt tgtttgtttt 84421 tgatagagac ggggtttcac catgttgccc aggctggtct caactcctga gctcaggtga 84481 tctgcccgcc ttagcctccc aaagtgctgg gattacaggc ctgagccacc gcacgccagc 84541 ctgcattttt tccaaatgtt ctaatctttc tcattttatt attttttttt ttttaatttt 84601 ttttttgaca caagacctca ctctgtcacc cagcctggag tgcagtggcc tgatcacagc 84661 ttactgcagc cttgaactcc caggctcaag agatcctccc accccagcct ctcaagtaga 84721 tgggactaca ggcatatgcc accacaccca gctaattttt gtattctttt gtagagatgg 84781 agtttgccca tattgcccag gctgatctcc aactcctggg ctcaagtgat cctcccgctt 84841 cagcctccca aggtgctggg attacaggca tgagccacca cacccagccc agctgttaat 84901 cttattgatg attaattata cgtgatgggt cacttgtcag ttgctgcttt caaggttctc 84961 tttttatcgt tcaataattt gattatgata tgtctggtga tggatatctt tgagtttatc 85021 ctatttgaag attattgagc ttcttggctc tagattgatg gtttttttca tcaaattttg 85081 ggagttttca gccattcttt cttcatatat tcttttttac ccctttcttt ttgtcctctc 85141 cttctgggat tcccattatt tgatgatgtc tcacaggtct ctgaagttct gttcacttat 85201 cttcgtgctt ttttctttct gttccttaga cttgatcatc tcaactgatc tatactcacg 85261 ttcaccgatc ttttcttctg cctactcaaa tctgttgttg aaccctctat tcctctattg 85321 aattttttct tttttttttt ttttgagacg gagtctcgct ctgtcgccca agctggagtg 85381 cagtggcgcg atctcggctc accgcaacct ccacttcccg agttcaagtg attatcctgc 85441 ctcagcctcc tgagtagctg gtactacagg cccataccac catgcctgat taattttgtg 85501 tatttttagt agagacaggg tttcaccatg ttagccagga tggcctcgat ctcctgacct 85561 cgtgatctgt ccacctcggc ctcccaaagt gctgggatta caggcatgag ccaccacgcc 85621 tggcccattt gaattttttc attttagtta ctgtactttt caagtcccaa gtttctgttt 85681 agttctttta aaataatttc tgcttttagg ccaggcatgg tggctcacac ctgtaatccc 85741 agcactttgg gaggccaaag tgggaggttc acttgaggtc aggagttcaa gactagcctg 85801 gccaacactc catctcaaaa aaaaagacag tattctttag cattagttcc gctaatgatt 85861 ggacaaagat ttccttagcc tccaaaaacc aataatctcc tagcctttgc tgaggcatgg 85921 tgggccatgc tttcgacatt cagccaggca attgacaact ctgccttcgc cactacttct 85981 cccaggaagg cacacatcct caaggtcagc ccaatgtaag aagactgggt tctcatctct 86041 ggtctttctt gagcctgcat atactcctgg gcatgcacac aaccctatgc atgcatgtga 86101 ccttctagat ttccaagaat attttggagt ttttcaaagc cttctcattc cccagcttta 86161 tcttttaaac tgtttcatta gcctattatt ggcctcaact gttattcatt gcttaggcag 86221 ctgcctagtt aaacaattgc atgcaattgt gtttcacaaa tgcccctggg gaaataattt 86281 tgacgatgta tgagctctaa gttaggtcaa atacagacaa cattgcaagt aggggcttcc 86341 agggaaccat aagacgaatc aaataattct gactttctgg gagtgaggct tttatggagt 86401 ttcatcctct tctgccccgt ttggtgtatg ccaggctgct agtttccact gagaatgctg 86461 gcagttaata ttcaaggcta ccatggagct ggaatggggg aatggaacta ggccaattaa 86521 aaatgtcaca aagtataaaa attagccggg cgtggtggcg ggcgcctgta atcctagcta 86581 ctcgggaggc tgaggcagga gaattgcttg aacctgggag gcagaggttg cagtgaactg 86641 agatcgtgcc actgcattcc agcctgggcg acagtgagac tctgtctcaa gaaaacaaaa 86701 ataaaaataa attttaaaaa agtcataaag ttcactgctc ttactaagat tcagctgctt 86761 ttcttgaata aacactttac agatttttgt tgttgttgtg gagacagagt ctcactctgt 86821 cacccaggct ggagagcaat ggcacgacct cagctcactg caacctctgc cttcgaggtt 86881 caagcgatta gctcctgcct cagcctcctg agtagctggg attacaggca cttgccacca 86941 cgactggcta attttgtatt tttagtagag atagggtttc accatgttag tcagcctgac 87001 ctcaggtgat ccccccaccc tggcctccca aagtgctggg attacaggcg tgagccacca 87061 cgactggcca aaaaaactat tttaaaatct aaagtgggtt gactaaaacc aaattaacca 87121 ttttaatcgg aaaacatctc atgtatctta ttaccaaacc agtgccttga cgatgggatt 87181 ttcagtgtca tttgccaaat tcgcaggcac cgcagtgggg agccactgct cccacagcag 87241 aggcaataca cggcctagcc ttggggttga gaagttctgg ggacggtgtc gggggccctg 87301 ggccaccatt tttgcatgga gccagccctg cgctgcgagg gcgtggcatc ggagagaccg 87361 tggcctcgtg ggcagcatcg gcggcccctg gggacacgcg ctggctcagt ccccacctcc 87421 gccagggaag cggtctgatg atacccattt cgcagatggg caaatccaca agccaagtcg 87481 tgccgcccag aagaggcaga accgggattt gaacctaggt ttctcattct gcacctgaag 87541 aaataaatgt ctggtccatg cgtgtgttgg ggggcgacgg ggacttttct tcaaccaagc 87601 ccggtgtgcc ccctctcatt ttctagcacg ttcttctccc ccatctttgc acttatctca 87661 gactcccctc acacacctat cctgggtgcc atcctttccc tcagcccacc gcccatgtag 87721 tcagcctcct aagcgcccct tccacctgtc ccctcatctc cccactgctc tgcctcggtc 87781 cagccctggc cttggcaggg gtccctgcct gggcctcccc acgggctccc ggcctcacag 87841 aggagcagcc acctgtgcaa ggtcacacag caggagagac acttctggga ttcaggaccc 87901 acctgtgggt ccgaagtgga ttttttgttt gtttggtttt ggagacaggg cctggctctg 87961 tcactcaagc tggagtgcaa tggcacgatc ttggctcact gcaacctctg tccccaggtt 88021 cagccatcct cccacctcag cctcccaagt agctgggacc acaggcatga gcctggctaa 88081 ttatttttgt attttttata gaaacggggt ttcactatgt cacccagtct ggtctcaaac 88141 tactgagctc aagcgatccg cctacctcag tttcccaaag tgctgggatt acaggcctga 88201 gctaccacgc cccgccctga tttggatctt gctcctattc ccaggaaggc ccatgtccta 88261 gcaagagaga caccaggtgg tgcgcacgcc cctccacctc caactcctcc accgccaaca 88321 aagcctggga cccagaggcc atgggtccta cacaatctgg tatttcaggt ctataaatca 88381 tgggggtggc agggagccca tcaaagctgt tcatccagcc tggtctgccg gctcagcctc 88441 ttgtccaccc cactgccgtg ccccaaaaca cacacacaca cacacacacc ctccctcctt 88501 cctgggagac tagcccaagg aggggcctgg gcgtatatag ggagccacgg tgggtggggc 88561 tggccagaga ggctgcagac agagaaggtg aggagggggc cctgggagtc tgggtatggc 88621 atgaggggtc tgaggctcag ggaggggcag tgccaagttc aggccagtga ctgccccacc 88681 ggagcccggg tgtcctcttg tgcaaaatgc ctccccacca gggctgtggg tacatcagta 88741 ccagcctggg ccccgcagag agctggagca acccctcgaa gccagggtgt ggctcagaga 88801 tggggggtcc ggctggccct ttacgctcct ggcctcatcc ctgctcatgt gtcccaggat 88861 gatggcgtct gcggcagcag cggaggccga gaagggatct ccagttgtgg tgggcctgct 88921 agttgtgggc aatatcatta ttctggtgag actcgcccca gtgctgggag ccgagtggtt 88981 gtgtggtggg gaggggacag tcctgagctc ggggtggggg atgagggtcc agaactgtct 89041 ctcgggcaga ctcagtggtc agcacagcct gggttgaaat cccagctctg cctctcccca 89101 gctctgtgac cttagcaagc gactgtccct ctctgtgcct tctttcctca tctatagact 89161 gtgtcctcat tgtgtgggct gttgtcatgt ggattaatcc atgtgtggcc ctcagctcag 89221 agaaagtgcc cgaccaatgc tggctatcac catcagcatc actatggata taaagtgaca 89281 gcctggtgca caggaggcca caagaaggaa aaacaacagt gtctttcaat taacattggg 89341 ttttggtttt gtgttttttg ttttgttttg ttttgttttg ttttgttttg ttttgttttt 89401 gagacggagt ctcactctgt tgcccaggct ggagtgcagt ggctgatctc agctaactgc 89461 aacctctaac tcccgggctc aagtgattct cctgcctcag cctccagaat agctgggatt 89521 acaggcgcac accaccacgc ccggctaatt tttttttttt cagacagatt ctccctttgt 89581 cacccaggct gaagtgcagt ggcgagatct cggctcactg catcctccac ctcctgggct 89641 ccagcaattc tcctgcctca gcttcccgag cagctgggat tacaggcaca tgctaccaca 89701 cccggctact ttttgtattt ttagaagaga tggggtttca ccatgttggc caggctggtc 89761 ttgaactcct gacctgaggt gatctgccca cctcagcctc ccaaagtgtt gggattacag 89821 gtgtaagcca ccgcgactgg ccgtgtgttt gttttagaga cagggtctca ctctgttgac 89881 caggctgcag tgcagtggtg tgatgcttgc tcactgcagc ctccaactcc tgggctcaac 89941 tgatcctcct gcctcagcct cccaagtagc tgggactaca ggtgcacatc accacatcta 90001 gctaattttt ttattttttg tagagacggg gtctcactat tgcccaggct agtctcaaac 90061 tcctgggctc aagcaatcct cccgcctcgg cctcccacaa tgctgggact gcaggtgtga 90121 gccaccttgc ctggcccatt tgtaatgttt atggccatgg ccatgttccc ctcctcgggg 90181 aataacagca ctgtcaaggg agggcagatg gaccaagcat ctcctctgtg ccagtccact 90241 gctccagcct ttccacgtgg catccccttt tatccccgaa gaaactggca gctggggcgt 90301 gggggtcggt tgtctggctg ggctgtctct tctctcctgg cctctgctca ccctcatcgc 90361 tgctcatgga aggcgaggtc tttgcccact ttactgagag ggcagtggag gctccaaatg 90421 gggagggaac ttgctcgtgg agatgcctgg tgccagggct gctggccccg agcagacctt 90481 cctaacccac cgctctgtcc agctgtcagg cctgtccctg tttgctgaga ccatatgggt 90541 gacagccgac cagtaccgtg tatacccact gatgggagtc tcaggcaagg atgacgtctt 90601 cgctggtgcc tggattgcca tcttctgcgg cttctccttc ttcatggtag ccagttttgg 90661 tgtgggtgcc gcactctgcc gccgccggtc catggtcctc acggtgagac tccaggggtt 90721 gggggatggg gacactgaaa acggagttca gcatcactga gtcatagtag caggtgcagc 90781 cccagccact agtgtgagtt ccccatggtg tcacactcag ggatcccagg catcatctca 90841 ttcagtcttc ccacagccca ataacccagg cttgagtagc agagctggga ttttttcttt 90901 tttttttttt tggtaaagat gggggtctca ctatgttgcc caggctggtc tcaaactcct 90961 ggctcaaatg attctcctgc ctcagcctcc caaagtgctg ggattacagg cttgagcccc 91021 tctacctgca cccagtgcag ggctagaatt tgaacccctg ctagctgctt tccagagtcc 91081 atgctcatga actctgccaa taatagcaca gctaccattt aataatggtt tactacaagc 91141 cacgagctgt tctaaatggt ttactggatt agttcattta ttcctcatag tccaccaaga 91201 gatactgtca tcatcatgac cctgtttgat gaagattgaa actgaggctc agagaagtga 91261 agtcacttat ccagggtcac acattgagac aatggcagag ccagtttcaa cctgagcaga 91321 tcggctcttg aggctgtgct cttaatctct gcccacaaga ataacccata cttcccaagt 91381 gtctctcatg cacctgacac tagggtaagc agtttagaaa atgagcttcg tggccaggcg 91441 cagtggctca tgcctgtaat cccagcacgt ggggaggctg aggcggacag atcatttgag 91501 gtcgggagtc caagaccaga ctggccaaca tggcgaaatc ccatctctaa aaaaaaaaaa 91561 aaaaaaaaaa ttagccacgc atggtggcag gcacctgtaa tccccgctac tcaggaggct 91621 ggggcggaag aatcactaga acccgggagg cagaggttgc agtgagccaa gatcacacca 91681 ctgcattcca gcctggatga caaagcgaga ctccatctca aaaaaaagga aagaaaatga 91741 gcttcctcaa tcttgccaga agcccttcac agctagtgct tttcaaccca tttaccagat 91801 gaggaaactg aggctcaaag aggctaacag cctctcccag gtgaccagga gagcagggat 91861 tccaattcac ggccccaatg acacagagac agcactgtgt ttcctcgaca gcagagggtt 91921 tttcagattc ccagaagcag ccactcattt gatgaacaaa tgcttatgag caggagctac 91981 atgctgagcc ctacgctggg cactggggct ctgtgctgag cgaaacataa aagcccttga 92041 cccagggaag tggtgctcag aggaggaaga cagtgaacag gaggggaggt caggagccga 92101 ggagggcgag ggaagacctc actgccaggg aaagcaaaga cctagaggca ctgggagcga 92161 ggcaggtgtg taactggagg aagagccttc caggtggagg tggcagcaag ggcaaaggct 92221 ctgaggccag agcgggccag gagtccaaag cagctggagc tgagtgagga ggaggttggg 92281 gggtgagagg acatgaggca gacagatggg gaggagaaca gtgaggaatg gggatgggaa 92341 agaagcagga gggaaggcag gtgttgtgga ctttggcttt atcctgagta agatggaccc 92401 atagaccccc tgctcccctc ccctcccttc ccctcccctt ccctcccctc ccctctcctt 92461 ccttccttcc ttcgtttttc tacacaggca ggtgttgtgg actttggctt tatcctgagt 92521 aagatggagc catggactcc tcccctcccc tcccctcccc tcgcctcctc tcccctcccc 92581 tcccctcctt tcccctcccc tcccctcctc tcccctcccc acctccctcc ttccttcctt 92641 catttttcta gacaggcagg tattgtggac tttggcttta tcctgggtac gatggagcca 92701 tggacttcct tcccttcctt ctttccctcc ctccctccct ccctccctcc cttccttcct 92761 tccttccttt ctttgtttct ctagacaggg tctcactctg ttgcccagtt tggagtgcag 92821 tggcacaatc atagctcatt gccacctcga actcttcatc tcaaatgatc ctcccacctc 92881 agcctcccaa tgagctggga ctacaagtgt gcatcaccat gtgcagctaa tttttgtatt 92941 ttttgtagag atggagtctc actatggtgc ccaggctggt ctccaactcc tggcctcaag 93001 tgagcctccc acctcagcct cccaaagtgt tggaattaca ggcatgagcc accacacctg 93061 ccctcttatt ttcttatcct gtatttaatt atctggcata acaatttaac caaagcccta 93121 aaaagtcacc caactagttt taaaaattgt gtctgggccg ggctcggtgg ctcacgcctg 93181 taatcccagc agtttgggag gccgaggcag gtggatcaca aggtcaggag atcgagacct 93241 tcctggctaa cacggtgaaa tcccgtcttt actaaaaaac aaaaaattaa ccaggcgtgg 93301 tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg gcgtgaaccc 93361 gggaggcgca gcttgcagtg agcagagatc gcgccactgc actccagcct gggcgacaga 93421 gcaagactct gtctcaaaaa aaaaaaaaaa ttgtgtctgt ttccccatta gtttgtcatt 93481 aactaagaca ttttctcccc aaatttcttt tttttttttt ttttgagacg gagtcttgct 93541 cagttgccca ggctggagtg cagtggtgcg atctcggctc actgcaagct ccgcctcccg 93601 ggttcacgcc attctcctgc ctcagcctcc cgagtagctg ggactacagg cacctgccac 93661 cacgcctggt taattttttt ttttgtattt ttagtagaga tggggtttcg ccgtgttagc 93721 caggatggtc tcgatctcct gacctcgtga tccgcccacc tcggcttccc aaagtgctga 93781 gattacaggc atgagccacc gtgcccggcc cccaaatttc ttacaggaaa tttctaaagt 93841 agactagagt ccgcagtatt tgagccctgc atacctctca cctgatttca acaacatcag 93901 ctgcagactg cccgacattg tctcctctat cccccttctc tcatactttt tcttctgggt 93961 attttgaagc acattctaga tgccaagtca cttgcctgta aatactggag cacacatttc 94021 tctgagtgat aagcacattg ttttaaagca gtcaggaggt tgtgcaatgt catccatctc 94081 ccagcagcag agttcacaca tccacacgca acctcctgct gctttataga tgaaggtgcc 94141 aggtgaggac atgaaaagaa tttgcattaa acatcaactg acaagctttt ttttttctcc 94201 tgccctctgt gccgttggca caacccacca aattcaatca agttcttcta tatcttccaa 94261 tacccagtct gtgtcaaatg tctctgatta tctgagaaca tcgttttatg ttgtttgttt 94321 tattttaaga gacagggtct cactctgttg cccaggctgg agtgtaatgg cacaatctcg 94381 gctcattgca acctctgcct cctgggttca agctattctc ctatctcatc ctcctgagta 94441 gctgggatta caggcaccca tcaccatgcc cagctaattt ttgtattttt gcatttttag 94501 tagatggggt ttcaccatat tggtcaggct ggtcttgaac tcctgacctc aggtgatcca 94561 cccgtctcaa cctcccaaag tgttgggatt acaggcgtaa gccactgcac ccacccttgt 94621 ttgttttgtt ttaagagaca gggttttgct ctgttgccca ggcaggaatg cagtggtgcg 94681 atcatagctc gctgaggcct ccaactactg ggctcaaacg atcctccaac ctcagcctcc 94741 caaatagcta ggactaacac taccactagt agtaactatg cttaactaat ttttttattt 94801 ttattttttg tagcgacggg gtctcactgt gttgcccagg ctgatcttga actccagggt 94861 tcaagcaatc ttcccatctc agcctcccaa agtgctggga ttacaggtgt gagccaccaa 94921 tgcccagccc taatttttgt atttttagta gagaaggggt ttcaccatgt tggccaggct 94981 ggtctcgaac tcctgactca ggtgatcgcc ccctcggcct cccaaagtgc tgggattata 95041 ggcgtgagcc accacgcctc gcctgtctca aaaataaatt aattaaatta aatatttcaa 95101 aatacagaaa agttggaaga caatacagcg aacacctgaa cacccaccac atattataga 95161 tgctgctagt cacactgccc caaacttttt taaaacttgc atttccccag tgatgcttcc 95221 ctgacggggt gtggcttcac gggctctgcc cgacgggccg tcctccctct gtctccccag 95281 tacctggtgc tcatgctcat cgtctacatc ttcgagtgcg cctcctgcat cacgtcctac 95341 acccaccgtg actacgtgag cccggcgagg cctggggagg ggcctgacct gcatgggagg 95401 ggcgagggtc ccggcgcggc ggggcggagg tcgtcctctc tccccaccca gccctgccgg 95461 cccttgttct tccctgttct cctcagatgg tgtccaaccc atccctgatc accaagcaga 95521 tgctgacctt ctacagcgcg gacaccgacc agggccagga gctgacccgc ctctgggacc 95581 gcgtcatgat tgaggtgggc ggggtggacc gggtgctggg agggccctgg gctccgtcca 95641 cagaggccga tagagctccc gatgtgccag cttttagagg agtgagtgcc ttaaagacat 95701 ccgctcatga gatctctagg agagcagggc tgcggtcatc accgttctgc ggatgagtaa 95761 acaggcccag aaagggaaag tcccttgccc gaggccacac agtcggtaag tggtgggagc 95821 tgggattgga acccggcagc ctagctcctg acctccctgc tcttagccta cacatactac 95881 ctcaacagaa atgttgataa taacaatgat agtaacagca ccattcactg agctaggttt 95941 cacactgttc aaaggacttt tatttttata ttttgagatg gggtcttgct ctgttgccca 96001 ggctggagtg atcatagctc actgcaacct ccaactcctg ggctcaagtg atcctcccgc 96061 ctcagcttcc cgagtagctg ggaatacagg tgcgccccac cacacccggc taagtttttt 96121 aaattatctt ttgtggagat gggggtcttg ctctattgcc caggctggtc tggaactcct 96181 gggctcagga ttggagcgat tctcccacct ccgcctctca aagtgccgag atcacaggca 96241 tgagccacga gtgcctggcc tcaaagcaca tttttttttt tctttttatc tttttttttt 96301 tttttttttt tttgagacag agtcttgctc tgtcgcccag gctggaatgc agtggcgcga 96361 tctcggctca ctgcaagctc cgcctcccgg gttcacgcta ttctcctgcc tcagcctccc 96421 gaatagctgg gactacaggc gcctgccacc gcgcccagct gattttttgt atttttagta 96481 gagatgggtg ttagccagga tggtcttgat ctcctgacct tgtgatccac ctgcctcggc 96541 ctcccaaagt gctgggatta caggcttgag ccaccgtgcc cagcctttca aagcactttt 96601 aaagcttttt taatcctcac gaccactcca taagggagat cggcctttta gtgatgagga 96661 gaccaaagca agaggttgaa tatcttgcat gaggtcacac acctggtagg tcacagaact 96721 ggaaggattc aaactcagac ttccaggttt gaagagatac agtcttcctc aaatccaaca 96781 tgatattgca ctggaaaagg aagaaattgg ctgggtgcag tggctcaaac ctataatccc 96841 aacactttgg gaggcaaagg cgggcagatc acttgagttc aggggttcaa gaccatcctg 96901 gccaacatgg tgaaaccctg tctctacaaa aaatacaaaa attagtcagg tgtagtagtg 96961 tgcgcctgtg attccagcta cttgggaggc tgaggcagga gaatcacttg aacccaggag 97021 ggggaggttg cagtgagcca agatggcgcc actgtactcc agcctggaca acagagggag 97081 actccgtctc aaaaaaaaaa aaaaaaaaga agaagaaatt gaggactcaa ggaaaactga 97141 ggcgtcaatg tggcccaagt cctctagcta gccaggattt gaacccatgt ttgaagggtg 97201 cccaagccta tgttttcctc attcaggagc cacaaattaa acacctggat tccttaccaa 97261 ggccaggaaa agccagaaat ctgccctttt tgtgttattt tatcagcata gttgttgata 97321 aaaatatttt attttatttt attttttgag atggagtttc attcttgatt cccaggctgg 97381 agtgcagtgg cacgatcttg gctcactgca gtcgccacct cccggattca agcaattctc 97441 ctgccccagc ctctcaagta gctgggatta caggcgccca ccaccacacc cggcaaattt 97501 tgtattttta gtagagacgg ggtttcatca tgttggtcag gctggtctcg aactcctgac 97561 ctcaggtgat ccatccacct cggcctccca gagagctggg attacaggcg tgagacaccg 97621 tgcccggcaa aaatatttta aaataagtca gaggtgcagg tttaatccaa atcccccatc 97681 agcctcctgg tcctgggaca aatattgtct gcaccgtgct cacacacaca gacacccacc 97741 ggacacactc gcttggaggc ccaggacatg gggaagttgg gcccagctgt gtcaaaggcc 97801 aatcctgccc ctccttgctg tgtgacctca ggcaagcaac tgtcctctct gagcccgagc 97861 ctgcctgacc ccctgctttt cactctagca agaatgctgt ggcacatctg gtcccatgga 97921 ctgggtgaac ttcacgtcag ccttccgggc ggccactccg gaggtggtgt tcccctggcc 97981 cccactgtgc tgtcgccgga cgggaaactt catccccctc aacgaggagg gctgccgcct 98041 ggggcacatg gactacctgt tcaccaaggt gtggccgtct gccctgctcc atctgtctat 98101 ccatctgtct gtcttcctct ggtctctgct ccctctctga ttctctcttt ctccctccct 98161 gtgtctgtcc gtctagcttt ttccctctcc tcttcccctc tctgattttc ttgttttctt 98221 tttctttctt tctttctttc tttctttttt ttttttcaag accgagtcct gctctgtcac 98281 ccaggctgga gtacagtggc acaatctcag ctcactgcag cgtctgcctc ccgggttcaa 98341 gggattctcc cacctcagct tccagagcag ctgggattac aggtgcgcac caccctcttg 98401 gctagttttt gtatttttag cagagacggg gtttgaccat gttgtccagg ctggtctcga 98461 actcctgagc tcaagtgatc cacccacctc agcctcccaa agtgctcaga ttacaggcgt 98521 gagccaccac gctgggtccc ctctctgatt ttctagtgct cccccaactc tctctcaagt 98581 cagcaggatg cacccagggt ctgtctgaac cagacctggc acctacaact ttactcaggt 98641 tgtccgagaa taagagcaat aatagggccg ggcgcgatgg ctcatgcctg taatctcagc 98701 actttgggag gctgaggagg gcagatcact tgaggtcaag agttcgagac cagcctgacc 98761 aacatggcga aaccccatct ctactaaaaa tgcaaaaatt agcccagagc agtagcacgt 98821 gcctgtaatc ccagctactc gggaggctga ggtaggagaa tcacttgaac tctggaggtg 98881 aaggttgcag tgagccgaga ttgtgccact gcaccccagc ctgggcgaca gagtgaaact 98941 ccatctcaaa caaaaaaaaa caaaacaaaa catcaaacaa aacaatagag ctaggcacgg 99001 gggctcacga ctgtaatccc agtactttgg gaggccaagg caggagcatc acttgaggtc 99061 agaagctcaa gaccagcctg ggcaacatat caagaccccg tctctacaaa aaagtacaga 99121 aaattagcca ggcgtggtgg tgcgcgcctg taatcccagc tactggggag gctaaggcag 99181 gaggattgtt tgagtctagg aggtcaaggc tgcagagagc tgtgatggtg ccactgcact 99241 ctagcctagg caagagactt aaaaaaggca agagacttta aaaaaaaaaa ggccgggcgc 99301 gcacagtggc tcatgcctgt aatcccagca ctttgggaga ccgagatggg aggatcacaa 99361 gatcaggagt tcgagaccag cctgaccaac atggtgaaac cccgtctcta ctaaaaatac 99421 aaaaattagc cgggcgtagt ggtgcacacc tgtaatccca gctactcagg aggctgaggc 99481 aggagaattg cttgaacgcg ggaggtaaag gtttcagtga gccgagatca tgccactgca 99541 ctgtagcctg ggtgacagag ggagactgcg agactgtctc caaaaaaaaa aaaaagccag 99601 ctgtaagatc aaaatcaccc tcagattgtc atagccatca gcagggcggc acctacttgt 99661 gcagggcctg ggttcaacat cagctccctg gggccccgaa accacccgag gaggtggaca 99721 gtgccatggt ccccattgca caagtgagaa aaccaatgct cagggaaggt ggtgactttc 99781 ccaagggcgc acagtgactc tcgctttcag cggctaccct catgggcgtc ttctcaaact 99841 tcccccatcc ccctgcccag ggctgcttcg aacacatcgg ccacgccatc gacagctaca 99901 cgtggggtat ctcgtggttt gggtttgcca tcctgatgtg gacggtgaga ggcggggagc 99961 ccacaggctg ggtgggctgg gggtgggggg cggagtgccc tcatctcgct gcctcctcgc 100021 cagctcccgg tcatgctgat agccatgtat ttctacacca tgctctgagg gacaggaggg 100081 gaaggcaaca tacacacccc ggactcctcc gcatcctcct cctgcttcct ccgctgggcc 100141 tggatggctg cctcacctct cacctcccaa cgtccctagc ccttacgtcc ttccacttcc 100201 aagatctttt tccaggttcc tgagccctac tgtgtctcag gtgtgccctg aaaccccagg 100261 gcttgtgtgc acatatcctt agcccatctt tcaagggacc tctccatgat cccacctccc 100321 attcacagat acctctcttg tagctctctg acctcctcct tcatggcagg catcgccatt 100381 cttgctgaac cgtttgtgat tgccatttga gctctggaag cctctattgc catgagagtt 100441 ctgtcacggt cactttactg tccccatcat cacccagcac ggggctaagc atatactaga 100501 tagtcaataa ataaataaat aatgaatgaa tgaatgagtt tttttttttc tgggtcacgg 100561 gagagtgaac aagaactgtt ctaggccagg cgcagtggct cacgcctgta atcccagcac 100621 tttgggaggc cgaggcgggc agatcacctg aggtcaggag ttcaagacca gcctggccaa 100681 aatggtgaga cccccatctc tacaaaaatg caaaaatcag ccgggtatgg tggcgcacat 100741 ctgtaatccc agccactccg gaggctgagg cagaagaatc acttgaaccc gggaggtgga 100801 ggttgcagtg agctgagatc gcaccattgt actccagcct cggcaacaaa tcgaggccct 100861 gtctccaaaa acaaaaaaaa ctattctgat ggaagtggtg gaaacccaac tgtcactgac 100921 tgaagtgtaa aatggaattt atcagctcac gtaactgagt caacggggct ggcttcaggc 100981 atgtctggat ccagatgttc aatctaagtt tccaggaaac ctgcttcatt ccctgacagg 101041 atttctctgg caactccaga cttaatgtct ttataactaa gcaatgcaac agaaggaaag 101101 agcctctctc ccaagggctg tagccaaggt ccagagttgt agccaaggtc cagagttgaa 101161 tctcactgtc tcggcttagg ttctaggccc ttcgttaggt tgggaaaggg agggcacctg 101221 gcttggtaat cctgctaagg atgcatgcag tggtggtgtt ggaaggggag ttgtttatca 101281 aaggaaaatc cagatacagg gctgctgagg cttataggtg ataaaaatcc ttctcctagg 101341 gatatagctt ccagcaagtt ctctaagggg tccatgacag caaaccatta cgaattgaat 101401 tatggctggg cactgtggct cacacctgta atcccagcac ttgctgaggt gggcagatca 101461 cttgaggcca ggaacttgag accagcctgg ccaaaatggc gaaaccctat ctctattaaa 101521 aatacaaaaa agtacctggg catggtggtg tgcacctgta gacccagcta ctcaggaggc 101581 tgaagcacaa gaatcgcttg aacccggaag gtggaggttg cagtgagcca agattgcacc 101641 actgcactcc agcctgggag acagaacaag attccgtctc taaaaaaaaa aaaaagaacc 101701 aaattatcac tctttgttct ccctgaagcc acgactaggc cttcccaaga gggcctgggg 101761 gcttccttcc ccatcaaata atttcttggg gttcccattt actccagtgg tctgcagata 101821 gtctttgatt gctgaattca gatttcattt tttaaaaata tgtaatggat aaagaggtca 101881 ttatttatta gagtgaaaaa tcggaaacta cccaacaacc atcaccaaag ggatgggttg 101941 tgtaaatcat gttatgccct tggactgaaa tgctctgtgg ctgttgtgtg tgcgcctgtg 102001 gtcccagcta ctgggaaggc ttaggcggga ggaccgcttg agccctggag ttcgaggttg 102061 cagtaagcta tgttcacacc tatgagtagc cactgcactc cagcctgggc gacacagcaa 102121 aattccatct ctaaataata ataataatag gctgagcacg gtggctcatg tctgtaatcc 102181 cagcactttg ggatgtccag gtgggaggat cgcttgagcc ccagagttta agatcagcct 102241 gggcaacata gcaagacctt gtctctataa aaaattttaa aaagcaggac gcagtggctc 102301 aggcctgtaa tcccagcact ttgggaggct gaggtgggag gaccgcttga gctctcgaga 102361 ccagcctggg caacatggca aaaccctgtc tctacaaaaa atgcaaaaat tatccaggtg 102421 tggtgatgcg tgcctgtggt cccagacact ggggaggctg agatggaagg atggcttgag 102481 cccaggaggc agaggttgca gtgagccgag atcgcaccac tgcactccag cctgggtgat 102541 agagccagac tctgtctcaa aaaaaaaaaa aattaaaaaa attatctggg tgtagtagtg 102601 tgtgcctgtg gtcctagctt cttaggaggc tgagatggaa ggactgtctg agcacaggag 102661 gtcgaggctg cagtgctcca caccaccaca ctccagcctg gggaacagag caagaacctg 102721 cttcaaataa ataaatatat atatatataa ttataataat aaatactttt taaaagagat 102781 agctgtatgt gagttgacta taaaaagatc tcataataac tggaaagccc atatgaggat 102841 gttcattagt aaaataaatg actatcagca gggcactggg taaattagct acgctatatc 102901 catcctctgc aataccacaa agaggtaaaa ataaaaagaa caagaactct gctttaacat 102961 ggaaaacgcg ctaagaaata catagtagga aaaattgaaa actatataca gtactttttt 103021 tttttttgag atggagtttc actcttttgc ccaggctaga gtgaagtggt gcgatcttgg 103081 ctcactgcaa cctctgcccc ctgtgttcaa gcgattctct tgcctcaggc tctgagtagc 103141 taagattata ggcacctgcc accatgcccg gctaattttt gtatttttac tggagacagg 103201 cttcgccatg ttggccaggc tggtctggaa ctcccgacct caggtgatcc actcacctca 103261 gtctcccaaa atgctaggat tacaggtgtg agccatcaca cctggcccat atagtactct 103321 tattcctgca gggttgatca aacggagctt tagtcttttt cttccttaaa aaaatgagtt 103381 aatgttttta taataatgtt tagaagaata gaggaaaaac tttgttgttt tgcagaacac 103441 ttttttgttg ttgttgttgt tgacggagtc tcgctctgtt gaccaggctg aagttcagtg 103501 gcacaatccc ggctcactgc aacctccgcc tcctgggctc aagcaatcct cccctctcag 103561 cctcctgagt agct // LOCUS HSACENT 2846 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for alpha-centractin. ACCESSION X82206 NID g563882 KEYWORDS centractin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2846) AUTHORS Clark,S.W., Staub,O., Clark,I.B., Holzbaur,E.L., Paschal,B.M., Vallee,R.B. and Meyer,D.I. TITLE Beta-centractin: characterization and distribution of a new member of the centractin family of actin-related proteins JOURNAL Mol. Biol. Cell 5 (12), 1301-1310 (1994) MEDLINE 95210749 REFERENCE 2 (bases 1 to 2846) AUTHORS Clark,S.W. TITLE Direct Submission JOURNAL Submitted (12-OCT-1994) S.W. Clark, University of California, Dept of Biological Chemistry, UCLA School of Medicine, 10833 Le Conte Ave., Los Angeles CA 90024-1737, USA REFERENCE 3 (bases 1 to 2846) AUTHORS Lees-Miller,J.P., Helfman,D.M. and Schroer,T.A. TITLE A vertebrate actin-related protein is a component of a multi-subunit complex involved in microtubule based vesicle motility JOURNAL Nature (1992) In press COMMENT Shows homology to Z14978 from position 19..1682. FEATURES Location/Qualifiers source 1..2846 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testicular" /clone_lib="Clontech HL1010b" /clone="FC1519" misc_feature 1..9 /note="EcoRI linker" CDS 67..1197 /codon_start=1 /product="alpha-centractin" /db_xref="PID:g563883" /db_xref="SWISS-PROT:P42024" /translation="MESYDVIANQPVVIDNGSGVIKAGFAGDQIPKYCFPNYVGRPKH VRVMAGALEGDIFIGPKAEEHRGLLSIRYPMEHGIVKDWNDMERIWQYVYSKDQLQTF SEEHPVLLTEAPLNPRKNRERAAEVFFETFNVPALFISMQAVLSLYATGRTTGVVLDS GDGVTHAVPIYEGFAMPHSIMRIDIAGRDVSRFLRLYLRKEGYDFHSSSEFEIVKAIK ERACYLSINPQKDETLETEKAQYYLPDGSTIEIGPSRFRAPELLFRPDLIGEESEGIH EVLVFAIQKSDMDLRRTLFSNIVLSGGSTLFKGFGDRLLSEVKKLAPKDVKIRISAPQ ERLYSTWIGGSILASLDTFKKMWVSKKEYEEDGARSIHRKTF" conflict 1262..1265 /citation=[3] /replace="ctta" conflict 1318..1320 /citation=[3] /replace="gtg" conflict 1378..1380 /citation=[3] /replace="ctg" conflict 1386..1387 /citation=[3] /replace="gtgat" conflict 1469..1472 /citation=[3] /replace="gcgc" conflict 1477..1478 /citation=[3] /replace="gctggctgtgggggac" conflict 1541..1543 /citation=[3] /replace="aca" conflict 1574..1575 /citation=[3] /replace="gcc" conflict 1596..1598 /citation=[3] /replace="ag" conflict 1674..1677 /citation=[3] /replace="gg" polyA_signal 2800..2805 misc_feature 2838..2846 /note="EcoRI linker" BASE COUNT 636 a 790 c 768 g 652 t ORIGIN 1 gaattcgggg cgctacggcg gacccggctg ggcagttcct tccccagaag gagagattcc 61 tctgccatgg agtcctacga tgtgatcgcc aaccagcctg tcgtgatcga caacggatcc 121 ggtgtgatta aagctggttt tgctggtgat cagatcccca aatactgctt tccaaactat 181 gtgggccgac ccaagcacgt tcgtgtcatg gcaggagccc ttgaaggcga catcttcatt 241 ggccccaaag ctgaggagca ccgagggctg ctttcaatcc gctatcccat ggagcatggc 301 atcgtcaagg attggaacga catggaacgc atttggcaat atgtctattc taaggaccag 361 ctgcagactt tctcagagga gcatcctgtg ctcctgactg aggcgccttt aaacccacga 421 aaaaaccggg aacgagctgc cgaagttttc ttcgagacct tcaatgtgcc cgctcttttc 481 atctccatgc aagctgtact cagcctttac gctacaggca ggaccacagg ggtggtgctg 541 gattctgggg atggagtcac ccatgctgtg cccatctatg agggctttgc catgccccac 601 tccatcatgc gcatcgacat cgcgggccgg gacgtctctc gcttcctgcg cctctacctg 661 cgtaaggagg gctacgactt ccactcatcc tctgagtttg agattgtcaa ggccataaaa 721 gaaagagcct gttacctatc cataaacccc caaaaggatg agacgctaga gacagagaaa 781 gctcagtact acctgcctga tggcagcacc attgagattg gtccttcccg attccgggcc 841 cctgagttgc tcttcaggcc agatttgatt ggagaggaga gtgaaggcat ccacgaggtc 901 ctggtgttcg ccattcagaa gtcagacatg gacctgcggc gcacgctttt ctctaacatt 961 gtcctctcag gaggctctac cctgttcaaa ggttttggtg acaggctcct gagtgaagtg 1021 aagaaactag ctccaaaaga tgtgaagatc aggatatctg cacctcagga gagactgtat 1081 tccacgtgga ttgggggctc catccttgcc tccctggaca cctttaagaa gatgtgggtc 1141 tccaaaaagg aatatgagga agacggtgcc cgatccatcc acagaaaaac cttctaatgt 1201 cgggacatca tcttcacctc tctctgaagt taactccact ttaaaactcg ctttcttgag 1261 tcggagtgtt tgcgaggaac tgcctgtgtg tgagtgcgtg tgtggatatg agtgtgtgcg 1321 cacatgcgag tgccgtgtgg ccctgggacc ctgggcccag aaaggacgat gaactacccg 1381 cagtggtgat gcctgaggcc tggggttgac cactaactgg ctcctgacag ggaagagcgc 1441 tggcagaggc tgtgctccct cctcaggtgg cctctggctg gctgtggggg actccgttta 1501 ctaccacagg gagacagagg gaggtaagcc atcccccggg agaccttgct gctgaccatc 1561 ctaggctggg ctggcccacc ctcaccccca cccccagggt gccctgaggc cccaggcagc 1621 tgctgcctcc actatcgatg cctcctgact gcacactgag gactgggact ggggttgagt 1681 tctgtctggt tttgttgcca ttttggtttg ggaggctgga aaagcacccc aagaagctat 1741 tacagagact ggagtcagga gagagcagga ggccctcatg ttcaccaggg aacaggacca 1801 caccggccac tgaaggaggg caggagcagt cctccctctg aatggctgca gagttaatgt 1861 tcccagccca gtcccctttc gggggccttg ggagagttta aggcacctgc tggttccagg 1921 acctcgcttt ccatctgttc ttgttgcaat gccatcttca aaccgtttta tttattgaag 1981 tgtttgttca gttaggggct ggagagaggg agcttgctgc ctcctgcctt gctacactaa 2041 tgtttacagc acctaagctt agcctccagg gccccacctc tcccagctga tggtgagctg 2101 acagtgtcca caggttccag gaccatttga gattggaagc tacactcaaa gacactccca 2161 ccaggctctt tctccctttt cctcttctca ctgccctgga atcaacaggc tggttgctgg 2221 ttagattttc tgaaacagga ggtaaaattt ttctttggca gaggccccta agcaagggag 2281 gggtgttgga gagccagtgc ccttaagact ggagaaagct gcaatttacc aagttgcctt 2341 ttgccactgt agctgaccag gggactaggt tgtagaggtg ggaaggcccc tctgggctga 2401 tcttgtgcca ttcttgacct tggacctgct tggttaagga gggagtgggc cagaccagag 2461 tgccaggagc taatggagcc aggcctgaca cctaggagtg gtccaaagcc ttcagcctag 2521 atggtgcaaa gctggggcca gcctgtcttc accggcaccc tcacctgtga caccaagacc 2581 caccccaatc cagacttcac acagtattct cccccacgcc gtctatgacc aaaggcccct 2641 gccaggtgtg ggtccacagc agcaggtatg tgtgaaagca acgtagcgcc ccgcggactg 2701 cagtgcgctt aaccaactca cctcccttct cttagcccaa gcctgtccct cgcacagcct 2761 cgcacaaacc acattgcctg gtggggccca gtgtactgaa ataaagtcgt tccgatagac 2821 acgtcaaaaa aaaaaaaccc gaattc // LOCUS HSACETR 2457 bp RNA PRI 25-OCT-1994 DEFINITION H.sapiens mRNA for acetylcholine receptor (epsilon subunit). ACCESSION X66403 NID g560152 KEYWORDS acetylcholine receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2457) AUTHORS Beeson,D.M.W. TITLE Direct Submission JOURNAL Submitted (25-MAY-1992) D.M.W. Beeson, Institute of Molecular Medicine, Neurosciences group, Inst. of Mol. Med., John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 2457) AUTHORS Beeson,D., Brydson,M., Betty,M., Jeremiah,S., Povey,S., Vincent,A. and Newsom-Davis,J. TITLE Primary structure of the human muscle acetylcholine receptor. cDNA cloning of the gamma and epsilon subunits JOURNAL Eur. J. Biochem. 215 (2), 229-238 (1993) MEDLINE 93345508 REFERENCE 3 (bases 1 to 2457) AUTHORS Beeson,D.M.W. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) D.M.W. Beeson, Institute of Molecular Medicine, Neurosciences group, Inst. of Mol. Med., John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK FEATURES Location/Qualifiers source 1..2457 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="muscle" /cell_type="fibroblast" /clone_lib="lambda gt11 muscle cDNA" /clone="RP3L, LMP8, RG1, RA6" /chromosome="17 CHRNE" sig_peptide 12..72 /note="acetylcholine receptor epsilon subunit-precursor" CDS 12..1493 /codon_start=1 /product="acetylcholine receptor epsilon subunit CHRNE" /db_xref="PID:g560153" /db_xref="SWISS-PROT:Q04844" /translation="MARAPLGVLLLLGLLGRGVGKNEELRLYHHLFNNYDPGSRPVRE PEDTVTISLKVTLTNLISLNEKEETLTTSVWIGIDWQDYRLNYSKDDFGGIETLRVPS ELVWLPEIVLENNIDGQFGVAYDANVLVYEGGSVTWLPPAIYRSVCAVEVTYFPFDWQ NCSLIFRSQTYNAEEVEFTFAVDNDGKTINKIDIDTEAYTENGEWAIDFCPGVIRRHH GGATDGPGETDVIYSLIIRRKPLFYVINIIVPCVLISGLVLLAYFLPAQAGGQKCTVS INVLLAQTVFLFLIAQKIPETSLSVPLLGRFLIFVMVVATLIVMNCVIVLNVSQRTPT THAMSPRLRHVLLELLPRLLGSPPPPEAPRAASPPRRASSVGLLLRAEELILKKPRSE LVFEGQRHRQGTWTAAFCQSLGAAAPEVRCCVDAVNFVAESTRDQEATGEEVSDWVRM GNALDNICFWAALVLFSVGSSLIFLGAYFNRVPDLPYAPCIQP" mat_peptide 73..1490 /product="acetylcholine receptor epsilon subunit CHRNE" BASE COUNT 477 a 751 c 703 g 526 t ORIGIN 1 cacgcagcag gatggcaagg gctccgcttg gggtcctgct cctcttgggg cttctcggca 61 ggggtgtggg gaagaacgag gaactgcgtc tttatcacca tctcttcaac aactatgacc 121 caggaagccg gccagtgcgg gagcctgagg atactgtcac catcagcctc aaggtcaccc 181 tgacgaatct catctcactg aatgaaaaag aggagactct caccactagc gtctggattg 241 gaatcgattg gcaggattac cgactcaact acagcaagga cgactttggg ggtatagaaa 301 ccctgcgagt cccttcagaa ctcgtgtggc tgccagagat tgtgctggaa aacaatattg 361 atggccagtt cggagtggcc tacgacgcca acgtgctcgt ctacgagggc ggctccgtga 421 cgtggctgcc tccggccatc taccgcagcg tctgcgcagt ggaggtcacc tacttcccct 481 tcgattggca gaactgttcg cttattttcc gctctcagac gtacaatgcc gaagaggtgg 541 agttcacttt tgccgtagac aacgacggca agaccatcaa caagatcgac atcgacacag 601 aggcctatac tgagaacggc gagtgggcca tcgacttctg cccgggggtg atccgccgcc 661 accacggtgg cgccaccgac ggcccagggg agactgacgt catctactcg ctcatcatcc 721 gccggaagcc gctcttctac gtcattaaca tcatcgtgcc ctgtgtgctc atctcgggcc 781 tggtgctgct cgcctacttc ctgccggcgc aggccggcgg ccagaaatgc acggtctcca 841 tcaacgtcct gctcgcccag accgtcttct tgttcctcat tgcccagaaa atcccagaga 901 cttctctgag cgtgccgctc ctgggcaggt tccttatttt cgtcatggtg gtcgccacgc 961 tcattgtcat gaattgcgtc atcgtgctca acgtgtccca gcggacgccc accacccacg 1021 ccatgtcccc gcggctgcgc cacgttctcc tggagctgct gccgcgcctc ctgggctccc 1081 cgccgccgcc cgaggccccc cgggccgcct cgcccccaag gcgggcgtcg tcggtgggct 1141 tattgctccg cgcggaggag ctgatactga aaaagccacg gagcgagctc gtgtttgagg 1201 ggcagaggca ccggcagggg acctggacgg ctgccttctg ccagagcctg ggcgccgccg 1261 cccccgaggt ccgctgctgt gtggatgccg tgaacttcgt ggccgagagc acgagagatc 1321 aggaggccac cggcgaggaa gtgtccgact gggtgcgcat ggggaatgcc cttgacaaca 1381 tctgcttctg ggccgctctg gtgctcttca gcgtgggctc cagcctcatc ttcctcgggg 1441 cctacttcaa ccgagtgcct gatctcccct acgcgccgtg tatccagcct tagctcgcac 1501 cgacttcaat ttcccaccca tctccagtag gaaattgatt ttgaaaaagt aggctgccgc 1561 caccacggca ttatgatccc ttccccctgc tgatcaatct gcagtttgtg aacttcacaa 1621 gaatggtgtg tgcccgttcc ctggcgtgtg taggcctggc cgcagtccag gggtcagcag 1681 gaggaaaggg ttcacatagg ctctcaggtg ccagtcttcc agaaagcaag gactgccctt 1741 cattcagcct tgctgacctc ccagcctttc taaggctcag ccccacggga ctctggtggc 1801 tgccagcttg tgagctatct atctatattc atttcatagc caaacaggag acccctttgc 1861 aggacttgca cacagggagg ctgtagccag gaaaccctct tcttccctgg tctggctctg 1921 ctggagcggg tgggaaccaa acaccttcag tgctggtggc cctcaggccc acaggtttaa 1981 ggctgaggct gccctgaccc ttccacagtc atttcttcta ggttttcttg gcccagcact 2041 gcccatccca ccccatgagg ctcactcatt gcagatccca gcccaccctg cccctttctt 2101 ccccaccctg gaggctctct ctgcctagtc tacagtactg acagaaagca aggacatgcg 2161 gcctgcatgg tgggagctgg ttgaattgtc tttattaaca aacaggatat ccaaggccac 2221 tacattgagg aggggggagg ggggagggag gagaagggtt acttgctgct cacactatat 2281 acagatgcaa gcaaggggcg tggagagtga gggctccctg ctccctccct ccaccgggga 2341 agggcatggg ctagaagagg agaggggggt cgggaatggg gggaatgttt tggctgcggg 2401 gtcccccctc cattccctgg agtttggggg aaggggaatc attaaagtgc tttcaga // LOCUS HSACHRA 1667 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for muscle acetylcholine receptor alpha-subunit. ACCESSION Y00762 NID g28308 KEYWORDS acetylcholine receptor alpha. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1667) AUTHORS Schoepfer,R. TITLE Direct Submission JOURNAL Submitted (03-FEB-1988) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 1667) AUTHORS Schoepfer,R., Luther,M. and Lindstrom,J. TITLE The human medulloblastoma cell line TE671 expresses a muscle-like acetylcholine receptor. Cloning of the alpha-subunit cDNA JOURNAL FEBS Lett. 226 (2), 235-240 (1988) MEDLINE 88112190 FEATURES Location/Qualifiers source 1..1667 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="medulloblastoma TE761" /clone_lib="1TE" /clone="pTE1.1" CDS 49..1422 /note="precursor (AA -20 to 437)" /codon_start=1 /db_xref="PID:g28309" /db_xref="SWISS-PROT:P02708" /translation="MEPWPLLLLFSLCSAGLVLGSEHETRLVAKLFKDYSSVVRPVED HRQVVEVTVGLQLIQLINVDEVNQIVTTNVRLKQQWVDYNLKWNPDDYGGVKKIHIPS EKIWRPDLVLYNNADGDFAIVKFTKVLLQYTGHITWTPPAIFKSYCEIIVTHFPFDEQ NCSMKLGTWTYDGSVVAINPESDQPDLSNFMESGEWVIKESRGWKHSVTYSCCPDTPY LDITYHFVMQRLPLYFIVNVIIPCLLFSFLTGLVFYLPTDSGEKMTLSISVLLSLTVF LLVIVELIPSTSSAVPLIGKYMLFTMVFVIASIIITVIVINTHHRSPSTHVMPNWVRK VFIDTIPNIMFFSTMKRPSREKQDKKIFTEDIDISDISGKPGPPPMGFHSPLIKHPEV KSAIEGIKYIAETMKSDQESNNAAAEWKYVAMVMDHILLGVFMLVCIIGTLAVFAGRL IELNQQG" sig_peptide 49..108 /note="signal peptide (AA -20 to -1)" mat_peptide 109..1419 /note="mature alpha-chain (AA 1-437)" misc_feature 529..531 /note="pot. N-glycosylation site" BASE COUNT 419 a 465 c 371 g 412 t ORIGIN 1 aagcacaggc caccactctg ccctggtcca cacaagctcc ggtagcccat ggagccctgg 61 cctctcctcc tgctctttag cctttgctca gctggcctcg tcctgggctc cgaacatgag 121 acccgtctgg tggcaaagct atttaaagac tacagcagcg tggtgcggcc agtggaagac 181 caccgccagg tcgtggaggt caccgtgggc ctgcagctga tacagctcat caatgtggat 241 gaagtaaatc agatcgtgac aaccaatgtg cgtctgaaac agcaatgggt ggattacaac 301 ctaaaatgga atccagatga ctatggcggt gtgaaaaaaa ttcacattcc ttcagaaaag 361 atctggcgcc cagaccttgt tctctataac aatgcagatg gtgactttgc tattgtcaag 421 ttcaccaaag tgctcctgca gtacactggc cacatcacgt ggacacctcc agccatcttt 481 aaaagctact gtgagatcat cgtcacccac tttccctttg atgaacagaa ctgcagcatg 541 aagctgggca cctggaccta cgacggctct gtcgtggcca tcaacccgga aagcgaccag 601 ccagacctga gcaacttcat ggagagcggg gagtgggtga tcaaggagtc ccggggctgg 661 aagcactccg tgacctattc ctgctgcccc gacaccccct acctggacat cacctaccac 721 ttcgtcatgc agcgcctgcc cctctacttc atcgtcaacg tcatcatccc ctgcctgctc 781 ttctccttct taactggcct ggtattctac ctgcccacag actcagggga gaagatgact 841 ctgagcatct ctgtcttact gtctttgact gtgttccttc tggtcatcgt ggagctgatc 901 ccctccacgt ccagtgctgt gcccttgatt ggaaaataca tgctgttcac catggtgttc 961 gtcattgcct ccatcatcat cactgtcatc gtcatcaaca cacaccaccg ctcacccagc 1021 acccatgtca tgcccaactg ggtgcggaag gtttttatcg acactatccc aaatatcatg 1081 tttttctcca caatgaaaag accatccaga gaaaagcaag acaaaaagat ttttacagaa 1141 gacattgata tctctgacat ttctggaaag ccagggcctc cacccatggg cttccactct 1201 cccctgatca aacaccccga ggtgaaaagt gccatcgagg gcatcaagta catcgcagag 1261 accatgaagt cagaccagga gtctaacaat gcggcggcag agtggaagta cgttgcaatg 1321 gtgatggacc acatactcct cggagtcttc atgcttgttt gcatcatcgg aaccctagcc 1381 gtgtttgcag gtcgactcat tgaattaaat cagcaaggat gagcagaaaa tgagctgagc 1441 ttagctctgc cctggaacct accagagcag agaagggcag gagaggaaga tttgtctact 1501 tgctccactc gcacttatca aacgtgttat attccatact tattattgat gataagattt 1561 acctttatgt aagtttatgg ccttgaagtg ttttcatatt gcttctccct ttagttctgc 1621 tgtctccctg aagagtgaac cctctttagt aaatgaaact aatcact // LOCUS HSACHRB 1650 bp RNA PRI 25-OCT-1994 DEFINITION Human mRNA for muscle acetylcholine receptor beta-subunit. ACCESSION X14830 NID g560154 KEYWORDS acetylcholine receptor; acetylcholine receptor beta; glycoprotein; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1650) AUTHORS Beeson,D.M.W. TITLE Direct Submission JOURNAL Submitted (20-MAR-1989) Beeson D.M.W., Neurosciences Group IMM OXFORD, Institute of Molecular Medicine, John Radcliffe Hospital, Oxford 0X3 9DX, ENGLAND REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 1650) AUTHORS Beeson,D., Brydson,M. and Newsom-Davis,J. TITLE Nucleotide sequence of human muscle acetylcholine receptor beta-subunit JOURNAL Nucleic Acids Res. 17 (11), 4391 (1989) MEDLINE 89296503 REFERENCE 3 (bases 1 to 1650) AUTHORS Beeson,D.M.W. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) Beeson D.M.W., Neurosciences Group IMM OXFORD, Institute of Molecular Medicine, John Radcliffe Hospital, Oxford 0X3 9DX, ENGLAND FEATURES Location/Qualifiers source 1..1650 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10." sig_peptide 17..85 /note="signal peptide (AA -23 to -1)" CDS 17..1522 /note="acetylcholine receptor beta-subunit preprotein" /codon_start=1 /db_xref="PID:g560155" /db_xref="SWISS-PROT:P11230" /translation="MTPGALLMLLGALGPPLAPGVRGSEAEGRLREKLFSGYDSSVRP AREVGDRVRVSVGLILAQLISLNEKDEEMSTKVYLDLEWTDYRLSWDPAEHDGIDSLR ITAESVWLPDVVLLNNNDGNFDVALDISVVVSSDGSVRWQPPGIYRSSCSIQVTYFPF DWQNCTMVFSSYSYDSSEVSLQTGLGPDGQGHQEIHIHEGTFIENGQWENIHKPSRLI QPPGDPRGGREGQRQEVIFYLIIRRKPLFYLVNVIAPCILITLLAIFVFYLPPDAGEK MGLSIFALLTLTVFLLLLADKVPETSLSVPIIIKYLMFTMVLVTFSVILSVVVLNLHH RSPHTHQMPLWVRQIFIHKLPLYLRLKRPKPERDLMPEPPHCSSPGSGWGRGTDEYFI RKPPSDFLFPKPNRFQPELSAPDLRRFIDGPNRAVALLPELREVVSSISYIARQLQEQ EDHDALKEDWQFVAMVVDRLFLWTFIIFTSVGTLVIFLDATYHLPPPDPFP" mat_peptide 86..1519 /note="mat. acetylcholine receptor beta- subunit (AA 1-478)" BASE COUNT 331 a 502 c 440 g 377 t ORIGIN 1 agcgagccgc cacggtatga ccccaggggc tctgctgatg ctgctggggg cgctggggcc 61 gccgctcgcc ccaggcgtcc gcggctcgga ggcggagggt cgactccggg agaaactttt 121 ctctggctat gatagctccg tgcggccagc gcgggaggtg ggagaccgtg tcagggtcag 181 cgttggtctc atcctggcgc aactcatcag cctgaacgag aaggatgaag agatgagcac 241 aaaggtgtac ttagacctgg agtggactga ctacaggctg agctgggacc ctgcggagca 301 cgacggcatc gattcgctcc gcatcacggc ggaatccgtg tggctccctg acgtggtgct 361 actgaacaac aatgatggga attttgacgt ggctctggac attagcgtcg tggtgtcctc 421 cgacggctcc gtgcgttggc aacccccggg catctatcgc agcagctgca gcatccaggt 481 cacctacttc cccttcgact ggcagaattg cactatggtg ttcagctcct acagctacga 541 cagctcggag gtcagcctgc agacaggcct gggtcctgac gggcaagggc atcaggaaat 601 ccacattcat gaagggactt tcattgagaa tggccagtgg gagaatatcc acaagccctc 661 tcggctaatc cagcctccag gcgatcctag gggagggagg gaaggacagc gccaggaagt 721 catcttctac ctcatcatcc gccgcaagcc tctcttctac ctggtcaacg tcattgcccc 781 atgcatcctc atcactcttc tggccatctt cgtcttctac ctgccaccag atgcaggaga 841 gaagatgggg ctctcaatct ttgccctgct gacccttact gtgttcctgc tgctgctggc 901 tgacaaagta cctgagacct cactatcagt acccattatt atcaagtacc tcatgtttac 961 catggtcctc gtcaccttct cagtcatcct tagtgtcgtg gttctcaacc tgcaccaccg 1021 ctcaccccac acccaccaaa tgcccctttg ggtccgtcag atcttcattc acaaacttcc 1081 gctgtacctg cgtctaaaaa ggcccaaacc cgagagagac ctgatgccgg agccccctca 1141 ctgttcttct ccaggaagtg gctggggtcg gggaacagat gaatatttca tccggaagcc 1201 gccaagtgat tttctcttcc ccaaacccaa taggttccag cctgaactgt ctgcccctga 1261 tctgcggcga tttatcgatg gtccaaaccg ggctgtggcc ctgcttccgg agctacggga 1321 ggtcgtctcc tctatcagct acatcgctcg acagctgcag gaacaggagg accacgatgc 1381 gctgaaggag gactggcagt ttgtggccat ggtagtggac cgcctcttcc tgtggacttt 1441 catcatcttc accagcgttg ggaccctagt catcttcctg gacgccacgt accacttgcc 1501 ccctccagac ccctttcctt gaagactgga gggttgagac caggccccct gccagttgaa 1561 gtgagagagt ttggtgatac tgtcaagccc tatccttctc tgcctcttaa ctccttcacg 1621 aggaatctgg gcctcttatt tcgttctggg // LOCUS HSACHRG 1741 bp RNA PRI 28-JUL-1993 DEFINITION H.sapiens mRNA for acetylcholine receptor delta subunit. ACCESSION X55019 X53091 X53516 NID g297401 KEYWORDS acetylcholine receptor delta subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1741) AUTHORS Luther,M.A., Schoepfer,R., Whiting,P., Casey,B., Blatt,Y., Montal,M.S., Montal,M. and Linstrom,J. TITLE A muscle acetylcholine receptor is expressed in the human cerebellar medulloblastoma cell line TE671 JOURNAL J. Neurosci. 9 (3), 1082-1096 (1989) MEDLINE 89177471 REFERENCE 2 (bases 1 to 1741) AUTHORS Schoepfer,R. TITLE Direct Submission JOURNAL Submitted (16-MAY-1990) Schoepfer R., The Salk Institute for Biological Studies, 10010 N. Torret Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1741 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="medulloblastoma" /cell_line="TE671" sig_peptide 5..67 /gene="AChR" CDS 5..1558 /gene="AChR" /codon_start=1 /product="acetylcholine receptor delta subunit" /db_xref="PID:g297402" /db_xref="SWISS-PROT:Q07001" /translation="MEGPVLTLGLLAALAVCGSWGLNEEERLIRHLFQEKGYNKELRP VAHKEESVDVALALTLSNLISLKEVEETLTTNVWIEHGWTDNRLKWNAEEFGNISVLR LPPDMVWLPEIVLENNNDGSFQISYSCNVLVYHYGFVYWLPPAIFRSSCPISVTYFPF DWQNCSLKFSSLKYTAKEITLSLKQDAKENRTYPVEWIIIDPEGFTENGEWEIVHRPA RVNVDPRAPLDSPSRQDITFYLIIRRKPLFYIINILVPCVLISFMVNLVFYLPADSGE KTSVAISVLLAQSVFLLLISKRLPATSMAIPLIGKFLLFGMVLVTMVVVICVIVLNIH FRTPSTHVLSEGVKKLFLETLPELLHMSRPAEDGPSPGALVRRSSSLGYISKAEEYFL LKSRSDLMFEKQSERHGLARRLTTARRPPASSEQAQQELFNELKPAVDGANFIVNHMR DQNNYNEEKDSWNRVARTVDRLCLFVVTPVMVVGTAWIFLQGVYNQPPPQPFPGDPYS YNVQDKRFI" gene 5..1558 /gene="AChR" mat_peptide 68..1555 /gene="AChR" /product="acetylcholine receptor delta subunit" BASE COUNT 370 a 549 c 479 g 343 t ORIGIN 1 tgggatggag gggccagtgc tgacactggg gctgctggct gccctggcgg tgtgtggcag 61 ctgggggctg aacgaggagg agcggctgat ccggcacctg tttcaagaga agggctacaa 121 caaggagctc cggcccgtgg cacacaaaga ggagagtgtg gacgttgccc tggccctcac 181 actctccaac ctcatctccc tgaaagaagt tgaggagacc ctcactacca atgtgtggat 241 agagcacggc tggacagaca accggctgaa gtggaatgct gaagaatttg gaaacatcag 301 tgtcctgcgc ctccccccgg acatggtgtg gctcccagag attgtgctgg agaacaacaa 361 tgacggctcc ttccagatct cctactcctg caacgtgctt gtctaccact acggcttcgt 421 gtactggctg ccacctgcca tcttccgctc ctcctgcccc atctctgtca cctatttccc 481 cttcgactgg cagaactgct ccctcaagtt cagttccctc aagtatacgg ccaaagagat 541 caccctgagc ctgaaacagg atgccaagga gaaccgcacc taccccgtgg agtggatcat 601 cattgatcct gaaggcttca cagagaacgg ggagtgggag atagtccacc ggccggccag 661 ggtcaacgtg gaccccagag cccctctgga cagccccagc cgccaggaca tcaccttcta 721 cctcatcatc cgccgcaagc ccctcttcta catcatcaac atcctggtgc cctgcgtgct 781 catctccttc atggtcaacc tggtcttcta cctaccggct gacagtggtg agaagacatc 841 agtggccatc tcggtgctcc tggctcagtc tgtcttcctg ctgctcatct ccaagcgtct 901 gcctgccaca tccatggcca tcccccttat cggcaagttc ctgctcttcg gcatggtgct 961 ggtcaccatg gttgtggtga tctgtgtcat cgtgctcaac atccacttcc gaacacccag 1021 cacccatgtg ctgtctgagg gggtcaagaa gctcttcctg gagaccctgc cggagctcct 1081 gcacatgtcc cgcccagcag aggatggacc cagccctggg gccctggtgc ggaggagcag 1141 ctccctggga tacatctcca aggccgagga gtacttcctg ctcaagtccc gcagtgacct 1201 catgttcgag aagcagtcag agcggcatgg gctggccagg cgcctcacca ctgcacgccg 1261 gcccccagca agctctgagc aggcccagca ggaactcttc aatgagctga agccagctgt 1321 ggatggggca aacttcattg ttaaccacat gagggaccag aacaattaca atgaggagaa 1381 agacagctgg aaccgagtgg cccgcacagt ggaccgcctc tgcctgtttg tggtgacgcc 1441 tgtcatggtg gtgggcacag cctggatctt cctgcagggc gtttacaacc agccaccacc 1501 ccagcctttt cctggggacc cctactccta caacgtgcag gacaagcgct tcatctaggg 1561 tgggcctgtt ggggagccag gagacagcag ggtctgagag aggagccaca gtccctaatg 1621 acacccactc ctagccctga ggctcgtgcc cctcagactg gggaagagtc caaggaaggg 1681 agggagcagc cactcctcaa tgctcaatgg ctcccctgaa atcaagacag gggccacccg 1741 a // LOCUS HSACRAP 1653 bp RNA PRI 23-SEP-1996 DEFINITION H.sapiens gene for 43kD acetylcholine receptor-associated protein (Rapsyn). ACCESSION Z33905 NID g512484 KEYWORDS 43kD protein; Acetylcholine receptor-associated protei. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1653) AUTHORS Buckel,A., Beeson,D., James,M. and Vincent,A. TITLE Cloning of cDNA encoding human rapsyn and mapping of the RAPSN gene locus to chromosome 11p11.2-p11.1 JOURNAL Genomics 35 (3), 613-616 (1996) MEDLINE 97001170 REFERENCE 2 (bases 1 to 1653) AUTHORS Beeson,D.M. TITLE Direct Submission JOURNAL Submitted (27-MAY-1994) Beeson D. M., Institute of molecular medicine, Neurosciences group, John Radcliffe Hospital, Headington, Oxford, Oxon, U.K., OX3 9DU FEATURES Location/Qualifiers source 1..1653 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HuRap1" /dev_stage="Adult" /tissue_type="muscle, leg" /clone_lib="Human muscle cDNA in lamda gt10" mRNA 1..1653 CDS 208..1446 /function="Structural muscle protein" /codon_start=1 /product="43kD Acetylcholine receptor-associated protein (Rapsyn)" /db_xref="PID:g512485" /translation="MGQDQTKQQIEKGLQLYQSNQTEKALQVWTKVLEKSSDLMGRFR VLGCLVTAHSEMGRYKEMLKFAVVQIDTARELEDADFLLESYLNLARSNEKLCEFHKT ISYCKTCLGVPGTRAGAQLGGQVSLSMGNAFLGLSVFQKALESFEKALRYAHNNDDTM LECRVCCSLGSFYAQVKDYEKALFFPCKAAELVNNYGKGWSLKYQAMSQYHMAVAYRL LGRLGSAMECCEESMKIALQHGDRPLQALCLLCFADIHRSRGDLETAFPRYDSAMSIM TEIGNRLGQVQALLGVAKCWVARKALDKALDAIERAQDLAEEVGNKLSQLKLHCLSES IYRSKGLQRELRAHVVRFHECVEETELYCALCGESIGEKNSRLQALPCSHIFHLRCLQ NNGTRSCPNCRRSSMKPGFV" BASE COUNT 319 a 513 c 521 g 300 t ORIGIN 1 cccaactggc agcgacacgt agggacgggc tgaaccagct ttcttcccag ggtggcgcct 61 gctctccatc caggccccat tccggctccc acccgactgc tgcttttgtt cccacgtttc 121 ggggggcagc tggcactgtg attcctgccc catgagtgcc tagtggccag gagccaccag 181 ggatcacccc acgtgggctt ggggaggatg gggcaggacc agaccaagca gcagatcgag 241 aaggggctcc agctgtacca gtccaaccag acagagaagg cattgcaggt gtggacaaag 301 gtgctggaga agagctcgga cctcatgggg cgcttccgcg tgctgggctg cctggtcaca 361 gcccactcgg agatgggccg ctacaaggag atgctgaagt tcgccgtggt ccagatcgac 421 acggcccggg agctggagga tgccgacttc ctcctggaga gctacctgaa cctggcacgc 481 agcaacgaga agctgtgcga gtttcacaag accatctcct actgcaagac ctgccttggc 541 gtgccaggta ccagggcagg tgcccagctc ggaggccagg tcagcctgag catgggcaat 601 gccttcctgg gcctcagcgt cttccagaag gccctggaga gcttcgagaa ggccctgcgc 661 tacgcgcaca acaatgatga caccatgctc gagtgccgcg tgtgctgcag cctgggcagc 721 ttctatgccc aggtcaagga ctacgagaaa gccctgttct tcccctgcaa ggcggcagag 781 cttgtcaaca actatggcaa aggctggagc ctgaagtacc aggccatgag ccagtaccac 841 atggccgtgg cctatcgcct gctgggccgc ctgggcagtg ccatggagtg ttgtgaggag 901 tctatgaaga tcgccctgca gcacggcgac cggccactgc aggcgctctg cctgctctgc 961 ttcgctgaca tccaccggag ccgtggggac ctggagacag ccttccccag gtacgactcc 1021 gccatgagca tcatgaccga gatcggaaac cgcctggggc aggtgcaggc gctgctgggt 1081 gtggccaagt gctgggtggc caggaaggcg ctggacaagg ctctggatgc catcgagaga 1141 gcccaggatc tggccgagga ggtggggaac aagctgagcc agctcaagct gcactgtctg 1201 agcgagagca tttaccgcag caaagggctg cagcgggaac tgcgcgcgca cgttgtgagg 1261 ttccacgagt gcgtggagga gacggagctc tactgcgcgc tgtgcggcga gtccataggc 1321 gagaagaaca gccggctgca ggccctaccc tgctcccaca tcttccacct caggtgcctg 1381 cagaacaacg ggacccggag ctgtcccaac tgccgccgct cctccatgaa gcctggcttt 1441 gtatgactcc tggcaggagg gcgtgggctt cctcctgggc gactcctgct ctttctccac 1501 tgcaccgagt ggccattact cctgggcagc tgccaggtct gcctcaccat acgcaacgct 1561 tggggccccg agggcgtgct cccctggccc agctcccctg ccctgcctct ttgtactttg 1621 ctctttatag aaaaataaac tgtttgtacc tgg // LOCUS HSACROS 1388 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for acrosin (EC 3.4.21.10). ACCESSION Y00970 NID g28325 KEYWORDS acrosin; serine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1388) AUTHORS Baba,T. TITLE Direct Submission JOURNAL Submitted (13-FEB-1989) Tadashi Baba, Institiute of Applied Biochemistry, University of Tsukuba, Tsukuba City, Ibaraki 305, Japan REFERENCE 2 (bases 1 to 1388) AUTHORS Baba,T., Watanabe,K., Kashiwabara,S. and Arai,Y. TITLE Primary structure of human proacrosin deduced from its cDNA sequence JOURNAL FEBS Lett. 244 (2), 296-300 (1989) MEDLINE 89153568 FEATURES Location/Qualifiers source 1..1388 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /clone_lib="lambda gt11" /clone="H4" sig_peptide 17..73 /note="signal peptide (AA -19 to -1)" CDS 17..1282 /note="preproacrosin (AA -19 to 402)" /codon_start=1 /db_xref="PID:g28326" /db_xref="SWISS-PROT:P10323" /translation="MVEMLPTAILLVLAVSVVAKDNATCDGPCGLRFRQNPQGGVRIV GGKAAQHGAWPWMVSLQIFTYNSHRYHTCGGSLLNSRWVLTAAHCFVGKNNVHDWRLV FGAKEITYGNNKPVKAPVQERYVEKIIIHEKYNSATEGNDIALVEITPPISCGRFIGP GCLPHLKAGLPRGSQSCWVAGWGYIEEKAPRPSSILMEARVDLIDLDLCNSTQWYNGR VQPTNVCAGYPVGKIDTCQGDSGGPLMCKDSKESAYVVVGITSWGVGCARAKRPGIYT ATWPYLNWIASKIGSNALRMIQSATPPPPTTRPPPIRPPFSHPISAHLPWYFQPPPRP LPPRPPAAQPRPPPSPPPPPPPPASPLPPPPPPPPPTPSSTTKLPQGLSFAKRLQQLI EVLKGKTYSDGKNHYDMETTELPELTSTS" mat_peptide 74..1279 /note="mature proacrosin (AA 1 to 402)" misc_feature 1343..1348 /note="pot. polyA signal" misc_feature 1347..1352 /note="pot. polyA signal" misc_feature 1351..1356 /note="pot. polyA signal" BASE COUNT 347 a 432 c 334 g 275 t ORIGIN 1 caggcagtgc aggagtatgg ttgagatgct accaactgcc attctgctgg tcttggcagt 61 gtccgtggtt gctaaagata acgccacgtg tgatggcccc tgtgggttac ggttcaggca 121 aaacccacag ggtggtgtcc gcatcgtcgg cgggaaggct gcacagcatg gggcctggcc 181 ctggatggtc agcctccaga tcttcacgta caacagccac aggtaccaca catgtggagg 241 cagcttgctg aattcacgat gggtgctcac tgctgctcac tgcttcgtcg gcaaaaataa 301 tgtgcatgac tggagactgg ttttcggagc aaaggaaatt acatatggga acaataaacc 361 agtaaaggcg cctgtgcaag agagatatgt ggagaaaatc atcattcatg aaaaatacaa 421 ctctgcgaca gagggaaatg acattgccct cgtggagatc acccctccca tttcgtgtgg 481 gcgcttcatt gggccgggct gcctgcccca cttgaaggca ggcctcccca gaggctccca 541 gagctgctgg gtggccggct ggggatatat agaagagaaa gcccccaggc catcatctat 601 actgatggag gcacgtgtgg atctcatcga cctggacttg tgtaactcga cccagtggta 661 caatgggcgc gttcagccaa ccaatgtgtg cgcggggtat cctgtaggca agatcgacac 721 ctgccaggga gacagcggcg ggcctctcat gtgcaaagac agcaaggaaa gcgcctatgt 781 ggtcgtggga atcacaagct ggggggtagg ctgtgcccgt gccaagcgcc ccggaatcta 841 cacggccacc tggccttatc tgaactggat cgcctccaag attggttcta acgctttgcg 901 tatgattcaa tcggccaccc ctccaccgcc caccactcga ccgcccccga ttcgaccccc 961 cttctcccac cctatctctg ctcaccttcc ttggtatttc caaccgcccc ctcgaccact 1021 tccaccccga ccaccggcag cccagccccg acccccacct tcacccccgc ccccaccccc 1081 acctccagcc tcacctttac ccccaccccc acccccaccc ccacctacac cctcatctac 1141 cacaaaactt ccccaaggac tttcttttgc caagcgccta cagcagctca tagaggtctt 1201 gaaggggaag acctattccg acggaaagaa ccattatgac atggagacca cagagctccc 1261 agaactgacc tcgacctcct gatctgacct ggttctcaac agacccagtg agcccttcac 1321 tcctgagaaa aaggaaagat gaaataaata aataaacata tatatataga tataaaaaaa 1381 aaaaaaaa // LOCUS HSACTNBC 2272 bp RNA PRI 13-FEB-1995 DEFINITION H.sapiens mRNA for activin beta-C chain. ACCESSION X82540 NID g669154 KEYWORDS activin beta-C chain; activin beta-C gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2272) AUTHORS H tten,G., Neidhardt,H., Schneider,C. and Pohl,J. TITLE Cloning of a new member of the TGF-beta family: a putative new activin beta C chain JOURNAL Biochem. Biophys. Res. Commun. 206 (2), 608-613 (1995) MEDLINE 95126961 REFERENCE 2 (bases 1 to 2272) AUTHORS Hoetten,G. TITLE Direct Submission JOURNAL Submitted (07-NOV-1994) G. Hoetten, Biopharm GmbH, Czernyring 22, D-69115 Heidelberg, FRG FEATURES Location/Qualifiers source 1..2272 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="liver cDNA (Clontech)" gene 128..1186 /gene="activin beta-C" CDS 128..1186 /gene="activin beta-C" /codon_start=1 /product="activin beta-C chain" /db_xref="PID:g669155" /translation="MTSSLLLAFLLLAPTTVATPRAGGQCPACGGPTLELESQRELLL DLAKRSILDKLHLTQRPTLNRPVSRAALRTALQHLHGVPQGALLEDNREQECEIISFA ETGLSTINQTRLDFHFSSDRTAGDREVQQASLMFFVQLPSNTTWTLKVRVLVLGPHNT NLTLATQYLLEVDASGWHQLPLGPEAQAACSQGHLTLELVLEGQVAQSSVILGGAAHR PFVAARVRVGGKHQIHRRGIDCQGGSRMCCRQEFFVDFREIGWHDWIIQPEGYAMNFC IGQCPLHIAGMPGIAASFHTAVLNLLKANTAAGTTGGGSCCVPTARRPLSLLYYDRDS NIVKTDIPDMVVEACGCS" BASE COUNT 510 a 664 c 512 g 586 t ORIGIN 1 caaggagcca tgccagctgg acacacactt cttccagggc ctctggcagc caggacagag 61 ttgagaccac agctgttgag accctgagcc ctgagtctgt attgctcaag aagggccttc 121 cccagcaatg acctcctcat tgcttctggc ctttctcctc ctggctccaa ccacagtggc 181 cactcccaga gctggcggtc agtgtccagc atgtgggggg cccaccttgg aactggagag 241 ccagcgggag ctgcttcttg atctggccaa gagaagcatc ttggacaagc tgcacctcac 301 ccagcgccca acactgaacc gccctgtgtc cagagctgct ttgaggactg cactgcagca 361 cctccacggg gtcccacagg gggcacttct agaggacaac agggaacagg aatgtgaaat 421 catcagcttt gctgagacag gcctctccac catcaaccag actcgtcttg attttcactt 481 ctcctctgat agaactgctg gtgacaggga ggtccagcag gccagtctca tgttctttgt 541 gcagctccct tccaatacca cttggacctt gaaagtgaga gtccttgtgc tgggtccaca 601 taataccaac ctcaccttgg ctactcagta cctgctggag gtggatgcca gtggctggca 661 tcaactcccc ctagggcctg aagctcaagc tgcctgcagc caggggcacc tgaccctgga 721 gctggtactt gaaggccagg tagcccagag ctcagtcatc ctgggtggag ctgcccatag 781 gccttttgtg gcagcccggg tgagagttgg gggcaaacac cagattcacc gacgaggcat 841 cgactgccaa ggagggtcca ggatgtgctg tcgacaagag ttttttgtgg acttccgtga 901 gattggctgg cacgactgga tcatccagcc tgagggctac gccatgaact tctgcatagg 961 gcagtgccca ctacacatag caggcatgcc tggtattgct gcctcctttc acactgcagt 1021 gctcaatctt ctcaaggcca acacagctgc aggcaccact ggagggggct catgctgtgt 1081 acccacggcc cggcgccccc tgtctctgct ctattatgac agggacagca acattgtcaa 1141 gactgacata cctgacatgg tagtagaggc ctgtgggtgc agttagtcta tgtgtggtat 1201 gggcagccca aggttgcatg ggaaaacacg cccctacaga agtgcacttc cttgagagga 1261 gggaatgacc tcattctctg tccagaatgt ggactccctc ttcctgagca tcttatggaa 1321 attaccccac ctttgacttg aagaaacctt catctaaagc aagtcactgt gccatcttcc 1381 tgaccactac cctctttcct agggcatagt ccatcccgct agtccatccc gctagcccca 1441 ctccagggac tcagacccat ctccaaccat gagcaatgcc atctggttcc caggcaaaga 1501 cacccttagc tcacctttaa tagaccccat aacccactat gccttcctgt cctttctact 1561 caatggtccc cactccaaga tgagttgaca caaccccttc ccccaatttt tgtggatctc 1621 cagagaggcc cttctttgga ttcaccaaag tttagatcac tgctgcccaa aatagaggct 1681 tacctacccc cctctttgtt gtgagcccct gtccttctta gttgtccagg tgaactacta 1741 aagctctctt tgcatacctt catccatttt ttgtccttct ctgcctttct ctatgccctt 1801 aaggggtgac ttgcctgagc tctatcacct gagctcccct gccctctggc ttcctgctga 1861 ggtcagggca tttcttatcc ctgttccctc tctgtctagg tgtcatggtt ctgtgtaact 1921 gtggctattc tgtgtcccta cactacctgg ctaccccctt ccatggcccc agctctgcct 1981 acattctgat tttttttttt tttttttttt tgaaaagtta aaaattcctt aattttttat 2041 tcctggtacc actaccacaa tttacagggc aatatacctg atgtaatgaa aagaaaaaga 2101 aaaagacaaa gctacaacag ataaaagacc tcaggaatgt acatctaatt gacactacat 2161 tgcattaatc aatagctgca ctttttgcaa actgtggcta tgacagtcct gaacaagaag 2221 ggtttcctgt ttaagctgca gtaacttttc tgactatgga tcatcgttcc tt // LOCUS HSADDA 3936 bp RNA PRI 31-DEC-1992 DEFINITION Human mRNA for erythrocyte adducin alpha subunit. ACCESSION X58141 NID g28381 KEYWORDS adducin; membrane skeleton protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3936) AUTHORS Joshi,R. TITLE Direct Submission JOURNAL Submitted (04-MAR-1991) R. Joshi, Duke University Medical Center, Dept of Biochemistry, Howard Hughes Medical Institute, P O Box 3892 DUMC, Durham NC 27710, USA REFERENCE 2 (bases 1 to 3936) AUTHORS Joshi,R., Gilligan,D.M., Otto,E., McLaughlin,T. and Bennett,V. TITLE Primary structure and domain organization of human alpha and beta adducin JOURNAL J. Cell Biol. 115 (3), 665-675 (1991) MEDLINE 92011907 FEATURES Location/Qualifiers source 1..3936 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="reticulocytes & K562 (erythroleukemic) cells" /cell_line="reticulocytes" /clone_lib="lambda gt11 human reticulocyte & K562 cDNA" /clone="K10, R107, R8 & R301" mRNA 1..3936 /note="erythrocyte alpha adducin" /evidence=experimental CDS 155..2368 /codon_start=1 /product="erythrocyte alpha adducin" /db_xref="PID:g28382" /db_xref="SWISS-PROT:P35611" /translation="MNGDSRAAVVTSPPPTTAPHKERYFDRVDENNPEYLRERNMAPD LRQDFNMMEQKKRVSMILQSPAFCEELESMIQEQFKKGKNPTGLLALQQIADFMTTNV PNVYPAAPQGGMAALNMSLGMVTPVNDLRGSDSIAYDKGEKLLRCKLAAFYRLADLFG WSQLIYNHITTRVNSEQEHFLIVPFGLLYSEVTASSLVKINLQGDIVDRGSTNLGVNQ AGFTLHSAIYAARPDVKCVVHIHTPAGAAVSAMKCGLLPISPEALSLGEVAYHDYHGI LVDEEEKVLIQKNLGPKSKVLILRNHGLVSVGESVEEAFYYIHNLVVACEIQVRTLAS AGGPDNLVLLNPEKYKAKSRSPGSPVGEGTGSPPKWQIGEQEFEALMRMLDNLGYRTG YPYRYPALREKSKKYSDVEVPASVTGYSFASDGDSGTCSPLRHSFQKQQREKTRWLNS GRGDEASEEGQNGSSPKSKTKWTKEDGHRTSTSAVPNLFVPLNTNPKEVQEMRNKIRE QNLQDIKTAGPQSQVLCGVVMDRSLVQGELVTASKAIIEKEYQPHVIVSTTGPNPFTT LTDRELEEYRREVERKQKGCEENLDEAREQKEKSPPDQPAVPHPPPSTPIKLEEDLVP EPTTGDDSDAATFKPTLPDLSPDEPSEALGFPMLEKEEEAHRPPSPTEAPTEASPEPA PDPAPVAEEAAPSAVEEGAAADPGSDGSPGKSPSKKKKKFRTPSFLKKSKKKSDS" BASE COUNT 955 a 1061 c 1049 g 871 t ORIGIN 1 gggaccggcg ctcagctggc ggcgcgctcg ccggccgagg tgggatcccg aggcctctcc 61 agtccgccga gggcgcacca ccggcccgtc tcgcccgccg cgccggggag gtggagcacg 121 agcgcacgtg ttaggaacct agaaagattg tacaatgaat ggtgattctc gtgctgcggt 181 ggtgacctca ccacccccga ccacagcccc tcacaaggag aggtacttcg accgagtaga 241 tgagaacaac ccagagtact tgagggagag gaacatggca ccagaccttc gccaggactt 301 caacatgatg gagcaaaaga agagggtgtc catgattctg caaagccctg ctttctgtga 361 agaattggaa tcaatgatac aggagcaatt taagaagggg aagaacccca caggcctatt 421 ggcattacag cagattgcag attttatgac cacgaatgta ccaaatgtct acccagcagc 481 tccgcaagga gggatggctg ccttaaacat gagtcttggt atggtgactc ctgtgaacga 541 tcttagagga tctgattcta ttgcgtatga caaaggagag aagttattac ggtgtaaatt 601 ggcagcgttt tatagactag cagatctctt tgggtggtct cagcttatct acaatcatat 661 cacaaccaga gtgaactccg agcaggaaca cttcctcatt gtcccttttg ggcttcttta 721 cagtgaagtg actgcatcca gtttggttaa gatcaatcta caaggagata tagtagatcg 781 tggaagcact aatctgggag tgaatcaggc cggcttcacc ttacactctg caatttatgc 841 tgcacgcccg gacgtgaagt gcgtcgtgca cattcacacc ccagcagggg ctgcggtctc 901 tgcaatgaaa tgtggcctct tgccaatctc cccggaggcg ctttcccttg gagaagtggc 961 ttatcatgac taccatggca ttctggttga tgaagaggaa aaagttttga ttcagaaaaa 1021 tctggggcct aaaagcaagg ttcttattct ccggaaccat gggctcgtgt cagttggaga 1081 gagcgttgag gaggccttct attacatcca taaccttgtg gttgcctgtg agatccaggt 1141 tcgaactctg gccagtgcag gaggaccaga caacttagtc ctgctgaatc ctgagaagta 1201 caaagccaag tcccgttccc cagggtctcc ggtaggggaa ggcactggat cgcctcccaa 1261 gtggcagatt ggtgagcagg aatttgaagc cctcatgcgg atgctcgata atctgggcta 1321 cagaactggc tacccttatc gataccctgc tctgagagag aagtctaaaa aatacagcga 1381 tgtggaggtt cctgctagtg tcacaggtta ctcctttgct agtgacggtg attcgggcac 1441 ttgctcccca ctcagacaca gttttcagaa gcagcagcgg gagaagacaa gatggctgaa 1501 ctctggccgg ggcgacgaag cttccgagga agggcagaat ggaagcagtc ccaagtcgaa 1561 gactaagtgg actaaagagg atggacatag aacttccacc tctgctgtcc ctaacctgtt 1621 tgttccattg aacactaacc caaaagaggt ccaggagatg aggaacaaga tccgagagca 1681 gaatttacag gacattaaga cggctggccc tcagtcccag gttttgtgtg gtgtagtgat 1741 ggacaggagc ctcgtccagg gagagctggt gacggcctcc aaggccatca ttgaaaagga 1801 gtaccagccc cacgtcattg tgagcaccac gggccccaac cccttcacca cactcacaga 1861 ccgtgagctg gaggagtacc gcagggaggt ggagaggaag cagaagggct gtgaagagaa 1921 tctggacgag gctagagaac agaaagaaaa gagtcctcca gaccagcctg cggtccccca 1981 cccgcctccc agcactccca tcaagctgga ggaagacctt gtgccggagc cgactactgg 2041 agatgacagt gatgctgcca cctttaagcc aactctcccc gatctgtccc ctgatgaacc 2101 ttcagaagca ctcggcttcc caatgttaga gaaggaggag gaagcccata gacccccaag 2161 ccccactgag gcccctactg aggccagccc cgagccagcc ccagacccag ccccggtggc 2221 tgaagaggct gccccctcag ctgtcgagga gggggccgcc gcggaccctg gcagcgatgg 2281 gtctccaggc aagtccccgt ccaaaaagaa gaagaagttc cgtaccccgt cctttctgaa 2341 gaagagcaag aagaagagtg actcctgaaa gccctgcgct aacactgtcc tgtccggagc 2401 gaccctggct ctgccagcgt ccccggccac gtctgtgctc tgtccttgtg taatggaatg 2461 caaaaaagcc aagccctccg cctagaggtc ccctcacgtg accagccccg tgtagccccg 2521 ggctgaccca gtgtgtgctc agcagcccca ccccaccctg ccccttgtcc tctcagagcc 2581 tcagcttctg ggggagacat gctctcccca caggggggag gcactaagtc atggtcctgg 2641 ctggaaggta ctgaaggctt ctgcagcttt ggctgcacgt caccctcctg agcctcacct 2701 ttcctgccgt ccctcctgtt gtgaaatcac cacattctgt ctctgcttgg cttcccctcc 2761 accctaaagt ctcaggtgac ggactcagac tcctggcttc atgtggcatt ctctctgctc 2821 agtgatctca cttaaatcta tatacaaagc cttggtcccg tgaaaacact cgtgtgccca 2881 ccagcggcct tgaagaggca ggtctgggcc agatgctggg caggaaaccc cagcggcaga 2941 tgggcctgtg tgcacccaac gtgatgctat gcatgtctga ccgacgatcc ctcgaccaga 3001 atcagattca ggagctcagt ttctttttca cttgggtctc tggattcctg tcatagggaa 3061 ggtatatcag gaggggaaga ggcctttcta gaattttctt tgagcaggtt tacaatttag 3121 cttacatttt tcgactgtga acgtgaatag gctgcttttt gctttcttct ttccagaccc 3181 cacagtagag cacttttcac ttatttgggg gaggcttcag gggactgttc tcaccttaac 3241 tcagccagaa agatgcccta gttgtgatca aaggtaactc gaggtggagg gtagccctgg 3301 ggcccctcga catcaccgtc attgatggag cctgaaccgt gtgctcctcg gcagatgctg 3361 ttgttgttac ttccctccaa gaggctggaa aagggctcag agctgctgag caggaaccgg 3421 agggtgaccc atttcaggag gtgccggtac cagcctgact aggtacaggc aagcttgtgt 3481 gggcccaaca ggcccttggt agagctggtg ccagatgtgg gctcagatcc tgggcatgat 3541 gggccgagcc acctcggatc ccactgattg gccagccgag cgagaaccag gctgctgcat 3601 ggcactgacc gccgcttcca gcttcctctg agccgcaggg cctgctacgc gggcaagcgt 3661 gctgcctctc ttctgtgtcg ttttgttgcc aaggcagaat gaaaagtcct taaccgtgga 3721 ctcttccttt atcccctcct ttaccccaca tatgcaatga cttttaattt tcacttttgt 3781 agtttaatcc tttgtattac aacatgaaat atagttgcat atatggacac cgacttggga 3841 ggacaggtcc tgaatgtcct ttctccagtg taacatgttt tactcacaaa taaaattctt 3901 tcagcaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSADE2H1 1426 bp RNA PRI 12-FEB-1992 DEFINITION H.sapiens ADE2H1 mRNA showing homologies to SAICAR synthetase and AIR carboxylase of the purine pathway (EC 6.3.2.6, EC 4.1.1.21). ACCESSION X53793 NID g28383 KEYWORDS purine biosynthetic pathway. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Minet,M. TITLE Direct Submission JOURNAL Submitted (11-JUL-1990) Minet M., CNRS, Avenue de la Terrasse, 91190 Gif Sur Yvette, France REFERENCE 2 (bases 1 to 1426) AUTHORS Minet,M. and Lacroute,F. TITLE Cloning and sequencing of a human cDNA coding for a multifunctional polypeptide of the purine pathway by complementation of the ade2-101 mutant in Saccharomyces cerevisiae JOURNAL Curr. Genet. 18 (4), 287-291 (1990) MEDLINE 91070616 FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" gene 25..1302 /gene="ADE2H1" CDS 25..1302 /gene="ADE2H1" /note="5' half of the product is homologues to Bacillus subtiis SAICAR synthetase, 3' half corresponds to the catalytic subunit of AIR carboxylase" /codon_start=1 /db_xref="PID:g28384" /db_xref="SWISS-PROT:P22234" /translation="MATAEVLNIGKKLYEGKTKEVYELLDSPGKVLLQSKDQITAGNA ARKNHLEGKAAISNKITSCIFQLLQEAGIKTAFTRKCGETAFIAPQCEMIPIEWVCRR IATGSFLKRNPGVKEGYKFYPPKVELFFKDDANNDPQWSEEQLIAAKFCFAGLLIGQT EVDIMSHATQAIFEILEKSWLPQNCTLVDMKIEFGVDVTTKEIVLADVIDNDSWRLWP SGDRSQQKDKQSYRDLKEVTPEGLQMVKKNFEWVAERVELLLKSESQCRVVVLMGSTS DLGHCEKIKKACGNFGIPCELRVTSAHKGPDETLRIKAEYEGDGIPTVFVAVAGRSNG LGPVMSGNTAYPVISCPPLTPDWGVQDVWSSLRLPSGLGCSTVLSPEGSAQFAAQIFG LSNHLVWSKLRASILNTWISLKQADKKIRECNL" BASE COUNT 460 a 250 c 339 g 377 t ORIGIN 1 gcagccctca gcccacttag gataatggcg acagctgagg tactgaacat tggtaaaaaa 61 ttatatgagg gtaaaacaaa agaagtctac gaattgttag acagtccagg aaaagtcctc 121 ctgcagtcca aggaccagat tacagcagga aatgcagcta gaaaaaacca cctggaagga 181 aaagctgcaa tctcaaataa aatcaccagt tgtatttttc agttattaca ggaagcaggt 241 attaaaactg ccttcaccag aaaatgtggg gagacagctt tcattgcacc gcagtgtgaa 301 atgattccaa ttgaatgggt ttgcagaaga atagcaactg gttcttttct caaaagaaat 361 cctggtgtca aggaaggata taagttttac ccacctaaag tggagttgtt tttcaaggat 421 gatgccaata atgacccaca gtggtctgag gaacagctga ttgctgcaaa attttgcttt 481 gctggacttc ttataggcca gactgaagtg gatatcatga gtcatgctac acaggctata 541 tttgaaatac tggagaaatc ctggttgccc cagaattgta cactggttga tatgaagatt 601 gaatttggtg ttgatgtaac caccaaagaa attgttcttg ctgatgttat tgacaatgat 661 tcctggagac tctggccatc aggagatcga agccaacaga aagacaaaca gtcttatcgg 721 gacctcaaag aagtaactcc tgaagggctc caaatggtaa agaaaaactt tgagtgggtt 781 gcagagagag tagagttgct tttgaaatca gaaagtcagt gcagggttgt agtgttgatg 841 ggctctactt ctgatcttgg tcactgtgaa aaaatcaaga aggcctgtgg aaattttggc 901 attccatgtg aacttcgagt aacatctgcg cataaaggac cagatgaaac tctgaggatt 961 aaagctgagt atgaagggga tggcattcct actgtatttg tggcagtggc aggcagaagt 1021 aatggtttgg gaccagtgat gtctgggaac actgcatatc cagttatcag ctgtcctccc 1081 ctcacaccag actggggagt tcaggatgtg tggtcttctc ttcgactacc cagtggtctt 1141 ggctgttcaa ccgtactttc tccagaagga tcagctcaat ttgctgctca gatatttggg 1201 ttaagcaacc atttggtatg gagcaaactg cgagcaagca ttttgaacac atggatttcc 1261 ttgaagcagg ctgacaagaa aatcagagaa tgtaatttat aagaaagaat gccattgaat 1321 tttttagggg aaaaactaca aatttctaat ttagctgaag gaaaatcaag caagatgaaa 1381 aggtaatttt aaattagaga acacaaataa aatgtattag tgaaca // LOCUS HSADENCYR 6005 bp RNA PRI 07-DEC-1994 DEFINITION H.sapiens mRNA for adenylyl cyclase. ACCESSION Z35309 NID g516262 KEYWORDS adenylyl cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6005) AUTHORS Defer,N., Marinx,O., Stengel,D., Danisova,A., Iourgenko,V., Matsuoka,I., Caput,D. and Hanoune,J. TITLE Molecular cloning of the human type VIII adenylyl cyclase JOURNAL FEBS Lett. 351 (1), 109-113 (1994) MEDLINE 94357261 REFERENCE 2 (bases 1 to 6005) AUTHORS Defer,N. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Nicole Defer, Inserm U99, Hopital Henri Mondor, CRETEIL F-94010, France FEATURES Location/Qualifiers source 1..6005 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="newborn brain stem" /clone_lib="cDNA library" CDS 2095..5850 /EC_number="4.6.1.1" /codon_start=1 /product="adenylyl cyclase" /db_xref="PID:g516263" /db_xref="SWISS-PROT:P40145" /translation="MELSDVRCLTGSEELYTIHPTPPAGDGRSASRPQRLLWQTAVRH ITEQRFIHGHRGGSGSGSGGSGKASDPAGGGPNHHAPQLSGDSALPLYSLGPGERAHS TCGTKVFPERSGSGSASGSGGGGDLGFLHLDCAPSNSDFFLNGGYSYRGVIFPTLRNS FKSRDLERLYQRYFLGQRRKSEVVMNVLDVLTKLTLLVLHLSLASAPMDPLKGILLGF FTGIEVVICALVVVRKDTTSHTYLQYSGVVTWVAMTTQILAAGLGYGLLGDGIGYVLF TLFATYSMLPLPLTWAILAGLGTSLLQVILQVVIPRLAVISINQVVAQAVLFMCMNTA GIFISYLSDRAQRQAFLETRRCVEARLRLETENQRQERLVLSVLPRFVVLEMINDMTN VEDEHLQHQFHRIYIHRYENVSILFADVKGFTNLSTTLSAQELVRMLNELFARFDRLA HEHHCLRIKILGDCYYCVSGLPEPRQDHAHCCVEMGLSMIKTIRYVRSRTKHDVDMRI GIHSGSVLCGVLGLRKWQFDVWSWDVDIANKLESGGIPGRIHISKATLDCLNGDYNVE EGHGKERNEFLRKHNIETYLIKQPEDSLLSLPEDIVKESVSSSDRRNSGATFTEGSWS PELPFDNIVGKQNTLAALTRNSINLLPNHLAQALHVQSGPEEINKRIEHTIDLRSGDK LRREHIKPFSLMFKDSSLEHKYSQMRDEVFKSNLVCAFIVLLFITAIQSLLPSSRVMP MTIQFSILIMLHSALVLITTAEDYKCLPLILRKTCCWINETYLARNVIIFASILINFL GAILNILWCDFDKSIPLKNLTFNSSAVFTDICSYPEYFVFTGVLAMVTCAVFLRLNSV LKLAVLLIMIAIYALLTETVYAGLFLRYDNLNHSGEDFLGTKEVSLLLMAMFLLAVFY HGQQLEYTARLDFLWRVQAKEEINEMKELREHNENMLRNILPSHVARHFLEKDRDNEE LYSQSYDAVGVMFASIPGFADFYSQTEMNNQGVECLRLLNEIIADFDELLGEDRFQDI EKIKTIGSTYMAVSGLSPEKQQCEDKWGHLCALADFSLALTESIQEINKHSFNNFELR IGISHGSVVAGVIGAKKPQYDIWGKTVNLASRMDSTGVSGRIQVPEETYLILKDQGFA FDYRGEIYVKGISEQEGKIKTYFLLGRVQPNPFILPPRRLPGQYSLAAVVLGLVQSLN RQRQKQLLNENNNTGIIKGHYNRRTLLSPSGTEPGAQAEGTDKSDLP" polyA_signal 5901..5906 BASE COUNT 1342 a 1760 c 1614 g 1289 t ORIGIN 1 gactggctgc agccgcagtc ttggtggagg aggtggtgac caccaccgct cctccacctg 61 catccggctg cgggactgcg gcggccgcgt ttgccctgca gaccctgcac cccgggacgc 121 ggctcatctg tcattagcac cggcactaag ctcccaccgc tcagcgactt ggtccgccgc 181 aagctccgcc gcaggctttg tccgctagcg ctcggctgag tctgggcggg cgggaaacct 241 gggctagggc gaggcggggc ccctggacat gcctttctcc acgtccgcct cctcgaccct 301 attgtaagcg gagaaactta gtgtgcgagg caggcagggg acgtccccat tttgatgtcc 361 cgtaccctcc acccccttcg gatcgaggta gtaaagaggc tcctgtagga aactgactgc 421 ctctatgatt gcggcctctt gggggatttt gcgtttagcc cgaaagttgg ctttgccaaa 481 agacgcacgg gtaggaaggg cgaaaaggaa accctgtatt ccgtcgcgct gggctctccg 541 agtccgtgcg caaagcggcc tacgagtcct ggctccgcac ctgcagagga caagagccaa 601 tgcctaaaaa agaacagcgg aggaaccggc tggcgcggcc agctggaacg ctggatcgca 661 gtgcgcccag ggaaggccgg gggcgcccgc cggccctagc cctcagtggt cctctcccac 721 gccggcccgc gcgtgcctct gcctacaaga cctggggcgt cctggccaga tctggatggc 781 aggtccctcc gccacccccg gcccggttcc gggggcgtgg cttggcgcgg gggcgggttt 841 aagtcaccgc gggtgtctga ccactctgac aggtctccaa atttctccca gtcgcctggc 901 gcccgcggtg cgtttcagag ctccagggtg gcacgcggcg gcgcttccct agatccagag 961 gcgtctctgt cgacttccac gcggcctcgg gcctcccttc tcctccaaac cttcgcctca 1021 tccgccaagc ttcggttctc agcctcagat atccgcaccg gcggcttctc ttcgttctcg 1081 gagctcttgg ctttggagcc ctcaccactt tttccttccc ccgcctctcc tgatcctcct 1141 agctccgagc caaatggact ccaaagaacg aaataaaggg gatgagaact gtgtgctgcg 1201 acccttcgaa agcacagctg aaagcgttga cctcgtctta tagatcaggc tgggaccctg 1261 gggcgagagt ccccacaccc cctccgggag ggatgcttct ggccagagcc agcgctgcgc 1321 tgtcagtcct tgctcccgaa ctaggaaaga gcctaggagg gagcctcagc ataccccttc 1381 ctccaaatta actatttggt gaattgttag cgccgaggct accacctctc caaccctgtc 1441 gcggggcgcg ccgccacctc accgtgaccc cctcctcctc cttctttgcc accgccccca 1501 gctccgcccc tgctccccat cccggcgcaa tggagttctc cgaagggcga tgattccagc 1561 cacatctgct aacttcgcac ccatcgctgc cgccggtcac cgccggccag gccccctgca 1621 gccgcggagc agtgggcgtc caaagcccag tgcagcagcc aggacccgcc cgacgcgcag 1681 cagaagcacg gcgcccaggc gcttaggcgt ctcttggaga gcaaaggctg cgccaaaacg 1741 ctgagcctag aatcaaccaa ggagcctgag cccaggaagg ggctgcgtgg ctcacagcgc 1801 tgcggctcct gaggacaaat agccactgcc gctgcgtacc caagctgcgc cggctggcgg 1861 gagagcagca cgcaaggacg ccgaggtccg ccgcgatctt ccaggtgccc tttgcccctg 1921 ggcacagtat gacccgacct acagggagcc ctagcgcagg gctcctgcaa cgggtcagcc 1981 taggataaaa aggatccttg ccaagctcct accaggccgc ctttgagtct ttaggaaccc 2041 ctcctccggc tgcctcccca aggttctggg cctccttccc tgcggcccag agccatggag 2101 ctctccgatg tgcgctgcct tacaggcagc gaggaactct acaccatcca cccgacgccc 2161 ccggccggcg acggcaggag cgcctcccgg ccgcagcggc tgctgtggca gacggcggtg 2221 cgacacatca cggagcagcg cttcattcac gggcaccggg gaggcagcgg cagcgggagt 2281 ggaggctcgg gcaaagcctc ggaccctgcg ggcggcggcc ccaaccacca cgcgccgcag 2341 ctgtcaggcg actcggcgct gcccctctac tcgctgggcc cgggagagcg agcgcacagc 2401 acctgcggca ccaaagtctt cccggaacgc agcgggagcg gcagtgccag cggcagcgga 2461 ggcgggggcg acctgggctt cctgcacctt gactgtgccc ctagcaactc ggatttcttt 2521 cttaatgggg gctatagcta ccgaggggtc attttcccca ccctgcgcaa ctccttcaaa 2581 tctcgggatt tggaacgcct ctaccagcgc tatttcttgg gccaaaggcg caaatcggaa 2641 gtggtgatga acgtgctgga cgtgctgacc aaactcactc tcttggtcct acacttgagc 2701 ctggcctcgg cccccatgga cccgctcaag ggcatcctgc tgggcttctt caccggcatt 2761 gaggtagtga tctgcgccct ggtggtggtc aggaaggaca ccacctccca cacgtacctg 2821 cagtacagcg gcgtggtcac ctgggtggcc atgaccaccc agatcctggc agcaggcctc 2881 ggctacgggc tcctgggcga cggcataggc tacgtgctct tcacgctctt cgccacctac 2941 agtatgctgc cgctgccgct cacctgggcc atcctggccg gcctgggcac ctcgctgctg 3001 caggtcatcc tccaagtggt cataccccgg ctggcggtca tttccatcaa ccaggttgtg 3061 gcccaggcag tgctattcat gtgtatgaac acagctggaa tcttcatcag ttacctgtca 3121 gaccgggccc agcgccaagc tttcctggag actcggaggt gtgtggaggc caggctgcgc 3181 ctggagacag agaaccaaag acaggagcgg ctcgtgcttt ctgtgctccc ccggtttgtt 3241 gtcctggaaa tgatcaacga catgaccaat gtggaagatg agcacctgca gcaccagttc 3301 catcggatct acatccatcg ctatgagaac gtcagtattc tttttgcaga tgttaaagga 3361 tttaccaacc tctccacgac cttgtctgct caggagctgg tcaggatgct caacgagctc 3421 tttgccagat ttgatcgact ggcccatgag catcactgcc ttcgtattaa aatcctgggg 3481 gactgctact actgcgtgtc tggacttcct gagccccgcc aggaccatgc ccactgctgt 3541 gttgaaatgg gtctcagcat gatcaaaacc atcaggtatg tgcggtcaag gacaaaacac 3601 gatgttgaca tgaggattgg aatccactcc ggctcggtgc tgtgcggtgt tttgggacta 3661 cggaagtggc agtttgatgt ctggtcttgg gatgtggata ttgcaaacaa actcgaatct 3721 ggaggaatcc ccgggaggat tcacatttcc aaagccacgc tggactgtct caacggtgac 3781 tataacgtgg aagagggcca tggtaaagag aggaatgaat tcctgaggaa gcataatatc 3841 gaaacttact taattaagca gcctgaggac agtctgctgt ccttgcctga agatatcgtc 3901 aaggagtcag tgagctcctc agaccggaga aacagtgggg ccacattcac tgaaggatcc 3961 tggagccctg aactgccctt tgataatatc gtggggaaac agaatactct ggctgcccta 4021 acaagaaatt caataaatct gcttccaaac catcttgcac aagctttgca tgtccagtct 4081 gggcctgagg aaattaacaa gagaatagaa cataccatcg acttgcggag tggcgataaa 4141 ttgagaagag agcatatcaa gccattctca ctgatgttta aagactccag cctggagcac 4201 aagtattctc aaatgaggga tgaagtgttc aagtcaaact tggtctgtgc atttatcgtt 4261 cttctattta tcacggcaat acaaagtttg cttccttctt caagagtgat gccaatgacc 4321 atccagttct ccattctgat tatgctgcac tcggctctgg tcctcatcac cacagcagag 4381 gattataaat gtttgcccct catcctccgg aaaacttgct gttggattaa tgagacctat 4441 ttggcccgga acgtcatcat ctttgcatcc attttgatta atttcctggg tgccatctta 4501 aatatcctgt ggtgtgattt tgacaagtcg atacccttga agaacctgac tttcaattcc 4561 tcagctgtgt ttacagatat ctgctcctac ccagagtact ttgtcttcac gggggtgttg 4621 gccatggtga cctgtgcagt tttcctccgg ctgaactccg tcctgaagct ggcagtgctg 4681 ctgatcatga ttgccatcta tgccctgctc actgagaccg tctacgcagg cctctttctg 4741 cgttatgaca acctcaacca cagtggagaa gatttcctgg ggaccaagga ggtatcactg 4801 ctactgatgg ccatgttcct cctggctgtg ttctaccatg gacagcagct ggagtacaca 4861 gcccgcctgg acttcctttg gcgagtacag gccaaagagg agatcaatga gatgaaggag 4921 ctgagggaac acaatgagaa catgctccgg aatatcttac ccagccatgt ggcccgccat 4981 ttcctagaga aggaccgaga caatgaggag ctgtattctc aatcctatga tgctgttggg 5041 gtgatgtttg cctccatccc aggatttgcg gacttttact ctcagactga aatgaataac 5101 cagggagtgg aatgcctgcg cttgctcaat gagatcattg ctgacttcga tgagttgctt 5161 ggtgaagacc gatttcaaga cattgaaaag attaagacca ttggcagcac ctacatggcc 5221 gtgtcaggcc tgtcacctga aaaacagcaa tgtgaagaca agtggggaca tttgtgtgct 5281 ctggctgact tctcactcgc cctgacagaa agcatacagg agatcaacaa gcattcattc 5341 aacaattttg aactccggat tggcatcagc cacggctcag tggtagctgg cgttatcggc 5401 gctaagaaac cacagtatga catttggggc aaaactgtga acctggcaag ccgaatggac 5461 agcacggggg ttagtggccg gatccaagtc ccagaggaga cctatctcat cctgaaggac 5521 cagggctttg cctttgatta ccgaggggag atctatgtga agggtatcag tgaacaggaa 5581 ggaaaaatca aaacgtactt tcttctggga agagtccaac ccaacccatt catcttgccc 5641 ccaagaagac tgcctgggca gtactccctg gccgcggttg tcctgggact tgtccagtcc 5701 ctcaataggc aaaggcagaa gcagctactc aatgagaaca acaacacagg aatcatcaag 5761 ggtcattaca accggcggac tttgttgtca cccagcggca cagagcctgg agcccaggct 5821 gaaggcaccg acaaatctga tttgccataa aagcattttc tttctgtttt tttttttttt 5881 tgtatttctt ttatatataa aataaatata ctaataaaaa ggtttaattt tttttagaac 5941 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacccc aaaaaaaaaa 6001 aaaaa // LOCUS HSADOLB 1611 bp RNA PRI 29-OCT-1994 DEFINITION Human mRNA for aldolase B (EC 4.1.2.13). ACCESSION X01098 K01177 NID g28419 KEYWORDS aldolase; aldolase B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1611) AUTHORS Paolella,G., Santamaria,R., Izzo,P., Costanzo,P. and Salvatore,F. TITLE Isolation and nucleotide sequence of a full-length cDNA coding for aldolase B from human liver JOURNAL Nucleic Acids Res. 12 (19), 7401-7410 (1984) MEDLINE 85037920 REFERENCE 2 (bases 777 to 1348) AUTHORS Besmond,C., Dreyfus,J.C., Gregori,C., Frain,M., Zakin,M.M., Sala Trepat,J. and Kahn,A. TITLE Nucleotide sequence of a cDNA clone for human aldolase B JOURNAL Biochem. Biophys. Res. Commun. 117 (2), 601-609 (1983) MEDLINE 84104270 COMMENT Data kindly reviewed (13-FEB-1986) by C. Besmond. FEATURES Location/Qualifiers source 1..1611 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 67..1161 /EC_number="4.1.2.13" /codon_start=1 /product="aldolase B" /db_xref="PID:g28420" /db_xref="SWISS-PROT:P05062" /translation="MAHRFPALTQEQKKELSEIAQSIVANGKGILAADESVGTMGNRL QRIKVENTEENRRQFREILFSVDSSINQSIGGVILFHETLYQKDSQGKLFRNILKEKG IVVGIKLDQGGAPLAGTNKETTIQGLDGLSERCAQYKKDGVDFGKWRAVLRIADQCPS SLAIQENANALARYASICQQNGLVPIVEPEVIPDGDHDLEHCQYVTEKVLAAVYKALN DHHVYLEGTLLKPNMVTAGHACTKKYTPEQVAMATVTALHRTVPAAVPGICFLSGGMS EEDATLNLNAINLCPLPKPWKLSFSYGRALQASALAAWGGKAANKEATQEAFMKRAMA NCQAAKGQYVHTGSSGAASTQSLFTACYTY" misc_feature 1310..1315 /note="pot. polyadenylation signal" polyA_signal 1574..1579 /note="polyadenylation signal" polyA_site 1595 /note="polyadenylation site" BASE COUNT 465 a 413 c 382 g 351 t ORIGIN 1 agctgctgcc tcacccacag cttttgcata tctaggagga ctcttctctc ccaaactacc 61 tgtcacatgg cccaccgatt tccagccctc acccaggagc agaagaagga gctctcagaa 121 attgcccaga gcattgttgc caatggaaag gggatcctgg ctgcagatga atctgtaggt 181 accatgggga accgcctgca gaggatcaag gtggaaaaca ctgaagagaa ccgccggcag 241 ttccgagaaa tcctcttctc tgtggacagt tccatcaacc agagcatcgg gggtgtgatc 301 cttttccacg agaccctcta ccagaaggac agccagggaa agctgttcag aaacatcctc 361 aaggaaaagg ggatcgtggt gggaatcaag ttagaccaag gaggtgctcc tcttgcagga 421 acaaacaaag aaaccaccat tcaagggctt gatggcctct cagagcgctg tgctcagtac 481 aagaaagatg gtgttgactt tgggaagtgg cgtgctgtgc tgaggattgc cgaccagtgt 541 ccatccagcc tcgctatcca ggaaaacgcc aacgccctgg ctcgttacgc cagcatctgt 601 cagcagaatg gactggtacc tattgttgaa ccagaggtaa ttcctgatgg agaccatgac 661 ctggaacact gccagtatgt tactgagaag gtcctggctg ctgtctacaa ggccctaaat 721 gaccatcatg tttacctgga gggcaccctg ctaaagccca acatggtgac tgctggacat 781 gcctgcacca agaagtatac tccagaacaa gtagctatgg ccaccgtaac agctctccac 841 cgtactgttc ctgcagctgt tcctggcatc tgctttttgt ctggtggcat gagtgaagag 901 gatgccactc tcaacctcaa tgctatcaac ctttgccctc taccaaagcc ctggaaacta 961 agtttctctt atggacgggc cctgcaggcc agtgcactgg ctgcctgggg tggcaaggct 1021 gcaaacaagg aggcaaccca ggaggctttt atgaagcggg ccatggctaa ctgccaggcg 1081 gccaaaggac agtatgttca cacgggttct tctggggctg cttccaccca gtcgctcttc 1141 acagcctgct atacctacta gggtccaatg cccgccagcc tagctccagt gcttctagta 1201 ggagggctga aagggagcaa cttttcctcc aatcctggaa attcgacaca attagatttg 1261 aactgctgga aatacaacac atgttaaatc ttaagtacaa gggggaaaaa ataaatcagt 1321 tattgaaaca taaaaatgaa taccaaggac ctgatcaaat ttcacacagc gagtttcctt 1381 gcaacacttt cagctcccca tgctccagaa tacccaccca agaaaataac cgggatctaa 1441 aacaataatc ggctcctcat ccaaagaaca actgctgatt gaaacacctc attagctgat 1501 gtagagaagt gcatcttatg aaacagtctt agcagtggta ggttgggaag gagatagctg 1561 caaccaaaaa agaaataaat attctataaa ccttcaaaaa aaaaaaaaaa a // LOCUS HSADTG 2634 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for gamma-adaptin. ACCESSION Y12226 NID g2765189 KEYWORDS ADTG gene; gamma-adaptin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2634) AUTHORS Peyrard,M., Lagerkrantz,S., Parveneh,S., Fransson,I., Sahlen,S. and Dumanski,J.P. TITLE Cloning, expression pattern and chromosomal assignment to 16q23 of the human gamma-adaptin gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2634) AUTHORS Peyrard,M. TITLE Direct Submission JOURNAL Submitted (02-APR-1997) M. Peyrard, Molecular Medicin, CMM, L8:00 Building Karolinska Hospital, Stockholm, S-171 76, SWEDEN FEATURES Location/Qualifiers source 1..2634 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="q23" gene 28..2505 /gene="ADTG" CDS 28..2505 /gene="ADTG" /codon_start=1 /product="gamma-adaptin" /db_xref="PID:e321949" /db_xref="PID:g2765190" /translation="MPAPIRLRELIRTIRTARTQAEEREMIQKECAAIRSSFREEDNT YRCRNVAKLLYMHMLGYPAHFGQLECLKLIASQKFTDKRIGYLGAMLLLDERQDVHLL MTNCIKNDLNHSTQFVQGLALCTLGCMGSSEMCRDLAGEVEKLLKTSNSYLRKKAALC AVHVIRKVPELMEMFLPATKNLLNEKNHGVLHTSVVLLTEMCERSPDMLAHFRKNEKL VPQLVRILKNLIMSGYSPEHDVSGISDPFLQVRILRLLRILGRNDDDSSEAMNDILAQ VATNTETSKNVGNAILYETVLTIMDIKSESGLRVLAINILGRFLLNNDKNIRYVALTS LLKTVQTDHNAVQRHRSTIVDCLKDLDVSIKRRAMELSFALVNGNNIRGMMKELLYFL DSCEPEFKADCASGIFLAAEKYAPSKRWHIDTIMRVLTTAGSYVRDDAVPNLIQLITN SVEMHAYTVQRLYKAILGDYSQQPLVQVAAWCIGEYGDLLVSGQCEEEEPIQVTEDEV LDILESVLISNMSTSVTRGYALTAIMKLSTRFTCTVNRIKKVVSIYGSSIDVELQQRA VEYNALFKKYDHMRSALLERMPVMEKVTTNGPTEIVQTNGETEPAPLETKPPPSGPQP TSQANDLLDLLGGNDITPVIPTAPTSKPSSAGGELLDLLGDINLTGAPAAAPAPASVP QISQPPFLLDGLSSQPLFNDIAAGIPSITAYSKNGLKIEFTFERSNTNPSVTVITIQA SNSTELDMTDFVFQAAVPKTFQLQLLSPSSSIVPAFNTGTITQVIKVLNPQKQQLRMR IKLTYNHKGSAMQDLAEVNNFPPQSWQ" BASE COUNT 772 a 581 c 593 g 688 t ORIGIN 1 ggtttcattc gaggtttcgg gccgaggatg ccagccccca tcagattgcg ggagctgatc 61 cggaccatcc ggacagcccg aacccaagct gaagaacgag aaatgatcca gaaagaatgt 121 gctgcaatcc ggtcatcttt tagagaagaa gacaatacat accgatgtcg gaatgtggca 181 aaattactgt atatgcacat gctgggctac cctgctcact ttggacagtt ggagtgcctc 241 aagcttattg cctctcaaaa atttacagac aaacgcattg gctatttagg ggcaatgctg 301 ctgttagatg aaagacaaga tgtccatctt ctcatgacca actgtatcaa gaatgatctt 361 aatcatagca cgcaattcgt acaggggtta gcactttgta ccctcggctg catgggctcc 421 tcagagatgt gcagagatct tgcaggagag gtagagaagc tcctgaaaac ctccaactct 481 tacttaagaa aaaaggcagc actgtgtgct gttcatgtca tcaggaaagt tcctgaactt 541 atggagatgt ttttaccagc aacaaaaaat ttattgaatg agaagaacca tggtgtcctc 601 cacacatctg tagtcctcct cacagaaatg tgtgagcgaa gcccagacat gcttgcgcat 661 ttcagaaaga atgaaaagct tgtgccccaa ttagttcgta ttttaaagaa cctcatcatg 721 tccggatatt caccagaaca tgatgtttct ggtatcagtg accccttttt gcaggtacga 781 attttgcggt tattaagaat tttaggacga aatgatgatg attcaagtga agctatgaat 841 gatatattag cacaggttgc cactaatact gagactagta aaaatgtagg aaatgctatt 901 ctttatgaaa cggttttgac tatcatggat attaagtcag agagtggatt gcgagtccta 961 gccataaata tcctgggtcg tttcttattg aacaatgaca agaatattag atatgtggct 1021 ctgacatctt tgttgaagac tgtacagaca gatcataatg cagtacagag gcacagaagc 1081 acaattgtgg actgtcttaa agatttggat gtctcaataa aacggcgtgc aatggaattg 1141 agttttgccc tggtaaatgg gaataatatc cgaggcatga tgaaagaatt actttatttt 1201 ctggattcgt gtgagccaga atttaaagca gactgtgcat ctggaatctt tcttgctgca 1261 gaaaagtatg caccttccaa acgatggcat atagacacaa ttatgcgtgt tttgacaacg 1321 gcaggaagtt atgttcgtga tgatgcagtc cccaatttaa tccagttaat aactaatagt 1381 gtggagatgc atgcctatac tgtccagcgc ctgtacaaag caattcttgg tgattattct 1441 caacaacctt tggtacaagt ggctgcatgg tgtataggtg aatatggtga tcttcttgta 1501 tctggccagt gtgaagagga agagcctatt caggtaacag aggatgaagt gttggatatt 1561 ttagaaagtg tcctaatctc taatatgtcc acctctgtga cacgaggtta tgccctcact 1621 gccattatga agctttccac tcgattcact tgtactgtaa accgaattaa gaaagtggtt 1681 tccatctacg gaagcagcat tgatgtggaa ctccagcaga gggcagtaga atataatgca 1741 cttttcaaga aatatgacca catgaggtct gccctacttg agagaatgcc tgtcatggaa 1801 aaagtgacca caaatggccc tactgagatt gtgcagacaa atggagagac agaaccagct 1861 ccactagaga ccaaaccgcc accctctggg ccacagccca ccagccaggc caatgattta 1921 ttggatttgt tgggaggaaa tgacataaca cctgttattc caactgcgcc tacaagcaaa 1981 ccatcttctg ctggtggaga acttcttgat ttgctgggag acatcaacct tacaggtgct 2041 ccagctgctg ctcctgcccc tgcctcagtc ccacagatat cccagccccc cttcttgttg 2101 gatgggcttt catcacagcc tctcttcaat gatattgctg caggcatccc ctccatcaca 2161 gcatacagta agaatggctt gaagatagaa ttcacctttg aacggtcaaa taccaacccc 2221 agtgtaacag tgataacgat acaggcctcc aacagcacag agctagatat gacggacttt 2281 gttttccaag ctgcagtacc aaagacattc cagctgcagc tcttgtctcc tagcagcagc 2341 attgtcccag catttaacac ggggaccatc acacaagtca ttaaagttct gaaccctcag 2401 aagcaacagc tgcgaatgcg gatcaagctt acatataatc acaagggctc agcaatgcaa 2461 gatctagcag aggtgaacaa ctttccccct cagtcctggc aatgagggtt tggcaccatt 2521 ctcattcttt atcccactca atcaaaggaa ctctgggaag gaggttgtga ttgctggcaa 2581 gtccccccca actgtaccac gggcatgagg agctgaagag aactgctgag gggt // LOCUS HSAF000148 7334 bp mRNA PRI 21-OCT-1997 DEFINITION Homo sapiens ATP-binding cassette transporter (ABCR) mRNA, complete cds. ACCESSION AF000148 NID g2547311 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7334) AUTHORS Azarian,S.M. and Travis,G.H. TITLE The photoreceptor rim protein is an ABC transporter encoded by the gene for recessive Stargardt's disease (ABCR) JOURNAL FEBS Lett. 409 (2), 247-252 (1997) MEDLINE 97345663 REFERENCE 2 (bases 1 to 7334) AUTHORS Azarian,S.M. and Travis,G.H. TITLE Direct Submission JOURNAL Submitted (16-APR-1997) Psychiatry, UT Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75235-9111, USA FEATURES Location/Qualifiers source 1..7334 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p13-p21" gene <1..>7334 /gene="ABCR" CDS 99..6920 /gene="ABCR" /note="rim protein; RmP" /codon_start=1 /product="ATP-binding cassette transporter" /db_xref="PID:g2547312" /translation="MGFVRQIQLLLWKNWTLRKRQKIRFVVELVWPLSLFLVLIWLRN ANPLYSHHECHFPNKAMPSAGMLPWLQGIFCNVNNPCFQSPTPGESPGIVSNYNNSIL ARVYRDFQELLMNAPESQHLGRIWTELHILSQFMDTLRTHPERIAGRGIRIRDILKDE ETLTLFLIKNIGLSDSVVYLLINSQVRPEQFAHGVPDLALKDIACSEALLERFIIFSQ RRGAKTVRYALCSLSQGTLQWIEDTLYANVDFFKLFRVLPTLLDSRSQGINLRSWGGI LSDMSPRIQEFIHRPSMQDLLWVTRPLMQNGGPETFTKLMGILSDLLCGYPEGGGSRV LSFNWYEDNNYKAFLGIDSTRKDPIYSYDRRTTSFCNALIQSLESNPLTKIAWRAAKP LLMGKILYTPDSPAARRILKNANSTFEELEHVRKLVKAWEEVGPQIWYFFDNSTQMNM IRDTLGNPTVKDFLNRQLGEEGITAEAILNFLYKGPRESQADDMANFDWRDIFNITDR TLRLVNQYLECLVLDKFESYNDETQLTQRALSLLEENMFWAGVVFPDMYPWTSSLPPH VKYKIRMDIDVVEKTNKIKDRYWDSGPRADPVEDFRYIWGGFAYLQDMVEQGITRSQV QAEAPVGIYLQQMPYPCFVDDSFMIILNRCFPIFMVLAWIYSVSMTVKSIVLEKELRL KETLKNQGVSNAVIWCTWFLDSFSIMSMSIFLLTIFIMHVRILHYSDPFILFLFLLAF STATIMLCFLLSTFFSKASLAAACSGVIYFTLYLPHILCFAWQDRMTAELKKAVSLLS PVAFGFGTEYLVRFEEQGLGLQWSNIGNSPTEGDEFSFLLSMQMMLLDAAVYGLLAWY LDQVFPGDYGTPLPWYFLLQESYWLGGEGCSTREERALEKTEPLTEETEDPEHPEGIH DSFFEREHPGWVPGVCVKNLVKIFEPSGRPAVDRLNITFYENQITAFLGHNGAGKTTT LSILTGLLPPTSGVLLVGGRDIETSLDAVRQSLGMCPQHNILFHHLTVAEHMLFYAQL KGKSQEEAQLEMEAMLEDTGLHHKRNEEAQDLSGGMQRKLSVAIAFVGDAKVVILDEP TSGVDPYSRRSIWDLLLKYRSGRTIIMSTHHMDEADLLGDRIAIIAQGRLYCSGTPLF LKNCFGTGLYLTLVRKMKNIQSQRKGSEGTCSCSSKGFSTTCPAHVDDLTPEQVLDGD VNELMDVVLHHVPEAKLVECIGQELIFLLPNKNFKHRAYASLFRELEETLADLGLSSF GISDTPLEEIFLKVTEDSDSGPLFAGGAQQKRENVNPRHPCLGPREKAGQTPQDSNVC SPGAPAAHPEGQPPPEPECPGPQLNTGTQLVLQHVQALLVKRFQHTIRSHKDFLAQIV LPATFVFLALMLSIVIPPFGEYPALTLHPWIYGQQYTFFSMDEPGSEQFTVLADVLLN KPGFGNRCLKEGWLPEYPCGNSTPWKTPSVSPNITQLFQKQKWTQVNPSPSCRCSTRE KLTMLPECPEGAGGLPPPQRTQRSTEILQDLTDRNISDFLVKTYPALIRSSLKSKFWV NEQRYGGISIGGKLPVVPITGEALVGFLSDLGRIMNVSGGPITREASKEIPDFLKHLE TEDNIKVWFNNKGWHALVSFLNVAHNAILRASLPKDRSPEEYGITVISQPLNLTKEQL SEITVLTTSVDAVVAICVIFSMSFVPASFVLYLIQERVNKSKHLQFISGVSPTTYWVT NFLWDIVNYSVSAGLVVGIFIGFQKKAYTSPENLPALVALLLLYGWAVIPMMYPASFL FDVPSTAYVALSCANLFIGINSSAITFILELFENNRTLLRFNAVLRKLLIVFPHFCLG RGLIDLALSQAVTDVYARFGEEHSANPFHWDLIGKNLFAMVVEGVVYFLLTLLVQRHF FLSQWIAEPTKEPIVDEDDDVAEERHRIITGGNKTDILRLHELTKIYLGTSSPAVDRL CVGVRPGECFGLLGVNGAGKTTTFKMLTGDNTVTSGDATVAGKSILTNISEVHQNMGY CPQFDAIDELLTGREHLYLYARLRGVPAEEIEKVANWSIKSLGLTVYADCLAGTYSGG NKRKLSTAIALIGCPPLVLLDEPTTGMDPQARRMLWNVIVSIIREGRAVVLTSHSMEE CEALCTRLAIMVKGAFRCMGTIQHLKSKFGDGYIVTMKIKSPKDDLLPDLNPVEQFFQ GNFPGSVQRERHYNMLQFQVSSSSLARIFQLLLSHKDSLLIEEYSVTQTTLDQVFVNF AKQQTESHDLPLHPRAAGASRQAQD" BASE COUNT 1797 a 1974 c 1857 g 1704 t 2 others ORIGIN 1 ggcactaggg agccagaggc gctcttaacg gcgtttatgt cctttgctgt ctgaggggcc 61 tcagctctga ccaatctggt cttcgtgtgg tcattagcat gggcttcgtg agacagatac 121 agcttttgct ctggaagaac tggaccctgc ggaaaaggca aaagattcgc tttgtggtgg 181 aactcgtstg gcctttatct ttatttctgg tcttgatctg gttaaggaat gccaacccgc 241 tctacagcca tcatgaatgc catttcccca acaaggcgat gccctcagca ggaatgctgc 301 cgtggctcca ggggatcttc tgcaatgtga acaatccytg ttttcaaagc cccaccccag 361 gagaatctcc tggaattgtg tcaaactata acaactccat cttggcaagg gtatatcgag 421 attttcaaga actcctcatg aatgcaccag agagccagca ccttggccgt atttggacag 481 agctacacat cttgtcccaa ttcatggaca ccctccggac tcacccggag agaattgcag 541 gaagaggaat acgaataagg gatatcttga aagatgaaga aacactgaca ctatttctca 601 ttaaaaacat cggcctgtct gactcagtgg tctaccttct gatcaactct caagtccgtc 661 cagagcagtt cgctcatgga gtcccggacc tggcgctgaa ggacatcgcc tgcagcgagg 721 ccctcctgga gcgcttcatc atcttcagcc agagacgcgg ggcaaagacg gtgcgctatg 781 ccctgtgctc cctctcccag ggcaccctac agtggataga agacactctg tatgccaacg 841 tggacttctt caagctcttc cgtgtgcttc ccacactcct agacagccgt tctcaaggta 901 tcaatctgag atcttgggga ggaatattat ctgatatgtc accaagaatt caagagttta 961 tccatcggcc gagtatgcag gacttgctgt gggtgaccag gcccctcatg cagaatggtg 1021 gtccagagac ctttacaaag ctgatgggca tcctgtctga ccttctgtgt ggctaccccg 1081 agggaggtgg ctctcgggtg ctctccttca actggtatga agacaataac tataaggcct 1141 ttctggggat tgactccaca aggaaggatc ctatctattc ttatgacaga agaacaacat 1201 ccttttgtaa tgcattgatc cagagcctgg agtcaaatcc tttaaccaaa atcgcttgga 1261 gggcggcaaa gcctttgctg atgggaaaaa tcctgtacac tcctgattca cctgcagcac 1321 gaaggatact gaagaatgcc aactcaactt ttgaagaact ggaacacgtt aggaagttgg 1381 tcaaagcctg ggaagaagta gggccccaga tctggtactt ctttgacaac agcacacaga 1441 tgaacatgat cagagatacc ctggggaacc caacagtaaa agactttttg aataggcagc 1501 ttggtgaaga aggtattact gctgaagcca tcctaaactt cctctacaag ggccctcggg 1561 aaagccaggc tgacgacatg gccaacttcg actggaggga catatttaac atcactgatc 1621 gcaccctccg cctggtcaat caatacctgg agtgcttggt cctggataag tttgaaagct 1681 acaatgatga aactcagctc acccaacgtg ccctctctct actggaggaa aacatgttct 1741 gggccggagt ggtattccct gacatgtatc cctggaccag ctctctacca ccccacgtga 1801 agtataagat ccgaatggac atagacgtgg tggagaaaac caataagatt aaagacaggt 1861 attgggattc tggtcccaga gctgatcccg tggaagattt ccggtacatc tggggcgggt 1921 ttgcctatct gcaggacatg gttgaacagg ggatcacaag gagccaggtg caggcggagg 1981 ctccagttgg aatctacctc cagcagatgc cctacccctg cttcgtggac gattctttca 2041 tgatcatcct gaaccgctgt ttccctatct tcatggtgct ggcatggatc tactctgtct 2101 ccatgactgt gaagagcatc gtcttggaga aggagttgcg actgaaggag accttgaaaa 2161 atcagggtgt ctccaatgca gtgatttggt gtacctggtt cctggacagc ttctccatca 2221 tgtcgatgag catcttcctc ctgacgatat tcatcatgca tgtaagaatc ctacattaca 2281 gcgacccatt catcctcttc ctgttcttgt tggctttctc cactgccacc atcatgctgt 2341 gctttctgct cagcaccttc ttctccaagg ccagtctggc agcagcctgt agtggtgtca 2401 tctatttcac cctctacctg ccacacatcc tgtgcttcgc ctggcaggac cgcatgaccg 2461 ctgagctgaa gaaggctgtg agcttactgt ctccggtggc atttggattt ggcactgagt 2521 acctggttcg ctttgaagag caaggcctgg ggctgcagtg gagcaacatc gggaacagtc 2581 ccacggaagg ggacgaattc agcttcctgc tgtccatgca gatgatgctc cttgatgctg 2641 ctgtctatgg cttactcgct tggtaccttg atcaggtgtt tccaggagac tatggaaccc 2701 cacttccttg gtactttctt ctacaagagt cgtattggct tggcggtgaa gggtgttcaa 2761 ccagagaaga aagagccctg gaaaagaccg agcccctaac agaggaaacg gaggatccag 2821 agcacccaga aggaatacac gactccttct ttgaacgtga gcatccaggg tgggttcctg 2881 gggtatgcgt gaagaatctg gtaaagattt ttgagccctc cggccggcca gctgtggacc 2941 gtctgaacat caccttctac gagaaccaga tcaccgcatt cctgggccac aatggagctg 3001 ggaaaaccac caccttgtcc atcctgacgg gtctgttgcc accaacctct ggggttttgc 3061 tcgttggggg aagggacatt gaaaccagcc tggatgcagt ccggcagagc cttggcatgt 3121 gtccacagca caacatcctg ttccaccacc tcacggtggc tgagcacatg ctgttctatg 3181 cccagctgaa aggaaagtcc caggaggagg cccagctgga gatggaagcc atgttggagg 3241 acacaggcct ccaccacaag cggaatgaag aggctcagga cctatcaggt ggcatgcaga 3301 gaaagctgtc ggttgccatt gcctttgtgg gagatgccaa ggtggtgatt ctggacgaac 3361 ccacctctgg ggtggaccct tactcgagac gctcaatctg ggatctgctc ctgaagtatc 3421 gctcaggcag aaccatcatc atgtccactc accacatgga cgaggccgac ctccttgggg 3481 accgcattgc catcattgcc cagggaaggc tctactgctc aggcacccca ctcttcctga 3541 agaactgctt tggcacaggc ttgtacttaa ccttggtgcg caagatgaaa aacatccaga 3601 gccaaaggaa aggcagtgag gggacctgca gctgctcgtc taagggtttc tccaccacgt 3661 gtccagccca cgtcgatgac ctaactccag aacaagtcct ggatggggat gtaaatgagc 3721 tgatggatgt agttctccac catgttccag aggcaaagct ggtggagtgc attggtcaag 3781 aacttatctt ccttcttcca aataagaact tcaagcacag agcatatgcc agccttttca 3841 gagagctgga ggagacgctg gctgaccttg gtctcagcag ttttggaatt tctgacactc 3901 ccctggaaga gatttttctg aaggtcacgg aggattctga ttcaggacct ctgtttgcgg 3961 gtggcgctca gcagaaaaga gaaaacgtca acccccgaca cccctgcttg ggtcccagag 4021 agaaggctgg acagacaccc caggactcca atgtctgctc cccaggggcg ccggctgctc 4081 acccagaggg ccagcctccc ccagagccag agtgcccagg cccgcagctc aacacgggga 4141 cacagctggt cctccagcat gtgcaggcgc tgctggtcaa gagattccaa cacaccatcc 4201 gcagccacaa ggacttcctg gcgcagatcg tgctcccggc tacctttgtg tttttggctc 4261 tgatgctttc tattgttatc cctccttttg gcgaataccc cgctttgacc cttcacccct 4321 ggatatatgg gcagcagtac accttcttca gcatggatga accaggcagt gagcagttca 4381 cggtacttgc agacgtcctc ctgaataagc caggctttgg caaccgctgc ctgaaggaag 4441 ggtggcttcc ggagtacccc tgtggcaact caacaccctg gaagactcct tctgtgtccc 4501 caaacatcac ccagctgttc cagaagcaga aatggacaca ggtcaaccct tcaccatcct 4561 gcaggtgcag caccagggag aagctcacca tgctgccaga gtgccccgag ggtgccgggg 4621 gcctcccgcc cccccagaga acacagcgca gcacggaaat tctacaagac ctgacggaca 4681 ggaacatctc cgacttcttg gtaaaaacgt atcctgctct tataagaagc agcttaaaga 4741 gcaaattctg ggtcaatgaa cagaggtatg gaggaatttc cattggagga aagctcccag 4801 tcgtccccat cacgggggaa gcacttgttg ggtttttaag cgaccttggc cggatcatga 4861 atgtgagcgg gggccctatc actagagagg cctctaaaga aatacctgat ttccttaaac 4921 atctagaaac tgaagacaac attaaggtgt ggtttaataa caaaggctgg catgccctgg 4981 tcagctttct caatgtggcc cacaacgcca tcttacgggc cagcctgcct aaggacagga 5041 gccccgagga gtatggaatc accgtcatta gccaacccct gaacctgacc aaggagcagc 5101 tctcagagat tacagtgctg accacttcag tggatgctgt ggttgccatc tgtgtgattt 5161 tctccatgtc cttcgtccca gccagctttg tcctttattt gatccaggag cgggtgaaca 5221 aatccaagca cctccagttt atcagtggag tgagccccac cacctactgg gtgaccaact 5281 tcctctggga catcgtgaat tattccgtga gtgctgggct ggtggtgggc atcttcatcg 5341 ggtttcagaa gaaagcctac acttctccag aaaaccttcc tgcccttgtg gcactgctcc 5401 tgctgtatgg atgggcggtc attcccatga tgtacccagc atccttcctg tttgatgtcc 5461 ccagcacagc ctatgtggct ttatcttgtg ctaatctgtt catcggcatc aacagcagtg 5521 ctattacctt catcttggaa ttatttgaga ataaccggac gctgctcagg ttcaacgccg 5581 tgctgaggaa gctgctcatt gtcttccccc acttctgcct gggccggggc ctcattgacc 5641 ttgcactgag ccaggctgtg acagatgtct atgcccggtt tggtgaggag cactctgcaa 5701 atccgttcca ctgggacctg attgggaaga acctgtttgc catggtggtg gaaggggtgg 5761 tgtacttcct cctgaccctg ctggtccagc gccacttctt cctctcccaa tggattgccg 5821 agcccactaa ggagcccatt gttgatgaag atgatgatgt ggctgaagaa agacatagaa 5881 ttattactgg tggaaataaa actgacatct taaggctaca tgaactaacc aagatttatc 5941 tgggcacctc cagcccagca gtggacaggc tgtgtgtcgg agttcgccct ggagagtgct 6001 ttggcctcct gggagtgaat ggtgccggca aaacaaccac attcaagatg ctcactgggg 6061 acaacacagt gacctcaggg gatgccaccg tagcaggcaa gagtatttta accaatattt 6121 ctgaagtcca tcaaaatatg ggctactgtc ctcagtttga tgcaatcgat gagctgctca 6181 caggacgaga acatctttac ctttatgccc ggcttcgagg tgtaccagca gaagaaatcg 6241 aaaaggttgc aaactggagt attaagagcc tgggcctgac tgtctacgcc gactgcctgg 6301 ctggcacgta cagtgggggc aacaagcgga aactctccac agccatcgca ctcattggct 6361 gcccaccgct ggtgctgctg gatgagccca ccacagggat ggacccccag gcacgccgca 6421 tgctgtggaa cgtcatcgtg agcatcatca gagaagggag ggctgtggtc ctcacatccc 6481 acagcatgga agaatgtgag gcactgtgta cccggctggc catcatggta aagggcgcct 6541 ttcgatgtat gggcaccatt cagcatctca agtccaaatt tggagatggc tatatcgtca 6601 caatgaagat caaatccccg aaggacgacc tgcttcctga cctgaaccct gtggagcagt 6661 tcttccaggg gaacttccca ggcagtgtgc agagggagag gcactacaac atgctccagt 6721 tccaggtctc ctcctcctcc ctggcgagga tcttccagct cctcctctcc cacaaggaca 6781 gcctgctcat cgaggagtac tcagtcacac agaccacact ggaccaggtg tttgtaaatt 6841 ttgctaaaca gcagactgaa agtcatgacc tccctctgca ccctcgagct gctggagcca 6901 gtcgacaagc ccaggactga tctttcacac cgttcgttcc tgcagccaga aaggaactct 6961 gggcagctgg aggcgcagga gcctgtgccc atatggtcat ccaaatggac tggccagcgt 7021 aaatgacccc actgcagcag aaaacaaaca cacgaggagc atgcagcgaa ttcagaaaga 7081 ggtctttcag aaggaaaccg aaactgactt gctcacctgg aacacctgat ggtgaaacca 7141 aacaaataca aaatccttct ccagacccca gaactagaaa ccccgggcca tcccactagc 7201 agctttggcc tccatattgc tctcatttca agcagatctg cttttctgca tgtttgtctg 7261 tgtgtctgcg ttgtgtgtga ttttcatgga aaaataaaat gcaaatgcac tcatcacaaa 7321 aaaaaaaaaa aaaa // LOCUS HSAF000152 4833 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens OS-4 protein (OS-4) mRNA, complete cds. ACCESSION AF000152 NID g2454301 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4833) AUTHORS Su,Y.A., Trent,J.M., Guan,X.Y. and Meltzer,P.S. TITLE Direct isolation of genes encoded within a homogeneously staining region by chromosome microdissection JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (19), 9121-9125 (1994) MEDLINE 94377504 REFERENCE 2 (bases 1 to 4833) AUTHORS Su,Y.A., Lee,M.M., Hutter,C.M. and Meltzer,P.S. TITLE Characterization of a highly conserved gene (OS4) co-amplified with CDK4 in human sarcomas JOURNAL Oncogene 15, 1290-1294 (1997) REFERENCE 3 (bases 1 to 4833) AUTHORS Su,Y.A. and Meltzer,P.S. TITLE Direct Submission JOURNAL Submitted (16-APR-1997) Laboratory of Cancer Genetics, NHGRI/NIH, Room 4A52, 49 Convert Dr., Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4833 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13-q15" /cell_type="osteosarcoma" /cell_line="Osa-Cl" gene 1..4833 /gene="OS-4" CDS 306..1157 /gene="OS-4" /codon_start=1 /product="OS-4 protein" /db_xref="PID:g2454302" /translation="MEHGSIITQARREDALVLTKQGLVSKSSPKKPRGRNIFKALFCC FRAQHVGQSSSSTELAAYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRI CVVIDLDETLVHSSFKPINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECV LFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDN SPASYIFHPENAVPVQSWFDDMADTELLNLIPIFEELSGAEDVYTSLGAAAGPLACPA SKRRPSQ" BASE COUNT 1051 a 1303 c 1240 g 1236 t 3 others ORIGIN 1 gccatttcct cctcttgttt tcactccgga ttctccatgt tggacccaaa ctgaggagcc 61 cggagctgcc gctgggggat cggggccggg ggcacccggg ggagccgctg cccgggccgc 121 ccgccctttg tacaggccgc ctcccttccc ggtccgggga ggaaacgaga ggggggatgt 181 gaacagctgt ggaagtcgga gtctcgggag ccggagcggg cccccgccca ggccccccag 241 cccagcccag cccgcgcgcc cgcccgtcct cccgtccagc cagcccgggc ccgcgggatt 301 gttagatgga acacggctcc atcatcaccc aggcgcggag ggaagacgcc ctggtgctca 361 ccaagcaagg cctggtctcc aagtcctctc ctaagaagcc tcgtggacgt aacatcttca 421 aggccctttt ctgctgtttt cgcgcccagc atgttggcca gtcaagttcc tccactgagc 481 tcgctgcgta taaggaggaa gcaaacacca ttgctaagtc ggatctgctc cagtgtctcc 541 agtaccagtt ctaccagatc ccagggacct gcctgctccc agaggtgaca gaggaagatc 601 aaggaaggat ctgtgtggtc attgacctcg atgaaaccct tgtgcatagc tcctttaagc 661 caatcaacaa tgctgacttc atagtgccta tagagattga ggggaccact caccaggtgt 721 atgtgctcaa gaggccttat gtggatgagt tcctgagacg catgggggaa ctctttgaat 781 gtgttctctt cactgccagc ctggccaagt atgccgaccc tgtgacagac ctgctggacc 841 ggtgtggggt gttccgggcc cgcctattcc gtgagtcttg cgtgttccac cagggctgct 901 acgtcaagga cctcagccgc ctggggaggg acctgagaaa gaccctcatc ctggacaact 961 cgcctgcttc ttacatattc caccccgaga atgcagtgcc tgtgcagtcc tggtttgatg 1021 acatggcaga cactgagttg ctgaacctga tcccaatctt tgaggagctg agcggagcag 1081 aggacgtcta caccagcctt ggggcagctg cgggcccctt agcctgccct gcttccaagc 1141 gacggccatc ccagtagggg actttcccac actgtgcctt tacgatcagc gtgacagagt 1201 agaagctgga gtgcctcacc acacggcccg gaaacagcgg gaagtaactg gaaagagctt 1261 taggacagct tagatgccga gtgggcgaat gccagaccaa tgatacccag agctacctgc 1321 cgccaacttg ttgagatgtg tgtttgactg tgagagagtg tgtgtttgtg tgtgtgtttt 1381 gccatgaact gtggccccag tgtatagtgt ttcagtgggg gagaagctga aagaccaaga 1441 ctcttcccaa gttagcttgt ctcctctcct gtcaccctaa gagccactga gttgtgtagg 1501 gatgaaract attgaagact ccattgccaa accatggcct ttcctcagtg ttgtaaggcc 1561 tatgccaagg ataaaggaag ggtatgcctt tgggtactcc aggcatacac ctttctgaaa 1621 tccttctcca gccagctgct gcagacaaaa gatcacattt ctgggaagat gagaacttgt 1681 ttccagacca gcatccagtg gccatcaggt cttgtggccc aaaggctatg cttgcctccg 1741 gctgagtgcc tgggataggc cttttctatg tctccccaag gctggggtgc tgagcctgcc 1801 ttcctcacca cctagccata gtctcaaacc tgtggggaag gaggttttct ccctgcccgg 1861 gaagaggaca gataactgat ttccgttctt ttgactgtgt tttaaaattc tctttctaaa 1921 cacagagtgt tgggcctggt ttgtttctga caaagttaca gtcctgggcc tgtaatgaat 1981 gtcggcggcg ctggggttgc agggaaaaga caaatcctca aagcgtggac gtgtgtcccc 2041 atggcttgtg gatcagctaa gctcgggatc atttccataa gtctgctttt cagggattct 2101 ctgctggtgc tggtgcaagg acttctgttc caaaggctgg gaaaaactaa gctgtcccag 2161 cccctcccat ttcttgggca gggctctttt cctgttgtgt cttcccccag ggcctgtcct 2221 gtaccgagct ctgtctgttc cagcctacat ccttcctggg tgttgctttt cctcttaagg 2281 gcctcagaac tcttgctctt cctggggtga gggggaatga gtgttcttga catgtgacag 2341 cctaatgcgc atgctttctg cctctggtaa caggagtgag tgagcccctc agacctgcac 2401 tctgggtgtc tcctgcttac aaaggttctt aatagtgaat gctttaaaat taaagtcatc 2461 acgaaatgga agttttccca gggtggaaaa taagaggaag tgctgctgta attgggagca 2521 caaggggcct cccaaaaagg agccccacct cagcatcact gccttaatcg tggcctccct 2581 ggggtgggtg gggttctctc ctccctccct ccctcctcct ggggtgggag ggcgctcctg 2641 ttcccatctc tgtgttccct ggaggcaggt atcacaaagc atttgtgaat tgctttaggt 2701 gcagggacac cacccactca ggactcttcc ccatcatccc ttccattgcc acaccctaga 2761 tccagcctca ggaactaaca agttktgaga aaagcaggtg gtagagcagc agcttcgtgc 2821 tctcagcggt ggctggctgg catttttctc tagcgttgtg gtgccacctt cccttcttgt 2881 cccaaggtta taaggccttg tctttctctt tggaatcata aagtggaaca gagtccccag 2941 aactcatgtg ghcatttccg acagcatcac tccccggtgc ctatggggtc ccggtgtacc 3001 taaagggaga aggaccccat gtgctagcca gaaatatact gtctcttgaa ggaaagcagg 3061 agctcagact cttagagcca gctgtggctt cggacccaag gcctgaccta ggctgctatc 3121 ctaatattgg aggaggggcc tctcttccaa gccccaccct aagggttagc ccttggacaa 3181 atcttgtgcc gtctaggccc agccaggctt ttctgactaa ataagcaata agaggctcta 3241 agctgactga gttgcaagga ccctttccgc cctcccttgg atctccatgt ttctccagat 3301 ggcggaagag catgtgccac cccctttcct aacagacttg tccaagtgct tggcgtggga 3361 cccatgacca aagcccagga tggcttggtg ggagtgtccc tgctgcatct gcatgaagcc 3421 cctgcttttt aggcctcact cccatcagaa ccctgcctgc ccacctgcaa ctccccccca 3481 acaatgccat tcccacttgc cccagagaag ctactcggcc aaacctagcc agggtctgtt 3541 cttgtggacc agagccagcc tagtcattat ttgctgtcgg gtttccagtt tcaccgtgtg 3601 ttagggtgag ggatgattgt aaaatttgct cctcaaagga atcaggccag actcaatttt 3661 gggagggcaa gacagggagg aggccgcttc atcccagact ctcttctagg gcttcccacc 3721 atcagcccct cccacttgag actggtcttt gggaggcaat aggccaccat gcctggtcag 3781 caccaattca agccatgcca ggaatctgcc tacctgccag gttcagttct tttaaggtgc 3841 ctcttcaggg acacagtgtg tctctctgat tgggcttcta aatcaaaagc ctgatgttcg 3901 tgtccctctc atagggggag ctttggacac aggaccagtt tggaaaaggg tcaggtaagg 3961 gtttccactc tgcacattgt agagggaaca ctctgtaggc ccatgggtcc cttactagag 4021 aggttgagtg aatttgcctt cagttaacat gggaccttct gtttagcttc ctcttgcttc 4081 ccaaagattt taagcatttt gtaaatgtat aaactcacct ctggtaacag tggcccagac 4141 gctgctttgt gctaaaagca tgggaaatgt aaaggcagtc tttctctggg aaatggatgc 4201 tattctattc tgctgcccct acctgttcct gaggcctcat ttagaaagaa aatcccctca 4261 gaaggctgtc tggcacccag tgtcctagcc aggccaagta tatgagaaag gtaagtccat 4321 tttccccttc aggtcctcag tggattactt aaccactgct gtccctcggt ccctttttcc 4381 taaacgggtt tagttctgtc ttttttctcc ttttttctaa atgctggtaa atatttacat 4441 tcagccaggg aagaggaggc cagaggtcgg gccagctgcc ccattctttt aacgttgtag 4501 ggcctgccca tggagcggac cctcctcttt gggcctcgtg agcttttttg cttatcatgt 4561 tccatttcgt gccgctttcc cccttcaaga tgccatttgg agggtagggg atctgcttcc 4621 cactgtgact gggctatggg attctgacta ccttgcttac agattcatgg tttgataaat 4681 ttgttgtatt ccaaaacttg aaatgcagga cgccattaag tgtctgttta tatttttgga 4741 atatttgtat tacttacaat taattaataa aagtgggttt aaaaaacctt tccaggaaaa 4801 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSAF000177 894 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens Sm-like protein CaSm (CaSm) mRNA, complete cds. ACCESSION AF000177 NID g2232056 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 894) AUTHORS Schweinfest,C.W., Graber,M.W., Chapman,J.M., Papas,T.S., Baron,P.L. and Watson,D.K. TITLE CaSm: an Sm-like protein that contributes to the transformed state in cancer cells JOURNAL Cancer Res. 57 (1997) In press REFERENCE 2 (bases 1 to 894) AUTHORS Schweinfest,C.W., Graber,M.W., Chapman,J.M., Papas,T.S., Baron,P.L. and Watson,D.K. TITLE Direct Submission JOURNAL Submitted (17-APR-1997) Hollings Cancer Center, Medical University of South Carolina, 171 Ashley Avenue, Charleston, SC 29425, USA FEATURES Location/Qualifiers source 1..894 /organism="Homo sapiens" /db_xref="taxon:9606" gene 165..566 /gene="CaSm" CDS 165..566 /gene="CaSm" /note="Sm-like protein; encodes Sm motifs; overexpressed in pancreatic cancer" /codon_start=1 /product="CaSm" /db_xref="PID:g2232057" /translation="MNYMPGTASLIEDIDKKHLVLLRDGRTLIGFLRSIDQFANLVLH QTVERIHVGKKYGDIPRGIFVVRGENVVLLGEIDLEKESDTPLQQVSIEEILEEQRVE QQTKLEAEKLKVQALKDRGLSIPRADTLDEY" BASE COUNT 250 a 185 c 222 g 237 t ORIGIN 1 cttccggcag gccccgccgg cggctgaaag ccggggcaga agtgctggtc tcggtcggga 61 ttccgggctt ggtcccaccg aggcggcgac tgcggtagga gggaactggt tttggacgcg 121 ctggcgtccc gccgctgtgc attgcagcat tatttcagtt caaaatgaac tatatgcctg 181 gcaccgccag cctcatcgag gacattgaca aaaagcactt ggttctgctt cgagatggaa 241 ggacacttat aggcttttta agaagcattg atcaatttgc aaacttagtg ctacatcaga 301 ctgtggagcg tattcatgtg ggcaaaaaat acggtgatat tcctcgaggg atttttgtgg 361 tcagaggaga aaatgtggtc ctactaggag aaatagactt ggaaaaggag agtgacacac 421 ccctccagca agtatccatt gaagaaattc tagaagaaca aagggtggaa cagcagacca 481 agctggaagc agagaagttg aaagtgcagg ccctgaagga ccgaggtctt tccattcctc 541 gagcagatac tcttgatgag tactaatctt ttgcccagag gctgttggct cttgaagagt 601 aggggctgtc actgagtgaa agtgacatcc tggccacctc acgcatttga tcacagactg 661 tagagttttg aaaagtcact tttattttta attattttac atatgcaaca tgaagaaatc 721 gtgtaggtgg gttttttttt taaataacaa aatcactgtt taaagaaaca gtggcataga 781 ctccttcaca catcactgtg gcaccagcaa ctacttcttt atattgttct tcatatccca 841 aattagagtt tacagggaca gtcttcattt acttgtaaat aaaatatgaa tctc // LOCUS HSAF000381 1180 bp mRNA PRI 28-OCT-1997 DEFINITION Homo sapiens non-functional folate binding protein mRNA, complete cds. ACCESSION AF000381 NID g2565195 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1180) AUTHORS Verma,R.S. and Elwood,P.C. TITLE Identification and charcterization of homologous cDNA to KB folate receptor from human salivary gland JOURNAL Unpublished REFERENCE 2 (bases 1 to 1180) AUTHORS Verma,R.S. and Elwood,P.C. TITLE Direct Submission JOURNAL Submitted (18-APR-1997) Medicine Branch, National Cancer Institute/NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1180 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="salivary gland" 5'UTR <1..23 CDS 24..788 /codon_start=1 /evidence=experimental /product="non-functional folate binding protein" /db_xref="PID:g2565196" /translation="MASVPKTNKIEPRSYSIIPSCSIRRLGPALNTPIFQSKRNGPRG HSAYSIEGRQRQGAGRAVVPRADRPPAPKIQLRAFYLQQLYYTLLELELPRLLAPDLP SNGSSLKDLKWTHSNYRASKESCIVIFVTTSPGREWVICAPAAFLGCGSLQAPSPESE PSFPVTRGHHGRHGDYHRKLIGQTFEWVVVRRHGGRAIGPRLSRVTKAAGARPPAGAG EGLRVGFDLINAPIPPAKGVSARRHVLALELPQLSK" 3'UTR 789..1180 BASE COUNT 256 a 354 c 322 g 248 t ORIGIN 1 ggcccccggc gtccctctta atcatggcct cagttccgaa aaccaacaaa atagaaccgc 61 ggtcctattc cattattcct agctgcagta tcaggcggct cgggcctgct ttgaacactc 121 caatttttca aagtaaacgc aacgggcccc gcggacactc agcttacagc atcgaggggc 181 gccagaggca aggggcggga cgggcggtgg tccctcgcgc ggaccgcccg cccgctccca 241 agatccaact acgagctttt tacctgcagc aactttacta tacgctattg gagctggaat 301 taccgcggct gctggcacca gacttgccct ccaatggctc ctcgttaaag gatttaaagt 361 ggactcattc caattacagg gcctcgaaag agtcctgtat tgttattttc gtcactacct 421 ccccgggtcg ggagtgggta atttgcgcgc ctgctgcctt ccttggatgt ggtagcctcc 481 aggctccctc tccggaatct gaaccctcat tccccgtcac ccgtggtcac catggtcggc 541 acggcgacta ccatcgaaag ttgatagggc agacgttcga atgggtcgtc gtccgccgcc 601 acggggggcg tgcgatcggc ccgaggttat ctagagtcac caaagccgcc ggcgcccgcc 661 ccccggccgg ggccggagag gggctgaggg ttggttttga tctgataaat gcaccgatcc 721 cccccgcgaa gggggtcagc gcccgtcggc atgtattagc tctagaatta ccacagttat 781 ccaagtagga gaggagcgag cgaccaaagg aaccataact gatttaatga gccattcgca 841 gtttcactgt accggccgtg cgtacttaga catgcatggc ttaatctttg agacaagcat 901 atgctactgg caggatcaac caggacacct gcctctacga gtgctccccc aacttggggc 961 cctggatcca gcaggtggat cagagctggc gcaaagagcg ggtactgaac gtgcccctgt 1021 gcaaagagga ctgtgagcaa tggtgggaag attgtcgcac ctcctacacc tgcaagagca 1081 actggcacaa gggctgcaac tggacttcag ggtttaacaa gtgcgcagtg ggagctgcct 1141 gccaaccttt ccatttctac ttccccacac ccattgcccg // LOCUS HSAF000545 1020 bp DNA PRI 18-MAY-1997 DEFINITION Homo sapiens putative purinergic receptor P2Y10 gene, complete cds. ACCESSION AF000545 NID g2104786 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1020) AUTHORS Bohm,S.K. TITLE Putative purinergic receptor related to P2Y5 and P2Y9 is localized on the X chromosome JOURNAL Unpublished REFERENCE 2 (bases 1 to 1020) AUTHORS Bohm,S.K. TITLE Direct Submission JOURNAL Submitted (20-APR-1997) Dept. of Surgery, University of California, San Francisco, 521 Parnassus Ave., San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="J333E231" CDS 1..1020 /note="G-protein coupled receptor" /codon_start=1 /product="putative purinergic receptor P2Y10" /db_xref="PID:g2104787" /translation="MANLDKYTETFKMGSNSTSTAEIYCNVTNVKFQYSLYATTYILI FIPGLLANSAALWVLCRFISKKNKAIIFMINLSVADLAHVLSLPLRIYYYISHHWPFQ RALCLLCFYLKYLNMYASICFLTCISLQRCFFLLKPFRARDWKRRYDVGISAAIWIVV GTACLPFPILRSTDLNNNKSCFADLGYKQMNAVALVGMITVAELAGFVIPVIIIAWCT WKTTISLRQPPMAFQGISERQKALRMVFMCAAVFFICFTPYHINFIFYTMVKETIISS CPVVRIALYFHPFCLCLASLCCLLDPILYYFMASEFRDQLSRHGSSVTRSRLMSKESG SSMIG" BASE COUNT 240 a 259 c 210 g 311 t ORIGIN 1 atggctaacc ttgacaaata cactgaaaca ttcaagatgg gtagcaacag taccagcact 61 gctgagattt actgtaatgt cactaatgtg aaatttcaat actccctcta tgcaaccacc 121 tatatcctca tattcattcc tggtcttctg gctaacagtg cagccttgtg ggttctgtgc 181 cgcttcatca gcaagaaaaa taaagccatc attttcatga tcaacctctc tgtggctgac 241 cttgctcatg tattatcttt acccctccgg atttactatt acatcagcca ccactggcct 301 ttccagagag ccctttgcct gctctgcttc tacctgaagt atctcaacat gtatgccagc 361 atttgtttcc tgacgtgcat cagtcttcaa aggtgctttt ttctcctcaa gcccttcagg 421 gccagagact ggaagcgtag gtacgatgtg ggcatcagtg ctgccatctg gatcgttgtg 481 gggactgcct gtttgccatt tcccatcctg agaagcacag acttaaacaa caacaagtcc 541 tgctttgctg atcttggata caagcaaatg aatgcagttg cgttggtcgg gatgattaca 601 gttgctgagc ttgcaggatt tgtgatccca gtgatcatca tcgcatggtg tacctggaaa 661 actactatat ccttgagaca gccaccaatg gctttccaag ggatcagtga gaggcagaaa 721 gcactgcgga tggtgttcat gtgtgctgca gtcttcttca tctgcttcac tccctatcat 781 attaacttta ttttttacac catggtaaag gaaaccatca ttagcagttg tcccgttgtc 841 cgaatcgcac tgtatttcca ccctttttgc ctgtgccttg caagtctctg ctgccttttg 901 gatccaattc tttattactt tatggcttca gagtttcgtg accaactatc ccgccatggc 961 agttctgtga cccgctcccg cctcatgagc aaggagagtg gttcatcaat gattggctaa // LOCUS HSAF000546 1118 bp mRNA PRI 02-JUL-1997 DEFINITION Homo sapiens purinergic receptor P2Y5 mRNA, complete cds. ACCESSION AF000546 NID g2232068 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1118) AUTHORS Bohm,S.K., Trumpp,A., Khitin,L.M., Kong,W., Payan,D.G. and Bunnett,N.W. TITLE The human purinergic receptor P2Y5 is encoded in intron 17 of the retinoblastoma gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1118) AUTHORS Bohm,S.K. TITLE Direct Submission JOURNAL Submitted (21-APR-1997) Dept. of Surgery, University of California, San Francisco, 521 Parnassus Ave., San Francisco, CA 94143-0660, USA FEATURES Location/Qualifiers source 1..1118 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q14" /cell_type="white blood cell" CDS 45..1079 /note="G-protein coupled receptor" /codon_start=1 /product="purinergic receptor P2Y5" /db_xref="PID:g2232069" /translation="MVSVNSSHCFYNDSFKYTLYGCMFSMVFVLGLVSNCVAIYIFIC VLKVRNETTTYMINLAMSDLLFVFTLPFRIFYFTTRNWPFGDLLCKISVMLFYTNMYG SILFLTCISVDRFLAIVYPFKSKTLRTKRNAKIVCTGVWLTVIGGSAPAVFVQSTHSQ GNNASEACFENFPEATWKTYLSRIVIFIEIVGFFIPLILNVTCSSMVLKTLTKPVTLS RSKINKTKVLKMIFVHLIIFCFCFVPYNINLILYSLVRTQTFVNCSVVAAVRTMYPIT LCIAVSNCCFDPIVYYFTSDTIQNSIKMKNWSVRRSDFRFSEVHGAENFIQHNLQTLK SKIFDNESAA" BASE COUNT 333 a 208 c 198 g 379 t ORIGIN 1 ctgatgaaag tgcttccaaa ctgaaaattg gacgtgcctt tacgatggta agcgttaaca 61 gctcccactg cttctataat gactccttta agtacacttt gtatgggtgc atgttcagca 121 tggtgtttgt gcttgggtta gtatccaatt gtgttgccat atacattttc atctgcgtcc 181 tcaaagtccg aaatgaaact acaacttaca tgattaactt ggcaatgtca gacttgcttt 241 ttgtttttac tttacccttc aggatttttt acttcacaac acggaattgg ccatttggag 301 atttactttg taagatttct gtgatgctgt tttataccaa catgtacgga agcattctgt 361 tcttaacctg tattagtgta gatcgatttc tggcaattgt ctacccattt aagtcaaaga 421 ctctaagaac caaaagaaat gcaaagattg tttgcactgg cgtgtggtta actgtgatcg 481 gaggaagtgc acccgccgtt tttgttcagt ctacccactc tcagggtaac aatgcctcag 541 aagcctgctt tgaaaatttt ccagaagcca catggaaaac atatctctca aggattgtaa 601 ttttcatcga aatagtggga ttttttattc ctctaatttt aaatgtaact tgttctagta 661 tggtgctaaa aactttaacc aaaccagtta cattaagtag aagcaaaata aacaaaacta 721 aggttttaaa aatgattttt gtacatttga tcatattctg tttctgtttt gttccttaca 781 atatcaatct tattttatat tctcttgtga gaacacaaac atttgttaat tgctcagtag 841 tggcagcagt aaggacaatg tacccaatca ctctctgtat tgctgtttcc aactgttgtt 901 ttgaccctat agtttactac tttacatcgg acacaattca gaattcaata aaaatgaaaa 961 actggtctgt caggagaagt gacttcagat tctctgaagt tcatggtgca gagaatttta 1021 ttcagcataa cctacagacc ttaaaaagta agatatttga caatgaatct gctgcctgaa 1081 ataaaaccat taggactcac tgggacagaa ctttcaag // LOCUS HSAF000974 1755 bp mRNA PRI 02-JUL-1997 DEFINITION Human zyxin related protein ZRP-1 mRNA, complete cds. ACCESSION AF000974 NID g2232135 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Murthy,K.K., Shen,S.-H. and Banville,D. TITLE ZRP-1, a zyxin-related protein identified by interaction trap using the second PDZ domain of the cytosolic protein tyrosine phosphatase hPTP1E as bait JOURNAL Unpublished REFERENCE 2 (bases 1 to 1755) AUTHORS Banville,D. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) Biotechnology Research Institute, 6100 Royalmount, Montreal, Quebec H4P 2R2, Canada FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 160..1590 /note="zyxin related protein; Lim domains containing protein" /codon_start=1 /product="ZRP-1" /db_xref="PID:g2232136" /translation="MSGPTWLPPKQPEPARAPQGRAIPRGTPGPPPAHGAALQPHPRV NFCPLPSEQCYQAPGGPEDRGPAWVGSHGVLQHTQGLPADRGGLRPGSLDAEIDLLST TLAKLNGGRGHASRRPDRQAYEPPPPPAYRTGSLKPNPASPLPASPYGGPTPASYTTA STPAGPAFPVQVKVAQPVRGCGPPRRGASQASGPLPGPHFPLPGRGEVWGPGYRSQRE PGPGAKEEAAGVSGPAGRGRGGEHGPQVPLSQPPEDELDRLTKKLVHDMNHPPSGEYF GQCGGCGEDVVGDGAGVVALDRVFHVGCFVCSTCRAQLRGQHFYAVERRAYCEGCYVA TLEKCATCSQPILDRILRAMGKAYHPGCFTCVVCHRGLDGIPFTVDATSQIHCIEDFH RKFAPRCSVCGGAIMPEPGQEETVRIVALDRSFHIGCYKCEECGLLLSSEGECQGCYP LDGHILCKACSAWRIQELSATVTTDC" BASE COUNT 347 a 539 c 547 g 322 t ORIGIN 1 cgcccgggca ggtcccaaaa ttagggggga agaggaaaaa aaaaagccag aaaaagtttt 61 cttttctgga gtcccaaacg aggtgcggga cggaagaggg ggtgaaggcc agaggctcgg 121 ggcttcaaga ccgctgtctg gagtccccct ttccaggcca tgtcggggcc cacctggctg 181 cccccgaagc agccggagcc cgccagagcc cctcagggga gggcgatccc ccgcggcacc 241 ccggggccac caccggccca cggagcagca ctccagcccc accccagggt caatttttgc 301 ccccttccat ctgagcagtg ttaccaggcc ccagggggac cggaggatcg ggggccggcg 361 tgggtggggt cccatggagt actccagcac acgcaggggc tccctgcaga cagggggggc 421 cttcgccctg gaagcctgga cgccgagata gacttgctga gcaccacgct ggccaaactg 481 aatgggggtc ggggtcatgc gtcacggcga ccagaccgac aggcatatga gcccccgcca 541 cctcctgcct accgcacggg ctccctgaag ccaaatccag cctcgccgct cccagcgtct 601 ccctatgggg gccccactcc agcctcttac actaccgcca gcaccccggc tggcccagcc 661 ttccccgtgc aagtgaaggt ggcacagcca gtgaggggct gcggcccacc caggcgggga 721 gcctctcagg cttctgggcc cctcccgggc ccccactttc ctctcccagg ccgaggtgaa 781 gtctgggggc ctggctatag gagccagaga gagccagggc caggggccaa agaggaagct 841 gctggggtct ctggccctgc aggaagagga agaggaggcg agcacgggcc ccaggtgccc 901 ctgagccagc ctccagagga tgagctggat aggctgacga agaagctggt tcacgacatg 961 aaccacccgc ccagcgggga gtactttggc cagtgtggtg gctgcggaga agatgtggtt 1021 ggggatgggg ctggggttgt ggcccttgat cgcgtctttc acgtgggctg ctttgtatgt 1081 tctacatgcc gggcccagct tcgcggccag catttctacg ccgtggagag gagggcatat 1141 tgcgagggct gctacgtggc caccctggag aaatgtgcca cgtgctccca gcccatcctg 1201 gaccggatcc tgcgggctat ggggaaggcc taccaccctg gctgcttcac ctgcgtggtg 1261 tgtcaccgcg gcctcgacgg catccccttc acagtggatg ctacgagcca gatccactgt 1321 attgaggact ttcacaggaa gtttgcccca agatgctcag tgtgcggtgg ggccataatg 1381 cctgagccag gtcaggagga gactgtgaga attgttgctc tggatcgaag ttttcacatt 1441 ggctgttaca agtgcgagga gtgtgggctg ctgctctcct ctgagggcga gtgtcagggc 1501 tgctacccgc tggatgggca catcttgtgc aaggcctgca gcgcctggcg catccaggag 1561 ctctcagcca ccgtcaccac tgactgctga gtcttcctag aagtacctgc tgggttctca 1621 gttccagttc ccatcctttg attgatcact ctccctgaca tccacctgta tgactttgtc 1681 accaaatgct gtcttctctt tctccaatca agaaataata atccctcgag tttacaaaaa 1741 aaaaaaaaaa aaaaa // LOCUS HSAF000979 552 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens testis-specific Basic Protein Y 1 (BPY1) mRNA, complete cds. ACCESSION AF000979 NID g2580543 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 552) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 552) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..552 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..552 /gene="BPY1" CDS 73..450 /gene="BPY1" /note="small basic protein" /codon_start=1 /product="testis-specific Basic Protein Y 1" /db_xref="PID:g2580544" /translation="MSPKPRASGPPAKAKETGKRKSSSQPSPSGPKKKTTKVAEKGEA VRGGRRGKKGAATKMAAVTAPEAESGPAAPGPSDQPSQELPQHELPPEEPVSEGTQHD PLSQESELEEPLSKGRPSTPLSP" BASE COUNT 147 a 154 c 185 g 66 t ORIGIN 1 gagaggggta tacacaggga ggccaggcag cctggagtta gtcgaccgtt gcgagacgtt 61 gagctgcggc agatgagtcc aaagccgaga gcctcgggac ctccggccaa ggccaaggag 121 acaggaaaga ggaagtcctc ctctcagccg agccccagtg gcccgaagaa gaagactacc 181 aaggtggccg agaagggaga agcagttcgt ggagggagac gcgggaagaa aggggctgcg 241 acaaagatgg cggccgtgac ggcacctgag gcggagagcg ggccagcggc acccggcccc 301 agcgaccagc ccagccagga gctccctcag cacgagctgc cgccggagga gccagtgagc 361 gaggggaccc agcacgaccc cctgagtcag gagagcgagc tggaggaacc actgagtaag 421 gggcgcccat ctactcccct atctccctga gcagcaacta agtttaggcc cagctgccag 481 acctcagaga tctcaccagc agggtgcttc ccatgttgat gacaataaaa tgaatgtgtt 541 gcaaaaaaaa aa // LOCUS HSAF000980 1212 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens testis-specific Basic Protein Y 2 (BPY2) mRNA, complete cds. ACCESSION AF000980 NID g2580545 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1212) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 1212) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1212 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..1212 /gene="BPY2" CDS 333..653 /gene="BPY2" /codon_start=1 /product="testis-specific Basic Protein Y 2" /db_xref="PID:g2580546" /translation="MMTLVPRARTRAGQDHYSHPCPRFSQVLLTEGIMTYCLTKNLSD VNILHRLLKNGNVRNTLLQSKVGLLTYYVKLYPGEVTLLTRPSIQMRLCCITGSVSKP RSQK" BASE COUNT 379 a 275 c 245 g 313 t ORIGIN 1 aatatctcag gacccaggac catgtgatat gggcccaaca cctggatgat gttactcttc 61 tgcctaggtc atgcgtaaag agggaattag ggcatattgc ttggcccagt cccgtaatga 121 tatgactctc ctgcttgtgc cagagccaca gaagtgtgct tggtgacata atctttgagg 181 ctgtcacatc accaagatta tattgtatca ctggaccagc ataaagctga cacttctgac 241 tatgcccagc cttcaaataa tactacactg tataattggc tcaacaccca ggtgatattg 301 ttccatttac ctgagaccag ataaaaagcc taatgatgac gcttgtcccc agagccagga 361 cacgtgcagg acaggatcat tactctcatc cctgccccag attttcacag gtgctgctta 421 cagagggcat catgacatat tgcttgacaa agaacctaag tgatgttaat attctgcata 481 ggttgctaaa aaatgggaat gtgagaaata ccttgcttca gtccaaagtg ggcttgctga 541 catattatgt gaaactgtac ccgggtgaag tgactcttct gactaggccc agcatacaaa 601 tgagattatg ctgtatcact ggctcagtgt cgaagcccag atcacagaag taattgtgcc 661 atatgtggaa caagcagcta agcaatagat aacatccatc gtggctctgc cttcaaaggg 721 aaattttaca tatgtcactg ggaccatcac ccagatgatg tcctgcccac taaaagaatt 781 gtgacataac gctgactgca aaaactgggt aatgcaactc tcctctttat tctggagtct 841 gccaaaacaa gggattatca catattgcgg agtccagcac ccaggtaaaa ttttgtcata 901 tacccagctt cagataccat gcaatgatac aactatcata cctggaccca aagaggagag 961 atattttgat tctcattgcc attcttatgg ccacaagcaa agtaatggtt ctcatagtgg 1021 tataaagttc acacagtatt atgacactcc cagcgtatca tagaaaatgt gagtagtaca 1081 atgagtgtta taacagggaa cagcaaacca atgctattgt gattattgga ttcacaccca 1141 gctgacgcga ctatcattct ctcacaagaa cagaacctgc aaataaagta ctaaatctca 1201 ccaaaaaaaa aa // LOCUS HSAF000981 2184 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens testis-specific ChromoDomain Y isoform a (CDY) mRNA, complete cds. ACCESSION AF000981 NID g2580547 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2184) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 2184) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..2184 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..2184 /gene="CDY" CDS 282..1946 /gene="CDY" /codon_start=1 /product="testis-specific ChromoDomain Y isoform a" /db_xref="PID:g2580548" /translation="MASQEFEVEAIVDKRQDKNGNTQYLVRWRGYDKQDDTWEPEQHL MNCEKCVHDFNRRQTEKQKKLTWTTTSRIFSNNARRRTSRSTKANYSKNSPKTPVTDK HHRSKNRKLFAASKNVRRKAASILSDTKNMEIINSTIETLAPDSPFDHKTVSGFQKLE KLNPIAADQQDTVVFKVTEGKLLRDPLSRPGAEQTGIQNKTQIHPLMSQMSGSVTASM ATGSATRKGIVVLIDPLAANGTTDMHTSVPRVKGGQRNITDDSRDQPFIKKMHFTIRL TESASTYRDIVVKKEDGFTQIVLSTRSTEKNALNTEVIKEIVNALNSAAADDSKLVLF SAAGSVFCCGLDFGYFVKHLRNNRNTASLEMVDTIKNFVNTFIQFKKPIVVSVNGPAI GLGASILPLCDLVWANEKAWFQTPYTTFGQSPDGCSSITFPIMMGKASANEMLIAGRK LTAREACAKGLVSQVFLTGTFTQEVMIQIKELASYNPIVLEECKALVRCNIKLELEQA NERECEVLRKIWSSARGIESMLKIPLLGYKAAFPPRKTQNDQRWCP" BASE COUNT 713 a 456 c 485 g 530 t ORIGIN 1 ctgtggattt agctactctc acctgaggct actgagcaag ttgtcatgca ccatgagaca 61 aagcccaagc tgtcccacca ggcagtaagt atggagaggt tcaggcacat ggcatagctg 121 ctatttcgca caattttcac tacaccagtg gtgacaaaat agaagaggtt catccataca 181 cagaacctgg tgaagagctg gaggcagaaa gaagtgtcta tgtggagacg caactgaaac 241 aaaggtggca cagcaactgt tccaatcccg tgtctttcct catggcttcc caggagtttg 301 aggttgaagc tattgttgac aaaagacagg ataaaaatgg gaatacacag tatttggttc 361 ggtggagagg ttatgacaaa caggatgaca cttgggaacc agagcagcac ctcatgaact 421 gtgaaaaatg tgtacatgat tttaatagac gacagactga aaaacagaaa aaactgacat 481 ggactacaac cagtagaatt ttttcaaaca atgccagaag aagaacttcc agatctacaa 541 aagcaaacta ttctaagaac tctcctaaaa cgccagtgac tgataaacac cacaggtcca 601 aaaaccgcaa gttatttgct gccagcaaga acgttaggag aaaggcagct tcaattctct 661 ccgacacaaa gaatatggag ataataaatt caactattga gacccttgca cctgacagcc 721 cctttgacca caaaactgtg agtggctttc agaaacttga gaaactgaac cctattgcag 781 cagatcagca ggacacggtg gtcttcaagg tgacagaagg gaaactcctc cgggaccctt 841 tgtcacgtcc tggtgcagaa cagactggaa tacagaacaa gactcagata cacccactaa 901 tgtcgcagat gtctggctca gttactgctt ctatggccac aggttcagct acccgaaagg 961 gtatagtggt attaatagac ccattagcag ccaatgggac aacagacatg catacctcag 1021 ttccaagagt gaaaggtggg caaagaaata ttactgatga cagcagagac cagcctttta 1081 tcaagaagat gcacttcacc ataaggctaa cagaaagtgc cagcacatac agagacattg 1141 tagtgaagaa agaggatgga ttcacccaga tagtgctatc aactagatcg acagaaaaaa 1201 atgcactgaa tacagaagta attaaagaaa tagttaatgc tctgaatagc gctgctgcag 1261 atgacagcaa gctcgtgctg ttcagtgcag ctggaagtgt cttttgctgc ggtcttgatt 1321 ttgggtactt tgtgaagcac ttaaggaata acagaaacac agcaagcctt gaaatggtgg 1381 acaccatcaa gaactttgtg aatactttta ttcaatttaa aaagcctatt gttgtatcag 1441 tcaatggccc tgcgattgga ctaggtgcat ccatcctgcc tctttgtgat ctcgtgtggg 1501 ctaatgaaaa ggcttggttc caaacccctt atacgacctt tggacagagt ccagatggct 1561 gttcttctat tacattcccc ataatgatgg gtaaagcatc tgccaatgaa atgttaattg 1621 ctgggcgaaa gctgacagca agggaggcat gcgccaaagg cctggtctct caggtatttt 1681 tgactggaac tttcacccaa gaggttatga ttcaaattaa ggagcttgcc tcatacaatc 1741 caattgtact ggaagaatgt aaggccctcg ttcgctgtaa tattaagttg gagttggaac 1801 aggccaatga gagagagtgt gaggtgctga ggaagatctg gagctcagcc cgagggatag 1861 aatccatgtt aaaaatacct ctgttgggat ataaagcagc cttccctccc agaaagacac 1921 agaatgatca gagatggtgc ccttgacttt atagtggcac aaacgcttca gagacacaca 1981 attataagag acttatcttt tagcataaat acttatggct caaaatccac tgacgatcat 2041 tctcctaaac tgaacacatg actagaattg gtggtgagat atcgcttgat tttcttttcc 2101 tttataaatg tctagttctt acccagttaa caaaagaaaa ctttatcgct ctaaagtaaa 2161 acttgttaca ccacaaaaaa aaaa // LOCUS HSAF000982 5322 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens dead box, X isoform (DBX) mRNA, alternative transcript 2, complete cds. ACCESSION AF000982 NID g2580549 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5322) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 5322) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..5322 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" gene 1..5322 /note="X/Y homologous gene" /gene="DBX" CDS 857..2845 /gene="DBX" /note="alternative transcript 2" /codon_start=1 /product="dead box, X isoform" /db_xref="PID:g2580550" /translation="MSHVAVENALGLDQQFAGLDLNSSDNQSGGSTASKGRYIPPHLR NREATRGFYDKDSSGWSSSKDKDAYSSFGSRSDSRGKSSFFSDRGSGSRGRFDDRGRS DYDGIGSRGDRSGFGKFERGGNSRWCDKSDEDDWSKPLPPSERLEQELFSGGNTGINF EKYDDIPVEATGNNCPPHIESFSDVEMGEIIMGNIELTRYTRPTPVQKHAIPIIKEKR DLMACAQTGSGKTAAFLLPILSQIYSDGPGEALRAMKENGRYGRRKQYPISLVLAPTR ELAVQIYEEARKFSYRSRVRPCVVYGGADIGQQIRDLERGCHLLVATPGRLVDMMERG KIGLDFCKYLVLDEADRMLDMGFEPQIRRIVEQDTMPPKGVRHTMMFSATFPKEIQML ARDFLDEYIFLAVGRVGSTSENITQKVVWVEESDKRSFLLDLLNATGKDSLTLVFVET KKGADSLEDFLYHEGYACTSIHGDRSQRDREEALHQFRSGKSPILVATAVAARGLDIS NVKHVINFDLPSDIEEYVHRIGRTGRVGNLGLATSFFNERNINITKDLLDLLVEAKQE VPSWLENMAYEHHYKGSSRGRSKSSRFSGGFGARDYRQSSGASSSSFSSSRASSSRSG GGGHGSSRGFGGGGYGGFYNSDGYGGNYNSQGVDWWGN" BASE COUNT 1532 a 1011 c 1274 g 1505 t ORIGIN 1 ctttcccctt actccgctcc cctcttttcc ctccctctcc tccccttccc tctgttctct 61 cctcctcttc ccctcccctc ccccgtccgg ggcactctat attcaagcca ccgtttcctg 121 cttcacaaaa tggccaccgc acgcgacacc tacggtcacg tggcctgccg ccctctcagt 181 ttcgggaatc tgcctagctc ccactaaggg gaggctaccc gcggaagagc gagggcagat 241 tagaccggag aaatcccacc acatctccaa gcccgggaac tgagagagga agaagagtga 301 aggccagtgt taggaaaaaa aaaaacaaaa acaaaaaaaa cgaaaaacga aagctgagtg 361 catagagttg gaaaggggag cgaatgcgta aggttggaaa ggggggcgaa gaggcctagg 421 ttaacatttt caggcgtctt agccggtgga aagcgggaga cgcaagttct cgcgagatct 481 cgagaactcc gaggctgaga ctagggtttt agcggagagc acgggaagtg tagctcgaga 541 gaactgggac agcatttcgc accctaagct ccaaggcagg actgctaggg gcgacaggac 601 taagtaggaa atcccttgag cttagacctg agggagcgcg cagtagccgg gcagaagtcg 661 ccgcgacagg gaattgcggt gtgagaggga gggcacacgt tgtacgtgct gacgtagccg 721 gctttccagc gggtatatta gatccgtggc cgcgcggtgc gctccagagc cgcagttctc 781 ccgtgagagg gccttcgcgg tggaacaaac actcgcttag cagcggaaga ctccgagttc 841 tcggtactct tcagggatga gtcatgtggc agtggaaaat gcgctcgggc tggaccagca 901 gtttgctggc ctagacctga actcttcaga taatcagagt ggaggaagta cagccagcaa 961 agggcgctat attcctcctc atttaaggaa ccgagaagct actagaggtt tctacgataa 1021 agacagttca gggtggagtt ctagcaaaga taaggatgcg tatagcagtt ttggatctcg 1081 tagtgattca agagggaagt ctagcttctt cagtgatcgt ggaagtggat caaggggaag 1141 gtttgatgat cgtggacgga gtgattacga tggcattggc agccgtggtg acagaagtgg 1201 ctttggcaaa tttgaacgtg gtggaaacag tcgctggtgt gacaaatcag atgaagatga 1261 ttggtcaaaa ccactcccac caagtgaacg cttggaacag gaactctttt ctggaggcaa 1321 cactgggatt aattttgaga aatacgatga cattccagtt gaggcaacag gcaacaactg 1381 tcctccacat attgaaagtt tcagtgatgt tgagatggga gaaattatca tgggaaacat 1441 tgagcttact cgttatactc gcccaactcc agtgcaaaag catgctattc ctattatcaa 1501 agagaaaaga gacttgatgg cttgtgccca aacagggtct ggaaaaactg cagcatttct 1561 gttgcccatc ttgagtcaga tttattcaga tggtccaggc gaggctttga gggccatgaa 1621 ggaaaatgga aggtatgggc gccgcaaaca atacccaatc tccttggtat tagcaccaac 1681 gagagagttg gcagtacaga tctacgaaga agccagaaaa ttttcatacc gatctagagt 1741 tcgtccttgc gtggtttatg gtggtgccga tattggtcag cagattcgag acttggaacg 1801 tggatgccat ttgttagtag ccactccagg acgtctagtg gatatgatgg aaagaggaaa 1861 gattggatta gacttttgca aatacttggt gttagatgaa gctgatcgga tgttggatat 1921 ggggtttgag cctcagattc gtagaatagt cgaacaagat actatgcctc caaagggtgt 1981 ccgccacact atgatgttta gtgctacttt tcctaaggaa atacagatgc tggctcgtga 2041 tttcttagat gaatatatct tcttggctgt aggaagagtt ggctctacct ctgaaaacat 2101 cacacagaaa gtagtttggg tggaagaatc agacaaacgg tcatttctgc ttgacctcct 2161 aaatgcaaca ggcaaggatt cactgacctt agtgtttgtg gagaccaaaa agggtgcaga 2221 ttctctggag gatttcttat accatgaagg atacgcatgt accagcatcc atggagaccg 2281 ttctcagagg gatagagaag aggcccttca ccagttccgc tcaggaaaaa gcccaatttt 2341 agtggctaca gcagtagcag caagaggact ggacatttca aatgtgaaac atgttatcaa 2401 ttttgacttg ccaagtgata ttgaagaata tgtacatcgt attggtcgta cgggacgtgt 2461 aggaaacctt ggcctggcaa cctcattctt taacgagagg aacataaata ttactaagga 2521 tttgttggat cttcttgttg aagctaaaca agaagtgccg tcttggttag aaaacatggc 2581 ttatgaacac cactacaagg gtagcagtcg tggacgttct aagagtagca gatttagtgg 2641 agggtttggt gccagagact accgacaaag tagcggtgcc agcagttcca gcttcagcag 2701 cagccgcgca agcagcagcc gcagtggcgg aggtggccac ggtagcagca gaggatttgg 2761 tggaggtggc tatggaggct tttacaacag tgatggatat ggaggaaatt ataactccca 2821 gggggttgac tggtggggta actgagcctg ctttgcagta ggtcaccctg ccaaacaagc 2881 taatatggaa accacatgta acttagccag actatacctt gtgtagtttc aagaactcgc 2941 agtacattac cagctgtgat tctccactga aatttttttt ttaagggagc tcaaggtcac 3001 aagaagaaat gaaaggaaca atcagcagcc ctgttcagaa ggtggtttga agacttcatt 3061 gctgtagttt ggattaactc ccctcccgcc tacccccatc ccaaactgca tttataattt 3121 tgtgactgag gatcatttgt ttgttaatgt actgtgcctt taactataga caacttttta 3181 ttttgatgtc ctgttggctc agtaatgctc aagatatcaa ttgttttgac aaaataaatt 3241 tactgaactt gggctaaaat caaaccttgg cacacaggtg tgatacaact taacaggaat 3301 catcgattca tccataaata atataaggaa aaacttatgc ggtagcctgc attagggctt 3361 tttgatactt gcagattggg ggaaaacaac aaatgtcttg aagcatatta atggaattag 3421 tttctaatgt ggcaaactgt attaagttaa agttctgatt tgctcactct atcctggata 3481 ggtatttaga acctgatagt ctttaagcca ttccagtcat gatgaggtga tgtatgaata 3541 catgcataca ttcaaagcac tgttttcaaa gttaatgcaa gtaaatacag caattcctct 3601 ttcaacgttt aggcagatca ttaattatga gctagccaaa tgtgggcata ctattacagg 3661 gaaagtttaa aggtctgata acttgaaaat aggtttttag gagaattcat ctacttagac 3721 tttttaagtg cctgccataa atgaaattga aatggtagaa tggctgacca cagcaatgac 3781 cagccctcat tagggccctg gatgattttt ggtctaataa cgcatgctag tgttgatgtt 3841 ttttggtcag agggtatgaa caggaagaat taaatgcagc aggctttatt ttaaatgccg 3901 attcacatta ctctgttcaa gctgcgttga gatgttaaac tggcttacta tagacttcgt 3961 aaaaatggct ccagaaaagt aacaaactga aatctttgag atcacacagg ttggaaatat 4021 gtacataact gcacaaggtg tcaattctgc tctacagtgc agttttagtc agttttagtt 4081 gcataggttt ccattgtatt tatagtctgt ttatgctaaa tctggccaaa gatgaacatt 4141 gtccaccact aaaatgcctc tgccactttg aattctgtgc taattttgtg gccagaatgc 4201 ggtgatcaaa acgctccatc tttttacagt ggcataggaa gacggcaaaa atttcctaaa 4261 gtgcaataga ttttcaagtg tattgtgcct tgttctaaaa cttttattaa gtaggtgcac 4321 ttgacagtat tgaggtcatt tgttatggtg ctatttcaat tagtctaggt ttaggccctt 4381 gtacattttg cccataactt tttacaaagt acttctttta ttgcacattc agagaatttt 4441 atatatatgt cttgtgtgcg tgtccttaaa cttccaatct tactttgtct cttggagatt 4501 gttgaacgca gcttgtctag gaaggggatg ggactagatt ctaaaattta tttgggacca 4561 tgggaatgat agttgggaag aaaactattt gcacacgaca gatttctaga tactttttgc 4621 tgctagcttt atgtaatatt tattgaacat tttgacaaat atttattttt gtaagcctaa 4681 aagtgattct ttgaaagttt aaagaaactt gaccaaaaga cagtacaaaa acactggcac 4741 ttgaatgttg aatgtcaccg tatgcgtgaa attatatatt tcggggtagt gtgagctttt 4801 aatgtttaag tcatattaaa ctcttaagtc aaattaagca gacccggcgt tggcagtgta 4861 gccataactt tctgatgtta gtaaaaacaa aattggcgac ttgaaattaa attatgccaa 4921 ggttttgata cacttgtctt aagatattaa tgaaacactt caaaacactg atgtgaagtg 4981 tccagattct cagatgtttg ttgtgtggat tttgtttagt tgtgtgtttt tttttttttc 5041 agtgaatgtc tggcacattg caatcctcaa acatgtggtt atctttgttg tattggcata 5101 atcagtgact tgtacattca gcaatagcat ttgagcaagt tttatcagca agcaatattt 5161 tcagttaata aggtttcaaa aatcatgtaa ggatttaaac ttgctgaatg taaagattga 5221 acctcaagtc actgtagctt tagtaattgc ttattgtatt agtttagatg ctagcactgc 5281 atgtgctgtg catattctga ttttattaaa ataaaaaaaa aa // LOCUS HSAF000986 10091 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens Drosophila fat facets related Y protein (DFFRY) mRNA, complete cds. ACCESSION AF000986 NID g2580557 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10091) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 10091) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..10091 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..10091 /note="X/Y homologous gene" /gene="DFFRY" CDS 1665..9332 /gene="DFFRY" /codon_start=1 /product="Drosophila fat facets related Y protein" /db_xref="PID:g2580558" /translation="MTAITHGSPVGGNDSQGQVLDGQSQHLFQQNQTSSPDSSNENSV ATPPPEEQGQGDAPPQHEDEEPAFPHTELANLDDMINRPRWVVPVLPKGELEVLLEAA IDLSVKGLDVKSEACQRFFRDGLTISFTKILMDEAVSGWKFEIHRCIINNTHRLVELC VAKLSQDWFPLLELLAMALNPHCKFHIYNGTRPCELISSNAQLPEDELFARSSDPRSP KGWLVDLINKFGTLNGFQILHDRFFNGSALNIQIIAALIKPFGQCYEFLSQHTLKKYF IPVIEMVPHLLENLTDEELKKEAKNEAKNDALSMIIKSLKNLASRISGQDETIKNLEI FRLKMILRLLQISSFNGKMNALNEINKVISSVSYYTHRHSNPEEEEWLTAERMAEWIQ QNNILSIVLQDSLHQPQYVEKLEKILRFVIKEKALTLQDLDNIWAAQAGKHEAIVKNV HDLLAKLAWDFSPGQLDHLFDCFKASWTNASKKQREKLLELIRRLAEDDKDGVMAHKV LNLLWNLAQSDDVPVYIMDLALSAHIKILDYSCAQDRDAQKIQWIDHFIEELRTNDKW VIPALKQIREICSLFGEASQNLSQTQRSPHIFYRHDLINQLQQNHALVTLVAENLATY MNSIRLYAGDHEDYDPQTVRLGSRYSHVQEVQERLNFLRFLVKDGQLWLCAPQAKQIW KCLAENAVYLCDREACFKWYSKLMGDEPDLDPDINKDFFESNVLQLDPSLLTENGMKC FERFFKAVNCRERKLIAKRRSYMMDDLELIGLDYLWRVVIQSSDEIANRAIDLLKEIY TNLGPRLKANQVVIHEDFIQSCFDRLKASYDTLCVFDGDKNSINCARQEAIRMVRVLT VIKEYINECDSDYHKERMILPMSRAFRGKHLSLIVRFPNQGRQVDELDIWSHTNDTIG SVRRCIVNRIKANVAHKKIELFVGGELIDSEDDRKLIGQLNLKDKSLITAKLTQINFN MPSSPDSSSDSSTASPGNHRNHYNDGPNLEVESCLPGVIMSVHPRYISFLWQVADLGS NLNMPPLRDGARVLMKLMPPDRTAVEKLRAVCLDHAKLGEGKLSPPLDSLFFGPSASQ VLYLTEVVYALLMPAGVPLTDGSSDFQVHFLKSGGLPLVLSMLIRNNFLPNTDMETRR GAYLNALKIAKLLLTAIGYGHVRAVAEACQPVVDGTDPITQINQVTHDQAVVLQSALQ SIPNPSSECVLRNESILLAQEISNEASRYMPDICVIRAIQKIIWASACGALGLVFSPN EEITKIYQMTTNGSNKLEVEDEQVCCEALEVMTLCFALLPTALDALSKEKAWQTFIID LLLHCPSKTVRQLAQEQFFLMCTRCCMGHRPLLFFITLLFTILGSTAREKGKYSGDYF TLLRHLLNYAYNGNINIPNAEVLLVSEIDWLKRIRDNVKNTGETGVEEPILEGHLGVT KELLAFQTSEKKYHFGCEKGGANLIKELIDDFIFPASKVYLQYLRSGELPAEQAIPVC SSPVTINAGFELLVALAIGCVRNLKQIVDCLTEMYYMGTAITTCEALTEWEYLPPVGP RPPKGFVGLKNAGATCYMNSVIQQLYMIPSIRNSILAIEGTGSDLHDDMFGDEKQDSE SNVDPRDDVFGYPHQFEDKPALSKTEDRKEYNIGVLRHLQVIFGHLAASQLQYYVPRG FWKQFRLWGEPVNLREQHDALEFFNSLVDSLDEALKALGHPAILSKVLGGSFADQKIC QGCPHRYECEESFTTLNVDIRNHQNLLDSLEQYIKGDLLEGANAYHCEKCDKKVDTVK RLLIKKLPRVLAIQLKRFDYDWERECAIKFNDYFEFPRELDMGPYTVAGVANLERDNV NSENELIEQKEQSDNETAGGTKYRLVGVLVHSGQASGGHYYSYIIQRNGKDDQTDHWY KFDDGDVTECKMDDDEEMKNQCFGGEYMGEVFDHMMKRMSYRRQKRWWNAYILFYEQM DMIDEDDEMIRYISELTIARPHQIIMSPAIERSVRKQNVKFMHNRLQYSLEYFQFVKK LLTCNGVYLNPAPGQDYLLPEAEEITMISIQLAARFLFTTGFHTKKIVRGPASDWYDA LCVLLRHSKNVGFWFTHNVLFNVSNRFSEYLLECPSAEVRGAFAKLIVFIAHFSLQDG SCPSPFASPGPSSQACDNLSLSDHLLRATLNLLRREVSEHGHHLQQYFNLFVMYANLG VAEKTQLLKLNVPATFMLVSLDEGPGPPIKYQYAELGKLYSVVSQLIRCCNVSSTMQS SINGNPPLPNPFGDLNLSQPIMPIQQNVLDILFVRTSYVKKIIEDCSNSEDTIKLLRF CSWENPQFSSTVLSELLWQVAYSYTYELRPYLDLLFQILLIEDSWQTHRIHNALKGIP DDRDGLFDTIQRSKNHYQKRAYQCIKCMVALFSSCPVAYQILQGNGDLKRKWTWAVEW LGDELERRPYTGNPQYSYNNWSPPVQSNETANGYFLERSHSARMTLAKACELCPEEEP DDQDAPDEHEPSPSEDAPLYPHSPASQYQQNNHVHGQPYTGPAAHHLNNPQKTGQRTQ ENYEGNEEVSSPQMKDQ" BASE COUNT 3208 a 1797 c 2103 g 2983 t ORIGIN 1 gaagtgacat gttggcatgg gcccaattct gctggtcctt tagtatacaa aaaaaataaa 61 ggtttaccag tatgtcacta catgcagatt tatggattgt acagaaaatt ggtgattccc 121 aaatttcact gtgcatcaaa ataatcgatg gaactttaaa gactaaagat ttctagaccc 181 caccccaggc ccgatgattg agaatatcta gaggggaccc aagaatccat atatttaagt 241 gccccaccca caacaatgac ctttaagcag gtagtttgca tttgggaacc actgctacag 301 gttactagtg ggacaaccag ttaggagcat aagtttgaac attttacagt ttgtcacctg 361 tgatagctta tcacctgtga tataaccaga aatccaatta agattgctat ctctctgtaa 421 tctgtttgca atttaggtgt taattttttt gaaagttcag aaaaaagtag acaaaacaga 481 aaagaaatca agtacaacta cataatgaca aaaaacgtat tacacttgta ttaaacttca 541 aaactggaga ataaaggtgc aatataacat gaaaataatt aaatgctaag tgaaataata 601 tcaaatgtag ttgaccctga agaaaatgca gtagtgaggg atccctaacc tgtgggccct 661 ccaggaatta ctgttgaatg gtcttgagaa tccactggaa aagaccaagc attgttacct 721 gaataattga actttgttta tttctccata tttttgcagt ggtaattcca ttataaaacc 781 taatgaaaca atgtttttat agatggtgtg gaaagacttt tctgggctca gaggtgaaac 841 tgacccttgt gtatcagcag catttctgac tgactgagag agtgtagtga ttaacagagt 901 tgtgatgtta gttaagaaac ttagatttgc cattgtagct tttctaccaa ttagcagatt 961 gtttaactca ctgaaattgt aaagtggtag acgtggactt agtcattact gggcagctta 1021 tgaattgtat tcatttactc atgatgtaaa aatggttagt ctccactttt aaggctctag 1081 ttctagtggc taaataggta cttatttata cagtatgata actgctgtat taaaatacat 1141 gtctcaaatg tggaatagta gaagaggtga agaaaatcat agtttgaggt agaatactgt 1201 ttgctggtct taaaaactgt ggtattttgg tgattccata aattaggtca gatacttcca 1261 ctggagggaa acagtttaaa ggatatatgt gatactatta atagaatgag gaagacacac 1321 cagatattta ggagggaatt agcgagcttg aaactaagag ctggtttgaa tgagactggg 1381 tcataagtga tttcaagtac cagattaagg cactgagatt ttatttttaa gcactgaagt 1441 cagatttttt ccttttaaaa gaaaggattc atgatgaaat ctgctttttg ttttgcagag 1501 agcttggaga taattctggt ggctgtgtgg agtatgtgtt ggaggtatta aattttcaca 1561 gtatatataa ggcagcaatt gataggcctt tcacagattc ttctgataac tacataaaga 1621 gacaaaaaaa agaaaaaaga gcaaagatct gtgctgtgtc aagtatgaca gccatcactc 1681 atggctctcc agtaggaggg aacgacagcc agggccaggt tcttgatggc cagtctcagc 1741 atctcttcca acagaaccag acttcatcac ctgattcttc caatgagaat tccgtagcaa 1801 ctcctcctcc agaggaacaa gggcaaggtg atgccccacc acagcatgaa gatgaagagc 1861 ctgcatttcc acatactgag ctggcaaacc tggatgacat gatcaacagg cctcgatggg 1921 tggttcctgt tttgccaaaa ggggaattag aagtgctttt agaagctgct attgatctta 1981 gtgtaaaagg ccttgatgtt aaaagtgaag catgccaacg tttttttcga gatggactaa 2041 caatatcttt cactaaaatt cttatggatg aggctgtgag tggctggaag tttgaaattc 2101 atagatgtat tattaacaat actcatcgcc tagtggagct ttgtgtggcc aagttgtccc 2161 aagattggtt tccacttcta gaacttctcg ccatggcctt aaatcctcac tgcaagtttc 2221 atatctacaa tggtacacgt ccgtgtgaat taatttcctc aaatgctcag ttgcctgaag 2281 atgaattatt tgctcgttct tcagatcctc gatcaccaaa aggttggcta gtggatctca 2341 tcaataaatt tggcacatta aatgggttcc agattttgca tgatcgtttt tttaatggat 2401 cagcattaaa tattcaaata attgcagctc ttattaaacc atttggacaa tgctatgagt 2461 ttctcagtca acatacactg aaaaagtact tcattccagt tatagaaatg gttccacatt 2521 tattggaaaa cttaactgat gaagaactga aaaaggaggc aaagaatgaa gccaaaaatg 2581 atgccctttc aatgattatt aaatctttga agaacttagc ttcaagaatt tcaggacaag 2641 atgagactat aaaaaatttg gaaattttta ggttaaagat gatactcaga ttgttgcaaa 2701 tttcctcttt taatggaaag atgaatgcac tgaatgaaat aaataaggtt atatctagtg 2761 tatcatatta tactcatcgg catagtaatc ctgaggagga agaatggctg acagctgagc 2821 gaatggcaga atggatacag caaaataata tcttatccat agtcttgcaa gacagtcttc 2881 atcaaccaca atatgtagaa aagctagaga aaattcttcg ttttgtgatt aaagaaaagg 2941 ctcttacatt acaggacctt gataatatct gggcagcaca ggcaggaaaa catgaagcca 3001 ttgtgaagaa tgtacatgat ctgctagcaa agttggcttg ggatttttct cctggacaac 3061 ttgatcatct ttttgattgc tttaaggcaa gttggacaaa tgcaagtaaa aagcaacgtg 3121 aaaagctcct tgagttgata cgccgtcttg cagaagatga taaagatggt gtgatggcac 3181 acaaagtgtt gaaccttctt tggaacctgg ctcagagtga tgatgtgcct gtatacatca 3241 tggaccttgc tcttagtgcc cacataaaaa tactagatta tagttgtgcc caggatcgag 3301 atgcacagaa gatccagtgg atagatcact ttatagaaga acttcgcaca aatgacaagt 3361 gggtaattcc tgctctgaaa caaataagag aaatttgtag tttgtttggt gaagcatctc 3421 aaaatttgag tcaaactcag cgaagtcccc acatatttta tcgccatgat ttaatcaacc 3481 agcttcaaca aaatcatgct ttagttactt tggtagcaga aaaccttgca acctacatga 3541 atagcatcag attgtatgct ggagatcatg aagactatga tccacaaaca gtgaggcttg 3601 gaagtcgata cagtcatgtt caagaagttc aagaacgact aaacttcctt agatttttag 3661 tgaaggatgg ccaactgtgg ctctgtgctc ctcaggcaaa acaaatatgg aagtgcttag 3721 cagaaaatgc agtttatctt tgtgatcgtg aagcctgttt taagtggtat tccaagttaa 3781 tgggggatga accagacttg gatcctgata ttaataagga cttctttgaa agtaatgtac 3841 ttcagcttga tccttccctt ttaactgaaa atggaatgaa atgctttgaa agatttttca 3901 aagctgtcaa ttgtcgagaa aggaaactaa tagcaaaaag aagatcctat atgatggatg 3961 atttggaatt aattggacta gactaccttt ggagggttgt gattcagagt agtgacgaga 4021 ttgctaacag agctatagat cttcttaaag agatatacac aaaccttggc ccaagattaa 4081 aagccaatca ggtggttatc catgaagact tcattcagtc ttgctttgat cgtttaaaag 4141 catcatatga tacactgtgt gtttttgatg gtgacaaaaa cagcattaat tgtgcaagac 4201 aagaagccat tcgaatggtt agagtattaa ctgttataaa agagtacatt aatgaatgtg 4261 acagtgatta tcacaaggaa agaatgattc tacctatgtc gagagcattt cgtggcaaac 4321 acctctctct tatagttcgg tttccaaacc agggcagaca ggttgatgag ttggatatat 4381 ggtctcatac gaatgacaca attggttcag tacggcgatg tattgttaat cgtattaaag 4441 ccaatgtagc ccacaaaaaa attgaacttt ttgtgggtgg tgagctgata gattctgaag 4501 atgacagaaa gctaattgga caattaaact taaaagataa atctctaatt acagccaaac 4561 ttacacaaat aaatttcaat atgccatcaa gtcctgatag ctcttccgat tcctcaactg 4621 catctcctgg aaaccaccgt aatcattaca atgatggtcc caatctagag gtggaaagtt 4681 gtttgcctgg ggtgataatg tcagtgcatc ccagatacat ctctttcctt tggcaagttg 4741 cagacttagg tagcaacctg aatatgccac ctcttagaga tggagcaaga gtacttatga 4801 aacttatgcc accagataga acagctgtag aaaaattacg agctgtttgt ttggaccatg 4861 caaaacttgg agaaggcaaa cttagtccac cccttgactc tcttttcttt ggtccttctg 4921 cctcccaagt tctataccta acagaggtag tttatgcctt gttaatgcct gctggtgtgc 4981 ctctaactga tgggtcctct gactttcaag ttcacttctt gaaaagtggt ggcttacctc 5041 ttgtactgag tatgctaata agaaataact tcttgccaaa tacagatatg gaaactcgaa 5101 ggggtgctta tttaaatgct cttaaaatag ccaaactgtt gttaactgcg attggctatg 5161 gccatgttcg agctgtagca gaagcttgtc agccagttgt agatggtaca gaccccataa 5221 cacagattaa ccaagttact catgatcaag cagtggtgct acaaagtgcc cttcagagca 5281 ttcctaatcc ctcatccgag tgcgtactta gaaatgagtc catacttctt gctcaggaaa 5341 tatctaatga ggcttcaaga tatatgcctg atatttgtgt aattagggct atacagaaaa 5401 ttatctgggc atcagcatgt ggggcattag gactagtttt tagcccaaat gaagaaataa 5461 ctaaaattta tcagatgacc accaatggaa gcaataagct ggaggtggaa gatgaacaag 5521 tttgctgtga agcactggaa gtgatgacct tatgttttgc tttacttcca acagcgttgg 5581 atgcacttag taaagaaaaa gcctggcaga ccttcatcat tgacttatta ttgcactgtc 5641 caagcaaaac tgttcgtcag ttggcacagg agcagttctt tttaatgtgc accagatgtt 5701 gcatgggaca caggcctctg cttttcttca ttactttact ctttaccata ctggggagca 5761 cagcaagaga gaagggtaaa tattcaggtg attatttcac acttttacgg caccttctca 5821 attatgctta caatggcaat attaacatac ccaatgctga agttcttctt gtcagtgaaa 5881 ttgattggct caaaaggatt agggataatg ttaaaaacac aggtgaaaca ggtgtcgaag 5941 agccaatact ggaaggccac cttggggtaa caaaagagtt attggccttt caaacttctg 6001 agaaaaagta tcactttggt tgtgaaaaag gaggtgctaa tctcattaaa gaattaattg 6061 atgatttcat ctttcccgca tccaaagttt acctgcagta tttaagaagt ggagaactac 6121 cagctgagca ggctattcca gtctgtagtt cacccgttac catcaatgcc ggttttgagc 6181 tacttgtagc attagctatt ggctgtgtga ggaatctcaa acagatagta gactgtttga 6241 ctgaaatgta ttacatgggc acagcaatta ctacttgtga agcacttact gagtgggaat 6301 atctgccccc tgttggaccc cgcccaccaa aaggatttgt gggactcaaa aatgctggtg 6361 ctacgtgtta catgaactct gtgatccagc agctatacat gattccttct atcaggaaca 6421 gtattcttgc aattgaaggc acaggtagtg atttacacga tgatatgttc ggggatgaga 6481 agcaggacag tgagagtaat gttgatcccc gagatgatgt atttggatat cctcatcaat 6541 ttgaagacaa gccagcatta agtaagacag aagataggaa agagtataat attggtgtcc 6601 taagacacct tcaggtcatc tttggtcatt tagctgcttc ccaactacaa tactatgtac 6661 ccagaggatt ttggaaacag ttcaggcttt ggggtgaacc tgttaatctc cgtgaacaac 6721 atgatgcctt agagtttttt aattctttgg tggatagttt agatgaagct ttaaaagctt 6781 taggacaccc ggctatacta agtaaagtcc taggaggctc ctttgctgat cagaagatct 6841 gccaaggctg cccacatagg tatgaatgtg aagaatcttt tacaactttg aatgtggata 6901 ttagaaatca tcaaaatctt cttgactctt tggaacagta tatcaaagga gatttattgg 6961 aaggtgcaaa tgcatatcat tgtgaaaaat gtgataaaaa ggttgacaca gtaaagcgcc 7021 tgctaattaa aaaattgcct cgggttcttg ctatccaact caaacgattt gactatgact 7081 gggaaagaga atgtgcaatt aaattcaatg attattttga atttcctcga gagctggata 7141 tgggacctta cacagtagca ggtgttgcaa acctggaaag ggataatgta aactcagaaa 7201 atgagttgat tgaacagaaa gagcagtctg acaatgaaac tgcaggaggc acaaagtaca 7261 gacttgtagg agtgcttgta cacagtggtc aagcaagcgg tgggcattat tattcttaca 7321 tcattcaaag gaatggtaaa gatgatcaga cagatcactg gtataaattt gatgatggag 7381 atgtaacaga atgcaaaatg gatgatgatg aagaaatgaa aaatcagtgt tttggtggag 7441 agtacatggg agaagtattt gatcacatga tgaagcgcat gtcatatagg cgacagaaga 7501 ggtggtggaa tgcttacata cttttttatg aacaaatgga tatgatagat gaagatgatg 7561 agatgataag atacatatca gagctaacta ttgcaagacc ccatcagatc attatgtcac 7621 cagccattga gagaagtgta cggaaacaaa atgtgaaatt tatgcataac cgattgcaat 7681 atagtttaga gtattttcag tttgtgaaaa aactgcttac atgtaatggt gtttatttaa 7741 accctgctcc agggcaggat tatttgttgc ctgaagcaga agaaattact atgattagta 7801 ttcagcttgc tgctagattc ctctttacca ctggatttca caccaagaaa atagttcgtg 7861 gtcctgccag tgactggtat gatgcactgt gcgttcttct ccgtcacagc aaaaatgtag 7921 gtttttggtt tactcataat gtccttttta atgtatcaaa tcgcttctct gaataccttc 7981 tggagtgccc tagtgcagaa gtgaggggtg catttgcaaa acttatagtg tttattgcac 8041 acttttcctt gcaagatggg tcttgtcctt ctccttttgc atctccagga ccttctagtc 8101 aggcatgtga taacttgagc ttgagtgacc acttactaag agccacacta aatctcttga 8161 gaagggaagt ttcagagcat ggacatcatt tacagcaata ttttaatttg tttgtaatgt 8221 atgccaattt aggtgtggca gaaaaaacac agcttctgaa attgaatgta cctgctacct 8281 ttatgcttgt gtctttagac gaaggaccag gtcctccaat caaatatcag tatgctgaat 8341 taggcaagtt atattcagta gtgtctcagc tgattcgttg ttgcaatgtg tcatcaacaa 8401 tgcagtcttc aatcaatggt aatccccctc tccccaatcc tttcggtgac cttaatttat 8461 cacagcctat aatgccaatt cagcagaatg tgttagacat tttatttgtg agaacaagtt 8521 atgtgaagaa aattattgaa gactgcagta actcagagga taccatcaaa ttacttcgct 8581 tttgctcttg ggagaatcct cagttctcat ctactgtcct cagcgaactt ctctggcagg 8641 ttgcatattc atatacctat gaacttcggc catatttaga tctacttttc caaattttac 8701 tgattgagga ctcctggcag actcacagaa ttcataatgc acttaaagga attccagatg 8761 acagagatgg gctgttcgat acaatacagc gctcgaagaa tcactatcaa aaacgagcat 8821 atcagtgcat aaaatgtatg gtagctctat ttagcagttg tcctgttgct taccagatct 8881 tacagggtaa cggagatctt aaaagaaaat ggacctgggc agtggaatgg ctaggagatg 8941 aacttgaaag aagaccatat actggcaatc ctcagtatag ttacaacaat tggtctcctc 9001 cagtacaaag caatgaaaca gcaaatggtt atttcttaga aagatcacat agtgctagga 9061 tgacacttgc aaaagcttgt gaactctgtc cagaagagga gccagatgac caggatgccc 9121 cagatgagca tgagccctct ccatcagaag atgccccatt atatcctcat tcacctgcct 9181 ctcagtatca acagaataat catgtacatg gacagccata tacaggacca gcagcacatc 9241 acttgaacaa ccctcagaaa acaggccaac gaacacaaga aaattatgaa ggcaatgaag 9301 aagtatcctc acctcagatg aaggatcagt gaaaagcaat aattaactgc ttcctttatg 9361 actatgcact aaggtcttat agtccaaact ttctctgtgt ctggctagta ttgaaaacta 9421 gataaactgc tccaaaccaa catggagtaa agagcatatt cactggttta tttgcagtaa 9481 tttgcaattt gtcagtgtat aagacacatg cagggtgaag tgtacagagt tttgtaacaa 9541 atgactggtc ctaatctgta aatgagaaag gtatatatac tatgttaatg tctgactgtt 9601 aattcttaag caagaaactt tttttgatga aaacaagtca gatctacaca gtcacacaat 9661 tattttttgt tgtgttcact acattgtgca attgatattg cctgctttga gcagtttggt 9721 caacttacca acttcccccc aaaaaaggga acataaaaga gcccatcttt gtcagtttac 9781 accaatagtt tcttgttaat ccttctttcc tggatatata aggctggtgg taacttttga 9841 attatatggt tgatgtggaa aattggcagt gtaacatttc tagatacttt tcattacctt 9901 tttattctgg tatataggct aaccacttta aagctattct tatgctgtaa cagttagcat 9961 ggcttcacac tgtttgtgta gccaagagga cagaattaca tgaatgacag tgcccagagt 10021 gacagctgta tattgctcag agcttttatt tcttatacct agaataaata taaaatgggg 10081 gaaaaaaaaa a // LOCUS HSAF000988 1248 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens testis-specific PTP-BL Related Y protein (PRY) mRNA, complete cds. ACCESSION AF000988 NID g2580561 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1248) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 1248) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1248 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..1248 /gene="PRY" CDS 183..926 /gene="PRY" /codon_start=1 /product="testis-specific PTP-BL Related Y protein" /db_xref="PID:g2580562" /translation="MNKMGLNNPKKNHSRTMGATGLGFLLPWKQDNLNGTDCQGCNIL YFSETTGSMCSELSLNRGLEARRKKDLKDSFLWRYGKVGCISLPLREMTAWINPPQIS EIFQGYHQRVHGADALSLQTNSLRSRLSSQCLGQSFLLRTLERAVVSGHLGTSVATFM KKTKPTSSQDPPKSGRGFGTPAVGSTMRIKPPSLLDMSRSGRCYKSPGATTRVRIKTS PQDPPRRVHGIETSGGQVRKRHPVCSTQN" BASE COUNT 349 a 310 c 311 g 278 t ORIGIN 1 aagaagagga gcacaccaca ccagaaacag acatcttgca gtgtttcact gtctcaacct 61 tatctgcaca gtccgaggtc agtctgagag agcttctgag agacccagga tgaagggatg 121 cagtgaggtc aagagcccaa ccttctttca ctgacaccca cctctaagga ctcagaagag 181 acatgaataa aatgggcctc aacaatccca agaagaacca ctcaaggaca atgggagcca 241 ctgggcttgg cttcctactt ccctggaaac aagacaattt gaatggcact gactgccagg 301 gatgcaatat tttatacttc tctgagacta cggggagcat gtgttctgaa ctttccctga 361 acagaggtct tgaggccaga aggaagaagg atcttaaaga ctcatttctc tggagatatg 421 ggaaggttgg ctgtatctca cttccacttc gtgagatgac cgcctggatt aacccacccc 481 aaatttcaga gattttccaa ggctaccacc agagggtgca cggagctgat gcactgagcc 541 tgcaaaccaa ctctctgaga agcaggttat cttcacagtg cctcggacag agcttccttc 601 tcaggacact cgagagagcc gtggtttcag ggcacttggg gacatctgtg gccacgttca 661 tgaagaagac taagcctact tcatctcagg acccgcccaa gagtggccgc ggctttggga 721 cacctgcggt cgggtccacc atgaggataa aacctccttc tcttctggac atgtccagga 781 gtggccgttg ctacaagtca cctggtgcta cgaccagggt gagaataaag acgtctcctc 841 aggaccctcc caggagagta catggcattg agacatctgg cggccaagtg aggaaaagac 901 accctgtctg cagcacccag aactgaggag gggcactgcc ctgggcctta cttcccagcc 961 ctggcctcca attctgacct tacaaaagtg tcccttgagt gaggcagtga ccacgcattg 1021 tcacagctac caaagtgtgg tttgcagatg atctgggctt gtttctggca gagattctgg 1081 tacagagaaa ggagaggcgt tgagtggaac cacgatgggc tgaggccagg ggagacatca 1141 caacctccaa caacactttt tttcatgctt taataaatca tttttcttag agaactaaag 1201 tagttgaaac aatatagaaa cattttttaa gtaggcataa aaaaaaaa // LOCUS HSAF000992 4856 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens ubiquitous TPR motif, X isoform (UTX) mRNA, alternative transcript 1, complete cds. ACCESSION AF000992 NID g2580569 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4856) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 4856) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..4856 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" gene 1..4856 /note="X/Y homologous gene" /gene="UTX" CDS 27..4232 /gene="UTX" /note="alternative transcript 1" /codon_start=1 /product="ubiquitous TPR motif, X isoform" /db_xref="PID:g2580570" /translation="MKSCGVSLATAAAAAAAFGDEEKKMAAGKASGESEEASPSLTAE EREALGGLDSRLFGFVRFHEDGARTKALLGKAVRCYESLILKAEGKVESDFFCQLGHF NLLLEDYPKALSAYQRYYSLQSDYWKNAAFLYGLGLVYFHYNAFQWAIKAFQEVLYVD PSFCRAKEIHLRVGLMFKVNTDYESSLKHFQLALVDCNPCTLSNAEIQFHIAHLYETQ RKYHSAKEAYEQLLQTENLSAQVKATVLQQLGWMHHTVDLLGDKATKESYAIQYLQKS LEADPNSGQSWYFLGRCYSSIGKVQDAFISYRQSIDKSEASADTWCSIGVLYQQQNQP MDALQAYICAVQLDHGHAAAWMDLGTLYESCNQPQDAIKCYLNATRSKSCSNTSALAA RIKYLQAQLCNLPQGSLQNKTKLLPSIEEAWSLPIPAELTSRQGAMNTAQQNTSDNWS GGHAVSHPPVQQQAHSWCLTPQKLQHLEQLRANRNNLNPAQKLMLEQLESQFVLMQQH QMRPTGVAQVRSTGIPNGPTADSSLPTNSVSGQQPQLALTRVPSVSQPGVRPACPGQP LANGPFSAGHVPCSTSRTRGSTDTILIGNNHITGNGSNGNVPYLQRNALTLPHNRTNL TSSAKEPWKNQLSNSTQGLHKGQSSHSAGPNGERPLSSTGPSQHLQAAGSGIQNQNGH PTLPSNSVTQGAALNHLSSHTATSGGQQGITLTKESKPSGNILTVPETSRHTGETPNS TASVEGLPNHVHQMTADAVCSPSHGDSKSPGLLSSDNPQLSALLMGKANNNVGTGTCD KVNNIHPAVHTKTDNSVASSPSSAISTATPSPKSTEQTTTNSVTSLNSPHSGLHTING EGMEESQSPMKTDLLLVNHKPSPQIIPSMSVSIYPSSAEVLKACRNLGKNGLSNSSIL LDKCPPPRPPSSPYPPLPKDKLNPPTPSIYLENKRDAFFPPLHQFCTNPNNPVTVIRG LAGALKLDLGLFSTKTLVEANNEHMVEVRTQLLQPADENWDPTGTKKIWHCESNRSHT TIAKYAQYQASSFQESLREENEKRSHHKDHSDSESTSSDNSGRRRKGPFKTIKFGTNI DLSDDKKWKLQLHELTKLPAFVRVVSAGNLLSHVGHTILGMNTVQLYMKVPGSRTPGH QENNNFCSVNINIGPGDCEWFVVPEGYWGVLNDFCEKNNLNFLMGSWWPNLEDLYEAN VPVYRFIQRPGDLVWINAGTVHWVQAIGWCNNIAWNVGPLTACQYKLAVERYEWNKLQ SVKSIVPMVHLSWNMARNIKVSDPKLFEMIKYCLLRTLKQCQTLREALIAAGKEIIWH GRTKEEPAHYCSICEVEVFDLLFVTNESNSRKTYIVHCQDCARKTSGNLENFVVLEQY KMEDLMQVYDQFTLAPPLPSASS" BASE COUNT 1452 a 1081 c 1032 g 1291 t ORIGIN 1 aaagcaaaag aattcgctgc gtttccatga aatcctgcgg agtgtcgctc gctaccgccg 61 ccgctgccgc cgccgctttc ggtgatgagg aaaagaaaat ggcggcggga aaagcgagcg 121 gcgagagcga ggaggcgtcc cccagcctga cagccgagga gagggaggcg ctcggcggac 181 tggacagccg cctctttggg ttcgtgagat ttcatgaaga tggcgccagg acgaaggccc 241 tactgggcaa ggctgttcgc tgctatgaat ctctaatctt aaaagctgaa ggaaaagtgg 301 agtctgattt cttttgtcaa ttaggtcact tcaacctctt attggaagat tatccaaaag 361 cattatctgc ataccagagg tactacagtt tacagtctga ctactggaag aatgctgcct 421 ttttatatgg tcttggtttg gtctacttcc attataatgc atttcagtgg gcaattaaag 481 catttcagga ggtgctttat gttgatccca gcttttgtcg agccaaggaa attcatttac 541 gagttgggct tatgttcaaa gtgaacacag actatgagtc tagtttaaag cattttcagt 601 tagctttggt tgactgtaat ccctgcactt tgtccaatgc tgaaattcaa tttcacattg 661 cccacttata tgaaacccag aggaaatatc attctgcaaa agaagcttat gaacaacttt 721 tgcagacaga gaatctttct gcacaagtaa aagcaactgt cttacaacag ttaggttgga 781 tgcatcacac tgtagatctc ctgggagata aagccaccaa ggaaagctat gctattcagt 841 atctccaaaa gtccttggaa gcagatccta attctggcca gtcctggtat ttcctcggaa 901 ggtgctattc aagtattggg aaagttcagg atgcctttat atcttacagg cagtctattg 961 ataaatcaga agcaagtgca gatacatggt gttcaatagg tgtgctatat cagcagcaaa 1021 atcagcccat ggatgcttta caggcctata tttgtgctgt acaattggac catggccatg 1081 ctgcagcctg gatggaccta ggcactctct atgaatcctg caaccagcct caggatgcca 1141 ttaaatgcta cttaaatgca actagaagca aaagttgtag taatacctct gcacttgcag 1201 cacgaattaa gtatttacag gctcagttgt gtaaccttcc acaaggtagt ctacagaata 1261 aaactaaatt acttcctagt attgaggagg cgtggagcct accaattccc gcagagctta 1321 cctccaggca gggtgccatg aacacagcac agcagaatac ttctgacaat tggagtggtg 1381 gacatgctgt gtcacatcct ccagtacagc aacaagctca ttcatggtgt ttgacaccac 1441 agaaattaca gcatttggaa cagctccgcg caaatagaaa taatttaaat ccagcacaga 1501 aactgatgct ggaacagctg gaaagtcagt ttgtcttaat gcaacaacac caaatgagac 1561 caacaggagt tgcacaggta cgatctactg gaattcctaa tgggccaaca gctgactcat 1621 cactgcctac aaactcagtc tctggccagc agccacagct tgctctgacc agagtgccta 1681 gcgtctctca gcctggagtc cgtcctgcct gccctgggca gcctttggcc aatggaccct 1741 tttctgcagg ccatgttccc tgtagcacat caagaacgcg gggaagtaca gacactattt 1801 tgataggcaa taatcatata acaggaaatg gaagtaatgg aaacgtgcct tacctgcagc 1861 gaaacgcact cactctacct cataaccgca caaacctgac cagcagcgca aaggagccgt 1921 ggaaaaacca actatctaac tccactcagg ggcttcacaa aggtcagagt tcacattcgg 1981 caggtcctaa tggtgaacga cctctctctt ccactgggcc ttcccagcat ctccaggcag 2041 ctggctctgg tattcagaat cagaacggac atcccaccct gcctagcaat tcagtaacac 2101 agggggctgc tctcaatcac ctctcctctc acactgctac ctcaggtgga caacaaggca 2161 ttaccttaac caaagagagc aagccttcag gaaacatatt gacggtgcct gaaacaagca 2221 ggcacactgg agagacacct aacagcactg ccagtgtcga gggacttcct aatcatgtcc 2281 atcagatgac ggcagatgct gtttgcagtc ctagccatgg agattctaag tcaccaggtt 2341 tactaagttc agacaatcct cagctctctg ccttgttgat gggaaaagcc aataacaatg 2401 tgggtactgg aacctgtgac aaagtcaata acatccaccc agctgttcat acaaagactg 2461 ataactctgt tgcctcttca ccatcttcag ccatttcaac agcaacacct tctccaaaat 2521 ccactgagca gacaaccaca aacagtgtta ccagccttaa cagccctcac agtgggctac 2581 acacaattaa tggagaaggg atggaagaat ctcagagccc catgaaaaca gatctgcttc 2641 tggttaacca caaacctagt ccacagatca taccatcaat gtctgtgtcc atatacccca 2701 gctcagcaga agttctgaag gcatgcagga atctaggtaa aaatggctta tctaacagta 2761 gcattttgtt ggataaatgt ccacctccaa gaccaccatc ttcaccatac cctcccttgc 2821 caaaggacaa gttgaatcca cctacaccta gtatttactt ggaaaataaa cgtgatgctt 2881 tctttcctcc attacatcaa ttttgtacaa atccgaacaa ccctgttaca gtaatacgtg 2941 gccttgctgg agctcttaag ttagacctgg gacttttctc tactaaaact ttggtggaag 3001 ctaacaatga acatatggta gaagtgagga cacagttgtt gcagccagca gatgaaaact 3061 gggatcccac tggaacaaag aaaatctggc attgtgaaag taatagatct catactacaa 3121 ttgctaaata tgcacagtac caggcctcct cattccagga atcattgaga gaagaaaatg 3181 aaaaaagaag tcatcataaa gaccactcag atagtgaatc tacatcgtca gataattctg 3241 ggaggaggag gaaaggaccc tttaaaacca taaagtttgg gaccaatatt gacctatctg 3301 atgacaaaaa gtggaagttg cagctacatg agctgactaa acttcctgct tttgtgcgtg 3361 tcgtatcagc aggaaatctt ctaagccatg ttggtcatac catattgggc atgaacacag 3421 ttcaactata catgaaagtt ccagggagca gaacaccagg tcatcaggaa aataacaact 3481 tctgttcagt taacataaat attggcccag gtgactgtga atggtttgtt gttcctgaag 3541 gttactgggg tgttttgaat gacttctgtg aaaaaaataa tttgaatttc ctaatgggtt 3601 cttggtggcc caatcttgaa gatctttatg aagcaaatgt tccagtgtat aggtttattc 3661 agcgacctgg agatttggtc tggataaatg caggcactgt tcattgggtt caggctattg 3721 gctggtgcaa caacattgct tggaatgttg gtccacttac agcctgccag tataaattgg 3781 cagtggaacg gtacgaatgg aacaaattgc aaagtgtgaa gtcaatagta cccatggttc 3841 atctttcctg gaatatggca cgaaatatca aggtctcaga tccaaagctt tttgaaatga 3901 ttaagtattg tcttctaaga actctgaagc aatgtcagac attgagggaa gctctcattg 3961 ctgcaggaaa agagattata tggcatgggc ggacaaaaga agaaccagct cattactgta 4021 gcatttgtga agtggaggtt tttgatctgc tttttgtcac taatgagagt aattcacgaa 4081 agacctacat agtacattgc caagattgtg cacgaaaaac aagcggaaac ttggaaaact 4141 ttgtggtgct agaacagtac aaaatggagg acctgatgca agtctatgac caatttacat 4201 tagctcctcc attaccatcc gcctcatctt gatattgttc catggacatt aaatgagacc 4261 ttttctgcta ttcaggaaat aacccagttc tgcaccactg gtttttgtag ctatctcgta 4321 aggctgctgg ctgaaaactg tgtctatgca accttccaag tgcggagtgt caaccaactg 4381 gacgggagag agtactgctc ctactccagg actctcacaa agctgatgag ctgtacttca 4441 gaaaaaaata ataatttcca tgttttgtat atatctgaca aaactggcaa catcttacag 4501 actactgact tgaagacaac ctcttttata tttctctatt tctgggctga tgaatttgtt 4561 ttcatctgtc ttttccccct tcagaatttt ccttggaaaa aaaatactag cctagctggt 4621 catttctttg taaggtagtt agcaatttta agtctttctt tggtcaactt ttttttaatg 4681 tgaaaagtta ggtaagacac ttttttactg cttttatgtt tttctgtctt gttttgagac 4741 catgatggtt acacttttgg ttcctaaata aaatttaaaa aattaacagc caagtcacaa 4801 aggtaatgga ttgcacatag actaaggaat aaacttcaga tttgtgaaaa aaaaaa // LOCUS HSAF000997 1588 bp mRNA PRI 01-NOV-1997 DEFINITION Homo sapiens testis-specific XK Related Y (XKRY) mRNA, complete cds. ACCESSION AF000997 NID g2580579 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1588) AUTHORS Lahn,B.T. and Page,D.C. TITLE Functional coherence of the human Y chromosome JOURNAL Science 278 (5338), 675-680 (1997) MEDLINE 98022381 REFERENCE 2 (bases 1 to 1588) AUTHORS Lahn,B.T. and Page,D.C. TITLE Direct Submission JOURNAL Submitted (23-APR-1997) Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1588 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" gene 1..1588 /gene="XKRY" CDS 664..1143 /gene="XKRY" /codon_start=1 /product="testis-specific XK Related Y" /db_xref="PID:g2580580" /translation="MFIFNSIADDIFPLISCVGAIHCNILAIRTGNDFAAIKLQVIKL IYLMIWHSLVIISPVVTLAFFPASLKQGSLHFLLIIYFVLLLTPWLEFSKSGTHLPSN TKIIPAWWVSMDAYLNHASICCHQFSCLSAVKLQLSNEELIRDTRWDIQSYTTDFSF" BASE COUNT 504 a 284 c 258 g 542 t ORIGIN 1 attaaaaact tctgataaaa ttacctaagt acacacaaac aaaaacatgc ccacacaaat 61 cacttaattt ctaaaacttt taatttttct gcttctctag taccttgtat tccatcacac 121 agcaaaatct ggcagctcca cttccagaat ttacttgaac tccacagctt atttccgatt 181 tcctgttatc accagagtct aaaacacagt ttatattgca ttcacctcct attttacacc 241 gtaatttcct actttacact ctaactttat ataaaaaaga aactaccttt tcaagatcta 301 attcacgcaa ttttatttgt tcttaattga gacttctttc taggtgctgt cacaccttgt 361 aacgtcagat acaaatgtct ctatccaatt tcatgagttc cagttatttt attttaaggg 421 aatgtgtata tacatttata aatttgtgta tgtgtgtatt cacttattct ttattttata 481 tgttttgcat gcatatattc actaaatccc tgataataga aagataacaa atcttttttt 541 ttctttcttt tttgtatgta aattattttc cgaaggaggt gggttgggag aaatatatct 601 taacttggca agtttaaaag agaaagtggc cattactaat gaaaattatt ctctagcatt 661 ttcatgttta tctttaatag cattgctgat gacatattcc ctcttatcag ttgtgtaggt 721 gccattcact gcaatatact ggccatccgc actggcaacg actttgctgc cattaagcta 781 caggtgataa aattgatcta tctcatgata tggcattcgt tggtgattat ctcacctgta 841 gtgactctgg cattcttccc tgcatctctg aaacagggga gcttacactt tctattaatc 901 atatattttg tattattgtt gacaccatgg ctggagtttt cgaaaagtgg aactcatctt 961 cctagcaaca caaaaataat tccagcatgg tgggtaagta tggatgctta tcttaatcat 1021 gctagtatat gctgccatca attctcctgc ttgtcagcag tgaaactgca gctgtcaaat 1081 gaggaattga taagagacac gaggtgggac atacaatcct acactacaga tttcagtttt 1141 tagaaaatgt gataataata ttgatattta gtttctttgg agggaacgtt ttaccgaagt 1201 gttgtgactc aataattgcc gtgtagttca tcaaaaccta catattagcc tttggcttta 1261 agctccgctt ctgtcagtat ttgcaaccaa ggtggtcggg caaagtattg ccaggagata 1321 ctgaaaatca tccagaagca ctgtgatatt gtgtaagcat ctggagaaaa ttcagttaaa 1381 agaataaaag taagcagctg aggaattact atcactcatg gagaagggta ggatattttc 1441 aataagtgag tatgcaatat ccatatatac tttcacagaa caaagagtaa agaggctgag 1501 tgtgacttta taaagatact catgaaaaat ataaacaaca aaaccttgga agtagtttct 1561 aataaaattg atttttctaa aaaaaaaa // LOCUS HSAF001433 2440 bp mRNA PRI 17-OCT-1997 DEFINITION Human requiem (HREQ) mRNA, complete cds. ACCESSION AF001433 NID g2529704 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2440) AUTHORS Guru,S.C., Agarwal,S.K., Manickam,P., Olufemi,S.-E., Crabtree,J.S., Weisemann,J.M., Kester,M., Kim,Y.S., Emmert-Buck,M.R., Liotta,L.A., Spiegel,A.M., Boguski,M., Roe,B.A., Collins,F.S., Burns,A.L., Marx,S.J. and Chandrasekharappa,S.C. TITLE A transcript map for the 2.8-Mb region containing the multiple endocrine neoplasia type 1 locus JOURNAL Genome Res. 7 (7), 725-735 (1997) MEDLINE 97397562 REFERENCE 2 (bases 1 to 2440) AUTHORS Chandrasekharappa,S.C. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) LGT, NHGRI, Bldg 49, Room 3A76, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..2440 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q13" /chromosome="11" gene 1..2440 /gene="HREQ" CDS 42..1217 /gene="HREQ" /note="neuroD4; ubi-d4" /codon_start=1 /product="requiem" /db_xref="PID:g2529705" /translation="MAAVVENVVKLLGEQYYKDAMEQCHNYNARLCAERSVRLPFLDS QTGVAQSNCYIWMEKRHRGPGLASGQLYSYPARRWRKKRRAHPPEDPRLSFPSIKPDT DQTLKKEGLISQDGSSLEALLRTDPLEKRGAPDPRVDDDSLGEFPVTNSRARKRILEP DDFLDDLDDEDYEEDTPKRRGKGKSKGKGVGSARKKLDASILEDRDKPYACDICGKRY KNRPGLSYHYAHSHLAEEEGEDKEDSQPPTPVSQRSEEQKSKKGPDGLALPNNYCDFC LGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICG TSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSCHLCLDLLKEKASIYQNQNSS" BASE COUNT 521 a 705 c 622 g 591 t 1 others ORIGIN 1 ggataaccgt attaccgcct ttggagtgag tgataccgaa gatggcggct gtggtggaga 61 atgtagtgaa gctccttggg gagcagtact acaaagatgc catggagcag tgccacaatt 121 acaatgctcg cctctgtgct gagcgcagcg tgcgcctgcc tttcttggac tcacagaccg 181 gagtagccca gagcaattgt tacatctgga tggaaaagcg acaccggggt ccaggattgg 241 cctccggaca gctgtactcc taccctgccc ggcgctggcg gaaaaagcgg cgagcccatc 301 cccctgagga tccacgactt tccttcccat ctattaagcc agacacagac cagaccctga 361 agaaggaggg gctgatctct caggatggca gtagtttaga ggctctgttg cgcactgacc 421 ccctggagaa gcgaggtgcc ccggatcccc gagttgatga tgacagcctg ggcgagtttc 481 ctgtgaccaa cagtcgagcg cgaaagcgga tcctagaacc agatgacttc ctggatgacc 541 tcgatgatga agactatgaa gaagatactc ccaagcgtcg gggaaagggg aaatccaagg 601 gtaagggtgt gggcagtgcc cgtaagaagc tggatgcttc catcctggag gaccgggata 661 agccctatgc ctgtgacatt tgtggaaaac gttacaagaa ccgaccaggc ctcagttacc 721 actatgccca ctcccacttg gctgaggagg agggcgagga caaggaagac tctcaaccac 781 ccactcctgt ttcccagagg tctgaggagc agaaatccaa aaagggtcct gatggattgg 841 ccttgcccaa caactactgt gacttctgcc tgggggactc aaagattaac aagaagacgg 901 gacaacccga ggagctggtg tcctgttctg actgtggccg ctcagggcat ccatcttgcc 961 tccaatttac ccccgtgatg atggcggcag tgaagacata ccgctggcag tgcatcgagt 1021 gcaaatgttg caatatctgc ggcacctccg agaatgacga ccagttgctc ttctgtgatg 1081 actgcgatcg tggctaccac atgtactgtc tcaccccgtc catgtctgag ccccctgaag 1141 gaagttggag ctgccacctg tgtctggacc tgttgaaaga gaaagcttcc atctaccaga 1201 accagaactc ctcttgatgt ggccacccac ctgctccccg acatatctaa ggctgtttct 1261 ctcctccact tcatatttca tacccatctt tcccttcttc ctcctctcct tcacaaatcc 1321 agagaacctt ggggtggttg tgccakcctg cctttggcag ctgcaagctg aggtggcagc 1381 tctgaccacc tctggcccca ggccctcagg gagaaaggag caacacactg cccctaggcg 1441 tgcgtgtggc ccagtttctc tctgctctcc attaagtgca ttcactctgc ttgccttggg 1501 cccagcccct ggtgatcaca gggttcaaac agtgtcctcc tagaaagagt gggagagcag 1561 ctcacttctc tgtgttctgc ctcccctctg gtctccagag ttttcctgtc ctctagaggc 1621 aagccaggcc agggagctgg gagcgagcaa gctgaggcca cgtccacaag gagcttttca 1681 tgcccctgtg ccgcatagcc tcacctcttt cctccagagt ggctctctgc ggccctgtgt 1741 tcctgctaca gagtgttctt ttctggagtc aggatgttct cggtcaccct cctggttctg 1801 ccctgtccca ttccacccca ccccaggggg aacagtagct tcaccttgtt attcccattg 1861 ctctcctggc tcactcttac ggtcggtctc cagtgactga agcattcccc acccttggaa 1921 tttctcatct tctgcctccc ttcctactcc ttttggtttt gtggggagag gggaaggatc 1981 agggggccag gccagcagct cgggggccac aaggagatgg ataatgtgcc tgttttttaa 2041 cacaacaaaa aagcctacct ccaaaatccc ctttttgttc ttcctggacc tgggcattca 2101 gcctcctgct cttaactgaa ttgggagcct ctgccacctg ccccgtgtat cctggctctc 2161 agctcatggg gaagccacat agacatccct ttcttccctt gcacgctcgc tagcagctgg 2221 taaggtcttc acaccctgat tcctcaagtt ttctgcttag tggcactgac attaagtagt 2281 ggggggacag tccatgccag gacaccctgg agtagccttc ccccttggcc gtgggcaggc 2341 cctaactcac tgtcgctttg gagttgaggt gtcttttttt tttctttctt tagttcctgt 2401 attctaaaca ttagtaaaaa taaatgtttt tacacagaaa // LOCUS HSAF001435 1889 bp mRNA PRI 17-OCT-1997 DEFINITION Human clone iota unknown protein mRNA, complete cds. ACCESSION AF001435 NID g2529708 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1889) AUTHORS Guru,S.C., Agarwal,S.K., Manickam,P., Olufemi,S.-E., Crabtree,J.S., Weisemann,J.M., Kester,M., Kim,Y.S., Emmert-Buck,M.R., Liotta,L.A., Spiegel,A.M., Boguski,M., Roe,B.A., Collins,F.S., Burns,A.L., Marx,S.J. and Chandrasekharappa,S.C. TITLE A transcript map for the 2.8-Mb region containing the multiple endocrine neoplasia type 1 locus JOURNAL Genome Res. 7 (7), 725-735 (1997) MEDLINE 97397562 REFERENCE 2 (bases 1 to 1889) AUTHORS Chandrasekharappa,S.C. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) LGT, NHGRI, Bldg 49, Room 3A76, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..1889 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" /clone="iota" CDS 57..515 /codon_start=1 /evidence=not_experimental /product="unknown" /db_xref="PID:g2529709" /translation="MSRQAKDDFLRHYTVSDPRTHPKGYTEYKVTAQFISKKDPEDVK EVVVWKRYSDFRKLHGDLAYTHRNLFRRLEEFPAFPRAQVFGRFEASVIEERRKGAED LLRFTVHIPALNNSPQLKEFFRGGEVTRPLEVSRDLHILPPPLIPTPPPG" BASE COUNT 434 a 564 c 495 g 393 t 3 others ORIGIN 1 gcacgaggcg gcgaggaggt ggaggccggc gctccgctcc gctccagctc ggtttcatgt 61 cccgccaggc gaaggatgac ttcctgcggc actacacagt gtcggacccc aggactcacc 121 ccaagggcta caccgagtac aaagtaaccg cgcagttcat ctcaaagaag gacccagagg 181 atgtcaaaga ggtggtggtc tggaagcggt acagcgactt ccgcaagctg catggagacc 241 tggcctacac ccaccgcaac ctcttccgcc gcctcgagga gttccccgct ttcccccggg 301 cccaggtgtt tggccggttt gaagcctcag tgatcgagga gcggcgaaag ggggcagagg 361 acctgcttcg cttcactgtg cacatacctg cgctcaacaa cagcccccag ctcaaggagt 421 tcttccgggg tggggaggtg acccgaccct tggaggtgtc cagggaccta cacatcctgc 481 caccccctct gatccccacc ccgccccctg gatgaccccc ggctatccca actgctccct 541 gcagaaaagg aggggcctcg aggaaattgg aggtgccagt ggacccccca ccatccagcc 601 ctgcccagga rgccctggat ctcctcttta actgtgagag caccgaggaa gcatctggtt 661 cccctgcccg aggccccctc accgaggctg aacttgccct cttcgacccc ttctccaagg 721 aagaaggcgc agcccccagc cccacccatg tggctgagct ggcaacgatg gaggtggagt 781 ctgcaaggct ggaccaggaa ccctgggagc caggagggca ggaggaggaa cawgatgggg 841 aaggagggcc cacccctgcc tacctaagcc aggccacaga gctcatcacc caggccctgc 901 gggatgagaa ggcaggcgct tacgctgctg cactccaggg ctatcgagac ggcgtgcacg 961 tcttgcttca gggagtcccc agtgacccgt tgcctgcccg ccaggaaggt gtgaagaaga 1021 aggcagctga gtacctgaag cgggcagagg agatcctgcg cctgcacctg tctcaactcc 1081 caccctaaca gggagtgggc cattccctgg gactctcact cctgcactgc cagccccttt 1141 tcctctcccc agggcctggc cctacctcct ggtcttgtaa ttacaggagc catttctgta 1201 ggtaactgga ccaagaatga gaaaaataat gaattcttag ctccctgatt acacctgcca 1261 ccttggaatc caggactcac atttctgacc ctgcctgtct ttttggggtt tttttgagtt 1321 ggagtctcgc tgtgtcgccc agactggagt gcagtggtgg gatcacggct cattgcaacc 1381 tccacctccc aggttcaagc agttttcctg tytcagcctc cccagtagct gagattgcag 1441 gcacatgcca ccacgcccag ctaatatttt gtatttttca gtagggacgg ggttacacca 1501 tgttggccag gctggtctcg aactcctgac ctcaagtgat ccacccgcct cagtctccca 1561 aagtgctgag attacaggca tgagtcacta cgcccggccc atgtctgtct gtcttgatgt 1621 gtgagcagca gctgtggtca ttaaaccatt agtttacccc tctagaactg gggtctgcaa 1681 actcccacct gcagccaaat ctggcccacc tcctttttaa tgtaagggct gtgagagtgg 1741 tttttacttt ttttaatgat taaaaaaatc aaaataatat tctgtgacaa tgacaggtga 1801 aatttatatg tgacaagtga aaattatatg aaatttaaga gtccataaat aaaatttgtt 1861 ggaacacaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSAF001436 1976 bp mRNA PRI 17-OCT-1997 DEFINITION Human clone zeta unknown protein mRNA, complete cds. ACCESSION AF001436 NID g2529710 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1976) AUTHORS Guru,S.C., Agarwal,S.K., Manickam,P., Olufemi,S.-E., Crabtree,J.S., Weisemann,J.M., Kester,M., Kim,Y.S., Emmert-Buck,M.R., Liotta,L.A., Spiegel,A.M., Boguski,M., Roe,B.A., Collins,F.S., Burns,A.L., Marx,S.J. and Chandrasekharappa,S.C. TITLE A transcript map for the 2.8-Mb region containing the multiple endocrine neoplasia type 1 locus JOURNAL Genome Res. 7 (7), 725-735 (1997) MEDLINE 97397562 REFERENCE 2 (bases 1 to 1976) AUTHORS Chandrasekharappa,S.C. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) LGT, NHGRI, Bldg 49, Room 3A76, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..1976 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" /clone="zeta" /note="transcripts on northern blots are 2 and 2.2 kb" CDS 451..1083 /codon_start=1 /evidence=not_experimental /product="unknown" /db_xref="PID:g2529711" /translation="MSTKVPIYLKRGSRKGKKEKLRDLLSSDMISPPLGDFRHTIHIG SGGGSDMFGDISFLQGKFHLLPGTMVEGPEEDGTFDLPFQFTRTATVCGRELPDGPSP LLKNAISLPVIGGPQALTLPTAQAPPKPPRLHLETPQPSPQEGGSVDIWRIPETGSPN SGLTPESGAEEPFLSNASSLLSLHVDLGPSILDDVLQIMDQDLDSMQIPT" BASE COUNT 358 a 641 c 564 g 413 t ORIGIN 1 cactctgtaa gttcaccgcc ggtcgggtcc ggccgcagcg ctgtccagct cctgagacct 61 tgctgtccgc cggtctgccg tctgcgcgcc tcacgctcct cagccctgga ccggggacaa 121 gtaaccctcg gtgacaagac caaagtgcac tgctgcccac acagttccta cctttctggc 181 ttcaattctt cagaagagtt tgccgtcctt tggggagaac gtgatttttg ttatctcagc 241 ccactgactt cattgatctc taatcttttt taattccttg ggccaacttt gttcgtgccc 301 ccacactgta gccagaagcc cgttggcgag ctctggcacc tgcaaaccac cccgtggaac 361 gagtgtttcc tctggctgag ggttggagag gaggtgtggt ctcagcaggc ggcccgtagc 421 ctcacagcca ggcctggtgg tgaggtcacc atgtccacca aggtgcccat ctatctgaag 481 cgtggcagtc gcaagggcaa gaaggagaag cttcgggacc tgctgtcctc ggacatgatc 541 agcccaccgc tgggggactt ccgccacacc attcatattg gcagtggcgg cggcagtgac 601 atgtttggcg acatctcctt cctgcagggc aagttccacc tcctgccggg gaccatggtg 661 gaggggcctg aagaagatgg caccttcgac ctccccttcc agttcacccg caccgccacc 721 gtgtgtgggc gggagctccc ggacggccca tcccctctgc tcaagaacgc catctccctc 781 ccggttatcg gtggacccca ggctctcacc ctgcccacag cccaggctcc acccaagccc 841 cctcgcctgc acctggagac ccctcagcct tccccacagg agggagggag tgtggacatc 901 tggaggattc cagagactgg ctcccccaac agtggactga ccccggagtc aggggccgag 961 gagcccttcc tgtccaatgc cagctccctg ctgtccctgc acgtggacct ggggccttcc 1021 atcctggatg atgtcctgca gatcatggat caggacctgg acagcatgca gatccccaca 1081 taggacacga ggctgcctag gctggggtcc caggtggggc ccagccagga ggtggggtgt 1141 ggacccggcc ctggcggcgg agtcagggtc ccaagatccc acctgtatgg tcgctggcca 1201 gtgattctcc ttctgagccg tgtttcccct ctccctccct ctccacgtgg gcagggcagg 1261 ccccatcgct ttcctctgat aaccacatgg acacatcctg aagtcagccc aggcgccctg 1321 agcatcttgg ggcacctgga ccccatcaca atactccttc ttccttcagg tccctgggtg 1381 aaggctttgc tgaaaccgac cccccttttc acgtcccttc tgcctctgcc ccgttggatg 1441 ccctgactgg gggcagggga agagacaggg cacagctggc cacagggctc agccactgag 1501 caggctgttc cgggcctttg gctttgcatc ctggacgggg agtgtcctgt cagggaccag 1561 atgtgtcctg cctcatccct agctccaatc ccttccccac gtgaccgggg attctggttg 1621 caataaaaca tgctgctgct ggtggcggag ctccctgtcc ctttgcccca ggtttcctcc 1681 cggaggcaga cagtctccca gagctgaggg cttgcctctg gagaccccag ccccagaggg 1741 ctttgtggag gacaggcctt gccctcaaga atgtcgtacc tgacgctgag cctgtcatga 1801 gaatgcaaca ggagcaaacc aagtgttgct gtgacattga ttcagatgtt tggcaagagg 1861 tggctgagca ctggggtggg cttggcactg tgccaagcct ggggccaatc cctgcccagt 1921 cagctggggt ctggtggggg acacccaaga ataaaagaat aaccacaaag tgtgca // LOCUS HSAF001550 173882 bp DNA PRI 22-AUG-1997 DEFINITION Homo sapiens chromosome 16 BAC clone CIT987SK-334D11 complete sequence. ACCESSION AF001550 NID g2335061 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 173882) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., Phillips,C., Kerlavage,A.R., Kim,U.J. and Venter,J.C. TITLE Chromosome 16 BAC clone CIT987SK -334D11 complete sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 173882) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 3 (bases 1 to 173882) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (19-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 4 (bases 1 to 173882) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (22-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA COMMENT BAC clone CIT987SK-334D11 is located in chromosome 16. Genes were identified by a combination of five methods: XGRAIL (available by anonymous ftp from arthur.epm.ornl.gov), Genefinder (available by anonymous ftp from colin@u.washington.edu), GENSCAN (available using the e-mail server at genscan@gnomic.stanford.edu), searches of the EST database at TIGR (http://www.tigr.org/tdb/hcd/hcd.html) and searches against a peptide database. Repeats were identified using RepeatMasker (Smit, A. and Green, P. unpublished, http://ftp.genome.washington.edu/rm/RepeatMasker.html). FEATURES Location/Qualifiers source 1..173882 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p12" /clone="CIT987SK-334D11" repeat_region 3..317 /rpt_family="L1MB5" repeat_region complement(374..401) /rpt_family="AluSq" repeat_region 488..795 /rpt_family="AluJb" repeat_region 876..1167 /rpt_family="AluSp" repeat_region complement(1200..1438) /rpt_family="MIR" repeat_region complement(1942..2073) /rpt_family="AluJb" repeat_region complement(2097..2403) /rpt_family="AluSx" repeat_region 2827..3160 /rpt_family="AluSx" repeat_region 3174..3306 /rpt_family="FLAM_C" repeat_region 3320..3371 /rpt_family="MER5A" repeat_region 3548..3678 /rpt_family="AluJo" repeat_region 4269..4566 /rpt_family="AluSg" repeat_region complement(4837..5137) /rpt_family="AluSq" repeat_region 5382..5683 /rpt_family="AluSx" repeat_region 5821..5857 /rpt_family="MER5A" repeat_region 5946..6242 /rpt_family="AluSq" repeat_region 6249..6376 /rpt_family="MIR" repeat_region complement(6745..6819) /rpt_family="AluJo" repeat_region complement(6902..7181) /rpt_family="AluJb" repeat_region complement(7192..7498) /rpt_family="AluSq" repeat_region complement(7841..7894) /rpt_family="MER5B" repeat_region 8357..8563 /rpt_family="AluSx" repeat_region 8949..9025 /rpt_family="MIR" repeat_region 9183..9484 /rpt_family="AluSx" repeat_region 9499..9787 /rpt_family="AluSp" repeat_region 9789..9920 /rpt_family="(TAAA)n" repeat_region 9789..9827 /rpt_family="(CATA)n" repeat_region complement(10069..10314) /rpt_family="AluJb" repeat_region 10621..10992 /rpt_family="MLT1A1" repeat_region 11548..12077 /rpt_family="MLT1F" repeat_region 12083..12128 /rpt_family="MIR2" repeat_region 12250..12545 /rpt_family="AluSx" repeat_region 13688..13928 /rpt_family="AluSq" repeat_region complement(13931..14160) /rpt_family="L1MA7" repeat_region complement(14174..14473) /rpt_family="AluSx" repeat_region complement(14639..14927) /rpt_family="AluSx" repeat_region complement(15831..15984) /rpt_family="AT_rich" repeat_region complement(15850..15981) /rpt_family="AluSx" repeat_region 16363..16660 /rpt_family="AluSx" repeat_region complement(16966..17099) /rpt_family="MER5A" repeat_region 17358..17660 /rpt_family="AluSp" repeat_region 18490..18690 /rpt_family="MIR" repeat_region complement(18736..19008) /rpt_family="AluY" repeat_region 19367..19665 /rpt_family="AluSq" repeat_region 19829..20875 /rpt_family="pTR5" repeat_region 20500..21241 /rpt_family="pTR5" repeat_region 21536..21831 /rpt_family="AluSx" repeat_region complement(21954..22130) /rpt_family="MIR" repeat_region complement(22362..22662) /rpt_family="AluSx" repeat_region 22756..23055 /rpt_family="AluSg" repeat_region complement(23371..23580) /rpt_family="MLT1A1" repeat_region complement(23581..23635) /rpt_family="(TAAAA)n" repeat_region complement(23637..23931) /rpt_family="AluJo" repeat_region complement(23932..24040) /rpt_family="MLT1A1" repeat_region complement(23932..24069) /rpt_family="MLT1A2" repeat_region 24406..24436 /rpt_family="(CA)n" repeat_region complement(24634..24672) /rpt_family="MIR2" repeat_region complement(24750..25041) /rpt_family="AluSx" repeat_region 25255..25458 /rpt_family="MER20" repeat_region 25845..25875 /rpt_family="MIR" repeat_region 26391..26439 /rpt_family="(CA)n" repeat_region complement(27654..27688) /rpt_family="AT_rich" mRNA join(28595..28690,37862..38030,41435..41573,46900..46981) /gene="334D11.1" gene 28595..46981 /gene="334D11.1" CDS join(28595..28690,37862..38030,41435..41573,46900..46981) /gene="334D11.1" /codon_start=1 /product="unknown protein" /db_xref="PID:g2098822" /translation="MAKEEPQSISRDLQELQKKLSLLIDSFQNNSKVVAFMKSPVGQY LDSHPFLAFTLLVFIVMSAVPVGFFLLIVVLTTLAALLGVIILEGLVISVGGFSLLCI LCGLSFVSLAMSGMMIASYVVVSSLISCWFSPRPLTQQNTSCDFLPAMKSAEFEGLYQ E" repeat_region complement(29193..29287) /rpt_family="MER33" repeat_region 29288..29580 /rpt_family="AluSx" repeat_region complement(29581..29787) /rpt_family="MER33" repeat_region 30776..31074 /rpt_family="AluSx" repeat_region complement(31173..31217) /rpt_family="MIR2" repeat_region complement(31279..31413) /rpt_family="AluSq" repeat_region 31597..31653 /rpt_family="MIR" repeat_region complement(31654..31961) /rpt_family="AluJo" repeat_region 31967..32066 /rpt_family="MIR" repeat_region 32313..32417 /rpt_family="MIR" repeat_region 32591..32732 /rpt_family="AluSx" repeat_region 32734..32846 /rpt_family="AluSg" repeat_region complement(33175..33269) /rpt_family="MER33" repeat_region complement(33270..33573) /rpt_family="AluSx" repeat_region complement(33575..33688) /rpt_family="MER33" repeat_region complement(33847..33935) /rpt_family="MER3" repeat_region complement(35105..35134) /rpt_family="POLY_A" repeat_region 35154..35451 /rpt_family="AluSq" repeat_region complement(35458..35733) /rpt_family="AluSx" repeat_region 35913..36057 /rpt_family="(TAA)n" repeat_region 36067..36107 /rpt_family="AT_rich" repeat_region 36117..36249 /rpt_family="(TA)n" repeat_region complement(36250..36372) /rpt_family="(TAAA)n" repeat_region complement(36286..36422) /rpt_family="(TA)n" repeat_region 36404..36541 /rpt_family="(TAA)n" repeat_region complement(36731..37033) /rpt_family="AluJo" repeat_region complement(37341..37375) /rpt_family="AluSp" repeat_region complement(37376..37459) /rpt_family="AluSg" repeat_region complement(37460..37727) /rpt_family="AluSx" repeat_region complement(38106..38228) /rpt_family="FLAM_C" repeat_region 38266..38510 /rpt_family="MIR" repeat_region complement(38526..38590) /rpt_family="MIR2" repeat_region 38905..38967 /rpt_family="(TGAA)n" repeat_region complement(39166..39461) /rpt_family="AluSx" repeat_region 39508..39588 /rpt_family="FLAM_A" repeat_region 39591..39891 /rpt_family="AluSq" repeat_region 39916..40073 /rpt_family="AluJo" repeat_region complement(40075..40217) /rpt_family="MER5A" repeat_region 40857..41155 /rpt_family="AluSx" repeat_region complement(41613..41916) /rpt_family="AluJo" repeat_region complement(42653..42721) /rpt_family="MIR" repeat_region complement(43338..43458) /rpt_family="AluJo" repeat_region 43809..44082 /rpt_family="AluJo" repeat_region 44091..44184 /rpt_family="(TAAAA)n" repeat_region 44190..44343 /rpt_family="AluJo" repeat_region 44344..44374 /rpt_family="AT_rich" repeat_region complement(45045..45342) /rpt_family="AluSx" repeat_region 46202..46437 /rpt_family="THE1B" repeat_region 46499..46642 /rpt_family="MIR" repeat_region 46726..46770 /rpt_family="MIR" STS 47124..47378 /db_xref="dbSTS:G06024" repeat_region complement(47655..47964) /rpt_family="AluSq" repeat_region complement(48802..49085) /rpt_family="MLT1F" repeat_region complement(48804..49017) /rpt_family="MLT1E" repeat_region complement(49090..49262) /rpt_family="MLT1E" repeat_region 49623..49838 /rpt_family="MER1B" repeat_region 49699..50096 /rpt_family="MER1A" repeat_region 50275..50569 /rpt_family="AluSx" repeat_region 50707..50853 /rpt_family="MER34" repeat_region 50955..51103 /rpt_family="L1PA7" repeat_region 51100..51533 /rpt_family="L1PA13" repeat_region 52225..52583 /rpt_family="THE1C" repeat_region 52610..52718 /rpt_family="MER34" repeat_region 53060..53353 /rpt_family="AluSx" repeat_region 53624..53916 /rpt_family="AluJb" repeat_region complement(54209..54503) /rpt_family="AluJb" repeat_region complement(54567..54659) /rpt_family="MIR" repeat_region 54679..54974 /rpt_family="AluSx" repeat_region 55031..55052 /rpt_family="GC_rich" repeat_region 55068..55100 /rpt_family="MER20" repeat_region complement(55101..55365) /rpt_family="AluJb" repeat_region 55366..55531 /rpt_family="MER20" repeat_region 56414..56676 /rpt_family="AluY" repeat_region 56677..56855 /rpt_family="AluSp" repeat_region complement(56859..56954) /rpt_family="MER5A" repeat_region complement(56977..57091) /rpt_family="MIR" repeat_region complement(58214..58471) /rpt_family="MLT1B" repeat_region complement(58472..58783) /rpt_family="AluSx" repeat_region complement(58786..58879) /rpt_family="MLT1B" repeat_region complement(59217..59427) /rpt_family="MLT1A1" repeat_region complement(59298..59478) /rpt_family="MLT1B" repeat_region complement(60090..60386) /rpt_family="AluSc" repeat_region complement(60725..61209) /rpt_family="MLT1D" repeat_region complement(61273..61572) /rpt_family="AluSx" repeat_region complement(61842..61998) /rpt_family="MIR" repeat_region 62577..62631 /rpt_family="MIR" repeat_region complement(62680..62839) /rpt_family="MLT1D" repeat_region complement(62897..63192) /rpt_family="AluSq" repeat_region complement(63204..63329) /rpt_family="MLT1D" repeat_region 63702..63942 /rpt_family="AluSq" mRNA complement(join(64877..65047,65191..65274,66204..66287, 66995..67091,67168..67303,68794..68983,69131..69255, 69357..69448,69529..69716,70550..70676,71455..71636, 71733..71829,72845..73009,73156..73200,74263..74415, 77056..77150,77534..77617,78732..78820,78911..78972)) /gene="334D11.2" /product="zona pellucida ZP2" gene complement(64877..78972) /gene="334D11.2" CDS complement(join(64905..65047,65191..65274,66204..66287, 66995..67091,67168..67303,68794..68983,69131..69255, 69357..69448,69529..69716,70550..70676,71455..71636, 71733..71829,72845..73009,73156..73200,74263..74415, 77056..77150,77534..77617,78732..78820,78911..78972)) /gene="334D11.2" /codon_start=1 /product="zona pellucida ZP2" /db_xref="PID:g2098823" /translation="MACRQRGGSWSPSGWFNAGWSTYRSISLFFALVTSGNSIDVSQL VNPAFPGTVTCDEREITVEFPSSPGTKKWHASVVDPLGLDMPNCTYILDPEKLTLRAT YDNCTRRVHGGHQMTIRVMNNSAALRHGAVMYQFFCPAMQVEETQGLSASTICQKDFM SFSLPRVFSGLADDSKGTKVQMGWSIEVGDGARAKTLTLPEAMKEGFSLLIDNHRMTF HVPFNATGVTHYVQGNSHLYMVSLKLTFISPGQKVIFSSQAICAPDPVTCNATHMTLT IPEFPGKLKSVSFENQNIDVSQLHDNGIDLEATNGMKLHFSKTLLKTKLSEKCLLHQF YLASLKLTFLLRPETVSMVIYPECLCESPVSIVTGELCTQDGFMDVEVYSYQTQPALD LGTLRVGNSSCQPVFEAQSQGLVRFHIPLNGCGTRYKFEDDKVVYENEIHALWTDFPP SKISRDSEFRMTVKCSYSRNDMLLNINVESLTPPVASVKLGPFTLILQSYPDNSYQQP YGENEYPLVRFLRQPIYMEVRVLNRDDPNIKLVLDDCWATSTMDPDSFPQWNVVVDGC AYDLDNYQTTFHPVGSSVTHPDHYQRFDMKAFAFVSEAHVLSSLVYFHCSALICNRLS PDSPLCSVTCPVSSRHRRATGATEAEKMTVSLPGPILLLSDDSSFRGVGSSDLKASGS SGEKSRSETGEEVGSRGAMDTKGHKTAGDVGSKAVAAVAAFAGVVATLGFIYYLYEKR TVSNH" repeat_region 65538..65836 /rpt_family="AluSq" repeat_region 66458..66763 /rpt_family="AluSq" repeat_region 67581..67878 /rpt_family="AluSq" repeat_region 68174..68223 /rpt_family="MIR" repeat_region 68186..68231 /rpt_family="MIR2" repeat_region 68342..68642 /rpt_family="AluSx" repeat_region complement(72421..72701) /rpt_family="AluSx" repeat_region complement(73443..73721) /rpt_family="AluSp" repeat_region 73867..73973 /rpt_family="MIR2" repeat_region complement(74003..74128) /rpt_family="MIR" repeat_region 74929..75050 /rpt_family="AluJo" repeat_region 75062..75098 /rpt_family="(CA)n" repeat_region 75099..75298 /rpt_family="AluJb" repeat_region 75316..75356 /rpt_family="(CAAAA)n" repeat_region 75368..75649 /rpt_family="AluJb" repeat_region 75879..75965 /rpt_family="MER5A" repeat_region complement(76403..76601) /rpt_family="MER20" repeat_region complement(76616..76893) /rpt_family="AluSq" repeat_region complement(78000..78298) /rpt_family="AluSx" repeat_region 78302..78322 /rpt_family="AT_rich" repeat_region complement(79355..79646) /rpt_family="AluSx" repeat_region 80425..80468 /rpt_family="MADE1" repeat_region complement(80485..80769) /rpt_family="AluSq" repeat_region 80771..80822 /rpt_family="MADE1" repeat_region 80916..81041 /rpt_family="MER5A" repeat_region 81870..82120 /rpt_family="L1PA9" repeat_region 82144..82440 /rpt_family="AluSq" repeat_region complement(83108..83406) /rpt_family="AluY" repeat_region complement(83407..83515) /rpt_family="AluY" repeat_region 83564..83694 /rpt_family="FLAM_A" repeat_region complement(84409..84720) /rpt_family="AluSq" repeat_region 85005..85306 /rpt_family="AluSx" repeat_region complement(85910..86209) /rpt_family="AluSq" repeat_region 86210..86311 /rpt_family="LTR9" repeat_region 86414..86536 /rpt_family="FLAM_C" repeat_region complement(86554..86845) /rpt_family="AluSx" repeat_region 86848..86876 /rpt_family="AT_rich" repeat_region complement(87043..87091) /rpt_family="MIR" repeat_region complement(87598..87893) /rpt_family="AluSx" repeat_region complement(87894..88844) /rpt_family="L1" repeat_region complement(88642..88922) /rpt_family="MER25" repeat_region 89036..89082 /rpt_family="(TA)n" repeat_region complement(89212..90173) /rpt_family="LTR5" repeat_region 90760..91064 /rpt_family="AluSp" repeat_region complement(91928..92226) /rpt_family="AluSq" repeat_region 92600..92687 /rpt_family="MLT1B" repeat_region 92692..93045 /rpt_family="THE1C" repeat_region 93041..93385 /rpt_family="MLT1B" repeat_region complement(93420..93444) /rpt_family="AT_rich" repeat_region complement(93608..93866) /rpt_family="MIR" repeat_region complement(94699..94917) /rpt_family="AluSp" repeat_region 97198..97223 /rpt_family="AT_rich" repeat_region complement(97734..98023) /rpt_family="AluSx" repeat_region complement(98800..99099) /rpt_family="AluSc" repeat_region complement(99181..99223) /rpt_family="POLY_A" repeat_region 99914..100785 /rpt_family="L1MB8" repeat_region 101793..101813 /rpt_family="AT_rich" repeat_region 101907..101946 /rpt_family="(CA)n" repeat_region complement(102170..102245) /rpt_family="MIR2" repeat_region complement(102285..102571) /rpt_family="AluJo" repeat_region complement(103426..103826) /rpt_family="MSTA" repeat_region complement(104004..104135) /rpt_family="AluJo" repeat_region complement(104447..104745) /rpt_family="AluSx" repeat_region complement(104800..105197) /rpt_family="MLT1A1" repeat_region 105305..105614 /rpt_family="AluSx" repeat_region complement(105618..105768) /rpt_family="L1ME3A" repeat_region complement(105853..105935) /rpt_family="(TA)n" repeat_region 106017..106043 /rpt_family="AT_rich" repeat_region 106486..106728 /rpt_family="L1PA4" repeat_region 106730..107021 /rpt_family="AluSx" repeat_region 107023..107290 /rpt_family="L1PA13" repeat_region 107276..107406 /rpt_family="L1PA13" repeat_region complement(107718..107932) /rpt_family="AluSq" repeat_region complement(108189..108213) /rpt_family="AT_rich" repeat_region complement(108218..108518) /rpt_family="AluSx" repeat_region complement(108521..108785) /rpt_family="L1PA16" repeat_region complement(108840..109136) /rpt_family="AluSx" repeat_region complement(109144..109337) /rpt_family="L1MA8" repeat_region 109342..109475 /rpt_family="AluJo" repeat_region complement(109487..109620) /rpt_family="L1MA8" repeat_region complement(109623..109910) /rpt_family="AluSc" repeat_region complement(109918..110131) /rpt_family="L1MA9" repeat_region complement(109935..110504) /rpt_family="L1MB8" repeat_region complement(110995..111297) /rpt_family="AluY" repeat_region 111510..111859 /rpt_family="L1PA8" repeat_region complement(112288..112341) /rpt_family="(GAAAA)n" repeat_region complement(112899..113189) /rpt_family="AluSx" repeat_region complement(113307..113435) /rpt_family="MIR" repeat_region 113453..113531 /rpt_family="LTR10" repeat_region complement(114491..114772) /rpt_family="AluSg" repeat_region complement(114778..114846) /rpt_family="MER5A" repeat_region complement(115200..115260) /rpt_family="MIR2" repeat_region complement(115628..115747) /rpt_family="AluJo" repeat_region complement(115748..115856) /rpt_family="AluJo" repeat_region 115873..116163 /rpt_family="AluSx" repeat_region complement(116164..116200) /rpt_family="AT_rich" repeat_region 116628..116919 /rpt_family="AluJb" repeat_region complement(118525..118830) /rpt_family="AluJo" repeat_region 119204..119349 /rpt_family="MLT1B" repeat_region complement(119358..119657) /rpt_family="AluSx" repeat_region 119658..119912 /rpt_family="MLT1B" repeat_region 120179..120467 /rpt_family="AluSx" repeat_region 121819..122028 /rpt_family="MER20" repeat_region complement(122323..122624) /rpt_family="AluSp" repeat_region complement(123633..123921) /rpt_family="AluSx" repeat_region 124572..124609 /rpt_family="MIR" repeat_region complement(124852..124991) /rpt_family="AluJo" repeat_region 125402..125453 /rpt_family="L1ME3" repeat_region 125464..125765 /rpt_family="AluSq" STS complement(125512..125729) /db_xref="dbSTS:G03524" repeat_region 125767..125838 /rpt_family="L1MB7" repeat_region 125767..125821 /rpt_family="L1MB6" mRNA complement(join(125943..126270,128679..128763, 129462..129583,134979..135162,137215..137316, 142957..143019,144856..145009,145507..145706)) /gene="334D11.3" /product="mu-crystallin" gene complement(125943..145706) /gene="334D11.3" STS 125958..126107 /db_xref="dbSTS:G23020" STS 126058..126182 /db_xref="dbSTS:G11235" CDS complement(join(126206..126270,128679..128763, 129462..129583,134979..135162,137215..137316, 142957..143019,144856..145009,145507..145676)) /gene="334D11.3" /codon_start=1 /product="mu-crystallin" /db_xref="PID:g2098824" /translation="MSRVPAFLSAAEVEEHLRSSSLLIPPLETALANFSSGPEGGVMQ PVRTVVPVTKHRGYLGVMPAYSAAEDALTTKLVTFYEDRGITSVVPSHQATVLLFEPS NGTLLAVMDGNVITAKRTAAVSAIATKFLKPPSSEVLCILGAGVQAYSHYEIFTEQFS FKEVRIWNRTKENAEKFADTVQGEVRVCSSVQEAVAGADVIITVTLATEPILFGEWVK PGAHINAVGASRPDWRELDDELMKEAVLYVDSQEAALKESGDVLLSGAEIFAELGEVI KGVKPAHCEKTTVFKSLGMAVEDTVAAKLIYDSWSSGK" repeat_region complement(126520..126832) /rpt_family="AluSx" repeat_region complement(127731..128024) /rpt_family="AluSx" repeat_region complement(128414..128539) /rpt_family="MIR" repeat_region 128861..128889 /rpt_family="AT_rich" repeat_region 129761..130062 /rpt_family="AluSx" repeat_region complement(130465..130544) /rpt_family="AluJo" repeat_region 130545..130845 /rpt_family="AluSx" repeat_region complement(130846..130935) /rpt_family="AluJo" repeat_region 132269..132483 /rpt_family="MLT1E" repeat_region 132965..133105 /rpt_family="MIR2" repeat_region 133293..133347 /rpt_family="MIR" repeat_region 133367..133667 /rpt_family="AluSx" repeat_region 133680..133797 /rpt_family="MIR" repeat_region 134144..134447 /rpt_family="AluSx" repeat_region complement(134524..134822) /rpt_family="AluSx" repeat_region 135469..135594 /rpt_family="MIR" repeat_region 136289..136588 /rpt_family="AluSx" repeat_region complement(136589..136629) /rpt_family="MADE1" repeat_region complement(137533..137669) /rpt_family="MIR" repeat_region 137671..137693 /rpt_family="AT_rich" repeat_region complement(137707..138008) /rpt_family="AluSx" repeat_region complement(138497..138720) /rpt_family="MER20" repeat_region complement(139289..139587) /rpt_family="AluSx" repeat_region complement(140077..140392) /rpt_family="AluSq" repeat_region 140421..140541 /rpt_family="MIR2" repeat_region complement(141930..141986) /rpt_family="(TAAA)n" repeat_region complement(142036..142330) /rpt_family="AluSx" repeat_region 142379..142463 /rpt_family="MIR2" repeat_region 142611..142703 /rpt_family="MIR" repeat_region 143126..143326 /rpt_family="MIR" repeat_region complement(143539..143573) /rpt_family="MER5B" repeat_region complement(143651..143692) /rpt_family="MIR" repeat_region 143936..144085 /rpt_family="MIR" repeat_region 144167..144403 /rpt_family="MIR" repeat_region 145163..145381 /rpt_family="MIR" repeat_region complement(145417..145511) /rpt_family="(GGA)n" repeat_region complement(146577..146822) /rpt_family="MIR" repeat_region 147320..147343 /rpt_family="(GA)n" repeat_region 147525..147797 /rpt_family="AluSx" repeat_region 147822..148031 /rpt_family="MER4B" repeat_region 148252..148529 /rpt_family="AluSx" repeat_region complement(148530..148560) /rpt_family="(GA)n" repeat_region complement(148560..148685) /rpt_family="(TA)n" repeat_region complement(148688..148954) /rpt_family="AluSx" repeat_region 149405..149769 /rpt_family="THE1B" repeat_region complement(150290..150389) /rpt_family="(TAA)n" repeat_region complement(150853..150901) /rpt_family="AT_rich" repeat_region 151282..151309 /rpt_family="GC_rich" repeat_region complement(151724..152590) /rpt_family="L1MB6" repeat_region complement(151998..152594) /rpt_family="L1PA12" repeat_region complement(153646..153951) /rpt_family="AluSx" repeat_region 154234..154534 /rpt_family="AluY" repeat_region complement(154781..155451) /rpt_family="MER21B" repeat_region complement(155514..155581) /rpt_family="MER21B" repeat_region complement(155630..157034) /rpt_family="L1" repeat_region complement(157143..157275) /rpt_family="LTR12" repeat_region 157297..157970 /rpt_family="pTR5" repeat_region complement(159102..159281) /rpt_family="L1" repeat_region complement(159102..159238) /rpt_family="L1MD2" repeat_region 159437..159636 /rpt_family="L1MB6" repeat_region 159762..160025 /rpt_family="L1" repeat_region 160131..160208 /rpt_family="(TAGA)n" repeat_region 160355..160484 /rpt_family="(TAGA)n" repeat_region complement(161769..162119) /rpt_family="L1PA8" repeat_region complement(162120..162414) /rpt_family="AluY" repeat_region complement(162419..162974) /rpt_family="L1PA10" repeat_region 163076..163374 /rpt_family="AluSx" repeat_region complement(163513..163661) /rpt_family="AluSq" repeat_region 164163..164451 /rpt_family="AluY" repeat_region 164537..164834 /rpt_family="AluSq" repeat_region 165692..166005 /rpt_family="L1ME3" repeat_region complement(166735..167034) /rpt_family="AluSx" repeat_region 167938..167976 /rpt_family="POLY_A" repeat_region 169496..169526 /rpt_family="(CA)n" repeat_region 169571..169691 /rpt_family="MIR" repeat_region complement(170320..170670) /rpt_family="THE1B" repeat_region 170721..170773 /rpt_family="MER4A2" repeat_region complement(171896..171955) /rpt_family="(CAAAA)n" repeat_region 172201..172483 /rpt_family="AluSq" repeat_region complement(172669..172895) /rpt_family="L1ME2" repeat_region complement(173006..173156) /rpt_family="AluSp" repeat_region complement(173426..173767) /rpt_family="L1" BASE COUNT 48141 a 37952 c 38065 g 49724 t ORIGIN 1 aagcttcagt gtggatgaag actgacaata ttgttcattg aaagacacca gacacaaaag 61 atcatatatt gcatgattcc atttctataa aatgtccaga atatgcaaat ccatagagac 121 agaaaatcac ttagtggttg ggaggaaatg ggggaaggtg agaatgcgga gtgactgcta 181 atgggtttct ttttggggta atgataatac tcttgaatta gacagtggta aggattaaac 241 aaacttgtga atatattaaa aacttgcatt gtgtactttc aaagggtgaa ttttatggaa 301 tgtggatatc tcactttttt ttaaaatgaa agaatgaaaa aaggtggctt ctgtctcatc 361 tggactccta caattttttt ttttttgaga cggagtttcg caaatttaat tattgataaa 421 tatacctgag agatcactgc ttgccagccc tgtactggca ctgagttaca taaaggaata 481 agaccgtccg ggcgtggtgg cttacacctg taatcccaga actttggggt gccgatgtgg 541 gtggattgct tgagcttagg agtttgagac cagtctggac aacatggtga aactccgtct 601 ctatcaaaaa atacaaaaaa ttagctgggt gtggtggtgt gggtctgtgg tcccaactac 661 tcagctaact tgggagactg aggtgggagg attgcttcag cttgggagga ggaggttgca 721 gtgaatcgag attatgccac tgcactccag cctgggtgac agggtgagat tcgtctcaaa 781 aaaaaaaaaa aaaaaggaat aaaacccaag cccgcacaca cttcataggc taaagggaga 841 caaataagta aatgttctat tataaccaag ggtgagctgg gtgcgatggc ccatgtctct 901 aatcccagca ctttgggagg ctgaggctgg cagatcacct gagatcgaga gttcgagacc 961 agcctgacca acatggagaa accccatctt gactaaagat gcaaaattag ccaggcatgg 1021 tggcgcatgc ctgtaatccc agctactcgg gaggttgaga caggagaatc gcttgaaccc 1081 gggaggcgga ggctgcagtg agccaagatc gcgccattgc actccagcct gggtaacaag 1141 agccaaactc tgtctcaaaa acaaaaactg agggtgatgt tgtaatggag gtaatgccaa 1201 attaacaatt actggtgctt aagttatatt agatactgtt ctaagtgctt tggatgtagt 1261 aagttattta ttcctcttat caatcctatg aagtggaaac tcttattaac attatcgcaa 1321 atttagaggt aaaggaagca aggcacagaa aagtgatgcg accacatata tgcctgtaaa 1381 gcagcagagc taggagttag acctgagcag gctggctcca agtctgtgca attaagcagg 1441 atattccagt gcttagagac aggcacaggg tgccctgggg acacaaagga gggccacgta 1501 agcctgcaag gaaaggtgtc acttataatt cacatgcata tttggcctag attttttttt 1561 ctttcatttc tctatgctgt cagggctacc cccactgtgc acagcagaat aggcttgctt 1621 acacgtggat gtggtcaaga aagggaatga aaaaaaaagg gagggacttc ttactctgca 1681 aaccacagtt ctttcagcct gagcatcatg gggttcaccg tgtgcagatg ctcctcgttc 1741 cacttcttgg cgctcctgta gacactgtgc cagggcacag gggcccggat cactctttga 1801 ggaaacaagc gggggatgct ctcaataaag agccgttttc tctccattgg gtccatgagg 1861 atgtaatcaa ctacaaccgg aaagggcgaa cacctcatta ttaagctcta agggttgtgc 1921 tcttagcttc aactcattta gttttgtttt taatatagag acggggtctc actaagttgg 1981 ccaggctggt ctcaaactcc tgggctcaag tgatcctccc acctcagcct gccaaagtgc 2041 tgggattaca ggcatgagtt gctatgcctg gccttagagt caactctttt tatttattta 2101 tttatttatt ttttgagatg gagtctcgct ctgtcgccca ggctcgattg caatggttca 2161 atctcagctc actgcaacct ccacctccgc ctcctaggtt caagccattc tcctgcctca 2221 gcctcctgag tagctgggat tacaggtggc tgccaccaca cctgcctaat tttttgtatt 2281 ttcagtagat acagggtttt gccatgttgg ccaggttggt cttgaaatcc tgacctcagg 2341 tgatccaccc gcctcggcct cccaaagtgc tgcgattata ggtgtgagct agcacacctg 2401 gccttagctt caactcttag aatataaatg agcctctgtc catttgcaag gtcatgggta 2461 atcccctggg tataattcaa ctggtgaagc ttcactcttg gtgtttgttt tctttctgag 2521 ggtattttac agaaaggagg agtgtgactt cacggtgtgg agcacttaga atctggggtt 2581 gagccctggt aggatgtcta gtatcaggca ggacaatgaa ataggatttg ctgataactc 2641 attagtcact gattgatttc tcacaccaaa atagatgcac acaaaaaaac ccaccatgga 2701 tgataccatc cattatattt acaaatagaa gtttaaagac caaatgtttt ccatactaag 2761 aaaggttggt agaaaagctt catagaggat gtaacatttg tgctgggtca taaagaacaa 2821 gtagcaggct gggtgcggtg gctcacacct gtaatcccag cactttgaga ggctgaggtg 2881 ggtagatcac ttgaggtcat gaattcgaga ccagactggc caacatggcg aaaccttgtc 2941 tctactaaaa atacaaaaat tagctgggag tggtggtgcg tgcctgtagt cccagctact 3001 caggaggctg aggcaggaga attgcgtgaa cccaggggtc cgcccaggta ggtcttgaaa 3061 tcctgaccaa cctctgcagt tgcagtgagc agagatcaca ccactgcact tcagcctggg 3121 cgacagagca agactgtctc aaaaaagaaa aaaaaaagaa gtgtagaatc tcagccaggt 3181 gtgatggctc atgtctataa tcccagcact ttgggaggcc gaggcaggag gatctcttga 3241 gtccaggatg ttcaagacaa gcctaggcaa catagggaga ccttgtcgct aaaaaaaaaa 3301 aaaaaaaatt gttttaaact tacaaatgta gaatctcagg ccccaggttc agatctactg 3361 aatcgcaatc tgcattttaa gatctgcagg ttacagggat gcacgtgaat atgtgaagtg 3421 ctggtgtaga catcagtgat ggtaggggga tgcaggactt tgaattagcg gttaaatgat 3481 gttttgggta tttccagtgg aatcagggtg gcaaatgaca aagtgagaag aaagaagatg 3541 ccttccaggc caggcatggt gcctcatgcc tgtaatccta gcactttggg aggctaagga 3601 gggaggattg cttgagccca ggagttccag accagcctga gcaacaaagc aagaccttgt 3661 ctctacaaaa ataaaaataa aaaagaaggt gccctctaca tagaagctcc ccagccccct 3721 ttatgcccac cagccttacc gatgcttttc atgaggctac agtaatagtc attctccttc 3781 tcctgcacga ggaccaccat caggggctcc aggaagggac tcgtcagcag cgtgttagaa 3841 atcagctttg aaatccgaac catcacttca ccctcctcag gggcaatcat gtctttgcga 3901 attccattgg tcagatagta atagtatctc ttccataaac aaaccaccag cgagaaaatg 3961 gtcatgattg gatcatgggt tcaactgtcc tgcccatcag ctgccaaggg accccattca 4021 tacagcattc gtctgcaccc ttcacttgcc ataaccatgc attctgggcc tgtcgcttca 4081 cactcatggc aagaagatca ctggcaaatt ttccccagga cttcagagga ttcaaccaaa 4141 gccagagaga tgtcagtggg gtataaggaa ggatttgtca aataccaccc agataggaac 4201 agtggagtga gtacagagtc cagggatgga ggtggcatat ctggtgacct ttaagaagag 4261 gaataacagg ccaggtgtgg tggcttacac ctgtaatctc agcattttgg gaggcaaagg 4321 tgggagaatt gcaaggtcag gggttcgaga ccagcctggc caatatggtg aaaccccatc 4381 tctactacaa aatacaaaaa ttagccgggc gtggtggcgc acacctgtag tcccatctgc 4441 tcaggaggct gaggcaggaa aattgctgga acccaggagg tggaggttgc agtgagccaa 4501 ggtcatgcca ctgcactcca gcctgggtga ccaatcgaga ctctgtctca aaaacaaaca 4561 aaaaaacatt agattttcct gatccagagc tagaatttac cccccggaag ctttcactca 4621 ctggtcctga ttctgccttc taaggtagct cagacaaaca gggttggatt tctgtctaaa 4681 aaaaaaaaaa aaaaagtaaa ggaatgacaa gctcctaaaa tggcctcgaa tcacataccc 4741 tcctgaacag caatgatctc ctacaaaaca ctttgatccc atggcttcta gctgtcagtg 4801 gggaaaaata accccaaagt ccaagatcta ttgttgttct tttttttttt tttttgacac 4861 agaatttcac ttttgtcgct caggctggag tgcagtggca caatctcagc tcactgcaac 4921 ctctgcctcc cgggttcaag caattctcct gcctcagcct cccaagtagc tgggattaca 4981 ggtgtgcacc accacagcca gctaattttt gtatttttag tggagacggg gtttcaccat 5041 gttggccagg ctggtctcaa actcctgacc tcaggtgatc cacccacctt ggcctcccaa 5101 agtgctggaa ttataggcat gagccaccac acccagctga tagttcttct tacccaagca 5161 agggcttttg agtttcaaaa gcaacataaa cacacaaatg aaacaaaaat ggccagtgga 5221 acctagtgac ttcaggaaat tttgtatatt tagggtttgc ctctgagctg ctggatggct 5281 gagtggctct gggaatgact aatgtcagat agaggagaag cacagagtgt gaggaaaggg 5341 cttgaagtct gtttttaagt ttcagatcag aagaggttgc tggccaggca cggtggctca 5401 tgcctgtaat cccagcactt tgggaggcca aggcgggtgg atcacctgaa gtcaggagtt 5461 tgagaccagc ctgaccaaca tggtgaaacc ctatctctac taaaaataca aaaatcagcc 5521 aggtggggtg gtgcatacct gtaatcccag ctactcagga ggctgagaca gcagaattgc 5581 ttgaacccag gaggtggaga ttgcagtggg gcaagatcac gccacagcat tcaaacctgg 5641 gtgacagagc gagattctgt ctcaaaaaaa aaaaagaaaa gaaaaagagg ttgctgctgc 5701 cagcctcttt gttactctcg cctctctcca acgttagtaa aagcagctgc taacagggga 5761 gtattgcaaa gcttttctga ttttaagaac tgcctgggtg cagggtgggg gtaaaaagtg 5821 cttgttaaaa tgcagattcc caggcccctc ctcaggcagt tagggtgttt accaaggacc 5881 ccaggtgacc cttatcttca ggcaagtttc agaaatgctg cacagctctt aagaacacag 5941 agactggctg ggtgcggtgg ctcacgccat aatcccagca ctttaggagg ctgaggtggg 6001 tggatcattc aaggtcagga gttcgagacc agcttgacct acatagtgaa accccatctc 6061 tactaaaata caaaaattag ctgggcatgg tggcagttgc ctgtaatccc agctacttgg 6121 gaggctgagg caggagaatc acttgaaccc aggaggcaga ggttgcagtg agccgagatc 6181 gtgccactgc actctagcct gggtaacaga gcaagactcc ctctaataaa aaaaaaaaac 6241 aacacagaga ctttggtatc agccagaccg gctttcagtc ttagttttgt ctcctaccct 6301 gtagccctaa gcacattgct taaacagtct atactccact ttccgcaact ggacaataga 6361 gataatcata gaacctgggt ctcaagtggg gtcaagtagc tatagagggt ttagcaggga 6421 attttgccaa cagcacatag tcactgtgag tttttattaa tcattttatt atacacccta 6481 aacccacaag acatttgctc taaagtttta ttgttctttt aagacgggtt ggaaactcca 6541 acatctatct agagccaggc aggtaatgtt aataagtgaa agttgtctag acctaagaaa 6601 ataataagga atggtgggaa ctatgacaaa ctgaagagtg tatttctcat ctaaaggagg 6661 tagctactgc ttagcaccaa atattttagt tttctcaaga gaagttggaa aatgtagctt 6721 gagactttcc tcatgctttt tttttttttt tttttttttt ttttgagata tggtcttgct 6781 ctgttgccca ggctggagtg cagtggcatg atcatagctg ggatgcctga ggtgtcagag 6841 aaggggacca agggataggt tgtcttgtcc tagaaagggt tgtgaatccc tccccctacc 6901 ctttttttga gacaaggtct cactctgctg cccagactgg aatgtagtgg cacaatcata 6961 gctcactgca gcctcagctt cccagggaca agtgatcctc ccacctcagc ctcccaagta 7021 gctgggacca cagatgcatg gtaccatacc cagctaattt ttgtagagat tgggtttctc 7081 catgttaccc aggctggtct tgagatcctg ggctcaagtg atcctcccca ttcctcctcc 7141 taaagtgctg ggattacagg catgagtcac catgcccagc cacctcattc tttttttttt 7201 tttttttttt tgagatagag cctcgctctg ttgcccatgc cgaagtgcag tggctcaatc 7261 ttggctcact gcaacctcca cctcccaggt tcaagtgatt ctcctgcctc agcctcccga 7321 gtagctgaaa ttacaggcgc ctgccaccat gcctggctaa ttttgttttt gtatttttag 7381 tagagacggg gtttcagcat gttgaccagg ctggttttga actcctgacc tcaagtgatc 7441 cacctgcctc ggcctcccaa agtgctggga ttacaggtgt gagccaccat gcccagccac 7501 ctcattctta aaacacatct acaggccaaa ttcagcctat gatagagatc ccatataaaa 7561 tatgggatgc tttgggatat acttatacta aaaagttatt cattacctaa aatttgaatt 7621 taactatgtg tcccattttt tgtcttgcaa aatctggcac ccccagccta tgggccacta 7681 tttttgcaac ttatattcta aaaggactca aggtatcaat ttccatcact tactagggca 7741 gagaacatgt acacacactc tctctccttc tctctcactc gcacaagcac acaaacatgc 7801 gccctcttca gtagcctgtc ttgggtctag actttggtgt ttttcaaaca tttcccaggt 7861 gattctgctg ttctccctga atttagaatc actgagatgc cgagagtaac cctcagcaaa 7921 ccgagggaaa gggggcaaca ggaacgtacc tccaggtccg attcagatgg cttcttttct 7981 ttactttcca tttccatctc ctgctgtaac atgacatcga gctgctgttc tggactcatt 8041 ggtctgcttc ctgggaacgt caaagatgtc tttacctcct tcttcatggg tgagaagatt 8101 gacgtcttga aggccaactt gatcctgaaa agtagacaca gctcagccct tggtgtttgg 8161 ttctccagag aggtgctgtg gtgttgccta ctttccctag cctgatgagt gtggttctgc 8221 aaatagaagc ctctctcctc tgtgtcatct aatggcatgt gtgtatactt ccaagatgac 8281 cctaatgact gaacacctca taggtatttt agcatggttt tgttcagttg gtttattttt 8341 gtctacaatc tcaaaaagcc tggccaacat ggtgaaacca cgtctctact aaaaatacaa 8401 aaatcagcca ggcctggtgg catgcacctg aaatcccagc tactcaggag gctaaggaag 8461 gagagtcgct tgaacccggg aggcagatgt tgcagcaagc tgagatcatg ccactgcact 8521 ccagcccggg caacagagca agactccatc tcaaaaaaaa aaattttttt tttgattaaa 8581 gtgattcgtg gataagaata tgattatctg ggaatgcttt gaagccaaga agcagagtgt 8641 ggctttagtg agaccacatc cttctgttca gcccaccgtc ctgccttctc tggtgcccac 8701 agtgatatct tacctagtgg aatcctcttt gtttctggag gaaaacttcg tttttttcct 8761 catgggctct gatgagcttc tatttcctgg aagaatggag aagaaagaca tcagtgatgg 8821 gcaatggcct tgggccattc tccgtccaca ggggcatgac tctcggaaca gtccaggagc 8881 acactaaggg gagcggacat ggtacggtgt gacgtggcag gagatgtggg ctcaggtctc 8941 atgtgcatca ctgattacct gtgtgaccct gaagaactgg cttcatcact ccggacctca 9001 gcttcctcaa ctgtaaagtg gggatgtgcg tatcttccat tcgttcatac atttggcaag 9061 aattaataac tgcttgtttc atgtactctc caatattgta tgctctattt tctaaacacc 9121 tgttatatta ctgtttaata tttctcttta agtagacctg ggcatttaaa cttagtacat 9181 ttggccaggc acggtggctc atgcctgtaa tcccagtact ttgggaggct aaggcaggca 9241 gatcacctga ggtcaggagt tcacgaccag cctggccaac atggcgaaac cccatctcta 9301 ctgaaaaata caaaaattag ccgggcattg cggcgagcgc ctgtagaccc agctacttgg 9361 gaggctgagg cagggagaat tgcttgaacc caggaggtgg aggttgcagt gagccgagat 9421 cgtgccactg cactccatac tgggcgacag agtgagactc cgtctcaaaa ataaataaat 9481 aaaataaaaa ataaaatagg ccggacacgg tggctcatgc ctgtaatccc agcactttgg 9541 gaggccgaga caggcagatc acgaggtcag gagttggaga ccagcctgac caacatggtg 9601 aaatccccca tctctactaa aaatacaaaa attagccagg catggtggca cacgcctgta 9661 atcccagcta ctcaggaggc tgaggcagaa gaatcgcttg aaccctgggg acggaggttg 9721 cagtgagcca agatagcacc attgcactcc agcctgggca atgagagcga aactccgtct 9781 caaaaaatat acatacatac atacatgcat acatacatac ataaataaaa taaacttagt 9841 acatttattt ttaaaagaaa ttttatgatt atcacaaata aaaaataagt acatatttat 9901 tgcaagtaga aaaaccaata tcatttgcaa taaatagaag gtagctacaa ttaattcaat 9961 aagtgatata aaaactatat tattcaactt taattagata ttacttccta aagtgtctat 10021 ctttgaactt ctcaaaatag tccacatata acaacaatac aaaaatcctt tttttttttt 10081 tttttgagac agggtctcac tctgctgcct gtgctggagt gcagtggcat gatttcagct 10141 cactgcaacc tcaaccaccc aggctcaggt gatcctccca cctcagtctc ctgagtagct 10201 aggagtacag gcgcatgcca ccacacctgg ctaattttcc tatcttttgt agagacgggg 10261 ttttgtcatg ttacccaggc tggtctccaa ctcctggggg caagcaaccc tctcaataaa 10321 cacaacaatc aaggtacaga ttcacaaaat ggaaataata cagcaataac aatattgctc 10381 caaaataaat ttttaagatc ctccaagtac aaatgtattt taatgaataa taagcagaaa 10441 tcatctcaag caaggacatg aataacagac accagtgccg ctctgaagaa agcatcttag 10501 tactttatat tcaaagactc tggcttgcct gagaattttg gggacaatga aacctgggaa 10561 gcaatagcac tagttcttca cagatattaa taattttgga ctaagaaaat tattaaagac 10621 tgctacagac tgaaagctaa tgtcttgtca aaattagtat gttgaacctt aatccccagc 10681 atgatggtat ttggaggtgg gactcttggg aggtgactag gtcatgaggg ctccatctta 10741 tgaatgagat tgtgccttta taaaaaaaac agcaaaaaga tttctcaccc cttctcccat 10801 gtgagcatgc agcaagaaga tagagaagac agatgtctat gaatcaggaa gaggaccttc 10861 atcatagact gaatctgttg gtaccttgat ctcggacttt acagactcca gaaccacaag 10921 atatgaattt ctgtagttta taagccactc agtttatgat tctttgttat agcagcccaa 10981 aaggaactaa gatgatgacc atgaaattac caattagagc agcaaactta aggttctgtg 11041 tatcatgatc ttgaacattt gggcattaat tatttgctac atgtccttaa agcaggaaca 11101 tgcaaaagca gcatcctaga gactgtgggt gtgtatgtaa tacacaaata acactggtaa 11161 gaaccacttg ggaaaaagaa atcctccaag tgatattgaa agggctatga caataagaat 11221 ggaatcagga gttggaaaag ggaggtagaa aaccataagt aatcagaaaa tttgaaatgg 11281 aaagatagtc atttaaacaa aaaacactca ttagacatga aaaactcaag actgaacaga 11341 gttgaagaga aaattagtaa attggaagat gtaattaaga aactcaccca gacgtagcac 11401 agagaaaaca atgaaaaata taaaaatgag tttgagacag agagcagatc aagagtgcca 11461 acatacatcc aacaaaaatt actaaaggat atccttatgc caatatacct atgcttggtg 11521 ggaattaaat taatacatac aaggtgctgt ggtaagcagc ttccaagaca gcccccaatg 11581 atccccatct cctggtaaca cccctttgtg gtgtaatcct ctccccttga gtatgggcta 11641 gaactagtga ctcctaacaa atagaatgtg gcaaaagtga caagatgtca cttccaagat 11701 taagttccaa aaaagcctat gggttctctc ttgctggctc tctctctttc tctccccctc 11761 cctcgctcta agggaagcca gaagccatgt tgtgagttgt tctaagggag agacccgtgt 11821 ggcaagaaac tggtgtctat ggccaacagc cagcaaggac ctgatgacat gagtgagcct 11881 ggaagaggat ccttccctag ttgcctttag atgatggtgg ccttggtgta tgccttgatt 11941 gtacgctagt gaggggccct gagccagcac tacaaattca gccatgccca cgttcctgcc 12001 ccactgaaat tgtaagataa caaatactgg ttcttttaag ccacagaatt ttggaataac 12061 ttatcatgca aacaataaca caagtacata gaacagtgcc tggtatatgt aaatggtcta 12121 taaatgttag ccgtttgcaa tcatttcctc catagataag gaggagagtg aggctttgca 12181 aagtctccaa agttagcttc agcaaaaggg gagccaggat cagaatccag gagtaaaaat 12241 ttcactccag gccaggcaca gtggctcatg cctgtaatcc cagcactttc ggaggttgag 12301 gcaggtggat cacctgaggt caggagtttg agaccagcct ggccaacata gtgaaacccc 12361 atctctacta aaaatacaaa aaaaaatagc caggtgtggc ctgtagtccc agctactcag 12421 gaggctgagg caggagaatc acttgaatcc gggaggcgga ggttgcaatg agccacgatc 12481 gtgccatggc actccagcct gggcgacaga gtgagactcc atctaaaata aataaataaa 12541 taaaaatttc ctccaaagcc ccattctgca agggtgtgac actgccacaa gtacctgatg 12601 atggcagccc ctgaccggtc ctgtccctag ggacactgat ggtggccggt tggtagacct 12661 tcagcagatc tttcagcttc aggtcctggg ccatcaagga gtagttgttg gcgatggaat 12721 cactgggtcc acggtggtga tgctgttctt tgaagggtgc agccaaggtc catgacgtgc 12781 gttgcatcaa gggcgggtaa aagctgtgtg acatgacagt ctacgaggta agaagtgcag 12841 agtgagacac aatgaggaac atctctcgat gagcctgagc tctgctcaca ctgaggctca 12901 caagagttcc aagtgcctta tgctatgttt ctgccttctg cacattcagt ggaaagattg 12961 tctgaacaga caccaactgt acttctctga ccattgtaag catgaattct gatgtcaggg 13021 ataatagctt gcattcacca gtaattttga ggcaggtaat tgacatctct ggtctttatt 13081 ccttatggac atgactacaa ggtacaagca attaccctca ttttacaatg aaacacgcag 13141 aagttcagag agtttaaata acacagcaga tcagtggacg gccagattct aattcacctt 13201 tggacaatgg catagctctc tctcccacac ttctgcagcc tcttctgggg gcaaaagcga 13261 ccaaagctac taatcctact tgtcaggtct gtttgactct ccaccctcat tgagtacctg 13321 ccaaatacgg agagaagtgg gggatgcact tcacctcctc accctcaaca gaagagctgg 13381 cagccaggtg tgcatacctg atagagtcca gacggttcct cattagcaga agcaggcaga 13441 ggaggcagct ctggctgccc ctgggagtgg ctcatgtgat gtatggagtc acttttggcg 13501 atctgtggga catgggaaca aagtcaccct tcaaaaatta gggaagattt gcaagcctct 13561 gagttggtta ggactaaaga ggtccccagg aaaagtttta ggtccagagt tctggaccta 13621 aaacaaaaca ctcctctcat tcccatccat tatgtatgtt tcatttattt aaaaattaac 13681 agataagggc tgggtgcggt agctcacccc ggtaatccca ccagtttgag aggccaggga 13741 gggtggatca cctgaggtca ggagttcgag accagcctga ccaatatggt gaaaccctgt 13801 ctctactaaa aatacaaaaa ttagccaggc atggtggcgt gtgcctgtag tctcagctac 13861 ttgggaggct gagacaggag agttgcttga acctgggacg cggaggttgc ggtgagctga 13921 gattgcactc tgagatgtat tgaagtgtat atacattata gagtgactaa atctaattat 13981 catatgcgtt accttgcatg gttattatct ttgcagtgag aacacatcta ctctcatagc 14041 gttgttcaag aatacaatat attgttaatt agtcatcatg ctgtacaaca gatctcttga 14101 acttattcca tctaactgaa attttatatc ctatgaccat ctccccaacc cttccactcc 14161 tgtttctttt ttctttcttt cttttttttt ttgagatgga gtcttgctct gtcacccagg 14221 ctggagtgca gtggctctat ctcagctcaa tgcaacctct gcctccgggg gtcaagtgat 14281 tctcctccct cagcctccca agtagctggg attacaggtg actgccacca tgcccagcta 14341 attttatatt tttagtagag acggggtttc atcatgttgg ccaggctggt cttgaactcc 14401 tggcctcaag tgatctgccc tcctcggcct cccaaagttc tgggattaca ggcatgagcc 14461 accacgcccg gcccctgctt cattttttaa tcattcagtg tctcctcctc ggattccctt 14521 ctatgagcaa atttgaatat ttttcccccg tggaagagcc caaattctga ggacatcccc 14581 agtaccctac cctcaggtat gattaaagga ggtgacttta aaaacattga ctcttgcatt 14641 tttttttttt tgagatagtc tctcactctg tcacccaggc tagagtgcaa tggtacgatc 14701 ttagctcact gcaactgctg cttcccaggt tcaagcaatc ctcctgcctc agcctcccaa 14761 gaagctggat gggtgcccac caccatgcct ggctaatttt catattttta gtagagacgg 14821 ggtttcacca cattggtcag gctggtcttg aactcctgac ctcaggtgat ccacctgctt 14881 cgaccttcca aagtgctcgg attacaggca taagccactc caccaggtgg actcttgcat 14941 tttcacatgt gcaaaattcc aaagagaaat tcactcaggc agtggcactg aattttctgt 15001 gtatttaaca cacacataag caactcttta tgattcctgc gaaaagaaaa agtgttattt 15061 caacaatgtg ggtgttgagt actccattcc aaccagtgag tgcatactgt ggagtgagcc 15121 tcaggtaagg gtcaacttgt ggctcccaca atcactctca gattccaagt gaccctccac 15181 ctctcctgag cagagaagaa agtgactaca ttattgaatg aaatcacatg tgtttttgtc 15241 ttaagactta gcccctatat gcaaaacaac cacttcacct gggactagtg atggagaaac 15301 tgcaacttgt tccaacacta gaagaaaagc tgttggcctg gacttattga gaggtgctgc 15361 cgttcagcac tttaaaggtg gtttgaggga ttttcaaggg caattttgac tctctgtgat 15421 tttcacctta ggcattgact ttggacttaa ctcctatata agatgagcca acatcgtatc 15481 agtgtgattt atttgggtgt gcaaattatg acatgcaaga tcactgatga gggtgggatg 15541 gggcaggggg catatgacta ttagctgtcc ctctgccccc aaccccttaa agatgactca 15601 aattaaccat cattatctct cccggagcac ctacctcatt aaataacaac agaaaaacaa 15661 caaccgcagc aatactatct accattattc caaacgatta tgtgccaggt actgtacaac 15721 ctcatgagat ttttttcata cccgcattct agctgaagaa aatgagggtg agaagatagg 15781 taacacatcc tagccatgta attggccggc ttgacttcta agtcagtgac ttatttatta 15841 ttattattat tatttatttt ttttttttga gtccaagttt tcgctctgtc gcccaggctg 15901 gagtgcagtg acacaatctt ggctcactac aactgcaacc tccaccccct aggttcgagc 15961 gattctcctg cctcagcttc cttactgcat catgcaaact gacagggacg ctgttctcgg 16021 gataatgaag aaagcaaacc gtgagcattt tgcagaacac aatggaaaga gggagaacct 16081 ctgtagaaac aaattaaact aggctatgtc ataggcttgg actaaaacaa atacacaaac 16141 agtcataagt acgctgatag cagaaagcat cccagtctgt cgatcaggaa cacatggggc 16201 acacacaaca tgagcagatc taggcatcaa ttgcagaatg gggctaaaag agttttcctg 16261 gcaataaatg gctaagaatc ctctgtggca ccttcaagag atcgagattc tcataagaac 16321 tagctggttg tgcaaaattg tttgttaaaa atatgaatgc ttggccagcc gtggtagcta 16381 atgtctataa tcccagcact ttgggaggcc gaggcgggtg gatcacttga ggccaggagt 16441 tcgagaccag cctggccaac atggagaaat cccgtctcta ctaaaaatac aaaaattagc 16501 caggcggtgg cgcacacctg tagttccagc tactcaggag actgaggcag gagaatcgct 16561 tgaacccaga aagcagacgc ttcagtgagt gaagatcacg ccactgcact ccagcctgga 16621 taacagagac aaactgtctc aaaaaaaaaa aaagaaagaa agaaaaagaa agaaagaaat 16681 taatgcttaa atcttctctt tggtaaattt caaggagctt tcacttcaga gcatattggc 16741 ttagttatta tctacaaatc cactttagtt ttacaaagtt tctatttaaa atgctcctct 16801 gggcacttcg ctgtatggca acctggtgat tcaacacaga tacaacagaa attctggaac 16861 ttagattggg tagaactgtc actttgggca gcttattgct gattacacat gttgtatgta 16921 tgttcttcat ggaggccatt cccaaatgct cgtctgagga cgtgctacac cagaatcact 16981 ctggtcttag ataaaaatat agattccagg gccccactcc aaatcttctg gatcggaatc 17041 taggagtggg ggaccaagac tctatgtttt taagatgctc cccagaatat tctaatgttc 17101 agccacagtc aaaaactgat ggttggaata tatggtctaa tgggcttgga agggaaggaa 17161 agggaagaac gggtgtctat tcccggattt ttcacagcaa aacttgaaga tttccaaaag 17221 tctaattcag caaaaatgat atttcaaagc agacaacaaa aaaatgtgat gaatacactt 17281 gttcagtcaa gggcatctaa catgccaatt ggccctcttc ccacatattc ccaacacatt 17341 aaaaatacag gtaaatcggc cagacacggt ggctcacacc tgtaatccca gcactttggg 17401 aggctgaggc aggcggatct cctgagatcg ggagttcaag accagcctga ccaacatgga 17461 taaaccctgt ctccactaaa aatacaaaat tagccaggtg tggtggcaca tgcctgtaat 17521 cccagctact caggaggctg aggcaggaga atcacttgaa cccaggaggc ggaggttgca 17581 gtgagccgag atcacaccat tgcactccag cctgggcaac aaagagcaaa actctgtctc 17641 aaaataaaaa ataaaaaaaa aaaatacagg taaaacaaga aacgtcttaa aattcacacc 17701 taccttctga gatgagcaat ccatgtcacc catggtgtcc actctaattg tagctatttt 17761 tttaaaatga ataaagagct ttcagcttct ccagggatgt gcctctccta tccagttggt 17821 ctgaaattca cacttcctta tagcataatg gggatgacac caacaatgga actttccaga 17881 tgttctgaaa gctctgttca tgaggttgcc tctctgaggt catagacttt ggcccgctga 17941 ggagttccct ggtaaaaaag gagagaaagg gaagctagag ataaggggac agcactgggt 18001 ttctgctgcg cttttggctc tggttcctag tcccccagcg gtggccacgc ttcctttccc 18061 ttgagtgctt ctctcttgga aattttccag aaacacactg agttcactgc agaggaaaga 18121 ctgaagcaga gttacacatt aatacagaga agagggagac tcacccctcc aaacaacatc 18181 ctggcagccc gaactgttct ccctcctcct ccctggactc ttaacactgg cagtgcttgt 18241 atgcaggcgc tgggcacagt aaccatgtct cagcgtctgt tgctaaggga aggaggggcc 18301 tctgcggtgc ctacatacaa gactgggcat gacctagctc tgtgagctgg tgtctccctt 18361 tgctggaggc ccaggtgggc agcaggaagt gttggtgaca gaagggaaca tgggaaggtg 18421 tttagcaatg ttccaggaat aacagaggca acagcaataa aagacctttg gagacgtcga 18481 aaagcacaga gtatggtggt taagttcttg ggatctacaa ttggacaggt catacttcag 18541 atcctactct tctcattaat tagctgtgtg gcttggcaag ttgtttaatc tctccaagtg 18601 tcctttccgt ctgtaaaatg gcaataatgt gagtccctac ttcatagggt tgctttgaag 18661 ggttaacggg ataacgtgta cagagcgatt tgcatatagt agaaggataa ctgatatttt 18721 cccagttttt ccccgttttt tttttttttt ttgagacaga gtcttgctct gttgcccagg 18781 ctgcagtgca gtggcgtgat ctcagctcac tgcaagctct gcctcccggg ttcacaccat 18841 tctcctgcct cagcctccca ccagtgcgcc cagctaattt tttgtatttt tagtagagac 18901 ggggtttcac cgtgttagcc aggatggtct caatctcctg acctcgtgat ccgcccacct 18961 cggcctccca aagtgtggga ttacaggtgt gagccactgc gcccagccca gtttttcccc 19021 attttaacac taagtcctgc atcctgggag caatctcagt cctgggcaaa ctgggatagt 19081 ttgatcaccc aacacagtca atgctctata aatgttgtgg ctaacatcaa tattacgata 19141 aacacagaaa tacatgtgca catactcaaa agtaacaata ttagctgccg cccgaatatt 19201 aaaagcacat gttccatggt cactgtgaag ttctctcaaa accaagttag acttaatgag 19261 ttttttgggg gtgccatgct tgcatatctt tttttttcat tcaatttccc cccaaacacc 19321 acgatattcc cattttaagg ataaggaatc aaaactgaga gatatgggcc aggtatggtg 19381 gttcatgcct gtaatcccag cactctgaaa ggctgaggtg gatggatcac ttgaggtcag 19441 gagtttgaga ccagcttggc caacatagtg aaaccccatc tctactaaaa atacaaaatt 19501 agccgggcat ggtggtagtc gcctgtaatc ccagctactt ggggggctga ggcaggagaa 19561 tcgcttgaac ctgggaggga ggttgcagtg agcacagatc atgccactgc accccagcct 19621 gggcaacaag agcgaaactc catctcaaaa aaaaagaaaa acaaacaaaa aaactgagag 19681 atacataact ttcctaagtc gacttgactc ataagtagca ctgatacaat tggaacctgt 19741 cctgatgacc ccgagtccac tgctctttcc ataacctacc tgtccagaac acaaacattc 19801 ccgagttcct atgaaaagca ttgcatagtg agaggtgaca gcgtgctggc agtcctcaca 19861 gccctcgctc gctctcggcg cctcctctgc ctgggctccc actttggcgg cacttgagga 19921 gcccttcagc ccaccgctgc actgtgagag ctcctttctg ggctgggcaa ggccgaagcc 19981 ggctccctca gcttgcaggc aggtgtggag ggagaggcgc gagcgggaac ccgggctgcg 20041 cacggcgctt gcgggccagc tggagttccg ggtgggcgtg ggcttggcgg gccccgcact 20101 gggagcagcc agctggccct gccggccccg ggcaatgagg ggcttagcac cccggccagc 20161 ggctgcggag ggtgtactgg gtcccccagc agtgctagcc caccggcgct gcgctccatt 20221 tctcaccggg ccttagctgc cttcccgcgg ggcagggctc gggacttgca gcccgccatg 20281 cctgagcctc ccaccccctc cgtgggctcc tgtgctgccc gagcctcccc gacgagcgcc 20341 accccttgct ccacggcgcc cagtcccacc gaccacccaa gggctgagga gtgcgggccc 20401 acggcgcggg actggcaggc agctccacct gcagccccgg tgcaggagcc actgggtgaa 20461 gccagctggg ctcctgagtc tgatggggac gtggagaacc tttatggcta gctcagggat 20521 tgtaaataca ccaatcagca ccctgagtct aactcagggt ttgtgaatgc accaatcgac 20581 actctgtatc tagctgctct ggtgggacct tggagaacct ttatgtctag cttggggatt 20641 gtaaatacac caatcagcac tctgtatcta gctcaaggtt tgtaaacaca ccaatcagca 20701 ccctgtgtct agctcagggt ctgtgaatgc accaatccac actctgtatc tagctattct 20761 ggtggggcct tggagaacct ttgtgtggac actctgtgtc tagctaatct ggtggggact 20821 tggagaacct ttgtgtctag ctcagggatt gtaaacgcac caatcagcac cctgtcaaaa 20881 cagaccacta ggctctacca atcagcagga tgtgggtggg gccagataag agcataaaag 20941 caggctgccc gggccagcag tggcaagccg cttgatcccc ttctacaacg tggaaggttt 21001 gttctttcac tgtttgggtt catgccgcct taatagctgt aacactcacg gtgaaggtct 21061 gcagcttcac tcctgcgcca gcgagaccag aaacccacca gaaggaagaa actccgaaca 21121 catccgagca tcagaaggaa caaactccag acgcgctacc ttaagagctg taacactcac 21181 cgcgagggcc cacggcttca ttcttgaagt cagtgagacc aagaacctac gaattccaga 21241 cacaatagaa ccaagtactt ccattttccc cctttttcta ctccttttcc tcaaagttgg 21301 gctttccatc ctttcaaaat aaccctgagc ttttccatct cttatattat gctctttatc 21361 acaaagtata catacggttt caaatagcat ttggaataat tgagacagtg ggagagctgg 21421 agactctggt ttagacggaa atgggaagat tatcaggtct cctatactgg gtgtgaaacc 21481 tcatgtccca ccaaaaacag caactgtaat ccaatgaagt gaaaaattag aatggggcca 21541 ggcgcggtgg cccatgccta taatcccagc agctggggag gccaaggcga gtggatcgcc 21601 tgaggtcagg ggttcaagac caacctggcc aacatggtga aaccccgtct ctactaaaaa 21661 tacaaaaaat tagccgggcg tggtggcagg caccagtaat cccagcttct tgggttgctg 21721 aggcaggaga attgcatgaa cacaggaggt ggaggttgca gtaagctgag atcacaccat 21781 tgcactccag cctgtgtgac acagcgagac tctgtctcaa aaaaaaaaaa attagaatgg 21841 tcctgatgtg gagctaatgg accctatggc ctgtgtgtga gtaggaggta tgctaccatc 21901 tgaaatttag tgtgttttaa gtctactagt gtttaaaact atcaaaacaa tactaatggc 21961 tttcgcttat tgagcatata ctatgccctg ggactgctgc taggcactta gtagacatca 22021 tgtctttgaa tcttctcaac aacctagaat ataaatatca gtattttcac aagttttatg 22081 attgaggaaa ctgatacatt aaactaacct gcccatagtc acacagctag gtcatcagga 22141 aagcttgaac ttggttatga aatctgttcc atggcttcaa ccgaaaacct acatgtaatc 22201 tgaggaaagg tggaaaataa agtatctgcc aaagagagtg aaagtgactt cccccaggcc 22261 acctcacctg taccagaatt gtttgttgtc tgacaacttt gtgttggata attcttaaga 22321 aaggatatag gcttcaagtt tctcaatctt ctttttcttt ctttcttttt tttttttttt 22381 tgagatggaa tcttgctctg tggcccaggc tggagtgcag tggtatgatc tcggctcact 22441 gtaacctcca cctcctgggt tcaagtgatt ctcttgcctc agcctcccca gtaactggga 22501 ttacaggcgt gtgctaacac acctggctta tttttgtatt tttagtagag atggggtttt 22561 gccatgttgg ccagcctggt ctcgaacccc tgacctcaag tgatctgccc acctccgcct 22621 cccaaagtgc tgggattcag gcatgagcca ccatgcccgg ccttgtttct caattgggct 22681 gccagacaaa acagaaggtg cccagttaaa ttagagtttc agtcaaacaa tttttcaggt 22741 aaaagaatgt tttagggccg ggagcagtgg ctcatgcctg taatcccaac actttggaag 22801 gccgaggcag gcggatcaca aagtcaggag tttgagacca gcctgaccaa catggtgaaa 22861 ccctgtctct actaaaaata caaaaattag ctgggcgtgg cggcacatgc ctgtaatccc 22921 agctactccg gaggctttgg caggagaatc atttgaaccc aggaggtgga ggttgcagta 22981 agccgagata gcgccactgc actccagcct gggagacaga gcaagactcc gtctcaaaaa 23041 aaaaaaaaaa aaaaaaaagt tttagtataa gcatgtccca ttcaatacat gggctgtact 23101 tgtacttgta ctaaaaagtt acccattatt tacctgaaat tcagatttaa ttgagtgtcg 23161 tgtgttttta tttgccaaat ctgccaacat ggggagcaga ggtcagcaag gttccaagtt 23221 gcccctgcct cttgttatca agagtttagg gtcttaatct gttgctaaaa aaatttcaca 23281 catttcttat tcagaggctg gtattcctgg cagctcattc ctgaggccta aaatagttgt 23341 ttaagcagag caacagtttc aatccttgga ggctactata gcaaagtacc aaagtctggg 23401 tggcttataa acaagagaca tttatttctc actgctctga aggctggaag ttcacaatca 23461 gggtgccagg gtggtctggt tccagtgaag gcccccttcc atgttgcagg ctgccaactt 23521 ctcattgtat cctcacatgg tggaaagagg gtaagagagc tctctgggag ttattcttat 23581 tttattttat tttattttat tttattttat ttattttatt ttattttatt ttattatttt 23641 attttatttt attttaatga gccagggtct cactttgttg cccaggctag agtgcagtgg 23701 tgcaatctca gctcactgca gcctccactg cctgggctca agagatcctc ctgcctcagc 23761 ctcccatgta gctgggacta caggcatgtg ccactatgcc cggctaactt ttgtattttt 23821 agtagagaag aggtttcgcc atgttgccca ggctggtctc gaactcctag actcaagcaa 23881 tccacccacc ttggcctccc aaagtggtgg gattataggt gtgagctact gggatctctt 23941 ttataagggc actaatccca tttaggaggg cttcaccttc atgacctaat cacctcccaa 24001 aaaccccatg cctaatccca tcacattggg ggttaggatt ttagaggaac agacattcag 24061 tccattgcag caagtaattg ggaaagcata ggagaggctg cgcgtggatt tttttaaccc 24121 aaccactgca tgttccttga gggccaagtc ctgatgctat ggagagatgc taggatccca 24181 gccctggagg aacacatgat gtgatggagg aagcatctat cacacaaaga caagcaaaac 24241 tctgctggcc cagccaaaaa tgccagaggt ggaaggaatt tgaagattca cagagacgcc 24301 cctagatggc actcttaaac aaaaatccca gtggaaaccc taactcagct ttccaaagca 24361 acttccattt agggaatgat gattggaagg ctgtaatcta aggaaacaca cacacacaca 24421 cacacacaca cacacaatct tttcctgggt ggggaaccca gcattttgca gctgaacaaa 24481 gtgcatgacg ccagacaaga ttggctaaaa cttaagaatt tagggattct tgtctgtcct 24541 atccccacca ttagctcttc tttctgctat tgtggaatta cagcaggatt ctgtggttta 24601 tcatatcatt gtcgtgatca tcatcataaa tcatcatccc aagcaaatat ttttgagccc 24661 ctactctgtg cctcaatagg tgctaagttc tcgacagcca cagccacact attgcaccag 24721 tcctcactac aaccctgtga ttgcttcttt tttttttttt tttttttttg agacagggtc 24781 tcactctgtt gcccaggctg gagtgcagtg tcttcactgc aacttctact tcccaggttc 24841 aagcgatact tctgcctcag cctcccgagt agctgggatt acaggtgcgc accaccatgc 24901 ctggctagtt tttatatttt tagtagagat gaggtttcac catgttggcc aggctggtct 24961 ggaactcctg gcctcaagtg atctgcccac ctcagcctcc caaagtgctg ggattatagg 25021 tgtgagccac cgcaccctgc cagctttttg ttatctaaag acttattttc caggttgcta 25081 tggccacagg caaagaggag aggtcggtca acacaggtcc ctgtctggaa agtggctgag 25141 aactgtgcaa cagcagacca atggttgggc ttatccctct ctctaattta tggttgaagc 25201 catggttcag agaggttact tagtcacaca gtaagttgcc atcatgctct atagcagtgg 25261 ttctcacctc gggacgatgg ggccttccag gggacatttg gcaatatttt gagacacaac 25321 tggaggtggg ggggggcggt tcttgtaact ggcatctaat gggtagaggc taggatattg 25381 ctaaacattc tacaatccac agctcagaaa cagaattatt ccgccccaaa tgtcaatagc 25441 atactattca gaaaccttac tctatagggc agatttgggg ccaggggaaa caggggctcc 25501 tgctccgact tggatccatg caaaccgagg aatttggatt gatttatctt gtctctctca 25561 gcctggactt tttcatgtgt aaaatggaca tccaatccat ctgaaaggat ctcactggct 25621 tatgacattc gcatttgagc agtcatggtt aaaaacttct agccagggtg atacttcatc 25681 acgtgtgtgt cttgagcgcc agagacgttc cagagcagct ggttttaggc atacaggagg 25741 ggaaaccctg acacccttgg aaaactcatt ctcggctgcc agtctgcctg tcttgaaggt 25801 ggtctgagct ttctcactcc aagccagtcg tgtgggaagg gacactctct gagcctcagt 25861 ttcctcagct gtcaacctct cagggcttgt tctagtggct ccgaaagcca gtgggagcga 25921 gcgcgctttg caaagccttg ggcgccgggc cttggcgcgc ctttgccgga tccttccggc 25981 cacgggcggg gcgagcccag aggagaagag ccgaggggcg gtgcccgggc cagggggcgg 26041 aggcaggcgg ctctggctcc ctctcgggac gctctttcct tcttcctctt gttcctcctc 26101 ctgcctctct tcgcttcgcc tgcaaacgcg gtgggggctg ctcggcggtc aggagcaggt 26161 gagagctcgg agcttggggg tgggggtccg ggtgccccgg gtggctagac ccctctcgcc 26221 gggcggggac agaggaggag cgcgggggcg acggcgtgag cgcgctatcg gggtcccacg 26281 cccgtagctt tctgccccgg ggttagtgta aggggcgtct cctgcttatg agtcgccggg 26341 acggggctgg gagcgcacga ggagacacga ttcctcccca cccctaaatg cacactcacc 26401 ccgccacccc ccaacacaca cacacataca cacacacact accactagat taggtaaaat 26461 ggtagctagc accaccacac cccgagactg tgcccctttc tggggaacct ggggaagacg 26521 tcgaggcaaa cctccctagg attattgaaa acgggcgcct tcctcaccac ccagcccctc 26581 ttctccaccg aggtcccctt ccccatcgct gcagagagat ctgcaagtat caaagggggc 26641 ttctttaccc ccaaattaat aactaagtac tcgactcccc tctgagttcc cattattcag 26701 tctctgtgcc ttctgggacg cggaccccct ctccacctgc atttcactcc cttcctcctc 26761 tccttgggtc tccctggctt ttgaacgctg aaaggctggg cccggatggg gaggggcggc 26821 cagtgtgagc tcgaggcgcc ctgtagctcc catcccccaa ccagagatgt gacccctcct 26881 cccagcttga actttcgagg tagccgtttt cccctggagg ttcgggggtg ggagggccag 26941 gagatttatt tggagccttg gaaggggaaa gggttagctg gtgttggcct ggcccagtcc 27001 cccaggcgtg aagaaggcgg ggcactcctt ggccgcgccg ggttctcgct ttcccttctc 27061 gggagacagc tccgtgcctg gggaggcccc tccgaggagt gagtgaccag ggggtctggg 27121 aattggtggc caggtagacg catttgaagg ctgtggccga tggcattcgc cccacgcact 27181 ctctcatcgg actccccaaa ccggtctggg tggcacttga ggctcccagg ccaaaggaga 27241 cctgtaaggg gaactccgct tgtttctgct tgctgcagag atgagattat gggaactgag 27301 tgctgggaac atcccaagtt caagacctac gccagaggca gcttcaaaga gagaatgtgg 27361 ggtggtggga ttagtcccct cctcaacttt cccctccaaa gggtttcttg ttttacaaaa 27421 gggaagtagc agaggatgtc tagctgggat gattaaaatg tgctttaatt tgcatactag 27481 accctgtaca gcttagtagt agataaaatt accccctata tgataaataa atgtgaggat 27541 ctctcaatta attttggcca cgtggacaca atcttttaag catttgtact gtaggtcagg 27601 tcagtttaaa gggagaaatg gaaaagatat ctctgtgtcc agccagttct gggaaatata 27661 aaataagaat gatttttttt ttatatttga aaactcaaca agggttaaaa aatatatcta 27721 atttaaccaa ttgcctttat taaaaataag taaaaattgt atgagattgg gacacccctc 27781 aaaattccag gcagtttagt tagagtataa ttactgtggt gaggtctgcc cccgaggtgc 27841 agttctcctt agcagggata gacttaccca caagtaagta gatgatgatg tatggaatga 27901 tggaattggg agccaacctg tacagtaata aagaagagaa acaggtcaag gctactgttg 27961 tacaaggaaa gctttctagg gagaataaaa ggcaatgaga tttgaaatag tgtgttcagc 28021 tttcttgctt tctgtatatt ttcagcattt tgtcaagcta aaaaataaac ataagctgtc 28081 agagataata ctattaacat tttatcatct ttttttaaaa gtatcataca catcttttca 28141 catcactaaa cagtcacact atcattttaa tgactgaatt tatcattatt taaccaattt 28201 tctcctaatg gaaatacagt tagctacttt ccaatgtttt gctagtcatt atatgttata 28261 aacagggatg caatgcacag ccttttagta cattttgaga tacctgttgg attatttcca 28321 tcatgttcct ggaatataaa ttcctggatt gaagtgtgtg caatctgaaa ttgttgtgaa 28381 tattgccagg ttaccctccg tctgcatgcc caccatcaag gtatgaggat ggtagaagct 28441 ctcgtcgaac cagatggatg aagaccacta acggcttttg tttcctctgg taacagcaag 28501 agacagagcg acatgagaga ttggaccgcg ggctgcactg gagaatttac tggtaggata 28561 attcatccct aaagagattg aagtgagctt cagaatggca aaagaggagc cccagagtat 28621 ctcaagggac ttgcaggaac tgcagaagaa gctgtctctg ctgatagact ccttccagaa 28681 taactcaaag gtcagtttcc aatcactatg tataatggaa aatcccaata taacactagc 28741 ataaacaaga tagaaagtta tttctttctc cagtagaagg aatttggagg taagtcaaga 28801 atctttctgg cttaccgccc tgccatgtct atgctgtgac cttcaaactc cttgtctgag 28861 atgactgcaa gagcatcagc cattgtatcc atctttcagg caacaggatt tgaggagggg 28921 ggaagagaga agtgcaagcc acctacaatt taaggagact tctgggagtc gcacacaata 28981 cttccaatta catcccttaa gccaacattg agtcatttgc cactaatggc aaatgcaatt 29041 ctttagctgg ataaagaagt ctgtgcaaca gatatcatcc tttttttggt tttgtttggt 29101 tttgttgtga attttgccct atctgttacc ctgaaaatga gagacaaaag cctacttcaa 29161 gatttgagtt cattgttcca ctaggaatcc aggtccgata gaaatatgat gtgagtcacg 29221 tatgtaattt agaatttttc tagtagccac atgtaagaag taaaaagaaa taggtaaaat 29281 taattttggc tgggcatagt ggctcccgcc tgtgatccta gcactttggg aggctgaggt 29341 gggtggatca cttgagatca gaagttcgag atcagcctgg cgaacatggt aaaaccccat 29401 ctctactaaa attaccaaaa tgagcaggtg tggtggtaag gcctgtagtc ccagctactc 29461 gggaggctga ggcaggagaa tcacttgaac ccaggaggcg gaggttgcag tgagctgaga 29521 ttgtgccacc gcactccagc ctgggcgaca gagcgagatt ccgtctccaa aaaaaaaaaa 29581 ttatttttaa taatgttttt attcaaccta agatatcaaa aatatttcaa catgtaatca 29641 tttatacaaa attattaatg aaaaaagatt cttgaaacat tgtaccttct tttttgaact 29701 ccctcttcaa aaccccccgt gcgcatattt tacacttcta gcacatttca attcgagtta 29761 gccacttttt aggtattcag tggccactca gtggcaatga gacttcagat agcgttatgt 29821 tttcttgcat atttatcctt attgctttct tctatatgtc tttcagcatt ttgtcaggga 29881 aaagaaaaca gctttcagag ataatactgt taacattttg tatgtcattc tagacctttt 29941 ttctatgcaa tatacctata aaaatacata tgtatatgta acaaaatgaa ctattttgtg 30001 gtgtattttt ttttgtattg gatagcacag cgggtctact cgaagagatg cttgtttgca 30061 atctacaaca ataaattctg gacttgctcc agaatttatt tggcactctt tgtggattta 30121 ccaagctatt tttaaattgg acaagcagct ttctcgccag ctggcgggga ctcttgttat 30181 taaaagttag ctcagtaaga agggtcctcc tgcctgttcc acatttacag ctgattctca 30241 ttaaccatgg tagttacgtt ctataaagtt actgcagaca ctgaattagc acataccaaa 30301 ccatcgctcc taggggaaat acaggattca gttgctgtga gcctcctgtt cacaacattt 30361 ttgtcagctg atcagtacat aacctcgttt tatgtgtatt tctgtttaaa gacatgttaa 30421 tatatattgt tgatcattaa cgttgaactc agggccaaca acagtactat gactcatgtc 30481 ttaaacaaag ctaatcacac acatgtattt tctgcataag gcacatcaca gccttctcat 30541 ggttaggaac gctagagaac acttcagcac tacacttggg gaccattttc aacagtaaaa 30601 tcaccaaact aaagcataaa cgtggcacta aatagaccac agtgacactt gtttaaagga 30661 tgaaacagga aggctcggtg tcaccttgct tgacctcagc tgggaatgta cgtggcaggt 30721 gactccactt tttgctgttg tggacgtgtc cactaatgac ctcaaaagca ctgcaggcca 30781 ggcgtggtag ctcacacctg taatcctagt actttgggag gccgaggcgg gtggatccct 30841 tgagccagaa gttcaagacc agcctggcca acatgatgaa accccttctc tacgaaaaat 30901 acaaaaatta gctgggtgtg gtggtgtgcg cctgtaatcc tagcctcttg ggaggctgag 30961 gcacgagaat cgcttgaacc caggaggtga aggttgaagg gagccaagat cacaccactg 31021 tactccagcc tgggtgacag agcaagactc tatctcagac aaaacaaaac aaaacaaaaa 31081 gcactgccag tattcatttt gaggttacaa ataaatttca gtgaggaggc aaatttacaa 31141 atatggaatc atgagtaata aggattgact gcatatttct tggcactgac tgtatgccaa 31201 gcactgcact aggcattcac cctggccctg tctgaactta gttaatagtc acgttcacct 31261 gccttttttg tttgtttgtt tttgtatttt tagtagagac ggggtttctc accattttgg 31321 ccacgctgat ctccaactcc tgacctcagg caatccaccc acttcggcct cccaaagtgc 31381 tgggattata ggcgtgagcc accgcacctg gccaccagtg gttgtaaatg gtcctgaccc 31441 ttcgtctttc tcctgttagg aagagataat gaatcattga ttgttttttg cattctgata 31501 ccagggaatt tgcctggctc ctgaatcatc caagtttcag tcttcatgtt ctcaaatact 31561 ggccactttg tccctagttt agaagccaac cgcctttgtc tgaccttggg caagtcactt 31621 aggctcactg aatttccatc tcctttttta aaatttttta tttatttttt attgagacag 31681 agtctcactc tgtcttccag gctggagtat agtggtgtga tcatggctta ctgtagcctc 31741 gaacttctgg tctcaagcaa cacctccagc ctcagcctcc tgagtagctg ggactacagg 31801 tgcacaccag catactcagc taatgtttta ttttattttt tggagagaca gcatctttca 31861 ctgtgttgcc caggctggtc tcaaactctt ggcctcaagc agtcctcctg ccttggcctc 31921 ctgaagtgct gggattacag aagtgagcca ccatgcccag ctgccatctc ctcttttcta 31981 aaatgagcaa tgtcaggctc agaggtttgt tatgaagatt atctgagatg ctgtttgaag 32041 ctttagcaca gagggtggcg cataatgttg tcagtctgtc tcagcctggc cttcctgggc 32101 tctgcagtgc tgcactgtgc ttacagagcc tagattgggt aacctgtgtc tacctccagg 32161 ctggtgatgg aaaatcacag tgcatgttac tgaataataa cctctcccac tttccatctt 32221 tccattgcat ccccctctgt tgccatgaga atgaaatgtt aattgctacg ttctaccccc 32281 ctgagtttaa atagtgttag gtcgttcttt tttacctacc tcgtagcatt gtaggaagaa 32341 ataaatgagg aaataatacg agaggtgctt aaagcagtgc ccagcacctg agtgtttaat 32401 gaatgttctc tgttattatt aggtgtggtt gaatcgtgtc ctcccctcct gggacagact 32461 ggctggagct cagctgtact tcctgggaga agacaatttc tgggtttaat catctgaatc 32521 ttggtctctt tgcatatcta aatgactttc tctttgatta tagagcccag actcggaaaa 32581 gactaagaat ggcggggcat ggtggctgat gcttataatc ctagcacttc gggaggccga 32641 ggcgggcaga tcacttgagg tcaggagttc gagactagcc tggccaacat ggtgaaaccc 32701 catctctact aaaaaatata aatattagct ggaaatcgcc tgaacctggg aagcggaagt 32761 tgcagtgagc cgagatcatg ccattgcact ctagcctggg caatagagtg agattccatc 32821 tcaaaaaaac aaaaggctaa gaaaaaggtg gattccttaa ggactgaagt ggaacagtct 32881 tgcctgtttc cttttatatc cctgtgccat acagtcccta agtcctgtgt attccttctt 32941 ttgtcattcc actttcatac ccatctaccc ctcccccttc tcttccctct ctgccatcag 33001 cctggacttg accacagtat atcttgcctg atttcctgca aatctttcta cctgcctctt 33061 gcacatgatt gctaggttca tatttcttaa atcattcctt taatcatgtc atagttctgc 33121 tcaaatgaca gcagtttgtt ccctgttttt tttttcccag atgaaattta aatgagtgct 33181 gtctagcaga aatattatgt gagccacata tataatttta agttttttaa taattacatt 33241 aaaaacaaaa agaaacaaat gagattaatt ttatttattt atttatcttg agatggagtc 33301 tcgctctgtc gcccaggctg gagtacagtg gcgtgatctc ggctcactgc aacctctgac 33361 tcccgggttc aagtgattct cccgcctcag cctcccaagt agctaggatt acaggcacgc 33421 accactctgc ctgactaatt tttttgtatt tttactagag acggggtttc accatgttga 33481 ccaggctggt cttaagctcc agatcttaag taatctgccc acctcagcct cccaaagtgc 33541 tgggattaca ggtgtgagcc accgtgccca gccagagatt aattttaata atatattctc 33601 aatttaacac aatatatcca aaacactata atttcaacag gtaatataaa aaagtattaa 33661 tgagatattt tgcatttttt tttcttacac ttaaagcaca tctcaattct gatgccaaat 33721 tttcattgga aatatttaat ctctattcag attttataaa acttacaatt cagcgaaata 33781 gattcacata cccaacctat tggaaacata cttaaaattt ttccaacaac tgaatcaagt 33841 ctcagcttta aaatttaaat taaattaaat tggaaatcca ggccatcagt cacaggagga 33901 gcatttcaag tgctgaatgg cgcaggtggc ctgtgatgga cagcccgggg ataagctagc 33961 aagcccagtg tcctggtgtc ccagcctcac aagttccttc cgctgcacag cccttccctc 34021 gctggcgcag catactcacg ggtgtactgc tgcctccctt tcctggagag cccatctgct 34081 ctccccactg tggttctttg gattgtaaag atcaggtgac acagggagca tcaaataagg 34141 atgcatgtgg attcgaactg gaaagaagtc aggacctgag acagcgcaag ggaccagccc 34201 acattctgtg tggtgactcc cacggctcct ccgcctctct ttctctttca gctgccccaa 34261 cacagcagga tctcactgga ctctgatgat ggagtgtcca ggctgggcag tgctggctcc 34321 aaggtgattc ctgctacagt catctctggc tagagaagcg gaatagcaag ctctttggtg 34381 gcttttgtca gaatggtggg gtagcaggag gcagtgaggt cagcgggcat tatgccgtcc 34441 tgtggcctgg cagggagtga aattggccat gcctcgagag taatccttag gagaccacgc 34501 tgaagcccaa gaccaatggc tgtggcttct tggcatggcc tcttttattt aagcagggtg 34561 cagcttgcac tttagttcct tcagcatgaa ccacaaaccc ctcctgcctt ccaactcgcc 34621 ctgaaaatcc cactgtttaa accacttggg taaactcttc ttcctaggag gcacatgtgt 34681 tttggggacc tgaattatgt cttcttttca tctgttggcc tgcaaggccg tagggatgag 34741 gctcaggaag gaccagttat gatggaagga ggttctggaa taatggttcc tttcaagtga 34801 acaggatcta gaataccagt actgaggaga gagaaggcgt ggagcctaag gaggagaggg 34861 ggaaacccat ctttgcggat ttgcctatgg ccagctctgc gttaaagcag tctccctcca 34921 ttgagtcaga tcgaaaattc ctttttgccc ggtgaaggaa aataaactat taacacaaca 34981 taaacttgca ttagtcacag gctgtggcca actatggtaa ccaggttgat gacataaaag 35041 catttttact tgatgtggtg gcagctgttg caagcattgg acattctgag tccaagacct 35101 taggtttgtt tttttttttt tttttttttt ttttgaaaag gagacttgcg gtggggcgcg 35161 gtatcccacg cctgtaatct cagcactttg ggaggccaag gcgggtggat cgcctgaggt 35221 caggaattcg agaccagcct ggacaatata gtgaaaccct gtctctacta aaaatacaaa 35281 acttagctgg gcgtggtggg ggcgcccgta atcccagcta ctcgggaggc tgaggcggga 35341 gaatggcttg aacccgggag gcggaggttg cagtgagccg agatcttgcc attgcactcc 35401 agcctgggcg acaagagcaa aactccatct caaaaaaaaa aaaagaaaga aagaaaagaa 35461 aaggagactt gctctgtcat ccaggctgga gtgcaatggc gccatcttgg ctcactgcaa 35521 cctccgcctt ccaggctcaa acgattctcg tgcctcagcc tccctagtag ctgggactac 35581 cagcacgcgc caccatgcct ggctgatttt tgtatgttta aaagagacag ggtttcacca 35641 tgttagccag gctgttcttg aactcctgac ctcaggtgac ccgcctgcct ccacctccca 35701 aagtgctggg attacaggta tgagccacca tgctggccag gccttagctt ttaatgctga 35761 gattttgcca cttggaggct gtgcgactct agacaagtga ttagctttta atgctgagct 35821 tagcttttaa tagacaagtg aatagacaag tgattagctt ttaatgctta ctaatagaca 35881 agtgattagc ttttaatgct taataatgct tttaataata atgcttttaa taattataat 35941 tataataaat aatatataaa taatataata catattataa ttatatacat atataaatat 36001 aatatattta tataatatat aataatgcta tatgaatatt gataactaat attagtataa 36061 tatatctaat ataattatac aatattataa ttatgatttt tatattacat tacaattata 36121 tttttatata taattatata tatattttta atattatatt atatttttat tttatatatt 36181 ataattataa atatttttat atttatattt ttataaatat aattatattt ttatataaaa 36241 ttaaatataa tttaatttta tataattata aatattttta tattttatat ttataattat 36301 aatttatatt atatatttat atttatatat ttatatttat ttatatttat atatttaaat 36361 atttatattt atagtatata taatataata tattatatta tatataatta taaacatata 36421 taattattat attataatta taattataat gcttttaata gtaataatac ttttaataat 36481 aatagacaag tgattagctt ttaatgctaa tcactttaat aatgctttta ataataataa 36541 tgcttttaat gcttaataat agacaagtga ttagctttta atgctgagat tttgcaactt 36601 ggaggctgtg cgactctaga caagtgactt agttcattgc ttctcaaact ttaatgtgta 36661 tgcacgacac ctggggaatc ttgtttaaaa ggcagtctga ttcatcaggg cctgagattc 36721 tgcatttcct ttttttttcc cctttttctg gagacaagat ctccctctgt tgcccaggct 36781 ggaatgcagt ggtgcaatcc atagttcact gcaacctcaa actcctgggc tccagtgatc 36841 ctcccacctc agcctcccaa gtagctagga ctgtgggtat gtgctgctat gccaagctag 36901 tttttaaatt ttttgtagag atggaatctt gctgtgttgt ccaggctggt ctcaaactcc 36961 tggcctcaag caatcttcct gcctcgactt cccataatgc tgggattaca gatgtgagcc 37021 accacgccca gccagatttt gtatttctaa caaactccta gatgatccaa gtagctaagc 37081 ctatttgtgt ctgagtggct tcaactatga gaacctcagc tgtaaagtag gaggcagatg 37141 gtatgtacct tacatcaatg cacctttatg ccgaagaggt cttctcaaga gtgtgtttgg 37201 caggagagcc atgactgggt ctttcctcct ttcacaaggc ttgaccttgg gtggggacct 37261 cagacttgcc acctctccac agctgcctaa gatagctgcc tgggtgactc atgtcttggg 37321 agaagtcttg ttgtagtgac tttttttttt tttttgagac ggagtttcgc tcttgttatt 37381 cttattatca tttttgagat ggagtcttgc tctgtctccc aggctggcgt gcaatggtgt 37441 gaccttggct cactgcaacc tctgcctccc aggctggagt gcaatggtgc aaccttggct 37501 cactgcaacc tctgtctccc aggttcaagc aattctcctg cctcagcctc cccagtagct 37561 aggattacag gcacacacca ccgtgcctgg ctaattttgt atttttagta gagacggggg 37621 tttcatcatg ttggtcaggc tggtctcaag ctcccaacct caggtgatcc gctcgccttg 37681 gcctcccaaa gtgttgggat tacagacatg agccactgca cccggcctgt agtgacttgt 37741 taatgcataa gccttctggc tttacgactt ctgccttgta attcccacag cttgggcaga 37801 ttcattcaac ctggggttcc gatgtgttac ttatccctgg ctctttgtcc tttgcctaca 37861 ggtggtggcc tttatgaagt ctccagtggg tcagtacttg gacagccatc cgtttctggc 37921 cttcaccttg ctggtgttca ttgtcatgtc ggccgttcct gttggattct tcctgctcat 37981 cgtggtgctt accaccctgg ctgctctgct gggggtcata atattggaag gtagcctgtt 38041 ccgtcattca cccctttaga aaataattca attgggagaa aaatactttt ggggccattt 38101 aaaaattttt ttaaggggcg gggtcttgtt atgttgccca ggctggtctc aaactcctgg 38161 gctcaagtga tcctcccacg tcggcctgcc aaagtgctgg gattatggcc atgggctacc 38221 atgcctggtt gtctgccatt cttgtcaaaa tcacatcaag caaggcaact tatcagagta 38281 ggtaaaacat agactctgga gcctgcattc aaatcccacc tccaccactg tgatttgggg 38341 caagtggttt acctctctga gccttaaatt cttaatatgt aaaatgcaga tataataata 38401 caatctcaaa aggttgttgt gaagattgaa acttatggct ccataaagtc cttagcacaa 38461 tacacaatac atgtcacaga gtagcttctc aatgaatgct agctattatt attaagcaga 38521 gaggccattc attccacagt gagttattgg gcacctactg catgccagat gcctcgctag 38581 gtactggaga ccaccttcat ccttcctcct gaagttccac taccaacagt caacttctcc 38641 ctgtattcct aagttaaaat tactgggaaa gcaaatctga tttgctcact ttgacttttc 38701 aaatcagatc acacacccca ggcaagtggc caaagagatg gtggccagtg ggtcaggtga 38761 tcacttctgg tccattccgt ggcttcccag agctaagagc acccagggct gtttatttgc 38821 cccagggaca aggcagttta cacttgaagg agaaagtagg catggtcagt ggatgagcca 38881 aatacagcct ggcagagtgg tttttgagta aatgttgata gatagatggg tgaatgcatg 38941 aatgaatgga tggaggaatg aatgagtacc ataagaaagg aaatttatag agaaaggaga 39001 gagccattac attttggtgg taccatgcaa agctttacag cagacaagac tttaaggatg 39061 cataacaact ttgcctcagg gtgggcaagg aagggccttt tggacagggg aacagcatgt 39121 gcaaagcctc tagaatagtg gttcttgaac ctgagtgtat ttaaatcttt tttttttttc 39181 tagatggaat cttactctgt cgcccagcct ggaatacagt ggcatgatct caactcactg 39241 caacctctgc cacccaggtt caagaaattc tcctgcctca gcctcccaag tagttgggat 39301 tacaggcatg ccccaccatg cccgactaat ttttgtattt ttggtagaga tagggtttcg 39361 ccatgttggc caggatggtc ttgagctccc gacctcaagt gatctgccca cctcggcctc 39421 ccaaagtgct gggattacag gtgtgagcca ctgcacccag ctccaagtgt gtataaatct 39481 tacttccttt atcaaaatac agactttgga tcttttgaga ccaggagttc aagaccactc 39541 tgggcaacat agcgagattc cctccatctc tttaaaaaaa aaaaaaaaga ggccggacac 39601 agtggctcat gcctgtaatc ccaacactgg gaggctgagg agggcagatc actggaggtc 39661 aggagttcgc gaccagcctg gccaacatgg tgaaacccca tctctactaa aaatacaaaa 39721 aattagccgg gcatggtggt gggcacctgt atcccagcta cttgggaggc tgagacagta 39781 gaatcacttg agcctgggag gtggaggttg cagtgagcca agattgcacc attgcattcc 39841 accctgggtg ataagagtga gactctgtct caaaaaaaag aaaagaaaag aaaaaagaaa 39901 gaaagaaaga aaaaagtggt gtgtgtctgt agtcccagct actcagaagg ctgaggtgac 39961 gggatcgctt gagcccaggc agttgaggct gtagtgagct atgatcatgc cactgcactc 40021 cagcctgggc aacagagcaa gactctgtct cttaaaaaaa gaaaaaagag aaagaaaatg 40081 cagacttcca ggcctcatcc agagagtgtt gttttagtag gtctggagtg gggtttagaa 40141 atctgaattt ttcacgagct tccttgatgc acctgattct gatgcaggtg gttggcacca 40201 cactttgaaa aacgctgatc taaaattttg cgaatcttta tgaacaggca gcacacctgt 40261 tgatgttctc ttccttccag ccttttcagg gtccctccct gaaaccatca ctgtggctgt 40321 gggaggagca ctgggctaac tggccactca caaagcaaag ggaaggatgt cagccataca 40381 atacttgcca tgttacttgt tcttcatgat cttttccttc taccttttat tcttttcccc 40441 cccacaggcc ccacattatg aaaaaatgta ttttccttgc acgtattaga tggtaattca 40501 ttaatattcc tattcagagt gggaatggta tattaaaatg gcatataaaa tgctccataa 40561 aaatgaccaa tcagacctta agattaaaat gtgatggttt acaggctgga ttttcaagga 40621 tgccatcaaa agctgcattt agattttgct ttagcctgca tgacctttat acctatagct 40681 tgtaggtaag agtaaataat ggcatttatt atgcactttg agacattaga gataatttac 40741 ctcccccagt actgccttgg ctgactcctc gaggggatcc ctttggaagt ccatctctga 40801 ccttgagggt aagggaatct caaactccag cacttatggg caagacaaaa tgagcaggct 40861 ggttgtggtg gctcacgcct gtaatcgcag caccttggga ggccaaggca ggtggatcac 40921 ttgaggtcag gagttcaaga ccagcctggc caacatagtg aaaccccatc tcctctaaaa 40981 atataaaaat tagctgggcg tagtggcagg cacctgtaat ctcagctcgt caggaggctg 41041 aagccgggaa tcacttgaac ctgggaggcg gaggttgcag tgagctgaga ttgcgccact 41101 gcactccagc ctgggaggca gagcaagact gtctcaaaaa agaaaaaaaa aaagacaaaa 41161 tgagcaaagc agcccatgtg tgatgcagtt tgggaattgg ggggactgga ggcaacggga 41221 gggccagcca gttgttgcca tggtttgatg cccactcagg tacccatatc gcctcctttt 41281 tttaagtgat gctagaaatc tgggtttttg tgggaaatct tcttacttta ataatctgag 41341 ttagcggttg ttattttaaa cagactgacc cagggtctgc cagttttcaa cctgtttgag 41401 atgaaccctt ctcctaacca cgttctcctc ctaggattgg tcatctctgt gggtggcttc 41461 tcactgctct gcatcctctg tggtttgagc ttcgtatcac tcgccatgtc ggggatgatg 41521 atagcatctt atgtagtggt ctccagcctc atcagctgct ggttttctcc caggtaaata 41581 catgtccatg aaataattta tttttttaat ctttctactt ttttgttttt ttgagacaag 41641 gcctcactct gtcactcagg ctggagtgca gtggcatgaa catagctcgc tacagcctca 41701 acctccaagg ctcaagcagt tctcctgctt cagcctccca agtagctggg gctacaggtg 41761 catgccatca tgcccggcta attttaatat tttttatttt ttgtagagac agggtcttgc 41821 tatgttgccc aggctgttct ctaattcctg ggctcaagca atcctcctgc ctccacctcc 41881 caaagtgcag ggactacagg catgagccac cacaccttgc tgagtaattt aatctttgcc 41941 acttgagggg attttcacaa ataaaaccat aatcctagtt gatgtccaga aggaaacttg 42001 taagacccag gtgtttctca tagttcccca tgtatgccag gcatcaggtg ggaatttacc 42061 atggaatttt aaacaaagca attgctgtca ttttccatca cgtatacaca tttcattgtc 42121 atagcaactc tatcagtatt attgattcct tttaaaagat gtccatttaa tttacaaatg 42181 tcaaattagg tggttttctt tggcggggtg gggcagagag gcacaggaga aaaatgaccc 42241 acgagacatg tctgtcaccc aaagcggtac agtatgggag ccgtgaatct tctgtcccca 42301 aagtctgtca atcccctctg ctcttgagaa tgtgagtttc tcaccaactg agtgtgagcc 42361 tcatgtttga atatatagat ttttgttcct acaatatggc tcttcaatgc cagctacttc 42421 atctggggcc ttgagtgaaa gaacagcgat ctacttaaat gctggaggga aaagccagct 42481 atgttggcat tactgagaga ctctgtgttt gtctacaatg tgccacccac ctaagggatt 42541 ttcaaaggag atttatttta atgctttatt tttgtccttt gtgttgattt tagaatctaa 42601 gtcccaagtt catcatactt catttaagct cctgctaaat tgatgatagt gctactgagc 42661 acttgctctg tgccagggac tctcttaaac actttccagg ccttatgtca tgtaatcctt 42721 aacaatttag cccctttcta ttataaggaa aattaagcca agtcccttaa gctgagagta 42781 gcacagaagg gacttaattc caggtgagtt aattccaaag ccctcttaat caagatacaa 42841 agtgactctc taaaacagga gttggcaatt tctggaaagg gccagacagg aagtatttag 42901 taagtaatat agtctctact ggaatttctg gaaagggcca gatagtaagt atttagtaaa 42961 taatatagtc tctactgcag ttagtcaact ctgttgccgt agcatgaagg cagcacagac 43021 caggaagtaa atgcatgggc atggctgtgt tccagtaaaa ctttatttac aaagacaggc 43081 agcttgctgg atttggccaa ccattgctct ataagaactg atatcctgag ttttgttttt 43141 ctacaaaaca gcccttgaac tccttaccaa aagtttatcc tgttattatt cctatgttga 43201 aaaataaagg gtgaagtggg tataatataa aattgctcaa ggatggctct tataggattg 43261 cttaattgaa tcaaccctgg agtcataatc accttttcaa aaatcgcttt gatcactctt 43321 agtcttttgt ttgttcattt gtattttact tataaaatgg ggtcttgctg tgttgcccag 43381 gccattctca aactcctggc ttcaagcaat cctcccccgt tggcctctca atgaactgga 43441 attataggcg taaggcacta actcactccc agtcttaaaa gatgtaaaag gagaaatatt 43501 ggcctatttt gcatattgag ggtgatgctt ctgaaataaa attcctccct gcctctgaac 43561 gctgctgtaa cagggtgagt tcttaaagat tataccacag gaagtagcag aatctgactc 43621 cctacttatc aatgaagaaa ttgaaaaaca aatccaggat ttgtacttga tctccttatc 43681 ttcccctatt gctgccaacc aattgtttcc acgtgggcag aggttcttaa gctttttgtt 43741 ctcaagtccc attttacctt cttaaaaatt attgaggcat ctgaagagat gttatttatg 43801 tggttaattg tattaatagc actttgggag gccaaggagg gaagatcact tgaggccagg 43861 ggtttaagac caacctaggc aacatggcaa gaccacatct ctacaaaaaa tacaaaaatt 43921 agccaactgt ggtggtgcgt gcctgtagtc ccagctactc aggaggctga gatgggagga 43981 tcatttgggc ccaggagttc aaggctgcag tgagctgtga tcacaccatt gcactccagc 44041 ctgggcaaca gagcaagacc ctgtctcaaa aaacaaatga aatgagctga aatataaaat 44101 aaataactaa catgacataa aactatatac aataaaataa attaaaataa acataacata 44161 aattacatta aatcaaatta aatattagct ggtgcatgct tgtggtccca gctactcaag 44221 aggctgagga gggaggatgg cctgagcctg ggaggttggg ggtgcagtga gctatgattg 44281 taccactgca ctccagcctg ggtgacagag ctagactctg tctggaaaaa aaaaaaaaaa 44341 gaatattaat aattatttct ttttaaaaaa ttttcttcct cattgtttct tgttctgaaa 44401 taattattct atgagaaatc atagcataaa ttttaaaaat atctattcat ttaaaaggaa 44461 caatattacc tgttgatata aataacaaat tttaatgaaa agtaactata ttttccaaaa 44521 caaaatgaaa tttgtgagtg gtattatttt acattttaca gatctctttc atatctgcct 44581 taacagttgt cagccggctt ctcgtttgtg tggctgcatt cattctgttg tgctatgttg 44641 tcctgtttgg agtatttgaa gaagacctag catcactaac atgcggttgg gagagggagg 44701 caaagctttt ttaaacgatt gtggatattc tatgatagtc caccaaaact ccgtaagtga 44761 tagtttctta aaggttagct gaaatgtgaa atttttctac tctattttat tgaaatcttc 44821 aggtctgtgt cacactttaa attgatcttt atcatgcatg attttctaac tttacccatt 44881 ggtcaatggg agaatatcaa ttcattgagt catgcagatc ttccaaatat attttgaaaa 44941 attgcactgt tcccttgtga gagaatgtga gtgaaaaagg catataatag gtcatgttat 45001 tgtgaacatg gttttcatct gagaaccctt gcattagcga attctttttt tttttttttt 45061 ttttgagacg agtctcactc tcacccaggc tggagtgcag tggcttgatc ttggttcaca 45121 caacctctcc cttctgggtt caagcagttc ttctgtctca gcctcccgag tagctgggat 45181 tataggtgtg caccaccatg catggctaat ttttgtgttt ttagtagagg tggggtttca 45241 ccatgttggc caggctggtc ttgaactcct gagctcaagt gatccaccca cctcagcctc 45301 ccaagatgct gggattatag gtgtaagcca ctgtacccag cctgcattag tgaattcttt 45361 ataactcaaa aaaagataac tccctcatta aatgatcgaa agcatggaat ggggggaaga 45421 ggagctatat aatttatttc atgatttaat ggaattttgg ccctaaaaca taaaagataa 45481 tacagcttaa atttttaaaa aagaaaagaa aaaggaaaca aaactgtaag tttcccttac 45541 atatgaaacc caactaaaca atatttagat caaatacact gacctgcctc ctatgtagat 45601 cagctagcct ctgaggggca aaaccacact gcttggttat ggagactgca agaccttcca 45661 gggcctgcag ctcaggagaa gtaggggccc tgggcagctg tgtttatttg cagagccacc 45721 aaactcacta ccccatatca taatcttcca aaccagcctg agcataacta gcttccttac 45781 atctttacta agaaaaggaa aggcttcttt cttagttttt tgttatcagg gaggaatggt 45841 ttaggtggca aataatgaaa aaaaccaaag tagatactag tttaaacaca cagggaactt 45901 ttttttattg cttaatgaaa aactggagac agcccagggc tggtgcagcc agtgtggtat 45961 caacatacca ggcacctttt gccttcctgt gttgctatcc ttagcccatt ggtttcctac 46021 cttatggcca caagatagct gctgcagctc cagatctacc agctaccttc aaggcagcag 46081 aaaaggagaa aaaaatggag tctgaccctt ttaaaaatgt gaataaaata tttcccagga 46141 cctctcacca ctatacttca aaagacttca ttgactagaa ctttgtcaga tggccgctgg 46201 ttgctgttct cctgattgta ggtctcacga gatctgatgg ttttataaag ggcatttccc 46261 ctgcacatgc tctcttgcct gccgccatgt aagaagtgcc tctgatcttc ctttgccttc 46321 caccatgatt gtgaggcctc cccagccatg tggaattgtg agtccatgaa acctcttttt 46381 ctttataaat tacccagtct caggtcttta ttagcaacat aagaacggac taatacaatc 46441 ctcccccgaa ctcctttcca tggaaggtgt caagaaacta attgaaatca gaagatatgg 46501 tctgaatccc cgccctgcca tgttttcagc tgtgtggtct tgaacgggct gcttgacttc 46561 tctgatttca gtttttgtcc attggtaaaa tagacaaaat ggctactctt aatctaccat 46621 tctcactggg ttgttgtgag gacacagata attaagaaaa acataataaa tatccaaatt 46681 agaaaatgga aaaggggccg taaccctact cctaacctgg tcattttaac ctcctgtgcc 46741 ctcagtttct tcatctgtat aatggacata ggcctggtgt ggttgcaaga agcagctaaa 46801 aatcaggaaa aagaacatca tgtattcagc tatgcacact tccaacgttg ctctttactg 46861 aggccctaga gctaacgatc tcttcttgtt atccgacagg ccactgacac agcaaaacac 46921 cagttgtgac tttctgccag ccatgaagtc tgcagaattc gaggggcttt accaggaatg 46981 agtgactgct cagaggccgg gcttcttttc aagtactgct ggatcatact caccccttgg 47041 gattatggct tagaagaagg gggctgggta ggcaagcact ccttggctgt gtcctctcgc 47101 tttttcactt acttgtagga tcccgcagca gccaatttag ggattgtgtt gttcttgtgt 47161 tggttcccat gtaaagaagc agcagagaaa tgcgatggtt caacagctcg ccctgcccca 47221 agtatgcaga cttgaccttg gcggggtcct gggcttctag agtctagccc gttgacccca 47281 aagcctcagg gcttatgtcc aacggtccca ttgggagcaa cagtgattgt ttatagtttt 47341 ttgttttcag aaatgagggc agctgttaat tttttcactc atgtgaaaca aaatgaaaaa 47401 aaaaatccaa aacagaacaa gcccttctgt ggtatttgct ttttatcaga aagaacagca 47461 acatttggcc tctccaagtt ggaaaataga gtccaaatgt aactttgtgg ccaagagtat 47521 tttcaaaaaa gtctaaaggt gtagtttcca ctgtaactgt tgagtactgt taagtactgt 47581 taatctttta catatttgca cttggtactg ttgtattgtt ctaaagagct tgtcctttaa 47641 gtaatttctt tctttttttt tttttttttt ttttgtgacg gagtttccct cttgttgccc 47701 aggctggagt gcaatggtgc agtctcggct cactgcaacc tccaactccc aggttcaagt 47761 gagtctcgtg cctcagcctc ccaagtagct gggattacag gcacccgcca ccgtgcccgg 47821 ctagtttttt tttgtatttt ttttagtaca gatggggttt caccatgttg gtcaggctag 47881 tctcaaactc ctgacctcag gtgctccacc tgccttggcc tcccaaagtg ctgggattac 47941 aggcgtgagc cactgtgccc agcctaagcc atttcttaaa ataaaaatgc taaaggacta 48001 gtaagtaaaa ataaaacttc ctatgggatt tcccagtgga attactgagt ggtttatttg 48061 cgtggcattt cattaaatat ttatttgggt gggctttttt tctttactct ttttcccttc 48121 tgctccatac atcagacctg cctattcctc tgtcgatttc atgactggtg ctgggtaata 48181 gggatttttt tagttgctcg caatagacac ccaattccac attactttaa aaagaggggt 48241 ggtagggaga aggattagtt tcactcaacc aaaaagccaa ggggtagatc ttgaggtgct 48301 gctgattcca ggggctcagg tggtgtcatt aagcatctct cacccctcat ctctcagtat 48361 agctttcctc agtgtgagct gcttttgcag gcaggcccgc cacccctttc ggaggtccca 48421 ggagctccat cctgggcaag aaaactgctt ttttccaaca gtatatacaa atattctggg 48481 tttggctctg attggataga cttgggtcat gtgagcgacc ccagaccaat catggtggct 48541 aggaggatgg gatatgcaga tgggacaggg ctgggatggt ggccattcaa gaaacagggg 48601 aggcgataga tcagcccact attaatataa ctattaatat taacagttac tgtttattgg 48661 atgtttcctt cattcataca actgtctgat attatctctc tatgctatct acccagctgg 48721 gttctgcagc tttaataatg aagaatgcaa tggtctccct tctgaagttt tctttctaat 48781 gcaagagtga gtgagcaagc atatctgttg ctatgtaaca gtttaccagc aactattggc 48841 ttaaaagaac aggcatttct tatctcacat tcttcatgtg ttaggggtcc aggcatggct 48901 tggctagatc ttttgtaagc ctgtgatcac agtgtcatca gggctgtgtt ctcatttggg 48961 gtctgactgg ggaaggacca cattccaagc tcacatggtt gttggcagca tccagctggc 49021 aacacgactt aagccctcca ttccttgctg catattcccc tccccatgac agttcagtgt 49081 ggtcaatggc ttcttcaaag ccatcaaggg agagcctctc agcaagatag acgttacatt 49141 cttagggaac gtaatcatgc atgcacaatc atgaacactg gatcacttta tctgtgttct 49201 gctggttaga aacgagtccc agaaaccacc acccactctc atgaggaggg gatcacccaa 49261 gggcaggtaa accacagagg ggaccatggg ctctgcctta agagtcctta ctagggtcct 49321 tagtagtcca ccttgctaca acaagtaaac aaaatattag accaactccg ctgaagataa 49381 gtaaaagggt ggtaagtaat tgggagagca tgctttagcc acagcaatca gagatttctt 49441 ctatttgaac caaaacatga gttgtcacca tgccaagcac acagggaggg tgttcctagc 49501 agagggcaca gtatcccaaa ggcccagaga tgggaagaag cttaatgtgt ttaagggaaa 49561 aaaatgatgt cagcatggct gggcctcatc atgcaggacc tcatggagtg agtttacacc 49621 aggggtcccc aaccagtacc ggtccttggc ctgttaggaa ccaggccata cagcaggagg 49681 tgagcagcag gtgagcaagc attgctgcct gagcttcacc tcctgtcaga tcagcagtgg 49741 cattagattc tcataggaac atgaaccctc atgtgaactg cacatgcaag ggatctaggt 49801 tgcatgatcc ttatgagaat ctaatgcctg atgatctgtc actgtctccc gtcaccccca 49861 gttgggattg tctagttgta ggaaaacaag ctcagggctc ccactgattc tacattatgg 49921 taagttgtgt aattatttca ttatgtatta caatgtaatc ataatagaaa taagctgcac 49981 gataaatgga atgcacttga atcatcctga aaccaccccc ctacccacgc ctcatccatg 50041 gaaaaattgt cttccacaaa acttgtccct ggtgccaaaa aggttgagga ccactggttt 50101 atgccaaaga gcaaaggaaa gacaatgaag aatcttacat gggagagtga cgtgatcaga 50161 cttgggtttt taaagatcac tccatggcca atggatttaa agcaggaagg ctagttggga 50221 ggtctatgga ggcccaagag cagagaggat aattgtgact taaaacttac actaggctga 50281 gcacagtggc tcatgcctgt aatcccagca ctttgggcgg ccaaggatca cttggggtca 50341 ggagttcaag accagcctgg ccaacatggt gaaaccccgt ctctactaaa aatacaaaaa 50401 ttagctgggt gtgatggcgg gcacctgtaa ttccagctac tcaggaggct gaagcaggca 50461 aatcacttga acctgggagg tggagattgc agtgatctga gatcgcacca ctgtactcca 50521 gcctaggcga cagagcgaga ctctctctca aaaaaaaaaa aaaaaaaaaa aaattagggc 50581 aaatgtaatg gagaccttcc tcattttgag ggccagcagc acttggtgat gatggagaag 50641 gaaaggggga cacaaggtgg cacctaggtt tttagcttga gccagtagtg gatagtgaca 50701 ctattgtagg agtgaaggac aagcttcctc tctaccctct gaaggatccc tgaaatgaac 50761 tggcaataga cagatgaaca ggagaaaagg catattcaaa tttattaaca tgcacatgaa 50821 cacaggagtt ccaaaaatat gagactcaaa gaaaggccaa atgattgaaa cttacataga 50881 gggagatggg ggaaatgtag gcaacttgaa ggattgtaaa tgatttttag gggaattgaa 50941 tggacccaaa gagcaaacca caatgagata ccatctgaaa ccagtcagaa tggctattac 51001 taaaaagtca aaaaataaca gatgctggcg agtttgtgga gaaaagggaa tgcttacaca 51061 ttgctggtgg gaaagtaaat tagttcagtc actgtggaaa gcactatcac aatagcaaaa 51121 atggaatcaa aaaatgtccg tcaatggtgg ataaagaaaa tgtggtgcca tatacatcac 51181 agaatactac acagttataa aaacgaatga gattgtgtcc tttgcagcaa catggataca 51241 gctagaagcc attatcctaa gtgaactaat gcaggaacag aaaaccaaac accatatgtt 51301 ctcacatata agtgggagct aaacattggg tacatatgga aataaagatg ggaccaacag 51361 acattggggc ctacatggag gtggagggtg ggaggagagt gaagataaaa aaaactattg 51421 ggtactatgc ttattacctg aatgataatc tgtacaccaa acccctgtga catgcaattt 51481 acccatgtaa caaacctcca catgtactcc tggaactaaa atgaaagtta aaaaaaaaaa 51541 gcagacaata gtttgtaaat tattttcttt agaaactgaa tgggacaagt tatgggaagg 51601 tgaggggcag aactgcactg caaacaaagg tagtcttatt atgcagatta agtcttttag 51661 gtaatctctc agaattactc tcggcaaaat agatgaaaag cccgagcatg gtgacaactt 51721 ttagtctctt ctgcagtggt taattttttc tggttatttt ggatgctaat ccaccacgtt 51781 gacttctgat taaccccagt cccatgaatg cctcctgatt cctacttcac tgttcctagt 51841 gtaagaacat gttgaccttg atgtcatcac acaaattata ggctatgata cattcagcat 51901 tcttgcctgt cctgaagggt tgcctttaat tggcttgctg gagcaagcat accctttccc 51961 tatggtatat gcgtagccct ggtgtgggga gtaacagtgc agagagctac ctgtcctgcc 52021 accacctaag accacacttg tgtctgtaag ttccttcaat gaatcaccca aagtcaacac 52081 aatggatctg tctgcctcct tctttggttc ctcggctccc tcagcatttg ggaatcactt 52141 tgcatattca cagaaattaa gacaattgca tttcttttgg aagaaatttt cctcggtcag 52201 ataagggaac ttccagagaa agcctgatat ggttaggctg cgtccccacc caaatctcat 52261 cttgaattat agttcccata atcaccatgt gttgtgggtg ggaccaggtg gatataattg 52321 aatcatgggg ggtggtttct cgcatcctgt tctcctgata gtgagtcagt tctcatgaga 52381 tttgatggtt ttacaaggga cttccccctt cactgggcac tcattctctc ctgccgccct 52441 atgaagaggt gccttccgcc ttgattgtaa gtttcctgag gcctccccag ccatgtggaa 52501 ctgtgagtca gttaaacctc ttcctttata aactacccag tgccagatat ttcttcacag 52561 cagcctgaga acagactaat acagagcccc tccctgcact caagatggca gaaacaagag 52621 aaggttagaa aatccttggt tctgaagcag tttctaaggc ctttcatttt cctttaactc 52681 agaagtgctc tgcatgccaa agtaacagac tttggggtat ccttgtctgt gtctcaatgg 52741 gggctgtgtg aaaattcaac ccctgctttg ctcatattaa gtcagatatg cctcttggat 52801 actaagagga tgtgtcaagt agacagctgg cttagagggt acgtttgggc tggggagtat 52861 ttttggtgct gagagcttta cacagccttc atttgggatg ctctcccaga gagtagccag 52921 gagatcattt catagcatag ttctgatcac ataaacacct tactccaacc tctgccaagg 52981 gcaaagtgct ttcccacagc acttagaagt gctaactctt tcccctcatg gatcaatcca 53041 tgatcaaaga ccaatcccag ctgggcacaa tgatttacac ctgtaatcat agcactttgg 53101 gaggccgagg caggtggatc acctgaggtc aggagtttga gacctggcca acacagtgaa 53161 tccccgtctc tgctaaaaat acaaaaatta gctgggcatg gtggcacatg cctataatcc 53221 cagctacccg gaaggctgag gcaggagaat cgctggaacc tgggaggcga aggctacact 53281 gagccaagat cacgccactg cactccagcc tgggtgacag agggagactc cctctcaaaa 53341 aaaaaaaaaa agatcaatcc aagcttctgc ttcctgctac ttctttatat gccatcggcc 53401 tctggatatg agagagaaaa ctaaagttca aaggactcac ccaattcagc taagaaatgt 53461 gacatctaaa tcaaggctgt gcagtgaggt tattcattca ggttctgttc tttacatggt 53521 ctgggcctaa ccttcaccga tcaaatgaag tgaaataagg tgcagaaccc tagcacagtg 53581 gctgacacca aacaagccca tcaagaataa ttgtcaagga cgaggctcag tggctcgtgc 53641 ctgtcatctc agcactttgg gaggccaagg cgagagaatc acttcagccc aggagttcaa 53701 gagcagcctg ggcaacataa tgaaacctcg cctctacaaa aatgaaaaaa ttagctgggg 53761 atagtgttac atgcctgtgg tcccagctac tgagggactt aggcaggagg atcacttgag 53821 cccaagaggt caaggctgca gtgagctacg atggcaccac tgcactccac actggggaac 53881 aatgagacct tgtctccaaa aaaaaaaaca aagaaaagtt gtcaaaatgc atggctacat 53941 gagtcattta aaactgagtg aaacaggaag atgtaaaaga cctgttgtta accaagctgt 54001 ggaggaccgt taaggacttc gggctaaggt ccttacaaag atatccttag ccttttagac 54061 acacagctca atgtgtggcc gattggaaag gtaattaatc cccagagcat aaaaacgatg 54121 acttttttac agttataatg ttaatttatc agtatgtcca ggtttcttgt ttgtttattt 54181 ttgagcattc gccatgtgct ggatgctatt tttttttttt ttgagacagg gtctcattct 54241 gtcacccaag ctggagtgca gtgacatgat catagctcac tgcagcctcg acctcccagg 54301 ctcagataat cctcccacct tagcctcccg agtagctggg accactggtg cccaccatca 54361 tactcagcta atttttgtat tttttgtaga gacagggttt caccatgttg cccaggttgg 54421 ttttgagctc ctgagctcaa gtgatcctcc cgcctcagcc tcctaaagtg ctaggattac 54481 aggcgtgagc cactgcgcct ggctgcgggg ggctatttta agcccaatac acagcaatac 54541 atgtttccag tgcgtcctca ctacctccca ttttagagat aaggaaaccg aaagattgac 54601 aagttaaatt gcctaagggc ccatagaatt agaagtggga ttctaactgc agtttgtcta 54661 actgcaaaag acatcctagg ctgggtgcag tggctcatgc ctgtaatccc agcactttgg 54721 gagactgagg cgggtggatc acctgaggtc aggagttcaa gaccagcctg gccaacatag 54781 tgaaaccctg tctccactaa aaatacaaat attgttagct ggacatgctg gcgggcaccc 54841 gtaatcccag ctactcggga ggttgaggca agagggttcg cttgaaccca ggaggcaaag 54901 gttgcagtga gtcaagatca tgccattgca ctccggcctg gatgacaaag tgagaccccg 54961 tctcaaaaat aaaactaact aactaaataa ataaatctag atcagcagtt ctcagttggg 55021 agtgattgtt ccccgccccc gccccccgcc ccaaccctca accagcaccc ccaggggaca 55081 tttggcaatg cctggaggca ttttttttgt ttgagacagg gtcttgcttg gtcgcccaaa 55141 ctggagtgca gtgtcaacca cagctctctt gacctcccaa gctcaaatga tgctcctgtc 55201 tcagcctcct gcatagcagg aactacagat gtgtggcgcc atgcctagct tttttttttt 55261 tttttttttt ttttcagtag agatggggtc tcactgtgtt gccagggctg gtatcaaact 55321 cctgggctca agtgattctc ctgccttggc gtcccaaagt gctagaggca ttttttgtta 55381 tcacgatatg cgtgcctgtg ttgggtgctg ctggcatgta atgagtaggg gccaaggatg 55441 ctgctcaaca ttttaccaag cccaggacag gcctccacaa ccaagaatta tctgggccca 55501 aatgtcaaca gtgccagtct tgacaaaccc taacctcaac aaataataag aattaagaga 55561 attaggccaa gcaagagcac agctggtgga gagggggtca aaattgacca ggtctgcccc 55621 agagtgacca gtggcaatcg tggcatcatg gaaacacaca agttcacagt tgtgaaatgg 55681 ttcaggttgt caaagagttt tcgtatcagt taccatcctt acctttcata acagtccaga 55741 gacgttattc tgctctcatt cccatcccct actcttcagc tgatggaaga ggaaagcagg 55801 gttaatggat gagtcacctg tggaagttct cagagtcagc aaatgactaa gccacagaac 55861 ccggttatcg gcctcctagc ccacgtgcag atcctttcct cccccttctg ctgccttttt 55921 tgttaaagca gaagtgacaa cgacgaaacg ttcggaagat ggactacagt gtcccatggc 55981 agttgggagt cagccatcca gggggtccat acactgcaga gggcatgctc cgtgtatgga 56041 cctccaggct ggagaggaca tgctccgtgt atgggcctcc agcagcacac attgtttgtt 56101 catgtttcca aaactcagga ggagactcaa gccccagcca catgctctag gagtctctat 56161 gcccctgtta ggagctcccg tgctggagtt tcaatcccca gctggaaacc ttagaaaaac 56221 catcagaagg aaacgcaact gtttcctaat gaaagcgttt catctctgag ccagtagagg 56281 gagcgcacat ccaggccatc cgaagctggc tcagccgaga cttcaatagc accgagggct 56341 ggaagaagca ttcaacgcgc gatccttgaa cttgaatgag cctatgcata ggagcctgtt 56401 aaaagccaaa ttcgcgggcg gatcacgagg tcaggagatc cagaccatcc tggctaacac 56461 agtgaaaccc cgtctctact aacaatacaa aaaaaaaaaa ttagccgggt gtggtggcgg 56521 gcgcctgtag taccagctac acgggaggct gaggcaggag aatggcgtga acccgggagg 56581 cggagcttgt agtgagctga gatcgcgcca ctgcactcca gcctgggcga caggtgacag 56641 agcaagactc cgtctcaaaa gaaaaaaaaa aaaaaaaaaa aggccagatt ctggccgggt 56701 gcggtggctc acacttgtaa tcccagctac tcaggaggct gaggcaggag acttgcttga 56761 accggggagg cggaggttgc agtgagccaa gattgcgcca ttgcactcca gcctgggtga 56821 cgagagtgag actctgtcaa taaataaata aataaggcag attctgagtc tgtaggtctg 56881 catcaggacc tgagttcctg cattgcttat cagccaccag atgaagcctt ccaccggcca 56941 caccttgagt agcaaggtaa tagataactt ctcccaccct ctcccataga tggtggacct 57001 gaggcccagg gagtttagac tgcttactag aggccaaaca gaaagactgc agtgctgggg 57061 ctagaaccca tgactcctac agctggtgct cacttcatag catcttccct gcccctctga 57121 ccctgaaaca ggattcaggg aagaggatcc ctaagtgtgg gtgaggagta agagactata 57181 ctatctgctt tgaatcaagc ttctttatac tctatttttg gtcaagttac ttcccttgca 57241 gaagtaccaa tagaaattca aagatgaaca caagcagact ccaattcttt tgggcctgag 57301 ggagccaaag taaaatagat ttgttagtga agtccctctt atttttggtt agtacaaatt 57361 tggcaagtca tccaaagaaa agcttacata taacttggta tgtttttatg aatgcaaatt 57421 atagccatag actttctcct aagaatgaag tttcttagct tgaatcagtt tagatgaaaa 57481 atgcctccct taaaaaataa gagctcctaa actctcacta aacatgaaat gcagaggcaa 57541 gtggttctag ggttgatttg gggaacacaa tgaagttatc aaggatttag gctttccatt 57601 ttttcgctcc atcgttctca gggtgtcttt agggttattt cctcatagtt acaacatggc 57661 tgccatagct ccgggaatca catcacataa tcatatccaa aagcaggtag aaaaggggtt 57721 agctgcacat accttttttt aaataaggaa aaagaagccc cctctcaact tccccttaaa 57781 tctcattgaa cagaactgta ttacatgctc acctctaaac agaagggaaa tgggacgtta 57841 tcttggctat ctcatgccaa gcaggattgt cccccagggt gtgctgtgca aacaaagtcc 57901 aggttctctt agcaaggaag aaggggatga ggtgggaagg aaaaagaaag ggcaagaaaa 57961 tcccttttaa ttaagcccta cactatctgc cccagtacac ttgtgttttc gctgtattta 58021 ttcacaaagt aagggtcttc atgtgctgtg tgagcaaagg tcaggaccaa ggaggattta 58081 gttaaatctg gaatctcagt caggggttca tactcttgag ataaggaaaa gtaaccaatg 58141 ctgcctaaca gcccagcacc cacactgcag tccctagact tagcagatgg aacttgctat 58201 aatgaggacc atatgtattc atttgctaag gctgccatag aaagtaccac aaactgggtg 58261 gcttacacaa cagacattta ttttctcaca atcctggaag ctgggagtct aagatcaaga 58321 tgccaagcag gcttggtttc tccttggcct gtagagagct gtcttctccc tgtgtcttct 58381 gcctgtgtct gaatttcctc ttcttataag gatagcaatc ctattggact agggctcatc 58441 ctattaactc atgttaatgc aattacctct tttttttttt tttttttttt ttaggcagag 58501 tctcactctg ttgcccagac tggggtgcag tggcgtgatc ttggctcact gcaacctcca 58561 tctcccaggt tcaagtgatt ctcctgcctc agcctcctga gtagctggga ctacaggcct 58621 gcaccaccat gcccggctag tttttttttt tttttttttt tttagtagag acagggtttc 58681 accacgttgg ccaggctggt ctcgaactcc tgacctcagg tgatccactc acctcagcct 58741 cccaaagtgc tgggattaca ggcgtgagct gctgcactta gccacaatta tctctttaaa 58801 gacccttcct ccagccatat tttgagattc tgatgttggg acttcaacat atcgatttgc 58861 gggtcacaca attcagtcct ccaattcagc gagcatactt ctatttcctc tgcaagtctg 58921 atgtgcacct tgctttaaaa gagatacgat tatcatgctt ctcatcctaa ttttcaaagg 58981 gctttctatc atgtgtgaat ctcaggttcc agtgaatgcc ttcttcgtgt ctgtggatgt 59041 catcatatga gggtattttt tgacctgttt ctgtgacgta ttcatagagt tcctgtaatc 59101 gaacagtcat ggatcagcat tcagaatata taaagaaacc ctaaaaatca ataggaaaag 59161 gacttttaaa atctagtata aattaaacaa aggctataaa cagacaattc agaggaaggg 59221 actgcttcct gctgtgcaga tggtcatctt gcggtatcct cacatggccg agagcagaga 59281 gaggaagctc actctccttt ctcttccttt aagggcactt atccccttta tgagggcacc 59341 atcctcatga cttaattaat tcccaaagcc ccacctccaa atatatcaca ctgggggtaa 59401 ggacttcaaa aacatataca ctgggggtaa gaacttcatg tgaacttgtt gtggggggac 59461 acaattgagc ccacaacacc gagaataagc ctgagcacac accttgtgcc acaggctgca 59521 tgttctacct ttccgcaacc cctgtgctgc ccacgatgag acagcttccc actcaacaca 59581 tcatcacttt caaaccagaa gcaatgttgt gctgtggatt ttctggttat tgacacagtg 59641 gtgggcacta tcagaggtaa ttcacatctt tgcatccaat gggagagaaa ctaaattccc 59701 tgttgactta aaagggagaa actgaggcag aattaaaagt agagagtttt ttggggccaa 59761 gtttgaggat tgcaaccccg gagcatggat tcaagttgac atgagttata gactcctatt 59821 agcagcattt acaagtggat ttctaaaggc aaaaaagggg ggacagggag tgagctgatg 59881 cacagttctt tgtcaggaat tctcattgtt tttcagaaat aacattgatg tgtgattggc 59941 tatacattgt taagctatag ggtgtggtta tatgtctggt gcagcattat taggttaact 60001 taaagctact tgtggcaata gtaagcagtt tcaagagatg gatacttagc tcaagggagg 60061 agtaggatgt gatgactgtc tcatcttttt tccttttttt ttttcttttg agatggagtc 60121 tcactctgtt gccaggctgg agtgcagtga catgatcttg gctcactgca acctctgcct 60181 cccaggttca agcaattctc ctgtctcagc ctcctgagta gctgggacta caggtgtgcg 60241 ccaccatgcc cagctaattt ttgtattttt agtagagaca gggtttcacc atgttgtcag 60301 gatggtctcg atctcttgac ctcgtgatct gcccaccttg gcctcccgaa gtgttgggat 60361 tacaggtgtg acccacggca cccagctgac tatctcatct tactgtctct ggccctgata 60421 gtttagaagg acttatattt ctcagatcaa agttcttttc ttttctcacc cctgtctcag 60481 gacctgtatt ctcagcaatg cctcttcttt caactacgat acatcactgt ttggaaatgg 60541 aactagttgc aacttaagtg ccaagttcct gaagctataa actgctcccc atgcttttag 60601 atcctatgtg caagcaacag tgtgtgtttg taatgtgttg tctgtgatga agtgtggtag 60661 gctctgcctt cactagcaca gtgatttcta tccagggata tgtgtcagag tcatctgagg 60721 agcatggatt agttttcttt cactgcttag acaaattatt acaagtttag tgggttaaaa 60781 caacatagat ttattctgct acaattctga aggtcagaag tttgaaatca ctttcactgg 60841 actaaagcca aggtgtcggt ggagctggtt ccttctggag gatctagggg aggatccatt 60901 tccttgcatt ttccagcctc tagtgttccc ctgtatccca tggcttatag ccctttctcc 60961 atctcaaagt gcatcactcc aacctctgct tccatcattg tactgctttc tccccatagt 61021 caaaactccc tcttacaagc acacatgcag ttatatgtat ggcctacctg gataatttag 61081 ataatttccc catctcaccg ttttacctta attatgactg caaagtccct tttgccatat 61141 tagggaaaat tcacaagtcc cagaggttag cacgtgaata tctttggggc catgatttag 61201 cctcctatag agcctaaaca tgcttagact ccactcccta aagattctag tttagtagat 61261 ggtgtctaag tttttgtttt ttgttttgtt ttgaaatgga gtctacctct gtcgtccagg 61321 ctggagtgca gtggcataat cttggctaac tacaacttcc acctcctggg ttcgtgattc 61381 tcctgcctca acctcccaaa tagctgggat tacaggcacc tgccaccatg cccacctaat 61441 tttttgtatt tttagtagaa acggtgtttc accatgttga ccaggctggt ctcaaactcc 61501 tgacctcagg tgatccgcct gcctcagcct cccaaagtgc tgggattaca ggagtgagcc 61561 accatgccca gctgatagtg tctagttttg taggtggtgt ctccctaaag attttagttt 61621 ggtagatggt gtctaggcca gaggtctgta caatcaggtg tacattctac cacatatgca 61681 cttgattaca tatgccttgg tatatagcac acagacttat tcaagtcaca gttattgtcc 61741 ataatgttat ctgagttttt atttagagtt agaacagtgc tattttcttg atgttgagat 61801 actccaatgc tgactgggca gtgcctacat caatacaacg gttatcttgt tgaaccctca 61861 cacgcttcca aattaggttg tgttcccttt ttacagatgc ggaagcaggc tcagagattg 61921 ttctatctgc ccgaggccac acagccaggg agacgtagat cccgaaccca aacccaactc 61981 ttgctctaga gcctgcgcat cccacagaac ttcctaatgc agaacagggc aggggaaagt 62041 agtacattac attacaggtg ccttcccagg gcacacaagt ctgttttcaa ctcctgcaac 62101 ctccaccatc aatcagaact gctcaaacta ttagaacccc aaagctccat cattctactc 62161 ttgacacctg tatgacaatt acagcccatc actacgtgat aaaggagatt gcaagcacac 62221 taacgtggtt gaagcagcag tctctcagga tttaagctcg actctcttcc cacacagttc 62281 ctgtctgagg aggctgcctc ccccatatat agtaggatgc ctactttcag aaggcaagaa 62341 cacgacttta gtcaactgct cccaggctac catggcaaca agtgttcatc ccggcagctg 62401 ctccacacca gcttggcttt ccactctcaa agttctgctt tagcattttt ggcaaacaca 62461 tcagaaccaa gtcacgctgt gggctgcctt gagacccagg ggactttgat tcccaatatg 62521 actctataca agtggaccaa cttgcagctg agaggctcta gctctgaaat caaggggctg 62581 tgacatcctg gacaagcttc tttagggctc aagttgctca tctgtaaaat gtgcactttt 62641 aagccatgtt tttcaagttg tgggtagtga ctcagtattt taatttccta ttgttgctgt 62701 aacaagtttg tttcatgccg ttaagctatc tcacaattct ggaggtcaga agtctaaagt 62761 ggtacagcag agctgtactc tttctagagg ctctatggga agatattttc ttgttcccct 62821 actagaggcg gcctgtgttt tctgtgactc tctatcattc caatttctgc ttccattgat 62881 acatcctatt ctgaccttgt ttttattttg agacagagtt tcactcttct tgctcaggct 62941 ggagtgcaat gtcgcaatct cggctcactg caacctccaa ctcccgagtt caagcaattc 63001 tcctgcctca gcctcccatg tatctgggat tacaggcgcc cgccaccatg cccggctaat 63061 tttttgtatt tttagtagag atggtgtttc accatgttgg ccaggctggt ctcgcactcc 63121 tgacctcagg tgatccacat gcctcagcct cccaaagtgc tgggatcacc ggcatgagcc 63181 accgtgccct gctgaccttc ttgttttata gggacacttg tgattgggcc caccgggatg 63241 atcttaccat gtcaagaccc ttagtcacat ctgcagagtc catgttgcca tgtaaaacta 63301 atattcacag gctctaggga ttaggatgta attcctagac atgggtcggg ggtaggtgtc 63361 attcagggtc aggaagtcat ttaagtggcc atttttaaaa atgccattaa aaagcatggt 63421 acgtgagatt tagttatgat atacaatgca acttgataaa gtcacacaga attttgaatc 63481 taaaactatt tagcagaatg ccttggatcc ttaaaagaag gtccaaagaa ggcttttgat 63541 gagacaattt gagcttttga ccctattact ctagaaatgc cagaccctct acataaggta 63601 ggtgtggttc tttccctaca tcagagtctt atctgtaaga cctaaccaaa ggatttagaa 63661 agctaccttt taagtctgtg tggcagaatt ctagaacagg tgccgggcac ggtgtctcat 63721 gcctgtaatc ccaacacttt gggaggccga ggcaggcaga tgacctgagg tcaggagttt 63781 gagacaagcc tggccaacat ggtgaaacac catctctact aaaaatacaa aaattagctg 63841 ggcttggtgg caggtgcctg caatcccagc tactgaggag gctgaggcag gagagtcact 63901 tgaacccagg aggcgaaggt tgcaataagc cgagattgca cctatagaaa accacaagat 63961 gaccagatag acatcacgat attcaacctc aaggttagat ttttgaaaaa cccaagagta 64021 taatgactga gctgacattt aaatccagcc actagaagct ttcatcactt attcttcctg 64081 ccatacgtag atttgaaata tttccttgca aaaattttag actcaagttt tctagaatac 64141 aagtgaattt ccatatatct tgaaggtctc taagagctgc agtataaagc agacactagt 64201 catgttgatg acataattac ttagaataag actggctacc ctcaggacga aggcctcagg 64261 agacggtcaa gatcgaatgc aggtaatgga aaactcactc taggctcagt tgaatggtgg 64321 agtctgattc ttggaatgtt ataactgagc cttaggactc tggcacttta gtattccttt 64381 ccctaccaaa gaaagattgc ttgtctttag ttataagtgg gaaggtaact aagtcctttt 64441 aagctcagga aagttgtaag cttgagtgta acttgagtta ggaattacct tgaagtaagt 64501 tcttatatta ctttcttagg aacatctggt ttcccttccc ttaaaagaag gggggttgga 64561 tttatgccag tgatcatggg tagcttgctg gcttcttatc ctgtgtaaga gcttgtagac 64621 tcaagtgtac ccccagagct gggctatagc taaagcccta ctgttgactg ggagtgtcag 64681 aggactcatg tattggggtt agaggatcag aaggggccaa atgacaggaa acaattatac 64741 catgaacttc cctatttgta tctatataca aatcctgccc agaggattgt ggctaggtac 64801 tgcctctaca ctgaggaaca aaagaaatga tcccttgtca tctaatctta ataagaatgt 64861 ggaaggcaag aggcttattt tgactgcttt atttagaagc ccatttagtg atttgacaca 64921 gtccttttct cgtacaggta gtagatgaag cctagagttg ccaccacacc tgcaaaggca 64981 gccacagcag ccacagcttt ggaaccaaca tctccagcag tcttgtgccc tttggtgtcc 65041 atagcaccta caaaggaagc agacatttga gtcttaagta tttttaaata agggtaagtg 65101 gaagccttac ataaggctcc ccagagaagg gggtaaaact aagccttcaa caggcttatt 65161 tcagaagttt gcaacattga ataaacttac ctcgtgagcc aacctcctcc cctgtttcac 65221 tcctactctt ctccccactg ctcccacttg cttttagatc agatgagccg acacctggga 65281 agaacctgag agtttagtac aacatcttca tgctagtcga tgtggttagc tgtcccctga 65341 gggtgtgatc ccacaggttg ttcattaagg ctatctcagg taagggtatg tgttaacagg 65401 cacaaactaa tacacactta agtttggcca tagcatgtta agatgcacat gctctgacct 65461 aatcttgcta gaaacatatc ttaaatcttt gactactata caagaaattc aagaggattt 65521 tttttttttt tttttgtggc cgggtgtggt ggctcacacc tgtaatccca gcactttggg 65581 aggctgaggc aggcggatca cttgaggtca ggagtttgag accagcctgg ccaacatggt 65641 aaaacctttt ctctactaaa aataataaaa ttagctgggc acagtggcag gtgcctgtaa 65701 tccctgatac tcgggaggct gaggcaggag aatcacttga acctgggagg cagaggttgc 65761 agtgagccaa gatcacacca ttgtactcca gcctgggtga caagaacaaa actttatctc 65821 aaaaagaaaa aaagaaggat tctagtatgg actgaggaca aaactagaaa taaacctata 65881 gatatccatg aatgaggatc atgactaagg tagagccacg tggaatccta agggttcttt 65941 gtacagatat aaggtagata agacagatta gagaccaagt tctagatcag taggtagaag 66001 ctatgatcct acttgtgttg aagtgatttc tagaagggta cacaggagac ttaatagaag 66061 cctacttctg ggaagagact tagtctttta ttcttctatt gtttagactc accacgggta 66121 catagtatat gtaagcactg accgcttggc ataaaaagct aagaacttga atccaggaag 66181 tactccaggc tttaggtcca tacctctgaa tgaggagtca tctgacaaca ggagaatggg 66241 tcctgggagg ctgactgtca ttttctctgc ttcagtggcc cctgtggctg gagacagatg 66301 atcaagtttg tgtttggcct ctggaatagt ggttcgagag gtgtgtcaat ggaatgctta 66361 gaagtagaag ctctctggag tattttgttc atgtggtttg tcttgccttt gcaggcgagg 66421 ttatgccctt tactgtagaa gtgggatgga ttcttggggt tgggtgtggt ggctcatgcc 66481 tataatccca gcacttttgg gaggccgagg taggtggatc acctgaggtc aggagattga 66541 gaccagcctg gcctggccaa catggtgaaa ccctgtctct actaaagata taaaagttag 66601 ctgggtatgt tgggggggtt cctataatcc cagctacttg ggaggctcag gcaggagaat 66661 catttccctt gaaccctgga ggtggaatga gccaagatca caccatctta ctccagcctg 66721 ggcaacaaga gcaaaactgt ctcaaaaaaa aaaaaaaaaa aaaaaaaaag gatagattct 66781 taaagaactc atcgtctata gaactagaca agaaactgtt catcttagac ctgtcacatg 66841 agtaaggtag actggaccaa gttttgactg ggccaacaag agctctgggc agtaaataaa 66901 gtatgagtta taccagtcct taagatcttc tattctaaaa agttgatact tgctttgaat 66961 tagactcaca gaaacaggaa tttgaagttt ttacctcgcc tgtgcctaga ggacacaggg 67021 caggtcacag aacacagtgg ggagtcaggg gagagtcgat tacagattaa ggcactgcag 67081 tggaagtaga cctggagaca gaagagagtt acatggaatc gatgcactaa catttaacct 67141 gtccctggtg acttctgaat cacttaccag gctagagagc acgtgggctt ctgatacaaa 67201 ggcaaaagcc ttcatgtcaa acctctgata gtgatcagga tgggtcacag aggagccgac 67261 tggatggaag gtggtctggt agttgtccag gtcatatgca cagctaggag gtatgcacca 67321 gggagggatg tcagtcatat gcatcttgga gtcaaattca atgtctgccc agatttagtc 67381 atacgtaata tcttcagaac tattgccata tttaccttct accagcacct ttgttttact 67441 cagcatttct tacctgacct ataataccta tgaaatcata agttgtatag actagttata 67501 ttttaactat tttagcagta cctgtttgcc aactacatga gaagtacaaa ttatatcatt 67561 tcagaagaca ctggtttctt ggccaggtgc agtagctcac acctgtaatc ccagaactgt 67621 gggaggctga ggtgggcaga tcacctgagg tcaggagtct aagaccagcc tggccaacat 67681 ggtgaaaccc cgtttctact aaagatataa aaattagctg ggcatggtgg caggcacctg 67741 tagtcccagc tacttgggag gctaaggcag gtgaatcgct tgaatctggg aggcggaggt 67801 tgcagtgagc tgagattgtg ccattgcact ccagcctggg ggacaagagt gagacctcgt 67861 ctcaaaaaca aaacagaagg cttagtaact gagctaccac agcttcaaga aagcaaaggt 67921 gcagttttag ccaagtacac actattccat aagaatgggt gagatccagt tgtaggacag 67981 gacacggcag gagagctgta aggttgaaca cctatgtgta caagtgtgtg gctgcaacct 68041 cagagcatca gtgctcagat tctagagtca gatggatatt ctaatacaag ctccattgtt 68101 taagtattgt tttgggctgg ttatttaggc agcaagctgg ggtttcttca tgaaagacga 68161 agactagtac aactagtgga tgtggaatac atagcactgt gcctggcata tggtaggtgt 68221 tcagatggat agaggctaga gaccagtctt tccagatgat caacaccata gtagagtcag 68281 actttctggg gctttggaac tcattcctta agattcaaga ggtgattgga aggcagcatg 68341 tggccaagtg tggtggttca cacctgtaat cccagtactt tgggaggcca aggtaggtgg 68401 atcacctgaa gtcaggagtt cgagaccagc ctggccaaca tggcaaaacc ctgtctctac 68461 taagatagca aaaattagct gggcttggtg gtgtgcccct gtaatctcag ctacccagga 68521 ggctgaggta ggagaattgc ttgaacccag gaggcagggg ctacagtgag ctaagatcat 68581 gccgcagcac cccaacctgg gtgacagaat gagaccccat ctcaaaacaa aaaaaaaaaa 68641 aacagaaggc agaagagtcc caattagaaa acaaggacta aatctggatc tcacatgact 68701 gaatgctgct ctaaagccca agagggttta ttcccattac tggcaagttt tgatggatta 68761 gctgggtaac ctgatagtac agggagtacc tacccatcca cgacaacgtt ccactggggg 68821 aaagagtctg gatccatggt ggacgtcgcc cagcagtcat ctaagaccag cttgatgttg 68881 gggtcatccc tgtttaggac tctcacttcc atgtaaattg gttggcggag gaatctcact 68941 agagggtact cgttttcccc ataaggttgt tggtaggaat tatctgaaat tgaatggtat 69001 tcaagtagtt aatggctgca acacagattc ataggtcatc actgctccat tagtctggca 69061 gtatcaggat gttcatttta gggagaagtt tgagatcatt taagcaattt cacagactgc 69121 aatctcttac ctgggtagct ttgcaggatc aaggtaaatg gacccaactt cactgaggcc 69181 actggaggag taaggctttc aacgttgatg tttagtagca tgtcattcct gctataagaa 69241 cacttcactg tcattctgta agagtttgga gggaaggtag gtacagaaaa tttcaggtta 69301 aaaaattgct tgagtttaaa ctagagctta agagtcaaat agcaatttta tctcacctga 69361 actcactgtc tctagatatt ttgcttggag gaaaatccgt ccagagagca tgtatttcgt 69421 tttcatagac gactttatca tcttcgaact taaatgtcag agaaagtggg ttagcagcca 69481 gttatgtcaa agatgagact taacctcgtg ccatctgcca gtactaacct tatatctcgt 69541 tccacatcca ttcaggggta tgtggaaccg taccagcccc tgagactgag cctcaaagac 69601 aggctggcag gatgagtttc ccaccctcag agtacccagg tcaagagctg gttgtgtttg 69661 gtagctgtag acctcgacgt ccataaaccc atcctgggtg cacagctccc ctgtaactag 69721 acagcggtga aagtttagag aaaataagtt tgtctgccct gtgatactgt catctcccaa 69781 cctgatatgc tacgagggag aaattctaat ttagcaggag gcatcagaat ctgtcttgaa 69841 gtctttatga gggaaccttg atcagcagct tgaattacaa agggagctga agttagttga 69901 ttgttctatg tgagagtagg acttttcatc caaaggaggg accagtagat gtaggaagaa 69961 gtgctatatg gtataagacc agtgagaatc ctgatgaagg gcaggagttg actcaaggtt 70021 aaacttaatg tggctttaag tacagcgcct atgttattgt atcctttctt tgtatactca 70081 caaatcctga gtagtttggg gagaaactga ggcaggagct ggttttccta tttgatctga 70141 cctgtgcaga tcacattaaa cagaagcaca tggtggaggc aatgctaaac aagagatatt 70201 cactctattt aagaatttgt aggaattttg gggtgggggg gtacttaaac ttaattctaa 70261 ttagaagttt gaatattctt acctagatgt caggttcagg ctcctaagcc aaaatatcta 70321 gattgtttaa ccatggagca gttgccataa aacctaccta tgtagggatc ttaaaaatat 70381 catacttagg accccatcgt agcagctaca ctgtttctaa tctgactagt ctttagtttc 70441 ttcagtctat aagcaataga tcagaatctg accttctata catttgccca catgtacaag 70501 agccaaggga gaaaaaaggt agaacatgat tttaataaaa ggtctttacc tatagaaacg 70561 ggtgactcac agagacactc aggatagatc accatggata ctgtctctgg ccgaaggaga 70621 aaggtcagct tgagtgaagc taagtagaac tgatggagta ggcatttttc agataactga 70681 aatgagaaaa tgtcaattag cagccacagt ccgttggggc ccggaaccgt cacagggagg 70741 caggaagcaa gtccactaaa tacttgcagg acacactcca gacttatctg ggctcccttt 70801 aaaccaaaac aagagagacc ctatcacttc ctcagatctg gatgtatccc caggcagatt 70861 tattgcttta aactcatgcc ctggagtacc cagcactgta ctctttgtac gaggcactaa 70921 agcttgaatt agcagatcaa attccaaaga atagtggaat cctctgccaa gccaggtctg 70981 agtcaaaagg actctacaga tgttataaaa tggtattgca taatatgtac ccccaaccta 71041 agattattct ctggtatccc tggcagttct agtgaagact aacctcaaca gcaacagtac 71101 gggacccttt tgtttagatc aaggtgtcct gttcgttgac caggacagat tcatcattag 71161 gtatttgaca agtagttgct actactcagt tggggcctta gacagtttta agacttagaa 71221 agacttagac gatttttcat gacagaagtt aaagaggttt tgtgttagag aatgaacttt 71281 agttacttga tatttcactt atacaagatg atcaaatgag gccttttgga gtttgaatct 71341 gttatgttgg caattagagc tcacgggcaa gttatcagcc gtatgaggac tacccacaag 71401 agtatacttc aagtcaggac ccggtggagt tcagtggttg acaattgaac atactttcgt 71461 tttgagcaga gttttgctga aatgcaattt catgccattt gttgcttcta gatcaattcc 71521 attgtcatgc agctggctca catcaatgtt ctggttttca aagctcacag acttaagctt 71581 cccaggaaac tctggtatgg tgagagtcat gtgtgtggca ttgcaggtca caggatctag 71641 aaggaatgac aacagaatgc ccattaccag ccttgattaa gcttctagga caatgtctcc 71701 cacactgagg ttgcagctat ctgggacctt acctggtgca caaatagctt gtgaagagaa 71761 gatcaccttc tgtccaggag atataaatgt aagcttcaga gacaccatgt agagatgact 71821 gttaccttgc tagggggaga aataacagtg atcttcaaaa acattggtta cttgaatctc 71881 taaccagtaa gaatcatctt ggcaagtata tatccaagta atgggcacta ctttgacaca 71941 agtaggattt aagcatttgt ctaatgacta atttgtctaa tctggaggta tattagtttt 72001 aaccttgttc taacatctgg aagaatgttg tctagatcac ctgtagccaa tgacttctag 72061 gcagataaga tctgtcctta ccccatatga ctattttaaa gtatcagcag ccagacaatg 72121 ttttcaagcc tcaatccatt tgtgttctta ctgggagata ggactgttct tgaagtggtt 72181 ggcatttgag cacttacatc tcaggacctg tcctaagtac tttaggagat taattcattt 72241 agtgcagtgt tcagactatt ggcctggaaa gcaccaatac ttgctataaa acaatatagg 72301 acaatgacaa cccaaacaga accccttaac ccataaagct ttcaggaata tgcttgttct 72361 gtgttgtact atagatttaa gactatggga aagagactag attgaggtaa ggattatgta 72421 tttttgagaa atggtctcgc tctgtcgccc aggctggagt gcagtggtac aatctcggct 72481 cactgcaact tctacctcct gggttcaagc gattctccca cctcagcctc ctgagtagtt 72541 gggattatag gcatgtacca tctcacccag ttaagttttt gtgtttttag tagagacagg 72601 gtttcactaa acagacaggg ttggccgggc tggtcttgaa ctcctgacct caagcgatct 72661 gcccgcttca gccttccaaa gtgctgggat tacaggtgtg aatacccagc gaagtgtttg 72721 gcacttaata ggcattaaaa tgggttgtat tataaattaa tgtgaagaca tggttgagat 72781 catgcttcaa tttaggaata ctggcctgtg ctttggtggt acaaagccag acttccacac 72841 ctaccacata gtgagtcact ccagtggcat tgaatggcac atggaaggtc atcctgtggt 72901 tgtcaatcaa gaggctgaag ccttccttca tggcctctgg cagggtcaga gttttggctc 72961 ttgcaccatc accaacctca atgctccatc ccatctgaac tttggtcccc tgaagtcaaa 73021 aagctttgtt tagagggctg gtgctccctt aaccaagtct ctggtagtct aacaatttat 73081 caggttaaag gtaatatcac ctttaacata gaattaaagg ttattaaggt ctgattttta 73141 aaatttgctt cataccttac tgtcgtcagc caagccagag aagacccgtg gcaaggaaaa 73201 ctggaagaaa agaattgtga tgtaagactt tgatttggag gtagatgttt ccgaattcat 73261 gccagaaatt gctgtgtggt tgtcatacca aaaggcctgc tgtgggatgc agtggaagaa 73321 gcacatgaac cctgactgtg cccttagatg tcattatact tctttgaact tctgatccca 73381 agaggcagct ggaagatcta agctcctttc cagttgtaag ttgggacaca tttctccccc 73441 caagatagag tttcgctcat cgcccaggct ggagtgcaat ggtgcaatct cggctccctg 73501 caacctccgt ctcctgggtt caagcgattc tcctgcccca gcctcccaag tagctgggat 73561 tacaggaatg tactaccata cccggctagt tttttgtatt tagtagagat gggttttcac 73621 catgttggtc aggctggtct caaactcctg acctcaggtg atccacctac ctcagcctcc 73681 caaattgctg ggattacagg catgagccac catgcctgac caggacaatg ttcttaaacg 73741 tttgcccttt caccaatgtc catagaaatc aggtactatg taaaagccat gcagttctat 73801 cattgttatt gagttgcttt tatgtcagtg ttatttttag ttgtccattg ttgacattag 73861 ccatgttaga ctctaaaccc atgaagatgt gactgggtcg ttttgttcac ttattgtatc 73921 ttagcatcta gggtagggcc tggaacatgg catgtattcc atttgtcaga tgattagtgt 73981 gagtgcaact accaataaaa ccctattgcc atgcatgatg tgccaagtac catgtatata 74041 ccctcagtaa attcttagag ccaccctgag gtaagcatta ttgtcccatc actttggact 74101 tggaaacaaa gcctcagggt aaagttaaac ctcatcacac agaaaattag caagtacagg 74161 attaaatcat ggtctgattt ctaagccccg ccctggttag atcatcatca tattccccct 74221 gcagtagcca tataccccga gcagtcagcc cgtttcactc acagacatga aatccttctg 74281 gcagattgta gatgctgaaa gcccctgggt ctcttctact tgcatagctg gacagaagaa 74341 ctgatacatg acagctccgt gtcttaaggc agcactgttg ttcatgactc tgatggtcat 74401 ctggtgtcca ccatgctgtg tacagatagc acagtgggaa cagagtaagt actgtacccc 74461 catgggattc tagaatacca tctgacccat ctgagtggga aaacccttaa agttactaat 74521 accacaagtg ctggaaggta cctcattccc gtaccaggag actagctagc ccattgtaat 74581 ctgttagtga agactcatgc tgataggcca gttttagcat caggaaatgg agttcagttt 74641 ctatttctct tatgtgtacc ctatgtctca gtaccatggt cctggctgaa tggggttggt 74701 atggggcttc tacccaggtt atctgaatac ttgttaagca cctaagtagc tgccattgtg 74761 ctgggtagag ctacccttgc tattacccct gaccttagga aactcagtct aaaagcaggg 74821 gtcagccaac ctttttccat tatgcattag caactgtttt agactcatgt gagccacatg 74881 aaatccatag catatttcct taaaagaact ttaaaaagat aaaagccagc tgggcatggt 74941 ggctcattgc ctgtaactcc agcactttgg gaggctgagg cagactgact gcttgagctc 75001 aggagttcaa gaccagcctg tgcaacatgg tgagagactc catctctata tatatattaa 75061 acacacacac acacacacac acacacacac acacacacac acacaccaca aaagttagct 75121 gggtgtggtg gtatatgcct gtggtctcag ctactgggca ggctgaggtg gtactgagac 75181 atagggtagg ggattgcttg agtccgggag gtagaggttg cagtgagccg agattatgcc 75241 actgcatgcc agcctggaca acagcaagac cctgtctcaa gaagattaag aagataaaac 75301 ccattcttag tttgaaaacc aaaccaaaac aaaaacaaac aaaaacacaa aaaaaaccca 75361 tgaactaggc acagtggttc atgcctataa tcccagtgct ttggaaggct gaggtgggag 75421 gatcccttga ggccagaagt ttgaggccag cctggacaac atggtgaaat ccccattttg 75481 acaagacaaa gattagacag gcatagtgtt gcatgcctgt agccccaact agttgggagg 75541 ctgagacggg agaatccctt gagcccagca gttctaggct gcagtgagcc atgattgcag 75601 ccactacttt ccagcctggg ctacagagca agcctgcctc agaaagagac catggactgc 75661 gtttggccat ccacaggtta tggcttgcag acccttagtc taagggaatg cgactccaag 75721 acctctaatg ttgagaaggt ctgttttaga aggaaattat caggtttttc atgaaggacc 75781 tatgattact tcattgattg atcacagagg gctttagggg accagatgac ttacagttca 75841 agcctgttta cagagtaggg tctaagactg tatctgtaca aagcttagac tgctgaccac 75901 ctgcatcgta atcagatggt atagactgtt agatatgcag attcccagaa ctcaccccag 75961 gcctatgtcc aactagtcaa acccaaactt tccaaattac aacttgctag cttctgctta 76021 gacaggttgc ttttattgct cctaatattc ctgattgggc aatactgttg cttggcaata 76081 gaaaaaccct tactgcttag ctgctgtttg ccttttaaca cctgtttcta taggtcagat 76141 ttctatttgc tagcaccaaa atagccttcc atctcatgtt ggggttcaca gatcaaaata 76201 gctcgtgttt catcacttgt tgtatccagg ttctgcacca ggtcttaatc tgtagaacat 76261 cattagtctt cattaccccc atggagcagg caccattact ctcatcttct aatgggcaaa 76321 caggcttgaa agggctcact tggtggaaat tacacctgat tagtggtaga gccagggctc 76381 aagggctcaa gtacttgaat gccagggttt ctcatcctca acactgacat ctcagtcctg 76441 gtaattctgt gttatgggga ggagacttgg gtaatgttgg ttgtctgcca gcatctttgg 76501 cttgtatcca ttagatgcca gtagtagctc ttctaccttc agttgtgata accaaagatg 76561 tttctagatg ttgtcaaatg tcccctggac agattccccc caccctcctc cagacagttt 76621 tgctcttgtt gcccaggctg gagggcaatg gagcaatctc agctcactgc aaacctctgt 76681 cccctgggtt taagcgattc tcctgcctca gcctcacaag tagctgggat tacaggtgtg 76741 tgctaccaca cccagctagt tttttgtatt tttagtaggg atggggtttc accatgttgg 76801 ccaggctggt ctcaaacttc tgacctcagg tgatccaccc gtctcagcct cctaaagtgc 76861 agagattata ggcgtgagcc atcacacctg gcctagacag attataatag aaaagaagag 76921 ccacacacga gacaccaccc ccttggttga gagccactgc tttactccaa agagaaagtt 76981 cttagccaag tttactgcca gtaactctta ggttactacg tacttcccca agaactgctc 77041 aaagcaatga cttaccactc tcctggtaca gttatcatag gtagccctca gggtgagctt 77101 ttctgggtcc aggatgtaag tgcagttcgg catgtcgaga ccaagaggat ctgccaaggc 77161 cagagcaggt tagacaggat ggctgagtac atttcagtga cagtccaaag ctatagagtc 77221 aagaccacca gtccatttct aaggtacttc agtcccttca cttccgcccc aactcagtga 77281 tagaacttga gctttgagat ggtatgactt gcagccagag gcttcccaga gatgctggca 77341 taagacacac tttcgaagac agacaggtct tccatgctgg gtggcttcat gcaaatcagg 77401 tatccccatg gcatcatggg agttggatgc cgttgctgca ggagtttggg tactctgctc 77461 cccaaacagg attgacctaa gccaaaggct gtctgctaat caggactatg aggagataac 77521 acacgttacc tacccaccac agatgcatgc catttcttgg tgccaggact gcttgggaac 77581 tccactgtta tttccctttc atcgcaagtg acagtgccta aggagcaaag gaagcatttg 77641 ggggctttga gtcatgagtg atgcaactcc tacaaaagtg tatagactcc tctcagatgg 77701 gagggattcc agatgaggga ggcaagaatc gtcccctacc ctgggagagc tcacaaacaa 77761 gtgaccaaga caacaataca atatgccaag aactgtaagg aaaatatcag gtacccaggg 77821 cttttgaagt gttccagtaa gaagcccagg ttagggttga caagcaggtc tgtaggagcc 77881 aggtcaacac cttgacagtg gagcagatgg aggcaggccc gaaaaactga caaaagcttt 77941 gtcaacaggt tttgcaaatt gatgttaaac aaagtcctgg gtaagtccca ggacttttat 78001 ttatttattt atttattttg agacggagtc tctctgtcac ccaggctgga gtgcagtggc 78061 acgatctcaa cccacctcaa cctccgcctc ccaggttcaa gcgattctcc tgcctcagcc 78121 tcccaagtag ctgggattac atgtgcccac caccacaccc agctagtttt tgtatttttg 78181 gtagagatgg ggtttcacca cattggccag gctggtctca aactcctgag ctcaggtgat 78241 ctgcccgcct tggcttccca aagttctggg attacaggcg tgagccccgg cgcctggctg 78301 catttttttt ttttttttta aagatcaagt tttgcccatc atctgctact ctgcagcctc 78361 tggctcaggt gaacgtcaag tgagcagctc cagttttgct tgttatgtcg atgttttcct 78421 taagcatgag tttgtgtttg aagattgctg ctcctaaaga tgagggctta gtgtggtggt 78481 ttgctgttct gtggatacca taaagcttga ggcagtgagg gattggaagg cagtgcaagt 78541 gtttgagttt tcatttagca ggtatagtga gtctacaggt tagcagaaga aagatccagt 78601 aacctgacca aggtaataaa gggggtgaat ggaggttgtc tgagccaggc ctgtgtttct 78661 tggccagcta ggccttccag ctgcaacctg ttttctctgc taagaagact tctgtcaccc 78721 aagagacata cctggaaagg caggatttac caactgagaa acatctatgg agttccctga 78781 agtcacaagg gcgaagaaga gagaaatcga cctgtaggtg ctggaaagag acagggagat 78841 agtcaaggaa gaatggactt taacaaacaa agctaccata ccccctccct ccacttccag 78901 aattactcac ctccagcctg cattgaacca gcctgaggga ctccaagagc ctcctctctg 78961 cctgcacgcc atagcagaag acactaccag atcaaccagg tagagggtag gctgctctgt 79021 gtttttatag caggaagcca gccgcatcca caccctctcc ccattggggg cacctgaatc 79081 ttgtcccatt ctcccagctg aaacggaagg attggggaag gaagtggcct ctgattcctg 79141 acagcttgcg ccgggtattc tgccagctgc cctgtgggcc aggcatgaaa tcagagtttg 79201 ctgaaatcag ctccaggtga gtgaattggt gttttcccta taactatggg ggaaattagg 79261 atcttttggg caaattattt acaactgcaa tgtagagcca aagagaatca tgggctttat 79321 tgagaaaaaa acaagtcttg ttcacaccac tacatttttt tttttttttt gagattcagt 79381 ttcactctgt cacccaggct ggagtgcact ggtgcaatct cggctcactg caacctctgc 79441 ctcctgggtt caagcaattc tcctgcctca gcctcctgag tagctgggat tacaggtgca 79501 cgccaccaca cctggctaat ttttgtattt tgagtaaaga ttgggtttca ccgtgttggt 79561 cagggtggtc tcgaactcct gacctaggtg atccacgtgc ctcggcctcc caaagtgctg 79621 ggattacagg cgtgagccat tgcacctacc ctaaaacctt ttaagtaaca tctagtcttc 79681 tggctactgc agaagttacc ctgataatgc acaggaagtg tctttaaaaa aatgaatgaa 79741 gccttgttga cccccaagtg taaaagctta agcttaccgc aaagatacaa tttcagaagg 79801 atacagagaa aaggcaaaag tcactcataa tagtgccact gctaatattt aggtgagtaa 79861 ctttagacat gtttctacaa acaccacaca caattttttt aaaacactgg gaggaaagtt 79921 taaatactgt tttgtaatct gctactgtca ctttataact attaaacata aagctctctg 79981 ttaccatttt agggtggcat aatattcaat ttgagataaa tctgtgttta tataatttcc 80041 ttaattatga gcataaggtg gcttgtaatt tttcatcatt ataagcagta acatgatgag 80101 cattctgata catatccttg ggtaaccctc aagataaatt cctcaaagta aaacttctgg 80161 gtaaaaggga gtagtgatct aataagaatg ttatattgtt gtaaagcact tggaagtttc 80221 ccagtgggga aactgaggca aggcaacttg ccccaacaat gtcagggacc aggtggagaa 80281 ctcagatttc ctgaggcctg atggttgtgt tttttcaaca tctctcacac tctttgagat 80341 aactagatgt ctgcagccca tgttccaaaa taggtggata taaggcttag actccattca 80401 agtggtgggt gacacttgca attattaggc tggtgcaaaa gtgattgcag tttttgccgc 80461 tacttttatg tatgtatgta tgtatttgag atggagtctt gctctgttgc ccaggtggga 80521 gtgcagtggc gcaatcttgg ctcactgcaa cctctgcctc ctggattcaa gcgattctcc 80581 tgcctcagcc tcccgggtag ctgggattac aggcgcccac caccaagccc agctaatttt 80641 tgtatttttg gtagagacag ggtttcacca tgttggccag gctggtctcg aactcctggc 80701 ctcaagtgat ccaaccgtgt cagcctccca aagtgctggg attacaggcg tgagccactg 80761 tgcccggccc tttgccacta attttaatgg caaaaactgc aattattttt gcatcatcct 80821 aaactgtatt gattcatgtg ttgcctttga ggtgtaacct gaggcatctg aaataacatt 80881 ggagattaac tgtttccctg acttccttga tcaaaaggat cacatggagc tctcattaat 80941 atcacagatt ccagggtccc acccaaaagc tgctgaatta gactctccag aagatgtgcc 81001 tgggaaatat ttacttaacg agcaccacag gtgattctta taatcaggaa aacctggaaa 81061 atacttggag ggagcgaggc gagaaaatct gtggtggcca gtggttaacc tccttacggg 81121 aattgagtag attacatccc agcggaattg ggaagaagga gcaaaataca gataagtttt 81181 tggggataga gatggtaaaa ggatttcatt ttatagccaa agcatgatag caatagtttc 81241 tccttctatg catatttttc tgaaaagtgg caccttttct acaaatataa tttattgcca 81301 caatggttaa gactatgata agacattaga ggaaaaagga gcatagacgt gagactcaga 81361 gccctattga gaaggtggtc agagaagtga tgggttaatg tgcttattca gcctccctca 81421 gaccccacct cccttcccac cgcgctactc actcccctcg agatggtgtc tttctccaga 81481 gcgggaagtt tttgccttgg aaagggcagc tttggtcaga gcccccaaat ttggggtttc 81541 taaaaggtta aggggcagca cagagacccc aagcaggaga tgtggtgaga aagcccgctg 81601 tgggctggtg aagctggtat gtgcaccaga gattaatttt gtttgtctca cctaacccag 81661 acctgtggtg tgctggggac agccaccgaa caaatggctc tgggcccagg agcatccatg 81721 tctaggattc tgctccggaa aagcaggaag ctaccatctg ttcagcaact gactttgaac 81781 acaggaagaa atagctgctc cgtaaaacag tgtctcctat caggaggtag gaaaaactaa 81841 agttattaca aaaatgattc ttaacattct gttctcactt ataagtggga gctgaacaat 81901 gagaacacat ggacacaggg aggggaacat cacacactgg gtcctggtga caggggttgg 81961 ggggtggggg cagggagagc atcaagataa atagctaatg catgcggggc tcaataccta 82021 ggtgatgggt tgataggtgc agcaaaccac catggcacac gtttacctat gtaacaaagc 82081 tgcatgtcct gtacatgtat cctggaaatt aaaattaaat ttaaaaaaat attgcgctaa 82141 ataggcccgg tgcagtggct cacgcctgta atcccagcac tttgggaggc tgaggcgggt 82201 ggatcaccag aggtcaggag tttgagacca gcctagccaa catggtgaaa ccctgtctct 82261 accaaaaata ttagctgggc atggcggcac acacctgtaa atcctgctac tcgggaggct 82321 gaggcaggag aatcgcttga acccaggagg cagaggttgc agtgagccaa gattgtgcca 82381 ctgcactcca gcctgggcaa caagagtgaa actctgtctc aaaaaaaaaa aaaaaaaaaa 82441 ggcagtaaac aaacagcaga cctcgttcaa atatatgtgt tccacactct tttcaatgaa 82501 cctcttaaag tttgctgatt tattattaca aggtgttcct tctcctcttc tctgccctgg 82561 gttctatgta tcatgaaacc atttctctct tgctaatcac aaggcatttc ttgacggggt 82621 tctcatacat aaaagaaaga ctatagtttt gttgttgcct catttgggca gatgtcttcc 82681 tttctgaagt gtggttctta taaatgacag tttataattc ttaaattctg aatctgctgc 82741 ccttgttttc cacaacaatt ataaaagtat ttataaaaga caacctaaat atcaccatct 82801 agagatggtt ggataaacta tggaatactc ttcagctact gcaaatgatt ctatgagttc 82861 gttcagatct gtgagtactg acatgaaaga tgtctgcaat aaaactggaa tttcagggct 82921 atatatctac tatgctttca tgttttgctt taataggaaa atgaaaaacc tgtcgaccta 82981 aaggaaggag aagagaatat agttttaaag agtttacttg aggcaaagtg tggacagctg 83041 cccaggaaac acttccaagt tgccttggtg agttctccac ttttgtcaca agtgggtgtt 83101 ttttttgttt tgttttttgt tttttttaga cagagtcttg ctctgtcgcc cagactggag 83161 tgcagtcgca cgatctcagc tcactgcaag ctccgcctcc caggttcacg ctattctcct 83221 gcctcagcct cccgagtagc tgggactaca ggcgcccacc accacgccca gctaattttt 83281 tgtattttta gtagagatgg ggtttcaccg tgttagccag gatggtcttg atctcctgaa 83341 ctcatgatcc acctgcctca gcctcccaaa gtgctgggat tacaggcgcc caccaccatt 83401 cctggctaat tttttgtatt tttagtagag acggggtttc accatgttag ccagaatggt 83461 ctcctaacct cgtgatccgc ccaccttggc ctcccaaagt gctgggatta gtggctgata 83521 cagtgtttct tgactcattg atttacagaa ataacactga ttagctgggt acagtggctc 83581 acacctgtag tcacagtgac tcaggaggct gaggtgggag gatagcttga gcccaggact 83641 tttaaggcag tctgggcagc atagcaagat cccatctcaa aaaaaaaaaa aaaacactga 83701 ttagtgattg gctatacatt gttgaactat agggtatgat taatggcgtc cagcatatgg 83761 tatgagttat gatgtccagt gtatggcata gttaggttaa ttttatggct acttggtgtc 83821 agtctagagc ccacatatca agtagctcca agagataatt actgagcccg agggagtgag 83881 gtgtgactgc tgtcacgttt caatgcctct ctgggcctga taatttaaag ggacttgcat 83941 tcttcagata aaaagtttct tttctttctc aaacttatgt gtatatttat actttatgtg 84001 tctgcatgtg ggtctataca tatgtatttg gattacaagt caagaaagtt gtcccagaga 84061 atcgacagca tcatccagga aaggagaggg ttgccagggt ccggggagtc cacgtggctg 84121 tgctgggcat ttgcgaccat tgtctattct gacttgttcg tgcttttgct gtatgggttt 84181 taaatacatt gaatatcatc accgggaata ggattaagga gatgggaggg ttggagcaca 84241 gaaaacaatt ccccaaatgt gcctgggcat gccaagtgct ttggaaaatt aaaaggcttc 84301 agaaataagc ctcagaatca aggtctctct atccttgcct tgttcccttt ccacccccaa 84361 agtacaggaa ggaactctct ctggggaaaa aaaaaagaaa gctccttctt tttttttttt 84421 ttttttttgg cagggctttg gtcttgttgc gcaggctgga gtgcaatggc atgatctggg 84481 ctcactgcaa cctccacctc ccaggttcaa gtgattctcc tgcctcagcc tctcaagtag 84541 ctgggattac aggtgcacgc cagcaaccat ggctaattat tttttttttc ttgtattttt 84601 agtagagaca gggtttcacc atgacggtca ggctggtctc aaactcctga cctcaaatga 84661 tccacctgcc ttggcctccc aaagtgctga gattacaggc atgagccact gggcccagcc 84721 aaggcagctt cttaacagaa gaaacactat tgccttctat cccctccctg aaatctcatt 84781 atctatagca ggaaaggaga ctaaggaatg taaccacaca tggacagact tttccacaag 84841 ataatgtcag cctctgaagc tcagtcaaat tccaaagata attatgtaca agttaatttc 84901 tctctccctg gtctgttcat tctccctgat aatcattatt gcccctcaag agaattgtct 84961 acagtcccca tctcctccct cccctatgaa aaactgtata tacagccagg cacggtggct 85021 cacacctgta atcccagcac tttgggaagc cgaggcaggt ggatcacctg aggtcaggag 85081 tttgagacca gcctggccaa catggcgaaa ccctgtctct actaaaaata taaaaaatta 85141 gccaggcatg gtggtgtgtg cctgtaatcc cagctacgca ggagtctgag gcaggagaat 85201 cgcttgaacc cggcgggcag aggctgtagt gagccgagat tgcaccacta cactccagcc 85261 tgggcaacag agcaagactc catctcaaaa aaaaaaaaaa aagaaaaccc accgtatata 85321 tgcatctgtg ccccactgag ggtttagggg caattactct gtgattccca ccttcccctg 85381 cacattaata attttgtatg aatttttctc ttattcatct gcctttggtc agtttatttt 85441 cagtgaactt tcagagtgca aaggtgggag ttttcttcct tcagccctta aagaataagt 85501 aactcctgcc tttctgactt tgagggagaa gaaaaggaaa aatcatttgg gataaacagg 85561 ctgcacctgc acacagataa gcaactttgt ctaattagcg agctcctagg aaaaagtttc 85621 ctcccctttt cagacatatc catggtggga acttacacag ggaggagggg ggcttaccta 85681 aaacaaaccc gcagttatca aaacaagaga cgcaggcttt gtgcttgcct agatacatgc 85741 ccacagttgc gtaagatacc gggagttgca cagacagctt tactgatgag aagttactca 85801 aactgctaga gatgagagag gagtttctta aaaaagcttt tgaattcagc tgtaacctgg 85861 caatccactt ggactcccct ctctgctgca gagaattttt ttctttcact tattactatt 85921 cttttttgag acggagtttc gctcttgttg cccaggctgg agtgcaatgg tgtgatctcg 85981 ggtcactgca aactctgcct cctggattca agagattctc ctgcctcagc ctcccaagta 86041 actgggatta caggcatgcc ccaccacact cggctaattt tgtattttta gtagagatgg 86101 ggtttcatca tattggccag actggtctca aattcctgac ctcaggtgat ccacccacct 86161 cggcctctca aactggtggg attagaggtg tgagccacca cacctggcct ctttccctca 86221 ttaaactttc actccaaccc acctttgtgt ccatgttcct taattttcta ggaggtagga 86281 caaagaaccc agggtactag ttcagacaat gagaaagtct acattaaggt gcattggtga 86341 ggttccaaca actttattcc ctctgttttg gttgaattgt tttagtgata aaataaccaa 86401 ttaaataatt atagactggg catggtggct cacgcctata atcccagcac tttgggaggc 86461 tgaggctaga agataacttg aggccaggaa ttcaaaacca ccttcgacag catggcagga 86521 ccctgtctct taaaaattta tttagttagt tagttttagt tttttgagac agagtctcac 86581 tctgtcaccc atgctggagt ccagtggtac catctgtgct ccctgcaacc tctgcctccc 86641 aggttcaagc tattcttatg actcagcctc cccagcagct gggactacag gcgtgtgcca 86701 ccacacctgg ctactttttg tatttttagt agagatgggg tttcatcatg ttggccaggc 86761 tggtcctgaa ctcttgacct caggtgatcc atccgccttg gcctcccaaa gtgctgggat 86821 tacaagtgtg agccaccgtg cctggacaaa aaaaaatttt ttttaataat aattttgact 86881 gatttctgag tgagctacct aagttacaga gtctaaggct caaggatgaa aactcagtag 86941 tataaaggta aaaaggcgaa atgaatggat ccggaacttg aggctttgaa acaaagtcca 87001 actccctatg gtaaaatgaa gatattaaga ggtttaatat ctaaagaggt ttagtgactt 87061 gctccaggtc atttaccaag ttagtagcag atagagctcc agtgagaaac caggtctcct 87121 gatgcctctc cctgcccttt ctaattcatt tcaattttct ctctctcgct gtgtgtgtgt 87181 gtctgtgcat gcgggggtgt tttaaaactc tgtgcttctc ttgcagttca cttgtctttt 87241 aaaagtacaa gatgacatta attagaattt tagcagtata agataaataa tataaaaact 87301 tctcccagtt atgctaatgc agctcaaaat agctttcaaa tatcaaatat gaatttaact 87361 tggaaaaata agatagacta tgtcaatgag aatgttctcg aacatcatat ggtagttcct 87421 ttaccaatct aatatacaat gttaatgttg ttttcccctt gggtccttta gacagatcca 87481 agttgaatga gccttctgac agtaataaat tcagaaagaa ctaggatgaa agctttatcg 87541 atctataaat ataaaggtag ggcggcatca atatatggaa gacagaaggc ccagaagtct 87601 tttttttttt tgagatggag tcttgctctg tcgcccaggc tggagtgcaa tggcgtgatc 87661 tcggctcatg gcaacctccg tcttccaggt tcaagagatt ctccggtctc agcctcctga 87721 gtatctggga ttataggcat gtgccaccac gcccagctaa tttttgtact tttagtagag 87781 atggggtttc accatgttgg ccaggctggt ctcgaactcc tgacctcaag tgatctgccc 87841 cctccagcct cccaaagtgc tgggattata ggcatgagcc accacacctt gcctttttgt 87901 cttttttaac tgttgttgct ttaacgtctg ttttgactga tacaagaata gctattcctg 87961 ctcacttttg atgtccgttt gcatggaatg tcctttttca cccctttact ttaagtttat 88021 gtgagtcctt acatgttaga tgagtctctt gaagacagca gatacttggt tgctgaattc 88081 ttattcattc tgccattcta tatcttttaa gtagagcatt tagtccattt acattcagtg 88141 ttagtattga gaagtgaagt actattctat tcgttgtgtt agtttttgcc tgaatacctt 88201 gtgttttttt tttcattgtg tttttgtttt ataagtcctg tgagacaatg ccttaaggag 88261 gttctatttt ggtatatttt gaagatttgt ttcaagattt tgagctactt ttagcagttc 88321 ttgtagtgct ggcttggtag tggcaaattc tctcagcatt tgtttgaaaa aaaactgtgt 88381 ttttccttca tttatgaagc atagtttcac tggataccaa attcttggct gataattatt 88441 ttgtttaaga aggctaaaga caggacccca atcccttcta gcctgtaggg tttctgctga 88501 gaaatctgct gttaatctga tagtttttcc tttataggtt acctgatgct tttgcctcac 88561 agctcttaag attctttcct tcatcttgaa tttgatcatt atgtgcctac atgataatct 88621 ttttacggtg aatttcctgg gtgttctttg agcttcttgt atttggatgt ctagatcttt 88681 aacaaggcca aggaagtttt cctcaattat tccctcaaat atctttttca aacttttaga 88741 tttctcttct tcctcaggaa tgccaattat tcttaggttt ggtcatttaa catactccca 88801 aacttcttgg agactttgtt catttttaaa atccgttttt ctttgtcttt gttggattgg 88861 gttaactcaa aagccttgtc ttcgagctct gcagttttct tctacttgtt tgattcttgt 88921 tggagaatac ctggatactc atttggttgt ctttatccta gatgggtatg tgggtttgcc 88981 tatgctgagg ttgtttcaag aaccaagaga agtcaatatt gtaagagatt agtggtatgt 89041 gtatgtgtat gtttatgcat gcatatagat acatatatat gtttttctag agtagcttgc 89101 atataaaggg gcagtgatat agcaagaaat aagctgctgt ttttaactat ggcaggactc 89161 tgcttgttca gagagtgtgt aagatgccca cacagcaatg acactatggg atgaaggggt 89221 gccctgcccc tccaaacctg tgggtgtttt cttgtcgggt gggatgagag actaagaaaa 89281 gaaagagaca cagagacaaa gtatagagaa agaaaagtgg gcccagggga tcgtcgctca 89341 gcatacggag gaccatgccg gcaccagtct ctgagttccc ttagtattta ttgatcatta 89401 tctctaccat ctcggaaagg gggatgtggc aggacaatag ggtaatagtg gggagaggtt 89461 cagcaggaaa acatgtgaac aaatgtctct gtgtcataaa caaagttaga aaaggtgctg 89521 tgccttgatg tgcacataca gaaacatatc tggtgcatta aagagcagta ttaccgccag 89581 catgtctcac ctccagcctt aaggcagttt tctcctatct cagtagatgg aacatacaat 89641 ccggttttac actgagacat tctattgccc agggacgagc aggagacagg tgcctttctc 89701 ttatctcaac tgcaaagagg ccttcctctt ttactaatcc ttctcagcac agacccttta 89761 cgggtgttgg gctgggggac agtcaggtct ttcccttccc gcgaggccat atttcagact 89821 atcacatggg gagaaaactt ggacaatacc tggctttcct aggcagaggt ccctgcggcc 89881 ttccgcagtg ttttgtgtcc ctgggtactt gagattaggg agtggtgatg acttttaaca 89941 agcatactgc cttcaagcat ttgtttaaca aagcacatcc tgcatagccc taaatccatt 90001 aaaccttgag tcaacacagc atgtttctgg gagcacaggg ttgggggtag ggttacagat 90061 taacagcatc tcaaggcaga agaatttttc ttagtacaga acaaaatgga gtctcttatg 90121 tctacttctt tctacataga cacagtaaca gtctgatctc tcttttctcc acaatgggac 90181 caggataaag tgttactaaa acacatagca gctaattttg ctctgcgtgg ccacttctgt 90241 atccttggct aacctgggaa attatcctgc aagtgtgtgt gtgttctttc tactgatcag 90301 taattcaact gcaattgggc aatatgcaag tttgcttaag tcttataacc ctgtgagtgc 90361 aggtgtaaat atccccatat ccccaagaga acagcctttt taatgcttga gaattggcca 90421 gacatgctgg aaatggggac tagaagagct ttcacacagg cgcataatat gtatttggtg 90481 actgattggc acagaaggaa atgagggagg gttaggatgt aactgatgtg agaggaaatg 90541 ataggacatt gtcacactca aaaggccatt cgacatggtt actggagcaa ggcaaatact 90601 gtacatgagt gaagtcggtt gggtgcaagg aaatggcttt agtgtgatga gaatggcaag 90661 tgactaacta ttgccctcag ggtcaggatt acagctctgg gcagctgggg gtgatgtgga 90721 ctgttgagtc caggctgcat ctaaagaaat aactgctttg gctgggcaca gcggctcaca 90781 cctgtaatcc cagcactttg ggaggccaag gtgggtggat cccctgaggt tgggagttcg 90841 agactaacct gaccaacccg gagaaacctc ctgtctacta aaaatacaaa aaaattagcc 90901 gggcatggtg gcgcatgcct gtaatcccag gtactctgga ggctgaggca ggagaatcac 90961 ttgaacccag gaggcggagg ttgcagtgag ctgagatcgc gccattgcac tccagcctgg 91021 gcaacaagag caaaactgca tcttaaaaaa aaagaaaaaa gaaaaaaaag aactgcttcc 91081 cagctcaggc agaggtaatt gtcatgatct tcagtttctc atctggaagc tgaaaatcca 91141 gattccattc aatttcctca attttaatgt tgatatttaa tgttaaagat attttcaaaa 91201 tcctgtgcct gccaaacaag acatatctct ggccatattt attcccatcc accagcagcc 91261 tgtaacttcc aaaaccatgg agagaccatg catatagtgg ctgagctgct gctcaccctg 91321 tgggcgtggg gcaggaggcg aacctgcagc tcctcccact ccgtccagat ctcccccttc 91381 agcttctctg aatcctaggc caggtgcgtt tgcagctttg tctatggtga aaggtggcag 91441 cttatcctgt agacacctgt gtcgtcgagg ttgggggtgg tgagagacag gcgtgggttt 91501 cgttttattc tctggggttc caatttgtct ctgcagattc taatttgttc tttttcttta 91561 cttcctctcc atctttactt cccaattgct ggccctacgg actttgggtc taacatctgg 91621 tgcagaagaa atagccttac tagacagctt acctggcttc caggtataag cttaatctct 91681 gcaataaacc ctttattcta tatcactcag aatggttctg cttctctgat gaaacctgac 91741 tgatacagaa tttggttgct acctccttat atgagaggca aggaacctgg attgcagata 91801 gggacaataa ctctaggtcc tacagttact tagtgacagt cctgggaatt gaatccaggg 91861 ttcccctcat ctctgctttt tactttttcc cctgtgttta aagaagttga ttctcacctt 91921 tgtataatct ttttttattt ttttgcgaca gagtttcact cttgttgccc aggctggagt 91981 acagtggcac aaccttggct cactgcaacc tctgtctcct gggttcaagt gattctcctg 92041 cctcagcctc caagtagcta ggactgcagg tgcgtgccac cacacccagc taatttttgt 92101 atgtttagta gagatggggt ttcaccatgt tggccaggct ggtcttgacc ccctgacctc 92161 aggtgataca tccacctcgg cctctcaaag agctgggatt acaggtgtga gccaccatgc 92221 ctggccacct ttgtataatc tttaaagcat cttttccata aaatgtcccc caccattcaa 92281 ataacagctg agtcctgcta agtctccttt gatcatagct ctgtaagcca ctccttcctc 92341 atttggtcct ctgcaggttt tgtaagccac atgtacatcg ttcacgtttc atccagagtt 92401 ctgggaaaga gtaggtggtt cccagaaaca atgtgagttc atgagtcttt ctaagaagaa 92461 gggaagaaag tttttgtcaa ggccacaatg tatagacctt gtgctaagca ctgatttata 92521 aataatccag ataggagtaa gaaaatgaaa tgaagtctcc ttgaggtgaa ttttcctcca 92581 gttgatagta gtagaaagct attatgggct gaaaatatat ctcccccacc attcatgttg 92641 aggtcctaat tcccagcacc tcagaatgtt actgtatttg gagatagcct ttgatacggt 92701 ttggctctgt gtccccatcc aaatctcatc tcaaattgta attctcacgt gctgagggag 92761 ggaggtgatt gaatcatgga ggtgatttcc cccatgctgt tctagtgata gtgagtttgt 92821 tcctatgaga tttgatggtt ttataagtgt ttggaagttc ctccttcatt ctctctccta 92881 ccactgtgtg tggaagatgc ttgcttcccc tttgcctttg ccaggattgg gtttcctgaa 92941 gcctccccag ccatgttgga actgtgagtc aattaaacct ctttccttga taaattaccc 93001 agtctcgggt atttcattat agccgtgtga aaacagacta atacagcttt tacagaggtg 93061 atgatgttaa atgaagttat tagggtgtgc tctaatcaat atgaccatct tataaggacc 93121 aagtgtcctt ataagaagag aaaatttgga cacagacatg tgaagaggaa gatgacatga 93181 agacacaagg agaaaatggc catttgcaag tctttttcca ggagagaggc ctggaacagg 93241 tcctttcctc acagccctca aaaggaacca accacgctca caccttgatt tcagatttct 93301 ggctttcaga actgtaagaa aataaatttg tttaagccac tcagtctgtg gtactttgtt 93361 atggaagtcc tagtaaacta ttacaaaacc ccagattgaa attcaattct gtgtgactgt 93421 aaaatatata ttatttttat tataccatga taaggggtag gacaaaaaaa aaaaaacaaa 93481 actaggtctc ggctgttgtt attgagaagg aaaatgcaga gaaaagagaa gttttaagaa 93541 aatgctgcat ttatatagat tgacttaaag aaaggagaaa ctttctttct cctagtgcta 93601 gtactactat taatttactt taattaagtg ctgactacac accataaact gtacttagta 93661 ttttacaaat attcagtaat tacttgcaac aaccctgtga atgattctct tttaaagaga 93721 aaagtactgt ctttatttta ctgatgaaga aactaaagct tagggagaag taatttgccc 93781 aacttaatgg gaagatccaa tctggggctt aatatctgtc agtctctgac ttcagacccc 93841 atgttctcaa tcattattct actctggctt tatttcaaca caatgcttgg agtttacatt 93901 tgcttgtctt tccaatccaa ccatcagctt tgtaagagtt gaacggtttt gatggtattt 93961 gatgatctgc taatggatgc tataacacac ctcccttgtg gtaaattcta ccatctttga 94021 aacaaaatgg cacctaacat aggtttttcc aatgattgtc gttcctgcag tttttctaga 94081 acttgagtta tagaggggac ctgttggtag aagaattaca tggattgaaa ttttcaagtc 94141 acagaaccaa taattaggaa gaaggaaaga cataagaaaa aagggaaatt tgttcagcta 94201 acaaagccag tttaatcaag tgtagaattt aaattcctag cccctgctta gactatccag 94261 tattcaaatt tgaaagaatt tgtttttgga agcctttata aaactcttca ctgttacaat 94321 gtatgccatc tgggatttag catttataaa atgtgcagtt aagagatgcc aagagattcc 94381 tcatgcaaac cagatggtac aataacagct ctgtctttag ttttctgcca gaaaaatagg 94441 cctttgaatg gcttgaaact tatatgaata aattaagctc aaccaatact tcagggacat 94501 tacaatttca ataggcaact tatccactta aatattttta taaatgtttt catttagaaa 94561 agaagaaaca aaaaggcaga caatgattct ttttgattgt taaagaatac aaacatgata 94621 caatgctttt aaggaaaaca tcaatgactt tcaatgtggg aaagaaagtt tctccttatt 94681 ttgacagtgt tatatgtgtt tgtttattta ttttttgaga cggagttttg ctcttgtcgc 94741 ccaggctgga gtgcaatggc acaatctcgg cttgctgcaa cctctgcctc ccaggttcaa 94801 gtgattcttc tgcctcagcc tcccaagtag ctgggactac aggcatatgc caccacgccc 94861 agctaatttt tgtattttta gtagacacgg ggtttcacca tgttggccag gctagtcgat 94921 gaatgtgttt aagttagaag tgaagttgtt gaaatttctt ccgaatagga gaaatccata 94981 tcaaataggt tgaacagttg tatcattgtg ctaaagagaa aagaatacac atgtaagtga 95041 cattttctgg ataatatatt tgagcacatc tttaccttaa cagatagtct ctcataaacc 95101 caagggcagt gcatacagca gtgtgttact gatggcttct tttgtccttg cctggttcct 95161 gcaagttttc tatggaacaa caggactttt gtattactct ctaatttccc ttactttgtt 95221 actgttttta ttttcattaa aattcataat aagatcctca ttagatgaag taatcttgtt 95281 aaaagcagtg ttctactctg atctttttcc ttcttatgta aaaaaaaaaa atgtgcttcc 95341 ctaccataca agtgaatttc ttctctaatt agacatgaat agctttagag atcaaacaca 95401 ggctgctttt gaaaaccaga tctcttagga cacaggggca atctcattac tgaaaatatg 95461 tgtttctggg ttgcacaagc cattgaattt ctttcccaat tagcagcaaa tggtttttca 95521 gaacaatgag gaagaagttt ggtttgtact ttgcaagtgt tgtttttttc ttggagttag 95581 actggaagta tcaactacca tcatctctgt cccttctttt cctctccctg agcaaaggaa 95641 cacacatgac tctgttttat agatagtcag agttccaggt aaattcttca tccaaatgct 95701 acttcctctg ggccaggatt ggactccatg gtagaaggaa cacctattta agactaccta 95761 gttgtcagct actgtcagct gttatgatgt taggggagac tgcaaaatca taaacacagc 95821 aatgagaagc tgacaatgtc cttgctctgg ggggaggagg ggaaccattc atgagagaat 95881 ctctccaagt gctgccccta gaagcagatt aaccatgttt tctctgggtc tttggcaaaa 95941 aagaagtggt gtctgtggag aagggcttct tatcattaaa taaccaactg cttggggttc 96001 cagataatag gttgaggttc tagagtttta ggacagtgca ggaggaaata agacctgcct 96061 gtctaaacct aagtgtcagt cagagtcaag ggaacacaga ggtcatggac aggatgtttc 96121 tctaagttga gacatactag tagtgcagga gtatggggag atatgaggag catggcgagg 96181 ctccaacagg aaggttgatg agagctttga cttggggtca ctgccactca ggtccgagcc 96241 tcaaagtctc tccagtgtgt gtctggcaaa tattcttctc tggccttcca agagtccagc 96301 tgtatgctga tggaacacgc tcaaatacta atttccacaa gatcagtaca gtcaagaaaa 96361 tccacaactg cccccacaat taccatgact cctcacaagg aattaatgtc tacattatat 96421 gcaaattcgt gttagctctt tttattaatt tctgccctta ccatctccca ccaattttgt 96481 ttcctttggg ctttgaggaa ttcaaaaatg attactgggc taaaacttcc tctgtctgtt 96541 ccctaagcca caatcacact acttctagtt tgtaaaaaca tgctttatgg gttgcaagaa 96601 attttatttc ttcatcttta ctaaagttag ttgatgctga tgataatgat gatatgatca 96661 ctagcaacat cccccagttt tatccacaag gatatggaga ctcaaataga tggagtaact 96721 tttcctggat tatatgaacg ctaggtcttc aaccctcctt cccaacctca ccttgaaggg 96781 ataggataag gaggatcttc aagttcagaa aaatgtttta aatccttatc tctgccccca 96841 cttccccacc ccccgctgcc cctgccaccc cattcttctc ccaaggagga aattttccca 96901 agggcgtgcc agaggatgcc aggatattta ccatttaaga aagtttggtt atttcgccat 96961 ttcacaggag actttaagtt ctagtcctaa ggactgctgg aactgagaaa ttctttctct 97021 ttcttgtctt ctatctccaa tccttcttcc ccatctctat aattattctg tcctctttac 97081 cctttgcctc catcctgtga aacttaatca cgcaaattgc atcattcttg ttatacccaa 97141 ctaaatcagg gttgagaggc tgggtggtgg agttggggga gaaagcactc cgagtacaaa 97201 aaataaataa ataaataaaa attgctccaa gaacgtaatt ctcagcaaga ctgactgctg 97261 gaactgcctg ctgtaacccg ggagcagttt tatttacagc tgctgagata acttgctgca 97321 actctaggac aaattttgcc cactaatcag agcttgccag ctccccaaac ccttactagt 97381 gccaatgaac tttcttgaag agcaatatgt aacatttctc ttgttaataa aacctctaaa 97441 cttctctttt cttcttcaga catactgaag accacctggt ctatgtgtgt atcccaaatt 97501 gcaagtcttt ttccaaataa aacattaaat ttagagattc agctctaaat ttttattttg 97561 actttgacaa tcctcatcct aatttggctt atgaccatcc tagtggccac tattcttctc 97621 tttctcttcc tcttcctcat ccttatcttc tgaaagcaga ccctttggct ggagtcttag 97681 gattctacta caaataatac atctttgaga agttctctat cggttatatg tatttttatt 97741 ttttattttt tttgcactct gtcacccagg ctggagtgca gtggtgtgat cttggctcac 97801 tgcaacctca gcctcccagg ctcaatcaat tctcctgcct cagcctcatg agtagctggg 97861 cttacaggtg tgcaccacca tgcccagata atttttgtat tttagtagag atggggtttt 97921 gccatgttgg ccaagtgggt cttcaactcc cgacctcaag tgattcaccc acctttgcct 97981 cccaaagtgc taggattata ggcgtgagcc acctcgcctg acctgttttt aaatttcaaa 98041 gtgcattaat gaatgcagtt gtatctaaat taaagtggct catgcaaaac ttgatgaaga 98101 accttaccat tttctcacag tattcataat catgtttgga tatgtcactc tctctgggct 98161 aaaaacaatg tttaaaaaaa gctaaataat gctgccctcc aggctgcttc tactgctctt 98221 cacctagaag ccagaaaaag gaacaaacag tgacctttgg agctcattat tattgtgaag 98281 aaactgtaga tgaaagcttt gtactttttc tgagtgggta aataatttat cttggaataa 98341 aaacttagac aaaaagagat aagaggaata tacagttaca ggcttcaaaa atagatattt 98401 ctcacatcag tttagtggac aattatctaa atatattaaa gattgtgtaa caatatgggg 98461 atatatttat gatggattaa ggagtggaaa agatgtaatg taaaattata tgtatttgat 98521 gaaaacaatg taaaaaatac accacaggag aacatatgca tagttgatga gttatgtaag 98581 gatggcggtt tctaatttag ttgcagtgtc ttcagagaac acggtttgca tgaaccctgt 98641 tatttgttat ttgtcgagaa ctgcactgct tctattcagt ggtcaggttt tataaatatt 98701 ctgtgtgctt agaaaaaaca tgcatttact attgggcaca gggtcctttc tccatgtgac 98761 tgttagatca agcttgttaa ttgtgttcaa atctgttttt cttctttttt tttttttttg 98821 agatggagtc tcactctgtc accaggctgg agtgcaatgg tgcgatctca gctcactgca 98881 acctccgcct cccgggttta ggcgactctc ctgcctcagc ctcccgagta gctgggacta 98941 caggcacatg ccaccatgcc cagctaattt ttgtattttt agtagagacg tggtttcacc 99001 atattggcca ggatggtctc aatctcttga cctcgtgatc tgcaccccct cagcctctca 99061 aagtgctagg attacaggtg tgagccactg tgcccggccc aaatatttta tagctatacc 99121 aatctttagt ctgatataga acattatgat gatgtctgaa atcttccctt atgattgtgg 99181 ttttgttttt ttttttctgt ttaatttttg ctttcccttt tttgcagctg tgttatttaa 99241 tttatacaag tttaaaacga tcatagtttc ttgattaatt gttcctttca ttatcatgta 99301 ttatctctct ttatgcctag taaatttttt tggcttaagc tatatttttc cttgctacca 99361 atacgggtgc tcttcaactt acaatgcaaa accaccataa gttgatataa taactcagtt 99421 gtgagctgag aagcatacta aatacctgtc atttttgtgc catcataaag ttgaaaaatt 99481 aggtcaaact ctcataagtt gcagaccatc tatgtgggta caccaatgtt ttgttagtat 99541 ttgcttgcta taacttttcc attttttgct ttcaatcttt ctgcaatttt atattttaga 99601 tatgtctatt ataaacaaga catggctgag tcttccacat cataaattta gtccattttc 99661 ttttgttttt attacagatc tggaaatttt ttggtcttat attgtactcc ttatagtcat 99721 ttttctatac tttttctcct tcccttgagt tattgataat atgtttaagt tattgatttg 99781 agatcttttt tctttctcat atatgcataa aaaactacaa attttgccct aagcttttct 99841 ttaactgcat cctataaatc ttgatatgtg gtattttcat tttcattctg ttccaaatat 99901 gtctgatata ggacttgtat ccagagtata taaataattc ttataattta ttaataagac 99961 aaccacttaa aaattgggca aaaagatttg aatagaagat gtaagaatgg ctaatgggca 100021 catgaaaaga tgctcaatat cttagtcatt aggaaattga aaattaaagc cataataaga 100081 ccacttcata tccactagga tgaccataaa caaaaagata ggcaataata aggctggcaa 100141 ggttgtggag aaattgaaac cctcatacgt tattggtgga aatgtgaagt agtacagcca 100201 ctttggaaaa cagtctggca acttctgaaa tttaaatgta aattgccatg caacccagca 100261 gtttcactcc ttgatatata cgcaagacac atgaaaatat gtgtccacac agagacttgt 100321 acatcagtgc tcatagcagc attattcatc attactgaaa aatggaaaca acccaaatat 100381 tcatcatatt catgaaacat tgaatggatg aagaacatgt ggtatgttag cacagtggaa 100441 tattattcag cagtgaaaag aaagaaacca ttacatgcta cacatgacaa acctcaaaaa 100501 agtatgtgaa ttgaaacaag ctatacacta aagactatat attgtaggat tccatttgtc 100561 tgaaatgccc gtaaaaaaca aatctataga gacagaaagc agactgtggt ttcctgaggt 100621 tgggagtgga aatgggattg acagcaaatg gcatgaggga tctttaaggg ctgatggaaa 100681 ctggattatg ataatgggca tgcaactcta taaatgtcct aaaaatcatt gaattgtatg 100741 catgaaatgg gtaaatttaa tgatgttata cctcaataaa gctgttcttt ttggaaaaaa 100801 cagctagtga actagagtag ctctgcattt tattaagaaa ttatttctta catcctgtgc 100861 ctagatggcc aatattacac tcaagaaact aatggacata gaaaaattga atgtacctcc 100921 aaacactcaa gatgaataca gcagtcttca atagtcaacg tcatggccat tccagagaga 100981 aaaggtccaa gttcagtgcc tcgaaccttg ttgccttccg tgaaaggaag cagttgaaat 101041 gagctttgcc cctggtgaca ctcttccttg tcactaatta accagctcta taggaaggtt 101101 gctccgccaa caccttgctc tgcctggaga gacatctggc caagttctgg tgagcaggaa 101161 aaatgtctac tcgttaccac caagctgcta gtgatagtta cctggaactt ctaaaagagg 101221 ctaccaagcg agatctaaat ctttcggatg aagacggcat gactcctact ctcttggcag 101281 cctaccatgg gaacttggaa gccctagaga taatctgcag tagagggtaa gttcaacccg 101341 atggtttctg ttggaaacag tgttcatggt aggtttttgc agacagcagc aagaggcagg 101401 gacgtcaata caaatacttt atcctctttc tagattgtct aaatgattgg atttttagag 101461 aaagagagaa aactatgcct agagatgctc tcttttctca ctgaggactg gcactatagg 101521 agtggctcat cccttagcca caatcagagg agagaccctg aactacggca ttatctgcat 101581 ggggaagagg gaagatattt taattctatg cctttaaaaa tgcttttcat gaaatttaaa 101641 tagttttctc ttttcaaaga ggccattgtt ttacttggtt ttaagttgcc aaccttggtt 101701 ggaacaaatg acatggtttt agtttttttt ttatctatgc catcaaaatc tcacagtctt 101761 ttattaactt tcaatcccag tgcaagctca tgattttttt attttttatt tttgacacag 101821 ggtcttgctc ttttgaatgg agttgctggc tctacaagga ctctttagcc tatcttaact 101881 ctatcacttc tttggctagt ggatagatac acacacacac acacacacac acacacacac 101941 acacacccca tctctttgcc cagattttaa tggaggagta agctagttcc tctgcttgtt 102001 ctgcctgtta tcagagggct aggttgtgtt ggcaaccgta acccaagagg ggtaaagaaa 102061 gagaaaagga caaggtaaat aaaggagcca agtgaatcca attgaatacc catttattca 102121 gtatttgtta tgtgccagcc acttatctct taagataaga gatattttgc tagccactgg 102181 gcacactgtg atcaacaaga cagccacagt ccctgtcata gtgagcagat ttgagagggg 102241 aagacaagac atctatgata tttgtgtgtg tgtgtgtgtg catgtttgct ctgagacagg 102301 gtctcactct gtcactcagg ctggagtgca gtgtcaagat catagctgac tgcagccttg 102361 agccccaggc tcaagtggtc ctcccatctg agcctcccga gtagctggga caagaggtgt 102421 gcaccaccat gcccagcaat ttttgtactt tttgtagtga tggggttttg ccatgttgcc 102481 caggatggtc ttcaacttat gagctcaagc gatctgacag cctcggtctc ccaaagtgct 102541 gggattacag ctgtgagcca ctgtgcctgg ctcatctatg atatttgtta caaaaaggga 102601 aatagagggc actaaatagg catctacctc aatctcaggg gtgagggaag gcttccctaa 102661 ggggaaagca tttaagctgg gattaggaaa atgagaacaa gttggctaga cgaaagagtg 102721 ttttccctaa gatggaaaac tgctctagtt ctgagaacta gagaagctga gtatggctgg 102781 atgtggttca agtaggagtg gggtcaggac aagtataaga gataagacca gatcacaaag 102841 agccttgtca actctggtaa ggactttatc ctaagggccc aggaagccat gtcttgggta 102901 gggaagctgc cagagtagat ttatgtttac aaaactcact caggtagatg gataacagtc 102961 ggagagaatg tttggggact gttttagtgg tcgagatgag ttggactagg gtagtgacat 103021 tgggaatggg aagatatggt tggatttgag tggcagtttg gatgtagaat caataatgat 103081 tgactggatg tgggcagata aaggaaaagg agggttaagg tcccaggttt tggatgttga 103141 caccctttat taaaataggg gacatttagg aagtgtcaga agaacccagg attgggagag 103201 ttcctgagtt gaagattgaa tgagaaagtg agtgaagcag atcttcccca tctctcttgt 103261 ctgaataaga ctttcttttc tcagacacat cgatccccat cagccagcct tctccagtca 103321 tctcatctct tctcctactc agcctagggt tccccagctt tctccatcta ctgaaagtct 103381 acctcattcc acaaactgca agatgtcagt tactttaaaa tcaaagtatt agcctattct 103441 tgcattgcta taaggaaatg cctgggccta ggtaatttat aaagaaaaga ggtttcactg 103501 gctcatagtt ctacgggctg tacaggaagt atagtagctt ctgcttttgg ggagccttca 103561 ggaagcttcc aatcatggca gaaggaaaag gcatctcaca tggcagaagc aagagcaaga 103621 gagcaagggg ggaggtgcta gaaaccagat ctaatgagag ctcactatca tgaggacagt 103681 accaagggga atggtgctaa accattcatg agaaattcgc ccccacgatc cagtcacctc 103741 ccaccagtcc ctacctccaa cattgcggat tacaatttga tttgaggttt gttccccacc 103801 aaatctcaca gatccaaact gtatcaacca gtgagtttgt cctcctagaa gagggtttgg 103861 gggttctggg gttcttctca gcagagagct cttgttccat ttgaggacat tttgaatagt 103921 aatgcttagg tgttctctct ctttctctcc tttgaaagcc tgaaatattc tcttacaata 103981 actcttctgc tggtctttat ttatttttta tttttactag agacagggcc tcgctttgtt 104041 gcccaggctg gtctcgaact cctgagctca aataatcctc ctgcctcagc ctcccaaagt 104101 gttgagatta caggtgtgag ccacagcact cagcctggtt ctgctagtct ttaattacaa 104161 aagtaacatg agctgaaccc agaaaattaa accatacaga cgcacacaaa acatgaagaa 104221 aaaatgtcct ataatgccat gctactaaga tgtatatagt tctctctata tatactatag 104281 tgcatacaaa tacattttaa tattcttcta taaatttcag ggactttgta ggactctagt 104341 caacaaatac atttcttagc caacccctct tttggacaca tagtttgttt tccctagttt 104401 gaatagggct atgatgacta tccttgtaca taaatctttg ctgacattgt cttttgtgtt 104461 tttttgagat ggagtctctc tgtcactcag gctggagtgc agtagcgtga tcttggctca 104521 ctgcaacctc cacctctcgg gtttgagcaa ttttcctgcc tcagcctcct aagtagctgg 104581 gtctacaggc gtgcactacc acgccgggct aatttttgta tttttagtag ggacagggtt 104641 tcaccatgtt ggccaggctg gtctcgaact cctgacctca ggtgatttgc ccacctcggc 104701 ctcccaaagt gttgggatta caggtgtgag ccaccgggcc tggccacatt ttcttgagaa 104761 taaattctcc aaagtgaaat ttctggatca atttgcattt gtgttagtcc atttgtactg 104821 ctataacaaa ataccttaga ttaggtaatt tataaaaacc agaaatttat ttcttacagc 104881 tctggcagca gagacatcca agatcaaggc accagcagga ctggtgtctg atgacagccc 104941 tgtctccgct tccaagatgg tgccttgttg ttgcatccta tggctgtgtc ctcacatgga 105001 ggaagggaag gaaggggcaa aaaaaaaaaa aaggcaaatt ccctgtgtca agtcctttta 105061 taagggtgct aaccccattc atgagggtag agtcttcatg actcaatcgt ctcctaaagg 105121 ccacactgct taacgctgtt acattgacga ttaagtttca acatgaattc tggaggagac 105181 acattcaaac catagcagca tatgtgcaat ttaaatagtt ttgtgtgtgt gtaaatggac 105241 atgctgtcct tcagaaagtg agtacccatt ttggccaaca tgataggtcc aaaatgatat 105301 attaggctgg gcgtggtggc tcactcctgt aatcccagca ctttgggaag ccaaggcggg 105361 cagatcactt gaggccagga gttcgaggcc agcctggcca acatggtgaa accccatctc 105421 tactaaaaat acaaaattag catggtgtgg tggtgcacgc ctataatccc agctactcag 105481 gaggctgagg cacaaattgc tcctgaatcc cttgaatccg ggtggtggag gttgcagtga 105541 gccgagatca ggcaactgca ctccagcctg ggcaacagag tgagactctg tctcaaaaaa 105601 caaaacaaaa caaaacaaaa tgatacacca gtgtggcttt aatgttcttt tctttggttg 105661 ccgatgaggt tgaaaattct ttttatgctt aaaaagaatt ggccattgga ttgccacttt 105721 tgtgaactgt gtatttataa ctttattccc catttatatt ggggttttca accatttctc 105781 cttgattgat aaaaacactt ttgaaagtta acaatatcaa ctctttgtca tataggttac 105841 aactgtttta caatacacaa tttatatttt tgacaacaga tttttaaata taaatgtaaa 105901 cataaatata tatatatata tacgtataag tctatcaatc tttgttgtgt gattctaact 105961 ttgaggtcat gcctcaatac tgactgatcc tttagtgcta tttttagttt catttgttat 106021 tttaaatact ttatataatt taagatctga attatgatct aactctgcat tctccaagtg 106081 gctactcatt tattccaaca caattagttg ttatagtcta tctctttctc cactaagttg 106141 acatggtaac atcatttctt tcaatgtaat ataaatatcc agtcccccca atttgcttat 106201 attaatattt attaaacact ggctatgggt gcaaatattc ttctaacctt gtttttccta 106261 gtccttcctt ctcaactcct ttcttctcac gggatctagt acatctcaga tttgtttcat 106321 tgccctcaaa agcatctggg atctgctgtg ttgagcaggc ctgatgtagg ctaaaaaaaa 106381 gaacagaagt cttgtgaggt gcctttttga ccatagctaa tcctgggaac cagtgtgagg 106441 tcaaatttcc aattagattt aaggaagctg aaaagggaag aaaattacca tctcacatca 106501 gtcagaatgg ctattataac atcaaaaaat aacagatgct ggtgaggttg aggagaaaag 106561 cgaacactta tacactgttg gtgggagtgt aaattagttc aaccattgtg gaaagcagtg 106621 tgacaattgc tccaagagct aaaagcagaa ctaccattcg agccagcaat ctcattactg 106681 ggtatatacc caaaggaata tgaatccttc cacaataaag atacatgcct ggatcatgcc 106741 tgtaatccta gcactttggg aggccgaggc gggtgggtca tttgaggtca ggagttcaag 106801 accagcctga tcaacatggt gaaacctcat ctctactaaa agaaaaacac aaaaaattaa 106861 ctgggcgtgg tggcacacac ctgtaattcc agctcctcag gaggctgaga cacaaaaatt 106921 gcttgaaccc gcaaggagaa ggttgcagtg agccaagatc acgccactgc actccagcct 106981 ggggaacagc aagactccat ctcaaaaaac aaacaaacaa acaaacacat gcatgtgagt 107041 gttcattgca gcactgtcac aatagcaaag acatggaatc agcctaaatg tccatcaatg 107101 acagattgga taaaggaaat gtggtacata tacagcatgg aatagtatgc agtcgaaaaa 107161 gaatgagatc acatattttg tgggaacatg ggtggagatg gaggccatta ttcttagcga 107221 gctaacgcag gaacagaaaa ccaaacacca catattctta ctgataagtg ggagctaaat 107281 gatgagaaca tgttggctta gtgcctgggt gacaaaatta tctacaacaa acactcgtga 107341 tatgagttta cctatataac aaacctgcac atgtactcct gaacctaaaa ttaaaaaaaa 107401 ttgaaaattg aatttagaag ctaatcaact attgaaatac tgaagtccat ttcagtctga 107461 tgatttaaga gtttatctgt cattacctga tagacaacga atcctatagc aaccatttat 107521 attaaaactc tggctttgtt agaaaactat tttcctttgt ttttaaaatt tgaagatgga 107581 aaacaatcat tacatgtatt ttcctgcact aggaagcata atggcccaat tacaaaaaaa 107641 aattcatttt acctttaaag tgtatgaaat ttcttggttt tattttttaa aatggaaact 107701 agactttttc ccccagattc tttttttttt gtgatggagt ctcgcttctg tcgcccatcc 107761 tggagtgcaa tgatgggatc ttggctcact gcaacctccg ccacctaggt tcgagtgatt 107821 ctcctgcctc agcctctgga atagctgaga ctacaggtgc ctgccaccac accccgctaa 107881 tttttgtatt tttagtagag acaaggtttc accatgttgg ccaggctggt ctgattctta 107941 aagaaataca tacttattgt aatatttcac atcatattga atagaattga taaagtaaaa 108001 attataccaa ttcctacagt tatcaacatt tgggtacata tccttccaga ctttttcctt 108061 tgagtacatg caaacacatg tacattctct ccctttaaaa gacaatcata taaactatac 108121 tgaaaccttc taattattta acaatatatc atggaccatt tgcacttaaa agttgagatc 108181 cacgccacta ttttattttt tttttaataa tttcaacttt tttttttttg tgagagacag 108241 agtcttgttc tgtcacctag gctggagtgc agtggcacaa tcttggctca ctgcatcctc 108301 cacctcccag gtttgagtga ttctcctgcc tcagcttccc aagcagctgg aactacaggt 108361 atgcgccacc atgcccagct aattttggta ttttttagta gagacagggt ttcactatat 108421 gttggccagg ccggtctcga actcctgact tcaggcgatc cacctgcctt ggcctcccaa 108481 attgttggga ttacaggtgt gagtcactgc accacgccag tttcaacttt tagatttaga 108541 gggtacgtgt gcaggtttgc tgcatgggta cactgcatgg tgctggtgtt tggggtatga 108601 ctgatcccat cacccaggta gtgagcatgg taccaaatag tttttcaacc gttgccctcc 108661 tctcatagtc ccccgtgtct gttgttgcta tctttacgtc tgtgagtacc caatgtttag 108721 ctctcactca taagtaagaa tatgcggtag tgggttttcc attcctgtgt taatttgcta 108781 gggatgggat aatgcatgcc actatcttta atgacttcat ggtattctgt tttatggtgt 108841 ttgtttgttt gtttttgaga tggactctcg ctctgtcatc caggctggag tgcagtggtg 108901 cgatctcagt tcactgcacc ctcagcctcc caggttcaag caattctcct gcctgtctcc 108961 agagtagctg gggttacagg cacatgctac cacccccact taatttttgt agttttagta 109021 gagatgaggt ttcaccatgt tggccgggct ggtctcaaac tcctggcctc aagtgatcaa 109081 cccaccttgg cctcccaaag tgctaggatt ataggcgtga gctactgtgc ctggccagtt 109141 tcattatttt tattaattaa caaaaattat atatatttat ggtgtacaac gtgattttta 109201 aaaatatgta tacattgtgg aattaagtca agctaattga ctatgtgtta tcttctatac 109261 tttttgtgat gagaacatta aaatctactt ggtaggcaat tttcaaatat acagtgtatt 109321 gttattaact atagccattg agctgagcat gatggctcat gcctgtaatc ccagctcttt 109381 gggaggttga ggcaggagga ttgcttgagg ccaggagttc gagactagcc tgggcaacat 109441 agcaagaccc catctctata aaaaataaat aactaatttt taaaaaaact atagtcatca 109501 tgtgttacaa tagatgtctt gtacttattc ctcctgtctc actgaaattt tgtatccttt 109561 gaccaactcc ccagtctgcc tctgcctagt taccaccatt ctactttctg cttctaggag 109621 acttttcttt tagacggagt ctcactctgt caccaggctg gagtgaagtg gcacgatctt 109681 ggctcactgc aacctccgcc tcccggattc aagcaattct cctgcctcag ccctcccaag 109741 tagctgggac tacaggcacg tgccaccaca cccagctaat ttttgtattt tcagtagaga 109801 tggggtttca ccatggtggc caggatggtt tcaatctctt gacctcgtga tctgcccacc 109861 tcagcctccc gaagtgctgg gattacaggc atgagccact gcgcccagcc tctaggagac 109921 ttttttagat tccacattta agtgagatca tgtggtattc gtctttctgt gcctgtctta 109981 tttcacttaa catactgtcc tccaggctca tccatgttgt cgaaaatgac aggatccgtt 110041 tgtttcttaa gactggatag tattccattg tgtggagcca ccatgttttc tttatccatt 110101 cttccactga tggacacaga ttaatgccat aatatatgta atcagttgac tattcatgtt 110161 gcttaatttg aacccactct ttttcctgtt ataagtaagt ttgggaggaa cattcctgct 110221 cattcatctt tgcacaaaga tcttatattt ccttaggaca aattcctaaa tgtagaattg 110281 tagattcaaa agagtgcctc attttaaagg ctttttgtgt gtattgtgcc aggaaatttg 110341 aaccactttc tgcttcttcc agcagtattt ctttccccat actcctgcca aaaccaagta 110401 ttactattct ttttaatcgt tgcctatctg attggcaaaa ggcagagtat tgtttcaatt 110461 tgcatttctt tgactattag tgagtccaaa tatttctttt tatgcagaaa aacgtttttg 110521 aaagaatagt tgagaacttc acaaaaaatc aacttcagga ggaaaaattg gaatgtaaca 110581 gatgctcagg gtaatttaag caaattgcct acttagtgac agtagtgcta agccagcttt 110641 aaaaactatt gcccaaatgc actgatctac caaaaatcaa gggaaaaatg ggttatttta 110701 aaatctcttc agtgagtttc tcttttgggg aaaagggaaa tagaaaaaaa caacactaga 110761 ctgagcttag cccacgaaaa ccgtagagca tcaatgttga actcagatta tcaaacgaca 110821 ggactagttt tccacccagg cagttgtatg caatggttct atcaagaaat gtgaacaatg 110881 acaagtatgg aatttaaaag gactccatgc aacttgtctg tagaaaatat ttccatttaa 110941 attgtacatc aatttggaaa tgtgtctgta agtagaagtt ttgcaagttt tttgttttgt 111001 tttgttttgt ttttgagacg gagtcttgct ctgttgcctg ggctggagtg cagtggtgtg 111061 atctcagctc actgcaagct ccgcctcctg ggttcacgcc attctcctgc ctcagcctcc 111121 cgagtagctg ggactacagg tacacaccac catgcctggc taattttttt tgtattttta 111181 gtagagacgg ggtttcacca tgttagccag gatggtcttg atcttctgac ctcgtgatcc 111241 acccaccttg gcctcccaaa gtgctgggat tacagacata agccactgtg cccagccaaa 111301 gttttgcaag ttttaatagc ttaaagccac taggaaaacc catgaattgt cttactgaca 111361 gtttcatctg taaatctgga attttgcatt tgaaattttc tcaaaagaag gcccaccttt 111421 ggggtcatct caggtgtact gttcttcctt ctgtcttgaa agcatcattt aagattcagt 111481 gaaatgggtg cagctgtaaa aaaaaaaaaa aaaaaaaatg agatcatgtc ctttgcaggg 111541 acatggatga agctggaagc catcattctc agcaaactaa cacaggaaca gaaaaccaaa 111601 cactgcatgt tctcactcat aaatgggagt tgaacaatga gaagacatgg acacagggag 111661 gggaacatca cacaccgggg cctgttgggg ggtcgggggc aagaggaggg agagcattag 111721 gacaaatacc tactgcatga agggcttaaa acctagatga caggttgaca ggtgcagcaa 111781 accaccatgg cacatgtata cctaggtaac agacctgtat gttctgcaca tgtatcccag 111841 aacttaaagc aaaataaaaa ttcagtgaaa tgagaaaatt ccaccagtct atagtcttaa 111901 gcatatacat ttaacccgga caccttagtg cccttctgca tcccccctgc agctcactcc 111961 tcccaataat tttgttttca acttatccaa tccaaggata acttcaggca aatttaatca 112021 gtatgtttct ctttctgttg atagtaacgt cttgggtgtt ttacaatccc tctttgatgt 112081 ctaggatact taaggagtct tttgcatcta gtgtttattt aaaataaaaa gtgaaaccac 112141 atgtttaggg gggagacaca gcggtccctt aagatgcggg tctgtgccaa ggttgaaggg 112201 tctcctttca tcccaaacaa ttaagtcagg ctgactgacc agagccttct ggaaagtggc 112261 ttcccagcaa aactgtgtta cttaatattt gctttcctta gatttgattt cattttcttt 112321 ccttttccac tctttcccct tacccccaac tccacacact ctggtcacgc tgatctatac 112381 tcctggttcc ccaaataggc cttgtaatat cagccatctg tgcctctgct cacactgttc 112441 tctctgctca gaaggaggtt accagttgcc agcttatggg ctggagctgg tccacaaatg 112501 tgttttattt ggactttact atatttttct tttaaatttg agccaacact taaaaatcac 112561 agaatttcgc ttgaaattct ggatttgtgg cttcgtttga aaaatgggag ctctggcaat 112621 attgtttccg ctttgattca aggtagctga gctgtggtcc tttctaggtg gagtatgcat 112681 ttaactgtga ctttgttgaa agagcaaact tggccccatt cagtcactta catgacttgc 112741 tggacttctt tgagttgggt tggagacttc tgccgggaca tgtggctgac acaaaggaga 112801 aagagacaca ggctttgccc tctggggcat attatccaat taaggagaca aggcataaat 112861 atttgtcaga ttaaaactgg aagtgattga ttgattgatt ttttttttat ttttttgaga 112921 cagagtctca ctttgtcacc caggctggag tgcggcggcg caatctcggc tcactgcaat 112981 ctccacctcc caggttctcc tgcctcagtc tcctgagtag ctgggactac aggcgcgcac 113041 caccatgcct ggctaatttt tgtactttta gtagagatgg ggtttcacca tgttgaatag 113101 gctggtctca aactcctggc ctccagtgat ccaccagcct cggcctccca aagtgctggg 113161 attataggca tgagccacct cacctggcct ataagtgatt tttaaaaacc cacaagcatt 113221 attgtgctgt aggaatttac tagttgaata caaggaaaaa taaatacatg aaattaagga 113281 ttatgagttt gggggcaagg gcaaggataa taagtaatgt ttattcaata tttactatgt 113341 gccaagcagt tttccaatgc tttatgtaga ttaatttatt taattcacat aactctatga 113401 ataggtaata gtattatctt caatttacag atgagtgtta gaaatgtttg tcttttggtg 113461 ttgcaaagaa atagtatttg aacataaatt taattttttt agtaaggcta tttttatttt 113521 tcgtagaaag ggtatatttg ttagtagttt tgttatgaga gtatattgaa caaaggagac 113581 agggttattt ataacttgat gtgtttaatg ttgtgtttgg tttttattgg ctggaatggg 113641 actttacatt ttgtatttgt cttgattggt tagtaactta gaacttttta aaagaggcaa 113701 aggcagagga gaacaaagga aggagaaagt aatttgtgga atgttgagaa aggtaaaaac 113761 acttttaaat aaggaagagg aacaggctat gacttaatgt ttgtttggac tactataagt 113821 atgttagggt aaatatttaa gctaaattgt gggagttaag agtataaagt atattgattt 113881 ttttattatg gctagtagat atttaagaat gttagtacag gtttttgaat aaattttgtt 113941 tttaagagaa gttattattt atttttaatt agatggggag gaaagttttt gaagaggaaa 114001 ctttatttta ctttttacat gaagaaacca agtcctagaa aaattaaact tgaggccatg 114061 tagctggtaa aatggaaaag cccagttgtg ttaatctagg atgtattgta ttggagatga 114121 tttgactggg agaccagcct gatccaaatc atgctgcctg ttacccttgg gcaaatcaat 114181 taagctctct gagtcttatt ttattcctgc agtaatgatt cttgcctgcc ttctctaaga 114241 gggagattgt gatggttgaa ggcaaagaat gtgaaagtgc ttctttgtaa ctaacacaca 114301 ccaaagagcc agggattccc gtggagtaac tgcaaatctg atttatcaaa ttataggacc 114361 ctggtcggat cacactaacc agtgaagttc ttttcaatct gtagtgagcc tacgaatcac 114421 ctgcggatct tgctaaagtg tagattctga ttcaataggt ctgggtgaga gtcttctgta 114481 ctttttttgg tttgcttttt tttttttttg gagagagtct cactctgtca cccaggctgg 114541 agtgcagtgg tgtgatctcg gctcactgca acctccgcct ccggggttta agcaattctc 114601 tgcctcagcc tcctgagaaa ctgggattac agacatgtgc caccacgccc ggctaatttt 114661 tgtattttta gtagagacag ggtttcacaa tgttgaccag gctgatcttg aactcctgac 114721 cttgtgatcc acccacctcg acctcccaaa gtgctgggat tacaggcatg agagtcttct 114781 gaacttctaa caaactcaca ggtgatgcat ctgcttctgg tctggagacc caacgttgag 114841 cagccaggaa ctagtagaat ggtcgtggcg cactaacagg tcatttgcaa gggtgcagat 114901 cttctcttct gcttctgtct gccctctact tctcgggaat gttaagagga cttctcagac 114961 tcactgtcct ctttccatgg ggccaggctg gccagagttt agtgagatcc tgattttgga 115021 gtctagccta gggcaaagga aggacccctt agtatctgta ttatcagagt tctcagcatt 115081 ggcgcatggc tctggccatc attaaaaata gttactcgac tttttatctg ttcctgtttg 115141 taactcaagt ccttgtagga gaggagggtt tgctgccttg ggcaggattc cagtaggttt 115201 tcattcattc cacaaatact ggagcaccta ctatgtccct gttctgtgcc aggcattagg 115261 tttatggcct taataccatc taatagtatc cccaattcca ccttaaactc ttgtagaatg 115321 acacacttat atgaacctgc agtcatggaa gtaagttgag gaataaatag agatgtaatg 115381 cccttttaat gaataatgtt tgtaacagac actgtttgct acctctccaa catccttttc 115441 tttcttcttt cctgctgatg gggactgagt ttggttcagt tatttatttc cccacatgtg 115501 gctcaattcg ggagggtagc agtagcatgc tggaaccagc ttgtcagagt tatttccaaa 115561 ttttagaaag tttgacagct aaatatagcc attctttctt actttttatt tttattttaa 115621 aaaaaaattt aatggaggca ggtcttgtta tgtttcccag gctggtcttg aatgcctggc 115681 ctcaagcaaa tctcccacct cagcctccca gtgctgggat tacaggcacg agccactgag 115741 cctagccttt tttttttttt tttttaaggc agggtcttgc tgtgttgccc aggctgtaga 115801 cagtggtgca atcataggtt actgtaagcc tgagcttctg agaacatagc cattcttaaa 115861 aataaaatta caggctgggc gtggtggctc acaactgtaa tcccagcact ttgggaggct 115921 gaggcgggtg gatcacctga ggtcaggagt ttgagaccag cctgaccaac acggtgaaac 115981 tccatctcta ctaaaaatac aaaaattagc tgggtgcagt ggtgggtgcc tgcagtccca 116041 gctactcggg aggctgaggc aggagaatcg cttgaacctg ggaggtggag gttgcagtca 116101 agccgagatt gcaccactgc actccagcct ggacgacaga gcaagacttc atctcaaaaa 116161 aaattatata aacttataat ttaacaaatt atattaaaaa gataacacga aagtctcact 116221 ttctacttat tttaatacat ttcactctta cctgtctctt aagatgattc atgtccatta 116281 tatctacatg ggggaaatag ttgataatgg tacgctactg tgtatctctt ccaaactctg 116341 tccagtgatg ttacattggt agcttgaaat ttgccatggt ggatgtattt acaccacaga 116401 aattggcaca tgctacaaat taggattttg atgttattat tttcttgata agccattgaa 116461 caatcctatc cctagtttca gggcaaaatc taattgatct aagttaaaca caatagtatg 116521 tttcccatct cttaggcgtg taacctagtg ctggccaatg agacaagaaa gttgtttggc 116581 tggagaattt cttagtaagt ttttctcact tctaaaaagg agacacagcc aggcacggtg 116641 gcttacacct gtaatcccaa caatttggga cgccaaggca agcagatggc ttgaggccag 116701 gagtttgcga ccagcctggg caacatggtg aaaccctgtc tctacaaaaa attagctagg 116761 tgtcatggtg catgcctgta gtctcagcta ctcaggaggc tgagcaggga gaatcacctg 116821 agcctgggag gtcaaggctg ccaatgagct gtgatcacac cattccactc cagcctggga 116881 gatagagtga gaacctgtct caaaacaaaa caaaggaaac aaacaaaaag gagacacacg 116941 gtagagatgg gctctgtttc tggccatgtg gttttaagat gggatggttc taagacagtt 117001 ggacttgcag ccagtatctt ataacaatgt caagtcagga tgaacatgtt tgtattaacg 117061 tgccttttgt tctttatttt tgttttgatt tttttgggcc tgcgttcaga gaaaaaaagt 117121 ccaacatgta ttttttttct ctctctctct tctagagggg accctgatag gtgtgacatc 117181 tggggaaaca ctcctctaca ttttgcagcc tccaatggcc atgcccactg cgtctcattc 117241 ctggtcaact ttggtgccaa catctttgcc ctggataatg acttacagac tccactggat 117301 gctgctgcca gcagggagca gaatgaatgt gttgctctcc tggacaaggc tgccactgca 117361 cagaacatca tgaaccccaa gaaggtcacc aggctgaagg agcaggctca gaagaatgcc 117421 aggaggcaga tcaaagagtg tgagaggctc caggagaagc accaaaataa gatggcccac 117481 acctacagca aggaggaatc cgggactctc tcttcttcca agggtacctt ctccagatca 117541 tccccttcaa atgcttctgc tcctggcaca ttcgggtcac tatctaaggg cattaaagac 117601 actttcaaga tcaagttcaa gaagaacaaa gatacagcag aacaggtggg gaaggaaggc 117661 agaagtgggc agaggaacgt gatggaagtg ttcagagagg aagaggaaga ctcgttctca 117721 ggggacttca aagagaagct ccagttgtca gcagaggagg acggcagtgt gcaccatgaa 117781 tccattctca atcgtccagg tctaggaagt attgttttta gaaggaacag gatatcgagt 117841 cctgaagaca tctcagatag caagagagag tttggtttta aactgcccag tgaattgctt 117901 caaagacaag gagcatcaga ggctgatgag ggtgcagctg atgaagaggg agaggaaaac 117961 ggcctcaaag atgatctgcc gtgggatgac gatgaagtgg agtgggagga agatgtggtc 118021 gatgccacgc ccctggaagt gttcttgctg tctcagcacc tggaagaatt cctgcctatc 118081 ttcaagagag agcagattga tctagaagct ctgctgctct gctctgatga ggaccttcag 118141 agcatacaaa tgcagctggg tcccaggaag aaagttctga atgctatcaa caggaggaag 118201 caggtgcttc aacagcctgg gcagctggtc gacaccagcc tgtgatggag agttttggcc 118261 tggagcattg gggtgatgct gtggcccgct ggcagcactc caggcggcac cccctcttta 118321 cccaatgcca gaccactggg aatggattct agggcatcgg aaatgcctac ctgagagaga 118381 gacccaaact ttactctggg aggtaggcta tgcccatcca aataaatctc catgagaaac 118441 ttgaggagac ttcataacaa gaatctggca tttctcttca gttatcttat atgtacatat 118501 aattgttttt gtggttgttt tgttttgttt tgttttgttt tttggagatg aaggtctcag 118561 tctcttaccc aggctagggt gcagtggtat gatcatagtt cactgtattc tcaacctcct 118621 gggctcaaat gatctcctcc cacctcagcc tcccaagtag ctgagactac aggttcacac 118681 cccccacacc tggcttattt tgtatgtttt agtagaggtg gggtcttgcc acattgccca 118741 ggctggtctc aaactcctgg cctcaagcaa tcctcccacc tcagcctcct aaagcactgg 118801 gattacaggt gtgaaccacc gtacccagcc tatctttttg atacttttga ataaagaaag 118861 ggtcatatgc atgacaggaa aatgaaagaa acttccttta cttttctatc tctggattta 118921 aaattataat ctcatcacat tatcctgctg cttgctttcc gatctgtgta acctgggaat 118981 tccaattctt tttctctcct gagatctatg actttgccta gtggtagaga ctagagttct 119041 ttcctggcct gcggcttgat gcccaactta aatgcatcta accctttaac aaatgtgtac 119101 atgtttacaa gtaatggaaa tgcgtctata atactcctgc ctgagaatag agacagagtg 119161 gtggtgggga gagtgaagaa agagatagaa tacaggtggt acctgttgtg gactgaattg 119221 cgtcaaattc atatgttgga gctctaaccc ctaatgtgac tgtaattgga aataagacct 119281 ttaaagaagt gattaaggtt aaatgaagtc ataagaatgc aaccctaatc ctgtaggact 119341 ggtgtccttt ttttcccttt tttttttttt ttttgagatg gagccttgct ctgtcactca 119401 tgctggagtg cagtggcgtg atctcagctc actgcaaccc ccgcctccca ggttcgagca 119461 cttttcatgc ctcagcctcc tgagtagctg ggattacagg cgtgcaccac aacgcctggc 119521 taagtttttg tatttttagt agaggcgggg tttcaccatg ttggccaggc tggtctcaaa 119581 ctcctgacct caggtgatcc acctgcctcg gcctcccaga gtgctgggat tacaggcatg 119641 agccactgca cctggcctag gactggtgtc ctaagaagag gaagagacac ttaggtggaa 119701 ggcacacaga gaggccacgt gaggacacag tgagaaggtg gccgtctgca agccgaggag 119761 ggggcctcag gagaaaccaa ccctgcaatc accttgatct tgggctttca gcccccaaag 119821 gtgtgagaaa ataaacttcg gttgataaac tgctgttgtt gaagccatcc agtctgtggc 119881 attttgttat ggcagtccta gcagaataat actgtcctaa gtaagagggt tggggaggag 119941 accaagaaaa atacagaaaa aaagtctgtc caactgcaat tgatgagttt tgtaagggta 120001 aacacctagt gaaacttaag gggaaaaaaa actaagttct ttggagggaa gatttgattg 120061 tcaaaggaaa tttcacattt tcatgcttat tatgtacaca tggtttattt actgttgtct 120121 gtcaccattg ccgcatatct gaatatgtgt aggttccacg atagaaactg acaacacttg 120181 gctcatgcct gtaatcccag cactttggga ggcccaggca ggcagatcac ctgaggtcag 120241 gagttcaaga ccagcctggc caacatggcg gaaacccgtc tctactaaaa atacaaaaat 120301 tagccaggtg tggtggcgtg tgcctgtaat cccagctact tgggaggctg aggcaagaga 120361 attgcttgaa cccaaaaggt ggaggttgca gtgagttgag attgcaccac tgcactctag 120421 cccaggcggt aagagagact ccatctcaaa aaaaaaaaaa aaaaaaaaga aaaaagaaac 120481 tgacaacact gctgctgaca ttttttcaat ggcaatccca aattcaaact gaaaccccac 120541 tgaagagcta agcttcatta gatctctaca ggctgactta catcaagtgg aatttactgt 120601 tgattctggg tataatacag aaacagctgt ttatcttcag cttgctttct gatgcacatc 120661 tgtttgggtt acttcaagag gcatcatgga ggattcaagt ttagggagga tacagatgct 120721 caaatctgat gaacaattgg cttattcttc ctcaatgaat atattcagaa agcttgttag 120781 ccattgaata aacacgttgc attaggggat gattgtttac aaatacctta tcttgtggaa 120841 taaactgaag ttgtgctttc cttcattaac gtgcacagaa gcagttggca aatagaaagt 120901 gctcaaaaaa gtttgtagtc tgtgcctgcc actattatat ttcatcatca tagatgccag 120961 aatcccctct tcacctattg cccggaatct gcctaatgag atatttgctc caccccctgc 121021 tgtaactaca tcatagcaag agagccatca ctcaggttgg tagaaatgaa cttgaacatt 121081 gactctcatc cctttttagg agatatttgc cactaggggt ctcatcaggt acctctatgc 121141 tcttcttagg ggttcttttc ttttatgatt gacagaatta ataggtgagc gttctccaaa 121201 gttctcagca tcaaggatat agtctcttcg tcacagagta ttgttatata ataaaaaaag 121261 gtttatacca tgttataaga tgttatcaat gttctaagaa ttccagatct ttcgaactaa 121321 ataccaacta caggggctaa gagaagatgc cagagcaata caaaggcctt gataagagac 121381 aaagacaaga ggattatgag ccaggatgag cttcaagtac tcactgatct cagacctcca 121441 tctcttaaac tagatgctta gagtagctca atccaatatc tggtcacccc taccacctcc 121501 tggcccctct tttccccctc tatgtagctt gacataaaag tcaacaagag ggaaaagcag 121561 ctgggagcat ttcaccgaag aggagagttt gagaagttat agtgtggtaa atgtgtattg 121621 tgcttatatt tcaggccatt gtgtagtaca ggtcagttat atgctcagaa ttcctaagtc 121681 caagttacca gcccagacca ttgctttggg caccaaatac ttaagcccaa atctaatctg 121741 acatctcctc tcacatgtct cacaagtacc ccatactcag tgtgtccagg actgaaccca 121801 ttttcttcca ggttgtcagt tctcaacttg gatgattttg ccctctgtct cctagaggac 121861 atgtggcaat atctagagac attttttgat tgccatgata ggggaacagt ggtgcaactg 121921 acatccagtg ggtagaaaca atgggtgcta aacatcctag aatgtacagg acagcccacc 121981 accacaatga attatcgaga cccaaatagt gatcgtgctc agtttcagct gcattaaaac 122041 atgcagctct aaatcttcat ttttcataaa tggaaaattt aacccaatac ttcttttcat 122101 cacccaattg gtcaccaggc cctttcaact tttcaatatg tctcagagct aaatcctctt 122161 tgatgtttgc aagatgacct attggttcct gcccccaacc atcactcact ggattgctgt 122221 aagtatcctt atctgtcttt aattcattct ccacaaggct ggttgagtca cgtaatcttt 122281 ttaatatgta agtctcttgt cactttctac tttactgttg cttttttttt tttttttttt 122341 ttaagacgca gttttgctct tgttgcccag gctggagtgc aatagcacga tctcagctca 122401 ccgcaacctc agcctcctgg gttcaagtga ttctcctgcc tcagcctccc aagtagctgg 122461 aattacaggc atgtgccacc aagcctggct aatgttgtat ttttagtaga gacagggttt 122521 ctccatgttg gtcaggctgg tctcgaactc ccgacctcag gtgatccacc tgcctcagcc 122581 tcccaaagtg ctgggattac aggcgtgagc cactgggctc agccaccatt gcttttcaaa 122641 ctctcaccac cacacagaag gcctgcatga ttaaagagcc ctattctctc ctccaaacct 122701 ctggtcccat ctctgaccac tcctctccct gtcccctaat ttcagctata ctagatttct 122761 tgtggcttcc tgaatacatg tctctcatat gcctgtgcct tagcatacca tcacccttct 122821 gcctggaaaa gtcttcctcc atatagccat cctcactggg agacataccc tctctctttg 122881 cccctagtct gggttgagta gcttcctatc tgtgcaagct ctctttgttt gcctctccag 122941 aacttctctc tactattttc caacctactt tgtgtcccag gtggttgggc tctggactgc 123001 atcaaagggc tcccttgccc ttgggatcct ggttgggtct ggccaatggg agttactaac 123061 aagataccag aaggtgggag aaagaaagat tgaggcattt attctacctg tttcttcttt 123121 tcagggcagc agctgccttc atctaagctc acctctccca tctggcagcc ctctgctata 123181 gctctagttg tttctgggtt tcactaactg ctctttcaag cttagcagtg gtagtgattt 123241 cctactattg ctaaccctgg ggtgcttctt cctaaacagt ctctttattc tcttcaatca 123301 caccttttga gtgtggcatc tacagacagt tccccactta caatcgttca acttaagatt 123361 ttttgacttt acgatggtgt gaaaatgata cgcattcagt agaaactggt aagatattct 123421 ctcaggatgc tgggcagagg cagcgagcca cagcttccaa tcagccatgt gatcaccagg 123481 ataatcaatt gacagtgaac agtgtaccgt gttaccagat gattttgccc aaatgtaggc 123541 aaatgtaagt gttctgagca catttaaggt aggcaaggct aagctatcat gtttggtagg 123601 tgttttctat gcatttttga cacatttcca actttttttg agacagggtc ttgctctgtc 123661 aaccaggctg gagtgcactg gcacaatctc ggctcactgc agcctctgcc tcctgggttc 123721 aagtgattct cctgcctcag cctcctgagt agctgggact acaggtgtgc accaccacac 123781 ccagctaagt tttgtatttt tagtatagac ggggtttcac catattgccc aggctggtct 123841 cggactcctg acctcaggtg atccgcctgc cttggcctcc caaagtgctg ggattacagg 123901 catgacccac catgttcggc cacatatttc caacttttga tgaatttatt gggacacgac 123961 tgcatcataa gtcaaggaac atctgcattt cctgcaggga ccctgaacca tgttcctgta 124021 acaccaaatt attgtaattt ttttttttta acatctgtct tctcttcgag gctgtgttct 124081 ctggggctga ggactacttt tctgttgtat cttagtaagc tagcacaggg ttatgatagg 124141 ggctcttgcg aaaagaatga aagaaaagat atatgaatat tctcatcatt ccatgctctt 124201 ctctatatat cctttagggt agtgaccatc aggattcctc agaaagactc catagtcact 124261 tgagatatgc agaatcagat catgtgggta atgtgggtat tttgtttcag cacaaagacc 124321 cctgtccaga cagtctggca aatatgataa acttttggct aaaccatctg gagcatgagt 124381 ccttcccagg aggagtatgg tcaggaaaac agagccatgc tttgctgctg acttatctga 124441 aaaccagcct tggagtagca aaaagcgaag gggtggggtg aggctagggt cggagtggga 124501 tttggacagg gcctttggca ggacaaggac aacacggttc tgctagtgat tcagggggcc 124561 tggagaaagt atctctgcac ctcagtttcc ccatccgtaa gttgaagatc gactagctga 124621 tccttaatgg cccttttaac atggatactg tagtggacat ctttagattt tgcctttcaa 124681 ctacccagtc ctctttttgt taagagacac cttccctcca tcagctcatg tgggcagatg 124741 cgcagatgac acttctgact tcaagggtgg gcatgtgatc caggcctgga taaccagagc 124801 atctgatccc cctggacaac gtgatggttc agggataggt acaaggcttg agcctggcca 124861 gggtattcca tctctttaaa agatggggtc ttgctatgtt gcccaggctg gtctcgaact 124921 cctgggctca agcaatcctc ctgcttcagc ctcccaaaat gctgggatta caggcatgaa 124981 ccacctcacc ccgtctattc catgtctttc agccacaatg ttaggctcag gaatagacat 125041 gtgacctaga tggtccagta actctcagtt ctggaacatc taactgtaaa ggaagagaat 125101 atttatgctc agttgctaaa gaggacagaa tacaattcag gagctgctga ccaatgtcct 125161 gccactacca gggagaactt gataatggag ccaacgggga agaagaccag gccctggtga 125221 cagttggagc ccctgaatac agctgtagct gaaaccagtt acctgtgtgt tcctaattac 125281 gagagccaat aaatctcatt ctccattcat tcagcttaag gtagtttgag ttgtagtctc 125341 taccacttat gaccaaagga gccctgatct atacagaaat tccatttttc ctctaagcat 125401 taaggagtca ggcacaaaag actacatata gtagaatttc acttgtaaga aatgaccagg 125461 cttggccggg agccgtggct cacgcctgta atcccagcac tttgggaggc cgaggcgggc 125521 agatcacctg aggtcaggag ttcaagacca gtctggccaa catggtgaaa ccccgtctct 125581 actaaaaata caaaaattag ctgagtgtgg tggcgtggca cctgtaatct cagctactcg 125641 tgaggctaag gcaggagaat cacttgaacc caggaggcag aggttgcagt gagccgagat 125701 catgccatcg cactccagcc tgggcaacaa gagtgaaact ccatccaaaa aaaaaaaaaa 125761 aaaaacacta ctgaattgta tagtttaaac tagtggatct tagagtatgt gagttgtaac 125821 tactaaataa atacataaaa aagaagtttc attatagcca gtgtagctcc cccagggagt 125881 aaatgggagc atctgatccc tctcagtttc cttgtgatag taatagaagt taccgaagca 125941 aacactgcac aattggaatg tgctttattt cagggaaata taaagggaaa tgaatgctat 126001 tataacttgg tagaacagaa gaaatggcta cctagctttg ctttccaact acaaacataa 126061 atgaggatct cagcatttaa ggtaaaacat gataagcaca aaaggagagt tcactgggga 126121 ctggactccc tcatttactc tagaaattat gagaaccagc agcaatattc ctcaagcatc 126181 catctcaaca tcaagttcct ttgttttatt taccagatga ccaggaatca tagatgagtt 126241 tggctgcaac tgtgtcttcc actgccattc ctagaataga cagaaatttg gttcgccttt 126301 ggtcaaaaca acttttcttg aaacaaccca ggccccatgg ctggaagttt cctgatacat 126361 gtccatgttg ccaatgccta ttggaataac agggactgat acccagagat gagctcaggc 126421 ttcaattgtc tgcaaagggg cagagaatat gatagtaaga aacacccacc aacaaattgt 126481 gcatctttct aaatctagct cagggctgag caaatttttt tttttaattt tttttttttt 126541 cgagatggag tctcgctctg ttgcccaggc tgaagtgcag tggcgcgatt ttggctcact 126601 gcaaactctg cctcccagga tcaagtgatt ctcctgcctc agcctcctga gtagctggga 126661 ttacaggtgc atgccaccat gcctggctaa tttttttttt ttttgtattt ttagtagaga 126721 cagggtttca ccatgttgac caggctagtt tcaaactcct gacctcaggc gatccgccca 126781 cctcggcctc ccaaagtgct gggcttacag gcatgagcca ccacgcctgg cctgaacaaa 126841 cttttcaaaa agggtcagac agtaaatcct ttaggctttg tgggccacag ggtctctatt 126901 acaattattc aactctgcct ttagagtaga aaacacagcc aacaggtaaa caaaagagcc 126961 tggctgtggg ctgtgttcca gtgaaattat tgatgggact gaagcttgaa ttttgtatca 127021 ttttcagaca tcatgaaata ttttctcccc ccattcattt gaaaatctaa aaaccatttc 127081 tagctcatag gccacacaaa agcaggcagc agagcaggtc tggccccagg actgtagttt 127141 tctgacgctt ggtccagttt gaatgctgca tcctttgata tctccccttt gtcttgtaat 127201 tttggtagat accttaacct ctgcacccag cacagataat taggtgcttg atacgcatct 127261 gttagatgaa acaatgcacc aataaactga tccccaaaag taagaaagca aaaattgcca 127321 aaaaggacag aacaagtttc tgcagcattt tggctcctta agtaagggct ggctgacctc 127381 tgatctgcca tgacctttga agttttcaga caaaactgga tccatgtgat gtctttgaac 127441 caggctccag ataaaattgt attgcctttg tcttccagtg catgagagat tttaaagaca 127501 tttaccaacc tgcaaggcat ggatacttaa ccaatggcat gggctctggg tggccagagc 127561 gctcctgatg ctctgccagg catttactct tttgctgctt gatcacgaag tatggaaggt 127621 catgagaggg ggacagaggg cactctcgag gggctcagga tagcaactgt ttaccaacca 127681 gtttgggagt atttcagtgt tttaggtgta tcagctctgt tcccatgtac tttatttatt 127741 tattttttga gatggagtct tgctctgttg cccaggctgg agtgcacgat gtcagctcac 127801 tacaacctcc accttccggg ttcaagtgat tctcctgcct caacctccca agtagctggg 127861 attgcagatg tgcaccacca cgcctggcta atttttgtat ttttagtaga gatggagttt 127921 caccatgttg gccaggctgg tctcaaactc cttagctcaa gtgatctgcc caccttggcc 127981 tcccaaagtg ttgggattac aggcgtgagc caccgagctc agcctgttcc catgtacttc 128041 tgtttccttc aatctcatag ggtggaagag gacctggggc aagggacaga ctggccactg 128101 ggagcccacc ctttgttggg cttctataag gtaccccata caactcaggt ggtagggggc 128161 tccccatgga agccctcctc aaagcagcaa gtgggagagt agggaaggag ggaatgaaca 128221 gggcttaggc tatactttcc atctgactct ttcaggttgg gtctctgcca tttcccttaa 128281 ggttcctgtc ttttaacatc tttgtcccct gccccctcca cctctgaagg gtaccttctg 128341 aatttcatca agatctgcac cgaactgatt cattttatat gcaaatccca cctaatcctc 128401 aatggtaaca ggtggtgagg aaactgaggc ctacagaggt cagatcactt gcctaaggac 128461 acctgccttt gagtggcggg gaagggactc tgacccagga ctgctcagtt ccagagttca 128521 tgggcttagc actaggctaa accatgtaaa taattgacct gaatgatgga gcacaatgat 128581 tcagctgcag agtctggtga ctcattgctc tttccaccaa tgggtgaacc tgccttgtgc 128641 gctagttgga tttgtgccca gggagcaatg gatcttaccc aaagacttga acacggtggt 128701 cttctcacag tgggctggtt tcactccctt aatcacttct cccagctcag caaagatctc 128761 ggcctaggaa acaaacatac gctgacccag gcatgaagct aaagtctgtg cagggttggc 128821 ttttgaagcc agcccaggga attcctgaat tggataagag ataaatttga ttaaaaaaaa 128881 aaaaaaaaag agagagatcc tttgtacact gaatgatatt agggaggagg ctggccccga 128941 aatccttttg gggtggcatt tgcttttgtg ataccataaa cacagagatt cgagttggtg 129001 ctcagggagc tgtcctgcca agatcattgt gtcagtgagg aaatagacat gctgttatct 129061 gaacatgcat gccagcagct cccttcttgt tttatgctga ttataaccgt aaaatgaggc 129121 cagaaggtgg gcaacacagt cttatcatgt tggccacatt gggaaggttt ccaggctgca 129181 agaagcttcc aatcccactt ctttagagga cagcccagca aaaaaataaa atccactgag 129241 cttctccttc caggaacaca catgtcacca cccataatgc ccatattttt ctggaatgga 129301 attaaagagg gttgcaaaat gtttaggagc tccttcccct acaaggaggt ggggtggctg 129361 aggtctggag cagaccaaac tggaagggaa gttaatgaaa ccctgggatc caggtgaggc 129421 cagccaggtg gcagccctga ctggggtcag aagggcctca cccctgacag caggacatct 129481 ccagactcct tcagggcagc ctcctgggaa tccacgtaca gcacagcttc tttcatgagc 129541 tcatcatcca gttctctcca gtcaggtctg ctggctccaa cagctaagag acagcaaaac 129601 agaccttaag cccaggatta gtgccaaagg ctgctaagaa gcccaaagca caggagagag 129661 gtgaacaaag gactcgtgaa cacactcaga ttcaaaatac cccctcattc cccaggccat 129721 gctctcccaa tcagtaggac gaaatactta ttaagaaata ggccaggtgt ggtggctcac 129781 acctgtaatc ccagcacttt gggaggctga ggcaggcaga tcacttgaag ccaggagttt 129841 gagactagcc tggccaacat ggtgaaaccc tgtgtctact aaaaatacaa aaaattatcc 129901 aggcgtggtg gctcacacct gtagtctttg ctactctgga ggctgaggca tgagaattgc 129961 ttaaacctgg gaggcggagg ttgcagtgag ccaagatcat gccactgcac tccagcctgg 130021 gcaacagagt gaggctccat ctcaaaaaaa caaacaaaca aacaaaaaaa actacctacc 130081 attttgcctt tctattctat atcccacctc attgctaaaa gattttgaga cgacatagca 130141 ttcatttaca gggtgtctac tctctgtggg atgttttgca cgtactattt atggttctcg 130201 caagtacatt tcaaagtctg gttctcactt aacagatgca cgaatggagg ctcaaagaag 130261 tgaaggtgtt tgccaaggtt aaggatctga tgggtgaatc caggtctgag agagcaccta 130321 ggcgctcaca cctcttctgc ttaacaggaa caccttcctc acaaaaatcc cgaaacagag 130381 acatcacttg gtgcttattc acacctcact ctcctcttga tctccatcat tcccattcct 130441 caccctcaac cagttctttt aatatctatt tttttctttt tggagagagg gtctcactct 130501 gttgcccagg ctggagtgta gtggtgcaat cagaactcac cacaggctgg gtgcagtggc 130561 tcatgcctgt aatcctaaca ctctggaagg ctgaggtggg tggttcactt gaggtcagga 130621 gttcaagacc agcctggcca acatggtgaa accctgtctc tactgaaata caaaaattat 130681 ctgggcatgg tagcacacgc ctgtaatccc agctacctgg gaggctgagg caggagaatc 130741 tcttgaacct gggagttgga ggttgcagcg agccaagact gcgccactgt gctccaaacc 130801 aggcgacaga gtgagactcc atctcaaaaa acaaacaaac aaaaactcac cacagcctca 130861 aactcctgag ctcaagtgat cctcccgcct cagcctccca aactgttggg accacagacg 130921 cctgccacta tgcccctttc cctctcctca cccccggttc tctcttaact catgacaaaa 130981 gaaaaaaaac aacaacaaca aaaactcagt gtcagggtca tggagaagtc tcgtcagaaa 131041 gaaaactggc tccgtcctgg ccctgaactg tcacatcctc tatggagtcc aggaccagaa 131101 tgatgaatcc aaggtgtcac gccccctggc accaggacac aggagaatgc agaggctgcc 131161 accaaatgcc tgcaaggcca gggaggtaat gaaaatgaac aaagctggtc agagggactg 131221 gattagatgt ggtggagact gaggggcggg gaagccagca ggcccttccg aaggggacag 131281 tcattgttca gctccagcta atcattacca cgtgtgaatg tagatccact gttaccagat 131341 atgccccagt tgttttctta gaagagaaac aggaaatcta tttgcatatg tcactccttg 131401 agttttagat tactctatgg tccaaaccaa acacatctgc tggctggatc cagcccctgg 131461 ctggctagtc tttgaaaact gctataaagc cttatacaaa tataagacag catttatatg 131521 ccttcttcca ctgccgcccc gttttctccc tttgttccct cccgttctgt tctccctcat 131581 tctaatttcc aaagagagtt gtgtaaaaat gtccatcttg aaggttgttt gtgcggagag 131641 ttctatagaa caaaagcact agcttaagga agcttgtaaa taaatcataa gctggtggga 131701 acaaaacccc cgtttctcag gccactcctt ctcactggac tatttcccct gcacaaccac 131761 tgcttatgag ccgggcgagt tctcaaaggg ttgtgtgagg tcactcaggc ggatggaggt 131821 gggccaggca gtaaagagag agaagcctga ctcaaacaag gctctggctt ggttgtgatg 131881 gggctaaatg agtgagggtc gcagacaggc ttgacagggt gtgggggcag gaggagggag 131941 gcatggggca gggtggaact gggcctgtcc ctccgctgag ctgtaaacta agtaaaatct 132001 gttgttggta tttccctgtt gtctgcgctt gtgacacatt tttcttaaat gtatgtgttg 132061 cctctgtgaa cacataaacc aattttgtcc ctgggccagg aaaacagaag agatgccttg 132121 actcaaaatc aagtcagccc ttcctggaga atgattccaa gtgagagagg gatccaacat 132181 aagggagatt gtctgtggct ggatgtgagg atggagagtc catatggtgg gaaatttggg 132241 tcatctctgg gagctgagaa tggccccaag gaaaggggct ccccccatcc tccagtgtaa 132301 ggaactgaat cctgccacac catgggagcc tggaagagga cccccgagca tgagatgaga 132361 ttgcagcccg gatgttgccc aatttctttt ttgtgagact ttgcacagag gatccagcta 132421 cccacaccca gtctcctgaa ccacgggaac tgtgagatga taataatgtg ttgttttaag 132481 ctgaaaaaaa aatcaagtcc atctttctga agatgcagcc ttggaaaccc atttcaccat 132541 gaactggttc cttttgattt cccaggtcct tccagaaatg gtcttattgt ccataaactc 132601 agccctgact tacctcttac ccattccctg gacccagtct gaccaccagt cccatgcagc 132661 ctgagacttt caaattactt tgatccattc ctcaaacacg ccatgttcgc ttccacactc 132721 aaggcctttg catctgctgt gccctctacc ctgaatgcct ttccttcagc tcttggcatg 132781 gctgctcttc ctcctcattc aggtcacagc tcaaaaaaca catcttagag agcttctcag 132841 ataaccctat tgaaaacacc tctctctgct tctaccttcc acctccacca ccagcctcta 132901 tcacccaggg ccttgttctt tccggtctca tctttagcca gaacgatcta gttcctctat 132961 gtgtttgttt gttgtctatc tctcctcact gaaattaacc accatgaaac cagtgacctt 133021 gactgcctgg ttattgtggc atccctggca tctagaatag tgcctggcac acggtagaga 133081 ttcagtcagc atgtatgatt gaatgcgtca tcatttgact tggctctcgc cggctcatct 133141 gcccacgtgt gcaaagctct tagcttctga gttcacacca tgtgcttgtc actaaaacac 133201 agggcaccaa gaccatcttt aaatcctcta aaatttgcct gctctgctca ccagaatcca 133261 gtcttcctgt tggaaggcaa tcatgagtat catttaaagc ttggactctg aggatagagt 133321 gcctccgttc aaatcccagc tctgccatca agaacctctg attttgggct tggcacagtg 133381 gctcacgcct gtaatcccag cactttggga ggctgaggcg ggtggatcat ttgaggtcag 133441 gagttcgaga ccagcctgga caatgtggtg aaaccccatc tctactaaaa atacaaaaaa 133501 attagctggc atggtggcgt gtgtctgtaa tctcagctac tcggcaggct gaggcaggag 133561 agtcacttaa catgggaggt ggaggttgca gtgagctgag atcatgccac tgcactccag 133621 cctgggtgac agagtgagac tccatctcaa acaaacaaac aaacaaacaa acaaaaactt 133681 gtgactttgg aaaagttata taatttctcc atgccttggt tcatccccct ataaaatgtt 133741 tatgataata gttcctgcct cataagttgg ttataagtat taaatgacat tatacataga 133801 ataatgtcat ttaatataca aagaattata caaagaattc atgacagcac ctgcagtagt 133861 attagatgta gaccagctct tgctataaca cccagggcca tctgaaggtc tacagaggcc 133921 ccaggcaccc tcatgtctag agccccctct cctgcccctc atcctatcac acatattaaa 133981 aagcatatac atgtcacata aaaaattaca gtacaaaaga aatttaaagg tttataattc 134041 ttagtgttaa gaaaaagtga ctcatttggc ctttctgatt taaatttcat cacattttag 134101 aattgagctg tgaattacca tttgccataa ttttacatgt tatggtcagg cgtagtggct 134161 catacttgta atcagtcgca gcactttggg aggctgaagt gggtagagtg cttgaggcca 134221 ggagttcgag accagcctgg ccaacatggt gaaaccttgt ctctaccaaa aattacaaaa 134281 aattagctgg gcatggtggc acatgcctgt agtcccagct actcgggagg ctgaagtggg 134341 agaatcgctt gaacccagga ggcagaggtt gcagtgagct gagatctcgc cactgcactc 134401 cagcctgagc gacagagtga gaccctgtcc caaaaaaaga aaaaaaactt acattttata 134461 gaacattaaa tattctttag tttcaggttt gtaatttccc ttttacattt cttggagcaa 134521 atattgtttg tttgtttttt aagacagagt ctctctctgt tgcccaggct aaagtgcagt 134581 ggcacaatct tggctcactg caacctctgc ctcctgggtt caagccattc tcatgcctca 134641 gcctcatgag tagctgggac tacaggtgca agccaccatg cctggctaac ttttgtattt 134701 ttagtagaga cggggttttg ccatgttgac caggttagtc ttaaactcct ggcctcaagc 134761 aatccacctg cctcagtctc ccaaagtgct gggattacag gcatgagcca ctgcacccag 134821 ccaaatattg tttttttcaa tctcacatgt tttcaaggcc cttaaatagc ttgggcattc 134881 tgcatgaata aactggctct gcaaatgtag tcgactggtc ccgtctagag aggagcctgg 134941 ccaccaccca tcatgcctta tggagccact ctacttacca ttgatgtgag cccctggctt 135001 cacccattca ccaaacaaaa tgggctctgt tgccagggtg actgtgatga tcacatctgc 135061 acctgccaca gcctcctgga ccgaagaaca gacccgtacc tctccttgca ctgtgtctgc 135121 aaacttctct gcattttctt tggtgcggtt ccatatcctc accttcattg ggagtaacaa 135181 gaaggatatt ggcgccattt tttggctgct tatgttacac tgagcccagg gatgcggagg 135241 gaaagttgaa ctactctggc ataggggtca gtggttatct aaatcactcc taaatcattc 135301 cccatggaga atggacattt gatcaggggt acagcattaa cttagacacc tagccaggta 135361 tttgaaaatc agcgttctct accactttcc ccttttaaaa tgagtggaag tatttcactt 135421 ctaggccaag aggcacagcc tcagtaactt catagctatt atatattaat agcataatga 135481 ttagcagagc aagttctgga gccaggtagg cttggaacca atctccactc tgctatttcc 135541 tgaaattgct tcatgtcatt cagccttagt ttccacatct gtaaaatgga aatataactg 135601 gcaaagtctg tgtggaactc acactttatg tctgagaccc cgtaaaggag ttgtaatggg 135661 tgggagattc atatgaagga ttatttatga taatgttttt catggaataa cttgatcggt 135721 gaagaatact taggaaattc aagatcagac gagtttgaag gtaaatggca tagccaatga 135781 aaaggccaag aatggtagct acggctggaa tcctgaaagg tgctggggtt ggagggtggg 135841 gcagcggcct caccaggagc tggggacttg ccgacttgct tgacaggttt caatgtttgg 135901 gtcaagggct tgtcacccgt caactgaaaa taaggcctac agaatttctc ataaggaaat 135961 ataatacata cggaaacaaa ctttatgtct aagcaagttc accaagcaag gtgaaaaaca 136021 ggacattttg cccggccgga agtgcatggt taacctcttc acccttcttg atagaagact 136081 atgcaaattt tatgcaaaaa ttagataacc tgaaaagtga tgttagaaca ttcagtaaga 136141 gaggatataa aatggtgcat atctaacgtg actaaagcta tgttataaaa agctgaagag 136201 ggaagatcag aaaacattaa ctgtctttga aaacagatga aagggcattt ttcttttgtt 136261 ttcttccttg tattaaaagt aatggcaact gggcgcagtg gttcacgcct gtaatcccag 136321 aactttggga ggacaaggca ggtggatcac ctgaggtcag aagttccaga ctagcctggc 136381 caacatggtg aaaccccatc tctactaaaa atacaaaaat tagccaggtg tggtagtggg 136441 cgcctataat ccaagctact tgggaggctg aggcccaaga atcacttgaa cccaggagat 136501 ggaggttgca gtgagccgag attatgccac tgcactccag cctgggcgac agcgcgagac 136561 tccatttcaa aaaaaaaaaa aaaaaaaaaa gtcatggcca aaactgcaat tacttttgca 136621 ctcacctaat agtctatatt tctcaaggca gtggttttat tttcatcatt aaatcatact 136681 ttaaaactgc tccatccctt tcatacacac acataaaata acaatagcat caaaaattcc 136741 cagggggaag catgagacgc agtgggggga aggtggacca gtatggagac tgcaaaagag 136801 agaggctgtg gcctcctggt ctccaggtct catcccaccc agtttcttga tccataacct 136861 tgaatactta aaaactctgg attctcctaa gaaaaacata atagacagga gctaaaagga 136921 caactgaatg tatacaagag atataataga atcttttaac cgagtaagct tgctgtatcc 136981 tcagacaaat gccctctgaa ctgaggactg gaaacaggta atcagatctt cccctgactc 137041 ttatcctcca tcccctccat cagggcttct gggaagtcat gaagatacaa gaatggtcta 137101 gatggggctc acaacttaat tatttcagaa taacctgtgg tgcagtgaag cagtatagac 137161 aggtcaagga ccaccccctt ccctcttctc tcccaccccc acccctggac ttacctcctt 137221 aaaggagaac tgctctgtga agatctcata atggctgtag gcctggaccc cagccccaag 137281 gatgcacagc acttcactgc tgggaggttt cagaaactat atgagagaaa tgaagtggca 137341 aaggtcaagg gagaccctag cagacgggta aaaaactaag atggtttgag tatcaggtga 137401 cttctgccct ctcttctgcc tccctcaaga gcttgcaggc aatcaagaga atttcctttt 137461 gtgtctagtt tcccccgtat caagtgggaa aagaaggaag aaggcttcac tgaacagcta 137521 atgtatgcca atgctttgcc cccatgatct catttagtct agacattgac cctacgagtt 137581 ggggactcct actatcttct tcttacagat gaagagactg aggttgccct ggtaacagaa 137641 aggtgaactt gctctgggtc ttacaactac tttttaaaat tatttattta tttctttttc 137701 tttctttctt tcttttcttt ttttttgaga tggagtctcg ctctgtcgcc taggctggag 137761 tgcagtggca cgatcttggc tcattgcaac ctctgcctcc tgggttccag tgattatcct 137821 acctcagccc cccgtggagc tgggattata ggtgtgcgcc acaacaccca gctaattttt 137881 gtattttcga tagagatggg gtttcaccat gttggccagg ctggtctcaa actcctgacc 137941 tcaggtgatc tgtgcacctc ggcctcccaa agtactggga ttacagacgt gagccaccgc 138001 acctagccag gtcttacaac tattgaaggg cacagtgtgg ctctaaagtc acgtcttcca 138061 tactccaaag cccatgtcct ttcttcatat ctcaggtaca tgctcagcca tacccacttc 138121 tcagagctct gagctgcttc taagagctgc ctggattctg tgagcccttc cttcccaggg 138181 ttctagctat attcactggg ccaactctgt tgatcttttc actgcctctg tcttcctcta 138241 aagacgaatg tagaattagc atttgtggca tctcctgagg gtgtctgtgt ttgagagcag 138301 gattctgtga tcaactcacg tccagtgcca cctctccaat gaagcctttc tgacctcctt 138361 gatagaatgc accaggccat agcacttatt tgtttgtaga ttttcttcta caggtcttta 138421 agccccatga ggacagaggt agtcttattt gcacagaact ggtacatacc tactatgacc 138481 tacaagaggt agcactgggc ttctcaacct cggtactact aacatttggg ctggacaatt 138541 ctttattggg gaggggcacc aacctgtgta ttataggatg tgcaacagta tctctggttt 138601 ctatccatta gatgttagta gcattttcca ccccttcaag ttgtgggaac taaaaacgtc 138661 tgcacactcc taaatgtctc tggagcaggt ggcaaactca tgctcagtgg agaaccactg 138721 gcatagagga aagacatcac ttggccctcg acaccatatg tattaaataa aaccctcttc 138781 tgccttttct ctattccatt tagtaaactt tgagttccct gtggtctgtt cttttataaa 138841 atgctactca ctggaatttt ggagtataat atttaaaaag tgaaaagtct tcttaactac 138901 taattactag caaaacagca tctctgaaat attaggaatt aattttatct cctgcagcta 138961 ccgaaggaga gataccattc tgtgccataa aatatagttt aacgtctctg atttgaaaag 139021 gtttcaaaat gataatatcc acaagactag aaatcttatg ttgctgtctg ataaggataa 139081 ttagattttt ttccttaatt gttaccattt tcttctgttc tcagttaata tgaatctatt 139141 cttgaaagta caggtgagct tcagaaattg ttacagtatc ccataaaaat aaccttcaac 139201 ttcctaaact aaatgtgtaa ttctaattgg ttgtcttact tgataatttg ttgccctggt 139261 aatagtgtga taattgatat ttttaaaatt tttttttttt tccgagacag tctcgctttg 139321 tcgcccaggc tggaatgcag cggtgtgatc tcagctcact gcaacctctg cttcctgggt 139381 tcaagcgatt ctcatgcctc agcctcccga gtagctggga ttacaggtat gtgccaccat 139441 acttggctaa tttttttttt tttttttggt aaagatgggg tttcgccatg ttggccaggc 139501 tggtctcgaa ctcctgacct caagtgatcc acctgcctcg gcctccaaaa gtgctgggat 139561 tacaggtgta agccactgca cccagccgat ttttaatcat atgtttataa gatttattaa 139621 acagcattag tggcacttca atcttttctc ctaagcatgc aggttattaa gtacacgtat 139681 atgaatctgc tgggtttctc atttaaagag tctgaaagtc tgtttcattt gcatttgcca 139741 aggccttgca tggtctggcc ctcctatctc taaagctgtc tctcatacca ccatgccatg 139801 ttcccctgca cttcctgtct tccagccata ctggcttcct tgcagcttcc caagctcatg 139861 tacttccttc aaccacaggg cctttgcatg aattaattcc ttgccatacg atgctaggct 139921 gctttgtccc ctctccccct cacaagtaac tgattatcct tcaatctcag ctcaaatatc 139981 ccctacagag acacagccac aggctagatc aagacatact gctttacact cttacagcac 140041 ttgtcagcgt tgtagtttca tatttattgg tgtgaatttt tttttttttt ttggagactg 140101 agtctcactc gttgtccagg ctggagtgca gtggtgtgat cttggctcac tgtaacctcc 140161 gcctccagga ttcaagcaat tcccttgcct cagcctccca agtagctggg attacaggca 140221 cctgccacca tgcccagcta attttttttt tttttttttt tttttttttt ttagtacaga 140281 tggggttttt ccatgttggt caggctggtc ttgaactcct gacctcaggt aatctgcctg 140341 cctcagcctc ccaaagtgct gggattgcag gcgtgagcca ctgcgcccag cctattgatg 140401 tgatctttgg actatatctt gcctccccca ctggaccctg tactacatga gggcaagtac 140461 tgtatctggt tttgcttacc ccatgctgag cgcagaacct tgcatttggt ggacactcaa 140521 atatttgttg aataaatgta tgttagtgct aaaccctaat atatactatc accttgagaa 140581 aacatatcag aggcttcaat gaagcttatg tgagacccaa gcacaaaaaa tgtattatgt 140641 tcatcagacc ctagagctcc gtatacttat tctccccaac aagagcccta aatagaagag 140701 aaatgatgga caagaaaata ttctactctc aaggcaacaa gtaaaataga ccctttgttc 140761 cttattgaaa cagttcatct gagcccctgg aaataaagct gtgaagtttc agcctaaact 140821 ttgaagtgtc taaactttga agtgtcaacc caaggcttct tgattcttct gggagcacag 140881 atttttatta atgcaacaaa gcagtgttat tgcaggctgg gtctacaaaa gtgtggtttc 140941 cagacctggt tgatcctcac tctgtgattt tgattcagta ggcatgggaa tttgagtttt 141001 taaaaacttc cctagctaat gctgctgcat agcttgcttc tgttggaaac ttcttatctg 141061 ggacaagacc atgtggcctc aaaaggaccc gcaggaacct ggttgtatga cttcaggaaa 141121 cccaagtttc tattctttta tgcgtaaaat gaggccatta atactttctc tgccttcctc 141181 acagaaactc gtagtccaga caaggggctg gaaatgagtt tgaaaaagac taaagctctt 141241 tgcaaatgac agtaatacaa gattcattct gttcccaaat taccagcaaa catagcaagg 141301 agtccttgat tacttcagtt gagtcaggtc tggatataac taaaaaccag ttcattgagg 141361 tgggtgctgt tggtctactc ccttcgcata caattaccag attaatcctc ccattcttcc 141421 ctccccagca ggtatctaga aaacctcgac atgcaacctg aacacctggt ctttgctgct 141481 gcaactttcc ctgggttacc acgtttcctt tataataact taggaaaacc agtgtcccaa 141541 gaccttgtac ttccacattt aatgtgaaag actccagact ccaatcgttc ccacagacac 141601 cctgtttcct aagcctcatc ctcttaactc taacttttaa aaatgctgtt tttcttatag 141661 gatgccttga cattgcttgg gaacatcatc acccccattg cttactgttc cctctgcctg 141721 gagtgcccct ccgtgttttt gcatggttcg ctaactcact gctttcagct ttgactcaaa 141781 tgtcaccttc tcagggaggc ctttcttgga taatgtaaaa tgtgacagct tctccaatac 141841 tctcacttct gctttatctt cacctaacat gttatgtggc tttttggatt attgttggtt 141901 ttatttttta gcaaattcta agcacgctat taatcattta ttaatatgta ctaattattt 141961 aattacatat tatttattac tatttaacac attattaaac tcaacagatg tttttgtttg 142021 ttttgttttg tttccttttt gttttgtttt ttaagacact cactcttttg cccaggctag 142081 agtgcagtgg ctcgatctgg gctcattgca acctctgcct cctgagttca aacgtttctc 142141 ctgcctcact ctcccgggta gctgagatta taggcgccca ccaccacgac cagctaattt 142201 ttgtgttttt acaagagacg aggtttcacc atgttgacca ggctggtctc caactcctga 142261 gctcaagtga tacgcctgcc tcagcctccc aaaatgttgg gattacaggc gtgagccact 142321 gtacccagcc tgtgttttcc tttttattgg tattgtctac ctcccctgtg tgtgcccctc 142381 tgtttttctc actgccattt ccctaacatt tagaacagtg cctgacacat gatagaagcc 142441 cagtcaatac ttgtcgaatg aatacaggga tctggcttgg atatgttcag ttcactcagc 142501 ctcttgtgta gtactagctt tggtctatgg gacaattata ctgtcttacc aggggcaatg 142561 tggtacaggg acaagaagcc agtcctggaa gccagagagg tcaagtatta aattcttact 142621 ctagcactta ttagctgtgt gatcctgggc aattcacaga agctctctga gccatattcc 142681 ttgacctgga aggtggagag aatttctcat ttcgattctt caagactgat gggaagattg 142741 agaattatgc acataaggta cttgaaatct ggtcatgcac actaattggt agctaataat 142801 ttcgtgaatt actgatatat ctgacatctg gagttccagc tatgtccaac aggaaaccaa 142861 aataagttcc ccacttctcc tttagtctct caagattgag acaccagacc ccacttgaca 142921 gctcacctta atggtacagc cagccattca tcttaccttg gtggcaatgg cagaaactgc 142981 agctgttctc tttgcagtta tgacatttcc atccatgacc ttggaggaaa agagagacag 143041 tgagcaaggg gaacccctgt ccactgcata aattagaaga aacagttcta aaatattagc 143101 ttaggaagtt ttatagcatc ctacaaacag catgggtttg gcgtcagacc tggatccgat 143161 tcctccactt acgagatatg caactttgaa caaacacttt aatctttaag cactggtttc 143221 ctcacctgta aaaaagataa tgccagccat tcagagggtt attgtgagga ttaaatgcac 143281 tggcatatct ggaaacatct agcacatggt gagccctcaa gaactgccca ttttcttttc 143341 ttgccctagg attctcaaaa caccaaatta agaaggaact agagagagaa ggttgccaat 143401 tgacatctgg acaacggcca tgaaactctt agtggctctc cgttcctgct gcacatgtgg 143461 aaagcttcgt gaagatgctc agataccacc cttagattaa cccagtctgt atctcttcca 143521 ggagtgagac ccagacagcc caggtgattc cgatacacag cgaaggttga aaattatgcc 143581 tcgctagtaa aaaccatcct ctgcactcct ctccctgtta ttcatctcag ctgatcatca 143641 aaaaataggc atcatcaacc tcattttaca gatgaggaag tgaatcacga agtcactgag 143701 cagattcagc tgtcatctag tttcaaagcc ttgggttcta cccccctagg gaaccgctgt 143761 tggcaggtag tatgaatgcc atctcggttg gatgagtgta agttggggaa agctagatat 143821 tctcaaagga gtgttaaacg gccagattat ttggcgatcc agaaattcag gggaaaagga 143881 gggacattgt aagacccagc ttggagcgca cagaccttaa gagttaactg tcacacctgg 143941 gagtgattct tgagtctttc actaaccaga tgtgtgacct caagcaaagt gactctacta 144001 acctgagcct caaggacctc aacagaaatt ggggggaagg ggagttaata aaaacaccca 144061 tagaggaatt ttgcagaatt gaatgtgatc agcgtctcct tcttggcaca taagtgccca 144121 atatgtagca gttaaaatgg cggtggtctt agaagagtct ggacctatgg accgtggact 144181 tagactgagc agctctgaag tccaaccatg ctaattgttg gttacataaa cctcggtaag 144241 ttgaacatag ttgctctgac tcggtttcct catctataaa gtgggagtaa tcatattacc 144301 catatcatag aatagtttta agtattaaga ggattaataa atgtgaaatg tttagaaaag 144361 tgcctgatgc attataaaaa gtactccata aacacaagct actgctacct tttgaaacta 144421 tgattagata ttaaaacagg tcacaccgta tacccctctg aagactccca gggattatag 144481 aacacccctt cttttccatg tcgacaagga ttcagtaagt aagtcttccc ttggctaatg 144541 ggttcattca agctgcgtat ttggggcaac tgagtatagg acaaggacat atgcaagaat 144601 gacaacaaga ttattatttc cacactgtca catgcttgag gcagcagtgg gcgctggtgc 144661 cgtgttatgc atcctcaggc agtcctagaa atagatacat ggacacagat aaccttcact 144721 gttgctggta tccagtcact tgcagagggg cacgcgtagt cacaatcaag actccccctg 144781 ctgcggagaa cttacttttg agcttcaatc tgggcccagg ggccccattc caccccggga 144841 caggaagttg ctcaccgcca gcagggtgcc attgctgggc tcaaagagta gcacagtagc 144901 ctggtgggaa gggacgaccg aggtgatgcc gcggtcctcg tagaaggtga ccaacttggt 144961 ggtcagtgca tcctctgcag cactgtaggc gggcatgacc cccaggtagc tacaaggaga 145021 gagggagcag cttcagcccc ttcccatcaa tgctggggtc tcttagaaag tatccatata 145081 ctttcctttt tctttccttc ctcaccaccc cactggccct cttccagccc ccgcatccct 145141 ctgcccttcc cttagacact atgcacagta tgttcagaag tctggctctg gatcagaaaa 145201 cctttccagc tctgccactt agtagctgtg taagcttagg taagtcactt aacctctctg 145261 agtctcagtt ttccaccaat caaatggtgg tatcaatacc tccctaggtg gagagtgtgc 145321 tgaaataaac aaggtaacac ctgtaaaacg cccacttagc acaccgggta gcgcctatta 145381 aaagtggcag ctgttagcaa cggttaggca agccgtctct tcccttctcc cacccctcct 145441 cttcctgctc ctcctttccc cgctttccag tttctcctgc ccctgacccc gaccctcctc 145501 actcaccccc tgtgcttggt caccggcacc acggtgcgca cgggctgcat gacccctcct 145561 tcgggaccgc tggagaagtt ggccagggcc gtctctagag gcgggatgag gaggctggag 145621 ctgcggaggt gttcctccac ctcggccgcg ctcaggaacg ctggtacccg gctcatctcg 145681 ccacctgtgc cttctaacct cagtctccgc accgcgatcc acgtaccctc ggctgtgccc 145741 tttatgagcg cgcccgctgc tctgtggagc cgcctgctgg tcacagccca gcctccgggg 145801 gcggagcaac cccgccccgc cccgccagcc ctctcctccc ccttcgccca cctcgacccc 145861 taagggtggg cgagatgggt cccttccagc aacggattcc ccaagatgga catcgcggtg 145921 gcgtgctggg agtcaatatt gtttggtcca ctttctcagt tcctgtaatc agctgtagcc 145981 acattccttg acttttgctt cctacataac ccctccctat taactcgctt tttgctgctg 146041 tctaggtcac cgtgcctggg aaagaatgga cagggaaaac gctaacgggg ctcctttatt 146101 tattaaatta ctgcctactg ctaggagata aaaggagacg gaggcttatt taattttttt 146161 ttccttttaa ttctgtcccc gtctagagtc tcttatccta gtcttcctcc cccctgtacc 146221 cccagatgtc tggcctttga ttgctaagaa cgcacagttt tctcacaggc taactaacct 146281 aacaaacctt cgcctttaaa ggggggaagt ttgcagggca aggtgtaatc ttttccatct 146341 cacaaaaagc caaattgcag gatcttttct tccaggagtt tttacctgag atggggaaat 146401 aaaatcggaa aggaatgatg taaagcttaa ggcaataaat cctagtcatc aaatcagtga 146461 taggaaaagt aagttaattc aaagaaaaga gagatgattt tgacctggag tggttagata 146521 agggcttcat gaaagagtgg ggagaaacca aggtctgaaa taataatagt gacgacaaca 146581 atgataacat caggttattg atctttacta tgtaccagat actgcaccat gcactttatg 146641 tatattatct catttaatta tacaacaacc tctaaggtag ggaggtgttg ttattagttt 146701 tactgtccat tggaggctct gagaagggaa gaaatttagt tcaaggtcat ccaccaagtc 146761 agcagaggag ctggctttgc ccatggttgg acttcagaat tatattccta atcattaagc 146821 tactttggca gaaagagagg aagagggcat tacaggccaa gtaaagccct gaggtggaag 146881 gcatgttgaa tagctctggg aacagtgagt tgccagatgt gttaggctgg aataaggtgc 146941 tgaggtctag gagcatcaga agataaaact gagaacgatt cttggagtca ggcagctgag 147001 aatggtgatt cctaggaaag aaattacatg ctccaagaga gtaagcattt agaaagatgg 147061 gtcttctgcc aatgcaaagg atggaacaga tgagcgagag cagagcctgg gagaaggtca 147121 agaggcagga ctgctgaagt tgtgctttgt ggtattttgg ttggaagcta catagggcat 147181 tttgtaggaa atggtcacat catgataaaa gacagtggat gagtaagtgt caaaatcagt 147241 accaatgaaa tgagaaattg tacataatgt gtttagcaac agtgcttggc acagagtagc 147301 catgtaataa gtgaacactg agagagaaag agagagagag agaagacact ggtggttggg 147361 aaaagctgat ttcaatgaca caattttgtt tcttcggttc accatcccta gaaatttatg 147421 cccagagccg ctgggcttat agaaatgttt ctggcccatc acgtgtgctg ttagcattaa 147481 tattacatca catttattgg gtttagaaag cacagacata tagccctggc atgatagttc 147541 acgcctgtaa tcctagcact ttgggaggcc caagtgggtg gattacctga ggtgaggagt 147601 tcgagaccag cctggccaac atggtaaaat cttctctcta ctaaaaatac aaaaattagc 147661 tgggtgtggt ggcgggtgcc tgtaatccca gctacccagg aggctgaggc aggagaattg 147721 ctggaacccg ggagatggag gctgcaatga gccaagattg caccactgca ctccagcctg 147781 tgtgacacag taagactttt ttggcccttt gtttttgtga atgtgaacta aaaataaaat 147841 cctatgcccc ccacctactg aatgaacccc ctctgggcca agcagacccc agaaaaacct 147901 taaagactta gtttccagcc atgatgggat ggcaaatcag atatgcctca ttacatcctc 147961 tcccttttag agtttagaca caactgagca gccttaatgt taatacagag atcacaagac 148021 tgacagaaca gactctttgt ggcaatacga taacaaatta taaacaggat ctaaggccat 148081 gccaggaagg gttaagtcac gcacccttac acttaaagaa aaaagctatg ttctaactgc 148141 cacagagttt ttgttttctc cagcagctaa acaagcactg gcttgaagat aagcaatatt 148201 tttgtttttg ttttccccag ccctgctcag agaagatagg gaacatggct tgggcacagt 148261 ggctcacacc tgtaatccca gcactttggg aggccaaggc gggcggatca tgtgaggtca 148321 ggagtttgag accagcctgg ccaacgtgat gaaaccccat ctctacaaaa agtacaaaaa 148381 ttagccaggt gtggtagcat gtgcctgtaa tcccagctac tcaggaggct gaggcaggag 148441 aattgcttga accagggagg tggaggttgc agtgagccga gatcgtgcca ctgcactcca 148501 gcctggacaa cagagtgatt ctctctctct ctcactctct ctctctctct ctctctctct 148561 gtatatatat atatacacaa acatatatgt atatacatat atacgtatct atgtatctat 148621 atagatatgt atatgtatct atgtatctat atagatatgt atatatgtgc atatatatat 148681 atataaattt tttttttgaa acaaagtccc tctctgtcac ccagactgga gtgcagtggc 148741 atgatctcaa gtgattctcc tgcctcagcc tctcaggtag ctgggattac aggtgcatgc 148801 caccaccccc ggctaatttt tgtactttca gtagagatgg ggttttgtca tgttggccag 148861 gctggtctcg gtctcgaact cctgacctga gatgatccac ccgccttggc ctcccaaagt 148921 gctgggatta caggcatgag ccaccgtgcc cggctgaaga tatgcaatat taaaacaatt 148981 acaactcatc caaatcacag atattgacta accgacccct tgttccacca gccataacta 149041 cagcttgatt ggacaagaga ctgatttcag taactttctc ctgataagac taccgactat 149101 aggatggttc tggctgactt acagaggttg cacacttgca tgccttcatg tcctgaaaag 149161 accttttgac gtatcaggct gaattgtaat acatttaaat gctgcaacca ccccaaaatg 149221 aacatgagtt gtatgtaaca tgcatgtttg ttaaatacac atgcatcagg accactttca 149281 tgaatattca tagcttcttc tctcacctgt tgaatatgta tatttagcca acctgttgag 149341 cataaagctc ctaccccaac cctcctcctt caaagtgcct gtctctgatc ttggccaagg 149401 catgtgatat ggtttggctc catgtccaca cccaaatctc ttcttatagc tcccataatt 149461 cccacatctt gtgggaggga cccagtggga gatgactgaa tcatggaggc gggtctttcc 149521 cctgcggttc tgatgatagt gaatgagtct cacgagatct gatggtttta aaaatgggag 149581 ttgccctgca caagctctct ctttgctatt gccctccatg taagatgtga cttgttcctc 149641 cttgccttcc actatgattg tgaggcctcc ccagccatgt ggaactgtaa atccaataag 149701 cctctttctt ctgtaaattg cccagtctca ggcatgtgtt tatagacagc atgaaaacgg 149761 aataatacag cgtgcttccc agcataggat cccaccttgc aggctgtaac ctcgtataag 149821 aaagtctctt cttgtctaaa tttataaatt gtgtggtttt ttttttaaag ttaacaagga 149881 gaaaaaaagc aaaaccaaat actcatcaac taaaagagat atgaaataag aaagccagat 149941 atttatttgc ttccatagta aattcaggaa tttagccccg attaaataga aatcttattt 150001 ctgactccca cattactaga aagtttatta atcttcttta agtcatgatt ataatagtaa 150061 tttcattatt ttaacaatga caaaataaga aacctcaata ataggaattc actccagtga 150121 atcctgtaga aacaggacaa ctctgcaaat catctcaaat catcttcctg ataaagcttt 150181 ggttcacatc acccaggtcc tcagtaaagt agcactgaga aaactagtct tggttgcatc 150241 acttcttttg agaccagagg cacagagaac taattgtccc aagcctccaa taataagtat 150301 tattaataag tagcatatta aatattaggt gttattgagg tttattttat tattattaaa 150361 ataatgaaat tattattata attatgactt aacagaaaga agattaataa actttttagt 150421 aatgcagggg tcagtattga gaaagctggt gtaaatctct tttccccgcc cttccctgag 150481 accttgtcct gggacaagcc tgggggccat gtaagctcct gtgcctgagc ctctggaagt 150541 tccctgctaa cccctaacag accctgcgga ggaggtggtt acttacatcc ccctagactt 150601 ctgccactac cacagtgaag atggcagcag ggtagagctg taccatagtg aggtactcac 150661 ctatggcaca aaatgtacgg gattcactca gtagtcatga taaatattaa actaaaacat 150721 ctaaatgcaa tatttaaaat aaaaatcaat gccaaaaaat ccacagtgaa cagagtacaa 150781 aaattttcaa caaagacggc tcaggattac tgacttttcc ttttgctgta gatgccaata 150841 cagctcctca cgaaatttta tctttacatt ttattttaaa atgtaaaaat attgtataaa 150901 agtctttatt ttgactattg agttggcttc accttgaatg ctgcagccaa ggcgcgtgtc 150961 tcattcccct tatccctaac ctctgtcgtc ctagagttgg gggactgagc ggtgggccag 151021 gacaagcaac ggatgaagta tctcggaagc cggagaaggg tggtctgggg ctcttgggag 151081 catgcgcggg gtggggcggt ggggggactg aggtggcgct cccggctcct ccgcgcgccc 151141 ccggcacctt ccgcgcgtcc ccggctcccc acactccgtc cccggctccc cacactccca 151201 ccccggtctg ccccgggcgc ccccggttcc ccacctcctg gtgtgcctct ggcgggtccg 151261 cgcaggatca ggctacagac tcgcccgcgg gcggctgcgc ggcgggccgt tggggagggt 151321 gttgggatga ggagccccaa ccagatggac gcgcaccccg cagcggcgga ggcggcggcg 151381 aggcttgagc agtgagtgct cgggagcccc ggccagccct tcctccttct gcctgcgtcc 151441 tctgcgctgg ccgctgctcc agacgccagg gggcccaccc gctgcccgtg gtcggcgcgc 151501 ggagctcgaa agcgcgcgcc tctaacaaat gaaaacccat agggacttag ggttgaattt 151561 taactttttt ttttttttgg cttattgttt agtgctagat agaagggttt gagcggtgta 151621 gtcaattttg gtgctagtct gacttgaggc aacgtgacgt gacctacgcg tctcatcagt 151681 aatgcagaaa taacatttta tccacttcat cgggcttttc tttttctttt tcagaaaatt 151741 gaggtaaaat tcaaataaca taaatttcac cttttttacc cttcaaagtg cacagttccc 151801 tgacttttag tatattcacg atgttgtgcg accaacacta ccatctaatc ccattttcat 151861 caccccccaa ataaaccctg tactcattca gttacatccc attctcctct ccccactgcc 151921 tctcccaatt gccaatctac ttttttgttt gtggatttgt aggttctaga catttctttc 151981 tttctttctt tctttttgtc tccagctgca tctacattgc tgcagaggac ataattccat 152041 tcatttttat ggctgtatag tattccatag tgtatatgta ccacgttttc tttatccaat 152101 ccaccattga taggcaccta ggttgattct atgtctttgc tattgtgaat gatgctgcaa 152161 tgaacataca gacgcatgta tcttttagta gaatgattta ttttcctttg cgtatatacc 152221 tagtaatggg aatgccgggt caaatggtag ttctaagtta tttgagaaat ccccaaactg 152281 ctttccacag tttctgaaca aatttacatt cccaccaaca gtgtataagt gttctctttt 152341 ctccacagcc tcaccagcat ctgttttttg actttttaat aattgccatt ctgactgatg 152401 tgagatgata ttccattgtg gtttttattt gcatttccct gatgactggt gatattgagc 152461 attttttcat atattttttg gttactcgta tgtcttcttt tgagaaatgt ttgttccttt 152521 tgcccatgtt ttatttgggt tatttgtttt tggcttgttg aattgtttca gttccttgta 152581 gttttgtata ttagaccctt gtcagatgca tagttgcaaa ataaacttgg ggcttatgat 152641 acttcgcctg attatttaca taaaacacag tgggcatagt gaatggcttt taaaagttgg 152701 ctttgctggg acttttatac taaattttgg attagacttt taaaagtctt gaggctagga 152761 agccaaacca agtatttgct tggctgtatc tgtaatacct gtatgaattg ggttaatttc 152821 tctcttcttg agttcccaaa atatactgag gctcgtggcc ctgccagaaa gtgacattct 152881 ttacttagtg caagcacaga aaccatacaa gggaactgtg tagacaagga accatgccag 152941 actttccaaa gggcttttta tcagcactat aaaattgtaa agctaatctc aattcctcca 153001 agcagtctgg tcatctctga aaatatgcca ttccagccaa agccttgata aaatagccag 153061 tgtatctaat tatgtcctgt tataaaataa aacagatcct tattgaactt atgcaaataa 153121 ctattttgcc ataaattaag aatactcaca gtttccaaat ttgggagaaa tccagtagtg 153181 agaaaggcaa atgcttcaaa tttgcccaca aaggtatatt tacccaattt ttgtaagcta 153241 tgaatagctc aaaagaaaaa aggtttatta actctggaag caaaacataa aaagaatcag 153301 caatgtttca agcaaaaaag ttattaaaaa tcatctttgt cctctatcag tttagtccca 153361 tgtagctaat tattattcca cttgatgttg ggttagcaac cctcatgaat gcatcaggtt 153421 ttttattaga gctctggaag tttttgccca ctccaatggt gtgatctcca aagttatcag 153481 aaacctggat tcaagagtac ttgtcatagt tctttccatg aagttcctta agaagaagcc 153541 aattttggac tgtagctgat tataaaccac tttttgagaa gaatcaaaat aaaacaataa 153601 ttgagaatga caaatctctt agaatagaca tagttaaaga cagaattttt tttttttttt 153661 gagacagagt ctcactctat cgcccaggct ggaatgcaat ggcgtgatct cagctcactg 153721 gaacctccac ctcctgggtt caagcaattc tcttgcctca gcctcccaag tagctgggat 153781 tacaggcgcc ggccaccacg cccagctaat tttttttttt tttttttgta tttttagtag 153841 aggattttgc catgttggcc agcctggtct tgaactcctg acctcaggtg acccacccgc 153901 ctcggcctcc caaattgctg gaattacagg catgagccac cgcacccagc ctaaagacat 153961 aattgacaag gatatttgat tatttctgtg gcatacaatt taacatcatt gtaatgatta 154021 ccgataacat ataccaagac atatcagaat tgtaggaatt tcttacaatt ttggaacata 154081 ctttaataac acttttatgt aaatatgact caaagaaagt caagcaccat ttcttatttg 154141 ccagtgtttc ctatataatt ttaacatatt aaataagcct actatgtctc tcttggacat 154201 ctaggagttc cttttggaag atacttaatt ttaggccggg cgcagtggct cactcctgta 154261 atcccagcac tttgggaggc caaggcgggt ggatcatgag gtcaggagat ggagaccatc 154321 ctggctaaca cggtgaaacc ccatctctac taaaaataca aaaaattagc tgggcgtggt 154381 ggtgggcgcc tgtagtccca gctactaggg aggctgaggc aggagaatgg cgtgaaccca 154441 ggaggcggag tttgcagtga gccaagatca cgccactgca ctccagcctg ggcgacagag 154501 cgagactccg tctcaaaaaa aaaaaaaagg aaaatactta attttagaat ttgaaatttg 154561 atttttggaa gtatgtcaaa tactaaaggc ttaaaacact tcatcaaaat agaatcactg 154621 gtcactgtaa aataatagtc attcatttac tcaaagtgat aattcaaaga tttcaaatag 154681 aaaagccttt actctttgtt agagaggaaa ttgttttcca aacaaccata agacctattt 154741 tttggtagag tttgagaagg tatgatataa cagaaatatg tatttggtct tcatcccttg 154801 ttcctggcac agagctccca cagtccttgg aatttctgga atgaatggaa tatcttttct 154861 tattcaaaac ttgaccatac gtgagtttat gctaataaag tgaccccagg aaggccctta 154921 gatagcttga ggataggtgc tggttgccag aggaatcaac catgtgatga ggattgaaac 154981 tttcagccca catcctcacc cctgatctcc aggagggaag gagacaggag attgagttca 155041 gtcaccaaag gccattgctt taaccaatca tgattacata atgaagtctt gatacaaact 155101 cttgaacaat gagatctgga gagcttctgg gttggtgaac ccatcggtgt gctgggaggg 155161 tggcacactg gagagggcac agaagctctg tgtcccatgc cccttgccct gtgcacttct 155221 tcatttgcct gttcttttgt aataaactgt aattgtaaat gtagcacttt ccttagttct 155281 gtgagttgtt tttagcacat tatcaaacca gaggaacaat tgtgggagtc ctaacatttg 155341 tagttatcca ggcagaagtg caggtaccct gtattcccta tttgtggctg tgtctaaagc 155401 gggtgcaatc ttgtgggatt ggttctttaa cccatggagt ctgtgctaac ttataagctt 155461 aagcaacaaa tgagtcaaat aaaaaatcac aagggaaatt cgaaaatact ttgtagtgct 155521 agaattgaat tcaattgtag gacacccagt tggggtgtca gagaatagga caattggttg 155581 tttatttaaa aaccaaacag gaggaccaat gtaaattatt ctttagatgt ttggtggaat 155641 tcaccattga agccatctta tcccaacctt ttctttgctg agaggttttt agttactaat 155701 taaatatctt attataggtc tattcagata ttttacttgt tcttgagtta gttttgtagt 155761 ttgtgtgttt ctagaaatgt gtccttttca tctaggttat gaattgtgtc cttttcatct 155821 aggttatcaa ttgccctaca tttgttcata gtattctctt gtaatgtttt ttaatttttg 155881 taagattggt agtaatgtct tctcttagtt tcctgatttt aataatttga ggcttctttc 155941 ttattttggt cagtcaatat aaaggtcttt gatttgtgtt gattctttaa agggattgta 156001 tattttccct ggcctgttgt ttttacattc tctatttatc tctgttttag tctttattat 156061 ttcctgcctt ccaccggctt tgggtttggt ttgctcttct ttttccagtt tttaaggtgg 156121 aagactagat tatttatttg aaatagtttt aaatgtaaga ataaatgtga aaaaactaaa 156181 ctgcttttct tataagtttt ggtataccat atttttgttt tcattcatgt caaagtattt 156241 tctaatttcc cttatgattt tttatctgac tcatttgttg cttaagtttg tattgtctaa 156301 tttacacata tttgtgtatt ttcagatttc cttctgttac tgatttctaa ttttgttcca 156361 ttgcagtcat aataagatgc ttcatattat ttcagttttt tataatttat taagacttgt 156421 tttgtggcct aattaatggt ctgtcctgga taattttttt atgtacactt gagaaaaaaa 156481 tgtgtatttt gttattgttg ggtagagtat tttatatata cctttaggtc aggtggttta 156541 ttttgttgtt catatcttat atttgtttga tcaatgtagt tgttctataa ttatttaaag 156601 tggggtattg acatctccaa ctactattgt tgaaagccct taaatctgtc agtttttgct 156661 tcatatattt tagggctgtg ttgttaggac catatatgtt tgttactgtt gtatcttctt 156721 catgaattaa cccatttaat agtatataat gtccttcatc tcttataacc gtttttatct 156781 taaagcctat tttgtctgac attagtagaa tcattccagc tctcttttgg ttaccgtttg 156841 cattgaatat atttttctgt cttttaaact tttatttgtg cctttaaatt taaattgaat 156901 ctgttgtagg tagcatatgg ttggatcatg cttgttaaaa tccattctgc caatctctat 156961 cctttaattg gagcagtaat ctatttatat ttaatgtaat cactgatagg gaaagacata 157021 cttctaccat tttgtcataa gaatgtcata agaatttaca tattaggtgc aaacttaccc 157081 tctacctcct cactcaggtg acaccctctc ttctgtcaaa gtgagagttg aagccagctg 157141 ggctgctggc acaggtgccc ggcttttatt cccttatttg gccctgccca catcctgctg 157201 attggcccat tttacagagc gctgattggc ccatttacag agtgccgatt ggtccatttt 157261 acagagtact gattgccagc tgggcttctg ggtggagtgg ggacttggag aacttttgtg 157321 tctagctaaa ggattgtaaa cacaccaatc agcactctgt aaaattgcac caatcagcac 157381 tctgtgtcta gctaaaggat tgtaaatgga ccaatcagca ctctgtaaaa tggaccaatc 157441 agcactctgt aaaatggacc aatcagcact ctgtaaaatg gaccaatcag tgttctgtaa 157501 aatggaccaa tcagcaggat gtgggtgggg ccaaataagg gaataaaagc tggccaccag 157561 agccagcagt ggcaatgtcg gatccccttc catgctgtgg gaactttttt cttttgctgt 157621 tcacaataag tcttgctgct gctcactgtt tgggtccaca ctacctttgt gaactgtaac 157681 actaaccacg aaggttgtag gcttcattcc ttaagtcagt gagaccacga acccaccggg 157741 agggacaaac aacctggatg tgccaccttt aagagctgta acactcacta caaaggtctg 157801 cagcttctct cctgaggtca gcaagaccac aaacccactg gaaggaagaa actcgggaca 157861 catctgaaca tctgaaggaa caactctgga cacaccatct ttaagaattg taacactcac 157921 cgcaagggtc cgtggcttct ttcttgaagt cagcgagacc aagaacccac gggagggaac 157981 caattctggc cacaaaagca tcttacaaat atgctttttg ctcagactta taatttcatc 158041 tttttttcca gtaatttgaa ctcagtcttg aagtcttaca gtgacagttc agagccttac 158101 ctcttaaaca tgtttcttat ttactcatat atattcataa actgttatac atatttaatt 158161 tagtatttag aggaatctca gcatcaagtt tgtttctctg aggatgtatg tcctttcaac 158221 tgatgatata ttccttatcc tctttccact gttgagtcct aggctacatt tcctatcaca 158281 tctttgcttt ttcattggtt atataaaata aacggtatga gaaataatca gcattggttt 158341 cttgttcgca tgagatgaca ggaagacaga tgatatggac atcagctttc ccaaaaggat 158401 tttccaaagc tgtaattatc aacacaaact atgtttcaag aaatcagggg tagtatactt 158461 tatgctaaaa acggagggtt tgtgagaata atgttacagc ttttaatttt ctctgtttaa 158521 ttttatcaac ttgggagact tttgttatcc tgggacagtt catttcatat aaagcatgct 158581 aacattatct ttttcattgc agattggaat aatactttta aaaactttct gcttgattca 158641 ttcctgctgg tacccattta taaccttggg atgagcatca gcaaatacta tgatatccaa 158701 gaggtgaata tgttgtgttc ctgttgaaaa gtattcctaa ctccataaat tacagtacag 158761 aaagtgagta tctgaggctc tacaaatctc atcagagcta ctggtagtca attacttatt 158821 acatgtgaat aattctacct acattactgt gaatcttgac aataactcaa gttatatgtt 158881 atcctcattt tgaaaaaact aaaccttaca caatgtaaaa acttgctcaa caccacaaaa 158941 tttgtacaag atagcactaa gattcaaagc ttatgtgcat aatgcatttt ctctgtggga 159001 aagaatctct gtgattagta tctatattgt attatacatt aatctcaata gaggggaaag 159061 cacttgaaaa aatttaacac ccttttatga tgaaacttct caacatcttt gcatgtgctt 159121 atttgccaca tgtgtatcat ctttggtgaa gtgaccggtg aacttttctt tcatttattg 159181 ttgaattggt cttttttctt actattgggt tttgagaatt ctttgtacgt tctgaatatc 159241 agccctttgt gtagtaagcc atttgcaaat attttgtccc agactatggc cttttatttt 159301 tttcaaagca tcttttaaga accaaacgtt tttaattgtg gtgaagttaa atttatcatt 159361 ttttagtgaa ttgtgctttt agtgctatat ctaaaaaact ttgcccaata cagagtcaca 159421 aaagcttgat tggatgtctt cttctagaag ttttacttaa gagaaatgaa agcatatgaa 159481 catagaaaca cttgtatgca aacattcata aatagcttta attataaagg ccaaaaattg 159541 aaaataaccc aatgtccatc aacatgtgaa tgggtaatca aactgtgaat tattcataca 159601 atggaatacc ctcagcaata aaaatgaatg atctaccaaa aaagagactt ttcagcaaag 159661 gaggactaga atgaaacatc ttcactttga gtaaggaaat ctagaaatat ctattgctaa 159721 tatcattttc aaggcaatac cattgtggta gcatgaaata cataaaacat ttagggaaaa 159781 ttttaaaaaa gatgtgcagt acctgtacac tgaagactga aaacattgct gagaaaaatt 159841 aaagatgact tgagataatg gagaaatata ccttattcat ggatcagaag actcaatatt 159901 tttaaattgt caattgtccc caaattgatt tatacgctca aaacaatctc aatcaacatc 159961 ctagaggctt tttatttttg gtagaaattt agctgattct aaaatttata tggaaattca 160021 aaagaacttc attatccaaa aatatatatt ttcttaattg caagattttg tgacaactca 160081 ttttaaaact tattataaaa ctacagtaat taagacaatg tggtattaac ataagtagag 160141 acatagatga cagatggatg gatggataga tagatagata gaataagata cagagttcaa 160201 aaatagattc acacatgtat ggccaattga ttttggattc aaaagaactt tattatccca 160261 aactatactt ttttaattgc aagactttgc accaacttat ttgaaacctt attataaaac 160321 tacgataatt aagacagtgt ggtattagca tgagtagaga cagagaagat aaatgcatgg 160381 atggatggat ggatggattg atggatgaac ggatggatgg atgaatgaat agatggatag 160441 gtaagtagat agaagataga tagatagata gatagataga tagaataaga tacagagtcc 160501 aaaaatagat tcatacatgt atagtcagtt gatttcagat gaggtgccaa agtaattcaa 160561 atgagaaagg atagtcattt caattagtgg tcatggccct aacctgtaaa agcttgtgat 160621 tatttgatac ctagcaaaca gacaaataaa atccctataa cctcttaact ctatcccagt 160681 aggaagaagg ttgtgaaaac tgtgcattct aaaagacagc gtagattgaa acacccactt 160741 tggatgcacc aagggaaaag caaatctgag tgctacagga gacagtgaac aagggaatct 160801 ctcatagaga gcaggtgagg attctgcaga aaagagaata cggaacacag tggcaacagc 160861 ggtagatcca tcatgagctg ggaaatactg aaaaaaatct ctgagaacta gacctagtca 160921 atcaaaatcc tggttatagg gggaaataaa taaagaggct tgaaaatgtg caaagctata 160981 aatgttaatc ttgagagata ttaatttgga ggttagagag aagcaactta gaaaatagaa 161041 gtgcccttta gaattagggt agcaaagaga aataaggaca tgtccagcat aatttaactt 161101 catacagata ttagcaagaa tcacagaatt gagtcgcttc attcccttag aaaaagttct 161161 atttgttaaa gaaactgcac ttcactgtat taatagaaga agatgaatat gtactaagaa 161221 gcttacaaac taatcaaaat attaccattc atccattaaa tcacttgtta ttgttggttc 161281 agataatcaa tttaatatgc cataagaagg aaacagaaga atatcaaaat acttcaattg 161341 atgaaaattc ctgtccctca cctgccaaag gaaactaacc acaaagcaga gggaaatagt 161401 aactaagtat caatatacgg gaaactggag aacaatttgt cagcaaaaag gtagaatacg 161461 cataacttct cattaccatc tcacaaaagc ttcaaaaatt agcagaaact gtctgaacca 161521 attttgtcag gactatggaa aacaatcaaa agtttatagc aaccaaatga atactgaacc 161581 aataaaaaag tcacttcaaa acagtggata gttttgtgat gattttacac gcccttgccc 161641 ctctccctcc ctggcacagg agtggtcgtg gtcttgaagc aggagtagtc tgcagtccca 161701 gttttggacc gttttccctg gctctggagg gtgcggagca gaatttattc gcaaattatt 161761 attatttttt taaatttact ttaagttccg ggatacatgt gcagaatgtg caggtttatt 161821 acataggtat acatgtgcca tggtggtttg ctgcacctat caacctatca tctaggtttt 161881 aagccccatg tgcattaggt atttgtccta atgctctccc tctccttccc cgccaccccc 161941 tgacaggtct cagtgtctga tgttcccctc cctgtgccca tgtgttctca ctgttcaact 162001 cccacttatg agtgagaaca tgcagtgttc ggttttctgt tcctgtgtta gtttgctgag 162061 aatgttggtt ttcagcttca tccatgtccc tgcaaaggac atgatctcat tccttttttt 162121 ttttttttct tttttttttg agacagtctt gctctgttgc ccatgctgga gtacagtggc 162181 gcgatctcga ctcactgcaa cctccacctc ctgggttcac accattcttc tgcctcagcc 162241 tcccaagtag ctgggactac aggcacacac caccacgccc agctaatttt tttgtatttt 162301 tagtagagac agggtttcac catgttggcc aggatggtct cgatctcctg acctcgcaat 162361 ccacccacct cagcctccca aagtgctgag attataggcg tgaaccacca caccgtgctg 162421 atttcattct ttttatggct gcatagtatt ccatggtata tatgcaccac attttcttta 162481 tccagtctat cactgatgga catttgggtt ggttccacgt ctttgctatt gcaaatagtg 162541 ctgcgataaa catacatgtg catgtgtctt tacagtagaa tgatgtgtat tcctttgggt 162601 gtatgcccag taatgagata gcagggtcaa atggcatttc tggttctaga tccttgagga 162661 atcaccacac tgtcttccac agtggttgaa cgaatttaaa ttctcaccaa cagcgtaaaa 162721 gtgttttttc ctatttctcc acagccttgc cagcatctgt ggtttcttga ctttttaata 162781 atctccattc tgactggcat gaaatggtat ctcattatgg ttttgatttg catttctcta 162841 atgatcagtg atgttgagct tttttttatg tgtttgttgg cttcataaat gtcttctttt 162901 tagaagtgtc tgtcatatcc cttgctcact ttttgatggg gttttggaat aggtagttct 162961 ttttatgtca gttttgtatg ttgtgctttt caaggaattc gtccatttca cttaaattgt 163021 taactatagt aatacaatgg tataaagtta tttgtaatat caaaaaagaa aattaggcca 163081 gatgtggtgg ttcacgtcta taatctcagc actttgggag gccgaggtgg gcagatcacc 163141 tgaggtcagg agtttgagac cagcctggcc aacatggtga aactctgtct ttactaaaaa 163201 tacaaaatta gctgggtgtg gtggcacatg cctatagtcc cagctacttg agaggctgag 163261 ggaggagaat cgcttgaacc caggagatgg aggctgcagt gagccgagat catgccactg 163321 cacttcagcc tgggcaagac agatcaagac tccgtctaaa aaaaaaaaaa gaaagttatt 163381 ttgttgcacc ccatctccta tatatgaaac tatttccaaa accataagaa aattcatctc 163441 actaaaacat gaacaaatta aaagattata attgaactcc atagaaaaat atcataagta 163501 aaagaccatt ttccaccatg tccagctaat ttttgtattt ttagtagaga cagggtttca 163561 ccatattggt caggctggtc tccaactctt gacctcgagt gatccacccg ccttggcctc 163621 ccaaagtgtt gggattaaac gtgtgagtca ctgcacccag ctggagtttt ttaagtaaac 163681 atttaagaaa cacttacttt tcaagagcta atagtgtgga aagtgtaagg gaaatggaga 163741 ttggaaaaga gaaaaataga aaaacaatca aatggaaata tagatgagaa cttgtcaggg 163801 agaaataagg caatatcaat tcaaaacaat tatgaaagca gtccacaaag ataaaaagca 163861 aaaccacaga acaaatacta aaacctatca ttcaagaaaa tttcctgaaa taaaagaaca 163921 caaatctact cattgaaagg gcaaccagta tacctgaaaa aactgacccc taatcactaa 163981 caccaaggaa tattctaata aactacttca ctttaataat aaaatggggc agggactttt 164041 atatatgtag acaaaaaaga caagcatttt ataatagaaa aagtcagatc gagtgcagat 164101 tttgtaacaa gaaggtctta tgccaggaga gcaagtcaga tgtttaagat actcactgag 164161 gccgggggct cacgcctgta atcccagcac tttgggaggc tgaggcgggc ggatcatgag 164221 gtcaggagat cgagaccatc ctggctaaca cggtgaaacc ccgtctctac taaaaataca 164281 aaaattagcc gggcatggtg gcagacactt gtagtcctag ctactcagga ggctgaggca 164341 ggagaatggt gtgaacctgg gaggtggagc ttgcagtgag ccgagatcgc gccaccgact 164401 ccagcctggg cgacagagca agactccgtc tctaaaaata aaaatgaaaa ataaataaag 164461 atattcaatg aaagaaaaac acaagcaagg tttttacctc cagtcgaagt gaccttcaag 164521 attaaaagtc acaaaaggcc aggcgcggtg gctcacgcct gtaatcccag cactttggga 164581 ggccaaggca ggtgatcacc tgaggtcagg agttcgagac cagcccggtc aacatggtga 164641 aaccccatct ctactaaaaa tacgaaaaat tagccgggca tggtggcaca tgcatgtaat 164701 ccaagctact cgggaggctg aggcatgacc attgcttgat ccgggaggca gaggttacag 164761 agaactgaga tctcaccatc gcactccagc ctgggcaaca agagcaaaac tccatctaaa 164821 aaaaagagaa aagagtcaca aagactcatg aacacaagga atgctgtttc tattagctct 164881 tcctgaagaa tatacaaatg aaaaaaattc ccattaacta aaaattgatt ggcagagctt 164941 gtgtgtaaga actaatggtg ggcattgact atatgtgcct ctaaaatttg ggctaaatga 165001 aaggtatctg tgtgataaaa tataacataa ttgatacata ttttaaaaat gcataaatgt 165061 ttctgtatca tgaatataac aataaataga accaaccaga gtggggatat gtgccatgtg 165121 agtgtcttat gcatgtatgt agatatgtgt atttccattg acatgtaatt acagaacagt 165181 gtgggagaac ttctatagaa catattggtc ctctcagtaa acatagaaag gcttaattct 165241 ctgactagat gaaaaaaata tttttcagat tggctaacaa gcaaaatcca actctgcggt 165301 atgtaagaaa cacaactaaa gcaaactgta tcacagaggt taaaaataaa agaatgatct 165361 ttatgttctc atgctagacc aatacttggc ctttagcaat gcattaaaaa atgtttaaat 165421 cttcttacca attgatatat cattttcagt gtctgccttg tataagtaga tgcttgtcct 165481 gttgctccct gcaggcacct gtctctccca tgatgctgag ttaggtgttt gtccagtatt 165541 tgcccaatca gagaactgat aggttaaata aaagctatta atttgcagtt tgtccagcct 165601 ttccttatgt gaagagtgaa agggatattc accagttctg tgtacctcca agggaaagct 165661 aaagttggct taaccaattt tacaaaaagc aatgacccaa aaattctacc tttaggtata 165721 tacccgagat acttgcatcc acttctccca agagatacgt aaaataatat tcaaataagt 165781 tatattcaca aaagtcaaac actaaaaaca acccgcatgt ctctattagt aaacaaatgg 165841 gaaatataca acattcatat accagattac tgtgcagaaa ggaaaatgat gaattataat 165901 tacatgcaac atgagtgaag tttccaaaca taatattgag tgaagcagaa agaaaggaat 165961 gaatactgga tgattccatt catataaagt tcagaagtag aaaaattgaa atcacaattt 166021 taggggtgca tgtgtagata aaatgataaa aagggaactg ttagggaatt gctcaaataa 166081 caataaaagc aatggatact ttttatagag gacagaagct gggaaatagc ccatgggaat 166141 gtatgggtta ttattacttg acctggatag ggtggttttg tgtgtatttg ctttaaataa 166201 ttcattaagc tcgatgtttg aaatatctca tatttatttg tgtgatcatg aaggcaatac 166261 atgctttaag actaacatta agaaaaatcc cacactgtta caaacatgtg ttttttcaaa 166321 catttttaat cccagttaat ttcatttgct ataatctaca aatatgcaat aaaagtgtaa 166381 atacaaatgt ttattctaaa ttaggaatag acaccaccct ctttcctcct actccaccag 166441 tggatcatta ctcacctgta taagaaacta gaacaagtgg cctgactttg gtgatttata 166501 tgtgatttcc tgtttcacac gtgaggctat tcttactatg attattctta ttatgattaa 166561 tccttattgt tattaatctt gttcttattg tgattattcc tgttgtcctc cccaccaaac 166621 acactgtacc tgcccacaca ctgccgggag gaaaacatcc tggttcagat tctcagcact 166681 gcgtttagga ataaagtctc gtagcaacat taggctttta gttttttttt ttaatttttt 166741 ttttactttt tgaaacagag tctcactctg tcacccaggt tggagttgag tggcgtgatc 166801 tcagctcact gcaacctctg cttcttgggt tgaagcgatt ctcctgtctc agcctcccga 166861 gtagcttgca ttacaggagt gcaccaccac tgctcggcta atttttctat ttttagtaga 166921 gacagggttt cgccatgttg gccaggctgg tcccaaactc ctgacctcag gtgatccgct 166981 cgccttggcc tcccgaagtg ctgggattac aggcgtgagc cactgcacct gaccacatta 167041 gacttttaga aggctcaggt gttttacatt gggcaggtga actggctaat atttgtaccc 167101 tcaatcattg ctatgaactc cacataataa gactagttca tgaggaataa atctgccgaa 167161 accattaact atatcatgga taccaaagag attgggttag caggtcatat tgccttttgt 167221 attaacctcc tctatccacc ccttcaagta tatagttaag taaatgtagc aatattttgg 167281 tgatttcaaa ttaagtgtgt aagattattc accggtccta ttatcctttc ttgagtagtg 167341 tcttaaaaca caagaaccaa gggatcctta actgctacat tcccagcaag gactggggaa 167401 aggtgatggt agaaatcttg cccttgtaac attgcatgtt aagtaagaga gcactaatgg 167461 tggggagtat ttgggattcg ggcatttgga atgggagcct ggcattgttg ggaggagcag 167521 cagcgttagg ttactttcca aacaccccta gagtttaatt gccttgattt cttcccctaa 167581 atccgactat tagtccatgt ccatcaagag atggcctcct ctcactttct caattctctg 167641 ttcttccgcc ttctcacctc acccttctgt gccttttgtc ctccaacata cacttgatag 167701 gagtggggga gtggaggaga ggctcatctg agttgggggc tgaaggggag attgcaatgg 167761 taactgtgtt tttattgctg ttgttaaagg tgtttggcat tttggagcaa gctaaagagc 167821 aatttgattt agaagactat tctgtcagat cacactggaa caagtcttcc tgacctttgc 167881 taacccagag aaatcatcca gtgatgatga aaacgaggtg ccatgagatt ccctcttaaa 167941 aaaaaaaaaa aaacagaaaa agaaaaaaaa aagaaatgcc ctacacgtga gtcccaataa 168001 actcatctac tcatcaagct ggactggtct gagtcattct ttggtctgtt ggctcctttc 168061 ccagtttggg gcggaggatg ttctatacag tcctggattt ttccccaacc aggaagagaa 168121 ctccttgtcc tgcagtgttc ctcttgcgcc ctctcctgac aaagcttaac ttcggagctg 168181 gagcaagtcg tatccggcat tgcagggaag agtgaatttg gcctgtgatc caatcagaag 168241 ctgcgattct aaacaggaag ccacactgga tgcgagaatt ggggtggggc gcgccaagag 168301 gagcaagcat tatagaacgt ggggagcatg agaaatacac ggaggtggaa acgccggagt 168361 ggctggcggg taaaggcagc gggcgcagat gaagcgggct gggcgtccca cgcgcagaac 168421 cgtcccggac agaagccgca gggctgcgct ggctggaaaa aggaacgcga gtacagcgcg 168481 cgtggcgcgg ggtctgctcc aggacggaat cttttgggtg gcccgcatga ggggtttgca 168541 ggaccccggg cctttgggaa gttatctgct aaactccagt agaccctgag gagcagcggc 168601 tcatgaatct tcttaaactt ctgtcatcag cggctgggcc agctgaaggt gaccatggca 168661 cacgagggag agagaagccc gcgagaggcg gagaaatgtg gggtcgtcca ggagggtcga 168721 caaggcaaag aacctgaaga cgacccaaaa gggtacctag gtggggccct ttcagggact 168781 tggggcatag ggtagggcgc atgggacgag gtgggtgagc gcaagggacg agatgggtgt 168841 gcgcatggga cgaggtgggt gggagcatga gacgaggtgg gtggggcggt gggtgaggtc 168901 tccgccccca gacgggctgg cgaggaagca gggaagaagt aacgttgggc tggtgaggca 168961 acaggtgggg cgcactggag ctgcgggata ataggtggaa caaactgggg actacactcg 169021 tgggcgcacg tgcggaagag ggactgagga ggttcttgcg tgctcccctc gagcaccgcc 169081 gacagcttca cccgcacctc ctgccttccg caccgctgac tcctaccgct ccgcgcactg 169141 cgcgccccca gccctagtgc agccagctcc cggccgggtc cgcgcgaggg ccaggctgcc 169201 aacctgcccg cgggcggctg gtggttgggg agggcgttgg gaggaggagt cccgccgggt 169261 ggacgcgcgc ccttcagcgg cggaggcgga ggcggcagcg gcgaggcctg agggtgcgtg 169321 ctagggagtc ctggcgcgtc cttcttctgc cggcgtcccc ctgcgcttgc agctgctccc 169381 cgacgcccgg gaggcccacc cgctagccgc gatgggcgcg cagagcccaa aagggcgacc 169441 cccaaacaaa actcacgcat ataaatcctc cagagacttc ctcatccccc cgccaacaca 169501 cacacaccca cacacacata cacacaatca aatcaaatag aagcactgtt aggaatttta 169561 tattggctat gcaaagtgct tgaacgcact gactttggag gcaaatgcct tgtccctagt 169621 cctgactagt ccagaattta tttgctgtga cctatggcaa attgcttggt ttctctaagc 169681 ctccatttcc taaggcttat atctgaaaat aactataaga ggaaaggatg ctattgtatc 169741 caactcagag ggcagtcctg atacttacat tgcgagaaca tgttaagtgt tccataaatg 169801 gcagaagggg ctgtggaagt tcagtgatta taggtagttt aatttgtctt agttttcctt 169861 tgtgaaagca taaattacac cagagaaagt cacatatgta aggaatactt tatgcaaagc 169921 tgttgtccta gggaagagag accagaactc agtctaaact aactctgctg aaacaaagtg 169981 gggcagggtt tttaagctct aggatgagag tagaaaggtg ctggagggct gtcggaagca 170041 tactgagttg tttgctgagt ttacaagtgt ttcctccgtg attaggccag ctgtgtttgc 170101 taattggctc tcagggaagt taggctccta ccctcccaca gaaactggga gataggggga 170161 ccgtcttcct tgatgattgc atttcaaagg aatggctccc aggtccttga caaagatgat 170221 tctggtttgt aaaactagta agaggctatt aaaaagagtt acagatatct caagacagag 170281 aaagaattta cagtgaaaag ctttctaaag aaaatgctct gtattaatct gttttcacac 170341 tgctgataaa gacataccca agactgggca atttacaaaa gaaagaggtt tattggactt 170401 acagttccac gtggctgagg aggcctcaca atcacggcag aaggtgaaag gtatatctca 170461 catagtggca gacaagaaag cttgtgcagg gaaactcccc atttgaaaac catcagttct 170521 tatgagactt actcactatc atgagaagag cacgggaaca acccatcccc atgattcagt 170581 catcactcac agggtccctc ccataacacg tgggaattat gggagctaca agatgagatt 170641 tgggtgtgga cacagagtca aaccatatca tgctctaaga aaagaaagtt ttgggcctag 170701 agtcaggaaa aagccttttc aagtcaagct gagaacttta aggtggtctt ggctacctca 170761 ttttatgcaa agtaggcaga aaatttaaaa gaaaagaaac atatttaagt acaattaaga 170821 aaattacagt gtttcagtta aagcagttca gtgcttttta gaaaaggagc tgaaattaaa 170881 tagtaagtat aaatttaatt ctaataattc agggctatga accatgtaat tgcttaattt 170941 ccaataacta tttgttcttt tgcgactaca gtttggcagg ttgccctaat aattttcagg 171001 cattttaggt tcatgctcct tccaaccctc cttttccctt acaagggttg catccaagtt 171061 atgagggata agggtggggg gtgtttcata tttaggacca ggatgttgat gtgtacttat 171121 tcctggccag gctgttttgg agtcaggact cttgcgctat aagatccccc aaatctattt 171181 tgggaatagt gaaagggatg ggatgatgct gaggagccgg gcttcggcat ttccttattg 171241 tgcattattt ccaatccgtt gttttaccct ggaaagggtc aactgaaaga actggccatt 171301 gtctttgtag gtattaagta ttgctctgtc ttccccgaga ccaaaaaata aaagattgaa 171361 tgagggatgg tactgaattg aaaacttatg gttaatattc caaaacatta cccactaaaa 171421 gagcaagaga aaaaagttag tgggagagtg tttcagatga ttccttcttc acaggtttac 171481 ctgttccccc acctgaagag ttaattagaa aaacaataca ggtacattac ataaaattca 171541 aactctatag acatacaatt tgtaaaatgt gtaagtcccc cattttccac ctcctgagga 171601 catactatca acagatcttc taccattttt cttttctgta catgtacttc ttttctctac 171661 atatatattt ttccaaacaa tgtgtttata tcatgtataa tattttgcag tttacttttt 171721 ctaccttagt agtatattat cctctcatga cagtgtatct atatctatcc cttctttgta 171781 acagaggaac aggattttat tgaaggtatt ggcatgattt atgtaatcat tctttctgtt 171841 ttcatttgat cttactgttt tccagtgtta tcatgaacat ctttctgcct gaatctttgt 171901 tttgctatgg ttttgcttta tgaactttgt ttctaggttt gtttgctttt tttttaaata 171961 gcctttaaga cttggatctg ctgcataaat tcctttaaag tttgcagaca tgtattgagt 172021 gtattgatca gctgcactaa ggagtaagca cctgatgtga agcccccttc catgaaaaat 172081 cctttaagaa tggcatagtt gtgaagttgg aaggctgtat gaatcacaaa ttgattttac 172141 cccatcacca catcagccaa acacctgctc tgccattttg attaagattc gtcatgtatt 172201 tcatgccagt aatcccagca ctttgggaag ccaaggcagt tggatcacct gaggtcagga 172261 gtttgagacc agcctcaaca acatggttaa accccatctc tactaaaaac acaaaaaatt 172321 agccaggcgt gatggcacgc acctgtaatc ccagctactt gggaggctga ggcaggacaa 172381 tcgcttgaac ccgggaggtg aaggttgcag tgagacgaga tcgcgccatt gcactccagc 172441 ctgggcaaca agagcaaaac tccatctcaa aaaaaaaaaa aaattaatca tgtattaatt 172501 gagatttgca gggccgtgtg ctttatataa aacaatgcac aatgcactaa gctccatggt 172561 gtttgagaga caaaggagaa caagatacca tccccaccac tgggtagttt aagattattg 172621 tctgtatcca gtatatagca attgtaatga tttctaaaat gtttttctat tgagatacag 172681 tttacatgaa atgaagtgct cagattttga gtgacagttt gatcagtttc aaagaatgtg 172741 cacacctaat caagacacag aacatttctg taacaccaga aaattccctc ttgccctttc 172801 ctaatcaatc acaaacccct cccctataac cattgatgtc atttttgcct atacctttat 172861 atgaatggaa tcatgcagaa gatactattt tgggtatctt actttgtctt ccttgtaccc 172921 ctttgtgaaa tcctcccatg cctagacctc gacaactgct ggtctgtttt ctgcccctgt 172981 agttttgtat gaattcttta cgtattctgt ttgcccaggc tggagtgcag tggtgtgatc 173041 tcagctcact gcaacctctg cctcccaggt tcaagcaatc ctcccacccc agcctcctga 173101 gtagctaagg ttacaggcat gtgccaccac acctggctaa ttttttttgt attttttatc 173161 agatatgtat tgtgaataaa tacctattgt gaataaaatt ctttattaga tatgtattat 173221 gaatattttc tttctccact tgcatttctg tttccttagc agtatcttta caagagtaca 173281 tttttttttt tgtattttaa tggagtttaa tttaccaatt tttcctttat ggctcatacc 173341 ttgggtatgc aaagtaagaa atccctaccc ccttgaaggt aacagagatt ttctcctaag 173401 tttcttcatt ttattcattc ctcctgtttt cttttaaatt tgtactttta gcttccttca 173461 tttaggtctg tgacccatct cgagttaatt tcttgttcct agtatgaggt aaagttggtg 173521 ctattgtttt tcccccataa ggatattcag ttattccagc actgtttttt ttaaagactt 173581 ctctttgttc ccttggtact tttgtgggaa aaaaaaattg accatcttag tgtgggtcta 173641 tttctggact ccgttcagtt tcattgatgt atgtatctgt gtcttcacca ataccacaca 173701 gttttaatta cagtaatttt atatatcttg aaatcagata gtataagacc tctagttttg 173761 ttcttttcta aagttgtttt tgactattcc agcactagaa tttctacaca gttgcttatt 173821 ggaactttga tgggattgta tttattttca taaaggctca cctacaatgt gttgtaaagc 173881 tt // LOCUS HSAF001903 1719 bp mRNA PRI 12-MAY-1997 DEFINITION Human 3-hydroxyacyl-CoA dehydrogenase, isoform 2 mRNA, complete cds. ACCESSION AF001903 NID g2078328 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Samuel,S.J. and Jung,C.Y. TITLE Cloning of a novel isoform of 3-hydroxyacyl-CoA dehydrogenase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1719) AUTHORS Shi,Y., Samuel,S.J., Lee,W., Yu,C.H., Zhang,W., Lachaal,M. and Jung,C.Y. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) Biophysics Laboratory, Veterans' Administration Medical Center, 3495, Bailey Ave., Buffalo, NY 14215, USA FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" CDS 43..1215 /codon_start=1 /product="3-hydroxyacyl-CoA dehydrogenase, isoform 2" /db_xref="PID:g2078329" /translation="MGRAGLEAPPPPCGVTGTPGARGLQGRVGPRPQSLAFRGCLPRA SSLPGSPRCRRRCHTMAFVTRQFMRSVSSSSTASASAKKIIVKHVTVIGGGLMGAGIA QVAAATGHTVVLVDQTEDILAKSKKGIEESLRKVAKKKFAENPKAGDEFVEKTLSTIA TSTDAASVVHSTDLVVEAIVENLKVKNELFKRLDKFAAEHTIFASNTSSLQITSIANA TTRQDRFAGLHFFNPVPVMKLVEVIKTPMTSQKTFESLVDFSKALGKHPVSCKDTPGF IVNRLLVPYLMEAIRLYERDFQTCGDSNSGLGFSLKGDASKEDIDTAMKLGAGYPMGP FELLDYVGLDTTKFIVDGWHEMDAENPLHQPSPSLNKLVAENKFGKKTGEGFYKYK" BASE COUNT 425 a 445 c 432 g 417 t ORIGIN 1 cgtgtatacc cgctcaacgc tgggacgtta cagccagggc caatgggcag agcgggactc 61 gaggccccgc ccccgccttg tggcgtcacg gggacgccgg gggcgcgcgg gctgcagggc 121 cgcgtaggtc cccgccccca gagtctggct ttccgcggct gcctgcctcg cgcgtcttcc 181 ctgcccgggt ctcctcgctg tcgccgccgc tgccacacca tggccttcgt caccaggcag 241 ttcatgcgtt ccgtgtcctc ctcgtccacc gcctcggcct cggccaagaa gataatcgtc 301 aagcacgtga cggtcatcgg cggcgggctg atgggcgccg gcattgccca ggttgctgca 361 gcaactggtc acacagtagt gttggtagac cagacagagg acatcctggc aaaatccaaa 421 aagggaattg aggaaagcct taggaaagtg gcaaagaaga agtttgcaga aaaccctaag 481 gccggcgatg aatttgtgga gaagaccctg agcaccatag cgaccagcac ggatgcagcc 541 tccgttgtcc acagcacaga cttggtggtg gaagccatcg tggagaatct gaaggtgaaa 601 aacgagctct tcaaaaggct ggacaagttt gctgctgaac atacaatctt tgccagcaac 661 acttcctcct tgcagattac aagcatagct aatgccacca ccagacaaga ccgattcgct 721 ggcctccatt tcttcaaccc agtgcctgtc atgaaacttg tggaggtcat taaaacacca 781 atgaccagcc agaagacatt tgaatctttg gtagacttta gcaaagccct aggaaagcat 841 cctgtttctt gcaaggacac tcctgggttt attgtgaacc gcctcctggt tccatacctc 901 atggaagcaa tcaggctgta tgaacgagac ttccaaacgt gtggtgattc taactcgggt 961 ttgggctttt ctttaaaagg tgacgcatcc aaagaagaca ttgacactgc tatgaaatta 1021 ggagccggtt accccatggg cccatttgag cttctagatt atgtcggact ggatactacg 1081 aagttcatcg tggatgggtg gcatgaaatg gatgcagaga acccattaca tcagcccagc 1141 ccatccttaa ataagctggt agcagagaac aagttcggca agaagactgg agaaggattt 1201 tacaaataca agtgatgtgc agcttctccg gctctgagaa gaacacctga gagcgctttc 1261 cagccagtgc cccgagtgcc tgtgggaatg ctctttggtc agacattccc tcacacagta 1321 cagtttaata aatgtgcatt ttgattgtaa tctatcgaag tgattattac accagttaca 1381 gcagtaatag attctccatt aagaaataat tccctttttt agtctgttca tttctgtgta 1441 ttttctaaac agctttacac ccttggtgcc ttggagcaaa catgtttttt gaaccttgtc 1501 atttttgtga agaattgcct agattccttc tctcatcaac gggaaagtac ttcctctgag 1561 agtgcgagtg caccatgctc actgttgctg cgtgggagag tcacaagcca ctggcaagca 1621 agtggtatag tctgtgaagc actgcagcga gcagcacctg gatcttgcct ttataagaac 1681 attttactac ctgcagcttt gagtcttgcc ctacatttt // LOCUS HSAFX 3171 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for AFX protein. ACCESSION X93996 NID g1418758 KEYWORDS AFX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3171) AUTHORS Borkhardt,A., Repp,R., Haas,O.A., Leis,T., Harbott,J., Kreuder,J., Hammermann,J., Henn,T. and Lampert,F. TITLE Cloning and characterization of AFX, the gene that fuses to MLL in acute leukemias with a t(X;11)(q13;q23) JOURNAL Oncogene 14 (2), 195-202 (1997) MEDLINE 97163401 REFERENCE 2 (bases 1 to 3171) AUTHORS Borkhardt,A. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) A. Borkhardt, Universitaets-Kinderlinik, Feulgenstr. 12, Giessen, D-35392, FRG FEATURES Location/Qualifiers source 1..3171 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /chromosome="X" mRNA 1..3171 gene 183..1688 /gene="AFX" CDS 183..1688 /gene="AFX" /codon_start=1 /db_xref="PID:e218401" /db_xref="PID:g1418759" /db_xref="SWISS-PROT:P98177" /translation="MRIQPQKAAAIIDLDPDFEPQSRPRSCTWPLPRPEIANQPSEPP EVEPDLGEKVHTEGRSEPILLPSRLSEPAGGPQPGILGAVTGPRKGGSRRNAWGNQSY AEFISQAIESAPEKRLTLAQIYEWMVRTVPYFKDKGDSNSSAGWKNSIRHNLSLHSKF IKVHNEATGKSSWWMLNPEGGKSGKAPRRRAASMDSSSKLLRGRSKAPKKKPSVLPAP PEGATPTSPVGHFAKWSGSPCSRNREEADMWTTFRPRSSSNASSVSTRLSPLRPESEV LAEEIPASVSSYAGGVPPTLNEGLELLDGLNLTSSHSLLSRSGLSGFSLQHPGVTGPL HTYSSSLFSPAEGPLSAGEGCFSSSQALEALLTSDTPPPPADVLMTQVDPILSQAPTL LLLGGLPSSSKLATGVGLCPKPLEARGPSSLVPTLSMIAPPPVMASAPIPKALGTPVL TPPTEAASQDRMPQDLDLDMYMENLECDMDNIISDLMDEGEGLDFNFEPDP" BASE COUNT 742 a 855 c 909 g 665 t ORIGIN 1 gggacagctt agggactatc gtcctgggac tagggggaag ttcgcgactt tctgaagact 61 ggcaggaatg tgcctcctgg ccctcgatgc ttcccccctg aggggaggca tcgtgaggga 121 ctgtggcagg cttcactgaa cgctgagccg gggaggtcca actccacgta tggatccggg 181 gaatgagaat tcagccacag aaggccgccg cgatcataga cctagatccc gacttcgaac 241 cccagagccg tccccgctcc tgcacctggc cccttccccg accagagatc gctaaccagc 301 cgtccgagcc gcccgaggtg gagccagatc tgggggaaaa ggtacacacg gaggggcgct 361 cagagccgat cctgttgccc tctcggctct cagagccggc cgggggcccc cagcccggaa 421 tcctgggggc tgtaacaggt cctcggaagg gaggctcccg ccggaatgcc tggggaaatc 481 agtcatatgc agaattcatc agccaggcca ttgaaagcgc cccggagaag cgactgacac 541 ttgcccagat ttacgagtgg atggtccgta ctgtacccta cttcaaggac aagggtgaca 601 gcaacagctc agcaggatgg aagaactcga tccgccacaa cctgtccctg cacagcaagt 661 tcatcaaggt tcacaacgag gccaccggca aaagctcttg gtggatgctg aaccctgagg 721 gaggcaagag cggcaaagcc ccccgccgcc gggccgcctc catggatagc agcagcaagc 781 tgctccgggg ccgcagtaaa gcccccaaga agaaaccatc tgtgctgcca gctccacccg 841 aaggtgccac tccaacgagc cctgtcggcc actttgccaa gtggtcaggc agcccttgct 901 ctcgaaaccg tgaagaagcc gatatgtgga ccaccttccg tccacgaagc agttcaaatg 961 ccagcagtgt cagcacccgg ctgtccccct tgaggccaga gtctgaggtg ctggcggagg 1021 aaataccagc ttcagtcagc agttatgcag ggggtgtccc tcccaccctc aatgaaggtc 1081 tagagctgtt agatgggctc aatctcacct cttcccattc cctgctatct cggagtggtc 1141 tctctggctt ctctttgcag catcctgggg ttaccggccc cttacacacc tacagcagct 1201 cccttttcag cccagcagag gggcccctgt cagcaggaga agggtgcttc tccagctccc 1261 aggctctgga ggccctgctc acctctgata cgccaccacc ccctgctgac gtcctcatga 1321 cccaggtaga tcccattctg tcccaggctc cgactcttct gttgctgggg gggcttcctt 1381 cctccagtaa gctggccacg ggcgtcggcc tgtgtcccaa gcccctagag gctcgaggcc 1441 ccagcagtct ggttcccacc ctttctatga tagcaccacc tccagtcatg gcaagtgccc 1501 ccatccccaa ggctctgggg actcctgtgc tcacaccccc tactgaagct gcaagccaag 1561 acagaatgcc tcaggatcta gatcttgata tgtatatgga gaacctggag tgtgacatgg 1621 ataacatcat cagtgacctc atggatgagg gcgagggact ggacttcaac tttgagccag 1681 atccctgagt catgcctgga agctttgtcc cctgcttcag atgtggagcc aggcgtgttc 1741 atatctactc tttacccttg agccctcccc aggaatttgg gaccctgctt tagagctagg 1801 gtggggtctg gtcacacaca ggtgttgaag aaattataaa gataaagctg ccccatctgg 1861 ggacgatatg gggagggaga tgggagggga aaggggagag ggtttttctc actgtgccaa 1921 ttagggggta aggccccctc tcaggagcca tcatcggctt tccccattcc tacccactta 1981 ggctttgtag caagatgagc aatgctgttg gaaatgtgaa gtcaccagtg gccttacccc 2041 tgcctttggg agcaggattt ttttgtagag agtcttatct gagctgagcc aggctagctg 2101 gagcctggga tttctatgca gtggcccctt aggccagtga tgtgcggtgg gtgggctgtt 2161 taggggatct ggaagggcca aggtctgagc actggagtgg ctcgccaggc caaatcaccc 2221 ttagaaggct gcagataaca gaaaggcttt ttataaactt ttaaagaaat ataaacacaa 2281 atatagagat tttttaacca tggcagggtg ctagtggtgg gcagaatgct tttttttctt 2341 tctgaaggct ttgtgatagt gacatgatac aaacactaca gacaataaat attaggagac 2401 acagggaagt ggggagaggt ggggagtaat agtaaacaca gggaagagct cccctacgga 2461 ccaggtatag agaaaggtct atgcagaaat aggttagagt ttccctaaca aaaaagctaa 2521 cccaggtccc ctcattcctt caacttgtgc ctgggagtgt gtggtgttag ggtgcagcca 2581 cactcttcta tgacccagca tgggttagtg ctatggtggg agagtacatt gaaggcctgg 2641 aattagcttg gggccaggga agggactggg aggggagaga agagaaggag ggaaggattt 2701 aggatggtaa agttaggtac agagacctcc ctgttcaagg cccctgacag ctgtccctgc 2761 ccttcttccc cttccctgac tgcaggggtt atgtggaagt gtgtgtggca gcaggcagcg 2821 gggaggggag gaacagggaa gggggagctg gggagcttgg ctgagggtct gggaaatgag 2881 cagggatggg gggggatgtg gatcaggttt actagcacct gccagggagg ccatctgggg 2941 ctccttctcc accccagccc ccaaagcagc ccttccccca gtgccctttg catcgtcccc 3001 tcccccaccc ctgctgtggg ttcccatcat ttcctgtgtc agcgcctggc ctacccagat 3061 tgtatcatgt gctagattgg agtggggaag tgtgtcaaat caataaatga ataaattcaa 3121 taaatgccta taaccagcag aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSAGLO1 575 bp RNA PRI 23-DEC-1994 DEFINITION Human messenger RNA for alpha globin. ACCESSION V00493 NID g28557 KEYWORDS alpha-globin; complementary DNA; globin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 575) AUTHORS Wilson,J.T., Wilson,L.B., Reddy,V.B., Cavallesco,C., Ghosh,P.K., deRiel,J.K., Forget,B.G. and Weissman,S.M. TITLE Nucleotide sequence of the coding portion of human alpha globin messenger RNA JOURNAL J. Biol. Chem. 255 (7), 2807-2815 (1980) MEDLINE 80137531 FEATURES Location/Qualifiers source 1..575 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..575 /note="messenger RNA" misc_RNA 1 /note="capped by m7G-ppp" CDS 38..466 /note="reading frame alpha-globin" /codon_start=1 /db_xref="PID:g28558" /db_xref="SWISS-PROT:P01922" /translation="MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYF PHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLL SHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR" polyA_site 575 /note="polyA addition site" BASE COUNT 101 a 211 c 158 g 105 t ORIGIN 1 actcttctgg tccccacaga ctcagagaga acccaccatg gtgctgtctc ctgccgacaa 61 gaccaacgtc aaggccgcct ggggcaaggt tggcgcgcac gctggcgagt atggtgcgga 121 ggccctggag aggatgttcc tgtccttccc caccaccaag acctacttcc cgcacttcga 181 cctgagccac ggctctgccc aggttaaggg ccacggcaag aaggtggccg acgcgctgac 241 caacgccgtg gcgcacgtgg acgacatgcc caacgcgctg tccgccctga gcgacctgca 301 cgcgcacaag cttcgggtgg acccggtcaa cttcaagctc ctaagccact gcctgctggt 361 gaccctggcc gcccacctcc ccgccgagtt cacccctgcg gtgcacgcct ccctggacaa 421 gttcctggct tctgtgagca ccgtgctgac ctccaaatac cgttaagctg gagcctcggt 481 agcagttcct cctgccagat gggcctccca acgggccctc ctcccctcct tgcaccggcc 541 cttcctggtc tttgaataaa gtctgagtgg gcggc // LOCUS HSAGLUCIE 2890 bp RNA PRI 26-AUG-1997 DEFINITION H.sapiens mRNA for processing a-glucosidase I. ACCESSION X87237 NID g2344809 KEYWORDS a-glucosidase I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2890) AUTHORS Kalz-Fuller,B., Bieberich,E. and Bause,E. TITLE Cloning and expression of glucosidase I from human hippocampus JOURNAL Eur. J. Biochem. 231 (2), 344-351 (1995) MEDLINE 95361857 REFERENCE 2 (bases 1 to 2890) AUTHORS Bause,E. TITLE Direct Submission JOURNAL Submitted (17-MAY-1995) E. Bause, Institut fuer Physiologische Chemie, Chemie der Universitaet Bonn, Nussallee 11, D- 53115 Bonn, FRG REMARK Revised by [3] REFERENCE 3 (bases 1 to 2890) AUTHORS Kalz-Fuller,B. TITLE Direct Submission JOURNAL Submitted (26-AUG-1997) B. Kalz-Fuller, Institut fuer Physiologische Chemie, Chemie der Universitaet Bonn, Nussallee 11, D- 53115 Bonn, FRG FEATURES Location/Qualifiers source 1..2890 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /chromosome="2" /map="p12-p13" /clone_lib="HL3023a (lgt10, clontech)" /tissue_type="brain, hippocampus" CDS 133..2643 /codon_start=1 /product="a-glucosidase I" /db_xref="PID:e337821" /db_xref="PID:g2344810" /translation="MARGERRRRAVPAEGVRTAERAARGGPGRRDGRGGGPRSTAGGV ALAVVVLSLALGMSGRWVLAWYRARRAVTLHSAPPVLPADSSSPAVAPDLFWGTYRPH VYFGMKTRSPKPLLTGLMWAQQGTTPGTPKLRHTCEQGDGVGPYGWEFHDGLSFGRQH IQDGALRLTTEFVKRPGGQHGGDWSWRVTVEPQDSGTSALPLVSLFFYVVTDGKEVLL PEVGAKGQLKFISGHTSQLGNFRFTLLPPTSPGDTAPKYGSYNVFWTSNPGLPLLTEM VKSRLNSWFQHRPPGASPERYLGLPGSLKWEDRGPSGQGQGQFLIQQVTLKIPFSIEF VFESGSAQAGGNQALPRLAGSLLTQALESHAEGFRERFEKTFQLKEKGLSSGEQALGQ AALSGLLGGIGYFYGQGLVLPDIGVEGSEQKVDPALFPPVPLFTAVPSRSFFPRGFLW DEGFHQLVVQRWDPSLTREALGHWLGLLNADGWIGREQILGDEARARVPPEFLVQRAV HANPPTLLLPVAHMLEVGDPDDLAFLRKALPRLHAWFSWLHQSQAGPLPLSYRWRGRD PALPTLLNPKTLPSGLDDYPRASHPSVTERHLDLRCWVGLGARVLTRLAEHLGEAEVA AELGPLAASLEAAESLDELHWAPELGVFADFGNHTKAVQLKPRPPQGLVRVVGRPQPQ LQYVDALGYVSLFPLLLRLLDPTSSRLGPLLDILADSRHLWSPFGLRSLAASSSFYGQ RNSEHDPPYWRGAVWLNVNYLALGALHHYGHLEGPHQARAAKLHGELRANVVGNVWRQ YQATGFLWEQYSDRDGRGMCRPFHGWTSLVLLAMAEDY" BASE COUNT 564 a 859 c 886 g 581 t ORIGIN 1 ggcgctggct ggcaggtgtc gctaaccgga cggtggtcgc cagggcgaga ggcgggagcc 61 ggagaggtga ggcaggaccc gggctccact gccgcctctc cgagctcttg tgacgcggac 121 ctcagtgcca ggatggctcg gggcgagcgg cggcgccgcg cagtgccggc agagggagtg 181 cggacagccg agagggcggc tcggggaggc cccgggcgac gggacggccg gggcggcggg 241 ccgcgtagca cggctggagg agtggctctg gccgtcgtgg tcctgtcttt ggccctgggt 301 atgtcggggc gctgggtgct ggcgtggtac cgtgcgcggc gggcggtcac gctgcactcc 361 gcgcctcctg tgttgcctgc cgactcctcc agccccgccg tggccccgga cctcttctgg 421 ggaacctacc gccctcacgt ctacttcggc atgaagaccc gcagcccgaa acccctcctc 481 accggactga tgtgggcgca gcagggcacc accccgggga ctcctaagct caggcacacg 541 tgtgagcagg gggacggtgt gggtccctat ggctgggagt tccacgacgg cctctccttc 601 gggcgccaac acatccagga tggggcctta aggctcacca ctgagttcgt caagaggcct 661 gggggtcagc acggagggga ctggagctgg agagtgactg tagagcctca ggactcaggt 721 acttctgccc tccctttggt ctccctgttc ttctatgtgg tgacagatgg caaggaagtc 781 ctactaccag aggttggggc caaggggcag ttgaagttta tcagtgggca caccagtcaa 841 cttggtaact tccgctttac acttttgcca ccaaccagtc caggggatac agcccccaag 901 tatggcagct acaatgtctt ctggacctcc aacccaggac tgcccctgct gacagagatg 961 gtaaagagtc gcctaaatag ctggtttcag catcggcccc caggggcctc ccctgaacgc 1021 tacctcggct tgccaggatc cctgaagtgg gaggacagag gtccaagtgg gcaagggcag 1081 gggcagttct tgatacagca ggtgaccctg aaaattccat tttccataga gtttgtgttt 1141 gaatcaggca gtgcccaggc aggaggaaat caagccctgc caagactggc aggcagtcta 1201 ctgacccagg ccctggagag ccatgctgaa ggctttagag agcgctttga gaagaccttc 1261 cagctgaagg agaagggcct gagctctggc gagcaggctt tgggtcaggc tgccctcagc 1321 ggcctccttg gtggaattgg ctacttctac ggacaagggc tggtattgcc agacatcggg 1381 gtggaagggt ctgagcagaa ggtggaccca gccctctttc cacccgtacc tctttttaca 1441 gcagtgccct cccggtcatt cttcccacga ggcttccttt gggatgaagg ctttcaccag 1501 ctggtggttc agcggtggga tccctccctc acccgggaag cccttggcca ctggctgggg 1561 ctgctaaatg ctgatggctg gattgggagg gagcagatac tgggggatga ggcccgagcc 1621 cgggtgcctc cagaattcct agtacaacga gcagtccacg ccaacccccc aaccctactt 1681 ttgcctgtag cccatatgct agaggttggt gaccctgacg acttggcttt cctccgaaag 1741 gccttgcccc gcctgcatgc ctggttttcc tggctccatc agagccaggc aggcccactg 1801 ccactatctt accgctggcg gggacgggac cctgccttac caaccttact gaaccccaag 1861 accctaccct ctgggctgga tgactacccc cgggcttcac acccttcagt aaccgagcgg 1921 cacctggacc tgcgatgttg ggtgggactg ggtgcccgtg tgctgacgcg gctggcagag 1981 catctgggtg aggctgaggt agctgctgag ctgggcccac tggctgcctc actggaggca 2041 gcagagagcc tggatgagct gcactgggcc ccagagctag gagtctttgc agactttggg 2101 aaccacacaa aagcagtaca gctgaagccc aggccccctc aggggctcgt tcgggtggtg 2161 ggtcggcccc aacctcaact gcagtatgta gatgctcttg gctatgtcag tctttttccc 2221 ttgctgctgc gactgctgga ccccacctca tcccgccttg ggcccctgct ggacattcta 2281 gccgacagcc gccatctctg gagccccttt ggtttacgct cccttgcagc ctccagctcc 2341 ttttatggcc agcgcaattc agagcatgat cccccctact ggcggggtgc tgtgtggctc 2401 aatgtcaact acctggcttt gggagcactc caccactatg ggcatctgga gggtcctcac 2461 caggctcggg ctgccaaact ccacggtgag ctccgtgcca acgtggtagg caatgtatgg 2521 cgccagtacc aggccacagg ctttctttgg gagcagtaca gtgaccgcga tgggcgaggc 2581 atgtgccgcc ctttccacgg ctggaccagc cttgtcttac tggccatggc tgaagactac 2641 tgaagggagg gagaggaggg gagccaagac actcatgcca ctctggctct gaaggacaag 2701 ggacaaaggc ttctggcttt tgcccccagc cccttggata ccagtaattc aaaccttcct 2761 cattcattct caggtgtctc cttgctgtca tcccacatag ccctggggtg aatgtgaatc 2821 cagagtctat ttttctaaat aaattggaaa aaacaaaaaa aaaaaaaaaa aaaaaaaaaa 2881 aaaaaaaaaa // LOCUS HSAICL 759 bp RNA PRI 11-MAR-1997 DEFINITION H.sapiens mRNA for AICL (activation-induced C-type lectin). ACCESSION X96719 NID g1632815 KEYWORDS activation antigen; C-type lectin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 759) AUTHORS Hamann,J., Montgomery,K.T., Lau,S., Kucherlapati,R. and van Lier,R.A. TITLE AICL: a new activation-induced antigen encoded by the human NK gene complex JOURNAL Immunogenetics 45 (5), 295-300 (1997) MEDLINE 97190245 REFERENCE 2 (bases 1 to 759) AUTHORS Hamann,J. TITLE Direct Submission JOURNAL Submitted (21-MAR-1996) J. Hamann, Central Lab. Netherlands Red Cross Blood Transfusion Service, Dept. KVI, Plesmanlaan 125, NL-1066 Amsterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..759 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PBMC: peripheral blood mononuclear cells" /clone_lib="cDNA library from PMA-activated PBMC, vector pCDM8" /map="12p" gene 133..582 /gene="AICL" CDS 133..582 /gene="AICL" /note="activation induced" /codon_start=1 /product="C-Type lectin" /db_xref="PID:e239126" /db_xref="PID:g1632816" /translation="MMTKHKKCFIIVGVLITTNIITLIVKLTRDSQSLCPYDWIGFQN KCYYFSKEEGDWNSSKYNCSTQHADLTIIDNIEEMNFLRRYKCSSDHWIGLKMAKNRT GQWVHGATFTKSFGMRGSEGCAYLSDDGAATARCYTERKWICRKRIH" BASE COUNT 296 a 113 c 144 g 206 t ORIGIN 1 ctgtgctgta aaaacaagag taacattttt atattaaagt taaataaagt tacaactttg 61 aagagagttt ctgcaagaca tgacacaaag ctgctagcag aaaatcaaaa cgctgattaa 121 aagaagcacg gtatgatgac caaacataaa aagtgtttta taattgttgg tgttttaata 181 acaactaata ttattactct gatagttaaa ctaactcgag attctcagag tttatgcccc 241 tatgattgga ttggtttcca aaacaaatgc tattatttct ctaaagaaga aggagattgg 301 aattcaagta aatacaactg ttccactcaa catgccgacc taactataat tgacaacata 361 gaagaaatga attttcttag gcggtataaa tgcagttctg atcactggat tggactgaag 421 atggcaaaaa atcgaacagg acaatgggta catggagcta catttaccaa atcgtttggc 481 atgagaggga gtgaaggatg tgcctacctc agcgatgatg gtgcagcaac agctagatgt 541 tacaccgaaa gaaaatggat ttgcaggaaa agaatacact aagttaatgt ctaagataat 601 ggggaaaata gaaaataaca ttattaagtg taaaaccagc aaagtacttt tttaattaaa 661 caaagttcga gttttgtacc tgtctggtta attctgctta cgtgtcaggc tacacataaa 721 agccacttca aagattggca aaaaaaaaaa aaaaaaaaa // LOCUS HSAJ03144 627 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens mRNA for metalloproteinase. ACCESSION AJ003144 NID g2808654 KEYWORDS metalloproteinase; MMP-20 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 627) AUTHORS Bernot,A., Heilig,R., Clepet,C., Smaoui,N., Delpech,M., da Silva,C., Devaud,C., Petit,J.L., Chiannilkulchai,N., Fizames,C., Samson,D., Cruaud,C., Caloustian,C., Gyapay,G. and Weissenbach,J. TITLE A transcriptional map of the FMF region JOURNAL Unpublished REFERENCE 2 (bases 1 to 627) AUTHORS Bernot,A. TITLE Direct Submission JOURNAL Submitted (07-JAN-1998) GENOSCOPE - Centre National de Sequencage, 2 rue Gaston Cremieux, EVRY BP191, FRANCE FEATURES Location/Qualifiers source 1..627 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" gene 14..565 /gene="MMP20" CDS 14..565 /gene="MMP20" /codon_start=1 /product="metalloproteinase" /db_xref="PID:e1245446" /db_xref="PID:g2808655" /translation="MDPGTVATMRKPRCSLPDVLGVAGLVRRRRRYALSGSVWKKRTL TWRVRSFPQSSQLSQETVRVLMSYALMAWGMESGLTFHEVDSPQGQEPDILIDFARAF HQDSYPFDGLGGTLAHAFFPGEHPISGDTHFDDEETWTFGSKASQQLEQELAGGSPVD EELGFSRGWRVNPLGPGSPERLS" BASE COUNT 107 a 189 c 219 g 112 t ORIGIN 1 ggagaccggc cgcatggacc cagggacagt ggccaccatg cgtaagcccc gctgctccct 61 gcctgacgtg ctgggggtgg cggggctggt caggcggcgt cgccggtacg ctctgagcgg 121 cagcgtgtgg aagaagcgaa ccctgacatg gagggtacgt tccttccccc agagctccca 181 gctgagccag gagaccgtgc gggtcctcat gagctatgcc ctgatggcct ggggcatgga 241 gtcaggcctc acatttcatg aggtggattc cccccagggc caggagcccg acatcctcat 301 cgactttgcc cgcgccttcc accaggacag ctaccccttc gacgggttgg ggggcaccct 361 agcccatgcc ttcttccctg gggagcaccc catctccggg gacactcact ttgacgatga 421 ggagacctgg acttttgggt caaaagcctc tcagcagctg gagcaggagc tggcaggcgg 481 ctcaccggtt gatgaggagc tgggcttcag ccggggctgg cgtgtgaatc ctctgggtcc 541 tggcagtcct gagcgcctga gctgaataca gagggaagag gctgggagca aggccgggtg 601 ctggggccgg caggctgtgt tctgaga // LOCUS HSAJ03147 239566 bp DNA PRI 22-JAN-1998 DEFINITION Homo sapiens complete genomic sequence between D16S3070 and D16S3275, containing Familial Mediterranean Fever gene disease. ACCESSION AJ003147 NID g2808656 KEYWORDS HUMNK4 gene; mareno gene; marenostrin; metalloproteinase; mmp20 gene; olfactory receptor; zinc finger protein; znfmf gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 239566) AUTHORS Bernot,A., Heilig,R., Clepet,C., Smaoui,N., Delpech,M., da Silva,C., Devaud,C., Petit,J.L., Chiannilkulchai,N., Fizames,C., Samson,D., Cruaud,C., Caloustian,C., Gyapay,G. and Weissenbach,J. TITLE A transcriptional map of the FMF region JOURNAL Unpublished REFERENCE 2 (bases 1 to 239566) AUTHORS Bernot,A. TITLE Direct Submission JOURNAL Submitted (07-JAN-1998) GENOSCOPE - Centre National de Sequencage, 2 rue Gaston Cremieux, EVRY BP191, FRANCE FEATURES Location/Qualifiers source 1..239566 /organism="Homo sapiens" /db_xref="taxon:9606" source 1..36906 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 16g7" /sub_clone="bc2" STS 2959..3265 /standard_name="HSA353YH1" mRNA join(6759..6775,9222..9357,9467..9758,15042..16530) /gene="mmp20" gene 6759..16530 /gene="mmp20" exon 6759..6775 /gene="mmp20" /number=1 CDS join(6772..6775,9222..9357,9467..9758,15042..15161) /gene="mmp20" /codon_start=1 /product="metalloproteinase" /db_xref="PID:e1246030" /db_xref="PID:g2808657" /translation="MDPGTVATMRKPRCSLPDVLGVAGLVRRRRRYALSGSVWKKRTL TWRVRSFPQSSQLSQETVRVLMSYALMAWGMESGLTFHEVDSPQGQEPDILIDFARAF HQDSYPFDGLGGTLAHAFFPGEHPISGDTHFDDEETWTFGSKASQQLEQELAGGSPVD EELGFSRGWRVNPLGPGSPERLS" intron 6776..9221 /gene="mmp20" /number=1 exon 9222..9357 /gene="mmp20" /number=2 intron 9358..9466 /gene="mmp20" /number=2 exon 9467..9758 /gene="mmp20" /number=3 intron 9759..15041 /gene="mmp20" /number=3 mRNA complement(14113..16530) /gene="mmp20" /note="transcriptional unit overlapping metalloproteinase; expression only detectable in mRNAs from leucocytes (and not from liver)" exon 15042..16530 /gene="mmp20" /number=4 mRNA 25020..28566 /gene="HUMNK4" gene 25020..28566 /gene="HUMNK4" STS complement(26351..26612) /gene="HUMNK4" /standard_name="HSB070YG5" source 31077..67877 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 16g7" /sub_clone="bd6" source 46940..239566 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 633d12" /sub_clone="aa6" variation 53459 /note="aa6" /replace="a" variation 53486 /note="aa6" /replace="c" variation 57934 /note="aa6" /replace="g" variation 58731 /note="aa6" /replace="c" variation 61447 /note="aa6" /replace="c" variation 62401..62404 /note="aa6" /replace="ac" source 66755..108346 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 26fe7" /sub_clone="30c3" source 90616..137018 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 26fe7" /sub_clone="30e1" source 126806..170318 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 26fe7" /sub_clone="30g4" STS complement(161861..161970) /standard_name="CHLC.ATA41E08.P34246" gene 163476..164414 /gene="hsolfmf" CDS 163476..164414 /gene="hsolfmf" /codon_start=1 /number=1 /product="olfactory receptor" /db_xref="PID:e1246031" /db_xref="PID:g2808658" /translation="MSGTNQSSVSEFLLLGLSRQPQQQHLLFVFFLSMYLATVLGNLL IILSVSIDSCLHTPMYFFLSNLSFVDICFSFTTVPKMLANHILETQTISFCGCLTQMY FVFMFVDMDNFLLAVMAYDHFVAVCHPLHYTAKMTHQLCALLVAGLWVVANLNVLLHT LLMAPLSFCADNAITHFFCDVTPLLKLSCSDTHLNEVIILSEGALVMITPFLCILASY MHITCTVLKVPSTKGRWKAFSTCGSHLAVVLLFYSTIIAVYFNPLSSHSAEKDTMATV LYTVVTPMLNPFIYSLRNRYLKGALKKVVGRVVFSV" source 173301..217867 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 26fe7" /sub_clone="30f9" gene 174780..175778 /gene="olfmf2" CDS 174780..175778 /gene="olfmf2" /codon_start=1 /number=1 /pseudo /product="olfactory receptor pseudogene" /db_xref="PID:e1246032" source 179596..222837 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="p13.3" /clone="YAC 26fe7" /sub_clone="30e10" exon complement(183121..183842) /gene="znfmf" /number=5 gene complement(183121..194589) /gene="znfmf" CDS complement(join(183121..183842,191650..191773, 192062..192150,192735..192984)) /gene="znfmf" /codon_start=1 /product="zinc finger protein" /db_xref="PID:e1246033" /db_xref="PID:g2808659" /translation="MMAAKVVPMPPKPKQSFILRVPPDSKLGQDLLRDATNGPKTIHQ LVLEHFLTFLPKPSLVQPSQKVKETLVIMKDVSSSLQNRVHPRPLVKLLPKGVQKEQE TVSLYLKANPELVVFEDLNVFHCQEECVSLDPTQQLTSEKEDDSSVGEMMLLAVNGSN PEGEDPEREPVENEDYREKSSDDDEMDSSLVSQQPPDNQEKERLNTSIPQKRKMRNLL VTIENDTPLEELSKYVDISIIALTRNRRTRRWYTCPLCGKQFNESSYLISHQRTHTGE KPYDCNHCGKSFNHKTNLNKHERIHTGEKPYSCSQCGKNFRQNSHRSRHEGIHIREKI FKCPECGKTFPKNEEFVLHLQSHEAERPYGCKKCGRRFGRLSNCTRHEKTHSACKTRK QK" intron complement(183843..191649) /gene="znfmf" /number=4 exon complement(191650..191773) /gene="znfmf" /number=4 intron complement(191774..192061) /gene="znfmf" /number=3 exon complement(192062..192150) /gene="znfmf" /number=3 intron complement(192151..192734) /gene="znfmf" /number=2 exon complement(192735..193065) /gene="znfmf" /number=2 intron complement(193066..194340) /gene="znfmf" /number=1 exon complement(194341..194589) /gene="znfmf" /number=1 exon complement(202369..202923) /gene="mareno" /number=9 gene complement(202369..215816) /gene="mareno" CDS complement(join(202369..202923,203089..203121, 203483..203515,203702..203817,205754..205776, 206245..206475,208138..208233,208660..209009, 213387..214019,215540..215816)) /gene="mareno" /codon_start=1 /product="marenostrin" /db_xref="PID:e1246034" /db_xref="PID:g2808660" /translation="MAKTPSDHLLSTLEELVPYDFEKFKFKLQNTSVQKEHSRIPRSQ IQRARPVKMATLLVTYYGEEYAVQLTLQVLRAINQRLLAEELHRAAIQEYSTQENGTD DSAASSSLGENKPRSLKTPDHPEGNEGNGPRPYGGGAASLRCSQPEAGRGLSRKPLSK RREKASEGLDAQGKPRTRSPALPGGRSPGPCRALEGGQAEVRLRRNASSAGRLQGLAG GAPGQKECRPFEVYLPSGKMRPRSLEVTISTGEKAPANPEILLTLEEKTAANLDSATE PRARPTPDGGASADLKEGPGNPEHSVTGRPPDTAASPRCHAQEGDPVDGTCVRDSCSF PEAVSGHPQASGSRSPGCPRCQDSHERKSPGSLSPQPLPQCKRHLKQVQLLFCEDHDE PICLICSLSQEHQGHRVRPIEEVALEHKKKIQKQLEHLKKLRKSGEEQRSYGEEKAVS FLKQTEALKQRVQRKLEQVYYFLEQQEHFFVASLEDVGQMVGQIRKAYDTRVSQDIAL LDALIGELEAKECQSEWELLQDIGDILHRAKTVPVPEKWTTPQEIKQKIQLLHQKSEF VEKSTKYFSETLRSEMEMFNVPELIGAQAHAVNVILDAETAYPNLIFSDDLKSVRLGN KWERLPDGPQRFDSCIIVLGSPSFLSGRRYWEVEVGDKTAWILGACKTSISRKGNMTL SPENGYWVVIMMKENEYQASSVPPTRLLIKEPPKRVGIFVDYRVGSISFYNVTARSHI YTFASCSFSGPLQPIFSPGTRDGGKNTAPLTICPVGGQGPD" exon complement(203089..203121) /gene="mareno" /number=10 intron complement(203122..203482) /gene="mareno" /number=8 exon complement(203483..203515) /gene="mareno" /number=8 intron complement(203516..203701) /gene="mareno" /number=7 exon complement(203702..203817) /gene="mareno" /number=7 intron complement(203820..205753) /gene="mareno" /number=6 exon complement(205754..205776) /gene="mareno" /number=6 intron complement(205777..206244) /gene="mareno" /number=5 exon complement(206245..206475) /gene="mareno" /number=5 intron complement(206476..208137) /gene="mareno" /number=4 exon complement(208138..208233) /gene="mareno" /number=4 intron complement(208234..208659) /gene="mareno" /number=3 exon complement(208660..209009) /gene="mareno" /number=3 intron complement(209101..213386) /gene="mareno" /number=2 exon complement(213387..214019) /gene="mareno" /number=2 intron complement(214020..215539) /gene="mareno" /number=1 exon complement(215540..215816) /gene="mareno" /number=1 STS 225956..226349 /standard_name="s374H9_1" STS 239373..239555 /standard_name="AFMef34" BASE COUNT 58567 a 60586 c 62233 g 58178 t 2 others ORIGIN 1 gatcgagaca cggtgaaccc cgtctctact aaaaatacaa aaaattagct gggcgcagtg 61 gcaggcgcct gtagtcccag cactttggga gaccgaggca ggtggatcac gaggtcagga 121 gattgagacc atcctggcta acatggtgaa accccgtctc tactaaaaat acaaaaaatt 181 acccgtgtgt ggtggcgggc gcctgtagtc ccagctactc aggaggctga ggcatgagaa 241 tcgcttgaac ctgggagacg gaggttgcag tgagccaaga ctgcaccatt gcactccagc 301 ctgggagaca gggtgagact ccatctcaaa acaaaaacaa aaacaaaaga caaaacaaaa 361 caaaatacaa aaattagccg ggcgtggtgg cacgcgcctg taatcacaac tacttgagaa 421 gctgaggcag gagaatcagt tgaacccggg cagcagaggt tgaagtgagc tgagatcgca 481 ccactgcact ccaacctggg tgacagagca agactccatt tcaaaaaaga aaaaaaagaa 541 gggaaggatg atcacccagc ctcacctgct ctccagccca aacaagccag cttccagggt 601 gcttagtttg gtatgccctc cccgcctttc catccacccg ctgagccctg gggggttctg 661 agtcttggtt gggagttgag gaggggtctc atccctgggg aagccttgtc agacctcaca 721 atgccgtggg agtctatgct aggcatccag aggccagcag aaccccactc tcctcttgcc 781 cttccccaga agcagtctgg tagaaataac ttgggtccag acccaggtgc ctccaggctt 841 ttatgacgta tgctaagtcc atgaacagta ggtagtattg tttggagcat ttaaaacctt 901 tatatataat atttcacact gaattcgttg ttctgcaact tgtttttttg atgtatgctt 961 atgtttccaa gtgttttctc ccacttatta aaaaaaaaaa aaatcagggc aggcacggtg 1021 gctcatgcct gtaatcccag cactttggga ggccgaggag gtgggtggat catggggtca 1081 ggagttcaag accagcctgg ccaagacggt gaaacgccct ctctactaaa aatacaaaaa 1141 attagctggg cgtggtggca gatgcctgta atcccagcta cttgggaggc cgagtcagga 1201 gaatcgcttg aacctgggag acagaggttg cagtgtgctg agaatgtgcc actgcactcc 1261 agcctgggtg acagaacaag agtctgtctc agaaaaaaaa aaaaaatcag ctttggctga 1321 gcacagtggc tcacgcctgt aatcccaaca ctttgggagg ctgagacggg cagataactt 1381 gaggccagaa gtttgagacc agcctgggca acatggcaaa accccgtttc tacaaacaat 1441 acaaaaatta gcaggccata gtgctgtgtg cctatagtcc cagctactca cagggctaag 1501 gcaggaggat cactttagtc caggaagtgg aggctgcagt gagctgagat tgggccactg 1561 tactccagcc tgggtgacag agcaaggccc tatctcaaaa aaaaaaaaaa aaaaaaaaaa 1621 attcagcttt attgaggtag gtattagctt cctagggctg atgtatgtaa caaaacatcc 1681 accagcacgg gaaacagagc aagaccccat ctctacaaac ataaagaata attagtggca 1741 catgcctgca gtcccggctg cttgggaggc tgaggtggga ggatcccttg agcccaggag 1801 gtcaaggctg cagtgagcca tgattgcatc actgcactcc agcctggatg agagagcgag 1861 accctgtctc aaacaaagaa agagaaccaa aaaaaccaca aactgggttt acacaacaga 1921 aaaattattg tattacagtt ctggaggcca gaagcctgaa gtgagggttg gttccttctg 1981 gaggctctga gggtgaatct actccatgcc tctccctggg ccatggtgac tatagatgct 2041 ccttggtgtt ctgtgtcttg tggccacatc agtccagtca ctgcctctgc tttcacgtga 2101 cctgcccctc taggtcttct tagtctatga ctgaaatttc cctcttttat aaagatgcca 2161 gccattgaat ttagggccca ccctaaatct aggatgatct catcttgaga tctttagctt 2221 atgtctgcaa agacccattt ttcttttttc ttttctttct ttgagacaga gtctcgctct 2281 gtcgcccagg ctggagggca gtggcacgat ctcagctcac agtaacctct gcctcccggg 2341 ttcaagcaat tctcctgcct cagcttccca agtagccggg accacaggtg cgagccacca 2401 cacccagcta atttttgtat ttttagtaca gatggggttt caccatgttt gccagactgg 2461 tctcgatctc ctgacctcgt gatctgcccg cctcggcctc ccaaagtgct gggattacag 2521 gcgtgagcca ccgcgcccgg ccagcttatg tatttttaag acagggtctc actctgtcac 2581 ttggctggaa tgcagtggtg cagtaccagc tcactgctgc ctcactgtcc ctggcacagg 2641 tgatcctccc gcctcagtct accaagtagc tggcactata ggtgtgcacc accactccca 2701 gctaatttgt gtattttttg tagagacagg gtctcactat attgtccagg ttggtttcga 2761 acgcctgggc tcaagtgatc tgcctgtgtt ggcctcccaa agtgctggga ttacaggcat 2821 gaaccaccac gcctggccta aagcgtaggc tttaatttat actaggtaat aaagtgcctg 2881 actccctttt tgatgtttta cagctgacag ctttaaagcc ccgtgcctac ctcttctcct 2941 tgtgcccaca tgtgggcaag ctgatctgaa ggccctggtg ccctctccct cactctatag 3001 agaaatttaa attaggaaag tcctggctgg ttggaaggga ccctcatccc actttatccc 3061 ctaaccataa taaaagcccc tccttgaatt acttgaacac gggaggtgga ggttgcagtg 3121 agcccagact gtgccactgc actccagcct gggcgacaga gtgagactcc attaaacaca 3181 cacacacaca cacacacaca cacacacaca caaaacaaca acaacaaaaa aaccctcctt 3241 gctctcttaa accactttca gagctgcctg ggagcccact ctgctctccc agaaagcctc 3301 atcgttgagt aatcaaactc ttcttattct cttggtgtgt tcgtggcctt attggtctct 3361 aatatgaacc aaatttggca catagggtaa atcacaaaac atgtttctat ctgcagaggg 3421 tgaatttctt tttatttatt tattttattt ctatatgaac aaatgctgcc tcccctgcag 3481 agcatccatt tctttctttc cgtttttggt agaaatgcgg tctcagtatg ttgcccaggc 3541 tgggtctccc tcaaattcca ggcctcaaag gatcctcctg ccttggcctc ccaaagtgct 3601 gggattacag gtgtgagtca ccacacacct ggctagagca cgaatttcat ctgggtcaca 3661 cctggccaat acttggcatt gtcagacttt aaaattttgc caatctgatg acaaacatgg 3721 gatctgttat ggagtaaatt gtgccctgtc aatattctta tgttgaccat tacaggaatg 3781 gtggctcacg cctgtaatcc tagcactttg ggagcctgag gcaggagaaa tggttgagcc 3841 taggagttca agaaagccct gagcaacaca gggaggcccc gtctctacca aaagaaaaga 3901 aaaagaaaaa aaaaaaagaa agaaagaaaa aacatgggca tggtggcttg ggccagttgt 3961 cccagctact tgacaggctg agacaggagg attgcttgag cctaggagtt ccaggctgcg 4021 gtgagctatg atcatgctac tgctgcactc cagcctggga gacagagcga gaccctggct 4081 caaaacaaac aaacaaacaa aaaaaccaaa accaaccaaa caaacaaaaa aaacaaacca 4141 gagagagaag aagaatttgt atgttaaagc cctaaccccc agtacctcag aatgtgatta 4201 tagttagaga tagagctgat aaaaaagtga ctaacttaaa atgaggcttt tagggtggcc 4261 ccgatgattg acttcttaca agaggtcaat gaccaggcgc ggtggctcac gcctgtaatc 4321 ccagaacttt gggaggccga ggccggcaga tcacgaggtc aggagatgga gaccatcctg 4381 gctaacacgg tgaaaccccg tctctactaa aatacaaaaa aattagccgg gcgtggtggc 4441 gggcgcctgt agtcccagct actcgggagg ctgaggcagg agaatggcgt gaaccaggga 4501 ggcggagttt gcagtgagcc gagatcgcgc cactgcactc cagcctgggc gacagagcga 4561 gactccgtct caaaaaaagc aaaacaaaac aaaacaaaac aaaacaaaag aggacgatac 4621 ttctggggtg tgcaagcgca gggggacggc cgtgtgggga aacagcaaga ggacgcccat 4681 ctgcctgccc aggaggaggc atccggagaa actagcccct caggcacctt gatcttggcc 4741 ttccagtttc cagaactgtg agaacgtaaa cttctggttg agcctcccag tccttggtgt 4801 tttgttctgg cggccccagc gagcgggcgc gggatcgcag tgtttgcctt tgcatctcca 4861 gctccgcatc ctgctcgctg tttattttgt gagcagctcc ggcggctgcc cctgaactgc 4921 tccgagactg agcagtttaa ctcacactcg cccacccgcc ctccccagga agcctcccgc 4981 ccgcttcctc cagttccttc cccacctgtc cggcctccta gcccggtgct ctcggcgcgg 5041 gagggggcgg ctcgcagacc cccaggtctc ggcttcccgg atctcccccg cctactcact 5101 ccccggtccc cggctcccgg cactggcagg cgctgtggct tccgcgcagt ggctgcagca 5161 tttaaatgtc cggcacttag ggagaggggt gcagtcgcgg agcagccgct gctcccggca 5221 tctagagggg cccaaacccc cgcgtgccca ggccgagccc cgcgcgcagc ccacgtgggc 5281 tgagggactg agtattggcc ctccggcagc tgcaaccgcg cacagctgga gctggcgggg 5341 cggagggcgg gccgggaccc cgcggggaag gaggcttcct tcccatcctc ctgctgcccg 5401 gaaatccccc cggctctcag acaacctgcc ccacggggtc ctggggggaa ggagaggggc 5461 gggatacagg gatggggagg gctggctgag ccgaaggccc gggccgccct cccccaggcc 5521 gcgcggcttc gagaaagcag aagcgagccc tgccccgggg agtcccacag gagcggggat 5581 ctgggaagag aaacttgcgg acgagggagt tagagccgtg gacccccggc ccccgcccgg 5641 gcgtcctgcg tcggaggagg gtctctgggt cgggcagggg ccacctcttg gcccgcccct 5701 tgtcgctgga cgtccgaaga ttgggccatt tcccttgccg cgcccttacc gcatatcggg 5761 gtgcgcccgg cccggcccgg cccgccccac ccagccctcc gctcgcgccc ggagaggagg 5821 ggccgctggc gcagcgcccc gggaccccga gaggccgccg cggcacatcc agacctccgc 5881 cgctcccgcg ccctctcaac catcctggga ttcccgggcc cacccgaccc agcggcgcga 5941 ccctggccct ccgggaccct ccgctgactc caccgcgcac ttcccgggac ccccacacac 6001 atcccagccc tccggccgat ccctccctac tcggtgccgg gtgccccccg ccctctccag 6061 gcccggatct cctcccccag gtccccgggt cggccccagc caggccccct tcgaaccccg 6121 ccggcggccc gggctggggc gcaccatgcg gctgcggctc cggcttctgg cgctgctgct 6181 tctgctgctg gcaccgcccg cgcgcgcccc gaagccctcg gcgcaggacg tgagcctggg 6241 cgtggtgagc gcggggtccg caggctcctg gggtctgcag agagattggg agagggaagc 6301 tgggcccgga ctcctgggtc cggaggaggc aggggccaga ttccaatatc caaagagtga 6361 ctgaggatgg ggtctgtgct cccggcttct tgggtctgtc gggaagtttg gggctgggat 6421 ttggagctct ggaaaggagc tgaggtgcgg agctgaagtc cagacaagac tggcaaaccc 6481 gggactgaag cggggaccta tggaggggag caggcaggca tccggggctg gggccctggg 6541 ctctgggtgc ctgggagggg cagggctgcc aggatggtgg tgggcagaga gagcccatac 6601 ttttcaccca gcccgcttca catgccccct ccgatgccct aggactggct gactcgctat 6661 ggttacctgc cgccacccca ccgtgcccag gcccagctgc agagccctga gaagttgcgc 6721 gatgccatca aagtcatgca gaggttcgcg gggctgccgg agaccggccg catgggtagg 6781 tggcccccac ccctacccag ccctgcctct gcacccagcc tgaccaccgc ccaacagcct 6841 ttagacctca gtgtgctcct ggaacacgga gctgtgaaga tgctgatctc aggccccaaa 6901 cccagagggc ctcaggcgtt catctttcca taagcattta tcaagaacct ggtggctcat 6961 gccggtaatc ccagggtttt gggaggcgga gactcaagga ttgcttcagg ccaggagttt 7021 gagaccagct tggccaacaa agtgagaccc ccccactctc caaggatttt tttttttttt 7081 tttgagacgg agtatccctc tttttgccca ggctggagtg caatggtacc atctcggctc 7141 accgcaacct ctgcctccca ggttccagtg attctcctgc ctcagcctcc tgagtagctg 7201 ggattacagg cacgcaccat cacaccaggc taattttgta tttttagtag agacggggtt 7261 tctccatgct gatcaggctg gtttcgaact cctggcctca ggtaacccgc cggccttggc 7321 ctcccaaagt cctgggatta caggtgtgag ccaccgtgcc cagtcctcta aggatttttt 7381 taaatttagc tgagtgtagt ggcataggcc tgtggtacgc tgagatggga aaattgctta 7441 agcccagaag ttcaaggctg cagtgagcta tgattgcacc gttgcactcc aacctgggca 7501 acagatggag acccagtctc taaccaaata tactttgagc atctcttaga tgccacgccc 7561 tttcagctcc aatggtatag tgataaataa agtaggtaag gttcctgctc tcatggagct 7621 aactttctcg agggagagag agagagacag atcaagaaca taaatatgaa agataatttc 7681 agatggcggt gagtgtttag aaaaaaataa aatgctattg gcattgaggg gctggtctgg 7741 ttagggcagg tccctctgaa aagtgtcata tgagctgaga ccacaaagag gaggaaccag 7801 ccatgggaag atgtggaaga agagtgtccc gggcaagtgc aaagacccca cggcacgcag 7861 aagcttgtgt gcctggagca aaaggagcat tgaagagctc tggcgagagg cctcagccaa 7921 gggccctgaa aggtgattag ggttgattcc aagggtgatt agcaaccacc taagtgttct 7981 cagaagggaa gcaattcaga acttggtatg acttgcaatt tttttttttt tttttttaga 8041 cagagtctca ctctgtggcc caggctgcag tacagtggcg caatctcggc tcattgcaat 8101 ctctgcctcc cgggttcaag tgattctcct gcctcagcct cctgagtagc tgggactaca 8161 ggcatccgcc accatgccca gctaattttt ttgtatgttt agtagagaca gggttttgcc 8221 atgttggcca agcaggtctc gaactcttag cctcaagcga tcagcccacc ttggcctccc 8281 acagtgctgg gattacaggc atgagccacc gtgcccggct ttgacttgca gttttaagct 8341 tcctctgggt cctgatggag aatggaaggc aggagccctg agaccaggca gaagacggct 8401 gcggagtcca cacgagggat ggtggcagaa ggatgggtgg catatgtggc cagaagtagt 8461 tggattcggg gtgtatattt tggaggcagg gccgaggaac tcttctgtgg ggttaagtgg 8521 gcaggaagag ggaatttgaa ggatctggct cactacaagg aaaggtaggc cctgtgtctg 8581 tcctggtgac tcacaaacag cctagaacag tgtctggcat ctatatgtcc ttcatcacta 8641 tttgccgcct caacaaatga ggcgcagttt gaccttaatt tggagatgag gctactgggg 8701 atggagccca gctgggaaag gggaccccag agagacctaa cccaggaggg ggtggcttgc 8761 cacaggaggc cagagaccca gccggcctgg gcactttgga cctaatctgg gtttgaggag 8821 aacttggcag tgatggtgtg aggcctgcac agctcccagg gggttccagc ctcaaagggt 8881 cctcactgca tacccagggc tcctttctcc agagtccgcc caccagggcc cagggaagct 8941 ggatgcaaaa tcctggcctg agcctctgag catcacctgc atgcagccag tttgtcccag 9001 gccaggtgac gccactccaa agggcagtag gaatggcttg gtaggagctg gcttccagag 9061 acagactgcc tgggttcaaa cctgggctgt ggggccttcg agggcacatt tcttgagccc 9121 tctgagcctc agttttccca tgtagaaagt tgactggtgc tatgattaag gcaggctctc 9181 ctattgtgga cacacccccc accgccaaat gtctcccgca gacccaggga cagtggccac 9241 catgcgtaag ccccgctgct ccctgcctga cgtgctgggg gtggcggggc tggtcaggcg 9301 gcgtcgccgg tacgctctga gcggcagcgt gtggaagaag cgaaccctga catggaggta 9361 ggtcgtgggg cccacccgca ccctggccct gcctgctggg ctccggcttt gaatggctgc 9421 ctgctccctc cacggccacc cttacacctc actcccctct ccccagggta cgttccttcc 9481 cccagagctc ccagctgagc caggagaccg tgcgggtcct catgagctat gccctgatgg 9541 cctggggcat ggagtcaggc ctcacatttc atgaggtgga ttccccccag ggccaggagc 9601 ccgacatcct catcgacttt gcccgcgcct tccaccagga cagctacccc ttcgacgggt 9661 tggggggcac cctagcccat gccttcttcc ctggggagca ccccatctcc ggggacactc 9721 actttgacga tgaggagacc tggacttttg ggtcaaaagg taaaatctcc tctcttatga 9781 gagatcctct tgccaggtct ggtcttaaga ataagttttc tgttgtgtgt tttgttttgt 9841 ttaagaaaca aggtcttgct ctgttgctca ggctagagtg cagtggtgca atcatagcta 9901 actgcaacca cgaacttctg gctcaagcaa tcctcccacc tcagactcca gagtagctgg 9961 gatcactggt gcacaccacc atacctggct attttttttg tttgtttttt ttactttttg 10021 tagaaacagg gtctcattat cttgctcagg ctggagtgca gttgcacgat cacggctcac 10081 tgcagccttc acctcctggg ctcaagtgat cctccctcct cagccaccac gtagccagga 10141 ccacagatgt gcaccaccac acctggctag ttttaaaact ttttgtagag atggggcttc 10201 actatgtcac cagggctggt cttgaactcc tgggctcaag tgatcctccc atcttggtct 10261 ctcaaagcat tgggattata agcatgagcc accactcctg gccaataatt aggtttaaat 10321 tatatttact gatggaggct ttatcagctg ggaggtctcc agagaaacaa aaccaacagg 10381 atgtgtgtgt gtgtgcacgt gtgtgtggga tgtgtgtgtg tgtgtatgtg tgtggggggg 10441 gtgtgtgtgt gcgtgtgtgt aaagagattt attttaagga attggctcac gggattgtag 10501 gggatagcaa gtccaaaatt tgtacggcag gctggcaact caggatttca gtgacagtct 10561 ggaggcagaa ttccttcctc tccaggaaac cttagcttta gctctcaagg ccttgaactg 10621 attggctctg cccccacccc acccccagcc acactatgga gcatgatcta cttgacttaa 10681 ggtcaactga ttgtagatgt taattgcatc tacaaagacc ttcaccatgg catctagact 10741 ggtgtttcac caaccagctg ggcacgacag ccgggccaaa ctgacacaca caatgaacca 10801 tcacagaggc ctagcaggtg cctagctcca agccagagct gggggcttgc agagacgaaa 10861 acaaatcttt acctggcagc tgagtcagaa gagtgactgg caatgacaac acccacggtg 10921 acacagctcc acagaatccc agcagaggaa gtcctcaagc ctgcaggggc tcagagatgt 10981 tggcccaaag ctcttgggca aggggaggct tcctagaaga ggtgacttca ttccctcaat 11041 atatatatga atataggccg ggcgtgattg ctttcatctg taatcctagc acttcgggaa 11101 gcacaggtgg gtggatcacc tgaggtcagg agtttgagac cagcctgacc aacatgttga 11161 aaccccgtct ctattaaaaa tacaaaaaat tagccgggtg tggtggtgca agcctgtaat 11221 cccagctact ggagaggctg aggcaggaga atcagttgaa tcctggaggc agaggttgca 11281 gtgagccgag atcgtgccac tgcactccag cctgggcaac aagagcaaaa ctccatctca 11341 aatatatata tatatatatg aatatccaca catacacact gagtgcctac tagcacaaga 11401 caagacatac agagttccgc tctccccagg tacacagggt tgagatccaa tgggaaagca 11461 gtgagccagc tgagtcaggg gtaagaggag ccagtgacca gagaaatgac ccgggacata 11521 tgaggccaga agccaaaact atggcatcag tagggagact gcagaaggcg gcaggtcctg 11581 gatcaccaag gaccttgaaa gcatgtggaa gactttgatc aatgggggaa tcattaaggg 11641 gttttgcatt ccggaactat catcagcttc agcggggggc agggggtagg atagaggcag 11701 ggaggcagga agttctcgca gggtccaggc aagagatggt ggagtgtggt cctggggaag 11761 ggggcagtgg acggatctga gggaggttca gttaggatgg gcaaggattg agatctggct 11821 ggatcctggt gacaggggag atgaggctgg ggcccttgga ggatggcgtc ctggtgctgt 11881 taatgcaggt gtgagagtgg gaggaggagg agcaggctgg cagaaggcag gcacaagtgc 11941 aggtctgggc gggagtctgc agcgctctgg gctggccaca ccggtggggc aagggaggtg 12001 gaggatcctg ccctagagct tggtctcctc tgtctgaggt caggttcccc agaggtgggg 12061 ctccagaggg aaattcttgt gcaagcagtt tctcaggggc catgtgctga ggggaatggg 12121 gagcaggaac agcctagaag ctgagtgagg atgtggtctc agctggagag gaggctgatc 12181 gcatgggagc tctgcagcag ggatggcaat gccggggagg cggtcccgcc tgaggccagg 12241 cccagctctt tgcagcccca tattaactac attggctgct ggggagcctg agtgacgggg 12301 gaagggggca gccagggagg gtgcctaggc cctgtggagg gcttggtaag gcagaggctg 12361 gggaggctgt gtgcctcaac attcacatat gggcttcggc tgggcgtggt ggcacatgct 12421 tgtagtccca gctactcagg aggctaaggc aggaggattg cttgagtcag ggaggttgag 12481 actgcaatga gctgtgttca caccactgca ctgcggcctg ggtgacaaag tgagaccctg 12541 tcacacacac agaaaaagag aatgtgggct tcaaagctga attcaagtaa gtcagggttc 12601 ttccacttgt aatgaacaac ttccttaacc tcattgtctc aggttcctcg cctgagaaat 12661 gcgaggagaa cggcgcctgc tggtggcgtg ggcttgccga tgaaatgagc cattcaccca 12721 tgcctctcag cacaggacct gccttgggcc gcatagtgtg cattgtcagc tgctgtcggt 12781 gctgggctag cggccgcgtt gagacccagc agtcctcggg ggttggcgag cttctgggag 12841 gaatgggtgg aacgtgactg tcaggtacag agctacagag caccccagca tgatggtgta 12901 tggtggcagg ggagaacgac tcaacagaga tggagaaaga ctaagcccaa aggtgatcaa 12961 caacaagagt gaaggaggaa gtggtcagct gtggctgggg caaaacaagc ccagtaagtt 13021 acattggcct gtgtttggtc atccagggac aaagtgacct ttttagtgcc tgtaggccca 13081 gcgcgttggg gaagaattta gaatgcagtg gggataggtg tgagaggagc gggggaagtg 13141 aagattgagt tgagaccact ggctggagaa cttgaccggg aagggagggc agagagcagc 13201 atggggtgag gggagggaat ggatccaacg agcgtcttag gatgagggag ttgtgagcgc 13261 tcaacaccat tacaaagaag ggatcgggga ttgatagact gacagatgca tgcacagagg 13321 caggtatgga tgcatgcaca gaggcaggga tggatggacg gacaaatgca tgcatagagg 13381 caaggatgga tggatggaca gatgcatgca cagagtcagg gatggatgaa tggacagatg 13441 catgcacaga ggcagggatg aatggacgga cagatgcatg cacagaggca gggatggatg 13501 gacggacaga tgcatgcaca gaggcgggga tggatggaca gatgcatgca cagaggtggg 13561 gatggatgga tggacagatg catacacaga ggcagggatg aatggacaga tgcatgcaca 13621 gaggcaggga tggatggatg gacagatgca tgcacagagg cagggatgaa tggacagatg 13681 catgcacaga ggcagggatg gatggatgga cagatgcatg cacagaggca gggatgaatg 13741 gacagatgca tgcacagagg cagggatgga tggacagatg catgcacaga ggcggggatg 13801 gatggatgga cagatgcatg cacagaggca gggatgaatg gacagatgca tgcacagagg 13861 cagggatgga tgtacggaca gatgcatgca cagaggtggg gatggatgga cagatgcatg 13921 cacagaggcg gggatggatg gacagatgca tgcacagagg cagggatgga tggatgcaca 13981 gatgcatgca cagaggcggg gatggatgga cagatgcatg cacagaggcg gggatggatg 14041 gacggacgga tgcatggcca gaggcagaga tggatgggtg ggtagattgg acggacagat 14101 gatggtaggt tgttcttgtc ctggaggagg tgatccaggc tgagtcctca cactgggcag 14161 agcaagaggc aggaggtgag catggggaga tgcagctgtg tgccccagtg caccggaagt 14221 tacagatgac atcgatgtcc tgtcctgatg aaacaggaaa cacatttgtc atttgagagt 14281 tagtggctgg ggctgggtgg aagcttgagg ggctgtgggc ctggaatggg gctgagggga 14341 gcgtgcagga gctgcctgcc aaagcctgga gcaaggatga caggaggtgc cggggcccct 14401 gtggactgcc ctctgggaat cttgtggggc tgggtctgca cagctgtggt tgtgtaactt 14461 cccccagatg catggcgtga caagggtgca ggaaggggtg cagggcccgg aagcccagta 14521 tggccaagga gagacaaggc agggaagaag gtgagagatg aaagatatcg ccctgattga 14581 aacaccaaac tacccccagt ttctcagaaa aaggactcag agcaacaaga gaacctacag 14641 tcgacaggga aattcgtcag tgcggatgtg atttgagcaa gaccattgaa gcccctgctg 14701 aaaggcaggc tggcagagcc agggtacctg gccatctacc ttgaagtctc tgtgcctcag 14761 tttcctcctc tgtaaaatgg gcataggaga caggtttggt gagaagtgaa cgaaaacgtg 14821 tttgctaccc ctcttaggtc tcacgtttca tgcttagacg ctcagcgagt ggctttaggg 14881 accgagagaa atttgctttt cagttcacca ggtcgttacc tatgaacctg ccctgaacag 14941 tgctgacctt agttagagat ggccatgaaa ctcatccgca tggggtaact ggggcctccc 15001 tgtggtgctg cggggctgtg tctctgtctt ggtctcttca ggcctctcag cagctggagc 15061 aggagctggc aggcggctca ccggttgatg aggagctggg cttcagccgg ggctggcgtg 15121 tgaatcctct gggtcctggc agtcctgagc gcctgagctg aatacagagg gaagaggctg 15181 ggagcaaggc cgggtgctgg ggccggcagg ctgtgttctg agagtgcctg ctagaggagc 15241 tctgtgttcc caaggagatg gaggaagacc tggggtgggg gtggtacagg gtagggagag 15301 gaggggcaga ggctgtgaac agggagatgg gggagggggg atggaggact caagggatgt 15361 aagggatatg gcttgaggag ggatgggggc aggaatgagg gaaggtcttg ggtcaaggac 15421 atggacctac atccccagca gatgggggtg gcaaggggga gacgggtggg ggtcgctgac 15481 tatagactgt agctgggagg aggggatgct cccagggctg gggctgcctg aagagaatgg 15541 cacgcacctt gtaggctcct gccctcaccc caccgccctc ttaagtttct tttattttct 15601 ttttcttttt ttttcttttt actttttttg agacggggcc ttattctgtc ctccaggctg 15661 gagtgcaatg acatggtcat gcgtcactgc agcctccacc tcccgggctc cagcgattct 15721 cctgcctccc tgcctcagcc tgggacctga gtagctggga ccacaggcat gcgccaccac 15781 tcctggccaa tttttaaaat ttttttgtag agatggggat ctccttatgt tgtccaggcc 15841 agtcttgaac tcttgggctc aagcgatcct cccacctggg actccaaaag tgctgggttt 15901 acaggcatga gccaccacac ccagctcctc cccctcaaac ttctgtgcac aaagtgctcc 15961 cttcccagag gaggggcccc atcggtgtgt aaggtggcct attcctctgt gtgttctctg 16021 gatcttttca gccctgtggt ccagtgtcca tcacagccat gctgactgag tgactggaga 16081 cagggatgat ggagagttca tgaagggctg ggcagagagc tggggccacc tctggaggtg 16141 tcctgctgtt cctgttggcc ccagctgcac tcgcagggcc ttcctgtggc ccctttcccc 16201 aagcaggggc ggccgcagct ctcacccact ttctcctgca gacggcgagg ggaccgacct 16261 gtttgccgtg gctgtccatg agtttggcca cgccctgggc ctgggccact cctcagcccc 16321 caactccatt atgaggccct tctaccaggg tccggtgggc gaccctgaca agtaccgcct 16381 gtctcaggat gaccgcgatg gcctgcagca actctatggt agggggagag ggacctgccg 16441 cgaaaccatc attgccccat ccagtgtcct ctgcagcaag gccaggggac gcacacctgc 16501 ctgactcttt cctcacaggg aaggcgcccc aaaccccata tgacaagccc acaaggaaac 16561 ccctggctcc tccgccccag cccccggcct cgcccacaca caggtgagtc ccccaccaac 16621 tcggagacct tgggtgacca gctgcccagc ctcagtgtcc tctgagatgg ggatggtggg 16681 ggtccctgcc ttggagaaaa caaacccccc tctctactca cctctccttt cctccccagc 16741 ccatccttcc ccatccctga tcgatgtgag ggcaattttg acgccatcgc caacatccga 16801 ggggaaactt tcttcttcaa aggtgagtca tttcacttgg cctcatatat gttggtttcc 16861 tgcccacttc cagtgaccca ctggggctgt gggcttaccc tggaagcgga acttttttct 16921 tctttgagac agggtcttgc tttgttgcct gggccgcagt gcagtggtgt gatcatggct 16981 cactgcagcc tcaaaatact gggctcaagc gatcctccca cctcagcctc cctggtagct 17041 gggaccacag gcacatgcca ccacgccttg ctagtaattt attttattat tttgtagaga 17101 tggagtctca ctatattgcc caggctgggc tcaatctcct ggctcaggtg attctccggt 17161 gtcagcctcc caatgtgctg gcattacagg tctgagccac caggcaaggc cctggcactt 17221 ttagcgctaa aaagggaaaa gtctcaggcg ggccaggatg ggctggtcac cctagatcca 17281 ttgcgccctt gatttccaga tgggacccct ccccacccag ccacacaccc tgggggagga 17341 gacctcctgc tgtctcatgc cttcctgcga gaccctttgt cccctgcagg cccctggttc 17401 tggcgcctcc agccctccgg gacagctggt gtccccgcgg acccgcacgg ctgcaccgct 17461 tctgggaggg gctgcccgcc caagtgaggg tggtgcaggc cgcctatgct cggcaccgag 17521 acggccgaat cctcctcttt agcggtgagt ggggccggcg gcggggcgcg ctggggccgg 17581 cgcggggagc ccacccctga cctcccggcc tccaccctgc agggccccag ttctgggtgt 17641 tccaggaccg gcagctggag ggcggggcgc ggccgctcac ggagctgggg ctgcccccgg 17701 gagaggaggt ggacgccgtg ttctcgtggc cacagaacgg gaagacctac ctggtccgcg 17761 gccggcagta ctggcgctac gacgaggcgg cggcgcgccc ggaccccggc taccctcgcg 17821 acctgagcct ctgggaaggc gcgcccccct cccctgacga tgtcaccgtc agcaacgcag 17881 gtggggagcg cggtgacctg cgggttactg ggcctggggg tggggagagg gatgtgggga 17941 atggggacat ggaggccacc ctgcggggat gggggtcctt gggcatcagg gagcggcggg 18001 gcggggaggg accgggactc aagctctgct cctccaggtg acacctactc ttcaagggcg 18061 cccactactg cgttccccaa gaacagcatc aagaccgagc cggacgcccc ccagcccatg 18121 gggcccaact ggctggactg ccccgccccg agctctggtc cccgcgcccc caggcccccc 18181 aaagcgaccc ccgtgtccga aacctgcgat tgtcagtgcg agctcaacca ggccgcagga 18241 cgttggcctg ctcccatccc gctgctcctc ttgcccctgc tggtgggggg tgtagcctcc 18301 cgctgatggg gggagccatc cagaccgaac agcgccctcc acggccgagt cccccgccgc 18361 tggacctggt cgggggttgt gaggcgctgc ggaggcccct tgtctgttcc cacggacggg 18421 ggctcgggcg cggactaagc aggggggatc tcccgcgcag gggcggcggc ggcggggacc 18481 ggtcgcctgg cgctgggctc agtctcctca gggtctgaga ccccggcgct gccaccggaa 18541 cccgccttca ggggcgcacg cgcgctggga ccatgcgtcg gtcgtcgccc ccgtcgttcc 18601 ctcccggctg ccgccagggg gcggtcggac cccgcctccc gagcccgggg aggggcgggg 18661 aggacaaggg gcgggcccgc ggcctcaccc ggagggacgg cagccccggt cgcgcgctgg 18721 ccccgcagga ccttcctttt ccaggaagag ccagcttttc tcggagcgca gtcctgggac 18781 tctccgcagc cccgccccgc ctggccactg cgtctggcat tcctgggtcg ttagaggaca 18841 ggcctgactg cgaagctgtg ccttgcccct ctcccacccg cagtttctca ccccgttctg 18901 ctcccacaag gcccccctac agtcactgcc acactggtgg ggacctggga cccagacccg 18961 gaaccagccc agatatcacc cctgaggacc catgcgccac gtcctgggtg gtggaatcag 19021 tggctggagg gacgaccctt gctctccagg ctgttaacct tttccgttgc tcccccgcca 19081 cccacctcct cctccccagg ccacccagct tgggcacctc cctgggccca gaactgcctt 19141 ccattcaatg gggaaccctt ctatccccaa gaaccccttc cctgcttgca ccctggagag 19201 aacagcttga ctcccatcaa ctcaacgctg gtggaaagac agggaccgaa ccctggctca 19261 ggcctggtca ttgcctcctc agcactccct cctgggaggc cttagctcta gagtgagggg 19321 tgggtggaac ctgggggcac ctcgttcacc ctgtccccac tccccacagt tttaggatct 19381 aaatgattgc ctctggaact attcttctag actatcccac atcagaatca ctgggaaatt 19441 taagtttgca gatcccacac tcaccctgaa tcctcactca gggtggggtc aggaatctgc 19501 attttaacta gtcgcgggga ttgtgggggg cagtagctgg ctgtttcgtg gcatttctgt 19561 ggctctgcag tgttcctcca ccccaggacc aatatgttca ggccacaccg atggcctgaa 19621 ccccatgggt agagtcactt aggggccact tcctaagttg ctgtccagcc tcagtgaccc 19681 cctagtgctt cctggagctg aggctgtggg cggctgtccc agcaaccatg cgaggggttg 19741 ccccagttgc tcatacaaac agatcagcat gaggacagaa ggcaggagac tttggtcagt 19801 tacctgggaa ttctgggctg ccaggaaacg atttgggcct ctgtcagttt cttttccatg 19861 tatgaggagg gggaaatttg tatattagaa acttattcat cccactcagg acaataaaaa 19921 cgaatgtaca aaaagccctt ccattcttct gagcattgat ggacctgggg agaagtgtgg 19981 tggcaggaac tggactgacc tgaatctccc tgcttcgcag aaggaatcca tggttgatgg 20041 ggaatctaag aaggaagagt gggcccgggc gtggtggctc atgccttaat cccagcattt 20101 tgcgaggccc aggcagacgg atcatttgag gtcaggaatt tgagaccagc ctggccaaca 20161 ctgaaatccc atctccacta aaaatagaaa aaaaattagc caggcatggt ggtgcatgtc 20221 tgttatccca gctgctaagg aggctgggac aggataatca cttgaacccg ggagagggag 20281 gttgcagtga gccaagatct cacgattgca ctccagcctg gccaaaagag cgagactccg 20341 tctcaaaaat aaaaataaaa ataataataa taagaagaaa gaagagtggg aggacgaagt 20401 agagaggaat cgaaaagttg gttatggggc acccggagga cggacggcca ctgtggaatt 20461 gagactgcct catggctcca tgtagacttc cagagccttt aaaattttat gactgctaca 20521 caaaaatgtc atgtcaaagt gattattttt attttttaag tggtatcgtt taaaaatggc 20581 atggtctcat aatcaaaatg taaaactttt gtgcttcaaa ggacatcatc cagaaagttg 20641 gaaagaggct gggcacagtg gctcacgcct ataatccgaa tagtttggga ggccaaagtg 20701 ggaggatggc ttgatcgcag ggtttcaaga ccagcctggg caacatagtg agaccttgtc 20761 tctacaaaat aatcaaaaaa ttagccgggc atggtggcaa cccgtgtctg tattcccagc 20821 tacttggaag gcgaagagga gagaatcacc tgaagccggg aagtcttggc aacaatgagc 20881 catgatcacg ccactgcact ccaccctggg caacaggtgt gtctcaaaaa aataaaaata 20941 aaaataaaat tggctgggca cggttgctta tgcctgtaat cccgcatttc gggaggctga 21001 tgcagacgtt tgagaggcag atgtcacttg agtttgagag ttttagacca actggccaac 21061 atggtgaaac cctgtctcta ctaaaaatac aaaaattagc caggcgtggt ggcgcatgcc 21121 tgtaatccca gctacttgag aggctgaggc aggaaaatcg cttgaaccca gaagttggag 21181 gttgcagtga gccgagatcg tgcctctgca caccagcctg ggagaccaag cgagattgca 21241 tcaaataaaa aaaaaaaaga aacacaaaaa ttgtcacatg gtaccatttc agcgcactaa 21301 gatggctaga attaaaaaga cacagagcat aacaaatgta tctggggatg tggagaaaca 21361 ggaacccccg ggcactgctg ctgggaatgt aaaatgacag acaatttgaa acatagtctg 21421 gctgttcctc agaaggataa aaacacattt gccatgtaac ccagctattc ttctccttca 21481 tacgtatcca agagaagtga aaacataagt ctgcacagaa acttttgcgt gaatgttcac 21541 agcagcgtta ttcagaatag caaaaaaggg gaaagaaccc caacttcctt tgaggggtga 21601 atggagaaac caaatgtggt ctatccacac agtagaatat cattcagcca gaagaaggaa 21661 tgaaggactg atccgtgtta caacacagat gaacctcaag aacagcatgt tccgtgaaag 21721 aagctagaca cagaagatca cgtagtgtat aattctactt ataagaaatc tctgggagag 21781 atgcatccat tgagagagaa agtagataag cccagacgca gtggctcatg ccagtaatct 21841 cagcattttg ggaggccaag gcgggcagat cacttgaggt caggagttcg agactccttg 21901 gccaacatgg tgaaatccca tctctcctaa aaatacagaa attaggccag gctgagtggc 21961 agaacaagac tccacctcaa aaaaaaaaaa aaaaaaaatt gtcacgtgat accatttcag 22021 cccattggga tggctggaat tgaaaagaca cagagaataa caagtattgc tgaggttgta 22081 gagacacggg aaccctcgca cactgctgct gggaatgtaa aatgacagac attttgaaaa 22141 atagtctggc tgttcctcaa aaggataaaa acacatttac catgtaaccc agctattctt 22201 ctgcttcata catatgaagt gaaaacatac gtctgcacag aaacttgtac gtgaatgttc 22261 atagcagcat tattcggaat tagccaaaaa gtggaaagaa cccaaatgtc cttcgagggg 22321 tgaatggaga aaccaaatgt ggtctatcca cacagtagaa tattattcag ccagaaaaag 22381 gaatgaagga atgatgcgtg ttacaacacg gatgaacctc aagaatagca tgttcagtga 22441 aagaagctag acacagaaga tcacctagtg tataattcta cttataggaa atgtctggga 22501 gagaggcatc cattgagaga gaaagtagat gaggccagac acagtggctc atgcctgtaa 22561 tcccagcact ttgggaggcc aaggcgggca gatcacttga ggtcaggagt tcgagactcc 22621 ttcatcaaca tggtgaaacc ccgtctctac taaaaataca aaaattagct gggtgtggtg 22681 gcacgtgcct gtaatcccag ctactcggga ggctgagtca ggagaattgc ttgaacccag 22741 gaggcagagg ttgcagtgag ccgagatcgt gccactgccc ttcagcctag acaacagagc 22801 gagactctaa gaaaaaaaaa gaaaaagaaa aagtagatga gtggttatcc agggaaatga 22861 ggactggcag tgatgtctca gaagggcaga atttctctta gaggtgataa aaatgttcta 22921 aaatgggttg tggtgatagc tgcacagttc tgtgctttga attgtacagc tcaatgagtg 22981 aattgcattg cctgtaaatt gcatcaccat aaagctataa aaacagggta catacagtct 23041 gggtggatcc caagatgtct tcagatggtg gcctttgaga gtcatgaggg gctagggttg 23101 gagctgaagc ctaaagcatg acttttagct ggtgcaggag tggcttcctt ctgtctccca 23161 ctacaattca gaggatactc agggaaagat ctggacatcc tgtatgtgga aaggcggttg 23221 cgcacagtgg aatggactcc taagtggcag gcaggtgaca accgttgtct acttcaccca 23281 agggaagatt tctccatgga gcgtgtttta taggttctgt ctctccgagt cccctttgag 23341 aatataatgg gatacattgc atccatgctg aattcctaag ccccaggacc tcagaatatg 23401 cccttatttg gaaacagggt ctttatggag gtacttaagg tacaatgagg tcactggggt 23461 ggatcctact tctttttttt ttttttccga ggggaatttt cattgttgtc ccccagactg 23521 gagtgcagtg gcaccatctc agctcactgc aacattcgcc tcccgggttc aagtgattct 23581 cctgcctcag cctcccgagt agctgggaat acaggtgccg gccaccacgc aaggctaatt 23641 tttgtatttt tagtagagac aaggtttcac catgttggcc aggctggtct agaactcctg 23701 accttaggtg atctgcccgc ctcggcctcc caaagtgttg ggattacagg cgtgacccac 23761 tgcgcccggc ctagggtgga ccctatttca atatgactgg tgtcctttgg aaaggggaaa 23821 gggggacagt cacacccagg cagaacgtga tgaagatgaa gatggccatc tacaagggca 23881 ggagaaacct gaacagaatc ccagctccgg gccctcagaa ggaccccacg ctgcccacat 23941 tgaccttgga cctccagcct gcagatcgtg agggaagaga cgtcttcgac ttagggcccc 24001 ttgtcgtggt acttccttag tttggcccca ggaaaccatc ccaaaggcaa gggcgtggtt 24061 gtgctcagct gggggaaggg ggctgggggc cgtgaggagg aggtgggagg cccagccagg 24121 ctggagggtc agaacccgtg gagctagaag agcccgtagg ggagccccaa gattgctgag 24181 accagtgacc ttcggcccca gatggccttg ccttggccca gaagggtcag aaggacctgg 24241 tcagccaagc tcagacagcc ggcaggatgc cttccaccct gcagagggtc ctatcttgtc 24301 ccacaggtag atctacatca ccactagcca cccctccaac gtgcacaggc ccctgccctc 24361 acggcgcccc tcttaggtcc ggcagttcct gcctccttct gatccagaag tttctctggc 24421 ctctggagcc ggggcacacc tcatgcaagg acagggtcca aattcctttg tccttggatc 24481 ccacttggct gacgtcacct tcctgtactc agggagtttc cccagccagc tgtcccgagt 24541 ctggactttc cctctgcccc tccccactct caggctggtg gggtggggaa agcagcccat 24601 tcctgggctc agagactccc accccagctc agagggagca ggggcccagc cagggacgga 24661 ccctcattcc tcccagggac cccagacctc tgtctctctc gggtaagtct ccatctctgt 24721 ctgtctctgt ctctgtctct gtctctgtct gtttttcacg cactcagcaa ggcctcctgc 24781 cctgagagag gctccgccca ctacccccca ctttccccat aaaaccagct gagtatttgt 24841 gccaggaaga ctgcgtgcag aaggtgactg tctcagtgga gctgggtcat ctcaggtggg 24901 gagttggggt ccccgaaggt gaggaccctc tggggaggag ggtgcttctc tgagacactt 24961 tcttttcctc acacctgttc ctcgccagca ggccttggct ccttgaactt ttggccgcca 25021 tgtgcttccc gaaggtgagt gagaggctgc gtgtgctttt gtgggcatgt ctgaaaacag 25081 accgtaaggg tgcgggtgcc ctcagtattt cccgaggtgc ctgtgtgtca gggctcagtc 25141 aggggcaccc agcggcagga ggatagtgat ggggtgagag tgtcagtgga ggcgctggag 25201 gtcatatgtg tcgggggcgc tggagaacgg caggggtgtg gatgagaggg agcacctgtc 25261 ccaggagccc ttcacagccc ggaaagcccg gggcaggggt ggggcagggc tctgctggaa 25321 acgactcgga gaatgcttct ctcagaggcc ggctcagctg ggtgggccca agagcaaggc 25381 ctgtgtgggt cctggtgtct cttcctcctt tcctgggttc cctccgacct cccatcctct 25441 accactgccc caccgcaaat gctaggccca ccacaccctc cagggagctc ttcggcctgt 25501 gacaataggg gtttccatga tgtggcctgg ctcaggttca ggacagtgac ccggaggaca 25561 catggctccc gcatgtcggc acggtgctgc tttcaccctg gttcctggga aatcaggcta 25621 gcgggatggg accatcgctg cctgaaagtg tgcagacagc tgccctgccc agaatatgtc 25681 cccaggccct gcgcactctg tgggtgactg tcaccactct atagtggggg aaaccaggca 25741 tgtcaccccc gagactaggc ccttgacgtg ggggctcagc ggggattctg tggggtgcct 25801 ctggcctctg tggatgcagc cacgtgtctg caggcaggaa tggcccggga cctgtgggtc 25861 tgcatgttgg cagtcgggaa gagtggcagg ttgtagggtg gacctacctg gcaccccaaa 25921 tattaatcag ctcatcagag aggaatggct gctgttacct tctcaattgt catgtcccta 25981 aacatttttt ccttggccaa ctctcacctg ggaacatagt ggttgtggga aacccagctg 26041 agccagcctg ctccaggaca gtgtccatcc tcccgtgtgt gtacatgggg gggtgtgtgt 26101 gtgcagggag gacaccccgg cccacgcagg ccctgctctt gtgaggaggg gtcacctagg 26161 cccatgctgg ccctgctctt gtgaggaggg gtcacctagg cccacgcagg ccctgctctt 26221 gtgaggaggg gtcacctagg cccacgcagg ccctgctctt gtgaggaggg gtcacctagg 26281 cccacgcagg ccctgctctt gtgaggaggg gtcacctagg cccacgctgg ccctgctctt 26341 gggcctgccc agctgagccg gctcctgaga gaagcgcttt ctgagtcgtt tcgaggacag 26401 ccctggccgg tctttccagg ctgtgagggg ctcctgggac tgctgtctcc tcttatcctg 26461 tacctctgcc atgtgtctct gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 26521 ataaattatc ctggaggaaa ggttaaggtg acacatggag acagagtgtc accgttattt 26581 ccgcaggtcc tctctgatga catgaagaag ctgaaggccc gaatggtaat gctcctccct 26641 acttctgctc aggggttggg ggcctgggtc tcagcgtgtg acactgagga cactgtggga 26701 cacctgggac cctggaggga caaggatccg gccctttggt gccaactctg cctctcttca 26761 cagcaccagg ccatagaaag attttatgat aaaatgcaaa atgcagaatc aggacgtgga 26821 caggtgggtg gatttcccct caggcaccag gtcacatgtc cccgccccca ggcactccac 26881 cctgtgtggg gctcagggtg agaaggatga agagggaccc acaggctccc tcacccctta 26941 ccgtgggcaa atgcttgcac ctgggtggca gtgagtgggc gggtggggga tctggacgcc 27001 cggggagact gagggaggca tccaagcccc agggctcctt gaggaaacaa caggggtgcc 27061 agacgtggcc cgggcccctg gctgggccca gttcggggtg tgtgggagct gaggactcac 27121 tgggcttgag gactgactga tgtggggtgc agaggaggct tgggcctgga accgagtgct 27181 ttgttcctaa caggtgatgt cgagcctggc agagctggag gtgagccgtg gcctccccct 27241 ccaccaagct tagtccctgg gtcttaggct ccacaggaca ctgggtctgg gccccgggtc 27301 cccttgggaa tcacctggac cagtgggggc cacagtggga agggggcagg caggagcagc 27361 atgaaccccc tgtgccctcc tctccccagg acgacttcaa agagggctac ctggagacag 27421 tggcggctta ttatgaggag cagcacccag tgagtatgac acacccatct gggcaccttg 27481 ccttccttca cctctgccct gtcttttctt tctttctttc tttttgttta tttgagacag 27541 agtctcgctc tgtcgcccag gctggagtgc agtggcatga tcttggctca ctgcaacctc 27601 caaatctcgg gtttaagtga ttctcctgcc tcagcctgac aagtagttgg gactacaggc 27661 acccgccacc actccaggct gatttttttt gtgtgtgttt ttagtagaga ccaggtttca 27721 ccatgtttgc caggctggtc ttgacctcct aaccttgtgt tccgtctgcc ttggcctccc 27781 aaagtgctga gattacaggc atgagccacc gggcccagcc aacccctgcc ctgtcttgat 27841 gtggtgtggg cagggtgtgc ccagcccctg agcttggggt ggagggctgg gagtgacagc 27901 ctagctggga cctgcccatg gcctcactcc tcacacagtg gcacagccct caaggcacga 27961 tgagggccct gacctggtga ccaagcagac acacccatcc tgtcactgcc atggaggtga 28021 atgcagagga gggggactct gggaaaagtc cctcttgccc acggggctgt ggttgggaaa 28081 ccaacacctg tgggcctccg tctcccaggg tcaggaaaag gctgagaggc ctgggtgtgg 28141 ccagggcctg gggctgacac ccccacctac agaccctgaa tggtgctccc attccacagg 28201 agctcactcc tctacttgaa aaagaaagag atggattacg gtgccgaggc aacagatccc 28261 ctgtcccgga tgttgaggat cccgcaaccg aggagcctgg ggagagcttt tgtgacaagg 28321 tcatgagatg gttccaggcc atgctgcagc ggctgcagac ctggtggcac ggggttctgg 28381 cctgggtgaa ggagaaggtg gtggccctgg tccatgcagt gcaggccctc tggaaacagt 28441 tccagagttt ctgctgctct ctgtcagagc tcttcatgtc ctctttccag tcctacggag 28501 ccccacgggg ggacaaggag gagctgacac cccagaagtg ctctgaaccc caatcctcaa 28561 aatgaagata ctgacaccac ctttgccctc cccgtcaccg cgcacccacc ctgacccctc 28621 cctcagctgt cctgtgcccc gccctctccc gcacactcag tccccctgcc tggcgttcct 28681 gccgcagctc tgacctggtg ctgtcgccct ggcatcttaa taaaacctgc ttatacttcc 28741 ctggcagggg agataccatg atcgcggagg tgggtttccc agggcaaggc tgatctgttg 28801 ccgtattagt ccgttttcac acagctataa agaatgcctg agactgggtg atgtataaag 28861 aaaagaagtt taactgactc acagttccac atggctgggg aagcctgagg aagcttacaa 28921 tcatggggga aggcggaaga gaagcaaggc acgtcctaca tggcagcagg agaagcaggg 28981 tggggggaac taccaaacac ttttgtccgc gtgtttgtgt gtttttttta tttgagatgg 29041 agtttcgctc ttgtcaccca ggctggagtg caatggggtg atctcggctc actgcaacct 29101 ctgtctctct ggttcaagtg attctcctgc ctcagcctgc tgagtagctg gggttacagg 29161 catgcgccac cataccaggc taattttgta tttttagtac agatgcggct tcaccatgtt 29221 ggccaggctg gtcttgaact cctgacctca ggtgatccgc ccacctcggc ctcccaaagt 29281 gctgggatta caggcatgag ccaccacacc cggctgccaa acacttttaa cccaccagat 29341 cttgtgagaa ctcactcact tacaccagaa gcgcatgggg gaaaccgccc ccatgatgca 29401 atcatctcca gcccggcccc tccctcgaca tgtggggatt acagtttgag atgattgggg 29461 taaggacaca gagaaaatca tctcattctg cccctggacc ctcccaaacc tcatgtcctt 29521 ctcacatttc aaaactcaat catgccttcc caacagtccc ccaaagtcta aactcattcc 29581 aacatcaacc caaaaggcca atccaaagtc tcacctgaga caaggcaagt ctcttctgcc 29641 tataagcctg taaattcaaa agcaagttag ttacttccaa gacacattgt gggtacaggc 29701 attgggtaaa tgttcccttt ccaaattgga gaaattggcc aaaaaagggg gctacaggcc 29761 tcaggcaaga cccaaatcca ccagggcagt cattaaatct taaagctccc aagcgatctc 29821 tttggctcca tgtctcatgt ccagggctca ctgatgcaag gggtgggttc ccagggcctg 29881 gggcagctcc gcccctgtgg ctctgcaggg tgcagccccc gcagctgctt tcatgggctg 29941 gtgctgagtg cctgcggctt ttccaggcac acagagcaag cgcttggtgg atctaccatt 30001 ctggggtctg gaagagagtg gtcttctcac agctccacta ggcagtgccc cagtagagag 30061 tctgtgtggg ggctccaacc ccacatttcc tctctgcatt gccccagtag ccgttctcca 30121 tgacaacctg tgcgtggtca tccaggcatt tccatccatc ccctgaaatc taggcggagg 30181 cccccaaagc ccagctcttg tcttctgcag acctgcaggc ccaacaccat gtggaagctg 30241 ccaaggcttg ggacttgcac cttctggagc gacggcctga gctgtacctt ggtcttcttt 30301 agccacagct ggagctggag tggctggggc acagggtgcc atgtcccgag gctgcacaga 30361 gcagcggggc cctaggccta gcttgaatta atccccagga aatgggtttt cttttctacc 30421 acatggtcag gctgcaaatt ttccatcttt tatgctctgc ttccctttta agtataagtt 30481 ctaattccaa agcatctctt tgtgagtgca tgtaaccgta tgctttcaag aaaagacagg 30541 tgacttgctg aatgctttgg tgcttagaaa tttctactgc tggcggagtg cagtggctca 30601 tgtctgtaat cccagcactt tgggaggccg aggccggcag atcacgaagt caggatgtcg 30661 agaccatccc ggccaacatg gtgaaacact atctggacta aaaatgcaaa aattagctag 30721 gcatggtggc acacgcctgt aatcccagct acttgggagg ttgaggcagg agaaccactt 30781 gaaccaggga gtccaaggtt acagtgggcc aagattgcgc cactgtactc cagcctggtg 30841 aaggaccaag actccatctc aaacaaacaa acaaaaaaaa caagggacag gttcttgcta 30901 tgttgctgag gctgcagtgc agtggtgcaa tcataggtca ctgctatgga tggatggaca 30961 gatgcgtgca cagaggcagg gatggatgga cagatgcatg cacaaacacg ctgagcgcct 31021 actagcacaa gacaagacat acagggtttg tgctctcccc aggtacacag ggttgagatc 31081 caatgggaaa gcagtgagcc agctgagtca ggggtacgag gagacactga gcagagaaat 31141 aacccgggac atctgaggcc agaagccaaa actatggcat cagtagggag actgcagaag 31201 gcggcaggtc ctggatcacc aaggaccttg aaagcatgtg gaagactttg atcaatgggg 31261 gaatcattaa ggcattttgc attccggaac tatcatcagc ttcagcaggg ggcagggggt 31321 aggatagagg cagggaggca ggaagttctc gcagggtcca ggcaagagat ggtggagtgt 31381 ggtcctgggg aagggggcag tggacggatc tgagggaggt tcagttagga tgggcaagga 31441 ttgagatctg gctggatcct ggtgacaggg gagatgaggc tggggccctt ggaggatggt 31501 gtcctggtgc tgttaatgca ggtgtgagag tgggaggagg aggagcaggc tggcagaagg 31561 caggcatgag tacaggtctg ggcgggagtc tgcagcgctc tgggctggcc acacgggtgg 31621 ggcaagggag gtggaggatc ctgccctaga gcttggtctc ctctgtctga ggtcaggttc 31681 cccagaggtg gggctccaga gggaaattct tgtgcaagca gtttctcagg ggccatgtgc 31741 tgaggggaat ggggagcagg aacagcctag aagctgagtg aggatgtggt ctcagctgga 31801 gaggaggctg atcgcatggg agctctgcag cagggatggc aatgccgggg aggccgtccc 31861 gcctgaggcc aggcccagct ctttgcagcc ccatattaac tacattggct gctggggagc 31921 ctgagtgacg ggggaagggg gcagccaggg agggtgccta ggccctgtgg agggcttggt 31981 aaggcagagg ctggggaggc tgtgtgcctc aacattcaca tatgggcttc ggatgggcgt 32041 ggtggcacat gcttgtagtc ccagctactc aggaggctaa ggcaggagga ttgcttgagt 32101 cagggaggtt gagactgcaa tgagctgtgt tcacaccact gcactgcggc ctgggtgaca 32161 aagtgagacc ctgtcacaca cacagaaaaa gagaatgtgg gcttcaaagt tgaattcaag 32221 taagtcaggg ttcttccact tgtaatgaac aacttcctta acctcattgt ctcaggttcc 32281 tcccctgaga aatgcgagga gaacggcgcc tgctggtggc gggggcttgc cgatgaaatg 32341 agccattcac ccatgcctct cagcacaggg cctgccctgg gccgcatact gtgcattgtc 32401 agctgctgtc ggagctgggc tagtggccgc cttgagagcc agcagtccac gggggttggc 32461 gagcttctgg gaggaaaggg tggaacgtga ctgtcaggta cagagctaca gagaacccca 32521 gcatgaagga gtaggtggca ggggagaagg actcaacaga gatggagaaa gactaagccc 32581 aaaggtgatc aacaacaaga gtgaaggagg aagtggtcag ctatggctgg ggcaaaacaa 32641 gcccagtaag ttacattggc ctgtgtttgg tcatccaggg acaaagtgac ctttttagtg 32701 ccggtaggcc cagtgggttg gggaagaatt taggcggcag tgggcttagg catgagagga 32761 gcggaggaag agaagattga gttgagacca ctggctggag aacttgacca ggaagggagg 32821 gcagagagcg gcgtggagtc gggggaggga ataggtccaa tgagcgtctt aggatgaagg 32881 agttgtgagc actcagcaca ggataagctg atacaaagaa agggatggag gatggataga 32941 ctgacagatg catgcacaga ggcagggttg gatagcatga cagatgcatg cacagaggca 33001 gggatggata gatggacaga ggcatgcaca gagagaggga tggatagatg gacagatgca 33061 tgcacagagg cagggatgga tggatggaca gatgcatgca cagaggcagg aatggagaga 33121 cagatgagtg caaacaggta gggatggatg gacaggtgcc tacacagagg cagggatgaa 33181 tggatagatg gatgcacagg ggcagggatg gatggatgga cagatgcatg cacagaggca 33241 gggatgtatg gacggatgca tgggcagagg caaggatgaa tggatggata gatgcatgca 33301 cagaggcagg gatggacaga cagatgaatg cacagaggca gggatgtaca gaaggatgca 33361 tgcacagagg cagggatgga cggagggaca gatgcaggca cagaggcagg gatggatgga 33421 cagacagatg catacacaga ggcagggata gatggacaga tactggcaca gaggcaggga 33481 tggatggaca gatgcaggca cagaggcagg gacagacaga cagatgcatg cacagaggcg 33541 tgtatgaatg gttggaagga tgcatgcaca gaggcagtga tggatgggtg gggagattgg 33601 atggatagat gccggtaggt cgttcttgtc ctggaggagg tgatccaggc taagtcctca 33661 caccgggagg ggcaggaggc gagcatgggg agatacagct gtgtgtccca gtgcactgga 33721 agttacagat gacatcgatg tcctgtcctg atgaaacagg aaacacaatt gtcatttgaa 33781 agttagggct gagggccggg cgcggtggct cacacctgta atcccagcat tttgggaggc 33841 tgaggcgggg gggtcacctg aggtcaggag ttcgagacca gcctggccaa catggtgaaa 33901 tcccgtccct actaaaaatg caaaaattag ctgggtgtgt tggtggggcg cctgtaatcc 33961 cagatactca ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt 34021 gagctgagat cacgccattg cactccagcc tgggtgacag aggaagacac cgtctcaaaa 34081 aaaaaaaaaa aaagagacag aaaagaaagt tgggggccgg ggctggatga aacttgaggg 34141 gctgtgggcc tggcatgggg ctgaggggag tgtgcaggag ctgcctgcca aagcctgggg 34201 caaggatgat aggaggtgct ggggcccctg tggactgccc tctgagaatc tcgtgaggct 34261 gggtctgcac agctgtggtt gtgtaacttc ccccagatgc atggtgtgac aagggtgcag 34321 gaaggggtgc agggcccgga agctcagtgt ggttaaagac atgcaaggca ggatagaagg 34381 tgagagatga atgatatggc cgtgatcaaa acaccaaact tcagtttctc agaaaaagga 34441 ctcagagcaa caagagaacc tacagtcaac aaggacattc atcagcgcac atgtgatttg 34501 agcaagacca ttgaagcccc tgctgaacgg caggctggca gagccagggt acctggccat 34561 ctaccttgaa gtctctgtgc ctcagtttcc tcctctgtaa aatggacata ggggacaggt 34621 ttggtgagaa gtgaatgaaa acgtgtttgc tacccctctt aggactcatg tttcatgctt 34681 agatgctcag ctagtggttt tagggaccga gagaaatttg cttttcagtt caccaggtcg 34741 ttacctatga acctgccctg agcagtgctg accttagtta gagatggcca tgaaactcat 34801 ccgcatgggg taattggggc ctccctgtgg tgctgcaggg ctatgtctct gtcttggtct 34861 cttcaggcct ctcagcagct ggagcaggag ctggcaggca gctcaccgat ggataaggag 34921 ctgggctttg gccagggctg acatgggaag cgtctcggtc ctgccggtcc tggcgcctga 34981 gctgaatccg gagggaagag gctgctgtgg tgggaacagg ctgggagcca ggcctggcgc 35041 tggggccggc aggctgtgtt ctgagagtgc ctgctagagg agctctgtgt tcccaaggag 35101 atggaggaag acctggggtg ggggtggtac agggtaggga gaggaggggc agaggctgtg 35161 gacagggaga tgggggaggg gggatggagg actcaaggga tgtaagggat atggcttgag 35221 gagggatggg ggcaggaatg agggaaggtc ttgggtcaag gacatggacc tacatcccca 35281 gcagatgggg gagatgggcg ggggttgctg accatagact ggagctggga ggagggtacg 35341 ctgccagggc tggggctgcc agcagagaat ggcacgcacc ttataggctc ctgcccctgc 35401 cccactgccc cctcaagttc aagtttcttt ttcttttctt tctttttttg agacggggcc 35461 ttgttctgtt gtccaggctg gagttcagtg acatggtcat gcgtcactgc agcctcgacc 35521 tccccagctc aagcaagcct cctgcctccc tgcctcagct tggaacctga gtagctggga 35581 ccacaggcgt gcaccaccat tcctggctat tttttaattt ttttgtagag aagggggttc 35641 tccttatgtt gcccaggctg gtcttgagct cctgggctca agtgaaactc ctacctgagc 35701 ctcccaaagt gctaggatta caggagtgag ccaccacacc cagctccttc ccctcaaact 35761 tctgtgcaaa aagtgctccc ttcccagagg agggtcccca tctgtgtgta aggtggcccc 35821 ttcctctatg tgttctctgg gtcgcttcag ccctgtggcc cagtgtccag cacagccatg 35881 ctgagtgggt gactggagaa agggatcgtg gaggattggg gcaggggggt gctgggcaga 35941 ggcggctggg gtcacctctg gaaggtgtcc tgctgttcct gttggctgca gctgcacttg 36001 cagggccttc ctgtgccccc ctttcctggt gcaggggcac cctctgcttg ctgagaccag 36061 ctcggtcagg gagatcttaa accaggggcg ctagaggaat taaagacaca cacacagaaa 36121 ttcagaggtg cgaagtggga aatcaggggt ctgacagcct tcagagctga gagccccaac 36181 agaggtttac ccacatattt attaatagta agtcagtcat tagcattgtt tctatagata 36241 atagattaac taaaagtatc ccttatggga aatgaaggga tgggccgaag taaagtggtg 36301 ggtctggtta gttatctgca gcaggagtgt gtctttaagg cacagatcac tcaggctatt 36361 gtttgtggtt taagaacgcc tttaagcggt tttccgccct gggtgggcca ggtgtccctt 36421 gccctcattc cggtaaaccc acaaccttcc agcgtgggcg tcacggccat catgaacgtg 36481 tcacagtgct gcagattttg tttatagcca gttttggggc cagtttatgg ccagatcttg 36541 gggggcctgt tcccaacatg tccccccttc tttgatttgc aactcgataa aagcaaaggc 36601 agctttgtca ctgtgagtta cttctcgcag gagttaggat ccacatctgc agactataca 36661 aagacaaaca acacagatta aaagcacaat cacatcattg aaattacaga gcttccaagt 36721 atttttatcc attttaatgg gttactagct gcaaatctgt ctgcagttcc tttaagcact 36781 ccagttgctg gtattaaggt caggtgtgcc tgggatgctt taaatatttt ttcttttaat 36841 tttactgtat ccaaaaagct tgtagagtgt ccttctagat gctttattct ttccccaatt 36901 ttgatcttat taagagctat taatagtttc tacaaatcct taatgtttag ctcctacagc 36961 gggccttatc gtttgaggtt gaggtgccac tataccagca tggttccaga taataggaac 37021 ttttgccata cttcttattg tttctaccat ctgaccgttt tgttcagacc agctgcacat 37081 agtgtggccg tggcacgcag gctgagaggt gcaatttaag ctaaacatcc ccttagggga 37141 ccaattaaca gtgattccat aggaatcgtt gcgcagcacc tctgcctgtt ctgcaatgca 37201 atcttcctaa acaagtacgt tcattttttc tggccaggtt caattttgtt tacaaatagg 37261 tttttaaggg cggtatgcct caattatagg agcagattat ggtaaatact gagatcagaa 37321 agcatgtgta actgtgtcat agagtgatta catctaggca ttattgccag ccaaggttga 37381 taaatatgcc caataagtat aattgttctg tgtgtcagcc cttgttgaag gaatactcac 37441 agcagtggtg ataactgcta tcatagctac catgaaatta ctcattgtga ctggttgtcc 37501 cgctttcctc aggttttctt ctgccatctg tgacagcttc ttgatctgtc cccaggtggg 37561 tggctgtgtt cgacaggtgt tgctcgtgac agttggggtc ctcctcagcg tcagtctcaa 37621 catggctgca actgggccgt cctcgggatc ctcccggagt ctcttcctca gcatctggct 37681 catgataaga tttcaggtgt cttgatggta tccagatcag ctgttgattt tggcctggag 37741 aaacacaagc ataacttcta ccccaagtta ttattttacc tgtttcccaa ctttttgtta 37801 tcagatctct ccaccaaacc agttgttctg cttctgtctt tgcagctggt ttctggaatg 37861 ctgttcagct gctgatgaca tctggccttt gggcaggttc aaaaaatcta aagttaataa 37921 tgctagattc aattgtgtat ggactgtccc ataatcccta tctctccccc ttttttgttt 37981 ttgtcatcag ttgtttatct gtatgaaata gtaactgagc atttttaatt aactgtgtgg 38041 aatgaaccac atatgaagaa tcagaaatca cattaatagg ctatcaaaag cagtcaatac 38101 ctcaattaca gctacaaact ttgcttttta taaggcatct ggaaaactac tttttgagcc 38161 agaataagaa gctttaccat tactaaaccc atctgtaaaa caatgaaaac acttagcagg 38221 ctgcaggttg ttgactgcag gaatggtaaa tgcaaaccat tcacagtctt gcttagctaa 38281 ggggatagta aagaaacagt cttttaaatc tatgactatt aaaggccaac tttttggaat 38341 tatagcagga gaaggcaatc ctggctgtaa tgctcccata ggttgtgtaa ctgaattaat 38401 ggctcttaag tcggttaaca ttttctattt acctgatttt tttttaaatt atgaaaactg 38461 gagaattcca aggggaaaat gttggcgcta tgtgcccatt ttctaattgt tcagtaacta 38521 atttctctaa agcctccagt ttctatttac ttagcggcca tttttctatc caaattggct 38581 tttctgttaa ccgttttaaa ggtgtaggtt ctggaggctt aacagtggcc accatcaaaa 38641 atgatatcct aagccttggc gggaacttgt ctttccaatt gaagtggttc tttcaaacct 38701 tgcaaatttt ttttctagtc acctaccagg gacatgcccc atttcatgca tgatatgttg 38761 actttgaggg ctatataatt gttctggagt tagaacttgt gctccccatt gttgtaataa 38821 atctcttctc cataaattta taggcacaga agttataatt ggttgaatat tcccaggttg 38881 tccatcgggc ccctcacaat gcaaaatata actactttga tatacttcag gggctttacc 38941 aactccaact atgttaaatt gagcgggttg acttggccat gcggatggcc agtgctgtag 39001 agaaatgatt gaaatgtctg ctcctgtatc taccaaacct ttacatttct ttccctgaat 39061 agttatttca caggtaggac gtttatcagc aatttgattg acccaataag ccactttgcc 39121 ttgtttattt gtgcttccaa atcctcctgt tcctttagtt ccacttttcc ccattcccac 39181 atacggcaca atcaggagct gtgatacaca ttctcctggc tctgctttcc agggaacaga 39241 agtagatata acagtttgaa tttccccatc gtgatccgaa tcaatgactc ctgtatgtat 39301 ttgtcccctt ttaaacttaa actagacctt cctaaaagta atcctattcc ccctctggca 39361 cgggtccaca gactcctgtt gagacctttc gcgggtgttc ccctggcaga aagctcacgg 39421 ctttcgtgca gcatcaatct actgcggcac taccggctgc ggcgggggac agacattgta 39481 cgggggtgag ggaagggcct cagctggaaa tgccctgatt tagaaccggg cccgggacgg 39541 gcacctcatg gcatttcccg aaattgggtt tccagcttta tcaaacttag agtgacactg 39601 attagcccaa tgttttcctt ttttacattt tggacatatt tcaggctcag cagttttctt 39661 tgatattttc tacattcttt tttagtatga atatcttgtt taaattcttt gagtaattta 39721 aaagggaaag gctcaaatgt agctataata tttccctgtt gatctggggg gtgtattcta 39781 acagggaact gtcaagcctc taaatcaccc cctcgtctag cttgctgaat tcctgcctga 39841 atagaactaa gagaggtcgc tcgaggtgct gctcagtcac tggagcaact acttttcgcc 39901 cagtatcctc cggaaaagaa agatctggag ggtcaggccg ccctttttct tcaaaataat 39961 aagtaggggg tgcagaaggg tagggatgaa cgtctccctc ctttgccgct ttagctttag 40021 ctggcaaaca aacctgctct gtaacctctt ctgttacttc gttatactct cctccctcct 40081 cattagcttg aaaaggttcc aaggtggaag gaaccagacc ccaaaccaat cccattgtta 40141 ccctgatgct tccgagctcc ccttcttact caccatgggg attgctttaa gaatactcgg 40201 gtgtcctcca gctccttcca cattctccaa ccatcgctcc ggcgaccctt cgacctggat 40261 tcgagcctcg gtcggggaga ccctaaccca gcggcgctag aggaattaaa gacacacaca 40321 cagaagtaca gaggtgtgaa gtgggaaatc aggggtctca caaccttcag agctgagagc 40381 cccaacagag gtttacccaa gtatttatta acagcaagtc agtcattagc attgtttcta 40441 tagatattcc attaattaaa agtatccctt atgtgaagcg aagggacggg tcgaaataaa 40501 ggggtggctc tggctagtga actgcagcag gagcatgtcc ttaaggcaca gatcgctcag 40561 gctattgttt gtggtttaag aacgccttta agcggttttc cattctgggt aggccaggtg 40621 ttccttgccc tcattccggt aaacccacaa ccttccagtg tgggcgtcat ggccatcatg 40681 aatgtgtcac agtgctgcag agactttgtt tatagccagg aaaccaacac ctgtgggcct 40741 ccgtctccca gggtcaggaa aaggctgaga ggccggggtg tggccagggc ctggggctga 40801 cacccccacc tacagaccct gaatggtgct cccattccac aggagctcac tcctctactt 40861 gaaaaagaaa gagatggatt acggtgccga ggcaacagat cccctgtccc ggatgttgag 40921 gatcccgcaa ccgaggagcc tggggagagc ttttgtgaca aggtcatgag atggttccag 40981 gccatgctgc agcggctgca gacctggtgg cacggggttc tggcctgggt gaaggagaag 41041 gtggtggccc tggtccatgc agtgcaggcc ctctggaaac agttccagag tttctgctgc 41101 tctttgtcag agctcttcat gtcctctttc cagtcctacg gagccacacg gggggcaagg 41161 aggagctgac accccagaag tgctctgaac cccaatcctc aaaatgaaga tactgacacc 41221 acctttgccc tcccagtcac cgcgcaccca ccctgacccc tccctcagct gtcctgtgcc 41281 ccgccctctc ccgcacactc agtccccctg cctggcattc ctgccgcagc tctgacctgg 41341 cgctgtcgcc ctggcatctt aataaaacct gcttatactt ccctggcagg ggagatacca 41401 tgatcacgga ggtgggtttc acaggacaag gctgatctgt tgctgtatta gtccattttc 41461 acacagctat aaagaatgcc tgagggctgg gcgcagtggc tcacgcttgt aatcccagca 41521 ctttgggagg ccaaggcggg aggattacga ggtcaggaga tcgagaccat cctggctaac 41581 aggtgaagcc ctgtctctac taaaaataca aaaaattagc cgggcgtggt ggcgggagcc 41641 tgtagtccca gctactccgg aggctgaggc aggagaatgg cgtgaaccca ggtggcggag 41701 cttgcagtga gccgagatcg tgccactgca ctccagcctg ggagacagca agactccatc 41761 aaaaaaaaaa aaaaaaaaaa aaaaacaaag aatgcctgag actgggtaat ttataaacaa 41821 aagaaattta attgactcaa agttccacat ggctggggag gcctgaggaa acttgcaatc 41881 atgggggaag gctaaagaga aacaacgcac gtcctacatg gcagcaggag aagcaaggtt 41941 gggggaactg ctatacatct tttttttttt tttttttgag atggagttgt gcttttgtcg 42001 cccaggctgg agtgtaatgg ggtgatctcg gctcactgca acttctgcct ccctggttca 42061 agcgattctc ctgcctcagc ctgctgagta gctgagatta caggcatgca ccaccatgcc 42121 aggctaattt tgtgttttta gtacagatgg ggtttcacca tgttggccag gctggtctta 42181 aactcctgac ctcaggtgat ccgcccacct cggcctccca aagtgctggg attacaggca 42241 tgagccacca cacccagctg ccaaacactt ttaaaccatc agatcttgtg agaactcact 42301 cacaccagaa gcgcatgggg gaaactgccc ccatgatcca gtcacctcca accgggcccc 42361 tccctcgaca tgtggggatt acagtttgag atgatttggg tgaggacaca gagaaaacca 42421 cctcattctg cccctggacc ctcccaaacc tcatgtcctt ctcacatttc aaaactcaat 42481 catgccttcc caacagtccc ccaaagtcta aactcattcc aacatcaacc caaaaggcca 42541 atccaaagtc tcacctgaga caaggcaagt ctcttctgcc tataagcctg taaattcaag 42601 ttagttactt ccaagacaca ttgggggcac aggcattggg taaatgttcc ctttccaaaa 42661 tggagaaatt ggccaaaaaa gggggctaca ggcctcaggc aagacccaaa tccaccaggg 42721 cagtcgttaa atctgaaagc tcccaagcga tccctttggc tccatgtctc acgtccaggg 42781 ctcgctgatg caaggggtgg gttcccaggg cctcgggcag ctctgcccct gtggctctgc 42841 agggtgcagc ccccgcagct gctttcatgg gctggtgctg agtgcctgcg gcttttccag 42901 gcacagcgta caaactcttg gtggatctac cattctgggg tctggaggag agtggtcttc 42961 ttctcacagc tccactaggc agtgccccag tagagagcct gtgtggaggc tccaacctca 43021 catttcccct ctgcattgtc ctagtagccg ttctccatga cagccctgcc cctgacgctg 43081 acctctgcgt ggtcgtccag gcatttcata ggtccactga aatctaggcg gaggccccca 43141 aagcccagct cttgtcttct gcagacctgc aggcccaaca ccacgtggaa gctgccaagg 43201 cttgggactt gcacctcctg aagcgacggc ctgagctgta ccttggcctt ctttagccac 43261 agctggagct ggagtggctg gggcacaggg cgatgtccca aggctgcaca gagcagcaag 43321 gccctgggcc tagcttgaat ttctccccag aaaaaatttt tctttctacc acatggtcag 43381 gctgcaaatt ttccaatctt ttatgctctg cttccctttt aaatgtaagt tctaatttca 43441 aaccatttct ttgtgagtgc atataaccat acgcttttag gaaaagacag gtcacctatt 43501 gaatgctttg ctgcttagaa atttctactg ctggcctggt gcagtggctc acacctgtaa 43561 tcccagcact ttaggaggct gaggcggtca gatcacgagg tcaggagatt gagatcatcc 43621 tggccaacat ggtgaaaccc cgtctctact aaaaatacaa aaattagctg ggtatggtgg 43681 cgtgtcctat aatcccagct acttaggagg ctgaggcaag agaatcactt gaaccaggga 43741 gtcagaggtt gcagttagcc gaaatcaaga tcatgccact tcactccatc ctggtgacac 43801 agagagacac cctctcaaaa aaaaaaaaaa aaaagaaaag gaaagaaatt tctactgcag 43861 gacatgctaa gtcatctgtc tcaagttcaa agttccacag ggcaggggca aaattccttc 43921 agtctctttg ctaaagcata gcaagggtct cctttactcc aattcctagc aagtttctta 43981 tctccatctg agaccacctc agcctggact tgattatgca tatcactatc agcattttgg 44041 tgaaagccat tcaacaagtc tctaggaagt tccagacttt cccatatctt cctatcttct 44101 tctgagtcct ccaaactgtt ctaccctctt cctgttaccc agttccaagg ttgcttccac 44161 attttcaggt tatctttata gcagtgcccc acgcctgata ctaattttct ctattagtcc 44221 gttttcacac tgctgtaaaa aatacctgag actgggtaat ttataaagaa aaaaggatta 44281 attgactcac agttccaaaa aactggggag gcctcaggaa acttacaatc ttggtgaaag 44341 gcaaagggaa agcaaggcac gtcttacatg gcggcagggg agagagagag ggagagagaa 44401 agaaagcagg gaactgccaa acacttttaa aacattagat ctcaggctgg gtgtggtggc 44461 tcacacctat aatcccagca ctttgggagg ccaaggtagc acaggaatta aaagaaatta 44521 aaaaatgtgt aagcaaaaac tcagctgtat gtaagaaaaa accaattccc cctgaggaag 44581 aaaaagagct aaagtccttt aaaaattgac tgcctgtttt tctgtggcta gtgagcctta 44641 tctctccctt tcccaggcat cgtgaagacc ctgtttctct agctgtgcag ctgcaaggtc 44701 actaaacaga taatctcaag tcataacaca agttgttcct taaaaagtaa gaaataatgt 44761 aatgcatgtc ttgactgaat aactatcttt gtttctcgct tctgtaatat gcttccccca 44821 gcacaaatct ccccccatcc cacaaaatgc ttaaaaggta accggactct ttgttcaagc 44881 ctcagtcctt tggatgtgaa tcccactggg tcagtgcacc taaataatta aataattcca 44941 ccttaacccc tcggtctctc tgattcctta attatcccac tgcagaggtg ggtggatcac 45001 ctgaggtcag gagttcgaga ccagcttggc caacatggtg aggccccgtc tctactaaaa 45061 atacaaaaat taaccgggca tggtggcgtg cacctgtaat ccccgctact agaggggctg 45121 aggcaggaat tgcttgaatc caggaggcgg aggttgtggt gagcagagat tgcgcgactg 45181 ctctctagct tgggcaacaa gaatgaaact ccgtctcaaa aaaacaaaac aaaacaaaac 45241 aaacaaacaa gaaaaacatc agacctcgtg agaactcact cagtttcacc agaacagcat 45301 ggtggaaacc acccccatga tccaatcact tcctaccagg tccttccctt gacatgtgga 45361 gattacaatt ccagatgaga tttgagtgaa atagagccaa accgtctcaa ttgcaccccg 45421 gatgtgctga cccctgtgat ttccccaagt gtgggacact cgcctgcata atttgtggta 45481 gtgggggact gcattcatac cttcccctga aaacaataaa ataaaataaa ataaaagttg 45541 cttaaataga atcaggtgcc tgtctccagg cttctctgac aggcgggaac agggaggcgg 45601 ggggcccaat agtgacagca gagaccgtgg ctctgatgga cctacccaca ttccagatgt 45661 ggaaaagcaa ggccaggccc ctgactttct tattgataat gtgcatgggt ctccatgccc 45721 tggtgggttg gttgaggact agaggatttc agcctggcgt gtccttgatt acagagccgg 45781 aagcacagaa ccgctccctg atcgccaccc ctgggccccc ggcctgaggg gactttggcc 45841 ctgcatcttt cttgtcatga ctccctcttc tgtcctcatc ccccgggcag ggctggggcc 45901 gcaaaatctc cttggttttc tcccaagatc aagggcccca ggggctgggg cagagcggac 45961 tcaggagtcc cccactaggc cctggtttca ggggagaggc aagaaggcag aggtcaaatc 46021 tctgggcctg aggcctcagc accctgcagt tgcagagcct agggcggcca ggaggaacag 46081 ctgggagggc acttctcacc atttctgagg ccacgctcaa cgggcctggc agaggctccc 46141 gtcttcctcc acttcgtgcc tccgctgact cacacagacc accctccccc attcagggga 46201 ggcttggcag ccagcagcag ccggtgagat aaagatgggg ctgggggccc tgtcctgttc 46261 tgttccgtcc tttcctgggg ttccatgact cctgcctttg ggctgagcag gaaggaggaa 46321 ggggaaatgg gccacagcct ggggaggagc aaacacccac aggggaggct ccctgagcag 46381 ctctcctagg gctgagccca cggtctggga gggacactgg ggtgtgctcg agggtgtggg 46441 gggcttgggg accaggaaca ttgctggggg tcagtatgcc cctccctggt ctgcccgttt 46501 ggagcattgg atgcactctt gagttttgga gcacgcactc cttggcgcag cctagtgtcc 46561 agcaggggaa acagctggac agcgaccccg acagtcacct ttggtggccc tggtcaccct 46621 gaggggccct gcttgcttcc agcccctgtc ccagcagctc ctgcaggact gagggaggtg 46681 accagtggga aggaatgatg tcaccaggct ctctgggccc tgctgccccc agccttcctg 46741 aggaaggagg cagctggatc cggacttctc tctgcagagc ctcaaagagc cccgcaggct 46801 ttcctccttc gcatcctgca gaaacctctg atgtcccttt atcctggaag gactgtctct 46861 ggcctgagac cccagctctg aggggccgcc catggctcga ccctggtccc gcctctcctt 46921 accccgggcc acatgctggg atctaccctc tgccactgct caagttcttt tttgttttgt 46981 tttgtttttc tgagacagag tttcactctg tccccagggg ctctgtcacc cccagcctgg 47041 acagcagtct ccgccttcta gctgccgtcc ctggccttgc ctctctcaat ctatttccat 47101 gcagtagcca aagctgaagt gattttatta ctagaaaaga aatccctccc tgcctgagcc 47161 ctctgcaggt ctcccattgc acatgagccc agggctggcg tgtcagccct cagtcattgg 47221 gaactgttgt gtccccctcc gggacagatg ctgtccagct gcatctgggg ctcctccctt 47281 ccccaggcct ggcttcctga tactccctct gccagggccg cccctcccac atccttccca 47341 ggccaccttc cccttattcc cctcaataac ccagcacctt ccctggcaca cagtaggcac 47401 tcaatacatc aggagggctt cccgtgcttc aaatgtggcc tctcccaact cctcaccctc 47461 catctctcag ttgactgatc cctcctcctt gaaatgtcct ttcctggggt ccacaagagg 47521 aatttctcct gcaacatctg gtgacatagt ctcgctctgt tgcccaggct ggagtgcaat 47581 ggcatgatct aggctcactg caacctctgc ctcccgtgtt caaggaattc tcctgtctca 47641 gtctcccagg atgctgggac tacaggcaga caccaccacg cccggctaat tttggtattt 47701 tttgtagaga caggggtttc accatgtcgg tcaggctggt cttgaactcc tgacctcagg 47761 tgatccaccc gcctcggcct ctcaaagtgc tgggagcatg ggcaccatgc ccagacaact 47821 taaggagtct tatgtgacaa gtgagtactt accgcacagc acagatcaag ggtagactcc 47881 agaggagaaa acattccagg gtcagattcc ttggctctcc ccacagctgg ctgtgaaggc 47941 tgacatctcc cccttctccc tcattctggg tggcgacact ctccccccac ctgctcccag 48001 tcagcctgct ccagaaggtg cagctgtgtc cctgtgcccc agagggaaag ctgacagttc 48061 gaattttggt gggcttgtaa agaaataaag aggggccatt ttggagaagg ccattttact 48121 tcgggcgttt taattacata gctgaggcca gaaagcaatg cctcggccag ggaaggacag 48181 ctgtgaaagt ggaaggagag ggaagtgggg tgtggtggga agggacattc ctgcaggcct 48241 ccgctgtgga ggaccccggg taggtggcgc agggcgcggc tggcggaagg cgggccaagc 48301 tagtacagcg tctcgcgggc gtgggtgcgc aggtggcgca gcaggtggga gttgcggctg 48361 aagctgcggc cacactgcgt gcaggagtag ggcctggcgc ccgtgtgcac cagcaggtgt 48421 cgcagcagat tgcagctgcg gctgaagctc ttcccgcact ccacgcactc ctgcgggggc 48481 tcggcctgct cctgcccggg ctccgcatgg gtagccaggt gccgccgcag atgcgcgttg 48541 cgccggaagc tgcgaccgca cgtctgacag ctgtagggcc gctcgcccgt gtggctgcgg 48601 cgatggcggg ccaggttgga gctattgcgg aaacggtggc cgcaggtgtc gcaggcgtgg 48661 ggcttctccc ctgtgtggat gcgctggtgg cgcgccaggt gggcgctctg gctgaagccc 48721 tcaccgcact cgctgcagcg gcagggcttc tcgcccgtgt ggctgcgctg gtggcgggcc 48781 agatcctggg tctggccgaa actcttcccg cactgggtgc agtggtgggg ccgagggcca 48841 ccgtgggtca gcaggtggcg ggcaaggctg gcccggcgca caaagcgctt cccgcactgc 48901 ggacaggcgt agggcttctc gcccgtgtgc acccgttggt ggctgaccag ctgcgagctc 48961 tgcgtgaagc tgcggccgca agcctggcag gagaagggcc gctcgcccgt gtgcaccctc 49021 cggtgggcca ccaggtgctc gctgcgccgg aaggccttgc cgcagtcgct gcacacgaag 49081 ggcctccggt cggagtcccg gcggcggctg ccggagcctt cggaggaccg gcggtccttg 49141 tccctggcgt ggatccgcag gtggcgcttg aggctggagc ggcgctggaa gctctggccg 49201 cagtgggagc acaggacatc ggtcagtggc ggcgcttcgg ccttactctc aggagcgcag 49261 ggcggcttct ggtcctgggc gtgcgccagc aggtgctgca caaggctggc gcggcgctgg 49321 aagccgcggc cgcactctgc gcacaggaag gcgggttcgg aggagtgggt cagcaggtgc 49381 ttgctcaggt gcgagctctg gcggaagcgg tggccgcaca ggtggcaggc gtgcggccgc 49441 tcgtccgtgt gagtgcgcat gtgcagcttg agaatggagc tgcggccgaa gctcttcccg 49501 cagcaaaggc acaggaagga gcgcccagcc gggtgcgagc gcagctggtg cgccttcagg 49561 cgagacagct gcgggaagct caccccgcag tccgcgcaga tgaactgcaa ctccggattg 49621 ggctcgggga cgccctcagc cactttgacc tccaaaattt cactgctgcc aagaagggac 49681 ccatcttcac cagggccgct agccgcagcg ccccgtccga gcgactgggc gcaaggctct 49741 cccggcaccc caggactatc tgcctgggac tctgctaaga tgggagtggg ccaggcagcc 49801 cctttgggct cttcttgttt aaattcctcc ttgtctgggg ttctgcttcc gaatcctggg 49861 aaacaagaca aaacagggac ggtcaggcct attcccaggg ccactatcaa tcaccagaac 49921 ctgattggcc tggaagtgga aagaaaccca ggatcctgca aagctggtag gtaaaaggcc 49981 gggcacagtg gttcacgcct gtaatttcag cactttggga gcccgaggcg ggcggatcac 50041 ttgaagtcag gagtgagttc aagaccaccc tggccaacat ggtgaagccc cgtctctaca 50101 aaaaaaaaaa aaaaaaacca aaacaaaaat tagctgggtg tagtggcgca tgcctgtggt 50161 cccagctact cgggaggctg aggcaggaga atcgattgaa cccgggaggc ggaggttgca 50221 gtgagcggag atggttccac tgcactccag cctaggcaac acagtgagac tctgtctgaa 50281 aacaaacaaa caaacaaaaa aacctggtag gttggtagag agggtggagg tcaaacaagg 50341 gtgccctttt atttgacctc tctacaaaaa gttagctggg cgtggtggca cacacctata 50401 gtcccagcta ctcgggaggc tgaggtggga ggattacctg agcccaggag gtcccggctg 50461 cagtgagcca tgattgtgcc acagtctggg cgagtaacat cctgtttcaa aaaacaaaaa 50521 ggccaaaaag tcaggtaggg tctggagctt caggcaaaga aagaggtgtt ggccggtcac 50581 ggtggctcat gcctgtaatc ctagtacttt gggaggctga ggtgggagga tcacttgagc 50641 ccaggagtgg agactagcct gggcaacata gcgagattcc atctctttaa aaaagacaag 50701 aaaagaaaga ggcatcagtg gtcacttctc cagggaaggg ttgcttaccc agggggtgtg 50761 caggccacgc cttattctct ggcacatcct caaaggtcag gcactcctgt atcaagagca 50821 gaaacccttg gtgagtgagg aacatgcctc aggaacctgt ggggactgaa atgcttccta 50881 ggactgaatc agggaaggta tggatttgct aaaactacag ggtagagaag ttcctggggc 50941 ttgaggacac cctggggtgc ggcccagctc accagcacag ccgccagctc ctgatctcgg 51001 gaactctcct ctggccatgg tgaggggccc tggggacctg acggcattgg ggaagagggg 51061 aggaagtgct gttagaaggc agccctgcct gccatatggc tgcaaggaca cacaccgcat 51121 catccgggcc ctgtgaagac accttggcgc tctcctgtcc tggtcacctg gggaggcccc 51181 acttctcacc ccaccccaga catccaggcc cctccgcgac acccagtcct tggcctccta 51241 gggcacaagc tcctcactgc tttcttgcag ggcctggaat gtcttctggg gccccgggct 51301 cagcggctgc tttgaacttg ggggaagcct ccactgtccc ggctcagcag gctgggcagc 51361 ccttggctgg ggtcggggag gctcatccga tgggcccagc actgaaggct cttccgcagg 51421 caattccttc ttggggctgt ggctggggac ctgggaagca cacccctttt cctccaaggt 51481 gacgtctgca cggggacaac tcttgccagc attacaacta aaatcctgtt caaaacacat 51541 tcaagttaag tgactcccgg ggtggcaacc tgtgtcctga ctccgagggg acatgggaca 51601 gaaacgaggt tagagtcagc caggggagga gttaggaggt gaagaaggaa tggtcaggtg 51661 ggggaaagtg aagtttccag agattcctca ctggggggat caagtcccca tcatccgcat 51721 gctgctcagc ccgctgcccc accctctcac cagcggcccc gcgtggctgg gctcccggtg 51781 gatgccctcg agcagcagca ccacctcctc cccatccctg agcggctgcc cctgcaggcg 51841 gcccaggagg tgcggaggca gcacactcag gaactgctcc agcaccagca gctccaggat 51901 ctgtttcttg gtgtgcagag ccggccgcag ccagtggccg cagagctccc ggagccggct 51961 cagggacgcc cgtggcccca tgtcctcctg atactggaag catctgaaca gctggtgagc 52021 cacctcgggc ctcagcctgg actctggtcg cctggggtcc tctgggctga cagcctcctc 52081 ctcctccagc ttgacttccc ccagctgctc ctgctccacg gcagctggga ctgattctcc 52141 aagcatcctt cactgcgggg atgcctccct aacgccagcc ccgctcttgg gtctctctcc 52201 tctcctccca cacctgcgaa ggtgaacacc ctgggttagg atctgctgcg aggaagctgg 52261 ccccatacct gggtcatcca tgttaaaaga cacccaatac gcagacccag aggaatcgct 52321 tttctcactg ggtaatggct cacgagggca ggacagcccc cggtctcagg gttaccaaca 52381 aagcaaaggc cagagaaaac tgacagttct gtattatcat ccctccttaa agcatttaga 52441 gatttacatc acttggccgg gcacagtggc tcacacctgt aatccgagca ctttgggagg 52501 ccgaggcggg tggatcacga ggtcaggata tcgagaccat cctggctaac gcggtgaaat 52561 cccgtctcta ctaaaaatac aaaacaatta gccaggctaa tttagtccca gctgctgggg 52621 aggctgaggc atgagaatgg tgtgaacctg ggaggcggag cttacagtga gccgagatcg 52681 cgccactgca ctccagcctg ggtgacagag cgagactcag atcaaaaaaa aaaaaaaaag 52741 aaatttgcat cacttttccc aagtgattaa agtgaccatc cctgaaagag aaacaccctg 52801 cccttccctt cttcctgatg tagccactgg gagcacatgg caccatcaag atggatcctg 52861 gccaaaaaag tttgtttttt aaagcttttt tttttctttt tttttttgag atggagtctt 52921 gctatatcac ccagactgga gtgcagtggc atgatctcag ctctctgcag cctccacctc 52981 ccaggttcaa gcgattctca ttcctcagcc tgggtttaca gtagctggga ttacaggcct 53041 gtggcagcac gcccagctaa tttttgcatt tttagtagac acaagagttt caccatgttg 53101 gctgggctgg tcttgaactc ctgacctcaa gtgatcagcc cacctcagtc tcccaaattg 53161 ctgagattaa ggcgtgaacc actgcaccaa gcctgttttt taaattttta atttaattaa 53221 gtaatttatt tatttttttg agacagggtc tcactctgtc acgcaggctg cagtgcagtg 53281 gcgcaatcac ggctcactgc agccttgatc tgctgggctc aagtgatcct cccgcgtcag 53341 cctcccgagt agctgggact acaggtgcat atcaccatgc tgggttaatt ttatttattt 53401 tttgtagaca cgagttctca ttatgttgcc caggctggtc tcaaactcct gggctcaagt 53461 gatccacctg cctcagcctc tcaaagtgct ggaattacag gcgtgagcca ctgtgcctag 53521 cccaaaaata cctaatctca atccaaccag cctttattta tttatttatt tgagatggag 53581 tttcactctt gttgcccagg atggagtgca atggcaagat ctcggctcac cgcaacctcc 53641 gcctcgcggg ttccggtgat tctcgtccgt cagcttcctg agtagctggg attacaagca 53701 cgcgccaccg catgccgcta attttgtatt tttagtagag atggggtttc accatgttgg 53761 ccaggctgct ctcaaactct taacctcagg tgatccgccc gcttcggtct cccaaagtac 53821 tgggattaca ggcatgaacc accccgcccg acctaatggt ggcctttaga tccaacttcc 53881 aatgtttagg gccttgcagg gaatatggaa caagtcagac catctctcaa agaaattggc 53941 ctgagttact cagtgtcata aaaagtaagt gcctgtagga ggcgctggtg gacaggagcc 54001 caaagagtcc tcacaactaa gtgggtgaac ctagattgga tcctcattca tttttcttaa 54061 aggctttaaa agacatcctg gggccagtcg gaaaatggtg agtatggtgg ggatactaga 54121 tgataacagt atgactaccc atttctgaga tgaaagtcta gtcttgtaat atataggaaa 54181 tggctgagtt cttagaagac gctgaagagt ttggagaagg ggcctgatat atgagataca 54241 tatatgaatg ttcatttcac tgctctttca agttctctgg atgttttcga attttcaaaa 54301 tgaaaggtta gcggggctgg ctcacgcctg ttatcccagc actttgggag gctgaggcag 54361 gaggatcact tgagcctcag gagttcaaga ccagccctgg gcaacatggc acgacccccg 54421 tctctaaaaa aattaaaaat gtagggcagg gtgtggtggc tcacacctgt aatcccagca 54481 ctttgggagg ctgacgcagg cagatcacaa ggtcaggaga tcgagaccat cctgtctaac 54541 atggtgaaac cccgtctcta ctaaaaatac aaaaaattag ccacgggtgg tggcgggcac 54601 ctgtagtccc agctactcgg gaggctgagg caggagagtg gcgtgaaccc cggaggcgga 54661 gcttgcagtg agccgagatc atgccactgc actccagcct gggcgacagc gagactccgt 54721 ctcaaaaaaa aaaaaaaaat taaaaattta gccagttgct gggggcactg agtggctgac 54781 gcctgtaatc ccagcacttt gggaggccga gggtggatca ccaggtcagg agatcgagac 54841 catcctggtt aacacggtga aaccccgtct ctaataaaaa taaaaaaaat cagtcgggcg 54901 cagtggcggg tgcctgtggt cccagctact caggaggctg aggcaggaga atggcgtgaa 54961 acagggaggt agagcttgca atgagccaag atcgtgccac tgcactccag cctgggcgac 55021 agagccagac tccatctcaa aaaaaaaaaa aaaaaaattt agccagtcat ggttgtgtat 55081 acctgtggtc ctagctacct gggaggctga ggtgggaaga ttgcttgatc ccaggaggtt 55141 gaggctgcag tgcagtgagc tgtgactgca acattgcact ccagcctagg cgacagtgcc 55201 agcccgtctc aaaaaataaa aaatataaaa taagtcatct atagccacca ggaggcactg 55261 acatcctccc ccaacctcct cccaatctag ggctgtgggg cttccccttc ctggtctggt 55321 agcccagtac tacccatggg gaacctccga aatttcggcc ccacccacag gggactcagt 55381 tcagccctgc tcaactaatt gaggctggat gatttaagct tttctgagca gagcagagca 55441 ggctcatatt tttgtttgag tctcttttac attttatttc aggcaggggt cattttatag 55501 acatggaaat ccaggctcct gccctctgct ttctgccctt cctgttctct agcctcaggg 55561 ctacaagtga acctctccta gagcctccca ccctccactc ccagttggca ccccaggtag 55621 ggggatgtga agaaagcctg aagcaccact ccactcacct gaatggcaca gggcactgtc 55681 cagcatgcag tggctggcac acagtaggca tccatgagca cgagctatta ctgttctcat 55741 tgctgctgat tacaacacca agcaccgcag caaacagtga ctcacagttc tgttgggccc 55801 atcactgacc cattctacag ccgaggacac taaggctcag caaaaaggta acccagtggc 55861 cgagtgcagt ggctcaggcc tgtaatccca gcagtgtggg tggccgaggt ggatggatca 55921 cctgaggtca ggagttcgag accagcctga tcaatatggt gaaacccttg tctctactaa 55981 aattacaaaa attagccgga cgtgatggca tgcgcctgta gtcccagcta ctcaggaggc 56041 tgagagagga gaattgcttg aacctgggag gcggaggttg cagtgagacg agatcgtgcc 56101 actgcactcc agcctgggcg acagaattag actctgcctc aaaaaaaaaa aaaaaaaaat 56161 tatccgggca tggtggcatg catctgtggt cccagctgtt cgggagactg aggcaggaga 56221 atctcttgaa cccaggaggc agaggttgca gtgagctgag attgcatcac tacactccag 56281 cttgggcaac agaatgagaa tctgtctcaa aaaaaaaaaa aagtaactca gccttgggca 56341 gcctctccct ggtaacgatg gcctagttga gagccttgga cagtttgctg taccccctta 56401 atatgagcct gagggttccc tgaacctcct caggtggggt ggtgttgtgg tacagaatgg 56461 ggggatttta agcttagccg ctgagaagga ggggaggtga ttctagaaca tgacccatct 56521 ctcccaacct ccctgcaaca cacagtatct tccaagcccc tcacccctac cccggctccc 56581 ctgcatctcc caacctcagc cacctgggga aggactctga caaaatggaa cgggaagttc 56641 cctgccccag tcccatccca tccaggtgtt gaggggcttc ctagcgttaa ccctcaagct 56701 cccagtttgg gaggtggatg gggacagggt ggggccagat aagcgaagac aaagtgagaa 56761 atccagattt caacccctgc atgtgtgaga gtgtaaccac ccccgagccg gaactacgct 56821 ttcctgccct ctggcatcca ctccaggcag cccccatcct cccaccccat ctctccccac 56881 tagggtgatg gagtgcctca atcacagatg agcgtatttc tctgcctggc ctgggagtag 56941 gagctgtgtc tagttcaatg agtgttcatc cagcacttga agttgcttca gtgtctgtta 57001 gcagaaatta gggaatggca gtgattctgg ccatctcgaa ggaaagcatg taacaacgtc 57061 taggacagaa aaagtctgtg ataaatgatc agaaaaatgg tgttgtgagg ccggatgcgg 57121 tggctcatgc ttataatccc agcattttgg gtggccaagg cagaaggatt gcttgaacct 57181 ggggggtcaa agctgcagtg agccatgttc gcgccactgc acaccagcct ggataaaagg 57241 cgaagaccct gtctcaaaaa aaaaaaaaaa aaaaagaaag aaagaaagaa aagaaaagaa 57301 aagaaaaaaa aagagtgggg gagggggaag aaaatgggat tacgtttcac tggtttttag 57361 tagcatgcac aaaataacaa gaaaactaat cactgctcat ccattgccgt gcaggagcag 57421 cgggcaggta ctgagtgtct ggcgattaag ctcatttcat cccagtccag cccacgcacg 57481 aaacaatcac ttctcccttt gccaagtgca aagaccaagc tttgcacttg gtgacatgct 57541 cagggtcaca ccaggtggag ccaaggtttg ctttaggctg agagtgtagg tctagaacac 57601 agtgccaaag ctctgaaaca gaaaaggaac tgatctccat cctgctctgt gccattttcc 57661 tagccttagc cgaaatatca cagcatgaaa cagaaaaaaa atgaagattc caaatttcag 57721 ctctcagggc aagccacaca gggaggccgg aatctccagg tgagatctct ccaaaaggtt 57781 tcaaagaggt ggaattcgag acaagttggc ttcagagaat ctctccctcc ccccagatct 57841 ccaggacaaa agctctgtta atatgcacct ctgttggggg tgggctgggt agggaggcag 57901 gcagaggctc tacagccacc atccaacatt tgaccagctt aatttggagc ccggggtgga 57961 ggtgctgcca cgagacccac ccaggtcacc tgagagccca gagttgggga aggcggcaaa 58021 ggtggggtcc tcagccggga acactccctt tctccctgga cacagttggc agctgcctga 58081 tttgttgttt gttggagggg cctccctccg cgtgctgggc tgagtggaca aggcccatca 58141 gtccacctgg gagcacgtga agccagcctt cttcaatgtc acaggaagtt ctgcgatctc 58201 ccgcgtcccc aacgccccca tcgcgacagg tccaccaagc tggctggcac gaggctcccc 58261 agcagaccct ctacatcctc ccaagactct tccagccccc accccagcac caaggcctgg 58321 agcctcccct gctctctcca agctgctgcg agctcctagc acccgtcctg gcagcctgca 58381 ccccgacctc cctccaactc accactcgga gtccccacta cacaggcaga gatgctgggg 58441 ggttttctga ggactccagg gccaaagcca gcctccgcca gcctggggaa gggctctggg 58501 aaaatggaac aggaagttcc ttgccccagt cccatcccat ccaggtgatg aggggcttcc 58561 tagagttaac ccttaagctc ccagtttggg agcgggatgg gaagagggcg gggccagtga 58621 catgaagaca aagtgagaaa tccagatttc aatcccagcc tgtgtgagag tgtaaccacc 58681 tcccaaatca agactggcct ggtgcccctt gcaggtctgc ccgtctttct tttgggctaa 58741 gagctgccat ttttaaatgc ccacagtgtg ctggcttctt tgctaggctt ttcattcacg 58801 tgactactct ttgcaatggc ctcctgtgtg ccaggcactg ttacaaagca ggggctaagg 58861 acacagagag aaactgaagg aatgaggccc tgctctcagg ggcttacatt ctagggcaaa 58921 ggagcagcga cggtgctagg aaaaaatgga agaggaggat gtgatggggc agagtttgct 58981 ccatctacac ggtcacctaa agtcctccca agtcctgcca ggtggctttt gtcccaactt 59041 taccgatgac gaagcaaggg ctccccggga gggaagtgac ctaaccaaca gcgtttggcc 59101 ccaaactaat tccatccctg tgcttaaggt gccttcttgg gttgtgccct tggctgcagc 59161 tggtgccttc ctgagcgttc tgagcccccg gggagccagg accacctctc aggtgttggc 59221 cgttcccccg gcctagaccc ctcggtccgt tggggccagg accctcctga ctggctcatc 59281 tgacgggtct ttgttttcct gcacttaaca cctgagctca tcctgttcct acttcagttg 59341 tctagtgctc ccttcgtacg acggtctttc tccttggtct gctctgcatt ccccgcgtcc 59401 cgcgcctcgc tctcccagtc gcgggctcct cccactccat agtttgttca ggggcggggc 59461 cagaaaaccc cgcccccagg gaagtgggcc tgaaggccaa ggggcggggc ggggttctgt 59521 agggacggag ggggcgtgtc cggggcgggg cggggtgctg tagcgccgga gcgggtgtgg 59581 ccttggcggg accaggcagc cattacagat ctcggaagcg agaggcgtgg ggcctgtctc 59641 atcatgcccg gttcattttt tttttttttt tttttttttg tagagatggg gtttcatcat 59701 tttggccagg ctagtcttga actcctgaac tcaggtgatc cacctgcctt ggcctcccaa 59761 agtgctggga ttacaggcat gagccaccgc actcagcctc tttaatcacc tgaggtcagg 59821 agttcaagac tagcctggac aacatggtga aaccccatct ctactaaaaa tttttgtttc 59881 ttttaaaatt gtaattttcc caattccact tattttattt tattagagac aaggtcttgc 59941 catatctccc aggctggact caaactcctg ggttaaagtg atcctcccat ctcggtctac 60001 caaggagctg ggactacagc tgggactcca tgccatcgag cgttctttcc ctctcgttgc 60061 tgaatgattt tcctcattac atcagttatt cattctgcta tgggtagcca gtgggcagtt 60121 ttcagtttgg ggctatctct agtggttctc atatgtctct ggtgaccata tatgggacag 60181 cagtctgaag taggtgaaga gcagagtgag aggttaggtt tatctttggg aggtaaggac 60241 tgcacgtgtg tactccgagg tctgaggcca ccgaggaagc ctcagggact ggcctccgtg 60301 tagagctgct ctcactgccc tgggcattgg ggacaaggcc aggacaagca ggaccccgtg 60361 cgtgtttcct cgaatggctt atggtcgctt cagaaagaca ctgggtgttt gttggcgttc 60421 gggctgccag ggccatcgaa tccggttgtc tgccagggtg tgcaggggga gtggcgcttt 60481 tcaggtgtag gaacagctct ggaaataaac gagggatgga aaaaggagac agactaggcc 60541 ctccacagct cagctgcatg tgtccttggc gaagtcactt ttcatttgaa aaccatcact 60601 gcctcctctg taatgtgtga acgatgatcc cacctcccag atgtattgca ggagagggtt 60661 gtagaagaga aggtttgcaa aagcactgag acctgaaaga taataaacgt gtatgaatcc 60721 gtacctgacg ctagggggcg ttaaagagag tgtggttccc tactgggttg gggtcaactg 60781 tgcggagtgg gacggggaac ctttggagta gtgtccttca agagaagtgt ggtgccaccc 60841 acagccgcaa tttaaaattt tctggtagcc accattaaag aggtaaaagg aaacagatgc 60901 gattaatttt cttttttttt tttttttgag acggagtttc gctcttgttg cccaggctgg 60961 agtgcaatgg cccgatctcg gctcaccaca acctccacct cctgggttca aacaattctc 61021 ctacctcagc ctcccgagta gctgggatta caggcgtgca ctaccacacc cggctaattt 61081 tgtattttta gtagagaagg ggtttctcca tgttggtcag gctggtcgcg aactcccaac 61141 ctcagttgat ccgcccgcct cggcctccca aagcgccggg actacaggca tgagccaccg 61201 cgcccggcca gcgattaatt ttaataatac gatttagtta actcgttata tccaaaatat 61261 tatttcaaca aatacagaaa tttgctaatg agaggctgca cattcttttt tctcataaaa 61321 gtcttcaaaa tccagtgtgc attttacagc gagagcacgt ctgagctccg actcgcccca 61381 tttaaagcac tcaatagcct tttgtggctt tggggcggtt ctgggacctc agagttgtgg 61441 tgtttcttgc ggggaaagac tattgtggct gctggagggc ggaagacgaa taacaatagg 61501 cggagtgcag ggcactgggt tgccttagga cctcgccttt cccctggaga acacagctgc 61561 ttcctgaaat gatcaggaac ctaaaacctc tgcagggaag aaggggagat gggaagtgag 61621 gaaggagccc cgggggacac cggggaagtg agtaaatgga gccggggcag cttgatttgc 61681 agcaaattta tgattttaaa ggcccggcaa ggtcaggcac cgtggatcac tcttgtaatc 61741 ccagcacttt gagaggccaa ggcaggcgga tgacctgagg ccaggagatc aagaccagcc 61801 tggccaacat ggtgaaaccc ccgtctctac taaaaataca aaaattagct gggcgtggtt 61861 gcgggcgcct gtagtcccag ctactcggga ggctgaggca ggagaattgc ttgaacccag 61921 gaggcggaga ttacagtgag ctgagatcgc accactgcac tccagcctgg tgacacagcg 61981 agactccgtc tcaagaaaaa aaaaaaaaaa aaaaaaaaag gtctacccag aaggtaggtc 62041 actcccagtc agcagaattt ttgttgattt gggatttggg agggtgagca cttagaacca 62101 cagtccaggc cagtgcagtg gctcacacct ctaatcccag cactttggga ggctgaggtg 62161 gaaagatctc ttgagcctag gagtttgaga ccagcctgag caacacagtg agatcccgtc 62221 tctacaaaaa attttaaaat tagctgcaca tggtggtgcg tgcctgtagt cccagctact 62281 caggaggctg agtcaggagg atcacttgag cacaggtggt tgaggctgca gtgagccatg 62341 atcccaccac tgcactccag tctggaagac agagccagac cctgtctcaa aaacaaacac 62401 acacacacac acacacaaac aaacaaaaaa aagaaccatg ttccagctgc aaagactgaa 62461 cagtgctagg aaagtatttt ggtccctata caggccacag aaaaagagag tgttgaacct 62521 acatccatcg tttgtcctgg gggtggtccc agcatggctt gttcaggacc gtcttggccc 62581 tgctctgaga cccctataag gagtgaggcc cttctaacta gttatgcatc ttagttttct 62641 tactgcaagg tggtgccaaa gcatatgatt ctgaataact cccagcataa gccaggtgca 62701 atggctcaca tctgtagtcc cagctccatg gcaggctgag gcattggctc acttgagccc 62761 aggagttgga ggccagactg agcaacatat caagactctg tttctaaaaa gaaaaaacaa 62821 acatacaaca taaaatattt gcatgtcagt tccttctcaa aggaccttga actttctgct 62881 ttgtgattct ctctgaaagc tctattgcca gtagcaatct tcaacgtgat tgcttaaaga 62941 gtggcaatac atcggttttt agtatttaca tattattgat tccagcaaga cctgcagaga 63001 gttccctgtt tggggcattc ccctttgcaa gtgcagaatc gtgtactgca gggaaagcca 63061 gggcactggg cacccccaag acataataga gtttccacag cgccaactgc ttgtaggatg 63121 ctcatttggg caattctgct attacttaga aatatactat taagaaggaa gtagggctgg 63181 gtacagtggc ccacacctgt aatcccagca cttggagagg cggaaaaatc gcttgagccc 63241 aggagtttga gaccagcctg ggcaacatag tgagaccttg tcactacaga aatttttgaa 63301 aaattagctg ggtggccggg cgcagtggct cacgcctgta atcccatcag tttgggaggc 63361 cgaggcgggt ggatcacctg aggtcaggag ttcaagacca gcctggccaa cctggtgaaa 63421 ccccatctct actaaaaata caaaaattag ctgggcgtgg tggtgtgtgc ctgtaatccc 63481 agctacttgg ggggctgagg cagtagaatc acttgaacct gggaggcaga ggttgcagtg 63541 agccgagatt gcactactgc actccagcct gggtgacaga gtgagactcc atttcaaaat 63601 aataataata agaagaataa aataaaaata aataaatgaa aacatgcccc atcattaact 63661 tttttttgtt ttgttcagca ttcctgtcct tccgggcact actagattct ccaggctcat 63721 ctctcacatt tcttaccaga gtcctagaat tggccattta tcgaaggagc cctggttctc 63781 atccttaatt ggtttcttta aaaaaaaaaa aaaaaaagta taaagcgttt acaaaaacag 63841 aattttattt atttagagac ggagttttgc tcttgttgcc caggctggag tgcaatggcg 63901 tgatcttggc tcactgcaac ctctgcctcc cggattcaag caattctcct gcctcatcct 63961 ccggagtagc tgggattaca ggcatgtgcc acgacgcctg gctaattttg tatttttttt 64021 ttttaataga gatggggttt ctccacgttg gtcaggctgg tctcaaactc ctaaacttag 64081 atgatctgcc cgccttggcc tcccaaagtg ctgggattac aggcatgagc caccgtgccc 64141 ggccacaaaa acacaatttt taaaagaacg agatggaagc atggaagttc tacatttttg 64201 acacccaata gagtgagaag aatgtgatag ggattcgtgg gaaagaatgg agaggccagg 64261 ggtggtggct cacgcctgta actacaggca tgcacctgta gttccagcta ttggggaggc 64321 caaggctcga gaatcacttg aaaccaggag gcagaggttg cagtaggcca agctcgcacc 64381 actgctctcc agcctgggcg acagagtgag actctgtctc aaaacaaaca aacaaacaaa 64441 tacccatctc tactaataca aaaattagcg ggcatgttgg catttgcctg tagtcccagc 64501 tactggtggg cttaggcaga aggatcacct gagcctggga agtcaaggct gccatgagct 64561 gtgatcatgc cactgcactt cagcctgtct caaaacaaac aaacaggccg gacgtggtgg 64621 ctcacgcttg taatcccagc attttgggag gccaaggtgg ccagatcacg aggtcaggag 64681 atcgagacca tcctggccaa catggtgaaa tcacatctct tctaaaaata caaaaattat 64741 ccgggtgtgg tggcgcgtgc ctgcagtccc agctactcga gaggctgagg caggagaatc 64801 gcttgaacct gggaggcgga ggctgcagtg agctgagatt gggcactaca ctccagcctg 64861 ggcgacagat caagactctg tctggaaaac aaaagaaaac aaaacaaaag aagcacctgg 64921 tgttgttgcc tagggatgct cttcctgtcc tcccggggtt tgttcaagac gggaataggc 64981 tgtttttccc aactgtgaga ggcaggctgc ccaggctgag atttgtgagc caggtttagg 65041 ccagaagcct ggggctgagg acactcctgc gggagcgtga ggatagcctc tcttcccgcc 65101 ctatttgcaa gctcagatgt ctgttcattc actagaaagc caccaaggct ccatggtagt 65161 agctgttggg gtggttcctg ttactgcaag gcaagggtgt caagacgtgg gtggtgagca 65221 gaagcctggt taaggaaatt gtaacagact catacaagag aatgctacct agtttaaaaa 65281 aaaaaaaaaa gagcaaggca acgatgcatt ttttttctat ttttctcctt ttctctatgt 65341 tttttatttt gaacaatctc taagatatat gtcacccaca cagagggggc aagacccagg 65401 tagaccatat agtatagtcc cattttttaa acaaaaccaa acacccaaag caggggagtg 65461 tgaggccgac cctcggccgg ggaggctgga cccgggcgcc cgggcggggg ttggcgcctc 65521 atggatcggg atcctcgccc gggagggtgg actctccctt cattaagacc cccaggtccc 65581 aggcatggag gaggcctcgg cgcttccccg tggaggcctt cgaccaggca gcctcagttt 65641 ccccacgggc gaaagccgac tcgagggtgg ggtgccctcg ccccgcctgc tcctcacggc 65701 gattgccccc gcgccccggg ctggacaggt ggggcgccgc gaaaaggccc ggcaagccgg 65761 aagttgcaat ggggcagccg gacgcaaaac gcggggacac cggccgggat ccgccaagct 65821 agggtgagct gccgatcggg gacgcgaacg ggagcgcgca ggtgagggcg gcggggcggg 65881 gctggggcgg gggcggggcc actgaggctg cggccaatca ccggcgcctc ccgtgggcgg 65941 ggcggggccg ttgcttgggc gacaggctcc agcgtctgcg ttctgtgaac ctagccgctg 66001 ggctgtgcgg acaggttcgc gtggggctcc tgggcctccc gcagccgcgt aggtttcggg 66061 aaccgggtcg cctccctgcc tccggattct cacgcgctgg tgtccggtgg ccctgggaaa 66121 tcttttcttt taacaatgtg tattccttgg gtcctaccta tctaacactt ctgctggttt 66181 ctttttcttc tcttttagag ttcaatattt gctctgctct tttttgttgt ttgtttgttt 66241 gagacggagt ctcacactgt cgccctggct ggagtgcggt ggcgcgatct cggctcactg 66301 caacctccgc ctcccgggtt caagcgattc tcctgcctca gcctcccaag tagctaggat 66361 tgcaggcgcc cgccagaacg cccagctaat ttttgtattt ttagtagaga tggagtttca 66421 ctatgttggc caggctggtc tcaaactcct gacctcgtga tccgcccgcc gcggcctccc 66481 aaagtgctgg gattacaggt gtgagccacc gcgccgacct gctctggtct ttattattct 66541 tatttttctt taccttgctc agttcccttg ttttccccta tcgtcgtctc cttccttcct 66601 tcctaaggct ggctacatct ccctgtctcc aacctggctt agccacctta attagcacca 66661 attattttgt tcttacttca gatgggtctt tttttcccta gtttttgtta ttccttaagt 66721 cttgacttaa ctgtcaggtc gatctctgag agcagatcat gatttattta ttgaaggagg 66781 agtcctctcc ttactcaggg tcagtgaagc cgccagctac aggagttggc atttattatt 66841 taccatgctc gaagctctgc taagcggctt tgtgcattat agtcaattct ttctttcctt 66901 ccattttttt tttttttttt gagacggagt ttagctcttg ttgttcaggc tggagtgcaa 66961 tgaatggcac catctcggct caccacaacc tccgtctcct gggttcaagc aattctcctg 67021 cctcagcctc ccgagtaact gggattacag gcgtgcacca ccatgcccgg ctaatttttg 67081 tatttttagt agagacgcgg tttctccatt ttggtcaggc tggtctcaaa ctcctgacct 67141 caggtgatcc gcccgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccactgtg 67201 cctggcccca attatagtca attctttttt tttttttttt ttttttttga gatggagttt 67261 cactcttgtt gcccaggctg gagtgcaatg gcgtgatctc ggctcaccgc aacctctgcc 67321 tcccaggttc aagcgattct cctgcctcag cctccctagt agctgggatt acaggcatgt 67381 gccaccacgc ccggctaatt ttgtattttt agtagagatg gggtttctcc atgttggtca 67441 ggctggtctc gaactcccta cctcaggtga tccgccagcc ttgacctccc aaagtgctgg 67501 gattacaggc atgagccacc gcgcccggcc tatagtcagt tactgagagg taagtacgtc 67561 cctacacaga tgacctgtga gttgaatgca gacagcaggt gtgttttgct gacttgtatg 67621 gaattctttt tttttttttt tttttttttt ttgaggcgga gtctggctcc gccccccagg 67681 ctggagtgca gtgacggtat ctcggctcac tgcaagctcc ccttcccggg ttcacgccat 67741 tctcctgcct catcctccgg agtagctggg aatacaggcg cccgccacca cgcccggcta 67801 attttttttg tatttttagt agagacgggg tttcacagta ttagccagga tggtctctat 67861 ctcctgacct cgtgatccgc ccgtctccgc ctcccaaagt gctgggatta caggcgtgag 67921 ccaccgcgcc cggctggaat tcttaaattt taaaactttg tcatcaattt aaaagtcagg 67981 aggtcacaga ggaatcgaat tgttttttgg cttctcttca aggatttctc aaaggcctct 68041 gaagtccctt ctccctcccg tcttactccc aagccattgg tcccgtatgt aagctggtgg 68101 ctggaggcat ttaagttttt tgtttttaaa tagagttggg ggacgggcgc antggctcaa 68161 gcctgtaatc ccagcacttt gggaggccga ggcgggtgga tcacgaggtc aagagatcga 68221 gaccaggagt ttgagaccag cctggccaac atagtgaaac tgcatctcta ctaaacatac 68281 aaaaattagc tgggcgttgt ggcgggcgcc tgtattccca gctactcagg aggctgagcc 68341 aggagaatgg cgtgaatccg ggaggcggag cttgcagtga gctgagattg tgccactgca 68401 ctccagcctg ggcgacagag caagactccg tctgaaaaca aataaataaa taaatagagt 68461 tggggtctca ttatgttgct gaggctggtc tggaactcct gggctcaagc catcagctca 68521 ccttggcctc ccaaagtgct gggattacag gcacgaggca ctgcacccag ccccgtttaa 68581 gtttttaatg tctgctctac ttactgactc accctgccct gacgcctttc ggtgctcttt 68641 taggggttgc tgttgccaca tggcgactgt ggtcaccgtc tctcctggac ctgcctagat 68701 ccaaaagcca gccctggaag gaacacctct cattctcaag aagaagttaa tgtctgcctt 68761 acagtagatg tccaataaat cttctttgaa caattaaaca gatgctcaga cttcatttga 68821 cctaatttag gtgatcctat ggagggatcc aaacctttgg actcccccag tatccctgaa 68881 gtggcctctt ttgctggtag agaaactttt gcccaagagt cgggacacat cgacacatcc 68941 acttttgggg gggcttggtg cactcctggg acagtgtggg gtgaaagggt ccctttcagg 69001 agtgaattcc ctgaccctgg acactcagct ctcctcctgg ttggcagagt gaagagtgtc 69061 atgatttcca ggacccctcc ccgcttcagc tttcagttca tcgctcactc tggcttgaaa 69121 acagtactga cttgttaagt tgcatgtctc tcttcgcctt gaggcttaaa acacctagaa 69181 gccgggtgtg gtggctcgca cctgtggtcc cagctactga gcaggctgag gcgggagact 69241 tgcttgaagc caggagtccg agaccagcct gggtgatata gcgagaccac atctctaaaa 69301 aatcaaaaac aaaaacagaa cacctggcca ggcacagtgg ctcagcctat aatcccagca 69361 ctttgggagg cagaggtggg tggattgctt gaacccagga gttcgagacc agcctggcca 69421 acatgatgaa accctgtctc tactaaacat acaaaaatta gctggccgcg gtggcacact 69481 cctgtaatcc cagctacctg ggaggctgag acagaagaat cgtttgaacc caggaggtgg 69541 gggttggaat gagccgaaat ggtgccattg cacttcagcc tggatgacag agcgagactc 69601 tgtctcagaa aaacaaaaac aaaacaaaac aacaacaaaa tcccaaaacc agaacacctg 69661 gaaaacacca gacacaaatc ccaggagtaa tttttctttc atcaaattga caatagcatc 69721 aatcagaatc ccaggactcc taaaacatgt ttaactccaa aagcagagtt gcaaactcaa 69781 atgctacagg ggccaagcag gcactgtggc cacctagaaa gtggaccctg aaggtgggga 69841 aggtgggtcc agcatcggta aatgttctaa ttttttttaa agaatcaatt ttataggctg 69901 ggcgcggtgg ctcacgcctg taatcccagc attttaggag gctgaggtgg gcggatcacc 69961 tgaggtcagg agttccagac cagcctgacc gacatggaga aaccccatct ctactaaaaa 70021 cacaaaatta gccaggtgtg gtggcacatg cctgtaatcc caggtaatca ggaggctgag 70081 gcaggagaat cgcttgaacc caggaggcgg aggttggggt gagctgagat tgcgccattg 70141 cactccagca tgggaagaag agcgaaactc cgtctaaaaa aataaaagaa tcagttttat 70201 gagtgtcaat tttttaatgt tagaaactaa gttaaaaaaa aaacaaaatt agctgggcgc 70261 ggtggctcac gcctgtaatc ccagcacttt gggaggccaa ggcaggtgga tcactcaagg 70321 tcaggagttc gagaccagac tagccaacac ggtgaaaccc cgtctctact gaaaatacaa 70381 aaattagccc ggcatggtga tgggcacctg taatctcaaa tacttgggaa gctgaggcag 70441 gagaatcact tgaaccaggg aggtggaggt tgcagtgagc caagaccacg ccattgcact 70501 ccagtctggg ccacagagtg agacttcatt tcaaaaaaaa aaaaaaaaaa attaaaccct 70561 ctgggcatga tggctcacac atgtaatccc agcactttgg gaggtgggag gattgcttga 70621 gatcaggagt tcaagaccat cctgggtaac aaagcaagac ccccatctct acaaaaaaat 70681 ttaaaaaatt agccaggcat ggtggcatgc gcctgtggcc ccagctactt gggaggttga 70741 ggtgaattgc ttgagtccag gaggttgagg gtacagtgag ctatgatcgt gccactgcac 70801 ttcagcctgg gtaacagagc aagaccctgt ctcaaataag taaataaata aacaaacaaa 70861 cccttaaaag cccacacaac atagcacgag ctggtacaga ggcagctgcg cccatcttac 70921 tgtgtgaagc tgtgcacctc aggggattct ggtgagccaa agacagtggc atggggacgt 70981 ttcctaggga aacttcctgg aatctgattc caggactggg cccctggccc ttttcctgac 71041 acccagcccc catgcctgcc caagagaaat ggttccttcc cccagaaagt aaagcacagt 71101 ctctgtcctc tctatttgag gccttacttc tcagggtcta aagggaaatg gtgcctgaga 71161 ggcagaatct gcagctgcca ggcgtgtttg agaacttaga accccattta tagctggatc 71221 acaccctcta tccccaagct gaaatgggcc tggaagaatc ggattaacca acttcttccc 71281 aaggtgtggg gcatccggat tggggggagt tcccagcctg tttcccaagt tcactcttag 71341 ggagagagtt ctgtgccagt cttgggggga cccagagggc ccttgactag gagtccctaa 71401 atgaattcag ggtccctcag gagacaataa ccttcagaaa ccccgagtgc taaaataaat 71461 ggtcctgccc tctccaccct gcaaaatgcc cttctttcta cctgtgagga caactgcagt 71521 gctcgctttg aaatttcaaa taacacaatc tttccacaaa atgcatcttc catttttctg 71581 ttttggtggc tctgccttgc tggcatcatg aacgcgtcct gggtaaagcc atcctcatga 71641 gctggcattg cggctctggg agtcacctcc cggccaagag caggtgcaac ccaggctcga 71701 gactccacac ccggaaccct caaaaccggg aacgctgtag ggctgcggct gcatggggaa 71761 tgggatctgg gagggacttc ctgtcttccc tactcccagt ctccacccaa ctcccccgcc 71821 cgccccgtgc aggctgtgga gactcccttc ccgggggagg gggcccccac tgccgcaggt 71881 gccccctctg cctccacccc ggtgagaagg gggtgctggg gagggcatct ggatcccgag 71941 accggcgcag atgatcagct tgcaggaggg gatggggtca aaggtggaag gcttcaagct 72001 ggcaggagag aagggggcca ggaccgggtt tgggaaggag aaggacattg tctgccagga 72061 ccgagagcag gatgggcatc ccaggcctgg ggcaggtggt tgcagctgca ggggggctgc 72121 tgagagcggg caggttgatt tctgaattcc tggtgggcat gggagtcagg tagggacaca 72181 cggtgtatgt cctggggtcg ggggacagca ggtcctggat taccctggcc ttgacgggcg 72241 gggccccacc agcacctgtt ctgcaggatc tctgcagaac tttctgctcc tgatgacagg 72301 aaatagcgat tccaatttag cctcggcttt ccctgagccg gttacactga gatgcgaggc 72361 tcttgcttca gcaataaagg ggacaataca aaatagcatt ctttcaaaaa taggacccag 72421 aaaacatcac agagagtggg accatcggat aggatctggg agactttctt tccgattggg 72481 accagggagc gctggtactc gtggggattt tcagggctct ttgggggtgt ccaggaagcg 72541 agggctggtt tctgtctgct cttctaggct cctggccaga agctggaggg ggcttggcca 72601 aaaagaggga gaaacaactc agctgttctt tctagctctg aaatagaaaa tgtctgcaga 72661 cggcggaggc atccaggaca cccaggacaa ggagacaccc ccggaggtac agatggggct 72721 ggctgaggga ggtgtgcggt agaagaggct ggtgcggagg agattttcaa ggcacagagt 72781 ctggacccct ggaagagtta gactcactgg ggtggggaga tcaaagagaa ggtggcatgc 72841 tgggtagtac cctctgtctg gaatttacag cataacagtt ttgctttgct ggtgaattct 72901 ctagccagtg gccacaggag atggaagagg tctgggtgga ggtcctatca gtgaggtttt 72961 ctttctgagc agcgctcaga gcacacaggg aaacctgttc tccttccccg tgcttctcca 73021 cctggaacca cagcttctca tctctgtggg tctgagctcc cggtcctttc tccttcctgt 73081 acctgctacc cgtgtgtgtg tgcgtgcgtg gttgagtgtg tgtgtgtaat tactatttgt 73141 gggggacctg ggtctccggg cttggagttg gaggagtatg agtcagcgac aggccagggg 73201 ccagtccctg gcctctggga accagatgtg tcatctgtaa aatgcagacc acatgcctgc 73261 catttccacc tgtctcgcag ggtcactggg aggctctaag gataataggc atgagacgcc 73321 atcatctatg gttttcatgc catagaagca caaagaattg ttagcatggt tcagtttccc 73381 atccacttcc cgtgtgcctt gaacaccttg ggcatgtcag tagagcccca catgttggca 73441 ttctctgctg gcactggctt ctgtctggtg cgcacggcgg gggggattaa tgaccagtgc 73501 caccctgtgt ttatccagga ctttcctgag ctggcagctc cttctccatc attatctgcg 73561 tgaacctcat gaccattctg tgcaggggca gagcagctgc gtcctgaagg ctgcctgatt 73621 ggggaggggt gtcgctgggt ctggcaccca ggcactctga ttcctggcct gaggttcttt 73681 cccttcaaga cccaaacaca gacagggcgt ggaggaagga gtgtgagacc ccagcagaag 73741 tccggccagt tcctgtgctg tggatgaggc cccaccaggg ccctccctga ctttcccaaa 73801 gcttttgagc tatgtggtca cggggcattg tctcatcccc agccatggtg tgttagtctc 73861 ccgtggctgc tgtaacaagt ttccactctc tggggggctg aaaacaacca gaattggctc 73921 tctcacagtc ctggaggcct gcagtccaaa atctaggtgt cagcagggtc acactccttt 73981 tgaaagctct agactaagag gaccctccct tacctcttcc aactccttgg gggctcctgg 74041 tgtccttggt gtgggcatcc ctccagtctc tgcttccttc tctgtgtctc tgtgtcctct 74101 cctattctta taaggacata agtcattgtt agattttagg ctcactctaa attcacgatg 74161 atttaatctc aagatcttta acttacctgt ataaagaccc ttttccccaa taaggtccca 74221 ttctgaggtc cagggaggac acggattctt agaggacact attcaagcta ctacacttgg 74281 tttcttcata cactaataga cagcagtcta cactgctgag gtcagtgtga ggctggagct 74341 gagaaggggt agctgccctt gggcaccacg cccaccggca ctgtggaggc ggcttggtga 74401 atattcctcc tcttgccgcc cgtccttgct ggggtgagct ggatgaatgc agcaaggaca 74461 gtccttcaag ctgcgtcttg catgttggtt tccgacgctg ccggagcgca ctgccatgtg 74521 gccctgggtg tctctcccac tcatctgggt gctgatgggg ctgtcctttc taggttccag 74581 atcgtggaca tcctcatcag gaaatgcctt ctaagctggg ggaggcggta ccttcagggg 74641 acactcagga gtcactgcac attaagatgg agcccgaaga gccacactcc gagggggcat 74701 cgcaggagga tggggctcaa ggtgcctggg gctgggcacc cctaagtcac ggctctaagg 74761 agaaagctct cttcctgcct ggcggaggta ggagagggac ggggaagagg cgctttccca 74821 gggaggcagc tgtggggagg tggaggtttg gcccaggtcc aggtggggct gagggtctct 74881 atgccagcag gggacgctat ccccccatgg cccacagccc cctggcagcg tcagagccat 74941 ccgaccctgt ccttgggtcg ggccaggggt ggagtttgag gcagctgaag tagcagcagg 75001 tccagcagga tgagctgatc accgtgagga ctctctcctc ccacagccct cccctccccc 75061 cggatccccg tgctttcccg agaggggagg accagagacc ggcagatggc tgcagcgctc 75121 ctcactgcct ggtcccaggt gagtggccct tccccggccc ctgcatggta ctcagccctt 75181 cctgcatctg ctggttctgt gtgaaagcca ggaccccgct ggccccactt gcagccaagc 75241 ctgaagcctg gggctcttgc tagtgttgct ggaattgtcc aggctcaagg cccatggtca 75301 ggcctgggcc aaagctggag gagaagcagc aggaattaga gacacagggc ctctcgctgg 75361 tagccgttgg gaaccgtcct ccgaggtctc agctgaggct gggggctgca tggacatcct 75421 aagtcttaac tgtgatccct gaaaatacct cttggtcttt gttctttctc aactccccag 75481 aaggccttgg gtggctcacc tgcttcccga tcagatggaa gtggcaggcc tcactcgaca 75541 cagtgggttg ggagtccggg gtctcaccac acatctcatg ccatggtgtc cctccaccgt 75601 gtccacgtca tgtcctggag agtggatgtt tcacattgtt tcagatgcca gtgactttcg 75661 aggatgtggc cttgtacctc tcccgggagg agtggggacg gctggaccac acgcagcaga 75721 acttctacag ggatgtcctg cagaagaaaa atgggctgtc actgggtaag cactcgcctg 75781 gaggggggac tggggtgtta gggggaggtc ctgctctgcc gtctcctctt tcctcccttc 75841 cctccctctc tctcccaccc ttctccttgt cccttgagac agcactgttc tatggaaact 75901 tctgtaatga tggaaatgtt ctgtgtgtgc taggacggga gccacaagcc atgtgtggct 75961 actgagcact tgaaatgtga gtagagccct tgagttttta attttacttg gttttgtttt 76021 gttttgtttt gagacggagt ctcgctctgt cacccaggct ggaacacagt ggcacgatct 76081 cggctcactg caagctccgc ctcccgggtt catgccattc tcctgcctca gcctcccaag 76141 tagctgggac tacaggtgcc caccaccacg cccggctaat tttttgtatt tttagtagag 76201 atggggtttc accgtgttag ccaggatggt ctcgatctcc tgaactcgtg atccacctgc 76261 cttggcctcc caaagtgctg ggattacagg cgtgagccac cgcgcccggc caattttact 76321 tggttttaat cagctcaaat gagagtttaa atagccgcat atggctagca gctactgtac 76381 tggacgctgc cccaggaagt ccgcttttcc tgggttcatc cgcagggctc actcctatag 76441 ctccatcttc ctttccaagc cccaacttca gaggtgtctg agcagcaagc ttggcttcgg 76501 ggcccgaggg ctgcttgccc tggagagaga tggcatgaac tgaactcccc aggccccgca 76561 gacccttggg tgtcccatct gctttgagga gcaggtgctt gtctccgagg ggtcttggtt 76621 ccctttgttt cttggatgtt aagtatttaa gtcccagagc tttttttttt tttttttttt 76681 tttttttgaa acacagcctt gctctgtcac ccaggctgga gtgcagtggc accatcacga 76741 ctcatgcagc ctcgacttcc tgggttcaag tgatcctcct gccacagcct cctgagtagc 76801 tggggctaca ggtaccacca tgcccagcta tttttttctt tttttctttt tttggtagaa 76861 atgggatctc gctatgttgc ccaggttggt cttgagctcc tgggctcaag tcatcctcct 76921 gcttttgcct cccaaagtgt tgggattata ggcgtgagcc actgtgccca gccatcccag 76981 agcttttgac tcctcaattc cagactttag ttttaatacg gaaaaatcag tattcattta 77041 ttcatttatt cattcattca gttgacgctt agcttttttt tttttttgag acggagtctt 77101 gctctgtcac ccaggctgga gtgtagtggc gtgatctcag ctcactgcaa cctctgcctc 77161 ctgggttcaa gtgattctgc tgcttcagcc tcccaaatag ctggggctac aggtgcccac 77221 caccatgccc ggcttttttt tttttttttt tttttttgta tttttagtag agatggggtt 77281 tcaccatgtt ggccaggatg gtctcgatct cctgacctcg tgatccgccc acctcggcct 77341 cccaaagtgc tggaggcatg agccactgcg cctggccaac acccagctag cacacttgct 77401 agtccaggca gaatccagga ctcagcattc caggtggtgg catcatggtg gtcccaaggc 77461 cccgctggct ggcaccaggg atggtcaccc ctcaattagc tcaggctgct caccgctctt 77521 atggttcccc agtgcctggg ccagcaagcc caggcaagtc ctgagggatc agaaggatct 77581 caacacccat tttcagcacc acggcaggcc tcgcagatgc ctcaagccac ccggttggtc 77641 caacacattc ctttcctccc acgggtggtg gaagccaggg ccccctctgc tctcttactg 77701 acttgaccaa aattcctggg gccacacccc cccaagactt gacaaatggg atgaccttag 77761 tcgttggaga tcctggccct cctgtcccag gtcctcccac ttggagggcc actacccttg 77821 cccctccagg tgtgggtgta ggagcgggat acccgtcccg tgggcgttga tgtagggtgt 77881 tgcccactcc cgtcccttaa actccgttgc tctttcaggg aaagaggaga caggcctagt 77941 cagctttggc agccgccagg cagggggcgg acccccgacc cagggggcct tcttgaaact 78001 gctttatttt accccctgtt tccagagcct caccccctac ttgggcccat cagtttgggg 78061 gagatgctgt ctgcagctgg gcagcgccag aaggcctgtg acagcacagc atctctttct 78121 gggcaggctt tcccttcagc aggcctttct gggcccctca agcgcacggc aagggtgagg 78181 cctcgggctc cagccggcag gcaggagatg agaaggagtg gagaggcgcg tgcacaggtg 78241 agggacgggc gcgcgccttt gtctgcggga gtggggcgca gacgcaggcc ttctggctgt 78301 tgtctgtggg agtggggcag ggccaccaga ccccctcctg ggcggtttgc gcccagggcg 78361 gcccttgcgc tttctggtct ctgagaatcc agcccccacc cttagctctg ccttctccac 78421 cctcctgggg tccttaggga agctcgtccc tagctggaaa cgactttctt tttccaggag 78481 ccgtcgaggt ggggcagagg gtgcagacct catccgtggc agcccttggg aatgtgaagc 78541 ccttcagaac cagggcaggg agagtccagt ggggcgtccc gcagtgcgcg caggaagcag 78601 cctgcggccg gagctcaggg ccggccaaag actccgggca gccggctgag ccagatcgca 78661 ccccggatgc agctccgcca gaccccagtc ccacggagcc ccaggagtac cgcgtcccgg 78721 agaagcccaa cgaggaggag aagggcgccc cggagagtgg cgaggagggc ctggcccctg 78781 acagtgaggt gggcaggaag agctaccggt gcgagcagtg cggcaagggc ttcagctggc 78841 actcgcacct ggtgacgcac cggcgcacgc acacgggcga gaagccctac gcctgcactg 78901 actgcgggaa gcgcttcggc cgcagctcgc acctcatcca gcaccagatc atccacacgg 78961 gcgagaagcc ctacacctgc cccgcctgcc ggaagagctt cagccaccac tccacgctga 79021 ttcagcacca gcgcatccac accggagaga agccctacgt gtgcgaccgc tgcgccaagc 79081 gcttcacccg ccgctcggac ttggtcaccc accagggcac ccacacgggc gccaagccgc 79141 acaagtgccc catctgcgcc aagtgcttca cgcagagctc ggcgctagtc acccaccagc 79201 gcacccacac tggggtcaag ccctatccgt gccccgagtg cggcaagtgc ttcagccagc 79261 gttccaacct catcgcgcac aaccgcacac acacaggcga gaagccctac cactgcctcg 79321 actgcggcaa gagcttcagc cacagctcgc acctcaccgc gcaccagcgc acccaccgtg 79381 gcgtgcggcc ctacgcctgc ccgttgtgtg gcaagagctt cagccggcgc tccaacctgc 79441 accggcacga gaagatccac accaccgggc ccaaggccct ggccatgctg atgctggggg 79501 cggcggcggc gggggctctg gccacacccc cacccgctcc cacctaggag gccaggaaag 79561 ggggagcggg gcgcccaggg ccactggaac agccccactg gagtcaaggc tccgagggag 79621 gagagagggg ctcgggaagg gagctggggc ggtgagggca tggggtgagg catggcgatg 79681 ggggagggcg agggcgagaa agggcaggca ctctgcgaat taaaggcctt ggacttgaag 79741 cgcccgccta cacagctttg tctcctggtg ccctggcgct gattccccga gcgtggggga 79801 gctcctgggc taatcccctg tcctcattga ggcatccccg catcaccact cttggcttgg 79861 gtctccacca gggttggggt cccttttgcc aaaggcccat tccaaagcgt tgcacacatt 79921 ggcaagcaaa ggcttcaagt acagagaggt ttccagggct gaagccagtc agccgctcct 79981 ggacctgggg accccttgcc ggccgtctgc tgctgacgcc acttgtccca agggcactgc 80041 ttactgagcc cgcactctcc cttggctctt ctctttggaa gccaggagca ggcagaactg 80101 gtccggaaat tcctgcctgc cccatctccc actcaggcat cggtggctgg gttttgtgac 80161 tggcctttgc catttgctcc caccatggcc tgcctgtggt cctgcgaggg ctcctcactc 80221 accaggggcc actttccttc ttggcccatg ctgcagctgc tgtcacctgc ctctgctatg 80281 tctcacctcc ttggcctcct ggtatatgtc ctatataatt cgccctgctc gccccttcct 80341 ggctctgagc tttgatttac tcctggtctt tccgctggcg tttgtttttg tttttgcgac 80401 agagtcttgc tctgtcgccc aggctggagt gcagtggcct gatctccact cacggcagcc 80461 tccacctcct gggttcaagt gattctgctg cctcagcctc ctgagtagct gggactacag 80521 gcgcacgcca cgatgcctgg ctaatttttg tatttttagt agagacaggg tttcaccatg 80581 ttagccagga tggtgttgat ctcctgacct tgtgatccac ccacctcggc ctctcaaagt 80641 gctgggatta cacacctaag ccactgcgcc tggccccctg tttttgtttg ttttttgttt 80701 tgttttttga ggcagagtct tgctctgttg cccaggctgg agtgcagagg tgcgatcttg 80761 gctcactgca acctccagct cctgggtccc agcgattttc ctgcctcagt gtcccgcgta 80821 gctgggacta caggtgtgtg ccacatgcct ggctaatttt ttgtattttt agtggagaca 80881 gggtttcacc gtgttggcca ggctggtatt gaactcctga ccccaggtga tccgcccact 80941 tcggcttccc aaagtgctgg gattacaggc gtgagccacc gtgcctgacc tctttccctg 81001 gttttaacct ctgtagctcc caactcccat tctctgcttt tttccctggg tggggatggg 81061 ctcatcctgc ctatcatccc atatttgtgg cccagctccc gagataactt tcagctctga 81121 caccaactga ccacgaggat gtctgcacca agaaggccct tgagacccca cctgtctttg 81181 tggattatgg ggccaccagg gtggccacag ccctggttgg ctcccagatc ctgaggaaaa 81241 gagtggcttg ctgaggacgg tagactgaga agagtaatca tagctgccag ttgtcaagca 81301 gctgctcttc caggcacagg gccaagcact gtagataccg ctctctacaa caatgtaagt 81361 gtgcattgtt agcccaggtt tttttttgtt ttttgttttt tttttttttt gtctttttaa 81421 aaatagagac ggggtcttgc tatgttgccc aggctggtct ggaactcctg ggctcaagtg 81481 atccactcac ctcagcttcc caagtagctg ggactactgg tgtacactac taggccccat 81541 gcagggctat ctaaaaagag gaggacaagg ccaggcgcag cgactcacac ctgtaatctc 81601 agcactttgg gaagcctagg tgggtggatc acttgaggcc aggagtttaa gaccagcctg 81661 gccaacatgg tgaaaccctg tctctactaa aagtacaaaa attagctggg catggtggca 81721 ggcgcctgta gttccagcga cttgggacgc tgaggcagga gaattgcttg aagccaggag 81781 gtggaggtca cagtgagctg agatcttgcc actgcactcc agcctgaatg atagagcaag 81841 actctgtgtc aaaaaaaaaa aaaaaaaaaa aaaaagagag agagtaggac aatagacctg 81901 accttcagga atgttagact cagctagaga tgtgggtggg gtgggatgta atcaacggga 81961 caccagggag ctgctcacta gcggaagccc aggcaggact tgatggtatc caggaggtct 82021 ctggaatcta ggctagggat gctggcagga ttggttgaat taaatgatta tttccaaggc 82081 cagaaaaaaa gctgtttcct cccttcagtt tgtcttcatg gtcccaatca ctcagccaga 82141 aataggattg ctctgcttgg ttcagtgtga gggggtgagg gagactcatt tgtaaagcag 82201 ggttgctctc aaaagagctc attttctaag taaggagaga gacagaggtg ggagagtttc 82261 taaaatgacc ctggaagccg tcatcacttg gcaagagagc tgagcagagt actcacttgc 82321 acagtgagtg ctgcttgctt taatttcagt tccttaggct ccgtggccag cgtgccaggg 82381 accagatgtc aggacctgag tgattcagag gtctgtcaag gcttggtgtg gtggcttaca 82441 cctgtgatct gagcacttag ggaggctgag gcaggaggat ggcttgaagt caggaatttg 82501 agaccagctt gggcaacaca gcaagaccct cgtctctaaa tagttaaaaa aaattagcca 82561 agcatggtga tgcgtgcctg tagtaccagc tacttgggag gctagggtgg gaggatcgcc 82621 tgagcctggg aggttgaggc tgcagtgaac tgtgatcgca ccactgccct ccagcctggg 82681 caacagagca agaccctgtc tccaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagaccgt 82741 tgagtgatct agggagtcat ttaaaaactt cactctggcc tccagggagc ccaaacagca 82801 cacagtttcc tggtgaaacg tttccccttt cgtggcagag tgcaggagca cgggttccaa 82861 gggccccgtg gagggtgcct gcagatggcc attccttatg gctggcccac ttctccctct 82921 agggatccct ccggaggccc cactgtggga tagatctgga aagagtatct taagagtgga 82981 acttggcagt ttcggaaata ttcgacaagc acaaataaga cagggaggag taggggctgt 83041 gccagggaag gacctggaac gccaggctga agggtgctca tagtttggta ggtgatgggc 83101 gtcactgaga ggtaggaggc tggagggaag aagccaggta agggtcttgg tgtctccctg 83161 ggtaaaagat gaagaggttc agcggtttcc ctggcttcct cggggcttct cggccactgg 83221 agaactccac cagatcatca ctgatgcggt gtcaggagca cggtgagggc ggagagcgga 83281 gaggcaggat gtatgcatcc tggcttccca cctgcttccc tgcggcccca ggttcctgtt 83341 gcaagcgctc agggatcggc tgtcggcgct gcccctctgg gccgggggcg tcggcctcag 83401 cgccgtccta gcctaggaca ccgtctccca gttcccgggc ccggccgtgt agcgaagccc 83461 agagcctgca gggtcccatg gacccagcct ctcgctcggc ggccccgtcg ccggcatgac 83521 tgaatccagg aggccacgcc ccccttccct tagccattca ggcatcgggg cggggagaaa 83581 ggcggctcca gcagaaccag ccaatcacgg ctgaagtttg tgcccggccc ttttcccctc 83641 accgccacgc cccaagccgc acggttgccc cggcaactgc tccaggatcc tgtctttgaa 83701 gcatttcgtc ttacccttcg ctgggcgttc gttgccgaac gaagcgaaga ccacggggct 83761 ggaccttcgc tgcagacctc agtcctcaca gcccgccccg cccagccact gacttcctgg 83821 gtctatcacc gagatgccga gcctcctgca gtcctagaga gcatacggcg cagggccagg 83881 gccacttccg gcccagagga ctccctgtca atttgctgca atggctcctg caccagggtt 83941 gggtgggttc gctgcactgg ccgcggagct agagcatctt ccgagctccc gagcaggggc 84001 tgacggtgct gccagctgtc gtgcggtacg gccggcagag agcggccagg aaggagaccg 84061 tgccctcttc cgggggagca gaagaaacgc gcgggctgtg cgaggcggct cgcggctctg 84121 gagtgcctga ggggcagagg gcggcaaacg ttaactgtag gggctttgct gcgttcctca 84181 cacattcgtt tttttgtttg tttttcggag atggattcgt gctctgttgc ccaggctgga 84241 gtgtagtggc gtgatctcag ctcacggcaa tctccacctc ctggttcaag cgattcttcc 84301 gcctcagcct cccgagtagc tgggattaca ggcgcctgcc atcatgcccg gctaattttt 84361 gtatttttag tagagatggg gtttcactat gttggtcagg ctggtcttga actcctaacc 84421 tcgtgatccg cccgcttcag cctcccgaag tgttgggatt acaggcgtga gccaccgcgc 84481 ccagccaatt tttgtatttt taatggggtt tcaccatgtt ggccaggccg gtctcgaact 84541 cctgacctca agcccgcctc ggtctcccaa agtgccggga ttgcaggtgt gagccaccac 84601 gcccggcctc actagttcat ttttcaaata caaaatgaat gcctgttaat ttagcaaagg 84661 tcctttgcta aatgttgagt tgtaagtgag gaagattcca gacaagaccc ttgccctcat 84721 ggggttccag tgtggtgggg gaaactgaca gacacataca tggcacacac tttaacttca 84781 agtagttaca aatgccagaa agaaatgaac tgggtaaaac aggagaggac ggtcggagct 84841 tcattgtcag ggaagacccc tccaaggagg ggacatttaa gctgagactc aaagggtgag 84901 aagcggtttt acaaggatct ggagtacgaa cctttcaagc aaagggaaca ggataaacat 84961 gaggaggaga ctggtgccca gtagctgaaa tgacccctgc cccgaacctt ccccgccccc 85021 accccaaccc ccccatgtgg aggggagggg agtgaggaga caggctggag agctgcgact 85081 tgtcaggggg cagtggcttg aggcttctcc ctggtgtcct gtgaagacag tccagccttc 85141 ctcatcgcac gctcctctgg cagcctgaga catggggtca ggcgggcttc ctgcctggtg 85201 taaaggcagc acccaatgaa tggcctgcac ctcagtaact taagacagca atttttatgt 85261 tctcaactaa caaagcccta ggctagtcat ggagatgaac acagatctca cagagcaacc 85321 acagagctct tcaaggattc aggtgatcct gtcattaatg cctttctcag agacttagtg 85381 aagccagtgt ctactggtaa aagatagctg gggatggggt gggtacacag gtcatgcatg 85441 ataaggccac tcaacatttc catctcccta aggaaaagga gcctttggtt tccttctttc 85501 ctgtcatgcc acttcagcca tctcaacctt gttaactgaa ggccaccaat gagtccaaat 85561 ttcagccaac caatggagta aactgctaaa aagccaccaa cagccacaca cgctaacatt 85621 aaatttgaag tctactaaat gcacctatat cattgaattt gatttctcat tttgtgtgtg 85681 tgtgtgacag agtcttgttc tgtcacgcag gctgaagtgc agtggcatga tcacagttca 85741 ctgcagcctt gacctctcag gctcaagcga ccctcccacc tcagctttct gagtagctgg 85801 gactataggt gctcaccaca acaccccgct aatttttgta ttttatgttt caactatgtt 85861 gcccaaactg ataacaaact cctgggctca agtgatgatc ctccagcctc tgcctcctaa 85921 agtgctggga ctacaggcat gagctactgc acccggtcta atctcatctc tttttatatt 85981 ctgtccactt tggtatgcta tccttagaat ctactcttgc atctatcccc caggtttctg 86041 ctagtttata tcagcaaagt gaattatttt agtgtataag ctatttcctc ctgggtcata 86101 acgtgtatct ttcccctctc tctctgtctc ttgcacacgc ttgtgcagcc tctctctcca 86161 cacacacaca tatatagatt gtgttgagat ttgaccctag ttgttatgag tttactggca 86221 atggagcgca attgtggtgg gttttcagga gaatggacat ccccttttag gtcttctgcc 86281 cctgggaagg ttatcctaag gttttcaagc agagaaaggc tattctattc agatactgag 86341 ggcaaactac ttctactggt aaaggaggct cagagtgacc taaggaatca agactgtcaa 86401 tttcatctgg gtccaaacat atatccccac tttgaatttc aggatcacct tctgtttttt 86461 cgttttgttt tgtttttttg atagagtctc gctctgtcgc ccagtctgga agtgcagtgg 86521 cgccatctcg gccccctgca acctcacctc ccaggttcaa gcgattctcc tgcctcagcc 86581 tcccgagtaa ctgggattac aggactgcac cactgtgcct ggccaggatc accttcttta 86641 ccaatcagcg cagtaacctt cacgtgagat ttgttgggcc tgggaattta gctgtcatta 86701 ttattcagcg actctacagt taagttttgt gtttgatttt cagtgacagt attgtggcta 86761 taagagctgc ttctaagcta tcacagaagt tctcaggttc tcagaccatg acttgaactg 86821 agctaaggga cttgggcttg tcattttctc actgtaaatg ctcaagtgca ctaaaaataa 86881 tgtatcccat accatggttg tataatcatc attatacatc tgtggacacc ataatggctg 86941 agtgcagcag ctacttgttc ccaccaggta catgctttca ttggcacttc atcatcacct 87001 gcaggtggct gcttgattaa ttgtgatatc gctgcaagcc gtggataacc agcaccctgt 87061 gatggttaat ttcatgttaa actatactgg gccaaggcat gcccagacag ctgcttaaac 87121 attctttctg ggtgtgtctg tgagggcttc tcaagaagag attaacgttt gaattggctg 87181 agtgagtaaa gcagacggcc ctcaccacgg gtgggcatca tccatccgta tggaattaaa 87241 cagaacagca aggcagagga aggctgaatt ctctctgtgc ctgagtactt gagacctggg 87301 tactgggaca ttgatctcct ggtcttgtgc tcctgattct caggccttga gtcttgaatt 87361 ggaatctaca ccattggcct ttggctctca ggccttaaaa ttacaaaact gttaggaggc 87421 ggaggcaggc ggatagcttg agcccagggg agttcgagat cagcctagga catacaggga 87481 gaggccatct ctatataaaa atatatatat agaacaaatt agctgggtat ggtggtgtcc 87541 acctgtagtc ccagctgttc aggaggctga gatgggaagc ttgcttgagc tggggaagtc 87601 aaggctgcag tgagctgtga ttgtgccact gcagtccagc ctgggcaaca gagggagacc 87661 ccatctgaaa aaaaaatgct gcaccactga ccttgggtct ccagcttgca gatagcagat 87721 tgtggggctt ctcgactgcc ataatcatgt gaggcaatac cttagtgtgt gtgtgtttat 87781 atatatacat atacacacac ataaaagctc tctctctctc tctctatata tatatatata 87841 cacacacata cacatgtata tatagattct gttattctgt ttctggagaa ccctaataca 87901 tatcctattt ctcattgtca aggagctcag tctgcactca agcccaaata cccaaatccg 87961 tccccatatc ccttctttgt gcatctatct cttgagacca ctccttgtac taattttgta 88021 tcaagcaagg tccaatcagg agaagggaag cacacagtaa tttgaactgg gaaaatttaa 88081 tatatagaat taagctataa agggcttaga ctgggcgctc tggctaacgt ctgtaatccc 88141 agcactttgg gaggctgaga caggtggatc acctgaggtc gagagtttga gaccaacctg 88201 accaacatgg agaaaccccg tcactactaa aaatacaaaa ttagctgggc gtggtggcac 88261 atgcctgtaa tcccacctac ttgtgaggct gaggcaggag aatcacttga actcaggagg 88321 cggaggttgc agtgagccac tgcactgcct gggcaacaga gcgagactcc gtctcaaaaa 88381 aaaacaaaaa acaaaaaggc attttcaaga ggttggctgg taagaagtaa agagaactct 88441 gattctctgg tcacagcaca tagcagcagc cataccttct ggcctgagtt ccagcactca 88501 agcaagaggt ttcctggtcc ctgggcttag atccagacct tgaagagggc acaacggcag 88561 tttcactgaa tggcagagaa gtggctgtgg taccacatgg gcagaatttg ctggaaatcc 88621 atcctcaagg gtagaaaaag ctgctcacag ggaagtgtct catgagaaac gcgctgctac 88681 agaactacct gaagagggtg gcagagaaag ctgctaggtg ccgctggttg aggttcactg 88741 aggcggcggc actggagacg ctttgcaggc tgcgggcgcc tgctgtcaca ggagctgccc 88801 atgctgcagg agctgggtgc tggcgaagcc ctgtgtgctg cagaagccag actcggaaga 88861 agtcacccgt catgccggag cctagagaga acacaccaga agcaggaagg gaaactcgtc 88921 tcctgcaatg tctctccagc gccctctact gacaaagctt aagatcatgc cagctggtaa 88981 aggaaaaact aaagggccag aaccatttca cagagcaggc aacgcaggag ctgggacgta 89041 atcaataact agcacaggtt tatattatga ttcgtctgta gcggcctgcc aaaggttcca 89101 tataacatcc ctgcggcagg aactgacact tcaggcccca ccgaatcagc tagataacgg 89161 cccacattgc agagcagcca atcaaaaccg gaacatattt agatcatttg gtaaacggga 89221 cactcaagca gtcctatagg gaggcctatg tggggaggaa ctgaggtctc ctgccaagag 89281 ccacgtgagt catccgcctt ctaagtgggt cctccagtgc cagaggactg gggtcctcgt 89341 tggtgtggtg tctgcagtat catgagagat cttgagccag aactgcctag cttagcagct 89401 cgcatatttc tttttttttt tttttttttt gagacggagt cttgctgctg cccaggctga 89461 agtgcagtgc catgatctca gctcactata acctccacct cccaggttca ctttactttc 89521 ttcattattt atagactata ttttcttttt tgtgagacgg agtattgctc tgtcatccag 89581 actggagtgc agtggcgtgt tctcagctca ctgcaacctc cgcatcctgg gttcacgcca 89641 ttctcctgcc tcagcctgcc gagtagctgg gacaaggtgc ctgccaccac gtccggctta 89701 ttttttgtat ttttagtaga gctggggttt caccgtgtta gccaggatgg ttacaggcgt 89761 gagccaccgc acctggccac tttagactgt attttctagc gagaaaaaac agtgggccgg 89821 tcccgatggc tcatgcctgt aatcttaaca ctttaggaga ccaaggcggg tagatcactt 89881 gaggtcagga gtttgagacc agcttgggca acatggcaaa acctcgtctc tactaaaaat 89941 acaaaaatta gccgggcatg gtggcacaca cctgtaatca cagctacttg ggaggctgag 90001 gcagcagaat tgcctgtaat cccagcactt tgggaggcca aggcgggtgg atcatgaggt 90061 caggagtttc agaccagcct agcgaacatg gtgaaacccc atctctacta aaaatacaaa 90121 aaattagcca gatgtggtgg tgtgctcctg taatcccagt tacttgggag gctgaggcag 90181 gagaactgct tcaatgtggg aggtggaggt tgcagtgagc caagatggcg ccactgcact 90241 ccagcctggg ctacagagca agactcggtc tcaaaaaata aataaataaa taaacaaata 90301 accgggcgcg gtggctcacg cctgtaatcc cagcactttg ggaggctgaa gcgggcagat 90361 catgaggtca ggagagcgag gccatcctgg ttaacacagt gaaatctcta ctaaaaatat 90421 aaaaaagagg ccgaggcagg tggatcacga ggtcaggaga tagagaccat gctggctaac 90481 atggtgaaac cccgtctcta ctaaaaatac aaaaaattag ccgggcatgg tggtgggctc 90541 ctgtagtccc agctactcgg gaggctgagg caagagaatg gcatgaactc gggaggcgga 90601 gtttgcagtg agctgagatc gcgccactgc actccagcct gggcaacaga gcaagactcc 90661 atttcaaaaa aaaaaaaaaa aaaattagcc agccgtggtg gcaggtgcct gtagtcccag 90721 ctactcagga ggctgaggca ggagaatggc atgaactcgg gaggcggagc ttgcagtgaa 90781 ccgagatcgc gccactggac tccagcctgg gcgacagagc gagactccat ctaaagtaag 90841 aaagaaaaag aacacgtgaa aaattatcag aaggaacaag aaagtgcaat ccggtagtca 90901 atttcaatat aattttatgc aaatttgcca tataccagca atgctcaata gaaaatacag 90961 ttcatgtgcc acttaccaat gtatggcctc tcagcccaaa cacatccttt tcgttctgct 91021 ttgtgatact gagctggatc acatctaagc tttgggtgag atccagtgtg atacccagct 91081 ggaccctgta tacagttcct tgctagccag cttgatgttg ttcttcgcta atacagggcg 91141 atggatgaac actgtcaggt cagagcagga gggaggggct gtcttctcca ctgtggccag 91201 cggagggcag gagaggtaac cagcggcctt cagttccact gtcctcacct tggtccggct 91261 cctgcccttt ccactgtccg ctagctgtga gttctcgggg cacccactcc ctcttctgag 91321 gtccagtctc caccttgaat tgaaagggga agggctcttt cgtgtttcca agtttccctc 91381 ctttttactt cctcagtcct aagggcacaa gttccttcct gcagttgcta ttctgtaact 91441 cttagaaatc tctcttacca gttgggtagt taaccatctt tacaactgac caattatttt 91501 tatcaaattt tctcttcaaa ataatggtgt ggccgggcac aggtgctcac gcctgtaatc 91561 ccagtacttt gggaggccga aatgggtgga tcacctgagg tcaggagttc gagaccagcc 91621 tggccaacat ggcaaaaccc cgtctctact aaaaatacaa aaaattagct gggcgtggtg 91681 gcgggtgcct gtaatcccag ctacttggga ggctgaggca ggagaattgc ttgaacccag 91741 gaggcagagg tttgcaatga gccgagatcg tgccattgca ctccagcctg ggcaacagag 91801 tgagactctg tcaaaaaaag agaatattcc aatgaaaata acaacagaca attcacaaaa 91861 ggataaatag aaataatact ggaaaaaaag aaaacatatc cagcttctct agaaatcaag 91921 gaaatggtaa ctatattcat ggaatttctc ccattgggct gttgcacaga atgagcgcaa 91981 gacgtaaaag acttagctca atgcctggca tataaacact ctatacatgg taactatcat 92041 gatttaaaag tttaatagga tgggtgcagt ggttctcacc tgtaatccca gcactttgga 92101 aagctgaggc aggcggatcc cttgaggtct ggagttcaag accagcctgg ctaacctggt 92161 gaaaccccat ctctactaaa aatacaaaaa ttagcctggt gtggtggcgt gcacctgtaa 92221 tcccagctac tagggaggct gagatgggag gatcgcttga acctgggagg tggaggttgc 92281 agtgagccat gaaattgcac taccgcacca aaaaaaaaaa aaaaaaattt aattactgtg 92341 gaccttatgg ggacgctaca tcaagctgtt tcccaattga attggaatgc cacaacaacg 92401 gctgaacact gtaaatgtcc aaatctggag gaacagagga aggtaaacat ttgattctca 92461 agtggaagtt caaagcatta cattctttct ggaggacagt ttggaaacat gtacacaaag 92521 taatctggca attccatttc tagtttttct cagggaaatg ttcaagcaaa tgaataaagg 92581 tgtacgtaga agcatatctg caaatacaga aagatggaaa aaatctaaat gtccaagggt 92641 gggggactta aaatatgaaa attccgccag gcacggtggc tcacgtctgt aatctcagca 92701 ctttgggagg ccgaggcggg cagatcacct gaggtcagga gttcgagacc agcctgacca 92761 acacggagaa accccgtctc tactaaaaat acaaaattag ccgggcgtgg tgacacatgc 92821 ctgtaatcct agctactcgg gaggctgagg cagaagaatc gcttgaatcc aggaggcgga 92881 ggttgcagtg agccaagatc gcgccactgc agtccagcct gggtgacaga gcgagactct 92941 gtctcaaaaa aaaaaaaaaa aaaaaaaaaa atatatatat atatatatat acacacatat 93001 ataatatata ttcacaggca tataactgtc tgaacagtaa aactgtaaga acacactaaa 93061 acatgaatag tacttattcc tggatggtcg tattagaaat gttttttacc ttcattcatt 93121 aaacaaatat ttactaagca tctaccatgt gccaggcaca attctaggtg cttgtcatag 93181 ggcattaaaa agtgtctgcc ctcatgaaca cttatcgttg agtcgtggga aacacgtata 93241 tgtcaggtag tgataaacat tagggagaaa agttaacgcc gagcacaggt tgtttacaga 93301 gctggggaag gcctccttga gggggtgaca tatgagcaga atgagggagc aagccttgtg 93361 gctatcacga gaaagctatc tttatggttt tccaattaat tttctaaatt ttctcttcca 93421 agcatgcaac acctacataa ttaagacaaa acaaaccaaa aaaactcttt ccatttatga 93481 aagagatctg gagcctgaaa tcatctccgg gtcgaggcgt ggagagagtg ggagaggatc 93541 cgagggtatg gaaaggaccc cgagcccccc tgctctagat ccctccgggc cgtgggaccc 93601 ctgacgccca gcagcaactc gtggacaaag cagctggtga gagaacgcgg gactccggcc 93661 aggtcagtgt cccgcttgcc gcgtcctgct aaggcgaccc gaagacagag gatcgccgca 93721 ggaggctccg cgccccttac ccgagggcac ttcccagcaa gctcgacgcc tctccaggga 93781 attgggagga gcgggaccca agaggctaga aaagagtctc cccggcccga gctccacggc 93841 ccgcggcctt tcggccctca cctactagtt ggcaaagaaa gaccgtggca gaaggaagag 93901 gccgctccag cgcatccctt cgggcggagc agaagctcca cttgtggcta ggccatgtgc 93961 cctctccgtt cctcgggcca gccttggctt cccctttttg tcgctcaagg gccagggtcg 94021 caaggaaatg tgaaaaaaaa gcggagacaa aagagaacga ggtaaatcta gatgcagagc 94081 ctcgcctttc ctcccgggtg caggtgtatg acgcgtatgg agtttctgtc ttctaatagg 94141 cagcctgaga attcttcctg ttcattggcc atcggtcatg gagaaggcgg gctcagtgga 94201 cgaacggctt tctgggagct ggggaatgct tgcgtagcgt agtttcccag cgagccccgc 94261 gaggacttcc ggcgccggga gctcgcggcg gaagtgggat ctcctgggcc gtagtgggcg 94321 ttgtgtgttt cgggggcggg ggcgggggcg ggggccgggg cggggacggg gcctctggcc 94381 gcctggctcc aacatcaagc accgggctcc gagtggccgg gatcagcgcc ccgaggcaga 94441 ggccggaggg cgcgcgcact gctaggaagt gctggtcccc cgcgccgctc tgccagcttg 94501 gtcccccggc agacgccccc tgtacgatcg ccgctcgccc cgcgggcgag gctgcggtgg 94561 acagcgcggg gctccggctg gctcgccttc ccgcctgccg tgtcctgctg agcgaccctg 94621 gtgagtcctg gccctcttcg aggaaagtct tcttcgaagt caccagagga tgagaatggt 94681 gcgcgcacct ttcaggggtc ttgtgaggag cagaataaat aaagtgctta ggatggacta 94741 aagaattgga aacaacccga atgcatcact cgagtattga ttaagaaact agagcatatc 94801 cataccgcgg aagctcatag tgccgctaat gtctgtgcag taaggtctca tttatgtaga 94861 aacacatcaa gcaccttttc tccccgctat ggatgacttt tttagtaaaa gcacaggaaa 94921 aggtctggga cgtcatttgt cagactgtta agtcgggggg tacaatggca ttgaggaagg 94981 gagagtttta cagatttcta tattgtttga atcttccctt ttttattaag atggtttgta 95041 tttttttcat tttcttttta cttttctttt ctttcttttt tttttttttt ttgagatggt 95101 ctcactctgt tgcccaggct ggagtacagt ggtgcgatca tagctcattg caacctcaaa 95161 ctcctggact caagcaatct ttcctcctca gcctccccag tagctgggac tacaggagcc 95221 ccacctggct ttttttgttt tttggtacag atggcgtctc acattgtctt cccaaagggg 95281 tcttgaaccc ctggcctcaa gcaatcatcc tgtgtctgtc tcccaaaatg ctgggattac 95341 aggcataagc caccaagccc ggccagtttg tatttttatt gttgaaacaa aaaatcgatt 95401 actgaaaatt gcataataca cttttcagta gagagtaaca atgtaaggta taaatgtgtc 95461 atacatttat atgtacctgt ggataaatgg ctttggatgg tgtctgtaag cccttctgtg 95521 acacctcagg gccagatgtg ttttggaatt cagaggtttt caaatttgag aaagataata 95581 tagtgcacag acgatatatc aagcacatct ccagagggat ctggggcaac actaaataat 95641 gaaacatatc agttaaaatt ttttaattag gccgggcgca gtggctcacg cctgtaatcc 95701 cagcacttta ggaggccgag gcgggtggat cacaaggtca ggagtttgag accatcctgg 95761 ccaacatggt gaaacctcgt ctctactaaa aatacaaaaa attagctgga cgtggtagct 95821 ggtgcctgta gtcccagcta cttgggaggc tgaagcagga gaatcgcttg aacctggcag 95881 gcagaggttg cagtgaactg agattgcgcc actgcactcc agcctgggcg acagagtgag 95941 actctctctc agtaaaaaaa aaaaaaaaga tatttaattg atgtataatg tgcaatacag 96001 aaaaacacac gagtgggtat agctcaatga aattttataa actgaagata tatgttatta 96061 atatttccac cataattttg aacatttaca tttagtggaa taaattgaga ttataaatat 96121 gcttatatca ggtcaggtgg tgctgccaaa tgaatgacat tgggttttgc tgtcaaagag 96181 ttatgaaaca cccgggtttt ggatttctgg ctttgggatg agagcttgtg ggcctgtgtt 96241 aataggagag acagtcacat gagcaaggca ttacagtaaa gcctagaaaa ggcttcaggg 96301 ccatgaaagg atggggactg gatggctctt cagcctgggg tttgctcgtg tattccacat 96361 ccttcctcct cagtcctgcc attttgctct ttccccagga gtacacatcc agatgccagc 96421 ccagctacca caggggatcc ctctgggaga ctgaaagtac aggttctggg gcccaggttg 96481 aagccgacca accctgagcc tcaggccagg ggaatggcag cccccttgga ggcccaggac 96541 caggcccctg gggagggaga agggcttctg attgtgaaag tggaagattc ctcctgggaa 96601 caggaatctg cccagcatga ggatggcagg gattccgaag cctgccgcca gcgcttccgg 96661 caattctgct acggggatgt gcatgggcct catgaggcct tcagccagct ctgggagctc 96721 tgctgccgct ggctgcggcc cgagctgcgt accaaggagc agatcctgga gctgctggtg 96781 ctggagcagt tcctgacagt gctgccaggg gagatccagg gctgggtgcg tgagcagcac 96841 ccgggaagcg gtgaggaggc tgtcgccttg gtggaggacc tacagaagca gccagtgaaa 96901 gcctggcgac aggtgagggg cccttccaca tccaggggca cctggatggt atctgagctc 96961 gagagaagtg ggtaacctgc agagataagt ctcccagagg ctcacagggt aggggagaca 97021 cagagccccc aggagcaggc acacaagcac agtgtgccat gggtgccttt ctgaaagatg 97081 cgatccaaag taaatgtgat ttattttttg tggctttgga aggcagcaat agggctgaag 97141 tgcgaatgtt ccagacagga ggttttagct ccatgcaggg tctgctgatg tttatttgca 97201 ttttgctcta ggcctggctc tggggacaca agggtgtgca tagagccagc tccagagtgg 97261 ggcgagaagg tggagctgtc cggggcctct gctaccttgt gggatgttgg gtccttcaca 97321 ttggatgcat ctggggaggt tctaatgagt ggctggggga tggggttctg cctttggcag 97381 gagggtcccc agctaccccc ttgctaagtg gctgtgattc tggcctttca gggctggttc 97441 ttgcctcgtg aatgatcggg gagcatcctt gggcacacag cattacctgg tatcacattc 97501 tcctccggac ttttcctggg gcaccatatt ggtctccttc ccacacccca gcatcctgag 97561 ggtcatgtcc tgtgtctcct tcccagggag taaactctgt ctcccccggt ctggtctcgg 97621 gctgatgagc atgttgtggt tcctgcacag gatgtgccct cggaggaggc ggaacccgag 97681 gctgcaggcc ggggatccca ggccacgggg cctcccccga cggtgggggc acggaggcgg 97741 ccgtctgttc cccaggagca gcacagccat agcggtgagt aagcctccgt tcttgtggac 97801 agtcgagtgg ctgggcaggg acctagcttt gtcaccggcg ttgccctaag ggtcacaggc 97861 aggacagctc cctctgtgaa gtccagggcg tgtgtgcatg cgcacaggct ggggaggcca 97921 taggcgtcgg tgtcaagcct gggctggcct ttctaaggct ccattcttct ccttcagccc 97981 agcctcctgc tcttcttaaa gagggtcgtc ccggagagac gacggacacc tgctttgtct 98041 ctggggtcca tgtgagtcac cagtcccttt gtcttcttta aggcacttgg ccctgttgag 98101 tttgtaaaat gggacttgct gtcccatcag gcctctttca tctgacccat cctgtccccg 98161 ccagtgctgc tgggaggcct gagccgggtc ttctcacccc attccaggga cctgtggcat 98221 tgggagacat cccattctat ttctcccggg aagaatgggg caccctggac cctgctcagc 98281 gggatctctt ctgggacata aagcgggaga actcccggaa caccaccctg ggtaagcacc 98341 cagggccttt gggtccaggc tggccgcccc cgattctgct ggaacttcag tcttgtttcc 98401 caccccatcc ttagctggtt ccaaagcagg ctctccctag gtcttgccag gagcctgagt 98461 aactcctttc ttggctgatg atcagttttt gtgcgtttcc acatgcagca tgggacggcg 98521 ccggcgctgc ccagccctgc agttgctcta agggcaactt ctcctttgag tctcacaacc 98581 tagtgcatgg aagtattgtc cccattttac acataaggga cctgagactc aggtcagtgg 98641 atgctgaagg gacacagctg ggacttggat caggcatagt tgtgaggcca ccttgggctg 98701 ctaggagcag gggtgctgac ggggaacccc agctgctcac cagcctgggc ccctgctcgt 98761 cagaactgca cttaccaggt tcctctccat gccaggcccc ttctcagcac catcagggat 98821 catcttgttc agtcacactc ccaggaggat gggctgcgac ctctgtccag atctgtgttg 98881 agtttggaga actagagcct gcgtggtgaa gggagccaca ctagctagac cagtttggct 98941 cctcagtttc cgactgtgac ggttgggaaa aattttcttt cttttttttt tttttgagat 99001 ggagtcttgc tctgtcgcca ggccggaatg cagtggtgtg atctcagctc actgaaactt 99061 ccacctcccg ggttcaagca attctcctgc ctcagcctga gtagctggga ttacaggcat 99121 gagccaccat gcccggctaa tttttttgta tttttaatag agacgagttt tcaccatgtt 99181 ggtcaggctg gtctcgaact cctgacttca tgatccgcct gcctcagcct cccaaagtgc 99241 tgggattaca ggcgtgagcc accgcgcccg gcccttgaaa agtttcagaa ttactataaa 99301 tctgttctgc tgtggagctt gatatctggg gttcagagtg ggacattgga tcccagtgtg 99361 gcctgcaggg cacagatggc ttagggggct ggcccatgca ggcgggatca gaggcttatt 99421 cagactgctg ctctgccgaa ttttttcatc atccctgatt tatttgggtt tttgtttgtt 99481 tttgagacaa agtctcgctc ttggagtgca atggcatgat cccggcttac tgcaacctcc 99541 gcttcatggg ttcaagtgat tcttctacct cagcctcctg agtagctggg attacagatg 99601 tgcgccacca tgccctgctg atttttgcat tttttaatag agacggggtt tcaccatgtt 99661 ggccaggctg gtctcgaact cctaacctca ggtgatcagc tcgcctcagc ctcccaaaat 99721 gttgggatta caggcttgag tcactgcgcc tggccaatca tccctgtttt atagatgagg 99781 aacctgagaa ttcagctgtg tgaacccagg gctttctgaa cctggaggcc agggagcttt 99841 ccccagcctt gtttcttcct cacctcagct ctggccccag aaccgcgtgg gactgaagag 99901 gtcgcctcct tccccttgca ggttttgggc tcaaaggcca aagtgagaag tccctgctgc 99961 aggagatggt gccggtggtg ccaggccaga caggcagcga cgtgactgtg tcctggagcc 100021 ccgaggaggc tgaggcctgg gagagcgaga accggccgag ggcggccctg ggcccagtgg 100081 tgggcgcgcg acgggggcgg ccacccactc gccggcgcca gttccgggac ctggcagccg 100141 agaagccgca cagctgcggg cagtgtggaa agcgcttccg ctggggctcg gacctggcgc 100201 ggcaccagcg cacgcacacg ggcgagaagc cacacaagtg ccctgagtgc gacaagagct 100261 tccgcagctc ctcggacctg gtgcgccacc aaggcgtgca cacgggcgag aagcccttct 100321 cctgttccga gtgcggcaag agcttcagcc gcagcgccta cctggccgac caccagcgca 100381 tacacacggg cgagaagcct ttcggctgca gcgactgcgg caagagcttc tcgctgcgct 100441 cctacctgct ggaccatcgg cgtgtgcaca ccggtgagcg gcccttcggc tgcggagagt 100501 gcgacaagag cttcaagcag cgcgcgcacc tcatcgcgca tcagagcctg cacgccaaga 100561 tggcccagcc cgtggggtga gcagctggct tggccggaaa cccgggggag gcccagccac 100621 ggcacatcct gctttgttca ccactgggac tctccttcca tctgtggcca cctcccgggc 100681 tgtccgagga ccccaggtac ctcacactcg gaactcgcct gccctgcttg gctctgaaga 100741 cctgcccagc gctcaaaggg aacggaagcc ttcccctccc gcccccgatc ttgtcctctt 100801 tcccccttct gcgcctagcg ttcctcttcc cctctagttt cctggagccc caacacattc 100861 ctggcaggga cagcagggtg gcaaggactc aggtctaggt cccttcccag aagcccccga 100921 gcctcatttg actgtgtggc tctttggccc ccaccctgtg gggtgggtcc atgggtcagg 100981 cctctgccct accaacctgt gcctttcagt gggcgtggag gactggcctt ggccccccag 101041 ggggctgctg gactttggga gagacagccc acacctgtgg gaccgcgggt cttagtcacg 101101 gcggcagggg ctttctggcc ccctcccact cccgtttcca ggccatgacc actctgccct 101161 gtcctggcca tacggactcg gcctgccttt gccctcggcc tacttgccct agcatgaggc 101221 tctgagagcc acctgcccac caatctggtg aggataatgg tggctccagc gacaggaggc 101281 caaccctgga gaccaagaac agggcgcctg gctgccatct tttcctccag aggtggggct 101341 gcaccagact cagcactagc actccatcag cactagcacc tcactccatc agcactagca 101401 cctcactcca tcggccccgg caccctgctc catcggcact ggcgccctgc tccatcggca 101461 ctaatgctcc actcggcgcc ccactccatc ggccccgctc catcggcact aatgccccac 101521 tcggcgcccc actccatcag cactaatgct ccactccatt ggcactaacg ccccaactcc 101581 agcggcacta atgacccgct cctttgacat tggtgcccca ctccatcagc actaacgccc 101641 tgctccatcg gcactggtgt cccactccat tgtcactaac gtccggctcc atcggcacta 101701 ccaccccgct ccatcatcac tatgtccagc tccgtcggca ctaccaccct gctccatcat 101761 cactacgtcc agctccaacg gcactggtgc cccattccat cggcactaac gccccgctcc 101821 accggcacca gtgcctcgct ccattggcac caacgcccag ctccaccggt actggctccc 101881 tgctccatcg gcactaacgc cctgcttcat tggcactttg ctgctgcctc ctgagcactg 101941 ccttccatga acagggacag accagaggcc ctgcaaggac tccccctcag acctccaaag 102001 ggcaacagaa gagtattaat aaacgtgaaa acttacctcc aggctgttct gttctttcgg 102061 tagctggagg gtggggatac ctggaatata tgccccttgc cctggccttg tgtcctggtg 102121 gcacatgctg ctgcaccgtg accaggcgga gctgcgtcct tggcgtccag gtcggctggg 102181 ccccaatggg cagccgtgtc ccctgggact cttgtccccc tgtgtcagct agtgcctgcc 102241 ctgctctggg agcaccgtcg gtgctccagc agaacgggga gctgagcagg actcagggtc 102301 ttcaccaggc cactcctctc ccacaactgt gaaaggaaga agggcccata gacgctgctt 102361 ggtcagtggt gggaccctct tgctgcgtgt tggggatctg gaactcaagg ctgccccagg 102421 ctgagtccct tggaggaggg gctgacagcg ggcttgccct tcccaccagc ccctgacatc 102481 caacttctcc ctaccaggag ccccaggccg ggacccacca agggctggtg ggtgcgtggt 102541 caaaactagt gatcatccag gcacgacccc tccttccacc catgctggtt ccttctcttt 102601 cacctgctag tggcctcagg gcccctggct gaggggaggc gggggccggt ggacaagacg 102661 ctctggtccc ttggtaacag gtgtgcatgt gccagccctc tctgccccta ggcctttctt 102721 agccaacctt ccatgcacac atcatcctgc ggtcactgtc atggtctccg tttcccgtaa 102781 gaagcaagca aggtctcgct gtggacaagg ctgattccag ttggtgattt ttacctctgg 102841 gggcctggtg ctgtgtgcat gcgtgtggcg ggtggccggg atggtacaag gacttcttgg 102901 tggcggtggc tggctccaaa tggagacact cgggtgtaac gagatcagat aagcacgtag 102961 gccaggagct aagaatagga ttgagcacag gttcctggct cgggtagggg catgtcagtg 103021 ccagcaaaga agtcaccagc aggagccaga gaatgaagcc caggctggga gagcggggcc 103081 aggacccagg gttccaggga tggggcagag tccagctcag ctcataagca gagctgatag 103141 tactcgttag ggtagagggc aagggttagg acccagggca ctgagagccc agggccagga 103201 ggtgagctcc gagcttggtg cttggaggaa gccccgtctc gccctatggc cttcctggtg 103261 gctgggaccc tgcggctggc cacgcagatg gcagacactc gagacagcct ggggagaaag 103321 gtagcaggag gcaccgccta gcccagctgc tcctgcctcc ctgctgagcc ccagcccagc 103381 ccagcccagc ccagctctgg ccttgcctcc ccaggggaag tatgacgtcc agggtccaag 103441 ggcagccctg atgctcagca gccctggggt ggcggccgct gtagtcactg ccctggagga 103501 cgtgttccag gccctgggct ttgagagctg cgagaggagg gaggtcccgg tccaggtgag 103561 cctctgcctc ttacatccac cctcaggccc agcaccgacc ccttccccac tctcccctga 103621 cccagaatct cctccccacc caaaacctcc ccggactcag gtcggtctca ctcttgcccc 103681 tagggcttcc tcgaggaact ggcttggttc caggagcagc tggatgccca cgggcgccct 103741 gtggggtgtg ccttagtggc cttgatgccc ccagagggca gctgaggcag ccacagcagc 103801 tggtccggga gctgagcggc tgccgggccc tgcggggctg ccccaaagtc ttcctgctgc 103861 tctcaagtgg tcctgggtgt gagtgacctg ggtcaggatc caggagctgg gcagggaccc 103921 aggggcagag cctcgggcct cactgcaggc caacatgctg cttctctacc cagcctccct 103981 ggagcccgga gccttccttg ctggcctgag agagctgtgt ggccgctctc ctcactggtc 104041 cctggtgcag ctgctgacga aggtggggac gctggagggg gaggcccagg gaagcggggc 104101 tggtcctgct gtctccgctg gttctgctgt gcccccctaa gccagttaca gttagattca 104161 ttcacttgta tcctccctcc gattcattct acaaacctcc tttattttct acagaagtgg 104221 tctcagcccc tcaccctcaa ctcgggaccc ctgacaaccc actcaggatc cccgacactc 104281 gactcctccc agctccatgc cttggcccaa gcagttccct ctgcctgcta tgccttcctg 104341 ttgtcccggg agccctgcac agtcctctgc aggctgccag cgatgccctt ccttgcatcc 104401 acgcttgcac ttcaggctgc ccctttctac accaagcagt ggctgtccgc cctgttagat 104461 cgcaggaccc tggaggctgg gactgggaat gcttcaagtc tggtctccaa ggctgaaggt 104521 ctcactgtgt agttgggtgg cccctgggca gccctgggga tgagggggcc ccagctctcc 104581 tccacagatc gcccagcagg gtagtgccca ccccgtgagt cttcccgaca ggcccagccc 104641 cgctctaggg aacatcccca cctctctgcg cactgatccc ggggcagggg ttaatggtgt 104701 ggccactgtc cactcgaggc catccatgcc cagggaagtc cataaccttg agcctgaact 104761 ggagcctggg ccctgcttga ggccatcctg cctgtctgtc ctttgctggc ctcctcctgt 104821 ctgcattctg gcaggtgacc ctgaagagac tatagaagcc acaagcctgg ctgagtcttt 104881 tctcggcccc atccccagct cttccgcagg gtggctgaag agtccgcagg gggcacctgc 104941 tgccccgtcc ttcggagctc cttgaggggg gcactgtgcc tgggaggcgt ggagccctgg 105001 aggcctgagg tgaggggggc agggcaggga tccaatcaca tggccacagt ttccagtggg 105061 acagaagctt agggggggcc ccggccgggg aggccaggag ctggagtcct ctcaggccac 105121 tttagatgtg tcttaaccct ctctgcagcc ggcccccggt cccagcacac agtatgacct 105181 gtccaaggcc agggctgccc tcctcctggc tgtgatccaa ggccggcctg gggcccagca 105241 tgacgtggag gcgctggggg gcctgtgctg ggccctgggc tttgagacca ccgtgagaac 105301 ggaccctaca gcccaggtga ggggaagccg agaacttcca ctggtgctct gaaggaagac 105361 cacccctccc tagaaacctg gggcctctct ccatcactgg caggaagtgc accacaagtc 105421 taacctctgt gctccctgtt gcctgcatct ggcccgtcac ttccctgcct ctggaagcct 105481 ggtctccagg gtccccgagg ccttcctcac tggctggttc tctggccccc ccgccccctc 105541 cccagctcaa agctttagct ccaagtcttg gtttccctct tggctcccag cagcccactc 105601 cactctcctc acacctctca acttcttggt gcggcttccc cacgagggca ggaggagaac 105661 tggctccagg aagctgggtc tctatgtcac ctctaaagag gccatgccaa ggccttgcag 105721 gagggagtta gaaaagggct tctggccggg tgcagtggct cacgcctgta atcccaacgc 105781 tttaggaggc tgagacgggt ggatcacttg agattaggag tttgagacca gcctgactac 105841 catggtgaaa cctcgtctct actatataga caaaattagt agggcatggt ggtgcgtgcc 105901 tgtaatccca gctacttggg aggctgaggc aggagaatgg cttgaaccca ggaggccgag 105961 gttgcagtga gctgagattg tgccactgca ctccagcctg ggcgacagag caagactctg 106021 tctcaaaaaa caaaaacagg ccgggcacag tggctcacgc ctgtaatccc agcactttgg 106081 gaggccgaga caagcggatc acgacgtcag gagatcgaga ccattctggc taacacagcg 106141 aaacctcatc tctactaaaa atacaaaaaa ttagccgggt gtggtggcat gtacctgtat 106201 tcccagctac tcaggaggct gaggcaggag aatcgcttga acccaggagg cagagatagc 106261 agtgagccga gatcgcgcca ctgcactcca gcctgggtga cagagcaaga ctccttccca 106321 aaaaaaaaaa aaaaaaaaaa gagggcttcc tccctggacc tgtgagtggc aggcggtggg 106381 aggccaggtg gggcagggtc tgggaaacct tgtcagcctc acagagggca gccagtggct 106441 ggggaggcgg tggctttggc ccaggcctca acattgttcc caccccaggc tttccaggag 106501 gagctggccc agttccggga gcaactggac acctgcaggg gccctgtgag ctgtgccctt 106561 gtggccctga tggcccatgg gggaccacgg ggtcagctgc tgggggctga cgggcaagag 106621 gtgcagcccg aggcactcat gcaggagctg agccgctgcc aggtgctgca gggccgcccc 106681 aagatcttcc tgttgcaggc ctgccgtggg ggtgagcggc ccggcctcct actgccctca 106741 ctttcctcgg ccaagcttca gcccccggga ctcactgtct accttctcca gggagcccgg 106801 gtacctgccc ttccctgccc cctctcctgt cctctcctag aggtcaagtc cacgaccttg 106861 aacccttaac tctcaacacc tgtcattcag cgctctgaat gtctccagtc tggcaagcct 106921 gccctggagc tctggagttg ggttctcacc ttgaccccac attcacanta gacccctgag 106981 cacccccagg tatccccgga gtgagactat ctgcctctcc ccaccctctt caggaaacag 107041 ggatgctggt gtggggccca cagctctccc ctggtactgg agctggctgc gggcacctcc 107101 atctgtcccc tcccatgcag atgtcctgca gatctacgct gaggcccaag gtgggttctg 107161 ccttccttcc agggcctggg cttgggcagg gctggttgtg gggaccgtcc agagagcatc 107221 tccagggctc taagctgggg tatggctgcc acctgcatcc tctgtttgcc aagacaatgg 107281 gaagaaaaaa aatcttccta aaccgcaagg gcctttggga agtgggagct tcttcccctg 107341 ttggagcctg gcaagaaccc tggagtcggt aaggtcaaat agcttttctg aggtcacagc 107401 tgttaagtgg ctgggccgag ctttgaactt ccgtctgtca ttcctgccct gcactctttc 107461 cacctccctg ggctgccctt aagccacaga tggggagctc ccggggctga tggagttcca 107521 cgatgttgat cactggaatt gattcctctt gcaggcagct cctgcagggg cacccctcca 107581 gggagctctg accaagcaga catcctgacg gtctactcag ccgcagaggg taaggagatg 107641 ggtcatcggg agcctgtggt tacacagggc ccagcttcct ggcctaagat ctggagtagc 107701 cttaggggca gctagggctt agggttgggg cacagagatg ccagcccagc tgtggtccag 107761 ccatgttccc tacatgggtt aggatgtgta gtaacagcag tgatggtgag tgctgtgagg 107821 cgcagctgtg ccaaccactg tgtgtgcagg tttcctctag gctgtgagtt ccacgaggcc 107881 agggtagacc tgcctgcccc aaggccctgc tcagtgtctg gcacatagta ggtgcacagt 107941 aaatgtttgt tcagtagtga atctctccat aggctcacct ctgcaaatac ctagcaacaa 108001 cttgttccag atgcaagaag tccctgctcc ctgccctgtt ctcttgcctg attcctgggt 108061 cctgcctcct tgtaccccac tttccaccaa caataggacc cctgggattg gaaggcagag 108121 ggttggggcc ttggtctgat gctctggccc tgatcccctg acataggtta tgtggcctat 108181 cgcgatgaca agggctcaga ctttatccag acactggtgg aggtcctcag agccaacccc 108241 gggagagacc ttctggagct gctgactgag gtgtgttggg gggttccagg gtgacaagtg 108301 gcaaggagct gggtttgccc ttctccccag ccctggtatt ctgatcacct cctatgaact 108361 ccattggcga aggagggatc ctctgccctc aatactacaa gataaccaaa cgcaggatgg 108421 ccgacgctgc acagatgcca tccactgcag ttcttagtca cacgtactgc agtcgggtgg 108481 ggaggaggac actgcatgtc atgcggggcc acctgggctt gtgctcagag cagggtgaac 108541 ctgcagggcc agtgggaagc tggctttgta gtgacaagag agtgagatgc cccctggttc 108601 ccacgggcag atgtggttgg ttggtttgaa tatttccaag gcctgtcagg gggctgaagt 108661 ccattagggt gacgaccagg tggggtgcag ctggtctgct gagaggggac ctagggggtg 108721 ggagcctgtc ctgctgggtg gggacatgtc tggccagaac agaggaattc accgttaggc 108781 ctctggagct ctgcgagcct caaagatgtc caggcagtcc ttgaaatttt aggccttgca 108841 atgtaccaag ccaggggctc tcccctgggg caggctgagc cccggagggc tgtaaccccg 108901 ggcgcaggct gggtgtggtc ctcaggtcaa caggcgggtg tgcgagcagg aggtgctggg 108961 ccccgactgc gatgaactcc gcaaggcctg cctggagatc cgcagctcgc tccggcgccg 109021 gctctgcctc caggcctgag ggtgcggcgg ccacgggggc gctgctgaga cggtggccag 109081 atcccagcgc cattcttgcc tccatccacc ccccatcccc ccggtttcct catctgagag 109141 cgaggcgtgg cagcgtgggg gtggccgtgc aataaatatc tgccgtgaac agtgcctgtt 109201 ctcaaggacg tgtgaaataa gtgagataat gtgtgtaaag cgcctggcac tgcatatgga 109261 cgcaatagtg tccatgggaa ccgagttggg gcctgtgccc agcttccggg acttaggaac 109321 ccaacatggg ctggactggg ctgggctggg ttggcagttc ggttatggga gcgtccaggg 109381 ggagaggggt cccaggaaga ggacgcatat ccctctccca gacatgtccc tgtgtatgcg 109441 cgccccacct ccaccccgcg ctgggccata gaaactcagg agacacggcc tccgctttcc 109501 aggcggccaa aataaagtct ggggccaaaa gcctctgaaa cacacctttg caggttcacc 109561 gtgatggagc aagcacaggg gctgggcaag cagcttgcag cccctgaccc aaagcggcgg 109621 gaggcacatg ggggaaagcg ctcccaaacg gcgtctcgga ggggcctgct gcggggaggg 109681 tggggtaggg cgggctcgcc agggctcgaa ctcacagtcc cgggcccgcg gcggcggttt 109741 ccacccagca gcctcagcgg cccggcgcgg tggcccagct cggagtcccg cccaccggga 109801 gcctcggaag gaccggcctc ccctcactct aagcatgcgc gttaaagcga cagcataggc 109861 tatagaaagc tttatgagaa tagctgggag ccatacagcc gcagggccgc gtggcctaat 109921 ggataaggcg tctgattccg gatcagaaga ttgagggttc gagtcccttc gtggtcgttg 109981 ccatgttaac gttttcttcc agctccactt aaaatttctt cacctgggac tgcaccctgg 110041 gcgactacga cactcgaact gagtgcgccg gtcgctctcc cctcaccccc gtaggaacac 110101 tgggacctcg ccctgcgcct ccagtcgcga agctaaggac ccagggccag aatagcagcg 110161 agggcgactt agagaacgcc ggcagggccc tggcggactg cggagcttga agccagagaa 110221 cgcaggtcct gggatctggt aggggattga cttcacaaat taacgttcag gtgatggaaa 110281 ccgtgacaga cgcggggctg ggattgcttg ctgtgaggag ttgccccaag ggatcaaagg 110341 acacctgtag ttttttgttt ttgtttttgt tttttttttg agacggagtt tcgctcttgt 110401 tgcctaggct ggagtgcaat gtcttgatct ccgctcactg caacctccgc ctcccgggtt 110461 caaacgattc tcctgcctca gccttccgag tagctgggat tacaggcatg cgccaccagg 110521 cccggctaat ttttgtattt ttagtagggt ggggtggtgg gcgggggtgg gttctccatg 110581 ttggccaggc tggtctcgaa ctcccgacct caggtgatcg tccagcctcg gtctcccaaa 110641 gtgctgggct tacaggcgtg agccaccgcg cccggcgaca tgtgtagttt aattgattat 110701 tgtgatgaaa aactttgtat caactgatca tggggtaaca aagtaaaaaa gaaaccgatt 110761 atagtggtga tgagccgggc gcggtggctc acgcctctaa tcccagcact ttgggaggtc 110821 gaggcgggcg aatcatgagg tcgggagttc cagaccagcc tgactcacat ggtaaaaccc 110881 cgtttctact aaaaatacaa aaattagcgg gacgtggtgg cgcgcgcctg tactcccagc 110941 tactcaggag gctgaggcag gagaatcgct tgaacccggg aggccggagg ctgcagtgag 111001 ccaagatcgc gccactgcac tccagcctgg gcgacagagc gagactccgt ctcaaaatag 111061 aaaataaata aaataaaata aaataaaata aaataaaata aaataaaata aaattcaaag 111121 agtatctcca ggattgagtt aatatttcgc cgaggggaaa aaaaatgtct accagcggaa 111181 cccgagttta gtccgggtgc cgagcagcgc ctcttggggt atccttccac ggggtggggt 111241 atccttccac ggggtgaagg ctgcggagag tcgcagctgc aggcatggcc tccggtcggc 111301 ggacgctggt gacctggcgt cgtgcagggt gtcactttct cctttaattt tttttgatgt 111361 ttcagtggtt tacgaaggta tttgtttttg gtagagacgg ggcgctcact attttgccca 111421 tgatggtctc gaactcctgg actgaagcgt ttactaggta ttaaaggaaa agaacgtcat 111481 taaaatacag attttttaaa aaaggaaaat aacacacgtg ttggggcagg aaagtaaaca 111541 cccaacgtgg cgatcctgag attaacacct agccagctgg ggatgtagag ggcggatttt 111601 ggagggtcat tattaagagg cttccgacca ctgggcggca cccttggtct aacggacaat 111661 gcgggggagg aggacgaaga ccgacacttc cagaagcggc gggctcggga tgaggagcca 111721 gttgctgagc tggcatcgcg tgccccttcc catacactct acccatcgca agtgcatcat 111781 tcgctactaa ttacccctag acttgggttg tctcctgagt ggagacgacg gcttgtgaag 111841 gagttgatcc aggcacctct cgcaccctaa gcgaagatca tacccctaga ccaaagagcg 111901 gagccaaaaa tctgacgttc ggtgtctttc aggtccctct gcacgtgcgc cgtctctttg 111961 gggtcccgag gatgctcagg agcggacgag atctgaaccg aagcttggat tgcagtggag 112021 gggcgggaga aaggccaggg taggacgggc aggctgtgca ggaaccaccg cggagtgatg 112081 gagaaactgg tctaagacaa gcgacagcgt tttgttatcc gccccggtgg cctaatggat 112141 aaggcattgg cctcctaagc cagggattgt gggttcgagt cccacccggg gtaaagaaag 112201 gccgaatttt agtgttcctt atcgggcaga agagttagaa tgcggtatac tccattggag 112261 gtgcggagtt tccgaagggt tactaaaaag gctctaaatc agaaacctct acctgttttc 112321 aaggaagacg agcacaattt cacgttacag aaaaaaggag ccgaatcttt tacaggatgt 112381 taagtaccat ccatttaact tgatagaaag gaatgattgt ttatttttcc aggtttgtta 112441 attctattat ggttacaaat attttttaaa tccttttttt tataaacaca attatttaca 112501 gattaaatta caggacaact ggtttgtttt aaaaatcata aaactgaggc caggcgcgct 112561 ggcttaaact tgtaatccca gcctgtaatc tccgtttcaa aaaaaaaaaa aaagaaatca 112621 taagatggat ggcggtatag atgaaactac ttatccatga gtagataatg ggggacatga 112681 ggggacattg cactattttc tctcttttaa tacatatttg aaatagtcca tagtattctt 112741 tttttaaatc catgcaatat atgatgccat cttcaaggta aaaatttccc gaactacagg 112801 acatggtgta aagtaaaatc agaagccgct ctgatgggga ccactgctgg agttttgtca 112861 taaatatgta taaaaacatg gttggctttt gcctttaaaa ccaataattg gcccgggcgc 112921 agtggctccc gcctgtaatc ccagcacttt gggaggccga ggcgggcgga tcacgatcac 112981 gcaccagcct gaccaacacg gtgaaacccc gtctctacta aaaatacaaa aattagccgg 113041 gcgtgggggc acgcgcctgt aatcccagct cctcaggaga ctgaggcagg agaatcactt 113101 gaacccggga ggtggaggtt gcagtgagcg gagatcgcgc cactgcactc cagcctgggt 113161 gacagagact ccgtctcaaa ataaataaat aaataaataa aactcataat tgggactctt 113221 ctgtttttcc agctctggcc tggcttctcc cagttcccca aacatcctgg accccatcat 113281 ctccactgaa atcagggacc tcacctcaat ggcattacct tcctcacctc aatggcgtta 113341 ccttcctcac ctcaatgggg ttaccttcct cacctcaatg gcgttacctt ccacacctca 113401 atggggttac cttcctcacc tcaatggcat taccttcctc acctcaatgg ggttaccttc 113461 ctcacctcaa tggcattacc ttccacacct caatggggtt accttcctca cctcaatggc 113521 attaccttcc acacctcaat ggggttacct tcctcacctc aatggggtta ccttccacat 113581 ctcaatggca ttaccttcct cacctcaatg gcattacctt cctcacctca atggggttac 113641 cttccacacc tcaatggggt taccttccac acctcagtgg cgttaccttc cacacctcaa 113701 tggcattacc ttccaccatc ataggccacc tgtagagaat ggccaccaga gagctactta 113761 aggaagctga tgcttcagtg atggattttt tcgggaggcc ttgtgaaagg gagtgtcctg 113821 gcctggtgcg gtagctcaca cctgtaatcc cagcactttg ggaggccgag gcaggcggat 113881 cacgaggtca ggagttctag agcagccagg ccaatatggt gacaccccat ctctactaaa 113941 aatacagaaa ttagccgggt gtggtggcgc tcgcctgtag tcccagctac ttgggaggct 114001 gaggcagaag aatcgcttga acccgggagg cagaggttgc agtgagctga ggtcgtgcca 114061 ctgcactcca gcctgggcga cagagtaaga ctccgtctca aaaaaaagag aaaagagaaa 114121 aagaaagaaa gaaagaaagg gagtgtcctc tggtcttctc agtggtatgc acatcatgcg 114181 tgaaagatgc actccaacat tgcttggtgt ctttaggttt ctttctctac agtttgcctg 114241 agacgttctg gcagcccggt ctcattaaac ataggccacc actgagttca attttcagcc 114301 aaccaaccaa gaaacctggt agggccatgg tcagctgcat gagataacat attaactata 114361 ggctctttgg gatatgcacc cagatggata aattcaccag atacaaaatc atatcccatc 114421 ctccttatga tggcttaaag tagccaaact tgttttgttt ctgttttgtt ttgtcatgta 114481 ggctgttttc tccaaggtca acctttgtag ttcaagattc tcaattttga gtccaatctg 114541 atgtcaccac ttctgcctaa tcaatgcagt atctttcaca ccaaagattt cctaaggctg 114601 tgaattcaat ttacgttgcc attctgcaac atgcctagtt aaatgacatg tttgattttt 114661 agcaatgtca gccttgtggc tttaaggtta ttgtatggtt gccataaatt gtccctgcgt 114721 ctctgaactt gacttagtct aaaagtttag agacctgatc ttgtcatctt ctttctgtaa 114781 gcaaaccagg gccttcagaa gacttctttc cacgccatga actaacgaaa gtctgtgtgg 114841 tcacctgtgc tgcaacaatc aggcacagca gccactggtc acccagggca cttgcttcag 114901 cctgcaactc ctcataatca accaggccac gcattgcctg atcacgatgc tgttgtatac 114961 tgtgaattat gtcatctcac gttccattgg taaggacctc agcactgcat tgaagccaaa 115021 atttgttacc aaaccaatgt ccaggtagca gtcaggaaac catcagctat gttttttttg 115081 tttttgtttt tgtttttttg tttttttgag acagggtctc actcagtcac ccaggctgga 115141 gtgcagtggt gtgatcactg ctcactgcag cctcgacctc ccaggttcaa gtgatcctcc 115201 ccaatcagcc tcctgagtag cttgtactgc aggcacatgc tacttatttt ttgtagagac 115261 ggggtcttgc tatgttgccc aggctggtct gaaactcctg ggctcaagca atattcccac 115321 cttggcctcc caaagtgctg ttcttatagg catgagccac cacacctccc attagctatt 115381 ttcacagaat atttaatatg acagttatta gagaggtata aatacttgtt ctggtttttg 115441 gagatagggt ctcactgtgt tgcccaggct gcagtggcac gatcatgggt cactgcaccc 115501 gcaaattcct gggctcaagt gatgttcctg ccttggcctc cctaaatgct gggattacag 115561 gcctgagtca tggtacctag cctagaacca gattataagg gtgaaattct aaatttgact 115621 cccctgctgg ctggctggct ggctggctgg ctagctggca atgaggacca cgtccctgat 115681 ggtaggtgag agtccctggt gtgctgtagc gtgcaattat aagaattttt gccagtgtta 115741 aactggggtg aaatagaagg agaattcctc ttaaggcaaa tattatgcag agagggtccc 115801 aaatgctcag tttactgaat atggaacgtg tatgcaaatt ttatacatac ataattttta 115861 tttctttttt ttttttcttt tttttttgag acgtagtctc actctgttgc tcaggctgga 115921 gtgcagtggc acgatctcgg ctcactgcaa gctctgcctt ccgggttcac gctattctcc 115981 tgtctcagcc tccctagtag ctgggactac aggcgcccac caccatgccc ggctaatttt 116041 ttctcttttt tagtagagac ggggtttcac catgttagcc aggatggtct cgatctcttg 116101 acctcgtgat ccgcccgcct cgacctccca aagtgctggg attacaggcg tgagccacca 116161 cgcccggcca atttcttttt attattattt tgagacggag tcttactgtg ttgtccaggc 116221 tggagtgcac tggcacagtc tcggcttact gtaacctcta cctcccgggt tcaagtgatt 116281 ctcctgcctc agtctctgga atagctggga ctacaggcgc acaccaccaa gcctggcgaa 116341 aatttgtatt tttttttaag ggccgggggt ctcaccatgt tgcctaggct ggtctcaaac 116401 tcctagggtc aaagtgagag gattataggc atgaaccacg acgctcggcc tctttttttt 116461 atgaaacaag acaaagggga tcctgaagcc agttggttgg tggagcgttt tgcgtttgcc 116521 tgaccaacaa ttcccgattt gcaaggtata cggagtcact tccacgtttt cttatttagt 116581 ttactttctt ccaagttgca agtctaatcc atccgacaaa atctcacgcc caacgtgggg 116641 ctcgaaccca cgaccctgag attaagagtc tcatgctcta ccgactgagc tagccgggcg 116701 gctatgagga aaatggtttc ccacacctct gttggagctc tcccgaactt ccctattcta 116761 tataaaattg gtggggatga gtcctggatc cgagacatga gacatgctac atcctgagcg 116821 ctagagccct aagctcggcg cggaccgagg acgccgccag gcccgtgcgg tctgccgtcc 116881 ttcgcgggtg tgtgtcatat ccgccgaccc tcgtgggggt gtatcctgtc tgccgaccct 116941 cgtgggcgtg tgtcccgtct gccgtccctc gcgggcgtgt gtcccgtctg cggtcccttg 117001 tgggggtgtg tcctgtctgc cgaccctcgt gggggtgtgt cctgtctgcc gtccctcgcg 117061 ggtgtgtgta ctgtctgccg tccctcgcgg gtgtgtgtcc tgtccgccgt ccctcgcggg 117121 tgtgtgtcct ctctgctgtc cctcgagggt gtgtgtttcc tgtctgccgt cccttgcggt 117181 gtgtgtcccc tgtctgccgt ccctcatggg tgtgtgcggc acctgcagct cccgggagcg 117241 ggaacgtcgg ggacaagagc tgaagggacg aaggacactg gagatggcca gggggaaact 117301 cggcgtgact ctctcttagg gctgtgtgag ggttttcaaa ccctcatccc tgtttagggc 117361 cggcagcgag gcctcaccac agggacaagc ccggtagacc tttctcctcc ttaggctccc 117421 agtaaaatgc tgggtccctt gccagtggaa aacggaagaa aagtttggag cgtttctgga 117481 gccgcccgtt tgggaagaaa tgaactctga ccccagctcc ggcccgtgta tcggaggtcc 117541 cagggctgcg agggtcgcct gggcagagct caggccaaga aggagctcgg ctgaggacag 117601 gagcccgggg tctgtgtggg agacgggatc ccttcctagg atgcccgggc tggcctgctg 117661 ggaggcggcg gggtctcctc ctcagggtcc tgcactcggg tttcacatcg ggtgaaattt 117721 ctgggctggg gctcagcacg cgagaccgct gcctcacgcg ggccacacag accacctccc 117781 cgcgggccgc agcccctggt caccggccct cctctcggtg ctggggccgc ctggagccgc 117841 tgggagccga gtggccctcg gggaggcggc gccgcggtgc ccagggtgcg gggtccgcct 117901 ctgtgcgcct gagggcgggg gtcgtacagg agctcttccc ggggctggtc cccgaggcgc 117961 tgcacctcct accagcttcc tgtgatcgaa accgaaacct ccacctgcgc aaccctccca 118021 gatgggcttg aggttggggt gtggccggga cccgggcggg gaggggtagg ggtggagtga 118081 aggcggggtc tgggggggtg cgcgtggacg gggagagtgg aaccgcgggt gtggacctcc 118141 aagtatagac gtcagagaac attcgtcgtc ctggctcgtt ggtctagggg tatgattctc 118201 gctttgggtg cgagaggtcc cgggttcaaa tcccggacga gcccgctttt tttttttttt 118261 ctgaacacca tcattttctc gcggtgaaga ttgaggagct tcacggagtc tcggccgcag 118321 ggaacttggt ctctccgtgg cagctgcagg cgccggggcc gatccgactt cccgcttccg 118381 tccggcttgc ggcccctcgc cttgcccggc tacaggaagg actcgcctgc gcctcctgac 118441 tggagccttc cacctgggag ttctgagtgc gcctaaggcc tggcgctgag gcagtggacc 118501 ccagatccgg gacccgaaag gagggcaaac acctccaccg ccgcgtctgc ctctgcggac 118561 gtggccctag atcccgggtg gccatggaac ttggtctggg cgtgtagttt tcctttgagt 118621 gggtcggttc taagtttaaa tttccgacga gcccagggtt tcgaacttcc ttcctttttt 118681 ccttcgtttc tctttctttt cttttctttc tttctttctt tttctctttc tttctttctt 118741 tctttctttc tttctttctt tctttctttc tttctctctc tctctctctc ttgctttctt 118801 tctttctttt tctcttttat tttccttttt cttctttttt tgttttttgt tttttttgac 118861 agtgtctcct tctgttgccc aggctggagt gctggagtgc agtggtgcga tctcggctca 118921 ctgcaacctc aacctcccaa actcaaggga tcctcccacc tccacctccc aagtagctgg 118981 gactacaggt gctcaccacc acgccaggct aagtgtgtgt gtgtgtgtgt gtgtgtgtgt 119041 gtgtgtgtgt gtgtgtgttt gtgtgtgtaa aaccgaggtc tcgctgtgtt ctggaggctg 119101 gtctccaact tgcactcaag caatcttcct acttcggcct cccctcacaa agtgctgcaa 119161 ttacaggcca gaaccaccat gctcagctgg ttccttttcc ttccaggtcc tgaggtcgca 119221 gcggcgcgtc cctccggtac atggcgggga agaggtgcgg ttccgggagc ctgcgacgcc 119281 tctggggagt ccaagcccgg ctcctagccc tttgcccgag gacggaggag gacctggggg 119341 gcttccccta gctgatggtc tcagttctcc aggggcggcg gagacagcag gaagcccaaa 119401 acccgagctg ggtttcctgc ttggagtcgc agggtgtggc aatgttgaag ttcgccgtgc 119461 ccctgctccc tatgggaatt agaatccggt ccaggacaag catcgggaaa gggggacagt 119521 ttgtgaccga ggaaactagg actgatccag aacagcgcag gcagagacgc gctgggctca 119581 tgaagcgcga caaaggcccc gacccctctt ccgcttggct cgttggtcta ggggtgtggt 119641 tctcgcttag ggaccacagg gacaagcccg ggagacccaa gaggtcccgg gttcaaatcc 119701 cggacgagcc cacactttaa gaacccagca ggggctgggc gcggtcgctc acgcctgtaa 119761 tcccagcact ttgggaggcc gaagcagtgg atcacttgag gtcaggagtt caagaccaac 119821 ctgggcaaca cggtgaaacc ccgtctcttt actaaaaata caaaaattag ctgggcatgg 119881 tggtgcacgc ctgtagtccc agctactcag gaggctgagg caggggaatc gcttgaacct 119941 gggaggcgaa ggttgcagtg agccgagatt gtgccactgc gctccagcct gggcgacaga 120001 gcgagactct gtctcaaaaa aaaaaaaaaa aaaaaaaaga acccagcagt ttgggaggcc 120061 gaggcgggcg gatcacgagg tcaggagatt gagaccatcc tggctaacac ggtgaaaccc 120121 cgtctctact aaaaataaaa aaaattagcc gggtgtggtg gtgggcgcct gtagtcccag 120181 ctactcgcga ggctgaggca ggagaatgac gtgaaccagg gaggcggagc ttgcagtgag 120241 ccgagatcgc gccactgcac tccagcctgg gcgacagagc gagactcagt ctcaaaaaaa 120301 aaaaaaaaaa aaaaaaaatc aatgtttgtg tctgtggctt gtccttgggg cacatgcgtc 120361 gtgggacaca ggctgtttaa ccagggaatt tttttctggg gcagtcaggg tggatcttgt 120421 cctgagggac tcctccgagt gactggagtg gtggttgcat cccggaatga gtgaggcttg 120481 tgacctggag ctcccagaca aaacctactc actcaatgga gcgtgtcccg cataaggtga 120541 gctttttacg actgcagatt tgctctcctg ccgaggcaag gctgatagca gtggctcact 120601 tcgttgacta aactaaggaa aactgtgggt ggctttgtct ggtaactggg ttcacatccc 120661 tcataggact gattctaaag ttgccttgct cagagagaaa tgaaacgggg ccgggatcca 120721 gtgtagagag agctttggct ggctggcggc gcgcaaggcc ggactgcaca ggtggcaggg 120781 gcgtggttgt caaataatca aaatgatcag gcacttcagg tccccgggaa agcgggcaca 120841 gaggtggcca cccctgttcc cctcctggca ccatgatggt caggaccgcc gggagcccat 120901 gcccacctcc ctcagctccc gtggtgcgcc cgggggaaat gtggttggaa atcgcggcgg 120961 ccgggcatgg tggctcacgc atgtaatccc agcactttgg gaggccgaag tgggaatcgc 121021 ttgagcccaa ggagttcaag cccagcctgg acaacgtagt gaaaccccac ctctgcaaaa 121081 caattttaaa agttagcagg gcgtggtggt gcgcgcctgt gatccaagct actcctgtgt 121141 ctgaggcagg aggatcgctt gagcctgtct caaataaaca aagggcgggg cgcggtggct 121201 cactcctgta atcctagcac tttgggaggc cgaggcgggt ggatcacttg aggtcaggag 121261 ttcgagacca gcctggccaa gatggtgaaa ccccatttct actaaaaata caaaaattag 121321 ccaggcgcgg tggcgggcgc ctgtaatccc agctactcgg gaggctgagg caggagaatc 121381 gcttgaaccc gggaggtgcc cctggcgaca gggcaagact ctgtctcaaa agcaaacaaa 121441 caaataaaaa acagaaacgt acatgggaat gggggaggtt gtgtgcctgt gggggcagag 121501 gaaaatttgc aactctgcac ttcccaccca atttttctga gaacctaaaa ctgctcttaa 121561 gctgtttttg tgtgtgtgtg gttttttttt tgtttttttg tttgtttgtt ttttctgaaa 121621 tggagtctgg ctctgtcgcc caggctggag tgcagaggcg agatctcgga tcactgcaac 121681 ctctgcctcc tgggttcaag caattctcta cctcagcttc ctgagtatct gggattacag 121741 gcgcctgcca ccatgcccag ctattttttt tctttttttg tttttttttg tatttttagt 121801 agagacgggg tttcaccatc ttggccaggc tggtcttgaa ctcctgacct cgtgatccac 121861 ctgccttgtc ctcctaaagt gctgggatta caggtgtgag ccactgtatc tggcctgctt 121921 tcgtttgtaa aggccatatt tttcttctct tgggtctcca ataactgtga tgttagatct 121981 tttgttaaag tcccacaaac acccgaggtt ctgttatttt ttttaagttt attgtctctc 122041 caggtaattt ctattgttct gtcttcaaat ttgaggagtc tcctcttttc ttccattctg 122101 ctgttgccca ttcattaaag ctccttcctt tggttttgta tttctagggt gtaaactttc 122161 cgtttagtta ttctttatat tttctactcc tttcctgaga cttcatattt tttcatttgc 122221 atgaagcagg agttttagct gaagcatgtt tatggtggct ggttcaccac ctttgtcatc 122281 ccgtcagata gtccaacccc ggtatcagtg tccgttgtcc gttgattgac attgttttgt 122341 tgttgttgtt gttgttgttt tgtttttttt ttgagacgga gttgcccagg ctggagtgca 122401 gtggtgccat ctcggctcac tacaacctcc gcctctgggg ttcaagtgat tctcctgcct 122461 cagcctcccg agtagctggg attacaggcg ggtgccacca cgcctggcta attttttgta 122521 tttttagtag agacggagtt tcatcatgtt ggccaggctg gtctccagct cctcacctca 122581 ggtgatccac ccgcctcagc ctcccaaagt gctgggatta caggcgtgag cccccgcgcc 122641 tgcgcgcccg gcctgattga cattttcaag ctgggatttt ccaggctttg agaatgacca 122701 ggtctgcgat gacactccct ggctgggagc aggagggact cctgagactg ctccccttgt 122761 ggacaaagag gaagttgggc cgatggccga tgacggccag gccgcgaggc tgctgctggg 122821 ccctgggtca caggtcgagg tgtcaggcct ctgagcctaa agctcaacca ttataacccc 122881 tgtgacctgc acatatacgt gcagatggcc tgcaggagcc aagaagtctg aagaagccaa 122941 aaaaaccaca aagaagtata acagccggtt cctgccttaa gtgattaacc aacattacaa 123001 cattctacca ctgtgacttg tccctgccct accttggctg atcaatcgac tttgtgacat 123061 tcttcttttg gacaataaat cttatgacct ccctaccaca taccttgtga ccccctcctc 123121 tgctaacaat agataaccac cttttactgt aattttccat tacctaccca actcctacaa 123181 agcaacccct tccccatctc ccttcgctga ctcctttttc ggactcagtc cgcctgcacc 123241 caggtgatta aaaaagtttt actgcttaca caaagcctgt ttggtggtct cttctcacag 123301 atgtgcttga cagaggccgc cctgcagttc ctgttccgat ctaaggaagg gaaggggagg 123361 cgaggaagaa ctgggtccca agacaagagg gtccatttcg tgaggcacac actcaggaca 123421 gaccctaccc cgctgctcta ctctttcgct tcagtccctc cttcccggcc tccgctggcc 123481 ccctgacccc gcactgatca ttaatgcaga agcagcgcag accagcagcc gggatcgccg 123541 gtaatcccag cactttggga ggcggaggcg ggcggatcac ctgaggtcgg cagttctgag 123601 gtcggcagtt cgagaccagc ctgaccaaca tggagaaacc tctgtcatga gccgcttgcc 123661 tccaccgtgc atgcagatct gcaggggagt gggaagaggc aggttctttg ggtcatttga 123721 aacggtttgg agatatgcag aggctgggct caatattaag taggaattgg gatttatatt 123781 ggcagcgaag cgtggaccca ggccccttcg gatccgcact tgggccccag agagaggttg 123841 ttcctttcct ccccgcctcc cactcaggac cacccatggc taatggtccc tgactacaaa 123901 ttccacctga ctggaggctc gggtttgagg ttacacaact cgctgagaag gggaggatag 123961 gggctcccag atgtccggag accccggcag ggactgtgca cccagctgtc tgaatgccag 124021 aggccggctc cgcgggtagc catccctgca ccacctcctc gggactagag tcaccacagc 124081 ctttgttttt ttgtttttgt ttttgttttt gcagcgcctg ttttgaggat tctgagtatt 124141 agggtgggaa gcaagtatta ttacatcagc ctggctagct cagtcggcaa agcatgagac 124201 tcttaatctc agggtcgtgg gctcgagctc catgttgggc ggagctttta ccattcttgg 124261 tacacagaat tcagcatcat cgcaggtgac ccagtttccg aagcttggcc cctaactgga 124321 aggtaaagtt tcactgagtc ccaactcaat ctgtgtcccc atcagaggtt tcaaggcctc 124381 ctcccaaatt catcaatctg accactggac tgctgtgtta tctctcacag ccggtgtccg 124441 agaaccagag tctcttaggg aaaaggaata ggagatttgc accccaggaa gttctgtctg 124501 ctcaccctag gctggggtag agaacctgga tccctttacg ttagcacaag acccaaagtt 124561 tccttccctc ccccaggaga ctcaatccca gggctggagg tggatgtgct gcttccaagt 124621 tcagccaagg gagaagggcc ctcccaccca gttcagctga gactgtgggt caatgtaggg 124681 ccagccctac acggtctgtg gggttttctc cccatgtatg tagatgagag accatagaaa 124741 taaagacaca agacaaagag ataaaagaaa agacagttgg gccccggggg gaccaccacc 124801 accaagacgc ggagaccggt agtggccccg aatgccaggc tgccctgtta tttattggat 124861 acaaggcaag ggggcagggt aaggagtgtg agccatctcc agtgataggt gaggtcacgt 124921 gggtcacatg tccactggac ggggcctttc cctgtatggc acccgaggag gagagagaga 124981 gaggaaagac agcttacgcc attatttctg cattttagag gcttttaata ctttcactaa 125041 ttctgctact gctatctaga aggcagagcc aggtgtacag gatggaacat gaaagtggac 125101 caggagcgtg actgctgaag cacagcatca cagggagacg gttaggcctc cagacctgac 125161 taatgtcagg ccctccacag gagatggtgg agtagagtct tctctctaaa ctcccccggg 125221 gaaagggaga ctccctttcc cggtctgcta aatagcgggt gcttttcctt ggcactgacg 125281 ctatcactag accacggtct gcttggtaac ttccgtcttc ccagacgctg gcgttaccga 125341 tagaccaagg agccctctgg tggccctgtc tgggcataac agaaggctca cgcttgtctt 125401 ctggtcactt ctcaccatgt tcctccagct cctatctctg tatggcctgg tttttcatag 125461 gttatgattg tagagcgagg attattataa tattggaata aagagtaatt actacaaact 125521 aatgattaat catacttata tatagtcata tctatgatct atatctagta taactcttgt 125581 tattttatat attttattat actggaacag ctcgtgccct cggtctcttg cctcggcacc 125641 tgggtggctt gccgcccact ggtcaacatg gagagaccac ctcccacctg ctccacaatc 125701 ccccggctaa ctcaggctct tgggagagac accagggccc cagaaaccag tatgacccag 125761 ccattactta tctctctcag tgtttggatg gtggcccttg ttattaggag accaggagtg 125821 cagccaccca gtccctccaa aacatgccta acaggttttt tggaggcgaa ggcaaccctg 125881 tgtaggatgt taatccacca taaagatttc tcattaatcc cacttctggg acagcctcta 125941 agtattccag ttcatctatt gtttcttgtg taagactaac cataaattct gcccttaggt 126001 caaaacaacc ttgctgctat tgcatttacc ataaatcttg cccttaagca agtgccttaa 126061 acattccttg tgaagcacat acaccctttc cctgtggtat gtagaccctg ggtctgggcg 126121 gtaaaggcat ggagacccac tatcttgtct ccctgccacg agagacccag aagtggattc 126181 tgttcataag tccctaataa atgtttcttt cctaagaaag atgatttgtc agtgtttttc 126241 ttcagcctgg caccttcctt ggactctggg tgtaggtttg cattatggtt gtaaccagcc 126301 aaggtgtgtc tttggaatgg gagtcaaacc ttgatctctg ggctttagag tggctttgga 126361 cacatttccc agtgggggtg ctgaggcagt ctgggggccc tgaggaaggc agatcagagg 126421 ctgggacaaa aggctacagg cacctcacca gcgcaggtcc caataatgca gcaccatcgc 126481 ctcagcctct ggtcagagtt cgaatcccac ctcgaggatg ttggctcttg gggccttata 126541 accatgacct tgtgtcacag cgatacatgg ggaatggggt tggggtcact tgtatccttg 126601 gcctgtgagg gttgcagagc caggggtgga ttgcggactg gtgcccactc ttctgtccga 126661 agctatcaat cctggacatt tttcagggag gggatagggc tatcctgtga ttgtcattga 126721 gtctttttag aaagattgga gctatcaggc cgggcgtggt agctcacgcc tgtaatccca 126781 gcactttggg aggccgaggc gggcggatca cgaggtcagg agatcaagac catcctggct 126841 aacacgggga aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tggcggcgtg 126901 cgcctgtagt cccagctgct ggggaggctg aggcaggaga atggcgtgaa cccaggaggc 126961 ggagcttgca gtgagctgag atcacaccac tgcactccag cctgggtgac agagcaagac 127021 tccgtctcaa aaaaataaaa ataaatagaa ataaaaaaaa aaaagaaaga ctggagctat 127081 caaataaagg ctgaactagg tatattccaa gagtacaagg ttggcctaaa ttcaagaaaa 127141 ttatcaaaat aattcatcat attaacagaa taaaaaagaa gctatgtggt gatcttaatg 127201 tttggaaggg cgcttgatgc aattcacacc tgcattaatc gtttgtttgt tttagaggca 127261 gaatcttgct ctgtcaccca ggctggagtg cagtggtgca atctccactc actgcaacct 127321 ccacctgccg ggttcaagca attctctctg cctcagcctc tcgagtagct ggagtagcta 127381 ggattacagg tgcccaccac accaggctaa tttttgtatt tttagtggag ttggggtttc 127441 actatgttgg ccaggcttct ctcgaactcc tgacctcagg tgagcctcgg cctcccaaaa 127501 tgctgggatt acaggcatgg gccaccgtga ccagccagaa tcttactttg ttacccaggc 127561 tgcagcatgg tggtgagaac acagctcact gcagcctcaa ccttctcagc tcaagcaata 127621 ctcccacctc atgcaccagc taattttttt gtgtgttttc tgtagagatg gggcctcact 127681 atgttgctta ggctggtctc aaactcctga gctcaaggga tcgtcttgtg agacattgcg 127741 tccagccaat tattttcctt ctttgcttca actcccatca gttaccagtt cctggactgg 127801 aatgctgttc ttcctcctct ttcttgactg gccccgtatt ttctctcctg tctcagctta 127861 aatgtcactc ctagactcat tgttcccatg tcaggaactc atctatatat gacaacaccc 127921 ttatgacagc ctgtaatttt tgaaagtcta ttcattttat tttatttttc caagacaggg 127981 tcttgctttg ttgcccaggc tggagtgcag tggtgtgatc gtagctaact gcagtctcag 128041 cctcctggac tcaggaattc tcctacctca gatttccaag tagctgaaac aacagacttg 128101 tgccaccaca cccagctaat tttatttatt tatttagatg aagtcttatt ctgtcaccta 128161 ggctggaatg cagtgacgtg aactcaactc attgcaacct ccacctcccg gattcaagcg 128221 attctcttgc ctcagcctgc cgagtagctg ggattacagg cgtgtgccac cacgccagac 128281 tattttttgt atttttagta gagacagggt ttcaccatgc tggccaggct ggtctttaag 128341 tcctgacctc tagtgatcca cccgcctcag cctcccaaag tcctggggtt acacgcatga 128401 gccaccgtag ccggcctgat tgatttattt aaatttagct accgggtctt tctatgttgc 128461 ccaggatggt cttgagctcc tgggatcaag tgatcctccc tcctcggcct cccaaaatgc 128521 tggaattaca gatgtgcatc attgggccag cctatttact ttttatttta aaacaattat 128581 agattcacag taagtggtca aaaagaaaaa aaaaagtaca gaaaactctt gtgtagcatt 128641 cacccagtat cccctaagac ttccataact gtggctcagc atcaatgcca gaaaatgaac 128701 actggctgtt tttcaaaggt atttttcttg gttattgtca atcttcactt tgaactgtgt 128761 gctctttatt ttgtgcctac tattgtgccc accaaaaaat tagatgcatg ctttaacaca 128821 cgacgcctgt gatctttgag aataaaaaat ttgactccat gtagataaat ctagacctga 128881 cttgttcttt ttcattttat tttattttat tttagtttat ttgagacgga gtcttgctct 128941 tgtcgcccag cttggagtgc agtgacgtga tctcggctca ctgcaacctc cgcctcccgg 129001 gttcaaatga ttctcctgcc tcagcttacc aaatatctgg gattgcaggt gcatgcagcc 129061 aagcccagct aattttcgta tttttagtgg agacggagtt ttaccatgtt ggctaggttg 129121 gtctcattcc cgacctcaaa tgatctgccc gcctaggcct cccaaagtgc tgggattaca 129181 ggcatgagcc accctgccca gcttgacttg ttcttttttg tttgtttgtt tgtttttgag 129241 acgaagtctt gctctgtcac caggctggag tgcagtggcc ccatctcagc tcactgcaac 129301 caccacctcc ggattcaagc gattcccctg cttcagcctc ccgagtagct gagactacag 129361 gcatgcacca ccatgcctgg ctaatttttt gtatttttag tagagatggg ggtttcacca 129421 tgttgggcag gctggtcttg aactcctgac ctcgtgatcc gcccgcctcg gcctcccaaa 129481 gtgctgggat tacaggcgtg agctaccgcg cccggccgac ttgttctttt ttaaagaatt 129541 aaaatatggt ctgggtcttt tactttggaa gaggggtgtt aacttcagaa ctgtcctagg 129601 ttcagttcag atccccgcga gggaagggga ttaaagagag aaggaagttg agcctctgag 129661 cccattgcgc cccggaatct gggaggcggc gtgaccaggt gcccgggttc catccgctgg 129721 ggctgggcag cccccctgcc cccgcccagt acagaggctc ctgccggggg aggcgtccat 129781 cgcgttcgct tacttctcta gaccaacaag cgagggacgc ggagactgag ttggctgcgg 129841 ccaccgggtc cccggacacc gtcttcattc actcccaccc tcagtcaagc acctgtaccc 129901 gcagtccttt cctaaaagaa gaaaaacggc cggtcgcgcg ctgtccagag tactcagtcc 129961 tggatccagg aggtggctgg cggtgagctg atttccgggt tcttaaaccc gggcccgggg 130021 cccaaggccc cgcctggtgg ggtctcatcc tgcagtttga gaaaccgaag aaggtaggag 130081 agggaggaag cgccggcttt gcctgtgaaa gattctttca ttcgctccat tcttgttctt 130141 gcgcgctgga tttaccaggc taacaaacaa cagctgatca cggcaggctg ggctcctctg 130201 ggactcgcac cggggcctct cgcacccaaa gcgagaatca tacctctaga ccaacaggcc 130261 tcggtggcgg gaagcaccat ctttgctcct ccactccctc acccccctcc cgccccgcgc 130321 cttatctttc tggaacccaa gacttcgtat tctccgctct ccataaacgt cgatgccagc 130381 tctaggagcg tggccgggag cccacgatcg cggcagcgag aagcccaacc agaggggagg 130441 agggagtccc cggagtggga cagcgctcag gtctggggtg gggttgatct agggcaaaat 130501 agggcgggcg gcctgtggcg atgggcagga ccctctcgcc caccacacgc ggaactcgct 130561 ggatcctctc cacatccagg ccgcgtccac tggctttcca acctgctcag gtccttaaag 130621 aaggattcaa aaggtgttgc ggtattggcc caacaggatt tgaccctgag gcccactctc 130681 accctaatca taaccgcaaa accacagcgc ctggagagag agtgagagag aaacagaaac 130741 ggagcgagtg tgtgtcgctg tgatgccctt tgccgtcgct gctcatcccc agtgacctcc 130801 tggaactttt tcagctcttg ctgacagagg aagacacggg ggcagagcta acgtctgagt 130861 cagggcagag gcgctgggct ccatccgagg gaggctatgg gggcgcctct gggatggagc 130921 caaccaccgg cgcagtcgga tgaaaggtgg gctggccgct cagccatcct cctagggcaa 130981 ggccttgggt atgagtcagc agccccaggt gtgagcgcag acccggtaac cccggcgcag 131041 gggaaaatac agcggggagc cccaggctgc aggcctgacc ctgagcatcc cctaccaagc 131101 ccagtgtgga tgggctctgt ctccaagggg ctggttcacc agggtctccc cgcagcgacc 131161 ccagaattct gccaatcact tggggacggc gatgagctct atccacttcg gaatcagccg 131221 atttgtgccg gattggtggc aggtgtctga aatgtcagcg gaaatacacg cacgggaggc 131281 tcgttggtct aggggtatga ttctcgcttc gggtgcgaga ggtcccgggt tcaaatcccg 131341 gacgagccct agaagtggtt acttttccct tgtcatttta gagaatatag agctagaaaa 131401 tcggggaccg agcctgaagc ctcaactacc agtcgcgttg ctccgcttca ggtcggtcca 131461 ggtctgtgcc tcagctacaa gggacaagga tgctcctgag gctggcctgg tcggcacttg 131521 cctcagcctc gagggagtcc cgcgcccttc tccttcccaa ccccagacca gagaagctgt 131581 accctctgca gcccgggtcg ctcagctcca catgggctcc gggtatggtg gaggccggtg 131641 gttgggttct gagtgatcga gtagtgtgca cggtctgggc gggccctgga gagctactcg 131701 ttcctcgacc tcccctcccc gccctagaaa cccacatctc tgcaggccaa ggcggagtca 131761 cagatgaagc tcgttgagag caggtcaaag ctgcctgacc cgatggcccc ctgctgcgct 131821 agccaggaag gtgcccagga gccacatatg gctcttaaac acttgaaata tggctggtcc 131881 ccttgagatg tgctgctgtg agtacaaaat acacatcggg attttgaaga cttcgtacca 131941 aaaaaataag atatctcatt ctctcattac acatggaaat tagatttcgt gtctacagta 132001 ttgaataata tacacacata tgaaaaacat atgttttata tctaaacata tatattttat 132061 atgtgtttat atattatatg tagtatatat gtatcatata attatattaa taatatgatt 132121 atatcataat aatataattg tattatataa ttatactttt ctttgagaag gtgtcgctct 132181 gttgccaggc tggagtgcat tggcaccatc tcggctcact gcaacctgtg cctccagggt 132241 tcaagagatt ctctttcctc agcttctgag tagctgggag tataggtggg tgccaccacg 132301 cctggctaaa tttttttttt ttgagacgga ctctcactgt gtctcccagg ctggagtgca 132361 gcagtgcgat cttggttggc tcactgtaac ctccgcctcc tgggttcaag tgattctcct 132421 gcctcagcct cctgagtagc cgggattaca ggcgcccgcc accacacctg gctaatgttt 132481 tgtactttta gcagagacag ggtttcacca cattgaccag gctggtctca aacttctgac 132541 ctcgtgatct gcccacctcg gcctcccaaa gtgctgggat tacaggcgtg agccaccgcg 132601 cccagctatt ttattattat tattattatt tgagacagag tccctgtcac ccaggctgga 132661 gtgcaatggt gtgatcttgg ctcactgaaa cctctgcctc ccgggttcaa gtgattcacg 132721 tgcctcaggc tcccgagtag ctgggactac agggactcaa cagcactccc ggctaatttt 132781 ttgtattttt agtagagaca gggtttcacc atgttggcca ggctggtctc gaacttctga 132841 cctcaggtga tccgccagcc tcggcctccc aaagtgctgg gattacacgt atgagccgct 132901 gctcccggcc tcggacagtt cttttagggg tagagggaaa gatgacagga ccaggctccc 132961 catccctcat ctgcacccca caaacatgga cccacaaaat tgaggctctc aaaggcccca 133021 gcttacacat ctgtaatgtg aggataatgg caattaaaag attaattcga gtattaaaat 133081 ttcttcatgt tttggtaaag catgtagaac agtgcttgaa gacttccaca ttaactcttg 133141 gctgcccagc ctatacattt cagacttgcc cgatcacatg tgccaattca tttaaataaa 133201 tctagtaaaa atagcaacac atccatgcac gtgtacacat aagcgtcatc cgtctcaagt 133261 atctcttcta atggttttgt ttcttctttc cagagcctag ctgatacagc aaatcctcca 133321 ggtgtttggt atggtgattc accagcccaa gaagaagtaa aaaaacggaa agcccaggtc 133381 gaagccccag tgctggggct gaacgtgagg ggtcacccca tacacactcc cctctcgctg 133441 ctggaaaaac ctgctgaggt tagagctgcc ctcagcccct cccacttttt cttttctttt 133501 ttaaactgct cgtttcctgt ccggatagcc ttcctccggg cggcaaggcc cgaccaccac 133561 ggcagcataa cagcctgccc cagtctggct agctaggtgg gtcgagatga gactccactc 133621 cccccacccc ttactctcag gcatgtgact tgagccctct caggcatctc ccttaaacct 133681 tggacgtagg gtttgagggg cgccccctcc tggctgaact ctggggtcct ggaagaaata 133741 ggacctcagc gtggagtgag aacagaagag ccaggaaaaa cagagtccac ggagcgcctg 133801 aagccgccgc ccgcacagcc ctgccaagga ggagactcga acctggggag atgcaagtgc 133861 tgagggctca gggggggatt cttttttttt tttttaattt ttattttttc tgagacaggg 133921 tcttgccctg ttgccgaagc tggagtgcga tggtgcaatc atagctcact gcagcctcag 133981 cttcccgagt aactgggact ccaggcgcgc gccaccacgc ccgtctaatg tttaaatttt 134041 tattttttgt agagacgggt gtctcgctat attgcccagg ctggtcccca acccctgggc 134101 tcaagcgatc ctaccccctc ggccttccaa agtactggga tcacaggcgt gagccaccga 134161 acctggctag ggaattcttc ttcagccgaa cagcagcggc aaaggctaag ggtcctagaa 134221 aggaacggga aaccgtcacc tgcccaggtg ggacgcgagg caaggaaccc agcgcaggag 134281 gccgctgggt cacggaggtt tcctgtctgc ttcccgccgt cactgccggc cgcctcccac 134341 cttctcccgg ccccctgcaa cccagcccac cccagcccac ctccccctgg tggggacacg 134401 gttttctttc cggggtgcct cgcagttcct gcagtccctc cctcttcccc acccccaccc 134461 ccatcctttg ggcaggagtc acaacaattt cttgctccag tttggcagct gcggctctgt 134521 gccacaccct ccaacggggc tggatgccca gggtctgctg cctccatggg atctggctgg 134581 gctcgaagtc ggggctcagc aaaggtattt cgtgtgaacg aggaatccac gggtcagacc 134641 cgcttccaca gccctgagtc ggggcgggtg ccagggccag tcaccctgac actcaggagg 134701 tgccggcctg tggggcgtcc gcgggtgggc ctctgacgca gagaaagacc cagatccagc 134761 tacctcgggg gcctcctgct gcgcctcgcc tttggggcag gcctgagggc aacgcttact 134821 gcggagcaac tgtgttctac agtgtagtcc cgacactgaa gactcctggt cctggactct 134881 gctgtaatct agaaatccac taaggtaacg ttggcgtgtc gcccggctag ctcagtcggt 134941 agagcatgag actcttaatc tcagggtcgt gggttcgagc cccacgttgg gcgattcctt 135001 tttactgcga gcctcacccg caaggatagc attgtgagtt tctgttccaa ctgggtcaat 135061 tttcttctct cagccatctg cccctggcga tgcaagattt ggtgaaagca gatgcctgct 135121 catcccagga aaccatcctg gagtggggtt tcactactcc tgacctcagg tgatctacct 135181 gactcagcct cccaaagtgc tgggattaaa gacgtgagcc accgcgcctg ccctgaattt 135241 cctcttcttg taaggactcc agtcatattg gattaaggcc catcctaata acctcatttt 135301 gtcttaatta cctctttaaa tacccagtct ccaaatacag ccacattctg agggactgag 135361 agttagaact tcaacatatg gatggtggac agtcacaact cagctcatta aagtgaacat 135421 tgtgtacatg gtttagtatg aacatatgtc ttaaatttcc ttgagcatag gagtggaatt 135481 ccacttagga atggaatttt ggggtcattt ggtagttctg tgtttaacat tttgaggcta 135541 ttattcacag tggctggatt attttatatt cccaccagcc tgactgaggg ctcgtgtcct 135601 agaaaagtag taatagattt acatagatat tttgcatcaa aaaatattta ttctgagatt 135661 atacttaaaa accaggaaac aggctgggcg cggtggctca cgcctgtaat cccagtactt 135721 tgggaagtca aggtgggtgg atcacgaggt cgggagttcg agaccagcct ggcatggtaa 135781 aacttcgtct gtaccaaaac cacaaaaatc agctgggcgt gatggtgcgc gcctgtagtc 135841 ccagctactg gagaggctga ggcagaaaaa tcgcttgaac ccggaaggtg aaggttgcag 135901 tgagccgaga tcgcgcgcca ctgcacagca gcctgggcaa cagagcgaga ctccgtctca 135961 aaaaacaaaa caaaacacaa caaaaaacaa atggaaacaa tctacttgcc ccaaaatagg 136021 gcatagatta aatgaaatta tcataaatcc aaaatggagg ataatgtcat aaaaaataat 136081 gttatataat gatgatccca tgtgtgtaaa actctagaaa atgcaaaata atacagtgac 136141 agaaagcaga gagtgacgga ctgccaaaac ttaccaaatt gtacactttt tttttttttt 136201 aaatggagtc tcactctgtt gcccaagctg gagtgcagtg gcgcgatctc ggctcactgc 136261 agcctctgcc tcccagggtt aagcgattct cctgcctcag cctcccgtgt agctgggaac 136321 aggcacgtgc catcacaccg agctactttt tgtattttta gtagagacga ggtttcacca 136381 tgttggccag gctggtctcc aacttgttag atatgagttc caaatttctt ttcaaagatt 136441 caatatgtca gtatgttcaa ttctttacct tctactttta aacttaactt cctcataaag 136501 caaccttttt caattaccta ctccaccctg actctttcga ttacctgctc tgtaataacc 136561 atttttcccg ccaaaccatt cgctccgtca ctctctttaa attatccaat ggcaattagt 136621 ttagcctgtg cggtctaacc ctagccaata ggggaatgac acagcagcag gggccacgtg 136681 cgaaagggat aagaacccct tcccctccct tgtccaagtg tgctctcacc attgctccat 136741 ctgtaagggc gcacccttct atagaagtac cttgccttgc tgagaattaa aaacaaaatt 136801 ttatattttc gggctatttc ttttgcggca ccgaaacttt gtatataaca aactcctgac 136861 ctcaagtgat tcgcccacct ccgtctcctg aagtgttggg attacaggtg tgaaccaccg 136921 tgccgggcct acaaccggca attcttgatc cttcacaccg actctcgctg tcctccaaga 136981 tttttcctct ttacctatag agtctccagg actcgatccc caatcccttg caagccctgc 137041 agggaaactg gagggcactg gcttccccac ctttactccc cacagttgct cctcaggacc 137101 acaggaacgg aagggaccaa gcccctaacc tagaatgtta gtgattttag aaggaaaaaa 137161 ctttgggctt attcaggact agaacctgag atctcttacg ccttatgtga aaagcatgcc 137221 tgaatcaggc ttgacggcac attcctataa tcccagaact tgggtagtct gagaagggag 137281 gattacttga gcccaggagt tcaagaccag gctgggcaac caagagggac cccatctcta 137341 ctattgctac tgctactaca aaaaataaat tgccaagaat tgtggctcag cctgtagtct 137401 cagctacctg agaggctgaa gtggaaggac tactcgagcc cagaaggtcg aggctacagt 137461 aagctgtgac tgcatcactg cactccagcc tgggtgacaa agtgagactc tatctcaaaa 137521 tttcagaaaa agcaaaatag aaaagcaggc ctactgatca gtgcctcttt cttttccttc 137581 cttccttcct tccttccttc cttccttcct tccttccttt cttgaaacgc agtctcgttc 137641 ttgtcaccca ggctggagtg caatggcatg atcttgcttc actgcaacct ctgcctcctg 137701 ggttcaagcg attctcctgc ctcagcctcc caagtagctg gtattacagg cgcctgccac 137761 catgcctgcc taatttttgt atttttagtg gagacggggt ttcaccatgt tgcccaggct 137821 ggtctcgaac tcctgatctc agacgatcca cccacctcgg cctcccaaag tgctgggatt 137881 acagccttga gccactgtgc ccggcccagt gccactttcc tcagtgggct ccaagtgacc 137941 ttacagcttg acctatctca gaaggccagg tctggatttt gagtggtgtt ccttgcagca 138001 gcattaagga gcacacccga ggagttgtgc caatgtattt ttcattataa actatcctaa 138061 ataggctgag tgaggtggct cacacctgta atcccagcgc tttgggagac caaggcaggc 138121 agatcacctg gaggtcgaga gtttgcgacc aacctgacca acacggagaa actgcatctg 138181 tactaaaaat acaaaattag ccgggcgtgg tggtgcatgc ctgtaatccc agctacttgg 138241 gaggtgaagg caggaaaatc gcttcaaccc gggaggcgga ggttgtggtg agccaggaag 138301 gtgctattgc actccagcct gggcaacaag agcaaaactc catctcaaaa aaaaaaaaaa 138361 aaaaaaaaaa actattctga taaaccggcc gggcgcggtg gctctcgcct gtaatcccag 138421 cactttagga ggcagaggca ggcagatcac ctgaggccag gagttcgaga ccagcctggc 138481 caacatggtg aaaccccatg tctactaaaa acatacaaaa attagccggc cgtggtggcg 138541 cgttcctgca atcatagcta ctcaagggag gctgagcagg agaattcctt gaacccggga 138601 ggcagaggtt gcagtgagcg gatatcccat caccacactc cagcctgcgt gacagagcga 138661 gactcatctc aatcagtaaa tcaatcagtc aataaacctc actgccccag gaatgccgac 138721 accaagtcaa gcagaccttc tcttccctgc cgtccgcttc cctgacctgc tggggttggg 138781 agggcacatt ctgtggcagg ttgactgggt ttccatgttt ccatcccagc tcctcaattt 138841 tgtctgaaaa tgcaataata agagtatagt agtgaagatt aaacgagttt agataagtaa 138901 agagcttaca agagtgttag gcatggtgag cctaataagc attagctacg tctactgcta 138961 atgttgaaca tcctaatctc tgccccctgg gtcaccgtca ccagcctcac aactctgcaa 139021 gcaggtacca gcagcaccac gctggccgta caagcagaga tccagtgaaa tcttgtatgc 139081 acccagaggg tccgggtgga gggaggttgg gtaactcagg agaccacagc tgctccctcc 139141 ggctgcactg ggcacttctg ggaggatccg gcacccacct ccctgggctg gaggcctcta 139201 atcagagagg aaggcaggaa agccagacac tgggactgtc aaactagtga acattgggat 139261 ctctatcccc ctctacttga tgggacctga gtcaaactct acctgacatg accccctggg 139321 gatcagcatc tgggcagtgt tgctggacaa gggtggagcc aggcataagg acgatgggag 139381 ataagcagag aaaggttttg taggtggctc cttgtccagg ctatgattgt tgcttagggt 139441 acaagagagt ccaatcctgt acaagctaag gttctcagac tttcttctgt tttaatgagg 139501 acagccctgt tgggatccca gaaaaggaaa accctagtgg gggtcttccc cagggcctct 139561 gctatcaagg agcgaggaat tactgttctc ctggtcttcc aggtaggaac cgcatccatg 139621 gggcaggagc caggatggct ttctggaata agcaggggag ggcataacca gacattggtc 139681 actttaaatc ttctgcttga gtttcttctt tcaccaagtc ttgcattgcc aggggcagaa 139741 agctgggaga aaaagatgta ccttcatggt acagcaaaag gcacgcccaa cgtgcggctc 139801 gaacccacga ccctgagatt aagagtctca tgctctatcg actgagctag ccgggcttcc 139861 ttataggatc ttttccatat ctacaaatgg gccgagtggc ctaagggtgg tggaaacaaa 139921 gggaggtaat gaatcacccc gggaaaccag ggcgcgcttg gggtccccca caacagactc 139981 agttcgccac cgacgtcccc gctagtgcag gatttccctc accaccccgc aatctcccat 140041 aaatgggggt gacctgccag agtctccttc gtcaccgcgc cggcatatat ctgtgttctc 140101 ccagtccgtg ccctcctctg aaaaggcctt ctttcagaaa gagaaaaaac attctcagac 140161 tgcgactgca cgcacttaaa gagtcaatga caaccttggc aacccttttt tttttttttt 140221 tttttttttt gaggccactt gagttacagg gacaggagat tctgctgtgt atgggccatc 140281 cttgcagctg cagacagaaa gcccaaagct cactgggact cacagaagcc ctcgcaggtg 140341 ctgctccgca ctgtaaccta ctgtggcgtg catctagttg ttccctttcc attaccgtgt 140401 gcttctatat ctagaactat ttggttcttt tttcaatcaa gaaggttgga ggtcctgatg 140461 tcaaagatat tttctttatt gcctttcatt tctttagaca cagtagggac agttgtatta 140521 taagtcggat acactttgca tctgaaatat ttgcaatatt gtttctgttc ctgttaatga 140581 gttgttgctc ctggtacctt ctttccttgt taatttttag ggtgcaatgt ccattttcct 140641 tgcaaatttc tttgtgtagg aaattctgtg agatctggaa tacaaatgta tttgtccaca 140701 cacacacaaa attattgttt gcttctgcag gggctcaagg gcataaccag acattggtca 140761 ctttaaatct gcttgagttt ctttccactt ggtcctcaag cgctcaaaat tggattagtc 140821 acgactcttg gttcccggca tgttctcaaa cccagcatcc caaacaaaaa ggaagactct 140881 gccaatagat tctgtcctca gagtagatgt gaggaggccc cagtcctggt gggcgctggg 140941 actcaggacg cagatacccg ggttcctggg ttcagtctgt accccttcca gaactaggat 141001 aggggaggag aggagcccca gaaggctaca cgcaggtgag gagtgtcggg atgtggggag 141061 ggagaggact ctagacccca gatacaaacg cccatagccc atctcctctg ctcaagtgcc 141121 cggctttctc tccttaaggc tctagccgcc ggagccaggg acactgggtg ggaatcccag 141181 tgtagagctt ttaaggtgtc tgacagtgaa gactcggctc ttactgtata taggaaggac 141241 ttatctcttc ttcagtgcct cgaatgaatt cctactgaca aaagtagctc ataaataaaa 141301 ataataataa aagcagaaaa caaaaatagc aaaatatcaa acaaaaatat tagggaatag 141361 ggctctgacg acagtggtga gaaaactcca aataaatttt aaatgtggca gtgggtggtg 141421 tcagacagct ttagatcttt gtacaaatta acttcgaagt ctgagcgcgg tggctgacgc 141481 ctgtaatcct agcactttgg gaggccgagg cgggctgatc acctgaagta aggagttgga 141541 gaccagcctg gccaacatgg cgaaaccctg tctctactaa aaaaaaagta caaaaattag 141601 ccagacatgg tggcacgtgc ctgtagtccc agctacctgg gagactgagg caggagaatt 141661 gcttgaaccc aggaggcgga ggttgcagtg aggtgagata gcaccactgc actccagccg 141721 gggcgacaga gactctgtct caaaacaaac aaacaaacta acttcgaagg actcgggctg 141781 cgctgacctg tcagagctgt ctggatctct gatcttcatc taaagtagga aagggtagaa 141841 accgatattt tggaccaaaa accgggctcg tccgggattt gaacccggga cctctcgcac 141901 cctaagcgag aatcataccc ctagaccaac gagccacacg ccccgtagcc tccggcttgt 141961 ctttgtggcc ttcgtggagt cctcgtggcc agaggtgggg acccagcaga gcagagctag 142021 gaggagactc cgatttcctg aagtctggaa gagacaagtt caagcgcggg tggacgcgga 142081 gggagccctg agcccccgct tctccctgcg ccgcacaaaa tggggagcat cccctgtgcc 142141 cccacaatcc ggggcccccc gggccctcgc ccgcctccgc ctctcagccc tgggctgggg 142201 atccgcgcgt cacgccgcgt cgcgtagtga ggagctccag ctccggcggg acatgggaac 142261 ctctcaagag ctggcagcgg ctgcgcggag ggtccgcgaa ccgcgaccac tggacgcccg 142321 ccttggaaac gctgcagccc ggcggcgtgg tgcagatggg tggtcagtag aggcctcgcc 142381 agtgcaccag gaaccagcca ccaggccccc tacctcgacg cgccccacgc ttctcccgct 142441 ccggtcctct cttctccccc agggaagcca aaacccagaa ggtgagcttc ccacgtggaa 142501 cctcaattca agaggctgac actgagcgtc gcttcgctcc accagacccg cggcgcccgc 142561 cggagtcagc gatgtacccc cttccctttc actgggggaa ctggcggaac ctccggtcac 142621 caggacgtgg ggaagcagag gccgcttggg gcagcagagg ccgcgtccag gtgtcggttc 142681 cggggcacct tttactgtca ctctgtcctt ctctgtgtcc aaatttcacc cggagcccga 142741 gaattcggca ggtctggact gaaaacgtct ccccatcccc tccggatcca agcctcgaat 142801 ttcctggctc agtgggaagg cctcggggtc tgggcgtgga gacggggccc gagaccccca 142861 ccccgccagg gcccactaaa gtgctgccag ctgagccccg gccttcgccc ctccttcgtc 142921 tctccctcat tcccctccct ctgacacccc gccacataca ttactccccg caataccatg 142981 caattttaag ctagggtagg gctcacctga gactggagcc gccataccgt cgaaccctac 143041 gaaaaattta tgcccgtaaa taaacaagcc cggcgcattc ttccttattc taaataactt 143101 cttcctgctg cacggggtcg tctcctttaa ccacggctac cgcagttggt agtgctggat 143161 cccatttcag ggtgatggga tgctggatcc catttcaggg tgataggacc gaccgcaagg 143221 cgaacgattc aggcccgtcc gtctgcgtcc cgggaccccg ggaggtctcc actgacgccc 143281 tgccagcagc aggctccctc cgccttccta gtctcctgtc cttgctgaaa atgcgaaatg 143341 ggaaaaaaga gaaggaaaag ggggctcgtc cgggatttga acccgggacc tctcgcaccc 143401 aaagcgagaa tcatacccct agaccaacga gccgccgctt cccctctgtt ttgttgttgt 143461 tgttgttgtt ttttgacacg tccttccgcc cctgacgcag agcgagactc tctccatcgc 143521 cccaatagct cagtccttag cgcgcgcacc tgcgaatctg ggcttttggg ttcgcggcca 143581 ccctcagatt tgcgaggcag gctttactgc tgctggataa accaaccagg ttatccgttg 143641 aggttatgag ctggtggttt caatgtcgtc ctgatatgaa atataagtgc atttgcctat 143701 cactaaggta aaaggatttc caggcaggat agtgatgtca gaaatcacaa aggaaatgta 143761 cagattttac ccaagttttt aaaacggcaa atgacaatcc aaagaaaata caatctgaat 143821 gacaaatcaa gggcttatta ccgggcttca tagttttgtt ttgttttgtt ttgttttgag 143881 acacagtttc gctcttgttg cccaggctgg agtgcagtgg cacaatctcg gctcactgca 143941 acctccgcct cctgggttca agcaattctc ctgcctcagc ctcccgagta gctggaacta 144001 caggcttgag tcaccacgcc cagctgattt ttgtattttt agtagagacg ggggtctcac 144061 catgttggcc aggctggttt tgaactcctg acctcaagtg atctacccac ctcggcctcc 144121 caaagtgctg ggattacagg tgtgagtttg agaccagcct gggcaacata gtgagacctc 144181 gttcgtacaa aaaaataaaa ataaaaaaat tagtcgacaa gacatggcgg ctcactactg 144241 taatcccagc actttgggag gccgaggtgg gcagataacc tgaggtcagg agttccagag 144301 cagcctggcc aatatggtga aaccccatct ctacaaaaat atgaaaatca gccagacctg 144361 gtggcaaaca gctgtaatcc cagctactca ggagactgag gcaggagaat tgcttgaacc 144421 tgagaggcgg aggttgcagt gagccaagat catgccactg cattccagcc taggcgacag 144481 agcaaaactc cgtctcaaaa aaaaaaaaaa aaaaaaaaag ttagttgagt gtggtggcaa 144541 tcacccatag tcccagctac tcaggaggct gaagtaggag gatctcttga gccctggagg 144601 tggaggcttg cagtgagcca tgttcatgcc actgtcctcc agcctgggta acaaaccctg 144661 tctcaaaatt ataataataa taataatttg gatcttggat agaggcatac acaaagggaa 144721 gatggtatga aggcccaggg agaagatggc tgtccccaag ccaaggaagg aggtctcagc 144781 accaaccaac cctgcccata cttagtctca gactccagcc tccagaatag tgagaaaata 144841 aatttctgtt atttcatcca ctcagcctat ggtactgtta tggcgaccct agcaaactaa 144901 tacagtatta ttcaataata ctaaatatta ttctgtcata attaggaatg aattattgac 144961 atatgcaaca aggatgaacc ttgaatacat tatgttaagt gagagaagcc agacatgaaa 145021 gactgctatt gtataattcc atttatatca aatgtagaga acaggcaaat gcgtggagac 145081 agaaagcata ctagtggttg ccagaagatt ggaggaggga ggaacaggga ggactgctaa 145141 tgggtactgc atctgttttg ggctgatgaa aacattctag aactagatag tagtgactgt 145201 tgcacaatca catgaatata taaaaaccac tcaattacac actttagtga attttatggt 145261 atgtgtagta tatctcaatt atcaatatac tgagtcaaag catatgccaa aattggagaa 145321 ttcagacata tagctattcc ccactggctt ctgtggtaag gctatgagat aaactaggcc 145381 tagaatggaa ttgatgcagc gggcagttca gaaagtgggg aaagaaactt tgctagtgag 145441 gtgagtggtt tttcatgttt attggccatc tgtgtgtgtg tgtgtatgtg tgtgtgtgtt 145501 tgctttcctt tttctgtgaa ttacctgttg gtgtgcaaag acagctgttt acgtggttct 145561 ctagccagac tttccagggc acttcctggc tgtgtgaact tgaactaatt ccttaacctt 145621 tccagggctc cagttgtgtc atctgtagac atgaagataa tggtgtactc ctcgtgcatc 145681 tgctgaaaac aacatgaggc tgggtgcaat ggctcacgcc tgtaatccca gcactttggg 145741 aggctgaggc aggcggatca cctgaggtca ggagtttgac accagcctga ccaaaacaga 145801 gaaaccccgt ctctactaaa aatacaacaa ttagcctggc atggtggcac acgcctgtaa 145861 tcccaactac ttgggaggct gaggcaggag aatctcttga acccaggagg cagaggttgc 145921 ggtgagccaa gatcacccca ttgcactcca gcccggggaa aaagagcaaa actccgtctc 145981 aaaaaaagaa agaaagaaaa cagcatgagt tagttcatgg aaaccagcag agtctagtcc 146041 ctattcagta actgacgaga atgagcagtt ggactctatt ctgtttcctt tttttttttc 146101 tttctttctt tttttttaga cagggtctca ccctgtcagg ctgaagggca gtggcgcagt 146161 ctgggctcac tctaaactcc atctcccggg ttcaagcgat tctcgtgcct cagcttcctg 146221 agtagctggg actacaggca cgcgccacca taccaggctg atttttgtat ttttagtaga 146281 gacagcggtc tggcatgttg gccaggctgg tttcgcattc ctggcttcaa gtgatcaccc 146341 gccttggcct cccaaagtgc tgggaatacc ggcgggaggt accgcgccca gcctccttca 146401 cttttctgtg acttccagag cagacaatac ataagacacc gggagagatg cctgcaggtg 146461 agtagacagg gttctccttg gcttcctttg agtctcaagg aagtcactga agcagcgggc 146521 ccagctgcct tttacgcctc acgccagcct ccagacacac ccattgcacc atgtactgct 146581 acttggcaaa ctttgtcacg gaaatccaat gtttccagtt tctgggtccc gaaactcttc 146641 ccatgcaagc tctaggacaa tctctttttc tccagtctcc acttcaaggc tcagttgttt 146701 tccacctatg gcccttaggg ccagcccgag gctacacttg ccagccagag aggctgcggc 146761 aagcagcgtt tatgctccat agtcccaggt gaggaagacg aagccttttt tgtcatagat 146821 cttcatttct tttcttaaat gtcttttaaa aatatgccta tgtaattttt taaaggattc 146881 gaaatctatg tagcccagta ataagccctt agtttgtaac ttcgattgta ttttctttgg 146941 attatcattt gccgttttaa aaccttggat aaaatctgtg tatttccttt atgatttctg 147001 acatctctat cctgcctgga aatcctttta ccttagtaat aggcaaatgc acctatatat 147061 cataccagca cgacattgaa accaccagct cataacctca gcgggtaacc ttgttggttt 147121 acccagcagg agtaaagcct gcctcggaaa tctgagggtg gccgcgaacc cacaagtcaa 147181 gattcgcagg tgcgcgcgct aaggactgag ctattggggc gatggagaga gtctcgctct 147241 gcgtcaggga cggaaggacg tgtcccccac cagaaaaagg aaaaaaaaaa aaaaaaaaag 147301 aaaaaagaaa aaaggaagcg gcggctcgtt ggtctagggg tatgattctc gctttgggtg 147361 cgagaggtcc cgggttcaaa tcccggacga gccccctttt ccttctcttt tttcccattt 147421 cgcattttca gcaaggacag gagactagga aggcggaggg aggctgctat tggcagggcc 147481 tcagtggaga cctcccgggg tcccgggacg cagacggacg ggcctgaatc gctcgccttg 147541 gtgaaatggg atccagcgct gccaactgcg gcaactgcgg ttaaaggaga cgaccccgtg 147601 cagcaggagg aggttattta gaataaggaa gaatgcgccg ggctcattgg tcgctacagg 147661 ggcataaatt ttttctagga ttccagagta tggctgctcc agtcccaggt gagccctcct 147721 ccgcttaaaa ttccagttta ttcggagaag gggagaatac cggaagcggg ggagaaggga 147781 ggggagtgag tgagaggagg gaggcgggga agaccgggca tccgcgggca gcacttcagt 147841 cggccctggc ggggtggggg tctcggccct tgtccccacc cccagacccc ggggccttcc 147901 cactgagagc gagggaattc gcggctttga tctggagggg acagggagcc ccttttagcc 147961 cagatctgct ggattctcgg gagtcaggtg aaatctggat acagggaagg gcagagtgac 148021 agtaaagggt gccccggaac cgatgcccga cgcctgctgc cccacggcct tcattcttgc 148081 tcctcctcac gtcctgggga ctggaggtcc cgccagctcc ccccagtgaa gggcaagggg 148141 gcgcatcgca gaatcgcccc tggtgcgcag cgtagttggt ggcctaagcc gtcttctccg 148201 agagctcagc catcctgggc acttgaggcc aggcaggcgc tgcggggctg gttcagtcga 148261 tggagaaaag agactctcag cgtcagcctt tagggttcga ggcccacgtg ggaaactcgc 148321 tttctgggtt ttggcttccc tggggagaag ggaggaccgg agccggagaa gcgtgaggtg 148381 cggcgaggta gggtgcctgg tagctggtgt ctcgtgcact ccgtcaggcc tttattgaac 148441 acccattggt gccacgctga gggactgcag cgtttccaag gcgagcttgc agtgatcgag 148501 gctcgcagac cctcagcgca gccgctgcct gctattggga ggttcccatg tcctgcggaa 148561 gctggagctc ctcactgcgc gcctcggaga cgccgaatcc ccaactcaag gccgagtggc 148621 ggtggcgggc gaggacccag ggcccttctg gattgtggta gcgccagggc tgctccccat 148681 tttgtgcgtc gcggaggagg ggccggggct ctgggcttcc gccacaccca tcctcacttg 148741 aaattgcccc ttccaggcgg gtctgaaata agattctact cactgggtcc tgctgggtcc 148801 gccccagccg cccccaggat gctaggacgc ccacaaagat aaccttaatg cgtaaatcgt 148861 gtggctcgtt ggtctagggg tatgattctc gcttagggtg cgagaggtcc cgggttcaaa 148921 tcccggacga gcccggcttt tggtgcaggg taaaagtcgt ttcctgctct tttttagatt 148981 cggctcgact tacagacctc agcgcaggac gtggacgccc tgcaaaggta atttgtacaa 149041 aggtctcaag ctatgttgac gccgcctact atcccactac ctcttcttaa aattcatcta 149101 gagtcctgtc accagtgttg ccagagccct tttgacaagc tatttttttt gtcgtttgtt 149161 tggtttttgg tggtttttgt tttgttttgt tttgtttttt gttacttctg gcagtgcaaa 149221 ctcgaggcac cgaagaagag acaagtccct atatacaata agagccgaat cttcacagtc 149281 agacgcctta aaaggtccag cctgggactc ccacccacta tccccggttc tagcagtcgc 149341 agtcttaatg agagagtgag ccaggcactc gagcagtgga gataggctat gggtgtttgc 149401 gtctggggtc cagagacctc tccctcccca catcccgtca ctcctcacct gcgtgtagcc 149461 tcttggggct cctctgctcc cctgtcctgg ttttggaagg agtacggact gaaccctgga 149521 acccgggtac cccgtcctga gtcccagcgc ccaccaggac tggggcctcc tctggtctac 149581 tctgaggaca gattctattg gcagggactt catttttgtt tgggacgctg ggtttgagaa 149641 cgtgctatga atcaagaatc aaagaaattt acaaggaaaa tggacccagc acaaggaggc 149701 agggaggcag ggctcacgcc tgcaatccca gcactttggg aggctgaggc aggttaatca 149761 cctgatgtca ggagattgag accatcctgg ccaacatggt aaaaccctgt ctctactgaa 149821 aatacaaaaa ttagtcggtc atggtggcgg gcgcctgtaa tcccaggtac taggaaggat 149881 gaggcaggag aatcgcttga accccggaga tgggggttgc agtgagctga aatggcgcca 149941 gtgcactcca gcctgggtga caaggtaaga ctctatctaa aaaaaaaaaa aaaaaaagac 150001 cgcaaaacaa ggtatcatga acaacaagtc atcaacagta ggcaacagaa acaaaattgc 150061 aaacattttg gatgcagact attgggaagt ctgtaataca attatcctta ggaaaggcaa 150121 taaagaaaat atctttgaca acgggaccta tgtaaaatgc tctcctttga accttctaga 150181 ttgaaaaaac aaccaaatag ttctagatat aaaagcacac agtaatggaa acggaacagc 150241 tacatgcacg tcacagtagg ttacagtgcg gagcgacacc tgcgaggact tctgtgagct 150301 cccagtgagc ttcagacatt ctgtctgcag ctccaaagat gggttaattt gcccgataca 150361 cagcagaact ctcctgttcc tgtaattcga gtggcctcca aaaagaggtt gtcgttgact 150421 cttttaggtg cgtgcagtgt gggtatgttt tttctctttc tgaaaggagg acatttcagc 150481 ggacggcaga gaatacagat ttatgccagc gccctgcgga aggagcctct ggcgggtcat 150541 ctccatttat gggagatcgc agagcggtga gcaaagtctt gcaccagcgg ggatgtcggg 150601 gtggaactga acctgttgta ggggaccccc agtgtgccct ggtttctggg gctgattcat 150661 tacgtcactt agtttctact attctcatgc caccccgcca attactagac ataaaacgtt 150721 cccagggaag cccggctagc tcagtcggta gagcatggga ctcttaatct cagggtcgtg 150781 ggttcgagcc ccacgttggg cgacttgttt tcttccttgc taccttggag ttctggaaac 150841 agccagcatt taatgaacac taccaggtgc cacatcgcgc tctgtgatgg tatcccctca 150901 atgattgaat tttggtctcc gcgcgctccc agcagcaagg attgtcagtg aaatgaccaa 150961 aggcagcacc gggacgggct ggttgccagc caggggcaga ggggcggggg cgggtgagaa 151021 ctgaggggcc acaggatggg tgtgaggggt gcaggggcct ctccccactt tgtgcggtgc 151081 ggagaggggc cggggcgctg gcccacccac tgagtctgca atcccagtct cctgtcactt 151141 tgctttcgca ggcttgagct ttatcgctcc aagtactcca cggctgccac aaagaaaagt 151201 aaagggcctg ctgctgtggc tcgttggtct aggggtatga ttctcgctta ggatgcgaga 151261 ggtcccgggt tcaaatcccg gacgagcccc tcttcttatt tttgagacag agtctcgctc 151321 ggtcgcccag gctggagtgc agtggcgcta tctcaactcg gttgaagctc cttctcctgg 151381 gttcacgcca ttctcctgcc tcagcctcct gagtagctgg gactacaggt gcccgccacc 151441 acgcccgatt aattttttgt atttttagta gagatggggt ttcaccgtgt tagccaggat 151501 ggtcttgatc tgacctcgtg atccgcccgc ctcggcctcc caaagtgcgg ggatgacagg 151561 cgtgagccac tgcgcccggc caacagccat gcactttcaa gggctcaagt ccagggttta 151621 ctccagtgag actcaacccc tcggtcctgg atttctctgc ttcaatttat aaaatataaa 151681 gttactgggg gaatttagaa tatggattgt atattagatc gccttgtcgc atgagtgtct 151741 gaggtcccgg aagcctcatt ggtgggatga gtatcttgag cagctccaac actgctgaat 151801 ttttaccact aggtaaatga cagtcacagc ggattccagg ggttgtgtcc acgatcccgg 151861 ggggcgtcgg ctttgagcag cctcaagggg agggttttgt ctggctcctt gcttcctctc 151921 gcgccgagcc tggaaaagcg aggtgcaggg tagaatctcc cgggccgcct ccgtgtcccg 151981 gacgttggcc cagctctgag tcgcggcgct cggccctggg gtgcggcccg ggaggctgct 152041 aagagggcgc tgacttgggc ttgcagttga gcttttggcg gtccggtggg gtgtctgact 152101 cgcgccgtct gcaatcttcc tcccgggtac tgtgccccgc gggattaaaa aaaaagagtt 152161 tttaaaaatt ttttatcttt tttgaaatgg cgtcttcctc tgtcgcccag gctggaatgc 152221 agtggagcga tctcggctca ctgcagcctc cgcctcccag gttcaagcga ttttcctgtc 152281 tcagccttcc gagtagttgg aggagccacc acacccggct aatttttgta tttttagtag 152341 agacggggtt tcgccatgtt ggccaggctg gcctcgaact cctgacctca agtgatccac 152401 cgccttggcc tcccaaagtg ctgagattac tggcgcgagc ctccacgccc ggcccctgca 152461 ggatcttttg tcatgccctg gaccccagga ggacctgacc tttattaaac acagtggggc 152521 tgtcaagggc ctcgcgtcaa ggctcagcgc ttcaccgacg cacccagccc gagaccctcc 152581 gttgctctct gactccgaag gaaaatctag ttccttcggg cgcctgggac tcctttctgg 152641 aggatcagac gagtcgggct ccgcgaagcc catgcgggct ggaggatcgg aaaccacgcg 152701 ggagaggata gcgccggtgg cgcgaggacg cagactgcag agctctacgg ggaatgggag 152761 ttttctctcg ttcactatgg cgtccccggc gacccgaatg gaggctgctg cgttgcaaga 152821 actgggctaa gactattttt tttgagacag gatctcactc ccgtcgccca ggctggagtg 152881 cagtggcgcc atctcggctc cctgcagcct ccacttccta ggctcaagcg atcctccagc 152941 ctcagcctcc caagtagatg ggactacagg cgagcgcaac cacaaccggc taattttaaa 153001 atttttgtag agacggggtc tcgctatgtt gctcaagctg gcatacttcc ttgaataaat 153061 gcccaaacca attctagggt cattttctag ccacccgcga taagttattc tgtcttttaa 153121 tattctcagt agggcacgtg gtaactgccc cagtggcctg atggataagg tactggcctc 153181 ctaagccagg gattgtgggt tcgagttcca cctggggtaa gacaacaccg accgtaggtg 153241 acttggggta aggtatcacg ccttttaaaa aggacaaagc atggagacac ataagtagaa 153301 aaactttgca agaataagaa gaaacggcat gaatttggaa gaaaaacagg gcattaaaca 153361 agacacacaa tctcctctct ctctaggatg agaagagtga agattttact tacctctctc 153421 cacatcaaag ggacaagaaa aagcaatgta cagcttttgt atatatcaaa gtgggaaaat 153481 gttggaggct ggttttctgt tttggtttgt ttgtttggtt tggtttgtct ttctgtttta 153541 gaaagcaccc cttgtataca cacctctttg gcccacacgt tccatttcat agaaataaga 153601 gtaccatgta tgtacaagga tgttttattg cagtaatttt gttcaaaaac gggaagggag 153661 atgaccatga ataggggatg aattgaatac agtttgtttc attaatgttg ctcaacctca 153721 gaaaagaacg ttcaatctat cctggttgac ttggatatcc cccacgggag agatgtatga 153781 cactttacaa aacaagcagc gactctgagg ccccagatgt ttgtgtatga tcagtgccaa 153841 aggctcttgg cactcgggcc agaggagagg aggggagaaa aaagattcta gtaaaatgga 153901 aaaatatgca gatcaaaagt aaacaggaac aggagaaaac agaggtctct tttggaattt 153961 aaggggggca cttgcggaat tggaagtgag gtgggcagat gatcatgtgg acttaatgtg 154021 gtttatgatc gtgagtgatt taggtctcgc tatgttgttc aggctggtct ggaactcctg 154081 tgctcaagtt atcttcctgc ctcggcctcc caaaatgctg ggattacagg cgtgagccac 154141 cgcccttctt ctgtgacttc tgatagtgac tcctgggcat gaaagacctc gagggtggag 154201 tcttggcatc ggtccaggat ttggggtcca gcaagtgctg tccagattgg cgtgtcactc 154261 agtgagcact ctggtggcgg aaggccccgt gtcccccgga gcacaatgcc agccgtgctc 154321 cgggaagcgc atcccggccc aggagggtgc ttcttgctca gagcccgggt cctggaccca 154381 atctcagacc ctgcgcctcc ggcgctctcc ccccaaccgt tccttctctt cctgccgcta 154441 cggaaacagg agagaatcct ccctaaaaag acaggtaact cattgtgcta attgcgttta 154501 ctattaaaaa aagaaaagaa aagaaaacag aaaaaaaggc aaaaaacaaa acaacaaaca 154561 aaccaaaaca gctaccgcga cttgtgtgcg ggttaaacga cgcactagct tcaaatgcgg 154621 gcgtccatcc cagtgagaaa ggaaacaggg cccggcgcgg tggctcagag aggcaaggcc 154681 gggagattac ctgaggtcag gagttcgaga ccagcctggc taacatggtg aaaccctgtc 154741 tctactaaaa atacaaaaat attaaccggg cgtggtgggg gtggcctgta atcccagcta 154801 ctcgggaggc tgaggcaaga gaatcgctgg agcccgggag ttggaggttg cagtgagcag 154861 agatcacacc attgtattac aagcccgggg gacagagtgg aactcaaaaa aaaaaaaaaa 154921 aaagaagtga agaccagcaa ctacctctga tacagaatcc accctctttc tttttctgtt 154981 ttgaaagtcc attttgttaa ctgctatatc cccagtgcct ggcacacaga ggtgctcaat 155041 acatatttga gagcatgaat tactttattg ggcaatgtgg cccccagtac atctgggcac 155101 accttgggac tgagaacata gggtgggatg taccctccca cctgggtgcg cagccccacc 155161 aggtgggttt ggtcactgag cccctgcctg gggcttccag gaccccagag tctctgtgcc 155221 acccgccagc aggacgcgtg ggtcacgtct ctcctggtgg actttctctt ctcctgggaa 155281 gggcaggacc cagcgagcag ttggtggcac tgcctgattc atcctcacgt tcatgtcaag 155341 tggtgagcac acactccatt atcagaaaac cttgcagtta tgcccagcta gctcagccgg 155401 tagagcacaa gactcttaat ctcagggtcg tgggtttgag ccctgtgttg agcacatgtt 155461 tccttttcct ttgcagccct aggctgcaga gaaggttgag acatggagta ctctcaggtc 155521 tttgcccagt gaaagttttt ggatgaaggt tagtaacgtt tagcaagact taggagatgc 155581 taatcggggc cattaaagtt ttaaagtaaa aatattatta ttattatttt gagatggggt 155641 ctccctctgt tgcccaggct ggagggcagt ggtacgatta cagctcactg catccttgac 155701 ctcccaggct gaggatatct tcccacctca gcctcctaag tagctggcac cacaggcacg 155761 cactacgaca cccagctaat ttttaaaatt atttttgtag agatgggatc tcactattgt 155821 tgcccaggct ggtcttgaac tcctgggctc aagagatccc cctgcgttgg cctcccaaag 155881 tgctgggatt acagatgtaa gccaccacga ctggcggtga aattaatctt tttttttttt 155941 tttttttgaa atggagtttc gctgttgttg cccaggctgg agtgcaatgg cgtcctgcaa 156001 cctccgcatc ctgggttcaa gcaagtctcc agcctcacct cagcctccca agtagctggg 156061 attactggcg ccaccaccat gcccagctat tttgttgttg ttgttgtttt gttttgtttc 156121 tttgagacag agtctcactc tgttgaccag gctggagtgc agtggtgcaa tctcagcttc 156181 ctgcaacctc cgcctgccgg cttcaagtga ttctcctgcc tcagtccccc aagtagctgg 156241 gattacagga gtgcgccact gcacctaatt tttgtatttt tttcagtaga gacagagttt 156301 caccatgttg gccaggctag tctcgaactc ctgacctcag gtgatccgct ctcctcagcc 156361 tcccacagtg ctgagattac aggcatgagc catcacacct ggcatttttt tttttttttg 156421 tatttttagt agagacgggg tttcaccatg ttggccaggc tggtctgaaa ctcttgacct 156481 caggtggtcc acctgcctcg gcctcccaaa gtgctgggat tacaggtgtg agccactgtg 156541 cccagcccaa aactaatctt aataatacat tttatttgaa caaatgtttt atgtcacaag 156601 taatcaatat atacattatc aatgagatag gttatgtcgt tgttggtttt tttttctctc 156661 attctaaacc ttcaaactct ggcatgcttt cctctaaggc acatctcaac ttgcacaggc 156721 cacatgtcaa gggctccaga gccacctgct gctgggggcc agttcctcag acagcgcagt 156781 cctacatctg actacaagct taaatgtggc cacatgtcaa gggctccaga gccacctgct 156841 gctgggggcc agttcctcag acagcgcagt cctacatctg actacaagct taaatgggct 156901 ggggagagag ggagagggag agggagaggg agaaggagag ggagagggag aaggagaggg 156961 agagggagaa ggagagggag agggagaagg agaaagagag ggagaaggag ggagagggag 157021 aaggagaggg agagggagaa ggagagggag aaggagaggg agagggagaa ggagagggag 157081 aaggagaggg agagggagaa ggagagggag agggagaagg agagggagag ggagaaggag 157141 aaagagaggg agaaggaggg agagggagaa ggagagggag agggagaagg agagggagaa 157201 ggagagggag agggagaagg agagggagaa ggagagggag agggagaagg agagggagaa 157261 ggagagggag agggagaagg agagggagag ggagaaggag agggagaggg agaaggagag 157321 ggagagggag aaggagaggg agaaggagag ggagagggag aaggagggag agagagggag 157381 agagagggag agagagagac agagagagag attataagga attggctcac gcagatggag 157441 gctaacaggt ccccaagtat gcagggtgag ttggcaaact ggagacccag cagagctgat 157501 ggtgtggttc taatccggag gctagcaggc aggaggccca ggaagagcgg gtgttttcat 157561 gggattctga aggcacaaaa tagctggtgt gccatctggc tgaggcaagt aggctgacat 157621 gtttcccctg ttggtggtgg ggatgacaca tgcagagaga ggacaggctt aaccccttaa 157681 gcaggggaaa tgcctgatga acatgggtgt tttaggggaa tcaacacagt tctgtacaga 157741 tatgcaagcg gggacaggga acaggtagct ctcctgggcc tggtctatgt tggacctgat 157801 ggtggcagct ggcaggagat gaaacaaaac aacagtgaga ctttgtactt caggtgaaag 157861 ggtagtggcg aggtcctgcc tggtagaaac catgaggtct tctgaccctt tttgtttacc 157921 cttaagaaat cacagcggtg gacgcggtgg ctcacgcctg gaatcccagc attttgagag 157981 gcgaggcagg aggatcactt aagcccaagc gttcaagatc agcttgagca atatggcaaa 158041 accccggctc tacaaaaaat acaaaaatta gctgggcttg gaggctgtga tgggaggatc 158101 gcttgagccc aggagcttga ggctgcagtg agctgtgttc actcctcggc actccagctt 158161 aggtgacaga gcaagatctt gtcacaaaaa acgaaaacaa aaacaaaatc aaaacccatt 158221 aaagaaggtg aggatggtgg ctcacgccta taatcccagc actttgagag gccagggcag 158281 gaggatcact tgagctcagg tgttccagac cagcctgggc cagaaggcaa gatcctgtct 158341 ctacaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aattacctga acatagtggc 158401 atgtgtctgt agtcccagct cctcaggagg ctgagacagg agaatcacct gagcctggaa 158461 ggttgagact tcagtgagct gtgttcatac cactgcattc cagcctaggc aacagaggga 158521 gaccctacct gaaataataa taattattat gatgatgaaa ataaaaatat ataataaaag 158581 agattaaata gctgggcaca gtgtctcatg cctgtaatcc cagtatttca ggaggctgag 158641 gtaagaggat tgcttgagcc caggagttca agaccagtcg agaccagcct gggcaacata 158701 atgagacatc atctttacaa aaaatttaaa aattagccag gtgtgttgat ggtctcacta 158761 tgttgcccag gctggtctca aactcctgag cttaagcggg cctcccacct tggcctccca 158821 aagtgctagt gttgaccatt gatatgatac cttggtttat gtattcttgg tttaaaagaa 158881 tttaaacaag agacacacag caaaacaaat gcagcataga gtaacttctt gcaaaagaaa 158941 aaggatattt tgaaagttat ggctggttgc agtggctcac acttgtaatc tctcagcact 159001 ttgagaggcc aaggcgggcg gatcgcctga ggtcaggagt ttgagaccag cctgaccaac 159061 atggagaaac cccgtctcta ctaaaaatac aaaattagct gggcattgtg gtgcatgcct 159121 ggcaatccca gctactcggg aggctgaggc aggagaatcg cttgaaccca ggaggcggag 159181 gttgtggtga accgagatca tgccactgca ctgcacccca ggcaacaaga acaaaactcc 159241 gtctcaaaaa aaaaaaagaa gaagaacaaa gttaggtgca gaatagacag tacaccctga 159301 gagagaggga actgagggca ggccgcttgt aagaatgaga cagcaaagat gcacgaggga 159361 gactcccttt atgggagcct taaatggtta ttcataagga gatgagagag gtgttactag 159421 taagcctgtt ctgggtggtc ttcttctcag tgcacaagtg ctgtagctgt gaatgcttgt 159481 tcataggttg cacgtctcgt tagcatctta aatctccacc cagggatatg ttttttacta 159541 ttaaaatgag gaaaaaggct gggcatggtg gcgcatgcct gtaatcccac tttcggaggc 159601 cgagaggggc ggatcacttg agctcaggaa ttcgagacca gcctggccaa tatggtgaaa 159661 ctctgtctct actaaaaata caaaaattag ccggacatgg cagtgcatgc ctgtaatccc 159721 aactactcgg gaggctgagg caggagaatg gcgtggaccc gggaggcagt gagccaagat 159781 tgcgccactg cgctctagcc tgggcgacag agcaagacgt catctcaaaa caaacaaaca 159841 aacaaaaaac gaataaacaa acaaagaacc aggaaaaggt caatttgaag gcaggtaaga 159901 tcagaatgca catgctctac agaagggaaa gtacctactg aagatagctt tgctttaatg 159961 acctcaatta caaggtgaat gctgaggctt actgtgtgga ctgtgtggtc accacagttg 160021 ctgtgtccca agaacatggt cacattcttg actacctatc ctgcctcact aggattacag 160081 gtgtgaccca ccatgcctga ccaatggcat taattacatt gacaatgtat gcaataatca 160141 ccattatctg tttccaaaac gttttcgttt caaacaaatt ctgtaaccat taagcaataa 160201 ctccccacat ccccttcccc agctgttaat aacctttaat ctacttcctg tctctatgga 160261 tttgcctact ctagacattt catctaagta gaataataca ttatgtgttc ttttgtatct 160321 ggcttctttc acttagcata atgtctcagg gttcatccac attgtagtgt gtgtcagtgc 160381 ttcattcctt ttttatggat gaatgatatt ccatcataag gatctaccac attttgttta 160441 tctattcatc tgttgatgga tacttgtgtt attttcacat tttggccatt gtgactaatg 160501 tacaataatt gttgtacaaa tatctgtttg agtccctgtt ttcaattctt ttgggtacaa 160561 acctaggaat ggaattgctg gattgtatgg taattgtatg tttagctcct gggggaacaa 160621 cccaactgtt ttccccagca gctgcatcat tttaccttct caccagcaat gtagagctct 160681 agtttctcta catccttgcc aacacttgtc atttttgtct gcttgtttaa ttattgccaa 160741 atacgtggag gaaaagggga attcctatcc tcctccattt agtcctggtg acaatgtcca 160801 tcactggatt cccgcatgga atggattgag ctgatgatcc agaatgatga gagtactctg 160861 ttcttccatg cagaatggtt tgttcaggtt tggacacatg acccacgaag ggatgatcac 160921 aaccttacct ggagttgagg taggaagcag ggctcggaca ctggaccaaa ttgaggatta 160981 gctaaaagag atggggcgga agcagctttc cataagacat acccaccagt gacacccagg 161041 cgtcactgcc cctttccatg gcaatgactc agtgacccaa acgttactac cccttcccta 161101 gaaatttctg cataaactgc accttattct acatgtactt aaaaatgggt ataaatatgg 161161 ctgcaaaact ccctagagct gttactctca gcacactgcc tatgggggag ccctcctctg 161221 caggagcagt caggcagctg taacactgcc tgctgcttta gtaaagctgt tttcttctac 161281 cactggctcg cccttgtatt ctttcctggg caaagccaag aacccttgca gactaggcct 161341 cactttgggg ctcgcttgtc ctgcatcagt acaatgttat taataaataa acattggtag 161401 agggaagttt catgtgccct gtggctgcta gtctggaagg atgtgaattt ggagccactg 161461 ttggacatct tgctgccaca tgaggagagg taatctgcag aatgaagctg agcagaagcc 161521 aatgcagtag tgagatagag aaatacaggg tcttgatgtt atcattcagc tccctgaatt 161581 tggtctggcc tgaaacacaa aggctttgca agaagacacc ttggattttg taatgacatt 161641 aacctgtcat ttctccctag gctgatatga gttggttttc tatcgcttgc aaataatgtc 161701 tttgctaatt cagtgtccat tctcctcctc ctccccagat ctttcgtaat tagctttatg 161761 ggttgtttgt tgttcttgtt gtttgtttgt ttttaagacg aattctcgct ctgttgccca 161821 gcctggtggg catggtagtg cacgcctgta gtaccttgcc tgggcaacaa gaatgaaatt 161881 ccatctcaaa ataataataa taataataat aataataata ataataataa taataacaat 161941 ttaatatctt cagccttcac ctagtggaca tggtcttagc ccagtgcatt gatgaggtta 162001 agagtcacga gatatttaac ataaattacc tagtttagtt attacaaccc tacaaggtga 162061 atattatttc tccactctat agatgatggg agctggtgat tcagtggagg ttactagctt 162121 ttctaaggct atacaactgg taatggtgga tctggaaatt gagtgctggt ctgtgaaaat 162181 ccaaaggtta tcttcttaat cccagtacag cttcacaggt atctcccaac aagatcagca 162241 tcccagctgg gtgggggata aaatacaggg tcgtgagttt cctctagcag aaggattgtt 162301 gccaatgatt accccttaga ttgcatcact ttcagtgttg ccaagtgctg aggcaggaat 162361 tatgtatgac aaagctttat cgtcagtgtc ggacacaaaa gagccaagga ggggaaggga 162421 caattactct ctgcttttct caatgaggct ctcagggaat atgaccctag tggccatgga 162481 ggctcctgtg aagagcttgc cctgtgattc ttccaaacat ccttcctgca tcaagggctg 162541 agtctgtgca gcccagctcc tgattcatga gatactcagc ctcaaagctc cagtgacgag 162601 tcaccctccc tcagaccaat aacctctggc caggtctgaa aaagggaagc caacccagga 162661 cttcaggaga cagtgaggac tgctgctctt tctgggtaag agtcagatca ttggccaggt 162721 gtggtggctc acgcctgtaa tcccagcacc ttgggaggcc aaggcgggcg gatcacgagg 162781 tcaagagata gagaccatca tggccaacat ggtgaaaccc cgtctctact aaaaatacaa 162841 aaattagctg ggcgtggtgg cacgtgccta taatcccagc tacttgggag gctgaagcag 162901 gagaatcact tgaactaggg agctgaaggt tgcagcgagc caagatctcg ccactgcact 162961 ctagcctggc gacagagcga gactccgttt aaaaaaagaa ataaaaagag tcacatcagt 163021 gggacttcac tgataactgc ttgtcaggac ttaaagcctc agagggaccc tccttccttt 163081 caattatgtg gagccagaaa aacagtcctg agctcagcag cagtcccttg caatcaccca 163141 gtcaggacca gcctctgtcc cgtctgtctg acatcctgaa ctgtggggaa cacaaactga 163201 ctgagtttct gcccagacct ctgggttgtt ccattttaat tttacaaatt ggactcaggg 163261 tgctccgtgg atatagccaa gaaggcctgg ctcagacttc tgtaaacctg agccctgaac 163321 atagcagtag cagccccagc agggttgaca atattatcat ttacatcctg caagcaaaat 163381 tctactgcca ttcctgtgcc ccatcttaac cagcattttc caagcttttg catttcgtct 163441 tctttgtctg cctctgtgtc caggatccca ggcccatgag cgggacaaac cagtcgagtg 163501 tctccgagtt cctcctcctg ggactctcca ggcagcccca gcagcagcat ctcctctttg 163561 tgttcttcct cagcatgtac ctggccactg tcctggggaa cctgctcatc atcctgtccg 163621 taagcataga ctcctgcctg cacaccccca tgtacttctt cctcagcaac ctgtcttttg 163681 tggacatctg cttctccttc accaccgtcc ccaagatgct ggccaatcac atactcgaga 163741 ctcagaccat ctccttctgt ggctgtctca cacagatgta tttcgttttc atgttcgtgg 163801 acatggacaa tttcctccta gctgtgatgg cctatgacca ctttgtcgcc gtgtgccacc 163861 ccttacatta cacagcaaag atgacccatc agctctgtgc cctgctggtt gctggattat 163921 gggtggttgc caacctgaat gtccttctgc acaccctgct gatggctcca ctctcattct 163981 gtgcagacaa tgccatcact cacttcttct gcgatgtgac tcccctactg aaactctcct 164041 gctcagacac acacctcaat gaggtcataa tccttagtga gggtgccctg gtcatgatca 164101 ccccatttct ttgcatcctg gcttcttata tgcacatcac ctgcactgtc ctgaaggtcc 164161 catccacaaa gggaaggtgg aaagccttct ccacctgtgg ttctcacctg gctgtggttc 164221 tcctcttcta cagcaccatc attgctgtgt attttaaccc tctgtcctcc cactcagctg 164281 agaaagacac tatggctact gtgttgtata cagtagtgac tcccatgcta aaccctttca 164341 tctacagcct gaggaacagg tacttgaaag gggctctgaa aaaagtagtt ggcagggtgg 164401 tgttttctgt ctgatgaaat aatcaagact gaatctcatt cccaaggaaa tttatttttc 164461 accaattgag tttaatgcag tagttgtttc attaaatgat gttcttgcta gtgacacact 164521 tagtaattat actaagttaa actattaatt ataatttttt ttgagacagg gtcatgctct 164581 gtcacccagg ctggagtgca gtgccgtgat cttggctcac tgcaccctcc atctcccagg 164641 ctcaagtgat cctcctgcct cagcctcctg agtagctggg accacaggtg tgtgccacat 164701 gcctagctaa attttttttt ttttttttga gacggagtct ctctatgtcg ccaggctgga 164761 tgctgttgcg ggaagtcagg gaccctgaac agagggacca gctggagctg tgtcagagga 164821 acataaattg tgaagatttc attttaatat ggacatgtat cggttcccaa aattaatact 164881 tttataattt cttacgcctg tctttactgc aatctctgaa cataagctgt gaagatttca 164941 cggacattta tcagttcccg aaattaacac ttataatttc tcatgcctgt ctttacttta 165001 atctcttaat cctgttatct tcgtaagctg acgatgtacg tcacctcagg accactgtga 165061 taattctacc taactataca aattgattgt aaaacatgtg tatttgaaca atatgaaatc 165121 agtgcacctt gaaaaagaac agaataacag tgattttagg gaacaaggga agataatcat 165181 aaggtctgac tatctgtgga gttgggcaga atggagccat atttttcttc ttgcagagag 165241 cctataaatg gacgtgcaag tagggatatc actgaattct tttcctagca aggaatgtta 165301 ataattaaga ccttgggaga ggaatgcact cctcggggga ggtctataaa tggctgctct 165361 gggagagtct gtcttatgca gttgagataa ggactgaaat atgccctggt ctcctgcagt 165421 accctcaggc ttattagtgt ggggaaaaaa ccccaccctg gtgaatttaa ggtcagacag 165481 attctctgct cttgaaccct gttttctgtt gtttaagatg tttatcaaga caatacgtgc 165541 acagctgaac atagaccctt atctggaggt tttgattttg tcctttgcct tgtgatctct 165601 attggcttca gaggcatgtg atctttgttc tcctttttgc cctttgacac ctgtgatctc 165661 tgtgacctac tccctgttcg tacaccccca ccccttttaa agtccttaat aaaaacctcc 165721 tggttttgcg gctcaggtgg gtcctaccaa tatgggatgt cacccccaga ggcctagctg 165781 taaaattcct ctctttgtac tctttctctt tatttctcag ctggccgaca cttagggaaa 165841 atagaaagaa cctatgttga aatattgggg gtgggttccc ccagtagagt gcagtggagc 165901 catctcggct tcctgcaacc tccacctccc agattcaagc tattctcctg cctcaaccac 165961 ccgagtaact gggactacag gtgtgcacca cctgtgcagc ccgggtgaag actcactggc 166021 ctcatcctcc ttttatgtgt cctctgcaat ggtcaagcat aaaggctggg tcacaaaaat 166081 gctcaggcaa aagaacagca gagtgaggct gggtgtggtg gctcatgcca gtaatcccag 166141 cagtttggga ggctgaggca ggtggatcac ctgaggtcag gactttaaga ccagcctggc 166201 caacatggtg aaaccctatc tctactaaaa atacaaaaat tagctgggcg tggtggcagg 166261 cgcctacaat cacagatact cagggggatg aggcaggaga attgcttaaa cttgggaggt 166321 ggaggttgca gtgagccaag atcaggccat tgcattccag cctgggtgaa agagcaaggt 166381 tctgtcaaaa aagggagggg aggggagggg aggggagggg agggaagaaa gaaagagcgg 166441 agtgaaatgt gagtaccaca acgagtaaca agcatttcag atgaacctgg ggccctggtc 166501 cgcttgcagc ccaggcggtg cagaggtgaa gcgacgctga accttgggaa ccggctagga 166561 gcaccctgat tggctgcctg acttaggggg cagggtcagc cagagccagg tctttgacat 166621 cagggaaaca aatcgacatt tggtcttccc tccatccaga ccaactgcct ctgtctctcc 166681 tctcaagtca gctcccaccc acaccttcaa gaaccaagtc aatgccacct cctcctccgt 166741 gaagcctccg gattcacgct tccggctctc agagctctgt ttttctttgg tttatcattc 166801 cctctccaca agggctttcc gcagactacg agacctacag acccgaatac acacatgact 166861 cactttgtat ctcggtgtcc agcactgctg ctagaccaaa ggtgatcatc aactttaact 166921 gaaaggaaaa aatataggtg aatgcatatg catattgcca tacattttaa tttggagaat 166981 gttactttag aataagaaag aaatagcaga gggaaatgtg agtactacag caagtaacaa 167041 gcatttcaga tgaaccttgg gccctggccc gcttgtagcc caggtggtgc agaggtgaag 167101 caacgctgaa ctgtgggaac tggccaggag caccctgatt ggctgcctga cttagggggc 167161 ggggtcaaat gacaaaaaat cattttgtta tttgaaagct agtgtctacg tatgcattat 167221 ggtacaccat aaaccataaa gaagaaaatt aaagtaacct gtatccttgc agctcaaagg 167281 taatcgcagt taggcacttg gaatattgtt ttacaaaagt tggctgcaat cgatgcatat 167341 aattttgagt tctgattttc cacttcttac ttcccatgtc agtggagctc ttagaaacgt 167401 gatttttttt tttttttttg agatggaatc gcgttctgtc accaggctgg agtgcagtgg 167461 cacgaccttg actcactgca acttctgcct cctggattca agcaattctc ctgcctcagc 167521 ctcccaagta gctgggacaa cagacacgca ccgccacgcc cagctacttt ttgtattttt 167581 agtagagacg gggtttcacc atgttggcca ggactgtctc catctgctga cctcgtgatc 167641 cacctgcatc ggcctcccaa agtgctggga ttacaggcat gagccaccat gctcagccca 167701 gaaatgtgat tttttttttt ttttttttga gatggagtct cgctctgtcg cccaggctga 167761 agtgcagtgg catgatttcc gctcactgca agctccacct ccggcttcac accattctcc 167821 tgcctcagcc tccggagtag ctggggctat aggcgcctgc caccacaccc agctaatttt 167881 ttgtattttt agtaaagatg gggtttcacc atgttaggca ggatggtctc aaactcctga 167941 ccttgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcat gagccactgc 168001 gcctgacccc agaaatttga ttttaatggc tgcataatat ttcatcacag ggacgtgata 168061 taattcaccc tttccccaac tctgaacaat ttcaagtgtt tctgtaccat aacaaacact 168121 gctgcacact tccttgtgtg ggaatcattg tttgtgttta ttattgccac aggtagcctg 168181 actgatctat ttcttcagct tggagtaaaa atgtcacctt tcctttgtta ttcataattt 168241 ctgcctaacc ataccctgcc attactacca gggggtccag cacctccctg tgggtgtcaa 168301 aaagcacccc aggagttcac tcagaggagt cagtcaggtg gagagaggga gagactgggg 168361 agctagaagc atccactggc acgatcctgt gagttacggc acacagtgca gtccctgccc 168421 ttcgatgcct ggtcagctga cagcctgtgt gctgcaacat agatgcctgt ggcccatccc 168481 tttgggggcc acactgggac ccacctaagg ttcagatcac agccatctgt gttcaaactc 168541 atgttattct ccactaaaat ctgactctgg gccgggtatg gtggctccca cctgtaatca 168601 caacaccttt aaagtccaag gtgggaggat catttgagcc caggagtttg agaccagcct 168661 gggcaatata gtgagactcc atctctacaa aaaattaaaa aattaactgg gcatggtggc 168721 tcatacctgt ggtcccagct acttgggagg ctgaggtggg aggatcactt gagcccagga 168781 agttgaggct gcagtgagct gcgatcgtgc cactgcactc cagcctcggg aacaaagcat 168841 gatcctgtct ccaaaaaaaa aaaaaaaaat ctgcctccgg atggacaccc actccctggc 168901 cctggggctt ccagtcccac tgtgtcaaac caggtttccc caggaacacc cagagcagat 168961 ccaggcttat tcctttgaga ttccctctgg gtcctaggga tctttttatg tggaaatgac 169021 ttcaatatat ttttttctca tgatatagct gtagctgtaa aatataatcc aaatagatct 169081 atatgaagat tataatgttt gaattctgac tgcttcttat tatttattta gttgatttgc 169141 aaagtctaat aattaacatt tcaattattt tctttctata tgttaatagc ttttaccttt 169201 ttttctttac agaatgattg cagtaagctt ttactttttt ttttttaacg tccttttttt 169261 ttctgcgggg gggatgaagt ctcagtctgt tgcccaggct ggagtgcagc agcacgatct 169321 cagctcactg taacctctgc ctcctgggtt caagtgattt tcctgcctca acctcttgag 169381 tggctgggat tacagacatc caccaccatg cctggctaat ttttgtattt ttagtagaga 169441 tggggtttca ccatgttggc caggctggtc ttgaactcct gacctcaagt gatctgccta 169501 ccttggcatc ccaaagtgct ggaattacag gcatgagcca ccatgccggg tcagctttac 169561 tttttgattt gatctttggt tatggaggtg atgtggtttg gctgtttgtc tccaccaaat 169621 ctcatgttga aatgtgattc tcagtgttgg aggtggcgcc tggcaggtgg tgtttaggtc 169681 atgggggtgg gtccctcatg aatggcttgg ttccccccac acagtaataa gttaccatga 169741 gctctgattg ttagagcctg ggagcttccc cttctccctt ttattccctc tctcttcatg 169801 tgacacacct gtttccactt caccttctgc catgattgga aacttcctga ggcctcacca 169861 gaagtaggtg cccacaccat gcttcttgta cagcctgcag aatcatgacc caaaaaaact 169921 tttctttata aattacccag agtcaggtat ttctttatag cagcgaaaac ggactaacac 169981 agaaggcctg gaggctggtg aatgttatcc attcattata aattatatta aataccttct 170041 agaaatagaa tgatctttgt cctcttcaat tcttgtttct gcttaaagca tactttggtt 170101 aatactaata ttactcattc tgctttcttg tgtttgctca ttctctcaga ctttctgaaa 170161 tagttgactt ggatgaaggg ctccttccct ctgtatcaag atcttccttt tcaaagcttt 170221 cagtatgtga gaaaaaatta gggcaggcaa ggtggctcac gcctgtaatc ccagcacttt 170281 gggaggccta ggctggtgga tcacgacgtc aggagatcga gaccatcctg gctaacacgg 170341 tgaaaccccg tctctactaa aaatacaaaa aaattatcca ggcatggtgg tgggcacctg 170401 cagtcccagt tacttgggag gctgaggcag gagaatcact tgaacctggg aggtggaggc 170461 tgcagtgagc caagatcacg ctgctgcact ccagcctgga tgacagagtg agactctgtc 170521 taaaaaaaat taaaaaaaat aaaaaaaaaa tagagtgtaa tctaacacct agaaagaaca 170581 gtctacaggc cgggcatggt ggctcacgcc tgtaatccca gcccttgggg aggctgaagt 170641 aggcgcctgt aatcccagcc tttggggagg ccgaggtagg cggatcacct gaggtcagga 170701 gttcgagacc agactgacca acatggtgaa accgcatctc tactaaaaat acaaaaaaaa 170761 atctgggcgt ggtggtgggt tcttgtaata ccagctactc aggaacctgc ggtgggagga 170821 tcccttgagc ctgggaagtg gaggttgcag tgagtcgaga ttgtgttact gcactacagc 170881 ctgggcgaca gtaagactct gtctcaaaaa aaaaaaaagt gattctgttt ttcagtttgt 170941 cttttgtcta tctcacacac ttttgtctct gctcttccac gtatattttt atctactaat 171001 tttcaccttt gaatgtccct cttttgaaga tgggtgagtg gggcttccag ttttgtaagg 171061 gatacttgcg ttatgttagg atccagccta acattttcag gagggtgtgt tttggggaag 171121 aggtgtgcgt attaatacca caagccagag gatgactcta gtggacattt gtcagacttt 171181 gtggcttcca agcatctggg cccacttcca aagtttgtag agtcccctaa tttatggatg 171241 ttgttgggaa gagagcccac ctcccactat agaaataagt acaccagaaa cttgcttctg 171301 agtgtctctt tcagctagaa tgagagcaag tgacaggctc tctgcccatc agatatatct 171361 gccctgcatt tgacacagag aaggggagac aaggaggaac ttgctctgtc agtttgtagg 171421 cagccattgt agggacatgg attcctggag cgtgacgaca gtaatgctag gggtagcagc 171481 gaatgtctgt gagaagtaca tcagaaatgc aagctgcagc atctagtgct tggtggcagc 171541 agcactggtg tcctcactag ctggcttgga gtcatgattt ggggcactgt taacagatga 171601 atttttgttg ttgttgtagt tttgttttgt tttgtttttg tttttgtaga aacggggtct 171661 cgctgtgttg cccagggtga tgttgaactc ctggcctcaa gcaatcctcc tgtcttggcc 171721 tcccaaagcg ctgggacttc aggcatgaga caccacactc agccatagac tcgtttgtta 171781 gttctcctaa gaaacagagc caacaaaata tattgctaaa gggtgggtgg ggtgggtggg 171841 aaggtaagat tttaaactct aaggaattgg cacatgtgat tgtggaggct tggcaagttc 171901 aaagtatgca gggtggacca gaaggctgta gacctagcga agagctgatg ttgttgcagc 171961 ttgagtccaa aggcagtggg ttgacttttt ctattgtggc cttcaactga ttggataagg 172021 cccacccaca ttatggaggc taatctggtg tactcaagtt ctattgattt aaatgttaat 172081 ctcatcttaa aaatactccc caaaaagaaa aataagaaag aaagaaagaa agaaggaaag 172141 aaagaggaaa gaaaagagaa agaaagaaag agagagaaag aaagaaagaa agaggaaaga 172201 aaagaaagag aaagaaagag agagagagaa agaaagaaag aaagaaagaa agaaagaaag 172261 aaagaaagaa agaaagaaaa gaaaagaaaa gaaagagaag aaagagggct gggcgcggtg 172321 gctcacgcct gtaatcccag cactttgaga ggccgaggtg ggtgggtcac gaggtcaaga 172381 aatcaagacc atcctgggca aaatggtgaa accctgtctt tactaaaaat acaaaaaatt 172441 agctgggcgt ggtggcgcgt gcctgtagtc ccagctactc gggaggctga ggcaggaaaa 172501 tcactcgaac ctgggaggtg gaggttgcag tgagccgaga ttgcgccact gcactccagc 172561 ctggcgacag agcgagactc cgtctcaaaa aaaaaaaaaa aagaaagaaa gaaagaggcg 172621 ccaggcgggg tggttcacgc ctgtaatccc accactttgg gaggctgagg tcaagagatc 172681 gagaccatta tggccaacaa tgtgaaaccc tgtctctact aaaaatacaa aaattagctg 172741 ggcatggtgg tacgtgcctg tagtcccagc tactcgggag gctgaggcag gagaatctct 172801 tgaacccggg aggtggaggt tgcagtgagt tgagatcaaa ccactgaact ccagcctgat 172861 gacagagtga aactccatct caaaaaaaaa aaaaaaaaaa aaaagagaat gggacagaga 172921 gaaggtgatt ggatttatac attgtagcca agcatgtagt gatagctttg ctcaatcctg 172981 gttccaaata ctctggttca accagctatg gtgaagggga ggtggtagaa acacgactcc 173041 tggggctcca ccacattgtg cctatgcaga ttaggaagct ctatcccaag aaaagggcaa 173101 accatgtgag ttgcagggac atccccaaag atgtccatcc taggggtcta gagttcataa 173161 atcacggcat cctggattct cctggagaaa acctccctta caatcagact acccttcatc 173221 tcaaacattt tctttttcct tttttttttt tttttgagac agagtcttgc tctattgccc 173281 aggctggagt gcagtggcac gatctcggtt cactgcaaca tccacctcct gggttcaagt 173341 aattattgtg cctcaccctc ccaagtagct aggactacag gcacatgcca ccatgcctgg 173401 ctaatttttg tatttttagt agagactggg tttcaccatg atggccaggc tggtctcgaa 173461 cccctgacct taggtgatct gcccaccttg gcttcccaaa cttttgggtt acaggcgtga 173521 gccactgtgc ctgaccccaa acattttcac gaattaagcc catctcagta atgtgctcgt 173581 aacattccct cctgtaaaat ggaaaacacg aagcatcact aatgtcttaa gatgaccagg 173641 cagaggaaag caagggctac acagaaaaca cggaagagcc ccatatctca acaaaggaag 173701 tgatactgca aaggatttca tgacagaatt tccacacctg tgggcacagg agcagatcac 173761 aaggtgagga ggtttgtggt tccaaggaat cttgtctgcg atttatctgt atcaggatgg 173821 cttcatttct aatatctaca agttttggtc caagagtttt atctaaacgt tagataatat 173881 tgaatgtcgt cgttgtttgg tccaaagggg ccaaaattag gtgggacaat tatgttcttc 173941 ttgtccctaa tgaggctctc agggaatctg gtcccagtgg ccaaaggagg ttcctacaaa 174001 tgcctgctct gtgattgccc caaacattta cttacacaga ggactaagac catgagcccc 174061 tactcctcac acactcagga cccgcctgtg tttccaagac attccaactc ccacagtaag 174121 tagaagcatt gaccagttat gtagaaatga ggaacaactg agagtgacag caccaacctt 174181 ctccaggaaa ggtcgcagag gtgaacaatt aagtatgtgt tggctcagat gtgaagttct 174241 tttaggaatc ttctttgcta catcatccag atacaggaag taatgagtaa tgtacagaga 174301 agtgaagtac aggagtcctg aatccatcag ttgtcacttg agattaaccc agtaatcatc 174361 agtcattcct ccagtctctt tatgagctgc taactagttg ggtggggaac tgaatgtacc 174421 agagacacac acattctcag agtcaggcta tgtgtacgtg tctgtgtgtt ttgttttctt 174481 ttctttttta atagagatgg ggtcttggta tgttgtccag gctggtcttg aactcctggc 174541 ctcaagccat cttccctctt tggccttcca aagttctagg attacaggca tgagccacca 174601 tgcccagcct gtctgtgttt tcaacgttaa atatgcaaaa taagaatcag tgagtcgtct 174661 aggaaaagca tcgtaactga gtgttggaaa ttggacgtat gggtgaggag accacatcct 174721 gtttttgcaa gtttgtgaat tgatttgcaa acgtggttct tcctggagcc tcatcacatc 174781 ttaaccaccc atgtgtatgt ttctgaattc actgtcttct atgcagctgg gtccagacat 174841 atgagaggga caaaccagtg agtgtctccg agttcctcct cttgggactc tccaggcagc 174901 cccagcagca gcatctcctc tttgtgttct tcctcagcat gtacctggcc actgtcctgg 174961 ggaacctgct catcatcctg gccataagca tagactcccg cctgcacacc cccatgtact 175021 tcttcctcag caacatgtcc tttgtggaca actgcttctc caccaccgtc cccaagatgc 175081 tggccaatca catactcagg actcaaacca tctccttctc tggctgtctc atgcagatgt 175141 attttatcag tgagcttgct gacatggaca atttcctcct ggctgtgatg gcctatgacc 175201 gctttgtcgc cgtgtgccgc cccttacatt acacagcaaa gatgacccat cagctctgtg 175261 ccctgctggt cactggatca tgggtggttg ccaactcgaa tgctctgctg cacaccctgc 175321 tgatggctcg actctcattc tgtgcagaca acaccatccc ccacatcttc tgcgatgtga 175381 ctcccctcct gaaactctcc tgttcagaca cacacctcag tgaagtgatg attcttactg 175441 aggctgccct agtcacgatc accccatttc tttgcctcct ggcttcctat atgcacatca 175501 cctgcgttgt cctgagggtc ccatccacaa agggaagatg gaaagccttc tccacctgtg 175561 gctcccacct ggctgtggtt ctcctcttct atggcaccat catgtctcca tatttcagaa 175621 cttcatcctc ccactcagct cagagagata tagcagctgc tgtgaggttc acagtggtga 175681 ctcccgtgat gaatcctttg atctacagcc tgaggaacaa ggacataaaa ggggctcttg 175741 taaaagtggt tgctgtgaaa tttttttctg ttcaataatg gtataggctt aagaaagtcc 175801 tagaaggagc taatttctga gataatcgtt tattttttct actgtgtgaa acttagcatt 175861 gtttgtttgt ttgtttgttt tgagacggag tctctgtcac ccaggctgga gtggagtcgt 175921 gcgatttcgg ctcactgtaa cctttgcctc ttgggttcaa gataatctcc tgcctcagcc 175981 tcctgagtag atgagattac agatgtgtgc caccacaatt tttttttttt ttgtattttt 176041 agtagagacg gggtttcacc atgttggtca ggctggtctc gagctcctga cctcaaatga 176101 tccacctgcc ttggcctctc taagtgctgg gattacagat gtgagccacc gcacctggcc 176161 agcatttgtt ttcataatag aaacatctgg tatctatttt gggagaacaa aaccctgcat 176221 atagactctt taggttaaag atggaaagag agatccttta attaaatgac ccgacaattc 176281 caattgtcaa tgccctcctg ccaaaaccta gaaggaacac acctgtagtt caataagttg 176341 gatttactaa ttatataaca agggagaata cacagcatca gcatcgggaa tagtgaggtg 176401 tctcaataga agagtcctaa aaaggacttc tgcttgtgta atgttggtga ggaaatagga 176461 atgagtctgt gctctggagt agatgccatt acagaataga gataattctg aatgagtatc 176521 tttttttttt tttttttttt ttttgagaca gagtctcact cagtcgccca ggctggagtg 176581 cagtagctcg atctccgctc actgcaagct ccgccttctg ggttcacgcc attctcctgc 176641 ctcagcctcc tgagtagctg ggactacagg cgcccgccac catgcccggc taattttttt 176701 gtattttttt tagtagagag ggggtttcac cgtgttagcc aggatggtct cgatttcctg 176761 acctggtgat ccgcccgcct cagcctccca aagtgctggg attacaggcg tgagccacca 176821 cgcccggcca tgaatgagta tcttaatata ttttatctag aaggaaagaa gaggccaaag 176881 ctgtgattgc caaagaaata gcagtcactc atatcaacca ccataggggg atgtttggtg 176941 atttttgtgg ctatgaccat gttcctgttt ttgtgttgag acatcattac agaaaggtct 177001 tgctttgttt tgctctagca cagtcagtgt ggccttatct gataccgatg ttctgtgaaa 177061 ttctctatgt tgatcaggag aacacaaaaa ccttgctgtg agggccaggc caactcctgt 177121 cagggttgtt tactctttct cacaacagag accttgacat gaacacatct gatggaaaga 177181 tcagacattt gttgggacag gaaggggagg attgatttta ttttatttta tttgagacgg 177241 agtctcgctc tcttgcccag gctggagtgc agtggcgaga tctcggctca ccgcaacctc 177301 tgtctaccag gttgaagtga ttctcctgat tcagcctcct gagtagatgg gattacaggt 177361 gcgtgtcacc acgccaggct aatttttgtt atttttagta gagatgggtt ttcactgtgt 177421 tagccaggat ggtctcgatc tcctgacctc gagatctgcc agcctcggcc tcccaaagtg 177481 ctgggattac aggcgtgagc caccgtgcac ggctgaaggg aggatttatt tagcgttcca 177541 gaaagcccta attctgccac tcatttgagc tatttttatt ttcttatcta acctttatgt 177601 atcacacatt acagcaggaa tatgggtaag ttaaacaaga aaaatatctt ccacagtccc 177661 aaatatccag ccatatcaat caactacatc tattttttta cactctgttc tgtgtcaatg 177721 cacatatata tttttattta gatactgatt tatatcctgg tttttcacct ttcattttat 177781 aataaagctt ttccaaatca ctacatagtc tccacaattt tatcttaata ctttatatgt 177841 ctccatcaag ttctctagca aaagactctt actaataatt ctatttctaa attaaaacat 177901 aaaggaaatt tactatttaa gagcgtgatt tgaattttat ttgtaggcca ggcgcagtgg 177961 ctcacgcctg tcatcccagc actttaggag gctgaggtgg gcggattacc tgaggtcagg 178021 agtttcagac cagcctggcc aatatggtaa aaccccgtct ctactaaaaa tacaaaaatt 178081 agccgggtgt ggtggcgtgg tgcctataat cccagctact cgggaggctg aggcaagaga 178141 attgcttgac cctgggaggc agaggttgca gtgagctgag atcacgccac tgcactccag 178201 cctgggtgac agaccgagac tgtctcaaaa aataaataaa taaataaaaa gaaagcgatt 178261 gtagaaaata acaaaactga ccaattatga aactggttat tttttccttg tctttgacat 178321 tcagcatttc tttctttttt cttttttttt tttttttttt ttttgagaag gagtctcgct 178381 ctgtcgccca ggctagagtg cagtcttgcg atctcggctc actgcaagct ccgcctcctg 178441 ggttcacgct attctcctgc ctcaggctcc caagctgctg ggaatgcagg cgccggccac 178501 cacgcccggc taattttttt tttttttttt gtattactga gacagggttt cactgtgtta 178561 gctacgatgg tctcgatttc ctgactttgt gatccgcccg cctcaggctc ccaagctgct 178621 gggaatgcag gcgtgagcca ccacgcctgg ctaacattca gcatttttac tatgatgcat 178681 ctgtttgtgg gtctctttgt gtttatctta cttgaagttt actaagcttc ctgtctgtat 178741 agattattat gttttaataa atttggggcc gggcgcggtg gctcaagcct gtaatcccag 178801 cactttggga ggccgaggcg ggtggatcac gaggtcagga gatcaagacc atcctggcta 178861 acacggtgaa accccatctc cactaaaaat acaaaaaaat tagctgggcg tgatggtggg 178921 cgcctgtagt cccagctact cgggaggctg aagcaggaga atggcgtgaa ctcgggaggt 178981 ggaggttgca gtgagccaag aacgtgtcac tgcactccag cctgaccgac agagtgagac 179041 tccgtctcaa aaaataaaat taattaatta attaattagg gaaattttta gccattattt 179101 ttccaaaatt ttttcctctc ctttctctct tcttctggta ctcccattgt gtgtatttgg 179161 tgcacttaat gatgtccaca tttctttgaa gttatattca cttgtctttt tttttttttt 179221 ttgagatgga gtcttgctct gtcacccagg ctggatctcc cctcactgtg ggttcaagag 179281 attctcctgc ctcagcctcc caagtagctg ggactacagg cactctcaca ctgtcatgct 179341 ggagtcaacc tcccatttac tctgattttt ttttttaaag agatgagccc agcgctttgg 179401 aaggccaagg cagggggatc acttgggccc aggagtttga ggccatcatg gacaacatag 179461 caagaccgcg tttctagaaa aataaaaata aaaaaattag ctgggtgagg tggcatgtcc 179521 ctgtagtccc agatacttgg gagagttagg cggaaggatc tcttgagtct acgagttcag 179581 ggctgtagtg agctatgatc acagctctgt actccagtct gcgcaacaga gtgagaccct 179641 gtttcttaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagat gtggtctcag taagttgctc 179701 aggctggtct ctaactcctg gattcaagga atcctcccac ctcagcttcc aaagtagctg 179761 ggactacacg cacatgccac catgccgtct tgataatgtt ttttaaattt taaaaataat 179821 tgttttttga aatggtatct cactctatag cccaggctag agtgcagtgg cataatctcg 179881 gctcactgca actccacttc ccgggttcaa gcaatcctcc ttcctcagcc tcctgagtag 179941 ctgggactac agatgcctgc caccgcaact ggctactttt tgtattatta gtagagatgg 180001 ggtttcacca tgttggccag gctggccttg aactcctgac ctcaggtgat ccacctgcct 180061 tggcctccca aagtgctggg attataggcg ttgagccact gcactcaacc aaatcttttt 180121 tttttttttt ttcgtagaca gggtctcact atgttgccca ggctggtctt gaactcctgg 180181 cctcaagcca tcctcctgat tcggcttccc aaagtgctgg gattacaggt gtgagccact 180241 gcgcctggct tattttctgt agttactact ggaattggat ctttggtctc ttacattttc 180301 caactctcgc tctaccgcta tattgggaaa ctgttcactg ttgtttactt tttcaactta 180361 actggtgatc aggctcctta ggtaacagaa cttactcatc caacaacttc ccaggggctc 180421 tcgtgtttgg gggcggcccc tggtcctggt cggggcgctc ctctcattcc cacccagcct 180481 cactattgcg gaaggaaacc ctgcggccca tgtcctgccc tctgggacac tgggtgctga 180541 ttgacggcaa ccctgaccgt gggcacctcc ccagtccagg agatggtgtg ggttgtggtg 180601 gcttcaccca cctacactta aaaaaacaaa aagctactca ggaagctgag gcaggaggat 180661 tgtttgagcc taggaggtgg aggttgcagt gagccaagat cgtcccactc cagccttagc 180721 aacagagcca gtctcaacta aagaaaaaag tgtctcaggt accgtgccat ggcttcatga 180781 gaactgatgt aacccccgac tctgggcagg gctgccaaag agtgagagag tggagactct 180841 ctctcaccac tcacttcctg gcagctactt ttctggatgc atgggcaggt cccctacaca 180901 tgagatggtg tgggttgtgg tggcttcacc cacctacact taaacgaaaa gctactcaag 180961 aggttgaggc aggaagatcg cttgagccta gcaggcggga ggttgcagcg agccaagatc 181021 accccactcc agccttatca acagagccag tctcaaagaa aaaattaact gtctcgggta 181081 ctgtgccatg gctgcatgag gactgataaa atacccgact ctgggcaggg gtaccaaaga 181141 gtgagtggag actctctctc accactcact tcctggcagc tacttttctg ggtgcacagg 181201 caggtccact gcacatggta acagcctgcc gactctggaa acacagttca ggatcccagt 181261 cctagtgcac tggtcagact ttcaaataag cgaataacgt tgtgggcagc ctgttttgtt 181321 cgctttgctg gctttacagg gtacttcctg ttggcctgaa atacaatgtt ggtaaagaca 181381 gtttttctgt ggcctttctc tgttgtggag gatttgttgt ttttaaatta cgattcagga 181441 ctgaactgta tcatagaatg tccttctttc agaatgcttt gtgccaaaac aagaaacctc 181501 agaattacat gttaataaca acaaacaaaa caaaaaagac caataggcag aaggcaagaa 181561 acaaaaacta ttctttgtag atggtgtcat tttaccaaaa gttaacagaa ctaaaccaac 181621 aaactattga aaattaacaa cagtgagatg actatacaaa tatataatta aaaataactt 181681 tcccatgtaa aagcaataat caactataaa aatataatgg aaaaaactac aataacaaaa 181741 ttgtgtaaaa caagtgaaga aaattagaaa aattttcaca ggaatataat actctaataa 181801 atagaaaggc atgtttctgg atgggaaaca tacactataa tttttcccaa atagcttata 181861 ggatttcttt attccaatca aaacaccaat gtataacttt tggaaaccta aagcaagcaa 181921 aactgaagaa tccctaaatc gcaccacacg atcagtccgt aaaggcaccg ctggccttgc 181981 ttctaggcag aggacaggaa ggtatggata tcaagaggct gaagacaatt aatacccact 182041 acaatgaaga cacctggaaa cctgtctctt gagaggcaat gtgagctaac cactggagac 182101 gtaatttcta atcatacaac actttgcctt gtgaggtagt gaccacccca ttgccaaagg 182161 tctgatgaca gcatagaggg gacctaagca tatggtaagg ttattgatga gacatggatc 182221 caagccaact ttttcctgcc tccaggttga cccttcctct ctgggtcttg gggtttcctt 182281 cgtagcacat gagatctggg ctcaacagag ggacaagatt ctgaagagct tttctagtgg 182341 tggggctgga tggccctgcc tgagtaatcc aaacttcttt tagcaaggga gaagcatgag 182401 acggctgctg gagagatcaa cagcagaaca aaatacaata agaacagtag acacctaaat 182461 tatgttgtga aaaggaatct gtaaaagtca gttttatcac aaattgtaaa tattattgaa 182521 attgattgca aatttagatc acatacaaat gagagtctga cattcaactg ttttcctata 182581 ttccaaagta aacaattcct ttcaacactc aagacttaaa caggtattct tagagggtta 182641 tatgaattgc tatcagaagc tgttggctaa caagccagta atttggttct ttcaccagaa 182701 cacagttcca gataagcatc tttgcactat ttctcaagta tgaatcccca tgtgggggga 182761 aaacggatat actttcaata gacacaagtc actctttgcc ttccaagtaa gcagactcca 182821 gattcatctt caaagtgttg ggaaagggga tctgtgacct gtacattatc atataacttc 182881 aaaaaggaaa gctccttagt ccaaaaagcc tagatgctga ggtatagccc ttgaaatgtt 182941 ttcttccctg tgaattttct agcaatttga ggttttagct aagatgggca tttatccaat 183001 tttggcaata atgagatttt tacacattta taccttttta ggcagcttgg gaattcagaa 183061 ctacttatga aagctctcag gttgaggcag caccatcaga cccagaaagg gttcccagta 183121 ttacttctgc tttcgggtct tacaggctga gtgggttttc tcatgccggg tacagtttga 183181 cagccgacca aatcttctcc cacatttttt gcaaccatat ggtctctcag cctcatgact 183241 ctgcagatga agcacaaact cctcattctt tgggaaggtt ttcccacatt ctggacactt 183301 aaatatcttc tcccttatat ggattccttc atgacgactc cgatgagaat tctgacggaa 183361 gttttttcca cactgagaac aggaataagg tttctctcct gtatgaattc gctcatgttt 183421 attgaggttt gttttatgat tgaagctttt cccacagtga ttacagtcat agggtttttc 183481 tccagtgtgg gtcctctggt gggaaatgag gtaagaactt tcattaaact gtttcccaca 183541 cagtggacaa gtgtaccatc tccttgtcct ccgatttcga gtaagggcaa taatactgat 183601 gtctacatat tttgagagtt cctctagagg agtatcattc tcaatggtaa ctaacagatt 183661 tctcattttc cttttttgtg gaatggatgt atttagtcgt tccttttcct ggttatcggg 183721 aggctgctga gagaccaagg aagaatccat ttcatcatca tctgaagact tttctctata 183781 atcttcattt tctacaggtt ccctctcagg atcttcacct tcaggattac tgccattgac 183841 tgctgaaaca aagagagaat ttaacaactt ctgagagata tcacaacacc aggcaggaaa 183901 aaacaagtca ggtttcttat gccacagcca ccttgaacgt caatctgggg agtggcggat 183961 actgccgtcg cactcctttt ccatacttat tgagggtggg atacataact gataaatctt 184021 tccatctacc acaatgcaag cacactttaa agaaataaaa tgagtgggat ttcactgaaa 184081 gacttcaaat gtttgttctc ttcataggtg ttaattactc gacaagttca tattcacctt 184141 atactctgta gagcactcag gtaaggacta gggatgaaga ctatacatct atacagggaa 184201 ccactcatca tcttgaagga gtctggtaac tgaatagtaa gacgaaccac ttgtaggttt 184261 tcactatatt aaatattatc acactattat agcactaaat aatatcataa tgtcttacaa 184321 tgagaaaatt gaaacaattc caataagcaa aattaaagag ggcacagcat attcatacaa 184381 aagtttatta ttggccaggc gtggtggctc acacctgtaa tcccagcact ttgggaggtc 184441 aaggcgagca gatcacccaa ggctgggagt tcaagaccag cctgaccatc gtggagaaac 184501 cctgtctcta ctaaaaatac aaaataagct gggcatggtg gcacatgcct gtaatctcag 184561 ctactaggga ggctgaggca ggagaatcac ttgaaaccgg aaggcagagg ttgcagtgag 184621 ccgagatcgc gccattgcac tccagcctgg gcaacaagag cgaatctgtc tcaaaacaga 184681 aaaaagttta ttatttggca attaaagaga ttactacaac taatgaagct ttgggattag 184741 aagtcagtag tgtttctctt tgtgtaggaa aggaagtagt gagtgggtac agggcacaaa 184801 agggattttt gggatgctaa gaaaagtcca ttttgggtgg tgcttagtgt gtacattttg 184861 tgaaaacttg agctatatat ttacaattgg taggcttttt ctatatttcc attatacttc 184921 agaaaaaagt cagtgtaagc agattataaa gactgggtaa caaagtgaaa aagaaccatg 184981 ctgtggatat acttgtgtga caagggaccc ctcaactcag gcctactgag agtcaagggt 185041 tcaaatcaac caccaagaca tgggggctga tcctgaagct ggagttggaa tctctgctgg 185101 attcagcagt ttcacaccaa tagatttttt tttatttcac tttttttttt ttttggcaga 185161 gatagggtct tgctatgttg cctaggctgg ttttgcactc gtgtactcaa gcaatcttcc 185221 caaagtgctg gggattacag gtgtaagccc ccacacccag ccttaattat tttttctttc 185281 tttttttttt tttttttttg agatggagtc tcgctttgtc gcccaggttg gagtgcaatg 185341 gcaccatctc agttcactgc aagctccgcc tcctgggttc actccattct cctacctgag 185401 cctccccagt agctgggact acagacgtcc accaccgcgc ccagctaatt ttttgtattt 185461 atagtagaga gagggtttca tcgtgttagc caggatggtc ttgatgatct cctgacctcg 185521 tgatccaccc gcctaggcct cccaaagtgc tgggattaca ggcatgagcc actgcgcccg 185581 acctaattat tctttttaat cactccctta ggataatgaa atgtcaattt ttaccatagc 185641 tcttgacatg aactaccatt ctgctttcaa aaacagctct actaatttca tgtgaatatt 185701 aactttacct aatcttgtta gcattaaatt ttaaaatatt taataggtat aaaatagttt 185761 ctcattgtag ttttactttg gattttcatt atgcaaaata atctactggg ttaaggcttt 185821 tataaatgta tatctttgtc aatgatttct tgtctaaata atttttaact caactcttcc 185881 tctttaacta cacttggact gcttccaacc ttgtgttttt acaaacaata ccacagtgaa 185941 taatcttctt tgtacatcat ttcgaatgtg ttctgttgct atatctgtag gtcagatttc 186001 cagaattgaa ctcctggaac aaaggataag atgggtagtt acagatgcaa ttacccttat 186061 acagggacta taccatttta tatgcccacc agcaatatac ctgtttctca acaacctcac 186121 catcaatgtg tttttatttc tataaatatg aaatacgcaa aatgttaatt cactagaggc 186181 ttaatttgca cttctttttg ctcctgtcac gcaggctgga gtgcagtggc gtgatctcgg 186241 ctcactgcaa cctccacctc ctgggttcaa gtgattctcc ttcctcagcc tcccgagtag 186301 ctgggattat aggcatgccc caccacatcc agctaatttt tgtattttta gtaaaaatgg 186361 ggttttgcca tgttggccag gctggtctca aaactcctga cctcaggtga tccacctgcc 186421 tcggcctccc aaagtgctgg gattacaggc gtgagccacc atgccccagc cttttgtacg 186481 tatttgaatc tgtcaatctt ttctttcaag gcttctgggc ttttgaatct taattacaaa 186541 aaccttccta tcccatggtt ataaaggaat cactcatatt ttctaaatgt gctttttata 186601 tttgttttaa acatttaaaa tctttgatcc atgtggactt tttcttggta tgtaaagtaa 186661 ggtactaatt tatcaagtgg ctgtcagtcg tcctaacacc ttttatcaaa aagtccccct 186721 ttatggttag gctttgtgtc ctcgcccaaa tctcatctta aattgtaatc ccacatgtca 186781 aggagacatc aggtgaaggt aactgagtca tgggcggggg ggccttcccc catgctattc 186841 tcatgatagt gagttctcat gagatctgat ggttttataa ggggctcttc cccctttact 186901 cgtacttctc cctcctgctg ccttgtgaag atgatgcctt gcttcctctt gcctttgcca 186961 tgattttaag tttctggagg cctccccagc catgctgaac tgtgagtcaa ttaaacctgt 187021 ttcctttata aattaccaag tctcggacaa ttctttatac cagtgtgaaa atggacgaat 187081 acaccaactt tatcacacac taaactccca tgtgtatttg aggctatttc gggactttct 187141 gagatcacag tcttgttcca agggaaaaat gctattggta ttgttgaaaa aaaacacatc 187201 aaattaataa atcaacaaga ggagaaatga cctttttatg atattgtctc cctagccaaa 187261 aacatagtat tttttttttt tgctttcgtt gaaatctact cttgtccttt agccttgttt 187321 taaagtttcc tcataaattt gcacatttct tgagtttatt cctaagtatt taaactttca 187381 tgtttctatt ctaatggggc cttccactgt atcttctaaa tgaccccatt tatatatgtg 187441 aaagccaatg atttctgttt attcacttta tttcccacta ccttagtgaa tggtctcatt 187501 ttttcccatg agagtactta agacaaataa atgttcttat ttttaagttt tctgattact 187561 ctgttctacc tataaggaag tatctgttaa ctattttgtt aaatgttctt ataaaggaga 187621 ggcttcaatt ttgtcaagta tcttttcagt aactatacaa gtaaacaaat gatttttctc 187681 ctttatatat aagaaatatt aatagatttg tattatttct gtttttttaa ttattttttt 187741 tgagatggag tcttgctctg tcccctaggc tggagtacaa tgatgtgatc tcagctcact 187801 gcaacctctg cctcccgggt tcaagtgatt ctcctacctc agcctccaga gtaactggga 187861 ttacaagcac gcaccaccac acccagctaa cttttgtata tttttttttt agtagagatg 187921 gggttttgct atgttggaca ggctggtctc aaactcctga cctcaggtat cacccgcctt 187981 tgcctcccaa agtgctggga ttacagccgt gaaccactgt gcccagcatg gggactggat 188041 ttcaacacga gatttggagg ggacaaatat ccaaactata tcagcaacct tgcgtcaaag 188101 gggtaaaaac aaattaaaat tgcatatttt ttctagaaaa cattatacag accttcatat 188161 tcataggttc tgcatctgag gattcaacca accaaggatt gaaaatattt gagagaaaaa 188221 aaaaggatgg ttttgtctgt acatattcag tttttttgtc attccctaaa caatttagta 188281 taacaactat ttatgtagca tttactttgt attaggtatt ataattgatc tagaaataaa 188341 atatatggta ggatatgtgt ggagtatatg taaatgctac accattttat ataatgaatt 188401 tgagcatcca tggattttgg tatctgagtg ggggtcctgg aaccaatccc ccatgaatac 188461 caaggacaag tgtatatcag aatttatggt ataacctaaa gtgatgctca taagagagct 188521 catggctgta aaaacatgta ttaatttaga aaaatgagaa taataaagcc tctgattcaa 188581 agttagaaaa acaataggat aaatttaaaa gaatgaaggt attaataaat accaattaga 188641 aagtaatgag ttaaaaatag aaaaaaacta gtaagattta agcaaaaata aaccaatttc 188701 catagagaaa attgaaatta tcaaaaaaag actagccacc tacccaagaa aagcaccagc 188761 ctcagacagt tccacagaaa attcttctaa acttttaaaa attaaataat tcaaatgcta 188821 ggaaaatttc cagagtatag aaaaagaagg ttgggaggcc aaggcgggca gatcacaagg 188881 tcaggagatt gagaccatcc tggctaacat ggtgaaaccc tgtctctact aaaaatacaa 188941 aaaatcagcc ggacaaggtg gcaggcacct gtagtcccag ctactcggga ggctgaggca 189001 ggagaatggc atgaaccgca gggggcggag actgcaggaa gccaagatcg caccactgca 189061 ctccagccta ggtgacagcg agactccgtc tcaaaaaaaa aaaagaagga aacttccaaa 189121 ttcaaacaag agcgaaactc tgtctcaaaa aaataaataa ataaataaag acagagaaat 189181 ctgagcccca gtgatggagg tggggcctag tcggaggtgt ctgcatcatg tgggtagatc 189241 gctcatgaat ggcttgatgc cacctcaaaa taataagtga tttctcactc ttagttcact 189301 aacaatgtgg ctgtttaaaa gagcctggca ccttcctact ctctttcttc ctctcttgcc 189361 atgtgaagtc tgctctcctt caaccatgag tagtttcttg aagccctcac cagaagcaga 189421 cgctgacgcc atgcttcttg tacagcctac agaaaactgt gagccaaata aacctctttt 189481 ctttataaat tacccagcct caggtattct tttatagcag cacaaaatgt actaagacag 189541 ttgcttaaga gttttaaatt ttccaggtgg aaggaacttt aatttactga tctttattat 189601 taatttctag ttgtactgca ttgtgacaga tcctacttta tttctactcc ttggaatctg 189661 agtctttgtt tgttgcctag tatttagtca attttcttca atcttgaaaa ggtgtattct 189721 cctatcacag cacacagtag ctataataag aattatctta ctgaaaatct gtttaagtgt 189781 tctacgtttt tccttatttt tacgtggttc acttggccaa cacacacatt ctctgtttcc 189841 ctaggttatc aaggactgtt ctttgattcc tgccaccaca atgttcaatc cgttatagtt 189901 tttcctttgc aaaaattggt taaatgttac tgtttcaatt atcatcacct ataaattgat 189961 ggctccccaa tccctatatc ctcacatctg aactccaatc cccaatttcc aaatatctgg 190021 aggataatct cattttattt cccatcatgg attcaatctc atccaaaatc aattccacct 190081 tcttcactac tctgccccgg gtccaaatct tacctcagtc ttgcgatagc tcattatacc 190141 tataaaaatt tcacactgct tatcgtttga tgaatgactt gtcttacaaa ccaaattgga 190201 agattacaaa aggcaagaat ataacatctt cgtaacctga tgaatacaat tccagaaccc 190261 aatcctgcct gaggaaaaaa ctagtacatt tatgaaatgc tggtaaatga ctaagcttaa 190321 gccaccagct tcagggtacg ggtggaggaa gtggggaaca gagggtgagg attgggtctt 190381 aggggccctg aagtgcatag tcgaagtgat cttggaggca atatatccac taaccagtca 190441 cccaaaataa atgccagctc tgcccaagac tcaggaaaaa ctcagtgtta ataaatctga 190501 tctttttatc cttttctcta aaataaagaa gatatcacat agagcaacct gcgggaaatt 190561 aactacactc attacatacc aatacttctg accatacggt ataattacag ctaagagttt 190621 gacagtcata atggatttct gtcaccccag gcaacttcta aaacctattc tctgtacaac 190681 ttcaccctca gaaacaaaaa gccctgaagg caaagtaaaa agtgtccggc actcagtaaa 190741 gacatttgtg cgggcctctg cctctagggt tgcaacattt ctagagaaaa ggtaaagcat 190801 ctctaagcaa attatagaga ggctgctgct tgatcagttc atacacagca ctgggggaca 190861 cttgtcaact attactggga aatgaggaca aataaagaat gccccattca ccaaccatac 190921 acggaattca ttgacaaaat tatgagaaat gcatctagga gacagtacag ccctgggcca 190981 ttactgcaaa ttctgatgtc agattccctg agtccgaatc cctgttcccc tacttacaac 191041 tgtgtgaact ttagcaaatt ttgtaacccc tctctgcctc agttctgtca cctgtaaggt 191101 ggcagtaatt tctataccac tcagtattac tctgagaagt acatgagaga ataaacacat 191161 gaaaagcatt tggactagtg cctagcatgt aacagcatat aagtgctcat taagtgttaa 191221 acattacttt caccaatagt ggaatatatt ttcacttagg ctagtagtct tcacatctat 191281 tgaacatcaa aaaggggagg gggattacaa gagttagtca tgtgttcaat gaaaacattc 191341 aaatatacct gttgagaaca gagttcagaa aatgggttca gaacagtttc tatatccaca 191401 gtttctatgt ctgcaggttt tccctacctc tcccctcact gcccccaaga tacagggaaa 191461 aatggccacc cacccacagc tggagttctg tggcaaggct ctgctacatt gaataaaaca 191521 aaagtaaaat aaaatgtgtg tgtgcacata ggcttaccag gtgtgaatcg tcccccaaag 191581 ctaaagagat gaaggacatg cttttcccaa agcaaacaac ctgaacacac aaaggctgtg 191641 atttcttacc cagtaacatc atttccccga cactgctgtc atcttccttc tctgacgtga 191701 gttgttgagt aggatccaag ctcacacatt cttcctggca gtgaaataca ttcaaatcct 191761 caaagaccac cagctcctga aagagcaaga ggcccctttc atcttagtaa ctgaggttca 191821 ctgacagccc tactggtaag aaaagctgac acacaaagat caagggaagg gatagcaaga 191881 gggacctata aatcccacag aggtgcagca aagacaggac agccaggctg agaagggctc 191941 aaggaaaaag cagaagaaaa acacccaaga aggccaactc ctaagaatgt gtttttacac 192001 acaggttggc caagcacgga aggtgatgga ttcaggcagt aaatgggaaa accaaaccca 192061 cctcagggtt agctttcaaa tacagagaca ctgtctcttg ttccttttgg actcctttgg 192121 gcagaagctt caccaaggga cgaggatgca ctggggaaag aggaaggttt agttagtgaa 192181 aaactcagtg actctaacac agagactccc cataccgcga ggccaggctg tcctcattcc 192241 tttgacaggt aactcatggc cttttctgct cctgtgagcc cttatagagt tcaagccata 192301 aactctaggg tcttgaacct atggcatctc cctgttatga aaagaatgac ttctctaagc 192361 acctttcaga atttcccata gagtattact aaaggagagc ttagggcaac gtaaagtgtg 192421 aagatgccca gaaaaagcac aaaatttcta aaattccaca tcaacctaac tcaataattc 192481 atctgattta tccttaaaag atatttccct cagaatgttt gggtaaaact gacactacag 192541 aaagatgcac actgtttatt ctgtggcctt actcacaccc agcacttgag aagaacttcc 192601 tctttccaac actagaattg actccttttc caatatacac catcctaact tgaatttata 192661 actaacacag aaacctatct tagactccag tacctcctcc tcctcttcta gtgaaacaag 192721 catgtgcttc tcacctctgt tctgaaggct tgagctcaca tctttcataa taaccaaggt 192781 ctccttgact ttctgactgg gctggaccag gcttggcttg ggcaagaagg tgaggaagtg 192841 ctccagcact agctggtgga tggtcttggg cccgttagtg gcatctcgaa gtaggtcttg 192901 gcccagcttg gagtctggcg gaactctcag tataaaggac tgctttggct ttgggggcat 192961 aggaaccact tttgcagcca tcatgccttg caaccacaca ccacactcgt ttcgcggggc 193021 ctccagagcc acctcttact agaggaaatc tgccagagag ccaagctgta gacagagaaa 193081 ccagggatta cccaaaagac caggcacggc attactgcac tccaatatgt ggcatggctg 193141 gtgaggctac atgagatcta aagaaaacga cagctgggat agggaaatca taactgaaac 193201 gcagtatttg aacaagatat gcttaggaag atgtgaaagg aagatcctga gaatgaaaaa 193261 cagagactgt gcaaacctca agtccaaagg gaagggagta agggtggagc ggagaaggcc 193321 aaggtccagc ctcctgagaa atacaaagtg tggccaggta ccggtggctc acacctgtaa 193381 tcccagcact ttggaaggcc ctggcgggtg gatcgcttga gtccaggagt tcaagaccag 193441 cctgggcaac acagcaagac accgactcca taaaaaaaaa aaaaattagc tgggcgtggc 193501 gacacaagcc tgtggtccca gctactgggg agctgggagg attgcttgtg cccaggaggt 193561 cgaggctgca gtgaactgtg atcgcgccac tgcactgcag tctgagcgac aaagcaagac 193621 cctgactcta aaaaaagaaa ggaaaagaga aaggaaagaa ggaggaaggg aatcagggaa 193681 gagaaagaaa ataaaaggaa aaagaaagaa aagacagaaa gaaagacggt gtgtaaaacc 193741 caccaggaat aggttaaatc aggcttggag aaagagcaat gggctagaag acaggaaatc 193801 tggggtcaat accgaggaca tctgcctaaa agcaggttgg tcactgaatc tggccatact 193861 gtacccctcg aaccgaagct ccctccggtg ccctttgggc ggggaggcgg ttggtgactc 193921 tcccggggag cagatgcaag gccgaggagg tgtccacaca ccgcctgccg actcctctcc 193981 gccgtcaaag ctctgctgag agcggcaggc gacatcccac taaggaccgc cgggccaggc 194041 tcactctggg gcctcttccg ctggtcaagg aacaccttta ccgtaaagct cagcgtgcgc 194101 ccctgcctga ggcgctcacc aggctcccta cccggccttg ctccctcagc aacggacacg 194161 ctccgctccc cagaggcggc ctcagcctgg ttcccgccct cacggagccc ctcacctctc 194221 ggggcctctg cagcccctga gcgtttgctg gggacggctc agagactcag gctccgggag 194281 agatagaaaa actaggcgcg agcggtcgag ccctcccctc gcccttccga gtgccctcac 194341 aggtcgccgg cgactattcg ttcgcgccgc cgccagttga ggagaacggc agggactcgg 194401 tgccttctgg gaagtcgggc gctctgcggc tgtgacgtca caaccggtgc cttgtttccg 194461 gtgcagaagc ctggtctccc cgttcggagc cggcagtctg cgcttgagac gttaagactt 194521 gagacaggcc agaggagctc tcagggccgg agggaggcca ggacggctgt agcctctctg 194581 tggttctgcc tggaagacgg aaggcaggtg gttggctcta gtcatccacg acgggctggc 194641 acctctccag ctgcggccag tctaacccca gggcctgctg ggaaatgtag ttcgaatgca 194701 aacaaccaat ggacgaccgt caggcgcggc ggttggggcg gggcaggccc ccccaccgcc 194761 ccaccccccg cccacccagc gcccgcgctc cccccacccc ccaccccgcc acccccctac 194821 cccgccactc ccccaccgcc ctactctccc caccccccac ccccctactc tccccacccc 194881 ccacccccca ccccgccacc ccccacggcg ccacccccca acccccaccc ccctactctc 194941 cccacccccc tactctcccc acccccctac tctccccacc ccctactctc cccacccccc 195001 tactctcccc acccccccac cccctactct ccccaccccc ctactctccc caccccccta 195061 ctctccccac ccccccaccc cctaccctcc ccaccccccc acccccctac cctccccacc 195121 cccctaccct ccccaccccc ctaccctccc caccccccta ccccgaggct taaaggaagc 195181 aagctatctc tccgaccgga aaatcaagac gcctccgcgg tttccgcctt ttactgcggt 195241 tctccagtaa aaagactgcg gaggcggaca gggtgtggcc gccatgggac tccgcccccg 195301 ctctggtgac tcccataggt tagagatggg gcaccgacat tgcccactcc agcttgctaa 195361 cttctacact gctgtccgca ccggccgtct tgtttttaaa gacatcttaa ttgcccttct 195421 cctactgtct cggtttcctt tctcaatttc agcttcctac ggaggccgaa cagagttttg 195481 tgtttgtgcg ttgtatccac actcggttcg ttccccagtg acgcactgat cagggttcgg 195541 acaacttgtg ggcagggatg gccttatttc tcgatgtaat accagcctcc agaaagttga 195601 caaaagtaaa aatataaatg aatgcctctt cttaaagtta attatacctg actaatggct 195661 gacgatggct taaaatatgt tacaaagaaa cgagtatgaa aactgtgtgg caccaagaca 195721 ttctagaaaa ataaggaata tctgcctttt cagatcaaaa tttactaaaa agcctccgtt 195781 aatcaaagca ttatggtttg aattatgagg caaaacagat aactgagatg tcttgaagta 195841 aaatataaac ctgtgtgatt gtatttttta agaacttttt cttgactctg ttaacaaaca 195901 agaaatagac tggaagataa ttgtaaagat tagctacaca aaagattata cattcctaag 195961 acaaaatgaa cccctgccaa ctgacaaaaa gcaaaggcaa tgaggagctc ttcccgagag 196021 gaggaaaact aattggtcat taacatttaa aaagatgctt aaccacacta ctagtcaaga 196081 ttatcacttt taaagggaac cacatttttc acctatcaaa caatgaaggt ggagaaatag 196141 gcactccttt cgctggtagt gtggtgtgtg aagtcatctt gtaatatata ttaaaattaa 196201 aagtacaagt atgtaccctt tgaccttcac aatctcatct gtgaatctga tgtctacaaa 196261 tttcttgtag aattctttgt agtgacaaaa actagaaatg acatgcatgt tcatacgtaa 196321 ggaaaggagt aaattggtgc attcatactc tggaaattac ccagctatga aaaagaatgg 196381 attggaacta tatgtacacc tggagggatg gccataaaat attacatgaa aaagtttcag 196441 agctacatat attatgcata attatatttt tggaacaaac aaaacacctt gtgtatgttt 196501 gtgttgaggg atacacaata gattattaaa atggttactc atgggatggc caggcgcagt 196561 ggctcatgcc tgtaatccca gcactttggg aagccagagg tgggctgatc actggaggtt 196621 aggagttcaa gaccagcctg gccaacatgg ggaaacccca tctgtactaa aaatacaaaa 196681 attagccggg cgtggtggcg ggtgcctgta atctcatcta cttgggagga ggctgaagca 196741 caagaattgc ttgaacccag gaggtggagg tggacgttac agtgagccaa gatcgcacca 196801 ctgcactcca gcctgagtga caggcacacc agcctgggcg acagagtagt aagactctgt 196861 ctctaaataa atacgtaaaa taaaaataaa atggttactc agagggaggg aatggaaatg 196921 gagaaaaggg agagattaca cttttttttt attgtgtcac ttcatttaat gagcacgtca 196981 atttggtaac taaaaataat aaaaccttaa aaaaaaatag aacaataggc cgggcatggt 197041 ggctcatccc agcacttttg gaggctgagg tgggcagatc atctgaggtc aggagttcga 197101 gagcagccta acatagtgaa acccgtctct actaaaaaat acaaaattag ccgggcgtgg 197161 tgatgcatgc ctataatccc agctactcag gaggctgaga caggagaatc atttgaacct 197221 gggaggcaga ggttgcagtg agctgagatc atgccactgc actccagcct gggtgacaga 197281 gcaagactcc atcttgtggg ggcggaaaaa aaagaaaaaa gaacgataaa gattttttaa 197341 aacaatttga tggaaaaaaa gaacgataaa gattttttaa aacaatttga tggaaaaaaa 197401 agaacgaaaa agatttttta aaacaatttg atggaatcct gtttttagaa tctaatccgt 197461 aaaataaatg acataaatgg atttgaaatg ataccataaa ttaagggtgt aaacacaata 197521 tgtatctcta taaatgcctt ttaacatgaa aagatggcta ttgtcacata caattaaaga 197581 aatgaaattt aaaataataa ttacaaaaat aatattcagt attctaaaag caaattaata 197641 caattattta agagggcaat atgacacttc ctattttaag aagtaaactg ttcataaact 197701 ttgtaatggt caattctcaa ctcattttaa agaaaaactc acgcctgtaa tcccagcact 197761 ttgggaggcc gaggtgggcg gatcatgagg tcagaagatc aagactatcc tggccaacat 197821 ggtgaaaccc cgcctctact aaaaatacaa aaattagctg ggtgtagagg tacacgcctg 197881 taatcccagc tacttgggag gctgaggcag gagaatagct tgaaccaggg agttggaggt 197941 tgcagtgagc caagattctg ccactgcact ccagctcggg tgacagaagg agactccatc 198001 tccaaaaaaa aaaaaaaaaa ggttacatca tgatatgttt atacagtgga agccaacaga 198061 accattgaaa agagatactt taagtgaaaa ggcaagttat aaagcagtat gtataataag 198121 tcgttttaaa aataaaatgt taggcttttt ttttttgaga tggagtcttg ctctgttgcc 198181 caggctggag tgcagtggcg tggtctcggc tcactgcaac ctccatctcc caggttcaag 198241 caattcttct gcctcagcct cctgagtagc agggactacc ggcgcccgcc accacaccca 198301 gctaattttt gtatttttac tagagacggg ggtttcacca tgttggccag gatggtctcg 198361 atctcttgac ctcgtgatct gcccgcctcg gcctcccaaa gtgttgggat tacaggtgtg 198421 agccaccgcc cccagcctag gctttttttt tttttttttt aactggagga atatacaata 198481 acaagagtta acagttgctt tctagaaatg ttggccgggg gggagggaaa gagagttaat 198541 attggttgtc tctaaggagt aggattagaa ggaatttgtc actttttatg ttttataaat 198601 ccatacattt tcaaataaca agtgttcctt ttgtagttga aaaaaacaac tatttgaaag 198661 gtgagttaat ttagtatctg atcatctagg agtaaaacta agtgaagaaa gaatttccta 198721 tttaaaagat cttgaatgca aatcaaaact ataatgagat accacttcac acacactaga 198781 attactataa taaaaaatac cataaaaatg ttattggtga ggatgtggag aaaatgaaac 198841 cctcctatat tgctggtggt ccagctactt tggaaaggac ctaaaatggt ccagctactt 198901 tggaaaacag tttggcagtt tctttttttt tttttttttt ttttgagacg gagtctcgct 198961 ctgtcgccca ggctggagtg cagtggcggg atctcggctc actgcaagct ccgcctcccg 199021 ggttcacgcc attctcctgc ctcagcctcg tcagtagctg gaactacagg cgcccaccac 199081 cacgcccggc taattttttt ttgtattctt agtagaaacg gggtttcacc gtgttagcca 199141 ggatggtctc gatctcctga ccttgtgatc cgcctgcctc ggcctcccaa agtgttggga 199201 ttacaggcgt gagccaccgc acccggccgg cagtttctta aaatagttac catgtgaaca 199261 agcagttcaa tttcaaggta taaactcaag agaaatggaa atgtgtgttc acacaataac 199321 gtgttcacac tagcctaaaa gtggaaacaa cccaatgtcc tgtaagtgat cagtggatga 199381 acaagaagtg atatttacat gtaatcatat aatggattat tcagtgataa aacaggatga 199441 aagtagtgac acatgctgca acgtggatgg gccatgaaaa cacactaagc aaaagaatca 199501 gtcgcaaaaa acacatatcg tatgattcca cttaacatca tatgtccaga ataggcaaat 199561 ccatagagac agaaagtagc ctagtggttt acatgctctg gggaatgggg gagaatgggg 199621 agtgactgct aatacatatg ataaaatgtt ctggtattag tggtgatatt tatacacaca 199681 atgcatatac taagaaacac tgaattttcc actttaaaag gatgcatttt atggtatgga 199741 tttagagctc cataaaataa ccactaggca gagaaatttc atcagacaga ttcaaatgtg 199801 tactacctaa aacatttatg tttttgtttt ccttttttaa tttttttatt ttttgagaca 199861 gggtgttgtt ctgttgccga ggttggagtg cagtggcaca atctcggctt actgcagcct 199921 ctgcctcaaa ggctcaagca attctcctgc ctcagcttcc ggagcagctg ggactacagg 199981 tgctcaccac catgcctggc taatttttgt attttctggt agagacgagg tttcgccatg 200041 ttgcccaggc tggtctcgaa ccctggatgc aagcgatctg cccacctccg gctcccaaag 200101 tgttgggatt gcaggcgtga gccaccgcac ccagcctaaa acatgtatga agaagtacag 200161 caaaacacga agttcaatat gtagattcca aatagcaagt attcaaagga actgtacaat 200221 ttccatagaa tgaagcaaaa agcagaaatt cacatggaaa aattcacagc ctcattaaca 200281 ggtaatgaaa taaaacagaa gtaaggcctc attacatccc tattaaatca gcaaaaaaat 200341 tttaaatggc agtcattgct ttgggcatga aatattttaa aaatctttgc aaaggatggt 200401 atggcaaccc gtaggaagca ttatcagata tttcacattc aggaaccatt ttaggaataa 200461 cacaggagaa aaaaatatgt atatggtgag gtggtggaaa atatacacaa agatattcat 200521 gaaaagctta taacaaggga gaatcgggaa taagacatga cccaccgcag ggaacagcta 200581 agccgtaggt catgtgaact gtcctgggat gtggattact cttatagaat aaaactcgtg 200641 gaggaaagcc cagcaagttt acctgctctc atcatagcca tggagtatct gagtctaatc 200701 tacactctag tagtgaagac agaggagttg gcataggagt ttggaattta atcttcattt 200761 gatttttttc ttcttacatc cactttttgg agacagggac tcactctgtt gcccaggctg 200821 gattgccgta gtgcagtctc agttcactgc tgcagcctcg atctcctgcg ctcaagccat 200881 cctcccaccc ccatccctat ggctaatttt tgtatttttt gtagagacgg gttttcgcca 200941 cattgcccag ctggtctcaa tccctgggct caggcgatcc acccgccttt gcctcccaaa 201001 atgctgagat tacagatgtg agtcaccatg cccggcccct ttacatctct ttaaatgaag 201061 gacaaatgca cgggatgtgt gtgaagtaga acctaatatt ccacaacccg cagattttcc 201121 catacaaacc aagacagaat aatctgacac ttggaataca tgccaaatgt ttttccatac 201181 agtagtccca tctgcttcct gaaatgcagt cgggaaatgg gattccacag tgattgagtg 201241 acaaatgcaa atgaccagat tttctaattt ttttattcat gaggcccagt caattctctt 201301 aagaatcatg gcccggcatg gtggctcatg cctgtaatcc ctgcactttg ggaggctgag 201361 gtgggtgcat cacctgaggt caggagtttg agaccggcct ggccaacatg atgaaacctc 201421 atgtctacta aaaatgcaaa aaattagccg ggcatggtgg cgggcgcctg taatcccagc 201481 tacttcggag gctgaggcag gagaatctct tgaacctggg aggcggaggt tgcgagccaa 201541 gattgcacca ctgcactcca gcctgggtga caagagcaaa actttgtctc aaaaaaaaaa 201601 aaaaaaaaga atcataggcc gggcacagtg gctcatgcct gtaatcccag cactttggga 201661 ggctgaggcg ggtggatcac ctgaggtcag gagttcgaga ccagcctggc taacatggtg 201721 aaacccccgt ctctactaaa aatacaagaa aaccagcctg gtgtggtggc acgcacctgt 201781 aatcccagct gctcaggagg ctgaggcagg agaatcgctt gaacctggga ggtggaggtt 201841 gcagtgagct gagatcgtgc cactgcactc cagcctgggc cacagagcaa gatttggtct 201901 caaaaaaaaa aaaaaaaaat catagttcat taatccaggt acacaaatac ttctatccca 201961 atccagtctg cttgcgtttc tctaggcttc ccatatcctc aagacagaag cagagagaac 202021 aaggggacat atatccttgc cgtggggttc tgagggaaaa tgagggccaa atcttctgtt 202081 caggagcacc tgagagtgcc acccaccagg ggcggattat gcaacgactc cgtacttcct 202141 cctctgaaat ccatggtgtg tcatcagtac atgtcttcac ccggattgac taacatgttc 202201 gttcctaact tacctctgct ataatcgggt aggctccgtg ggcacagtaa ctattttgtt 202261 atttttgcat ttcccatagc agctagcacc tagtcggcat tccgtgacta ttgagtgtga 202321 atgcaagata caaggccaga agcaggaaga gagatgcagt gttgggcatt cagtcaggcc 202381 cctgaccacc cactggacag atagtcagag gagctgtgtt cttccctcca tcacgtgtcc 202441 cagggctgaa gataggttga aggggcccag agaaagagca gctggcgaat gtatagatgt 202501 gggatctggc tgtcacattg taaaaggaga tgcttccaac tctgtagtcc acgaagatgc 202561 ccacacgctt gggaggctcc tttattagca ggcgggtcgg gggaacgctg gacgcctggt 202621 actcattttc cttcatcatt atcaccaccc agtagccatt ctctggcgac agagtcatgt 202681 tccctttcct gcttatggat gtcttgcagg ctcccaggat ccatgctgtc ttgtctccaa 202741 cctccacctc ccagtaacgg cggccagaga ggaaactcgg agagcccaga acaatgatac 202801 agctgtcaaa tctttgcggg ccatcaggca gcctctccca cttgtttcca agtctaacac 202861 tcttcagatc atcagagaag atgaggttgg ggtaagcggt ttctgcatcc agaatcacat 202921 taactgcaaa gaaaatttga atacctaggt aggggtccat gggcaacatc cctacagggt 202981 tctccccacc tgcaggaaac agggacaggg tagttcttct ggaacgtggt aggggagagc 203041 acagggatcc agcaggccag ggccacttgc cttgatctgg gcacttacca gcatgtgcct 203101 gagcgccaat cagctccgga actacggaga aaaatcagat agggaaaaaa atcctgagca 203161 ttagcattaa gaggggctaa attacactgt cctgacaaca aggaaagaca aggaaccccc 203221 tgcttagggc cctgcatgct atgttgggta taatcccgtt cctcccagga acccaggcca 203281 tgttggggac tgtggtctaa tgagtcaact cagtcaatga gtcacgcaca gagcctgggg 203341 ggcctgccat gacctttctc ctacctttgc tccaggtgtt tggttttttt ttttgtcttc 203401 taaatagggc ccctcaagtc aacagcacaa gggaacactg caacaacccc aggccagcac 203461 acaccattac cgctggactc accattgaac atttccattt ctgaacgcag ggtttctaaa 203521 atgtgggaaa gggagcagag agaagctgga gttaggtccc tcagccaggg acagatggag 203581 gagaggttga aggcaggtca gcaagaccag gggaagagga gggaagtgag gggctctggg 203641 ctatgtggat cttagggagg aagtgagcat gcacctccaa tcttctccca agcccatcta 203701 cctgagaagt actttgtgct cttctccaca aactctgact tctggtggag gagttggatc 203761 ttttgtttta tctcttgagg agtggtccac ttttcaggga caggcactgt cttagcccta 203821 gagacaaaag actgttgacc agagaaggcc ggagcgaggg ggtggcccaa gtacccgtga 203881 gctggaaatg aactacattc tccacagggc acacctagcc cagcctgagc tacacaaaga 203941 aaagtcttcc ctggattcag aggtctaaat gctggtcctc aactttggcc acagactgtc 204001 atgtactggg gtgctttaaa gacatggata tcaaaaccac aaggagtttt cagaatggac 204061 ttaaaatgac attgatatta tctactagaa ttatctatgt caaattgatt atacacccct 204121 ttctttctat tgaaataaaa agataataaa tgaaaaaaaa acacacacaa caaaatatcc 204181 cttcacatcc attagaatgg ctattataaa aacagcaaca acagaaaaaa atagaccagg 204241 tatggtggct catgcctgta atcctagcac ttcaggaggc cggggcagtg ggatcgcttg 204301 agcccaggag ttcaagtcca gcttgggcaa catggcgaaa ccctgtctct attaaaaata 204361 caaaaaagta gccggaagtg gtagcacatg cctgtagtcc cagctactca ggatgccagc 204421 tactcagaat cacccgagcc caggaagctg agcctgcagt gagctgtgac tgtgccactg 204481 ctctccagcc ttagtgatgg gagtgagacc ttgtctcaag aagaaaagaa aggaaggaag 204541 gaagaaagga aggaagggag ggagggaggg aaggaaggaa agagaaatgg ctgggcacgg 204601 tggctcacgc ctgcaatccc agcgctttgg gaggccaagg tgggtggatc acttgaggtc 204661 aggagtttga gaccagcctg gctaatatgg tgaaaccctg tctaccaaaa aaattagctg 204721 ggcatggtgg cacgctcctg tagtcccagc tactagggag gctgaggtgg gagaatccct 204781 tgaacctggg aggcagaggt tgtagtgagc caagatcgcg ccactgcact ccagcctgga 204841 tgacagagca agacgctgtc ttcaaaaaaa aagagagaga gagagagtga aacagaagaa 204901 aataagtgat ggcaaagatg tggagaaact ggagcccctg tgtgctgctg gtgggaatgt 204961 gaagcgatgc agcccctgtg agaattagga tggtggttcc tcaaaaaaaa ttaaacacag 205021 gataaccaca tggtcccgca gttccacttc tgggtaggta cccaaaataa caaagcaggg 205081 tctcaagcag atatttgtac acccttgttc atagcaacat tattcacaaa ccaaaagtta 205141 gaagaagtcc cgatatccat gaatggataa gcaaaatgta tgaacacaca ctggactatg 205201 attcagccat aaaaagaaaa taaattctga tacacgttac aacacggatg aagcccgaag 205261 acattatgct aagtgaagaa gccagacaca aaaggacacc gggattttgt aaagctccca 205321 ggtgatttta aggcttagcc tcttcctgtc ccttccctcc cctccccttc ctgctgggct 205381 tagcccagct ccactccaaa caagcttccc aaggccacca gtaggcctga aggggcagct 205441 ttcatcagat gaactgcacg ggacacaatg cactcggaag gtttcttagg gagcctccat 205501 cttgtttcag caggatgggc tgaattcacc cttcccagtc tgcagctcca tacccggcca 205561 gcctccctgc tgaccagatg cccttctccc tatcaaatcc agagaggctt tgggaagctg 205621 cagaaaaaag gaggagtctg gaatcacaga ccccggggtt gggaacatct ccctcccagg 205681 tcccttctgg gactgtctcc cccatatgct ttctgcaaga caccccaggg tacaccacag 205741 gacctcgctg tacctgtgca agatgtctcc aatgtcctag gagaaaaaag aaggaaactg 205801 tcggttacca ggctcctagt ggctgggctg actcctggcc tctacttctg ggctcctgga 205861 cttttagctc cccgctgcac ttaccagggc tccctgccct agggtgtcag tggaagtgga 205921 gcacctttct tcaagtctaa ttctaaccac gggaattcag gcatcccagg tggcgaggcc 205981 tgcaatggaa cggccctcaa gccctgctga tcccttctgg gaaatggcag agatcccctg 206041 gataccaggc caccctgggc tcctgagaga cctcatcaaa ggggtctggg cacaagaggc 206101 actgtgggtc accaagacca agtcctatcc taggccttag gggcttcacc cacttgttcc 206161 agcacggctg atggcagagc tgggagcctg aggcatcctg ataggcacag gggacccaag 206221 aaagccgggc ccaggcacac ccacctgcag aagttcccat tctgactggc actccttggc 206281 ctccagttcc ccaatcagcg catcgagcag ggcgatgtcc tgggatacgc gggtgtcata 206341 tgccttcctg atctgcccaa ccatctggcc cacgtcctcc agtgaggcca caaagaaatg 206401 ctcttgctgc tccaggaagt agtacacctg ctccagcttc ctctgcaccc gctgcttcag 206461 cgcttcagtt tgtttctggg gagcagagga cagggaggta tgggggtctg tgctgtgggt 206521 ggacgtggat gtccaggaac ccccagaagc ccacctcctg gaggtggata agaggtgggc 206581 ttatgctgcc tcttcaaggc ctagattcca gagcaggagg cgatattgtc tggaaccttc 206641 cagaaaagac tgagcatggc tggggctagc tctgggaaaa tgtccttaaa cctggtgtta 206701 aggtttaact gtgtcccttc caaaatttac acgttgaagc cctaaccccc agtactcagc 206761 atgtgatctt atttggagat aagagttctt gcagatgtaa ttagttaaga tgtggtccta 206821 ctgggattgg gtgggtccct aatccaatat aaatgttgtc ttataaaatg gggaaatttg 206881 ggctgggtgc attggctcat gtctgtaatc ccagcacttt gggaggctga gttgggcaga 206941 tcacttgaga tcaggagttc gagaccagcc tggccaacat ggggaaactc tgtctttact 207001 aaaaatacaa aaattagcca ggagtggtgg catgtgccta cagtcccagc aactcgggag 207061 gctgaggcag gagaattgct tcaactgggg aggcgaaggt ttgcagtgag ccgagattgc 207121 accactgcac cccagcctgg acgacggagt gagattccat ctcaaaaaaa aaaaaaaaaa 207181 aaaaccctaa aaaacaaaaa aaccagccca ggcacgatgg ctcacacctg taatcccagc 207241 actttgggag gccgaggcag gtggatcacc tgaggtcagg agtttgagac cagcctggcc 207301 aacacggtga aatcccgtct ctactaaaaa tacaaaaatt agccaggcgc ggtggcaggc 207361 gcctgtaatt ccagccacta aggaggctga ggcaggagaa ttgcttgaac ctcggaggtg 207421 gaggttggag caagccaaga tcgtgccact gtactccagc ctgggtgaca gagcgagact 207481 ccatcttgga aacaaaacaa aaaaacagag agtaaattat gaagttaaaa gtgttcttgg 207541 ccgggcgcag tggctcacat ctgtaatccc agtactttgg gaggccaagg cgggcggatc 207601 acgagttcag gagttcgaga acagcctgac caacatggtg aaaccccatc tctactaaaa 207661 ctacaaaaat tagccaggca tggtggcact cacctataat cccagctact caggaggctg 207721 aggcaggaga atcgcttgaa cccgggaggc agaggttgca gtgagccaag atcacaccac 207781 tgcactacaa cctgggtgac agagcgagac tccgtcacaa aaaaagaaag tggcgttctc 207841 ctcccttttc ctcaatccca tctttctgca gtagtcaccg gcataggttg ctctctacca 207901 tcttctggtg agtatgagaa agcatgacag gatcctcaga ggtgacccct gcctgctggg 207961 ggtggtctcc ctctacaggg atgagcttac ccttggctgc tggttaccct ctgtcccctg 208021 agaggaggtg gccatccctg ctgccctgtg aaccacagca gaatctcggg gacccctgct 208081 cactcttccc accttcctcc cagggacgga tgggccatca gccacctctg accttaccag 208141 aaagctcact gccttctcct ccccatagga tcgctgctcc tcccctgatt ttctcagctt 208201 cttcagatgc tccagctgct tctgaatttt cttctggaaa aacagcactt gttgaaaagc 208261 ttgaatttgg ctcctgccat ctttagctgg tgccaactct ggaggggaac atctccttct 208321 ggtagcaagg ctgagggtgc tgctgccagc ggcatgggga ggggtgctca ggaaagagga 208381 gggaggaatg gctgaattgc caaaggcagc agagctgaga ggcacttcct ctgagcctct 208441 ggattcagcc atacctgaag ccagctcccc ctggatctct gagcaacata aacctgtagc 208501 tcccctcttt gctggactgg tttatattgt gttcttgcca tttccagccc aagaatgctg 208561 gttaatgcac caacaaccca gagttgttgg gaaaatgaag taaggcccag tgtgtccaag 208621 tgcctggcag agaagagccc acaggcaggg agtgcctacc ttgtgttcca gggcgacctc 208681 ctcaatgggg cgcacccggt ggccttggtg ctcctgactc agactgcaga tgaggcagat 208741 gggctcatcg tgatcctcac agaagagcag ctggacctgc ttcaggtggc gcttacactg 208801 tggcaggggc tgggggctta ggcttcccgg gctcttcctt tcatgggagt cctggcaccg 208861 ggggcagcca ggtgagcggc tgcctgaggc ctgggggtgc ccagaaactg cctcggggaa 208921 gctgcaggaa tcacgcacac aggtaccgtc aactgggtct ccttcctggg cgtggcagcg 208981 gggactcgca gccgtgtctg gtggccttcc tggggacatg cagtggaaaa accccctgaa 209041 tggcaaaccc aagttgctta cacagaggta tcacaaagca cagcggacac agcccttgcc 209101 tgagatgtgc gagttctcag ttagactctg cccacttcct agcttgtcct cccccgactt 209161 ccatccagtg aggaagcaaa caggccactg ggcctgagag ctaagggcca gacctcagac 209221 attcctgagc cttggaattc ttgggcattt atcctaaaga aagacccatg gccttgtgct 209281 gttcctcctg cacagacata gatctagcag gtgatgccaa ggaaagagac attctgacca 209341 gtgttccttt gccacttgaa cccatccctg gctgcccata acgtgagtgg aggcttccag 209401 gggatggtga cttgtgtacc tcgctcctag ctggatccca gcccctagca gagtacctgg 209461 caaatgcaag catttggaaa atgagtattg aatgaaggaa taaatagacc actgctcaca 209521 tatgagttca ataaccatta aatgaaagca aagaaggggc tgggtggtgg ctcatgccta 209581 taatcccagt gctttgggag gccaaggtgg gaggactgct taagctcagg agtttgagac 209641 cagcctgggc aacatggcaa gagcccacct ctacaaaaaa ttaaaaaatt agccaggtgt 209701 ggtggtgcac acctctaatc ccagctactt gggaggctga gacaggagga ttgcttgaac 209761 ctaggaggtt gatgctgcac cactgcactc cagcctgggc aacagagtaa gatgccatct 209821 taaaagaaaa agcaaagaag gggggcatta tagggctaaa gactggaact catcatgaag 209881 acctaacagt tatgaatagc tctactccaa gactctaatt gcaacttcat aagcagacat 209941 gccacgagat acaagaagaa atagaaacac aaggccgagc gcagtggctc acgcctgtaa 210001 tcccagcact ttgggaggcc gaggcgggtg ggtcatgagg tcaggagttc aagaccagcc 210061 tggccaagat ggtgaaaccc tgtctctact aaaaatacaa aaattagccg gacgtggtgg 210121 tgggcacctg taatcccagc tactcgggag gctgaggcag gagaattgct tgaacccggg 210181 aggcggagtt tgcagtgagc tgaagttgcg ccactgcact ccagtctggg tgaccaagag 210241 agactccatc tcaaaaaaaa aaaaaaaaaa aaaaagaaat agaaacacac tcaatccttt 210301 agcccctgct acctcctccc acctcacagt gtcccagaaa ttcacagcac atctgtaaat 210361 gagactgcta gtagtcacca gaaattattc tccctccttc tctcgtgata gaattaattg 210421 tccacgtttc cctgtttccc tgcctcccat tgccccccaa atgccctgca tttctcagcc 210481 tcctttgcag ttaggtgtgg ctatttgaca ataatatgac caaagggatg taaatggagg 210541 tgtcacaagc agcttctaga aacttctaga gctttcccga gggatactat tcccctctgt 210601 tcctccatct tgctgcctgg catgaggacg tcaccttctg agagcagaag cttgaggaca 210661 aacttggtgg gacagagcaa cagcaggacc ccagggtcct ggcccacgga tctgcctggg 210721 accgcctgtg tatcatttat gtgaaggagg cacaaacttc ttttctgttg aggttactgt 210781 tatttgggtg tttctattat tcacaaccaa acataatttt aatgactacc tcagcagtag 210841 atagttagga ctttagccaa taggcttttc tttctgctga gagaaaggca aatggtagaa 210901 tctcaacccc atgatgatgc attgctgtga ggttattgtg agagggagta tttttaacca 210961 gtcttcccca gattttctac ctggtgtcca taaacccctt ggattctaaa gatggtcttc 211021 agagtgtcca agaactacct gaaatttgat gtgacatttg tgaagctgtg catatatgca 211081 tttttcaggg tctaagcctg catcagatta acgaagcggc cctagaagac acccaggaat 211141 cccaatgata taaatacagt ggggacctgg tgcagtggtg cacacctgta atcccagcag 211201 tttcggaggc aaagatggga ggattgtttg agatcaggag ttcgagacca acctgggcaa 211261 catagcaaga ccccatatct aaaaaaaaaa aaaaattaaa aaaaatacag tagggataaa 211321 agaaaattag gggggaaata cttcagtact tttcaggtca ctattcctaa ggatgctagg 211381 ttttccagat cccagggtcc tttctcactg gaagacccag ggacttctgg atttgaagac 211441 tcatgcagca taccacatcc tttcccctcc caggtctcag gggtccctgg actcccaatt 211501 cttttttttt tttttttttg agacagagtt ttgctcttgt cgcccaggct agagtgcagt 211561 ggcatgatct tggctcactg caacctctgc ctcctcggtt caagcaattc tcctgcctcg 211621 aagccttctg agtagctggg attatggaca cccgccacca cgcccagcaa atttagtatt 211681 tttagtagag atggcgtttc accatgttgg caggctggtc tcgaactcct cacctcaggt 211741 gatccgcccc cattggcctc ccaaagtgct gggattacag gtatgagcca ccgcacccgg 211801 cctggacccc caattctatt tctcctcctt gagtgctgtg atttctcctc tccagccaac 211861 aatgcattgg aaagtgtgtt ccataggctg ggtgtagtga ctcacacctg aaatcctagc 211921 acttttagag gctaaggcgg gggatcgctt gagcccaaga gttcaaaacc agcctgggca 211981 acatagacag tcctcgtaac tacaaaaaat acaaaaaaaa aaaaaaaaat taaccaggtg 212041 tggtggtgag cacctatagt ctcagctact tggaaggctg aggtgggagg atcgcttgag 212101 cctaggaggt tgaggctgca gtaaactgtg ctcacactac tgcactccag catgggagac 212161 agactgagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aaggaaagtg tgtttctgaa 212221 tgctgccccc accacgagca tggaattgtc acctggatag cctcggtacc ctctaaactg 212281 gtggcatttt tcttttacag ttttttttta gtctctttac ctatagggac agctgcttaa 212341 aaaaatccag atggcctgga gcctcctgat cccttgccaa aaaccagagg aagttaagat 212401 cagagcaaaa ccagtgcaaa ctggaagagg tgacttccag ttaccttaag atcatttaca 212461 cattgttata aggctaaagg tccctcccct aaaagaagat ggccagggtt ttgtgtccat 212521 gcgatgtagg aagaagcatg ctggggacag cgcctgcaca tatggaaccc tgccctgggc 212581 ctgcttacct atctcccttc ccctccctgg actctaaaat cgccctgcct cccatccctg 212641 gcaagcagac gcctgcagag gtgagctccc cttctccatt ctttggccac tgataacacc 212701 tgattgcctt tttcaatcgg acattctttc tttctttttt tttcccccca aaatgtttaa 212761 gagacagggt cttgctctgt tgcccaggcc ggagtgcagt ggcacatcct cagctcactg 212821 cagcctggaa ctccctggct caagcaatcc tcctgcctca gcttcctgag tggctagaat 212881 tacaggcgtg tgccaccata cccagctaat ttttttttta gttttgtaga agtggagtct 212941 ggctacgttg ctcaggctgg tgtgcagtgg catgatctca gctcactgta gcctggaact 213001 ccgtctgctc aagtgaaccc gagcagctgg gactacaggc gcgtgccact attgtatttt 213061 gtagagatgg ggtctcactg tgttgcctag gctagtatgg aactcctggg gtcaagagat 213121 cctcccgtgc aggcctctca aagctctggg attacaggcg agagctacca cgccctgccc 213181 aacctttggg tttttatttt tttaatttcg tttatagaga tggcgggggt ctcactacat 213241 tcaccaggct ggtctcaaag tcttggcctc cagcaatcct cccgccctgg cctcccaaag 213301 cgctgggatt acaggcatga gctatcgtgc ccggccagcc attctttctc tgcagccgat 213361 ataaagtagg aaagaacaca atttaccggt gaccgaatgt tctggatttc cagggccttc 213421 cttcaggtcc gcagatgccc ctccatccgg agtgggcctt gcccggggtt ctgttgccga 213481 gtccagattc gcagctgtct tttcctctag agtcaggaga atttctggat ttgcgggcgc 213541 cttctcccct gtagaaatgg tgacctcaag gcttctaggt cgcatctttc ccgagggcag 213601 gtacacttcg aagggcctgc actccttctg ccccggggcg ccccccgcca gcccctgcag 213661 cctccccgcg gagctggcgt ttctgcgcag ccggacctcg gcctggcccc cctctagcgc 213721 cctgcagggg ccggggcttc tcccgcccgg cagggccggg ctccgggtcc gaggcttgcc 213781 ctgcgcgtcc aggccctccg aggccttctc tctgcgtttg ctcaggggct tcctcgacag 213841 ccccctcccg gcctcgggct ggctgcaccg caggctggca gctccgcccc cgtacggccg 213901 agggccgttc ccctcgttcc cctcggggtg gtctggagtc ttcaggctcc tgggcttgtt 213961 ctcccccagg gagctggacg ctgcggaatc atctgtgccg ttttcttgtg tggaatattc 214021 tggaaggaca accagatgca aaatgatgaa gctgtcccac gtttagggcc caagattcag 214081 ggcagaggag agagaatccc cttggatatt aaagtttaga aattgaggaa aacaacgggc 214141 cgggcgcggt ggctcatgcc tgtattcccg gcactctggg aggccgaggc gggaggatca 214201 cctgagatca ggagttcaag accaacaggg tgaaaccccg tctctactaa aaatacaaaa 214261 ataagctgag cgtggtggcg catgcctgta atcccagctt ctccagaggc caaggtggga 214321 ggatcatttg aggtcaggag ttcgagacca gcctggccaa catggcaaaa ccccgttcct 214381 actaaaaata agaaactcag ccagacgcgg tggtgcgcac ctgtagtccc agctactcgg 214441 gaggctgagg catgagaatc gcttgagccc gggatgtgga ggttgcagtg agctgagatc 214501 gggccactgc actccagcct gagtgacaga gcaagactcc atctcaaaaa caaaagttca 214561 ggacttccta gactagacaa tggcagtcaa ttaccattca gtaagaacat gcttgatgtt 214621 gctatgtatt tctctctctc ttattttatt ttttgagaca ggatcttgct ctgttgccct 214681 gatgtgatct cagctcactg caacctctgc ctcccaggct caagcaatcc tctcacctca 214741 gcctgttttc tgttttttca atagctgaaa atacaggtgt gtgctaccac gcctggctca 214801 tttttttgta tttttggtag agacagggtt tcaccatgtt gcccaagctg gtctcaaact 214861 cctgagctct agcaatccac ccaccttagc ctcccaaagt gctgagatta tttataagtg 214921 tgagccacca tgcctagcct atatttctca aatagtactt ttcaaacgtt ttaagcagga 214981 caactgtttc aaataagatc ttacacaggt cagcaaaacc cagattatac gctttaaaaa 215041 taaaaataag ccgggcacgg tggcctatgt ctgaaatccc agcactttgg gaggctgagg 215101 cgggaggatc acttgaggtc aggagttcga gaccagcctg gccaacatgg tgaacagcac 215161 tgagaagatt ctataccaac cccgttcata tgattgcata gcaatccctt tttgctaacc 215221 tagagatgtt tgtctgacat gagtattaat cataaatgta gtgaagaagt ctcaaagaac 215281 aggtgttcca gctcctgcct ttcgtgaccc tgacctgctt cttaaaaaac ccattaggag 215341 cctgaaggaa gtttactaac ccagttccaa aggccagagt gaagaaaggg acgttcctga 215401 actaaagtca tctggatttt ggtagacctg agactcccaa tccccaggtc agagtgagct 215461 gctctgagct cctggtcccc tttcccacaa agcagccagc actcagcact ggatgaggag 215521 gaggcctggg cccgcttacc ctgaatggct gccctgtgga gctcctcggc cagcaggcgc 215581 tggttgatgg cccgcaggac ctgcagggtg agctgcacgg cgtactcttc cccatagtag 215641 gtgaccagca gagtggccat cttcaccggc ctggctctct ggatctggct ccgggggatc 215701 ctggagtgct ccttctgcac actggtgttc tgcagcttga acttgaactt ctcgaagtca 215761 tagggcacca gctcctccag ggtggacagc agatggtcac taggggtctt agccatggtg 215821 ctgagcagga gaggctcgag ccagctgtct ggcttctggt aggaaaagaa gcctctgtcc 215881 ttggtgagca agaaaaggca ggttgtgaaa tagcggaaaa ggcacaggaa atgctctgtg 215941 tcttggtggg aactgagact aagggtaagg gtgtgtccat gccggctgtg ttctcttctt 216001 acaggttcag aggattttcc agctgggagg gctccatgcc ttggcagaca tgtgccccca 216061 cacccctccc aaaccctgga cctcgcctcc acactagacc acagacagat gggcaagtct 216121 gcaagggaag gtctgggatt ggattccaga caccccctcc aatcttcctt cctgccagga 216181 tctggggctg gcagagcggg tgagtgggaa cagagaggac tatgctgagg gcccgccctg 216241 ctggtcaact gttctcctcc agactcgggg tttcagggac cttccagagc caatggcaca 216301 ggacggggga aggggtgggc actgaataca gtcctgtgag ctttgccttt cctggcccaa 216361 gaaggcaagt gaggccagaa gggcactgcc aggagataac tcttgccatg ccatcacccc 216421 tgttgatccg gttccagttg actggttctt ccagccacat gaaggacttg aaaagcaatg 216481 cctggaccac cagcctgact catctgttcc cagagctgac tcagtcctcg ctccctggag 216541 ccagaatgct cttccgtgca cattcattga acattctaga cagtgtgcca agatgctgga 216601 gctacacggg tgaaaaggca cgggtgtcct tcagctcata tgtggtgcag ggccccagag 216661 gtggctttga agctaacaga agttacatgt tgggggcagg gcacagtggc taacgcctgt 216721 aatcccagca ctttgggagg ccgaggtggg tggatcacct caggttggga gtacaagacc 216781 agcctggcca acacagtaaa accccatctc tactaaaaat acaaaaatta gccaggcatg 216841 gtggcacgca cctgtactcc cagctactca ggaggctgag gcaggagaat cacttgaaca 216901 tgggaggcag atgttgcagt gagccgaggt tgcgccattg tactccacct tgggcgacag 216961 agcgggactt tgtctcaaaa aaaataaata aatgaaaaat tacatgtagg gtgaactgac 217021 tcctcactta cctacttctg accgttgcat accctaattt ttttattttt agatagggtc 217081 ttgctcagtc actcaggctg gagtgcagta gtgggatctt ggctcactgc aacttctgcc 217141 tcctgggctt gagcaaccct ccaacctcag cctcccaagt agctgagact acaggtgcat 217201 tccaccacat ctggccaatt tttgtatttt ttatagagac agggtctcac tatgttgcgc 217261 aggctggtct tgaactcctg atctcaagcg atccactcgc tttggcctcc caaagtgctg 217321 ggattacagg tgtgagccac tgcgcccggc cagaagtttt gttaataatg caatttaccc 217381 ttcaccagca cttcatcggc accagccacc actagcttta taaacacttc ctcctagaat 217441 ctctgcagca ccctgcaggg ctgcgtttat catccccctg tgcagccaag aatctgtagc 217501 ttagtgactt gcctagagcc aagatccaaa cgtcaaacca cttcacaaca tcgtctcctg 217561 catgacaata atttgtgatt ttactgttaa tgactaccca tggatgtcac atttccagct 217621 gaccactgag gtccaggatg ggtcacttcc acaggtcagt cacaacattg cgacgcctag 217681 ctgaacaccc agctctgggt gtgagctcct ctgtgtgtgc ccagcctggg attcaaggct 217741 ttgatgagtt ctggttggtt gaatttaact tactgtccag ctggtccacc tgtttcctat 217801 ttcacagagt atttggtgtt tttgcattgt tttcatagct gacaacaaac atgcagtttg 217861 ctagatcttt ctgagcccaa acccaacatc ttccctaaaa aaagggtgaa ttgggctggg 217921 tgtagtggct cacacctgca atcccagccc ttgggaggct gagacaggag gattacttaa 217981 gcccaagagt ttgagattag cctgggcaac ataaggagac cctgcctcta caaaaaatag 218041 aaaaaatatt agctgggtgt ggtggcacac actggtggtc cctgctgcta ctcaggaggc 218101 tgaggtggga agattgcttg agcctaggtt caggctgcaa tgagctgtga ttgcaccact 218161 gctctccagc ctgggtgaca cagcaagaca ccctgtctca aaaaaaaaaa aaaaaaaaaa 218221 aaaaaaaagt ggatgaaatg cacattgatg gtgcagacag cagatttttg gaatttttag 218281 gctgggccag aagaagggtg ttgaatgggg tatcactggg gccagcaccc gtccctgact 218341 ctggacggtg gcaacccagc acctcccagg cagtctccca ctaccagatg gagaaaggat 218401 gagactgtgc agtgacgctg tgggtgctgt gtgatcccat ctcgtctggt cagaaaactc 218461 ctcggagctg ataccaaaac cccatgacag aattgggcag ggaatgatac tttctggtct 218521 aaagcaggat attttgcaat tgcggtgaca cgttttgcaa cagaggctgc tctgaggcag 218581 gacttctgga aggcagtgtt ttgatgccac tttggacttt ctagggcaag gcaggctcac 218641 aaagcttttg tttggagggg ttccagtaat ttctcacagt gaaaatagaa ttattgccct 218701 tctaacgtca tttaagtgtt taataatcaa agtggaatgg tcatgggctg tcttgtgtgc 218761 ccctgccatt ggatgtgtga ccttatttgg ggtgacatgg caaatctaaa cccaacaaag 218821 ctgctgcaaa gggtctactc tcggctatta ctcctgggaa atgtgcggct cagacaccac 218881 gaccttcact ccactcctgg tcaccagggg tttcactttc tatggcattt acttctgcac 218941 attttgggat ttaaaaaaat gaagattgta ttactttcgt aataagcaaa aaagatagtt 219001 ttaaaaagtc tttcagacca ggtgtggtgg cgcacgcctg taatttcaac actttggaag 219061 gccgagacag gtggatcact tgaggtcagg agttccagac cagcctggcc aacacggcaa 219121 aacctcgtct ctaatgaaaa tacaaaaatt agccaggtga ggtggcccgc gcctgtaatc 219181 ccaggtactc cggaggctga ggtagaactg cttgaacctg ggaggtggag gtttcagtga 219241 gtcgagattg catcactgca ctccaggttg ggcaacagag caagattccg tatttaaaaa 219301 aaaaaaaagt ctttcacaat tagcttaact tctaagggag tatcagtccc gtgcccctaa 219361 accatcatga taaaggaggt atatagtatt tacttattta tttacttgat ttattttgag 219421 acagagtctc actcttatca cctgggctgg agtgcagtgg caagatcatg gcccactgca 219481 acctccgcct cccgggttca agcgattctc ctgcctctgc ctcccgagta gctggcatta 219541 caggtgcccg ccattacacc cagctaattt tttgtatttt tagtagagac ggggtttcac 219601 catgttggcc aggctggtct tgaactcctg accttgtgat tcgcccgcct cagcctccca 219661 aagttctggg attacaggcg tgcgccaccg tgaccagcca actatttttt catagatgat 219721 gaaatgtagg cttacaggta gggtgaccaa cacattttgt ctaggagcaa aacccaattt 219781 gcccagtctt agcactgaaa gcctagttct gggaacgacc acagtccggg gaaattggga 219841 caattgccca ccctgtttag atgggttaat tgacatactg aacacatgtc tagctaggaa 219901 gaggcaccac agggccattc atttactctt ggagcacctg ctgtatgcca gatgctattc 219961 taggtactac tgatacagaa ctgaacagaa aaaaacttct tgcagtgctc acattctttt 220021 ttttctgata taaatgccct tcagaagagg ccacattctt tttttttttt cgagatggaa 220081 tttcactcag tcgcccaggc tggagggcag tagcgcaatc tcgccactgc aacctctgcc 220141 tccagtgttt gagcgatcct cctgcctcag cgtcccaagt agttgggatt ataggtaccg 220201 acaaccacgc ctggctaatt tttgtatttt tagtagagac ggggtttcac catgttggcc 220261 aggctggtct taaactcccg gcctcaagtg atttacccac ctcggcctcc caaagtgctg 220321 ggattacagg cgtgagctgc tgcgcctggc tcagaagatc ccgcattttt ttttttttga 220381 cggagtctag ctatattgcc caggctggag tgcagtggtg ctatctcagc tcactgcaag 220441 ctccacctcc cgggttcacg ccatcctcct gcctcagcct cccgagtagc tggaactaca 220501 ggcgcctgcc accacgcccg gctaattttt gtatttttag tagagacggg gtttcaccgt 220561 gttagccagg atggtctcga tctcctgacc ttgtgatctg cccacctcgg cctcccaaag 220621 tgctgggatt acaggcatga gccactgcgc ccagccaaga gcccacattc ttataggaaa 220681 gtgtgaattg atataaacaa acaaatacac aatgtatcag ggagtgataa gttgggggaa 220741 aaaacagcag aataataggg ttagtgagtg caggggattg gtttcatatt tttttatttt 220801 tatttattta tttagagaca gagtctcact ctgtcaccca ggctggagtg cagtggtgct 220861 atcttggctc actgcaactt ctgcctcccg ggttcaagca attctcctgc ctcagcctcc 220921 caagtagttg ggattacagg ctcacaccac catgccaggc taatttttgt atttttagta 220981 aagacggggt ttcaccatgt tggccaggcc agtctcaaac aactgacctc aagtgatcca 221041 accgccttgg cctcccaaag tgctgtgatt acaggcatga gccaccgtgc cccctggttt 221101 catattttaa tagggcactt gaagaaggct tttgtgcttg aacagagacc caaaggaaga 221161 ggagtaagcc aggtattcat tttgaggatg agcagtcttt gcctatttac agagtgcaaa 221221 aggggtaact ttgtgcttag tgtttgagga agagtggtgg ctggagcaga ggcacggagg 221281 aggagtggat ggaggggtta gagagggagc tggagtcccc tatcaagtag cagggctccc 221341 aggacacagc aaggacttgg ctcttactct gagatgaggg gacactggaa agtttgagca 221401 gagcagtgac ataattggag ttacgcttta aaagaaacat tctgggccgg gcgcggtggc 221461 tcacacatat aatccagcac tttgggaagc caaggaaggt ggatcacctg aggtcagaag 221521 ttcgagataa gcctggccaa catggtgaaa gcccgtctct actgaaaata caaaaatttg 221581 ctgagtgtgg tggtgatgtg cgcgtagtcc cagctactct ggaggctgag gtgaatcacc 221641 tgaacctggg aggcggaggt tgcagtgacc tgtgatcgca ccactgtact ctggcctggg 221701 tgaaagagtg agactgtttc aaaaaaagaa agaaaagaaa agagaacact ctggcttcca 221761 taagagacaa gagatgagaa aggtggaaac agagaggaga aagcaatcta ttgtcacaat 221821 gcaggaaagc gactgcaggg ccctgaacca ctgtggaggc agaagggtca gattctgggt 221881 gattctgaaa gcagggctga catgataagg gtagactgga tactggcggg tgtgaagatg 221941 catctaagat acctgtttct ggccagattt tacaaagggc ccttgttgct accttaagga 222001 gttagtgttg aatcctgcag cagctaagag cttctgcagg atctccctct caacttggac 222061 caacccacct gccgatccca gcccagatca atgagatctg ctgcccagtc ccctacgcag 222121 ataacactaa ttgtaactga tctttctgag ggcacctcct gagcagcttg gaaacggagg 222181 gagaggagag gaagccagct tcccaaagtg gggtgtggca ggtggcctta accctgctga 222241 agggctgcag tcatggggct ctgaatagat ccccacacca ctcaccaatg agcttcattt 222301 ggaaggcggt aacgttaaac aagagggttt gtaaaaaata aaggcctaca cacataattt 222361 tttctgatta ttaaatctag gtggagagta cgtgagtatt catgttactc tctcaacttt 222421 cttgtgctta aaaatcttaa aataaaaact ttaatacgct ctgtcctacc aatatttata 222481 ggaacctgac tagcagccag gctttggaca cgcaatcccc ggccaccaaa gggctgacag 222541 gctagtgaaa ggggcagaaa aataacaaaa ctgtgtctgg aaggcagcta ctcccttttt 222601 ttgtttgttt gttttttgag acgcagtctc actctgtcgc ccatgctgga gtgcagtggc 222661 gcgatcttag ctcactacat gtgcctccgg ggttcaagcg attctcctgc ctcagcctcc 222721 tgagtagctg ggactacagg cgcgcgccac cacgcccggc taatttttgt attttttagt 222781 agagacaggg tttcaccata ttggtcaggc tggtctcgaa ctcctgacct cgcgatccga 222841 ccgcctcggc ctcccaaagt ggtgggatta caggcgtgag ccaccgcgcc cggccctttg 222901 ttttcttacc aactaagcag cagcactgac actgtttcct ttgaggttcc gtttgctgaa 222961 tcctgccctc taaggtgctc cgcccttcca ctccccgccc cgtgctgctg tcattcccac 223021 gcctcaccct gttttgcaac ttcttgttta tgagcctgga agccacgaag tggtgggaaa 223081 tacgagattc aaatatctgg tggccgtgtg acgtgagtaa attactcgat ttttctcagt 223141 ttccatttcc tatcaaatgg gttgttctgg gaattcgggg aagtaacagt gccgggaacc 223201 gcgggaggaa ccgccagctg cgtcctcgga cgtccccaaa gcccagggcg gcgactggca 223261 cgactgtcag gggcgcgtct gttaagagga cagggggtcc cgccccggca cagccgtctc 223321 ctccaggacc cctcccgccg acgccccacg gaccccacgc ccgagcggag accggcgcga 223381 gtccggggtc tccggtccgg cagcccctcc ctggcccggc gccccaaagg gaagcggcct 223441 gggggaggag acgtgtggaa agagcggcag aagataaaga ggaagtagcg gtagagaata 223501 aaaaggaact gtccgcagtg tgcccggacg cggggaggcg ctggggtcaa gcgagacccg 223561 acctgcacgc aattcccgcc ggggtcccgg cctgcctgcg ggagggaaag gacccagggc 223621 ggcttctgcc aaaagtgcgg cttctgccga aagtgcgtct tctgccgagg ggcccacata 223681 aggcgcgaca gacactccag cccgaccccg acgccccgcc ctcacgcatg cacatgcgcg 223741 gaacactccc agaaacacat ctcccagagc gccccgggag cacgcgagcc aattggaggg 223801 cggattagcg ggggcccacg tctcccagag gtctccgagt cgcggctctg ttggtctgat 223861 tggcagccgc aacagcctat agctgctttt cgccggaggg gccacgcgcc gtttgccggg 223921 actgagccgc tgttgtcgct ggtatcccgg gagcagcgcc ggcaagtgga gtcgtcgtat 223981 tccgggcggc ccgcggccca cggggatggg acgtcccggg gctgtgctga ccccaaaacc 224041 cttccacact ttattagcct gtcgcccttc tttcatgttt acagcaaata ttttccgagt 224101 gcctaatgtg tcccacttac tgtgccacgt gcgagggtgc acgtgtcagg caagatcgct 224161 gttccaagga ggctgagaac aagtaaatgg taattaccag ttatgaccaa agctgtgaaa 224221 ggaacagata catcagccta acagcagcac tgccgttttc tcccttataa tcagcagagg 224281 cgggctagag caaggattct caaactctgc cccctcagaa ccctagagta cggttgctga 224341 aacttctgtt ttttcctttt gaacaaatct agtaacatgc tgtgagccag agaggtaatg 224401 gtttggctca aagaaggcac atgcatctgg tttctttctt cttttttgag acggagtctc 224461 gctctgtcgt ccaggctgga gtgcagtggc gccatctccg ctcactgcaa gctccgcctc 224521 gcgagttcac gccagtctcc tgcctcagcc ttccgagtag ctgggattac aggcgcccgc 224581 caccacgccc ggctaatttt cacgcccggc taattttttg tatttttagt agagacgggg 224641 tttcaccatg ttagccagga tggtctcgct ctcctgacct cgtgatccgc ccgcttcggc 224701 ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcccaggt ttcttatttc 224761 aggagcttca taggcataga cctccctcct tccctcctac cccctccccc ctctcccaac 224821 taatgccaga ccggccacta ctttttaact gttttacata tttaaccttc aagaaagatt 224881 tcctttgatc caagtgtatt gcagcccaca aaaaagcttg gaaacccagc tagatggtct 224941 gtttttctag ttcctacatt ggatttaagt cctacccatc gagactactg tcttcaggta 225001 agcatatcga ggctgtattc accttcatgg tgttctcagg ggcactttct tacacaaatg 225061 atggtaatca tcctagatca tttccctgcc tgccacactt gccagttagt tagaaactgt 225121 aaacccctta aatgtctgtc accagggcac tggctgaata gctgagtaag ttttggtgaa 225181 tccattcctt ggactacact gcatccactg aaaacagtgc ggtgaatttc tgtgtgctaa 225241 tgtgcaaaga tggtttatgt attattcagt ggaaaaagca catttgcact acagtgcaga 225301 tagcatgatc ccatgtttgt aaataagaga aaaaaatgtt aaaatttaag tatctggagg 225361 gaaatgcaag aaataaatat gactaccttt caagagtaga attgggggta gaagttggcc 225421 ttttccctct atgtcatcta tacgttttat ttttttattt tattattatt tttttgagac 225481 ggagtctcac gctgctgccc aggctggagt gcagtggcat gatctgggct tactgcgacc 225541 tccgcctccc gggttcaagc aattttctgt ctcagcctcc tgagtagctg ggattacaag 225601 cgcacgccac cacgcccggc taatttttgt atttttagtt agagatgggg tttcaccatg 225661 ttggtcaggc tggtctcgaa cttctgacct tgtgttctgc ccgcctcagc ctcccaaagt 225721 gctcagatta caggcgtgaa ccagtatgcc cggcgttcaa tgagcatttt taaagtttta 225781 tgactcatga aacatagtcc acaaatatat gctcattgtg aaatagtaac ggaaaataaa 225841 atatggcaag ctaaatgtta gaggagaaat ttcttatttt gctccattac ttttacagca 225901 gaaaaattaa aaaatttatt agctgagact ttgtttcact ggggtgcaga agattaaaca 225961 aatgattttt aaaagcagac tctgcccata gagaaaaaat attcgaggtt tccaaagtag 226021 aataagttta aattagcaag tcaaataagg aaccttgagc cagtaccgaa ggaaacacgt 226081 cccagaccta tctccaaaca cacacacaca cacacacaca cacacacact gcaggcataa 226141 catttatgca gacagactgt tctttagggc ataattgtct catgtctcaa ttcctctgaa 226201 gctccagagc agaaactgaa attttaaaaa tgcagttgag ataaactgaa atgactgaca 226261 ggccatttgc caagaaaagc tttggttctt tggaacaaag acagtgccca tatcccagcc 226321 aacatcaggt ggtatttctc tttctgtcct gtagagggca tcttcatgtg tgggcaccag 226381 ggctgcggct ggaaccacag tggacacccc acgctccaag gacaatcctc tgcagaaacc 226441 agcagctgga aacaattctt aatgctgcag gaagtagtta ggagaaaacg actgtgtaag 226501 tcagagctaa aactggagtt caaaaaccac agaaaatatt tccaaggctg tctgtgaagt 226561 ggggatgtca aaataccaaa aattacagct ttggaactgc agtggatgac ctttaattga 226621 tgcagtttat aggtgctaag tattgtgtgt gctttacgtt gcctcatgtg agtgcttcat 226681 agcagggtgt gacgctggcc aagtgatata cctaccctct ttgtccctgt ttctatcaac 226741 tgtaaaatgg gggttctaga gttgttgtga agattaaatg agctaataca cacaaagcac 226801 caagaacaat gctggataca aggtaatagg tgtttcttcc tagacttcac tatcctcatg 226861 tgaggtgggt atgtttattt atttatttgg ggacagggtc tcgctctatc acccagactg 226921 gagtgcagtg ttgcaatcat ggcttagctc cctacagctt tttaaattta attatttatt 226981 attattattt ttttgagatg gagtctcact ctgtcaccca ggctggagtg cagtggcacg 227041 atctctgctt actgcaagct gggcctcccg gattcacacc attctcctgc ctcagcctcc 227101 tgagtagctg ggactacagg cgcctgccac cacgcccagc atatttttgt atttttagta 227161 gaggcggggt ttcaccgtgt tagtcaggat ggtctcgatc tcctgacctt gtgatctgcc 227221 cacctcggcc tcccaaagtg ctgggattac aggcgtgagc caccgtgcct ggcctcagct 227281 ccctacagct ttgacctccc cagctcaagt aatgtcccac ttcagcctcc cgagtagctg 227341 acactacagg aatgcaccat catgtatacc tgtggtccta gctacttggg aagctgaggt 227401 gggaagattg cccctgaaat aagactctag atacaggtcc ctttctctga aatttttgaa 227461 acaaaacgcc taattcaaaa aaagttattt ttggtagaga cggggtctca ctatgtttcc 227521 caggctgatc tcaaacttgg gttcaagcca tcctccctca ttggcctccc aaagtgttgg 227581 gattataggc aggagccact gtgcctgtcc aaggtgggtg ttgatatatt caatttaaca 227641 cagaaaacag gaagtttgta cacactgact tgcccaaggc tacacaacta caaagagtca 227701 ggtagccaac attcagtctc aaggtctgtg tctgcagagc tcaaggcctt tcctttggct 227761 ttgctgtctc ttggcatctg gtagcttagc agtcatcaag ccaaccccgt catgttacag 227821 aggaggacac aggcttccag gggaactgac tgacttgagt tctacacatc agtgcagagc 227881 tgggactaga attctgtcca cctaactgta tttggaagaa agttaagtgc tattgactac 227941 acctaatatc tgaaagataa agggagaggc agaagcaaga atgaataatg tggctgggcg 228001 cggtggctca cgcctttaat cccagcactt tgagaggccg aggcaggtgg atcacgagat 228061 caggagttca agaccagcct gaccaacatg gtgaaactcc atttctacta aaaatacaaa 228121 aattagccgg gcatggtagc gtgcgcctgt aaccccagct actaagtagg ctgaggcggg 228181 aggatcgatt gaagccggga ggtggaggtt gcagtgagcc gagatcgcac cattgcactc 228241 cagcctgggc aacagagcga gactctgtca aaaagaaaaa aaaaagaatg aataattttt 228301 aaagcaggct gtgaagaact aaaaacatga ggacacggaa attcagaaac acccatttca 228361 atatgattcc tttcggataa cttgaagccc aaacaggatg gaaaatcaca ggccaaagtc 228421 acctgaagga aggctgtcac ccatgacgga caaccgggat gttgctctcc attcatccca 228481 acggcagcaa agtctgtcct ggaacatgga atggtttctt tatacagaca actaagactg 228541 gctccaagag aagagttgct accactaacc cctacatgct gccttgaaag agttggtggt 228601 aggtgttgga aaattaatct gagtaattag gaaacaaaat ttgaaacaaa gaatcccatg 228661 cacaatagta gcaaaaccca taaccagaca tgataagaaa tttgcaagac ctataagaag 228721 aaaagtatat aggattacag aattttaaaa gtcccagatc attgaagaaa tacaccacag 228781 tcgtggatgg aagagttaat attacaaata tatcatgcat tcctaaatta atcaacaaat 228841 tataagcaaa cccaattgaa attgtaacga gatttttttt ttacattttt aaaaaaatgt 228901 gatcttgcta tgttgcctga gctggtctca aacttctaag ctcatgcgat cttcctgcct 228961 tggcctctca aaatgctggc attacaagtg tgagccacca tgcccagcca taacaagact 229021 ttcttttttt tttcagactg ggtctcactg ttgcccaggc tggagtgcag tggcgtgatc 229081 tcaggtcact gcaacctctg cctcccaggt tcaagtgatt ctcctgcctc agcctcccga 229141 gtagctggga ttacaggtgc gccccaccat gcccagctaa tttttgtatt tttagtagag 229201 atgaggtttc gccatgttgg ccaggctgtt ctgcccacct caccctccca aagtgctggg 229261 attacaggca tcagccaccg caccaggctg caagatgttt aagaatgggc tttgacaagt 229321 tgatttttaa aattcatagc caaaaaaatt gtgaaaaaaa tacagtggtg tgagatgtat 229381 tgtaacagat ataacaacaa gctgtgaaaa tacagtaaat tagaacagta atatgttttt 229441 acaagaatgg atataaggta aatggaaaca agattccgta agacgtgtgt atatgcaaaa 229501 gatggtattt aaaacccaag gggaaaaggt ggcccaactt ataaatagtt ttgggactgt 229561 tgacagttga gctggaaaac atagacttct gcctgacata tataaaaaat aaactccata 229621 tatatatata tatatatata tatatatata tatatttttt tttttttttg agacggaatg 229681 tcgctcttgt ttcccaggct ggagtgcaat ggtgtgatct gggctcattg caacctctgc 229741 ctcccaggtt caagtgattc tcctgcctca gcctcccaag tagctgggat tacaggcatg 229801 tgccaccaca cccggctaat tttttgtatt tttagtggag atgggttttc accatgttgc 229861 ccagactggt ctcgaactcc tgacctcgag tgatccaccc gcctcggcct cccaaaatgc 229921 tgggattaca accatgagcc accgcacctg gcccagatga tttaaagaaa cacgaagtgt 229981 aatatctgta aatttgttta aatttcgcag tgaggaagat ctttcttttt ttttattttt 230041 tattttttta ttgatcattc ttgggtgttt ctcgcagagg gggacttggc agggccatag 230101 gacaacagtg gagggaaggt cagcagacaa acaagtgaac aaaggtctct ggttttccta 230161 ggcagaggac cctgcggcct tctgcagtgt ttgtgtcctt gggtacttga gattagggag 230221 tggtgatgac tcttaacgag catgctgcct tcaagcatct gtttaacaaa gcacatcttg 230281 cactgccctt aatccattta accctgagtg gacacagcac atgtttcaga gagcacaggg 230341 ttgggggtaa ggtcacagat caacagtatc ccaaggcaga agaatttttc ttagtacaga 230401 acaaaatgaa aagtctccca tgtctacttt tttctacaca gacacagcaa ccatctgatt 230461 tctcaatctt ttccccacct ttcccccttt tctattccac aaaaccgcca ttgtcatcat 230521 ggccccttct caatgagctg ttgggtaccc ctcccagacg gggtggtggc tgggcaaagg 230581 ggctcctcac ttcccagaag gggcggccgg gcaaaggcgg ccccccacca cccggacggg 230641 gcggttgggc gggcggaggc accccccacc tccctcccgg acggggcggc tggccgggca 230701 ggggctgacc ccccacctct ctcccggacg gggcggctgg ccgggcaggg gctgaccccc 230761 cacctccctc ccggacgggg cggctggccg ggcgggggct gacagatctt tctaaccaag 230821 ataaaaaaac tgaaaaagtt aaaaaattca ggtaaataga aaaattagaa aacttcagta 230881 tgacaaaaac agccatgaat atggttaaaa ggctggatgc agtggctcat gcctgtaatc 230941 cctgcacttt gggaggccaa ggtgggagga ttgcttgagc ctaggggttt gagtctccac 231001 aatgtaggca gaccccatct ctaccctccc tcacaaaatt aactggccat ggtggtgtgc 231061 gcctgtggtc ccagctactc aagaggctaa gttggaaaga tcgcttgagc ctgggaggtc 231121 aaggctgcag tgagctgtga ttgcaccact gcactccagt ctgggtgatg gagcaaaacc 231181 ctgtcttaaa aaataaaaat aaaaaaggtg acagagaaaa tatttgtaca tacgataata 231241 cctggaatgt taaagcacta taattttttt aatgctcaaa ggcttaaata ggcaactcac 231301 ggccgggcgt ggtggctcac acctgtaatc ccagcacttt gggaggctga ggcgggtgga 231361 tcacgaggtc aggagatcga gaccatcctg gctaacacag tgaaaccctg tctctacaga 231421 ctacaaaaaa ttagccaggc atggtggcac gtgcctgtag tctcagctac tcgcgaggct 231481 gaggcagaag aattgcttga acccaggagg cggagattgc agtgagctga gatcctgaca 231541 cgcactacag cctgggcaat agactgagac tccatctcaa aaaaaaaaaa aaaaaaggca 231601 actcagagaa gaaatacaaa gtaccacgtg aaaagatgcc caatctccaa gtaatcagaa 231661 acatgacagg caacaagggg atattatttt tttttaaata gcggcagagt cttgttatgt 231721 tacccaggct ggtttagaac tcctgggctc aagcagtcct cctgccttgg ccttccaaaa 231781 gtgctgggat tacaggccct aagccaccac acccggctga ggggacataa tttttcaccc 231841 actatattat cacaaattaa tatgattgcc acagccaact ttaatgagca tgcagacata 231901 gaagcacttt cacacctcag gttacgagtt aaatttgtga agcttttttt tttttttttt 231961 tttttttgtg atggagtctc actcttttac ccaggctgga atgcagtgat gcaatctcgg 232021 ctccctgcaa cctccacctc tgaggttcaa gcgattctcc cacgtcagcc tcctgagtag 232081 ctgggattac aggcacccgc caccatgccc ggctaatttt tgtattttta gtagagacgg 232141 ggtttcaccg cgttggccag gctggtgtca aacccctgac tttaggtgat ccgcccgcct 232201 cggcctccca aagtgctagg attacaggtg tcagccactg ctcctggcct gtgaagctgt 232261 tttggaaggt gatttgctgc aagttttcaa aattaaaaat gcacaagcct aggccaggca 232321 cggtggctca tgcctgtaat cacagtactt tgggaggccg aggtgagcag atcacttcag 232381 cccaggagtt tcagaccagc ctgaccaaca aagtgagacc tcgttgctac aagaaataaa 232441 aaattgggtg gcacatgcct gtggtcccag ctactctgaa gggtgaagtg ggaggactgc 232501 ttgagcccag gaggtcaagg atgcagtgag ccaagattat gccactgcac tccagcctgg 232561 gtgacagagg gagaacatat ctcaaaaaaa aaaaaaaaat gcacaaggct tttgcttcag 232621 tgattccatt tctaagacta tatccaatgt acccaaatat gcacgtctga gatgttcatt 232681 ggagtattgg ttgttaaagg aaacagctgg agacaatcta aatggggaga cagtcagtac 232741 agctgtggta tatccatagt atgcatgctc tacagttgtt aagagtggca atggcctatt 232801 catctggacc tagatagatg tcaaaggcac actgctaata tttaaaaaga agatctggaa 232861 aatatgtata gcatggtgta tacatataat tagcccatat aggtacatga agttcctcag 232921 gagactggga ggatgcacag ccaggtaaac ctgtagagca tggagacatt aggtgggagg 232981 tgatggaaaa gtccctcagt tttttttttt tttgagacag tctcgctctg ttgcccaggc 233041 tggagtgcag tggcatgatc tcagctcact gcaaccttca cttcctaggt tcaagcgatt 233101 ctcctacctc aacctaccaa gtagctggga ttacaggcgc ccgccaccac gctcagctaa 233161 ttttggtagt tttagtagag atggggtttc accatgttgg ccaggctggt cttgaactcc 233221 tgacctcaag tgatcctccc cgccttggcc tcccgaagtg ctgggattac aggtgtgaac 233281 cactgtgccc agaccttcat tttgtttgta tacatctgta ttcattgaat cttctacaag 233341 gtttatttat tcatgtatgc ctattgttcc ggttatctat tgttgcatga aaaacacccc 233401 aaaacttaat agtgtaaaac agcagcccct ttattatgct cacaacttgg tggtcagatt 233461 tcaagcaaga cccagaggca caactcatct cgttccacag tggctggagc ttcagctgag 233521 gtggctcaaa tggctggaga acacttccct gcagcagaac ctacaaccat cttcctcccc 233581 tttctggaaa aggtttgttt ctgaggctgg agtcagagtc ttcaaaccaa cagccagaga 233641 taacattgct aattatgccc ctttccttgg tctttatttg aaatgaaacg gttctgttac 233701 cggtagaggg tcttgatgca aattgtccag gttcttggca ttttgaacaa agaactggac 233761 aaaacgcaca gcaaagcaag gtaagaatga agcagcaaaa gcagagattt attgaaagta 233821 catgccacag tgtgggagcc ggccaagcag cagcttaaac accccctaga ggtttcccat 233881 tggccacttg gtgttcaccc catgtaaatg aaccaacctc agtcagtctg attggttgca 233941 accaatcaga ggctgaagtg aagttacaaa gttacactcc tgtgcaaacg tctgattgca 234001 aaaagcagcc agtcagaggt accttcaatt tcccatctgc ccctcagaaa aggtgggcat 234061 tggcaaaggg agtagcctct ggtccttttg ttacttatgc gtggaaagtt ggggtttttc 234121 tttcgattta gttctaggaa gtcaaggtga actggccttc agttccctgc ctccagaccc 234181 tattctgcct cagttcaaat acacaaaacc taacaaatac ctatatgacc accttccaca 234241 actaatcaat ggcagtgtct attagtttga ctttgtttct attttacaag aagcaaaagc 234301 acctctggac tctccactag agagctatgt gcagccactc cagcatggtg ctctcagggt 234361 aaccagactt cttacactag ctccaagtgc aagttctcct agcaaacaag atagaggttg 234421 caaggccttt tatgacctca ccctggaggt cacaccacat ctgtatttca ttgtatttca 234481 ttggtagcag tcacaagctg cccagataca aggcagaggg acagatgccc cacttcttga 234541 tgggatgagt ggcaaaaaaa aatctgtgac catgtttaaa acccagaaga tttggctgtt 234601 agagatttgt gaagttttta aaagttaaga attattaaaa caaaaataaa acccaggaga 234661 caaggtagtg gaagtgtgtc cctcccattc actgaccaag gtgaggggac tctagtcctt 234721 tgctgctgac cttcagcctc ccttcctctt aaaaagaggt cctagttggg gagcgatggc 234781 tgaagagtta ccataaaata ccaagtagga gcgaggtgcg gtggctcacg tctttaatcc 234841 cagcagtttg ggaggacaag gtgggaggat cgcatgagcc caggagttca agacaaacct 234901 gggcaacata gcaaggctct gtctctacaa aaaaggaaaa gagtacaaag tagggccatt 234961 tagtatggtt tcatctttat taaaagtctt aagttttaga tttttgaagt tatcaccaat 235021 aaaatatcca gttgtggtct tgtgttgagt gaaaatgtta tttaaatgca gatatttaaa 235081 atatactaca atttttgtgc tattttaatt ttctaggagc cttaaaaaat gtgtggctac 235141 tgaaactgtg ccccaaagag ttaaagaaat cagtaactaa cagaaattct tgagtctgca 235201 ggatggcaga taagaaacaa ctcgctggcc aggtgcagtg gctcacacct gtaatcccag 235261 cactttgagg gattgaccca ctgaagtggg tggattacat gaggtcagga gttcgaggcc 235321 tggcaggttg gtgggcacct gtagtcccag ctgcttggga ggctgaggca ggaaaatcac 235381 ttgaacccgg gaatcggaag ttgcagtgag ccgagatcag gctattgcac tccagcctgg 235441 gcgttgcagc gaaactccta ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaacaacttg 235501 ctgaaatgct gaagaaatgc tgaaactccc tccctataag ataaaagaag aacaagcaga 235561 aatctgttgg aaccaataac gctgactgga gtctgcacag aatgagcttg ctgacatcac 235621 agcctgaatt ttcaacacag gtttcatgct gtccctgaat ttgcatgaga cccattaagt 235681 agcttgaaga ggtaactgcg aaagcccaag gactttccac atctcccctt tccttccact 235741 aataatcacc tactaatctc agaatccacc gcctgaacct tttctaataa aaataactgc 235801 cttaaagcca acacggggag acagccttga gcttgatatt cctgtctcct tgtgagtcga 235861 cttgcaatac aaagcttttc ttttttcaga accacagtgt catagtatta gcttctaggg 235921 tatcaggctg gaatgcactg gtgcgattat ggctcactgt agcctcaatc tcctggactc 235981 atgcagtcct cccacctcag cctcccaagc agctaggacc acaggcactt gccaccacgc 236041 ctggctaatt tttgtttttt ttgtagaaat gggatctcac tatgttgtcc aggctggtcc 236101 cgtgctcttg ggctcaagtg atccttctgc ctatgcctcc cgaagtgcta ggaaaagcct 236161 cccaaactgg aaaatatggg atggcttccc agttgttaaa acaccctgat ttgggagctg 236221 attgcttata aacactaaga aaaagctaag agctgagttc gccttcccat tctgttctca 236281 ggtttctctc cttatagaga ggttctgatc atctacaagc tttgctaaat ctggggtgtt 236341 tctctactga tttcctaaat gtttgtagtc ttttttgtaa aattgatgat tcaattttat 236401 tctcctctga agtaacaata gatgcagagt ctgacatttc ttcagctttg aagaatctgc 236461 attttctctc ctgatctcaa aacctgtaag aaaaagaaat tgcagatatc accctgtcca 236521 aggagaaaga ggggggaaaa taagagttgt attaaagtag tagagagaaa acagaaactg 236581 ctaaatagaa tcccaaagcc ttttattcca agtaaggctg aatttttctt ttctttcttc 236641 ccgtttcttt ccagtgaaca ctcacctctc ctggagtcaa gtcattgttt ctttcctttg 236701 gaatcctgct attcctgcac ccatggctct tctctaggct ccagtgggaa gtacaggtca 236761 gggtttggga ctagatatac tggtcataag aaatacaggg aacatgatag tctgggtcac 236821 actgattttt cctgagttca agccgatttc caggacagat ggcctctcca tatttatttc 236881 cttctcaatt aggattattt cacctggagc ctctaacagg cagctggctt tttttgagac 236941 agggtcttgc agtgtctccc aggctggact gcagtggcac aatctcaact cactgcagtc 237001 ttgacctccc aggctcaagg gatcctccca ccttagcctc tggagtagct gggaccacag 237061 gcatgtgcta ccatgccctg ataattaaaa aaaaaaactc tgtagagaga agggctctct 237121 ctgttgccca ggctggtctt gaactcctgg ctcaagcaat cctcttgcct cggcctccca 237181 aagtgctagg attacaggtg tgagccactg tgcccagcat aacaggcagt ttttcaacat 237241 tagaacaaaa tctaggagta cgaaaggcat actggaaaaa taataaacaa acattagaac 237301 aaaaacaaaa tagggggaag atccttttgt gctcaattca aagctattct cctccaatct 237361 ctgggaagag gagtgaatta attctgctag tcaatgcact gatttagttt tacctcattt 237421 tttgaatatt ttgataataa aattatctaa gaaccaaaat atcaagttaa tcttcctaaa 237481 attctgacta tattaaaact gagaactatt ctcttattga atccaagaga agagagaggt 237541 aaatgtggta gttaaactat gtcctggatt aatttcagtt aagagacttt caaataagaa 237601 ttagtatctg aactggtttg gttggcaacc agcaaaaaca atctttacat gtttgtttta 237661 gctgaagcct agaagtagat tgtcttgata ttaaccaaaa caaatatcta agagtgtttc 237721 catcaatcca gacaattagt aaaatggtct ccagtgtggt ctcccctgtg tcctgccatg 237781 cctcccctca ctgacaaaag tttgggttgc accctctttc caagttcttt cctctccctc 237841 ctcatcttga cagtttactg gtctgaaaaa aaaaataggt tggtaaatag tgatctaata 237901 tgcaaataga ggaatatgtc taaatattaa cattatcaaa caatattagt aagacagttt 237961 ggttaatgtt agtccaaagg aaaatttaaa ctccagaaca tttaatcaaa atataagtgg 238021 cgtaaaatgt ctcattttat cttttcatga ggaaaggtaa tatattccat attaataaaa 238081 ataaaatgca tttcaaatga aatgccattt aaatttgcta ttttgtttat gattggattt 238141 caaagctagg agtaggatca aagagcatgt ggtacaggta aaagaaaacc ctaaatccag 238201 cgctttggga ggccgaggcg ggtggatcac ctgaggtcag gagttcaaga ccagcctggc 238261 cgacatggca aaactccatc tctactaaaa atacaaaaat tagccaggca tggtggcgtg 238321 tgcctgtaat cccagctacc cgggaggctg aggcaggaga atcgctggaa cttgggcaga 238381 ggctgcagtg agccttgact gcgccactgt actccagcct gggcgacaga gcaaggctct 238441 gtctcagaaa aaaagaaaac cctaagtcat agatcaaagg aatgctgtag tcaagtaaac 238501 gaggggagag accaaagctg gggggagtgg caagtagcaa aggcttagag tggcagaagc 238561 tcccggaggc tgaatgtacc ctaagatgaa agacaggaaa agcatcgatt taaaatgacg 238621 gctatgttgc ccaggctggt ctcaaaactc ctgggctgaa gcaatcctcc tgcctcagca 238681 tccctaatag gtggggctac agttgcatgt caccacaacc agttaatttt taaaattttt 238741 tttctgtaga gacgaggtct tgccatgttg cccaggctgg tctcaaaatc ctggcctcaa 238801 gtgattcttc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc cacagtacgt 238861 ggcctagtct tgaaagtctt aacactgtag atttcaccaa atgtcacttt gaaaaccagg 238921 cacagataac tggccccttg agtacaaagg aatctgtaag agctagttag ggaatgaaag 238981 aagtcccaga gacagagtct agtacccatg agacctaaaa ccaggagtgg taagggttct 239041 ttgtcgaaga ttcaaattta tttttctctg caacagtcac gaagtcacct gcagctgagg 239101 atgagaaaac acagttaagt cctcacttaa catcactaat acggtaagtc ctcacttaac 239161 atcattgata gggtcttgga aactgggact taaagggaaa agacatataa tgaaactaat 239221 ttatttttgt catcaacctt ataacaaaat gtgctttcac ttaagataac agtttccaac 239281 ctatctagga cattgaggac ttacagtgtt aagaaagtag aatctgtact ggtttcaaag 239341 gcgcttactt ctcttcacta cttacttttc accaaagccc taaagtagca gttcttagtc 239401 tttttggcgg tttttagaac ttgatgaaag ctatgaaccc acttactaga acatacagat 239461 acacacacac acacacacac acacacacac acacacacac acacacatcc aattttgcat 239521 acagtatcaa aaggtttaca aaaatctcca aacccatcca tggatc // LOCUS HSAJ1589 1722 bp RNA PRI 04-NOV-1997 DEFINITION Homo sapiens mRNA for fork head protein. ACCESSION AJ001589 NID g2597916 KEYWORDS AF6q21 gene; fork head protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1722) AUTHORS Bernard,O.A. TITLE Direct Submission JOURNAL Submitted (17-SEP-1997) Bernard O.A., u301, Inserm, 27, rue J Dodu, Paris/75010, FRANCE REFERENCE 2 (bases 1 to 1722) AUTHORS Hillion,J., Leconiat,M., Jonveaux,P., Berger,R. and Bernard,O. TITLE AF6q21, a novel partner of the MLL gene in t(6;11)(q21;q23), defines a forkhead transcriptional factor subfamily JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1722 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /cell_line="k562" /cell_type="hematopoietic" /map="q21" gene 80..1225 /gene="AF6q21" CDS 80..1225 /gene="AF6q21" /codon_start=1 /product="fork head protein" /db_xref="PID:e1169810" /db_xref="PID:g2597917" /translation="MAEAPASPAPLSPLEVELDPEFEPQSRPRSCTWPLQRPELQASP AKPSGETAADSMIPEEEDDEDDEDGGGRAGSAMAIGGGGGSGTLGSGLLLEDSARVLA PGGQDPGSGPATAAGGLSGGTQALLQPQQPLPPPQPGAAGGSGQPRKCSSRRNWGKPV YSDLITRAIESSPDKRLTLSQIYEWMVRCVPYFKDKGDSNSSAGWKNSIRHNLSLHSR FMRVQNEGTGKSSWWIINLMGEERKTPRRRAVTMDNSNKYTKSRGRAAKKAALQTAPE SADDSPSQLSKWAWQPHVNAAVMSWMRGRTSVHAPILTPAQSVAACRPSWQVTELDEV QDDDAPLSAHALQHVSQPVTFSKQACTVELRRLTEMAGTMNLNDGAD" BASE COUNT 357 a 571 c 519 g 275 t ORIGIN 1 cgtccctccc ccgatgcacc ccgccccggc gcgagaggag agcgcgagag ccccagccgc 61 gggcgggcgg gcggcgaaga tggcagaggc accggcttcc ccggccccgc tctctccgct 121 cgaagtggag ctggacccgg agttcgagcc ccagagccgt ccgcgatcct gtacgtggcc 181 cctgcaaagg ccggagctcc aagcgagccc tgccaagccc tcgggggaga cggccgccga 241 ctccatgatc cccgaggagg aggacgatga agacgacgag gacggcgggg gacgggccgg 301 ctcggccatg gcgatcggcg gcggcggcgg gagcggcacg ctgggctccg ggctgctcct 361 tgaggactcg gcccgggtgc tggcacccgg agggcaagac cccgggtctg ggccagccac 421 cgcggcgggc gggctgagcg ggggtacaca ggcgctgctg cagcctcagc aaccgctgcc 481 accgccgcag ccgggggcgg ctgggggctc cgggcagccg aggaaatgtt cgtcgcggcg 541 gaactggggg aaacctgtct actcggacct gatcacccgc gccatcgaga gctccccgga 601 caaacggctc actctgtccc agatctacga gtggatggtg cgttgcgtgc cctacttcaa 661 ggataagggc gacagcaaca gctctgccgg ctggaagaac tccatccggc acaacctgtc 721 actgcatagt cgattcatgc gggtccagaa tgagggaact ggcaagagct cttggtggat 781 catcaacctg atgggggaag agcggaaaac gccccggcgg cgggctgtca ccatggacaa 841 tagcaacaag tataccaaga gccgtggccg cgcagccaag aaggcagccc tgcagacagc 901 ccccgaatca gctgacgaca gtccctccca gctctccaag tgggcctggc agccccacgt 961 caacgcagca gtgatgagct ggatgcgtgg acggacttcc gttcacgcac caattctaac 1021 gccagcacag tcagtggccg cctgtcgccc atcatggcaa gtcacagagt tggatgaagt 1081 ccaggacgat gatgcgcctc tctctgccca tgctctacag cacgtcagcc agcctgtcac 1141 cttcagtaag caagcgtgca cggtggaact gcgacggctg actgaaatgg caggcaccat 1201 gaatctgaat gatggggctg actgaaaacc tcatggacga cctgctggaa acatcacgtc 1261 ccgccatccc agccatcgcc actggggact catgcagcga gtctagcttc ccgtatacca 1321 ccaaggctcg ggcctgagct ccccaaccag ctcctttaac agcacgtgtt tggaccttca 1381 tctctgaact ccctacgcca ggtctccatg cagaccatcc aagagaaaag cagctacctt 1441 ctcttccatg tcacactatg gtaaccagac atccagacct gctcaattcg gactcactta 1501 gccaacagcg atgtcatgat gacacagttc gaccccttga gtgtctcagg ccagcaccgc 1561 tgtgtctgcc cagtcccgcc ggaacgtgac tgtcgcaatg atccgatgat gtcctttgct 1621 gcccagccta accagggaag tttggtcaat cagaacttgc tccaccacca gcaccaaacc 1681 cagggcgctc ttggtggcag ccgtgccttg tcgaattcgc cc // LOCUS HSAJ1612 839 bp RNA PRI 15-SEP-1997 DEFINITION Homo sapiens mRNA for L-3-phosphoserine-phosphatase homologue. ACCESSION AJ001612 NID g2407908 KEYWORDS CO9 gene; phosphoserine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 839) AUTHORS Planitzer,S.A. TITLE Direct Submission JOURNAL Submitted (12-SEP-1997) Planitzer S.A., TR-BY1, Boehringer Mannheim GmbH, Nonnenwald 2, 82377 Penzberg, GERMANY REFERENCE 2 (bases 1 to 839) AUTHORS Planitzer,S.A., Machl,A.W., Rueckels,M. and Kubbies,M. TITLE Identification of a cDNA encoding a novel protein partially homologue to human L-3-phosphoserine phosphatase predominantly expressed in Fanconi anemia fibroblasts JOURNAL Unpublished FEATURES Location/Qualifiers source 1..839 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="diploid fibroblasts" /chromosome="7" /map="q11.2" /dev_stage="fetal" gene 162..810 /gene="CO9" CDS 162..380 /gene="CO9" /codon_start=1 /product="L-3-phosphoserine-phosphatase homologue" /db_xref="PID:e347847" /db_xref="PID:g2407909" /translation="MASASCSPGGALASPEPGRKILPRMISHSELRKLFYSADAVCFD VDSTVISEEGIGCFHWIWRKCDQATSQG" polyA_signal 805..810 /gene="CO9" BASE COUNT 230 a 174 c 194 g 241 t ORIGIN 1 aagccacagg ctccctggct ggcgtcagct aaagtggctg ttgggtgtcc gcaggcttct 61 gcctggccgc cgccgcctat aagctaccag gaggagcttt acgacttccc gtcctgcggg 121 aagtggcggg cacgatcgca aggtagcgca gaagcttctc aatggccagc gccagctgca 181 gccccggcgg cgcactcgcc tcacctgagc ctgggaggaa aattcttcca aggatgatct 241 cccactcaga gctgaggaag cttttctact cagcagatgc tgtgtgtttt gatgttgaca 301 gcacggtcat cagtgaagaa ggaatcggat gctttcattg gatttggagg aaatgtgatc 361 aggcaacaag tcaaggataa cgccaaatgg tatatcactg attttgtaga gctgctggga 421 gaaccggaag aataacatcc attgtcatac agctccaaac aacttcagat gaatttttac 481 aagttacaca gattgatact gtttgcttac aattgcctat tacaacttgc tataaaaagt 541 tggtacagat gatctgcact gtcaagtaaa ctacagttag gaatcctcaa agattggttt 601 gtttgttttt aactgtagtt ccagtattat atgatcacta tcgatttcct ggagagtttt 661 gtaatctgaa ttctttatgt atattcctag ctatatttca tacaaagtgt tttaagagtg 721 gagagtcaat taaacacctt tactcttagg aatatagatt cggcagcctt cagtgaatat 781 tggttttttt ccctttggta tgtcaataaa agtttatcca tgtgtcagaa aaaaaaaaa // LOCUS HSAJ2030 1874 bp RNA PRI 27-OCT-1997 DEFINITION Homo sapiens mRNA for putative progesterone binding protein. ACCESSION AJ002030 NID g2570006 KEYWORDS progesterone binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1874) AUTHORS Gerdes,D. TITLE Direct Submission JOURNAL Submitted (20-OCT-1997) Gerdes D., Institute of Clinical Pharmacology Mannheim, University of Heidelberg, Theodor Kutzer Ufer 1, Mannheim, 68167, GERMANY REFERENCE 2 (bases 1 to 1874) AUTHORS Gerdes,D. TITLE Cloning and tissue expression of two putative steroid membrane receptors JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1874 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone="dg6" /clone_lib="pCMVsport" CDS 7..678 /note="putative" /codon_start=1 /product="progresterone binding protein" /db_xref="PID:e1154367" /db_xref="PID:g2570007" /translation="MAAGDGDVKLGTLGSGSESSNDGGSESPGDAGAAAEGGGWAAAA LALLTGGGEMLLNVALVALVLLGAYRLWVRWGRRGLGAGAGAGEESPATSLPRMKKRD FSLEQLRQYDGSRNPRILLAVNGKVFDVTKGSKFYGPAGPYGIFAGRDASRGLATFCL DKDALRDEYDDLSDLNAVQMESVREWEMQFKEKYDYVGRLLKPGEEPSEYTDEEDTKD HNKQD" BASE COUNT 542 a 371 c 459 g 502 t ORIGIN 1 ccggtgatgg cggctggtga tggggacgtg aagctaggca ccctggggag tggcagcgag 61 agcagcaacg acggcggcag cgagagtcca ggcgacgcgg gagcggcagc ggaaggggga 121 ggctgggcgg cggcggcgtt ggcgcttctg acggggggcg gggaaatgct gctgaacgtg 181 gcgctggtgg ctctggtgct gctgggggcc taccggctgt gggtgcgctg ggggcggcgg 241 ggtctggggg ccggggccgg ggcgggcgag gagagccccg ccacctctct gcctcgcatg 301 aagaagcggg acttcagctt ggagcagctg cgccagtacg acggctcccg caacccgcgc 361 atcctgctcg cggtcaatgg gaaagtcttc gacgtgacca aaggcagcaa gttctacggc 421 ccggcgggtc catatggaat atttgctggt agggatgcct ccagaggact ggccacattt 481 tgcctagata aagatgcact tagagatgaa tatgatgatc tctcagattt gaatgcagta 541 caaatggaga gtgttcgaga atgggaaatg cagtttaaag aaaaatatga ttatgtaggc 601 agactcctaa aaccaggaga agaaccatca gaatatacag atgaagaaga taccaaggat 661 cacaataaac aggattgaac tttgtaaaca accaaagtca ggggccttca gaactgcaat 721 tcttactccc tttcacagac tgtccggagt ctttgggttt gattcacctg ctgcgaaaaa 781 cattcaacaa attgtgtaca agataaatta atctcactat gaagatttga ataactagac 841 attatttatg ctgccaaact catttgttgc agttgtttgt aatgtctagt ggggcttcat 901 catcctgaaa agaaggagac agggattttt ttaaagagca agaaagtcac aatattactt 961 ctttccttcc ttttttcctt ctttcctttc ttctttctct ttctttcttt ttaaaatata 1021 ttgaagacaa ccagatatgt atttgctact caagtgtaca gatctcctca agaaacatca 1081 agggactcct gtgtcacata ctgtgttttt attttaacat gggtgaggga ggcgacctga 1141 tcaggggagg tgggggtaca catcaatttg agttgttcag gctactgaaa cattaaaatg 1201 tgaattccca aacttttctt tttggctttg tcagggaaaa gaaaaatatc tttataaaga 1261 aatctttgga aattaggaga aggaatttca ggtgggttta agtcagagct agttccccaa 1321 cagaaagatc atttgaaacc agtttttatc ccttctcttt ccttcccttt ccctaaatca 1381 aatcaatatt aattgtgcct tatttcactt aacatagact tgaattattt ttagggaaag 1441 cccctataat gaattcagaa atcactacaa gcagcattaa gactgaagtt ggaatattct 1501 gttgaccata aaaccttgat atcattctgt gtatatagaa tgtaaaagga atattacagt 1561 gttaactgcc atatatgtaa tatacacaaa ctcaattagc attgtaatgg ccaaatgcat 1621 tcccccatgc ttttctgttt tcaaaaaaat tgaaaaacaa atcaactctt atccccaaca 1681 gctgcctaat tttaggagtc tgaccctcca catctcactg gtgtgggtgc atggggctgt 1741 ggagtgggtg tcagtatgga tgtgtctgaa tgtgtgaggc cttggaaggg actctttctg 1801 cagatactgt aaatacaagt accattttaa taaagcatgt acaataaacc aaaaaaaaaa 1861 aaaaaaaaaa aaaa // LOCUS HSAJ2078 768 bp RNA PRI 16-DEC-1997 DEFINITION Homo sapiens mRNA for syntaxin 6, complete CDS. ACCESSION AJ002078 NID g2695736 KEYWORDS syntaxin 6. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 768) AUTHORS Nabokina,S., Lazo,P.A. and Mollinedo,F. TITLE Identification of variant isoforms and pattern of expression of syntaxin genes in human neutrophils JOURNAL Unpublished REFERENCE 2 (bases 1 to 768) AUTHORS Mollinedo,F. TITLE Direct Submission JOURNAL Submitted (16-OCT-1997) Mollinedo F., Inst. de Biologia y Genetica Molecular, Facultad Medicina, CSIC-Universidad de Valladolid, C/Ramon y Cajal, E-47005, SPAIN FEATURES Location/Qualifiers source 1..768 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neutrophils" CDS 1..768 /codon_start=1 /product="syntaxin 6" /db_xref="PID:e1216708" /db_xref="PID:g2695737" /translation="MSMEDPFFVVKGEVQKAVNTAQGLFQRWTELLQDPSTATREEID WTTNELRNNLRSIEWDLEDLDETISIVEANPRKFNLDATELSIRKAFITSTRQVVRDM KDQMSTSSVQALAERKNRQALLGDSGSQNWSTGTTDKYGRLDRELQRANSHFIEEQQA QQQLIVEQQDEQLELVSGSIGVLKNMSQRIGGELEEQAVMLEDFSHELESTQSRLDNV MKKLAKVSHMTSDRRQWCAIAILFAVLLVVLILFLVL" BASE COUNT 228 a 167 c 211 g 162 t ORIGIN 1 atgtccatgg aggacccctt ctttgtggtg aaaggagagg tacagaaagc agtcaacact 61 gcccagggat tgtttcagag atggacagag ctcctccagg acccctccac agcaacaagg 121 gaagaaatcg actggaccac caacgagctg agaaataacc tccggagcat agagtgggat 181 ctagaggacc ttgatgaaac catcagcata gttgaagcaa atcctagaaa atttaacctt 241 gatgcaactg aattgagtat aagaaaagcc ttcattacaa gtactcggca agttgtcagg 301 gacatgaaag atcagatgtc aacttcatct gtgcaggcat tagctgaaag aaaaaataga 361 caggcactgc tgggagacag tggcagccag aactggagca ctggaacaac agataaatat 421 gggcgtctgg accgagagct ccagagagcc aattctcatt tcattgagga gcagcaggca 481 cagcagcagt tgatcgtgga acagcaggat gagcagttgg agctggtctc tggcagcatc 541 ggggtgctga agaacatgtc ccagcgcatc ggaggggagc tggaggaaca ggcagttatg 601 ttggaagatt tctctcacga attggagagc actcagtccc ggctggacaa tgtgatgaag 661 aaacttgcaa aagtatctca tatgaccagt gatcggcgcc aatggtgtgc catagccatc 721 ctctttgcag tcctgttggt tgtgctcatc ctcttcctag tgctgtga // LOCUS HSAJ2425 2095 bp RNA PRI 11-DEC-1997 DEFINITION Homo sapiens mRNA for p65 protein. ACCESSION AJ002425 NID g2570012 KEYWORDS oncofetal protein; p65 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2095) AUTHORS Hanausek,M., Szemraj,J., Adams,A.K. and Walaszek,Z. TITLE The oncofetal protein p65: a new member of the steroid/thyroid receptor superfamily JOURNAL Cancer Detect. Prev. 20 (2), 94-102 (1996) MEDLINE 96253399 REFERENCE 2 (bases 1 to 2095) AUTHORS Hanausek,M.E. TITLE Direct Submission JOURNAL Submitted (23-OCT-1997) Hanausek M.E., Carcinogenesis, University of Texas M.D. Anderson Cancer Center, P.O. Box 389, Smithville, Texas, 78957, USA FEATURES Location/Qualifiers source 1..2095 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MCF-7" /cell_type="adenocarcinoma" /tissue_type="breast" gene 61..2063 /gene="p65" CDS 61..1905 /gene="p65" /codon_start=1 /product="protein" /db_xref="PID:e1215340" /db_xref="PID:g2687718" /translation="MGPENESPVGGEMWGRVCVFSARQKRPRCSTLNSLLVARTCYNM LALVDWIEQDFKSTWRKAHVSLRGFLDFELGRRKEVAGAFLGGDTRPDPKKPRGGSKK NVEVYDDDVGSQAADSPGKRMAPKGTFRDKDKFEGLFKLGALVAKKALTPAFSCSPNR GSPLHAHYGDEILYKVESGPVNICEGGKRGVGIHPPDNYGDTLDENLGLPQKIVIKVK PQTEEANTWLRQDLKNHNSAKEAGGSDEIKTFVTGCKKDGHSGRKNMTTHDRNSKKWQ RVNLSLMASLQLDSRGGRAGPRRGARRLCLVCEDYASCSNTCVWSCEAYKVFFRRSQS FTDPAIFTNDCNISKNRSKSCPACLLRCLHPSINEIRKDKRAALKVRDNVGEEVDMTG PSWTCLKLLFSDGEKVIPRLGHELPGIKGGRQAKQQSHRGSPIPKNRKGWPPGHVLSN DGGAGGRVWKKKSCKPIRREGPKWWDRLNESTPLFWGSRANKSLGKGGTRGRIFIKHP HLFKFAADPQDKHWLAEQHHMRATGGKMAYLLIEEDIGQHHGQGFPVMLLKISHIRHM VGGVAHCLYDMKEKKFVLPSWKVEKLGKYVETLRTEKEHRAAEASPQT" polyA_signal 2057..2063 /gene="p65" /evidence=experimental BASE COUNT 574 a 481 c 607 g 433 t ORIGIN 1 ggggcttatt aacggtggta tctggtagag cccccataaa gtaccaccgt ggggaagaca 61 atgggacccg aaaacgagag ccccgtgggg ggggagatgt gggggagagt gtgtgtgttc 121 agcgctagac aaaagagacc gcgatgcagc actttaaata gtcttctcgt tgctagaaca 181 tgctataaca tgctggcgct ggtggattgg atcgagcagg atttcaagag cacttggcga 241 aaggctcacg tttccctccg cgggttcctc gatttcgagc tggggagacg aaaagaggtt 301 gccggggctt ttctgggtgg ggataccaga cctgacccca aaaagccgcg gggggggtcc 361 aaaaagaacg tggaggtgta tgatgatgac gtaggctctc aggctgcgga cagccccggg 421 aaacgcatgg ccccgaaagg gacatttaga gataaggaca aatttgaagg gctgtttaag 481 ctcggggcgc tggtggcaaa aaaagccttg accccagcat tttcttgttc ccccaacagg 541 gggtcgcctc tacacgccca ttatggggat gaaatcctct acaaggttga atccgggccc 601 gtcaacattt gcgagggggg caaaagaggc gtggggatcc accccccaga taactacggc 661 gatacccttg atgaaaatct tggccttccc caaaaaattg tgattaaggt gaagccccaa 721 accgaggaag ccaacacttg gttaaggcag gatctgaaaa atcataacag tgcaaaggag 781 gccgggggct ccgacgagat taaaaccttt gtgacaggat gtaaaaaaga tgggcatagt 841 gggcgtaaaa atatgaccac acatgacaga aattcaaaaa agtggcaacg ggtaaacttg 901 tcccttatgg cctccctcca gctagattct aggggcggac gcgcggggcc ccggcgcgga 961 gcgcggcgcc tgtgcctggt gtgtgaggac tatgccagct gttcaaacac ctgtgtctgg 1021 tcctgtgaag cctacaaggt cttctttcgc cgaagtcaaa gtttcacaga tccagccatt 1081 ttcacaaacg attgcaacat ctctaagaat aggtctaagt cttgcccagc ttgcctcctc 1141 cgttgcctgc accctagcat taatgagatc cgaaaagaca agcgagcagc gctgaaagtg 1201 cgagacaacg ttggtgaaga ggtggatatg accggtccta gctggacctg cctgaagcta 1261 cttttttcag atggggaaaa agtgataccc agattgggcc atgaactccc agggatcaag 1321 gggggccggc aggcaaaaca gcagtcccac cgaggaagcc ccatccccaa aaacaggaaa 1381 ggttggcccc ccggacatgt cctgtcaaat gacggcggag ctggtggcag ggtatggaaa 1441 aaaaaatcct gtaaaccaat tcgccgagaa ggccccaagt ggtgggatcg gctgaatgaa 1501 tctacacctt tgttttgggg gtctcgagcc aacaagagtt tagggaaggg aggcaccagg 1561 gggaggattt tcatcaagca cccacacctc tttaagtttg cagcagatcc tcaggacaag 1621 cactggctgg ctgagcagca tcatatgcgg gcaacaggag gaaagatggc gtaccttctc 1681 attgaggaag acatcgggca gcatcatggc caggggttcc cagttatgct tctcaagatt 1741 agccatatta ggcacatggt tgggggagtg gctcattgct tgtacgacat gaaagaaaag 1801 aagtttgttc tgccatcctg gaaggttgag aagttgggga aatacgtgga gacactacgg 1861 acagaaaaag agcatcgtgc tgctgaagca agtccccaga cctgactttc ccggcccggc 1921 tgaggccatc atggggatgc ggtctagttg gctcttagca gcatcaagct gtacatgagc 1981 tagtttgtag tgactcactg cagagccccc cagactggct tgtggttctg tttctaaagt 2041 tattggaata agaagcaata atacaagttt gtaatttaaa aaaaaaaaaa aaaaa // LOCUS HSAJ97 3859 bp DNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for EYA1B gene. ACCESSION AJ000097 NID g2661374 KEYWORDS EYA1B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3859) AUTHORS Abdelak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Levi-Acobas,F., Cruaud,C., Le Merrer,M., Mathieu,M., Koenig,R., Vigneron,J., Weissenbach,J., Petit,C. and Weil,D. TITLE Clustering of mutations responsible for Branchio-Oto-Renal (BOR) syndrome in the eyes absent homologous region (eyaHR) of EYA1 JOURNAL Hum. Mol. Genet. 6, 2247-2255 (1997) REFERENCE 2 (bases 1 to 3859) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 3 (bases 1 to 3859) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (03-SEP-1997) Abdelhak S., Unite de Genetique des Deficits Sensoriels, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, FRANCE COMMENT Related sequences Y10260, AJ000098. FEATURES Location/Qualifiers source 1..3859 /organism="Homo sapiens" /db_xref="taxon:9606" allele 1..124 /gene="EYA1C" /citation=[2] /replace="aaaccaataaggttaggacaagagaatagctgtggtttgcgttgcaaa a accaaaaaaaaaaaaaaaaaaaaaaagaaagccccgaggctccatgggcagacctaca a ggctgcgcaaacaaatcgagggatgagattctgctgtttctttgtctagggttctcag a tgctatctgccgctgctgtttggtggggaaggagcgctgggcgcaaagctgttaccaa a cagaacggtgggagctgatggctccgagtttggggcgaggtagaaactctccagtgcc a cttccgactttaagccttcctgttgccgtccactgtggcgggtttcttcctggggaac a cgttttcgctcagtcgctcggcagcccgagcctgcggcagcggccaggcgcctgcccc c tgcgccgagctttcccctgcagaggcgctccactcccagaagcgccgcggctgcacca g agcgcctgagagcccccgcgcgtacccatccaggagcaaaactatgtcaggaatggag g tttgctaacccagaaaattcgaaggaacacattaaactggtggatgcagcagatgtaa g cgctg" gene 1..124 /gene="EYA1C" allele 171..300 /gene="EYA1A" /citation=[2] /replace="ca" gene 171..300 /gene="EYA1A" gene 178..1956 /gene="EYA1B" CDS 178..1956 /gene="EYA1B" /codon_start=1 /db_xref="PID:e1198519" /db_xref="PID:g2661375" /translation="MEMQDLTSPHSRLSGSSESPSGPKLGNSHINSNSMTPNGTEVKT EPMSSSETASTTADGSLNNFSGSAIGSSSFSPRPTHQFSPPQIYPSNRPYPHILPTPS SQTMAAYGQTQFTTGMQQATAYATYPQPGQPYGISSYGALWAGIKTEGGLSQSQSPGQ TGFLSYGTSFSTPQPGQAPYSYQMQGSSFTTSSGIYTGNNSLTNSSGFNSSQQDYPSY PSFGQGQYAQYYNSSPYPAHYMTSSNTSPTTPSTNATYQLQEPPSGITSQAVTDPTAE YSTIHSPSTPIKDSDSDRLRRGSDGKSRGRGRRNNNPSPPPDSDLERVFIWDLDETII VFHSLLTGSYANRYGRDPPTSVSLGLRMEEMIFNLADTHLFFNDLEECDQVHIDDVSS DDNGQDLSTYNFGTDGFPAAATSANLCLATGVRGGVDWMRKLAFRYRRVKEIYNTYKN NVGGLLGPAKREAWLQLRAEIEALTDSWLTLALKALSLIHSRTNCVNILVTTTQLIPA LAKVLLYGLGIVFPIENIYSATKIGKESCFERIIQRFGRKVVYVVIGDGVEEEQGAKK HAMPFWRISSHSDLMALHHALELEYL" BASE COUNT 1153 a 842 c 758 g 1106 t ORIGIN 1 tctccttttt ctcttttggt taaaagaggg cattgtcgtt ctcagccatg tgctctgtat 61 aattaagagc tgacactgaa gcagagtaac aacatcttct aattttttta cccctgatca 121 caggtgcaaa catctcaagc cagttcagat gttgctgttt cctcaagttg caggtctatg 181 gaaatgcagg atctaaccag cccgcatagc cgtctgagtg gtagtagtga atcccccagt 241 ggccccaaac tcggtaactc tcatataaat agtaattcca tgactcccaa tggcaccgaa 301 gttaaaacag agccaatgag cagcagtgaa acagcttcaa cgacagccga cgggtcttta 361 aacaatttct caggttcagc aattgggagc agtagtttca gcccacgacc aactcaccag 421 ttctctccac cacagattta cccttccaac agaccatacc cacatattct ccctacccct 481 tcctcacaaa ctatggctgc atatgggcaa acacagttta ccacaggaat gcaacaagct 541 acagcctatg ccacgtaccc acagccagga cagccgtacg gcatttcctc atatggtgca 601 ttgtgggcag gcatcaagac tgaaggtgga ttgtcacagt ctcagtcacc tggacagaca 661 ggatttctca gctatggcac aagcttcagt acccctcaac ctggacaggc accatacagc 721 taccagatgc aaggtagcag ttttacaaca tcatcaggaa tatatacagg aaataattca 781 ctcacaaatt cctctggatt taatagttca cagcaggact atccgtctta tcccagtttt 841 ggccagggtc agtacgcaca gtattataac agctcaccgt atccagcaca ttatatgacc 901 agcagcaaca ccagcccaac gacaccatcc accaatgcca cttaccagct tcaagaaccg 961 ccatctggca tcaccagcca agcagttaca gatcccacag cagagtacag cacaatccac 1021 agcccatcaa cacccattaa agattcagat tctgatcgat tgcgtcgagg ttcagatggg 1081 aaatcacgtg gacggggccg aagaaacaat aatccttcac ctcccccaga ttctgatctt 1141 gagagagtgt tcatctggga cttggatgag acaatcattg ttttccactc cttgcttact 1201 gggtcctacg ccaacagata tgggagggat ccacccactt cagtttccct tggactgcga 1261 atggaagaaa tgattttcaa cttggcagac acacatttat tttttaatga cttagaagaa 1321 tgtgaccaag tccatataga tgatgtttct tcagatgata acggacagga cctaagcaca 1381 tataactttg gaacagatgg ctttcctgct gcagcaacca gtgctaactt atgtttggca 1441 actggtgtac ggggcggtgt ggactggatg agaaagttgg ccttccgcta cagacgggta 1501 aaagagattt acaacaccta caaaaataat gttggaggtc tgcttggtcc agctaagagg 1561 gaagcctggc tgcagttgag ggccgaaatt gaagccctga ccgactcctg gttgacactg 1621 gccctgaaag cactctcgct cattcactcc cggacaaact gtgtgaatat tttagtaaca 1681 actactcagc tcatcccagc attggcgaaa gtcctgctgt atgggttagg aattgtattt 1741 ccaatagaaa atatttacag tgcaactaaa ataggaaaag aaagctgttt tgagagaata 1801 attcaaaggt ttggaagaaa agtggtgtat gttgttatag gagatggtgt agaagaagaa 1861 caaggagcaa aaaagcacgc gatgcccttc tggaggatct ccagccactc ggacctcatg 1921 gccctgcacc atgccttgga actggagtac ctgtaacagc gctcggcact ttgacagcgc 1981 acagctgctc tgtgaccagg gacagatcca gcaggcccca gtctcgcatc agcgccggcc 2041 tccagaactt agcaatttcc gcctggtgat gcgcagttgc tgtcagtctt gacctctgcc 2101 tttgtggtga atggaggacc acgtctattt catcagaaca gctgttgact ctagtactgt 2161 gaatccagtg aaaataagcc atgagaatgt tttagcacag cgttatgtgt ctgccacatt 2221 aactacacgg ttcaaacctg tgaagaaagg acctgcaaac gcttcagttg ttagcatttt 2281 caatgtgata taaacagctt ctccaataca gcaaacctaa ttgcacaaca gagactgaaa 2341 tgtgtttcct gaataccagt ggaggaattt tcttgtaaag aaggtttact ttttggtgtc 2401 tcatacccag ggtaatctgt acatctctac ttatttatga acagactttt tttaaaaaga 2461 taaaaaaaca gctttattga ggtataattc acccaccaga cttttttaaa catcaaataa 2521 ttgaggagac aatagcatta gaaataagtg attaaaggcc tctgcctcac aacatggcaa 2581 gtacagtact ttgaatttta gcacattgca taatagtttt aagtatgtct aatttaaacg 2641 tataatatgt acatcactga gacaatcatg tacagaaaga atttttggtg taaatttgta 2701 ataatggata attcttttac atattgttta gggaaatgat attgaaaggt agcaatgcct 2761 ggatagtgaa gcatgaggca gcacgtgcac aaattcatgt gccgtgcctt atctgagttt 2821 tcggtataaa tatgtagata atggattttt ttttagataa tgttgtcaag accaaaagca 2881 tggatgtcaa gtgtcagtaa ggattttgtt tctaaaattt tttcctgcat cagttcttct 2941 gagggccttg atgaaataac acagcagttt cttaaacaat ttgaaacaaa atgagctctc 3001 ctaccacctc actttttcat ttccacacta atgtattata tgtaactact tggaaaaaat 3061 aattattcaa atgcttcttc ccacaaagaa tatagatgat agtagatata ttttattaat 3121 aaaatggttc atgaatcgga gactaacaaa gttttcatgt gctcagaatt attaattatc 3181 gtgtctgcat tttctttcga taaaggaaga cacacgatgc taatccggaa atcagcaaac 3241 tttgcattac tccctatgtg cgtattttct ctttcttcct gtcaccctga ggaaggttca 3301 ttgccattgt catcaccatg gaaacaacgt tcctctccac ctgcattatg tactacatga 3361 caggcatcaa tctggggaaa taataaaatt atcacctttg tcagaccata agagtttctc 3421 caaaagtggt cagtttggct gggcaatatt ttctctcatc taacaaacac aatccattgt 3481 catgaaatta cccttaggat gagtcttctt taatcaatca tatattgggc ggaaaaaaca 3541 ccagctttga cccgaagtag ttgaagagct acttcattct tttctgaagt tgtgtgttgc 3601 tgctagaaat agtcatttgt gaattatcca aattgtttaa attcacaatt gaattagttt 3661 tttcttcctt ttggcttgaa gcaaacagtt gaccattttt aaccttttca ttttatgttt 3721 ttgtactctg cagactgaaa agacaaagtt tatcttggcc ttactgtata aaggtatgct 3781 gtgtccaccg ttgtgtacag aatttttctt cattaatttt gtgtttaagt taataaaatt 3841 tatttgtgat gtactgtaa // LOCUS HSAK3 1707 bp RNA PRI 18-JAN-1995 DEFINITION Human AK3 mRNA for adenylate kinase 3. ACCESSION X60673 S41502 NID g28576 KEYWORDS adenylate kinase 3; AK3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1707) AUTHORS Xu,G. TITLE Direct Submission JOURNAL Submitted (12-JUL-1991) G. Xu, University of Utah, Eccles Institution of Human Genetics, Bldg 533 Suite 6450, Salt Lake City UT 84112, USA REFERENCE 2 (bases 1 to 1707) AUTHORS Xu,G., O'Connell,P., Stevens,J. and White,R. TITLE Characterization of human adenylate kinase 3 (AK3) cDNA and mapping of the AK3 pseudogene to an intron of the NF1 gene JOURNAL Genomics 13 (3), 537-542 (1992) MEDLINE 92347846 COMMENT See also X60674. FEATURES Location/Qualifiers source 1..1707 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /clone_lib="stratagene #935203" mRNA 1..1692 /gene="AK3" /evidence=experimental gene 1..1692 /gene="AK3" CDS 184..855 /gene="AK3" /EC_number="2.7.4.10" /codon_start=1 /product="nucleoside-triphosphate--adenylate kinase" /db_xref="PID:g28577" /db_xref="SWISS-PROT:P27144" /translation="MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENI KASTEVGEMAKQYIEKSLLVPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDK ICEVDLVISLNIPFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQE DDKPEAVAARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQ SKEAY" misc_binding 217..243 /gene="AK3" /bound_moiety="GTP" BASE COUNT 429 a 396 c 422 g 460 t ORIGIN 1 cggcgctggg ctgaggggag gggttgtctt aaaagtctct ccttccccct gtaggggcgg 61 ccggcgagtc ccagtgagag cggagggtgc cagaggtagg gggccgagaa acaaagttcc 121 cggggcttcc tccggggccg cggtcggggc tgcgcgtttg accgcccccc tcctcgcgaa 181 gcaatggctt ccaaactcct gcgcgcggtc atcctcgggc cgcccggctc gggcaagggc 241 accgtgtgcc agaggatcgc ccagaacttt ggtctccagc atctctccag cggccacttc 301 ttgcgggaga acatcaaggc cagcaccgaa gttggtgaga tggcaaagca gtatatagag 361 aaaagtcttt tggttccaga ccatgtgatc acacgcctaa tgatgtccga gttggagaac 421 aggcgtggac agcactggct ccttgatggt tttcctagga cattaggaca agccgaagcc 481 ctggacaaaa tctgtgaagt ggatctagtg atcagtttga atattccatt tgaaacactt 541 aaagatcgtc tcagccgccg ttggattcac cctcctagcg gaagggtata taacctggac 601 ttcaatccac ctcatgtaca tggtattgat gacgtcactg gtgaaccgtt agtccagcag 661 gaggatgata aacccgaagc agttgctgcc aggctaagac agtacaaaga cgtggcaaag 721 ccagtcattg aattatacaa gagccgagga gtgctccacc aattttccgg aacggagacg 781 aacaaaatct ggccctacgt ttacacactt ttctcaaaca agatcacacc tattcagtcc 841 aaagaagcat attgaccctg cccaatggaa gaaccaggaa gatgtggtca ttcattcaat 901 agtgtgtgta gtattggtgc tgtgtccaaa ttagaagcta gctgaggtag cttgcagcat 961 cttttctagt tgaaatggtg aactgatagg aaaacaaatg agtagaaaga gttcatgaag 1021 aggccctcct ctgcctttca aaaggctggt cacctacaca tgtttaaggt gtctctgcac 1081 atgtctcaag cccatcacaa gaaagcaagt acagtgtgga tttcaaatgg tgtgtaactt 1141 cagctccagc tggtttttga cagctgttgc tgtggtaata tttttgacat gtgatggtga 1201 tagtctctgg ttctccccat ccccacaaag gctgttgaac cacagcacca ggaagcctga 1261 gaatgaatcc tgagggctct agcccaggct ttgtcccagg ctttctggtg tgtgccctcc 1321 tggtaacagt gaaattgaag ctacttactc atagtggttg tttctctggt cttgagtgac 1381 tgtgtccaca gttcattttt ttccggtagg aataactcct tttctacatc cacgctccat 1441 agagtctctc cttttcagac atcctgggat gaaagaattt ggcttttttt tttctttttt 1501 ttttggacat ctgttttcac tcttaggctt ttaaacaata gttattgctt ttatccctct 1561 cagattctaa taactgagag cgatggggct atattgaatc tctgtatgca ctgagaactg 1621 agctatgaag agaatcttat taaactgctg gtctgacttt atggattgac actgttcctt 1681 tcttttattg tgaaaaaaaa aaaaaaa // LOCUS HSALDHI1 1989 bp RNA PRI 12-SEP-1993 DEFINITION Human RNA for mitochondrial aldehyde dehydrogenase I ALDH I (EC 1.2.1.3). ACCESSION X05409 NID g28605 KEYWORDS aldehyde dehydrogenase; aldehyde dehydrogenase I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1989) AUTHORS Braun,T., Bober,E., Singh,S., Agarwal,D.P. and Goedde,H.W. TITLE Evidence for a signal peptide at the amino-terminal end of human mitochondrial aldehyde dehydrogenase JOURNAL FEBS Lett. 215 (2), 233-236 (1987) MEDLINE 87219091 REMARK Erratum:[FEBS Lett 1988 Jun 20;233(2):440]] REFERENCE 2 (bases 1 to 1989) AUTHORS Schurr,A. and Rigor,B.M. TITLE Corrigenda and Errata JOURNAL FEBS Lett. 233, 440-441 (1988) COMMENT see y00109 for further ALDHI cDNA sequence. FEATURES Location/Qualifiers source 1..1989 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22nd week of gestation" /tissue_type="fetal muscle (abortion)" /clone_lib="lambda gt11" /clone="(lambda)cALI23" CDS 37..1587 /note="aldehyde dehydrogenase 1 (AA 1-516)" /codon_start=1 /db_xref="PID:g28606" /db_xref="SWISS-PROT:P05091" /translation="MLRAAAARAPPGRRLLSAAATQAVPAPNQQPEVFCNQIFINNEW HDAVSRKTFPTVNPSTGEVICQVAEGDKEDVDKAREGRPGAFQLGSPWRRMDASHSGR LLNRLADLIERDRTYLAALETLDNGKPYVISYLVDLDMVLKCLRYYAGWADKYHGKTI PIDGDFFSYTRHEPVGVCGQIIPWNFPLLMQAWKLGPALATGNVVVMKVAEQTPLTAL YVANLIKEAGFPPGVVNIVPGFGPTAGAAIASHEDVDKVAFTGSTEIGRVIQVAAGSS NLKRVTLELGGKSPNIIMSDADMDWAVEQAHFALFFNQGQCCCAGSRTFVQEDIYDEF VVRSVARAKSRVVGNPFDSKTEQGPQVDETQFKKILGYINTGKQEGAKLLCGGGIAAD RGYFIQPTVFGDVQDGMTIAKEEIFGPVMQILKFKTIEEVVGRANNSTYGLAAAVFTK DLDKANYLSQALQAGTVWVNCYDVFGAQSPFGGYKMSGSGRELGEYGLQAYTEVKTVT VKVPQKNS" polyA_site 1989 /note="polyA site" BASE COUNT 466 a 521 c 565 g 437 t ORIGIN 1 gctctcggtc cgctcgctgt ccgctagccc gctgcgatgt tgcgcgctgc cgccgctcgg 61 gccccgcctg gccgccgcct cttgtcagcc gccgccaccc aggccgtgcc tgcccccaac 121 cagcagcccg aggtcttctg caaccagatt ttcataaaca atgaatggca cgatgccgtc 181 agcaggaaaa cattccccac cgtcaatccg tccactggag aggtcatctg tcaggtagct 241 gaaggggaca aggaagatgt ggacaaggca cgtgaaggcc gcccgggcgc cttccagctg 301 ggctcacctt ggcgccgcat ggacgcatca cacagcggcc ggctgctgaa ccgcctggcc 361 gatctgatcg agcgggaccg gacctacctg gcggccttgg agaccctgga caatggcaag 421 ccctatgtca tctcctacct ggtggatttg gacatggtcc tcaaatgtct ccggtattat 481 gccggctggg ctgataagta ccacgggaaa accatcccca ttgacggaga cttcttcagc 541 tacacacgcc atgaacctgt gggggtgtgc gggcagatca ttccgtggaa tttcccgctc 601 ctgatgcaag catggaagct gggcccagcc ttggcaactg gaaacgtggt tgtgatgaag 661 gtagctgagc agacacccct caccgccctc tatgtggcca acctgatcaa ggaggctggc 721 tttccccctg gtgtggtcaa cattgtgcct ggatttggcc ccacggctgg ggccgccatt 781 gcctcccatg aggatgtgga caaagtggca ttcacaggct ccactgagat tggccgcgta 841 atccaggttg ctgctgggag cagcaacctc aagagagtga ccttggagct gggggggaag 901 agccccaaca tcatcatgtc agatgccgat atggattggg ccgtggaaca ggcccacttc 961 gccctgttct tcaaccaggg ccagtgctgc tgtgccggct cccggacctt cgtgcaggag 1021 gacatctatg atgagtttgt ggtgcggagc gttgcccggg ccaagtctcg ggtggtcggg 1081 aacccctttg atagcaagac cgagcagggg ccgcaggtgg atgaaactca gtttaagaag 1141 atcctcggct acatcaacac ggggaagcaa gagggggcga agctgctgtg tggtgggggc 1201 attgctgctg accgtggtta cttcatccag cccactgtgt ttggagatgt gcaggatggc 1261 atgaccatcg ccaaggagga gatcttcggg ccagtgatgc agatcctgaa gttcaagacc 1321 atagaggagg ttgttgggag agccaacaat tccacgtacg ggctggccgc agctgtcttc 1381 acaaaggatt tggacaaggc caattacctg tcccaggccc tccaggcggg cactgtgtgg 1441 gtcaactgct atgatgtgtt tggagcccag tcaccctttg gtggctacaa gatgtcgggg 1501 agtggccggg agttgggcga gtacgggctg caggcataca ctgaagtgaa aactgtcaca 1561 gtcaaagtgc ctcagaagaa ctcataagaa tcatgcaagc ttcctccctc agccattgat 1621 ggaaagttca gcaagatcag caacaaaacc aagaaaaatg atccttgcgt gctgaatatc 1681 tgaaaagaga aatttttcct acaaaatctc ttgggtcaag aaagttctag aatttgaatt 1741 gataaacatg gtgggttggc tgagggtaag agtatatgag gaacctttta aacgacaaca 1801 atactgctag ctttcaggat gatttttaaa aaatagattc aaatgtgtta tcctctctct 1861 gaaacgcttc ctataactcg agtttatagg ggaagaaaaa gctattgttt acaattatat 1921 caccattaag gcaactgcta caccctgctt tgtattctgg gctaagattc attaaaaact 1981 agctgctct // LOCUS HSALDHPSY 2074 bp mRNA PRI 05-FEB-1998 DEFINITION H.sapiens mRNA for alkyl-dihydroxyacetonephosphate synthase precursor. ACCESSION Y09443 NID g1922284 KEYWORDS alkyl-dihydroxyacetonephosphate synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2074) AUTHORS de Vet,E.C., van den Broek,B.T. and van den Bosch,H. TITLE Nucleotide sequence of human alkyl-dihydroxyacetonephosphate synthase cDNA reveals the presence of a peroxisomal targeting signal 2 JOURNAL Biochim. Biophys. Acta 1346 (1), 25-29 (1997) MEDLINE 97330864 REFERENCE 2 (bases 1 to 2074) AUTHORS de Vet,E.C.J.M. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) E.C.J.M. de Vet, Institute for Biomembranes, Biochemistry of Lipids, Padualaan 8, 3584 CH Utrecht, NETHERLANDS FEATURES Location/Qualifiers source 1..2074 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" CDS 16..1992 /codon_start=1 /product="alkyl-dihydroxyacetonephosphate synthase precursor" /db_xref="PID:e294102" /db_xref="PID:g1922285" /db_xref="SWISS-PROT:O00116" /translation="MAEAAAAAGGTGLGAGASYGSAADRDRDPDPDRAGRRLRVLSGH LLGRPREALSTNECKARRAASAATAAPTATPAAQESGTIPKKRQEVMKWNGWGYNDSK FIFNKKGQIELTGKRYPLSGMGLPTFKEWIQNTLGVNVEHKTTSKASLNPSDTPPSVV NEDFLHDLKETNISYSQEADDRVFRAHGHCLHEIFLLREGMFERIPDIVLWPTCHDDV VKIVNLACKYNLCIIPIGGGTSVSYGLMCPADETRTIISLDTSQMNRILWVDENNLTA HVEAGITGQELERQLKESGYCTGHEPDSLEFSTVGGWVSTRASGMKKNIYGNIEDLVV HIKMVTPRGIIEKSCQGPRMSTGPDIHHFIMGSEGTLGVITEATIKIRPVPEYQKYGS VAFPNFEQGVACLREIAKQRCAPASIRLMDNKQFQFGHALKPQVSSIFTSFLDGLKKF YITKFKGFDPNQLSVATLLFEGDREKVLQHEKQVYDIAAKFGGLAAGEDNGQRGYLLT YVIAYIRDLALEYYVLGESFETSAPWDRVVDLCRNVKERITRECKEKGVQFAPFSTCR VTQTYDAGACIYFYFAFNYRGISDPLTVFEQTEAAAREEILANGGSLSHHHGVGKLRK QWLKESISDVGFGMLKSVKEYVDPNNIFGNRNLL" BASE COUNT 615 a 392 c 515 g 552 t ORIGIN 1 cacaaggcgg tagccatggc ggaggcggcg gctgcagcgg gtgggactgg cttgggcgcg 61 ggcgcgagct acgggtctgc agcggaccgg gaccgggacc cggacccgga ccgcgccggg 121 cggaggctgc gggttctctc tggccatctg ctgggccggc cccgggaggc tctgagtacc 181 aatgagtgca aagcgcggag agccgcgtcg gcggccacgg cagcgcccac ggccactccc 241 gccgcgcagg agtcgggcac catcccaaag aagcggcaag aagttatgaa atggaatgga 301 tggggatata atgattctaa attcatcttc aataagaagg gccaaattga attgactggg 361 aaaaggtacc ctcttagtgg catgggttta ccaacattta aagaatggat ccaaaatacc 421 cttggagtaa atgtggagca taaaactacc tctaaagcat ccttaaatcc tagtgataca 481 cctccttctg ttgtaaatga agattttctt catgacctta aagaaactaa tatttcatat 541 tcacaagagg cagatgatcg agtatttaga gctcatggtc attgtcttca tgagatattt 601 ttgctcaggg aaggaatgtt tgagcgaatt cctgatatag ttttatggcc aacatgccat 661 gatgatgtag ttaagattgt gaatctagct tgcaaatata atctttgtat cataccaatt 721 ggtggaggaa caagtgtttc atatggcctg atgtgtcctg cagatgagac aagaacaatt 781 atttctttgg acacttcaca aatgaatcga attctctggg ttgatgagaa caatttgaca 841 gctcatgtgg aggctggcat aacaggacaa gagttggaaa gacagcttaa agaaagtggt 901 tattgtacag gtcatgaacc agattccctg gagttcagta ctgtaggagg atgggtatct 961 actcgcgcat caggcatgaa gaagaatatc tatggcaata tcgaggacct ggtggttcat 1021 ataaaaatgg taacacctag aggtataata gaaaaaagct gtcaaggacc tcgtatgtca 1081 acaggccctg atatccatca cttcatcatg ggatctgaag gaactcttgg tgtaataaca 1141 gaagctacaa taaaaatcag accagtccct gaataccaaa agtatggctc agtagctttc 1201 cctaattttg aacaaggagt agcctgttta agagaaattg caaaacagag atgtgctccg 1261 gcatctattc gcctcatgga caacaagcag tttcagtttg gtcatgctct taaacctcag 1321 gtttcctcta tctttacatc atttttggac ggattaaaaa agttttatat tacaaagttt 1381 aaaggatttg acccaaatca gctaagtgta gccacattac tgtttgaggg ggatcgtgag 1441 aaggttcttc aacatgaaaa acaagtgtat gatattgctg caaaatttgg tgggttggca 1501 gctggagaag ataatggaca gagaggttat ttgctgacct atgttattgc atacattcga 1561 gacttggctt tggaatacta tgtattagga gaatcttttg agacttctgc tccttgggac 1621 agggtggtag atctctgtag aaatgtaaaa gaaagaataa caagggaatg caaagagaag 1681 ggtgttcagt ttgctccttt ttctacatgc agggtgacgc agacttacga tgcaggtgct 1741 tgtatctact tctattttgc ctttaactac aggggaatta gtgacccact gaccgtattt 1801 gaacaaactg aggcagctgc tagagaagaa atccttgcta atggagggag cctgtcacat 1861 caccatggag tgggcaagtt acggaagcaa tggctaaagg aaagtatctc tgatgtcggc 1921 tttgggatgc tgaagtctgt caaggaatat gtggacccca ataacatctt tggaaacaga 1981 aaccttttat aaatccatta gtaccattac aaaaaaatgt caattttttt tttaagtttt 2041 caactgtggt tatactagta atcaaatata tcat // LOCUS HSALDR 3547 bp RNA PRI 01-NOV-1997 DEFINITION Homo sapiens mRNA for adrenoleukodystrophy related protein (ALDR). ACCESSION AJ000327 NID g2584766 KEYWORDS adrenoleukodystrophy related protein; ALDR gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3547) AUTHORS Holzinger,A. TITLE Direct Submission JOURNAL Submitted (07-JUL-1997) Holzinger A., Department of Clinical Chemistry and Metabolism, LMU Dr. v. Hauner Children's Hospital, Lindwurmstrasse 4, Munich, 80337, GERMANY REMARK Revised by [3] REFERENCE 2 (bases 1 to 3547) AUTHORS Holzinger,A., Kammerer,S., Berger,J. and Roscher,A.A. TITLE cDNA cloning and mRNA expression of the human adrenoleukodystrophy related protein (ALDRP), a peroxisomal ABC transporter JOURNAL Biochem. Biophys. Res. Commun. 239 (1), 261-264 (1997) MEDLINE 98005117 REFERENCE 3 (bases 1 to 3547) AUTHORS Holzinger,A. TITLE Direct Submission JOURNAL Submitted (31-OCT-1997) Holzinger A., Department of Clinical Chemistry and Metabolism, LMU Dr. v. Hauner Children's Hospital, Lindwurmstrasse 4, Munich, 80337, GERMANY FEATURES Location/Qualifiers source 1..3547 /organism="Homo sapiens" /db_xref="taxon:9606" source 1..2395 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" gene 136..2358 /gene="ALDR" CDS 136..2358 /gene="ALDR" /function="peroxisomal ABC-transporter" /codon_start=1 /product="adrenoleukodystrophy related protein" /db_xref="PID:e1169561" /db_xref="PID:g2584767" /translation="MTHMLNAAADRVKWTRSSAAKRAACLVAAAYALKTLYPIIGKRL KQSGHGKKKAAAYPAAENTEILHCTETICEKPSPGVNADFFKQLLELRKILFPKLVTT ETGWLCLHSVALISRTFLSIYVAGLDGKIVKSIVEKKPRTFIIKLIKWLMIAIPATFV NSAIRYLECKLALAFRTRLVDHAYETYFTNQTYYKVINMDGRLANPDQSLTEDIMMFS QSVAHLYSNLTKPILDVMLTSYTLIQTATSRGASPIGPTLLAGLVVYATAKVLKACSP KFGKLVAEEAHRKGYLRYVHSRIIANVEEIAFYRGHKVEMKQLQKSYKALADQMNLIL SKRLWYIMIEQFLMKYVWSSSGLIMVAIPIITATGFADGEDGQKQVMVSERTEAFTTA RNLLASGADAIERIMSSYKEVTELAGYTARVYNMFWVFDEVKRGIYKRTAVIQESESH SKNGAKVELPLSDTLAIKGKVIDVDHGIICENVPIITPAGEVVASRLNFKVEEGMHLL ITGPNGCGKSSLFRILSGLWPVYEGVLYKPPPQHMFYIPQRPYMSLGSLRDQVIYPDS VDDMHDKGYTDQDLERILHNVHLYHIVQREGGWDAVMDWKDVLSGGEKQRMGMARMFY HKPKYALLDECTSAVSIDVEGKIFQAAKGAGISLLSITHRPSLWKYHTHLLQFDGEGG WRFEQLDTAIRLTLSEEKQKLESQLAGIPKMQQRLNELCKILGEDSVLKTIKNEDETS " polyA_site 2361 BASE COUNT 1179 a 623 c 717 g 1027 t 1 others ORIGIN 1 aaaacacaac agtggaagag aaacgctgca tactatggga cgctgtagga ctttctaaaa 61 catttgctgg ggatttctgt gaagcatgat cttttaaacg aattcttttg gaagccggtt 121 tgggtaactg ggaaaatgac acatatgcta aatgcagcag ctgatcgagt gaaatggacc 181 agatcgagtg ctgctaagag ggctgcctgc ctggtggctg cggcatatgc tctgaaaacc 241 ctctatccca tcattggcaa gcgtttaaag caatctggcc acgggaagaa aaaagcagca 301 gcttaccctg ctgcagagaa cacagaaata ctgcattgca ccgagaccat ttgtgaaaaa 361 ccttcgcctg gagtgaatgc agatttcttc aaacagctac tagaacttcg gaaaattttg 421 tttccaaaac ttgtgaccac tgaaacaggg tggctctgcc tgcactcagt ggctctaatc 481 tcaagaacct ttctttctat ctatgtggct ggtctggatg gaaaaatcgt gaaaagcatt 541 gtggaaaaga agcctcggac tttcatcatc aaattaatca agtggcttat gattgccatc 601 cctgctacct tcgtcaacag tgcaataagg tacctggaat gcaaattggc tttggccttc 661 agaactcgcc tagtagacca cgcctatgaa acctatttta caaatcagac ttattataaa 721 gtgatcaata tggatgggag gctggcaaac cctgaccaat ctcttacgga ggatattatg 781 atgttctccc aatctgtggc tcacttgtat tccaatctga ccaaacctat tttagatgta 841 atgctgacct cctatacact cattcaaact gctacatcca gaggagcaag cccaattggg 901 cccaccctac tagcaggact tgtggtgtat gccactgcta aagtgttaaa agcctgttct 961 cccaaatttg gcaaactggt ggcagaggaa gcacatagaa aaggctattt gcggtatgtg 1021 cactcgagaa ttatagccaa tgtagaagaa attgcctttt acagaggaca taaggtagaa 1081 atgaaacaac ttcagaaaag ttacaaagct ttagcagatc agatgaacct cattttatcc 1141 aaacgtttgt ggtacatcat gatagaacag ttcctgatga agtatgtttg gagcagcagt 1201 ggactaatta tggtggctat acctattatc actgcaactg gctttgcaga tggtgaggat 1261 ggccaaaagc aagttatggt tagtgaacgg acagaagcct ttaccactgc tcgaaattta 1321 ctggcctctg gagctgatgc tattgaaagg attatgtctt catacaaaga ggtcactgaa 1381 ttagcaggct acactgctcg agtgtacaat atgttttggg tctttgatga agtaaaaaga 1441 ggcatttata agagaactgc tgtcattcaa gaatctgaaa gccatagcaa gaatggagct 1501 aaggtagaat tacctctcag tgacacattg gcaattaaag gaaaagttat tgatgtggat 1561 cacggaatta tttgtgaaaa tgttcccata attacaccag caggagaagt ggtggcttcc 1621 aggctaaact tcaaagtaga agaaggaatg catcttttga taactggtcc caatggttgt 1681 gggaaaagtt ctctcttcag aattctaagt gggctctggc ctgtgtatga aggagtcctc 1741 tataaaccac ctcctcaaca tatgttttat attccacaaa ggccatatat gtctcttgga 1801 agtcttcggg atcaagtcat ttaccctgat tcagtggatg atatgcatga taaaggttat 1861 acagaccaag atctggaacg tatcctacac aatgtccatc tctatcacat agttcaaaga 1921 gaaggaggat gggatgctgt tatggactgg aaagatgtcc tgtcaggagg ggaaaagcaa 1981 agaatgggca tggctcgtat gttttatcat aaaccaaaat atgccttgct ggatgaatgt 2041 accagtgctg tcagcattga tgtcgaagga aagatatttc aggctgcaaa aggggctgga 2101 atttccttac tgtctataac acacagacct tctctttgga aataccacac acatttatta 2161 cagtttgatg gtgaaggagg ttggcgcttt gaacaattgg atactgctat ccgtttgaca 2221 ttgagtgaag aaaaacaaaa gctagaatct cagctagctg gaattcccaa aatgcagcag 2281 agactcaatg aactatgtaa aattttggga gaagactcag tgctgaaaac aattaaaaat 2341 gaagatgaga catcttaatt tgttttgaca tattttaaaa agttaattat tagataaagg 2401 ctcaaagaca ttctgttata ctgcatgaag tatgttaagc taagcacaga gaaaaaaagg 2461 cagcaagaca tgttttataa gattttagca ttaaggaagt atatgatctg acttttcaga 2521 agaaaataaa caaatgcatt atgtaaggtc agtcattatg acttatacta attcctagtg 2581 aaggcctaat gcacttgtaa aacaggattt tctaggtgaa ttcctgatga ataccagatt 2641 tactatgtat atgtggtgtg tctgaagttc ttaacaaaca tgggcaatat tctggaaatg 2701 aaacaagtta taactgagca ccatttgggt tgataccaag tgcataagat tcaaactttg 2761 agtgacattt agtccattta tggttgatat taggtttaat acctagaatt caaattgatt 2821 attgctagtg gccaactaaa cctgtacaaa atagctgaca gttttataac taatttcaat 2881 ataaaaattg ttttaatggc atttgttgaa agaaaaaagc atggctaaaa tgtatcaaat 2941 gccntatttt taaattttgg actttaagca tcttaatgag ggcatataac aaattaattt 3001 tagtacaatc ttaaatattt ttaataaatc ctttcatttt aaaaagagaa ttgccaatac 3061 agaaaaggag tatccaaaca atgtctcaac ctgataattt ccttagcaga attacctatt 3121 gcaacttctg ttcagaaata cacagcttgt ttttttgccc aaggatgagt ctacatttta 3181 agaactgcaa tggtataaag gaacttaagg attctgagaa tcatagtaat aacatacatt 3241 ggaatagtac tttataattt acaatcccca tttacatcat ttcaccttaa tgttgaggac 3301 aatgttttga aacaaatact atttttccta ctttgctttt gagaaaattg acactcagac 3361 ttgccctaat catgcacttt acttaaggaa agatcgagaa atcaaatgaa gttctcctga 3421 ctctctggtt tagtgctctt ttgttattat cctttaaatc aaactgggct ataatagcaa 3481 taaaagttag acgaagtgta gaaaataaaa taaatttcat aatgttaaaa aaaaaaaaaa 3541 aaaaaaa // LOCUS HSALK3A 2932 bp RNA PRI 29-SEP-1993 DEFINITION H.sapiens ALK-3 mRNA. ACCESSION Z22535 NID g402186 KEYWORDS ALK-3 gene; cell surface receptor; serine threonine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2932) AUTHORS ten Dijke,P.P., Ichijo,H.H., Franzen,P.P., Schulz,P.P., Saras,J.J., Toyoshima,H.H., Heldin,C.C. and Miyazono,K.K. TITLE Activin receptor-like kinases; Anovel subclass of cell surface receptors with predicted serine/threonine kinase activity JOURNAL Unpublished REFERENCE 2 (bases 1 to 2932) AUTHORS ten Dijke,P.P. TITLE Direct Submission JOURNAL Submitted (06-APR-1993) Peter P ten Dijke, Ludwig Institute for Cancer Research, Uppsala, branch, Biomedical Center, Husargatan 3, Uppsala, S-751 24, Sweden REFERENCE 3 (bases 1 to 2932) AUTHORS ten Dijke,P., Ichijo,H., Franzen,P., Schulz,P., Saras,J., Toyoshima,H., Heldin,C.H. and Miyazono,K. TITLE Activin receptor-like kinases: a novel subclass of cell-surface receptors with predicted serine/threonine kinase activity JOURNAL Oncogene 8 (10), 2879-2887 (1993) MEDLINE 93390967 FEATURES Location/Qualifiers source 1..2932 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ONF5" /cell_line="AG 1518 human foreskin fibroblasts" /clone_lib="lambda ZAP II cDNA library, random primed" CDS 310..1908 /codon_start=1 /product="ALK-3" /db_xref="PID:g402187" /db_xref="SWISS-PROT:P36894" /translation="MTQLYIYIRLLGAYLFIISRVQGQNLDSMLHGTGMKSDSDQKKS ENGVTLAPEDTLPFLKCYCSGHCPDDAINNTCITNGHCFAIIEEDDQGETTLASGCMK YEGSDFQCKDSPKAQLRRTIECCRTNLCNQYLQPTLPPVVIGPFFDGSIRWLVLLISM AVCIIAMIIFSSCFCYKHYCKSISSRRRYNRDLEQDEAFIPVGESLKDLIDQSQSSGS GSGLPLLVQRTIAKQIQMVRQVGKGRYGEVWMGKWRGEKVAVKVFFTTEEASWFRETE IYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDFLKCATLDTRALLKL AYSAACGLCHLHTEIYGTQGKPAIAHRDLKSKNILIKKNGSCCIADLGLAVKFNSDTN EVDVPLNTRVGTKRYMAPEVLDESLNKNHFQPYIMADIYSFGLIIWEMARRCITGGIV EEYQLPYYNMVPSDPSYEDMREVVCVKRLRPIVSNRWNSDECLRAVLKLMSECWAHNP ASRLTALRIKKTLAKMVESQDVKI" sig_peptide 310..378 misc_feature 379..765 /note="encoding extracellular domain" misc_feature 490..699 /note="encoding cysteine-rich domain" misc_feature 766..837 /note="encoding predicted transmembrane domain" misc_feature 838..1905 /note="encoding intracellular domain" misc_feature 1015..1890 /note="encoding predicted serine/threonine kinase domain" BASE COUNT 849 a 575 c 643 g 865 t ORIGIN 1 gctccgcgcc gagggctgga ggatgcgttc cctggggtcc ggacttatga aaatatgcat 61 cagtttaata ctgtcttgga attcatgaga tggaagcata ggtcaaagct gtttggagaa 121 aatcagaagt acagttttat ctagccacat cttggaggag tcgtaagaaa gcagtgggag 181 ttgaagtcat tgtcaagtgc ttgcgatctt ttacaagaaa atctcactga atgatagtca 241 tttaaattgg tgaagtagca agaccaatta ttaaaggtga cagtacacag gaaacattac 301 aattgaacaa tgactcagct atacatttac atcagattat tgggagccta tttgttcatc 361 atttctcgtg ttcaaggaca gaatctggat agtatgcttc atggcactgg gatgaaatca 421 gactccgacc agaaaaagtc agaaaatgga gtaaccttag caccagagga taccttgcct 481 tttttaaagt gctattgctc agggcactgt ccagatgatg ctattaataa cacatgcata 541 actaatggac attgctttgc catcatagaa gaagatgacc agggagaaac cacattagct 601 tcagggtgta tgaaatatga aggatctgat tttcagtgca aagattctcc aaaagcccag 661 ctacgccgga caatagaatg ttgtcggacc aatttatgta accagtattt gcaacccaca 721 ctgccccctg ttgtcatagg tccgtttttt gatggcagca ttcgatggct ggttttgctc 781 atttctatgg ctgtctgcat aattgctatg atcatcttct ccagctgctt ttgttacaaa 841 cattattgca agagcatctc aagcagacgt cgttacaatc gtgatttgga acaggatgaa 901 gcatttattc cagttggaga atcactaaaa gaccttattg accagtcaca aagttctggt 961 agtgggtctg gactaccttt attggttcag cgaactattg ccaaacagat tcagatggtc 1021 cggcaagttg gtaaaggccg atatggagaa gtatggatgg gcaaatggcg tggcgaaaaa 1081 gtggcggtga aagtattctt taccactgaa gaagccagct ggtttcgaga aacagaaatc 1141 taccaaactg tgctaatgcg ccatgaaaac atacttggtt tcatagcggc agacattaaa 1201 ggtacaggtt cctggactca gctctatttg attactgatt accatgaaaa tggatctctc 1261 tatgacttcc tgaaatgtgc tacactggac accagagccc tgcttaaatt ggcttattca 1321 gctgcctgtg gtctgtgcca cctgcacaca gaaatttatg gcacccaagg aaagcccgca 1381 attgctcatc gagacctaaa gagcaaaaac atcctcatca agaaaaatgg gagttgctgc 1441 attgctgacc tgggccttgc tgttaaattc aacagtgaca caaatgaagt tgatgtgccc 1501 ttgaatacca gggtgggcac caaacgctac atggctcccg aagtgctgga cgaaagcctg 1561 aacaaaaacc acttccagcc ctacatcatg gctgacatct acagcttcgg cctaatcatt 1621 tgggagatgg ctcgtcgttg tatcacagga gggatcgtgg aagaatacca attgccatat 1681 tacaacatgg taccgagtga tccgtcatac gaagatatgc gtgaggttgt gtgtgtcaaa 1741 cgtttgcggc caattgtgtc taatcggtgg aacagtgatg aatgtctacg agcagttttg 1801 aagctaatgt cagaatgctg ggcccacaat ccagcctcca gactcacagc attgagaatt 1861 aagaagacgc ttgccaagat ggttgaatcc caagatgtaa aaatctgatg gttaaaccat 1921 cggaggagaa actctagact gcaagaactg tttttaccca tggcatgggt ggaattagag 1981 tggaataagg atgttaactt ggttctcaga ctctttcttc actacgtgtt cacaggctgc 2041 taatattaaa cctttcagta ctcttattag gatacaagct gggaacttct aaacacttca 2101 ttctttatat atggacagct ttattttaaa tgtggttttt gatgcctttt tttaagtggg 2161 tttttatgaa ctgcatcaag acttcaatcc tgattagtgt ctccagtcaa gctctgggta 2221 ctgaattgcc tgttcataaa acggtgcttt ctgtgaaagc cttaagaaga taaatgagcg 2281 cagcagagat ggagaaatag actttgcctt ttacctgaga cattcagttc gtttgtattc 2341 tacctttgta aaacagccta tagatgatga tgtgtttggg atactgctta ttttatgata 2401 gtttgtcctg tgtccttagt gatgtgtgtg tgtctccatg cacatgcacg ccgggattcc 2461 tctgctgcca tttgaattag aagaaaataa tttatatgca tgcacaggaa gatattggtg 2521 gccggtggtt ttgtgcttta aaaatgcaat atctgaccaa gattcgccaa tctcatacaa 2581 gccatttact ttgcaagtga gatagcttcc ccaccagctt tattttttaa catgaaagct 2641 gatgccaagg ccaaaagaag tttaaagcat ctgtaaattt ggactgtttt ccttcaacca 2701 ccattttttt tgtggttatt atttttgtca cggaaagcat cctctccaaa gttggagctt 2761 ctattgccat gaaccatgct tacaaagaaa gcacttctta ttgaagtgaa ttcctgcatt 2821 tgatagcaat gtaagtgcct ataaccatgt tctatattct ttattctcag taacttttaa 2881 aagggaagtt atttatattt tgtgtataat gtgctttatt tgcaaatcac cc // LOCUS HSALPEND 749 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for alpha endosulfine. ACCESSION X99906 NID g2764973 KEYWORDS alpha endosulfine; sulfonylurea receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 749) AUTHORS Heron,L. JOURNAL Unpublished REFERENCE 2 (bases 1 to 749) AUTHORS Bataille,D. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) D. Bataille, Inserm U 376, Endocrinologie des Peptides, 371, Rue du Doyen Gaston Giraud, Montpellier Cedex 05, 34294, FRANCE FEATURES Location/Qualifiers source 1..749 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" CDS 126..491 /function="endogenous lignad for sulfonylurea receptor" /codon_start=1 /product="alpha endosulfine" /db_xref="PID:e284090" /db_xref="PID:g2764974" /translation="MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSL GQKPGGSDFLMKRLQKGQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQD LPQRKSSLVTSKLAGGQVE" BASE COUNT 184 a 224 c 200 g 141 t ORIGIN 1 cggtgactga gcaaccctag tgacaggagc cgaagcagca gcgcaggttg tccccgtttc 61 ccctccccct tcccttctcc ggttgccttc ccgggcccct tacactccac agtcccggtc 121 ccgccatgtc ccagaaacaa gaagaagaga accctgcgga ggagaccggc gaggagaagc 181 aggacacgca ggagaaagaa ggtattctgc ctgagagagc tgaagaggca aagctaaagg 241 ccaaataccc aagcctagga caaaagcctg gaggctccga cttcctcatg aagagactcc 301 agaaagggca aaagtacttt gactcaggag actacaacat ggccaaagcc aagatgaaga 361 ataagcagct gccaagtgca ggaccagaca agaacctggt gactggtgat cacatcccca 421 ccccacagga tctgccccag agaaagtcct cgctcgtcac cagcaagctt gcgggtggcc 481 aagttgaatg atgctgcccg gggctctgcc agatcctgag acgcttcccc tccctgcccc 541 acccgggtcc tgtgctggct cctgcccctt cctgcttttg cagccagggg tcaggaggtg 601 gctcgggtgt gggctggaga ggcagaagcc ctttcctgtt ggtgtcccag cacatggacc 661 ccttgggctg agcaccaaga ccttgaacct tttttgtttt accttttttc caaataacag 721 ttgggagaaa tatcaatgaa attctgccg // LOCUS HSALPHA4 1321 bp mRNA PRI 22-JAN-1998 DEFINITION H.sapiens mRNA for alpha 4 protein. ACCESSION Y08915 NID g1877201 KEYWORDS alpha 4 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1321) AUTHORS Onda,M., Inui,S., Maeda,K., Suzuki,M., Takahashi,E. and Sakaguchi,N. TITLE Expression and chromosomal localization of the human alpha 4/IGBP1 gene, the structure of which is closely related to the yeast TAP42 protein of the rapamycin-sensitive signal transduction pathway JOURNAL Genomics 46 (3), 373-378 (1997) MEDLINE 98110572 REFERENCE 2 (bases 1 to 1321) AUTHORS Inui,S. TITLE Direct Submission JOURNAL Submitted (10-OCT-1996) S. Inui, Kumamoto University, School of Medicine, Immunology, 2-2-1, Honjo, Kumamoto, 860, JAPAN FEATURES Location/Qualifiers source 1..1321 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RPM18866" /cell_line="IM9" /cell_type="B lymphocyte" CDS 9..1028 /codon_start=1 /product="alpha 4 protein" /db_xref="PID:e293920" /db_xref="PID:g1877202" /translation="MAAEDELQLPRLPELFETGRQLLDEVEVATEPAGSRIVQEKVFK GLDLLEKAAEMLSQLDLFSRNEDLEEIASTDLKYLLVPAFQGALTMKQVNPSKRLDHL QRAREHFINYLTQCHCYHVAEFELPKTMNNSAENHTANSSMAYPSLVAMASQRQAKIQ RYKQKKELEHRLSAMKSAVESGQADDERVREYYLLHLQRWIDISLEEIESIDQEIKIL RERDSSREASTSNSSRQERPPVKPFILTRNMAQAKVFGAGYPSLPTMTVSDWYEQHRK YGALPDQGIAKAAPEEFRKAAQQQEEQEEKEEEDDEQTLHRAREWDDWKDTHPRGYGN RQNMG" BASE COUNT 388 a 305 c 330 g 298 t ORIGIN 1 cccccaagat ggctgctgag gacgagttac agctgccgcg gctccccgag ctgttcgaaa 61 ctggtagaca gttactggac gaagtagaag tggcgactga acccgccggt tcccggatag 121 tccaggagaa ggtgttcaag ggcttggacc tccttgagaa ggctgccgaa atgttatcgc 181 agctcgactt gttcagccga aatgaagatt tggaagagat tgcttccacc gacctgaagt 241 accttttggt gccagcgttt caaggagccc tcaccatgaa acaagtcaac cccagcaagc 301 gtctagatca tttgcagcgg gctcgagaac actttataaa ctacttaact cagtgccatt 361 gctatcatgt ggcagagttt gagctgccca aaaccatgaa caactctgct gaaaatcaca 421 ctgccaattc ctccatggct tatcctagtc tcgttgctat ggcatctcaa agacaggcta 481 aaatacagag atacaagcag aagaaggagt tggagcatag gttgtctgca atgaaatctg 541 ctgtggaaag tggtcaagca gatgatgagc gtgttcgtga atattatctt cttcaccttc 601 agaggtggat tgatatcagc ttagaagaga ttgagagcat tgaccaggaa ataaagatcc 661 tgagagaaag agactcttca agagaggcat caacttctaa ctcatctcgc caggagaggc 721 ctccagtgaa acccttcatt ctcactcgga acatggctca agccaaagta tttggagctg 781 gttatccaag tctgccaact atgacggtga gtgactggta tgagcaacat cggaaatatg 841 gagcattacc ggatcaggga atagccaagg cagcaccaga ggaattcaga aaagcagctc 901 agcaacagga agaacaagaa gaaaaggagg aagaggatga tgaacaaaca ctccacagag 961 cccgggagtg ggatgactgg aaggacaccc atcctagggg ctatgggaac cgacagaaca 1021 tgggctgatc ttcccacaac accacaggac tgcagggtgc acaactccct gccaaggaaa 1081 accatgcagt cctcccctcc ctggtctcct gcttcagctc tgtacaacga gggcaaagat 1141 gctaaatctt gctttgcatt cagtaaagtg tcaagtgatt aagtgtgtat ttgtacccta 1201 gatgatatga accagcagtc ttgttttggc atcatcctca tcatgttgta ttccagcttc 1261 ttaagtggaa ggaaaagagt gctgagaaat ggctctgtat aatctatggc tatccgaatt 1321 c // LOCUS HSAMINPEP 3747 bp RNA PRI 03-DEC-1996 DEFINITION H.sapiens mRNA for aminopeptidase. ACCESSION Y07701 NID g1657267 KEYWORDS aminopeptidase; puromycin; puromycin-sensitive. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3747) AUTHORS Tobler,A.R., Constam,D.B., Schmitt-Graeff,A., Malipiero,U., Schlapbach,R. and Fontana,A. TITLE Cloning of the human puromycin-sensitive aminopeptidase and evidence for expression in neurons JOURNAL Unpublished REFERENCE 2 (bases 1 to 3747) AUTHORS Tobler,A.R. TITLE Direct Submission JOURNAL Submitted (28-AUG-1996) A.R. Tobler, University Hospital Zurich, Dept. Of Internal Medicine, Section For Clinical Immunology, Moussonstrasse 13, Zurich, CH-8044, SWITZERLAND FEATURES Location/Qualifiers source 1..3747 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="ZAP II fetal brain library, Statagene Cat no. 936206" /dev_stage="17 week gestation" CDS 405..3032 /note="puromycin-sensitive" /codon_start=1 /evidence=experimental /product="aminopeptidase" /db_xref="PID:e267526" /db_xref="PID:g1657268" /translation="MPEKRPFERLPADVSPINYSLCLKPDLLDFTFEGKLEAAAQVRQ ATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTLKI DFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFEATDPRRAFPCWDEPAIKATFDISL VVPKDRVALSNMNVIDRKPYPDDENLVEVKFARTPVMSTYLVAFVVGEYDFVETRSKD GVCVRVYTPVGKAEQGKFALEVAAKTLPFYKDYFNVPYPLPKIDLIAIADFAAGAMEN WGLVTYRETALLIDPKNSCSSSRQWVALVVGHELAHQWFGNLVTMEWWTHLWLNEGFA SWIEYLCVDHCFPEYDIWTQFVSADYTRAQELDALDNSHPIEVSVGHPSEVDEIFDAI SYSKGASVIRMLHDYIGDKDFKKGMNMYLTKFQQKNAATEDLWESLENASGKPIAAVM NTWTKQMGFPLIYVEAEQVEDDRLLRLSQKKFCAGGSYVGEDCPQWMVPITISTSEDP NQAKLKILMDKPEMNVVLKNVKPDQWVKLNLGTVGFYRTQYSSAMLESLLPGIRDLSL PPVDRLGLQNDLFSLARAGIISTVEVLKVMEAFVNEPNYTVWSDLSCNLGILSTLLSH TDFYEEIQEFVKDVFSPIGERLGWDPKPGEGHLDALLRGLVLGKLGKAGHKATLEEAR RRFKDHVEGKQILSADLRSPVYLTVLKHGDGTTLDIMLKLHKQADMQEEKNRIERVLG ATLLPDLIQKVLTFALSEEVRPQDTVSVIGGVAGGSKHGRKAAWKFIKDNWEELYNRY QGGFLISRLIKLSVEGFAVDKMAGEVKAFFESHPAPSAERTIQQCCENILLNAAWLKR DAESIHQYLLQRKASPPTV" BASE COUNT 1019 a 822 c 882 g 1024 t ORIGIN 1 gaattccccg atgtacttct gcattaaatt ccttgctttc agaatcataa ccagggaatc 61 gtttggtact gtgagggata gacaagccct catttctgta atccctgcta agtgaggtca 121 aggctgacat ctaagatgtt ctgtggcgga acctggcccc catcaggggc aacctttaat 181 ttggattctg gaagactaaa actttgagat ctctcccccg ccccccaggc tcccccggta 241 gctctcctcc ggcggtggcc cgcgctcggt ggatgtggct ggcagctgcc gccccctccc 301 tcgctcgccg cctgcttttc ctcggccctc cgcctcctcc cctcctcctt ctcgtcttca 361 gccgctcctc tcgccgccgc ctccacagcc tgggcctcgc cgcgatgccg gagaagaggc 421 ccttcgagcg gctgcctgcc gatgtctccc ccatcaacta cagcctttgc ctcaagcccg 481 acttgctgga cttcaccttc gagggcaagc tggaggccgc cgcccaggtg aggcaggcga 541 ctaatcagat tgtgatgaat tgtgctgata ttgatattat tacagcttca tatgcaccag 601 aaggagatga agaaatacat gctacaggat ttaactatca gaatgaagat gaaaaagtca 661 ccttgtcttt ccctagtact ctgcaaacag gtacgggaac cttaaagata gattttgttg 721 gagagctgaa tgacaaaatg aaaggtttct atagaagtaa atatactacc ccttctggag 781 aggtgcgcta tgctgctgta acacagtttg aggctactga tccgcgaagg gcttttcctt 841 gctgggatga gcctgctatc aaagcaactt ttgatatctc attggttgtt cctaaagaca 901 gagtagcttt atcaaacatg aatgtaattg accggaaacc ataccctgat gatgaaaatt 961 tagtggaagt gaagtttgcc cgcacacctg ttatgtctac atatctggtg gcatttgttg 1021 tgggtgaata tgactttgta gaaacaaggt caaaagatgg tgtgtgtgtc cgtgtttaca 1081 ctcctgttgg caaagcagag caaggaaaat ttgcgttaga ggttgctgct aaaaccttgc 1141 ctttttataa ggactacttc aatgttcctt atcctctacc taaaattgat ctcattgcta 1201 ttgcagactt tgcagctggt gccatggaga actggggcct tgttacttat agggagactg 1261 cattgcttat tgatccaaaa aattcctgtt cttcatcccg ccagtgggtt gctctggttg 1321 tgggacatga actcgcccat caatggtttg gaaatcttgt tactatggaa tggtggactc 1381 atctttggtt aaatgaaggt tttgcatcct ggattgaata tctgtgtgta gaccactgct 1441 tcccagagta tgatatttgg actcagtttg tttctgctga ttacacccgt gcccaggagc 1501 ttgacgcctt agataacagc catcctattg aagtcagtgt gggccatcca tctgaggttg 1561 atgagatatt tgatgctata tcatatagca aaggtgcatc tgtcatccga atgctgcatg 1621 actacattgg ggataaggac tttaagaaag gaatgaacat gtatttaacc aagttccaac 1681 aaaagaatgc tgccacagag gatctctggg aaagtttaga aaatgctagt ggtaaaccta 1741 tagcagctgt gatgaatacc tggaccaaac aaatgggatt tcccctcatt tatgtggaag 1801 ctgaacaggt agaagatgac agattattga ggttgtccca aaagaagttc tgtgctggtg 1861 ggtcatatgt tggtgaagat tgtccccagt ggatggtccc tatcacaatc tctactagtg 1921 aagaccccaa ccaggccaaa ctaaaaattc taatggacaa gccagagatg aatgtggttt 1981 tgaaaaatgt caaaccagac caatgggtga agttaaactt aggaacagtt gggttttatc 2041 ggacccagta cagctctgcc atgctggaaa gtttattacc aggcattcgt gacctttctc 2101 tgccccctgt ggatcgactt ggattacaga atgacctctt ctccttggct cgagctggaa 2161 tcattagcac tgtagaggtt ctaaaagtca tggaggcttt tgtgaatgag cccaattata 2221 ctgtatggag cgacctgagc tgtaacctgg ggattctctc aactctcttg tcccacacag 2281 acttctatga ggaaatccag gagtttgtga aagatgtctt ttcacctata ggggagagac 2341 tgggctggga ccccaaacct ggagaaggtc atctcgatgc actcctgagg ggcttggttc 2401 tgggaaaact aggaaaagca ggacataagg caacgttaga agaagcccgt cgtcggttta 2461 aggaccacgt ggaaggaaaa cagattctct ccgctgatct gaggagtcct gtctatctga 2521 ctgttttgaa gcatggtgat ggcactactt tagatattat gttaaaactt cataaacaag 2581 cagatatgca agaagagaaa aaccgaatcg aaagagtcct tggcgctact cttttgcctg 2641 acctgattca aaaagtcctc acgtttgcac tttcagaaga ggtacgtcca caggacactg 2701 tatcggtaat tggtggagta gctggaggca gcaagcatgg taggaaagct gcttggaaat 2761 tcataaagga caactgggaa gaactttata accgatacca gggaggattc ttaatatcca 2821 gactaataaa gctatcagtt gagggatttg cagttgataa aatggctgga gaggttaagg 2881 ctttcttcga gagtcaccca gctccttcag ctgagcgtac catccagcag tgttgtgaaa 2941 atattctgct gaatgctgcc tggctaaagc gagatgctga gagcatccac cagtacctcc 3001 ttcagcggaa ggcctcacca cccacagtgt gaatcctgag gttgcgccat tggcggttct 3061 gctcgttcgc tgcagggata aggtggagct accgaacagc tgattcatat gccaagaatt 3121 tggagtcttc tttcaaacca gtgggggttg gacaatgaat gtagttaact ggttcctgct 3181 cacactccag aattaaattc tattgaaaaa ggaaaatcag caattcagca aaaaataaat 3241 aaaaaataaa aatgtaaata tgatagtaat aaaatagagc ataacgaaac tgtgaaactt 3301 tctgaagcct tgtcagtggt taaaagtatt taacactcta ctgttaatga cagatgttct 3361 gtttttataa cctaccaaaa ggaaactaga ggcttcttgg tgaagagcat ttttgtgaag 3421 tgggttctgc aaggagccta taaagccaag ggtggtgtcc atttctggga atggttaaac 3481 acaaaaggct gatagctggt atcacatagt tggagtcagt gcataattcc aagtggcttt 3541 tttttttttt ggcacgggga ctgatcagga agatatattc ctgcataact caatctgaac 3601 caaggattgt agtttagttt tcctccttgc cttcccttct gtgtgaccga ccccttggcc 3661 aaaaaaaaaa caaaaagcaa aaaacaaaaa cctaccctgt tctggttttt ttcctccctt 3721 tagttccacc cccaaccccc ggaattc // LOCUS HSANAC 797 bp RNA PRI 14-JAN-1997 DEFINITION H.sapiens alpha NAC mRNA. ACCESSION X80909 NID g556641 KEYWORDS nac gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 797) AUTHORS Sakai,H., Chew,C., Wang,S., Wiedmann,B., Geromanos,S., Tempst,P. and Wiedmann,M. TITLE Nascent polypeptide-associate complex (NAC): A novel type of polypeptide binding protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 797) AUTHORS Sakai,H. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) H. Sakai, Cellular Biochemistry & Biophyscis, Program, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York NY 10021, USA REFERENCE 3 (bases 1 to 797) AUTHORS Yotov,W.V. and St-Arnaud,R. TITLE Mapping of the human gene for the alpha-NAC/1.9.2 (NACA/1.9.2) transcriptional coactivator to Chromosome 12q23-24.1 JOURNAL Mamm. Genome 7 (2), 163-164 (1996) MEDLINE 96432474 FEATURES Location/Qualifiers source 1..797 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="Stratagene human fetal brain cDNA library" gene 26..673 /gene="alpha NAC" CDS 26..673 /gene="alpha NAC" /codon_start=1 /product="Nascent polypeptide associated complex alpha subunit" /db_xref="PID:g556642" /translation="MPGEATETVPATEQELPQPQAETGSGTESDSDESVPELEEQDST QATTQQAQLAAAAEIDEEPVSKAKQSRSEKKARKAMSKLGLRQVTGVTRVTIRKSKNI LFVITKPDVYKSPASDTYIVFGEAKIEDLSQQAQLAAAEKFKVQGEAVSNIQENTQTP TVQEESEEEEVDETGVEVKDIELVMSQANVSRAKAVRALKNNSNDIVNAIMELTM" BASE COUNT 280 a 160 c 188 g 169 t ORIGIN 1 cttggttccg cgttccctgc acaaaatgcc cggcgaagcc acagaaaccg tccctgctac 61 agagcaggag ttgccgcagc cccaggctga gacagggtct ggaacagaat ctgacagtga 121 tgaatcagta ccagagcttg aagaacagga ttccacccag gcaaccacac aacaagccca 181 gctggcggca gcagctgaaa ttgatgaaga accagtcagt aaagcaaaac agagtcggag 241 tgaaaagaag gcacggaagg ctatgtccaa actgggtctt cggcaggtta caggagttac 301 tagagtcact atccggaaat ctaagaatat actctttgtc atcacaaaac cagatgtcta 361 caagagccct gcttcagata cttacatagt ttttggggaa gccaagatcg aagatttatc 421 ccagcaagca caactagcag ctgctgagaa attcaaagtt caaggtgaag ctgtctcaaa 481 cattcaagaa aacacacaga ctccaactgt acaagaggag agtgaagagg aagaggtcga 541 tgaaacaggt gtagaagtta aggacattga attggtcatg tcacaagcaa atgtgtcgag 601 agcaaaggca gtccgagccc tgaagaacaa cagtaatgat attgtaaatg cgattatgga 661 attaacaatg taaccatatg gaagcaactt tttttggtgt ctcaaaggag taactgcagc 721 ttggtttgaa atttgtactg tttctatcat aaataaagtt atggcttctt gttggaaaaa 781 aaaaaaaaaa aaaaaaa // LOCUS HSAP17 825 bp RNA PRI 30-APR-1996 DEFINITION H.sapiens mRNS for clathrin-associated protein. ACCESSION X97074 NID g1296606 KEYWORDS AP17 gene; clathrin-associated protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 825) AUTHORS Winterpacht,A., Endele,S. and Zabel,B. TITLE Human AP17, a small chain of the clathrin-associated protein complex:cDNA cloning and chromosomal localization JOURNAL Unpublished REFERENCE 2 (bases 1 to 825) AUTHORS Endele,S.U. TITLE Direct Submission JOURNAL Submitted (12-FEB-1996) S.U. Endele, University of Mainz, Childrens Hospital, Langenbeckstrasse 1, D-55101 Mainz, FRG FEATURES Location/Qualifiers source 1..825 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="q13.2-qter" /tissue_type="kidney" /dev_stage="adult" gene 115..543 /gene="AP17" CDS 115..543 /gene="AP17" /codon_start=1 /product="clathrin-associated protein" /db_xref="PID:e235833" /db_xref="PID:g1296607" /translation="MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDA KHTNFVEFRNFKIIYRRYAGLYFCICVDVNDNKLAYLEGIHNFVEVLNEYFHNVCELD LVFNFYKVYTVVDEMFLAGEIRETSQTKVLKQLLMLQSLE" BASE COUNT 171 a 242 c 238 g 174 t ORIGIN 1 aagcttgata gcaagttcag cctggttaag tccaagctga attccgtgca ccctgagccg 61 gagctgccca gtcgccgcgg gaccggggcc gctggggtct ggacgggggt cgccatgatc 121 cgctttatcc tcatccagaa ccgggcaggc aagacgcgcc tggccaagtg gtacatgcag 181 tttgatgatg atgagaaaca gaagctgatc gaggaggtgc atgccgtggt caccgtccga 241 gacgccaaac acaccaactt tgtggagttc cggaacttta agatcattta ccgccgctat 301 gctggcctct acttctgcat ctgtgtggat gtcaatgaca acaaactggc ttacctggag 361 ggcattcaca acttcgtgga ggtcttaaac gaatatttcc acaatgtctg tgaactggac 421 ctggtgttca acttctacaa ggtttacacg gtcgtggacg agatgttcct ggctggcgaa 481 atccgagaga ccagccagac gaaggtgctg aaacagctgc tgatgctaca gtccctggag 541 tgagggcagg cgagcacccc accccggccc cggcccctcc tggaatcgcc tgctcgcttc 601 cccttcccag gcccgtggcc aacccagcag tccttccctc aactgcctag gaggaaggga 661 cccagctggg tctgggccac aagggaggag acttcacccc acttcctctg ggccctggct 721 gtgggcagag gccaccgtgt gtgtcccgag taaccgtgcc gttgtcgtgt gattccataa 781 gcgtctgtgc gtggagtccc caataaacct gtggtcctgc ctggc // LOCUS HSAP2BETA 1391 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for AP-2 beta transcription factor. ACCESSION X95694 NID g1495416 KEYWORDS AP-2 beta gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1391) AUTHORS Williamson,J.A., Bosher,J.M., Skinner,A., Sheer,D., Williams,T. and Hurst,H.C. TITLE Chromosomal mapping of the human and mouse homologues of two new members of the AP-2 family of transcription factors JOURNAL Genomics 35 (1), 262-264 (1996) MEDLINE 96299769 REFERENCE 2 (bases 1 to 1391) AUTHORS Hurst,H.C. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) H.C.Hurst, ICRF Oncology Unit, Gene Transcription Lab, Hammersmith Hospital, Du Cane Road, London W12 0NN, UK REMARK Revised by author 16-APR-96 FEATURES Location/Qualifiers source 1..1391 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ZR75-1" /cell_type="epithelial" /chromosome="6" /map="p12" /dev_stage="adult" /tissue_type="breast tumour" gene 33..1382 /gene="AP-2 beta" CDS 33..1382 /gene="AP-2 beta" /function="transcription factor" /codon_start=1 /db_xref="PID:e223420" /db_xref="PID:g1495417" /translation="MLWKLVENVKYEDIYEDRHDGVPSHSSRLSQLGSVSQGPYSSAP PLSHTPSSDFQPPYFPPPYQPLPYHQSQDPYSHVNDPYSLNPLHQPQQHPWGQRQRQE VGSEAGSLLPQPRAALPQLSGLDPRRDYHSVRRPDVLLHSAHHGLDAGMGDSLSLHGL GHPGMEDVQSVEDANNSGMNLLDQSVIKKVPVPPKSVTSLMMNKDGFLGGMSVNTGEV FCSVPGRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLRER LEKIGLNLPAGRRKAANVTLLTSLVEGEAVHLARDFGYICETEFPAKAVSEYLNRQHT DPSDLHSRKNMLLATKQLCKEFTDLLAQDRTPIGNSRPSPILEPGIQSCLTHFSLITH GFGAPAICAALTALQNYLTEALKGMDKMFLNNTTTNRHTSGEGPGSKTGDKEEKHRK" BASE COUNT 325 a 445 c 362 g 259 t ORIGIN 1 tgcactcacc tcctagagac caggctgcca tcatgctctg gaagcttgtg gagaatgtca 61 agtacgaaga tatctatgag gaccggcacg atggtgtccc gagccacagc tcgcggctct 121 cccagctggg ctcggtgtcc caaggaccct actcgagcgc cccgccgctg tcccacaccc 181 cgtcgtcgga cttccagccg ccctacttcc caccccccta ccagccgctc ccctaccacc 241 agagccagga cccctactcc cacgtcaacg acccctactc cctgaaccca ctgcaccagc 301 cccagcaaca tccctggggg caacggcagc ggcaagaagt gggttcggaa gccggctctc 361 tcctgcccca gcctcgggcc gccttgcccc agctctcggg ccttgacccc cggagggact 421 accactcggt ccgccggccg gacgtgctgc tgcattcggc gcaccacggc ctggacgcgg 481 gcatgggtga cagcctctcg ctgcacggcc tcggccatcc cggaatggaa gacgtccagt 541 cagttgaaga tgccaataac agcggcatga atctattgga ccagtctgtc attaaaaaag 601 ttccagttcc tcccaaatcg gtgacttctc taatgatgaa taaagacggc ttcctgggag 661 gcatgtctgt caacaccggc gaggtgtttt gctccgtccc aggccgtttg tctctgctca 721 gttcaacttc gaagtacaaa gtaactgtgg gagaagttca gagacggctg tcgccccctg 781 aatgcctcaa tgcatctctc ctcggcggag tcctcagaag agccaaatcg aaaaatgggg 841 ggagatcttt gcgagaaagg ctagaaaaaa tcggtttgaa tttacccgcg ggcaggcgca 901 aagcagcaaa tgtcacgtta ctcacctccc tggtggaagg agaagctgtt cacttagcta 961 gggattttgg gtacatttgc gaaacggagt ttcccgccaa agccgtctct gagtatttga 1021 accggcagca cacagacccg agtgacctgc actcccgaaa gaatatgctg ttggccacca 1081 agcaactttg taaagaattt acggatctac tggcgcagga ccggacaccg atagggaaca 1141 gccgacccag ccccatcctg gagccgggga tccagagctg cctcacgcac ttcagcctca 1201 tcacgcacgg cttcggcgcc ccggccattt gcgccgcgct cacggccctg cagaactatc 1261 tcaccgaggc gctcaaaggc atggacaaga tgttcttgaa caacaccacc actaacaggc 1321 acacgtctgg ggaaggccca ggtagtaaaa ctggcgacaa ggaggagaaa cacaggaaat 1381 gaaaaatttt t // LOCUS HSAPHER 582 bp RNA PRI 10-FEB-1997 DEFINITION H.sapiens mRNA for acylphosphatase, erythrocyte (CT) isoenzyme. ACCESSION X84194 NID g1816490 KEYWORDS acylphosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 582) AUTHORS Fiaschi,T., Raugei,G., Marzocchini,R., Chiarugi,P., Cirri,P. and Ramponi,G. TITLE Cloning and expression of the cDNA coding for the erythrocyte isoenzyme of human acylphosphatase JOURNAL FEBS Lett. 367 (2), 145-148 (1995) MEDLINE 95317414 REFERENCE 2 (bases 1 to 582) AUTHORS Raugei,G. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) G. Raugei, Universita di Firenze, Dipt Scienze Biochimiche, Viale Morgagni 50, 50134 Firenze, ITALY REMARK revised by [3] REFERENCE 3 (bases 1 to 582) AUTHORS Raugei,G. TITLE Direct Submission JOURNAL Submitted (03-FEB-1997) G. Raugei, Universita di Firenze, Dipt Scienze Biochimiche, Viale Morgagni 50, 50134 Firenze, ITALY FEATURES Location/Qualifiers source 1..582 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambdaGT11" CDS 69..368 /EC_number="3.6.1.7" /note="erythrocyte (CD) isoenzyme" /codon_start=1 /product="acylphosphatase" /db_xref="PID:e300603" /db_xref="PID:g1834464" /translation="MAEGNTLISVDYEIFGKVQGVFFRKHTQAEGKKLGLVGWVQNTD RGTVQGQLQGPISKVRHMQEWLETRGSPKSHIDKANFNNEKVILKLDYSDFQIVK" BASE COUNT 186 a 96 c 134 g 165 t 1 others ORIGIN 1 ctactcgccg agttccctgt acgtgctgtg tccgatgacc tgcagcgtgg aagacaagag 61 gtttgagcat ggcagaggga aacaccctga tatcagtgga ttatgaaatt tttgggaagg 121 tgcaaggggt gtttttccgt aagcatactc aggctgaggg taaaaagctg ggattggtag 181 gctgggtcca gaacactgac cggggcacag tgcaaggaca attgcaaggt ccaatctcca 241 aggtgcgtca tatgcaggaa tggcttgaaa caagaggaag tcctaaatca cacatcgaca 301 aagcaaactt caacaatgaa aaagtcatct tgaagttgga ttactcagac ttccaaattg 361 taaaataatg gcctgaattt aagttttcta agataaactc agtggtttgg tttttattat 421 taatagagat agaactattg tgtgttaata ttagcattag tcaataagtt attttaatgt 481 cagatttttg aatgttatat atattacctg tatgatggaa ggattaccac tgtacacaaa 541 tctaatcaat aaaaacgtta gaaccttctg cttagagtac an // LOCUS HSAPHMU 772 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for acylphosphatase, muscle type (MT) isoenzyme. ACCESSION X84195 NID g1816492 KEYWORDS acylphosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 239 to 475) AUTHORS Fiaschi,T., Raugei,G., Marzocchini,R., Chiarugi,P., Cirri,P. and Ramponi,G. TITLE Cloning and expression of the cDNA coding for the erythrocyte isoenzyme of human acylphosphatase JOURNAL FEBS Lett. 367 (2), 145-148 (1995) MEDLINE 95317414 REFERENCE 2 (bases 1 to 772) AUTHORS Fiaschi,T., Marzocchini,R., Raugei,G., Veggi,D., Chiarugi,P. and Ramponi,G. TITLE The 5'-untranslated region of the human muscle acylphosphatase mRNA has an inhibitory effect on protein expression JOURNAL FEBS Lett. 417 (1), 130-134 (1997) MEDLINE 98055468 REFERENCE 3 (bases 1 to 772) AUTHORS Raugei,G. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) G. Raugei, Universita di Firenze, Dipt Scienze Biochimiche, Viale Morgagni 50, 50134 Firenze, ITALY REMARK revised by [3] REFERENCE 4 (bases 1 to 772) AUTHORS Raugei,G. TITLE Direct Submission JOURNAL Submitted (03-FEB-1997) G. Raugei, Universita di Firenze, Dipt Scienze Biochimiche, Viale Morgagni 50, 50134 Firenze, ITALY FEATURES Location/Qualifiers source 1..772 /organism="Homo sapiens" /db_xref="taxon:9606" source 1..240 /note="RACE-PCR" /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="erythroleukemia K562" CDS 176..475 /EC_number="3.6.1.7" /note="muscle type (MT) isoenzyme" /codon_start=1 /product="acylphosphatase" /db_xref="PID:e300055" /db_xref="PID:g1816493" /translation="MSTAQSLKSVDYEVFGRVQGVCFRMYTEDEARKIGVVGWVKNTS KGTVTGQVQGPEDKVNSMKSWLSKVGSPSSRIDRTNFSNEKTISKLEYSNFSIRY" source 241..772 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /clone_lib="lambdaGT11" BASE COUNT 227 a 180 c 177 g 188 t ORIGIN 1 ccaggcccgc agtctcattt gccgcttccg acgcgtgacc cggcgcgcta gcgtccggac 61 cggtgacagg cgcggggtgc gcaagcagtc ccatgtgtcc cctccctctc gcagccgccg 121 cagtcgctgc gccccgagcc cctctccggc tcctcaacag aggctcgccg ccgccatgtc 181 taccgcccag tcactcaaat ccgtggacta cgaggtgttc ggaagagtgc agggtgtttg 241 cttcagaatg tatacagaag atgaagctag gaaaatagga gtggttggct gggtgaagaa 301 tacaagcaaa ggcaccgtga caggccaagt gcaggggcca gaagacaaag tcaattccat 361 gaagtcctgg ctgagcaagg ttggaagccc tagttctcgc attgaccgca caaacttttc 421 taatgaaaaa accatctcta agcttgaata ctctaatttt agtattagat actaatagaa 481 gagaaattgt aacacactga accatagata ctgtatgctt aagactatgt atacagataa 541 gtagcagagt aggtgaaagg aactttctgt tctgaaagct aagcagctgt acgtctacta 601 aaatgtctga cactgaaata attttactca actatttttc aacaagcaaa tatagtatct 661 aagataatgt cattacaaat attagtgtga cttattactg ctcatgactt taatttcaat 721 gaacattaca gcatatatat gttattggcg gagacatcaa ataaagttaa cg // LOCUS HSAPHOL 2339 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for liver-type alkaline phosphatase (EC 3.1.3.1). ACCESSION X14174 NID g28737 KEYWORDS alkaline phosphatase; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2339) AUTHORS Kishi,F. TITLE Direct Submission JOURNAL Submitted (18-JAN-1989) Kishi F., Department of Pediatrics, Yamaguch University School of Medicine, Ube, Yamaguch, 755 Japa REFERENCE 2 (bases 1 to 2339) AUTHORS Kishi,F., Matsuura,S. and Kajii,T. TITLE Nucleotide sequence of the human liver-type alkaline phosphatase cDNA JOURNAL Nucleic Acids Res. 17 (5), 2129 (1989) MEDLINE 89183624 COMMENT The sequence overlaps with that reported by Mitchell et. al. in Proc. Natl. Acad. Sci. USA 83:7182-7186(1986) M14168. Data kindly reviewed (05-Apr-1989) by Kishi F. FEATURES Location/Qualifiers source 1..2339 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="p34-p36.1" sig_peptide 401..451 /note="signal peptide (AA -17 to -1)" CDS 401..1975 /note="alkaline phosphatase precursor (AA -17 to 507)" /codon_start=1 /db_xref="PID:g28738" /db_xref="SWISS-PROT:P05186" /translation="MISPFLVLAIGTCLTNSLVPEKEKDPKYWRDQAQETLKYALELQ KLNTNVAKNVIMFLGDGMGVSTVTAARILKGQLHHNPGEETRLEMDKFPFVALSKTYN TKAQVPDSAGTATAYLCGVKANEGTVGVSAATERSRCNTTQGNEVTSILRWAKDAGKS VGIVTTTRVNHATPSAAYAHSADRDWYSDNEMPPEALSQGCKDIAYQLMHNIRDIDVI MGGGRKYMYPKNKTDVEYESDEKARGTRLDGLDLVDTWKSFKPRHKHSHFIWNRTELL TLDPHNVDYLLGLFEPGDMQYELNRNNVTDPSLSEMVVVAIQILRKNPKGFFLLVEGG RIDHGHHEGKAKQALHEAVEMDRAIGQAGSLTSSEDTLTVVTADHSHVFTFGGYTPRG NSIFGLAPMLSDTDKKPFTAILYGNGPGYKVVGGERENVSMVDYAHNNYQAQSAVPLR HETHGGEDVAVFSKGPMAHLLHGVHEQNYVPHVMAYAACIGANLGHCAPASSAGSLAA GPLLLALALYPLSVLF" mat_peptide 452..1972 /note="mature alkaline phosphatase (AA 1-507)" BASE COUNT 558 a 713 c 645 g 423 t ORIGIN 1 aaaaaaataa tggcattatt tgggccactt ggaaaacccg gtggtattcc atgaagaaaa 61 ccactatgaa gataatccca ttcaccagac tcacaattgg agaaggacag caacaccacc 121 tggggggagc caaacaggct ggagacagaa actcccggtg tggcagctga gatggcccag 181 gaaagaacta tattaccttc aaaaagagag gtacatgcga tgtttgaggt ggcatgaagc 241 tcagtggtgt tatattggaa tgagtgagtg accatcctgg agccttcctg aaagaggatt 301 ggaacatcag ttaacatctg accactgcca gcgcaccccc tcccacccac gtcgattgca 361 tctctgggct ccagggataa agcaggtctt ggggtgcacc atgatttcac cattcttagt 421 actggccatt ggcacctgcc ttactaactc cttagtgcca gagaaagaga aagaccccaa 481 gtactggcga gaccaagcgc aagagacact gaaatatgcc ctggagcttc agaagctcaa 541 caccaacgtg gctaagaatg tcatcatgtt cctgggagat gggatgggtg tctccacagt 601 gacggctgcc cgcatcctca agggtcagct ccaccacaac cctggggagg agaccaggct 661 ggagatggac aagttcccct tcgtggccct ctccaagacg tacaacacca aagcccaggt 721 ccctgacagc gccggcaccg ccaccgccta cctgtgtggg gtgaaggcca atgagggcac 781 cgtgggggta agcgcagcca ctgagcgttc ccggtgcaac accacccagg ggaacgaggt 841 cacctccatc ctgcgctggg ccaaggacgc tgggaaatct gtgggcattg tgaccaccac 901 gagagtgaac catgccaccc ccagcgccgc ctacgcccac tcggctgacc gggactggta 961 ctcagacaac gagatgcccc ctgaggcctt gagccagggc tgtaaggaca tcgcctacca 1021 gctcatgcat aacatcaggg acattgacgt gatcatgggg ggtggccgga aatacatgta 1081 ccccaagaat aaaactgatg tggagtatga gagtgacgag aaagccaggg gcacgaggct 1141 ggacggcctg gacctcgttg acacctggaa gagcttcaaa ccgagacaca agcactccca 1201 cttcatctgg aaccgcacgg aactcctgac ccttgacccc cacaatgtgg actacctatt 1261 gggtctcttc gagccggggg acatgcagta cgagctgaac aggaacaacg tgacggaccc 1321 gtcactctcc gagatggtgg tggtggccat ccagatcctg cggaagaacc ccaaaggctt 1381 cttcttgctg gtggaaggag gcagaattga ccacgggcac catgaaggaa aagccaagca 1441 ggccctgcat gaggcggtgg agatggaccg ggccatcggg caggcaggca gcttgacctc 1501 ctcggaagac actctgaccg tggtcactgc ggaccattcc cacgtcttca catttggtgg 1561 atacaccccc cgtggcaact ctatctttgg tctggccccc atgctgagtg acacagacaa 1621 gaagcccttc actgccatcc tgtatggcaa tgggcctggc tacaaggtgg tgggcggtga 1681 acgagagaat gtctccatgg tggactatgc tcacaacaac taccaggcgc agtctgctgt 1741 gcccctgcgc cacgagaccc acggcgggga ggacgtggcc gtcttctcca agggccccat 1801 ggcgcacctg ctgcacggcg tccacgagca gaactacgtc ccccacgtga tggcgtatgc 1861 agcctgcatc ggggccaacc tcggccactg tgctcctgcc agctcggcag gcagccttgc 1921 tgcaggcccc ctgctgctcg cgctggccct ctaccccctg agcgtcctgt tctgagggcc 1981 cagggcccgg gcacccacaa gcccgtgaca gatgccaact tcccacacgg cagccccccc 2041 ctcaaggggc agggaggtgg gggcctcctc agcctctgca actgcaagaa aggggaccca 2101 ggaaaccaaa gtctgccgcc cacctcgctc ccctctggaa tcttccccaa gggccaaacc 2161 cacttctggc ctccagcctt tgctccctcc ccgctgccct ttggccaaca gggtagattt 2221 ctcttgggca ggcagagagt acagactgca gacattctca aagcctctta tttttctagc 2281 gaacgtattt ctccagaccc agaggccctg aagcctccgt ggaacattct ggatctgac // LOCUS HSAPR1 1559 bp RNA PRI 05-DEC-1995 DEFINITION H.sapiens mRNA for ARP1 protein. ACCESSION X91504 NID g1103581 KEYWORDS arp1 gene; GTPase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1559) AUTHORS Joost,H. TITLE Direct Submission JOURNAL Submitted (14-SEP-1995) H. Joost, Inst.f.Pharmakologie und Toxikologie, der RWTH, Wendlingweg 2, D- 52057 Aachen, FRG REFERENCE 2 (bases 1 to 1559) AUTHORS Schurmann,A., Massmann,S. and Joost,H.G. TITLE ARP is a plasma membrane-associated Ras-related GTPase with remote similarity to the family of ADP-ribosylation factors JOURNAL J. Biol. Chem. 270 (51), 30657-30663 (1995) MEDLINE 96107227 FEATURES Location/Qualifiers source 1..1559 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="lambda zap" gene 12..617 /gene="arp1" CDS 12..617 /gene="arp1" /codon_start=1 /product="GTPase" /db_xref="PID:e198926" /db_xref="PID:g1103582" /translation="MYTLLSGLYKYMFQKDEYCILILGLDNAGKTTFLEQSKTRFNKN YKGMSLSKITTTVGLNIGTVDVGKARLMFWDLGGQEELQSLWDKYYAECHGVIYVIDS TDEERLAESKQAFEKVVTSEALCGVPVLVLANKQDVETCLSIPDIKTAFSDCTSKIGR RDCLTQACSALTGKGVREGIEWMVKCVVRNVHRPPRQRDIT" BASE COUNT 291 a 425 c 529 g 314 t ORIGIN 1 cgcagggcag gatgtacacg ctgctgtcgg gcttgtacaa gtacatgttt cagaaggacg 61 agtactgcat cctgatcctg ggcctggaca atgctgggaa gacgaccttc ctggagcagt 121 cgaaaacccg atttaacaag aactacaagg ggatgagtct atccaaaatc accaccaccg 181 tgggcctaaa catcggcact gtggatgtgg gaaaggctcg gctcatgttc tgggacttag 241 gagggcagga agagctgcag tctttgtggg acaagtatta tgcggagtgt cacggcgtca 301 tctacgtcat tgactccacc gacgaggaga ggctggctga gtccaagcag gcgtttgaga 361 aggtggtgac cagcgaggcg ctgtgcggtg tccccgtctt ggtgctggcc aacaagcagg 421 atgtggagac gtgcctctca atccctgaca tcaagacggc cttcagcgac tgcaccagca 481 agatcggcag gcgcgattgc ctgacccagg cctgctcggc cctcacaggc aaaggggtgc 541 gcgagggcat cgagtggatg gtgaagtgtg tcgtgcggaa tgtgcaccgg ccgccgcggc 601 agagggacat cacgtagcgg cagccgcgct gccgtcggga cggctggtcc cctggtgctg 661 gaggagtggc ctcctgttgg ctcccatgct gctgatctgg ggggtgggtt tgctttgctt 721 tggggttctt ctatttactt tgttttctcg aagacaaact ttcctctatg tctggaaaag 781 cgtaggcatc cggaggcttt ggaggggagt ctggcagccc ggctggccca ggccctgcag 841 cggcagcctt tccacagggc gcagcggcgg cctttcgagg ccctttctgg ggggtctgag 901 ggagacctgg ttgggaattg gggctccagt gctcaggctg gcttgggctg catgaggaca 961 gccctgtggg accctcggga gaccccgtgg ctgtctccgc cccatcgagg aggaggcccg 1021 tcagccatgg ctgccatctg gcttctgccc tgtgaccccg tgaccccgga agtggtctgg 1081 ggctgatctt gccttgagga agacccaggc catgttccca aaggccagcg ggggccctgg 1141 attgtgatgc agcctcggga cagggctgag gcctgcgggg gaagacctat accccacgcc 1201 tgggcctggc ttcacctcac cctaatcccc cgggagggag ctgactgatg caaaaagctg 1261 agggggcctg ctgggagtgg ctgtttttat gccccagccc cgcaagttgg ggagtgtttg 1321 tgggggtcca gagccctccc ccagccagga gagaacctcc cggaggggtt ctctgtgggg 1381 ccctgtgtcc cctgctcggg agtaaggctg gtcctggggt cctccctgca cggaccccac 1441 tgggcctgcc gagtgctgtg ttcttcctca gtctggctgt gggcaggagc ggcctgccca 1501 gtgtcaccca gggtgagtgc aaaataaaga cggcgagtgt gaaaaaaaaa aaaaaaaaa // LOCUS HSAPRIL 1371 bp RNA PRI 03-APR-1997 DEFINITION H.sapiens mRNA for APRIL protein. ACCESSION Y07969 NID g1552325 KEYWORDS acidic protein; APRIL protein; Leucine rich. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1371) AUTHORS Mencinger,M. TITLE Direct Submission JOURNAL Submitted (11-SEP-1996) M. Mencinger, University Hospital Of Lund, Department Of Clinical Genetics, Lund, S-221 85, SWEDEN REFERENCE 2 (bases 1 to 1371) AUTHORS Mencinger,M., Panagopoulos,I., Contreras,J.A., Mitelman,F. and Aman,P. TITLE Characterization and chromosomal mapping of a novel gene APRIL, encoding acidic protein rich in leucines JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1371 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="pancreas" /chromosome="15" /map="q25" CDS 231..980 /note="acidic protein rich in leucines" /codon_start=1 /product="APRIL" /db_xref="PID:e266645" /db_xref="PID:g1552326" /translation="MKRRIHLELRNRTPAAVRELVLDNCKSNDGKIEGLTAEFVNLEF LSLINVGLISVSNLPKLPKLKKLELSENRIFGGLDMLAEKLPNLTHLNLSGNKLKDIS TLEPLKKLECLKSLDLFNCEVTNLNDYRESVFKLLPQLTYLDGYDREDQEAPDSDAEV DGVDEEEEDEEGEDEEDEDDEDGEEEEFDEEDDEDEDVEGDEDDDEVSEEEEEFGLDE EDEDEDEDEEEEEGGKGEKRKRETDDEGEDD" polyA_signal 1341..1346 polyA_site 1358 BASE COUNT 428 a 280 c 364 g 299 t ORIGIN 1 tctggttcgg cccacctctg aaggttccag aatcgatagt ggagtaactt ggctccgggg 61 gctccgctcg cctgcccaca cgccgcccgc cacccaggac cgcgccgccg gcctccgccg 121 ctagcaaacc cttccgacgg ccctcgctgc gcaagccggg acgcctctcc cccctccgcc 181 cccgccgcgg aaagttaagt ttgaagaggg gggaagaggg gaacatggac atgaagagga 241 ggatccacct ggagctgagg aaccggaccc cggcagctgt tcgagaactt gtcttggaca 301 attgcaaatc aaatgatgga aaaattgagg gcttaacagc tgaatttgtg aacttagagt 361 tcctcagttt aataaatgta ggcttgatct cagtttcaaa tctccccaag ctgcctaaat 421 tgaaaaagct tgaactcagt gaaaatagaa tctttggagg tctggacatg ttagctgaaa 481 aacttccaaa tctcacacat ctaaacttaa gtggaaataa actgaaagat atcagcacct 541 tggaaccttt gaaaaagtta gaatgtctga aaagcctgga cctctttaac tgtgaggtta 601 ccaacctgaa tgactaccga gagagtgtct tcaagctcct gccccagctt acctacttgg 661 atggctatga ccgagaggac caggaagcac ctgactcaga tgccgaggtg gatggtgtgg 721 atgaagagga ggaggacgaa gaaggagaag atgaggaaga cgaggacgat gaggatggtg 781 aagaagagga gtttgatgaa gaagatgatg aagatgaaga tgtagaaggg gatgaggacg 841 acgatgaagt cagtgaggag gaagaagaat ttggacttga tgaagaagat gaagatgagg 901 atgaggatga agaggaggaa gaaggtggga aaggtgaaaa gaggaagaga gaaacagatg 961 atgaaggaga agatgattaa gaccccagat gacctgcaga aacagaactg ttcagtattg 1021 gttggactgc tcatggattt tgtagctgtt taaaaaaaaa aaaggtagct gtgatacaaa 1081 ccccaggaca cccacccacc caaagagcca aagaatagtt cctgtgacat tccgccttcc 1141 ttccatgtag tccctcttgg taatctacca ccaagcttgt ggacttcacc ccaacaaaat 1201 tgtaagcgtt gttaggtttt tgtgtaagat tcttgctgta gcgtggatag ctgtgattgg 1261 tgagtcaacc gtctgtggct accagttaca ctgagattgt aacagcattt ttactttctg 1321 tacaacaaaa aagctttgta aataaaatct taacatttaa aaaaaaaaaa a // LOCUS HSAPXL 7445 bp RNA PRI 10-DEC-1995 DEFINITION H.sapiens APXL mRNA. ACCESSION X83543 NID g790999 KEYWORDS APXL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7445) AUTHORS Schiaffino,M.V., Bassi,M.T., Rugarli,E.I., Renieri,A., Galli,L. and Ballabio,A. TITLE Cloning of a human homologue of the Xenopus laevis APX gene from the ocular albinism type 1 critical region JOURNAL Hum. Mol. Genet. 4 (3), 373-382 (1995) MEDLINE 95315933 REFERENCE 2 (bases 1 to 7445) AUTHORS Schiaffino,V.M. TITLE Direct Submission JOURNAL Submitted (19-DEC-1994) V.M. Schiaffino, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1..7445 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /chromosome="22" /map="Xp22.2-22.3" gene 91..4941 /gene="APXL" CDS 91..4941 /gene="APXL" /codon_start=1 /db_xref="PID:g1181628" /translation="MEGAEPRARPERLAEAETRAADGGRLVEVQLSGGAPWGFTLKGG REHGEPLVITKIEEGSKAAAVDKLLAGDEIVGINDIGLSGFRQEAICLVKGSHKTLKL VVKRRSELGWRPHSWHATKFSDSHPELAASPFTSTSGCPSWSGRHHASSSSHDLSSSW EQTNLQRTLDHFSSLGSVDSLDHPSSRLSVAKSNSSIDHLGSHSKRDSAYGSFSTSSS TPDHTLSKADTSSAENILYTVGLWEAPRQGGRQAQAAGDPQGSEEKLSCFPPRVPGDS GKGPRPEYNAEPKLAAPGRSNFGPVWYVPDKKKAPSSPPPPPPPLRSDSFAATKSHEK AQGPVFSEAAAAQHFTALAQAQPRGDRRPELTDRPWRSAHPGSLGKGSGGPGCPQEAH ADGSWPPSKDGASSRLQASLSSSDVRFPQSPHSGRHPPLYSDHSPLCADSLGQEPGAA SFQNDSPPQVRGLSSCDQKLGSGWQGPRPCVQGDLQAAQLWAGCWPSDTALGALESLP PPTVGQSPRHHLPQPEGPPDARETGRCYPLDKGAEGCSAGAQEPPRASRAEKASQRLA ASITWADGESSRICPQETPLLHSLTQEGKRRPESSPEDSATRPPPFDAHVGKPTRRSD RFATTLRNEIQMHRAKLQKSRSTVALTAAGEAEDGTGRWRAGLGGGTQEGPLAGTYKD HLKEAQARVLRATSFKRRDLDPNPGDLYPESLEHRMGDPDTVPHFWEAGLAQPPSSTS GGPHPPRIGGRRRFTAEQKLKSYSEPEKMNEVGLTRGYSPHQHPRTSEDTVGTFADRW KFFEETSKPVPQRPAQKQALHGIPRDKPERPRTAGRTCEGTEPWSRTTSLGDSLNAHS AAEKAGTSDLPRRLGTFAEYQASWKEQRKPLEARSSGRCHSADDILDVSLDPQERPQH VHGRSRSSPSTDHYKQEASVELRRQAGDPGEPREELPSAVRAEEGQSTPRQADAQCRE GSPGSQQHPPSQKAPNPPTFSELSHCRGAPELPREGRGRAGTLPRDYRYSEESTPADL GPRAQSPGSPLHARGQDSWPVSSALLSKRPAPQRPPPPKREPRRYRATDGAPADAPVG VLGRPFPTPSPASLDVYVARLSLSHSPSVFSSAQPQDTPKATVCERGSQHVSGDASRP LPEALLPPKQQHLRLQTATMETSRSPSPQFAPQKLTDKPPLLIQDEDSTRIERVMDNN TTVKMVPIKIVHSESQPEKESRQSLACPAEPPALPHGLEKDQIKTLSTSEQFYSRFCL YTRQGAEPEAPHRAQPAEPQPLGTQVPPEKDRCTSPPGLSYMKAKEKTVEDLKSEELA REIVGKDKSLADILDPSVKIKTTMDLMEGIFPKDEHLLEEAQQRRKLLPKIPSPRSTE ERKEEPSVPAAVSLATNSTYYSTSAPKAELLIKMKDLQEQQEHEEDSGSDLDHDLSVK KQELIESISRKLQVLREARESLLEDVQANTVLGAEVEAIVKGVCKPSEFDKFRMFIGD LDKVVNLLLSLSGRLARVENALNNLDDGASPGDRQSLLEKQRVLIQQHEDAKELKENL DRRERIVFDILANYLSEESLADYEHFVKMKSALIIEQRELEDKIHLGEEQLKCLLDSL QPERGK" BASE COUNT 1705 a 2198 c 2087 g 1455 t ORIGIN 1 tctgcggcgc tcggagcctc ccttgcgatc ccacggccgg gactgcccgg agtgcatggg 61 cgcgggccag ggacgctgag cggtcgcgcc atggagggcg ccgagccccg cgcgcggccc 121 gagcgcctgg ccgaggccga gacgcgggcg gcggacggcg ggcgcctggt ggaggtgcag 181 ctgagcggcg gcgccccgtg gggcttcacc ctgaagggcg gccgcgagca cggcgagccg 241 ctggtcatca ccaagattga agagggcagt aaagccgcgg cggtcgacaa gttactggct 301 ggagatgaga tcgtcggcat caatgacatt ggtctctcag ggtttagaca ggaagcgatt 361 tgcctggtga aggggtccca taagaccctg aagctggtcg tcaaaaggag gagcgagctg 421 ggctggaggc ctcactcctg gcatgccacc aagttctctg acagccaccc cgagctagcg 481 gcctccccgt tcacctccac cagcggctgt ccttcctggt ccggccgaca ccacgcgagt 541 tcttcctccc acgacctgtc cagttcctgg gagcagacga acctacagcg caccttagat 601 cacttcagct ccttggggag cgttgacagc ctggaccacc cctccagtcg cctctcggtg 661 gccaagtcca acagcagcat cgaccacctg ggcagccaca gcaagcgcga ctcggcctac 721 ggctccttct ccaccagctc tagcactcct gaccacacct tgtccaaagc cgacacgtcc 781 tccgcagaga acatcctcta cactgtgggc ctctgggagg ctcccaggca gggtggccgg 841 caggcccagg ccgcaggcga ccctcagggc tcggaggaga agctcagttg tttcccgccc 901 agggtccccg gtgacagcgg caaaggcccc aggccagagt acaatgccga gcccaagctg 961 gctgcccctg ggaggtccaa ttttgggcca gtctggtatg ttcccgataa gaagaaagca 1021 ccatcatccc cacctcctcc ccctccccct ctccgcagtg acagctttgc tgccaccaag 1081 agccacgaga aggcccaggg ccctgtgttc tcagaggcgg ctgcggcaca gcactttacg 1141 gccctggccc aggctcagcc tcgtggtgac cggagaccag agctcaccga tcggccttgg 1201 aggtcagcac acccggggag cctcgggaag ggatcgggag gcccgggctg cccacaggag 1261 gcccacgcag acggcagctg gccgccctcc aaggatggag cttccagtag gctgcaggcc 1321 tctctgtcca gctcagatgt gcgcttccct cagtctcctc atagcggccg acaccctccc 1381 ctatacagcg accacagccc cctctgtgct gacagccttg ggcaggagcc aggggctgcc 1441 agcttccaga acgacagccc tcctcaggtg agggggctca gcagctgtga ccagaagctg 1501 gggagcggct ggcagggtcc ccggccctgt gtgcagggag acctgcaagc agcacagctc 1561 tgggcgggat gctggccttc tgacacagcc cttggagccc tcgagagtct tcccccaccc 1621 acggtgggcc agagcccacg ccatcaccta cctcagcctg agggtcctcc ggatgcccgc 1681 gagacaggac ggtgttaccc gctggacaaa ggggccgagg gctgctccgc gggagcccag 1741 gagcctccca gggccagccg tgcagaaaaa gccagccaga ggctggcagc cagcatcacg 1801 tgggcagatg gggagagcag caggatctgc ccgcaggaga cgcccctgtt gcactccctg 1861 acccaggagg ggaagcgccg gcctgagagc agtccagagg acagcgccac cagaccgcca 1921 ccgttcgacg cccacgtggg caagcccacc cgaagaagcg accgctttgc caccaccctg 1981 cggaatgaga tccagatgca tagagccaag ctgcagaaga gccggagcac agtggctctg 2041 actgcagcag gggaggcgga ggatggcacc ggccgctgga gggccgggtt gggaggtggc 2101 acccaggaag gacccctcgc tggcacctat aaagaccacc tgaaagaggc ccaagcccgg 2161 gtcctgaggg ccacgtcctt caagcgccgc gacttggacc ccaacccagg agacctatac 2221 ccggagtcac tggaacaccg gatgggggat ccagacactg tcccccactt ctgggaggca 2281 ggcctggccc agccaccctc atctacaagt ggcgggcccc acccgccccg catcggaggc 2341 cggagacggt tcacagctga gcagaaattg aagtcctact cggaacctga gaagatgaac 2401 gaggtgggcc tcacgagggg ctacagtcct caccagcacc ccaggacatc tgaggatact 2461 gtgggcacgt ttgctgacag gtggaagttt tttgaggaaa cgagcaaacc tgttccccag 2521 aggcctgccc agaagcaagc tcttcacgga atcccgagag acaagccaga gaggccgcgg 2581 acagcgggcc gcacatgtga gggcacggag ccctggtcgc gcaccacctc ccttggggac 2641 agcctcaacg ctcacagcgc agcggagaag gcagggactt cagacctgcc gcggaggctc 2701 ggcacctttg cagagtatca ggcctcttgg aaggaacaga ggaaacctct ggaggccagg 2761 agctctgggc gctgccactc agcggatgac atcctggatg tgagcctgga cccacaggag 2821 aggccgcagc acgttcatgg gaggtcccgg tcttcaccgt ccacagacca ctacaagcag 2881 gaagcttctg tcgaactgcg aaggcaggca ggggaccccg gcgagcccag agaagagctt 2941 ccctccgcag tccgggccga ggagggacag tccacgccga gacaagcaga tgcccagtgt 3001 cgggaaggca gcccaggatc acagcagcac ccaccgagtc agaaggcacc gaacccaccc 3061 acattctctg aactatctca ctgccgggga gccccagagc tgccccggga gggccggggc 3121 cgagcgggaa ccctacctcg agattataga tactcggagg agagcacccc agcagacttg 3181 ggaccccgag cccagagccc tggctcaccc ctgcatgctc gaggacaaga ctcgtggcca 3241 gtgagctcag ccctgctctc caagaggcca gccccacaga ggccaccgcc acccaagcgc 3301 gagcccagga gatacagggc cacagacggc gcacctgctg acgcccccgt gggcgtcctc 3361 ggcaggccct tcccaacgcc atcccctgcg tccctggatg tgtatgtggc ccgcctgtcc 3421 ctctcccaca gcccctctgt gttcagcagt gcccagcccc aggacacccc gaaggccact 3481 gtctgtgagc gtggaagcca gcatgtgagc ggggacgcat cacgtcctct gccagaagca 3541 ctgctccctc ccaagcagca gcacctgcgc ctgcagacgg ccaccatgga gacctcgcgc 3601 tccccctcgc cccagttcgc cccccagaaa ctgacggaca aacctcccct gctcatccag 3661 gatgaggatt caaccagaat tgagcgggtg atggacaaca acaccacggt gaagatggtg 3721 cccatcaaga tcgtgcactc ggagagccag ccagagaagg agagccgcca gagcctggca 3781 tgccccgccg agccacctgc cctgccccac gggctggaga aagaccagat caagacgctg 3841 agcacatctg agcagttcta ctcgcgcttc tgtctgtaca cgcggcaggg tgctgagccc 3901 gaggccccac atagggccca gccggctgag ccccagcccc tgggcaccca ggtgcccccc 3961 gagaaagacc gctgcacctc ccctccaggg ctcagctaca tgaaggccaa agagaagact 4021 gtggaagacc tgaagtcgga ggagctggcc agggagatcg tggggaagga taagtccctg 4081 gccgacatcc tggatcccag tgtgaagatc aaaaccacta tggacttgat ggaaggcatc 4141 ttccccaaag acgagcacct cctggaagaa gcccagcaac ggaggaagct gctccccaaa 4201 atcccctctc ctagaagcac agaggagagg aaagaggagc ccagcgtgcc tgcggccgtg 4261 tccctggcca ccaattctac ctactacagc acgtcggccc ccaaggcgga gctgctgatc 4321 aagatgaagg acctgcagga gcagcaggag cacgaagagg attcgggaag cgacttggac 4381 cacgacctgt cggtgaagaa gcaggagctc atcgagagca tcagccgcaa gctgcaggtg 4441 ctccgggagg cccgcgagag cctgctggag gacgtgcagg ccaacaccgt gctgggggcc 4501 gaggtggagg ccatcgtgaa aggcgtctgc aagcccagcg agtttgacaa gttccggatg 4561 ttcattggag acctggacaa agtggtgaac ctcctgctgt cgctgtcagg ccgcctggcc 4621 cgggtggaga atgccctcaa taatttggac gacggcgctt ctcccggtga tcggcaatca 4681 ctgcttgaga agcagagagt cctgatccag cagcacgagg acgccaagga gctcaaggag 4741 aacctggacc gccgcgagcg catcgtcttt gacattttgg ccaactatct gagcgaggag 4801 agcctcgcgg actatgagca cttcgtgaag atgaagtcgg ccctcatcat cgagcagcgg 4861 gagctggaag ataaaatcca ccttggtgaa gagcagctga agtgcttatt ggacagcctt 4921 cagcccgaaa ggggcaaata agagaccagt ccccggtgga ggaggggcac ggggcctccg 4981 agctccagct ccgttcccaa ggatactcgt gaagacccca tctgtgttca tggcctggaa 5041 agagacttct cccatagcaa agaggctgtt ataaaagcaa taacttttgt gtttgtgtgg 5101 gatgatttat ttaatttttt agtttcccct ttgattgctg agagccattt tcctttacac 5161 ataactacac ctgacaccag gctctgctgg atgtgagttt ccactgcatg ggctgtgggc 5221 tgggcctgtg gtgcctgccg agtggtcact gtcagtggga aacccgttgt tcctcccgtc 5281 ttcagatgct gagccaactg cttggacagc agccagcgcg tcatgacgtg catgagaggg 5341 ggaccctggt gctcatcttc tcttgtcatt catccaggca tgggctgcca ggttttgtcc 5401 ctgctcgttc aacagtgtga gcatttgtct ctgttatcta atgatgttct ctgacccagc 5461 agaaatcatc atcatgatga tgataattta ttaacttttt ggaagggtga atagtttcct 5521 aatggttaaa aaccaactgt gaaaggaacc acctgtgtgg ttgggttcac tcattctcag 5581 attaaattgc cacttaaaga aataacgtgc atgctttaaa aaacacagtc acgcaccaag 5641 caggcaaata gctttagtcc ttctcacctc acatcacagt tgttctgcaa agtaaaattt 5701 tttggttaag agcgtgtcca gtagtaatgt gcttgttagc tgtttctcaa gaccaacaga 5761 agattttttc agttactttc cccccatgta ttttgtatgc atatgattgt ccgtgataat 5821 tggctacttt tccattgttt cctccttaaa tcgtttagca tggcatgagg gccacattcc 5881 atggacggga agaccccttc ctcttcagag gtcccgtgga ctacacagct cctgagcttg 5941 atctttttct gccatgaagt ttaaagattc tatgcccatt tccttgattg aaatggcagg 6001 attctaaaga gagcctggtt tgttaaaaga aaacactgtc atgctgtcag ttcccaattg 6061 acaagtcaca gactgggaga aaatatttgc aaatcgtgta tctgacaaaa ggtttgtgtc 6121 caggatgtac aaagaactct caaaccggat agtaagaaaa caaacagccc aagtgaaaag 6181 caggcaaaag acttgaatag acacttcacc aaagagcata cacgcgtggc aaacaagcac 6241 acgaaaagac gttcagccgc cgatggcttg gttataattt ataacttact tatttttatc 6301 taataattgt agattcagtg tatttcttca aaaaatgttt aattaaatgc atgttaatgg 6361 tgagtgaatc ccttgggtga cttcgtgttt aggtcgtatt agggcatttg ttggatcaac 6421 ggatcatttt aaccctgact tccccttatt cccataaaag aagttttcca gtggaatgga 6481 gatttcattt tgtcagcagc agtgaccaca gccttaccaa agcagacgcg tgcgcgtgca 6541 cagatgcaca cacacagatg tcttaaaaga ctagaatcca cacttcctga gccagagggg 6601 ccgtgttgac ggtaatgcat tctctataga gccaagtcca aactggcaag ctcaatgatg 6661 caggcaataa accgcctttt tggcagccta ccaatgccaa aaggataaat gtctttccaa 6721 aagtgtgtat tcctgttaaa ttaagctctt gctaacttga aaaatccctg ttctgccagc 6781 gaagcttcct cctcctctcc agctggtagt cacttgcgtg aatgctggtc agtctgaaaa 6841 ggtgaagctg gctgtgcact tacccccatc tttctccctc ggggagacga cccaaggaat 6901 ttcagagtat tttgtttggc agagctttta cctgttattc tttgccctca aatacagtat 6961 tgtggtcatt ttgatgatat gtgtgtaaaa tgtgaataat ccaattggtg tctgtactca 7021 gccttttgat gtctttttag gactttctct tctacacagc aatacgtcgt gctcgagtat 7081 ccttgtagca aagcacatag agccagctgt cctgtcagtt cccctgtttg cctctgaaac 7141 gtctggttag tggggaccca aagattctag tgagtcaaca tccataactc tgtatctagt 7201 tgtattattc atagaaaatc aatctggtgc taatggttgg ccctggtgtt gttgggtggc 7261 agctgctcct tcgccctctt gtagtgtggc tgtggagggc tctgcctatg gggggtggcc 7321 tgtggcttgt atccttcagt ccaccacagc aaatgtgtgt agatttcatg ctcgacactt 7381 accactcacc tatcaacaga tcatcctgct tgactgtaac aaaataaata gtgtctcttc 7441 aagtg // LOCUS HSARAPROT 1936 bp RNA PRI 27-OCT-1997 DEFINITION H.sapiens mRNA for anthracycline resistance associated protein. ACCESSION X95715 NID g1279456 KEYWORDS anthracycline resistance associated protein; ara gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1936) AUTHORS Longhurst,T.J., O'Neill,G.M., Harvie,R.M. and Davey,R.A. TITLE The anthracycline resistance-associated (ara) gene, a novel gene associated with multidrug resistance in a human leukaemia cell line JOURNAL Br. J. Cancer 74 (9), 1331-1335 (1996) MEDLINE 97069542 REFERENCE 2 (bases 1 to 1936) AUTHORS Davey,R. TITLE Direct Submission JOURNAL Submitted (16-FEB-1996) R. Davey, Royal North Shore Hospital, Bill Walsh Cancer Research Laboratories, St Leonards, NSW, 2065, AUSTRALIA FEATURES Location/Qualifiers source 1..1936 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CCRF-CEM/E1000" gene 116..1477 /gene="ara" CDS 116..1477 /gene="ara" /codon_start=1 /product="anthracycline resistance associated protein" /db_xref="PID:e223411" /db_xref="PID:g1279457" /translation="MALRGFCSRWLRPALAIGLFASMAAVLLGGARASRLLFQRLLWD VVRSPISFFERTPIGHLLNRFSKETDTVDVDIPDKLRSLLMYAFGLLEVSLVVEWPTP LPLWPSCHCFSSTLGFRWLAANVELLGNGLVFAAATCAVLSKAHLSAGLVGFSVSAAL QVTQTLQWVVRNWTDLENSIVSVERMQDYAWTPKEAPWRLPTCAAQPPWPQGGQIEFR DFGLRYRPELPLAVQGVSFKIHAGEKVGIVGRTGAGKSSLASGLLRLQEAAEGGIWID GVPIAHVGVHTLRSRISIIPQDPILFPGSLRMNLDLLQEHSDEAIWAALETVQLKALV ACLPGQLQYKCADRGEDLSVGQKQLLCLARALLRKTQILILDEATAAVDPGTELQMQA MLGSWFAQCTVLLIAHRLRSVMDCARVLVMDKGQVAESGSPAQLLAQKGLFYRLAQES GLV" BASE COUNT 369 a 605 c 585 g 377 t ORIGIN 1 agcgctagcg cagcagccgg gcccgatcac ccgccgcccg gtgcccgccg ccgcccgcgc 61 cagcaaccgg gcccgatcac ccgccgcccg gtgcccgccg ccgcccggca ccggcatggc 121 gctccggggc ttctgcagcc gatggctccg acccgctctg gccattgggc tgtttgcctc 181 catggctgcg gtgctcctag gtggggcccg ggcatccagg ttgctcttcc agaggctcct 241 gtgggatgtg gtgcgatctc ccatcagctt ctttgagcgg acacccattg gtcacctgct 301 aaaccgcttc tccaaggaga cagacacggt tgacgtggac attccagaca aactccggtc 361 cctgctgatg tacgcctttg gactcctgga ggtcagcctg gtggtggagt ggcctacccc 421 actgccactg tggccatcct gccactgttt ctcctctacg ctgggtttca ggtggcttgc 481 ggccaatgtg gagctcctgg ggaatggcct ggtgtttgca gctgccacgt gtgctgtgct 541 gagcaaagcc cacctcagtg ctggcctcgt gggcttctct gtctctgctg ccctccaggt 601 gacccagaca ctgcagtggg ttgttcgcaa ctggacagac ctagagaaca gcatcgtgtc 661 agtggagcgg atgcaggact atgcctggac gcccaaggag gctccctgga ggctgcccac 721 atgtgcagct caacccccct ggcctcaggg cgggcagatc gagttccggg actttgggct 781 aagataccga cctgagctcc cgctggctgt gcagggcgtg tccttcaaga tccacgcagg 841 agagaaggtg ggcatcgttg gcaggaccgg ggcagggaag tcctccctgg ccagtgggct 901 gctgcggctc caggaggcag ctgagggtgg gatctggatc gacggggtcc ccattgccca 961 cgtgggcgtg cacacactgc gctccaggat cagcatcatc ccccaggacc ccatcctgtt 1021 ccctggctct ctgcggatga acctcgacct gctgcaggag cactcggacg aggctatctg 1081 ggcagccctg gagacggtgc agctcaaagc cttggtggcc tgcctgcccg gccagctgca 1141 gtacaagtgt gctgaccgag gcgaggacct gagcgtgggc cagaaacagc tcctgtgtct 1201 ggcacgtgcc cttctccgga agacccagat cctcatcctg gacgaggcta ctgctgccgt 1261 ggaccctggg acggagctgc agatgcaggc catgctcggg agctggtttg cacagtgcac 1321 tgtgctgctc attgcccacc gcctgcgctc cgtgatggac tgtgcccggg ttctggtcat 1381 ggacaagggg caggtggcag agagcggcag cccggcccag ctgctggccc agaagggcct 1441 gttttacaga ctggcccagg agtcaggcct ggtctgagcc aggaccctca accgtacccc 1501 agttggacca gcccgcacag cctgcagtgc tggagatgga agtgacccgt tggtcatcga 1561 tagctccaca cgatattgag tctagacctg tgtttggtct ctgggaggga aaatggcaga 1621 gaaagtggcc aattatcaca gagcatcaga gccggaagga cctagcaata cacaggtctg 1681 ccggcaggcc catctcgccc tgtccaccct gcagccaatg tcaacagcga ctctcagccc 1741 cgctgtactc tggactcacc tgggggcctc aagcacatgc ccaggctccc ggctagccct 1801 taaatcagaa tctctgaggc tgggaactgc catgctgtgt gtacttttta caaattaaca 1861 cttttatttt gggataatcc cagactcaca tgcagttaaa gaaacaataa tataaaaaaa 1921 aaaaaaaaaa aaaaaa // LOCUS HSARCP2 1998 bp RNA PRI 02-MAY-1995 DEFINITION H.sapiens mRNA (clone pPH2) for archain. ACCESSION X81197 NID g773572 KEYWORDS open reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1998) AUTHORS Radice,P., Pensotti,V., Jones,C., Perry,H., Pierotti,M.A. and Tunnacliffe,A. TITLE The human archain gene, ARCN1, has highly conserved homologs in rice and Drosophila JOURNAL Genomics 26 (1), 101-106 (1995) MEDLINE 95301274 REFERENCE 2 (bases 1 to 1998) AUTHORS Tunnacliffe,A. TITLE Direct Submission JOURNAL Submitted (22-AUG-1994) A. Tunnacliffe, Quadrant Res Foundation, Maris Lane, Trumpington, Cambridge CB2 2SY, UK FEATURES Location/Qualifiers source 1..1998 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pPH2" /chromosome="11" /map="11q23" CDS 118..1653 /codon_start=1 /product="archain" /db_xref="PID:g773573" /db_xref="SWISS-PROT:P48444" /translation="MVLLAAAVCTKAGKAIVSRQFVEMTRTRIEGLLAAFPKLMNTGK QHTFVETESVRYVYQPMEKLYMVLITTKNSNILEDLETLRLFSRVIPEYCRALEENEI SEHCFDLIFAFDEIVALGYRENVNLAQIRTFTEMDSHEEKVFRAVRETQEREAKAEMR RKAKELQQARRDAERQGKKAPGFGGFGSSAVSGGSTAAMITETIIETDKPKVAPAPAR PSGPSKALKLGAKGKEVDNFVDKLKSEGETIMSSSMGKRTSEATKMHAPPINMESVHM KIEEKITLTCGRDGGLQNMELHGMIMLRISDDKYGRIRLHVENEDKKGVQLQTHPNVD KKLFTAESLIGLKNPEKSFPVNSDVGVLKWRLQTTEESFIPLTINCWPSESGNGCDVN IEYELQEDNLELNDVVITIPLPSGVGAPVIGEIDGEYRHDSRRNTLEWCLPVIDAKNK SGSLEFSIAGQPNDFFPVQVSFVSKKNYCNIQVTKVTQVDGNSPVRFSTETTFLVDKY EIL" BASE COUNT 612 a 429 c 480 g 477 t ORIGIN 1 cgggcggttc ctgtcaaggg ggcagcaggt ccagagctgc tggtgctccc gttccccaga 61 ccctacccct atccccagtg gagccggagt gcggcgcgcc ccaccaccgc cctcaccatg 121 gtgctgttgg cagcagcggt ctgcacaaaa gcaggaaagg ctattgtttc tcgacagttt 181 gtggaaatga cccgaactcg gattgagggc ttattagcag cttttccaaa gctcatgaac 241 actggaaaac aacatacgtt tgttgaaaca gagagtgtaa gatatgtcta ccagcctatg 301 gagaaactgt atatggtact gatcactacc aaaaacagca acattttaga agatttggag 361 accctaaggc tcttctcaag agtgatccct gaatattgcc gagccttaga agagaatgaa 421 atatctgagc actgttttga tttgattttt gcttttgatg aaattgtcgc actgggatac 481 cgggagaatg ttaacttggc acagatcaga accttcacag aaatggattc tcatgaggag 541 aaggtgttca gagccgtcag agagactcaa gaacgtgaag ctaaggctga gatgcgtcgt 601 aaagcaaagg aattacaaca ggcccgaaga gatgcagaga gacagggcaa aaaagcacca 661 ggatttggcg gatttggcag ctctgcagta tctggaggca gcacagctgc catgatcaca 721 gagaccatca ttgaaactga taaaccaaaa gtggcacctg caccagccag gccttcaggc 781 cccagcaagg ctttaaaact tggagccaaa ggaaaggaag tagataactt tgtggacaaa 841 ttaaaatctg aaggtgaaac catcatgtcc tctagtatgg gcaagcgtac ttctgaagca 901 accaaaatgc atgctccacc cattaatatg gaaagtgtac atatgaagat tgaagaaaag 961 ataacattaa cctgtggacg agacggagga ttacagaata tggagttgca tggcatgatc 1021 atgcttagga tctcagatga caagtatggc cgaattcgtc ttcatgtgga aaatgaagat 1081 aagaaagggg tgcagctaca gacccatcca aatgtggata aaaaactttt cactgcagag 1141 tctctaattg gcctgaagaa tccagagaag tcatttccag tcaacagtga cgtaggggtg 1201 ctaaagtgga gactacaaac cacagaggaa tcttttattc cactgacaat taattgctgg 1261 ccctcggaga gtggaaatgg ctgtgatgtc aacatagaat atgagctaca agaagataat 1321 ttagaactga atgatgtggt tatcaccatc ccactcccgt ctggtgtcgg cgcgcctgtt 1381 atcggtgaga tcgatgggga gtatcgacat gacagtcgac gaaataccct ggagtggtgc 1441 ctgcctgtga ttgatgccaa aaataagagt ggcagcctgg agtttagcat tgctgggcag 1501 cccaatgact tcttccctgt tcaagtttcc tttgtctcca agaaaaatta ctgtaacata 1561 caggttacca aagtgaccca ggtagatgga aacagccccg tcaggttttc cacagagacc 1621 actttcctag tggataagta tgaaatcctg taataccaag aagagggagc tgaaaaggaa 1681 aattttcaga ttaataaaga agacgccaat gatggctgaa gagtttttcc cagatttaca 1741 agccactgga gacccctttt ttctgataca atgcacgatt ctctgcgcgc aaggaccctc 1801 gactcacccc catgtttcag tgtcacagag acattctttg ataaggaaat ggcacaaaca 1861 taaagggaaa ggctgctaat tttctttggc agattgtatt ggccagcagg aaagcaagct 1921 ctccagagaa tgcccccagt taaatacctc ctctaccttt acctaagttg ctcctttatt 1981 tttattttat aataataa // LOCUS HSARNO 1200 bp RNA PRI 06-FEB-1997 DEFINITION H.sapiens mRNA for Arno protein. ACCESSION X99753 NID g1834465 KEYWORDS ARF exchange factor; Arno gene; Arno protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Chardin,P., Paris,S., Antonny,B., Robineau,S., Beraud-Dufour,S., Jackson,C.L. and Chabre,M. TITLE A human exchange factor for ARF contains Sec7- and pleckstrin-homology domains JOURNAL Nature 384 (6608), 481-484 (1996) MEDLINE 97100951 REFERENCE 2 (bases 1 to 1200) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) P. Chardin, CNRS UPR 411, Institut de Pharmacologie, 660 Route des Lucioles, F- 06560 Valbonne, FRANCE COMMENT Related sequence: H52019. FEATURES Location/Qualifiers source 1..1200 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /germline /tissue_type="brain" /clone_lib="Soares N2b4HB55Y" gene 1..1200 /gene="Arno" CDS 1..1200 /gene="Arno" /codon_start=1 /product="Arno protein (ARF exchange factor)" /db_xref="PID:e276477" /db_xref="PID:g1834466" /translation="MEDGVYEPPDLTPEERMELENIRRRKQELLVEIQRLREELSEAM SEVEGLEANEGSKTLQRNRKMAMGRKKFNMDPKKGIQFLVENELLQNTPEEIARFLYK GEGLNKTAIGDYLGEREELNLAVLHAFVDLHEFTDLNLVQALRQFLWSFRLPGEAQKI DRMMEAFAQRYCLCNPGVFQSTDTCYVLSFAVIMLNTSLHNPNVRDKPGLERFVAMNR GINEGGDLPEELLRNLYDSIRNEPFKIPEDDGNDLTHTFFNPDREGWLLKLGGRVKTW KRRWFILTDNCLYYFEYTTDKEPRGIIPLENLSIREVDDPRKPNCFELYIPNNKGQLI KACKTEADGRVVEGNHMVYRISAPTQEEKDEWIKSIQAAVSVDPFYEMLAARKKRISV KKKQEQP" BASE COUNT 297 a 319 c 382 g 202 t ORIGIN 1 atggaggacg gcgtttatga acccccagac ctgactccgg aggagcggat ggagctggag 61 aacatccggc ggcggaagca ggagctgctg gtggagattc agcgcctgcg ggaggagctc 121 agtgaagcca tgagcgaggt ggaggggctg gaggccaatg agggcagtaa gaccttgcaa 181 cggaaccgga agatggcaat gggcaggaag aagttcaaca tggaccccaa gaaggggatc 241 cagttcttgg tggagaatga actgctgcag aacacacccg aggagatcgc ccgcttcctg 301 tacaagggcg aggggctgaa caagacagcc atcggggact acctggggga gagggaagaa 361 ctgaacctgg cagtgctcca tgcttttgtg gatctgcatg agttcaccga cctcaatctg 421 gtgcaggccc tcaggcagtt tctatggagc tttcgcctac ccggagaggc ccagaaaatt 481 gaccggatga tggaggcctt cgcccagcga tactgcctgt gcaaccctgg ggttttccag 541 tccacagaca cgtgctatgt gctgtccttc gccgtcatca tgctcaacac cagtctccac 601 aatcccaatg tccgggacaa gccgggcctg gagcgctttg tggccatgaa ccggggcatc 661 aacgagggcg gggacctgcc tgaggagctg ctcaggaacc tgtacgacag catccgaaat 721 gagcccttca agattcctga ggatgacggg aatgacctga cccacacctt cttcaacccg 781 gaccgggagg gctggctcct gaagctgggg ggccgggtga aaacgtggaa gcggcgctgg 841 tttatcctca cagacaactg cctctactac tttgagtaca ccacggacaa ggagccccga 901 ggaatcatcc ccctggagaa tctgagcatc cgagaggtgg acgacccccg gaaaccgaac 961 tgctttgaac tttacatccc caacaacaag gggcagctca tcaaagcctg caaaactgag 1021 gcggacggcc gagtggtgga gggaaaccac atggtgtacc ggatctcggc ccccacgcag 1081 gaggagaagg acgagtggat caagtccatc caggcggctg tgagtgtgga ccccttctat 1141 gagatgctgg cagcgagaaa gaagcggatt tcagtcaaga agaagcagga gcagccctga // LOCUS HSARSD 1945 bp RNA PRI 07-JUL-1995 DEFINITION H.sapiens ARSD mRNA. ACCESSION X83572 NID g791001 KEYWORDS ARSD gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1945) AUTHORS Franco,B., Meroni,G., Parenti,G., Levilliers,J., Bernard,L., Gebbia,M., Cox,L., Maroteaux,P., Sheffield,L., Rappold,G.A. et,al. TITLE A cluster of sulfatase genes on Xp22.3: mutations in chondrodysplasia punctata (CDPX) and implications for warfarin embryopathy JOURNAL Cell 81 (1), 15-25 (1995) MEDLINE 95236447 REFERENCE 2 (bases 1 to 1945) AUTHORS Franco,B. TITLE Direct Submission JOURNAL Submitted (16-DEC-1994) B. Franco, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY REFERENCE 3 (bases 1 to 1945) AUTHORS Franco,B. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) B. Franco, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1..1945 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" /clone_lib="adult kidney cDNA library" /chromosome="22" /map="Xp22.3" gene 77..1858 /gene="ARSD" CDS 77..1858 /gene="ARSD" /codon_start=1 /db_xref="PID:g791002" /translation="MRSAARRGRAAPAARDSLPVLLFLCLLLKTCEPKTANAFKPNIL LIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGRHS FRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRGD HCHHPLNHGFDYFYGMPFTLTNDCDPGRPPEVDAALRAQLWGYTQFLALGILTLAAGQ TCGFFSVSARAVTGMAGVGCLFFISWYSSFGFVRRWNCILMRNHDVTEQPMVLEKTAS LMLKEAVSYIERHKHGPFLLFLSLLHVHIPLVTTSAFLGKSQHGLYGDNVEEMDWLIG KVLNAIEDNGLKNSTFTYFTSDHGGHLEARDGHSQLGGWNGIYKGGKGMGGWEGGIRV PGIFHWPGVLPAGRVIGEPTSLMDVFPTVVQLVGGEVPQDRVIDGHSLVPLLQGAEAR SAHEFLFHYCGQHLHAARWHQKDSGSVWKVHYTTPQFHPEERGLLTAEASAHAEWGGV THHRPPLLFDLSRDPSEARPLTPDSEPLYHAVIARVGAAVSEHRQTLSPVPQQFSMSN ILWKPWLQPCCGHFPFCSCHEDGDGTP" BASE COUNT 408 a 554 c 577 g 406 t ORIGIN 1 ggaagccttg gcactagcgg cgcccgggcg cggagtgcgc agggcaaggt cctgcgctct 61 gggccagcgc tcggccatgc gatccgccgc gcggagggga cgcgccgcgc ccgccgccag 121 ggactctttg ccggtgctac tgtttttatg cttgcttctg aagacgtgtg aacctaaaac 181 tgcaaatgcc tttaaaccaa atatcctact gatcatggcg gatgatctag gcactgggga 241 tctcggttgc tacgggaaca atacactgag aacgccgaat attgaccagc ttgcagagga 301 aggtgtgagg ctcactcagc acctggcggc cgccccgctc tgcaccccaa gccgagctgc 361 attcctcaca gggagacatt ccttcagatc aggcatggac gccagcaatg gataccgggc 421 ccttcagtgg aacgcaggct caggtggact ccctgagaac gaaaccactt ttgcaagaat 481 cttgcagcag catggctatg caaccggcct cataggaaaa tggcaccagg gtgtgaattg 541 tgcatcccgc ggggatcact gccaccaccc cctgaaccac ggatttgact atttctacgg 601 catgcccttc acgctcacaa acgactgtga cccaggcagg ccccccgaag tggacgccgc 661 cctgagggcg cagctctggg gttacaccca gttcctggcg ctggggattc tcaccctggc 721 tgccggccag acctgcggtt tcttctctgt ctccgcgaga gcagtcaccg gcatggccgg 781 cgtgggctgc ctgtttttca tctcttggta ctcctccttc gggtttgtgc gacgctggaa 841 ctgtatcctg atgagaaacc atgacgtcac ggagcaaccc atggttctgg agaaaacagc 901 gagtcttatg ctaaaggaag ctgtttccta tattgaaaga cacaagcatg ggccatttct 961 cctcttcctt tctttgctgc atgtgcacat tccccttgtg accacgagtg cattcctggg 1021 gaaaagtcag catggcttat atggtgataa tgtggaggag atggactggc tcataggtaa 1081 ggttcttaat gccatcgaag acaatggttt aaagaactca acattcacgt atttcacctc 1141 tgaccatgga ggacatttag aggcaagaga tggacacagc cagttagggg gatggaacgg 1201 aatttacaaa ggtgggaagg gcatgggagg atgggaaggt gggatccgag tgcccgggat 1261 cttccactgg ccgggggtgc tcccggccgg ccgagtgatt ggagagccca cgagcctgat 1321 ggacgtgttc cctactgtgg tccagctggt gggtggcgag gtgccccagg acagggtgat 1381 tgatggccac agcctggtac ccttgctgca gggagctgag gcacgctcgg cacatgagtt 1441 cctgtttcat tactgtgggc agcatcttca cgcagcacgc tggcaccaga aggacagtgg 1501 aagcgtctgg aaggttcatt acacgacccc gcagttccac cccgaggagc ggggcctgct 1561 aacggccgag gcgtctgccc atgctgaatg gggaggcgtg acccatcaca gacccccttt 1621 gctctttgac ctctccaggg acccctccga ggcacggccc ctgacccccg actccgagcc 1681 cctgtaccac gccgtgatag caagggtagg tgccgcggtg tcggagcatc ggcagaccct 1741 gagtcctgtg ccccagcagt tttccatgag caacatcctg tggaagccgt ggctgcagcc 1801 gtgctgcgga catttcccgt tctgttcatg ccacgaggat ggggatggca ccccctgaat 1861 gccaggactg tgagagagga tccaggagag cctgactgcg ttgcaaacaa aattctccaa 1921 gcttggttct atcttcagtc cggaa // LOCUS HSARSE 1858 bp RNA PRI 07-JUL-1995 DEFINITION H.sapiens ARSE mRNA. ACCESSION X83573 NID g791003 KEYWORDS ARSE gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1858) AUTHORS Franco,B., Meroni,G., Parenti,G., Levilliers,J., Bernard,L., Gebbia,M., Cox,L., Maroteaux,P., Sheffield,L., Rappold,G.A. et,al. TITLE A cluster of sulfatase genes on Xp22.3: mutations in chondrodysplasia punctata (CDPX) and implications for warfarin embryopathy JOURNAL Cell 81 (1), 15-25 (1995) MEDLINE 95236447 REFERENCE 2 (bases 1 to 1858) AUTHORS Franco,B. TITLE Direct Submission JOURNAL Submitted (16-DEC-1994) B. Franco, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1..1858 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" /clone_lib="adult kidney cDNA library" /chromosome="22" /map="Xp22.3" gene 68..1837 /gene="ARSE" CDS 68..1837 /gene="ARSE" /codon_start=1 /db_xref="PID:g791004" /translation="MLHLHHSCLCFRSWLPAMLAVLLSLAPSASSDISASRPNILLLM ADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRS GMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCH HPLHHGFEHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVLALVALTLVAGKLTH LIPVSWMPVIWSALSAVLLLASSYFVGALIVHADCFLMRNHTITEQPMCFQRTTPLIL QEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGRIL DTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGWEGGIRVPGI FRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRVIDGQDLLPLLLGTAQHSDH EFLMHYCERFLHAARWHQRDRGTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKVVHH DPPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPVPLQLDRLGNIW RPWLQPCCGPFPLCWCLREDDPQ" BASE COUNT 406 a 515 c 506 g 431 t ORIGIN 1 ccttcctctt cttgatcggg gattcaggaa ggagcccagg agcagaggaa gtagagagag 61 agacaacatg ttacatctgc accattcttg tttgtgtttc aggagctggc tgccagcgat 121 gctcgctgta ctgctaagtt tggcaccatc agcttccagc gacatttccg cctcccgacc 181 gaacatcctt cttctgatgg cggacgacct tggcattggg gacattggct gctatggcaa 241 caacaccatg aggactccga atattgaccg ccttgcagag gacggcgtga agctgaccca 301 acacatctct gccgcatctt tgtgcacccc aagcagagcc gccttcctca cgggcagata 361 ccctgtgcga tcagggatgg tttccagcat tggttaccgt gttcttcagt ggaccggagc 421 atctggaggt cttccaacaa atgagacaac ttttgcaaaa atactgaaag agaaaggcta 481 tgccactgga ctcattggaa aatggcatct gggtctcaac tgtgagtcag ccagtgatca 541 ttgccaccac cctctccatc atggctttga gcatttctac ggaatgcctt tctccttgat 601 gggtgattgc gcccgctggg aactctcaga gaagcgtgtc aacctggaac aaaaactcaa 661 cttcctcttc caagtcctgg ccttggttgc cctcacactg gtagcaggga agctcacaca 721 cctgataccc gtctcgtgga tgccggtcat ctggtcagcc ctttcggccg tcctcctcct 781 cgcaagctcc tattttgtgg gtgctctgat tgtccatgcc gattgctttc tgatgagaaa 841 ccacaccatc acggagcagc ccatgtgctt ccaaagaacg acacccctta ttctgcagga 901 ggttgcgtcc tttctcaaaa ggaataagca tgggcctttc ctcctctttg tttcctttct 961 acacgttcac atccctctta tcactatgga gaacttcctc gggaagagtc tccacgggct 1021 gtatggggac aacgtagagg agatggactg gatggtagga cggatccttg acactttgga 1081 cgtggagggt ttgagcaaca gcaccctcat ttattttacg tcggatcacg gcggttccct 1141 agagaatcaa cttggaaaca cccagtatgg tggctggaat ggaatttata aaggtgggaa 1201 gggcatggga ggatgggaag gtgggatccg cgtgcccggg atcttccgct ggcccggggt 1261 gctcccggcc ggccgagtga ttggcgagcc cacgagtctg atggacgtgt tccccaccgt 1321 ggtccggctg gcgggcggcg aggtgcccca ggacagagtg attgacggcc aagaccttct 1381 gcccttgctc ctggggacag cccaacactc agaccacgag ttcctgatgc attattgtga 1441 gaggtttctg cacgcagcca ggtggcatca acgggacaga ggaacaatgt ggaaagtcca 1501 ctttgtgacg cctgtgttcc agccagaggg agccggtgcc tgctatggaa gaaaggtctg 1561 cccgtgcttt ggggaaaaag tagtccacca cgatccacct ttgctctttg acctctcaag 1621 agacccttct gagacccaca tcctcacacc agcctcagag cccgtgttct atcaggtgat 1681 ggaacgagtc cagcaggcgg tgtgggaaca ccagcggaca ctcagcccag ttcctctgca 1741 gctggacagg ctgggcaaca tctggagacc gtggctgcag ccctgctgtg gcccgttccc 1801 cctctgctgg tgccttaggg aagatgaccc acaataaatg tctgcagtga aaagctgg // LOCUS HSARSF 1996 bp RNA PRI 14-OCT-1997 DEFINITION H.sapiens mRNA for arylsulphatase. ACCESSION X97868 NID g2576304 KEYWORDS arsf gene; arylsulphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1996) AUTHORS Puca,A.A., Zollo,M., Repetto,M., Andolfi,G., Guffanti,A., Simon,G., Ballabio,A. and Franco,B. TITLE Identification by shotgun sequencing, genomic organization, and functional analysis of a fourth arylsulfatase gene (ARSF) from the Xp22.3 region JOURNAL Genomics 42 (2), 192-199 (1997) MEDLINE 97336043 REFERENCE 2 (bases 1 to 1996) AUTHORS Franco,B. TITLE Direct Submission JOURNAL Submitted (15-MAY-1996) B. Franco, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1..1996 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="X" /map="p22.3" gene 71..1846 /gene="arsf" CDS 71..1846 /gene="arsf" /codon_start=1 /product="arylsulphatase" /db_xref="PID:e245884" /db_xref="PID:g2576305" /translation="MRPRRPLVFMSLVCALLNTWPGHTGCMTTRPNIVLIMVDDLGIG DLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGN RRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGF DYYYGMPFTLVDSCWPDPSRNTELAFESQLWLCVQLVAIAILTLTFGKLSGWVSVPWL LIFSMILFIFLLGYAWFSSHTSPLYWDCLLMRGHEITEQPMKAERAGSIMVKEAISFL ERHSKETFLLFFSFLHVHTPLPTTDDFTGTSKHGLYGDNVEEMDSMVGKILDAIDDFG LRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGWEGGIRVPGIVRWPGKV PAGRLIKEPTSLMDILPTVASVSGGSLPQDRVIDGRDLMPLLQGNVRHSEHEFLFHYC GSYLHAVRWIPKDDSGSVWKAHYVTPVFQPPASGGCYVTSLCRCFGEQVTYHNPPLLF DLSRDPSESTPLTPATEPLYDFVIKKVANALKEHQETIVPVTYQLSELNQGRTWLKPC CGVFPFCLCDKEEEVSQPRGPNEKR" BASE COUNT 481 a 521 c 508 g 486 t ORIGIN 1 gggttctgct cctagacatt agagagataa tacggctgat agacaacaag aaggtattcc 61 aagctgcaca atgaggccca ggagaccgtt ggtcttcatg tctttggtgt gtgcactctt 121 gaacacatgg ccagggcaca cagggtgcat gacgacaagg cctaatattg tcctaatcat 181 ggttgatgac ctgggtattg gagatctggg ctgctacggc aatgacacca tgaggacgcc 241 tcacatcgac cgccttgcca gggaaggcgt gcgactgact cagcacatct ctgccgcctc 301 cctctgcagc ccaagccggt ccgcgttctt gacgggaaga taccccatcc gatcaggtat 361 ggtttctagt ggtaatagac gtgtcatcca aaatcttgca gtccccgcag gcctccctct 421 taatgagaca acacttgcag ccttgctaaa gaagcaagga tacagcacgg ggcttatagg 481 caaatggcac caaggcttga actgcgactc ccgaagtgac cagtgccacc atccatataa 541 ttatgggttt gactactact atggcatgcc gttcactctc gttgacagct gctggccgga 601 cccctctcgt aacacggaat tagcctttga gagtcagctc tggctctgtg tgcagctagt 661 tgccattgcc atcctcaccc taacctttgg gaagctgagc ggctgggtct ctgttccctg 721 gctcctgatc ttctccatga ttctgtttat tttcctcttg ggctatgctt ggttctccag 781 ccacacgtcc cctttatact gggactgcct cctcatgcgg gggcacgaga tcacggagca 841 gcccatgaag gctgaacgag ctggatccat tatggtgaag gaagcgattt cctttttaga 901 aaggcacagt aaggaaactt tccttctctt tttctccttt cttcacgtgc acacacctct 961 ccccaccacg gacgatttca ctggcaccag caagcatggc ttgtatgggg ataatgtgga 1021 agagatggac tccatggtgg gcaagattct tgatgctatc gatgattttg gcctaaggaa 1081 caacaccctt gtctacttta catcagatca cggagggcat ttggaagcta ggcgagggca 1141 tgcccaactt ggtggatgga atggaatata caaaggtgga aaaggcatgg ggggctggga 1201 aggtggaatc cgcgtcccag gaattgtccg atggcctgga aaggtaccag ctggacggtt 1261 gattaaggaa cctacaagtt taatggatat tttaccaact gtcgcatcag tgtcaggagg 1321 aagtctccct caggacaggg tcattgacgg ccgagacctc atgcccttgc tgcagggcaa 1381 cgtcaggcac tcggagcatg aatttctttt ccactactgt ggctcctacc tgcacgccgt 1441 gcggtggatc cccaaggacg acagtgggtc agtttggaag gctcactatg tgaccccggt 1501 attccagcca ccagcttctg gtggctgcta tgtcacctca ttatgcagat gtttcggaga 1561 acaggttacc taccacaacc cccctctgct cttcgatctc tccagggacc cctcagagtc 1621 cacacccctg acacctgcca cagagcccct ctatgatttt gtgattaaaa aggtggccaa 1681 cgccctgaag gaacaccagg aaaccatcgt gcctgtgacc taccaactct cagaactgaa 1741 tcagggcagg acgtggctga agccttgctg tggggtgttc ccattttgtc tgtgtgacaa 1801 ggaagaggaa gtctctcagc ctcggggtcc taacgagaag agataattac aatcaggcta 1861 ccagaggaag cctttggtcc taacgagaag agataattac aatcaggcta ccaaaggaag 1921 cactaacttt ggtgctttca agttggcaag gagtgcattt aatagtcaat aaattcatct 1981 accattccag attatt // LOCUS HSART3 1104 bp DNA PRI 26-FEB-1997 DEFINITION H.sapiens ART3 gene. ACCESSION X95827 NID g1495418 KEYWORDS mono-ADP-ribosyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1104) AUTHORS Koch-Nolte,F., Haag,F., Braren,R., Kuhl,M., Hoovers,J., Balasubramanian,S., Bazan,F. and Thiele,H.G. TITLE Two novel human members of an emerging mammalian gene family related to mono-ADP-ribosylating bacterial toxins JOURNAL Genomics 39 (3), 370-376 (1997) MEDLINE 97224466 REFERENCE 2 (bases 1 to 1104) AUTHORS Koch-Nolte,F. TITLE Direct Submission JOURNAL Submitted (23-FEB-1996) F. Koch-Nolte, University Hospital, Dept. of Immunology, Martinistr. 52, 20246 Hamburg, FRG REFERENCE 3 (bases 1 to 1104) AUTHORS Koch-Nolte,F., Braren,R., Haag,F., Khl,M. and Thiele,H.G. TITLE Molecular characterization of the gene for murine skeletal and cardiac muscle ecto mono(ADPribosyl)transferase Art2 JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1104 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="P1 Genome systems" /chromosome="4" gene 1..1104 /gene="ART3" CDS 1..1104 /gene="ART3" /EC_number="2.4.2.31" /note="expressed in testis" /codon_start=1 /product="mono-ADP-ribosyltransferase" /db_xref="PID:e223921" /db_xref="PID:g1495419" /translation="MKTGHFEIVTMLLATMILVDIFQVKAEVLDMADNAFDDEYLKCT DRMEIKYVPQLLKEEKASHQQLDTVWENAKAKWAARKTQIFLPMNFKDNHGIALMAYI SEAQEQTPFYHLFSEAVKMAGQSREDYIYGFQFKAFHFYLTRALQLLRKPCEASSKTV VYRTSQGTSFTFGGLNQARFGHFTLAYSAKPQAANDQLTVLSIYTCLGVDIENFLDKE SERITLIPLNEVFQVSQEGAGNNLILQSINKTCSHYECAFLGGLKTENCIENLEYFQP IYVYNPGEKNQKLEDHSEKNWKLEDHGEKNQKLEDHGVKILEPTQIPAPGPVPVPGPK CHPSASSGKLLLPQFGMVIILISVSAINLFVAL" BASE COUNT 342 a 239 c 233 g 290 t ORIGIN 1 atgaagacgg gacattttga aatagtcacc atgctgctgg caaccatgat tctagtggac 61 attttccagg tgaaggctga agtgttagac atggcagata atgcatttga tgatgaatac 121 ctgaaatgta cggacaggat ggaaattaaa tacgttcccc aactgctaaa ggaggaaaaa 181 gcaagccacc agcaattaga tactgtgtgg gaaaatgcaa aagccaaatg ggcagcccga 241 aagactcaaa tctttctccc tatgaatttt aaggataacc atggaatagc cctgatggca 301 tatatttccg aagctcaaga gcaaactccc ttttaccatc tgttcagtga agctgtgaag 361 atggctggcc aatctcgaga agattatatc tatggcttcc agttcaaagc tttccacttt 421 tacctcacaa gagccctgca gttgctgaga aaaccttgtg aggccagttc caaaactgtg 481 gtatatagaa caagccaggg cacttcattt acatttggag ggctaaacca agccaggttt 541 ggccatttta ccttggcata ttcagccaaa cctcaggctg ctaatgacca gctcactgtg 601 ttatccatct acacatgcct tggagttgac attgaaaatt ttcttgataa agaaagtgaa 661 agaattactt taatacctct gaatgaggtt tttcaagtgt cacaggaggg ggctggcaat 721 aaccttatcc ttcaaagcat aaacaagacc tgcagccatt atgagtgtgc atttctaggt 781 ggactaaaaa ccgaaaactg tattgagaac ctagaatatt ttcaacccat ctatgtctac 841 aaccctggtg agaaaaacca gaagcttgaa gaccatagtg agaaaaactg gaagcttgaa 901 gaccatggtg agaaaaacca gaagcttgaa gaccatggtg tgaaaatcct tgaacccacc 961 caaatacctg ctccaggtcc agttcctgtt ccaggtccca aatgccatcc ttctgcatcc 1021 tcgggcaaac tgctgcttcc acagtttggg atggtcatca ttttaatcag tgtttctgct 1081 ataaatctct ttgttgctct gtag // LOCUS HSASD 1560 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for argininosuccinate synthetase. ACCESSION X01630 NID g28871 KEYWORDS synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Bock,H.G., Su,T.S., O'Brien,W.E. and Beaudet,A.L. TITLE Sequence for human argininosuccinate synthetase cDNA JOURNAL Nucleic Acids Res. 11 (18), 6505-6512 (1983) MEDLINE 84015388 FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 40..48 /note="three tandem arginin codons" CDS 76..1314 /note="argininosuccinate synthetase (aa 1-412)" /codon_start=1 /db_xref="PID:g28872" /db_xref="SWISS-PROT:P00966" /translation="MSSKGSVVLAYSGGLDTSCILVWLKEQGYDVIAYLANIGQKEDF EEARKKALKLGAKKVFIEDVSREFVEEFIWPAIQSSALYEDRYLLGTSLARPCIARKQ VEIAQREGAKYVSHGATGKGNDQVRFELSCYSLAPQIKVIAPWRMPEFYNRFKGRNDL MEYAKQHGIPIPVTPKNPWSMDENLMHISYEAGILENPKNQAPPGLYTKTQDPAKAPN TPDILEIEFKKGVPVKVTNVKDGTTHQTSLELFMYLNEVAGKHGVGRIDIVENRFIGM KSRGIYETPAGTILYHAHLDIEAFTMDREVRKIKQGLGLKFAELVYTGLRPSPECEFV RHCIAKSQERVEGKVQVSVLKGQVYILGRESPLSLYNEELVSMNVQGDYEPTDATGFI NINSLRLKEYHRLQSKVTAK" variation 759..760 /note="pot. additional A in pAS1" variation 1320 /note="U is C in variant pAS2" variation 1431 /note="G is A in variant pAS1" misc_feature 1526..1532 /note="put. polyadenylation signal" misc_feature 1537..1544 /note="pot. polyadenylation signal" polyA_site 1547 /note="polyadenylation site" variation 1555 /note="A is U in variant pAS2" BASE COUNT 393 a 431 c 434 g 302 t ORIGIN 1 cgagcccgag tggttcactg cactgtgaaa acagattcca gacgccggga actcacgcct 61 ccaatcccag acgctatgtc cagcaaaggc tccgtggttc tggcctacag tggcggcctg 121 gacacctcgt gcatcctcgt gtggctgaag gaacaaggct atgacgtcat tgcctatctg 181 gccaacattg gccagaagga agacttcgag gaagccagga agaaggcact gaagcttggg 241 gccaaaaagg tgttcattga ggatgtcagc agggagtttg tggaggagtt catctggccg 301 gccatccagt ccagcgcact gtatgaggac cgctacctcc tgggcacctc tcttgccagg 361 ccctgcatcg cccgcaaaca agtggaaatc gcccagcggg agggggccaa gtatgtgtcc 421 cacggcgcca caggaaaggg gaacgatcag gtccggtttg agctcagctg ctactcactg 481 gccccccaga taaaggtcat tgctccctgg aggatgcctg aattctacaa ccggttcaag 541 ggccgcaatg acctgatgga gtacgcaaag caacacggga ttcccatccc ggtcactccc 601 aagaacccgt ggagcatgga tgagaacctc atgcacatca gctacgaggc tggaatcctg 661 gagaacccca agaaccaagc gcctccaggt ctctacacga agacccagga cccagccaaa 721 gcccccaaca cccctgacat tctcgagatc gagttcaaaa aaggggtccc tgtgaaggtg 781 accaacgtca aggatggcac cacccaccag acctccttgg agctcttcat gtacctgaac 841 gaagtcgcgg gcaagcatgg cgtgggccgt attgacatcg tggagaaccg cttcattgga 901 atgaagtccc gaggtatcta cgagacccca gcaggcacca tcctttacca tgctcattta 961 gacatcgagg ccttcaccat ggaccgggaa gtgcgcaaaa tcaaacaagg cctgggcttg 1021 aaatttgctg agctggtgta taccggttta cggcctagcc ctgagtgtga atttgtccgc 1081 cactgcatcg ccaagtccca ggagcgagtg gaagggaaag tgcaggtgtc cgtcctcaag 1141 ggccaggtgt acatcctcgg ccgggagtcc ccactgtctc tctacaatga ggagctggtg 1201 agcatgaacg tgcagggtga ttatgagcca actgatgcca ccgggttcat caacatcaat 1261 tccctcaggc tgaaggaata tcatcgtctc cagagcaagg tcactgccaa atagacccgt 1321 gtacaatgag gagctggggc ctcctcaatt tgcagatccc ccaagtacag gcgctaattg 1381 ttgtgataat ttgtaattgt gacttgttct ccccggctgg cagcgtagtg gggctgccag 1441 gccccagctt tgttccctgg tccccctgaa gcctgcaaac gttgtcatcg aagggaaggg 1501 tggggggcag ctgcggtggg gagctataaa aatgacaatt aaaagagaaa aaaaaaaaaa // LOCUS HSASUCL 1692 bp RNA PRI 18-SEP-1996 DEFINITION H.sapiens mRNA for adenylosuccinate lyase. ACCESSION X65867 NID g28903 KEYWORDS adenylosuccinate lyase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1692) AUTHORS Fon,E.A. TITLE Direct Submission JOURNAL Submitted (27-APR-1992) E.A. Fon, Montreal General Hospital Research Institute, 1650 Cedar Avenue, Montreal, Quebec H3G 1A4, CANADA REFERENCE 2 (bases 1 to 1692) AUTHORS Fon,E.A., Demczuk,S., Delattre,O., Thomas,G. and Rouleau,G.A. TITLE Mapping of the human adenylosuccinate lyase (ADSL) gene to chromosome 22q13.1-->q13.2 JOURNAL Cytogenet. Cell Genet. 64 (3-4), 201-203 (1993) MEDLINE 94007960 COMMENT See also Stone R.L., Nature Genetics 1:59-63(1992). FEATURES Location/Qualifiers source 1..1692 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="lambda zapII" /clone="Z9.2" /chromosome="22" CDS 3..1457 /EC_number="4.3.2.2" /note="ORF-1" /codon_start=1 /product="adenylosuccinate lyase" /db_xref="PID:g28904" /db_xref="SWISS-PROT:P30566" /translation="MAAGGDHGSPDSYRSPLASRYASPEMCFVFSDRYKFRTWRQLWL WLAEAEQTLGLPITDEQIQEMKSNLENIDFKMAAEEEKRLRHDVMAHVHTFGHCCPKA AGIIHLGATSCYVGDNTDLIILRNALDLLLPKLARVISRLADFAKERASLPTLGFTHF QPAQLTTVGKRCCLWIQDLCMDLQNLKRVRDDLRFRGVKGTTGTQASFLQLFEGDDHK VEQLDKMVTEKAGFKRAFIITGQTYTRKVDIEVLSVLASLGASVHKICTDIRLLANLK EMEEPFEKQQIGSSAMPYKRNPMRSERCCSLARHLMTLVMDPLQTASVQWFERTLDDS ANRRICLAEAFLTADTILNTLQNISEGLVVYPKVIERRIRQELPFMATENIIMAMVKA GGSRQDCHEKIRVLSQQAASVVKQEGGDNDLIERIQVDAYFSPIHSQLDHLLDPSSFT GRASQQVQRFLEEEVYPLLKPYESVMKVKAELCL" CDS 78..1457 /EC_number="4.3.2.2" /note="ORF-2" /codon_start=1 /product="adenylosuccinate lyase" /db_xref="PID:g28905" /db_xref="SWISS-PROT:P30566" /translation="MCFVFSDRYKFRTWRQLWLWLAEAEQTLGLPITDEQIQEMKSNL ENIDFKMAAEEEKRLRHDVMAHVHTFGHCCPKAAGIIHLGATSCYVGDNTDLIILRNA LDLLLPKLARVISRLADFAKERASLPTLGFTHFQPAQLTTVGKRCCLWIQDLCMDLQN LKRVRDDLRFRGVKGTTGTQASFLQLFEGDDHKVEQLDKMVTEKAGFKRAFIITGQTY TRKVDIEVLSVLASLGASVHKICTDIRLLANLKEMEEPFEKQQIGSSAMPYKRNPMRS ERCCSLARHLMTLVMDPLQTASVQWFERTLDDSANRRICLAEAFLTADTILNTLQNIS EGLVVYPKVIERRIRQELPFMATENIIMAMVKAGGSRQDCHEKIRVLSQQAASVVKQE GGDNDLIERIQVDAYFSPIHSQLDHLLDPSSFTGRASQQVQRFLEEEVYPLLKPYESV MKVKAELCL" BASE COUNT 449 a 390 c 422 g 431 t ORIGIN 1 ccatggcggc tggaggcgat catggttcgc ccgacagcta ccgctcacct cttgcctccc 61 gctatgccag cccggagatg tgcttcgtgt ttagcgacag gtataaattc cggacatggc 121 ggcagctgtg gctgtggctg gcggaggccg agcagacatt gggtttgcct atcacagatg 181 aacaaatcca ggagatgaaa tcaaacctgg agaacataga cttcaagatg gcagctgagg 241 aagagaaacg tttacgacat gatgtgatgg ctcacgtgca cacatttggc cactgctgtc 301 caaaagctgc aggcattatt caccttggtg ctacttcttg ctatgttgga gacaatactg 361 acttgattat tcttagaaat gcacttgacc tgcttttgcc aaagcttgcc agagtgatct 421 ctcggcttgc cgactttgct aaggaacgag ccagtctacc cacattaggt ttcacacatt 481 tccagcctgc acagctgacc acagttggga aacgttgctg tctttggatt caggatcttt 541 gcatggatct ccagaacttg aagcgtgtcc gagatgacct gcgcttccgg ggagtaaagg 601 gtaccactgg cactcaggcc agtttcctgc agctctttga gggagatgac cataaggtag 661 agcagcttga caagatggtg acagaaaagg caggatttaa gagagctttc atcatcacag 721 ggcagacata tacacgaaaa gtggatattg aagtactgtc tgtgctggct agcttggggg 781 catcagtgca caagatttgc accgacatac gcctcctggc aaacctcaag gagatggagg 841 aaccctttga aaaacagcag attggctcaa gtgcgatgcc atataagcgg aatcccatgc 901 gttcagaacg ttgctgcagt cttgcccgcc acctgatgac ccttgtcatg gacccgctac 961 agacagcatc tgtccagtgg tttgaacgca cactggatga tagtgccaac cgacggatct 1021 gtttggccga ggcatttctt accgcagata ctatattgaa tacgctgcag aacatttctg 1081 aaggattggt cgtgtacccc aaagtaattg aacggcgcat tcggcaagag ctgcctttca 1141 tggccacaga gaacatcatc atggccatgg tcaaagctgg aggtagccgc caggattgcc 1201 atgagaaaat cagagtgctt tctcagcagg cagcttctgt ggttaagcag gaagggggtg 1261 acaatgacct catagagcgt atccaggttg atgcctactt cagtcccatt cactcccagt 1321 tggatcattt actggatcct tcttctttca ctggtcgtgc ctcccagcag gtgcagagat 1381 tcttagaaga ggaggtgtat cccctgttaa aaccatatga aagcgtgatg aaggtgaaag 1441 cagaattatg tctgtagagt tggaagagaa ttaaacgaaa atcattgtta attgctgagg 1501 catgaaaatt gtgttactat aacgccttat tttacctcga gaattgttac cttaaattag 1561 tacagcactt tcttcttccc atggtgcttt cctgtttctc agtctcacat ttctcaacaa 1621 ggcaaaaaca aagagcgttg aagttgactc tgctcttgca tagtaaatgt agttcatact 1681 tgaaaaaaaa aa // LOCUS HSATFA 2758 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ATF-a transcription factor. ACCESSION X52943 NID g28912 KEYWORDS DNA-binding protein; leucine zipper; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2758) AUTHORS Kedinger,C. TITLE Direct Submission JOURNAL Submitted (03-MAY-1990) Kedinger C., LGME/CNRS-U184/INSERM-Universite'Louis Pasteur, 11 rue Humann, 67085 Strasbourg Cedex, FRANCE, LGME/CNRS-U184/INSERM, Universite'Louis Pasteur, 11 rue Humann, 67085 STRASBOURG CEDEX, FRANCE REFERENCE 2 (bases 1 to 2758) AUTHORS Gaire,M., Chatton,B. and Kedinger,C. TITLE Isolation and characterization of two novel, closely related ATF cDNA clones from HeLa cells JOURNAL Nucleic Acids Res. 18 (12), 3467-3473 (1990) MEDLINE 90301459 COMMENT Data kindly reviewed (09-JUL-1990) by Kedinger C. FEATURES Location/Qualifiers source 1..2758 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt11" /clone="ATF-a" CDS 104..1555 /note="ATF-a protein (AA 1-483)" /codon_start=1 /db_xref="PID:g28913" /db_xref="SWISS-PROT:P17544" /translation="MGDDRPFVCNAPGCGQRFTNEDHLAVHKHKHEMTLKFGPARTDS VIIADQTPTPTRFLKNCEEVGLFNELASSFEHEFKKAADEDEKKAAAGPLDMSLPSTP DIKIKEEEPVEVDSSPPDSPASSPCSPPLKEKEVTPKPVLISTPTPTIVRPGSLPLHL GYDPLHPTLPSPTSVITQAPPSNRQMGSPTGSLPLVMHLANGQTMPVLPGPPVQMPSV ISLARPVSMVPNIPGIPGPPVNSSGSISPSGHPIPSEAKMRLKATLTHQVSSINGGCG MVVGTASTMVTARPEQSQILIQHPDAPSPAQPQVSPAQPTPSTGGRRRRTVDEDPDER RQRFLERNRAAASRCRQKRKLWVSSLEKKAEELTSQNIQLSNEVTLLRNEVAQLKQLL LAHKDCPVTALQKKTQGYLESPKESSEPTGSPAPVIQHSSATAPSNGLSVRSAAEAVA TSVLTQMASQRTELSMPIQSHVIMTPQSQSAGR" misc_feature 443..505 /note="alternative exon (absent in clone ATF-a delta)" BASE COUNT 647 a 813 c 612 g 686 t ORIGIN 1 gaattcaggg gggaaaggag gagctggaga cagattgtag gaccgagcgc gggcaggcgg 61 gaggcaacgg agctaccagc cgctcctctc tgctatatga aatatgggag acgacagacc 121 gtttgtgtgc aatgccccgg gctgtggaca gagatttaca aacgaggacc acctggcagt 181 tcataaacac aagcatgaga tgacattgaa atttggccca gcccgaactg actcagtcat 241 cattgcagat caaacgccta ctccaactag attcctgaag aactgtgagg aggtgggact 301 cttcaatgaa ctagctagct cctttgaaca tgaattcaag aaagctgcag atgaggatga 361 gaaaaaggct gctgctgggc cccttgacat gtctctgcct tccacaccag acatcaaaat 421 caaagaagaa gagccagtgg aggtagactc atccccacct gatagccctg cctctagtcc 481 ctgttcccca ccactgaagg agaaggaggt taccccaaag cctgttctga tctctacccc 541 cacacccacc attgtacgtc ctggctccct gcctctccac ttgggctatg atccacttca 601 tccaaccctt ccctccccaa cctctgtcat cacacaggct ccaccatcca acaggcaaat 661 ggggtctccc actggctccc tccctcttgt catgcatctt gctaatggac agaccatgcc 721 tgtgttgcca gggcctccag tacagatgcc gtctgttata tcgctggcca gacctgtgtc 781 catggtgccc aacattcctg gtatccctgg cccaccagtt aacagtagtg gctccatttc 841 tccctctggc caccctatac catcagaagc caagatgaga ctgaaagcca ccctaactca 901 ccaagtctcc tcaatcaatg gtggttgtgg aatggtggtg ggtactgcca gcaccatggt 961 gacagcccgc ccagagcaga gccagattct catccagcac cctgatgccc catcccctgc 1021 ccagccacag gtctcaccag ctcagcccac ccctagtact ggggggcgac ggcggcgcac 1081 agtagatgaa gatccagatg agcgacggca gcgctttctg gagcgcaacc gggctgcagc 1141 ctcccgctgc cgccaaaagc gaaagctgtg ggtgtcctcc ctagagaaga aggccgaaga 1201 actcacttct cagaacattc agctgagtaa tgaagtcaca ttactacgca atgaggtggc 1261 ccagttgaaa cagctactgt tagctcataa agactgccca gtcactgcac tacagaaaaa 1321 gactcaaggc tatttagaaa gccccaagga aagctcagag ccaacgggtt ctccagcccc 1381 tgtgattcag cacagctcag caacagcccc tagcaatggc ctcagtgttc gctctgcagc 1441 tgaagctgtg gccacctcgg tcctcactca gatggccagc caaaggacag aactgagcat 1501 gccgatacaa tcgcatgtaa tcatgacccc acagtcccag tctgcgggca gatgatgcct 1561 cctctggtgg agaggtcctc agccagagct cgcccgccca agagccaaga gagatgtcat 1621 cttatcccct cccatctgcc ttggacatgg caatatgggt ggggggaatt gtgtgttgtg 1681 agtaatggac agatatgggg ctttttagcc atgtggtcat gtacgtttga cacctgtgct 1741 gggggcctcc cagccccgga gcctcataga tcatccccca gggtggggtt tctgatgcat 1801 ccacagccaa aggcctttcc agatggtccg aacaatcacc cctgccccat gcacgtgtat 1861 ctgtctcttt taacactaac ccttggcact tctaggtttg gggtttctca gttccctctc 1921 ttcccactaa gccgtggctg ttaccttttt gttccttacc agcatgccac ttcctgccag 1981 atagagggag gctagatttg atgacatatt acccgctgac ccagccccag ccaccccatg 2041 gactcaggac tcaggtctcc catagctgct tatttacccg tttccttttt cctcttcctt 2101 aattctagtt aaaacattca cttacggaga atgtaaggtt acctttgatg tatgaagtag 2161 cagtattttc cagtttttcc ataagcttta ttccttctct ctacctgcca ccaaaaagaa 2221 agaaaccctc aagagttcta attctttttc atctatttgc acagctattt gggtacctca 2281 ttttgtccct tgactttagt gtagtggtgg ggtctttccc accaccccac taagccttgg 2341 caacctcatt ttgtggcctt attattaaag attgcttggt agtgaaaagg gaagggccca 2401 gaatctgact ggctgattct gtctcctccc ctccctcctt ctttagttta accatcaggg 2461 cagacaggaa caaagtattt caaagcttgt tccatcaagt tagggattaa tcgattcctt 2521 tatgttttat aagagtttac cggccaaagc atattgtcat ctttggtcat ttttctgctc 2581 ctgcttctct ctcttcattg gggtatcaac attaccgccc ctgtctctct tcattcccct 2641 atcaacatta ccgcccctgt ctcttctgta actacttcag actcagccct cctggaggct 2701 gtcatgtagg ggactctcga ggttcctttg ttctacccca tctctccccc ccgaattc // LOCUS HSATPCITL 4297 bp RNA PRI 19-MAR-1992 DEFINITION H.sapiens mRNA for ATP-citrate lyase. ACCESSION X64330 NID g28934 KEYWORDS acetyl coA production; ATP citrate-lyase; cholesterol biosynthesis. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4297) AUTHORS Elshourbagy,N.A. TITLE Direct Submission JOURNAL Submitted (31-JAN-1992) N.A. Elshourbagy, Smithkline Beecham, P.O.Box 1539, King of Prussia, PA 19406-0939, USA REFERENCE 2 (bases 1 to 4297) AUTHORS Elshourbagy,N.A., Near,J.C., Kmetz,P.J., Wells,T.N., Groot,P.H., Saxty,B.A., Hughes,S.A., Franklin,M. and Gloger,I.S. TITLE Cloning and expression of a human ATP-citrate lyase cDNA JOURNAL Eur. J. Biochem. 204 (2), 491-499 (1992) MEDLINE 92174902 COMMENT Related sequence: J05210. FEATURES Location/Qualifiers source 1..4297 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult male" /tissue_type="liver" /clone_lib="lambda zap cDNA library" 5'UTR 1..84 gene 85..3402 /gene="ATP-citrate lyase" CDS 85..3402 /gene="ATP-citrate lyase" /EC_number="4.1.3.8" /codon_start=1 /product="ATP-citrate (pro-S-)-lyase" /db_xref="PID:g28935" /translation="MSAKAISEQTGKELLYKFICTTSAIQNRFKYARVTPDTDWARLL QDHPWLLSQNLVVKPDQLIKRRGKLGLVGVDLTLDGVKSWLKPRLGQEATVGKATGFL KNFLIEPFAPHSQAEEFYVCIYATREGDYVLFHHEGGVDVGDVDAKAQKLLVGVDEKL NPEDIKKHLLVHAPDDKKEILASFISGLFNFYEDLYFTYLEINPLVVTKDGVYVLDLA AKVDATADYICKVKWGDIEFPPPFGRVAYPEEAYIADLDAKSGASLKLTLLNPKGRIW TMVAGGGASVVYSDTICDLGGVNELANYGEYSGAPSEQQTYDYAKTILSLMTREKHPD GKILIIGGSIANFTNVAATFKGIVRAIRDYQGPLKEHEVTIFVRRGGPNYQEGLRVMG EVGKTTGIPIHVFGTETHMTAIVGMAWAPAIPNQPPTAAHTANFLLNAQRETSTPAPS RTASFYESMVDEVRADEVAPAKKAKPAMPQDSVPSPRSLQGKSTTLFSRHTKAIVWGM QTRAVQGMLDFDYVCSRDEPSVAAMVYPFTGDHKQKFYWGHKEILIPVFKNMADAMRK HPEVDVLINFASLRSAYDSTMETMNYAQIRTIAIIAEGIPEALTRKLIKKADQKGVTI IGPATVGGIKPGCFKIGNTGGMLDNILASKLYPQAAVAYVSRSGGMSNELNNIISRTT DGVYEGVAIGGDRYPGSTFMDHVLRYQDTPGVKMIVVLGEIGGTEEYKISRGIKEGRL TKPIVCWCIGTCATMFSSEVQFGHAGACANQASETAVAKNQALKEAGVFVPRSFDELG EIIQSVYEDLVANGVIVPAQEVPPPTVPMDYSWARELGLIRKPASFMTSICDERGQEL IYAGMPITEVFKEEMGIGGALGLLWFQKRLPKYSCQFIEMCLMVTADHGPAVSGAHNT IICARTAVELVSSLTSGLLTIGDRFGGALDAAAKMFSKAFDSGIIPMEFVNKMKKEGK LIMGIGHRVKSINNPDMRVQILKDYVRQHFPATPLLDYALEVEKITTSKKPNLILNVD GLIGVAFVDMLRNCGSFTREEADEYIDIGALNGIFVLGRSMGFIGHYLDQKRLKQGLY RHPWDDISYVLPEHMSM" 3'UTR 3403..4297 BASE COUNT 1048 a 1105 c 1132 g 1012 t ORIGIN 1 cccggatttt gcggggttcg tcgggcctgt ggaagaagcc ccgccacgga cttcggcaga 61 ggtagagcag gtctctctgc agccatgtcg gccaaggcaa tttcagagca gacgggcaaa 121 gaactccttt acaagttcat ctgtaccacc tcagccatcc agaatcggtt caagtatgct 181 cgggtcactc ctgacacaga ctgggcccgc ttgctgcagg accacccctg gctgctcagc 241 cagaacttgg tagtcaagcc agaccagctg atcaaacgtc gtggaaaact tggtctcgtt 301 ggggtcgacc tcactctgga tggggtcaag tcctggctga agccacggct gggacaggaa 361 gccacagttg gcaaggccac aggcttcctc aagaactttc tgatcgagcc cttcgccccc 421 cacagtcagg ctgaggagtt ctatgtctgc atctatgcca cccgagaagg ggactacgtc 481 ctgttccacc acgagggggg tgtggacgtg ggtgatgtgg acgccaaggc ccagaagctg 541 cttgttggcg tggatgagaa actgaatcct gaggacatca aaaaacacct gttggtccac 601 gcccctgacg acaagaaaga aattctggcc agttttatct ccggcctctt caatttctac 661 gaggacttgt acttcaccta cctcgagatc aatccccttg tagtgaccaa agatggagtc 721 tatgtccttg acttggcggc caaggtggac gccactgccg actacatctg caaagtgaag 781 tggggtgaca tcgagttccc tccccccttc gggcgggtgg catatccaga ggaagcctac 841 attgcagacc tcgatgccaa aagtggggca agcctgaagc tgaccttgct gaaccccaaa 901 gggaggatct ggaccatggt ggccgggggt ggcgcctctg tcgtgtacag cgataccatc 961 tgtgatctag ggggtgtcaa cgagctggca aactatgggg agtactcagg cgcccccagc 1021 gagcagcaga cctatgacta tgccaagact atcctctccc tcatgacccg agagaagcac 1081 ccagatggca agatcctcat cattggaggc agcatcgcaa acttcaccaa cgtggctgcc 1141 acgttcaagg gcatcgtgag agcaattcga gattaccagg gccccctgaa ggagcacgaa 1201 gtcacaatct ttgtccgaag aggtggcccc aactatcagg agggcttacg ggtgatggga 1261 gaagtcggga agaccactgg gatccccatc catgtctttg gcacagagac tcacatgacg 1321 gccattgtgg gcatggcctg ggcaccggcc atccccaacc agccacccac agcggcccac 1381 actgcaaact ttctcctcaa cgcccagcgg gagacatcga ctccagcccc cagcaggaca 1441 gcatcttttt atgagtccat ggtcgatgag gtcagggccg atgaggtggc gcctgcaaag 1501 aaggccaagc ctgccatgcc acaagattca gtcccaagtc caagatccct gcaaggaaag 1561 agcaccaccc tcttcagccg ccacaccaag gccattgtgt ggggcatgca gacccgggcc 1621 gtgcaaggca tgctggactt tgactatgtc tgctcccgag acgagccctc agtggctgcc 1681 atggtctatc ctttcactgg ggaccacaag cagaagtttt actgggggca caaagagatc 1741 ctgatccctg tcttcaagaa catggctgat gccatgagga agcacccgga ggtagatgtg 1801 ctcatcaact ttgcctctct ccgctctgcc tatgacagca ccatggagac catgaactat 1861 gcccagatcc ggaccatcgc catcatagct gaaggcatcc ctgaggccct cacgagaaag 1921 ctgatcaaga aggcggacca gaagggagtg accatcatcg gacctgccac tgttggaggc 1981 atcaagcctg ggtgctttaa gattggcaac acaggtggga tgctggacaa catcctggcc 2041 tccaaactgt acccccaggc agctgtggcc tatgtctcac gttccggagg catgtccaac 2101 gagctcaaca atatcatctc tcggaccacg gatggcgtct atgagggcgt ggccattggt 2161 ggggacaggt acccgggctc cacattcatg gatcatgtgt tacgctatca ggacactcca 2221 ggagtcaaaa tgattgtggt tcttggagag attgggggca ctgaggaata taagatttcc 2281 cggggcatca aggagggccg cctcactaag cccatcgtct gctggtgcat cgggacgtgt 2341 gccaccatgt tctcctctga ggtccagttt ggccatgctg gagcttgtgc caaccaggct 2401 tctgaaactg cagtagccaa gaaccaggct ttgaaggaag caggagtgtt tgtgccccgg 2461 agctttgatg agcttggaga gatcatccag tctgtatacg aagatctcgt ggccaatgga 2521 gtcattgtac ctgcccagga ggtgccgccc ccaaccgtgc ccatggacta ctcctgggcc 2581 agggagcttg gtttgatccg caaacctgcc tcgttcatga ccagcatctg cgatgagcga 2641 ggacaggagc tcatctacgc gggcatgccc atcactgagg tcttcaagga agagatgggc 2701 attggcgggg ccctcggcct cctctggttc cagaaaaggt tgcctaagta ctcttgccag 2761 ttcattgaga tgtgtctgat ggtgacagct gatcacgggc cagccgtctc tggagcccac 2821 aacaccatca tttgtgcgcg caccgcggtg gagctggtct ccagcctcac ctcggggctg 2881 ctcaccatcg gggatcggtt tgggggtgcc ttggatgcag cagccaagat gttcagtaaa 2941 gcctttgaca gtggcattat ccccatggag tttgtgaaca agatgaagaa ggaagggaag 3001 ctgatcatgg gcattggtca ccgagtgaag tcgataaaca acccagacat gcgagtgcag 3061 atcctcaaag attacgtcag gcagcacttc cctgccactc ctctgctcga ttatgcactg 3121 gaagtagaga agattaccac ctcgaagaag ccaaatctta tcctgaatgt agatggtctc 3181 atcggagtcg catttgtaga catgcttaga aactgtgggt cctttactcg ggaggaagct 3241 gatgaatata ttgacattgg agccctcaat ggcatctttg tgctgggaag gagtatgggg 3301 ttcattggac actatcttga tcagaagagg ctgaagcagg ggctgtatcg tcatccgtgg 3361 gatgatattt catatgttct tccggaacac atgagcatgt aacagagcca ggaaccctac 3421 tgcagtaaac tgaagacaag aactcttccc ccaagaaaaa gtgtcagaca gctggcagtg 3481 gagcctgctt tatttagcag gggcctggaa tgtaaacagc cactggggta caggcaccga 3541 agaccaacat ccacaggcta acaccccttc agtccacaca aagaagcttc atattttttt 3601 tataagcata gaaataaaaa ccaagccaat atttgtgact ttgctctgct acctgctgta 3661 tttattatat ggaagcatct aagtactgtc aggatggggt cttcctcatt gtagggcgtt 3721 aggatgttgc tttctttttc cattagttaa acattttttt ctcctttgga ggaagggaat 3781 gaaacattta tggcctcaag atactataca tttaaagcac cccaatgtct ctcttttttt 3841 ttttttttac ttcccttgct tcttccttat ataacatgaa gaacattgta ttaatctgat 3901 ttttaaagac tttttgtatg ttacgtgtta agggcttgtt tggtatccca ctgaaatgtt 3961 ctgtgttgca gaccagagtc tgtttatgtc agggggaggg ggccattgca tccttagcca 4021 ttgtcacaaa atatgtggag tagtaactta atatgtaaag ttgtaacata catacattta 4081 aaatggaaat gcagaaagct gtgaaatgtc ttgtgtctta tgttctctgt atttatgcag 4141 ctgatttgtc tgtctgtaac tgaagtgtgg gtccaaggac tcctaactac tttgcatctg 4201 taatccacaa agattctggg cagctgccac ctcagtctct tctctgtatt atcatagtct 4261 ggtttaaata aactatatag taacaggaat tcctgca // LOCUS HSATPSYN 1134 bp RNA PRI 30-JUN-1994 DEFINITION H.sapiens mRNA for H+-ATP synthase subunit b. ACCESSION X60221 NID g509290 KEYWORDS H+-ATP synthase; subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1134) AUTHORS Higuti,T. TITLE Direct Submission JOURNAL Submitted (25-JUN-1991) T. Higuti, Paharmaceutical Sciences, The Univ of Tokushima, Shomachi, Tokushima 770, JAPAN REFERENCE 2 (bases 1 to 1134) AUTHORS Higuti,T., Tsurumi,C., Osaka,F., Kawamura,Y., Tsujita,H., Yoshihara,Y., Tani,I., Tanaka,K. and Ichihara,A. TITLE Molecular cloning of cDNA for the import precursor of human subunit B of H(+)-ATP synthase in mitochondria JOURNAL Biochem. Biophys. Res. Commun. 178 (3), 1014-1020 (1991) MEDLINE 91337033 FEATURES Location/Qualifiers source 1..1134 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 33..803 /codon_start=1 /product="H+-ATP synthase subunit b" /db_xref="PID:g509291" /db_xref="SWISS-PROT:P24539" /translation="MLSRVVLSAAATAAPSLKNAAFLGPGVLQATRTFHTGQPHLVPV PPLPEYGGKVRYGLIPEEFFQFLYPKTGVTGPYVLGTGLVLYALSKEIYVISAETFTA LSVLGVMVYGIKKYGPFVADFADKLNEQKLAQLEEAKQASIQHIQNAIDTEKSQQALV QKRHYLFDVQRNNIAMALEVTYRERLYRVYKEVKNRLDYHISVQNMMRRKEQEHMINW VEKHVVQSISTQQEKETIAKCIADLKLLAKKAQAQPVM" transit_peptide 33..158 /note="H+-ATP synthase" mat_peptide 159..800 /product="H+-ATP synthase subunit b" polyA_signal 1115..1120 BASE COUNT 335 a 242 c 245 g 312 t ORIGIN 1 cgctaagatt gctacctgga ctttcgttga ccatgctgtc ccgggtggta ctttccgccg 61 ccgccacagc ggccccctct ctgaagaatg cagccttcct aggtccaggg gtattgcagg 121 caacaaggac ctttcataca gggcagccac accttgtccc tgtaccacct cttcctgaat 181 acggaggaaa agttcgttat ggactgatcc ctgaggaatt cttccagttt ctttatccta 241 aaactggtgt aacaggaccc tatgtactcg gaactgggct tgtcttgtac gctttatcca 301 aagaaatata tgtgattagc gcagagacct tcactgccct atcagtacta ggtgtaatgg 361 tctatggaat taaaaaatat ggtccctttg ttgcagactt tgctgataaa ctcaatgagc 421 aaaaacttgc ccaactagaa gaggcgaagc aggcttccat ccaacacatc cagaatgcaa 481 ttgatacgga gaagtcacaa caggcactgg ttcagaagcg ccattacctt tttgatgtgc 541 aaaggaataa cattgctatg gctttggaag ttacttaccg ggaacgactg tatagagtat 601 ataaggaagt aaagaatcgc ctggactatc atatatctgt gcagaacatg atgcgtcgaa 661 aggaacaaga acacatgata aattgggtgg agaagcacgt ggtgcaaagc atctccacac 721 agcaggaaaa ggagacaatt gccaagtgca ttgcggacct aaagctgctg gcaaagaagg 781 ctcaagcaca gccagttatg taaatgtatc tatcccaatt gagtcagcta gaaacagttg 841 actgactaaa tggaaactag tctatttgac aaagtctttc tgtgttggtg tctactgaag 901 ttatagttta cccttcctaa aaatgaaaag tttgtttcat atagtgagag aacgaaatct 961 ctatcggcca gtcagatgtt tctcatcctt cttgctctgc ctttgagttg ttccgtgatc 1021 attctgaata agcatttgcc tttataaaaa cttgctgcct gactaaagat taacaggtta 1081 tagtttaaat ttgtaattaa ttctaccatc ttgcaataaa gtgacaattg aatg // LOCUS HSATPSYNT 772 bp RNA PRI 08-JAN-1996 DEFINITION H.sapiens mRNA for ATP synthase. ACCESSION X83218 NID g1008079 KEYWORDS ATP synthase; chromosome 21. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 772) AUTHORS Chen,H., Morris,M.A., Rossier,C., Blouin,J.L. and Antonarakis,S.E. TITLE Cloning of the cDNA for the human ATP synthase OSCP subunit (ATP5O) by exon trapping and mapping to chromosome 21q22.1-q22.2 JOURNAL Genomics 28 (3), 470-476 (1995) MEDLINE 96039258 REFERENCE 2 (bases 1 to 772) AUTHORS Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (05-DEC-1994) S.E. Antonarakis, Div. of Med. Genetics, Univ. and Cantonal Hospital, CMU, 1 Rue Michel-Servet, 1211 Geneva, SWITZERLAND FEATURES Location/Qualifiers source 1..772 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain and muscle" /chromosome="21" /map="21Q22.1-22.2" sig_peptide 37..105 /gene="ATPO" /note="mitochondrial import" gene 37..678 /gene="ATPO" CDS 37..678 /gene="ATPO" /codon_start=1 /product="ATP synthase, oligomycin sensitivity conferring protein" /db_xref="PID:g1008080" /db_xref="SWISS-PROT:P48047" /translation="MAAPAVSGLSRQVRCFSTSVVRPFAKLVRPPVQVYGIEGRYATA LYSAASKQNKLEQVEKELLRVAQILKEPKVAASVLNPYVKRSIKVKSLNDITAKERFS PLTTNLINLLAENGRLSNTQGVVSAFSTMMSVHRGEVPCTVTSASPLEEATLSELKTV LKSFLSQGQVLKLEAKTDPSILGGMIVRIGEKYVDMSVKTKIQKLGRAMREIV" BASE COUNT 232 a 170 c 181 g 189 t ORIGIN 1 aagcttggca cgaggcctac aaccgcccgg gagaagatgg ctgccccagc agtgtccggg 61 ctctcccggc aggtgcgatg cttcagtacc tctgtggtca gaccatttgc caagcttgtg 121 aggcctcctg ttcaggtata cggtattgaa ggtcgctatg ccacagctct ttattctgct 181 gcatcaaaac agaataagct ggagcaagta gaaaaggagt tgttgagagt agcacaaatc 241 ctgaaggaac ccaaagtggc tgcttctgtt ttgaatccct atgtgaagcg ttccattaaa 301 gtgaaaagcc taaatgacat cacagcaaaa gagaggttct ctcccctcac taccaacctg 361 atcaatttgc ttgctgaaaa tggtcgatta agcaataccc aaggagtcgt ttctgccttt 421 tctaccatga tgagtgtcca tcgcggagag gtaccttgca cagtgacctc tgcatctcct 481 ttagaagaag ccacactctc tgaattaaaa actgtcctca agagcttcct aagtcaaggc 541 caagtattga aattggaggc taagactgat ccgtcaatct tgggtggaat gattgtgcgc 601 attggcgaga aatatgttga catgtctgtc aagaccaaga ttcagaagct gggcagggct 661 atgcgggaga ttgtctaaaa gtgttggttt tctgccatca gtgaaaattc ttaaacttgg 721 agcaacaata aaaagcttcc agaacagatc aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSAUHMR 1548 bp RNA PRI 18-APR-1995 DEFINITION H.sapiens AUH mRNA. ACCESSION X79888 NID g780240 KEYWORDS AU-binding protein; AUH gene; enoyl-CoA hydratase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1548) AUTHORS Nakagawa,J., Waldner,H., Meyer-Monard,S., Hofsteenge,J., Jeno,P. and Moroni,C. TITLE AUH, a gene encoding an AU-specific RNA binding protein with intrinsic enoyl-CoA hydratase activity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (6), 2051-2055 (1995) MEDLINE 95199290 REFERENCE 2 (bases 1 to 1548) AUTHORS Nakagawa,J. TITLE Direct Submission JOURNAL Submitted (28-JUN-1994) J. Nakagawa, Inst fuer Medizinische Mikrobiologie, Petersplatz 10, 4003 Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..1548 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /cell_line="IMR32 human neuroblastoma" /clone_lib="lambda Uni-Zap" gene 5..1024 /gene="AUH" CDS 5..1024 /gene="AUH" /codon_start=1 /product="AU-binding protein/Enoyl-CoA hydratase" /db_xref="PID:g780241" /translation="MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRR AGPAIWAQGWVPAAGGPAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNS LSKNLIKMLSKAVDALKSDKKVRTIIIRSEVPGIFCAGADLKERAKMSSSEVGPFVSK IRAVINDIANLPVPTIAAIDGLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGG GGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAR EFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPP RYKGE" BASE COUNT 467 a 298 c 394 g 389 t ORIGIN 1 caacatggcg gccgcggtgg cggcggcacc tggggccttg ggatccctgc atgctggcgg 61 cgcccgcctg gtggccgctt gcagtgcgtg gctctgcccg gggttgaggc tgcccggctc 121 gttggcaggc cggcgagcgg gcccggcgat ctgggcccag ggctgggtac ctgcggccgg 181 gggtcccgcc ccgaaaaggg gctacagctc tgagatgaag acggaggacg agctgcgggt 241 gcggcacctg gaggaggaga accgaggaat tgtggtgctt ggaataaaca gagcttatgg 301 caaaaattca ctcagtaaaa atcttataaa aatgctatca aaagctgtgg atgctttgaa 361 atctgataag aaagtacgga ccataataat caggagtgaa gtcccaggga tattctgtgc 421 tggtgctgac cttaaggaaa gagccaaaat gagttccagt gaagttggtc cttttgtctc 481 caaaataaga gcagtgatta acgatattgc taatcttcca gtgccaacaa ttgcagcaat 541 agatggactc gctttaggtg gtggtcttga actggcttta gcctgtgata tacgagtagc 601 agcttcctct gcaaaaatgg gcctggttga aacaaaattg gcgattattc ctggtggagg 661 ggggacacag cgattgccac gcgccattgg aatgtccctg gccaaggagc tcatattctc 721 tgcgcgagtc ctcgatggca aagaagccaa agcagtgggc ttaatcagcc acgttctgga 781 acagaaccag gagggagacg cggcctacag gaaggccttg gacctggcga gagagttttt 841 acctcaggga cctgttgcaa tgagagtggc aaaattagca attaatcaag ggatggaggt 901 cgatttagta acagggttag ccatagaaga agcttgttat gctcagacca ttccaacaaa 961 agacagactt gaaggtcttc ttgcttttaa agagaaaagg ccccctcgct ataaaggaga 1021 ataaaaggaa cagaaattct taagatgcca atgtaataaa tgtacttcct ggaagtgtct 1081 ttcggatcca ctatatgcct cagcacatgg aaccttaatg accaaagtga agagcagatt 1141 attcatacgg tgtaataagc gtctggaatg gacccatccg tgtacttcat tcaaatgtgt 1201 aaatgtcata ttcattcaga tttataaagc tagtagtgta tagtcagaaa cagaatcaaa 1261 gttagatata catttttaaa tatttactgc atatgaggct ttctgttaat tttttaatgt 1321 gaataattta tatattgcac attctaggaa taatattgat tgtatgtcta ctgtgctgca 1381 ttaagaaaat aaaatttcta tataccaaaa atgtgaagtt ataccaaata aagtttctaa 1441 gtgattaatg catacgaaca gctacatata catatatcta aacctgaaaa atgaattgat 1501 attctgagtg aaaactacct aatataaata aaattagtga aaagaaaa // LOCUS HSAUTAN64 3881 bp RNA PRI 25-JUL-1991 DEFINITION Human mRNA for a 64 Kd autoantigen expressed in thyroid and extra-ocular muscle. ACCESSION X54162 NID g28968 KEYWORDS 64 Kd autoantigen; autoantigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3881) AUTHORS Dong,Q.H. TITLE Direct Submission JOURNAL Submitted (30-JUL-1990) Dong Q.H., Institute of Interdisciplinary Research, School of Medicine, Universite Libre de Bruxelles, Campus Erasme, 808 Route de Lennik, 1070 Brussels, Belgium REFERENCE 2 (bases 1 to 3881) AUTHORS Dong,Q., Ludgate,M. and Vassart,G. TITLE Cloning and sequencing of a novel 64-kDa autoantigen recognized by patients with autoimmune thyroid disease JOURNAL J. Clin. Endocrinol. Metab. 72 (6), 1375-1381 (1991) MEDLINE 91225220 FEATURES Location/Qualifiers source 1..3881 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid" /clone_lib="lambda gt11" /clone="D1" mRNA 1..3881 /evidence=experimental CDS 213..1931 /codon_start=1 /product="64 Kd autoantigen" /db_xref="PID:g28969" /db_xref="SWISS-PROT:P29536" /translation="MEELEKELDVVDPDGSVPVGLRQRNQTEKQSTGVYNREAMLNFC EKETKKLMQREMSMDESKQVETKTDAKNGQERGRDASKKALGPRRNSDLGKEPKRGGL KKSFSRDRDEAGGKSGEKPKEEKIIRGIDKGRVRAAVDKKEAGKDGRGEERAVATKKE EEKKGGDRNTGLSRDKDKKREEMKEVAKKEDDEKVKGERRNTDTRKEGEKMKRAGGNT DMKKEDEKVKRGTGNTDTKKDDEKVKKNEPLHEKEAKDDSKTKTPEKQTPSGPTKPSE GPAKVEEEAAPSIFDEPLERVKNNDPEMTEVNVNNSDCITNEILVRFTEALEFNTVVK LFALANTRADDHVAFAIAIMLKANKTITSLNLDSNHITGKGILAIFRALLQNNTLTEL RFHNQRHICGGKTEMEIAKLLKENTSLLKLGYHFELAGPRMTVTNLLSRNMDKQRQKR LQEQRQAQEAKGEKKDLLEVPKAGAVAKGSPKPSPQPSPKPSPKNSPKKGGAPAAPPP PPPPLAPPLIMENLKNSLSPATQRKMGDKVLPAQEKNSRDQLLAAIRSSNLKQLKKVE VPKLLQ" BASE COUNT 1065 a 1030 c 1084 g 702 t ORIGIN 1 gctgaagtgt tcgaccagca ggaggttttc tcctcagccc actcgctgca tccagatcag 61 ctcaccccgc gccctttcct gcccaccagg actctgatag cccctggcag ccacagccca 121 ttttgccaag atgtctagag tagccaaata tcgccggcag tgagtgaaga ccccgacatc 181 gacagcctgc tgggaccctg tctcccgagg agatggagga gctggagaag gagctggacg 241 tggtggaccc agacgggagt gttcccgtgg ggctgcggca gagaaaccag acggagaaac 301 agtccacggg tgtgtacaac cgggaggcca tgctcaactt ctgtgaaaag gagaccaaga 361 aacttatgca gagggagatg tccatggatg aaagcaagca agtggagacc aagacagatg 421 ccaagaatgg acaggaaagg ggcagagatg ccagcaaaaa agccctgggc cccagacgga 481 actcagatct ggggaaggag ccaaagaggg gtggtttaaa gaaaagcttc tctagagaca 541 gagatgaagc tggtggcaag agtggcgaga agcccaagga ggagaagatc atccggggca 601 ttgacaaggg ccgggtcagg gctgcagtgg ataagaagga ggcagggaag gatgggagag 661 gagaggagag ggcagtggcc accaagaagg aagaggagaa gaaagggggt gacaggaaca 721 caggcttgag cagggacaag gataaaaaga gagaggagat gaaggaggtg gccaagaaag 781 aggatgatga gaaggtaaaa ggggagcgta ggaacacaga caccagaaaa gagggtgaga 841 agatgaaaag agcaggtggg aacacagaca tgaaaaagga ggatgagaag gtaaaaagag 901 gaactgggaa cacagacacc aaaaaggacg atgaaaaagt caagaagaat gaacccttac 961 atgaaaagga agccaaggat gacagcaaga ccaaaacacc cgagaaacag acgcccagtg 1021 gccccaccaa gccctctgaa ggaccggcca aggtggagga ggaggcagct cccagcatat 1081 ttgatgagcc tctggagaga gtgaagaaca atgaccccga gatgactgag gtgaacgtca 1141 acaactcaga ctgcatcaca aatgagatct tggtccggtt tactgaggct ctggagttca 1201 acactgtggt taagctgttc gccttggcca acacgcgagc cgatgaccac gtggcctttg 1261 ccattgccat catgctcaag gccaacaaga ccatcaccag cctcaacctg gactccaacc 1321 acatcacagg caaaggcatc ctggccatct tccgggccct cctccagaac aacacgctga 1381 ccgagctccg cttccacaac cagcgacaca tctgtggagg caagacggag atggagatcg 1441 ccaagctgct gaaggagaat acctccctgc tcaagctggg ctaccatttt gagctggccg 1501 ggccccgaat gactgtcacc aatctgctca gccgcaacat ggacaagcag agacaaaagc 1561 ggctgcagga gcaaaggcag gcacaggaag ccaagggaga gaagaaggat ctgctggagg 1621 tacccaaggc cggggccgtg gctaagggct ccccaaaacc ttcacctcaa ccatctccaa 1681 agccctctcc aaagaactca cccaaaaaag ggggtgctcc agctgcccca ccaccccctc 1741 cccctccctt ggctccaccc cttatcatgg agaacctgaa gaattcactc tcaccagcta 1801 cccagaggaa gatgggagac aaagtcctcc ctgcccagga gaagaactcc cgtgaccagc 1861 tattggctgc catccgctcc agcaacctca agcagctcaa gaaggtggaa gtgcccaaac 1921 tgcttcagta ggaccaggct gccaggcacc atctgccaat gccatgactg ctcaggcctc 1981 acctcccagg gctacacaga ccctgcccac cccatccctg gctgacctgc tgtggatgtc 2041 cctattctgc catgggagcg tccaggcctg ggtcacgctc aaggaaggat gccttatctc 2101 ttctcacttt ccttttcttg tctctgaggc tctccaaatt ttgctttagt acatggagct 2161 caggtttctg gacaagaaga gtccttttag cacatcactg agaagatggc actgtccagg 2221 gcccatgtag ctggcaagct gcaaaaggcc tgtgatccag gaaagatgtc ccacagggac 2281 cacatccacc ccagccccac tgccctccag ggccaggatt caggcctctg aggagcccac 2341 ggggcaaagc tgctgggcca gtggcactct gtgtgggaaa atggcagaaa gatggagagg 2401 catgggggcc caaaggggag cgtggggagg ggctgaggat accccaaagt ccaggctaat 2461 tagaggatgt ggcaggggca gtggcctgga tgcacagtgc ctgatgggag taggctccag 2521 acaggaggag tgggacagac agcagctgga cttgaaggtt tgatgccaaa gcagacattt 2581 tcctcacacc cacctgctgc tgtatgaata gctgtgtatc tgtttttcca taagattttg 2641 ataatatata caaaccttta gctgtgaatg gctgtgcccc acctgttgtc ctgaactgtg 2701 agtcctgatc ctaaccctgg gctccctgga ggactctaga agctcaggtt ccctgccaca 2761 ctatttgagt tggccaagaa ataaattcac atcctcagaa agtgcagcat ggaggaaaat 2821 ctgaactcta agcagaagac tctccactga cctggttgtc caggtctaga aggccaggcc 2881 tctactaggt ctgctcctga accagtcctg ctgcctggag tcagtagcca gagttgttct 2941 caggggtgct ggggcagagt ggagcccagg gtgctgggat ggctatatta ggcatgttca 3001 gggatgctca ttccatgact ctgcctaacc atgggctcag ggccaggtcc tcacagcagt 3061 cacaggccca ggaaggcggc aggcagagaa gtggagtgac tatttggaga atagcaccca 3121 tatctgtgtg ccctagggct cagaggggcc tcatcttccc cagccctccc cacctgctca 3181 ccaattccac ttcctgcccc aactgcagga atgctgacaa tgctgccatg cccaccatcg 3241 ggtgtaggtg aaaggcatct ttctgaattt cattctcttg aaggtgctgc caccccttgg 3301 cactgtggaa ctgccacctt gggtctgtgt cacttgtagg tttctctgcc tccaggttgc 3361 ctcaacagca ggaggcacag cagtttcacc atctttgagg tgagggtggg gtgccccagc 3421 taggaagcaa gatcgctgtg ctaggtctga ccaaaaccag agggcagtct agtcctgggg 3481 gtaaagccct cagatcccag ggtacactct tctccattcc ctccacccac ttgcctgtca 3541 ccccagtcac ctaagcaatc actgggccca gaggagagga gacagacaca cactggctcc 3601 tggacctaaa gggtatgagc tggagctaag gccagctaga gcttccactg tcagccctca 3661 ctgtcagccc cactgcaccc ccctgtgcct gctgggcact gggcactagc tagatgcttt 3721 aggttgcttc agctgatcct tcaactctgt gaggtggata ccaatattct attttgcaga 3781 tagaatttgg cccagagagg ttaactaata tatccatgat cacacagcta ataaaagtca 3841 gagctcagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSB2BRC14 1231 bp RNA PRI 08-MAR-1996 DEFINITION H.sapiens mRNA for B2-bradykinin receptor, C14 allele. ACCESSION X86164 NID g1220155 KEYWORDS B2 bradykinin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1231) AUTHORS Braun,A. TITLE Direct Submission JOURNAL Submitted (10-APR-1995) A. Braun, University of Munich, Dept of Pediatrics, Lindwurmstr. 4, 80337 Munich, FRG REFERENCE 2 (bases 1 to 1231) AUTHORS Braun,A., Kammerer,S., Bohme,E., Muller,B. and Roscher,A.A. TITLE Identification of polymorphic sites of the human bradykinin B2 receptor gene JOURNAL Biochem. Biophys. Res. Commun. 211 (1), 234-240 (1995) MEDLINE 95298028 COMMENT Related sequences: M88714 and Ma, Genomics 23; 362-369,1994. FEATURES Location/Qualifiers source 1..1231 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q32" gene 1..1176 /gene="B-2 bradykinin receptor gene" CDS 1..1176 /gene="B-2 bradykinin receptor gene" /note="coding region allele C14" /codon_start=1 /db_xref="PID:e146227" /db_xref="PID:g1220156" /transl_except=(pos:40..42,aa:Arg) /translation="MFSPWKISMFLSVREDSVPTTASFSADMLNVTLQGPTLNGTFAQ SKCPQVEWLGWLNTIQPPFLWVLFVLATLENIFVLSVFCLHKSSCTVAEIYLGNLAAA DLILACGLPFWAITISNNFDWLFGETLCRVVNAIISMNLYSSICFLMLVSIDRYLALV KTMSMGRMRGVRWAKLYSLVIWGCTLLLSSPMLVFRTMKEYSDEGHNVTACVISYPSL IWEVFTNMLLNVVGFLLPLSVITFCTMQIMQVLRNNEMQKFKEIQTERRATVLVLVVL LLFIICWLPFQISTFLDTLHRLGILSSCQDERIIDVITQIASFMAYSNSCLNPLVYVI VGKRFRKKSWEVYQGVCQKGGCRSEPIQMENSMGTLRTSISVERQIHKLQDWAGSRQ" BASE COUNT 250 a 366 c 342 g 273 t ORIGIN 1 atgttctctc cctggaagat atcaatgttt ctgtctgttt gtgaggactc cgtgcccacc 61 acggcctctt tcagcgccga catgctcaat gtcaccttgc aagggcccac tcttaacggg 121 acctttgccc agagcaaatg cccccaagtg gagtggctgg gctggctcaa caccatccag 181 ccccccttcc tctgggtgct gttcgtgctg gccaccctag agaacatctt tgtcctcagc 241 gtcttctgcc tgcacaagag cagctgcacg gtggcagaga tctacctggg gaacctggcc 301 gcagcagacc tgatcctggc ctgcgggctg cccttctggg ccatcaccat ctccaacaac 361 ttcgactggc tctttgggga gacgctctgc cgcgtggtga atgccattat ctccatgaac 421 ctgtacagca gcatctgttt cctgatgctg gtgagcatcg accgctacct ggccctggtg 481 aaaaccatgt ccatgggccg gatgcgcggc gtgcgctggg ccaagctcta cagcttggtg 541 atctgggggt gtacgctgct cctgagctca cccatgctgg tgttccggac catgaaggag 601 tacagcgatg agggccacaa cgtcaccgct tgtgtcatca gctacccatc cctcatctgg 661 gaagtgttca ccaacatgct cctgaatgtc gtgggcttcc tgctgcccct gagtgtcatc 721 accttctgca cgatgcagat catgcaggtg ctgcggaaca acgagatgca gaagttcaag 781 gagatccaga cggagaggag ggccacggtg ctagtcctgg ttgtgctgct gctattcatc 841 atctgctggc tgcccttcca gatcagcacc ttcctggata cgctgcatcg cctcggcatc 901 ctctccagct gccaggacga gcgcatcatc gatgtaatca cacagatcgc ctccttcatg 961 gcctacagca acagctgcct caacccactg gtgtacgtga tcgtgggcaa gcgcttccga 1021 aagaagtctt gggaggtgta ccagggagtg tgccagaaag ggggctgcag gtcagaaccc 1081 attcagatgg agaactccat gggcacactg cggacctcca tctccgtgga acgccagatt 1141 cacaaactgc aggactgggc agggagcaga cagtgagcaa acgccagcag ggctgctgtg 1201 aatttgtgta aggattgagg gacagttgct t // LOCUS HSB4BMR 2675 bp RNA PRI 24-JUL-1996 DEFINITION H.sapiens mRNA for B4B. ACCESSION Z50751 NID g1460067 KEYWORDS B4B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2675) AUTHORS Ruegg,C.L., Wu,H.Y., Fagnoni,F.F., Engleman,E.G. and Laus,R. TITLE B4B, a novel growth-arrest gene, is expressed by a subset of progenitor/pre-B lymphocytes negative for cytoplasmic mu-chain JOURNAL J. Immunol. 157 (1), 72-80 (1996) MEDLINE 96264675 REFERENCE 2 (bases 1 to 2675) AUTHORS Ruegg,C.L. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) Curtis L. Ruegg, Molecular Immunology, Activated Cell Therapy, Inc., 291 North Bernardo Avenue, Mountain View, CA, 94043, USA FEATURES Location/Qualifiers source 1..2675 /organism="Homo sapiens" /isolate="pooled anonymous blood donors" /db_xref="taxon:9606" /clone="1-12" /dev_stage="adult" /tissue_type="blood" /cell_type="B-cell precursor" /clone_lib="IDC phage library" /chromosome="20q12-13.1" 5'UTR 1..122 gene 123..596 /gene="B4B" CDS 123..596 /gene="B4B" /codon_start=1 /db_xref="PID:e194946" /db_xref="PID:g1460068" /translation="MLVLLAGIFVVHIATVIMLFVSTIANVWLVSNTVDASVGLWKNC TNISCSDSLSYASEDALKTVQAFMILSIIFCVIALLVFVFQLFTMEKGNRFFLSGATT LVCWLCILVGVSIYTSHYANRDGTQYHHGYSYILGWICFCFSFIIGVLYLVLRKK" BASE COUNT 699 a 584 c 627 g 765 t ORIGIN 1 ccgcatactt ccagaagagc ggaccagggc tgctgccagc acctgccact cagagcgcct 61 ctgtcgctgg gacccttcag aactctcttt gctcacaagt taccaaaaaa aaaagagcca 121 acatgttggt attgctggct ggtatctttg tggtccacat cgctactgtt attatgctat 181 ttgttagcac cattgccaat gtctggttgg tttccaatac ggtagatgca tcagtaggtc 241 tttggaaaaa ctgtaccaac attagctgca gtgacagcct gtcatatgcc agtgaagatg 301 ccctcaagac agtgcaggcc ttcatgattc tctctatcat cttctgtgtc attgccctcc 361 tggtcttcgt gttccagctc ttcaccatgg agaagggaaa ccggttcttc ctctcagggg 421 ccaccacact ggtgtgctgg ctgtgcattc ttgtgggggt gtccatctac actagtcatt 481 atgcgaatcg tgatggaacg cagtatcacc acggctattc ctacatcctg ggctggatct 541 gcttctgctt cagcttcatc atcggcgttc tctatctggt cctgagaaag aaataaggcc 601 ggacgagttc atggggatct ggggggtggg gaggaggaag ccgttgaatc tgggagggaa 661 gtggaggttg ctgtacagga aaaaccgaga taggggaggg gggaggggga agcaaagggg 721 ggaggtcaaa tcccaaacca ttactgaggg gattctctac tgccaagccc ctgccctggg 781 gagaaagtag ttggctagta ctttgatgct cccttgatgg ggtccagaga gcctccctgc 841 agccaccaga cttggcctcc agctgttctt agtgacacac actgtctggg gccccatcag 901 ctgccacaac accagcccca cttctgggtc atgcactgag gtccacagac ctactgcact 961 gagttaaaat agcggtacaa gttctggcaa gagcagatac tgtctttgtg ctgaatacgc 1021 taagcctgga agccatcctg cccttctgac ccaaagcaaa acatcacatt ccagtctgaa 1081 gtgcctactg gggggctttg gcctgtgagc cattgtccct ctttggaaca gatatttagc 1141 tctgtggaat tcagtgacaa aatgggagga ggaaagagag tttgtaaggt catgctggtg 1201 ggttagctaa accaagaagg agaccttttc acaatggaaa acctggggga tggtcagagc 1261 ccagtcgaga cctcacacac ggctgtccct catggagacc tcatgccatg gtctttgcta 1321 ggcctcttgc tgaaagccaa ggcagctctt ctggagtttc tctaaagtca ctagtgaaca 1381 attcggtggt aaaagtacca cacaaactat gggatccaag gggcagtctt gcaacagtgc 1441 catgttaggg ttatgttttt aggattcccc tcaatgcagt cagtgtttct tttaagtata 1501 caacaggaga gagatggaca tggctcattg tagcacaatc ctattactct tcctctaaca 1561 tttttgagga agttttgtct aattatcaat attgaggatc agggctccta ggctcagtgg 1621 tagctctggc ttagacacca cctggagtga tcacctcttg gggaccctgc ctatcccact 1681 tcacaggtga ggcatggcaa ttctggaagc tgattaaaac acacataaac caaaaccaaa 1741 caacaggccc ttgggtgaaa ggtgctatat aattgtgaag tattaagcct accgtatttc 1801 agccatgata agaacagagt gcctgcattc ccaggaaaat acgaaaatcc catgagataa 1861 ataaaaatat aggtgatggg cagatctttt ctttaaaata aaaaagcaaa aactcttgtg 1921 gtacctagtc agatggtaga cgagctgtct gctgccgcag gagcacctct atacaggact 1981 tagaagtagt atgttattcc tggttaagca ggcattgctt tgccctggag cagctatttt 2041 aagccatctc agattctgtc taaaggggtt ttttgggaag acgttttctt tatcgccctg 2101 agaagatcta ccccagggag aatctgagac atcttgccta cttttcttta ttagctttct 2161 cctcattcat ttcttttata cctttccttt ttggggagtt gttatgccat gatttttggt 2221 atttatgtaa aaggattatt actaattcta tttctctatg tttattctag ttaaggaaat 2281 gttgagggca agccaccaaa ttacctaggc tgaggttaga gagattggcc agcaaaaact 2341 gtgggaagat gaactttgtc attatgattt cattatcaca tgattataga aggctgtctt 2401 agtgcaaaaa acatacttac atttcagaca tatccaaagg gaatactcac attttgttaa 2461 gaagttgaac tatgactgga gtaaaccatg tatcccctta tcttttactt tttttctgtg 2521 acatttatgt ctcatgtaat ttgcattact ctggtggatt gttctagtac tgtattgggc 2581 ttcttcgtta atagattatt tcatatacta taattgtaaa tattttgata caaatgttta 2641 taactctagg gatataaaaa cagattctga ttccc // LOCUS HSB73 1182 bp RNA PRI 08-JAN-1997 DEFINITION H.sapiens mRNA for put. B7,3 molecule of CD80-CD60 protein family. ACCESSION Y07827 NID g1770367 KEYWORDS major histocompatibility complex. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1182) AUTHORS Henry,J., Ribouchon,M.T., Depetris,D., Mattei,M.G., Offer,C., Tazi-Ahnini,R. and Pantarotti,P. TITLE Cloning, structural analysis and mapping of B30 and B7 family members, to the MHC and other chromosomal regions. Toward the identification of the ancestral major histocompatability complex JOURNAL Unpublished REFERENCE 2 (bases 1 to 1182) AUTHORS Pontarotti,P. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) P. Pontarotti, Unite 119 INSERM, 27 bd.Lei Roure, 13009 Marseille, FRANCE FEATURES Location/Qualifiers source 1..1182 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p23" /chromosome="6" CDS 97..1149 /note="cDNA corresponding to the full length of EST clone in T86577; expresssion:antigen presenting cells" /codon_start=1 /product="put. B7,3 molecule of CD80-CD86 family" /db_xref="PID:e283126" /db_xref="PID:g1770368" /translation="MASFLAFLLLNFRVCLLLLQLLMPHSAQFSVLGPSGPILAMVGE DADLPCHLFPTMSAETMELKWVSSSLRQVVNVYADGKEVEDRQSAPYRGRTSILRDGI TAGKAAFRIHNVTGSDRWKYLCYFQDGDFYEKALVELKVAALGSDLHVDVKGYKDGGI HLECRSTGWYPQPQIQWSNNKGENIPTVEAPVVADGVGLYAVAASVIMRGSSGEGVSC TIRNSLLGLEKTASISIARPFFRSAQRWIAALAGTLPVLLLLLGGAGYFLWQQQEEKK TQFRKKKREQELREMAWSTMKQEQSTRVKLLEELRWRSIQYASRGERHSAYNEWKKAL FKPGEEMLQMRLHFVK" BASE COUNT 322 a 263 c 335 g 262 t ORIGIN 1 ttcggcacga gagaactatt aactgccttt cttctgtggg ctgtgatttt cagaggggaa 61 tgctaaagta tctcctgata tgcagcatga atgaaaatgg caagtttcct ggccttcctt 121 ctgctcaact ttcgtgtctg cctccttttg cttcagctgc tcatgcctca ctcagctcag 181 ttttctgtgc ttggaccctc tgggcccatc ctggccatgg tgggtgaaga cgctgatctg 241 ccctgtcacc tgttcccgac catgagtgca gagaccatgg agctgaagtg ggtaagttcc 301 agcctaaggc aggtggtgaa cgtgtatgca gatggaaagg aagtggaaga caggcagagt 361 gcaccgtatc gagggagaac ttcgattctg cgggatggca tcactgcagg gaaggctgct 421 ttccgaatac acaacgtcac aggctctgac aggtggaagt acctgtgtta tttccaagat 481 ggtgacttct atgaaaaagc cctggtggag ctgaaggttg cagcactggg ttctgatctt 541 cacgttgatg tgaagggtta caaggatgga gggatccatc tggagtgcag gtccactggc 601 tggtaccccc aaccccaaat acagtggagc aacaacaagg gagagaacat cccgactgtg 661 gaagcacctg tggttgcaga cggagtgggc ctgtatgcag tagcagcatc tgtgatcatg 721 agaggcagct ctggggaggg tgtatcctgt accatcagaa attccctcct cggcctggaa 781 aagacagcca gcatttccat cgcaagaccc ttcttcagga gcgcccagag gtggatcgcc 841 gccctggcag ggaccctgcc tgtcttgctg ctgctccttg ggggagccgg ttacttcctg 901 tggcaacagc aggaggaaaa aaagactcag ttcagaaaga aaaagagaga gcaagagttg 961 agagaaatgg catggagcac aatgaagcaa gaacaaagca caagagtgaa gctcctggag 1021 gaactcagat ggagaagtat ccagtatgca tctcggggag agagacattc agcctataat 1081 gaatggaaaa aggccctctt caagcctggt gaggaaatgc ttcagatgag gctccacttt 1141 gttaaataaa atggatgaat gaaaaaaaaa aaaaaaaaaa aa // LOCUS HSBADD 2597 bp RNA PRI 31-DEC-1992 DEFINITION Human mRNA for beta adducin. ACCESSION X58199 NID g29368 KEYWORDS beta adducin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2597) AUTHORS Gilligan,D.M. TITLE Direct Submission JOURNAL Submitted (08-MAR-1991) D.M. Gilligan, Duke University Medical Center, Howard Hughes Medical Institute, Box 3892, Durham NC 27710, USA REFERENCE 2 (bases 1 to 2597) AUTHORS Joshi,R., Gilligan,D.M., Otto,E., McLaughlin,T. and Bennett,V. TITLE Primary structure and domain organization of human alpha and beta adducin JOURNAL J. Cell Biol. 115 (3), 665-675 (1991) MEDLINE 92011907 FEATURES Location/Qualifiers source 1..2597 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /clone="K11, K3" mRNA 1..2597 /note="beta adducin" /evidence=experimental CDS 323..2503 /codon_start=1 /product="beta adducin" /db_xref="PID:g29369" /db_xref="SWISS-PROT:P35612" /translation="MSEETVPEAASPPPPQGQPYFDRFSEDDPEYMRLRNRAADLRQD FNLMEQKKRVTMILQSPSFREELEGLIQEQMKKGNNSSNIWALRQIADFMASTSHAVF PTSSMNVSMMTPINDLHTADSLNLAKGERLMRCKISSVYRLLDLYGWAQLSDTYVTLR VSKEQDHFLISPKGVSCSEVTASSLIKVNILGEVVEKGSSCFPVDTTGFCLHSAIYAA RPDVRCIIHLHTPATAAVSAMKWGLLPVSHNALLVGDMAYYDFNGEMEQEADRINLQK CLGPTCKILVLRNHGVVALGDTVEEAFYKIFHLQAACEIQVSALSSAGGVENLILLEQ EKHRPHEVGSVQWAGSTFGPMQKSRLGEHEFEALMRMLDNLGYRTGYTYRHPFVQEKT KHKSEVEIPATVTAFVFEEDGAPVPALRQHAQKQQKEKTRWLNTPNTYLRVNVADEVQ RSMGSPRPKTTWMKADEVEKSSSGMPIRIENPNQFVPLYTDPQEVLEMRNKIREQNRQ DVKSAGPQSQLLASVIAEKSRSPSTESQLMSKGDEDTKDDSEETVPNPFSQLTDQELE EYKKEVERKKLELDGEKETAPEEPGSPAKSAPASPVQSPAKEAETKSPLVSPSKSLEE GTKKTETSKAATTEPETTQPEGVVVNGREEEQTAEEILSKGLSQMTTSADTDVDTSKD KTESVTSGPMSPEGSPSKSPSKKKKKFRTPSFLKKSKKKEKVES" BASE COUNT 674 a 749 c 734 g 440 t ORIGIN 1 gaatgtctgc acagccgctt tccacacaga catcataaca aaaaatttcc accaaacccc 61 ctccccccgc ttctggccac agcacttaaa cacatctctg ccaaacccat aaaataacaa 121 aaccaacccg cagtggccga ccggagatag ctaagatgcc gcgcaggagt ttccacctgg 181 atgtttgagg ttgtgtagat gtggccggca cccttgagag tggagctagg gggtgcagac 241 tgagcagtga acagaaggag ccttggacag ggctgggcca gcctcccgag ttccaggagc 301 gaattgcaaa cccaccggga aaatgagcga agagacggtc cccgaggctg cctcgccgcc 361 gcccccgcag gggcagcctt actttgaccg cttctcagag gacgaccccg agtacatgcg 421 ccttcgcaac cgggcggcgg acctgcggca ggacttcaac ctgatggagc agaagaagcg 481 cgtcaccatg atcctgcaga gtccctcttt cagggaggag ctggaaggcc tcatccagga 541 gcagatgaag aaggggaaca actcctccaa catctgggcc ctgcgacaga tcgcggactt 601 catggccagc acctcccacg cagtcttccc gacatcttcc atgaatgtct ccatgatgac 661 gcctatcaat gacctccaca cagctgactc cctgaacctg gccaaagggg agcggctcat 721 gcggtgcaag atcagcagtg tctaccgact cctggacctc tatggctggg cccagctgag 781 tgacacctat gtcacgttga gagtcagcaa ggagcaggac cacttcctga tcagccctaa 841 gggagtttct tgcagtgaag tcacagcgtc cagcctgatc aaggtgaaca ttctgggaga 901 ggtggtggag aagggcagca gctgcttccc agtggacacc acaggcttct gtctgcactc 961 ggccatctat gcagcgaggc ccgacgtgcg ctgcatcatc cacctgcaca caccggccac 1021 agcagcggtg tcggccatga agtggggcct cctgcctgtc tcccacaatg ccctgctggt 1081 gggggacatg gcctattatg acttcaatgg ggaaatggag caggaagccg atcggatcaa 1141 cctgcagaag tgccttggac ccacctgcaa gatcctggtg ctaagaaacc atggagtggt 1201 tgctctgggt gacacggtag aggaggcatt ttacaagatc ttccacctgc aggctgcatg 1261 tgagatacag gtgtcggctc tgtccagtgc cgggggagtg gagaacctca tcctcctgga 1321 gcaggagaag caccggcccc atgaggtggg ctccgtgcag tgggccggga gcacctttgg 1381 gcctatgcag aagagtcggc tgggggagca tgagtttgag gccctcatga ggatgctgga 1441 caacctgggc tacagaacag gttacacgta tcgccacccc tttgttcaag agaaaaccaa 1501 acacaaaagt gaggtggaga ttccagccac ggtcacagcc ttcgtgtttg aggaggacgg 1561 tgccccggtg cccgccctgc gacagcatgc ccagaagcag cagaaggaga agacccgctg 1621 gctcaatacg cccaacacct acctgcgggt caatgtggcc gatgaggtcc agaggagcat 1681 gggcagcccc cgacccaaga ccacgtggat gaaggctgac gaggtggaga aatccagcag 1741 tggcatgccg attcgcatcg aaaacccaaa ccaatttgtg cctctctata ctgaccccca 1801 ggaagtactg gagatgagga acaagattcg agaacaaaac cgacaagatg tgaagtcagc 1861 ggggcctcag tcccagctcc tggcgagcgt cattgccgag aagagccgaa gcccgtctac 1921 agagagccag ctgatgtcca agggagacga ggataccaaa gacgattcag aggagacggt 1981 gcccaacccc ttcagccaac tcactgacca ggagttggag gagtacaaga aagaggtgga 2041 gaggaagaaa ctagaacttg atggagagaa agaaactgcc ccagaagagc ctggctcacc 2101 tgcaaagtct gcacctgctt ctccagtgca gagcccagcg aaggaggcag agacaaagag 2161 ccctttagtc tctccttcca agtctttaga ggaaggtact aagaagacag aaacaagcaa 2221 agccgccacc acagagcccg aaacaaccca gccggaaggg gtggtggtca acgggaggga 2281 ggaggagcag acggcagagg aaatcctcag caaaggcctg agccagatga ccaccagtgc 2341 tgacacggat gttgatacct ctaaggacaa aaccgagtcg gtcaccagcg gccccatgtc 2401 cccagagggc tcaccttcca agtctccctc aaagaagaaa aagaaattcc gaaccccctc 2461 cttcctgaaa aagagcaaaa agaaggagaa agtggagtcc tgattcatga cacccttggg 2521 ctccctcctg cctcctctct ctcctcccct tcccttctcc catctctgtc cctgcaagca 2581 cagggctaag gagggat // LOCUS HSBAT1MR 1658 bp RNA PRI 08-JUN-1995 DEFINITION H.sapiens BAT1 mRNA for nuclear RNA helicase (DEAD family). ACCESSION Z37166 NID g587145 KEYWORDS DEAD box protein; nuclear RNA helicase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1658) AUTHORS Peelman,L., Chardon,P., Nunes,M., Renard,C., Geffrotin,C., Vaiman,M., Van Zeveren,A., Coppieters,W., Van de Weghe,A., Bouquet,Y., Choy,W., Strominger,J. and Spies,T. TITLE The BAT1 gene in the MHC encodes an evolutionarily conserved putative nuclear RNA helicase of the DEAD family JOURNAL Genomics 26 (2), 210-218 (1995) MEDLINE 95324911 REFERENCE 2 (bases 1 to 1658) AUTHORS Peelman,L. TITLE Direct Submission JOURNAL Submitted (13-SEP-1994) Peelman L., Univeristy of Ghent, Heidestraat 19, Merelbeke, Belgium REFERENCE 3 (bases 1 to 1658) AUTHORS Peelman,L., Chardon,P., Nunes,M., Renard,C., Van de Weghe,A., Coppieters,W., Van Zeveren,A., Bouquet,Y. and Vaiman,M. TITLE The porcine BAT1 is a new member of the DEAD box protein family JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..167 mat_peptide 169..1452 /gene="BAT1" /product="nuclear RNA helicase (DEAD family)" gene 169..1455 /gene="BAT1" CDS 169..1455 /gene="BAT1" /codon_start=1 /product="nuclear RNA helicase (DEAD family)" /db_xref="PID:g587146" /translation="MAENDVDNELLDYEDDEVETAAGGDGAEAPAKKDVKGSYVSIHS SGFRDFLLKPELLRAIVDCGFEHPSEVQHECIPQAILGMDVLCQAKSGMGKTAVFVLA TLQQLEPVTGQVSVLVMCHTRELAFQISKEYERFSKYMPNVKVAVFFGGLSIKKDEEV LKKNCPHIVVGTPGRILALARNKSLNLKHIKHFILDECDKMLEQLDMRRDVQEIFRMT PHEKQVMMFSATLSKEIRPVCRKFMQDPMEIFVDDETKLTLHGLQQYYVKLKDNEKNR KLFDLLDVLEFNQVVIFVKSVQRCIALAQLLVEQNFPAIAIHRGMPQEERLSRYQQFK DFQRRILVATNLFGRGMDIERVNIAFNYDMPEDSDTYLHRVARAGRFGTKGLAITFVS DENDAKILNDVQDRFEVNISELPDEIDISSYIEQTR" 3'UTR 1453..1658 polyA_site 1628..1633 BASE COUNT 400 a 414 c 419 g 425 t ORIGIN 1 ctaaaggctg ccgccatacg cgctctccct gtttagctct tctgttagaa atagtatctt 61 tgttttcctt tgctgttcct caatccccta ctcttcaccc cttgttttca cctattttgc 121 gagaacccat ccagatcccc cttcccttct tcccctgccg gcccagttat ggcagagaac 181 gatgtggaca atgagctctt ggactatgaa gatgatgagg tggagacagc agctggggga 241 gatggggctg aggcccctgc caagaaggat gtcaagggct cctatgtctc catccacagc 301 tctggctttc gtgacttcct gctcaagcca gagttgctcc gggccattgt cgactgtggc 361 tttgagcatc cgtcagaagt ccagcatgag tgcatccctc aggccattct gggaatggat 421 gtcctgtgcc aggccaagtc gggcatggga aagacagcag tgtttgtctt ggccacactg 481 caacagctgg agccagttac tgggcaggtg tctgtactgg tgatgtgtca cactcgggag 541 ttggcttttc agatcagcaa ggaatatgag cgcttctcta aatacatgcc caatgtcaag 601 gttgctgttt tttttggtgg tctgtctatc aagaaggatg aagaggtgct gaagaagaac 661 tgcccgcata tcgtcgtggg gactccaggc cgtatcctag ccctggctcg aaataagagc 721 ctcaacctca aacacattaa acactttatt ttggatgaat gtgataagat gcttgaacag 781 ctcgacatgc gtcgggatgt ccaggaaatt tttcgcatga ccccccacga gaagcaggtc 841 atgatgttca gtgctacctt gagcaaagag atccgtccag tctgccgcaa gttcatgcaa 901 gatccaatgg agatcttcgt ggatgatgag acgaagttga cgctgcatgg gttgcagcag 961 tactacgtga aactgaagga caacgagaag aaccggaagc tctttgacct tctggatgtc 1021 cttgagttca accaggtggt gatctttgtg aagtctgtgc agcggtgcat tgccttggcc 1081 cagctactag tggagcagaa cttcccagcc attgccatcc accgtgggat gccccaggag 1141 gagaggcttt ctcggtatca gcagtttaaa gattttcaac gacgaattct tgtggctacc 1201 aacctatttg gccgaggcat ggacatcgag cgggtgaaca ttgcttttaa ttatgacatg 1261 cctgaggatt ctgacaccta cctgcatcgg gtggccagag caggccggtt tggcaccaag 1321 ggcttggcta tcacatttgt gtccgatgag aatgatgcca agatcctcaa tgatgtgcag 1381 gatcgctttg aggtcaatat tagtgagctg cctgatgaga tagacatctc ctcctacatt 1441 gaacagacac ggtagaagac tcgcccattt tggaatgtga ccgtctgtcc ttcaggagag 1501 gacaccaggg tggggtgaag gagacactac tgcccccacc cctgacagcc cccaccccat 1561 ggcttccatc ttttgcatca ccaccactcc tgaaccccca tttctgattt gtcagaattt 1621 tttttttaac aaaactaaaa atgaaaaaaa aaaaaaaa // LOCUS HSBBC1 942 bp RNA PRI 24-NOV-1993 DEFINITION H.sapiens BBC1 mRNA. ACCESSION X64707 NID g29382 KEYWORDS BBC1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 942) AUTHORS Helps,N.R. TITLE Direct Submission JOURNAL Submitted (27-FEB-1992) N.R. Helps, ICI/University Joint Lab, University of Leicester, University Road, Leicester LE1 7RH, UK REFERENCE 2 (bases 1 to 942) AUTHORS Adams,S.M., Helps,N.R., Sharp,M.G., Brammar,W.J., Walker,R.A. and Varley,J.M. TITLE Isolation and characterization of a novel gene with differential expression in benign and malignant human breast tumours JOURNAL Hum. Mol. Genet. 1 (2), 91-96 (1992) MEDLINE 93244791 FEATURES Location/Qualifiers source 1..942 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA" /clone="C328-10" /chromosome="16" gene 52..687 /gene="BBC1" CDS 52..687 /gene="BBC1" /codon_start=1 /db_xref="PID:g29383" /db_xref="SWISS-PROT:P26373" /translation="MAPSRNGMVLKPHFHKDWQRRVATWFNQPARKIRRRKARQAKAR RIAPRPASGPIRPIVRCPTVRYHTKVRAGRGFSLEELRVAGIHKKVARTIGISVDPRR RNKSTESLQTNVQRLKEYRSKLILFPRKPSAPKKGDSSAEELKLATQLTGPVMPVRNV YKKEKARVITEEEKNFKAFASLRMARANARLFGIRAKRAKEAAEQDVEKKK" misc_feature 297^298 /gene="BBC1" /note="splice site" /evidence=experimental polyA_signal 683..688 polyA_site 707 /evidence=experimental polyA_site 711 /evidence=experimental polyA_site 723 /evidence=experimental polyA_site 942 /evidence=experimental BASE COUNT 201 a 272 c 289 g 180 t ORIGIN 1 ctttccgctc ggctgttttc ctgcgcagga gccgcagggc cgtaggcagc catggcgccc 61 agccggaatg gcatggtctt gaagccccac ttccacaagg actggcagcg gcgcgtggcc 121 acgtggttca accagccggc ccgtaagatc cgcagacgta aggcccggca agccaaggcg 181 cgccgcatcg ccccgcgccc cgcgtcgggt cccatccggc ccatcgtgcg ctgccccacg 241 gttcggtacc acacgaaggt gcgcgccggc cgcggcttca gcctggagga gctcagggtg 301 gccggcattc acaagaaggt ggcccggacc atcggcattt ctgtggatcc gaggaggcgg 361 aacaagtcca cggagtccct gcagaccaac gtgcagcggc tgaaggagta ccgctccaaa 421 ctcatcctct tccccaggaa gccctcggcc cccaagaagg gagacagttc tgctgaagaa 481 ctgaaactgg ccacccagct gaccggaccg gtcatgcccg tccggaacgt ctataagaag 541 gagaaagctc gagtcatcac tgaggaagag aagaatttca aagccttcgc tagtctccgt 601 atggcccgtg ccaacgcccg gctcttcggc atacgggcaa aaagagccaa ggaagccgca 661 gaacaggatg ttgaaaagaa aaaataaagc cctcctgggg acttggaatc agtcgggcag 721 tcatgctggg tctccacgtg gtgtgtttcg tgggaacaac tgggcctggg atggggcttc 781 actgctgtga cttcctcctg ccaggggatt tggggctttc ttgaaagaca gtccaagccc 841 tggataatgc tttactttct gtgttgaagc actgttggtt gtttggttag tgactgatgt 901 aaaacggttt tcttgtgggg aggttacaga ggctgacttc ag // LOCUS HSBCDECAS 1743 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for branched chain decarboxylase alpha subunit. ACCESSION Z14093 S49270 NID g29390 KEYWORDS branched chain decarboxylase alpha subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS McKean,M.C., Winkeler,K.A. and Danner,D.J. TITLE Nucleotide sequence of the 5' end including the initiation codon of cDNA for the E1 alpha subunit of the human branched chain alpha-ketoacid dehydrogenase complex JOURNAL Biochim. Biophys. Acta 1171 (1), 109-112 (1992) MEDLINE 93041997 REFERENCE 2 (bases 1 to 1743) AUTHORS Danner,D.J. TITLE Direct Submission JOURNAL Submitted (14-JUL-1992) Dean J Danner, Pediatrics/Medical Genetics, Emory University, 2040, Ridgewood Dr., Atlanta, Georgia, 30322, USA FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Lymphoid tumor" /cell_type="B-lymphocyte" /cell_line="Raji" /clone="pKW5'alpha" /sex="Male" CDS 14..1351 /codon_start=1 /product="branched chain decarboxylase alpha subunit" /db_xref="PID:g29391" /db_xref="SWISS-PROT:P12694" /translation="MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQ FSSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKV LKLYKSMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYRE AGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVG AAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTS EQYRGDGIAARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIG HHSTSDDSSAYRSVDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVME AFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK" BASE COUNT 363 a 551 c 506 g 323 t ORIGIN 1 ttggttagcc aagatggcgg tagcgatcgc tgcagcgagg gtctggcggc taaaccgtgg 61 tttgagccag gctgccctcc tgctgctgcg gcagcctggg gctcggggac tggctagatc 121 tcaccccccc aggcagcagc agcagttttc atctctggat gacaagcccc agttcccagg 181 ggcctcggcg gagtttatag ataagttgga attcatccag cccaacgtca tctctggaat 241 ccccatctac cgcgtcatgg accggcaagg ccagatcatc aaccccagcg aggaccccca 301 cctgccgaag gagaaggtgc tgaagctcta caagagcatg acactgctta acaccatgga 361 ccgcatcctc tatgagtctc agcggcaggg ccggatctcc ttctacatga ccaactatgg 421 tgaggagggc acgcacgtgg ggagtgccgc cgccctggac aacacggacc tggtgtttgg 481 ccagtaccgg gaggcaggtg tgctgatgta tcgggactac cccctggaac tattcatggc 541 ccagtgctat ggcaacatca gtgacttggg caaggggcgc cagatgcctg tccactacgg 601 ctgcaaggaa cgccacttcg tcactatctc ctctccactg gccacgcaga tccctcaggc 661 ggtgggggcg gcgtacgcag ccaagcgggc caatgccaac agggtcgtca tctgttactt 721 cggcgagggg gcagccagtg agggggacgc ccatgccggc ttcaacttcg ctgccacact 781 tgagtgcccc atcatcttct tctgccggaa caatggctac gccatctcca cgcccacctc 841 tgagcagtat cgcggcgatg gcattgcagc acgaggcccc gggtatggca tcatgtcaat 901 ccgcgtggat ggtaatgatg tgtttgccgt atacaacgcc acaaaggagg cccgacggcg 961 ggctgtggca gagaaccagc cctttctcat cgaggccatg acctacagga tcgggcacca 1021 cagcaccagt gacgacagtt cagcgtaccg ctcggtggat gaggtcaatt actgggataa 1081 acaggaccac cccatctccc ggctgcggca ctatctgctg agccaaggct ggtgggatga 1141 ggagcaggag aaggcctgga ggaagcagtc ccgcaggaag gtgatggagg cctttgagca 1201 ggccgagcgg aagcccaaac ccaaccccaa cctgctcttc tcagacgtgt atcaggagat 1261 gcccgcccag ctccgcaagc agcaggagtc tctggcccgc cacctgcaga cctacgggga 1321 gcactaccca ctggatcact tcgataagtg agacctgctc agcccacccc cacccatcct 1381 cagctacccc gagaggtagc cccactctaa ggggcgcagg gggacctgac agcacaccac 1441 tgtcttcccc agtcagctcc ctctaaaata ctcagcggcc agggcggctg ccactcttca 1501 cccctgctcc tcccgtgtta cattctcagg ggacagcatc tgcagcagtt gctgaggctc 1561 cgtcagcccc ctcttcacct gttgttacag tgccttctcc caggggctgg gtgatgggca 1621 cattcaggac tagaagcccc tctgggcatg gggtggacat ggcaggtcag cctgtggaac 1681 ttgcgcaggt gcgactggcc agcagaggtc acgaataaac tgcatctctg cgcctggctc 1741 tct // LOCUS HSBCENT 2089 bp RNA PRI 26-NOV-1997 DEFINITION H.sapiens mRNA for beta-centractin (PC3). ACCESSION X82207 NID g563885 KEYWORDS centractin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Adams,M.D., Dubnick,M., Kerlavage,A.R., Moreno,R., Kelley,J.M., Utterback,T.R., Nagle,J.W., Fields,C. and Venter,J.C. TITLE Sequence identification of 2,375 human brain genes JOURNAL Nature 355 (6361), 632-634 (1992) MEDLINE 92168112 REFERENCE 2 (bases 1 to 2089) AUTHORS Clark,S.W., Staub,O., Clark,I.B., Holzbaur,E.L., Paschal,B.M., Vallee,R.B. and Meyer,D.I. TITLE Beta-centractin: characterization and distribution of a new member of the centractin family of actin-related proteins JOURNAL Mol. Biol. Cell 5 (12), 1301-1310 (1994) MEDLINE 95210749 REFERENCE 3 (bases 1 to 2089) AUTHORS Clark,S.W. TITLE Direct Submission JOURNAL Submitted (12-OCT-1994) S.W. Clark, University of California, Dept of Biological Chemistry, UCLA School of Medicine, 10833 Le Conte Ave., Los Angeles CA 90024-1737, USA COMMENT Homologous to M79070 at positions 147..536. FEATURES Location/Qualifiers source 1..2089 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testicular" /clone_lib="Clontech HL1010b" /clone="PC3" misc_feature 1..9 /note="EcoRI linker" misc_feature 10..26 /note="Non-RNA-derived cDNA" CDS 57..1187 /codon_start=1 /product="beta-centracetin" /db_xref="PID:e1188589" /db_xref="PID:g563886" /translation="MESYDIIANQPVVIDNGSGVIKAGFAGDQIPKYCFPNYVGRPKH MRVMAGALEGDLFIGPKAEEHRGLLTIRYPMEHGVVRDWNDMERIWQYVYSKDQLQTF SEEHPVLLTEAPLNPSKNREKAAEVFFETFNVPALFISMQAVLSLYATGRTTGVVLDS GDGVTHAVPIYEGFAMPHSIMRVDIAGRDVSRYLRLLLRKEGVDFHTSAEFEVVRTIK ERACYLSINPQKDEALETEKVQYTLPDGSTLDVGPARFRAPELLFQPDLVGDESEGLH EVVAFAIHKSDMDLRRTLFANIVLSGGSTLFKGFGDRLLSEVKKLAPKDIKIKISAPQ ERLYSTWIGGSILASLDTFKKMWVSKKEYEEDGSRAIHRKTF" conflict 498 /citation=[1] /replace="t" misc_feature 2081..2089 /note="EcoRI linker" misc_feature 2087..2088 /note="Non-RNA-derived cDNA" BASE COUNT 439 a 589 c 620 g 441 t ORIGIN 1 gaattcgggg gtgcctcctg cagcccgcct gctgggcagg ccggcgcggc ccggccatgg 61 agtcctacga catcatcgcc aaccagcctg tggtcatcga caacggttcg ggggtgatta 121 aagctggctt tgcaggagac cagattccca aatactgttt cccaaactat gtcgggcggc 181 cgaagcacat gcgggtgatg gctggagccc tggaggggga cctcttcatc ggaccaaaag 241 cagaggagca ccgggggctg ctgaccatcc gctaccccat ggagcacggc gtggtgcgag 301 actggaacga catggaacgc atctggcagt acgtctactc caaggatcag ctgcagacct 361 tctcggagga gcatcctgtg ctcctcacgg aggccccgct caacccgagt aagaaccggg 421 agaaggcggc agaggtgttc tttgagacct tcaacgtgcc ggccctgttc atctccatgc 481 aggctgtgct cagtctgtac gcaacaggac gcacgacagg agtggttcta gactcagggg 541 acggggtcac tcatgctgtg cccatctatg agggctttgc catgcctcac tccatcatgc 601 gggtggacat tgccggccgc gacgtctccc gctacctccg actcctgctg cgcaaggaag 661 gggttgactt ccatacctcg gctgagtttg aggttgtccg gacaatcaaa gagcgagcgt 721 gctacctgtc catcaaccca cagaaggatg aggctctgga gacggagaag gtgcagtaca 781 cgttgccaga cggcagcacg cttgatgtgg ggcctgcacg attccgggcc cccgagctgc 841 tgttccagcc ggaccttgtc ggggatgaga gtgaggggct ccatgaggtg gtggccttcg 901 ccatacacaa gtccgacatg gacctgcgcc ggacgctgtt cgccaacatc gtgctctcag 961 gtggctcaac gcttttcaaa ggcttcggag accgattact cagtgaagtg aagaagcttg 1021 ccccaaagga tatcaaaatc aagatctcag ccccgcagga acggctgtac tccacatgga 1081 ttggcggctc catcctggcc tcgctggaca cttttaagaa gatgtgggtg tccaaaaagg 1141 agtatgaaga ggatggctcc cgtgctattc atcgcaaaac tttctagtgc ccaaggaggg 1201 cggggcatgt tgggagaggg ggagggaggg gagacagagc ctttaaccct ttttggtctt 1261 ggctcgtata ctaggcttag ggtcccctgc atgccctgaa cccctgggtg ggtggcacag 1321 cagtgccccc ctgcagcctt cccctctaca caggacatgc acacacaagt aacattgagc 1381 tgcatggaca ggagccttga gctggcgtgt gggaattgag cgccatgtca ggctgttgtg 1441 ggtatccccc tggcagggcc agctaggcct gtggttcccc tgctccgact ctcagggctg 1501 cctccctgag ctccagggcc agaatgcctg gatgcctggg tagccagttt ggggagtggg 1561 ctgcaagggg cagccagcag cgcccactgg tgtgtcactg catccattgc cacctcctgt 1621 tcgtgacctg acagggtgac acagcccctt tcacactctg tcctcctatc ttcctgggta 1681 gatgccctgg tgtagggctg agtactgaat ggtcttccat ccccagcaag ggggtgcagc 1741 ccagggtcag gcccttcaga gccagggcta gaggatgcac ggtggctaga gccagctgca 1801 ctatcctttt cagagcactt catccacttg ctcctccctc taccctcggc accctgggtg 1861 ggaaagggtt gatgctcatc atttattgag gggaagccac ttaataagga gtcagaccta 1921 aaagggggtg ggggacattt tcttacctca cccaagaaag aggtcgtcac ttttgctgtg 1981 gccagggccc cacctccctc tctcagatat gtacaataat ttaacacggt tgcctgaaaa 2041 aaacttttgt aaatcattat agtaataatt atggacaagg cccgaattc // LOCUS HSBCL7A 4522 bp RNA PRI 17-JAN-1997 DEFINITION H.sapiens mRNA for BCL7A protein. ACCESSION X89984 NID g929614 KEYWORDS alternative splicing; bcl7a gene; CGG-repeat; TG repeat. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4522) AUTHORS Zani,V.J., Asou,N., Jadayel,D., Heward,J.M., Shipley,J., Nacheva,E., Takasuki,K., Catovsky,D. and Dyer,M.J. TITLE Molecular cloning of complex chromosomal translocation t(8;14;12)(q24.1;q32.3;q24.1) in a Burkitt lymphoma cell line defines a new gene (BCL7A) with homology to caldesmon JOURNAL Blood 87 (8), 3124-3134 (1996) MEDLINE 96184769 REFERENCE 2 (bases 1 to 4522) AUTHORS Dyer,M.J.S. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) M.J.S. Dyer, Institute of Cancer Research, Academic Haematology, 15 Cotswold Road, Sutton, Surrey. SM2 5NG, UK FEATURES Location/Qualifiers source 1..4522 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetus" /clone_lib="Stratagene fetal brain" /map="12q24.13" /chromosome="12" repeat_region 691..710 /rpt_unit=CGG repeat_region 778..819 /rpt_unit=TG gene 954..1649 /gene="BCL7A" CDS 954..1649 /gene="BCL7A" /codon_start=1 /db_xref="PID:g929615" /translation="MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLR IYKWVPVTEPKVDDKNKNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADA SPIKQENSSNSSPAPEPNSAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEH SMNSSEKVDRQPSGDSGLAAETSAISQVPRSRSQRGSQIGREPIGLSGDLEGVPPSKK MKLEASQQNSEEM" exon 1515..1578 /gene="BCL7A" /note="alternatively spliced exon" polyA_signal 3366..3370 /note="putative" polyA_signal 4503..4508 /note="putative" BASE COUNT 980 a 1285 c 1245 g 1009 t 3 others ORIGIN 1 cgggtctcag tctggtactg aatgcaggaa tggcttaagg tgaaatcgtg gtcctctggt 61 gaactcagcg aagaccccct cgccttgttt atgacaagag aacttctggg ggcgggagga 121 agagtccctg ttacgatgct gatcatcatt gagcttttgc tgagaaaact ctttagtact 181 aaggtcgaga gtctctgtgg tctgcctggc accagcacct tcctacaacc ctagttttcc 241 aaaaggaaaa gcctggggcc aggcgacgtc ctagctcgca ttatgacagg gccgcgggcc 301 accagagatg cgcgatgccc aactctttcc aagagcacct cgcgtcccga accggtgcct 361 tcaactcgga gaagtcaaga gacccgcaag aaacttgcac gactgcaccc gccgccgcgc 421 tgctgggggc tgggcagggg cagctgggct ggctcccggg gagacgcgac ccccccgcgc 481 cccgcagacc ggctgtctcc catggacccc tcggcacctg cagctccgag gaagggtcag 541 cgcgcgtgtc cgcaccccgc ccccaccgcg cgcccagagc cgggggtcgc cgtggccctt 601 cgccacctcc ccaggagggc gtccggggcg tcccctctgg gcgacgcgag gcccgggggn 661 ctgcggtggc cgcggcgcgg agccnnactc cggcggcggc ggcggcgcgg ctccccctgc 721 tctgtgcagc tgccgcccgg gcttgcgctg ggccaggcgc gcggcggccc cgggctttgt 781 gtgtgtgtat gtgtgtgtgt gtgtgtgtgt gtgtgtgtga gagtgtgtgc gtgtgagagt 841 gcgagtgtct gtgcgcgagt gagtgagcgg cgggcgggcg cgagtgtggc cggccggagc 901 gcgagcatga cccggcgggc gcgctcccca gcctccgtct ccccgccgga accatgtcgg 961 gcaggtcggt tcgagccgag acgaggagcc gggccaaaga tgatatcaag agggtcatgg 1021 cggcgatcga gaaagtgcgc aaatgggaga agaaatgggt gaccgttggt gacacatccc 1081 tacgaatcta caaatgggtc cctgtgacgg agcccaaggt tgatgacaaa aacaagaata 1141 agaaaaaagg caaggacgag aagtgtggct cagaggtgac cactccggag aacagttcct 1201 ccccagggat gatggacatg catgacgata acagcaacca gagctccatc gcagatgcct 1261 cccccatcaa acaggagaac agcagcaact ccagccccgc tccagagccc aactcggctg 1321 tgcccagcga cggcaccgag gccaaggtgg atgaggccca ggctgatggg aaggagcacc 1381 caggagctga agatgcttct gatgagcaga attcacagtc ctctatggaa cattcgatga 1441 acagctcaga gaaagtagat cggcagccgt ctggagactc gggtctggcc gcagagacgt 1501 ctgcaatctc tcaggtacct cgctcgaggt ctcagagggg cagccagatc ggccgggagc 1561 ccattgggtt gtcgggggat ctggaaggag tgccaccctc taaaaagatg aaactggagg 1621 cctctcaaca aaactccgaa gagatgtaga cgatgcttta agcctccgat aactgttcca 1681 tggaaggtac atcagcaatt aattctagag caactttgcc ccagcgattc ctcttgggtg 1741 cgaacagaac tactaacgtt tcaagtttac caagtgcaaa tccaagaaga cccagacggc 1801 gtcacttctc agacactgaa gaactctgct gtgaagcaaa acactcaaac ctttaaggga 1861 ctgtccttgg ggaggcaggc ggggctgaca gctcaggagt gtctgcacac tgtctcggaa 1921 gccaggattc catttgtgtt gctgctgtat ttccccccac ttctctatgt aacgatataa 1981 gctatcggag ggtggtaccg atcaggaacg ctttttggcg gggctttcca ctgttcaacc 2041 gattccttcc gctttctttt tttgtgcctt gtgcccttga ggtgacctct ggcatgtatc 2101 ctggtggttc ttacatcccc ctctgcaaag tgccctcttg gtttggttcg ggcggcggct 2161 gccaccctac tcaccgctct cctccctgcc ccaggacttc atcggagcag gcagggtgga 2221 gcgaaggagc tccttagccc acctggtttg caggtgcagg gggaccttag gcacgcccca 2281 agcaccaggc accagggccc aaggacgcgc aggtgttggg gcacagtccc caagggctcg 2341 gccccttgga tcaggctggg cactcgctgt gctctcccct ccttggggcg tttaggactg 2401 ggcgtctcca agcccaccat ggcccagatg gacgtgcaaa gcccttggaa ttttctggca 2461 cttcctctct attgccccca ccaccaccac ccccatcact gctttctccc agacctccga 2521 atacgaaatg gcttctctgg ctgactgcaa ggctgtctcc ttaaggcact gagtgggccg 2581 gggaggctgg gagccggcgg caggattagc tggtgctgaa ctttctctca taggacgtcg 2641 cttggatttc aaatccacgg tcacctgctg ccctttgcct cccccgacgc cccagcctgt 2701 gccccggaga ggcaggatcg cagtggtcag aatccacgtg ctttcctatt ctcaggctgt 2761 tctgactctg agccaacagc tggaccgtgt ctcatcccca gaacatgccg tctgtccacc 2821 ggggagtggc cttgatggcc ggcctcgaag gccacaaaca aggcgtcgag gaattggaaa 2881 gatttgcaca ccctccagaa aggagagacg caatctcccc tccctcccat cccccacctt 2941 cgctggaaca gcttcctctc actgaacgga gacgccccct tggacgaact gcctaatcgt 3001 ttggttctga ggcctggttt gctcttaatt aatatatgaa ctcctcagac cttaaacctt 3061 ttcctaagct ttctttactg cactggagtt ctgactccct ttgagttgtg tgttactggg 3121 ggtggggtgg ggtcatgggt tttgttgttt ttgggggcta attggtgcat attcaggtac 3181 cacctttgac gtgtggctct ttctcctgac catcatggga agtgtctgct ggattccatt 3241 ttctaagagt ttctgagggt gaggctctta tttttttttt aagggatcct gtctatttcc 3301 tgcacttcga gaagaatcaa aatgttcctg aatttcaaat acctcatgca aaatgtgtct 3361 cctgaaataa gggaaaaaaa aaaaaaacca caactttgaa aatcttaatg ttgaagttag 3421 caatgccgaa aggtttctgt cttaaaaaaa aaaaatcctt gtacttatca attttgcccc 3481 ttaggcagtc agttttgttg agaactgtgt cctgcatcct ggcgcagaac ctcctgatgc 3541 ggttcctctc cacgcatctc gaggcggcgt tacctccaga ttccgtagag ttagagtcac 3601 atttttcttt gcagcgaaac tccgtcttgg tgagagatga atttggatat ttatttcctt 3661 ctctgttttt gggaaacgag aggctacaac caagacagct gaaggagaat gaaacacaca 3721 tccacagaaa cagagaggcg taggtggccc tgccgttgac cgcagcctct ctggacaggc 3781 aaggggagtt ggcgcaggtg aggactcaga cgacgtccac cgtcccaagg ctgtcactag 3841 tatttctctg aagtgcctga aggtaggaat gggccggcga ttgggaccag ctgggcccca 3901 ccacggccac gccaggcaaa gcgccagcag ccctgcactc cacgctggcc aagaaggcct 3961 tccacgcaga atgacaagac tgcaaaaatc cgatgtgctt ccttccctgg cgcagtcgct 4021 cctcgagccg ctgcccccca cccaccctgc acccctcgcc ctccccccca ccacagaatc 4081 taagaccttt cagcttcgag ccagggggcg ggggatcccg agcaaaagcc ttccatggac 4141 atcaggcccc gtggcctcaa gggctcccag ggcaaaccta attcccccaa aacgtgaagt 4201 cggggaagct gcggctacac attccacaaa gtgctggcac ttacacccac aacccggaag 4261 gctgtggacc gattcctcta gggtggtgac ctcccattag caaacggtgt catggtttgg 4321 aatgttcatt atcgccaaga acctggttag aggcataaag accttttttc accgttacct 4381 aattttttcc cctttcaaga attttttttt tttttggtgt gttgtacagc agtataattt 4441 ttcacttatt tattcctaca gtagatatgg tttgtacaat gtacaattgt ttcatttcag 4501 aaaataaaaa tttcaaatca tg // LOCUS HSBCL7C 722 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens mRNA for BCL7C protein. ACCESSION AJ223980 NID g2832831 KEYWORDS BCL7C gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 722) AUTHORS Osborne,L.R., Jadayel,D.M., Coignet,L.J., Zani,V.J., Tsui,L.C., Scherer,S.W. and Dyer,M.J. TITLE The BCL7 gene family: deletion of BCL7B in Williams syndrome JOURNAL Unpublished REFERENCE 2 (bases 1 to 722) AUTHORS Dyer,M.J.S. TITLE Direct Submission JOURNAL Submitted (02-FEB-1998) Dyer M.J.S., Academic Haematology and Cytogenetics, Institute of Cancer Research, Haddow Laboratories, Sutton, Surrey, SM2 5NG, UK FEATURES Location/Qualifiers source 1..722 /organism="Homo sapiens" /plasmid="Plasmid pBS KS+" /db_xref="taxon:9606" /chromosome="16" /map="p11" /tissue_type="skeletal muscle" gene 1..654 /gene="BCL7C" CDS 1..654 /gene="BCL7C" /codon_start=1 /db_xref="PID:e1249849" /db_xref="PID:g2832832" /translation="MAGRTVRAETRSRAKDDIKKVMATIEKVRRWEKRWVTVGDTSLR IFKWVPVVDPQEEERRRAGGGAERSRGRERRGRGASPRGGGPLILLDLNDENSNQSFH SEGSLPKGTEPSPGGTPQPSRPVSPAGPPEGVPEEAQPPRLGQERDPGGITAGSTDEP PMLTKEEPVPELLEAEAPEAYPVFEPVPPVPEAAQGDTEDSGGAPPLKRICPNAPDP" BASE COUNT 147 a 224 c 238 g 113 t ORIGIN 1 atggccggcc ggactgtacg ggccgagacc cggagccggg ccaaggatga catcaagaag 61 gtgatggcga ccatcgagaa ggtccggaga tgggagaagc gatgggtgac tgtgggcgac 121 acttcccttc gtatcttcaa gtgggtgcca gtggtggatc cccaggagga ggagcgaagg 181 cgggcaggtg gcggggcaga gagatcccgt ggccgggaac gtcggggcag gggcgccagt 241 ccccgagggg gtggccctct catcctgctg gatcttaatg atgagaacag caaccagagt 301 ttccattcgg aaggttccct gccaaagggc acagagccca gtcctggggg caccccccag 361 cccagccgcc ctgtgtcacc tgccggaccc ccagaagggg tccctgagga ggctcagccc 421 ccacggctgg gccaagagag agatcccggg ggcataactg ctggcagcac cgacgaaccc 481 ccaatgctga ccaaggagga gcctgttcca gaactgctgg aagctgaggc ccccgaagct 541 taccctgtct ttgagccagt gccacctgtc cctgaggcag cccagggtga cacagaggac 601 tcggggggtg cccccccact caagcgcatc tgcccaaatg cccctgaccc ctgagaagcc 661 ggcctgcctg tcctgttgcc ccaggggccc ctttggcttt ttacaaataa agaccctttt 721 gt // LOCUS HSBCLXL 926 bp RNA PRI 26-JUL-1994 DEFINITION H.sapiens bcl-xL mRNA. ACCESSION Z23115 L20121 NID g510900 KEYWORDS bcl-xL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 926) AUTHORS Thompson,C.B. TITLE Direct Submission JOURNAL Submitted (22-JUN-1993) Craig B Thompson, Howard Hughes Medical Institute, University of Chicago, 5841 South Maryland, Chicago, IL, 60637, USA REFERENCE 2 (bases 1 to 926) AUTHORS Boise,L.H., Gonzalez-Garcia,M., Postema,C.E., Ding,L., Lindsten,T., Turka,L.A., Mao,X., Nunez,G. and Thompson,C.B. TITLE bcl-x, a bcl-2-related gene that functions as a dominant regulator of apoptotic cell death JOURNAL Cell 74 (4), 597-608 (1993) MEDLINE 93364977 FEATURES Location/Qualifiers source 1..926 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="bcl-xL" /dev_stage="Adult" /cell_type="brain" gene 135..836 /gene="bcl-xL" CDS 135..836 /gene="bcl-xL" /codon_start=1 /db_xref="PID:g510901" /db_xref="SWISS-PROT:Q07817" /translation="MSQSNRELVVDFLSYKLSQKGYSWSQFSDVEENRTEAPEGTESE METPSAINGNPSWHLADSPAVNGATAHSSSLDAREVIPMAAVKQALREAGDEFELRYR RAFSDLTSQLHITPGTAYQSFEQVVNELFRDGVNWGRIVAFFSFGGALCVESVDKEMQ VLVSRIAAWMATYLNDHLEPWIQENGGWDTFVELYGNNAAAESRKGQERFNRWFLTGM TVAGVVLLGSLFSRK" BASE COUNT 220 a 249 c 264 g 193 t ORIGIN 1 gaatctcttt ctctcccttc agaatcttat cttggctttg gatcttagaa gagaatcact 61 aaccagagac gagactcagt gagtgagcag gtgttttgga caatggactg gttgagccca 121 tccctattat aaaaatgtct cagagcaacc gggagctggt ggttgacttt ctctcctaca 181 agctttccca gaaaggatac agctggagtc agtttagtga tgtggaagag aacaggactg 241 aggccccaga agggactgaa tcggagatgg agacccccag tgccatcaat ggcaacccat 301 cctggcacct ggcagacagc cccgcggtga atggagccac tgcgcacagc agcagtttgg 361 atgcccggga ggtgatcccc atggcagcag taaagcaagc gctgagggag gcaggcgacg 421 agtttgaact gcggtaccgg cgggcattca gtgacctgac atcccagctc cacatcaccc 481 cagggacagc atatcagagc tttgaacagg tagtgaatga actcttccgg gatggggtaa 541 actggggtcg cattgtggcc tttttctcct tcggcggggc actgtgcgtg gaaagcgtag 601 acaaggagat gcaggtattg gtgagtcgga tcgcagcttg gatggccact tacctgaatg 661 accacctaga gccttggatc caggagaacg gcggctggga tacttttgtg gaactctatg 721 ggaacaatgc agcagccgag agccgaaagg gccaggaacg cttcaaccgc tggttcctga 781 cgggcatgac tgtggccggc gtggttctgc tgggctcact cttcagtcgg aaatgaccag 841 acactgacca tccactctac cctcccaccc ccttctctgc tccaccacat cctccgtcca 901 gccgccattg ccaccaggag aacccg // LOCUS HSBCTCF4 2444 bp RNA PRI 14-AUG-1997 DEFINITION Homo sapiens mRNA for hTCF-4. ACCESSION Y11306 NID g1938212 KEYWORDS HMG box; HMG box protein; hTcf-4 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2444) AUTHORS Korinek,V., Barker,N., Morin,P.J., van Wichen,D., de Weger,R., Kinzler,K.W., Vogelstein,B. and Clevers,H. TITLE Constitutive transcriptional activation by a beta-catenin-Tcf complex in APC-/- colon carcinoma JOURNAL Science 275 (5307), 1784-1787 (1997) MEDLINE 97218301 REFERENCE 2 (bases 1 to 2444) AUTHORS Korinek,V. TITLE Direct Submission JOURNAL Submitted (17-FEB-1997) V. Korinek, University Hopsital Utrecht, Department of Immunology, Universtiy Hopsital,, PO Box 85500, 3508 GA Utrecht, NETHERLANDS FEATURES Location/Qualifiers source 1..2444 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus 12 weeks old" mRNA 1..2444 gene 308..2098 /gene="hTcf-4" CDS 308..2098 /gene="hTcf-4" /function="HMG box transcription factor" /codon_start=1 /product="hTcf-4" /db_xref="PID:e309297" /db_xref="PID:g1938213" /translation="MPQLNGGGGDDLGANDELISFKDEGEQEEKSSENSSAERDLADV KSSLVNESETNQNSSSDSEAERRPPPRSESFRDKSRESLEEAAKRQDGGLFKGPPYPG YPFIMIPDLTSPYLPKRSVSPTARTYLQMKWPLLDVQAGSLQSRQALKDARSPSPAHI VSNKVPVVQHPHHVHPLTPLITYSNEHFTPGNPPPHLPADVDPKTGIPRPPHPPDISP YYPLSPGTVGQIPHPLGWLVPQQGQPVYPITTGGFRHPYPTALTVNASVSRFPPHMVP PHHTLHTTGIPHPAIVTPTVKQESSQSDVGSLHSSKHQDSKKEEEKKKPHIKKPLNAF MLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYP GWSARDNYGKKKKRKRDKQPGETNEHSECFLNPCLSLPPITDLSAPKKCRARFGLDQQ NNWCGPCRRKKKCVRYIQGEGSCLSPPSSDGSLLDSPPPSPNLLGSPPRDAKSQTEQT QPLSLSLKPDPLAHLSMMPPPPALLLAEATHKASALCPNGALDLPPAALQPAAPSSSI AQPSTSWLHSHSSLAGTQPQPLSLVTKSLE" BASE COUNT 635 a 734 c 522 g 553 t ORIGIN 1 ggtttttttt ttttaccccc cttttttatt tattattttt ttgcacattg agcggatcct 61 tgggaacgag agaaaaaaga aacccaaact cacgcgtgca gaagatctcc ccccccttcc 121 cctcccctcc tccctctttt cccctcccca ggagaaaaag acccccaagc agaaaaaagt 181 tcaccttgga ctcgtctttt tcttgcaata ttttttgggg gggcaaaact ttgagggggt 241 gatttttttt ggcttttctt cctccttcat ttttcttcca aaattgctgc tggtgggtga 301 aaaaaaaatg ccgcagctga acggcggtgg aggggatgac ctaggcgcca acgacgaact 361 gatttccttc aaagacgagg gcgaacagga ggagaagagc tccgaaaact cctcggcaga 421 gagggattta gctgatgtca aatcgtctct agtcaatgaa tcagaaacga atcaaaacag 481 ctcctccgat tccgaggcgg aaagacggcc tccgcctcgc tccgaaagtt tccgagacaa 541 atcccgggaa agtttggaag aagcggccaa gaggcaagat ggagggctct ttaaggggcc 601 accgtatccc ggctacccct tcatcatgat ccccgacctg acgagcccct acctccccaa 661 gcgatccgtc tcgcccaccg cccgaaccta tctccagatg aaatggccac tgcttgatgt 721 ccaggcaggg agcctccaga gtagacaagc cctcaaggat gcccggtccc catcaccggc 781 acacattgtc tctaacaaag tgccagtggt gcagcaccct caccatgtcc accccctcac 841 gcctcttatc acgtacagca atgaacactt cacgccggga aacccacctc cacacttacc 901 agccgacgta gaccccaaaa caggaatccc acggcctccg caccctccag atatatcccc 961 gtattaccca ctatcgcctg gcaccgtagg acaaatcccc catccgctag gatggttagt 1021 accacagcaa ggtcaaccag tgtacccaat cacgacagga ggattcagac acccctaccc 1081 cacagctctg accgtcaatg cttccgtgtc caggttccct ccccatatgg tcccaccaca 1141 tcatacgcta cacacgacgg gcattccgca tccggccata gtcacaccaa cagtcaaaca 1201 ggaatcgtcc cagagtgatg tcggctcact ccatagttca aagcatcagg actccaaaaa 1261 ggaagaagaa aagaagaagc cccacataaa gaaacctctt aatgcattca tgttgtatat 1321 gaaggaaatg agagcaaagg tcgtagctga gtgcacgttg aaagaaagcg cggccatcaa 1381 ccagatcctt gggcggaggt ggcatgcact gtccagagaa gagcaagcga aatactacga 1441 gctggcccgg aaggagcgac agcttcatat gcaactgtac cccggctggt ccgcgcggga 1501 taactatgga aagaagaaga agaggaaaag ggacaagcag ccgggagaga ccaatgaaca 1561 cagcgaatgt ttcctaaatc cttgcctttc acttcctccg attacagacc tcagcgctcc 1621 taagaaatgc cgagcgcgct ttggccttga tcaacagaat aactggtgcg gcccttgcag 1681 gagaaaaaaa aagtgcgttc gctacataca aggtgaaggc agctgcctca gcccaccctc 1741 ttcagatgga agcttactag attcgcctcc cccctccccg aacctgctag gctcccctcc 1801 ccgagacgcc aagtcacaga ctgagcagac ccagcctctg tcgctgtccc tgaagcccga 1861 ccccctggcc cacctgtcca tgatgcctcc gccacccgcc ctcctgctcg ctgaggccac 1921 ccacaaggcc tccgccctct gtcccaacgg ggccctggac ctgcccccag ccgctttgca 1981 gcctgccgcc ccctcctcat caattgcaca gccgtcgact tcttggttac attcccacag 2041 ctccctggcc gggacccagc cccagccgct gtcgctcgtc accaagtctt tagaatagct 2101 ttagcgtcgt gaaccccgct gctttgttta tggttttgtt tcacttttct taatttgccc 2161 cccaccccca ccttgaaagg ttttgttttg tactctctta attttgtgcc atgtggctac 2221 attagttgat gtttatcgag ttcattggtc aatatttgac ccattcttat ttcaatttct 2281 ccttttaaat atgtagatga gagaagaacc tcatgattgg taccaaaatt tttatcaaca 2341 gctgtttaaa gtctttgtag cgtttaaaaa atatatatat atacataact gttatgtagt 2401 tcggatagct tagttttaaa agactgatta aaaaacaaaa aaaa // LOCUS HSBDP1 2810 bp RNA PRI 25-MAR-1997 DEFINITION H.sapiens BDP1 mRNA for protein-tyrosine-phosphatase. ACCESSION X79568 NID g1871530 KEYWORDS BDP1 gene; protein-tyrosine-phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2810) AUTHORS Kim,Y.W., Wang,H., Sures,I., Lammers,R., Martell,K.J. and Ullrich,A. TITLE Characterization of the PEST family protein tyrosine phosphatase BDP1 JOURNAL Oncogene 13 (10), 2275-2279 (1996) MEDLINE 97108674 REFERENCE 2 (bases 1 to 2810) AUTHORS Kim,Y.W. TITLE Direct Submission JOURNAL Submitted (07-JUN-1994) Y.W. Kim, Max Planck Inst. fuer Biochemie, Dept of Molecular Biology, Am Klopferspitz 18a, 82152 Planegg, FRG FEATURES Location/Qualifiers source 1..2810 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /cell_line="hematopoietic" gene 44..1420 /gene="BDP1" CDS 44..1420 /gene="BDP1" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine-phosphatase" /db_xref="PID:e109926" /db_xref="PID:g1871531" /translation="MSRSLDSARSFLERLEARGGREGAVLAGEFSDIQACSAAWKADG VCSTVAGSRPENVRKNRYKDVLPYDQTRVILSLLQEEGHSDYINGNFIRGVDGSLAYI ATQGPLPHTLLDFWRLVWEFGVKVILMACREIENGRKRCERYWAQEQEPLQTGLFCIT LIKEKWLNEDIMLRTLKVTFQKESRSVYQLQYMSWPDRGVPSSPDHMLAMVEEARRLQ GSGPEPLCVHCSAGCGRTGVLCTVDYVRQLLLTQMIPPDFSLFDVVLKMRKQRPAAVQ TEEQYRFLYHTVAQMFCSTLQNASPHYQNIKENCAPLYDDALFLRTPQALLAIPRPPG GVLRSISVPGSPGHAMADTYAEEQKRGAPAGAGSGTQTGTGTGARSAEEAPLYSKVTP RAQRPGAHAEDARGTLPGRVPADQSPAGSGAYEDVAGGAQTGGLGFNLRIGRPKGPRD PPAEWTRV" BASE COUNT 700 a 807 c 810 g 493 t ORIGIN 1 gaattcggca cgagcgggct ggaccttgct ggcccgcggc gccatgagcc gcagcctgga 61 ctcggcgcgg agcttcctgg agcggctgga agcgcggggc ggccgggagg gggcagtcct 121 cgccggcgag ttcagcgaca tccaggcctg ctcggccgcc tggaaggctg acggcgtgtg 181 ctccaccgtg gccggcagtc ggccagagaa cgtgaggaag aaccgctaca aagacgtgct 241 gccttatgat cagacgcgag taatcctctc cctgctccag gaagagggac acagcgacta 301 cattaatggc aacttcatcc ggggcgtgga tggaagcctg gcctacattg ccacgcaagg 361 acccttgcct cacaccctgc tagacttctg gagactggtc tgggagtttg gggtcaaggt 421 gatcctgatg gcctgtcgag agatagagaa tgggcggaaa aggtgtgagc ggtactgggc 481 ccaggagcag gagccactgc agactgggct tttctgcatc actctgataa aggagaagtg 541 gctgaatgag gacatcatgc tcaggaccct caaggtcaca ttccagaagg agtcccgttc 601 tgtgtaccag ctacagtata tgtcctggcc agaccgtggg gtccccagca gtcctgacca 661 catgctcgcc atggtggagg aagcccgtcg cctccaggga tctggccctg aacccctctg 721 tgtccactgc agtgcgggtt gtgggcgaac aggcgtcctg tgcaccgtgg attatgtgag 781 gcagctgctc ctgacccaga tgatcccacc tgacttcagt ctctttgatg tggtccttaa 841 gatgaggaag cagcggcctg cggccgtgca gacagaggag cagtacaggt tcctgtacca 901 cacggtggct cagatgttct gctccacact ccagaatgcc agcccccact accagaacat 961 caaagagaat tgtgccccac tctacgacga tgccctcttc ctccggactc cccaggcact 1021 tctcgccata ccccgcccac caggaggggt cctcaggagc atctctgtgc ccgggtcccc 1081 gggccacgcc atggctgaca cctacgcgga ggagcagaag cgcggggctc cagcgggcgc 1141 cgggagtggg acgcagacgg ggacggggac gggggcgcgc agcgcggagg aggcgccgct 1201 ctacagcaag gtgacgccgc gcgcccagcg acccggggcg cacgcggagg acgcgagggg 1261 gacgctgcct ggccgcgttc ctgctgacca aagtcctgcc ggatctggcg cctacgagga 1321 cgtggcgggt ggagctcaga ccggtgggct aggtttcaac ctgcgcattg ggaggccgaa 1381 gggtccccgg gacccgcctg ctgagtggac ccgggtgtaa gtctaacgcc agttcctgcc 1441 tgttgcctct tgtgagctcg gactgctgat gccccggtgc tgctgagcgc cgtgccgaga 1501 atggaaacag tgggcctgga tcaaagttaa agtttctcag ggtgggaaat gtgggggctt 1561 tgcccaatga ctgtagcatt caaggcttga ggctggagga ggtagctagg gtatagtggc 1621 tggtgaggct gcacagagca gattcaagaa agaagatcag gaaggggcat gacccctgag 1681 ttatgaaggg gagaagggac agatgagctt ccggagactg ctctcctcac cacacagcac 1741 tagtccatcc tcagcacctg agcctccctc acttggacac tcaggggacc acacagagaa 1801 gtggatggac acttcgccat ccaggcagaa ctaagccagg cataaccaca gccaagcaga 1861 ttaaccccag gcagaccgat aaaaagacct ccagataggc agacagacag atggaccacc 1921 aacctggaca gacagccaaa gcttcagaga tacagtccac aggtggacaa agggatcccc 1981 agccagagag agagagacca gccaacagct tgatagacca gtgcagccag agagaccacc 2041 aaacacagcc cccaaaagac agacatctct gctagctgga cagccaggtg gaccccctaa 2101 gttagtcaga ttactagaca gatataaaca gatcccctgc tgaacagata tacagagttc 2161 tcagacccca ctccctcagg tgggctggct ggctgacaga ccttctggcc agacagactc 2221 ctaaccaacc agatggactg ccagacaggc agacatcagt ccacatggaa tcctgacatc 2281 ccagccagcc ggccagactc tcatcttgat gtcttgatgg atggacccca gctagtcaga 2341 catgatcctc cagattgaca gacaagtccc ccaaatgagt acacatctcc agctattcag 2401 acagatggag ccccagcaaa tcaggaccta tctaggcaga ccccagccag acccccgcca 2461 gacagactcc caaccagact gaccccttgc tgttcacaca gcctgccgag tagctgggac 2521 tacaggtcta attttttttt tttttaagaa atgagttttt gccatgttgc ccagactggt 2581 cttgaactcc caacctcaag caatcctcct gcctcagcct cccaaagtgc tgagattaca 2641 ggtgtgagcc accaggctca gccccctaag atttgaaaca ctttaaatgg cccatggtag 2701 ggttcctgct aggataaaac attaagtggc tgttaaaaga aataaaagga ggacacgtct 2761 ctgtgcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSBECATA 2585 bp RNA PRI 16-FEB-1995 DEFINITION H.sapiens of beta catenin gene. ACCESSION Z19054 NID g38519 KEYWORDS beta catenin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2585) AUTHORS Hulsken,J., Birchmeier,W. and Behrens,J. TITLE E-cadherin and APC compete for the interaction with beta-catenin and the cytoskeleton JOURNAL J. Cell Biol. 127 (6 Pt 2), 2061-2069 (1994) MEDLINE 95105247 REFERENCE 2 (bases 1 to 2585) AUTHORS Huelsken,J. TITLE Direct Submission JOURNAL Submitted (11-DEC-1992) Huelsken J., Institut fuer Zellbiologie (Tumorforschung), Virchowstr. 173, Essen, Deutschland, 4300 FEATURES Location/Qualifiers source 1..2585 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="Clontech CatNr.: HL1075b" /sex="Female" CDS 201..2546 /standard_name="beta catenin" /citation=[1] /codon_start=1 /product="beta catenin" /db_xref="PID:g38520" /db_xref="SWISS-PROT:P35222" /translation="MATQADLMELDMAMEPDRKAAVSHWQQQSYLDSGIHSGATTTAP SLSGKGNPEEEDVDTSQVLYEWEQGFSQSFTQEQVADIDGQYAMTRAQRVRAAMFPET LDEGMQIPSTQFDAAHPTNVQRLAEPSQMLKHAVVNLINYQDDAELATRAIPELTKLL NDEDQVVVNKAAVMVHQLSKKEASRHAIMRSPQMVSAIVRTMQNTNDVETARCTAGTL HNLSHHREGLLAIFKSGGIPALVKMLGSPVDSVLFYAITTLHNLLLHQEGAKMAVRLA GGLQKMVALLNKTNVKFLAITTDCLQILAYGNQESKLIILASGGPQALVNIMRTYTYE KLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALGLHLTDPSQRLVQNCLWTLRNLSDAA TKQEGMEGLLGTLVQLLGSDDINVVTCAAGILSNLTCNNYKNKMMVCQVGGIEALVRT VLRAGDREDITEPAICALRHLTSRHQEAEMAQNAVRLHYGLPVVVKLLHPPSHWPLIK ATVGLIRNLALCPANHAPLREQGAIPRLVQLLVRAHQDTQRRTSMGGTQQQFVEGVRM EEIVEGCTGALHILARDVHNRIVIRGLNTIPLFVQLLYSPIENIQRVAAGVLCELAQD KEAAEAIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSEDKPQDYKKRLSVELTSSL FRTEPMAWNETADLGLDIGAQGEPLGYRQDDPSYRSFHSGGYGQDALGMDPMMEHEMG GHHPGADYPVDGLPDLGHAQDLMDGLPPGDSNQLAWFDTDL" BASE COUNT 659 a 605 c 666 g 655 t ORIGIN 1 ggggcagcag cgttggcccg gccccgggag cggagagcga ggggaggcgg agacggagga 61 aggtctgagg agcagcttca gtccccgccg agccgccacc gcaggtcgag gacggtcgga 121 ctcccgcggc gggaggagcc tgttcccctg agggtatttg aagtatacca tacaactgtt 181 ttgaaaatcc agcgtggaca atggctactc aagctgattt gatggagttg gacatggcca 241 tggaaccaga cagaaaagcg gctgttagtc actggcagca acagtcttac ctggactctg 301 gaatccattc tggtgccact accacagctc cttctctgag tggtaaaggc aatcctgagg 361 aagaggatgt ggatacctcc caagtcctgt atgagtggga acagggattt tctcagtcct 421 tcactcaaga acaagtagct gatattgatg gacagtatgc aatgactcga gctcagaggg 481 tacgagctgc tatgttccct gagacattag atgagggcat gcagatccca tctacacagt 541 ttgatgctgc tcatcccact aatgtccagc gtttggctga accatcacag atgctgaaac 601 atgcagttgt aaacttgatt aactatcaag atgatgcaga acttgccaca cgtgcaatcc 661 ctgaactgac aaaactgcta aatgacgagg accaggtggt ggttaataag gctgcagtta 721 tggtccatca gctttctaaa aaggaagctt ccagacacgc tatcatgcgt tctcctcaga 781 tggtgtctgc tattgtacgt accatgcaga atacaaatga tgtagaaaca gctcgttgta 841 ccgctgggac cttgcataac ctttcccatc atcgtgaggg cttactggcc atctttaagt 901 ctggaggcat tcctgccctg gtgaaaatgc ttggttcacc agtggattct gtgttgtttt 961 atgccattac aactctccac aaccttttat tacatcaaga aggagctaaa atggcagtgc 1021 gtttagctgg tgggctgcag aaaatggttg ccttgctcaa caaaacaaat gttaaattct 1081 tggctattac gacagactgc cttcaaattt tagcttatgg caaccaagaa agcaagctca 1141 tcatactggc tagtggtgga ccccaagctt tagtaaatat aatgaggacc tatacttacg 1201 aaaaactact gtggaccaca agcagagtgc tgaaggtgct atctgtctgc tctagtaata 1261 agccggctat tgtagaagct ggtggaatgc aagctttagg acttcacctg acagatccaa 1321 gtcaacgtct tgttcagaac tgtctttgga ctctcaggaa tctttcagat gctgcaacta 1381 aacaggaagg gatggaaggt ctccttggga ctcttgttca gcttctgggt tcagatgata 1441 taaatgtggt cacctgtgca gctggaattc tttctaacct cacttgcaat aattataaga 1501 acaagatgat ggtctgccaa gtgggtggta tagaggctct tgtgcgtact gtccttcggg 1561 ctggtgacag ggaagacatc actgagcctg ccatctgtgc tcttcgtcat ctgaccagcc 1621 gacaccaaga agcagagatg gcccagaatg cagttcgcct tcactatgga ctaccagttg 1681 tggttaagct cttacaccca ccatcccact ggcctctgat aaaggctact gttggattga 1741 ttcgaaatct tgccctttgt cccgcaaatc atgcaccttt gcgtgagcag ggtgccattc 1801 cacgactagt tcagttgctt gttcgtgcac atcaggatac ccagcgccgt acgtccatgg 1861 gtgggacaca gcagcaattt gtggaggggg tccgcatgga agaaatagtt gaaggttgta 1921 ccggagccct tcacatccta gctcgggatg ttcacaaccg aattgttatc agaggactaa 1981 ataccattcc attgtttgtg cagctgcttt attctcccat tgaaaacatc caaagagtag 2041 ctgcaggggt cctctgtgaa cttgctcagg acaaggaagc tgcagaagct attgaagctg 2101 agggagccac agctcctctg acagagttac ttcactctag gaatgaaggt gtggcgacat 2161 atgcagctgc tgttttgttc cgaatgtctg aggacaagcc acaagattac aagaaacggc 2221 tttcagttga gctgaccagc tctctcttca gaacagagcc aatggcttgg aatgagactg 2281 ctgatcttgg acttgatatt ggtgcccagg gagaacccct tggatatcgc caggatgatc 2341 ctagctatcg ttcttttcac tctggtggat atggccagga tgccttgggt atggacccca 2401 tgatggaaca tgagatgggt ggccaccacc ctggtgctga ctatccagtt gatgggctgc 2461 cagatctggg gcatgcccag gacctcatgg atgggctgcc tccaggtgac agcaatcagc 2521 tggcctggtt tgatactgac ctgtaaatca tcctttagga gtaacaatac aaatggattt 2581 tgccc // LOCUS HSBETPP2A 2183 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for beta 2 isoform of 61 kDa regulatory subunit of PP2A. ACCESSION Z69028 NID g1418762 KEYWORDS 61kDa regulatory subunit; beta 2 isoform; PP2A gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2183) AUTHORS Zolnierowicz,S., Van Hoof,C., Andjelkovic,N., Cron,P., Stevens,I., Merlevede,W., Goris,J. and Hemmings,B.A. TITLE The variable subunit associated with protein phosphatase 2A0 defines a novel multimember family of regulatory subunits JOURNAL Biochem. J. 317 (Pt 1), 187-194 (1996) MEDLINE 96276417 REFERENCE 2 (bases 1 to 2183) AUTHORS Hemmings,B.B.A. TITLE Direct Submission JOURNAL Submitted (27-JAN-1996) Hemmings B.B.A., Friedrich Miescher-Institut, Maulbeerstrasse 66, BASEL, Switzerland, CH-4002 FEATURES Location/Qualifiers source 1..2183 /organism="Homo sapiens" /isolate="17-18 weeks gestation" /db_xref="taxon:9606" /clone="PR61-P14" /dev_stage="Fetus" /tissue_type="Brain" /clone_lib="cDNA library (Stratagene)" /sex="Female" 5'UTR 1..69 CDS 70..1554 /codon_start=1 /product="beta 2 isoform of 61kDa regulatory subunit of PP2A" /db_xref="PID:e220194" /db_xref="PID:g1418763" /translation="MITVNPPLPQDTVNLFSPVPPPDKVDGFSRRSLRRARPRRSHSS SQFRYQSNQQELTPLPLLKDVPASELHELLSRKLAQCGVMFDFLDCVADLKGKEVKRA ALNELVECVGSTRGVLIEPVYPDIIRMISVNIFRTLPPSENPEFDPEEDEPNLEPSWP HLQLVYEFFLRFLESPDFQPSVAKRYVDQKFVLMLLELFDSEDPREREYLKTILHRVY GKFLGLRAYIRKQCNHIFLRFIYEFEHFNGVAELLEILGSIINGFALPLKTEHKQFLV RVLIPLHSVKSLSVFHAQLAYCVVQFLEKDATLTEHVIRGLLKYWPKTCTQKEVMFLG EMEEILDVIEPSQFVKIQEPLFKQVARCVSSPHFQVAERALYFWNNEYILSLIEDNCH TVLPAVFGTLYQVSKEHWNQTIVSLIYNVLKTFMEMNGKLFDELTASYKLEKQQEQQK AQERQELWQGLEELRLRRLQGTQGAKEAPLQRLTPQVAASGGQS" BASE COUNT 458 a 645 c 597 g 483 t ORIGIN 1 gaattccttt tttttgaatt tctctcttta attaaggtct accccaagcc acggaaagag 61 gagtagacca tgattactgt gaaccccccc ttaccccagg acactgtgaa tctcttctcg 121 cctgtgcccc cacccgacaa ggtggacggc ttctcccgcc gttccctccg cagagcccgg 181 ccccgccgct cccacagctc ctctcagttc cgctatcaga gcaaccagca agagctcaca 241 ccgctgcccc tgctcaaaga tgtgccggct tccgagctgc acgagctgct gagccggaag 301 ctggcccagt gtggggtgat gtttgacttc ttggactgtg tggccgacct caaggggaag 361 gaggtgaagc gggcagccct caacgagctg gtggagtgtg tggggagcac ccggggtgtc 421 ctcatcgagc ccgtctaccc agacatcatc cgcatgatct cagtgaatat cttccggact 481 ctgccgccca gtgagaaccc tgaatttgac cctgaagagg atgagcccaa tcttgagcct 541 tcgtggccac acctgcagct ggtatatgag tttttcctgc gtttcttgga gagcccagac 601 ttccagccct ccgtggccaa gagatatgtg gatcaaaagt ttgtcctgat gctcctggag 661 ctatttgata gtgaggatcc ccgggagcgt gagtacctca agaccatcct gcaccgggtc 721 tatggcaagt tcctgggtct ccgggcctac atccgcaaac agtgcaacca catcttcctc 781 cggttcatct atgaattcga gcacttcaat ggtgtggctg agctgctgga gatcctagga 841 agcatcatca atggctttgc gctgcccctg aagacggagc acaagcagtt cctggttcgc 901 gtcctgatcc ccctgcactc tgtcaagtcg ctgtctgtct tccatgccca gctggcatac 961 tgtgtggtgc agttcctgga gaaggatgcc actctgacag agcacgtgat ccgggggctg 1021 ctcaaatact ggccaaaaac ctgcacccag aaggaggtga tgtttctggg ggagatggaa 1081 gagattcttg atgtcatcga gccctcccag tttgtgaaga tccaggagcc cctttttaag 1141 caggtggctc gctgtgtttc cagcccccat ttccaggttg cagagcgggc tctgtatttc 1201 tggaacaatg agtatatcct aagcctcatt gaggacaact gccacactgt gctgcctgct 1261 gtgtttggga ccctctacca agtctccaag gagcactgga accaaaccat cgtatcactg 1321 atctacaatg tgctcaagac cttcatggag atgaatggga agctgtttga tgagctcaca 1381 gcctcctaca agctggaaaa gcagcaggag cagcagaagg cccaggagcg tcaggagtta 1441 tggcaaggtc tggaggagct gcggctacgc cggctacagg ggacccaggg ggccaaggag 1501 gcccccctcc agcggcttac accccaggtg gccgccagtg ggggtcagag ctagacagca 1561 cctcagaagg ggaaaagcta aacccagagc tgtcagtccc tctatccctt ctcctgtcca 1621 ggggcccaga gagaaacaca cctacccctg gccttgccag agtggcttct gaggactccc 1681 agcccagccc agctttcact ggggggagac gaggagaggc aatggtggtc ttggcaacag 1741 aatgctcagc ccctcgtggc aggacttgac aagggcaagc ttgaccagga agctgccatc 1801 agggatcttc ccctgccccg caaagctagg ctccagctgc aggcgggctc ccaccctctg 1861 ctcctggcct tgggcaaggg cactcagcgc ctcgcctgcc cctgccttgg ccaatgcgag 1921 gtccttcctt atccccacca tggggtccat ggtctattta ttctcgccca gctcaccctc 1981 tacacagaca ctgtcctggg tgcacactcc tcccttccct cgctgtgtac ttccttgtcc 2041 cctttttatt tattgggcag ggggaggggg agggcacagg caagaagaga ttcacagtgt 2101 cctggggtaa gggggggttc acagtaatca tggtctactc ctctttccgt ggctgggggt 2161 agacttaata aagagagaaa ttc // LOCUS HSBHLH 1457 bp RNA PRI 11-SEP-1996 DEFINITION H.sapiens mRNA for B-HLH DNA binding protein. ACCESSION X99268 NID g1495422 KEYWORDS B-HLH protein; H-twist gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1457) AUTHORS Bourgeois,P., Stoetzel,C., Bolcato-Bellemin,A.L., Mattei,M.G. and Perrin-Schmitt,F. TITLE The human H-twist gene is located at 7p21 and encodes a B-HLH protein which is 96% similar to its murine M-twist counterpart JOURNAL Mamm. Genome In press REFERENCE 2 (bases 1 to 1457) AUTHORS Bourgeois,P. TITLE Direct Submission JOURNAL Submitted (05-JUL-1996) P. Bourgeois, LGME du CNRS / U184 de INSERM, Institut de Chimie Biologique, 11 rue Human, 67085 STRASBOURG Cedex, France FEATURES Location/Qualifiers source 1..1457 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /chromosome="7" /clone="pSK+" /clone_lib="lambda EXlox" /dev_stage="adult" /map="p21" /sex="female" /tissue_type="placenta" gene 111..731 /gene="H-twist" CDS 111..731 /gene="H-twist" /codon_start=1 /product="B-HLH DNA binding protein" /db_xref="PID:e254401" /db_xref="PID:g1495423" /translation="MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSAKRGARKRRSSRR SAGGGAGPGGAAGGAVGGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGGGGGSSSGGG SPQSYEELQTQRVMANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYI DFLYQVLQSDELDSKMASCSYVAHERLSYAFSVWRMEGAWSMSASH" BASE COUNT 378 a 370 c 420 g 289 t ORIGIN 1 tccgttgctg tcggcgcgcg gcggcccggg cgggggaagc tggcgggctg aggcgccccg 61 ctcttctcct ctgccccggg cccgcgaggc cacgcgtcgc cgcacgagag atgatgcagg 121 acgtgtccag ctcgccagtc tcgccggccg acgacagcct gagcaacagc gaggaagagc 181 cagaccggca gcagccgccg agcgcgaagc gcggggcacg caagcggcgc agcagcaggc 241 gcagcgcggg cggcggcgcg gggcccggcg gagccgcggg tggggccgtc ggaggcggcg 301 acgagccggg cagcccggcc cagggcaagc gcggcaagaa gtctgcgggc tgtggcggcg 361 gcggcggcgc gggcggcggc ggcggcggcg gcggcggcag cagcagcggc ggcgggagtc 421 cgcagtctta cgaggagctg cagacgcagc gggtcatggc caacgtgcgg gagcgccagc 481 gcacccagtc gctgaacgag gcgttcgccg cgctgcggaa gatcatcccc acgctgccct 541 cggacaagct gagcaagatt cagaccctca agctggcggc caggtacatc gacttcctct 601 accaggtcct ccagagcgac gagctggact ccaagatggc aagctgcagc tatgtggctc 661 acgagcggct cagctacgcc ttctcggtct ggaggatgga gggggcctgg tccatgtccg 721 cgtcccacta gcagcggagc cccccacccc ctcagcaggg ccggagacct agatgtcatt 781 gtttccagag aaggagaaaa tggacagtct agagactctg gagctggata actaaaaata 841 aaaatatatg ccaaagattt tcttggaaat tagaagagca aaatccaaat tcaaagaaac 901 agggcgtggg gcgcactttt aaaagagaaa gcgagacagg cccgtggaca gtgattccca 961 gacgggcagc gcaccatcct cacatcctct gcattctgat agaagtctga acagttgttt 1021 gtgttttttt tttttttttt ttgacgaaga atgtttttat ttttattttt ttcatgcatg 1081 cattctcaag aggtcgtgcc aatcatcagc cactgaaagg aaaggcatca ctatggactt 1141 tctctatttt aaaatggtaa caatcagagg aactataaga acacctttag aaataaaaat 1201 actgggatca aactggcctg caaaaccata gtcagttaat tctttttttc atccttcctc 1261 tgaggggaaa aacaaaaaaa aacttaaaat acaaaaaata acattctatt tatttattga 1321 ggacccatgg taaatgcaat agtccggtgt ctaaatgcat tcatattttt atgattgttt 1381 tgtaaatatc tttgtatatt tttctgcaat aaataaatat aaaaaattta gagaaaaaaa 1441 aaaaaaaaaa aaaaaaa // LOCUS HSBHRPO77 1521 bp RNA PRI 16-OCT-1997 DEFINITION H.sapiens mRNA for biphenyl hydrolase-related protein. ACCESSION X81372 NID g984662 KEYWORDS biphenyl hydrolase; serine hydrolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Puente,X.S. and Lopez-Otin,C. TITLE Cloning and expression analysis of a novel human serine hydrolase with sequence similarity to prokaryotic enzymes involved in the degradation of aromatic compounds JOURNAL J. Biol. Chem. 270 (21), 12926-12932 (1995) MEDLINE 95279440 REFERENCE 2 (bases 1 to 1521) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (06-SEP-1994) C. Lopez-Otin, Universidad de Oviedo, Dept de Biologia Funcional, Area de Bioquimica, Fac. de Medicina, C/Julian Claveria S/N, 33006-Oviedo, SPAIN FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast carcinoma" /clone="O7-7" CDS 213..1037 /codon_start=1 /product="biphenyl hydrolase-related protein" /db_xref="PID:g984663" /db_xref="SWISS-PROT:Q13855" /translation="MPRNLLYSLLSSHLSPHFSTSVTSAKVAVNGVQLHYQQTGEGDH AVLLLPGMLGSGETDFGPQLKNLNKKLFTVVAWDPRGYGHSRPPDRDFPADFFERDAK DAVDLMKALKFKKVSLLGWSDGGITALIAAAKYPSYIHKMVIWGANAYVTDEDSMIYE GIRDVSKWSERTRKPLEALYGYDYFARTCEKWVDGIRQFKHLPDGNICRHLLPRVQCP ALIVHGEKDPLVPRFHADFIHKHVKGSRLHLMPEGKHNLHLRFADEFNKLAEDFLQ" BASE COUNT 400 a 375 c 378 g 368 t ORIGIN 1 ggatccacgt cccacgggcc ggacccgcgg ccgcgttcgg aaatcagcct gagcctgagt 61 accgctaagg ctttaatcac gggtcccgag agccctaagt cttctctttg cttgctgatc 121 tcgtacctta atgtgcaaaa gaatcacgtt gggaactgaa aattcagaat cctgggcctc 181 actcccagag gatctgatct acatgtgtgg agatgcccag gaatctgctt tattctcttt 241 tgtcctccca cctgtccccc catttcagca cctcggtaac ctctgccaaa gtggctgtga 301 atggcgttca gctgcattac cagcagactg gagagggaga tcacgcagtc ctgctacttc 361 ctgggatgtt aggaagtgga gagactgatt ttggacctca gctcaagaac ctcaataaga 421 agctcttcac ggtggtcgcc tgggatcctc gaggctatgg acattccagg cccccagatc 481 gcgatttccc agcagacttt tttgaaaggg atgcaaaaga tgctgttgat ttgatgaagg 541 cgctgaagtt taagaaggtt tctctgctgg ggtggagtga tgggggcata accgcactca 601 ttgctgctgc aaaatatcca tcttacatcc acaagatggt gatctggggc gccaacgcct 661 acgtcactga cgaagacagc atgatatatg agggcatccg agatgtttcc aaatggagtg 721 agagaacaag aaagcctcta gaagccctct atgggtatga ctactttgcc agaacctgtg 781 aaaagtgggt ggatggcata agacagttta aacatctccc agatggtaac atctgccggc 841 acctgctgcc ccgggtccag tgccccgcct tgattgtgca cggtgagaag gatcctctgg 901 tcccacggtt tcatgccgac ttcattcata agcacgtgaa aggctcacgg ctgcatttga 961 tgccagaagg caaacacaac ctgcatttgc gttttgcaga tgaattcaac aagttagcag 1021 aagacttcct acaatgagaa tgcacactcc agtcttggtg gttccttcgt gtggggcttg 1081 atcgtgttgc tgcctgttaa catgatgcct ttgaaactct ccgcctttga aactttctac 1141 ccctcccttc aatcttatcc taaccaaatg agaataatga catattgaaa acagcctcta 1201 gcttcaggct gggcacggtg gctcacagct ataatctcag cactttggga ggctgaggtg 1261 ggagaattgc ctgagcccag gagttcaaga ccagcttgtg caatataggg agactccggc 1321 tctacaaaaa agagtttttc aaaattagcc aggcgaagtg gcacacatct gtggtcccag 1381 gtgctcagga agctgaggtg ggaggatcac ttgagcccaa ttcaaagctg cagtgagctg 1441 taattgcatc actgcactcc aacctgggca acagagtaag accttgtctt aaaaaaaaat 1501 aaaaacataa aaaaaaaaaa a // LOCUS HSBITPTK 2235 bp RNA PRI 22-MAR-1995 DEFINITION H.sapiens mRNA for human lymphoid tyrosine kinase related to murine Blk. ACCESSION Z33998 NID g601951 KEYWORDS protein tyrosine kinase; protein-tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2235) AUTHORS Islam,K.B., Rabbani,H., Larsson,C., Sanders,R. and Smith,C.I. TITLE Molecular cloning, characterization, and chromosomal localization of a human lymphoid tyrosine kinase related to murine Blk JOURNAL J. Immunol. 154 (3), 1265-1272 (1995) MEDLINE 95123078 REFERENCE 2 (bases 1 to 2235) AUTHORS Islam,K.B. TITLE Direct Submission JOURNAL Submitted (26-MAY-1994) Khalid B. Islam, Center for BioTechnology, NOVUM, Karolinska, Institute, Halsovagen 7, Huddinge, 141 57, Sweden FEATURES Location/Qualifiers source 1..2235 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone BLK 1A.1" /cell_type="B-lymphocyte" /cell_line="Burkitt's lymphoma cell line BL-29" /clone_lib="cDNA library BL-29 of Dr. P. Sideras" /germline CDS 223..1740 /standard_name="Human lymphoid protein tyrosine kinase related to murine BLK" /codon_start=1 /product="Protein-Tyrosine Kinase" /db_xref="PID:g601952" /translation="MGLVSSKKPDKEKPIKEKDKGQWSPLKVSAQDKDAPPLPPLVVF NHLTPPPPDEHLDEDKHFVVALYDYTAMNDRDLQMLKGEKLQVLKGTGDWWLARSLVT GREGYVPSNFVARVESLEMERWFFRSQGRKEAERQLLAPINKAGSFLIRESETNKGAF SLSVKDVTTQGELIKHYKIRCLDEGGYYISPRITFPSLQALVQHYSKKGDGLCQRLTL PCVRPAPQNPWAQDEWEIPRQSLRLVRKLGSGQFGEVWMGYYKNNMKVAIKTLKEGTM SPEAFLGEANMMKALQHERLVRLYAVVTKEPIYIVTEYMARGCLLDFLKTDEGSRLSL PRLIDMSAQIAEGMAYIERMNSIHRDLRAANILVSEALCCKIADFGLARIIDSEYTAQ EGAKFPIKWTAPEAIHFGVFTIKADVWSFGVLLMEVVTYGRVPYPGMSNPEVIRNLER GYRMPRPDTCPPELYRGVIAECWRSRPEERPTFEFLQSVLEDFYTATERQYELQP" polyA_signal 2211..2216 polyA_site 2231 BASE COUNT 484 a 661 c 674 g 416 t ORIGIN 1 gggaggctct gatcgcagac cgggggtgct gccacctctg tctgctgccg gcagaaagcc 61 acaagccatg aaaactgatt gagatgagaa gaattcatct gggactggct tttgctttag 121 gatggtgttg gaagttgctc gttgtcgcta ggagcctgct ccactgtaag ggtgtcggga 181 tctgaagagc tatggtgaaa caccactgaa gcattgccaa ggatggggct ggtaagtagc 241 aaaaagccgg acaaggaaaa gccgatcaaa gagaaggaca agggccaatg gagccccctg 301 aaggtcagcg cccaagacaa ggacgccccg ccactgccgc ccctggttgt cttcaaccac 361 cttactcctc caccgcccga tgaacacctg gatgaagaca agcatttcgt ggtggctctg 421 tatgactaca ccgctatgaa tgatcgggac ctgcagatgc tgaaggggga gaagctacag 481 gtcctgaagg gaactggaga ctggtggctg gccaggtcac tcgtcacagg aagagaaggc 541 tatgtgccca gcaactttgt ggcccgagtg gagagcctgg aaatggaaag gtggttcttt 601 agatcacagg gtcggaagga ggctgagagg cagcttcttg ctccaatcaa caaggccggc 661 tcctttctta tcagagagag tgaaaccaac aaaggtgcct tctccctgtc tgtgaaggat 721 gtcaccaccc agggggagct gatcaagcac tataagatcc gctgcctgga tgaagggggc 781 tactacatct ccccccggat caccttcccc tcgctccagg ccctggtgca gcactattct 841 aagaaggggg atggtctatg ccagaggctg accctgccct gtgtgcgccc ggccccgcag 901 aatccctggg cccaggatga atgggagatc ccccggcagt ctctcaggct ggtcaggaaa 961 ctcgggtctg gacaattcgg cgaagtctgg atgggttact acaaaaacaa catgaaggtg 1021 gccattaaga cgctgaagga gggaaccatg tctccagaag ccttcctggg tgaggccaac 1081 atgatgaagg ctctgcagca cgagcggctg gtccgactct acgcagtggt caccaaggag 1141 cccatctaca ttgtcaccga gtacatggcc agaggatgcc tgctggattt cctgaagaca 1201 gatgaaggga gcagattgtc actcccaagg ctgattgaca tgtcggcgca gattgctgaa 1261 gggatggcat acattgagcg catgaattcc atccaccgcg acctgcgggc ggccaacatc 1321 ctggtgtctg aggccttgtg ctgcaaaatt gctgattttg gcttggctcg aatcatcgac 1381 agtgaataca cggcccaaga gggggccaag ttccccatca agtggacagc cccggaagcc 1441 atccacttcg gggtcttcac catcaaagca gacgtgtggt cgtttggagt cctcctgatg 1501 gaagttgtca cttatgggcg ggtgccatac ccagggatga gcaaccccga ggtcatccgc 1561 aacctggagc gcggctaccg catgccgcgc cccgacacct gcccgcccga gctgtaccgc 1621 ggcgtcatcg ccgagtgctg gcgcagccgg cccgaggagc ggcccacctt cgagttcctg 1681 cagtcggtgc tggaggactt ctacacggcc accgagcggc agtacgagct gcagccctag 1741 ccggccgcgc ccgcctgcgc cccgtgccca cctctgcgcg gacgaccccg acttccgtgc 1801 catcccagac gggccgcgaa ggcggggtgt cgcctgtgcc cttttctcag acccggaatc 1861 cagtgggcag aggcagcttc gcagggggtc cccggacgga ctccttcacc gactgcaccc 1921 ccgggcgagt tacgcggcct ctctgtgccg cttcatttgt agagggctgt aacagtgacc 1981 tcgcacggtc atccggagta ctaagcccca gtaaggtgtt caggactggt aagcgactgt 2041 catcaagtaa ggcccccgtg ctgggcaccc cccgtgctgg ccgcgtcccc gcctctgcgc 2101 cctgcgtgga ccccgccctg ccccgctaca gaagccagac tgggtcccgc ggacgccagc 2161 aggggcaacc ccagcctagg ctgcgctcca gcactgcggg gcttttctgc aataaagtca 2221 cgagcgttcg aaaaa // LOCUS HSBLEO 1932 bp RNA PRI 13-MAY-1996 DEFINITION H.sapiens mRNA for bleomycin hydrolase. ACCESSION X92106 NID g1321857 KEYWORDS bleomycin hydrolase; cysteine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1932) AUTHORS Ferrando,A.A., Velasco,G., Campo,E. and Lopez-Otin,C. TITLE Cloning and expression analysis of human bleomycin hydrolase, a cysteine proteinase involved in chemotherapy resistance JOURNAL Cancer Res. 56 (8), 1746-1750 (1996) MEDLINE 96184962 REFERENCE 2 (bases 1 to 1932) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (09-OCT-1995) C. Lopez-Otin, Universidad de Oviedo, Dept de Biologie Funcional, Area de Bioquimica, Facultad de Medicina, C/ Julian Claveria S/N, 33006-Oviedo, SPAIN FEATURES Location/Qualifiers source 1..1932 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="lambda gt11" /clone="1-1" CDS 79..1446 /codon_start=1 /product="bleomycin hydrolase" /db_xref="PID:e205512" /db_xref="PID:g1321858" /translation="MSSSGLNSEKVAALIQKLNSDPQFVLAQNVGTTHDLLDICLKRA TVQRAQHVFQHAVPQEGKPITNQKSSGRCWIFSCLNVMRLPFMKKLNIEEFEFSQSYL FFWDKVERCYFFLSAFVDTAQRKEPEDGRLVQFLLMNPANDGGQWDMLVNIVEKYGVI PKKCFPESYTTEATRRMNDILNHKMREFCIRLRNLVHSGATKGEISATQDVMMEEIFR VVCICLGNPPETFTWEYRDKDKNYQKIGPITPLEFYREHVKPLFNMEDKICLVNDPRP QHKYNKLYTVEYLSNMVGGRKTLYNNQPIDFLKKMVAASIKDGEAVWFGCDVGKHFNS KLGLSDMNLYDHELVFGVSLKNMNKAERLTFGESLMTHAMTFTAVSEKDDQDGAFTKW RVENSWGEDHGHKGYLCMTDEWFSEYVYEVVVDRKHVPEEVLAVLEQEPIILPAWDPM GALAE" BASE COUNT 536 a 412 c 504 g 480 t ORIGIN 1 cagcgagccg cagcgcaatc ccggcgctcg cccaaggacc ctggaagcta ccgttacccc 61 gccggcagcg tgggcgccat gagcagctcg ggactgaatt cggagaaggt agctgctctg 121 atacagaaac tgaattccga cccccagttc gtacttgccc agaatgtcgg gaccacccac 181 gacctgctgg acatctgtct gaagcgggcc acggtgcagc gcgcgcagca tgtgttccag 241 cacgccgtgc cccaggaggg caagccaatc accaaccaga agagctcagg gcgatgctgg 301 atcttttctt gtctgaatgt tatgaggctt ccattcatga aaaagttaaa tattgaagaa 361 tttgagttta gccaatctta cctgtttttt tgggacaagg ttgaacgctg ttatttcttc 421 ttgagtgctt ttgtggacac agcccagaga aaggagcctg aggatgggag gctggtgcag 481 tttttgctta tgaaccctgc aaatgatggt ggccaatggg atatgcttgt taatattgtt 541 gaaaaatatg gtgttatccc taagaaatgc ttccctgaat cttatacaac agaggcaacc 601 agaaggatga atgatattct gaatcacaag atgagagaat tctgtatacg actgcggaac 661 ctggtacaca gtggagcaac caaaggagaa atctcggcca cacaggacgt catgatggag 721 gagatattcc gagtggtgtg catctgtttg ggtaatccac cagagacatt cacctgggaa 781 tatcgagaca aagataaaaa ttatcagaaa attggcccca taacaccctt ggagttttac 841 agggaacatg tcaagccact cttcaatatg gaagataaga tttgtttagt gaatgaccct 901 aggccccagc acaagtacaa caaactttac acagtggaat acttaagcaa tatggttggt 961 gggagaaaaa ctctatacaa caaccagccc attgacttcc tgaaaaagat ggttgctgcc 1021 tccatcaaag atggagaggc tgtgtggttt ggctgtgatg ttggaaaaca cttcaatagc 1081 aagctgggcc tcagtgacat gaatctctat gaccatgagt tagtgtttgg tgtctccttg 1141 aagaacatga ataaagcgga gaggctgact tttggtgagt cacttatgac ccacgccatg 1201 accttcactg ctgtctcaga gaaggatgat caggatggtg ctttcacaaa atggagagtg 1261 gagaattcat ggggtgaaga ccatggccac aaaggttacc tgtgcatgac agatgagtgg 1321 ttctctgagt atgtctacga agtggtggtg gacaggaagc atgtccctga agaggtgcta 1381 gctgtgttag agcaggaacc cattatcctg ccagcatggg accccatggg agctttggct 1441 gagtgatact gccctccagc tctttcctcc ttccatggaa cctgacgtag ctgcaaagga 1501 cagatccagg gactgaagcc aaagttatgc aagggactgt gtgttgccac aggacacagt 1561 cagatttcca gtctccacca ggaacctctt cagaaagtgt gctttatgct gaaacagaat 1621 actgttaaag gaaaaaaaag aggggggaag atcaggtcat actatctact ctcctcatct 1681 ctaacagctc aggatctctt agcattttaa ttagatgtaa ttgtttgtct ttaactgtca 1741 aaaagtttgg ttctgtgtct gtgttttaat aagacgagag gacgagcgat tgaggtgtat 1801 ggagagaaaa cagacctaat gctccttgtt cctagagtag agtggaggga gggtggccta 1861 agagttgagc tctcggaact gcatgctgct ggacagtatc actgtctttc ctagatggca 1921 gtcactgaat tc // LOCUS HSBM28 3379 bp RNA PRI 21-OCT-1996 DEFINITION H.sapiens BM28 mRNA. ACCESSION X67334 NID g468703 KEYWORDS BM28 polypeptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3379) AUTHORS Todorov,I.T., Pepperkok,R., Philipova,R.N., Kearsey,S.E., Ansorge,W. and Werner,D. TITLE A human nuclear protein with sequence homology to a family of early S phase proteins is required for entry into S phase and for cell division JOURNAL J. Cell. Sci. 107 (Pt 1), 253-265 (1994) MEDLINE 94230605 REFERENCE 2 (bases 1 to 3379) AUTHORS Todorov,I. TITLE Direct Submission JOURNAL Submitted (17-JUL-1992) I. Todorov, German Cancer Res Center, Div of Cellular Biochemestry, Im Neuenheimer Feld 280, D-6900 Heidelberg, FRG REMARK revised by [3] REFERENCE 3 (bases 1 to 3379) AUTHORS Todorov,I. TITLE Direct Submission JOURNAL Submitted (29-MAR-1994) I. Todorov, German Cancer Res Center, Div of Cellular Biochemestry, Im Neuenheimer Feld 280, D-6900 Heidelberg, FRG REFERENCE 4 (bases 1 to 3379) AUTHORS Mincheva,A., Todorov,I., Werner,D., Fink,T.M. and Lichter,P. TITLE The human gene for nuclear protein BM28 (CDCL1), a new member of the early S-phase family of proteins, maps to chromosome band 3q21 JOURNAL Cytogenet. Cell Genet. 65 (4), 276-277 (1994) MEDLINE 94080845 REFERENCE 5 (bases 1 to 3379) AUTHORS Ishimi,Y., Ichinose,S., Omori,A., Sato,K. and Kimura,H. TITLE Binding of human minichromosome maintenance proteins with histone H3 JOURNAL J. Biol. Chem. 271 (39), 24115-24122 (1996) MEDLINE 96394544 COMMENT Related sequence: X53539. FEATURES Location/Qualifiers source 1..3379 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Colon carcinoma cell line CaCO2 (ATTC HTB37)" /clone_lib="Lambda gt11 cDNA" /clone="lambda NrMCC11" /chromosome="3" /map="3q21" gene 31..2709 /gene="BM28" CDS 31..2709 /gene="BM28" /codon_start=1 /product="polypeptide BM28" /db_xref="PID:g468704" /db_xref="SWISS-PROT:P49736" /translation="MASSPAQRRRGNDPLTSSPGRSSRRTDALTSSPGRDLPPFEDES EGLLGTEGPLEEEEDGEELIGDGMERDYRAIPELDAYEAEGLALDDEDVEELTASRRE AADGPCGTVTGSWPGLGACAVGSCMTAMRRTRSALPASAASGAGTEDGEEDEQMIESI ENLEDLKGHSVREWVSMAGPRLEIHHRFKNFLRTHVDSHGHNVFKERISDMCKENRES LVVNYEDLAAREHVLAYFLPEAPAELLQIFDEAALEVVLAMYPKYDRITNHIHVRISH LPLVEELRSLRQLHLNQLIRTSGVVTSCTGVLPQLSMVKYNCNKCNFVLGPFCQSQNQ EVKPGSCPECQSAGPFEVNMEETIYQNYQRIRIQESPGKVAARRLPRSKDAILLADLV DSCNAGDEIELTGIYHNNYDGSLNTANGFPVFATVILANHVAKKDNKVAVGELTDEDV KMITSLSKDQQIGEKIFASIAPSIYGHEDIKRGPALALFGGEPKNPGGKHKVRGDINV LLCGDPGTAKSQFLKYIEKVSSRAIFTTGQGASAVAVTAYVQRHPVSREWTLEAGALV LADRGVCLIDEFDKMNDQDRTSIHEAMEQQSISISKAGIVTSLQARCTVIAAANPIGG RYDPSLTFSENVDLTEPIISRFDILCVVRDTVDPVQDEMLARFVVGSHVRHHPSNKEE EGLANGSAAEPAMPNTYGVEPLPQEVLKKYIIYAKERVHPKLNQMDQDKVAKMYSDLR KESMATGSIPITVRHIESMSHGGGPRAHPSAGLCDRRRRQHGHPRDAGELHRHTEVQR HRSMRKTFARYLSFRRDNNELLLFILKQLVAEQVTYQRNRFGAQQDTIEVPEKDLVDK ARQINIHNLSAFYDSELFRMNKFSHDLKRKMILQQF" BASE COUNT 749 a 936 c 993 g 701 t ORIGIN 1 aattccgcgg aatcatcgga atccttcacc atggcatcca gcccggccca gcgtcggcga 61 ggcaatgatc ctctcacctc cagccctggc cgaagctccc ggcgtactga tgccctcacc 121 tccagccctg gccgtgacct tccaccattt gaggatgagt ccgaggggct cctaggcaca 181 gaggggcccc tggaggaaga agaggatgga gaggagctca ttggagatgg catggaaagg 241 gactaccgcg ccatcccaga gctggacgcc tatgaggccg agggactggc tctggatgat 301 gaggacgtag aggagctgac ggccagtcga agggaggcag cagacgggcc atgcggcacg 361 gtgaccggga gctggccggg gctgggcgca tgcgccgtgg gctcctgtat gacagcgatg 421 aggaggacga ggagcgccct gcccgcaagc gccgccagtg gagccggcac ggaggacggc 481 gaggaggacg agcagatgat tgagagcatc gagaacctgg aggatctcaa aggccactct 541 gtgcgcgagt gggtgagcat ggcgggcccc cggctggaga tccaccaccg cttcaagaac 601 ttcctgcgca ctcacgtcga cagccacggc cacaacgtct tcaaggagcg catcagcgac 661 atgtgcaaag agaaccgtga gagcctggtg gtgaactatg aggacttggc agccagggag 721 cacgtgctgg cctacttcct gcctgaggca ccggcggagc tgctgcagat ctttgatgag 781 gctgccctgg aggtggtact ggccatgtac cccaagtacg accgcatcac caaccacatc 841 catgtccgca tctcccacct gcctctggtg gaggagctgc gctcgctgag gcagctgcat 901 ctgaaccagc tgatccgcac cagtggggtg gtgaccagct gcactggcgt cctgccccag 961 ctcagcatgg tcaagtacaa ctgcaacaag tgcaatttcg tcctgggtcc tttctgccag 1021 tcccagaacc aggaggtgaa accaggctcc tgtcctgagt gccagtcggc cggccccttt 1081 gaggtcaaca tggaggagac catctatcag aactaccagc gtatccgaat ccaggagagt 1141 ccaggcaaag tggcggctcg gcggctgccc cgctccaagg acgccattct cctcgcagat 1201 ctggtggaca gctgcaacgc aggagacgag atagagctga ctggcatcta tcacaacaac 1261 tatgatggct ccctcaacac tgccaatggc ttccctgtct ttgccactgt catcctagcc 1321 aaccacgtgg ccaagaagga caacaaggtt gctgtagggg aactgaccga tgaagatgtg 1381 aagatgatca ctagcctctc caaggatcag cagatcggag agaagatctt tgccagcatt 1441 gctccttcca tctatggtca tgaagacatc aagagaggcc ctgctctggc cctgttcgga 1501 ggggagccca aaaacccagg tggcaagcac aaggtacgtg gtgatatcaa cgtgctcttg 1561 tgcggagacc ctggcacagc gaagtcgcag tttctcaagt atattgagaa agtgtccagc 1621 cgagccatct tcaccactgg ccagggggcg tcggctgtgg ccgtcacggc gtatgtccag 1681 cggcaccctg tcagcaggga gtggaccttg gaggctgggg ccctggttct ggctgaccga 1741 ggagtgtgtc tcattgatga atttgacaag atgaatgacc aggacagaac cagcatccat 1801 gaggccatgg agcaacagag catctccatc tcgaaggctg gcatcgtcac ctccctgcag 1861 gctcgctgca cggtcattgc tgccgccaac cccataggag ggcgctacga cccctcgctg 1921 actttctctg agaacgtgga cctcacagag cccatcatct cacgctttga catcctgtgt 1981 gtggtgaggg acaccgtgga cccagtccag gacgagatgc tggcccgctt cgtggtgggc 2041 agccacgtca gacaccaccc cagcaacaag gaggaggagg ggctggccaa tggcagcgct 2101 gctgagcccg ccatgcccaa cacgtatggc gtggagcccc tgccccagga ggtcctgaag 2161 aagtacatca tctacgccaa ggagagggtc cacccgaagc tcaaccagat ggaccaggac 2221 aaggtggcca agatgtacag tgacctgagg aaagaatcta tggcgacagg cagcatcccc 2281 attacggtgc ggcacatcga gtccatgagt catggcggag gcccacgcgc gcatccatct 2341 gcgggactat gtgatcgaag acgacgtcaa catggccatc cgcgtgatgc tggagagctt 2401 catagacaca cagaagttca gcgtcatcgc agcatgcgca agacttttgc ccgctacctt 2461 tcattccggc gtgacaacaa tgagctgttg ctcttcatac tgaagcagtt agtggcagag 2521 caggtgacat atcagcgcaa ccgctttggg gcccagcagg acactattga ggtccctgag 2581 aaggacttgg tggataaggc tcgtcagatc aacatccaca acctctctgc attttatgac 2641 agtgagctct tcaggatgaa caagttcagc cacgacctga aaaggaaaat gatcctgcag 2701 cagttctgag gccctatgcc atccataagg attccttggg attctggttt ggggtggtca 2761 gtgccctctg tgctttatgg acacaaaacc agagcacttg atgaactcgg ggtactaggg 2821 tcagggctta tagcaggatg tctggctgca cctggcatga ctgtttgttt ctccaagcct 2881 gctttgtgct tctcaccttt gggtgggatg ccttgccagt gtgtcttact tggttgctga 2941 acatcttgcc acctccgagt gctttgtctc cactcagtac cttggatcag agctgctgag 3001 ttcaggatgc ctgcgtgtgg tttaggtgtt agccttctta catggatgtc aggagagctg 3061 ctgccctctt ggcgtgagtt gcgtattcag gctgcttttg ctcgctttgg ccagagagct 3121 ggttgaagat gtttgtaatc gttttcagtc tcctgcaggt ttctgtgccc ctgtggtgga 3181 agaggcacga cagtgccagc gcagcgttct gggctcctca gtcgcagggg tgggatgtga 3241 gtcatgcgga ttatccactc gccacagtta tcagctgcca ttgctccctg tctgtttccc 3301 cactctctta tttgtgcatt cggtttggtt tctgtagttt taatttttaa taaagttgaa 3361 taaaatataa aaaaaaaaa // LOCUS HSBMXGENE 2456 bp RNA PRI 22-AUG-1995 DEFINITION H.sapiens Bmx mRNA for cytoplasmic tyrosine kinase. ACCESSION X83107 NID g951234 KEYWORDS cytoplasmic; Tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2456) AUTHORS Tamagnone,L., Lahtinen,I., Mustonen,T., Virtaneva,K., Francis,F., Muscatelli,F., Alitalo,R., Smith,C.I., Larsson,C. and Alitalo,K. TITLE BMX, a novel nonreceptor tyrosine kinase gene of the BTK/ITK/TEC/TXK family located in chromosome Xp22.2 JOURNAL Oncogene 9 (12), 3683-3688 (1994) MEDLINE 95060827 REFERENCE 2 (bases 1 to 2456) AUTHORS Tamagnone,L. TITLE Direct Submission JOURNAL Submitted (01-DEC-1994) L. Tamagnone, University of Helsinki, Molecular/Cancer Biology Lab., PL21 (Haartmaninkatu 3), 00014 Helsinki, FINLAND COMMENT Related sequence: U08341. FEATURES Location/Qualifiers source 1..2456 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" /clone_lib="endothelial cell cDNA" /chromosome="X" /map="p22.2" gene 34..2061 /gene="bmx" CDS 34..2061 /gene="bmx" /codon_start=1 /db_xref="PID:g951235" /translation="MDTKSILEELLLKRSQQKKKMSPNNYKERLFVLTKTNLSYYEYD KMKRGSRKGSIEIKKIRCVEKVNLEEQTPVERQYPFQIVYKDGLLYVYASNEESRSQW LKALQKEIRGNPHLLVKYHSGFFVDGKFLCCQQSCKAAPGCTLWEAYANLHTAVNEEK HRVPTFPDRVLKIPRAVPVLKMDAPSSSTTLAQYDNESKKNYGSQPPSSSTSLAQYDS NSKKIYGSQPNFNMQYIPREDFPDWWQVRKLKSSSSSEDVASSNQKERNVNHTTSKIS WEFPESSSSEEEENLDDYDWFAGNISRSQSEQLLRQKGKEGAFMVRNSSQVGMYTVSL FSKAVNDKKGTVKHYHVHTNAENKLYLAENYCFDSIPKLIHYHQHNSAGMITRLRHPV STKANKVPDSVSLGNGIWELKREEITLLKELGSGQFGVVQLGKWKGQYDVAVKMIKEG SMSEDEFFQEAQTMMKLSHPKLVKFYGVCSKEYPIYIVTEYISNGCLLNYLRSHGKGL EPSQLLEMCYDVCEGMAFLESHQFIHRDLAARNCLVDRDLCVKVSDFGMTRYVLDDQY VSSVGTKFPVKWSAPEVFHYFKYSSKSDVWAFGILMWEVFSLGKQPYDLYDNSQVVLK VSQGHRLYRPHLASDTIYQIMYSCWHELPEKRPTFQQLLSSIEPLREKDKH" BASE COUNT 805 a 495 c 549 g 607 t ORIGIN 1 gcaagcacgg aacaagctga gacggatgat aatatggata caaaatctat tctagaagaa 61 cttcttctca aaagatcaca gcaaaagaag aaaatgtcac caaataatta caaagaacgg 121 ctttttgttt tgaccaaaac aaacctttcc tactatgaat atgacaaaat gaaaaggggc 181 agcagaaaag gatccattga aattaagaaa atcagatgtg tggagaaagt aaatctcgag 241 gagcagacgc ctgtagagag acagtaccca tttcagattg tctataaaga tgggcttctc 301 tatgtctatg catcaaatga agagagccga agtcagtggt tgaaagcatt acaaaaagag 361 ataaggggta acccccacct gctggtcaag taccatagtg ggttcttcgt ggacgggaag 421 ttcctgtgtt gccagcagag ctgtaaagca gccccaggat gtaccctctg ggaagcatat 481 gctaatctgc atactgcagt caatgaagag aaacacagag ttcccacctt cccagacaga 541 gtgctgaaga tacctcgggc agttcctgtt ctcaaaatgg atgcaccatc ttcaagtacc 601 actctagccc aatatgacaa cgaatcaaag aaaaactatg gctcccagcc accatcttca 661 agtaccagtc tagcgcaata tgacagcaac tcaaagaaaa tctatggctc ccagccaaac 721 ttcaacatgc agtatattcc aagggaagac ttccctgact ggtggcaagt aagaaaactg 781 aaaagtagca gcagcagtga agatgttgca agcagtaacc aaaaagaaag aaatgtgaat 841 cacaccacct caaagatttc atgggaattc cctgagtcaa gttcatctga agaagaggaa 901 aacctggatg attatgactg gtttgctggt aacatctcca gatcacaatc tgaacagtta 961 ctcagacaaa agggaaaaga aggagcattt atggttagaa attcgagcca agtgggaatg 1021 tacacagtgt ccttatttag taaggctgtg aatgataaaa aaggaactgt caaacattac 1081 cacgtgcata caaatgctga gaacaaatta tacctggcag aaaactactg ttttgattcc 1141 attccaaagc ttattcatta tcatcaacac aattcagcag gcatgatcac acggctccgc 1201 caccctgtgt caacaaaggc caacaaggtc cccgactctg tgtccctggg aaatggaatc 1261 tgggaactga aaagagaaga gattaccttg ttgaaggagc tgggaagtgg ccagtttgga 1321 gtggtccagc tgggcaagtg gaaggggcag tatgatgttg ctgttaagat gatcaaggag 1381 ggctccatgt cagaagatga attctttcag gaggcccaga ctatgatgaa actcagccat 1441 cccaagctgg ttaaattcta tggagtgtgt tcaaaggaat accccatata catagtgact 1501 gaatatataa gcaatggctg cttgctgaat tacctgagga gtcacggaaa aggacttgaa 1561 ccttcccagc tcttagaaat gtgctacgat gtctgtgaag gcatggcctt cttggagagt 1621 caccaattca tacaccggga cttggctgct cgtaactgct tggtggacag agatctctgt 1681 gtgaaagtat ctgactttgg aatgacaagg tatgttcttg atgaccagta tgtcagttca 1741 gtcggaacaa agtttccagt caagtggtca gctccagagg tgtttcatta cttcaaatac 1801 agcagcaagt cagacgtatg ggcatttggg atcctgatgt gggaggtgtt cagcctgggg 1861 aagcagccct atgacttgta tgacaactcc caggtggttc tgaaggtctc ccagggccac 1921 aggctttacc ggccccacct ggcatcggac accatctacc agatcatgta cagctgctgg 1981 cacgagcttc cagaaaagcg tcccacattt cagcaactcc tgtcttccat tgaaccactt 2041 cgggaaaaag acaagcattg aagaagaaat taggagtgct gataagaatg aatatagatg 2101 ctggccagca ttttcattca ttttaaggaa agtaggaagg cataagtaat tttagctagt 2161 ttttaatagt gttctctgta ttgtctatta tttagaaatg aacaaggcag gaaacaaaag 2221 attcccttga aatttagatc aaattagtaa ttttgtttta tgctgctcct gatataacac 2281 tttccagcct atagcagaag cacattttca gactgcaata tagagactgt gttcatgtgt 2341 aaagactgag cagaactgaa aaattactta ttggatattc attcttttct ttatattgtc 2401 attgtcacaa caattaaata tactaccaag tacagaaatg tggaaaaaaa aaaccg // LOCUS HSBMYB 2627 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for B-myb gene. ACCESSION X13293 NID g29471 KEYWORDS B-myb gene; myb homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2627) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (12-OCT-1988) Nomura N., Molecular Oncology Laboratory, Nippon Veterinary and Zootechnical College, Sakuragi, 1-10-19 Uenosakuragi, Taito-ku, Tokyo 110, Japan REFERENCE 2 (bases 1 to 2627) AUTHORS Nomura,N., Takahashi,M., Matsui,M., Ishii,S., Date,T., Sasamoto,S. and Ishizaki,R. TITLE Isolation of human cDNA clones of myb-related genes, A-myb and B-myb JOURNAL Nucleic Acids Res. 16 (23), 11075-11089 (1988) MEDLINE 89083548 REMARK Erratum:[Nucleic Acids Res 1989 Feb 11;17(3):1282]] COMMENT The B-myb protein is homologous with the myb protein in several regions, especially in the amino terminal domain. FEATURES Location/Qualifiers source 1..2627 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /clone_lib="T-cell(HPB-MLT)" /clone="lambda-Bmyb1" CDS 128..2230 /note="B-myb protein (AA 1-700)" /codon_start=1 /db_xref="PID:g29472" /db_xref="SWISS-PROT:P10244" /translation="MSRRTRCEDLDELHYQDTDSDVPEQRDSKCKVKWTHEEDEQLRA LVRQFGQQDWKFLASHFPNRTDQQCQYRWLRVLNPDLVKGPWTKEEDQKVIELVKKYG TKQWTLIAKHLKGRLGKQCRERWHNHLNPEVKKSCWTEEEDRIICEAHKVLGNRWAEI AKMLPGRTDNAVKNHWNSTIKRKVDTGGFLSESKDCKPPVYLLLELEDKDGLQSAQPT EGQGSLLTNWPSVPPTIKEEENSEEELAAATTSKEQEPIGTDLDAVRTPEPLEEFPKR EDQEGSPPETSLPYKWVVEAANLLIPAVGSSLSEALDLIESDPDAWCDLSKFDLPEEP SAEDSINNSLVQLQASHQQQVLPPRQPSALVPSVTEYRLDGHTISDLSRSSRGELIPI SPSTEVGGSGIGTPPSVLKRQRKRRVALSPVTENSTSLSFLDSCNSLTPKSTPVKTLP FSPSQFLNFWNKQDTLELESPSLTSTPVCSQKVVVTTPLHRDKTPLHQKHAAFVTPDQ KYSMDNTPHTPTPFKNALEKYGPLKPLPQTPHLEEDLKEVLRSEAGIELIIEDDIRPE KQKRKPGLRRSPIKKVRKSLALDIVDEDVKLMMSTLPKSLSLPTTAPSNSSSLTLSGI KEDNSLLNQGFLQAKPEKAAVAQKPRSHFTTPAPMSSAWKTVACGGTRDQLFMQEKAR QLLGRLKPSHTSRTLILS" misc_feature 2606..2611 /note="polyA signal" BASE COUNT 594 a 806 c 769 g 458 t ORIGIN 1 gctgacgcct tcgagcgcgg cccggggccc ggagcggccg gagcagcccg ggtcctgacc 61 ccggcccggc tcccgctccg ggctctgccg gcgggcgggc gagcgcggcg cggtccgggc 121 cggggggatg tctcggcgga cgcgctgcga ggatctggat gagctgcact accaggacac 181 agattcagat gtgccggagc agagggatag caagtgcaag gtcaaatgga cccatgagga 241 ggacgagcag ctgagggccc tggtgaggca gtttggacag caggactgga agttcctggc 301 cagccacttc cctaaccgca ctgaccagca atgccagtac aggtggctga gagttttgaa 361 tccagacctt gtcaaggggc catggaccaa agaggaagac caaaaagtca tcgagctggt 421 taagaagtat ggcacaaagc agtggacact gattgccaag cacctgaagg gccggctggg 481 gaagcagtgc cgtgaacgct ggcacaacca cctcaaccct gaggtgaaga agtcttgctg 541 gaccgaggag gaggaccgca tcatctgcga ggcccacaag gtgctgggca accgctgggc 601 cgagatcgcc aagatgttgc cagggaggac agacaatgct gtgaagaatc actggaactc 661 taccatcaaa aggaaggtgg acacaggagg cttcttgagc gagtccaaag actgcaagcc 721 cccagtgtac ttgctgctgg agctcgagga caaggacggc ctccagagtg cccagcccac 781 ggaaggccag ggaagtcttc tgaccaactg gccctccgtc cctcctacca taaaggagga 841 ggaaaacagt gaggaggaac ttgcagcagc caccacatcg aaggaacagg agcccatcgg 901 tacagatctg gacgcagtgc gaacaccaga gcccttggag gaattcccga agcgtgagga 961 ccaggaaggc tccccaccag aaacgagcct gccttacaag tgggtggtgg aggcagctaa 1021 cctcctcatc cccgctgtgg gttctagcct ctctgaagcc ctggacttga tcgagtcgga 1081 ccctgatgct tggtgtgacc tgagtaaatt tgacctccct gaggaaccat ctgcagagga 1141 cagtatcaac aacagcctag tgcagctgca agcgtcacat cagcagcaag tcctgccacc 1201 ccgccagcct tccgccctgg tgcccagtgt gaccgagtac cgcctggatg gccacaccat 1261 ctcagacctg agccggagca gccggggcga gctgatcccc atctccccca gcactgaagt 1321 cgggggctct ggcattggca caccgccctc tgtgctcaag cggcagagga agaggcgtgt 1381 ggctctgtcc cctgtcactg agaatagcac cagtctgtcc ttcctggatt cctgtaacag 1441 cctcacgccc aagagcacac ctgttaagac cctgcccttc tcgccctccc agtttctgaa 1501 cttctggaac aaacaggaca cattggagct ggagagcccc tcgctgacat ccaccccagt 1561 gtgcagccag aaggtggtgg tcaccacacc actgcaccgg gacaagacac ccctgcacca 1621 gaaacatgct gcgtttgtaa ccccagatca gaagtactcc atggacaaca ctccccacac 1681 gccaaccccg ttcaagaacg ccctggagaa gtacggaccc ctgaagcccc tgccacagac 1741 cccgcacctg gaggaggact tgaaggaggt gctgcgttct gaggctggca tcgaactcat 1801 catcgaggac gacatcaggc ccgagaagca gaagaggaag cctgggctgc ggcggagccc 1861 catcaagaaa gtccggaagt ctctggctct tgacattgtg gatgaggatg tgaagctgat 1921 gatgtccaca ctgcccaagt ctctatcctt gccgacaact gccccttcaa actcttccag 1981 cctcaccctg tcaggtatca aagaagacaa cagcttgctc aaccagggct tcttgcaggc 2041 caagcccgag aaggcagcag tggcccagaa gccccgaagc cacttcacga cacctgcccc 2101 tatgtccagt gcctggaaga cggtggcctg cggggggacc agggaccagc ttttcatgca 2161 ggagaaagcc cggcagctcc tgggccgcct gaagcccagc cacacatctc ggaccctcat 2221 cttgtcctga ggtgttgagg gtgtcacgag cccattctca tgtttacagg ggttgtgggg 2281 gcagaggggg tctgtgaatc tgagagtcat tcaggtgacc tcctgcaggg agccttctgc 2341 caccagcccc tccccagact ctcaggtgga ggcaacaggg ccatgtgctg ccctgttgcc 2401 gagcccagct gtgggcggct cctggtgcta acaacaaagt tccacttcca ggtctgcctg 2461 gttccctccc caaggccaca gggagctccg tcagcttctc ccaagcccac gtcaggcctg 2521 gcctcatctc agaccctgct taggatgggg gatgtggcca ggggtgctcc tgtgctcacc 2581 ctctcttggt gcattttttt ggaagaataa aattgcctct ctctttg // LOCUS HSBPGMR 1673 bp RNA PRI 12-SEP-1993 DEFINITION Human erythrocyte 2,3-bisphosphoglycerate mutase mRNA EC 2.7.5.4. ACCESSION X04327 NID g29480 KEYWORDS bisphosphoglycerate mutase; multifunctional enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1673) AUTHORS Joulin,V., Peduzzi,J., Romeo,P.H., Rosa,R., Valentin,C., Dubart,A., Lapeyre,B., Blouquit,Y., Garel,M.C., Goossens,M., Rosa,J. and Cohen-Solal,M. TITLE Molecular cloning and sequencing of the human erythrocyte 2,3-bisphosphoglycerate mutase cDNA: revised amino acid sequence JOURNAL EMBO J. 5 (9), 2275-2283 (1986) MEDLINE 87053869 COMMENT Data kindly reviewed (17-MAR-1988)) by Cohen-Solal M. FEATURES Location/Qualifiers source 1..1673 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>1673 /note="BPDGM mRNA" CDS 111..890 /note="2,3 biphosphoglycerated mutase (AA 1 - 259)" /codon_start=1 /db_xref="PID:g29481" /db_xref="SWISS-PROT:P07738" /translation="MSKYKLIMLRHGEGAWNKENRFCSWVDQKLNSEGMEEARNCGKQ LKALNFEFDLVFTSVLNRSIHTAWLILEELGQEWVPVESSWRLNERHYGALIGLNREQ MALNHGEEQVRLWRRSYNVTPPPIEESHPYYQEIYNDRRYKVCDVPLDQLPRSESLKD VLERLLPYWNERIAPEVLRGKTILISAHGNSSRALLKHLEGISDEDIINITLPTGVPI LLELDENLRAVGPHQFLGDQEAIQAAIKKVEDQGKVKQAKK" misc_feature 1551..1556 /note="pot. polyadenylation signal" misc_feature 1631..1636 /note="pot. polyadenylation signal" misc_feature 1647..1652 /note="pot. polyadenylation signal" BASE COUNT 495 a 324 c 390 g 464 t ORIGIN 1 ggaggagcgg ctgctgctgc tgctgctgct gctggtggcc cctttgcaga tgtattgctg 61 tccttgaata ttagcccatt tgaaaacgcc tgggaagttc agccatcagt atgtccaagt 121 acaaacttat tatgttaaga catggagagg gtgcttggaa taaggagaac cgtttttgta 181 gctgggtgga tcagaaactc aacagcgaag gaatggagga agctcggaac tgtgggaagc 241 aactcaaagc gttaaacttt gagtttgatc ttgtattcac atctgtcctt aatcggtcca 301 ttcacacagc ctggctgatc ctggaagagc taggccagga atgggtgcct gtggaaagct 361 cctggcgtct aaatgagcgt cactatgggg ccttgatcgg tctcaacagg gagcagatgg 421 ctttgaatca tggtgaagaa caagtgaggc tctggagaag aagctacaat gtaaccccgc 481 ctcccattga ggagtctcat ccttactacc aagaaatcta caacgaccgg aggtataaag 541 tatgcgatgt gcccttggat caactgccac ggtcggaaag cttaaaggat gttctggaga 601 gactccttcc ctattggaat gaaaggattg ctcccgaagt attacgtggc aaaaccattc 661 tgatatctgc tcatggaaat agcagtaggg cactcctaaa acacctggaa ggtatctcag 721 atgaagacat catcaacatt actcttccta ctggagtccc cattcttctg gaattggatg 781 aaaacctgcg tgctgttggg cctcatcagt tcctgggtga ccaagaggcg atccaagcag 841 ccattaagaa agtagaagat caaggaaaag tgaaacaagc taaaaaatag tctttctcaa 901 ctgttggcta agaagaaatg caaaagaagt ggcataggag tgtgttatgg gtgctgaact 961 ctctctcttt ttccccgatt ttccagagct aggctgtgga gtagagtttg tataggtaac 1021 taggtaactt attgtggccc agataaggct ttaggatgcc tcagtgctta tgtcatagcc 1081 ttatgagtta gctttcttgc tagcccccta gtcggtcacc aaactagtaa ctagtggggc 1141 ttaatgaagg tcataagttt ctgagatggg agagcaacaa gtagagatga agttaaaggt 1201 atttatcatt caagaaatca ttattgagtc accaattgac aggcactatt ctaatcagta 1261 gttcacttta atatttaata agattttctg ggataacagt aagggatatt agataatata 1321 ccgtatgtat ttattactag tcttttcctc taggaaaagg gatactttga taattaaggc 1381 cagaggccca ttagttgaga aagtcacaga tatatttctc caagaaagcc aacaaccacc 1441 accacaatga cagaaatgac aacaaggccc tttaacttgt cttctagttt agagacatcc 1501 ttcatttgac atttagtaga attcctcttt ggccacaaga ataagcagca aataaacaac 1561 tatggctgtt gaggttctca ttttggtttg ttttaatttt ttgaactttg ggtacctgta 1621 attagtttaa aaataaagtt cctgataata aagtgactga aaatggcatc ccc // LOCUS HSBRACHYT 2180 bp RNA PRI 09-OCT-1997 DEFINITION Homo sapiens mRNA for Brachyury (T) protein. ACCESSION AJ001699 NID g2558580 KEYWORDS Brachyury (T); brachyury gene; T gene; T protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2180) AUTHORS Edwards,Y. TITLE Direct Submission JOURNAL Submitted (29-SEP-1997) Edwards Y., Dept of Biology, University College London, MRC Human Biochemical Genetics Unit, Wolfson House, 4 Stephenson Way, London, NW1 2HE, UNITED KINGDOM REMARK Revised by author 09-OCT-97 REFERENCE 2 (bases 1 to 2180) AUTHORS Edwards,Y.H., Putt,W., Lekoape,K.M., Stott,D., Fox,M., Hopkinson,D.A. and Sowden,J. TITLE The human homolog T of the mouse T(Brachyury) gene; gene structure, cDNA sequence, and assignment to chromosome 6q27 JOURNAL Genome Res. 6 (3), 226-233 (1996) MEDLINE 96402060 FEATURES Location/Qualifiers source 1..2180 /organism="Homo sapiens" /db_xref="taxon:9606" gene 160..1467 /gene="T (Brachyury)" CDS 160..1467 /gene="T (Brachyury)" /codon_start=1 /product="Brachyury (T) protein" /db_xref="PID:e354228" /db_xref="PID:g2558581" /translation="MSSPGTESAGKSLQYRVDHLLSAVENELQAGSEKGDPTERELRV GLEESELWLRFKELTNEMIVTKNGRRMFPVLKVNVSGLDPNAMYSFLLDFVAADNHRW KYVNGEWVPGGKPEPQAPSCVYIHPDSPNFGAHWMKAPVSFSKVKLTNKLNGGGQIML NSLHKYEPRIHIVRVGGPQRMITSHCFPETQFIAVTAYQNEEITALKIKYNPFAKAFL DAKERSDHKEMMEEPGDSQQPGYSQWGWLLPGTSTLCPPANPHPQFGGALSLPSTHSC DRYPTLRSHRSSPYPSPYAHRNNSPTYSDNSPACLSMLQSHDNWSSLGMPAHPSMLPV SHNASPPTSSSQYPSLWSVSNGAVTPGSQAAAVSNGLGAQFFRGSPAHYTPLTHPVSA PSSSGSPLYEGAAAATDIVDSQYDAAAQGRLIASWTPVSPPSM" BASE COUNT 509 a 628 c 581 g 462 t ORIGIN 1 tcccgggtct gtgccgggac ccgggacccg ggagccgtcg caggtctcgg tccaaggggc 61 cccttttctc ggaagggcgg cggccaagag cagggaaggt ggatctcagg tagcgagtct 121 gggcttcggg gacggcgggg aggggagccg gacgggagga tgagctcccc tggcaccgag 181 agcgcgggaa agagcctgca gtaccgagtg gaccacctgc tgagcgccgt ggagaatgag 241 ctgcaggcgg gcagcgagaa gggcgacccc acagagcgcg aactgcgcgt gggcctggag 301 gagagcgagc tgtggctgcg cttcaaggag ctcaccaatg agatgatcgt gaccaagaac 361 ggcaggagga tgtttccggt gctgaaggtg aacgtgtctg gcctggaccc caacgccatg 421 tactccttcc tgctggactt cgtggcggcg gacaaccacc gctggaagta cgtgaacggg 481 gaatgggtgc cggggggcaa gccggagccg caggcgccca gctgcgtcta catccacccc 541 gactcgccca acttcggggc ccactggatg aaggctcccg tctccttcag caaagtcaag 601 ctcaccaaca agctcaacgg agggggccag atcatgctga actccttgca taagtatgag 661 cctcgaatcc acatagtgag agttgggggt ccacagcgca tgatcaccag ccactgcttc 721 cctgagaccc agttcatagc ggtgactgct tatcagaacg aggagatcac agctcttaaa 781 attaagtaca atccatttgc aaaagctttc cttgatgcaa aggaaagaag tgatcacaaa 841 gagatgatgg aggaacccgg agacagccag caacctgggt actcccaatg ggggtggctt 901 cttcctggaa ccagcaccct gtgtccacct gcaaatcctc atcctcagtt tggaggtgcc 961 ctctccctcc cctccacgca cagctgtgac aggtacccaa ccctgaggag ccaccggtcc 1021 tcaccctacc ccagccccta tgctcatcgg aacaattctc caacctattc tgacaactca 1081 cctgcatgtt tatccatgct gcaatcccat gacaattggt ccagccttgg aatgcctgcc 1141 catcccagca tgctccccgt gagccacaat gccagcccac ctaccagctc cagtcagtac 1201 cccagcctgt ggtctgtgag caacggcgcc gtcaccccgg gctcccaggc agcagccgtg 1261 tccaacgggc tgggggccca gttcttccgg ggctcccccg cgcactacac acccctcacc 1321 catccggtct cggcgccctc ttcctcggga tccccactgt acgaaggggc ggccgcggcc 1381 acagacatcg tggacagcca gtacgacgcc gcagcccaag gccgcctcat agcctcatgg 1441 acacctgtgt cgccaccttc catgtgaagc agcaaggccc aggtcccgaa agatgcagtg 1501 actttttgtc gtggcagcca gtggtgactg gattgaccta ctaggtaccc agtggcagtc 1561 tcaggttaag aaggaaatgc agcctcagta acttcctttt caaagcagtg gaggagcaca 1621 cggacctttc cccagagccc ccagcatccc ttgctcacac ctgcagtagc ggtgctgtcc 1681 aggtggctta cagatgaacc caactgtgga gatgatgcag ttggcccaac ctcactgacg 1741 gtgaaaaaat gtttgccagg gtccagaaac tttttttggt ttatttctca tacagtgtat 1801 tggcaacttt ggcacaccag aatttgtaaa ctccaccagt cctactttag tgagataaaa 1861 agcacactct taatcttctt ccttgttgct ttcaagtagt tagagttgag ctgttaagga 1921 cagaataaaa tcatagttga ggacagcagg ttttagttga attgaaaatt tgactgctct 1981 gccccctaga atgtgtgtat tttaagcata tgtagctaat ctcttgtgtt aaactataac 2041 tgtttcatat ttttcttttg acaaagtagc caaagacaat cagcagaaag cattttctgc 2101 aaaataaacg caatatgcaa aatgtgattc gtccagttat tagtgaagcc cctccttttg 2161 tgagtattta ctgtttattg // LOCUS HSBRCOX 2225 bp RNA PRI 13-JAN-1997 DEFINITION H.sapiens mRNA for Branched chain Acyl-CoA Oxidase. ACCESSION X95190 NID g1780990 KEYWORDS acyl-CoA oxidase; BRCOX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2225) AUTHORS Van Veldhoven,P.P. TITLE Direct Submission JOURNAL Submitted (17-JAN-1996) P.P. Van Veldhoven, K.U. Leuven, Farmakologie, Campus Gasthuisberg (O&N), Herestraat 49, B-3000 Leuven, BELGIUM REFERENCE 2 (bases 1 to 2225) AUTHORS Baumgart,E., Vanhooren,J.C., Fransen,M., Marynen,P., Puype,M., Vandekerckhove,J., Leunissen,J.A., Fahimi,H.D., Mannaerts,G.P. and Veldhoven,P.P. TITLE Molecular characterization of the human peroxisomal branched-chain acyl-CoA oxidase: cDNA cloning, chromosomal assignment, tissue distribution, and evidence for the absence of the protein in Zellweger syndrome JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (24), 13748-13753 (1996) MEDLINE 97098466 FEATURES Location/Qualifiers source 1..2225 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda UNI-ZAP" /chromosome="3" /map="q14.3" gene 93..2138 /gene="BRCOX" CDS 93..2138 /gene="BRCOX" /codon_start=1 /product="branched chain acyl-CoA oxidase" /db_xref="PID:e225440" /db_xref="PID:g1780991" /translation="MGSPVHRVSLGDTWSRQMHPDIESERYMQSFDVERLTNILDGGA QNTALRRKVESIIHSYPEFSCKDNYFMTQNERYKAAMRRAFHIRLIARRLGWLEDGRE LGYAYRALSGDVALNIHRVFVRALRSLGSEEQIAKWDPLCKNIQIIATYAQTELGHGT YLQGLETEATYDAATQEFVIHSPTLTATKWWPGDLGRSATHALVQAQLICSGARRGMH AFIVPIRSLQDHTPLPGIIIGDIGPKMDFDQTDNGFLQLNHVRVPRENMLSRFAQVLP DGTYVKLGTAQSNYLPMVVVRVELLSGEILPILQKACVIAMRYSVIRRQSRLRPSDPE AKVLDYQTQQQKLFPQLAISYAFHFLAVSLLEFFQHSYTAILNQDFSFLPELHALSTG MKAMMSEFCTQGAEMCRRACGGHGYSKLSGLPSLVTKLSASCTYEGENTVLYLQVARF LVKSYLQTQMSPGSTPQRSLSPSVAYLTAPDLARCPAQRAADFLCPELYTTAWAHVAV RLIKDSVQHLQTLTQSGADQHEAWNQTTVIHLQAAKVHCYYVTVKGFTEALEKLENEP AIQQVLKRLCDLHAIHGILTNSGDFLHDAFLSGAQVDMARTAYLDLLRLIRKDAILLT DAFDFTDQCLNSALGCYDGNVYERLFQWAQKSPTNTQENPAYEEYIRPLLQSWRSKL" BASE COUNT 550 a 630 c 573 g 472 t ORIGIN 1 ggttctttgc actgaccaat gctgagagca gaccctcgga gcagccgggt tggaagtgtc 61 tctccacagt caccagacag atccaggata ggatgggcag cccagtgcac cgagtgtcat 121 tgggggatac ctggagcagg caaatgcacc ccgacataga gagcgagagg tatatgcagt 181 cctttgacgt ggaacggctc accaacatcc ttgatggagg tgcccagaac actgcactcc 241 gcaggaaagt tgagagcatc atccacagtt acccggagtt tagctgtaag gacaattatt 301 tcatgaccca gaatgagcgt tataaggctg ccatgcggag ggcattccac atccggttga 361 tagctcggcg cctgggttgg ttagaagatg gtcgtgaatt aggctacgct tacagagccc 421 tttctggaga cgtggcctta aatatacaca gagtcttcgt gagagccctc aggagcctgg 481 gctcagagga gcagattgcc aaatgggacc cactctgcaa aaacatccag atcatcgcaa 541 cgtatgcaca gacagagttg ggacatggga catatcttca gggcctggag actgaagcca 601 cctatgacgc agccacccag gagtttgtga tacacagccc cacgctgact gccaccaaat 661 ggtggcctgg agacttggga cggtcagcca cccatgccct ggtccaggcc cagctgatct 721 gctcaggagc caggcggggc atgcacgctt ttattgtgcc aatccggagt cttcaggacc 781 acaccccact gccaggaatc atcattgggg acatcggacc caagatggac tttgatcaaa 841 cagacaatgg cttcctgcag ctgaaccatg tgcgggtccc cagggagaac atgctgagtc 901 gctttgcaca ggtcttgcca gatggcacct acgtcaaact cggtacagca cagagcaact 961 accttcccat ggtggtggtg cgggtggagc tgctgtcagg ggagatcctc cctatactgc 1021 agaaggcctg tgtcatcgcc atgcgctact cggtcatccg ccgccaatcc cggctccggc 1081 ccagtgaccc agaggcaaag gtcctggact accagacaca acagcagaaa ctctttcctc 1141 agctggccat cagttatgcc ttccatttcc tggcagtcag cctcttggag ttcttccagc 1201 actcctacac tgccattctg aaccaagact tcagcttcct gcctgagctc cacgcgctga 1261 gcacgggcat gaaggccatg atgtcagaat tctgcaccca gggagctgag atgtgccgca 1321 gggcctgtgg cggacatggc tactcaaagc tgagtggcct gccatcactg gtcaccaaat 1381 tgtcggcctc ctgcacctac gagggtgaga acacagtgct ctacctgcag gtggccaggt 1441 tcctggtgaa gagctacctg cagactcaga tgtcccctgg ctccacgcca cagagatctc 1501 tctctccatc tgtcgcatat ctcaccgcac ctgacctggc caggtgtcca gcccagaggg 1561 cagccgactt cctctgcccg gagctctaca ccacggcctg ggcacatgtg gcagtaaggc 1621 tcataaagga ctcagtgcag catttacaga ccctgacgca atccggagct gaccagcacg 1681 aggcttggaa ccagaccact gtcatacacc tccaggctgc taaggtgcac tgctactatg 1741 tcactgtgaa gggttttaca gaagctctgg agaaactaga aaatgaacca gcgattcagc 1801 aggtgctcaa gcgcctctgt gacctccatg ccatacatgg aatcttgact aactcgggtg 1861 actttctcca tgacgccttc ctgtctggtg cccaagtgga catggcaaga acagcctacc 1921 tggacctgct ccgcctgatc cggaaggatg ccatcctgtt aactgatgct tttgacttca 1981 ccgatcagtg tttaaattca gcccttggct gttatgatgg aaacgtctac gaacgcctgt 2041 tccagtgggc tcagaagtca ccaaccaata ctcaggagaa ccctgcctat gaggaatata 2101 taagaccact tttacaaagt tggagatcca agctatgaaa taaccaacag tattcaagaa 2161 gcaaccagca ccatcatgtg ataatggtac tatggcatat atgcaacatt aaaattttaa 2221 attag // LOCUS HSBRK 2507 bp RNA PRI 19-JUL-1994 DEFINITION H.sapiens brk mRNA for tyrosine kinase. ACCESSION X78549 NID g515025 KEYWORDS brk gene; Tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2507) AUTHORS Mitchell,P.J., Barker,K.T., Martindale,J.E., Kamalati,T., Lowe,P.N., Page,M.J., Gusterson,B.A. and Crompton,M.R. TITLE Cloning and characterisation of cDNAs encoding a novel non-receptor tyrosine kinase, brk, expressed in human breast tumours JOURNAL Oncogene 9 (8), 2383-2390 (1994) MEDLINE 94309916 REFERENCE 2 (bases 1 to 2507) AUTHORS Mitchell,P.J. TITLE Direct Submission JOURNAL Submitted (28-MAR-1994) P.J. Mitchell, Institute of Cancer Research, Haddon Labs, 15 Cotswold Road, Sutton, Surrey, SM2 5NG, UK FEATURES Location/Qualifiers source 1..2507 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast carcinoma" /cell_type="epithelial like" /cell_line="T-47D" /clone_lib="lambda Zap" /clone="lambda T2" gene 29..1384 /gene="brk" CDS 29..1384 /gene="brk" /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g515026" /translation="MVSRDQAHLGPKYVGLWDFKSRTDEELSFRAGDVFHVARKEEQW WWATLLDEAGGAVAQGYVPHNYLAERETVESEPWFFGCISRSEAVRRLQAEGNATGAF LIRVSEKPSADYVLSVRDTQAVRHYKIWRRAGGRLHLNEAVSFLSLPELVNYHRAQSL SHGLRLAAPCRKHEPEPLPHWDDWERPREEFTLCRKLGSGYFGEVFEGLWKDRVQVAI KVISRDNLLHQQMLQSEIQAMKKLRHKHILALYAVVSVGDPVYIITELMAKGSLLELL RDSDEKVLPVSELLDIAWQVAEGMCYLESQNYIHRDLAARNILVGENTLCKVGDFGLA RLIKEDVYLSHDHNIPYKWTAPEALSRGHYSTKSDVWSFGILLHEMFSRGQVPYPGMS NHEAFLRVDAGYRMPCPLECPPSVHKLMLTCWCRDPEQRPCFKALRERLSSFTSYENP T" BASE COUNT 458 a 726 c 809 g 514 t ORIGIN 1 cctggtcctg ccgctgcgcc cgcccgccat ggtgtcccgg gaccaggctc acctgggccc 61 caagtatgtg ggcctctggg acttcaagtc ccggacggac gaggagctga gcttccgcgc 121 gggggacgtc ttccacgtgg ccaggaagga ggagcagtgg tggtgggcca cgctgctgga 181 cgaggcgggt ggggccgtgg cccagggcta tgtgccccac aactacctgg ccgagaggga 241 gacggtggag tcggaaccgt ggttctttgg ctgcatctcc cgctcggaag ctgtgcgtcg 301 gctgcaggcc gagggcaacg ccacgggcgc cttcctgatc agggtcagcg agaagccgag 361 tgccgactac gtcctgtcgg tgcgggacac gcaggctgtg cggcactaca agatctggcg 421 gcgtgccggg ggccggctgc acctgaacga ggcggtgtcc ttcctcagcc tgcccgagct 481 tgtgaactac cacagggccc agagcctgtc ccacggcctg cggctggccg cgccctgccg 541 gaagcacgag cctgagcccc tgccccattg ggatgactgg gagaggccga gggaggagtt 601 cacgctctgc aggaagctgg ggtccggcta ctttggggag gtcttcgagg ggctctggaa 661 agaccgggtc caggtggcca ttaaggtgat ttctcgagac aacctcctgc accagcagat 721 gctgcagtcg gagatccagg ccatgaagaa gctgcggcac aaacacatcc tggcgctgta 781 cgccgtggtg tccgtggggg accccgtgta catcatcacg gagctcatgg ccaagggcag 841 cctgctggag ctgctccgcg actctgatga gaaagtcctg cccgtttcgg agctgctgga 901 catcgcctgg caggtggctg agggcatgtg ttacctggag tcgcagaatt acatccaccg 961 ggacctggcc gccaggaaca tcctcgtcgg ggaaaacacc ctctgcaaag ttggggactt 1021 cgggttagcc aggcttatca aggaggacgt ctacctctcc catgaccaca atatccccta 1081 caagtggacg gcccctgaag cgctctcccg aggccattac tccaccaaat ccgacgtctg 1141 gtcctttggg attctcctgc atgagatgtt cagcaggggt caggtgccct acccaggcat 1201 gtccaaccat gaggccttcc tgagggtgga cgccggctac cgcatgccct gccctctgga 1261 gtgcccgccc agcgtgcaca agctgatgct gacatgctgg tgcagggacc ccgagcagag 1321 accctgcttc aaggccctgc gggagaggct ctccagcttc accagctacg agaacccgac 1381 ctgagctgct gtggagcggg catggccggg ccctgctgag gaggggcctg ggcagagggc 1441 ctggacctgg gatcaaggcc cacgcgcttc cctggggttt actgaggtga tgggtgcagg 1501 aaaggttcac aaatgtggag tgtctgcgtc caatacacgc gtgtgctcct ctccttactc 1561 catcgtgtgt gccttgggtc tcagctgctg acacgcagcc tgctctggag cctgcagatg 1621 agatccggga gactgacacg aagccagcag aggtcagagg ggactctgac cacagcccgc 1681 tctctggctg tctgtctgca gtgcccggct gagggtggga ggcaaacacg ccttgttcct 1741 gctcttccca gttcagcttg gtgggagaaa gtcattcgcg tggctcggga cgctcatgta 1801 aatttggttt tggtgctcaa gggttctttc ctcccagggg caggtgtttc tttcctgttt 1861 gtcttgtgtc ttgagagctt ggccttatga ccagtgagaa ctctctccct ggtctctgcc 1921 agcccaagca tcactgcccg aggcgccagc tcagtttcac cgtccacgtc cacaaggggc 1981 ttttcccacc ttcacctttg tcgctgggtc agtgctggaa agcgcccctc actcctgcgc 2041 tgacaagggc ccttctctac tgtctgtggg gtggttccgg gctggggggg ctgcctcctt 2101 tgcacctgat tttgaaggtg tctctttcat ccatggttaa gtcataaaaa gcttattggt 2161 tttggttttg actcacctga aagttttttt ggtttaaaag aagaataggc ggggcacggt 2221 ggctcgtgcc tgtaatccca gcactttggg aggctgaggc aggtggatca cgaggtcagg 2281 agatcgacac catcctggct aacacggtga agccccgtct ctactaaaaa atacaaaaaa 2341 ttagctgggt gtggtggtgg gggtgggcgc ctgtggtccc agctacgtgg gaggctgagg 2401 cagcagactg gtgtgaaccc gggaggtgga gcttgcagtg agccgaggtc gcgccactgc 2461 actccagcct gggcgacaga gcgagactcc atctcaaaaa aaaaaaa // LOCUS HSBTF2P35 912 bp RNA PRI 23-OCT-1995 DEFINITION H.sapiens mRNA for basic transcription factor 2, 34 kD subunit. ACCESSION Z30093 NID g1039317 KEYWORDS basic transcription factor; BTF2 protein; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 909) AUTHORS Humbert,S., van Vuuren,H.A., Lutz,Y., Hoeiijmakers,J.J., Egly,J. and Moncollin,V. TITLE p44 and p34 subunits of the BTF2/TFIIH transcription factor have homologies with SSL1 a yeast protein involved in DNA repair JOURNAL EMBO J. (1994) In press REFERENCE 2 (bases 1 to 912) AUTHORS Moncollin,V. TITLE Direct Submission JOURNAL Submitted (15-FEB-1994) Vincent Moncollin, INSERM U-184/CNRS UPR 6520, Laboratoire de, Genetique Moleculaire des Eucaryotes, 11 rue Humann, Strasbourg, 67085, France REMARK revised by [3] REFERENCE 3 (bases 1 to 912) AUTHORS Moncollin,V. TITLE Direct Submission JOURNAL Submitted (23-OCT-1995) Vincent Moncollin, INSERM U-184/CNRS UPR 6520, Laboratoire de, Genetique Moleculaire des Eucaryotes, 11 rue Humann, Strasbourg, 67085, France FEATURES Location/Qualifiers source 1..912 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1..912 /codon_start=1 /product="basic transcription factor 2, 35 kD subunit" /db_xref="PID:g1039318" /translation="MVSDEDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVL GNSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYE LLTSANEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSR ILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGLYLKVP QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFC NFSPICTTCETAFKISLPPVLKAKKRN" BASE COUNT 267 a 186 c 199 g 260 t ORIGIN 1 atggtttcag acgaagatga attgaatctt ctggttattg tagttgatgc caacccaatt 61 tggtggggaa agcaagcatt aaaggaatct cagttcactt tatccaaatg catagatgcc 121 gtgatggtgc tgggaaattc gcatttattc atgaatcgtt ccaacaaact tgctgtgata 181 gcaagtcaca ttcaagaaag ccgattctta tatcctggaa agaatggcag acttggagac 241 ttcttcggag accctggcaa ccctcctgaa tttaatccct ctgggagtaa agatggaaaa 301 tacgaacttt taacctcagc aaatgaagtt attgttgaag agattaaaga tctaatgacc 361 aaaagtgaca taaagggtca acatacagaa actttgctgg caggatccct ggccaaagcc 421 ctttgctaca ttcatagaat gaacaaggaa gttaaagaca atcaggaaat gaaatcaagg 481 atattggtga ttaaggctgc agaagacagt gcgttgcagt atatgaactt catgaatgtc 541 atctttgcag cacagaaaca gaatattttg attgatgcct gtgttttaga ctccgactca 601 gggctcctcc aacaggcttg tgacatcacg ggaggactgt acctgaaggt gcctcagatg 661 ccttctcttc tgcagtattt gctgtgggtg tttcttcccg atcaagatca gagatctcag 721 ttaatcctcc cacccccggt tcatgttgac tacagggctg cttgcttctg tcatcgaaat 781 ctcattgaaa ttggttatgt ctgttctgtg tgtttgtcaa tattctgcaa tttcagcccc 841 atttgtacta cgtgcgagac agcctttaaa atttctctgc ctccagtgct gaaagccaag 901 aaaagaaact ga // LOCUS HSBTG1 1783 bp RNA PRI 18-NOV-1993 DEFINITION Human BTG1 mRNA. ACCESSION X61123 S38424 S38425 NID g29508 KEYWORDS BTG1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1783) AUTHORS Rouault,J.P. TITLE Direct Submission JOURNAL Submitted (30-JUL-1991) J.P. Rouault, Hopital Edouard Herriot, Pavillon E 2eme etage, place d'Arsonval 69003 Lyon, FRANCE REFERENCE 2 (bases 1 to 1783) AUTHORS Rouault,J.P., Rimokh,R., Tessa,C., Paranhos,G., Ffrench,M., Duret,L., Garoccio,M., Germain,D., Samarut,J. and Magaud,J.P. TITLE BTG1, a member of a new family of antiproliferative genes JOURNAL EMBO J. 11 (4), 1663-1670 (1992) MEDLINE 92224907 FEATURES Location/Qualifiers source 1..1783 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblastoid" /cell_line="UD53" /chromosome="12" /map="12q22" mRNA 1..1783 /gene="BTG1" /evidence=experimental gene 1..1783 /gene="BTG1" CDS 309..824 /gene="BTG1" /codon_start=1 /db_xref="PID:g29509" /db_xref="SWISS-PROT:P31607" /translation="MHPFYTRAATMIGEIAAAVSFISKFLRTKGLTSERQLQTFSQSL QELLAEHYKHHWFPEKPCKGSGYRCIRINHKMDPLIGQAAQRIGLSSQELFRLLPSEL TLWVDPYEVSYRIGEDGSICVLYEASPAGGSTQNSTNVQMVDSRISCKEELLLGRTSP SKNYNMMTVSG" polyA_signal 1751..1756 /gene="BTG1" BASE COUNT 476 a 401 c 404 g 502 t ORIGIN 1 cctctcggag ctggaaatgc agctattgag atcttcgaat gctgcggagc tggaggcgga 61 ggcagctggg gaggtccgag cgatgtgacc aggccgccat cgctcgtctc ttcctctctc 121 ctgccgcctc ctgtgtcgaa aataactttt ttagtctaaa gaaagaaaga caaaagtagt 181 cgtccgcccc tcacgccctc tcttcctctc agccttccgc ccggtgagga agcccggggt 241 ggctgctccg ccgtcggggc cgcgccgccg agccccagcg ccccgggccg cccccgcacg 301 ccgcccccat gcatcccttc tacacccggg ccgccaccat gataggcgag atcgccgccg 361 ccgtgtcctt catctccaag tttctccgca ccaaggggct gacgagcgag cgacagctgc 421 agaccttcag ccagagcctg caggagctgc tggcagaaca ttataaacat cactggttcc 481 cagaaaagcc atgcaaggga tcgggttacc gttgtattcg catcaaccat aaaatggatc 541 ctctgattgg acaggcagca cagcggattg gactgagcag tcaggagctg ttcaggcttc 601 tcccaagtga actcacactc tgggttgacc cctatgaagt gtcctacaga attggagagg 661 atggctccat ctgtgtgctg tatgaagcct caccagcagg aggtagcact caaaacagca 721 ccaacgtgca aatggtagac agccgaatca gctgtaagga ggaacttctc ttgggcagaa 781 cgagcccttc caaaaactac aatatgatga ctgtatcagg ttaagatata gtctgtggat 841 ggatcatctg atgatgatcc ataaatttga tttttgcttt gggtgggctc ctcttgggga 901 tggattatgg aatttaaacc atgtcacagc tgtgaagatc tggcacaaga tagaatggta 961 aaaaaaaaaa aaaattttaa gtgacagtgc catagtttgg acagtacctt tcaatgatta 1021 attttaatag cctgtgagtc caagtaaatg atcactttat ttgctaggga gggaagtcct 1081 agggtggttt cagtttctcc cagacatacc taaattttta catcaatcct tttaaagaaa 1141 atctgtattt caaagaatct ttctctgcag taaatctcgc aggggaattt gcactattac 1201 acttgaaagt tgttattgtt aaccttttcg gcagctttta ataggaaagt taaacgtttt 1261 aaacatggta gtactggaaa ttttacaaga cttttaccta gcacttaaat atgtataaat 1321 gtacataaag acaaactagt aagcatgacc tggggaaatg gtcagacctt gtattgtgtt 1381 tttggccttg aaagtagcaa gtgaccagaa tctgccatgg caacaggctt taaaaaagac 1441 ccttaaaaag acactgtctc aactgtggtg ttagcaccag ccagctctct gtacatttgc 1501 tagcttgtag ttttctaaga ctgagtaaac ttcttatttt tagaaagtgg aggtctggtt 1561 tgtaactttc cttgtactta attgggtaaa agtcttttcc acaaaccacc atctattttg 1621 tgaactttgt tagtcatctt ttatttggta aattatgaac tggtgtaaat ttgtacagtt 1681 catgtatatt gattgtggca aagttgtaca gatttctata ttttggatga gaaatttttc 1741 ttctctctat aataaatcgt ttcttatctt ggcattttta acc // LOCUS HSBVRGENE 1070 bp RNA PRI 12-APR-1996 DEFINITION H.sapiens mRNA for biliverdin IX alpha reductase. ACCESSION X93086 NID g1246748 KEYWORDS biliverdin IX alpha reductase; BVR gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1070) AUTHORS Maines,M.D., Polevoda,B.V., Huang,T.J. and McCoubrey,W.K. Jr. TITLE Human biliverdin IXalpha reductase is a zinc-metalloprotein. Characterization of purified and Escherichia coli expressed enzymes JOURNAL Eur. J. Biochem. 235 (1-2), 372-381 (1996) MEDLINE 96202961 REFERENCE 2 (bases 1 to 1070) AUTHORS McCoubrey,W.K. TITLE Direct Submission JOURNAL Submitted (14-NOV-1995) W.K. McCoubrey, University of Rochester, 575 Elmwood Avenue, Rochester, New York 14642, USA COMMENT Related sequence:- Maines, M.D. Arch. Biochem. Biophys. 300:320-326 (1995). FEATURES Location/Qualifiers source 1..1070 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="placental and kidney cDNA (Clontech)" /clone="hBVR-1" gene 61..951 /gene="BVR" CDS 61..951 /gene="BVR" /codon_start=1 /product="biliverdin IX alpha reductase" /db_xref="PID:e213259" /db_xref="PID:g1246749" /translation="MNAEPERKFGVVVVGVGRAGSVRMRDLRNPHPSSAFLNLIGFVS RRELGSIDGVQQISLEDALSSQEVEVAYICSESSSHEDYIRQFLNAGKHVLVEYPMTL SLAAAQELWELAEQKGKVLHEEHVELLMEEFAFLKKEVVGKDLLKGSLLFTSDPLEED RFGFPAFSGISRLTWLVSLFGELSLVSATLEERKEDQYMKMTVCLETEKKSPLSWIEE KGPGLKRNRYLSFHFKSGSLENVPNVGVNKNIFLKDQNIFVQKLLGQFSEKELAAEKK RILHCLGLAEEIQKYCCSRK" BASE COUNT 294 a 221 c 305 g 250 t ORIGIN 1 ggggtggcgc ccggagctgc acggagagcg tgcccgtcag tgaccgaaga agagaccaag 61 atgaatgcag agcccgagag gaagtttggc gtggtggtgg ttggtgttgg ccgagccggc 121 tccgtgcgga tgagggactt gcggaatcca cacccttcct cagcgttcct gaacctgatt 181 ggcttcgtgt cgagaaggga gctcgggagc attgatggag tccagcagat ttctttggag 241 gatgctcttt ccagccaaga ggtggaggtc gcctatatct gcagtgagag ctccagccat 301 gaggactaca tcaggcagtt ccttaatgct ggcaagcacg tccttgtgga ataccccatg 361 acactgtcat tggcggccgc tcaggaactg tgggagctgg ctgagcagaa aggaaaagtc 421 ttgcacgagg agcatgttga actcttgatg gaggaattcg ctttcctgaa aaaagaagtg 481 gtggggaaag acctgctgaa agggtcgctc ctcttcacat ctgacccgtt ggaagaagac 541 cggtttggct tccctgcatt cagcggcatc tctcgactga cctggctggt ctccctcttt 601 ggggagcttt ctcttgtgtc tgccactttg gaagagcgaa aggaagatca gtatatgaaa 661 atgacagtgt gtctggagac agagaagaaa agtccactgt catggattga agaaaaagga 721 cctggtctaa aacgaaacag atatttaagc ttccatttca agtctgggtc cttggagaat 781 gtgccaaatg taggagtgaa taagaacata tttctgaaag atcaaaatat atttgtccag 841 aaactcttgg gccagttctc tgagaaggaa ctggctgctg aaaagaaacg catcctgcac 901 tgcctggggc ttgcagaaga aatccagaaa tattgctgtt caaggaagta agaggaggag 961 gtgatgtagc acttccaaga tggcaccagc atttggttct tctcaagagt tgaccattat 1021 ctctattctt aaaattaaac atgttgggga aacaaaaaaa aaaaaaaaaa // LOCUS HSC1DPROT 1172 bp RNA PRI 13-FEB-1996 DEFINITION H.sapiens mRNA for C1D protein. ACCESSION X95592 NID g1185118 KEYWORDS C1D gene; C1D protein; DNA-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1172) AUTHORS Keck,T., Nehls,P. and Werner,D. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1172) AUTHORS Werner,D. TITLE Direct Submission JOURNAL Submitted (08-FEB-1996) D. Werner, Dt. Krebsforschungszentrum, Biochemistry of the Cell, Im Neuenheimer Feld 280-0225, D-69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..1172 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="term placenta" /clone_lib="lambda gt10 (OG1)" /clone="C1D-4" gene 118..543 /gene="C1D" CDS 118..543 /gene="C1D" /note="DNA-binding protein" /codon_start=1 /product="C1D protein" /db_xref="PID:e222184" /db_xref="PID:g1185119" /translation="MAGEEINEDYPVEIHEYLSAFENSIGAVDEMLKTMMSVSRNELL QKLDPLEQAKVDLVSAYTLNSMFWVYLATQGVNPKEHPVKQELERIRVYMNRVKEITD KKKAGKLDRGAASRFVKNALWEPKSKNASKVANKGKSKS" polyA_signal 1050..1055 BASE COUNT 403 a 155 c 237 g 377 t ORIGIN 1 ctttccggga gactggagtc gaaggccgtg agtattttct aagccagtgt ttagagagta 61 tgtgaggcaa gagtacctat agaacccgga ggagggtgag gagcagagct ggccataatg 121 gcaggtgaag aaattaatga agactatcca gtagaaattc acgagtattt gtcagcgttt 181 gagaattcca ttggtgctgt ggatgagatg ctgaagacca tgatgtctgt ttctagaaat 241 gagttgttgc agaagttgga tccacttgaa caagcaaaag tggatttggt ttctgcatac 301 acattaaatt caatgttttg ggtttatttg gcaacccaag gagttaatcc taaggaacat 361 ccagtaaaac aggaattgga aagaatcaga gtatatatga acagagtcaa ggaaataaca 421 gacaagaaaa aggctggcaa gctggacaga ggtgcagctt caagatttgt aaaaaatgcc 481 ctctgggaac caaaatcgaa aaatgcatca aaagttgcca ataaaggaaa aagtaaaagt 541 taactttttg gttttgatgt acacatattc aaaaagtaca ttaatatgta atcacagtaa 601 tatgtaaagc taaatacttc ctctccaaag atcattatct ttattgatta gcactgagga 661 ttttaacatt gtgatatatt atatatttat aatttaccat ctcttgatga gactcttatt 721 tctttatata ggtcagtctt gcaagtacca ttttataagc agctgtgaaa tttaagtgaa 781 atgttctttg taaacatttg tactatttta aatgaataat gaccttatga agtatgctat 841 ctgtaggctg aaattatagg tacatctgtt ttcactatat gatattaaga aagcgtgaaa 901 tgacttaaat gttcattttt ttctgtatag atactttatc atgttttcat gattttagga 961 attactgctt tgttgatatt caaagtgtga aactaaaagt ttatggttgt actttaattc 1021 ttggcatgtt gcctctatgt cccatttaaa ataaaataca ttctcattaa ctttagatgg 1081 gaaataaggt tgtatgttga tggatgaatt ttggcatgat gactgtactc tcaataaagg 1141 ctgaaaatgt tgtaaaaaaa aaaaaaaaaa aa // LOCUS HSC2PI3K 7654 bp mRNA PRI 20-JAN-1998 DEFINITION H.sapiens mRNA for phosphoinositide 3-kinase. ACCESSION Y11312 NID g2808446 KEYWORDS C2 domain; phosphoinositide 3-kinase; PI3K gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7654) AUTHORS Brown,R.A., Ho,L.K., Weber-Hall,S.J., Shipley,J.M. and Fry,M.J. TITLE Identification and cDNA cloning of a novel mammalian C2 domain-containing phosphoinositide 3-kinase, HsC2-PI3K JOURNAL Biochem. Biophys. Res. Commun. 233 (2), 537-544 (1997) MEDLINE 97289668 REFERENCE 2 (bases 1 to 7654) AUTHORS Brown,R.A. TITLE Direct Submission JOURNAL Submitted (17-FEB-1997) R.A. Brown, Institute Of Cancer Research, Cell Biology And Experimental Pathology, 15 Cotswold Road, Sutton, Surrey, SM2 5NG, UK REMARK revised by [3] REFERENCE 3 (bases 1 to 7654) AUTHORS Brown,R.A. TITLE Direct Submission JOURNAL Submitted (19-JAN-1998) R.A. Brown, Institute Of Cancer Research, Cell Biology And Experimental Pathology, 15 Cotswold Road, Sutton, Surrey, SM2 5NG, UK FEATURES Location/Qualifiers source 1..7654 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /cell_line="MCF-7" /map="q32" 5'UTR 1..515 /gene="C2-PI3K" gene 1..7654 /gene="C2-PI3K" CDS 516..5420 /gene="C2-PI3K" /EC_number="2.7.1.137" /codon_start=1 /evidence=not_experimental /product="phosphoinositide 3-kinase" /db_xref="PID:e311430" /db_xref="PID:g2076604" /translation="MSSTQDNGEHWKSLESVGISRKELAMAEALQMEYDALSRLRHDK EENRAKQNADPSLISWDEPGVDFYSKPAGRRTDLKLLRGLSGSDPTLNYNSLSPQEGP PNHSTSQGPQPGSDPWPKGSLSGDYLYIFDGSDGGVSSSPGPGDIEGSCKKLSPPPLP PRASIWDTPPLPPRKGSPSSSKISQPSDINTFSLVEQLPGKLLEHRILEEEEVLGGGG QGRLLGSVDYDGINDAITRLNLKSTYDVEMLRDATRGWKEGRGPLDFSKDTSGKPVAR SKTMPPQVPPRTYASRYGNRKNATPGKNRRISAAPVGSRPHTVANGHELFEVSEERDE EVAAFCHMLDILRSGSDIQDYFLTGYVWSAVTPSPEHLGDEVNLKVTVLCDRLQEALT FTCNCSSTVDLLIYQTLCYTHDDLRNVDVGDFVLKPCGLEEFLQNKHALGSHEYIQYC RKFDIDIRLQLMEQKVVRSDLARTVNDDQSPSTLNYLVHLQERPVKQTISRQALSLLF DTYHNEVDAFLLADGDFPLKADRVVQSVKAICNALAAVETPEITSALNQLPPCPSRMQ PKIQKDPSVLAVRENREKVVEALTAAILDLVELYCNTFNADFQTAVPGSRKHDLVQEA CHFARSLAFTVYATHRIPIIWATSYEDFYLSCSLSHGGKDMCSPLQTRRAHFSKYLFH LIVWDQQICFPVQVNRLPRETLLCATLYALPIPPPGSSSEANKQRRVPEALGWVTTPL FNFRQVLTCGRKLLGLWPATQENPSARWSAPNFHQPDSVILQIDFPTSAFDIKFTSPP GDKFSPRYEFGSLREEDQRKLKDIMQKESLYWLTDADKKRLWEKRYYCHSEVSSLPLV LASAPSWEWACLPDIYVLLKQWTHMNHQDALGLLHATFPDQEVRRMAVQWIGSLSDAE LLDYLPQLVQALKYECYLDSPLVRFLLKRAVSDLRVTHYFFWLLKDGLKDSQFSIRYQ YLLAALLCCCGKGLREEFNRQCWLVNALAKLAQQVREAAPSARQGILRTGLEEVKQFF ALNGSCRLPLSPSLLVKGIVPRDCSYFNSNAVPLKLSFQNVDPLGENIRVIFKCGDDL RQDMLTLQMIRIMSKIWVQEGLDMRMVIFRCFSTGRGRGMVEMIPNAETLRKIQVEHG VTGSFKDRPLADWLQKHNPGEDEYEKAVENFIYSCAGCCVATYVLGICDRHNDNIMLK TTGHMFHIDFGRFLGHAQMFGNIKRDRAPFVFTSDMAYVINGGDKPSSRFHDFVDLCC QAYNLIRKHTHLFLNLLGLMLSCGIPELSDLEDLKYVYDALRPQDTEANATTYFTRLI ESSLGSVATKLNFFIHNLAQMKFTGSDDRLTLSFASRTHTLKSSGRISDVFLCRHEKI FHPNKGYIYVVKVMRENTHEATYIQRTFEEFQELHNKLRLLFPSSHLPSFPSRFVIGR SRGEAVAERRREELNGYIWHLIHAPPEVAECDLVYTFFHPLPRDEKAMGTSPAPKSSD GTWARPVGKVGGEVKLSISYKNNKLFIMVMHIRGLQLLQDGNDPDPYVKIYLLPDPQK TTKRKTKVARKTCNPTYNEMLVYDGIPKGDLQQRELQLSVLSEQGFWENVLLGEVNIR LRELDLAQEKTGWFALGSRSHGTL" 3'UTR 5421..7654 /gene="C2-PI3K" BASE COUNT 1693 a 2102 c 2031 g 1828 t ORIGIN 1 actcactata gggctcgagc ggccgcccgg gcaggtaaga atcagaagac atttgtgctt 61 tggggagcag aggccctcag ggtatagaga aggaagaaga gagaggttca ctgtagtcct 121 gaagcagaaa taagacctgt ggctgaagga agccttagca attcactcct tcctcttcct 181 gagaactctc tgtaggaagt ctcacctagc agaggcttca cagtatttca gagaagccaa 241 agattgtttg cctctttgga aactgttatc cttccatcat gactgtgtca ctcctgccac 301 tgttccacca tagagatggc gtcctttgca gcaaaccgta agttataagg atgagggaag 361 aagagtagag ggccaaaagg attccatttt gaggaaaaac tacagtttgc cttgccaggt 421 agaagaatca ggcgcccaga caccatgtca caaccctcca gaactgacgt tggcaggaag 481 tagagacttt gttgcctgtg tcccccatcc tcaccatgtc ttcgactcag gacaatgggg 541 aacactggaa gtccctggag tctgtgggca tcagccgcaa agaactagcg atggccgaag 601 ccctgcagat ggagtatgat gccctgtccc ggctccggca tgacaaggag gagaacagag 661 ccaagcagaa cgcagacccc tctctcatca gctgggatga gcctggggta gacttttaca 721 gcaagccagc aggaaggcgg accgacctca agctgttacg cggtctctct ggctctgatc 781 ctacccttaa ctacaactca ctatccccac aggaagggcc gcccaaccac tctacctccc 841 aagggccaca gcctggctca gatccctggc ccaaaggctc cctgtctgga gactatctct 901 acatttttga tggttcagat gggggagtct cttcgtcccc aggaccaggg gacatagagg 961 gctcttgcaa gaaactatcc ccacctcctc tgcctccccg agcttctatc tgggataccc 1021 ctcccctgcc tcccagaaag gggtccccct catcctccaa gatctcccag cccagtgaca 1081 tcaacacttt ctctttggtc gaacaattgc caggcaaact gctagagcat cggatcctag 1141 aagaggaaga ggtgctggga ggtgggggtc aggggcgcct actggggtct gtggactatg 1201 atggtatcaa tgatgcaatt actaggctca acttgaaatc gacctatgat gtggagatgt 1261 tgcgggatgc caccaggggc tggaaggagg gccgagggcc gctggacttc agcaaagaca 1321 cctctggaaa acccgtggcc aggagcaaga ctatgccccc tcaggtgccc ccccgcacct 1381 atgcctcccg ctatggcaac cgaaagaatg cgacgcctgg caagaaccgc cggatttctg 1441 cagccccggt gggctcccgg ccccacactg ttgccaatgg ccatgagttg tttgaggtct 1501 cagaagagag agatgaggag gttgctgcat tttgccacat gctggatatc cttcgatctg 1561 gctctgacat ccaagactac ttcctcactg gctatgtctg gagtgctgtc acccctagcc 1621 cagagcacct cggggatgag gtcaacctga aggtgactgt gttgtgtgac aggcttcaag 1681 aggcactcac tttcacctgc aactgttcct ccactgtaga cttgcttatc taccagaccc 1741 tgtgctacac ccatgatgac ctgaggaatg tggacgtggg tgactttgtg ctaaagccct 1801 gcgggctgga ggagttcctg cagaacaagc atgccttggg cagtcatgag tacatccaat 1861 actgccgcaa gtttgacatt gacattcggc tacagctgat ggagcagaag gttgtgcgca 1921 gtgacctggc ccggacggtg aatgatgacc agagcccctc caccttgaac tacctcgtcc 1981 atctccaaga gaggcctgtc aagcagacca tcagcaggca ggccctgagt cttctgttcg 2041 acacttacca caatgaggtg gatgccttcc tgctggctga tggagacttc ccactgaagg 2101 ctgacagggt ggtccagtcc gtcaaggcca tctgcaacgc cctggccgcc gtggaaaccc 2161 ctgagatcac cagtgctctc aaccagctgc ccccctgccc ctcccgcatg cagcctaaaa 2221 ttcagaagga tcccagtgtc ttggctgtga gggaaaaccg agagaaggtc gtggaagccc 2281 tgaccgctgc catcttggac ctggtggagc tgtactgcaa cacattcaac gcagacttcc 2341 agacggcagt gcccgggagc cgcaagcatg acctggtcca ggaggcctgc catttcgcca 2401 ggtccctggc cttcactgtc tatgccaccc accgcatccc catcatctgg gctaccagct 2461 atgaagattt ctacctctcc tgctccctca gccatggcgg caaggacatg tgcagccccc 2521 tgcagacccg aagagctcac ttctccaagt acctcttcca cctcatcgtc tgggaccagc 2581 agatctgctt cccagtgcag gtgaaccggc tgcctcggga gacactgctg tgtgccactc 2641 tctatgctct gcccatcccc ccaccgggga gctcctcaga ggccaataag cagcggcggg 2701 tgcctgaagc cctgggctgg gtcactaccc cactcttcaa cttcaggcag gtcctgacct 2761 gtggccggaa gcttctgggt ttgtggccag caacacagga aaatcccagc gcccgttgga 2821 gtgcacctaa tttccaccag ccagacagtg tcatcctgca gattgacttc cccacctcgg 2881 cctttgacat caagttcacc agcccccctg gagacaagtt cagcccccgc tatgagtttg 2941 gcagcctccg ggaagaagac cagcgcaagc ttaaagacat catgcagaaa gagtccttgt 3001 actggctcac tgatgctgac aagaagcgcc tgtgggagaa gcgatattac tgccactcgg 3061 aggtgagctc gctccccctg gtgctcgcca gcgcccccag ctgggagtgg gcttgcctgc 3121 ctgacatcta tgttctcctg aagcagtgga cccacatgaa ccaccaggat gccctggggc 3181 tcctgcatgc caccttcccg gaccaggagg tgcgtcgtat ggctgtgcag tggattggct 3241 cactctcaga tgctgagctg ctagactacc tgccccagct ggtacaggcc ctgaagtatg 3301 aatgctacct ggacagcccg ttggtgcgct tcctcctgaa acgagctgtg tctgacttga 3361 gagtgactca ctacttcttc tggttactga aggacggcct caaggactct cagttcagca 3421 tccgctacca gtatctgctg gcagccttac tgtgctgctg tggcaagggg ctgagagaag 3481 agtttaaccg ccagtgctgg cttgtcaatg ccctggccaa actggcccag caggtccggg 3541 aggcagcccc atctgcaagg cagggaatcc tccgcacggg cctggaggag gtgaagcagt 3601 tctttgccct caatggctcg tgccgcttgc cactcagccc cagtctgctg gttaagggaa 3661 ttgtgcccag ggactgttcc tacttcaact ccaatgctgt ccccctcaaa ctctccttcc 3721 aaaatgtgga tcccctgggt gagaacatcc gtgtcatctt caagtgtggg gacgaccttc 3781 gccaggacat gctaacgctg cagatgattc gcatcatgag caagatctgg gtccaggagg 3841 ggctggacat gcgcatggtc atcttccgct gcttctccac cggccggggc agagggatgg 3901 tggagatgat ccctaatgct gagaccctgc gtaagatcca ggtggagcat ggggtgaccg 3961 gctcgttcaa ggaccggccc ctggcagact ggctgcagaa acacaaccct ggggaggacg 4021 agtatgagaa ggctgtggag aactttatct actcctgcgc tggctgctgc gtggccacgt 4081 acgtcttggg catctgtgac cgacataatg acaacatcat gctgaagacc actggtcaca 4141 tgttccacat tgattttggc cgcttcctgg gccatgccca gatgtttggc aacatcaagc 4201 gggaccgtgc cccctttgtc ttcacctcgg acatggcgta tgtcatcaac gggggtgaca 4261 agccttccag ccgcttccat gattttgttg acctttgctg ccaagcctac aacctcattc 4321 gcaagcacac ccacctcttc ctcaaccttc tgggcctgat gttgtcctgt gggatccctg 4381 aactctcaga cctggaggac ctcaagtatg tgtacgatgc cctgaggcct caggatacag 4441 aggccaatgc cactacctac ttcactaggt tgattgagtc cagcctgggc agtgtagcca 4501 caaagctcaa ttttttcatc cataatctgg ctcagatgaa gttcacgggc tcagatgacc 4561 ggctgaccct ctcctttgcc tcccgaacac acactctcaa gagctctggc cgaatcagtg 4621 atgttttcct ctgccgccat gagaagatct tccaccccaa caaaggctat atatatgtgg 4681 taaaggtgat gcgagagaac actcacgagg ccacctacat ccagcggacc tttgaggagt 4741 tccaggaatt acacaataag ttgcggctgc tcttcccttc ttcccacttg cccagcttcc 4801 ctagtcgctt cgtgatcggc cgctcccggg gagaggcggt ggccgagcgg cggagggagg 4861 agctaaacgg ttacatctgg cacttgatcc acgcaccccc tgaggtggcc gagtgtgatt 4921 tggtgtacac cttcttccac ccactgcccc gggatgagaa ggctatgggc accagcccag 4981 ctcctaagtc ctcagatggc acatgggccc ggcccgtcgg aaaggtggga ggggaggtga 5041 agctgtccat ctcctacaaa aacaataaac tcttcatcat ggtgatgcat attcggggct 5101 tgcaactgct ccaggatgga aatgaccctg acccctatgt gaaaatttac ctccttcctg 5161 accctcagaa aaccactaag aggaaaacca aagtggcccg gaaaacctgc aatcctacct 5221 acaatgagat gttggtatat gatgggatcc ccaagggtga cctgcagcag cgggagctcc 5281 agctgagcgt gctgagtgag cagggattct gggagaacgt cctcctcggt gaggtgaaca 5341 tccgcctgcg agagctggac ctggctcagg agaagaccgg ctggttcgcc ctgggatctc 5401 gaagtcatgg caccttgtga gcccagcaga gccaccaccc agcatcccag gctggtggca 5461 ggagctgggg gagaggactc tcccctgtga gactcctcct tgtgaagggc cagggccctg 5521 ggcaggcctc cagctcggtc caggtgattc tggcctctgt ggtaggaggc agggagagta 5581 agacatgctc tgctgtctct tcctctggag actgaacttg ggttggttgt gatgagcagc 5641 cccttggagg ctgtgaggtt gcagcaaagt tttaagttta ccttgtgtca agggagcaat 5701 gcttggtttg gggaatgtgt ggggtgggct gtatgaagta ccattttggg ggtgggtggg 5761 tggatatctt aatttttatt tttaaaaaat gaaatagtga tgttgtccta actgggacag 5821 gaagccttgc gagaagggac gtacctatgc cccacaaggc aagagaggaa cactatttgg 5881 actttttgta tgattaaggt tcttattgga cttttcccta ggtttttttt ttttgttatt 5941 gttgttgttg ttccgttttc tagctatagg aactatctgg ggaggggccc agtgggtcct 6001 cggccagagc cctctctaag gacaggttgg ggagggttgg ggagggctgc ctgtgctgga 6061 ctgaggcttg tgccactggg cctttctgat tttgcctcca aaggagagcg ctgtgatacc 6121 tacatgtgta aggaagggcc ttccgtattg gggttctgcc aaggacccgt attcagggac 6181 ccatgctctt ttggggggac ttttcctctt gtcttcccta ctttattagg acttgccctg 6241 aataccattt tctacccctt gcccctccat tctcctggcc cttctggggg tcagctggtc 6301 tctatgaata tgctgggggt gcttccccat aggtctctcc cttcatttgt ctctggtggg 6361 acaaaatact gactcagtcc ttagatgtag tttcacccaa gagcatcttg gccctgggaa 6421 gaggtcccta ggctgcagat gctactgact gcttgctagg tagcctctgg aaagcattcc 6481 ccatccatca ctccccactt ctttctgctg tgctgcttcc ctcccaaact ccatttctgt 6541 cacccttttt ataagacttt tcctcattct gtggggccat aaacctattt agtctggagc 6601 caaagggatg ccctatctga aggaaagggg catggggtgg gggattccat caaaactgtt 6661 gttttttgcc ccatgatttt tctttggtca gtaggaggct ggattggagt ggtgattatt 6721 cccctggagc taagctcagg agcccgaagg gagagactga gactgactcc cttatctctt 6781 catattcttt attccctacc agatggattt tttttttttt ttttggagac ggagtctcgc 6841 cctgtcgcca ggctggagtg tagtggcatg atctcgactc actgcaaaat ctgcctcccg 6901 ggttcaagcg attctcctac ctcagcctcc cgagtagctg ggattacagg catgtgccac 6961 cacgccaagc taatttttgt atttttagta gagacggggt ttcaccatgt tggccaggat 7021 ggtctcgatc tcttgacctc gtgatctgcc tgccttggcc tcccaaagtg ctgggattac 7081 aggcgtgagc caccatgccc cgccccagat ggattttaca tttgctcttt tgtgtttcgc 7141 tccaaagggt tgtcttcctc gccaaaagga gggagggact ttgaatttga tatgaatctt 7201 taaaaccaga attggctgga tatttcccat gattgggaaa agagtgaaat gaggacattc 7261 tgtaaactgt ccctccctaa ttccaaggat cagaaactcc ccgttttgct gactcattcc 7321 ataactggag aaagaagctc cattgaccga agccacaggg cagcatggaa gtttaaattt 7381 tctctaaaat taaaatgcca aggataaagc tggctgcttc caggaggggg aagaggagtg 7441 gggagtgggc ggtgaaactt ttccagatga acggaccata aatgtgttac tggctttgtg 7501 cctgtagctc attttattat gacctatatg ctcctgattt aaagagatct gtgtactgtt 7561 tacttcccac ttcccagaat cccttgtatc tcctttctcg ggaattgtat tttctaataa 7621 atgacatttg agaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSC54 1807 bp RNA PRI 25-MAY-1994 DEFINITION Human homolog of yeast IPP isomerase. ACCESSION X17025 NID g488749 KEYWORDS isopentenyl diphosphate:dimethylallyl diphosphate isomerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1807) AUTHORS Kowalski,J. TITLE Direct Submission JOURNAL Submitted (25-OCT-1989) Kowalski J., VIDO (Veterinary Infectious Disease Organization), University of Saskatechewan, 124 Veterinary Rd., Saskatoon SK S7N OWO, Canada REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 1807) AUTHORS Xuan,J.W., Kowalski,J., Chambers,A.F. and Denhardt,D.T. TITLE A human promyelocyte mRNA transiently induced by TPA is homologous to yeast IPP isomerase JOURNAL Genomics 20 (1), 129-131 (1994) MEDLINE 94292171 REFERENCE 3 (bases 1 to 1807) AUTHORS Kowalski,J. TITLE Direct Submission JOURNAL Submitted (19-MAY-1994) Kowalski J., VIDO (Veterinary Infectious Disease Organization), University of Saskatechewan, 124 Veterinary Rd., Saskatoon SK S7N OWO, Canada COMMENT Data kindly reviewed (19-APR-1990) by Kowalski J. FEATURES Location/Qualifiers source 1..1807 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="promyelocyte" /cell_line="HL60" /clone="c5" CDS 51..737 /note="homologue of yeast IPP isomerase" /codon_start=1 /db_xref="PID:g488750" /translation="MMPEINTNHLDKQQVQLLAEMCILIDENDNKIGAETKKNCHLNE NIEKGLLHRAFSVFLFNTENKLLLQQRSDAKITFPGCFTNTCCSHPLSNPAELEESDA LGVRRAAQRRLKAELGIPLEEVPPEEINYLTRIHYKAQSDGIWGEHEIDYILLVRKNV TLNPDPNEIKSYCYVSKEELKELLKKAASGEIKITPWFKIIAATFLFKWWDNLNHLNQ FVDHEKIYRM" BASE COUNT 593 a 290 c 366 g 558 t ORIGIN 1 tctgtggccg gaggctgatc agtgttctag aacagatcag acattttgta atgatgcctg 61 aaataaacac taaccacctc gacaagcaac aggttcaact cctggcagag atgtgtatcc 121 ttattgatga aaatgacaat aaaattggag ctgagaccaa gaagaattgt cacctgaacg 181 agaacattga gaaaggatta ttgcatcgag cttttagtgt cttcttattc aacaccgaaa 241 ataagcttct gctacagcaa agatcagatg ctaagattac ctttccaggt tgttttacga 301 atacgtgttg tagtcatcca ttaagcaatc cagccgagct tgaggaaagt gacgcccttg 361 gagtgaggcg agcagcacag agacggctga aagctgagct aggaattccc ttggaagagg 421 ttcctccaga agaaattaat tatttaacac gaattcacta caaagctcag tctgatggta 481 tctggggtga acatgaaatt gattacattt tgttggtgag gaagaatgta actttgaatc 541 cagatcccaa tgagattaaa agctattgtt atgtgtcaaa ggaagaacta aaagaacttc 601 tgaaaaaagc agccagtggt gaaattaaga taacgccatg gtttaaaatt attgcagcga 661 cttttctctt taaatggtgg gataacttaa atcatttgaa tcagtttgtt gaccatgaga 721 aaatatacag aatgtgaata tgtaggtaaa tgattacaga aaaatttatg tgcttaacaa 781 acttagaatg actttttcct tttaaattta gttctatcat taatttatca ttaaatttag 841 ttctatcatt tggtactatc attaatgtat tataaaactt gtgtggaaaa aactaactta 901 taattttgta tcacacaccc tggatatgtg ttctgtttct aagcgacatt tgtgagagat 961 tattgtaaaa tgagagcgag aaataaaact taatttaatc tttgcagata catacttatg 1021 ggaaatttga acaaatgagt gaaactctgt ttttagtagg ccgtgataaa catttccgga 1081 gcacttgcag aggacttgct atttgccagg tgctttatgt atcattaaat ttttctcata 1141 gttcagaaaa atgtgcaaag gaaactattg tctcgctcct tcaaaacagt cttaattaac 1201 tttcatatta gcagattaaa ctagcagagc aaggttcaaa ttaaatgata tgaccctaat 1261 ttgtatcatt ctgagttgat tgtgtggttt attcattctg aaacatgttg atacttacag 1321 tcaccgactg cttttgataa gtgatattga ttaggttgaa tcttcttgta aatagtattt 1381 accagttagc aaagtctgtg ttttcagaat tacagtgagc acagaggtgt tcataaaatg 1441 ggaattgagt cccactcggt aagagttgct taaacttgac actgttgaca tttgggctgg 1501 ataaaacccc tgtggtgggg tctgtgctgt gcattgcagg atggtgagca gcgtccctct 1561 catgtgacac ccacagttat gccggatgtt gccagatgcc cctagggaca gagtcaaccc 1621 ccaactgagg accactgtct acagagtcag gaaatattgt agggagaaaa aaataacaac 1681 aacaaaggcc tatattaatg ttaaatagag gagattatgg aatgtgtata ttaatgttaa 1741 aaattattcc ttattcaatg tatttttatc aaatcgatag atatctcaga tttgaaactc 1801 aagacag // LOCUS HSC9R1 2026 bp RNA PRI 03-OCT-1996 DEFINITION Human mRNA fragment for complement component C9. ACCESSION X02176 NID g29580 KEYWORDS C9 complement component protein; complement protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2026) AUTHORS Stanley,K.K., Kocher,H.P., Luzio,J.P., Jackson,P. and Tschopp,J. TITLE The sequence and topology of human complement component C9 JOURNAL EMBO J. 4 (2), 375-382 (1985) MEDLINE 85257464 REFERENCE 2 (bases 1 to 2026) AUTHORS DiScipio,R.G., Gehring,M.R., Podack,E.R., Kan,C.C., Hugli,T.E. and Fey,G.H. TITLE Nucleotide sequence of cDNA and derived amino acid sequence of human complement component C9 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (23), 7298-7302 (1984) MEDLINE 85063778 REFERENCE 3 (bases 1 to 2026) AUTHORS Marazitti,D., Eggertsen,G., Fey,G.H. and Stanley,K.K. TITLE Relationships between the gene and protein structure in human complement component C9 JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2026 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 5..67 /note="signal peptide" CDS 5..1684 /note="precursor" /codon_start=1 /db_xref="PID:g29581" /db_xref="SWISS-PROT:P02748" /translation="MSACRSFAVAICILEISILTAQYTTSYDPELTESSGSASHIDCR MSPWSEWSQCDPCLRQMFRSRSIEVFGQFNGKRCTDAVGDRRQCVPTEPCEDAEDDCG NDFQCSTGRCIKMRLRCNGDNDCGDFSDEDDCESEPRPPCRDRVVEESELARTAGYGI NILGMDPLSTPFDNEFYNGLCNRDRDGNTLTYYRRPWNVASLIYETKGEKNFRTEHYE EQIEAFKSIIQEKTSNFNAAISLKFTPTETNKAEQCCEETASSISLHGKGSFRFSYSK NETYQLFLSYSSKKEKMFLHVKGEIHLGRFVMRNRDVVLTTTFVDDIKALPTTYEKGE YFAFLETYGTHYSSSGSLGGLYELIYVLDKASMKRKGVELKDIKRCLGYHLDVSLAFS EISVGAEFNKDDCVKRGEGRAVNITSENLIDDVVSLIRGGTRKYAFELKEKLLRGTVI DVTDFVNWASSINDAPVLISQKLSPIYNLVPVKMKNAHLKKQNLERAIEDYINEFSVR KCHTCQNGGTVILMDGKCLCACPFKFEGIACEISKQKISEGLPALEFPNEK" mat_peptide 68..1681 /note="mature C9 protein [2]" conflict 131 /note="U is C in [2]" /citation=[2] misc_feature 299..412 /note="LDL receptor homology" misc_feature 727..728 /note="chymotrypsin cleavage site (in vitro)" misc_feature 799..800 /note="thrombin cleavage site (in vitro)" misc_feature 833..835 /note="N-linked oligosaccharide" old_sequence 940..944 /note="u guu g was ug in [1]; (1 codon inserted [3])" /citation=[1] /citation=[3] misc_feature 1240..1241 /note="trypsin cleavage site (in vitro)" misc_feature 1247..1249 /note="N-linked oligosaccharide" conflict 1253 /note="A is C in [2]" /citation=[2] misc_feature 1526..1630 /note="urokinase homology" BASE COUNT 675 a 385 c 426 g 540 t ORIGIN 1 cagcatgtca gcctgccgga gctttgcagt tgcaatctgc attttagaaa taagcatcct 61 cacagcacag tacacgacca gttatgaccc agagctaaca gaaagcagtg gctctgcatc 121 acacatagac tgcagaatga gcccctggag tgaatggtca caatgcgatc cttgtctcag 181 acaaatgttt cgttcaagaa gcattgaggt ctttggacaa tttaatggga aaagatgcac 241 cgacgctgtg ggagacagac gacagtgtgt gcccacagag ccctgtgagg atgctgagga 301 tgactgcgga aatgactttc aatgcagtac aggcagatgc ataaagatgc gacttcggtg 361 taatggtgac aatgactgcg gagacttttc agatgaggat gattgtgaaa gtgagccccg 421 tcccccctgc agagacagag tggtagaaga gtctgagctg gcacgaacag caggctatgg 481 gatcaacatt ttagggatgg atcccctaag cacacctttt gacaatgagt tctacaatgg 541 actctgtaac cgggatcggg atggaaacac tctgacatac taccgaagac cttggaacgt 601 ggcttctttg atctatgaaa ccaaaggcga gaaaaatttc agaaccgaac attacgaaga 661 acaaattgaa gcatttaaaa gtatcatcca agagaagaca tcaaatttta atgcagctat 721 atctctaaaa tttacaccca ctgaaacaaa taaagctgaa caatgttgtg aggaaacagc 781 ctcctcaatt tctttacatg gcaagggtag ttttcggttt tcatattcca aaaatgaaac 841 ttaccaacta tttttgtcat attcttcaaa gaaggaaaaa atgtttctgc atgtgaaagg 901 agaaattcat ctgggaagat ttgtaatgag aaatcgcgat gttgtgctca caacaacttt 961 tgtggatgat ataaaagctt tgccaactac ctatgaaaag ggagaatatt ttgccttttt 1021 ggaaacctat ggaactcact acagtagctc tgggtctcta ggaggactct atgaactaat 1081 atatgttttg gataaagctt ccatgaagcg gaaaggtgtt gaactaaaag acataaagag 1141 atgccttggg tatcatctgg atgtatctct ggctttctct gaaatctctg ttggagctga 1201 atttaataaa gatgattgtg taaagagggg agagggtaga gctgtaaaca tcaccagtga 1261 aaacctcata gatgatgttg tttcactcat aagaggtgga accagaaaat atgcatttga 1321 actgaaagaa aagcttctcc gaggaaccgt gattgatgtg actgactttg tcaactgggc 1381 ctcttccata aatgatgctc ctgttctcat tagtcaaaaa ctgtctccta tatataatct 1441 ggttccagtg aaaatgaaaa atgcacacct aaagaaacaa aacttggaaa gagccattga 1501 agactatatc aatgaattta gtgtaagaaa atgccacaca tgccaaaatg gaggtacagt 1561 gattctaatg gatggaaagt gtttgtgtgc ctgcccattc aaatttgagg gaattgcctg 1621 tgaaatcagt aaacaaaaaa tttctgaagg attgccagcc ctagagttcc ccaatgaaaa 1681 atagagctgt tggcttctct gagctccagt ggaagaagaa aacactagta ccttcagact 1741 cctacccctg aagataatct tagctgccaa gtaaatagca acatgcttca tgaaaatcct 1801 accaacctct gaagtctctt ctctcttagg tctataattt tttttttaat ttttcttcct 1861 taaactcctg tgatgtttcc attttttgtt ccctaatgag aagtcaacag tgaaatacgc 1921 cagaactgct ttatcccacg gaaaatgcca atctcttcta aaaaaaaaca aaattaaatt 1981 aaaaacagaa tgttggttta aaaaacttca aagaaaaaaa aaaaaa // LOCUS HSCAB3A 1611 bp RNA PRI 01-APR-1996 DEFINITION H.sapiens CAB3a mRNA for calcium channel beta3 subunit. ACCESSION X76555 NID g435134 KEYWORDS CAB3a gene; calcium channel; calcium channel beta3 subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1611) AUTHORS Flockerzi,V. TITLE Direct Submission JOURNAL Submitted (02-DEC-1993) V. Flockerzi, Inst. f. Pharmakiologie und Toxikologie, Technische Universitaet Muenchen, 80802 Muenchen, Biedersteiner Str. 29, FRG REFERENCE 2 (bases 1 to 1611) AUTHORS Murakami,M., Wissenbach,U. and Flockerzi,V. TITLE Gene structure of the murine calcium channel beta3 subunit, cDNA and characterization of alternative splicing and transcription products JOURNAL Eur. J. Biochem. 236 (1), 138-143 (1996) MEDLINE 96184890 FEATURES Location/Qualifiers source 1..1611 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_line="human medullary thyroid carcinoma cells hMTC" /clone_lib="cDNA in plasmid vectorpSport (BRl" gene 39..1493 /gene="CAB3a" CDS 39..1493 /gene="CAB3a" /codon_start=1 /product="CAB3a" /db_xref="PID:g435135" /translation="MYDDSYVPGFEDSEAGSADSYTSRPSLDSDVSLEEDRESARREV ESQAQQQLERAKHKPVAFAVRTNVSYCGVLDEECPVQGSGVNFEAKDFLHIKEKYSND WWIGRLVKEGGDIAFIPSPQRLESIRLKQEQKARRSGNPSSLSDIGNRRSPPPSLAKQ KQKQAEHVPPYDVVPSMRPVVLVGPSLKGYEVTDMMQKALFDFLKHRFDGRISITRVT ADLSLAKRSVLNNPGKRTIIERSSARSSIAEVQSEIERIFELAKSLQLVVLDADTINH PAQLAKTSLAPIIVFVKVSSPKVLQRLIRSRGKSQMKHLTVQMMAYDKLVQCPPESFD VILDENQLEDACEHLAEYLEVYWRATHHPAPGPGLLGPPSAIPGLQNQQLLGERGEEH SPLERDSLMPSDEASESSRQAWTGSSQRSSRHLEEDYADAYQDLYQPHRQHTSGLPSA NGHDPQDRLLAQDSEHNHSDRNWQRNRPWPKDSY" BASE COUNT 355 a 515 c 458 g 283 t ORIGIN 1 cccccggcgc cgctcgctcc cccgacccgg actcccccat gtatgacgac tcctacgtgc 61 ccgggtttga ggactcggag gcgggttcag ccgactccta caccagccgc ccatctctgg 121 actcagacgt ctccctggag gaggaccggg agagtgcccg gcgtgaagta gagagccagg 181 ctcagcagca gctcgaaagg gccaagcaca aacctgtggc atttgcggtg aggaccaatg 241 tcagctactg tggcgtactg gatgaggagt gcccagtcca gggctctgga gtcaactttg 301 aggccaaaga ttttctgcac attaaagaga agtacagcaa tgactggtgg atcgggcggc 361 tagtgaaaga gggcggggac atcgccttca tccccagccc ccagcgcctg gagagcatcc 421 ggctcaaaca ggagcagaag gccaggagat ctgggaaccc ttccagcctg agtgacattg 481 gcaaccgacg ctcccctccg ccatctctag ccaagcagaa gcaaaagcag gcggaacatg 541 ttcccccata tgacgtggtg ccctccatgc ggcctgtggt gctggtggga ccctctctga 601 aaggttatga ggtcacagac atgatgcaga aggctctctt cgacttcctc aaacacagat 661 ttgatggcag gatctccatc acccgagtca cagccgacct ctccctggca aagcgatctg 721 tgctcaacaa tccgggcaag aggaccatca ttgagcgctc ctctgcccgc tccagcattg 781 cggaagtgca gagtgagatc gagcgcatat ttgagctggc caaatccctg cagctagtag 841 tgttggacgc tgacaccatc aaccacccag cacagctggc caagacctcg ctggccccca 901 tcatcgtctt tgtcaaagtg tcctcaccaa aggtactcca gcgtctcatt cgctcccggg 961 ggaagtcaca gatgaagcac ctgaccgtac agatgatggc atatgataag ctggttcagt 1021 gcccaccgga gtcatttgat gtgattctgg atgagaacca gctggaggat gcctgtgagc 1081 acctggctga gtacctggag gtttactggc gggccacgca ccacccagcc cctggccccg 1141 gacttctggg tcctcccagt gccatccccg gacttcagaa ccagcagctg ctgggggagc 1201 gtggcgagga gcactccccc cttgagcggg acagcttgat gccctctgat gaggccagcg 1261 agagctcccg ccaagcctgg acaggatctt cacagcgtag ctcccgccac ctggaggagg 1321 actatgcaga tgcctaccag gacctgtacc agcctcaccg ccaacacacc tcggggctgc 1381 ctagtgctaa cgggcatgac ccccaagacc ggcttctagc ccaggactca gagcacaacc 1441 acagtgaccg gaactggcag cgcaaccggc cttggcccaa ggatagctac tgacagcctc 1501 ctgctgccct accctggcag gcacaggcgc acgtggctgg ggggcccact ccaggcaggg 1561 tggcgttaga ctggcatcag gctggcacta ggctcagccc caaaaccccc t // LOCUS HSCALBR 2376 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for 27-kDa calbindin. ACCESSION X06661 NID g29603 KEYWORDS calbindin; calcium binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2376) AUTHORS Parmentier,M., Lawson,D.E. and Vassart,G. TITLE Human 27-kDa calbindin complementary DNA sequence. Evolutionary and functional implications JOURNAL Eur. J. Biochem. 170 (1-2), 207-215 (1987) MEDLINE 88082818 REFERENCE 2 (bases 1 to 2376) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (09-MAY-1988) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (09-MAY-1988) by Parmentier M. FEATURES Location/Qualifiers source 1..2376 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="cDNA in lambda gt11" /clone="HBSC21, HBSC27" old_sequence 1 /note="c was cuc in [1] (ligation artefact)" /citation=[1] CDS 24..809 /note="calbindin (AA 1-261)" /codon_start=1 /db_xref="PID:g29604" /db_xref="SWISS-PROT:P05937" /translation="MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQNLIQELQ QARKKAGLELSPEMKTFVDQYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCEE FMKTWRKYDTDHSGFIETEELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDG KLELTEMARLLPVQENFLLKFQGIKMCGKEFNKAFELYDQDGNGYIDENELDALLKDL CEKNKQDLDINNITTYKKNIMALSDGGKLYRTDLALILCAGDN" misc_feature 2348..2357 /note="two overlapping pot. polyA signals" polyA_site 2372 /note="put. polyA site" BASE COUNT 786 a 409 c 436 g 745 t ORIGIN 1 ccagacacac accccgctgt acaatggcag aatcccacct gcagtcatcc ctcatcacag 61 cctcacagtt tttcgagatc tggctccatt tcgacgctga cggaagtggt tacctggaag 121 gaaaggagct gcagaacttg atccaggagc tccagcaggc gcgaaagaag gctggattgg 181 agttatcacc tgaaatgaaa acttttgtgg atcagtatgg gcaaagagat gatggaaaaa 241 taggaattgt agagttggct cacgtattac ccacagaaga gaatttcctg ctgctcttcc 301 gatgccagca gctgaagtcc tgtgaggaat tcatgaagac atggagaaaa tatgatactg 361 accacagtgg cttcatagaa actgaggagc ttaagaactt tctaaaggac ctgctagaaa 421 aagcaaacaa gactgttgat gacacaaaat tagccgagta tacagaccta atgctgaaac 481 tatttgattc aaataatgat gggaagctgg aattaactga gatggccagg ttactaccag 541 tgcaggagaa ttttcttctt aaattccagg gaatcaaaat gtgtgggaaa gagttcaata 601 aggcttttga gctgtatgat caggacggca atggatacat agatgaaaat gaactggatg 661 ctttactgaa ggatctgtgc gagaagaata aacaggatct ggatattaat aatattacaa 721 catacaagaa gaacataatg gctttgtcgg atggagggaa gctgtaccga acggatcttg 781 ctcttattct ctgtgctggg gataactaga gttggtggcc gcaaccactt gctagtgata 841 cactgtatct aaaaaataac tgtgcactat aagggagtag gctgtatttt cttttatatc 901 tgtaaattta actgcatata gataattatc caggatgtgt ggctcattct tttcagcttg 961 tttctatact gtttgtaata tacagttttt gtaaccatat gattgaaaag aagaaagtct 1021 atgcttaggc cagtcagtac acccaatttt aaaaaataac atattcttgc tttcacaaat 1081 atagttgaac aagatttccc taaaaattcc accaggatta atctctaaaa ttctagtctc 1141 tgatttgcaa atgcacattt gtcactgaat aatggaatta tgtataacaa gccaaacatt 1201 cttattttag acaaccatag aactgtccca caaaatattt ctaagcttat ttctaactat 1261 taggaggaat gtgcttttcc atctaaaata ctcaccaaaa tatagttaat tgtggcttta 1321 tgaagttaac agtctcatta cagatttagt ttaccaatca acagcatgtc tactgcttgg 1381 atccatacaa aactatcggt tcaagttgat gtgacaaggg aagggagcac cagatgacac 1441 ataaatctgt ctgattctat gcctgtattt ccaacaaact tactgtcaga gaatatgacc 1501 taaatccatt ttctaaactg ttttcatgtg ttgcaaatta ttctagtcaa ctgctgtttt 1561 atgtcatact ctgtgtaatc tctgattaaa tttaatatac tgcatatcct ggtgtctagt 1621 ttgcatactt cctggatttt ctttctatgt agaactgttc atttccacca agggtatctg 1681 ctgcctctga aaatattttt ttctagctat aacaactcta ttttttacta cataattaaa 1741 ttttaatgta aaattcatag catcctgatt attgaatgtt atatcatcaa tacttttgtg 1801 tattctgtgg attctatatt tcatattgag atcagcattc aaaatagttc tatttctatc 1861 tgcaaatagt ttcaaatgag tttaaaaaaa taacatctga aaagaaatgc taatgtaatc 1921 atttatctta tctagcaaga agattctaaa acattcttta acatacatct aagtcagttt 1981 cacatatttg tagctagaat atcctatact ggttatagtt gatatgtaac agttggtgat 2041 tttagatttc tttgattgtg aaacagggag ctatgagaga tgtgtccatg tgaaatttac 2101 agttactgcc taggagttaa tgatcgttct gggtcagctt gaatgtcccc attctataaa 2161 ttcaacactt attttctgaa ttcataaaaa taaccaaaaa atgtgagcta taatgtttcc 2221 ctcaagaaca aacagaaacg agatttgcca aaaactaaaa ttcaacaaat gatgttgagt 2281 gggagattgg ctttgccttt agcgtgtaaa tggaagcact gccattagac tgaatttaac 2341 tactaagaat aaataaagaa gaaaataacc ttaaaa // LOCUS HSCALCY 570 bp RNA PRI 04-DEC-1997 DEFINITION H.sapiens mRNA for calcyphosine. ACCESSION X97966 NID g1359716 KEYWORDS calcyphosine. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 570) AUTHORS El Housni,H., Radulescu,A., Lecocq,R., Dumont,J.E. and Christophe,D. TITLE Cloning and sequence analysis of human calcyphosine complementary DNA JOURNAL Biochim. Biophys. Acta 1352 (3), 249-252 (1997) MEDLINE 97368181 REFERENCE 2 (bases 1 to 570) AUTHORS El Housni,H. TITLE Direct Submission JOURNAL Submitted (17-MAY-1996) H. El Housni, IRIBHN - Universit Libre de Bruxelles, ULB-Erasme Bt. C, 808 route de Lennik, 1070 Bruxelles, BELGIUM FEATURES Location/Qualifiers source 1..570 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="adult" /tissue_type="thyroid" /map="19p13.3" CDS 1..570 /codon_start=1 /product="calcyphosine" /db_xref="PID:e245872" /db_xref="PID:g1359717" /db_xref="SWISS-PROT:Q13938" /translation="MDAVDATMEKLRAQCLSRGASGIQGLARFFRQLDRDGSRSLDAD EFRQGLAKLGLVLDQAEAEGVCRKWDRNGSGTLDLEEFLRALRPPMSQAREAVIAAAF AKLDRSGDGVVTVDDLRGVYSGRAHPKVRSGEWTEDEVLRRFLDNFDSSEKDGQVTLA EFQDYYSGVSASMNTDEEFVAMMTSAWQL" BASE COUNT 102 a 161 c 212 g 95 t ORIGIN 1 atggacgccg tggatgccac catggagaaa ctccgggcac agtgcctgtc ccgcggggcc 61 tcgggcatcc agggcctggc caggtttttc cgccaactag accgggacgg gagcagatcc 121 ctggacgctg atgagttccg gcagggtctg gccaaactcg ggctggtgct ggaccaggcg 181 gaggcagagg gtgtgtgcag gaagtgggac cgcaatggca gcgggacgct ggatctggag 241 gagttccttc gggcgctgcg gccccccatg tcccaggccc gggaggctgt catcgcagct 301 gcatttgcca agctggaccg cagtggggac ggcgtcgtga cggtggacga cctccgcggg 361 gtgtacagtg gccgtgccca ccccaaggtg cgcagtgggg agtggaccga ggacgaggtg 421 ctgcgccgct tcctggacaa cttcgactcc tctgagaagg atgggcaggt cacactggcg 481 gaattccagg actactacag cggcgtgagt gcctccatga acacggatga ggagttcgtg 541 gccatgatga ccagtgcctg gcagctgtga // LOCUS HSCALRTR 1426 bp RNA PRI 07-MAY-1991 DEFINITION Human mRNA for calretinin. ACCESSION X56667 NID g29635 KEYWORDS calcium binding protein; calretinin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (15-NOV-1990) M. Parmentier, I R I B M N, U L B CAMPUS ERASME, 808 ROUTE DE LENNIK, 1070 BRUSSELS, BELGIUM REFERENCE 2 (bases 1 to 1426) AUTHORS Parmentier,M. and Lefort,A. TITLE Structure of the human brain calcium-binding protein calretinin and its expression in bacteria JOURNAL Eur. J. Biochem. 196 (1), 79-85 (1991) MEDLINE 91160569 FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1426 /note="calretinin" /evidence=experimental CDS 44..859 /codon_start=1 /product="calretinin" /db_xref="PID:g29636" /db_xref="SWISS-PROT:P22676" /translation="MAGPQQQPPYLHLAELTASQFLEIWKHFDADGNGYIEGKELENF FQELEKARKGSGMMSKSDNFGEKMKEFMQKYDKNSDGKIEMAELAQILPTEENFLLCF RQHVGSSAEFMEAWRKYDTDRSGYIEANELKGFLSDLLKKANRPYDEPKLQEYTQTIL RMFDLNGDGKLGLSEMSRLLPVQENFLLKFQGMKLTSEEFNAIFTFYDKDRSGYIDEH ELDALLKDLYEKNKKEINIQQLTNYRKSVMSLAEAGKLYRKDLEIVLCSEPPM" polyA_site 1426 BASE COUNT 353 a 392 c 382 g 299 t ORIGIN 1 cggagcggga gcggtgcagg ctgaggtctc cgagcggctc gccatggctg gcccgcagca 61 gcagccccct tacctgcacc tggccgagct gacggcgtcc cagttcctgg aaatatggaa 121 gcactttgac gcagacggaa atgggtatat tgaaggtaaa gagctagaaa actttttcca 181 agagctggag aaggcaagga aaggctctgg catgatgtca aagagtgaca actttggaga 241 aaagatgaag gagttcatgc agaagtatga taaaaactca gatgggaaaa tcgagatggc 301 agagctggcg cagatcctgc caaccgaaga gaacttcctt ctgtgcttca ggcagcacgt 361 gggctccagc gccgagttta tggaggcttg gcggaagtac gacacagaca ggagtggcta 421 catcgaagcc aatgagctca agggattcct gtcagacctg ctgaagaagg cgaaccggcc 481 gtacgatgag cccaagctcc aggaatacac ccaaaccata ctacggatgt ttgacttgaa 541 cggggatggc aaattgggcc tctcagagat gtcccgactc ctgcctgtcc aggaaaactt 601 cctgcttaaa tttcagggca tgaagctgac ctcagaggag tttaacgcga tcttcacatt 661 ttacgacaag gatagaagcg gctacattga cgagcatgag ctggatgccc ttttgaagga 721 tctgtacgag aaaaacaaaa aggaaatcaa tattcaacag ctcaccaact acagaaagag 781 cgtcatgtcc ttggcagagg cagggaagct ctaccgcaag gacctggaga ttgtgctctg 841 cagcgagccc cccatgtaaa gtggggacgg gggctgcttc tccacctccc ccaaaccctg 901 cttctgctgc cctgatgcgt ctacccagac tcagagaccg tgagcgcccc gcccccaccc 961 ctacagcctg cacacacctg cctgcagagc aggaaacgag agatagagga tgggcagctg 1021 gggggctgtc ctgagccccc tgcacccacc cctgcccagg cagtctttgc tcagtggatc 1081 acacacatgg aaggtgatgg gggcatgggt ggagggtccc taattctctt cgctgtgatg 1141 catgagctcc ctcgctgtat gatttaggct tctatgtcca acagagtgga ctcttccctc 1201 tcgctcccct ctgccggtcc cccatgccac cacccacccc aaacttccag gttccatcca 1261 ccaccttgcc aatggtgtag ctgtcctctc agaactcctg tgtgtggaag gcacccgccc 1321 tttccttgcc ttctttactc ggcgtgctcc ttttctcttt gggtttcttg tttaccaaag 1381 aagagtttac agacaataaa atggaaaggt cctgctgtgg aaactt // LOCUS HSCALT 1087 bp RNA PRI 10-FEB-1994 DEFINITION H.sapiens mRNA for caltractin. ACCESSION X72964 NID g441311 KEYWORDS calcium binding; caltractin; EF-hand. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1087) AUTHORS Lee,V.D. and Huang,B. TITLE Molecular cloning and centrosomal localization of human caltractin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11039-11043 (1993) MEDLINE 94068540 REFERENCE 2 (bases 1 to 1087) AUTHORS Lee,V.D. TITLE Direct Submission JOURNAL Submitted (29-MAR-1993) V.D. Lee, The Scripps Research Institute, Dept. of Cell Biology, 10666 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1087 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial" /clone_lib="cDNA, lambda gt11" CDS 48..566 /codon_start=1 /product="caltractin" /db_xref="PID:g454248" /db_xref="SWISS-PROT:P41208" /translation="MASNFKKANMASSSQRKRMSPKPELTEEQKQEIREAFDLFDADG TGTIDVKELKVAMRALGFEPKKEEIKKMISEIDKEGTGKMNFGDFLTVMTQKMSEKDT KEEILKAFKLFDDDETGKISFKNLKRVAKELGENLTDEELQEMIDEADRDGDGEVSEQ EFLRIMKKTSLY" BASE COUNT 337 a 212 c 241 g 297 t ORIGIN 1 agtgtacacg tcggttgcct aacaaccggc agcggactcc tttggctatg gcctccaact 61 ttaagaaggc aaacatggca tcaagttctc agcgaaaaag aatgagccct aagcctgagc 121 ttactgaaga gcaaaagcag gagatccggg aagcttttga tcttttcgat gcggatggaa 181 ctggcaccat agatgttaaa gaactgaagg tggcaatgag ggccctgggc tttgaaccca 241 agaaagaaga aattaagaaa atgataagtg aaattgataa ggaagggaca ggaaaaatga 301 actttggtga ctttttaact gtgatgaccc agaaaatgtc tgagaaagat actaaagaag 361 aaatcctgaa agctttcaag ctctttgatg atgatgaaac tgggaagatt tcgttcaaaa 421 atctgaaacg cgtggccaag gagttgggtg agaacctgac tgatgaggag ctgcaggaaa 481 tgattgatga agctgatcga gatggagatg gagaggtcag tgagcaagag ttcctgcgca 541 tcatgaaaaa gaccagcctc tattaagatc agtgtcttct ttttctactg caagcacatg 601 taactagatt tagtgcctgc catggtgtga aatctggctt ttgagaacac aaacttttcc 661 cccacggacc tccctttatc actttaatag tgaccttgag cctattttag ccgtttggaa 721 gtgttctttg atattacagt tctttgtaaa atgacctgcg aattacccta attctcaaaa 781 gcaaaacaag agcacacaag cgtgaagaaa aggatcttaa agctttgagc acctgccatt 841 ttgccttgca tcgtttccct cgtcatgcat ttccacatat ccacaaacac agaacgactt 901 tagacaagca catgttacac ctgtgttgcc acaagcagtc attcttgacg gctccagttt 961 ttatttgaca cttgagttta gttttctctt ttataaaccc agtgaactcc tgcactggca 1021 tttggatgtg tgttaatgct atttgttttg tcttaaaagt aaaacctttc tcagtttgaa 1081 aaaaaaa // LOCUS HSCAN 6597 bp RNA PRI 19-MAY-1997 DEFINITION H.sapiens can mRNA. ACCESSION X64228 S89710 NID g29652 KEYWORDS can gene; putative oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6597) AUTHORS von Lindern,M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1992) M. Von Lindern, Dept of Cell Biology, Erasmus University, P.O. Box 1738, 3000 DR Rotterdam, THE NETHERLANDS REFERENCE 2 (bases 1 to 6597) AUTHORS von Lindern,M., Fornerod,M., van Baal,S., Jaegle,M., de Wit,T., Buijs,A. and Grosveld,G. TITLE The translocation (6;9), associated with a specific subtype of acute myeloid leukemia, results in the fusion of two genes, dek and can, and the expression of a chimeric, leukemia-specific dek-can mRNA JOURNAL Mol. Cell. Biol. 12 (4), 1687-1697 (1992) MEDLINE 92195315 COMMENT See also X64229. FEATURES Location/Qualifiers source 1..6597 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="testes" /clone_lib="human testes (Clonetech)" /clone="hXT23, hXT37, hXT54 & hXT65" /chromosome="9q34" gene 95..6367 /gene="can" CDS 95..6367 /gene="can" /codon_start=1 /product="putative oncogene" /db_xref="PID:g29653" /db_xref="SWISS-PROT:P35658" /translation="MGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAV SNKYGLVFAGGASGLQIFPTKNLLIQNKPGDDPNKIVDKVQGLLVPMKFPIHHLALSC DNLTLSACMMSSEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAAGMVIDMKWNPT VPSMVAVCLADGSIDVLQVTETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQ YLPTLQEKKVIPCPPFYESDHPVRVLDVLWIGTYVFAIVYAAADGTLETSPDVVMALL PKKEEKHPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVSILARQSD QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTNQVEITISDEKTLPPAPVLMLL STDGVLCPFYMINQNPGVKSLIKTPERLSLEGERQPKSPGSTPTTPTSSQAPQKLDAS AAAAPASLPPSSPAAPIATFSLLPAGGAPTVFSFGSSSLKSSATVTGEPPSYSSGSDS SKAAPGPGPSTFSFVPPSKASLAPTPAASPVAPSAASFSFGSSGFKPTLESTPVPSVS APNIAMKSSFPPSTSAVKVNLSEKFTAAATSTPVSSSQSAPPMSPFSSASKPAASGPL SHPTPLSAPPSSVPLKSSVLPSPSGRSAQGSSSPVPSMVQKSPRITPPAAKPGSPQAK SLQPAVAEKQGHQWKDSDPVMAGIGEEIAHFQKELEELKARTSKACFQVGTSEEMKML RTESDDLHTFLLEIKETTESLHGDISSLKTTLLEGFAGVEEAREQNERNRDSGYLHLL YKRPLDPKSEAQLQEIRRLHQYVKFAVQDVNDVLDLEWDQHLEQKKKQRHLLVPERET LFNTLANNREIINQQRKRLNHLVDSLQQLRLYKQTSLWSLSSAVPSQSSIHSFDSDLE SLCNALLKTTIESHTKSLPKVPAKLSPMKQAQLRNFLAKRKTPPVRSTAPASLSRSAF LSQRYYEDLDEVSSTSSVSQSLESEDARTSCKDDEAVVQAPRHAPVVRTPSIQPSLLP HAAPFAKSHLVHGSSPGVMGTSVATSASKIIPQGADSTMLATKTVKHGAPSPSHPISA PQQLAAAALRRQMASQAPAVNTLTESTLKNVPQVVNVQELKNNPATPSTAMGSSVPYS TAKTPHPVLTPVAANQAKQGSLINSLKPSGPTPASGQLSSGDKASGTAKIETAVTSTP SASGQFSKPFSFSPSGTGFNFGIITPTPSSNFTAAQGATPSTKESSQPDAFSSGGGSK PSYEAIPESSPPSGITSASNTTPGEPAASSSRPVAPSGTALSTTSSKLETPPSKLGEL LFPSSLAGETLGSFSGLRVGQADDSTKPTNKASSTSLTSTQPTKTSGVPSGFNFTAPP VLGKHTEPPVTSSATTTSVAPPAATSTSSTAVFGSLPVTSAGSSGVISFGGTSLSAGK TSFSFGSQQTNSTVPPSAPPPTTAATPLPTSFPTLSFGSLLSSATTPSLPMSAGRSTE EATSSALPEKPGDSEVSASAASLLEEQQSAQLPQAPPQTSDSVKKEPVLAQPAVSNSG TAASSTSLVALSAEATPATTGVPDARTEAVPPASSFSVPGQTAVTAAAISSAGPVAVE TSSTPIASSTTSIVAPGPSAEAAAFGTVTSGSSVFAQPPAASSSSAFNQLTNNTATAP SATPVFGQVAASTAPSLFGQQTGSTASTAAATPQVSSSGFSSPAFGTTAPGVFGQTTF GQASVFGQSASSAASVFSFSQPGFSSVPAFGQPASSTPTSTSGSVFGAASSTSSSSSF SFGQSSPNTGGGLFGQSNAPAFGQSPGFGQGGSVFGGTSAATTTAATSGFSFCQASGF GSSNTGSVFGQAASTGGIVFGQQSSSSSGSVFGSGNTGRGGGFFSGLGGKPSQDAANK NPFSSASGGFGSTATSNTSNLFGNSGAKTFGGFASSSFGEQKPTGTFSSGGGSVASQG FGFSSPNKTGGFGAAPVFGSPPTFGGSPGFGGVPAFGSAPAFTSPLGSTGGKVFGEGT AAASAGGFGFGSSSNTTSFGTLASQNAPTFGSLSQQTSGFGTQSSGFSGFGSGTGGFS FGSNNSSVQGFGGWRS" misc_feature 4052 /gene="can" /note="translocation breakpoint" BASE COUNT 1590 a 1870 c 1549 g 1588 t ORIGIN 1 aggggaggaa gtttgctgtc gagcggcctg ggttccgtgg gcaaggcgtg ggtggcagcg 61 ttggctcgtt cgacgacaca ctgagggcgg cgcgatggga gacgagatgg atgccatgat 121 tcccgagcgg gagatgaagg attttcagtt tagagcgcta aagaaggtga gaatctttga 181 ctcccctgag gaattgccca aggaacgctc gagtctgctt gctgtgtcca acaaatatgg 241 tctggtcttc gctggtggag ccagtggctt gcagattttt cctactaaaa atcttcttat 301 tcaaaataaa cccggagatg atcccaacaa aatagttgat aaagtccaag gcttgctagt 361 tcctatgaaa ttcccaatcc atcacctggc cttgagctgt gataacctca cactctctgc 421 gtgcatgatg tccagtgaat atggttccat tattgctttt tttgatgttc gcacattctc 481 aaatgaggct aaacagcaaa aacgcccatt tgcctatcat aagcttttga aagatgcagc 541 aggcatggtg attgatatga agtggaaccc cactgtcccc tccatggtgg cagtttgtct 601 ggctgatggt agtattgatg tcctgcaagt cacggaaaca gtgaaagtat gtgcaactct 661 tccttccacg gtagcagtaa cctctgtgtg ctggagcccc aaaggaaagc agctggcagt 721 gggaaaacag aatggaactg tggtccagta tcttcctact ttgcaggaaa aaaaagtcat 781 tccttgtcct ccgttttatg agtcagatca tcctgtcaga gttctggatg tgctgtggat 841 tggtacctac gtcttcgcca tagtgtatgc tgctgcagat gggaccctgg aaacgtctcc 901 agatgtggtg atggctctac taccgaaaaa agaagaaaag cacccagaga tatttgtgaa 961 ctttatggag ccctgttatg gcagctgcac ggagagacag catcattact acctcagtta 1021 cattgaggaa tgggatttag tgctggcagc atctgcggct tcaacagaag ttagtatcct 1081 tgctcgacaa agtgatcaga ttaattggga atcttggcta ctggaggatt ctagtcgagc 1141 tgaattgcct gtgacagaca agagtgatga ctccttgccc atgggagttg tcgtagacta 1201 tacaaaccaa gtggaaatca ccatcagtga tgaaaagact cttcctcctg ctccagttct 1261 catgttactt tcaacagatg gtgtgctttg tccattttat atgattaatc aaaatcctgg 1321 ggttaagtct ctcatcaaaa caccagagcg actttcatta gaaggagagc gacagcccaa 1381 gtcaccagga agtactccca ctaccccaac ctcctctcaa gccccacaga aactggatgc 1441 ttctgcagct gcagcccctg cctctctgcc accttcatca cctgctgctc ccattgccac 1501 tttttctttg cttcctgctg gtggagcccc cactgtgttc tcctttggtt cttcatcttt 1561 gaagtcatct gctacggtca ctggggagcc cccttcatat tccagtggct ccgacagctc 1621 caaagcagcc ccaggccctg gcccatcaac cttctctttt gttccccctt ctaaagcctc 1681 cctagccccc acccctgcag cgtctcctgt ggctccatca gctgcttcat tctcctttgg 1741 atcatctggt tttaagccta ccctggaaag cacaccagtg ccaagtgtgt ctgctccaaa 1801 tatagcaatg aagtcctcct tcccaccctc aacctctgct gtcaaagtca accttagtga 1861 aaagtttact gctgcagcta cctctactcc tgttagtagc tcccagagcg cacccccgat 1921 gtcgccattc tcttctgcct ccaagccagc tgcttctgga ccactcagcc accccacgcc 1981 tctctcagca ccacctagtt ccgtgccatt gaagtcctca gtcttgccct caccatcagg 2041 acgatctgct cagggcagtt caagcccagt gccctcaatg gtacagaaat cacccaggat 2101 aacccctcca gcggcaaagc caggctctcc ccaggcaaag tcacttcagc ctgctgttgc 2161 agaaaagcag ggacatcagt ggaaagattc agatcctgta atggctggaa ttggggagga 2221 gattgcacac tttcagaagg agttggaaga gttaaaagcc cgaacttcca aagcctgttt 2281 ccaagtgggc acttctgagg agatgaagat gctgcgaaca gaatcagatg acttgcatac 2341 ctttcttttg gagattaaag agaccacaga gtcgcttcat ggagatataa gtagcctgaa 2401 aacaacttta cttgagggct ttgctggtgt tgaggaagcc agagaacaaa atgaaagaaa 2461 tcgtgactct ggttatctgc atttgcttta taaaagacca ctggatccca agagtgaagc 2521 tcagcttcag gaaattcggc gccttcatca gtatgtgaaa tttgctgtcc aagatgtgaa 2581 tgatgttcta gacttggagt gggatcagca tctggaacaa aagaaaaaac aaaggcacct 2641 gcttgtgcca gagcgagaga cactgtttaa caccctagcc aacaatcggg aaatcatcaa 2701 ccaacagagg aagaggctga atcacctggt ggatagtctt cagcagctcc gcctttacaa 2761 acagacttcc ctgtggagcc tgtcctcggc tgttccttcc cagagcagca ttcacagttt 2821 tgacagtgac ctggaaagcc tgtgcaatgc tttgttgaaa accaccatag aatctcacac 2881 caaatccttg cccaaagtac cagccaaact gtcccccatg aaacaggcac aactgagaaa 2941 cttcttggcc aagaggaaga ccccaccagt gagatccact gctccagcca gcctgtctcg 3001 atcagccttt ctgtctcaga gatattatga agacttggat gaagtcagct caacgtcatc 3061 tgtctcccag tctctggaga gtgaagatgc acggacgtcc tgtaaagatg acgaggcagt 3121 ggttcaggcc cctcggcacg cccccgtggt tcgcactcct tccatccagc ccagtctctt 3181 gccccatgca gcaccttttg ctaaatctca cctggttcat ggttcttcac ctggtgtgat 3241 gggaacttca gtggctacat ctgctagcaa aattattcct caaggggccg atagcacaat 3301 gcttgccacg aaaaccgtga aacatggtgc acctagtcct tcccacccca tctcagcccc 3361 gcagcagctg gccgcagcag cactcaggcg gcagatggcc agtcaggcac cagctgtaaa 3421 cactttgact gaatcaacgt tgaagaatgt ccctcaagtg gtaaatgtgc aggaattgaa 3481 gaataaccct gcaacccctt ctacagccat gggttcttca gtgccctact ccacagccaa 3541 aacacctcac ccagtgttga ccccagtggc tgctaaccaa gccaagcagg ggtctctaat 3601 aaattccctt aagccatctg ggcctacacc agcatccggt cagttatcat ctggtgacaa 3661 agcttcaggg acagccaaga tagaaacagc tgtgacttca accccatctg cttctgggca 3721 gttcagcaag cctttctcat tttctccatc agggactggc tttaattttg ggataatcac 3781 accaacaccg tcttctaatt tcactgctgc acaaggggca acaccctcca ctaaagagtc 3841 aagccagccg gacgcattct catctggtgg gggaagcaaa ccttcttatg aggccattcc 3901 tgaaagctca cctccctcag gaatcacatc cgcatcaaac accaccccag gagaacctgc 3961 cgcatctagc agcagacctg tggcaccttc tggaactgct ctttccacca cctctagtaa 4021 gctggaaacc ccaccgtcca agctgggaga gcttctgttt ccaagttctt tggctggaga 4081 gactctggga agtttttcag gactgcgggt tggccaagca gatgattcta caaaaccaac 4141 caataaggct tcatccacaa gcctaactag tacccagcca accaagacgt caggcgtgcc 4201 ctcagggttt aattttactg cccccccggt gttagggaag cacacggagc cccctgtgac 4261 atcctctgca accaccacct cagtagcacc accagcagcc accagcactt cctcaactgc 4321 cgtttttggc agtctgccag tcaccagtgc aggatcctct ggggtcatca gttttggtgg 4381 gacatctcta agtgctggca agactagttt ttcatttgga agccaacaga ccaatagcac 4441 agtgccccca tctgccccac caccaactac agctgccact ccccttccaa catcattccc 4501 cacattgtca tttggtagcc tcctgagttc agcaactacc ccctccctgc ctatgtccgc 4561 tggcagaagc acagaagagg ccacttcatc agctttgcct gagaagccag gtgacagtga 4621 ggtctcagca tcagcagcct cacttctaga ggagcaacag tcagcccagc ttccccaggc 4681 tcctccgcaa acttctgact ctgttaaaaa agaacctgtt cttgcccagc ctgcagtcag 4741 caactctggc actgcagcat ctagtactag tcttgtagca ctttctgcag aggctacccc 4801 agccaccacg ggggtccctg atgccaggac ggaggcagta ccacctgctt cctccttttc 4861 tgtgcctggg cagactgctg tcacagcagc tgctatctca agtgcaggcc ctgtggccgt 4921 cgaaacatca agtaccccca tagcctccag caccacgtcc attgttgctc ccggcccatc 4981 tgcagaggca gcagcatttg gtaccgtcac ttctggctca tccgtctttg ctcagcctcc 5041 tgctgccagt tctagctcag ctttcaacca gctcaccaac aacacagcca ctgccccctc 5101 tgccacgccc gtgtttgggc aagtggcagc cagcaccgca ccaagtctgt ttgggcagca 5161 gactggtagc acagccagca cagcagctgc cacaccacag gtcagcagct cagggtttag 5221 cagcccagct tttggtacca cagccccagg ggtctttgga cagacaacct tcgggcaggc 5281 ctcagtcttt gggcagtcgg cgagcagtgc tgcaagtgtc ttttccttca gtcagcctgg 5341 gttcagttcc gtgcctgcct tcggtcagcc tgcttcctcc actcccacat ccaccagtgg 5401 aagtgtcttt ggtgccgcct caagtaccag tagctccagt tccttctcat ttggacagtc 5461 ttctcccaac acaggagggg ggctgtttgg ccaaagcaac gctcctgctt ttgggcagag 5521 tcctggcttt ggacagggag gctctgtctt tggtggtacc tcagctgcca ccacaacagc 5581 agcaacctct gggttcagct tttgccaagc ttcaggtttt gggtctagta atactggttc 5641 tgtgtttggt caagcagcca gtactggtgg aatagtcttt ggccagcaat catcctcttc 5701 cagtggtagc gtgtttgggt ctggaaacac tggaagaggg ggaggtttct tcagtggcct 5761 tggaggaaaa cccagtcagg atgcagccaa caaaaaccca ttcagctcgg ccagtggggg 5821 ctttggatcc acagctacct caaatacctc taacctattt ggaaacagtg gggccaagac 5881 atttggtgga tttgccagct cgtcgtttgg agagcagaaa cccactggca ctttcagctc 5941 tggaggagga agtgtggcat cccaaggctt tgggttttcc tctccaaaca aaacaggtgg 6001 cttcggtgct gctccagtgt ttggcagccc tcctactttt gggggatccc ctgggtttgg 6061 aggggtgcca gcattcggtt cagccccagc ctttacaagc cctctgggct cgacgggagg 6121 caaagtgttc ggagagggca ctgcagctgc cagcgcagga ggattcgggt ttgggagcag 6181 cagcaacacc acatccttcg gcacgctcgc gagtcagaat gcccccactt tcggatcact 6241 gtcccaacag acttctggtt ttgggaccca gagtagcgga ttctctggtt ttggatcagg 6301 cacaggaggg ttcagctttg ggtcaaataa ctcgtctgtc cagggttttg gtggctggcg 6361 aagctgaggg cgtgtcagca ggcctttcga tccctgggac caaccgcatc ctcagcttct 6421 tccccgagaa atgctggagc aggctgttca gaccgacgtt gccatcaaaa cacatacacc 6481 cagaaagaaa caacagaaac caaaactcac aaggcgcatg attacttgtt ttatatttca 6541 tgttgggttt tccctcccac tattaaacag tctgtttccg tacaaaaaaa aaaaaaa // LOCUS HSCANPR 3007 bp RNA PRI 15-NOV-1996 DEFINITION Human mRNA for calcium activated neutral protease large subunit (muCANP, calpain, EC 3.4.22.17). ACCESSION X04366 NID g29663 KEYWORDS calcium dependant protein; calpain; neutral protease; protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3007) AUTHORS Aoki,K., Imajoh,S., Ohno,S., Emori,Y., Koike,M., Kosaki,G. and Suzuki,K. TITLE Complete amino acid sequence of the large subunit of the low-Ca2+-requiring form of human Ca2+-activated neutral protease (muCANP) deduced from its cDNA sequence JOURNAL FEBS Lett. 205 (2), 313-317 (1986) MEDLINE 86301172 REFERENCE 2 (bases 1 to 3007) AUTHORS Zhang,W., Lane,R.D. and Mellgren,R.L. TITLE The major calpain isozymes are long-lived proteins. Design of an antisense strategy for calpain depletion in cultured cells JOURNAL J. Biol. Chem. 271 (31), 18825-18830 (1996) MEDLINE 96324965 COMMENT In domain IV (Ca(2+)-binding domain) four consecutive EF-hand sequences are found. Data kindly reviewed (10-DEC-1986) by K. Suzuki. FEATURES Location/Qualifiers source 1..3007 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 144..2288 /note="CANP, large subunit (aa 1-714)" /codon_start=1 /db_xref="PID:g29664" /db_xref="SWISS-PROT:P07384" /translation="MSEEIITPVYCTGVSAQVQKQRARELGLGRHENAIKYLGQDYEQ LRVRCLQSGTLFRDEAFPPVPQSLGYKDLGPNSSKTYGIKWKRPTELLSNPQFIVDGA TRTDICQGALGDCWLLAAIASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQLWQFGEWV DVVVDDLLPIKDGKLVFVHSAEGNEFWSALLEKAYAKVNGSYEALSGGSTSEGFEDFT GGVTEWYELRKAPSDLYQIILKALERGSLLGCSIDISSVLDMEAITFKKLVKGHAYSV TGAKQVNYRGQVVSLIRMRNPWGEVEWTGAWSDSSSEWNNVDPYERDQLRVKMEDGEF WMSFRDFMREFTRLEICNLTPDALKSRTIRKWNTTLYEGTWRRGSTAGGCRNYPATFW VNPQFKIRLDETDDPDDYGDRESGCSFVLALMQKHRRRERRFGRDMETIGFAVYEVPP ELVGQPAVHLKRDFFLANASRARSEQFINLREVSTRFRLPPGEYVVVPSTFEPNKEGD FVLRFFSEKSAGTVELDDQIQANLPDEQVLSEEEIDENFKALFRQLAGEDMEISVKEL RTILNRIISKHKDLRTKGFSLESCRSMVNLMDRDGNGKLGLVEFNILWNRIRNYLSIF RKFDLDKSGSMSAYEMRMAIESAGFKLNKKLYELIITRYSEPDLAVDFDNFVCCLVRL ETMFRFFKTLDTDLDGVVTFDLFKWLQLTMFA" misc_feature 404..405 /note="boundary of domains I/II" misc_feature 1124..1125 /note="boundary of domains II/III" misc_feature 1850..1851 /note="boundary of domains III/IV" misc_feature 2982..2988 /note="pot. polyA signal" polyA_site 3007 /note="polyA site" BASE COUNT 592 a 917 c 918 g 580 t ORIGIN 1 aaggagagag ggagggcgga gggcggaggg gcggcgggag gagggcgggg aggagcgctc 61 ttcctggttg ggccctgccc tgagctgcca ccgggaagcc agcctcaggg actgcagcga 121 cccccaaaca cccctccccc aggatgtcgg aggagatcat cacgccggtg tactgcactg 181 gggtgtcagc ccaagtgcag aagcagcggg ccagggagct gggcctgggc cgccatgaga 241 atgccatcaa gtacctgggc caggattatg agcagctgcg ggtgcgatgc ctgcagagtg 301 ggaccctctt ccgtgatgag gccttccccc cggtacccca gagcctgggt tacaaggacc 361 tgggtcccaa ttcctccaag acctatggca tcaagtggaa gcgtcccacg gaactgctgt 421 caaaccccca gttcattgtg gatggagcta cccgcacaga catctgccag ggagcactgg 481 gggactgctg gctcttggcg gccattgcct ccctcactct caacgacacc ctcctgcacc 541 gagtggttcc gcacggccag agcttccaga atggctatgc cggcatcttc catttccagc 601 tgtggcaatt tggggagtgg gtggacgtgg tcgtggatga cctgctgccc atcaaggacg 661 ggaagctagt gttcgtgcac tctgccgaag gcaacgagtt ctggagcgcc ctgcttgaga 721 aggcctatgc caaggtaaat ggcagctacg aggccctgtc agggggcagc acctcagagg 781 gctttgagga cttcacaggc ggggttaccg agtggtacga gttgcgcaag gctcccagtg 841 acctctacca gatcatcctc aaggcgctgg agcggggctc cctgctgggc tgctccatag 901 acatctccag cgttctagac atggaggcca tcactttcaa gaagttggtg aagggccatg 961 cctactctgt gaccggggcc aagcaggtga actaccgagg ccaggtggtg agcctgatcc 1021 ggatgcggaa cccctggggc gaggtggagt ggacgggagc ctggagcgac agctcctcag 1081 agtggaacaa cgtggaccca tatgaacggg accagctccg ggtcaagatg gaggacgggg 1141 agttctggat gtcattccga gacttcatgc gggagttcac ccgcctggag atctgcaacc 1201 tcacacccga cgccctcaag agccggacca tccgcaaatg gaacaccaca ctctacgaag 1261 gcacctggcg gcgggggagc accgcggggg gctgccgaaa ctacccagcc accttctggg 1321 tgaaccctca gttcaagatc cggctggatg agacggatga cccggacgac tacggggacc 1381 gcgagtcagg ctgcagcttc gtgctcgccc ttatgcagaa gcaccgtcgc cgcgagcgcc 1441 gcttcggccg cgacatggag actattggct tcgcggtcta cgaggtccct ccggagctgg 1501 tgggccagcc ggccgtacac ttgaagcgtg acttcttcct ggccaatgcg tctcgggcgc 1561 gctcagagca gttcatcaac ctgcgagagg tcagcacccg cttccgcctg ccacccgggg 1621 agtatgtggt ggtgccctcc accttcgagc ccaacaagga gggcgacttc gtgctgcgct 1681 tcttctcaga gaagagtgct gggactgtgg agctggatga ccagatccag gccaatctcc 1741 ccgatgagca agtgctctca gaagaggaga ttgacgagaa cttcaaggcc ctcttcaggc 1801 agctggcagg ggaggacatg gagatcagcg tgaaggagtt gcggacaatc ctcaatagga 1861 tcatcagcaa acacaaagac ctgcggacca agggcttcag cctagagtcg tgccgcagca 1921 tggtgaacct catggatcgt gatggcaatg ggaagctggg cctggtggag ttcaacatcc 1981 tgtggaaccg catccggaat tacctgtcca tcttccggaa gtttgacctg gacaagtcgg 2041 gcagcatgag tgcctacgag atgcggatgg ccattgagtc ggcaggcttc aagctcaaca 2101 agaagctgta cgagctcatc atcacccgct actcggagcc cgacctggcg gtcgactttg 2161 acaatttcgt ttgctgcctg gtgcggctag agaccatgtt ccgatttttc aaaactctgg 2221 acacagatct ggatggagtt gtgacctttg acttgtttaa gtggttgcag ctgaccatgt 2281 ttgcatgagg cagggactcg gtcccccttg ccgtgctccc ctccctcctc gtctgccaag 2341 cctcgcctcc taccacacca caccaggcca ccccagctgc aagtgccttc cttggagcag 2401 agaggcagcc tcgtcctcct gtcccctctc ctcccagcca ccatcgttca tctgctccgg 2461 gcagaactgt gtggcccctg cctgtgccag ccatgggctc gggatggact ccctgggccc 2521 cacccattgc caagccagga aggcagcttt cgcttgttcc tgcctcggga cagccccggg 2581 tttccccagc atcctgatgt gtcccctctc cccacttcag aggccaccca ctcagcacca 2641 ccggcctggc cttgcctgca gactataaac tataaccact agctcgacac agtctgcagt 2701 ccaggcgtgt ggagccgcct cccggctcgg ggaggccccg gggctgggaa cgcctgtgcc 2761 ttcctgcgcc gaagccaacg ccccctctgt ccttccctgg ccctgctgcc gaccaggagc 2821 tgcccagcct gtgggcggtc ggccttccct ccttcgctcc ttttttatat tagtgatttt 2881 aaaggggact cttcagggac ttgtgtactg gttatggggg tgccagaggc actaggcttg 2941 gggtggggag gtcccgtgtt ccatatagag gaaccccaaa taataaaagg ccccacatct 3001 gtctgtg // LOCUS HSCAP35MR 1281 bp RNA PRI 11-APR-1996 DEFINITION H.sapiens mRNA for p35, cyclin-like CAK1-associated protein. ACCESSION X92669 NID g1109756 KEYWORDS CAP35 gene; cyclin-like protein; p35. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1281) AUTHORS Yee,A., Nichols,M.A., Wu,L., Hall,F.L., Kobayashi,R. and Xiong,Y. TITLE Molecular cloning of CDK7-associated human MAT1, a cyclin-dependent kinase-activating kinase (CAK) assembly factor JOURNAL Cancer Res. 55 (24), 6058-6062 (1995) MEDLINE 96105021 REFERENCE 2 (bases 1 to 1281) AUTHORS Hall,F.L. TITLE Direct Submission JOURNAL Submitted (28-OCT-1995) F.L. Hall, Childrens Hospital Los Angeles, Division of Orthopaedic Surgery, 4650 Sunset Boulevard, Los Angeles, CA, 90027, USA FEATURES Location/Qualifiers source 1..1281 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tumor" /cell_type="cervical cancer" /cell_line="HeLa" gene 35..964 /gene="CAP35" CDS 35..964 /gene="CAP35" /note="cyclin-like CAK1-associated protein; cyclin box domain" /codon_start=1 /product="p35" /db_xref="PID:e213391" /db_xref="PID:g1109757" /translation="MDDQGCPRCKTTKYRNPSLKLMVNVCGHTLCESCVDLLFVRGAG NCPECGTPLRKSNFRVQLFEDPTVDKEVEIRKKVLKIYNKREEDFPSLREYNDFLEEV EEIVFNLTNNVDLDNTKKKMEIYQKENKDVIQKNKLKLTREQEELEEALEVERQENEQ RRLFIQKEEQLQQILKRKNKQAFLDELESSDLPVALLLAQHKDRSTQLEMQLEKPKPV KPVTFSTGIKMGQHISLAPIHKLEEALYEYQPLQIETYGPHVPELEMLGRLGYLNHVR AASPQDLAGGYTSSLACHRALQDAFSGLFWQPS" BASE COUNT 442 a 226 c 286 g 327 t ORIGIN 1 cgcgcttccg agagtctgta ggagggaaac cgccatggac gatcagggtt gccctcggtg 61 taagaccacc aaatatcgga acccctcctt gaagctgatg gtgaatgtgt gcggacacac 121 tctctgtgaa agttgtgtag atttactgtt tgtgagagga gctggaaact gccctgagtg 181 tggtactcca ctcagaaaga gcaacttcag ggtacaactc tttgaagatc ccactgttga 241 caaggaggtt gagatcagga aaaaagtgct aaagatatac aataaaaggg aagaagattt 301 tcctagtcta agagaataca atgatttctt ggaagaagtg gaagaaattg ttttcaactt 361 gaccaacaat gtggatttgg acaacaccaa aaagaaaatg gagatatacc aaaaggaaaa 421 caaagatgtt attcagaaaa ataaattaaa gctgactcga gaacaggaag aactggaaga 481 agctttagaa gtggaacgac aggaaaatga acaaagaaga ttatttatac aaaaagaaga 541 acaactgcag cagattctaa aaaggaagaa taagcaggct tttttagatg agctggagag 601 ttctgatctc cctgttgctc tgcttttggc tcagcataaa gatagatcta cccaattaga 661 aatgcaactt gagaaaccca aacctgtaaa accagtgacg ttttccacag gcatcaaaat 721 gggtcaacat atttcactgg cacctattca caagcttgaa gaagctctgt atgaatacca 781 gccactgcag atagagacat atggaccaca tgttcctgag cttgagatgc taggaagact 841 tgggtattta aaccatgtca gagctgcctc accacaggac cttgctggag gctatacttc 901 ttctcttgct tgtcacagag cactacagga tgcattcagt gggcttttct ggcagcccag 961 ttaaccattt ataagatttg gaccttggag ctgaaccagg gagctagcaa aagtaaagca 1021 gacttataaa attatagcta tgtgcagctg cacaacacag tccttccact agcagctgtg 1081 ttaaagtatt tataaggaga aaatttcaga actaagttga gtaatatagg ggatatatat 1141 ttgtgaaaaa taatttttac ttatattttt cagaggattt gacacgataa gcctcatctg 1201 atggaagaga ggaataaata attcacctat atgtgtttga ggttgtgaca gacttataaa 1261 atcttttaaa aaataaagct g // LOCUS HSCARBE 2443 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for carboxypeptidase E (EC 3.4.17.10). ACCESSION X51405 NID g29666 KEYWORDS carboxypeptidase E; carboxypeptidase H; enkephalin convertase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2443) AUTHORS Hall,C. TITLE Direct Submission JOURNAL Submitted (19-JAN-1990) Hall C., Institute of Neurology, Department of Neurochemistry, 1 Wakefield Street, London WC1 N1PJ, UK REFERENCE 2 (bases 1 to 2443) AUTHORS Manser,E., Fernandez,D., Loo,L., Goh,P.Y., Monfries,C., Hall,C. and Lim,L. TITLE Human carboxypeptidase E. Isolation and characterization of the cDNA, sequence conservation, expression and processing in vitro JOURNAL Biochem. J. 267 (2), 517-525 (1990) MEDLINE 90241164 COMMENT For Rat cDNA for carboxypetidase E see . FEATURES Location/Qualifiers source 1..2443 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="human retina cDNA" /clone="P1622" CDS 291..1721 /note="pre-pro polypeptide (AA -25 to 451)" /codon_start=1 /db_xref="PID:g29667" /db_xref="SWISS-PROT:P16870" /translation="MAGRGGSALLALCGALAACGWLLGAEAQEPGAPAAGMRRRRRLQ QEDGISFEYHRYPELREALVSVWLQCTAISRIYTVGRSFEGRELLVIELSDNPGVHEP GEPEFKYIGNMHGNEAVGRELLIFLAQYLCNEYQKGNETIVNLIHSTRIHIMPSLNPD GFEKAASQPGELKDWFVGRSNAQGIDLNRNFPDLDRIVYVNEKEGGPNNHLLKNMKKI VDQNTKLAPETKAVIHWIMDIPFVLSANLHGGDLVANYPYDETRSGSAHEYSSSPDDA IFQSLARAYSSFNPAMSDPNRPPCRKNDDDSSFVDGTTNGGAWYSVPGGMQDFNYLSS NCFEITVELSCEKFPPEETLKTYWEDNKNSLISYLEQIHRGVKGFVRDLQGNPIANAT ISVEGIDHDVTSAKDGDYWRLLIPGNYKLTASAPGYLAITKKVAVPYSPAAGVDFELE SFSERKEEEKEELMEWWKMMSETLNF" sig_peptide 291..365 /note="signal peptide (AA -25 to -1)" misc_feature 366..1718 /note="pro-peptide (AA 1-451)" mat_peptide 414..1718 /note="mature carboxypeptidase E (AA 17-451)" BASE COUNT 658 a 538 c 618 g 629 t ORIGIN 1 aaatggcgtg cccgtctctc cgccggcccc ctgcctcgca gtggtttctc ctgcagctcc 61 cctgggctcc gcggccagta gtgcagcccg tggagccgcg gctttgcccg tctcctctgg 121 gtggccccag tgcgcgggct gacactcatt cagccgggga aggtgaggcg agtagaggct 181 ggtgcggaac ttgccgcccc cagcagcgcc ggcgggctaa gcccagggcc gggcagacaa 241 aagaggccgc ccgcgtagga aggcacggcc ggcggcggcg gagcgcagcg atggccgggc 301 gagggggcag cgcgctgctg gctctgtgcg gggcactggc tgcctgcggg tggctcctgg 361 gcgccgaagc ccaggagccc ggggcgcccg cggcgggcat gaggcggcgc cggcggctgc 421 agcaagagga cggcatctcc ttcgagtacc accgctaccc cgagctgcgc gaggcgctcg 481 tgtccgtgtg gctgcagtgc accgccatca gcaggattta cacggtgggg cgcagcttcg 541 agggccggga gctcctggtc atcgagctgt ccgacaaccc tggcgtccat gagcctggtg 601 agcctgaatt taaatacatt gggaatatgc atgggaatga ggctgttgga cgagaactgc 661 tcattttctt ggcccagtac ctatgcaacg aataccagaa ggggaacgag acaattgtca 721 acctgatcca cagtacccgc attcacatca tgccttccct gaacccagat ggctttgaga 781 aggcagcgtc tcagcctggt gaactcaagg actggtttgt gggtcgaagc aatgcccagg 841 gaatagatct gaaccggaac tttccagacc tggataggat agtgtacgtg aatgagaaag 901 aaggtggtcc aaataatcat ctgttgaaaa atatgaagaa aattgtggat caaaacacaa 961 agcttgctcc tgagaccaag gctgtcattc attggattat ggatattcct tttgtgcttt 1021 ctgccaatct ccatggagga gaccttgtgg ccaattatcc atatgatgag acgcggagtg 1081 gtagtgctca cgaatacagc tcctccccag atgacgccat tttccaaagc ttggcccggg 1141 catactcttc tttcaacccg gccatgtctg accccaatcg gccaccatgt cgcaagaatg 1201 atgatgacag cagctttgta gatggaacca ccaacggtgg tgcttggtac agcgtacctg 1261 gagggatgca agacttcaat taccttagca gcaactgttt tgagatcacc gtggagctta 1321 gctgtgagaa gttcccacct gaagagactc tgaagaccta ctgggaggat aacaaaaact 1381 ccctcattag ctaccttgag cagatacacc gaggagttaa aggatttgtc cgagaccttc 1441 aaggtaaccc aattgcgaat gccaccatct ccgtggaagg aatagaccac gatgttacat 1501 ccgcaaagga tggtgattac tggagattgc ttatacctgg aaactataaa cttacagcct 1561 cagctccagg ctatctggca ataacaaaga aagtggcagt tccttacagc cctgctgctg 1621 gggttgattt tgaactggag tcattttctg aaaggaaaga agaggagaag gaagaattga 1681 tggaatggtg gaaaatgatg tcagaaactt taaattttta aaaaggcttc tagttagctg 1741 ctttaaatct atctatataa tgtagtatga tgtaatgtgg tctttttttt agattttgtg 1801 cagttaatac ttaacattga tttatttttt aatcatttaa atattaatca actttcctta 1861 aaataaatag cctcttaggt aaaaatataa gaacttgata tatttcattc tcttatatag 1921 tattcatttt cctacctata ttacacaaaa aagtatagaa aagatttaag taattttgcc 1981 atcctaggct taaatgcaat attcctggta ttatttacaa tgcagaattt tttgagtaat 2041 tctagctttc aaaaattagt gaagttcttt tactgtaatt ggtgacaatg tcacataatg 2101 aatgctattg aaaaggttaa cagatacagc tcggagttgt gagcactcta ctgcaagact 2161 taaatagttc agtataaatt gtcgtttttt tcttgtgctg actaactata agcatgatct 2221 tgttaatgca tttttgatgg gaagaaaagg tacatgttta caaagaggtt ttatgaaaag 2281 aataaaaatt gacttcttgc ttgtacatat aggagcaata ctattatatt atgtagtccg 2341 ttaacactac ttaaaagttt agggttttct cttggttgta gagtggccca gaattgcatt 2401 ctgaatgaat aaaggttaaa aaaaaatccc cagtgaaaaa aaa // LOCUS HSCARNCAR 1245 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for carnitine carrier. ACCESSION Y10319 NID g2765074 KEYWORDS carnitine carrier protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1245) AUTHORS Iacobazzi,V. and Palmieri,F. TITLE Cloning, sequencing and expression of human liver mitochondrial carnitine/acylcarnitine carrier JOURNAL Unpublished REFERENCE 2 (bases 1 to 1245) AUTHORS Iacobazzi,V. TITLE Direct Submission JOURNAL Submitted (03-JAN-1997) V. Iacobazzi, University of Bari, Dept. Pharmaco-Biology, via Orabona, 70125 Bari, ITALY REMARK revised by submitter 12-MAY-1997 FEATURES Location/Qualifiers source 1..1245 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 63..968 /codon_start=1 /product="carnitine carrier" /db_xref="PID:e309219" /db_xref="PID:g2765075" /translation="MADQPKPISPLKNLLAGGFGGVCLVFVGHPLDTVKVRLQTQPPS LPGQPPMYSGTFDCFRKTLFREGITGLYRGMAAPIIGVTPMFAVCFFGFGLGKKLQQK HPEDVLSYPQLFAAGMLSGVFTTGIMTPGERIKCLLQIQASSGESKYTGTLDCAKKLY QEFGIRGIYKGTVLTLMRDVPASGMYFMTYEWLKNIFTPEGKRVSELSAPRILVAGGI AGIFNWAVAIPPDVLKSRFQTAPPGKYPNGFRDVLRELIRDEGVTSLYKGFNAVMIRA FPANAACFLGFEVAMKFLNWATPNL" BASE COUNT 295 a 293 c 352 g 305 t ORIGIN 1 gggctcgagc ggccgcccgg gcaggtcgag aactgacaga cggagtgaca gacggactga 61 ccatggccga ccagccaaaa cccatcagcc cgctcaagaa cctgctggcc ggcggctttg 121 gcggcgtgtg cctggtgttc gtcggtcacc ctctggacac ggtcaaggtc cgactgcaga 181 cacagccacc gagtttgcct ggacaacctc ccatgtactc tgggaccttt gactgtttcc 241 ggaagactct ttttagagag ggcatcacgg ggctatatcg gggaatggct gcccctatca 301 tcggggtcac tcccatgttt gccgtgtgct tctttgggtt tggtttgggg aagaaactac 361 aacagaaaca cccagaagat gtgctcagct atccccagct ttttgcagct gggatgttat 421 ctggcgtatt caccacagga atcatgactc ctggagaacg gatcaagtgc ttattacaga 481 ttcaggcttc ttcaggagaa agcaagtaca ctggtacctt ggactgtgca aagaagctgt 541 accaggagtt tgggatccga ggcatctaca aagggactgt gcttaccctt atgcgagatg 601 tcccagctag tggaatgtat ttcatgacat atgaatggct gaaaaatatc ttcactccgg 661 agggaaagag ggtcagtgag ctcagtgccc ctcggatctt ggtggctggg ggcattgcag 721 ggatcttcaa ctgggctgtg gcaatccccc cagatgtgct caagtctcga ttccagactg 781 cacctcctgg gaaatatcct aatggtttca gagatgtgct gagggagctg atccgggatg 841 aaggagtcac atccttgtac aaagggttca atgcagtgat gatccgagcc ttcccagcca 901 atgcggcctg tttccttggc tttgaagttg ccatgaagtt ccttaattgg gccaccccca 961 acttgtgagg ctgaaggctg ctcaagttca cttctggatg ctggaagctg tcgttgagga 1021 gaaggagtag taagcagaac taagcagtct tggagggcaa ggggagggga atggtgagat 1081 ccgagccctg tgcatggact tggtgagact gttgccttaa tgacatcctg caccgtgtat 1141 aacttagtgt gtcattttga aacttgaatt cattcttatc aatttaaggg atcttaaaag 1201 gatttggaaa tggaacaagt agcttccaga ccagatacta cctgt // LOCUS HSCATDC 1988 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for cathepsin D from oestrogen responsive breast cancer cells. ACCESSION X05344 NID g29677 KEYWORDS cathepsin D. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1988) AUTHORS Westley,B.R. and May,F.E. TITLE Oestrogen regulates cathepsin D mRNA levels in oestrogen responsive human breast cancer cells JOURNAL Nucleic Acids Res. 15 (9), 3773-3786 (1987) MEDLINE 87231068 COMMENT Data kindly reviewed (8.1.88 ) by Westley. FEATURES Location/Qualifiers source 1..1988 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ZR-75" precursor_RNA <1..1988 /note="primary transcript" sig_peptide 3..62 /note="put. signal peptide (AA -20 to -1)" CDS 3..1241 /note="precursor polypeptide (AA -20 to 392)" /codon_start=1 /db_xref="PID:g29678" /db_xref="SWISS-PROT:P07339" /translation="MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVE DLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSN LWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNL MQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQ VEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST LPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGR YYTVFDRDNNRVGFAEAARL" misc_feature 63..194 /note="propeptide (AA 1-44)" mat_peptide 195..1238 /note="mature cathepsin D (AA 45-392)" misc_feature 402..410 /note="pot.N-glycosylation site" misc_feature 789..797 /note="pot.N-glycosylation site" variation 1308 /note="pot. polymorphism (a->g)" misc_feature 1959..1964 /note="pot.polyA signal" variation 1979 /note="pot. polymorphism (g->a)" polyA_site 1988 /note="polyA site" BASE COUNT 350 a 666 c 590 g 382 t ORIGIN 1 ccatgcagcc ctccagcctt ctgccgctcg ccctctgcct gctggctgca cccgcctccg 61 cgctcgtcag gatcccgctg cacaagttca cgtccatccg ccggaccatg tcggaggttg 121 ggggctctgt ggaggacctg attgccaaag gccccgtctc aaagtactcc caggcggtgc 181 cagccgtgac cgaggggccc attcccgagg tgctcaagaa ctacatggac gcccagtact 241 acggggagat tggcatcggg acgccccccc agtgcttcac agtcgtcttc gacacgggct 301 cctccaacct gtgggtcccc tccatccact gcaaactgct ggacatcgct tgctggatcc 361 accacaagta caacagcgac aagtccagca cctacgttaa gaatggtacc tcgtttgaca 421 tccactatgg ctcgggcagc ctctccgggt acctgagcca ggacactgtg tcggtgccct 481 gccagtcagc gtcgtcagcc tctgccctgg gcggtgtcaa agtggagagg caggtctttg 541 gggaggccac caagcagcca ggcatcacct tcatcgcagc caagttcgat ggcatcctgg 601 gcatggccta cccccgcatc tccgtcaaca acgtgctgcc cgtcttcgac aacctgatgc 661 agcagaagct ggtggaccag aacatcttct ccttctacct gagcagggac ccagatgcgc 721 agcctggggg tgagctgatg ctgggtggca cagactccaa gtattacaag ggttctctgt 781 cctacctgaa tgtcacccgc aaggcctact ggcaggtcca cctggaccag gtggaggtgg 841 ccagcgggct gaccctgtgc aaggagggct gtgaggccat tgtggacaca ggcacttccc 901 tcatggtggg cccggtggat gaggtgcgcg agctgcagaa ggccatcggg gccgtgccgc 961 tgattcaggg cgagtacatg atcccctgtg agaaggtgtc caccctgccc gcgatcacac 1021 tgaagctggg aggcaaaggc tacaagctgt ccccagagga ctacacgctc aaggtgtcgc 1081 aggccgggaa gaccctctgc ctgagcggct tcatgggcat ggacatcccg ccacccagcg 1141 ggccactctg gatcctgggc gacgtcttca tcggccgcta ctacactgtg tttgaccgtg 1201 acaacaacag ggtgggcttc gccgaggctg cccgcctcta gttcccaagg cgtccgcgcg 1261 ccagcacaga aacagaggag agtcccagag caggaggccc ctggcccagc ggcccctccc 1321 acacacaccc acacactcgc ccgcccactg tcctgggcgc cctggaagcc ggcggcccaa 1381 gcccgacttg ctgttttgtt ctgtggtttt cccctccctg ggttcagaaa tgctgcctgc 1441 ctgtctgtct ctccatctgt ttggtggggg tagagctgat ccagagcaca gatctgtttc 1501 gtgcattgga agaccccacc caagcttggc agccgagctc gtgtatcctg gggctccctt 1561 catctccagg gagtcccctc cccggcccta ccagcgcccg ctggctgagc ccctacccca 1621 caccaggccg tcctcccggg ccctcccttg gaaacctgcc ctgcctgagg gcccctctgc 1681 ccagcttggg cccagctggg ctctgccacc ctacctgttc agtgtcccgg gcccgttgag 1741 gatgaggccg ctagaggcct gaggatgagc tggaaggagt gagaggggac aaaacccacc 1801 ttgttggagc ctgcagggtg gtgctgggac tgagccagtc ccaggggcat gtattggcct 1861 ggaggtgggg ttgggattgg gggctggtgc cagccttcct ctgcagctga cctctgttgt 1921 cctccccttg ggcggctgag agccccagct gacatggaaa tacagttgtt ggcctccggc 1981 ctcccctc // LOCUS HSCATHH 1399 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for cathepsin H (EC 3.4.22.16). ACCESSION X16832 NID g29709 KEYWORDS cathepsin; cathepsin H; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1399) AUTHORS Fuchs,R. TITLE Direct Submission JOURNAL Submitted (11-OCT-1989) Fuchs R., EMBL, Meyerhofstr. 1, 6900 Heidelberg, FRG REFERENCE 2 (bases 1 to 1399) AUTHORS Fuchs,R. and Gassen,H.G. TITLE Nucleotide sequence of human preprocathepsin H, a lysosomal cysteine proteinase JOURNAL Nucleic Acids Res. 17 (22), 9471 (1989) MEDLINE 90067944 COMMENT For overlapping sequences see . FEATURES Location/Qualifiers source 1..1399 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver (nt 294-1399)" /cell_type="monocyte (nt 1-917)" /cell_line="monocytes U937 (nt 1-917)" /clone_lib="lambda gt10 (nt 1-917), pUC9 (nt 294-1399)" /clone="1CH21 (nt 1-917),pRF15 (nt 294-1399)" sig_peptide 35..100 /note="signal peptide (AA -22 to -1)" CDS 35..1042 /note="preprocathepsin H (AA -22 to 314)" /codon_start=1 /db_xref="PID:g29710" /db_xref="SWISS-PROT:P09668" /translation="MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRK TYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNC SATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGK MLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQP GKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTP DKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPL V" misc_feature 101..379 /note="propeptide (AA 1-93)" mat_peptide 380..1039 /note="mature cathepsin H (AA 94-314)" misc_feature 1375..1380 /note="polyA signal" BASE COUNT 349 a 383 c 367 g 300 t ORIGIN 1 ttgccggcgc aagagccaag ccgccagcgc tgctatgtgg gccacgctgc cgctgctctg 61 cgccggggcc tggctcctgg gagtccccgt ctgcggtgcc gccgaactgt ccgtgaactc 121 cttagagaag tttcacttca agtcatggat gtctaagcac cgtaagacct acagtacgga 181 ggagtaccac cacaggctgc agacgtttgc cagcaactgg aggaagataa acgcccacaa 241 caatgggaac cacacattta aaatggcact gaaccaattt tcagacatga gctttgctga 301 aataaaacac aagtatctct ggtcagagcc tcagaattgc tcagccacca aaagtaacta 361 ccttcgaggt actggtccct acccaccttc cgtggactgg cggaaaaaag gaaattttgt 421 ctcacctgtg aaaaatcagg gtgcctgcgg cagttgctgg actttctcca ccactggggc 481 cctggagtct gcaatcgcca tcgcaaccgg aaagatgctg tccttggcgg aacagcagct 541 ggtggactgc gcccaggact tcaataatta cggctgccaa gggggtctcc ccagccaggc 601 tttcgagtat atcctgtaca acaaggggat catgggtgaa gacacctacc cctaccaggg 661 caaggatggt tattgcaagt tccaacctgg aaaggccatc ggctttgtca aggatgtagc 721 caacatcaca atctatgacg aggaagcgat ggtggaggct gtggccctct acaaccctgt 781 gagctttgcc tttgaggtga ctcaggactt catgatgtat agaacgggca tctactccag 841 tacttcctgc cataaaactc cagataaagt aaaccatgca gtactggctg ttgggtatgg 901 agaaaaaaat gggatccctt actggatcgt gaaaaactct tggggtcccc agtggggaat 961 gaacgggtac ttcctcatcg agcgcggaaa gaacatgtgt ggcctggctg cctgcgcctc 1021 ctaccccatc cctctggtgt gagccgtggc agccgcagcg cagactggcg gagaaggaga 1081 ggaacgggca gcctgggcct gggtggaaat cctgccctgg aggaagttgt ggggagatcc 1141 actgggaccc ccaacattct gccctcacct ctgtgcccag cctggaaacc tacagacaag 1201 gaggagttcc accatgagct cacccgtgtc tatgacgcaa agatcaccag ccatgtgcct 1261 tagtgtcctt cttaacagac tcaaaccaca tggaccacga atattctttc tgtccagaag 1321 ggctactttc cacatataga gctccaggga ctgtcttttc tgtattcgct gttcaataaa 1381 cattgagtga gcacctcca // LOCUS HSCATHO 1647 bp RNA PRI 14-NOV-1994 DEFINITION H.sapiens mRNA for cathepsin-O. ACCESSION X77383 NID g574803 KEYWORDS cathepsin O. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1647) AUTHORS Velasco,G., Ferrando,A.A., Puente,X.S., Sanchez,L.M. and Lopez-Otin,C. TITLE Human cathepsin O. Molecular cloning from a breast carcinoma, production of the active enzyme in Escherichia coli, and expression analysis in human tissues JOURNAL J. Biol. Chem. 269 (43), 27136-27142 (1994) MEDLINE 95014586 REFERENCE 2 (bases 1 to 1647) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (28-JAN-1994) C. Lopez-Otin, Universidad de Oviedo, Dept de Biologia Funcional, Area de Bioquimica, Facultad de Medicina, C/Julian Claveria S/N, 33006 Oviedo, SPAIN FEATURES Location/Qualifiers source 1..1647 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast carcinoma" /clone="O7-7" /clone_lib="lambda gt11" CDS 50..1015 /codon_start=1 /product="cathepsin O" /db_xref="PID:g574804" /db_xref="SWISS-PROT:P43234" /translation="MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAF RESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMS IPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQV IDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKG YSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGF DKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV" BASE COUNT 456 a 328 c 403 g 460 t ORIGIN 1 gaattccgga aaacaggccg cgcgggcggc agaggagccg ggcgccgcaa tggacgtgcg 61 ggcgctgccg tggctgccgt ggctgctgtg gctgctgtgc cggggcggcg gcgatgcgga 121 ctcccgcgcc cccttcaccc cgacctggcc gcggagccgc gagcgtgaag ccgccgcctt 181 ccgggaaagt cttaatagac atcgatactt gaattcttta tttcccagtg aaaactccac 241 cgccttctat ggaataaatc agttttccta tttgtttcct gaagagttta aagccattta 301 tttaagaagc aaaccttcca agtttcccag atactcagca gaagtacata tgtccatccc 361 caatgtgtct ttgccgttaa gatttgactg gagggacaag caggttgtga cacaagtgag 421 aaaccagcag atgtgtggag gatgctgggc cttcagcgtg gtgggggcag tggaatctgc 481 ttatgcaata aaggggaagc ccctggaaga cctaagtgtc cagcaggtca ttgactgttc 541 gtataataat tatggctgca atggaggctc tactctcaat gctttgaact ggttaaacaa 601 gatgcaagta aaactggtga aagattcaga atatcctttt aaagcacaaa atggtctgtg 661 ccattacttt tctggttcac attctggatt ttcaatcaaa ggttattctg catatgactt 721 cagtgaccaa gaagatgaaa tggcaaaagc acttcttacc tttggccctt tggtagtcat 781 agtagatgca gtgagctggc aagattatct gggaggcatt atacagcatc actgctctag 841 tggagaagca aatcatgcag ttctcataac tgggtttgat aaaacaggaa gcactccata 901 ttggattgtg cggaattcct ggggaagttc ttggggagta gatggttatg cccatgtcaa 961 aatgggaagt aatgtttgtg gtattgcaga ttccgtttct tctatatttg tgtgacatgt 1021 tgggcagatc aagagacagc tacaaaaatg aaggttttca taatgcaatg taacatagta 1081 cttcaaagta ttattcaact tcaagtttca gcaactacct acaaaagatt ctaaggccta 1141 gtagtattta aactaagttt cagaatgttc ccttcttgta gagagatgga caaccaaagt 1201 cagtgggaca aactccagca cagaagcctg cgaggaagcc tatggaatag tttcctgtcc 1261 tgagacgaaa ttcagattag gagatatttt aggcccctgc aactggggaa ggctactgtt 1321 tgtttttgtt tgcttattat ttatttgttt gtttattgtg agatatttca ggtgggatca 1381 aagaggtcat aagaatttat tttcttttgt ggggtgtaac tactagcttt agattacccc 1441 tatacacaag aatggccaac ctaaaattat gtgtgtcttg tacagttagt tatattagca 1501 gccctctgag atggcgtatc tatcggaagg atttcaaaca ccaattgctt tacctgaaca 1561 aatggtgctt accctttgaa cagcagagtg accacgtaga aggaaggaaa agggcaaaat 1621 cgcttcagtt aaactgaaac cgaattc // LOCUS HSCATR 2264 bp RNA PRI 12-SEP-1993 DEFINITION Human kidney mRNA for catalase. ACCESSION X04076 NID g29720 KEYWORDS catalase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2264) AUTHORS Bell,G.I., Najarian,R.C., Mullenbach,G.T. and Hallewell,R.A. TITLE cDNA sequence coding for human kidney catalase JOURNAL Nucleic Acids Res. 14 (13), 5561-5562 (1986) MEDLINE 86286565 COMMENT See also for fibroblast catalase mRNA. FEATURES Location/Qualifiers source 1..2264 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 71..1654 /note="catalase (aa 1-527)" /codon_start=1 /db_xref="PID:g29721" /db_xref="SWISS-PROT:P04040" /translation="MADSRDPASDQMQHWKEQRAAQKADVLTTGAGNPVGDKLNVITV GPRGPLLVQDVVFTDEMAHFDRERIPERVVHAKGAGAFGYFEVTHDITKYSKAKVFEH IGKKTPIAVRFSTVAGESGSADTVRDPRGFAVKFYTEDGNWDLVGNNTPIFFIRDPIL FPSFIHSQKRNPQTHLKDPDMVWDFWSLRPESLHQVSFLFSDRGIPDGHRHMNGYGSH TFKLVNANGEAVYCKFHYKTDQGIKNLSVEDAARLSQEDPDYGIRDLFNAIATGKYPS WTFYIQVMTFNQAETFPFNPFDLTKVWPHKDYPLIPVGKLVLNRNPVNYFAEVEQIAF DPSNMPPGIEASPDKMLQGRLFAYPDTHRHRLGPNYLHIPVNCPYRARVANYQRDGPM CMQDNQGGAPNYYPNSFGAPEQQPSALEHSIQYSGEVRRFNTANDDNVTQVRAFYVNV LNEEQRKRLCENIAGHLKDAQIFIQKKAVKNFTEVHPDYGSHIQALLDKYNAEKPKNA IHTFVQSGSHLAAREKANL" BASE COUNT 634 a 511 c 509 g 610 t ORIGIN 1 tttgcctgct gagggtggag acccacgagc cgaggcctcc tgcagtgttc tgcacagcaa 61 accgcacgct atggctgaca gccgggatcc cgccagcgac cagatgcagc actggaagga 121 gcagcgggcc gcgcagaaag ctgatgtcct gaccactgga gctggtaacc cagtaggaga 181 caaacttaat gttattacag tagggccccg tgggcccctt cttgttcagg atgtggtttt 241 cactgatgaa atggctcatt ttgaccgaga gagaattcct gagagagttg tgcatgctaa 301 aggagcaggg gcctttggct actttgaggt cacacatgac attaccaaat actccaaggc 361 aaaggtattt gagcatattg gaaagaagac tcccatcgca gttcggttct ccactgttgc 421 tggagaatcg ggttcagctg acacagttcg ggaccctcgt gggtttgcag tgaaatttta 481 cacagaagat ggtaactggg atctcgttgg aaataacacc cccattttct tcatcaggga 541 tcccatattg tttccatctt ttatccacag ccaaaagaga aatcctcaga cacatctgaa 601 ggatccggac atggtctggg acttctggag cctacgtcct gagtctctgc atcaggtttc 661 tttcttgttc agtgatcggg ggattccaga tggacatcgc cacatgaatg gatatggatc 721 acatactttc aagctggtta atgcaaatgg ggaggcagtt tattgcaaat tccattataa 781 gactgaccag ggcatcaaaa acctttctgt tgaagatgcg gcgagacttt cccaggaaga 841 tcctgactat ggcatccggg atctttttaa cgccattgcc acaggaaagt acccctcctg 901 gactttttac atccaggtca tgacatttaa tcaggcagaa acttttccat ttaatccatt 961 cgatctcacc aaggtttggc ctcacaagga ctaccctctc atcccagttg gtaaactggt 1021 cttaaaccgg aatccagtta attactttgc tgaggttgaa cagatagcct tcgacccaag 1081 caacatgcca cctggcattg aggccagtcc tgacaaaatg cttcagggcc gcctttttgc 1141 ctatcctgac actcaccgcc atcgcctggg acccaattat cttcatatac ctgtgaactg 1201 tccctaccgt gctcgagtgg ccaactacca gcgtgatggc ccgatgtgca tgcaggacaa 1261 tcagggtggt gctccaaatt actaccccaa cagctttggt gctccggaac aacagccttc 1321 tgccctggag cacagcatcc aatattctgg agaagtgcgg agattcaaca ctgccaatga 1381 tgataacgtt actcaggtgc gggcattcta tgtgaacgtg ctgaatgagg aacagaggaa 1441 acgtctgtgt gagaacattg ccggccacct gaaggatgca caaattttca tccagaagaa 1501 agcggtcaag aacttcactg aggtccaccc tgactacggg agccacatcc aggctcttct 1561 ggacaagtac aatgctgaga agcctaagaa tgcgattcac acctttgtgc agtccggatc 1621 tcacttggcg gcaagggaga aggcaaatct gtgaggccgg ggccctgcac ctgtgcagcg 1681 aacgttagcg ttcatccgtg taacccgctc atcactggat gaagattctc ctgtgctaga 1741 tgtgcaaatg caagctagtg gcttcaaaat agagaatccc actttctata gcagattgtg 1801 taacaatttt aatgctattt ccccagggga aaatgaaggt taggatttaa cagtcattta 1861 aaaaaaaaat ttgttttgac ggatgattgg attattcatt taaaatgatt agaaggcaag 1921 tttctagctt agaaatatga ttttatttga caaaatttgt tgaaattatg tatgtttaca 1981 tatcacctca tggcctatta tattaaaata tggctataaa tatataaaaa gaaaagataa 2041 agatgatcta ctcagaaatt tttatttttc taaggttctc ataggaaaag tacatttaat 2101 acagcagtgt catcagaaga taacttgagc accgtcatgg cttaatgttt attcctgata 2161 ataattgatc aaattcattt ttttcactgg agttacatta atgttaattc agcactgatt 2221 tcacaacaga tcaatttgta attgcttaca tttttacaat aaat // LOCUS HSCAVEOMR 838 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for caveolin. ACCESSION Z18951 S49856 NID g38515 KEYWORDS caveolin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 838) AUTHORS Glenney,J.R. Jr. TITLE The sequence of human caveolin reveals identity with VIP21, a component of transport vesicles JOURNAL FEBS Lett. 314 (1), 45-48 (1992) MEDLINE 93083646 REFERENCE 2 (bases 1 to 838) AUTHORS Glenney,J.R. TITLE Direct Submission JOURNAL Submitted (21-OCT-1992) John R. Glenney Jr., Department of Surgery, Lucille Markey Cancer, Center, University of Kentucky, 800 Rose Street, Lexington, KY, 40536-0093, USA FEATURES Location/Qualifiers source 1..838 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Lung" /clone_lib="Clontech/human lung" /clone="cav lambda c" CDS 35..571 /codon_start=1 /product="caveolin" /db_xref="PID:g38516" /db_xref="SWISS-PROT:Q03135" /translation="MSGGKYVDSEGHLYTVPIREQGNIYKPNNKAMADELSEKQVYDA HTKEIDLVNRDPKHLNDDVVKIDFEDVIAEPEGTHSFHGIWKASFTTFTVTKYWFYRL LSALFGIPMALIWGIYFAILSFLHIWAVVPCIKSFLIEIQCTSRVYSIYVHTVCDPLF EAVGKIFSNVRINLQKEI" BASE COUNT 222 a 203 c 166 g 247 t ORIGIN 1 gaattccgga gttttcatcc agccacgggc cagcatgtct gggggcaaat acgtagactc 61 ggagggacat ctctacaccg ttcccatccg ggaacagggc aacatctaca agcccaacaa 121 caaggccatg gcagacgagc tgagcgagaa gcaagtgtac gacgcgcaca ccaaggagat 181 cgacctggtc aaccgcgacc ctaaacacct caacgatgac gtggtcaaga ttgactttga 241 agatgtgatt gcagaaccag aagggacaca cagttttcac ggcatttgga aggccagctt 301 caccaccttc actgtgacga aatactggtt ttaccgcttg ctgtctgccc tctttggcat 361 cccgatggca ctcatctggg gcatttactt cgccattctc tctttcctgc acatctgggc 421 agttgtacca tgcattaaga gcttcctgat tgagattcag tgcaccagcc gtgtctattc 481 catctacgtc cacaccgtct gtgacccact ctttgaagct gttgggaaaa tattcagcaa 541 tgtccgcatc aacttgcaga aagaaatata aatgacattt caaggataga agtatacctg 601 attttttttc cttttaattt tcctggtgcc aatttcaagt tccaagttgc taatacagca 661 acgaatttat gaattgaatt atcttggttg aaaataaaaa gatcactttc tcagttttca 721 taagtattat gtctcttctg agctatttca tctatttttg gcagtctgaa tttttaaaac 781 ccatttatat ttctttcctt acctttttat ttgcatgtgg atcaaccatc gctttatt // LOCUS HSCB2CANR 1790 bp RNA PRI 30-MAR-1995 DEFINITION H.sapiens mRNA for CB2 (peripheral) cannabinoid receptor. ACCESSION X74328 NID g407806 KEYWORDS cannabinoid receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1790) AUTHORS Munro,S. TITLE Direct Submission JOURNAL Submitted (28-JUL-1993) S. Munro, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK REMARK sequence revised by author 13-OCT-93 REFERENCE 2 (bases 1 to 1790) AUTHORS Munro,S., Thomas,K.L. and Abu-Shaar,M. TITLE Molecular characterization of a peripheral receptor for cannabinoids JOURNAL Nature 365 (6441), 61-65 (1993) MEDLINE 93368659 FEATURES Location/Qualifiers source 1..1790 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" mRNA 1..1776 mat_peptide 127..1206 /product="CB2 (peripheral) cannabinoid receptor" CDS 127..1209 /codon_start=1 /product="CB2 (peripheral) cannabinoid receptor" /db_xref="PID:g407807" /db_xref="SWISS-PROT:P34972" /translation="MEECWVTEIANGSKDGLDSNPMKDYMILSGPQKTAVAVLCTLLG LLSALENVAVLYLILSSHQLRRKPSYLFIGSLAGADFLASVVFACSFVNFHVFHGVDS KAVFLLKIGSVTMTFTASVGSLLLTAIDRYLCLRYPPSYKALLTRGRALVTLGIMWVL SALVSYLPLMGWTCCPRPCSELFPLIPNDYLLSWLLFIAFLFSGIIYTYGHVLWKAHQ HVASLSGHQDRQVPGMARMRLDVRLAKTLGLVLAVLLICWFPVLALMAHSLATTLSDQ VKKAFAFCSMLCLINSMVNPVIYALRSGEIRSSAHHCLAHWKKCVRGLGSEAKEEAPR SSVTETEADGKITPWPDSRDLDLSDC" polyA_site 1776 BASE COUNT 399 a 510 c 466 g 415 t ORIGIN 1 caggtcctgg gagaggacag aaaacaactg gactcctcag cccccggcag ctcccagtgc 61 ccagccaccc acaacacaac ccaaagcctt ctagacaagc tcagtggaat ctgaagggcc 121 caccccatgg aggaatgctg ggtgacagag atagccaatg gctccaagga tggcttggat 181 tccaacccta tgaaggatta catgatcctg agtggtcccc agaagacagc tgttgctgtg 241 ttgtgcactc ttctgggcct gctaagtgcc ctggagaacg tggctgtgct ctatctgatc 301 ctgtcctccc accaactccg ccggaagccc tcatacctgt tcattggcag cttggctggg 361 gctgacttcc tggccagtgt ggtctttgca tgcagctttg tgaatttcca tgttttccat 421 ggtgtggatt ccaaggctgt cttcctgctg aagattggca gcgtgactat gaccttcaca 481 gcctctgtgg gtagcctcct gctgaccgcc attgaccgat acctctgcct gcgctatcca 541 ccttcctaca aagctctgct cacccgtgga agggcactgg tgaccctggg catcatgtgg 601 gtcctctcag cactagtctc ctacctgccc ctcatgggat ggacttgctg tcccaggccc 661 tgctctgagc ttttcccact gatccccaat gactacctgc tgagctggct cctgttcatc 721 gccttcctct tttccggaat catctacacc tatgggcatg ttctctggaa ggcccatcag 781 catgtggcca gcttgtctgg ccaccaggac aggcaggtgc caggaatggc ccgaatgagg 841 ctggatgtga ggttggccaa gaccctaggg ctagtgttgg ctgtgctcct catctgttgg 901 ttcccagtgc tggccctcat ggcccacagc ctggccacta cgctcagtga ccaggtcaag 961 aaggcctttg ctttctgctc catgctgtgc ctcatcaact ccatggtcaa ccctgtcatc 1021 tatgctctac ggagtggaga gatccgctcc tctgcccatc actgcctggc tcactggaag 1081 aagtgtgtga ggggccttgg gtcagaggca aaagaagaag ccccgagatc ctcagtcacc 1141 gagacagagg ctgatgggaa aatcactccg tggccagatt ccagagatct agacctctct 1201 gattgctgat gaggcctctt cccaatttaa acaactcaag tcagaaatca gttcactccc 1261 tggaagagag agaggggtct tggcactctc ttcttactta aaccagtccc agacacctag 1321 acacggaccc ctttttgctg atgagtgttg ggactgactc ctggaagaca gcctggcctt 1381 gcccacctgc acacagtctg ttggataggt agggccacga ggagtagcca ggtaggcgag 1441 acacaaaaag gcctgggaca gggtcagtac aagtcaggac aggcttcatg cctgcatcct 1501 ccagagacca ccaggagcca aagcgagcct ccaggcccag caatgaggga cttgggagaa 1561 atctgagaag aatgggttgt tctcttggga agtcagggta tcagatggga tggacatcca 1621 ggtcttctct ctgcctaatt gtcaaggcct ccttggctct ggagctatga aaggccccac 1681 tttcaagtca cccttgccac tgaggaccga ggactatgct atgatgagga ttaaggtgtt 1741 gacttgcctc tttcagagat aaatgacaag ccttcaaaaa aaaaaaaaaa // LOCUS HSCBP20 572 bp RNA PRI 11-SEP-1995 DEFINITION H.sapiens mRNA for CBP20. ACCESSION X84157 NID g984138 KEYWORDS CAP-binding protein complex; cbp20 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 572) AUTHORS Izaurralde,E. TITLE Direct Submission JOURNAL Submitted (25-JAN-1995) E. Izaurralde, EMBL, Meyerhofstrasse 1, 69117 Heidelberg, FRG REFERENCE 2 (bases 1 to 572) AUTHORS Izaurralde,E., Lewis,J., Gamberi,C., Jarmolowski,A., McGuigan,C. and Mattaj,I.W. TITLE A cap-binding protein complex mediating U snRNA export JOURNAL Nature 376 (6542), 709-712 (1995) MEDLINE 95379956 COMMENT The second subunit, called CBP80, that together with the submited here form the complex CBC has the accession number X80030. FEATURES Location/Qualifiers source 1..572 /organism="Homo sapiens" /db_xref="taxon:9606" gene 30..500 /gene="CBP20" CDS 30..500 /gene="CBP20" /codon_start=1 /product="subunit of the dimeric cap binding complex CBC" /db_xref="PID:g984139" /translation="MSGGLLKALRSDSYVELSQYRDQHFRGDNEEQEKLLKKSCTLYV GNLSFYTTEEQIYELFSKSGDIKKIIMGLDKMKKTACGFCFVEYYSRADAENAMRYIN GTRLDDRIIRTDWDAGFKEGRQYGRGRSGGQVRDEYRQDYDAGRGGYGKLAQNQ" BASE COUNT 163 a 119 c 163 g 127 t ORIGIN 1 cgccgcattg tggtccgctt ctctgcacta tgtcgggtgg cctcctgaag gcgctgcgca 61 gcgactccta cgtggagctg agccagtacc gggaccagca cttccggggt gacaatgaag 121 aacaagaaaa attactgaag aaaagctgta cgttatatgt tggaaatctt tctttttaca 181 caactgaaga acaaatctat gaactcttca gcaaaagtgg tgacataaag aaaatcatta 241 tgggtctgga taaaatgaag aaaacagcat gtggattctg ttttgtggaa tattactcac 301 gcgcagatgc ggaaaacgcc atgcggtaca taaatgggac gcgtctggat gaccgaatca 361 ttcgcacaga ctgggacgca ggctttaagg agggcaggca atacggccgt gggcgatctg 421 ggggccaggt tcgggatgag tatcggcagg actacgatgc tgggagagga ggctatggaa 481 aactggcaca gaaccagtga gtggtgagag ctctgtcagt gacaaacact cctttggcct 541 gttgaatttg ctgaagaaca tcacctaaag tc // LOCUS HSCBP80 2828 bp RNA PRI 08-NOV-1994 DEFINITION H.sapiens CBP80 mRNA. ACCESSION X80030 NID g563367 KEYWORDS cbp80 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2828) AUTHORS Izaurralde,E., Lewis,J., McGuigan,C., Jankowska,M., Darzynkiewicz,E. and Mattaj,I.W. TITLE A nuclear cap binding protein complex involved in pre-mRNA splicing JOURNAL Cell 78 (4), 657-668 (1994) MEDLINE 94349369 REFERENCE 2 (bases 1 to 2828) AUTHORS Izaurralde,E.L. TITLE Direct Submission JOURNAL Submitted (04-JUL-1994) E.L. Izaurralde, EMBL, Meyerhofstr. 1, 69117 Heidelberg, FRG FEATURES Location/Qualifiers source 1..2828 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 31..2816 /gene="cbp80" CDS 31..2403 /gene="cbp80" /codon_start=1 /product="cap binding protein" /db_xref="PID:g563368" /translation="MSRRRHSDENDGGQPHKRRKTSDANETEDHLESLICKVGEKSAC SLESNLEGLAGVLEADLPNYKSKILRLLCTVARLLPEKLTIYTTLVGLLNARNYNFGG EFVEAMIRQLKESLKANNYNEAVYLVRFLSDLVNCHVIAAPSMVAMFENFVSVTQEED VPQVRRDWYVYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRRQKTHVPMLQVWT ADKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQHNLPPFTPPPH TEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLHCIIKSHWKERKTCAA QLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHIDVMYTTLLIELCKLQPGSLPQVLA QATEMLYMRLDTMNTTCVDRFINWFSHHLSNFQFRWSWEDWSDCLSQDPESPKPKFVR EVLEKCMRLSYHQRILDIVPPTFSALCPANPTCIYKYGDESSNSLPGHSVALCLAVAF KSKATNDEIFSILKDVPNPNQDDDDDEGFSFNPLKIEVFVQTLLHLAAKSFSHSFSAL AKFHEVFKTLAESDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAVANWIF SSELSRDFTRLFVWEILHSTIRKMNKHVLKIQKELEEAKEKLARQHKRRSDDDDRSSD RKDGVLEEQIERLQEKVESAQSEQKNLFLVIFQRFIMILTEHLVRCETDGTSVLTPWY KNCIERLQQIFLQHHQIIQQYMVTLENLLFTAELDPHILAVFQQFCALQA" sig_peptide 37..90 /gene="cbp80" polyA_signal 2811..2816 /gene="cbp80" BASE COUNT 846 a 572 c 616 g 794 t ORIGIN 1 cctcggttcc gcggcgcacc ggagggcagc atgtcgcggc ggcggcacag cgacgagaac 61 gacggtgggc agcctcacaa aaggagaaag acctctgatg caaatgaaac tgaagatcat 121 ttggaatctt taatatgtaa agtaggagaa aagagtgcct gctctttgga gagcaaccta 181 gaaggcttgg ctggtgtttt ggaagctgat cttcctaact acaagagcaa gatcttaagg 241 cttctttgta cagttgcacg cctattacct gagaagctga caatttatac aacattagtt 301 ggactactga atgccaggaa ttacaatttt ggtggagaat ttgtagaagc catgattcgt 361 caacttaaag aatcattgaa agcaaacaat tataatgaag ccgtgtattt ggtccgtttt 421 ttatctgatc ttgtgaattg tcatgtgatt gccgccccat caatggttgc tatgtttgaa 481 aattttgtaa gcgtaactca ggaagaagat gtacctcagg tgcgacgaga ttggtatgtg 541 tatgcatttc tgtcatcttt gccctgggtt ggaaaggagt tgtacgaaaa gaaagatgca 601 gagatggacc gcatctttgc caacactgaa agctatctta aaagacgcca aaagactcat 661 gtacccatgt tacaggtatg gactgctgat aaaccacatc cacaagaaga gtatttagat 721 tgcctgtggg cccagattca gaaattgaaa aaggatcgct ggcaggaacg gcacatccta 781 agaccttatc ttgcctttga cagcatcctg tgtgaagcac tgcagcacaa tctgcctcct 841 tttacaccac ctcctcacac tgaagattca gtgtacccaa tgccaagggt catcttcaga 901 atgtttgatt acacagatga tcccgagggt cctgtcatgc cagggagtca ttcagtggaa 961 agatttgtaa tagaagagaa tcttcactgc atcattaagt cccactggaa ggaaaggaag 1021 acttgtgctg cacagttagt gagctatcca gggaagaaca agatcccctt gaactaccac 1081 atagttgagg tgatctttgc agagctgttt caacttccag caccccctca cattgatgtg 1141 atgtacacaa cactcctcat tgaactgtgc aaacttcaac ctggctctct accccaagtt 1201 cttgcacagg caactgaaat gctatacatg cgtttggaca caatgaacac tacctgtgta 1261 gacaggttta ttaattggtt ttctcatcat ctaagtaact tccagttccg ttggagctgg 1321 gaagattggt cagattgtct tagtcaagat cctgaaagtc ccaaaccgaa gtttgtaaga 1381 gaagttctag aaaaatgtat gaggttgtct taccatcagc gtatattaga tattgttcct 1441 cctaccttct cagctctgtg tcctgcaaac ccaacctgca tttacaagta tggagatgaa 1501 agtagcaatt ctcttcctgg acattctgtt gccctctgtt tagctgttgc ctttaaaagt 1561 aaggcaacca atgatgaaat cttcagcatt ctgaaagatg taccaaatcc taaccaggat 1621 gatgacgacg atgaaggatt cagttttaac ccattgaaaa tagaagtctt tgtacagact 1681 ctgctacact tggcagccaa atcattcagc cactccttca gtgctcttgc aaagtttcat 1741 gaagtcttca aaaccctagc tgaaagtgat gaaggaaagt tacatgtgct aagagttatg 1801 tttgaggtct ggaggaacca tccacagatg attgctgtac tagtggataa gatgattcgt 1861 acacaaatag ttgattgtgc tgccgtagca aattggatct tctcttcaga actatctcgt 1921 gactttacca gattgtttgt ttgggaaatt ttgcactcta caattcgtaa gatgaacaaa 1981 catgtcctga agatccagaa agagctggaa gaagctaaag agaaacttgc taggcaacac 2041 aaacggcgaa gtgatgatga cgacagaagc agtgacagga aagacggggt tcttgaggaa 2101 caaatagaac gacttcagga aaaagtggaa tctgctcaga gtgaacaaaa gaatcttttc 2161 ctcgttatat ttcagcggtt tatcatgatc ttgaccgagc acctagtacg atgcgaaact 2221 gatgggacca gtgtattaac accatggtat aagaactgta tagagaggct gcagcagatc 2281 ttcctacagc atcaccaaat aatccagcag tacatggtga ccctggagaa ccttctcttc 2341 actgctgaat tagaccctca tatcttggcc gtgttccagc agttctgtgc cctgcaggcc 2401 taagggtcat tttttcctca tgtcaaggtt ttttttgata tcttaaaata atttgtctta 2461 ttttttgatg gtttgaatgc ttgctttctt gtagtatcct ttcacttctt aaaggaaaca 2521 aaggggaaga ggacagtgaa tgaacatggc attactttta attgccctga aaagcaaata 2581 cttcctaacg gcagtaatgt gactatgacc atgatatatt atatatgtga cagatacaaa 2641 ttctctgtga tcagtttgtt attttttttc tccttaaggc aacaaaataa ttggtttgag 2701 gtatgtgaaa cactagaggt caaccttaca tagtatatag aactgatggg tttacccagc 2761 tacccagtag cataactttt cacagctcgg ggatgaatta acatggctga aataaaacta 2821 aaagtatg // LOCUS HSCC21 925 bp RNA PRI 03-MAY-1996 DEFINITION H.sapiens mRNA for chemokine CC-2 and CC-1. ACCESSION Z70292 NID g1296608 KEYWORDS chemokine CC-1; chemokine CC-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 925) AUTHORS Pardigol,A., Maegert,H.J., Zucht,HD., Forssmann,W.G. and Schulz-Knappe,P. TITLE Transcription of a Human Tandem Gene results in a Mature Bicistronic mRNA encoding two Novel CC-Chemokines JOURNAL Unpublished REFERENCE 2 (bases 1 to 925) AUTHORS Pardigol,A. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Andreas Pardigol, IV - Molecular Biology, Lower Saxony Institute for Peptide Research, Feodor-Lynen-Strasse 31, Hannover, Lower Saxony, 30625, Germany FEATURES Location/Qualifiers source 1..925 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="PCR fragments" 5'UTR 1..55 CDS 56..397 /note="putative; first coding region of a bicistronic mRNA" /codon_start=1 /product="chemokine CC-2" /db_xref="PID:e233855" /db_xref="PID:g1296609" /translation="MKVSVAALSCLMLVAVLGSQAQFTNDAETELMMSKLPLENPVVL NSFHFAADCCTSYISQSIPCSLMKSYFETSSECSKPGVIFLTKKGRQVCAKPSGPGVQ DCMKKLKPYSI" misc_feature 398..498 /note="spacing region between two coding regions of the bicistronic mRNA" CDS 499..780 /codon_start=1 /evidence=experimental /product="chemokine CC-1" /db_xref="PID:e233856" /db_xref="PID:g1296610" /translation="MKISVAAIPFFLLITIALGTKTESSSRGPYHPSECCFTYTTYKI PRQRIMDYYETNSQCSKPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN" 3'UTR 781..925 polyA_signal 902..908 BASE COUNT 240 a 296 c 199 g 190 t ORIGIN 1 ccaggaagca gtgagcccag gagtcctcgg ccagccctgc ctgcccacca ggaggatgaa 61 ggtctccgtg gctgccctct cctgcctcat gcttgttgct gtccttggat cccaggccca 121 gttcacaaat gatgcagaga cagagttaat gatgtcaaag cttccactgg aaaatccagt 181 agttctgaac agctttcact ttgctgctga ctgctgcacc tcctacatct cacaaagcat 241 cccgtgttca ctcatgaaaa gttattttga aacgagcagc gagtgctcca agccaggtgt 301 catattcctc accaagaagg ggcggcaagt ctgtgccaaa cccagtggtc cgggagttca 361 ggattgcatg aaaaagctga agccctactc aatataataa taaagagaca aaagaggcca 421 gccacccacc tccaacacct cctgagcctc tgaagctccc accaggccag ctctcctccc 481 acaacagctt cccacagcat gaagatctcc gtggctgcca ttcccttctt cctcctcatc 541 accatcgccc tagggaccaa gactgaatcc tcctcacggg gaccttacca cccctcagag 601 tgctgcttca cctacactac ctacaagatc ccgcgtcagc ggattatgga ttactatgag 661 accaacagcc agtgctccaa gcccggaatt gtcttcatca ccaaaagggg ccattccgtc 721 tgtaccaacc ccagtgacaa gtgggtccag gactatatca aggacatgaa ggagaactga 781 gtgacccaga aggggtggcg aaggcacagc tcagagacat aaagagaaga tgccaaggcc 841 ccctcctcca cccaccgcta actctcagcc ccagtcaccc tcttggagct tccctgcttt 901 gaattaaaga ccactcatgc tcttc // LOCUS HSCCBL 3090 bp RNA PRI 12-DEC-1992 DEFINITION Human mRNA for c-cbl proto-oncogene. ACCESSION X57110 NID g29730 KEYWORDS cbl oncogene; nuclear protein; oncogene cellular. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3090) AUTHORS Langdon,W.Y. TITLE Direct Submission JOURNAL Submitted (02-JAN-1991) W.Y. Langdon, INSTITUTE OF MEDICAL & VETERINARY SCIENE, IMVS, DIVISION OF HUMAN IMMUNOLOGY, BOX 14 RUNDEL MALL POST OFFICE, ADELAIDE SA 5000, AUSTRALIA REFERENCE 2 (bases 1 to 3090) AUTHORS Blake,T.J., Shapiro,M., Morse,H.C. III. and Langdon,W.Y. TITLE The sequences of the human and mouse c-cbl proto-oncogenes show v-cbl was generated by a large truncation encompassing a proline-rich domain and a leucine zipper-like motif JOURNAL Oncogene 6 (4), 653-657 (1991) MEDLINE 91232862 REFERENCE 3 (bases 1 to 3090) AUTHORS Blake,T.J. and Langdon,W.Y. TITLE A rearrangement of the c-cbl proto-oncogene in HUT78 T-lymphoma cells results in a truncated protein JOURNAL Oncogene 7 (4), 757-762 (1992) MEDLINE 92228506 FEATURES Location/Qualifiers source 1..3090 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..3090 /gene="c-cbl" /evidence=experimental gene 1..3090 /gene="c-cbl" CDS 149..2869 /gene="c-cbl" /note="c-cbl protein" /codon_start=1 /db_xref="PID:g29731" /db_xref="SWISS-PROT:P22681" /translation="MAGNVKKSSGAGGGTGSGGSGSGGLIGLMKDAFQPHHHHHHHLS PHPPGTVDKKMVEKCWKLMDKVVRLCQNPKLALKNSPPYILDLLPDTYQHLRTILSRY EGKMETLGENEYFRVFMENLMKKTKQTISLFKEGKERMYEENSQPRRNLTKLSLIFSH MLAELKGIFPSGLFQGDTFRITKADAAEFWRKAFGEKTIVPWKSFRQALHEVHPISSG LEAMALKSTIDLTCNDYISVFEFDIFTRLFQPWSSLLRNWNSLAVTHPGYMAFLTYDE VKARLQKFIHKPGSYIFRLSCTRLGQWAIGYVTADGNILQTIPHNKPLFQALIDGFRE GFYLFPDGRNQNPDLTGLCEPTPQDHIKVTQEQYELYCEMGSTFQLCKICAENDKDVK IEPCGHLMCTSCLTSWQESEGQGCPFCRCEIKGTEPIVVDPFDPRGSGSLLRQGAEGA PSPNYDDDDDERADDTLFMMKELAGAKVERPPSPFSMAPQASLPPVPPRLDLLPQRVC VPSSASALGTASKAASGSLHKDKPLPVPPTLRDLPPPPPPDRPYSVGAESRPQRRPLP CTPGDCPSRDKLPPVPSSRLGDSWLPRPIPKVPVSAPSSSDPWTGRELTNRHSLPFSL PSQMEPRPDVPRLGSTFSLDTSMSMNSSPLVGPECDHPKIKPSSSANAIYSLAARPLP VPKLPPGEQCEGEEDTEYMTPSSRPLRPLDTSQSSRACDCDQQIDSCTYEAMYNIQSQ APSITESSTFGEGNLAAAHANTGPEESENEDDGYDVPKPPVPAVLARRTLSDISNASS SFGWLSLDGDPTTNVTEGSQVPERPPKPFPRRINSERKAGSCQQGSGPAASAATASPQ LSSEIENLMSQGYSYQDIQKALVIAQNNIEMAKNILREFVSISSPAHVAT" BASE COUNT 757 a 840 c 764 g 729 t ORIGIN 1 gaattccggg cccggatagc cggcggcggc ggcggcggcg gcggcggcgg cggccgggag 61 aggcccctcc ttcacgccct gcttctctcc ctcgctcgca gtcgagccga gccggcggac 121 ccgcctgggc tccgaccctg cccaggccat ggccggcaac gtgaagaaga gctctggggc 181 cgggggcggc acgggctccg ggggctcggg ttcgggtggc ctgattgggc tcatgaagga 241 cgccttccag ccgcaccacc accaccacca ccacctcagc ccccacccgc cggggacggt 301 ggacaagaag atggtggaga agtgctggaa gctcatggac aaggtggtgc ggttgtgtca 361 gaacccaaag ctggcgctaa agaatagccc accttatatc ttagacctgc taccagatac 421 ctaccagcat ctccgtacta tcttgtcaag atatgagggg aagatggaga cacttggaga 481 aaatgagtat tttagggtgt ttatggagaa tttgatgaag aaaactaagc aaaccataag 541 cctcttcaag gagggaaaag aaagaatgta tgaggagaat tctcagccta ggcgaaacct 601 aaccaaactg tccctcatct tcagccacat gctggcagaa ctaaaaggaa tctttccaag 661 tggactcttt cagggagaca catttcggat tactaaagca gatgctgcgg aattttggag 721 aaaagctttt ggggaaaaga caatagtccc ttggaagagc tttcgacagg ctctacatga 781 agtgcatccc atcagttctg ggctggaggc catggctctg aaatccacta ttgatctgac 841 ctgcaatgat tatatttcgg tttttgaatt tgacatcttt acccgactct ttcagccctg 901 gtcctctttg ctcaggaatt ggaacagcct tgctgtaact catcctggct acatggcttt 961 tttgacgtat gacgaagtga aagctcggct ccagaaattc attcacaaac ctggcagtta 1021 tatcttccgg ctgagctgta ctcgtctggg tcagtgggct attgggtatg ttactgctga 1081 tgggaacatt ctccagacaa tccctcacaa taaacctctc ttccaagcac tgattgatgg 1141 cttcagggaa ggcttctatt tgtttcctga tggacgaaat cagaatcctg atctgactgg 1201 cttatgtgaa ccaactcccc aagaccatat caaagtgacc caggaacaat atgaattata 1261 ctgtgagatg ggctccacat tccaactatg taaaatatgt gctgaaaatg ataaggatgt 1321 aaagattgag ccctgtggac acctcatgtg cacatcctgt cttacatcct ggcaggaatc 1381 agaaggtcag ggctgtcctt tctgccgatg tgaaattaaa ggtactgaac ccatcgtggt 1441 agatccgttt gatcctagag ggagtggcag cctgttgagg caaggagcag agggagctcc 1501 ctccccaaat tatgatgatg atgatgatga acgagctgat gatactctct tcatgatgaa 1561 ggaattggct ggtgccaagg tggaacggcc gccttctcca ttctccatgg ccccacaagc 1621 ttcccttccc ccggtgccac cacgacttga ccttctgccg cagcgagtat gtgttccctc 1681 aagtgcttct gctcttggaa ctgcttctaa ggctgcttct ggctcccttc ataaagacaa 1741 accattgcca gtacctccca cacttcgaga tcttccacca ccaccgcctc cagaccggcc 1801 atattctgtt ggagcagaat cccgacctca aagacgcccc ttgccttgta caccaggcga 1861 ctgtccctcc agagacaaac tgccccctgt cccctctagc cgccttggag actcatggct 1921 gccccggcca atccccaaag taccagtatc tgccccaagt tccagtgatc cctggacagg 1981 aagagaatta accaaccggc actcacttcc attttcattg ccctcacaaa tggagcccag 2041 accagatgtg cctaggctcg gaagcacgtt cagtctggat acctccatga gtatgaatag 2101 cagcccatta gtaggtccag agtgtgacca ccccaaaatc aaaccttcct catctgccaa 2161 tgccatttat tctctggctg ccagacctct tcctgtgcca aaactgccac ctggggagca 2221 atgtgagggt gaagaggaca cagagtacat gactccctct tccaggcctc tacggccttt 2281 ggatacatcc cagagttcac gagcatgtga ttgcgaccag cagattgata gctgtacgta 2341 tgaagcaatg tataatattc agtcccaggc gccatctatc accgagagca gcacctttgg 2401 tgaagggaat ttggccgcag cccatgccaa cactggtccc gaggagtcag aaaatgagga 2461 tgatgggtat gatgtcccaa agccacctgt gccggccgtg ctggcccgcc gaactctctc 2521 agatatctct aatgccagct cctcctttgg ctggttgtct ctggatggtg atcctacaac 2581 aaatgtcact gaaggttccc aagttcccga gaggcctcca aaaccattcc cgcggagaat 2641 caactctgaa cggaaagctg gcagctgtca gcaaggtagt ggtcctgccg cctctgctgc 2701 caccgcctca cctcagctct ccagtgagat cgagaacctc atgagtcagg ggtactccta 2761 ccaggacatc cagaaagctt tggtcattgc ccagaacaac atcgagatgg ccaaaaacat 2821 cctccgggaa tttgtttcca tttcttctcc tgcccatgta gctacctagc acaccatctc 2881 cctgctgcag gtttagagga ccagtgagtt gggagttatt actcaagtgg cacctagaag 2941 ggcaggagtt cctttggtga cttcacagtg aagtcttgcc ctctctgtgg gatatcacat 3001 cagtggttcc aagatttcaa agtggtgaaa tgaaaatgga gcagctagta tgttttatta 3061 ttttatgggt cttgagtgca tttgaaggtg // LOCUS HSCCCR3 1677 bp RNA PRI 04-JUN-1996 DEFINITION H.sapiens mRNA for C-C chemokine receptor-4. ACCESSION X85740 NID g1370103 KEYWORDS c-c chemokine receptor type 4; c-c ckr-4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Power,C.A., Meyer,A., Nemeth,K., Bacon,K.B., Hoogewerf,A.J., Proudfoot,A.E. and Wells,T.N. TITLE Molecular cloning and functional expression of a novel CC chemokine receptor cDNA from a human basophilic cell line JOURNAL J. Biol. Chem. 270 (33), 19495-19500 (1995) MEDLINE 95370289 REFERENCE 2 (bases 1 to 1677) AUTHORS Power,C.A. TITLE Direct Submission JOURNAL Submitted (20-MAR-1995) C.A. Power, Glaxo Institute for Molecular Biology, 14, chemin des Aulx, CH- 1228 Plan-les-Ouates, Geneva, SWITZERLAND REMARK Revised by [3] REFERENCE 3 (bases 1 to 1677) AUTHORS Power,C.A. TITLE Direct Submission JOURNAL Submitted (04-JUN-1996) C.A. Power, Glaxo Institute for Molecular Biology, 14, chemin des Aulx, CH- 1228 Plan-les-Ouates, Geneva, SWITZERLAND FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" mRNA <1..1677 intron <1..88 /note="unspliced intron" gene 183..1265 /gene="c-c ckr-4" CDS 183..1265 /gene="c-c ckr-4" /codon_start=1 /product="c-c chemokine receptor-4" /db_xref="PID:g971452" /translation="MNPTDIADTTLDESIYSNYYLYESIPKPCTKEGIKAFGELFLPP LYSLVFVFGLLGNSVVVLVLFKYKRLRSMTDVYLLNLAISDLLFVFSLPFWGYYAADQ WVFGLGLCKMISWMYLVGFYSGIFFVMLMSIDRYLAIVHAVFSLRARTLTYGVITSLA TWSVAVFASLPGFLFSTCYTERNHTYCKTKYSLNSTTWKVLSSLEINILGLVIPLGIM LFCYSMIIRTLQHCKNEKKNKAVKMIFAVVVLFLGFWTPYNIVLFLETLVELEVLQDC TFERYLDYAIQATETLAFVHCCLNPIIYFFLGEKFRKYILQLFKTCRGLFVLCQYCGL LQIYSADTPSSSYTQSTMDHDLHDAL" BASE COUNT 377 a 432 c 374 g 494 t ORIGIN 1 cgggggtttt gatcttcttc cccttctttt cttccccttc ttctttcctt cctccctccc 61 tctctcattt cccttctcct tctccctcag tctccacatt caacattgac aagtccattc 121 agaaaagcaa gctgcttctg gttgggccca gacctgcctt gaggagcctg tagagttaaa 181 aaatgaaccc cacggatata gcagatacca ccctcgatga aagcatatac agcaattact 241 atctgtatga aagtatcccc aagccttgca ccaaagaagg catcaaggca tttggggagc 301 tcttcctgcc cccactgtat tccttggttt ttgtatttgg tctgcttgga aattctgtgg 361 tggttctggt cctgttcaaa tacaagcggc tcaggtccat gactgatgtg tacctgctca 421 accttgccat ctcggatctg ctcttcgtgt tttccctccc tttttggggc tactatgcag 481 cagaccagtg ggtttttggg ctaggtctgt gcaagatgat ttcctggatg tacttggtgg 541 gcttttacag tggcatattc tttgtcatgc tcatgagcat tgatagatac ctggcgatag 601 tgcacgcggt gttttccttg agggcaagga ccttgactta tggggtcatc accagtttgg 661 ctacatggtc agtggctgtg ttcgcctccc ttcctggctt tctgttcagc acttgttata 721 ctgagcgcaa ccatacctac tgcaaaacca agtactctct caactccacg acgtggaagg 781 ttctcagctc cctggaaatc aacattctcg gattggtgat ccccttaggg atcatgctgt 841 tttgctactc catgatcatc aggaccttgc agcattgtaa aaatgagaag aagaacaagg 901 cggtgaagat gatctttgcc gtggtggtcc tcttccttgg gttctggaca ccttacaaca 961 tagtgctctt cctagagacc ctggtggagc tagaagtcct tcaggactgc acctttgaaa 1021 gatacttgga ctatgccatc caggccacag aaactctggc ttttgttcac tgctgcctta 1081 atcccatcat ctactttttt ctgggggaga aatttcgcaa gtacatccta cagctcttca 1141 aaacctgcag gggccttttt gtgctctgcc aatactgtgg gctcctccaa atttactctg 1201 ctgacacccc cagctcatct tacacgcagt ccaccatgga tcatgatctt catgatgctc 1261 tgtaggaaaa atgaaatggt gaaatgcaga gtcaatgaac ttttccacat tcagagctta 1321 ctttaaaatt ggtattttta ggtaagagat ccctgagcca gtgtcaggag gaaggcttac 1381 acccacagtg gaaagacagc ttctcatcct gcaggcagct ttttctctcc cactagacaa 1441 gtccagcctg gcaagggttc acctgggctg aggcatcctt cctcacacca ggcttgcctg 1501 caggcatgag tcagtctgat gagaactctg agcagtgctt gaatgaagtt gtaggtaata 1561 ttgcaaggca aagactattc ccttctaacc tgaactgatg ggtttctcca gagggaattg 1621 cagagtactg gctgatggag taaatcgcta ccttttgctg tggcaaatgg gcccccg // LOCUS HSCCG1 5257 bp RNA PRI 12-MAR-1996 DEFINITION Human X chromsome mRNA for CCG1 protein inv. in cell proliferation. ACCESSION X07024 NID g29732 KEYWORDS CCG1 gene; cell cycle control. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5257) AUTHORS Nishimto,T. TITLE Direct Submission JOURNAL Submitted (29-FEB-1988) Nishimto T., Dept of Molecular Biology, Graduate School of Medical Science, Kyushu University, Fukoka 812, Japan REFERENCE 2 (bases 1 to 5257) AUTHORS Sekoguchi,T., Miyata,T. and Nishimoto,T. TITLE Molecular cloning of the cDNA of human X chromosomal gene (CCG1) which complement the temperature sensitive G1 mutants, tsBN462 and ts13 of BHK cells JOURNAL Unpublished REFERENCE 3 (bases 1 to 5257) AUTHORS Aves,S.J., Hindley,J., Phear,G.A. and Tongue,N. TITLE A fission yeast gene mapping close to suc1 encodes a protein containing two bromodomains JOURNAL Mol. Gen. Genet. 248 (4), 491-498 (1995) MEDLINE 96004771 FEATURES Location/Qualifiers source 1..5257 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KB" /chromosome="X chromosome, q/ter-p/21" CDS 332..4996 /note="CCG1 protein (AA 1 - 1554)" /codon_start=1 /db_xref="PID:g29733" /db_xref="SWISS-PROT:P21675" /translation="MYRDECKKHLAGLGALGLGSLITELTANEELTGTDGALVNDEGW VRSTEDAVDYSDINEVAEDESRRYQQTMGSLQPLCHSDYDEDDYDADCEDIDCKLMPP PPPPPGPMKKDKDQDSITGEKVDFSSSSDSESEMGPQEATQAESEDGKLTLPLAGIMQ HDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRELIQEEQ IQEVECSVESEVSQKSLWNYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDIDKVT DTKPRVAEWRYGPARLWYDMLGVPEDGSGFDYGFKLRKTEHEPVIKSRMIEEFRKLEE NNGTDLLADENFLMVTQLHWEDDIIWDGEDVKHKGTKPQRASLAGWLPSSMTRNAMAY NVQQGFAATLDDDKPWYSIFPIDNEDLVYGRWEDNIIWDAQAMPRLLEPPVLTLDPND ENLILEIPDEKEEATSNSPSKESKKESSLKKSRILLGKTGVIKEEPQQNMSQPEVKDP WNLSNDEYYYPKQQGLRGTFGGNIIQHSIPAVELRQPFFPTHMGPIKLRQFHRPPLKK YSFGALSQPGPHSVQPLLKHIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLIL AEYSEENGPLMMQVGMATKIKNYYKRKPGKDPGAPDCKYGETVYCHTSPFLGSLHPGQ LLQAFENNLFRAPIYLHKMPETDFLIIRTRQGYYIRELVDIFVVGQQCPLFEVPGPNS KRANTHIRDFLQVFIYRLFWKSKDRPRRIRMEDIKKAFPSHSESSIRKRLKLCADFKR TGMDSNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFAPE EENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKCLLEVTGVADPTGCGEGFSYVKIPN KPTQQKDDKEPQPVKKTVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDV VRTMSTEQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTD TDSSSAEDSDFEEMGKNIENMLQNKKTSSQLSREREEQERKELQRMLLAAGSAASGNN HRDDDTASVTSLNSSATGRCLKIYRTFRDEEGKEYVRCETVRKPAVIDAYVRIRTTKD EEFIRKFALFDEQHREEMRKERRRIQEQLRRLKRNQEKEKLKGPPEKKPKKMKERPDL KLKCGACGAIGHMRTNKFCPLYYQTNAPPSNPVAMTEEQEEELEKTVIHNDNEELIKV EGTKIVLGKQLIESADEVRRKSLVLKFPKQQLPPKKKRRVGTTVHCDYLNRPHKSIHR RRTDPMVTLSSILESIINDMRDLPNTYPFHTPVNAKVVKDYYKIITRPMDLQTLRENV RKRLYPSREEFREHLELIVKNSATYNGPKHSLTQISQSMLDLCDEKLKEKEDKLARLE KAINPLLDDDDQVAFSFILDNIVTQKMMAVPDSWPFHHPVNKKFVPDYYKVIVNPMDL ETIRKNISKHKYQSRESFLDDVNLILANSVKYNDNECSSKANDIVCLIQYCSSQIEEL RF" misc_feature 4215..4236 /note="pot. nuclear translocation signal" repeat_region 4256..4618 /note="direct repeat 1" repeat_region 4619..4993 /note="direct repeat 1" misc_feature 5236..5241 /note="polyA signal" polyA_site 5257 /note="polyA site" BASE COUNT 1562 a 1137 c 1296 g 1262 t ORIGIN 1 gaattccttt tttttttgag ctttaaataa agcatttatt catgagcgga agcttacagt 61 ttgcatagat tcttcatacc ttatctggaa gggcgatgga aaccccaagg cactagagag 121 catcagaaga aatcagtgac atgatttgag tagggctggg ggactgggtc cctgcacccc 181 agccacatcc tatgggcctt aggcccatac tcggagaacg agtccattgg acaaagaaca 241 tggctgagag accttctggg ggccttgaag aggccgcctc cttggtctcc tcaaccccag 301 tgtaagtctg gggaggccca aggtgagggt catgtatcgg gatgaatgta agaagcactt 361 ggcaggcttg ggggctttgg ggctgggcag cctgatcact gaactcacgg caaatgaaga 421 attgaccggg actgacggtg ccttggtaaa tgatgaaggg tgggttagga gtacagaaga 481 tgctgtggac tattcagaca tcaatgaggt ggcagaagat gaaagccgaa gataccagca 541 gacgatgggg agcttgcagc ccctttgcca ctcagattat gatgaagatg actatgatgc 601 tgattgtgaa gacattgatt gcaagttgat gcctcctcca cctccacccc cgggaccaat 661 gaagaaggat aaggaccagg attctattac tggtgagaaa gtggacttca gtagttcctc 721 tgactcagaa tctgagatgg gacctcagga agcaacacag gcagaatctg aagatggaaa 781 gctgaccctt ccattggctg ggattatgca gcatgatgcc accaagctgt tgccaagtgt 841 cacagaactt tttccagaat ttcgacctgg aaaggtgtta cgttttctac gtctttttgg 901 accagggaag aatgtcccat ctgtttggcg gagtgctcgg agaaagagga agaagaagca 961 ccgtgagctg atacaggaag agcagatcca ggaggtggag tgctcagtag aatcagaagt 1021 cagccagaag tctttgtgga actacgacta cgctccacca ccacctccag agcagtgtct 1081 ctctgatgat gaaatcacga tgatggctcc tgtggagtcc aaattttccc aatcaactgg 1141 agatatagat aaagtgacag ataccaaacc aagagtggct gagtggcgtt atgggcctgc 1201 ccgactgtgg tatgatatgc tgggtgtccc tgaagatggc agtgggtttg actatggctt 1261 caaactgaga aagacagaac atgaacctgt gataaaatct agaatgatag aggaatttag 1321 gaaacttgag gaaaacaatg gcactgatct tctggctgat gaaaacttcc tgatggtgac 1381 acagctgcat tgggaggatg atatcatctg ggatggggag gatgtcaaac acaaagggac 1441 aaaacctcag cgtgcaagcc tggcaggctg gcttccttct agcatgacta ggaatgcgat 1501 ggcttacaat gttcagcaag gttttgcagc cactcttgat gatgacaaac cttggtactc 1561 catttttccc attgacaatg aggatctggt atatggacgc tgggaggaca atatcatttg 1621 ggatgctcag gccatgcccc ggctgttgga acctcctgtt ttgacacttg atcccaatga 1681 tgagaacctc attttggaaa ttcctgatga gaaggaagag gccacctcta actccccctc 1741 caaggagagt aagaaggaat catctctgaa gaagagtcga attctcttag ggaaaacagg 1801 agtcatcaag gaggaaccac agcagaacat gtctcagcca gaagtgaaag atccatggaa 1861 tctctccaat gatgagtatt attatcccaa gcaacagggt cttcgaggca cctttggagg 1921 gaatattatc cagcattcaa ttcctgctgt ggaattacgg cagcccttct ttcccaccca 1981 catggggccc atcaaactcc ggcagttcca tcgcccacct ctgaaaaagt actcatttgg 2041 tgcactttct cagccaggtc cccactcagt ccaacctttg ctaaagcaca tcaaaaaaaa 2101 ggccaagatg agagaacaag agaggcaagc ttcaggtggt ggagagatgt tttttatgcg 2161 cacacctcag gacctcacag gcaaagatgg tgatcttatt cttgcagaat atagtgagga 2221 aaatggaccc ttaatgatgc aggttggcat ggcaaccaag ataaagaact attataaacg 2281 gaaacctgga aaagatcctg gagcaccaga ttgtaaatat ggggaaactg tttactgcca 2341 tacatctcct ttcctgggtt ctctccatcc tggccaattg ctgcaagcat ttgagaacaa 2401 cctttttcgt gctccaattt atcttcataa gatgccagaa actgatttct tgatcattcg 2461 gacaagacag ggttactata ttcgggaatt agtggatatt tttgtggttg gccagcagtg 2521 tcccttgttt gaagttcctg ggcctaactc caaaagggcc aatacgcata ttcgagactt 2581 tctacaggtt tttatttacc gccttttctg gaaaagtaaa gatcggccac ggaggatacg 2641 aatggaagat ataaaaaaag cctttccttc ccattcagaa agcagcatcc ggaagaggct 2701 aaagctctgc gctgacttca aacgcacagg gatggactca aactggtggg tgcttaagtc 2761 tgattttcgt ttaccaacgg aagaagagat cagagctatg gtgtcaccag agcagtgctg 2821 tgcttattat agcatgatag ctgcagagca acgactgaag gatgctggct atggtgagaa 2881 atcctttttt gctccagaag aagaaaatga ggaagatttc cagatgaaga ttgatgatga 2941 agttcgcact gccccttgga acaccacaag ggccttcatt gctgccatga agggcaagtg 3001 tctgctagag gtgactgggg tggcagatcc cacggggtgt ggtgaaggat tctcctatgt 3061 gaagattcca aacaaaccaa cacagcagaa ggatgataaa gaaccgcagc cagtgaagaa 3121 gacagtgaca ggaacagatg cagaccttcg tcgcctttcc ctgaaaaatg ccaagcaact 3181 tctacgtaaa tttggtgtgc ctgaggaaga gattaaaaag ttgtcccgct gggaagtgat 3241 tgatgtggtg cgcacaatgt caacagaaca ggctcgttct ggagaggggc ccatgagtaa 3301 atttgcccgt ggatcaaggt tttctgtggc tgagcatcaa gagcgttaca aagaggaatg 3361 tcagcgcatc tttgacctac agaacaaggt tctgtcatca actgaagtct tatcaactga 3421 cacagacagc agctcagctg aagatagtga ctttgaagaa atgggaaaga acattgagaa 3481 catgttgcag aacaagaaaa ccagctctca gctttcacgt gaacgggagg aacaggagcg 3541 gaaggaacta cagcgaatgc tactggcagc aggctcagca gcatccggaa acaatcacag 3601 agatgatgac acagcttccg tgactagcct taactcttct gccactggac gctgtctcaa 3661 gatttatcgc acgtttcgag atgaagaggg gaaagagtat gttcgctgtg agacagtccg 3721 aaaaccagct gtcattgatg cctatgtgcg catacggact acaaaagatg aggaattcat 3781 tcgaaaattt gccctttttg atgaacaaca tcgggaagag atgcgaaaag aacggcggag 3841 gattcaagag caactgaggc ggcttaagag gaaccaggaa aaggagaagc ttaagggtcc 3901 tcctgagaag aagcccaaga aaatgaagga gcgtcctgac ctaaaactga aatgtggggc 3961 atgtggtgcc attggacaca tgaggactaa caaattctgc cccctctatt atcaaacaaa 4021 tgcgccacct tccaaccctg ttgccatgac agaagaacag gaggaggagt tggaaaagac 4081 agtcattcat aatgataatg aagaacttat caaggttgaa gggaccaaaa ttgtcttggg 4141 gaaacagcta attgagagtg cggatgaggt tcgcagaaaa tctctggttc tcaagtttcc 4201 taaacagcag cttcctccaa agaagaaacg gcgagttgga accactgttc actgtgacta 4261 tttgaataga cctcataagt ccatccaccg gcgccgcaca gaccctatgg tgacgctgtc 4321 gtccatcttg gagtctatca tcaatgacat gagagatctt ccaaatacat accctttcca 4381 cactccagtc aatgcaaagg ttgtaaagga ctactacaaa atcatcactc ggccaatgga 4441 cctacaaaca ctccgcgaaa acgtgcgtaa acgcctctac ccatctcggg aagagttcag 4501 agagcatctg gagctaattg tgaaaaatag tgcaacctac aatgggccaa aacactcatt 4561 gactcagatc tctcaatcca tgctggatct ctgtgatgaa aaactcaaag agaaagaaga 4621 caaattagct cgcttagaga aagctatcaa ccccttgctg gatgatgatg accaagtggc 4681 gttttctttc attctggaca acattgtcac ccagaaaatg atggcagttc cagattcttg 4741 gccatttcat cacccagtta ataagaaatt tgttccagat tattacaaag tgattgtcaa 4801 tccaatggat ttagagacca tacgtaagaa catctccaag cacaagtatc agagtcggga 4861 gagctttctg gatgatgtaa accttattct ggccaacagt gttaagtata atgacaatga 4921 gtgttcatct aaagcaaatg acatagtttg cctaatccag tactgtagtt cacagataga 4981 agaattaaga ttttaatggg acggtgattt gccagcagtc cctactgaat ttcttaatta 5041 agatttgtgc ccaactgtcc tggtctctaa actggtgtca tgtttcctcc ttattccatc 5101 atgtccctga tcatagcctg ccaatctgga tgtagaactc tctgctgctc tcctggaatg 5161 atgtctacct gcatgctgcc atgcctccca ccatgacaat aattgactga agctctgaac 5221 tgtaaggcag ccccaattaa atgctttcct ttatagg // LOCUS HSCCR9 1181 bp RNA PRI 17-JUN-1997 DEFINITION H.sapiens mRNA for chemokine receptor CCR-9. ACCESSION Y12815 NID g2204204 KEYWORDS chemokine receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1181) AUTHORS Nibbs,R.J.B., Lowe,S. and Graham,G.J. TITLE Cloning and characterisation of a novel human chemokine receptor CCR-9 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1181) AUTHORS Graham,G.J. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) G.J. Graham, Beatson Institute for Cancer Research, Garscube Estate, Switchback Road, Bearsden, Glasgow, G61 1BD, UK FEATURES Location/Qualifiers source 1..1181 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="U937 monocytic cells" CDS 13..1167 /codon_start=1 /product="chemokine receptor CCR-9" /db_xref="PID:e322086" /db_xref="PID:g2204205" /translation="MAATASPQPLATEDADSENSSFYYYDYLDEVAFMLCRKDAVVSF GKVFLPVFYSLIFVLGLSGNLLLLMVLLRYVPRRRMVEIYLLNLAISNLLFLVTLPFW GISVAWHWVFGSFLCKMVSTLYTINFYSGIFFISCMSLDKYLEIVHAQPYHRLRTRAK SLLLATIVWAVSLAVSIPDMVFVQTHENPKGVWNCHADFGGHGTIWKLFLRFQQNLLG FLLPLLAMIFFYSRIGCVLVRLRPAGQGRALKIAAALVVAFFVLWFPYNLTLFLHTLL DLQVFGNCEVSQHLDYALQVTESIAFLHCCFSPILYAFSSHRFRQYLKAFLAAVLGWH LAPGTAQASLSSCSESSILTAQEEMTGMNDLGERQSENYPNKEDVGNKSA" BASE COUNT 224 a 337 c 298 g 322 t ORIGIN 1 ggatcctcca acatggccgc cactgcctct ccgcagccac tcgccactga ggatgccgat 61 tctgagaata gcagcttcta ttactatgac tacctggatg aagtggcctt catgctctgc 121 aggaaggatg cagtggtgtc ctttggcaaa gtcttcctcc cagtcttcta tagcctgatt 181 tttgtgttgg gcctcagcgg gaacctcctt cttctcatgg tcttgctccg ttacgtgcct 241 cgcaggcgga tggttgagat ctatctgctg aatctggcca tctccaacct tctgtttctg 301 gtgacactgc ccttctgggg catctccgtg gcctggcatt gggtcttcgg gagtttcttg 361 tgcaagatgg tgagcactct ttatactatt aacttttaca gtggcatctt tttcattagc 421 tgcatgagcc tggacaagta cctggagatc gttcatgctc agccctacca caggctgagg 481 acccgggcca agagcctgct ccttgctacc atagtatggg ctgtgtccct ggccgtctcc 541 atccctgata tggtctttgt acagacacat gaaaatccca agggtgtgtg gaactgccac 601 gcagatttcg gcgggcatgg gaccatttgg aagctcttcc tccgcttcca gcagaacctc 661 ctagggtttc tccttccact ccttgccatg atcttcttct actcccgtat tggttgtgtc 721 ttggtgaggc tgaggcccgc aggccagggc cgggctttaa aaatagctgc agccttggtg 781 gtggccttct tcgtgctatg gttcccatac aatctcacct tgtttctgca tacgctgttg 841 gacctgcaag tattcgggaa ctgtgaggtc agccagcatc tagactacgc actccaggta 901 acagagagca tcgccttcct tcactgctgc ttttccccca tcctgtatgc cttctccagt 961 caccgcttcc gccagtacct gaaggctttc ctggctgccg tgcttggatg gcacctggca 1021 cctggcactg cccaggcctc attatccagc tgttctgaga gcagcatact tactgcccaa 1081 gaggaaatga ctggcatgaa tgaccttgga gagaggcagt ctgagaacta ccctaacaag 1141 gaggatgtgg ggaataaatc agcctgagtg accgcggccg c // LOCUS HSCD22AG 2116 bp RNA PRI 04-DEC-1992 DEFINITION H.sapiens CD22 mRNA. ACCESSION X52785 NID g29778 KEYWORDS CD22 antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2116) AUTHORS Stamenkovic,I. and Seed,B. TITLE The B-cell antigen CD22 mediates monocyte and erythrocyte adhesion JOURNAL Nature 345 (6270), 74-77 (1990) MEDLINE 90231465 FEATURES Location/Qualifiers source 1..2116 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 35..1978 /codon_start=1 /product="CD22 antigen" /db_xref="PID:g29779" /db_xref="SWISS-PROT:P20273" /translation="MHLLGPWLLLLVLEYLAFSDSSKWVFEHPETLYAWEGACVWIPC TYRALDGDLESFILFHNPEYNKNTSKFDGTRLYESTKDGKVPSEQKRVQFLGDKNKNC TLSIHPVHLNDSGQLGLRMESKTEKWMERIHLNVSERPFPPHIQLPPEIQESQEVTLT CLLNFSCYGYPIQLQWLLEGVPMRQAAVTSTSLTIKSVFTRSELKFSPQWSHHGKIVT CQLQDADGKFLSNDTVQLNVKHPPKKVTTVIQNPMPIREGDTVTLSCNYNSSNPSVTR YEWKPHGAWEEPSLGVLKIQNVGWDNTTIACAACNSWCSWASPVALNVQYAPRDVRVR KIKPLSEIHSGNSVSLQCDFSSSHPKEVQFFWEKNGRLLGKESQLNFDSISPEDAGSY SCWVNNSIGQTASKAWTLEVLYAPRRLRVSMSPGDQVMEGKSATLTCESDANPPVSHY TWFDWNNQSLPYHSQKLRLEPVKVQHSGAYWCQGTNSVGKGRSPLSTLTVYYSPETIG RRVAVGLGSCLAILILAICGLKLQRRWKRTQSQQGLQENSSGQSFFVRNKKVRRAPLS EGPHSLGCYNPMMEDGISYTTLRFPEMNIPRTGDAESSEMQRPPPDCDDTVTYSALHK RQVGTMRTSFQIFQKMRGFITQS" BASE COUNT 525 a 594 c 569 g 428 t ORIGIN 1 acgcggaaac aggcttgcac ccagacacga caccatgcat ctcctcggcc cctggctcct 61 gctcctggtt ctagaatact tggctttctc tgactcaagt aaatgggttt ttgagcaccc 121 tgaaaccctc tacgcctggg agggggcctg cgtctggatc ccctgcacct acagagccct 181 agatggtgac ctggaaagct tcatcctgtt ccacaatcct gagtataaca agaacacctc 241 gaagtttgat gggacaagac tctatgaaag cacaaaggat gggaaggttc cttctgagca 301 gaaaagggtg caattcctgg gagacaagaa taagaactgc acactgagta tccacccggt 361 gcacctcaat gacagtggtc agctggggct gaggatggag tccaagactg agaaatggat 421 ggaacgaata cacctcaatg tctctgaaag gccttttcca cctcatatcc agctccctcc 481 agaaattcaa gagtcccagg aagtcactct gacctgcttg ctgaatttct cctgctatgg 541 gtatccgatc caattgcagt ggctcctaga gggggttcca atgaggcagg ctgctgtcac 601 ctcgacctcc ttgaccatca agtctgtctt cacccggagc gagctcaagt tctccccaca 661 gtggagtcac catgggaaga ttgtgacctg ccagcttcag gatgcagatg ggaagttcct 721 ctccaatgac acggtgcagc tgaacgtgaa gcatcctccc aagaaggtga ccacagtgat 781 tcaaaacccc atgccgattc gagaaggaga cacagtgacc ctttcctgta actacaattc 841 cagtaacccc agtgttaccc ggtatgaatg gaaaccccat ggcgcctggg aggagccatc 901 gcttggggtg ctgaagatcc aaaacgttgg ctgggacaac acaaccatcg cctgcgcagc 961 ttgtaatagt tggtgctcgt gggcctcccc tgtcgccctg aatgtccagt atgccccccg 1021 agacgtgagg gtccggaaaa tcaagcccct ttccgagatt cactctggaa actcggtcag 1081 cctccaatgt gacttctcaa gcagccaccc caaagaagtc cagttcttct gggagaaaaa 1141 tggcaggctt ctggggaaag aaagccagct gaattttgac tccatctccc cagaagatgc 1201 tgggagttac agctgctggg tgaacaactc cataggacag acagcgtcca aggcctggac 1261 acttgaagtg ctgtatgcac ccaggaggct gcgtgtgtcc atgagcccgg gggaccaagt 1321 gatggagggg aagagtgcaa ccctgacctg tgagagcgac gccaaccctc ccgtctccca 1381 ctacacctgg tttgactgga ataaccaaag cctcccctac cacagccaga agctgagatt 1441 ggagccggtg aaggtccagc actcgggtgc ctactggtgc caggggacca acagtgtggg 1501 caagggccgt tcgcctctca gcaccctcac cgtctactat agcccggaga ccatcggcag 1561 gcgagtggct gtgggactcg ggtcctgcct cgccatcctc atcctggcaa tctgtgggct 1621 caagctccag cgacgttgga agaggacaca gagccagcag gggcttcagg agaattccag 1681 cggccagagc ttctttgtga ggaataaaaa ggttagaagg gcccccctct ctgaaggccc 1741 ccactccctg ggatgctaca atccaatgat ggaagatggc attagctaca ccaccctgcg 1801 ctttcccgag atgaacatac cacgaactgg agatgcagag tcctcagaga tgcagagacc 1861 tcccccggac tgcgatgaca cggtcactta ttcagcattg cacaagcgcc aagtgggcac 1921 tatgagaacg tcattccaga ttttccagaa gatgagggga ttcattactc agagctgatc 1981 cagtttgggg tcggggagcg gcctcaggca caagaaaatg tggactatgt gatcctcaaa 2041 cattgacact ggatgggctg cagcagaggc actgggggca gcgggggcca gggaagtccc 2101 cgagttttcc ccagac // LOCUS HSCD37 1125 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for leukocyte antigen CD37. ACCESSION X14046 NID g29793 KEYWORDS antigen; CD37 antigen; cell surface glycoprotein; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1125) AUTHORS Classon,B.J. TITLE Direct Submission JOURNAL Submitted (13-JAN-1989) Classon B.J., MRC Cellular Immunology Unit, Sir William Dunn, School of Pathology, University of Oxford, Oxford, OX1 3RE, England REMARK revised by [3] REFERENCE 2 (bases 1 to 1125) AUTHORS Classon,B.J., Williams,A.F., Willis,A.C., Seed,B. and Stamenkovic,I. TITLE The primary structure of the human leukocyte antigen CD37, a species homologue of the rat MRC OX-44 antigen JOURNAL J. Exp. Med. 169 (4), 1497-1502 (1989) MEDLINE 89176904 REMARK revised by [3] Erratum:[J Exp Med 1990 Sep 1;172(3):1007]] REFERENCE 3 (bases 1 to 1125) AUTHORS Classon,B.J. TITLE Direct Submission JOURNAL Submitted (25-MAY-1990) to the EMBL/GenBank/DDBJ databases COMMENT The human leukocyte antigen CD37 is a species homolog of the rat MRC OX-44 antigen. Data kindly reviewed (23-Jun-1989) by Classon B.J. FEATURES Location/Qualifiers source 1..1125 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CLL leukemia" /clone="CD37.1" CDS 64..909 /note="CD37 (AA 1-244)" /codon_start=1 /db_xref="PID:g29794" /db_xref="SWISS-PROT:P11049" /translation="MSAQESCLSLIKYFLFVFNLFFFVLGSLIFCFGIWILIDKTSFV SFVGLAFVPLQIWSKVLAISGIFTMGIALLGCVGALKELRCLLGLYFGMLLLLFATQI TLGILISTQRAQLERSLRDVVEKTIQKYGTNPEETAAEESWDYVQFQLRCCGWHYPQD WFQVLILRGNGSEAHRVPCSCYNLSATNDSTILDKVILPQLSRLGHLARSRHSADICA VPAESHIYREGCAQGLQKWLHNNLISIVGICLGVGLLELGFMTLSIFLCRNLDHVYNR LARYR" BASE COUNT 191 a 388 c 291 g 255 t ORIGIN 1 gtctccccca ctgtcagcac ctcttctgtg tggtgagtgg accgcttacc ccactaggtg 61 aagatgtcag cccaggagag ctgcctcagc ctcatcaagt acttcctctt cgttttcaac 121 ctcttcttct tcgtcctcgg cagcctgatc ttctgcttcg gcatctggat cctcatcgac 181 aagaccagct tcgtgtcctt tgtgggcttg gccttcgtgc ctctgcagat ctggtccaaa 241 gtcctggcca tctcaggaat cttcaccatg ggcatcgccc tcctgggttg tgtgggggcc 301 ctcaaggagc tccgctgcct cctgggcctg tattttggga tgctgctgct cctgtttgcc 361 acacagatca ccctgggaat cctcatctcc actcagcggg cccagctgga gcgaagcttg 421 cgggacgtcg tagagaaaac catccaaaag tacggcacca accccgagga gaccgcggcc 481 gaggagagct gggactatgt gcagttccag ctgcgctgct gcggctggca ctacccgcag 541 gactggttcc aagtcctcat cctgagaggt aacgggtcgg aggcgcaccg cgtgccctgc 601 tcctgctaca acttgtcggc gaccaacgac tccacaatcc tagataaggt gatcttgccc 661 cagctcagca ggcttggaca cctggcgcgg tccagacaca gtgcagacat ctgcgctgtc 721 cctgcagaga gccacatcta ccgcgagggc tgcgcgcagg gcctccagaa gtggctgcac 781 aacaacctta tttccatagt gggcatttgc ctgggcgtcg gcctactcga gctcgggttc 841 atgacgctct cgatattcct gtgcagaaac ctggaccacg tctacaaccg gctcgctcga 901 taccgttagg ccccgccctc cccaaagtcc cgccccgccc ccgtcacgtg cgctgggcac 961 ttccctgctg cctgtaaata tttgtttaat ccccagttcg cctggagccc tccgccttca 1021 cattcccctg gggacccacg tggctgcgtg cccctgctgc tgtcacctct cccacgggac 1081 ctggggcttt cgtccacagc ttcctgtccc catctgtcgg cctac // LOCUS HSCD69GNA 1702 bp RNA PRI 14-OCT-1994 DEFINITION H.sapiens CD69 gene. ACCESSION Z22576 NID g397938 KEYWORDS CD69. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1702) AUTHORS Lopez-Cabrera,M., Santis,A.G., Fernandez-Ruiz,E., Blacher,R., Esch,F., Sanchez-Mateos,P. and Sanchez-Madrid,F. TITLE Molecular cloning, expression, and chromosomal localization of the human earliest lymphocyte activation antigen AIM/CD69, a new member of the C-type animal lectin superfamily of signal-transmitting receptors JOURNAL J. Exp. Med. 178 (2), 537-547 (1993) MEDLINE 93340630 REFERENCE 2 (bases 1 to 1702) AUTHORS SANCHEZ-MADRID,F. TITLE Direct Submission JOURNAL Submitted (21-APR-1993) SANCHEZ-MADRID F., HOSPITAL DE LA PRINCESA, INMUNOLOGIA, C/ DIEGO DE LEON 62, MADRID, MADRID, SPAIN, 28006 FEATURES Location/Qualifiers source 1..1702 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="activated peripheral lymphocyte" 5'UTR 1..81 CDS 82..681 /standard_name="CD69" /codon_start=1 /product="CD69" /db_xref="PID:g397939" /db_xref="SWISS-PROT:Q07108" /translation="MSSENCFVAENSSLHPESGQENDATSPHFSTRHEGSFQVPVLCA VMNVVFITILIIALIALSVGQYNCPGQYTFSMPSDSHVSSCSEDWVGYQRKCYFISTV KRSWTSAQNACSEHGATLAVIDSEKDMNFLKRYAGREEHWVGLKKEPGHPWKWSNGKE FNNWFNVTGSDKCVFLKNTEVSSMECEKNLYWICNKPYK" 3'UTR 682..1702 BASE COUNT 567 a 283 c 331 g 521 t ORIGIN 1 agactcaaca agagctccag caaagacttt cactgtagct tgacttgacc tgagattaac 61 tagggaatct tgagaataaa gatgagctct gaaaattgtt tcgtagcaga gaacagctct 121 ttgcatccgg agagtggaca agaaaatgat gccaccagtc cccatttctc aacacgtcat 181 gaagggtcct tccaagttcc tgtcctgtgt gctgtaatga atgtggtctt catcaccatt 241 ttaatcatag ctctcattgc cttatcagtg ggccaataca attgtccagg ccaatacaca 301 ttctcaatgc catcagacag ccatgtttct tcatgctctg aggactgggt tggctaccag 361 aggaaatgct actttatttc tactgtgaag aggagctgga cttcagccca aaatgcttgt 421 tctgaacatg gtgctactct tgctgtcatt gattctgaaa aggacatgaa ctttctaaaa 481 cgatacgcag gtagagagga acactgggtt ggactgaaaa aggaacctgg tcacccatgg 541 aagtggtcaa atggcaaaga atttaacaac tggttcaacg ttacagggtc tgacaagtgt 601 gtttttctga aaaacacaga ggtcagcagc atggaatgtg agaagaattt atactggata 661 tgtaacaaac cttacaaata ataaggaaac atgttcactt attgactatt atagaatgga 721 actcaaggaa atctgtgtca gtggatgctg ctctgtggtc cgaagtcttc catagagact 781 ttgtgaaaaa aaattttata gtgtcttggg aattttcttc caaacagaac tatggaaaaa 841 aaggaagaaa ttccaggaaa atctgcactg tgggctttta ttgccatgag ctagaagcat 901 cacaggttga ccaataacca tgcccaagaa tgagaagaat gactatgcaa cctttggatg 961 cactttatat tattttgaat ccagaaataa tgaaataact aggcgtggac ttactattta 1021 ttgctgaatg actaccaaca gtgagagccc ttcatgcatt tgcactactg gaaggagtta 1081 gatgttggta ctagatactg aatgtaaaca aaggaattat ggctggtaac ataggttttt 1141 agtctaattg aatcccttaa actcagggag catttataaa tggacaaatg cttatgaaac 1201 taagatttgt aatatttctc tctttttaga gaaatttgcc aatttacttt gttatttttc 1261 cccaaaaaga atgggatgat cgtgtattta tttttttact tcctcagctg tagacaggtc 1321 cttttcgatg gtacatattt ctttgccttt ataatctttt atacagtgtc ttacagagaa 1381 aagacataag caaagactat gaggaatatt tgcaagacat agaatagtgt tggaaaatgt 1441 gcaatatgtg atgtggcaaa tctctattag gaaatattct gtaatcttca gacctagaat 1501 aatactagtc ttataatagg tttgtgactt tcctaaatca attctattac gtgcaatact 1561 tcaatacttc atttaaaata tttttatgtg caataaaatg tatttgtttg tattttgtgt 1621 tcagtacaat tataagctgt ttttatatat gtgaaataaa agtagaataa acacaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSCD97 2921 bp RNA PRI 30-OCT-1995 DEFINITION H.sapiens mRNA for leucocyte antigen CD97. ACCESSION X84700 NID g840770 KEYWORDS antigen CD97. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2921) AUTHORS Hamann,J., Eichler,W., Hamann,D., Kerstens,H.M., Poddighe,P.J., Hoovers,J.M., Hartmann,E., Strauss,M. and van Lier,R.A. TITLE Expression cloning and chromosomal mapping of the leukocyte activation antigen CD97, a new seven-span transmembrane molecule of the secretion receptor superfamily with an unusual extracellular domain JOURNAL J. Immunol. 155 (4), 1942-1950 (1995) MEDLINE 95363161 REFERENCE 2 (bases 1 to 2921) AUTHORS Hamann,J. TITLE Direct Submission JOURNAL Submitted (10-FEB-1995) J. Hamann, Central Lab. Netherlands Red Cross Blood Transfusion Service, CLB, Dept. KVI, Plesmanlaan 125, 1066 CX Amsterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..2921 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PBMC" /chromosome="19" /map="19p13.12-13.2" CDS 71..2299 /codon_start=1 /product="leucocyte antigen CD97" /db_xref="PID:g840771" /db_xref="SWISS-PROT:P48960" /translation="MGGRVFLAFCVWLTLPGAETQDSRGCARWCPQNSSCVNATACRC NPGFSSFSEIITTPTETCDDINECATPSKVSCGKFSDCWNTEGSYDCVCSPGYEPVSG AKTFKNESENTCQDVDECSSGQHQCDSSTVCFNTVGSYSCRCRPGWKPRHGIPNNQKD TVCEDMTFSTWTPPPGVHSQTLSRFFDKVQDLGRDSKTSSAEVTIQNVIKLVDELMEA PGDVEALAPPVRHLIATQLLSNLEDIMRILAKSLPKGPFTYISPSNTELTLMIQERGD KNVTMGQSSARMKLNWAVAAGAEDPGPAVAGILSIQNMTTLLANASLNLHSKKQAELE EIYESSIRGVQLRRLSAVNSIFLSHNNTKELNSPILFAFSHLESSDGEAGRDPPAKDV MPGPRQELLCAFWKSDSDRGGHWATEVCQVLGSKNGSTTCQCSHLSSFTILMAHYDVE DWKLTLITRVGLALSLFCLLLCILTFLLVRPIQGSRTTIHLHLCICLFVGSTIFLAGI ENEGGQVGLRCRLVAGLLHYCFLAAFCWMSLEGLELYFLVVRVFQGQGLSTRWLCLIG YGVPLLIVGVSAAIYSKGYGRPRYCWLDFEQGFLWSFLGPVTFIILCNAVIFVTTVWK LTQKFSEINPDMKKLKKARALTITAIAQLFLLGCTWVFGLFIFDDRSLVLTYVFTILN CLQGAFLYLLHCLLNKKVREEYRKWACLVAGGSKYSEFTSTTSGTGHNQTRALRASES GI" BASE COUNT 609 a 891 c 799 g 622 t ORIGIN 1 agcctgtgga gacgggacag ccctgtccca ctcactcttt cccctgccgc tcctgccggc 61 agctccaacc atgggaggcc gcgtctttct cgcattctgt gtctggctga ctctgccggg 121 agctgaaacc caggactcca ggggctgtgc ccggtggtgc cctcagaact cctcgtgtgt 181 caatgccacc gcctgtcgct gcaatccagg gttcagctct ttttctgaga tcatcaccac 241 cccgacggag acttgtgacg acatcaacga gtgtgcaaca ccgtcgaaag tgtcatgcgg 301 aaaattctcg gactgctgga acacagaggg gagctacgac tgcgtgtgca gcccgggata 361 tgagcctgtt tctggggcaa aaacattcaa gaatgagagc gagaacacct gtcaagatgt 421 ggacgagtgc agctccgggc agcatcagtg tgacagctcc accgtctgct tcaacaccgt 481 gggttcatac agctgccgct gccgcccagg ctggaagccc agacacggaa tcccgaataa 541 ccaaaaggac actgtctgtg aagatatgac tttctccacc tggaccccgc cccctggagt 601 ccacagccag acgctttccc gattcttcga caaagtccag gacctgggca gagactccaa 661 gacaagctca gccgaggtca ccatccagaa tgtcatcaaa ttggtggatg aactgatgga 721 agctcctgga gacgtagagg ccctggcgcc acctgtccgg cacctcatag ccacccagct 781 gctctcaaac cttgaagata tcatgaggat cctggccaag agcctgccta aaggcccctt 841 cacctacatt tccccttcga acacagagct gaccctgatg atccaggagc ggggggacaa 901 gaacgtcact atgggtcaga gcagcgcacg catgaagctg aattgggctg tggcagctgg 961 agccgaggat ccaggccccg ccgtggcggg catcctctcc atccagaaca tgacgacatt 1021 gctggccaat gcctccttga acctgcattc caagaagcaa gccgaactgg aggagatata 1081 tgaaagcagc atccgtggtg tccaactcag acgcctctct gccgtcaact ccatctttct 1141 gagccacaac aacaccaagg aactcaactc ccccatcctt ttcgccttct cccaccttga 1201 gtcctccgat ggggaggcgg gaagagaccc tcctgccaag gacgtgatgc ctgggccacg 1261 gcaggagctg ctctgtgcct tctggaagag tgacagcgac aggggagggc actgggccac 1321 cgaggtctgc caggtgctgg gcagcaagaa cggcagcacc acctgccaat gcagccacct 1381 gagcagcttt acgatcctta tggctcatta tgacgtggag gactggaagc tgaccctgat 1441 caccagggtg ggactggcgc tgtcactctt ctgcctgctg ctgtgcatcc tcactttcct 1501 gctggtgcgg cccatccagg gctcgcgcac caccatacac ctgcacctct gcatctgcct 1561 cttcgtgggc tccaccatct tcctggccgg catcgagaac gaaggcggcc aggtggggct 1621 gcgctgccgc ctggtggccg ggctgctgca ctactgtttc ctggccgcct tctgctggat 1681 gagcctcgaa ggcctggagc tctactttct tgtggtgcgc gtgttccaag gccagggcct 1741 gagtacgcgc tggctctgcc tgatcggcta tggcgtgccc ctgctcatcg tgggcgtctc 1801 ggctgccatc tacagcaagg gctacggccg ccccagatac tgctggttgg actttgagca 1861 gggcttcctc tggagcttct tgggacctgt gaccttcatc attttgtgca atgctgtcat 1921 tttcgtgact accgtctgga agctcactca gaagttttct gaaatcaatc cagacatgaa 1981 gaaattaaag aaggcgaggg cgctgaccat cacggccatc gcgcagctct tcctgttggg 2041 ctgcacctgg gtctttggcc tgttcatctt cgacgatcgg agcttggtgc tgacctatgt 2101 gtttaccatc ctcaactgcc tgcagggcgc cttcctctac ctgctgcact gcctgctcaa 2161 caagaaggtt cgggaagaat accggaagtg ggcctgccta gttgctgggg ggagcaagta 2221 ctcagaattc acctccacca cgtctggcac tggccacaat cagacccggg ccctcagggc 2281 atcagagtcc ggcatatgaa ggcgcatggt tctggacggc ccagcagctc ctgtggccac 2341 agcagctttg tacacgaaga ccatccatcc tcccttcgtc caccactcta ctccctccac 2401 cctccctccc tgatcccgtg tgccaccagg agggagtggc agctatagtc tggcaccaaa 2461 gtccaggaca cccagtgggg tggagtcgga gccactggtc ctgctgctgg ctgcctctct 2521 gctccacctt gtgacccagg gtggggacag gggctggccc agggctgcaa tgcagcatgt 2581 tgccctggca cctgtggcca gtactcggga cagactaagg gcgcttgtcc catcctggac 2641 ttttcctctc atgtctttgc tgcagaactg aagagactag gcgctggggc tcagcttccc 2701 tcttaagcta agactgatgt cagaggcccc atggcgaggc cccttggggc cactgcctga 2761 ggctcacggt acagaggcct gccctgcctg gccgggcagg aggttctcac tgttgtgaag 2821 gttgtagacg ttgtgtaatg tgtttttatc tgttaaaatt tttcagtgtt gacacttaaa 2881 attaaacaca tgcatacaga aaaaaaaaaa aaaaaaaaaa a // LOCUS HSCDC2 1050 bp RNA PRI 12-SEP-1993 DEFINITION Human CDC2 gene involved in cell cycle control. ACCESSION X05360 NID g29838 KEYWORDS CDC2 gene; cell cycle control. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1050) AUTHORS Lee,M.G. and Nurse,P. TITLE Complementation used to clone a human homologue of the fission yeast cell cycle control gene cdc2 JOURNAL Nature 327 (6117), 31-35 (1987) MEDLINE 87201915 FEATURES Location/Qualifiers source 1..1050 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 127..1020 /note="CDC2 polypeptide (CDC2) (AA 1-297)" /codon_start=1 /db_xref="PID:g29839" /db_xref="SWISS-PROT:P06493" /translation="MEDYTKIEKIGEGTYGVVYKGRHKTTGQVVAMKKIRLESEEEGV PSTAIREISLLKELRHPNIVSLQDVLMQDSRLYLIFEFLSMDLKKYLDSIPPGQYMDS SLVKSYLYQILQGIVFCHSRRVLHRDLKPQNLLIDDKGTIKLADFGLARAFGIPIRVY THEVVTLWYRSPEVLLGSARYSTPVDIWSIGTIFAELATKKPLFHGDSEIDQLFRIFR ALGTPNNEVWPEVESLQDYKNTFPKWKPGSLASHVKNLDENGLDLLSKMLIYDPAKRI SGKMALNHPYFNDLDNQIKKM" BASE COUNT 324 a 195 c 235 g 296 t ORIGIN 1 gggggggggg ggcacttggc ttcaaagctg gctcttggaa attgagcgga gacgagcggc 61 ttgttgtagc tgccgtgcgg ccgccgcgga ataataagcc gggatctacc ataccattga 121 ctaactatgg aagattatac caaaatagag aaaattggag aaggtaccta tggagttgtg 181 tataagggta gacacaaaac tacaggtcaa gtggtagcca tgaaaaaaat cagactagaa 241 agtgaagagg aaggggttcc tagtactgca attcgggaaa tttctctatt aaaggaactt 301 cgtcatccaa atatagtcag tcttcaggat gtgcttatgc aggattccag gttatatctc 361 atctttgagt ttctttccat ggatctgaag aaatacttgg attctatccc tcctggtcag 421 tacatggatt cttcacttgt taagagttat ttataccaaa tcctacaggg gattgtgttt 481 tgtcactcta gaagagttct tcacagagac ttaaaacctc aaaatctctt gattgatgac 541 aaaggaacaa ttaaactggc tgattttggc cttgccagag cttttggaat acctatcaga 601 gtatatacac atgaggtagt aacactctgg tacagatctc cagaagtatt gctggggtca 661 gctcgttact caactccagt tgacatttgg agtataggca ccatatttgc tgaactagca 721 actaagaaac cacttttcca tggggattca gaaattgatc aactcttcag gattttcaga 781 gctttgggca ctcccaataa tgaagtgtgg ccagaagtgg aatctttaca ggactataag 841 aatacatttc ccaaatggaa accaggaagc ctagcatccc atgtcaaaaa cttggatgaa 901 aatggcttgg atttgctctc gaaaatgtta atctatgatc cagccaaacg aatttctggc 961 aaaatggcac tgaatcatcc atattttaat gatttggaca atcagattaa gaagatgtag 1021 ctttctgaca aaaagtttcc atatgttatg // LOCUS HSCDC27 2592 bp mRNA PRI 13-JAN-1995 DEFINITION Human homologue of S. pombe nuc2+ and A. nidulans bimA. ACCESSION U00001 M78440 T03211 NID g405832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2592) AUTHORS Tugendreich,S., Boguski,M.S., Seldin,M. and Hieter,P.H. TITLE Linking yeast genetics to mammalian genomes: identification and mapping of the human homolog of CDC27 via the expressed sequence tag database JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 10031-10035 (1993) MEDLINE 94052097 REFERENCE 2 (bases 1 to 2592) AUTHORS Tugendreich,S., Boguski,M.S., Seldin,M. and Hieter,P.H. TITLE Direct Submission JOURNAL Submitted (04-MAR-1993) Department of Molecular Biology and Genetics, The Johns Hopkins University School of Medicine, 725 N. Wolfe Street, Baltimore, Maryland 21205-2185, U.S.A. FEATURES Location/Qualifiers source 1..2592 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q21-24 between ERBB2 and PKCA genes" gene 1..2592 /gene="CDC27" CDS 59..2530 /gene="CDC27" /codon_start=1 /product="CDC27" /db_xref="PID:g405833" /translation="MTVLQEPVQAAIWQALNHYAYRDAVFLAERLYAEVHSEEALFLL ATCYYRSGKAYKAYRLLKGHSCTTPQCKYLLAKCCVDLSKLAEGEQILSGGVFNKQKS HDDIVTEFGDSACFTLSLLGHVYCKTDRLAKGSECYQKSLSLNPFLWSPFESLCEIGE KPDPDQTFKFTSLQNFSNCLPNSCTTQVPNHSLSHRQPETVLTETPQDTIELNRLNLE SSNSKYSLNTDSSVSYIDSAVISPDTVPLGTGTSILSKQVQNKPKTGRSLLGGPAALS PLTPSFGILPLETPSPGDGSYLQNYTNTPPVIDVPSTGAPSKKSVARIGQTGTKSVFS QSGNSREVTPILAQTQSSGPQTSTTPQVLSPTITSPPNALPRRSSRLFTSDSSTTKEN SKKLKMKFPPKIPNRKTKSKTNKGGITQPNINDSLEITKLDSSIISEGKISTITPQIQ AFNLQKAAAGLMSLLREMGKGYLALCSYNCKEAINILSHLPSHHYNTGWVLCQIGRAY FELSEYMQAERIFSEVRRIENYRVEGMEIYSTTLWHLQKDVALSVLSKDLTDMDKNSP EAWCAAGNCFSLQREHDIAIKFFQRAIQVDPNYAYAYTLLGHEFVLTEELDKALACFR NAIRVNPRHYNAWYGLGMIYYKQEKFSLAEMHFQKALDINPQSSVLLCHIGVVQHALK KSEKALDTLNKAIVIDPKNPLCKFHRASVLFRNEKYKSALQELEELKQIVPKESLVYF LIGKVYKKLGQTHLALMNFSWAMDLDPKGANNQIKEAIDKRYLPDDEEPITQEEQIMG TDESQESSMTDADDTQLHAAESDEF" BASE COUNT 855 a 547 c 500 g 690 t ORIGIN 1 gaattcccgc tacagggggg gcctgaggca ctgcagaaag tgggcctgag cctcgaggat 61 gacggtgctg caggaacccg tccaggctgc tatatggcaa gcactaaacc actatgctta 121 ccgagatgcg gttttcctcg cagaacgcct ttatgcagaa gtacactcag aagaagcctt 181 gtttttactg gcaacctgtt attaccgctc aggaaaggca tataaagcat atagactctt 241 gaaaggacac agttgtacta caccgcaatg caaatacctg cttgcaaaat gttgtgttga 301 tctcagcaag cttgcagaag gggaacaaat cttatctggt ggagtgttta ataagcagaa 361 aagccatgat gatattgtta ctgagtttgg tgattcagct tgctttactc tttcattgtt 421 gggacatgta tattgcaaga cagatcggct tgccaaagga tcagaatgtt accaaaagag 481 ccttagttta aatcctttcc tctggtctcc ctttgaatca ttatgtgaaa taggtgaaaa 541 gccagatcct gaccaaacat ttaaattcac atctttacag aactttagca actgtctgcc 601 caactcttgc acaacacaag tacctaatca tagtttatct cacagacagc ctgagacagt 661 tcttacggaa acaccccagg acacaattga attaaacaga ttgaatttag aatcttccaa 721 ttcaaagtac tccttgaata cagattcctc agtgtcttat attgattcag ctgtaatttc 781 acctgatact gtcccactgg gaacaggaac ttccatatta tctaaacagg ttcaaaataa 841 accaaaaact ggtcgaagtt tattaggagg accagcagct cttagtccat taaccccaag 901 ttttgggatt ttgccattag aaaccccaag tcctggagat ggatcctatt tacaaaacta 961 cactaataca cctcctgtaa ttgatgtgcc atccaccgga gccccttcaa aaaagtctgt 1021 tgccagaatc ggccaaactg gaacaaagtc tgtcttctca cagagtggaa atagccgaga 1081 ggtaactcca attcttgcac aaacacaaag ttctggtcca caaacaagta caacacctca 1141 ggtattgagc cccactatta catctccccc aaacgcacta cctcgaagaa gttcacgact 1201 ctttactagt gacagctcca caaccaagga gaatagcaaa aaattaaaaa tgaagtttcc 1261 acctaaaatc ccaaacagaa aaacaaaaag taaaactaat aaaggaggaa taactcaacc 1321 taacataaat gatagcctgg aaattacaaa attggactct tccatcattt cagaagggaa 1381 aatatccaca atcacacctc agattcaggc ctttaatcta caaaaagcag cagcaggttt 1441 gatgagcctt cttcgtgaaa tggggaaagg ttatttagct ttgtgttcat acaactgcaa 1501 agaagctata aatattttga gccatctacc ttctcaccac tacaatactg gttgggtact 1561 gtgccaaatt ggaagggcct attttgaact ttcagagtac atgcaagctg aaagaatatt 1621 ctcagaggtt agaaggattg agaattatag agttgaaggc atggagatct actctacaac 1681 actttggcat cttcaaaaag atgttgctct ttcagttctg tcaaaagact taacagacat 1741 ggataaaaat tcgccagagg cctggtgtgc tgcagggaac tgtttcagtc tgcaacggga 1801 acatgatatt gcaattaaat tcttccagag agctatccaa gttgatccaa attacgctta 1861 tgcctatact ctattagggc atgagtttgt cttaactgaa gaattggaca aagcattagc 1921 ttgttttcga aatgctatca gagtcaatcc tagacattat aatgcatggt atggtttagg 1981 aatgatttat tacaagcaag aaaaattcag ccttgcagaa atgcatttcc aaaaagcgct 2041 tgatatcaac cctcaaagtt cagttttact ttgccacatt ggagtagttc aacatgcact 2101 gaaaaaatca gagaaggctt tggataccct aaacaaagcc attgtcattg atcccaagaa 2161 ccctctatgc aaatttcaca gagcctcagt tttatttcga aatgaaaaat ataagtctgc 2221 tttacaagaa cttgaagaat tgaaacaaat tgttcccaaa gaatccctcg tttacttctt 2281 aataggaaag gtttacaaga agttaggtca aacgcacctc gccctgatga atttctcttg 2341 ggctatggat ttagatccta aaggagccaa taaccagatt aaagaggcaa ttgataagcg 2401 ttatcttcca gatgatgagg agccaataac ccaagaagaa cagatcatgg gaacagatga 2461 atcccaggag agcagcatga cagatgcgga tgacacacaa cttcatgcag ctgaaagtga 2521 tgaattttaa cttctggaaa tcagactttt acaactggat gtgtgactag tgctgacgtg 2581 tttctggaat tc // LOCUS HSCDEIBPA 3723 bp RNA PRI 14-MAR-1995 DEFINITION H.sapiens CDEI binding protein mRNA. ACCESSION Z22572 NID g394763 KEYWORDS amyloid precursor-like protein; CDEI binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3723) AUTHORS von der Kammer,H., Loffler,C., Hanes,J., Klaudiny,J., Scheit,K.H. and Hansmann,I. TITLE The gene for the amyloid precursor-like protein APLP2 is assigned to human chromosome 11q23-q25 JOURNAL Genomics 20 (2), 308-311 (1994) MEDLINE 94292219 REFERENCE 2 (bases 1 to 3723) AUTHORS von der Kammer,H. TITLE Direct Submission JOURNAL Submitted (20-APR-1993) Heinz von der Kammer, Abteilung fuer molekulare Biologie, Max-Planck-Institut fuer biophysikalische Chemie, Am Fassberg 11, Goettingen, D-3400, Germany REFERENCE 3 (bases 1 to 3723) AUTHORS von der Kammer,H., Hanes,J., Klaudiny,J. and Scheit,K.H. TITLE A human amyloid precursor-like protein is highly homologous to a mouse sequence-specific DNA-binding protein JOURNAL DNA Cell Biol. 13 (11), 1137-1143 (1994) MEDLINE 95217334 FEATURES Location/Qualifiers source 1..3723 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="ovary" /map="11q23-q25" CDS 68..2359 /note="putative" /codon_start=1 /product="CDEI binding protein" /db_xref="PID:g394764" /db_xref="SWISS-PROT:Q06481" /translation="MAATGTAAAAATGRLLLLLLVGLTAPALALAGYIEALAANAGTG FAVAEPQIAMFCGKLNMHVNIQTGKWEPDPTGTKSCFETKEEVLQYCQEMYPELQITN VMEANQRVSIDNWCRRDKKQCKSRFVTPFKCLVGEFVSDVLLVPEKCQFFHKERMEVC ENHQHWHTVVKEACLTQGMTLYSYGMLLPCGVDQFHGTEYVCCPQTKIIGSVSKEEEE EDEEEEEEEDEEEDYDVYKSEFPTEADLEDFTEAAVDEDDEDEEEGEEVVEDRDYYYD TFKGDDYNEENPTEPGSDGTMSDKEITHDVKAVCSQEAMTGPCRAVMPRWYFDLSKGK CVRFIYGGCGGNRNNFESEDYCMAVCKAMIPPTPLPTNDVDVYFETSADDNEHARFQK AKEQLEIRHRNRMDRVKKEWEEAELQAKNLPKAERQTLIQHFQAMVKALEKEAASEKQ QLVETHLARVEAMLNDRRRMALENYLAALQSDPPRPHRILQALRRYVRAENKDRLHTI RHYQHVLAVDPEKAAQMKSQVMTHLHVIEERRNQSLSLLYKVPYVAQEIQEEIDELLQ EQRADMDQFTASISETPVDVRVSSEESEEIPPFHPFHPFPALPENEDTQPELYHPMKK GSGVGEQDGGLIGAEEKVINSKNKVDENMVIDETLDVKEMIFNAERVGGLEEERESVG PLREDFSLSSSALIGLLVIAVAIATVIVISLVMLRKRQYGTISHGIVEVDPMLTPEER HLNKMQNHGYENPTYKYLEQMQI" polyA_signal 2685..2690 polyA_site 2721 polyA_signal 3696..3701 polyA_site 3715 BASE COUNT 992 a 843 c 975 g 913 t ORIGIN 1 ggtgtgctaa gcgaggagtc cgagtgtgtg agcttgagag ccgcgcgcta gagcgacccg 61 gcgagggatg gcggccaccg ggaccgcggc cgccgcagcc acgggcaggc tcctgcttct 121 gctgctggtg gggctcacgg cgcctgcctt ggcgctggcc ggctacatcg aggctcttgc 181 agccaatgcc ggaacaggat ttgctgttgc tgagcctcaa atcgcaatgt tttgtgggaa 241 gttaaatatg catgtgaaca ttcagactgg gaaatgggaa cctgatccaa caggcaccaa 301 gagctgcttt gaaacaaaag aagaagttct tcagtactgt caggagatgt atccagagct 361 acagatcaca aatgtgatgg aggcaaacca gcgggttagt attgacaact ggtgccggag 421 ggacaaaaag caatgcaaga gtcgctttgt tacacctttc aagtgtctcg tgggtgaatt 481 tgtaagtgat gtcctgctag ttccagaaaa gtgccagttt ttccacaaag agcggatgga 541 ggtgtgtgag aatcaccagc actggcacac ggtagtcaaa gaggcatgtc tgactcaggg 601 aatgacctta tatagctacg gcatgctgct cccatgtggg gtagaccagt tccatggcac 661 tgaatatgtg tgctgccctc agacaaagat tattggatct gtgtcaaaag aagaggaaga 721 ggaagatgaa gaggaagagg aagaggaaga tgaagaggaa gactatgatg tttataaaag 781 tgaatttcct actgaagcag atctggaaga cttcacagaa gcagctgtgg atgaggatga 841 tgaggatgag gaagaagggg aggaagtggt ggaggaccga gattactact atgacacctt 901 caaaggagat gactacaatg aggagaatcc tactgaaccc ggcagcgacg gcaccatgtc 961 agacaaggaa attactcatg atgtcaaagc tgtctgctcc caggaggcga tgacggggcc 1021 ctgccgggcc gtgatgcctc gttggtactt cgacctctcc aagggaaagt gcgtgcgctt 1081 tatatatggt ggctgcggcg gcaacaggaa caattttgag tctgaggatt attgtatggc 1141 tgtgtgtaaa gcgatgattc ctccaactcc tctgccaacc aatgatgttg atgtgtattt 1201 cgagacctct gcagatgata atgagcatgc tcgcttccag aaggctaagg agcagctgga 1261 gattcggcac cgcaaccgaa tggacagggt aaagaaggaa tgggaagagg cagagcttca 1321 agctaagaac ctccccaaag cagagaggca gactctgatt cagcacttcc aagccatggt 1381 taaagcttta gagaaggaag cagccagtga gaagcagcag ctggtggaga cccacctggc 1441 ccgagtggaa gctatgctga atgaccgccg tcggatggct ctggagaact acctggctgc 1501 cttgcagtct gacccgccac ggcctcatcg cattctccag gccttacggc gttatgtccg 1561 tgctgagaac aaagatcgct tacataccat ccgtcattac cagcatgtgt tggctgttga 1621 cccagaaaag gcggcccaga tgaaatccca ggtgatgaca catctccacg tgattgaaga 1681 aaggaggaac caaagcctct ctctgctcta caaagtacct tatgtagccc aagaaattca 1741 agaggaaatt gatgagctcc ttcaggagca gcgtgcagat atggaccagt tcactgcctc 1801 aatctcagag acccctgtgg acgtccgggt gagctctgag gagagtgagg agatcccacc 1861 gttccacccc ttccacccct tcccagccct acctgagaac gaagacactc agccggagtt 1921 gtaccaccca atgaaaaaag gatctggagt gggagagcag gatgggggac tgatcggtgc 1981 cgaagagaaa gtgattaaca gtaagaataa agtggatgaa aacatggtca ttgacgagac 2041 tctggatgtt aaggaaatga ttttcaatgc cgagagagtt ggaggcctcg aggaagagcg 2101 ggaatccgtg ggcccactgc gggaggactt cagtctgagt agcagtgctc tcattggcct 2161 gctggtcatc gcagtggcca ttgccacggt catcgtcatc agcctggtga tgctgaggaa 2221 gaggcagtat ggcaccatca gccacgggat cgtggaggtt gatccaatgc tcaccccaga 2281 agagcgtcac ctgaacaaga tgcagaacca tggctatgag aaccccacct acaaatacct 2341 ggagcagatg cagatttagg tggcagggag cgcggcagcc ctggcggagg gatgcaggtg 2401 ggccggaaga tcccacgatt ccgatcgact gccaagcagc agccgctgcc aggggctgcg 2461 tctgacatcc tgacctcctg gactgtagga ctatataaag tactactgta gaactgcaat 2521 ttccattctt ttaaatgggt gaaaaatggt aatataacaa tatatgatat ataaacctta 2581 aatgaaaaaa atgatctatt gcagatattt gatgtagttt tcttttttaa attaatcaga 2641 aaccccactt ccattgtatt gtctgacaca tgctctcaat atataataaa tgggaaatgt 2701 cgattttcaa taatagactt atatgcaggc tgtcgttccg gttatgttgt gtaagtcaac 2761 tcttcagcct cattcactgt cctggctttt atttaaagaa aaaaaaggca gtattccctt 2821 tttaaatgag ctttcaggaa gttgctgaga aatggggtgg aatagggaac tgtaatggcc 2881 actgaagcac gtgagagacc ctcgcaaaat gatgtgaaag gaccagtttc ttgaagtcca 2941 gtgtttccac ggctggatac ctgtgtgtct ccataaaagt cctgtcacca aggacgttaa 3001 aggcatttta ttccagcgtc ttctagagag cttagtgtat acagatgagg gtgtctgctg 3061 ctgctttcct tcggaatcca gtgcttccac agagattagc ctgtagctta tatttgacat 3121 tcttcactgt ctgttgttta cctaccgtag ctttttaccg ttcacttccc cttccaacta 3181 tgtccagatg tgcaggctcc tcctctctgg actttctcca aaggcactga ccctcggcct 3241 ctactttgtc ccctcacctc caccccctcc tgtcaccggc cttgtgacat tcactcagag 3301 aagaccacac caaggaggcg gccgctggcc caggagagaa cacggggagg tttgtttgtg 3361 tgaaaggaaa gtagtccagg ctgtccctga aactgagtct gtggacactg tggaaagctt 3421 tgaacaattg tgttttcgtc acaggagtct ttgtaatgct tgtacagttg atgtcgatgc 3481 tcactgcttc tgctttttct ttcttttatt ttaaatctga aggttctggt aacctgtggt 3541 gtatttttat tttcctgtga ctgtttttgt tttgtttttt tcctttttcc tcccctttga 3601 ccctattcat gtctctaccc actatgcaca gattaaactt cacctacaaa ctccttaata 3661 tgatctgtgg agaatgtaca cagtttaaac acatcaataa atactttaac ttccaaaaaa 3721 aaa // LOCUS HSCDK2MR 1476 bp RNA PRI 15-JAN-1992 DEFINITION H.sapiens CDK2 mRNA. ACCESSION X61622 NID g29848 KEYWORDS CDK2 gene; cell cycle regulation protein; cyclin A binding; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1476) AUTHORS Elledge,S.J. and Spottswood,M.R. TITLE A new human p34 protein kinase, CDK2, identified by complementation of a cdc28 mutation in Saccharomyces cerevisiae, is a homolog of Xenopus Eg1 JOURNAL EMBO J. 10 (9), 2653-2659 (1991) MEDLINE 91330891 REFERENCE 2 (bases 1 to 1476) AUTHORS Elledge,S.J. TITLE Direct Submission JOURNAL Submitted (28-NOV-1991) S.J. Elledge, Dept. of Biochemistry, Baylor College of Medicine, 1 Baylor Place, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1476 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="EBV transformed Human peripheral lymphocyte (B-cell)" /clone_lib="lambda YES-R cDNA library" /clone="pSE1000" gene 1..897 /gene="CDK2" CDS 1..897 /gene="CDK2" /function="protein kinase" /note="cell division kinase. CDC2 homolog" /codon_start=1 /db_xref="PID:g29849" /db_xref="SWISS-PROT:P24941" /translation="MENFQKVEKIGEGTYGVVYKARNKLTGEVVALKKIRLDTETEGV PSTAIREISLLKELNHPNIVKLLDVIHTENKLYLVFEFLHQDLKKFMDASALTGIPLP LIKSYLFQLLQGLAFCHSHRVLHRDLKPQNLLINTEGAIKLADFGLARAFGVPVRTYT HEVVTLWYRAPEILLGSKYYSTAVDIWSLGCIFAEMVTRRALFPGDSEIDQLFRIFRT LGTPDEVVWPGVTSMPDYKPSFPKWARQDFSKVVPPLDEDGRSLLSQMLHYDPNKRIS AKAALAHPFFQDVTKPVPHLRL" BASE COUNT 368 a 372 c 351 g 385 t ORIGIN 1 atggagaact tccaaaaggt ggaaaagatc ggagagggca cgtacggagt tgtgtacaaa 61 gccagaaaca agttgacggg agaggtggtg gcgcttaaga aaatccgcct ggacactgag 121 actgagggtg tgcccagtac tgccatccga gagatctctc tgcttaagga gcttaaccat 181 cctaatattg tcaagctgct ggatgtcatt cacacagaaa ataaactcta cctggttttt 241 gaatttctgc accaagatct caagaaattc atggatgcct ctgctctcac tggcattcct 301 cttcccctca tcaagagcta tctgttccag ctgctccagg gcctagcttt ctgccattct 361 catcgggtcc tccaccgaga ccttaaacct cagaatctgc ttattaacac agagggggcc 421 atcaagctag cagactttgg actagccaga gcttttggag tccctgttcg tacttacacc 481 catgaggtgg tgaccctgtg gtaccgagct cctgaaatcc tcctgggctc gaaatattat 541 tccacagctg tggacatctg gagcctgggc tgcatctttg ctgagatggt gactcgccgg 601 gccctgttcc ctggagattc tgagattgac cagctcttcc ggatctttcg gactctgggg 661 accccagatg aggtggtgtg gccaggagtt acttctatgc ctgattacaa gccaagtttc 721 cccaagtggg cccggcaaga ttttagtaaa gttgtacctc ccctggatga agatggacgg 781 agcttgttat cgcaaatgct gcactacgac cctaacaagc ggatttcggc caaggcagcc 841 ctggctcacc ctttcttcca ggatgtgacc aagccagtac cccatcttcg actctgatag 901 ccttcttgaa gcccccgacc ctaatcggct caccctctcc tccagtgtgg gcttgaccag 961 cttggccttg ggctatttgg actcaggtgg gccctctgaa cttgccttaa acactcacct 1021 tctagtctta accagccaac tctgggaata caggggtgaa aggggggaac cagtgaaaat 1081 gaaaggaagt ttcagtatta gatgcactta agttagcctc caccaccctt tcccccttct 1141 cttagttatt gctgaagagg gttggtataa aaataatttt aaaaaagcct tcctacacgt 1201 tagatttgcc gtaccaatct ctgaatgccc cataattatt atttccagtg tttgggatga 1261 ccaggatccc aagcctcctg ctgccacaat gtttataaag gccaaatgat agcgggggct 1321 aagttggtgc ttttgagaat taagtaaaac aaaaccactg ggaggagtct attttaaaga 1381 attcggttaa aaaatagatc caatcagttt ataccctagt tagtgttttc ctcacctaat 1441 aggctgggag actgaagact cagcccgggt gggggt // LOCUS HSCDKAK 1373 bp RNA PRI 16-FEB-1995 DEFINITION H.sapiens CDK activating kinase mRNA. ACCESSION X77743 NID g468788 KEYWORDS Cdk-activating kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1373) AUTHORS Darbon,J.M., Devault,A., Taviaux,S., Fesquet,D., Martinez,A.M., Galas,S., Cavadore,J.C., Doree,M. and Blanchard,J.M. TITLE Cloning, expression and subcellular localization of the human homolog of p40MO15 catalytic subunit of cdk-activating kinase JOURNAL Oncogene 9 (11), 3127-3138 (1994) MEDLINE 95022621 REFERENCE 2 (bases 1 to 1373) AUTHORS Darbon,J. TITLE Direct Submission JOURNAL Submitted (17-FEB-1994) J. Darbon, CRBM CNRS et INSERM, 1919 route de Mende, BP 5051, 34033 Montpellier Cedex, FRANCE FEATURES Location/Qualifiers source 1..1373 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /chromosome="5" gene 35..1075 /gene="HUM015" CDS 35..1075 /gene="HUM015" /codon_start=1 /product="CDK activating kinase" /db_xref="PID:g468789" /translation="MALDVKSRAKRYEKLDFLGEGQFATVYKARDKNTNQIVAIKKIK LGHRSEAKDGINRTALREIKLLQELSHPNIIGLLDAFGHKSNISLVFDFMETDLEVII KDNSLVLTPSHIKAYMLMTLQGLEYLHQHWILHRDLKPNNLLLDENGVLKLADFGLAK SFGSPNRAYTHQVVTRWYRAPELLFGARMYGVGVDMWAVGCILAELLLRVPFLPGDSD LDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDDLLDLIQGLF LFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAIKRKRTEA LEQGGLPKKLIF" BASE COUNT 464 a 235 c 294 g 380 t ORIGIN 1 gcttttcggc tggagtcggg ctttacggcg ccggatggct ctggacgtga agtctcgggc 61 aaagcgttat gagaagctgg acttccttgg ggagggacag tttgccaccg tttacaaggc 121 cagagataag aataccaacc aaattgtcgc cattaagaaa atcaaacttg gacatagatc 181 agaagctaaa gatggtataa atagaaccgc cttaagagag ataaaattat tacaggagct 241 aagtcatcca aatataattg gtctccttga tgcttttgga cataaatcta atattagcct 301 tgtctttgat tttatggaaa ctgatctaga ggttataata aaggataata gtcttgtgct 361 gacaccatca cacatcaaag cctacatgtt gatgactctt caaggattag aatatttaca 421 tcaacattgg atcctacata gggatctgaa accaaacaac ttgttgctag atgaaaatgg 481 agttctaaaa ctggcagatt ttggcctggc caaatctttt gggagcccca atagagctta 541 tacacatcag gttgtaacca ggtggtatcg ggcccccgag ttactatttg gagctaggat 601 gtatggtgta ggtgtggaca tgtgggctgt tggctgtata ttagcagagt tacttctaag 661 ggttcctttt ttgccaggag attcagacct tgatcagcta acaagaatat ttgaaacttt 721 gggcacacca actgaggaac agtggccgga catgtgtagt cttccagatt atgtgacatt 781 taagagtttc cctggaatac ctttgcatca catcttcagt gcagcaggag acgacttact 841 agatctcata caaggcttat tcttatttaa tccatgtgct cgaattacgg ccacacaggc 901 actgaaaatg aagtatttca gtaatcggcc agggccaaca cctggatgtc agctgccaag 961 accaaactgt ccagtggaaa ccttaaagga gcaatcaaat ccagctttgg caataaaaag 1021 gaaaagaaca gaggccttag aacaaggagg attgcccaag aaactaattt tttaaagaga 1081 acactggaca acattttact actgagggaa atagccaaaa aggcaaataa tggaaaaata 1141 gtaaacatta agtaaatgct gtagaagtga gtttgtaaat attctacaca tgtaaaatat 1201 gtaaaactat gggttatttt tattaaatgt attttaaaat aaaaatttaa ttctggtttt 1261 tctgattaga gtcccaaagt gagaaaagtt caatactctt gaaatgtaga attgaaaatg 1321 cattagggaa aacttaataa aaattattac cagttatttg gaaaaaaaaa aaa // LOCUS HSCDMRN 1314 bp RNA PRI 28-JUN-1995 DEFINITION H.sapiens CDM mRNA. ACCESSION Z31696 NID g479156 KEYWORDS CDM gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1314) AUTHORS Mosser,J., Sarde,C.O., Vicaire,S., Yates,J.R. and Mandel,J.L. TITLE A new human gene (DXS1357E) with ubiquitous expression, located in Xq28 adjacent to the adrenoleukodystrophy gene JOURNAL Genomics 22 (2), 469-471 (1994) MEDLINE 95104864 REFERENCE 2 (bases 1 to 1314) AUTHORS Sarde,C.O. TITLE Direct Submission JOURNAL Submitted (30-MAR-1994) Claude O Sarde, Human Genetics, U184/INSERM-LGME/CNRS, Institut de Chimie Biologique, Faculte de Medecine, 11, Rue Humann, Strasbourg, 67085 Cedex, France FEATURES Location/Qualifiers source 1..1314 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CDM" /cell_type="Fibroblast" /clone_lib="Human fetal fibroblast cDNA library" /chromosome="X" CDS 137..877 /codon_start=1 /product="CDM" /db_xref="PID:g479157" /translation="MSLQWTAVATFLYAEVFVVLLLCIPFISPKRWQKIFKSRLVELL VSYGNTFFVVLIVILVLLVIDAVREIRKYDDVTEKVNLQNNPGAMEHFHMKLFRAQRN LYIAGFSLLLSFLLRRLVTLISQQATLLASNEAFKKQAESASEAAKKYMEENDQLKKG AAVDGGKLDVGNAEVKLEEENRSLKADLQKLKDELASTKQKLEKAENQVLAMRKQSEG LTKEYDRLLEEHAKLQAAVDGPMDKKEE" polyA_signal 1295..1300 BASE COUNT 262 a 336 c 388 g 328 t ORIGIN 1 tttccggccg cggtatgagg ggcggggccg gggctgctgt gggagagttc ggttgctgcg 61 gcggggcctg cacgttgact gtgggaaact cggaaacaag ctcacatctt cctgtgggaa 121 accttctagc aacaggatga gtctgcagtg gactgcagtt gccaccttcc tctatgcgga 181 ggtctttgtt gtgttgcttc tctgcattcc cttcatttct cctaaaagat ggcagaagat 241 tttcaagtcc cggctggtgg agttgttagt gtcctatggc aacaccttct ttgtggttct 301 cattgtcatc cttgtgctgt tggtcatcga tgccgtgcgc gaaattcgga agtatgatga 361 tgtgacggaa aaggtgaacc tccagaacaa tcccggggcc atggagcact tccacatgaa 421 gcttttccgt gcccagagga atctctacat tgctggcttt tccttgctgc tgtccttcct 481 gcttagacgc ctggtgactc tcatttcgca gcaggccacg ctgctggcct ccaatgaagc 541 ctttaaaaag caggcggaga gtgctagtga ggcggccaag aagtacatgg aggagaatga 601 ccagctcaag aagggagctg ctgttgacgg aggcaagttg gatgtcggga atgctgaggt 661 gaagttggag gaagagaaca ggagcctgaa ggctgacctg cagaagctaa aggacgagct 721 ggccagcact aagcaaaaac tagagaaagc tgaaaaccag gttctggcca tgcggaagca 781 gtctgagggc ctcaccaagg agtacgaccg cttgctggag gagcacgcaa agctgcaggc 841 tgcagtagat ggtcccatgg acaagaagga agagtaaggg cctccttcct cccctgcctg 901 cagctggctt ccacctggca cgtgcctgct gcttcctgag agcccggcct ctccctccag 961 tacttctgtt tgtgcccttc tgcttccccc attcccttcc acagctcata gctcgtcatc 1021 tcggcccttg tccacactct ccaagcacat tacaggggac ctgattgcta cacgttcaga 1081 atgcgtttgc tgtcatcctg cttggcctgg ccaggcctgg cacagccttg gcttccacgc 1141 ctgagcgtgg agagcacgag ttagttgtag tccggcttgc ggtggggctg acttcctgtt 1201 ggtttgagcc cctttttgtt ttgccctctg ggtgttttct ttggtcccgc aggagggtgg 1261 gtggaacagg tggactggag tttctcttga gggcaataaa agttttcatg gtgg // LOCUS HSCDW40 1004 bp RNA PRI 14-NOV-1997 DEFINITION Human CDw40 mRNA for nerve growth factor receptor-related B-lymphocyte activation molecule. ACCESSION X60592 NID g29850 KEYWORDS B-lymphocyte activation molecule; CD40 gene; CDw40 gene; nerve growth factor receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1004) AUTHORS Stamenkovic,I., Clark,E.A. and Seed,B. TITLE A B-lymphocyte activation molecule related to the nerve growth factor receptor and induced by cytokines in carcinomas JOURNAL EMBO J. 8 (5), 1403-1410 (1989) MEDLINE 89356608 FEATURES Location/Qualifiers source 1..1004 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" mRNA 1..1004 /evidence=experimental CDS 48..881 /codon_start=1 /product="CDw40" /db_xref="PID:e1175755" /db_xref="PID:g29851" /translation="MVRLPLQCVLWGCLLTAVHPEPPTACREKQYLINSQCCSLCQPG QKLVSDCTEFTETECLPCGESEFLDTWNRETHCHQHKYCDPNLGLRVQQKGTSETDTI CTCEEGWHCTSEACESCVLHRSCSPGFGVKQIATGVSDTICEPCPVGFFSNVSSAFEK CHPWTSCETKDLVVQQAGTNKTDVVCGPQDRLRALVVIPIIFGILFAILLVLVFIKKV AKKPTNKAPHPKQEPQEINFPDDLPGSNTAAPVQETLHGCQPVTQEDGKESRISVQER Q" BASE COUNT 230 a 297 c 276 g 201 t ORIGIN 1 gcctcgctcg ggcgcccagt ggtcctgccg cctggtctca cctcgccatg gttcgtctgc 61 ctctgcagtg cgtcctctgg ggctgcttgc tgaccgctgt ccatccagaa ccacccactg 121 catgcagaga aaaacagtac ctaataaaca gtcagtgctg ttctttgtgc cagccaggac 181 agaaactggt gagtgactgc acagagttca ctgaaacgga atgccttcct tgcggtgaaa 241 gcgaattcct agacacctgg aacagagaga cacactgcca ccagcacaaa tactgcgacc 301 ccaacctagg gcttcgggtc cagcagaagg gcacctcaga aacagacacc atctgcacct 361 gtgaagaagg ctggcactgt acgagtgagg cctgtgagag ctgtgtcctg caccgctcat 421 gctcgcccgg ctttggggtc aagcagattg ctacaggggt ttctgatacc atctgcgagc 481 cctgcccagt cggcttcttc tccaatgtgt catctgcttt cgaaaaatgt cacccttgga 541 caagctgtga gaccaaagac ctggttgtgc aacaggcagg cacaaacaag actgatgttg 601 tctgtggtcc ccaggatcgg ctgagagccc tggtggtgat ccccatcatc ttcgggatcc 661 tgtttgccat cctcttggtg ctggtcttta tcaaaaaggt ggccaagaag ccaaccaata 721 aggcccccca ccccaagcag gaaccccagg agatcaattt tcccgacgat cttcctggct 781 ccaacactgc tgctccagtg caggagactt tacatggatg ccaaccggtc acccaggagg 841 atggcaaaga gagtcgcatc tcagtgcagg agagacagtg aggctgcacc cacccaggag 901 tgtggccacg tgggcaaaca ggcagttggc cagagagcct ggtgctgctg ctgcaggggt 961 gcaggcagaa gcggggagct atgcccagtc agtgccagcc cctc // LOCUS HSCEBP1 1360 bp RNA PRI 05-MAY-1995 DEFINITION H.sapiens BAK mRNA for BCl-2 homologue. ACCESSION X84213 NID g804984 KEYWORDS Bcl-2 protein; CEBP-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1360) AUTHORS Farrow,S.N., White,J.H., Martinou,I., Raven,T., Pun,K.T., Grinham,C.J., Martinou,J.C. and Brown,R. TITLE Cloning of a bcl-2 homologue by interaction with adenovirus E1B 19K JOURNAL Nature 374 (6524), 731-733 (1995) MEDLINE 95231652 REMARK Erratum:[Nature 1995 Jun 1;375(6530):431]] REFERENCE 2 (bases 1 to 1360) AUTHORS Brown,R. TITLE Direct Submission JOURNAL Submitted (25-JAN-1995) R. Brown, Glaxo Research & Development, Greenford Road, Greenford, Middlesex UB6 0HE, UK FEATURES Location/Qualifiers source 1..1360 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="B-cell" /clone_lib="EBV-transformed B-cell" gene 193..828 /gene="BAK" CDS 193..828 /gene="BAK" /codon_start=1 /product="bcl-2 homologue" /db_xref="PID:g804985" /translation="MASGQGPGPPRQECGEPALPSASEEQVAQDTEEVFRSYVFYRHQ QEQEAEGVAAPADPEMVTLPLQPSSTMGQVGRQLAIIGDDINRRYDSEFQTMLQHLQP TAENAYEYFTKIATSLFESGINWGRVVALLGFGYRLALHVYQHGLTGFLGQVTRFVVD FMLHHCIARWIAQRGGWVAALNLGNGPILNVLVVLGVVLLGQFVVRRFFKS" BASE COUNT 257 a 405 c 400 g 298 t ORIGIN 1 acagggacaa gtaaaggcta catccagatg ccgggaatgc actgacgccc attcctggaa 61 actgggctcc cactcagccc ctgggagcag cagccgccag cccctcggga cctccatctc 121 caccctgctg agccacccgg gttgggccag gatcccggca ggctgatccc gtcctccact 181 gagacctgaa aaatggcttc ggggcaaggc ccaggtcctc ccaggcagga gtgcggagag 241 cctgccctgc cctctgcttc tgaggagcag gtagcccagg acacagagga ggttttccgc 301 agctacgttt tttaccgcca tcagcaggaa caggaggctg aaggggtggc tgcccctgcc 361 gacccagaga tggtcacctt acctctgcaa cctagcagca ccatggggca ggtgggacgg 421 cagctcgcca tcatcgggga cgacatcaac cgacgctatg actcagagtt ccagaccatg 481 ttgcagcacc tgcagcccac ggcagagaat gcctatgagt acttcaccaa gattgccacc 541 agcctgtttg agagtggcat caattggggc cgtgtggtgg ctcttctggg cttcggctac 601 cgtctggccc tacacgtcta ccagcatggc ctgactggct tcctaggcca ggtgacccgc 661 ttcgtggtcg acttcatgct gcatcactgc attgcccggt ggattgcaca gaggggtggc 721 tgggtggcag ccctgaactt gggcaatggt cccatcctga acgtgctggt ggttctgggt 781 gtggttctgt tgggccagtt tgtggtacga agattcttca aatcatgact cccaagggtg 841 ccctttgggt cccggttcag acccctgcct ggacttaagc gaagtctttg ccttctctgt 901 tcccttgcag gggtcccccc tcaagagtac agaagcttta gcaagtgtgc actccagctt 961 cggagggccc ctgcgtgggg gcagtcaggc tgcagaggca cctcaacatt gcatggtgct 1021 agtgggccct ctctctgggc caggggtgtg ccgtctcctc cctcagctct ctgggacctc 1081 cttaaaccct gtcctgctag gcgctgggga gactgataac ttggggagac aagagactgg 1141 gacccacttc tccccaggaa gtgtttaacg gttttagctt tttataatac ccttgtgaga 1201 gcccattccc accattctac ctgaggccag gacgtctggg gtgtggggat tggtgggtct 1261 atgttcccca ggattcagct attctggaag atcagcaccc taagagatgg gactaggacc 1321 tgatcctggt cctggccgtc cctaagcatg tgtcccaggg // LOCUS HSCENPE 8257 bp RNA PRI 10-JAN-1993 DEFINITION H.sapiens CENP-E mRNA. ACCESSION Z15005 NID g29864 KEYWORDS kinetochore motor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8257) AUTHORS Yen,T.J., Li,G., Schaar,B.T., Szilak,I. and Cleveland,D.W. TITLE CENP-E is a putative kinetochore motor that accumulates just before mitosis JOURNAL Nature 359 (6395), 536-539 (1992) MEDLINE 93024922 REFERENCE 2 (bases 1 to 8257) AUTHORS Cleveland,D.W. TITLE Direct Submission JOURNAL Submitted (27-AUG-1992) Don W. Cleveland, Biological Chemistry, Johns Hopkins University, School of Medicine, 725 N. Wolfe St., Baltimore, Maryland, 21205, USA FEATURES Location/Qualifiers source 1..8257 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CENPE" CDS 91..8082 /codon_start=1 /product="CENP-E" /db_xref="PID:g29865" /db_xref="SWISS-PROT:Q02224" /translation="MAEEGAVAVCVRVRPLNSREESLGETAQVYWKTDNNVIYQVDGS KSFNFDRVFHGNETTKNVYEEIAAPIIDSAIQGYNGTIFAYGQTASGKTYTMMGSEDH LGVIPRAIHDIFQKIKKFPDREFLLRVSYMEIYNETITDLLCGTQKMKPLIIREDVNR NVYVADLTEEVVYTSEMALKWITKGEKSRHYGETKMNQRSSRSHTIFRMILESREKGE PSNCEGSVKVSHLNLVDLAGSERAAQTGAAGVRLKEGCNINRSLFILGQVIKKLSDGQ VGGFINYRDSKLTRILQNSLGGNPKTRIICTITPVSFDETLTALQFASTAKYMKNTPY VNEVSTDEALLKRYRKEIMDLKKQLEEVSLETRAQAMEKDQLAQLLEEKDLLQKVQNE KIENLTRMLVTSSSLTLQQELKAKRKRRVTWCLGKINKMKNSNYADQFNIPTNITTKT HKLSINLLREIDESVCSESDVFSNTLDTLSEIEWNPATKLLNQENIESELNSLRADYD NLVLDYEQLRTEKEEMELKLKEKNDLDEFEALERKTKKDQEMQLIHEISNLKNLVKHR EVYNQDLENELSSKVELLREKEDQIKKLQEYIDSQKLENIKMDLSYSLESIEDPKQMK QTLFDAETVALDAKRESAFLRSENLELKEKMKELATTYKQMENDIQLYQSQLEAKKKM QVDLEKELQSAFNEITKLTSLIDGKVPKDLLCNLELEGKITDLQKELNKEVEENEALR EEVILLSELKSLPSEVERLRKEIQDKSEELHIITSEKDKLFSEVVHKESRVQGLLEEI GKTKDDLATTQSNYKSTDQEFQNFKTLHMDFEQKYKMVLEENERMNQEIVNLSKEAQK FDSSLGALKTELSYKTQELQEKTREVQERLNEMEQLKEQLENRDSPLQTVEREKTLIT EKLQQTLEEVKTLTQEKDDLKQLQESLQIERDQLKSDIHDTVNMNIDTQEQLRNALES LKQHQETINTLKSKISEEVSRNLHMEENTGETKDEFQQKMVGIDKKQDLEAKNTQTLT ADVKDNEIIEQQRKIFSLIQEKNELQQMLESVIAEKEQLKTDLKENIEMTIENQEELR LLGDELKKQQEIVAQEKNHAIKKEGELSRTCDRLAEVEEKLKEKSQQLQEKQQQLLNV QEEMSEMQKKINEIENLKNELKNKELTLEHMETERLELAQKLNENYEEVKSITKERKV LKELQKSFETERDHLRGYIREIEATGLQTKEELKIAHIHLKEHQETIDELRRSVSEKT AQIINTQDLEKSHTKLQEEIPVLHEEQELLPNVKKVSETQETMNELELLTEQSTTKDS TTLARIEMERLRLNEKFQESQEEIKSLTKERDNLKTIKEALEVKHDQLKEHIRETLAK IQESQSKQEQSLNMKEKDNETTKIVSEMEQFKPKDSALLRIEIEMLGLSKRLQESHDE MKSVAKEKDDLQRLQEVLQSESDQLKENIKEIVAKHLETEEELKVAHCCLKEQEETIN ELRVNLSEKETEISTIQKQLEAINDKLQNKIQEIYEKEEQLNIKQISEVQENVNELKQ FKEHRKAKDSALQSIESKMLELTNRLQESQEEIQIMIKEKEEMKRVQEALQIERDQLK ENTKEIVAKMKESQEKEYQFLKMTAVNETQEKMCEIEHLKEQFETQKLNLENIETENI RLTQILHENLEEMRSVTKERDDLRSVEETLKVERDQLKENLRETITRDLEKQEELKIV HMHLKEHQETIDKLRGIVSEKTNEISNMQKDLEHSNDALKAQDLKIQEELRIAHMHLK EQQETIDKLRGIVSEKTDKLSNMQKDLENSNAKLQEKIQELKANEHQLITLKKDVNET QKKVSEMEQLKKQIKDQSLTLSKLEIENLNLAQELHENLEEMKSVMKERDNLRRVEET LKLERDQLKESLQETKARDLEIQQELKTARMLSKEHKETVDKLREKISEKTIQISDIQ KDLDKSKDELQKKIQELQKKELQLLRVKEDVNMSHKKINEMEQLKKQFEPNYLCKCEM DNFQLTKKLHESLEEIRIVAKERDELRRIKESLKMERDQFIATLREMIARDRQNHQVK PEKRLLSDGQQHLMESLREKCSRIKELLKRYSEMDDHYECLNRLSLDLEKEIEFHRIM KKLKYVLSYVTKIKEEQHECINKFEMDFIDEVEKQKELLIKIQHLQQDCDVPSRELRD LKLNQNMDLHIEEILKDFSESEFPSIKTEFQQVLSNRKEMTQFLEEWLNTRFDIEKLK NGIQKENDRICQVNNFFNNRIIAIMNESTEFEERSATISKEWEQDLKSLKEKNEKLFK NYQTLKTSLASGAQVNPTTQDNKNPHVTSRATQLTTEKIRELENSLHEAKESAMHKES KIIKMQKELEVTNDIIAKLQAKVHESNKCLEKTKETIQVLQDKVALGAKPYKEEIEDL KMKLVKIDLEKMKNAKEFEKEISATKATVEYQKEVIRLLRENLRRSQQAQDTSVISEH TDPQPSNKPLTCGGGSGIVQNTKALILKSEHIRLEKEISKLKQQNEQLIKQKNELLSN NQHLSNEVKTWKERTLKREAHKQVTCENSPKSPKVTGTASKKKQITPSQCKERNLQDP VPKESPKSCFFDSRSKSLPSPHPVRYFDNSSLGLCPEVQNAGAESVDSQPGPWHASSG KDVPECKTQ" BASE COUNT 3429 a 1263 c 1677 g 1888 t ORIGIN 1 taaatttaaa ggcggggcgg cctgtgagcc ctgaagtgcc ggccgcggag ggtcctggcc 61 attttggtgg gaccagttca gcctgatagg atggcggagg aaggagccgt ggccgtctgc 121 gtgcgagtgc ggccgctgaa cagcagagaa gaatcacttg gagaaactgc ccaagtttac 181 tggaaaactg acaataatgt catttatcaa gttgatggaa gtaaatcctt caattttgat 241 cgtgtctttc atggtaatga aactaccaaa aatgtgtatg aagaaatagc agcaccaatc 301 atcgattctg ccatacaagg ctacaatggt actatatttg cctatggaca gactgcttca 361 ggaaaaacat ataccatgat gggttcagaa gatcatttgg gagttatacc cagggcaatt 421 catgacattt tccaaaaaat taagaagttt cctgataggg aatttctctt acgtgtatct 481 tacatggaaa tatacaatga aaccattaca gatttactct gtggcactca aaaaatgaaa 541 cctttaatta ttcgagaaga tgtcaatagg aatgtgtatg ttgctgatct cacagaagaa 601 gttgtatata catcagaaat ggctttgaaa tggattacaa agggagaaaa gagcaggcat 661 tatggagaaa caaaaatgaa tcaaagaagc agtcgttctc ataccatctt taggatgatt 721 ttggaaagca gagagaaggg tgaaccttct aattgtgaag gatctgttaa ggtatcccat 781 ttgaatttgg ttgatcttgc aggcagtgaa agagctgctc aaacaggcgc tgcaggtgtg 841 cggctcaagg aaggctgtaa tataaatcga agcttattta ttttgggaca agtgatcaag 901 aaacttagtg atggacaagt tggtggtttc ataaattatc gagatagcaa gttaacacga 961 attcttcaga attccttggg aggaaatcca aagacacgta ttatctgcac aattactcca 1021 gtatcttttg atgaaactct tactgctctc cagtttgcca gtactgctaa atatatgaag 1081 aatactcctt atgttaatga ggtatcaact gatgaagctc tcctgaaaag gtatagaaaa 1141 gaaataatgg atcttaaaaa acaattagag gaggtttctt tagagacgcg ggctcaggca 1201 atggaaaaag accaattggc ccaacttttg gaagaaaaag atttgcttca gaaagtacag 1261 aatgagaaaa ttgaaaactt aacacggatg ctggtgacct cttcttccct cacgttgcaa 1321 caggaattaa aggctaaaag aaaacgaaga gttacttggt gccttggcaa aattaacaaa 1381 atgaagaact caaactatgc agatcaattt aatataccaa caaatataac aacaaaaaca 1441 cataagcttt ctataaattt attacgagaa attgatgaat ctgtctgttc agagtctgat 1501 gttttcagta acactcttga tacattaagt gagatagaat ggaatccagc aacaaagcta 1561 ctaaatcagg agaatataga aagtgagttg aactcacttc gtgctgacta tgataatctg 1621 gtattagact atgaacaact acgaacagaa aaagaagaaa tggaattgaa attaaaagaa 1681 aagaatgatt tggatgaatt tgaggctcta gaaagaaaaa ctaaaaaaga tcaagagatg 1741 caactaattc atgaaatttc gaacttaaag aatttagtta agcatcgaga agtatataat 1801 caagatcttg agaatgaact cagttcaaaa gtagagctgc ttagagaaaa ggaagaccag 1861 attaagaagc tacaggaata catagactct caaaagctag aaaatataaa aatggacttg 1921 tcatactcat tggaaagcat tgaagaccca aaacaaatga agcagactct gtttgatgct 1981 gaaactgtag cccttgatgc caagagagaa tcagcctttc ttagaagtga aaatctggag 2041 ttgaaggaga aaatgaaaga acttgcaact acatacaagc aaatggaaaa tgatattcag 2101 ttatatcaaa gccaattgga ggcaaaaaag aaaatgcaag ttgatctgga gaaagaatta 2161 caatctgctt ttaatgagat aacaaaactc acctccctta tagatggcaa agttccaaaa 2221 gatttgctct gtaatttgga attggaagga aagattactg atcttcagaa agaactaaat 2281 aaagaagttg aagaaaatga agctttgcgg gaagaagtca ttttgctttc agaattgaaa 2341 tctttacctt ctgaagtaga aaggctgagg aaagagatac aagacaaatc tgaagagctc 2401 catataataa catcagaaaa agataaattg ttttctgaag tagttcataa ggagagtaga 2461 gttcaaggtt tacttgaaga aattgggaaa acaaaagatg acctagcaac tacacagtcg 2521 aattataaaa gcactgatca agaattccaa aatttcaaaa cccttcatat ggactttgag 2581 caaaagtata agatggtcct tgaggagaat gagagaatga atcaggaaat agttaatctc 2641 tctaaagaag cccaaaaatt tgattcgagt ttgggtgctt tgaagaccga gctttcttac 2701 aagacccaag aacttcagga gaaaacacgt gaggttcaag aaagactaaa tgagatggaa 2761 cagctgaagg aacaattaga aaatagagat tctccgctgc aaactgtaga aagggagaaa 2821 acactgatta ctgagaaact gcagcaaact ttagaagaag taaaaacttt aactcaagaa 2881 aaagatgatc taaaacaact ccaagaaagc ttgcaaattg agagggacca actcaaaagt 2941 gatattcacg atactgttaa catgaatata gatactcaag aacaattacg aaatgctctt 3001 gagtctctga aacaacatca agaaacaatt aatacactaa aatcgaaaat ttctgaggaa 3061 gtttccagga atttgcatat ggaggaaaat acaggagaaa ctaaagatga atttcagcaa 3121 aagatggttg gcatagataa aaaacaggat ttggaagcta aaaataccca aacactaact 3181 gcagatgtta aggataatga gataattgag caacaaagga agatattttc tttaatacag 3241 gagaaaaatg aactccaaca aatgttagag agtgttatag cagaaaagga acaattgaag 3301 actgacctaa aggaaaatat tgaaatgacc attgaaaacc aggaagaatt aagacttctt 3361 ggggatgaac ttaaaaagca acaagagata gttgcacaag aaaagaacca tgccataaag 3421 aaagaaggag agctttctag gacctgtgac agactggcag aagttgaaga aaaactaaag 3481 gaaaagagcc agcaactcca agaaaaacag caacaacttc ttaatgtaca agaagagatg 3541 agtgagatgc agaaaaagat taatgaaata gagaatttaa agaatgaatt aaagaacaaa 3601 gaattgacat tggaacatat ggaaacagag aggcttgagt tggctcagaa acttaatgaa 3661 aattatgagg aagtgaaatc tataaccaaa gaaagaaaag ttctaaagga attacagaag 3721 tcatttgaaa cagagagaga ccaccttaga ggatatataa gagaaattga agctacaggc 3781 ctacaaacca aagaagaact aaaaattgct catattcacc taaaagaaca ccaagaaact 3841 attgatgaac taagaagaag cgtatctgag aagacagctc aaataataaa tactcaggac 3901 ttagaaaaat cccataccaa attacaagaa gagatcccag tgcttcatga ggaacaagag 3961 ttactgccta atgtgaaaaa agtcagtgag actcaggaaa caatgaatga actggagtta 4021 ttaacagaac agtccacaac caaggactca acaacactgg caagaataga aatggaaagg 4081 ctcaggttga atgaaaaatt tcaagaaagt caggaagaga taaaatctct aaccaaggaa 4141 agagacaacc ttaaaacgat aaaagaagcc cttgaagtta aacatgacca gctgaaagaa 4201 catattagag aaactttggc taaaatccag gagtctcaaa gcaaacaaga acagtcctta 4261 aatatgaaag aaaaagacaa tgaaactacc aaaatcgtga gtgagatgga gcaattcaaa 4321 cccaaagatt cagcactact aaggatagaa atagaaatgc tcggattgtc caaaagactt 4381 caagaaagtc atgatgaaat gaaatctgta gctaaggaga aagatgacct acagaggctg 4441 caagaagttc ttcaatctga aagtgaccag ctcaaagaaa acataaaaga aattgtagct 4501 aaacacctgg aaactgaaga ggaacttaaa gttgctcatt gttgcctgaa agaacaagag 4561 gaaactatta atgagttaag agtgaatctt tcagagaagg aaactgaaat atcaaccatt 4621 caaaagcagt tagaagcaat caatgataaa ttacagaaca agatccaaga gatttatgag 4681 aaagaggaac aacttaatat aaaacaaatt agtgaggttc aggaaaacgt gaatgaactg 4741 aaacaattca aggagcatcg caaagccaag gattcagcac tacaaagtat agaaagtaag 4801 atgctcgagt tgaccaacag acttcaagaa agtcaagaag aaatacaaat tatgattaag 4861 gaaaaagagg aaatgaaaag agtacaggag gcccttcaga tagagagaga ccaactgaaa 4921 gaaaacacta aagaaattgt agctaaaatg aaagaatctc aagaaaaaga atatcagttt 4981 cttaagatga cagctgtcaa tgagactcag gagaaaatgt gtgaaataga acacttgaag 5041 gagcaatttg agacccagaa gttaaacctg gaaaacatag aaacggagaa tataaggttg 5101 actcagatac tacatgaaaa ccttgaagaa atgagatctg taacaaaaga aagagatgac 5161 cttaggagtg tggaggagac tctcaaagta gagagagacc agctcaagga aaaccttaga 5221 gaaactataa ctagagacct agaaaaacaa gaggagctaa aaattgttca catgcatctg 5281 aaggagcacc aagaaactat tgataaacta agagggattg tttcagagaa aacaaatgaa 5341 atatcaaata tgcaaaagga cttagaacac tcaaatgatg ccttaaaagc acaggatctg 5401 aaaatacaag aggaactaag aattgctcac atgcatctga aagagcagca ggaaactatt 5461 gacaaactca gaggaattgt ttctgagaag acagataaac tatcaaatat gcaaaaagat 5521 ttagaaaatt caaatgctaa attacaagaa aagattcaag aacttaaggc aaatgaacat 5581 caacttatta cgttaaaaaa agatgtcaat gagacacaga aaaaagtgtc tgaaatggag 5641 caactaaaga aacaaataaa agaccaaagc ttaactctga gtaaattaga aatagagaat 5701 ttaaatttgg ctcaagaact tcatgaaaac cttgaagaaa tgaaatctgt aatgaaagaa 5761 agagataatc taagaagagt agaggagaca ctcaaactgg agagagacca actcaaggaa 5821 agcctgcaag aaaccaaagc tagagatctg gaaatacaac aggaactaaa aactgctcgt 5881 atgctatcaa aagaacacaa agaaactgtt gataaactta gagaaaaaat ttcagaaaag 5941 acaattcaaa tttcagacat tcaaaaggat ttagataaat caaaagatga attacagaaa 6001 aagatccaag aacttcagaa aaaagaactt caactgctta gagtgaaaga agatgtcaat 6061 atgagtcata aaaaaattaa tgaaatggaa cagttgaaga agcaatttga gccaaactat 6121 ctatgcaagt gtgagatgga taacttccag ttgactaaga aacttcatga aagccttgaa 6181 gaaataagaa ttgtagctaa agaaagagat gagctaagga ggataaaaga atctctcaaa 6241 atggaaaggg accaattcat agcaacctta agggaaatga tagctagaga ccgacagaac 6301 caccaagtaa aacctgaaaa aaggttacta agtgatggac aacagcacct tatggaaagc 6361 ctgagagaaa agtgctctag aataaaagag cttttgaaga gatactcaga gatggatgat 6421 cattatgagt gcttgaatag attgtctctt gacttggaga aggaaattga attccacaga 6481 atcatgaaga aactgaagta tgtgttaagc tatgttacaa aaataaaaga agaacaacat 6541 gaatgcatca ataaatttga aatggatttt attgatgaag tggaaaagca aaaggaattg 6601 ctaattaaaa tacagcacct tcaacaagat tgtgatgtac catccagaga attaagggat 6661 ctcaaattga accagaatat ggatctacat attgaggaaa ttctcaaaga tttctcagaa 6721 agtgagttcc ctagcataaa gactgaattt caacaagtac taagtaatag gaaagaaatg 6781 acacagtttt tggaagagtg gttaaatact cgttttgata tagaaaagct taaaaatggc 6841 atccagaaag aaaatgatag gatttgtcaa gtgaataact tctttaataa cagaataatt 6901 gccataatga atgaatcaac agagtttgag gaaagaagtg ctaccatatc caaagagtgg 6961 gaacaggacc tgaaatcact gaaagagaaa aatgaaaaac tatttaaaaa ctaccaaaca 7021 ttgaagactt ccttggcatc tggtgcccag gttaatccta ccacacaaga caataagaat 7081 cctcatgtta catcaagagc tacacagtta accacagaga aaattcgaga gctggaaaat 7141 tcactgcatg aagctaaaga aagtgctatg cataaggaaa gcaagattat aaagatgcag 7201 aaagaacttg aggtgactaa tgacataata gcaaaacttc aagccaaagt tcatgaatca 7261 aataaatgcc ttgaaaaaac aaaagagaca attcaagtac ttcaggacaa agttgcttta 7321 ggagctaagc catataaaga agaaattgaa gatctcaaaa tgaagcttgt gaaaatagac 7381 ctagagaaaa tgaaaaatgc caaagaattt gaaaaggaaa tcagtgctac aaaagccact 7441 gtagaatatc aaaaggaagt tataaggcta ttgagagaaa atctcagaag aagtcaacag 7501 gcccaagata cctcagtgat atcagaacat actgatcctc agccttcaaa taaaccctta 7561 acttgtggag gtggcagcgg cattgtacaa aacacaaaag ctcttatttt gaaaagtgaa 7621 catataaggc tagaaaaaga aatttctaag ttaaagcagc aaaatgaaca gctaataaaa 7681 caaaagaatg aattgttaag caataatcag catctttcca atgaggtcaa aacttggaag 7741 gaaagaaccc ttaaaagaga ggctcacaaa caagtaactt gtgagaattc tccaaagtct 7801 cctaaagtga ctggaacagc ttctaaaaag aaacaaatta caccctctca atgcaaggaa 7861 cggaatttac aagatcctgt gccaaaggaa tcaccaaaat cttgtttttt tgatagccga 7921 tcaaagtctt taccatcacc tcatccagtt cgctattttg ataactcaag tttaggcctt 7981 tgtccagagg tgcaaaatgc aggagcagag agtgtggatt ctcagccagg tccttggcac 8041 gcctcctcag gcaaggatgt gcctgagtgc aaaactcagt agactcctct ttgtcacttc 8101 tctggagatc cagcattcct tatttggaaa tgactttgtt tatgtgtcta tccctggtaa 8161 tgatgttgta gtgcagctta atttcaattc agtctttact ttgccactag agttgaaaga 8221 taagggaaca ggaaatgaat gcattgtggt aatttag // LOCUS HSCENTRIN 504 bp RNA PRI 28-JUL-1997 DEFINITION H.sapiens mRNA for centrin gene. ACCESSION Y12473 NID g2246400 KEYWORDS centrin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 504) AUTHORS Middendorp,S., Paoletti,A., Schiebel,E. and Bornens,M. JOURNAL Unpublished REFERENCE 2 (bases 1 to 504) AUTHORS Bornens,M. TITLE Direct Submission JOURNAL Submitted (08-APR-1997) M. Bornens, Institut Curie, UMR 144-CNRS, 26 Rue dUlm, 75231 Paris Cedex 05, FRANCE FEATURES Location/Qualifiers source 1..504 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1..504 /codon_start=1 /product="centrin" /db_xref="PID:e314005" /db_xref="PID:g2246401" /translation="MSLALRSELLVDKTKRKKRRELSEEQKQEIKDAFELFDTDKDEA IDYHELKVAMRALGFDVKKADVLKILKDYDREATGKITFEDFNEVVTDWILERDPHEE ILKAFKLFDDDDSGKISLRNLRRVARELGENMSDEELRAMIEEFDKDGDGEINQEEFI AIMTGDI" BASE COUNT 197 a 55 c 124 g 128 t ORIGIN 1 atgagtttag ctctgagaag tgagcttcta gtggacaaaa caaagaggaa aaaaagaaga 61 gaactgtctg aggaacagaa acaagaaatt aaagatgctt ttgaactatt tgatacagac 121 aaagatgaag caatagatta tcatgaatta aaggtggcaa tgagagcctt ggggtttgat 181 gtaaaaaaag ctgatgtact gaagattctt aaagattatg acagagaagc cacagggaaa 241 atcacctttg aagattttaa tgaagttgtg acagactgga tattggaaag agatccccat 301 gaagaaatac tcaaggcatt taaactattt gatgatgatg attcaggtaa aataagcttg 361 aggaatttgc gacgtgttgc tagagaattg ggtgaaaaca tgagtgatga agaacttcga 421 gctatgatag aagaatttga caaagatggt gatggagaaa taaaccaaga ggagttcatt 481 gctattatga ctggtgacat ttaa // LOCUS HSCERBAR 2288 bp RNA PRI 19-MAR-1991 DEFINITION Human c-erbA-1 mRNA for thyroid hormone receptor alpha. ACCESSION X55005 NID g29878 KEYWORDS c-erbA-1 gene; thyroid hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2288) AUTHORS Laudet,V. TITLE Direct Submission JOURNAL Submitted (25-OCT-1990) V. Laudet, INSERM U186/CNRSUA041160, INSTITUT PASTEUR, 1 RUE CALMETTE, 59019 LILLE CEDEX, FRANCE REFERENCE 2 (bases 1 to 2288) AUTHORS Laudet,V., Begue,A., Henry-Duthoit,C., Joubel,A., Martin,P., Stehelin,D. and Saule,S. TITLE Genomic organization of the human thyroid hormone receptor alpha (c-erbA-1) gene JOURNAL Nucleic Acids Res. 19 (5), 1105-1112 (1991) MEDLINE 91212192 COMMENT Overlaps with Nakai, Mol. Endocrinol. 2:1087-1092(1988) and Benbrook, Science 238:788-791(1987). FEATURES Location/Qualifiers source 1..2288 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="K562" /cell_line="K562" /clone_lib="K562 cDNA" /clone="14c" /chromosome="17Q21" mRNA 1..2276 /gene="c-erbA-1" /evidence=experimental gene 1..2276 /gene="c-erbA-1" CDS 467..1699 /gene="c-erbA-1" /codon_start=1 /product="thyriod hormone receptor alpha" /db_xref="PID:g29879" /db_xref="SWISS-PROT:P21205" /translation="MEQKPSKVECGSDPEENSARSPDGKRKRKNGQCSLKTSMSGYIP SYLDKDEQCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPTYSCKYDSCCVIDKI TRNQCQLCRFKKCIAVGMAMDLVLDDSKRVAKRKLIEQNRERRRKEEMIRSLQQRPEP TPEEWDLIHIATEAHRSTNAQGSHWKQRRKFLPDDIGQSPIVSMPDGDKVDLEAFSEF TKIITPAITRVVDFAKKLPMFSELPCEDQIILLKGCCMEIMSLRAAVRYDPESDTLTL SGEMAVKREQLKNGGLGVVSDAIFELGKSLSAFNLDDTEVALLQAVLLMSTDRSGLLC VDKIEKSQEAYLLAFEHYVNHRKHNIPHFWPKLLMKVTDLRMIGACHASRFLHMKVEC PTELFPPLFLEVFEDQEV" BASE COUNT 518 a 702 c 673 g 395 t ORIGIN 1 ccgccgcccg ccacactcgc cccccgcccc ccccgcgcct cactcgcact cacacccggg 61 cgcaggaggc gggcggcccg ggccccaccg gccccccatg gacgccccca gcacggggcg 121 ctgagacccc cgcgtcgctg cccagcccgg tccggcgcgc cacgccgagg gatctctgga 181 caggacaaga ctccgaagct actcccccag cacacagccc gggacccaca aacccagctt 241 gcccccagcc ctcccacctg ccactccctg gcccctccca ccgcccgccc cccttggggc 301 gcagggcatg gtgtgaaagg ccaagtgctg aggcgggtat catgggtgct gtgccctagg 361 gcctgggtgg cagggggtgg gtggcctgtg ggtgtgccgg gggggccagt gtgcccacca 421 cagtctcttg gcgtgctgga gggcatcctg gatggaattg aagtgaatgg aacagaagcc 481 aagcaaggtg gaatgtgggt cagacccaga ggagaacagt gccaggtcac cagatggaaa 541 gcgaaaaaga aagaacggcc aatgttccct gaaaaccagc atgtcagggt atatccctag 601 ttacctggac aaagacgagc agtgtgtcgt gtgtggggac aaggcaactg gttatcacta 661 ccgctgtatc acttgtgagg gctgcaaggg cttctttcgc cgcacaatcc agaagaacct 721 ccatcccacc tattcctgca aatatgacag ctgctgtgtc attgacaaga tcacccgcaa 781 tcagtgccag ctgtgccgct tcaagaagtg catcgccgtg ggcatggcca tggacttggt 841 tctagatgac tcgaagcggg tggccaagcg taagctgatt gagcagaacc gggagcggcg 901 gcggaaggag gagatgatcc gatcactgca gcagcgacca gagcccactc ctgaagagtg 961 ggatctgatc cacattgcca cagaggccca tcgcagcacc aatgcccagg gcagccattg 1021 gaaacagagg cggaaattcc tgcccgatga cattggccag tcacccattg tctccatgcc 1081 ggacggagac aaggtggacc tggaagcctt cagcgagttt accaagatca tcaccccggc 1141 catcacccgt gtggtggact ttgccaaaaa actgcccatg ttctccgagc tgccttgcga 1201 agaccagatc atcctcctga aggggtgctg catggagatc atgtccctgc gggcggctgt 1261 ccgctacgac cctgagagcg acaccctgac gctgagtggg gagatggctg tcaagcggga 1321 gcagctcaag aatggcggcc tgggcgtagt ctccgacgcc atctttgaac tgggcaagtc 1381 actctctgcc tttaacctag atgacacgga agtggctctg ctgcaggctg tgctgctaat 1441 gtcaacagac cgctcgggcc tgctgtgtgt ggacaagatc gagaagagtc aggaggcgta 1501 cctgctggcg ttcgagcact acgtcaacca ccgcaaacac aacattccgc acttctggcc 1561 caagctgctg atgaaggtga ctgacctccg catgatcggg gcctgccacg ccagccgctt 1621 cctccacatg aaagtcgagt gccccaccga actcttcccc ccactcttcc tcgaggtctt 1681 tgaggatcag gaagtctaaa gcctcaggcg gccagagggt gtgcggagct ggtggggagg 1741 agcctggaga gaaggggcag agctgggggc tgagggagac ccccccacac cccttctctc 1801 cttcctctcg tccttggata gattcagctc ccacacacac acccgcactg cccaggtccc 1861 tcctcagacc tccagccctg ggacaggcaa acaactgaac ttgctatgga aaggacagtg 1921 tgggaggctg ggggagctgt gttctgcagt tcccaggacc ccatcctctc agaaggtagg 1981 ggaagggcgg gaggattgag aagggacaag ccaccttgac cgtagggaag gaggaatgtg 2041 ggctggggga agatgccctc aactcacccc ctacacacac atgagagaga gcccccaccc 2101 agttccttgg cctaggtctc ccctccaggc tgagggcctc tctacttccc cagatgcctg 2161 ggtgcaaaga acggcttggc ttggctcctc ctctggagtt aaaatttata gtcattctaa 2221 ctgcactttg gaaaccaagc aaggggagaa gacaaatgaa gaaaaactag acagagaaaa 2281 aaaaaaaa // LOCUS HSCGBS08 1755 bp RNA PRI 15-OCT-1991 DEFINITION Human mRNA for cannabinoid receptor. ACCESSION X54937 NID g29914 KEYWORDS G-protein coupled receptor; receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Gerard,C. TITLE Direct Submission JOURNAL Submitted (23-OCT-1990) Gerard C., IRIBHN, Campus Erasme, 808 v De Lennik, 1070 Bruxelles, Belgium REFERENCE 2 (bases 1 to 1755) AUTHORS Gerard,C., Mollereau,C., Vassart,G. and Parmentier,M. TITLE Nucleotide sequence of a human cannabinoid receptor cDNA JOURNAL Nucleic Acids Res. 18 (23), 7142 (1990) MEDLINE 91088303 REFERENCE 3 (bases 1 to 1755) AUTHORS Gerard,C.M., Mollereau,C., Vassart,G. and Parmentier,M. TITLE Molecular cloning of a human cannabinoid receptor which is also expressed in testis JOURNAL Biochem. J. 279 (Pt 1), 129-134 (1991) MEDLINE 92028798 COMMENT Data kindly reviewed (14-MAR-1991) by Gerard c. FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain stem" /clone_lib="cDNA in lambda GT11" mRNA 1..1755 /gene="CGBS08" gene 1..1755 /gene="CGBS08" CDS 149..1567 /gene="CGBS08" /codon_start=1 /product="cannabinoid receptor" /db_xref="PID:g29915" /db_xref="SWISS-PROT:P21554" /translation="MKSILDGLADTTFRTITTDLLYVGSNDIQYEDIKGDMASKLGYF PQKFPLTSFRGSPFQEKMTAGDNPQLVPADQVNITEFYNKSLSSFKENEENIQCGENF MDIECFMVLNPSQQLAIAVLSLTLGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLA VADLLGSVIFVYSFIDFHVFHRKDSRNVFLFKLGGVTASFTASVGSLFLTAIDRYISI HRPLAYKRIVTRPKAVVAFCLMWTIAIVIAVLPLLGWNCEKLQSVCSDIFPHIDETYL MFWIGVTSVLLLFIVYAYMYILWKAHSHAVRMIQRGTQKSIIIHTSEDGKVQVTRPDQ ARMDIRLAKTLVLILVVLIICWGPLLAIMVYDVFGKMNKLIKTVFAFCSMLCLLNSTV NPIIYALRSKDLRHAFRSMFPSCEGTAQPLDNSMGDSDCLHKHANNAASVHRAAESCI KSTVKIAKVTMSVSTDTSAEAL" mat_peptide 149..1564 /gene="CGBS08" /product="cannabinoid receptor" misc_feature 377..379 /gene="CGBS08" /note="putative glycosylation site" misc_feature 397..399 /gene="CGBS08" /note="putative glycosylation site" misc_feature 482..484 /gene="CGBS08" /note="putative glycosylation site" misc_feature 497..574 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 611..673 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 710..784 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 845..916 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 971..1045 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 1187..1243 /gene="CGBS08" /note="putative transmembrane segment" misc_feature 1280..1345 /gene="CGBS08" /note="putative transmembrane segment" BASE COUNT 395 a 487 c 441 g 432 t ORIGIN 1 ggggactacg gagagctctg cagggagccg aggcccccgc ccgggccaag ggagcttctg 61 tcccgaggac caggggatgc gaagggattg ccccctgtgg gtcactttct cagtcatttt 121 gagctcagcc taatcaaaga ctgaggttat gaagtcgatc ctagatggcc ttgcagatac 181 caccttccgc accatcacca ctgacctcct gtacgtgggc tcaaatgaca ttcagtacga 241 agacatcaaa ggtgacatgg catccaaatt agggtacttc ccacagaaat tccctttaac 301 ttcctttagg ggaagtccct tccaagagaa gatgactgcg ggagacaacc cccagctagt 361 cccagcagac caggtgaaca ttacagaatt ttacaacaag tctctctcgt ccttcaagga 421 gaatgaggag aacatccagt gtggggagaa cttcatggac atagagtgtt tcatggtcct 481 gaaccccagc cagcagctgg ccattgcagt cctgtccctc acgctgggca ccttcacggt 541 cctggagaac ctcctggtgc tgtgcgtcat cctccactcc cgcagcctcc gctgcaggcc 601 ttcctaccac ttcatcggca gcctggcggt ggcagacctc ctggggagtg tcatttttgt 661 ctacagcttc attgacttcc acgtgttcca ccgcaaagat agccgcaacg tgtttctgtt 721 caaactgggt ggggtcacgg cctccttcac tgcctccgtg ggcagcctgt tcctcacagc 781 catcgacagg tacatatcca ttcacaggcc cctggcctat aagaggattg tcaccaggcc 841 caaggccgtg gtggcgtttt gcctgatgtg gaccatagcc attgtgatcg ccgtgctgcc 901 tctcctgggc tggaactgcg agaaactgca atctgtttgc tcagacattt tcccacacat 961 tgatgaaacc tacctgatgt tctggatcgg ggtcaccagc gtactgcttc tgttcatcgt 1021 gtatgcgtac atgtatattc tctggaaggc tcacagccac gccgtccgca tgattcagcg 1081 tggcacccag aagagcatca tcatccacac gtctgaggat gggaaggtac aggtgacccg 1141 gccagaccaa gcccgcatgg acattaggtt agccaagacc ctggtcctga tcctggtggt 1201 gttgatcatc tgctggggcc ctctgcttgc aatcatggtg tatgatgtct ttgggaagat 1261 gaacaagctc attaagacgg tgtttgcatt ctgcagtatg ctctgcctgc tgaactccac 1321 cgtgaacccc atcatctatg ctctgaggag taaggacctg cgacacgctt tccggagcat 1381 gtttccctct tgtgaaggca ctgcgcagcc tctggataac agcatggggg actcggactg 1441 cctgcacaaa cacgcaaaca atgcagccag tgttcacagg gccgcagaaa gctgcatcaa 1501 gagcacggtc aagattgcca aggtaaccat gtctgtgtcc acagacacgt ctgccgaggc 1561 tctgtgagcc tgatgcctcc ctggcagcac aggaaaagaa tttttttttt taagctcaaa 1621 atctagaaga gtctattgtc tccttggtta tattttttta actttaccat gctcaatgaa 1681 aaggtgattg ccacatgtca cttatttgct tagtttccgt ttgggctaat cttccggggt 1741 tcgtaggaaa ccttt // LOCUS HSCGGBP 779 bp RNA PRI 22-JUL-1997 DEFINITION Homo sapiens trinucleotide repeat 5-d(CGG)n-3ds binding protein p20-CGGBP. ACCESSION AJ000258 NID g2274963 KEYWORDS CGGBP gene; DNA binding protein; trinucleotide repeat. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 779) AUTHORS Deissler,H. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) Deissler H., Department for Medical Genetics and Virology, Institute of Genetics, University of Cologne, Weyertal 121, NRW, D-50931 Cologne, GERMANY REFERENCE 2 (bases 1 to 779) AUTHORS Deissler,H., Wilm,M., Genc,B., Schmitz,B., Ternes,T., Naumann,F., Mann,M. and Doerfler,W. TITLE Rapid protein sequencing by tandem mass spectrometry and cDNA cloning of p20-CGGBP. A novel protein that binds to the unstable triplet repeat 5'-d(cgg)n-3' in the human fmr1 gene JOURNAL J. Biol. Chem. 272 (27), 16761-16768 (1997) MEDLINE 97347474 COMMENT EST clone ID269133 genbank accession no. N36676 and N24697. FEATURES Location/Qualifiers source 1..779 /organism="Homo sapiens" /plasmid="pT7T3D" /db_xref="taxon:9606" /cell_type="melanocytes" /chromosome="3" /clone_lib="soares melanocytes 2NbHM" /lab_host="DH10B" /sex="male" gene 197..759 /gene="CGGBP" CDS 197..700 /gene="CGGBP" /function="DNA binding protein" /note="p20-CGGBP binds highly sequence-specific to the double-stranded triplet repeat 5-d(CGG)n-3. Binding is severely inhibited by cytosine-specific DNA-methylation. p20-CGGBP was isolated from human HeLa cells, three internal peptides were sequenced that were completely contained in the corrected sequence of EST clone ID269133 (genbank acc.-No: N36676, N24697). The derived aa sequence lacks any overall homology to any known protein. p20-CGGBP RNA is expressed in a variety of human tissues. The cDNA-sequence of p20-CGGBP is highly conserved among mammals. Sequenced Peptides: peptide1: aa 4 to 12 FVVTAPPAR peptide2: aa 19 to 21 LYV peptide3: aa 101 to 104 VSVIQ putative nuclear localization signal: aa 69 to 84" /codon_start=1 /evidence=experimental /product="p20-CGGBP" /db_xref="PID:e329707" /db_xref="PID:g2274964" /translation="MERFVVTAPPARNRSKTALYVTPLDRVTEFGGELHEDGGKLFCT SCNVVLNHVRKSAISDHLKSKTHTKRKAEFEEQNVRKKQRPLTASLQCNSTAQTEKVS VIQDFVKMCLEANIPLEKADHPAVRAFLSRHVKNGGSIPKSDQLRRAYLPDGYENENQ LLNSQDC" polyA_signal 729..732 /gene="CGGBP" polyA_site 759 /gene="CGGBP" BASE COUNT 242 a 161 c 175 g 201 t ORIGIN 1 ggcacgaggg tttcgctctg gagaccattc cctgctaagt atcaagacga aaaaaactgg 61 aaactaatcc gaatttctgt ggaatgttta atcttctgga tccatgactg tctgatacgt 121 tggcaattta aagtcctttt gaaagagagt tcatgttacc cagctattct ctaaaccata 181 tttatttaga gtcagaatgg agcgatttgt agtaacagca ccacctgctc gaaaccgttc 241 taagactgct ttgtatgtga ctcccctgga tcgagtcact gagtttggag gtgagctgca 301 tgaagatgga ggaaaactct tctgcacttc ttgcaatgtg gttctgaatc atgttcgcaa 361 gtctgccatt agtgaccacc tcaagtcaaa gactcatacc aagaggaagg cagaatttga 421 agagcagaat gtgagaaaga agcagaggcc cctaactgca tctcttcagt gcaacagtac 481 tgcgcaaaca gagaaagtca gtgttatcca ggactttgtg aaaatgtgcc tggaagccaa 541 catcccactt gagaaggctg atcacccagc agtccgtgct ttcctatctc gccatgtgaa 601 gaatggaggc tccataccta agtcagacca gctacggagg gcatatcttc ctgatggata 661 tgagaatgag aatcaactcc tcaactcaca agattgttga ctaggaggtt accaccattg 721 tgatcaagat aaatgtggag tattaaagtt atgtgttgaa aaaaaaaaaa aaaaaaact // LOCUS HSCGJP 3038 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cardiac gap junction protein. ACCESSION X52947 NID g29916 KEYWORDS gap junction protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3038) AUTHORS Fishman,G.I. TITLE Direct Submission JOURNAL Submitted (02-MAY-1990) Fishman G.I., Albert Einstein College of Medicine, Dpt. of Microbiology and Immunology, 1300 Morris Park Avenue, Bronx, 10461 NY, USA REFERENCE 2 (bases 1 to 3038) AUTHORS Fishman,G.I., Spray,D.C. and Leinwand,L.A. TITLE Molecular characterization and functional expression of the human cardiac gap junction channel JOURNAL J. Cell Biol. 111 (2), 589-598 (1990) MEDLINE 90338113 FEATURES Location/Qualifiers source 1..3038 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetal" /tissue_type="cardiac muscle" CDS 158..1306 /note="gap junction protein (AA 1-382)" /codon_start=1 /db_xref="PID:g29917" /db_xref="SWISS-PROT:P17302" /translation="MGDWSALGKLLDKVQAYSTAGGKVWLSVLFIFRILLLGTAVESA WGDEQSAFRCNTQQPGCENVCYDKSFPISHVRFWVLQIIFVSVPTLLYLAHVFYVMRK EEKLNKKEEELKVAQTDGVNVDMHLKQIEIKKFKYGIEEHGKVKMRGGLLRTYIISIL FKSIFEVAFLLIQWYIYGFSLSAVYTCKRDPCPHQVDCFLSRPTEKTIFIIFMLVVSL VSLALNIIELFYVFFKGVKDRVKGKSDPYHATSGALSPAKDCGSQKYAYFNGCSSPTA PLSPMSPPGYKLVTGDRNNSSCRNYNKQASEQNWANYSAEQNRMGQAGSTISNSHAQP FDFPDDNQNSKKLAAGHELQPLAIVDQRPSSRASSRASSRPRPDDLEI" misc_feature 3012..3017 /note="polyA signal" BASE COUNT 859 a 584 c 645 g 950 t ORIGIN 1 gcgtgaggaa agtaccaaac agcagcggag ttttaaactt taaatagaca ggtctgagtg 61 cctgaacttg ccttttcatt ttacttcatc ctccaaggag ttcaatcact tggcgtgact 121 tcactacttt taagcaaaag agtggtgccc aggcaacatg ggtgactgga gcgccttagg 181 caaactcctt gacaaggttc aagcctactc aactgctgga gggaaggtgt ggctgtcagt 241 acttttcatt ttccgaatcc tgctgctggg gacagcggtt gagtcagcct ggggagatga 301 gcagtctgcc tttcgttgta acactcagca acctggttgt gaaaatgtct gctatgacaa 361 gtctttccca atctctcatg tgcgcttctg ggtcctgcag atcatatttg tgtctgtacc 421 cacactcttg tacctggctc atgtgttcta tgtgatgcga aaggaagaga aactgaacaa 481 gaaagaggaa gaactcaagg ttgcccaaac tgatggtgtc aatgtggaca tgcacttgaa 541 gcagattgag ataaagaagt tcaagtacgg tattgaagag catggtaagg tgaaaatgcg 601 aggggggttg ctgcgaacct acatcatcag tatcctcttc aagtctatct ttgaggtggc 661 cttcttgctg atccagtggt acatctatgg attcagcttg agtgctgttt acacttgcaa 721 aagagatccc tgcccacatc aggtggactg tttcctctct cgccccacgg agaaaaccat 781 cttcatcatc ttcatgctgg tggtgtcctt ggtgtccctg gccttgaata tcattgaact 841 cttctatgtt ttcttcaagg gcgttaagga tcgggttaag ggaaagagcg acccttacca 901 tgcgaccagt ggtgcgctga gccctgccaa agactgtggg tctcaaaaat atgcttattt 961 caatggctgc tcctcaccaa ccgctcccct ctcgcctatg tctcctcctg ggtacaagct 1021 ggttactggc gacagaaaca attcttcttg ccgcaattac aacaagcaag caagtgagca 1081 aaactgggct aattacagtg cagaacaaaa tcgaatgggg caggcgggaa gcaccatctc 1141 taactcccat gcacagcctt ttgatttccc cgatgataac cagaattcta aaaaactagc 1201 tgctggacat gaattacagc cactagccat tgtggaccag cgaccttcaa gcagagccag 1261 cagtcgtgcc agcagcagac ctcggcctga tgacctggag atctagatac aggcttgaaa 1321 gcatcaagat tccactcaat tgtggagaag aaaaaaggtg ctgtagaaag tgcaccaggt 1381 gttaattttg atccggtgga ggtggtactc aacagcctta ttcatgaggc ttagaaaaca 1441 caaagacatt agaataccta ggttcactgg gggtgtatgg ggtagatggg tggagaggga 1501 ggggataaga gaggtgcatg ttggtattta aagtagtgga ttcaaagaac ttagattata 1561 aataagagtt ccattaggtg atacatagat aagggctttt tctccccgca aacaccccta 1621 agaatggttc tgtgtatgtg aatgagcggg tggtaattgt ggctaaatat ttttgtttta 1681 ccaagaaact gaaataattc tggccaggaa taaatacttc ctgaacatct taggtctttt 1741 caacaagaaa aagacagagg attgtcctta agtccctgct aaaacattcc attgttaaaa 1801 tttgcacttt gaaggtaagc tttctaggcc tgaccctcca ggtgtcaatg gacttgtgct 1861 actatatttt tttattcttg gtatcagttt aaaattcaga caaggcccac agaataagat 1921 tttccatgca tttgcaaata cgtatattct ttttccatcc acttgcacaa tatcattacc 1981 atcacttttt catcattcct cagctactac tcacattcat ttaatggttt ctgtaaacat 2041 ttttaagaca gttgggatgt cacttaacat tttttttttt tgagctaaag tcagggaatc 2101 aagccatgct taatatttaa caatcactta tatgtgtgtc gaagagtttg ttttgtttgt 2161 catgtattgg tacaagcaga tacagtataa actcacaaac acagatttga aaataatgca 2221 catatggtgt tcaaatttga acctttctca tggatttttg tggtgtgggc caatatggtg 2281 tttacattat ataattcctg ctgtggcaag taaagcacac tttttttttc tcctaaaatg 2341 tttttccctg tgtatcctat tatggatact ggttttgtta attatgattc tttattttct 2401 ctcctttttt taggatatag cagtaatgct attactgaaa tgaatttcct ttttctgaaa 2461 tgtaatcatt gatgcttgaa tgatagaatt ttagtactgt aaacaggctt tagtcattaa 2521 tgtgagagac ttagaaaaaa tgcttagagt ggactattaa atgtgcctaa atgaattttg 2581 cagtaactgg tattcttggg ttttcctact taatacacag taattcagaa cttgtattct 2641 attatgagtt tagcagtctt ttggagtgac cagcaacttt gatgtttgca ctaagatttt 2701 atttggaatg caagagaggt tgaaagagga ttcagtagta cacatacaac taatttattt 2761 gaactatatg ttgaagacat ctaccagttt ctccaaatgc cttttttaaa actcatcaca 2821 gaagattggt gaaaatgctg agtatgacac ttttcttctt gcatgcatgt cagctacata 2881 aacagttttg tacaatgaaa attactaatt tgtttgacat tccatgttaa actacggtca 2941 tgttcagctt cattgcatgt aatgtagacc tagtccatca gatcatgtgt tctggagagt 3001 gttctttatt caataaagtt ttaatttagt ataaacat // LOCUS HSCGMPPM 2833 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens mRNA for rod cGMP phosphodiesterase. ACCESSION X66142 S59192 NID g396492 KEYWORDS 3',5'-cyclic-nucleotide phosphodiesterase; photoreceptor protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2833) AUTHORS Khramtsov,N.V. TITLE Direct Submission JOURNAL Submitted (03-JUN-1992) N.V. Khramtsov, Shemyakin Institute of Bioorganic Chemistry, Russian Academy of Sciences, ul.Miklukho-Maklaya, 16/10, 117871 GSP Moscow V-437, Russia REFERENCE 2 (bases 1 to 2833) AUTHORS Khramtsov,N.V., Feshchenko,E.A., Suslova,V.A., Shmukler,B.E., Terpugov,B.E., Rakitina,T.V., Atabekova,N.V. and Lipkin,V.M. TITLE The human rod photoreceptor cGMP phosphodiesterase beta-subunit. Structural studies of its cDNA and gene JOURNAL FEBS Lett. 327 (3), 275-278 (1993) MEDLINE 93351644 REFERENCE 3 (bases 1 to 2833) AUTHORS Khramtsov,N.V., Feshchenko,E.A., Suslova,V.A., Terpugov,B.E., Rakitina,T.V., Atabekova,N.V., Shmukler,B.E. and Lipkin,V.M. TITLE Structural studies of cDNA and the gene for the beta-subunit of cGMP phosphodiesterase from human retina JOURNAL Bioorg. Khim. 18 (12), 1551-1554 (1992) MEDLINE 93244036 FEATURES Location/Qualifiers source 1..2833 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retinal" /cell_type="rod photoreceptor" /clone_lib="lambda gt10 retinal cDNA" /clone="pBh-10, pBh-20, pBh-14" gene 6..2570 /gene="Rod cG-PDE Hum" CDS 6..2570 /gene="Rod cG-PDE Hum" /EC_number="3.1.4.17" /function="cGMP hydrolysis" /note="Rod cGMP phosphodiesterase" /codon_start=1 /product="3',5'-cyclic-nucleotide phosphodiesterase" /db_xref="PID:g396493" /db_xref="SWISS-PROT:P35913" /translation="MSLSEEQARSFLDQNPDFARQYFGKKLSPENVGRGCEDGCPPDC DSLRDLCQVEESTALLELVQDMQESINMERVVFKVLRRLCTLLQADRCSLFMYRQRNG VAELATRLFSVQPDSVLEDCLVPPDSEIVFPLDIGVVGHVAQTKKMVNVEDVAECPHF SSFADELTDYKTKNMLATPIMNGKDVVAVIMAVNKLNGPFFTSEDEDVFLKYLNFATL YLKIYHLSYLHNCETRRGQVLLWSANKVFEELTDIERQFHKAFYTVRAYLNCERYSVG LLDMTKEKEFFDVWSVLMGESQPYSGPRTPDGREIVFYKVIDYILHGKEEIKVIPTPS ADHWALASGLPSYVAESGFICNIMNASADEMFKFQEGALDDSGWLIKNVLSMPIVNKK EEIVGVATFYNRKDGKPFDEQDEVLMESLTQFLGWSVMNTDTYDKMNKLENRKDIAQD MVLYHVKCDRDEIQLILPTRARLGKEPADCDEDELGEILKEELPGPTTFDIYEFHFSD LECTELDLVKCGIQMYYELGVVRKFQIPQEVLVRFLFSISKGYRRITYHNWRHGFNVA QTMFTLLMTGKLKSYYTDLEAFAMVTAGLCHDIDHRGTNNLYQMKSQNPLAKLHGSSI LERHHLEFGKFLLSEETLNIYQNLNRRQHEHVIHLMDIAIIATDLALYFKKRAMFQKI VDESKNYQDKKSWVEYLSLETTRKEIVMAMMMTACDLSAITKPWEVQSKVALLVAAEF WEQGDLERTVLDQQPIPMMDRNKAAELPKLQVGFIDFVCTFVYKEFSRFHEEILPMFD RLQNNRKEWKALADEYEAKVKALEEKEEEERVAAKKVGTEICNGGPAPKSSTCCIL" BASE COUNT 662 a 791 c 811 g 569 t ORIGIN 1 ccaccatgag cctcagtgag gagcaggccc ggagctttct ggaccagaac cccgattttg 61 cccgccagta ctttgggaag aaactgagcc ctgagaatgt tggccgcggc tgcgaggacg 121 ggtgcccgcc ggactgcgac agcctccggg acctctgcca ggtggaggag agcacggcgc 181 tgctggagct ggtgcaggat atgcaggaga gcatcaacat ggagcgcgtg gtcttcaagg 241 tcctgcggcg cctctgcacc ctcctgcagg ccgaccgctg cagcctcttc atgtaccgcc 301 agcgcaacgg cgtggccgag ctggccacca ggcttttcag cgtgcagccg gacagcgtcc 361 tggaggactg cctggtgccc cccgactccg agatcgtctt cccactggac atcggggtcg 421 tgggccacgt ggctcagacc aaaaagatgg tgaacgtcga ggacgtggcc gagtgccctc 481 acttcagctc atttgctgac gagctcactg actacaagac aaagaatatg ctggccacac 541 ccatcatgaa tggcaaagac gtcgtggcgg tgatcatggc agtgaacaag ctcaacggcc 601 cattcttcac cagcgaagac gaagatgtgt tcttgaagta cctgaatttt gccacgttgt 661 acctgaagat ctatcacctg agctacctcc acaactgcga gacgcgccgc ggccaggtgc 721 tgctgtggtc ggccaacaag gtgtttgagg agctgacgga catcgagagg cagttccaca 781 aggccttcta cacggtgcgg gcctacctca actgcgagcg gtactccgtg ggcctcctgg 841 acatgaccaa ggagaaggaa ttttttgacg tgtggtctgt gctgatggga gagtcccagc 901 cgtactcggg cccacgcacg cctgatggcc gggaaattgt cttctacaaa gtgatcgact 961 acatcctcca cggcaaggag gagatcaagg tcattcccac accctcagcc gatcactggg 1021 ccctggccag cggccttcca agctacgtgg cagaaagcgg ctttatttgt aacatcatga 1081 atgcttccgc tgacgaaatg ttcaaatttc aggaaggggc cctggacgac tccgggtggc 1141 tcatcaagaa tgtgctgtcc atgcccatcg tcaacaagaa ggaggagatt gtgggagtcg 1201 ccacatttta caacaggaaa gacgggaagc cctttgacga acaggacgag gttctcatgg 1261 agtccctgac acagttcctg ggctggtcag tgatgaacac cgacacctac gacaagatga 1321 acaagctgga gaaccgcaag gacatcgcac aggacatggt cctttaccac gtgaagtgcg 1381 acagggacga gatccagctc atcctgccaa ccagagcgcg cctggggaag gagcctgctg 1441 actgcgatga ggacgagctg ggcgaaatcc tgaaggagga gctgccaggg cccaccacat 1501 ttgacatcta cgaattccac ttctctgacc tggagtgcac cgaactggac ctggtcaaat 1561 gtggcatcca gatgtactac gagctgggcg tggtccgaaa gttccagatc ccccaggagg 1621 tcctggtgcg gttcctgttc tccatcagca aagggtaccg gagaatcacc taccacaact 1681 ggcgccacgg cttcaacgtg gcccagacga tgttcacgct gctcatgacc ggcaaactga 1741 agagctacta cacggacctg gaggccttcg ccatggtgac agccggcctg tgccatgaca 1801 tcgaccaccg cggcaccaac aacctgtacc agatgaagtc ccagaacccc ttggctaagc 1861 tccacggctc ctcgattttg gagcggcacc acctggagtt tgggaagttc ctgctctcgg 1921 aggagaccct gaacatctac cagaacctga accggcggca gcacgagcac gtgatccacc 1981 tgatggacat cgccatcatc gccacggacc tggccctgta cttcaagaag agagcgatgt 2041 ttcagaagat cgtggatgag tccaagaact accaggacaa gaagagctgg gtggagtacc 2101 tgtccctgga gacgacccgg aaggagatcg tcatggccat gatgatgaca gcctgcgacc 2161 tgtctgccat caccaagccc tgggaagtcc agagcaaggt cgcacttctc gtggctgctg 2221 agttctggga gcaaggtgac ttggaaagga cagtcttgga tcagcagccc attcctatga 2281 tggaccggaa caaggcggcc gagctcccca agctgcaagt gggcttcatc gacttcgtgt 2341 gcacattcgt gtacaaggag ttctctcgtt tccacgaaga gatcctgccc atgttcgacc 2401 gactgcagaa caataggaaa gagtggaagg cgctggctga tgagtatgag gccaaagtga 2461 aggctctgga ggagaaggag gaggaggaga gggtggcagc caagaaagta ggcacagaaa 2521 tttgcaatgg cggcccagca cccaagtctt caacctgctg tatcctgtga gcactggccc 2581 gtgggacact atggctccct caatcttcac ccactaggat ttgggttctg cctgtggcta 2641 tttgctacaa gaggttagga agcccaagaa aatgactgaa gatcattctg gatattttaa 2701 tttttttttt tttttttttt tttttttttg agatggagtc ttgctctgtc acccaggctg 2761 gagtgccgtg gcacgatctc agctcactgc aacctccacc tcccaggttc aagcgattct 2821 cgtgcctcag cct // LOCUS HSCH16FAA 5503 bp RNA PRI 07-NOV-1996 DEFINITION H.sapiens mRNA for FAA protein. ACCESSION X99226 NID g1657311 KEYWORDS complementation group A protein; FAA gene; Fanconi anemia. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5503) AUTHORS Lo Ten Foe,J.R., Rooimans,M.A., Bosnoyan-Collins,L., Alon,N., Wijker,M., Parker,L., Lightfoot,J., Carreau,M., Callen,D.F., Savoia,A., Cheng,N.C., Van Berkel,C.G.M., Strunk,M.H.P., Gille,J.J.P., Pals,G., Kruyt,F.A.E., Pronk,J.C., Arwert,F., Buchwald,M. and Joenje,H. TITLE Expression cloning of a cDNA for the major Fanconi anaemia gene, FAA JOURNAL Nature Genet. 14 (3), 320-323 (1996) MEDLINE 97051928 REFERENCE 2 (bases 1 to 5503) AUTHORS Joenje,H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) H. Joenje, Free University of Amsterdam, Department of Human Genetics, Van der Boechorststraat 7, NL-1081 BT Amsterdam, NETHERLANDS REMARK Revised by [3] REFERENCE 3 (bases 1 to 5503) AUTHORS Joenje,H. TITLE Direct Submission JOURNAL Submitted (24-SEP-1996) H. Joenje, Free University of Amsterdam, Department of Human Genetics, Van der Boechorststraat 7, NL-1081 BT Amsterdam, NETHERLANDS COMMENT Related ESTs: HS618125, HS950128, HST35075, HS21376, HSC1VH032, HS16281, HS04071. FEATURES Location/Qualifiers source 1..5503 /organism="Homo sapiens" /isolate="healthy control" /db_xref="taxon:9606" /cell_type="lymphoblastoid" /cell_line="HSC93" /clone_lib="pREP4" /clone="D" /chromosome="16" /map="q24.3" gene 32..4399 /gene="FAA" CDS 32..4399 /gene="FAA" /function="acts with other genes to control FA pathway" /codon_start=1 /product="Fanconi anemia complementation group A protein" /db_xref="PID:e266883" /db_xref="PID:g1657312" /translation="MSDSWVPNSASGQDPGGRRRAWAELLAGRVKREKYNPERAQKLK ESAVRLLRSHQDLNALLLEVEGPLCKKLSLSKVIDCDSSEAYANHSSSFIGSALQDQA SRLGVPVGILSAGMVASSVGQICTAPAETSHPVLLTVEQRKKLSSLLEFAQYLLAHSM FSRLSFCQELWKIQSSLLLEAVWHLHVQGIVSLQELLESHPDMHAVGSWLFRNLCCLC EQMEASCQHADVARAMLSDFVQMFVLRGFQKNSDLRRTVEPEKMPQVTVDVLQRMLIF ALDALAAGVQEESSTHKIVRCWFGVFSGHTLGSVISTDPLKRFFSHTLTQILTHSPVL KASDAVQMQREWSFARTHPLLTSLYRRLFVMLSAEELVGHLQEVLETQEVHWQRVLSF VSALVVCFPEAQQLLEDWVARLMAQAFESCQLDSMVTAFLVVRQAALEGPSAFLSYAD WFKASFGSTRGYHGCSKKALVFLFTFLSELVPFESPRYLQVHILHPPLVPSKYRSLLT DYISLAKTRLADLKVSIENMGLYEDLSSAGDITEPHSQALQDVEKAIMVFEHTGNIPV TVMEASIFRRPYYVSHFLPALLTPRVLPKVPDSRVAFIESLKRADKIPPSLYSTYCQA CSAAEEKPEDAALGVRAEPNSAEEPLGQLTAALGELRASMTDPSQRDVISAQVAVISE RLRAVLGHNEDDSSVEISKIQLSINTPRLEPREHIAVDLLLTSFCQNLMAASSVAPPE RQGPWAALFVRTMCGRVLPAVLTRLCQLLRHQGPSLSAPHVLGLAALAVHLGESRSAL PEVDVGPPAPGAGLPVPALFDSLLTCRTRDSLFFCLKFCTAAISYSLCKFSSQSRDTL CSCLSPGLIKKFQFLMFRLFSEARQPLSEEDVASLSWRPLHLPSADWQRAALSLWTHR TFREVLKEEDVHLTYQDWLHLELEIQPEADALSDTERQDFHQWAIHEHFLPESSASGG CDGDLQAACTILVNALMDFHQSSRSYDHSENSDLVFGGRTGNEDIISRLQEMVADLEL QQDLIVPLGHTPSQEHFLFEIFRRRLQALTSGWSVAASLQRQRELLMYKRILLRLPSS VLCGSSFQAEQPITARCEQFFHLVNSEMRNFCSHGGALTQDITAHFFRGLLNACLRSR DPSLMVDFILAKCQTKCPLILTSALVWWPSLEPVLLCRWRRHCQSPLPRELQKLQEGR QFASDFLSPEAASPAPNPDWLSAAALHFAIQQVREENIRKQLKKLDCEREELLVFLFF FSLMGLLSSHLTSNSTTDLPKAFHVCAAILECLEKRKISWLALFQLTESDLRLGRLLL RVAPDQHTRLLPFAFYSLLSYFHEDAAIREEAFLHVAVDMYLKLVQLFVAGDTSTVSP PAGRSLELKGQGNPVELITKARLFLLQLIPRCPKKSFSHVAELLADRGDCDPEVSAAL QSRQQAAPDADLSQEPHLF" BASE COUNT 1208 a 1527 c 1492 g 1276 t ORIGIN 1 agccgccgcc ggggctgtag gcgccaaggc catgtccgac tcgtgggtcc cgaactccgc 61 ctcgggccag gacccagggg gccgccggag ggcctgggcc gagctgctgg cgggaagggt 121 caagagggaa aaatataatc ctgaaagggc acagaaatta aaggaatcag ctgtgcgcct 181 cctgcgaagc catcaggacc tgaatgccct tttgcttgag gtagaaggtc cactgtgtaa 241 aaaattgtct ctcagcaaag tgattgactg tgacagttct gaggcctatg ctaatcattc 301 tagttcattt ataggctctg ctttgcagga tcaagcctca aggctggggg ttcccgtggg 361 tattctctca gccgggatgg ttgcctctag cgtgggacag atctgcacgg ctccagcgga 421 gaccagtcac cctgtgctgc tgactgtgga gcagagaaag aagctgtctt ccctgttaga 481 gtttgctcag tatttattgg cacacagtat gttctcccgt ctttccttct gtcaagaatt 541 atggaaaata cagagttctt tgttgcttga agcggtgtgg catcttcacg tacaaggcat 601 tgtgagcctg caagagctgc tggaaagcca tcccgacatg catgctgtgg gatcgtggct 661 cttcaggaat ctgtgctgcc tttgtgaaca gatggaagca tcctgccagc atgctgacgt 721 cgccagggcc atgctttctg attttgttca aatgtttgtt ttgaggggat ttcagaaaaa 781 ctcagatctg agaagaactg tggagcctga aaaaatgccg caggtcacgg ttgatgtact 841 gcagagaatg ctgatttttg cacttgacgc tttggctgct ggagtacagg aggagtcctc 901 cactcacaag atcgtgaggt gctggttcgg agtgttcagt ggacacacgc ttggcagtgt 961 aatttccaca gatcctctga agaggttctt cagtcatacc ctgactcaga tactcactca 1021 cagccctgtg ctgaaagcat ctgatgctgt tcagatgcag agagagtgga gctttgcgcg 1081 gacacaccct ctgctcacct cactgtaccg caggctcttt gtgatgctga gtgcagagga 1141 gttggttggc catttgcaag aagttctgga aacgcaggag gttcactggc agagagtgct 1201 ctcctttgtg tctgccctgg ttgtctgctt tccagaagcg cagcagctgc ttgaagactg 1261 ggtggcgcgt ttgatggccc aggcattcga gagctgccag ctggacagca tggtcactgc 1321 gttcctggtt gtgcgccagg cagcactgga gggcccctct gcgttcctgt catatgcaga 1381 ctggttcaag gcctcctttg ggagcacacg aggctaccat ggctgcagca agaaggccct 1441 ggtcttcctg tttacgttct tgtcagaact cgtgcctttt gagtctcccc ggtacctgca 1501 ggtgcacatt ctccacccac ccctggttcc cagcaagtac cgctccctcc tcacagacta 1561 catctcattg gccaagacac ggctggccga cctcaaggtt tctatagaaa acatgggact 1621 ctacgaggat ttgtcatcag ctggggacat tactgagccc cacagccaag ctcttcagga 1681 tgttgaaaag gccatcatgg tgtttgagca tacggggaac atcccagtca ccgtcatgga 1741 ggccagcata ttcaggaggc cttactacgt gtcccacttc ctccccgccc tgctcacacc 1801 tcgagtgctc cccaaagtcc ctgactcccg tgtggcgttt atagagtctc tgaagagagc 1861 agataaaatc cccccatctc tgtactccac ctactgccag gcctgctctg ctgctgaaga 1921 gaagccagaa gatgcagccc tgggagtgag ggcagaaccc aactctgctg aggagcccct 1981 gggacagctc acagctgcac tgggagagct gagagcctcc atgacagacc ccagccagcg 2041 tgatgttata tcggcacagg tggcagtgat ttctgaaaga ctgagggctg tcctgggcca 2101 caatgaggat gacagcagcg ttgagatatc aaagattcag ctcagcatca acacgccgag 2161 actggagcca cgggaacaca ttgctgtgga cctcctgctg acgtctttct gtcagaacct 2221 gatggctgcc tccagtgtcg ctcccccgga gaggcagggt ccctgggctg ccctcttcgt 2281 gaggaccatg tgtggacgtg tgctccctgc agtgctcacc cggctctgcc agctgctccg 2341 tcaccagggc ccgagcctga gtgccccaca tgtgctgggg ttggctgccc tggccgtgca 2401 cctgggtgag tccaggtctg cgctcccaga ggtggatgtg ggtcctcctg cacctggtgc 2461 tggccttcct gtccctgcgc tctttgacag cctcctgacc tgtaggacga gggattcctt 2521 gttcttctgc ctgaaatttt gtacagcagc aatttcttac tctctctgca agttttcttc 2581 ccagtcacga gatactttgt gcagctgctt atctccaggc cttattaaaa agtttcagtt 2641 cctcatgttc agattgttct cagaggcccg acagcctctt tctgaggagg acgtagccag 2701 cctttcctgg agacccttgc accttccttc tgcagactgg cagagagctg ccctctctct 2761 ctggacacac agaaccttcc gagaggtgtt gaaagaggaa gatgttcact taacttacca 2821 agactggtta cacctggagc tggaaattca acctgaagct gatgctcttt cagatactga 2881 acggcaggac ttccaccagt gggcgatcca tgagcacttt ctccctgagt cctcggcttc 2941 agggggctgt gacggagacc tgcaggctgc gtgtaccatt cttgtcaacg cactgatgga 3001 tttccaccaa agctcaagga gttatgacca ctcagaaaat tctgatttgg tctttggtgg 3061 ccgcacagga aatgaggata ttatttccag attgcaggag atggtagctg acctggagct 3121 gcagcaagac ctcatagtgc ctctcggcca caccccttcc caggagcact tcctctttga 3181 gattttccgc agacggctcc aggctctgac aagcgggtgg agcgtggctg ccagccttca 3241 gagacagagg gagctgctaa tgtacaaacg gatcctcctc cgcctgcctt cgtctgtcct 3301 ctgcggcagc agcttccagg cagaacagcc catcactgcc agatgcgagc agttcttcca 3361 cttggtcaac tctgagatga gaaacttctg ctcccacgga ggtgccctga cacaggacat 3421 cactgcccac ttcttcaggg gcctcctgaa cgcctgtctg cggagcagag acccctccct 3481 gatggtcgac ttcatactgg ccaagtgcca gacgaaatgc cccttaattt tgacctctgc 3541 tctggtgtgg tggccgagcc tggagcctgt gctgctctgc cggtggagga gacactgcca 3601 gagcccgctg ccccgggaac tgcagaagct acaagaaggc cggcagtttg ccagcgattt 3661 cctctcccct gaggctgcct ccccagcacc caacccggac tggctctcag ctgctgcact 3721 gcactttgcg attcaacaag tcagggaaga aaacatcagg aagcagctaa agaagctgga 3781 ctgcgagaga gaggagctat tggttttcct tttcttcttc tccttgatgg gcctgctgtc 3841 gtcacatctg acctcaaata gcaccacaga cctgccaaag gctttccacg tttgtgcagc 3901 aatcctcgag tgtttagaga agaggaagat atcctggctg gcactctttc agttgacaga 3961 gagtgacctc aggctggggc ggctcctcct ccgtgtggcc ccggatcagc acaccaggct 4021 gctgcctttc gctttttaca gtcttctctc ctacttccat gaagacgcgg ccatcaggga 4081 agaggccttc ctgcatgttg ctgtggacat gtacttgaag ctggtccagc tcttcgtggc 4141 tggggataca agcacagttt cacctccagc tggcaggagc ctggagctca agggtcaggg 4201 caaccccgtg gaactgataa caaaagctcg tctttttctg ctgcagttaa tacctcggtg 4261 cccgaaaaag agcttctcac acgtggcaga gctgctggct gatcgtgggg actgcgaccc 4321 agaggtgagc gccgccctcc agagcagaca gcaggctgcc cctgacgctg acctgtccca 4381 ggagcctcat ctcttctgac gggacctgcc actgcacacc agcccagctc ccgtgtaaat 4441 aatttattac aagcataaca tggagctctt gttgcactaa aaagtggatt acaaatctcc 4501 tcgactgctt tagtggggaa aggaatcaat tatttatgaa ctgtccggcc ccgagtcact 4561 cagcgtttgc gggaaaataa accactggtc ccagagcaga ggaaggctac ttgagccgga 4621 caccaagccc gcctccagca ccaagggcgg gcagcaccct ccgaccctcc catgcgggtg 4681 cacacgaagg gtgaggctga cacagccact gcggagtcca ggctgctaga ggtgctcatc 4741 ctcactgccg tcctcaggtg ggttcgggct tcaccgcctg gccctctgtg gtcacagagg 4801 ggctcggtgg cccaggtggt ggttccgcct ccaggggcag ggccttgtcc tgggtctgtg 4861 tcagcgggtg caccatggac atgtgtacat tgaggttgtg ggccttctca aaccgccggc 4921 cacactggtc acaggcaaag tccagctcag tctcagcctt gtgtttggtc atgtggtact 4981 tgagggatgc ccgctgcctg cactggaacc cacagacctc acacctgggg gacagaggca 5041 gataagaagg tgcgaggcca cagccctggg agggggtcct gactcacact tactgcaaag 5101 gcttggctcc cgaatgtcgc atttggtgga cgagaaggtg cttccgctgc ttgaaggttt 5161 gtccacattc gtcacagata tagttccgca cctctgagag gggagagtcc agtgagtcca 5221 ggcccctgat gctccaacct cccgggggga cgacgatgac aatgtgaaac catcacagct 5281 gggaagacat ttctgcacat ggttcaccat gcagtgggcc caagcaaggg gcctatgagg 5341 gcctcgttta ttaagatctt taaactgctt tatacactgt cacgtggctt catcagctgt 5401 gtgcatttca ggatggtttt taaagaaacc tcagaaagct atttccttaa aaaaaaaaaa 5461 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSCHK2 5292 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens HK2 mRNA for hexokinase II. ACCESSION Z46376 NID g587201 KEYWORDS hexokinase II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5292) AUTHORS Lehto,M., Huang,X., Davis,E.M., Le Beau,M.M., Laurila,E., Eriksson,K.F., Bell,G.I. and Groop,L. TITLE Human hexokinase II gene: exon-intron organization, mutation screening in NIDDM, and its relationship to muscle hexokinase activity JOURNAL Diabetologia 38 (12), 1466-1474 (1995) MEDLINE 96238411 REFERENCE 2 (bases 1 to 5292) AUTHORS Deeb,S.S., Malkki,M. and Laakso,M. TITLE Human hexokinase II: sequence and homology to other hexokinases JOURNAL Biochem. Biophys. Res. Commun. 197 (1), 68-74 (1993) MEDLINE 94071972 REMARK (sites) REFERENCE 3 (bases 1 to 5292) AUTHORS Lehto,M. TITLE Direct Submission JOURNAL Submitted (24-OCT-1994) Lehto M., University of Lund, Endocrinology/Wallenberg lab., Malmo, Sweden, 21401 FEATURES Location/Qualifiers source 1..5292 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lambda gt10" /clone_lib="Clontech lab." mRNA <1..>5292 /citation=[1] /evidence=experimental 5'UTR <1..1490 /citation=[1] /evidence=experimental conflict 1377 /citation=[2] /replace="c" conflict 1440..1444 /citation=[2] /replace="ctgc" conflict 1482 /citation=[2] /replace="t" gene 1491..4244 /gene="HK2" CDS 1491..4244 /gene="HK2" /citation=[1] /codon_start=1 /evidence=experimental /product="Human hexokinase II cDNA" /db_xref="PID:g587202" /db_xref="SWISS-PROT:P52789" /translation="MIASHLLAYFFTELNHDQVQKVDQYLYHMRLSDETLLEISKRFR KEMEKGLGATTHPTAAVKMLPTFVRSTPDGTEHGEFLALDLGGTNFRVLWVKVTDNGL QKVEMENQIYAIPEDIMRGSGTQLFDHIAECLANFMDKLQIKDKKLPLGFTFSFPCHQ TKLDESFLVSWTKGFKSSGVEGRDVVALIRKAIQRRGDFDIDIVAVVNDTVGTMMTCG YDDHNCEIGLIVGTGSNACYMEEMRHIDMVEGDEGRMCINMEWGAFGDDGSLNDIRTE FDQEIDMGSLNPGKQLFEKMISGMYMGELVRLILVKMAKEELLFGGKLSPELLNTGRF ETKDISDIEGEKDGIRKAREVLMRLGLDPTQEDCVATHRICQIVSTRSASLCAATLAA VLQRIKENKGEERLRSTIGVDGSVYKKHPHFAKRLHKTVRRLVPGCDVRFLRSEDGSG KGAAMVTAVAYRLADQHRARQKTLEHLQLSHDQLLEVKRRMKVEMERGLSKETHASAP VKMLPTYVCATPDGTEKGDFLALDLGGTNFRVLLVRVRNGKWGGVEMHNKIYAIPQEV MHGTGDELFDHIVQCIADFLEYMGMKGVSLPLGFTFSFPCQQNSLDESILLKWTKGFK ASGCEGEDVVTLLKEAIHRREEFDLDVVAVVNDTVGTMMTCGFEDPHCEVGLIVGTGS NACYMEEMRNVELVEGEEGRMCVNMEWGAFGDNGCLDDFRTEFDVAVDELSLNPGKQR FEKMISGMYLGEIVRNILIDFTKRGLLFRGRISERLKTRGIFETKFLSQIESDCLALL QVRATLQHLGLESTCDDSIIVKEVCTVVARRAAQLCGAGMAAVVDRIRENRGLDALKV TVGVDGTLYKLHPHFAKVMHETVKDLAPKCDVSFLQSEDGSGKGAALITAVACRIREA GQR" conflict 1524 /gene="HK2" /citation=[2] /replace="t" conflict 1808 /gene="HK2" /citation=[2] /replace="a" conflict 2703..2708 /gene="HK2" /citation=[2] /replace="cgcgct" conflict 3391 /gene="HK2" /citation=[2] /replace="c" conflict 3623 /gene="HK2" /citation=[2] /replace="a" conflict 3898 /gene="HK2" /citation=[2] /replace="t" 3'UTR 4245..>5292 /citation=[1] /evidence=experimental conflict 4262..4264 /citation=[2] /replace="gg" conflict 4373..4375 /citation=[2] /replace="ccc" conflict 4388..4389 /citation=[2] /replace="ta" conflict 4453..4455 /citation=[2] /replace="tttt" BASE COUNT 1137 a 1420 c 1502 g 1233 t ORIGIN 1 ctcacaatgc aggcgctcca gctctgacac gtgcacttct ttattggaaa agaataaagc 61 agtcactgat gtggcagggg caggacacga gcagctgccg tcctcctccc agcgtgcctg 121 gcatggtcgc aggggagcgg gtgcctggag tcccggtgac accacggggc acactgaggg 181 agctgaggag ccggggccgc gcagcctcct ggatgctcag cggatcgtgt acttgtccca 241 cttcttttca gggtcgtagg gttcccagcg gctggcggga aagatgtgct tgttcttctc 301 gtaccagctc ctcagcacca ccttgcctgc atgggactca tccttctcca cagtggcgtc 361 actgagcaac cgcacatcgt catgaacatc aaagttgaag agtggtccac tcttcccccg 421 tgccttggtg acgatgaagt cgtagaagct gtgatggtga gggatgatca agtcctcctt 481 gatgtacatg agctgctcca cccctgcgga cctcagctca ctgaagtctt tccgaaggat 541 ctcgagcgcc ttctgcagga actgctgcat ggtgttgccc tttctcatct tgactgtccg 601 ccggtgccca gagccatccc agtagctgaa ggtgatctcg atctcctcac tcttgatctt 661 ctcctgcttg gcttcccact cctgccgcag ctcttcccga agccgattct cctcctcctc 721 acggtctcga tcaggcaaga agcttgtgtc aacgtctggg ttcttcccca gttttctctt 781 cttcgtggtg atctcttccc tttccatctc ctcctcatac atggccgcct cctcttcctc 841 ctcgcctccc tcttcttcct cctccagggt gaaggacagg ctggagatct tccgcttggc 901 ttccttctta cgctccttct ctcgaagctt ctccagcttc atctgcagct ccttggactg 961 ctccttcttg gccagctgct tctcccgctc cttcaccaga gcctcctgct tggccttcat 1021 gtcattcagg gtcacgagac ccacggtgct ggacttgagc tctgcctccc gggctgccgc 1081 tggcaacatc gtgtcaccca gctaagaaaa tccgcgggcc cgagccacgc gcctgtgaat 1141 cggagaggtc ccactgcccg agtggagccg ggctgagatt cttctcaagt tgagcctcag 1201 tgatcctgtg gccgaagtta gcgccttgac gtgggacaac cggacacgtc gccaggagag 1261 aactgaggcg ccttctagca gttgtgacgc caaaatcacg tctccggaga cccgcgccct 1321 ccgccagccg ggcgcaccct cgccggtagc cttctttgtg cgccgtccgg actcccagct 1381 cccggcccgg cagccgagcc ccagcacaaa gcagtcggac cgcgccgccc gcctcccctc 1441 tcgcgtctcc gcctcggttt cccaactctg cgccgtcggg ccgcggcagg atgattgcct 1501 cgcatctgct tgcctacttc ttcacggagc tcaaccatga ccaagtgcag aaggttgacc 1561 agtatctcta ccacatgcgc ctctctgatg agaccctctt ggagatctct aagcggttcc 1621 gcaaggagat ggagaaaggg cttggagcca ccactcaccc tactgcagca gtgaagatgc 1681 tgcccacctt tgtgaggtcc actccagatg ggacagaaca cggagagttc ctggctctgg 1741 atcttggagg gaccaacttc cgtgtgcttt gggtgaaagt aacggacaat gggctccaga 1801 aggtggagat ggagaatcag atctatgcca tccctgagga catcatgcga ggcagtggca 1861 cccagctgtt tgaccacatt gccgaatgcc tggctaactt catggataag ctacaaatca 1921 aagacaagaa gctcccactg ggttttacct tctcgttccc ctgccaccag actaaactag 1981 acgagagttt cctggtctca tggaccaagg gattcaagtc cagtggagtg gaaggcagag 2041 acgttgtggc tctgatccgg aaggccatcc agaggagagg ggactttgat atcgacattg 2101 tggctgtggt gaatgacaca gttgggacca tgatgacctg tggttatgat gaccacaact 2161 gtgagattgg tctcattgtg ggcacgggca gcaacgcctg ctacatggaa gagatgcgcc 2221 acatcgacat ggtggaaggc gatgaggggc ggatgtgtat caatatggag tggggggcct 2281 tcggggacga tggctcgctc aacgacattc gcactgagtt tgaccaggag attgacatgg 2341 gctcactgaa cccgggaaag caactgtttg agaagatgat cagtgggatg tacatggggg 2401 agctggtgag gcttatcctg gtgaagatgg ccaaggagga gctgctcttt ggggggaagc 2461 tcagcccaga gcttctcaac accggtcgct ttgagaccaa agacatctca gacattgaag 2521 gggagaagga tggcatccgg aaggcccgtg aggtcctgat gcggttgggc ctggacccga 2581 ctcaggagga ctgcgtggcc actcaccgga tctgccagat cgtgtccaca cgctccgcca 2641 gcctgtgcgc agccaccctg gccgccgtgc tgcagcgcat caaggagaac aaaggcgagg 2701 agcggctgcg ctctactatt ggggtcgacg gttccgtcta caagaaacac ccccattttg 2761 ccaagcgtct acataagacc gtgcggcggc tggtgcccgg ctgcgatgtc cgcttcctcc 2821 gctccgagga tggcagtggc aaaggtgcag ccatggtgac agcagtggct taccggctgg 2881 ccgatcaaca ccgtgcccgc cagaagacat tagagcatct gcagctgagc catgaccagc 2941 tgctggaggt caagaggagg atgaaggtag aaatggagcg aggtctgagc aaggagactc 3001 atgccagtgc ccccgtcaag atgctgccca cctacgtgtg tgctaccccg gacggcacag 3061 agaaagggga cttcttggcc ttggaccttg gaggaacaaa tttccgggtc ctgctggtcc 3121 gtgttcggaa tgggaagtgg ggtggagtgg agatgcacaa caagatctac gccatcccgc 3181 aggaggtcat gcacggcacc ggggacgagc tctttgacca cattgtccag tgcatcgcgg 3241 acttcctcga gtacatgggc atgaagggcg tgtccctgcc tctgggtttt accttctcct 3301 tcccctgcca gcagaacagc ctggacgaga gcatcctcct caagtggaca aaaggcttca 3361 aggcatctgg ctgcgagggc gaggacgtgg tgaccctgct gaaggaagcg atccaccggc 3421 gagaggagtt tgacctggat gtggttgctg tggtgaacga cacagtcgga actatgatga 3481 cctgtggctt tgaagaccct cactgtgaag ttggcctcat tgttggcacg ggcagcaatg 3541 cctgctacat ggaggagatg cgcaacgtgg aactggtgga aggagaagag gggcggatgt 3601 gtgtgaacat ggaatggggg gccttcgggg acaatggatg cctagatgac ttccgcacag 3661 aatttgatgt ggctgtggat gagctttcac tcaaccccgg caagcagagg ttcgagaaaa 3721 tgatcagtgg aatgtacctg ggtgagattg tccgtaacat tctcatcgat ttcaccaagc 3781 gtggactact cttccgaggc cgcatctcag agcggctcaa gacaaggggc atctttgaaa 3841 ccaagttctt gtctcagatt gagagtgact gcctggccct gctgcaagtc cgagccaccc 3901 tgcaacactt agggcttgag agcacctgtg acgacagcat cattgttaag gaggtgtgca 3961 ctgtggtggc ccggcgggca gcccagctct gtggcgcagg catggccgct gtggtggaca 4021 ggatacgaga aaaccgtggg ctggacgctc tcaaagtgac agtgggtgtg gatgggaccc 4081 tctacaagct acatcctcac tttgccaaag tcatgcatga gacagtgaag gacctggctc 4141 cgaaatgtga tgtgtctttc ctgcagtcag aggatggcag cgggaagggg gcggcgctca 4201 tcactgctgt ggcctgccgc atccgtgagg ctggacagcg atagaacccc tgaaatcgga 4261 agggacttcc tctttctctc cttcttccct gttttaaatt ataagatgtc atccccttgt 4321 gtcagagaca gaccccttgg cttttgcttg gcagagagga ccccactgga ctgggttttg 4381 tctctgcatc tcattgtaga gcttggtggc tgagcttggc cctattaaga taaatagagt 4441 tccaaataag gatttgttca catgcatcat aaccattccc attggttctc ctaaaacatg 4501 aaaattatct cccttagtaa tcccccttgc caaattccat gtccctgtat aattctacag 4561 gatggggaca ctaatgaaga tacggttgct tcaccttgga gcctgaacat gacatttcta 4621 agtggggtgc atcccccagc actgatgttg ttactgattc tcctgtcaga gatctgggag 4681 gtctccactg aggatgtgag cctgattatc ctataggcag acgtggggag ggtggagggg 4741 tgacagtgga ggaaaatcca tggatatcca cgcagcagcc cctctttaac ctcatctaca 4801 agcatttgcc ctgtggattc cagcatttgc cattcctgga atcaaggaat cctgagtctg 4861 ggcaatgaaa ccaaagccag gagttgacgc atcctgcagt tgggccagct gtcgcatctc 4921 agcggggcgc acatgttatc cacaagcaat ggacctttgg ggaaggggga gtttttagtt 4981 tgttttacaa atttttcctg caaaagtgga atcactgtat tttcatttta atttatattt 5041 gaaattttat ttagttcttg agtagatctg cttcttcatc ttgacatgta atgaatggtc 5101 agttgtacgt aatgtattta tatgttaatt tgttatgtat atagatgtgc aagtcttgtc 5161 agaattggcc tcagtgtagt taaagggcag aaggggaaga tactgactag tcatagaaat 5221 acctcattcg cctgtgggaa gagaagggaa gcctcttcag ggtgagtgaa tggcaaagcg 5281 gttgcttctc cg // LOCUS HSCHL1 3755 bp RNA PRI 20-NOV-1997 DEFINITION H.sapiens mRNA for CHL1 protein. ACCESSION X99583 NID g2632246 KEYWORDS CHL1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3755) AUTHORS Frank,S. and Werner,S. TITLE The human homologue of the yeast CHL1 gene is a novel keratinocyte growth factor-regulated gene JOURNAL J. Biol. Chem. 271 (40), 24337-24340 (1996) MEDLINE 96394579 REFERENCE 2 (bases 1 to 3755) AUTHORS Frank,S. TITLE Direct Submission JOURNAL Submitted (19-JUL-1996) S. Frank, Max-Planck Institut fuer Biochemie, Am Klopferspitz 18a, D- 82152 Martinsried, FRG REMARK revised by [3] REFERENCE 3 (bases 1 to 3755) AUTHORS Frank,S. TITLE Direct Submission JOURNAL Submitted (19-NOV-1997) S. Frank, Max-Planck Institut fuer Biochemie, Am Klopferspitz 18a, D- 82152 Martinsried, FRG COMMENT Related sequence: Genomics 32, 260-265 (1996). FEATURES Location/Qualifiers source 1..3755 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /cell_line="HaCaT" gene 741..2783 /gene="hCHL-1/KRG-2" CDS 741..2783 /gene="hCHL-1/KRG-2" /codon_start=1 /product="CHL1 protein" /db_xref="PID:e1184874" /db_xref="PID:g2632247" /translation="MLETGPEAERLEQLESGEEELVLAEYESDEEKKVASRVDEDEDD LEEEHITKIYYCSRTHSQLAQFVHEVKKSPFGKDVRLVSLGSRQNLCVNEDVKSLGSV QLINDRCVDMQRSRHEKKKGAEEEKPKRRRQEKQAACPFYNHEQMGLLRDEALAEVKD MEQLLALGKEARACPYYGSRLAIPAAQLVVLPYQMLLHAATRQAAGIRLQDQVVIIDE AHNLIDTITGMHSVEVSGSQLCQAHSQLLQYVERYGKRLKAKNLMYLKQILYLLEKFV AVLGGNIKQNPNTQSLSQTGTELKTINDFLFQSQIDNINLFKVQRYCEKSMISRKLFG FTERYGAVFSSREQPKLAGFQQFLQSLQPRTTEALAAPADESQASTLRPASPLMHIQG FLAALTTANQDGRVILSRQGSLSQSTLKFLLLNPAVHFAQVVKECRAVVIAGGTMQPV SDFRQQLLACAGVEAERGGGVFCGHVIPPDNILPLVICSGISNQPLEFTFQKRELPQM IFQEPKSAHQVEQVLLAYSRCIQACGQERGQVTGALLLSVVGGKMSEGINFSDNLGRC VVMVGMPFPNIRSAELQEKMAYLDQTLPRAPGQAPPGKALVENLCMKAVNQSIGRAIR HQKDFASVVLLDQRYARPPVLAKLPAWIRARVEVKATFGPAIAAVQKFHREKSASS" BASE COUNT 851 a 1037 c 1114 g 753 t ORIGIN 1 gaattcggca cgagttctaa ctcagtggcg tttgccctga ttcccggggc ctggctttca 61 gcgtagcaat gctgccggcg aagaaggtga gcgcagtgct gtgtggcagc agagctcctt 121 aggacgagga gcagcgggac gaggaagggc agactggtga aatcgcaaac tgggcgtctg 181 ttccggcgcc ggacccctat ttgcaaaggt ccatggctaa tgaaacacag aaggttggtg 241 ccatccattt tccttttccc ttcacaccct attccatcca ggaagacttc atggcagagc 301 tgtaccgggt tttggaggct ggcaagagtg ggatatttga gagtccaact ggcactggga 361 agtccttaag tcttatttgt ggggccctct cttggctccg tgactttgaa cagaagaagc 421 gtgaagaaga ggcacgactc cttgaaactg gaactggccc cttacatgat gagaaagatg 481 aatccctgtg tctgtcttct tcctgcgaag gggctgcagg caccccgagg cctgctggag 541 aaccggcctg ggttactcag tttgtgcaga agaaagaaga gagggacctg gtggaccgac 601 taaaggcgga gcaggccagg aggaagcagc gagaagaacg cctgcagcag ctgcagcaca 661 gggtgcagct caagtatgca gccaagcgcc tgaggcagga agaagaagaa agagagaatc 721 tcctccgcct cagcagggag atgctagaga caggcccgga ggctgagcgg ctggagcagc 781 tggagtctgg ggaggaggag ctggtcctcg ccgaatacga gagtgatgag gagaaaaagg 841 tggcgagcag agtggatgag gatgaggatg acctggagga agaacacata actaagattt 901 attactgtag tcggacacac tcccagctgg cccagtttgt gcatgaggtg aagaagagcc 961 cctttggcaa ggatgttcgg ctggtctccc ttggctcccg gcagaacctt tgtgtaaatg 1021 aagacgtgaa aagcctaggt tctgtgcagc ttatcaacga ccgctgtgtg gacatgcaga 1081 gaagcaggca cgagaagaag aaaggagctg aggaggagaa gccaaagagg aggaggcagg 1141 agaagcaggc agcctgcccc ttctacaacc acgagcagat gggccttctc cgggatgagg 1201 ccctggcaga ggtgaaggac atggagcagc tgctggccct tgggaaggag gcccgggcct 1261 gtccctatta cgggagccgc cttgccatcc ctgcagccca gctggtggtg ctgccctatc 1321 agatgctgct gcatgcggcc actcggcagg ccgcgggcat ccggctgcag gaccaggtgg 1381 tgatcatcga cgaggcgcac aacctgatcg acaccatcac gggcatgcac agcgtggagg 1441 tcagcggctc ccagctctgc caggcccatt cccagctgct gcagtacgtg gagcgatacg 1501 ggaagcgttt gaaggccaag aacctgatgt acctgaagca gatcctgtat ttgctggaga 1561 aattcgtggc tgtgctaggg gggaacatta agcaaaatcc caatacacag agtctgtcac 1621 agacagggac ggagctgaag accatcaacg actttctctt ccagagccag atcgacaaca 1681 tcaacctgtt caaggtgcag cgatactgtg agaagagcat gatcagcaga aagctctttg 1741 gattcactga acggtacgga gcagtgttct catcccggga gcagcccaaa ctggctgggt 1801 ttcagcaatt cctgcagagc ctgcagccca ggacgactga agctcttgca gcccctgcag 1861 acgagagtca ggccagcacc ctgcgaccag cttctccact gatgcacatc caaggcttcc 1921 tggcagctct cactacggcc aaccaggacg gcagggtcat cctgagccgc caaggcagcc 1981 tcagtcagag caccctgaag tttttgctcc tgaatccagc tgtgcacttt gcccaagtgg 2041 tgaaggaatg ccgggcagtg gtcattgcgg ggggtaccat gcagccggtg tctgacttcc 2101 ggcagcagct gctggcctgt gccggggtgg aagctgagcg cggtggtgga gttttctgtg 2161 gtcacgtgat ccctccagac aacatcctgc ccctcgtcat ctgcagcggg atctccaacc 2221 agccgctgga attcacgttc cagaaaagag agctgcctca gatgatattc caggaaccta 2281 agagcgcaca ccaggtggag caggtgctgc tggcatattc caggtgcatc caggcctgtg 2341 gccaggagag aggccaggtg acaggggccc tgctcctctc tgtggttgga ggaaagatga 2401 gtgaagggat caacttctct gacaacctag gccggtgtgt ggtgatggtg ggcatgccct 2461 tccccaacat caggtctgca gagctgcagg agaagatggc ctacttggat caaaccctcc 2521 ccagagcccc cggccaggca cccccaggga aggctctggt ggagaacctg tgcatgaagg 2581 ccgtcaacca gtccataggc agggccatca ggcaccagaa ggattttgcc agcgtagtgc 2641 tcctggacca gcgatatgcc cggccccctg tcctggccaa gctgccggcc tggatccgag 2701 cccgtgtgga ggtcaaagct acctttggcc ccgccattgc tgctgtgcag aagtttcacc 2761 gggagaagtc ggcctcttcc tgatgggcaa ccacaccact gcctggcgcc gtgcccttcc 2821 tttgtcctgc ccgctggaga cagtgtttgt cgtgggcgtg gtctgcgggg atcctgttac 2881 aaaggtgaaa cccaggagga gagtgtggag tccagagtgc tgccaggacc caggcacagg 2941 cgttagctcc cgtaggagaa aatgggggaa tcctgaatga acagtgggtc ctggctgtcc 3001 ttggggcgtt ccagggcagc tcccctcctg gaatagaatc tttctttcca tcctgcatgg 3061 ctgagagcca ggcttccttc ctggtctccg caggaggctg tggcagctgt ggcatccact 3121 gtggcatctc cgtcctgccc accttcttaa gaggcgagat ggagcaggcc catctgcctc 3181 tgccctttct agccaaggtt atagctgccc tggactgctc actctctggt ctcaatttaa 3241 aatgatccat ggccacaggc tcctgcccag gggcttgtca ccttcccctc ctccttcctg 3301 agtcactcct tcagtagaag gccctgctcc ctatcctgtc ccacagccct gcctggattt 3361 gtatccttgg cttcgtgcca gttcctccaa gtctatggca cctccctccc tctcaaccac 3421 ttgagcaaac tccaagacac cttctacccc aacaccagca attatgccaa gggccgttag 3481 gctctcaaca tgactataga gaccccgtgt catcacggag acctttgttc ctgtgggaaa 3541 atatccctcc cacctgcaac agctgcccct gctgactgcg cctgtcttct ccctctgacc 3601 ccagagaaag gggctgtggt cagctgggat cttctgccac catcagggac aaacgggggc 3661 aggaggaaag tcactgatgc ccagatgttt gcatcctgca cagctatagg tccttaaata 3721 aaagtgtgct gttggttaaa aaaaaaaaaa aaaaa // LOCUS HSCHM 2115 bp RNA PRI 23-JUN-1994 DEFINITION H.sapiens mRNA for choroideremia. ACCESSION X78121 NID g460794 KEYWORDS choroideremia gene; rab geranylgeranyl transferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2115) AUTHORS van Bokhoven,H., van den Hurk,J.A., Bogerd,L., Philippe,C., Gilgenkrantz,S., de Jong,P., Ropers,H.H. and Cremers,F.P. TITLE Cloning and characterization of the human choroideremia gene JOURNAL Hum. Mol. Genet. 3 (7), 1041-1046 (1994) MEDLINE 95072565 REFERENCE 2 (bases 1 to 2115) AUTHORS van Bokhoven,H. TITLE Direct Submission JOURNAL Submitted (11-MAR-1994) H. van Bokhoven, Human Genetics Department, University Hospital Nijmegen, PO Box 9101, 6500 HB Nijmegen, NETHERLANDS FEATURES Location/Qualifiers source 1..2115 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HER XC2 (retina)" /clone_lib="commercial fetal brain libr., adult retina library, fetal retina library" /dev_stage="fetus, adult" /tissue_type="retina, brain" /chromosome="X" /map="Xq21.2" mRNA join(<1..79,80..146,147..219,220..343,344..732,733..849, 850..970,971..1196,1197..1274,1275..1379,1380..1443, 1444..1540,1541..1639,1640..1800,1801..>1992) /gene="CHM" /evidence=experimental gene 1..1992 /gene="CHM" exon <1..79 /gene="CHM" /number=1 /evidence=experimental CDS 31..1992 /gene="CHM" /codon_start=1 /product="choroidermia, Rab geranylgeranyltransferase component A (REP-1)" /db_xref="PID:g460795" /db_xref="SWISS-PROT:P24386" /translation="MADTLPSEFDVIVIGTGLPESIIAAACSRSGRRVLHVDSRSYYG GNWASFSFSGLLSWLKEYQENSDIVSDSPVWQDQILENEEAIALSRKDKTIQHVEVFC YASQDLHEDVEEAGALQKNHALVTSANSTEAADSAFLPTEDESLSTMSCEMLTEQTPS SDPENALEVNGAEVTGEKENHCDDKTCVPSTSAEDMSENVPIAEDTTEQPKKNRIAYA QIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNITRILAFREGRVEQVPC SRADVFNSKQLTMVEKRMLMKFLTFCMEYEKYPDEYKGYEEITFYEYLKTQKLTPNLQ YIVMHSIAMTSETASSTIDGLKATKNFLHCLGRYGNTPFLFPLYGQGELPQCFCRMCA VFGGIYCLRHSVQCLVVDKESRKCKAIIDQFGQRIISEHFLVEDSYFPENMCSRVQYR QISRAVLITDRSVLKTDSDQQISILTVPAEEPGTFAVRVIELCSSTMTCMKGTYLVHL TCTSSKTAREDLESVVQKLFVPYTEMEIENEQVEKPRILWALYFNMRDSSDISRSCYN DLPSNVYVCSGPDCGLGNDNAVKQAETLFQEICPNEDFCPPPPNPEDIILDGDSLQPE ASESSAIPEANSETFKESTNLGNLEESSE" exon 80..146 /gene="CHM" /number=2 /evidence=experimental exon 147..219 /gene="CHM" /number=3 /evidence=experimental exon 220..343 /gene="CHM" /number=4 /evidence=experimental exon 344..732 /gene="CHM" /number=5 /evidence=experimental exon 733..849 /gene="CHM" /number=6 /evidence=experimental exon 850..970 /gene="CHM" /number=7 /evidence=experimental exon 971..1196 /gene="CHM" /number=8 /evidence=experimental exon 1197..1274 /gene="CHM" /number=9 /evidence=experimental exon 1275..1379 /gene="CHM" /number=10 /evidence=experimental exon 1380..1443 /gene="CHM" /number=11 /evidence=experimental exon 1444..1540 /gene="CHM" /number=12 /evidence=experimental exon 1541..1639 /gene="CHM" /number=13 /evidence=experimental exon 1640..1800 /gene="CHM" /number=14 /evidence=experimental exon 1801..>1992 /gene="CHM" /number=15 /evidence=experimental BASE COUNT 683 a 401 c 463 g 568 t ORIGIN 1 taatagtcac atgacacgtt tcccgtcaag atggcggata ctctcccttc ggagtttgat 61 gtgatcgtaa tagggacggg tttgcctgaa tccatcattg cagctgcatg ttcaagaagt 121 ggccggagag ttctgcatgt tgattcaaga agctactatg gaggaaactg ggccagtttt 181 agcttttcag gactattgtc ctggctaaag gaataccagg aaaacagtga cattgtaagt 241 gacagtccag tgtggcaaga ccagatcctt gaaaatgaag aagccattgc tcttagcagg 301 aaggacaaaa ctattcaaca tgtggaagta ttttgttatg ccagtcagga tttgcatgaa 361 gatgtcgaag aagctggtgc actgcagaaa aatcatgctc ttgtgacatc tgcaaactcc 421 acagaagctg cagattctgc cttcctgcct acggaggatg agtcattaag cactatgagc 481 tgtgaaatgc tcacagaaca aactccaagc agcgatccag agaatgcgct agaagtaaat 541 ggtgctgaag tgacagggga aaaagaaaac cattgtgatg ataaaacttg tgtgccatca 601 acttcagcag aagacatgag tgaaaatgtg cctatagcag aagataccac agagcaacca 661 aagaaaaaca gaattgctta cgcacaaatt attaaagaag gcaggagatt taatattgat 721 ttagtatcaa agctgctgta ttctcgagga ttactaattg atcttctaat caaatctaat 781 gttagtcgat atgcagagtt taaaaatatt accaggattc ttgcatttcg agaaggacga 841 gtggaacagg ttccgtgttc cagagcagat gtctttaata gcaaacaact tactatggta 901 gaaaagcgaa tgctaatgaa atttcttaca ttttgtatgg aatatgagaa atatcctgat 961 gaatataaag gatatgaaga gatcacattt tatgaatatt taaagactca aaaattaacc 1021 cccaacctcc aatatattgt catgcattca attgcaatga catcagagac agccagcagc 1081 accatagatg gtctcaaagc taccaaaaac tttcttcact gtcttgggcg gtatggcaac 1141 actccatttt tgtttccttt atatggccaa ggagaactcc cccagtgttt ctgcaggatg 1201 tgtgctgtgt ttggtggaat ttattgtctt cgccattcag tacagtgcct tgtagtggac 1261 aaagaatcca gaaaatgtaa agcaattata gatcagtttg gtcagagaat aatctctgag 1321 catttcctcg tggaggacag ttactttcct gagaacatgt gctcacgtgt gcaatacagg 1381 cagatctcca gggcagtgct gattacagat agatctgtcc taaaaacaga ttcagatcaa 1441 cagatttcca ttttgacagt gccagcagag gaaccaggaa cttttgctgt tcgggtcatt 1501 gagttatgtt cttcaacgat gacatgcatg aaaggcacct atttggttca tttgacttgc 1561 acatcttcta aaacagcaag agaagattta gaatcagttg tgcagaaatt gtttgttcca 1621 tatactgaaa tggagataga aaatgaacaa gtagaaaagc caagaattct gtgggctctt 1681 tacttcaata tgagagattc gtcagacatc agcaggagct gttataatga tttaccatcc 1741 aacgtttatg tctgctctgg cccagattgt ggtttaggaa atgataatgc agtcaaacag 1801 gctgaaacac ttttccagga aatctgcccc aatgaagatt tctgtccccc tccaccaaat 1861 cctgaagaca ttatccttga tggagacagt ttacagccag aggcttcaga atccagtgcc 1921 ataccagagg ctaactcgga gactttcaag gaaagcacaa accttggaaa cctagaggag 1981 tcctctgaat aatggatata caccaaactg gatacccaac tttggaaatt ctgactggtc 2041 tcagagtcta cttgatagaa ggactgtttg agaaatgtta gaaagcagca gcaattataa 2101 ggcaaaatag gtaat // LOCUS HSCHRX 1872 bp RNA PRI 05-AUG-1992 DEFINITION H.sapiens DNA for ORF1 and ORF2 from chromosome X. ACCESSION X65724 NID g29946 KEYWORDS X chromosome. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1872) AUTHORS Berger,W., Meindl,A., van de Pol,T.J., Cremers,F.P., Ropers,H.H., Doerner,C., Monaco,A., Bergen,A.A., Lebo,R., Warburg,M. et,al. TITLE Isolation of a candidate gene for Norrie disease by positional cloning JOURNAL Nature Genet. 1 (3), 199-203 (1992) MEDLINE 93265103 REMARK Erratum:[Nat Genet 1992 Sep;2(1):84]] REFERENCE 2 (bases 1 to 1872) AUTHORS Berger,W. TITLE Direct Submission JOURNAL Submitted (08-APR-1992) W. Berger, University Hospital Nijmegen, Dept of Human Genetics, Geert Grooteplein 20, P.O.Box 9101, 6500 HB Nijmegen, THE NETHERLANDS FEATURES Location/Qualifiers source 1..1872 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="adult and fetal retina libr." /chromosome="X" /map="p11.4" CDS 417..818 /note="ORF1" /codon_start=1 /db_xref="PID:g29947" /db_xref="SWISS-PROT:Q00604" /translation="MRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHY VDSISHPLYKCSSKMVLLARCEGHCSQASRSEPLVSFSTVLKQPFRSSCHCCRPQTSK LKALRLRCSGGMRLTATYRYILSCHCEECNS" CDS 727..1200 /note="ORF2; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e47283" /db_xref="PID:g1335017" /translation="RHCGCDAQGACDSLPPTGTSSPVTARNAIPEARCCVWLLDGTTV EAVRPARERLARKELRQKRMQQFSRDSAYSSNKDSTCLLTERDTLGTSLQFPSPFSGT ISFGSFSDSGIFPLGSQCCLGFQQFSISGKKWALIHKRVRLSVFGARWGRIYFGK" BASE COUNT 521 a 425 c 409 g 517 t ORIGIN 1 ggcacaagcc tctctctctc tccctctctc tctccctctc tctctctccc tgtgtcgctt 61 aaacaacagt cctaactttt gtgtgttgca aatataaaag gcaagccatg tgacagaggg 121 acagaagaac aaaagcattt ggaagtaaca ggacctcttt ctagctctca gaaaagtctg 181 agaagaaagg agccctgcgt tcccctaagc tgtgcagcag atactgtgat gatggattgc 241 aagtgcaaag agtaagacaa aactccagca cataaaggac aatgacaacc agaaagcttc 301 agcccgatcc tgccctttcc ttgaacggga ctggatccta ggaggtgaag ccatttccaa 361 ttttttgtcc tctgcctccc tctgctgttc ttctagagaa gtttttcctt acaacaatga 421 gaaaacatgt actagctgca tccttttcta tgctctccct gctggtgata atgggggata 481 cagacagtaa aacggacagc tcattcataa tggactcgga ccctcgacgc tgcatgaggc 541 accactatgt ggattctatc agtcacccat tgtacaagtg tagctcaaag atggtgctcc 601 tggccaggtg cgaggggcac tgcagccagg cgtcacgctc cgagcctttg gtgtcgttca 661 gcactgtcct caagcaaccc ttccgttcct cctgtcactg ctgccggccc cagacttcca 721 agctgaaggc actgcggctg cgatgctcag ggggcatgcg actcactgcc acctaccggt 781 acatcctctc ctgtcactgc gaggaatgca attcctgagg cccgctgctg tgtgtggctt 841 ctggatggga caactgtaga ggcagttcga ccagccaggg aaagactggc aagaaaagag 901 ttaaggcaaa aaaggatgca acaattctcc cgggactctg catattctag taataaagac 961 tctacatgct tgttgacaga gagagatact ctgggaactt ctttgcagtt cccatctcct 1021 ttctctggta caatttcttt tggttcattt tcagattcag gcattttccc ccttggctct 1081 caatgctgtt tgggtttcca acaattcagc attagtggga aaaagtgggc cctcatacac 1141 aagcgtgtca ggctgtcagt gtttggtgca cgctggggaa gaatttactt tggaaagtag 1201 aaaagcccag cttttcctgg gacatcttct gttattgttg atgttttttt ttaccttgtc 1261 attttggtct aaggttgcca ttgctgctaa aggttaccga tttcaaagtc cagataccaa 1321 gcatgtggat atgtttagct acgtttactc acagccagcg aaactgacat taaaataact 1381 aacaaacaga ttcttttatg tgatgctgga actcttgaca gctataatta ttattcagaa 1441 atgacttttt gaaagtaaaa gcagcataaa gaatttgtca caggaaggct gtctcagata 1501 aattatggta aaattttgta agggagcaga cttttaaaga cttgcacaaa tacggatcct 1561 gcactgactc tggaaaaggc atatatgtac tagtggcatg gagaatgcac catactcatg 1621 catgcaaatt agacaaccaa gtatgaatct atttgtgggt gtgctatagc tttagccgtg 1681 tcacgggcat cattctctaa tatccacttg tccatgtgaa acatgttgcc aaaatggtcc 1741 tggcttgtct tctgaacgtt tgggtcaaat gtgttttggt cctggaggct caaattttga 1801 gttattccca cgttttgaaa taaaaagagt atattcaaaa aaaaaaaaaa aaaaaaaaaa 1861 aaaaaaaaaa aa // LOCUS HSCHTOG 6449 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for ch-TOG protein. ACCESSION X92474 NID g1045056 KEYWORDS ch-TOG protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6449) AUTHORS Charrasse,S., Mazel,M., Taviaux,S., Berta,P., Chow,T. and Larroque,C. TITLE Characterization of the cDNA and pattern of expression of a new gene over-expressed in human hepatomas and colonic tumors JOURNAL Eur. J. Biochem. 234 (2), 406-413 (1995) MEDLINE 96128167 REFERENCE 2 (bases 1 to 6449) AUTHORS Larroque,C. TITLE Direct Submission JOURNAL Submitted (11-OCT-1995) C. Larroque, inserm, unite 128, cnrs, route de mende, bp 5051, 34033 montpellier, FRANCE REFERENCE 3 (bases 1 to 6449) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished FEATURES Location/Qualifiers source 1..6449 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11 human brain library, extended with lambda gt10 human breast cancer cell library" CDS 27..5945 /note="ch-TOG" /codon_start=1 /db_xref="PID:g1045057" /translation="MGDDSEWLKLPVDQKCEHKLWKARLSGYEEALKIFQKIKDEKSP EWSKFLGLIKKFVTDSNAVVQLKGLEAALVYVENAHVAGKTTGEVVSGVVSKVFNQPK AKAKELGIEICLMYIEIEKGEAVQEELLKGLDNKNPKIIVACIETLRKALSEFGSKII LLKPIIKVLPKLFESREKAVRDEAKLIAVEIYRWIRDALRPPLQNINSVQLKELEEEW VKLPTSAPRPTRFLRSQQELEAKLEQQQSAGGDAEGGGDDGDEVPQIDAYELLEAVEI LSKLPKDFYDKIEAKKWQERKEALESVEVLIKNPKLEAGDYADLVKALKKVVGKDTNV MLVALAAKCLTGLAVGLRKKFGQYAGHVVPTILEKFKEKKPQVVQALQEAIDAIFLTT TLQNISEDVLAVMDNKNPTIKQQTSLFIARSFRHCTASTLPKSLLKPFCAALLKHIND SAPEVRDAAFEALGTALKVVGEKAVKPFLADVDKLKLDKIKECSEKVELIHGKKAGLA ADKKEFKPLPGRTAASGAAGDKDTKDISAPKPGPLKKAPAAKAGGPPKKGKPAAPGGA GNTGTKNKKGLETKEIVEPELSIEVCEEKASAVLPPTCIQLLDSSNWKERLACMEEFQ KAVELMDRTEMPCQALVRMLAKKPGWKETNFQVMQMKLHIVALIAQKGNFSKTSAQVV LDGLVDKIGDVKCGNNAKEAMTAIAEACMLPWTAEQVVSMAFSQKNPKNQSETLNWLS NAIKEFGFSGLNVKAFISNVKTALAATNPAVRTAAITLLGVMYLYVGPSLRMFFEDEK PALLSQIDAEFEKMQGQSPPAPTRGISKHSTSGTDEGEDGDEPDDGSNDVVDLLPRTE ISDKITSELVSKIGDKNWKIRKEGLDEVAGIINDAKFIQPNIGELPTALKGRLNDSNK ILVQQTLNILQQLAVAMGPNIKQHVKNLGIPIITVLGDSKNNVRAAALATVNAWAEQT GMKEWLEGEDLSEELKKENPFLRQELLGWLAEKLPTLRSTPTDLILCVPHLYSCLEDR NGDVRKKAQDALPFFMMHLGYEKMAKATGKLKPTSKDQVLAMLEKAKVNMPAKPAPPT KATSKPMGGSAPAKFQPASAPAEDCISSSTEPKPDPKKAKAPGLSSKAKSAQGKKMPS KTSLKEDEDKSGPIFIVVPNGKEQRMKDEKGLKVLKWNFTTPRDEYIEQLKTQMSSCV AKWLQDEMFHSDFQHHNKALAVMVDHLESEKEGVIGCLDLILKWLTLRFFDTNTSVLM KALEYLKLLFTLLSEEEYHLTENEASSFIPYLVVKVGEPKDVIRKDVRAILNRMCLVY PASKMFPFIMEGTKSKNSKQRAECLEELGCLVESYGMNVCQPTPGKALKEIAVHIGDR DNAVRNAALNTIVTVYNVHGDQVFKLIGNLSEKDMSMLEERIKRSAKRPSAAPIKQVE EKPQRAQNISSNANMLRKGPAEDMSSKLNQARSMSGHPEAAQMVRREFQLDLDEIEND NGTVRCEMPELVQHKLDDIFEPVLIPEPKIRAVSPHFDDMHSNTASTINFIISQVASG DINTSIQALTQLFQIESLAREASTGVLKDLMHGLITLMLDSRIEDLEEGQQVIRSVNL LVVKVLEKSDQTNILSALLVLLQDSLLATASSPKFSELVMKCLWRMVRLLPDTINSIN LDRILLDIHIFMKVFPKEKLKQCKSEFPIRTLKTLLHTLCKLKGPKILDHLTMIDNKN ESELEAHLCRMMKHSMDQTGSKSDKETAKGASRIDAKSSKAKVNDFLAEIFKKIGSKE NTKEGLAELYEYKKKYSDADIEPFLKNSSQFFQSYVERGLRVIEMEREGKGRISTSTG ISPQMEVTCVPTPTSTVSSIGNTNGEEVGPSVYLERLKILRQRCGLDNTKQDDRPPLT SLLSKPAVPTVASSTDMLHSKLSQLRESREQHQHSDLDSNQTHSSGTVTSSSSTANID DLKKRLERIKSSRK" conflict 1802 /citation=[3] /replace="t" conflict 5287 /citation=[3] /replace="a" conflict 5311 /citation=[3] /replace="a" conflict 6078 /citation=[3] /replace="a" conflict 6216 /citation=[3] /replace="a" conflict 6229 /citation=[3] /replace="" conflict 6231..6234 /citation=[3] /replace="" conflict 6370 /citation=[3] /replace="g" polyA_signal 6405..6411 BASE COUNT 2048 a 1354 c 1492 g 1555 t ORIGIN 1 aattctaagg aaaacctgga agcacaatgg gagatgacag tgagtggttg aaactgccag 61 ttgatcagaa atgtgaacac aagctgtgga aagcaaggtt aagtgggtat gaagaggccc 121 tgaagatctt ccagaaaata aaggatgaaa agagcccaga gtggtccaaa tttttaggat 181 tgatcaaaaa atttgtcact gattccaatg cagtggttca attgaaagga ttagaagctg 241 cacttgttta tgttgaaaat gcccatgtag caggaaaaac cacaggagaa gttgtgtcag 301 gtgttgtaag taaggtgttc aatcaaccta aagctaaagc caaggagctg ggcatagaga 361 tctgtcttat gtacatagag attgagaaag gagaggctgt tcaagaagag ctcctgaaag 421 gcttggacaa taagaatccc aagatcatag tggcctgtat agagacactg aggaaagcct 481 taagtgaatt tggttccaaa atcatcttgc ttaagccaat tatcaaagtg ttgccaaaac 541 tctttgagtc tcgagagaag gctgttcgag atgaagccaa actaattgct gtggagattt 601 acagatggat tcgggatgct ctgagacccc cattacaaaa tataaactct gttcagttga 661 aagaactaga agaagaatgg gtcaaactgc caacaagtgc tcctagacct actcgatttc 721 ttcgttccca acaagaacta gaagctaaat tggaacaaca acagtctgct ggtggagatg 781 ctgaaggagg tggtgatgat ggtgatgagg tgccacaaat agatgcttat gagcttttag 841 aagctgtaga aatcctttcc aaacttccca aagactttta tgacaaaatt gaggcaaaaa 901 aatggcaaga gagaaaagag gccctggagt ctgtagaagt actaataaaa aaccccaaac 961 tggaagctgg cgattatgca gatttagtaa aagcattaaa gaaggttgtt ggaaaggaca 1021 ccaatgtcat gttggtggct ttggcagcaa aatgtcttac tggcctggct gttgggctaa 1081 ggaagaaatt tggacaatat gcaggacatg ttgtgccaac catcttggag aaattcaaag 1141 agaagaaacc tcaagtggta caagccctgc aggaggcaat tgatgcaatc ttccttacta 1201 ccacactaca gaacatcagt gaggatgttt tagcagtaat ggataataaa aatccaacca 1261 tcaagcagca gacatctctt tttattgcaa gaagtttccg ccactgcact gcttctaccc 1321 tgccaaagag cttgctaaag cccttttgtg ctgcactact taagcacatc aatgattctg 1381 ctcctgaagt cagagatgcc gcatttgaag cattgggtac tgctttgaag gtggttggcg 1441 agaaagcagt aaaaccattc ctagctgatg tggacaaact caagcttgat aagatcaaag 1501 aatgttcaga aaaggtagaa ctgatacatg gtaagaaagc tggactagct gctgataaga 1561 aggaattcaa acctctgcct ggaaggactg ctgcttcagg ggctgcagga gataaggaca 1621 caaaggacat ttctgcaccc aaaccaggac ctctaaaaaa ggcacctgct gctaaggctg 1681 gtgggccacc aaaaaagggg aaaccagctg caccaggagg cgcagggaat actggaacca 1741 agaacaagaa aggactggag actaaagaaa tagtggagcc tgagctctcg atagaagtat 1801 gcgaagaaaa agcttcagct gttcttcccc ctacctgtat acagcttctt gacagcagta 1861 actggaaaga aaggctggct tgtatggaag agttccagaa ggctgttgag ctaatggacc 1921 gaactgaaat gccatgccag gcattagtga ggatgctagc caagaaacct ggatggaaag 1981 aaactaattt tcaggtgatg caaatgaagc ttcatatagt tgctttgatt gcccagaagg 2041 gaaatttttc caaaacgtca gctcaggttg tattagatgg ccttgtggac aagattggag 2101 atgtgaaatg tgggaacaat gcaaaagaag ctatgacagc aatagccgaa gcctgtatgt 2161 taccatggac tgctgaacag gttgtgtcaa tggctttctc acaaaagaat cccaaaaatc 2221 agtcagaaac tctgaattgg ctatcaaatg ccataaaaga atttggtttt tctgggttga 2281 atgtcaaagc tttcattagc aatgtgaaga cagctcttgc tgcaacaaac ccagctgtga 2341 ggactgctgc cataaccctg cttggcgtga tgtatctgta tgttggtccc tctttgcgaa 2401 tgttctttga ggatgagaag cctgccctcc tatcccagat agatgcagaa tttgagaaga 2461 tgcagggaca aagtccacct gctccaacca gaggaatttc caagcatagc acaagtggta 2521 cagatgaagg agaagatgga gatgaaccag atgacgggag caatgatgtc gttgatcttt 2581 tgccgaggac ggagatcagt gataaaatca cttcagagtt ggtatctaag attggtgata 2641 agaattggaa gattaggaaa gaaggcctag atgaagtggc aggtattatt aatgacgcaa 2701 aatttatcca accgaatata ggtgaacttc caactgcctt gaagggtcga ctcaatgatt 2761 caaataaaat cttggtacag caaacgctga atatcctgca acaactggca gtagccatgg 2821 gcccaaatat taagcaacat gtaaaaaatt taggcatccc tatcatcaca gtccttggag 2881 acagcaagaa caatgttcga gctgctgccc tagcgactgt gaatgcttgg gcagaacaga 2941 ctggcatgaa ggaatggctg gaaggagaag atctttctga agagctcaaa aaggaaaatc 3001 ctttcttgag gcaagagctt ctgggctggc tggctgagaa actacctact cttcgttcca 3061 cccctacaga ccttatcctt tgtgttcctc atctctactc ctgcctagaa gatcgaaatg 3121 gagatgtgcg aaagaaggcc caagatgcct tgccattctt catgatgcat ttaggatatg 3181 aaaaaatggc caaggctact gggaaactaa agccaacttc taaagatcag gtattggcca 3241 tgctagagaa agccaaagtt aacatgccag ccaagcctgc tccacccact aaagcaactt 3301 ctaaaccaat gggagggtcc gctccagcca aattccagcc tgcatcagca cctgctgaag 3361 attgtatttc cagcagtaca gaacccaaac ctgatccaaa aaaggccaaa gctccaggat 3421 tatcctctaa agcaaagagt gcacaaggga agaagatgcc aagcaaaacc agcttaaagg 3481 aggatgaaga caaatccggg cctattttta ttgttgttcc aaatggaaaa gagcaaagga 3541 tgaaagatga aaaaggattg aaggtgctaa agtggaattt tactacccca cgggatgaat 3601 acattgagca actaaagact caaatgtcta gctgtgtggc taaatggtta caagatgaga 3661 tgtttcactc agactttcag catcataaca aagcccttgc tgttatggtt gatcacttgg 3721 agagtgaaaa agaaggagtt attggttgcc tggatcttat cttaaagtgg cttaccctga 3781 ggttttttga caccaataca agcgtcctga tgaaagcact agaatattta aaattgctct 3841 tcaccttgct aagtgaagaa gaatatcatc ttactgagaa tgaagcatct tccttcatcc 3901 cctatcttgt cgtcaaggtt ggagaaccaa aggatgtcat tcgtaaagat gttcgtgcca 3961 tcctgaaccg gatgtgcctt gtctacccag ctagcaagat gtttcccttt atcatggaag 4021 gaaccaaatc caaaaactct aagcagagag cagagtgcct ggaagagctg ggatgtctgg 4081 ttgagtccta tggcatgaat gtttgccaac caaccccagg aaaagcctta aaggaaatag 4141 ctgttcacat aggagaccgt gacaatgctg tacgcaatgc tgcactcaac accattgtaa 4201 cggtgtacaa tgtacatggg gatcaggtgt tcaaactgat tggaaatctt tctgaaaagg 4261 atatgagcat gctcgaggag aggattaagc ggtcagcaaa gagaccctct gctgcaccaa 4321 taaaacaggt ggaagagaaa cctcagcgtg cacagaacat aagctccaat gccaacatgt 4381 tacgcaaggg accagctgag gacatgtctt ccaaactcaa ccaagcccga agcatgagtg 4441 ggcatcctga ggcagcccag atggtccgcc gagaattcca gctggatcta gatgagattg 4501 agaatgacaa tggtacagtc cgatgtgaaa tgccagaact tgttcagcac aaactggatg 4561 acatttttga gccagtcctt attcctgaac ccaagatccg ggctgtttct ccacacttcg 4621 atgacatgca cagtaataca gcatccacaa tcaatttcat tatctcccaa gtagccagtg 4681 gtgacatcaa cacaagtatc caagctctga cacagctgtt tcagatagag agccttgccc 4741 gggaggcctc cactggagta ctaaaagacc taatgcatgg cctcatcacc ttaatgctgg 4801 attctcggat tgaagatctt gaggaaggac aacaggtcat ccgctctgtg aacctcttgg 4861 tggtgaaggt tctggagaag tcagaccaga ccaacatcct gagtgcccta cttgttttgc 4921 tccaagacag cctgctagca acagccagtt ctcccaaatt ctcagagctt gttatgaagt 4981 gtctctggag aatggttcga ctgttgcctg ataccatcaa tagcattaac ctagacagaa 5041 ttcttctgga tatccacatt ttcatgaagg tcttccccaa agagaaactg aagcaatgca 5101 aaagtgaatt tcccataagg accctaaaga ccctgctaca caccttatgc aaattaaaag 5161 ggcccaagat cctggaccac ctaacgatga tcgacaacaa aaacgagtct gagctggagg 5221 cccatctctg ccggatgatg aagcacagta tggaccagac tgggagcaag tctgataagg 5281 aaacagcaaa gggagcatct cgaatagatg caaaatcatc aaaggccaaa gtgaatgatt 5341 tcttagctga gatttttaag aagattggct ctaaagaaaa cactaaagag ggactagcag 5401 agttatatga atataagaag aaatactcag atgctgacat tgaaccattt ctgaaaaatt 5461 cctcacagtt cttccagagc tatgtcgaaa gaggccttcg ggtgattgag atggagaggg 5521 agggcaaagg tcgtatttcc acttcaacag gcatctcccc tcagatggaa gtcacatgtg 5581 tgcccacgcc cacaagcaca gtgtcctcca taggtaacac aaatggggaa gaagtggggc 5641 catctgtcta cttggaaagg ctaaagatcc tccgacagcg atgtggtctg gacaacacaa 5701 agcaagatga ccgacctcct ttgacctctt tgctctccaa accagcagtt cctactgtcg 5761 cctcttccac agacatgctc cacagcaaac tctctcagct ccgggagtca cgggagcagc 5821 accagcattc agacctggat tctaaccaga ctcactcttc aggaactgtg acctcctcct 5881 cctccacagc taacatagac gacttgaaaa aaagactgga gagaataaag agcagtcgca 5941 aatgaagctg ccccactccc ccggcaccct gcagctttag tttactaaac tagaagtcct 6001 catagtttaa aatggcctca gcaggcctag tgtatacaaa ctggttgtat gtatcatgcc 6061 gtggagctag ggggaggggt cattgtggca caagtatttg tacatactct gcttctctct 6121 gtcagcgtcc tgctgctcta gaagactgtc cgtggatgag tttagtgtac agacttgtaa 6181 acagctgccc cctctctgct cagtctagtt cccaggtcct tttcttttct ttttaattgc 6241 tcatttgtaa aattgtccta atctttccta gctttttaat agttaatatt agaaactctt 6301 taatagtttt cctttcagtt tgtgagctct tctctgtcgc cctgaagggt cactgtattc 6361 tgtatgaatc catggcatga tacaactaat ttaagagtct tttataaata aagtttgcat 6421 taactaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSCHYPRO 1184 bp RNA PRI 01-JAN-1994 DEFINITION H.sapiens mRNA for chymotrypsin-like protease CTRL-1. ACCESSION X71877 NID g438038 KEYWORDS chymotrypsin-like; hydrolase; serine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1184) AUTHORS Larsen,F. TITLE Direct Submission JOURNAL Submitted (10-MAY-1993) F. Larsen, Biotechnology Centre of Oslo, University of Oslo, PO Box 1125 Blindern, N0317 Oslo, NORWAY REFERENCE 2 (bases 1 to 1184) AUTHORS Larsen,F., Soliheim,J., reseland,J., Thorsen,L., Eriksen,J.A. and Prydz,H. TITLE Molecular cloning and immunological detection of a novel Chymotrypsin-like pancreatic protease JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1184 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /clone_lib="Clontech cDNA lambd_gt11 HL11630" /clone="C1 to C25" /map="16q22.1" CDS 11..805 /codon_start=1 /product="chymotrypsin-like protease CTRL-1" /db_xref="PID:g438039" /db_xref="SWISS-PROT:P40313" /translation="MLLLSLTLSLVLLGSSWGCGIPAIKPALSFSQRIVNGENAVLGS WPWQVSLQDSSGFHFCGGSLISQSWVVTAAHCNVSPGRHFVVLGEYDRSSNAEPLQVL SVSRAITHPSWNSTTMNNDVTLLKLASPAQYTTRISPVCLASSNEALTEGLTCVTTGW GRLSGVGNVTPAHLQQVALPLVTVNQCRQYWGSSITDSMICAGGAGASSCQGDSGGPL VCQKGNTWVLIGIVSWGTKNCNVRAPAVYTRVSKFSTWINQVIAYN" sig_peptide 11..64 misc_feature 65..109 /note="activation peptide" mat_peptide 110..805 /product="chymotrypsin-like protease CTRL-1" polyA_signal 836..841 /note="A, primary" polyA_site 856 /note="A3" polyA_site 862 /note="A1, primary" polyA_site 866 /note="A2, primary" polyA_signal 1108..1113 /note="B" polyA_site 1130 /note="B" polyA_signal 1143..1148 /note="C" polyA_site 1169 /note="C" BASE COUNT 258 a 361 c 307 g 258 t ORIGIN 1 atctgccacg atgttgctgc tcagcctgac cctaagcctg gttctcctcg gctcctcctg 61 gggctgcggc attcctgcca tcaaaccggc actgagcttc agccagagga ttgtcaacgg 121 ggagaatgca gtgttgggct cctggccctg gcaggtgtcc ctgcaggaca gcagcggctt 181 ccacttctgc ggtggttctc tcatcagcca gtcctgggtg gtcactgctg cccactgcaa 241 tgtcagccct ggccgccatt ttgttgtcct gggcgagtat gaccgatcat caaacgcaga 301 gcccttgcag gttctgtccg tctctcgggc cattacacac cctagctgga actctaccac 361 catgaacaat gacgtgacgc tgctgaagct cgcctcgcca gcccagtaca caacacgcat 421 ctcgccagtt tgcctggcat cctcaaacga ggctctgact gaaggcctca cgtgtgtcac 481 caccggctgg ggtcgcctca gtggcgtggg caatgtgaca ccagcacatc tgcagcaggt 541 ggctttgccc ctggtcactg tgaatcagtg ccggcagtac tggggctcaa gtatcactga 601 ctccatgatc tgtgcaggtg gcgcaggtgc ctcctcgtgc cagggtgact ccggaggccc 661 tcttgtctgc cagaagggaa acacatgggt gcttattggt attgtctcct ggggcaccaa 721 aaactgcaat gtgcgcgcac ctgctgtgta tactcgagtt agcaagttca gcacctggat 781 caaccaggtc atagcctaca actgagctca ccacaggccc tccccagctc aacccattaa 841 agacccaggc cctgtcccat catgcattca tgtctgtctt cctggctcag gagaaagaag 901 aggctgttga gggtccgact ccctacttgg acttctggca cagaaggggc tgagtgactc 961 cttgagtagc agtggctctt cctagagtag ccatgccgag gccggggccc ccacccctcc 1021 tccagggcaa ccccttggtc ctacagcaag aagccagaac tgttggaatg aatggcagcc 1081 ctccctggag aggcagcctg tttactgaat acagaggata cgtttacaaa ctgaatacgc 1141 ataataaata actgcacatt ctccatccaa aaaaaaaaaa aaaa // LOCUS HSCIITA 4543 bp RNA PRI 04-JUN-1995 DEFINITION H.sapiens mRNA for MHC class II transactivator. ACCESSION X74301 NID g414112 KEYWORDS CIITA gene; MHC class II transactivator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4543) AUTHORS Steimle,V. TITLE Direct Submission JOURNAL Submitted (27-JUL-1993) V. Steimle, University of Geneva, Dept of Genetics & Microbiology, Centre Medical Universitaire, 9, avenue de Champel, 1211 Geneva 4, SWITZERLAND REFERENCE 2 (bases 1 to 4543) AUTHORS Steimle,V., Otten,L.A., Zufferey,M. and Mach,B. TITLE Complementation cloning of an MHC class II transactivator mutated in hereditary MHC class II deficiency (or bare lymphocyte syndrome) JOURNAL Cell 75 (1), 135-146 (1993) MEDLINE 94006536 FEATURES Location/Qualifiers source 1..4543 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" /cell_type="Burkitt lymphoma" /clone_lib="Raji cDNA library in expression vector DV, pool 10" /clone="pDCP10-1" gene 116..3508 /gene="CIITA" CDS 116..3508 /gene="CIITA" /codon_start=1 /product="(MHC) class II transactivator" /db_xref="PID:g414113" /db_xref="SWISS-PROT:P33076" /translation="MRCLAPRPAGSYLSEPQGSSQCATMELGPLEGGYLELLNSDADP LCLYHFYDQMDLAGEEEIELYSEPDTDTINCDQFSRLLCDMEGDEETREAYANIAELD QYVFQDSQLEGLSKDIFKHIGPDEVIGESMEMPAEVGQKSQKRPFPEELPADLKHWKP AEPPTVVTGSLLVGPVSDCSTLPCLPLPALFNQEPASGQMRLEKTDQIPMPFSSSSLS CLNLPEGPIQFVPTISTLPHGLWQISEAGTGVSSIFIYHGEVPQASQVPPPSGFTVHG LPTSPDRPGSTSPFAPSATDLPSMPEPALTSRANMTEHKTSPTQCPAAGEVSNKLPKW PEPVEQFYRSLQDTYGAEPAGPDGILVEVDLVQARLERSSSKSLERELATPDWAERQL AQGGLAEVLLAAKEHRRPRETRVIAVLGKAGQGKSYWAGAVSRAWACGRLPQYDFVFS VPCHCLNRPGDAYGLQDLLFSLGPQPLVAADEVFSHILKRPDRVLLILDAFEELEAQD GFLHSTCGPAPAEPCSLRGLLAGLFQKKLLRGCTLLLTARPRGRLVQSLSKADALFEL SGFSMEQAQAYVMRYFESSGMTEHQDRALTLLRDRPLLLSHSHSPTLCRAVCQLSEAL LELGEDAKLPSTLTGLYVGLLGRAALDSPPGALAELAKLAWELGRRHQSTLQEDQFPS ADVRTWAMAKGLVQHPPRAAESELAFPSFLLQCFLGALWLALSGEIKDKELPQYLALT PRKKRPYDNWLEGVPRFLAGLIFQPPARCLGALLGPSAAASVDRKQKVLARYLKRLQP GTLRARQLLELLHCAHEAEEAGIWQHVVQELPGRLSFLGTRLTPPDAHVLGKALEAAG QDFSLDLRSTGICPSGLGSLVGLSCVTRFRAALSDTVALWESLRQHGETKLLQAAEEK FTIEPFKAKSLKDVEDLGKLVQTQRTRSSSEDTAGELPAVRDLKKLEFALGPVSGPQA FPKLVRILTAFSSLQHLDLDALSENKIGDEGVSQLSATFPQLKSLETLNLSQNNITDL GAYKLAEALPSLAASLLRLSLYNNCICDVGAESLARVLPDMVSLRVMDVQYNKFTAAG AQQLAASLRRCPHVETLAMWTPTIPFSVQEHLQQQDSRISLR" repeat_region 3890..4190 /rpt_family="Alu" polyA_signal 4497..4503 /note="putative" polyA_site 4521 BASE COUNT 916 a 1419 c 1325 g 883 t ORIGIN 1 tgatgaggct gtgtgcttct gagctgggca tccgaaggca tccttgggga agctgagggc 61 acgaggaggg gctgccagac tccgggagct gctgcctggc tgggattcct acacaatgcg 121 ttgcctggct ccacgccctg ctgggtccta cctgtcagag ccccaaggca gctcacagtg 181 tgccaccatg gagttggggc ccctagaagg tggctacctg gagcttctta acagcgatgc 241 tgaccccctg tgcctctacc acttctatga ccagatggac ctggctggag aagaagagat 301 tgagctctac tcagaacccg acacagacac catcaactgc gaccagttca gcaggctgtt 361 gtgtgacatg gaaggtgatg aagagaccag ggaggcttat gccaatatcg cggaactgga 421 ccagtatgtc ttccaggact cccagctgga gggcctgagc aaggacattt tcaagcacat 481 aggaccagat gaagtgatcg gtgagagtat ggagatgcca gcagaagttg ggcagaaaag 541 tcagaaaaga cccttcccag aggagcttcc ggcagacctg aagcactgga agccagctga 601 gccccccact gtggtgactg gcagtctcct agtgggacca gtgagcgact gctccaccct 661 gccctgcctg ccactgcctg cgctgttcaa ccaggagcca gcctccggcc agatgcgcct 721 ggagaaaacc gaccagattc ccatgccttt ctccagttcc tcgttgagct gcctgaatct 781 ccctgaggga cccatccagt ttgtccccac catctccact ctgccccatg ggctctggca 841 aatctctgag gctggaacag gggtctccag tatattcatc taccatggtg aggtgcccca 901 ggccagccaa gtaccccctc ccagtggatt cactgtccac ggcctcccaa catctccaga 961 ccggccaggc tccaccagcc ccttcgctcc atcagccact gacctgccca gcatgcctga 1021 acctgccctg acctcccgag caaacatgac agagcacaag acgtccccca cccaatgccc 1081 ggcagctgga gaggtctcca acaagcttcc aaaatggcct gagccggtgg agcagttcta 1141 ccgctcactg caggacacgt atggtgccga gcccgcaggc ccggatggca tcctagtgga 1201 ggtggatctg gtgcaggcca ggctggagag gagcagcagc aagagcctgg agcgggaact 1261 ggccaccccg gactgggcag aacggcagct ggcccaagga ggcctggctg aggtgctgtt 1321 ggctgccaag gagcaccggc ggccgcgtga gacacgagtg attgctgtgc tgggcaaagc 1381 tggtcagggc aagagctatt gggctggggc agtgagccgg gcctgggctt gtggccggct 1441 tccccagtac gactttgtct tctctgtccc ctgccattgc ttgaaccgtc cgggggatgc 1501 ctatggcctg caggatctgc tcttctccct gggcccacag ccactcgtgg cggccgatga 1561 ggttttcagc cacatcttga agagacctga ccgcgttctg ctcatcctag acgccttcga 1621 ggagctggaa gcgcaagatg gcttcctgca cagcacgtgc ggaccggcac cggcggagcc 1681 ctgctccctc cgggggctgc tggccggcct tttccagaag aagctgctcc gaggttgcac 1741 cctcctcctc acagcccggc cccggggccg cctggtccag agcctgagca aggccgacgc 1801 cctatttgag ctgtccggct tctccatgga gcaggcccag gcatacgtga tgcgctactt 1861 tgagagctca gggatgacag agcaccaaga cagagccctg acgctcctcc gggaccggcc 1921 acttcttctc agtcacagcc acagccctac tttgtgccgg gcagtgtgcc agctctcaga 1981 ggccctgctg gagcttgggg aggacgccaa gctgccctcc acgctcacgg gactctatgt 2041 cggcctgctg ggccgtgcag ccctcgacag cccccccggg gccctggcag agctggccaa 2101 gctggcctgg gagctgggcc gcagacatca aagtacccta caggaggacc agttcccatc 2161 cgcagacgtg aggacctggg cgatggccaa aggcttagtc caacacccac cgcgggccgc 2221 agagtccgag ctggccttcc ccagcttcct cctgcaatgc ttcctggggg ccctgtggct 2281 ggctctgagt ggcgaaatca aggacaagga gctcccgcag tacctagcat tgaccccaag 2341 gaagaagagg ccctatgaca actggctgga gggcgtgcca cgctttctgg ctgggctgat 2401 cttccagcct cccgcccgct gcctgggagc cctactcggg ccatcggcgg ctgcctcggt 2461 ggacaggaag cagaaggtgc ttgcgaggta cctgaagcgg ctgcagccgg ggacactgcg 2521 ggcgcggcag ctgcttgagc tgctgcactg cgcccacgag gccgaggagg ctggaatttg 2581 gcagcacgtg gtacaggagc tccccggccg cctctctttt ctgggcaccc gcctcacgcc 2641 tcctgatgca catgtactgg gcaaggcctt ggaggcggcg ggccaagact tctccctgga 2701 cctccgcagc actggcattt gcccctctgg attggggagc ctcgtgggac tcagctgtgt 2761 cacccgtttc agggctgcct tgagcgacac ggtggcgctg tgggagtccc tgcggcagca 2821 tggggagacc aagctacttc aggcagcaga ggagaagttc accatcgagc ctttcaaagc 2881 caagtccctg aaggatgtgg aagacctggg aaagcttgtg cagactcaga ggacgagaag 2941 ttcctcggaa gacacagctg gggagctccc tgctgttcgg gacctaaaga aactggagtt 3001 tgcgctgggc cctgtctcag gcccccaggc tttccccaaa ctggtgcgga tcctcacggc 3061 cttttcctcc ctgcagcatc tggacctgga tgcgctgagt gagaacaaga tcggggacga 3121 gggtgtctcg cagctctcag ccaccttccc ccagctgaag tccttggaaa ccctcaatct 3181 gtcccagaac aacatcactg acctgggtgc ctacaaactc gccgaggccc tgccttcgct 3241 cgctgcatcc ctgctcaggc taagcttgta caataactgc atctgcgacg tgggagccga 3301 gagcttggct cgtgtgcttc cggacatggt gtccctccgg gtgatggacg tccagtacaa 3361 caagttcacg gctgccgggg cccagcagct cgctgccagc cttcggaggt gtcctcatgt 3421 ggagacgctg gcgatgtgga cgcccaccat cccattcagt gtccaggaac acctgcaaca 3481 acaggattca cggatcagcc tgagatgatc ccagctgtgc tctggacagg catgttctct 3541 gaggacacta accacgctgg accttgaact gggtacttgt ggacacagct cttctccagg 3601 ctgtatccca tgaggcctca gcatcctggc acccggcccc tgctggttca gggttggccc 3661 ctgcccggct gcggaatgaa ccacatcttg ctctgctgac agacacaggc ccggctccag 3721 gctcctttag cgcccagttg ggtggatgcc tggtggcagc tgcggtccac ccaggagccc 3781 cgaggccttc tctgaaggac attgcggaca gccacggcca ggccagaggg agtgacagag 3841 gcagccccat tctgcctgcc caggcccctg ccaccctggg gagaaagtac ttcttttttt 3901 ttatttttag acagagtctc actgttgccc aggctggcgt gcagtggtgc gatctgggtt 3961 cactgcaacc tccgcctctt gggttcaagc gattcttctg cttcagcctc ccgagtagct 4021 gggactacag gcacccacca tcatgtctgg ctaatttttc atttttagta gagacagggt 4081 tttgccatgt tggccaggct ggtctcaaac tcttgacctc aggtgatcca cccacctcag 4141 cctcccaaag tgctggggat tacaagcgtg agccactgca ccgggccaca gagaaagtac 4201 ttctccaccc tgctctccga ccagacacct tgacagggca caccgggcac tcagaagaca 4261 ctgatgggca acccccagcc tgctaattcc ccagattgca acaggctggg cttcagtggc 4321 aggctgcttt tgtctatggg actcaatgca ctgacattgt tggccaaagc caaagctagg 4381 cctggccaga tgcaccaggc ccttagcagg gaaacagcta atgggacact aatggggcgg 4441 tgagagggga acagactgga agcacagctt catttcctgt gtcttttttc actacattat 4501 aaatgtctct ttaatgtcac aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSCIP4 2001 bp RNA PRI 22-JUL-1997 DEFINITION Homo sapiens mRNA for Cdc42-interacting protein 4 (CIP4). ACCESSION AJ000414 NID g2274965 KEYWORDS Cdc42-interacting protein 4; CIP4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2001) AUTHORS Aspenstroem,P. TITLE Direct Submission JOURNAL Submitted (15-JUL-1997) Aspenstroem P., Biomedical Center, Ludwig Institute for Cancer Research, Box 595, S-751 24 Uppsala, SWEDEN REFERENCE 2 (bases 1 to 2001) AUTHORS Aspenstroem,P. TITLE A Cdc42 target protein with homology to the non-kinase domain of FER has a potential role in regulating the actin cytoskeleton JOURNAL Curr. Biol. 7, 479-487 (1997) FEATURES Location/Qualifiers source 1..2001 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV-transformed B-cells" gene 40..1677 /gene="CIP4" CDS 40..1677 /gene="CIP4" /function="CIP4 is a target for the small GTPase Cdc42" /codon_start=1 /evidence=experimental /product="Cdc42-interacting protein 4" /db_xref="PID:e329500" /db_xref="PID:g2274966" /translation="MDWGTELWDQFEVLERHTQWGLDLLDRYVKFVKERTEVEQAYAK QLRSLVKKYLPKRPAKDDPESKFSQQQSFVQILQEVNDFAGQRELVAENLSVRVCLEL TKYSQEMKQERKMHFQEGRRAQQQLENGFKQLENSKRKFERDCREAEKAAQTAERLDQ DINATKADVEKAKQQAHLRSHMAEESKNEYAAQLQRFNRDQAHFYFSQMPQIFDKLQD MDERRATRLGAGYGLLSEAELEVVPIIAKCLEGMKVAANAVDPKNDSHVLIELHKSGF ARPGDVEFEDFSQPMNRAPSDSSLGTPSDGRPELRGPGRSRTKRWPFGKKNKTVVTED FSHLPPEQQRKRLQQQLEERSRELQKEVDQREALKKMKDVYEKTPQMGDPASLEPQIA ETLSNIERLKLEVQKYEAWLAEAESRVLSNRGDSLSRHARPPDPPASAPPDSSSNSAS QDTKESSEEPPSEESQDTPIYTEFDEDFEEEPTSPIGHCVAIYHFEGSSEGTISMAEG EDLSLMEEDKGDGWTRVRRKEGGEGYVPTSYLRVTLN" BASE COUNT 497 a 542 c 581 g 381 t ORIGIN 1 ggccgcggtg gtggctgcgg cggcggcggc gggagcagca tggattgggg cactgagctg 61 tgggatcagt tcgaggtgct cgagcgccac acgcagtggg ggctggacct gttggacaga 121 tatgtaaagt tcgtgaaaga acgcaccgaa gtggaacagg cttacgccaa acaactgcgg 181 agcctggtga aaaaatatct gcccaagaga cctgccaagg atgatcctga gtccaaattc 241 agccagcaac agtccttcgt acagattctc caggaggtga atgactttgc aggccagcgg 301 gagctggtgg ctgagaacct cagtgtccgt gtatgtcttg agctgaccaa gtactcacaa 361 gagatgaaac aggagaggaa gatgcacttc caagaagggc ggcgggccca gcagcagctg 421 gaaaatggct ttaaacagct ggagaatagt aagcgtaaat ttgagcggga ctgccgggag 481 gcagagaagg cagcccagac tgctgaacgg ctagaccagg atatcaacgc caccaaggct 541 gatgtggaga aggccaagca gcaagcccac cttcggagtc acatggccga agaaagcaaa 601 aacgaatatg cggctcaact gcagcgcttc aaccgagacc aagcccactt ctatttttca 661 cagatgcccc agatattcga taagctccaa gacatggatg aacgcagggc cacccgcctg 721 ggtgccgggt atgggctcct gtcggaggcc gagctggagg tggtgcccat aatagccaag 781 tgcttggagg gcatgaaggt ggctgcaaat gctgtggatc ccaagaacga ctcccacgtc 841 cttatagagc tgcacaagtc aggttttgcc cgcccgggcg acgtggaatt cgaggacttc 901 agccagccca tgaaccgtgc accctccgac agcagtctgg gcaccccctc ggatggacgg 961 cctgaactcc gaggcccggg tcgcagccgc accaagcgct ggccttttgg caagaagaac 1021 aagacagtgg tgaccgagga ttttagccac ttgcccccag agcagcagcg aaaacggctt 1081 caacagcagt tggaagaacg cagtcgtgaa cttcagaagg aggttgacca gagggaagcc 1141 ctaaagaaaa tgaaggatgt ctatgagaag acacctcaga tgggggaccc cgccagcttg 1201 gagccccaga tcgctgaaac cctgagcaac attgaacggc tgaaattgga agtgcagaag 1261 tatgaggcgt ggctggcaga agctgaaagt cgagtcctta gcaaccgggg agacagcctg 1321 agccggcacg cccggcctcc cgaccccccc gctagcgccc cgccagacag cagcagcaac 1381 agcgcatcac aggacaccaa ggagagctct gaagagcctc cctcagaaga gagccaggac 1441 acccccattt acacggagtt tgatgaggat ttcgaggagg aacccacatc ccccataggt 1501 cactgtgtgg ccatctacca ctttgaaggg tccagcgagg gcactatctc tatggccgag 1561 ggtgaagacc tcagtcttat ggaagaagac aaaggggacg gctggacccg ggtcaggcgg 1621 aaagagggag gcgagggcta cgtgcccacc tcctacctcc gagtcacgct caattgaacc 1681 ctgccagaga cgggaagagg ggggctgtcg gctgctgctt ctgggccacg gggagcccca 1741 ggacctatgc actttatttc tgaccccgtg gcttcggctg agacctgtgt aacctgctgc 1801 cccctccacc cccaacccag tcctacctgt cacaccggac ggacccgctg tgccttctac 1861 catcgttcca ccattgatgt acatactcat gttttacatc ttttctttct gcgctcggct 1921 ccggccattt tgttttatac aaaaatgggt tttttttttt tctttaatat atttcaagag 1981 attttttttt tttttttttt t // LOCUS HSCK1MR 1259 bp RNA PRI 14-AUG-1995 DEFINITION H.sapiens mRNA for protein kinase CK1. ACCESSION X80693 NID g940506 KEYWORDS ck1 gene; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1259) AUTHORS Tapia,C., Featherstone,T., Gomez,C., Taillon-Miller,P., Allende,C.C. and Allende,J.E. TITLE Cloning and chromosomal localization of the gene coding for human protein kinase CK1 JOURNAL FEBS Lett. 349 (2), 307-312 (1994) MEDLINE 94326941 REFERENCE 2 (bases 1 to 1259) AUTHORS Allende,J.E. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) J.E. Allende, Univ. de Chile, Dept de Bioquimica, Facultad de Medicina, Casilla 70086, Santiago 7, CHILE REMARK revised by [3] MAT REFERENCE 3 (bases 1 to 1259) AUTHORS Allende,J.E. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) J.E. Allende, Univ. de Chile, Dept de Bioquimica, Facultad de Medicina, Casilla 70086, Santiago 7, CHILE FEATURES Location/Qualifiers source 1..1259 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="foetal" /clone="cDNA ZAP" /chromosome="13" /map="q13" gene 126..1139 /gene="ck1" CDS 126..1139 /gene="ck1" /note="Author-given protein sequence is in conflict with the conceptual translation." /codon_start=1 /product="protein kinase CK1 (casein kinase)" /db_xref="PID:g1335025" /db_xref="SWISS-PROT:P48729" /translation="MASSSGSKAEFIVGGKYKLVRKIGSGSFGDIYLAINITNGEEVA LKLESQKARHPQLLYESKLYKILQGGVGIPHIRWYGQEKDYNVLVMDLLGPSLEDLFN FCSRRFTMKTVLMLADQMISRIEYHVTKNFIHRDIKPDNFLMGIGRHCNKLFLIDFGL AKKYRDNRTRQHIPYREDKNLTGTADYASINAHLGIEQSRRDDMESLGYVLMYFNRTS LPWQGLKAATKKQKYEKISEKKMSTPVEVLCKGFPAEFAMYLNYCRGLRFEEAPDYMY LRQLFRILFRTLNHQYDYTFDWTMLKQKAAQQAASSSGQGQQAQTPTGKQTDKTKSNM KGF" BASE COUNT 371 a 282 c 295 g 311 t ORIGIN 1 ccgcctccgt gttccgtttc ctgccgccct cctctcgtag ccttgcctag tgtggagccc 61 caggcctccg tcctcttccc agaggtgtcg aggcttggcc ccagcctcca tcttcgtctc 121 tcaggatggc gagtagcagc ggctccaagg ctgaattcat tgtcggtggg aaatataaac 181 tggtacggaa gatcgggtct ggctccttcg gggacatcta tttggcgatc aacatcacca 241 acggcgagga agtggcactg aagctagaat ctcagaaggc caggcatccc cagttgctgt 301 acgagagcaa gctctataag attcttcaag gtggggttgg catcccccac atacggtggt 361 atggtcagga aaaagactac aatgtactag tcatggatct tctgggacct agcctcgaag 421 acctcttcaa tttctgttca agaaggttca caatgaaaac tgtacttatg ttagctgacc 481 agatgatcag tagaattgaa tatgtgcata caaagaattt tatacacaga gacattaaac 541 cagataactt cctaatgggt attgggcgtc actgtaataa gttattcctt attgattttg 601 gtttggccaa aaagtacaga gacaacagga caaggcaaca cataccatac agagaagata 661 aaaacctcac tggcactgcc cgatatgcta gcatcaatgc acatcttggt attgagcaga 721 gtcgccgaga tgacatggaa tcattaggat atgttttgat gtattttaat agaaccagcc 781 tgccatggca agggctaaag gctgcaacaa agaaacaaaa atatgaaaag attagtgaaa 841 agaagatgtc cacgcctgtt gaagttttat gtaaggggtt tcctgcagaa tttgcgatgt 901 acttaaacta ttgtcgtggg ctacgctttg aggaagcccc agattacatg tatctgaggc 961 agctattccg cattcttttc aggaccctga accatcaata tgactacaca tttgattgga 1021 caatgttaaa gcagaaagca gcacagcagg cagcctcttc aagtgggcag ggtcagcagg 1081 cccaaacccc cacaggcaag caaactgaca aatccaagag taacatgaaa ggtttctaat 1141 ttctaagcat gaattgagga acagaagaag cagacgagat gatcggagca gcatttgttt 1201 ctccccaaat ctagaaattt tagttcatat gtacactagc cagtggttgt ggacaacca // LOCUS HSCKRL2 1563 bp DNA PRI 26-JUL-1997 DEFINITION H.sapiens G protein-coupled receptor CKR-L2. ACCESSION Z79783 NID g2281709 KEYWORDS G Protein-coupled Receptor CKR-L2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1563) AUTHORS Gutierrez,J., Varona,R., Zaballos,A., Lind,P. and Marquez,G. TITLE unpublished JOURNAL Unpublished REFERENCE 2 (bases 1 to 1563) AUTHORS Zaballos,A. TITLE Direct Submission JOURNAL Submitted (03-SEP-1996) Angel Zaballos, Research, Pharmacia & Upjohn, Antonio Lopez 109, Madrid, 28026, Spain FEATURES Location/Qualifiers source 1..1563 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 255..1502 /codon_start=1 /product="G PROTEIN-COUPLED RECEPTOR CKR-L2" /db_xref="PID:e264773" /db_xref="PID:g2281710" /translation="MELRKYGPGRLAGTVIGGAAQSKSQTKSDSITKEFLPGLYTAPS SPFPPSQVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAF LPALYSLLFLLGLLGNGAVRAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDA AVQWVFGSGLCKVAGALFNINFYAGALLLACISFDRYLNIVHATQLYRRGPPARVTLT CLAVWGLCLLFALPDFIFLSAHHDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLL VMAYCYAHILAVLLVSRGQRRLRAMRLVVVVVVAFALCWTPYHLVVLVDILMDLGALA RNCGRESRVDVAKSVTSGLGYMHCCLNPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQ RQPSSSRRDSSWSETSEASYSGL" BASE COUNT 272 a 497 c 479 g 315 t ORIGIN 1 aaaaggggat tgtggagagg cgctggggag gaaggtggtt gaactatgtg ttgggggtgg 61 ggcactgagg acgcaaggca agcctgaagg gagagcaggg agagagagga cagtgggcag 121 agagggctct gggcactgga gggacgctct tcttcctgcc caggggtccc tgggcggatg 181 ggatcacgca gaagaatgcg agagaagcag cctttgagaa gggaagtcac tatcccagag 241 cccaggctga gcggatggag ttgaggaagt acggccctgg aagactggcg gggacagtta 301 taggaggagc tgctcagagt aaatcacaga ctaaatcaga ctcaatcaca aaagagttcc 361 tgccaggcct ttacacagcc ccttcctccc cgttcccgcc ctcacaggtg agtgaccacc 421 aagtgctaaa tgacgccgag gttgccgccc tcctggagaa cttcagctct tcctatgact 481 atggagaaaa cgagagtgac tcgtgctgta cctccccgcc ctgcccacag gacttcagcc 541 tgaacttcga ccgggccttc ctgccagccc tctacagcct cctctttctg ctggggctgc 601 tgggcaacgg cgcggtgcga gccgtgctgc tgagccggcg gacagccctg agcagcaccg 661 acaccttcct gctccaccta gctgtagcag acacgctgct ggtgctgaca ctgccgctct 721 gggcagtgga cgctgccgtc cagtgggtct ttggctctgg cctctgcaaa gtggcaggtg 781 ccctcttcaa catcaacttc tacgcaggag ccctcctgct ggcctgcatc agctttgacc 841 gctacctgaa catagttcat gccacccagc tctaccgccg ggggcccccg gcccgcgtga 901 ccctcacctg cctggctgtc tgggggctct gcctgctttt cgccctccca gacttcatct 961 tcctgtcggc ccaccacgac gagcgcctca acgccaccca ctgccaatac aacttcccac 1021 aggtgggccg cacggctctg cgggtgctgc agctggtggc tggctttctg ctgcccctgc 1081 tggtcatggc ctactgctat gcccacatcc tggccgtgct gctggtttcc aggggccagc 1141 ggcgcctgcg ggccatgcgg ctggtggtgg tggtcgtggt ggcctttgcc ctctgctgga 1201 ccccctatca cctggtggtg ctggtggaca tcctcatgga cctgggcgct ttggcccgca 1261 actgtggccg agaaagcagg gtagacgtgg ccaagtcggt cacctcaggc ctgggctaca 1321 tgcactgctg cctcaacccg ctgctctatg cctttgtagg ggtcaagttc cgggagcgga 1381 tgtggatgct gctcttgcgc ctgggctgcc ccaaccagag agggctccag aggcagccat 1441 cgtcttcccg ccgggattca tcctggtctg agacctcaga ggcctcctac tcgggcttgt 1501 gaggccggaa tccgggctcc cctttcgccc acagtctgac ttccccgcat tccaggctcc 1561 tcc // LOCUS HSCKSHS1 717 bp RNA PRI 30-APR-1992 DEFINITION H.sapiens ckshs1 mRNA for Cks1 protein homologue. ACCESSION X54941 X55505 NID g29976 KEYWORDS Cdc28 protein kinase; Cks1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 717) AUTHORS Richardson,H.E. TITLE Direct Submission JOURNAL Submitted (28-AUG-1990) Richardson H.E., Biochemistry Dept, University of Adelaide, P O Box 498, Adelaide 5001, South Australia REFERENCE 2 (bases 1 to 717) AUTHORS Richardson,H.E., Stueland,C.S., Thomas,J., Russell,P. and Reed,S.I. TITLE Human cDNAs encoding homologs of the small p34Cdc28/Cdc2-associated protein of Saccharomyces cerevisiae and Schizosaccharomyces pombe JOURNAL Genes Dev. 4 (8), 1332-1344 (1990) MEDLINE 91032985 FEATURES Location/Qualifiers source 1..717 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda-ZAPII (strategene)" /clone="ckshs1" gene 10..249 /gene="ckshs1" CDS 10..249 /gene="ckshs1" /codon_start=1 /product="Cks1 protein homologue" /db_xref="PID:g29977" /db_xref="SWISS-PROT:P33551" /translation="MSHKQIYYSDKYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWR NLGVQQSQGWVHYMIHEPEPHILLFRRPLPKKPKK" polyA_signal 696..701 BASE COUNT 192 a 157 c 155 g 213 t ORIGIN 1 agagcgatca tgtcgcacaa acaaatttac tattcggaca aatacgacga cgaggagttt 61 gagtatcgac atgtcatgct gcccaaggac atagccaagc tggtccctaa aacccatctg 121 atgtctgaat ctgaatggag gaatcttggc gttcagcaga gtcagggatg ggtccattat 181 atgatccatg aaccagaacc tcacatcttg ctgttccggc gcccactacc caagaaacca 241 aagaaatgaa gctggcaagc tacttttcag cctcaagctt tacacagctg tccttacttc 301 ctaacatctt tctgataaca ttattatgtt gccttcttgt ttctcacttt gatatttaaa 361 agatgttcaa tacactgttt gaatgtgctg gtaactgctt tgcttcttga gtagagccac 421 caccaccata gcccagccag atgagtgctc tgtggaccca cagcctaagc tgagtgtgac 481 cccagaagcc acgatgtgct ctgtatccag aacacacttg gcagatggag gaagcatctg 541 agtttgagac catggctgtt acagggatca tgtaaacttg ctgtttttgt tttttctgcc 601 gggtgttgta tgtgtggtga cttgcggatt tatgtttcag tgtactggaa actttccatt 661 ttattcaaga aatctgttca tgttaaaagc cttgattaaa gaggaagttt ttataat // LOCUS HSCKSHS2 627 bp RNA PRI 30-APR-1992 DEFINITION H.sapiens ckshs2 mRNA for Cks1 protein homologue. ACCESSION X54942 X55506 NID g29978 KEYWORDS Cdc28 protein kinase; Cks1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 627) AUTHORS Richardson,H.E. TITLE Direct Submission JOURNAL Submitted (28-AUG-1990) Richardson H.E., Biochemistry Dept, University of Adelaide, P O Box 498, Adelaide 5001, South Australia REFERENCE 2 (bases 1 to 627) AUTHORS Richardson,H.E., Stueland,C.S., Thomas,J., Russell,P. and Reed,S.I. TITLE Human cDNAs encoding homologs of the small p34Cdc28/Cdc2-associated protein of Saccharomyces cerevisiae and Schizosaccharomyces pombe JOURNAL Genes Dev. 4 (8), 1332-1344 (1990) MEDLINE 91032985 FEATURES Location/Qualifiers source 1..627 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="plasmid PCD (Steve Hanks, Salk Inst.)" /clone="ckshs2" gene 96..335 /gene="ckshs2" CDS 96..335 /gene="ckshs2" /codon_start=1 /product="Cks1 protein homologue" /db_xref="PID:g29979" /db_xref="SWISS-PROT:P33552" /translation="MAHKQIYYSDKYFDEHYEYRHVMLPRELSKQVPKTHLMSEEEWR RLGVQQSLGWVHYMIHEPEPHILLFRRPLPKDQQK" polyA_signal 590..595 polyA_site 612 BASE COUNT 181 a 126 c 128 g 192 t ORIGIN 1 agtctccggc gagttgttgc ctgggctgga cgtggttttg tctgctgcgc ccgctcttcg 61 cgctctcgtt tcattttctg cagcgcgcca cgaggatggc ccacaagcag atctactact 121 cggacaagta cttcgacgaa cactacgagt accggcatgt tatgttaccc agagaacttt 181 ccaaacaagt acctaaaact catctgatgt ctgaagagga gtggaggaga cttggtgtcc 241 aacagagtct aggctgggtt cattacatga ttcatgagcc agaaccacat attcttctct 301 ttagacgacc tcttccaaaa gatcaacaaa aatgaagttt atctggggat cgtcaaatct 361 ttttcaaatt taatgtatat gtgtatataa ggtagtattc agtgaatact tgagaaatgt 421 acaaatcttt catccatacc tgtgcatgag ctgtattctt cacagcaaca gagctcagtt 481 aaatgcaact gcaagtaggt tactgtaaga tgtttaagat aaaagttctt ccagtcagtt 541 tttctcttaa gtgcctgttt gagtttactg aaacagttta cttttgttca ataaagtttg 601 tatgttgcat ttaaaaaaaa aaaaaaa // LOCUS HSCL100 2000 bp RNA PRI 25-JUL-1993 DEFINITION H.sapiens CL 100 mRNA for protein tyrosine phosphatase. ACCESSION X68277 S46269 NID g29980 KEYWORDS tyrosine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2000) AUTHORS Keyse,S.M. TITLE Direct Submission JOURNAL Submitted (25-AUG-1992) S.M. Keyse, ICRF Molecular Pharmacology Group, University Dept of Biochemistry, Hugh Robson Bldg, George Square, Edinburgh EH8 9XD, UK REFERENCE 2 (bases 1 to 2000) AUTHORS Keyse,S.M. and Emslie,E.A. TITLE Oxidative stress and heat shock induce a human gene encoding a protein-tyrosine phosphatase JOURNAL Nature 359 (6396), 644-647 (1992) MEDLINE 93024952 FEATURES Location/Qualifiers source 1..2000 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="foreskin" /cell_type="fibroblast" /cell_line="EK 4" /clone_lib="cDNA/ Lambda gt10" /clone="CL 100" gene 234..1337 /gene="CL 100" CDS 234..1337 /gene="CL 100" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g29981" /db_xref="SWISS-PROT:P28562" /translation="MVMEVGTLDAGGLRALLGERAAQCLLLDCRSFFAFNAGHIAGSV NVRFSTIVRRRAKGAMGLEHIVPNAELRGRLLAGAYHAVVLLDERSAALDGAKRDGTL ALAAGALCREARAAQVFFLKGGYEAFSASCPELCSKQSTPMGLSLPLSTSVPDSAESG CSSCSTPLYDQGGPVEILPFLYLGSAYHASRKDMLDALGITALINVSANCPNHFEGHY QYKSIPVEDNHKADISSWFNEAIDFIDSIKNAGGRVFVHCQAGISRSATICLAYLMRT NRVKLDEAFEFVKQRRSIISPNFSFMGQLLQFESQVLAPHCSAEAGSPAMAVLDRGTS TTTVFNFPVSIPVHSTNSALSYLQSPITTSPSC" misc_signal 1441..1449 /note="CL 100" /function="mRNA destabilizing signal" polyA_signal 1978..1983 BASE COUNT 414 a 566 c 539 g 481 t ORIGIN 1 tttgggctgt gtgtgcgacg cgggtcggag gggcagtcgg gggaaccgcg aagaagccga 61 ggagcccgga gccccgcgtg acgctcctct ctcagtccaa aagcggcttt tggttcggcg 121 cagagagacc cgggggtcta gcttttcctc gaaaagcgcc gccctgccct tggccccgag 181 aacagacaaa gagcaccgca gggccgatca cgctgggggc gctgaggccg gccatggtca 241 tggaagtggg caccctggac gctggaggcc tgcgggcgct gctgggggag cgagcggcgc 301 aatgcctgct gctggactgc cgctccttct tcgctttcaa cgccggccac atcgccggct 361 ctgtcaacgt gcgcttcagc accatcgtgc ggcgccgggc caagggcgcc atgggcctgg 421 agcacatcgt gcccaacgcc gagctccgcg gccgcctgct ggccggcgcc taccacgccg 481 tggtgttgct ggacgagcgc agcgccgccc tggacggcgc caagcgcgac ggcaccctgg 541 ccctggcggc cggcgcgctc tgccgcgagg cgcgcgccgc gcaagtcttc ttcctcaaag 601 gaggatacga agcgttttcg gcttcctgcc cggagctgtg cagcaaacag tcgaccccca 661 tggggctcag ccttcccctg agtactagcg tccctgacag cgcggaatct gggtgcagtt 721 cctgcagtac cccactctac gatcagggtg gcccggtgga aatcctgccc tttctgtacc 781 tgggcagtgc gtatcacgct tcccgcaagg acatgctgga tgccttgggc ataactgcct 841 tgatcaacgt ctcagccaat tgtcccaacc attttgaggg tcactaccag tacaagagca 901 tccctgtgga ggacaaccac aaggcagaca tcagctcctg gttcaacgag gccattgact 961 tcatagactc catcaagaat gctggaggaa gggtgtttgt ccactgccag gcaggcattt 1021 cccggtcagc caccatctgc cttgcttacc ttatgaggac taatcgagtc aagctggacg 1081 aggcctttga gtttgtgaag cagaggcgaa gcatcatctc tcccaacttc agcttcatgg 1141 gccagctgct gcagtttgag tcccaggtgc tggctccgca ctgttcggca gaggctggga 1201 gccccgccat ggctgtgctc gaccgaggca cctccaccac caccgtgttc aacttccccg 1261 tctccatccc tgtccactcc acgaacagtg cgctgagcta ccttcagagc cccattacga 1321 cctctcccag ctgctgaaag gccacgggag gtgaggctct tcacatccca ttgggactcc 1381 atgctccttg agaggagaaa tgcaataact ctgggagggg ctcgagaggg ctggtcctta 1441 tttatttaac ttcacccgag ttcctctggg tttctaagca gttatggtga tgacttagcg 1501 tcaagacatt tgctgaactc agcacattcg ggaccaatat atagtgggta catcaagtcc 1561 atctgacaaa atggggcaga agagaaagga ctcagtgtgt gatccggttt ctttttgctc 1621 gcccctgttt tttgtagaat ctcttcatgc ttgacatacc taccagtatt attcccgacg 1681 acacatatac atatgagaat ataccttatt tatttttgtg taggtgtctg ccttcacaaa 1741 tgtcattgtc tactcctaga agaaccaaat acctcaattt ttgtttttga gtactgtact 1801 atcctgtaaa tatatcttaa gcaggtttgt tttcagcact gatggaaaat accagtgttg 1861 ggtttttttt tagttgccaa cagttgtatg tttgctgatt atttatgacc tgaaataata 1921 tatttcttct tctaagaaga cattttgtta cataaggatg acttttttat acaatggaat 1981 aaattatggc atttctattg // LOCUS HSCL1042 2400 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens cl.1042 mRNA of DEAD box protein family. ACCESSION X70649 NID g1370104 KEYWORDS 16S ribosomal RNA; DEAD box protein; retinoblastoma; ribosomal RNA; subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2400) AUTHORS Godbout,R. TITLE Direct Submission JOURNAL Submitted (13-JAN-1993) R. Godbout, Cross Cancer Institute, 11560 University Avenue, Edmonton, Alberta, T6G 1Z2, CANADA REMARK revised by [4] REFERENCE 2 (bases 1 to 2400) AUTHORS Godbout,R. and Squire,J. TITLE Amplification of a DEAD box protein gene in retinoblastoma cell lines JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 7578-7582 (1993) MEDLINE 93361490 REFERENCE 3 (bases 1 to 2400) AUTHORS Godbout,R. TITLE Direct Submission JOURNAL Submitted (11-MAR-1996) R. Godbout, Cross Cancer Institute, 11560 University Avenue, Edmonton, Alberta, T6G 1Z2, CANADA REMARK Revised by [5] REFERENCE 4 (bases 1 to 2400) AUTHORS Godbout,R. TITLE Direct Submission JOURNAL Submitted (04-JUN-1996) R. Godbout, Cross Cancer Institute, 11560 University Avenue, Edmonton, Alberta, T6G 1Z2, CANADA FEATURES Location/Qualifiers source 1..2400 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="retinoblastoma" gene 1..2205 /gene="cl. 1042" CDS 1..2205 /gene="cl. 1042" /codon_start=1 /product="member of DEAD box protein family" /db_xref="PID:e264415" /db_xref="PID:g1370105" /translation="MGVMPEIAQAVEEMDWLLPTDIQAESIPLILGGGDVLMAAETGS GKTGAFSIPVIQIVYETLKDQQEGKKGKTTIKTGASVLNKWQMNPYDRGSAFAIGSDG LCCQSREVKEWHGCRATKGLMKGKHYYEVSCHDQGLCRVGWSTMQASLDLGTDKFGFG FGGTGKKSHNKQFDNYGEEFTMHDTIGCYLDIDKGHVKFSKNGKDLGLAFEIPPHMKN QALFPACVLKNAELKFNFGEEEFKFPPKDGFVALSKAPDGYIVKSQHSGNAQVTQTKF LPNAPKALIVEPSRELAEQTLNNIKQFKKYIDNPKLRELLIIGGVAARDQLSVLENGV DIVVGTPGRLDDLVSTGKLNLSQVRFLVLDEADGLLSQGYSDFINRMHNQIPQVTSDG KRLQVIVCSATLHSFDVKKLSEKIMHFPTWVDLKGEDSVPDTVHHVVVPVNPKTDRLW ERLGKSHIRTDDVHAKDNTRPGANSPEMWSEAIKILKGEYAVRAIKEHKMDQAIIFCR TKIDCDNLEQYFIQQGGGPDKKGHQFSCVCLHGDRKPHERKQNLERFKKGDVRFLICT DVAARGIDIHGVPYVINVTLPDEKQNYVHRIGRVGRAERMGLAISLVATEKEKVWYHV CSSRGKGCYNTRLKEDGGCTIWYNEMQLLSEIEEHLNCTISQVEPDIKVPVDEFDGKV TYGQKRAAGGGSYKGHVDILAPTVQELAALEKEAQTSFLHLGYLPNQLFRTF" polyA_site 2400 BASE COUNT 768 a 424 c 563 g 645 t ORIGIN 1 atgggtgtaa tgcctgagat tgcacaagct gtggaagaga tggattggct cctcccaact 61 gatatccagg ctgaatctat cccattgatc ttaggaggag gtgatgtact tatggctgca 121 gaaacaggaa gtggcaaaac tggtgctttt agtattccag ttatccagat agtttatgaa 181 actctgaaag accaacagga aggcaaaaaa ggaaaaacaa caattaaaac tggtgcttca 241 gtgctgaaca aatggcagat gaacccatat gacagaggat ctgcttttgc aattgggtca 301 gatggtcttt gttgtcaaag cagagaagta aaggaatggc atgggtgtag agctactaaa 361 ggattaatga aagggaaaca ctactatgaa gtatcctgtc atgaccaagg gttatgcagg 421 gtcgggtggt ctaccatgca ggcctctttg gacctaggta ctgacaagtt tggatttggc 481 tttggtggaa caggaaagaa atcccataac aaacaatttg ataattatgg agaggaattc 541 actatgcatg ataccattgg atgttacctg gatatagata agggacatgt caagttctcc 601 aaaaatggaa aagatcttgg tctggcattt gaaataccac cacatatgaa aaaccaagcc 661 ctctttcctg cctgtgtttt gaagaatgct gaactgaaat ttaacttcgg tgaagaggaa 721 tttaagtttc caccaaaaga tggctttgtt gctctttcca aggcaccgga tggttacatt 781 gtcaaatcac agcactcagg taatgcacag gtgacacaaa caaagtttct ccccaatgct 841 ccgaaagctc tcattgttga accttcccgg gagttagctg aacaaacttt gaacaacatc 901 aagcagttta agaaatacat tgataatcct aaattaaggg agcttctgat aattggaggt 961 gttgcagccc gggatcagct ctctgttttg gaaaatggag tagatatagt tgtaggtact 1021 ccgggaagac tagatgactt ggtgtcaact ggaaagctga acttatctca agttagattc 1081 ctggtcctgg atgaagctga tgggcttctt tctcaaggtt attctgattt tataaatagg 1141 atgcacaatc agattcctca ggttacctct gatggaaaaa gacttcaggt gattgtttgc 1201 tctgccactt tgcattcttt cgatgtaaag aaactgtccg agaagataat gcattttcct 1261 acatgggttg acttaaaagg agaagactct gttccagata ctgtacacca tgttgttgtc 1321 ccagtaaatc ccaaaactga cagactctgg gaaaggcttg gaaagagcca cattagaact 1381 gatgatgtac atgcaaaaga taacacaaga cctggtgcta atagtccaga gatgtggtct 1441 gaagctatta aaatcctgaa aggggagtat gctgtccggg caatcaagga acataagatg 1501 gatcaagcaa ttatcttctg tagaaccaaa attgactgtg ataacttgga gcagtacttt 1561 atacaacaag gaggaggacc tgataaaaaa ggacaccagt tctcatgtgt ttgtcttcat 1621 ggtgacagaa agcctcatga gagaaagcaa aacttggaaa gatttaagaa aggagatgta 1681 agattcttga tttgcacaga tgtagctgct agaggaattg atatccacgg tgttccttat 1741 gttataaatg tcactctgcc cgatgaaaag caaaactacg tacatcgaat tggcagagta 1801 ggaagagctg aaaggatggg tctggcaatt tccctggtgg caacagaaaa agaaaaggtt 1861 tggtaccatg tatgtagcag ccgtggaaaa gggtgttata acacaagact caaggaagat 1921 ggaggctgta ccatatggta caacgagatg cagttactat ctgagataga agaacacctg 1981 aactgtacca tttctcaggt tgagccggat ataaaggtac cagtggatga atttgatggg 2041 aaagttacct acggtcagaa aagggctgct ggtggtggaa gctataaagg ccatgtggat 2101 attttggcac ctactgttca agagttggct gcccttgaaa aggaggcgca gacatctttc 2161 ctgcatcttg gctaccttcc taaccagctg ttcagaacct tctgattttt acatttactg 2221 aataagattt gagtaatgaa agtctgtagt cttaaaactc taaaacagtt gtactgcttc 2281 caagcagcag tatttatagt aacgtaagct attaatgcta actcttgcat gtcaagaaac 2341 attagtctta ggaattcttc aaaaaatggc atcccaatga aaataaattt gatgactata // LOCUS HSCLA1GNA 2566 bp RNA PRI 06-OCT-1993 DEFINITION H.sapiens encoding CLA-1 mRNA. ACCESSION Z22555 NID g397606 KEYWORDS CLA-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2566) AUTHORS Calvo,D. and Vega,M.A. TITLE Identification, primary structure, and distribution of CLA-1, a novel member of the CD36/LIMPII gene family JOURNAL J. Biol. Chem. 268 (25), 18929-18935 (1993) MEDLINE 93366811 REFERENCE 2 (bases 1 to 2566) AUTHORS VEGA,M. TITLE Direct Submission JOURNAL Submitted (15-APR-1993) VEGA M., HOSPITAL DE LA PRINCESA, UNIDAD DE BIOLOGIA MOLECULAR, C/ DIEGO DE LEON 62, MADRID, MADRID, SPAIN, 28006 FEATURES Location/Qualifiers source 1..2566 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="promyelocytes" /cell_line="HL60" /clone_lib="HL60 cDNA library, Angel L. Corbi" 5'UTR 1..69 CDS 70..1599 /codon_start=1 /product="CLA-1" /db_xref="PID:g397607" /translation="MGCSAKARWAAGALGVAGLLCAVLGAVMIVMVPSLIKQQVLKNV RIDPSSLSFNMWKEIPIPFYLSVYFFDVMNPSEILKGEKPQVRERGPYVYRESRHKSN ITFNNNDTVSFLEYRTFQFQPSKSHGSESDYIVMPNILVLGAAVMMENKPMTLKLIMT LAFTTLGERAFMNRTVGEIMWGYKDPLVNLINKYFPGMFPFKDKFGLFAELNNSDSGL FTVFTGVQNISRIHLVDKWNGLSKVDFWHSDQCNMINGTSGQMWPPFMTPESSLEFYS PEACRSMKLMYKESGVFEGIPTYRFVAPKTLFANGSIYPPNEGFCPCLESGIQNVSTC RFSAPLFLSHPHFLNADPVLAEAVTGLHPNQEAHSLFLDIHPVTGIPMNCSVKLQLSL YMKSVAGIGQTGKIEPVVLPLLWFAESGAMEGETLHTFYTQLVLMPKVMHYAQYVLLA LGCVLLLVPVICQIRSQEKCYLFWSSSKKGSKDKEAIQAYSESLMTSAPKGSVLQEAK L" 3'UTR 1600..2566 polyA_site 2532..2537 BASE COUNT 528 a 811 c 695 g 532 t ORIGIN 1 cgtcgccgtc cccgtctcct gccaggcgcg gagccctgcg agccgcgggt gggccccagg 61 cgcgcagaca tgggctgctc cgccaaagcg cgctgggctg ccggggcgct gggcgtcgcg 121 gggctactgt gcgctgtgct gggcgctgtc atgatcgtga tggtgccgtc gctcatcaag 181 cagcaggtcc ttaagaacgt gcgcatcgac cccagtagcc tgtccttcaa catgtggaag 241 gagatcccta tccccttcta tctctccgtc tacttctttg acgtcatgaa ccccagcgag 301 atcctgaagg gcgagaagcc gcaggtgcgg gagcgcgggc cctacgtgta cagggagtcc 361 aggcacaaaa gcaacatcac cttcaacaac aacgacaccg tgtccttcct cgagtaccgc 421 accttccagt tccagccctc caagtcccac ggctcggaga gcgactacat cgtcatgccc 481 aacatcctgg tcttgggtgc ggcggtgatg atggagaata agcccatgac cctgaagctc 541 atcatgacct tggcattcac caccctcggc gaacgtgcct tcatgaaccg cactgtgggt 601 gagatcatgt ggggctacaa ggaccccctt gtgaatctca tcaacaagta ctttccaggc 661 atgttcccct tcaaggacaa gttcggatta tttgctgagc tcaacaactc cgactctggg 721 ctcttcacgg tgttcacggg ggtccagaac atcagcagga tccacctcgt ggacaagtgg 781 aacgggctga gcaaggttga cttctggcat tccgatcagt gcaacatgat caatggaact 841 tctgggcaaa tgtggccgcc cttcatgact cctgagtcct cgctggagtt ctacagcccg 901 gaggcctgcc gatccatgaa gctaatgtac aaggagtcag gggtgtttga aggcatcccc 961 acctatcgct tcgtggctcc caaaaccctg tttgccaacg ggtccatcta cccacccaac 1021 gaaggcttct gcccgtgcct ggagtctgga attcagaacg tcagcacctg caggttcagt 1081 gcccccttgt ttctctccca tcctcacttc ctcaacgccg acccggttct ggcagaagcg 1141 gtgactggcc tgcaccctaa ccaggaggca cactccttgt tcctggacat ccacccggtc 1201 acgggaatcc ccatgaactg ctctgtgaaa ctgcagctga gcctctacat gaaatctgtc 1261 gcaggcattg gacaaactgg gaagattgag cctgtggtcc tgccgctgct ctggtttgca 1321 gagagcgggg ccatggaggg ggagactctt cacacattct acactcagct ggtgttgatg 1381 cccaaggtga tgcactatgc ccagtacgtc ctcctggcgc tgggctgcgt cctgctgctg 1441 gtccctgtca tctgccaaat ccggagccaa gagaaatgct atttattttg gagtagtagt 1501 aaaaagggct caaaggataa ggaggccatt caggcctatt ctgaatccct gatgacatca 1561 gctcccaagg gctctgtgct gcaggaagca aaactgtagg gtcctgagga caccgtgagc 1621 cagccaggcc tggccgctgg gcctgaccgg ccccccagcc cctacacccc gcttctcccg 1681 gactctccca gcagacagcc ccccagcccc acagcctgag cctcccagct gccatgtgcc 1741 tgttgcacac ctgcacacac gccctggcac acatacacac atgcgtgcag gcttgtgcag 1801 acactcaggg atggagctgc tgctgaaggg acttgtaggg agaggctcgt caacaagcac 1861 tgttctggaa ccttctctcc acgtggccca caggctgacc acaggggctg tgggtcctgc 1921 gtccccttcc tcgggtgagc ctggcctgtc ccgttcagcc gttgggccag gcttcctccc 1981 ctccaaggtg aaacactgca gtcccggtgt ggtggctccc catgcaggac gggccaggct 2041 gggagtgccg ccttcctgtg ccaaattcag tggggactca gtgcccaggc cctggcacga 2101 gctttggcct tggtctacct gccaggccag gcaaagcgcc tttacacagg cctcggaaaa 2161 caatggagtg agcacaagat gccctgtgca gctgcccgag ggtctccgcc caccccggcc 2221 ggactttgat ccccccgaag tcttcacagg cactgcatcg ggttgtctgg cgcccttttc 2281 ctccagccta aactgacatc atcctatgga ctgagccggc cactctctgg ccgaagtggc 2341 gcaggctgtg cccccgagct gcccccaccc cctcacaggg tccctcagat tataggtgcc 2401 caggctgagg tgaagaggcc tgggggccct gccttccggg cgctcctgga ccctggggca 2461 aacctgtgac ccttttctac tggaatagaa atgagtttta tcatctttga aaaataattc 2521 actcttgaag taataaacgt ttaaaaaaat ggaaaaaaaa aaaaaa // LOCUS HSCLC1MR 3093 bp RNA PRI 23-MAR-1995 DEFINITION H.sapiens mRNA for ClC-1 muscle chloride channel protein. ACCESSION Z25884 NID g398160 KEYWORDS chloride channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3093) AUTHORS Steinmeyer,K., Lorenz,C., Pusch,M., Koch,M.C. and Jentsch,T.J. TITLE Multimeric structure of ClC-1 chloride channel revealed by mutations in dominant myotonia congenita (Thomsen) JOURNAL EMBO J. 13 (4), 737-743 (1994) MEDLINE 94155836 REFERENCE 2 (bases 1 to 3093) AUTHORS Jentsch,T.J. TITLE Direct Submission JOURNAL Submitted (03-SEP-1993) Thomas J Jentsch, Centre for Molecular Neurobiology, ZMNH, Hamburg University, Martinistr. 52, Hamburg, D-20246, Germany REFERENCE 3 (bases 1 to 3093) AUTHORS Lorenz,C., Meyer-Kleine,C., Steinmeyer,K., Koch,M.C. and Jentsch,T.J. TITLE Genomic organization of the human muscle chloride channel CIC-1 and analysis of novel mutations leading to Becker-type myotonia JOURNAL Hum. Mol. Genet. 3 (6), 941-946 (1994) MEDLINE 95038751 FEATURES Location/Qualifiers source 1..3093 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Caucasian male fetus placenta" /clone_lib="Stratagene Lambda FixII genomic library #946203" /chromosome="7/q" 5'UTR 1..87 /partial /note="sequence is derived from a genomic clone; shows sequence homology to the rat ClC-1 5`UTR region (GenBank # X62894)" CDS 88..3054 /note="88..596 is derived from a genomic clone; 597..3054 is identical to partial human ClC-1 cDNA clone (GenBank M97820)" /codon_start=1 /product="human ClC-1 muscle chloride channel" /db_xref="PID:g398161" /db_xref="SWISS-PROT:P35523" /translation="MEQSRSQQRGGEQSWWGSDPQYQYMPFEHCTSYGLPSENGGLQH RLRKDAGPRHNVHPTQIYGHHKEQFSDREQDIGMPKKTGSSSTVDSKDEDHYSKCQDC IHRLGQVVRRKLGEDWIFLVLLGLLMALVSWSMDYVSAKSLQAYKWSYAQMQPSLPLQ FLVWVTFPLVLILFSALFCHLISPQAVGSGIPEMKTILRGVVLKEYLTMKAFVAKVVA LTAGLGSGIPVGKEGPFVHIASICAAVLSKFMSVFCGVYEQPYYYSDILTVGCAVGVG CCFGTPLGGVLFSIEVTSTYFAVRNYWRGFFAATFSAFVFRVLAVWNKDAVTITALFR TNFRMDFPFDLKELPAFAAIGICCGLLGAVFVYLHRQVMLGVRKHKALSQFLAKHRLL YPGIVTFVIASFTFPPGMGQFMAGELMPREAISTLFDNNTWVKHAGDPESLGQSAVWI HPRVNVVIIIFLFFVMKFWMSIVATTMPIPCGGFMPVFVLGAAFGRLVGEIMAMLFPD GILFDDIIYKILPGGYAVIGAAALTGAVSHTVSTAVICFELTGQIAHILPMMVAVILA NMVAQSLQPSLYDSIIQVKKLPYLPDLGWNQLSKYTIFVEDIMVRDVKFVSASYTYGE LRTLLQTTTVKTLPLVDSKDSMILLGSVERSELQALLQRHLCPERRLRAAQEMARKLS ELPYDGKARLAGEGPPGAPPGRPESFAFVDEDEDEDLSGKSELPPSLALHPSTTAPLS PEEPNGPLPGHKQQPEAPEPAGQRPSIFQSLLHCLLGRARPTKKKTTQDSTDLVDNMS PEEIEAWEQEQLSQPVCFDSCCIDQSPFQLVEQTTLHKTHTLFSLLGLHLAYVTSMGK LRGVLALEELQKAIEGHTKSGVQLRPPLASFRNTTSTRKSTGAPPSSAENWNLPEDRP GATGTGDVIAASPETPVPSPSPEPPLSLAPGKVEGELEELELVESPGLEEELADILQG PSLRSTDEEDEDELIL" 3'UTR 3055..3093 /partial /note="sequence is identical to a partial human ClC-1 cDNA clone (GenBank M97820)" BASE COUNT 656 a 884 c 853 g 700 t ORIGIN 1 agcaagagca gaggcttaag gagctacact gggggaagga caggggcaag caggccaagg 61 cctggccggg gctcgggggg agggaatatg gagcaatccc ggtcacagca gcgtgggggt 121 gaacaaagct ggtggggtag tgacccccag taccagtata tgccctttga acactgcacc 181 agctacggac tgccctctga gaatgggggc ctccagcaca ggctccggaa ggatgcaggc 241 ccccgccaca acgtccaccc cacacagatt tatggccatc acaaagaaca attctcagac 301 agggagcagg acatagggat gcccaagaag acaggctcca gttctaccgt ggacagcaag 361 gatgaggatc actattctaa atgtcaagat tgtatccacc gcctgggaca ggtggtgaga 421 agaaaattag gggaagactg gatctttctg gtgcttctgg gactgctgat ggctctggtc 481 agctggagca tggactacgt cagtgccaaa agccttcagg cctacaagtg gtcctacgcg 541 cagatgcagc ccagccttcc tctgcagttc ctggtctggg tcaccttccc actagtcctc 601 atcctcttca gcgccctctt ctgccacctc atctctcccc aggctgttgg ctctggaatc 661 cccgaaatga agacaatact tcgtggggtt gtcctgaagg aatacctcac aatgaaagcc 721 tttgtggcca aggttgtcgc cctgactgcg ggcctgggca gtggcatccc cgtggggaaa 781 gagggcccct tcgtccacat tgccagcatc tgtgctgctg tcctcagcaa attcatgtct 841 gtgttctgcg gggtatatga gcagccatac tactactctg atatcctgac ggtgggctgt 901 gctgtgggag tcggctgttg ttttgggaca ccacttggag gagtgctatt tagcatcgag 961 gtcacctcca cctactttgc tgttcggaac tactggagag gattctttgc agccacgttc 1021 agcgcctttg tgtttcgagt gctggcagtg tggaacaagg atgctgtcac catcactgct 1081 ctgttcagaa ccaatttccg aatggatttc ccctttgacc tgaaggaact accagctttt 1141 gctgccatcg ggatttgctg tgggctcctg ggagctgtat ttgtgtatct gcatcgccaa 1201 gtcatgctcg gtgtccgaaa gcacaaggcc ctcagccagt ttcttgctaa gcaccgcctg 1261 ctgtatcctg gaattgttac ctttgtcatt gcctcattca ccttcccacc aggaatgggt 1321 caattcatgg ctggagagtt gatgccccgc gaagccatca gtactttgtt tgacaacaat 1381 acatgggtga aacacgcggg tgatcctgag agcctgggcc agtcagctgt gtggattcac 1441 ccccgggtca acgttgtcat catcatcttt ctcttcttcg tcatgaagtt ctggatgtcc 1501 atcgtggcca ccactatgcc cataccctgc ggaggcttca tgcctgtgtt tgtgctagga 1561 gctgcatttg gaaggctggt aggagaaatc atggccatgc tctttcctga tggtattttg 1621 tttgatgaca tcatctacaa gatcctacct gggggctatg cagtaattgg agcagcagcg 1681 ctgactggtg ccgtttccca cacagtctcc acagctgtga tttgcttcga attaacgggt 1741 cagattgctc acatcctgcc catgatggtg gctgttatct tggccaacat ggtggcccag 1801 agcctgcagc cctctctcta tgacagcatc atccaggtca agaagctacc ctacttgcct 1861 gaccttggct ggaaccagct cagcaaatat accatctttg ttgaggacat catggtacgt 1921 gatgtgaagt ttgtttcagc ttcttacaca tatggggagt tgcgaaccct gctccagacc 1981 accacagtca agactttacc actggttgac tcaaaagatt caatgatcct gctgggctcg 2041 gtggagcggt cggaactgca ggccctcctg cagcgccacc tgtgtcctga gcgcaggctg 2101 cgcgcagccc aagagatggc gcggaagttg tcggagctgc cttacgacgg gaaggcgcgg 2161 ctggctgggg aggggccccc cggcgcgcct ccaggccggc ccgagtcctt cgcctttgtg 2221 gatgaggatg aggacgaaga tctctctggc aagagcgagc ttcctccttc ccttgctctc 2281 cacccctcta ctactgcccc tctgtcccca gaagagccca atgggcctct gcctggccac 2341 aaacagcagc cggaagcacc agagcctgca ggtcaaagac cctccatctt ccagtccctg 2401 cttcactgct tgctgggcag agctcgcccc acaaagaaga aaacaaccca ggattccaca 2461 gatttagtgg ataacatgtc acctgaagag attgaggcct gggagcagga gcagctgagc 2521 cagcctgtct gttttgattc ctgctgtatt gaccagtctc ccttccagct ggtggagcag 2581 acaaccctgc acaagactca taccctgttt tcactccttg gcctccacct cgcttacgtg 2641 accagcatgg ggaagctcag gggcgtcctg gccctggagg agctacagaa ggccattgag 2701 gggcacacca agtctggggt gcagctccgc cctccccttg ccagcttccg gaacacgact 2761 tcaactcgaa agagtaccgg ggcacctcca tcttctgcag agaactggaa cctgcctgag 2821 gacaggcctg gggccactgg aacaggggat gtgattgctg cctccccaga gacccctgtg 2881 ccatctcctt ccccagagcc ccctctctcc ctggccccag gcaaggtaga gggcgagttg 2941 gaggagctgg agctggtgga gagtccaggg ctggaagagg agctggccga catcttgcag 3001 ggccccagcc tgcgatccac agacgaggag gatgaggatg aactgatcct ttgaccccct 3061 cccacgacct cctcataaag accgtggaga ggc // LOCUS HSCLCHL 5541 bp RNA PRI 16-APR-1996 DEFINITION H.sapiens mRNA for putative chloride channel. ACCESSION X83378 NID g1154676 KEYWORDS chloride channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5541) AUTHORS Jentsch,T.J. TITLE Direct Submission JOURNAL Submitted (12-DEC-1994) T.J. Jentsch, Zentrum f. Molekulare Neurobiologie, Inst.f. Molekulare Neuropathobiologie, Martinistr. 52, D- 20246 Hamburg, FRG REMARK Revised by [3] REFERENCE 2 (bases 1 to 5541) AUTHORS Jentsch,T.J. TITLE Direct Submission JOURNAL Submitted (12-JAN-1996) T.J. Jentsch, Zentrum f. Molekulare Neurobiologie, Inst.f. Molekulare Neuropathobiologie, Martinistr. 52, D- 20246 Hamburg, FRG REFERENCE 3 (bases 1 to 5541) AUTHORS Brandt,S. and Jentsch,T.J. TITLE ClC-6 and ClC-7 are two novel broadly expressed members of the CLC chloride channel family JOURNAL FEBS Lett. 377 (1), 15-20 (1995) MEDLINE 96130311 COMMENT Bases 726-5440 from D28475 (partly verified). FEATURES Location/Qualifiers source 1..5541 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ClC-6" /dev_stage="adult" /tissue_type="brain" /clone_lib="human brain cDNA" /sex="Male" CDS 27..2636 /note="member of CLC chloride channel family" /codon_start=1 /product="putative chloride channel" /db_xref="PID:g1263890" /translation="MAGCRGSLCCCCRWCCCCGERETRTPEELTILGETQEEEDEILP RKDYESLDYDRCINDPYLEVLETMDNKKGRRYEAVKWMVVFAIGVCTGLVGLFVDFFV RLFTQLKFGVVQTSVEECSQKGCLALSLLELLGFNLTFVFLASLLVLIEPVAAGSGIP EVKCYLNGVKVPGIVRLRTLLCKVLGVLFSVAGGLFVGKEGPMIHSGSVVGAGLPQFQ SISLRKIQFNFPYFRSDRDKRDFVSAGAAAGVAAAFGAPIGGTLFSLEEGSSFWNQGL TWKVLFCSMSATFTLNFFRSGIQFGSWGSFQLPGLLNFGEFKCSDSDKKCHLWTAMDL GFFVVMGVIGGLLGATFNCLNKRLAKYRMRNVHPKPKLVRVLESLLVSLVTTVVVFVA SMVLGECRQMSSSSQIGNDSFQLQVTEDVNSSIKTFFCPNDTYNDMATLFFNPQESAI LQLFHQDGTFSPVTLALFFVLYFLLACWTYGISVPSGLFVPSLLCGAAFGRLVANVLK SYIGLGHIYSGTFALIGAAAFLGGVVRMTISLTVILIESTNEITYGLPIMVTLMVAKW TGDFFNKGIYDIHVGLRGVPLLEWETEVEMDKLRASDIMEPNLTYVYPHTRIQSLVSI LRTTVHHAFPVVTENRGNEKEFMKGNQLISNNIKFKKSSILTRAGEQRKRSQSMKSYP SSELRNMCDEHIASEEPAEKEDLLQQMLERRYTPYPNLYPDQSPSEDWTMEERFRPLT FHGLILRSQLVTLLVRGVCYSESQSSASQPRLSYAEMAEDYPRYPDIHDLDLTLLNPR MIVDVTPYMNPSPFTVSPNTHVSQVFNLFRTMGLRHLPVVNAVGEIVGIITRHNLTYE FLQARLRQHYQTI" BASE COUNT 1198 a 1524 c 1406 g 1413 t ORIGIN 1 gtccagagtg gcagtaaagg aggaagatgg cggggtgcag ggggtctctg tgctgctgct 61 gcaggtggtg ctgctgctgc ggtgagcgtg agacccgcac ccccgaggag ctgaccatcc 121 ttggagaaac acaggaggag gaggatgaga ttcttccaag gaaagactat gagagtttgg 181 attatgatcg ctgtatcaat gacccttacc tggaagtttt ggagaccatg gataataaga 241 aaggtcgaag atatgaggcg gtgaagtgga tggtggtgtt tgccattgga gtctgcactg 301 gcctggtggg tctctttgtg gacttttttg tgcgactctt cacccaactc aagttcggag 361 tggtacagac atcggtggag gagtgcagcc agaaaggctg cctcgctctg tctctccttg 421 aactcctggg ttttaacctc acctttgtct tcctggcaag cctccttgtt ctcattgagc 481 cggtggcagc aggttccggg atacccgagg tcaaatgcta tctgaatggc gtaaaggtgc 541 caggaatcgt ccgtctccgg accctgctct gcaaggtcct tggagtgctg ttcagtgtgg 601 ctggagggct cttcgtgggg aaggaaggcc ccatgatcca cagtggttcg gtggtgggag 661 ctggcctccc tcagtttcag agcatctcct tacggaagat ccagtttaac ttcccctatt 721 tccgaagcga cagagacaag agagactttg tatcagcagg agcggctgct ggagttgctg 781 cagctttcgg ggcgccaatc gggggtacct tgttcagtct agaggagggt tcgtccttct 841 ggaaccaagg gctcacgtgg aaagtgctct tttgttccat gtctgccacc ttcaccctca 901 acttcttccg ttctgggatt cagtttggaa gctggggttc cttccagctc cctggattgc 961 tgaactttgg cgagtttaag tgctctgact ctgataaaaa atgtcatctc tggacagcta 1021 tggatttggg tttcttcgtc gtgatggggg tcattggggg cctcctggga gccacattca 1081 actgtctgaa caagaggctt gcaaagtacc gtatgcgaaa cgtgcacccg aaacctaagc 1141 tcgtcagagt cttagagagc ctccttgtgt ctctggtaac caccgtggtg gtgtttgtgg 1201 cctcgatggt gttaggagaa tgccgacaga tgtcctcttc gagtcaaatc ggtaatgact 1261 cattccagct ccaggtcaca gaagatgtga attcaagtat caagacattt ttttgtccca 1321 atgataccta caatgacatg gccacactct tcttcaaccc gcaggagtct gccatcctcc 1381 agctcttcca ccaggatggt actttcagcc ccgtcactct ggccttgttc ttcgttctct 1441 atttcttgct tgcatgttgg acttacggca tttctgttcc aagtggcctt tttgtgcctt 1501 ctctgctgtg tggagctgct tttggacgtt tagttgccaa tgtcctaaaa agctacattg 1561 gattgggcca catctattcg gggacctttg ccctgattgg tgcagcggct ttcttgggcg 1621 gggtggtccg catgaccatc agcctcacgg tcatcctgat cgagtccacc aatgagatca 1681 cctacgggct ccccatcatg gtcacactga tggtggccaa atggacaggg gactttttca 1741 ataagggcat ttatgatatc cacgtgggcc tgcgaggcgt gccgcttctg gaatgggaga 1801 cagaggtgga aatggacaag ctgagagcca gcgacatcat ggagcccaac ctgacctacg 1861 tctacccgca cacccgcatc cagtctctgg tgagcatcct gcgcaccacg gtccaccatg 1921 ccttcccggt ggtcacagag aaccgcggta acgagaagga gttcatgaag ggcaaccagc 1981 tcatcagcaa caacatcaag ttcaagaaat ccagcatcct cacccgggct ggcgagcagc 2041 gcaaacggag ccagtccatg aagtcctacc catccagcga gctacggaac atgtgtgatg 2101 agcacatcgc ctctgaggag ccagccgaga aggaggacct cctgcagcag atgctggaaa 2161 ggagatacac tccctacccc aacctatacc ctgaccagtc cccaagtgaa gactggacca 2221 tggaggagcg gttccgccct ctgaccttcc acggcctgat ccttcggtcg cagcttgtca 2281 ccctgcttgt ccgaggagtt tgttactctg aaagccagtc gagcgccagc cagccgcgcc 2341 tctcctatgc cgagatggcc gaggactacc cgcggtaccc cgacatccac gacctggacc 2401 tgacgctgct caacccgcgc atgatcgtgg atgtcacccc atacatgaac ccttcgcctt 2461 tcaccgtctc gcccaacacc cacgtctccc aagtcttcaa cctgttcaga acgatgggcc 2521 tgcgccacct gcccgtggtg aacgctgtgg gagagatcgt ggggatcatc acacggcaca 2581 acctcaccta tgaatttctg caggcccggc tgaggcagca ctaccagacc atctgacagc 2641 ccagcccacc ctctcctggt gctgcctggg gaggcaaatc atgctcactc cggcgggcac 2701 agctggctgg ggctgttccg gggcatggaa gattcccagt tacccactca ctcagaaagc 2761 cgggagtcat cggacacctt gctggtcaga ggccctgggg gtggttttga accatcagag 2821 cttggacttt tctgacttcc ccagcaagga tcttcccact tcctgctccc tgtgttccca 2881 ccctccagtg ttggcacagg cccacccctg gctccaccag agccagaagc agaggtagaa 2941 tcaggcgggc cccgggctgc actccgagca gtgttcctgg ccatctttgc tactttccta 3001 gagaacccgg ctgttgcctt aaatgtgtga gagggacttg gccaaggcaa aagctgggga 3061 gatgccagtg acaacataca gttcatgact aggtttagga attgggcact gagaaaattc 3121 tcaatatttc agagagtcct tcccttattt gggactccta acacggtatc ctcgctagtt 3181 tgttttaagg gaaacactct gctcctgggt gtgagcagag gctctggtct tgccctgtgg 3241 tttgactctc cttagaacca ccgcccacca gaaacataaa ggattaaaat cacactaata 3301 acccctggat ggtcaatctg ataataggat cagatttacg tctaccctaa ttcttaacat 3361 tgcagctttc tctccatctg cagattattc ccagtctccc agtaacacgt ttctacccag 3421 atcctttttc atttccttaa gttttgatct ccgtcttcct gatgaagcag gcagagctca 3481 gaggatcttg gcatcaccca ccaaagttag ctgaaagcag ggcactcctg gataaagcag 3541 cttcactcaa ctctggggaa tgctaccatt ttttttccaa agtagaaagg aagcacttct 3601 gagccagtga ccactgaaag gtatgtgcta tgataaagca gatggcctat ttgaggaaga 3661 gggtgtctgc ccttcacaaa cacctctctc tcccctgcac tagctgtccc aagcttacat 3721 acagaggccc ttcaggaggg cctcctgtgc cgcagggagg gtgcgtgggg aagatgcttc 3781 ctgccagcac gtgcctgaag gtttcacatg aagcatggga agcgcaccct gtcgttcagt 3841 gacgtcattc ttctccaggc tggcccgccc cctctgacta ggcacccaaa gtgagcatct 3901 gggcattggg cattcatgct tatcttcccc caccttctac atggtatcag tcccagcagg 3961 catccctggg gcagacgtgc tttggctcaa gatggccttc atttacgttt agtttttttt 4021 aaaaccgtgg aggttgccca cgggcctcgg cacctggccc tggcagcaca gctctcaggc 4081 ccagccctgg gcgacctcct tggccaagtc tgcctttcac cctggggtga gcatcagtcc 4141 tggctctgct ggtccagatc ttgcgctcag cacactctag ggaataattc cactccagag 4201 atggggctgc ttcaaggtct tttctagctg attgtggccc ctccattttc cccattttct 4261 tatctccctg accaaaattg ctttgacttc taaatgtttc tgcttcccag aatgcacctg 4321 acttatgaaa tggggataat actcccagga aatagcgcag gacatcacaa ggaccaaaaa 4381 ggcaattctt atttaaatgt tactatttgg ccagctgctg ctgtgtttta tggcagtgtt 4441 cagagcttga tcacgttatt tcttcctttt attaagaagg aagccaattg tccaagtcag 4501 gagaatggtg tgatcacctg tcacagacac tttgtcccct ctccccgccc cttcctggag 4561 ctggcagagc taacgccctg caggaggacc ccggcctctc gagggctgga tcagcagccg 4621 cctgccctga ggctgccccg gtgaatgtta ttggaattca tccctcgtgc acatcctgtt 4681 gtgtttaagt caccagatat tttgttccca tcagtttagc ccagagatag acagtagaat 4741 gcaaatacct ccctccccta aactgactgg acggctgcca aggaggcccc aaacccaggc 4801 cccatgcaaa ggcacgtggt ttccttttct cctctctctg catctgcgct ttccagataa 4861 gcccaaagac agcaacttct ccactcatga caaatcaact gtgaccctcg ctccttccat 4921 ttctgtccat tagaaaccag ccttttcagc atctcaccca ttagcagccc catcacccag 4981 tgatcagtcg cctcagtaaa gcagatctgt ggatggggag cctacgggtg gtaagaagtg 5041 gtgttttgtg tttcatctcc agcttggtgt tccatggccc ctaggcgagg tgatcaggga 5101 gtggggccaa tgggcccccg gccctggctt tgggaccttg tgctgaggga tgatttgctc 5161 ctgaccttga ttaacttaac agttcccagc tggaagggac actttcagga cccagtccac 5221 tgtatggcat ttgtgatgca gaattatgca ctgacatgac cctgggtgac aggaaagcct 5281 ttcgagaggc ccaaggtggc ctcgccagcc ctgcagtatt gatgtgcagt attgcaccac 5341 agctctgcgg accttggcca ttgccgcagt cgcagcttcc ttttttctgt ttgcactgtt 5401 tgtttgtatg atgttagcta attccactgt gtatataaat tgtatttttt ttaatttgta 5461 aaatgctatt tttatttgaa cctttggaac ttgggagttc tcattgtaac cctaacatgt 5521 gagaataaaa tgtcttctgt c // LOCUS HSCLCHPRA 2139 bp RNA PRI 02-AUG-1994 DEFINITION H.sapiens mRNA for chloride channel (putative) 2139bp. ACCESSION Z30643 NID g521071 KEYWORDS chloride channel; chloride channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2139) AUTHORS Kieferle,S., Fong,P., Bens,M., Vandewalle,A. and Jentsch,T.J. TITLE Two highly homologous members of the ClC chloride channel family in both rat and human kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (15), 6943-6947 (1994) MEDLINE 94316614 REFERENCE 2 (bases 1 to 2139) AUTHORS Kieferle,S. TITLE Direct Submission JOURNAL Submitted (09-MAR-1994) Stefanie Kieferle, Zentrum fuer Molekulare Neurobiologie, Martinistr. 52, Hamburg, 220246, Germany FEATURES Location/Qualifiers source 1..2139 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 23..2086 /codon_start=1 /product="chloride channel (putative)" /db_xref="PID:g521072" /translation="MEELVGLREGFSGDPVTLQELWGPCPHIRRAIQGGLEWLKQKVF RLGEDWYFLMTLGVLMALVSYAMNFAIGCVVRAHQWLYREIGDSHLLRYLSWTVYPVA LVSFSSGFSQSITPSSGGSGIPELKTMLAGVILEDYLDIKNFGAKVVGLSCTLATGST LFLGKVGPFVHLSVMIAAYLGRVRTTTIGEPENKSKQNEMLVAAAAVGVATVFAAPFS GVLFSIEVMSSHFSVRDYWRGFFAATCGAFIFRLLAVFNSEQETITSLYKTSFRVDVP FDLPEIFFFVALGGICGVLSCAYLFCQRTFLSFIKTNRYSSKLLATSKPVYSALATLL LASITYPPGVGHFLASRLSMKQHLDSLFDNHSWALMTQNSSPPWPEELDPQHLWWEWY HPRFTIFGTLAFFLVMKFWMLILATTIPMPAGYFMPIFILGAAIGRLLGEALAVAFPE GIVTGGVTNPIMPGGYALAGAAAFSGAVTHTISTALLAFELTGQIVHALPVLMAVLAA NAIAQSCQPSFYDGTIIVKKLPYLPRILGRNIGSHHVRVEHFMNHSITTLAKDTPLEE VVKVVTSTDVTEYPLVESTESQILVGIVQRAQLVQALQAEPPSRAPGHQQCLQDILAR GCPTEPVTLTLFSETTLHQAQNLFKLLNLQSLFVTSRGRAVGCVSWVEMKKAISNLTN PPAPK" BASE COUNT 386 a 672 c 614 g 467 t ORIGIN 1 ggggaggact gacaggggcc tgatggagga gttggtgggg ctgcgtgagg gcttctcagg 61 ggaccctgtg actctgcagg agctgtgggg cccctgtccc cacatccgcc gagccatcca 121 aggtggcctg gagtggctaa agcagaaggt gttccgcctg ggagaagact ggtacttcct 181 gatgaccctc ggggtgctca tggccctggt cagctatgcc atgaactttg ccatcgggtg 241 tgtggtccga gcacaccagt ggctgtacag ggagattggg gacagccacc tgctccggta 301 tctttcctgg actgtgtacc ctgtggccct cgtctctttc tcctcaggct tctcccagag 361 catcacgccc tcctctggag gttctggaat cccggagctg aagaccatgt tggcgggtgt 421 gatcttggag gactacctgg atatcaagaa ctttggggcc aaggtggtgg gcctctcctg 481 caccctggcc accggcagca ccctgttcct gggcaaagtg ggccctttcg tgcacctgtc 541 tgtaatgatc gctgcctacc tgggccgtgt gcgcaccacg accatcgggg agcctgagaa 601 caagagcaag caaaacgaaa tgctggtggc agcggcggca gtgggcgtgg ccacagtctt 661 tgcagctccc ttcagcggcg tcctgttcag catcgaggtc atgtcttccc acttctctgt 721 ccgggattac tggaggggct tctttgcggc cacctgcggg gccttcatat tccggctcct 781 ggcagtcttc aacagcgagc aggagaccat cacctccctc tacaagacca gtttccgggt 841 ggacgttccc ttcgacctgc ctgagatctt cttttttgtg gcgctgggtg gcatctgcgg 901 cgtcctgagc tgtgcttacc tcttctgtca gcgaaccttc ctcagcttca tcaagaccaa 961 tcggtacagc tccaaactgc tggctactag caagcctgtg tactccgctc tggccacctt 1021 gcttctcgcc tccatcacct acccgcctgg tgtgggccac ttcctagctt ctcggctgtc 1081 catgaagcag catctggact cgctgttcga caaccactcc tgggcgctga tgacccagaa 1141 ctccagccca ccctggcccg aggagctcga cccccagcac ctttggtggg aatggtacca 1201 cccgcggttc accatctttg ggacccttgc cttcttcctg gttatgaagt tctggatgct 1261 gattctggcc accaccatcc ccatgcctgc cgggtacttc atgcccatct ttatccttgg 1321 agctgccatc gggcgcctct tgggagaggc tcttgccgtc gccttccctg agggcattgt 1381 gactggaggg gttaccaatc ccatcatgcc cggggggtat gctctggcag gggctgcagc 1441 cttctcaggg gctgtgaccc acaccatctc cacggcgctg ctggcctttg agctgaccgg 1501 ccagatagtg catgcactgc ccgtgctgat ggcggtgctg gcagccaacg ccattgcaca 1561 gagctgccag ccctccttct atgatggcac catcattgtc aagaagctgc catacctgcc 1621 acggattctg ggccgcaaca tcggctccca ccatgtgagg gtggagcact tcatgaacca 1681 cagcatcacc acactggcca aggacacgcc gctggaggag gtggtcaagg ttgtgacctc 1741 cacagacgtg accgagtatc ccctggtgga gagcacagag tcccagatcc tggtaggcat 1801 cgtgcagagg gcccagctgg tgcaggccct ccaggctgag cctccttcca gggctccagg 1861 acaccagcag tgtctccagg acatcttggc caggggctgc cccacggaac cagtgaccct 1921 gacgctattc tcagagacca ccttgcacca ggcacaaaac ctctttaagc tgttgaacct 1981 tcagtccctc ttcgtgacat cgcggggcag agctgtgggc tgcgtgtcct gggtggagat 2041 gaagaaagca atttccaacc tgacaaatcc gccagctcca aagtgagccg gcccagcaag 2101 atgaaacagg gcaccccagc tgacctggta ctgaggccg // LOCUS HSCLCN3 3953 bp RNA PRI 21-NOV-1995 DEFINITION H. sapiens RNA for CLCN3. ACCESSION X78520 NID g854101 KEYWORDS chloride channel; chloride channel 3; chloride channel protein; CLCL3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3953) AUTHORS Borsani,G., Rugarli,E.I., Taglialatela,M., Wong,C. and Ballabio,A. TITLE Characterization of a human and murine gene (CLCN3) sharing similarities to voltage-gated chloride channels and to a yeast integral membrane protein JOURNAL Genomics 27 (1), 131-141 (1995) MEDLINE 95394449 REFERENCE 2 (bases 1 to 3953) AUTHORS Borsani,G. TITLE Direct Submission JOURNAL Submitted (29-MAR-1994) G. Borsani, Baylor College of Medicine, Institute for Molecular Genetics, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..3953 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /clone_lib="cDNA" /map="4q32" /chromosome="4" gene 489..2951 /gene="CLCN3" CDS 489..2951 /gene="CLCN3" /note="putative" /codon_start=1 /product="chloride channel 3" /db_xref="PID:g854102" /translation="MESEQLFHRGYYRNSYNSITSASSDEELLDGAGVIMDFQTSEDD NLLDGDTAVGTHYTMTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRE RHRRINSKKKESAWEMTKSLYDAWSGWLVVTLTGLASGALAGLIDIAADWMTDLKEGI CLSALWYNHEQCCWGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWA LSFAFLAVSLVKVFAPYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASG LSLGKEGPLVHVACCCGNIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVL FSLEEVSYYFPLKTLWRSFFAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFP FILLGVFGGLWGAFFIRANIAWCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRL NTSELIKELFTDCGPLESSSLCDYRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALI FKIIMTVFTFGIKVPSGLFIPSMAIGAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGA DCITPGLYAMVGAAACLGGVTRMTVSLVVIVFELTGGLEYIVPLMAAVMTSKWVGDAF GREGIYEAHIRLNGYPFLDAKEEFEFTHTTLAADVMRPRRNDPPLAVLTQDNMTVDDI ENMINETSYNGFPVIMSKESQRLVGFALRRDLTIAIESARKKQEGIVGSSRVCFAQHT PSLPAESPRPLKLRSILDMSPFTVTDHTPMEIVVDIFRKLGLRQCLVTHNGRLLGIIT KKDILRHMAQTANQDPASIMFN" CDS 663..2951 /gene="CLCN3" /note="putative" /codon_start=1 /product="chloride channel 3" /db_xref="PID:g854103" /translation="MTNGGSINSSTHLLDLLDEPIPGVGTYDDFHTIDWVREKCKDRE RHRRINSKKKESAWEMTKSLYDAWSGWLVVTLTGLASGALAGLIDIAADWMTDLKEGI CLSALWYNHEQCCWGSNETTFEERDKCPQWKTWAELIIGQAEGPGSYIMNYIMYIFWA LSFAFLAVSLVKVFAPYACGSGIPEIKTILSGFIIRGYLGKWTLMIKTITLVLAVASG LSLGKEGPLVHVACCCGNIFSYLFPKYSTNEAKKREVLSAASAAGVSVAFGAPIGGVL FSLEEVSYYFPLKTLWRSFFAALVAAFVLRSINPFGNSRLVLFYVEYHTPWYLFELFP FILLGVFGGLWGAFFIRANIAWCRRRKSTKFGKYPVLEVIIVAAITAVIAFPNPYTRL NTSELIKELFTDCGPLESSSLCDYRNDMNASKIVDDIPDRPAGIGVYSAIWQLCLALI FKIIMTVFTFGIKVPSGLFIPSMAIGAIAGRIVGIAVEQLAYYHHDWFIFKEWCEVGA DCITPGLYAMVGAAACLGGVTRMTVSLVVIVFELTGGLEYIVPLMAAVMTSKWVGDAF GREGIYEAHIRLNGYPFLDAKEEFEFTHTTLAADVMRPRRNDPPLAVLTQDNMTVDDI ENMINETSYNGFPVIMSKESQRLVGFALRRDLTIAIESARKKQEGIVGSSRVCFAQHT PSLPAESPRPLKLRSILDMSPFTVTDHTPMEIVVDIFRKLGLRQCLVTHNGRLLGIIT KKDILRHMAQTANQDPASIMFN" BASE COUNT 1111 a 766 c 940 g 1136 t ORIGIN 1 ggggtcacgg gcgaactaga acactgggaa aggggctgca ggttccggac cggaccggcc 61 ctgacccgga ataatgagca aggagggtgt ggtgggttga aagccatcct actttactcc 121 cgagttagag catggattca gttttagtct taagggggaa gtgagattgg agatttttat 181 ttttaatttt gggcagaagc aggttgactc tagggatctc cagagcgaga ggatttaact 241 tcatgttgct cccgtgtttg aaggaggaca ataaaagtcc caccgggcaa aattttcgta 301 acctctgcgg tagaaaacgt caggtatctt ttaaatcgcg atagttttcg ctgtgtcagg 361 ctttcttcgg tggagctccg agggtagcta ggttctaggt ttgaaacaga tgcagaatcc 421 aaaggcagcg caaaaaacag ccaccgattt tgctatgtct ctgagctgcg agataatcag 481 acagctaaat ggagtctgag cagctgttcc atagaggcta ctatagaaac agctacaaca 541 gtataacaag tgcaagtagt gatgaggaac ttttagatgg agcaggtgtt attatggact 601 ttcaaacatc tgaagatgac aatttattag atggtgacac tgcagttgga actcattata 661 caatgacaaa tggaggcagc attaacagtt ctacacattt actggatctt ttggatgaac 721 caattccagg tgttggtaca tatgatgatt tccatactat tgattgggtg cgagaaaaat 781 gtaaagacag agaaaggcat agacggatca acagcaaaaa gaaagaatca gcatgggaaa 841 tgacaaaaag tttgtatgat gcgtggtcag gatggctagt agtaacacta acaggattgg 901 catcaggggc actggccgga ttaatagaca ttgctgccga ttggatgact gacctaaagg 961 agggcatttg ccttagtgcg ttgtggtaca accacgaaca gtgctgttgg ggatctaatg 1021 aaacaacatt tgaagagagg gataaatgtc cacagtggaa aacatgggca gaattaatca 1081 taggtcaagc agagggtcct ggttcttata tcatgaacta cataatgtac atcttctggg 1141 ccttgagttt tgcctttctt gcagtttccc tggtaaaggt atttgctcca tatgcctgtg 1201 gctctggaat tccagagatt aaaactattt taagtggatt catcatcaga ggttacttgg 1261 gaaaatggac tttaatgatt aaaaccatca cattagtcct ggctgtggca tcaggtttga 1321 gtttaggaaa agaaggtccc ctggtacatg ttgcctgttg ctgcggaaat atcttttcct 1381 acctctttcc aaagtatagc acaaacgaag ctaaaaaaag ggaggtgcta tcagctgcct 1441 cagctgcagg ggtttctgta gcttttggtg caccaattgg aggagttctt tttagcctgg 1501 aagaggttag ctattatttt cctctcaaaa ctttatggag atcatttttt gctgctttag 1561 tggctgcatt tgttttgagg tccatcaatc catttggtaa cagccgtctg gtcctttttt 1621 atgtggagta tcatacacca tggtaccttt ttgaactgtt tccttttatt cttctagggg 1681 tatttggagg gctttgggga gcctttttca ttagggcaaa tattgcctgg tgtcgtcgac 1741 gcaagtccac gaaatttgga aagtatcccg ttctggaagt cattattgtt gcagccatta 1801 ctgctgtgat agccttccct aatccataca ctaggctaaa caccagtgaa ctgatcaaag 1861 agctttttac agactgtggt cccctggaat cctcttctct ttgtgactac agaaatgaca 1921 tgaatgccag taaaattgtc gatgacattc ctgatcgtcc agcaggcatt ggagtatatt 1981 cagctatatg gcagttatgc ctggcactca tatttaaaat cataatgaca gtattcactt 2041 ttggcatcaa ggttccatca ggcttgttca tccccagcat ggccattgga gcgatcgcag 2101 gaaggattgt ggggattgcg gtggagcagc ttgcctacta tcaccacgac tggtttatct 2161 ttaaggagtg gtgtgaggtc ggggctgatt gcattacacc tggcctttat gccatggttg 2221 gtgctgctgc atgcttaggt ggtgtgacaa gaatgactgt ctccctggtg gttattgttt 2281 ttgagcttac tggaggcttg gaatatattg ttccccttat ggctgcagtc atgaccagta 2341 aatgggttgg agatgccttt ggcagggaag gcatttatga agcacacatc cgattaaatg 2401 gatacccttt cttggatgca aaagaagaat tcgaattcac tcataccacc ctggctgctg 2461 acgttatgag acctcgaagg aatgatcctc ccttagctgt cctgacacag gacaatatga 2521 cagtggatga tatagaaaac atgattaatg aaaccagcta caatggattt cctgtcataa 2581 tgtcaaaaga atctcagaga ttagtgggat ttgccctcag aagagacctg acaattgcaa 2641 tagaaagtgc caggaaaaaa caagaaggta tcgttggcag ttctcgggtg tgttttgcac 2701 agcacacccc atctcttcca gcagaaagtc ctcggccatt gaagcttcga agcattcttg 2761 acatgagccc ttttacagtg acagaccaca ccccaatgga gattgtggtg gatattttcc 2821 gaaagctggg actgaggcag tgccttgtaa ctcacaatgg gcgcctcctt ggcattataa 2881 caaaaaaaga tatcctccgg catatggccc agacggcaaa ccaagacccc gcttcaataa 2941 tgttcaactg aatctcacag atgaggagag agaagaaacg gaagaggaag tttatttgtt 3001 gaatagcaca actctttaac ctgagggagt catctacttt tttttcctcc tttacaaaaa 3061 aagaaaggaa atataaaagc cgggtttttg caacatggtt tgcaaataat gctggtggaa 3121 tggaggagtt gtttggggag ggaaaggaga gagaaggaaa ggagtgaggt atttcccgtc 3181 taacagaaag cagcgtatca actcctattg ttctgcactg gatgcattca gctgaggatg 3241 tgcctgatag tgcaggcttg cgcctcaaca gagatgacag cagagtcctc gagcacctgg 3301 cctgtttgct cacatgcaag acacatacag ccctattcta gaggatactt gaatggacct 3361 ctataaacgc aaggttcttg ccttttttta atcaaaactg ttctgtttaa ttcatgaatt 3421 gtatagttaa gcattacctt tctacattcc agaagagcct ttatttctct ctctctctct 3481 ctctctctct ctctctctac tgagctgtaa caaagcctct ttaaatcggt gtatcctttt 3541 gaagcagtcc tttctcatat tgagatgtac tgtgatttta ctgaggtttc atcacaagaa 3601 gggagtgttt cttgtgccat taaccatgta gtttgtacca tcactaaatg cttggaacag 3661 tacacatgca ccacaacaaa ggctcatcaa acaggtaaag tctcgaagga agcgagaacg 3721 aaatctctca ttgtgtgccg tgtggctcaa aaccgaaaac aatgaagctt ggttttaaag 3781 gataaagttt tcttttttgt tttcctctca gactttatgg ataatgtgac cgggtcttat 3841 gcaaattttc tatttctaaa actactacta tgatatacaa gtgctgttga gcataattaa 3901 ataaaatgct gctgctttga cagtaaagag aaggaagtat tctgaaaaaa aac // LOCUS HSCLCN5GN 3173 bp RNA PRI 07-MAY-1996 DEFINITION H.sapiens voltage-gated chloride ion channel CLCN5. ACCESSION X91906 NID g1067131 KEYWORDS CLCN5 gene; voltage-gated chloride channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3173) AUTHORS Fisher,S.E., van Bakel,I., Lloyd,S.E., Pearce,S.H., Thakker,R.V. and Craig,I.W. TITLE Cloning and characterization of CLCN5, the human kidney chloride channel gene implicated in Dent disease (an X-linked hereditary nephrolithiasis) JOURNAL Genomics 29 (3), 598-606 (1995) MEDLINE 96121370 REFERENCE 2 (bases 1 to 3173) AUTHORS Craig,I.W. TITLE Direct Submission JOURNAL Submitted (30-AUG-1995) I.W. Craig, University of Oxford, Genetics Laboratory, Dept.of Biochemistry, South Parks Road, Oxford OX1 3QU, UK REFERENCE 3 (bases 1 to 3173) AUTHORS Lloyd,S.E., Pearce,S.H.S., Fisher,S.E., Steinmeyer,K., Schwappach,B., Scheinman,S.J., Harding,B., Bolino,A., Devoto,M., Goodyer,P., Rigden,S.P.A., Wrong,O., Jentsch,T.J., Craig,I.W. and Thakker,R.V. TITLE A common molecular basis for three inherited kidney stone diseases JOURNAL Nature 379 (6564), 445-449 (1996) MEDLINE 96158876 COMMENT Related sequences X81836 and Z56277. FEATURES Location/Qualifiers source 1..3173 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="Clontech adult renal tissue library" /clone="RL.3, RL.6, RL.7, RL.8" /chromosome="X" /map="p11.22" gene 292..2532 /gene="CLCN5" CDS 292..2532 /gene="CLCN5" /codon_start=1 /product="voltage-gated chloride ion channel" /db_xref="PID:e220083" /db_xref="PID:g1171562" /translation="MDFLEEPIPGVGTYDDFNTIDWVREKSRDRDRHREITNKSKEST WALIHSVSDAFSGWLLMLLIGLLSGSLAGLIDISAHWMTDLKEGICTGGFWFNHEHCC WNSEHVTFEERDKCPEWNSWSQLIISTDEGAFAYIVNYFMYVLWALLFAFLAVSLVKV FAPYACGSGIPEIKTILSGFIIRGYLGKWTLVIKTITLVLAVSSGLSLGKEGPLVHVA CCCGNILCHCFNKYRKNEAKRREVLSAAAAAGVSVAFGAPIGGVLFSLEEVSYYFPLK TLWRSFFAALVAAFTLRSINPFGNSRLVLFYVEFHTPWHLFELVPFILLGIFGGLWGA LFIRTNIAWCRKRKTTQLGKYPVIEVLVVTAITAILAFPNEYTRMSTSELISELFNDC GLLDSSKLCDYENRFNTSKGGELPDRPAGVGVYSAMWQLALTLILKIVITIFTFGMKI PSGLFIPSMAVGAIAGRLLGVGMEQLAYYHQEWTVFNSWCSQGADCITPGLYAMVGAA ACLGGVTRMTVSLVVIMFELTGGLEYIVPLMAAAMTSKWVADALGREGIYDAHIRLNG YPFLEAKEEFAHKTLAMDVMKPRRNDPLLTVLTQDSMTVEDVETIISETTYSGFPVVV SRESQRLVGFVLRRDLIISIENARKKQDGVVSTSIIYFTEHSPPLPPYTPPTLKLRNI LDLSPFTVTDLTPMEIVVDIFRKLGLRQCLVTHNGRLLGIITKKDVLKHIAQMANQDP DSILFN" BASE COUNT 811 a 682 c 732 g 948 t ORIGIN 1 tgatgtgata tggctgcaag tgcctttgac ccttttgtct cccttccata aactgaaata 61 cctaagctgc tccaacctcc tttttgtctt ttgtttcata aatcctttcc cattgcacat 121 caactcctgt ctctctttgt actgtcactc tcatctgttg ctttccattc acactgcctt 181 tagccactca tcattttgtg cctacaccac agaaacctct gaatgtaatg gatgttccta 241 ccagaggaca agtcgtacaa tggtggagga ataggttctt caaataggat catggacttc 301 ttggaggagc caatccctgg tgtagggacc tatgatgatt tcaatacaat tgattgggtg 361 agagagaagt ctcgagaccg ggataggcac cgagagatta ccaataaaag caaagagtca 421 acatgggcct taattcacag tgtgagtgat gctttttccg gctggttgtt gatgctcctt 481 attgggcttt tatcaggttc gttagctggt ttgatagaca tctctgctca ttggatgaca 541 gacttaaaag aaggtatatg cacaggggga ttctggttta accatgaaca ttgttgctgg 601 aactctgagc atgtcacctt tgaagagaga gacaaatgtc cagagtggaa tagttggtcc 661 cagcttatca tcagcacaga tgagggagcc tttgcctaca tagtcaatta tttcatgtac 721 gtcctctggg ctctcctatt tgccttcctt gccgtatctc ttgtcaaggt gtttgcgcct 781 tatgcctgtg gctctggaat ccctgagata aaaactatct tgagtggttt cattattagg 841 ggctatttgg gtaagtggac tctggttatc aaaaccatca ccttggtgct ggcagtgtcg 901 tctggcttga gcctgggcaa agagggccct ctagtgcacg tggcttgctg ctgtgggaac 961 atcctgtgcc actgcttcaa caaatacagg aagaatgaag ccaagcgcag agaggtcttg 1021 tcggctgcag cagcagctgg tgtatctgta gcctttggag cacctatagg tggagtatta 1081 ttcagccttg aagaggtcag ctactatttt cccctcaaaa cattgtggcg ttcattcttt 1141 gctgccttgg tggcagcatt cactctacgc tccatcaatc catttgggaa cagccgcctg 1201 gtactatttt atgtggagtt tcacacccca tggcatctct ttgagctcgt gccattcatt 1261 ctgctgggca tatttggtgg tctgtgggga gcactgttta tccgcacaaa cattgcctgg 1321 tgtcggaagc gaaagaccac ccagttgggc aagtatcctg ttatagaggt actcgtcgtg 1381 acagccatca ctgccatcct ggctttcccc aatgaataca ctcggatgag cacaagtgag 1441 ctcatttctg agctgtttaa tgactgtggc cttctggact cctccaagct ctgtgattat 1501 gagaaccgtt tcaacacaag caaagggggt gaactgcctg acagaccggc tggcgtggga 1561 gtctacagtg caatgtggca gctggcttta acactcatac tgaaaattgt cattactata 1621 ttcacctttg gcatgaagat cccttctggc ctctttatcc ctagcatggc tgttggtgct 1681 atagcaggtc gacttctagg agtaggaatg gaacagctgg cttattacca ccaggaatgg 1741 accgtcttca atagctggtg tagtcaggga gctgattgca tcacccccgg cctttatgca 1801 atggttgggg ctgcagcctg cttaggtggg gtgactcgga tgactgtttc tcttgttgtc 1861 ataatgtttg aactgactgg tggcttagaa tacatcgtgc ctctgatggc tgcagccatg 1921 acaagcaagt gggtggcaga tgctcttggg cgggagggca tctatgatgc ccacatccgt 1981 ctcaatggat acccctttct tgaagccaaa gaagagtttg ctcataagac cctggcaatg 2041 gatgtgatga aaccccggag aaatgatcct ttgttgactg tccttactca ggacagtatg 2101 actgtggaag atgtagagac cataatcagt gaaaccactt acagtggctt cccagtggtg 2161 gtatcccggg agtcccaaag acttgtgggc tttgtcctcc gaagagatct cattatttca 2221 attgaaaatg ctcgaaagaa acaggatggg gttgttagca cttccatcat ttatttcacg 2281 gagcattctc ctccattgcc accatacact ccacccactc taaagcttcg gaacatcctc 2341 gatctcagcc ccttcactgt gactgacctt acacccatgg agatcgtagt ggatattttc 2401 cgaaagctgg gactgcggca gtgcctggtt acacacaacg ggcgattgct tggaatcatt 2461 accaaaaagg atgtgttaaa gcatatagca cagatggcga accaagatcc tgattccatt 2521 ctcttcaact agaatcatag agttctggat gtaaagcggg aaggacatta cagaccatgg 2581 atatgttgtt taacggtacc caaaacacat tttccatatt tggatggtga agtcacatta 2641 gtgtgttgtc tctttcctac aagttaacca gttgcactac ataatctctg gaaattaatt 2701 ttctctttag gagaaattat agttaggctt ccatgatgtt acattaggaa gatatcatga 2761 aagaataaat aagattgcta tggtttaatt atatttgctt tttaaaagat ttttttaact 2821 taaaaagtag ttagccaata tgcaatcact gaaaactatg caagagaaat tccaaccgtc 2881 ctgacctata acctgtagga aaccgacgaa aaagtcactc ttttgggatc taactgttgt 2941 tactggaaga cgaaggtaaa ctaaggggct ttgcttttca aaccagagaa aggaaagcca 3001 gaaggaaaag agtaatggta ttttctagac tgtgaagatt cagttcaaat gttatccttg 3061 ttcctgttac aatatttagc attattagtt tgttatgtgt gtatgtttat gttaatttta 3121 atttctgatt ataagacaat gctgctttgg ttaatctctt ctaaaggaat tta // LOCUS HSCLCPX 3214 bp RNA PRI 31-MAY-1995 DEFINITION H.sapiens mRNA for chloride channel. ACCESSION X77197 NID g479158 KEYWORDS chloride channel; CLCN4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3214) AUTHORS van Slegtenhorst,M.A., Bassi,M.T., Borsani,G., Wapenaar,M.C., Ferrero,G.B., de Conciliis,L., Rugarli,E., Grillo,A., Franco,B., Zoghbi,H.Y. and Ballabio,A. TITLE A gene from the Xp22.3 region shares homology with voltage-gated chloride channels JOURNAL Hum. Mol. Genet. 3 (4), 547-552 (1994) MEDLINE 94348498 REFERENCE 2 (bases 1 to 3214) AUTHORS Borsani,G. TITLE Direct Submission JOURNAL Submitted (22-JAN-1994) G. Borsani, Baylor College of Medicine, Institute for Molecular Genetics Rm S911, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..3214 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /clone_lib="adult retina cDNA library" /clone="4A" /chromosome="X" /map="Xp22.3" gene 384..2666 /gene="CLCN4" CDS 384..2666 /gene="CLCN4" /codon_start=1 /product="chloride channel" /db_xref="PID:g479159" /translation="MVNAGAMSGSGNLMDFLDEPFPDVGTYEDFHTIDWLREKSRDTD RHRKITSKSKESIWEFIKSLLDAWSGWVVMLLIGLLAGTLAGVIDLAVDWMTDLKEGV CLSAFWYSHEQCCWTSNETTFEDRDKCPLWQKWSELLVNQSEGASAYILNYLMYILWA LLFAFLAVSLVRVFAPYRCGSGIPEIKTILSGFIIRGYLGKWTLLIKTVTLVLVVSSG LSLGKEGPLVHVACCCGNFFSSLFSKYSKNEGKRREVLSAAAAAGVSVAFGAPIGGVL FSLEEVSYYFPLKTLWRSFFAALVAAFTLRSINPFGNSRLVLFYVEYHTPWYMAELFP FILLGVFGGLWGTLFIRCNIAWCRRRKTTRLGKYPVLEVIVVTAITAIIAYPNPYTRQ STSELISELFNDCGALESSQLCDYINDPNMTRPVDDIPDRPAGVGVYTAMWQLALALI FKIVVTIFTFGMKIPSGLFIPSMAVGAIAGRMVGIGVEQLAYHHHDWYYFRNWCRPGA DCVTPGLYAMVGAAACLGGVTRMTVSLVVIMFELTGGLEYIVPLMAAAVTSKWVADAF GKEGIYEAHIHLNGYPFLDVKDEFTHRTLATDVMRPRRGEPPLSVLTQDSMTVEDVET LIKETDYNGFPVVVSRDSERLIGFAQRRELILAINNARQRQEGIVSNSIMYFTEEPPE LPANSPHPLKLRRILNLSPFTVTDHTPMETVVDIFRKLGLRQCLVTRSGRLLGIITKK DVLRHMAQMANQDPESIMFN" polyA_signal 3175..3181 BASE COUNT 745 a 803 c 914 g 752 t ORIGIN 1 ccgagaggca aatgagcttg tcagtttctg cctccattag accctcaacc caaaggagca 61 ggagatttcg gtagcgtttt aactttatct cagaatctga aagcggaagg ccaggcaagc 121 tgcacacatc aagcgaaacg cctgaggggc ggccagcgcg agggtttctg gccatcgacc 181 ctcacctccc gggacttcca gggtcttccc cccaccccgc gcacacctcc ctgcctcgcc 241 ccgagggcgt cacgtggcag cgtggggccc gcctcctggt gatgtcacgg cgctcgcagc 301 cgtcgcgctg aagaaaggat gctcgaggat gctgtccagg tgggcggccg cgggcgcgat 361 gcggcactgc aggtgtaatt agcatggtca atgcgggagc gatgagtggc tctggaaacc 421 tgatggattt cctcgatgag ccgttccctg atgtggggac gtatgaggac ttccacacca 481 tcgactggct aagggaaaag tcacgggaca ccgacagaca caggaagatc accagcaaga 541 gcaaggagtc catatgggag ttcatcaaga gcctgctgga tgcctggtcg ggatgggtgg 601 tgatgctgct catcggcctg ctggcgggca ccttggctgg ggtcatcgat ctcgccgtgg 661 actggatgac ggacctgaag gagggggtct gcctgtctgc cttctggtat agccatgagc 721 agtgttgctg gacttctaac gagaccactt ttgaggacag agacaagtgt cccctgtggc 781 agaaatggtc ggagctgctg gtgaatcagt cagagggtgc cagtgcttac attctgaatt 841 acttaatgta catcctatgg gcgctgctgt ttgcattttt ggctgtctcc ctggtgcgtg 901 tatttgcacc atatcgctgt ggctctggca taccagagat aaagaccatt ttgagcggct 961 ttatcatcag gggctacttg gggaagtgga ccctgctaat caagacagtc acgctggtgc 1021 tggtagtgtc ctccggtctg agccttggga aggaagggcc gctagtgcac gtggcttgtt 1081 gctgtggcaa cttcttcagc agccttttct ccaagtacag caagaatgag ggcaagaggc 1141 gggaggtgct ttcagctgca gcggctgctg gagtctctgt tgcctttggt gcaccaattg 1201 gaggcgtgct tttcagtcta gaagaggtca gttactactt tcccctgaag accttgtgga 1261 ggtcattttt cgcagccctg gtggcggcct ttacgctgag atccatcaat ccctttggga 1321 atagccgtct cgttctcttt tatgtggaat accacacgcc ctggtacatg gctgaactct 1381 tccccttcat cctgcttggg gtcttcgggg gcttgtgggg aaccctcttc atccgctgca 1441 acatcgcctg gtgcaggagg cgcaagacca ccaggctggg gaagtacccg gtgctggagg 1501 tcattgtggt gactgccatc actgccatca ttgcctaccc caatccctac acacgccaga 1561 gcaccagcga gctcatttct gagctgttca atgactgtgg agcccttgag tcttcccagc 1621 tctgtgacta catcaatgac cccaacatga ctcggcctgt ggatgacatt ccagaccggc 1681 cggctggtgt cggtgtttac acggccatgt ggcagctggc cctggcactg atcttcaaaa 1741 tcgtcgttac catatttacc tttggcatga agatcccgtc gggcctcttc atccccagca 1801 tggctgtggg cgcgatagcg ggcaggatgg tgggaattgg cgtggagcag ctggcctacc 1861 atcaccatga ctggtactac ttcaggaact ggtgcagacc cggtgcagac tgtgtcacgc 1921 cagggctgta cgcaatggtg ggagctgcgg cctgcctcgg tggagttacc aggatgacgg 1981 tgtcattggt ggtcatcatg tttgaattaa ccgggggtct ggagtacatc gtgcccctga 2041 tggcggcggc tgtgaccagc aagtgggtag ctgatgcatt tgggaaagaa ggcatctacg 2101 aggcccacat ccacttaaat gggtaccctt tccttgacgt gaaggacgag tttactcacc 2161 gcacactggc caccgacgtc atgcggcccc ggcggggaga gccgccactg tcggtgctca 2221 cccaggacag catgactgtc gaggacgtgg agacgctcat caaggagacc gactacaacg 2281 gcttccccgt ggtggtctcc agagactccg agcgcctcat tggatttgcc cagaggaggg 2341 aactgattct cgcaataaat aacgccagac agaggcagga gggcattgtg agcaattcca 2401 tcatgtactt cacggaggaa ccccccgagc tgccggccaa cagcccacat cccctgaagc 2461 tgcggcgcat cctgaacctc agcccgttta cagtgacaga ccacactccg atggaaacgg 2521 tggtggatat cttccggaaa ctggggcttc ggcagtgcct ggtgacgcgg agcgggagac 2581 ttcttggcat catcacaaaa aaggatgttc tgagacatat ggcccagatg gcaaaccagg 2641 accccgaatc catcatgttt aattagcaac aaggtggcaa ttattttcag aaaaacactg 2701 actgtgtcat ttaaaaagaa ataaatgata tgttattatc ccaatgaaag atcatgcatt 2761 ggggacagca gaaacaaaag cttttttgga aaggcgggga agaaggatga aacctttaaa 2821 aacaaaaaca aaaacatcaa tgagtaggca ttttatagct ttaaccccgt atgagtttca 2881 agctgtgttt cctaatgagt ttgctactgc tgtgggggca tgtgggtggg taaatgatgt 2941 aaatgatgtg atctgtacaa gtatgtggag catgaatgct gactcaagaa acttttactc 3001 cttctgctca aggctgatgt ttgtaactta tgaacacacg tgaagtgttg agtccaaaag 3061 acaaaggggc atcggcatgt cagcgtcctt atttattggt tcttgaagtt ttgctgctat 3121 gttactgaat catactaaag acatttgcgc ttactttgtt gaaaaagaaa aagaaattaa 3181 atttgaacac agtgaaagct gcaaaaaaaa aaaa // LOCUS HSCLPPMR 1044 bp RNA PRI 10-JAN-1996 DEFINITION H.sapiens mRNA for CLPP. ACCESSION Z50853 NID g963047 KEYWORDS CLPP; protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1044) AUTHORS Bross,P., Andresen,B.S., Knudsen,I., Kruse,T.A. and Gregersen,N. TITLE Human ClpP protease: cDNA sequence, tissue-specific expression and chromosomal assignment of the gene JOURNAL FEBS Lett. 377 (2), 249-252 (1995) MEDLINE 96128239 REFERENCE 2 (bases 1 to 1044) AUTHORS Bross,P. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Bross P., Skejby Sygehus Center for Medical Molecular Biology Brendstrupgaardsvej AARHUS Denmark 8200 FEATURES Location/Qualifiers source 1..1044 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 20..853 /function="protease" /codon_start=1 /product="CLPP" /db_xref="PID:g963048" /translation="MWPGILVGGARVASCRYPALGPRLAAHFPAQRPPQRTLQNGLAL QRCLHATATRALPLIPIVVEQTGRGERAYDIYSRLLRERIVCVMGPIDDSVASLVIAQ LLFLQSESNKKPIHMYINSPGGVVTAGLAIYDTMQYILNPICTWCVGQAASMGSLLLA AGTPGMRHSLPNSRIMIHQPSGGARGQATDIAIQAEEIMKLKKQLYNIYAKHTKQSLQ VIESAMERDRYMSPMEAQEFGILDKVLVHPPQDGEDEPTLVQKEPVEAAPAAEPVPAS T" polyA_site 1025..1031 BASE COUNT 200 a 339 c 319 g 186 t ORIGIN 1 gaccggggcg tgcggaggga tgtggcccgg aatattggta gggggggccc gggtggcgtc 61 atgcaggtac cccgcgctgg ggcctcgcct cgccgctcac tttccagcgc agcggccgcc 121 gcagcggaca ctccagaacg gcctggccct gcagcggtgc ctgcacgcga cggcgacccg 181 ggctctcccg ctcattccca tcgtggtgga gcagacgggt cgcggcgagc gcgcctatga 241 catctactcg cggctgctgc gggagcgcat cgtgtgcgtc atgggcccga tcgatgacag 301 cgttgccagc cttgttatcg cacagctcct cttcctgcaa tccgagagca acaagaagcc 361 catccacatg tacatcaaca gccctggtgg tgtggtgacc gcgggcctgg ccatctacga 421 cacgatgcag tacatcctca acccgatctg cacctggtgc gtgggccagg ccgccagcat 481 gggctccctg cttctcgccg ccggcacccc aggcatgcgc cactcgctcc ccaactcccg 541 tatcatgatc caccagccct caggaggcgc ccggggccaa gccacagaca ttgccatcca 601 ggcagaggag atcatgaagc tcaagaagca gctctataac atctacgcca agcacaccaa 661 acagagcctg caggtgatcg agtccgccat ggagagggac cgctacatga gccccatgga 721 ggcccaggag tttggcatct tagacaaggt tctggtccac cctccccagg acggtgagga 781 tgagcccacg ctggtgcaga aggagcctgt agaagcagcg ccggcagcag aacctgtccc 841 agctagcacc tgagagctgg gcctcctctc cagaatcatg tggaggggcc agaggcttgc 901 cagaccccca gctgggccct gctcacccct tgttgctggg cttggagggg cctcttgagg 961 aacttttaat ttgcaggggt gcccgctatg gacggggcat tccagctgag acactgtgat 1021 tttaaattaa atctttgtgg tctt // LOCUS HSCMRF35A 1151 bp RNA PRI 02-AUG-1993 DEFINITION H.sapiens CMRF35 mRNA, complete CDS. ACCESSION X66171 NID g396169 KEYWORDS antigen; cell membrane; cell surface glycoprotein; CMRF35 gene; monoclonal antibody. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1151) AUTHORS Jackson,D.G., Hart,D.N., Starling,G. and Bell,J.I. TITLE Molecular cloning of a novel member of the immunoglobulin gene superfamily homologous to the polymeric immunoglobulin receptor JOURNAL Eur. J. Immunol. 22 (5), 1157-1163 (1992) MEDLINE 92249405 FEATURES Location/Qualifiers source 1..1151 /organism="Homo sapiens" /db_xref="taxon:9606" gene 240..914 /gene="CMRF35" CDS 240..914 /gene="CMRF35" /codon_start=1 /product="CMRF-35 antigen" /db_xref="PID:g396170" /translation="MTARAWASWRSSALLLLLVPGYFPLSHPMTVAGPVGGSLSVQCR YEKEHRTLNKFWCRPPQILRCDKIVETKGSAGKRNGRVSIRDSPANLSFTVTLENLTE EDAGTYWCGVDTPWLRDFHDPIVEVEVSVFPAGTTTASSPQSSMGTSGPPTKLPVHTW PSVTRKDSPEPSPHPGSLFSNVRFLLLVLLELPLLLSMLGAVLWVNRPQRSSRSRQNW PKGENQ" BASE COUNT 246 a 367 c 312 g 226 t ORIGIN 1 ctctaaaggc cactagcacc catcccagag ctgtcagcac cggcctcagc ccaggcggct 61 ctctccctga gcttcctgta gccctgaccc tctccagcct cagacctgag acagggctgg 121 acaaggaagc agagagcaga agaaaagcag aagcgaagct cagatctgct gggaggaaga 181 ttacattttg tcccctcctg gggtcttgca cagtggcagg tgacattcgt gttacaggaa 241 tgactgccag ggcctgggcc tcgtggcggt cttcagctct gctcctcctg cttgtcccag 301 gctattttcc tctgagccac cccatgaccg tggcgggccc cgtgggggga tccctgagtg 361 tgcagtgtcg ctatgagaag gaacacagga ccctcaacaa attctggtgc agaccaccac 421 agattctccg atgtgacaag attgtggaga ccaaagggtc agcagggaaa aggaatggcc 481 gagtgtccat cagggacagt cctgcaaacc tcagcttcac agtgaccctg gagaatctca 541 cagaggagga cgcaggcacc tactggtgtg gggtggatac accgtggctc cgagactttc 601 atgatcccat tgtcgaggtt gaggtgtccg tgttcccggc cgggacgacc acagcctcca 661 gcccccagag ctccatgggc acctcaggtc ctcccacgaa gctgcccgtg cacacctggc 721 ccagcgtgac cagaaaggac agccccgaac ccagcccaca ccctggctcc ctgttcagca 781 atgtccgctt cctgctcctg gtcctcttgg agctgcccct gctcctgagc atgctgggtg 841 ccgtcctctg ggtgaacaga cctcagagaa gctctagaag caggcagaat tggcccaagg 901 gtgagaacca gtagcatctg ctgtccatca aggccctgtg ctgcaacaga gcccctctgg 961 ggactggaat gacctcctga ccatcaaggc ctgcaacaga gcccctctgg gggactggaa 1021 tgacctcctg accactccct cccgggctgc tctctccaac atctcctgga atcctttgtg 1081 agcctccttc agccttttcc ctgtgcccga tccaacatgt gacacatgag gactttagag 1141 cacaatggat c // LOCUS HSCMRP 4864 bp RNA PRI 04-SEP-1996 DEFINITION H.sapiens mRNA for canalicular multidrug resistance protein. ACCESSION X96395 NID g1507819 KEYWORDS ABC transporter protein; canalicular multidrug resistance protein; cmrp gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4864) AUTHORS Keppler,D. TITLE Direct Submission JOURNAL Submitted (27-FEB-1996) D. Keppler, DKFZ, Abteilung Tumorbiochemie, Im Neuenheimer Feld 280, D-69120 Heidelberg, FRG REMARK revised by [4] MAT REFERENCE 2 (bases 1 to 4864) AUTHORS Buchler,M., Konig,J., Brom,M., Kartenbeck,J., Spring,H., Horie,T. and Keppler,D. TITLE cDNA cloning of the hepatocyte canalicular isoform of the multidrug resistance protein, cMrp, reveals a novel conjugate export pump deficient in hyperbilirubinemic mutant rats JOURNAL J. Biol. Chem. 271 (25), 15091-15098 (1996) MEDLINE 96279006 REFERENCE 3 (bases 1 to 4864) AUTHORS Keppler,D. TITLE Direct Submission JOURNAL Submitted (21-AUG-1996) D. Keppler, DKFZ, Abteilung Tumorbiochemie, Im Neuenheimer Feld 280, D-69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..4864 /organism="Homo sapiens" /db_xref="taxon:9606" gene 38..4675 /gene="cmrp" CDS 38..4675 /gene="cmrp" /codon_start=1 /product="canalicular multidrug resistance protein" /db_xref="PID:e261872" /db_xref="PID:g1514568" /translation="MLEKFCNSTFWNSSFLDSPEADLPLCFEQTVLVWIPLGFLWLLA PWQLLHVYKSRTKRSSTTKLYLAKQVFVGFLLILAAIELALVLTEDSGQATVPAVRYT NPSLYLGTWLLVLLIQYSRQWCVQKNSWFLSLFWILSILCGTFQFQTLIRTLLQGDNS NLAYSCLFFISYGFQILILIFSAFSENNESSNNPSSIASFLSSITYSWYDSIILKGYK RPLTLEDVWEVDEEMKTKTLVSKFETHMKRELQKARRALQRRQEKSSQQNSGARLPGL NKNQSQSQDALVLEDVEKKKKKSGTKKDVPKSWLMKALFKTFYMVLLKSFLLKLVNDI FTFVSPQLLKLLISFASDRDTYLWIGYLCAILLFTAALIQSFCLQCYFQLCFKLGVKV RTAIMASVYKKALTLSNLARKEYTVGETVNLMSVDAQKLMDVTNFMHMLWSSVLQIVL SIFFLWRELGPSVLAGVGVMVLVIPINAILSTKSKTIQVKNMKNKDKRLKIMNEILSG IKILKYFAWEPSFRDQVQNLRKKELKNLLAFSQLQCVVIFVFQLTPVLVSVVTFSVYV LVDSNNILDAQKAFTSITLFNILRFPLSMLPMMISSMLQASVSTERLEKYLGGDDLDT SAIRHSCNFDKAMQFSEASFTWEHDSEATVRDVNLDIMAGQLVAVIGPVGSGKSSLIS AMLGEMENVHGHITIKGTTAYVPQQSWIQNGTIKDNILFGTEFNEKRYQQVLEACALL PDLEMLPGGDLAEIGEKGINLSGGQKQRISLARATYQNLDIYLLDDPLSAVDAHVGKH IFNKVLGPNGLLKGKTRLLVTHSMHFLPQVDEIVVLGNGTIVEKGSYSALLAKKGEFA KNLKTFLRHTGPEEEATVHDGSEEEADDYGLISSVEEIPEDAASITMRRENSFRRTLS RSSRSNGRHLKSLRNSLKTRNVNSLKEDEELVKGQKLIKKEFIETGKVKFSIYLEYLQ AIGLFSIFFIILAFVMNSVAFIGSNLWLSAWTSDSKIFNSTDYPASQRDMRVGVYGAL GLAQGIFVFIAHFWSAFGFVHASNILHKQLLNNILRAPMRFFDTTPTGRIVNRFAGDI STVDDTLPQSLRTWITCFLGIISTLVMICMATPVFTIIVIPLGIIYVSVQMFYVSTSR QLRRLDSVTRSPIYSHFSETVSGLPVIRAFEHQQRFLKHNEVRIDTNQKCVFSWITSN RWLAIRLELVGNLTVFFSALMMVIYRDTLSGDTVGFVLSNALNITQTLNWLVRMTSEI ETNIVAVERITEYTKVENEAPWVTDKRPPPDWPSKGKIQFNNYQVRYRPELDLVLRGI TCDIGSMEKIGVVGRTGAGKSSLTNCLFRILEAAGGQIIIDGVDIASIGLHDLREKLT IIPQDPILFSGSLRMNLDPFNNYSDEEIWKALELAHLKSFVASLQLGLSHEGTEAGGN LSIGQRQLLCLGRALLRKSKILVLDEATAAVDLETDNLIQTTIQNEFAHCTVITIAHR LHTIMDSDKVMVLDNGKIIECGSPEELLQIPGPFYFMAKEAGIENVNSTKF" BASE COUNT 1334 a 1141 c 1130 g 1259 t ORIGIN 1 atagaagagt cttcgttcca gacgcagtcc aggaatcatg ctggagaagt tctgcaactc 61 tactttttgg aattcctcat tcctggacag tccggaggca gacctgccac tttgttttga 121 gcaaactgtt ctggtgtgga ttcccttggg cttcctatgg ctcctggccc cctggcagct 181 tctccacgtg tataaatcca ggaccaagag atcctctacc accaaactct atcttgctaa 241 gcaggtattc gttggttttc ttcttattct agcagccata gagctggccc ttgtactcac 301 agaagactct ggacaagcca cagtccctgc tgttcgatat accaatccaa gcctctacct 361 aggcacatgg ctcctggttt tgctgatcca atacagcaga caatggtgtg tacagaaaaa 421 ctcctggttc ctgtccctat tctggattct ctcgatactc tgtggcactt tccaatttca 481 gactctgatc cggacactct tacagggtga caattctaat ctagcctact cctgcctgtt 541 cttcatctcc tacggattcc agatcctgat cctgatcttt tcagcatttt cagaaaataa 601 tgagtcatca aataatccat catccatagc ttcattcctg agtagcatta cctacagctg 661 gtatgacagc atcattctga aaggctacaa gcgtcctctg acactcgagg atgtctggga 721 agttgatgaa gagatgaaaa ccaagacatt agtgagcaag tttgaaacgc acatgaagag 781 agagctgcag aaagccaggc gggcactcca gagacggcag gagaagagct cccagcagaa 841 ctctggagcc aggctgcctg gcttgaacaa gaatcagagt caaagccaag atgcccttgt 901 cctggaagat gttgaaaaga aaaaaaagaa gtctgggacc aaaaaagatg ttccaaaatc 961 ctggttgatg aaggctctgt tcaaaacttt ctacatggtg ctcctgaaat cattcctact 1021 gaagctagtg aatgacatct tcacgtttgt gagtcctcag ctgctgaaat tgctgatctc 1081 ctttgcaagt gaccgtgaca catatttgtg gattggatat ctctgtgcaa tcctcttatt 1141 cactgcggct ctcattcagt ctttctgcct tcagtgttat ttccaactgt gcttcaagct 1201 gggtgtaaaa gtacggacag ctatcatggc ttctgtatat aagaaggcat tgaccctatc 1261 caacttggcc aggaaggagt acaccgttgg agaaacagtg aacctgatgt ctgtggatgc 1321 ccagaagctc atggatgtga ccaacttcat gcacatgctg tggtcaagtg ttctacagat 1381 tgtcttatct atcttcttcc tatggagaga gttgggaccc tcagtcttag caggtgttgg 1441 ggtgatggtg cttgtaatcc caattaatgc gatactgtcc accaagagta agaccattca 1501 ggtcaaaaat atgaagaata aagacaaacg tttaaagatc atgaatgaga ttcttagtgg 1561 aatcaagatc ctgaaatatt ttgcctggga accttcattc agagaccaag tacaaaacct 1621 ccggaagaaa gagctcaaga acctgctggc ctttagtcaa ctacagtgtg tagtaatatt 1681 cgtcttccag ttaactccag tcctggtatc tgtggtcaca ttttctgttt atgtcctggt 1741 ggatagcaac aatattttgg atgcacaaaa ggccttcacc tccattaccc tcttcaatat 1801 cctgcgcttt cccctgagca tgcttcccat gatgatctcc tccatgctcc aggccagtgt 1861 ttccacagag cggctagaga agtacttggg aggggatgac ttggacacat ctgccattcg 1921 acatagctgc aattttgaca aagccatgca gttttctgag gcctccttta cctgggaaca 1981 tgattcggaa gccacagtcc gagatgtgaa cctggacatt atggcaggcc aacttgtggc 2041 tgtgataggc cctgtcggct ctgggaaatc ctccttgata tcagccatgc tgggagaaat 2101 ggaaaatgtc cacgggcaca tcaccatcaa gggcaccact gcctatgtcc cacagcagtc 2161 ctggattcag aatggcacca taaaggacaa catccttttt ggaacagagt ttaatgaaaa 2221 gaggtaccag caagtactgg aggcctgtgc tctcctccca gacttggaaa tgctgcctgg 2281 aggagatttg gctgagattg gagagaaggg tataaatctt agtgggggtc agaagcagcg 2341 gatcagcctg gccagagcta cctaccaaaa tttagacatc tatcttctag atgaccccct 2401 gtctgcagtg gatgctcatg taggaaaaca tatttttaat aaggtcttgg gccccaatgg 2461 cctgttgaaa ggcaagactc gactcttggt tacacatagc atgcactttc ttcctcaagt 2521 ggatgagatt gtagttctgg ggaatggaac aattgtagag aaaggatcct acagtgctct 2581 cctggccaaa aaaggagagt ttgctaagaa tctgaagaca tttctaagac atacaggccc 2641 tgaagaggaa gccacagtcc atgatggcag tgaagaagaa gcagatgact atgggctgat 2701 atccagtgtg gaagagatcc ccgaagatgc agcctccata accatgagaa gagagaacag 2761 ctttcgtcga acacttagcc gcagttctag gtccaatggc aggcatctga agtccctgag 2821 aaactccttg aaaactcgga atgtgaatag cctgaaggaa gacgaagaac tagtgaaagg 2881 acaaaaacta attaagaagg aattcataga aactggaaag gtgaagttct ccatctacct 2941 ggagtaccta caagcaatag gattgttttc gatattcttc atcatccttg cgtttgtgat 3001 gaattctgtg gcttttattg gatccaacct ctggctcagt gcttggacca gtgactctaa 3061 aatcttcaat agcaccgact atccagcatc tcagagggac atgagagttg gagtctacgg 3121 agctctggga ttagcccaag gtatatttgt gttcatagca catttctgga gtgcctttgg 3181 tttcgtccat gcatcaaata tcttgcacaa gcaactgctg aacaatatcc ttcgagcacc 3241 tatgagattt tttgacacaa cacccacagg ccggattgtg aacaggtttg ccggcgatat 3301 ttccacagtg gatgacaccc tgcctcagtc cttgcgcacg tggattacat gcttcctggg 3361 gataatcagc acccttgtca tgatctgcat ggccactcct gtcttcacca tcatcgtcat 3421 tcctcttggc attatttatg tatctgttca gatgttttat gtgtctacct cccgccagct 3481 gaggcgtctg gactctgtca ccaggtcccc aatctactct cacttcagcg agaccgtatc 3541 aggtttgcca gttatccgtg cctttgagca ccagcagcga tttctgaaac acaatgaggt 3601 gaggattgac accaaccaga aatgtgtctt ttcctggatc acctccaaca ggtggcttgc 3661 aattcgcctg gagctggttg ggaacctgac tgtcttcttt tcagccttga tgatggttat 3721 ttatagagat accctaagtg gggacactgt tggctttgtt ctgtccaatg cactcaatat 3781 cacacaaacc ctgaactggc tggtgaggat gacatcagaa atagagacca acattgtggc 3841 tgttgagcga ataactgagt acacaaaagt ggaaaatgag gcaccctggg tgactgataa 3901 gaggcctccg ccagattggc ccagcaaagg caagatccag tttaacaact accaagtgcg 3961 gtaccgacct gagctggatc tggtcctcag agggatcact tgtgacatcg gtagcatgga 4021 gaagattggt gtggtgggca ggacaggagc tggaaagtca tccctcacaa actgcctctt 4081 cagaatctta gaggctgccg gtggtcagat tatcattgat ggagtagata ttgcttccat 4141 tgggctccac gacctccgag agaagctgac catcatcccc caggacccca tcctgttctc 4201 tggaagcctg aggatgaatc tcgacccttt caacaactac tcagatgagg agatttggaa 4261 ggccttggag ctggctcacc tcaagtcttt tgtggccagc ctgcaacttg ggttatccca 4321 cgaaggtaca gaggctggtg gcaacctgag cataggccag aggcagctgc tgtgcctggg 4381 cagggctctg cttcggaaat ccaagatcct ggtcctggat gaggccactg ctgcggtgga 4441 tctagagaca gacaacctca ttcagacgac catccaaaac gagttcgccc actgcacagt 4501 gatcaccatc gcccacaggc tgcacaccat catggacagt gacaaggtaa tggtcctaga 4561 caacgggaag attatagagt gcggcagccc tgaagaactg ctacaaatcc ctggaccctt 4621 ttactttatg gctaaggaag ctggcattga gaatgtgaac agcacaaaat tctagcagaa 4681 ggccccatgg gttagaaaag gactataaga ataatttctt atttaatttt attttttata 4741 aaatacagaa tacatacaaa agtgtgtata aaatgtacgt tttaaaaaag gataagtgaa 4801 cacccatgaa cctactaccc aggttaagaa aataaatgtc accaggtact tgaaaaaaaa 4861 aaaa // LOCUS HSCMYBA1 3302 bp RNA PRI 12-SEP-1993 DEFINITION Human alternatively spliced c-myb mRNA (clone=pMbm-1). ACCESSION X52125 NID g29988 KEYWORDS alternative splicing; c-myb oncogene; DNA binding protein; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3302) AUTHORS Westin,E.H. TITLE Direct Submission JOURNAL Submitted (22-MAR-1990) Westin E.H., Medical College of Virginia, Box 230, MCV Station, Richmond VA 23298, U S A REFERENCE 2 (bases 1 to 3302) AUTHORS Westin,E.H., Gorse,K.M. and Clarke,M.F. TITLE Alternative splicing of the human c-myb gene JOURNAL Oncogene 5 (8), 1117-1124 (1990) MEDLINE 90363543 COMMENT See , and for overlapping sequences. FEATURES Location/Qualifiers source 1..3302 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoid leukemic" /cell_line="CCRF-CEM" /clone="pMbm-1" CDS 198..2111 /note="MYB protein (AA 1-637)" /codon_start=1 /db_xref="PID:g29989" /translation="MARRPRHSIYSSDEDDEDFEMCDHDYDGLLPKSGKRHLGKTRWT REEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPELIKGPWTKEEDQR VIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTSWTEEEDRIIYQAHK RLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKVEQEGYLQESSKASQPAVATSFQKNS HLMGFAQAPPTAQLPATGQPTVNNDYSYYHISEAQNVSSHVPYPVALHVNIVNVPQPA AAAIQRHYNDEDPEKEKRIKELELLLMSTENELKGQQTQNHTCSYPGWHSTTIADHTR PHGDSAPVSCLGEHHSTPSLPADPGSLPEESASPARCMIVHQGTILDNVKNLLEFAET LQFIDSFLNTSSNHENSDLEMPSLTSTPLIGHKLTVTTPFHRDQTVKTQKENTVFRTP AIKRSILESSPRTPTPFKHALAAQEIKYGPLKMLPQTPSHLVEDLQDVIKQESDESGI VAEFQENGPPLLKKIKQEVESPTDKSGNFFCSHHWEGDSLNTQLFTQTSPVADAPNIL TSSVLMAPASEDEDNVLKAFTVPKNRSLASPLQPCSSTWEPASCGKMEEQMTSSSQAR KYVNAFSARTLVM" BASE COUNT 986 a 722 c 704 g 890 t ORIGIN 1 aatatcaacc tgtttcctcc tcctccttct cctcctcctc cgtgacctcc tcctcctctt 61 tctcctgaga aacttcgccc cagcggtgcg gagcgccctg cgcagccggg gagggacgca 121 ggcaggcggc gggcagcggg aggcggcagc ccggtcggtc cccgcggctc tcgcggagcc 181 ccgccgcccg ccgcgccatg gcccgaagac cccggcacag catatatagc agtgacgagg 241 atgatgagga ctttgagatg tgtgaccatg actatgatgg gctgcttccc aagtctggaa 301 agcgtcactt ggggaaaaca aggtggaccc gggaagagga tgaaaaactg aagaagctgg 361 tggaacagaa tggaacagat gactggaaag ttattgccaa ttatctcccg aatcgaacag 421 atgtgcagtg ccagcaccga tggcagaaag tactaaaccc tgagctcatc aagggtcctt 481 ggaccaaaga agaagatcag agagtgatag agcttgtaca gaaatacggt ccgaaacgtt 541 ggtctgttat tgccaagcac ttaaagggga gaattggaaa acaatgtagg gagaggtggc 601 ataaccactt gaatccagaa gttaagaaaa cctcctggac agaagaggaa gacagaatta 661 tttaccaggc acacaagaga ctggggaaca gatgggcaga aatcgcaaag ctactgcctg 721 gacgaactga taatgctatc aagaaccact ggaattctac aatgcgtcgg aaggtcgaac 781 aggaaggtta tctgcaggag tcttcaaaag ccagccagcc agcagtggcc acaagcttcc 841 agaagaacag tcatttgatg ggttttgctc aggctccgcc tacagctcaa ctccctgcca 901 ctggccagcc cactgttaac aacgactatt cctattacca catttctgaa gcacaaaatg 961 tctccagtca tgttccatac cctgtagcgt tacatgtaaa tatagtcaat gtccctcagc 1021 cagctgccgc agccattcag agacactata atgatgaaga ccctgagaag gaaaagcgaa 1081 taaaggaatt agaattgctc ctaatgtcaa ccgagaatga gctaaaagga cagcagacac 1141 agaaccacac atgcagctac cccgggtggc acagcaccac cattgccgac cacaccagac 1201 ctcatggaga cagtgcacct gtttcctgtt tgggagaaca ccactccact ccatctctgc 1261 cagcggatcc tggctcccta cctgaagaaa gcgcctcgcc agcaaggtgc atgatcgtcc 1321 accagggcac cattctggat aatgttaaga acctcttaga atttgcagaa acactccaat 1381 ttatagattc tttcttaaac acttccagta accatgaaaa ctcagacttg gaaatgcctt 1441 ctttaacttc cacccccctc attggtcaca aattgactgt tacaacacca tttcatagag 1501 accagactgt gaaaactcaa aaggaaaata ctgtttttag aaccccagct atcaaaaggt 1561 caatcttaga aagctctcca agaactccta caccattcaa acatgcactt gcagctcaag 1621 aaattaaata cggtcccctg aagatgctac ctcagacacc ctctcatcta gtagaagatc 1681 tgcaggatgt gatcaaacag gaatctgatg aatctggaat tgttgctgag tttcaagaaa 1741 atggaccacc cttactgaag aaaatcaaac aagaggtgga atctccaact gataaatcag 1801 gaaacttctt ctgctcacac cactgggaag gggacagtct gaatacccaa ctgttcacgc 1861 agacctcgcc tgtggcagat gcaccgaata ttcttacaag ctccgtttta atggcaccag 1921 catcagaaga tgaagacaat gttctcaaag catttacagt acctaaaaac aggtccctgg 1981 cgagcccctt gcagccttgt agcagtacct gggaacctgc atcctgtgga aagatggagg 2041 agcagatgac atcttccagt caagctcgta aatacgtgaa tgcattctca gcccggacgc 2101 tggtcatgtg agacatttcc agaaaagcat tatggttttc agaacacttc aagttgactt 2161 gggatatatc attcctcaac atgaaacttt tcatgaatgg gagaagaacc tatttttgtt 2221 gtggtacaac agttgagagc agcaccaagt gcatttagtt gaatgaagtc ttcttggatt 2281 tcacccaact aaaaggattt ttaaaaataa ataacagtct tacctaaatt attaggtaat 2341 gaattgtagc cagttgttaa tatcttaatg cagatttttt taaaaaaaac ataaaatgat 2401 ttatctgtat tttaaaggat ccaacagatc agtatttttt cctgtgatgg gttttttgaa 2461 atttgacaca ttaaaaggta ctccagtatt tcacttttct cgatcactaa acatatgcat 2521 atatttttaa aaatcagtaa aagcattact ctaagtgtag acttaatacc atgtgacatt 2581 taatccagat tgtaaatgct catttatggt taatgacatt gaaggtacat ttattgtacc 2641 aaaccatttt atgagttttc tgttagcttg ctttaaaaat tattactgta agaaatagtt 2701 ttataaaaaa ttatattttt attcagtaat ttaattttgt aaatgccaaa tgaaaaacgt 2761 tttttgctgc tatggtctta gcctgtagac atgctgctag tatcagaggg gcagtagagc 2821 ttggacagaa agaaaagaaa cttggtgtta ggtaattgac tatgcactag tatttcagac 2881 tttttaattt tatatatata tacatttttt ttccttctgc aatacatttg aaaacttgtt 2941 tgggagactc tgcatttttt attgtggttt ttttgttatt gttggtttat acaagcatgc 3001 gttgcacttc ttttttggga gatgtgtgtt gttgatgttc tatgttttgt tttgagtgta 3061 gcctgactgt tttataattt gggagttctg catttgatcc gcatcccctg tggtttctaa 3121 gtgtatggtc tcagaactgt tgcatggatc ctgtgtttgc aactggggag acagaaactg 3181 tggttgatag ccagtcactg ccttaagaac atttgatgca agatggccag cactgaactt 3241 ttgagatatg acggtgtact tactgccttg tagcaaaata aagatgtgcc cttattttac 3301 ct // LOCUS HSCNTCTNA 3360 bp RNA PRI 08-NOV-1993 DEFINITION H.sapiens contactin mRNA,. ACCESSION Z21488 NID g414790 KEYWORDS cell adhesion molecule; contactin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3360) AUTHORS Reid,R.A. and Hemperly,J.J. TITLE Identification and Characterization of the Human Cell Adhesion Molecule Contactin JOURNAL Brain Res. 21, 1-8 (1994) REFERENCE 2 (bases 1 to 3360) AUTHORS Reid,R. TITLE Direct Submission JOURNAL Submitted (22-JAN-1993) Reid R., Becton Dickinson and Company, 21 Davis Drive, Research Triangle Park, North Carolina, U.S.A FEATURES Location/Qualifiers source 1..3360 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 10..121 CDS 122..3178 /codon_start=1 /product="contactin" /db_xref="PID:g414791" /translation="MKMWLLVSHLVIISITTCLAEFTWYRRYGHGVSEEDKGFGPIFE EQPINTIYPEESLEGKVSLNCRARASPFPVYKWRMNNGDVDLTSDRYSMVGGNLVINN PDKQKDAGIYYCLASNNYGMVRSTEATLSFGYLDPFPPEERPEVRVKEGKGMVLLCDP PYHFPDDLSYRWLLNEFPVFITMDKRRFVSQTNGNLYIANVEASDKGNYSCFVSSPSI TKSVFSKFIPLIPIPERTTKPYPADIVVQFKDVYALMGQNVTLECFALGNPVPDIRWR KVLEPMPSTAEISTSGAVLKIFNIQLEDEGIYECEAENIRGKDKHQARIYVQAFPEWV EHINDTEVDIGSDLYWPCVATGKPIPTIRWLKNGYAYHKGELRLYDVTFENAGMYQCI AENTYGAIYANAELKILALAPTFEMNPMKKKILAAKGGRVIIECKPKAAPKPKFSWSK GTEWLVNSSRILIWEDGSLEINNITRNDGGIYTCFAENNRGKANSTGTLVITDPTRII LAPINADITVGENATMQCAASFDPALDLTFVWSFNGYVIDFNKENIHYQRNFMLDSNG ELLIRNAQLKHAGRYTCTAQTIVDNSSASADLVVRGPPGPPGGLRIEDIRATSVALTW SRGSDNHSPISKYTIQTKTILSDDWKDAKTDPPIIEGNMEAARAVDLIPWMEYEFRVV ATNTLGRGEPSIPSNRIKTDGAAPNVAPSDVGGGGGRNRELTITWAPLSREYHYGNNF GYIVAFKPFDGEEWKKVTVTNPDTGRYVHKDETMSPSTAFQVKVKAFNNKGDGPYSLL AVINSAQDAPSEAPTEVGVKVLSSSEISVHWEHVLEKIVESYQIRYWAAHDKEEAANR VQVTSQEYSARLENLLPDTQYFIEVGACNSAGCGPPSDMIEAFTKKAPPSQPPRIISS VRSGSRYIITWDHVVALSNESTVTGYKVLYRPDGQHDGKLYSTHKHSIEVPIPRDGEY VVEVRAHSDGGDGVVSQVKISGAPTLSPSLLGLLLPAFGILVYLEF" sig_peptide 122..181 mat_peptide 182..3175 /product="contactin" 3'UTR 3176..3360 polyA_signal 3281..3286 BASE COUNT 1036 a 713 c 758 g 853 t ORIGIN 1 gaattccggc tgtgccgcac cgaggcgagc aggagcaggg aacaggtgtt taaaattatc 61 caactgccat agagctaaat tcttttttgg aaaattgaac cgaacttcta ctgaatacaa 121 gatgaaaatg tggttgctgg tcagtcatct tgtgataata tctattacta cctgtttagc 181 agagtttaca tggtatagaa gatatggtca tggagtttct gaggaagaca aaggatttgg 241 accaattttt gaagagcagc caatcaatac catttatcca gaggaatcac tggaaggaaa 301 agtctcactc aactgtaggg cacgagccag ccctttcccg gtttacaaat ggagaatgaa 361 taatggggac gttgatctca caagtgatcg atacagtatg gtaggaggaa accttgttat 421 caacaaccct gacaaacaga aagatgctgg aatatactac tgtttagcat ctaataacta 481 cgggatggtc agaagcactg aagcaaccct gagctttgga tatcttgatc ctttcccacc 541 tgaggaacgt cctgaggtca gagtaaaaga agggaaagga atggtgcttc tctgtgaccc 601 cccataccat tttccagatg atcttagcta tcgctggctt ctaaatgaat ttcctgtatt 661 tatcacaatg gataaacggc gatttgtgtc tcagacaaat ggcaatctct acattgcaaa 721 tgttgaggct tccgacaaag gcaattattc ctgctttgtt tccagtcctt ctattacaaa 781 gagcgtgttc agcaaattca tcccactcat tccaatacct gaacgaacaa caaaaccata 841 tcctgctgat attgtagttc agttcaagga tgtatatgca ttgatgggcc aaaatgtgac 901 cttagaatgt tttgcacttg gaaatcctgt tccggatatc cgatggcgga aggttctaga 961 accaatgcca agcactgctg agattagcac ctctggggct gttcttaaga tcttcaatat 1021 tcagctagaa gatgaaggca tctatgaatg tgaggctgag aacattagag gaaaggataa 1081 acatcaagca agaatttatg ttcaagcatt ccctgagtgg gtagaacaca tcaatgacac 1141 agaggtggac ataggcagtg atctctactg gccttgtgtg gccacaggaa agcccatccc 1201 tacaatccga tggttgaaaa atggatatgc gtatcataaa ggggaattaa gactgtatga 1261 tgtgactttt gaaaatgccg gaatgtatca gtgcatagct gaaaacacat atggagccat 1321 ttatgcaaat gctgagttga agatcttggc gttggctcca acttttgaaa tgaatcctat 1381 gaagaaaaag atcctggctg ctaaaggtgg aagggtgata attgaatgca aacctaaagc 1441 tgcaccgaaa ccaaagtttt catggagtaa agggacagag tggcttgtca atagcagcag 1501 aatactcatt tgggaagatg gtagcttgga aatcaacaac attacaagga atgatggagg 1561 tatctataca tgctttgcag aaaataacag agggaaagct aatagcactg gaacccttgt 1621 tatcacagat cctacgcgaa ttatattggc cccaattaat gccgatatca cagttggaga 1681 aaacgccacc atgcagtgtg ctgcgtcctt tgatcctgcc ttggatctca catttgtttg 1741 gtccttcaat ggctatgtga tcgattttaa caaagagaat attcactacc agaggaattt 1801 tatgctggat tccaatgggg aattactaat ccgaaatgcg cagctgaaac atgctggaag 1861 atacacatgc actgcccaga caattgtgga caattcttca gcttcagctg accttgtagt 1921 gagaggccct ccaggccctc caggtggtct gagaatagaa gacattagag ccacttctgt 1981 ggcacttact tggagccgtg gttcagacaa tcatagtcct atttctaaat acactatcca 2041 gaccaagact attctttcag atgactggaa agatgcaaag acagatcccc caattattga 2101 aggaaatatg gaggcagcaa gagcagtgga cttaatccca tggatggagt atgaattccg 2161 cgtggtagca accaatacac tgggtagagg agagcccagt ataccatcta acagaattaa 2221 aacagacggt gctgcaccaa atgtggctcc ttcagatgta ggaggtggag gtggaagaaa 2281 cagagagctg accataacat gggcgccttt gtcaagagaa taccactatg gcaacaattt 2341 tggttacata gtggcattta agccatttga tggagaagaa tggaaaaaag tcacagttac 2401 taatcctgat actggccgat atgtccataa agatgaaacc atgagccctt ccactgcatt 2461 tcaagttaaa gtcaaggcct tcaacaacaa aggagatgga ccttacagcc tactagcagt 2521 cattaattca gcacaagacg ctcccagtga agccccaaca gaagtaggtg taaaagtctt 2581 atcatcttct gagatatctg ttcattggga acatgtttta gaaaaaatag tggaaagcta 2641 tcagattcgg tattgggctg cccatgacaa agaagaagct gcaaacagag ttcaagtcac 2701 cagccaagag tactcggcca ggctcgagaa ccttctgcca gacacccagt attttataga 2761 agtcggggcc tgcaatagtg cagggtgtgg acctccaagt gacatgattg aggctttcac 2821 caagaaagca cctcctagcc agcctccaag gatcatcagt tcagtaaggt ctggttcacg 2881 ctatataatc acctgggatc atgtcgttgc actatcaaat gaatctacag tgacgggata 2941 taaggtactc tacagacctg atggccagca tgatggcaag ctgtattcaa ctcacaaaca 3001 ctccatagaa gtcccaatcc ccagagatgg agaatacgtt gtggaggttc gcgcgcacag 3061 tgatggagga gatggagtgg tgtctcaagt caaaatttca ggtgcaccca ccctatcccc 3121 aagtcttctc ggcttactgc tgcctgcctt tggcatcctt gtctacttgg aattctgaat 3181 gtgttgtgac agctgctgtt cccatcccag ctcagaagac acccttcaac cctgggatga 3241 ccacaattcc ttccaatttc tgcggctcca tcctaagcca aataaattat actttaacaa 3301 actattcaac tgatttacaa cacacatgat gactgaggca ttcaggaacc ccttcatcca // LOCUS HSCOA2IT 469 bp DNA PRI 22-JUL-1996 DEFINITION Human pro-alpha2 (I) collagen gene transcription start region. ACCESSION X03892 NID g30007 KEYWORDS collagen alpha; collagen type I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 469) AUTHORS Dickson,L.A., de Wet,W., Di Liberto,M., Weil,D. and Ramirez,F. TITLE Analysis of the promoter region and the N-propeptide domain of the human pro alpha 2(I) collagen gene JOURNAL Nucleic Acids Res. 13 (10), 3427-3438 (1985) MEDLINE 85242047 FEATURES Location/Qualifiers source 1..469 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 249..253 /note="pot. CAAT-box" promoter 302..307 /note="pot. TATA-box" precursor_RNA 335..>469 /note="primary transcript" CDS 389..409 /note="pot. URF" /codon_start=1 /db_xref="PID:g579785" /translation="MPAPAM" CDS 453..467 /note="pot. URF" /codon_start=1 /db_xref="PID:g579786" /translation="MSKC" BASE COUNT 97 a 143 c 139 g 90 t ORIGIN 1 gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga tgcggagggc 61 ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc 121 tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc 181 gggccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg cccccagccc 241 cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc 301 gtataaatag ggcagatccg ggatttgtta ttttagcacc acggcagcag gaggtttcgg 361 ctaagttgga ggtactggcc acgactgcat gcccgcgccc gccatgtgat acctccgccg 421 gtgacccagg gctctgcgac acaaggagtc gcatgtctaa gtgctagac // LOCUS HSCOAS 1685 bp RNA PRI 12-SEP-1993 DEFINITION H.sapiens mRNA for HMG-CoA-synthase. ACCESSION X66435 S48133 NID g30008 KEYWORDS Hydroxymethylglutaryl CoA Synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1679) AUTHORS Russ,A. TITLE Direct Submission JOURNAL Submitted (26-MAY-1992) A. Russ, Labor fur angewandte Biochemie, Theodor Stern-Kai 7, W-6000 Frankfurt am Main 70, FRG REMARK revised by [2] REFERENCE 2 (bases 1 to 1685) AUTHORS Russ,A. TITLE Direct Submission JOURNAL Submitted (10-AUG-1992) Andreas Russ, Zentrum der biologischen Chemie, J.W.-Goethe-Universitaet Frankfurt, Theodor-Stern-Kai 7, Frankfurt, 6000, Germany REFERENCE 3 (bases 1 to 1685) AUTHORS Russ,A.P., Ruzicka,V., Maerz,W., Appelhans,H. and Gross,W. TITLE Amplification and direct sequencing of a cDNA encoding human cytosolic 3-hydroxy-3-methylglutaryl-coenzyme A synthase JOURNAL Biochim. Biophys. Acta 1132 (3), 329-331 (1992) MEDLINE 93041939 FEATURES Location/Qualifiers source 1..1685 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Fibroblast" /clone="HMGCoASynthase" CDS 123..1685 /codon_start=1 /product="Hydroxymethylglutaryl CoA Synthase" /db_xref="PID:g30009" /db_xref="SWISS-PROT:Q01581" /translation="MPGSLPLNAEACWPKDVGIVALEIYFPSQYVDQAELEKYDGVDA GKYTIGLGQAKMGFCTDREDINSLCMTVVQNLMERNNLSYDCIGRLEVGTETIIDKSK SVKTNLMQLFEESGNTDIEGIDTTNACYGGTAAVFNAVNWIESSSWDGRYALVVAGDI AVYATGNARPTGGVGAVALLIGPNAPLIFERGLRGTHMQHAYDFYKPDMLSEYPIVDG KLSIQCYLSALDRCYSVYCKKIHAQWQKEANDNDFTLNDFGFMIFHSPYCKLVQKSLA RMLLNDFLNDQNRDKNSIYSGLKAFGDVKLEDTYFDRDVEKAFMKASSELFSQKTKAS LLVSNQNGNMYTSSVYGSLASVLAQYSPQHLAGKRIGVFSYGSGLAATLYSLKVTQDA TPGSALDKITASLCDLKSRLDSRTGVAQDVFAENMKLREDTHHLVNYIPQGSIDSLFE GTWYLVRVDEKHRRTYARRPTPNDDTLDEGVGLVHSNIATEHIPSPAKKVPRLPATAA EPEAAVISNGVW" BASE COUNT 482 a 342 c 390 g 471 t ORIGIN 1 actgtccttt cgtggctcac tccctttcct ctcgtgccgc tcggtcacgc ttgtgcccga 61 aggaggaaac agtgacagac ctggagactg cagttctcta tccttcacac agctctttca 121 ccatgcctgg atcacttcct ttgaatgcag aagcttgctg gccaaaagat gtgggaattg 181 ttgcccttga gatctatttt ccttctcaat atgttgatca agcagagttg gaaaaatatg 241 atggtgtaga tgctggaaag tataccattg gcttgggcca ggccaagatg ggcttctgca 301 cagatagaga agatattaac tctctttgca tgactgtggt tcagaatctt atggagagaa 361 ataacctttc ctatgattgc attgggcggc tggaagttgg aacagagaca atcatcgaca 421 aatcaaagtc tgtgaagact aatttgatgc agctgtttga agagtctggg aatacagata 481 tagaaggaat cgacacaact aatgcatgct atggaggcac agctgctgtc ttcaatgctg 541 ttaactggat tgagtccagc tcttgggatg gacggtatgc cctggtagtt gcaggagata 601 ttgctgtata tgccacagga aatgctagac ctacaggtgg agttggagca gtagctctgc 661 taattgggcc aaatgctcct ttaatttttg aacgagggct tcgtgggaca catatgcaac 721 atgcctatga tttttacaag cctgatatgc tatctgaata tcctatagta gatggaaaac 781 tctccataca gtgctacctc agtgcattag accgctgcta ttctgtctac tgcaaaaaga 841 tccatgccca gtggcagaaa gaggcaaatg ataacgattt taccttgaat gattttggct 901 tcatgatctt tcactcacca tattgtaaac tggttcagaa atctctagct cggatgttgc 961 tgaatgactt ccttaatgac cagaatagag ataaaaatag tatctatagt ggcctgaagg 1021 cctttgggga tgttaagtta gaagacacct actttgatag agatgtggag aaggcattta 1081 tgaaggctag ctctgaactc ttcagtcaga aaacaaaggc atctttactt gtatcaaatc 1141 aaaatggaaa tatgtacaca tcttcagtat atggttccct tgcatctgtt ctagcacagt 1201 actcacctca gcatttagca gggaagagaa ttggagtgtt ttcttatggt tctggtttgg 1261 ctgccactct gtactctctt aaagtcacac aagatgctac accggggtct gctcttgata 1321 aaataacagc aagtttatgt gatcttaaat caaggcttga ttcaagaact ggtgtggcac 1381 aagatgtctt cgctgaaaac atgaagctca gagaggacac ccatcatttg gtcaactata 1441 ttccccaggg ttcaatagat tcactctttg aaggaacgtg gtacttagtt agggtggatg 1501 aaaagcacag aagaacttac gctcggcgtc ccactccaaa tgatgacact ttggatgaag 1561 gagtaggact tgtgcattca aacatagcaa ctgagcatat tccaagccct gccaagaaag 1621 taccaagact ccctgctaca gcagcagaac ctgaagcagc agttattagt aatggggtat 1681 ggtaa // LOCUS HSCOLL1 1970 bp RNA PRI 02-FEB-1993 DEFINITION H.sapiens mRNA for type I interstitial collagenase. ACCESSION X54925 NID g30125 KEYWORDS Collagenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1970) AUTHORS Templeton,N.S., Brown,P.D., Levy,A.T., Margulies,I.M., Liotta,L.A. and Stetler-Stevenson,W.G. TITLE Cloning and characterization of human tumor cell interstitial collagenase JOURNAL Cancer Res. 50 (17), 5431-5437 (1990) MEDLINE 90352587 FEATURES Location/Qualifiers source 1..1970 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A2058 melanoma" /clone_lib="cDNA in pCD-X Okayama-Berg vector" /sex="Male" CDS 69..1478 /codon_start=1 /product="type I interstitial collagenase" /db_xref="PID:g30126" /db_xref="SWISS-PROT:P03956" /translation="MHSFPPLLLLLFWGVVSHSFPATLETQEQDVDLVQKYLEKYYNL KNDGRQVEKRRNSGPVVEKLKQMQEFFGLKVTGKPDAETLKVMKQPRCGVPDVAQFVL TEGNPRWEQTHLTYRIENYTPDLPRADVDHAIEKAFQLWSNVTPLTFTKVSEGQADIM ISFVRGDHRDNSPFDGPGGNLAHAFQPGPGIGGDAHFDEDERWTNNFREYNLHRVAAH ELGHSLGLSHSTDIGALMYPSYTFSGDVQLAQDDIDGIQAIYGRSQNPVQPIGPQTPK ACDSKLTFDAITTIRGEVMFFKDRFYMRTNPFYPEVELNFISVFWPQLPNGLEAAYEF ADRDEVRFFKGNKYWAVQGQNVLHGYPKDIYSSFGFPRTVKHIDAALSEENTGKTYFF VANKYWRYDEYKRSMDPGYPKMIAHDFPGIGHKVDAVFMKDGFFYFFHGTRQYKFDPK TKRILTLQKANSWFNCRKN" BASE COUNT 586 a 410 c 443 g 531 t ORIGIN 1 atattggagt agcaagaggc tgggaagcca tcacttacct tgcactgaga aagaagacaa 61 aggccagtat gcacagcttt cctccactgc tgctgctgct gttctggggt gtggtgtctc 121 acagcttccc agcgactcta gaaacacaag agcaagatgt ggacttagtc cagaaatacc 181 tggaaaaata ctacaacctg aagaatgatg ggaggcaagt tgaaaagcgg agaaatagtg 241 gcccagtggt tgaaaaattg aagcaaatgc aggaattctt tgggctgaaa gtgactggga 301 aaccagatgc tgaaaccctg aaggtgatga agcagcccag atgtggagtg cctgatgtgg 361 ctcagtttgt cctcactgag gggaaccctc gctgggagca aacacatctg acctacagga 421 ttgaaaatta cacgccagat ttgccaagag cagatgtgga ccatgccatt gagaaagcct 481 tccaactctg gagtaatgtc acacctctga cattcaccaa ggtctctgag ggtcaagcag 541 acatcatgat atcttttgtc aggggagatc atcgggacaa ctctcctttt gatggacctg 601 gaggaaatct tgctcatgct tttcaaccag gcccaggtat tggaggggat gctcattttg 661 atgaagatga aaggtggacc aacaatttca gagagtacaa cttacatcgt gttgcggctc 721 atgaactcgg ccattctctt ggactctccc attctactga tatcggggct ttgatgtacc 781 ctagctacac cttcagtggt gatgttcagc tagctcagga tgacattgat ggcatccaag 841 ccatatatgg acgttcccaa aatcctgtcc agcccatcgg cccacaaacc ccaaaagcat 901 gtgacagtaa gctaaccttt gatgctataa ctacgattcg gggagaagtg atgttcttta 961 aagacagatt ctacatgcgc acaaatccct tctacccgga agttgagctc aatttcattt 1021 ctgttttctg gccacaactg ccaaatgggc ttgaagctgc ttacgaattt gccgacagag 1081 atgaagtccg gtttttcaaa gggaataagt actgggctgt tcagggacag aatgtgctac 1141 acggataccc caaggacatc tacagctcct ttggcttccc tagaactgtg aagcatatcg 1201 atgctgctct ttctgaggaa aacactggaa aaacctactt ctttgttgct aacaaatact 1261 ggaggtatga tgaatataaa cgatctatgg atccaggtta tcccaaaatg atagcacatg 1321 actttcctgg aattggccac aaagttgatg cagttttcat gaaagatgga tttttctatt 1381 tctttcatgg aacaagacaa tacaaatttg atcctaaaac gaagagaatt ttgactctcc 1441 agaaagctaa tagctggttc aactgcagga aaaattgaac attactaatt tgaatggaaa 1501 acacatggtg tgagtccaaa gaaggtgttt tcctgaagaa ctgtctattt tctcagtcat 1561 ttttaacctc tagagtcact gatacacaga atataatctt atttatacct cagtttgcat 1621 atttttttac tatttagaat gtagcccttt ttgtactgat ataatttagt tccacaaatg 1681 gtgggtacaa aaagtcaagt ttgtggctta tggattcata taggccagag ttgcaaagat 1741 cttttccaga gtatgcaact ctgacgttga tcccagagag cagcttcagt gacaaacata 1801 tcctttcaag acagaaagag acaggagaca tgagtctttg ccggaggaaa agcagctcaa 1861 gaacacatgt gcagtcactg gtgtcaccct ggataggcaa gggataactc ttctaacaca 1921 aaataagtgt tttatgtttg gaataaagtc aaccttgttt ctactgtttt // LOCUS HSCOLLIG 1936 bp RNA PRI 25-NOV-1992 DEFINITION H.sapiens mRNA for colligin (a collagen-binding protein). ACCESSION X61598 NID g30129 KEYWORDS collagen-binding protein; colligin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1936) AUTHORS Clarke,E. TITLE Direct Submission JOURNAL Submitted (23-SEP-1991) E. Clarke, The University of Western Ontario, Health Sciences Centre, Faculty of Medicine, Dept of Biochemistry, London Ontario N6A 5C1, CANADA REFERENCE 2 (bases 1 to 1936) AUTHORS Clarke,E.P. and Sanwal,B.D. TITLE Cloning of a human collagen-binding protein, and its homology with rat gp46, chick hsp47 and mouse J6 proteins JOURNAL Biochim. Biophys. Acta 1129 (2), 246-248 (1992) MEDLINE 92110393 COMMENT Sequence comparisons reveal that human colligin, gp46, HSP47 and J6 represent the same protein in different cell lines and thus all should be refered to as colligin. FEATURES Location/Qualifiers source 1..1936 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="fibroblast" CDS 34..1287 /function="collagen-binding protein" /codon_start=1 /product="colligin" /db_xref="PID:g30130" /db_xref="SWISS-PROT:P29043" /translation="MRSLLLGTLCLLAVALAAEVKKPVEAAAPGTAEKLSSKATTLAE PSTGLAFSLYQAMAKDQAVENILVSPVVVASSLGLVSLGGKATTASQAKAVLSAEQLR DEEVHAGLGELLRSLSNSTARNVTWKLGSRLYGPSSVSFADDFVRSSKQHYNCEHSKI NFPDKRSALQSINEWAAQTTDGKLPEVTKDVERTDGALLVNAMFFKPHWDEKFHHKMV DNRGFMVTRSYTVGVTMMHRTGLYNYYDDEKEKLQLVEMPLAHKLSSLIILMPHHVEP LERLEKLLTKEQLKIWMGKMQKKAVAISLPKGVVEVTHDLQKHLAGLGLTEAIDKNKA DLSRMSGKKDLYLASVFHATAFELDTDGNPFDQDIYGREELRSPKLFYADHPFIFLVR DTQSGSLLFIGRLVRLKGDKMRDEL" sig_peptide 34..84 /note="colligin" mat_peptide 85..1284 /function="collagen-binding protein" /product="colligin" misc_feature 1273..1284 /note="C-terminal RDEL sequence" BASE COUNT 424 a 568 c 562 g 382 t ORIGIN 1 ggtcctctgt ggtgcacagc ccacccccca gccatgcgct ctctccttct gggcacctta 61 tgcctcctgg ctgtggccct ggcagccgag gtgaagaaac ctgtagaggc cgcagcccct 121 ggtactgcgg agaagctgag ttccaaggcg accacactgg cagagcccag cacaggcctg 181 gccttcagcc tgtatcaggc aatggccaag gaccaggcag tggagaacat cctggtgtca 241 cccgtggtgg tggcctcgtc gctgggtctc gtgtcgctgg gcggcaaggc gaccacggcg 301 tcgcaggcca aggcagtgct gagcgccgag cagctgcgcg acgaggaggt gcacgccggc 361 ctgggtgagc tgctgcgctc actcagcaac tcgacggcgc gcaacgtgac ctggaagctg 421 ggcagccgac tgtacggacc cagctcagtg agcttcgctg atgacttcgt gcgcagcagc 481 aagcagcact acaactgcga gcactccaag atcaacttcc cggacaagcg cagcgcgctg 541 cagtccatca acgagtgggc cgcgcagacc accgacggca agctgcccga ggtcaccaag 601 gacgtggagc gcacggacgg cgccctgcta gtcaacgcca tgttcttcaa gccacactgg 661 gatgagaaat tccaccacaa gatggtggac aaccgtggct tcatggtgac tcggtcctat 721 actgtgggtg ttacgatgat gcaccggaca ggcctctaca actactacga cgacgagaag 781 gagaagctgc agctggtgga gatgcccctg gctcacaagc tctccagcct catcatcctc 841 atgccccatc acgtggagcc tctcgagcgc cttgaaaagc tgctaaccaa agagcagctg 901 aagatctgga tggggaagat gcagaagaag gctgttgcca tctccttgcc caagggtgtg 961 gtggaggtga cccatgacct gcagaaacac ctggctgggc tgggcctgac tgaggccatt 1021 gacaagaaca aggccgactt atcacgcatg tctggcaaga aggatctgta cctggccagt 1081 gtgttccacg ccaccgcctt tgagttggac acagatggca acccctttga ccaggacatc 1141 tacgggcgcg aggagctgcg cagccccaag ctgttctacg ccgaccaccc cttcatcttc 1201 ctggtgcggg acacccaaag cggctccctg ctattcattg ggcgcctggt ccggctcaag 1261 ggtgacaaga tgcgagacga gttatagggc ctcagggtgc acacaggatg gcaggaggca 1321 tccaaaggct cctgagacac atgggtgcta ttggggttgg gggggaggtg aggtaccagc 1381 cttggatact ccatggaatt cgagctccac ttggacatgg gccccagata ccatgatgct 1441 gagcccggaa actccacatc ctgtgggacc tgggccatag tcattctgcc tgccctgaaa 1501 gtcccagatc aagcctgcct caatcagtat tcatatttat agccaggtac cttctcacct 1561 gtgagaccaa attgagctcg gggggtcagc cagccctctt ctgacactaa aacacctcag 1621 ctgcctcccc agctctatcc caacctctcc caactataaa actaggtgct gcagcctggg 1681 accaggcacc cccagaatga cctggccgca gtgaggcgat tgagaaggag ctcccaggag 1741 gggcttctgg gaagaccctg gtcaagaagc atcgtctggc gttgtgggga tgaacttttt 1801 gttttgtttc ttcctttttt agttcttcaa ggaatggggg gccagggggg caatgagcct 1861 ttgttgctaa tcaaatccgg gacttgtttg tacgtttttt tttctcactg aaaccttttc 1921 cagtgccaaa aaaaaa // LOCUS HSCORONIN 1563 bp RNA PRI 23-DEC-1995 DEFINITION H.sapiens mRNA for coronin. ACCESSION X89109 NID g1136139 KEYWORDS coronin homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1563) AUTHORS Keep,N.H. TITLE Direct Submission JOURNAL Submitted (23-JUN-1995) N.H. Keep, University College London, Medicine, Rayne Institute, 5 University Street, London, WC1E 6JJ, UK REFERENCE 2 (bases 1 to 1563) AUTHORS Grogan,A., Keep,N.H., Reeves,E. and Segal,A.W. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1563 /organism="Homo sapiens" /strain="ATCC 85531" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="brain" /lab_host="E.coli" CDS 94..1479 /codon_start=1 /product="coronin homologue" /db_xref="PID:e185568" /db_xref="PID:g1136140" /translation="MSRQVVRSSKFRHVFGQPAKADQCYEDVRVSQTTWDSGFCAVNP KFVALICEASGGGAFLVLPLGKTGRVDKNAPTVCGHTAPVLDIAWCPHNDNVIASGSE DCTVMVWEIPDGGLMLPLREPVVTLEGHTKRVGIVAWHTTAQNVLLSAGCDNVIMVWD VGTGAAMLTLGPEVHPDTIYSVDWSRDGGLICTSCRDKRVRIIEPRKGTVVAEKDRPH EGTRPVRAVFVSEGKILTTGFSRMSERQVALWDTKHLEEPLSLQELDTSSGVLLPFFD PDTNIVYLCGKGDSSIRYFEITSEAPFLHYLSMFSSKESQRGMGYMPKRGLEVNKCEI ARFYKLHERRCEPIAMTVPRKSDLFQEDLYPPTAGPDPALTAEEWLGGRDAGPLLISL KDGYVPPKSRELRVNRGLDTGRRRAAPEASGTPSSDAVSRLEEEMRKLQATVQELQKR LDRLEETVQAK" BASE COUNT 300 a 492 c 494 g 277 t ORIGIN 1 gcgagtcccc ggctcctcca gctccttcct cctcttcctc ctcctcctcc acctccggct 61 tttgggggat cactgtcctc tctcggcagc agaatgagcc ggcaggtggt ccgctccagc 121 aagttccgcc acgtgtttgg acagccggcc aaggccgacc agtgctatga agatgtgcgc 181 gtctcacaga ccacctggga cagtggcttc tgtgctgtca accctaagtt tgtggccctg 241 atctgtgagg ccagcggggg aggggccttc ctggtgctgc ccctgggcaa gactggacgt 301 gtggacaaga atgcgcccac ggtctgtggc cacacagccc ctgtgctaga catcgcctgg 361 tgcccgcaca atgacaacgt cattgccagt ggctccgagg actgcacagt catggtgtgg 421 gagatcccgg atgggggcct gatgctgccc ctgcgggagc ccgtcgtcac cctggagggc 481 cacaccaagc gtgtgggcat tgtggcctgg cacaccacag cccagaacgt gctgctcagt 541 gcaggttgtg acaacgtgat catggtgtgg gacgtgggca ctggggcggc catgctgaca 601 ctgggcccag aggtgcaccc agacacgatc tacagtgtgg actggagccg agatggaggc 661 ctcatttgta cctcctgccg tgacaagcgc gtgcgcatca tcgagccccg caaaggcact 721 gtcgtagctg agaaggaccg tccccacgag gggacccggc ccgtgcgtgc agtgttcgtg 781 tcggagggga agatcctgac cacgggcttc agccgcatga gtgagcggca ggtggcgctg 841 tgggacacaa agcacctgga ggagccgctg tccctgcagg agctggacac cagcagcggt 901 gtcctgctgc ccttctttga ccctgacacc aacatcgtct acctctgtgg caagggtgac 961 agctcaatcc ggtactttga gatcacttcc gaggcccctt tcctgcacta tctctccatg 1021 ttcagttcca aggagtccca gcggggcatg ggctacatgc ccaaacgtgg cctggaggtg 1081 aacaagtgtg agatcgccag gttctacaag ctgcacgagc ggaggtgtga gcccattgcc 1141 atgacagtgc ctcgaaagtc ggacctgttc caggaggacc tgtacccacc caccgcaggg 1201 cccgaccctg ccctcacggc tgaggagtgg ctggggggtc gggatgctgg gcccctcctc 1261 atctccctca aggatggcta cgtaccccca aagagccggg agctgagggt caaccggggc 1321 ctggacaccg ggcgcaggag ggcagcacca gaggccagtg gcactcccag ctcggatgcc 1381 gtgtctcggc tggaggagga gatgcggaag ctccaggcca cggtgcagga gctccagaag 1441 cgcttggaca ggctggagga gacagtccag gccaagtaga gccccgcagg gcctccagca 1501 gggtcagcca ttcacaccca tccactcacc tcccattccc agccacatgg cagagaaaaa 1561 aaa // LOCUS HSCOVIC 422 bp RNA PRI 17-FEB-1997 DEFINITION Human mRNA for cytochrome c oxidase subunit VIc. ACCESSION X13238 NID g1200056 KEYWORDS cytochrome c oxidase; cytochrome c oxidase subunit VIc. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 422) AUTHORS Ohta,S. TITLE Direct Submission JOURNAL Submitted (13-OCT-1988) Ohta S., Department of Biochemistry, Jichi Medical School, Minamikawachi machi, Tochigi ken, 329 04 Japan REMARK Revised by [3] REFERENCE 2 (bases 1 to 422) AUTHORS Otsuka,M., Mizuno,Y., Yoshida,M., Kagawa,Y. and Ohta,S. TITLE Nucleotide sequence of cDNA encoding human cytochrome c oxidase subunit VIc JOURNAL Nucleic Acids Res. 16 (22), 10916 (1988) MEDLINE 89083509 REFERENCE 3 (bases 1 to 422) AUTHORS Ohta,S. TITLE Direct Submission JOURNAL Submitted (20-FEB-1996) Ohta S., Department of Biochemistry, Jichi Medical School, Minamikawachi machi, Tochigi ken, 329 04 Japan FEATURES Location/Qualifiers source 1..422 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /clone_lib="pcD" CDS 68..295 /codon_start=1 /product="cytochrome c oxidase subunit VIc preprotein" /db_xref="PID:e223120" /db_xref="PID:g1200057" /db_xref="SWISS-PROT:P09669" /translation="MAPEVLPKPRMRGLLARRLRNHMAVAFVLSLGVAALYKFRVADQ RKKAYADFYRNYDVMKDFEEMRKAGIFQSVK" mat_peptide 74..292 /product="cytochrome c oxidase subunit VIc" old_sequence 222 /citation=[1] /replace="c" polyA_signal 408..413 polyA_site 422 BASE COUNT 118 a 71 c 115 g 118 t ORIGIN 1 gggggggggg gggtcaggaa ggacgttggt gttgaggtta gcatacgtat caaggacagt 61 aactaccatg gctcccgaag ttttgccaaa acctcggatg cgtggccttc tggccaggcg 121 tctgcgaaat catatggctg tagcattcgt gctatccctg ggggttgcag ctttgtataa 181 gtttcgtgtg gctgatcaaa gaaagaaggc atacgcagat ttctacagaa actacgatgt 241 catgaaagat tttgaggaga tgaggaaggc tggtatcttt cagagtgtaa agtaatcttg 301 gaatataaag aatttcttca ggttgaatta cctagaagtt tgtcactgac ttgtgttcct 361 gaactatgcc acatgaatat gtgggctaag aatagttcct cttgataaat aaacaattaa 421 ca // LOCUS HSCOX7AL 408 bp RNA PRI 27-MAR-1995 DEFINITION Human COX VIIa-L mRNA for liver-specific cytochrome c oxidase (EC 1.9.3.1.). ACCESSION X15822 NID g30146 KEYWORDS cytochrome c oxidase; cytochrome c oxidase subunit VIIa. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 408) AUTHORS Schon,E.A. TITLE Direct Submission JOURNAL Submitted (14-JUL-1989) Schon E.A., Department of Neurology, Columbia University, 630 West 168th Street, New York NY 10032, U S A REFERENCE 2 (bases 1 to 408) AUTHORS Fabrizi,G.M., Rizzuto,R., Nakase,H., Mita,S., Lomax,M.I., Grossman,L.I. and Schon,E.A. TITLE Sequence of a cDNA specifying subunit VIIa of human cytochrome c oxidase JOURNAL Nucleic Acids Res. 17 (17), 7107 (1989) MEDLINE 89386065 FEATURES Location/Qualifiers source 1..408 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /clone_lib="lambda gt11" /clone="pCOX7.22" CDS 14..265 /note="precursor (AA -23 to 60)" /codon_start=1 /db_xref="PID:g30147" /db_xref="SWISS-PROT:P14406" /translation="MLRNLLALRQIGQRTISTASRRHFKNKVPEKQKLFQEDDEIPLY LKGGVADALLYRATMILTVGGTAYAIYELAVASFPKKQE" transit_peptide 14..82 /note="transit peptide (AA -23 to -1)" mat_peptide 83..262 /note="mature peptide (AA 1-60)" misc_feature 388..393 /note="polyA signal" BASE COUNT 113 a 89 c 95 g 111 t ORIGIN 1 agtaacagcc aagatgctgc ggaatctgct ggctcttcgt cagattgggc agaggacgat 61 aagcactgct tcccgcaggc attttaaaaa taaagttccg gagaagcaaa aactgttcca 121 ggaggatgat gaaattccac tgtatctaaa gggtggggta gctgatgccc tcctgtatag 181 agccaccatg attcttacag ttggtggaac agcatatgcc atatatgagc tggctgtggc 241 ttcatttccc aagaagcagg agtgacttca gtcatcccag caatcgcttg gttcagtttc 301 attcagctct ctatggacca gtaatctgat aaataaccga gctcttcttt ggggatcaat 361 atttattgac ttgtagtaac tgccaccaat aaagcagtct ttaccatg // LOCUS HSCOX7BM 468 bp RNA PRI 15-MAR-1993 DEFINITION H.sapiens coxVIIb mRNA for cytochrome c oxidase subunit VIIb. ACCESSION Z14244 NID g30150 KEYWORDS coxVIIb gene; cytochrome c oxidase; cytochrome c oxidase subunit VIIb. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 468) AUTHORS Sadlock,J.E., Lightowlers,R.N., Capaldi,R.A. and Schon,E.A. TITLE Isolation of a cDNA specifying subunit VIIb of human cytochrome c oxidase JOURNAL Biochim. Biophys. Acta 1172 (1-2), 223-225 (1993) MEDLINE 93176819 REFERENCE 2 (bases 1 to 468) AUTHORS Schon,E.A. TITLE Direct Submission JOURNAL Submitted (12-AUG-1992) Eric A. Schon, Neurology, Columbia University, 630 West 168th, Street, New York, NY, 10032, USA FEATURES Location/Qualifiers source 1..468 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /cell_type="Endothelial cell" /clone_lib="Lambda gt10 (from M. Chao & D. Littman, Cornell U.)" /clone="pHCOX7b.7.1" 5'UTR 1..102 misc_feature 1..22 /note="Poly(T) stretch present in cDNA; unknown if present in mRNA or gene" gene 103..345 /gene="cox VIIb" sig_peptide 103..174 /gene="cox VIIb" /note="Presumed mitochondrial importation presequence" CDS 103..345 /gene="cox VIIb" /codon_start=1 /product="cytochrome c oxidase subunit VIIb" /db_xref="PID:g30151" /db_xref="SWISS-PROT:P24311" /translation="MFPLVKSALNRLQVRSIQQTMARQSHQKRTPDFHDKYGNAVLAS GATFCIVTWTYVATQVGIEWNLSPVGRVTPKEWRNQ" mat_peptide 175..342 /gene="cox VIIb" /product="cytochrome c oxidase subunit VIIb" 3'UTR 343..468 terminator 343..345 /gene="cox VIIb" polyA_signal 446..451 BASE COUNT 145 a 97 c 92 g 134 t ORIGIN 1 tttttttttt ttttttgttt ttcagctcac ttcaagggta cctgaagcga attggcacca 61 aagcagcagc tgtattgccg cagttctagc ttcaccttca cgatgtttcc cttggtcaaa 121 agcgcactaa atcgtctcca agttcgaagc attcagcaaa caatggcaag gcagagccac 181 cagaaacgta cacctgattt tcatgacaaa tacggtaatg ctgtattagc tagtggagcc 241 actttctgta ttgttacatg gacatatgta gcaacacaag tcggaataga atggaacctg 301 tcccctgttg gcagagttac cccaaaggaa tggaggaatc agtaatcatc ccagctggtg 361 taataatgaa ttgtttaaaa aacagctcat aattgatgcc aaattaaagc actgtgtacc 421 cattaagata tggcattatt gaagaaataa agtacatttg aaaccttc // LOCUS HSCOXIVR 696 bp RNA PRI 28-JUL-1994 DEFINITION Human mRNA for cytochrome c oxidase subunit IV (EC 1.9.3.1). ACCESSION X54802 NID g517251 KEYWORDS cox gene; cytochrome c oxidase; cytochrome c oxidase subunit IV; oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 696) AUTHORS Ernst,S.G. TITLE Direct Submission JOURNAL Submitted (11-OCT-1990) Ernst S.G., Tufts University, Dept. of Biology, Medford, MA 02155, USA REFERENCE 2 (bases 1 to 696) AUTHORS Park,S.J., Modica-Napolitano,J., Gross,A., Ernst,S.G. and Aprille,J.R. JOURNAL Unpublished COMMENT Data kindly reviewed (11-FEB-1991) by Park S.J. FEATURES Location/Qualifiers source 1..696 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /clone="lambda gt11" mRNA <1..>696 mat_peptide 63..569 /gene="coxIV" /product="cytochrome-c oxidase subunit IV" gene 63..572 /gene="coxIV" CDS 63..572 /gene="coxIV" /codon_start=1 /product="cytochrome-c oxidase subunit IV" /db_xref="PID:g517252" /db_xref="SWISS-PROT:P13073" /translation="MLATRVFSLVGKRAISTSVCVRAHESVVKSEDFSLPAYMDRRDH PLPEVAHVKHLSASQKALKEKEKASWSSLSMDEKVELYRIKFKESFAEMNRGSNEWKT VVGGAMFFIGFTALVIMWQKHYVYGPLPQSFDKEWVAKQTKRMLDMKVNPIQGLASKW DYEKNEWKK" BASE COUNT 163 a 174 c 209 g 150 t ORIGIN 1 gctctcttcc ggtcgcggga caccgggtgt agagggcggt cgcggcgggc agtggcggca 61 gaatgttggc taccagggta tttagcctag ttggcaagcg agcaatttcc acctctgtgt 121 gtgtacgagc tcatgaaagt gttgtgaaga gcgaagactt ttcgctccca gcttatatgg 181 atcggcgtga ccaccccttg ccggaggtgg cccatgtcaa gcacctgtct gccagccaga 241 aggcactgaa ggagaaggag aaggcctcct ggagcagcct ctccatggat gagaaagtcg 301 agttgtatcg cattaagttc aaggagagct ttgctgagat gaacaggggc tcgaacgagt 361 ggaagacggt tgtgggcggt gccatgttct tcatcggttt caccgcgctc gttatcatgt 421 ggcagaagca ctatgtgtac ggccccctcc cgcaaagctt tgacaaagag tgggtggcca 481 agcagaccaa gaggatgctg gacatgaagg tgaaccccat ccagggctta gcctccaagt 541 gggactacga aaagaacgag tggaagaagt gagagatgct gcctgcgcct gcacctgcgc 601 ctggctctgt caccgccatg caactccatg cctatttact ggaaacctgt tatgccaaac 661 agttgtacca ctgctaataa atgaccagtt tacctg // LOCUS HSCOXVII 334 bp RNA PRI 27-MAR-1995 DEFINITION Human COX VIIc gene for subunit VIIc of cytochrome c oxidase (EC 1.9.3.1). ACCESSION X16560 NID g30154 KEYWORDS cytochrome c oxidase; respiratory chain enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 334) AUTHORS Schon,E.A. TITLE Direct Submission JOURNAL Submitted (21-SEP-1989) Schon E.A., Columbia University, Dept. of Neurology - Room BB324, 630 West 168th Street, New York, NY 10032, USA REFERENCE 2 (bases 1 to 334) AUTHORS Koga,Y., Fabrizi,G.M., Mita,S., Arnaudo,E., Lomax,M.I., Aqua,M.S., Grossman,L.I. and Schon,E.A. TITLE Sequence of a cDNA specifying subunit VIIc of human cytochrome c oxidase JOURNAL Nucleic Acids Res. 18 (3), 684 (1990) MEDLINE 90175022 COMMENT Data kindly reviewed (20-APR-1990) by Schon E.A. FEATURES Location/Qualifiers source 1..334 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /clone_lib="lambda gt10" /clone="pCOX7.183" transit_peptide 19..66 /note="transit peptide (AA -16 to -1)" CDS 19..210 /note="precursor polypeptide (AA -16 to 47)" /codon_start=1 /db_xref="PID:g30155" /db_xref="SWISS-PROT:P15954" /translation="MLGQSIRRFTTSVVRRSHYEEGPGKNLPFSVENKWSLLAKMCLY FGSAFATPFLVVRHQLLKT" mat_peptide 67..207 /note="mature polypeptide cytochrome c oxidase (AA 1-47)" misc_feature 318..323 /note="polyA signal" polyA_site 334 /note="polyA addition site" BASE COUNT 92 a 68 c 80 g 94 t ORIGIN 1 gcagagcttc cagcggctat gttgggccag agcatccgga ggttcacaac ctctgtggtc 61 cgtaggagcc actatgagga gggccctggg aagaatttgc cattttcagt ggaaaacaag 121 tggtcgttac tagctaagat gtgtttgtac tttggatctg catttgctac acccttcctt 181 gtagtaagac accaactgct taaaacataa ggatgtttca gttcctccat ttaacagata 241 tgaagagcat tttaagaggt gcagcctctg gaagtggatc aaactagaac tcatatgcca 301 tactagatat gtttgtcaat aaacttatga cgtg // LOCUS HSCR1 6951 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for complement receptor type 1 (CR1, C3b/C4b receptor, CD35). ACCESSION Y00816 NID g30185 KEYWORDS C3b/C4b complement component receptor; C3b/C4b receptor; CD35 antigen; complement receptor; glycoprotein; membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6951) AUTHORS Klickstein,L.B. TITLE Direct Submission JOURNAL Submitted (20-OCT-1988) Klickstein L.B., Room 617 Hunterian Bldg., 725 N. Wolfe St., Baltimore, MD 21205 REFERENCE 2 (bases 1 to 1531) AUTHORS Klickstein,L.B., Bartow,T.J., Miletic,V., Rabson,L.D., Smith,J.A. and Fearon,D.T. TITLE Identification of distinct C3b and C4b recognition sites in the human C3b/C4b receptor (CR1, CD35) by deletion mutagenesis JOURNAL J. Exp. Med. 168 (5), 1699-1717 (1988) MEDLINE 89035992 COMMENT This is the sequence of the F allotype of CR1. seq pos. 1532-6951 are already published see x05309 Data kindly reviewed (16/5/89) by Klickstein L. FEATURES Location/Qualifiers source 1..6951 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil and HL-60" /clone_lib="lambda S2T (ATCC #37546)." sig_peptide 28..150 /note="signal peptide (-41 to -1)" CDS 28..6147 /note="CR1 precursor protein" /codon_start=1 /db_xref="PID:g30186" /db_xref="SWISS-PROT:P17927" /translation="MGASSPRSPEPVGPPAPGLPFCCGGSLLAVVVLLALPVAWGQCN APEWLPFARPTNLTDEFEFPIGTYLNYECRPGYSGRPFSIICLKNSVWTGAKDRCRRK SCRNPPDPVNGMVHVIKGIQFGSQIKYSCTKGYRLIGSSSATCIISGDTVIWDNETPI CDRIPCGLPPTITNGDFISTNRENFHYGSVVTYRCNPGSGGRKVFELVGEPSIYCTSN DDQVGIWSGPAPQCIIPNKCTPPNVENGILVSDNRSLFSLNEVVEFRCQPGFVMKGPR RVKCQALNKWEPELPSCSRVCQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGYDLRG AASMRCTPQGDWSPAAPTCEVKSCDDFMGQLLNGRVLFPVNLQLGAKVDFVCDEGFQL KGSSASYCVLAGMESLWNSSVPVCEQIFCPSPPVIPNGRHTGKPLEVFPFGKAVNYTC DPHPDRGTSFDLIGESTIRCTSDPQGNGVWSSPAPRCGILGHCQAPDHFLFAKLKTQT NASDFPIGTSLKYECRPEYYGRPFSITCLDNLVWSSPKDVCKRKSCKTPPDPVNGMVH VITDIQVGSRINYSCTTGHRLIGHSSAECILSGNAAHWSTKPPICQRIPCGLPPTIAN GDFISTNRENFHYGSVVTYRCNPGSGGRKVFELVGEPSIYCTSNDDQVGIWSGPAPQC IIPNKCTPPNVENGILVSDNRSLFSLNEVVEFRCQPGFVMKGPRRVKCQALNKWEPEL PSCSRVCQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGYDLRGAASMRCTPQGDWSP AAPTCEVKSCDDFMGQLLNGRVLFPVNLQLGAKVDFVCDEGFQLKGSSASYCVLAGME SLWNSSVPVCEQIFCPSPPVIPNGRHTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIG ESTIRCTSDPQGNGVWSSPAPRCGILGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYE CRPEYYGRPFSITCLDNLVWSSPKDVCKRKSCKTPPDPVNGMVHVITDIQVGSRINYS CTTGHRLIGHSSAECILSGNTAHWSTKPPICQRIPCGLPPTIANGDFISTNRENFHYG SVVTYRCNLGSRGRKVFELVGEPSIYCTSNDDQVGIWSGPAPQCIIPNKCTPPNVENG ILVSDNRSLFSLNEVVEFRCQPGFVMKGPRRVKCQALNKWEPELPSCSRVCQPPPEIL HGEHTPSHQDNFSPGQEVFYSCEPGYDLRGAASLHCTPQGDWSPEAPRCAVKSCDDFL GQLPHGRVLFPLNLQLGAKVSFVCDEGFRLKGSSVSHCVLVGMRSLWNNSVPVCEHIF CPNPPAILNGRHTGTPSGDIPYGKEISYTCDPHPDRGMTFNLIGESTIRCTSDPHGNG VWSSPAPRCELSVRAGHCKTPEQFPFASPTIPINDFEFPVGTSLNYECRPGYFGKMFS ISCLENLVWSSVEDNCRRKSCGPPPEPFNGMVHINTDTQFGSTVNYSCNEGFRLIGSP STTCLVSGNNVTWDKKAPICEIISCEPPPTISNGDFYSNNRTSFHNGTVVTYQCHTGP DGEQLFELVGERSIYCTSKDDQVGVWSSPPPRCISTNKCTAPEVENAIRVPGNRSFFS LTEIIRFRCQPGFVMVGSHTVQCQTNGRWGPKLPHCSRVCQPPPEILHGEHTLSHQDN FSPGQEVFYSCEPSYDLRGAASLHCTPQGDWSPEAPRCTVKSCDDFLGQLPHGRVLLP LNLQLGAKVSFVCDEGFRLKGRSASHCVLAGMKALWNSSVPVCEQIFCPNPPAILNGR HTGTPFGDIPYGKEISYACDTHPDRGMTFNLIGESSIRCTSDPQGNGVWSSPAPRCEL SVPAACPHPPKIQNGHYIGGHVSLYLPGMTISYTCDPGYLLVGKGFIFCTDQGIWSQL DHYCKEVNCSFPLFMNGISKELEMKKVYHYGDYVTLKCEDGYTLEGSPWSQCQADDRW DPPLAKCTSRAHDALIVGTLSGTIFFILLIIFLSWIILKHRKGNNAHENPKEVAIHLH SQGGSSVHPRTLQTNEENSRVLP" mat_peptide 151..6144 /note="mature CR1 protein (AA 1-1998)" misc_feature 151..1500 /note="long homologous repeat A coding sequence" misc_feature 1501..2850 /note="long homologous repeat B coding sequence" misc_feature 2851..4209 /note="long homologous repeat C coding sequence" misc_feature 4210..5565 /note="long homologous repeat D coding sequence" BASE COUNT 1802 a 1680 c 1661 g 1808 t ORIGIN 1 cgtggtttgt agatgtgctt ggggagaatg ggggcctctt ctccaagaag cccggagcct 61 gtcgggccgc cggcgcccgg tctccccttc tgctgcggag gatccctgct ggcggttgtg 121 gtgctgcttg cgctgccggt ggcctggggt caatgcaatg ccccagaatg gcttccattt 181 gccaggccta ccaacctaac tgatgagttt gagtttccca ttgggacata tctgaactat 241 gaatgccgcc ctggttattc cggaagaccg ttttctatca tctgcctaaa aaactcagtc 301 tggactggtg ctaaggacag gtgcagacgt aaatcatgtc gtaatcctcc agatcctgtg 361 aatggcatgg tgcatgtgat caaaggcatc cagttcggat cccaaattaa atattcttgt 421 actaaaggat accgactcat tggttcctcg tctgccacat gcatcatctc aggtgatact 481 gtcatttggg ataatgaaac acctatttgt gacagaattc cttgtgggct accccccacc 541 atcaccaatg gagatttcat tagcaccaac agagagaatt ttcactatgg atcagtggtg 601 acctaccgct gcaatcctgg aagcggaggg agaaaggtgt ttgagcttgt gggtgagccc 661 tccatatact gcaccagcaa tgacgatcaa gtgggcatct ggagcggccc cgcccctcag 721 tgcattatac ctaacaaatg cacgcctcca aatgtggaaa atggaatatt ggtatctgac 781 aacagaagct tattttcctt aaatgaagtt gtggagttta ggtgtcagcc tggctttgtc 841 atgaaaggac cccgccgtgt gaagtgccag gccctgaaca aatgggagcc ggagctacca 901 agctgctcca gggtatgtca gccacctcca gatgtcctgc atgctgagcg tacccaaagg 961 gacaaggaca acttttcacc tgggcaggaa gtgttctaca gctgtgagcc cggctacgac 1021 ctcagagggg ctgcgtctat gcgctgcaca ccccagggag actggagccc tgcagccccc 1081 acatgtgaag tgaaatcctg tgatgacttc atgggccaac ttcttaatgg ccgtgtgcta 1141 tttccagtaa atctccagct tggagcaaaa gtggattttg tttgtgatga aggatttcaa 1201 ttaaaaggca gctctgctag ttactgtgtc ttggctggaa tggaaagcct ttggaatagc 1261 agtgttccag tgtgtgaaca aatcttttgt ccaagtcctc cagttattcc taatgggaga 1321 cacacaggaa aacctctgga agtctttccc tttggaaaag cagtaaatta cacatgcgac 1381 ccccacccag acagagggac gagcttcgac ctcattggag agagcaccat ccgctgcaca 1441 agtgaccctc aagggaatgg ggtttggagc agccctgccc ctcgctgtgg aattctgggt 1501 cactgtcaag ccccagatca ttttctgttt gccaagttga aaacccaaac caatgcatct 1561 gactttccca ttgggacatc tttaaagtac gaatgccgtc ctgagtacta cgggaggcca 1621 ttctctatca catgtctaga taacctggtc tggtcaagtc ccaaagatgt ctgtaaacgt 1681 aaatcatgta aaactcctcc agatccagtg aatggcatgg tgcatgtgat cacagacatc 1741 caggttggat ccagaatcaa ctattcttgt actacagggc accgactcat tggtcactca 1801 tctgctgaat gtatcctctc gggcaatgct gcccattgga gcacgaagcc gccaatttgt 1861 caacgaattc cttgtgggct accccccacc atcgccaatg gagatttcat tagcaccaac 1921 agagagaatt ttcactatgg atcagtggtg acctaccgct gcaatcctgg aagcggaggg 1981 agaaaggtgt ttgagcttgt gggtgagccc tccatatact gcaccagcaa tgacgatcaa 2041 gtgggcatct ggagcggccc ggcccctcag tgcattatac ctaacaaatg cacgcctcca 2101 aatgtggaaa atggaatatt ggtatctgac aacagaagct tattttcctt aaatgaagtt 2161 gtggagttta ggtgtcagcc tggctttgtc atgaaaggac cccgccgtgt gaagtgccag 2221 gccctgaaca aatgggagcc ggagctacca agctgctcca gggtatgtca gccacctcca 2281 gatgtcctgc atgctgagcg tacccaaagg gacaaggaca acttttcacc cgggcaggaa 2341 gtgttctaca gctgtgagcc cggctatgac ctcagagggg ctgcgtctat gcgctgcaca 2401 ccccagggag actggagccc tgcagccccc acatgtgaag tgaaatcctg tgatgacttc 2461 atgggccaac ttcttaatgg ccgtgtgcta tttccagtaa atctccagct tggagcaaaa 2521 gtggattttg tttgtgatga aggatttcaa ttaaaaggca gctctgctag ttattgtgtc 2581 ttggctggaa tggaaagcct ttggaatagc agtgttccag tgtgtgaaca aatcttttgt 2641 ccaagtcctc cagttattcc taatgggaga cacacaggaa aacctctgga agtctttccc 2701 tttggaaaag cagtaaatta cacatgcgac ccccacccag acagagggac gagcttcgac 2761 ctcattggag agagcaccat ccgctgcaca agtgaccctc aagggaatgg ggtttggagc 2821 agccctgccc ctcgctgtgg aattctgggt cactgtcaag ccccagatca ttttctgttt 2881 gccaagttga aaacccaaac caatgcatct gactttccca ttgggacatc tttaaagtac 2941 gaatgccgtc ctgagtacta cgggaggcca ttctctatca catgtctaga taacctggtc 3001 tggtcaagtc ccaaagatgt ctgtaaacgt aaatcatgta aaactcctcc agatccagtg 3061 aatggcatgg tgcatgtgat cacagacatc caggttggat ccagaatcaa ctattcttgt 3121 actacagggc accgactcat tggtcactca tctgctgaat gtatcctctc aggcaatact 3181 gcccattgga gcacgaagcc gccaatttgt caacgaattc cttgtgggct acccccaacc 3241 atcgccaatg gagatttcat tagcaccaac agagagaatt ttcactatgg atcagtggtg 3301 acctaccgct gcaatcttgg aagcagaggg agaaaggtgt ttgagcttgt gggtgagccc 3361 tccatatact gcaccagcaa tgacgatcaa gtgggcatct ggagcggccc cgcccctcag 3421 tgcattatac ctaacaaatg cacgcctcca aatgtggaaa atggaatatt ggtatctgac 3481 aacagaagct tattttcctt aaatgaagtt gtggagttta ggtgtcagcc tggctttgtc 3541 atgaaaggac cccgccgtgt gaagtgccag gccctgaaca aatgggagcc agagttacca 3601 agctgctcca gggtgtgtca gccgcctcca gaaatcctgc atggtgagca taccccaagc 3661 catcaggaca acttttcacc tgggcaggaa gtgttctaca gctgtgagcc tggctatgac 3721 ctcagagggg ctgcgtctct gcactgcaca ccccagggag actggagccc tgaagccccg 3781 agatgtgcag tgaaatcctg tgatgacttc ttgggtcaac tccctcatgg ccgtgtgcta 3841 tttccactta atctccagct tggggcaaag gtgtcctttg tctgtgatga agggtttcgc 3901 ttaaagggca gttccgttag tcattgtgtc ttggttggaa tgagaagcct ttggaataac 3961 agtgttcctg tgtgtgaaca tatcttttgt ccaaatcctc cagctatcct taatgggaga 4021 cacacaggaa ctccctctgg agatattccc tatggaaaag aaatatctta cacatgtgac 4081 ccccacccag acagagggat gaccttcaac ctcattgggg agagcaccat ccgctgcaca 4141 agtgaccctc atgggaatgg ggtttggagc agccctgccc ctcgctgtga actttctgtt 4201 cgtgctggtc actgtaaaac cccagagcag tttccatttg ccagtcctac gatcccaatt 4261 aatgactttg agtttccagt cgggacatct ttgaattatg aatgccgtcc tgggtatttt 4321 gggaaaatgt tctctatctc ctgcctagaa aacttggtct ggtcaagtgt tgaagacaac 4381 tgtagacgaa aatcatgtgg acctccacca gaacccttca atggaatggt gcatataaac 4441 acagatacac agtttggatc aacagttaat tattcttgta atgaagggtt tcgactcatt 4501 ggttccccat ctactacttg tctcgtctca ggcaataatg tcacatggga taagaaggca 4561 cctatttgtg agatcatatc ttgtgagcca cctccaacca tatccaatgg agacttctac 4621 agcaacaata gaacatcttt tcacaatgga acggtggtaa cttaccagtg ccacactgga 4681 ccagatggag aacagctgtt tgagcttgtg ggagaacggt caatatattg caccagcaaa 4741 gatgatcaag ttggtgtttg gagcagccct ccccctcggt gtatttctac taataaatgc 4801 acagctccag aagttgaaaa tgcaattaga gtaccaggaa acaggagttt cttttccctc 4861 actgagatca tcagatttag atgtcagccc gggtttgtca tggtagggtc ccacactgtg 4921 cagtgccaga ccaatggcag atgggggccc aagctgccac actgctccag ggtgtgtcag 4981 ccgcctccag aaatcctgca tggtgagcat accctaagcc atcaggacaa cttttcacct 5041 gggcaggaag tgttctacag ctgtgagccc agctatgacc tcagaggggc tgcgtctctg 5101 cactgcacgc cccagggaga ctggagccct gaagccccta gatgtacagt gaaatcctgt 5161 gatgacttcc tgggccaact ccctcatggc cgtgtgctac ttccacttaa tctccagctt 5221 ggggcaaagg tgtcctttgt ttgcgatgaa gggttccgat taaaaggcag gtctgctagt 5281 cattgtgtct tggctggaat gaaagccctt tggaatagca gtgttccagt gtgtgaacaa 5341 atcttttgtc caaatcctcc agctatcctt aatgggagac acacaggaac tccctttgga 5401 gatattccct atggaaaaga aatatcttac gcatgcgaca cccacccaga cagagggatg 5461 accttcaacc tcattgggga gagctccatc cgctgcacaa gtgaccctca agggaatggg 5521 gtttggagca gccctgcccc tcgctgtgaa ctttctgttc ctgctgcctg cccacatcca 5581 cccaagatcc aaaacgggca ttacattgga ggacacgtat ctctatatct tcctgggatg 5641 acaatcagct acacttgtga ccccggctac ctgttagtgg gaaagggctt cattttctgt 5701 acagaccagg gaatctggag ccaattggat cattattgca aagaagtaaa ttgtagcttc 5761 ccactgttta tgaatggaat ctcgaaggag ttagaaatga aaaaagtata tcactatgga 5821 gattatgtga ctttgaagtg tgaagatggg tatactctgg aaggcagtcc ctggagccag 5881 tgccaggcgg atgacagatg ggaccctcct ctggccaaat gtacctctcg tgcacatgat 5941 gctctcatag ttggcacttt atctggtacg atcttcttta ttttactcat cattttcctc 6001 tcttggataa ttctaaagca cagaaaaggc aataatgcac atgaaaaccc taaagaagtg 6061 gctatccatt tacattctca aggaggcagc agcgttcatc cccgaactct gcaaacaaat 6121 gaagaaaata gcagggtcct tccttgacaa agtactatac agctgaagaa catctcgaat 6181 acaattttgg tgggaaagga gccaattgat ttcaacagaa tcagatctga gcttcataaa 6241 gtctttgaag tgacttcaca gagacgcaga catgtgcact tgaagatgct gccccttccc 6301 tggtacctag caaagctcct gcctctttgt gtgcgtcact gtgaaacccc cacccttctg 6361 cctcgtgcta aacgcacaca gtatctagtc aggggaaaag actgcattta ggagatagaa 6421 aatagtttgg attacttaaa ggaataaggt gttgcctgga atttctggtt tgtaaggtgg 6481 tcactgttct tttttaaaat atttgtaata tggaatgggc tcagtaagaa gagcttggaa 6541 aatgcagaaa gttatgaaaa ataagtcact tataattatg ctacctactg ataaccactc 6601 ctaatatttt gattcatttt ctgcctatct tctttcacat atgtgttttt ttacatacgt 6661 acttttcccc ccttagtttg tttcctttta ttttatagag cagaacccta gtcttttaaa 6721 cagtttagag tgaaatatat gctatatcag tttttacttt ctctagggag aaaaattaat 6781 ttactagaaa ggcatgaaat gatcatggga agagtggtta agactactga agagaaatat 6841 ttggaaaata agatttcgat atcttctttt tttttgagat ggagtctggc tctgtctccc 6901 aggctggagt gcagtggcgt aatctcggct cactgcaacg tccgcctccc g // LOCUS HSCREBP1 1647 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cAMP response element (CRE-BP1) binding protein. ACCESSION X15875 NID g30214 KEYWORDS CREBP1 gene; DNA binding protein; leucine zipper; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1647) AUTHORS Maekawa,T., Sakura,H., Kanei-Ishii,C., Sudo,T., Yoshimura,T., Fujisawa,J., Yoshida,M. and Ishii,S. TITLE Leucine zipper structure of the protein CRE-BP1 binding to the cyclic AMP response element in brain JOURNAL EMBO J. 8 (7), 2023-2028 (1989) MEDLINE 90005408 COMMENT Data kindly reviewed (20-FEB-1990) by Ishii S. FEATURES Location/Qualifiers source 1..1647 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /clone_lib="lambda gt11" CDS 27..1544 /note="cAMP response element binding protein (AA 1-505)" /codon_start=1 /db_xref="PID:g30215" /db_xref="SWISS-PROT:P15336" /translation="MKFKLHVNSARQYKDLWNMSDDKPFLCTAPGCGQRFTNEDHLAV HKHKHEMTLKFGPARNDSVIVADQTPTPTRFLKNCEEVGLFNELASPFENEFKKASED DIKKMPLDLSPLATPIIRSKIEEPSVVETTHQDSPLPHPESTTSDEKEVPLAQTAQPT SAIVRPASLQVPNVLLTSSDSSVIIQQAVPSPTSSTVITQAPSSNRPIVPVPGPFPLL LHLPSGQTMPVAIPASITSSNVHVPAAVPLVRPVTMVPSVPGIPGPSSPQPVQSEAKM RLKAALTQQHPPVTNGDTVKGHGSGLVRTQSEESRPQSLQQPATSTTETPASPAHTTP QTQSTSGRRRRAANEDPDEKRRKFLERNRAAASRCRQKRKVWVQSLEKKAEDLSSLNG QLQSEVTLLRNEVAQLKQLLLAHKDCPVTAMQKKSGYHTADKDDSSEDISVPSSPHTE AIQHSSVSTSNGVSSTSKAEAVATSVLTQMADQSTEPALSQIVMAPSSQSQPSGS" BASE COUNT 504 a 396 c 341 g 406 t ORIGIN 1 gaattctgtg ataagttatt caacttatga aattcaagtt acatgtgaat tctgccaggc 61 aatacaagga cctgtggaat atgagtgatg acaaaccctt tctatgtact gcgcctggat 121 gtggccagcg ttttaccaac gaggatcatt tggctgtcca taaacataaa catgagatga 181 cactgaaatt tggtccagca cgtaatgaca gtgtcattgt ggctgatcag accccaacac 241 caacaagatt cttgaaaaac tgtgaagaag tgggtttgtt taatgagttg gcgagtccat 301 ttgagaatga attcaagaaa gcttcagaag atgacattaa aaaaatgcct ctagatttat 361 cccctcttgc aacacctatc ataagaagca aaattgagga gccttctgtt gtagaaacaa 421 ctcaccagga tagtccttta cctcacccag agtctactac cagtgatgag aaggaagtac 481 cattggcaca aactgcacag cccacatcag ctattgttcg tccagcatca ttacaggttc 541 ccaatgtgct gcttacaagt tctgactcaa gtgtaattat tcagcaggca gtaccttcac 601 caacctcaag tactgtaatc acccaggcac catcctctaa caggccaatt gtccctgtac 661 caggcccatt tcctcttctg ttacatcttc ctagtggaca aaccatgcct gttgctattc 721 ctgcatcaat tacaagttct aatgtgcatg ttccagctgc agtcccactc gttcgaccag 781 tcaccatggt gcctagtgtt ccaggaatcc caggtccttc ctctccccaa ccagtacagt 841 cagaagcaaa aatgagatta aaagctgctt tgacccagca acatcctcca gttaccaatg 901 gtgatactgt caaaggtcat ggtagcggat tggttaggac tcagtcagag gaatctcgac 961 cgcagtcatt acaacagcca gccacatcca ctacagaaac tccggcttct ccagctcaca 1021 caactccaca gacccaaagt acaagtggtc gtcggagaag agcagctaac gaagatcctg 1081 atgaaaaaag gagaaagttt ttagagcgaa atagagcagc agcttcaaga tgccgacaaa 1141 aaaggaaagt ctgggttcag tctttagaga agaaagctga agacttgagt tcattaaatg 1201 gtcagctgca gagtgaagtc accctgctga gaaatgaagt ggcacagctg aaacagcttc 1261 ttctggctca taaagattgc cctgtaaccg ccatgcagaa gaaatctggc tatcatactg 1321 ctgataaaga tgatagttca gaagacattt cagtgccgag tagtccacat acggaagcta 1381 tacagcatag ttcggtcagc acatccaatg gagtcagttc aacctccaag gcagaagctg 1441 tagccacttc agtcctcacc cagatggcgg accagagtac agagcctgct ctttcacaga 1501 tcgttatggc tccttcctcc cagtcacagc cctcaggaag ttgattaaaa acctgcagta 1561 caacagttta gatactcatt agtgacttca aagggaaatc aaggaaagac cagtttccat 1621 ttatgcgaaa tctgtggttg taaattt // LOCUS HSCRFBP 1248 bp RNA PRI 24-MAY-1991 DEFINITION Human mRNA for corticotropin-releasing factor binding protein (CRF-BP). ACCESSION X58022 NID g30218 KEYWORDS corticotropin-releasing factor binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1248) AUTHORS Potter,E., Behan,D.P., Fischer,W.H., Linton,E.A., Lowry,P.J. and Vale,W.W. TITLE Cloning and characterization of the cDNAs for human and rat corticotropin releasing factor-binding proteins JOURNAL Nature 349 (6308), 423-426 (1991) MEDLINE 91125460 REFERENCE 2 (bases 1 to 1248) AUTHORS Potter,E. TITLE Direct Submission JOURNAL Submitted (30-APR-1991) E. Potter, Salk Institute, Peptide Biology Laboratory, P O Box 85800, San Diego CA 92186, USA FEATURES Location/Qualifiers source 1..1248 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="Llamda Zap II" /sex="Male" mRNA 1..1248 /note="corticotropin releasing factor-binding protein" /evidence=experimental CDS 47..1015 /codon_start=1 /product="corticotropin releasing factor-binding protein" /db_xref="PID:g30219" /db_xref="SWISS-PROT:P24387" /translation="MSPNFKLQCHFILIFLTALRGESRYLELREAADYDPFLLFSANL KRDVAGEQPYRRALRCLDMLSLQGQFTFTADRPQLHCAAFFISEPEEFITIHYDQVSI DCQGGDFLKVFDGWILKGEKFPSSQDHPLPSAERYIDFCESGLSRRSIRSSQNVAMIF FRVHEPGNGFTLTIKTDPNLFPCNVISQTPNGKFTLVVPHQHRNCSFSIIYPVVIKIS DLTLGHVNGLQLKKSSAGCEGIGDFVELLEGTGLDPSKMTPLADLCYPFHGPAQMKVG CDNTVVRMVSSGKHVNRVTFEYRQLEPYELENPNGNSIGEFCLSGL" BASE COUNT 308 a 325 c 301 g 314 t ORIGIN 1 ggacctccgg agcagacagc acagcagctg cagaggcaag gccagcatgt cgcccaactt 61 caaacttcag tgtcacttca ttctcatctt cctgacggct ctaagagggg aaagccggta 121 cctagagctg agggaagcgg cggactacga tcctttcctg ctcttcagcg ccaacctgaa 181 gcgggacgtg gctggggagc agccgtaccg ccgcgctctg cggtgcctgg acatgctgag 241 cctccagggc cagttcacct tcaccgccga ccggccgcag ctgcactgcg cagccttctt 301 catcagcgag cccgaggagt tcattaccat ccactacgac caggtctcca tcgactgtca 361 gggcggcgac ttcctgaagg tatttgatgg ttggattctc aagggggaga agttccccag 421 ttcccaggat catcctctcc cctcagctga gcggtacata gatttctgtg agagtggtct 481 tagcaggagg agcatcagat cttcccagaa tgtggccatg atcttcttcc gagtccatga 541 accaggaaat ggattcacat taaccataaa gacagacccc aacctctttc cttgcaatgt 601 catttctcag actccaaatg gaaagtttac cctggtagtt ccacaccagc atcgaaactg 661 cagcttctcc ataatttatc ctgtggtgat caaaatatct gatcttaccc tgggacacgt 721 aaatggtctt cagttaaaga aatcctcagc aggttgcgag ggaataggag actttgtgga 781 gctgctggag ggaactggat tggacccttc caagatgacg cctttagctg atctctgcta 841 cccctttcat ggcccggccc agatgaaagt tggctgtgac aacactgtgg tgcgcatggt 901 ctccagtgga aaacacgtaa atcgtgtgac ttttgagtat cgtcagctgg agccgtacga 961 gctggaaaac ccaaatggaa acagtatcgg ggaattctgt ttgtctggtc tttgaataac 1021 caacccagtg atttacatgc tgatagctaa gtgagttttt aatggccatt gtgtatgatt 1081 ttgatgcaca actagttaaa agcctttcat accagtcagt atttcccagc cttgagcgca 1141 cgcacacacc acacacatac acacacgcat tatttttgtt actttgcttc tttttatgtt 1201 tgtaatctgt aaatgaacac atggcagaaa ataaccctga ttggtagg // LOCUS HSCRISP1D 1797 bp RNA PRI 12-APR-1996 DEFINITION H.sapiens mRNA for cysteine-rich secretory protein-1 delta. ACCESSION X95238 NID g1262812 KEYWORDS CRISP-1delta gene; cysteine-rich secretory protein-1 delta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1797) AUTHORS Kratzschmar,J., Haendler,B., Eberspaecher,U., Roosterman,D., Donner,P. and Schleuning,W.D. TITLE The human cysteine-rich secretory protein (CRISP) family. Primary structure and tissue distribution of CRISP-1, CRISP-2 and CRISP-3 JOURNAL Eur. J. Biochem. 236 (3), 827-836 (1996) MEDLINE 96270732 REFERENCE 2 (bases 1 to 1797) AUTHORS Haendler,B. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) B. Haendler, Schering AG, ICMB, S109/517, 13342 Berlin, FRG FEATURES Location/Qualifiers source 1..1797 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="epididymis cDNA library" gene 74..610 /gene="CRISP-1delta" sig_peptide 74..136 /gene="CRISP-1delta" CDS 74..610 /gene="CRISP-1delta" /codon_start=1 /product="cysteine-rich secretory protein-1 delta" /db_xref="PID:e221224" /db_xref="PID:g1262813" /translation="MEIKHLLFLVAAACLLPMLSMKKKSARDQFNKLVTDLPNVQEEI VNIHNALRRRVVPPASNMLKMSWSEEAAQNARIFSKYCDMTESNPLERRLPNTFCGEN MHMTSYPVSWSSVIGVWYSESTSFKHGEWTTTDDDITTDHYTQIVWATSYLIGCAIAS CRQQGSPRYLYVCHYCHD" polyA_signal 960..965 variation 1066 /note="variation with CRISP-1" polyA_signal 1291..1296 polyA_signal 1301..1306 polyA_signal 1767..1772 BASE COUNT 590 a 337 c 328 g 542 t ORIGIN 1 gcacaaatac actacataga gaaaggcttg gttcttatca ggacacaaat ttaaaggctg 61 tgtggacttg gggatggaaa ttaaacacct cttgtttttg gttgctgctg cttgcttact 121 gcctatgttg tccatgaaaa agaaatcagc tagagaccaa tttaataagc tcgtcaccga 181 cttgccaaat gtacaagaag agatcgttaa tatacacaac gccctcagga gaagagtagt 241 tccaccagcc agcaacatgc tgaagatgag ttggagtgaa gaggctgcac aaaatgccag 301 aattttttca aagtattgtg atatgacaga gagcaacccc cttgagagga gacttccaaa 361 taccttttgt ggagaaaata tgcatatgac atcttatcct gtatcatggt caagtgtaat 421 tggagtctgg tacagtgagt ctacaagttt caaacatgga gaatggacaa caacggatga 481 tgacataact actgaccact acactcagat tgtttgggcc acatcttacc tgattggctg 541 tgccattgca tcttgccgcc aacaaggatc acctcgatat ctctacgttt gtcactattg 601 tcatgactaa cccctgcatc tactatgatg aatacttcga ctgtgacata caagtccatt 661 atctgggatg caaccactca acaactatcc tattctgtaa agccacttgt ctgtgtgaca 721 ctgagataaa ataggtcttt gttattttca actgttctat gctgtgacga tgaggaggag 781 atgtctgttg gattcatgtc ttttgctata gttcagtagc ttctgctaaa tttcactgat 841 tttaatcatg ctggagacct taactcccat cctgatacat cctgaagtaa cactgtttta 901 aactttctta gtgctggagt aaaaggtcaa gtccaacacc tgccttaaat ttaaatcatg 961 tgatttatag tttttaagtt ggcataattc aacttatggt ataactgggt ccctcaacag 1021 taacctgggc taaaataggt cttatgtggt tcaactccca cccccacctt ccccatattt 1081 tcaaccactc tgattatctt ccctgcacaa ctaacatcca gtaataattc ttcactttta 1141 aaattttact tctactttaa atcaatcatt aaaggaatcc acaaagcaaa cagagttcag 1201 tctcatcttg caaggtaaat atcatttaat tggaagtagt ttaaatgtct cattgtttta 1261 ttgacacatc tatatataca tttgtgaagc aagaaacaat aaaaaagctt cgtatgccat 1321 taatttaaca aaatatgtat tcagtactga ttgcatacaa gatgcatgtt tatatatatg 1381 gaaggaatat agtttcattt cattgcaaag gcagtataaa agatatataa aatagcataa 1441 tatgagaaat taagtcccta aagacatata ggtcacatat tattattgcc agatgagcat 1501 aaatagcttc tgtttggaga ttcaggaaag ccttagggtg gaatgaggaa catcttctga 1561 gtaaacaggg ttgcaaaggt tatgattatt tcaacacaat ggaagagcac agttaaggcc 1621 aactaacgta aaatgcactg aagccttagg gaatattgaa gggcctgaca tggggaaagg 1681 gaaggctaga aatacttggt caaattttaa cattatacca aagttatacc cagttctacc 1741 tacttgtata tttctttact catttcaata aagtgtttga aaaaaaaaaa aaaaaaa // LOCUS HSCRISP2I 1406 bp RNA PRI 12-APR-1996 DEFINITION H.sapiens mRNA for cysteine-rich secretory protein-2/type I. ACCESSION X95239 NID g1262816 KEYWORDS CRISP-2 gene; cysteine-rich secretory protein-2/type I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1406) AUTHORS Kratzschmar,J., Haendler,B., Eberspaecher,U., Roosterman,D., Donner,P. and Schleuning,W.D. TITLE The human cysteine-rich secretory protein (CRISP) family. Primary structure and tissue distribution of CRISP-1, CRISP-2 and CRISP-3 JOURNAL Eur. J. Biochem. 236 (3), 827-836 (1996) MEDLINE 96270732 REFERENCE 2 (bases 1 to 1406) AUTHORS Haendler,B. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) B. Haendler, Schering AG, ICMB, S109/517, 13342 Berlin, FRG COMMENT Related sequence J04741. FEATURES Location/Qualifiers source 1..1406 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="testis cDNA library" sig_peptide 241..300 /gene="CRISP-2" CDS 241..972 /gene="CRISP-2" /codon_start=1 /product="cysteine-rich secretory protein-2/type I" /db_xref="PID:e221226" /db_xref="PID:g1262817" /translation="MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKH NELRKAVSPPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSS DPTSWSSAIQSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDS LKYYYVCQYCPAGNNMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLK NTAGCEHELLKEKCKATCLCENKIY" gene 241..972 /gene="CRISP-2" polyA_signal 1362..1367 BASE COUNT 458 a 293 c 289 g 366 t ORIGIN 1 cggtgagagg ggcgcgcagc agcagctcct caacgccgca acgcgccggc ccaactgcag 61 gaaggtctgt gctctggagc cagggtaaat ggttataaaa ttatacacca tggccctcct 121 aaagacactc taggaaaacc atgtcatcct gatcttaaaa cacctgcaag aaagagcaca 181 gtacttcacc attaataaag tagatatttc atcctgctca gaaaaccaac atttccagca 241 atggctttac taccggtgtt gtttctggtt actgtgctgc ttccatcttt acctgcagaa 301 ggaaaggatc ccgcttttac tgctttgtta accacccagt tgcaagtgca aagggagatt 361 gtaaataaac acaatgaact aaggaaagca gtctctccac ctgccagtaa catgctaaag 421 atggaatgga gcagagaggt aacaacgaat gcccaaaggt gggcaaacaa gtgcacttta 481 caacatagtg atccagagga ccgcaaaacc agtacaagat gtggtgagaa tctctatatg 541 tcaagtgacc ctacttcctg gtcttctgca atccaaagct ggtatgacga gatcctagat 601 tttgtctatg gtgtaggacc aaagagtccc aatgcagttg ttggacatta tactcagctt 661 gtttggtact cgacttacca ggtaggctgt ggaattgcct actgtcccaa tcaagatagt 721 ctaaaatact actatgtttg ccaatattgt cctgctggta ataatatgaa tagaaagaat 781 accccgtacc aacaaggaac accttgtgcc ggttgccctg atgactgtga caaaggacta 841 tgcaccaata gttgccagta tcaagatctc ctaagtaact gtgattcctt gaagaataca 901 gctggctgtg aacatgagtt actcaaggaa aagtgcaagg ctacttgcct atgtgagaac 961 aaaatttact gatttaccta gtgagcattg tgcaagactg catggataag ggctgcatca 1021 tttaattgcg acataccagt ggaaattgta tgtatgttag tgacaaattt gatttcaaag 1081 agcaatgcat cttctccccc agatcatcac agaaatcact ttcaggcaat gatttacaaa 1141 agtagcatag tagatgatga caactgtgaa ctctgacata aatttagtgc tttataacga 1201 actgaatcag gttgaggatt ttgaaaactg tataaccata ggatttaggt cactaggact 1261 ttggatcaaa atggtgcatt acgtatttcc tgaaacatgc taaagaagaa gactgtaaca 1321 tcattgccat tcctactacc tgagttttta cttgcataaa caataaattc aaagctttac 1381 atctgcaaaa aaaaaaaaaa aaaaaa // LOCUS HSCRISP3G 2128 bp RNA PRI 12-APR-1996 DEFINITION H.sapiens mRNA for cysteine-rich secretory protein-3. ACCESSION X95240 NID g1262818 KEYWORDS CRISP-3 gene; cysteine-rich secretory protein-3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2128) AUTHORS Kratzschmar,J., Haendler,B., Eberspaecher,U., Roosterman,D., Donner,P. and Schleuning,W.D. TITLE The human cysteine-rich secretory protein (CRISP) family. Primary structure and tissue distribution of CRISP-1, CRISP-2 and CRISP-3 JOURNAL Eur. J. Biochem. 236 (3), 827-836 (1996) MEDLINE 96270732 REFERENCE 2 (bases 1 to 2128) AUTHORS Haendler,B. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) B. Haendler, Schering AG, ICMB, S109/517, 13342 Berlin, FRG FEATURES Location/Qualifiers source 1..2128 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="testis cDNA library" sig_peptide 16..75 /gene="CRISP-3" CDS 16..753 /gene="CRISP-3" /codon_start=1 /product="cysteine-rich secretory protein-3" /db_xref="PID:e221225" /db_xref="PID:g1262819" /translation="MTLFPVLLFLVAGLLPSFPANEDKDPAFTALLTTQTQVQREIVN KHNELRRAVSPPARNMLKMEWNKEAAANAQKWANQCNYRHSNPKDRMTSLKCGENLYM SSASSSWSQAIQSWFDEYNDFDFGVGPKTPNAVVGHYTQVVWYSSYLVGCGNAYCPNQ KVLKYYYVCQYCPAGNWANRLYVPYEQGAPCASCPDNCDDGLCTNGCKYEDLYSNCKS LKLTLTCKHQLVRDSCKASCNCSNSIY" gene 16..753 /gene="CRISP-3" polyA_signal 2084..2089 BASE COUNT 734 a 397 c 380 g 617 t ORIGIN 1 ctggaaacca ctgcaatgac attattccca gtgctgttgt tcctggttgc tgggctgctt 61 ccatcttttc cagcaaatga agataaggat cccgctttta ctgctttgtt aaccacccaa 121 acacaagtgc aaagggagat tgtgaataag cacaatgaac tgaggagagc agtatctccc 181 cctgccagaa acatgctgaa gatggaatgg aacaaagagg ctgcagcaaa tgcccaaaag 241 tgggcaaacc agtgcaatta cagacacagt aacccaaagg atcgaatgac aagtctaaaa 301 tgtggtgaga atctctacat gtcaagtgcc tccagctcat ggtcacaagc aatccaaagc 361 tggtttgatg agtacaatga ttttgacttt ggtgtagggc caaagactcc caacgcagtg 421 gttggacatt atacacaggt tgtttggtac tcttcatacc tcgttggatg tggaaatgcc 481 tactgtccca atcaaaaagt tctaaaatac tactatgttt gccaatattg tcctgctggt 541 aattgggcta atagactata tgtcccttat gaacaaggag caccttgtgc cagttgccca 601 gataactgtg acgatggact atgcaccaat ggttgcaagt acgaagatct ctatagtaac 661 tgtaaaagtt tgaagctcac attaacctgt aaacatcagt tggtcaggga cagttgcaag 721 gcctcctgca attgttcaaa cagcatttat taaatacgca ttacacaccg agtagggcta 781 tgtagagagg agtcagatta tctacttaga tttggcatct acttagattt aacatatact 841 agctgagaaa ttgtaggcat gtttgataca catttgattt caaatgtttt tcttctggat 901 ctgcttttta ttttacaaaa atatttttca tacaaatggt taaaaagaaa caaaatctat 961 aacaacaact ttggattttt atatataaac tttgtgattt aaatttactg aatttaatta 1021 gggtgaaaat tttgaaagtt gtattctcat atgactaagt tcactaaaac cctggattga 1081 aagtgaaaat tatgttccta gaacaaaatg tacaaaaaga acaatataat tttcacatga 1141 acccttggct gtagttgcct ttcctagctc cactctaagg ctaagcatct tcaaagacgt 1201 tttcccatat gctgtcttaa ttcttttcac tcattcaccc ttcttcccaa tcatctggct 1261 ggcatcctca caattgagtt gaagctgttc ctcctaaaac aatcctgact tttattttgc 1321 caaaatcaat acaatccttt gaatttttta tctgcataaa ttttacagta gaatatgatc 1381 aaaccttcat ttttaaacct ctcttctctt tgacaaaact tccttaaaaa agaatacaag 1441 ataatatagg taaataccct ccactcaagg aggtagaact cagtcctctc ccttgtgagt 1501 cttcactaaa atcagtgact cacttccaaa gagtggagta tggaaaggga aacatagtaa 1561 ctttacaggg gagaaaaatg acaaatgacg tcttcaccaa gtgatcaaaa ttaacgtcac 1621 cagtgataag tcattcagat ttgttctaga taatctttct aaaaattcat aatcccaatc 1681 taattatgag ctaaaacatc cagcaaactc aagttgaagg acattctaca aaatatccct 1741 ggggtatttt agagtattcc tcaaaactgt aaaaatcatg gaaaataagg gaatcctgag 1801 aaacaatcac agaccacatg agactaagga gacatgtgag ccaaatgcaa tgtgcttctt 1861 ggatcagatc ctggaacaga aaaagatcag taatgaaaaa actgatgaag tctgaataga 1921 atctggagta tttttaacag tagtgttgat ttcttaatct tgacaaatat agcagggtaa 1981 tgtaagatga taacgttaga gaaactgaaa ctgggtgagg gctatctagg aattctctgt 2041 actatcttac caaattttcg gtaagtctaa gaaagcaatg caaaataaaa agtgtcttga 2101 aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSCRKL 1881 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens crk-like gene CRKL. ACCESSION X59656 NID g416519 KEYWORDS CRKL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1881) AUTHORS ten Hoeve,J., Morris,C., Heisterkamp,N. and Groffen,J. TITLE Isolation and chromosomal localization of CRKL, a human crk-like gene JOURNAL Oncogene 8 (9), 2469-2474 (1993) MEDLINE 93368949 REFERENCE 2 (bases 1 to 1881) AUTHORS Groffen,J. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) J. Groffen, Childrens Hospital of Los Angeles, 4650 Sunset, Blud, Mailstop # 103, Dept. of Pathology, Los Angeles CA 90027, USA FEATURES Location/Qualifiers source 1..1881 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="q11" gene 510..1421 /gene="CRKL" CDS 510..1421 /gene="CRKL" /codon_start=1 /db_xref="PID:g416520" /db_xref="SWISS-PROT:P46109" /translation="MSSARFDSSDRSAWYMGPVSRQEAQTRLQGQRHGMFLVRDSSTC PGDYVLSVSENSRVSHYIINSLPNRRFKIGDQEFDHLPALLEFYKIHYLDTTTLIEPA PRYPSPPMGSVSAPNLPTAEDNLEYVRTLYDFPGNDAEDLPFKKGEILVIIEKPEEQW WSARNKDGRVGMIPVPYVEKLVRSSPHGKHGNRNSNSYGIPEPAHAYAQPQTTTPLPA VSGSPGAAITPLPSTQNGPVFAKAIQKRVPCAYDKTALALEVGDIVKVTRMNINGQWE GEVNGRKGLFPFTHVKIFDPQNPDENE" BASE COUNT 399 a 538 c 532 g 412 t ORIGIN 1 cggaggggga ggtggctgcc gcttctcccg cgtccgccat tttgttgctg tggctattgg 61 gaacaagctg ggcaaaagca ccccggaggc gcgacgctcc ttcgagttcg gtgcctcgtg 121 tgacggcggg ggtcggtgaa gacccgtcga gctgcggcgc cggcgcgttc caggccggga 181 gtcactggag gcacccctgg gacgccgagc agcccgagaa ccccggggtg gcctccgctg 241 cggctcgggt ttgcctgccc cgaccccccg gctctgccgt gcattcccgg gcggctctct 301 ccgtgtggcg gccccggagc aggcgggcgg cgtcggagga tgctgcgggc ccggagccga 361 gaggaaagtg ctggcccagc cctctgagcg ctcctcgagg tgtgcgagag gcccttcctc 421 ggccccaaag ccgtctgccg ggctaaggcg tgcagagcag gcgaggacag ccgccgcccc 481 taccgccgca gagtccccgg tccaacacca tgtcctccgc caggttcgac tcctcggacc 541 gctccgcctg gtatatgggg ccggtgtctc gccaggaggc gcagacccgg ctccagggcc 601 agcgccacgg tatgttcctc gtccgcgatt cttccacctg ccctggggac tatgtgctgt 661 cggtgtccga gaactcgcgg gtctcccact acattatcaa ctcgctgccc aaccgccgtt 721 ttaagatcgg ggaccaggaa tttgaccatt tgccggccct gctggagttt tacaagatcc 781 actacctgga caccaccacc ctcatcgagc ctgcgcccag gtatccaagc ccaccaatgg 841 gatctgtctc agcacccaac ctgcctacag cagaagataa cctggaatat gtacggactc 901 tgtatgattt tcctgggaat gatgccgaag acctgccctt taaaaagggt gagatcctag 961 tgataataga gaagcctgaa gaacagtggt ggagtgcccg gaacaaggat ggccgggttg 1021 ggatgattcc tgtcccttat gtcgaaaagc ttgtgagatc ctcaccacac ggaaagcatg 1081 gaaataggaa ttccaacagt tatgggatcc cagaacctgc tcatgcatac gctcaacctc 1141 agaccacaac tcctctacct gcagtttccg gttctcctgg ggcagcaatc acccctttgc 1201 catccacaca gaatggacct gtctttgcga aagcaatcca gaaaagagta ccctgtgctt 1261 atgacaagac tgccttggca ttagaggttg gtgacatcgt gaaagtcaca aggatgaata 1321 taaatggcca gtgggaaggc gaagtgaacg ggcgcaaagg gcttttcccc tttacgcacg 1381 tcaaaatctt tgaccctcaa aacccagatg aaaacgagtg attgctgttg ccctgtttcc 1441 tgctgctttg ttgttctgcc tgtcctagtc tcctttgaag tgggaaagca ttttctctca 1501 taggcaagtc acactgcatt gccgaagtcc agctttctgc agactggcag tcgcacacac 1561 atttggaatg cacacagcgg ctgcctcctg atgtttgtat catagtcgta ttgtgcaaag 1621 agtagccgat tttagagttc ttttggatca taaactggaa atactgatgg aagcacacaa 1681 gtggagagaa gttgacgtgg aaagggtctt ccttctcatt gctgcccgtt tgtacatggg 1741 actgattctg tcgtgttcac cagagaaagc ttgaggccat ggcgagatac tgcatgtttg 1801 ctgttccaca aagcagtggc ttagctgcca tcttgctttt ctttggacaa caggaagtga 1861 accttaagga agagagaatt c // LOCUS HSCSP40 1676 bp RNA PRI 22-MAR-1995 DEFINITION Human SP-40,40 mRNA for complement-associated protein SP-40,40 alpha-1 and beta-1 chain. ACCESSION X14723 NID g30250 KEYWORDS complement-associated protein; serum protein; SP-40,40 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1676) AUTHORS Kirszbaum,L. TITLE Direct Submission JOURNAL Submitted (17-MAR-1989) Kirszbaum L., The University of Melbourne, The Preclinical Centre, School of Veterinary Science, Parkville Victoria 3052, Australia REFERENCE 2 (bases 1 to 1676) AUTHORS Kirszbaum,L., Sharpe,J.A., Murphy,B., d'Apice,A.J., Classon,B., Hudson,P. and Walker,I.D. TITLE Molecular cloning and characterization of the novel, human complement-associated protein, SP-40,40: a link between the complement and reproductive systems JOURNAL EMBO J. 8 (3), 711-718 (1989) MEDLINE 89251601 COMMENT The sequence overlaps with that reported by Murphy et. al. in J. Clin. Invest. 81:1858-1864(1988). FEATURES Location/Qualifiers source 1..1676 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda gt11" /clone="LK (107)" sig_peptide 48..113 /note="signal peptide (AA -22 to -1)" CDS 48..1397 /note="SP-40,40 prepropetide (AA -22 to 427)" /codon_start=1 /db_xref="PID:g30251" /db_xref="SWISS-PROT:P10909" /translation="MMKTLLLFVGLLLTWESGQVLGDQTVSDNELQEMSNQGSKYVNK EIQNAVNGVKQIKTLIEKTNEERKTLLSNLEEAKKKKEDALNETRESETKLKELPGVC NETMMALWEECKPCLKQTCMKFYARVCRSGSGLVGRQLEEFLNQSSPFYFWMNGDRID SLLENDRQQTHMLDVMQDHFSRASSIIDELFQDRFFTREPQDTYHYLPFSLPHRRPHF FFPKSRIVRSLMPFSPYEPLNFHAMFQPFLEMIHEAQQAMDIHFHSPAFQHPPTEFIR EGDDDRTVCREIRHNSTGCLRMKDQCDKCREILSVDCSTNNPSQAKLRRELDESLQVA ERLTRKYNELLKSYQWKMLNTSSLLEQLNEQFNWVSRLANLTQGEDQYYLRVTTVASH TSDSDVPSGVTEVVVKLFDSDPITVTVPVEVSRKNPKFMETVAEKALQEYRKKHREE" misc_feature 114..>114 /note="beta-chain" misc_feature 114..1394 /note="SP-40,40 propetide (AA 1-427)" mat_peptide 729..1394 /note="mature alpha-chain (AA 205-427)" misc_feature 1622..1627 /note="pot. polyA signal" BASE COUNT 436 a 488 c 437 g 315 t ORIGIN 1 gaattccgcc gctgaccgag gcgtgcaaag actccagaat tggaggcatg atgaagactc 61 tgctgctgtt tgtggggctg ctgctgacct gggagagtgg gcaggtcctg ggggaccaga 121 cggtctcaga caatgagctc caggaaatgt ccaatcaggg aagtaagtac gtcaataagg 181 aaattcaaaa tgctgtcaac ggggtgaaac agataaagac tctcatagaa aaaacaaacg 241 aagagcgcaa gacactgctc agcaacctag aagaagccaa gaagaagaaa gaggatgccc 301 taaatgagac cagggaatca gagacaaagc tgaaggagct cccaggagtg tgcaatgaga 361 ccatgatggc cctctgggaa gagtgtaagc cctgcctgaa acagacctgc atgaagttct 421 acgcacgcgt ctgcagaagt ggctcaggcc tggttggccg ccagcttgag gagttcctga 481 accagagctc gcccttctac ttctggatga atggtgaccg catcgactcc ctgctggaga 541 acgaccggca gcagacgcac atgctggatg tcatgcagga ccacttcagc cgcgcgtcca 601 gcatcataga cgagctcttc caggacaggt tcttcacccg ggagccccag gatacctacc 661 actacctgcc cttcagcctg ccccaccgga ggcctcactt cttctttccc aagtcccgca 721 tcgtccgcag cttgatgccc ttctctccgt acgagcccct gaacttccac gccatgttcc 781 agcccttcct tgagatgata cacgaggctc agcaggccat ggacatccac ttccacagcc 841 cggccttcca gcacccgcca acagaattca tacgagaagg cgacgatgac cggactgtgt 901 gccgggagat ccgccacaac tccacgggct gcctgcggat gaaggaccag tgtgacaagt 961 gccgggagat cttgtctgtg gactgttcca ccaacaaccc ctcccaggct aagctgcggc 1021 gggagctcga cgaatccctc caggtcgctg agaggttgac caggaaatac aacgagctgc 1081 taaagtccta ccagtggaag atgctcaaca cctcctcctt gctggagcag ctgaacgagc 1141 agtttaactg ggtgtcccgg ctggcaaacc tcacgcaagg cgaagaccag tactatctgc 1201 gggtcaccac ggtggcttcc cacacttctg actcggacgt tccttccggt gtcactgagg 1261 tggtcgtgaa gctctttgac tctgatccca tcactgtgac ggtccctgta gaagtctcca 1321 ggaagaaccc taaatttatg gagaccgtgg cggagaaagc gctgcaggaa taccgcaaaa 1381 agcaccggga ggagtgagat gtggatgttg cttttgcacc ttacgggggc atcttgagtc 1441 cagctccccc caagatgagc tgcagccccc cagagagagc tctgcacgtc accaagtaac 1501 caggccccag cctccaggcc cccaactccg cccagcctct ccccgctctg gatcctgcac 1561 tctaacactc gactctgctg ctcatgggaa gaacagaatt gctcctgcat gcaactaatt 1621 caataaaact gtcttgtgag ctgaaaaaaa aaaaaaaaaa aaaaaaaaag gaattc // LOCUS HSCTF1 1755 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for CAAT-box binding transcription factor CTF-1 (syn. CTF/NFI or CTF or NF-I or NF-1). ACCESSION X12492 NID g30265 KEYWORDS DNA-binding protein; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Mermod,N. TITLE Direct Submission JOURNAL Submitted (27-JUL-1988) Mermod N., Howard Hughes Medical Institute, Dept of Biochemistry, University of California, Berkeley, CA 94720, USA REFERENCE 2 (bases 1 to 1755) AUTHORS Santoro,C., Mermod,N., Andrews,P.C. and Tjian,R. TITLE A family of human CCAAT-box-binding proteins active in transcription and DNA replication: cloning and expression of multiple cDNAs JOURNAL Nature 334 (6179), 218-224 (1988) MEDLINE 88288392 COMMENT Data kindly reviewed (14-SEP-1988) by Mermod N. FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt10" /clone="CTF-1" CDS 68..1567 /note="CTF-1 factor (AA 1 - 499)" /codon_start=1 /db_xref="PID:g30266" /db_xref="SWISS-PROT:P08651" /translation="MDEFHPFIEALLPHVRAFAYTWFNLQARKRKYFKKHEKRMSKDE ERAVKDELLGEKPEVKQKWASRLLAKLRKDIRPECREDFVLSITGKKAPGCVLSNPDQ KGKMRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLVKAAQCGHPVLCVQPHHI GVAVKELDLYLAYFVRERDAEQSGSPRTGMGSDQEDSKPITLDTTDFQESFVTSGVFS VTELIQVSRTPVVTGTGPNFSLGELQGHLAYDLNPASTGLRRTLPSTSSSGSKRHKSG SMEEDVDTSPGGDYYTSPSSPTSSSRNWTEDMEGGISSPVKKTEMDKSPFNSPSPQDS PRLSSFTQHHRPVIAVHSGIARSPHPSSALHFPTTSILPQTASTYFPHTAIRYPPHLN PQDPLKDLVSLACDPASQQPGPLNGSGQLKMPSHCLSAQMLAPPPPGLPRLALPPATK PATTSEGGATSPTSPSYSPPDTSPANRSFVGLGPRDPAGIYQAQSWYLG" misc_feature 602..673 /note="sequence absent in CTF-3" misc_feature 1310..1463 /note="sequence absent in CTF-2 and CTF-3" BASE COUNT 366 a 623 c 496 g 270 t ORIGIN 1 gaattccggc ggcggccgcg agcgcgctcg gtccccggcg ccggcctcgc ctcctcgcag 61 cagcgccatg gatgagttcc acccgttcat cgaggccctg ctgcctcacg tccgcgcctt 121 cgcctacacc tggttcaacc tgcaggcgcg gaagcgcaag tacttcaaga agcacgagaa 181 gcggatgtcg aaggacgagg agcgtgcggt caaggacgag ctgctgggcg agaagcccga 241 ggtcaagcag aagtgggcgt cgcggctgct ggccaagctg cgcaaggaca tccggcccga 301 gtgccgcgag gacttcgtgc tgagcatcac cggcaagaag gcgccgggct gcgtgctctc 361 caaccccgac cagaagggca agatgcggcg catcgactgt ctccggcagg cggacaaggt 421 gtggcggctg gacctggtca tggtcatcct gttcaagggc atcccgctgg agagcaccga 481 cggcgagcgc ctggtcaagg ctgcgcagtg cggtcacccg gtcctgtgcg tgcagccgca 541 ccacattggc gtggccgtca aggagctgga cctctacctg gcctacttcg tgcgtgagcg 601 agatgcagag caaagcggca gtccccggac agggatgggc tctgaccagg aggacagcaa 661 gcccatcacg ctggacacga ccgacttcca ggagagcttt gtcacctccg gcgtgttcag 721 cgtcactgag ctcatccaag tgtcccggac acccgtggtg actggaacag gacccaactt 781 ctccctgggg gagctgcagg ggcacctggc atacgacctg aacccagcca gcactggcct 841 cagaagaacg ctgcccagca cctcctccag tgggagcaag cggcacaaat cgggctcgat 901 ggaggaagac gtggacacga gccctggcgg cgattactac acttcgccca gctcgcccac 961 gagtagcagc cgcaactgga cggaggacat ggaaggaggc atctcgtccc cggtgaagaa 1021 gacagagatg gacaagtcac cattcaacag cccgtccccc caggactctc cccgcctctc 1081 cagcttcacc cagcaccacc ggcccgtcat cgccgtgcac agcgggatcg cccggagccc 1141 acacccgtcc tccgctctgc atttccctac gacgtccatc ctaccccaga cggcctccac 1201 ctacttcccc cacacggcca tccgctaccc acctcatctc aacccccagg acccgctcaa 1261 agatcttgtc tcgctggcct gcgacccagc cagccagcaa cctggaccgt taaatggaag 1321 tggtcagctc aaaatgccca gccactgcct ttctgctcag atgctggcac ctccgccccc 1381 ggggctgcca cggctggcgc tcccccctgc caccaaaccc gccaccacct ccgagggagg 1441 agccacgtcg ccgacctcgc cttcctactc tccgcccgac acgtcccctg caaaccgttc 1501 ctttgtggga ttaggaccaa gggatcctgc gggcatttat caggcacagt cctggtatct 1561 gggatagcaa aggtcttctt ccctcgcccc ttctccatcg tcccaggaat cccagggggc 1621 agcacagccg cccccggccc acgtttttgg tggaaaatta gagtgaacaa gaacacccct 1681 gccgactccc agcccggcca aaaagacaaa acacatagac gcacacactc aggaggaaaa 1741 gaaaaaccgg aattc // LOCUS HSCTMRSU 3067 bp RNA PRI 24-AUG-1993 DEFINITION H.sapiens subunit of coatomer complex. ACCESSION X70476 NID g298096 KEYWORDS coatomer complex. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3067) AUTHORS Harrison-Lavoie,K.J., Lewis,V.A., Hynes,G.M., Collison,K.S., Nutland,E. and Willison,K.R. TITLE A 102 kDa subunit of a Golgi-associated particle has homology to beta subunits of trimeric G proteins JOURNAL EMBO J. 12 (7), 2847-2853 (1993) MEDLINE 93327774 REFERENCE 2 (bases 1 to 3067) AUTHORS Harrison-Lavoie,K.J. TITLE Direct Submission JOURNAL Submitted (12-MAY-1993) K.J. Harrison-Lavoie, Institute of Cancer Research, Chester Beatty Labs., Section of Cell & Molecular Biology, 237 Fulham Road, SW3 6JB London, UK FEATURES Location/Qualifiers source 1..3067 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HT14.2" /tissue_type="Fibrosarcoma" /cell_line="HT1080" /sex="Male" mRNA 1..3067 /function="This cDNA encodes a subunit of coatomer complex which is involved in vesicle formation in the constitutive exocytic pathway." CDS 69..2789 /codon_start=1 /product="subunit of coatomer complex" /db_xref="PID:g298097" /db_xref="SWISS-PROT:P35606" /translation="MPLRLDIKRKLTARSDRVKSVDLHPTEPWMLASLYNGSVCVWNH ETQTLVKTFEVCDLPVRAAKFVARKNWVVTGADDMQIRVFNYNTLERVHMFEAHSDYI RCIAVHPTQPFILTSSDDMLIKLWDWDKKWSCSQVFEGHTHYVMQIVINPKDNNQFAS ASLDRTIKVWQLGSSSPNFTLEGHEKGVNCIDYYSGGDKPYLISGADDRLVKIWDYQN KTCVQTLEGHAQNVSCASFHPELPIIITGSEDGTVRIWHSSTYRLESTLNYGMERVWC VASLRGSNNVALGYDEGSIIVKLGREEPAMSMDANGKIIWAKHSEVQQANLKAMGDAE IKDGERLPLAVKDMGSCEIYPQTIQHNPNGRFVVVCGDGEYIIYTAMALRNKSFGSAQ EFAWAHDSSEYAIRESNSIVKIFKNFKEKKSFKPDFGAESIYGGFLLGVRSVNGLAFY DWDNTELIRRIEIQPKHIFWSDSGELVCIATEESFFILKYLSEKVLAAQETHEGVTED GIEDAFEVLGEIQEIVKTGLWVGDCFIYTSSVNRLNYYVGGEIVTIAHLDRTMYLLGY IPKDNRLYLGDKELNIISYSLLVSVLEYQTAVMRRDFSMADKVLPTIPKEQRTRVAHF LEKQGFKQQALTVSTDPEHRFELALQLGELKIAYQLAVEAESEQKWKQLAELAISKCQ FGLAQECLHHAQDYGGLLLLATASGNANMVNKLAEGAERDGKNNVAFMSYFLQGKVDA CLELLIRTGRLPEAAFLARTYLPSQVSRVVKLWRENLSKVNQKAAESLADPTEYENLF PGLKEAFVVEEWVKETHADLWPAKQYPLVTPNEERNVMEEGKDFQPSRSTAQQELDGK PASPTPVIVASHTANKEEKSLLELEVDLDNLELEDIDTTDINLDEDILDD" BASE COUNT 922 a 582 c 741 g 822 t ORIGIN 1 ggtgggttta tctcaaggcc tgagtagccg gtaacaaacg agggttcccg ggattggacc 61 gacgcaccat gcctctgcga cttgatatca aaagaaagct aactgctaga tctgatcgag 121 ttaagagtgt ggatctgcat cctacagagc catggatgtt ggcaagtctt tacaatggca 181 gtgtgtgtgt ttggaatcat gaaacacaga cactggtgaa gacatttgaa gtatgtgatc 241 ttcctgttcg agctgcaaag tttgttgcaa ggaagaattg ggttgtgaca ggagcggatg 301 acatgcagat tagagtgttc aattacaata ctctggagag agttcatatg tttgaagcac 361 actcagacta cattcgctgt attgctgttc atccaaccca gcctttcatt ctaactagca 421 gtgatgacat gcttattaag ctctgggact gggataaaaa atggtcttgc tcacaagtgt 481 ttgaaggaca cacccattat gttatgcaga ttgtgatcaa ccccaaagat aacaatcagt 541 ttgccagtgc ctctttggac aggactatca aggtgtggca gttgggctct tcgtcaccaa 601 acttcacttt ggaaggacat gagaaaggcg tgaattgcat tgattactac agtggtgggg 661 acaagccata cctcatttca ggtgcagatg accgtcttgt taaaatatgg gattatcaga 721 ataaaacatg tgtgcagaca ctggaaggac atgcccaaaa tgtgtcttgt gccagctttc 781 atcctgagtt gccaatcatt atcacaggtt cagaagatgg aacagtacgt atttggcatt 841 caagcaccta ccggcttgag agcacactga attatggaat ggagagggta tggtgcgtgg 901 ccagtctaag agggtcaaac aatgtcgctt tgggctatga tgaagggagc atcattgtta 961 agcttggtcg ggaggaacct gccatgtcca tggatgccaa tggaaagata atttgggcca 1021 agcattcaga agtccagcag gccaacctaa aagcaatggg agatgctgaa attaaagatg 1081 gtgaaagatt gccactggca gtaaaggata tgggcagttg tgaaatatac cctcagacta 1141 ttcagcacaa tcctaatggg cggtttgtgg tggtgtgtgg tgatggggag tatatcatct 1201 acacagcaat ggcattgaga aacaagagct ttggatctgc tcaggagttt gcatgggccc 1261 acgattcttc agagtatgca ataagagaga gcaacagcat tgtaaagata tttaagaact 1321 ttaaggaaaa aaaatcattt aaaccagatt ttggagcaga aagtatctac ggcggcttct 1381 tattgggagt cagatctgta aatggcttag ccttctatga ctgggacaat acagaactca 1441 tacgaagaat tgaaattcag cccaaacata ttttctggtc tgactctgga gagctagtct 1501 gtattgctac tgaggaatca ttttttatcc ttaagtatct gtcagaaaaa gtcttggctg 1561 cacaggaaac acatgaggga gttactgaag atggcattga agatgccttt gaggttcttg 1621 gtgagattca ggaaattgtg aaaacagggc tttgggtagg cgattgcttc atttacacaa 1681 gttctgtgaa cagattaaat tattatgttg gaggagaaat agtcaccatt gcccacttgg 1741 acaggacgat gtatctccta ggctacattc ctaaagacaa caggctttat ctgggggata 1801 aagaattgaa catcattagc tattccctgc tggtttcagt cctggaatac cagacagctg 1861 tcatgcggag ggactttagc atggctgata aggtccttcc taccattcca aaagaacaga 1921 ggaccagagt tgcacacttt ttggaaaagc agggcttcaa gcagcaagct cttacagtat 1981 ccacagatcc tgagcatcgt tttgagcttg ctcttcagct tggagagtta aaaattgcat 2041 accagttagc agtggaagca gagtcagaac agaagtggaa acaacttgct gaacttgcca 2101 ttagtaaatg tcagtttggc ctagcccagg agtgcctgca tcatgcacag gattatgggg 2161 gcctgctgct tttggccact gcctctggaa atgctaatat ggtgaacaag ctagcagagg 2221 gtgcggagag agatggcaaa aataatgtgg cattcatgag ctacttttta cagggcaagg 2281 ttgatgcctg cctagagctc ttaattagaa ctggacggct gccagaagct gccttcttgg 2341 cccgaactta cttacccagt caggtttcaa gggtagtgaa actctggaga gagaatctct 2401 caaaagtcaa tcagaaagca gcagaatccc ttgctgaccc aacagagtat gaaaacctgt 2461 tccctggatt aaaagaagcc tttgttgttg aagaatgggt gaaggaaaca catgctgatc 2521 tgtggccagc caaacaatac ccacttgtca cgccaaatga agagagaaat gtcatggaag 2581 agggaaaaga ctttcagccc tcaagatcta cagctcaaca ggaacttgat gggaaacctg 2641 cttctcctac tccggttatt gtggcctccc acacagccaa caaagaagaa aagagtttac 2701 tcgaactaga agtagatttg gataatttgg aattagaaga tattgacaca acagatatca 2761 atctggatga agatattttg gatgattgac tgtaatgctt tccatttacc tgactaaaca 2821 gatcattatt atatataggt attgattgct accctgacca cagtgctttg gactatgaga 2881 aacttcttag atttttatat gtaaatgctg tggaccactg ggagcacaat gcccacatca 2941 tcttaagaag agtttatgtg cagcatttaa atcactgtgt tttccttgtt aactaaaaca 3001 gacatgggct ttgatttttt tcatactatt agaccatatc tcataaaacc ttttgaatta 3061 aaaaaaa // LOCUS HSCTPSYN 2758 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for CTP synthetase (EC 6.3.4.2). ACCESSION X52142 NID g30292 KEYWORDS CTP synthetase; synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2758) AUTHORS Meuth,M. TITLE Direct Submission JOURNAL Submitted (26-MAR-1990) Meuth M., Imperial Cancer Research Fund, Clare Hall Laboratories, South Mimms Herts EN6 3LD, UK REFERENCE 2 (bases 1 to 2758) AUTHORS Yamauchi,M., Yamauchi,N. and Meuth,M. TITLE Molecular cloning of the human CTP synthetase gene by functional complementation with purified human metaphase chromosomes JOURNAL EMBO J. 9 (7), 2095-2099 (1990) MEDLINE 90291972 COMMENT Data kindly reviewed (23-JUL-1990) by Meuth M. FEATURES Location/Qualifiers source 1..2758 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa and testis" CDS 76..1851 /note="CTP synthetase (AA 1-591)" /codon_start=1 /db_xref="PID:g30293" /db_xref="SWISS-PROT:P17812" /translation="MKYILVTGGVISGIGKGIIASSVGTILKSCGLHVTSIKIDPYIN IDAGTFSPYEHGEVFVLDDGGEVDLDLGNYERFLDIRLTKDNNLTTGKIYQYVINKER KGDYLGKTVQVVPHITDAIQEWVMRQALIPVDEDGLEPQVCVIELGGTVGDIESMPFI EAFRQFQFKVKRENFCNIHVSLVPQPSSTGEQKTKPTQNSVRELRGLGLSPDLVVCRC SNPLDTSVKEKISMFCHVEPEQVICVHDVSSIYRVPLLLEEQGVVDYFLRRLDLPIER QPRKMLMKWKEMADRYDRLLETCSIALVAKYTEFSDSYASVIKALEHSALAINHKLEI KYIDSADLEPITSQEEPVRYHEAWQKLCSAHGVLVPGGFGVRGTEGKIQAIAWARNQK KPFLGVCLGMQLAVVEFSRNVLGWQDANSTEFDPTTSHPVVVDMPEHNPGQMGGTMRL GKRRTLFQTKNSVMRKLYGDADYLEERHRHRFEVNPVWKKCLEEQGLKFVGQDVEGER MEIVELEDHPFFVGVQYHPEFLSRPIKPSPPYFGLLLASVGRLSHYLQKGCRLSPRDT YSDRSGSSSPDSEITELKFPSINHD" BASE COUNT 725 a 614 c 702 g 717 t ORIGIN 1 ctatccgcgc gcgtcgccgg cccagtcctg tcgctgacgg gaggatctga agccggccgc 61 aggtcaaaga gtaaaatgaa gtacattctg gttactggtg gtgttatatc aggaattgga 121 aaaggaatca ttgccagcag tgtgggcaca atactcaagt catgtggttt acatgtaact 181 tcaatcaaaa ttgaccccta cattaacatt gatgcaggaa cattctctcc ttatgagcat 241 ggtgaggttt ttgtgctgga tgatggtggg gaagtagacc ttgacctggg taactatgag 301 cggttccttg acatccgcct caccaaggac aataatctga ccactggaaa gatataccag 361 tatgtcatta acaaggaacg gaaaggagat tacttgggga aaactgtcca agttgtccct 421 catatcacag atgcaatcca ggagtgggtg atgagacagg cgttaatacc tgtagatgaa 481 gatggcctgg aacctcaagt gtgtgttatt gagcttggtg gaaccgtggg ggacatagaa 541 agcatgccct ttattgaggc cttccgtcag ttccaattca aggtcaaaag agagaacttt 601 tgtaacatcc acgtcagtct agttccccag ccaagttcaa caggggaaca gaagactaaa 661 cctacccaga atagtgttcg ggaacttaga ggacttgggc tttccccaga tctggttgta 721 tgcaggtgct caaatccact tgacacatca gtgaaggaga aaatatcaat gttctgccat 781 gttgagcctg aacaagtgat ctgtgtccac gatgtctcat ccatctaccg agtccccttg 841 ttgttagagg agcaaggggt tgtagattat tttcttcgaa gacttgacct tcctattgag 901 aggcagccaa gaaaaatgct gatgaaatgg aaagagatgg ctgacagata tgatcgcttg 961 ctggagacct gctctattgc ccttgtggcg aaatacaccg agttctcaga ctcctatgcc 1021 tctgtcatta aggctctgga gcattctgca ctggccatca accacaaatt ggaaatcaag 1081 tacatagatt ctgcggactt ggagcccatc acctcgcaag aagagcccgt gcgctaccac 1141 gaagcttggc agaagctctg tagtgctcat ggagtgctgg ttccaggagg atttggtgtt 1201 cgaggaacag aaggaaaaat ccaagcaatt gcctgggctc ggaatcagaa aaagcctttt 1261 ttgggcgtgt gcttagggat gcagttggca gtggttgaat tctcaagaaa cgtgctggga 1321 tggcaagatg ccaattctac agagtttgac cctacgacca gtcatcccgt ggtcgtagac 1381 atgccagaac acaacccagg gcagatgggc ggaaccatga ggctgggcaa gaggagaacc 1441 ctgttccaga ccaagaactc agtcatgagg aaactctatg gagacgcaga ctacttggaa 1501 gagaggcacc gccaccgatt tgaggtgaat ccagtctgga aaaagtgttt ggaagaacaa 1561 ggcttgaagt ttgttggcca agatgttgaa ggagagagaa tggaaattgt ggagttagaa 1621 gatcatccct tttttgttgg ggttcagtac caccctgagt tcctgtccag gcctatcaag 1681 ccctccccac catactttgg cctcctcctg gcctctgtgg ggcggctctc acattacctc 1741 cagaaaggct gcaggctctc acccagggac acctatagtg acaggagtgg aagcagctcc 1801 cctgactctg aaatcaccga actgaagttt ccatcaataa atcatgactg atcttgtagc 1861 ggatgattct tcaagagacc cttcaaactt gggtagagtt tacagctctg actttacact 1921 cggctttgga gactttcttt aaattatgtt tttattaaga ttattttatt atgcggaaag 1981 gtatttggga aacttgtcac ttgcatgtcc catcacgtgt actggctcct ctgtggtgtc 2041 tgcctgttgc gtgacactct ccttgcagtt cttgagttgc ggcagaacat cgcgatggga 2101 accgatggtg ggtggggctg cagatgtccc catcggtcac cttgtttctc aactacctcg 2161 catcattgca gatcgtagcg cgttgcctgt cgctttccct tggataccta gaccgttata 2221 aagtgtgcca catggactta ccgagcatgg agagaggatt ttagctagga tttgaacact 2281 tggtgctggg aacctcaggg tattgcttgc cactaagcca tgaaaccaga gacaaaatct 2341 ctatactgcc ctgagttggg gggaattctc agtgccaact gtggctggtc ctcattcaaa 2401 gggacggtca gtttggtgtc aacatgaaac accaagatgt ctgtctctga agcgtgattt 2461 taaaatcccc atgcctgtgc gtgcgcttcc tatttctagg gctgggaaac actccttgca 2521 tcaaggggtc acttacagaa caaagaatct tttgggggaa acttcctcta aaaccctctc 2581 atatatagac agctttgact ggagggtcca tttttcttcc aggatggtgt tactgcagtt 2641 gaagggcaat atgaagttac tttcttaatg tgacctagca ataggcatag ctacgtggca 2701 ctatattctg gccagactcg atgtgtactc taacttaaga aataaatcag taaggcag // LOCUS HSCXYP 1702 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for carboxypeptidase N small subunit (EC 3.4.17.3). ACCESSION X14329 NID g30296 KEYWORDS carboxypeptidase; metalloproteinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1702) AUTHORS Gebhard,W., Schube,M. and Eulitz,M. TITLE cDNA cloning and complete primary structure of the small, active subunit of human carboxypeptidase N (kininase 1) JOURNAL Eur. J. Biochem. 178 (3), 603-607 (1989) MEDLINE 89107181 FEATURES Location/Qualifiers source 1..1702 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" CDS 214..1590 /note="carboxypeptidase N precursor (AA -20 to 438)" /codon_start=1 /db_xref="PID:g30297" /db_xref="SWISS-PROT:P15169" /translation="MSDLLSVFLHLLLLFKLVAPVTFRHHRYDDLVRTLYKVQNECPG ITRVYSIGRSVEGRHLYVLEFSDHPGIHEPLEPEVKYVGNMHGNEALGRELMLQLSEF LCEEFRNRNQRIVQLIQDTRIHILPSMNPDGYEVAAAQGPNKPGYLVGRNNANGVDLN RNFPDLNTYIYYNEKYGGPNHHLPLPDNWKSQVEPETRAVIRWMHSFNFVLSANLHGG AVVANYPYDKSFEHRVRGVRRTASTPTPDDKLFQKLAKVYSYAHGWMFQGWNCGDYFP DGITNGASWYSLSKGMQDFNYLHTNCFEITLELSCDKFPPEEELQREWLGNREALIQF LEQVHQGIKGMVLDENYNNLANAVISVSGINHDVTSGDHGDYFRLLLPGIYTVSATAP GYDPETVTVTVGPAEPTLVNFHLKRSIPQVSPVRRAPSRRHGVRAKVQPQARKKEMEM RQLQRGPA" sig_peptide 214..273 /note="signal peptide (AA -20 to -1)" mat_peptide 274..1587 /note="mature carboxypeptidase N (AA 1-438)" polyA_site 1702 /note="polyA site" BASE COUNT 422 a 452 c 461 g 367 t ORIGIN 1 tgaaagggag tgagggagga gagatgagtg gctattccag aacgacataa agaatttcca 61 gccttggacg gacagctggg aacgtcttcc aatttggact ggtgtttaca agcgggaagc 121 taggtggacc ttggattttg gcgggtgaag aggctaggtt gtttaaggag gtggggcgcg 181 tttcagtggc tctctttgaa aaagcccagc aagatgtcag acctgctctc agtcttcctc 241 cacctcctcc ttctcttcaa gttggttgcc ccggtgacct ttcgccacca ccgctatgat 301 gatcttgtgc ggacgctgta caaggtgcaa aacgaatgcc ccggcatcac gcgggtctac 361 agcattgggc gcagcgtgga ggggagacac ctctacgtgc tggagttcag cgaccaccct 421 ggaatccacg agcccttgga accagaggtc aagtatgtgg ggaacatgca cggcaacgaa 481 gcgttgggcc gcgagctgat gctgcagctg tcggagtttc tgtgcgagga gttccggaac 541 aggaaccagc gcatcgtcca gctcatccag gacacgcgca ttcacatcct gccatccatg 601 aaccccgacg gctacgaggt ggctgctgcc cagggcccaa acaagcctgg gtatctagtt 661 ggcaggaaca atgcaaatgg agtggacctg aaccgcaact tccctgatct caatacctat 721 atctactata acgagaagta cggaggcccc aaccaccacc tgccccttcc agacaactgg 781 aaaagtcagg tggaacccga gacccgggcg gtgatccggt ggatgcactc cttcaacttt 841 gttctttcag ccaatctcca cggaggggcg gtggtggcca attacccgta tgacaagtcc 901 tttgagcacc gggtccgagg ggtccgccgc accgccagca cccccacgcc tgacgacaag 961 ctcttccaga agctggccaa ggtctactcc tatgcacatg gatggatgtt ccaaggttgg 1021 aactgcggag attacttccc agatggcatc accaatgggg cttcctggta ttctctcagc 1081 aagggaatgc aagactttaa ttatctccat accaactgct ttgagatcac gctggaactg 1141 agttgcgaca agtttccccc cgaagaggag ttacagcggg agtggctggg taatcgggaa 1201 gccctaatcc agttcctgga acaggttcac cagggcatca agggaatggt gcttgatgag 1261 aattacaata atctcgccaa tgctgtcatt tctgtcagtg ggattaacca tgatgtcact 1321 tcaggtgacc atggtgatta cttccggctg ctgcttccag gtatctacac tgttagtgcc 1381 acagcacctg ggtatgaccc agagacagta actgtgaccg tgggtcctgc ggaaccaacg 1441 ttggttaact tccacctcaa aagaagcatc cctcaagtaa gccctgtgag gagagctccc 1501 agcagaaggc acggagtcag agccaaagtg cagccccaag ccagaaagaa agaaatggag 1561 atgaggcagc tgcagagagg ccctgcctga aacccacagt gccaggcaac ccttcagaaa 1621 ggctttgctc ctgctctcag atcagatcaa gcattctttc tattttatta tctgggacat 1681 atttaaatac aaacatattc ag // LOCUS HSCYCD2 1129 bp RNA PRI 30-MAR-1993 DEFINITION H.sapiens mRNA for cyclin D2. ACCESSION X68452 NID g38415 KEYWORDS binding protein; CCND2 gene; cyclin D2; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1129) AUTHORS Peters,G. TITLE Direct Submission JOURNAL Submitted (17-SEP-1992) G. Peters, Imperial Cancer Research Fund, P.O.Box 123, Lincoln's Inn Fields, London WC2A 3PX, UK REFERENCE 2 (bases 1 to 1129) AUTHORS Palmero,I., Holder,A., Sinclair,A.J., Dickson,C. and Peters,G. TITLE Cyclins D1 and D2 are differentially expressed in human B-lymphoid cell lines JOURNAL Oncogene 8 (4), 1049-1054 (1993) MEDLINE 93205384 COMMENT related sequences: M90814 & M88080-85. FEATURES Location/Qualifiers source 1..1129 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" /chromosome="12p13" gene 156..1025 /gene="CCND2" CDS 156..1025 /gene="CCND2" /codon_start=1 /product="cyclin D2" /db_xref="PID:g38416" /db_xref="SWISS-PROT:P30279" /translation="MELLCHEVDPVRRAVRDRNLLRDDRVLQNLLTIEERYLPQCSYF KCVQKDIQPYMRRMVATWMLEVCEEQKCEEEVFPLAMNYLDRFLAGVPTPKSHLQLLG AVCMFLASKLKETSPLTAEKLCIYTDNSIKPQELLEWELVVLGKLKWNLAAVTPHDFI EHILRKLPQQREKLSLIRKHAQTFIALCATDFKFAMYPPSMIATGSVGAAICGLQQDE EVSSLTCDALTELLAKITNTDVDCLKACQEQIEAVLLNSLQQYRQDQRDGSKSEDELD QASTPTDVRDIDL" BASE COUNT 258 a 321 c 316 g 234 t ORIGIN 1 ctctctctgc cctcacctct cccccgaaaa ccccctattt agccaaagga aggaggtcag 61 gggaacgctc tcccctcccc ttccaaaaaa caaaaacaga aaaacccttt tccaggccgg 121 ggaaagcagg agggagaggg gccgccgggc tggccatgga gctgctgtgc cacgaggtgg 181 acccggtccg cagggccgtg cgggaccgca acctgctccg agacgaccgc gtcctgcaga 241 acctgctcac catcgaggag cgctaccttc cgcagtgctc ctacttcaag tgcgtgcaga 301 aggacatcca accctacatg cgcagaatgg tggccacctg gatgctggag gtctgtgagg 361 aacagaagtg cgaagaagag gtcttccctc tggccatgaa ttacctggac cgtttcttgg 421 ctggggtccc gactccgaag tcccatctgc aactcctggg tgctgtctgc atgttcctgg 481 cctccaaact caaagagacc agcccgctga ccgcggagaa gctgtgcatt tacaccgaca 541 actccatcaa gcctcaggag ctgctggagt gggaactggt ggtgctgggg aagttgaagt 601 ggaacctggc agctgtcact cctcatgact tcattgagca catcttgcgc aagctgcccc 661 agcagcggga gaagctgtct ctgatccgca agcatgctca gaccttcatt gctctgtgtg 721 ccaccgactt taagtttgcc atgtacccac cgtcgatgat cgcaactgga agtgtgggag 781 cagccatctg tgggctccag caggatgagg aagtgagctc gctcacttgt gatgccctga 841 ctgagctgct ggctaagatc accaacacag acgtggattg tctcaaagct tgccaggagc 901 agattgaggc ggtgctcctc aatagcctgc agcagtaccg tcaggaccaa cgtgacggat 961 ccaagtcgga ggatgaactg gaccaagcca gcacccctac agacgtgcgg gatatcgacc 1021 tgtgaggatg ccagttgggc cgaaagagag agacgcgtcc ataatctggt ctcttcttct 1081 ttctggttgt ttttgttctt tgtgttttag ggtgaaactt aaaaaaaaa // LOCUS HSCYCLF 4238 bp RNA PRI 11-AUG-1995 DEFINITION H.sapiens mRNA for cyclin F. ACCESSION Z36714 NID g562752 KEYWORDS cyclin; cyclin F. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4238) AUTHORS Kraus,B., Pohlschmidt,M., Leung,A.L., Germino,G.G., Snarey,A., Schneider,M.C., Reeders,S.T. and Frischauf,A.M. TITLE A novel cyclin gene (CCNF) in the region of the polycystic kidney disease gene (PKD1) JOURNAL Genomics 24 (1), 27-33 (1994) MEDLINE 95203887 REFERENCE 2 (bases 1 to 4238) AUTHORS Kraus,B. TITLE Direct Submission JOURNAL Submitted (17-AUG-1994) Barbara Kraus, Molecular Analysis of Mammalian Mutation, Imperial, Cancer Research Fund, 44 Lincoln's Inn Fields, London, WC2A 3PX, United Kingdom REFERENCE 3 (bases 1 to 4238) AUTHORS Nehls,M., Luno,K., Schorpp,M., Pfeifer,D., Krause,S., Matysiak-Scholze,U., Dierbach,H. and Boehm,T. TITLE YAC/P1 contigs defining the location of 56 microsatellite markers and several genes across a 3.4-cM interval on mouse chromosome 11 JOURNAL Mamm. Genome 6 (5), 321-331 (1995) MEDLINE 95352957 FEATURES Location/Qualifiers source 1..4238 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..4238 CDS 44..2404 /codon_start=1 /product="cyclin F" /db_xref="PID:g562753" /db_xref="SWISS-PROT:P41002" /translation="MGSGGVVHCRCAKCFCYPTKRRIRRRPRNLTILSLPEDVLFHIL KWLSVEDILAVRAVHSQLKDLVDNHASVWACASFQELWPSPGNLKLFERAAEKGNFEA AVKLGIAYLYNEGLSVSDEARAEVNGLKASRFFSLAERLNVGAAPFIWLFIRPPWSVS GSCCKAVVHESLRAECQLQRTHKASILHCLGRVLSLFEDEEKQQQAHDLFEEAAHQGC LTSSYLLWESDRRTDVSDPGRCLHSFRKLRDYARKGCWEAQLSLAKACANANQLGLEV RASSEIVCQLFQASQAVSKQQVFSVQKGLNDTMRYILIDWLVEVATMNDFTSLCLHLT VECVDRYLRRRLVPRYRLQLLGIACMVICTRFISKEILTIREAVWLTDNTYKYEDLVR MMGEIVSALEGKIRVPTVVDYKEVLLTLVPVELRTQHLCSFLCELSLLHTSLSAYAPA RLAAAALLLARLTHGQTQPWTTQLWDLTGFSYEDLIPCVLSLHKKCFHDDAPKDYRQV SLTAVKQRFEDKRYGEISQEEVLSYSQLCAALGVTQDSPDPPTFLSTGEIHAFLSSPS GRRTKRKRENSLQEDRGSFVTTPTAELSSQEETVLGSFLDWSLDCCSGYEGDQESEGE KEGDVTAPSGILDVTVVYLNPEQHCCQESSDEEACPEAKGPQDPQALALDTQIPATPG PKPLVRTSREPGKDVTTSGYSSVSTASPTSSVDGGLGALPQPTSVLSLHSDSHTQPCH HQARKSCLQCRPPSPPESSVPQQQVKRINLCIHSEEEDMNLGLVRL" BASE COUNT 918 a 1215 c 1212 g 893 t ORIGIN 1 gcgctctcag gcgggctccg gcggcagcga cgcgagcgcg gcgatgggga gcggcggcgt 61 ggtccactgt aggtgtgcca agtgtttctg ttatcctaca aagcgaagaa taaggaggag 121 gccccgaaac ctgaccatct tgagtctccc cgaagatgtg ctctttcaca tcctgaaatg 181 gctttctgta gaggacatcc tggccgtccg agctgtacac tcccagctga aggacctggt 241 ggacaaccac gccagtgtgt gggcatgtgc cagcttccag gagctgtggc cgtctccagg 301 gaacctgaag ctctttgaaa gggctgctga aaaggggaat ttcgaagctg ctgtgaagct 361 gggcatagcc tacctctaca atgaaggcct gtctgtgtct gatgaggccc gcgcagaagt 421 gaatggcctg aaggcctctc gcttcttcag tctcgctgag cggctgaatg tgggtgccgc 481 acctttcatc tggctcttca tccgccctcc gtggtcggtg agcggaagct gctgcaaggc 541 cgtggttcac gagagcctca gggcagagtg ccagctgcag aggactcaca aagcatccat 601 attgcactgc ttgggcagag tgctgagtct gttcgaggat gaggagaagc agcagcaggc 661 ccatgacctg tttgaggagg ctgctcatca gggatgtctg accagctcct acctcctctg 721 ggaaagcgac aggaggacag atgtgtcaga tcctgggcga tgcctccaca gcttccgaaa 781 actcagggac tacgctcgca aaggctgctg ggaagcgcag ctgtctttag ccaaagcctg 841 tgcaaatgca aaccagcttg gactggaggt gagagcttcc agtgagatcg tctgccagct 901 atttcaggct tcccaggctg tcagtaaaca acaagtcttc tccgtgcaga agggactcaa 961 tgacacaatg aggtacattc tgatcgactg gctggtggaa gttgccacca tgaatgactt 1021 cacaagcctg tgcctgcacc tgaccgtgga gtgtgtggac cggtacctgc ggaggaggct 1081 ggtgccgcgg tacaggctcc agctgctggg catcgcctgc atggtcatct gcacccggtt 1141 tatcagtaaa gagatcctga ccatccggga ggccgtatgg ctcacggaca acacttacaa 1201 gtacgaggac ctggtgagaa tgatgggcga gatcgtctcc gccttggaag ggaagattcg 1261 agtccccact gtggtggatt acaaggaggt cctgctgacg ctagtccctg tggagctgag 1321 aacccagcac ctgtgcagct tcctctgcga gctctccctg ctgcacacca gcctgtccgc 1381 ctacgcccca gcccgcctgg ctgccgcagc cctgctcctg gccagactga cgcacgggca 1441 gacacagccc tggaccactc agctgtggga cctcaccgga ttctcctatg aagacctcat 1501 tccctgcgtc ttgagcctcc ataagaagtg cttccatgat gacgccccca aggactacag 1561 gcaagtctct ctgaccgccg tgaagcagcg gtttgaggac aagcgctatg gagaaatcag 1621 ccaggaagag gtgctgagct acagccagtt gtgtgctgca ttaggagtga cacaagacag 1681 ccccgacccc ccgactttcc tcagcacagg ggagatccac gccttcctca gctctccctc 1741 ggggcggaga accaaacgga agcgggagaa cagcctccag gaagacagag gcagcttcgt 1801 taccaccccc actgcggagc tgtccagcca ggaggagacc gtgctgggca gcttcctcga 1861 ctggagcctg gactgctgct ctggctatga aggcgaccag gagagtgagg gcgagaagga 1921 gggcgacgtg acagctccca gcggcatcct cgatgtcacc gtggtctacc tgaacccaga 1981 acagcattgc tgccaggaat ccagtgatga ggaggcttgt ccagaggcaa agggacccca 2041 ggacccacag gcactggcgc tggacaccca gatccctgca acccctggac ccaaacccct 2101 ggtccgcacc agccgggagc cagggaagga cgtcacgacc tcagggtact cctccgtcag 2161 caccgcaagt cccacaagct ccgtggacgg tggcttgggg gccctgcccc aacctacctc 2221 agtgctgtcc ctgcacagtg actcgcacac acagccctgc caccatcagg ccaggaagtc 2281 atgtttacag tgtcgtcccc caagtccccc ggagagcagt gttccccagc aacaggtgaa 2341 gcggataaac ctatgcatac acagtgagga ggaggacatg aacctgggcc ttgtgaggct 2401 gtaagtgtgt cagcacattt gccgcagtgg atgtgtactg agggggctgg aggcgaaggg 2461 tgggagcata gcataggaac gctgcataga ccatggaggc ctttgcgcag agagcagaga 2521 ggatgacttg cggccaccaa gtttctgtct ccgcgggagt cccgtgcaag ccatcagaat 2581 gttgaaatga gggtgaagag ctcagatccc tctctttgga aagtttagcc tggaagcagt 2641 tggccacact gtgtggaggg cacctctctg tcccttccgt gtctcactgt ctctggaagc 2701 ttcagcccat gtgtgtcctg gtgttcccag ccccaccaga gccccgtgcc gggagctgac 2761 agctttcacg cttaaggcac gtgtgacctg ggtagtcaga caccacttga gcccctgccc 2821 acatctgctg gtttggggct tcagtgggga gctgacagct gtgagcacac cactgtcccc 2881 tcatccacct cggcctgcat ggggcaccca cttccttctg ggtggggctt ccatggtaag 2941 ggggcctgcg tccctgcaca ctgcgaggac tgccttgcca caggcccact ccctacgaca 3001 cgtgactcgt tttagagctc tgtcccagag gcgttcgtat gtgacccaca gatggcgtca 3061 atgtgaacac ctctctttgt gctgaatttc tgggccattc ttttcctgtc ttatttctaa 3121 atttccttct tccaagatga aaacaaaaga aaaacttaaa acagaaggta ttaaaaaaac 3181 aagagattcc caccattatt taggttcacc tgcaaaacaa aaatcttact ccagcccctc 3241 aatgccatcc tgacacactt tatgcaaaaa gaattttccc agataggcta gccagaaaaa 3301 acttcaagtc ctctgtaaca tctgaggtga ccaagaggca gaagagcaga gcagtcgggg 3361 gccgtgtcct ggctgatccc aactgcagct ctgctgtggg ggcccgtggg agggaggcag 3421 acccctgggc tttcctgctg gccacggaga ctctgctcct gcatggaaag ggagcctggg 3481 agccagcagc ccacgcctgg ggagcctgcc tggggccatg tgaccatggc ctctccctgg 3541 gaacgggctg accacaacac accctgctgc catccacttc tgtttactct gcaaatgtaa 3601 gaaagaacca cttggccaga agtgtccccc agatgctttt tttttttttt ttttggagac 3661 agttttgctc ttgtctcccc ggctggagtg cagtggcatg atctcaactc tcaactcact 3721 gtaacctccg cctcccggat actcctgcct cagcctcctg ggtagctggg attacaagca 3781 cccaaccacg cccagctaat ttttgtattt tcggtagaga cgggatttca ccatgttggc 3841 caggctagtc tcgaactcat gacctcaagt gatccgccca cttcggtctc ccaaagtgct 3901 gggattacag gcatgagcca cggcgcctgg cccccaaatg ctcttgaacc ggaaacccag 3961 ggatgggaga tgctcactga gctgctgctt ttatgtgtgc tggtgctatg tgtgttcatg 4021 tccgcggcag ctgtcttttt gctactataa gggaattctg gccaccctgg gtggggtgtg 4081 gtcggggtga gaacccaagc gttggaactg tagacccgtc ctgtcgactg tgtgcccctg 4141 ggcatgtgtg agcctcagtt tcctcatctg taaggggggc aatgatacct acctcacagg 4201 gggttgtgag gattaaatgt gaggaggata gtggcaac // LOCUS HSCYCNII 2119 bp RNA PRI 13-JUL-1995 DEFINITION H.sapiens mRNA for cylicin II. ACCESSION Z46788 NID g758586 KEYWORDS cylicin II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2119) AUTHORS Hess,H., Heid,H., Zimbelmann,R. and Franke,W.W. TITLE The protein complexity of the cytoskeleton of bovine and human sperm heads: the identification and characterization of cylicin II JOURNAL Exp. Cell Res. 218 (1), 174-182 (1995) MEDLINE 95255491 REFERENCE 2 (bases 1 to 2119) AUTHORS Hess,H. TITLE Direct Submission JOURNAL Submitted (22-NOV-1994) Holger Hess, Division for Cell Biology, German Cancer Research, Center, Im Neuenheimer Feld 280, Heidelberg, Baden-Wuerttemberg, 69120, Germany FEATURES Location/Qualifiers source 1..2119 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="5.21/B20" /tissue_type="testis" /cell_type="spermatozoon" /clone_lib="lambda ZAP II cDNA" /sex="Male" CDS 4..1050 /codon_start=1 /product="human cylicin II" /db_xref="PID:g758587" /translation="MSLPRFQRVNFGPYDNYIPVSELSKKSWNQQHFALLFPKPQRPG TKRRSKPSQIRDNTVSIIDEEQLRGDRRQPLWMYRSLMRISERPSVYLAARRQPLKPT RTVEVDSKAAEIGKKGEDKTTQKDTTDSESELKQGKKDSKKGKDIEKGKEEKLDAKKD SKKGKKDAEKGKDSATESEDEKGGAKKDNKKDKKDSNKGKDSATESEGEKGGTEKDSK KGKKDSKKGKDSAIELQAVKADEKKDEDGKKDANKGDESKDAKKDAKEIKKGKKDKKK PSSTDSDSKDDVKKESKKDATKDAKKVAKKDTEKESADSKKDAKKNAKKDAKKDAKKN AKKDEKKDAKKKGK" BASE COUNT 898 a 338 c 453 g 430 t ORIGIN 1 gggatgtctc tcccaagatt ccaaagagta aactttgggc catatgataa ttacattcca 61 gtcagtgaat taagcaaaaa atcatggaat cagcaacact ttgccctgtt atttcccaaa 121 ccacaacggc caggaaccaa aaggagatca aaaccttctc aaatacggga caacacggtt 181 tctataattg atgaagaaca attaagagga gatcgtagac aaccattatg gatgtaccgt 241 tctttaatga gaatttctga gagaccatct gtttatttag ctgccaggag gcagcctctc 301 aaaccaactc gtactgtcga ggtggattct aaagcagcag aaattggtaa gaaaggtgaa 361 gacaagacaa cacagaagga cacaacagat tcggaatcag aattaaaaca aggaaaaaaa 421 gattcaaaga aaggcaagga tatagagaaa ggaaaagaag aaaagctaga tgcaaagaaa 481 gatagcaaaa aaggtaaaaa ggatgcagag aagggcaaag actcagcaac agaatctgaa 541 gatgaaaaag gaggtgcaaa gaaagataac aaaaaagata aaaaggattc aaacaaaggc 601 aaagactcgg caacagaatc tgaaggtgaa aaaggaggta cagagaaaga tagcaaaaaa 661 ggtaaaaagg attcaaagaa gggcaaggat tcagccatag aattacaagc tgtaaaagca 721 gatgaaaaga aggatgagga tggaaaaaaa gatgcaaaca aaggtgatga atcgaaggat 781 gccaagaaag atgcaaagga gattaaaaaa ggtaagaaag ataagaagaa gcccagtagt 841 acagacagtg actcaaagga tgatgtcaag aaagagtcta agaaggacgc cacgaaagat 901 gccaagaaag ttgccaagaa agatactgag aaagaatctg ctgattcaaa gaaggatgca 961 aagaaaaatg ctaagaagga tgcaaagaag gatgcaaaga agaatgcaaa gaaggatgaa 1021 aagaaggatg caaagaagaa gggcaagtag gccttggata agaatttgaa ccgaaagaat 1081 aattcaaaag catatttgat gaaacaatag tggtagtctg cagctgaatt tgtgagaaaa 1141 caagaggcct caaagaatta aataattttt aaaaggtggt aaagaaggat acaaaggaga 1201 actcagcaga gatttataaa aatatataag aaagatgtta agaaaaatta aggggggatc 1261 cattgaaaga cttgaagaat acatatactt atattcgggg atatgaagga ttcaatgaat 1321 gatctctaga aagatttaaa gaagaatatt cagataagga tgttgaagat aatgacacta 1381 aatctatgga cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 1441 gtttaatgaa ggcttaaaga atccaaggag acagatgttg tatctataga tttaaaataa 1501 tctcaagctg catccacaga tacaaaaaaa tataggagcc atgaaatgaa gttaaacacc 1561 ttgtagaaag cctactttca agtaaatata gcattccttt ggcaaaaaaa aaaaaaaaag 1621 taaaagaaaa gaaaaaaaat gtattaccac ccgatgagcc tacatctttc tttctgaagt 1681 ttaaacaacc acagtcatgt ttatggttgt tggatgtgcc acttccaatc caaaagaagc 1741 acacttgtct cgctctgttg ccctggctgg agttcagtga catgatctca ggcttactac 1801 agcctccacc ttcaggttca agtgattctc ctgcctcagc ctctctggta gctgggatta 1861 cagggaagga aaatagctgg catttcttgc tggagaaagt tatcacaagt ggaaaggtta 1921 ccctagaaac aaaagatgac taccgtgagg atggattgga cattacacca cactggaaaa 1981 gtgaaacatc tcattcagtt ggtgatcttc ctgtcatgaa agttgcttga cagctctaat 2041 tacgaattga acacattaaa cacctggtga aattatcaat taaaaacatc tagtcaccat 2101 aaaaaaaaaa aaaaaaccc // LOCUS HSCYSUV 439 bp RNA PRI 12-SEP-1993 DEFINITION Human radiated keratinocyte mRNA for cysteine protease inhibitor. ACCESSION X05978 NID g30374 KEYWORDS cystatin A; cysteine protease inhibitor; protease inhibitor; stefin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 439) AUTHORS Kartasova,T., Cornelissen,B.J., Belt,P. and van de Putte,P. TITLE Effects of UV, 4-NQO and TPA on gene expression in cultured human epidermal keratinocytes JOURNAL Nucleic Acids Res. 15 (15), 5945-5962 (1987) MEDLINE 87316861 COMMENT clone 283 deduced AA sequence is completely identical to stefin AA sequence (isolated from human polymorphonuclear granulocytes); stefin has been found to be immunologically identical to cystatin A, isolated from human skin Data kindly reviewed (15-April-1988) by Putte P. FEATURES Location/Qualifiers source 1..439 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epidermal keratinocytes(UV radiated)" /clone="283" CDS 46..342 /note="gene product (clone 283) (AA 1-98); put. stefin" /codon_start=1 /db_xref="PID:g30375" /db_xref="SWISS-PROT:P01040" /translation="MIPGGLSEAKPATPEIQEIVDKVKPQLEEKTNETYGKLEAVQYK TQVVAGTNYYIKVRAGDNKYMHLKVFKSLPGQNEDLVLTGYQVDKNKDDELTGF" variation 100 /note="u is c in clones 301 and 242" variation 184 /note="g is c in clone 301" misc_feature 418..423 /note="pot. polyA signal" polyA_site 439 /note="polyA site" BASE COUNT 153 a 87 c 90 g 109 t ORIGIN 1 actttggttc cagcatcctg tccagcaaag aagcaatcag ccaaaatgat acctggaggc 61 ttatctgagg ccaaacccgc cactccagaa atccaggaga ttgttgataa ggttaaacca 121 cagcttgaag aaaaaacaaa tgagacttat ggaaaattgg aagctgtgca gtataaaact 181 caagttgttg ctggaacaaa ttactacatt aaggtacgag caggtgataa taaatatatg 241 cacttgaaag tattcaaaag tcttcccgga caaaatgagg acttggtact tactggatac 301 caggttgaca aaaacaagga tgacgagctg acgggctttt agcagcatgt acccaaagtg 361 ttctgattcc ttcaactggc tactgagtca tgatccttgc tgataaatat aaccatcaat 421 aaagaagcat tcttttcca // LOCUS HSCYTASNS 1874 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for cytosolic asparaginyl-tRNA synthetase. ACCESSION AJ000334 NID g2764504 KEYWORDS asnS gene; asparaginyl-tRNA synthetase; cytosolic. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1874) AUTHORS Beaulande,M., Tarbouriech,N. and Hartlein,M. TITLE Human cytosolic asparaginyl-tRNA synthetase: cDNA sequence, functional expression in Escherichia coli and characterization as human autoantigen JOURNAL Nucleic Acids Res. 26, 521-524 (1998) REFERENCE 2 (bases 1 to 1874) AUTHORS Beaulande,M.M. TITLE Direct Submission JOURNAL Submitted (08-JUL-1997) Beaulande M.M., Outstation Grenoble, EMBL, c/o ILL, BP 156X, 38042 Grenoble cedex, FRANCE FEATURES Location/Qualifiers source 1..1874 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Marathon ready Clontech" /tissue_type="liver" gene 74..1720 /gene="asnS" CDS 74..1720 /gene="asnS" /EC_number="6.1.1.22" /function="aminoacyl-tRNA synthetase" /codon_start=1 /evidence=experimental /product="asparaginyl-tRNA synthetase" /db_xref="PID:e1227566" /db_xref="PID:g2764505" /translation="MVLAELYVSDREGSDATGDGTKEKPFKTGLKALMTVGKEPFPTI YVDSQKENERWNVISKSQLKNIKKMWHREQMKSESREKKEAEDSLRREKNLEEAKKIT IKNDPSLPEPKCVKIGALEGYRGQRVKVFGWVHRLRRQGKNLMFLVLRDGTGYLQCVL ADELCQCYNGVLLSTESSVAVYGMLNLTPKGKQAPGGHELSCDFWELIGLAPAGGADN LINEESDVDVQLNNRHMMIRGENMSKILKARSMVTRCFRDHFFDRGYYEVTPPTLVQT QVEGGATLFKLDYFGEEAFLTQSSQLYLETCLPALGDVFCIAQSYRAEQSRTRRHLAE YTHVEAECPFLTFDDLLNRLEDLVCDVVDRILKSPAGSIVHELNPNFQPPKRPFKRMN YSDAIVWLKEHDVKKEDGTFYEFGEDIPEAPERLMTDTINEPILLCRFPVEIKSFYMQ RCPEDSRLTESVDVLMPNVGEIVGGSMRIFDSEEILAGYKREGIDPTPYYWYTDQRKY GTCPHGGYGLGLERFLTWILNRYHIRDVCLYPRFVQRCTP" BASE COUNT 545 a 370 c 481 g 478 t ORIGIN 1 ccgacatgtt gagtcataag acgcgtcggt gttgcagtct gtgtccttgg aggtgaccag 61 ggccactgca ggcatggtgc tagcagagct gtacgtctct gaccgagagg gaagcgatgc 121 cacgggagat ggaaccaagg agaaaccatt taaaacaggt ctaaaggctt tgatgacagt 181 agggaaagaa ccatttccta ccatttacgt agattcacaa aaagaaaatg agaggtggaa 241 tgttatttct aaatcacagt tgaagaacat taaaaagatg tggcataggg aacaaatgaa 301 gagtgaatcc cgggaaaaga aagaggcaga agatagttta cgaagagaaa agaacctgga 361 agaagcaaag aagattacca ttaaaaatga tccaagtctc ccagagccaa aatgtgtgaa 421 gattggtgcg ttagaaggat atagaggcca aagagtaaag gtgtttggct gggtccacag 481 gctgcgcagg caaggaaaga atttaatgtt tctggtgttg cgagatggta caggttatct 541 tcagtgtgtc ttggcggatg agttgtgtca gtgctacaat ggagttctct tgtccacgga 601 gagcagtgtt gcagtgtatg gaatgctaaa tcttacccca aagggcaagc aggctccagg 661 tggccatgag ctgagttgtg acttctggga actaattggg ttggcccctg ctggaggagc 721 tgacaacctg atcaatgagg agtctgacgt tgatgtccag ctcaacaaca gacacatgat 781 gatccgagga gaaaacatgt ccaaaatcct aaaagcacga tccatggtca ccaggtgctt 841 tagagatcac ttctttgata gggggtacta tgaagttact cctccaacat tagtgcaaac 901 acaagtagaa ggtggtgcca cactcttcaa gcttgactat tttggggaag aggcattttt 961 gactcaatcc tctcagttgt acttggagac ctgcctccca gccctgggag atgttttttg 1021 tattgctcag tcataccggg cagagcagtc cagaacacga aggcacctgg ctgagtacac 1081 tcacgtggaa gctgagtgtc ctttcctgac ttttgacgac ctcctgaacc ggttggagga 1141 cttggtttgt gatgtggtag atcgaatatt gaagtcacct gcagggagca tagtgcatga 1201 gctcaacccg aactttcagc cccccaaacg gcctttcaaa cggatgaact attcagatgc 1261 tatcgtttgg ctaaaagaac atgatgtaaa gaaagaagat ggaactttct atgaatttgg 1321 agaagatatc ccagaagctc ctgagagact gatgacagac accattaatg aaccaatctt 1381 gctgtgtcga tttcctgtgg agatcaagtc cttctacatg cagcgatgtc ctgaggattc 1441 ccgtcttact gaatctgtcg acgtgttgat gcccaatgtt ggtgagattg tgggaggctc 1501 aatgcgtatc tttgatagtg aagaaatact ggcaggttat aaaagggaag ggattgaccc 1561 cactccctat tactggtata cggatcagag aaaatacggt acatgtcccc atggaggata 1621 tggcttgggc ttggaacgat tcttaacgtg gattctgaat aggtatcaca tccgagacgt 1681 gtgcttatac cctcgatttg tccagcgttg cacgccataa ccattttctc cagaagcgtg 1741 gaggaaagat tatgaaagga acaggctctt taaaaaagaa aacaaaaagc cagaatcttc 1801 ctttttttgt ttcattgggg tttctctttc tgtttttctt tctactacca taaaaactat 1861 ctcaaatcac ctga // LOCUS HSD13S106 1181 bp RNA PRI 25-MAR-1992 DEFINITION H.sapiens D13S106 mRNA for a highly charged amino acid sequene. ACCESSION X59131 NID g30387 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1181) AUTHORS Wilton,A.N. TITLE Direct Submission JOURNAL Submitted (20-APR-1991) A.N. Wilton, Macquarie University, School of Biological Sciences, NSW 2109, AUSTRALIA REFERENCE 2 (bases 1 to 1181) AUTHORS Wilton,A.N., Zehavi-Feferman,T., Fleming,J., Baker,E., Chen,L.Z. and Cooper,D.W. TITLE A unique intronless gene or gene seqment on chromosome 13 specifying a highly charged amino acid sequence JOURNAL Cytogenet. Cell Genet. 58, 1985-1985 (1991) FEATURES Location/Qualifiers source 1..1181 /organism="Homo sapiens" /isolate="IGR3" /db_xref="taxon:9606" /tissue_type="melanoma" /cell_line="IGR3" /clone_lib="lambda-gt11 (RM Hope, Univ.Adelaide)" /clone="39" /chromosome="13" /map="13q12-q14" CDS 42..989 /codon_start=1 /product="highly charged amino acid sequence" /db_xref="PID:g30388" /translation="MASSVSAPCNEKLIQDQFVDISFPSQVVNTNMQSVQLNTEDTVN TKSVNNTDATGLIQGVKSVEIEKDAQLKQFLTPKTEQLKPERVTSQVSNLKKKETTAD SQTTTSKSLQNQSLKENQKKPFVGSWVKGLISRGASFMPLCVSAHNRNTITDLQPSVK GVNNFGGFKTKGINQKASHVSKKARKSASKPPPISKPPAGPPSSNGTAAHPHAHAASE VLEKSGSTSCGAQLNHSSYGNGISSANHEDLVEGQIHKLRLKLRKKLKAEKKKLAALM SSPQSRTVRSENLEQVPQDGSPNDCESIEDLLNEATISN" BASE COUNT 401 a 245 c 231 g 304 t ORIGIN 1 gaattctcaa aaccaatact ttgctatcac aagaatcact aatggcttct tcagtatcag 61 ctccatgtaa tgaaaagctt attcaagacc aatttgtgga cataagtttt ccatcccaag 121 ttgtaaatac aaacatgcag tcagtacagc tgaatacaga agatactgta aatactaaat 181 ctgtgaataa tactgatgct actggtctta tacagggagt gaagtcagta gaaattgaga 241 aggacgctca gttaaaacaa ttccttacac caaaaactga acaattaaaa ccagaacgtg 301 tcacatctca ggtatctaat ttgaagaaaa aagaaactac agcagattct caaaccacaa 361 catccaagtc attacagaat cagtctctga aagaaaatca gaagaagcca tttgtgggaa 421 gttgggttaa aggcttaata agcaggggtg cttcttttat gccactctgt gtttcagctc 481 ataatagaaa cactataact gatttacaac cttcagttaa aggggtaaat aattttggtg 541 gctttaaaac taaaggtata aaccagaagg ccagccacgt atccaagaaa gctcgtaaga 601 gtgcaagtaa gcctcctccc atcagtaagc caccagcagg ccctccatcg tctaatggca 661 cagctgccca cccacatgct catgctgctt cagaagtttt ggaaaagtct ggaagcacct 721 catgtggagc tcaactcaac cacagttctt atgggaatgg tatttcttca gcaaaccatg 781 aagacttggt ggaaggtcag attcataaac ttcgtctaaa acttcgtaaa aagctaaagg 841 cagaaaagaa gaaattagct gctcttatgt cttccccgca aagcagaaca gttcgaagtg 901 aaaatctaga acaggtgccc caggatgggt ctccaaatga ttgtgaatca atagaggact 961 tgttaaatga agctaccata tccaattgat attgccagtg agtctgcatg caccactgtt 1021 cctggtgttt ccctgtacgt agtcaaactc atgaagaaat tttagcggaa ttattgtctc 1081 ctacacctct ttcaacagag ctgtcagaaa atggggaagg tgactttagg tatttgggaa 1141 tgggagatag tcatatccca ccaccagtac caagtgaatt c // LOCUS HSD1DORE 2342 bp RNA PRI 04-JUN-1991 DEFINITION Human mRNA for D-1 dopamine receptor. ACCESSION X58987 X59308 NID g30398 KEYWORDS dopamine receptor D1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2342) AUTHORS Zhou,Q.Y., Grandy,D.K., Thambi,L., Kushner,J.A., Van Tol,H.H., Cone,R., Pribnow,D., Salon,J., Bunzow,J.R. and Civelli,O. TITLE Cloning and expression of human and rat D1 dopamine receptors JOURNAL Nature 347 (6288), 76-80 (1990) MEDLINE 90370094 REFERENCE 2 (bases 1 to 2342) AUTHORS Zhou,Q. TITLE Direct Submission JOURNAL Submitted (29-MAY-1991) Zhou Q., Vollum Institute, Oregon Health Sciences University, Vollum Institute Mail Code: L474, 3181 S.W. Sam Jackson Park Road, Portland OR 97201-3098, U.S.A FEATURES Location/Qualifiers source 1..2342 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HGR213-1" mRNA 1..2342 /note="D-1 dopamine receptor" /evidence=experimental CDS 277..1617 /codon_start=1 /product="D-1 dopamine receptor" /db_xref="PID:g30399" /db_xref="SWISS-PROT:P21728" /translation="MRTLNTSAMDGTGLVVERDFSVRILTACFLSLLILSTLLGNTLV CAAVIRFRHLRSKVTNFFVISLAVSDLLVAVLVMPWKAVAEIAGFWPFGSFCNIWVAF DIMCSTASILNLCVISVDRYWAISSPFRYERKMTPKAAFILISVAWTLSVLISFIPVQ LSWHKAKPTSPSDGNATSLAETIDNCDSSLSRTYAISSSVISFYIPVAIMIVTYTRIY RIAQKQIRRIAALERAAVHAKNCQTTTGNGKPVECSQPESSFKMSFKRETKVLKTLSV IMGVFVCCWLPFFILNCILPFCGSGETQPFCIDSNTFDVFVWFGWANSSLNPIIYAFN ADFRKAFSTLLGCYRLCPATNNAIETVSINNNGAAMFSSHHEPRGSISKECNLVYLIP HAVGSSEDLKKEEAAGIARPLEKLSPALSVILDYDTDVSLEKIQPMTQNGQHPT" BASE COUNT 588 a 552 c 546 g 656 t ORIGIN 1 gaattcaggg gctttctggt gcccaagaca gtgacctgca gcaagggagt cagaagacag 61 atgtagaaat caagagtgac catccacggg attgacttgg attgccactc aagcggtcct 121 ctcatggaat gttggtgagg ccctctgcca gggaagcaat ctggctgtgc aaagtgctgc 181 ctggtgggga ggactcctgg aaatctgact gacccctatt ccctgcttag gaacttgagg 241 ggtgtcagag cccctgatgt gctttctctt aggaagatga ggactctgaa cacctctgcc 301 atggacggga ctgggctggt ggtggagagg gacttctctg ttcgtatcct cactgcctgt 361 ttcctgtcgc tgctcatcct gtccacgctc ctggggaaca cgctggtctg tgctgccgtt 421 atcaggttcc gacacctgcg gtccaaggtg accaacttct ttgtcatctc cttggctgtg 481 tcagatctct tggtggccgt cctggtcatg ccctggaagg cagtggctga gattgctggc 541 ttctggccct ttgggtcctt ctgtaacatc tgggtggcct ttgacatcat gtgctccact 601 gcatccatcc tcaacctctg tgtgatcagc gtggacaggt attgggctat ctccagccct 661 ttccggtatg agagaaagat gacccccaag gcagccttca tcctgatcag tgtggcatgg 721 accttgtctg tactcatctc cttcatccca gtgcagctca gctggcacaa ggcaaaaccc 781 acaagcccct ctgatggaaa tgccacttcc ctggctgaga ccatagacaa ctgtgactcc 841 agcctcagca ggacatatgc catctcatcc tctgtaataa gcttttacat ccctgtggcc 901 atcatgattg tcacctacac caggatctac aggattgctc agaaacaaat acggcgcatt 961 gcggccttgg agagggcagc agtccacgcc aagaattgcc agaccaccac aggtaatgga 1021 aagcctgtcg aatgttctca accggaaagt tcttttaaga tgtccttcaa aagagaaact 1081 aaagtcctga agactctgtc ggtgatcatg ggtgtgtttg tgtgctgttg gctacctttc 1141 ttcatcttga actgcatttt gcccttctgt gggtctgggg agacgcagcc cttctgcatt 1201 gattccaaca cctttgacgt gtttgtgtgg tttgggtggg ctaattcatc cttgaacccc 1261 atcatttatg cctttaatgc tgattttcgg aaggcatttt caaccctctt aggatgctac 1321 agactttgcc ctgcgacgaa taatgccata gagacggtga gtatcaataa caatggggcc 1381 gcgatgtttt ccagccatca tgagccacga ggctccatct ccaaggagtg caatctggtt 1441 tacctgatcc cacatgctgt gggctcctct gaggacctga aaaaggagga ggcagctggc 1501 atcgccagac ccttggagaa gctgtcccca gccctatcgg tcatattgga ctatgacact 1561 gacgtctctc tggagaagat ccaacccatg acacaaaacg gtcagcaccc aacctgaact 1621 cgcagatgaa tcctgccaca catgctcatc ccaaaagcta gaggagattg ctctggggtt 1681 tgctattaag aaactaaggt acggtgagac tctgaggtgt caggagagcc ctctgctgct 1741 ttccaacaca caattaactc cgtttccaaa tacattccag tgtattttct gtgttgttca 1801 tagtcaatca aacagggaca ctacaaacat ggggagccat aagggacatg tctttggctt 1861 cagaattgtt tttagaaatt tattcttatc ttaggattta ccaaataggg caaagaatca 1921 acagtgaaca gcttcactta aaatcaaatt tttctgggaa gaaaatgaga tgggttgagt 1981 ttgctgtata caaacaggtg ctaacactgt tcccagcaaa gttttcagat tgtaaaggta 2041 ggtgcatgcc ttcataaatt atttctaaaa cattaattga ggcttacagt aggagtgaga 2101 aatttttttc cagaattgag agatgttttg ttgatattgg ttctatttat ttattgtata 2161 tatggatatt tttaatttat gatataataa atatatattt atcatattta ataggataaa 2221 ttaatgagtt ttatccaaga ccttacaacc acatttctgg ccatttaact agcactttat 2281 aagccaatga agcaaacaca cagactctgt gagattctaa atgttcatgt gtaacttcta 2341 ga // LOCUS HSDAAO 1633 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for D-amino acid oxidase (EC 1.4.3.3). ACCESSION X13227 NID g30445 KEYWORDS amino acid oxidase; D-amino acid oxidase; oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1633) AUTHORS Momoi,K., Fukui,K., Watanabe,F. and Miyake,Y. TITLE Molecular cloning and sequence analysis of cDNA encoding human kidney D-amino acid oxidase JOURNAL FEBS Lett. 238 (1), 180-184 (1988) MEDLINE 89005666 REFERENCE 2 (bases 1 to 1633) AUTHORS Momoi,K. TITLE Direct Submission JOURNAL Submitted (26-SEP-1990) Momoi K., Dept. of Biochemistry, National Cardiovascular Center Research Institute, 5-7-1 Fujishiro-dai, Suita, Osaka 565, Japan FEATURES Location/Qualifiers source 1..1633 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="lambda gt11" CDS 201..1244 /note="D-amino acid oxidase (AA 1 - 347)" /codon_start=1 /db_xref="PID:g30446" /db_xref="SWISS-PROT:P14920" /translation="MRVVVIGAGVIGLSTALCIHERYHSVLQPLHIKVYADRFTPLTT TDVAAGLWQPYLSDPNNPQEADWSQQTFDYLLSHVHSPNAENLGLFLISGYNLFHEAI PDPSWKDTVLGFRKLTPRELDMFPDYGYGWFHTSLILEGKNYLQWLTERLTERGVKFF QRKVESFEEVAREGADVIVNCTGVWAGALQRDPLLQPGRGQIMKVDAPWMKHFILTHD PERGIYNSPYIIPGTQTVTLGGIFQLGNWSELNNIQDHNTIWEGCCRLEPTLKNARII GEATGFRPVRPQIRLEREQLRTGPSNTEVIHNYGHGGYGLTIHWGCALEAAKLFGRIL EEKKLSRMPPSHL" old_sequence 291 /note="c was g in [1]" /citation=[1] old_sequence 1380 /note="r was a in [1]" /citation=[1] misc_feature 1406..1411 /note="polyA signal [2]" old_sequence 1524 /note="g was a in [1]" /citation=[1] misc_feature 1606..1611 /note="pot. polyA signal" polyA_site 1629 /note="polyA site" BASE COUNT 426 a 457 c 418 g 331 t 1 others ORIGIN 1 ttggggtcca ttgcaacccg aggcgagact agagttccca agcgagaagg gaagaggcag 61 tgggtgcacg tggaaggcgg acagagggct ggaaacaaga cgctccagaa tcaggagctt 121 cccctcagga aatagcatcc tgtgtccccg cactgcagtt gtctggtctc tccagcagtt 181 tggtacttcc ggctgctgca atgcgtgtgg tggtgattgg agcaggagtc atcgggctgt 241 ccaccgccct ctgcatccat gagcgctacc actcagtcct gcagccactg cacataaagg 301 tctacgcgga ccgcttcacc ccactcacca ccaccgacgt ggctgccggc ctctggcagc 361 cctacctttc tgaccccaac aacccacagg aggcggactg gagccaacag acctttgact 421 atctcctgag ccatgtccat tctcccaacg ctgaaaacct gggcctgttc ctaatctcgg 481 gctacaacct cttccatgaa gccattccgg acccttcctg gaaggacaca gttctgggat 541 ttcggaagct gacccccaga gagctggata tgttcccaga ttacggctat ggctggttcc 601 acacaagcct aattctggag ggaaagaact atctacagtg gctgactgaa aggttaactg 661 agaggggagt gaagttcttc cagcggaaag tggagtcttt tgaggaggtg gcaagagaag 721 gcgcagacgt gattgtcaac tgcactgggg tatgggctgg ggcgctacaa cgagaccccc 781 tgctgcagcc aggccggggg cagatcatga aggtggacgc cccttggatg aagcacttca 841 ttctcaccca tgacccagag agaggcatct acaattcccc gtacatcatc ccagggaccc 901 agacagttac tcttggaggc atcttccagt tgggaaactg gagtgaacta aacaatatcc 961 aggaccacaa caccatttgg gaaggctgct gcagactgga gcccacactg aagaatgcaa 1021 gaattattgg tgaagcaact ggcttccggc cagtacgccc ccagattcgg ctagaaagag 1081 aacagcttcg cactggacct tcaaacacag aggtcatcca caactatggc catggaggct 1141 acgggctcac catccactgg ggatgtgccc tggaggcagc caagctcttt gggagaatcc 1201 tggaagaaaa gaaattgtcc agaatgccac catcccacct ctgaagactc cagtgactgc 1261 tgcctccccc cacaagaact cccttctccc ctcagccaat gaatcaatgt gctccttcat 1321 aagccattgc ttctccctca cttctttcct caaagaagca tgaggtgaga gaaagccacr 1381 aagtcagtgc ctggagaagg gttcagccca acatggggcc cctctcatca ctgaaatccc 1441 tctaccttct ctgggtctgg cattataaag aacagctgag gctgtcattc catgagtctt 1501 cagaagaaag gacagctcag aaagtcaaag aggccaactg cccagagcca cagaaaatgg 1561 aggataattg aggctaagta acctgattac aagttgtact aacatattaa aggttctgaa 1621 aagtcctgca aaa // LOCUS HSDAP1 2232 bp RNA PRI 24-FEB-1995 DEFINITION H.sapiens DAP-1 mRNA. ACCESSION X76105 NID g434844 KEYWORDS DAP-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2232) AUTHORS Deiss,L.P., Feinstein,E., Berissi,H., Cohen,O. and Kimchi,A. TITLE Identification of a novel serine/threonine kinase and a novel 15-kD protein as potential mediators of the gamma interferon-induced cell death JOURNAL Genes Dev. 9 (1), 15-30 (1995) MEDLINE 95129831 REFERENCE 2 (bases 1 to 2232) AUTHORS Kimchi,A. TITLE Direct Submission JOURNAL Submitted (12-NOV-1993) A. Kimchi, The Weizmann Institute of Science, Dept of Molecular Genetics & Virology, Rehovot POB 26, ISRAEL FEATURES Location/Qualifiers source 1..2232 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myeloid leukemia" /cell_line="HL-60" /clone_lib="HL-60 cDNA" gene 160..2208 /gene="DAP-1" CDS 160..468 /gene="DAP-1" /codon_start=1 /product="DAP-1" /db_xref="PID:g434845" /translation="MSSPPEGKLETKAGHPPAVKAGGMRIVQKHPHTGDTKEEKDKDD QEWESPSPPKPTVFISGVIARGDKDFPPAAAQVAHQKPHASMDKHPSPRTQHIQQPRK " polyA_signal 2203..2208 /gene="DAP-1" BASE COUNT 542 a 648 c 568 g 474 t ORIGIN 1 cgtggcactc acccggctcg cgcggccccg gccgcccacg ccgcgcgtcg ttctcccgcc 61 cgctcgctcc ccggcgctca cacctgagct cactcgcgca cgcccgcccg gcccgagaac 121 cgcgccgccg cctcggcccc gcggaagccc cgccgcgcca tgtcttcgcc tcccgaaggg 181 aaactagaga ctaaagctgg acacccgccc gccgtgaaag ctggtggaat gcgaattgtg 241 cagaaacacc cacatacagg agacaccaaa gaagagaaag acaaggatga ccaggaatgg 301 gaaagcccca gtccacctaa acccactgtg ttcatctctg gggtcatcgc ccggggtgac 361 aaagatttcc ccccggcggc tgcgcaggtg gctcaccaga agccgcatgc ctccatggac 421 aagcatcctt ccccaagaac ccagcacatc cagcagccac gcaagtgagc ctggagtcca 481 ccagcctgcc ccatggcccc ggctctgctg cacttggtat ttccctgaca gagagaacca 541 gcagtttcgc ccaaatccta ctctgctggg aaatctaagg caaaaccaag tgctctgtcc 601 tttgccttac atttccatat ttaaaactag aaacagcttc agcccaaacc ttgtttatgg 661 ggagtctggt tggatgtcat ttgaggatca ttgtgcccct agaggtgcca ttagcagaat 721 ttgccaagat ccgagaaaaa ttttagcttt agttctattt cagcagtcac ctgacgtcct 781 tgtctatggt cttaaaaaca agaaggcaca catttgagaa gatgagatta aggttaggag 841 aaaacctcag tcattgcatg ctttttagta tgggccaata aaatctcaac acctgtggga 901 gagtaagaac taagggaatg agtttgggcg ccccctcata aaggacctta gaggcaggga 961 acagcaatgc caaatttccc tctctcgtga gatgggggat cctgtgcagg ctgatgaggc 1021 acccatgaga aaagccgaaa aagcatgcat cttagaaata gcccctcaat tccaggagtc 1081 aacatgccaa agaatgaggc tggagacagg tagctccgag ggaggacttc tggcatgaga 1141 tctcggcacg gcaagcccag catcgcctca gcccagacag gctccaccag gagatcaagc 1201 aagggctgcc tttcaggagt cacctcctga gccacttcag agttctggaa gtgaccacgg 1261 accagggtgg aggaatagac ttctagttca ttctgggaca cttgagccag agagttgaaa 1321 gcttggaaag accagataag aaacctgccc tttgtctccc tagggacatg agacaccaca 1381 ttccatttgt gctagaaaaa cctatccact gatgagtcta actgttccaa acgcctccca 1441 cctggtgtgc acagctgcct gggtccattg tcacttgggt gcatcaggtt gtcctccgat 1501 ttttagatga gtttcctgtc tagagatgtc ctagtctgct cactggctgg tggcagtagg 1561 gtaccctgcg tcctcgaaaa gccagagggt tcacctagtc agacgaaact ccagaacagt 1621 gcttgtggag ggcctgactg tcctgctcac ccacagccga tctgctgcag gtcagcaact 1681 gtgtcgtgag cagctgccaa ccaccagcct ttctggtgct gttctccagt tcacgtctgc 1741 cagctggtga gggcagaggc agacctggtc agacccagcg cccctcctcc ctgagggagc 1801 atggcacagc ctcacacttg aaagacggtg tttggtttcc catctaatca acttaaggga 1861 agccggcatg tacccttcaa ggccctgtca ccacctattt tcctgatcag ttggtataaa 1921 ctgagggtgg cttttagaga cccagacttg gttggcagcg ctgccatgga acaccccagc 1981 aagcacctcc cagcctgcct ttcggagcag cacccaggag gggatgccgc gctccagcaa 2041 caccaggtca ggcctgtgca gacccctgcc ctgccgctgc agaaatccag aagcatcctt 2101 aatgcttctc agtcttcagc cagagggagg gctgttattt ccagaggtgc gctttttatg 2161 tacttttagc tagatgtggc atgcatctgt gagctttaga tcattaaatc caaaatgttt 2221 gcctaaatga gg // LOCUS HSDAP3 1608 bp RNA PRI 30-NOV-1995 DEFINITION H.sapiens DAP-3 mRNA. ACCESSION X83544 NID g1089849 KEYWORDS DAP-3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1608) AUTHORS Kissil,J.L., Deiss,L.P., Bayewitch,M., Raveh,T., Khaspekov,G. and Kimchi,A. TITLE Isolation of DAP3, a novel mediator of interferon-gamma-induced cell death JOURNAL J. Biol. Chem. 270 (46), 27932-27936 (1995) MEDLINE 96070931 REFERENCE 2 (bases 1 to 1608) AUTHORS Kissil,J.L. TITLE Direct Submission JOURNAL Submitted (19-DEC-1994) J.L. Kissil, Weizmann Institute of Science, Dept of Molec. Genetics & Virology, Rehovot 76100, ISRAEL FEATURES Location/Qualifiers source 1..1608 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="K562 cDNA library" /cell_line="K562" gene 74..1270 /gene="DAP-3" CDS 74..1270 /gene="DAP-3" /codon_start=1 /db_xref="PID:e131922" /db_xref="PID:g1089850" /translation="MMLKGITRLISRIHKLDPGRFLHMGTQARQSIAAHLDNQVPVES PRAISRTNENDPAKHGDQHEGQHYNISPQDLETVFPHGLPPRFVMQVKTFSEACLMVR KPALELLHYLKNTSFAYPAIRYLLYGEKGTGKTLSLCHVIHFCAKQDWLILHIPDAHL WVKNCRDLLQSSYNKQRFDQPLEASTWLKNFKTTNERFLNQIKVQEKYVWNKRESTEK GSPLGEVVEQGITRVRNATDAVGIVLKELKRQSSLGMFHLLVAVDGINALWGRTTLKR EDKSPIAPEELALVHNLRKMMKNDWHGGAIVSALSQTGSLFKPRKAYLPQELLGKEGF DALDPFIPILVSNYNPKEFESCIQYYLENNWLQHEKAPTEEGKKELLFLSNANPSLLE RHCAYL" BASE COUNT 498 a 340 c 383 g 387 t ORIGIN 1 gaattccgcc ggccccaggc agcgtgtgtc ggtcgcctag gctggagaac tagtcctcga 61 ctcacgtgca aggatgatgc tgaaaggaat aacaaggctt atctctagga tccataagtt 121 ggaccctggg cgttttttac acatggggac ccaggctcgc caaagcattg ctgctcacct 181 agataaccag gttccagttg agagtccgag agctatttcc cgcaccaatg agaatgaccc 241 ggccaagcat ggggatcagc acgagggtca gcactacaac atctcccccc aggatttgga 301 gactgtattt ccccatggcc ttcctcctcg ctttgtgatg caggtgaaga cattcagtga 361 agcttgcctg atggtaagga aaccagccct agaacttctg cattacctga aaaacaccag 421 ttttgcttat ccagctatac gatatcttct gtatggagag aagggaacag gaaaaaccct 481 aagtctttgc catgttattc atttctgtgc aaaacaggac tggctgatac tacatattcc 541 agatgctcat ctttgggtga aaaattgtcg ggatcttctg cagtccagct acaacaaaca 601 gcgctttgat caacctttag aggcttcaac ctggctgaag aatttcaaaa ctacaaatga 661 gcgcttcctg aaccagataa aagttcaaga gaagtatgtc tggaataaga gagaaagcac 721 tgagaaaggg agtcctctgg gagaagtggt tgaacagggc ataacacggg tgaggaacgc 781 cacagatgca gttggaattg tgctgaaaga gctaaagagg caaagttctt tgggtatgtt 841 tcacctccta gtggccgtgg atggaatcaa tgctctttgg ggaagaacca ctctgaaaag 901 agaagataaa agcccgattg cccccgagga attagcactt gttcacaact tgaggaaaat 961 gatgaaaaat gattggcatg gaggcgccat tgtgtcggct ttgagccaga ctgggtctct 1021 ctttaagccc cggaaagcct atctgcccca ggagttgctg ggaaaggaag gatttgatgc 1081 cctggatccc tttattccca tcctggtttc caactataac ccaaaggaat ttgaaagttg 1141 tattcagtat tatttggaaa acaattggct tcaacatgag aaagctccta cagaagaagg 1201 gaaaaaagag ctgctgttcc taagtaacgc gaacccctcg ctgctggagc ggcactgtgc 1261 ctacctctaa gccaagatca cagcatgtga ggaagacagt ggacatctgc tttatgctgg 1321 acccagtaag atgaggaagt cgggcagtac acaggaagag gagccaggcc cttgtaccta 1381 tgggattgga caggactgca gttggctctg gacctgcatt aaaatgggtt tcactgtgaa 1441 tgcgtgacaa taagatattc ccttgttcct aaaactttat atcagtttat tggatgtggt 1501 ttttcacatt taagataatt atggctcttt tcctaaaaaa taaaatatct ttctaaaaaa 1561 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSDAPK 5910 bp RNA PRI 20-APR-1997 DEFINITION H.sapiens DAP-kinase mRNA. ACCESSION X76104 NID g2094872 KEYWORDS DAP-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5910) AUTHORS Deiss,L.P., Feinstein,E., Berissi,H., Cohen,O. and Kimchi,A. TITLE Identification of a novel serine/threonine kinase and a novel 15-kD protein as potential mediators of the gamma interferon-induced cell death JOURNAL Genes Dev. 9 (1), 15-30 (1995) MEDLINE 95129831 REFERENCE 2 (bases 1 to 5910) AUTHORS Kimchi,A. TITLE Direct Submission JOURNAL Submitted (12-NOV-1993) A. Kimchi, The Weizmann Institute of Science, Dept of Molecular Genetics & Virology, Rehovot POB 26, ISRAEL REMARK revised by [3] REFERENCE 3 (bases 1 to 5910) AUTHORS Feinstein,E. TITLE Direct Submission JOURNAL Submitted (20-APR-1997) E. Feinstein, The Weizmann Institute of Science, Dept of Molecular Genetics & Virology, Rehovot POB 26, ISRAEL FEATURES Location/Qualifiers source 1..5910 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukemia" /cell_line="K562" /clone_lib="cDNA library K562" gene 337..5884 /gene="DAP-kinase" CDS 337..4632 /gene="DAP-kinase" /codon_start=1 /product="DAP-kinase" /db_xref="PID:e312975" /db_xref="PID:g2094873" /translation="MTVFRQENVDDYYDTGEELGSGQFAVVKKCREKSTGLQYAAKFI KKRRTKSSRRGVSREDIEREVSILKEIQHPNVITLHEVYENKTDVILILELVAGGELF DFLAEKESLTEEEATEFLKQILNGVYYLHSLQIAHFDLKPENIMLLDRNVPKPRIKII DFGLAHKIDFGNEFKNIFGTPEFVAPEIVNYEPLGLEADMWSIGVITYILLSGASPFL GDTKQETLANVSAVNYEFEDEYFSNTSALAKDFIRRLLVKDPKKRMTIQDSLQHPWIK PKDTQQALSRKASAVNMEKFKKFAARKKWKQSVRLISLCQRLSRSFLSRSNMSVARSD DTLDEEDSFVMKAIIHAINDDNVPGLQHLLGSLSNYDVNQPNKHGTPPLLIAAGCGNI QILQLLIKRGSRIDVQDKGGSNAVYWAARHGHVDTLKFLSENKCPLDVKDKSGEMALH VAARYGHADVAQVTCAASAQIPISRTKEEETPLHCAAWHGYYSVAKALCEAGCNVNIK NREGETPLLTASARGYHDIVECLAEHGADLNACDKDGHIALHLAVRRCQMEVIKTLLS QGCFVDYQDRHGNTPLHVACKDGNMPIVVALCEANCNLDISNKYGRTPLHLAANNGIL DVVRYLCLMGASVEALTTDGKTAEDLARSEQHEHVAGLLARLRKDTHRGLFIQQLRPT QNLQPRIKLKLFGHSGSGKTTLVESLKCGLLRSFFRRRRPRLSSTNSSRFPPSPLASK PTVSVSINNLYPGCENVSVRSRSMMFEPGLTKGMLEVFVAPTHHPHCSADDQSTKAID IQNAYLNGVGDFSVWEFSGNPVYFCCYDYFAANDPTSIHVVVFSLEEPYEIQLNPVIF WLSFLKSLVPVEEPIAFGGKLKNPLQVVLVATHADIMNVPRPAGGEFGYDKDTSLLKE IRNRFGNDLHISNKLFVLDAGASGSKDMKVLRNHLQEIRSQIVSVCPPMTHLCEKIIS TLPSWRKLNGPNQLMSLQQFVYDVQDQLNPLASEEDLRRIAQQLHSTGEINIMQSETV QDVLLLDPRWLCTNVLGKLLSVETPRALHHYRGRYTVEDIQRLVPDSDVEELLQILDA MDICARDLSSGTMVDVPALIKTDNLHRSWADEEDEVMVYGGVRIVPVEHLTPFPCGIF HKVQVNLCRWIHQQSTEGDADIRLWVNGCKLANRGAELLVLLVNHGQGIEVQVRGLET EKIKCCLLLDSVCSTIENVMATTLPGLLTVKHYLSPQQLREHHEPVMIYQPRDFFRAQ TLKETSLTNTMGGYKESFSSIMCFGCHDVYSQASLGMDIHASDLNLLTRRKLSRLLDP PDPLGKDWCLLAMNLGLPDLVAKYNTNNGAPKDFLPSPLHALLREWTTYPESTVGTLM SKLRELGRRDAADLLLKASSVFKINLDGNGQEAYASSCNSGTSYNSISSVVSR" polyA_signal 5652..5657 /gene="DAP-kinase" polyA_signal 5879..5884 /gene="DAP-kinase" BASE COUNT 1453 a 1528 c 1506 g 1423 t ORIGIN 1 cggaggacag ccggaccgag ccaacgccgg ggactttgtt ccctccacgg aggggactcg 61 gcaactcgca gcggcagggt ctggggccgg cgcctgggag ggatctgcgc cccccactca 121 ctccctagct gtgttcccgc cgccgccccg gctagtctcc ggcgctggcg cctatggtcg 181 gcctccgaca gcgctccgga gggaccgggg gagctcccag gcgcccggga ctggagactg 241 atgcatgagg ggcctacgga ggcgcaggag cggtggtgat ggtctgggaa gcggagctga 301 agtcccctgg gctttggtga ggcgtgacag tttatcatga ccgtgttcag gcaggaaaac 361 gtggatgatt actacgacac cggcgaggaa cttggcagtg gacagtttgc ggttgtgaag 421 aaatgccgtg agaaaagtac cggcctccag tatgccgcca aattcatcaa gaaaaggagg 481 actaagtcca gccggcgggg tgtgagccgc gaggacatcg agcgggaggt cagcatcctg 541 aaggagatcc agcaccccaa tgtcatcacc ctgcacgagg tctatgagaa caagacggac 601 gtcatcctga tcttggaact cgttgcaggt ggcgagctgt ttgacttctt agctgaaaag 661 gaatctttaa ctgaagagga agcaactgaa tttctcaaac aaattcttaa tggtgtttac 721 tacctgcact cccttcaaat cgcccacttt gatcttaagc ctgagaacat aatgcttttg 781 gatagaaatg tccccaaacc tcggatcaag atcattgact ttgggttggc ccataaaatt 841 gactttggaa atgaatttaa aaacatattt gggactccag agtttgtcgc tcctgagata 901 gtcaactatg aacctcttgg tcttgaggca gatatgtgga gtatcggggt aataacctat 961 atcctcctaa gtggggcctc cccatttctt ggagacacta agcaagaaac gttagcaaat 1021 gtatccgctg tcaactacga atttgaggat gaatacttca gtaataccag tgccctagcc 1081 aaagatttca taagaagact tctggtcaag gatccaaaga agagaatgac aattcaagat 1141 agtttgcagc atccctggat caagcctaaa gatacacaac aggcacttag tagaaaagca 1201 tcagcagtaa acatggagaa attcaagaag tttgcagccc ggaaaaaatg gaaacaatcc 1261 gttcgcttga tatcactgtg ccaaagatta tccaggtcat tcctgtccag aagtaacatg 1321 agtgttgcca gaagcgatga tactctggat gaggaagact cctttgtgat gaaagccatc 1381 atccatgcca tcaacgatga caatgtccca ggcctgcagc accttctggg ctcattatcc 1441 aactatgatg ttaaccaacc caacaagcac gggacacctc cattactcat tgctgctggc 1501 tgtgggaata ttcaaatact acagttgctc attaaaagag gctcgagaat cgatgtccag 1561 gataagggcg ggtccaatgc cgtctactgg gctgctcggc atggccacgt cgataccttg 1621 aaatttctca gtgagaacaa atgccctttg gatgtgaaag acaagtctgg agagatggcc 1681 ctccacgtgg cagctcgcta tggccatgct gacgtggctc aagttacttg tgcagcttcg 1741 gctcaaatcc caatatccag gacaaaggaa gaagaaaccc ccctgcactg tgctgcttgg 1801 cacggctatt actctgtggc caaagccctt tgtgaagccg gctgtaacgt gaacatcaag 1861 aaccgagaag gagagacgcc cctcctgaca gcctctgcca ggggctacca cgacatcgtg 1921 gagtgtctgg ccgaacatgg agccgacctt aatgcttgcg acaaggacgg acacattgcc 1981 cttcatctgg ctgtaagacg gtgtcagatg gaggtaatca agactctcct cagccaaggg 2041 tgtttcgtcg attatcaaga caggcacggc aatactcccc tccatgtggc atgtaaagat 2101 ggcaacatgc ctatcgtggt ggccctctgt gaagcaaact gcaatttgga catctccaac 2161 aagtatgggc gaacgcctct gcaccttgcg gccaacaacg gaatcctaga cgtggtccgg 2221 tatctctgtc tgatgggagc cagcgttgag gcgctgacca cggacggaaa gacggcagaa 2281 gatcttgcta gatcggaaca gcacgagcac gtagcaggtc tccttgcaag acttcgaaag 2341 gatacgcacc gaggactctt catccagcag ctccgaccca cacagaacct gcagccaaga 2401 attaagctca agctgtttgg ccactcggga tccgggaaaa ccacccttgt agaatctctc 2461 aagtgtgggc tgctgaggag ctttttcaga aggcgtcggc ccagactgtc ttccaccaac 2521 tccagcaggt tcccaccttc acccctggct tctaagccca cagtctcagt gagcatcaac 2581 aacctgtacc caggctgcga gaacgtgagt gtgaggagcc gcagcatgat gttcgagccg 2641 ggtcttacca aagggatgct ggaggtgttt gtggccccga cccaccaccc gcactgctcg 2701 gccgatgacc agtccaccaa ggccatcgac atccagaacg cttatttgaa tggagttggc 2761 gatttcagcg tgtgggagtt ctctggaaat cctgtgtatt tctgctgtta tgactatttt 2821 gctgcaaatg atcccacgtc aatccatgtt gttgtcttta gtctagaaga gccctatgag 2881 atccagctga acccagtgat tttctggctc agtttcctga agtcccttgt cccagttgaa 2941 gaacccatag ccttcggtgg caagctgaag aacccactcc aagttgtcct ggtggccacc 3001 cacgctgaca tcatgaatgt tcctcgaccg gctggaggcg agtttggata tgacaaagac 3061 acatcgttgc tgaaagagat taggaacagg tttggaaatg atcttcacat ttcaaataag 3121 ctgtttgttc tggatgctgg ggcttctggg tcaaaggaca tgaaggtact tcgaaatcat 3181 ctgcaagaaa tacgaagcca gattgtttcg gtctgtcctc ccatgactca cctgtgtgag 3241 aaaatcatct ccacgctgcc ttcctggagg aagctcaatg gacccaacca gctgatgtcg 3301 ctgcagcagt ttgtgtacga cgtgcaggac cagctgaacc ccctggccag cgaggaggac 3361 ctcaggcgca ttgctcagca gctccacagc acaggcgaga tcaacatcat gcaaagtgaa 3421 acagttcagg acgtgctgct cctggacccc cgctggctct gcacaaacgt cctggggaag 3481 ttgctgtccg tggagacccc acgggcgctg caccactacc ggggccgcta caccgtggag 3541 gacatccagc gcctggtgcc cgacagcgac gtggaggagc tgctgcagat cctcgatgcc 3601 atggacatct gcgcccggga cctgagcagc gggaccatgg tggacgtccc agccctgatc 3661 aagacagaca acctgcaccg ctcctgggct gatgaggagg acgaggtgat ggtgtatggt 3721 ggcgtgcgca tcgtgcccgt ggaacacctc acccccttcc catgtggcat ctttcacaag 3781 gtccaggtga acctgtgccg gtggatccac cagcaaagca cagagggcga cgcggacatc 3841 cgcctgtggg tgaatggctg caagctggcc aaccgtgggg ccgagctgct ggtgctgctg 3901 gtcaaccacg gccagggcat tgaggtccag gtccgtggcc tggagacgga gaagatcaag 3961 tgctgcctgc tgctggactc ggtgtgcagc accattgaga acgtcatggc caccacgctg 4021 ccagggctcc tgaccgtgaa gcattacctg agcccccagc agctgcggga gcaccatgag 4081 cccgtcatga tctaccagcc acgggacttc ttccgggcac agactctgaa ggaaacctca 4141 ctgaccaaca ccatgggggg gtacaaggaa agcttcagca gcatcatgtg cttcgggtgt 4201 cacgacgtct actcacaggc cagcctcggc atggacatcc atgcatcaga cctgaacctc 4261 ctcactcgga ggaaactgag tcgcctgctg gacccgcccg accccctggg gaaggactgg 4321 tgccttctcg ccatgaactt aggcctccct gacctcgtgg caaagtacaa caccaataac 4381 ggggctccca aggatttcct ccccagcccc ctccacgccc tgctgcggga atggaccacc 4441 taccctgaga gcacagtggg caccctcatg tccaaactga gggagctggg tcgccgggat 4501 gccgcagacc ttttgctgaa ggcatcctct gtgttcaaaa tcaacctgga tggcaatggc 4561 caggaggcct atgcctcgag ctgcaacagc ggcacctctt acaattccat tagctctgtt 4621 gtatcccggt gagggcagcc tctggcttgg acagggtctg tttggactgc agaaccaagg 4681 gggtgatgta gcccatcctt ccctttggag atgctgaggg tgtttcttcc tgcacccaca 4741 gccaggggga tgccactcct ccctccggct tgacctgttt ctctgccgct acctccctcc 4801 ccgtctcatt ccgttgtctg tggatggtca ttgcagttta agagcagaac agatctttta 4861 ctttggccgc ttgaaaagct agtgtacctc ctctcagtgt tttggactcc atctctcatc 4921 ctccagtacc ttgcttctta ctgataattt tgctggaatt cctaactttt caatgacatt 4981 ttttttaact atcatattga ttgtccttta aaaaagaaaa gtgcatattt atccaaaatg 5041 tgtatttctt atacgctttt ctgtgttata ccatttcctc agcttatctc ttttatattt 5101 gtaggagaaa ctcccatgta tggaatccca ctgtatgatt tataaacaga caatatgtga 5161 gtgccttttg cagaagaggg tgtgtttgaa atcatcggag tcagccagga gctgtcacca 5221 aggaaacgct acctctctgt cccttgctgt atgctgatca tcgccagagg tgcttcaccc 5281 tgagttttgt tttgtattgt tttctgacag tttttctgtt ttgtttggca aggaaagggg 5341 agaagggaat cctcctccag ggtgatttta tgatcagtgt tgttgctcta ggaagacatt 5401 tttccgtttg cttttgttcc aatgtcaatg tgaacgtcca catgaaacct acacactgtc 5461 atgcttcatc attccctctc atctcaggta gaaggttgac acagttgtag ggttacagag 5521 acctatgtaa gaattcagaa gacccctgac tcatcatttg tggcagtccc ttataattgg 5581 tgcatagcag atggtttcca catttagatc ctggtttcat aacttcctgt acttgaagtc 5641 taaaagcaga aaataaagga agcaagtttt cttccatgat tttaaattgt gatcgagttt 5701 taaattgata ggagggaaca tgtcctaatt cttctgtcct gagaagcatg taatgttaat 5761 gttatatcat atgtatatat atatatgcac tatgtatata catatatatt aatactggta 5821 tttttactta atctataaaa tgtcgttaaa aagttgtttg tttttttctt tttttataaa 5881 taaactgttg ctcgttaaaa aaaaaaaaaa // LOCUS HSDAUDI6 2017 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA DAUDI6 for retinoic acid X receptor b. ACCESSION X63522 S54072 NID g30447 KEYWORDS DAUDI6 gene; hormone receptor; retinoic acid X receptor b; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2017) AUTHORS Fleischhauer,K. TITLE Direct Submission JOURNAL Submitted (13-DEC-1991) K. Fleischhauer, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York NY 10021, USA REFERENCE 2 (bases 1 to 2017) AUTHORS Fleischhauer,K., Park,J.H., DiSanto,J.P., Marks,M., Ozato,K. and Yang,S.Y. TITLE Isolation of a full-length cDNA clone encoding a N-terminally variant form of the human retinoid X receptor beta JOURNAL Nucleic Acids Res. 20 (7), 1801 (1992) MEDLINE 92253386 REFERENCE 3 (bases 1 to 2017) AUTHORS Fleischhauer,K., McBride,O.W., DiSanto,J.P., Ozato,K. and Yang,S.Y. TITLE Cloning and chromosome mapping of human retinoid X receptor beta: selective amino acid sequence conservation of a nuclear hormone receptor in mammals JOURNAL Hum. Genet. 90 (5), 505-510 (1993) MEDLINE 93154716 FEATURES Location/Qualifiers source 1..2017 /organism="Homo sapiens" /isolate="DAUDI" /db_xref="taxon:9606" /cell_type="T cell lymphoma" /cell_line="DAUDI" /clone_lib="cDNA DAUDI" /clone="DAUDI6" gene 180..1781 /gene="DAUDI6" CDS 180..1781 /gene="DAUDI6" /codon_start=1 /product="retinoic acid X receptor b" /db_xref="PID:g30448" /db_xref="SWISS-PROT:P28703" /translation="MSWAARPPFLPQRHAAGQCGPVGVRKEMHCGVASRWRRRRPWLD PAAAAAAAVAGGEQQTPEPEPGEAGRDGMGDSGRDSRSPDSSSPNPLPQGVPPPSPPG PPLPPSTAPSLGGSGAPPPPPMPPPPLGSPFPVISSSMGSPGLPPPAPPGFSGPVSSP QINSTVSLPGGGSGPPEDVKPPVLGVRGLHCPPPPGGPGAGKRLCAICGDRSSGKHYG VYSCEGCKGFFKRTIRKDLTYSCRDNKDCTVDKRQRNRCQYCRYQKCLATGMKREAVQ EERQRGKDKDGDGEGAGGAPEEMPVDRILEAELAVEQKSDQGVEGPGGTGGSGSSPND PVTNICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNELLIASFSHRSIDVR DGILLATGLHVHRNSAHSAGVGAIFDRVLTELVSKMRDMRMDKTELGCLRAIILFNPD AKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALRSIGLKCLEHLF FFKLIGDTPIDTFLMEMLEAPHQLA" BASE COUNT 411 a 607 c 598 g 401 t ORIGIN 1 gaattccggc tgccacattg gcgctgtcat tttggtactg agcagagcga cgggcttaat 61 tcgacccaat ccaggccaga gtctttctct caggggcttc ctcgtgctca gctaatcctc 121 cgatcaatcc ttgggaatcc ctgggacctc ttcggtatcc ctactctcag ccagggatca 181 tgtcttgggc cgctcgcccg cccttcctcc ctcagcggca tgccgcaggg cagtgtgggc 241 cggtgggggt gcgaaaagaa atgcattgtg gggtcgcgtc ccggtggcgg cggcgacggc 301 cctggctgga tcccgcagcg gcggcggcgg cggcggtggc aggcggagaa caacaaaccc 361 cggagccgga gccaggggag gctggacggg acgggatggg cgacagcggg cgggactccc 421 gaagcccaga cagctcctcc ccaaatcccc ttccccaggg agtccctccc ccttctcctc 481 ctgggccacc cctaccccct tcaacagctc catcccttgg aggctctggg gccccacccc 541 cacccccgat gccaccaccc ccactgggct ctccctttcc agtcatcagt tcttccatgg 601 ggtcccctgg tctgccccct ccagctcccc caggattctc cgggcctgtc agcagccccc 661 agattaactc aacagtgtca ctccctgggg gtgggtctgg cccccctgaa gatgtgaagc 721 caccagtctt aggggtccgg ggcctgcact gtccaccccc tccaggtggc cctggggctg 781 gcaaacggct atgtgcaatc tgcggggaca gaagctcagg caaacactac ggggtttaca 841 gctgtgaggg ttgcaagggc ttcttcaaac gcaccatccg caaagacctt acatactctt 901 gccgggacaa caaagactgc acagtggaca agcgccagcg gaaccgctgt cagtactgcc 961 gctatcagaa gtgcctggcc actggcatga agagggaggc ggtacaggag gagcgtcagc 1021 ggggaaagga caaggatggg gatggggagg gggctggggg agcccccgag gagatgcctg 1081 tggacaggat cctggaggca gagcttgctg tggaacagaa gagtgaccag ggcgttgagg 1141 gtcctggggg aaccgggggt agcggcagca gcccaaatga ccctgtgact aacatctgtc 1201 aggcagctga caaacagcta ttcacgcttg ttgagtgggc gaagaggatc ccacactttt 1261 cctccttgcc tctggatgat caggtcatat tgctgcgggc aggctggaat gaactcctca 1321 ttgcctcctt ttcacaccga tccattgatg ttcgagatgg catcctcctt gccacaggtc 1381 ttcacgtgca ccgcaactca gcccattcag caggagtagg agccatcttt gatcgggtgc 1441 tgacagagct agtgtccaaa atgcgtgaca tgaggatgga caagacagag cttggctgcc 1501 tgagggcaat cattctgttt aatccagatg ccaagggcct ctccaaccct agtgaggtgg 1561 aggtcctgcg ggagaaagtg tatgcatcac tggagaccta ctgcaaacag aagtaccctg 1621 agcagcaggg acggtttgcc aagctgctgc tacgtcttcc tgccctccgg tccattggcc 1681 ttaagtgtct agagcatctg tttttcttca agctcattgg tgacaccccc atcgacacct 1741 tcctcatgga gatgcttgag gctccccatc aactggcctg agctcagacc cagacgtggt 1801 gcttctcaca ctggaggagc acacatccaa gagggactcc aagccctggg gagggtgggg 1861 ggccatgttc ccagaacctt gatggggtga gaagtacagg gcagaaccaa gaacataaac 1921 cctccaaggg atctgcttga tatcccaagt tggaagggac cccagatacc tgtgaggact 1981 ggttgtctct cttcggtgcc cttgagtctc tgaattt // LOCUS HSDBH 1955 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for dopamine beta-hydroxylase (EC 1.14.17.1). ACCESSION Y00096 NID g30455 KEYWORDS dopamine beta-hydroxylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1955) AUTHORS Lamouroux,A., Vigny,A., Faucon Biguet,N., Darmon,M.C., Franck,R., Henry,J.P. and Mallet,J. TITLE The primary structure of human dopamine-ss-hydroxylase: insights into the relationship between the soluble and the membrane-bound forms of the enzyme JOURNAL EMBO J. 6, 3921-3937 (1987) FEATURES Location/Qualifiers source 1..1955 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /clone="DBH2" sig_peptide 40..114 /note="signal peptide (AA -25 to -1)" CDS 40..1851 /note="dopamine-beta-hydroxylase preprotein" /codon_start=1 /db_xref="PID:g30456" /db_xref="SWISS-PROT:P09172" /translation="MREAAFMYSTAVAIFLVILVAALQGSAPRESPLPYHIPLDPEGS LELSWNVSYTQEAIHFQLLVRRLKAGVLFGMSDRGELENADLVVLWTDGDTAYFADAW SDQKGQIHLDPQQDYQLLQVQRTPEGLTLLFKRPFGTCDPKDYLIEDGTVHLVYGILE EPFRSLEAINGSGLQMGLQRVQLLKPNIPEPELPSDTCTMEVQAPNIQIPSQETTYWC YIKELPKGFSRHHIIKYEPIVTKGNEALVHHMEVFQCAPEMDSVPHFSGPCDSKMKPD RLNYCRHVLAAWALGAKAFYYPEEAGLAFGGPGSSRYLRLEVHYHNPLVIEGRNDSSG IRLYYTAKLRRFNAGIMELGLVYTPVMAIPPRETAFILTGYCTDKCTQLALPPSGIHI FASQLHTHLTGRKVVTVLVRDGREWEIVNQDNHYSPHFQEIRMLKKVVSVHPGDVLIT SCTYNTEDRELATVGGFGILEEMCVNYVHYYPQTQLELCKTAVDAGFLQKYFHLINRF NNEDVCTCPQASVSQQFTSVPWNSFNCDVLKALYSFAPISMHCNKSSAVRFQGEWNLQ PLPKVISTLEEPTPQCPTSQGRSPAGPTVVSIGGGKG" mat_peptide 115..1848 /note="mat. dopamine-beta-hydroxylase (AA 1-578)" BASE COUNT 394 a 665 c 538 g 358 t ORIGIN 1 cccgccctca gtcgctgggc cagcctgccc ggccccagca tgcgggaggc agccttcatg 61 tacagcacag cagtggccat cttcctggtc atcctggtgg ccgcactgca gggctcggct 121 ccccgtgaga gccccctccc ctatcacatc cccctggacc cggaggggtc cctggagctc 181 tcatggaatg tcagctacac ccaggaggcc atccatttcc agctcctggt gcggaggctc 241 aaggctggcg tcctgtttgg gatgtccgac cgtggcgagc ttgagaacgc agatctcgtg 301 gtgctctgga ccgatgggga cactgcctat tttgcggacg cctggagtga ccagaagggg 361 cagatccacc tggatcccca gcaggactac cagctgctgc aggtgcagag gaccccagaa 421 ggcctgaccc tgcttttcaa gaggcccttt ggcacctgcg accccaagga ttacctcatt 481 gaggacggca ctgtccactt ggtctacggg atcctggagg agccgttccg gtcactggag 541 gccatcaacg gctcgggcct gcagatgggg ctgcagaggg tgcagctcct gaagcccaat 601 atccccgaac cggagttgcc ctcagacacg tgcaccatgg aggtccaagc tcccaatatc 661 cagatcccca gccaggagac cacgtactgg tgctacatta aggagcttcc aaagggcttc 721 tctcggcacc acattatcaa gtacgagccc atcgtcacca agggcaatga ggcccttgtc 781 caccacatgg aagtcttcca gtgcgccccc gagatggaca gcgtccccca cttcagcggg 841 ccctgcgact ccaagatgaa acccgaccgc ctcaactact gccgccacgt gctggccgcc 901 tgggccctgg gtgccaaggc attttactac ccagaggaag ccggccttgc cttcgggggt 961 ccagggtcct ccagatatct ccgcctggaa gttcactacc acaacccact ggtgatagaa 1021 ggacgaaacg actcctcagg catccgcttg tactacacag ccaagctgcg gcgcttcaac 1081 gcggggatca tggagctggg actggtgtac acgccagtga tggccattcc accacgggag 1141 accgccttca tcctcactgg ctactgcacg gacaagtgca cccagctggc actgcctccc 1201 tccgggatcc acatcttcgc ctctcagctc cacacacacc tgactgggag aaaggtggtc 1261 acagtgctgg tccgggacgg ccgggagtgg gagatcgtga accaggacaa tcactacagc 1321 cctcacttcc aggagatccg catgttgaag aaggtcgtgt cggtccatcc gggagatgtg 1381 ctcatcacct cctgcacgta caacacagaa gaccgggagc tggccacagt ggggggcttc 1441 gggatcctgg aggagatgtg tgtcaactac gtgcactact acccccagac gcagctggag 1501 ctctgcaaga cggctgtgga cgccggcttc ctgcagaagt acttccacct catcaacagg 1561 ttcaacaacg aggatgtctg cacctgccct caggcgtccg tgtctcagca gttcacctct 1621 gttccctgga actccttcaa ctgcgacgta ctgaaggccc tgtacagctt cgcgcccatc 1681 tccatgcact gcaacaagtc ctcagccgtc cgcttccagg gtgaatggaa cctgcagccc 1741 ctgcccaagg tcatctccac actggaagag cccaccccac agtgccccac cagccagggc 1801 cgaagccctg ctggccccac cgttgtcagc attggtgggg gcaaaggctg aggggggacc 1861 tactcctccc cctcctccat gctgtccctg tgggctcaca ccggcactgt gcactctact 1921 ctgcgacgat ccccaaggaa cagccctgca cgccc // LOCUS HSDBLPRO 3652 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for dbl proto-oncogene. ACCESSION X12556 NID g30481 KEYWORDS dbl oncogene; glycoprotein; oncogene; phosphoprotein; proto-oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3652) AUTHORS Ron,D., Tronick,S.R., Aaronson,S.A. and Eva,A. TITLE Molecular cloning and characterization of the human dbl proto-oncogene: evidence that its overexpression is sufficient to transform NIH/3T3 cells JOURNAL EMBO J. 7 (8), 2465-2473 (1988) MEDLINE 89052660 REFERENCE 2 (bases 1 to 3652) AUTHORS Ron,D. TITLE Direct Submission JOURNAL Submitted (26-JUN-1989) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..3652 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain stem" /clone_lib="lambda gt11" /clone="p18-nd-3.6." mRNA <1..3652 /note="dbl proto-oncogene mRNA" CDS 175..2952 /note="dbl protein (AA1 - 925)" /codon_start=1 /db_xref="PID:g30482" /db_xref="SWISS-PROT:P10911" /translation="MAEANPRRGKMRFRRNAASFPGNLHLVLVLRPTSFLQRTFTDIG FWFSQEDFMPKLPVVMLSSVSDLLTYIDDKQLTPELGGTLQYCHSEWIIFRNAIENFA LTVKEMAQMLQSFGTELAETELPDDIPSIEEILAIRAERYHLLKNDITAVTKEGKILL TNLEVPDTEGAVSSRLECHRQISGDWQTINKLLTQVHDMETAFDGFWEKHQLKMEQYL QLWKFEQDFQQLVTEVEFLLNQQAELADVTGTIAQVKQKIKKLENLDENSQELLSKAQ FVILHGHKLAANHHYALDLICQRCNELRYLSDILVNEIKAKRIQLSRTFKMHKLLQQA RQCCDEGECLLANQEIDKFQSKEDAQKALQDIENFLEMALPFINYEPETLQYEFDVIL SPELKVQMKTIQLKLENIRSIFENQQAGFRNLADKHVRPIQFVVPTPENLVTSGTPFF SSKQGKKTWRQNQSNLKIEVVPDCQEKRSSGPSSSLDNGNSLDVLKNHVLNELIQTER VYVRELYTVLLGYRAEMDNPEMFDLMPPLLRNKKDILFGNMAEIYEFHNDIFLSSLEN CAHAPERVGPCFLERKDDFQMYAKYCQNKPRSETIWRKYSECAFFQECQRKLKHRLRL DSYLLKPVQRITKYQLLLKELLKYSKDCEGSALLKKALDAMLDLLKSVNDSMHQIAIN GYIGNLNELGKMIMQGGFSVWIGHKKGATKMKDLARFKPMQRHLFLYEKAIVFCKRRV ESGEGSDRYPSYSFKHCWKMDEVGITEYVKGDNRKFEIWYGEKEEVYIVQASNVDVKM TWLKEIRNILLKQQELLTVKKRKQQDQLTERDKFQISLQQNDEKQQGAFISTEETELE HTSTVVEVCEAIASVQAEANTVWTEASQSAEISEEPAEWSSNYFYPTYDENEEENRPL MRPVSEMALLY" misc_feature 1201..1212 /note="pot. palmitylation site" misc_feature 1558..1566 /note="pot. N-linked glycosylation site" old_sequence 1783..1785 /note="ctc was ccc in [1]" /citation=[1] misc_feature 2191..2199 /note="pot. N-linked glycosylation site" misc_feature 2386..2403 /note="pot. serine phosphorylation site" misc_feature 3630..3635 /note="polyA signal" polyA_site 3652 /note="polyA site" BASE COUNT 1198 a 613 c 771 g 1070 t ORIGIN 1 tttttttttt ttcctcccaa cattgctgcc actgtgctaa tggaagcacc acggcagctt 61 tgtttgatag agatttttgg ctgccgtttt taaatactac ccaagaagca gctcgtattt 121 catcaatgtt gcgttgacaa ttggaaaaga aaagtgtaat tgcgtacagg cgaaatggca 181 gaagcaaatc cccggagagg caagatgagg ttcagaagga atgcggcttc cttccctggg 241 aacttgcact tggttttggt tttacgtcct accagctttc ttcaacgaac gttcacagac 301 attggatttt ggtttagtca ggaggatttt atgcctaaat taccagttgt tatgctgagc 361 tcagttagtg atttgctgac atacattgat gacaagcaat taacccctga gttaggcggc 421 accttgcagt actgccacag tgaatggatc atcttcagaa atgctataga aaattttgcc 481 ctcacagtga aagaaatggc tcagatgtta cagtcctttg gaactgaact ggctgagaca 541 gaactaccag atgatattcc ctcaatagaa gaaattctgg caattcgtgc tgaaaggtat 601 catctgttga agaatgatat tacagctgta accaaagaag gaaaaattct gctaacaaat 661 ctggaagtgc ctgacactga aggagctgtc agttcaagac tagaatgtca tcggcaaata 721 agtggtgact ggcaaactat taataagttg ctgactcaag tacatgatat ggaaacagct 781 tttgatggat tttgggaaaa acatcaatta aaaatggagc agtatctgca actatggaag 841 tttgagcagg attttcaaca gcttgtgact gaagttgaat ttctattaaa ccaacaagca 901 gaactggctg atgtaacagg gactatagct caagtaaaac aaaaaataaa aaaattggaa 961 aacttagatg aaaattctca ggagctatta tcaaaggccc agtttgtgat attacatgga 1021 cacaagcttg cagcaaatca ccattatgca cttgatttaa tctgccagag gtgcaatgag 1081 ctacgttacc tttctgatat tttggttaat gagataaaag caaaacggat acaactcagc 1141 aggaccttca aaatgcataa actcctacag caggctcgtc aatgctgtga tgaaggggaa 1201 tgtcttctag ctaatcagga aatagataag tttcagtcta aagaagatgc tcagaaagct 1261 ctccaagaca ttgaaaattt tcttgaaatg gctctaccct ttataaatta tgaacctgaa 1321 acactgcagt atgaatttga tgtaatatta tctcctgagc ttaaggttca aatgaagact 1381 atacaactca agcttgaaaa cattcgaagt atatttgaga accagcaggc tggtttcagg 1441 aacctggcag ataagcatgt gaggccaatc caatttgtgg tacccacacc tgaaaatttg 1501 gtcacatctg ggacaccatt tttttcatct aaacaaggga agaagacttg gagacaaaat 1561 cagagcaact taaaaattga agtggtgcct gattgtcagg agaagagaag ttctggtcca 1621 tcctccagtt tggacaatgg caatagcttg gatgttttaa agaaccacgt actaaatgaa 1681 ctgatacaga ctgagagagt ttatgttcga gaactgtata ctgttttgtt gggttataga 1741 gcggagatgg ataatccaga gatgtttgat cttatgccac ctctcctgag aaataaaaag 1801 gacattctct ttggaaacat ggcagaaata tatgaattcc ataacgacat tttcttgagc 1861 agcctggaaa attgtgctca tgctccagaa agagtgggac cttgtttcct ggaaaggaag 1921 gatgattttc agatgtatgc aaaatattgt cagaataagc ccagatcaga aacaatttgg 1981 aggaagtatt cagaatgcgc atttttccag gaatgtcaaa gaaagttaaa acacagactt 2041 agactggatt cctatttact caaaccagtg caacgaatca ctaaatatca gttattgttg 2101 aaggagctat taaaatatag caaagactgt gaaggttctg ctctgttgaa gaaggcactc 2161 gatgcaatgc tggatttact gaagtcagtt aatgattcta tgcatcagat tgcaataaat 2221 ggctatattg gaaacttaaa tgaactgggc aagatgataa tgcaaggtgg attcagcgtt 2281 tggatagggc acaagaaagg tgctacaaaa atgaaggatt tggctagatt caaaccaatg 2341 cagcgacacc ttttcttgta tgaaaaagcc attgtttttt gcaaaaggcg tgttgaaagt 2401 ggagaaggct ctgacagata cccgtcatac agttttaaac actgttggaa aatggatgaa 2461 gttggaatca ctgaatatgt aaaaggtgat aaccgcaagt ttgaaatctg gtatggtgaa 2521 aaggaagaag tttatattgt ccaggcttct aatgtagatg tgaagatgac gtggctaaaa 2581 gaaataagaa atattttgtt gaagcagcag gaacttttga cagttaaaaa aagaaagcaa 2641 caggatcaat taacagaacg ggataagttt cagatttctc ttcagcagaa tgatgaaaag 2701 caacagggag cttttataag tactgaggaa actgaattgg aacacaccag cactgtggtg 2761 gaggtctgtg aggcaattgc gtcagttcag gcagaagcaa atacagtttg gactgaggca 2821 tcacaatctg cagaaatctc tgaagaacct gcggaatggt caagcaacta tttctaccct 2881 acttatgatg aaaatgaaga agaaaatagg cccctcatga gacctgtgtc ggagatggct 2941 ctcctatatt gatgaagcta ctatgtcaaa tggcaagtag ctctttcctg cctgcttctc 3001 agctcatttg gaaaaatact gcgcaaaaga cattgagctc aaatgatgca gatgttgttt 3061 tcaggttaat ggacacgcaa agaaaccaca gcacatactt cttttctttc atttaataaa 3121 gcttttaatt atggtacgct gtctttttaa aatcatgtat ttaatgtgtc agatattgtg 3181 cttgaaagat tctcatctca gaatactttt ggacttgaaa attatttctt ctctactttg 3241 taaccaaatg caatcggtgt gccttggatt atttagttta ttaatgaatt aagtcaaaat 3301 tacggctgca aaatggctaa ggtcaagtaa agcacaacat tatgatttaa tatgcttttg 3361 ttgaaaccac agcttttgtg cccattgttt taacttgtgt gaaacaatac aaagcccaga 3421 aattcttttc ggggcatgag taaattttgt tcagggctac tgtctgtatg tgcccagata 3481 aaattttcat gagagtagtt tacaaaagcc gtatttaaaa gttaatattt tcacactttt 3541 tttctggatt tctgcttata attaatgtaa cttaaattag ttgtgctctg ctattttctg 3601 tatatttcat gttgtaattc tttttttcaa ataaaaatta attcttcagg tt // LOCUS HSDBN1 2524 bp mRNA PRI 10-FEB-1994 DEFINITION Human drebrin E2 mRNA (DBN1), complete cds. ACCESSION U00802 NID g392889 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2524) AUTHORS Fisher,L.W., McBride,O.W., Filpula,D., Ibaraki,K. and Young,M.F. TITLE Human drebrin: cDNA sequence, mRNA tissue distribution and chromosomal localization JOURNAL Neurosci. Res. Commun. 14, 35-42 (1994) REFERENCE 2 (bases 1 to 2524) AUTHORS Fisher,L.W. TITLE Direct Submission JOURNAL Submitted (17-AUG-1993) Larry W Fisher, Bone Research Branch, National Institute of Dental Research, NIH, Room 106, Building 30, Bethesda, Maryland, 20892, USA FEATURES Location/Qualifiers source 1..2524 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="bone cells" /dev_stage="adult" /chromosome="5" gene 73..2022 /gene="DBN1" CDS 73..2022 /gene="DBN1" /codon_start=1 /product="drebrin E2" /db_xref="PID:g392890" /translation="MAGVSFSGHRLELLAAYEEVIREESAADWALYTYEDGSDDLKLA ASGEGGLQELSGHFENQKVMYGFCSVKDSQAALPKYVLINWVGEDVPDARKCACASHV AKVAEFFQGVDVIVNASSVEDIDAGAIGQRLSNGLARLSSPVLHRLRLREDENAEPVG TTYQKTDAAVEMKRINREQFWEQAKKEEELRKEEERKKALDERLRFEQERMEQERQEQ EERERRYREREQQIEEHRRKQQTLEAEEAKRRLKEQSIFGDHRDEEEETHMKKSESEV EEAAAIIAQRPDNPREFFKQQERVASASAGSCDVPSPFNHRPGSHLDSHRRMAPTPIP TRSPSDSSTASTPVAEQIERALDEVTSSQPPPLPPPPPPAQETQEPSPILDSEETRAA APQAWAGPMEEPPQAQAPPRGPGSPAEDLMFMESAEQAVLAAPVEPATADATEVHDAA DTIETDTATADTTVANNVPPAATSLIDLWPGNGEGASTLQGEPRAPTPPSGTEVTLAE VPLLDEVAPEPLLPAGEGCATLLNFDELPEPPATFCDPEEVEGEPLAAPQTPTLPSAL EELEQEQEPEPHLLTNGETTQKEGTQASEGYFSQSQEEEFAQSEELCAKAPPPVFYNK PPEIDITCWDADPVPEEEEGFEGGD" polyA_signal 2497 BASE COUNT 526 a 778 c 777 g 443 t ORIGIN 1 ctttccctcc ctcctcctcc gtccgcccgt ccgtccgcgc gtctgtccgt tcggcccggt 61 ccggcccgaa gcatggccgg cgtcagcttc agcggccacc gcctggagct gctggcggct 121 tacgaggagg tgatccgaga ggagagcgcg gccgactggg ctctgtacac atatgaagat 181 ggctccgatg acctcaagct tgcagcatca ggagaagggg gcttgcagga gctttcggga 241 cactttgaga accagaaggt gatgtacggc ttctgcagtg tcaaggactc ccaagctgct 301 ctgccaaaat acgtgctcat caactgggtg ggcgaagatg tgcctgatgc ccgcaagtgc 361 gcttgtgcca gccacgtggc taaggtggca gagttcttcc agggtgtcga cgtgatcgtg 421 aacgccagca gcgtggaaga catagacgcg ggtgccatcg ggcagcggct ctctaacggg 481 ctggcgcgac tctccagccc tgtgctgcac cgactgcggc tgcgagagga tgagaacgca 541 gagcccgtgg gcaccaccta ccagaagacg gatgcagctg tggaaatgaa gcggattaac 601 cgagagcagt tctgggagca ggccaagaag gaagaagagc tgcggaagga ggaggagcgg 661 aagaaggccc tggatgagag gctcaggttc gagcaggagc ggatggagca ggagcggcag 721 gagcaagagg agcgcgagcg gcgctaccgg gagcgggagc agcagatcga ggagcacagg 781 aggaaacagc agactttaga agcggaagag gccaagaggc ggttgaagga gcagtctatc 841 tttggtgacc atcgggatga ggaggaagag acccacatga agaagtcaga gtcggaggtg 901 gaggaggcag cagctattat tgcccagcgg cctgacaacc caagggagtt cttcaagcag 961 caggaaagag tcgcatcggc ctctgcgggc agctgtgatg taccctcgcc cttcaaccat 1021 cgaccaggca gccacctgga cagccaccgg aggatggcgc ccactcccat ccccacgcgg 1081 agcccgtctg actccagcac cgcctccacc cctgtcgctg agcagataga gcgggccctg 1141 gatgaggtca cctcctcgca gcctccacca ctgccaccgc cacccccacc agcccaagag 1201 acccaggagc ccagccccat cctagacagt gaggagacca gagcagcagc ccctcaggcc 1261 tgggccggcc ccatggagga gccccctcag gcacaggcgc ctccccgggg gccaggcagc 1321 cctgcagagg acttgatgtt catggagtct gcagagcagg ctgtcctggc tgctcccgtg 1381 gagcctgcca cagctgacgc cacggaggtc cacgatgcag ctgacaccat tgaaactgac 1441 actgccactg ctgacaccac tgttgccaac aacgtacccc ccgccgccac cagcctcatt 1501 gacctatggc ctggcaacgg ggaaggggcc tccacactcc agggtgagcc cagggccccc 1561 acgccaccct cgggtactga ggtcaccctg gcagaggtgc ccctgctgga tgaggtggct 1621 ccggagccac tgctgccagc aggcgaaggc tgtgccaccc ttctcaactt tgatgagctg 1681 cctgagccgc cagccacctt ctgtgaccca gaggaagtgg aaggggagcc cctggctgcc 1741 ccccagaccc caactctgcc ctcagccctt gaggagctgg agcaagagca ggagccggag 1801 ccccacctgc taaccaatgg cgagaccacc cagaaggagg ggacccaggc cagtgagggg 1861 tacttcagtc aatcacagga ggaggagttt gcccaatcgg aagagctctg tgccaaggct 1921 ccgcctcctg tgttctacaa caagcctcca gagatcgaca tcacatgctg ggatgcagac 1981 ccagttccag aagaggagga gggcttcgag ggtggtgatt agcggtggcg ccagccctag 2041 gctacccttg ccaaggccgc ccacctgcat cagcctctgg ccagacggcc cgccgtgcct 2101 gcattcgcag cagctccgcc tggcacccac tccggattcc ggccctggct ggggacttgg 2161 ccgcttccct acccacaggg cctgactttt acagcttttc tcttttttta aaaagttgat 2221 aggagacttg tacagttgac tggctttcct ctcgttggta gttgagacgc tgttgcaaat 2281 tccacccctc cttccctggt ccagattgta gctcttagtc ctccctgctc agctggccgg 2341 gttggaggcc tcaccctgct tggggcctgg cgtgggggga gctctggtgg gaaaatgtcc 2401 cccacctctt ttcctagttt tatgtttctt gggaaaatat cactttgtat tctctgtcca 2461 gggcttcaga tattttgcac gaattttaaa acatggcaat aaatggctcg tgggctctgg 2521 ctcc // LOCUS HSDBP5 4972 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for novel DNA binding protein. ACCESSION X63071 S50007 NID g30487 KEYWORDS DNA-binding protein; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4972) AUTHORS Mattioni,T.M. TITLE Direct Submission JOURNAL Submitted (05-NOV-1991) T.M. Mattioni, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, N.Y. 10021, USA REMARK sequence corrected according to publication REFERENCE 2 (bases 1 to 4972) AUTHORS Mattioni,T., Hume,C.R., Konigorski,S., Hayes,P., Osterweil,Z. and Lee,J.S. TITLE A cDNA clone for a novel nuclear protein with DNA binding activity JOURNAL Chromosoma 101 (10), 618-624 (1992) MEDLINE 93048367 COMMENT See also M36428. FEATURES Location/Qualifiers source 1..4972 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /cell_line="721.180" /clone_lib="pCDM8-180" /clone="5-1, 5-9" mRNA 1..4972 /gene="DBP-5" gene 1..4972 /gene="DBP-5" CDS 73..3612 /gene="DBP-5" /codon_start=1 /product="DBP-5 protein" /db_xref="PID:g30488" /translation="MVPPLPPEEPPTMPPLPPEEPPMTPPLPPEEPPEGPALPTEQSA LTAENTWPTEVPSLPSEESVSQPEPPVSQSEISEPSAVPTDYSVSASDPSVLVSEAAV TVPEPPPEPESSITLTPVESAVVAEEHEVVPERPVTCMVSETPAMSAEPTVLASEPPV MSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAESILEPPAMAAPESSAMAVLES SAVTVLESSTVTVLESSTVTVLEPSVVTVPEPPVVAEPDYVTIPVPVVSALEPSVPVL EPAVSVLQPSMIVSEPSVSVQESTVTVSEPAVTVSEQTQVIPTEVAIESTPMILESSI MSSHVMKGINLSSGDQNLAPEIGMQEIALHSGEEPHAEEHLKGDFYESEHGINIDLNI NNHLIAKEMEHNTVCAAGTSPVGEIGEEKILPTSETKQRTVLDTYPGVSEADAGETLS STGPFALEPDATGTSKGIEFTTASTLSLVNKYDVDLSLTTQDTEHDMVISTSPSGGSE ADIEGPLPAKDIHLDLPSNNNLVSKDTEEPLPVKESDQTLAALLSPKESSGGEKEVPP PPKETLPDSGFSANIEDINEADLVRPLLPKDMERLTSLRAGIEGPLLASDVGRDRSAA SPVVSSMPERASESSSEEKDDYEIFVKVKDTHEKSKKNKNRDKGEKEKKRDSSLRSRS KRSKSSEHKSRKRTSESRSRARKRSSKSKSHRSQTRSRSRSRRRRRSSRSRSKSRGRR SVSKEKRKRSPKHRSKSRERKRKRSSSRDNRKTVRARSRTPSRRSRSHTPSRRRRSRS VGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRRR SRSVVRRRSFSISPVRLRRSRTPLRRRFSRSPIRRKRSRSSERGRSPKRLTDLDKAQL LEIAKANAAAMCAKAGVPLPPNLKPAPPPTIEEKVAKKSGGATIEELTEKCKQIAQSK EDDDVIVNKPHVSDEEEEEPPFYHHPFKLSEPKPIFFNLNIAAAKPTPPKSQVTLTKE FPVSSGSQHRKKEADSVYGEWVPVEKNGEENKDDDNVFSSNLPSEPVDISTAMSERAL AQKRLSENAFDLEAMSMLNRAQERIDAWAQLNSIPGQFTGSTGVQVLTQEQLANTGAQ AWIKKDQFLRAAPVTGGMGAVLMRKWAGEKEKD" BASE COUNT 1507 a 1063 c 1113 g 1289 t ORIGIN 1 gatcgttcaa tgatgtctat ggctgctgat tcttacaccg attcttacac tgacacatat 61 acagaggcat atatggtgcc acctttgcct cctgaagagc ccccaacaat gccaccgttg 121 ccacctgagg agccaccaat gacaccacca ttgcctcctg aggaaccacc agagggtcca 181 gcattgccca ctgagcagtc agcattaaca gctgaaaata cttggcctac agaggtgcca 241 tcattaccat ctgaagagtc tgtatcgcag cctgagcctc ctgtgagtca aagtgagatt 301 tcggagcctt cagcagtgcc tactgattat tcagtgtcag catcagatcc ctcagtttta 361 gtatcagagg ctgctgtgac tgttccagaa ccaccaccag agccagaatc ttcaattacg 421 ttaacacctg tagagtctgc agtagtagca gaagaacatg aagttgttcc agagagacca 481 gtgacttgta tggtatctga aactcccgcc atgtcagctg aaccaactgt gttagcatca 541 gagcctcctg ttatgtcaga gacagcagaa acatttgatt ccatgagagc ctcaggacat 601 gttgcctcag aagtatctac atccttgttg gttccagcag taactactcc agtgctggca 661 gagagcattc tggagccgcc agccatggct gccccagagt cttcagctat ggctgtcctg 721 gagtcttcgg ctgtgaccgt cctggagtct tcgactgtga ctgtcctgga gtcttcgact 781 gtaactgtcc tggagccttc ggttgtgact gtcccggagc ctcctgttgt ggctgagcca 841 gactatgtta ccattcctgt gccagttgtt tctgcgctgg agccttctgt gcctgttctg 901 gaaccagcgg tgtcagtcct tcaaccttct atgattgttt cagaaccatc tgtttctgtc 961 caggaatcga ctgtgacagt ttcagagcct gctgtcacag tctcagagca gactcaagta 1021 ataccaactg aggtggctat agagtccaca ccaatgatac tggaatctag tatcatgtca 1081 tcacatgtta tgaaaggaat taatctatcc tctggtgatc aaaatcttgc tccagagatt 1141 ggcatgcagg agattgcatt gcattcaggt gaagaaccac atgctgagga acacctgaaa 1201 ggtgactttt acgaaagcga acatggtata aatatagacc ttaatataaa taatcattta 1261 attgctaaag agatggaaca taatacagtg tgtgctgctg gtactagtcc tgttggggaa 1321 attggtgaag agaaaatttt gcccaccagt gagactaaac agcgcacagt attggatacc 1381 taccctggtg ttagtgaagc tgatgcagga gaaactctat cttctactgg tccttttgct 1441 ctggaacctg atgcaacagg aactagtaag ggtattgaat ttaccacagc atctactctc 1501 agtttagtta ataaatatga tgttgattta tctttaacta ctcaagatac tgaacatgac 1561 atggtaattt ccaccagtcc tagtggtggt agtgaagctg acattgaagg gcctttgcct 1621 gctaaagata ttcatcttga tttaccatct aataataacc ttgttagtaa ggatacagaa 1681 gaaccattac ctgtaaaaga gagtgaccag acattagcag ctctgctcag ccctaaagaa 1741 agtagtggag gagaaaaaga agtacctccc cctcctaaag agacactgcc tgattcagga 1801 ttttctgcca atattgagga tattaatgaa gcagatttag tgagaccgtt acttcctaag 1861 gacatggaac gtcttacaag ccttagagct ggcattgaag gacctttact tgcaagtgat 1921 gttggacgtg acagatctgc tgccagcccg gttgtaagta gtatgccaga aagagcttca 1981 gagtcttctt cagaggaaaa agatgattat gaaatttttg taaaagttaa ggacactcac 2041 gaaaaaagca agaaaaataa gaaccgtgat aagggggaga aagagaagaa aagagactct 2101 tcattaagat ctcgaagtaa gcgttccaaa tcttctgaac acaaatcacg caagcgtacc 2161 agtgaatctc gttctagggc aagaaagaga tcatctaagt ccaagtctca tcgctctcag 2221 acacgttcac ggtcacgttc aagacgcagg aggagaagca gcagatcaag atcaaagtct 2281 agaggaagaa gatctgtatc aaaagagaag cgcaaaagat ctccaaagca cagatccaag 2341 tctagggaaa gaaaaagaaa aagatcaagc tccagggata accgaaagac agttagagct 2401 cgaagtcgaa ccccaagtcg tcggagtcgg agtcatactc caagtcgtcg acgaaggtct 2461 agatctgtgg gtagaagaag gagctttagc atttccccaa gccgccgcag ccgcaccccc 2521 agccgccgca gccgcacccc cagccgccgc agccgcaccc ccagccgccg cagccgcacc 2581 cccagccgcc ggagccgcac ccctagccgt cggagccgca ccccaagccg ccggagaaga 2641 tcaaggtctg tggtaagaag acgaagcttc agtatctcac cagtcagatt aaggcgatca 2701 agaacaccct taagaagaag gtttagcaga tctcccatcc gtcgtaaaag atccaggtct 2761 tctgaacgag gcagatcacc caaacgtctg acagatttgg ataaggctca attacttgaa 2821 atagccaaag ctaatgcagc tgccatgtgt gctaaggctg gtgtcccttt accaccaaac 2881 ctaaagcctg cacctccacc tactatagaa gagaaagttg ctaaaaagtc aggaggagct 2941 actatagaag aactaactga gaaatgtaaa cagatcgcac agagtaaaga agatgatgat 3001 gtaatagtga ataaacctca tgtttcggat gaagaggaag aagaacctcc tttttatcat 3061 catcccttta aactcagtga acccaaacct atttttttca atctgaatat tgctgcagca 3121 aaaccaactc caccaaaaag ccaggtaaca ttaacaaaag aattccctgt atcatctgga 3181 tctcaacatc ggaaaaaaga agcggatagt gtttatggag aatgggttcc tgtggagaaa 3241 aatggtgaag aaaacaaaga tgatgataat gttttcagca gcaatttgcc ctcagagcct 3301 gtggacatct ctacagcaat gagtgaacgg gcacttgctc agaaaagact cagtgagaat 3361 gcatttgatc ttgaagccat gagcatgtta aatagagctc aggaaaggat tgatgcctgg 3421 gctcagctga actctattcc tggccagttc acaggaagta caggagtaca ggttttgaca 3481 caagaacagt tggccaatac tggtgcccaa gcctggatta aaaaggatca gttcttaaga 3541 gcagccccgg taactggagg aatgggagcc gttttgatga gaaaatgggc tggagagaag 3601 gagaaggatt aggaaaaaac aaagaaggca acaaggaacc catcctagtt gattttaaga 3661 cagaccgaaa aggtcttgtt gcagtaggag aaagagcaca aaagaggtct gggaacttct 3721 ctgctgcaat gaaagatctg tcaggcaaac atcctgtgtc tgctttgatg gagatctgta 3781 ataaaagaag gtggcaacca cctgaatttc tattggtcca tgatagtggc cctgatcatc 3841 gcaaacattt tctctttagg gtattgagaa atggaagccc ttaccagccc aattgtatgt 3901 ttttcttgaa taggtattga taaatggaag cgcttaccag cccagctttg ccagccctaa 3961 taagaagcat gctaaagcca cagcagctac tgtggttctt caagcaatgg gccttgtacc 4021 aaaggacctc atgctaatgc cacttgcttc aggagtgcct cacgtagata gattgaggtt 4081 ttataataat catttcagaa ttttactctg catcacaatg tatttcctct ttaatgttgt 4141 aaatatttgg caatttaaga cattgtgtaa aaagcaatct gtaaaaacat ctccaggctt 4201 tgatttttgt accatggaaa ttgtatttaa ccatacaggg ttttggtatg tttatattgt 4261 ttaccttagt gatgtatttg tttaagtggc taacatccaa acgactgttt gaaggcatca 4321 gagtaatctt cagtgtggaa tgttaaataa cgcttttata ctgtattttg tactatgatg 4381 taactcccct tccttatggc taggctactg taacacttgc ctgtaatcag tgaagggctg 4441 tgcaccttgt actatttcac aatgggttct gctggacaga taatgggcca gtgttattga 4501 ggtgatcaag atctgttcca cagggctaat gccaccatct cccctcaaaa tttgtagagg 4561 ttctaaaaag aaagtggtat gttgtgtgat gatcagcact aagtcctgca ttcctgttaa 4621 agccacttgg gtcataagaa gggaagtaaa aaatgaagtc tgactagaaa ttctattgca 4681 gaggccaagt acatttagta tggcattgag ttgtgatata gttttcattt gatgtgcatt 4741 ttgaatttca gctacaccta gatagacgta aaatgataat taaaatgctg taaccaactt 4801 atctaataaa attggcaacc agccactatt ttgttgacta tgagaaagtt aaaagtttat 4861 gttaattttt agggtctgat agaatatttc atgtgtatta cagtggtatt catatgctat 4921 gtctctaaac tttattttca aaagcttaag gcccaaatac aaacttcctt ta // LOCUS HSDBPAV 1392 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for DNA binding protein A variant. ACCESSION X95325 NID g1167837 KEYWORDS dbpA gene; DNA-binding protein; Y-box binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1392) AUTHORS Coles,L.S., Diamond,P., Occhiodoro,F., Vadas,M.A. and Shannon,M.F. TITLE Cold shock domain proteins repress transcription from the GM-CSF promoter JOURNAL Nucleic Acids Res. 24 (12), 2311-2317 (1996) MEDLINE 96279731 REFERENCE 2 (bases 1 to 1392) AUTHORS Coles,L.S. TITLE Direct Submission JOURNAL Submitted (24-JAN-1996) L.S. Coles, Hanson Centre for Cancer Research, Div. Human Immunology, Inst Medical & Veterinary Science, Frome Road, Adelaide, South Australia 5000, AUSTRALIA COMMENT Overlaps with M24069. FEATURES Location/Qualifiers source 1..1392 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUT-78, ATCC #TIB161" /cell_type="T-lymphoblast" /clone_lib="lambda gt11" /clone="A2" mRNA <1..>1392 gene 184..1302 /gene="dbpAv" CDS 184..1302 /gene="dbpAv" /note="variant A" /codon_start=1 /product="DNA-binding protein" /db_xref="PID:e219699" /db_xref="PID:g1167838" /translation="MSEAGEATTTTTTTLPQAPTEAAAAAPQDPAPKSPVGSGAPQAA APAPAAHVAGNPGGDAAPAATGTAAAASLATAAGSEDAEKKVLATKVLGTVKWFNVRN GYGFINRNDTKEDVFVHQTAIKKNNPRKYLRSVGDGETVEFDVVEGEKGAEAANVTGP DGVPVEGSRYAADRRRYRRGYYGRRRGPPRNYAGEEEEEGSGSSEGFDPPATDRQFSG ARNQLRRPQYRPQYRQRRFPPYHVGQTFDRRSRVLPHPNRIQAGEIGEMKDGVPEGAQ LQGPVHRNPTYRPRYRSRGPPRPRPAPAVGEAEDKENQQATSGPNQPSVRRGYRRPYN YRRRPRPPNAPSQDGKEAKAGEAPTENPAPPTQQSSAE" BASE COUNT 330 a 456 c 412 g 194 t ORIGIN 1 gaattccggc acgaagctcg agccgcctcc gccgcgcgac cccacctcgg ccgccgccgc 61 ctgcgccgcg agatccgccc cggcctcccc gagagcgagc cccggccgcc gcgaccacca 121 gccgcgctaa ccgccgacca accgccaccg aggcgcctga gcgagagcag aggaggagga 181 ggcatgagtg aggcgggcga ggccaccacc accaccacca ccaccctccc gcaggctccg 241 acggaggcgg ccgccgcggc tccccaggac cccgcgccca agagcccggt gggcagcggt 301 gcgccccagg ccgcggcccc ggcgcccgcc gcccacgtcg caggaaaccc cggtggggac 361 gcggcccccg cagccacggg caccgcggcc gccgcctctt tagccaccgc cgccggcagc 421 gaagacgcgg agaaaaaagt tctcgccacc aaagtccttg gcactgtcaa atggttcaac 481 gtcagaaatg gatatggatt tataaatcga aatgacacca aagaagatgt atttgtacat 541 cagactgcca tcaagaagaa taacccacgg aaatatctgc gcagtgtagg agatggagaa 601 actgtagagt ttgatgtggt tgaaggagag aagggtgcag aagctgccaa tgtgactggc 661 ccggatggag ttcctgtgga agggagtcgt tacgctgcag atcggcgccg ttacagacgt 721 ggctactatg gaaggcgccg tggccctccc cggaattacg ctggggagga ggaggaggaa 781 gggagcggca gcagtgaagg atttgacccc cctgccactg ataggcagtt ctctggggcc 841 cggaatcagc tgcgccgccc ccagtatcgc cctcagtacc ggcagcggcg gttcccgcct 901 taccacgtgg gacagacctt tgaccgtcgc tcacgggtct taccccatcc caacagaata 961 caggctggtg agattggaga gatgaaggat ggagtcccag agggagcaca acttcaggga 1021 ccggttcatc gaaatccaac ttaccgccca aggtaccgta gcaggggacc tcctcgccca 1081 cgacctgccc cagcagttgg agaggctgaa gataaagaaa atcagcaagc caccagtggt 1141 ccaaaccagc cgtctgttcg ccgtggatac cggcgtccct acaattaccg gcgtcgcccg 1201 cgtcctccta acgctccttc acaagatggc aaagaggcca aggcaggtga agcaccaact 1261 gagaaccctg ctccacccac ccagcagagc agtgctgagt aacaccaggc tcctcaggca 1321 ccttcaccat cggcaggtga cctaaagaat taatgaccat tcagaaataa agcaaaaagc 1381 aggccggaat tc // LOCUS HSDBT 3535 bp RNA PRI 31-JUL-1993 DEFINITION H.sapiens mRNA for transacylase (DBT). ACCESSION X66785 S48130 NID g30489 KEYWORDS transacylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3535) AUTHORS Lau,K.S. TITLE Direct Submission JOURNAL Submitted (07-JUN-1992) K.S. Lau, University of Texas, Southwestern Medical Center, Dept of Biochemistry, 5323 Harry Hines Blvd, Dallas TX 75235-9038, USA REFERENCE 2 (bases 1 to 3535) AUTHORS Lau,K.S., Chuang,J.L., Herring,W.J., Danner,D.J., Cox,R.P. and Chuang,D.T. TITLE The complete cDNA sequence for dihydrolipoyl transacylase (E2) of human branched-chain alpha-keto acid dehydrogenase complex JOURNAL Biochim. Biophys. Acta 1132 (3), 319-321 (1992) MEDLINE 93041936 FEATURES Location/Qualifiers source 1..3535 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /chromosome="1p31" gene 15..1463 /gene="DBT" CDS 15..1463 /gene="DBT" /note="product subunit structure 24mer, mitochondrial multi-enzyme complex" /codon_start=1 /product="transacylase" /db_xref="PID:g30490" /db_xref="SWISS-PROT:P11182" /translation="MAAVRMLRTWSRNAGKLICVRYFQTCGNVHVLKPNYVCFFGYPS FKYSHPHHFLKTTAALRGQVVQFKLSDIGEGIREVTVKEWYVKEGDTVSQFDSICEVQ SDKASVTITSRYDGVIKKLYYNLDDIAYVGKPLVDIETEALKDSEEDVVETPAVSHDE HTHQEIKGRKTLATPAVRRLAMENNIKLSEVVGSGKDGRILKEDILNYLEKQTGAILP PSPKVEIMPPPPKPKDMTVPILVSKPPVFTGKDKTEPIKGFQKAMVKTMSAALKIPHF GYCDEIDLTELVKLREELKPIAFARGIKLSFMPFFLKAASLGLLQFPILNASVDENCQ NITYKASHNIGIAMDTEQGLIVPNVKNVQICSIFDIATELNRLQKLGSVGQLSTTDLT GGTFTLSNIGSIGGTFAKPVIMPPEVAIGALGSIKAIPRFNQKGEVYKAQIMNVSWSA DHRVIDGATMSRFSNLWKSYLENPAFMLLDLK" repeat_region 1763..2032 /rpt_family="Alu 1" polyA_signal 2901..2906 polyA_signal 2923..2928 repeat_region 2954..3054 /rpt_family="Alu 2" repeat_region 3073..3359 /rpt_family="Alu 3" BASE COUNT 1105 a 644 c 763 g 1023 t ORIGIN 1 atttccgggg taagatggct gcagtccgta tgctgagaac ctggagcagg aatgcgggga 61 agctgatttg tgttcgctat tttcaaacat gtggtaatgt tcatgttttg aagccaaatt 121 atgtgtgttt ctttggttat ccttcattca agtatagtca tccacatcac ttcctgaaaa 181 caactgctgc tctccgtgga caggttgttc agttcaagct ctcagacatt ggagaaggga 241 ttagagaagt aactgttaaa gaatggtatg taaaagaagg agatacagtg tctcagtttg 301 atagcatctg tgaagttcaa agtgataaag cttctgttac catcactagt cgttatgatg 361 gagtcattaa aaaactctat tataatctag acgatattgc ctatgtgggg aagccattag 421 tagacataga aacggaagct ttaaaagatt cagaagaaga tgttgttgaa actcctgcag 481 tgtctcatga tgaacataca caccaagaga taaagggccg aaaaacactg gcaactcctg 541 cagttcgccg tctggcaatg gaaaacaata ttaagctgag tgaagttgtt ggctcaggaa 601 aagatggcag aatacttaaa gaagatatcc tcaactattt ggaaaagcag acaggagcta 661 tattgcctcc ttcacccaaa gttgaaatta tgccacctcc accaaagcca aaagacatga 721 ctgttcctat actagtatca aaacctccgg tattcacagg caaagacaaa acagaaccca 781 taaaaggctt tcaaaaagca atggtcaaga ctatgtctgc agccctgaag atacctcatt 841 ttggttattg tgatgagatt gaccttactg aactggttaa gctccgagaa gaattaaaac 901 ccattgcatt tgctcgtgga attaaactct cctttatgcc tttcttctta aaggctgctt 961 ccttgggatt actacagttt cctatcctta acgcttctgt ggatgaaaac tgccagaata 1021 taacatataa ggcttctcat aacattggga tagcaatgga tactgagcag ggtttgattg 1081 tccctaatgt gaaaaatgtt cagatctgct ctatatttga catcgccact gaactgaacc 1141 gcctccagaa attgggctct gtgggtcagc tcagcaccac tgatcttaca ggaggaacat 1201 ttactctttc caacattgga tcaattggtg gtacctttgc caaaccagtg ataatgccac 1261 ctgaagtagc cattggggcc cttggatcaa ttaaggccat tccccgattt aaccagaaag 1321 gagaagtata taaggcacag ataatgaatg tgagctggtc agctgatcac agagttattg 1381 atggtgctac aatgtcacgc ttctccaatt tgtggaaatc ctatttagaa aacccagctt 1441 ttatgctact agatctgaaa tgaagactga taagacattc ttgaactttt tgagcttcca 1501 aagagtatgt aaaccctagc tgtgccagca catgttcatc tttacaattt atattgtaaa 1561 cgatttgtat cgtatgatta aggatctaag gcacaatatt tgtcactgtt ctattagact 1621 ttttactgaa aatgaataat ggtgtaatgg ttctcctggg gctgtcacat tttataggtc 1681 agagtgtgac ttcttaatat ggtgctgatg tttttgtgtc aatggcttga aactggcaag 1741 attaacaaaa ttaggccggg catggtggct cacgcctgta atccagcact ttgggaggcc 1801 caggtggggc gatcacctga ggttagaagt ttgagaccag cctggccaac atggtgaaac 1861 ctggcctcta cctaaaaaat acaaaattga ccgggtgtgg tggtgggtac cgctacttgg 1921 gaggctgagg caggagaatc gcttgaacct gggaggtgga ggttgcagtg agctgagatc 1981 gtgctattgc actccagcct gggcgacaga gcaagacgcc atctcaaaaa caaaaaaaac 2041 aaaattcatg ttactaaaag acaggtagcc atatacagac agtatatgcc ctattttttt 2101 taactgactc ttaatgaaac tttaatttta cttaattaag aaatggaatt tatatacaaa 2161 aatattttcc atttccgtta ttatgctaat tgttgtatga aataagtgca attatacttc 2221 tcttttgaga tatccaagag tatattcttg ctctgtatag agaatatcat ctgatagtgt 2281 cttatttata ttaattaatg tctttgaaaa gggaaaagta taaactggcc ttaaaattgt 2341 ccaattatag ttttataacc agtctattaa aggtgtttgt ttaaaatgga tatagtttta 2401 gatttgtggt aatgctttgg tattttcttg gggaagacct tcacctttgc aaacttccct 2461 catgtaagga aggtacttta aatgtagcag ccactgacat ttcttttttt aaaaaaaatt 2521 tgagaagtct acttcctttt aacttttttg gtcttcagct aaaaaatagg ataagaaatt 2581 aaggtctatt ccattctcca tatcctgggt aagaatgtaa ataagaggag aaggaagagt 2641 ctaatagtaa ttatggatat aaaaaataag aaattttgta tagaaatgaa ggtttcataa 2701 tgatcatttt gttaaaggtc tactttaatc agaaatagca acgagatgaa tgtatccaac 2761 atttcaattt gcattcggaa atccatgttg tttctaatat tgtccagttg aaaactgtat 2821 gccaaaatta gttgtttaag tgaagttttg tgacagaaaa aaggttgttt taatatctac 2881 ttggtttttc tcaaaatgga aataatttta aaatcaggaa agaataaatc agccaggtgt 2941 gatgacttgt aactgtaatc ccagttatag gggaggctga agcaggagga tcacttgagg 3001 ccaggagttt gagaccagcc tgggcaacat agtgagatcc catctcaaaa aacattattt 3061 ttaaaattag cctggtggct cacgcctgta atcccagcac tttgggaggc cgaggtggcc 3121 agatcacctg aggtcaggag ttcgagacca ccctggccaa catggtgaaa ccccatctct 3181 acagttttgt aaaaatacaa aaattacctg ggcctggtgc acaggcctgt agtcccagct 3241 acttgggagg ctgaggcagg agaattgctt gagcccaaga ggtggaggtt acagtgagca 3301 gagatcacac cactgcactc cagcctgggt ggcagagcaa cacttcgtct cagaaaaaaa 3361 aaaaaaaacc aaaaaccaaa aagccaagtg tggtggtgtg cacctatagt cccagctact 3421 caggaagctg agacaagagg atcaattgag cccaggagtt caaagctgta gtgagctgtc 3481 attgtgccac tatcctccag tatgggtgac agagtgagac ctggtctcta aaaat // LOCUS HSDCREB 1024 bp RNA PRI 07-AUG-1991 DEFINITION Human delta CREB mRNA for cAMP-responsive element (CRE) binding protein. ACCESSION X60003 NID g30493 KEYWORDS cAMP response element; cAMP response element binding protein; CRE binding protein; delta CREB gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1024) AUTHORS Jungmann,R.A. TITLE Direct Submission JOURNAL Submitted (29-MAY-1991) R.A. Jungmann, Northwestern Univ Med Scool, Dept of CMS Biology, 303 E. Chicago Avenue, Chicago, IL 60611, USA REFERENCE 2 (bases 1 to 1024) AUTHORS Short,M.L., Manohar,C.F., Furtado,M.R., Ghadge,G.D., Wolinsky,S.M., Thimmapaya,B. and Jungmann,R.A. TITLE Nucleotide and derived amino-acid sequences of the CRE-binding proteins from rat C6 glioma and HeLa cells JOURNAL Nucleic Acids Res. 19 (15), 4290 (1991) MEDLINE 91334144 COMMENT See also X60002, M27691, M34353 & X14788. FEATURES Location/Qualifiers source 1..1024 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" mRNA 1..1024 /gene="delta CREB" /evidence=experimental gene 1..1024 /gene="delta CREB" CDS 1..984 /gene="delta CREB" /codon_start=1 /db_xref="PID:g30494" /db_xref="SWISS-PROT:P16220" /translation="MTMDSGADNQQSGDAAVTEAENQQMTVQAQPQIATLAQVSMPAA HATSSAPTVTLVQLPNGQTVQVHGVIQAAQPSVIQSPQVQTVQISTIAESEDSQESVD SVTDSQKRREILSRRPSYRKILNDLSSDAPGVPRIEEEKSEEEASAPAITAVAVPTPI YRTSSGQYITITQRGAIQLASNGTDGVQGLQTLTMANAAATQPGTTILQYAQTTDGQQ ILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRL MKNREAARECRRKKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD" misc_feature 346..360 /gene="delta CREB" /note="cAMP-dependent protein kinase consensus phosphorylation site" repeat_region 889..954 /note="leucine heptad repeat" BASE COUNT 325 a 243 c 241 g 215 t ORIGIN 1 atgaccatgg actctggagc agacaaccag cagagtggag atgcagctgt aacagaagct 61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct 121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtaca gctgcccaat 181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct 241 ccacaagtcc aaacagttca gatttcaact attgcagaaa gtgaagattc acaggagtca 301 gtggatagtg taactgattc ccaaaagcga agggaaattc tttcaaggag gccttcctac 361 aggaaaattt tgaatgactt atcttctgat gcaccaggag tgccaaggat tgaagaagag 421 aagtctgaag aggaggcttc agcacctgcc atcaccgctg tagcggtgcc aacgccaatt 481 taccggacta gcagtggaca gtatattacc attacccaga gaggagcaat acagctggct 541 agcaatggta ccgatggggt acagggcctg caaacattaa ccatggccaa tgcagcagcc 601 actcagccgg gtactaccat tctacagtat gcacagacca ctgatggaca gcagatctta 661 gtgcccagca accaagttgt tgttcaagct gcctctggag acgtacaaac ataccagatt 721 cgcacagcac ccactagcac tattgcccct ggagttgtta tggcatcctc cccagcactt 781 cctacacagc ctgctgaaga agcagcacga aagagagagg tccgtctaat gaagaacagg 841 gaagcagctc gtgagtgtcg tagaaagaag aaagaatatg tgaaatgttt agaaaacaga 901 gtggcagtgc ttgaaaatca aaacaagaca ttgattgagg agctaaaagc acttaaggac 961 ctttactgcc acaaatcaga ttaatttggg atttaaattt tcacctgtta aggtggaaga 1021 tgga // LOCUS HSDD2 2625 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for dopamine D2 receptor. ACCESSION X51362 NID g30495 KEYWORDS dopamine receptor D2; dopamine receptor D3; receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2625) AUTHORS Robakis,N.K. TITLE Direct Submission JOURNAL Submitted (04-JAN-1990) Robakis N.K., Mount Sinai School of Medicine, City University of New York, One Gustave Levy Place, Box 1229, New York, NY 10029, USA REFERENCE 2 (bases 1 to 2625) AUTHORS Robakis,N.K., Mohamadi,M., Fu,D.Y., Sambamurti,K. and Refolo,L.M. TITLE Human retina D2 receptor cDNAs have multiple polyadenylation sites and differ from a pituitary clone at the 5' non-coding region JOURNAL Nucleic Acids Res. 18 (5), 1299 (1990) MEDLINE 90206805 COMMENT Data kindly reviewed (27-JUN-1990) by Robalis N. FEATURES Location/Qualifiers source 1..2625 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" /clone_lib="lambda gt10" /clone="11.1 and 11.2" CDS 166..1497 /note="dopamine D2 receptor (AA 1-443)" /codon_start=1 /db_xref="PID:g30496" /db_xref="SWISS-PROT:P14416" /translation="MDPLNLSWYDDDLERQNWSRPFNGSDGKADRPHYNYYATLLTLL IAVIVFGNVLVCMAVSREKALQTTTNYLIVSLAVADLLVATLVMPWVVYLEVVGEWKF SRIHCDIFVTLDVMMCTASILNLCAISIDRYTAVAMPMLYNTRYSSKRRVTVMISIVW VLSFTISCPLLFGLNNADQNECIIANPAFVVYSSIVSFYVPFIVTLLVYIKIYIVLRR RRKRVNTKRSSRAFRAHLRAPLKGNCTHPEDMKLCTVIMKSNGSFPVNRRRVEAARRA QELEMEMLSSTSPPERTRYSPIPPSHHQLTLPDPSHHGLHSTPDSPAKPEKNGHAKDH PKIAKIFEIQTMPNGKTRTSLKTMSRRKLSQQKEKKATQMLAIVLGVFIICWLPFFIT HILNIHCDCNIPPVLYSAFTWLGYVNSAVNPIIYTTFNIEFRKAFLKILHC" BASE COUNT 541 a 895 c 663 g 526 t ORIGIN 1 ggcagccgtc cggggccgcc actctcctcg gccggtccct ggctcccgga ggcggccgcg 61 cgtggatgcg gcgggagctg gaagcctcaa gcagccggcg ccgtctctgc cccggggcgc 121 cctatggctt gaagagcctg gccacccagt ggctccaccg ccctgatgga tccactgaat 181 ctgtcctggt atgatgatga tctggagagg cagaactgga gccggccctt caacgggtca 241 gacgggaagg cggacagacc ccactacaac tactatgcca cactgctcac cctgctcatc 301 gctgtcatcg tcttcggcaa cgtgctggtg tgcatggctg tgtcccgcga gaaggcgctg 361 cagaccacca ccaactacct gatcgtcagc ctcgcagtgg ccgacctcct cgtcgccaca 421 ctggtcatgc cctgggttgt ctacctggag gtggtaggtg agtggaaatt cagcaggatt 481 cactgtgaca tcttcgtcac tctggacgtc atgatgtgca cggcgagcat cctgaacttg 541 tgtgccatca gcatcgacag gtacacagct gtggccatgc ccatgctgta caatacgcgc 601 tacagctcca agcgccgggt caccgtcatg atctccatcg tctgggtcct gtccttcacc 661 atctcctgcc cactcctctt cggactcaat aacgcagacc agaacgagtg catcattgcc 721 aacccggcct tcgtggtcta ctcctccatc gtctccttct acgtgccctt cattgtcacc 781 ctgctggtct acatcaagat ctacattgtc ctccgcagac gccgcaagcg agtcaacacc 841 aaacgcagca gccgagcttt cagggcccac ctgagggctc cactaaaggg caactgtact 901 caccccgagg acatgaaact ctgcaccgtt atcatgaagt ctaatgggag tttcccagtg 961 aacaggcgga gagtggaggc tgcccggcga gcccaggagc tggagatgga gatgctctcc 1021 agcaccagcc cacccgagag gacccggtac agccccatcc cacccagcca ccaccagctg 1081 actctccccg acccgtccca ccacggtctc cacagcactc ctgacagccc cgccaaacca 1141 gagaagaatg ggcatgccaa agaccacccc aagattgcca agatctttga gatccagacc 1201 atgcccaatg gcaaaacccg gacctccctc aagaccatga gccgtagaaa gctctcccag 1261 cagaaggaga agaaagccac tcagatgctc gccattgttc tcggcgtgtt catcatctgc 1321 tggctgccct tcttcatcac acacatcctg aacatacact gtgactgcaa catcccgcct 1381 gtcctgtaca gcgccttcac gtggctgggc tatgtcaaca gcgccgtgaa ccccatcatc 1441 tacaccacct tcaacattga gttccgcaag gccttcctga agatccttca ctgctgactc 1501 tgctgcctgc ccgcacagca gcctgcttcc cacctcctgc ccaggccagc cagcctcacc 1561 cttgcgaacc gtgagcagga aggcctgggt ggatcggcct cctcttcacc ccggcagccc 1621 tgcagtgttc gcttggctcc atgctcctca ctgcccgcac accctcactc tgccagggca 1681 gtgctagtga gctgggcatg gtaccagccc tggggtgccc ccagctcagg ggcagctcat 1741 agagtccccc ctcccacctc cagtccccct atccttggca ccaaagatgc agccgccttc 1801 cttgaccttc ctctggggct ctagggttgc tggagcctga gtcagggccc agaggctgag 1861 ttttctcttt gtggggcttg gcgtggagca ggcggtgggg agagatggac agttcacacc 1921 ctgcaaggcc cacaggaggc aagcaagctc tcttgccgag gagccaggca acttcagtcc 1981 tgggagacca tgtaaatacc agactgcagg ttggacccag agattcccaa gccaaaacct 2041 tagctccctc cgcacccgat gtgacctcta ctttccagct agtccgaccc acctcacccc 2101 gttacagctc cccaagtggt ttccacatgc tctgagaaga ggagccctca tcttgaaggg 2161 cccaggaggg tctatgggga gaggaactcc ttgcctagcc caccctgctg ccttctgacg 2221 gccctgcaat gtatcccttc tcacagcaca tgctgccagc ctggggcctg gcagggaggt 2281 caggccctgg aactctatct gggcctgggc taggggacat cagaggttct ttgagggact 2341 gcctctgcca cactctgacg caaaaccact ttccttttct attccttctg gcctttcctc 2401 tctcctgttt cccttccctt ccactgcctc tgccttagag gagcccacgg ctaagaggct 2461 gctgaaaacc atctgcctgg cctggccctg ccctgaggaa ggaggggaag ctgcagcttg 2521 ggagagcccc tggggctaga ctctgtaaca tcactatcca tgcaccaaac taataaaact 2581 ttgacgagtc accttccagg acccctgggt aaaaaaaaaa aaaaa // LOCUS HSDEK9 2699 bp RNA PRI 19-MAY-1997 DEFINITION H.sapiens dek mRNA. ACCESSION X64229 S89712 NID g30502 KEYWORDS dek gene; putative oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2699) AUTHORS von Lindern,M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1992) M. Von Lindern, Dept of Cell Biology, Erasmus University, P.O. Box 1738, 3000 DR Rotterdam, THE NETHERLANDS REFERENCE 2 (bases 1 to 2699) AUTHORS von Lindern,M., Fornerod,M., van Baal,S., Jaegle,M., de Wit,T., Buijs,A. and Grosveld,G. TITLE The translocation (6;9), associated with a specific subtype of acute myeloid leukemia, results in the fusion of two genes, dek and can, and the expression of a chimeric, leukemia-specific dek-can mRNA JOURNAL Mol. Cell. Biol. 12 (4), 1687-1697 (1992) MEDLINE 92195315 COMMENT See also X64228. FEATURES Location/Qualifiers source 1..2699 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="testes" /clone_lib="human testes (Clonetech)" /clone="Dk14, Dk9" /chromosome="6p23" gene 34..1161 /gene="dek" CDS 34..1161 /gene="dek" /codon_start=1 /product="putative oncogene" /db_xref="PID:g30503" /db_xref="SWISS-PROT:P35659" /translation="MSASAPAAEGEGTPTQPASEKEPEMPGPREESEEEEDEDDEEEE EEEKEKSLIVEGKREKKKVERLTMQVSSLQREPFTIAQGKGQKLCEIERIHFFLSKKK TDELRNLHKLLYNRPGTVSSLKKNVGQFSGFPFEKGSVQYKKKEEMLKKFRNAMLKSI CEVLDLERSGVNSELVKRILNFLMHPKPSGKPLPKSKKTCSKGSKKERNSSGMARKAK RTKCPEILSDESSSDEDEKKNKEESSDDEDKESEEEPPKKTAKREKPKQKATSKSKKS VKSANVKKADSSTTKKNQNSSKKESESEDSSDDEPLIKKLKKPPTDEELKETIKKLLA SANLEEVTMKQICKKVYENYPTYDLTERKDFIKTTVKELIS" BASE COUNT 962 a 413 c 546 g 777 t 1 others ORIGIN 1 ggcccgcggc ggccgaaatc cgcggttcac agcatgtccg cctcggcccc tgctgcggag 61 ggggagggaa cccccaccca gcccgcgtcc gagaaagaac ccgaaatgcc cggtcccaga 121 gaggagagcg aggaggaaga ggacgaggac gacgaggagg aggaggagga ggaaaaagaa 181 aagagtctca tcgtggaagg caagagggaa aagaaaaaag tagagaggtt gacaatgcaa 241 gtctcttcct tacagagaga gccatttaca attgcacaag gaaaggggca gaaactttgt 301 gaaattgaga ggatacattt ttttctaagt aagaagaaaa ccgatgaact tagaaatcta 361 cacaaactgc tttacaacag gccaggcact gtgtcctcat taaagaagaa tgtgggtcag 421 ttcagtggct ttccatttga aaaaggaagt gtccaatata aaaagaagga agaaatgttg 481 aaaaaattta gaaatgccat gttaaagagc atctgtgagg ttcttgattt ggagagatca 541 ggtgtaaata gtgaactagt gaagaggatc ttgaatttct taatgcatcc aaagccttct 601 ggcaaaccat tgccgaaatc taaaaaaact tgtagcaaag gcagtaaaaa ggaacggaac 661 agttctggaa tggcaaggaa ggctaagcga accaaatgtc ctgaaattct gtcagatgaa 721 tctagtagtg atgaagatga aaagaaaaac aaggaagagt cttcagatga tgaagataaa 781 gaaagtgaag aggagccacc aaaaaagaca gccaaaagag aaaaacctaa acagaaagct 841 acttctaaaa gtaaaaaatc tgtgaaaagt gccaatgtta agaaagcaga tagcagcacc 901 accaagaaga atcaaaacag ttccaaaaaa gaaagtgagt ctgaggatag ttcagatgat 961 gaacctttaa ttaaaaagtt gaagaaaccc cctacagatg aagagttaaa ggaaacaata 1021 aagaaattac tggccagtgc taacttggaa gaagtcacaa tgaaacagat ttgcaaaaag 1081 gtctatgaaa attatcctac ttatgattta actgaaagaa aagatttcat aaaaacaact 1141 gtaaaagagc taatttcttg agatagagga cagagaagat gactcgttcc catagatttg 1201 aagatctgat ttataccatt ataccagcaa agagaatgta tttccttttc taaatccttg 1261 ttaagcaacg ttagtagaac ttactgctga cctttttatc ttgagtgtta tgtgaatttg 1321 agtttgctgt tttaaattgc atttctatgc catttttagt ttaaaatctt gcatggcatt 1381 aattgttcct tgcttttata gttgtatttt gtacattttg gatttcttta tataaggtca 1441 tagattcttg agctgttgtg gtttttagtg cacttaatat tagcttgctt aaggcatact 1501 tttaatcaag tagaacaaaa actattatca ccaggattta tacatacaga gattgtagta 1561 tttagtatat gaaatatttt gaatacacat ctctgtcagt gtgaaaattc agcggcagtg 1621 tgtccatcat attaaaaata tacaagctac agttgtccag atcactgaat tggaactttt 1681 ctcctgcatg tgtatatatg tcaaattgtc agcatgacaa aagtgacaga tgttattttn 1741 gtatttttaa aaaacaattg gttgtatata aagttttttt atttcttttg tgcagatcac 1801 tttttaaact cacataggta ggtatcttta tagttgtaga ctatggaatg tcagtgttca 1861 gccaaacagt atgatggaac agtgaaagtc aattcagtga tggcaacact gaaggaacag 1921 ttaccctgct ttgcctcgaa agtgtcatca atttgtaatt ttagtattaa ctctgtaaaa 1981 gtgtctgtag gtacgtttta tattatataa ggacagacca aaaatcaacc tatcaaagct 2041 tcaaaaactt tgggaaaggg tgggattaag tacaagcaca tttggcttac agtaaatgaa 2101 ctgattttta ttaactgctt ttgcccatat aaaatgctga tatttactgg aaacctagcc 2161 agcttcacga ttatgactaa agtaccagat tataatgcca gaatataatg tgcaggcaat 2221 cgtggatgtc tctgacaaag tgtgtctcaa aaataatata cttttacatt aaagaaattt 2281 aatgtttctc tggagttggg gctcttggct ttcagagttt ggttaatcag tgttgattct 2341 agatgatcaa cataatggac cactcctgaa tgagacttaa ttttgtcttt caaatttact 2401 gtcttaaatc agtttattaa atctgaattt taaaacatgc tgtttatgac acaatgacac 2461 atttgttgca ccaattaagt gttgaaaaat atctttgcat catagaacag aaatatataa 2521 aaatatatgt tgaatgttaa caggtatttt cacaggtttg tttcttgata gttactcaga 2581 cactagggaa aggtaaatac aagtgaacaa aataagcaac taaatgagac ctaataattg 2641 gccttcgatt ttaaatattt gttcttataa accttgtcaa taaaaataaa tctaaatca // LOCUS HSDERMATA 729 bp RNA PRI 03-NOV-1993 DEFINITION H.sapiens dermatopontin mRNA, complete CDS. ACCESSION Z22865 NID g311613 KEYWORDS dermatopontin; proteoglycan-binding cell-adhesion protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 729) AUTHORS Superti-Furga,A., Rocchi,M., Schafer,B.W. and Gitzelmann,R. TITLE Complementary DNA sequence and chromosomal mapping of a human proteoglycan-binding cell-adhesion protein (dermatopontin) JOURNAL Genomics 17 (2), 463-467 (1993) MEDLINE 94010945 REFERENCE 2 (bases 1 to 729) AUTHORS Superti-Furga,A. TITLE Direct Submission JOURNAL Submitted (28-MAY-1993) Andrea Superti-Furga, Pediatrics, University of Zurich, Steinwiesstrasse 75, Zurich, 8032, Switzerland FEATURES Location/Qualifiers source 1..729 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibrosarcoma" /cell_line="HT-1080" /clone_lib="lambda gt11" sig_peptide 13..66 CDS 13..618 /codon_start=1 /product="dermatopontin" /db_xref="PID:g311614" /translation="MDLSLLWVLMPLVTMAWGQYGDYGYPYQQYHDYSDDGWVNLNRQ GFSYQCPQGQVIVAVRSIFSKKEGSDRQWNYACMPTPQSLGEPTECWWEEINRAGMEW YQTCSNNGLVAGFQSRYFESVLDREWQFYCCRYSKRCPYSCWLTTEYPGHYGEEMDMI SYNYDYYIRGATTTFSAVERDRQWKFIMCRMTEYDCEFANV" BASE COUNT 189 a 161 c 207 g 172 t ORIGIN 1 gaattcggga gcatggacct cagtcttctc tgggtactta tgcccctagt caccatggcc 61 tggggccagt atggcgatta tggataccca taccagcagt atcatgacta cagcgatgat 121 gggtgggtga atttgaatcg gcaaggcttc agctaccagt gtccccaggg gcaggtgata 181 gtggccgtga ggagcatctt cagtaagaag gaaggttctg acagacaatg gaactacgcc 241 tgcatgccca cgccacagag cctcggggaa cccacggagt gctggtggga ggagatcaac 301 agggctggca tggaatggta ccagacgtgc tccaacaatg ggctggtggc aggattccag 361 agccgctact tcgagtcagt gctggatcgg gagtggcagt tttactgttg tcgctacagc 421 aagaggtgcc catattcctg ctggctaaca acagaatatc caggtcacta tggtgaggaa 481 atggacatga tttcctacaa ttatgattac tatatccgag gagcaacaac cactttctct 541 gcagtggaaa gggatcgcca gtggaagttc ataatgtgcc ggatgactga atacgactgt 601 gaatttgcaa atgtttagat ttgccacata ccaaatctgg gtgaaaggaa aggggccctc 661 cagctttcca ctgcagagaa agtggttgtt gctcctcggt atatgtaatc ataattgtag 721 atcgaattc // LOCUS HSDESMOC 4195 bp RNA PRI 22-JUN-1994 DEFINITION H.sapiens mRNA for desmocollin type 1 (diferentially spliced). ACCESSION Z34522 NID g505536 KEYWORDS desmocollin type 1; desmocollin type 2; Dsc1a; Dsc1b. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4195) AUTHORS Theis,D.G., Koch,P.J. and Franke,W.W. TITLE Differential synthesis of type 1 and type 2 desmocollin mRNAs in human stratified epithelia JOURNAL Int. J. Dev. Biol. 37 (1), 101-110 (1993) MEDLINE 93283249 REFERENCE 2 (bases 1 to 4195) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (21-JUN-1994) Zimbelmann R., German Cancer Research Center, Institute for Cell Biology, Im Neuenheimer Feld 280, D-69120 Heidelberg, Federal Republik of Germany FEATURES Location/Qualifiers source 1..4195 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HEDCT1" /tissue_type="Foreskin" CDS 191..2713 /codon_start=1 /product="Dsc1a precursor" /db_xref="PID:g505537" /translation="MALASAAPGSIFCKQLLFSLLVLTLLCDACQKVYLRVPSHLQAE TLVGKVNLEECLKSASLIRSSDPAFRILEDGSIYTTHDLILSSERKSFSIFLSDGQRR EQQEIKVVLSARENKSPKKRHTKDTALKRTKRRWAPIPASLMENSLGPFPQHVQQIQS DAAQNYTIFYSISGPGVDKEPFNLFYIEKDTGDIFCTRSIDREKYEQFALYGYATTAD GYAPEYPLPLIIKIEDDNDNAPYFEHRVTIFTVPENCRSGTSVGKVTATDLDEPDTLH TRLKYKILQQIPDHPKHFSIHPDTGVITTTTPFLDREKCDTYQLIMEVRDMGGQPFGL FNTGTITISLEDENDNPPSFTETSYVTEVEENRIDVEILRMKVQDQDLPNTPHSKAVY KILQGNENGNFIISTDPNTNEGVLCVVKPLNYEVNRQVILQVGVINEAQFSKAASSQT PTMCTTTVTVKIIDSDEGPECHPPVKVIQSQDGFPAGQELLGYKALDPEISSGEGLRY QKLGDEDNWFEINQHTGDLRTLKVLDRESKFVKNNQYNISVVAVDAVGRSCTGTLVVH LDDYNDHAPQIDKEVTICQNNEDFAVLKPVDPDGPENGPPFQFFLDNSASKNWNIEEK DGKTAILRQRQNLDYNYYSVPIQIKDRHGLVATHMLTVRVCDCSTPSECRMKDKSTRD VRPNVILGRWAILAMVLGSVLLLCILFTCFCVTAKRTVKKCFPEDIAQQNLIVSNTEG PGEEVTEANIRLPMQTSNICDTSMSVGTVGGQGIKTQQSFEMVKGGYTLDSNKGGGHQ TLESVKGVGQGDTGRYAYTDWQSFTQPRLGEESIRGHTLIKN" CDS join(191..2676,2723..2921) /note="alternative spliced" /codon_start=1 /product="Dsc1b precursor" /db_xref="PID:g505538" /translation="MALASAAPGSIFCKQLLFSLLVLTLLCDACQKVYLRVPSHLQAE TLVGKVNLEECLKSASLIRSSDPAFRILEDGSIYTTHDLILSSERKSFSIFLSDGQRR EQQEIKVVLSARENKSPKKRHTKDTALKRTKRRWAPIPASLMENSLGPFPQHVQQIQS DAAQNYTIFYSISGPGVDKEPFNLFYIEKDTGDIFCTRSIDREKYEQFALYGYATTAD GYAPEYPLPLIIKIEDDNDNAPYFEHRVTIFTVPENCRSGTSVGKVTATDLDEPDTLH TRLKYKILQQIPDHPKHFSIHPDTGVITTTTPFLDREKCDTYQLIMEVRDMGGQPFGL FNTGTITISLEDENDNPPSFTETSYVTEVEENRIDVEILRMKVQDQDLPNTPHSKAVY KILQGNENGNFIISTDPNTNEGVLCVVKPLNYEVNRQVILQVGVINEAQFSKAASSQT PTMCTTTVTVKIIDSDEGPECHPPVKVIQSQDGFPAGQELLGYKALDPEISSGEGLRY QKLGDEDNWFEINQHTGDLRTLKVLDRESKFVKNNQYNISVVAVDAVGRSCTGTLVVH LDDYNDHAPQIDKEVTICQNNEDFAVLKPVDPDGPENGPPFQFFLDNSASKNWNIEEK DGKTAILRQRQNLDYNYYSVPIQIKDRHGLVATHMLTVRVCDCSTPSECRMKDKSTRD VRPNVILGRWAILAMVLGSVLLLCILFTCFCVTAKRTVKKCFPEDIAQQNLIVSNTEG PGEEVTEANIRLPMQTSNICDTSMSVGTVGGQGIKTQQSFEMVKGGYTLDSNKGGGHQ TLESVKGVGQGDTGRYAYTDWQSFTQPRLGEKVYLCGQDEEHKHCEDYVFSYNYEGKG SLAGSVGCCSDRQEEEGLEFLDHLEPKFRTLAKTCIKK" mat_peptide 592..2713 /product="Dsc1a" mat_peptide 592..2921 /note="alternative spliced" /product="Dsc1b" exon 2678..2723 /note="alternatively spliced exon" BASE COUNT 1290 a 851 c 885 g 1169 t ORIGIN 1 cgaagaaatt ctcccgttgc tcctcctact gtttatcact tgcctccgga ctgtcttcca 61 aaccaagctc agctgcatca aggtggcagc agaataccct gtgcaagtgc cagcgtcttc 121 ttagccgctc tgtgcatccc aggctgccct gttatctggc caccgtccct ggccattggg 181 actgcttctg atggctctgg cctctgctgc cccagggagc atcttctgta agcagctcct 241 tttctctctc ctggttttaa cattactttg cgatgcttgt cagaaagttt atcttcgagt 301 tccttctcat cttcaggctg aaacacttgt aggcaaagtg aatctggagg agtgtctcaa 361 gtcggccagc ctaatccggt ccagtgaccc tgccttcaga attctagaag atggctcaat 421 ttacacaaca catgacctca ttttgtcttc tgaaaggaaa agtttttcca ttttcctttc 481 agatggtcag agacgggaac aacaagagat aaaagttgta ctgtcagcaa gagaaaacaa 541 gtctcctaag aagagacata ccaaagacac agccctcaag cgcacgaaga gacgatgggc 601 tcctattcca gcttcattga tggagaactc gttgggtcca tttccacaac acgttcagca 661 gatccaatct gatgctgcac agaattacac catcttttat tccataagtg ggccaggcgt 721 ggacaaagaa cccttcaatt tgttttacat agagaaagac actggggata tcttttgtac 781 aaggagcatt gaccgtgaga aatatgaaca gtttgcgtta tatggctatg caacaactgc 841 agatggctat gcaccagaat atccactccc tttgatcatc aaaattgaag atgataatga 901 taacgcccca tattttgaac acagagtgac tatctttact gtgcctgaaa attgccgatc 961 cggaacttca gtgggaaaag tgaccgccac agaccttgac gaacctgaca ctctccatac 1021 tcgtctgaaa tataaaatct tacaacaaat cccagatcat ccaaagcatt tctccataca 1081 cccagatacc ggtgtcatca ccacaactac accttttctg gatagagaaa aatgtgatac 1141 ttaccagtta ataatggaag tgcgagacat gggtggtcag cctttcggtt tatttaatac 1201 aggaacaatt actatttcac ttgaggatga aaatgacaat ccaccatctt tcacagaaac 1261 ttcttatgtt acagaagtag aagaaaacag aattgacgtg gagattttgc gaatgaaggt 1321 acaggatcag gatttgccaa acactcctca ctcaaaggct gtatacaaaa tcttacaagg 1381 aaatgaaaat ggaaacttca taattagcac agatccaaat acaaatgaag gagtgctgtg 1441 tgttgtcaag ccattgaact atgaagtcaa tcgccaagtt attttgcaag ttggtgtcat 1501 taacgaggca caattctcta aagcagcgag ctcacaaact cctacaatgt gcactacaac 1561 tgtcaccgtt aaaattatag acagtgatga gggccctgaa tgccaccctc cagtgaaagt 1621 tattcagagt caagatggct tcccagctgg ccaagaactc cttggataca aagcactgga 1681 cccggaaata tccagtggtg aaggcttaag gtatcagaag ttaggggatg aagataactg 1741 gtttgaaatt aatcaacaca ctggcgactt gagaactcta aaagtactag atagagaatc 1801 caaatttgta aaaaacaacc aatacaatat ttcagttgtt gcagtggatg cagttggccg 1861 atcttgcact ggaacattag tagttcattt ggatgattac aacgatcacg cacctcaaat 1921 tgacaaagaa gtgaccattt gtcagaataa tgaggatttt gctgttctga aacctgtaga 1981 tccagatgga cctgaaaatg gaccaccttt tcaattcttt ctggataatt ctgccagtaa 2041 aaactggaac atagaagaaa aggatggtaa aactgccatt cttcgtcaac ggcaaaatct 2101 tgattataac tattattctg tgcctattca aataaaagac aggcatggtt tagttgcaac 2161 acatatgtta acagtgagag tatgtgactg ttcaactcca tctgagtgta gaatgaagga 2221 taaaagtaca agagacgtta gaccaaatgt aatacttgga agatgggcta ttcttgctat 2281 ggtgttgggt tctgtattgt tattatgtat tctgtttacg tgtttctgtg tcactgctaa 2341 gagaacagtc aagaaatgtt ttccagaaga catagcccag caaaatttaa ttgtatcaaa 2401 tactgaagga cctggagaag aagtaacgga agcaaatatt agactcccca tgcagacatc 2461 caacatttgt gacacaagca tgtctgttgg tactgttggt ggccagggaa tcaaaacaca 2521 gcaaagtttt gagatggtca aaggaggcta cactttggat tccaacaaag gaggtggaca 2581 tcagaccttg gagtccgtca agggagtggg gcagggagat actggcagat atgcgtacac 2641 ggactggcag agtttcaccc aacctcggct tggcgaagaa tccattagag gacacactct 2701 gattaaaaat taaacagtaa aagaaggtgt atttgtgtgg acaagatgag gagcataaac 2761 attgtgaaga ctacgttttt tcttataact atgaaggcaa aggttctctg gccggctcag 2821 taggttgctg cagcgatcgg caggaagaag agggactgga gtttctagat cacctggaac 2881 ccaaatttag gacattagca aagacatgca tcaagaaata aatgtgcctt ttaatagtgt 2941 aatatccaca gatgcataag taggaattta ttacttgcag aatgttagca gcatctgcta 3001 atgtttttgt ttatggaggt aaactttgtc atgtataggt aagggtacta taaatatgag 3061 attcccctac attctccttg tctggtataa cttccatgtt ctctagaaat caaggttttg 3121 tttgttaatt ctcttttata tgcatgtata tattgccctt ttcacgactg tactgtacac 3181 cttcttgcac cttttatttg caaactgatg ttactttttg tgctgtggaa gagcatttgg 3241 gaaagctggg tattatagag gccaatgaaa gatgaatttg cattgtagat gtacgaatta 3301 aatatgttct tcaaaatctt ggggagaatt atgttcttag aacatagttg gtgccagata 3361 attgcattct ctccacctga gtgtttaaaa aggactttta agtattcttc agtgcaatct 3421 tcagttttgt gattaagttc atttctcttt tacacttttg tactcctcag agcagtgctc 3481 cagcattgtt ttctttcagg atccttcaga gctcagtccc tggacctctg cccatgtgga 3541 tttgttgtta ggtcactcca acttctaggg ttcttggaaa gataaggacc agaacaagct 3601 catagcaaat tgaggggcag agattttatg aagattacat gagaagattt ccatgaaaga 3661 attgcagccc tgaggtccat gggttgactt atgctcacaa atatgtttcg tttgctcaac 3721 atggtttact actaacattt taaaaatata aatactttag caaaaacatt cactcttgag 3781 tttgacatag gcctgcctta tctgtgttgc cacctgccat ctccaagcat ttggacaact 3841 agccctgagt cattaggctg caactctgat atacagagac tagcaccttg aatatgccag 3901 aaattgaatt accatctgta ttagaactta agactcagcc taaatttaca gttactttaa 3961 gaaaatgggc agtcagaatt agggactaga atgtatatga gaaaccccca ctctactaaa 4021 aatataagaa attagccgga catggtggcg aatgactgta atcccagcta ctcaggaggc 4081 tgaggcagga gaatcgcttg aatccaggag gcggaggttg cagtgagccg agattgccac 4141 tgcactccag cctgggcaac aagagcgaaa ctccgtctca aaaaaaaaaa aaaaa // LOCUS HSDESMOG2 3516 bp RNA PRI 05-DEC-1994 DEFINITION H.sapiens mRNA for desmoglein 2. ACCESSION Z26317 S64273 NID g416177 KEYWORDS desmoglein; desmoglein type 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3516) AUTHORS Schaefer,S., Troyanovsky,S.J., Heid,H.W., Eshkind,L., Koch,P.J. and Franke,W.W. TITLE Cytoskeletal architecture and epithelial differentiation: Molecular determinations of cell interactions cytoskeletal filament anchorage JOURNAL Unpublished REFERENCE 2 (bases 1 to 1987) AUTHORS Koch,P.J., Goldschmidt,M.D., Walsh,M.J., Zimbelmann,R. and Franke,W.W. TITLE Complete amino acid sequence of the epidermal desmoglein precursor polypeptide and identification of a second type of desmoglein gene JOURNAL Eur. J. Cell Biol. 55 (2), 200-208 (1991) MEDLINE 92037656 REFERENCE 3 (bases 1 to 3516) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (16-SEP-1993) Zimbelmann R., German Cancer Research Center, Division of Cell Biology, Im Neuenheimer Feld 280, 69120 Heidelberg, Fed. Rep. of Germany REFERENCE 4 (bases 1 to 3516) AUTHORS Schafer,S., Koch,P.J. and Franke,W.W. TITLE Identification of the ubiquitous human desmoglein, Dsg2, and the expression catalogue of the desmoglein subfamily of desmosomal cadherins JOURNAL Exp. Cell Res. 211 (2), 391-399 (1994) MEDLINE 94192736 FEATURES Location/Qualifiers source 1..3516 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hdsg2" /cell_line="Colon Carcinoma CaCo2" /clone_lib="Colon Carcinoma CaCo2 cDNA library" CDS 12..3365 /codon_start=1 /product="desmoglein 2" /db_xref="PID:g416178" /translation="MARTRDRVRLLLLLICFNVGSGLHLQVLSTRNENKLLPKHPHLV RQKRAWITAPVALREGEDLSKKNPIAKIHSDLAEERGLKITYKYTGKGITEPPFGIFV FNKDTGELNVTSILDREETPFFLLTGYALDARGNNVEKPLELRIKVLDINDNEPVFTQ DVFVGSVEELSAAHTLVMKINATDADEPNTLNSKISYRIVSLEPAYPPVFYLNKDTGE IYTTSVTLDREEHSSYTLTVEARDGNGEVTDKPVKQAQVQIRILDVNDNIPVVENKVL EGMVEENQVNVEVTRIKVFDADEIGSDNWLANFTFASGNEGGYFHIETDAQTNEGIVT LIKEVDYEEMKNLDFSVIVANKAAFHKSIRSKYKPTPIPIKVKVKNVKEGIHFKSSVI SIYVSESMDRSSKGQIIGNFQAFDEDTGLPAHARYVKLEDRDNWISVDSVTSEIKLAK LPDFESRYVQNGTYTVKIVAISEDYPRKTITGTVLINVEDINDNCPTLIEPVQTICHD AEYVNVTAEDLDGHPNSGPFSFSVIDKPPGMAEKWKIARQESTSVLLQQSEKKLGRSE IQFLISDNQGFSCPEKQVLTLTVCEVLHGSGCREAQHDSYVGLGPAAIALMILAFLLL LLVPLLLLMCHCGKGAKAFTPIPGTIEMLHPWNNEGAPPEDKVVPSFLPVDQGGSLVG RNGVGGMAKEATMKGSSSASIVKGQHEMSEMDGRWEEHRSLLSGRATQFTGATGAIMT TETTKTARATGASRDMAGAQAAAVALNEEFLRNYFTDKAASYTEEDENHTAKDCLLVY SQEETESLNASIGCCSFIEGELDDRFLDDLGLKFKTLAEVCLGQKIDINKEIEQRQKP ATETSMNTASHSLCEQTMVNSENTYSSGSSFPVPKSLQEANAEKVTQEIVTERSVSSR QAQKVATPLPDPMASRNVIATETSYVTGSTMPPTTVILGPSQPQSLIVTERVYAPAST LVDQPYANEGTVVVTERVIQPHGGGSNPLEGTQHLQDVPYVMVRERESFLAPSSGVQP TLAMPNIAVGQNVTVTERVLAPASTLQSSYQIPTENSMTARNTTVSGAGVPGPLPDFG LEESGHSNSTITTSSTRVTKHSTVQHSYS" mat_peptide 156..3362 /product="desmoglein 2" BASE COUNT 1097 a 745 c 803 g 871 t ORIGIN 1 tcgagggtgc gatggcgcgg acgcgggacc gcgtacgcct gctgcttctc ctgatctgct 61 ttaacgttgg aagtggactt cacttacagg tcttaagcac aagaaatgaa aataagctgc 121 ttcctaaaca tcctcattta gtgcggcaaa agcgcgcctg gatcaccgcc cccgtggctc 181 ttcgggaggg agaggatctg tccaagaaga atccaattgc caagatacat tctgatcttg 241 cagaagaaag aggactcaaa attacttaca aatacactgg aaaagggatt acagagccac 301 cttttggtat atttgtcttt aacaaagata ctggagaact gaatgttacc agcattcttg 361 atcgagaaga aacaccattt tttctgctaa caggttacgc tttggatgca agaggaaaca 421 atgtagagaa acccttagag ctacgcatta aggttcttga tatcaatgac aacgaaccag 481 tgttcacaca ggatgtcttt gttgggtctg ttgaagagtt gagtgcagca catactcttg 541 tgatgaaaat caatgcaaca gatgcagatg agcccaatac cctgaattcg aaaatttcct 601 atagaatcgt atctctggag cctgcttatc ctccagtgtt ctacctaaat aaagatacag 661 gagagattta tacaaccagt gttaccttgg acagagagga acacagcagc tacactttga 721 cagtagaagc aagagatggc aatggagaag ttacagacaa acctgtaaaa caagctcaag 781 ttcagattcg tattttggat gtcaatgaca atatacctgt agtagaaaat aaagtgcttg 841 aagggatggt tgaagaaaat caagtcaatg tagaagttac gcgcataaaa gtgttcgatg 901 cagatgaaat aggttctgat aattggctgg caaattttac atttgcatca ggaaatgaag 961 gaggttattt ccacatagaa acagatgctc aaactaacga aggaattgtg acccttatta 1021 aggaagtaga ttatgaagaa atgaagaatc ttgacttcag tgttattgtc gctaataaag 1081 cagcttttca caagtcgatt aggagtaaat acaagcctac acccattccc atcaaggtca 1141 aagtgaaaaa tgtgaaagaa ggcattcatt ttaaaagcag cgtcatctca atttatgtta 1201 gcgagagcat ggatagatca agcaaaggcc aaataattgg aaattttcaa gcttttgatg 1261 aggacactgg actaccagcc catgcaagat atgtaaaatt agaagataga gataattgga 1321 tctctgtgga ttctgtcaca tctgaaatta aacttgcaaa acttcctgat tttgaatcta 1381 gatatgttca aaatggcaca tacactgtaa agattgtggc catatcagaa gattatccta 1441 gaaaaaccat cactggcaca gtccttatca atgttgaaga catcaacgac aactgtccca 1501 cactgataga gcctgtgcag acaatctgtc acgatgcaga gtatgtgaat gttactgcag 1561 aggacctgga tggacaccca aacagtggcc ctttcagttt ctccgtcatt gacaaaccac 1621 ctggcatggc agaaaaatgg aaaatagcac gccaagaaag taccagtgtg ctgctgcaac 1681 aaagtgagaa aaagcttggg agaagtgaaa ttcagttcct gatttcagac aatcagggtt 1741 ttagttgtcc tgaaaagcag gtccttacac tcacagtttg tgaggttctg catggcagcg 1801 gctgcaggga agcacagcat gactcctatg tgggcctggg acccgcagca attgcgctca 1861 tgattttggc ctttctgctc ctgctattgg taccactttt actgctgatg tgccattgcg 1921 gaaagggcgc caaagcgttt acccccatac ctggcaccat agagatgctg catccttgga 1981 ataatgaagg agcaccacct gaagacaagg tggtgccatc atttctgcca gtggatcaag 2041 ggggcagtct agtaggaaga aatggagtag gaggtatggc caaggaagcc acgatgaaag 2101 gaagtagctc tgcttccatt gtcaaagggc aacatgagat gtccgagatg gatggaaggt 2161 gggaagaaca cagaagcctg ctttctggta gagctaccca gtttacaggg gccacaggcg 2221 ctatcatgac cactgaaacc acgaagaccg caagggccac aggggcttcc agagacatgg 2281 ccggagctca ggcagctgct gttgcactga acgaagaatt cttaagaaat tatttcactg 2341 ataaagcggc ctcttacact gaggaagatg aaaatcacac agccaaagat tgccttctgg 2401 tttattctca ggaagaaact gaatcgctga atgcttctat tggttgttgc agttttattg 2461 aaggagagct agatgaccgc ttcttagatg atttgggact taaattcaag acgctagctg 2521 aagtttgcct gggtcaaaaa atagatataa ataaggaaat tgagcagaga caaaaacctg 2581 ccacagaaac aagtatgaac acagcttcac attcactctg tgagcaaact atggttaatt 2641 cagagaatac ctactcctct ggcagtagct tcccagttcc aaaatctttg caagaagcca 2701 atgcagagaa agtaactcag gaaatagtca ctgaaagatc tgtgtcttct aggcaggcgc 2761 aaaaggtagc tacacctctt cctgacccaa tggcttctag aaatgtgata gcaacagaaa 2821 cttcctatgt cacagggtcc actatgccac caaccactgt gatcctgggt cctagccagc 2881 cacagagcct tattgtgaca gagagggtgt atgctccagc ttctaccttg gtagatcagc 2941 cttatgctaa tgaaggtaca gttgtggtca ctgaaagagt aatacagcct catgggggtg 3001 gatcgaatcc tctggaaggc actcagcatc ttcaagatgt accttacgtc atggtgaggg 3061 aaagagagag cttccttgcc cccagctcag gtgtgcagcc tactctggcc atgcctaata 3121 tagcagtagg acagaatgtg acagtgacag aaagagttct agcacctgct tccactctgc 3181 aatccagtta ccagattccc actgaaaatt ctatgacggc taggaacacc acggtgtctg 3241 gagctggagt ccctggccct ctgccagatt ttggtttaga ggaatctggt cattctaatt 3301 ctaccataac cacatcttcc accagagtca ccaagcatag cactgtacag cattcttact 3361 cctaaacagc agtcagccac aaactgaccc agagtttaat tagcagtgac taatttcatg 3421 tttccaatgt acctgatttt tcatgagcct tacagacaca cagagacaca tacacattga 3481 tcttaaaatt tttctcagtc actgatatgc aaagga // LOCUS HSDEUBIQ 3158 bp RNA PRI 18-DEC-1995 DEFINITION H.sapiens mRNA for de-ubiquitinase. ACCESSION X91349 NID g1122277 KEYWORDS de-ubiquitinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3158) AUTHORS Falquet,L., Paquet,N., Frutiger,S., Hughes,G.J., Hoang-Van,K. and Jaton,J.C. TITLE cDNA cloning of a human 100 kDa de-ubiquitinating enzyme: the 100 kDa human de-ubiquitinase belongs to the ubiquitin C-terminal hydrolase family 2 (UCH2) JOURNAL FEBS Lett. 376 (3), 233-237 (1995) MEDLINE 96105388 REFERENCE 2 (bases 1 to 3152) AUTHORS Falquet,L. TITLE Direct Submission JOURNAL Submitted (07-SEP-1995) L. Falquet, University Medical Center, Medical Biochemistry, 1, Rue Michel-Servet, 1211 Geneva 4, SWITZERLAND FEATURES Location/Qualifiers source 1..3158 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" CDS 30..2606 /codon_start=1 /product="de-ubiquitinase" /db_xref="PID:e208113" /db_xref="PID:g1122278" /db_xref="SWISS-PROT:P45974" /translation="MADVSEEALLSVLPTIRVPKAGDRVHKDECAFSFDTPESEGGLY VCMNTFLGFGKQYVERHFNKTGQRVYLHLRRTRRPKEEDPATGTGDPPRKKPTRLAIG VEGGFDLSEEKFELDEDVKIVILPDYLEIARDGLGGLPDIVRDRVTSAVEALLSADSA SRKQEVQAWDGEVRQVSKHAFSLKQLDNPARIPPCGWKCSKCDMRENLWLNLTDGSIL CGRRYFDGSGGNNHAVEHYRETGYPLAVKLGTITPDGADVYSYDEDDMVLDPSLAEHL SHFGIDMLKMQKTDKTMTELEIDMNQRIGEWELIQESGVPLKPLFGPGYTGIRNLGNS CYLNSVVQVLFSIPDFQRKYVDKLEKIFQNAPTDPTQDFSTQVAKLGHGLLSGEYSKP VPESGDGERVPEQKEVQDGIAPRMFKALIGKGHPEFSTNRQQDAQEFFLHLINMVERN CRSSENPNEVFRFLVEEKIKCLATEKVKYTQRVDYIMQLPVPMDAALNKEELLEYEEK KRQAEEEKMALPELVRAQVPFSSCLEAYGAPEQVDDFWSTALQAKSVAVKTTRFASFP DYLVIQIKKFTFGLDWVPKKLDVSIEMPEELDISQLRGTGLQPGEEELPDIAPPLVTP DEPKGSLGFYGNEDEDSFCSPHFSSPTSPMLDESVIIQLVEMGFPMDACRKAVYYTGN SGAEAAMNWVMSHMDDPDFANPLILPGSSGPGSTSAAADPPPEDCVTTIVSMGFSRDQ ALKALRATNNSLERAVDWIFSHIDDLDAEAAMDISEGRSAADSISESVPVGPKVRDGP GKYQLFAFISHMGTSTMCGHYVCHIKKEGRWVIYNDQKVCASEKPPKDLGYIYFYQRV AS" BASE COUNT 693 a 879 c 940 g 646 t ORIGIN 1 ccgtgtgtgg agaagctgct gccggtgtca tggcggacgt gagtgaggag gcgctgctgt 61 cagtattacc gacgatccgg gtccctaagg ctggagaccg ggtccacaaa gacgagtgcg 121 ccttctcctt cgacacgccg gagtctgaag ggggcctcta cgtctgtatg aacacgtttc 181 tgggctttgg gaaacagtat gtggagagac atttcaataa gaccggccag cgagtctact 241 tgcacctccg gcggacccgg cgcccgaaag aggaggaccc tgctacaggc actggagacc 301 caccccggaa gaagcccacg cggctggcta ttggtgttga aggcggattt gaccttagcg 361 aggagaagtt tgaattagac gaggatgtga agattgtcat tttgccagat tacctggaga 421 ttgcccggga tggactgggg ggactgcctg acattgtcag agatcgggtg accagtgcag 481 tggaggccct actgtcggcc gactcagcct cccgcaagca ggaggtgcag gcatgggatg 541 gggaagtacg gcaggtgtct aagcatgcct tcagcctcaa gcagttggac aaccctgctc 601 gaatccctcc ctgtggctgg aagtgctcca agtgtgacat gagagagaac ctgtggctca 661 acctgactga tggctccatc ctctgtgggc gacgctactt cgatggcagt gggggcaaca 721 accacgctgt ggagcactac cgagagacag gctacccgtt agctgtcaag ctgggcacca 781 tcacccctga tggagctgac gtgtactcat atgatgagga tgacatggtc ctggacccca 841 gcctggctga gcacctgtcc cacttcggca tcgacatgct gaagatgcag aagacagaca 901 agacgatgac tgagttggag atagacatga accagcggat tggtgaatgg gagctgatcc 961 aggagtcagg tgtgccactc aagcccctgt ttgggcctgg ctacacaggc atccggaacc 1021 tgggtaacag ctgctacctc aactctgtgg tccaggtgct cttcagcatc cctgacttcc 1081 agaggaagta tgtggataag ctggagaaga tcttccagaa tgccccgacg gaccctaccc 1141 aggatttcag cacccaggtg gccaagctgg gccatggcct tctctccggg gagtattcca 1201 agccagtacc ggagtcgggc gatggggagc gggtgccaga acagaaggaa gttcaagatg 1261 gcattgcccc tcggatgttc aaggccctca tcggcaaggg ccaccctgaa ttctccacca 1321 accggcagca ggatgcccag gagttcttcc ttcaccttat caacatggtg gagaggaatt 1381 gccggagctc tgaaaatcct aatgaagtgt tccgcttctt ggtggaggaa aagatcaagt 1441 gcctggccac agagaaggtg aagtacaccc agcgagttga ctacatcatg cagctgcctg 1501 tgcccatgga tgcagccctt aacaaagagg agcttctgga gtacgaggag aagaagcggc 1561 aagccgaaga ggagaagatg gcactgccag aactggttcg ggcccaggtg cccttcagct 1621 cttgcctgga ggcctacggg gcccctgagc aggtcgatga cttctggagc acggccctgc 1681 aggccaagtc agtagctgtc aagaccacac gatttgcctc attccctgac tacctggtca 1741 tccagatcaa gaagttcacc ttcggcttag actgggtgcc caagaaactg gatgtgtcca 1801 tcgagatgcc agaggagctc gacatctccc agttgagggg cacagggctg cagcccggag 1861 aggaggagct gccagacatt gccccacccc tggtcactcc ggatgagccc aaaggtagcc 1921 ttggtttcta tggcaacgaa gacgaagact ccttctgctc ccctcacttc tcctctccga 1981 catcgcccat gctggatgaa tcagtcatca tccagctggt ggagatggga ttccctatgg 2041 acgcctgccg caaagctgtc tactacacgg gcaacagcgg ggctgaggcc gccatgaact 2101 gggtcatgtc acacatggat gatccagatt ttgcaaaccc cctcatcctg cctggctcta 2161 gtgggccggg ctccacaagc gcagcagccg acccccctcc tgaggactgt gtgaccacca 2221 ttgtctccat gggcttctcc cgggaccagg ccttgaaagc gctgcgggcc acgaacaata 2281 gtttagaacg ggctgtggac tggatcttca gtcacattga cgacctggat gctgaagctg 2341 ccatggacat ctcagagggc cgctcagctg ccgactccat ctctgagtct gtgccagtgg 2401 gacctaaagt ccgggatggt cctggaaagt atcagctctt tgccttcatt agtcacatgg 2461 gcacctctac catgtgtggt cactacgtct gccacatcaa gaaagaaggc agatgggtga 2521 tctacaatga ccagaaagtg tgtgcctccg agaagccgcc caaggacctg ggctacatct 2581 acttctacca gagagtggcc agctaagagc ctgcctcacc ccttaccaat gagggcaggg 2641 gaagaccacc tggcatgagg gagaggggct gagggatgga cttcagcccc tctgctctgt 2701 accctttttc cttttgtccc cggcagcagg gaagaagctg gaggccgtgg gagaatggct 2761 gggcagagca gaggggcagc gatagactct ggggatggag caggacgggg acgggagggg 2821 ccggccacct gtctgtaagg agactttgtt gcttcccctg cccccggaat ccaaagtgct 2881 ctgcttctct gtgtcgcccc gcccagcccc ctggtgtgga gggaggggtc tcgtttgtgc 2941 gcgtgggtgt agctttgtgc atcctctccc agtggagcga tcacctgtgc ctcccctccc 3001 cctttgtttg cccctgtgtg gttggtcaag gagggatgtg agggaaatag ggaccccccg 3061 acttgccctc ctgcctcagt ctttccccca ccctgtctct tccttgtcct tctctggaaa 3121 atgccaaaat acacgatgtg aataaaagta caacggct // LOCUS HSDF1F01 994 bp RNA PRI 26-OCT-1994 DEFINITION H.sapiens mRNA for delta-subunit of mitochondrial F1F0 ATP-synthase (clone #1). ACCESSION X63422 S87916 S87918 NID g12585 KEYWORDS ATP synthase delta subunit; delta subunit; F1F0-ATP synthase; H+-translocation. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 994) AUTHORS Breen,G.A.M. TITLE Direct Submission JOURNAL Submitted (02-DEC-1991) G.A.M. Breen, The Univesity of Texas at Dallas, Biology Programs, FO 3.1, P.O.Box 830688, Richardson, TX 75083-0688, USA REFERENCE 2 (bases 1 to 994) AUTHORS Jordan,E.M. and Breen,G.A. TITLE Molecular cloning of an import precursor of the delta-subunit of the human mitochondrial ATP synthase complex JOURNAL Biochim. Biophys. Acta 1130 (1), 123-126 (1992) MEDLINE 92182007 COMMENT See also X63423. FEATURES Location/Qualifiers source 1..994 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA in lambda YES-R" /clone="#1" 5'UTR 1..83 sig_peptide 84..149 CDS 84..590 /EC_number="3.6.1.34" /note="delta subunit of mitochondrial F1F0 ATP synthase" /codon_start=1 /product="H(+)-transporting ATP synthase" /db_xref="PID:g12586" /db_xref="SWISS-PROT:P30049" /translation="MLPAALLRRPGLGRLVRHARAYAEAAAAPAAASGPNQMSFTFAS PTQVFFNGANVRQVDVPTLTGAFGILAAHVPTLQVLRPGLVVVHAEDGTTSKYFVSSG SIAVNADSSVQLLAEEAVTLDMLDLGAAKANLEKAQAELVGTADEATRAEIQIRIEAN EALVKALE" mat_peptide 150..587 /EC_number="3.6.1.34" /note="delta subunit of mitochondrial F1F0 ATP synthase" /product="H(+)-transporting ATP synthase" 3'UTR 591..977 polyA_signal 958..963 polyA_site 977 BASE COUNT 160 a 367 c 313 g 154 t ORIGIN 1 gtcctcctcg ccctccaggc cgcccgcgcc gcgccggagt ccgctgtccg ccagctaccc 61 gcttcctgcc gcccgccgct gccatgctgc ccgccgcgct gctccgccgc ccgggacttg 121 gccgcctcgt ccgccacgcc cgtgcctatg ccgaggccgc cgccgccccg gctgccgcct 181 ctggccccaa ccagatgtcc ttcaccttcg cctctcccac gcaggtgttc ttcaacggtg 241 ccaacgtccg gcaggtggac gtgcccacgc tgaccggagc cttcggcatc ctggcggccc 301 acgtgcccac gctgcaggtc ctgcggccgg ggctggtcgt ggtgcatgca gaggacggca 361 ccacctccaa atactttgtg agcagcggtt ccatcgcagt gaacgccgac tcttcggtgc 421 agttgttggc cgaagaggcc gtgacgctgg acatgttgga cctgggggca gccaaggcaa 481 acttggagaa ggcccaggcg gagctggtgg ggacagctga cgaggccacg cgggcagaga 541 tccagatccg aatcgaggcc aacgaggccc tggtgaaggc cctggagtag gcggtgcgta 601 cccggtgtcc cgaggcccgg ccaggggctg ggcagggatg ccaggtgggc ccagccagct 661 cctggggtcc cggccacctg gggaagccgc gcctgccaag gaggccacca gagggcagtg 721 caggcttctg cctgggcccc aggccctgcc tgtgttgaaa gctctgggga ctgggccagg 781 gaagctcctc ctcagctttg agctgtggct gccacccatg gggctctcct tccgcctctc 841 aagatccccc cagcctgacg ggccgcttac catcccctct gccctgcaga gccagccgcc 901 aaggttgacc tcagcttcgg agccacctct ggatgaactg cccccagccc ccgccccatt 961 aaagacccgg aagcctgaaa aaaaaaaaaa aaaa // LOCUS HSDGCR2 4398 bp RNA PRI 11-OCT-1995 DEFINITION H.sapiens mRNA for DGCR2. ACCESSION X84076 NID g809021 KEYWORDS adhesion protein; C-type lectin; CATCH 22; DGCR2 gene; DiGeorge syndrome; LDLR cys-rich repeats; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4398) AUTHORS Demczuk,S., Aledo,R., Zucman,J., Delattre,O., Desmaze,C., Dauphinot,L., Jalbert,P., Rouleau,G.A., Thomas,G. and Aurias,A. TITLE Cloning of a balanced translocation breakpoint in the DiGeorge syndrome critical region and isolation of a novel potential adhesion receptor gene in its vicinity JOURNAL Hum. Mol. Genet. 4 (4), 551-558 (1995) MEDLINE 95359957 REFERENCE 2 (bases 1 to 4398) AUTHORS Demczuk,S. TITLE Direct Submission JOURNAL Submitted (25-JAN-1995) S. Demczuk, INSERM U434, Genetique des Tumeurs, Institut Curie, 26 rue d'Ulm, 75231 Paris, FRANCE FEATURES Location/Qualifiers source 1..4398 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="22" /map="q11.2 chromosomal band" gene 147..1799 /gene="DGCR2 gene" CDS 147..1799 /gene="DGCR2 gene" /codon_start=1 /db_xref="PID:g809022" /translation="MVPKADSGAFLLLFLLVLTVTEPLRPELRCNPGQFACRSGTIQC IPLPWQCDGWATCEDESDEANCPEVTGEVRPHHGKEAVDPRQGRARGGDPSHFHAVNV AQPVRFSSFLGKCPTGWHHYEGTASCYRVYLSGENYWDAAQTCQRLNGSLATFSTDQE LRFVLAQEWDQPERSFGWKDQRKLWVGYQYVITGRNRSLEGRWEVAFKGSSEVFLPPD PIFASAMSENDNVFCAQLQCFHFPTLRHHDLHSWHAESCYEKSSFLCKRSQTCVDIKD NVVDEGFYFTPKGDDPCLSCTCHGGEPEMCVAALCERPQGCQQYRKDPKECCKFMCLD PDGNSLFDSMASGMRLVVSCISSFLILSLLLFMVHRLRQRRRERIESLIGANLHHFNL GRRIPGFDYGPDGFGTGLTPLHLSDDGEGGTFHFHDPPPPYTAYKYPDIGQPDDPPPP YEASIHPDSVFYDPADDDAFEPVEVSLPAPGDGGSEGALLRRLEQPLPTAGASLADLE DSADSSSALLVPPDPAQSGSTPAAEALPGGGRHSRSSLNTVV" mat_peptide 219..1796 /gene="DGCR2 gene" repeat_region 229..344 /note="LDLR cys-rich repeats" misc_feature 489..869 /gene="DGCR2 gene" /note="C-type lectin" polyA_signal 4377..4381 BASE COUNT 798 a 1303 c 1308 g 989 t ORIGIN 1 cgttttctct cctctgtgtt ggtcgcctgt ctcgctccct ctcatgctct ctttctctcc 61 tctattttgt ctctcactat tcgtgttttg tgtttgtttc taggtttggc tcaattgctg 121 ccccggggac gatgaacgga ggataaatgg tgcccaaggc agacagcggc gccttcctgc 181 tgctcttcct gctcgtgctc actgtcaccg agccgctgcg gccagagctg cggtgcaacc 241 ctgggcagtt tgcgtgtcgc agcggcacca tccagtgcat ccccctcccc tggcagtgtg 301 acggctgggc gacttgcgag gatgagagcg acgaagccaa ctgtccagaa gtgaccgggg 361 aggtgcgtcc tcatcatggg aaggaggctg tggatccgcg gcaggggcgg gccagaggag 421 gcgacccttc gcacttccac gcggtgaacg tggcgcagcc cgttcgcttc agcagtttcc 481 tagggaagtg cccgacaggg tggcaccact acgaaggcac ggccagctgc taccgggtct 541 acctgagcgg ggagaactac tgggatgccg cgcagacctg ccagcgcctg aatggctctc 601 tcgccacctt ctccactgac caggagctgc gctttgtcct ggcccaggaa tgggaccagc 661 ccgagcggag ctttggttgg aaggaccagc gcaagttgtg ggttggctat cagtatgtta 721 tcactggccg gaaccgctcc ttggaaggtc gctgggaggt ggcattcaaa ggctcttcag 781 aggtgttcct gcccccagac cccatctttg cctcggccat gtctgagaac gacaacgtgt 841 tctgtgccca gcttcagtgc ttccatttcc ccaccctgcg gcaccacgac ctccacagct 901 ggcacgccga gagctgctac gagaagtctt catttctgtg taaaagaagt caaacatgtg 961 ttgacatcaa ggacaacgtg gtggatgaag ggttctactt cacccctaag ggggacgacc 1021 catgcctgag ctgcacctgc catggagggg agcctgagat gtgtgtggct gctctctgtg 1081 agaggcccca gggctgccaa cagtaccgca aggaccccaa agagtgctgc aagttcatgt 1141 gtctggaccc agatggcaac agtctgtttg actccatggc cagcgggatg cgcctggtcg 1201 tcagctgcat ctcctccttc ctcatcctgt cactgctgct cttcatggtc caccggctgc 1261 gccagcggcg ccgggagcgc atcgagtccc tgattggagc aaacttgcac cacttcaacc 1321 tcggccgcag gatccctggc tttgattacg gcccagacgg gtttggcacg ggcctcacgc 1381 cgctgcatct ttctgacgac ggagagggtg ggactttcca tttccacgac cctccacctc 1441 cctacacggc atacaagtac ccggacatcg gccagcccga cgaccctccg ccgccctacg 1501 aggcctccat ccacccggac agtgtgttct atgaccctgc agacgatgat gcttttgagc 1561 ctgtggaggt cagcctgcca gcccctgggg atggtgggag tgaaggtgca ttactccggc 1621 gcctggagca gcctctgccc actgcggggg cctctctggc agacctggaa gactctgccg 1681 acagcagcag cgccctgctc gtgccccctg accctgccca gagcgggagc accccagctg 1741 cagaggcact gccagggggt ggccgccaca gccgcagctc cctcaatact gtggtgtaga 1801 cggcctggcc tgtaccccaa cggtctggga gcacctgtct gttgcagaaa acaccggtcc 1861 ctggggagac ttgaaaggcc cctgtcccag cctggacgcc acgcactgcc gcacgtcact 1921 ggcgggctcg cgtgtgtaca tagagaccac agcccgcctt ctgccaaaag aagtgatggc 1981 ctgcaccgag cttccttgag ggcttcagaa acatgcatag ctttggatca ctgtcttctc 2041 ctttataaat ggcagaagag tgacaaaatt cattcagacc gcacatgtta gaggcaggga 2101 atgaagaagg tactgtgggc catggccaca cctgatgcgt ttttggtggg cctacttggt 2161 gcagtgtgct gtccagagag acctgctgac ccagtctggg acaggcacag tgggagctgc 2221 cacagtgccc cttgctggcc gccctcagga gggggcctct ggaccgtcag tgtggcgtag 2281 gcagtgggtc tgcttcaggg aggcagcctc ttgactttgt cacaacggtt gcactgaaga 2341 tggcccccac aagcccagtt gtgaatatca aggtgaccct gcccctggct gggagctccc 2401 ctggggctct ggaacctgaa gccctgagaa ggagagcttg gaaggaggtt gagctcttca 2461 ctgtgtcttt ccatctgggc tctgcagccc agctctgtgg caggaggcct gaccccaccc 2521 catcagtccc tctcccagca ttgctgtgca tggctccctc aggaagaagc tctggagtgg 2581 ggccgaggcc ccagatgctc tgctggggtc tggggactga gctgcctcct gtctctccac 2641 tctggagccc tgggctcttg cctcctgttg aattgcccct gggcctgccc ccggccccat 2701 ttgtgccata aagggttgct tcattgcagg aggggtggct gaaaccacca tcctgggctg 2761 catctatctc cttaaagtcc actccttaca tcaccgccac tactgcagct cagtgcccag 2821 tggccgcatg cacacctccc ggcccctcct cagcagcgca ggggctgggg gcccttcctg 2881 gcaccctttt gcctgacgga ccctgttgcc tgctcctggg cagtggtttt gttgcctgac 2941 acagggcctt taacatttct gaggtttgga gattaagcgg gttctgtgcg atttttagca 3001 catccatatc ttttctacgt tgaatgtctg gatctttcct ttgtatgttt gtgtctctgt 3061 gtgtgtatgt ggccaagcgt ctgcaggtga gggtggcaca tgaaacttca cgggtcactg 3121 ggggcccact cttccagggt catcgatgca tcctgggctg tgctgaggca gtgctgtctc 3181 gctcagtctt tccagaccag cggggcagca cctgggaggc ctgtctgaag catgcagttc 3241 tgtggtgctg ctcggggatg gactccatgc acagtccact tcttgacaca ttctcacact 3301 gtggctggag acccattttt tccatcccat gtggaatgag aatcttgagt tgcccacaga 3361 cagacttgac ctttgtgcca gaactgtcca gagctgtgag ttttgttact gaattgcctg 3421 tccctgagct tggcctgctg gagcatgatg ctggaggctg tgtgcgggca ggcttgtgct 3481 ccccttggag aagggctgct acttaacaag gaagtcctgg ccacaagagt cccctactgg 3541 ctgcagagga cgaagcagga agcatgccgt ggatcccaca tgcactgggg cagggcagcc 3601 aggttccaga gcagcaggca tgtgcattat gggtgccagg tggccagtga gggcccgagg 3661 accacgcagc agagtccccc acgggcctga gcctgagagc tcagaggatt cctgccaggc 3721 ctcagaactg tgtgtgcggg acagggtggc tggcacgaga tgtgtgggac tgggacgctt 3781 cctttgggga ccagaggaac attaggggcc ggtcagacac gtagggggca gtgaggaaac 3841 ggggtaaagt ggaccatgca ggctgcagag ggtgggcctt gggctggccg gggttgctgg 3901 gtggcccctt ccccatgggc ctccacaagc actcgggcct ccacaagcac tcaggccact 3961 ggcacgttgg gccaggcaga aggtccacat ggcagggctg cctggaacac cgctgccact 4021 gtggctccag gaggcccttg ggagcatgag gagagctgga ctcgctcatc tgttcttgca 4081 ccaccatcca gaatgccccc tttgcaaggc ctgctgcagt ccccagtctg accagcaagt 4141 cccggggtca cccttgctga accttgtgct ccaggggtcc tcctcctcta ggaccagctt 4201 gccacggttt ctcagagccc aggtgccctc tgacctgcca tgcggaggtg ggatttgata 4261 ctgtacattg tcttgatgcc tgttttttta tgttttcatt aagggttttt agtttttggt 4321 tgggttgaca ctaacttttc ttaagatgct gtgagaatca tttcatccaa atgccaaata 4381 aacttgtgtt ccggaaaa // LOCUS HSDGCR6 1080 bp RNA PRI 03-MAY-1996 DEFINITION H.sapiens mRNA for DGCR6 protein. ACCESSION X96484 NID g1223740 KEYWORDS DGCR6 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1080) AUTHORS Demczuk,S., Thomas,G. and Aurias,A. TITLE Isolation of a novel gene from the DiGeorge syndrome critical region with homology to Drosophila gdl and to human LAMC1 genes JOURNAL Hum. Mol. Genet. 5 (5), 633-638 (1996) MEDLINE 96311558 REFERENCE 2 (bases 1 to 1080) AUTHORS Demczuk,S. TITLE Direct Submission JOURNAL Submitted (07-MAR-1996) S. Demczuk, Institut Curie, Section de Recherche, INSERM U434 Genetique des Tumeurs, 26 rue dUlm, Paris, 75231, France FEATURES Location/Qualifiers source 1..1080 /organism="Homo sapiens" /note="DiGeorge syndrome critical region" /db_xref="taxon:9606" /cell_line="HeLa" /dev_stage="adult" /chromosome="22" /map="q11.2" CDS 423..677 /codon_start=1 /product="DGCR6" /db_xref="PID:e228435" /db_xref="PID:g1223741" /translation="MDQKIVLELDRKVADQQSTLEKAGVAGFYVTTNPQELMLQMNLL ELIRKLQQRGCWAGKAALGLGGPWQLPAAQCDQKGSPVPP" BASE COUNT 230 a 334 c 342 g 174 t ORIGIN 1 cggtccggga catcccaaag gagccaaggc tgggtaccag cccagggggg acggtgcccg 61 gcagcaggag cgacactacc agctgctgtc ggcgttacag agcctggtga aggagttgcc 121 cagctcattc cagcagcgct tgtcctacac cacgctgagc gacctggccc tggcgcttct 181 cgacggcacc gtgttcgaaa tcgtgcaggg gctactggag gatccagcac ctcaccgaaa 241 agagcctgta caaccagcgc ctgcgcctac agaacgagca tcgagtgctc aggcaggcgc 301 tgcggcagaa gcaccaggaa gcccagcagg cctgccggcc ccataacctg cctgtgcttc 361 aggcggctca gcagcgagaa ctagaggcgg tggagcaccg gatccgtgag gagcagcggg 421 cgatggacca gaagatcgtc ctggagctgg accggaaggt ggctgaccag cagagcacac 481 tggagaaggc gggggtggct ggcttctacg tgaccaccaa cccacaggag ctgatgctgc 541 agatgaacct gctggaactc atccggaagc tgcagcagag gggctgctgg gcagggaagg 601 cagccctggg gctaggaggt ccctggcagt tgcctgctgc ccagtgtgac cagaaaggca 661 gccctgtccc accatagcca caggcagcag aagtctgggc agagttcatc ttcttgacct 721 ttggccactg ccttcccagc tgcccgcagg gggttccccc tgctgaggag agaccaggtg 781 gaccccagct gcctgtcacc cttcatctgg gacttgctgt caaaccctag gatagtctca 841 taaaggggag gctgggccag cctgctgctg tctgcttcag ggccaggcag agagtgaggc 901 tgggggttct cacaccttac tccaccgggc acatcccaac ctgcactggg gcccactcga 961 gtgcttgttc tggtctcagc cgctcccttg gcagctgcag cccccatgca gaagaggctc 1021 ccaggcccaa gctctgtgtg acccagagaa ataaagatgc ctcagtggcc caaaaaaaaa // LOCUS HSDGIGLY 3750 bp RNA PRI 03-AUG-1993 DEFINITION Human DSG1 mRNA for desmoglein type 1. ACCESSION X56654 NID g30505 KEYWORDS desmoglein; desmosomal glycoprotein; DSG1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3750) AUTHORS Buxton,R.S. TITLE Direct Submission JOURNAL Submitted (22-NOV-1990) R.S. Buxton, NATIONAL INSTITUTE FOR MEDICAL RESEARCH, THE RIDGEWAY, MILL HILL, LONDON NW7 1AA, ENGLAND REFERENCE 2 (bases 147 to 3604) AUTHORS Wheeler,G.N., Parker,A.E., Thomas,C.L., Ataliotis,P., Poynter,D., Arnemann,J., Rutman,A.J., Pidsley,S.C., Watt,F.M., Rees,D.A., Buxton,R.S. and Magee,A.I. TITLE Desmosomal glycoprotein DGI, a component of intercellular desmosome junctions, is related to the cadherin family of cell adhesion molecules JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (11), 4796-4800 (1991) MEDLINE 91271279 FEATURES Location/Qualifiers source 1..3750 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocytes" /clone="G4" /sub_clone="lambda gt11" /chromosome="18q12.1" mRNA 1..3750 /gene="DGS1" /evidence=experimental gene 1..3750 /gene="DGS1" CDS 78..3227 /gene="DGS1" /codon_start=1 /product="desmoglein type 1" /db_xref="PID:g30506" /db_xref="SWISS-PROT:Q02413" /translation="MDWSFFRVVAVLFIFLVVVEVNSEFRIQVRDYNTKNGTIKWHSI RRQKREWIKFAAACREGEDNSKRNPIAKIHSDCAANQQVTYRISGVGIDQPPYGIFVI NQKTGEINITSIVDREVTPFFIIYCRALNSMGQDLERPLELRVRVLDINDNPPVFSMA TFAGQIEENSNANTLVMILNATDADEPNNLNSKIAFKIIRQEPSDSPMFIINRNTGEI RTMNNFLDREQYGQYALAVRGSDRDGGADGMSAECECNIKILDVNDNIPYMEQSSYTI EIQENTLNSNLLEIRVIDLDEEFSANWMAVIFFISGNEGNWFEIEMNERTNVGILKVV KPLDYEAMQSLQLSIGVRNKAEFHHSIMSQYKLKASAISVTVLNVIEGPVFRPGSKTY VVTGNMGSNDKVGDFVATDLDTGRPSTTVRYVMGNNPADLLAVDSRTGKLTLKNKVTK EQYNMLGGKYQGTILSIDDNLQRTCTGTININIQSFGNDDRTNTEPNTKITTNTGRQE STSSTNYDTSTTSTDSSQVYSSEPGNGAKDLLSDNVHFGPAGIGLLIMGFLVLGLVPF LMICCDCGGAPRSAAGFEPVPECSDGAIHSWAVEGPQPEPRDITTVIPQIPPDNANII ECIDNSGVYTNEYGGREMQDLGGGERMTGFELTEGVKTSGMPEICQEYSGTLRRNSMR ECREGGLNMNFMESYFCQKAYAYADEDEGRPSNDCLLIYDIEGVGSPAGSVGCCSFIG EDLDDSFLDTLGPKFKKLADISLGKESYPDLDPSWPPQSTEPVCLPQETEPVVSGHPP ISPHFGTTTVISESTYPSGPGVLHPKPILDPLGYGNVTVTESYTTSDTLKPSVHVHDN RPASNVVVTERVVGPISGADLHGMLEMPDLRDGSNVIVTERVIAPSSSLPTSLTIHHP RESSNVVVTERVIQPTSGMIGSLSMHPELANAHNVIVTERVVSGAGVTGISGTTGISG GIGSSGLVGTSMGAGSGALSGAGISGGGIGLSSLGGTASIGHMRSSSDHHFNQTIGSA SPSTARSRITKYSTVQYSK" sig_peptide 99..134 /gene="DGS1" /note="hydrophobic core" mat_peptide 225..3227 /gene="DGS1" /product="desmoglein type 1" polyA_signal 3367..3372 /gene="DGS1" polyA_signal 3671..3676 /gene="DGS1" BASE COUNT 1170 a 755 c 840 g 985 t ORIGIN 1 tgagtgggag aaaggaaaag aacagagaag aacaaacaaa actcccttgg tcttggatgt 61 aagagaatcc agcagagatg gactggagtt tcttcagagt agttgcagtg ctgttcattt 121 ttctggtggt ggtagaagtt aacagtgaat tccgaatcca ggtaagagat tataacacta 181 aaaatggcac catcaaatgg cattcaatcc gaaggcagaa acgtgaatgg atcaagttcg 241 cagcagcctg tcgtgaaggt gaagacaact caaagaggaa cccaatcgcc aaaattcact 301 cagattgtgc tgcaaaccag caagttacat accgcatctc tggagtagga attgatcagc 361 caccatatgg gatctttgtc attaatcaga aaactggtga aattaatata acatccatag 421 ttgatcgaga ggtcactcct ttcttcatta tctactgccg agctctgaac tcaatgggcc 481 aagatttaga gaggcctcta gagctcagag tcagggtttt ggatataaat gacaaccctc 541 cagtgttttc aatggctaca tttgcaggac aaatagaaga aaattctaat gcaaatacac 601 tggtgatgat actcaatgct actgacgcag atgaaccgaa caatttgaac tcaaaaatag 661 ccttcaagat tataagacaa gaaccttcag attcaccaat gtttattatc aacagaaata 721 ctggagaaat tcgaacgatg aataattttc tagacagaga gcaatacggc cagtatgctc 781 ttgctgtaag aggctctgac cgagatggtg gggcagatgg catgtcagcg gaatgtgagt 841 gcaacattaa aatcctcgat gtcaatgata atatccctta catggaacag tcttcatata 901 ccatagaaat tcaagaaaat actctaaatt caaatttgct cgagattaga gtaattgatt 961 tggatgaaga gttctcagct aactggatgg cagtaatttt ctttatctct ggaaatgaag 1021 gaaattggtt tgagatagaa atgaatgaaa gaacaaatgt gggaatttta aaggttgtta 1081 agcccttaga ttatgaagct atgcagagtc tgcaactcag tattggtgtc agaaataaag 1141 ctgaatttca tcattcaatt atgtctcaat ataaactgaa agcatctgca atttctgtga 1201 ctgtgttaaa tgtaattgaa ggcccagtgt ttcgtccagg ttcaaagaca tatgttgtaa 1261 ctggtaatat gggatcaaat gataaagtgg gagactttgt agctactgac ctggacacag 1321 gtagaccttc aacgactgtt aggtatgtaa tgggaaataa tccagctgac ctgctagctg 1381 ttgattcaag aacaggcaaa ctcactttga aaaataaagt taccaaggaa cagtacaata 1441 tgctcggagg aaaataccaa ggaacgattc tctctataga tgataatctt caaagaactt 1501 gcactggtac aattaatatt aacattcaaa gttttggtaa tgacgacagg actaatacag 1561 agccgaacac taaaattact accaatactg gcagacaaga aagtacttct tccactaact 1621 atgataccag cacaacttct actgactcta gccaagtata ttcttctgaa cccggaaacg 1681 gagccaaaga tttgttatca gacaatgtac attttggtcc tgctggcatt ggactcctca 1741 tcatgggatt cttggtctta ggattggtcc catttttgat gatctgttgt gattgtggag 1801 gtgctcctcg tagtgcagct ggctttgagc ctgttcccga atgttcagat ggagcaattc 1861 attcatgggc agtagaagga ccacagcctg aacccaggga tataaccact gtcataccac 1921 aaataccacc tgataacgca aatataattg aatgcattga caactcagga gtttatacaa 1981 atgagtatgg tggcagagaa atgcaagatc tgggaggagg agagagaatg acaggatttg 2041 aactaacaga gggagttaaa acttcaggaa tgcctgagat atgtcaagaa tactctggaa 2101 cattaagaag aaattctatg agggaatgta gagaaggagg tctgaatatg aatttcatgg 2161 aaagctactt ctgtcagaaa gcatatgctt acgcagatga agatgaagga cgcccatcta 2221 atgactgttt gctcatatat gacatcgaag gtgtaggttc ccctgctggc tctgtgggtt 2281 gttgtagctt cattggagaa gacctggatg acagcttctt ggataccctg ggacctaaat 2341 ttaagaagtt ggcagacatc agcctaggaa aagaatcata tccagacctt gatccttctt 2401 ggccaccaca aagcactgaa ccagtttgcc ttcctcagga aacagagccc gttgttagtg 2461 gacacccacc aatctcccca catttcggca ctaccacagt aatttctgag agcacctatc 2521 cctcgggacc tggtgtactg catcctaagc ctattctcga tcctctgggc tatggtaatg 2581 tcactgtgac cgagtcttac accacctctg acactctgaa gccctctgtg cacgttcacg 2641 ataaccgacc agcatcaaac gtggtagtga cagagagagt ggtcggccca atctctggcg 2701 ctgatttgca tggaatgtta gagatgcctg acttgcgaga tgggtcgaat gttatagtga 2761 cagaaagggt aatagcacca agctctagtc tacccacctc tctgactatc catcatccta 2821 gagagtcttc aaatgtggta gtgacagaaa gagtaatcca accaacttcc ggcatgatag 2881 gtagtctgag tatgcacccc gagttagcca atgcccacaa tgtcattgtg acagagaggg 2941 ttgtttctgg tgctggcgta actggaatta gtggcaccac tgggatcagc ggtggcatag 3001 gcagcagtgg cctggttggc accagcatgg gtgctgggag cggtgccctg agtggagctg 3061 gcataagtgg tggtggcatt ggcctgagca gcttgggagg gacagccagc attggccaca 3121 tgaggagttc ctctgaccat cactttaacc aaaccattgg gtccgcctcc cctagcacag 3181 ctcgaagtcg aatcacaaag tatagtaccg tgcaatatag caagtagtca ggaccccagc 3241 tcactttttc atagtcattg tggtttagat ccaattccca ccactaaaaa actaacaatg 3301 tgatttataa cgcacaactt cgtgctcagg tcatctagga gcaaggtgag aaatcacaat 3361 gagaaaaata aatggaaaca ccactgctag gggagagctc tccttagcat tcataaactt 3421 ttctcttata ttaggactaa ggaactaaaa cttgaggcag agtcttcttt gtgcctgagt 3481 ggcctgtagt ccatctccag catgtaactg gccttacgat ggcaattggc atcattctcc 3541 ttgctctgtt ttgcttttcc atatagctcg agcaaaattc aaaaagaact aaatatgcaa 3601 tatatgttca tatctatggg aaaaatctaa aatgtgtgcc agatgccctg ttggtttcac 3661 agataacata aataaaaatt caaccacaga tttatacaag ggttaaccat tttttttaag 3721 tttgactaca tagtcaagtc cacggaattc // LOCUS HSDGK 1025 bp RNA PRI 03-AUG-1996 DEFINITION H.sapiens mRNA for deoxyguanosine kinase. ACCESSION X97386 NID g1480197 KEYWORDS deoxyguanosine kinase; dgk gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1025) AUTHORS Wang,L., Hellman,U. and Eriksson,S. TITLE Cloning and expression of human mitochondrial deoxyguanosine kinase cDNA JOURNAL FEBS Lett. 390 (1), 39-43 (1996) MEDLINE 96314545 REFERENCE 2 (bases 1 to 1025) AUTHORS Wang,L. TITLE Direct Submission JOURNAL Submitted (17-APR-1996) L. Wang, Swedish University of Agriculture Science, Veterinary Medical Chemistry, BMC, Box 575, UPPSALA, S-751 23, SWEDEN FEATURES Location/Qualifiers source 1..1025 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 12..794 /gene="dGK" CDS 12..794 /gene="dGK" /EC_number="2.7.1.113" /codon_start=1 /product="deoxyguanosine kinase" /db_xref="PID:e242999" /db_xref="PID:g1480198" /translation="MAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTKT YPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRL KVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENDSLSDIEWHIYQDWHSFLL WEFASRITLHGFIYLQASPQVCLKRLYQRAREEEEGIELAYLEQLHGQHEAWLIHKTT KLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL" BASE COUNT 272 a 250 c 238 g 263 t 2 others ORIGIN 1 ccttcagttc catggccaag agcccactcg agggcgtttc ctcctccaga ggcctgcacg 61 cggggcgcgg gccccgaagg ctctccatcg aaggcaacat tgctgtggga aagtccacgt 121 ttgtgaagtt actcacgaaa acttacccag aatggcacgt agctacagaa cctgtagcaa 181 catggcagaa tatccaggct gctggcaccc aaaaagcctg cactgcccaa agtcttggaa 241 acttgctgga tatgatgtac cgggagccag cacgatggtc ctacacattc cagacatttt 301 cctttttgag ccgcctgaaa gtacagctgg agcccttccc tgagaaactc ttacaggcca 361 ggaagccagt acagatcttt gagaggtctg tgtacagtga caggtatatc tttgcaaaga 421 atctttttga aaatgattcc ctcagtgaca tcgagtggca tatctatcag gactggcatt 481 cttttctcct gtgggagttt gccagccgga tcacattaca tggcttcatc tacctccagg 541 cttctcccca ggtttgtttg aagagactgt accagagggc cagggaggag gaggaaggaa 601 ttgagctggc ctatctagag cagctgcatg gccaacacga agcctggctt attcacaaga 661 caacgaagct ccactttgag gctctgatga acattccagt gctggtgttg gatgtcaatg 721 atgatttttc tgaggaagta accaaacaag aagacctcat gagagaggta aacacctttg 781 taaagaatct gtaaccaata ccatgaagtt caggctgtga tctgggctcc ctgactttct 841 gaagctagaa aaatgttgtg tctcccaacc acctttccat ccnnagcccc tctcatccct 901 ggagcactct gccgctcaag agctggtttg ttaattattg ttagactttg ccattgttgc 961 cattgttttc ttttgtacct gaagcatttt gaaaataaag tttacttaag ttaaaaaaaa 1021 aaaaa // LOCUS HSDHEAST 1064 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for dehydroepiandrosterone sulphotransferase. ACCESSION X70222 S53620 NID g312804 KEYWORDS dehydroepiandrosterone sulphotransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1064) AUTHORS Comer,K.A., Falany,J.L. and Falany,C.N. TITLE Cloning and expression of human liver dehydroepiandrosterone sulphotransferase JOURNAL Biochem. J. 289 (Pt 1), 233-240 (1993) MEDLINE 93143674 FEATURES Location/Qualifiers source 1..1064 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 53..910 /codon_start=1 /product="dehydroepiandrosterone sulphotransferase" /db_xref="PID:g312805" /db_xref="SWISS-PROT:Q06520" /translation="MSDDFLWFEGIAFPTMGFRSETLRKVRDEFVIRDEDVIILTYPK SGTNWLAEILCLMHSKGDAKWIQSVPIWERSPWVESEIGYTALSESESPRLFSSHLPI QLFPKSFFSSKAKVIYLMRNPRDVLVSGYFFWKNMKFIKKPKSWEEYFEWFCQGTVLY GSWFDHIHGWMPMREEKNFLLLSYEELKQDTGRTIEKICQFLGKTLEPEELNLILKNS SFQSMKENKMSNYSLLSVDYVVDKAQLLRKGVSGDWKNHFTVAQAEDFDKLFQEKMAD LPRELFPWE" BASE COUNT 331 a 211 c 240 g 282 t ORIGIN 1 gaattcggca cgaggttgaa accctcacac cacgcaggaa gaggtcatca tcatgtcgga 61 cgatttctta tggtttgaag gcatagcttt ccctactatg ggtttcagat ccgaaacctt 121 aagaaaagta cgtgatgagt tcgtgataag ggatgaagat gtaataatat tgacttaccc 181 caaatcagga acaaactggt tggctgagat tctctgcctg atgcactcca agggggatgc 241 caagtggatc caatctgtgc ccatctggga gcgatcaccc tgggtagaga gtgagattgg 301 gtatacagca ctcagtgaaa gcgagagtcc acgtttattc tcctcccacc tccccatcca 361 gttattcccc aagtctttct tcagttccaa ggccaaggtg atttatctca tgagaaatcc 421 cagagatgtt ttggtgtctg gttatttttt ctggaaaaac atgaagttta ttaagaaacc 481 aaagtcatgg gaagaatatt ttgaatggtt ttgtcaagga actgtgctat atgggtcatg 541 gtttgaccac attcatggct ggatgcccat gagagaggag aaaaacttcc tgttactgag 601 ttatgaggag ctgaaacagg acacaggaag aaccatagag aagatctgtc aattcctggg 661 aaagacgtta gaacccgaag aactgaactt aattctcaag aacagctcct ttcagagcat 721 gaaagaaaac aagatgtcca attattccct cctgagtgtt gattatgtag tggacaaagc 781 acaacttctg agaaaaggtg tatctgggga ctggaaaaat cacttcacag tggcccaagc 841 tgaagacttt gataaattgt tccaagagaa gatggcagat cttcctcgag agctgttccc 901 atgggaataa cgtccaaaac actctggatc ttatatggag aatgacattg attctcctgt 961 ccttgtacat gtacctgact ggggtcattg tgtaagactt attattttat cctgaaacct 1021 ttataaacaa acctctgcaa aaaaaaaaaa aaaaaaaact cgag // LOCUS HSDHFR 564 bp RNA PRI 07-JAN-1995 DEFINITION Human mRNA for dihydrofolic acid reductase (coding region only). ACCESSION V00507 NID g30774 KEYWORDS complementary DNA; dihydrofolate reductase; reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 564) AUTHORS Masters,J.N. and Attardi,G. TITLE The nucleotide sequence of the cDNA coding for the human dihydrofolic acid reductase JOURNAL Gene 21 (1-2), 59-63 (1983) MEDLINE 83183667 FEATURES Location/Qualifiers source 1..564 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>564 /note="messenger RNA of DHFR" CDS 1..564 /note="coding sequence of DHFR (1 is 1st base in codon) (561 is 3rd base in codon)" /codon_start=1 /db_xref="PID:g30775" /db_xref="SWISS-PROT:P00374" /translation="MVGSLNCIVAVSQNMGIGKNGDLPWPPLRNEFRYFQRMTTTSSV EGKQNLVIMGKKTWFSIPEKNRPLKGRINLVLSRELKEPPQGAHFLSRSLDDALKLTE QPELANKVDMVWIVGGSSVYKEAMNHPGHLKLFVTRIMQDFESDTFFPEIDLEKYKLL PEYPGVLSDVQEEKGIKYKFEVYEKND" BASE COUNT 182 a 108 c 127 g 147 t ORIGIN 1 atggttggtt cgctaaactg catcgtcgct gtgtcccaga acatgggcat cggcaagaac 61 ggggacctgc cctggccacc gctcaggaat gaattcagat atttccagag aatgaccaca 121 acctcttcag tagaaggtaa acagaatctg gtgattatgg gtaagaagac ctggttctcc 181 attcctgaga agaatcgacc tttaaagggt agaattaatt tagttctcag cagagaactc 241 aaggaacctc cacaaggagc tcattttctt tccagaagtc tagatgatgc cttaaaactt 301 actgaacaac cagaattagc aaataaagta gacatggtct ggatagttgg tggcagttct 361 gtttataagg aagccatgaa tcacccaggc catcttaaac tatttgtgac aaggatcatg 421 caagactttg aaagtgacac gttttttcca gaaattgatt tggagaaata taaacttctg 481 ccagaatacc caggtgttct ctctgatgtc caggaggaga aaggcattaa gtacaaattt 541 gaagtatatg agaagaatga ttaa // LOCUS HSDHPR 1216 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for dihydropteridine reductase (hDHPR). ACCESSION X04882 NID g30818 KEYWORDS dihydropteridine reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1216) AUTHORS Dahl,H.H.M. TITLE Direct Submission JOURNAL Submitted (20-APR-1987) Dahl H.-H.M., Murdoch Institute, Royal Children's Hospital, Melbourne, Australia 3052 REFERENCE 2 (bases 1 to 1216) AUTHORS Dahl,H.H., Hutchison,W., McAdam,W., Wake,S., Morgan,F.J. and Cotton,R.G. TITLE Human dihydropteridine reductase: characterisation of a cDNA clone and its use in analysis of patients with dihydropteridine reductase deficiency JOURNAL Nucleic Acids Res. 15 (5), 1921-1932 (1987) MEDLINE 87174727 REFERENCE 3 (bases 176 to 177) AUTHORS Dahl,H.H. TITLE Direct Submission JOURNAL Submitted (21-JUL-1987) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (21-JUL-1987) by Dahl H.HM. FEATURES Location/Qualifiers source 1..1216 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda hDHPR 19" CDS 25..759 /note="hDHPR (AA1-244)" /codon_start=1 /db_xref="PID:g30819" /db_xref="SWISS-PROT:P09417" /translation="MAAAAAAGEARRVLVYGGRGALGSRCVQAFRARNWWVASVDVVE NEEASATIIVKMTDSFTEQADQVTAEVGKLLGEEKVDAILCVAGGWAGGNAKSKSLFK NCDLMWKQSIWTSTISSHLATKHLKEGGLLTLAGAKAALDGTPGMIGYGMAKGAVHQL CQSLAGKNSGMPPGAAAIAVLPVTLDTPMNRKSMPEADFSSWTPLEFLVETFHDWITG KNRPSSGSLIQVVTTEGRTELTPAYF" old_sequence 176..177 /note="cg was gc in [1] & [2]" /citation=[2] polyA_site 1216 /note="polyA site" BASE COUNT 266 a 285 c 354 g 311 t ORIGIN 1 cggagccggg ctggcaggag caggatggcg gcggcggcgg ctgcaggcga ggcgcgccgg 61 gtgctggtgt acggcggcag gggcgctctg ggttctcgat gcgtgcaggc ttttcgggcc 121 cgcaactggt gggttgccag cgttgatgtg gtggagaatg aagaggccag cgctacgatc 181 attgttaaaa tgacagactc gttcactgag caggctgacc aggtgactgc tgaggttgga 241 aagctcttgg gtgaagagaa ggtggatgca attctttgcg ttgctggagg atgggccggg 301 ggcaatgcca aatccaagtc tctctttaag aactgtgacc tgatgtggaa gcagagcata 361 tggacatcga ccatctccag ccatctggct accaagcatc tcaaggaagg aggcctcctg 421 accttggctg gcgcaaaggc tgccctggat gggactcctg gtatgatcgg gtacggcatg 481 gccaagggtg ctgttcacca gctctgccag agcctggctg ggaagaacag cggcatgccg 541 cccggggcag ccgccatcgc tgtgctcccg gttaccctgg ataccccgat gaacaggaaa 601 tcaatgcctg aggctgactt cagctcctgg acacccttag aattcctagt tgaaactttc 661 catgactgga tcacagggaa aaaccgaccg agctcaggaa gcctaatcca ggtggtaacc 721 acagaaggaa ggacggaact caccccagca tatttttagg cctcatctca gtgcctatga 781 ggggcctgcc agaaaagtca ctaacctgtc tcagtgtggc cttgtccagc cttgtgtttt 841 ctgtaacccc tgtttgtggt acgagataat gagtcctatt tttctctcac ataatatgca 901 tttgctctcc taggacagtg taatacattt atgtgaagta aagacatgcg agactggtgg 961 cctgcaaata gcatccgtca atctgtgtta actgcatagg gagggctctg catagcacct 1021 gctatagcgg tgtcatgttg gatcgctttt gtgactgttc atctgtcctt gacagtggct 1081 gtcatcttga ctactttgtt gatttgttgg tattggggac attttaaagg ctgagttatt 1141 tttgaatgtc atgtttatgt catagacgta gttttcgcat ccttgaatta aactgcctta 1201 actccttttg tggtat // LOCUS HSDINFIG 2608 bp RNA PRI 26-MAY-1992 DEFINITION H.sapiens mRNA for IFN-inducible gamma2 protein. ACCESSION X59892 NID g30820 KEYWORDS gamma2 protein; interferon inducible protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2608) AUTHORS Fleckner,J. TITLE Direct Submission JOURNAL Submitted (05-JUN-1991) J. Fleckner, Dept of Mol Biology, Univ of Aarhus, CF Moellers Alle 130, DK 8000 Aarhus C, DENMARK REFERENCE 2 (bases 1 to 2608) AUTHORS Fleckner,J., Rasmussen,H.H. and Justesen,J. TITLE Human interferon gamma potently induces the synthesis of a 55-kDa protein (gamma 2) highly homologous to rabbit peptide chain release factor and bovine tryptophanyl-tRNA synthetase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11520-11524 (1991) MEDLINE 92107982 FEATURES Location/Qualifiers source 1..2608 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AMA amniotic epithelial" /clone_lib="lambda gt10" /clone="gamma2 lambda1" CDS 112..1527 /codon_start=1 /product="471 aa polypeptide (gamma2)" /db_xref="PID:g30821" /db_xref="SWISS-PROT:P23381" /translation="MPNSEPASLLELFNSIATQGELVRSLKAGNASKDEIDSAVKMLV SLKMSYKAAAGEDYKADCPPGNPAPTSNHGPDATEAEEDFVDPWTVQTSSAKGIDYDK LIVRFGSSKIDKELINRIERATGQRPHHFLRRGIFFSHRDMNQVLDAYENKKPFYLYT GRGPSSEAMHVGHLIPFIFTKWLQDVFNVPLVIQMTDDEKYLWKDLTLDQAYSYAVEN AKDIIACGFDINKTFIFSDLDYMGMSSGFYKNVVKIQKHVTFNQVKGIFGFTDSDCIG KISFPAIQAAPSFSNSFPQIFRDRTDIQCLIPCAIDQDPYFRMTRDVAPRIGYPKPAL LHSTFFPALQGAQTKMSASDPNSSIFLTDTAKQIKTKVNKHAFSGGRDTIEEHRQFGG NCDVDVSFMYLTFFLEDDDKLEQIRKDYTSGAMLTGELKKALIEVLQPLIAEHQARRK EVTDEIVKEFMTPRKLSFDFQ" polyA_signal 2588..2593 polyA_site 2608 BASE COUNT 685 a 663 c 621 g 639 t ORIGIN 1 gaccagtggc cacctctgca gtgtcttcca caacctggtc ttgactcgtc tgctgaacaa 61 atcctctgac ctcaggccgg ctgtgaacgt agttcctgag agatagcaaa catgcccaac 121 agtgagcccg catctctgct ggagctgttc aacagcatcg ccacacaagg ggagctcgta 181 aggtccctca aagcgggaaa tgcgtcaaag gatgaaattg attctgcagt aaagatgttg 241 gtgtcattaa aaatgagcta caaagctgcc gcgggggagg attacaaggc tgactgtcct 301 ccagggaacc cagcacctac cagtaatcat ggcccagatg ccacagaagc tgaagaggat 361 tttgtggacc catggacagt acagacaagc agtgcaaaag gcatagacta cgataagctc 421 attgttcggt ttggaagtag taaaattgac aaagagctaa taaaccgaat agagagagcc 481 accggccaaa gaccacacca cttcctgcgc agaggcatct tcttctcaca cagagatatg 541 aatcaggttc ttgatgccta tgaaaataag aagccatttt atctgtacac gggccggggc 601 ccctcttctg aagcaatgca tgtaggtcac ctcattccat ttattttcac aaagtggctc 661 caggatgtat ttaacgtgcc cttggtcatc cagatgacgg atgacgagaa gtatctgtgg 721 aaggacctga ccctggacca ggcctatagc tatgctgtgg agaatgccaa ggacatcatc 781 gcctgtggct ttgacatcaa caagactttc atattctctg acctggacta catggggatg 841 agctcaggtt tctacaaaaa tgtggtgaag attcaaaagc atgttacctt caaccaagtg 901 aaaggcattt tcggcttcac tgacagcgac tgcattggga agatcagttt tcctgccatc 961 caggctgctc cctccttcag caactcattc ccacagatct tccgagacag gacggatatc 1021 cagtgcctta tcccatgtgc cattgaccag gatccttact ttagaatgac aagggacgtc 1081 gcccccagga tcggctatcc taaaccagcc ctgttgcact ccaccttctt cccagccctg 1141 cagggcgccc agaccaaaat gagtgccagc gaccccaact cctccatctt cctcaccgac 1201 acggccaagc agatcaaaac caaggtcaat aagcatgcgt tttctggagg gagagacacc 1261 atcgaggagc acaggcagtt tgggggcaac tgtgatgtgg acgtgtcttt catgtacctg 1321 accttcttcc tcgaggacga cgacaagctc gagcagatca ggaaggatta caccagcgga 1381 gccatgctca ccggtgagct caagaaggca ctcatagagg ttctgcagcc cttgatcgca 1441 gagcaccagg cccggcgcaa ggaggtcacg gatgagatag tgaaagagtt catgactccc 1501 cggaagctgt ccttcgactt tcagtagcac tcgttttaca tatgcttata aaagaagtga 1561 tgtatcagta atgtatcaat aatcccagcc cagtcaaagc accgccacct gtaggcttct 1621 gtctcatggt aattactggg cctggcctct gtaagcctgt gtatgttatc aatactgttt 1681 cttcctgtga gttccattat ttctatctct tatgggcaaa gcattgtggg taattggtgc 1741 tggctaacat tgcatggtcg gatagagaag tccagctgtg agtctctccc caaagcagcc 1801 ccacagtgga gcctttggct ggaagtccat gggccaccct gttcttgtcc atggaggact 1861 ccgagggttc caagtatact cttaagaccc actctgttta aaaatatata ttctatgtat 1921 gcgtatatgg aattgaaatg tcattattgt aacctagaaa gtgctttgaa atattgatgt 1981 gggggaggtt tattgagcac aagatgtatt tcagcccatg ccccctccca aaaagaaatt 2041 gataagtaaa agcttcgtta tacatttgac taagaaatca cccagcttta aagctgcttt 2101 taacaatgaa gattgaacag agttcagcaa ttttgattaa attaagactt gggggtgaaa 2161 ctttccagtt tactgaactc cagaccatgc atgtagtcca ctccagaaat catgctcgct 2221 tcccttggca caccagtgtt ctcctgccaa atgaccctag accctctgtc ctgcagagtc 2281 agggtggctt tcccctgcat gtgtccgatg ccaagagtcc tggccttccg agatgcttca 2341 ttttgaccct tggctgcagt ggaagtcagc acagagcagt gccctggctg tgtcctggac 2401 gggtggactt agctagggag aaagtcgagc agcagccctc gaggccctca cagatgtcta 2461 gcaggcctca tttcatcacg cagcatgtgc aggcctggaa gagcaaagcc aaatctcagg 2521 gaagtccttg gttgatgtat ctgggttcct ctggagcact ctgccctcct gtcacccagt 2581 agagttaaat aaacttcctt ggctcctg // LOCUS HSDING 1625 bp RNA PRI 17-JAN-1997 DEFINITION H.sapiens mRNA for dinG gene. ACCESSION Y10571 NID g1785642 KEYWORDS dinG gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1625) AUTHORS Dyer,M.J.S., Abdul-Rauf,M., Heward,J.M., Cui,X., Cleary,M.L. and Catovsky,D. TITLE Interactions of BCL7A with novel ring finger proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 1625) AUTHORS Dyer,M.J.S. TITLE Direct Submission JOURNAL Submitted (15-JAN-1997) M.J.S. Dyer, Institute of Cancer Research, Academic Heamatology and Cytogenetics, Haddow Laboratories, Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..1625 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /chromosome="12" /map="p12" /germline gene 13..1023 /gene="dinG" CDS 13..1023 /gene="dinG" /codon_start=1 /evidence=experimental /db_xref="PID:e291103" /db_xref="PID:g1785643" /translation="MSQAVQTNGTQPLSKTWELSLYELQRTPQEAITDGLEIVVSPRS LHSELMCPICLDMLKNTMTTKECLHRFCADCIITALRSGNKECPTCRKKLVSKRSLRP DPNFDALISKIYPSRDEYEAHQERVLARINKHNNQQALSHSIEEGLKIQAMNRLQRGK KQQIENGSGAEDNGDSSHCSNASTHSNQEAGPSNKRTKTSDDSGLELDNNNAAMAIDP VMDGASEIELVFRPHPTLMEKDDSAQTRYIKTSGNATVDHLSKYLAVRLALEELRSKG ESNQMNLDTASEKQYTIYIATASGQFTVLNGSFSLELVSEKYWKVNKPMELYYAPTKE HK" BASE COUNT 518 a 314 c 304 g 489 t ORIGIN 1 gcaggagccg caatgtctca ggctgtgcag acaaacggaa ctcaaccatt aagcaaaaca 61 tgggaactca gtttatatga gttacaacga acacctcagg aggcaataac agatggctta 121 gaaattgtgg tttcacctcg aagtctacac agtgaattaa tgtgcccaat ttgtttggat 181 atgttgaaga acaccatgac tacaaaggag tgtttacatc gtttttgtgc agactgcatc 241 atcacagccc ttagaagtgg caacaaagaa tgtcctacct gtcggaaaaa actagtttcc 301 aaaagatcac taaggccaga cccaaacttt gatgcactca tcagcaaaat ttatccaagt 361 cgtgatgagt atgaagctca tcaagagaga gtattagcca ggatcaacaa gcacaataat 421 cagcaagcac tcagtcacag cattgaggaa ggactgaaga tacaggccat gaacagactg 481 cagcgaggca agaaacaaca gattgaaaat ggtagtggag cagaagataa tggtgacagt 541 tcacactgca gtaatgcatc cacacatagc aatcaggaag caggccctag taacaaacgg 601 accaaaacat ctgatgattc tgggctagag cttgataata acaatgcagc aatggcaatt 661 gatccagtaa tggatggtgc tagtgaaatt gaattagtat tcaggcctca tcccacactt 721 atggaaaaag atgacagtgc acagacgaga tacataaaga cttctggtaa cgccactgtt 781 gatcacttat ccaagtatct ggctgtgagg ttagctttag aagaacttcg aagcaaaggt 841 gaatcaaacc agatgaacct tgatacagcc agtgagaagc agtataccat ttatatagca 901 acagccagtg gccagttcac tgtattaaat ggctcttttt ctttggaatt ggtcagtgag 961 aaatactgga aagtgaacaa acccatggaa ctttattacg cacctacaaa ggagcacaaa 1021 tgagccttta aaaaccaatt ctgagactga acttttttat agcctatttc tttaatatta 1081 aagatgtact ggcattactt ttatggacag atcttggata tgttgttcaa ttttctttct 1141 gagccagaat agtttacgct attcaaatct tttccccctt atttaagatt tcctttttgg 1201 aagggactgc aattattcag tatttttttc tttcctttaa aaaaatatat ctgaagtttc 1261 ttgtgttttt tttttttccc cacaaagtgt gtttccactt ggagcaccat tttgacccag 1321 gaatttttca tagtttctgt attcttataa gattcagtgg ctgtcctttt cctgctcccc 1381 tcaaaagatt tttagtcata cagaatgtta aatattatgt attctgacct tttttttttc 1441 ccccggagtc ttggtatatt tatagttttc tatataaact gtagtatctt catgaagacc 1501 caaggctcaa atttactgtc cttaaaaaca attctcatag gattattctt ttcatggtat 1561 cttcttccat aatatctcat tttaaaaaga agttctatat gaactttttg tccattgtca 1621 tgcaa // LOCUS HSDISPRO 2187 bp RNA PRI 04-SEP-1997 DEFINITION Homo sapiens mRNA for disintegrin-protease. ACCESSION Y13323 NID g2370106 KEYWORDS disintegrin; protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2187) AUTHORS Mueller,C.G., Rissoan,M.C., Salinas,B., Ait-Yahia,S., Ravel,O., Bridon,J.M., Briere,F., Lebecque,S. and Liu,Y.J. TITLE Polymerase chain reaction selects a novel disintegrin proteinase from CD40-activated germinal center dendritic cells JOURNAL J. Exp. Med. 186 (5), 655-663 (1997) MEDLINE 97419183 REFERENCE 2 (bases 1 to 2187) AUTHORS Mueller,C.G.F. TITLE Direct Submission JOURNAL Submitted (20-MAY-1997) C.G.F. Mueller, Schering-Plough, Immunological Research, 27, Chemin Des Peupliers, Bp11, 69571 Dardilly, FRANCE FEATURES Location/Qualifiers source 1..2187 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="dendritic cells" CDS 61..1473 /codon_start=1 /product="disintegrin-protease" /db_xref="PID:e332729" /db_xref="PID:g2370107" /translation="MLRGISQLPAVATMSWVLLPVLWLIVQTQAIAIKQTPELTLHEI VCPKKLHILHKREIKNNQTEKHGKEERYEPEVQYQMILNGEEIILSLQKTKHLLGPDY TETLYSPRGEEITTKPENMEHCYYKGNILNEKNSVASISTCDGLRGYFTHHHQRYQIK PLKSTDEKEHAVFTSNQEEQDPANHTCGVKSTDGKQGPIRISRSLKSPEKEDFLRAQK YIDLYLVLDNAFYKNYNENLTLIRSFVFDVMNLLNVIYNTIDVQVALVGMEIWSDGDK IKVVPSASTTFDNFLRWHSSNPGKKIHDHAQLLSGISFNNRRVGLAASNSLCSPSSVA VIEAKKKNNVALVGVMSHELGHVLGMPDVPFNTKCPSGSCVMNQYLSSKFPKDFSTSC RAHFERYLLSQKPKCLLQAPIPTNIMTTPVCGNHLLEVGEDCDCGSPKECTNLCCEAL TCKLKPGTDCGGDAPNHTTE" polyA_signal 2165..2170 BASE COUNT 690 a 457 c 431 g 609 t ORIGIN 1 cgcccgggca ggtgagaaat tggagaagat aaaactggac actggggaga ccacaacttc 61 atgctgcgtg ggatctccca gctacctgca gtggccacca tgtcttgggt cctgctgcct 121 gtactttggc tcattgttca aactcaagca atagccataa agcaaacacc tgaattaacg 181 ctccatgaaa tagtttgtcc taaaaaactt cacattttac acaaaagaga gatcaagaac 241 aaccagacag aaaagcatgg caaagaggaa aggtatgaac ctgaagttca atatcagatg 301 atcttaaatg gagaagaaat cattctctcc ctacaaaaaa ccaagcacct cctggggcca 361 gactacactg aaacattgta ctcacccaga ggagaggaaa ttaccacgaa acctgagaac 421 atggaacact gttactataa aggaaacatc ctaaatgaaa agaattctgt tgccagcatc 481 agtacttgtg acgggttgag aggatacttc acacatcatc accaaagata ccagataaaa 541 cctctgaaaa gcacagacga gaaagaacat gccgtcttta catctaacca ggaggaacaa 601 gacccagcta accacacatg tggtgtgaag agcactgacg ggaaacaagg cccaattcga 661 atctctagat cactcaaaag cccagagaaa gaagactttc ttcgggcaca gaaatacatt 721 gatctctatt tggtgctgga taatgccttt tataagaact ataatgagaa tctaactctg 781 ataagaagct ttgtgtttga tgtgatgaac ctactcaatg tgatatataa caccatagat 841 gttcaagtgg ccttggtagg tatggaaatc tggtctgatg gggataagat aaaggtggtg 901 cccagcgcaa gcaccacgtt tgacaacttc ctgagatggc acagttctaa cccggggaaa 961 aagatccacg accatgctca gcttctcagc gggattagct tcaacaatcg acgtgtggga 1021 ctggcagctt caaattcctt gtgttcccca tcttcggttg ctgttattga ggctaaaaaa 1081 aagaataatg tggctcttgt aggagtgatg tcacatgagc tgggccatgt ccttggtatg 1141 cctgatgttc cattcaacac caagtgtccc tctggcagtt gtgtgatgaa tcagtatctg 1201 agttcaaaat tcccaaagga tttcagtaca tcttgccgtg cacattttga aagatacctt 1261 ttatctcaga aaccaaagtg cctgctgcaa gcacctattc ctacaaatat aatgacaaca 1321 ccagtgtgtg ggaaccacct tctagaagtg ggagaagact gtgattgtgg ctctcctaag 1381 gagtgtacca atctctgctg tgaagcccta acgtgtaaac tgaagcctgg aactgattgc 1441 ggaggagatg ctccaaacca taccacagag tgaatccaaa gtctgcttca ctgagatgct 1501 accttgccag gacaagaacc aagaactcta actgtcccag gaatcttgtg aattttcacc 1561 cataatggtc tttcacttgt cattctactt tctatattgt tatcagtcca ggaaacaggt 1621 aaacagatgt aattagagac attggctctt tgtttaggcc taatctttct ttttactttt 1681 ttttttcttt tttctttttt tttaaagatc atgaatttgt gacttagttc tgccctttgg 1741 agaacaaaag aaagcagtct tccatcaaat caccttaaaa tgcacggcta aactattcag 1801 agttaacact ccagaattgt taaattacaa gtactatgct ttaatgcttc tttcatctta 1861 ctagtatggc ctataaaaaa aataatacca cttgatgggt gaaggctttg gcaatagaaa 1921 gaagaataga attcaggttt tatgttattc ctctgtgttc acttcgcctt gctcttgaaa 1981 gtgcagtatt tttctacatc atgtcgagaa tgattcaatg taaatatttt tcattttatc 2041 atgtatatcc tatacacaca tctccttcat catcatatat gaagtttatt ttgagaagtc 2101 tacattgctt acattttaat tgagccagca aagaaggctt aatgatttat tgaaccataa 2161 tgtcaataaa aacacaactt ttgaggc // LOCUS HSDIUBIQU 777 bp RNA PRI 16-OCT-1997 DEFINITION H.sapiens mRNA for diubiquitin. ACCESSION Y12653 NID g2546963 KEYWORDS diubiquitin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 777) AUTHORS Bates,E.E.M., Ravel,O., Dieu,M.C., Ho,S., Guret,C., Caux,C., Banchereau,J. and Lebecque,S. TITLE Identification and analysis of a novel member of the ubiquitin family expressed in Dentric cells and mature B cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 777) AUTHORS Bates,E.E.M. TITLE Direct Submission JOURNAL Submitted (17-APR-1997) E.E.M. Bates, Schering-Plough, Laboratory for Immunological Research, 27 Chemin des Peupliers, BP11, 69571 Dardilly Cedex, FRANCE FEATURES Location/Qualifiers source 1..777 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="dendritic cells" CDS 19..516 /codon_start=1 /product="diubiquitin" /db_xref="PID:e321293" /db_xref="PID:g2546964" /translation="MAPNASCLCVHVRSEEWDLMTFDANPYDSVKKIKEHVRSKTKVP VQDQVLLLGSKILKPRRSLSSYGIDKEKTIHLTLKVVKPSDEELPLFLVESGDEAKRH LLQVRRSSSVAQVKAMIETKTGIIPETQIVTCNGKRLEDGKMMADYGIRKGNLLFLAS YCIGG" polyA_signal 748..753 BASE COUNT 223 a 153 c 195 g 206 t ORIGIN 1 ggccccttgt ctgcagagat ggctcccaat gcttcctgcc tctgtgtgca tgtccgttcc 61 gaggaatggg atttaatgac ctttgatgcc aacccatatg acagcgtgaa aaaaatcaaa 121 gaacatgtcc ggtctaagac caaggttcct gtgcaggacc aggttctttt gctgggctcc 181 aagatcttaa agccacggag aagcctctca tcttatggca ttgacaaaga gaagaccatc 241 caccttaccc tgaaagtggt gaagcccagt gatgaggagc tgcccttgtt tcttgtggag 301 tcaggtgatg aggcaaagag gcacctcctc caggtgcgaa ggtccagctc agtggcacaa 361 gtgaaagcaa tgatcgagac taagacgggt ataatccctg agacccagat tgtgacttgc 421 aatggaaaga gactggaaga tgggaagatg atggcagatt acggcatcag aaagggcaac 481 ttactcttcc tggcatctta ttgtattgga gggtgaccac cctggggatg gggtgttggc 541 aggggtcaaa aagcttattt cttttaatct cttactcaac gaacacatct tctgatgatt 601 tcccaaaatt aatgagaatg agatgagtag agtaagattt gggtgggatg ggtaggatga 661 agtatattgc ccaactctat gtttctttga ttctaacaca attaattaag tgacatgatt 721 tttactaatg tattactgag actagtaaat aaatttttaa ggcaaaatag agcattc // LOCUS HSDKRNA 2564 bp RNA PRI 19-JUN-1992 DEFINITION H.sapiens mRNA for diacylglycerol kinase. ACCESSION X62535 NID g30822 KEYWORDS calcium binding protein; cysteine repeat; diacylglycerol kinase; EF-hand. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2564) AUTHORS Schaap,D., de Widt,J., van der Wal,J., Vandekerckhove,J., van Damme,J., Gussow,D., Ploegh,H.L., van Blitterswijk,W.J. and van der Bend,R.L. TITLE Purification, cDNA-cloning and expression of human diacylglycerol kinase JOURNAL FEBS Lett. 275 (1-2), 151-158 (1990) MEDLINE 91085550 REFERENCE 2 (bases 1 to 2564) AUTHORS Schaap,D. TITLE Direct Submission JOURNAL Submitted (07-OCT-1991) D. Schaap, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, THE NETHERLANDS FEATURES Location/Qualifiers source 1..2564 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Lymphocytes" /cell_type="Jurkat" /clone_lib="Jurkat T-cell cDNA library and DND41 T-cell leukemic cell cDNA library" mRNA 1..2564 /evidence=experimental CDS 104..2311 /EC_number="2.7.1.107" /codon_start=1 /product="diacylglycerol kinase" /db_xref="PID:g30823" /db_xref="SWISS-PROT:P23743" /translation="MAKERGLISPSDFAQLQKYMEYSTKKVSDVLKLFEDGEMAKYVQ GDAIGYEGFQQFLKIYLEVDNVPRHLSLALFQSFETGHCLNETNVTKDVVCLNDVSCY FSLLEGGRPEDKLEFTFKLYDTDRNGILDSSEVDKIILQMMRVAEYLDWDVSELRPIL QEMMKEIDYDGSGSVSQAEWVRAGATTVPLLVLLGLEMTLKDDGQHMWRPKRFPRPVY CNLCESSIGLGKQGLSCNLCKYTVHDQCAMKALPCEVSTYAKSRKDIGVQSHVWVRGG CESGRCDRCQKKIRIYHSLTGLHCVWCHLEIHDDCLQAVGHECDCGLLRDHILPPSSI YPSVLASGPDRKNSKTSQKTMDDLNLSTSEALRIDPVPNTHPLLVFVNPKSGGKQGQR VLWKFQYILNPRQVFNLLKDGPEIGLRLFKDVPDSRILVCGGDGTVGWILETIDKANL PVLPPVAVLPLGTGNDLARCLRWGGGYEGQNLAKILKDLEMSKVVHMDRWSVEVIPQQ TEEKSDPVPFQIINNYFSIGVDASIAHRFHIMREKYPEKFNSRMKNKLWYFEFATSES IFSTCKKLEESLTVEICGKPLDLSNLSLEGIAVLNIPSMHGGSNLWGDTRRPHGDIYG INQALGATAKVITDPDILKTCVPDLSDKRLEVVGLEGAIEMGQIYTKLKNAGRRLAKC SEITFHTTKTLPMQIDVEPWMQTPCTIKITHKNQMPMLMGPPPRSTNFFGFLS" misc_feature 369..537 /note="double EF-hand" repeat_region 618..957 /note="double cysteine repeat" misc_feature 681..747 /note="ATP motif" misc_feature 1422..1479 /note="ATP motif" BASE COUNT 668 a 644 c 653 g 599 t ORIGIN 1 ggggcggtcg cagctgaagc aggcctaccc tctgaagagg tccaagcaac ggaagtacta 61 ctacgaagct gcctttctgg ccatccttga gaaaaataga cagatggcca aggagagggg 121 cctaataagc cccagtgatt ttgcccagct gcaaaaatac atggaatact ccaccaaaaa 181 ggtcagtgat gtcctaaagc tcttcgagga tggcgagatg gctaaatatg tccaaggaga 241 tgccattggg tacgagggat tccagcaatt cctgaaaatc tatctcgaag tggataatgt 301 tcccagacac ctaagcctgg cactgtttca atcctttgag actggtcact gcttaaatga 361 gacaaatgtg acaaaagatg tggtgtgtct caatgatgtt tcctgctact tttcccttct 421 ggagggtggt cggccagaag acaagttaga attcaccttc aagctgtacg acacggacag 481 aaatgggatc ctggacagct cagaagtgga caaaattatc ctacagatga tgcgagtggc 541 tgaatacctg gattgggatg tgtctgagct gaggccgatt cttcaggaga tgatgaaaga 601 gattgactat gatggcagtg gctctgtctc tcaagctgag tgggtccggg ctggggccac 661 caccgtgcca ctgctagtgc tgctgggtct ggagatgact ctgaaggacg acggacagca 721 catgtggagg cccaagaggt tccccagacc agtctactgc aatctgtgcg agtcaagcat 781 tggtcttggc aaacagggac tgagctgtaa cctctgtaag tacactgttc acgaccagtg 841 tgccatgaaa gccctgcctt gtgaagtcag cacctatgcc aagtctcgga aggacattgg 901 tgtccaatca catgtgtggg tgcgaggagg ctgtgagtcc gggcgctgcg accgctgtca 961 gaaaaagatc cggatctacc acagtctgac cgggctgcat tgtgtatggt gccacctaga 1021 gatccacgat gactgcctgc aagcggtggg ccatgagtgt gactgtgggc tgctccggga 1081 tcacatcctg cctccatctt ccatctatcc cagtgtcctg gcctctggac cggatcgtaa 1141 aaatagcaaa acaagccaga agaccatgga tgatttaaat ttgagcacct ctgaggctct 1201 gcggattgac cctgttccta acacccaccc acttctcgtc tttgtcaatc ctaagagtgg 1261 cgggaagcag gggcagaggg tgctctggaa gttccagtat atattaaacc ctcgacaggt 1321 gttcaacctc ctaaaggatg gtcctgagat agggctccga ttattcaagg atgttcctga 1381 tagccggatt ttggtgtgtg gtggagacgg cacagtaggc tggattctag agaccattga 1441 caaagctaac ttgccagttt tgcctcctgt tgctgtgttg cccctgggta ctggaaatga 1501 tctggctcga tgcctaagat ggggaggagg ttatgaagga cagaatctgg caaagatcct 1561 caaggattta gagatgagta aagtggtaca tatggatcga tggtctgtgg aggtgatacc 1621 tcaacaaact gaagaaaaaa gtgacccagt cccctttcaa atcatcaata actacttctc 1681 tattggcgtg gatgcctcta ttgctcatcg attccacatc atgcgagaga aatatccgga 1741 gaagttcaac agcagaatga agaacaagct atggtacttc gaatttgcca catctgaatc 1801 catcttctca acatgcaaaa agctggagga gtctttgaca gttgagatct gtgggaaacc 1861 gctggatctg agcaacctgt ccctagaagg catcgcagtg ctaaacatcc ctagcatgca 1921 tggtggctcc aacctctggg gtgataccag gagaccccat ggggatatct atgggatcaa 1981 ccaggcctta ggtgctacag ctaaagtcat caccgaccct gatatcctga aaacctgtgt 2041 accagaccta agtgacaaga gactggaagt ggttgggctg gagggtgcaa ttgagatggg 2101 ccaaatctat accaagctca agaatgctgg acgtcggctg gccaagtgct ctgagatcac 2161 cttccacacc acaaaaaccc ttcccatgca aattgacgta gaaccctgga tgcagacgcc 2221 ctgtacaatc aagatcaccc acaagaacca gatgcccatg ctcatgggcc cacccccccg 2281 ctccaccaat ttctttggct tcttgagcta agggggacac ccttggcctc caagccagcc 2341 ttgaacccac ctccctgtcc ctggactcta ctcccgaggc tctgtacatt gctgccacat 2401 actcctgcca gcttggggga gtgttccttc accctcacag tatttattat cctgcaccac 2461 ctcactgttc cccatgcgca cacacataca cacaccccaa aacacataca ttgaaagtgc 2521 ctcatctgaa taaaatgact tgtgtttccc tttgggatct gctg // LOCUS HSDLG2 3461 bp RNA PRI 08-AUG-1995 DEFINITION H.sapiens mRNA for DLG2. ACCESSION X82895 NID g939884 KEYWORDS DLG2 gene; tumor supressor gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3461) AUTHORS Mazoyer,S., Gayther,S.A., Nagai,M.A., Smith,S.A., Dunning,A., van Rensburg,E.J., Albertsen,H., White,R. and Ponder,B.A. TITLE A gene (DLG2) located at 17q12-q21 encodes a new homologue of the Drosophila tumor suppressor dIg-A JOURNAL Genomics 28 (1), 25-31 (1995) MEDLINE 96070428 REFERENCE 2 (bases 1 to 3461) AUTHORS Mazoyer,S. TITLE Direct Submission JOURNAL Submitted (24-NOV-1994) S. Mazoyer, CRC Human Cancer Genetics Research Group, Laboratories Block, Addenbrooke's Hospital, Hill Road, Cambridge CB2 2QP, UK FEATURES Location/Qualifiers source 1..3461 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetal" /tissue_type="brain" /clone_lib="cDNA library" /clone="38B1/1" /chromosome="17" /map="q12-21" exon 88..237 /gene="DLG2" /number=1 /evidence=experimental CDS 88..1818 /gene="DLG2" /codon_start=1 /db_xref="PID:g939885" /translation="MPVAATNSETAMQQVLDNLGSLPSATGAAELDLIFLRGIMESPI VRSLAKVIMVLWFMQQNVFVPMKYMLKYFGAHERLEETKLEAVRDNNLELVQEILRDL AQLAEQSSTAAELAHILQEPHFQSLLETHDSVASKTYETPPPSPGLDPTFSNQPVPPD AVRMVGIRKTAGEHLGVTFRVEGGELVIARILHGGMVAQQGLLHVGDIIKEVNGQPVG SDPRALQELLRNASGSVILKILPNYQEPHLPRQVFVKCHFDYDPARDSLIPCKEAGLR FNAGDLLQIVNQDDANWWQACHVEGGSAGLIPSQLLEEKRKAFVKRDLELTPNSGTLC GSLSGKKKKRMMYLTTKNAEFDRHELLIYEEVARMPPFRRKTLVLIGAQGVGRRSLKN KLIMWDPDRYGTTVPYTSRRPKDSEREGQGYSFVSRGEMEADVRAGRYLEHGEYEGNL YGTRIDSIRGVVAAGKVCVLDVNPQAVKVLRTAEFVPYVVFIEAPDFETLRAMNRAAL ESGISTKQLTEADLRRTVEESSRIQRGYGHYFDLCLVNSNLERTFRELQTAMEKLRTE PQWVPVSWVY" gene 88..1818 /gene="DLG2" exon 238..309 /gene="DLG2" /number=2 /evidence=experimental exon 310..462 /gene="DLG2" /number=3 /evidence=experimental exon 463..612 /gene="DLG2" /number=4 /evidence=experimental exon 613..840 /gene="DLG2" /number=5 /evidence=experimental repeat_region 640..807 /note="dlg homology repeat" exon 841..972 /gene="DLG2" /number=6 /evidence=experimental misc_feature 844..1029 /gene="DLG2" /note="SH3 domain" exon 973..1078 /gene="DLG2" /number=7 /evidence=experimental exon 1079..1147 /gene="DLG2" /number=8 /evidence=experimental exon 1148..1309 /gene="DLG2" /number=9 /evidence=experimental misc_feature 1207..1779 /gene="DLG2" /note="guanylate domain" exon 1310..1512 /gene="DLG2" /number=10 /evidence=experimental exon 1513..1641 /gene="DLG2" /number=11 /evidence=experimental exon 1642..1818 /gene="DLG2" /number=12 /evidence=experimental BASE COUNT 710 a 1009 c 1019 g 723 t ORIGIN 1 ggagcgcccg gctgcgctgg agccgcccgg agctaggggc tccccggggc gcaggagaga 61 cgtttcagag cccttgcctc cttcaccatg ccggttgccg ccaccaactc tgaaactgcc 121 atgcagcaag tcctggacaa cttgggatcc ctccccagtg ccacgggggc tgcagagctg 181 gacctgatct tccttcgagg cattatggaa agtcccatag taagatccct ggccaaggtg 241 ataatggtat tgtggtttat gcagcagaat gtctttgttc ctatgaaata catgctgaaa 301 tactttgggg cccatgagag gctggaggag acgaagctgg aggccgtgag agacaacaac 361 ctggagctgg tgcaggagat cctgcgggac ctggcgcagc tggctgagca gagcagcaca 421 gccgccgagc tggcccacat cctccaggag ccccacttcc agtccctcct ggagacgcac 481 gactctgtgg cctcaaagac ctatgagaca ccacccccca gccctggcct ggaccctacg 541 ttcagcaacc agcctgtacc tcccgatgct gtgcgcatgg tgggcatccg caagacagcc 601 ggagaacatc tgggtgtaac gttccgcgtg gagggcggcg agctggtgat cgcgcgcatt 661 ctgcatgggg gcatggtggc tcagcaaggc ctgctgcatg tgggtgacat catcaaggag 721 gtgaacgggc agccagtggg cagtgacccc cgcgcactgc aggagctcct gcgcaatgcc 781 agtggcagtg tcatcctcaa gatcctgccc aactaccagg agccccatct gccccgccag 841 gtatttgtga aatgtcactt tgactatgac ccggcccgag acagcctcat cccctgcaag 901 gaagcaggcc tgcgcttcaa cgccggggac ttgctccaga tcgtaaacca ggatgatgcc 961 aactggtggc aggcatgcca tgtcgaaggg ggcagtgctg ggctcattcc cagccagctg 1021 ctggaggaga agcggaaagc atttgtcaag agggacctgg agctgacacc aaactcaggg 1081 accctatgcg gcagcctttc aggaaagaaa aagaagcgaa tgatgtattt gaccaccaag 1141 aatgcagagt ttgaccgtca tgagctgctc atttatgagg aggtggcccg catgcccccg 1201 ttccgccgga aaaccctggt actgattggg gctcagggcg tgggacggcg cagcctgaag 1261 aacaagctca tcatgtggga tccagatcgc tatggcacca cggtgcccta cacctcccgg 1321 cggccgaaag actcagagcg ggaaggtcag ggttacagct ttgtgtcccg tggggagatg 1381 gaggctgacg tccgtgctgg gcgctacctg gagcatggcg aatacgaggg caacctgtat 1441 ggcacacgta ttgactccat ccggggcgtg gtcgctgctg ggaaggtgtg cgtgctggat 1501 gtcaaccccc aggcggtgaa ggtgctacga acggccgagt ttgtccctta cgtggtgttc 1561 atcgaggccc cagacttcga gaccctgcgg gccatgaaca gggctgcgct ggagagtgga 1621 atatccacca agcagctcac ggaggcggac ctgagacgga cagtggagga gagcagccgc 1681 atccagcggg gctacgggca ctactttgac ctctgcctgg tcaatagcaa cctggagagg 1741 accttccgcg agctccagac agccatggag aagctacgga cagagcccca gtgggtgcct 1801 gtcagctggg tgtactgagc ctgttcacct ggtccttggc tcactctgtg ttgaaaccca 1861 gaacctgaat ccatccccct cctgacctgt gaccccctgc cacaatcctt agcccccata 1921 tctggctgtc cttgggtaac agctcccagc aggccctaag tctggcttca gcacagaggc 1981 gtgcactgcc agggaggtgg gcattcatgg ggtaccttgt gcccaggtgc tgcccactcc 2041 tgatgcccat tggtcaccag atatctctga gggccaagct atgcccagga atgtgtcaga 2101 gtcacctcca taatggtcag tacagagaag agaaaagctg ctttgggacc acatggtcag 2161 taggcacact gccccctgcc acccctcccc agtcaccagt tctcctctgg actggccaca 2221 cccaccccat tcctggactc ctcccacctc tcacctctgt gtcggaggaa caggccttgg 2281 gctgtttccg tgtgaccagg ggaatgtgtg gcccgctggc agccaggcag gcccgggtgg 2341 tggtgccagc ctggtgccat cttgaaggct ggaggagtca gagtgagagc cagtggccac 2401 agctgcagag cactgcagct cccagctcct ttggaaaggg acagggtcgc agggcagatg 2461 ctgctcggtc cttccctcat ccacagcttc tcactgccga agtttctcca gatttctcca 2521 atgtgtcctg acaggtcagc cctgctcccc acagggccag gctggcaggg gccagtgggt 2581 tcagcccagg taggggcagg atggagggct gagccctgtg acaacctgct gttaccaact 2641 gaagagcccc aagctctcca tggcccacag caggcacagg tctgagctct atgtccttga 2701 ccttggtcca tttggttttc tgtctagcca ggtccaggta gcccacttgc atcagggctg 2761 ctgggttgga ggggctaagg aggagtgcag aggggacctt gggagcctgg gcttgaagga 2821 cagttgccct ccaggaggtt cctcacacac aactccagag gcgccattta cactgtagtc 2881 tgtacaacct gtggttccac gtgcatgttc ggcacctgtc tgtgcctctg gcaccaggtt 2941 gtgtgtgtgt gcgtgtgcac gtgcgtgtgt gtgtgtgtgt gtcaggttta gtttggggag 3001 gaaccaaagg gttttgtttt ggaggtcact ctttggggcc cctttctggg ggttccccat 3061 cagccctcat ttcttataat accctgatcc cagactccaa agccctggtc ctttcctgat 3121 gtctcctccc ttgtcttatt gtccccctac cctaaatgcc cccctgccat aacttgggga 3181 gggcagtttt gtaaaatagg agactccctt taagaaagaa tgctgtccta gatgtacttg 3241 ggcatctcat ccttcattat tctctgcatt ccttccgggg ggagcctgtc ctcagagggg 3301 acaacctgtg acaccctgag tccaaaccct tgtgcctccc agttcttcca agtgtctaac 3361 tagtcttcgc tgcagcgtca gccaaagctg gcccctgaac cactgtgtgc ccatttccta 3421 gggaagggga aggagaataa acagaatatt tattacaaaa a // LOCUS HSDMBT1 5802 bp RNA PRI 10-SEP-1997 DEFINITION Homo sapiens mRNA for DMBT1 protein. ACCESSION AJ000342 NID g2398620 KEYWORDS DMBT1 protein; tumour suppressor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5802) AUTHORS Mollenhauer,J. TITLE Direct Submission JOURNAL Submitted (10-JUL-1997) Mollenhauer J., Molecular Genome Analysis, German Cancer Research Center, Im Neuenheimer Feld 280, Heidelberg, D-69120, GERMANY REFERENCE 2 (bases 1 to 5802) AUTHORS Mollenhauer,J., Wiemann,S., Scheurlen,W., Korn,B., Hayashi,Y., Wilgenbus,K.K., von Deimling,A. and Poustka,A. TITLE DMBT1, a new member of the SRCR superfamily, on chromosome 10q25.3-26.1 is deleted in malignant brain tumours JOURNAL Nature Genet. 17 (1), 32-39 (1997) MEDLINE 97434209 FEATURES Location/Qualifiers source 1..5802 /organism="Homo sapiens" /note="sequence assembled from three cDNA clones" /db_xref="taxon:9606" /chromosome="10" /dev_stage="fetal" /map="q25.3-q26.1" /tissue_type="lung" sig_peptide 107..181 /gene="DMBT1" CDS 107..5464 /gene="DMBT1" /function="putative tumour suppressor" /note="deleted in malignant brain tumours" /codon_start=1 /product="DMBT1 protein, 5.8 kb transcript" /db_xref="PID:e328724" /db_xref="PID:g2398621" /translation="MGISTVILEMCLLWGQVLSTGGWIPRTTDYASLIPSEVPLDQTV AEGSPFPSESTLESTAAEGSPISLESTLESTVAEGSLIPSESTLESTVAEGSDSGLAL RLVNGDGRCQGRVEILYRGSWGTVCDDSWDTNDANVVCRQLGCGWAMSAPGNAWFGQG SGPIALDDVRCSGHESYLWSCPHNGWLSHNCGHGEDAGVICSAAQPQSTLRPESWPVR ISPPVPTEGSESSLALRLVNGGDRCRGRVEVLYRGSWGTVCDDYWDTNDANVVCRQLG CGWAMSAPGNAQFGQGSGPIVLDDVRCSGHESYLWSCPHNGWLTHNCGHSEDAGVICS APQSRPTPSPDTWPTSHASTAGPESSLALRLVNGGDRCQGRVEVLYRGSWGTVCDDSW DTSDANVVCRQLGCGWATSAPGNARFGQGSGPIVLDDVRCSGYESYLWSCPHNGWLSH NCQHSEDAGVICSAAHSWSTPSPDTLPTITLPASTVGSESSLALRLVNGGDRCQGRVE VLYQGSWGTVCDDSWDTNDANVVCRQPGCGWAMSAPGNARFGQGSGPIVLDDVRCSGH ESYPWSCPHNGWLSHNCGHSEDAGVICSASQSRPTPSPDTWPTSHASTAGSESSLALR LVNGGDRCQGRVEVLYRGSWGTVCDDYWDTNDANVVCRQLGCGWAMSAPGNARFGQGS GPIVLDDVRCSGHESYLWSCPHNGWLSHNCGHHEDAGVICSASQSQPTPSPDTWPTSH ASTAGSESSLALRLVNGGDRCQGRVEVLYRGSWGTVCDDYWDTNDANVVCRQLGCGWA TSAPGNARFGQGSGPIVLDDVRCSGHESYLWSCPHNGWLSHNCGHHEDAGVICSASQS QPTPSPDTWPTSRASTAGSESTLALRLVNGGDRCRGRVEVLYQGSWGTVCDDYWDTND ANVVCRQLGCGWAMSAPGNAQFGQGSGPIVLDDVRCSGHESYLWSCPHNGWLSHNCGH HEDAGVICSAAQSQSTPRPDTWLTTNLPALTVGSESSLALRLVNGGDRCRGRVEVLYR GSWGTVCDDSWDTNDANVVCRQLGCGWAMSAPGNARFGQGSGPIVLDDVRCSGNESYL WSCPHKGWLTHNCGHHEDAGVICSATQINSTTTDWWHPTTTTTARPSSNCGGFLFYAS GTFSSPSYPAYYPNNAKCVWEIEVNSGYRINLGFSNLKLEAHHNCSFDYVEIFDGSLN SSLLLGKICNDTRQIFTSSYNRMTIHFRSDISFQNTGFLAWYNSFPSDATLRLVNLNS SYGLCAGRVEIYHGGTWGTVCDDSWTIQEAEVVCRQLGCGRAVSALGNAYFGSGSGPI TLDDVECSGTESTLWQCRNRGWFSHNCNHREDAGVICSGNHLSTPAPFLNITRPNTDY SCGGFLSQPSGDFSSPFYPGNYPNNAKCVWDIEVQNNYRVTVIFRDVQLEGGCNYDYI EVFDGPYRSSPLIARVCDGARGSFTSSSNFMSIRFISDHSITRRGFRAEYYSSPSNDS TNLLCLPNHMQASVSRSYLQSLGFSASDLVISTWNGYYECRPQITPNLVIFTIPYSGC GTFKQADNDTIDYSNFLTAAVSGGIIKRRTDLRIHVSCRMLQNTWVDTMYIANDTIHV ANNTIQVEEVQYGNFDVNISFYTSSSFLYPVTSRPYYVDLNQDLYVQAEILHSDAVLT LFVDTCVASPYSNDFTSLTYDLIRSGCVRDDTYGPYSSPSLRIARFRFRAFHFLNRFP SVYLRCKMVVCRAYDPSSRCYRGCVLRSKRDVGSYQEKVDVVLGPIQLQTPPRREEEP R" gene 107..5464 /gene="DMBT1" mat_peptide 182..5461 /gene="DMBT1" /product="DMBT1 protein, 5.8 kb transcript" BASE COUNT 1253 a 1614 c 1576 g 1359 t ORIGIN 1 tttatagcag cagcagaaat ataccaccct agaggacaca cctcctttta gctaggtacc 61 tataaatgtc caggattttc tattcaattg agaagaaccc agcaaaatgg ggatctccac 121 agtcatcctt gaaatgtgtc ttttatgggg acaagttcta tctacaggtg ggtggatccc 181 aaggactaca gactacgctt cactgattcc ctcggaggtg cccttggatc aaactgtagc 241 agaaggttct ccatttccct cggagtcgac cctggagtca actgcagcag aaggttctcc 301 gatttccttg gagtcaaccc tggagtcaac tgtagcagaa ggttctctga ttccctcaga 361 gtcaaccctg gagtcaactg tagcagaagg atctgattct ggtttggccc tgaggctggt 421 gaatggagat ggcaggtgtc agggccgagt ggagatccta taccgaggct cctggggcac 481 cgtgtgtgat gacagctggg acaccaatga tgccaacgtg gtctgtaggc agctgggttg 541 tggctgggcc atgtcagctc caggaaatgc ctggtttggc cagggctcag gacccattgc 601 cctggatgat gtgcgctgct caggacacga atcctacctg tggagctgcc cccacaatgg 661 ctggctctcc cataactgtg gccatggtga agatgctggt gttatctgct cagctgccca 721 gcctcagtca acactcaggc cagaaagttg gcctgtcagg atatcaccac ctgtacccac 781 agaaggatct gaatccagtt tggccctgag gctggtgaat ggaggcgaca ggtgtcgagg 841 ccgagtggag gtcctatacc gaggctcctg gggcaccgtg tgtgatgact actgggacac 901 caatgatgcc aatgtggtct gcaggcagct gggctgtggc tgggccatgt cagccccagg 961 aaatgcccag tttggccagg gctcaggacc cattgtcctg gatgatgtgc gctgctcagg 1021 acacgagtcc tacctgtgga gctgccccca caatggctgg ctcacccaca actgtggcca 1081 tagtgaagac gctggtgtca tctgctcagc tccccagtcc cggccgacac ccagcccaga 1141 tacttggccg acctcacatg catcaacagc aggacctgaa tccagtttgg ccctgaggct 1201 ggtgaatgga ggtgacaggt gtcagggccg agtggaggtc ctataccgag gctcctgggg 1261 caccgtgtgt gatgatagct gggacaccag tgacgccaat gtggtctgcc ggcagctggg 1321 ctgtggctgg gccacgtcag ccccaggaaa tgcccggttt ggccagggtt caggacccat 1381 tgtcctggat gacgtgcgct gctcaggcta tgagtcctac ctgtggagct gcccccacaa 1441 tggctggctc tcccataact gtcagcacag tgaagacgct ggtgtcatct gctcagctgc 1501 ccactcctgg tcgacgccca gtccagacac attgccgacc atcaccttgc ctgcatcgac 1561 agtaggatct gaatccagtt tggccctgag gctggtgaat ggaggtgaca ggtgtcaggg 1621 ccgagtggag gtcctatacc aaggctcctg gggcaccgtg tgcgatgaca gctgggacac 1681 caatgatgcc aatgtcgtct gcaggcaacc gggctgtggc tgggccatgt cagccccagg 1741 aaatgcccgg tttggtcagg gctcaggacc cattgtcctg gatgatgtgc gctgctcagg 1801 acacgagtct tacccgtgga gctgccccca caatggctgg ctctcccaca actgtggcca 1861 tagtgaagac gctggtgtca tctgctcagc ttcccagtcc cggccaacac ctagtccaga 1921 cacttggcca acctcacatg catcaacagc aggatctgaa tccagtttgg ccctgaggct 1981 ggtgaatgga ggtgacaggt gtcagggccg agtggaggtc ctataccgag gctcctgggg 2041 caccgtgtgt gatgactact gggacaccaa tgatgccaat gtggtttgca ggcagctggg 2101 ctgtggctgg gccatgtcag ccccaggaaa tgcccggttt ggccagggtt caggacccat 2161 tgtcctggat gatgtgcgct gctcaggaca tgagtcctat ctgtggagct gcccccacaa 2221 tggctggctc tcccacaact gtggccatca tgaagacgct ggtgtcatct gctcagcttc 2281 ccagtcccag ccgacaccca gcccagacac ttggccaacc tcacatgcat caacagcagg 2341 atctgaatcc agtttggccc tgaggctggt gaatggaggt gacaggtgtc agggccgagt 2401 ggaggtccta taccgaggct cctggggcac cgtgtgtgat gactactggg acaccaatga 2461 tgccaatgtg gtttgcaggc agctgggctg tggctgggcc acgtcagccc caggaaatgc 2521 ccggtttggc cagggttcag gacccattgt cctggatgat gtgcgctgct caggacatga 2581 gtcctatctg tggagctgcc cccacaatgg ctggctctcc cacaactgtg gccatcatga 2641 agacgctggt gtcatctgct cagcttccca gtcccagccg acacccagcc cagacacttg 2701 gccaacctct cgtgcatcaa cagcaggatc tgaatccact ttggccctga gactggtgaa 2761 tggaggtgac aggtgtcgag gccgagtgga ggtcctatac caaggctcct ggggcaccgt 2821 gtgtgatgac tactgggaca ccaatgatgc caacgtggtc tgcaggcagc tgggctgtgg 2881 ctgggccatg tcagccccag gaaatgccca gtttggccag ggctcaggac ccattgtcct 2941 ggatgatgtg cgctgctcag gacacgagtc ttacctgtgg agctgccccc acaatggctg 3001 gctctcccac aactgtggcc atcatgaaga tgctggtgtc atctgctcag ctgctcagtc 3061 ccagtcaacg cccaggccag atacttggct gaccaccaac ttaccggcat tgacagtagg 3121 atctgaatcc agtttggctc tgaggctggt gaatggaggt gacaggtgtc gaggccgagt 3181 ggaggtcctg tatcgaggct cctggggaac cgtgtgtgat gacagctggg acaccaatga 3241 tgccaatgtg gtctgcaggc agctgggctg tggctgggcc atgtcggccc caggaaatgc 3301 ccggtttggc cagggctcag gacccattgt cctggatgat gtgcgctgct cagggaatga 3361 gtcctacctg tggagctgcc cccacaaagg ctggctcacc cacaactgtg gccatcacga 3421 agacgctggt gtcatctgct cagccaccca aataaattct actacgacag attggtggca 3481 tccaacaact acaaccactg caagaccctc ttcaaattgt ggtggcttct tattctatgc 3541 cagtgggaca ttctccagcc catcctaccc tgcatactac cccaacaatg ctaagtgtgt 3601 ttgggaaata gaagtgaatt ctggttatcg cataaacctg ggcttcagta atctgaaatt 3661 ggaggcacac cataactgca gttttgatta tgttgaaatc tttgatggat cattgaatag 3721 cagtctcctg ctggggaaaa tctgtaatga taccaggcaa atatttacat cttcttacaa 3781 ccgaatgacc attcactttc gaagtgacat cagtttccaa aacactggct ttttggcttg 3841 gtataactcc ttcccaagcg atgccacctt gaggttggtc aatttaaatt catcctatgg 3901 tctatgtgcc gggcgtgtag aaatttacca tggtggcacc tgggggacag tttgtgatga 3961 ctcctggacc attcaggaag ctgaggtggt ctgcagacag ctagggtgtg gacgtgcagt 4021 ttcagccctt ggaaatgcat attttggctc tggctctggc cccatcaccc tggacgatgt 4081 agagtgctca gggacggaat ccactctctg gcagtgccgg aaccgaggct ggttctccca 4141 caactgtaat catcgtgaag atgctggtgt catctgctca ggaaaccatc tatcgacacc 4201 tgctcctttt ctcaacatca cccgtccaaa cacagattat tcctgcggag gcttcctatc 4261 ccaaccatca ggggactttt ccagcccatt ctatcccggg aactatccaa acaatgccaa 4321 gtgtgtgtgg gacattgagg tgcaaaacaa ctaccgtgtg actgtgatct tcagagatgt 4381 ccagcttgaa ggtggctgca actatgatta tattgaagtt ttcgatggcc cctaccgcag 4441 ttcccctctc attgctcgag tttgtgatgg ggccagaggc tccttcactt cttcctccaa 4501 cttcatgtcc attcgcttca tcagtgacca cagcatcaca aggagagggt tccgggctga 4561 gtactactcc agtccctcca atgacagcac caacctgctc tgtctgccaa atcacatgca 4621 agccagtgtg agcaggagct atctccaatc cttgggcttt tctgccagtg accttgtcat 4681 ttccacctgg aatggatact acgagtgtcg gccccagata acgccgaacc tggtgatatt 4741 cacaattccc tactcaggct gcggcacctt caagcaggca gacaatgaca ccatcgacta 4801 ttccaacttc ctcacagcag ctgtctcagg tggcatcatc aagaggagga cagacctccg 4861 tattcacgtc agctgcagaa tgcttcagaa cacctgggtc gacaccatgt acattgctaa 4921 tgacaccatc cacgttgcta ataacaccat ccaggtcgag gaagtccagt atggcaattt 4981 tgacgtgaac atttcctttt atacttcctc atctttcttg tatcctgtga ccagccgccc 5041 ttactacgtg gacctgaacc aggacttgta cgttcaggct gaaatcctcc attctgatgc 5101 tgtactgacc ttgtttgtgg acacctgcgt ggcatcacca tactccaatg acttcacgtc 5161 tttgacttat gatctaatcc ggagtggatg cgtgagggat gacacctacg gaccctactc 5221 ctcgccgtct cttcgcattg cccgcttccg gttcagggcc ttccacttcc tgaaccgctt 5281 cccctccgtg tacctgcgtt gtaaaatggt ggtgtgcaga gcgtatgacc cctcttcccg 5341 ctgctaccga ggctgtgtgt tgaggtcgaa gagggatgtg ggctcctacc aggaaaaggt 5401 ggacgtcgtc ctgggtccca tccagctgca gaccccccca cgccgagaag aggagcctcg 5461 gtaggtggtc gctctcagac cccactgtcc accggggcgc agacccctga ctcggggact 5521 tgggatgttc ctcttggtgt catattccaa ctcagattga gccctacatt gtgctgcacc 5581 tggtcatacg gagttgaatc agacctggtt cccgcctccc ccaaggctca tggtccttgg 5641 aggacccgtt gcagggcgag gtcaagagag ttctgacctg gatggcccat agacctgacg 5701 tcccagaatc catgcttctc atctgcaaaa tgaaaatgtc aatacttact tcttagcact 5761 gttgagaggg ttacttacat aaaggaattt tggtgaaact gc // LOCUS HSDMDR 12446 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for dystrophin. ACCESSION X14298 NID g30845 KEYWORDS Dmd gene; Duchenne muscular dystrophy; dystrophin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12446) AUTHORS Rosenthal,A. TITLE Direct Submission JOURNAL Submitted (09-FEB-1989) Rosenthal A., Akademie der Wissenschaften der DDr, Zentralinstitut fuer Molekularbiologie, Robert-Roessle Str.10, 1115 Berlin Buch, DDR REFERENCE 2 (bases 1 to 12446) AUTHORS Rosenthal,A., Speer,A., Billwitz,H., Cross,G.S., Forrest,S.M. and Davies,K.E. TITLE Two human cDNA molecules coding for the Duchenne muscular dystrophy (DMD) locus are highly homologous JOURNAL Nucleic Acids Res. 17 (13), 5391 (1989) MEDLINE 89345106 COMMENT see also M18533 and M20250 for Dmd seqs.; discrepancies compared to M18533 cDNA were located at x14298 pos. 496, 1772, 1965, 2449, 3687, 4229, 4504, 5075, 5332, 5630 and 7194. FEATURES Location/Qualifiers source 1..12446 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal and adult." /tissue_type="muscle" /chromosome="X chromosomal, Xp21." CDS 99..11156 /note="dystrophin (AA 1 - 3685)" /codon_start=1 /db_xref="PID:g30846" /db_xref="SWISS-PROT:P11532" /translation="MLWWEEVEDCYEREDVQKKTFTKWVNAQFSKFGKQHIENLFSDL QDGRRLLDLLEGLTGQKLPKEKGSTRVHALNNVNKALRVLQNNNVDLVNIGSTDIVDG NHKLTLGLIWNIILHWQVKNVMKNIMAGLQPTNSEKILLSWVRQSTRNYPQVNVINFT TSWSDGLALNALIHSHRPDLFDWNSVVCQQSATQRLEHAFNIARYQLGIEKLLDPEDV DTTYPDKKSILMYITSLFQVLPQQVSIEAIQEVEMLPRPPKVTKEEHFQLHHQMHYSQ QITVSLAQGYERTSSPKPRFKSYAYTQAAYVTTSDPTRSPFPSQHLEAPEDKSFGSSL MESEVNLDRYQTALEEVLSWLLSAEDTLQAQGEISNDVEVVKDQFHTHEGYMMDLTAH QGRVGNILQLGSKLIGTGKLSEDEETEVQEQMNLLNSRWECLRVASMEKQSNLHRVLM DLQNQKLKELNDWLTKTEERTRKMEEEPLGPDLEDLKRQVQQHKVLQEDLEQEQVRVN SLTHMVVVVDESSGDHATAALEEQLKVLGDRWANICRWTEDRWVLLQDILLKWQRLTE EQCLFSAWLSEKEDAVNKIHTTGFKDQNEMLSSLQKLAVLKADLEKKKQSMGKLYSIK QDLLSTLKNKSVTQKTEAWLDNFARCWDNLVQKLEKSTAQISQAVTTTQPSLTQTTVM ETVTTVTTREQILVKHAQEELPPPPPQKKRQITVDSEIRKRLDVDITELHSWITRSEA VLQSPEFAIFRKEGNFSDLKEKVNAIEREKAEKFRKLQDASRSGQALVEQMVNEGVNA DSIKQASEQLNSRWIEFCQLLSERLNWLEYQNNIIAFYNQLQQLEQMTTTAENWLKIQ PTTPSEPTAIKSQLKICKDEVNRLSGLQPQIERLKIQSIALKEKGQGPMFLDADFVAF TNHFKQVFSDVQAREKELQTIFDTLPPMRYQETMSAIRTWVQQSETKLSIPQLSVTDY EIMEQRLGELQALQSSLQEQQSGLYYLSTTVKEMSKKAPSEISRKYQSEFEEIEGRWK KLSSQLVEHCQKLEEQMNKLRKIQNHIQTLKKWMAEVDVFLKEEWPALGDSEILKKQL KQCRLLVSDIQTIQPSLNSVNEGGQKIKNEAEPEFASRLETELKELNTQWDHMCQQVY ARKEALKGGLEKTVSLQKDLSEMHEWMTQAEEEYLERDFEYKTPDELQKAFEEMKRAK EEAQQKEAKVKLLTESVNSVIAQAPPVAQEALKKELETLTTNYQWLCTRLNGKCKTLE EVWACWHELLSYLEKANKWLNEVEFKLKTTENIPGGAEEISEVLDSLENLMRHSEDNP NQIRILAQTLTDGGVMDELINEELETFNSRWRELHEEAVRRQKLLEQSIQSAQETENS LHLIQESLTFIDKQLAAYIADKVDAAQMPQEAQKIQSDLTSHEISLEEMKKHNQGKEA AQRVLSQIDVAQKKLQDVSMKFRLFQKPANFEQRLQESKMILDEVKMHLPALETKSVE QEVVQSQLNHCVNLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKELDERVTALKLHY NELGAKVTERKQQLEKCLKLSRKMRKEMNVLTEWLAATDMELTKRSAVEGMPSNLDSE VAWGKATQKEIEKQKVHLKSITEVGEALKTVLGKKETLVEDKLSLLNSNWIAVTSRAE EWLNLLLEYQKHMETFDQNVDHITKWIIQADTLLDESEKKKPQQKEDVLKRLKAELND IRPKVDSTRDQAANLMANHGDHCRKLVEPQISELNHRFAAISHRIKTGKASIPLKELE QFNSDIQKLLEPLEAEIQQGVNLKEEDFNKDMNEDNEGTVKELLQRGDNLQQRITDER KSEEIKIKQQLLQTKHNALKDLRSQRRKKALEISHQWYQYKRQADDLLKCLDDIEKKL ASLPEPRDERKIKEIDRELQKKKEELNAVRRQAEGLSEDGAAMAVEPTQIQLSKRWRE IESKFAQFRRLNFAQIHTVREETMMVMTEDMPLEISYVPSTYLTEITHVSQALLEVEQ LLNAPDLCAKDFEDLFKQEESLKNIKDSLQQSSGRIDIIHSKKTAALQSATPVERVKL QEALSQLDFQWEKVNKMYKDRQGRFDRSVEKWRRFHYDIKIFNQWLTEAEQFLRKTQI PENWEHAKYKWYLKELQDGIGQRQTVVRTLNATGEEIIQQSSKTDASILQEKLGSLNL RWQEVCKQLSDRKKRLEEQKNILSEFQRDLNEFVLWLEEADNIASIPLEPGKEQQLKE KLEQVKLLVEELPLRQGILKQLNETGGPVLVSAPISPEEQDKLENKLKQTNLQWIKVS RALPEKQGEIEAQIKDLGQLEKKLEDLEEQLNHLLLWLSPIRNQLEIYNQPNQEGPFD VKETEIAVQAKQPDVEEILSKGQHLYKEKPATQPVKRKLEDLSSEWKAVNRLLQELRA KQPDLAPGLTTIGASPTQTVTLVTQPVVTKETAISKLEMPSSLMLEVPALADFNRAWT ELTDWLSLLDQVIKSQRVMVGDLEDINEMIIKQKATMQDLEQRRPQLEELITAAQNLK NKTSNQEARTIITDRIERIQNQWDEVQEHLQNRRQQLNEMLKDSTQWLEAKEEAEQVL GQARAKLESWKEGPYTVDAIQKKITETKQLAKDLRQWQTNVDVANDLALKLLRDYSAD DTRKVHMITENINASWRSIHKRVSEREAALEETHRLLQQFPLDLEKFLAWLTEAETTA NVLQDATRKERLLEDSKGVKELMKQWQDLQGEIEAHTDVYHNLDENSQKILRSLEGSD DAVLLQRRLDNMNFKWSELRKKSLNIRSHLEASSDQWKRLHLSLQELLVWLQLKDDEL SRQAPIGGDFPAVQKQNDVHRAFKRELKTKEPVIMSTLETVRIFLTEQPLEGLEKLYQ EPRELPPEERAQNVTRLLRKQAEEVNTEWEKLNLHSADWQRKIDETLERLQELQEATD ELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVKALRGEIAPLKENVSHVNDLARQL TTLGIQLSPYNLSTLEDLNTRWKLLQVAVEDRVRQLHEAHRDFGPASQHFLSTSVQGP WERAISPNKVPYYINHETQTTCWDHPKMTELYQSLADLNNVRFSAYRTAMKLRRLQKA LCLDLLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVD MCLNWLLNVYDTGRTGRIRVLSFKTGIISLCKAHLEDKYRYLFKQVASSTGFCDQRRL GLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMRLEPQSM VWLPVLHRVAAAETAKHQAKCNICKECPIIGFRYRSLKHFNYDICQSCFFSGRVAKGH KMHYPMVEYCTPTTSGEDVRDFAKVLKNKFRTKRYFAKHPRMGYLPVQTVLEGDNMET PVTLINFWPVDSAPASSPQLSHDDTHSRIEHYASRLAEMENSNGSYLNDSISPNESID DEHLLIQHYCQSLNQDSPLSQPRSPAQILISLESEERGELERILADLEEENRNLQAEY DRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILEDH NKQLESQLHRLRQLLEQPQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRVVGSQTSDSM GEEDLLSPPQDTSTGLEEVMEQLNNSFPSSRGRNTPGKPMREDTM" BASE COUNT 4135 a 2524 c 2876 g 2911 t ORIGIN 1 tgttggtttc tcattgtttt taagcctact ggagcaataa agtttgaaga acttttacca 61 ggtttttttt atcgctgcct tgatatacac ttttcaaaat gctttggtgg gaagaagtag 121 aggactgtta tgaaagagaa gatgttcaaa agaaaacatt cacaaaatgg gtaaatgcac 181 aattttctaa gtttgggaag cagcatattg agaacctctt cagtgaccta caggatggga 241 ggcgcctcct agacctcctc gaaggcctga cagggcaaaa actgccaaaa gaaaaaggat 301 ccacaagagt tcatgccctg aacaatgtca acaaggcact gcgggttttg cagaacaata 361 atgttgattt agtgaatatt ggaagtactg acatcgtaga tggaaatcat aaactgactc 421 ttggtttgat ttggaatata atcctccact ggcaggtcaa aaatgtaatg aaaaatatca 481 tggctggatt gcaaccaacc aacagtgaaa agattctcct gagctgggtc cgacaatcaa 541 ctcgtaatta tccacaggtt aatgtaatca acttcaccac cagctggtct gatggcctgg 601 ctttgaatgc tctcatccat agtcataggc cagacctatt tgactggaat agtgtggttt 661 gccagcagtc agccacacaa cgactggaac atgcattcaa catcgccaga tatcaattag 721 gcatagagaa actactcgat cctgaagatg ttgataccac ctatccagat aagaagtcca 781 tcttaatgta catcacatca ctcttccaag ttttgcctca acaagtgagc attgaagcca 841 tccaggaagt ggaaatgttg ccaaggccac ctaaagtgac taaagaagaa cattttcagt 901 tacatcatca aatgcactat tctcaacaga tcacggtcag tctagcacag ggatatgaga 961 gaacttcttc ccctaagcct cgattcaaga gctatgccta cacacaggct gcttatgtca 1021 ccacctctga ccctacacgg agcccatttc cttcacagca tttggaagct cctgaagaca 1081 agtcatttgg cagttcattg atggagagtg aagtaaacct ggaccgttat caaacagctt 1141 tagaagaagt attatcgtgg cttctttctg ctgaggacac attgcaagca caaggagaga 1201 tttctaatga tgtggaagtg gtgaaagacc agtttcatac tcatgagggg tacatgatgg 1261 atttgacagc ccatcagggc cgggttggta atattctaca attgggaagt aagctgattg 1321 gaacaggaaa attatcagaa gatgaagaaa ctgaagtaca agagcagatg aatctcctaa 1381 attcaagatg ggaatgcctc agggtagcta gcatggaaaa acaaagcaat ttacatagag 1441 ttttaatgga tctccagaat cagaaactga aagagttgaa tgactggcta acaaaaacag 1501 aagaaagaac aaggaaaatg gaggaagagc ctcttggacc tgatcttgaa gacctaaaac 1561 gccaagtaca acaacataag gtgcttcaag aagatctaga acaagaacaa gtcagggtca 1621 attctctcac tcacatggtg gtggtagttg atgaatctag tggagatcac gcaactgctg 1681 ctttggaaga acaacttaag gtattgggag atcgatgggc aaacatctgt agatggacag 1741 aagaccgctg ggttctttta caagacatcc tgctcaaatg gcaacgtctt actgaagaac 1801 agtgcctttt tagtgcatgg ctttcagaaa aagaagatgc agtgaacaag attcacacaa 1861 ctggctttaa agatcaaaat gaaatgttat caagtcttca aaaactggcc gttttaaaag 1921 cggatctaga aaagaaaaag caatccatgg gcaaactgta ttcaatcaaa caagatcttc 1981 tttcaacact gaagaataag tcagtgaccc agaagacgga agcatggctg gataactttg 2041 cccggtgttg ggataattta gtccaaaaac ttgaaaagag tacagcacag atttcacagg 2101 ctgtcaccac cactcagcca tcactaacac agacaactgt aatggaaaca gtaactacgg 2161 tgaccacaag ggaacagatc ctggtaaagc atgctcaaga ggaacttcca ccaccacctc 2221 cccaaaagaa gaggcagatt actgtggatt ctgaaattag gaaaaggttg gatgttgata 2281 taactgaact tcacagctgg attactcgct cagaagctgt gttgcagagt cctgaatttg 2341 caatctttcg gaaggaaggc aacttctcag acttaaaaga aaaagtcaat gccatagagc 2401 gagaaaaagc tgagaagttc agaaaactgc aagatgccag cagatcaggt caggccctgg 2461 tggaacagat ggtgaatgag ggtgttaatg cagatagcat caaacaagcc tcagaacaac 2521 tgaacagccg gtggatcgaa ttctgccagt tgctaagtga gagacttaac tggctggagt 2581 atcagaacaa catcatcgct ttctataatc agctacaaca attggagcag atgacaacta 2641 ctgctgaaaa ctggttgaaa atccaaccca ccaccccatc agagccaaca gcaattaaaa 2701 gtcagttaaa aatttgtaag gatgaagtca accggctatc aggtcttcaa cctcaaattg 2761 aacgattaaa aattcaaagc atagccctga aagagaaagg acaaggaccc atgttcctgg 2821 atgcagactt tgtggccttt acaaatcatt ttaagcaagt cttttctgat gtgcaggcca 2881 gagagaaaga gctacagaca atttttgaca ctttgccacc aatgcgctat caggagacca 2941 tgagtgccat caggacatgg gtccagcagt cagaaaccaa actctccata cctcaactta 3001 gtgtcaccga ctatgaaatc atggagcaga gactcgggga attgcaggct ttacaaagtt 3061 ctctgcaaga gcaacaaagt ggcctatact atctcagcac cactgtgaaa gagatgtcga 3121 agaaagcgcc ctctgaaatt agccggaaat atcaatcaga atttgaagaa attgagggac 3181 gctggaagaa gctctcctcc cagctggttg agcattgtca aaagctagag gagcaaatga 3241 ataaactccg aaaaattcag aatcacatac aaaccctgaa gaaatggatg gctgaagttg 3301 atgtttttct gaaggaggaa tggcctgccc ttggggattc agaaattcta aaaaagcagc 3361 tgaaacagtg cagactttta gtcagtgata ttcagacaat tcagcccagt ctaaacagtg 3421 tcaatgaagg tgggcagaag ataaagaatg aagcagagcc agagtttgct tcgagacttg 3481 agacagaact caaagaactt aacactcagt gggatcacat gtgccaacag gtctatgcca 3541 gaaaggaggc cttgaaggga ggtttggaga aaactgtaag cctccagaaa gatctatcag 3601 agatgcacga atggatgaca caagctgaag aagagtatct tgagagagat tttgaatata 3661 aaactccaga tgaattacag aaagcatttg aagagatgaa gagagctaaa gaagaggccc 3721 aacaaaaaga agcgaaagtg aaactcctta ctgagtctgt aaatagtgtc atagctcaag 3781 ctccacctgt agcacaagag gccttaaaaa aggaacttga aactctaacc accaactacc 3841 agtggctctg cactaggctg aatgggaaat gcaagacttt ggaagaagtt tgggcatgtt 3901 ggcatgagtt attgtcatac ttggagaaag caaacaagtg gctaaatgaa gtagaattta 3961 aacttaaaac cactgaaaac attcctggcg gagctgagga aatctctgag gtgctagatt 4021 cacttgaaaa tttgatgcga cattcagagg ataacccaaa tcagattcgc atattggcac 4081 agaccctaac agatggcgga gtcatggatg agctaatcaa tgaggaactt gagacattta 4141 attctcgttg gagggaacta catgaagagg ctgtaaggag gcaaaagttg cttgaacaga 4201 gcatccagtc tgcccaggag actgaaaatt ccttacactt aatccaggag tccctcacat 4261 tcattgacaa gcagttggca gcttatattg cagacaaggt ggacgcagct caaatgcctc 4321 aggaagccca gaaaatccaa tctgatttga caagtcatga gatcagttta gaagaaatga 4381 agaaacataa tcaggggaag gaggctgccc aaagagtcct gtctcagatt gatgttgcac 4441 agaaaaaatt acaagatgtc tccatgaagt ttcgattatt ccagaaacca gccaattttg 4501 agcagcgtct acaagaaagt aagatgattt tagatgaagt gaagatgcac ttgcctgcat 4561 tggaaacaaa gagtgtggaa caggaagtag tacagtcaca gctaaatcat tgtgtgaact 4621 tgtataaaag tctgagtgaa gtgaagtctg aagtggaaat ggtgataaag actggacgtc 4681 agattgtaca gaaaaagcag acggaaaatc ccaaagaact tgatgaaaga gtaacagctt 4741 tgaaattgca ttataatgag ctgggagcaa aggtaacaga aagaaagcaa cagttggaga 4801 aatgcttgaa attgtcccgt aagatgcgaa aggaaatgaa tgtcttgaca gaatggctgg 4861 cagctacaga tatggaattg acaaagagat cagcagttga aggaatgcct agtaatttgg 4921 attctgaagt tgcctgggga aaggctactc aaaaagagat tgagaaacag aaggtgcacc 4981 tgaagagtat cacagaggta ggagaggcct tgaaaacagt tttgggcaag aaggagacgt 5041 tggtggaaga taaactcagt cttctgaata gtaattggat agctgtcacc tcccgagcag 5101 aagagtggtt aaatcttttg ttggaatacc agaaacacat ggaaactttt gaccagaatg 5161 tggaccacat cacaaagtgg atcattcagg ctgacacact tttggatgaa tcagagaaaa 5221 agaaacccca gcaaaaagaa gacgtgctta agcgtttaaa ggcagaactg aatgacatac 5281 gcccaaaggt ggactctaca cgtgaccaag cagcaaactt gatggcaaac cacggtgacc 5341 actgcaggaa attagtagag ccccaaatct cagagctcaa ccatcgattt gcagccattt 5401 cacacagaat taagactgga aaggcctcca ttcctttgaa ggaattggag cagtttaact 5461 cagatataca aaaattgctt gaaccactgg aggctgaaat tcagcagggg gtgaatctga 5521 aagaggaaga cttcaataaa gatatgaatg aagacaatga gggtactgta aaagaattgt 5581 tgcaaagagg agacaactta caacaaagaa tcacagatga gagaaagagc gaggaaataa 5641 agataaaaca gcagctgtta cagacaaaac ataatgctct caaggatttg aggtctcaaa 5701 gaagaaaaaa ggctctagaa atttctcatc agtggtatca gtacaagagg caggctgatg 5761 atctcctgaa atgcttggat gacattgaaa aaaaattagc cagcctacct gagcccagag 5821 atgaaaggaa aataaaggaa attgatcggg aattgcagaa gaagaaagag gagctgaatg 5881 cagtgcgtag gcaagctgag ggcttgtctg aggatggggc cgcaatggca gtggagccaa 5941 ctcagatcca gctcagcaag cgctggcggg aaattgagag caaatttgct cagtttcgaa 6001 gactcaactt tgcacaaatt cacactgtcc gtgaagaaac gatgatggtg atgactgaag 6061 acatgccttt ggaaatttct tatgtgcctt ctacttattt gactgaaatc actcatgtct 6121 cacaagccct attagaagtg gaacaacttc tcaatgctcc tgacctctgt gctaaggact 6181 ttgaagatct ctttaagcaa gaggagtctc tgaagaatat aaaagatagt ctacaacaaa 6241 gctcaggtcg gattgacatt attcatagca agaagacagc agcattgcaa agtgcaacgc 6301 ctgtggaaag ggtgaagcta caggaagctc tctcccagct tgatttccaa tgggaaaaag 6361 ttaacaaaat gtacaaggac cgacaagggc gatttgacag atctgttgag aaatggcggc 6421 gttttcatta tgatataaag atatttaatc agtggctaac agaagctgaa cagtttctca 6481 gaaagacaca aattcctgag aattgggaac atgctaaata caaatggtat cttaaggaac 6541 tccaggatgg cattgggcag cggcaaactg ttgtcagaac attgaatgca actggggaag 6601 aaataattca gcaatcctca aaaacagatg ccagtattct acaggaaaaa ttgggaagcc 6661 tgaatctgcg gtggcaggag gtctgcaaac agctgtcaga cagaaaaaag aggctagaag 6721 aacaaaagaa tatcttgtca gaatttcaaa gagatttaaa tgaatttgtt ttatggttgg 6781 aggaagcaga taacattgct agtatcccac ttgaacctgg aaaagagcag caactaaaag 6841 aaaagcttga gcaagtcaag ttactggtgg aagagttgcc cctgcgccag ggaattctca 6901 aacaattaaa tgaaactgga ggacccgtgc ttgtaagtgc tcccataagc ccagaagagc 6961 aagataaact tgaaaataag ctcaagcaga caaatctcca gtggataaag gtttccagag 7021 ctttacctga gaaacaagga gaaattgaag ctcaaataaa agaccttggg cagcttgaaa 7081 aaaagcttga agaccttgaa gagcagttaa atcatctgct gctgtggtta tctcctatta 7141 ggaatcagtt ggaaatttat aaccaaccaa accaagaagg accatttgac gttaaggaaa 7201 ctgaaatagc agttcaagct aaacaaccgg atgtggaaga gattttgtct aaagggcagc 7261 atttgtacaa ggaaaaacca gccactcagc cagtgaagag gaagttagaa gatctgagct 7321 ctgagtggaa ggcggtaaac cgtttacttc aagagctgag ggcaaagcag cctgacctag 7381 ctcctggact gaccactatt ggagcctctc ctactcagac tgttactctg gtgacacaac 7441 ctgtggttac taaggaaact gccatctcca aactagaaat gccatcttcc ttgatgttgg 7501 aggtacctgc tctggcagat ttcaaccggg cttggacaga acttaccgac tggctttctc 7561 tgcttgatca agttataaaa tcacagaggg tgatggtggg tgaccttgag gatatcaacg 7621 agatgatcat caagcagaag gcaacaatgc aggatttgga acagaggcgt ccccagttgg 7681 aagaactcat taccgctgcc caaaatttga aaaacaagac cagcaatcaa gaggctagaa 7741 caatcattac ggatcgaatt gaaagaattc agaatcagtg ggatgaagta caagaacacc 7801 ttcagaaccg gaggcaacag ttgaatgaaa tgttaaagga ttcaacacaa tggctggaag 7861 ctaaggaaga agctgagcag gtcttaggac aggccagagc caagcttgag tcatggaagg 7921 agggtcccta tacagtagat gcaatccaaa agaaaatcac agaaaccaag cagttggcca 7981 aagacctccg ccagtggcag acaaatgtag atgtggcaaa tgacttggcc ctgaaacttc 8041 tccgggatta ttctgcagat gataccagaa aagtccacat gataacagag aatatcaatg 8101 cctcttggag aagcattcat aaaagggtga gtgagcgaga ggctgctttg gaagaaactc 8161 atagattact gcaacagttc cccctggacc tggaaaagtt tcttgcctgg cttacagaag 8221 ctgaaacaac tgccaatgtc ctacaggatg ctacccgtaa ggaaaggctc ctagaagact 8281 ccaagggagt aaaagagctg atgaaacaat ggcaagacct ccaaggtgaa attgaagctc 8341 acacagatgt ttatcacaac ctggatgaaa acagccaaaa aatcctgaga tccctggaag 8401 gttccgatga tgcagtcctg ttacaaagac gtttggataa catgaacttc aagtggagtg 8461 aacttcggaa aaagtctctc aacattaggt cccatttgga agccagttct gaccagtgga 8521 agcgtctgca cctttctctg caggaacttc tggtgtggct acagctgaaa gatgatgaat 8581 taagccggca ggcacctatt ggaggcgact ttccagcagt tcagaagcag aacgatgtac 8641 atagggcctt caagagggaa ttgaaaacta aagaacctgt aatcatgagt actcttgaga 8701 ctgtacgaat atttctgaca gagcagcctt tggaaggact agagaaactc taccaggagc 8761 ccagagagct gcctcctgag gagagagccc agaatgtcac tcggcttcta cgaaagcagg 8821 ctgaggaggt caatactgag tgggaaaaat tgaacctgca ctccgctgac tggcagagaa 8881 aaatagatga gacccttgaa agactccagg aacttcaaga ggccacggat gagctggacc 8941 tcaagctgcg ccaagctgag gtgatcaagg gatcctggca gcccgtgggc gatctcctca 9001 ttgactctct ccaagatcac ctcgagaaag tcaaggcact tcgaggagaa attgcgcctc 9061 tgaaagagaa cgtgagccac gtcaatgacc ttgctcgcca gcttaccact ttgggcattc 9121 agctctcacc gtataacctc agcactctgg aagacctgaa caccagatgg aagcttctgc 9181 aggtggccgt cgaggaccga gtcaggcagc tgcatgaagc ccacagggac tttggtccag 9241 catctcagca ctttctttcc acgtctgtcc agggtccctg ggagagagcc atctcgccaa 9301 acaaagtgcc ctactatatc aaccacgaga ctcaaacaac ttgctgggac catcccaaaa 9361 tgacagagct ctaccagtct ttagctgacc tgaataatgt cagattctca gcttatagga 9421 ctgccatgaa actccgaaga ctgcagaagg ccctttgctt ggatctcttg agcctgtcag 9481 ctgcatgtga tgccttggac cagcacaacc tcaagcaaaa tgaccagccc atggatatcc 9541 tgcagattat taattgtttg accactattt atgaccgcct ggagcaagag cacaacaatt 9601 tggtcaacgt ccctctctgc gtggatatgt gtctgaactg gctgctgaat gtttatgata 9661 cgggacgaac agggaggatc cgtgtcctgt cttttaaaac tggcatcatt tccctgtgta 9721 aagcacattt ggaagacaag tacagatacc ttttcaagca agtggcaagt tcaacaggat 9781 tttgtgacca gcgcaggctg ggcctccttc tgcatgattc tatccaaatt ccaagacagt 9841 tgggtgaagt tgcatccttt gggggcagta acattgagcc aagtgtccgg agctgcttcc 9901 aatttgctaa taataagcca gagatcgaag cggccctctt cctagactgg atgagactgg 9961 aaccccagtc catggtgtgg ctgcccgtcc tgcacagagt ggctgctgca gaaactgcca 10021 agcatcaggc caaatgtaac atctgcaaag agtgtccaat cattggattc aggtacagga 10081 gtctaaagca ctttaattat gacatctgcc aaagctgctt tttttctggt cgagttgcaa 10141 aaggccataa aatgcactat cccatggtgg aatattgcac tccgactaca tcaggagaag 10201 atgttcgaga ctttgccaag gtactaaaaa acaaatttcg aaccaaaagg tattttgcga 10261 agcatccccg aatgggctac ctgccagtgc agactgtctt agagggggac aacatggaaa 10321 ctcccgttac tctgatcaac ttctggccag tagattctgc gcctgcctcg tcccctcagc 10381 tttcacacga tgatactcat tcacgcattg aacattatgc tagcaggcta gcagaaatgg 10441 aaaacagcaa tggatcttat ctaaatgata gcatctctcc taatgagagc atagatgatg 10501 aacatttgtt aatccagcat tactgccaaa gtttgaacca ggactccccc ctgagccagc 10561 ctcgtagtcc tgcccagatc ttgatttcct tagagagtga ggaaagaggg gagctagaga 10621 gaatcctagc agatcttgag gaagaaaaca ggaatctgca agcagaatat gaccgtctaa 10681 agcagcagca cgaacataaa ggcctgtccc cactgccgtc ccctcctgaa atgatgccca 10741 cctctcccca gagtccccgg gatgctgagc tcattgctga ggccaagcta ctgcgtcaac 10801 acaaaggccg cctggaagcc aggatgcaaa tcctggaaga ccacaataaa cagctggagt 10861 cacagttaca caggctaagg cagctgctgg agcaacccca ggcagaggcc aaagtgaatg 10921 gcacaacggt gtcctctcct tctacctctc tacagaggtc cgacagcagt cagcctatgc 10981 tgctccgagt ggttggcagt caaacttcgg actccatggg tgaggaagat cttctcagtc 11041 ctccccagga cacaagcaca gggttagagg aggtgatgga gcaactcaac aactccttcc 11101 ctagttcaag aggaagaaat acccctggaa agccaatgag agaggacaca atgtaggaag 11161 tcttttccac atggcagatg atttgggcag agcgatggag tccttagtat cagtcatgac 11221 agatgaagaa ggagcagaat aaatgtttta caactcctga ttcccgcatg gtttttataa 11281 tattcataca acaaagagga ttagacagta agagtttaca agaaataaat ctatattttt 11341 gtgaagggta gtggtattat actgtagatt tcagtagttt ctaagtctgt tattgttttg 11401 ttaacaatgg caggttttac acgtctatgc aattgtacaa aaaagttata agaaaactac 11461 atgtaaaatc ttgatagcta aataacttgc catttcttta tatggaacgc attttgggtt 11521 gtttaaaaat ttataacagt tataaagaaa gattgtaaac taaagtgtgc tttataaaaa 11581 aaagttgttt ataaaaaccc ctaaaaacaa aacaaacaca cacacacaca catacacaca 11641 cacacacaaa actttgaggc agcgcattgt tttgcatcct tttggcgtga tatccatatg 11701 aaattcatgg ctttttcttt ttttgcatat taaagataag acttcctcta ccaccacacc 11761 aaatgactac tacacactgc tcatttgaga actgtcagct gagtggggca ggcttgagtt 11821 ttcatttcat atatctatat gtctataagt atataaatac tatagttata tagataaaga 11881 gatacgaatt tctatagact gactttttcc attttttaaa tgttcatgtc acatcctaat 11941 agaaagaaat tacttctagt cagtcatcca ggcttacctg cttggtctag aatggatttt 12001 tcccggagcc ggaagccagg aggaaactac accacactaa aacattgtct acagctccag 12061 atgtttctca ttttaaacaa ctttccactg acaacgaaag taaagtaaag tattggattt 12121 ttttaaaggg aacatgtgaa tgaatacaca ggacttatta tatcagagtg agtaatcggt 12181 tggttggttg attgattgat tgattgatac attcagcttc ctgctgctag caatgccacg 12241 atttagattt aatgatgctt cagtggaaat caatcagaag gtattctgac cttgtgaaca 12301 tcagaaggta ttttttaact cccaagcagt agcaggacga tgatagggct ggagggctat 12361 ggattcccag cccatccctg tgaaggagta ggccactctt taagtgaagg attggatgat 12421 tgttcataat acataaagtt ctctgt // LOCUS HSDNALIG3 3422 bp RNA PRI 16-FEB-1996 DEFINITION H.sapiens mRNA for DNA ligase III. ACCESSION X84740 NID g860962 KEYWORDS DNA ligase III; LIG3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3422) AUTHORS Schar,P. TITLE Direct Submission JOURNAL Submitted (14-FEB-1995) P. Schar, Imperial Cancer Research Fund, Clare Hall Laboratories, South Mimms, Herts, EN6 3LD, UK REFERENCE 2 (bases 1 to 3422) AUTHORS Wei,Y.F., Robins,P., Carter,K., Caldecott,K., Pappin,D.J.C., Yu,G.L., Wang,R.P., Shell,B.K., Nash,R., Schar,P., Barnes,D.E., Haseltine,W.A. and Lindahl,T. TITLE Molecular cloning and expression of human cDNAs encoding a novel DNA ligase IV and DNA ligase III, an enzyme active in DNA repair and recombination JOURNAL Mol. Cell. Biol. 15 (6), 3206-3216 (1995) MEDLINE 95280920 COMMENT CDNA sequence deposited by: Ying-Fei Wei, Human Genome Sciences Inc., 920 Medical Center Drive, Rockville, MD 20850-3338 cDNA sequence deposited b: Ying-Fei Wei, Human Genonme Sciences Inc., 9620 Medical Center Drive. Rockville, MD 20850-3338. FEATURES Location/Qualifiers source 1..3422 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="prostate" /clone_lib="prostrate library HGS" /clone="HGS473238" /chromosome="17" /map="q11.2-12" mRNA 1..3422 /gene="LIG3" gene 1..3422 /gene="LIG3" CDS 334..3102 /gene="LIG3" /codon_start=1 /product="DNA ligase III" /db_xref="PID:g860963" /translation="MAEQRFCVDYAKRGTAGCKKCKEKIVKGVCRIGKVVPNPFSESG GDMKEWYHIKCMFEKLERARATTKKIEDLTELEGWEELEDNEKEQITQHIADLSSKAA GTPKKKAVVQAKLTTTGQVTSPVKGASFVTSTNPRKFSGFSAKPNNSGEAPSSPTPKR SLSSSKCDPRHKDCLLREFRKLCAMVADNPSYNTKTQIIQDFLRKGSAGDGFHGDVYL TVKLLLPGVIKTVYNLNDKQIVKLFSRIFNCNPDDMARDLEQGDVSETIRVFFEQSKS FPPAAKSLLTIQEVDEFLLRLSKLTKEDEQQQALQDIASRCTANDLKCIIRLIKHDLK MNSGAKHVLDALDPNAYEAFKASRNLQDVVERVLHNAQEVEKEPGQRRALSVQASLMT PVQPMLAEACKSVEYAMKKCPNGMFSEIKYDGERVQVHKNGDHFSYFSRSLKPVLPHK VAHFKDYIPQAFPGGHSMILDSEVLLIDNKTGKPLPFGTLGVHKKAAFQDANVCLFVF DCIYFNDVSLMDRPLCERRKFLHDNMVEIPNRIMFSEMKRVTKALDLADMITRVIQEG LEGLVLKDVKGTYEPGKRHWLKVKKDYLNEGAMADTADLVVLGAFYGQGSKGGMMSIF LMGCYDPGSQKWCTVTKCAGGHDDATLARLQNELDMVKISKDPSKIPSWLKVNKIYYP DFIVPDPKKAAVWEITGAEFSKSEAHTADGISIRFPRCTRIRDDKDWKSATNLPQLKE LYQLSKEKADFTVVAGDEGSSTTGGSSEENKGPSGSAVSRKAPSKPSASTKKAEGKLS NSNSKDGNMQTAKPSAMKVGEKLATKSSPVKVGEKRKAADETLCQTKVLLDIFTGVRL YLPPSTPDFSRLRRYFVAFDGDLVQEFDMTSATHVLGSRDKNPAAQQVSPEWIWACIR KRRLVAPC" polyA_signal 3382..3387 /gene="LIG3" BASE COUNT 906 a 851 c 912 g 753 t ORIGIN 1 ccacgcgtcc ggcagcctgt atgagcaagt gccgaggcct acggtgagcg ccggagccgg 61 agaggcagct atatgtcttt ggctttcaag atcttctttc cacaaaccct ccgtgcactc 121 agccgaaaag aactgtgcct attccgaaaa catcactggc gtgatgtaag acaattcagc 181 cagtggtcag aaacagatct gcttcatgga catcccctct tcctgagaag aaagcctgtt 241 ctatcattcc agggaagcca tctaagatca cgtgccacct accttgtttt cttgccaggg 301 ttgcatgtgg gactctgcag tggcccctgt gagatggctg agcaacggtt ctgtgtggac 361 tatgccaagc gtggcacagc tggctgcaaa aaatgcaagg aaaagattgt gaagggcgta 421 tgccgaattg gcaaagtggt gcccaatccc ttctcagagt ctgggggtga tatgaaagag 481 tggtaccaca ttaaatgcat gtttgagaaa ctagagcggg cccgggccac cacaaaaaaa 541 atcgaggacc tcacagagct ggaaggctgg gaagagctgg aagataatga gaaggaacag 601 ataacccagc acattgcaga tctgtcttct aaggcagcag gtacaccaaa gaagaaagct 661 gttgtccagg ctaagttgac aaccactggc caggtgactt ctccagtgaa aggcgcctca 721 tttgtcacca gtaccaatcc ccggaaattt tctggctttt cagccaagcc caacaactct 781 ggggaagccc cctcgagccc cacccctaag agaagtctgt cttcaagcaa atgtgacccc 841 aggcataagg actgtctgct acgggagttt cgaaagttat gcgccatggt ggccgataat 901 cctagctaca acacgaagac ccagatcatc caggacttcc ttcggaaagg ctcagcagga 961 gatggtttcc acggtgatgt gtacctaaca gtgaagctgc tgctgccagg agtcattaag 1021 actgtttaca acttgaacga taagcagatt gtgaagcttt tcagtcgcat ttttaactgc 1081 aacccagatg atatggcacg ggacctagag cagggtgacg tgtcagagac aatcagagtc 1141 ttctttgagc agagcaagtc tttcccccca gctgccaaga gcctccttac catccaggaa 1201 gtggatgagt tccttctgcg gctgtccaag ctcaccaagg aggatgagca gcaacaggcc 1261 ctacaggaca ttgcctccag gtgtacagcc aatgacctta aatgcatcat caggttgatc 1321 aaacatgatc tgaagatgaa ctcaggtgca aaacatgtgt tagacgccct tgaccccaat 1381 gcctatgaag ccttcaaagc ctcgcgcaac ctgcaggatg tggtggagcg ggtccttcac 1441 aacgcgcagg aggtggagaa ggagccgggc cagagacgag ctctgagcgt ccaggcctcg 1501 ctgatgacac ctgtgcagcc catgttggcg gaggcctgca agtccgttga gtatgcaatg 1561 aagaaatgtc ccaatggcat gttctctgag atcaagtacg atggagagcg agtccaggtg 1621 cataagaatg gagaccactt cagctacttc agccgcagtc tcaagcccgt ccttcctcac 1681 aaggtggccc actttaagga ctacattccc caggcttttc ctgggggcca cagcatgatc 1741 ttggattctg aagtgcttct gattgacaac aagacaggca aaccactgcc ctttgggact 1801 ctgggagtac acaagaaagc agccttccag gatgctaatg tctgcctgtt tgtttttgat 1861 tgtatctact ttaatgatgt cagcttgatg gacagacctc tgtgtgagcg gcggaagttt 1921 cttcatgaca acatggttga aattccaaac cggatcatgt tctcagaaat gaagcgagtc 1981 acaaaagctt tggacttggc tgacatgata acccgggtga tccaggaggg attggagggg 2041 ctggtgctga aggatgtgaa gggtacatat gagcctggga agcggcactg gctgaaagtg 2101 aagaaagact atttgaacga gggggccatg gccgacacag ctgacctggt ggtccttgga 2161 gccttctatg ggcaagggag caaaggcggc atgatgtcaa tcttcctcat gggctgctac 2221 gaccctggca gccagaagtg gtgcacagtc accaagtgtg caggaggcca tgatgatgcc 2281 acgcttgccc gcctgcagaa tgaactagac atggtgaaga tcagcaagga ccccagcaaa 2341 atacccagct ggttgaaggt caacaagatc tactatcctg acttcatcgt cccagaccca 2401 aagaaagctg ccgtgtggga gatcacaggg gctgaattct ccaaatcgga ggctcataca 2461 gctgacggga tctccatccg attccctcgc tgcacccgaa tccgagatga taaggactgg 2521 aaatctgcca ctaaccttcc ccaactcaag gaactgtacc agttgtccaa ggagaaggca 2581 gacttcactg tagtggctgg agatgagggg agctccacta cagggggtag cagtgaagag 2641 aataagggtc cctcagggtc tgctgtgtcc cgcaaggccc ccagcaagcc ctcagccagt 2701 accaagaaag cagaagggaa gctgagtaac tccaacagca aagatggcaa catgcagact 2761 gcaaagcctt ccgctatgaa ggtgggggag aagctggcca caaagtcttc tccagtgaaa 2821 gtaggggaga agcggaaagc tgctgatgag acgctgtgcc aaacaaaggt attgctggac 2881 atcttcactg gggtgcggct ttacttgcca ccctccacac cagacttcag ccgtctcaga 2941 cgctactttg tggcattcga cggggacctg gtacaggaat ttgatatgac ttcagccacg 3001 cacgtgctgg gtagcaggga caagaaccct gcggcccagc aggtctcccc agagtggatt 3061 tgggcatgta tccggaaacg gagactggta gctccctgct aggtttgctg tcttccctct 3121 ccctcaggcc atactctcct ttaccatact attggactgg actcaggctg gaggcagata 3181 gacacagtat agggggaatg ggcttgcttc tcccaaaccc accagttctc cactgtctct 3241 tctggaccag gaattagttg ctgtgggtgc cacagctgaa gtcagtttgt cttgctggtt 3301 taaatagatc tttcagagct gggtgctggg tttgccatct ttttgttttc tttgaaaagc 3361 agcttagtta ccctttttat aaataaaata tcttgcagtt aaaaaaaaaa aaaaaaaaaa 3421 aa // LOCUS HSDNMTASE 5434 bp RNA PRI 11-DEC-1996 DEFINITION H.sapiens mRNA for DNA (cytosin-5)-methyltransferase. ACCESSION X63692 NID g1632818 KEYWORDS DNA methyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5434) AUTHORS Yen,R. TITLE Direct Submission JOURNAL Submitted (31-JAN-1992) R. Yen, The Oncology Center, The Johns Hopkins Medical Inst, 424 N Bond Street, Baltimore MD 21213, USA REMARK Revised by [3] REFERENCE 2 (bases 1 to 5194) AUTHORS Yen,R.W., Vertino,P.M., Nelkin,B.D., Yu,J.J., el-Deiry,W., Cumaraswamy,A., Lennon,G.G., Trask,B.J., Celano,P. and Baylin,S.B. TITLE Isolation and characterization of the cDNA encoding human DNA methyltransferase JOURNAL Nucleic Acids Res. 20 (9), 2287-2291 (1992) MEDLINE 92279022 REFERENCE 3 (bases 1 to 5434) AUTHORS Yen,R. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) R. Yen, The Oncology Center, The Johns Hopkins Medical Inst, 424 N Bond Street, Baltimore MD 21213, USA REFERENCE 4 (bases 1 to 5434) AUTHORS Bestor,T., Laudano,A., Mattaliano,R. and Ingram,V. TITLE Cloning and sequencing of a cDNA encoding DNA methyltransferase of mouse cells. The carboxyl-terminal domain of the mammalian enzymes is related to bacterial restriction methyltransferases JOURNAL J. Mol. Biol. 203 (4), 971-983 (1988) MEDLINE 89094873 REFERENCE 5 (bases 1 to 5434) AUTHORS Yoder,J.A., Yen,R.W.C., Vertino,P.M., Bestor,T.H. and Baylin,S.B. TITLE New 5' regions of the murine and human genes for DNA (cytosine-5)-methyltransferase JOURNAL J. Biol. Chem. 271 (49), 31092-31097 (1996) MEDLINE 97094871 FEATURES Location/Qualifiers source 1..5434 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="medullary thyroid carcinoma" /cell_line="TT cells" CDS 238..5088 /EC_number="2.1.1.37" /codon_start=1 /product="DNA (cytosine-5-)-methyltransferase" /db_xref="PID:g1632819" /db_xref="SWISS-PROT:P26358" /translation="MPARTAPARVPTLAVPAISLPDDVRRRLKDLERDSLTEKECVKE KLNLLHEFLQTEIKNQLCDLETKLRKEELSEEGYLAKVKSLLNKDLSLENGAHAYNRE VNGRLENGNQARSEARRVGMADANSPPKPLSKPRTPRRSKSDGEAKPEPSPSPRITRK STRQTTITSHFAKGPAKRKPQEESERAKSDESIKEEDKDQDEKRRRVTSRERVARPLP AEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPTPKQKLKEEPDREARAGVQADEDE DGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISDEKDEDEKEEKRRKTTPKEPTE KKMARAKTVMNSKTHPPKCIQCGQYLDDPDLKYGQHPPDAVDEPQMLTNEKLSIFDAN ESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIELFFSGSAKPIYDDDPSLE GGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSPEYAPIFGLMQEKIYI SKIVVEFLQSNSDSTYEDLINKIETTVPPSGLNLNRFTEDSLLRHAQFVVEQVESYDE AGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDRGPTKATTTKLV YQIFDTFFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDMVKFGGSGRS KQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQGKKKKQNKNRISWVGEAVKT DGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFHAHWFC AGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPESLLE GDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLAEMRQKEIPRVLEQL EDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEH YRKYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKS TPASYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGPNRFYFLEA YNAKSKSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFS GCGGLSEGFHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETT NSRGQRLPQKGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRPRF FLLENVRNFVSFKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAP GEKLPLFPEPLHVFAPRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPEVR NGASALEISYNGEPQSWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDW RDLPNIEVRLSDGTMARKLRYTHHDRKNGRSSSGALRGVCSCVEAGKACDPAARQFNT LIPWCLPHTGNRHNHWAGLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRE CARSQGFPDTYRLFGNILDKHRQVGNAVPPPLAKAIGLEIKLCMLAKARESASAKIKE EEAAKD" polyA_site 5408 BASE COUNT 1418 a 1488 c 1511 g 1017 t ORIGIN 1 cgtccgcgtg gggggggtgt gtgcccgcct tgcgcatgcg tgttccctgg gcatggccgg 61 ctccgttcca tccttctgca cagggtatcg cctctctccg tttggtacat cccctcctcc 121 cccacgcccg gactggggtg gtagacgcgc ctccgctcat cgcccctccc catcggtttc 181 cgcgcgaaaa gccggggcgc ctgcgctgcc gccgccgcgt ctgctgaagc ctccgagatg 241 ccggcgcgta ccgccccagc ccgggtgccc acactggccg tcccggccat ctcgctgccc 301 gacgatgtcc gcaggcggct caaagatttg gaaagagaca gcttaacaga aaaggaatgt 361 gtgaaggaga aattgaatct cttgcacgaa tttctgcaaa cagaaataaa gaatcagtta 421 tgtgacttgg aaaccaaatt acgtaaagaa gaattatccg aggagggcta cctggctaaa 481 gtcaaatccc ttttaaataa agatttgtcc ttggagaacg gtgctcatgc ttacaaccgg 541 gaagtgaatg gacgtctaga aaacgggaac caagcaagaa gtgaagcccg tagagtggga 601 atggcagatg ccaacagccc ccccaaaccc ctttccaaac ctcgcacgcc caggaggagc 661 aagtccgatg gagaggctaa gcctgaacct tcacctagcc ccaggattac aaggaaaagc 721 accaggcaaa ccaccatcac atctcatttt gcaaagggcc ctgccaaacg gaaacctcag 781 gaagagtctg aaagagccaa atcggatgag tccatcaagg aagaagacaa agaccaggat 841 gagaagagac gtagagttac atccagagaa cgagttgcta gaccgcttcc tgcagaagaa 901 cctgaaagag caaaatcagg aacgcgcact gaaaaggaag aagaaagaga tgaaaaagaa 961 gaaaagagac tccgaagtca aaccaaagaa ccaacaccca aacagaaact gaaggaggag 1021 ccggacagag aagccagggc aggcgtgcag gctgacgagg acgaagatgg agacgagaaa 1081 gatgagaaga agcacagaag tcaacccaaa gatctagctg ccaaacggag gcccgaagaa 1141 aaagaacctg aaaaagtaaa tccacagatt tctgatgaaa aagacgagga tgaaaaggag 1201 gagaagagac gcaaaacgac ccccaaagaa ccaacggaga aaaaaatggc tcgcgccaaa 1261 acagtcatga actccaagac ccaccctccc aagtgcattc agtgcgggca gtacctggac 1321 gaccctgacc tcaaatatgg gcagcaccca ccagacgcgg tggatgagcc acagatgctg 1381 acaaatgaga agctgtccat ctttgatgcc aacgagtctg gctttgagag ttatgaggcg 1441 cttccccagc acaaactgac ctgcttcagt gtgtactgta agcacggtca cctgtgtccc 1501 atcgacaccg gcctcatcga gaagaatatc gaactcttct tttctggttc agcaaaacca 1561 atctatgatg atgacccgtc tcttgaaggt ggtgttaatg gcaaaaatct tggccccata 1621 aatgaatggt ggatcactgg ctttgatgga ggtgaaaagg ccctcatcgg cttcagcacc 1681 tcatttgccg aatacattct gatggatccc agtcccgagt atgcgcccat atttgggctg 1741 atgcaggaga agatctacat cagcaagatt gtggtggagt tcctgcagag caattccgac 1801 tcgacctatg aggacctgat caacaagatc gagaccacgg ttcctccttc tggcctcaac 1861 ttgaaccgct tcacagagga ctccctcctg cgacacgcgc agtttgtggt ggagcaggtg 1921 gagagttatg acgaggccgg ggacagtgat gagcagccca tcttcctgac gccctgcatg 1981 cgggacctga tcaagctggc tggggtcacg ctgggacaga ggcgagccca ggcgaggcgg 2041 cagaccatca ggcattctac cagggagaag gacaggggac ccacgaaagc caccaccacc 2101 aagctggtct accagatctt cgatactttc ttcgcagagc aaattgaaaa ggatgacaga 2161 gaagacaagg agaacgcctt taagcgccgg cgatgtggcg tctgtgaggt gtgtcagcag 2221 cctgagtgtg ggaaatgtaa agcctgcaag gacatggtta aatttggtgg cagtggacgg 2281 agcaagcagg cttgccaaga gcggaggtgt cccaatatgg ccatgaagga ggcagatgac 2341 gatgaggaag tcgatgataa catcccagag atgccgtcac ccaaaaaaat gcaccagggg 2401 aagaagaaga aacagaacaa gaatcgcatc tcttgggtcg gagaagccgt caagactgat 2461 gggaagaaga gttactataa gaaggtgtgc attgatgcgg aaaccctgga agtgggggac 2521 tgtgtctctg ttattccaga tgattcctca aaaccgctgt atctagcaag ggtcacggcg 2581 ctgtgggagg acagcagcaa cgggcagatg tttcacgccc actggttctg cgctgggaca 2641 gacacagtcc tcggggccac gtcggaccct ctggagctgt tcttggtgga tgaatgtgag 2701 gacatgcagc tttcatatat ccacagcaaa gtgaaagtca tctacaaagc cccctccgaa 2761 aactgggcca tggagggagg catggatccc gagtccctgc tggaggggga cgacgggaag 2821 acctacttct accagctgtg gtatgatcaa gactacgcga gattcgagtc ccctccaaaa 2881 acccagccaa cagaggacaa caagttcaaa ttctgtgtga gctgtgcccg tctggctgag 2941 atgaggcaaa aagaaatccc cagggtcctg gagcagctcg aggacctgga tagccgggtc 3001 ctctactact cagccaccaa gaacggcatc ctgtaccgag ttggtgatgg tgtgtacctg 3061 ccccctgagg ccttcacgtt caacatcaag ctgtccagtc ccgtgaaacg cccacggaag 3121 gagcccgtgg atgaggacct gtacccagag cactaccgga aatactccga ctacatcaaa 3181 ggcagcaacc tggatgcccc tgagccctac cgaattggcc ggatcaaaga gatcttctgt 3241 cccaagaaga gcaacggcag gcccaatgag actgacatca aaatccgggt caacaagttc 3301 tacaggcctg agaacaccca caagtccact ccagcgagct accacgcaga catcaacctg 3361 ctctactgga gcgacgagga ggccgtggtg gacttcaagg ctgtgcaggg ccgctgcacc 3421 gtggagtatg gggaggacct gcccgagtgc gtccaggtgt actccatggg cggccccaac 3481 cgcttctact tcctcgaggc ctataatgca aagagcaaaa gctttgaaga tcctcccaac 3541 catgcccgta gccctggaaa caaagggaag ggcaagggaa aagggaaggg caagcccaag 3601 tcccaagcct gtgagccgag cgagccagag atagagatca agctgcccaa gctgcggacc 3661 ctggatgtgt tttctggctg cggggggttg tcggagggat tccaccaagc aggcatctct 3721 gacacgctgt gggccatcga gatgtgggac cctgcggccc aggcgttccg gctgaacaac 3781 cccggctcca cagtgttcac agaggactgc aacatcctgc tgaagctggt catggctggg 3841 gagaccacca actcccgcgg ccagcggctg ccccagaagg gagacgtgga gatgctgtgc 3901 ggcgggccgc cctgccaggg cttcagcggc atgaaccgct tcaattcgcg cacctactcc 3961 aagttcaaaa actctctggt ggtttccttc ctcagctact gcgactacta ccggccccgg 4021 ttcttcctcc tggagaatgt caggaacttt gtctccttca agcgctccat ggtcctgaag 4081 ctcaccctcc gctgcctggt ccgcatgggc tatcagtgca ccttcggcgt gctgcaggcc 4141 ggtcagtacg gcgtggccca gactaggagg cgggccatca tcctggccgc ggcccctgga 4201 gagaagctcc ctctgttccc ggagccactg cacgtgtttg ctccccgggc ctgccagctg 4261 agcgtggtgg tggatgacaa gaagtttgtg agcaacataa ccaggttgag ctcgggtcct 4321 ttccggacca tcacggtgcg agacacgatg tccgacctgc cggaggtgcg gaatggagcc 4381 tcggcactgg agatctccta caacggggag cctcagtcct ggttccagag gcagctccgg 4441 ggcgcacagt accagcccat cctcagggac cacatctgta aggacatgag tgcattggtg 4501 gctgcccgca tgcggcacat ccccttggcc ccagggtcag actggcgcga tctgcccaac 4561 atcgaggtgc ggctctcaga cggcaccatg gccaggaagc tgcggtatac ccaccatgac 4621 aggaagaacg gccgcagcag ctctggggcc ctccgtgggg tctgctcctg cgtggaagcc 4681 ggcaaagcct gcgaccccgc agccaggcag ttcaacaccc tcatcccctg gtgcctgccc 4741 cacaccggga accggcacaa ccactgggct ggcctctatg gaaggctcga gtgggacggc 4801 ttcttcagca caaccgtcac caaccccgag cccatgggca agcagggccg cgtgctccac 4861 ccagagcagc accgtgtggt gagcgtgcgg gagtgtgccc gctcccaggg cttccctgac 4921 acctaccggc tcttcggcaa catcctggac aagcaccggc aggtgggcaa tgccgtgcca 4981 ccgcccctgg ccaaagccat tggcttggag atcaagcttt gtatgttggc caaagcccga 5041 gagagtgcct cagctaaaat aaaggaggag gaagctgcta aggactagtt ctgccctccc 5101 gtcacccctg tttctggcac caggaatccc caacatgcac tgatgttgtg tttttaacat 5161 gtcaatctgt ccgttcacat gtgtggtaca tggtgtttgt ggccttggct gacatgaagc 5221 tgttgtgtga ggttcgctta tcaactaatg atttagtgat caaattgtgc agtactttgt 5281 gcattctgga ttttaaaagt tttttattat gcattatatc aaatctacca ctgtatgagt 5341 ggaaattaag actttatgta gtttttatat gttgtaatat ttcttcaaat aaatctctcc 5401 tataaaccaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSDOCKP 2932 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for docking protein (signal recognition particle receptor). ACCESSION X06272 NID g30865 KEYWORDS docking protein; signal recognition particle receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2932) AUTHORS Hortsch,M. TITLE Direct Submission JOURNAL Submitted (19-NOV-1987) Hortsch M., EMBL, Meyerhofstr.1, Heidelberg REFERENCE 2 (bases 1 to 2932) AUTHORS Hortsch,M., Labeit,S. and Meyer,D.I. TITLE Complete cDNA sequence coding for human docking protein JOURNAL Nucleic Acids Res. 16 (1), 361-362 (1988) MEDLINE 88124220 COMMENT Data kindly reviewed (12.2.88) by Hortsch M. FEATURES Location/Qualifiers source 1..2932 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa-cells" /clone_lib="cDNA in lambda NM1149 phage vector" misc_feature 1..33 /note="5'-UT region" CDS 34..1950 /note="docking protein" /codon_start=1 /db_xref="PID:g30866" /db_xref="SWISS-PROT:P08240" /translation="MLDFFTIFSKGGLVLWCFQGVSDSCTGPVNALIRSVLLQERGGN NSFTHEALTLKYKLDNQFELVFVVGFQKILTLTYVDRLIDDVHRLFRDKYRTEIQQQS ALSLLNGTFDFQNDFLRLLREAEESSKIRAPTTMKKFEDSEKAKKPVRSMIETRGEKP KEKAKNSKKKGAKKEGSDGPLATSKPVPAEKSGLPVGPENEVELSKEELIRRKREEFI QKHGRGMEKSNKSTKSDAPKEKGKKAPRVWELGGCANKEVLDYSTPTTNGTPEAALSE DINLIRGTGSGGQLQDLDCSSSDDEGAAQTLTKPSATKGTLGGMFGMLKGLVGSKSLS REDMESVLDKMRDHLIAKNVAADIAVQLCESVANKLEGKVMGTFSTVTSTVKQALQES LVQILQPQRRVDMLRDIMDAQRRQRPYVVTFCGVNGVGKSTNLAKISFWLLENGFSVL IAACDTFRAGAVEQLRTHTRRLSALHPPEKHGGRTMVQLFEKGYGKDAAGIAMEAIAF ARNQGFDVVLVDTAGRMQDNAPLMTALAKLITVNTPDLVLFVGEALVGNEAVDQLVKF NRALADHSMAQTPRLIDGIVLTKFDTIDDKVGAAISMTYITSKPIVFVGTGQTYCDLR SLNAKAVVAALMKA" misc_feature 1948..2932 /note="3'-UT region" BASE COUNT 733 a 750 c 744 g 705 t ORIGIN 1 aggcccggtt cgccgccgct tcctgctgcc gccatgctcg acttcttcac cattttctcc 61 aagggcgggc ttgtgctctg gtgcttccag ggcgttagcg actcatgcac cggacccgtt 121 aacgcgttga ttcgttccgt gctgctgcag gaacggggag gtaacaactc cttcacccat 181 gaggcactga cactcaagta taaactggac aaccagtttg agctggtgtt tgtggttggt 241 tttcagaaga tcctgacact gacatatgta gacagattga tagatgacgt gcatcggctg 301 tttcgggaca agtaccgcac agagatccaa cagcaaagtg ctttaagttt attaaatggc 361 acttttgatt tccaaaatga cttcctgcgg ctccttcgtg aagcagagga gagcagtaag 421 atccgtgctc ccactaccat gaagaaattt gaagattctg aaaaggccaa gaaacctgtg 481 aggtccatga ttgagacacg gggggaaaag cccaaggaaa aagcaaagaa tagcaaaaaa 541 aagggggcca agaaggaagg ttctgatggt cctttggcta ccagcaaacc agtccctgca 601 gaaaagtcag gtcttccagt gggtcctgag aacgaggtag aactttccaa agaggagctg 661 atccgcagga agcgcgagga gttcattcag aagcatggga ggggtatgga gaagtccaac 721 aagtccacga agtcagatgc tccaaaggag aagggcaaaa aagcaccccg ggtgtgggaa 781 ctgggtggct gtgctaacaa agaagtgttg gattacagta ctcccaccac caatggaacc 841 cctgaggctg ccttgtctga ggacatcaac ctgattcgag ggactgggtc tggggggcag 901 cttcaggatc tggactgcag cagctctgat gacgaagggg ctgctcaaac tctcaccaaa 961 cctagtgcga ccaagggaac actgggtggc atgtttggta tgctgaaggg ccttgtgggt 1021 tcaaagagct tgagtcgtga agacatggaa tctgtgctgg acaagatgcg tgatcatctc 1081 attgctaaga acgtggctgc agacattgcc gtccagctct gtgaatctgt tgccaacaag 1141 ttggaaggga aggtgatggg gacgttcagc acggtgactt ccacagtaaa gcaagcccta 1201 caggagtccc tggtgcagat tctgcagcca cagcgtcgtg tagacatgct ccgggacatc 1261 atggatgccc agcgtcgcca gcgcccttat gtcgtcacct tctgcggcgt taatggagtg 1321 gggaaatcta ctaatcttgc caagatttcc ttctggttgt tagagaatgg cttcagtgtc 1381 ctcattgctg cctgtgatac atttcgtgct ggggccgtgg agcagctgcg tacacacacc 1441 cggcgtttga gtgccctaca ccctccagag aagcatggtg gccgcaccat ggtgcagttg 1501 tttgaaaagg gctatggcaa ggatgctgct ggcattgcca tggaagccat tgcttttgca 1561 cgtaaccaag gctttgacgt ggtgctggtg gacacggcag gccgcatgca agacaatgcc 1621 cctctgatga ctgccctggc caaactcatt actgtcaata cacctgattt ggtgctgttt 1681 gtaggagaag ccttagtagg caatgaagcc gtggaccagc tggtcaagtt caacagagcc 1741 ttggctgacc attctatggc tcagacacct cggctcattg atggcattgt tcttaccaaa 1801 tttgatacca ttgatgacaa ggtgggagct gctatttcta tgacgtacat cacaagcaaa 1861 cccatcgtct ttgtgggcac cggccagacc tactgtgacc tacgcagcct caatgccaag 1921 gctgtggtgg ctgccctcat gaaggcttaa cgtggctctt gcccaatacc aaatcgccgc 1981 tttccccaca agcccttctt cctgtatcaa gaatgtgctt tagagtatgt gagcaacctg 2041 tcttaagtgt agtacaaagg cagagtgagg gggcttgtgg ctccttccaa ccccactccc 2101 cgttcagcac agccgccatt tgcaaggaag gcctaatcat gttacaatca ctgcccgact 2161 gaccctctcc cagcggcctc ccccttccta ctcaggcacc cccttcactc tgcctacaga 2221 ctcagtctta ttacagcttt gaccaatggt tggaacccaa caccagagct ttgctaataa 2281 tgagtgtggt caagagccgt ctgagcctaa tgagtcccag ctgcattagg ttaagagact 2341 cttccagagc cagcgccagg tcttgaatgg cacctctccc taggatacac agcctgcagg 2401 tccccaggac ctgatgacac ccgcctcact gtggcagtgt attgcctgtt aattgctgct 2461 aattctaatt ctgatgatga ctcctactcc attgtttacc ccaaagcatc agctaggctg 2521 gagtgatttg ttacaaatga gcaaaagatg agtccttgct tccctcagaa ataaaagtag 2581 cccagctgca gcgttgcatt ggcttcttgg cctcccaact cttccactcc cagaatcaga 2641 agtaagctct gcatgttccc ttcctggagg aaaccaattg tcagaaggtg tatgatgacc 2701 ccctcccctc ccatccttca cctcctaagc agtcctggct tttcctcatc actcccctct 2761 acagtgcctg gtagacaagt gctacattga agaacacaaa cctcttgtta agacttgtcc 2821 tgtagcttga tattacaggt gtgctattag tgcaataagg tgaaggctgt ctgcccagag 2881 aaataagtaa tttatataag aaaataaatt tcataaataa attggaaatt cc // LOCUS HSDOXBR 2694 bp RNA PRI 29-APR-1994 DEFINITION H.sapiens mRNA for delta 4-3-oxosteroid 5 beta-reductase. ACCESSION Z28339 NID g431856 KEYWORDS delta 4-3-oxosteroid 5 beta-reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2694) AUTHORS Kondo,K.H., Kai,M.H., Setoguchi,Y., Eggertsen,G., Sjoblom,P., Setoguchi,T., Okuda,K.I. and Bjorkhem,I. TITLE Cloning and expression of cDNA of human delta 4-3-oxosteroid 5 beta-reductase and substrate specificity of the expressed enzyme JOURNAL Eur. J. Biochem. 219 (1-2), 357-363 (1994) MEDLINE 94139710 REFERENCE 2 (bases 1 to 2694) AUTHORS Kondo,K. TITLE Direct Submission JOURNAL Submitted (18-NOV-1993) Kazu-Hiro kondo, Surgery 1, Miyazaki Medical College, 5200 Kiwara, Kiyotake, Miyazaki, Miyazaki, 889-16, Japan FEATURES Location/Qualifiers source 1..2694 /organism="Homo sapiens" /strain="mongolian" /isolate="patient 92100" /db_xref="taxon:9606" /clone="pH5bR" /tissue_type="liver" /clone_lib="pBluescript" CDS 70..1050 /EC_number="1.3.1.23" /codon_start=1 /product="delta 4-3-oxosteroid 5 beta-reductase" /db_xref="PID:g431857" /translation="MDLSAASHRIPLSDGNSIPIIGLGTYSEPKSTPKGACATSVKVA IDTGYRHIDGAYIYQNEHEVGEAIREKIAEGKVRREDIFYCGKLWATNHVPEMVRPTL ERTLRVLQLDYVDLYIIEVPMAFKPGDEIYPRDENGKWLYHKSNLCATWEAMEACKDA GLVKSLGVSNFNRRQLELILNKPGLKHKPVSNQVECHPYFTQPKLLKFCQQHDIVITA YSPLGTSRNPIWVNVSSPPLLKDALLNSLGKRYNKTAAQIVLRFNIQRGVVVIPKSFN LERIKENFQIFDFSLTEEEMKDIEALNKNVRFVELLMWRDHPEYPFHDEY" polyA_signal 2669..2674 BASE COUNT 865 a 520 c 573 g 736 t ORIGIN 1 ccctaggaca cctttctaaa aagactccct gtggtgttca gaatcactcc tacagtcagg 61 ttctccacaa tggatctcag tgctgcaagt caccgcatac ctctaagtga tggaaacagc 121 attcccatca tcggacttgg tacctactca gaacctaaat cgacccctaa gggagcctgt 181 gcaacatcgg tgaaggttgc tattgacaca gggtaccgac atattgatgg ggcctacatc 241 taccaaaatg aacacgaagt tggggaggcc atcagggaga agatagcaga aggaaaggtg 301 cggagggaag atatcttcta ctgtggaaag ctatgggcta caaatcatgt cccagagatg 361 gtccgcccaa ccctggagag gacactcagg gtcctccagc tagattatgt ggatctttac 421 atcattgaag tacccatggc ctttaagcca ggagatgaaa tataccctag agatgagaat 481 ggcaaatggt tatatcacaa gtcaaatctg tgtgccactt gggaggcgat ggaagcttgc 541 aaagacgctg gcttggtgaa atccctggga gtgtccaatt ttaaccgcag gcagctggag 601 ctcatcctga acaagccagg actcaaacac aagccagtca gcaaccaggt tgagtgccat 661 ccgtatttca cccagccaaa actcttgaaa ttttgccaac aacatgacat tgtcattact 721 gcatatagcc ctttggggac cagtaggaat ccaatctggg tgaatgtttc ttctccacct 781 ttgttaaagg atgcacttct aaactcattg gggaaaaggt acaataagac agcagctcaa 841 attgttttgc gtttcaacat ccagcgaggg gtggttgtca ttcctaaaag ctttaatctt 901 gaaaggatca aagaaaattt tcagatcttt gacttttctc tcactgaaga agaaatgaag 961 gacattgaag ccttgaataa aaatgtccgc tttgtagaat tgctcatgtg gcgcgatcat 1021 cctgaatacc catttcatga tgaatactga ctgccgggag ttcctgaaca gatttttcac 1081 tcccatgagt gccaagacgg tgcaatgggt agtcccctag atgtgaaaat gaagagagag 1141 ggttttacca tcctgagaag aaataatgat ggaaacatgt ttaatgtttg tgcagtgtaa 1201 atgactttga ctcagtcaca ttgaagtaaa aatattaaaa tctgttgaaa taactcttag 1261 gaaattatca actaattttt tcagatcagt atcttctaga ttccagacag aaaaaaatta 1321 cacttcagaa aagacatcaa aggcaacata tgacaacaag taatttatga atctgggtag 1381 tagcgttggt aatctgagtt ctttaagggt tcacaggaca acgaagtgca tgtggcagtg 1441 tgctggcagt ggccttgagg ctttggacca ttggttacaa aacagacaca gccaagataa 1501 gatccacaca cacattatta acaaggaagt gatttgctgc accttgagtt gagaggacta 1561 catgtagaaa agtcttaaaa tagagctaaa caccacagtg gtcaacaaag ccatcataat 1621 gttggtgttt gtttccctcc aatgtatgta tgtttagttt ttatccaacc tgaggaatga 1681 aaacttaact ggatctctct tgcatcctta aagggcctga gtctcaacat ggctgctgat 1741 ccatacttac acatcttact gtcaatcttg cctacattga ttatagaacc actattacgt 1801 gaaaaggctt gaaacaacca acatatacaa ataaaaccct gccttgtaaa atagtaaaag 1861 agaagccata tattggcttt tcttcttaac ttgggagata tattgaaaca aggtgcttta 1921 taagattatt gtacttaaga ctttaatagt gttacttgga tagcttatat gaattttgag 1981 aattttatat gaattttgag aaagcaagtt caaaagaact ctggtaattt tcctgtatgt 2041 acaatttaaa gagtgaataa gattattaga attcagcaat agagatatat ctattttcaa 2101 ttcaactaca gaaatatatt ttattggccg ggtgcggtgg ctcatgccta taatcccagc 2161 actttgggag gccaaggtgg gcagatcagg aggtcaggag atcgagacca tcttggctaa 2221 caaggtgaaa ccccgtctct actaaaaata caaaaaatta gccaggcgcg gtggcggggg 2281 cctgtaatcc cagctactca ggaggctgag gcagaagaat ggcatgaacc cgggaggagg 2341 agcttgcagt gagccgagat agcgccactg cagtccggcc tgggtgaaag agcgagactc 2401 cgtctcaaaa acaaaaaaaa aaaagaaaag aaatatattt tattcattca cattaggtca 2461 ctgtcatact gtcataggct gagagagttc ttcaaaaatt atgttttccc aagatcagtt 2521 gcttatagat aatgttcaat gacctcaaga catatatttt tgagaaatta tcattttaaa 2581 aaatttggtc tatactgatt gttttcactg attccaatat tattacttat aacactgacc 2641 tctggaaaat attttgttca caagaaataa taaagtataa tgatttgttg catc // LOCUS HSDRB1R 1181 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for HLA class II DR-beta 1 (Dw14). ACCESSION X02902 NID g30884 KEYWORDS cell surface glycoprotein; class II antigen; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1181) AUTHORS Cairns,J.S., Curtsinger,J.M., Dahl,C.A., Freeman,S., Alter,B.J. and Bach,F.H. TITLE Sequence polymorphism of HLA DR beta 1 alleles relating to T-cell-recognized determinants JOURNAL Nature 317 (6033), 166-168 (1985) MEDLINE 85296375 FEATURES Location/Qualifiers source 1..1181 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 57..857 /note="precursor" /codon_start=1 /db_xref="PID:g30885" /db_xref="SWISS-PROT:P13759" /translation="MVCLKFPGGSCMAALTVTLMVLSSPLALAGDTRPRFLEQVKHEC HFFNGTERVRFLDRYFYHQEEYVRFDSDVGEYRAVTELGRPDAEYWNSQKDLLEQRRA AVDTYCRHNYGVVESFTVQRRVYPEVTVYPAKTQPLQHHNLLVCSVNGFYPGSIEVRW FRNGQEEKTGVVSTGLIQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSLTSPLTVEWR ARSESAQSKMLSGVGGFVLGLLFLGAGLFIYFRNQKGHSGLQPTGFLS" sig_peptide 57..143 /note="signal peptide (aa -29 to -1)" mat_peptide 144..854 /note="mature HLA class II DR-beta 1 chain (aa 1-237)" allele 363..365 /note="GCG (Alu) is GAG (Glu) in HLA DR-beta 1 (DW 13)" BASE COUNT 250 a 327 c 330 g 273 t 1 others ORIGIN 1 tccctgagtg agactcacct gctcctctgg cccctggtcc tgtcctgttc tccagcatgg 61 tgtgtctgaa gttccctgga ggctcctgca tggcagctct gacagtgaca ctgatggtgc 121 tgagctcccc actggctttg gctggggaca cccgaccacg tttcttggag caggttaaac 181 atgagtgtca tttcttcaac gggacggagc gggtgcggtt cctggacaga tacttctatc 241 accaagagga gtacgtgcgc ttcgacagcg acgtggggga gtaccgggcg gtgacggagc 301 tggggcggcc tgatgccgag tactggaaca gccagaagga cctcctggag cagaggcggg 361 ccgcggtgga cacctactgc agacacaact acggggttgt ggagagcttc acagtgcagc 421 ggcgagtcta tcctgaggtg actgtgtatc ctgcaaagac ccagcccctg cagcaccaca 481 acctcctggt ctgctctgtg aatggtttct atccaggcag cattgaagtc aggtggttcc 541 ggaacggcca ggaagagaag actggggtgg tgtccacagg cctgatccag aatggagact 601 ggaccttcca gaccctggtg atgctggaaa cagttcctcg gagtggagag gtttacacct 661 gccaagtgga gcacccaagc ctgacgagcc ctctcacagt ggaatggaga gcacggtctg 721 aatctgcaca gagcaagatg ctgagtggag tcgggggctt cgtgctgggc ctgctcttcc 781 ttggggccgg gctgttcatc tacttcagga atcagaaagg acactctgga cttcagccaa 841 caggattcct gagctgaagt gaagatgacc acattcaagg aagaaccttc tgccccagct 901 ttgcaggatg aaacacttcc ccgcttggct ctcattcttc cacaagagag acctttctcc 961 ggacctggtt gctactggtt cagcagctct gcagaaaatg tcctcccttg tggctgcctc 1021 agctcgtacc tttggcctga agtcccagca ttaatggcag cccctcatct tccaagtttt 1081 gtgctcccct ttacctaatg cttcctgcct cccatgcatc tgtactcctg ctgtgccaca 1141 aacanattac attattaaat gtttctcaaa catggagtta a // LOCUS HSDRES9 4211 bp RNA PRI 25-JUL-1997 DEFINITION H.sapiens mRNA for DRES9 protein. ACCESSION X98654 NID g2245316 KEYWORDS DRES9 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4211) AUTHORS Rubboli,F. TITLE Direct Submission JOURNAL Submitted (19-JUN-1996) F. Rubboli, Tigem - Telethon Institute of Genetics, and Medicine, Via Olgettina 58, I- 20132 Milan - Italy, ITALY REFERENCE 2 (bases 1 to 4211) AUTHORS Vihtelic,T.S., Hyde,D.R. and O'Tousa,J.E. TITLE Isolation and characterization of the Drosophila retinal degeneration B (rdgB) gene JOURNAL Genetics 127 (4), 761-768 (1991) MEDLINE 91231170 REFERENCE 3 (bases 1 to 4211) AUTHORS Rubboli,F., Bulfone,A., Bogni,S., Marchitiello,A., Zollo,M., Borsani,G., Ballabio,A. and Banfi,S. TITLE A mammalian homolog of the Drosophila retinal degeneration B gene: implications for the evolution of phototransduction mechanisms JOURNAL Genes Funct. 1, 205-214 (1997) REFERENCE 4 (bases 1 to 4211) AUTHORS Rubboli,F. TITLE Direct Submission JOURNAL Submitted (04-JUL-1997) F. Rubboli, Tigem - Telethon Institute of Genetics, and Medicine, Via Olgettina 58, I- 20132 Milan - Italy, ITALY FEATURES Location/Qualifiers source 1..4211 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="NT2D1" /chromosome="11" /map="11q13" gene 190..3924 /gene="DRES9" CDS 190..3924 /gene="DRES9" /codon_start=1 /product="homologue of Drosphila retinal degeneration B gene" /db_xref="PID:e326231" /db_xref="PID:g2245317" /translation="MLIKEYHILLPMSLDEYQVAQLYMIQKKSREESSGEGSGVEILA NRPYTDGPGGSGQYTHKVYHVGSHIPGWFRALLPKAALQVEEESWNAYPYTRTRYTCP FVEKFSIEIETYYLPDGGQQPNVFNLSGAERRQRILDTIDIVRDAVAPGEYKAEEDPR LYHSVKTGRGPLSDDWARTAAQTGPLMCAYKLCKVEFRYWGMQAKIEQFIHDVGLRRV MLRAHRQAWCWQDEWTELSMADIRALEEETARMLAQRMAKCNTGSEGSEAQPPGKPST EARSAASNTGTPDGPEAPPGPDASPDASFGKQWSSSSRSSYSSQHGGAVSPQSLSEWR MQNIARDSENSSEEEFFDAHEGFSDSEEVFPKEMTKWNSNDFIDAFASPVEAEGTPEP GAEAAKGIEDGAQAPRDSEGLDGAGELGAEACAVHALFLILHSGNILDSGPGDANSKQ ADVQTLSSAFEAVTRIHFPEALGHVALRLVPCPPICAAAYALVSNLSPYSHDGDSLSR SQDHIPLAALPLLATSSSRYQGAVATVIARTNQAYSAFLRSPEGAGFCGQVALIGDGV GGILGFDALCHSANAGTGSRGSSRRGSMNNELLSPEFGPVRDPLADGVEGLGRGSPEP SALPPQRIPSDMASPEPEGSQNSLQAAPATTSSWEPRRASTAFCPPAASSEAPDGPSS TARLDFKVSGFFLFGSPLGLVLALRKTVMPALEAAQMRPACEQIYNLFHAADPCASRL EPLLAPKFQAIAPLTVPRYQKFPLGDGSSLLLADTLQTHSSLFLEELEMLVPSTPTST SGAFWKGSELATDPPAQPAAPSTTSEVVKILERWWGTKRIDYSLYCPEALTAFPTVTL PHLFHASYWESADVVAFILRQVIEKERPQLAECEEPSIYSPAFPREKWQRKRTQVKIR NVTSNHRASDTVVCEGRPQVLSGRFMYGPLDVVTLTGEKVDVYIMTQPLSGKWIHFGT EVTNSSGRLTFPVPPERALGIGVYPVRMVVRGDHTYAECCLTVVARGTEAVVFSIDGS FTASVSIMGSDPKVRAGAVDVVRHWQDSGYLIVYVTGRPDMQKHRVVAWLSQHNFPHG VVSFCDGLTHDPLRQKAMFLQSLVQEVGLNIVAGYGSPKDVAVYTALGLSPSQTYIVG RAVRKLQAQCQFLSDGYVAHLGQLEAGSHSHASSGPPRAALGKSSYGVAAPVDFLRKQ SQLLRSRGPSQAEREGPGTPPTTLARGKARSISLKLDSEE" BASE COUNT 787 a 1417 c 1300 g 707 t ORIGIN 1 cgcgggcggc ggtgctaggc tcgcctctgc ctccctcgtg gcgggcccgg acatggggtc 61 ccgtggcctg agtccctcgg ccggcgcgca gggctcgggg ccgcgcgcac cttccccgca 121 ctgactgtcg cgccgtgccc tgcgccagga ggagcggagg ccgcgcgcgg cccgccgagc 181 gccttcagga tgctcatcaa ggaataccac attctgctgc ccatgagcct ggacgagtac 241 caggtggccc agctctacat gatccagaaa aagagccggg aggagtctag tggtgagggc 301 agcggcgtgg agatcctggc caaccggccc tacacggatg ggcccggggg cagcgggcaa 361 tacacacaca aggtgtacca cgtgggctcc cacatcccag gctggttccg ggcactgctg 421 cccaaggctg ccctgcaggt agaagaggaa tcctggaatg cctaccccta cacccgaacc 481 cggtacacct gccctttcgt ggagaaattc tccattgaaa ttgagaccta ttacctgcct 541 gatggggggc agcagccaaa cgtcttcaac ctgagcgggg ccgagaggag acagcgcatc 601 ctggacacca tcgacatcgt gcgggatgca gtggccccag gcgagtacaa agcagaagag 661 gacccccggc tttatcactc ggtcaagacg ggccgagggc cactgtctga tgactgggca 721 cggacggcgg cacagacggg gccccttatg tgtgcctata agctgtgcaa ggttgagttc 781 cgctactggg gcatgcaagc caagatcgag cagttcatcc atgatgtagg tctgcgtcgg 841 gtgatgctgc gggcccaccg ccaggcctgg tgctggcagg atgagtggac agagctgagc 901 atggctgaca tccgggcact ggaagaggag actgctcgca tgctggccca gcgcatggcc 961 aagtgcaaca caggcagtga ggggtccgag gcccagcccc ccgggaaacc gagcaccgag 1021 gcccggtctg cggccagcaa cactggcacc cccgatgggc ctgaggcccc cccaggccca 1081 gatgcctccc ccgatgccag ctttgggaag cagtggtcct catcctcccg ttcctcctac 1141 tcatcccaac atggaggggc tgtgtctccc cagagcttgt ctgagtggcg catgcagaac 1201 attgcccgag actctgagaa cagctccgag gaagagttct ttgatgccca cgaaggcttc 1261 tcggacagtg aggaggtctt ccccaaggag atgaccaagt ggaactccaa tgacttcatt 1321 gatgcctttg cctccccagt ggaggcagag ggaacgccag agcctggagc cgaggcagct 1381 aaaggcattg aggatggggc ccaagcaccc agggactcag agggcctgga tggagccggg 1441 gagctggggg ctgaggcatg cgcagtccac gccctcttcc ttatcctgca cagcggcaac 1501 atcctggact caggccctgg agacgccaac tccaagcagg cggatgtgca gacgctgagc 1561 tccgccttcg aggccgtcac ccgcatccac ttccctgagg ccttgggcca cgtggcgctg 1621 cgactggtgc cctgtccacc catctgcgcc gccgcctatg cccttgtctc caacctgagc 1681 ccttacagcc acgatgggga cagcctgtct cgctcccaag accacattcc actggctgcc 1741 ctgccactgc tggccacctc atcctcccgc taccagggcg ccgtggccac cgtcattgcc 1801 cgcaccaacc aggcctactc agccttcctg cgctcacctg agggtgccgg cttctgtggg 1861 caggtcgcac tgattggaga tggtgttggt ggcatcctgg gctttgatgc actctgccac 1921 agtgctaacg cgggcaccgg gagtcggggc agcagccgcc gtgggagcat gaacaatgag 1981 ctgctctctc cggagtttgg cccagtgcgg gaccccctgg cagatggtgt ggaaggcctg 2041 ggtcggggca gcccagaacc ctcggccttg cctccccagc gcatccccag cgacatggcc 2101 agtcctgagc ccgagggctc tcagaacagc cttcaggcag cccccgcaac cacctcctcc 2161 tgggagcccc ggcgggcaag cacggccttc tgcccacccg ctgccagttc cgaggcacct 2221 gacggcccca gcagcactgc ccgccttgac ttcaaggtct ctggcttctt cctcttcggc 2281 tccccactgg gcctggtgct ggctctgcgc aaaactgtga tgcccgccct ggaggcagcc 2341 cagatgcgcc cagcctgtga acagatctac aacctcttcc acgcggccga cccctgcgcc 2401 tcacgcctcg agcccctgct ggccccgaag ttccaggcca tcgccccact gaccgtgccc 2461 cgctaccaga agttccccct gggagatggc tcatccctgc tgctggccga cactctgcag 2521 acgcactcca gcctctttct ggaggagctg gagatgctgg tgccctcaac acccacctct 2581 actagcggtg ccttctggaa gggcagtgag ttggccactg accccccggc ccagccagcc 2641 gcccccagca ccaccagtga ggtggttaag atcctggagc gctggtgggg gaccaagcgg 2701 atcgactact cgctgtactg ccccgaggcg ctcaccgcct ttcccaccgt cacgctgccc 2761 cacctcttcc acgccagcta ctgggagtcc gccgacgtgg tggcgttcat cctgcgccag 2821 gtgatcgaga aggagcggcc acagctggcg gaatgcgagg agccgtccat ctacagcccg 2881 gccttcccca gggagaagtg gcagcgaaaa cgcacgcagg tcaagatccg gaacgtcact 2941 tccaaccacc gggcgagcga cacggtggtg tgcgagggcc gcccccaggt gctaagcggg 3001 cgcttcatgt acgggcccct ggacgtcgtc acgctcactg gagagaaggt ggatgtctac 3061 atcatgacgc agccgctgtc gggcaagtgg atccactttg gcaccgaagt caccaatagc 3121 tcgggccgcc tcaccttccc agttccccca gaacgcgcgc tgggcattgg tgtctacccc 3181 gtgcgcatgg tggtcagggg cgaccacacc tatgccgaat gctgcctgac tgtggtggcc 3241 cgcggcacgg aggctgtggt cttcagcatc gacggctcct tcaccgccag cgtctccatc 3301 atgggcagcg accccaaggt gcgagctggc gccgtggacg tggtcaggca ctggcaggac 3361 tccggctacc tgatcgtgta tgtcacaggc cggccggata tgcagaagca ccgcgtggtg 3421 gcatggctgt cgcagcacaa cttcccccac ggcgtcgtct ccttctgcga cggcctcacc 3481 cacgacccac tacgccagaa ggcaatgttt ctccagagcc tggttcagga ggtaggactg 3541 aacatcgtgg ccggttatgg gtctcccaaa gatgtggctg tatacacggc gctggggctg 3601 tccccgagcc agacctacat cgtgggccgt gccgtgcgga agctacaggc gcagtgccag 3661 ttcctgtcag acggctatgt ggcccacctg ggccagctgg aagcgggctc gcactcgcat 3721 gcctcctcgg gacccccgag agctgccttg ggcaagagca gctatggtgt ggctgccccc 3781 gtggacttcc tgcgcaaaca gagccagctg cttcgctcga ggggccccag ccaggcggag 3841 cgtgagggcc cgggaacacc acccaccacc ctggcacggg gcaaagcacg gagcatcagc 3901 ctgaagctgg acagcgagga gtgaggccca caccagcctg gacctgggtt atttattgac 3961 acacccaagg ggcccgaggg gctgcgtgtg ggaggctggg gacccagact tttggcccca 4021 gcgctggccc ccccagcccc acaccctata tctccgtgtg ctcctcggtg ttacttccct 4081 ttcatatgag gggacccagc gccgggggga gggaggaggg cgtgggcatg ggcgcagagg 4141 cttttccagt gtgtataaat ccatgaaaat aaacgccacc tgcaccccaa aaaaaaaaaa 4201 aaaaaaaaaa a // LOCUS HSDS1GENE 888 bp RNA PRI 19-MAR-1996 DEFINITION H.sapiens DS-1 mRNA. ACCESSION X81788 NID g1045058 KEYWORDS DS-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 888) AUTHORS van Belzen,N., Diesveld,M.P., van der Made,A.C., Nozawa,Y., Dinjens,W.N., Vlietstra,R., Trapman,J. and Bosman,F.T. TITLE Identification of mRNAs that show modulated expression during colon carcinoma cell differentiation JOURNAL Eur. J. Biochem. 234 (3), 843-848 (1995) MEDLINE 96163468 REFERENCE 2 (bases 1 to 888) AUTHORS Van Belzen,N. TITLE Direct Submission JOURNAL Submitted (20-SEP-1994) N. Van Belzen, Dept. of Pathology, Erasmus University Rotterdam, PO Box 1738, 3000 DR Rotterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..888 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT29-D4 colon carcinoma" gene 3..623 /gene="DS-1" CDS 3..623 /gene="DS-1" /note="putative" /codon_start=1 /product="DS-1 protein" /db_xref="PID:g1045059" /translation="MAATRCLRWGLSRAGVWLLPPPARCPRRALHKQKDGTEFKSIYS LDKLYPESQGSDTAWRVPNGAKQADSDIPLDRLTISYCRSSGPGGQNVNKVNSKAEVR FHLATAEWIAEPVRQKIAITHKNKINRLGELILTSESSRYQFRNLADCLQKIRDMITE ASQTPKEPTKEDVKLHRIRIENMNRERLRQKRIHSAVKTSRRVDMD" BASE COUNT 255 a 214 c 237 g 182 t ORIGIN 1 gcatggcggc caccaggtgc ctgcgctggg gcctgagccg agccggagtc tggctgctcc 61 caccgcccgc acggtgccca cgccgggcgc tgcacaagca gaaagacggc actgagttca 121 agagcatcta cagcctggac aagctctacc ccgaatctca gggctcggac accgcctgga 181 gggtcccgaa tggtgcaaag caagccgaca gtgacatccc tctagatcgc ttgacaatat 241 cttattgtcg gagtagtggt cctggggggc agaatgtgaa caaagtgaat tccaaggcag 301 aagtcaggtt ccatttggca actgccgagt ggatcgcgga gcccgtgcgg cagaagatag 361 ccatcacgca taaaaacaag atcaacaggt taggagagtt gatccttacc tctgagagca 421 gccgctatca gttccggaat ctggcagatt gcctgcagaa aattcgagac atgatcactg 481 aggccagcca gacaccgaag gagccaacaa aagaagatgt taaacttcat agaatcagga 541 tagaaaacat gaatcgggaa aggctgagac aaaagagaat tcattctgct gtaaagacaa 601 gcaggagggt cgacatggac tgaaatcacc ctctgcagct gggagggctc ttctgggcgt 661 ccgggcagct gcagctgaga ggactttcac accataagga gatttctgtt tttctttttg 721 gctgttaatg cttgtctata acattggagc catcacaaga atgttcattt ggaatgaagg 781 ctgcaggcac tggttgcaga cgtctttata ggcagtcacc atgttgtcaa accttaataa 841 tgcacctcat gtattagtca caataaaaat cagaactcaa aaaaaaaa // LOCUS HSDS33 1252 bp RNA PRI 04-AUG-1995 DEFINITION H.sapiens mRNA for unknown protein expressed in macrophages. ACCESSION X89059 NID g929630 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1252) AUTHORS Krause,S.W., Rehli,M., Kreutz,M., Schwarzfischer,L., Paulauskis,J.D. and Andreesen,J.D. TITLE Differential screening leads to novel genetic markers of monocyte to macrophage maturation JOURNAL Unpublished REFERENCE 2 (bases 1 to 1252) AUTHORS Krause,S. TITLE Direct Submission JOURNAL Submitted (19-JUL-1995) S. Krause, Klinik und Poliklinik fuer Innere Med I, Abt.Haematologie und Internistische Onkologie, Klinikum der Universitaet Regensburg, D- 93042 Regensburg, FRG FEATURES Location/Qualifiers source 1..1252 /organism="Homo sapiens" /sub_species="caucasian" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="macrophage" /clone_lib="lambda ZAPII" /clone="ds33" CDS <1..463 /note="putative ORF" /codon_start=2 /db_xref="PID:g929631" /translation="GTSNSKDIQNLSVGLPRADEGLPANESFLNGNLAGASLSPLHTK TYQASSQPGSTSKDLTNNNIPHLLSPKEAKSKQSLILILTQSLQKAQGQSTSSQTADL SRTATHSWKALKAKLGHCSPMKSRVGIAILTQFPSPLGVPPTGPRPKAMGH" CDS 1046..1234 /note="putative ORF" /codon_start=1 /db_xref="PID:g929632" /translation="MTHTLMMAQPPKKIDTYTMILCQGELVAFTECHLHVQTILSMKI LVIQSNSRKKRSKDFSGQ" BASE COUNT 391 a 348 c 272 g 241 t ORIGIN 1 cggcacgagt aacagcaagg acatccagaa cctgagtgta ggcctgcccc gggctgacga 61 aggtctccct gccaatgaaa gcttcctaaa tggaaacctt gctggagcta gtcttagtcc 121 actgcacacc aaaacctacc aagcaagcag ccagcctggg tctaccagca aagatctcac 181 caacaacaac ataccacacc ttcttagccc aaaagaagcc aagtcaaaac agagtttgat 241 tttaatattg acccaaagcc ttcagaaggc ccagggacaa agtacctcaa gtcaaacagc 301 agatctcagc agaaccgcca ctcattcatg gaaagctctc aaagcaaagc tgggacactg 361 cagcccaatg aaaagcagag tcggcatagc tatattgaca caattcccca gtcctctagg 421 agtccctcct acaggaccaa ggccaaaagc catggggcac tgagtgactc caagtctgtg 481 agcaaccttt ctgaagccag ggcccaaatt gcggagccca gtaccagtag gtacttccca 541 tctagctgct tagacttgaa ttctcccacc agcccaaccc ccaccagaca cagtgacacg 601 agaacttggc tcagcccttc tggaagaaat aaccgaaatg agggaacgct ggactcacgt 661 cgaaccacaa ccagacattc taagacgatg gaggaattga agctgccgga gcacatggac 721 agtagccatt cccattcact gtctgcacct cacgaatctt tttcttatgg actgggctac 781 accagcccct tttcttccca gcaacgtcct cataggcatt ctatgtatgt gacccgtgac 841 aaagtgagag ccaagggctt ggatggaagc ttgagcatag ggcaagggat ggcagctaga 901 gccaacagcc tgcaactctt gtcaccccag cctggagaac agctccctcc agagatgact 961 gtggcaagat cttcggtcaa agagacctcc agagaaggca cctcttcctt ccatacacgc 1021 cagaagtctg agggtggagt gtatcatgac ccacactctg atgatggcac agcccccaaa 1081 gaaaatagac acctatacaa tgatcctgtg ccaaggagag ttggtagctt ttacagagtg 1141 ccatctccac gtccagacaa ttctttccat gaaaatatta gtcattcaga gcaactcaag 1201 gaaaaagaga agcaaggatt tttcaggtca atgaaaaaaa aaaaaaaaaa aa // LOCUS HSDSARCOG 873 bp RNA PRI 22-AUG-1996 DEFINITION H.sapiens mRNA for delta-sarcoglycan. ACCESSION X95191 NID g1495426 KEYWORDS delta-sarcoglycan; sarcoglycan. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 873) AUTHORS Nigro,V., Piluso,G., Belsito,A., Politano,L., Puca,A.A., Papparella,S., Rossi,E., Viglietto,G., Esposito,M.G., Abbondanza,C., Medici,N., Molinari,A.M., Nigro,G. and Puca,G.A. TITLE Identification of a novel sarcoglycan gene at 5q33 encoding a sarcolemmal 35 kDa glycoprotein JOURNAL Hum. Mol. Genet. 5 (8), 1179-1186 (1996) MEDLINE 96440427 REFERENCE 2 (bases 1 to 873) AUTHORS Nigro,V. TITLE Direct Submission JOURNAL Submitted (17-JAN-1996) V. Nigro, Seconda Universita degli Studi di Napoli, Ist. di Patologia Generale e Oncologia, Larghetto S. Aniello a Caponapoli 2, Napoli, Napoli, 80138 I, ITALY FEATURES Location/Qualifiers source 1..873 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="skelatal muscle" CDS 1..873 /codon_start=1 /product="delta-sarcoglycan" /db_xref="PID:e218673" /db_xref="PID:g1495427" /translation="MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMI LILVNLAMTIWILKVMNFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSQPGNAL YFKSARNVTVNILNDQTKVLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVG AERLRVLGAEGTVFPKSIETPNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNME ATCRTELRLESKDGEIKLDAAKIRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQA GAGSTCQINTSVCL" BASE COUNT 264 a 194 c 214 g 201 t ORIGIN 1 atgatgcctc aggagcagta cactcaccac cggagcacca tgcctggctc tgtggggcca 61 caggtataca aggtggggat ttatggctgg cggaaacgat gcctgtattt ctttgtcctg 121 ctcctcatga ttttaatact ggtgaacttg gccatgacca tctggattct caaagtcatg 181 aacttcacaa ttgatggaat gggaaacctg aggatcacag aaaaaggtct aaagctagaa 241 ggagactctg aattcttaca acctctctac gccaaagaaa tccagtccca accaggtaat 301 gccctgtact tcaagtctgc cagaaatgtt acagtgaaca ttctcaatga ccagactaaa 361 gtgctaactc agcttataac aggtccaaaa gccgtagaag cttatggtaa aaaatttgag 421 gtaaaaactg tttctggaaa attgctcttc tctgcagaca ataatgaagt ggtagtagga 481 gctgaaagat tacgagtttt aggagcggag ggcacagtgt tccctaaatc tatagaaaca 541 cctaatgtca gggcagaccc cttcaaagaa ctaaggttgg agtccccaac ccggtctcta 601 gtgatggagg ccccaaaagg agtggaaatc aatgcagaag ctggcaatat ggaagccacc 661 tgcaggacag agctgagact ggaatccaaa gatggagaga ttaagttaga tgctgcgaaa 721 atcaggctac ctagactgcc tcatggatcc tacacgccta caggaacgag gcagaaggtc 781 ttcgagatct gcgtctgcgc caatgggaga ttattcctgt ctcaggcagg agctgggtcc 841 acttgtcaga taaacacaag tgtctgcctc tga // LOCUS HSDSC3MR 3299 bp RNA PRI 14-DEC-1995 DEFINITION H.sapiens mRNA for type 3 desmocollin. ACCESSION X83929 NID g1122882 KEYWORDS desmocollin; desmocollin type 3; DSC3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3299) AUTHORS King,I.A., Sullivan,K.H., Bennett,R. Jr. and Buxton,R.S. TITLE The desmocollins of human foreskin epidermis: identification and chromosomal assignment of a third gene and expression patterns of the three isoforms JOURNAL J. Invest. Dermatol. 105 (3), 314-321 (1995) MEDLINE 95395282 REFERENCE 2 (bases 1 to 3299) AUTHORS Buxton,R.S. TITLE Direct Submission JOURNAL Submitted (16-JAN-1995) R.S. Buxton, National Inst. of Medical Research, The Ridgeway, Mill Hill, London, NW7 1AA, UK FEATURES Location/Qualifiers source 1..3299 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /clone_lib="lambda gt11" /chromosome="18" /map="18q12.1" gene 22..2712 /gene="DSC3" CDS 22..2712 /gene="DSC3" /codon_start=1 /product="desmocollin type 3" /db_xref="PID:e135090" /db_xref="PID:g1122883" /translation="MAAAGPRRSVRGAVCLHLLLTLVIFSRDGEACKKVILNVPSKLE ADKIIGRVNLEECFRSADLIRSSDPDFRVLNDGSVYTARAVALSDKKRSFTIWLSDKR KQTQKEVTVLLEHQKKVSKTRHTRETVLRRAKRRWAPIPCSMQENSLGPFPLFLQQVE SDAAQNYTVFYSISGRGVDKEPLNLFYIERDTGNLFCTRPVDREEYDVFDLIAYASTA DGYSADLPLPLPIRVEDENDNHPVFTEAIYNFEVLESSRPGTTVGVVCATDRDEPDTM HTRLKYSILQQTPRSPGLFSVHPSTGVITTVSHYLDREVVDKYSLIMKVQDMDGQFFG LIGTSTCIITVTDSNDNAPTFRQNAYEAFVEENAFNVEILRIPIEDKDLINTANWRVN FTILKGNENGHFKISTDKETNEGVLSVVKPLNYEENRQVNLEIGVNNEAPFARDIPRV TALNRALVTVHVRDLDEGPECTPAAQYVRIKENLAVGSKINGYKAYDPENRNGNGLRY KKLHDPKGWITIDEISGSIITSKILDREVETPKNELYNITVLAIDKDDRSCTGTLAVN IEDVNDNPPEILQEYVVICKPKMGYTDILAVDPDEPVHGAPFYFSLPNTSPEISRLWS LTKVNDTAARLSYQKNAGFQEYTIPITVKDRAGQAATKLLRVNLCECTHPTQCRATSR STGVILGKWAILAILLGIALLFSVLLTLVCGVFGATKGKRFPEDLAQQNLIISNTEAP GDDRVCSANGFMTQTTNNSSQGFCGTMGSGMKNGGQETIEMMKGGNQTLESCRGAGHH HTLDSCRGGHTEVDNCRYTYSEWHSFTQPRLGEKLHRCNQNEDRMPSQDYVLTYNYEG RGSPAGSVGCCSEKQEEDGLDFLNNLEPKFITLAEACTKR" mat_peptide 427..2709 /gene="DSC3" /product="desmocollin type 3" misc_feature 427..2091 /gene="DSC3" /note="extracellular domain" misc_feature 517..525 /gene="DSC3" /note="pot. N-linked glycosylation site" misc_feature 1195..1203 /gene="DSC3" /note="pot. N-linked glycosylation site" misc_feature 1657..1665 /gene="DSC3" /note="pot. N-linked glycosylation site" misc_feature 1906..1914 /gene="DSC3" /note="pot. N-linked glycosylation site" misc_feature 2092..2163 /gene="DSC3" /note="transmembrane domain" misc_feature 2164..2709 /gene="DSC3" /note="cytoplasmic domain Dsc3a" misc_feature 2590..2598 /gene="DSC3" /note="pot. O-phosphorylation site" BASE COUNT 1067 a 619 c 714 g 899 t ORIGIN 1 gaattccggc ccggcatccc gatggccgcc gctgggcccc ggcgctccgt gcgcggagcc 61 gtctgcctgc atctgctgct gaccctcgtg atcttcagtc gtgatggtga agcctgcaaa 121 aaggtgatac ttaatgtacc ttctaaacta gaggcagaca aaataattgg cagagttaat 181 ttggaagagt gcttcaggtc tgcagacctc atccggtcaa gtgatcctga tttcagagtt 241 ctaaatgatg ggtcagtgta cacagccagg gctgttgcgc tgtctgataa gaaaagatca 301 tttaccatat ggctttctga caaaaggaaa cagacacaga aagaggttac tgtgctgcta 361 gaacatcaga agaaggtatc gaagacaaga cacactagag aaactgttct caggcgtgcc 421 aagaggagat gggcacctat tccttgctct atgcaagaga attccttggg ccctttccca 481 ttgtttcttc aacaagttga atctgatgca gcacagaact atactgtctt ctactcaata 541 agtggacgtg gagttgataa agaaccttta aatttgtttt atatagaaag agacactgga 601 aatctatttt gcactcggcc tgtggatcgt gaagaatatg atgtttttga tttgattgct 661 tatgcgtcaa ctgcagatgg atattcagca gatctgcccc tcccactacc catcagggta 721 gaggatgaaa atgacaacca ccctgttttc acagaagcaa tttataattt tgaagttttg 781 gaaagtagta gacctggtac tacagtgggg gtggtttgtg ccacagacag agatgaaccg 841 gacacaatgc atacgcgcct gaaatacagc attttgcagc agacaccaag gtcacctggg 901 ctcttttctg tgcatcccag cacaggcgta atcaccacag tctctcatta tttggacaga 961 gaggttgtag acaagtactc attgataatg aaagtacaag acatggatgg ccagtttttt 1021 ggattgatag gcacatcaac ttgtatcata acagtaacag attcaaatga taatgcaccc 1081 actttcagac aaaatgctta tgaagcattt gtagaggaaa atgcattcaa tgtggaaatc 1141 ttacgaatac ctatagaaga taaggattta attaacactg ccaattggag agtcaatttt 1201 accattttaa agggaaatga aaatggacat ttcaaaatca gcacagacaa agaaactaat 1261 gaaggtgttc tttctgttgt aaagccactg aattatgaag aaaaccgtca agtgaacctg 1321 gaaattggag taaacaatga agcgccattt gctagagata ttcccagagt gacagccttg 1381 aacagagcct tggttacagt tcatgtgagg gatctggatg aggggcctga atgcactcct 1441 gcagcccaat atgtgcggat taaagaaaac ttagcagtgg ggtcaaagat caacggctat 1501 aaggcatatg accccgaaaa tagaaatggc aatggtttaa ggtacaaaaa attgcatgat 1561 cctaaaggtt ggatcaccat tgatgaaatt tcagggtcaa tcataacttc caaaatcctg 1621 gatagggagg ttgaaactcc caaaaatgag ttgtataata ttacagtcct ggcaatagac 1681 aaagatgata gatcatgtac tggaacactt gctgtgaaca ttgaagatgt aaatgataat 1741 ccaccagaaa tacttcaaga atatgtagtc atttgcaaac caaaaatggg gtataccgac 1801 attttagctg ttgatcctga tgaacctgtc catggagctc cattttattt cagtttgccc 1861 aatacttctc cagaaatcag tagactgtgg agcctcacca aagttaatga tacagctgcc 1921 cgtctttcat atcagaaaaa tgctggattt caagaatata ccattcctat tactgtaaaa 1981 gacagggccg gccaagctgc aacaaaatta ttgagagtta atctgtgtga atgtactcat 2041 ccaactcagt gtcgtgcgac ttcaaggagt acaggagtaa tacttggaaa atgggcaatc 2101 cttgcaatat tactgggtat agcactgctc ttttctgtat tgctaacttt agtatgtgga 2161 gtttttggtg caactaaagg gaaacgtttt cctgaagatt tagcacagca aaacttaatt 2221 atatcaaaca cagaagcacc tggagacgat agagtgtgct ctgccaatgg atttatgacc 2281 caaactacca acaactctag ccaaggtttt tgtggtacta tgggatcagg aatgaaaaat 2341 ggagggcagg aaaccattga aatgatgaaa ggaggaaacc agaccttgga atcctgccgg 2401 ggggctgggc atcatcatac cctggactcc tgcaggggag gacacacgga ggtggacaac 2461 tgcagataca cttactcgga gtggcacagt tttactcagc cccgtctcgg tgaaaaattg 2521 catcgatgta atcagaatga agaccgcatg ccatcccaag attatgtcct cacttataac 2581 tatgagggaa gaggatctcc agctggttct gtgggctgct gcagtgaaaa gcaggaagaa 2641 gatggccttg actttttaaa taatttggaa cccaaattta ttacattagc agaagcatgc 2701 acaaagagat aatgtcacag tgctacaatt aggtctttgt cagacattct ggaggtttcc 2761 aaaaataata ttgtaaagtt caatttcaac atgtatgtat atgatgattt ttttctcaat 2821 tttgaattat gctactcacc aattatattt ttaaagcaag ttgttgctta tcttttccaa 2881 aaagtgaaaa atgttaaaac agacaactgg taaatctcaa actccagcac tggaattaag 2941 gtctctaaag catctgctct tttttttttt acggatattt tagtaataaa tatgctggat 3001 aaatattagt ccaacaatag ctaagttatg ctaatatcac attattatgt attcacttta 3061 agtgatagtt taaaaaataa acaagaaata ttgagtatca ctatgtgaag aaagttttgg 3121 aaaagaaaca atgaagactg aattaaatta aaaatgttgc agctcataaa gaattgggac 3181 tcacccctac tgcactacca aattcatttg actttggagg caaaatgtgt tgaagtgccc 3241 tatgaagtag caattttcta taggaatata gttggaagta aacggaatcc gcggaattc // LOCUS HSDYNEIN 3163 bp RNA PRI 08-DEC-1996 DEFINITION H.sapiens mRNA dynein-related protein. ACCESSION X99947 NID g1729763 KEYWORDS dynein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3163) AUTHORS Milisav,I., Jones,M.H. and Affara,N.A. TITLE Characterization of a novel human dynein-related gene that is specifically expressed in the testis JOURNAL Unpublished REFERENCE 2 (bases 1 to 3163) AUTHORS Milisav,I. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) I. Milisav, University of Cambridge, Department of Pathology, Tennis Court Road, Cambridge, CB2 1QP, UNITED KINGDOM FEATURES Location/Qualifiers source 1..3163 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="testis" /chromosome="17" CDS 530..2926 /codon_start=1 /product="dynein-related protein" /db_xref="PID:e258465" /db_xref="PID:g1729764" /translation="MNDLSKIHPMYQFSLKAFSIVFQKAVERAAPDESLRERVANLID SITFSVYQYTIRGLFECDKLTYLAQLTFQILLMNREVNAVELDFLLRSPVQTGTASPV EFLSHQAWGAVKVLSSMEEFSNLDRDIEGSAKSWKKFVESECPEKEKLPQEWKNKTAL QRLCMLRAMRPDRMTYALRDFVEEKLGSKYVVGRALDFATSFEESGPATPMFFILSPG VDPLKDVESQGRKLGYTFNNQNFHNVSLGQGQEVVAEAALDLAAKKGHWVILQNIHLV AKWLSTLEKKLEEHSENSHPEFRVFMSAEPAPSPEGHIIPQGILENSIKITNEPPTGM HANLHKALDNFTQDTLEMCSRETEFKSILFALCYFHAVVAERRKFGPQGWNRSYPFNT GDLTISVNVLYNFLEANAKVPYDDLRYLFGEIMYGGHITDDWDRRLCRTYLGEFIRPE MLEGELSLAPGFPLPGNMDYNGYHQYIDAELPPESPYLYGLHPNAEIGFLTQTSEKLF RTVLELQPRDSQARDGAGATREEKVKALLEEILERVTDEFNIPELMAKVEERTPYIVV AFQECGRMNILTREIQRSLRELELGLKGELTMTSHMENLQNALYFDMVPESWARRAYP STAGLAARFPDLLNRIKELEAWTGDFTIPSTVWLTGFFNPQSFLTAIMQSTARKNEWP LDQMALQCDMTKKNREEFRSPPREGAYIHGLFMEGACWDTQAGIITEAKLKDLTPPMP VMFIKAIPADKQDCRSVYSCPVYKTSQRGPTYVWTFNLKTKENPSKWVLAGVALLLQI " polyA_site 2949 polyA_site 3126 BASE COUNT 850 a 809 c 831 g 673 t ORIGIN 1 tcgtgttaag aaacaagaaa ggtcagggtg gctgtggttg gaacctccaa caggaaggtt 61 attcttccca cggatccaaa ggctgtacaa gggcaaagac actacagctg cctgaagtta 121 gtgctttagg atatggagca taaagagagg aaaagatgag atccacttgg ataccagcac 181 cccactccag cctcgaggaa gcttttacac agtctgccat ggttttacac agtttaaaac 241 tggaagacag attctgctga ttttctttta atccgatctc acaaagcagc agaatggatt 301 caaaattacc ctgaaaacgt tggaagacag tcttctctct cgcctctcct ccgcctctgg 361 gaacttcctg ggagaaacag tgctggtgga aaacctagag atcaccaagc agactgctgc 421 cgaagttgag aaaaaggtcc agggggccaa ggtgactgaa gtgaaaatca acgaggcccg 481 agagcactac cggccagcag ctgccagggc ctcactgctc tacttcatca tgaacgacct 541 cagcaagatc catccaatgt accagttttc tctcaaggcc ttcagtatcg tcttccagaa 601 ggctgtggag agggctgctc ctgacgaaag cctcagggag cgggtggcca acctaataga 661 cagcataacc ttctctgtgt accagtacac catccgcggg ctctttgagt gtgataagct 721 gacctacctt gcccagctca cctttcagat tctcctcatg aaccgagaag tcaatgcagt 781 ggagttggat ttcctgcttc gatctccagt gcagacgggc accgccagcc ccgtggagtt 841 cctctcccat caggcgtggg gagctgtcaa ggtactttca tcaatggaag aattctctaa 901 tctggatcgg gacatagagg gatctgctaa gagctggaaa aagtttgtgg agtccgaatg 961 tcctgagaaa gagaagctcc cacaggagtg gaagaacaag acagccctgc agcgcctctg 1021 catgctgaga gccatgcggc ccgaccggat gacctatgct ttgcgagatt ttgttgaaga 1081 gaagttagga agcaaatacg tggtgggaag agccctagat tttgcaacct catttgaaga 1141 atcgggacca gccactccta tgtttttcat cctgtctcca ggggtggacc cactgaagga 1201 tgtagaaagt caaggaagaa aacttggata caccttcaac aatcagaact ttcacaacgt 1261 gtctttgggg caaggacagg aagtggtggc tgaggctgcg ctggacctcg ctgccaagaa 1321 aggtcactgg gttattttgc agaacattca cctggtggcc aagtggctca gcaccctgga 1381 gaagaagctg gaggagcaca gtgagaacag ccacccagag ttcagggtct tcatgagtgc 1441 agagccagca ccctcccctg agggccacat catcccccag ggcatcctgg agaactccat 1501 taagatcacc aatgagcccc ccacgggcat gcatgccaac ctgcacaagg ccctggacaa 1561 cttcactcag gacactctgg agatgtgttc tcgggagacg gagtttaaga gcatcctctt 1621 tgctctttgt tacttccatg cggtggtggc agaaagacga aaatttgggc cccagggatg 1681 gaatcgctca taccccttta acactggaga cctcactatc tctgtgaatg tcctctacaa 1741 cttcctggag gccaacgcaa aggtccccta tgatgatttg cgctacctgt ttggagagat 1801 catgtatgga ggccatatca cagatgactg ggacagaaga ctctgcagaa cctacctggg 1861 ggaattcatt cgaccagaaa tgttagaagg agaactgtct ttggccccag ggttcccact 1921 cccaggcaac atggactaca atggttatca tcagtacatc gatgctgagc tgcccccaga 1981 atccccctac ctctatggcc tccacccgaa cgcagagatt ggcttcctga cccaaacctc 2041 agaaaagctc ttccgcactg tgctggagct gcagcctcgg gacagccagg ccagagacgg 2101 agcgggcgcc acaagagaag aaaaggtcaa ggcacttctg gaagaaatat tggagcgggt 2161 gacagacgag tttaacatcc cagaactgat ggccaaagtg gaggagcgca ccccttacat 2221 tgtagttgcc ttccaggagt gtggccggat gaatatcctc accagagaga ttcagcgctc 2281 actgagggag ctggagctcg gcttaaaggg ggagctgact atgaccagcc acatggagaa 2341 cttacagaat gccctgtact tcgatatggt gccagagtcc tgggctagac gagcctaccc 2401 ttccacagca ggcctggcag cccggtttcc agacctcctc aacagaatca aggagctaga 2461 ggcttggacg ggtgacttta caatcccctc cactgtgtgg ctgacaggct tcttcaaccc 2521 ccagtcgttc ctgactgcca tcatgcagtc cacggctcgc aagaatgagt ggccactgga 2581 ccagatggcc ctgcaatgtg acatgacgaa gaagaacaga gaagagttta ggagtcctcc 2641 tcgggaaggg gcctacatcc atggcctctt catggaaggt gcctgctggg acacacaggc 2701 tgggatcatt acagaggcaa agctgaagga tctgacaccc cctatgcctg tgatgttcat 2761 caaggccatt cctgcagata agcaggactg ccgcagtgtc tattcctgtc ctgtgtacaa 2821 gactagtcag cggggaccca cctacgtgtg gactttcaac ctgaagacta aggaaaaccc 2881 atccaagtgg gttctggctg gagtagcctt gcttctccag atttagcatc ctgcagagcc 2941 accgagaaaa taaaaaagct gggcttggag gctgcctaga gggacaggtg ggtgaagggt 3001 caccacagac acttagaacg gtaagaaacc atgagcactc acaattctgt agaattcctc 3061 tagggaactt ggagaggtgt gcctaaggtg aggctgagct gaaggaatgt gggcccaggt 3121 ttcttaataa aatgatttac tcttcaaaaa aaaaaaaaaa aaa // LOCUS HSDYRK3 2141 bp RNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for protein kinase, Dyrk3. ACCESSION Y12735 NID g2765228 KEYWORDS dual specificity protein kinase; Dyrk3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2141) AUTHORS Becker,W., Wetzel,K. and Joost,H.G. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2141) AUTHORS Becker,W. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) W. Becker, Inst.f Pharmakologie u. Toxikologie, Wendlingweg 2, D- 52057 Aachen, FRG FEATURES Location/Qualifiers source 1..2141 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="brain" /dev_stage="foetus" /map="q32" gene 253..1914 /gene="Dyrk3" CDS 253..1914 /gene="Dyrk3" /note="dual specificity protein kinase" /codon_start=1 /product="Dyrk3 protein" /db_xref="PID:e1227574" /db_xref="PID:g2765229" /translation="MMIDETKCPPCSNVLCNPSEPPPPRRLNMTAEQFTGDHTQHFLD GGEMKVEQLFQEFGNRKSNTIQSDGISDSEKCSPTVSQGKSSDCLNTVKSNSSSKAPK VVPLTPEQALKQYKHHLTAYEKLEIINYPEIYFVGPNAKKRHGVIGGPNNGGYDDADG AYIHVPRDHLAYRYEVLKIIGKGSFGQVARVYDHKLRQYVALKMVRNEKRFHRQAAEE IRILEHLKKQDKTGSMNVIHMLESFTFRNHVCMAFELLSIDLYELIKKNKFQGFSVQL VRKFAQSILQSLDALHKNKIIHCDLKPENILLKHHGRSSTKVIDFGSSCFEYQKLYTY IQSRFYRAPEIILGSRYSTPIDIWSFRCILAELLTGQPLFPGEDEGDQLACMMELLGM PPPKLLEQSKRAKYFINSKGIPRYCSVTTQADGRVVLVGGRSRRGKKRGPPGSKDWGT ALKGCDDYLFIEFLKRCLHWDPSARLTPAQALRHPWISKSVPRPLTTIDKVSGKRVVN PASAFQGLGSKLPPVVGIANKLKANLMSETNGSIPLCSVLPKLIS" polyA_site 2141 BASE COUNT 597 a 485 c 530 g 529 t ORIGIN 1 cgggagcgaa agtgcgctga gctgcagtgt ctggtcgaga gtacccgtgg gagcgtcgcg 61 ccgcggaggc agccgtcccg gcgtaggtgg cgtggccgac cggaccccca actggcgcct 121 ctccccgagc ggggtcccga gctaggagat gggaggcaca gctcgtgggc ctgggcggaa 181 ggatgcgggg ccgcctgggg ccgggctccc gccccagcag cggagttggg ggatggtgtc 241 tatgacacct tcatgatgat agatgaaacc aaatgtcccc cctgttcaaa tgtactctgc 301 aatccttctg aaccacctcc acccagaaga ctaaatatga ccgctgagca gtttacagga 361 gatcatactc agcacttttt ggatggaggt gagatgaagg tagaacagct gtttcaagaa 421 tttggcaaca gaaaatccaa tactattcag tcagatggca tcagtgactc tgaaaaatgc 481 tctcctactg tttctcaggg taaaagttca gattgcttga atacagtaaa atccaacagt 541 tcatccaagg cacccaaagt ggtgcctctg actccagaac aagccctgaa gcaatataaa 601 caccacctca ctgcctatga gaaactggaa ataattaatt atccagaaat ttactttgta 661 ggtccaaatg ccaagaaaag acatggagtt attggtggtc ccaataatgg agggtatgat 721 gatgcagatg gggcctatat tcatgtacct cgagaccatc tagcttatcg atatgaggtg 781 ctgaaaatta ttggcaaggg gagttttggg caggtggcca gggtctatga tcacaaactt 841 cgacagtacg tggccctaaa aatggtgcgc aatgagaagc gctttcatcg tcaagcagct 901 gaggagatcc ggattttgga gcatcttaag aaacaggata aaactggtag tatgaacgtt 961 atccacatgc tggaaagttt cacattccgg aaccatgttt gcatggcctt tgaattgctg 1021 agcatagacc tttatgagct gattaaaaaa aataagtttc agggttttag cgtccagttg 1081 gtacgcaagt ttgcccagtc catcttgcaa tctttggatg ccctccacaa aaataagatt 1141 attcactgcg atctgaagcc agaaaacatt ctcctgaaac accacgggcg cagttcaacc 1201 aaggtcattg actttgggtc cagctgtttc gagtaccaga agctctacac atatatccag 1261 tctcggttct acagagctcc agaaatcatc ttaggaagcc gctacagcac accaattgac 1321 atatggagtt ttcgctgcat ccttgcagaa cttttaacag gacagcctct cttccctgga 1381 gaggatgaag gagaccagtt ggcctgcatg atggagcttc tagggatgcc accaccaaaa 1441 cttctggagc aatccaaacg tgccaagtac tttattaatt ccaagggcat accccgctac 1501 tgctctgtga ctacccaggc agatgggagg gttgtgcttg tggggggtcg ctcacgtagg 1561 ggtaaaaagc ggggtccccc aggcagcaaa gactggggga cagcactgaa agggtgtgat 1621 gactacttgt ttatagagtt cttgaaaagg tgtcttcact gggacccctc tgcccgcttg 1681 accccagctc aagcattaag acacccttgg attagcaagt ctgtccccag acctctcacc 1741 accatagaca aggtgtcagg gaaacgggta gttaatcctg caagtgcttt ccagggattg 1801 ggttctaagc tgcctccagt tgttggaata gccaataagc ttaaagctaa cttaatgtca 1861 gaaaccaatg gtagtatacc cctatgcagt gtattgccaa aactgattag ctagtggaca 1921 gagatatgcc cagagatgca tatgtgtata tttttatgat cttacaaacc tgcaaatgga 1981 aaaaatgcaa gcccattggt ggatgttttt gttagagtag acttttttta aacaagacaa 2041 aacattttta tatgattata aaagaattct tcaagggcta attacctaac cagcttgtat 2101 tggccatctg gaatatgcat taaatgactt tttataggtc a // LOCUS HSDYSTRAO 2247 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for dystrophin-associated protein A0. ACCESSION Y12712 NID g2765226 KEYWORDS dystrophin-associated protein A0. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2247) AUTHORS Puca,A.A. and Nigro,V. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2247) AUTHORS Puca,A.A. TITLE Direct Submission JOURNAL Submitted (22-APR-1997) A.A. Puca, Seconda Universita di Napoli, Istituto di Patologia Generale, Larghetto S.Aniello a Caponapoli 2, Napoli 80138, ITALY FEATURES Location/Qualifiers source 1..2247 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="p23" /dev_stage="adult" /tissue_type="brain" CDS 184..1887 /codon_start=1 /product="dystrophin-associated protein AO" /db_xref="PID:e315132" /db_xref="PID:g2765227" /translation="MIEESGNKRKTMAEKRQLFIEMRAQNFDVIRLSTYRTACKLRFV QKRCNLHLVDIWNMIEAFRDNGLNTLDHTTEISVSRLETVISSIYYQLNKRLPSTHQI SVEQSISLLLNFMIAAYDSEGRGKLTVFSVKAMLATMCGGKMLDKLRYVFSQMSDSNG LMIFSKFDQFLKEVLKLPTAVFEGPSFGYTEHSVRTCFPQQRKIMLNMFLDTMMADPP PQCLVWLPLMHRLAHVENVFHPVECSYCRCESMMGFRYRCQQCHNYQLCQNCFWRGHA GGPHSNQHQMKEHSSWKSPAKKLSHAISKSLGCVPTREPPHPVFPEQPEKPLDLAHIV PPRPLTNMNDTMVSHMSSGVPTPTKSVLDSPSRLDEEHRLIARYAARLAAEAGNVTRP PTDLSFNFDANKQQRQLIAELENKNREILQEIQRLRLEHEQASQPTPEKAQQNPTLLA ELRLLRQRKDELEQRMSALQESRRELMVQLEELMKLLKEEEQKQAAQATGSPHTSPTH GGGRPMPMPVRSTSAGSTPTHCPQDSLSGVGGDVQEAFAQAEEGAEEEEEKMQNGKDR G" BASE COUNT 591 a 606 c 612 g 438 t ORIGIN 1 accaagcttg gcacgagggc ggcgcgagcc gggcgctgcg aacgttcgcc gcgggggtgg 61 ctccggggcc tgagtaggcg ctgccgctgc ctcagccgag ggggctgggc cggagcgtgc 121 ggaggagtga ggccgcagga gaccttcccg acgacccctg ctccggcggg gaagtgagca 181 aggatgattg aggaaagtgg gaacaagcgg aagaccatgg cagagaagag gcagctgttc 241 atagaaatgc gtgctcagaa ttttgatgtc atacgactat caacttacag aacagcctgc 301 aaattacgat ttgtacaaaa acgatgcaac cttcatcttg ttgatatctg gaacatgatt 361 gaagccttcc gagacaatgg ccttaataca ctggaccata ccaccgagat cagtgtgtcc 421 cgcctcgaaa ctgtcatctc ctccatctac tatcagttga acaagcgcct tccttctact 481 caccaaatta gtgtggaaca atctatcagc ctcctcctca actttatgat tgctgcatat 541 gacagtgagg gccgaggcaa gttgacggta ttttcagtta aagctatgtt agcaaccatg 601 tgtggtggaa aaatgctgga caaattgaga tatgttttct cccagatgtc agattccaat 661 ggcttaatga tatttagcaa gtttgaccag tttctgaagg aagttctgaa gctcccaaca 721 gctgtctttg aagggccatc ttttggttac acagagcact cagtccgcac ctgttttcca 781 cagcagagaa agataatgct aaatatgttt ttagacacaa tgatggctga ccctcctccc 841 cagtgccttg tctggctacc tctcatgcac aggcttgccc atgttgagaa tgtcttccat 901 cccgtggagt gctcctactg ccgatgtgag agtatgatgg gtttccggta ccgatgccag 961 cagtgccaca actatcagct ctgccagaat tgcttttggc gtggccatgc cggcggccct 1021 cacagcaacc agcaccagat gaaggagcat tcctcttgga aatctcctgc aaagaagctg 1081 agccatgcaa ttagtaaatc tttggggtgt gtacccacga gagaaccccc gcatcctgtt 1141 tttcctgagc aaccagagaa accacttgac cttgcacata tagttcctcc tcgccctctg 1201 actaatatga atgacaccat ggttagccac atgtcctctg gagtgcccac tcccaccaag 1261 agtgttctgg acagtcctag ccgactggat gaggaacacc gtcttatagc tcgctatgct 1321 gcccggctgg ctgcagaagc aggaaacgtg actcgtcctc ccactgactt gagctttaac 1381 tttgatgcca acaaacaaca aagacagctt attgcagaac tggaaaacaa aaacagagag 1441 atcctgcagg agattcagcg tctccgcctg gaacacgagc aggcctccca gcccacccct 1501 gagaaggcac agcagaaccc cacgctgctg gcagagctgc ggctgctgag gcaaaggaag 1561 gatgaactgg agcagaggat gtcggccctg caggagagca ggcgggagct gatggtccag 1621 ctggaagagc tgatgaagtt gctgaaggag gaagagcaaa agcaggcagc tcaggccaca 1681 gggtcaccac atacatcgcc cacccatgga ggcggccggc caatgcccat gccagtgcgc 1741 tccacgtctg ccggctccac ccccacccac tgtccgcagg actcgctgag cggagtcggg 1801 ggagacgtgc aggaggcctt cgcacaagca gaggaaggtg cagaggaaga agaagagaag 1861 atgcagaatg ggaaagacag aggttagcag aggagccgga cacagaggaa gctcaggcac 1921 agaggacgag gagcaagctg gcgccgacat ggcgaaggca aggtcttccc ccagaggcac 1981 attcctctcc atctttccac cgcacacctg gaccaggctt gcaggctgcc agacgtcact 2041 ccacccgcca gggagagggg agccagagcc ggtgggaagc ggggaggggc tgcgtggcac 2101 agctagtggg cctccccctg cacagccctg catgtactag caccttcatc actcccctca 2161 gggcatggtc tcatctccgc atcaggaatt cacctggagg ttgaaaagag aaaagaaaaa 2221 gcaccaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSE14 5895 bp RNA PRI 01-JUL-1996 DEFINITION H.sapiens mRNA for E14 protein. ACCESSION X97186 NID g1418773 KEYWORDS E14 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5895) AUTHORS Cooper,P.R., Byrd,P.J. and Taylor,A.M.R. JOURNAL Unpublished REFERENCE 2 (bases 1 to 5895) AUTHORS Byrd John,P. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) P. Byrd John, University of Birmingham, CRC Institute for Cancer Studies, Edgbaston, Birmingham, B15 2TJ, UK REFERENCE 3 (bases 1 to 5895) AUTHORS Byrd,P.J., McConville,C.M., Cooper,P.R., Parkhill,J., McGuire,G., Stankovic,T., Thick,J. and Taylor,A.M.R. TITLE Sequencing of the 5' half of the gene for Ataxia telangiectasia identifies additional mutations and a bidirectional promoter JOURNAL Unpublished FEATURES Location/Qualifiers source 1..5895 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q22-23" gene 35..4318 /gene="E14" CDS 35..4318 /gene="E14" /codon_start=1 /db_xref="PID:e238806" /db_xref="PID:g1418774" /translation="MLLPSDVARLVLGYLQQENLISTCQTFILESSDLKEYAEHCTDE GFIPACLLSLFGKNLTTILNEYVAMKTKETSNNVPAIMSSLWKKLDHTLSQIRSMQSS PRFAGSQRARTRTGIAEIKRQRKLASQTAPASAELLTLPYLSGQFTTPPSTGTQVTRP SGQISDPSRSYFVVVNHSQSQDTVTTGEALNVIPGAQEKKAHASLMSPGRRKSESQRK STTLSGPHSTIRNFQDPNAFAVEKQMVIENAREKILSNKSLQEKLAENINKFLTSDNN IAQVPKQTDNNPTEPETSLDEFLGLPSEIHMSEEAIQDILEQTESDPAFQALFDLFDY GKTKNNKNISQSISSQPMESNPSIVLADETNLAVKGSFETEESDGQSGQPAFCTSYQN DDPLNAMKNSNNHDVLRQEDQENFSQISTSIQKKAFKTAVPTEQKCDIDITFESVPNL NDFNQRGNSNAECNPHCAELNTNQMSTETEMAIGIEKNSLSSNVPSESQLQPDQPDIP ITSFVSLGCEANNENLILSGKSSQLLSQDTSLTGKPSKKSQFCENSNDTVKLKINFHG SKSSDSSEIHKSKIEINVLEPVMSQLSNCQDNSCLQSEILPVSVESSHLNVSGQIEIH LGDSLSSTKQPSNDSASVELNHTENEAQASKSENSQEPSSSVKEENTIFLSLGGNANC EKVALTPPEGTPVENSHSLPPESVCSSVGDSHPESQNTDDKPSSNNSAEIDASNIVSL KVIISDDPFVSSDTELTSAVSSINGENLPTIILSSPTKSPTKNAELVKCLSSEETVGA VVYAEVGDSASMEQSLLTFKSEDSAVNNTQNEDGIAFSANVTPCVSKDGGYIQLMPAT STAFGNSNNILIATCVTDPTALGTSVSQSNVVVLPGNSAPMTAQPLPPQLQTPPRSNS VFAVNQAVSPNFSQGSAIIIASPVQPVLQGMVGMIPVSVVGQNGNNFSTPPREVLHMP VTAPVCNRSIPQFPAPPKSQKAQGLRNKPCIGKQVNNLVDSSGHSVGCHAQKTEVSDK SIATDLGKKSEETTVPFPEESIVPAAKPCHRRVLCFDSTTAPVANTQGPNHKMVSQNK ERNAVSFPNLDSPNVSSTLKPPSNNAIKREKEKPPLPKILSKSESAISRHTTIRETQS EKKVSPTEIVLESFHKATANKENELCSDVERQKNPENSKLSIGQRNGGLRSEKSIASL QEMTKKQGTSSNNKNVLSVGTAVKDLKQEQTKSASSLITTEMLQDIQRHSSVSRLADS SDLPVPRTPGSGAGEKHKEEPIDIIKAPSSRRFSEDSSTSKVMVPPVTPDLPACSPAS ETGSENSVNMAAHTLMILSRAAISRTTSATPLKDNTQQFRASSRSTTKKRKIEELDER ERNSRPSSKNLTNSSIPMKKKKIKKKKLPSSFPAGMDVDKFLLSLHYDE" conflict replace(74,"a") /gene="E14" /citation=[3] BASE COUNT 1986 a 1153 c 1046 g 1710 t ORIGIN 1 gttccttatt gtggttcctg ctgtggtttt gatcatgttg ttaccctcgg acgtagcccg 61 gcttgtattg ggttacttac agcaagaaaa cctcatttct acctgccaga cttttatttt 121 ggaaagttca gatttaaaag aatatgcaga acattgtaca gatgaagggt ttattccagc 181 ctgcttactg tccttatttg gaaaaaactt gacaacaatt ttaaatgagt atgtagctat 241 gaaaacaaaa gaaacatcaa ataatgtccc agcaataatg tcatctctat ggaagaaatt 301 ggaccataca ctttctcaga tcaggagcat gcaaagttcc ccaaggtttg ctggcagtca 361 gagagcccga acgagaactg gaattgcaga aatcaaacgg cagagaaagc ttgcatctca 421 aacagctcca gccagtgcag agttgctcac tttaccttac ctttcaggac agtttaccac 481 tcctccttcc acaggtacac aggttactcg accaagtggc caaatttcag atccatcgag 541 gtcatatttt gtagtggtca accactcaca gtcacaagat actgtaacca ctggagaagc 601 tttaaatgtc attcctggtg ctcaggaaaa gaaagcacat gccagtttaa tgtctcccgg 661 tagacgcaaa agtgaatctc agagaaaaag taccactttg tctggccctc attcaacaat 721 acggaatttc caagatccaa acgcttttgc agtagaaaaa caaatggtta ttgaaaatgc 781 acgagaaaaa atactaagca acaaatctct tcaagaaaag ctagcagaaa acataaataa 841 atttttaact agtgataaca atattgccca agtacctaag caaacagata acaaccctac 901 ggagccagag acttcacttg atgaattcct aggacttccg agtgaaattc acatgtctga 961 agaagctata caggacatat tggaacagac agaatcagac ccagcatttc aggcactctt 1021 tgatctcttt gactatggca aaacaaagaa taataaaaat atatcacaaa gtatttccag 1081 tcaacctatg gaatccaatc ccagtatagt cttagcagat gaaactaatc tagcagttaa 1141 aggttctttt gaaacagaag aatctgatgg tcagtctggt cagcccgctt tttgtacatc 1201 ctatcagaat gatgacccat taaatgctat gaagaatagc aacaaccatg atgtgcttag 1261 acaagaagac caggaaaatt tttcccaaat aagtaccagc atacagaaaa aggcctttaa 1321 aacagctgta cccactgaac agaagtgtga cattgacatt acctttgagt ccgtgcctaa 1381 tttgaatgac tttaaccaaa gagggaattc taatgctgaa tgtaatccac attgtgctga 1441 attaaacacc aatcagatgt ccactgaaac tgaaatggct atagggattg aaaagaactc 1501 tttgtcttca aatgtaccga gtgaatctca gttacagcct gatcagcctg atataccaat 1561 aacttcattt gtttcacttg gttgtgaagc taacaatgaa aacttaattc tctctgggaa 1621 gagttctcaa cttttatccc aagatacttc attaactgga aagccatcta aaaaaagtca 1681 attttgtgaa aattctaatg atacagtaaa acttaaaatt aattttcatg gttccaagtc 1741 atcagattct agtgaaattc acaagagtaa aatagaaatt aatgtgttag aaccagttat 1801 gtcacagcta tcaaattgcc aagataattc ttgtcttcaa agtgaaatac tacctgtgtc 1861 tgttgaaagt tcacatttaa atgtatctgg acaaatagaa attcatcttg gagattcgct 1921 gtcttctact aaacaaccat ctaatgattc agcatctgtt gagttaaatc atacagaaaa 1981 tgaagctcag gcatccaagt ctgagaattc acaggagcct tcatcttctg taaaagaaga 2041 gaatactatt tttctctctt taggaggaaa tgctaactgt gagaaagttg cactgacgcc 2101 tccagaaggc actcctgtag aaaacagtca ctctcttcct ccagaatctg tgtgttcttc 2161 agtgggagat tctcaccctg agtcccaaaa tactgatgat aaaccttcta gcaacaactc 2221 agcagagata gatgcatcaa atatcgtctc tctcaaagtt atcattagtg atgatccatt 2281 tgtttcctca gatactgaac ttaccagtgc tgtttctagt attaatggag aaaacctgcc 2341 aactataatc ttgtcttctc ctactaaatc acctactaaa aatgcagaac tagttaaatg 2401 cctatcttca gaagaaactg taggtgctgt tgtatatgcc gaagtagggg attcagcctc 2461 aatggaacag agtcttttaa cattcaaatc tgaagactct gcagtaaaca atactcagaa 2521 tgaagatggc attgcttttt cagctaatgt tacaccatgt gtttccaagg atggaggata 2581 tatacagttg atgccagcca caagcacagc ttttggcaat tcaaataaca ttctgatagc 2641 tacctgtgtg actgatccaa cagcgttagg aacatctgta agtcagtcta atgtagtggt 2701 gttgcctgga aattctgcac ctatgactgc tcaacctcta ccacctcagt tacagacacc 2761 accaaggtca aacagtgtat ttgctgtcaa ccaagctgtg tcaccaaact tttcacaagg 2821 atctgccata ataattgcct ctccagtcca gcctgtactc caaggaatgg tagggatgat 2881 cccagtatct gtggttggac agaatggaaa taacttttct actcctcctc gggaggttct 2941 tcatatgcct gtgacagcac ctgtatgcaa tagaagtatc cctcaattcc ccgcccctcc 3001 aaaatctcag aaggctcagg gactaagaaa caagccttgt ataggaaaac aagtaaataa 3061 tttggtggat tcgtcaggtc attcagttgg atgtcatgca caaaaaactg aagtttctga 3121 caaaagtatt gccacagatc ttgggaaaaa atcagaagaa accacagttc ccttcccaga 3181 agagagtata gttccagctg ctaaaccatg ccacagacgt gtactctgtt tcgacagcac 3241 tactgctcct gtggcaaata cgcaggggcc aaaccataag atggtgtccc aaaacaaaga 3301 aaggaatgca gtctcttttc ctaatcttga ctcacccaat gtgtcctcca ccttaaaacc 3361 cccttctaat aatgctatca aaagagagaa agagaagcct cctctgccta agattttatc 3421 taaatcggaa agtgccatta gccggcatac caccataaga gaaactcaat cagaaaagaa 3481 agtttcacca acagaaattg tgcttgaatc tttccataaa gcaacagcta ataaggagaa 3541 tgaattatgc agcgatgtag aaagacagaa aaatccagaa aattcaaaac tatctattgg 3601 gcagcgaaat gggggtttgc gaagtgagaa atctatagct tcactgcaag aaatgaccaa 3661 aaaacaaggc acatcttcaa acaataaaaa tgtactttca gtaggtacag ctgtgaagga 3721 tctaaaacaa gaacaaacta aatccgccag ttctttgatt accacagaaa tgttacagga 3781 tatacagagg cacagctcag taagtaggct tgctgatagt agtgatttac ctgtgccccg 3841 gacacctggc tcaggggcag gggaaaaaca taaagaagaa cctatagata ttatcaaggc 3901 cccctctagt aggcgtttca gtgaagacag tagtacatca aaagtaatgg tccctcctgt 3961 caccccagac ttgcctgcct gcagccctgc cagtgaaaca ggaagtgaaa acagtgtaaa 4021 tatggctgcc cacacattaa tgattctctc cagggcagcc atttctagga ctacttcagc 4081 aactcctctg aaagataaca cacaacagtt tagagcatct tcaaggagca ccacaaaaaa 4141 gcggaaaatt gaggaattag atgaacgtga gcgaaactct cgtccttcta gtaaaaatct 4201 tacaaattca tcaataccaa tgaaaaagaa gaaaattaag aaaaagaagc ttcccagttc 4261 atttccagca ggaatggatg tagacaaatt tttgttatca ttgcattatg atgagtaaac 4321 attctggaca cataagaaca gtaggtgtta aaaactcata tcccttaaac tgagtgtagg 4381 gaatgggata ttgacagaat ctgaaagcat gacctgcact ttcattgtac tgaaacttca 4441 ctttatatct aaatcatgct gtttctgaaa cagcttccta gtttgtaaat agacttactt 4501 ggtatatttt tattttggga aaaacgtttg cagaaatgta agtaaagcca atctgcaaaa 4561 ctgtatagct ttgacaattc catattgtaa atactgtgta aatcttgttg aaatagaggt 4621 taaatcaacc taatgttctt acactgtgtt ttttgacttc ttatgatccc taattctgag 4681 acttactcat ctggaatagt ttctcacttg ttttggagga aaatgatgct tattcttata 4741 aattaccttc agtaactatt caataagaca tttaaataca caatcactaa gtactacaaa 4801 ataataacca cctcaactga tagcaaaata tctaattaga aatcaacagt ttgatgagtt 4861 tttttcctag agtgaatact acccctttct tataattgct gtaagtatta tatttctgga 4921 atttcacact ttacctactc tgatcactct gttctctttg ttttaaagag agaattttgt 4981 aaaccattta ttgaatgttc tgatctttca tttataactt aagatttttt acagaacttt 5041 aagtggttag actattgagc cataaaattt tggtttgtag tagagatttc tcggtatgta 5101 cttttcccct aaatgctggg gatttaaaga attgcactaa ttcatagaac tctttacatt 5161 gtagtgacca atgcacatat aatttaaatc ttgtgttcta tgtgagagtt tatgtggagg 5221 attatcttct ggtgttgtgt ccatcataag atgtaaaaag agaaatcata gttttatatt 5281 ttagtttctc ctacaatggg tatgttaaat aactatatct atacttagat atagtgcatt 5341 gtctgtgtcc cttaaagtgt tacaaagagt ttctctcaaa aggtccttaa ggagagtgag 5401 agatgagaga ggtgccttct ctgtattaga ctcacacaca tcagtccagt tgaagaattg 5461 gtttaaactt ttaaatgaga acagaaatta taattagtac ctgaccaatt gtgcctagat 5521 attaaagtat gttctcttta gtactttccg ttcaattaaa tatcagtata ttagggattt 5581 ttttctacag gtgtatattt tgttgaagag tcactttcct caaccatttt tttgtactgg 5641 gcctttaaaa aaataaatcc atccaacttt aacacaacat ttctcagtgt agaaatcatg 5701 tcttcttaat tgctgaacct tactgcaaaa acttgtgatg ttaagaaatt tgtatggtgt 5761 ggcagtggtc tattcctaag gaactaaata tcatatagtt aatgtttatt taactcagct 5821 tgagactgta ctacagttag gtttgaataa atattttcat taacttcaaa aaaaaaaaaa 5881 aaaaaaaaaa aaaaa // LOCUS HSE2 1110 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for T-cell surface glycoprotein E2. ACCESSION X16996 NID g30948 KEYWORDS glycoprotein E2; MIC2 gene; pseudoautosome. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1110) AUTHORS Gelin,C., Aubrit,F., Phalipon,A., Raynal,B., Cole,S., Kaczorek,M. and Bernard,A. TITLE The E2 antigen, a 32 kd glycoprotein involved in T-cell adhesion processes, is the MIC2 gene product JOURNAL EMBO J. 8 (11), 3253-3259 (1989) MEDLINE 90059916 COMMENT E2 is the product of the MIC2 gene which is the only pseudoautosomal gene sofar described in man. FEATURES Location/Qualifiers source 1..1110 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="thymic" /clone_lib="lambda gt11" /clone="BC1" CDS 124..681 /note="pre-pro polypeptide (AA -22 to 163)" /codon_start=1 /db_xref="PID:g30949" /db_xref="SWISS-PROT:P14209" /translation="MARGAALALLLFGLLGVLVAAPDGGFDLSDALPDNENKKPTAIP KKPSAGDDFDLGDAVVDGENDDPRPPNPPKPMPNPNPNHPSSSGSFSDADLADGVSGG EGKGGSDGGGSHRKEGEEADAPGVIPGIVGAVVVAVAGAISSFIAYQKKKLCFKENAE QGEVDMESHRNANAEPAVQRTLLEK" sig_peptide 124..189 /note="signal peptide (AA -22 to -1)" mat_peptide 190..678 /note="mature polypeptide (AA 1-163)" misc_feature 1083..1090 /note="polyA signal" BASE COUNT 246 a 312 c 313 g 239 t ORIGIN 1 gggcgggcct cacccgcttc gagtcctcgg gcttccccca cccggcccgt gggggagtat 61 ctgtcctgcc gccttcgccc acgccctgca ctccgggacc gtccctgcgc gctctgggcg 121 accatggccc gcggggctgc gctggcgctg ctgctcttcg gcctgctggg tgttctggtc 181 gccgccccgg atggtggttt cgatttatct gatgcccttc ctgacaatga aaacaagaaa 241 cccactgcaa tccccaagaa acccagtgct ggggatgact ttgacttagg agatgctgtt 301 gttgatggag aaaatgacga cccacgacca ccgaacccac ccaaaccgat gccaaatcca 361 aaccccaacc accctagttc ctccggtagc ttttcagatg ctgaccttgc ggatggcgtt 421 tcaggtggag aaggaaaagg aggcagtgat ggtggaggca gccacaggaa agaaggggaa 481 gaggccgacg ccccaggcgt gatccccggg attgtggggg ctgtcgtggt cgccgtggct 541 ggagccatct ctagcttcat tgcttaccag aaaaagaagc tatgcttcaa agaaaatgca 601 gaacaagggg aggtggacat ggagagccac cggaatgcca acgcagagcc agctgttcag 661 cgtactcttt tagagaaata gaagattgtc ggcagaaaca gcccaggcgt tggcagcagg 721 gttagaacag ctgcctgagg ctcctccctg aaggacacct gcctgagagc agagatggag 781 gccttctgtt cacggcggat tctttgtttt aatcttgcga tgtgctttgc ttgttgctgg 841 gcggatgatg tttactaacg atgaatttta catccaaagg gggataggca cttggacccc 901 cattctccaa ggcccggggg ggcggtttcc catgggatgt gaaaggctgg ccattattaa 961 gtccctgtaa ctcaaatgtc aaccccaccg aggcaccccc ccgtccccca gaatcttggc 1021 tgtttacaaa tcacgtgtcc atcgagcacg tctgaaaccc ctggtagccc cgacttcttt 1081 ttaattaaaa taaggtaagc ccttcaattt // LOCUS HSE2F3TF 1515 bp RNA PRI 06-MAR-1997 DEFINITION H.sapiens mRNA for E2F-3 transcription factor. ACCESSION Y10479 NID g1783322 KEYWORDS E2F-3 transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1515) AUTHORS Lees,J.A., Saito,M., Vidal,M., Valentine,M., Look,T., Harlow,E., Dyson,N. and Helin,K. TITLE The retinoblastoma protein binds to a family of E2F transcription factors JOURNAL Mol. Cell. Biol. 13 (12), 7813-7825 (1993) MEDLINE 94067142 REFERENCE 2 (bases 1 to 1515) AUTHORS Helin,K. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) K. Helin, European Institute Of Oncology, Department Of Experimental Oncology, Via Ripamonti 435, 20141 Milan, ITALY FEATURES Location/Qualifiers source 1..1515 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NALM-6" /cell_type="pre-B cells" CDS 67..1464 /codon_start=1 /product="E2F-3 transcription factor" /db_xref="PID:e291067" /db_xref="PID:g1783323" /translation="MRKGIQPALEQYLVTAGGGEGAAVVAAAAAASMDKRALLASPGF AAAAAAAAAPGAYIQILTTNTSTTSCSSSLQSGAVAAGPLLPSAPGAEQTAGSLLYTT PHGPSSRAGLLQQPPALGRGGSGGGGGPPAKRRLELGESGHQYLSDGLKTPKGKGRAA LRSPDSPKTPKSPSEKTRYDTSLGLLTKKFIQLLSQSPDGVLDLNKAAEVLKVQKRRI YDITNVLEGIHLIKKKSKNNVQWMGCSLSEDGGMLAQCQGLSKEVTELSQEEKKLDEL IQSCTLDLKLLTEDSENQRLAYVTYQDIRKISGLKDQTVIVVKAPPETRLEVPDSIES LQIHLASTQGPIEVYLCPEETETHSPMKTNNQDHNGNIPKPASKDLASTNSGHSDCSV SMGNLSPLASPANLLQQTEDQIPSNLEGPFVNLLPPLLQEDYLLSLGEEEGISDLFDA YDLEKLPLVEDFMCS" BASE COUNT 429 a 425 c 370 g 291 t ORIGIN 1 aaataataaa gaaattgaaa acaatacatt aatataccat aacactaaaa agagcaggag 61 cgagagatga gaaagggaat ccagcccgct ctggagcagt acctggtgac cgccgggggt 121 ggggaggggg cggctgtcgt cgccgccgcc gctgcagcct ccatggacaa aagggcactg 181 ctagccagcc ccggcttcgc cgccgccgcc gccgctgccg ccgccccggg cgcgtacatc 241 cagatcctca ccacgaacac ttccaccacc tcctgttcct cctccctcca aagcggcgcc 301 gtagccgccg gccccctcct ccccagtgcc cccggcgcgg agcagaccgc cggcagcctc 361 ctctacacca cgccgcacgg accctccagc agagccgggc tgctgcagca gccaccagcg 421 ctgggacgcg gcggcagcgg cggcggcggc ggccctccgg caaagcgaag gctggagcta 481 ggagaaagcg gtcatcagta cctctcagat ggtttaaaaa cccccaaggg caaaggaaga 541 gctgcactac gaagtccaga tagtccaaaa actccaaaat ctccctcaga aaaaacgcgg 601 tatgatacgt ctcttggtct gctcaccaag aagttcattc agctcctgag ccagtcaccc 661 gatggggtat tggatttgaa caaggcagca gaagtgctaa aagtgcaaaa gagaaggatt 721 tatgatatca ccaacgttct ggaaggcatc cacctcatta agaagaagtc taaaaacaac 781 gtccaatgga tgggctgcag tctgtctgag gatgggggca tgctggccca gtgtcaaggc 841 ctgtcaaaag aagtgaccga gctcagtcag gaagagaaga aattagatga actgatccaa 901 agctgcaccc tggacctcaa actgttaacc gaggattcag agaatcaaag gttagcttat 961 gttacatatc aagatattcg aaaaattagt ggccttaaag accaaactgt tatagttgtg 1021 aaagcccctc cagaaacaag acttgaagtg cctgactcaa tagagagcct acaaatacat 1081 ttggcaagta cccaagggcc cattgaggtt tacttatgtc cagaagagac tgaaacacac 1141 agtccaatga aaacaaacaa ccaagaccac aatgggaata tccctaaacc cgcttccaaa 1201 gacttggctt caaccaactc aggacatagc gattgctcag tttctatggg aaacctttct 1261 cctctggcct ccccagccaa cctcttacag cagactgagg accaaattcc ttccaaccta 1321 gaaggaccgt ttgtgaactt actgcctccc ctgctgcaag aggactatct cctgagcctc 1381 ggggaggagg aaggcatcag cgatctcttc gatgcttacg atttggaaaa gctcccactg 1441 gtggaagact tcatgtgtag ttgattatgc ttcgtgtgaa ctctccttaa aaaccgatat 1501 ttttttatca tggaa // LOCUS HSE2P1 921 bp RNA PRI 30-DEC-1992 DEFINITION H.sapiens mRNA for E2 protein. ACCESSION X53251 NID g30953 KEYWORDS E2 protein; ubiquitin-conjugating enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 921) AUTHORS Schneider,R., Eckerskorn,C., Lottspeich,F. and Schweiger,M. TITLE The human ubiquitin carrier protein E2(Mr = 17,000) is homologous to the yeast DNA repair gene RAD6 JOURNAL EMBO J. 9 (5), 1431-1435 (1990) MEDLINE 90228340 FEATURES Location/Qualifiers source 1..921 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA" /clone="U18" CDS 39..497 /codon_start=1 /product="E2 protein" /db_xref="PID:g30954" /db_xref="SWISS-PROT:P23567" /translation="MSTPARRRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFGPE GTPFEDGTFKLVIEFSEEYPNKPPTVRFLSKMFHPNVYADGSICLDILQNRWSPTYDV SSILTSIQSLLDEPNPNSPANSQAAQLYQENKREYEKRVSAIVEQSWNDS" polyA_signal 904..909 polyA_site 921 BASE COUNT 288 a 153 c 176 g 304 t ORIGIN 1 tttttttttt tcagactgac cgcggggcag ctgcggacat gtcgaccccg gcccggagga 61 ggctcatgcg ggatttcaag cggttacaag aggacccacc tgtgggtgtc agtggcgcac 121 catctgaaaa caacatcatg cagtggaatg cagttatatt tggaccagaa gggacacctt 181 ttgaagatgg tacttttaaa ctagtaatag aattttctga agaatatcca aataaaccac 241 caactgttag gtttttatcc aaaatgtttc atccaaatgt gtatgctgat ggtagcatat 301 gtttagatat ccttcagaat cgatggagtc caacatatga tgtatcttct atcttaacat 361 caattcagtc tctgctggat gaaccgaatc ctaacagtcc agccaatagc caggcagcac 421 agctttatca ggaaaacaaa cgagaatatg agaaaagagt ttcggccatt gttgaacaaa 481 gctggaatga ttcataatag acaactggtc tgttaatctt tttcatcatt gttgtgtata 541 atttacctct cattagaaag gctaacaaat tttaagtgcc acaggtttta aggattctgc 601 agaaaaaaaa gaaaaaagtc cttcagttta gaacctacaa aagcttgtgt atcttgatta 661 atgtactttt tattgcatgg tgtgaactaa gttattgctg cataaatttg taatatatcc 721 tgtttgtatt tttttccaag tgtataatgt tggtgtggag ttttcatgac agaatataca 781 cattttgtaa atctgtactt ttttcaaata ttgaatgcct tatttttgaa ttctttagat 841 ttttaaattg gagaaaagca cttaaagttt tttatatatg aatattacat gtaaagctgt 901 taaaatacat aacttcagtg c // LOCUS HSE4BP4RN 1923 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens E4BP4 gene. ACCESSION X64318 S38949 NID g30955 KEYWORDS E4BP4 gene; transcriptional repressor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1923) AUTHORS Cowell,I.G. TITLE Direct Submission JOURNAL Submitted (24-JAN-1992) I.G. Cowell, Oncology Group, Imperial Cancer Research Fund, Cyclotron Bldg, Royal Postgraduate Medical School, Du Cane Rd London, W12 0HS, UK REFERENCE 2 (bases 1 to 1923) AUTHORS Cowell,I.G., Skinner,A. and Hurst,H.C. TITLE Transcriptional repression by a novel member of the bZIP family of transcription factors JOURNAL Mol. Cell. Biol. 12 (7), 3070-3077 (1992) MEDLINE 92318924 FEATURES Location/Qualifiers source 1..1923 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="placental lambda gt11" gene 214..1602 /gene="E4BP4" CDS 214..1602 /gene="E4BP4" /note="product possesses binding site dependent transcriptional suppressing activity" /codon_start=1 /db_xref="PID:g30956" /translation="MQLRKMQTVKKEQASLDASSNVDKMMVLNSALTEVSEDSTTGED VLLSEGSVGKNKSSACRRKREFIPDEKKDAMYWEKRRKNNEAAKRSREKRRLNDLVLE NKLIALGEENATLKAELLSLKLKFGLISSTAYAQEIQKLSNSTAVYFQDYQTSKSNVS SFVDEHEPSMVSSSCISVIKHSPQSSLSDVSEVSSVEHTQESSVQGSCRSPENKFQII KQEPMELESYTREPRDDRGSYTASIYQNYMGNSFSGYSHSPPLLQVNRSSSNSPRTSE TDDGVVGKSSDGEDEQQVPKGPIHSPVELKHVHATVVKVPEVNSSALPHKLRIKAKAM QIKVEAFDNEFEATQKLSSPIDMTSKRHFELEKHSAPSMVHSSLTPFSVQVTNIQDWS LKSEHWHQKELSGKTQNSFKTGVVEMKDSGYKVSDPENLYLKQGIANLSAEVVSLKRL IATQPISASDSG" BASE COUNT 610 a 385 c 428 g 499 t 1 others ORIGIN 1 gcccctttct ttctcctcgt cggcccgaga gcaggaacac gataacgaag gaggcccaac 61 ttcattcaat aaggagcctg acggatttat cccagacggt agaacaaaag gaagaatatt 121 gatggatttt aaaccagagt ttttaaagag cttgagaata cggggaaatt aatttgttct 181 cctacacaca tagatagggt aaggttgttt ctgatgcagc tgagaaaaat gcagaccgtc 241 aaaaaggagc aggcgtctct tgatgccagt agcaatgtgg acaagatgat ggtccttaat 301 tctgctttaa cggaagtgtc agaagactcc acaacaggtg aggacgtgct tctcagtgaa 361 ggaagtgtgg ggaagaacaa atcttctgca tgtcggagga aacgggaatt cattcctgat 421 gaaaagaaag atgctatgta ttgggaaaaa aggcggaaaa ataatgaagc tgccaaaaga 481 tctcgtgaga agcgtcgact gaatgacctg gttttagaga acaaactaat tgcactggga 541 gaagaaaacg ccactttaaa agctgagctg ctttcactaa aattaaagtt tggtttaatt 601 agctccacag catatgctca agagattcag aaactcagta attctacagc tgtgtacttt 661 caagattacc agacttccaa atccaatgtg agttcatttg tggacgagca cgaaccctcg 721 atggtgtcaa gtagttgtat ttctgtcatt aaacactctc cacaaagctc gctgtccgat 781 gtttcagaag tgtcctcagt agaacacacg caggagagct ctgtgcaggg aagctgcaga 841 agtcctgaaa acaagttcca gattatcaag caagagccga tggaattaga gagctacaca 901 agggagccaa gagatgaccg aggctcttac acagcgtcca tctatcaaaa ctatatgggg 961 aattctttct ctgggtactc acactctccc ccactactgc aagtcaaccg atcctccagc 1021 aactccccga gaacgtcgga aactgatgat ggtgtggtag gaaagtcatc tgatggagaa 1081 gacgagcaac aggtccccaa gggccccatc cattctccag ttgaactcaa gcatgtgcat 1141 gcaactgtgg ttaaagttcc agaagtgaat tcctctgcct tgccacacaa gctccggatc 1201 aaagccaaag ccatgcagat caaagtagaa gcctttgata atgaatttga ggccacgcaa 1261 aaactttcct cacctattga catgacatct aaaagacatt tcgaactcga aaagcatagt 1321 gccccaagta tggtacattc ttctcttact cctttctcag tgcaagtgac taacattcaa 1381 gattggtctc tcaaatcgga gcactggcat caaaaagaac tgagtggcaa aactcagaat 1441 agtttcaaaa ctggagttgt tgaaatgaaa gacagtggct acaaagtttc tgacccagag 1501 aacttgtatt tgaagcaggg gatagcaaac ttatctgcag aggttgtctc actcaagaga 1561 cttatagcca cacaaccaat ctctgcttca gactctgggt aaattactac tgagtaagag 1621 ctgggcattt agaaagatgt catttgcaat agagcagtcc attttgtatt atgctgaatt 1681 ttcactggac ctgtgatgtc atttcactgt gatgtgcaca tgttgtctgt ttggtgtctt 1741 tttgtgcaca gattatgatg aagattagat tgtgttatca ctctgcctgt gtatagtcag 1801 atagtcatat gcgtaaggct gtatatatta agnttttatt tttgttgttc tattataaag 1861 tgtgtaagtt accagtttca ataaaggatt ggtgacaaac acagaaaaaa aaaaaaaaaa 1921 aaa // LOCUS HSE6AP1 2559 bp RNA PRI 29-APR-1997 DEFINITION H.sapiens mRNA for E6-AP isoform-I. ACCESSION X98032 NID g1495429 KEYWORDS E6-AP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2559) AUTHORS Yamamoto,Y., Huibregtse,J.M. and Howley,P.M. TITLE The human E6-AP gene (UBE3A) encodes three potential protein isoforms generated by differential splicing JOURNAL Genomics 41 (2), 263-266 (1997) MEDLINE 97288525 REFERENCE 2 (bases 1 to 2559) AUTHORS Yamamoto,Y. TITLE Direct Submission JOURNAL Submitted (13-MAY-1996) Y. Yamamoto, Harvard Medical School, Pathology, 200 Longwood Ave. Boston, MA. 02115, USA FEATURES Location/Qualifiers source 1..2559 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary keratinocyte" gene 1..2559 /gene="E6-AP" CDS 1..2559 /gene="E6-AP" /note="isoform I" /codon_start=1 /db_xref="PID:e246074" /db_xref="PID:g1495430" /translation="MKRAAAKHLIERYYHQLTEGCGNEACTNEFCASCPTFLRMDNNA AAIKALELYKINAKLCDPHPSKKGASSAYLENSKGAPNNSCSEIKMNKKGARIDFKDV TYLTEEKVYEILELCREREDYSPLIRVIGRVFSSAEALVQSFRKVKQHTKEELKSLQA KDEDKDEDEKEKAACSAAAMEEDSEASSSRIGDSSQGDNNLQKLGPDDVSVDIDAIRR VYTRLLSNEKIETAFLNALVYLSPNVECDLTYHNVYSRDPNYLNLFIIGMENRNLHSP EYLEMALPLFCKAMSKLPLAAQGKLIRLWSKYNADQIRRMMETFQQLITYKVISNEFN SRNLVNDDDAIVAASKCLKMVYYANVVGGEVDTNHNEEDDEEPIPESSELTLQELLGE ERRNKKGPRVDPLETELGVKTLDCRKPLIPFEEFINEPLNEVLEMDKDYTFFKVETEN KFSFMTCPFILNAVTKNLGLYYDNRIRMYSERRITVLYSLVQGQQLNPYLRLKVRRDH IIDDALVRLEMIAMENPADLKKQLYVEFEGEQGVDEGGVSKEFFQLVVEEIFNPDIGM FTYDESTKLFWFNPSSFETEGQFTLIGIVLGLAIYNNCILDVHFPMVVYRKLMGKKGT FRDLGDSHPVLYQSLKDLLEYEGNVEDDMMITFQISQTDLFGNPMMYDLKENGDKIPI TNENRKEFVNLYSDYILNKSVEKQFKAFRRGFHMVTNESPLKYLFRPEEIELLICGSR NLDFQALEETTEYDGGYTRDSVLIREFWEIVHSFTDEQKRLFLQFTTGTDRAPVGGLG KLKMIIAKNGPDTERLPTSHTCFNVLLLPEYSSKEKLKERLLKAITYAKGFGML" BASE COUNT 853 a 443 c 556 g 707 t ORIGIN 1 atgaagcgag cagctgcaaa gcatctaata gaacgctact accaccagtt aactgagggc 61 tgtggaaatg aagcctgcac gaatgagttt tgtgcttcct gtccaacttt tcttcgtatg 121 gataataatg cagcagctat taaagccctc gagctttata agattaatgc aaaactctgt 181 gatcctcatc cctccaagaa aggagcaagc tcagcttacc ttgagaactc gaaaggtgcc 241 cccaacaact cctgctctga gataaaaatg aacaagaaag gcgctagaat tgattttaaa 301 gatgtgactt acttaacaga agagaaggta tatgaaattc ttgaattatg tagagaaaga 361 gaggattatt cccctttaat ccgtgttatt ggaagagttt tttctagtgc tgaggcattg 421 gtacagagct tccggaaagt taaacaacac accaaggaag aactgaaatc tcttcaagca 481 aaagatgaag acaaagatga agatgaaaag gaaaaagctg catgttctgc tgctgctatg 541 gaagaagact cagaagcatc ttcctcaagg ataggtgata gctcacaggg agacaacaat 601 ttgcaaaaat taggccctga tgatgtgtct gtggatattg atgccattag aagggtctac 661 accagattgc tctctaatga aaaaattgaa actgcctttc tcaatgcact tgtatatttg 721 tcacctaacg tggaatgtga cttgacgtat cacaatgtat actctcgaga tcctaattat 781 ctgaatttgt tcattatcgg aatggagaat agaaatctcc acagtcctga atatctggaa 841 atggctttgc cattattttg caaagcgatg agcaagctac cccttgcagc ccaaggaaaa 901 ctgatcagac tgtggtctaa atacaatgca gaccagattc ggagaatgat ggagacattt 961 cagcaactta ttacttataa agtcataagc aatgaattta acagtcgaaa tctagtgaat 1021 gatgatgatg ccattgttgc tgcttcgaag tgcttgaaaa tggtttacta tgcaaatgta 1081 gtgggagggg aagtggacac aaatcacaat gaagaagatg atgaagagcc catccctgag 1141 tccagcgagc tgacacttca ggaacttttg ggagaagaaa gaagaaacaa gaaaggtcct 1201 cgagtggacc ccctggaaac tgaacttggt gttaaaaccc tggattgtcg aaaaccactt 1261 atcccttttg aagagtttat taatgaacca ctgaatgagg ttctagaaat ggataaagat 1321 tatacttttt tcaaagtaga aacagagaac aaattctctt ttatgacatg tccctttata 1381 ttgaatgctg tcacaaagaa tttgggatta tattatgaca atagaattcg catgtacagt 1441 gaacgaagaa tcactgttct ctacagctta gttcaaggac agcagttgaa tccatatttg 1501 agactcaaag ttagacgtga ccatatcata gatgatgcac ttgtccggct agagatgatc 1561 gctatggaaa atcctgcaga cttgaagaag cagttgtatg tggaatttga aggagaacaa 1621 ggagttgatg agggaggtgt ttccaaagaa ttttttcagc tggttgtgga ggaaatcttc 1681 aatccagata ttggtatgtt cacatacgat gaatctacaa aattgttttg gtttaatcca 1741 tcttcttttg aaactgaggg tcagtttact ctgattggca tagtactggg tctggctatt 1801 tacaataact gtatactgga tgtacatttt cccatggttg tctacaggaa gctaatgggg 1861 aaaaaaggaa cttttcgtga cttgggagac tctcacccag ttctatatca gagtttaaaa 1921 gatttattgg agtatgaagg gaatgtggaa gatgacatga tgatcacttt ccagatatca 1981 cagacagatc tttttggtaa cccaatgatg tatgatctaa aggaaaatgg tgataaaatt 2041 ccaattacaa atgaaaacag gaaggaattt gtcaatcttt attctgacta cattctcaat 2101 aaatcagtag aaaaacagtt caaggctttt cggagaggtt ttcatatggt gaccaatgaa 2161 tctcccttaa agtacttatt cagaccagaa gaaattgaat tgcttatatg tggaagccgg 2221 aatctagatt tccaagcact agaagaaact acagaatatg acggtggcta taccagggac 2281 tctgttctga ttagggagtt ctgggaaatc gttcattcat ttacagatga acagaaaaga 2341 ctcttcttgc agtttacaac gggcacagac agagcacctg tgggaggact aggaaaatta 2401 aagatgatta tagccaaaaa tggcccagac acagaaaggt tacctacatc tcatacttgc 2461 tttaatgtgc ttttacttcc ggaatactca agcaaagaaa aacttaaaga gagattgttg 2521 aaggccatca cgtatgccaa aggatttggc atgctgtaa // LOCUS HSEAP 602 bp RNA PRI 27-MAY-1991 DEFINITION Human mRNA for Epstein-Barr virus small RNAs (EBERs)associated protein (EAP). ACCESSION X59357 NID g31061 KEYWORDS small nuclear ribonucleoprotein; small nuclear RNP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 602) AUTHORS Cali,J.J. and Wrighton,N.C. TITLE EAP, a highly conserved cellular protein associated with Epstein-Barr virus small RNA's (EBERs) JOURNAL EMBO J. 10, 459-466 (1991) MEDLINE 91122054 FEATURES Location/Qualifiers source 1..602 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="b lymphocytes" /cell_line="cytoplasmic extract from Raji (Burkitts lymphoma)" /tissue_lib="placental cDNA" mRNA 1..602 /gene="EAP" /evidence=experimental gene 1..602 /gene="EAP" CDS 52..438 /gene="EAP" /codon_start=1 /product="Epstein-Barr virus small RNA associated protein" /db_xref="PID:g31062" /db_xref="SWISS-PROT:P35268" /translation="MAPVKKLVVKGGKKKKQVLKFTLDCTHPVEDGIMDAANFEQFLQ ERIKVNGKAGNLGGGVVTIERSKSKITVTSEVPFSKRYLKYLTKKYLKKNNLRDWLRV VANSKESYELRYFQINQDEEEEEDED" mat_peptide 55..438 /gene="EAP" /product="Epstein-Barr virus small RNA associated protein" BASE COUNT 186 a 100 c 151 g 165 t ORIGIN 1 gaattctgat gtcgtaccta aggcttgtcc atctttgttg ttggaggtgc catggctcct 61 gtgaaaaagc ttgtggtgaa ggggggcaaa aaaaagaagc aagttctgaa gttcactctt 121 gattgcaccc accctgtaga agatggaatc atggatgctg ccaattttga gcagtttttg 181 caagaaagga tcaaagtgaa cggaaaagct gggaaccttg gtggaggggt ggtgaccatc 241 gaaaggagca agagcaagat caccgtgaca tccgaggtgc ctttctccaa aaggtatttg 301 aaatatctca ccaaaaaata tttgaagaag aataatctac gtgactggtt gcgcgtagtt 361 gctaacagca aagagagtta cgaattacgt tacttccaga ttaaccagga cgaagaagag 421 gaggaagacg aggattaaat ttcatttatc tggaaaattt tgtatgagtt cttgaataaa 481 acttgggaac caaaatggtg gtttatcctt gtatctctgc agtgtggatt gaacagaaaa 541 ttggaaatca tagtcaaagg gcttccttgg ttcgccactc atttatttgt aacttgactt 601 ct // LOCUS HSECAD 4778 bp RNA PRI 27-APR-1996 DEFINITION H.sapiens mRNA for E-cadherin. ACCESSION Z13009 NID g31072 KEYWORDS E-cadherin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4778) AUTHORS Bussemakers,M.J., van Bokhoven,A., Mees,S.G., Kemler,R. and Schalken,J.A. TITLE Molecular cloning and characterization of the human E-cadherin cDNA JOURNAL Mol. Biol. Rep. 17 (2), 123-128 (1993) MEDLINE 93211394 REFERENCE 2 (bases 1 to 4778) AUTHORS Bussemakers,M. TITLE Direct Submission JOURNAL Submitted (22-JUN-1992) Bussemakers M., University Hospital Nijmegen, Geert Grooteplein 16, Nijmegen, The Netherlands REFERENCE 3 (bases 1 to 4778) AUTHORS Bussemakers,M.J., Giroldi,L.A., van Bokhoven,A. and Schalken,J.A. TITLE Transcriptional regulation of the human E-cadherin gene in human prostate cancer cell lines: characterization of the human E-cadherin gene promoter JOURNAL Biochem. Biophys. Res. Commun. 203 (2), 1284-1290 (1994) MEDLINE 94380041 COMMENT . FEATURES Location/Qualifiers source 1..4778 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..4778 CDS 95..2743 /codon_start=1 /product="E-cadherin" /db_xref="PID:g31073" /db_xref="SWISS-PROT:P12830" /translation="MGPWSRSLSALLLLLQVSSWLCQEPEPCHPGFDAESYTFTVPRR HLERGRVLGRVNFEDCTGRQRTAYFSLDTRFKVGTDGVITVKRPLRFHNPQIHFLVYA WDSTYRKFSTKVTLNTVGHHHRPPPHQASVSGIQAELLTFPNSSPGLRRQKRDWVIPP ISCPENEKGPFPKNLVQIKSNKDKEGKVFYSITGQGADTPPVGVFIIERETGWLKVTE PLDRERIATYTLFSHAVSSNGNAVEDPMEILITVTDQNDNKPEFTQEVFKGSVMEGAL PGTSVMEVTATDADDDVNTYNAAIAYTILSQDPELPDKNMFTINRNTGVISVVTTGLD RESFPTYTLVVQAADLQGEGLSTTATAVITVTDTNDNPPIFNPTTYKGQVPENEANVV ITTLKVTDADAPNTPAWEAVYTILNDDGGQFVVTTNPVNNDGILKTAKGLDFEAKQQY ILHVAVTNVVPFEVSLTTSTATVTVDVLDVNEAPIFVPPEKRVEVSEDFGVGQEITSY TAQEPDTFMEQKITYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALII ATDNGSPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTS PFTAELTHGASANWTIQYNDPTQESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLE VSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALLILILLLLLFLRRRAVVKEP LLPPEDDTRDNVYYYDEEGGGEEDQDFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYL PRPANPDEIGNFIDENLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKD QDYDYLNEWGNRFKKLADMYGGGEDD" mat_peptide 557..2740 /product="E-cadherin" BASE COUNT 1237 a 1177 c 1128 g 1236 t ORIGIN 1 gcttgcggaa gtcagttcag actccagccc gctccagccc ggcccgaccc gaccgcaccc 61 ggcgcctgcc ctcgctcggc gtccccggcc agccatgggc ccttggagcc gcagcctctc 121 ggcgctgctg ctgctgctgc aggtctcctc ttggctctgc caggagccgg agccctgcca 181 ccctggcttt gacgccgaga gctacacgtt cacggtgccc cggcgccacc tggagagagg 241 ccgcgtcctg ggcagagtga attttgaaga ttgcaccggt cgacaaagga cagcctattt 301 ttccctcgac acccgattca aagtgggcac agatggtgtg attacagtca aaaggcctct 361 acggtttcat aacccacaga tccatttctt ggtctacgcc tgggactcca cctacagaaa 421 gttttccacc aaagtcacgc tgaatacagt ggggcaccac caccgccccc cgccccatca 481 ggcctccgtt tctggaatcc aagcagaatt gctcacattt cccaactcct ctcctggcct 541 cagaagacag aagagagact gggttattcc tcccatcagc tgcccagaaa atgaaaaagg 601 cccatttcct aaaaacctgg ttcagatcaa atccaacaaa gacaaagaag gcaaggtttt 661 ctacagcatc actggccaag gagctgacac accccctgtt ggtgtcttta ttattgaaag 721 agaaacagga tggctgaagg tgacagagcc tctggataga gaacgcattg ccacatacac 781 tctcttctct cacgctgtgt catccaacgg gaatgcagtt gaggatccaa tggagatttt 841 gatcacggta accgatcaga atgacaacaa gcccgaattc acccaggagg tctttaaggg 901 gtctgtcatg gaaggtgctc ttccaggaac ctctgtgatg gaggtcacag ccacagacgc 961 ggacgatgat gtgaacacct acaatgccgc catcgcttac accatcctca gccaagatcc 1021 tgagctccct gacaaaaata tgttcaccat taacaggaac acaggagtca tcagtgtggt 1081 caccactggg ctggaccgag agagtttccc tacgtatacc ctggtggttc aagctgctga 1141 ccttcaaggt gaggggttaa gcacaacagc aacagctgtg atcacagtca ctgacaccaa 1201 cgataatcct ccgatcttca atcccaccac gtacaagggt caggtgcctg agaacgaggc 1261 taacgtcgta atcaccacac tgaaagtgac tgatgctgat gcccccaata ccccagcgtg 1321 ggaggctgta tacaccatat tgaatgatga tggtggacaa tttgtcgtca ccacaaatcc 1381 agtgaacaac gatggcattt tgaaaacagc aaagggcttg gattttgagg ccaagcagca 1441 gtacattcta cacgtagcag tgacgaatgt ggtacctttt gaggtctctc tcaccacctc 1501 cacagccacc gtcaccgtgg atgtgctgga tgtgaatgaa gcccccatct ttgtgcctcc 1561 tgaaaagaga gtggaagtgt ccgaggactt tggcgtgggc caggaaatca catcctacac 1621 tgcccaggag ccagacacat ttatggaaca gaaaataaca tatcggattt ggagagacac 1681 tgccaactgg ctggagatta atccggacac tggtgccatt tccactcggg ctgagctgga 1741 cagggaggat tttgagcacg tgaagaacag cacgtacaca gccctaatca tagctacaga 1801 caatggttct ccagttgcta ctggaacagg gacacttctg ctgatcctgt ctgatgtgaa 1861 tgacaacgcc cccataccag aacctcgaac tatattcttc tgtgagagga atccaaagcc 1921 tcaggtcata aacatcattg atgcagacct tcctcccaat acatctccct tcacagcaga 1981 actaacacac ggggcgagtg ccaactggac cattcagtac aacgacccaa cccaagaatc 2041 tatcattttg aagccaaaga tggccttaga ggtgggtgac tacaaaatca atctcaagct 2101 catggataac cagaataaag accaagtgac caccttagag gtcagcgtgt gtgactgtga 2161 aggggccgcc ggcgtctgta ggaaggcaca gcctgtcgaa gcaggattgc aaattcctgc 2221 cattctgggg attcttggag gaattcttgc tttgctaatt ctgattctgc tgctcttgct 2281 gtttcttcgg aggagagcgg tggtcaaaga gcccttactg cccccagagg atgacacccg 2341 ggacaacgtt tattactatg atgaagaagg aggcggagaa gaggaccagg actttgactt 2401 gagccagctg cacaggggcc tggacgctcg gcctgaagtg actcgtaacg acgttgcacc 2461 aaccctcatg agtgtccccc ggtatcttcc ccgccctgcc aatcccgatg aaattggaaa 2521 ttttattgat gaaaatctga aagcggctga tactgacccc acagccccgc cttatgattc 2581 tctgctcgtg tttgactatg aaggaagcgg ttccgaagct gctagtctga gctccctgaa 2641 ctcctcagag tcagacaaag accaggacta tgactacttg aacgaatggg gcaatcgctt 2701 caagaagctg gctgacatgt acggaggcgg cgaggacgac taggggactc gagagaggcg 2761 ggccccagac ccatgtgctg ggaaatgcag aaatcacgtt gctggtggtt tttcagctcc 2821 cttcccttga gatgagtttc tggggaaaaa aaagagactg gttagtgatg cagttagtat 2881 agctttatac tctctccact ttatagctct aataagtttg tgttagaaaa gtttcgactt 2941 atttcttaaa gctttttttt ttttcccatc actctttaca tggtggtgat gtccaaaaga 3001 tacccaaatt ttaatattcc agaagaacaa ctttagcatc agaaggttca cccagcacct 3061 tgcagatttt cttaaggaat tttgtctcac ttttaaaaag aaggggagaa gtcagctact 3121 ctagttctgt tgttttgtgt atataatttt ttaaaaaaaa tttgtgtgct tctgctcatt 3181 actacactgg tgtgtccctc tgcctttttt ttttttttta agacagggtc tcattctatc 3241 ggccaggctg gagtgcagtg gtgcaatcac agctcactgc agccttgtcc tcccaggctc 3301 aagctatcct tgcacctcag cctcccaagt agctgggacc acaggcatgc accactacgc 3361 atgactaatt ttttaaatat ttgagacggg gtctccctgt gttacccagg ctggtctcaa 3421 actcctgggc tcaagtgatc ctcccatctt ggcctcccag agtattggga ttacagacat 3481 gagccactgc acctgcccag ctccccaact ccctgccatt ttttaagaga cagtttcgct 3541 ccatcgccca ggcctgggat gcagtgatgt gatcatagct cactgtaacc tcaaactctg 3601 gggctcaagc agttctccca ccagcctcct ttttattttt ttgtacagat ggggtcttgc 3661 tatgttgccc aagctggtct taaactcctg gcctcaagca atccttctgc cttggccccc 3721 caaagtgctg ggattgtggg catgagctgc tgtgcccagc ctccatgttt taatatcaac 3781 tctcactcct gaattcagtt gctttgccca agataggagt tctctgatgc agaaattatt 3841 gggctctttt agggtaagaa gtttgtgtct ttgtctggcc acatcttgac taggtattgt 3901 ctactctgaa gacctttaat ggcttccctc tttcatctcc tgagtatgta acttgcaatg 3961 ggcagctatc cagtgacttg ttctgagtaa gtgtgttcat taatgtttat ttagctctga 4021 agcaagagtg atatactcca ggacttagaa tagtgcctaa agtgctgcag ccaaagacag 4081 agcggaacta tgaaaagtgg gcttggagat ggcaggagag cttgtcattg agcctggcaa 4141 tttagcaaac tgatgctgag gatgattgag gtgggtctac ctcatctctg aaaattctgg 4201 aaggaatgga ggagtctcaa catgtgtttc tgacacaaga tccgtggttt gtactcaaag 4261 cccagaatcc ccaagtgcct gcttttgatg atgtctacag aaaatgctgg ctgagctgaa 4321 cacatttgcc caattccagg tgtgcacaga aaaccgagaa tattcaaaat tccaaatttt 4381 ttcttaggag caagaagaaa atgtggccct aaagggggtt agttgagggg tagggggtag 4441 tgaggatctt gatttggatc tctttttatt taaatgtgaa tttcaacttt tgacaatcaa 4501 agaaaagact tttgttgaaa tagctttact gtttctcaag tgttttggag aaaaaaatca 4561 accctgcaat cactttttgg aattgtcttg atttttcggc agttcaagct atatcgaata 4621 tagttctgtg tagagaatgt cactgtagtt ttgagtgtat acatgtgtgg gtgctgataa 4681 ttgtgtattt tctttggggg tggaaaagga aaacaattca agctgagaaa agtattctca 4741 aagatgcatt tttataaatt ttattaaaca attttgtt // LOCUS HSECP 715 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for eosinophil cationic protein ECP. ACCESSION X15161 NID g31076 KEYWORDS cytotoxin; eosinophil cationic protein; glycoprotein; helminthotoxin; matrix protein; neurotoxin; perforin; ribonuclease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 715) AUTHORS Rosenberg,H.F. TITLE Direct Submission JOURNAL Submitted (29-APR-1989) Rosenberg H.F., Beth Israel Hospital, 330 Brookline Avenue, Boston MA 02215, USA REFERENCE 2 (bases 1 to 715) AUTHORS Rosenberg,H.F., Ackerman,S.J. and Tenen,D.G. TITLE Human eosinophil cationic protein. Molecular cloning of a cytotoxin and helminthotoxin with ribonuclease activity JOURNAL J. Exp. Med. 170 (1), 163-176 (1989) MEDLINE 89310354 FEATURES Location/Qualifiers source 1..715 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood granulocytes" /cell_type="eosinophil" /cell_line="HL-60" /clone_lib="lambda gt11" mRNA <1..715 /note="ECP mRNA" sig_peptide 55..135 /note="signal peptide ((AA -27 to -1)" CDS 55..537 /note="ECP preprotein" /codon_start=1 /db_xref="PID:g31077" /db_xref="SWISS-PROT:P12724" /translation="MVPKLFTSQICLLLLLGLMGVEGSLHARPPQFTRAQWFAIQHIS LNPPRCTIAMRAINNYRWRCKNQNTFLRTTFANVVNVCGNQSIRCPHNRTLNNCHRSR FRVPLLHCDLINPGAQNISNCRYADRPGRRFYVVACDNRDPRDSPRYPVVPVHLDTTI " mat_peptide 136..534 /note="mature ECP (AA 1-133)" polyA_site 715 /note="polyA site" BASE COUNT 185 a 199 c 141 g 190 t ORIGIN 1 gaacaaccag ctggatcagt tctcacagga gccacagctc agagactggg aaacatggtt 61 ccaaaactgt tcacttccca aatttgtctg cttcttctgt tggggcttat gggtgtggag 121 ggctcactcc atgccagacc cccacagttt acgagggctc agtggtttgc catccagcac 181 atcagtctga acccccctcg atgcaccatt gcaatgcggg caattaacaa ttatcgatgg 241 cgttgcaaaa accaaaatac ttttcttcgt acaacttttg ctaatgtagt taatgtttgt 301 ggtaaccaaa gtatacgctg ccctcataac agaactctca acaattgtca tcggagtaga 361 ttccgggtgc ctttactcca ctgtgacctc ataaatccag gtgcacagaa tatttcaaac 421 tgcaggtatg cagacagacc aggaaggagg ttctatgtag ttgcatgtga caacagagat 481 ccacgggatt ctccacggta tcctgtggtt ccagttcacc tggataccac catctaagct 541 cctgtatcag cagtcctcat catcactcat ctgccaagct cctcaatcat agccaagatc 601 ccatccctcc atgtactctg ggtatcagca actgtcctca tcagtctcca taccccttca 661 gctttcctga gctgaagtcc cttgtgaacc ctgcaataaa ctgctttgca aattc // LOCUS HSECPTP 2736 bp RNA PRI 03-SEP-1996 DEFINITION H.sapiens mRNA for protein-tyrosine-phosphatase. ACCESSION X82635 NID g1524068 KEYWORDS protein-tyrosine-phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2736) AUTHORS Knyazev,P.G. and Ullrich,A. TITLE Molecular cloning and characterization of human epithelial cells specific protein tyrosine phosphatase (EC-PTP) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2736) AUTHORS Knyazev,P. TITLE Direct Submission JOURNAL Submitted (11-NOV-1994) P. Knyazev, Max-Plank-Inst. fuer Biochemie, Dept. of Molecular Biology, Am Klopferspitz 18A, 82152 Planegg, FRG FEATURES Location/Qualifiers source 1..2736 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast" /clone_lib="colon carcinoma" gene 503..1651 /gene="EC-PTP" CDS 503..1651 /gene="EC-PTP" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine-phosphatase" /db_xref="PID:e124057" /db_xref="PID:g1524069" /translation="MVQPEQAPKVLNVVVDPQGRGAPEIRATTATSVCPSPFKMKPIG LQERRGSNVSLTLDMSSLGNIEPFVSIPTPREKVAMEYLQSASRILTRSQLRDVVASS HLLQSEFMEIPMNFVDPKEIDIPRHGTKNRYKTILPNPLSRVCLRPKNVTDSLSTYIN ANYIRGYSGKEKAFIATQGPMINTVDDFWQMVWQEDSPVIVMITKLKEKNEKCVLYWP EKRGIYGKVEVLVISVNECDNYTIRNLVLKQGSHTQHVKHYWYTSWPDHKTPDSAQPL LQLMLDVEEDRLASQGRGPVVVHCSAGIGRTGCFIATSIGCQQLKEEGVVDALSIVCQ LRMDRGGMVQTSEQYEFVHHALCLYESRLQQRLSSESLKTCQTINLLG" BASE COUNT 833 a 533 c 590 g 780 t ORIGIN 1 gaattcggca cgagagaaag cctgggacct gcagatgcca tgtcaggcac gcttgctcct 61 gcataggaga ctaaataatc tcgatatata aggatggcag tctgttgtct tagatcagtt 121 tgagaagcag ctctggcagc ggggggtgta ggtgtgttgc actacactga atggaataag 181 gctaaaaata tgtttagtgt ctgataagaa cgccagtttt ctcaagctct catttaacgt 241 cggactttct gttttgcttt taaagaaaaa tgttttacaa gggcagcatg aagcggacaa 301 aatctggagc aaagaaggat tttatgctgt tgtcattttt ctcagcatct ttgttattat 361 agtaacgtgt ttgatgattc tttacagatt aaaagaaaga tttcagcttt ccttaagaca 421 agacaaagag aaaaaccagg agatccacct atcgcccatc acattacagc cagcactgtc 481 cgaggcaaag acagtccaca gcatggtcca acctgagcag gccccaaagg tactgaatgt 541 tgtcgtggac cctcaaggcc gaggtgctcc tgagatcaga gctaccaccg ctacctctgt 601 ttgcccttct cctttcaaaa tgaagcccat aggacttcaa gagagaagag ggtccaacgt 661 atctcttaca ttggacatga gtagcttggg gaacattgaa ccctttgtgt ctataccaac 721 accacgggag aaggtagcaa tggagtatct gcagtcagcc agccgaattc tcacaaggtc 781 tcagctgagg gacgtcgtgg caagttcaca tttactccaa agtgaattca tggaaatacc 841 aatgaacttt gtggatccca aagaaattga tattccgcgt catggaacta aaaatcgcta 901 taagaccatt ttaccaaatc ccctcagcag agtgtgttta agaccaaaaa atgtaaccga 961 ttcattgagc acctacatta atgctaatta tattaggggc tacagtggca aggagaaagc 1021 cttcattgcc acgcagggcc ccatgatcaa caccgtggat gatttctggc agatggtttg 1081 gcaggaagac agccctgtga ttgttatgat cacaaaactc aaagaaaaaa atgagaaatg 1141 tgtgctatac tggccggaaa agagagggat atatggaaaa gttgaggttc tggttatcag 1201 tgtaaatgaa tgtgataact acaccattcg aaaccttgtc ttaaagcaag gaagccacac 1261 ccaacatgtg aagcattact ggtacacctc atggcctgat cacaagactc cagacagtgc 1321 ccagcccctc ctacagctca tgctggatgt agaagaagac agacttgctt cccagggccg 1381 agggcctgtg gttgtccact gcagtgcagg aataggtaga acagggtgtt ttattgctac 1441 atccattggc tgtcaacagc tgaaagaaga aggagttgtg gatgcactaa gcattgtctg 1501 ccagcttcgt atggatagag gtggaatggt gcaaaccagt gagcagtatg aatttgtgca 1561 ccatgctctg tgcctgtatg agagcagact tcagcagaga ctgtccagtg agtcattgaa 1621 gacttgtcag accatcaatc tcttggggtg attaatcaaa ttacccaccc aaggcttcta 1681 gaaggagctt cctgcaatgg aaggaaggag aagctctgaa gcccatgtat ggcatggatt 1741 gtggaagact gggcaacata tttaagattt ccagctcctt gtgtatatga atgcatttgt 1801 aagcatcccc caaattattc tgaaggtttt ttgatgatgg aggtatgata ggtttatcac 1861 acagcctaag gcagattttg ttttgtctgt actgactcta tctgccacac agaatgtatg 1921 tatgtaatat tcagtaataa atgtcatcag gtgatgactg gatgagctgc tgaagacatt 1981 cgtattatgt gttagatgct ttaatgtttg caaaatctgt cttgtgaatg gactgtcagc 2041 tgttaaactg ttcctgtttt gaagtgctat tacctttctc agttaccaga atcttgctgc 2101 taaagttgca agtgattgat aatggatttt taacagagaa gtctttgttt ttgaaaaaca 2161 aaaatcaaaa acagtaacta ttttatatgg aaatgtgtct tgataatatt acctattaaa 2221 tgtgtattta tagtccctcc tatcaaacaa ttacagagca caatgattgt cattgggtat 2281 atatgtatgg aaagatttac tctctattat tgggcataaa ggtggcttct gctccagaac 2341 tctatccact gtatttccac atcgtgagtc attttacttt aaaagggaaa aacaaatttg 2401 tagcaactct gaagtatcaa gagtttaact acttgtctct cttttgctaa gaagggattt 2461 ttgatatgct atctacctgg aatctctctc tcaacaaaag gtatatgcct tcaggaatga 2521 tataatctgt cccattttcg aggctcctta taaggacatt tccatgtatg tccttacatt 2581 tctgaaagct ttcaatcttc aagagacaaa aaaaattaaa ataactaccc ttagcaaaca 2641 ctagctgttc tgctcatata tgaattttta atgcagcaat gttgactttg tttcatactg 2701 ccaataaact cttaatacta aataaaaaaa aaaaaa // LOCUS HSEDG2 1217 bp RNA PRI 29-MAY-1997 DEFINITION H.sapiens mRNA for G protein-coupled receptor Edg-2. ACCESSION Y09479 NID g1679601 KEYWORDS G protein-coupled receptor; G protein-coupled receptor Edg-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1217) AUTHORS Moolenaar,W.H., Kranenburg,O., Postma,F.R. and Zondag,G.C. TITLE Lysophosphatidic acid: G-protein signalling and cellular responses JOURNAL Curr. Opin. Cell Biol. 9 (2), 168-173 (1997) MEDLINE 97224241 REFERENCE 2 (bases 1 to 1217) AUTHORS Zondag,G.C.M. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) G.C.M. Zondag and W.H. Moolenaar, The Netherlands Cancer Institute, Div. Cellular Biochemistry, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands FEATURES Location/Qualifiers source 1..1217 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="brain" /dev_stage="fetal" /clone="RC6" CDS 123..1217 /codon_start=1 /product="G protein-coupled receptor Edg-2" /db_xref="PID:e281935" /db_xref="PID:g1679602" /translation="MAAISTSIPVISQPQFTAMNEPQCFYNESIAFFYNRSGKHLATE WNTVSKLVMGLGITVCIFIMLANLLVMVAIYVNRRFHFPIYYLMANLAAADFFAGLAY FYLMFNTGPNTRRLTVSTWLLRQGLIDTSLTASVANLLAIAIERHITVFRMQLHTRMS NRRVVVVIVVIWTMAIVMGAIPSVGWNCICDIENCSNMAPLYSDSYLVFWAIFNLVTF VVMVVLYAHIFGYVRQRTMRMSRHSSGPRRNRDTMMSLLKTVVIVLGAFIICWTPGLV LLLLDVCCPQCDVLAYEKFFLLLAEFNSAMNPIIYSYRDKEMSATFRQILCCQRSENP TGPTEGSDRSASSLNHTILAGVHSNDHSVV" CDS 177..1217 /note="alternative startcodon" /codon_start=1 /product="G protein-coupled receptor Edg-2" /db_xref="PID:e281936" /db_xref="PID:g1679603" /translation="MNEPQCFYNESIAFFYNRSGKHLATEWNTVSKLVMGLGITVCIF IMLANLLVMVAIYVNRRFHFPIYYLMANLAAADFFAGLAYFYLMFNTGPNTRRLTVST WLLRQGLIDTSLTASVANLLAIAIERHITVFRMQLHTRMSNRRVVVVIVVIWTMAIVM GAIPSVGWNCICDIENCSNMAPLYSDSYLVFWAIFNLVTFVVMVVLYAHIFGYVRQRT MRMSRHSSGPRRNRDTMMSLLKTVVIVLGAFIICWTPGLVLLLLDVCCPQCDVLAYEK FFLLLAEFNSAMNPIIYSYRDKEMSATFRQILCCQRSENPTGPTEGSDRSASSLNHTI LAGVHSNDHSVV" BASE COUNT 269 a 332 c 274 g 342 t ORIGIN 1 ctgacaccta cagcatcagg tacacagctt ctcctagcat gacttcgatc tgatcagcaa 61 acaagaaaat ttgtctcccg tagttctggg gcgtgttcac cacctacaac cacagagctg 121 tcatggctgc catctctact tccatccctg taatttcaca gccccagttc acagccatga 181 atgaaccaca gtgcttctac aacgagtcca ttgccttctt ttataaccga agtggaaagc 241 atcttgccac agaatggaac acagtcagca agctggtgat gggacttgga atcactgttt 301 gtatcttcat catgttggcc aacctattgg tcatggtggc aatctatgtc aaccgccgct 361 tccattttcc tatttattac ctaatggcta atctggctgc tgcagacttc tttgctgggt 421 tggcctactt ctatctcatg ttcaacacag gacccaatac tcggagactg actgtcagca 481 catggctcct tcgtcagggc ctcattgaca ccagcctgac ggcatctgtg gccaacttac 541 tggctattgc aatcgagagg cacattacgg ttttccgcat gcagctccac acacggatga 601 gcaaccggcg ggtagtggtg gtcattgtgg tcatctggac tatggccatc gttatgggtg 661 ctatacccag tgtgggctgg aactgtatct gtgatattga aaattgttcc aacatggcac 721 ccctctacag tgactcttac ttagtcttct gggccatttt caacttggtg acctttgtgg 781 taatggtggt tctctatgct cacatctttg gctatgttcg ccagaggact atgagaatgt 841 ctcggcatag ttctggaccc cggcggaatc gggataccat gatgagtctt ctgaagactg 901 tggtcattgt gcttggggcc tttatcatct gctggactcc tggattggtt ttgttacttc 961 tagacgtgtg ctgtccacag tgcgacgtgc tggcctatga gaaattcttc cttctccttg 1021 ctgaattcaa ctctgccatg aaccccatca tttactccta ccgcgacaaa gaaatgagcg 1081 ccacctttag gcagatcctc tgctgccagc gcagtgagaa ccccaccggc cccacagaag 1141 gctcagaccg ctcggcttcc tccctcaacc acaccatctt ggctggagtt cacagcaatg 1201 atcactctgt ggtttag // LOCUS HSEF1B 964 bp RNA PRI 26-AUG-1991 DEFINITION Human mRNA for elongation factor-1-beta. ACCESSION X60489 NID g31099 KEYWORDS elongation factor; elongation factor 1-beta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 964) AUTHORS Sanders,J.P. TITLE Direct Submission JOURNAL Submitted (18-JUL-1991) J.P. Sanders, State University Leiden, Medical Biochemistry, Wassenaarseweg 72, Leiden, THE NETHERLANDS REFERENCE 2 (bases 1 to 964) AUTHORS Sanders,J., Maassen,J.A., Amons,R. and Moller,W. TITLE Nucleotide sequence of human elongation factor-1 beta cDNA JOURNAL Nucleic Acids Res. 19 (16), 4551 (1991) MEDLINE 91360360 FEATURES Location/Qualifiers source 1..964 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="fibroblast" mRNA 1..964 /evidence=experimental CDS 236..913 /codon_start=1 /product="elongation factor-1-beta" /db_xref="PID:g31100" /db_xref="SWISS-PROT:P24534" /translation="MGFGDLKSPAGLQVLNDYLADKSYIEGYVPSQADVAVFEAVSSP PPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYGPADVEDTTGSGATDSKDDDDID LFGSDDEEESEEAKRLREERLAQYESKKAKKPALVAKSSILLDVKPWDDETDMAKLEE CVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVEDDKVGTDMLEEQITAFEDYVQSMD VAAFNKI" BASE COUNT 274 a 208 c 251 g 231 t ORIGIN 1 ggacaatttg tgggccattt aattcagggc ccccaattcg tacgtggaga agtgggaatg 61 caaaagtact ttgaccttta accttcggtc cggcgcggtg gagggaaacg cctccgtctc 121 tatataagga attttccggt ctcttcgggt cctttttcct ctcttcagcg tggggcgccc 181 acaatttgcg cgctctcttt ctgctgctcc ccagctctcg gatacagccg acaccatggg 241 tttcggagac ctgaaaagcc ctgccggcct ccaggtgctc aacgattacc tggcggacaa 301 gagctacatc gaggggtatg tgccatcaca agcagatgtg gcagtatttg aagccgtgtc 361 cagcccaccg cctgccgact tgtgtcatgc cctacgttgg tataatcaca tcaagtctta 421 cgaaaaggaa aaggccagcc tgccaggagt gaagaaagct ttgggcaaat atggtcctgc 481 cgatgtggaa gacactacag gaagtggagc tacagatagt aaagatgatg atgacattga 541 cctctttgga tctgatgatg aggaggaaag tgaagaagca aagaggctaa gggaagaacg 601 tcttgcacaa tatgaatcaa agaaagccaa aaaacctgca cttgttgcca agtcttccat 661 cttactagat gtgaaacctt gggatgatga gacagatatg gcgaaattag aggagtgcgt 721 cagaagcatt caagcagacg gcttagtctg gggctcatct aaactagttc cagtgggata 781 cggaattaag aaacttcaaa tacagtgtgt agttgaagat gataaagttg gaacagatat 841 gctggaggag cagatcactg cttttgagga ctatgtgcag tccatggatg tggctgcttt 901 caacaagatc taaaatccat cctggatcat ggcatttaaa taaaagattg aaagattaaa 961 accc // LOCUS HSEF1DELA 991 bp RNA PRI 07-JAN-1994 DEFINITION H.sapiens EF-1delta gene encoding human elongation factor-1-delta. ACCESSION Z21507 NID g38521 KEYWORDS human elongation factor-1-delta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 988) AUTHORS Sanders,J., Raggiaschi,R., Morales,J. and Moller,W. TITLE The human leucine zipper-containing guanine-nucleotide exchange protein elongation factor-1 delta JOURNAL Biochim. Biophys. Acta 1174 (1), 87-90 (1993) MEDLINE 93326642 REFERENCE 2 (bases 1 to 991) AUTHORS Sanders,J.J. TITLE Direct Submission JOURNAL Submitted (27-JAN-1993) Sanders J., University of Leiden, Medical Biochemistry, Wassenaarseweg 72, Leiden, The Netherlands, 2333 AL REMARK revised by [3] FEATURES Location/Qualifiers source 1..991 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="fibroblast" /clone_lib="lambda gt10" /clone="Deltah2" CDS 70..915 /standard_name="EF-1delta" /function="guanine-nucleotide exchange factor" /citation=[1] /codon_start=1 /product="human elongation factor-1-delta" /db_xref="PID:g38522" /db_xref="SWISS-PROT:P29692" /translation="MATNFLAHEKIWFDKFKYDDAERRFYEQMNGPVRGASRQENGAT VILRDIARARENIQKSLAGSSGPGASSGTSGDHGELVVRIASLEVENQSLRGVVQELQ QAISKLEARLNVLEKSSPGHRATAPQTQHVSPMRQVEPPAKKPATPAEDDEDDDIDLF GSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLEACV RSIQLDGLVWGASKLVPVGYGIRKLQIQCVVEDDKVGTDLLEEEITKFEEHVQSVDIA AFNKI" BASE COUNT 234 a 265 c 328 g 164 t ORIGIN 1 gggatcagtc ttcccgcgtc cgccgattcc tcctccttgg tcgccgcgtc cttggctggc 61 gtcagaaaaa tggctacaaa cttcctagca catgagaaga tctggttcga caagttcaaa 121 tatgacgacg cagaaaggag attctacgag cagatgaacg ggcctgtgcg aggtgcctcc 181 cgccaggaga acggcgccac ggtgatcctc cgtgacattg cgagagccag agagaacatc 241 cagaaatccc tggctggaag ctcaggcccc ggggcctcca gcggcaccag cggagaccac 301 ggtgagctcg tcgtccggat tgccagtctg gaagtggaga accagagtct gcgtggcgtg 361 gtacaggagc tgcagcaggc catctccaag ctggaggccc ggctgaacgt gctggagaag 421 agctcgcctg gccaccgggc cacggcccca cagacccagc acgtatctcc catgcgccaa 481 gtggagcccc cagccaagaa gccagccaca ccagcagagg atgacgagga tgatgacatt 541 gacctgtttg gcagtgacaa tgaggaggag gacaaggagg cggcacagct gcgggaggag 601 cggctacggc agtacgcgga gaagaaggcc aagaagcctg cactggtggc caagtcctcc 661 atcctgctgg atgtcaagcc ttgggatgat gagacggaca tggcccagct ggaggcctgt 721 gtgcgctcta tccagctgga cgggctggtc tggggggctt ccaagctggt gcccgtgggc 781 tacggtatcc ggaagctaca gattcagtgt gtggtggagg acgacaaggt ggggacagac 841 ttgctggagg aggagatcac caagtttgag gagcacgtgc agagtgtcga tatcgcagct 901 ttcaacaaga tctgaagcct gagtgtgtgt acgtgcgcgc gtgcgtgagg gccctgccac 961 gattaaagac tgagaccggc aaaaaaaaaa a // LOCUS HSEF1G 1410 bp RNA PRI 22-SEP-1992 DEFINITION H.sapiens mRNA for protein homologous to elongation factor 1-gamma from A.salina. ACCESSION X63526 NID g31101 KEYWORDS elongation factor 1-gamma homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1410) AUTHORS Kumabe,T. TITLE Direct Submission JOURNAL Submitted (06-DEC-1991) T. Kumabe, Tohoku Univ Gene Research Center, 1-1 Tsutsumidori-Amamiyamachi, Aobaku, Sendai, 981, JAPAN REFERENCE 2 (bases 1 to 1410) AUTHORS Kumabe,T., Sohma,Y. and Yamamoto,T. TITLE Human cDNAs encoding elongation factor 1 gamma and the ribosomal protein L19 JOURNAL Nucleic Acids Res. 20 (10), 2598 (1992) MEDLINE 92285147 FEATURES Location/Qualifiers source 1..1410 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /cell_line="THP1" CDS 20..1333 /codon_start=1 /product="homologue to elongation factor 1-gamma from A.salina" /db_xref="PID:g31102" /db_xref="SWISS-PROT:P26641" /translation="MAAGTLYTYPENWRAFKALIAAQYSGAQVRVLSAPPHFHFGQTN RTPEFLRKFPAGKVPAFEGDDGFCVFESNAIAYYVSNEELRGSTPEAAAQVVQWVSFA DSDIVPPASTWVFPTLGIMHHNKQATENAKEEVRRILGLLDAYLKTRTFLVGERVTLA DITVVCTLLWLYKQVLEPSFRQAFPNTNRWFLTCINQPQFRAVLGEVKLCEKMAQFDA KKFAETQPKKDTPRKEKGSREEKQKPQAERKEEKKAAAPAPEEEMDECEQALAAEPKA KDPFAHLPKSTFVLDEFKRKYSNEDTLSVALPYFWEHFDKDGWSLWYSEYRFPEELTQ TFMSCNLITGMFQRLDKLRKNAFASVILFGTNNSSSISGVWVFRGQELAFPLSPDWQV DYESYTWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK" BASE COUNT 332 a 373 c 392 g 313 t ORIGIN 1 ctttctttgc ggaatcacca tggcggctgg gaccctgtac acgtatcctg aaaactggag 61 ggccttcaag gctctcatcg ctgctcagta cagcggggct caggtccgcg tgctctccgc 121 accaccccac ttccattttg gccaaaccaa ccgcacccct gaatttctcc gcaaatttcc 181 tgccggcaag gtcccagcat ttgagggtga tgatggattc tgtgtgtttg agagcaacgc 241 cattgcctac tatgtgagca atgaggagct gcggggaagt actccagagg cagcagccca 301 ggtggtgcag tgggtgagct ttgctgattc cgatatagtg cccccagcca gtacctgggt 361 gttccccacc ttgggcatca tgcaccacaa caaacaggcc actgagaatg caaaggagga 421 agtgaggcga attctggggc tgctggatgc ttacttgaag acgaggactt ttctggtggg 481 cgaacgagtg acattggctg acatcacagt tgtctgcacc ctgttgtggc tctataagca 541 ggttctagag ccttctttcc gccaggcctt tcccaatacc aaccgctggt tcctcacctg 601 cattaaccag ccccagttcc gggctgtctt gggcgaagtg aaactgtgtg agaagatggc 661 ccagtttgat gctaaaaagt ttgcagagac ccaacctaaa aaggacacac cacggaaaga 721 gaagggttca cgggaagaga agcagaagcc ccaggctgag cggaaggagg agaaaaaggc 781 ggctgcccct gctcctgagg aggagatgga tgaatgtgag caggcgctgg ctgctgagcc 841 caaggccaag gaccccttcg ctcacctgcc caagagtacc tttgtgttgg atgaatttaa 901 gcgcaagtac tccaatgagg acacactctc tgtggcactg ccatatttct gggagcactt 961 tgataaggac ggctggtccc tgtggtactc agagtatcgc ttccctgaag aactcactca 1021 gaccttcatg agctgcaatc tcatcactgg aatgttccag cgactggaca agctgaggaa 1081 gaatgccttc gccagtgtca tcctttttgg aaccaacaat agcagctcca tttctggagt 1141 ctgggtcttc cgaggccagg agcttgcctt tccgctgagt ccagattggc aggtggacta 1201 cgagtcatac acatggcgga aactggatcc tggcagcgag gagacccaga cgctggttcg 1261 agagtacttt tcctgggagg gggccttcca gcatgtgggc aaagccttca atcagggcaa 1321 gatcttcaag tgaacatctc ttgccatcac ctagctgcct gcacctgccc ttcagggaga 1381 tgggggtcat taaaggaaac ctgaacattg // LOCUS HSEF2 3075 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for elongation factor 2. ACCESSION X51466 M30456 NID g31105 KEYWORDS elongation factor; elongation factor 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3075) AUTHORS Scheit,K.H. TITLE Direct Submission JOURNAL Submitted (18-JAN-1990) Scheit K.H., MPI fuer Biophysikalische Chemie, 3400 Goettingen REFERENCE 2 (bases 1 to 3075) AUTHORS Rapp,G., Klaudiny,J., Hagendorff,G., Luck,M.R. and Scheit,K.H. TITLE Complete sequence of the coding region of human elongation factor 2 (EF-2) by enzymatic amplification of cDNA from human ovarian granulosa cells JOURNAL Biol. Chem. Hoppe-Seyler 370 (10), 1071-1075 (1989) MEDLINE 90121741 COMMENT See also for partial elongation factor 2 mRNA. Data kindly reviewed (16-JUL-1990) by Sheit K.H. entry is replaced by Z11692. FEATURES Location/Qualifiers source 1..3075 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="ovary" /cell_type="granulosa" CDS 1..2577 /codon_start=1 /product="elongation factor 2" /db_xref="PID:g31106" /db_xref="SWISS-PROT:P13639" /translation="MVNFTVDQIRAIMDKKANIRNMSVIAHVDHGKSTLTDSLVCKAG IIASARAGETRFTDTRKDEQERCITIKSTAISLFYELSENDLNFIKQSKDGAGFLINL IDSPGHVDFSSEVTAALRVTDGALVVVDCVSGVCVQTETVLRQAIAERIKPVLMMNKM DRALLELQLEPEELYQTFQRIVENVNVIISTYGEGESGPMGNIMIDPVLGTVGFGSGL HGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLWGDRYFDPANGKFSKS ATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDIKLDSEDKDKEGKP LLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEAAMGIKSCDPKG PLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKEDLYLKPIQRT ILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITTFEHAHNMRVMKFSVSPVVRVA VEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHA CIPIKKSDPVVSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDIDKGEVSA RQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQYLNEIKDSVVAG FQWATKEGALCEENMRGVRFDVHDVTLHADAIHRGGGQIIPTARRCLYASVLTAQPRL MEPIYLVEIQCPEQVVGGIYGVLNRKRGHVFEESQVAGTPMFVVKAYLPVNESFGFTA DLRSNTGGQAFPQCVFDHWQILPGDPFDNSSRPSQVVAETRKRKGLKEGIPALDNFLD KL" BASE COUNT 678 a 906 c 924 g 567 t ORIGIN 1 atggttaatt ttacggtgga tcagatccgc gccatcatgg acaagaaggc caacatccgc 61 aacatgtctg tcatcgccca cgtggaccat ggcaagtcca cgctgacaga ctccctggtg 121 tgcaaggcgg gcatcatcgc ctcggcccgg gccggggaga cacgcttcac tgatacccgg 181 aaggacgagc aggagcgttg catcaccatc aagtcaactg ccatctccct cttctacgag 241 ctctcggaga atgacttgaa cttcatcaag cagagcaagg acggtgccgg cttcctcatc 301 aacctcattg actcccccgg gcatgtcgac ttctcctcgg aggtgactgc tgccctccga 361 gtcaccgatg gcgcattggt ggtggtggac tgcgtgtcag gcgtgtgcgt gcagacggag 421 acagtgctgc ggcaggccat tgccgagcgc atcaagcctg tgctgatgat gaacaagatg 481 gaccgcgccc tgctggagct gcagctggag cccgaggagc tctaccagac tttccagcgc 541 atcgtggaga acgtgaacgt catcatctcc acctacggcg agggcgagag cggccccatg 601 ggcaacatca tgatcgatcc tgtcctcggt accgtgggct ttgggtctgg cctccacggg 661 tgggccttca ccctgaagca gtttgccgag atgtatgtgg ccaagttcgc cgccaagggg 721 gagggccagt tggggcctgc cgagcgggcc aagaaagtag aggacatgat gaagaagctg 781 tggggtgaca ggtactttga cccagccaac ggcaagttca gcaagtcagc caccagcccc 841 gaagggaaga agctgccacg caccttctgc cagctgatcc tggaccccat cttcaaggtg 901 tttgatgcga tcatgaattt caagaaagag gagacagcaa aactgataga gaaactggac 961 atcaaactgg acagcgagga caaggacaaa gaaggcaaac ccctgctgaa ggctgtgatg 1021 cgccgctggc tgcctgccgg agacgccttg ttgcagatga tcaccatcca cctgccctcc 1081 cctgtgacgg cccagaagta ccgctgcgag ctcctgtacg aggggccccc ggacgacgag 1141 gctgccatgg gcattaaaag ctgtgacccc aaaggccctc ttatgatgta tatttccaaa 1201 atggtgccaa cctccgacaa aggtcggttc tacgcctttg gacgagtctt ctcggggctg 1261 gtctccactg gcctgaaggt caggatcatg gggcccaact atacccctgg gaagaaggag 1321 gacctctacc tgaagccaat ccagagaaca atcttgatga tgggccgcta cgtggagccc 1381 atcgaggatg tgccttgtgg gaacattgtg ggcctcgtgg gcgtggacca gttcctggtg 1441 aagacgggca ccatcaccac cttcgagcac gcgcacaaca tgcgggtgat gaagttcagc 1501 gtcagccctg ttgtcagagt ggccgtggag gccaagaacc cggctgacct gcccaagctg 1561 gtggaggggc tgaagcggct ggccaagtcc gaccccatgg tgcagtgcat catcgaggag 1621 tcgggagagc atatcatcgc gggcgccggc gagctgcacc tggagatctg cctgaaggac 1681 ctggaggagg accacgcctg catccccatc aagaaatctg acccggtcgt ctcgtaccgc 1741 gagacggtca gtgaagagtc gaacgtgctc tgcctctcca agtcccccaa caagcacaac 1801 cggctgtaca tgaaggcgcg gcccttcccc gacggcctgg ccgaggacat cgataaaggc 1861 gaggtgtccg cccgtcagga gctcaagcag cgggcgcgct acctggccga gaagtacgag 1921 tgggacgtgg ctgaggcccg caagatctgg tgctttgggc ccgacggcac cggccccaac 1981 atcctcaccg acatcaccaa gggtgtgcag tacctcaacg agatcaagga cagtgtggtg 2041 gccggcttcc agtgggccac caaggagggc gcactgtgtg aggagaacat gcggggtgtg 2101 cgcttcgacg tccacgacgt caccctgcac gccgacgcca tccaccgcgg agggggccag 2161 atcatcccca cagcacggcg ctgcctctat gccagtgtgc tgaccgccca gccacgcctc 2221 atggagccca tctaccttgt ggagatccag tgtccagagc aggtggtcgg tggcatctac 2281 ggggttttga acaggaagcg gggccacgtg ttcgaggagt cccaggtggc cggcaccccc 2341 atgtttgtgg tcaaggccta tctgcccgtc aacgagtcct ttggcttcac cgctgacctg 2401 aggtccaaca cgggcggcca ggcgttcccc cagtgtgtgt ttgaccactg gcagatcctg 2461 cccggagacc ccttcgacaa cagcagccgc cccagccagg tggtggcgga gacccgcaag 2521 cgcaagggcc tgaaagaagg catccctgcc ctggacaact tcctggacaa attgtaggcg 2581 gcccttcctg cagcgcctgc cgccccgggg actcgcagca cccacagcac cacgtcctcg 2641 aattctcaga cgacacctgg agactgtccc gacacagcga cgctcccctg agaggtttct 2701 ggggcccgct gcgtgccatc actcaaccat aacacttgat gccgtttctt tcaatattta 2761 tttccagagt ccggaggcag cagacacgcc ctcttagtag ggacttaatg ggccggtcgg 2821 ggagggggag gcgggatggg acacccaaca ctttttccat ttcttcagag ggaaactcag 2881 atgtccaaac taattttaac aaacgcatta agaggtttat ttgggtacat ggcccgcagt 2941 ggcttttgcc ccagaaaggg gaaaggaaca cgcgggtaga tgatttctag caggcaggaa 3001 gtcctgtgcg gtgtcaccat gagcactcag ctgtactagt gccattggaa taataaattt 3061 gataaggtgt gaaaa // LOCUS HSEFAC1A2 1755 bp RNA PRI 18-AUG-1993 DEFINITION H.sapiens mRNA for elongation factor 1 alpha-2. ACCESSION X70940 NID g38455 KEYWORDS elongation factor; elongation factor 1-alpha-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Knudsen,S.M., Frydenberg,J., Clark,B.F. and Leffers,H. TITLE Tissue-dependent variation in the expression of elongation factor-1 alpha isoforms: isolation and characterisation of a cDNA encoding a novel variant of human elongation-factor 1 alpha JOURNAL Eur. J. Biochem. 215 (3), 549-554 (1993) MEDLINE 93358875 REFERENCE 2 (bases 1 to 1755) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (16-FEB-1993) H. Leffers, Inst of Medical Biochemistry & Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda ZAPII; AMA" CDS 84..1475 /codon_start=1 /product="elongation factor 1 alpha-2" /db_xref="PID:g38456" /db_xref="SWISS-PROT:Q05639" /translation="MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEK EAAEMGKGSFKYAWVLDKLKAERERGITIDISLWKFETTKYYITIIDAPGHRDFIKNM ITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEP AYSEKRYDEIVKEVSAYIKKIGYNPATVPFVPISGWHGDNMLEPSPNMPWFKGWKVER KEGNASGVSLLEALDTILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGILRPGM VVTFAPVNITTEVKSVEMHHEALSEALPGDNVGFNVKNVSVKDIRRGNVCGDSKSDPP QEAAQFTSQVIILNHPGQISAGYSPVIDCHTAHIACKFAELKEKIDRRSGKKLEDNPK SLKSGDAAIVEMVPGKPMCVESFSQYPPLGRFAVRDMRQTVAVGVIKNVEKKSGGAGK VTKSAQKAQKAGK" polyA_signal 1736..1741 BASE COUNT 368 a 585 c 548 g 254 t ORIGIN 1 cctcggctcc ggaatcactg cagcccccct cgccctgagc cagagcaccc cgggtcccgc 61 cagcccctca cactcccagc aaaatgggca aggagaagac ccacatcaac atcgtggtca 121 tcggccacgt ggactccgga aagtccacca ccacgggcca cctcatctac aaatgcggag 181 gtattgacaa aaggaccatt gagaagttcg agaaggaggc ggctgagatg gggaagggat 241 ccttcaagta tgcctgggtg ctggacaagc tgaaggcgga gcgtgagcgc ggcatcacca 301 tcgacatctc cctctggaag ttcgagacca ccaagtacta catcaccatc atcgatgccc 361 ccggccaccg cgacttcatc aagaacatga tcacgggtac atcccaggcg gactgcgcag 421 tgctgatcgt ggcggcgggc gtgggcgagt tcgaggcggg catctccaag aatgggcaga 481 cgcgggagca tgccctgctg gcctacacgc tgggtgtgaa gcagctcatc gtgggcgtga 541 acaaaatgga ctccacagag ccggcctaca gcgagaagcg ctacgacgag atcgtcaagg 601 aagtcagcgc ctacatcaag aagatcggct acaacccggc caccgtgccc tttgtgccca 661 tctccggctg gcacggcgac aacatgctgg agccctcccc caacatgccg tggttcaagg 721 gctggaaggt ggagcgtaag gagggcaacg caagcggcgt gtccctgctg gaggccctgg 781 acaccatcct gccccccacg cgccccacgg acaagcccct gcgcctgccg ctgcaggacg 841 tgtacaagat tggcggcatt ggcacggtgc ccgtgggccg ggtggagacc ggcatcctgc 901 ggccgggcat ggtggtgacc tttgcgccag tgaacatcac cactgaggtg aagtcagtgg 961 agatgcacca cgaggctctg agcgaagctc tgcccggcga caacgtcggc ttcaatgtga 1021 agaacgtgtc ggtgaaggac atccggcggg gcaacgtgtg tggggacagc aagtctgacc 1081 cgccgcagga ggctgctcag ttcacctccc aggtcatcat cctgaaccac ccggggcaga 1141 ttagcgccgg ctactccccg gtcatcgact gccacacagc ccacatcgcc tgcaagtttg 1201 cggagctgaa ggagaagatt gaccggcgct ctggcaagaa gctggaggac aaccccaagt 1261 ccctgaagtc tggagacgcg gccatcgtgg agatggtgcc gggaaagccc atgtgtgtgg 1321 agagcttctc ccagtacccg cctctcggcc gcttcgccgt gcgcgacatg aggcagacgg 1381 tggccgtagg cgtcatcaag aacgtggaaa agaagagcgg cggcgccggc aaggtcacca 1441 agtcggcgca gaaggcgcag aaggcgggca agtgaagcgc gggccgcggc gcgaccctcc 1501 ccggcggcgc cgcgctccga accccggccc ggcccccgcc ccgcccccgc cccgcgcgcc 1561 gctccggcgc cccgcacccc cgccaggcgc atgtctgcac ctccgcttgc cagaggccct 1621 cggtcagcga ctggatgctc gccatcaagg tccagtggaa gttcttcaag aggaaaggcg 1681 cccccgcccc aggcttccgc gcccagcgct cgccacgctc agtgcccgtt ttaccaataa 1741 actgagcgac cccag // LOCUS HSEGFPRE 5532 bp RNA PRI 30-MAR-1995 DEFINITION Human mRNA for precursor of epidermal growth factor receptor. ACCESSION X00588 NID g31113 KEYWORDS epidermal growth factor receptor; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5532) AUTHORS Ullrich,A., Coussens,L., Hayflick,J.S., Dull,T.J., Gray,A., Tam,A.W., Lee,J., Yarden,Y., Libermann,T.A., Schlessinger,J., Downward,J., Mayes,E.L., Whittle,N., Waterfield,M.D. and Seeburg,P.H. TITLE Human epidermal growth factor receptor cDNA sequence and aberrant expression of the amplified gene in A431 epidermoid carcinoma cells JOURNAL Nature 309 (5967), 418-425 (1984) MEDLINE 84219729 FEATURES Location/Qualifiers source 1..5532 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 187..258 /note="put. signal peptide" CDS 187..3819 /codon_start=1 /product="epidermal growth factor receptor" /db_xref="PID:g757924" /db_xref="SWISS-PROT:P00533" /translation="MRPSGTAGAALLALLAALCPASRALEEKKVCQGTSNKLTQLGTF EDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLE NLQIIRGNMYYENSYALAVLSNYDANKTGLKELPMRNLQEILHGAVRFSNNPALCNVE SIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICAQQ CSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQ MDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRKCKKCEGPCR KVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQEL DILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGL RSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQVCHA LCSPEGCWGPEPRDCVSCRNVSRGRECVDKCKLLEGEPREFVENSECIQCHPECLPQA MNITCTGRGPDNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNC TYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRL LQERELVEPLTPSGEAPNQALLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKI PVAIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCL LDYVREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDF GLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKP YDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMA RDPQRYLVIQGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSSPSTS RTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSSDPTGALTEDSIDDTFLP VPEYINQSVPKRPAGSVQNPVYHNQPLNPAPSRDPHYQDPHSTAVGNPEYLNTVQPTC VNSTFDSPAHWAQKGSHQISLDNPDYQQDFFPKEAKPNGIFKGSTAENAEYLRVAPQS SEFIGA" misc_feature 259..2127 /note="EGF extracellular domain" misc_feature 568..576 /note="Asn-linked glycosylation site" misc_feature 709..717 /note="Asn-linked glycosylation site" misc_feature 772..780 /note="Asn-linked glycosylation site" misc_feature 997..1006 /note="Asn-linked glycosylation site" misc_feature 1267..1275 /note="Asn-linked glycosylation site" misc_feature 1290..1298 /note="Asn-linked glycosylation site" misc_feature 1423..1431 /note="Asn-linked glycosylation site" misc_feature 1506..1514 /note="Asn-linked glycosylation site" misc_feature 1718..1726 /note="Asn-linked glycosylation site" misc_feature 1838..1846 /note="Asn-linked glycosylation site" misc_feature 1924..3732 /note="v-erb B homology" misc_feature 1993..2001 /note="Asn-linked glycosylation site" misc_feature 2053..2061 /note="Asn-linked glycosylation site" misc_feature 2128..2190 /note="EGF transmembrane region" misc_feature 2191..3816 /note="EGF cytoplasmatic domain" misc_feature 3313..3321 /note="Asn-linked glycosylation site" misc_feature 3316..3324 /note="Asn-linked glycosylation site" misc_feature 3466..3474 /note="Asn-linked glycosylation site" misc_feature 3628..3636 /note="Asn-linked glycosylation site" misc_feature 5512..5517 /note="polyadenylation signal" polyA_site 5532 /note="polyadenylation site" BASE COUNT 1472 a 1484 c 1337 g 1239 t ORIGIN 1 gccgcgctgc gccggagtcc cgagctagcc ccggcgccgc cgccgcccag accggacgac 61 aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc 121 gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga 181 gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc 241 tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc 301 acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt 361 gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc 421 ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga 481 attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc 541 ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga 601 aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac 661 gtggagagca tccagtggcg ggacatagtc agcagtgact ttctcagcaa catgtcgatg 721 gacttccaga accacctggg cagctgccaa aagtgtgatc caagctgtcc caatgggagc 781 tgctggggtg caggagagga gaactgccag aaactgacca aaatcatctg tgcccagcag 841 tgctccgggc gctgccgtgg caagtccccc agtgactgct gccacaacca gtgtgctgca 901 ggctgcacag gcccccggga gagcgactgc ctggtctgcc gcaaattccg agacgaagcc 961 acgtgcaagg acacctgccc cccactcatg ctctacaacc ccaccacgta ccagatggat 1021 gtgaaccccg agggcaaata cagctttggt gccacctgcg tgaagaagtg tccccgtaat 1081 tatgtggtga cagatcacgg ctcgtgcgtc cgagcctgtg gggccgacag ctatgagatg 1141 gaggaagacg gcgtccgcaa gtgtaagaag tgcgaagggc cttgccgcaa agtgtgtaac 1201 ggaataggta ttggtgaatt taaagactca ctctccataa atgctacgaa tattaaacac 1261 ttcaaaaact gcacctccat cagtggcgat ctccacatcc tgccggtggc atttaggggt 1321 gactccttca cacatactcc tcctctggat ccacaggaac tggatattct gaaaaccgta 1381 aaggaaatca cagggttttt gctgattcag gcttggcctg aaaacaggac ggacctccat 1441 gcctttgaga acctagaaat catacgcggc aggaccaagc aacatggtca gttttctctt 1501 gcagtcgtca gcctgaacat aacatccttg ggattacgct ccctcaagga gataagtgat 1561 ggagatgtga taatttcagg aaacaaaaat ttgtgctatg caaatacaat aaactggaaa 1621 aaactgtttg ggacctccgg tcagaaaacc aaaattataa gcaacagagg tgaaaacagc 1681 tgcaaggcca caggccaggt ctgccatgcc ttgtgctccc ccgagggctg ctggggcccg 1741 gagcccaggg actgcgtctc ttgccggaat gtcagccgag gcagggaatg cgtggacaag 1801 tgcaagcttc tggagggtga gccaagggag tttgtggaga actctgagtg catacagtgc 1861 cacccagagt gcctgcctca ggccatgaac atcacctgca caggacgggg accagacaac 1921 tgtatccagt gtgcccacta cattgacggc ccccactgcg tcaagacctg cccggcagga 1981 gtcatgggag aaaacaacac cctggtctgg aagtacgcag acgccggcca tgtgtgccac 2041 ctgtgccatc caaactgcac ctacggatgc actgggccag gtcttgaagg ctgtccaacg 2101 aatgggccta agatcccgtc catcgccact gggatggtgg gggccctcct cttgctgctg 2161 gtggtggccc tggggatcgg cctcttcatg cgaaggcgcc acatcgttcg gaagcgcacg 2221 ctgcggaggc tgctgcagga gagggagctt gtggagcctc ttacacccag tggagaagct 2281 cccaaccaag ctctcttgag gatcttgaag gaaactgaat tcaaaaagat caaagtgctg 2341 ggctccggtg cgttcggcac ggtgtataag ggactctgga tcccagaagg tgagaaagtt 2401 aaaattcccg tcgctatcaa ggaattaaga gaagcaacat ctccgaaagc caacaaggaa 2461 atcctcgatg aagcctacgt gatggccagc gtggacaacc cccacgtgtg ccgcctgctg 2521 ggcatctgcc tcacctccac cgtgcaactc atcacgcagc tcatgccctt cggctgcctc 2581 ctggactatg tccgggaaca caaagacaat attggctccc agtacctgct caactggtgt 2641 gtgcagatcg caaagggcat gaactacttg gaggaccgtc gcttggtgca ccgcgacctg 2701 gcagccagga acgtactggt gaaaacaccg cagcatgtca agatcacaga ttttgggctg 2761 gccaaactgc tgggtgcgga agagaaagaa taccatgcag aaggaggcaa agtgcctatc 2821 aagtggatgg cattggaatc aattttacac agaatctata cccaccagag tgatgtctgg 2881 agctacgggg tgaccgtttg ggagttgatg acctttggat ccaagccata tgacggaatc 2941 cctgccagcg agatctcctc catcctggag aaaggagaac gcctccctca gccacccata 3001 tgtaccatcg atgtctacat gatcatggtc aagtgctgga tgatagacgc agatagtcgc 3061 ccaaagttcc gtgagttgat catcgaattc tccaaaatgg cccgagaccc ccagcgctac 3121 cttgtcattc agggggatga aagaatgcat ttgccaagtc ctacagactc caacttctac 3181 cgtgccctga tggatgaaga agacatggac gacgtggtgg atgccgacga gtacctcatc 3241 ccacagcagg gcttcttcag cagcccctcc acgtcacgga ctcccctcct gagctctctg 3301 agtgcaacca gcaacaattc caccgtggct tgcattgata gaaatgggct gcaaagctgt 3361 cccatcaagg aagacagctt cttgcagcga tacagctcag accccacagg cgccttgact 3421 gaggacagca tagacgacac cttcctccca gtgcctgaat acataaacca gtccgttccc 3481 aaaaggcccg ctggctctgt gcagaatcct gtctatcaca atcagcctct gaaccccgcg 3541 cccagcagag acccacacta ccaggacccc cacagcactg cagtgggcaa ccccgagtat 3601 ctcaacactg tccagcccac ctgtgtcaac agcacattcg acagccctgc ccactgggcc 3661 cagaaaggca gccaccaaat tagcctggac aaccctgact accagcagga cttctttccc 3721 aaggaagcca agccaaatgg catctttaag ggctccacag ctgaaaatgc agaataccta 3781 agggtcgcgc cacaaagcag tgaatttatt ggagcatgac cacggaggat agtatgagcc 3841 ctaaaaatcc agactctttc gatacccagg accaagccac agcaggtcct ccatcccaac 3901 agccatgccc gcattagctc ttagacccac agactggttt tgcaacgttt acaccgacta 3961 gccaggaagt acttccacct cgggcacatt ttgggaagtt gcattccttt gtcttcaaac 4021 tgtgaagcat ttacagaaac gcatccagca agaatattgt ccctttgagc agaaatttat 4081 ctttcaaaga ggtatatttg aaaaaaaaaa aaaaagtata tgtgaggatt tttattgatt 4141 ggggatcttg gagtttttca ttgtcgctat tgatttttac ttcaatgggc tcttccaaca 4201 aggaagaagc ttgctggtag cacttgctac cctgagttca tccaggccca actgtgagca 4261 aggagcacaa gccacaagtc ttccagagga tgcttgattc cagtggttct gcttcaaggc 4321 ttccactgca aaacactaaa gatccaagaa ggccttcatg gccccagcag gccggatcgg 4381 tactgtatca agtcatggca ggtacagtag gataagccac tctgtccctt cctgggcaaa 4441 gaagaaacgg aggggatgaa ttcttcctta gacttacttt tgtaaaaatg tccccacggt 4501 acttactccc cactgatgga ccagtggttt ccagtcatga gcgttagact gacttgtttg 4561 tcttccattc cattgttttg aaactcagta tgccgcccct gtcttgctgt catgaaatca 4621 gcaagagagg atgacacatc aaataataac tcggattcca gcccacattg gattcatcag 4681 catttggacc aatagcccac agctgagaat gtggaatacc taaggataac accgcttttg 4741 ttctcgcaaa aacgtatctc ctaatttgag gctcagatga aatgcatcag gtcctttggg 4801 gcatagatca gaagactaca aaaatgaagc tgctctgaaa tctcctttag ccatcacccc 4861 aaccccccaa aattagtttg tgttacttat ggaagatagt tttctccttt tacttcactt 4921 caaaagcttt ttactcaaag agtatatgtt ccctccaggt cagctgcccc caaaccccct 4981 ccttacgctt tgtcacacaa aaagtgtctc tgccttgagt catctattca agcacttaca 5041 gctctggcca caacagggca ttttacaggt gcgaatgaca gtagcattat gagtagtgtg 5101 aattcaggta gtaaatatga aactagggtt tgaaattgat aatgctttca caacatttgc 5161 agatgtttta gaaggaaaaa agttccttcc taaaataatt tctctacaat tggaagattg 5221 gaagattcag ctagttagga gcccattttt tcctaatctg tgtgtgccct gtaacctgac 5281 tggttaacag cagtcctttg taaacagtgt tttaaactct cctagtcaat atccacccca 5341 tccaatttat caaggaagaa atggttcaga aaatattttc agcctacagt tatgttcagt 5401 cacacacaca tacaaaatgt tccttttgct tttaaagtaa tttttgactc ccagatcagt 5461 cagagcccct acagcattgt taagaaagta tttgattttt gtctcaatga aaataaaact 5521 atattcattt cc // LOCUS HSEGFRER 4871 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for kidney epidermal growth factor (EGF) precursor. ACCESSION X04571 NID g31120 KEYWORDS epidermal growth factor; glycoprotein; growth factor; membrane protein; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4871) AUTHORS Bell,G.I., Fong,N.M., Stempien,M.M., Wormsted,M.A., Caput,D., Ku,L.L., Urdea,M.S., Rall,L.B. and Sanchez-Pescador,R. TITLE Human epidermal growth factor precursor: cDNA sequence, expression in vitro and gene organization JOURNAL Nucleic Acids Res. 14 (21), 8427-8446 (1986) MEDLINE 87066721 REFERENCE 2 (bases 4242 to 4243) AUTHORS Bell,G.I. TITLE Direct Submission JOURNAL Submitted (21-MAY-1987) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..4871 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" CDS 437..4060 /note="precursor polypeptide (AA -22 to 1185)" /codon_start=1 /db_xref="PID:g31121" /db_xref="SWISS-PROT:P01133" /translation="MLLTLIILLPVVSKFSFVSLSAPQHWSCPEGTLAGNGNSTCVGP APFLIFSHGNSIFRIDTEGTNYEQLVVDAGVSVIMDFHYNEKRIYWVDLERQLLQRVF LNGSRQERVCNIEKNVSGMAINWINEEVIWSNQQEGIITVTDMKGNNSHILLSALKYP ANVAVDPVERFIFWSSEVAGSLYRADLDGVGVKALLETSEKITAVSLDVLDKRLFWIQ YNREGSNSLICSCDYDGGSVHISKHPTQHNLFAMSLFGDRIFYSTWKMKTIWIANKHT GKDMVRINLHSSFVPLGELKVVHPLAQPKAEDDTWEPEQKLCKLRKGNCSSTVCGQDL QSHLCMCAEGYALSRDRKYCEDVNECAFWNHGCTLGCKNTPGSYYCTCPVGFVLLPDG KRCHQLVSCPRNVSECSHDCVLTSEGPLCFCPEGSVLERDGKTCSGCSSPDNGGCSQL CVPLSPVSWECDCFPGYDLQLDEKSCAASGPQPFLLFANSQDIRHMHFDGTDYGTLLS QQMGMVYALDHDPVENKIYFAHTALKWIERANMDGSQRERLIEEGVDVPEGLAVDWIG RRFYWTDRGKSLIGRSDLNGKRSKIITKENISQPRGIAVHPMAKRLFWTDTGINPRIE SSSLQGLGRLVIASSDLIWPSGITIDFLTDKLYWCDAKQSVIEMANLDGSKRRRLTQN DVGHPFAVAVFEDYVWFSDWAMPSVIRVNKRTGKDRVRLQGSMLKPSSLVVVHPLAKP GADPCLYQNGGCEHICKKRLGTAWCSCREGFMKASDGKTCLALDGHQLLAGGEVDLKN QVTPLDILSKTRVSEDNITESQHMLVAEIMVSDQDDCAPVGCSMYARCISEGEDATCQ CLKGFAGDGKLCSDIDECEMGVPVCPPASSKCINTEGGYVCRCSEGYQGDGIHCLDID ECQLGVHSCGENASCTNTEGGYTCMCAGRLSEPGLICPDSTPPPHLREDDHHYSVRNS DSECPLSHDGYCLHDGVCMYIEALDKYACNCVVGYIGERCQYRDLKWWELRHAGHGQQ QKVIVVAVCVVVLVMLLLLSLWGAHYYRTQKLLSKNPKNPYEESSRDVRSRRPADTED GMSSCPQPWFVVIKEHQDLKNGGQPVAGEDGQAADGSMQPTSWRQEPQLCGMGTEQGC WIPVSSDKGSCPQVMERSFHMPSYGTQTLEGGVEKPHSLLSANPLWQQRALDPPHQME LTQ" sig_peptide 437..502 /note="pot. signal peptide (AA -22 to -1)" misc_feature 437..3532 /note="put. extracellular domain (AA 1 to 1010)" misc_feature 548..556 /note="pot. N-glycosylation site" misc_feature 746..754 /note="pot. N-glycosylation site" misc_feature 785..793 /note="pot. N-glycosylation site" misc_feature 878..886 /note="pot. N-glycosylation site" misc_feature 1406..1414 /note="pot. N-glycosylation site" misc_feature 1646..1654 /note="pot. N-glycosylation site" misc_feature 2222..2230 /note="pot. N-glycosylation site" misc_feature 2879..2887 /note="pot. N-glycosylation site" misc_feature 3212..3220 /note="pot. N-glycosylation site" mat_peptide 3347..3505 /note="mature EGF (AA 949-1001)" misc_feature 3533..3607 /note="transmembrane domain (AA 1011-1035)" misc_feature 3608..4057 /note="cytoplasmic domain (AA 1036-1185)" old_sequence 4242..4243 /note="gc was cg in [1]" /citation=[1] misc_feature 4852..4857 /note="pot. polyA signal" BASE COUNT 1356 a 974 c 1187 g 1354 t ORIGIN 1 gggagaggaa tcgtatctcc atatttcttc tttcagcccc aatccaaggg ttgtagctgg 61 aactttccat cagttcttcc tttctttttc ctctctaagc ctttgccttg ctctgtcaca 121 gtgaagtcag ccagagcagg gctgttaaac tctgtgaaat ttgtcataag ggtgtcaggt 181 atttcttact ggcttccaaa gaaacataga taaagaaatc tttcctgtgg cttcccttgg 241 caggctgcat tcagaaggtc tctcagttga agaaagagct tggaggacaa cagcacaaca 301 ggagagtaaa agatgcccca gggctgaggc ctccgctcag gcagccgcat ctggggtcaa 361 tcatactcac cttgcccggg ccatgctcca gcaaaatcaa gctgttttct tttgaaagtt 421 caaactcatc aagattatgc tgctcactct tatcattctg ttgccagtag tttcaaaatt 481 tagttttgtt agtctctcag caccgcagca ctggagctgt cctgaaggta ctctcgcagg 541 aaatgggaat tctacttgtg tgggtcctgc acccttctta attttctccc atggaaatag 601 tatctttagg attgacacag aaggaaccaa ttatgagcaa ttggtggtgg atgctggtgt 661 ctcagtgatc atggattttc attataatga gaaaagaatc tattgggtgg atttagaaag 721 acaacttttg caaagagttt ttctgaatgg gtcaaggcaa gagagagtat gtaatataga 781 gaaaaatgtt tctggaatgg caataaattg gataaatgaa gaagttattt ggtcaaatca 841 acaggaagga atcattacag taacagatat gaaaggaaat aattcccaca ttcttttaag 901 tgctttaaaa tatcctgcaa atgtagcagt tgatccagta gaaaggttta tattttggtc 961 ttcagaggtg gctggaagcc tttatagagc agatctcgat ggtgtgggag tgaaggctct 1021 gttggagaca tcagagaaaa taacagctgt gtcattggat gtgcttgata agcggctgtt 1081 ttggattcag tacaacagag aaggaagcaa ttctcttatt tgctcctgtg attatgatgg 1141 aggttctgtc cacattagta aacatccaac acagcataat ttgtttgcaa tgtccctttt 1201 tggtgaccgt atcttctatt caacatggaa aatgaagaca atttggatag ccaacaaaca 1261 cactggaaag gacatggtta gaattaacct ccattcatca tttgtaccac ttggtgaact 1321 gaaagtagtg catccacttg cacaacccaa ggcagaagat gacacttggg agcctgagca 1381 gaaactttgc aaattgagga aaggaaactg cagcagcact gtgtgtgggc aagacctcca 1441 gtcacacttg tgcatgtgtg cagagggata cgccctaagt cgagaccgga agtactgtga 1501 agatgttaat gaatgtgctt tttggaatca tggctgtact cttgggtgta aaaacacccc 1561 tggatcctat tactgcacgt gccctgtagg atttgttctg cttcctgatg ggaaacgatg 1621 tcatcaactt gtttcctgtc cacgcaatgt gtctgaatgc agccatgact gtgttctgac 1681 atcagaaggt cccttatgtt tctgtcctga aggctcagtg cttgagagag atgggaaaac 1741 atgtagcggt tgttcctcac ccgataatgg tggatgtagc cagctctgcg ttcctcttag 1801 cccagtatcc tgggaatgtg attgctttcc tgggtatgac ctacaactgg atgaaaaaag 1861 ctgtgcagct tcaggaccac aaccattttt gctgtttgcc aattctcaag atattcgaca 1921 catgcatttt gatggaacag actatggaac tctgctcagc cagcagatgg gaatggttta 1981 tgccctagat catgaccctg tggaaaataa gatatacttt gcccatacag ccctgaagtg 2041 gatagagaga gctaatatgg atggttccca gcgagaaagg cttattgagg aaggagtaga 2101 tgtgccagaa ggtcttgctg tggactggat tggccgtaga ttctattgga cagacagagg 2161 gaaatctctg attggaagga gtgatttaaa tgggaaacgt tccaaaataa tcactaagga 2221 gaacatctct caaccacgag gaattgctgt tcatccaatg gccaagagat tattctggac 2281 tgatacaggg attaatccac gaattgaaag ttcttccctc caaggccttg gccgtctggt 2341 tatagccagc tctgatctaa tctggcccag tggaataacg attgacttct taactgacaa 2401 gttgtactgg tgcgatgcca agcagtctgt gattgaaatg gccaatctgg atggttcaaa 2461 acgccgaaga cttacccaga atgatgtagg tcacccattt gctgtagcag tgtttgagga 2521 ttatgtgtgg ttctcagatt gggctatgcc atcagtaata agagtaaaca agaggactgg 2581 caaagataga gtacgtctcc aaggcagcat gctgaagccc tcatcactgg ttgtggttca 2641 tccattggca aaaccaggag cagatccctg cttatatcaa aacggaggct gtgaacatat 2701 ttgcaaaaag aggcttggaa ctgcttggtg ttcgtgtcgt gaaggtttta tgaaagcctc 2761 agatgggaaa acgtgtctgg ctctggatgg tcatcagctg ttggcaggtg gtgaagttga 2821 tctaaagaac caagtaacac cattggacat cttgtccaag actagagtgt cagaagataa 2881 cattacagaa tctcaacaca tgctagtggc tgaaatcatg gtgtcagatc aagatgactg 2941 tgctcctgtg ggatgcagca tgtatgctcg gtgtatttca gagggagagg atgccacatg 3001 tcagtgtttg aaaggatttg ctggggatgg aaaactatgt tctgatatag atgaatgtga 3061 gatgggtgtc ccagtgtgcc cccctgcctc ctccaagtgc atcaacaccg aaggtggtta 3121 tgtctgccgg tgctcagaag gctaccaagg agatgggatt cactgtcttg atattgatga 3181 gtgccaactg ggggtgcaca gctgtggaga gaatgccagc tgcacaaata cagagggagg 3241 ctatacctgc atgtgtgctg gacgcctgtc tgaaccagga ctgatttgcc ctgactctac 3301 tccaccccct cacctcaggg aagatgacca ccactattcc gtaagaaata gtgactctga 3361 atgtcccctg tcccacgatg ggtactgcct ccatgatggt gtgtgcatgt atattgaagc 3421 attggacaag tatgcatgca actgtgttgt tggctacatc ggggagcgat gtcagtaccg 3481 agacctgaag tggtgggaac tgcgccacgc tggccacggg cagcagcaga aggtcatcgt 3541 ggtggctgtc tgcgtggtgg tgcttgtcat gctgctcctc ctgagcctgt ggggggccca 3601 ctactacagg actcagaagc tgctatcgaa aaacccaaag aatccttatg aggagtcgag 3661 cagagatgtg aggagtcgca ggcctgctga cactgaggat gggatgtcct cttgccctca 3721 accttggttt gtggttataa aagaacacca agacctcaag aatgggggtc aaccagtggc 3781 tggtgaggat ggccaggcag cagatgggtc aatgcaacca acttcatgga ggcaggagcc 3841 ccagttatgt ggaatgggca cagagcaagg ctgctggatt ccagtatcca gtgataaggg 3901 ctcctgtccc caggtaatgg agcgaagctt tcatatgccc tcctatggga cacagaccct 3961 tgaagggggt gtcgagaagc cccattctct cctatcagct aacccattat ggcaacaaag 4021 ggccctggac ccaccacacc aaatggagct gactcagtga aaactggaat taaaaggaaa 4081 gtcaagaaga atgaactatg tcgatgcaca gtatcttttc tttcaaaagt agagcaaaac 4141 tataggtttt ggttccacaa tctctacgac taatcaccta ctcaatgcct ggagacagat 4201 acgtagttgt gcttttgttt gctcttttaa gcagtctcac tgcagtctta tttccaagta 4261 agagtactgg gagaatcact aggtaactta ttagaaaccc aaattgggac aacagtgctt 4321 tgtaaattgt gttgtcttca gcagtcaata caaatagatt tttgtttttg ttgttcctgc 4381 agccccagaa gaaattaggg gttaaagcag acagtcacac tggtttggtc agttacaaag 4441 taatttcttt gatctggaca gaacatttat atcagtttca tgaaatgatt ggaatattac 4501 aataccgtta agatacagtg taggcattta actcctcatt ggcgtggtcc atgctgatga 4561 ttttgccaaa atgagttgtg atgaatcaat gaaaaatgta atttagaaac tgatttcttc 4621 agaattagat ggccttattt tttaaaatat ttgaatgaaa acattttatt tttaaaatat 4681 tacacaggag gccttcggag tttcttagtc attactgtcc ttttccccta cagaattttc 4741 cctcttggtg tgattgcaca gaatttgtat gtattttcag ttacaagatt gtaagtaaat 4801 tgcctgattt gttttcatta tagacaacga tgaatttctt ctaattattt aaataaaatc 4861 accaaaaaca t // LOCUS HSEGR1 3132 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for early growth response protein 1 (hEGR1). ACCESSION X52541 NID g31129 KEYWORDS early growth response gene; human early growth response gene 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3132) AUTHORS Sukhatme,V.P. TITLE Direct Submission JOURNAL Submitted (09-MAR-1990) Sukhatme V.P., University of Chicago, 5841 South Maryland Avenue, Box 391, Chicago Illinois 60637, U S A REFERENCE 2 (bases 1 to 3132) AUTHORS Suggs,S.V., Katzowitz,J.L., Tsai-Morris,C. and Sukhatme,V.P. TITLE cDNA sequence of the human cellular early growth response gene Egr-1 JOURNAL Nucleic Acids Res. 18 (14), 4283 (1990) MEDLINE 90332455 COMMENT See for human early growth response protein 4 mRNA. Data kindly reviewed (07-SEP-1990) by Sukhatme V.P. FEATURES Location/Qualifiers source 1..3132 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="foreskin" /cell_type="fibroblast 303" /clone_lib="lambda-ZAP" /clone="hEGR1.364" /chromosome="5q23-31" CDS 271..1902 /note="early growth response protein 1 (AA 1-543)" /codon_start=1 /db_xref="PID:g31130" /db_xref="SWISS-PROT:P18146" /translation="MAAAKAEMQLMSPLQISDPFGSFPHSPTMDNYPKLEEMMLLSNG APQFLGAAGAPEGSGSNSSSSSSGGGGGGGGGSNSSSSSSTFNPQADTGEQPYEHLTA ESFPDISLNNEKVLVETSYPSQTTRLPPITYTGRFSLEPAPNSGNTLWPEPLFSLVSG LVSMTNPPASSSSAPSPAASSASASQSPPLSCAVPSNDSSPIYSAAPTFPTPNTDIFP EPQSQAFPGSAGTALQYPPPAYPAAKGGFQVPMIPDYLFPQQQGDLGLGTPDQKPFQG LESRTQQPSLTPLSTIKAFATQSGSQDLKALNTSYQSQLIKPSRMRKYPNRPSKTPPH ERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTG EKPFACDICGRKFARSDERKRHTKIHLRQKDKKADKSVVASSATSSLSSYPSPVATSY PSPVTTSYPSPATTSYPSPVPTSFSSPGSSTYPSPVHSGFPSPSVATTYSSVPPAFPA QVSSFPSSAVTNSFSASTGLSDMTATFSPRTIEIC" polyA_site 3132 /note="polyadenylation site" BASE COUNT 687 a 1004 c 730 g 711 t ORIGIN 1 ccgcagaact tggggagccg ccgccgccat ccgccgccgc agccagcttc cgccgccgca 61 ggaccggccc ctgccccagc ctccgcagcc gcggcgcgtc cacgcccgcc cgcgcccagg 121 gcgagtcggg gtcgccgcct gcacgcttct cagtgttccc cgcgccccgc atgtaacccg 181 gccaggcccc cgcaacggtg tcccctgcag ctccagcccc gggctgcacc cccccgcccc 241 gacaccagct ctccagcctg ctcgtccagg atggccgcgg ccaaggccga gatgcagctg 301 atgtccccgc tgcagatctc tgacccgttc ggatcctttc ctcactcgcc caccatggac 361 aactacccta agctggagga gatgatgctg ctgagcaacg gggctcccca gttcctcggc 421 gccgccgggg ccccagaggg cagcggcagc aacagcagca gcagcagcag cgggggcggt 481 ggaggcggcg ggggcggcag caacagcagc agcagcagca gcaccttcaa ccctcaggcg 541 gacacgggcg agcagcccta cgagcacctg accgcagagt cttttcctga catctctctg 601 aacaacgaga aggtgctggt ggagaccagt taccccagcc aaaccactcg actgcccccc 661 atcacctata ctggccgctt ttccctggag cctgcaccca acagtggcaa caccttgtgg 721 cccgagcccc tcttcagctt ggtcagtggc ctagtgagca tgaccaaccc accggcctcc 781 tcgtcctcag caccatctcc agcggcctcc tccgcctccg cctcccagag cccacccctg 841 agctgcgcag tgccatccaa cgacagcagt cccatttact cagcggcacc caccttcccc 901 acgccgaaca ctgacatttt ccctgagcca caaagccagg ccttcccggg ctcggcaggg 961 acagcgctcc agtacccgcc tcctgcctac cctgccgcca agggtggctt ccaggttccc 1021 atgatccccg actacctgtt tccacagcag cagggggatc tgggcctggg caccccagac 1081 cagaagccct tccagggcct ggagagccgc acccagcagc cttcgctaac ccctctgtct 1141 actattaagg cctttgccac tcagtcgggc tcccaggacc tgaaggccct caataccagc 1201 taccagtccc agctcatcaa acccagccgc atgcgcaagt atcccaaccg gcccagcaag 1261 acgccccccc acgaacgccc ttacgcttgc ccagtggagt cctgtgatcg ccgcttctcc 1321 cgctccgacg agctcacccg ccacatccgc atccacacag gccagaagcc cttccagtgc 1381 cgcatctgca tgcgcaactt cagccgcagc gaccacctca ccacccacat ccgcacccac 1441 acaggcgaaa agcccttcgc ctgcgacatc tgtggaagaa agtttgccag gagcgatgaa 1501 cgcaagaggc ataccaagat ccacttgcgg cagaaggaca agaaagcaga caaaagtgtt 1561 gtggcctctt cggccacctc ctctctctct tcctacccgt ccccggttgc tacctcttac 1621 ccgtccccgg ttactacctc ttatccatcc ccggccacca cctcataccc atcccctgtg 1681 cccacctcct tctcctctcc cggctcctcg acctacccat cccctgtgca cagtggcttc 1741 ccctccccgt cggtggccac cacgtactcc tctgttcccc ctgctttccc ggcccaggtc 1801 agcagcttcc cttcctcagc tgtcaccaac tccttcagcg cctccacagg gctttcggac 1861 atgacagcaa ccttttctcc caggacaatt gaaatttgct aaagggaaag gggaaagaaa 1921 gggaaaaggg agaaaaagaa acacaagaga cttaaaggac aggaggagga gatggccata 1981 ggagaggagg gttcctctta ggtcagatgg aggttctcag agccaagtcc tccctctcta 2041 ctggagtgga aggtctattg gccaacaatc ctttctgccc acttcccctt ccccaattac 2101 tattcccttt gacttcagct gcctgaaaca gccatgtcca agttcttcac ctctatccaa 2161 agaacttgat ttgcatggat tttggataaa tcatttcagt atcatctcca tcatatgcct 2221 gaccccttgc tcccttcaat gctagaaaat cgagttggca aaatggggtt tgggcccctc 2281 agagccctgc cctgcaccct tgtacagtgt ctgtgccatg gatttcgttt ttcttggggt 2341 actcttgatg tgaagataat ttgcatattc tattgtatta tttggagtta ggtcctcact 2401 tgggggaaaa aaaaaaaaaa aagccaagca aaccaatggt gatcctctat tttgtgatga 2461 tgctgtgaca ataagtttga accttttttt ttgaaacagc agtcccagta ttctcagagc 2521 atgtgtcaga gtgttgttcc gttaaccttt ttgtaaatac tgcttgaccg tactctcaca 2581 tgtggcaaaa tatggtttgg tttttctttt ttttttttga aagtgttttt tcttcgtcct 2641 tttggtttaa aaagtttcac gtcttggtgc cttttgtgtg atgccccttg ctgatggctt 2701 gacatgtgca attgtgaggg acatgctcac ctctagcctt aaggggggca gggagtgatg 2761 atttggggga ggctttggga gcaaaataag gaagagggct gagctgagct tcggttctcc 2821 agaatgtaag aaaacaaaat ctaaaacaaa atctgaactc tcaaaagtct atttttttaa 2881 ctgaaaatgt aaatttataa atatattcag gagttggaat gttgtagtta cctactgagt 2941 aggcggcgat ttttgtatgt tatgaacatg cagttcatta ttttgtggtt ctattttact 3001 ttgtacttgt gtttgcttaa acaaagtgac tgtttggctt ataaacacat tgaatgcgct 3061 ttattgccca tgggatatgt ggtgtatatc cttccaaaaa attaaaacga aaataaagta 3121 gctgcgattg gg // LOCUS HSEHK1 3903 bp RNA PRI 12-AUG-1997 DEFINITION H.sapiens mRNA for EHK-1 receptor tyrosine kinase. ACCESSION X95425 NID g1177465 KEYWORDS EHK-1; receptor tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3903) AUTHORS Miescher,G.C., Luetzelschwab,R., Erne,B., Ferracin,F., Huber,S. and Steck,A.J. TITLE Reciprocal Expression of Myelin Associated Glycoprotein Splice Variants in the Adult Human Peripheral and Central Nervous Systems JOURNAL Mol. Brain Res. In press REFERENCE 2 (bases 1 to 3903) AUTHORS Miescher Constant,G. TITLE Direct Submission JOURNAL Submitted (26-JAN-1996) G. Miescher Constant, University Hospitals Basel, Department of Research, Departement Forschung, Kantonsspital Basel, 4031 Basel, Switzerland COMMENT Partial human EHK-1 cDNA without information on mRNA splicing variants has been published by Fox, G.M. et al. (1995). Oncogene 10:897-905. Overlaps with L36642-L36645. FEATURES Location/Qualifiers source 1..3903 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda ZAPII #936206" /clone="HFB415" /clone="HFB115" /clone="HFB129" /dev_stage="embryo" /tissue_type="brain" /cell_type="CNS neurons" /lab_host="XL1-blue" /chromosome="4" /map="q12" sig_peptide 712..783 CDS 712..3825 /codon_start=1 /product="EHK-1 receptor tyrosine kinase" /db_xref="PID:e220355" /db_xref="PID:g1177466" /db_xref="SWISS-PROT:P54756" /translation="MRGSGPRGAGHRRPPSGGGDTPITPASLAGCYSAPRRAPLWTCL LLCAALRTLLASPSNEVNLLDSRTVMGDLGWIAFPKNGWEEIGEVDENYAPIHTYQVC KVMEQNQNNWLLTSWISNEGASRIFIELKFTLRDCNSLPGGLGTCKETFNMYYFESDD QNGRNIKENQYIKIDTIAADESFTELDLGDRVMKLNTEVRDVGPLSKKGFYLAFQDVG ACIALVSVRVYYKKCPSVVRHLAVFPDTITGADSSQLLEVSGSCVNHSVTDEPPKMHC SAEGEWLVPIGKCMCKAGYEEKNGTCQVCRPGFFKASPHIQSCGKCPPHSYTHEEAST SCVCEKDYFRRESDPPTMACTRPPSAPRNAISNVNETSVFLEWIPPADTGGRKDVSYY IACKKCNSHAGVCEECGGHVRYLPRQSGLKNTSVMMVDLLAHTNYTFEIEAVNGVSDL SPGARQYVSVNVTTNQAAPSPVTNVKKGKIAKNSISLSWQEPDRPNGIILEYEIKHFE KDQETSYTIIKSKETTITAEGLKPASVYVFQIRARTAAGYGVFSRRFEFETTPVFAAS SDQSQIPVIAVSVTVGVILLAVVIGVLLSGSCCECGCGRASSLCAVAHPILIWRCGYS KAKQDPEEEKMHFHNGHIKLPGVRTYIDPHTYEDPNQAVHEFAKEIEASCITIERVIG AGEFGEVCSGRLKLPGKRELPVAIKTLKVGYTEKQRRDFLGEASIMGQFDHPNIIHLE GVVTKSKPVMIVTEYMENGSLDTFLKKNDGQFTVIQLVGMLRGISAGMKYLSDMGYVH RDLAARNILINSNLVCKVSDFGLSRVLEDDPEAAYTTRGGKIPIRWTAPEAIAFRKFT SASDVWSYGIVMWEVVSYGERPYWEMTNQDVIKAVEEGYRLPSPMDCPAALYQLMLDC WQKERNSRPKFDEIVNMLDKLIRNPSSLKTLVNASCRVSNLLAEHSPLGSGAYRSVGE WLEAIKMGRYTEIFMENGYSSMDAVAQVTLEDLRRLGVTLVGHQKKIMNSLQEMKVQL VNGMVPL" mat_peptide 784..3822 misc_feature 1622..1777 /note="RNA splice domain I" misc_feature 1778..2113 /note="RNA splice domain IIa" misc_feature 2114..2392 /note="RNA splice domain IIb" misc_feature 2502..2567 /note="RNA splice domain III" BASE COUNT 1091 a 882 c 953 g 977 t ORIGIN 1 aatggtcagt caatacatta taacataata caccaaatgc tagaatagaa ggggaggggg 61 gcacacataa tgactcactg ctggaagaag ggtgcatcag tgaattaaaa aatgtccctc 121 ccctcttcag cactcagcgc gcagctattt ccttctgcca gtctctttga actctggatc 181 tttgcttttg ctcgctgctc tcctgttttt cattctccac attttctcaa tcctctttct 241 ttatccttag ccaccctgct tttttcctcc ttttttaaaa aatcggagat ttcgtcttaa 301 aatgatttgt cttccttacc ttcgtccatt tcaacactga aggctgcaaa gaacttcacc 361 tttcccctag tggtatttaa aaattctcaa tccgtaaaaa gtctttttga aaggcaaagg 421 aacaggaccc agaccctctc gacacccttg atccgagtca gatctgcact agcaaccaga 481 actaatattt catttaaccc accaaaaggg ggaggcgaga ggagccagaa gcaaacttca 541 tctgtctcag acggatccgt ggttcctaca tttggaggag ccgcgtgtca gaaggcgtag 601 gaccccaagg ggggacaagg aggactcccg agtctccctt ctccgctctc cgagaccgaa 661 gaggtggact gagccgctcg ggacagcggc accggaggag gctcggagaa gatgcggggc 721 tcggggcccc ggggtgcggg acaccggcgg cccccaagcg gcggcggcga cacccccatc 781 accccagcgt ccctggccgg ctgctactct gcacctcgac gggctcccct ctggacgtgc 841 cttctcctgt gcgccgcact ccggaccctc ctggccagcc ccagcaacga agtgaattta 901 ttggattcac gcactgtcat gggggacctg ggatggattg cttttccaaa aaatgggtgg 961 gaagagattg gtgaagtgga tgaaaattat gcccctatcc acacatacca agtatgcaaa 1021 gtgatggaac agaatcagaa taactggctt ttgaccagtt ggatctccaa tgaaggtgct 1081 tccagaatct tcatagaact caaatttacc ctgcgggact gcaacagcct tcctggagga 1141 ctggggacct gtaaggaaac ctttaatatg tattactttg agtcagatga tcagaatggg 1201 agaaacatca aggaaaacca atacatcaaa attgatacca ttgctgccga tgaaagcttt 1261 acagaacttg atcttggtga ccgtgttatg aaactgaata cagaggtcag agatgtagga 1321 cctctaagca aaaagggatt ttatcttgct tttcaagatg ttggtgcttg cattgctctg 1381 gtttctgtgc gtgtatacta taaaaaatgc ccttctgtgg tacgacactt ggctgtcttc 1441 cctgacacca tcactggagc tgattcttcc caattgctcg aagtgtcagg ctcctgtgtc 1501 aaccattctg tgaccgatga acctcccaaa atgcactgca gcgccgaagg ggagtggctg 1561 gtgcccatcg ggaaatgcat gtgcaaggca ggatatgaag agaaaaatgg cacctgtcaa 1621 gtgtgcagac ctgggttctt caaagcctca cctcacatcc agagctgcgg caaatgtcca 1681 cctcacagtt atacccatga ggaagcttca acctcttgtg tctgtgaaaa ggattatttc 1741 aggagagagt ctgatccacc cacaatggca tgcacaagac ccccctctgc tcctcggaat 1801 gccatctcaa atgttaatga aactagtgtc tttctggaat ggattccgcc tgctgacact 1861 ggtggaagga aagacgtgtc atattatatt gcatgcaaga agtgcaactc ccatgcaggt 1921 gtgtgtgagg agtgtggcgg tcatgtcagg taccttcccc ggcaaagcgg cctgaaaaac 1981 acctctgtca tgatggtgga tctactcgct cacacaaact atacctttga gattgaggca 2041 gtgaatggag tgtccgactt gagcccagga gcccggcagt atgtgtctgt aaatgtaacc 2101 acaaatcaag cagctccatc tccagtcacc aatgtgaaaa aagggaaaat tgcaaaaaac 2161 agcatctctt tgtcttggca agaaccagat cgtcccaatg gaatcatcct agagtatgaa 2221 atcaagcatt ttgaaaagga ccaagagacc agctacacga ttatcaaatc taaagagaca 2281 actattactg cagagggctt gaaaccagct tcagtttatg tcttccaaat tcgagcacgt 2341 acagcagcag gctatggtgt cttcagtcga agatttgagt ttgaaaccac cccagtgttt 2401 gcagcatcca gcgatcaaag ccagattcct gtaattgctg tgtctgtgac agtaggagtc 2461 attttgttgg cagtggttat cggcgtcctc ctcagtggaa gttgctgcga atgtggctgt 2521 gggagggctt cttccctgtg cgctgttgcc catccaatcc taatatggcg gtgtggctac 2581 agcaaagcaa aacaagatcc agaagaggaa aagatgcatt ttcataatgg gcacattaaa 2641 ctgccaggag taagaactta cattgatcca catacctatg aggatcccaa tcaagctgtc 2701 cacgaatttg ccaaggagat agaagcatca tgtatcacca ttgagagagt tattggagca 2761 ggtgaatttg gtgaagtttg tagtggacgt ttgaaactac caggaaaaag agaattacct 2821 gtggctatca aaacccttaa agtaggctat actgaaaagc aacgcagaga tttcctaggt 2881 gaagcaagta tcatgggaca gtttgatcat cctaacatca tccatttaga aggtgtggtg 2941 accaaaagta aaccagtgat gatcgtgaca gagtatatgg agaatggctc tttagataca 3001 tttttgaaga aaaacgatgg gcagttcact gtgattcagc ttgttggcat gctgagaggt 3061 atctctgcag gaatgaagta cctttctgac atgggctatg tgcatagaga tcttgctgcc 3121 agaaacatct taatcaacag taaccttgtg tgcaaagtgt ctgactttgg actttcccgg 3181 gtactggaag atgatcccga ggcagcctac accacaaggg gaggaaaaat tccaatcaga 3241 tggactgccc cagaagcaat agctttccga aagtttactt ctgccagtga tgtctggagt 3301 tatggaatag taatgtggga agttgtgtct tatggagaga gaccctactg ggagatgacc 3361 aatcaagatg tgattaaagc ggtagaggaa ggctatcgtc tgccaagccc catggattgt 3421 cctgctgctc tctatcagtt aatgctggat tgctggcaga aagagcgaaa tagcaggccc 3481 aagtttgatg aaatagtcaa catgttggac aagctgatac gtaacccaag tagtctgaag 3541 acgctggtta atgcatcctg cagagtatct aatttattgg cagaacatag cccactagga 3601 tctggggcct acagatcagt aggtgaatgg ctagaggcaa tcaagatggg ccggtataca 3661 gagattttca tggaaaatgg atacagttca atggacgctg tggctcaggt gaccttggag 3721 gatttgagac ggcttggagt gactcttgtc ggtcaccaga agaagatcat gaacagcctt 3781 caagaaatga aggtgcagct ggtaaacgga atggtgccat tgtaacttca tgtaaatgtc 3841 gcttcttcaa gtgaatgatt ctgcactttg taaacagcac tgagatttat tttaacaaaa 3901 aaa // LOCUS HSEIF2BAS 1658 bp RNA PRI 23-FEB-1996 DEFINITION H.sapiens mRNA for eIF-2B alpha subunit. ACCESSION X95648 NID g1200231 KEYWORDS eIF-2B alfa subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1658) AUTHORS Torp,A. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1658) AUTHORS Torp,A. TITLE Direct Submission JOURNAL Submitted (13-FEB-1996) A. Torp, Institute of Chemical Physics and Biophysics, Riia 23, EE2400 Tartu, ESTONIA COMMENT related sequences F00367, R27407, H96127, R01373, T95796 and Flowers et al, 1995, Proc.Natl.Acad.Sci.U.S.A. 92, 4274-4278. FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937-B" /chromosome="12" gene 11..928 /gene="eIF-2B" CDS 11..928 /gene="eIF-2B" /note="alfa subunit" /codon_start=1 /db_xref="PID:e222930" /db_xref="PID:g1200232" /translation="MDDKELIEYFKSQMKEDPDMASAVAAIRTLLEFLKRDKGETIQG LRANLTSAIETLCGVDSSVAVSSGGELFLRFISLASLEYSDYSKCKKIMIERGELFLR RISLSRNKIADLCHTFIKDGATILTHAYSRVVLRVLEAAVAAKKRFSVYVTESQPDLS GKKMAKALCHLNVPVTVVLDAAVGYIMEKADLVIVGAEGVVENGGIINKIGTNQMAVC AKAQNKPFYVVAESFKFVRLFPLNQQDVPDKFKYKADTLKVAQTGQDLKEEHPWVDYT APSLITLLFTDLGVLTPSAVSDELIKLYL" BASE COUNT 476 a 384 c 380 g 418 t ORIGIN 1 gaggacgcct atggacgaca aggagttaat tgaatacttt aagtctcaga tgaaagaaga 61 tcctgacatg gcctcagcag tggctgccat ccggacgttg ctggagttct tgaagagaga 121 taaaggggag acaatccagg gtctgagggc caatctcacc agtgccatag aaaccctgtg 181 tggtgtggac tcctctgtgg cagtgtcctc tggcggggag ctcttcctcc gcttcatcag 241 tcttgcctcc ctggaatact ccgattactc caaatgtaaa aagatcatga ttgagagagg 301 agagcttttt ctcaggagaa tatcactgtc aagaaacaaa attgcagatc tgtgccatac 361 tttcatcaaa gatggagcga caatattgac tcacgcctac tccagagtgg tcctgagagt 421 cctggaagca gccgtggcgg ccaagaagcg atttagtgta tacgtcacag agtcacagcc 481 tgatttgtca ggtaagaaaa tggccaaagc cctctgccac ctcaacgtcc ctgtcactgt 541 ggtgctagat gctgctgtcg gctacatcat ggagaaagca gatcttgtca tagttggtgc 601 tgaaggagtt gttgaaaacg gaggaattat taacaagatt ggaaccaacc agatggctgt 661 gtgtgccaaa gcacagaaca aacctttcta tgtggttgca gaaagtttca agtttgtccg 721 gctctttcca ctaaaccagc aagacgtccc agataagttt aagtataagg cagacactct 781 caaggtcgcg cagactggac aagacctcaa agaggagcat ccgtgggtcg actacactgc 841 cccttcctta atcactctgc tgtttacaga cctgggcgtg ctgacaccct cagcagtcag 901 cgatgagctc atcaagctct atctgtaacc tgtgagccct ttcctgccaa ggtgcagctt 961 acgtagttga ggcagggtga gtagctgctt gacaccccag tgagtcaggc caaaactgag 1021 atgtgtttaa tgaagattta tggagtaagg acttaaaatc atacatcttg gagaaccttt 1081 cttactcatt tcagtcccat ctaaaaatgt gtcagctatt ctaaatccca acttaaattg 1141 ttcttacggt ttctagaaac tttccttttc agtttccaga aatacaagtt agataattgg 1201 ctacttaact gatgaaagat gagcccaagt ccacctgtct tcatcctccc ctgcactcca 1261 gactgatctg cctggggcac gcgagatgca ggcgaaaagc agccacaccc ctctgccaca 1321 aatgaccaac agctggtcag gacgttacac gcggtgcctt gtaagaggca agaaacactt 1381 gccgaatctg catctggctt ccagtggtaa gcacattcct cagcaggatc aagccaaaca 1441 gtaaaaacta ccaagagaac acgaggaagg cagaaacgat gtttagcaac agtattctgc 1501 atggttcact gcttaagaaa atgccttctg gaatatttgt aaactgaaat tctgtatgtg 1561 taagagatgt ttaaatatgt ttgggtccta aacagctttt taaaattata cttgggaata 1621 aattcagcat cttttcaaat aaactctatg aaccacac // LOCUS HSEMAP 3120 bp RNA PRI 24-NOV-1993 DEFINITION H.sapiens E-MAP-115 mRNA. ACCESSION X73882 NID g414114 KEYWORDS microtubule-associated protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3120) AUTHORS Masson,D. TITLE Direct Submission JOURNAL Submitted (06-JUL-1993) D. Masson, University of Geneva, Dept de Biologie Cellulaire, Sciences III, 30 Quai Ernest-Ansermet, 1211 Geneve 4, SWITZERLAND REFERENCE 2 (bases 1 to 3120) AUTHORS Masson,D. and Kreis,T.E. TITLE Identification and molecular characterization of E-MAP-115, a novel microtubule-associated protein predominantly expressed in epithelial cells JOURNAL J. Cell Biol. 123 (2), 357-371 (1993) MEDLINE 94012982 FEATURES Location/Qualifiers source 1..3120 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /cell_line="HeLa" /clone_lib="pUEX HeLa cDNA library" gene 101..2350 /gene="E-MAP-115" CDS 101..2350 /gene="E-MAP-115" /codon_start=1 /product="microtubule associated protein" /db_xref="PID:g414115" /translation="MAELGAGGDGHRGGDGAVRSETAPDSYKVQDKKNASSRPASAIS GQNNNHSGNKPDPPPVLRVDDRQRLARERREEREKQLAAREIVWLEREERARQHYEKH LEERKKRLEEQRQKEERRRAAVEEKRRQRLEEDKERHEAVVRRTMERSQKPKQKHNRW SWGGSLHGSPSIHSADPDRRSVSTMNLSKYVDPVISKRLSSSSATLLNSPDRARRLQL SPWESSVVNRLLTPTHSFLARSKSTAALSGEAASCSPIIMPYKAAHSRNSMDRPKLFV TPPEGSSRRRIIHGTASYKKERERENVLFLTSGTRRAVSPSNPKARQPARSRLWLPSK SLPHLPGTPRPTSSLPPGSVKAAPAQVRPPSPGNIRPVKREVKVEPEKKDPEKEPQKV ANEPSLKGRAPLVKVEEATVEERTPAEPEVGPAAPAMAPAPASAPAPASAPAPAPVPT PAMVSAPSSTVNASASVKTSAGTTDPEEATRLLAEKRRLAREQREKEERERREQEELE RQKREELAQRVAEERTTRREEESRRLEAEQAREKEEQLQRQAEERALREREEAERAQR QKEEEARVREEAERVRQEREKHFQREEQERLERKKRLEEIMKRTRRTEATDKKTSDQR NGDIAKGALTGGTEVSALPCTTNAPGNGKPVGSPHVVTSHQSKVTVESTPDLEKQPNE NGVSVQNENFEEIINLPIGSKPSRLDVTNSESPEIPLNPILAFDDEGTLGPLPQVDGV QTQQTAEVI" BASE COUNT 948 a 734 c 797 g 641 t ORIGIN 1 gcgctcacct gtctgggccg ctggcctggg aggcgggggc cggcgggagc caagccgagg 61 aaagggcgga gcggctctcc gggcgcgtca tcggagcacc atggcggagc taggagctgg 121 cggcgacggc cacaggggcg gcgacggcgc agtgcgaagc gaaacagcac ccgacagcta 181 caaagtgcaa gataagaaaa atgcctccag ccgccctgcc tctgcaattt caggacaaaa 241 taacaaccac tcaggaaata aaccagaccc tccgcctgtg ttacgtgttg atgaccggca 301 gcggctggcc cgggagcgac gtgaggaacg ggagaaacag ctagctgcaa gagaaatagt 361 gtggttagaa agagaagagc gagccaggca gcactacgag aagcacctgg aagagcggaa 421 gaagaggttg gaggagcaga ggcagaagga ggagcggagg agggctgctg tggaggagaa 481 gcggaggcag agacttgagg aggacaaaga acgccacgaa gctgttgtac ggcgcacaat 541 ggaaaggagc cagaagccaa aacagaagca taaccgttgg tcgtggggag gctctctcca 601 tgggagccct agcatccaca gtgcagatcc agacaggcgg tcagtttcca ccatgaatct 661 ttcgaaatat gttgatcccg tcattagcaa gcggctctcc tcttcatctg caactttact 721 aaattctcca gatagagctc gccgcctgca gctcagccca tgggagagca gcgttgttaa 781 cagactcctg acgcccacac attcgttcct ggccagaagt aaaagcacag ctgccttgtc 841 tggagaagca gcatcttgca gccccatcat catgccctac aaagctgcac actctagaaa 901 ttcgatggat cgaccaaaac tctttgtaac accacctgag ggctcttctc gcaggaggat 961 cattcatggc acagcgagct ataaaaaaga aagagagaga gaaaatgtac tcttcctcac 1021 atctggcacc cgaagggctg tatctccatc taatcccaaa gcaagacaac cagctcgctc 1081 ccgactttgg cttccgtcca agtctcttcc tcatttgcct ggcacaccca gaccgacatc 1141 ctccttgcca cccggctcag tcaaagctgc tcctgctcag gtccggcccc catcccccgg 1201 caacatccgc cctgtcaaga gggaagtcaa agtggagcct gagaagaaag atcctgagaa 1261 ggaacctcag aaagttgcca atgagccctc actaaagggc agagcacctt tagtgaaggt 1321 agaagaagcc acagttgaag agcggacacc tgctgaacca gaagttggcc ctgctgctcc 1381 agccatggcc ccagctccag cctcggcccc agctccagcc tcggccccag ctccagcccc 1441 ggtccccacc ccagccatgg tctcagcccc gtcatccact gtgaatgcca gtgcttctgt 1501 taagacttct gcaggcacca ccgacccaga ggaggccaca aggcttctag ctgagaagag 1561 gcggctggcc cgagagcaga gagaaaagga agaaagggag aggagggagc aggaagagct 1621 tgaaagacaa aagagagagg aattggctca acgtgtggct gaagagagga cgactcgccg 1681 tgaggaggag tcgcgcaggc tggaagccga gcaggcccgg gagaaggagg agcagctgca 1741 gcggcaggcg gaggagcggg cgctgcgcga gcgggaggag gcagagcgcg cccagaggca 1801 gaaagaagaa gaagctcgcg ttcgtgaaga agcagagagg gtccggcagg aacgagagaa 1861 gcatttccag agagaagagc aagagcgcct ggagagaaag aagcgacttg aggagattat 1921 gaaaagaacc aggagaacag aagctacaga taagaaaacc agtgatcaga gaaacggtga 1981 tatagccaag ggagctctca ctggaggaac agaggtgtct gcacttccat gtacaacaaa 2041 cgctccggga aatggaaagc cagttggcag cccacatgtg gttacctcac accagtcaaa 2101 agtgacagtg gagagcactc ccgatttgga aaaacaacca aatgaaaatg gtgtatctgt 2161 tcagaatgaa aattttgaag aaattataaa cttacccatt ggatctaaac catccagatt 2221 agatgtcacc aacagtgaga gcccagaaat tcctttgaat ccaattttgg cctttgatga 2281 tgaagggaca cttgggcccc tgcctcaggt agatggtgtt cagacacagc agactgcaga 2341 agttatatga gtgtttcttc tgaagaacca aagctgaaat ttaatgagaa tttctacaat 2401 taatggaatt cctttcctgc tataaaggag catcccctcc acccgttttc tagagttctt 2461 gaccatcatt ttgaaaagat ttattaaaac tagctaaaga caacagactg gatagctttt 2521 ctaataattt tcatcaatag gaaaaaagaa atacgtctca ttcttcaata ctttaaaatg 2581 gctttttcca gtgtgctcct tcttagcaat caatattttt ctgcattctt taaaagacaa 2641 gagaatttgg ttataaaaga aatgggctga ctaggcatga tttttttggt cttaaaaggc 2701 ttaacatgta aaattggcaa aaaaaatttt ttacctttta taatacttga aaaataagta 2761 cctctttgtt ctacaagtag aatgaatagg agaagagttt aagcctgttt ttttaaaata 2821 ttattgcaaa gagctctatt tgtagaagca aattataggc agattaccag gttcttataa 2881 atacagcttg tacatggaca ttctgcaaac ccagctgtca catttttctt gcaactcctt 2941 ttgcaaaagc agactaaaat gttttaaaat gtgaaaaaac attatttttt caaagcaaga 3001 aaataattta ctgccctctt acataatgta tttataaagt ttttccagat aaactaatca 3061 aataaattag aataatgtga caacattaca aatttaattt gccatggtac cttcgttgcc // LOCUS HSEMR1 3149 bp RNA PRI 20-APR-1995 DEFINITION H.sapiens mRNA for EMR1 hormone receptor. ACCESSION X81479 NID g784993 KEYWORDS emr1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3149) AUTHORS Baud,V., Chissoe,S.L., Viegas-Pequignot,E., Diriong,S., N'Guyen,V.C., Roe,B.A. and Lipinski,M. TITLE EMR1, an unusual member in the family of hormone receptors with seven transmembrane segments JOURNAL Genomics 26 (2), 334-344 (1995) MEDLINE 95324926 REFERENCE 2 (bases 1 to 3149) AUTHORS Lipinski,M. TITLE Direct Submission JOURNAL Submitted (05-SEP-1994) M. Lipinski, INST. LAB. DE BIOL.DE TUMEURS HUMAINES, CNRS URA 1156, INST. GUSTAVE ROUSSY, 94805 VILLEJUIF CEDEX, FRANCE FEATURES Location/Qualifiers source 1..3149 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="germ line" /tissue_type="neuroectodermal tumour" /cell_type="Ewing cell-line" /cell_line="IARC-EW11" /sub_clone="36EW11 and 36B" /map="19p" gene 39..2699 /gene="EMR1" CDS 39..2699 /gene="EMR1" /codon_start=1 /db_xref="PID:g784994" /translation="MRGFNLLLFWGCCVMHSWEGHIRPTRKPNTKGNNCRDSTLCPAY ATCTNTVDSYYCTCKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSG RYKCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSRVCPEHSDCVNSMGSYSCSCQ VGFISRNSTCEDVNECADPRACPEHATCNNTVGNYSCFCNPGFESSSGHLSCQGLKAS CEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSSGQLNFTDQGVECRDIDECRQDP STCGPNSICTNALGSYSCGCIVGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQ QCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISMWTK FTKEETSSLATVFLESVESMTLASFWKPSANVTPAVRAEYLDIESKVINKECSEENVT LDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFQDHQAPLTTSEIK LKMNSRVVGGIMTGEKKDGFSDPIIYTLENVQPKQKFERPICVSWSTDVKGGRWTSFG CVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATF LLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKTGCAIIAGFLHYLFLACFFW MLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHN RCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTF KAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLIHCLLNGQVREEYK RWITGKTKPSSQSQTSRILLSSMPSASKTG" polyA_signal 3082..3087 BASE COUNT 811 a 789 c 739 g 810 t ORIGIN 1 ctaaagtttt tttctttgaa tgacagaact acagcataat gcgtggcttc aacctgctcc 61 tcttctgggg atgttgtgtt atgcacagct gggaagggca cataagaccc acacggaaac 121 caaacacaaa gggtaataac tgtagagaca gtaccttgtg cccagcttat gccacctgca 181 ccaatacggt ggacagttac tattgcactt gcaaacaagg cttcctgtcc agcaatgggc 241 aaaatcactt caaggatcca ggagtgcgat gcaaagatat tgatgaatgt tctcaaagcc 301 cccagccctg tggtcctaac tcatcctgca aaaacctgtc agggaggtac aagtgcagct 361 gtttagatgg tttctcttct cccactggaa atgactgggt cccaggaaag ccgggcaatt 421 tctcctgtac tgatatcaat gagtgcctca ccagcagggt ctgccctgag cattctgact 481 gtgtcaactc catgggaagc tacagttgca gctgtcaagt tggattcatc tctagaaact 541 ccacctgtga agacgtgaat gaatgtgcag atccaagagc ttgcccagag catgcaactt 601 gtaataacac tgttggaaac tactcttgtt tctgcaaccc aggatttgaa tccagcagtg 661 gccacttgag ttgccagggt ctcaaagcat cgtgtgaaga tattgatgaa tgcactgaaa 721 tgtgccccat caattcaaca tgcaccaaca ctcctgggag ctacttttgc acctgccacc 781 ctggctttgc accaagcagt ggacagttga atttcacaga ccaaggagtg gaatgtagag 841 atattgatga gtgccgccaa gatccatcaa cctgtggtcc taattctatc tgcaccaatg 901 ccctgggctc ctacagctgt ggctgcattg taggctttca tcccaatcca gaaggctccc 961 agaaagatgg caacttcagc tgccaaaggg ttctcttcaa atgtaaggaa gatgtgatac 1021 ccgataataa gcagatccag caatgccaag agggaaccgc agtgaaacct gcatatgtct 1081 ccttttgtgc acaaataaat aacatcttca gcgttctgga caaagtgtgt gaaaataaaa 1141 cgaccgtagt ttctctgaag aatacaactg agagctttgt ccctgtgctt aaacaaatat 1201 ccatgtggac taaattcacc aaggaagaga cgtcctccct ggccacagtc ttcctggaga 1261 gtgtggaaag catgacactg gcatcttttt ggaaaccctc agcaaatgtc actccggctg 1321 ttcgggcgga atacttagac attgagagca aagttatcaa caaagaatgc agtgaagaga 1381 atgtgacgtt ggacttggta gccaaggggg ataagatgaa gatcgggtgt tccacaattg 1441 aggaatctga atccacagag accactggtg tggcttttgt ctcctttgtg ggcatggaat 1501 cggttttaaa tgagcgcttc ttccaagacc accaggctcc cttgaccacc tctgagatca 1561 agctgaagat gaattctcga gtcgttgggg gcataatgac tggagagaag aaagacggct 1621 tctcagatcc aatcatctac actctggaga acgttcagcc aaagcagaag tttgagaggc 1681 ccatctgtgt ttcctggagc actgatgtga agggtggaag atggacatcc tttggctgtg 1741 tgatcctgga agcttctgag acatatacca tctgcagctg taatcagatg gcaaatcttg 1801 ccgttatcat ggcgtctggg gagctcacga tggacttttc cttgtacatc attagccatg 1861 taggcattat catctccttg gtgtgcctcg tcttggccat cgccaccttt ctgctgtgtc 1921 gctccatccg aaatcacaac acctacctcc acctgcacct ctgcgtgtgt ctcctcttgg 1981 cgaagactct cttcctcgcc ggtatacaca agactgacaa caagacgggc tgcgccatca 2041 tcgcgggctt cctgcactac cttttccttg cctgcttctt ctggatgctg gtggaggctg 2101 tgatactgtt cttgatggtc agaaacctga aggtggtgaa ttacttcagc tctcgcaaca 2161 tcaagatgct gcacatctgt gcctttggtt atgggctgcc gatgctggtg gtggtgatct 2221 ctgccagtgt gcagccacag ggctatggaa tgcataatcg ctgctggctg aatacagaga 2281 cagggttcat ctggagtttc ttggggccag tttgcacagt tatagtgatc aactcccttc 2341 tcctgacctg gaccttgtgg atcctgaggc agaggctttc cagtgttaat gccgaagtct 2401 caacgctaaa agacaccagg ttactgacct tcaaggcctt tgcccagctc ttcatcctgg 2461 gctgctcctg ggtgctgggc atttttcaga ttggacctgt ggcaggtgtc atggcttacc 2521 tgttcaccat catcaacagc ctgcaggggg ccttcatctt cctcatccac tgtctgctca 2581 acggccaggt acgagaagaa tacaagaggt ggatcactgg gaagacgaag cccagctccc 2641 agtcccagac ctcaaggatc ttgctgtcct ccatgccatc cgcttccaag acgggttaaa 2701 gcctttcttg ctttcaaata tgctatggag ccacagttga ggacagtagt ttcctgcagg 2761 agcctaccct gaaatctctt ctcagcttaa catggaaatg aggatcccac cagccccaga 2821 accctctggg gaagaatgtt gggggccgtc ttcctgtggt tgtatgcact gatgagaaat 2881 cagacgtttc tgctccaaac gaccatttta tcttcgtgct ctgcaacttc ttcaattcca 2941 gagtttctga gaacagaccc aaattcaatg gcatgaccaa gaacacctgg ctaccatttt 3001 gttttctcct gcccttgttg gtgcatggtt ctaagcgtgc ccctccagcg cctatcatac 3061 gcctgacaca gagaacctct caataaatga tttgtcgcct gtctgactga tttaccctaa 3121 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSENDCE 2720 bp RNA PRI 31-MAR-1995 DEFINITION H.sapiens mRNA for endothelin-converting-enzyme 1. ACCESSION Z35307 NID g535181 KEYWORDS endothelin-converting-enzyme 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2720) AUTHORS Schmidt,M., Kroeger,B., Jacob,E., Seulberger,H., Subkowski,T., Otter,R., Meyer,T., Schmalzing,G. and Hillen,H. TITLE Molecular characterization of human and bovine Endothelin-Converting-Enzyme (ECE-1) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2720) AUTHORS Kroeger,B. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Burkhard Kroeger, Department of Pharmaceutical Research, BASF, Aktiengesellschaft, Carl-Bosch-Str., Ludwigshafen, D-67056, Germany REFERENCE 3 (bases 1 to 2720) AUTHORS Schmidt,M., Kroger,B., Jacob,E., Seulberger,H., Subkowski,T., Otter,R., Meyer,T., Schmalzing,G. and Hillen,H. TITLE Molecular characterization of human and bovine endothelin converting enzyme (ECE-1) JOURNAL FEBS Lett. 356 (2-3), 238-243 (1994) MEDLINE 95104423 FEATURES Location/Qualifiers source 1..2720 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Placenta" CDS 38..2299 /codon_start=1 /product="endothelin-converting-enzyme 1" /db_xref="PID:g535182" /db_xref="SWISS-PROT:P42892" /translation="MSTYKRATLDEEDLVDSLSEGDAYPNGLQVNFHSPRSGQRCWAA RTQVEKRLVVLVVLLAAGLVACLAALGIQYQTRSPSVCLSEACVSVTSSILSSMDPTV DPCHDFFSYACGGWIKANPVPDGHSRWGTFSNLWEHNQAIIKHLLENSTASVSEAERK AQVYYRACMNETRIEELRAKPLMELIERLGGWNITGPWAKDNFQDTLQVVTAHYRTSP FFSVYVSADSKNSNSNVIQVDQSGLGLPSRDYYLNKTENEKVLTGYLNYMVQLGKLLG GGDEEAIRPQMQQILDFETALANITIPQEKRRDEELIYHKVTAAELQTLAPAINWLPF LNTIFYPVEINESEPIVVYDKEYLEQISTLINTTDRCLLNNYMIWNLVRKTSSFLDQR FQDADEKFMEVMYGTKKTCLPRWKFCVSDTENNLGFALGPMFVKATFAEDSKSIATEI ILEIKKAFEESLSTLKWMDEETRKSAKEKADAIYNMIGYPNFIMDPKELDKVFNDYTA VPDLYFENAMRFFNFSWRVTADQLRKAPNRDQWSMTPPMVNAYYSPTKNEIVFPAGIL QAPFYTRSSPKALNFGGIGVVVGHELTHAFDDQGREYDKDGNLRPWWKNSSVEAFKRQ TECMVEQYSNYSVNGEPVNGRHTLGENIADNGGLKAAYRAYQNWVKKNGAEHSLPTLG LTNNQLFFLGFAQVWCSVRTPESSHEGLITDPHSPSRFRVIGSLSNSKEFSEHFRCPP GSPMNPPHKCEVW" BASE COUNT 651 a 808 c 742 g 519 t ORIGIN 1 cgcccccccg gtgtccgccc tgctgtcggc gctggggatg tcgacgtaca agcgggccac 61 gctggacgag gaggacctgg tggactcgct ctccgagggc gacgcatacc ccaacggcct 121 gcaggtgaac ttccacagcc cccggagtgg ccagaggtgc tgggctgcac ggacccaggt 181 ggagaagcgg ctggtggtgt tggtggtact tctggcggca ggactggtgg cctgcttggc 241 agcactgggc atccagtacc agacaagatc cccctctgtg tgcctgagcg aagcttgtgt 301 ctcagtgacc agctccatct tgagctccat ggaccccaca gtggacccct gccatgactt 361 cttcagctac gcctgtgggg gctggatcaa ggccaaccca gtccctgatg gccactcacg 421 ctgggggacc ttcagcaacc tctgggaaca caaccaagca atcatcaagc acctcctcga 481 aaactccacg gccagcgtga gcgaggcaga gagaaaggcg caagtatact accgtgcgtg 541 catgaacgag accaggatcg aggagctcag ggccaaacct ctaatggagt tgattgagag 601 gctcgggggc tggaacatca caggtccctg ggccaaggac aacttccagg acaccctgca 661 ggtggtcacc gcccactacc gcacctcacc cttcttctct gtctatgtca gtgccgattc 721 caagaactcc aacagcaacg tgatccaggt ggaccagtct ggcctgggct tgccctcgag 781 agactattac ctgaacaaaa ctgaaaacga gaaggtgctg accggatatc tgaactacat 841 ggtccagctg gggaagctgc tgggcggcgg ggacgaggag gccatccggc cccagatgca 901 gcagatcttg gactttgaga cggcactggc caacatcacc atcccacagg agaagcgccg 961 tgatgaggag ctcatctacc acaaagtgac ggcagccgag ctgcagacct tggcacccgc 1021 catcaactgg ttgccttttc tcaacaccat cttctacccc gtggagatca atgaatccga 1081 gcctattgtg gtctatgaca aggaatacct tgagcagatc tccactctca tcaacaccac 1141 cgacagatgc ctgctcaaca actacatgat ctggaacctg gtgcggaaaa caagctcctt 1201 ccttgaccag cgctttcagg acgccgatga gaagttcatg gaagtcatgt acgggaccaa 1261 gaagacctgt cttcctcgct ggaagttttg cgtgagtgac acagaaaaca acctgggctt 1321 tgcgttgggc cccatgtttg tcaaagcaac cttcgccgag gacagcaaga gcatagccac 1381 cgagatcatc ctggagatta agaaggcatt tgaggaaagc ctgagcaccc tgaagtggat 1441 ggatgaggaa acccgaaaat cagccaagga aaaggccgat gccatctaca acatgatagg 1501 ataccccaac ttcatcatgg atcccaagga gctggacaaa gtgtttaatg actacactgc 1561 agttccagac ctctactttg aaaatgccat gcggtttttc aacttctcat ggagggtcac 1621 tgccgatcag ctcaggaaag cccccaacag agatcagtgg agcatgaccc cgcccatggt 1681 gaacgcctac tactcgccca ccaagaatga gattgtgttt ccggccggga tcctgcaggc 1741 accattctac acacgctcct cacccaaggc cttaaacttt ggtggcatag gtgtcgtcgt 1801 gggccatgag ctgactcatg cttttgatga tcaaggacgg gagtatgaca aggacgggaa 1861 cctccggcca tggtggaaga actcatccgt ggaggccttc aagcgtcaga ccgagtgcat 1921 ggtagagcag tacagcaact acagcgtgaa cggggagccg gtgaacgggc ggcacaccct 1981 gggggagaac atcgccgaca acgggggtct caaggcggcc tatcgggctt accagaactg 2041 ggtgaagaag aacggggctg agcactcgct ccccaccctg ggcctcacca ataaccagct 2101 cttcttcctg ggctttgcac aggtctggtg ctccgtccgc acacctgaga gctcccacga 2161 aggcctcatc accgatcccc acagcccctc tcgcttccgg gtcatcggct ccctctccaa 2221 ttccaaggag ttctcagaac acttccgctg cccacctggc tcacccatga acccgcctca 2281 caagtgcgaa gtctggtaag gacgaagcgg agagagccaa gacggaggag gggaaggggc 2341 tgaggacgag acccccatcc agcctccagg gcattgctca gcccgcttgg ccacccgggg 2401 ccctgcttcc tcacactggc gggttttcag ccggaaccga gcccatggtg ttggctctca 2461 acgtgacccg cagtctgatc ccctgtgaag agccggacat cccaggcaca cgtgtgcgcc 2521 accttcagca ggcattcggg tgctgggctg gtggctcatc aggcctgggc cccacactga 2581 caagcgccag atacgccaca aataccactg tgtcaaatgc tttcaagata tatttttggg 2641 gaaactattt tttaaacact gtggaataca ctggaaatct tcagggaaaa acacatttaa 2701 acactttttt ttttaagccc // LOCUS HSENDOG 3073 bp RNA PRI 17-SEP-1993 DEFINITION H.sapiens end mRNA for endoglin. ACCESSION X72012 NID g402206 KEYWORDS END gene; endoglin; TGF beta binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3073) AUTHORS Bellon,T., Corbi,A., Lastres,P., Cales,C., Cebrian,M., Vera,S., Cheifetz,S., Massague,J., Letarte,M. and Bernabeu,C. TITLE Identification and expression of two forms of the human transforming growth factor-beta-binding protein endoglin with distinct cytoplasmic regions JOURNAL Eur. J. Immunol. 23 (9), 2340-2345 (1993) MEDLINE 93380509 REFERENCE 2 (bases 1 to 3073) AUTHORS Bernabeu,C. TITLE Direct Submission JOURNAL Submitted (19-MAY-1993) C. Bernabeu, Centro de Investigaciones Biologicas, C.S.I.C., Velazquez, 144, 28006 Madrid, SPAIN REFERENCE 3 (bases 1 to 3073) AUTHORS Bellon,T. JOURNAL Thesis (1993) Universidad Autonoma, Madrid, SPAIN COMMENT Related sequence: J05481. FEATURES Location/Qualifiers source 1..3073 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /clone_lib="lambda gt10" /clone="3.3" /chromosome="9q34qter" sig_peptide 282..356 /gene="END" gene 282..2159 /gene="END" CDS 282..2159 /gene="END" /codon_start=1 /product="endoglin" /db_xref="PID:g402207" /translation="MDRGTLPLAVALLLASCSLSPTSLAETVHCDLQPVGPERGEVTY TTSQVSKGCVAQAPNAILEVHVLFLEFPTGPSQLELTLQASKQNGTWPREVLLVLSVN SSVFLHLQALGIPLHLAYNSSLVTFQEPPGVNTTELPSFPKTQILEWAAERGPITSAA ELNDPQSILLRLGQAQGSLSFCMLEASQDMGRTLEWRPRTPALVRGCHLEGVAGHKEA HILRVLPGHSAGPRTVTVKVELSCAPGDLDAVLILQGPPYVSWLIDANHNMQIWTTGE YSFKIFPEKNIRGFKLPDTPQGLLGEARMLNASIVASFVELPLASIVSLHASSCGGRL QTSPAPIQTTPPKDTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDP SCEAEDRGDKFVLRSAYSSCGMQVSASMISNEAVVNILSSSSPQRKKVHCLNMDSLSF QLGLYLSPHFLQASNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQ GRAAKGNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHR TVFMRLNIISPDLSGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTREYPRPP Q" BASE COUNT 607 a 1094 c 841 g 531 t ORIGIN 1 cctggggcca ggactgctgc tgtcactgcc atccattgga gcccagcacc ccctccccgc 61 ccatccttcg gacagcaact ccagcccagc cccgcgtccc tgtgtccact tctcctgacc 121 cctcggccgc caccccagaa ggctggagca gggacgccgt cgctccggcc gcctgctccc 181 ctcgggtccc cgtgcgagcc cacgccggcc ccggtgcccg cccgcagccc tgccactgga 241 cacaggataa ggcccagcgc acaggccccc acgtggacag catggaccgc ggcacgctcc 301 ctctggctgt tgccctgctg ctggccagct gcagcctcag ccccacaagt cttgcagaaa 361 cagtccattg tgaccttcag cctgtgggcc ccgagagggg cgaggtgaca tataccacta 421 gccaggtctc gaagggctgc gtggctcagg cccccaatgc catccttgaa gtccatgtcc 481 tcttcctgga gttcccaacg ggcccgtcac agctggagct gactctccag gcatccaagc 541 aaaatggcac ctggccccga gaggtgcttc tggtcctcag tgtaaacagc agtgtcttcc 601 tgcatctcca ggccctggga atcccactgc acttggccta caattccagc ctggtcacct 661 tccaagagcc cccgggggtc aacaccacag agctgccatc cttccccaag acccagatcc 721 ttgagtgggc agctgagagg ggccccatca cctctgctgc tgagctgaat gacccccaga 781 gcatcctcct ccgactgggc caagcccagg ggtcactgtc cttctgcatg ctggaagcca 841 gccaggacat gggccgcacg ctcgagtggc ggccgcgtac tccagccttg gtccggggct 901 gccacttgga aggcgtggcc ggccacaagg aggcgcacat cctgagggtc ctgccgggcc 961 actcggccgg gccccggacg gtgacggtga aggtggaact gagctgcgca cccggggatc 1021 tcgatgccgt cctcatcctg cagggtcccc cctacgtgtc ctggctcatc gacgccaacc 1081 acaacatgca gatctggacc actggagaat actccttcaa gatctttcca gagaaaaaca 1141 ttcgtggctt caagctccca gacacacctc aaggcctcct gggggaggcc cggatgctca 1201 atgccagcat tgtggcatcc ttcgtggagc taccgctggc cagcattgtc tcacttcatg 1261 cctccagctg cggtggtagg ctgcagacct cacccgcacc gatccagacc actcctccca 1321 aggacacttg tagcccggag ctgctcatgt ccttgatcca gacaaagtgt gccgacgacg 1381 ccatgaccct ggtactaaag aaagagcttg ttgcgcattt gaagtgcacc atcacgggcc 1441 tgaccttctg ggaccccagc tgtgaggcag aggacagggg tgacaagttt gtcttgcgca 1501 gtgcttactc cagctgtggc atgcaggtgt cagcaagtat gatcagcaat gaggcggtgg 1561 tcaatatcct gtcgagctca tcaccacagc ggaaaaaggt gcactgcctc aacatggaca 1621 gcctctcttt ccagctgggc ctctacctca gcccacactt cctccaggcc tccaacacca 1681 tcgagccggg gcagcagagc tttgtgcagg tcagagtgtc cccatccgtc tccgagttcc 1741 tgctccagtt agacagctgc cacctggact tggggcctga gggaggcacc gtggaactca 1801 tccagggccg ggcggccaag ggcaactgtg tgagcctgct gtccccaagc cccgagggtg 1861 acccgcgctt cagcttcctc ctccacttct acacagtacc catacccaaa accggcaccc 1921 tcagctgcac ggtagccctg cgtcccaaga ccgggtctca agaccaggaa gtccatagga 1981 ctgtcttcat gcgcttgaac atcatcagcc ctgacctgtc tggttgcaca agcaaaggcc 2041 tcgtcctgcc cgccgtgctg ggcatcacct ttggtgcctt cctcatcggg gccctgctca 2101 ctgctgcact ctggtacatc tactcgcaca cgcgtgagta ccccaggccc ccacagtgag 2161 catgccgggc ccctccatcc acccggggga gcccagtgaa gcctctgagg gattgagggg 2221 ccctggcagg accctgacct ccgcccctgc ccccgctccc gctcccaggt tcccccagca 2281 agcgggagcc cgtggtggcg gtggctgccc cggcctcctc ggagagcagc agcaccaacc 2341 acagcatcgg gagcacccag agcaccccct gctccaccag cagcatggca tagccccggc 2401 cccccgcgct cgcccagcag gagagactga gcagccgcca gctgggagca ctggtgtgaa 2461 ctcaccctgg gagccagtcc tccactcgac ccagaatgga gcctgctctc cgcgcctacc 2521 cttcccgcct ccctctcaga ggcctgctgc cagtgcagcc actggcttgg aacaccttgg 2581 ggtccctcca ccccacagaa ccttcaaccc agtgggtctg ggatatggct gcccaggaga 2641 cagaccactt gccacgctgt tgtaaaaacc caagtccctg tcatttgaac ctggatccag 2701 cactggtgaa ctgagctggg caggaaggga gaacttgaaa cagattcagg ccagcccagc 2761 caggccaaca gcacctcccc gctgggaaga gaagagggcc cagcccagag ccacctggat 2821 ctatccctgc ggcctccaca cctgaacttg cctaactaac tggcagggga gacaggagcc 2881 tagcggagcc cagcctggga gcccagaggg tggcaagaac agtgggcgtt gggagcctag 2941 ctcctgccac atggagcccc ctctgccggt cgggcagcca gcagaggggg agtagccaag 3001 ctgcttgtcc tgggcctgcc cctgtgtatt caccaccaat aaatcagacc atgaaacctg 3061 aaaaaaaaaa aaa // LOCUS HSENDOGMR 1132 bp RNA PRI 20-AUG-1997 DEFINITION H.sapiens endonuclease G (ENDOG) mRNA. ACCESSION X79444 NID g1480384 KEYWORDS ENDOG gene; endonuclease G. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 388 to 1132) AUTHORS Tiranti,V., Rossi,E., Ruiz-Carrillo,A., Rossi,G., Rocchi,M., DiDonato,S., Zuffardi,O. and Zeviani,M. TITLE Chromosomal localization of mitochondrial transcription factor A (TCF6), single-stranded DNA-binding protein (SSBP), and endonuclease G (ENDOG), three human housekeeping genes involved in mitochondrial biogenesis JOURNAL Genomics 25 (2), 559-564 (1995) MEDLINE 95309925 REFERENCE 2 (bases 1 to 1132) AUTHORS Zeviani,M. TITLE Direct Submission JOURNAL Submitted (24-MAY-1994) M. Zeviani, Istituto Neurologico C. Besta, Via Celoria 11, 20133 Milan, ITALY REMARK Revised by [3] REFERENCE 3 (bases 1 to 1132) AUTHORS Zeviani,M. TITLE Direct Submission JOURNAL Submitted (18-APR-1996) M. Zeviani, Istituto Neurologico C. Besta, Via Celoria 11, 20133 Milan, ITALY REFERENCE 4 (bases 1 to 1132) AUTHORS Prats,E., Noel,M., Letourneau,J., Tiranti,V., Vaque,J., Debon,R., Zeviani,M., Cornudella,L. and Ruiz-Carrillo,A. TITLE Characterization and expression of the mouse endonuclease G gene JOURNAL DNA Cell Biol. 16 (9), 1111-1122 (1997) MEDLINE 97464492 FEATURES Location/Qualifiers source 1..1132 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /chromosome="9" /map="q34.1" /clone="33899" /clone_lib="Soares infant brain" /dev_stage="Infant female" /sex="female" /lab_host="DH10B (ampicillin resistant)" /tissue_type="brain" CDS 166..1059 /function="mitochondrial endonuclease" /note="high similarity with bovine endonuclease G" /codon_start=1 /product="endonuclease" /db_xref="PID:e236577" /db_xref="PID:g1480385" /translation="MRALRAGLTMASGAGLGAVVEGWRRRREDARAALGLLGRLPVLP VAAAAELPPVPGGPRGPGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPE RLRGDGDRRECDFREDDSVHAYHRATNVDYRGSGFDRGHLAAAANHRWSQKAMDDTFY LSKVAPQVTHLNQNAWNNLEKYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVI GKNHVAVPTHFFKVLILEAAGGQIELRTYVMPNAPVDEAIPLERFLVPIESIERASGL LFVPNILARAGSLKAITAGSK" BASE COUNT 194 a 362 c 398 g 178 t ORIGIN 1 ggcacgaggc tgggttccga ggcccaagcc cttggcagtg tttgtgagtg gaagggaggt 61 cacgctatcg tccgcggccc cagcagccct gtgccctcgt tggatcccgc gacgcggctc 121 ctttaagagc tcgcgggtcg cccgccgcta ggtcgctccc cggccatgcg ggcgctgcgg 181 gccggcctga ccatggcgtc gggcgcgggg ctgggtgcgg tcgtcgaggg ctggcggcgg 241 cggcgggagg acgcgcgggc ggcgctggga ctgctgggcc ggctgcccgt gctgcccgtg 301 gcggcggcag ccgagttgcc ccctgtgccc gggggacccc gcggcccggg cgagttggcc 361 aagtacgggc tgccggggct ggcgcagctc aagagccgcg agtcgtacgt gctgtgctac 421 gacccgcgca cccgcggcgc gctctgggtg gtggagcagc tgcgacccga gcgtctccgc 481 ggcgacggcg accggcgcga gtgcgacttc cgcgaggacg actcggtgca cgcgtaccac 541 cgtgccacca acgtcgacta ccgcggcagt ggcttcgacc gcggtcacct ggccgccgcc 601 gccaaccacc gctggagcca gaaggccatg gacgacacgt tctacctgag caaagtcgcg 661 ccccaggtga cccacctcaa ccagaatgcc tggaacaacc tggagaaata tagccgcagc 721 ttgacccgca gctaccaaaa cgtctatgtc tgcacagggc cactcttcct gcccaggaca 781 gaggctgatg ggaaatccta cgtaaagtac caggtcatcg gcaagaacca cgtggcagtg 841 cccacacact tcttcaaggt gctgatcctg gaggcagcag gtggccaaat tgagctccgc 901 acctacgtga tgcccaacgc acctgtggat gaggccatcc cactggagcg cttcctggtg 961 cccatcgaga gcattgagcg ggcttcgggg ctgctctttg tgccaaacat cctggcgcgg 1021 gcaggcagcc tcaaggccat cacggcgggc agtaagtgag ggtggagccc agtgagactg 1081 tgggtgtgtg caggccgggg agtattaaag gtggtgattt ttggaaaaaa aa // LOCUS HSENO3BE 1390 bp RNA PRI 12-SEP-1993 DEFINITION Human ENO3 mRNA for beta-enolase (EC 4.2.1.11). ACCESSION X16504 NID g31169 KEYWORDS beta-enolase; ENO3 gene; enolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1390) AUTHORS Day,I.N.M. TITLE Direct Submission JOURNAL Submitted (12-SEP-1989) Day I.N.M., University of Clinical Biochemistry, South Laboratory and Pathology Block, Level D, Southampton General Hospital, Tremona Road, Southampton S09 4XY, UK REFERENCE 2 (bases 1 to 1390) AUTHORS Peshavaria,M., Hinks,L.J. and Day,I.N. TITLE Structure of human muscle (beta) enolase mRNA and protein deduced from a genomic clone JOURNAL Nucleic Acids Res. 17 (21), 8862 (1989) MEDLINE 90067857 COMMENT Data kindly reviewed (02-JAN-1990) by Day I.N.M. FEATURES Location/Qualifiers source 1..1390 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="peripheral blood leucocytes" /clone_lib="human genomic lambda-2001" /clone="lambda-HGM1" CDS 1..1305 /note="beta-enoiase (AA 1-434)" /codon_start=1 /db_xref="PID:g31170" /db_xref="SWISS-PROT:P13929" /translation="MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIY EALELRDGDKGRYLGKGVLKAVENINNTLGPALLQKKLSVVDQEKVDKFMIELDGTEN KSKFGANAILGVSLAVCKAGAAEKGVPLYRHIADLAGNPDLILPVPAFNVINGGSHAG NNLAMQEFMILPVGASSFKEAMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNIL ENNEALELLKTAIQAAGYPDKVVIGMDVAASEFYRNGKYDLDFKSPDDPARHITGEKL GELYKSFIKNYPVVSIEDPFDQDDWATWTSFLSGVNIQIVGDDLTVTNPKRIAQAVEK KACNCLLLKVNQIGSVTESIQACKLAQSNGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLMRIEEALGDKAIFAGRKFRNPKAK" misc_feature 1366..1371 /note="polyA signal" BASE COUNT 339 a 365 c 415 g 271 t ORIGIN 1 atggccatgc agaaaatctt tgcccgggaa atcttggact ccaggggcaa ccccacggtg 61 gaggtggacc tgcacacggc caagggccga ttccgagcag ctgtgcccag tggggcttcc 121 acgggtatct atgaggctct ggaactaaga gacggagaca aaggccgcta cctggggaaa 181 ggagtcctga aggctgtgga gaacatcaac aatactctgg gccctgctct gctgcaaaag 241 aaactaagcg ttgttgatca agaaaaagtt gacaaattta tgattgagct agatgggacc 301 gagaataagt ccaagtttgg ggccaatgcc atcctgggcg tgtccttggc cgtgtgtaag 361 gcgggagcag ctgagaaggg ggtccccctg taccgccaca tcgcagatct cgctgggaac 421 cctgacctca tactcccagt gccagccttc aatgtgatca acgggggctc ccatgctgga 481 aacaacttgg ccatgcagga gttcatgatt ctgcctgtgg gagccagctc cttcaaggaa 541 gccatgcgca ttggcgccga ggtctaccac cacctcaagg gggtcatcaa ggccaagtat 601 gggaaggatg ccaccaatgt gggtgatgaa ggtggcttcg cacccaacat cctggagaac 661 aatgaggccc tggagctgct gaagacggcc atccaggcgg ctggttaccc agacaaggtg 721 gtgatcggca tggatgtggc agcatctgag ttctatcgca atgggaagta cgatcttgac 781 ttcaagtcgc ctgatgatcc cgcacggcac atcactgggg agaagctcgg agagctgtat 841 aagagcttta tcaagaacta tcctgtggtc tccatcgaag acccctttga ccaggatgac 901 tgggccactt ggacctcctt cctctcgggg gtgaacatcc agattgtggg ggatgacttg 961 acagtcacca accccaagag gattgcccag gccgttgaga agaaggcctg caactgtctg 1021 ctgctgaagg tcaaccagat cggctcggtg accgaatcga tccaggcgtg caaactggct 1081 cagtctaatg gctggggggt gatggtgagc caccgctcag gggagactga ggacacattc 1141 attgctgacc ttgtggtggg gctctgcaca ggacagatca agactggcgc cccctgccgc 1201 tcggagcgtc tggccaaata caaccaactc atgaggatcg aggaggctct tggggacaag 1261 gcaatctttg ctggacgcaa gttccgtaac ccgaaggcca agtgagaagc tggaggctcc 1321 aggactccac tggacagacc caggtcttcc agacctgctt cctgaaataa acactggtgc 1381 caaccaagac // LOCUS HSEP2PR 1077 bp RNA PRI 19-JAN-1995 DEFINITION H.sapiens mRNA for EP2 prostaglandin receptor. ACCESSION X83868 NID g633205 KEYWORDS prostaglandin E2 receptor; prostaglandin E2 receptor EP2 subtype. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1077) AUTHORS Oakley,C.J. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1077) AUTHORS Oakley,C.J. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) C.J. Oakley, Fisons Pharmaceuticals, Bakewell Road, Loughborough, Leicestershire, LE11 ORH, UK COMMENT In conflict with Regan, J.W., Molecular Pharmacology, 46:213-220 (1994). FEATURES Location/Qualifiers source 1..1077 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="promyelocytic leukemia" /cell_line="HL60" CDS 1..1077 /codon_start=1 /product="EP2 prostaglandin receptor" /db_xref="PID:g633206" /db_xref="SWISS-PROT:P43116" /translation="MGNASNDSQSEDCETRQWLPPGESPAISSVMFSAGVLGNLIALA LLARRWRGDVGCSAGRRSSLSLFHVLVTELVFTDLLGTCLISPVVLASYARNQTLVAL APESRACTYFAFAMTFFSLATMLMLFAMALERYLSIGHPYFYQRRVSRSGGLAVLPVI YAVSLLFCSLPLLDYGQYVQYCPGTWCFIRHGRTAYLQLYATLLLLLIVSVLACNFSV ILNLIRMHRRSRRSRCGPSLGSGRGGPGARRRGERVSMAEETDHLILLAIMTITFAVC SLPFTIFAYMNETSSRKEKWDLQALRFLSINSIIDPWVFAILRPPVLRLMRSVLCCRI SLRTQDATQTSCSTQSDASKQADL" BASE COUNT 183 a 355 c 293 g 246 t ORIGIN 1 atgggcaatg cctccaatga ctcccagtct gaggactgcg agacgcgaca gtggcttccc 61 ccaggcgaaa gcccagccat cagctccgtc atgttctcgg ccggggtgct ggggaacctc 121 atagcactgg cgctgctggc gcgccgctgg cggggggacg tggggtgcag cgccggccgc 181 aggagctccc tctccttgtt ccacgtgctg gtgaccgagc tggtgttcac cgacctgctc 241 gggacctgcc tcatcagccc agtggtactg gcttcgtacg cgcggaacca gaccctggtg 301 gcactggcgc ccgagagccg cgcgtgcacc tacttcgctt tcgccatgac cttcttcagc 361 ctggccacga tgctcatgct cttcgccatg gccctggagc gctacctctc gatcgggcac 421 ccctacttct accagcgccg cgtctcgcgc tccgggggcc tggccgtgct gcctgtcatc 481 tatgcagtct ccctgctctt ctgctcgctg ccgctgctgg actatgggca gtacgtccag 541 tactgccccg ggacctggtg cttcatccgg cacgggcgga ccgcttacct gcagctgtac 601 gccaccctgc tgctgcttct cattgtctcg gtgctcgcct gcaacttcag tgtcattctc 661 aacctcatcc gcatgcaccg ccgaagccgg agaagccgct gcggaccttc cctgggcagt 721 ggccggggcg gccccggggc ccgcaggaga ggggaaaggg tgtccatggc ggaggagacg 781 gaccacctca ttctcctggc tatcatgacc atcaccttcg ccgtctgctc cttgcctttc 841 acgatttttg catatatgaa tgaaacctct tcccgaaagg aaaaatggga cctccaagct 901 cttaggtttt tatcaattaa ttcaataatt gacccttggg tctttgccat ccttaggcct 961 cctgttctga gactaatgcg ttcagtcctc tgttgtcgga tttcattaag aacacaagat 1021 gcaacacaaa cttcctgttc tacacagtca gatgccagta aacaggctga cctttga // LOCUS HSEP3A1 1911 bp RNA PRI 31-MAY-1995 DEFINITION H.sapiens mRNA for prostaglandin E receptor (EP3a1). ACCESSION X83857 NID g633207 KEYWORDS prostaglandin E2 receptor EP3 subtype. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1911) AUTHORS Schmid,A., Thierauch,K.H., Schleuning,W.D. and Dinter,H. TITLE Splice variants of the human EP3 receptor for prostaglandin E2 JOURNAL Eur. J. Biochem. 228 (1), 23-30 (1995) MEDLINE 95188908 REFERENCE 2 (bases 1 to 1911) AUTHORS Dinter,H. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) H. Dinter, Schering AG, IZMB S109/618, 13342 Berlin, FRG FEATURES Location/Qualifiers source 1..1911 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" CDS 208..1380 /codon_start=1 /product="prostaglandin E receptor, subtype EP3a1" /db_xref="PID:g633208" /db_xref="SWISS-PROT:P43115" /translation="MKETRGYGGDAPFCTRLNHSYTGMWAPERSAEARGNLTRPPGSG EDCGSVSVAFPITMLLTGFVGNALAMLLVSRSYRRRESKRKKSFLLCIGWLALTDLVG QLLTTPVVIVVYLSKQRWEHIDPSGRLCTFFGLTMTVFGLSSLFIASAMAVERALAIR APHWYASHMKTRATRAVLLGVWLAVLAFALLPVLGVGQYTVQWPGTWCFISTGRGGNG TSSSHNWGNLFFASAFAFLGLLALTVTFSCNLATIKALVSRCRAKATASQSSAQWGRI TTETAIQLMGIMCVLSVCWSPLLIMMLKMIFNQTSVEHCKTHTEKQKECNFFLIAVRL ASLNQILDPWVYLLLRKILLRKFCQIRYHTNNYASSSTSLPCQCSSTLMWSDHLER" BASE COUNT 408 a 575 c 472 g 456 t ORIGIN 1 gaaggcgtgg ctccctcccg ggccagtgag cctggcgccg ccgcggccgc gtcccagcag 61 cggagtaggg cggcggctgc gccccgcacc atgggggcag cccagcccca gccgcggtaa 121 acgccgacct ccgccgccgc ccgcgcccgt ctgccccctc ccgctgcggc tctctggacg 181 ccatcccctc ctcacctcga agccaacatg aaggagaccc ggggctacgg aggggatgcc 241 cccttctgca cccgcctcaa ccactcctac acaggcatgt gggcgcccga gcgttccgcc 301 gaggcgcggg gcaacctcac gcgccctcca gggtctggcg aggattgcgg atcggtgtcc 361 gtggccttcc cgatcaccat gctgctcact ggtttcgtgg gcaacgcact ggccatgctg 421 ctcgtgtcgc gcagctaccg gcgccgggag agcaagcgca agaagtcctt cctgctgtgc 481 atcggctggc tggcgctcac cgacctggtc gggcagcttc tcaccacccc ggtcgtcatc 541 gtcgtgtacc tgtccaagca gcgttgggag cacatcgacc cgtcggggcg gctctgcacc 601 tttttcgggc tgaccatgac tgttttcggg ctctcctcgt tgttcatcgc cagcgccatg 661 gccgtcgagc gggcgctggc catcagggcg ccgcactggt atgcgagcca catgaagacg 721 cgtgccaccc gcgctgtgct gctcggcgtg tggctggccg tgctcgcctt cgccctgctg 781 ccggtgctgg gcgtgggcca gtacaccgtc cagtggcccg ggacgtggtg cttcatcagc 841 accgggcgag ggggcaacgg gactagctct tcgcataact ggggcaacct tttcttcgcc 901 tctgcctttg ccttcctggg gctcttggcg ctgacagtca ccttttcctg caacctggcc 961 accattaagg ccctggtgtc ccgctgccgg gccaaggcca cggcatctca gtccagtgcc 1021 cagtggggcc gcatcacgac cgagacggcc attcagctta tggggatcat gtgcgtgctg 1081 tcggtctgct ggtctccgct cctgataatg atgttgaaaa tgatcttcaa tcagacatca 1141 gttgagcact gcaagacaca cacggagaag cagaaagaat gcaacttctt cttaatagct 1201 gttcgcctgg cttcactgaa ccagatcttg gatccttggg tttacctgct gttaagaaag 1261 atccttcttc gaaagttttg ccagatcagg taccacacaa acaactatgc atccagctcc 1321 acctccttac cctgccagtg ttcctcaacc ttgatgtgga gcgaccattt ggaaagatga 1381 gaaaaagaag actcagagag caagaggaat tttggggaaa ttaaaacctg cctttctgcc 1441 aggatcacat cactggaagc tccatgactc tctttttgta aaagaaaaaa aaatcacaga 1501 aacacccacc tcccaaacta ttctctttta cttcttcccc caagcccacc cccaaatata 1561 actgttatcc agaagctgtt atgtcctgtt tccatacatg tttttgtact tttactatat 1621 ctacatacat caattaaact tatgtcctat tgttttgtga atttatattt gcgtatacat 1681 tatcatatgt aaaatttgca tttttttatt gaaaattatg tttcttgaga tttatccaca 1741 ttgaaacatg gagctctaaa tcgttaattt taaccgctat agagtattcc ataatttgaa 1801 taaagcataa tttgtttgta catctcccgc caagggaaaa ttatttccac actcatcatg 1861 acaaggagca ctgcaaaaat aaaaataaaa attacattca tacatgttta a // LOCUS HSEPAR 760 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for EPA glycoprotein (erythroid-potentiating activity). ACCESSION X02598 NID g31188 KEYWORDS erythroid-potentiating activity; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 760) AUTHORS Gasson,J.C., Golde,D.W., Kaufman,S.E., Westbrook,C.A., Hewick,R.M., Kaufman,R.J., Wong,G.G., Temple,P.A., Leary,A.C., Brown,E.L., Orr,E.C. and Clark,S.C. TITLE Molecular characterization and expression of the gene encoding human erythroid-potentiating activity JOURNAL Nature 315 (6022), 768-771 (1985) MEDLINE 85240567 COMMENT Data kindly reviewed (13-MAY-1986) by S. Clark. FEATURES Location/Qualifiers source 1..760 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 42..665 /note="EPA glycoprotein" /codon_start=1 /db_xref="PID:g31189" /translation="MAPFEPLASGILLLLWLIAPSRPCTCVPPHPQTAFCNSDLVIRA KFVGTPEVNQTTLYQRYEIKMTKMYKGFQALGDAADIRFVYTPAMESVCGYFHRSHNR SEEFLIAGKLQDGLLHITTCSFVAPWNSLSLAQRRGFTKTYTVGCEECTVFPCLSIPC KLQSGTHCLWTDQLLQGSEKGFQSRHLACLPREPGLCTWQSLRSQIA" BASE COUNT 163 a 248 c 188 g 161 t ORIGIN 1 gccgcagatc cagcgcccag agagacacca gagaacccac catggccccc tttgagcccc 61 tggcttctgg catcctgttg ttgctgtggc tgatagcccc cagcaggccc tgcacctgtg 121 tcccacccca cccacagacg gccttctgca attccgacct cgtcatcagg gccaagttcg 181 tggggacacc agaagtcaac cagaccacct tataccagcg ttatgagatc aagatgacca 241 agatgtataa agggttccaa gccttagggg atgccgctga catccggttc gtctacaccc 301 ccgccatgga gagtgtctgc ggatacttcc acaggtccca caaccgcagc gaggagtttc 361 tcattgctgg aaaactgcag gatggactct tgcacatcac tacctgcagt tttgtggctc 421 cctggaacag cctgagctta gctcagcgcc ggggcttcac caagacctac actgttggct 481 gtgaggaatg cacagtgttt ccctgtttat ccatcccctg caaactgcag agtggcactc 541 attgcttgtg gacggaccag ctcctccaag gctctgaaaa gggcttccag tcccgtcacc 601 ttgcctgcct gcctcgggag ccagggctgt gcacctggca gtccctgcgg tcccagatag 661 cctgaatcct gcccggagtg gaagctgaag cctgcacagt gtccaccctg ttcccactcc 721 catctttctt ccggacaatg aaataaagag ttacaccagc // LOCUS HSEPIC 2265 bp RNA PRI 25-JUL-1993 DEFINITION H.sapiens mRNA for epican. ACCESSION X66733 S45674 NID g31190 KEYWORDS Epican. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2265) AUTHORS Kugelmann,L.C. TITLE Direct Submission JOURNAL Submitted (19-JUN-1992) L.C. Kugelmann, Yale University, Dept of Dermatology 500 LCI, New Haven, CT 06510, USA REFERENCE 2 (bases 1 to 2265) AUTHORS Kugelman,L.C., Ganguly,S., Haggerty,J.G., Weissman,S.M. and Milstone,L.M. TITLE The core protein of epican, a heparan sulfate proteoglycan on keratinocytes, is an alternative form of CD44 JOURNAL J. Invest. Dermatol. 99, 381-385 (1992) FEATURES Location/Qualifiers source 1..2265 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Neonatal" /tissue_type="Foreskin" /cell_type="Keratinocytes" /clone_lib="lambda gt11" /clone="lambda 1" /chromosome="11" CDS 130..2229 /codon_start=1 /product="epican" /db_xref="PID:g31191" /translation="MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSI SRTEAADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNT GVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGE YRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDR IPATSTSSNTISAGWEPNEENEDERDRHLSFSGSGIDDDEDFISSTISTTPRAFDHTK QNQDWTQWNPSHSNPEVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPLIHHEHHEEEE TPHSTSTIQATPSSTTEETATQKEQWFGNRWHVGYRQTPKEDSHSTTGTAAASAHTSH PMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRRMDMDSSHSTTLQPTANPNTGLVE DLDRTGPLSMTTQQSNSQSFSTSHEGLEEDKDHPTTSTLTSSNRNDVTGGRRDPNHSE GSTTLLEGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTF HPSGGSHTTHGSESDGHSHGSQEGGANTTSGPIRTPQIPEWLIILASLLALALILAVC IAVNSRRRCGQKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFM TADETRNLQNVDMKIGV" BASE COUNT 666 a 622 c 519 g 458 t ORIGIN 1 cggcccagcg gaccccagcc tctgccaggt tcggtccgcc atcctcgtcc cgtcctccgc 61 cggcccctgc cccgcgccca gggatcctcc agctcctttc gcccgcgccc tccgttcgct 121 ccggacacca tggacaagtt ttggtggcac gcagcctggg gactctgcct cgtgccgctg 181 agcctggcgc agatcgattt gaatataacc tgccgctttg caggtgtatt ccacgtggag 241 aaaaatggtc gctacagcat ctctcggacg gaggccgctg acctctgcaa ggctttcaat 301 agcaccttgc ccacaatggc ccagatggag aaagctctga gcatcggatt tgagacctgc 361 aggtatgggt tcatagaagg gcacgtggtg attccccgga tccaccccaa ctccatctgt 421 gcagcaaaca acacaggggt gtacatcctc acatccaaca cctcccagta tgacacatat 481 tgcttcaatg cttcagctcc acctgaagaa gattgtacat cagtcacaga cctgcccaat 541 gcctttgatg gaccaattac cataactatt gttaaccgtg atggcacccg ctatgtccag 601 aaaggagaat acagaacgaa tcctgaagac atctacccca gcaaccctac tgatgatgac 661 gtgagcagcg gctcctccag tgaaaggagc agcacttcag gaggttacat cttttacacc 721 ttttctactg tacaccccat cccagacgaa gacagtccct ggatcaccga cagcacagac 781 agaatccctg ctaccagtac gtcttcaaat accatctcag caggctggga gccaaatgaa 841 gaaaatgaag atgaaagaga cagacacctc agtttttctg gatcaggcat tgatgatgat 901 gaagatttta tctccagcac catttcaacc acaccacggg cttttgacca cacaaaacag 961 aaccaggact ggacccagtg gaacccaagc cattcaaatc cggaagtgct acttcagaca 1021 accacaagga tgactgatgt agacagaaat ggcaccactg cttatgaagg aaactggaac 1081 ccagaagcac accctcccct cattcaccat gagcatcatg aggaagaaga gaccccacat 1141 tctacaagca caatccaggc aactcctagt agtacaacgg aagaaacagc tacccagaag 1201 gaacagtggt ttggcaacag atggcatgtg ggatatcgcc aaacacccaa agaagactcc 1261 cattcgacaa cagggacagc tgcagcctca gctcatacca gccatccaat gcaaggaagg 1321 acaacaccaa gcccagagga cagttcctgg actgatttct tcaacccaat ctcacacccc 1381 atgggacgag gtcatcaagc aggaagaagg atggatatgg actccagtca tagtacaacg 1441 cttcagccta ctgcaaatcc aaacacaggt ttggtggaag atttggacag gacaggacct 1501 ctttcaatga caacgcagca gagtaattct cagagcttct ctacatcaca tgaaggcttg 1561 gaagaagata aagaccatcc aacaacttct actctgacat caagcaatag gaatgatgtc 1621 acaggtggaa gaagagaccc aaatcattct gaaggctcaa ctactttact ggaaggttat 1681 acctctcatt acccacacac gaaggaaagc aggaccttca tcccagtgac ctcagctaag 1741 actgggtcct ttggagttac tgcagttact gttggagatt ccaactctaa tgtcaatcgt 1801 tccttatcag gagaccaaga cacattccac cccagtgggg ggtcccatac cactcatgga 1861 tctgaatcag atggacactc acatgggagt caagaaggtg gagcaaacac aacctctggt 1921 cctataagga caccccaaat tccagaatgg ctgatcatct tggcatccct cttggccttg 1981 gctttgattc ttgcagtttg cattgcagtc aacagtcgaa gaaggtgtgg gcagaagaaa 2041 aagctagtga tcaacagtgg caatggagct gtggaggaca gaaagccaag tggactcaac 2101 ggagaggcca gcaagtctca ggaaatggtg catttggtga acaaggagtc gtcagaaact 2161 ccagaccagt ttatgacagc tgatgagaca aggaacctgc agaatgtgga catgaagatt 2221 ggggtgtaac acctacacca ttatcttgga aagaaacaac cgttg // LOCUS HSEPIT1 2152 bp RNA PRI 26-AUG-1992 DEFINITION H.sapiens mRNA for epithelin 1 and 2. ACCESSION X62320 NID g31192 KEYWORDS epithelial cell growth regulator; Epithelin 1; Epithelin 2; soluble protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2152) AUTHORS Plowman,G.D., Green,J.M., Neubauer,M.G., Buckley,S.D., McDonald,V.L., Todaro,G.J. and Shoyab,M. TITLE The epithelin precursor encodes two proteins with opposing activities on epithelial cell growth JOURNAL J. Biol. Chem. 267 (18), 13073-13078 (1992) MEDLINE 92317004 REFERENCE 2 (bases 1 to 2152) AUTHORS Plowman,G.D. TITLE Direct Submission JOURNAL Submitted (15-JAN-1991) G.D. Plowman, Oncogen, 3005 1st Avenue, Seattle, WA 98121, U S A COMMENT See also X62320-2. FEATURES Location/Qualifiers source 1..2152 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="PCR clones in Bluescript" sig_peptide 41..91 CDS 41..1822 /codon_start=1 /product="Epithelin 1 & 2" /db_xref="PID:g31193" /db_xref="SWISS-PROT:P28799" /translation="MWTLVSWVALTAGLVAGTRCPDGQFCPVACCLDPGGASYSCCRP LLDKWPTTLSRHLGGPCQVDAHCSAGHSCIFTVSGTSSCCPFPEAVACGDGHHCCPRG FHCSADGRSCFQRSGNNSVGAIQCPDSQFECPDFSTCCVMVDGSWGCCPMPQASCCED RVHCCPHGAFCDLVHTRCITPTGTHPLAKKLPAQRTNRAVALSSSVMCPDARSRCPDG STCCELPSGKYGCCPMPNATCCSDHLHCCPQDTVCDLIQSKCLSKENATTDLLTKLPA HTVGDVKCDMEVSCPDGYTCCRLQSGAWGCCPFTQAVCCEDHIHCCPAGFTCDTQKGT CEQGPHQVPWMEKAPAHLSLPDPQALKRDVPCDNVSSCPSSDTCCQLTSGEWGCCPIP EAVCCSDHQHCCPQGYTCVAEGQCQRGSEIVAGLEKMPARRASLSHPRDIGCDQHTSC PVGQTCCPSLGGSWACCQLPHAVCCEDRQHCCPAGYTCNVKARSCEKEVVSAQPATFL ARSPHVGVKDVECGEGHFCHDNQTCCRDNRQGWACCPYRQGVCCADRRHCCPAGFRCA ARGTKCLRREAPRWDAPLRDPALRQLL" mat_peptide 92..1819 /product="Epithelin 1 & 2" misc_feature 215..382 /note="cysteine motif 1" misc_feature 392..394 /note="pot. glycosylation site" misc_feature 410..580 /note="cysteine motif 2" misc_feature 656..826 /note="Epithelin 2" misc_feature 746..748 /note="pot. glycosylation site" misc_feature 833..835 /note="pot. glycosylation site" misc_feature 884..1051 /note="Epithelin 1" misc_feature 1130..1294 /note="cysteine motif 5" misc_feature 1142..1144 /note="pot. glycosylation site" misc_feature 1364..1531 /note="cysteine motif 6" misc_feature 1595..1762 /note="cysteine motif 7" misc_feature 1628..1630 /note="pot. glycosylation site" BASE COUNT 394 a 691 c 639 g 428 t ORIGIN 1 gctgctgccc aaggaccgcg gagtcggacg caggcagacc atgtggaccc tggtgagctg 61 ggtggcctta acagcagggc tggtggctgg aacgcggtgc ccagatggtc agttctgccc 121 tgtggcctgc tgcctggacc ccggaggagc cagctacagc tgctgccgtc cccttctgga 181 caaatggccc acaacactga gcaggcatct gggtggcccc tgccaggttg atgcccactg 241 ctctgccggc cactcctgca tctttaccgt ctcagggact tccagttgct gccccttccc 301 agaggccgtg gcatgcgggg atggccatca ctgctgccca cggggcttcc actgcagtgc 361 agacgggcga tcctgcttcc aaagatcagg taacaactcc gtgggtgcca tccagtgccc 421 tgatagtcag ttcgaatgcc cggacttctc cacgtgctgt gttatggtcg atggctcctg 481 ggggtgctgc cccatgcccc aggcttcctg ctgtgaagac agggtgcact gctgtccgca 541 cggtgccttc tgcgacctgg ttcacacccg ctgcatcaca cccacgggca cccaccccct 601 ggcaaagaag ctccctgccc agaggactaa cagggcagtg gccttgtcca gctcggtcat 661 gtgtccggac gcacggtccc ggtgccctga tggttctacc tgctgtgagc tgcccagtgg 721 gaagtatggc tgctgcccaa tgcccaacgc cacctgctgc tccgatcacc tgcactgctg 781 cccccaagac actgtgtgtg acctgatcca gagtaagtgc ctctccaagg agaacgctac 841 cacggacctc ctcactaagc tgcctgcgca cacagtgggg gatgtgaaat gtgacatgga 901 ggtgagctgc ccagatggct atacctgctg ccgtctacag tcgggggcct ggggctgctg 961 cccttttacc caggctgtgt gctgtgagga ccacatacac tgctgtcccg cggggtttac 1021 gtgtgacacg cagaagggta cctgtgaaca ggggccccac caggtgccct ggatggagaa 1081 ggccccagct cacctcagcc tgccagaccc acaagccttg aagagagatg tcccctgtga 1141 taatgtcagc agctgtccct cctccgatac ctgctgccaa ctcacgtctg gggagtgggg 1201 ctgctgtcca atcccagagg ctgtctgctg ctcggaccac cagcactgct gcccccaggg 1261 ctacacgtgt gtagctgagg ggcagtgtca gcgaggaagc gagatcgtgg ctggactgga 1321 gaagatgcct gcccgccggg cttccttatc ccaccccaga gacatcggct gtgaccagca 1381 caccagctgc ccggtggggc agacctgctg cccgagcctg ggtgggagct gggcctgctg 1441 ccagttgccc catgctgtgt gctgcgagga tcgccagcac tgctgcccgg ctggctacac 1501 ctgcaacgtg aaggctcgat cctgcgagaa ggaagtggtc tctgcccagc ctgccacctt 1561 cctggcccgt agccctcacg tgggtgtgaa ggacgtggag tgtggggaag gacacttctg 1621 ccatgataac cagacctgct gccgagacaa ccgacagggc tgggcctgct gtccctaccg 1681 ccagggcgtc tgttgtgctg atcggcgcca ctgctgtcct gctggcttcc gctgcgcagc 1741 caggggtacc aagtgtttgc gcagggaggc cccgcgctgg gacgcccctt tgagggaccc 1801 agccttgaga cagctgctgt gagggacagt actgaagact ctgcagccct cgggacccca 1861 ctcggagggt gccctctgct caggcctccc tagcacctcc ccctaaccaa attctccctg 1921 gaccccattc tgagctcccc atcaccatgg gaggtggggc ctcaatctaa ggccttccct 1981 gtcagaaggg ggttgtggca aaagccacat tacaagctgc catcccctcc ccgtttcagt 2041 ggaccctgtg gccaggtgct tttccctatc cacaggggtg tttgtgtgtg tgcgcgtgtg 2101 cgtttcaata aagtttgtac actttcaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSEPMG50 1927 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for 50 kDa erythrocyte plasma membrane glycoprotein. ACCESSION X64594 S46252 NID g31194 KEYWORDS glycoprotein; plasma glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1927) AUTHORS Ridgwell,K. TITLE Direct Submission JOURNAL Submitted (23-MAR-1992) K. Ridgwell, University of Bristol, Dept of Biochemistry, School of Medical Sciences, University Walk, Bristol BS8 1TD, UK REFERENCE 2 (bases 1 to 1927) AUTHORS Ridgwell,K., Spurr,N.K., Laguda,B., MacGeoch,C., Avent,N.D. and Tanner,M.J. TITLE Isolation of cDNA clones for a 50 kDa glycoprotein of the human erythrocyte membrane associated with Rh (rhesus) blood-group antigen expression JOURNAL Biochem. J. 287 (Pt 1), 223-228 (1992) MEDLINE 93038558 FEATURES Location/Qualifiers source 1..1927 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetal" /tissue_type="bone marrow, liver" /chromosome="6" /map="6p21-qter" CDS 28..1257 /codon_start=1 /product="50 kDa erythrocyte plasma membrane glycoprotein" /db_xref="PID:g31195" /db_xref="SWISS-PROT:Q02094" /translation="MRFTFPLMAIVLEIAMIVLFGLFVEYETDQTVLEQLNITKPTDM GIFFELYPLFQDVHVMIFVGFGFLMTFLKKYGFSSVGINLLVAALGLQWGTIVQGILQ SQGQKFNIGIKNMINADFSAATVLISFGAVLGKTSPTQMLIMTILEIVFFAHNEYLVS EIFKASDIGASMTIHAFGAYFGLAVAGILYRSGLRKGHENEESAYYSDLFAMIGTLFL WMFWPSFNSAIAEPGDKQCRAIVDTYFSLAACVLTAFAFSSLVEHRGKLNMVHIQNAT LAGGVAVGTCADMAIHPFGSMIIGSIAGMVSVLGYKFLTPLFTTKLRIHDTCGVHNLH GLPGVVGGLAGIVAVAMGASNTSMAMQAAALGSSIGTAVVGGLMTGLILKLPLWGQPS DQNCYDDSVYWKVPKTR" misc_feature 136..138 /standard_name="glycosylation site" misc_feature 307..309 /standard_name="glycosylation site" misc_feature 390..392 /standard_name="glycosylation site" misc_feature 751 /note="ASN/ASP polymorphism" BASE COUNT 563 a 392 c 411 g 561 t ORIGIN 1 agtgtgcctc tgtcctttgc cacaaacatg aggttcacat tccctctcat ggctatagtc 61 ctggaaattg ccatgattgt tttatttgga ttatttgttg agtatgaaac ggaccagact 121 gttctcgagc agctcaacat caccaagcca acagacatgg gcatattctt tgagttatat 181 cctctgttcc aagatgtaca tgttatgata tttgttgggt ttggcttcct catgaccttc 241 ctgaagaaat atggcttcag cagtgtgggt atcaacctac tcgttgctgc tttgggcctc 301 cagtggggca ctattgtaca gggaatcctg caaagccagg gacagaaatt taacattgga 361 atcaaaaaca tgataaatgc agacttcagt gcagccacag ttctgatatc ttttggagct 421 gtcctgggaa aaacgagccc cacccaaatg ctgatcatga caattttaga aattgttttc 481 tttgcccaca atgaatacct ggttagtgaa atatttaagg cctctgacat tggagcatca 541 atgacgatcc atgcctttgg ggcctacttt ggcttggctg tagcaggcat cttgtatcga 601 tctggactga gaaaggggca tgaaaatgaa gagtccgcat actactcaga cttgtttgca 661 atgattggga ctctctttct gtggatgttt tggcccagct ttaactcggc cattgctgaa 721 cctggagaca aacagtgcag ggccattgta gacacgtact tctctctcgc tgcctgtgtg 781 ctcacagcct ttgccttctc cagcctagtg gagcaccgag gcaagctcaa catggttcac 841 attcagaatg ccacccttgc tggaggagtt gctgtgggca cttgtgcgga tatggcaatt 901 cacccatttg gttctatgat tattgggagc attgcaggaa tggtctctgt gcttggatac 961 aagttcctga ctccactttt tactactaaa ctgaggatcc atgatacatg tggggtccat 1021 aacctccacg gcttacctgg tgtagtggga ggccttgcag gcattgtggc agtagcaatg 1081 ggcgcctcca acacgtctat ggccatgcag gcagctgcac tgggttcctc tatcggaaca 1141 gcagttgttg gaggtctgat gacaggttta attctaaagt tgcctctctg gggacagcca 1201 tctgaccaga actgctatga tgattctgtt tattggaagg tccctaagac gagataactt 1261 gacaatcagt tccatggaca tggtgaccac agccagctgg aacctgaagt ctaaacacca 1321 ttcctgctct ccagcttcct ttcccattat ccagaatcaa gtccaaataa acaaaaaggg 1381 agtaaccaaa gagagtatgg accagagtga atagatccta agtcccaaat ggccagtgta 1441 aaaatgtcct tatgtctgat gctgtctctt gctcttcaat gattaattga ggggatgtta 1501 ctcataaaac agataatcaa atagatcttc tccaggattc ccaaaaagct tttggcagtg 1561 agtaaataca gagtaaacat gtcagtttct taatgtagac actatgtctt caatcccaaa 1621 aattataaaa ctgaaaccca tgaagcaaga atagatgtga gaaatctatg taaaaaaata 1681 attaaagaaa tgcatgtgtg taaagtagta atatgatgat tttaggtagt gctttttatt 1741 ttaaaaatag tctagttagt aatgttgtat ccttgcatga atattattct taattccttt 1801 tgcatgttga ctatttgcaa cgagctcaaa tgctatctga tcaaagtcta ttttgcataa 1861 aatgtccaat aattaaatat tgttataaaa taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1921 aaaaaaa // LOCUS HSEPSPP2A 3197 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for epsilon isoform of 61kDa regulatory subunit of PP2A. ACCESSION Z69029 NID g1418775 KEYWORDS 61kDa regulatory subunit; epsilon isoform; PP2A gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3197) AUTHORS Hemmings,B.B.A. TITLE Direct Submission JOURNAL Submitted (27-JAN-1996) Hemmings B.B.A., Friedrich Miescher-Institut, Maulbeerstrasse 66, BASEL, Switzerland, CH-4002 REFERENCE 2 (bases 1 to 3197) AUTHORS Zolnierowicz,S., Van Hoof,C., Andjelkovic,N., Cron,P., Stevens,I., Merlevede,W., Goris,J. and Hemmings,B.A. TITLE The variable subunit associated with protein phosphatase 2A0 defines a novel multimember family of regulatory subunits JOURNAL Biochem. J. 317 (Pt 1), 187-194 (1996) MEDLINE 96276417 FEATURES Location/Qualifiers source 1..3197 /organism="Homo sapiens" /isolate="pooled tissue" /db_xref="taxon:9606" /clone="PR61-8a" /dev_stage="Fetus" /tissue_type="Retina" /clone_lib="cDNA library (Stratagene)" 5'UTR 1..505 CDS 506..1909 /codon_start=1 /product="epsilon isoform of 61kDa regulatory subunit of PP2A" /db_xref="PID:e220196" /db_xref="PID:g1418776" /translation="MSSAPTTPPSVDKVDGFSRKSVRKARQKRSQSSSQFRSQGKPIE LTPLPLLKDVPSSEQPELFLKKLQQCCVIFDFMDTLSDLKMKEYKRSTLNELVDYITI SRGCLTEQTYPEVVRMVSCNIFRTLPPSDSNEFDPEEDEPTLEASWPHLQLVYEFFIR FLESQEFQPSIAKKYIDQKFVLQLLELFDSEDPRERDYLKTVLHRIYGKFLGLRAFIR KQINNIFLRFVYETEHFNGVAELLEILGSIINGFALPLKAEHKQFLVKVLIPLHTVRS LSLFHAQLAYCIVQFLEKDPSLTEPVIRGLMKFWPKTCSQKEVMFLGELEEILDVIEP SQFVKIQEPLFKQIAKCVSSPHFQVAERALYYWNNEYIMSLIEENSNVILPIMFSSLY RISKEHWNPAIVALVYNVLKAFMEMNSTMFDELTATYKSDRQREKKKEKEREELWKKL EDLELKRGLRRDGIIPT" BASE COUNT 943 a 689 c 684 g 881 t ORIGIN 1 gggtagcgaa aggtggctct ggcagcggcg gctccagctc ctgcggctcc tcctccttat 61 tctgtcccct tctcttgctg ccgctgcaga tccagtcttc ctccctccct tccccccctc 121 cccacgtcgt cgccgccgcc gccgggtccg gggcaacgag ctgaggcgcc gcccgccagg 181 aatgtgagcg aggagccacc ggcggagcca aacggggtcg gtgccgattt gatgggacgg 241 gcccgcgggg gaggatcgtg aggccgccgc cgccaccgga acgctgaggt tcgggtccgg 301 ccgtgaggcc taggaggctc cgccgccgcg gaaccggagg gacccgtacc ggacagccgt 361 cgccccaggc tccccgcagc tgcccggacc tccccctgca cgtcccggtc ccgccgcccg 421 cccgctgcgg ccacctcgcc cgtctcccgc ccctccaagc cacagatcat cttcggattc 481 ttccccagaa gcttcaagta gggatatgtc ctcagcacca actactcctc catcagtgga 541 taaagtagac ggattttctc ggaagtccgt cagaaaagcg agacagaaga ggtcgcaaag 601 ttcctcacag tttaggtctc aaggcaagcc tattgagtta acacctctgc cgctgctaaa 661 agacgttcca tcctcagagc agcctgaact gttcctaaag aaacttcagc agtgctgtgt 721 catttttgac ttcatggaca cgctatctga tcttaaaatg aaagaataca agcgctccac 781 tcttaatgaa ctggtggact acattacaat aagcagaggc tgtttgacag agcagactta 841 ccctgaagta gttagaatgg tatcttgcaa tatattcaga actctccctc ctagtgacag 901 caatgaattt gatccagaag aagatgaacc tacccttgag gcatcgtggc cacacttaca 961 gcttgtatat gaatttttca tacgattttt ggaaagccaa gaattccaac ccagcattgc 1021 caaaaaatat atagatcaga aatttgtatt acagcttctg gagctatttg acagcgaaga 1081 ccctcgggaa cgggactact taaaaacagt cttacacaga atttatggca agtttcttgg 1141 tcttagagca tttatccgaa aacagattaa caatattttt ctaaggtttg tttatgaaac 1201 agaacacttc aatggtgtag ctgaactgct ggaaatatta ggaagtatta tcaatggctt 1261 tgctttacct cttaaggcag aacacaaaca gtttctggtg aaagtattga tccctttaca 1321 cactgtcagg agcttatcac tcttccatgc acagctggca tattgtatag tacagtttct 1381 ggagaaagat ccttcactca cagaaccagt tattaggggg ttaatgaaat tttggcctaa 1441 aacatgtagt caaaaagagg tcatgttcct tggggaactg gaagaaatat tggatgtgat 1501 tgaaccttca caatttgtta aaatccaaga acctttgttt aaacaaatcg ccaagtgtgt 1561 atctagcccc cattttcagg tggcagaaag agcactctat tattggaata atgaatacat 1621 catgagtttg atagaagaaa actctaacgt catccttccc atcatgtttt ccagccttta 1681 taggatttca aaagaacatt ggaatccggc tattgtggcg ttggtgtaca atgtgttgaa 1741 ggcatttatg gaaatgaaca gcaccatgtt tgacgagctg acagccacat acaagtcaga 1801 tcgtcagcgt gagaaaaaga aagaaaagga gcgtgaagaa ttgtggaaaa aattggagga 1861 tctggagtta aagagaggtc ttagacgtga tggaataatt ccaacttaac aaaaacaatg 1921 acaacaacat tactaacctg tggagtcaca cgtttatgta gtagaagatg gagcaacagt 1981 tttctgtatt gtgcacttta cagtagattt cacctttgtt tcattattac agcagcactg 2041 tatatacctg tctctaagta aaggaaaaaa caaaataagg acttcaatcc aaagtttgga 2101 cagtagatgg acttctcaga actttgcaaa cataatcatt gttctcaccc tcttttaaaa 2161 aaaaaatcgg tcttcaaaga tctgttgatg aaattgctat gttaaaattc cattatcggg 2221 agttccttat ttatcactag cagagagtat gatacaattt tcaaatgtga acaatcttaa 2281 atttagcttg tctttctgct aagctgttaa atgtatttat agtaaaggaa gaaaaaaaga 2341 ctgtcatttc cttataagtt tgtgtaacat cctcctctgg ataacttgac tgtaatttga 2401 catctttttc ttttgcacat cttcctgagt tgaatgtcca cgtggaatgg ggtcatgaat 2461 tataaaagtc cctgataaaa gttttgttta ctggggtgaa catctttcca gtaaccaggt 2521 agtcctggta ctcctttagt tttaaaatta ggagttaaga gagaagaggt gataaacata 2581 gtagggaagg gaatatcgga ttcatgcatc agtttatggt gaatccaaat caatgtcttg 2641 aatcctttga aaacaggcac tgggacatca caggcttcag tacctgacca gtattagttg 2701 catatatcat tgaacacaca taccagagat gttttagaaa tgcgagaaaa acatcctttt 2761 ggaccatttg aataagaaga caacactaac atacaccatg aattgatcac cgggattgca 2821 atctattggg aaagagttga gcaacagctt ggactgtttg gagttgttgc cttacttttt 2881 aatatgtatt tataaagtat tccagcaaaa gaggatgtag cctctgggaa aaaacaaaca 2941 tgttacagtg ttttttgtag attctcgttc tatatctcat cacagcgcca gccctgtttt 3001 tagccggaaa ggattcagga taaacattat tatgcattct gaattggatg catattccta 3061 actactgtat ttgttaccaa aagtggttct acaaatgcta ctgaaaaaaa tctggaaatt 3121 cctaatgtcc tgagtattaa taataaagtt taaaaatgct tttatatcaa aggtgcatcg 3181 tgaccaaatt gtttaag // LOCUS HSER81TFR 1603 bp RNA PRI 30-OCT-1995 DEFINITION H.sapiens mRNA for ER81 transcription factor. ACCESSION X87175 NID g1045060 KEYWORDS er81 gene; ER81 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1603) AUTHORS De Launoit,Y.P. TITLE Direct Submission JOURNAL Submitted (11-MAY-1995) Y.P. de Launoit, CNRS URA 1160 - Unite d Oncologie Mol., Institut Pasteur de Lille, 1 rue Calmette, F- 59019 - Lille Cedex, FRANCE REFERENCE 2 (bases 1 to 1603) AUTHORS Monte,D., Coutte,L., Baert,J.L., Angeli,I., Stehelin,D. and de Launoit,Y. TITLE Molecular characterization of the ets-related human transcription factor ER81 JOURNAL Oncogene 11 (4), 771-779 (1995) MEDLINE 95380185 FEATURES Location/Qualifiers source 1..1603 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" gene 219..1595 /gene="ER81" CDS 219..1595 /gene="ER81" /codon_start=1 /product="ER81 protein" /db_xref="PID:g1045061" /translation="MDGFYDQQVPYMVTNSQRGRNCNEKPTNVRKRKFINRDVAHDSE ELFQDLSQLQETWLAEVAFHGLPLKIKKEPHSPCSEISSACSQEQPFKFSYGEKCLYN VSAYDQKPQVGMRPSNPPTPSSTPVSPLHHASPNSTHTPKPDRAFPAHLPPSQSIPDS SYPMDHRFRRQLSEPCNSFPPLPTMPREGRPMYQRQMSEPNIPFPPQGFKQEYHDPVY EHNTMVGSAASQSFPPLMIKQEPRDFAYDSEVPSCHSIYMRQEGFLAHPSRTEGCMFE KGPRQFYDDTCVVPEKFDGDIKQEPGMYREGPTYQRRGSLQLWQFLVALLDDPANSHF IAWTGRGMEFKLIEPEEVARRWGIQKNRPAMNYDKLSRSLRYYYEKGIMQKVAGERYV YKFVCDPEALFSMAFPDNQRPLLKTDMERHINEEDTVPLSHFDESMAYMPEGGCCNPH PYNEGYVY" BASE COUNT 461 a 427 c 357 g 358 t ORIGIN 1 acccccatgc attggttgca ccctcagata gtacccatga gcttcactgt tcagcctcgg 61 ggcccaggcg cttcctggaa tctctccttg gccggggtta aaatgcagtt cccgctcaaa 121 atgcttcata ggttgataga agtccagatc ctgaggaaat ctccagctaa atgctcaaaa 181 tataaaaact gaagctcaca tttgcgaaga gcagcagcat ggatggattt tatgaccagc 241 aagtgcctta catggtcacc aatagtcagc gtgggagaaa ttgtaacgag aaaccaacaa 301 atgtcaggaa aagaaaattc attaacagag atgtggctca tgattcagaa gaactctttc 361 aagatctaag tcaattacag gaaacatggc ttgcagaagt ggcttttcat ggcctgccac 421 tgaaaatcaa gaaagaaccc cacagtccat gttcagaaat cagctctgcc tgcagtcaag 481 aacagccctt taaattcagc tatggagaaa agtgcctgta caatgtcagt gcctatgatc 541 agaagccaca agtgggaatg aggccctcca acccccccac accatccagc acgccagtgt 601 ccccactgca tcatgcatct ccaaactcaa ctcatacacc gaaacctgac cgggccttcc 661 cagctcacct ccctccatcg cagtccatac cagatagcag ctaccccatg gaccacagat 721 ttcgccgcca gctttctgaa ccctgtaact cctttcctcc tttgccgacg atgccaaggg 781 aaggacgtcc tatgtaccaa cgccagatgt ctgagccaaa catccccttc ccaccacaag 841 gctttaagca ggagtaccac gacccagtgt atgaacacaa caccatggtt ggcagtgcgg 901 ccagccaaag cttccctcct ttgatgatta aacaggaacc cagagatttt gcatatgact 961 cagaagtgcc tagctgccac tccatttata tgaggcaaga aggcttcctg gctcatccca 1021 gcagaacaga aggctgtatg tttgaaaagg gccccaggca gttttatgat gacacctgtg 1081 ttgtcccaga aaaattcgat ggagacatca aacaagagcc aggaatgtat cgggaaggac 1141 ccacatacca acggcgagga tcacttcagc tctggcagtt tttggtagct cttctggatg 1201 acccggcaaa ttctcatttt attgcctgga ctggtcgagg catggaattt aaactgattg 1261 agcctgaaga ggtggcccga cgttggggca ttcagaaaaa caggccagct atgaactatg 1321 ataaacttag ccgttcactc cgctattact atgagaaagg aattatgcaa aaggtggctg 1381 gagagagata tgtctacaag tttgtgtgtg atccagaagc ccttttctcc atggcctttc 1441 cagataatca gcgtccactg ctgaagacag acatggaacg tcacatcaac gaggaggaca 1501 cagtgcctct ttctcacttt gatgagagca tggcctacat gccggaaggg ggctgctgca 1561 acccccaccc ctacaacgaa ggctacgtgt attaacacaa gtg // LOCUS HSERB2R 4473 bp RNA PRI 30-MAR-1995 DEFINITION Human c-erb-B-2 mRNA. ACCESSION X03363 NID g31197 KEYWORDS cell surface glycoprotein; cellular oncogene; erB-2 cellular; glycoprotein; growth factor receptor; kinase; neu cellular oncogene; transmembrane protein; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4473) AUTHORS Yamamoto,T., Ikawa,S., Akiyama,T., Semba,K., Nomura,N., Miyajima,N., Saito,T. and Toyoshima,K. TITLE Similarity of protein encoded by the human c-erb-B-2 gene to epidermal growth factor receptor JOURNAL Nature 319 (6050), 230-234 (1986) MEDLINE 86118663 REFERENCE 2 (bases 1 to 4473) AUTHORS Papewalis,J., Nikitin,A.Yu. and Rajewsky,M.F. TITLE G to A polymorphism at amino acid codon 655 of the human erbB-2/HER2 gene JOURNAL Nucleic Acids Res. 19 (19), 5452 (1991) MEDLINE 92020265 COMMENT The c-erb-B-2 protein shows similarity to the epidermal growth factor receptor. FEATURES Location/Qualifiers source 1..4473 /organism="Homo sapiens" /strain="cell line MKN-7" /db_xref="taxon:9606" sig_peptide 175..237 /note="put. signal peptide (aa -21 to -1)" CDS 175..3942 /note="c-erb-B-2 precursor" /codon_start=1 /db_xref="PID:g31198" /db_xref="SWISS-PROT:P04626" /translation="MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLD MLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIV RGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQ LCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRT VCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNT DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKC SKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPL QPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGI SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG LACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPE CQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQ PCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKI RKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWI PDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVT QLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSP NHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWE LMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFREL VSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGF FCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDG DLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR PQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAA PQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV" mat_peptide 238..3939 /note="put. c-erb-B-2 protein (aa 1-1234)" misc_feature 376..384 /note="pot. glycosylation site" misc_feature 544..558 /note="pot. glycosylation site" misc_feature 733..741 /note="pot. glycosylation site" misc_feature 949..957 /note="pot. glycosylation site" misc_feature 1762..1770 /note="pot. glycosylation site" misc_feature 1885..1893 /note="pot. glycosylation site" misc_feature 2059..2067 /note="pot. glycosylation site" misc_feature 2353..3132 /note="aa 727-986, seq. homologous to EGF receptor kinase domain" misc_feature 2446..2454 /note="pot. glycosylation site" misc_feature 4455..4460 /note="put. polyA signal" polyA_site 4473 /note="polyA site" BASE COUNT 902 a 1383 c 1329 g 859 t ORIGIN 1 aaggggaggt aaccctggcc cctttggtcg gggccccggg cagccgcgcg ccccttccca 61 cggggccctt tactgcgccg cgcgcccggc ccccacccct cgcagcaccc cgcgccccgc 121 gccctcccag ccgggtccag ccggagccat ggggccggag ccgcagtgag caccatggag 181 ctggcggcct tgtgccgctg ggggctcctc ctcgccctct tgccccccgg agccgcgagc 241 acccaagtgt gcaccggcac agacatgaag ctgcggctcc ctgccagtcc cgagacccac 301 ctggacatgc tccgccacct ctaccagggc tgccaggtgg tgcagggaaa cctggaactc 361 acctacctgc ccaccaatgc cagcctgtcc ttcctgcagg atatccagga ggtgcagggc 421 tacgtgctca tcgctcacaa ccaagtgagg caggtcccac tgcagaggct gcggattgtg 481 cgaggcaccc agctctttga ggacaactat gccctggccg tgctagacaa tggagacccg 541 ctgaacaata ccacccctgt cacaggggcc tccccaggag gcctgcggga gctgcagctt 601 cgaagcctca cagagatctt gaaaggaggg gtcttgatcc agcggaaccc ccagctctgc 661 taccaggaca cgattttgtg gaaggacatc ttccacaaga acaaccagct ggctctcaca 721 ctgatagaca ccaaccgctc tcgggcctgc cacccctgtt ctccgatgtg taagggctcc 781 cgctgctggg gagagagttc tgaggattgt cagagcctga cgcgcactgt ctgtgccggt 841 ggctgtgccc gctgcaaggg gccactgccc actgactgct gccatgagca gtgtgctgcc 901 ggctgcacgg gccccaagca ctctgactgc ctggcctgcc tccacttcaa ccacagtggc 961 atctgtgagc tgcactgccc agccctggtc acctacaaca cagacacgtt tgagtccatg 1021 cccaatcccg agggccggta tacattcggc gccagctgtg tgactgcctg tccctacaac 1081 tacctttcta cggacgtggg atcctgcacc ctcgtctgcc ccctgcacaa ccaagaggtg 1141 acagcagagg atggaacaca gcggtgtgag aagtgcagca agccctgtgc ccgagtgtgc 1201 tatggtctgg gcatggagca cttgcgagag gtgagggcag ttaccagtgc caatatccag 1261 gagtttgctg gctgcaagaa gatctttggg agcctggcat ttctgccgga gagctttgat 1321 ggggacccag cctccaacac tgccccgctc cagccagagc agctccaagt gtttgagact 1381 ctggaagaga tcacaggtta cctatacatc tcagcatggc cggacagcct gcctgacctc 1441 agcgtcttcc agaacctgca agtaatccgg ggacgaattc tgcacaatgg cgcctactcg 1501 ctgaccctgc aagggctggg catcagctgg ctggggctgc gctcactgag ggaactgggc 1561 agtggactgg ccctcatcca ccataacacc cacctctgct tcgtgcacac ggtgccctgg 1621 gaccagctct ttcggaaccc gcaccaagct ctgctccaca ctgccaaccg gccagaggac 1681 gagtgtgtgg gcgagggcct ggcctgccac cagctgtgcg cccgagggca ctgctggggt 1741 ccagggccca cccagtgtgt caactgcagc cagttccttc ggggccagga gtgcgtggag 1801 gaatgccgag tactgcaggg gctccccagg gagtatgtga atgccaggca ctgtttgccg 1861 tgccaccctg agtgtcagcc ccagaatggc tcagtgacct gttttggacc ggaggctgac 1921 cagtgtgtgg cctgtgccca ctataaggac cctcccttct gcgtggcccg ctgccccagc 1981 ggtgtgaaac ctgacctctc ctacatgccc atctggaagt ttccagatga ggagggcgca 2041 tgccagcctt gccccatcaa ctgcacccac tcctgtgtgg acctggatga caagggctgc 2101 cccgccgagc agagagccag ccctctgacg tccatcatct ctgcggtggt tggcattctg 2161 ctggtcgtgg tcttgggggt ggtctttggg atcctcatca agcgacggca gcagaagatc 2221 cggaagtaca cgatgcggag actgctgcag gaaacggagc tggtggagcc gctgacacct 2281 agcggagcga tgcccaacca ggcgcagatg cggatcctga aagagacgga gctgaggaag 2341 gtgaaggtgc ttggatctgg cgcttttggc acagtctaca agggcatctg gatccctgat 2401 ggggagaatg tgaaaattcc agtggccatc aaagtgttga gggaaaacac atcccccaaa 2461 gccaacaaag aaatcttaga cgaagcatac gtgatggctg gtgtgggctc cccatatgtc 2521 tcccgccttc tgggcatctg cctgacatcc acggtgcagc tggtgacaca gcttatgccc 2581 tatggctgcc tcttagacca tgtccgggaa aaccgcggac gcctgggctc ccaggacctg 2641 ctgaactggt gtatgcagat tgccaagggg atgagctacc tggaggatgt gcggctcgta 2701 cacagggact tggccgctcg gaacgtgctg gtcaagagtc ccaaccatgt caaaattaca 2761 gacttcgggc tggctcggct gctggacatt gacgagacag agtaccatgc agatgggggc 2821 aaggtgccca tcaagtggat ggcgctggag tccattctcc gccggcggtt cacccaccag 2881 agtgatgtgt ggagttatgg tgtgactgtg tgggagctga tgacttttgg ggccaaacct 2941 tacgatggga tcccagcccg ggagatccct gacctgctgg aaaaggggga gcggctgccc 3001 cagcccccca tctgcaccat tgatgtctac atgatcatgg tcaaatgttg gatgattgac 3061 tctgaatgtc ggccaagatt ccgggagttg gtgtctgaat tctcccgcat ggccagggac 3121 ccccagcgct ttgtggtcat ccagaatgag gacttgggcc cagccagtcc cttggacagc 3181 accttctacc gctcactgct ggaggacgat gacatggggg acctggtgga tgctgaggag 3241 tatctggtac cccagcaggg cttcttctgt ccagaccctg ccccgggcgc tgggggcatg 3301 gtccaccaca ggcaccgcag ctcatctacc aggagtggcg gtggggacct gacactaggg 3361 ctggagccct ctgaagagga ggcccccagg tctccactgg caccctccga aggggctggc 3421 tccgatgtat ttgatggtga cctgggaatg ggggcagcca aggggctgca aagcctcccc 3481 acacatgacc ccagccctct acagcggtac agtgaggacc ccacagtacc cctgccctct 3541 gagactgatg gctacgttgc ccccctgacc tgcagccccc agcctgaata tgtgaaccag 3601 ccagatgttc ggccccagcc cccttcgccc cgagagggcc ctctgcctgc tgcccgacct 3661 gctggtgcca ctctggaaag gcccaagact ctctccccag ggaagaatgg ggtcgtcaaa 3721 gacgtttttg cctttggggg tgccgtggag aaccccgagt acttgacacc ccagggagga 3781 gctgcccctc agccccaccc tcctcctgcc ttcagcccag ccttcgacaa cctctattac 3841 tgggaccagg acccaccaga gcggggggct ccacccagca ccttcaaagg gacacctacg 3901 gcagagaacc cagagtacct gggtctggac gtgccagtgt gaaccagaag gccaagtccg 3961 cagaagccct gatgtgtcct cagggagcag ggaaggcctg acttctgctg gcatcaagag 4021 gtgggagggc cctccgacca cttccagggg aacctgccat gccaggaacc tgtcctaagg 4081 aaccttcctt cctgcttgag ttcccagatg gctggaaggg gtccagcctc gttggaagag 4141 gaacagcact ggggagtctt tgtggattct gaggccctgc ccaatgagac tctagggtcc 4201 agtggatgcc acagcccagc ttggcccttt ccttccagat cctgggtact gaaagcctta 4261 gggaagctgg cctgagaggg gaagcggccc taagggagtg tctaagaaca aaagcgaccc 4321 attcagagac tgtccctgaa acctagtact gccccccatg aggaaggaac agcaatggtg 4381 tcagtatcca ggctttgtac agagtgcttt tctgtttagt ttttactttt tttgttttgt 4441 ttttttaaag atgaaataaa gacccagggg gag // LOCUS HSERB5FR 3350 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens 5' flanking region for estrogen receptor (breast) gene. ACCESSION X62462 NID g31201 KEYWORDS 5' flanking region; estrogen receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3350) AUTHORS Keaveney,M. TITLE Direct Submission JOURNAL Submitted (01-OCT-1991) M. Keaveney, University College Galway, Microbiology Department, Galway, IRELAND REFERENCE 2 (bases 1 to 3350) AUTHORS Green,S., Walter,P., Kumar,V., Krust,A., Bornert,J.M., Argos,P. and Chambon,P. TITLE Human oestrogen receptor cDNA: sequence, expression and homology to v-erb-A JOURNAL Nature 320 (6058), 134-139 (1986) MEDLINE 86146892 REFERENCE 3 (bases 1 to 3350) AUTHORS Ponglikitmongkol,M., Green,S. and Chambon,P. TITLE Genomic organization of the human oestrogen receptor gene JOURNAL EMBO J. 7 (11), 3385-3388 (1988) MEDLINE 89091079 REFERENCE 4 (bases 1 to 3350) AUTHORS Keaveney,M., Klug,J. and Gannon,F. TITLE Sequence analysis of the 5' flanking region of the human estrogen receptor gene JOURNAL DNA Seq. 2 (6), 347-358 (1992) MEDLINE 93075998 REMARK Erratum:[DNA Seq 1992;3(3):201] FEATURES Location/Qualifiers source 1..3350 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="breast" /chromosome="6" /map="q24-27" /tissue_lib="human breast tumour genomic DNA in lambda EMBL III" CAAT_signal 41..46 TATA_signal 128..136 misc_signal 645..652 /note="1" /function="initiator element" mRNA <815..>859 /gene="14 amino acid ORF" /note="transcript 2" prim_transcript 815..>2952 /gene="ORF" /note="transcript 2" gene 815..859 /gene="14 amino acid ORF" gene 815..2952 /gene="ORF" exon <815..>859 /gene="14 amino acid ORF" /note="transcript 2" /number=1 CDS 815..859 /gene="14 amino acid ORF" /note="transcript 2" /codon_start=1 /db_xref="PID:g31202" /translation="MRAHSFLPSHSLGP" misc_signal 839..848 /gene="14 amino acid ORF" /note="2" /function="initiator element" gene 873..2952 /gene="18 amino acid ORF" exon 873..910 /partial /gene="18 amino acid ORF" /note="transcript 2" /number=1 mRNA join(<873..910,2934..>2952) /gene="18 amino acid ORF" /note="transcript 2" gene 873..2952 /gene="18aa ORF" CDS join(873..910,2934..2952) /gene="18 amino acid ORF" /note="transcript 2" /codon_start=1 /db_xref="PID:g31203" /translation="MEHFWKDVLDPAGWPAGF" intron 911..2933 /gene="18 amino acid ORF" /note="transcript 2" misc_feature 1028..1036 /gene="18 amino acid ORF" /note="octamer motif 1" misc_feature 1075..1083 /gene="18 amino acid ORF" /note="octamer motif 2" misc_feature 1616..1686 /gene="18 amino acid ORF" /note="alternating purine /pyrimidine tract" misc_binding complement(2214..2220) /gene="18 amino acid ORF" /bound_moiety="SP1" misc_binding complement(2550..2556) /gene="18 amino acid ORF" /bound_moiety="SP1" misc_signal 2569..2577 /gene="18 amino acid ORF" /note="3" /function="initiator element" CAAT_signal 2666..2773 /gene="18 amino acid ORF" TATA_signal 2742..2750 /gene="18 amino acid ORF" prim_transcript 2770..>2952 /gene="20 amino acid ORF" /note="transcript 1" mRNA 2770..>2952 /gene="20 amino acid ORF" /note="transcript 1" gene 2770..2952 /gene="20 amino acid ORF" conflict 2812..2814 /gene="20 amino acid ORF" /citation=[2] /replace="cg" conflict 2824..2826 /gene="20 amino acid ORF" /citation=[2] /replace="cg" conflict 2839..2841 /gene="20 amino acid ORF" /citation=[2] /replace="cg" CDS 2890..2952 /gene="20 amino acid ORF" /note="transcript 1" /codon_start=1 /db_xref="PID:g31204" /translation="MRCVASNLGLCSFSRWPAGF" exon 2934..2952 /partial /gene="18 amino acid ORF" /note="transcript 2" /number=2 gene 3004..3350 /gene="estrogen receptor" CDS 3004..>3350 /gene="estrogen receptor" /note="major ORF" /codon_start=1 /db_xref="PID:g31205" /translation="MTMTLHTKASGMALLHQIQGNELEPLNRPQLKIPLERPLGEVYL DSSKPAVYNYPEGAAYEFNAAAAANAQVYGQTGLPYGPGSEAAAFGSNGLGGFPPLNS VSPSPLMLLHPPP" mutation 3033 /gene="estrogen receptor" /citation=[2] /replace="c" BASE COUNT 818 a 839 c 776 g 917 t ORIGIN 1 ggatccatgt gaacgccact gggaaatgag agacctcgtt cccaatcacg gtcagtgcaa 61 ctcgaaagcc taaaatcagt ttaaaacaaa ggtatctacc tttatcttat gttcatatcc 121 taggctttta ataatacgta tttttcacat gtttacagaa agcagtcaac tgagctattc 181 atggaaaggt ttgtgggttt ggttaacgaa gtgaggagta ttacatttca gctggaaaca 241 catccctaga atgccaaaac atttattcca aagtctggtt tcctggtgca atcggaggca 301 tggcaatgcc tctgttcaga gactgggggc tagggccagt aaggcatttg atccacatgt 361 atcccagaag gcttttattg ttaaattata ttctttcgga aaaaccaccc atgtcctatt 421 ttgtaaactt gatatccata cacttttgac tggcattcta ttttagccgt aagactatga 481 ttcacagcaa gcctgttttt cctcttgctt ggggtggcag cagaaagcat agggtacttt 541 ccagcctcca agggtagggg caaaggggct ggggtttctc ctccccagta cagctttctc 601 tggctgtgcc acactgctcc ctgtgagcag acagcaagtc tcccctcact ccccactgcc 661 attcatccag cgctgtgcag tagcccagct gcgtgtctgc cgggaggggc tgccaagtgc 721 cctgcctact ggctgcttcc cgaatccctg ccattccacg cacaaacaca tccacacact 781 ctctctgcct agttcacaca ctgagccatc gcacatgcga gcacattcct tccttccttc 841 tcactctctc ggcccttgac ttctacaagc ccatggaaca tttctggaaa gacgttcttg 901 atccagcagg gtaggcttgt tttgatttct ctctctgtag ctttagcatt ttgagaaagc 961 aacttacctt tctggctagt gtctgtatcc tagcagggag atgaggattg ctgttctcca 1021 tgggggtatg tgtgtgtctc ctttttcttt caggacttgt aggattcttt gtgccatttg 1081 catataattt ggcaggttca cattttttaa gagccctatg aagtgctttt tgcatgtgtt 1141 ttaaaaaggc atttgaaaat tgaaagtgtg atttatggaa attaaatcat ctgtaaaaaa 1201 ttgctttgga aagtaatgat tgctggccat aaagggaaat atctgcgatg cacctaatgt 1261 gtttttaacc ctttatttgc tgacaatcta tagtcattaa tgctaaactc gattttggct 1321 tcagctacat ttgcatattg tccaacaatg gtctattttt gtaagaatta gataaaatgt 1381 atacttgata taaaatagtc aaaaatgtaa ctcttagtaa cagtaagctt ggcatttaga 1441 tagaccatga accacttcgt cagatactct gttgggtgtt tgggatagca attaaaacaa 1501 agtattgata gttgtatcag agtctattag gctgcagcaa aggaagttta ttcaaaagta 1561 taaactatcc aagattatag acgcatgata tacttcacct attttttgtc tccttaatat 1621 gtatatatat atatatatat atatatatat acacatatat gtgtgtgtgt atgtgcgtgt 1681 gcatgtttaa cttttaattc agttaaaaac ttttttctat ttgtttttca tctggatatt 1741 tgattctgca tatcctagcc caagtgaacc gagaagatcg agttgtagga ctaaaggata 1801 gacatgcaga aatgcatttt aaaaatctgt tagctggacc agaccgacaa tgtaacataa 1861 ttgccaaagc tttggttcgt gacctgaggt tatgtttggt atgaaaaggt cacattttat 1921 attcagtttt ctgaagtttt ggttgcataa ccaacctgtg gaaggcatga acacccatgt 1981 gcgccctaac caaaggtttt tctgaatcat ccttcacatg agaattccta atgggaccaa 2041 gtacagtact gtggtccaac ataaacacac aagtcaggct gagagaatct cagaaggttg 2101 tggaagggtc tatctacttt gggagcattt tgcagaggaa gaaactgagg tcctggcagg 2161 ttgcattctc ctgatggcaa aatgcagctc ttcctatatg tataccctga atctccgccc 2221 ccttcccctc agatgccccc tgtcagttcc cccagctgct aaatatagct gtctgtggct 2281 ggctgcgtat gcaaccgcac accccattct atctgcccta tctcggttac agtgtagtcc 2341 tccccagggt catcctatgt acacactacg tatttctagc caacgaggag ggggaatcaa 2401 acagaaagag agacaaacag agatatatcg gagtctggca cggggcacat aaggcagcac 2461 attagagaaa gccggcccct ggatccgtct ttcgcgttta ttttaagccc agtcttccct 2521 gggccacctt tagcagatcc tcgtgcgccc ccgccccctg gccgtgaaac tcagcctcta 2581 tccagcagcg acgacaagta aagtaaagtt cagggaagct gctctttggg atgctcaaat 2641 cgagttgtgc ctggagtgat gtttaagcca atgtcagggc aaggcaacag tccctggccg 2701 tcctccagca cctttgtaat gcatatgagc tcgggagacc agtacttaaa gttggaggcc 2761 cgggagccca ggagctggcg gagggcgttc gtcctgggac tgcacttgct cccgtcgggt 2821 cgcccggctt caccggaccc gcaggctccc ggggcagggc cggggccaga gctcgcgtgt 2881 cggcgggaca tgcgctgcgt cgcctctaac ctcgggctgt gctctttttc caggtggccc 2941 gccggtttct gagccttctg ccctgcgggg acacggtctg caccctgccc gcggccacgg 3001 accatgacca tgaccctcca caccaaagca tctgggatgg ccctactgca tcagatccaa 3061 gggaacgagc tggagcccct gaaccgtccg cagctcaaga tccccctgga gcggcccctg 3121 ggcgaggtgt acctggacag cagcaagccc gccgtgtaca actaccccga gggcgccgcc 3181 tacgagttca acgccgcggc cgccgccaac gcgcaggtct acggtcagac cggcctcccc 3241 tacggccccg ggtctgaggc tgcggcgttc ggctccaacg gcctgggggg tttcccccca 3301 ctcaacagcg tgtctccgag cccgctgatg ctactgcacc cgccgccgca // LOCUS HSERBAR 1698 bp RNA PRI 26-JUN-1995 DEFINITION Human c-erb-A mRNA for thyroid hormone receptor. ACCESSION X04707 NID g31206 KEYWORDS cellular oncogene; erbA cellular oncogene; erbA oncogene; hormone receptor; steroid hormone receptor; thyroid hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1698) AUTHORS Weinberger,C., Thompson,C.C., Ong,E.S., Lebo,R., Gruol,D.J. and Evans,R.M. TITLE The c-erb-A gene encodes a thyroid hormone receptor JOURNAL Nature 324 (6098), 641-646 (1986) MEDLINE 87090375 FEATURES Location/Qualifiers source 1..1698 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="(lambda)gt10" /clone="pheA4 ans pheA12" CDS 301..1671 /codon_start=1 /product="put.thyroid hormone receptor" /db_xref="PID:g31207" /db_xref="SWISS-PROT:P10828" /translation="MTENGLTAWDKPKHCPDREHDWKLVGMSEACLHRKSHSERRSTL KNEQSSPHLIQTTWTSSIFHLDHDDVNDQSVSSAQTFQTEEKKCKGYIPSYLDKDELC VVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTRNQCQECR FKKCIYVGMATDLVLDDSKRLAKRKLIEENREKRRREELQKSIGHKPEPTDEEWELIK TVTEAHVATNAQGSHWKQKPKFLPEDIGQAPIVNAPEGGKVDLEAFSHFTKIITPAIT RVVDFAKKLPMFCELPCEDQIILLKGCCMEIMSLRAAVRYDPESETLTLNGEMAVIRG QLKNGGLGVVSDAIFDLGMSLSSFNLDDTEVALLQAVLLMSSDRPGLACVERIEKYQD SFLLAFEHYINYRKHHVTHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTELLPPLF LEVFED" polyA_site 1698 /note="polyadenylation site" BASE COUNT 477 a 413 c 446 g 362 t ORIGIN 1 cggcggggat caactttgca tgaataatgt gagtgcgctt ggaaaagaga cctcctgctc 61 cgcgggctcg gggcaagagc ccgcaggcta ccttccccgg gcaggggcgc tcaacccaac 121 cggctccagg gcactggtaa tttggctaga ggaccgcgcg gaggcagcgg gatctgcgat 181 ttccttctgg ttggctgtcc tgcgtgggtg ccaagttcca cacatgattt aatgaataag 241 aaggagatgt cagtgaaaaa agggatccag aatgattact aacctataac ccccaacagt 301 atgacagaaa atggccttac agcttgggac aaaccgaagc actgtccaga ccgagaacac 361 gactggaagc tagtaggaat gtctgaagcc tgcctacata ggaagagcca ttcagagagg 421 cgcagcacgt tgaaaaatga acagtcgtcg ccacatctca tccagaccac ttggactagc 481 tcaatattcc atctggacca tgatgatgtg aacgaccaga gtgtctcaag tgcccagacc 541 ttccaaacgg aggagaagaa atgtaaaggg tacatcccca gttacttaga caaggacgag 601 ctctgtgtag tgtgtggtga caaagccacc gggtatcact accgctgtat cacgtgtgaa 661 ggctgcaagg gtttctttag aagaaccatt cagaaaaatc tccatccatc ctattcctgt 721 aaatatgaag gaaaatgtgt catagacaaa gtcacgcgaa atcagtgcca ggaatgtcgc 781 tttaagaaat gcatctatgt tggcatggca acagatttgg tgctggatga cagcaagagg 841 ctggccaaga ggaagctgat agaggagaac cgggagaaaa gacggcggga agagctgcag 901 aagtccatcg ggcacaagcc agagcccaca gacgaggaat gggagctcat caaaactgtc 961 accgaagccc atgtggcgac caacgcccaa ggcagccact ggaagcaaaa accgaaattt 1021 ctgccagaag acattggaca agcaccaata gtcaatgccc cagaaggtgg aaaggttgac 1081 ttggaagcct tcagccattt tacaaaaatc atcacaccag caattaccag agtggtggat 1141 tttgccaaaa agttgcctat gttttgtgag ctgccatgtg aagaccagat catcctcctc 1201 aaaggctgct gcatggagat catgtccctt cgcgctgctg tgcgctatga cccggaaagt 1261 gagactttaa ccttgaatgg ggaaatggca gtgatacggg gccagctgaa aaatgggggt 1321 cttggggtgg tgtcagacgc catctttgac ctaggcatgt ctctgtcttc tttcaacctg 1381 gatgacactg aagtagccct ccttcaggcc gtcctgctga tgtcttcaga tcgcccgggg 1441 cttgcctgtg ttgagagaat agaaaagtac caagatagtt tcctgctggc ctttgaacac 1501 tatatcaatt accgaaaaca ccacgtgaca cacttttggc caaaactcct gatgaaggtg 1561 acagatctgc ggatgatagg agcctgccat gccagccgct tcctgcacat gaaggtggaa 1621 tgccccacag aactcctccc ccctttgttc ctggaagtgt tcgaggatta gactgactgg 1681 attccttcct ataattcc // LOCUS HSERC55R 1700 bp RNA PRI 27-SEP-1994 DEFINITION H.sapiens ERC-55 mRNA. ACCESSION X78669 NID g469884 KEYWORDS calcium binding; ERC-55 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1700) AUTHORS Weis,K., Griffiths,G. and Lamond,A.I. TITLE The endoplasmic reticulum calcium-binding protein of 55 kDa is a novel EF-hand protein retained in the endoplasmic reticulum by a carboxyl-terminal His-Asp-Glu-Leu motif JOURNAL J. Biol. Chem. 269 (29), 19142-19150 (1994) MEDLINE 94308182 REFERENCE 2 (bases 1 to 1700) AUTHORS Weis,K. TITLE Direct Submission JOURNAL Submitted (07-APR-1994) K. Weis, EMBL, Meyerhofstr 1, 69117 Heidelberg, FRG FEATURES Location/Qualifiers source 1..1700 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /clone_lib="pBluescript" gene 67..1020 /gene="ERC-55" CDS 67..1020 /gene="ERC-55" /function="calcium binding protein" /codon_start=1 /product="EF-hand protein" /db_xref="PID:g469885" /translation="MRLGPRTAALGLLLLCAAAAGAGKAEELHYPLGERRSDYDREAL LGVQEDVDEYVKLGHEEQQKRLQAIIKKIDLDSDGFLTESELSSWIQMSFKHYAMQEA KQQFVEYDKNSDDTVTWDEYNIQMYDRVIDFDENTALDDAEEESFRKLHLKDKKRFEK ANQDSGPGLSLEEFIAFEHPEEVDYMTEFVIQEALEEHDKNGDGFVSLEEFLGDYRWD PTANEDPEWILVEKDRFVNDYDKDNDGRLDPQELLPWVVPNNQGIAQEEALHLIDEMD LNGDKKLSEEEILENPDLFLTSEATDYGRQLHDDYFYHDEL" polyA_signal 1675..1680 BASE COUNT 506 a 295 c 387 g 512 t ORIGIN 1 gcagccgccc gggcccccgc cagcctccct cctcgcgtcc ctcggtgtcc tccgcgggcc 61 ggcgcgatgc ggctgggccc gaggaccgcg gcgttggggc tgctgctgct gtgcgccgcc 121 gcggccggcg ccggcaaggc cgaggagctg cactacccgc tgggcgagcg ccgcagcgac 181 tacgaccgcg aggcgctgct gggcgtccag gaagatgtgg atgaatatgt taaactcggc 241 cacgaagagc agcaaaaaag actgcaggcg atcataaaga aaatcgactt ggactcagat 301 ggctttctca ctgaaagtga actcagttca tggattcaga tgtcttttaa gcattatgct 361 atgcaagaag caaaacaaca gtttgttgaa tatgataaaa acagtgatga tactgtgact 421 tgggatgaat ataacattca gatgtatgat cgtgtgattg actttgatga gaacactgct 481 ctggatgatg cagaagagga gtcctttagg aagcttcact taaaggacaa gaagcgattt 541 gaaaaagcta accaggattc aggtcccggt ttgagtcttg aagaatttat tgcttttgag 601 catcctgaag aagttgatta tatgacggaa tttgtcattc aagaagcttt agaagaacat 661 gacaaaaatg gtgatggatt tgttagtttg gaagaatttc ttggtgatta caggtgggat 721 ccaactgcaa atgaagatcc agaatggata cttgttgaga aagacagatt cgtgaatgat 781 tatgacaaag ataacgatgg caggcttgat ccccaagagc tgttaccttg ggtagtacct 841 aataatcagg gcattgcaca agaggaggcg cttcatctaa ttgatgaaat ggatttgaat 901 ggtgacaaaa agctctctga agaagagatt ctggaaaacc cggacttgtt tctcaccagt 961 gaagccacag attatggcag acagctccat gatgactatt tctatcatga tgagctttaa 1021 tctccgagcc tgtctcagta gagtactggc tccttttata atttgttacc agctttactt 1081 ttgtgataaa atattgatgt tgtattttac actcttaagt cttaaccaca gtcagaatta 1141 tcttaatgta gaattataat tttggctctt ttaggaaaaa acaaaatctg atatttttcc 1201 aaacgtattg agcaacaaaa tattaatatt gtgccatatg acaacaaagt ctttcctaaa 1261 tactccatct gtttagtact gtattgtgga atatttgagt tctatttcca gacttgaaaa 1321 catggaggat tttagagatg cctgaacaat attatttaag tagtatgtga ccgagctata 1381 aattttttgt ttttgttcta agtagattta atttgggaac tgacaggaca atgtttttag 1441 gtttagcatt ttgtttaaaa acctttaaag aaacctttag aaggacttag acctcacata 1501 ttaatgttga gaagttctgc ttaattttaa aatggtttct ataaagggtt ttattgtatg 1561 aaatagaact ttatattttt gcatatgtat agaggataat tatatttaat gtataactat 1621 agcattatgg tgagtggaat ttgacattgt ccaaaccttt ttcatttttg agtgattaaa 1681 aatgaaatgt cctttgtaaa // LOCUS HSERD22 688 bp RNA PRI 12-JUL-1996 DEFINITION H.sapiens ERD2.2 mRNA for KDEL receptor. ACCESSION X63745 NID g31217 KEYWORDS ERD2.2 gene; KDEL receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 688) AUTHORS Lewis,M.J. and Pelham,H.R. TITLE Sequence of a second human KDEL receptor JOURNAL J. Mol. Biol. 226 (4), 913-916 (1992) MEDLINE 92389337 REFERENCE 2 (bases 1 to 688) AUTHORS Lewis,M.J. TITLE Direct Submission JOURNAL Submitted (24-FEB-1992) M.J. Lewis, MRC, Laboratory of Mol.Biology, Hills Road, Cambridge CB2 2QH, UK FEATURES Location/Qualifiers source 1..688 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="placental cDNA, RACE from Raji cells" gene 13..651 /gene="ERD2.2" CDS 13..651 /gene="ERD2.2" /codon_start=1 /product="KDEL receptor" /db_xref="PID:g31218" /db_xref="SWISS-PROT:P33947" /translation="MNIFRLTGDLSHLAAIVILLLKIWKTRSCAGISGKSQLLFALVF TTRYLDLFTSFISLYNTSMKVIYLACSYATVYLIYLKFKATYDGNHDTFRVEFLVVPV GGLSFLVNHDFSPLEILWTFSIYLESVAILPQLFMISKTGEAETITTHYLFFLGLYRA LYLVNWIWRFYFEGFFDLIAVVAGVVQTILYCDFFYLYITKVLKGKKLSLPA" BASE COUNT 139 a 193 c 149 g 207 t ORIGIN 1 gccgccgccg ccatgaacat tttccggctg actggggacc tgtcccacct ggcggccatc 61 gtcatcctgc tgctgaagat ctggaagacc cgctcctgcg ccggtatttc tgggaaaagc 121 cagcttctgt ttgcactggt cttcacaact cgttacctgg atctttttac ttcatttatt 181 tcattgtata acacatctat gaaggttatc taccttgcct gctcctatgc cacagtgtac 241 ctgatctacc tgaaatttaa ggcaacctac gatggaaatc atgatacctt ccgagtggag 301 tttctggtgg tccctgtggg aggcctctca tttttagtta atcacgattt ctctcctctt 361 gagatcctct ggaccttctc catctacctg gagtccgtgg ctatccttcc gcagctgttt 421 atgatcagca agactgggga ggccgagacc atcaccaccc actacctgtt cttcctgggc 481 ctctatcgtg ctttgtatct tgtcaactgg atctggcgct tctactttga gggcttcttt 541 gacctcattg ctgtggtggc cggcgtagtc cagaccatcc tatactgtga cttcttctac 601 ttgtacatta caaaagtact caagggaaag aagctcagtt tgccagcata agtgccaaag 661 accatcacca gcatctgtcc tttcaggg // LOCUS HSERF2 1629 bp RNA PRI 03-MAY-1995 DEFINITION H.sapiens ERF-2 mRNA. ACCESSION X78992 NID g509777 KEYWORDS ERF-2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1629) AUTHORS Nie,X.F., Maclean,K.N., Kumar,V., McKay,I.A. and Bustin,S.A. TITLE ERF-2, the human homologue of the murine Tis11d early response gene JOURNAL Gene 152 (2), 285-286 (1995) MEDLINE 95137407 REFERENCE 2 (bases 1 to 1629) AUTHORS Bustin,S.A. TITLE Direct Submission JOURNAL Submitted (25-APR-1994) S.A. Bustin, London Hospital Medical College, Surgical Unit, 4th Floor Alexandra Wing, Turner Street, London E1 1BB, UK FEATURES Location/Qualifiers source 1..1629 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="human epidermal cylindroma" /clone_lib="lambda gt11" gene 67..1545 /gene="ERF-2" CDS 67..1545 /gene="ERF-2" /codon_start=1 /db_xref="PID:g509778" /translation="MSTTLLSAFYDVDFLCKTEKSLANLNLNNMLDKKAVGTPVAAAP SSGFAPGFLRRHSASNLHALAHPAPSPGSCSPKFPGAANGSSCGSAAAGGPTSYGTLK EPSGGGGTALLNKENKFRDRSFSENGDRSQHLLHLQQQQKGGGGSQINSTRYKTELCR PFEESGTCKYGEKCQFAHGFHELRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFIHNA DERRPAPSGGASGDLRAFGTRDALHLGFPREPRPKLHHSLSFSGFPSGHHQPPGGLES PLLLDSPTSRTPPPPSCSSASSCSSSASSCSSASAASTPSGTPTCCASAAAALRLLYG TGGAEDLLAPGAPCAACSSASCANNAFAFGPELSSLITPLAIQTHNFAAVAAAAYYRS QQQQQQQGLAPPAQPPAPPSATLPAGAAAPPSPPFSFQLPRRLSDSPVFDAPPSPPDS LSDRDSYLSGSLSSGSLSGSESPSLDPGRRLPIFSRLSISDD" BASE COUNT 237 a 669 c 487 g 236 t ORIGIN 1 gggccgcccc aagggctcct cccgacctcc cggcctgccg ctccggccac tgcgggatcc 61 agaaacatgt cgaccacact tctgtccgcc ttctacgatg tcgacttctt gtgcaagaca 121 gagaaatccc tggccaacct caacctgaac aacatgctgg acaagaaggc ggtggggacg 181 cctgtggccg ccgcccccag ctcgggcttc gcgccgggat tcctccgacg gcactcggcc 241 agcaacctgc atgcactcgc ccaccccgcg cccagccccg gcagctgctc gcccaagttc 301 ccgggcgccg ctaacggcag cagctgcggc agcgcggcgg ccggcggtcc gacctcctac 361 ggcaccctta aggagccgtc ggggggcggc ggcacagccc tgctcaacaa ggagaacaaa 421 ttccgggacc gctcgtttag cgagaacggc gatcgcagcc agcacctcct gcacctgcag 481 cagcagcaga aggggggcgg cggctcccag atcaactcca cgcgctacaa gaccgagctg 541 tgccggccct tcgaggagag cggcacgtgc aagtacggcg aaaagtgcca gttcgcgcat 601 ggcttccacg agctgcgcag cctgactcgc catccgaagt acaagaccga gctgtgccgc 661 acctttcata ccatcggctt ctgcccctat gggccgcgct gccacttcat ccacaacgcg 721 gacgagcggc ggcccgcgcc gtcggggggc gcctccgggg acctgcgtgc ctttggcacg 781 cgcgatgcgt tgcacctggg cttcccgcgg gagccgcggc ccaagttgca ccacagcctc 841 agcttctcgg gcttcccgtc gggccaccat cagcccccgg gcggcctcga gtcgccgctg 901 ctgctcgaca gccccacgtc gcgcacgccg ccgccgccct cctgctcttc ggcctcgtcc 961 tgctcctcct ccgcctcctc ctgttcctcg gcctccgcgg cctccacgcc ctcggggacc 1021 ccgacatgct gcgcctccgc ggcggccgcg ctgcgtctgc tgtacggcac cgggggcgcc 1081 gaggacctgc tggcgccggg ggccccgtgc gcggcctgct cgtcggcctc gtgcgccaac 1141 aacgccttcg ccttcggtcc ggagctcagc agcctcatca cgccgctcgc catccagacc 1201 cacaactttg ccgccgtggc cgccgccgcc tactaccgca gtcagcagca gcagcagcag 1261 cagggcctgg cgccccccgc gcagccgccg gcgccgccca gcgcgaccct ccccgccggg 1321 gccgccgcac ctccctcgcc gcccttcagc ttccagctgc cgcgccgcct gtccgactcg 1381 cccgtgttcg acgcgccccc cagccccccg gactcgctgt cggaccgcga cagctaccta 1441 agcggctccc tgagctccgg cagcctcagc ggctctgagt ctcccagcct cgaccctggc 1501 cgccgcctgc caatcttcag ccgcctctcc atctccgacg actgaggcaa gagggcgcca 1561 gtgaggagga agggaaggcg gttcagagat gttggaggac acccctcgcc atctcgccct 1621 tgctggggg // LOCUS HSERGICA 2768 bp RNA PRI 01-AUG-1994 DEFINITION H.sapiens ERGIC-53 mRNA. ACCESSION X71661 NID g433937 KEYWORDS ER-golgi intermediate compartment; membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2768) AUTHORS Schindler,R., Itin,C., Zerial,M., Lottspeich,F. and Hauri,H.P. TITLE ERGIC-53, a membrane protein of the ER-Golgi intermediate compartment, carries an ER retention motif JOURNAL Eur. J. Cell Biol. 61 (1), 1-9 (1993) MEDLINE 94039195 REFERENCE 2 (bases 1 to 2768) AUTHORS Hauri,H.P. TITLE Direct Submission JOURNAL Submitted (22-MAR-1993) H.P. Hauri, Biocenter, University of Basel, Klingelbergstr. 70, 4056 Basel, SWITZERLAND REFERENCE 3 (bases 1 to 2768) AUTHORS Fiedler,K. and Simons,K. TITLE A putative novel class of animal lectins in the secretory pathway homologous to leguminous lectins JOURNAL Cell 77 (5), 625-626 (1994) MEDLINE 94265253 FEATURES Location/Qualifiers source 1..2768 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver, placenta" sig_peptide 22..112 /gene="ERGIC53" gene 22..1554 /gene="ERGIC53" CDS 22..1554 /gene="ERGIC53" /function="ER-golgi intermediate compartment protein" /codon_start=1 /db_xref="PID:g433938" /db_xref="SWISS-PROT:P49257" /translation="MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHR RFEYKYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFE NWEVEVTFRVTGRGRIGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDTFDNDGKK NNPAIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINN GFTPDKNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPT PDKEISEKEKEKYQEEFEHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQV FEGQNRIHLEIKQLNRQLDMILDEQRRYVSSLTEEISKRGAGMPGQHGQITQQELDTV VKTQHEILRQVNEMKNSMSETVRLVSGMQHPGSAGGVYETTQHFIDIKEHLHIVKRDI DNLVQRNMPSNEKPKCPELPPFPSCLSTVHFIIFVVVQTVLFIGYIMYRSQQEAAAKK FF" BASE COUNT 853 a 515 c 595 g 805 t ORIGIN 1 ggtcgcgttc cagaatccaa gatggcggga tccaggcaaa ggggtctccg ggccagagtt 61 cggccgctgt tctgcgcctt gctgctgtca ctcggtcgct tcgtccgggg cgacggcgtg 121 ggaggagacc ccgcggtcgc gttgccacat cgccgtttcg agtacaaata cagcttcaag 181 gggccgcacc tggtgcagag cgacgggacc gtgcccttct gggcccacgc ggggaatgct 241 attccaagtt cagatcaaat tcgagtagca ccatctttaa aaagccaaag aggctcagtg 301 tggacaaaga caaaagcggc ctttgagaac tgggaagttg aggtgacatt tcgagtgact 361 ggaagaggtc gaattggagc tgatggccta gcaatttggt atgcagaaaa tcaaggcttg 421 gagggccctg tgtttggatc agctgatctg tggaatggtg ttggaatatt ttttgatact 481 tttgacaatg atggaaagaa aaataatcct gctatagtaa ttataggcaa caatggacaa 541 atccattatg accatcaaaa tgacggggct agtcaagctt tggcaagttg ccagagggac 601 ttccgcaaca aaccctatcc tgtccgagca aagattacct attaccagaa cacactgaca 661 gtaatgatca ataatggctt tacaccagat aaaaatgatt atgaattttg tgccaaagtg 721 gaaaatatga ttatccctgc acaagggcat tttggaatat ctgctgcaac tggaggtctt 781 gcagatgacc atgatgtcct ttcttttctg actttccagt tgactgaacc tggaaaagag 841 ccgcccacac cagataaaga aatttcggaa aaggaaaaag aaaagtatca ggaggaattt 901 gagcactttc aacaagaatt ggataaaaaa aaagaggaat tccagaaggg ccaccccgac 961 ctccaagggc agcctgcgga ggaaatattt gagagtgtag gagatcgaga gctaagacaa 1021 gtctttgaag gacagaatcg tattcatctt gaaatcaagc agctgaaccg gcagttagat 1081 atgattcttg atgaacagag aagatatgtc tcttccttaa cagaggaaat ctctaaaaga 1141 ggagcaggaa tgcctgggca gcatgggcag attactcaac aagaactgga tactgttgtg 1201 aaaactcagc atgagattct gagacaagta aatgaaatga aaaattccat gagtgaaacc 1261 gtcagactgg tcagtggaat gcagcaccct ggctctgctg gaggcgtcta tgagacaaca 1321 cagcacttca ttgacatcaa agagcacctg cacatagtaa agagggacat agataactta 1381 gtgcagcgaa atatgccatc aaatgaaaag ccgaaatgcc cagaactacc accatttcca 1441 tcatgtttgt ctacggtcca cttcattata tttgttgtgg tgcaaactgt attattcatt 1501 ggttatatca tgtataggtc tcagcaagaa gcagctgcca aaaaattctt ttgactacca 1561 ttttcctgtg tacttcatct atttgtgtac aaaatgagtc gttttgaggg aatttaagta 1621 tttaaattgc ttcatagtct aaattattaa ttttcttaat aaaataactg tttaaacatt 1681 gatttgcagt taagaataaa ccttaaagca aagacaacca cattttaatt tgttcacagt 1741 atgtaaatct gtctaaattt cagtgaattt ctggtcagta tgatgcagcc tctgagcaga 1801 atattgacca gtaagagggt aaataaagtg ggggcaaccc tggatatgaa tgttaccccc 1861 taagtctcca atattgcagg tttccctgta taacgtaaac acacttgccc tcatgcctcc 1921 cagaatatga ggtctaatta agaagtccat caggtttatt ttgtaaccaa agtctttttt 1981 agaggtcaga cttcctaatc aaaggcctgg gcctgcagtc cctttcatct taatgcaact 2041 tcctttgaaa tcaaagaata ttttgtctga gagctttaag gatctggtaa tagacttcaa 2101 aatgttaagt gaaatttttt tttcctctat ttatcaatga tatatttcac ttttaaagga 2161 aattttagag gaaaattaat agctgctttt tgcactaaaa aaccttgtgg gtggaaatat 2221 tcctctgaga atggctttta taggtatttt gcctggtaat gtattcattc atgattgccc 2281 atattcttga atgtttcttc attccaatgg ggtcaggtca atattatgaa aataattttt 2341 atatttatat ttgtaactaa gaatttattt ctccctttac tacacgatgt aaattcacgt 2401 caaattcgat gatctgagga tttaaattca caaaacctgc cactacattc tggtttacat 2461 tagttacttc atgctggctg gggttagtga ccatttgcat actcttttaa atcaaggagg 2521 ctgtagtaga ggcagtttta agattcttga aggcaaaatt tgaaaaacag tgaatacttc 2581 taattgtttc cttttagtgc cagaactaag acattgtgaa gcacttgtta gtaaacttaa 2641 ccttgaaatg tcagactgga aggagttttt atgtctttgt gcatacttct gggtattaca 2701 gaaacagtct gtaaataaca ttttaagatg caaatttaat tctgttcaca gctgatttat 2761 actgattt // LOCUS HSERK1 1866 bp RNA PRI 06-SEP-1993 DEFINITION Human ERK1 mRNA for protein serine/threonine kinase. ACCESSION X60188 NID g31220 KEYWORDS erk1 gene; protein-serine/threonine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Pelech,S.L. TITLE Direct Submission JOURNAL Submitted (23-JUN-1991) S.L. Pelech, Biomedical Res Centre, 2222 Health Science Hall, Univ of British Columbia, Vancouver B C V6T 1Z3, CANADA REFERENCE 2 (bases 1 to 1866) AUTHORS Charest,D.L., Mordret,G., Harder,K.W., Jirik,F. and Pelech,S.L. TITLE Molecular cloning, expression, and characterization of the human mitogen-activated protein kinase p44erk1 JOURNAL Mol. Cell. Biol. 13 (8), 4679-4690 (1993) MEDLINE 93330262 FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver tumor" /cell_type="hepatoblastoma" /cell_line="HepG2" /clone_lib="HepG2" /clone="p26a-Beta-3" /chromosome="16" mRNA 1..1866 /gene="ERK1" /evidence=experimental gene 1..1866 /gene="ERK1" CDS 73..1212 /gene="ERK1" /codon_start=1 /product="protein serine/threonine kinase" /db_xref="PID:g31221" /db_xref="SWISS-PROT:P27361" /translation="MAAAAAQGGGGGEPRRTEGVGPGVPGEVEMVKGQPFDVGPRYTQ LQYIGEGAYGMVSSAYDHVRKTRVAIKKISPFEHQTYCQRTLREIQILLRFRHENVIG IRDILRASTLEAMRDVYIVQDLMETDLYKLLKSQQLSNDHICYFLYQILRGLKYIHSA NVLHRDLKPSNLLSNTTCDLKICDFGLARIADPEHDHTGFLTEYVATRWYRAPEIMLN SKGYTKSIDIWSVGCILAEMLSNRPIFPGKHYLDQLNHILGILGSPSQEDLNCIINMK ARNYLQSLPSKTKVAWAKLFPKSDSKALDLLDRMLTFNPNKRITVEEALAHPYLEQYY DPTDEPVAEEPFTFAMELDDLPKERLKELIFQETARFQPGVLEAP" BASE COUNT 380 a 605 c 535 g 346 t ORIGIN 1 cgttcctcgg cgccgccggg gccccagagg gcagcggcag caacagcagc agcagcagca 61 gcgggagtgg agatggcggc ggcggcggct caggggggcg ggggcgggga gccccgtaga 121 accgaggggg tcggcccggg ggtcccgggg gaggtggaga tggtgaaggg gcagccgttc 181 gacgtgggcc cgcgctacac gcagttgcag tacatcggcg agggcgcgta cggcatggtc 241 agctcggcct atgaccacgt gcgcaagact cgcgtggcca tcaagaagat cagccccttc 301 gaacatcaga cctactgcca gcgcacgctc cgggagatcc agatcctgct gcgcttccgc 361 catgagaatg tcatcggcat ccgagacatt ctgcgggcgt ccaccctgga agccatgaga 421 gatgtctaca ttgtgcagga cctgatggag actgacctgt acaagttgct gaaaagccag 481 cagctgagca atgaccatat ctgctacttc ctctaccaga tcctgcgggg cctcaagtac 541 atccactccg ccaacgtgct ccaccgagat ctaaagccct ccaacctgct cagcaacacc 601 acctgcgacc ttaagatttg tgatttcggc ctggcccgga ttgccgatcc tgagcatgac 661 cacaccggct tcctgacgga gtatgtggct acgcgctggt accgggcccc agagatcatg 721 ctgaactcca agggctatac caagtccatc gacatctggt ctgtgggctg cattctggct 781 gagatgctct ctaaccggcc catcttccct ggcaagcact acctggatca gctcaaccac 841 attctgggca tcctgggctc cccatcccag gaggacctga attgtatcat caacatgaag 901 gcccgaaact acctacagtc tctgccctcc aagaccaagg tggcttgggc caagcttttc 961 cccaagtcag actccaaagc ccttgacctg ctggaccgga tgttaacctt taaccccaat 1021 aaacggatca cagtggagga agcgctggct cacccctacc tggagcagta ctatgacccg 1081 acggatgagc cagtggccga ggagcccttc accttcgcca tggagctgga tgacctacct 1141 aaggagcggc tgaaggagct catcttccag gagacagcac gcttccagcc cggagtgctg 1201 gaggccccct agcccagaca gacatctctg caccctgggg cctggacctg cctcctgcct 1261 gcccctctcc cgccagactg ttagaaaatg gacactgtgc ccagcccgga ccttggcagc 1321 ccaggccggg gtggagcatg ggcctggcca cctctctcct ttgctgaggc ctccagcttc 1381 aggcaggcca aggccttctc ctccccaccc gccctcccca cggggcctcg ggagctcagg 1441 tggccccagt tcaatctccc gctgctgctg ctgctgcgcc cttaccttcc ccagcgtccc 1501 agtctctggc agttctggaa tggaagggtt ctggctgccc caacctgctg aagggcagag 1561 gtggagggtg gggggcgctg agtagggact cagggccatg cctgcccccc tcatctcatt 1621 caaaccccac cctagtttcc ctgaaggaac attccttagt ctcaagggct agcatccctg 1681 aggagccagg ccgggccgaa tcccctccct gtcaaagctg tcacttcgcg tgccctcgct 1741 gcttctgtgt gtggtgagca gaagtggagc tggggggcgt ggagagcccg gcgcccctgc 1801 cacctccctg acccgtctaa tatataaata tagagatgtg tctatggctg aaaaaaaaaa 1861 aaaaaa // LOCUS HSERK3 3920 bp RNA PRI 07-APR-1995 DEFINITION H.sapiens ERK3 mRNA. ACCESSION X80692 NID g763112 KEYWORDS erk3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3920) AUTHORS Zhu,A.X., Zhao,Y., Moller,D.E. and Flier,J.S. TITLE Cloning and characterization of p97MAPK, a novel human homolog of rat ERK-3 JOURNAL Mol. Cell. Biol. 14 (12), 8202-8211 (1994) MEDLINE 95059049 REFERENCE 2 (bases 1 to 3920) AUTHORS Zhu,A.X. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) A.X. Zhu, Beth Israel Hospital, Harvard Medical School, Div. of Endocrinology & Metabolism, 330 Brookline Ave., Boston MA 02215, USA FEATURES Location/Qualifiers source 1..3920 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="muscle" /cell_line="human fetal muscle" gene 479..2644 /gene="ERK3" CDS 479..2644 /gene="ERK3" /codon_start=1 /product="p97mapk" /db_xref="PID:g763113" /translation="MAEKFESLMNIHGFDLGSRYMDLKPLGCGGNGLVFSAVDNDCDK RVAIKKIVLTDPQSVKHALREIKIIRRLDHDNIVKVFEILGPSGSQLTDDVGSLTELN SVYIVQEYMETDLANVLEQGPLLEEHARLFMYQLLRGLKYIHSANVLHRDLKPANLFI NTEDLVLKIGDFGLARIMDPHYSHKGHLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAA GCIFAEMLTGKTLFAGAHELEQMQLILESIPVVHEEDRQELLSVIPVYIRNDMTEPHK PLTQLLPGISREALDFLEQILTFSPMDRLTAEEALSHPYMSIYSFPMDEPISSHPFHI EDEVDDILLMDETHSHIYNWERYHDCQFSEHDWPVHNNFDIDEVQLDPRALSDVTDEE EVQVDPRKYLDGDREKYLEDPAFDTNYSTEPCWQYSDHHENKYCDLECSHTCNYKTRS SSYLDNLVWRESEVNHYYEPKLIIDLSNWKEQSKEKSDKKGKSKCERNGLVKAQIALE EASQQLAGKEREKNQGFDFDSFIAGTIQLSSQHEPTDVVDKLNDLNSSVSQLELKSLI SKSVSQEKQEKGMANLAQLEALYQSSWDSQFVSGGEDCFFINQFCEVRKDEQVEKENT YTSYLDKFFSRKEDTEMLETEPVEDGKLGERGHEEGFLNNSGEFLFNKQLESIGIPQF HSPVGSPLKSIQATLTPSAMKSSPQIPHQTYSSILKHLN" BASE COUNT 1230 a 675 c 763 g 1252 t ORIGIN 1 caacggaaat agtgcttacc agcaccttag aatgatgctg ctcaggacca gtccaacact 61 gaatgtatct gcactgtgag gagaatgttc atagaagcct gttgtgtgca tatttattca 121 ctttttgtta aatgttaaat cgtttagcac ggtaatctga gtgcacagta tgtcatttca 181 ttccgttgga gtttcttgtt ttcgttaaat gtctgcagag ttgctgcccc tttcttgaac 241 tatgagtact gcaatctttt taattctcaa tatgaataga gctttttgag ctttaaatct 301 aaggggaact cgacaggcct gtttggcata tgcaatgaac atcaagaaac catcttgctg 361 tggaagcata attatttttc ttctcccttt ttgaaagatc tttccttttg atgccagttt 421 tcttccttgt ttacacaagt tcaatttgaa aggaaaaggc aatagtaagg gtttcaaaat 481 ggcagagaaa tttgaaagtc tcatgaacat tcatggtttt gatctgggtt ctaggtatat 541 ggacttaaaa ccattgggtt gtggaggcaa tggcttggtt ttttctgctg tagacaatga 601 ctgtgacaaa agagtagcca tcaagaaaat tgtccttact gatccccaga gtgtcaaaca 661 tgctctacgt gaaatcaaaa ttattagaag acttgaccat gataacattg tgaaagtgtt 721 tgagattctt ggtcccagtg gaagccaatt aacagacgat gtgggctctc ttacggaact 781 gaacagtgtt tacattgttc aggagtacat ggagacagac ttggctaatg tgctggagca 841 gggcccttta ctggaagagc atgccaggct tttcatgtat cagctgctac gggggctcaa 901 gtatattcac tctgcaaatg tactgcacag agatctcaaa ccagctaatc ttttcattaa 961 tacggaagac ttggtgctga agataggtga ctttggtctt gcacggatca tggatcctca 1021 ttattcccat aagggtcatc tttctgaagg attggttact aaatggtaca gatctccacg 1081 tcttttactt tctcctaata attatactaa agccattgac atgtgggctg caggctgcat 1141 ctttgctgaa atgctgactg gtaaaaccct ttttgcaggt gcacatgaac ttgaacagat 1201 gcagctgatt ttagaatcta ttcctgttgt acatgaggaa gatcgtcagg agcttctcag 1261 cgtaattcca gtttacatta gaaatgacat gactgagcca cacaaacctt taactcagct 1321 gcttccagga attagtcgag aagcactgga tttcctggaa caaattttga catttagccc 1381 catggatcgg ttaacagcag aagaagcact ctcccatcct tacatgagca tatattcttt 1441 tccaatggat gagccaattt caagccatcc ttttcatatt gaagatgaag ttgatgatat 1501 tttgcttatg gatgaaactc acagtcacat ttataactgg gaaaggtatc atgattgtca 1561 gttttcagag catgattggc ctgtacataa caactttgat attgatgaag ttcagcttga 1621 tccaagagct ctgtccgatg tcactgatga agaagaagta caagttgatc cccgaaaata 1681 tttggatgga gatcgggaaa agtatctgga ggatcctgct tttgatacca attactctac 1741 tgagccttgt tggcaatact cagatcatca tgaaaacaaa tattgtgatc tggagtgtag 1801 ccatacttgt aactacaaaa cgaggtcatc atcatattta gataacttag tttggagaga 1861 gagtgaagtt aaccattact atgaacccaa gcttattata gatctttcca attggaaaga 1921 acaaagcaaa gaaaaatctg ataagaaagg caaatcaaaa tgtgaaagga atggattggt 1981 taaagcccag atagcgctag aggaagcatc acagcaactg gctggaaaag aaagggaaaa 2041 gaatcaggga tttgattttg attcctttat tgcaggaact attcagctta gttcccagca 2101 tgagcctact gatgttgttg ataaattaaa tgacttgaat agctcagtgt cccaactaga 2161 attgaaaagt ttgatatcaa agtcagtaag ccaagaaaaa caggaaaaag gaatggcaaa 2221 tctggctcaa ttagaagcct tgtaccagtc ttcttgggac agccagtttg tgagtggtgg 2281 ggaggactgt tttttcataa atcagttttg tgaggtaagg aaggatgaac aagttgagaa 2341 ggaaaacact tacactagtt acttggacaa gttctttagc aggaaagaag atactgaaat 2401 gctagaaact gagccagtag aggatgggaa gcttggggag agaggacatg aggaaggatt 2461 tctgaacaac agtggggagt tcctctttaa caagcagctc gagtccatag gcatcccaca 2521 gtttcacagt ccagttgggt caccacttaa gtcaatacag gccacattaa caccttctgc 2581 tatgaaatct tcccctcaaa ttcctcatca aacatacagc agcattctga aacatctgaa 2641 ctaaaacact cagcagacat ttatctttgt attcttcatg aaatgtgttt tgtctttttt 2701 ttttattact agtgtttaag tcatttttta cttgaatcag atggtgtcat ttagtaagga 2761 ttttatgagt tcttgttttt taaaatccag actttctttt tctacatgtg agatagtttt 2821 cattttaact ggcatgtcat ttgcacacaa aaataaagac tagagcaaaa taatgcaacg 2881 caggaggaga aaagaaatgc actaagacaa gaacattctc tcatagaaca ttgatctgtt 2941 ttacaggaaa caaaccttgc cttgaaattt acacagtgag actgtacata attgcatgaa 3001 aatagctatt tttttcctaa gacatttttc attcatgaat attttcaagt ttttcatact 3061 gtacacattt cttaaaacac atgataccag cagcaactga aaatgaatgc cgaatttggt 3121 acacatgtgt tatctacctc aaggtaacaa gagtatgtgg caaaacatat accacccata 3181 gtgcttcaca aaatgcactt ctatttagcc agcgtttatt gtagtaaact attcttaata 3241 aaactcactc actgtttata aatgttctgg tatgcattct ttatagtgaa gtgttaatac 3301 atcacatctt atttatttta gcaaatcagt atattttctg tatttaatta taaaaaatta 3361 acttagtttt taaaatttat ttgcaaatat actttttcca tttggcacta tggtttgttg 3421 cctacctagc tgcatctata atgtcagctt atcctaaggc tgtccacgta cttaatttac 3481 ttaagtgttc attttaagta acgtgctcac tgtgtatagg aatttgtatt ttggaggtgc 3541 ttgatctatc tacaaaagaa aaaattaatt aggaattact ttattataaa atgctcctag 3601 aagtcttaat tgtgtttatt ttttaaaaaa acaaatgtta gacttgtgtg catggaagta 3661 attaaggtac atcattattg tagtttgaaa gttgtacatg ataagacatt ttgtttttac 3721 tgtatgtttt tactgaatga tctattcccc atcccaaggc aagcatgaat aaaattaggt 3781 taaacgtagc atgtggcatc gcagtctctt agaatttgtt tcatctattt tattttattg 3841 aatactgtct gtatctttgg ttatcctgtt tgaagaaaaa ggacaaataa aacatggcca 3901 gcaaatacaa aaaaaaaaaa // LOCUS HSERV9 3918 bp RNA PRI 23-FEB-1993 DEFINITION Human endogenous retrovirus pHE.1 (ERV9). ACCESSION X57147 M37638 NID g38332 KEYWORDS endogenous retrovirus; retrovirus. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3918) AUTHORS Lania,L. TITLE Direct Submission JOURNAL Submitted (28-DEC-1990) Lania L., Department of Genetics, General and Molecular Biology, University of Naples via Mezzocannone, 8 80124 Napoli, Italy REFERENCE 2 (bases 1 to 3918) AUTHORS La Mantia,G., Maglione,D., Pengue,G., Di Cristofano,A., Simeone,A., Lanfrancone,L. and Lania,L. TITLE Identification and characterization of novel human endogenous retroviral sequences prefentially expressed in undifferentiated embryonal carcinoma cells JOURNAL Nucleic Acids Res. 19 (7), 1513-1520 (1991) MEDLINE 91227143 FEATURES Location/Qualifiers source 1..3918 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratocarcinoma" /cell_type="ntera" /cell_line="NT2/D1" /clone_lib="NT2/D1 gt10" mRNA <1..3904 /gene="endogenous retrovirus ERV9" gene 1..3904 /gene="endogenous retrovirus ERV9" mat_peptide 1592..1979 /gene="endogenous retrovirus ERV9" /note="potential mat peptide" CDS 1592..1981 /gene="endogenous retrovirus ERV9" /note="potential CDS" /codon_start=1 /db_xref="PID:g38333" /translation="MDPWIQRDSQAPLIKETQRANTHLVEWEPEAETTFKTLKQALVQ APALSLPTGQNFSLYVRERARIALGVLTQTHGTTPQPVAYLSKEIDVVAKGWPHCLRV VAAVAVLASEAIKIIQGKDLTSRLLMM" BASE COUNT 1155 a 962 c 894 g 907 t ORIGIN 1 cggagaaagc tccaaaagca agccctgggc cctgaacaaa atctagagac attattaaac 61 ctggcaacct cggtgttcta taatagggac caagaggaac aggtccaaaa ggaaaagcga 121 gatcagagaa aggccgcagc cttagtcatg gccctcagac aaacaaacct tggtggttca 181 gagaggacag aacatgaagc aggccaatca cctggtaagg cttgttatca gtgtggttta 241 ctgggacact ttaaaaaaga ttgtccaatg agaaacaagc tgccccctcg tccgtgtcca 301 ctatgctgag gcaatcactg gaaggtgcac tgccccagag gatgaaggtt ccctgggtca 361 gaagccccca gccagacgat ccaacaacag gactgagggt gcctggggca agcgccagct 421 catgtcatca ccctcactga gccccgggta tgtttaacta ttgagggcca ggaaattgac 481 ttcctcctgg acactggcgc ggccttctca gtgttaatct cctgtcctgg acgactctcc 541 tcaaggtccg ttaccatcca aggaatcctg ggacagcctg taaccaggta tttctcccac 601 ctcctcagtt gtaattggga gactttgctc ttttcacatg cctttcttgt tatgcctgaa 661 agtcccacag ggttattagg gagggatata ttagccaagg ctggagctat tatctacatg 721 aatatgggga acaagttacc catttgttgt cccctacttg aggagggaat caaccctgaa 781 gtctgggcat tggaaggaca atttggaagg gcaaaaaatg cccacccagt ccaaatcagg 841 ttaaaagatc ccaccacttt tccttatcaa aggcaatatc cctaaggcct gaagctcata 901 aaggattaca gaatattgtt aaacatttga aagctcaagg cttagtaagg aaatgcagca 961 gtccctgcaa caccccaatt ctggaagtac aaagaccgag actagtgcaa gatcttagac 1021 tcattaatga ggcagtaatt tcactatatc cagttgtacc caacccctat accctgctct 1081 ctcaaatacc agaggaagca gaatggttca cagttctgga cctcaaggat gccttcttct 1141 gtgttcccct gcactctgat tcacagttcc tctttgcttt tgaggatccc acaaaccaca 1201 catcccaact tacatggatg gtcttgcccc aagggtttag ggatagccct catctgtttg 1261 gtcaggccct agccaaagat ctaggccact tctcaagtcc aggcactctg gtccttcaat 1321 atgtggatga tttacttttg gctaccagtt aggaagcctt gtgccagcag gctactctag 1381 atctcttgaa ctttctagct aatcaagggt acaaggtgtc tatgtcgaag gcccagcttt 1441 gcctacagca ggttaaatat ctaggcctaa tcttagccaa agggaccagg gccctcagca 1501 aggaatgaat acagcctata ctggcttatc ctcgccctaa gacattaaaa cagttgaggg 1561 agttccttgg aattaccagc ttttgccgac tatggatccc tggatacagc gagacagcca 1621 ggcccctcta atcaaggaaa cccagagggc aaatactcat ctagttgagt gggaaccaga 1681 ggcagaaaca accttcaaaa ccttaaagca ggctctagta caagctccag ctttaagcct 1741 tcccacagga cagaatttct ctttatatgt cagagagaga gccaggatag ctcttggagt 1801 cctcactcaa actcatggga caaccccaca accagtggca tacctaagta aggaaattga 1861 tgtagtagca aaaggctggc ctcactgttt aagggtagtt gcagcagtgg ccgtcttagc 1921 atcagaggct atcaaaataa tacaaggaaa ggatctcact tctagactac ttatgatgta 1981 aatggcatac taggtgccaa aggaagttta tggctatcag acaactgcct acttagatac 2041 caggcactac tccttgaggg accagtgctt caaatatgca catgcatggc cctcaaccct 2101 gccacttttc tcccagagga tggggaacca atcaagcatg actgccaaca aattatagtc 2161 cagacttatg ccgcccgaga tgatctctta gaagtcccct taactaatcc tgaccttaac 2221 ctatatactg atggaagttc atttgtggag aatgggatac gaaggttagt gatgtaacca 2281 tacttgaaag caagcctctt cccccaggga ccagtgccca gttagcggaa ctagtggcac 2341 ttacctgggc cttagaactg ggaaagggaa aaagaataaa tgtgtataca gatagcaagt 2401 atgcttatct aatcctacat gcccatgctg caatatggaa agagagggag ttcctaacct 2461 ctgtaggaac ccccattaaa taccacaagg aaattataga gttattgcac gcaatgcaaa 2521 aacacaaaga ggtgggaatc ttacactgac aaagccatca gaataggaag gagaggggag 2581 aacagcagca taagcagctg gcagaggcag cagaaaggaa agaaagagac aggaagtcaa 2641 agaaagagac agagaggaag agacaagaag ttagagagaa agagggacag acacagaagt 2701 caaagagagg gtaaaaaaga gaggaagaga caaagaagaa gtcaaagaga gagatggaag 2761 tagtaaagaa aaaacagtgt acctattcct ttaaaagcca gggtaaattt ctctctaccc 2821 acgcaaggca attctctatt ggatctcaac ccatatctgc ctctcaaaca gttgaagaaa 2881 taatgaaatc tatccttact ttacaatccc aaataagact ctttggcagc agtgactctc 2941 caaaaccgct gaggcctaga tctcctcact gctgaaaaag gaggactctg caccttctta 3001 ggggaagagt gttgttttta cactaaccag tcagggatag catgagatgc cacccagcgt 3061 ttacaggaaa aggcttctga aatcagacgc ctttcaaatt cttataccaa cctctggagt 3121 tgggcaacat ggcttctccc ctttctaggt cccgtggcag ccatcttgct gttactcgcc 3181 tttgggcccc gtatttttaa ccttcttgtc aaatttgttt ggtctagaat cgaggccatc 3241 aagctacaga tggtcttaca aatcgaaccc caaatgagtt caactaacaa cttctaccga 3301 ggacccctgg actgaccagc tggcacttcc cctggcctag agagttcccc tctgaaggac 3361 actacaactg caaagcccct tcttcgcccc tatccagcag gaagtagcta gagcagtcat 3421 cggccaaatt cccaacagca gttggggtgt cctgttgatt gaggggtgac agcatgctgg 3481 cagtcctcac agccctcact cgctcgctca ctctcggcac ctcctctgcc tgggctccca 3541 ctttggcagc acttgaggag cccttcagct ctgtatctag ctactctgat gggtccttgg 3601 agaaccttta tgtctagctc agggattgta aatacaccaa tcagcaccct gtgtctagct 3661 cagggtttgt gaatgcacca atggacactc tgtatctagc tactctggtg gggccttgga 3721 gaacctttgt gtcaacactc tgtatctaac taacctggtg gggatgtgga gaacctttgt 3781 gtctagctca gggatgtaaa cgcaccaatc agtgccctgt caaaccactc ggctctacca 3841 atcagcagga tgtgggtggg gccagataag agaataaaag caggctgccc gagccagcag 3901 tggcaaaaaa aaaaaaaa // LOCUS HSERYF1 1498 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for the transcription factor Eryf1. ACCESSION X17254 NID g31242 KEYWORDS DNA-binding protein; Eryf1 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1498) AUTHORS Trainor,C.D. TITLE Direct Submission JOURNAL Submitted (06-DEC-1989) Trainor C.D., LMB, NIDDK National Institute of Health, Bldg 2 Room 310, Bethesda MD 20892, U S A REFERENCE 2 (bases 1 to 1498) AUTHORS Trainor,C.D., Evans,T., Felsenfeld,G. and Boguski,M.S. TITLE Structure and evolution of a human erythroid transcription factor JOURNAL Nature 343 (6253), 92-96 (1990) MEDLINE 90114418 COMMENT Eryf1 is sometimes also referred to as GF-1 or NF-E1. FEATURES Location/Qualifiers source 1..1498 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" /clone_lib="lambda gt11" /clone="H9" CDS 113..1354 /note="Eryf1 transcription factor (AA 1-413)" /codon_start=1 /db_xref="PID:g31243" /db_xref="SWISS-PROT:P15976" /translation="MEFPGLGSLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAA ASSTAPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYAGWAYGKTG LYPASTVCPTREDSPPQAVEDLDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPVPN SAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRD RTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGD PVCNACGLYYKLHQVNRPLTMRKDGIQTRNRKASGKGKKKRGSSLGGTGAAEGPAGGF MVVAGGSGSGNCGEVASGLTLGPPGTAHLYQGLGPVVLSGPVSHLMPFPGPLLGSPTG SFPTGPMPPTTSTTVVAPLSS" misc_feature 1478..1483 /note="put. polyA signal" polyA_site 1498 /note="polyA addition site" BASE COUNT 307 a 502 c 410 g 279 t ORIGIN 1 gcaaaggcca aggccagcca ggacaccccc tgggatcaca ctgagcttgc cacatcccca 61 aggcggccga accctccgca accaccagcc caggttaatc cccagaggct ccatggagtt 121 ccctggcctg gggtccctgg ggacctcaga gcccctcccc cagtttgtgg atcctgctct 181 ggtgtcctcc acaccagaat caggggtttt cttcccctct gggcctgagg gcttggatgc 241 agcagcttcc tccactgccc cgagcacagc caccgctgca gctgcggcac tggcctacta 301 cagggacgct gaggcctaca gacactcccc agtctttcag gtgtacccat tgctcaactg 361 tatggagggg atcccagggg gctcaccata tgccggctgg gcctacggca agacggggct 421 ctaccctgcc tcaactgtgt gtcccacccg cgaggactct cctccccagg ccgtggaaga 481 tctggatgga aaaggcagca ccagcttcct ggagactttg aagacagagc ggctgagccc 541 agacctcctg accctgggac ctgcactgcc ttcatcactc cctgtcccca atagtgctta 601 tgggggccct gacttttcca gtaccttctt ttctcccacc gggagccccc tcaattcagc 661 agcctattcc tctcccaagc ttcgtggaac tctccccctg cctccctgtg aggccaggga 721 gtgtgtgaac tgcggagcaa cagccactcc actgtggcgg agggacagga caggccacta 781 cctatgcaac gcctgcggcc tctatcacaa gatgaatggg cagaacaggc ccctcatccg 841 gcccaagaag cgcctgattg tcagtaaacg ggcaggtact cagtgcacca actgccagac 901 gaccaccacg acactgtggc ggagaaatgc cagtggggat cccgtgtgca atgcctgcgg 961 cctctactac aagctacacc aggtgaaccg gccactgacc atgcggaagg atggtattca 1021 gactcgaaac cgcaaggcat ctggaaaagg gaaaaagaaa cggggctcca gtctgggagg 1081 cacaggagca gccgaaggac cagctggtgg ctttatggtg gtggctgggg gcagcggtag 1141 cgggaattgt ggggaggtgg cttcaggcct gacactgggc cccccaggta ctgcccatct 1201 ctaccaaggc ctgggccctg tggtgctgtc agggcctgtt agccacctca tgcctttccc 1261 tggaccccta ctgggctcac ccacgggctc cttccccaca ggccccatgc cccccaccac 1321 cagcactact gtggtggctc cgctcagctc atgagggcac agagcatggc ctccagagga 1381 ggggtggtgt ccttctcctc ttgtagccag aattctggac aacccaagtc tctgggcccc 1441 aggcaccccc tggcttgaac cttcaaagct tttgtaaaat aaaaccacca aagtcctg // LOCUS HSESCORF 2563 bp RNA PRI 18-JAN-1993 DEFINITION H.sapiens (Ewing's sarcoma cell line) mRNA encoding open reading frame. ACCESSION Z14138 NID g31244 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2563) AUTHORS Chan,A.M., Chedid,M., Aaronson,S.A., Miki,T. and McGovern,E.S. TITLE A transforming gene isolated by expression cloning from Ewing's sarcoma Cell line JOURNAL Unpublished REFERENCE 2 (bases 1 to 2563) AUTHORS Chan,A.M. TITLE Direct Submission JOURNAL Submitted (28-JUL-1992) Andrew M. L. Chan, Laboratory of Cellular & Molecular Biol., National Institute of Health, National Cancer Institute, Building 37, Room 1E24, Bethesda, Maryland, MD 20892, U.S.A FEATURES Location/Qualifiers source 1..2563 /organism="Homo sapiens" /isolate="From Ewing's sarcoma Cell line, SK-ES-1" /db_xref="taxon:9606" /tissue_type="Bone" /cell_type="Mesenchymal cells" /cell_line="SK-ES-1" /clone_lib="SK-ES-1 cDNA library" /clone="est" /chromosome="10" /sex="Male" mRNA 1..2563 CDS 161..1564 /note="open reading frame" /codon_start=1 /db_xref="PID:g31245" /translation="MEYMSTGSDNKEEIDLLIKHLNVSDVIDIMENLYASEEPAVYEP SLMTMCQDSNQNDERSKSLLLSGQEVPWLSSVRYGTVEDLLAFANHISNTAKHFYGQR PQESGILLNMVITPQNGRYQIDSDVLLIPWKLTYRNIGSDFIPRGAFGKVYLAQDIKT KKRMACKLIPVDQFKPSDVEIQACFRHENIAELYGAVLWGETVHLFMEAGEGGSVLEK LESCGPMREFEIIWVTKHVLKGLDFLHSKKVIHHDIKPSNIVFMSTKAVLVDFGLSVQ MTEDVYFPKDLRGTEIYMSPEVILCRGHSTKADIYSLGATLIHMQTGTPPWVKRYPRS AYPSYLYIIHKQAPPLEDIADDCSPGMRELIEASLERNPNHRPRAADLLKHEALNPPR EDQPRCQSLDSALLERKRLLSRKELELPENIADSSCTGSTEESEMLKRQRSLYIDLGA LAGYFNLVRGPPTLEYG" polyA_signal 2541..2546 polyA_site 2563 BASE COUNT 750 a 518 c 556 g 739 t ORIGIN 1 caatcgcgaa accggtttcg cgattggtac caatcgcgaa accaaatcct gttgccaatc 61 cttgtatgtc agtttcccat gggtcttgaa tgcaaataca aatatcgtaa actaaatatt 121 tgtgttttct ttcctagact ctccagaaag agcaacagta atggagtaca tgagcactgg 181 aagtgacaat aaagaagaga ttgatttatt aattaaacat ttaaatgtgt ctgatgtaat 241 agacattatg gaaaatcttt atgcaagtga agagccagca gtttatgaac ccagtctaat 301 gaccatgtgt caagacagta atcaaaacga tgagcgttct aagtctctgc tgcttagtgg 361 ccaagaggta ccatggttgt catcagtcag atatggaact gtggaggatt tgcttgcttt 421 tgcaaaccat atatccaaca ctgcaaagca tttttatgga caacgaccac aggaatctgg 481 aattttatta aacatggtca tcactcccca aaatggacgt taccaaatag attccgatgt 541 tctcctgatc ccctggaagc tgacttacag gaatattggt tctgatttta ttcctcgggg 601 cgcctttgga aaggtatact tggcacaaga tataaagacg aagaaaagaa tggcgtgtaa 661 actgatccca gtagatcaat ttaagccatc tgatgtggaa atccaggctt gcttccggca 721 cgagaacatc gcagagctgt atggcgcagt cctgtggggt gaaactgtcc atctctttat 781 ggaagcaggc gagggagggt ctgttctgga gaaactggag agctgtggac caatgagaga 841 atttgaaatt atttgggtga caaagcatgt tctcaaggga cttgattttc tacactcaaa 901 gaaagtgatc catcatgata ttaaacctag caacattgtt ttcatgtcca caaaagctgt 961 tttggtggat tttggcctaa gtgttcaaat gaccgaagat gtctattttc ctaaggacct 1021 ccgaggaaca gagatttaca tgagcccaga ggtcatcctg tgcaggggcc attcaaccaa 1081 agcagacatc tacagcctgg gggccacgct catccacatg cagacgggca ccccaccctg 1141 ggtgaagcgc taccctcgct cagcctatcc ctcctacctg tacataatcc acaagcaagc 1201 acctccactg gaagacattg cagatgactg cagtccaggg atgagagagc tgatagaagc 1261 ttccctggag agaaacccca atcaccgccc aagagccgca gacctactaa aacatgaggc 1321 cctgaacccg cccagagagg atcagccacg ctgtcagagt ctggactctg ccctcttgga 1381 gcgcaagagg ctgctgagta ggaaggagct ggaacttcct gagaacattg ctgattcttc 1441 gtgcacagga agcaccgagg aatctgagat gctcaagagg caacgctctc tctacatcga 1501 cctcggcgct ctggctggct acttcaatct tgttcgggga ccaccaacgc ttgaatatgg 1561 ctgaaggatg ccatgtttgc tctaaattaa gacaggcatt gatctcctgg aggctggttc 1621 tgctgcctct acacaggggc cctgtacagt gaatggtgcc attttcgaag gagcagcagt 1681 gtgacctcct gtgacccatg aatgtgcctc caagcggccc tgtgtgtttg acatgtgaag 1741 ctatttgata tgcaccaggt ctcaaggttc tcatttctca ggtgacgtga ttctaaggca 1801 ggatttgaga gttcacagaa ggatcgtgtc tgctgactgt ttcattcact gtgcactttg 1861 ctcaaaattt taaaaatacc aatcacaagg ataatagagt agcctaaaat tactattctt 1921 ggttcttatt taagtatgga atattcattt tactcagaat agctgttttg tgtatattgg 1981 tgtatattat ataactcttt gagcctttat tggtaaattc tggtatacat tgaattcatt 2041 ataatttggg tgactagaac aacttgaaga ttgtagcaat aagctggact agtgtcctaa 2101 aaatggctaa ctgatgaatt agaagccatc tgacagcagg ccactagtga cagtttcttt 2161 tgtgttccta tggaaacatt ttatactgta catgctatct gaagacattc aaaacgtgat 2221 gttttgaatg tggataaaac tgtgtaaacc acataatttt tgtacatccc aaaggatgag 2281 cctgttgacc tttaagaaaa atgaaaactt ttgtaaatta ttgatgattt tgtaattctt 2341 atgactaaat tttcttttaa gcatttgtat attaaaatag catactgtgt atgttttata 2401 tcaaatgcct tcatgaatct ttcatacata tatatatttg taacattgta aagtatgtga 2461 gtagtcttat gtaaagtatg ttttacatta tgcaaataaa acccaatact tttgtccaat 2521 gtggttggtc aaatcaactg aataaattca gtattttgcc tta // LOCUS HSET1R 4105 bp RNA PRI 23-MAR-1993 DEFINITION H.sapiens mRNA for endothelin-1 receptor. ACCESSION X61950 NID g288312 KEYWORDS endothelin-1 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4105) AUTHORS Hosoda,K., Nakao,K., Hiroshi-Arai, Suga,S., Ogawa,Y., Mukoyama,M., Shirakami,G., Saito,Y., Nakanishi,S. and Imura,H. TITLE Cloning and expression of human endothelin-1 receptor cDNA JOURNAL FEBS Lett. 287 (1-2), 23-26 (1991) MEDLINE 91348221 FEATURES Location/Qualifiers source 1..4105 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 485..1768 /codon_start=1 /product="endothelin-1 receptor" /db_xref="PID:g288313" /db_xref="SWISS-PROT:P25101" /translation="METLCLRASFWLALVGCVISDNPERYSTNLSNHVDDFTTFRGTE LSFLVTTHQPTNLVLPSNGSMHNYCPQQTKITSAFKYINTVISCTIFIVGMVGNATLL RIIYQNKCMRNGPNALIASLALGDLIYVVIDLPINVFKLLAGRWPFDHNDFGVFLCKL FPFLQKSSVGITVLNLCALSVDRYRAVASWSRVQGIGIPLVTAIEIVSIWILSFILAI PEAIGFVMVPFEYRGEQHKTCMLNATSKFMEFYQDVKDWWLFGFYFCMPLVCTAIFYT LMTCEMLNRRNGSLRIALSEHLKQRREVAKTVFCLVVIFALCWFPLHLSRILKKTVYN EMDKNRCELLSFLLLMDYIGINLATMNSCINPIALYFVSKKFKNCFQSCLCCCCYQSK SLMTSVPMNGTSIQWKNHDQNNHNTDRSSHKDSMN" BASE COUNT 1138 a 859 c 845 g 1263 t ORIGIN 1 gaattcgcgg ccgcctcttg cggtcccaga gtggagtgga aggtctggag ctttgggagg 61 agacggggag gacagactgg aggcgtgttc ctccggagtt ttctttttcg tgcgagccct 121 cgcgcgcgcg tacagtcatc ccgctggtct gacgattgtg gagaggcggt ggagaggctt 181 catccatccc acccggtcgt cgccggggat tggggtccca gcgacacctc cccgggagaa 241 gcagtgccca ggaagttttc tgaagccggg gaagctgtgc agccgaagcc gccgccgcgc 301 cggagcccgg gacaccggcc accctccgcg ccacccaccc tcgctttctc cggcttcctc 361 tggcccaggc gccgcgcgga cccggcagct gtctgcgcac gccgagctcc acggtgaaaa 421 aaaaagtgaa ggtgtaaaag cagcacaagt gcaataagag atatttcctc aaatttgcct 481 caagatggaa accctttgcc tcagggcatc cttttggctg gcactggttg gatgtgtaat 541 cagtgataat cctgagagat acagcacaaa tctaagcaat catgtggatg atttcaccac 601 ttttcgtggc acagagctca gcttcctggt taccactcat caacccacta atttggtcct 661 acccagcaat ggctcaatgc acaactattg cccacagcag actaaaatta cttcagcttt 721 caaatacatt aacactgtga tatcttgtac tattttcatc gtgggaatgg tggggaatgc 781 aactctgctc aggatcattt accagaacaa atgtatgagg aatggcccca acgcgctgat 841 agccagtctt gcccttggag accttatcta tgtggtcatt gatctcccta tcaatgtatt 901 taagctgctg gctgggcgct ggccttttga tcacaatgac tttggcgtat ttctttgcaa 961 gctgttcccc tttttgcaga agtcctcggt ggggatcacc gtcctcaacc tctgcgctct 1021 tagtgttgac aggtacagag cagttgcctc ctggagtcgt gttcagggaa ttgggattcc 1081 tttggtaact gccattgaaa ttgtctccat ctggatcctg tcctttatcc tggccattcc 1141 tgaagcgatt ggcttcgtca tggtaccctt tgaatatagg ggtgaacagc ataaaacctg 1201 tatgctcaat gccacatcaa aattcatgga gttctaccaa gatgtaaagg actggtggct 1261 cttcgggttc tatttctgta tgcccttggt gtgcactgcg atcttctaca ccctcatgac 1321 ttgtgagatg ttgaacagaa ggaatggcag cttgagaatt gccctcagtg aacatcttaa 1381 gcagcgtcga gaagtggcaa aaacagtttt ctgcttggtt gtaatttttg ctctttgctg 1441 gttccctctt cacttaagcc gtatattgaa gaaaactgtg tataacgaaa tggacaagaa 1501 ccgatgtgaa ttacttagtt tcttactgct catggattac atcggtatta acttggcaac 1561 catgaattca tgtataaacc ccatagctct gtattttgtg agcaagaaat ttaaaaattg 1621 tttccagtca tgcctctgct gctgctgtta ccagtccaaa agtctgatga cctcggtccc 1681 catgaacgga acaagcatcc agtggaagaa ccacgatcaa aacaaccaca acacagaccg 1741 gagcagccat aaggacagca tgaactgacc acccttagaa gcactcctcg gtactcccat 1801 aatcctctcg gagaaaaaaa tcacaaggca actgtgactc cgggaatctc ttctctgatc 1861 cttcttcctt aattcactcc cacacccaag aagaaatgct ttccaaaacc gcaaggtaga 1921 ctggtttatc cacccacaac atctacgaat cgtacttctt taattgatct aatttacata 1981 ttctgcgtgt tgtattcagc actaaaaaat ggtgggagct gggggagaat gaagactgtt 2041 aaatgaaacc agaaggatat ttactacttt tgcatgaaaa tagagctttc aagtacatgg 2101 ctagctttta tggcagttct ggtgaatgtt caatgggaac tggtcaccat gaaactttag 2161 agattaacga caagattttc tacttttttt aagtgatttt ttgtccttca gccaaacaca 2221 atatgggctc aggtcacttt tatttgaaat gtcatttggt gccagtattt tttaactgca 2281 taatagccta acatgattat ttgaacttat ttacacatag tttgaaaaaa aaaagacaaa 2341 aatagtattc aggtgagcaa ttagattagt attttccacg tcactattta tttttttaaa 2401 acacaaattc taaagctaca acaaatacta caggccctta aagcacagtc tgatgacaca 2461 tttggcagtt taatagatgt tactcaaaga attttttaag aactgtattt tattttttaa 2521 atggtgtttt attacaaggg accttgaaca tgttttgtat gttaaattca aaagtaatgc 2581 ttcaatcaga tagttctttt tcacaagttc aatactgttt ttcatgtaaa ttttgtatga 2641 aaaatcaatg tcaagtacca aaatgttaat gtatgtgtca tttaactctg cctgagactt 2701 tcagtgcact gtatatagaa gtctaaaaca cacctaagag aaaaagatcg aatttttcag 2761 atgattcgga aattttcatt caggtatttg taatagtgac atatatatgt atatacatat 2821 cacctcctat tctcttaatt tttgttaaaa tgttaactgg cagtaagtct tttttgatca 2881 ttcccttttc catataggaa acataatttt gaagtggcca gatgagttta tcatgtcagt 2941 gaaaaataat tacccacaaa tgccaccagt aacttaacga ttcttcactt cttggggttt 3001 tcagtatgaa cctaactccc caccccaaca tctccctccc acattgtcac catttcaaag 3061 ggcccacagt gacttttgct gggcattttc ccagatgttt acagactgtg agtacagcag 3121 aaaatctttt actagtgtgt gtgtgtatat atataaacaa ttgtaaattt cttttagccc 3181 atttttctag actgtctctg tggaatatat ttgtgtgtgt gatatatgca tgtgtgtgat 3241 ggtatgtatg gatttaatct aatctaataa ttgtgccccg cagttgtgcc aaagtgcata 3301 gtctgagcta aaatctaggt gattgttcat catgacaacc tgcctcagtc cattttaacc 3361 tgtagcaacc ttctgcattc ataaatcttg taatcatgtt accattacaa atgggatata 3421 agaggcagcg tgaaagcaga tgagctgtgg actagcaata tagggttttg tttggttggt 3481 tggtttgata aagcagtatt tggggtcata ttgtttcctg tgctggagca aaagtcatta 3541 cactttgaag tattatattg ttcttatcct caattcaatg tggtgatgaa attgccaggt 3601 tgtctgatat ttctttcaga cttcgccaga cagattgctg ataataaatt aggtaagata 3661 atttgttggg ccatatttta ggacaggtaa aataacatca ggttccagtt gcttgaattg 3721 caaggctaag aagtactgcc cttttgtgtg ttagcagtca aatctattat tccactggcg 3781 catcatatgc agtgatatat gcctataata taagccatag gttcacacca ttttgtttag 3841 acaattgtct ttttttcaag atgctttgtt tctttcatat gaaaaaaatg cattttataa 3901 attcagaaag tcatagattt ctgaaggcgt caacgtgcat tttatttatg gactggtaag 3961 taactgtggt ttactagcag gaatatttcc aatttctacc tttactacat cttttcaaca 4021 agtaactttg tagaaatgag ccagaagcca aggccctgag ttggcagtgg cccataagtg 4081 taaaataaaa gtttacagaa acctt // LOCUS HSET2RNA 1230 bp RNA PRI 24-JUN-1992 DEFINITION H.sapiens ET-2 mRNA for endothelin-2. ACCESSION X55177 NID g31258 KEYWORDS endothelin; endothelin 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Ohkubo,S., Ogi,K., Hosoya,M., Matsumoto,H., Suzuki,N., Kimura,C., Ondo,H. and Fujino,M. TITLE Specific expression of human endothelin-2 (ET-2) gene in a renal adenocarcinoma cell line. Molecular cloning of cDNA encoding the precursor of ET-2 and its characterization JOURNAL FEBS Lett. 274 (1-2), 136-140 (1990) MEDLINE 91071415 FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" gene 59..595 /gene="ET-2" sig_peptide 59..130 /gene="ET-2" CDS 59..595 /gene="ET-2" /codon_start=1 /product="prepro-endothelin-2" /db_xref="PID:g31259" /db_xref="SWISS-PROT:P20800" /translation="MVSVPTTWCSVALALLVALHEGKGQAAATLEQPASSSHAQGTHL RLRRCSCSSWLDKECVYFCHLDIIWVNTPEQTAPYGLGNPPRRRRRSLPRRCQCSSAR DPACATFCLRRPWTEAGAVPSRKSPADVFQTGKTGATTGELLQRLRDISTVKSLFAKR QQEAMREPRSTHSRWRKR" mat_peptide 131..592 /gene="ET-2" /product="pro-endothelin-2" BASE COUNT 240 a 389 c 344 g 257 t ORIGIN 1 aggacgctgg caacaggcac tccctgctcc agtccagcct gcgcgctcca ccgccgctat 61 ggtctccgtg cctaccacct ggtgctccgt tgcgctagcc ctgctcgtgg ccctgcatga 121 agggaagggc caggctgctg ccaccctgga gcagccagcg tcctcatctc atgcccaagg 181 cacccacctt cggcttcgcc gttgctcctg cagctcctgg ctcgacaagg agtgcgtcta 241 cttctgccac ttggacatca tctgggtgaa cactcctgaa cagacagctc cttacggcct 301 gggaaacccg ccaagacgcc ggcgccgctc cctgccaagg cgctgtcagt gctccagtgc 361 cagggacccc gcctgtgcca ccttctgcct tcgaaggccc tggactgaag ccggggcagt 421 cccaagccgg aagtcccctg cagacgtgtt ccagactggc aagacagggg ccactacagg 481 agagcttctc caaaggctga gggacatttc cacagtcaag agcctctttg ccaagcgaca 541 acaggaggcc atgcgggagc ctcggtccac acattccagg tggaggaaga gatagtgtcg 601 tgagctggag gaacattggg aaggaagccc gcggggagag aggaggagag aagtggccag 661 ggcttgtgga ctctctgcct gcttcctgga ccggggcctt ggtcccagac agctggaccc 721 atttgccagg attggcacaa gctccctggt gagggagcct cgtccaaggc agttctgtgt 781 cctcgcactg cccagggaag ccctcggcct ccagactgcg gagcagcctc cagtgctggc 841 tgctggccca cagctctgct ggaagaactg catggggagt acattcatct ggaggctgcg 901 tcctgaggag tgtcctgtct gctgggctac aaaccaggag caaccgtgca gccacgaaca 961 cgcatgcctc agccagccct ggagactgga tggctcccct gaggctggca tcctggctgg 1021 ctgtgtcctc tccagctttc cctccccaga gttcttgcac cctcattccc tcgggaccct 1081 cccagtgaga agggcctgct ctgcttttcc tgtctgtata taacttattt gccctaagaa 1141 ctttgagaat cccaattatt tattttaatg tattttttag accctctatt tacctgcgaa 1201 cttgtgttta taataaatga ggaaacatca // LOCUS HSET3AA 2299 bp RNA PRI 11-DEC-1992 DEFINITION H.sapiens endothelin 3 mRNA. ACCESSION X52001 NID g31260 KEYWORDS endothelin; endothelin 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2299) AUTHORS Onda,H., Ohkubo,S., Ogi,K., Kosaka,T., Kimura,C., Matsumoto,H., Suzuki,N. and Fujino,M. TITLE One of the endothelin gene family, endothelin 3 gene, is expressed in the placenta JOURNAL FEBS Lett. 261 (2), 327-330 (1990) MEDLINE 90184472 FEATURES Location/Qualifiers source 1..2299 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 111..785 /gene="ET3" CDS 111..785 /gene="ET3" /codon_start=1 /product="prepro-endothelin 3" /db_xref="PID:g31261" /translation="MEPGLWLLFGLTVTSAAGFVPCSQSGDAGRRGVSQAPTAARSEG DCEETVAGPGEETVAGPGEGTVAPTALQGPSPGSPGQEQAAEGAPEHHRSRRCTCFTY KDKECVYYCHLDIIWINTPEQTVPYGLSNYRGSFRGKRSAGPLPGNLQLSHRPHLRCA CVGRYDKACLHFCTQTLDVSRQVEVKDQQSKQALDLHHPKLMPGSGLALAPSTCPRCL FQEGAP" mat_peptide 159..782 /gene="ET3" /product="endothelin 3" BASE COUNT 592 a 571 c 582 g 554 t ORIGIN 1 cgaaccccca cagctggagg gcgaggccag ctgtacccgg ccccagtgcc ctttcgcggc 61 cacaagcggc cgtcctcctg gtccggtgct ccggcgcctg atctaggttc atggagccgg 121 ggctgtggct ccttttcggg ctcacagtga cctccgccgc aggattcgtg ccttgctccc 181 agtctgggga tgctggcagg cgcggcgtgt cccaggcccc cactgcagcc agatctgagg 241 gggactgtga agagactgtg gctggccctg gcgaggagac tgtggctggc cctggcgagg 301 ggactgtggc cccgacagca ctgcagggtc caagccctgg aagccctggg caggagcagg 361 cggccgaggg ggcccctgag caccaccgat ccaggcgctg cacgtgcttc acctacaagg 421 acaaggagtg tgtctactat tgccacctgg acatcatttg gatcaacact cccgaacaga 481 cggtgcccta tggactgtcc aactacagag gaagcttccg gggcaagagg tctgcggggc 541 cacttccagg gaatctgcag ctctcacatc ggccacactt gcgctgcgct tgtgtgggga 601 gatatgacaa ggcctgcctg cacttttgca cccaaactct ggacgtcagc agacaggttg 661 aagtcaagga ccaacaaagc aagcaggctt tagacctcca ccatccaaag ctcatgcccg 721 gcagtggact cgccctcgct ccatctacct gcccccgctg cctctttcag gaaggagccc 781 cttaggagga caggcctgca gcatcctggt ctcgggaggc ttctgtcatt gctcacacac 841 agttcagatt tccacctctt tatagacaag aagtgaattt gcctggggca gaacacccac 901 ccaaagagtc cccacttaac aatacccccc ccccacggca agaatgccca aatccgaatg 961 accccagttt tcctaatgag taaaatgatc ccagatgtgc cccagagcat gacgcctgca 1021 gctccggttt catgcaggaa attggttttg gagagttttg gcaagttgga aagccactta 1081 ctggcttttg acatgacttc tcttggagaa taagtggact ccaagctaac tctttgcaaa 1141 tgtaaacaca tgtccatctt gtaataaatg caaaatgccc gtgcagcaga agcatgcgac 1201 tttcatatcc ttgcctagaa taggctgcat ggtgtatgtc agtgagggcc acgaggcgtc 1261 ggctttagac acagatcata gctctacagg agtttatgaa tttgaagctt atgggatttt 1321 ggcagagaaa ttttcagctg tgcttgatac ccaccaaaag aatgtatctc gaaagaatga 1381 aggaagaaga aaaaaggatc cttgatgttt gtgacaagaa aatgagaaag ttagtatctg 1441 caatacagag cttgttcctg ttcagtgact gaccctctgt attctgtata gacaccaggc 1501 cgatacacag tggagttccc aggccttgtt tgcaggaagc cgactgtaaa gacagcccca 1561 gctcaaggct attaggttga atatttgctt tcatgagtaa atgtggatct ttggggaatg 1621 gcttcaaaat aagtcacgaa cacaaattct ttgtaaatta tgtaaattcc tgtttatata 1681 aattggcaac aacttatacc gtctgacagt tcaaaatctc tttcagctgc gctcttccca 1741 ccgagccgag cttactgtga gtgtggagat gttatcccac catgtaaagt cgcctgcgca 1801 ggggagggct gcccatctcc ccaacccagt cacagagaga taggaaacgg catttgagtg 1861 ggtgtccagg gccccgtaga gagacattta agatggtgta tgacagagca ttggccttga 1921 ccaaatgtta aatcctctgt gtgtatttca taagttatta caggtataaa agtgatgacc 1981 tatcatgagg aaatgaaagt ggctgatttg ctggtaggat tttgtacagt ttagagaagc 2041 gattatttat tgtgaaactg ttctccactc caactccttt atgtggatct gttcaaagta 2101 gtcactgtat atacgtatag agaggtagat aggtaggtag attttaaatt gcattctgaa 2161 tacaaactca tactccttag agcttgaatt acatttttaa aatgcatatg tgctgtttgg 2221 caccgtggca agatggtatc agagagaaac ccatcaattg ctcaaatact cagaaagtac 2281 tgtcaaaagc ctaataaaa // LOCUS HSETFBS 852 bp RNA PRI 13-MAY-1993 DEFINITION H.sapiens mRNA for electron transfer flavoprotein beta subunit. ACCESSION X71129 NID g297901 KEYWORDS electron transfer flavoprotein; electron transfer flavoprotein beta subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 852) AUTHORS Finocchiaro,G., Colombo,I., Garavaglia,B., Gellera,C., Valdameri,G., Garbuglio,N. and Didonato,S. TITLE cDNA cloning and mitochondrial import of the beta-subunit of the human electron-transfer flavoprotein JOURNAL Eur. J. Biochem. 213 (3), 1003-1008 (1993) MEDLINE 93279298 REFERENCE 2 (bases 1 to 852) AUTHORS Finocchiaro,G. TITLE Direct Submission JOURNAL Submitted (04-MAR-1993) G. Finocchiaro, Istituto Nazionale Neurologico 'C.Besta', Via Celoria 11, 20133 Milano, ITALY FEATURES Location/Qualifiers source 1..852 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" /clone_lib="lambda gt11" /clone="pE6.1" /chromosome="19" CDS 28..795 /codon_start=1 /product="electron transfer flavoprotein beta subunit" /db_xref="PID:g297902" /db_xref="SWISS-PROT:P38117" /translation="MAELRVLVAVKRVIDYAVKIRVKPDRTGVVTDGVKHSMNPFCEI AVEEAVRLKEKKLVKEVIAVSCGPAQCQETIRTALAMGADRGIHVEVPPAEAERLGPL QVARVLAKLAEKEKVDLVLLGKQAIDDDCNQTGQMTAGFLDWPQGTFASQVTLEGDKL KVEREIDGGLETLRLKLPAVVTADLRLNEPRYATLPNIMKAKKKKIEVIKPGDLGVDL TSKLSVISVEDPPQRTAGVKVETTEDLVAKLKEIGRI" polyA_signal 814..819 BASE COUNT 205 a 218 c 290 g 139 t ORIGIN 1 caccctgtaa gtggctgcgg cgggaagatg gcggagctgc gcgtgctcgt agctgtcaag 61 agggtcatcg actacgccgt gaagatccga gtgaagcctg acaggaccgg tgtggtcacg 121 gatggtgtga agcactccat gaaccccttc tgtgagatcg cggtggagga ggctgtgcgg 181 ctcaaggaga agaagctggt gaaggaggtc atcgccgtca gctgtgggcc tgcacagtgc 241 caggagacga ttcgtaccgc cctggccatg ggtgcagacc gaggtatcca cgtggaggtg 301 cccccagcag aagcagaacg cttgggtccc ctgcaggtgg ctcgggtcct ggccaagctg 361 gcagagaagg agaaggtgga cctggtgctg ctgggcaaac aggccatcga tgatgactgt 421 aaccagacag ggcagatgac agctggattt cttgactggc cacagggcac attcgcctcc 481 caggtgacgc tggaggggga caagttgaaa gtggagcggg agatcgatgg gggcctggag 541 accctgcgcc tgaagctgcc agctgtggtg acagctgacc tgaggctcaa cgagccccgc 601 tacgccacgc tgcccaacat catgaaagcc aagaagaaga agatcgaggt gatcaagcct 661 ggggacctgg gtgtggacct gacctccaag ctctctgtga tcagtgtgga ggacccgccc 721 cagcgcacgg ccggcgtcaa ggtggagacc actgaggacc tggtggccaa gctgaaggag 781 attgggcgga tttgagcccc tcccagagat ggcaataaaa ctgactctca acatcaaaaa 841 aaaaaaaaaa aa // LOCUS HSETSRP 2130 bp RNA PRI 06-MAR-1996 DEFINITION H.sapiens mRNA for ets-related protein. ACCESSION X76184 NID g479166 KEYWORDS erm gene; ets-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2130) AUTHORS De Launoit,Y.P. TITLE Direct Submission JOURNAL Submitted (19-NOV-1993) Y.P. De Launoit, CNRS 1160 Inst Pasteur de Lille, 1 rue Calmette, 59019 Lille, Cedex, FRANCE REFERENCE 2 (bases 1 to 2130) AUTHORS Monte,D., Baert,J.L., Defossez,P.A., de Launoit,Y. and Stehelin,D. TITLE Molecular cloning and characterization of human ERM, a new member of the Ets family closely related to mouse PEA3 and ER81 transcription factors JOURNAL Oncogene 9 (5), 1397-1406 (1994) MEDLINE 94203669 COMMENT See X96374-82 for genomic DNA of erm gene. FEATURES Location/Qualifiers source 1..2130 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /clone_lib="lambda GT11" gene 196..1728 /gene="erm" CDS 196..1728 /gene="erm" /codon_start=1 /product="ets-related protein" /db_xref="PID:g479167" /db_xref="SWISS-PROT:P41161" /translation="MDGFYDQQVPFMVPGKSRSEECRGRPVIDRKRKFLDTDLAHDSE ELFQDLSQLQEAWLAEAQVPDDEQFVPDFQSDNLVLHAPPPTKIKRELHSPSSELSSC SHEQALGANYGEKCLYNYCAYDRKPPSGFKPLTPPTTPLSPTHQNPLFPPPQATLPTS GHAPAAGPVQGVGPAPAPHSLPEPGPQQQTFAVPRPPHQPLQMPKMMPENQYPSEQRF QRQLSEPCHPFPPQPGVPGDNRPSYHRQMSEPIVPAAPPPPQGFKQEYHDPLYEHGVP GMPGPPAHGFQSPMGIKQEPRDYCVDSEVPNCQSSYMRGGYFSSSHEGFSYEKDPRLY FDDTCVVPERLEGKVKQEPTMYREGPPYQRRGSLQLWQFLVTLLDDPANAHFIAWTGR GMEFKLIEPEEVARRWGIQKNRPAMNYDKLSRSLRYYYEKGIMQKVAGERYVYKFVCD PDALFSMAFPDNQRPFLKAESECHLSEEDTLPLTHFEDSPAYLLDMDRCSSLPYAEGF AY" misc_feature 1285..1542 /gene="erm" /note="ets domain" BASE COUNT 543 a 591 c 532 g 464 t ORIGIN 1 ggggcgagaa gagacgcttg gcggacgtcc gcagttggga ggagggggcg ggaagcagtt 61 tgaggggaat gtctgagagg cgctgccccc agagcaccgg gtaggggggt aaatgacaca 121 ggaggatccc ttttccccca gaaattactc aatgctgaaa cctctcaaag tggtattaga 181 gacgctgaaa gcaccatgga cgggttttat gatcagcaag tcccttttat ggtcccaggg 241 aaatctcgat ctgaggaatg cagagggcgg cctgtgattg acagaaagag gaagtttttg 301 gacacagatc tggctcacga ttctgaagag ctatttcagg atctcagtca acttcaagag 361 gcttggttag ctgaagcaca agttcctgat gatgaacagt ttgtcccaga ttttcagtct 421 gataacctgg tgcttcatgc cccacctcca accaagatca aacgggagct gcacagcccc 481 tcctctgagc tgtcgtcttg tagccatgag caggctcttg gtgctaacta tggagaaaag 541 tgcctctaca actattgtgc ctatgatagg aagcctccct ctgggttcaa gccattaacc 601 cctcctacaa cccccctctc acccacccat cagaatcccc tatttccccc acctcaggca 661 actctgccca cctcagggca tgcccctgca gctggcccag ttcaaggtgt gggccccgcc 721 cccgcccccc attcgcttcc agagcctgga ccacagcagc aaacatttgc ggtcccccga 781 ccaccacatc agcccctgca gatgccaaag atgatgcctg aaaaccagta tccatcagaa 841 cagagatttc agagacaact gtctgaaccc tgccacccct tccctcctca gccaggagtt 901 cctggagata atcgccccag ttaccatcgg caaatgtcag aacctattgt ccctgcagct 961 cccccgcccc ctcagggatt caaacaagaa taccatgacc cactctatga acatggggtc 1021 ccgggcatgc cagggccccc agcacacggg ttccagtcac caatgggaat caagcaggag 1081 cctcgggatt actgcgtcga ttcagaagtg cctaactgcc agtcatccta catgagaggg 1141 ggttatttct ccagcagcca tgaaggtttt tcatatgaaa aagatccccg attatacttt 1201 gacgacactt gtgttgtgcc tgagagactg gaaggcaaag tcaaacagga gcctaccatg 1261 tatcgagagg ggccccctta ccagaggcga ggttcccttc agctgtggca gttcctggtc 1321 acccttcttg atgacccagc caatgcccac ttcattgcct ggacaggtcg aggcatggag 1381 ttcaagctga tagaaccgga agaggttgct cggcgctggg gcatccagaa gaaccggcca 1441 gccatgaact atgacaagct gagccgctct ctccgctatt actatgaaaa gggcatcatg 1501 cagaaggtgg ctggagagcg atacgtctac aaatttgtct gtgacccaga tgccctcttc 1561 tccatggctt tcccggataa ccagcgtccg ttcctgaagg cagagtccga gtgccacctc 1621 agcgaggagg acaccctgcc gctgacccac tttgaagaca gccccgctta cctcctggac 1681 atggaccgct gcagcagcct cccctatgcc gaaggctttg cttactaagt ttctgagtgg 1741 cggagtggcc aaaccctaga gctagcagtt cccattcagg caaacaaggg cagtggtttt 1801 gtttgtgttt ttggttgttc ctaaagcttg ccctttgagt attatctgga gaacccaagc 1861 tgtctctgga ttggcaccct taaagacaga tacattggct ggggagtggg aacagggagg 1921 ggcagaaaac caccaaaagg ccagtgcctc aactcttgat tctgatgagg tttctgggaa 1981 gagatcaaaa tggagtctcc ttaccatgga caatacatgc aaagcaatat cttgttcagg 2041 ttagtacccg caaaacggga catgatgtga caatctgcat cgatcatgga ctactaaatg 2101 cctttacata gaaaaaaaaa aaaaaaaaaa // LOCUS HSEWS 2390 bp RNA PRI 28-JUN-1995 DEFINITION H.sapiens EWS mRNA. ACCESSION X66899 NID g547565 KEYWORDS RNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2371) AUTHORS Delattre,O. TITLE Direct Submission JOURNAL Submitted (26-MAY-1992) O. Delattre, Lab. de Genet. des Tumeurs. Inst. Curie, 26 rue D'Ulm, 75231 Paris Cedex, FRANCE REFERENCE 2 (bases 1 to 2390) AUTHORS Delattre,O., Zucman,J., Plougastel,B., Desmaze,C., Melot,T., Peter,M., Kovar,H., Joubert,I., de Jong,P., Rouleau,G., Aurias,A. and Thomas,G. TITLE Gene fusion with an ETS DNA-binding domain caused by chromosome translocation in human tumours JOURNAL Nature 359 (6391), 162-165 (1992) MEDLINE 92396239 FEATURES Location/Qualifiers source 1..2390 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /clone_lib="cDNA, Stratagene 936206" /clone="BF1AC5" /chromosome="22q12" gene 44..2014 /gene="EWS" CDS 44..2014 /gene="EWS" /codon_start=1 /product="RNA binding protein" /db_xref="PID:g31280" /db_xref="SWISS-PROT:Q01844" /translation="MASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTY GQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTT ATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQ PSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPS SYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMG VYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFN KPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKR TGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPP MNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGN PSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRG GPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGP LMEQMGGRRGGRGGPGKMDKGEHRQERRDRPY" misc_binding 1127..1387 /gene="EWS" /bound_moiety="RNA" polyA_signal 2162..2167 /evidence=experimental polyA_signal 2350..2355 /evidence=experimental polyA_site 2371 BASE COUNT 645 a 589 c 668 g 488 t ORIGIN 1 agagggagac ggacgttgag agaacgagga ggaaggagag aaaatggcgt ccacggatta 61 cagtacctat agccaagctg cagcgcagca gggctacagt gcttacaccg cccagcccac 121 tcaaggatat gcacagacca cccaggcata tgggcaacaa agctatggaa cctatggaca 181 gcccactgat gtcagctata cccaggctca gaccactgca acctatgggc agaccgccta 241 tgcaacttct tatggacagc ctcccactgg ttatactact ccaactgccc cccaggcata 301 cagccagcct gtccaggggt atggcactgg tgcttatgat accaccactg ctacagtcac 361 caccacccag gcctcctatg cagctcagtc tgcatatggc actcagcctg cttatccagc 421 ctatgggcag cagccagcag ccactgcacc tacaagaccg caggatggaa acaagcccac 481 tgagactagt caacctcaat ctagcacagg gggttacaac cagcccagcc taggatatgg 541 acagagtaac tacagttatc cccaggtacc tgggagctac cccatgcagc cagtcactgc 601 acctccatcc taccctccta ccagctattc ctctacacag ccgactagtt atgatcagag 661 cagttactct cagcagaaca cctatgggca accgagcagc tatggacagc agagtagcta 721 tggtcaacaa agcagctatg ggcagcagcc tcccactagt tacccacccc aaactggatc 781 ctacagccaa gctccaagtc aatatagcca acagagcagc agctacgggc agcagagttc 841 attccgacag gaccacccca gtagcatggg tgtttatggg caggagtctg gaggattttc 901 cggaccagga gagaaccgga gcatgagtgg ccctgataac cggggcaggg gaagaggggg 961 atttgatcgt ggaggcatga gcagaggtgg gcggggagga ggacgcggtg gaatgggcag 1021 cgctggagag cgaggtggct tcaataagcc tggtggaccc atggatgaag gaccagatct 1081 tgatctaggc cctcctgtag atccagatga agactctgac aacagtgcaa tttatgtaca 1141 aggattaaat gacagtgtga ctctagatga tctggcagac ttctttaagc agtgtggggt 1201 tgttaagatg aacaagagaa ctgggcaacc catgatccac atctacctgg acaaggaaac 1261 aggaaagccc aaaggcgatg ccacagtgtc ctatgaagac ccacccactg ccaaggctgc 1321 cgtggaatgg tttgatggga aagattttca agggagcaaa cttaaagtct cccttgctcg 1381 gaagaagcct ccaatgaaca gtatgcgggg tggtctgcca ccccgtgagg gcagaggcat 1441 gccaccacca ctccgtggag gtccaggagg cccaggaggt cctgggggac ccatgggtcg 1501 catgggaggc cgtggaggag atagaggagg cttccctcca agaggacccc ggggttcccg 1561 agggaacccc tctggaggag gaaacgtcca gcaccgagct ggagactggc agtgtcccaa 1621 tccgggttgt ggaaaccaga acttcgcctg gagaacagag tgcaaccagt gtaaggcccc 1681 aaagcctgaa ggcttcctcc cgccaccctt tccgcccccg ggtggtgatc gtggcagagg 1741 tggccctggt ggcatgcggg gaggaagagg tggcctcatg gatcgtggtg gtcccggtgg 1801 aatgttcaga ggtggccgtg gtggagacag aggtggcttc cgtggtggcc ggggcatgga 1861 ccgaggtggc tttggtggag gaagacgagg tggccctggg gggccccctg gacctttgat 1921 ggaacagatg ggaggaagaa gaggaggacg tggaggacct ggaaaaatgg ataaaggcga 1981 gcaccgtcag gagcgcagag atcggcccta ctagatgcag agaccccgca gagctgcatt 2041 gactaccaga tttatttttt aaaccagaaa atgttttaaa tttataattc catatttata 2101 atgttggcca caacattatg attattcctt gtctgtactt tagtattttt caccatttgt 2161 gaagaaacat taaaacaagt taaatggtag tgtgcggagt ttttttttct tccttctttt 2221 aaaaatggtt gtttaagact ttaacaatgg gaaccccttg tgagcatgct cagtatcatt 2281 gtggagaacc aagagggcct cttaactgta acaatgttca tggttgtgat gttttttttt 2341 tttttttaaa ataaaattcc aaatgtttaa taaaaaaaaa aaaaaaaaaa // LOCUS HSEWSGAR 40141 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens EWS, gar22, rrp22 and bam22 genes. ACCESSION Y07848 NID g1666067 KEYWORDS bam22 gene; EWS gene; gar22 gene; rrp22 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40141) AUTHORS Zucman-Rossi,J., Legoix,P. and Thomas,G. TITLE Identification of new members of the Gas2 and Ras families in the 22q12 chromosome region JOURNAL Genomics 38 (3), 247-254 (1996) MEDLINE 97131501 REFERENCE 2 (bases 1 to 40141) AUTHORS Zucman-Rossi,J. TITLE Direct Submission JOURNAL Submitted (05-SEP-1996) J. Zucman-Rossi, Inserm U434, Institut Curie, 26 Rue d'Ulm, Paris 75231 Cedex 05, FRANCE FEATURES Location/Qualifiers source 1..40141 /organism="Homo sapiens" /isolate="G6" /db_xref="taxon:9606" /chromosome="22" /map="q12" /clone_lib="g6" /germline mRNA join(<1..33,352..470,4102..4231,5690..5812,6596..6758, 7097..7194,7462..7714,7915..8311) /gene="EWS" gene 1..8311 /gene="EWS" CDS join(<1..33,352..470,4102..4231,5690..5812,6596..6758, 7097..7194,7462..7714,7915..7954) /gene="EWS" /codon_start=3 /product="RNA binding protein" /db_xref="PID:e280555" /db_xref="PID:g1666068" /translation="PMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFK QCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKL KVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFP PRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPF PPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGR RGGPGGPPGPLMEQMGGRRGGRGGPGKMDKGEHRQERRDRPY" exon <1..33 /gene="EWS" /number=10 intron 34..351 /gene="EWS" /number=10 exon 352..470 /gene="EWS" /number=11 intron 471..4101 /gene="EWS" /number=11 repeat_region 638..915 /note="subfamily Sp" /rpt_family="Alu" repeat_region 948..1173 /note="subfamily Sq" /rpt_family="Alu" repeat_region 1442..1731 /note="subfamily Sx" /rpt_family="Alu" repeat_region 1753..2027 /note="subfamily Sx" /rpt_family="Alu" repeat_region 2189..2476 /note="subfamily Sb2" /rpt_family="Alu" repeat_region 2497..2782 /note="subfamily Sc" /rpt_family="Alu" repeat_region 2939..3113 /note="subfamily Sx" /rpt_family="Alu" repeat_region 3201..3341 /note="subfamily J" /rpt_family="Alu" exon 4102..4231 /gene="EWS" /number=12 intron 4232..5689 /gene="EWS" /number=12 repeat_region 4622..4919 /note="subfamily J" /rpt_family="Alu" repeat_region 5022..5232 /note="subfamily Sx" /rpt_family="Alu" repeat_region 5273..5545 /note="subfamily J" /rpt_family="Alu" exon 5690..5812 /gene="EWS" /number=13 intron 5813..6595 /gene="EWS" /number=13 repeat_region 6088..6377 /note="subfamily Sx" /rpt_family="Alu" exon 6596..6758 /gene="EWS" /number=14 intron 6759..7096 /gene="EWS" /number=14 exon 7097..7194 /gene="EWS" /number=15 intron 7195..7461 /gene="EWS" /number=15 exon 7462..7714 /gene="EWS" /number=16 intron 7715..7914 /gene="EWS" /number=16 exon 7915..8311 /gene="EWS" /number=17 repeat_region 9390..9614 /note="subfamily J" /rpt_family="Alu" repeat_region 11596..11889 /note="subfamily Sbo" /rpt_family="Alu" repeat_region 12185..12279 /rpt_family="Mir" repeat_region 12544..12841 /note="subfamily Sb1" /rpt_family="Alu" repeat_region 13433..13716 /note="subfamily Sx" /rpt_family="Alu" exon 14832..14945 /gene="gar22" /number=1 mRNA join(14832..14945,15879..16530,18231..18338,18430..18526, 19254..19864) /gene="gar22" gene 14832..19864 /gene="gar22" intron 14946..15878 /gene="gar22" /number=1 repeat_region 15212..15320 /rpt_family="Mir" repeat_region 15658..15747 /rpt_family="Mir" exon 15879..16530 /gene="gar22" /number=2 CDS join(15898..16530,18231..18338,18430..18526,19254..19405) /gene="gar22" /codon_start=1 /product="GAR22 protein" /db_xref="PID:e284042" /db_xref="PID:g1707491" /translation="MADPVAGIAGSAAKSVRPFRSSEAYVEAMKEDLAEWLNALYGLG LPGGGDGFLTGLATGTTLCQHANAVTEAARALAAARPARGVAFQAHSVVPGSFMARDN VATFIGWCRVELGVPEVLMFETEDLVLRKNEKSVVLCLLEVARRGARLGLLAPRLVQF EQEIERELRAAPPAPNAPAAGEDTTETAPAPGTPARGPRMTPSDLRNLDELVREILGR CTCPDQFPMIKVSEGKYRVGDSSLLIFVRVLRSHVMVRVGGGWDTLEHYLDKHDPCRC SSTGPGISCPPIPAPAATPGTVTPQPPPPRAAPLVPAVMTQALAPGGSDPAGG" intron 16531..18230 /gene="gar22" /number=2 repeat_region 17773..18042 /note="subfamily Sp" /rpt_family="Alu" exon 18231..18338 /gene="gar22" /number=3 intron 18339..18429 /gene="gar22" /number=3 exon 18430..18526 /gene="gar22" /number=4 intron 18527..19253 /gene="gar22" /number=4 exon 18619..18791 /gene="gar22" /number=5 exon 18792..19253 /gene="gar22" /note="5B; alternatively spliced" /number=5 exon 19254..19864 /gene="gar22" /note="5C; alternatively spliced" /number=5 exon complement(20731..21370) /gene="rrp22" /number=3 mRNA complement(join(20731..21370,21655..21780,22821..23360)) /gene="rrp22" gene complement(20731..23360) /gene="rrp22" intron complement(21371..21654) /gene="rrp22" /number=2 exon complement(21655..21780) /gene="rrp22" /number=2 intron complement(21781..22820) /gene="rrp22" /number=1 exon complement(22821..23360) /gene="rrp22" /number=1 repeat_region 25055..25331 /note="subfamily J" /rpt_family="Alu" repeat_region 25488..25778 /note="subfamily Sx" /rpt_family="Alu" repeat_region 27361..27433 /note="subfamily J" /rpt_family="Alu" repeat_region 27501..27785 /note="subfamily Sx" /rpt_family="Alu" repeat_region 27818..28050 /note="subfamily Sb0" /rpt_family="Alu" repeat_region 28051..28251 /note="subfamily J" /rpt_family="Alu" repeat_region 28506..30413 /rpt_family="MER42A" repeat_region 28606..28884 /note="subfamily J" /rpt_family="Alu" repeat_region 28912..29202 /note="subfamily Sx" /rpt_family="Alu" repeat_region 29702..30009 /note="subfamily J" /rpt_family="Alu" repeat_region 30428..30716 /note="subfamily Sx" /rpt_family="Alu" repeat_region 30966..31141 /note="subfamily J" /rpt_family="Alu" repeat_region 31152..31434 /note="subfamily J" /rpt_family="Alu" repeat_region 31936..32217 /note="subfamily J" /rpt_family="Alu" repeat_region 32256..32549 /note="subfamily Sx" /rpt_family="Alu" repeat_region 32575..32841 /note="subfamily Sx" /rpt_family="Alu" mRNA join(35651..36688,37505..37513,38171..38325,38416..38502, 39242..39326,39580..39709) /gene="bam22" /product="beta adaptin protein" gene 35651..39709 /gene="bam22" exon 35651..36688 /gene="bam22" intron 36689..37504 /gene="bam22" exon 37505..37513 /gene="bam22" intron 37514..38170 /gene="bam22" exon 38171..38325 /gene="bam22" intron 38326..38415 /gene="bam22" exon 38416..38502 /gene="bam22" intron 38503..39241 /gene="bam22" exon 39242..39326 /gene="bam22" intron 39327..39579 /gene="bam22" exon 39580..39709 /gene="bam22" BASE COUNT 9163 a 11174 c 11126 g 8678 t ORIGIN 1 gacccatgga tgaaggacca gatcttgatc taggtaattt tgaattctag ttgtgcttca 61 tatcgtgctt tgtaaattaa tggtacagag gtaaatgcat gcgtagagtt cagcagcctt 121 atagaccagt gtgatattct tgctgtcaag gacagtttgg gaaatcctat gtgagcatct 181 actcataatt gccttaagtg aagtaaacca aaagtttgat agcattcttc ttagtatgct 241 tggtagtttt cttagattta tgatgaatat ccttgggcag tagtgatcag gtagttaaga 301 aacccctata gatacatgat aataattctc ctgtcttgtt gtctctgaaa ggcccacctg 361 tagatccaga tgaagactct gacaacagtg caatttatgt acaaggatta aatgacagtg 421 tgactctaga tgatctggca gacttcttta agcagtgtgg ggttgttaag gtcagtaaaa 481 gcataaccag gtcatctggc agaactttaa accacagagc attttaaaga gattgaattg 541 atgttgagag ttgttttcta aggttgcctt tgcctgactt ctttctaggt atcttatggt 601 ttgttccttt ttaaaaaaga ttagcctttt tttttttttt tttttgagac ggagtttcct 661 tcttggtgcc caggctggag tgcaatggcg caagtcttgg ctcactgcaa cctctgttca 721 agcgaatctc ctgcctcagc ctcccaagta gctgggatta caggtgtgcg ccaccagcta 781 attgtgtctt tttagtaaag atggggtttc accgtgttgg gcaggcgggt ctcaaactcc 841 tgacctcagg tgatctgccc accttggcgc ctcccaaagt gctgggatta caggtgtgat 901 cccctacgcc cggccttccc tcctccaccc ctagggacag agtgttgctc aatctcagcc 961 tcccaggttc aagtgattct cccaccacag cctcttgagg tagctggact tataggcaca 1021 tgccaccaca cccaactaat ttttatattt ttagtggaga tggatttttg cctgttggcc 1081 gggctggtct ctaactcctg gccacctcaa gtggccacct acctcagcct cccaaattgc 1141 tgggattaca agcatgagcc actacaccca acctcaaatt attgtgtctt tatgctaatt 1201 cttcatagct agtaacgcag attttttctt cagggatttt gactgttttt cagttcttgc 1261 tcttctgtat tagagttagt ttatgaaccc cataaacctc aaaaccttaa aaccccaatt 1321 gaatatttac taaatctatg aagtaatttc aggagaattt acatttttat gatactgagt 1381 gatctttttt tccccatgaa gattttattt ctccatttat ttgtatattt atttgcttat 1441 ttatttactg agacagtctt gctgtgtcac caggctggag tgcagtggca cgatctcggc 1501 tcactgcagc ctctggctcc tgggttcaag cgattgtcct gcctcagcct cctgagtagc 1561 taggattaca ggtgcccgcc accacaccca gctaattttt gtatttttag tagagatggg 1621 gtttggtgaa accatgttgg tcaggctggt cttgaactcc tgacctcagg tgatctgccg 1681 cctcaactcc caaagttttg tgattgcaga catgagccac tgtgcctggc ctgtctctat 1741 ttattatttt attttacttt gagacagagt ctcactcctt cacccggctg gattgcagtg 1801 gcgcagtctt ggtttactgt aacctctgcc tcccgggtct ccttgcctca gcctcccgag 1861 tagctggaat tacaggcgcc tgccagcacc cctggctaat ttttgtaagt agagatggga 1921 tttcaccagg ttggtcaggc tggtcttgaa cttctgacct cacgtgatcc acccacctag 1981 gcctcccaaa gtgctgtgat tacaggtgtg agccaccaca cccagcctat ttagatcttc 2041 agtaaagact atcgatttcc ttaaaggtct tgcaaatttt actagtttta cttctaggtt 2101 tcgtttattt tcaagttgct gtggttaagg gtcttcgtta aatgacactt ttctggttat 2161 tactggtgta tagaaataca gttgatttgg ccgggcgcgg tagctcacgc ctgtaatccc 2221 agcactttgg gaggccgagg cgggtggatc acgaggtcag gagatcgaga ccatcctggc 2281 taacacgatg aaaccccgtc tctactataa atacaaaaaa ttagccaggc ttggtggcgg 2341 gcacctgtag tcccagctac tcggaggctg aggcgggaga atggcatgaa cctgggaggc 2401 ggagcttgca gtgagctgag atggcgccac tgcattccag cctgggtgac tgagtgagac 2461 tccgcctcaa aaaaaaaaaa aaatacagtt gattttgggc gggcgtggtg gctcatgcct 2521 gtaatcccgg cactatggga ggccgaggtg ggtggatcac gaagtcaaga gatcgagacc 2581 atcctggcca acgtggtgaa accccgtatc tactaaaaat acagaaatta gccgggtgtg 2641 gtgacacgcg cctgtagttg agctacttgg gagtctgagg caggagaatc agttgatccc 2701 aggatgcaga ggttgcagtg agctgagatg gcgccactgc actccagcct ggcgacagag 2761 caagactcca tctcaaaaaa aaaaaggaaa tacagttgat tttggtaaat cttatataag 2821 gtgatttctc cctgcttgta ggtgtgtgtg tgcacagata tcacacacac ctcaaaagta 2881 aaatgctttt agtgtctact gtcacattct accctgtgca agactccttg tttgcttttt 2941 tgttttgaga tggagtcttg ctctgttgcc cagactggag tgcagtggca caatttcagc 3001 tcactgcaac ctccgcctcc tgggttcaag cctcaagcct cttgagtagc tgggagtaca 3061 ggcttgagca tattaaaaag actgcttttt aatatgtata taacatgtag tttattgagg 3121 aatgtggagt ttatcctggt taagaaagta gtatttacca tcaaaatttt ggaaataaag 3181 atgttaaaat agatttccaa gctggataca gtggctcaca tgtgtaatcc cagcactttg 3241 ggaggccaag gcaggtggat cccaaggatt gcttgagccc aggagtttga gaccagcctg 3301 gacaacagag tgagactctg tctcacaaaa aagaaaaaaa tgcatttcca cattattcat 3361 accacagttt tgccatattt tcttttcagt ttttaggtat cacgttcctt tttttaagtc 3421 agaggtatca atatatatat cctgttgtaa tcttttttaa gtcctataaa tagattgctc 3481 ttgtgttact ggatatatgt aaacaattac agactaaact gatcgtgtgc atatataatg 3541 caaaaaagta cacgtgaaga tattcttata caaggtagct cctgaattca agtcaaccac 3601 atctgatatt cagctgaaag actagtcatc tgttctaact ggaggaagta gtggctgtgt 3661 tagactaacc aagaatccct gtagttcact cacagcctgt acactgtcct tcacggttta 3721 aggataattt tctttatgcc agtcatgtct tatgtggtgg tatcaaagtc tgactacccc 3781 tacgaggtgg ctgtgctcag tggatacctg tctgggcata gaaaatgcca gggcaggcaa 3841 accagcatca gaagatggct aaggcaggag tggggcctat cctgttcact tccacacttt 3901 ggtctacttt ggtttttatg ataaagcagg gatagaggag aaaataactt agaattctga 3961 aaaacttaaa agcggagaaa cggtgacatt tagtgactta ctacatgtga gaaattggta 4021 cttttcctgg attttgcctg gacttttttc tcccaaatta gtatattcta gtcatgccta 4081 actatgctat tctttgtcta gatgaacaag agaactgggc aacccatgat ccacatctac 4141 ctggacaagg aaacaggaaa gcccaaaggc gatgccacag tgtcctatga agacccaccc 4201 actgccaagg ctgccgtgga atggtttgat ggtgagatgt actcactggc attcttaatc 4261 tccctggcta tagaatatgg catgagggag aaacttgtga accataggag caagaagacc 4321 ttccatctct tcctggggga ggtagatggc cggtctccct gcagtagtag tagcacccag 4381 ccattgaccc tggatttgga gatcccttta ttttgaggca ttagtattac aaatcaacct 4441 tgcttaaagt ggcaaacact tcctaagcac agattatcag aaggtacaga aaacccttca 4501 aaagaacatc ttagccagtg tacatttgca tatagataaa tatatatatt taaatgaggc 4561 cttgacatta atggtgggat ttcacttcag acaaaaccaa cttgaaagca ttgaatgact 4621 tggccatgtc cagtggcttg cacctgtaat cccagcactt tgggaggcca acatcggtgc 4681 attgcttgaa tccaggggct tgagaccagc ctggggaaca tggcaaaacc ctgtctctac 4741 aaaaaaaata caaaaattag ccaggtgttg tggcacactc ctatacctat agtaccagcc 4801 actcaggagg ctgaggtggg aggatcactt gagcccggag agtttgaagc tccagtgagc 4861 tgagattgca ccactgcact ccaacctggg caacggagca agactctgtc tcaaagaaag 4921 tgtcaaataa caagtcctgg ttctgttttc ttggtcatgc agtcagcttc ccaccagctt 4981 attttcctcc ttatggtaga cagtgttttt tgcttgtgtt gttttttttg agatggtctc 5041 actttgtcac ccaggctgga gtgcagtgac acaagccatc tcagcttact gcaacctcca 5101 cctcccaggt tcaagccaat cttgtgcgtc agccactcga gtagctgctg gactacaggc 5161 gcatgtcacc acgctgagcc taacttttgt attttttagt agagatgggg tttcaccatg 5221 ttggtcaggc tgtaggaagg tttttaagtg tatttaagta tggccttttt ctggccaggc 5281 acagtggctc ttgcctataa tcctaccact tcaggaggcc aaggcaggag gattgtttga 5341 accaaggagt tcactagcag cctgggcaac ctagtgagac actgcctcta caaacaattt 5401 tgtaaaatta gtcgggcatg gtgatgcgtg cctgtagtcc cagctacttg ggaggattgc 5461 ttgagcaagt gagattgaaa ctgcagtgag ctgtgccact gcactccagc ccgggcaaca 5521 gagtgagact cccatctcaa aaaaagcctt tttctggcct tgtcattaaa gatcttagag 5581 aagattacag gcagacctaa tgcatctggg agtggtcatt tgatgatgat gtggagttgg 5641 tgaacaggga gtacagggga gtaattgatg ttctgttgtc ttgttccagg gaaagatttt 5701 caagggagca aacttaaagt ctcccttgct cggaagaagc ctccaatgaa cagtatgcgg 5761 ggtggtctgc caccccgtga gggcagaggc atgccaccac cactccgtgg aggtactttt 5821 tctgagctcc tatgttgcat taaaaggttt tcagtacact tcataccctt gagaaacttg 5881 attattagag tgaagaaata taaaattgtg tgtagagtca atactagact atcgagagct 5941 aacaatgaat gtttgttggg aataaaagga agagaagaac atgggaggct ggaagccact 6001 ctgcctgtca actccagact gccatttatt cagctttggt tgtgtctgta tagacatgcc 6061 tattccttac agaattgtgg gagttcagcc aggtgcagtg gctcacgcct gtaatcccag 6121 cactttggga ggctgaggtg ggcggatcac ctgaggtcag gagtttaaga ccagcctggc 6181 caacatggtg aaactccctc tctactaaaa atacaaaaat tagccgggta tggtggtgca 6241 tgcctgtaat cccagctact cgggaggctg aggcaggaga attgtttgaa tctggggggg 6301 tggaggttgc agtgagcaaa gatcgtgcca ctgcactcca gcctgggcaa cagtgcgaga 6361 ctccgtctcc aaaaaaaaaa aaaaaaaatt gtgggagctc tgtttctgta gagcacgtgg 6421 aacacgctcc tcacagggaa gggggctgat ggcctgagcc acacggaaac acgggacagg 6481 tgatggggaa atgacagcag tagtatctgt gggtttactt agtgattttt atttcctata 6541 gcaaatttgg tgctacagag aaatgatttg ctgtttcttg ttgttcttgt tgtaggtcca 6601 ggaggcccag gaggtcctgg gggacccatg ggtcgcatgg gaggccgtgg aggagataga 6661 ggaggcttcc ctccaagagg accccggggt tcccgaggga acccctctgg aggaggaaac 6721 gtccagcacc gagctggaga ctggcagtgt cccaatccgt atgtacttgt cttggcaaat 6781 tgatacccta cgagtgaagc cacccttccc tcaccccatc cccactctag agtggattgc 6841 tctgtctaga ggaacagaat gatgaccctg atggctggtt agggacacta gtcagccatt 6901 cactggacgc ttcagagcct tctgaagatt gatttgacct gtcctgtggg tgcaatgctg 6961 cctgaggctg tgccctaaag catgggtgta catagatcct cttgatagtg agtgtgtacc 7021 tgttcacaca ccacctttcc ttgtttatct tccttagttc aattggtgat ttctgctgtg 7081 atgtaattgt atgcaggggt tgtggaaacc agaacttcgc ctggagaaca gagtgcaacc 7141 agtgtaaggc cccaaagcct gaaggcttcc tcccgccacc ctttccgccc ccgggtaggt 7201 gcaggtttca tgagtgtccc ctcagcttcc tggtgctaaa cctcttttct tatttgtggg 7261 cttggtaaac tgcagttgcc ctctgcttaa caactttgag ttgtcgtgtc ctcatttcta 7321 aattgtcagc ccgatgccga gattgagtga agtgtctggt ttgttctgct gtgagagaag 7381 gaagcagagc agcttccaca gtgtccacag ggcctctgca gccacccact gactgctttc 7441 gccctgctat tctcacctta ggtggtgatc gtggcagagg tggccctggt ggcatgcggg 7501 gaggaagagg tggcctcatg gatcgtggtg gtcccggtgg aatgttcaga ggtggccgtg 7561 gtggagacag aggtggcttc cgtggtggcc ggggcatgga ccgaggtggc tttggtggag 7621 gaagacgagg tggccctggg gggccccctg gacctttgat ggaacagatg ggaggaagaa 7681 gaggaggacg tggaggacct ggaaaaatgg ataagtaagt gctggtgaaa agcagctgtg 7741 ggccgccagg cacagtaaga ggacagccct tcccagcttg gttggcgcaa gtcctcatgt 7801 cgctaggaag cttgtgatag tggttgggag gagccaggaa ggggcacctg ggggctctgg 7861 aagggcttcc tcaccccttc ccattctaac cgaagggccc tctttacctt gcagaggcga 7921 gcaccgtcag gagcgcagag atcggcccta ctagatgcag agaccccgca gagctgcatt 7981 gactaccaga tttatttttt aaaccagaaa atgttttaaa tttataattc catatttata 8041 atgttggcca caacattatg attattcctt gtctgtactt tagtattttt caccatttgt 8101 gaagaaacat taaaacaagt taaatggtag tgtgcggagt ttttttttct tccttctttt 8161 aaaaatggtt gtttaagact ttaacaatgg gaaccccttg tgagcatgct cagtatcatt 8221 gtggagaacc aagagggcct cttaactgta acaatgttca tggttgtgat gttttttttt 8281 tttttttaaa taaaattcca aatgtttata aagagtcatc cttctcggcc tctgttccac 8341 agtcactgtg tgtctgctgg gagcatgctc ccaccccacc caggagaggg gtgccttcca 8401 ggtaaacggt tggttcaggg tttgtgtgag gggaaataag cggttgtgac atggaaacag 8461 actctggccc tgatgcagcc tctgagaccc actagtgtcc aaaggttcaa ggggagatcc 8521 aggtaaggga ggcacaggaa gaactggaaa taactttgcc ttccatggga tcacggatca 8581 ggcactagga gcatctaagg ggctgctccc tcaggtgagg gacgatccct ctcaggcgga 8641 ctgggtcggg tcctctcatc cccagcactt ctgggcatgc attgtgtgca cctctgctgt 8701 ttctttccct ggtcatttcc tgttctaata tgtgtttgac tccctgcgtt tggaaaaccc 8761 tcccttaacc agccatcatc cccttccctc catcaaacca tcttcctggc cctttctaca 8821 ggaagcacat ggaaagagct ttgtccaccc ctccctctct ggaacccact tcattcttca 8881 cctgcactaa aacaagcagc tctcagtcac caaccacctc taatgttgcc caatcaatgg 8941 tctttccttc tctgttctcc catctccttc attgcttcct ttttctccta tccctaaatg 9001 caggaaggcc ccaaagctag cattgcgctc ccctgagccg ccacactgcc tggctgtcct 9061 ttccatactg tggctttagc cctttctccc tagggccaac tcggcttggg tgcatgatgg 9121 gtgtagagca gggcttgctc ttgtcccgtc ctgcctacac cccagttcag taaacacaca 9181 cctgccccaa accttggagt catccatgac ccattctcaa accccagccc ccaacctagc 9241 tctcttgcaa atccaggacg gtgcaaatcc acccacagcc cccacctcca taacgccccc 9301 tcactgggtt ccctgcttct cccctcagta tctccagccc ttcctcagaa gccagaccaa 9361 ggtttgtggg gttttttgtt gtttgagatg gtcttgctct caccaagact ggagtacagt 9421 ggtgcaatca cagcttcacc tcctgagctg agtagctggg actacaggtg tagcacccag 9481 ctaatttttt tgatgttttt gtagagacca ggtctcactg ttgcccaggc tggtctcaac 9541 ttgggctcaa gtgatcatcc caccttgacc tctcaaagtg ctaggattac aggcgtaagc 9601 caccacactt ggcccaaacc aacttgacac gggaggtcat ctccagtctc aaaaccattt 9661 ggtggctcca agcacgcttg ccatgcccca aaagacctct gggaccccag ctgccctgct 9721 ctaccttgca cgccagtcca gcactgttcc tcaaggggcc gaggacgctc tcaccgcagc 9781 cttggtgtgt cagagccaag atgcacacat aacctaggag gcccaaactc gactagggca 9841 tcactgggtc cctgcgtcgt gccaggcact gcgctaagta cctctgtgcg tttaattcct 9901 aataaccttc aaggcaggta acataggaca gccacacgtc agctgagatg ggaagagaga 9961 ggtcaaggga ggacttgcca ttgagaggct gaattgagct aagggtggtc gccctccagg 10021 ccccgcacgc tcaccctccc accaacccac tgtatacagg cacctgcttc tgaaaaggat 10081 gggtgctctg acatgacccg catgctgcca gcactaaggg aagacaagca gcctgctctg 10141 gaagctgcta ctgcacaagc atctgtcaga ctctggcatt gcataggctc cccaggagta 10201 gccttatgtc cctgcccgag aggggtttgt cactggcact aaatagccct ggtagagatg 10261 tttacagcag gcaggtctat ttttctcagc ccaaagaaac ttctggatgc tctgggcaca 10321 gattccctca ggcagcatgg atgccaagcc tgtctcaaag ggcggcagga agctggctgg 10381 tgccagggca tgaaggaggt gggcagggca gagagcagag caggtggaga aaaagcctag 10441 tctctaccca caccctggtc aaactcccca aaagcaccta gactaaaagt tgccaacaca 10501 aacttgaggt cccacgaggc cacatggcac aggcctgggc aaaggtggcg gggtccatgg 10561 ggagactgat ctatgcccac agggagtggc tcccctgctc gacggggaag agcacagggc 10621 tgcagcaaca gcagtaccag cagctgtggg gctgaggact gtgcagtctg tgacagggac 10681 aacagctggc cctcgtgctg tcccctccta ccccactcct gacagatctg ctgggcgcag 10741 ggccggcagt gaggacagtg tgatgcaggc aggttggaaa gagcccactt gggttttcca 10801 gttttgcgtc tctcttgtga agggcactgg aggcagctca gggtcacctg tcccacagcc 10861 agagcaggag ctgaccagca ggtgtccttg gagctgagcc cacacctgga cacagcagcc 10921 ggaagcctcc tgcgagtccc tcacttagcc tgggctgtgc tcagagcctc agtccagact 10981 ggagaatttg gagttctaat ccaaccagtg ccagagcttc ctggagctgc ccggtagcag 11041 ggatgtaggc cacaatgaga gttttgccac agccatcatc tgacctctag ctgcaaccaa 11101 ggccaagcag gctgctggct ggcacccagc agtgggcgca agaaactggc tatgtgccct 11161 ctggccaaag gccagaagga gcccacatgg gagccacagc attctgtgca tagaggtggg 11221 tggcctcaag ggtccgtgca gatgggctca gggtctctat agcccaaacc tcctgtccct 11281 ccagcagttt ctgtggtttc tcttggtcct gcaaggaaca agggagacca tgtagaggct 11341 gaggctttcc cagaggtggc cccacaggcc ctgggccacc acacgacaga gggggtcaaa 11401 gctacggtct accatgccct gggccagggg tctgaccccc cagtgaggtc tgcatcatag 11461 cagctacatg tcctagaatc aggtgggagg ctgtggtcta gaacaggcct cttatcacaa 11521 gccacagctc atgggagaga aggaggaagg aatctcgtgt gtgtgtgtgt gtgtgtgtgt 11581 gtgtgtgtgt gtgtgtgtgt gtgtgacgga gtctcgctct gtcgcccaga ctggagtgca 11641 gtggcgcaat ctctgatctc tgctcactgc aagctccgcc tctcgggctc acgccattct 11701 cctgcctcag cctccagagt agctgggact acaggcaccc gccaccacac ccagagaatt 11761 ttttgtattt ttagtggaga cggggtttca ccgtgttaac caggatggtc tcgatctccc 11821 gacctcatga tccacccgcc tcggcctccc aaagtgccgg gattacaggc gtgagctact 11881 gcgcccagca gaaggaattt ctttataatc cctcctcccc agagaagtgg atatgctgag 11941 ccaggccaca ctgagctggg tagtggcatt atttttcaca accccatgat gtaactacta 12001 ctactgtccc attctgcaga tggtgccact gaggctcagg gagcttcacc atcccaccca 12061 atgaccaatg acagagcctg ggttccgtcc aacttcaccc atgtccaaaa ccccaaaatg 12121 cagcaaggta ggtataagca ggagtcaaat tcctgcacag ctgctggcta cacatatccc 12181 atgccttctc tgggcctcag tttccccaaa ttgtaaaatg agaacaccta agagtatcat 12241 agctcatcgg ggtattttga agatgaaact gttaaatgtc cacaccctga ttcaccccca 12301 ccccaaacct gcaaccctcc tccacctcca gcagcccttg cctccccagg tgttagcacc 12361 tccgccctcc cagtgctcag ccctgggagt tgtcctggat tccgtcctca cactcaacca 12421 atccaccagc acatcagctc tcccctcaca atatggccag aatccagcgc cctctcacca 12481 cctggtttcc tcaggctcca gctcctgccc tggcccacca cagcagcccc agggaatgtt 12541 tttttttttt tgagacggag tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 12601 tgggctcact gcaagctccg cctcccaggt tcacaccatt ctccttctcc tgcctcagcc 12661 tgccaagtag ctgggactac aggtgcccgc caccacgcct ggctaatttt tttgtatttt 12721 tttagtagag acagggtttc accatgttag ccaggatggt ctcgatttcc cgatctcgtg 12781 atccgcacgc ctcggcctct caaagtgctg ggattacagg cgtgagccgc cgcgcccggc 12841 cgggaatgtt aagacctaag ccaaccaccc ctgctctaaa tcactgcatg gctcccatgt 12901 ccctcaaaag agacacggcc cccaaggccc tgcttaaccg gccctggggt ctctctaatc 12961 tctctacctc ctccctcacc cacctccctg cactccagcc tcagccacct ccccagtctt 13021 cttgagcacg ccggacacat gcccaccatg gggcctttgc actggccgtt ccctctccct 13081 ctccagatat ccccgacctt cctccctcac ttcctttcct gatgagggat tttaaattgc 13141 agccccctca cacccacacc ctcacccggc tccatccctg tcttctctgc acctcatccc 13201 tgtctctgcc ctggggctgc ctccctgcca gacttggaca acaggccctg agagctggct 13261 ctgcttggtt ccctgctgtt ttgctgttga tttccagtgc ccaaagagac ccagcacaca 13321 gtaggtgcca cgcaggtgtc agccttgagg attgctagtt actgccttcc tcactggagg 13381 aagtgccatg gggggcgctg aggtgagaaa acgcctttga aaaggtcaca caggccaggc 13441 aagtggctca cacctgtaat cccagccctt tgggaggtgg aggcgggcag atcaccttag 13501 gtcaggagtt cgagaccaac ctggccaaca tggtgaactt gtctctacta aaaatacaaa 13561 aagccgggcg tggtggcagg tatcagtaat cccagctact cgggaggctg aggcaggaga 13621 atcgcttgaa ccccggaggt agaggttgca atgagcccag atccagccac tgtactccag 13681 cctgagcgac agagcgagac aaaaaaaaaa aaaaaaaaaa aaaggaagaa aataaaaaag 13741 aaaaggtcac aaggactctc tgggatgcct gtgaatgcct ggcaccaaga ggcctggctc 13801 acaggtgctc aaccaatgct gctgatccgt ggatgagggc gtccccataa tcgtcatgtt 13861 caaatcctgg ctccaccagc tgctctcctg actgtgcccc agttgggcat gagtgtcctt 13921 cacagaggtg acacaccagt ccattcaggt caagggcgca gagcagtcct ggcagcacac 13981 agtaggagat ttattggagc tctgagcctg gctgtggttg gcggcttccc tgcaggcctg 14041 ccccacccgt gtagcatggc tgccgcccag acccgcccga cgtcccgccc agcccagccg 14101 gcagcagcag ggggcaggat ttcccgctga tcccggcggt ccgtcctctg cagtttaaaa 14161 agagcccagc ggggcttaac cctcccgcgt ccggccggcc tccgatgccc cccgtcccct 14221 gttctgtgcg gaactccggc tttggccatc tgcctacttc tctgagcttc agtatcctca 14281 tctgtaaaac cggttcccct catttcaggg tcaaaaatga gcattcagtg agcacaagga 14341 gccctcagtc gccaggggca tcctccccct ccccgcggcg gtgccgttta tctggccagg 14401 ggaggtccac ccgggaggag gtgactttgc agcctccagc cgctatcgct tcctgccttc 14461 atcttgggaa caggccgtgc tgtgcgccag cccggggagc ccaccgtctg gggagggggg 14521 cgattaaaag gtcccaacct cctccccagg agaccaaaga ggttggacgg ggccccgggt 14581 actaggctgg gtgggggttg ggcgcctggg ccggtcgcag ttgtctgggg gccctggggc 14641 tctctgcgtc gagagcgctc gaagacccgg gattcctggc ccgatcgcgg gcggggggag 14701 accccagctc caccccagct cccgccggct cggggaaggg gcggcccctt taagagcgcg 14761 cggccccgcc cgccccctcc gggcaggatc cgaattccag ggaggcgggg cggagacggc 14821 ggcgaggagg aggccgcggc gcgggacgca tagagctgcg gctcgggcgg cgcctccctg 14881 cggcggcccg gcccggctcc ggcccccgct ggggcaatgc tccccggggc cgcgggatga 14941 gccaggtgag cgcggagacc cccagcactc cctgcctggt gcgcatggac gggggtgggg 15001 aggcggggcg ggaagcggcg tcctcggcct ctgttccccg gggctgtgtg accttgggcg 15061 gccccgcacc ctctctgggc ccagagcccc agcaccggca gggcctgcgg aggccgctct 15121 gagtcctggc cccgcaggca ccggcctgga gcgctgggga ggcttcttgg aggaggcggg 15181 gcctggccct ggccccggat cgccgagacc ggccacgctg cgccgcctgg acgccgggct 15241 cctccaccca cagctctgcg accttcacga gctcgaccct ttctgcgcct cagtttcctc 15301 atctataaaa tggagataat gcctccgcgc gctgagcccg agcgcggtcg tgctgttaca 15361 ctattgctgc tgctatgtcg ttaatgacgg cacagccctt atggaagggg ggccaaggaa 15421 ggcacgggga ccttcccgca ggacatgggg tgggcaccag ccctaagcca tgcacataac 15481 ccactctgag gtcaggacct gggttccttt cgtgtccact gggggcctta gccgttgtct 15541 tcacttctct aggtcccagc ccttcacaat gggcgcgcct ccctgcctga gagggaggcc 15601 ccagccaagc cccagtttcc accgcaaaaa gccacacagc aaacacttaa tggaccctaa 15661 atttataaaa attatcccat agagtcctat gaggcaggga atgttactgt acccatttcg 15721 cagatgagga aacttggggc accgacattt cccagagcca acatgtaaac caggcatttg 15781 gcctgagtga cagactggtg caggtggcca gcagcttaat ttgtactgag cgcttactgg 15841 gtccccacag cgatcctgaa ctgtgttcct gcccacagtg actcggccgg tccgggcatg 15901 gcagacccag tggcgggcat cgcgggctcg gcggccaaga gcgtgcggcc atttcgctcc 15961 agtgaggcct acgtggaggc catgaaggag gacctggccg agtggctcaa tgccttgtac 16021 ggcctgggtc tcccgggtgg tggcgatggc ttcctgacag ggctggccac gggcacgacc 16081 ctgtgccaac atgccaacgc cgtgaccgag gctgcccgtg cattggcagc cgcccgcccg 16141 gcccgaggtg tggccttcca ggcgcacagt gtagtgcctg gctccttcat ggcgcgcgac 16201 aacgtggcca ccttcatcgg ctggtgccgc gtggagctgg gtgtgccgga ggtgctcatg 16261 tttgagactg aggacctggt gctgcgcaag aacgagaaga gcgtggtgct gtgcctgctg 16321 gaggtggcgc ggcgtggggc acgcctgggc ctgctggccc cacgcctcgt gcagtttgag 16381 caggagattg agcgggagct gcgtgctgca cccccagccc ccaacgcccc tgccgctggg 16441 gaggacacca ctgaaaccgc ccccgcacca gggactcctg cccgcggccc ccgcatgaca 16501 cccagcgacc tgcgcaacct cgacgagctg gtgagtcccc cagcaccagg tgccagggac 16561 ccgacagcac agccagtgcc tgcaatctgt ccctgaacct cccagggtcc accctgctat 16621 ctgcagaaca gtctgcccca caaaaagagt gcacatgttg agcaccaaac cgaggaatgt 16681 atacctgacc ccaaggaaat gggtagccca cagagaggga ggtgccgggg agatgacgca 16741 gcccagccgg gcctgggaca ggatgcgggc tcctggcacg atcttccctc actttgtgtg 16801 ctaactgtgc ctcaactgag cttgtgcatt ccttcttctg gggggccgca tgatcggggc 16861 gcctcctctg tgccctccct gcagtcagtc ttaggcagat gtacccacaa cctcatggag 16921 gctcgaggtt gagcgggtgc tgtgatgctc ggttgagacg aggcatggaa aaacctgggc 16981 aaggtgcctg gcacacagta ggtgccccag ggggctctgc aggtctcccc actccccagg 17041 cacccctggg aagcctagct gggttgctgc tgttgctggg agcagggatc ctgtttggaa 17101 ttacaggatt cgttctactc taagcatagg cgttgctggg ccctgccccc aagtctctcc 17161 tctcccagga tctgggtggc ccctgccacc cctgagggct ggaccaagag tttccttctt 17221 atcagaaggt caggactaca atgtgcctgc ctgggaggca gatagcggat ggcagtgggc 17281 agagcctggg gccccacccc ctgtcctgga ccagggtcac tcaaggcccc acctgcttat 17341 ttcctctggc ggcaccactg ccgctgtggg cggggaggcg ggcagggctg cccacatagc 17401 cagcttggca gcgctctggc tctggtgggg ggctgtatgt ggtgcgtgct cacccggcca 17461 gtggtctgtg tgtgcatgtg tccgatgtgt ctgtggtacg cctccacatc agtgcccacc 17521 caggggcatg gggccccctg cacctgtgtc ctgtagcctg cctggccttc ctgtgggccc 17581 cagccttctt gtctgagaag tgggtgaagc gggagtgagg acgggctctg ggccttcccc 17641 aggtctaggg gctgtgggat gttcgtcacc gggtctcaga ccatggcctg accacagtcc 17701 catcatctca tcctgggcag ctccctggac ccagagagcc aagaaagggg agccggtgaa 17761 aggtcatgtg tgggccgggc gtggtggctc gcacctgtaa tccagcaatt tgggaggccg 17821 aggcgggtgg aactcctgag cctgaccaac atggagaaac cccatctcta ctaaaaatac 17881 aaaattagcc gggagtggtg gtgcatgcct gtaatcccag ccactcagga ggctgaggca 17941 ggagaattgc ttgaacctgg aaggcggagg ttgcggtgag acgagatcgt gctattgcac 18001 tccagcctgg gcaacaagag caaaactcca actcaaaaaa aaaaaaaaat aaaaaaaaat 18061 aaaaaaaata aaagaaaggt cacgtgtgga agctgtaccc catccggggc aacactgcct 18121 ccatccactt acccggaaag cctgacttcc tcctgaccct ggaatggctg acattaagcc 18181 ccaggaggag gacttgacct ctgaccccta ccctctctct ctggcctcag gtgagggaga 18241 ttctgggccg ctgcacctgc cctgaccagt ttcccatgat caaggtctca gaggggaagt 18301 accgtgtggg ggactcgagc ctgctcatct ttgtgcgggt aagggcctgg ggccgcccca 18361 gcgggcagca gccaaggtgg tgggctgcgg ggcgcccggg gcacagccgt gacctgccca 18421 cacctgcagg tgctgaggag ccacgtgatg gtgcgagtgg gtggtggctg ggacacgctg 18481 gagcattacc tggacaagca cgacccgtgc cgctgctcct ccactggtca gtgccagggt 18541 ggggctgggg ctggacgggc aggggacttg cttctgtggc tctgtccctc acatgctgcc 18601 tgtcctctct ccccgcagct catcgcccac cccagccgag ggtctgcacc ttttctccac 18661 agagggtgtc gcccaccacc agtccccgcc ctgctagccc agtccctggg agtgagcgcc 18721 ggggctcccg gcctgagatg actcccgtta gcttacgaag cacaaaggag gggcccgaga 18781 ccccacccag gtgagatgca ggaggacgag gagtgagggg tccagagggt gggggggcgt 18841 cagccctggc ctgcatgatg ggttgcctgt gcgccagagt gactcacacc ctgggaaaag 18901 ttcccgtggg tgggggcggg cctggcttcc catctccatg gcaacccaac caaaccaaca 18961 acacagctgg ggcgggccct gtggctgggc ggcagccagt ccagctcttg cttcctctgg 19021 cccgtcctgg gaaggggggt gtctagagcc caggaaactt cttcttccgt ggcttttggg 19081 gccctgggcc gctggaggaa gctgctcact tctccctgga agtccccagg acagaccgat 19141 gcccctgaca gccccgtcca ccagccacat accctgctgt tcctctccct gcctgttctg 19201 catgccagtc cttccgtggt accccatctg tctctattgt ccccctgccc caggccccgg 19261 gatcagctgc ccccccatcc ccgctcccgc cgctactccg gggacagtga ctcctcagcc 19321 tcctccgccc agagcggccc ccttggtacc cgcagtgatg acacaggcac tggcccccgg 19381 agggagcgac ccagccggcg gctgaccaca ggcaccccgg cctctccgag acggcctcct 19441 gccctgcgca gccagtcccg agaccggctg gatcgcggcc ggccccgggg ggccccagga 19501 ggcaggggag cccagctgtc ggtccccagc cctgcccggc gggcccggag ccagagccgc 19561 gaggagcagg ctgtgctgct tgtgcgcagg gatcgagacg ggcagcactc atgggtgcca 19621 aggggcaggg gcagtggggg ctcgggcagg agcacccccc agactccccg tgcccgcagc 19681 cctgcagcac cccggctttc ccgggtctcc agccccagtc cagagttggg caccacaccg 19741 gccagcatct tccgcacacc cctgcagctc gacccgcagc aggagcagca gctgttccgg 19801 cgcctggaag aggagttcct ggccaatgcc cgggcccttg aggctgttgc tagcgtgacc 19861 cccactggac cagcccctga cccagctcgg gcccccgacc ctccagctcc tgactctgcc 19921 tattgttcct ccagttcctc ctcttcgtcc ctcagcgtcc tgggtggcaa atgtggccaa 19981 cctggggact ctggccggac ggccaatggg ctgcctgggc cccgaagcca agccctttcc 20041 agctcctccg atgaaggcag cccctgccct ggcatggggg ggccactaga tgcacctggg 20101 agccccctgg cttgcactga accctcgagg acctgggcac ggggtcggat ggacacacag 20161 ccagaccgta aaccctcacg tatccccacg cctcggggcc cccgccgccc ctccggaccc 20221 gcagagctgg ggacatggca tgccctgcac tcagtcaccc cgagggctga gccagattcc 20281 tggatgtgat ggaccagctc agctgtcccc agaccccatc ccttctcctt ttcctttgtg 20341 gccttaaccc ttctgcatca gggagccccc tctgcctctt gagtaccaga cctcatggga 20401 ccagacccct tgggaccaca tggcacaatg ggacctctgt tgtacattcc ggttggggga 20461 tgagcgttgc tatttaatta ctaatattat tgaatgcctt agaggaggcc gggcgagccc 20521 ggtgttctga agacctgtgg cccagcagag cctctgacag taaagttttg ctccagccac 20581 ctgtctgttt cttggggact ccaccttggg gccattcaga ccctttcgag gaccctggag 20641 ttaactgcct ggaggccaag cctggcaact ctgacctcta ggcttaaggc gttatcccgc 20701 ccagcccaga aatcccaggc gcactcaaat tattttccct tttattatcc cgtggtaggt 20761 gtggcataaa gaatatgtcc agtgaagctc caggagcagg ctgcgtgttt ccagtcaagt 20821 tcatagccaa gtcctttcca tccaatggga ttgtgaccca ttgaggtccc agaaaggact 20881 ggtcatctcc aatggagctt gggacctggt tgggtccaat gaggttcaaa agggggccag 20941 tcgcttagga gactgggttg gagctttcct catccaatca gggcggagac ttccctgtcc 21001 agtcccagtg aagttgggcg atcccgtcca atccaggtcc ctgattgtcc cagtcacaag 21061 gtggggccca tggatggcac tgtccgatcg ggtcacatga ggctgcagcg cgcgggatgc 21121 agcgccccct gcaggcgcag ggccgggtgt gcagggcgcg cgcgcaccag agcgcagcgc 21181 agcagctcgc ggaagagacg cagcacgtgc cagttgtact tggcggagca ctcgaggtag 21241 ccgcagcgcc agcccctgcg cactagggcg gccagcgcgc gccgcggtcc gaagcgcagc 21301 cgctgcctgt cccgcttgtt gcctaccacg aggatgggcg cttcgggcgc gcccgccggc 21361 ctgggggccg aatggagatg agagacgcgg ggaccccacg gccggagaat tcccccagtg 21421 ggtatacaca cgcaggcgca ttcccttcct acggccagag acaccgcccc cccccgccgc 21481 cccaacgaaa ccccaggccc atgtgggcca aagagcacat acagcttaag accttcccac 21541 gggcgaaggc ctaagacccg gcccacgacg ttgggagacc ctacacactc tcctcaggca 21601 aaggccctgc cagacctgtc ctagctcccc accgccagaa ctgccgggcc cctacctggt 21661 ctccgcgatg cgctgccgca gggccttcac gtagtcgaaa ctgtccgggc tgcagatgtc 21721 gtagacgagc acgaaggcgt ccgtgtcctg caagctccag tccttagcgt ctggccactc 21781 ctggggacag gaggccaggg tttgcggaag cctcctttcc actcttccgc tctcgaaccc 21841 tctgcagggg cctggacgct aatcctcagg ccaacatcac ccccatacag acctctgtgc 21901 ccctccccca agctttccgg gccgtctcct catgaccacc agggtaaaat tttccaaatc 21961 gtgcctggag cctgtcagcc agcctccttc atccttggtc tcccctctgt ttaccccgaa 22021 tcttcctctt ttcagctttc cccaactcgg ctcaggccgc tctcctccct agatgctctc 22081 ggtgcttacc acggcctcta agactaattc caaagccctt ggcccgagcc ccgtggtcca 22141 ccacgccccc agattgtcag ctgtaaccgt ttatgaccgt agcgtaagca cctgctggtc 22201 tttccacctg gtatgttttt gccacttcct ttactcagaa cttcactcgg tcttccaggt 22261 tcagctcaaa tgccactccc tccaggaagc cctgcttaat cctgcctgct ctttcctctt 22321 cactgctcta gggtcggagg tctgtccttt actttgcaca aattactaaa cgagtcttcg 22381 ctccagggga caaaaatgat agggcagaga atgaacccct ccaccagccc ccgtagaaag 22441 tacacccctg ctcctgaccc cagtctctcc cattgtagct ctgctcaagg ccccaaacac 22501 aaaacgccac actctcggga aggcaccgtt atgcaccacc aagagctgct ccacaccccc 22561 agccgaggca ggcaaaggtg gaggcgctca caggagcaag ggaacgcccc tgggcaaaat 22621 cgcccggcct ccagcctggg cttgggcgtc tccaggtcgg cgggaagggc tgaaacgcaa 22681 gtatcccagg acgcacacac atccctccat cccggagcac agccaggcgc ccgaggcctc 22741 cacgcggaca cgcacgtgtg caccgagcac gcacacccct cacactgtca cacgcccctg 22801 cccgctcctc gagccgctac ctccggaccc ccggggctcg agccggggcc agcgacgtcg 22861 ccgtcgcgga tgctcaagtc gtagacggcg ccgtcgagca gcaccgcggg tcggtagagg 22921 cgcggcccgt ccgtgggccg gtggcgctcg gggtagtcac cgaacaggaa ctggcggatg 22981 atggccgtct tgcccacgcc cggggcgcct agaacggcca cccgcaggct accccccatg 23041 gccggccggc gctgccgctc cccgcgctgg aaagcctcat gggccggcgc cgcaccgtgc 23101 gcccccaggc cgtgcgcccc gcgcgccctg cccggtgcgc cacggccccg tcgcgcttct 23161 cgtcgcccct cggagcaccg agaccggaga gggcaggccc gaggcaggag ctggcggcgg 23221 gagggcagac agacagccga gtgggcagac aggtggcggg cgcgcgagcg gctcgggcgg 23281 gtgccgaagc tcgagagaga gcagagccgg agcggcgctc agacaccgcc tccgccccgc 23341 agcgccaacc caggcgccgg cggacgcgcg aggcgccgaa gtccgcccct ctcgcggcgc 23401 gggtcccgca gcggtgcggg gcgcgggggg cagcgctgcc tcccggagcc tctgtgcctc 23461 tatccttctc tcgcttccgc gcctggagtc gccgccccag ccgccggcgc gagcgagcgg 23521 ggcctcccgg aggcggcgat gaggtaaccg cccgcccctc ccaccttccc ccaccgccca 23581 ggacgcccct cccaggaagg gtcgcaggaa ggtgactggg ggacagagat cccggagagg 23641 gcctacccct tccttacctc ctccgcgtgt cccgctcctc tcacccgctc caaaatagaa 23701 gccccttccc acccactgcc gcggccgctc ttccaaacac tgctcgcccc cacgctggct 23761 cccgacaagc cccggccccc ggagtcaggc cgcccggggt ccgaattggg gggggcggct 23821 gtgtgacctt gggcgaatcg ccgcactgcg ctgggtctgc gctccgcatc catcacaggc 23881 agactcctca agaggctcca accttttctt gagagcctac tatgtgccag gcccattacc 23941 tacgtgattc tcgaacctac agcctcagcc tggcatcgct ggttgagaga agaggaaaca 24001 ggcccgagag gggaatctgc aaggttagaa gggcagtggg ggggcccttg aagtctcctt 24061 tgaagtccct cctaccttcc tcctgcccac gggggtaaat ggcccacact agacaatatt 24121 caaacagccg tcctcgaggg gaagtgggga cagagggtgc cgagaccaca gcagcttagc 24181 cccagatccc tgtcccagca aggctggagg tgactaaata aagatccaga aattggccca 24241 ctccacctcc atcactacta aggtcacctt tcctgcgtga gttgcaaccc gtagcgcccc 24301 tcaaagggat gcaacctcca aaacaggcgc ccagagccca agttccccct ccctcgagct 24361 agtggagaag ctcctgcccc atcacagctt ggaaagcctg ggttagccca gcccccccat 24421 cctccgcctc tgctctcctc cccgctgtgt gggggctccc acgtctctga cctgcacatg 24481 gcaatgctca agaaatggcg gcaatgctca agaaatggcc gcaatgcaca gatctctgca 24541 ctgactaagg agttcattta tttatcccct caacaaatat tccaaatctt attgcttgtc 24601 aggaggctga gatagattta aaaaacagaa aacaggaaaa aaaaaaaaag ttctgctttg 24661 ggagagaagg ttctagaaca catgggatct agggactgtc aaagcacaga caatgtgccc 24721 acaccagccc aggggaagta gaggagctaa atcctgaagg gggaggaggc acatcctcca 24781 gggagcctgt ataatgacaa ccagaaagtg tctccagaac aatggctgtg ctttcatcct 24841 cccatgctgg atcaggggct ggggtccgct ttcactcctc tcctcagcca ctaggaagct 24901 cagaaccacc tctgtctagt gttccaggtc ttctagctca gtgagtgggg actgctggct 24961 tcatctttct ttcctaccat gatttttccc ttcacttgat gtcctctttc acgttgtaac 25021 ttttcagact tctgggtttt gttgtttgag acaaggtctt gctctgtcac ccaagctgga 25081 gtgcagtggt gtgatcttag ctcactgcat cctcaacctc ctgggttcaa gtgatcctcc 25141 ttgcctcagt ctctcaaata gctgggactg caggtgtgta ctacgacatc cagctgattt 25201 atttttattt tttgtagagg tggaatctca ctatgttgcc caggctggtc ttgaactcct 25261 ggcttaagca atcctccctc ctcagcctcc aaaagtgcga ggattacagg cgtgagccac 25321 cacatctggc caaacttctg ggtttttgac ttttcaggct tctgggtttt gtgagctgct 25381 tcaattcctt cggggaacaa ggcaaggagt aaataacaag gcactcagca acacaacaac 25441 aaaacccaga agtctgaaaa gctgacacca ggacggtctt ttgtttgttt gttttgagat 25501 ggagtctcgg tctgtcaccc aggctggagt gcagtggtgc gatcttggct cactgcaacc 25561 tccacctcct gggttcaagc aattctcctg tctcagcctc tagagtagct gggactacag 25621 gcacatgcca ccatgcccgg ctaatttttg tgttttttag tagagacggg gtttcaccat 25681 attggtcagg ctggtctcga actgctgacc tcaggtgatc cacccgcctc agcctcccaa 25741 agtgctgggg tttcaggcat gagccactgc gcccggccag gacggtcttt tttaaggtgt 25801 tctcaatgct aggcgcactc ccgcccaccc ctgggtattc tttctctgaa aaactttcct 25861 tgcttctcct tccgacactg caaacagagc tgtgtgtgag gcgtctctgg ctcccagtat 25921 gtgtttttgg atggcgaggg gaaggggcac tgtggactgg gaacgtcctg aggctcccat 25981 gaagccctgc aaacgtgcca tgttccctag gagactctca cagcagcgtt cctagcagca 26041 gtgcccattt atcaagggcc tgcgatgtgc caagcctgac ccacagcctc atgctcacta 26101 aggctccctg ggacgccatg attgggccac agttgcctag caggaaaatg gcagcacagg 26161 ggccggaacc caggcccttg ggctccagag cctggctgag aagcctggct tcatctggac 26221 ccagagggtt ctcaggaagt actcaactga agctcagccc ccagtgacca ctcagccccc 26281 tgtcgggagg gacttgggga ggcctctagg tcccgtggag ccccccatcc acaggggctg 26341 aaatgagctc taatgggagt gtgcagccct gtgccaggct tcctaaagac cagcaggctg 26401 gggggctgaa gctgcctgtc tgcctgggat gtggatttca gggttctctc atcagtcccc 26461 tcattccttc cccaaggaaa ggctgtaggc ctcctgcctg ctgcccccct ccaatcccta 26521 ttccccaggg gcctcctggg cactccttga atgcccctat ccagggctcc agggaccacc 26581 tgccaactat gcatgcatct tctccttcag cagaggggac tggcactagg cacacagtgg 26641 ccaaggtggt cctgcctacc ctctgtggca gggagaaaga gacagacatg cagctaggca 26701 gggacgggtg aagacaggag ctctgggctg gttcaggcct ggggagatga ggaggcctag 26761 ggtaagcacc agtccagcgg gagttgaggg gcttatactg agccctgaag gatcaagcca 26821 cagatagtgg ggagggctgg gcaggggcag gcaccaggtg gggaattctg ctttccccag 26881 ttcattgccc acaggctcct aacaccagct acctggcctt cacccatgtg tttcttccca 26941 actgggcctg attccagccc atccctgaag tcaggcaccc aagactgcat ccagctccca 27001 gggggtgtta gttctggact cctttgctag gccacaccag tcaccactaa ttatggtctg 27061 tcctggccac gaatcatggc tccacaccac cagtaactca cctcctggac ccacgaatgg 27121 agggatcctc cacagctcac atgcttgggg tgtagtgggt ccttggacat cctcatgatg 27181 actcccacta atcacagcca ggcagcctct tgagggctgg gctggtcttc cagccatcag 27241 gacacaagga ggctgggcag ggctgggctt ccaagcctct gaaagtttct tttcttgggt 27301 ttgaaacctc cttccagggc tgtgtcatct gtgcaagtta cttaaattct ctgcaccttg 27361 ggctgggtgc agtggctgac gcctgtaatt ccagcacctt ggggaggccc aggcgggagg 27421 atcgcttggg cccaggagtt ccagaccagc ctgggcaaca cagtgagacc ccgactctag 27481 aaaaaatttt aaaaattggc tgggtgcagt ggctcacgcc tgtaatccca gcattttggg 27541 aggccgaggc aggggaatca cgaggtcagg catttgagac cagcctggcc aacatggtga 27601 aaccccatct ctattaaaaa tataaaaatt agccaggcat ggtggcgggg gcctgtagtc 27661 ccagctactc tggaggctga ggcaggagag tcacttgaac ccgggaggca gaggttgcag 27721 tgagccgaga ttgcgccact gcactccagc ctgggtgaca gagcgagact ccgtctcaaa 27781 aaaaaaaaaa aaaaaaaatt aaaaaactag ctgggtgctg taatcccagc acttcgggag 27841 gctgaggcag gtggatcacg aggtcaggag atacagatca tcctggctaa catggtgaaa 27901 ccacgtctct actaaaaaat acaaaaaatt agtcgggcat ggtggtgggc gcctgtagtc 27961 ccagttactc gggaggctga ggcaggagaa tggcgtgaac ccgggaggca gaggttgcag 28021 tgagccaaga tcgcaccact gcactccagc ctgggcgaca gagcaagact ctgtctcaaa 28081 aaaaaaaaaa aaaaattagc taggcgtggt ggtgcacacc tgtggtccca gctactcagg 28141 aagctgaggt gggaggattg cttgagccct ggaggtcgag gctacagtaa gccatgatca 28201 caccactgta cagcagcctg ggtgacagag caagaccctg tctcgaaaaa acaaaaaacg 28261 aacaaacaaa aaaacacctc tgcttctcag cttccttctc tgtgaaagaa atattaattt 28321 tcttgacttc actgagctgt gggatgcaat gagatgatac aaagagaaaa acaagacaac 28381 ttgttaagtt tggggagaca aaattggtac atataaaact tccatttttc tatgtaaatg 28441 gctacaaata caaaacataa tggaggcata tgtgtgtata cgtgggtaaa caggcacaca 28501 tatattttct agctctgtca gcagagggcc tggaagaagt gacacccagt ggcagtgaat 28561 acatccagca tctagctctt ggtttctttc ctttttatta tttatttatt tttgaaacag 28621 ggtctcattc tgttgtccag gctggaatgc acaggcacaa acatggctca ctgcagcctg 28681 gacctccagg gctcaagcaa tcctcccact ttagcctccc tgagtagctg ggaccacagg 28741 cataggccac cacacccggc taaatttttg catttctttg tagaaacagg gttttgctgg 28801 tctcaaactc ctgggctcca gcaatcagcc tgcctgggcc tcccaaagtg ctgggattac 28861 aggtgtgagt caccacgcct ggccgctctt ggtttctgtt ttttcttttt ttcttttttg 28921 agatggagtc tcgctctgtt gcccaggctg gagtgcagag gcgtgatctc ggctcactac 28981 aacctcggcc tcccaggttg aagcaattct cctgtctcag cctcctgagt agctgggact 29041 acaaggcgcc tgccaccacg ccccgctaat ttttgtattt ttaatagaga ctgggtttca 29101 ccttgttggt caggctggtc tcgaactccc gacctcaggt gatccaccca cctcgccctc 29161 ccaaagggcc gggattaccg gcatgagcca tcgcgcccag cctggctgct cctggtttct 29221 aatatggttt tccagtgaaa ggaaccacgc tccttggaga gatgactgtt tctagggttg 29281 gcgcaggacc agtacaagat gatcctggag ccacttatgg tggcagaaag gagggaagtg 29341 ctcaaaaacg aggatggtgg tatgtcacag ggacatagga accttccaga aggagctccc 29401 agtggccaaa cctgcaacaa ttcaagcaac aaagtaaata atgttgtatg ggattaagcc 29461 cagattataa aataaatatc catgagtcca aactgatata aatgactgag taaataaatg 29521 aataaatgtg atcattcagc ttgcttgctc ccaggggaaa aaaatacatc tgagaagaaa 29581 ccagtcttgc ttaaagaaga attccaaaca cttccatgta gctactcccc caccccgctt 29641 tccaaagtgt aggctacatt tagttatttg cttccaaaga atagagtaca taaaggagga 29701 tggccgggca cggtggctca tgcctgtaat cccaacactc tgggaggcca aggttggtga 29761 atcacttgag cccaggagtt cagtaccagc ctgggcaaca tagtgagacc tcctctctac 29821 taaaaataat aataaaaaaa atattgaatg aaattagcca ggtgtgatgg tgtgtgactg 29881 tagtcccagc tacttgggag gctgaggtgg gaggatcaac tgagctccag aagtcaaggc 29941 tgcagtgagc tgtgtttatg cctctgcact ccagcctgga ggacagaatg agaccctgtc 30001 ttaaaaaaat aaataaaata aaaaataaag aggggaaaag taacttcacg gtagagaaaa 30061 ctgcaaacat gcaaacatta tcttggctag gtgatccagg caaacgtgag gtgacaagaa 30121 atgtcgatag catgtcaccc tgattatgat gcgtaggaaa agtcacctgt gtggtattct 30181 ttctaaaaaa ccataatccc aggctgacca tgagaaaaac acccaacaaa cccaaactga 30241 gggactttct acagaatacc caagcaggac tccttgaaaa cagtcaaggt catgaaacac 30301 caagaatagc tgagaaactc tcacagatat aaggaggcat gagtaccacg tggaatgtgg 30361 gattctggaa cagaaagagg acacgagcgc acaactagtg aaacacgaat aaaaaaccga 30421 aatgggcggc caggcatgat ggctcatgcc tgtaatccca gcactttggg aggctgaggc 30481 aggcagatca cttgaggtca ggtgtttgag accagcctgg ccaatatggt gaaaccccgt 30541 ctctactaaa aatacaaaaa ttggctgggc acagtggcgc atgcctgtaa tcccagctac 30601 ttgggaggct gaggcaggag aatcacttga accagggagt cggaggttgc agtgagctga 30661 gatcgtgcca ctgcactcca gcctggcaac aaagggaggc tccgtctaaa aaaaaaaaaa 30721 aaaaaaaaaa gaacttttac atgaagaaaa cttaaaatat tacagagaca taaaagaaaa 30781 cctgaatcaa tgggaaagca tctactcctg ttctttacat ataaagatcc aatattgtta 30841 ttgacatcat aacaaggcca ttctccataa ataaacattt gatgtgatcc aaatgatacg 30901 atcaattttt ttttaaggac aaactagttc taaagttcaa atagaatgaa atgggaataa 30961 gagggtacaa gaagttatca aaacatggct gggagctgtg gctcacactt gtaatcccaa 31021 tgacttgaga ggctgaggca ggaggatcgc ttgagcccag gtgttcgagg ctgcagtgag 31081 ctatgatcat gccactgcac tccagcctgg gtgacagagt gagaccctga ctcaaaaaat 31141 aaaaaatgat tgggcatggt ggctcatgcc tgtaatccca gcactttggg aggccaacgc 31201 gggtggatca cttgagttca ggagttcgag accagcctgg gccacgtggg ggaaccctat 31261 ctctacaaaa accacaaaaa ttaggtgtgg tagctctcac ctgtagcccc agctatttgg 31321 gagactgagg caggaggatt gcttgagcct gggaggtgga agttgcagtg agctcagatc 31381 acaccactgc acaacagcct gggtgacagg agtgagaccc tgtctcaaaa ataataaatg 31441 aataaaaata aaaataaaag ttatcaaaac atgaatagac aaattatctt gtatagaaca 31501 gagtccagaa aaacacccag atatatgtgg aaatttagaa tatgataaaa atggcatttc 31561 aaatctgtgg ggaaaagatg aattattcag taaatggtgt ggagacaagt ggggaaccat 31621 ctggggaaaa agggaagttg gatccctacc tcattcctac agaaagccca gatggaacag 31681 gattgaaatg caaaaatgaa gagcagacaa gtactagaaa gaaaaaatta aaagaaaatt 31741 acatcttaaa aagcccaatt tcagcatgga aaagtaaaaa gcagtaacaa aatagggagg 31801 aataatggta aatcagtaag aaaaggatca ataatcagat agaaaaatgg cagaaaatgt 31861 gaagacagct cacagaaagg gacatacaag tggctcaaaa acacatgcaa agaggcacat 31921 tgaaaacatg ggaagggcca ggcgcgatgg ctcatgcctg taatcctagc attttgggaa 31981 gctgaggcag gcaggttatt gagcccagga gtttgagacc agcctgggca acacggcaag 32041 accgtctaca aaaacataca aaaatttact tggtgtggtg gtgcgcattt gtagtcccag 32101 ctactcagga ggctgaggtg ggaggatcac ttgagcctgg gaggttaaga ctgcagtgag 32161 tggtaatcac atcactgcac tccagcctgg gcaacagaga ccctgtcaca acaaaaaaga 32221 aaaaaaaaag ggaaagaaaa gaagaagaga aaaaaggcca ggtgcgatgg cttacgcctg 32281 taatcccaac aatttgggag gccaaggagg gcggatcacc tgaggtcagg agttcaagac 32341 cagcctagcc aacatggtga aaccccatct ctaataaaaa tacaaaaaat aattagccgg 32401 gtatggtggt gtgctcctgt aatcccggct acttgggagg ctgaggcagg agaatcactt 32461 gggtctggga ggcggaggtt gcagtgagct gagattacac cactgcactc cagcctgggc 32521 gacagagcaa ggctctgtct caaaaaaaaa aaaaaaaaaa aaaaaaaaaa ggcagctggg 32581 tgcagtggct cacgtctgta atcccagaac tttaggaggc agaggcgggc agatcacctg 32641 aggtcaggag ttcaagacca gcctggccaa catggcgaaa atctgtctct actaaaaata 32701 caaaaattag ccaagcgtgg tggcgggcgc ctgtaatccc agctacttgg gaggctaagg 32761 caggaggaac acttgagcct gggaggcaga ggttgcagtg agctgagatc ataccattga 32821 acgccagcct gggaggcaga ggttgcagtg agctgagatc ataccactga acgccagcct 32881 gggcgtcaga gtgagactcc atctcaaaaa agaaataaaa atttaaaaat gaaaaaataa 32941 aaaaaaggca tgaggacact ctctataatg ggctaataat ggaaaaacct ctgcgatata 33001 agcccaacat gtattccgtg ctactatttc cacacaagaa gggaaaaaaa tccaattagg 33061 cactgccaag tctctttgga taattacgtg gtttcctttg gggaggaaaa tgagggaaag 33121 gggtaggttt ttacatctgt cttgtcttta gattttgagc catgtgaatg tgttaccttt 33181 tcaaaacaag aactaaaata gaaaatgcat cttgggtata ccctctggtt cagcagttcc 33241 ggggctagat cctacagcta catccctgcc caggcaggag ggtgtatgtg ggaagtattc 33301 gctgcagcat tgcttagggt gcagaaacag cctgaatgcc ccgcagcagg gactggctaa 33361 cggtggccca ttcacacaag gaagcctgag gcggtctgct cctactgata gggaaagagc 33421 tccagaaacg actgtcaggt cagagaagcg atgtgctagc agtgtctagg taggccatca 33481 ttcgtttagt aaaactctat atggatttgc ttatacacat gtgcagaata tccctggagg 33541 aaaatcaaaa tctagtgaca atgtctgcct tcaagggagg aaaattaagg actgaggatg 33601 gggtgggagg gagaacttta ttctctggct ggctgacctc accatggtcc tgaattactt 33661 gctccacaaa aatgaacgca aaccactagc tgaaaattaa acaacgtaaa accaggtccc 33721 gcaacagcct tggcagtgtc tggtgcacag gggagcccat cggcccttgg gtatgtgttg 33781 gaccctccag aaataactgg ttgctgttgt tggatttcct gattggctgg gaaggttcaa 33841 gacacttggg aatcttccca agaaaagtct gggaagttcc ccaagctgcg ggatgccaag 33901 gcctccaaga ggcgagacgg ggaggcctga gcgtgtcagg accaggaagt gccacatgag 33961 caggagggag gcggcctgtg atgaccctca gaggtgaggg agggacaggg aagctgaagg 34021 gcacagactg gctatgccct gaggtactct tcctaaaggc aggggcccaa agtagcatcc 34081 tgtgtgtcta ttctcctgat gcagtagtgt gggggactgg gagcatgaag gccctccctg 34141 cctcagtttc cctgcatgta aactaagccg cattatatgc ttagtccagt atatgctggg 34201 tcgccttatg ctgagcactg tgggggtgaa agaggggccc aactgtagtg aggcaccgct 34261 cacaatcacc tgaggcttga gcagactcat tcaaacacaa ggtgtctagg gatggtcctc 34321 ctttcttgcc agctctcctc acccgggtgg cttgtcctgg gcctccggtg gggacagcga 34381 tcccacactt gcactcttcc gaccttgcaa tggaggggtg gggaggtgaa gtgccaagct 34441 ccctgaatgg ctgagctggc aggccatggg cgtggtggcc cgtgcagctg cttcccaggc 34501 agagtgcgaa ggtattgggc tcaggggagg cagacgacac ggcagggtga gagtccctct 34561 cattaagggc tcaccaacct ctagttccat ggcagggcag ggactccaga tgcatttggg 34621 cagagactct caggcgaggc aagaaccagg gttgcacagt caagggcctt ggttggttct 34681 ggactctttg gaggggccac agctgggcac acagctcgag cccttgaccc tggggtcccc 34741 agggcagcac agtgacacca cagaggtcag aggtggggca gtggagggcc aggtgcccct 34801 gtgatgccct gcaagagggc aggctgtctc acccagcttc cagcagccag gaaggcagcg 34861 gtgtgtgtga ggtgggcagc atgtgggggg actaagtgca gacagaacac gagggagcgg 34921 ctggagctga aacaggctga gggacacctg ggctgctcct ccctcagaag caggctgagg 34981 cccaccagcc aggaggcagc tcagccgatg aagccctgcc cagggtctgg ccaccctcac 35041 tgggagccag gccagtgcca ggactggggc cacacaaggt gtcctcggtg gcggctggcc 35101 tggtccacac gcccaccccc tctcccacgg ctgcccctga tcaggggatc tcccacggcc 35161 cctggagcct cccagatggc agggagctgg ggaaaaccac aaggggagcc cagccttctg 35221 gccagcatca caaggcgagg cattgtcagc tctgggggag gtgggatcaa tggcaggaga 35281 ggggtcaggg acagcagcag gccgcagcag tcagtttgcc aaggatagga tggagggaag 35341 gcgctgtcct cagagtgcca cgggaggggc tgctgccgca cataccactg gagcctttga 35401 ccctggggtc cctgcagccc gggactcctg cccagaagag cctcgggcct gacacactgg 35461 ctggggacac acttgatgtc atcattttta ttcattttct ctttctgaaa ataagagtaa 35521 acatacaata tatattttct gatgatatat attagatcta gttaccgtag ggtcagaaaa 35581 ccatccccaa ccactcggag gcagctttca atcctgacat tcggtggagg ggaaaaaagt 35641 gtccgtgatt gccaaaagcc gaatctttct ttattattac acagcagtaa caggaaaacc 35701 taatgctctg tcagacacaa ctgaaacctg ggaggccagg aggagggcag cagacactac 35761 agagaaaccc cgaagaaacc caaacaggag gcgcacccgc ttacccagcc ccatctctcg 35821 gcccagtctt tgtggcagga ggcctgggct cagaggcgac actgaggaat gtatcacagg 35881 cagtgaccac ccttggggca gaagtgggga gttggggata gtgcagctgc cagtcggtgg 35941 ggagcccgag acaggggctc tgggagcttc tgcctggtct tgaccatgtg ggtcagcact 36001 cgggaggctg tggcccaact ggcacagctg cagggcacca gcctggtggt tgcaagtggg 36061 tggtgcaggc cctgggctcc tcccagccta gctctgcccc tctgccggca cctacagctc 36121 cttgagcccc aggccctaag tgagcagctc tcctggtacc aaggggaacc cctccctgca 36181 cacccaggcc aggttggcag ggtccatccc atgagaggtg gcagtggagg gccctttaac 36241 accaaccgga gagaccaagg agccaggggc tgccggggct gtgagccgga ggggcaagct 36301 ccacactcac ccttcaggca cagcttctgt ggctgctctg gctctgtttc ctttggtccc 36361 cacctcactt aaccccacat cacagccaca ggagcagggg tgtcccagag cacgggagtg 36421 cactgcgctc tcaccccagg agggggtggt gccctacccc agggatcggg tgggttctgc 36481 catcaggacc agggagccca ctgagtggcc tggagccccg cctggtccct cctgcgagga 36541 ggaagatgtg ctgcccccga ggggcctcct cgatggggcg ggcagaaggc tggggtgggc 36601 gctggccggg gtctcagttc ttgaggatgg tctcgtaggc ctggtacacg tgctgggaca 36661 cctctggtgc tcgacacttc agggacagct gcaggggaga gaggggtcgg gggaaagagc 36721 gctcatccct ggggttcctc tcaggaagga aaggggtggg gaagagagca ggaaccaatg 36781 ggacagcgtg gagtgcacac agcctggcgg cagctgcgcc tggcacagat gtgaggtaaa 36841 gcggaggggg cggctgtcgc agccagccgg gtgtggacca aatatacttg cagcccacac 36901 acctgctgtc agcccagtca ccagagccct gctggcgagc agtgtgagtc acgagccgga 36961 ggctgccagg gagtgcccgg cccccaccct gagggctgcc cggcccccca gagtccctga 37021 aggtctgggg tcctcaggac tgaagcctga cagaaaatcc cgagggggtg cggagggctg 37081 ggagggagcc tggaggggtg ctgggctcat tcctggtttc ctgcgtagga agcaggcacg 37141 gtagagactt tgcctccgct ctggggcctg agcttgagat cacaggcaga ggagagagga 37201 gcccccaccc atccactcta aagcactggt tctgcagggt gcagttagga agtgggaaca 37261 atggagttcc gcaggtcagc agcagcccac gggccctcca ggtctcctag gaaggactct 37321 gtgcacactg tgaatgtgtg gaagtggggt gtcagcacct caatcagggc cacctggctg 37381 aagccccccg ggtgaggtga ggccaagaga tgaggtgcag acaggagaca tggggtgcat 37441 gggggccatg acaggagaca agactgggga gggcggacgg ggaaagagaa agcgatctgc 37501 taacctctaa gtcctgcacc acgggaacca gcagggaagg tgacatgcag caaccaaaca 37561 cagggagacg gaagttacca ccaccgaagc cacgccacag ccccccaccg accagcactg 37621 ccagcaggac ccaggagggg cagtgtctgg agggccatgc ctgccagaga ctcaggggtg 37681 tgggggagaa gcagagcttg gggctgtccg gggccctgga acagcggtgc aggcctccag 37741 gacaactgtg tgccccaggg aagccattct ctaacaggtc tacctcaaag gctgggaggt 37801 tcaggctcct gggaaccact gtcctcccag ctggacatgc aggcactcac aggacagggc 37861 cagggcctgt gcccagcaat gtcacttccc agtcaggggt gccaaaccaa ctgcaccacc 37921 ttacagggcc caattttaca caatcttcta gtccaactca ctgttccgag acccaattgg 37981 tgtcttatct gggtaacctg acggggttga tttggaagct ttctagaaag gtccttctca 38041 gggaccaaac tgattctagt tcaagccagg cttgaaggga cgcttggacc gcgtcctcca 38101 ccagggccac cccctggcat gggacgaagg ggaggctcca aggctgcagg gcgggtgccg 38161 gggctctcac cgtgcagctg gggttgcccg gctggatccg cagctccgcc agcacccaga 38221 tgccgttggt cagcttcagg gactggtaga gcatgtcctg gccctccacg ttcctcttgg 38281 cgacagtgaa gatgttgctg ctctgcagct tgctgctcgc agcctctgtg gggtcacatg 38341 gccgtgagag gccccagtca gcgcgggggc ctgggtacag gcgaggggct ggggtcgggg 38401 gctcggagtc ctcacctgca ttgagggggc agtctctgat ctggaactgg gcctcattct 38461 cattgggaat atccttccat gtggccagga acatctgccg gtctgcgggg tgagcagggt 38521 ggggatgaga ggcgagcccc tcataaacct gtgggccctc tgtagctcag caggaaatgg 38581 gtctggcctt tactgtgcca cgtgaaagct gacaacaggc actagctgcc gcacagccct 38641 cctgcttccc taagcacacc ctccgctctg tgagggaggc cccagtgccg cttccaaatg 38701 accctggctt cctgcctcca ggccttggat ctcccctccc cacccccaca cgccttcctc 38761 ctcctcggtg agcagcctgc tcgttagtca cagcctccca tgcaggcctt ctgtggcaag 38821 caagagatga catgggccaa gggctccacg cagccagggg cagggtgcag gccgtctgtg 38881 tgctgtgatc aacacccttc ttgggggttc cagaaagggc acagaggagg gggttaccag 38941 gtgtggggca tgacctgggc tcgctgcagc ccatgggaca aggctgagga gcgagcagga 39001 gagcctggga gtggggtgag ttccatgtgg cacagcccac ctgccagacc agaactggcc 39061 actaaggccg ggggggtcct gggtataccc ctcactccag gccctgcagg gaagcagcaa 39121 ctccccgact ccatttacgg gcaaggcagt ccatctggac cttggtaaag gcaccacctt 39181 ggccaggttc ccaagccaca gccccatttc aggcacccca gcgggctccc tcatgactca 39241 cccatcttcc cgtcctccac aaagaggatg tgcagtgggt acaaggtgct gaagtagaag 39301 acatcgatgt tgttcttcac ggccacctag gcacaagggg gtccccagtc agtctctggg 39361 gcttgacgcc acctctgagg ccccaggaag agtgtcactg agcagattct ctcccagggc 39421 cttccctggt aaacacaccc cttcgttccc caacctgggc ctccccaaca catctgcagc 39481 taagagcatg actccgcaga cacgaccttc tactgacccc agtggggccc ggctggtagg 39541 tgagtaagga ctggggtgcc tgccctgccc ggaacccacc tggaggttgt tcagaggctc 39601 catcttcatg accgagccca ccgtgctgag aggcagggag atctccactg tctggttggg 39661 gctgagtggt gcgtggacct ggaggggggc ggcgggggcc aggccaaagc tggggagaga 39721 gaagccccac agggatggca gggggagtag gtgctgatgc gggacaggtg agctcctccg 39781 actcgggagc tggggctgtg gttggcaggg agcctctcac cgtgccagcc ttcccatgtt 39841 ctggaccctg gacagaagga tgggctgtgg gtcccctccc ccagatggcc atctggacag 39901 aaagaaagag ccccaggcca attgctctgc tctgccccag cctggctgca ggcctgaagc 39961 ccagctctgt tcctctcctc tgggccaggc actggtgggc ccaggtaggc aggccttgcc 40021 ctgtcctgcc ctcaaggagc tctgtctaat gccaggggcc tggagtataa gctgcttcta 40081 ccttgaaaac tctgggacca cccaagtctc tgctgccaat cctcaactcc cagaagggat 40141 c // LOCUS HSEYA2 1617 bp DNA PRI 05-SEP-1997 DEFINITION H.sapiens EYA2 gene. ACCESSION Y10261 NID g1834488 KEYWORDS EYA2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1617) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 2 (bases 1 to 1617) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) S. Abdelhak, Genetique Moleculaire Humaine, URA CNRS 1968, Institut Pasteur, 25 Rue du Dr Roux, 75724 Paris Cedex 15, FRANCE FEATURES Location/Qualifiers source 1..1617 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="9 week embryo" /chromosome="20" /map="20q13.1" gene 1..1617 /gene="EYA2" CDS 1..1617 /gene="EYA2" /codon_start=1 /db_xref="PID:e290305" /db_xref="PID:g1834489" /translation="MVELVISPSLTVNSDCLDKLKFNRADAAVWTLSDRQGITKSAPL RVSQLFSRSCPRVLPRQPSTAMAAYGQTQYSAGIQQATPYTAYPPPAQAYGIPSYSIK TEDSLNHSPGQSGFLSYGSSFSTSPTGQSPYTYQMHGTTGFYQGGNGLGNAAGFGSVH QDYPSYPGFPQSQYPQYYGSSYNPPYVPASSICPSPLSTSTYVLQEASHNVPNQSSES LAGEYNTHNGPSTPAKEGDTDRPHRASDGKLRGRSKRSSDPSPAGDNEIERVFVWDLD ETIIIFHSLLTGTFASRYGKDTTTSVRIGLMMEEMIFNLADTHLFFNDLEDCDQIHVD DVSSDDNGQDLSTYNFSADGFHSSAPGANLCLGSGVHGGVDWMRKLAFRYRRVKEMYN TYKNNVGGLIGTPKRETWLQLRAELEALTDLWLTHSLKALNLINSRPNCVNVLVTTTQ LIPALAKVLLYGLGSVFPIENIYSATKTGKESCFERIMQRFGRKAVYVVIGDGVEEEQ GAKKHNMPFWRISCHADLEALRHALELEYL" BASE COUNT 399 a 488 c 409 g 321 t ORIGIN 1 atggtagaac tagtgatctc acccagcctc actgtaaaca gcgattgtct ggataaactg 61 aagtttaacc gtgctgacgc tgctgtgtgg actctgagtg acagacaagg catcaccaaa 121 tcggcccccc tgagagtgtc ccagctcttc tccagatctt gcccacgtgt cctcccccgc 181 cagccttcca cagccatggc agcctacggc cagacgcagt acagtgcggg gatccagcag 241 gctaccccct atacagctta cccacctcca gcacaagcct atggaatccc ttcctacagc 301 atcaagacag aagacagctt gaaccattcc cctggccaga gtggattcct cagctatggc 361 tccagcttca gcacctcacc cactggacag agcccataca cctaccagat gcacggcaca 421 acagggttct atcaaggagg aaatggactg ggcaacgcag ccggtttcgg gagtgtgcac 481 caggactatc cttcctaccc cggcttcccc cagagccagt acccccagta ttacggctca 541 tcctacaacc ctccctacgt cccggccagc agcatctgcc cttcgcccct ctccacgtcc 601 acctacgtcc tccaggaggc atctcacaac gtccccaacc agagttccga gtcacttgct 661 ggtgaataca acacacacaa tggaccttcc acaccagcga aagagggaga cacagacagg 721 ccgcaccggg cctccgacgg gaagctccga ggccggtcta agaggagcag tgacccgtcc 781 ccggcagggg acaatgagat tgagcgtgtg ttcgtgtggg acttggatga gacaataatt 841 atttttcact ccttactcac ggggacattt gcatccagat acgggaagga caccacgacg 901 tccgtgcgca ttggccttat gatggaagag atgatcttca accttgcaga tacacatctg 961 ttcttcaatg acctggagga ttgtgaccag atccacgttg atgacgtctc atcagatgac 1021 aatggccaag atttaagcac atacaacttc tccgctgacg gcttccacag ttcggcccca 1081 ggagccaacc tgtgcctggg ctctggcgtg cacggcggcg tggactggat gaggaagctg 1141 gccttccgct accggcgggt gaaggagatg tacaatacct acaagaacaa cgttggtggg 1201 ttgataggca ctcccaaaag ggagacctgg ctacagctcc gagctgagct ggaagctctc 1261 acagacctct ggctgaccca ctccctgaag gcactaaacc tcatcaactc ccggcccaac 1321 tgtgtcaatg tgctggtcac caccactcaa ctaattcctg ccctggccaa agtcctgcta 1381 tatggcctgg ggtctgtgtt tcctattgag aacatctaca gtgcaaccaa gacagggaag 1441 gagagctgct tcgagaggat aatgcagaga ttcggcagaa aagctgtcta cgtggtgatc 1501 ggtgatggtg tggaagagga gcaaggagcg aaaaagcaca acatgccttt ctggcggata 1561 tcctgccacg cagacctgga ggcactgagg cacgccctgg aactggagta tttatag // LOCUS HSEYA3 1722 bp DNA PRI 05-SEP-1997 DEFINITION H.sapiens EYA3 gene. ACCESSION Y10262 NID g1834490 KEYWORDS EYA3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1722) AUTHORS Abdekhak,S., Kalatzis,V., Heilig,R., Compain,S., Samson,D., Vincent,C., Weil,D., Cruaud,C., Sahly,I., Leibovici,M., Bitner-Glindzicz,M., Francis,M., Lacomde,D., Vigneron,J., Charachon,R., Boven,K., Bedbeder,P., Van Regemorter,N., Weissenbach,J. and Petit,C. TITLE A human homologue of the Drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family JOURNAL Nature Genet. 15 (2), 157-164 (1997) MEDLINE 97172972 REFERENCE 2 (bases 1 to 1722) AUTHORS Abdelhak,S. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) S. Abdelhak, Genetique Moleculaire Humaine, URA CNRS 1968, Institut Pasteur, 25 Rue du Dr Roux, 75724 Paris Cedex 15, FRANCE REMARK revised by submitter 31-JAN-1997 FEATURES Location/Qualifiers source 1..1722 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="9 week embryo" /chromosome="1" gene 1..1722 /gene="EYA3" CDS 1..1722 /gene="EYA3" /codon_start=1 /db_xref="PID:e299512" /db_xref="PID:g1834491" /translation="MEEEQDLPEQPVKKAKMQESGEQTISQVSNPDVSDQKPETSSLA SNLPMSEEIMTCTDYIPRSSNDYTSQMYSAKPYAHILSVPVSETAYPGQTQYQTLQQT QPDAVYPQATQTYGLPPFGALWPGMKPESGLIQTPSPSQHSVLTCTTGLTTSQPSPAH YSYPIQASSTNASLISTSSTIANIPAAAVASISNQDYPTYTILGQNQYQACYPSSSFG VTGQTNSDAESTTLAATTYQSEKPSVMAPAPAAQKLSSGDPSTSPSLSQTTPSKDTDD QSRKNMNSKNRGKKKADATSSQDSELERVFLWDLDETIIIFHSLLTGSYAQKYGKDPT VVIGSGLTMEKMIFEVADTHLFSNDLKECDQVHVEDVAPNDKGQNLNNYSFSTNGFSG SGGSGSHGSSVGVQGGVDWMRKLAFRYRKVREIYDKHKSNVGGLLSPQRKEALQKLKA EIEVLTNSWLGTALKSLLLIQSKKNCVNVLITTTQLLPALAKVLLYGLGKIFPIENIY SATKIGKESCFERIVTSLGKKLTYVVIGDGRDEEIAAKQHNMPFWRITNHGDLVSLHQ ALELDFL" BASE COUNT 544 a 411 c 353 g 414 t ORIGIN 1 atggaagaag agcaagattt accagagcaa ccagtgaaaa aagccaagat gcaggaatca 61 ggagagcaaa ctataagtca agtaagcaat ccagatgtca gtgatcagaa gcctgaaaca 121 tcaagccttg cttcaaacct tcccatgtca gaggaaatta tgacatgcac cgattacatc 181 cctcgctcat ccaatgatta tacctcacaa atgtattctg caaaacctta tgcacatatt 241 ctctcagttc ctgtttcgga aactgcttac cctggacaga ctcaatacca gacactacag 301 cagactcaac cagatgctgt ctaccctcag gcaacccaaa cgtatggact acctcctttt 361 ggtgcattgt ggccaggtat gaaacctgaa agtggtttaa ttcagactcc atctccaagt 421 caacacagtg ttcttacctg cactacaggg ttaaccacaa gccagccaag cccagcacat 481 tattcttatc ccattcaagc ttcaagcaca aatgccagcc tgatatctac ttcttctaca 541 attgccaata ttccagcagc agcagtagcc agcatctcaa accaggatta tcccacctat 601 actattcttg gtcagaatca gtaccaggcc tgctacccca gctccagctt tggagtcaca 661 ggtcagacta acagtgatgc agagagcacc acattagcag caaccacata ccagtcggag 721 aagcctagtg tcatggcgcc tgcacctgca gcacagaaac tttcctctgg agacccttct 781 acaagtccat ctttgtccca gactacacca agtaaagata ctgatgatca gtccaggaaa 841 aacatgaata gcaagaaccg gggcaagaag aaagctgatg ccacttcttc ccaagacagt 901 gaattagaac gggtatttct gtgggacttg gatgaaacca tcatcatctt ccactcactt 961 cttactggat cctatgccca gaaatatgga aaggacccaa cagtagtgat tggctcaggt 1021 ttaacaatgg aaaaaatgat ttttgaagtg gctgataccc atctattttc caatgactta 1081 aaagagtgtg accaggtaca tgtggaagat gtggctccta atgacaaggg ccaaaacttg 1141 aacaactaca gtttctcaac aaatggtttc agtggctcag gaggtagtgg cagccatggt 1201 tcatctgtgg gtgttcaggg aggtgtggac tggatgagga aactagcttt ccgctaccgg 1261 aaagtgagag aaatctatga taagcataaa agcaacgtgg gtggtctcct cagtccccaa 1321 aggaaggaag cactgcaaaa attaaaagca gaaattgaag ttttaacaaa ttcctggtta 1381 ggaactgcat taaagtcctt acttctcatc cagtccaaaa agaattgtgt gaatgttctg 1441 atcactacca cccagctgct tccagccctg gccaaggttc tcctatatgg actaggaaaa 1501 atatttccta ttgagaacat ctatagtgct accaaaattg gtaaggagag ctgctttgag 1561 agaattgtca caagccttgg aaagaaactc acatatgttg tgattggaga tggacgagat 1621 gaagaaattg cagccaaaca gcacaacatg cctttctgga ggatcacaaa ccatggagac 1681 ctagtatccc ttcaccaggc tttagagctt gattttctct aa // LOCUS HSEZRIN 3044 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ezrin. ACCESSION X51521 NID g31282 KEYWORDS ezrin; kinase substrate; microvilli protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3044) AUTHORS Hunter,T. TITLE Direct Submission JOURNAL Submitted (25-JAN-1990) Hunter T., The Salk Institute, Molecular Biology and Virology Laboratory, 10010 North Torrey Pines Road, San Diego, CA 92138, USA REFERENCE 2 (bases 1 to 3044) AUTHORS Gould,K.L., Bretscher,A., Esch,F.S. and Hunter,T. TITLE cDNA cloning and sequencing of the protein-tyrosine kinase substrate, ezrin, reveals homology to band 4.1 JOURNAL EMBO J. 8 (13), 4133-4142 (1989) MEDLINE 90076135 COMMENT See also . FEATURES Location/Qualifiers source 1..3044 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Okayama-Berg" /clone="F6" CDS 118..1878 /note="ezrin (AA 1-586)" /codon_start=1 /db_xref="PID:g31283" /db_xref="SWISS-PROT:P15311" /translation="MPKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWY FGLHYVDNKGFPTWLKLDKKVSAQEVRKENPLQFKFRAKFYPEDVAEELIQDITQKLF FLQVKEGILSDEIYCPPETAVLLGSYAVQAKFGDYNKEVHKSGYLSSERLIPQRVMDQ HKLTRDQWEDRIQVWHAEHRGMLKDNAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWL GVDALGLNIYEKDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRI NKRILQLCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLERQQLETEKKRRETVE REKEQMMREKEELMLRLQDYEEKTKKAERELSEQIQRALQLEEERKRAQEEAERLEAD RMAALRAKEELERQAVDQIKSQEQLAAELAEYTAKIALLEEARRRKEDEVEEWQHRAK EAQDDLVKTKEELHLVMTAPPPPPPPVYEPVSYHVQESLQDEGAEPTGYSAELSSEGI RDDRNEEKRITEAEKNERVQRQLVTLSSELSQARDENKRTHNDIIHNENMRQGRDKYK TLRQIRQGNTKQRIDEFEAL" BASE COUNT 826 a 687 c 855 g 675 t 1 others ORIGIN 1 aggcagggcg ggcgggcgct ctaagggttc tgctctgact ccaggttggg acagcgtctt 61 cgctgctgct ggatagtcgt gttttcgggg atcgaggata ctcaccagaa accgaaaatg 121 ccgaaaccaa tcaatgtccg agttaccacc atggatgcag agctggagtt tgcaatccag 181 ccaaatacaa ctggaaaaca gctttttgat caggtggtaa agactatcgg cctccgggaa 241 gtgtggtact ttggcctcca ctatgtggat aataaaggat ttcctacctg gctgaagctg 301 gataagaagg tgtctgccca ggaggtcagg aaggagaatc ccctccagtt caagttccgg 361 gccaagttct accctgaaga tgtggctgag gagctcatcc aggacatcac ccagaaactt 421 ttcttcctcc aagtgaagga aggaatcctt agcgatgaga tctactgccc ccctgagact 481 gccgtgctct tggggtccta cgctgtgcag gccaagtttg gggactacaa caaagaagtg 541 cacaagtctg ggtacctcag ctctgagcgg ctgatccctc aaagagtgat ggaccagcac 601 aaacttacca gggaccagtg ggaggaccgg atccaggtgt ggcatgcgga acaccgtggg 661 atgctcaaag ataatgctat gttggaatac ctgaagattg ctcaggacct ggaaatgtat 721 ggaatcaact atttcgagat aaaaaacaag aaaggaacag acctttggct tggagttgat 781 gcccttggac tgaatattta tgagaaagat gataagttaa ccccaaagat tggctttcct 841 tggagtgaaa tcaggaacat ctctttcaat gacaaaaagt ttgtcattaa acccatcgac 901 aagaaggcac ctgactttgt gttttatgcc ccacgtctga gaatcaacaa gcggatcctg 961 cagctctgca tgggcaacca tgagttgtat atgcgccgca ggaagcctga caccatcgag 1021 gtgcagcaga tgaaggccca ggcccgggag gagaagcatc agaagcagct ggagcggcaa 1081 cagctggaaa cagagaagaa aaggagagaa accgtggaga gagagaaaga gcagatgatg 1141 cgcgagaagg aggagttgat gctgcggctg caggactatg aggagaagac aaagaaggca 1201 gagagagagc tctcggagca gattcagagg gccctgcagc tggaggagga gaggaagcgg 1261 gcacaggagg aggccgagcg cctagaggct gaccgtatgg ctgcactgcg ggctaaggag 1321 gagctggaga gacaggcggt ggatcagata aagagccagg agcagctggc tgcggagctt 1381 gcagaataca cagccaagat tgccctcctg gaagaggcgc ggaggcgcaa ggaggatgaa 1441 gttgaagagt ggcagcacag ggccaaagaa gcccaggatg acctggtgaa gaccaaggag 1501 gagctgcacc tggtgatgac agcacccccg cccccaccac cccccgtgta cgagccggtg 1561 agctaccatg tccaggagag cttgcaggat gagggcgcag agcccacggg ctacagcgcg 1621 gagctgtcta gtgagggcat ccgggatgac cgcaatgagg agaagcgcat cactgaggca 1681 gagaagaacg agcgtgtgca gcggcagctc gtgacgctga gcagcgagct gtcccaggcc 1741 cgagatgaga ataagaggac ccacaatgac atcatccaca acgagaacat gaggcaaggc 1801 cgggacaagt acaagacgct gcggcagatc cggcagggca acaccaagca gcgcatcgac 1861 gagttcgagg ccctgtaaca gccaggccag gaccaagggc agaggggtgc tcatagcggg 1921 cgctgccagc cccgccacgc ttgtctttag tgctccaagt ctaggaactc cctcagatcc 1981 cagttccttt agaaagcagt tacccaacag aaacattctg ggctgggaac cagggaggcg 2041 ccctggtttg ttttccccag ttgtaatagt gccaagcagg cctgattctc gcgattattc 2101 tcgaatcacc tcctgtgttg tgctgggagc aggactgatt gaattacgga aaatgcctgt 2161 aaagtctgag taagaaactt catgctggcc tgtgtgatac aagagtcagc atcattaaag 2221 gaaacgtggc aggacttcca tctgtgccat acttgttctg tattcgaaat gagctcaaat 2281 tgattttttt aatttctatg aaggatccat ctttgtatat ttacatgctt agaggggtga 2341 aaattatttt ggaaattgag tctgaagcac tctcgcacac acagtgattc cctcctcccg 2401 tcactccacg cagctggcag agagcacagt gatcaccagc gtgagtggtg gaggaggaca 2461 cttggatatt tttttagttc tttttttttt ggcttaacag ttttagaata cattgtactt 2521 atacacctta ttaatgatca gctatatact atttatatac aagtgataat acagatttgt 2581 aacattagtt ttaaaaaggg aaagttttgt tctgtatatt ttgttacctt ttacagaata 2641 aaagaattac atatgaaaaa ccctctaaac catggcactt gatgtgatgt ggcaggaggg 2701 nagtggtgga gctggacctg cctgctgcag ctgcagtcac gtgtaaacag gattattatt 2761 agtgttttat gcatgtaatg gactatgcac acttttaatt ttgtcagatt cacacatgcc 2821 actatgagct ttcagactcc agctgtgaag agactctgtc tgcttgtgtt tgtttgcagt 2881 ctctctctgc catggccttg gcaggctgct ggaaggcagc ttgtggaggc cgttggttcc 2941 gcccactcat tccttctcgt gcactgcttt ctccttcaca gctaagatgc catgtgcagg 3001 tggattccat gccgcagaca tgaaataaaa gctttgcaaa ggca // LOCUS HSF0811 40127 bp DNA PRI 04-FEB-1998 DEFINITION Human DNA sequence from cosmid F0811 on chromosome 6. Contains Daxx, BING1, Tapasin, RGL2, KE2, BING4, BING5, ESTs and CpG islands. ACCESSION Z97184 NID g2648017 KEYWORDS 6p21.3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40127) AUTHORS Beck,S. TITLE Direct Submission JOURNAL Submitted (13-NOV-1997) Chromosome 6 Project Group (http://www.sanger.ac.uk/chr6/); E-mail enquires: humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk COMMENT Herberg J. A., Beck S. and Trowsdale J. Tapasin, Daxx, RGL2, KE2 and four new genes (BING1, 3-5) form a dense cluster at the centromeric end of the MHC (submitted). This sequence has been finished according to sequence map criteria as follows. An attempt is made to resolve all sequencing problems, such as compressions and repeats, but not necessarily within known annotated human repeat sequence elements (e.g. Alu). Where the sequence is ambiguous, there is an annotation using the 'unsure' feature key. The true left end of clone F0811 is at 1 in this sequence. The true right end of clone B2046 is at 9988. The true left end of clone F0811 is at 40127. F0811 is from the ICRF flow-sorted human chromosome 6 cosmid library (cell line RPETO1) IMPORTANT: This sequence is the entire clone insert. This sequence was generated from part of bacterial clone contigs of human chromosome 6, constructed in collaboration by the Sanger Centre chromosome 6 mapping group and Ioannis Ragoussis, Jethro Herberg & John Trowsdale. Further information can be found at http://www.sanger.ac.uk/HGP/Chr6/ Herberg, J.A., Beck, S. and Trowsdale, J. (1998) Tapasin, Daxx, RGL2, KE2 and four new genes (BING1, 3-5) form a dense cluster at the centromeric end of the MHC. J. Mol. Biol., in press Herber, J.A., Sgouros, J., Jones, T., Copeman, J., Humphray, S.J., Sheer, D., Cresswell, P., Beck, S. and Towsdale, J. (1998) Genomic analysis of the Tapasin gene, located close to the TAP loci in the MHC. European J. Immunol., in press. FEATURES Location/Qualifiers source 1..40127 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /clone="F0811" prim_transcript <1..>2522 /note="match: multiple ESTs; match: AA129657 Z45843 T79567 W39776 H13819; match: W02333 H34947 H73773 H77913AA075116; match: AA383223 AA317661 C03211 W45308 AA301687; match: AA405738 W51682 AA355087 AA129682 C00363; match: AA374951 Z41479 H73021 T79477 R02432 H74158; match: AA356200 H45863 AA308321 AA075005 R48034; match: T36090 AA312483 AA234490 N73287 T30378; match: R47926 AA380467 AA365654 AA306929" CDS join(2..345,490..701,857..1070,1227..1701,1862..2084, 2279..2337) /note="Author-given protein sequence is in conflict with the conceptual translation." /codon_start=1 /product="Daxx" /db_xref="PID:e1186793" /db_xref="PID:g2648018" /translation="IRLFGRLCELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLIN KPGPDTFPDYGDVLRAVEKAAARHSLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIY NFGCHLTDDYRPGVDPALSDPVLARRLRENRSLAMSRLDEVISKYAMLQDKSEEGERK KRRARLQGTSSHSADTPEASLDSGEGPSGMASQGCPSASRAETDDEDDEESDEEEEEE EEEEEEEATDSEEEEDLEQMQEGQEDDEEEDEEEEAAAGKDGDKSPMSSLQISNEKNL EPGKQISRSSGEQQNKGRIVSPSLLSEEPLAPSSIDAESNGEQPEELTLEEESPVSQL FELEIEALPLDTPSSVETDISSSRKQSEEPFTTVLENGAGMVSSTSFNGGVSPHNWGD SGPPCKKSRKEKKQTGSGPLGNRYVERQRSVHEKNGKKICTLPSPPSPLASLAPVADS STRVDSPSHGLVTSSLCIPSPARLSQTPHSQPPRPGTCKASVATQCDPEEIIVLSDSD " repeat_region 2639..2855 /note="AluJb repeat: matches 302..88 of consensus; incomplete repeat" misc_feature 3240..3642 /note="Putative CpG island" repeat_region 3855..4023 /note="MIR repeat: matches 20..206 of consensus" CDS 4165..6069 /codon_start=1 /product="BING1" /db_xref="PID:e1186794" /db_xref="PID:g2648019" /translation="MEPSPLSPSGAALPLPLSLAPPPLPLPAAAVVHVSFPEVTSALL ESLNQQRLQGQLCDVSIRVQGREFRAHRAVLAASSPYFHDQVLLKGMTSISLPSVMDP GAFETVLASAYTGRLSMAAADIVNFLTVGSVLQMWHIVDKCTELLREGRASATTTITT AAATSVTVPGAGVPSGSGGTVAPATMGSARSHASSRASENQSPSSSNYFSPRESTDFS SSSQEAFAASAVGSGERRGGGPVFPAPVVGSGGATSGKLLLEADELCDDGGDGRGAVV PGAGLRRPTYTPPSIMPQKHWVYVKRGGNCPAPTPLVPQDPDLEEEEEEEDLVLTCED DEDEELGGSSRVPVGGGPEATLSISDVRTLSEPPDKGEEQVNFCESSNDFGPYEGGGP VAGLDDSGGPTPSSYAPSHPPRPLLPLDMQGNQILVFPSSSSSSSSQAPGQPPGNQAE HGAVTVGGTSVGSLGVPGSVGGVPGGTGSGDGNKIFLCHCGKAFSHKSMRDRHVNMHL NLRPFDCPVCNKKFKMKHHLTEHMKTHTGLKPYECGVCAKKFMWRDSFMRHRGHCERR HRLGGVGAVPGPGTPTGPSLPSKRESPGVGGGSGDEASAATPPSSRRVWSPPRVHKVE MGFGGGGGAN" prim_transcript <5009..>5868 /note="match: multiple ESTs; match: AA085230 AA056630 AA309450 N43919 R53654; match: R36276 T07529 H15337 W78260 AA051601" prim_transcript <5940..>6663 /note="match: multiple ESTs; match: T78477 T78559 AA359884 D82810 H34158 H15719; match: AA062251 T07530 T03652 N34845 R53543; match: AA056605 N23868 T27481 AA018565 T18609; match: R49563 W74632 W94380 N29958" misc_feature 7031..7745 /note="Putative CpG island" CDS join(7040..7076,7217..7387,7604..7864,15694..16092, 16443..16784,16864..16953,17092..17126,19309..19320) /codon_start=1 /product="tapasin" /db_xref="PID:e1186795" /db_xref="PID:g2648020" /translation="MKSLSLLLAVALGLATAVSAGPAVIECWFVEDASGKGLAKRPGA LLLRQGPGEPPPRPDLDPELYLSVHDPAGALQAAFRRYPRGAPAPHCEMSRFVPLPAS AKWASGLTPAQNCPRALDGAWLMVSISSPVLSLSSLLRPQPEPQQEPVLITMATVVLT VLTHTPAPRVRLGQDALLDLSFAYMPPTSEAASSLAPGPPPFGLEWRRQHLGKGHLLL AATPGLNGQMPAAQEGAVAFAAWDDDEPWGPWTGNGTFWLPTVQPFQEGTYLATIHLP YLQGQVTLELAVYKPPKVSLMPATLARAAPGEAPPELLCLVSHFYPSGGLEVEWELRG GPGGRSQKAEGQRWLSALRHHSDGSVSLSGHLQPPPVTTEQHGARYACRIHHPSLPAS GRSAEVTLEVAGLSGPSLEDSVGLFLSAFLLLGLFKALGWAAVYLSTCKDSKKKAE" prim_transcript <7998..>8890 /note="match: multiple ESTs; match: H95651 H95725 H95087 T02997" repeat_region 9188..9216 /note="MER5B repeat: matches 79..108 of consensus" repeat_region 9432..9730 /note="AluSp repeat: matches 303..4 of consensus" repeat_region 9739..10046 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 10228..10542 /note="AluJb repeat: matches 300..5 of consensus" repeat_region 10555..10856 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 10924..11055 /note="AluJo repeat: matches 2..133 of consensus; incomplete repeat" repeat_region 11058..11356 /note="AluSp repeat: matches 2..303 of consensus" repeat_region 11363..11474 /note="FLAM_A repeat: matches 4..133 of consensus" repeat_region 11526..11657 /note="FLAM_C repeat: matches 132..1 of consensus" repeat_region 11683..11966 /note="AluSp repeat: matches 294..1 of consensus" repeat_region 12231..12452 /note="MER30 repeat: matches 1..230 of consensus" repeat_region 12730..13036 /note="AluSq repeat: matches 293..3 of consensus" repeat_region 13174..13373 /note="MIR repeat: matches 262..64 of consensus" repeat_region 13378..13678 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 13770..14073 /note="AluSq repeat: matches 303..1 of consensus" repeat_region 14084..14384 /note="AluSp repeat: matches 302..1 of consensus" repeat_region 14618..14735 /note="MIR2 repeat: matches 17..146 of consensus" repeat_region 14994..15182 /note="MIR repeat: matches 199..2 of consensus" prim_transcript 16051..16385 /note="match: 5' EST AA324236" prim_transcript <16785..>21386 /note="match: multiple ESTs; match: AA281362 AA299378 T95134 H40477 H40478; match: T95038 W48760 W48761 U51697 AA369475; match: R65889 H27117 N25151 D19918 R22460; match: T61710 AA404406 N35653 AA411318 R32826; match: AA303745 AA235390 AA143464 N98901 H39975; match: R48048 AA364715 T28708 AA290846 W74553; match: AA379282 T87753 H53201 AA357767 W74659; match: H45803 H98631 AA303092 H59364 AA128601; match: R48019 T61625 AA062872 T87603 T69380; match: AA339923 AA295964 AA411317 N69910 R32928; match: AA379524 AA102006 R70438 AA335972 AA292457" repeat_region 17352..17477 /note="MER33 repeat: matches 324..197 of consensus" repeat_region 17485..17603 /note="MER46 repeat: matches 12..125 of consensus" repeat_region 17647..17947 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 18000..18091 /note="MER46 repeat: matches 145..234 of consensus" repeat_region 18139..18437 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 18454..18757 /note="AluSx repeat: matches 1..302 of consensus" repeat_region 18765..18935 /note="AluJo repeat: matches 131..301 of consensus; incomplete repeat" repeat_region 18947..19091 /note="MER33 repeat: matches 145..1 of consensus" repeat_region 19487..19788 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 20097..20395 /note="AluY repeat: matches 299..2 of consensus" misc_feature 20119..22554 /note="Putative CpG island" repeat_region 20405..20699 /note="AluY repeat: matches 296..2 of consensus" repeat_region 21134..21270 /note="MIR2 repeat: matches 1..139 of consensus" CDS join(22470..22625,23965..24048,24304..24482,24616..24666, 24755..25052,25321..25572,25658..25761,25869..25953, 26035..26103,26318..26392,27014..27046,27164..27285, 27381..27476,27556..27667,27774..28064,28536..28650, 28767..28978) /codon_start=1 /product="RGL2" /db_xref="PID:e1186796" /db_xref="PID:g2648021" /translation="MLPRPLRLLLDTSPPGGVVLSSFRSRDPEEGGGPGGLVVGGGQE EEEEEEEEAPVSVWDEEEDGAVFTVTSRQYRPLDPLVPMPPPRSSRRLRAGTLEALVR HLLDTRTSGTDVSFMSAFLATHRAFTSTPALLGLMADRLEALESHPTDELERTTEVAI SVLSTWLASHPEDFGSEAKGQLDRLESFLLQTGYAAGKGVGGGSADLIRNLRSRVDPQ APDLPKPLALPGDPPADPTDVLVFLADHLAEQLTLLDAELFLNLIPSQCLGGLWGHRD RPGHSHLCPSVRATVTQFNKVAGAVVSSVLGATSTGEGPGEVTIRPLRPPQRARLLEK WIRVAEECRLLRNFSSVYAVVSALQSSPIHRLRAAWGEATRDSLRVFSSLCQIFSEED NYSQSRELLVQEVKLQSPLEPHSKKAPRSGSRGGGVVPYLGTFLKDLVMLDAASKDEL ENGYINFDKRRKEFAVLSELRRLQNECRGYNLQPDHDIQRWLQGLRPLTEAQSHRVSC EVEPPGSSDPPAPRVLRPTLVISQWTEVLGSVGVPTPLVSCDRPSTGGDEAPTTPAPL LTRLAQHMKWPSVSSLDSALESSPSLHSPADPSHLSPPASSPRPSRGHRRSASCGSPL SGGAEEASGGTGYGGEGSGPGASDCRIIRVQMELGEDGSVYKSILVTSQDKAPSVISR VLKKNNRDSAVASEYELVQLLPGERELTIPASANVFYAMDGASHDFLLRQRRRSSTAT PGVTSGPSASGTPPSEGGGGSFPRIKATGRKIARALF" prim_transcript <24790..>29426 /note="match: multiple ESTs and cDNAs; match: R37588 R25789 AA345798 C04496 D31579; match: AA400504 T32323 N31477 N80842 AA401259; match: R17716 AA401972 AA339803 C02440 AA066942; match: T08668 C03271 T08669 N99972 AA369649 D30835; match: N27045 AA400614 AA402117 AA203308 AA404233; match: R86012 AA135350 AA007622 T31165 N27365; match: AA007661 W07350 AA311078 T80173 T49124; match: T32376 AA135176 AA035367 T30397 W05771; match: N92937 T87606" repeat_region 26520..26647 /note="MIR repeat: matches 110..240 of consensus" prim_transcript <30147..>31460 /note="match: multiple ESTs; match: C21091 AA023721 AA409103 AA277765 AA234979; match: AA020628 AA270316 AA240167 W88772 D25580; match: W05039 AA152842 AA369134 N74677 AA289910" CDS complement(join(30235..30364,30630..30754,30836..30906, 31160..31223)) /codon_start=1 /product="HKE2" /db_xref="PID:e1186797" /db_xref="PID:g2648022" /translation="MAELIQKKLQGEVEKYQQLQKDLSKSMSGRQKLEAQLTENNIVK EELALLDGSNVVFKLLGPVLVKQELGEARATVGKRLDYITAEIKRYESQLRDLERQSE QQRETLAQLQQEFQRAQAAKAGAPGKA" prim_transcript <31845..>32746 /note="match: multiple ESTs; match: H52575 T97040" misc_feature 31884..32317 /note="Putative CpG island" misc_feature complement(31893..32197) /note="match: CpG island DNA genomic Mse1 fragment Z63744" CDS join(31910..31978,32067..32276,32386..32466,32610..32722, 32845..32932,33054..33115,33362..33466,33575..33725, 33854..33989,34186..34285,40088..40127) /codon_start=1 /product="BING4" /db_xref="PID:e1186798" /db_xref="PID:g2648023" /translation="METAPKPGKDVPPKKDKLQTKRKKPRRYWEEETVPTTAGASPGP PRNKKNRELRPQRPKNAYILKKSRISKKPQVPKKPREWKNPESQRGLSGAQDPFPGPA PVPVEVVQKFCRIDKSRKLPHSKAKTRSRLEVAEAEEEETSIKAARSELLLAEEPGFL EGEDGEDTAKICQADIVEAVDIASAAKHFDLNLRQFGPYRLNYSRTGRHLAFGGRRGH VAALDWVTKKLMCEINVMEAVRDIRFLHSEALLAVAQNRWLHIYDNQGIELHCIRRCD RVTRLEFLPFHFLLATASETGFLTYLDVSVGKIVAALNARAGRLDVMSQNPYNAVIHL GHSNGTVSLWSPAMKEPLAKILCHRGGVRAVAVDSTGTYMATSGLDHQLKI" prim_transcript <32897..>40127 /note="match: multiple ESTs; match: R13118 AA166557 AA156666 H10139 F06119; match: H52171 H09026 R14111 H07039 AA306917 Z42940; match: H17751 H16617" repeat_region 34354..34504 /note="MIR repeat: matches 18..172 of consensus" repeat_region 34624..34915 /note="AluSx repeat: matches 292..1 of consensus" repeat_region 34916..35221 /note="AluSx repeat: matches 302..1 of consensus" repeat_region 35257..35559 /note="AluSq repeat: matches 1..303 of consensus" repeat_region 35572..35877 /note="AluJb repeat: matches 1..300 of consensus" CDS complement(36387..36734) /note="Author-given protein sequence is in conflict with the conceptual translation." /codon_start=1 /product="BING5" /db_xref="PID:e1186799" /db_xref="PID:g2648024" /translation="NLSAEPCPQQVPSEYLMTKGIRGKYKSDRKAVENADTPSGTAEI QSRHHLHPSHQPDLERTIILNDNDEHSAPYFAKRCAKLYALTHLNVYNSLYEGFKPRL GKLLPMGQTWPII" repeat_region 36446..36524 /note="MIR repeat: matches 162..241 of consensus" repeat_region 36673..36753 /note="MIR2 repeat: matches 146..67 of consensus" repeat_region 36815..37118 /note="AluSg repeat: matches 296..1 of consensus" repeat_region 37163..37461 /note="AluSx repeat: matches 302..2 of consensus" repeat_region 37602..37778 /note="AluJb repeat: matches 165..2 of consensus; incomplete repeat" repeat_region 37855..38150 /note="AluSg repeat: matches 297..1 of consensus" repeat_region 38978..39237 /note="AluSx repeat: matches 20..293 of consensus; incomplete repeat" repeat_region 39868..39961 /note="MIR repeat: matches 110..214 of consensus" BASE COUNT 8915 a 10354 c 10696 g 10162 t ORIGIN 1 gatccgcctc tttgggcgac tatgtgagct gaaagactgc tcttcactga ccggccgtgt 61 catagagcag cgcatcccct accgtggcac ccgctaccca gaggttaaca ggcgcattga 121 gcggctcatc aacaagccag ggcctgatac cttccctgac tatggggatg tgcttcgggc 181 tgtagagaag gcagctgccc gacacagcct tggcctcccc cgacagcagc tccagctcat 241 ggctcaggat gccttccgag atgtgggcat caggttacag gagcgacgtc acctcgatct 301 catctacaac tttggctgcc acctcacaga tgactatagg ccaggtaggg ggttggtggg 361 atgcctctca gtgaccctta tatgtcaggt atgaggcgga tggggcatct ctctagtcct 421 ttctagggtt tttactcttc tagtcccttc aagggctgag tgctctgact ttatgtcttc 481 ccacgtaggc gttgaccctg cactatcaga tcctgtgttg gcccggcgcc ttcgggaaaa 541 ccggagtttg gccatgagtc ggctggatga ggtcatctcc aaatatgcaa tgttgcaaga 601 caaaagtgag gagggcgaga gaaaaaagag aagagctcgg ctccaaggca cctcttccca 661 ctctgcagac acccccgaag cctccttgga ttctggtgag gtgtggatgg ggtacagcct 721 tcagagagac attgtccttc ccctgcactg gccaccaggg agtccaggtt gactgatggg 781 ggagcatgag aaggaaagca agaaccaaac cctctggggc aagggattcc ttagagaaac 841 ttctttgtct cccagggccc tagtggaatg gcatcccagg ggtgcccttc tgcctccaga 901 gctgagacag atgacgaaga cgatgaggag agtgatgagg aagaggagga ggaggaggaa 961 gaagaagagg aggaggccac agattctgaa gaggaggagg atctggaaca gatgcaggag 1021 ggtcaggagg atgatgaaga ggaggacgaa gaggaagaag cagcagcagg tatgtcaagg 1081 gaacattctc ctcacctgtc atttctgttt tgttaggcat gagccacctg tttgaaacct 1141 ttcccttcct cctcctagtg tcctctcagc agtatttttt ccccctcatt ctcaggctac 1201 cctctcccct tttcttcttt tccaggtaaa gatggagaca agagccccat gtcctcacta 1261 cagatctcca atgaaaagaa cctggaacct ggcaaacaga tcagcagatc ttcaggggag 1321 cagcaaaaca aaggacgcat agtgtcacca tcgttactgt cagaagaacc cctggccccc 1381 tccagcatag atgctgaaag caatggagaa cagcctgagg agctgaccct ggaggaagaa 1441 agccctgtgt ctcagctctt tgagctagag attgaagctt tgcccctgga taccccttcc 1501 tctgtggaga cggacatttc ctcttccagg aagcaatcag aggagccctt caccactgtc 1561 ttagagaatg gagcaggcat ggtctcttct acttccttca atggaggcgt ctctcctcac 1621 aactggggag attctggtcc cccctgcaaa aaatctcgga aggagaagaa gcaaacagga 1681 tcagggccat taggaaacag gtaataacaa gaggaggagg aagagggaag ccaaggaggg 1741 attgcagggg actttcggag agcaatactg gggaactgag tctgctggga gagactggac 1801 tgcccccctc cagcctcagt cttcccgtac ccctccacat atttatgttt ctgtttctag 1861 ctatgtggaa aggcaaaggt cagtgcatga gaagaatggg aaaaagatat gtaccctgcc 1921 cagcccacct tcccccttgg cttccttggc cccagttgct gattcctcca cgagggtgga 1981 ctctcccagc catggcctgg tgaccagctc cctctgcatc ccttctccag cccggctgtc 2041 ccaaaccccc cattcacagc ctcctcggcc tggtacttgc aaggtaagaa ggggagggcg 2101 tgtttcctct gacaatttgt tctctttcat tggctgtttc agctgctctg tctgaatatt 2161 ccacgtgcca catcctgtct cttccttttt cccacccatt ttatcctcta aatcctcctg 2221 tttcttttct cctgtactcc agatctcttt gactctttgt ctctcacttc cctgcagaca 2281 agtgtggcca cacaatgcga tccagaagag atcatcgtgc tctcagactc tgattagctg 2341 cctccccttc tccctgcctc cagaatgttc tgggataaca tttggaggaa ggtgggaagc 2401 agatgactga ggaagggatg gactaagcta atcccctttt ggtggtgttt ctttaaaaaa 2461 aaaaaaaagc ttaagtttta cacagaaaca ttaataaaca ataaagttct tttcttactg 2521 tatcactgtc tttttcttgt ccttctactg ttacagattg gcattagctg tccccttgct 2581 cagggttagc ccagtatccc cggggtagtc ccgctctgtc ctttctagcc tggccttttt 2641 tttttttttt ttttttttgt ctcgctgtgt cgcccaggcc agggtgcagt ggtgagatca 2701 cagctcactg cagcctggac ctcccagtct caagcgataa tgcctcagcc tcccgagtag 2761 ccgggactac aggcgcgtgc caccatgcct ggctaatttt tgtttttgtt gtttgtattt 2821 tttgtagaga cggggtttcg ccacgttgtt caggcagtac tttttttttt ttttaagtgc 2881 ttcagccgcc ttaacgcact ttactttttc acttcgtgca gtaggagtga gaaagccaat 2941 gaggggccga ctcaaggtta tttagccaat ccaggcccgg agaaaggggg cggggcttca 3001 gtgggactgt aaggagctag gaagagggcg attagatccc accaaccagc taatggtcac 3061 agctcagagg cggagcctgg ggaacgatac ctggacccat cgccaatggg aaaaggcagg 3121 cttatagctt taaaggggaa taaatgcttc ggccgaacca agtcctgagc tcccagcgga 3181 ggaggggacc tgatctttta aggtggaatg tggcgatgtt tctttaaagg caaagtggcc 3241 gagtcggccg agtattgccc agcggcacgg gcggggtgcc cggctgtgcc tttaaaggcg 3301 gaggggcggg aggcggagcg gaggcggagg gcggagggag tcggcgcaag atggcggcgg 3361 gaggggccca ggttgctgct ctggccgccg agtgaggggc ggggggggcc cgggggcgcg 3421 cggcccggta agcggggacg cgcgaggttg gggggcgcct tccgggtggg agaagaggcc 3481 gaggtagcgc ttggggcgga gcgggggggg gcggggtccg ttgtgggcgg aggtagacaa 3541 ggggcggggt cgaaaggccc gatagtgggc ggggcacggg gggggtgggc ctggaaaggg 3601 gcggggctag agaagctggg gcgaaagggg gcgtggtcgg cgagagagct tggggggcat 3661 agcctgaggg gagatggagc aagcgttggt tttcagtagc agagctgccc tgccccctcc 3721 caaaaagtgg gtgcaaaaca ccaaaaaggt agattttgga atgtcaggct ctcaggtaag 3781 tcggaaatag ggtataattc cgaaggaagc cactccagcc cctaagagca ccttatttct 3841 ctaacctcca aggaaagggc acaagatttg tcagatacct gtgtttgctg gcgctgccac 3901 ttgctgtgtg accttggata atctcttaat ctttctgagc tttagtttcc tcatctgtca 3961 aatggggttg agaggaatac ctacctcact gttgtgggga gttacacaga taatgagcgt 4021 gaacatattt ttgagttgta acgggctgtg tcgttatgag atgttatcat tattcttttc 4081 tctttttctt cccagagacc cccccggccg ccctcctcct ttctttgttc ctgtggctgg 4141 gggggtatcc cctccctcca caacatggag ccatctcctc tgtctcccag tggggcagca 4201 cttcccctgc cgctgtcgct ggctccgccc ccactacccc tgccagcagc tgcagtggta 4261 catgtgtcct tccctgaggt gaccagtgcc ctcttggagt ccctcaatca gcagcgtctg 4321 cagggccagc tctgcgatgt atctatcaga gtgcagggcc gggagttccg ggctcatcgg 4381 gctgtcctgg ctgcctcctc cccttacttc catgatcagg tcctactcaa aggcatgacc 4441 tccatctcgc tgcccagtgt catggaccca ggcgcctttg agactgtcct agcctccgct 4501 tacactggcc gcctcagcat ggctgctgct gacattgtca acttccttac agtggggtct 4561 gtgctccaaa tgtggcacat tgtggacaag tgcactgaac tactccgaga aggccgggcc 4621 tcagctacca ccaccatcac tactgctgca gccacctctg tcactgtccc tggtgctggg 4681 gtgccatccg ggagtggggg cactgtggcc cctgctacca tgggctctgc gcgctcccat 4741 gcctccagcc gggccagtga gaatcaatct cccagcagca gcaactactt cagccccagg 4801 gagtccactg atttctcatc ttcctcccaa gaggcatttg cagcttctgc agtgggcagt 4861 ggggagcgtc gaggaggtgg ccctgtattc ccagcccctg tcgttggcag tggaggggcc 4921 acatctggaa agctgctgct ggaggcagat gagctgtgcg atgatggtgg ggatgggagg 4981 ggggcagtgg ttcctggggc tgggctccgg agacccacct acacaccccc tagcatcatg 5041 ccacagaaac actgggtata cgtgaagcga ggtggtaatt gcccagcgcc aacacccctg 5101 gttccccaag acccagatct ggaggaggaa gaggaggagg aagatctggt gttgacctgt 5161 gaggatgatg aagatgaaga actagggggt agctccaggg ttccagtggg gggagggcct 5221 gaggctaccc tcagcataag tgatgtccgt accctgagtg agcccccaga caagggggag 5281 gagcaggtca acttctgtga gtcctccaat gactttggcc catatgaggg tgggggtcct 5341 gtggcaggtc ttgatgactc aggggggcca actccctctt cctatgcccc ctcccaccct 5401 cctcgaccgc tccttccctt ggacatgcag ggcaaccaga tcctggtctt cccgtcgtcg 5461 tcttcatcct catcctcaca ggctcctggc caaccaccag ggaaccaagc agaacacggg 5521 gcagtgaccg tggggggcac gtcggtgggg agcctgggtg tgccgggtag cgttggtggg 5581 gtccctggag ggactggcag tggggacggg aataagatct ttctgtgcca ttgtgggaag 5641 gccttctccc acaagagcat gcgggaccgg cacgtgaaca tgcacctcaa tctgcggccg 5701 tttgactgcc ccgtgtgcaa caaaaagttc aagatgaagc accatctgac tgagcacatg 5761 aagacgcaca caggtctcaa gccctacgag tgcggagtct gcgccaagaa gttcatgtgg 5821 cgagacagct tcatgcgcca ccgaggacac tgtgagcgcc ggcaccgcct gggcggggtc 5881 ggggccgtac ctgggcctgg gactcccacg gggccatcct tgccgtccaa gagagagtct 5941 cccggagtgg gcgggggcag cggcgacgaa gcgagtgcgg ccacgccccc gtccagcaga 6001 cgtgtctggt ccccacccag agtccacaag gtggagatgg gcttcggtgg aggtggagga 6061 gcaaactgaa ggggcaggct actggggtgg ggtagctttc gggaaaggga ataaggagca 6121 cgatgcaagg gcgctgtggc ccccgggtga tctcccacca cacttactgt cttcctttat 6181 ctctgtggac ttgtatatat tctggaaggg gaaccacagt ttcaccatcg cccgcccatt 6241 ctactactca acccctcccc cccaaggtat ttccagaact aaacccttcc tttccctctg 6301 atgggtacac tgaagcccct gctccacaga gtagattgca catggaggga gggagagggg 6361 gcgtgttgaa catcctgcag tcacagggtc aggggtcagg tggttgtagt ctgtgcctga 6421 agtctgtgtt tgtgttgtcg tggagacaag gcctttgagc cccacccttg tcctagaacc 6481 taccccctct caaggatgcg ctctttattt ctaccctgtc tctccccgcc acccccgact 6541 tcccgtggaa attcccaact cggttctcat ggaggagtgg gtggagacaa ggagggagta 6601 agtcgtagga gtacaaggtt tttatttttt ttaacagtga ttaaaatatt tattggtcat 6661 ttacttggct tcccgataac ccgtgtgttt gctggggacg cggcacagat agggggaagc 6721 cggagtaatg gttttcgggc aagtggatgt tggagagcac acacaggagt tggggggcgg 6781 gggagggcct ggggttgggg agggctcgaa ctcggggctg ctgggtagtc caggagggcg 6841 cggtaaggct ggggtgtcct ggtgagaact ggagaggatc tacccgggtc cctgcctggc 6901 cagtggggaa acaccggtcc cccaggcacc ttcacctaac cagagcgggg atttccaccg 6961 cccctcatgc cgccctttgg aggaaagtga aagtgaaagg aggaagagga ggcttcatgg 7021 ctgaggaggt cgcagcgcca tgaagtccct gtctctgctc ctcgctgtgg ctttgggtga 7081 gcgaaccccg cgactcatcg cccaagaact agagggaagc ggagggaggt ggccccactg 7141 gagccgatgc cagggtggga gtggggcagg tcaccagaca tacaaaccgc tcctcactcg 7201 cgtccctata cgccaggcct ggcgaccgcc gtctcagcag gacccgcggt gatcgagtgt 7261 tggttcgtgg aggatgcgag cggaaagggc ctggccaaga gacccggtgc actgctgttg 7321 cgccagggac cgggggaacc gccgccccgg ccggacctcg accctgagct ctatctcagt 7381 gtacacggtg agtctctagg gactcgccgc ccccctacct ctgtcgcctc caccgaaacc 7441 ccctcctctt tagatccggc agtgacctca ggcctcagct tcccctttgt aaagtgagtc 7501 tcactacggg gtagtctgtg catttgaagt tccccgaacg ctgcccttcc agcccctttc 7561 ccggcggtga ctctacagct gcaacttcct tctctacact cagaccccgc gggcgccctc 7621 caggctgcct tcaggcggta tccccggggc gcccccgcac cacactgcga gatgagccgc 7681 ttcgtgcctc tccccgcctc tgcgaaatgg gccagcggcc tgacccccgc gcagaactgc 7741 ccgcgggccc tggatggggc ttggctgatg gtcagcatat ccagcccagt cctcagcctc 7801 tccagcctct tgcgaccaca gccagagcct cagcaggagc ctgttctcat caccatggca 7861 acaggtagct ggggagggga ggtggagaag ggtgggtaga ttctaagggt ccagatcagc 7921 cggtggtctc gctttatgga cttgagcaag acatttcgcc aatcggggtt cagtttcctt 7981 tttttgttaa gagaggtggt ttggctagaa tgaatcaatc tctaccgctc cttctagccc 8041 tcacccagtt aaaaaaaaaa aacaaaaaaa aaactgcagc ttggcagggg gtagggggaa 8101 agcagggccg ggcagggggg ttactttctg gtgtttcgaa gggcggggct ttgaagaggt 8161 ggggtttccc gacaccagac cttgagagac ttgtcaggtg atgcgaggtg ggaggggttc 8221 agaagcagga gcgttttttc tctgctcttc ccagatctgt gtcttgctct gcaccctcag 8281 ctttgcgggt cactctttca aaacccagca ctctatcccc acctgcgccc acacccggcg 8341 ctgcggcaat ccgcaaagca aaggctcttg cactctagcg cgttgacctt gcctgacagc 8401 cccgtgtagg gatgtgctgc cgctctgtgg tccgcaaaca ggtgggctcg agtgggcgga 8461 tggacgcggc tggagattgg atagctccta ttgcactgaa aggtgcattg cagtttgagg 8521 cccaggagtc agaagctttt ctaggctggg acgtgggagg gactgtgcag ataattcaag 8581 aatgaggaga gtctgacctg gatatctggg agctctccct cctggcagcg gaggcggggt 8641 attccggggt agaataccgg tagagttctt agcttttcta tctggatttt atgtcagcga 8701 cacagcccga tggacagggc aaccccaacc aatccctgca ccctcccttc cctccaccct 8761 cctgagaatt acccacagaa ccagggagac agacttgaaa cccacgctga aaccctcacc 8821 tctgcctata cctccattct tagtctcaag cccccaccct ggcagaagct gaggaacact 8881 gctcttgtgt agacagtagc tctctctggt ggggtggaat atttcctagg gctactggct 8941 ccaccgaggt gatgatgaga atggtctttg ctttctgagt ttgtttttcc aactgtaaaa 9001 tgtggatact atttcccagc tgcttttctt gttgtgacta tcaagagatg gcactgtgtt 9061 aatttatcca ttgattcaat aaatactttt tagcaccaag aatgagttag gactcagcag 9121 ttctcagtcc acactgttgg aggtttctaa aatatactga aacctgattc aatcccatgc 9181 ttagcttggc cgattaaatc agaatctctg gggtggccct gggtgccctt aagggtcaca 9241 aagatgatct gatgtagact ctgctctcca ggaggttctg ggcgtatttg ccacggataa 9301 tcctaatgta aagtagaaag tgaaggacat gcagacaggg agccctggaa gtgcccgggt 9361 ggtaagattg tctgtatttc acaaacaact ggggtcaaag caaggtgctg tgctatttat 9421 ttatttttat tttatttatt tatttatttt tgagacagag ttttgctctt gttgcccagg 9481 ctggagtgca atggcgtgat ctcacgctca ccggaacctc cgcctccctg gttcaagcaa 9541 ttctcctgcc tcagcctccc gagtagctgg gattacaggt gcctgccacc acacccagct 9601 aattttgtat ttttagtaga gatgaggttt ctccatgttg gtaaggctgg tctcgaactc 9661 ccaacttcag gtgatcctcc cacctcgcct cccgaagtgc tgggattaca agcgtgagcc 9721 actgcgcctg cctgctattt atttatttat ttatttttga gacggagtct ccctcttatt 9781 gcccaggcta gagtgcagtt gcagtggcat gatcttggct cactgcaacc tccgtctccc 9841 atgttcaagc aattcttgtg ccttagcctc cccagtagct gggattacag gcttgtgcca 9901 ccacgcccac ctaatttttg tatttttatt ggagacaggt ttcaccatgt tggccaggat 9961 ggtcttgaac tcctgaccta aggtgatcca cccaccttgg ccttccaaag tgctgggatt 10021 acaggcatga gccaccacgc ccggcctgct gtgctctttc ttttcaagca tttctgcttg 10081 atcagggatt taagaaaatg aaagctgttg tactgatgta aaattagaaa tgaaatctct 10141 ccagggtcca agatcattca tgattttgct aatatacatt tatacttgtc ttatcaaata 10201 tttttaaatg tttattgaca aatatgctta tttatttatt tatttgagac agggtctccc 10261 tctgtcgccc aggctggagt gcagtgcagg ctggagagca gtggcatgat ctcagctcac 10321 tgcaacctcc acctccgtgg ctcaagtgat cctcgtgcct cagtctcctg agtagctgag 10381 accacaggca tgtgccacca tgcccagcta atttttgtat tttttttgtg gagatgggtt 10441 tttgtcaggt tgcccaggct ggtcttggac tcctgggctc aattgatctg cccaccttgg 10501 cctccgaaac tgctgggatt acacgtgtga gctgcctggc cccaaatata cttttttttt 10561 tttttttttt ttttgagaca gagtctcacc ctgttgccca ggctggagtg caatggtgtg 10621 atctctgctc atttcaactt ctgcctcctg agttcaagta attctcctgt ctcagcctcc 10681 caagtagctg agactacagg cgcccaccac caggcccagc taatttttgt atttttagtg 10741 gagacagggt ttggtcatgt tagccaggct ggtcttgaac tcctgacctc aggtgatcca 10801 cccacctcag cctcccaaat tgctgggatt acaggcgtga gccaccacgc caggcctata 10861 ctttcttaat atactttgtg tgtatagtgt ttaattctca ccattctttt aaaatattat 10921 ggagtcaggt gaggtgactc acgcctgtac ttccagcact ttgggaggtc taggtgggag 10981 gatcatttgc actcaggagt tcaagaccag cctgggcaac ataaccagac cttgtctcta 11041 ctaaaaatta aaaaaatgct gggcgcagtg gctcatgcct gtaatcccag cactttggga 11101 ggccgaggtg ggcagatcac ctgaggtcga aagttcaaga ccagcctgac caacatggag 11161 aaaccccgtc tgtactaaaa atacaaaatt agccaggcat ggtggtgcat gcctgtaatc 11221 ccagctactc ggaaggctga ggcaagagaa ttgcttgaac ctgggaggcg gaggttgcag 11281 tgagccaaga tcatgccact gcactccagc ctgggcaaca agcgtgaaac tgtctcaaaa 11341 aaaaaaaaaa aaaaaaaagg ttgggtgtcg tggcatgtac ctatggtccc agctactcgg 11401 gaggctgtgt tgggaggatt gcacctttga tgggtgacag agtgagaccc tgtctccaaa 11461 aaaaaaaaaa aaaaaaaaag agttaaactg taaaaccttt aggtcttgtt tattattttt 11521 ttaaattttt tcttttaaat agagatgggg tcttgctatg ttgcccaggc tggtctcaaa 11581 ctgctgggtt caagtgatcc tcccacctca gcctcccaag gtactgggat tacaagtctg 11641 agccaccctg cccagccagg ttttgtttat taattctatg gatttttttt tttgagatca 11701 agttttgctc ttgttgccca gtctggaggg caatggcatg atctcggctc accacaacct 11761 ccgcctccca ggttcaagtg attctcctgc ctcagcttcc caagcagctg tgattacagg 11821 cgtgtgccac catgcccgga taattctgta tttttagtag agacggggtt tctccagtct 11881 ggtctcaaac tcccaacctc aggtgatcca ccggcctcgg cctcccaaag tgctgggatt 11941 acaggcgtga gccactgtgc ccagcctgtg gatctttttt ttttttttaa caattcaaaa 12001 gtatgtttct ggattttgac aaactaagtg aaattgcata ctgtttttac tctactgaac 12061 aattccatga agatgagaac agtttaacct tataggcctc agaagctagg attaaaaaga 12121 acataaaaat aattacaaga agaaaaagat tacttctacc tgaaggacct agacatttct 12181 gggaagaaac agcagtgaaa gtgaccttca tgtacatgga gtatttagat caggggtctc 12241 caatcttttg gcttccctga gccacactgg gagaagaatt gtcttgggcc atgcataaaa 12301 tatacgaaca ctaacgatgg ctgataagct aaaaaaaaaa aatgtgaaaa aatctcataa 12361 tgttttaaga aagtttatga atttgtgtcg ggctgtattc caagccgtcc ttggctgcat 12421 gtggtccctg ggctggaggt tggacaagct tgatagatgt ttgggagaaa atgggaaaga 12481 caatgtaaac tgaagggacc tcatgaccaa agcaaagaga tgaccaatga tgaccacttc 12541 tactaggcac aagaaagggg ttctatgtgg ctggtttgta gggtgcagga gaagcctgga 12601 agggtgggtt aggggctgaa tgttcaaata ccttgaaagc tgggctttgt tctgtaggtg 12661 agagcgaacc atcagaagtt cttttggagg gcgtgttatt ctaaaaagcc tgtttttatt 12721 atgtacatat ttattttttg agacggagtt tcactcttgt cacccatgct ggagtgcaat 12781 ggggtgatct tggctcactg caacctctgc ctcctgggtt caagcaattc tcctgcctca 12841 gcctcccaag ttgatgggat cacaggcacc cacaaccacg ccccgctaat tttttttttt 12901 tttttttttt gtatttttag tagagatggg gtttcaccat gttggccagg ctgctctcga 12961 actcctgacc tcaggtaatc cactcgcctc agcctcccaa agtgctggga ttacatgtgt 13021 tagccaccat gcccggaaaa agcctgttat tttacttgat tcttttaaaa gataggcaaa 13081 ggtcagatct ttggactagt gaggaaaata caggttaaat gatttttcca agttaaccag 13141 caagtccatg atagtgatga aggtaatgaa aatagtaaca gctaacattt atcaagtatt 13201 ttctatatgc caggccctgt tctaagcact ttatatgcat aatctcactt attcctctca 13261 acaaccatag gttgtaggtt ttaatattat ccctgtattg catatggata aatataggac 13321 acaaaccagt taaataattt gcccaaggac acacagtgtg taagaagcag agctagaggc 13381 caggcgtggt ggctcatgcc tgtaatccca gcactttggg agaccaaagc aggcggatca 13441 cctgaggtcg ggagttcaag accagcctgg gcaacatggt gaaaccccat ctctaccaaa 13501 aatacaaaaa ttagctgggc acggttgtgg acgcctgtaa tcccagctac tcaggaggct 13561 gaggcaggag actcacttga acctgggagg cagaggttgc agtgagccaa gatcacacca 13621 ctatactcta gcctaggtga cgagcaaaac tctatctcaa aaaaaaaaaa aaaaaaaaaa 13681 aaaagaagca gggctagaaa ttgaacccag gccatatagc ttcagcctat gcctttatca 13741 gctgacttcc taggtatttg ggatttttat tttatttatt tatttagttt tgagacggag 13801 tctcactctg ttgcccaggc tggagtgtag tggcatgatt tcagcttacg gcaacctcta 13861 cctcccaggt tcaagtgatt cttctgcctc agcctcccaa gtagctagga ctacaggtgc 13921 ctgcccccac acctggctaa tttttgtatt tttagtagag acagggtttt gccatgttgg 13981 ccaggctggt cttgaactcc tgacctcagg taatccaccc gcctcaggct accaaagtgt 14041 tgggattaca ggtgtgagcc accacgcctg gccgtatttg ggattttttt tttttttttt 14101 ttgaaacggg gtttccctct tattgcccag gctggagtgc aatggcacaa tctcggctca 14161 ccgcaacctc tgcttcccgg gttcaagcaa ttctactgcc tcggtctccc gagtagctgg 14221 gattacaggc atgtgccacc acaccaggct aattttgtat ttttagtaga gacggggttt 14281 caccatgttg gtcaggctgg tttcaaactc ccgaactcag gtgatccacc cgcctcgtcc 14341 tcccaaagtg ctgggattat aggcatgaac cactgcgtct ggcccatatt tgggattttt 14401 aaacaggtag gtgatgggat cagacttgta gtcactttgt ttggctacag tgtgaagaag 14461 gagttgaagg agagactgag aatagggaga ccagacagga gggctgttgt aattgtctaa 14521 acgtacattg atgagaacct aaattggaaa cagatgtgag atctgttagg tcctggtggg 14581 gatagcacat agcacattat ttttgacaat ttgcttgtct gtcattggac tataggcaat 14641 ttgagagcag ggactgcttg tcttgttcac cattctgtct tctaccagga taccttgaac 14701 atcttcagtg ctcaatagtt gctgaatgaa taaatgagtc tgagtaactc caagggtttg 14761 cttttagtgt ctggcgggct tggtgatcat tcatcaaaga cagggaagat gggcattgga 14821 gcagggtggg gaatgttgca gagatgctgt tgacttgggg tatgctaagt taggtgtaac 14881 tatagcaaac acgtgtagac ttactgtgtt ccagacagtg ttgtaagcac ccttcatggg 14941 taggtgtctc tgttatctaa aattgggaac tgaatagact tgagtagcaa gaggtattaa 15001 tgatttaacc atcacaacaa cctcatgaag taggtactat tattatcatc aacttttaat 15061 tttagagatt aggaagctga gccatggaaa aaggaagtgg cccaaggcca tataaataag 15121 ctgcagaggg gatttgaacc cggtcagcca ggctccaggc acttagccac tgcactgtac 15181 tgcttcagta gagcatagag gtctgtgctc aggtctaggc tagagctgga accttctaga 15241 gtcagtaaat attctgtaca gcaccacaag actgacagtc atggctcaat cattgtcttc 15301 gcctgtaatc aagaaaggcc tggttcctca agggtggtgc ttggagatag agtggttcag 15361 gttgtgtcag aagtggcctc ggattctgca gtcccccgag ctagagtgtg ctatgtgttc 15421 agagtaggct tcaaaggcca taccccctca ttgcccctac tcaccacccc tagacctctc 15481 tgctcaccct cagccacact aggtgccacc cagtcgcaga gcaaaaggag tttgcaggga 15541 gggctaggaa atggtgagta tcagaaaagg tgtcctctga ggggtgggta aactgcagtt 15601 tagccctccc caataactgc atttctctat ttttttctcc ttctttctct ccctcatgcc 15661 cctcccaacc cctcatctcc ctgtcttcct cagtggtact gactgtcctc acccacaccc 15721 ctgcccctcg agtgagactg ggacaagatg ctctgctgga cttgagcttt gcctacatgc 15781 cccccacctc cgaggccgcc tcatctctgg ctccgggtcc ccctcccttt gggctagagt 15841 ggcgacgcca gcacctgggt aagggacatc tgctcctggc tgcaactcct gggctgaatg 15901 gccagatgcc agcagcccaa gaaggggccg tggcatttgc tgcttgggat gatgatgagc 15961 catggggccc atggaccgga aatgggacct tctggctgcc tacagttcaa ccctttcagg 16021 agggcaccta tctggccacc atacacctgc catacctgca aggacaggtc accctggagc 16081 ttgctgtgta cagtgagttg ggggacagag gtctccaggg gtagagggtg ggcactggat 16141 tgtggggacc gtaatagggg gagggatgat ggataagagg gtgcctgggc aagtaagtag 16201 agatagaaag aggttcctgg gagttagagg ggtaatggga ggctagaagt ttcctggaaa 16261 tttgaggggc tttgacatgg gtatttctgt gacgcaccaa tggagagaca gtgggttccc 16321 tatttcagga gaagaaacct aaccttcttt agttctgagg aagccagcag gcaaactgag 16381 ggtctcttag ggaggacagt atggactgat ttccctatgc tcatttcgtc ctctttcccc 16441 agaacccccc aaagtgtccc tgatgccagc aacccttgca cgggccgccc caggggaggc 16501 acccccggaa ttgctctgcc ttgtgtccca cttctaccct tctgggggcc tggaggtgga 16561 gtgggaactc cggggtggcc cagggggccg ctctcagaag gccgaggggc agaggtggct 16621 ctcggccctg cgccaccatt ccgatggctc tgtcagcctc tctgggcact tgcagccgcc 16681 cccagtcacc actgagcagc atggggcacg ctatgcctgt cgaattcacc atcccagcct 16741 gcctgcctcg gggcgcagcg ctgaggtcac cctggaggta gcaggtaaga gctgggagct 16801 ctgcggaatc tgagccagca cccaggaaga caggagctca ctctcacccc ttctgcctgc 16861 caggtctttc agggccctcc cttgaggaca gcgtaggcct tttcctgtct gcctttcttc 16921 tgcttgggct cttcaaggca ctgggctggg ctggtaagtg tcagccctac cctgaccatg 16981 acctgaggtt ggtggacttt ccccacctac tcccaagagc ctgacaccca tccctctgcc 17041 ctccacctct accactccag catcccatcc cttctcaatc tttccccaca gctgtctacc 17101 tgtccacctg caaggattca aagaaggtac agtgctccac ctctctgtat ctttcccttg 17161 tcactttatc tcctcatcct atctcaaaac ccatggaggg aggctgctgg tgtggtaggc 17221 agaacctagg cttggaattc acactgatct gggttaaaac ctggcactat atcctaactg 17281 taggactctt tgagcatgct acttaatcta tatgtttcct tgggtgtagg gattttaaaa 17341 agttacttag gcagtgctgt ccaatagaaa gataatgcaa gccacatatg taaatttaaa 17401 tactctagta ctcacattaa aaaaataagc agaaacaggt aaaattaact taataatata 17461 ccttatttaa tctaataatt ctggtccctt atctgaaatg cttgggtcag aagtgttttt 17521 ggatgttgga ttttttcaga ctttggaata tttgcatata tttaatatct ttgggatcag 17581 acccaaatct aaacacagaa atcatttatg tttcctatac accttataca catggcctga 17641 agattttttt ccttctcttt ttttttgaga cagagtctct gtcgcccagg ctggggtgca 17701 atggtgcgat ctcggctcac tgcaacctcc acctcccagg ttcaagcgat tctcctgcct 17761 tagcctcccg agtagttggg attacaggca cccaccacca cgcctggcta atttttgtat 17821 ttttagtaga gacggggttt caccatgttg gtcaggctgg tctcgaactc ccgacctcag 17881 gtgatccgcc cgctgccttg gcctcccaaa gcgttgggat tacaggtgtg aaccgctgca 17941 cccgaccaca atatttttaa taatatatgt gacccatcac atgagggcag ctgtgggatt 18001 ttccacttgt ggcatcatgt tggcactaaa aaaatttcag gttttggaac atttcagatt 18061 ttagagtttt ggatcggcga tgctcaacct gtacatccaa aatattattg caacatgtaa 18121 acaatataag aaattacagg ccgggtgcga tggctcatgc ctgtaatccc tgcactttgg 18181 gaggccgagg cgggaggatc acctgaggtc aggagttcaa gaccagctta gccaacatga 18241 tgaaacccca tctctactaa aaatacaaaa aattagccag atgtgacggc acctgtaatc 18301 ccagctactc gggagactgg ggcaggagaa tcgcttgaac ctgggaggca gaggttgcat 18361 gagctgagat cacaccattg cactccagcc tgggtgacaa aaggaaaact tcgtctcaac 18421 aacaaaaaaa aaagaaaaag aaattacagt cttggccagg tgcgttggtt cacgcctgta 18481 atcctagcac tttgggaggc tgaggagggc ggatcacctg aggtcaggag ttcaagacca 18541 gcctggccaa catggtgaaa ccccgtctct actaaaaata cagaaaaatt agccgggcat 18601 ggtggtgcac acctgtaatc tcagctactc aggaggctga ggcaggagaa ttgcttgaac 18661 ctgggaggtg gaggttgcag tgagccgaga tcgcaccact gcactccagc ctgggcaaca 18721 gagtgaggct ctgtctcaaa aaacaaaaca aaacaaaaca aaacaaatta gctgggcatg 18781 gtggcacaca gctatagtcc cagctacttg gggagctgtg gcagaaggat cgcttgaatc 18841 tgggagggtg aggctgcaat gagctaaggt tgtcccactg tactccagcc tgggtgacaa 18901 agtgagaccc tgtctcaaaa aaaaaaaaaa aaaaagaaag aaaagaaaat aaattgcagc 18961 ccttttctca ccttaagtat tcaaacacag gtgtttattt taaatagtca gcacatctca 19021 attcggacta gccacatatc aaatgctcaa tagcctaatg tgggtggtgg ctgtcatatt 19081 ggacagcgga gcctgagagg acttttagga ggattaagga agacagtgta tctccttaag 19141 aagatgcagg agatacagca ctcagcagtg tctggagggc agaaaatatt agcctcatcc 19201 ttctcccatc atctgtgccc atgatagttt atatgtctct aaactgacca tgtacacctc 19261 aactgccttt taatatcctg aattcctcac caccttcctc tcttccagaa agcagagtga 19321 gggcactcac tgccatcctg tggaagccac catcatctct ggcccaagct tctgtagtag 19381 ctccctaaaa taatacccta tcatctgctc ctaatccctc caatctctct ccactgagtg 19441 gctggaatgc tttttttttt ttctttcact tatataaggg ataatttttc tttttttttt 19501 ttttttgaga cggagtctca ctcttccgcc caggctgcag tgcagtggca tgatcttggc 19561 ttactgcaac ctccgcctcc tgggttcaag caattctgtg gcttcagcct ccggagtagc 19621 tgggattaca ggcacatgcc accacaccca gtgaattttt gtatttttag tagagacggg 19681 gtttcaccat gttggccagg ctggtcttga attcctgacc tcaggtgatc tgcccacctc 19741 agcctcccaa agtgctggga ttacaggcgt gagccaccac accaggcccg agaaatgctt 19801 ttttaaaaaa cacacatctt atggcattca ccttcttgga gctctaggac agtggttctc 19861 aaaatttttt tctctcagga cctcttaaaa atcatcaagg accccaaaaa gcttttgggt 19921 atgtgggtta tagctatcaa tatttatggt actagaactt aaaagtgaga aaaatttaaa 19981 acacgagaat acataggcac acattctatt catcgtggga accatggtgt caatacatat 20041 catgtagctt ctgaaaaact ccactgtaca cttatagaat gaagaaggca aaaaactttt 20101 tttttttttt ttttgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggcgcg 20161 atctcggctc actgcaagct ccgcctctcg ggttcacgcc attctcctgc ctcagcctcc 20221 caagtagctc ggactacagg cgtcctccac catgcctggc taatattttg tattttttag 20281 tagagacggg gtttcaccgt gttagccagg atggtctcga tctcctaacc tggtgatccg 20341 cccgcctcgg cctcccaaag tattgggatt acccgcgtga gccaccgcgc ccggctgcaa 20401 ataatctttc tttttttctg agacagagtc tcgctctgtt gcccaggctg gagtgcagtg 20461 gcacgatctc ggctcacggc acgctccgcc tcccgggttc acgccattct cctgcctcag 20521 cttcccgagt agctgggact acaggggccc gccaccacgc ccggctaact ttttgtgttt 20581 ttagtagaga cggggtttca ccgtgttagc caggatggtc tcgatctcct gaccttgtga 20641 tctgcccgcc tcggcctccc aaagtgctgg gattacaggc gtgagccacc gcgcccggcg 20701 gcgaaacacg atattgtact aacatcttaa ttttgttata aaatctcaca aaccccctga 20761 catagtctca gagatctgta gggccgaggt tacatttgga gaacccgtac tctagggcca 20821 aatccattct tcttgccctg gctcacttgt cccccccacc gccccgcgct ggagccactg 20881 cctagttctt cagccctaga tggtgctcgc cagacctcct ctcaatgctc atcacacaca 20941 gggctattcc tttcctccaa tgaaccaaac gcctcccgcc cacctccagg tcccagtcct 21001 ctgttccctt tgcctggtcc acccttgccc tccctgggtc gcagacgagg tcggcctcgt 21061 cattccccgc agaccgccgc gcgtccctct tgtgcggttc accacagttg tatttaagtg 21121 atcgtgtgag tcgtcgttaa atgcctgtct ccccgcggat catgggctcc tcgaggacag 21181 ggactggcct gtctgtccac tgctgtaacc ccgcgccggc atagggacct aaggcccact 21241 ggagggcgct catcaagtag ctgctggatg ttgacgaagg aagcggcggc gcagctcagg 21301 gatctccgag tcaggacggt cggccagacc cacggggtaa cgggtctaat cgtgtaggaa 21361 taaagctgta ttccagtgct tccaaacggt tctctcattc caaccccttt ccaagctcaa 21421 tgaatattcc aatcacctct ggtcccttcc agtgaccctg aaccgctcag agaacgcccc 21481 gtcgacaccg ttcggggcgg gcgccactac tggcgcatga gatgcgctca agcagagggt 21541 actccaccag cgaggataca ccgagccgac ggggatctcg gtctcggcgc cggaagcctc 21601 tgagagccga atctggaacc ggatgtggct gcttcctgcc ccgccccctg ccgagggggc 21661 gggaatcgag ggcccttcgg aaaacccggc cgggatttcg gtcagctaca ttcgtgggtt 21721 gtggtaggga gggtaatccc gtggggatgg agtgaccggc tctctctccg cctgaaacgc 21781 ttcccgcggg agttttagcc gtgttattgt tggggaaggg aatgtcaaat tctgctcctg 21841 ccttccccca ggggctggag actggaatct ttattccagt tcattgcggg tcgacgtggt 21901 ctccagaatc cccgagtgta gaggcggagg acagtggagc tggggactcg ggcgcctggg 21961 tcctggtccc tgagcgtcca gcgctcccct agtccgcggc cctggctcct gtcctcactt 22021 ccttcccctt ttggctcctg ctctcggagc gggcggacgc gggggggagc ggaaagggcg 22081 ggagggccca ggagcccggc gggggggttc aagggggtac ggaaaagccg gggaggggac 22141 tcggtccggg gccggagacc gacggcaaca gcggctcagg acccacgctg cccccacccc 22201 tcccgagcag gtcagagcct gagcaccctg cgcgtccccg tactcgtgac ctcctccttc 22261 cccaacagca ccaagggccc ccctattcca cccttctgtt gccccctccc ccaaccttcc 22321 tgcagaactc ccttccggtc ttcttgacac ccacccacgt cccttccctg cccggttccc 22381 ccgcgcctgc ctgactctcc acccccaccc caccccgtct ctccccaggc gcccccatgg 22441 cccgaccccg ctgattcctt cactcggcca tgctcccgcg gcccctgcgg ctgcttttgg 22501 acacgagccc ccccggggga gtcgtactga gcagcttccg aagccgggac cccgaagagg 22561 gtgggggccc aggtggcctg gtcgtgggcg gggggcagga ggaagaggag gaggaagaag 22621 aagaggtgag acgcgggtgg tgggaggtag ggggcgtatg tatttctgaa gtctccactt 22681 ttctcttgtc tctgggtccc tacatctcct cctccttgcc tcagttattc tcgcgcctcc 22741 ctgggtgtcg gcccgtttct ctgctgcggg gagtcctggc gtctccgtgg ttctctgtca 22801 gtgtgactca ggacctgccc gcaggaacct gccctgttcc tgcctggaac ttctggctgg 22861 ctgtcctgtc cctgcgggtc tttggccctc cacctctgcc tgcctgagtg tgcctacctc 22921 tctgagcagt cgtgtcttgg tccctgtctc cctccttgtt tttgtttggc ggggaggggt 22981 ttgtttccac agtaaccagc tcctctgact ctgggattgg ggagggaacc ctcagatttc 23041 gtgacccctc cttccctctc tcccccttcc ccccgccccc ccgccccgcc ccggtcctgg 23101 gtagaggccg ggaagccacg gttctgggac ctggaggaaa gatcctaggg tgggtagaga 23161 gtggtcatct gttttcatta gggattaatg gtggtggcca ggagcaggat ttcgggcctc 23221 aggaggggcc agaatttgga tttggggccc cgaggcccta aactcctagg tccagatcct 23281 aggtgagggc aggttgactg gcagggctgg tttttcggga tcggtcgggg gaggggctgg 23341 ggccacgtga ctctctgagt ctaccgcccc caacccccta tcatcccact tccccttttc 23401 cttcctgtac tgggattgga gctgctacct tcgggggtcc cttagggctg agatagtgga 23461 ggcgctaccc gagggaccag ggtgaaaggg gtcctgaaaa agactaattc tctgggctgg 23521 tatagtcctt ggccgggagg ggcggggcat gcggcagtgc ctggactgag gtttcagcta 23581 cccctagtcg ctgcctctgg cttgctgcct ctcgcttccc tctcctgccc tttgtaccct 23641 gcttctctgt cgtgtgtggg ttcagctctg gggagaaggg gctttggggg tctccaggat 23701 cagaattcct gtactccatc cacgcctcca ctgcagcccc ttcgactcca ccggggccct 23761 gaggcggctg gggtggggct gtctgggccc cagtggggga gacctggggt caccagctcc 23821 cccaaccctt cctcgcactc gctggtacta tgccctgcca ccacaggtga ggggtacaga 23881 gggaggagca gggcctgggg tgaaggctga gggcaatggg gggtggtgtg gaactgtctg 23941 acttcccctt ccttgctcct ccaggcccct gtgtccgtct gggatgagga ggaggatggt 24001 gccgtgttta ccgtcacaag ccgccaatat cgacctcttg atcccttggt gagatcatga 24061 ctttagcttc tcacctctga ctcttcattc ccagttttcc ttcctctggc tgactttctg 24121 gccagagtgc taggttggag ttctggagtc ctggactctg ctgtgtgaca taggccaaga 24181 ctcttgccct ctctgaatat cagaaaaatg agtctggaac atctcctggg tctgggattg 24241 tcctcagaaa tctaggtgca gagtgggaga aagggttagc gatcatctct ctgtgttctc 24301 caggtcccta tgcctccccc acgttcctcc cgacggctcc gagctggcac tctggaggcc 24361 ctggtcagac acctactgga tacccggaca tcagggactg atgtgagctt catgtcagcc 24421 ttcctggcta cccaccgggc cttcacctcc acgcctgcct tgctagggct tatggctgac 24481 aggtcagagt cataagggac gcagggtagt ggagtatctg cccggatttc ctaaagccgc 24541 aacatcccac caaaatagtc aaagccactg agggtttggg aaaaatggac agatagatcc 24601 tcttctcccc ctcaggctgg aagcccttga atctcatcct accgacgaac tagagaggac 24661 aacagagtga gtgacccctg gttcttaatc tcacagtccc ctctgcccag gctgcagctt 24721 tacctctcac ccttggctga ttcccttctc ccagggtagc catctctgta ctgtcaacct 24781 ggctggcctc tcaccctgag gattttggct ctgaggccaa gggtcagctt gaccggcttg 24841 agagcttctt acttcagaca gggtatgcag cagggaaggg tgttgggggg ggcagcgctg 24901 acctcatccg caatctccgg tcccgggtgg acccccaggc ccccgacctt cctaagcccc 24961 tggccctccc cggcgatccc cctgctgacc ccacggatgt cctggtgttc ctcgctgacc 25021 acttggccga acagctgacc ctgctagatg cggtgagacc ctgacctctg gcccctatgc 25081 cccctgaccc cttactaacc tctccttgac tccagatcct ttccttgctt ccaccccttc 25141 ctgatccagc tcctagcctc ctccacctac cattttgatt ctcccctttt ccctttggtc 25201 gttacccttc aaaccctgtg cccctctggt tccttggcca cctgagatcc tcagatccct 25261 gatattggaa ggctgacctc acttgactat gaactttaac ctctgacctc taccctgcag 25321 gaactttttc tcaatttgat cccctctcag tgcctgggag gcctgtgggg tcacagagac 25381 cggccaggac attctcacct ctgcccatct gtccgagcta ctgtcacaca gtttaacaag 25441 gtggcagggg cagtggttag ttctgtcctg ggggctactt ccactggaga gggacctggg 25501 gaggtgacca tacggccact ccgtccccca cagagggccc ggctcctgga gaagtggatc 25561 cgcgtggcag aggtgagaga gaagattgcc ctacggtttg tggccatgga atggagaggc 25621 ctcccatagc cttagctctc ttctttgacc cccacaggag tgccggctgc tccgaaactt 25681 ctcttcagtt tatgccgtgg tgtcagccct gcagtccagc cccatccaca ggcttcgggc 25741 agcctggggg gaagcaacca ggtgcggagg ctgaggcatt ggactggggt gggggttcct 25801 cagaaggcgg ggaagagggg gtgacagagc caggcctgtc ctcagggcca cttttctccc 25861 ttccccaggg acagcctcag agtcttttcc agcctctgcc agattttctc cgaggaggat 25921 aattattccc agagtcggga gctgctcgtg caggtgagag cctggtttgt ggcattccca 25981 cctctccttg ttccctactg ctcccccatg tctcttttat tttgtcacca ccaggaggtg 26041 aagctgcagt ctcctctgga gccacactcc aagaaggccc cgaggtctgg ctcccggggt 26101 ggggtgagtg actagcgggg tggtgggcgt gggggtggtg attatgaagg ggtgaggagt 26161 atgttttggg ggagcagcct gaaggatggg taagggaggc acagaggggt gaggagaatg 26221 aaatctgttc tctgaggcaa tgaggagggc aagtgcggag ggatggaggg ccctcacgtc 26281 aatggcaggc acttctgacc ctgacccctg acttcagggt gtggtcccat accttggcac 26341 cttcctgaag gaccttgtga tgctggatgc agcctccaag gatgagttgg aggtcagtgg 26401 tgtgtgtgtg tgtcattgca atgatgtaac aagtgggctg cagggtcgga cagacccccc 26461 cgccccttgc aaggggctgt aaggcacaaa tagctgagtg tgggacagag ccagatctgt 26521 ttctgtgcca agtcgcctca gtgatctgta aatgggaaac tagaaatagc atctacattg 26581 atctgttata agggttaaat gaatttgtat ttgtcaagtg cttagaacgt gcctgaaacg 26641 taagcacatc taggttttgt gatgggctcc tgctctattg ctagagtttg ttaaataaac 26701 aaatactgag accttggaca agttccttat cctctctgga taaggaatcc tgtcgaattg 26761 gatactatcc agtcctctca tctgcaaagc agactcgttg tctacctgag aaattcaatt 26821 taaagggtca agtggagtgc ttaggccatg atagattgag gaggaaggca gggagggatc 26881 gaagagtcag agaggctgga gggaagtggt tgtgagttgg atgtggctgt gtgtgtttaa 26941 gagaggggat attctctctc tgtggccgga aggtagggtt ggaactttca ccccatcccc 27001 attttcctcg cagaatggat acatcaattt tgacaagcgg aggaaggtga gcggagtgtc 27061 tgggctggat gctggacttc cctatccatg ttccaggaag gggaggggga aaagtcaggg 27121 gtccctgagt tttggctcct gcagtttgag ggctccttcc caggagtttg cagtcctttc 27181 tgagttgcga cggctccaga atgaatgtcg tggctataac ctccaacctg accatgatat 27241 ccagaggtgg ctacaggggc tccggccact gacagaggct cagaggtgac tggcgggtga 27301 ggttgggact ccagggttct ggccaggtgt gggggattgg tgtcatgtcc tgagctctgc 27361 ctattgcccc cctaacccag ccatcgtgta tcctgtgagg tggagccacc tggttccagt 27421 gaccctcctg ccccacgggt gcttcggcca acattggtca tctcgcagtg gacagagtga 27481 gagattcact ggctgggggg tggtttcttg ggacttccac ttttctctga tcctcccaat 27541 ctcctgccac tgcagggttt tgggctctgt tggggtccct accccgcttg tgtcctgtga 27601 ccggcccagt actgggggag atgaggcgcc tacaactcct gctcctctgc tgactcggct 27661 ggcccaggtg agctctgctc ctgactctga ccttgacgct gaccctcact ctgtaaatgt 27721 ttcttcctta aacctccctg ccttcatgcc agtcccatgt tgtttgttcc tagcacatga 27781 agtggccatc tgtctcgtca ctagactctg ccttggaaag cagtccatcc ctgcacagtc 27841 cagctgaccc cagccacctc tccccaccag cctcctcccc taggccttct cgaggtcacc 27901 gccgctcagc ctcctgtggc tccccgctga gtgggggtgc agaagaggcc tccgggggga 27961 ctggatatgg gggagaggga tctgggccag gggcctctga ttgccgtatc atccgagtcc 28021 agatggagtt gggggaagat ggcagtgtct ataagagcat tttggtgagg gagccttggg 28081 atggagttgg ggtgaaggat ggtgtcttgt tggactataa atgttgtctg aggtttgggc 28141 agccaaatgg aattgaatgt agttttaatt ttgacctttg gttttggcag ttatgtttta 28201 tttttaattt tatcttctaa actctagtct tcaatagaag ctgaaatttg ttttattttg 28261 atggctattt tgcattagtt gctacacctt aaacttttgg tcgccgtgtt aaaatttcag 28321 aatcaacagc agtttcccca cagggaagca gctctgtttc actgggcacc atgtgggctg 28381 gcttatgggg tataggaagc caccatattg ggttttcctg ttcattttaa agccttggac 28441 acaagtaaca ttttgtggtt gggttcagtg gctgagggta gaaggatggg aaatcatggc 28501 tgattgattg ctcttttggt ctcctgttct gccaggtgac aagccaggac aaggctccaa 28561 gtgtcatcag tcgtgtcctt aagaaaaaca atcgtgactc tgcagtggct tcagagtatg 28621 agctggtaca gctgctacca ggggagcgag gtcagaggcc atgagggaaa ggcagactcg 28681 ggaggagagt ggagtacttc cacatctggg cggctgtggg gggaacaact gtgtgtgtgc 28741 tttacatcca tcccctgaac cttcagagct gactatccca gcctcggcta atgtattcta 28801 cgccatggat ggagcttcac acgatttcct cctgcggcag cggcgaaggt cctctactgc 28861 tacacctggc gtcaccagtg gcccgtctgc ctcaggaact cctccgagtg agggaggagg 28921 gggctccttt cccaggatca aggccacagg gaggaagatt gcacgggcac tgttctgagg 28981 aggaagcccc gttggcttac agaagtcatg gtgttcatac cagatgtggg tagccatcct 29041 gaatggtggc aattatatca cattgagaca gaaattcaga aagggagcca gccaccctgg 29101 ggcagtgaag tgccactggt ttaccagaca gctgagaaat ccagccctgt gggaactggt 29161 gtcttataac caagttggat acctgtgtat agcttcccac cttccatgag tgcagcacac 29221 aggtagtgct ggaaaaacgc atcagtttct gattcttggc catatcctaa catgcaaggg 29281 ccaagcaaag gcttcaaggc tctgagcccc agggcagagg ggaatggcaa aatgtaggtc 29341 ctcgcaggag ctcttcttcc cactctgggg gtttctatca ctgtgacaac actaagataa 29401 taaaccaaaa cactacctga attctactcc cctgtccttg cagtacatat gaactggctg 29461 ctatgagtgg gggtggggaa ttggctgaag gtagatgcca tggaacagga aggggcacaa 29521 tgttttctgt ccccatgaac agagcaaaaa gtgaggtatt ggtgaaaaaa gtttcctcag 29581 aacagtttct ctccagtaac ctctctacca ctggtccctt gctgcaaatg tgtataaaac 29641 aattttagga caaggaaatg gcagaacaac tttatggaca acatataaac ttgggaggac 29701 ttataatacc ataaagcatt acatgctggg attttaagtc agtttaaagc tacattcaag 29761 caaattctag gaagacagag gctgaaaagt actaatagag atgccagggc tgttggcaag 29821 gagggtaacc acatttcatc ctggggcctc aggttagaga actgtgtatt ctttttcaat 29881 taaaccgaca ttcctagcaa tatggtatgt tgcaaatgcc cttccaaaag aatcatgcca 29941 accaagatga gggagagtta agggttgtat atacaggcaa gaagtaggtt tgaagacatt 30001 taggatttca ctttcaagaa agaacactgt gccacaaaga gattccaagt gcaagagttt 30061 gaacctgagg gggtaataca ggtggtgtgt gttaaggtgg aaggggctgt ggggatatga 30121 cagaagtgcc tggtcaggaa acagagccag gtgtttttac attttattag ctacagtata 30181 gatcctagag ctgcctcatt ccctcccctc ccctcccctc cccccaccat ggggtcaggc 30241 cttgccagga gcccctgcct ttgctgcctg ggcccgctgg aactcctgct gcagctgagc 30301 aagggtctcc ctctgttgct ctgactgccg ctcaagatcc cgaagctggg attcgtatcg 30361 cttactaagg agagacaagg gagacaagca gaaagggaga aattagtagg actcactatg 30421 gagagtgcta aacgcaagaa agagataggt ggaaaacttg agatccaggg gagggagaag 30481 aaggggggtt aaaagggaag gaaagggttg gaggaactgg aggactaggg gccactatgg 30541 agagctgttc tgcaatggtt caacacctct actccaatgg ttcacttgca tcacagggtg 30601 cagcacacgg tggtggaata aaaactcaca tttcagctgt gatatagtcc agcctcttcc 30661 ctactgtggc ccgagcctcc cccagctcct gtttgactag caccggaccc agaagtttaa 30721 agaccacgtt ggacccatcc agcagggcca gttcctgatg gagggatggg acatatgtga 30781 tcagaaccat ggctagtaca ggtccctcct cgccccacaa atcccagtcc ctcacctctt 30841 tcacgatatt attttctgtt agttgtgctt caagtttctg cctccccgac atggatttac 30901 ttaagtctgg ggatgagagg aaaggcaatt tagaaacatc aaatgtgcca cctatcccca 30961 caatcctagt caggtggata tggatctgag gaagcagaaa aggatggggg cagcgggtga 31021 gtgaacaggg aagagtactg agactgaggc ggggaagtct ctgcacgcga gggcccagat 31081 ggggcaacaa accttatctc ggtaatggct ttgggttgta agtgcattgg gcgagaccat 31141 accgaccctg ttcccttacc cttctgtagc tgttgatatt tctccacttc tccctgtagc 31201 ttcttctgga tcagctccgc catggcgggg atgaaagcct actgggtgcg agacaagggc 31261 gctgggtctc actctctgga aggtacctga gctccctttt ctgatctggc ctgcggaaat 31321 gatagcactc ttgagaggta gccctgaggg gcaccgcaac ctcccccccg tcccggatat 31381 cgactccacc ctgtccccag gagatggtgg gcgaaacgca agaccccaag cctctcaacc 31441 gaaatcctaa cctactcacg ttcttccttc agtagtaata aacccggaag taaacaagga 31501 atgctgggaa aagactgtgc cggaagtttc tttctgaccc ttgatgggaa gtgtagttct 31561 gacatgttta gggagtggag ttctgctaaa aagactagac taggacattt ttaggcaaga 31621 tggggaaact aatccagaaa aagaaaccgc accgctactt agcccctggc tccggacctg 31681 cctttctttt ttaagtcact cttgcataag ctgccgctcg cgataaggtg ccaaaaactg 31741 ttccgcccct ctaaggagag cgtgccctca ctcaagatgg cacctagaga gcttcatacc 31801 tggtacgctg ctgattggat gaaggacaga gggcttccgg gagttttcaa gccgactgtg 31861 tggcagctga gaggagtttt gcacgtggat cgccgttcgg gtgggcgaga tggagacagc 31921 ccccaagccg ggcaaggatg tcccgcccaa gaaagacaaa cttcagacca agagaaaggt 31981 agaggcctcc ctgggtggga aacgaagttt ttagctgtgg ggtcgggggg cggggcgtga 32041 gtgcggagtt cctgatgtgc ctgtagaaac cgcggcgata ctgggaggaa gagaccgttc 32101 cgaccacagc cggagcctct ccagggcctc ctcgtaacaa gaagaatcgg gagctccgtc 32161 ctcagagacc aaaaaatgct tacatcttaa agaagtctcg gatctctaag aagcctcagg 32221 tcccgaagaa accccgagaa tggaagaacc cggagtccca gcgcggcttg tccggggtga 32281 gcgtgggacc tgatgggtgg cgaggcaggc cgcctcgttc cttgagagtg agagggttgg 32341 ggtcaatcca aggctgtctg ctaattgcgt cctcctttcc cctaggccca agatccattc 32401 ccaggccccg cccccgtccc tgtggaagtg gtccagaagt tctgtcgcat tgacaaatcc 32461 cgaaaggtga ggtccagccg gagagttggg aagtgctgga ggcagggagt gtctgggtga 32521 gttggatgga ggctgggaca ggtagctctc ctgtgacccc acttatcctg tggttcttct 32581 gccactcctc cccaccaaca atgttacagc taccacattc taaagccaaa actcgaagcc 32641 gacttgaggt ggctgaagct gaggaagagg aaacaagtat caaagctgct cgttctgagc 32701 tgctgcttgc tgaagaacct gggtgagtga gccctaatct ggacccccat tccctgcctt 32761 ttgggactgt cttttctcgt ttttatgttg gtatacttgc ttcatattgg gaagctttac 32821 tgccatccta acccttgctt tcaggtttct ggaaggggag gatggggaag acacagcaaa 32881 gatatgccag gctgacattg tggaggctgt ggacattgca agtgcagcca aggtgagcct 32941 gaggaggtaa aggagccaag ggattgattg gtggtgcagg acaatagagg aatgggggct 33001 agaagaaggc gttactgcag ggcactcttt tttttcactc ttctctttcc cagcactttg 33061 acttgaatct gcggcagttt ggaccctaca gactaaacta ctctcgaact ggaaggtaag 33121 gttgaattct agtgactctt gaactaagat gtgtttcctt aaccacttca gccattccca 33181 gtgtatgttt gggttgctga tgaggggagg gtccttcgat ttgcttgggt gtgagggtaa 33241 gcacctacag caacatgtgt ctgcccgcct ggagagatgg ggctggcgtg gggcagacct 33301 caagttgtct gagtcggtgg tcccctgcct taacaccctg cctgcccctc acctccaaca 33361 gacacctggc ttttggaggg cgccgaggtc atgtggctgc ccttgattgg gtaacaaaga 33421 agcttatgtg cgagatcaac gtcatggagg cggtgcggga catccggtca gtggcctcac 33481 tgtcagcggt cagttggggt gagatagtcc attcctgatt gaatgatagc ctgtgacctc 33541 atttcccaat tgaaccactc ttcctctccc ccaggtttct ccattctgag gcactgcttg 33601 ctgttgctca gaaccgctgg ctccacatct atgacaatca gggcattgag ctccactgta 33661 tccgccgctg tgaccgagta acacggcttg agttcctgcc cttccacttc ctcctggcta 33721 cagctgtgag tggccatgga gctcaggaac tggttggaag cccttgggat gaccacctct 33781 cctttaggac cccagcagag ggaatacaga gggcaatcag gactgggtca ttctctctgt 33841 ctttctctct cagtcagaaa cagggtttct aacctacctg gatgtgtcag tggggaagat 33901 tgtggcagct ctgaatgctc gagctgggcg gctcgatgtt atgagtcaga acccttacaa 33961 tgccgtcatc catctcggac acagcaatgg tcagtacctg gcttagtttt gactctgacc 34021 atcctgactt gcttttcttc tatatttgta cttcatgagt cccttaaagt taccctttta 34081 tttccctttt ttgttatctc ttggtcttga gttcccatct ttcccatgtt tagtaacctc 34141 aggcttaggt gtgtattagc aactttggtt cttcttctct tccaggtact gtgtctttat 34201 ggagtccagc tatgaaggag ccactggcaa agattctctg tcatcgtggt ggggtccggg 34261 ctgtggcagt agattctaca ggcacgtaag tcactggtgg ggtgaggtgt taggagtcat 34321 aggtgggcag aaaggtgtgg aaggcagtgt gctttaggag cacagagtct aaagccagga 34381 tgcccaggtt taaatcgcag tgttaccacg gatgggcctt gcaagtatag gcatatttca 34441 taacctctgt gtgccacagt ttcctgaccc cgaaaatgga aatatgagtg tccatttcaa 34501 gggtccacaa actttttctg taaagagtca gatagtaaat attttatgat ttgctgataa 34561 gaggtaaaat caaagggtac catgtaggca tttaaatacc aagagaaaac aaattatcac 34621 aacttttttt tttgagatgg agtctcgctc tgacacccag gctggagtgc aatggcgcaa 34681 tctcagctca ctgcaacctc cacctctcag gttcaagtga ttctcctgcc ttggcctctc 34741 gagtatctgg gactacaggc gcctgctaca acacctggct aatttttgta tttttagtag 34801 agacagggtt tcagcatgtt gttcaggctg gtctcgaatt cctgacctca agtgatctgc 34861 ccgcctcagc ctcccaaagt gctgggatta taggcgtgag ccactgcgcc tgaccttttt 34921 ttttttaaat cttttgagag agacgtagtc ttgctctgtc tcccaggttg gagtgcagtg 34981 gcgtgatctc ggctcactgc aacctccgcc tcccagattc aagcgattct cctgcctcag 35041 tctcccaagt agctgggatt acaggcacct gccatcatgc ccagctaatt ttgtattttt 35101 ttgtagagac ggggttttac tgtgttggcc aggctggtct tgaactcctg acctcagatg 35161 atctgcccgc ctcggcctcc caaagtgttg ggattacagg cgtgagccac tgcgcctggc 35221 ccacacattt ttaggttata aaattaaacg taatatggcc aggtgcggtg gctcacgcct 35281 gtaatcccag cactttggag gccaaggcgg gtggatcgcc tgaggtcagg agtttgagac 35341 cagcctggcc aacatggcga aactctgtct ctactaaaaa ttcacaaaat tagccgggtg 35401 tcgtggcggg ggcctgtaat tccggcaact tgggaggctg aggcaggaga attgcttgaa 35461 cctgggaggc aggggttgca gtgagccaag actgtgccat tgcactccaa cctgggcaac 35521 aagagcaaaa ctccgtctca aaaacaaaca aacaaacaaa acataatatg agactggaca 35581 cagtggctca tgcctgtaat cttaacagtt tggtaggctg aggtgggcag atcacttgag 35641 cccagaagtt cgaaacaagc catgtcaccc atgacatggc aaaactctgt ctctacagaa 35701 gatagaaaaa ttagccgggt gtggtggtgc atgcctgtag tcccagctac tcaggaggct 35761 gaggtgatcc tcccacctca gcccaggagg ttaaggctgc agtgagctgt gatcatgcca 35821 ctgcactcta gcatgggcaa cagagtgaga cccggtctca gaaaaaaaaa ataataataa 35881 tcaaatatat ttgtgtaata cagatctact aatgggaaga actgaatttc tctttttgag 35941 gttaacattt tgcctaattg atgtacaaag ttagtgttcc agatggtcaa atttgactgt 36001 agatattcat gttcatgctg atctgtagag attgcaagta tttcatcttt gaaaacatct 36061 tttcacacag gtaatgatag gtgatatgtg aggtgcttga aatgctgtga agcacttacg 36121 actgtgtcac tgtgacttgt agtgtacaaa gcagcagtgc aaatcaggat gctgtagtcg 36181 ctgtcgtgac cactcggctg tgtgttgtag agcaaaagca gccacagtat taagtaaata 36241 gtgtggcccc gttccaataa aactttattt gttggatatt ggaatttgaa tttcatacgg 36301 ttttcacagt ctcaaaatat tcttttgatt ttttttcaac cacttaaaaa tgtaaaaacc 36361 attcttggct tgtgggctat acagaactag atgatgggcc aggtttggcc catgggcagt 36421 agtttaccaa gccttggttt aaagccctca tataagctgt tgtagacatt aagatgagtt 36481 aaggcatata gtttagcaca gcgcttagca aaatagggag cgctgtgttc atcattgtca 36541 ttcaagatga tcgttctctc caggtctggc tgatgtgagg ggtggaggtg gtgtctgctt 36601 tggatttctg ctgttcctga gggagtatct gcattttcca cagcttttct gtctgatttg 36661 tattttcctc tgattccttt tgtcatcagg tattcactgg gcacctgctg tgggcagggc 36721 tctgcgctga ggttctggag acaaaaggat gaatcgttga gcctgccctg gtgtggtgct 36781 cccgttatcc tttaggtata aaaacttgtg gctatttttt tttttttttt gagacagagt 36841 cttgatctgt cacccaggct ggagtgcagt ggcacaatct cagctcacca caacctctgc 36901 cccccggggt tcaagcgatt ctcctgcctc agcctcctga gtagctggga ttacaggcgc 36961 ccacgaacca cgcccagtta attttttaat gtttagtaga gatggggttt caccatcttg 37021 gccaggctgg tcttgaactc ctgacatcgt gatccacctg cttcggcctc ccaaagtgca 37081 ggtgttggga ttacaggcgt gagccactgc acccggccat ggctatggtt tttgagaatg 37141 attggccagg tgatgcattt atttatttta ttactatttt tcgagacgga gtcttgatct 37201 atcacccagg ctggagtgca gtggcgcgat ctcggttcat tacaacctcc gcctcctagg 37261 ctcaagtgat tgttctgcct cagcctccaa gtagctggga gtacaagtgc atgccactgc 37321 atgcgctaat ttttgtattt ttagtagaga tggggttttg ccatgttggc ttggctggtc 37381 tcaaactcct gacctcaggt gatccgccca cctcggcctc ccaaagtgct gggattacag 37441 gcatgagcca ccacgcctag ctcaggtgat gccattagtt tctacacagt tatcctctgt 37501 catcccagac tcaatgtgtc catcactgga taggtcaccg cctcctaaag tttctcttga 37561 cctgctctca tctcccagaa ttttcctgtc acccagaatt tgactcaggt acacaccacc 37621 acacctggat aatttttcta ttttttgtag agatggggtt tcaccatgta gcccaggctg 37681 gtctctgtct tgaactcctg ggctcaagcg gtcctcctac ctcaacctcc caaagtactg 37741 ggattacaga gtaattactg tgagccacca catccagctt ggccaccagc ttattcaaag 37801 tctccaggat gctgggacca ccccccccac cacccgctcc ctgtttttgc ataattgtat 37861 cttttttttt tgagacggag tctcactctg tcacccaggc tggagtgcaa tggtgtggtc 37921 tcggctcact gcaacctcca cctcctgggt tcaagcgatt ctcctgcctc agcctcctga 37981 gaagctggga ctacaggtgt gtgccaccac acccggctaa tttttgtatt tgtagtacag 38041 atggaatttc accatgttgg ccaggctggt cttgaactcc tgaccttgtg atccacccgc 38101 ctgtcctccc aaagtgctgg gattataggc gtgagccacc gtgtctggcc tttgtaattg 38161 tatctttgtg aatgagtgat ttggttctgc cctttttact ccatatttat accagcctgg 38221 ctctaggaga gtcagaaggc ctgcccaagg gtttctgccc tctctgggcc actgggccaa 38281 agctgtagct tgccctccgt gggctacctg ggtcagccac tcctgtgctt agggcttaat 38341 cactagttcg ctgaggcctg aagtttaatc aacaccttga ggctaaacag ctctggtcct 38401 tgtgatcttc agcccatccc tgcttttctc tggtcctcct caggagcttg tcaggccacg 38461 ggggctccag tgatgaggct gaccatcttt cttaagaggt gttctggtag cttgtattat 38521 gattggattg cgttgacttc tcaaagccga actgctgctt actagtagta cagtagactc 38581 ccatttggca ctgggctggt ctttacccag gggtcccata gatgggtgtg tggaggggag 38641 aagagccagc atgcatcctt gagttgttag ttcaccaaag gatgtgaccc tgtgttccac 38701 gagcgctcgg taattcctgg tttgcactga tggcctttct ctttggtaaa gtcagggcgc 38761 tgattatctt gtcggaagta tccagctggt cctctcttgg gctccatcac tgtgcaagtc 38821 tctgctcata agaactcttc agttcccttc tcctgggttc acagccaagg caaaaaagag 38881 actccttggc ctcttggcaa acatacatcc cctacctggc tggtccccct tcagcttccc 38941 tctctaagga acaacttgaa catagaaact ccaacctcac ctgtaatccc agcactctgg 39001 gaggccaagg tgggcaggtc atttgaggcc aggagttcaa gaccagcgtg gtcaacatgg 39061 cgaaaccctg gctctactaa aaattagcgg tggtggcgca tgcctgtaat cccagctact 39121 tgggaggctg aggcacgaga atcacttgaa ccagggaggc agaggctgca gtgagccagg 39181 gtcatgccac tgcactccag cttgggcaac agagagaccc tgtctcaaaa aaaggaactc 39241 caacttctga aagtaaacaa cttatgcagt acatgattcc tggagaggat ttggtcacca 39301 accttccctc cttccttcca gggaaagcac acagaggggc agggcaaggg cctccagtgc 39361 attacttcac cacttccctg agtaggtctt ctcaagtcct tggagccttc actttgtgat 39421 ttgtcactgc cagagagcaa tgttgggatc atgagctgtg ggtcctcagt ctgactttaa 39481 agaaacctca caaattgtct gctcctgatt ttcctgacca tgtttcccac caatcctaac 39541 aaacctctgc ttccctctca gtctcaacac tctggtgtta atgcatttga acctgctgtg 39601 tatctacttc ctgtaatttt ttgagaatta tctagaatgg ctcaaatccc acctttccca 39661 tgaagccttt ctgaccactg cacatgctcc ttccccttgc tgggctgtgt gctgcatgtg 39721 tgtacagggg gacatcacat ttgggtgcag cctgtcgtgt gggcgttcct tttgacgccc 39781 gtgtcttgag agggctgtgt ggtgttagca ggttagaggc agatgctaaa gagacaggcc 39841 tgaccattca ctgacctagc tttcctttcc ctgagcttca gtttcctcac ctcttaaagg 39901 gaaataaaaa tagtatgagt tgtgtgcagc ttaactgaga ggatgcctgt ggagtgttca 39961 gggtgatggc agtggttatt actattaatc tttttctccc cagccagact tgatgcctgg 40021 acttggccac agcttgcttg gtaaacccat ttactgacat attaattgat ccctcttctc 40081 tcaccaggta catggccacc tctggcctag accaccagct gaagatc // LOCUS HSF25B33 2309 bp RNA PRI 19-JUN-1997 DEFINITION H.sapiens mRNA for F25B3.3 kinase like protein from C.elegans. ACCESSION Y12336 NID g2208955 KEYWORDS F25B3.3 gene; kinase-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2309) AUTHORS Kedra,D. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) D. Kedra, Karolinska Hospital, Department Of Molecular Medicine, Building L-6, S-171 76 Stockholm, SWEDEN FEATURES Location/Qualifiers source 1..2309 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" CDS 254..2083 /note="similar to C.elegans F25B3.3 kinase like gene" /codon_start=1 /product="F25B3.3 kinase like protein" /db_xref="PID:e309050" /db_xref="PID:g2208956" /translation="MAGTLDLDKGCTVEELLRGCIEAFDDSGKVRDPQLVRMFLMMHP WYIPSSQLAAKLLHIYQQSRKDNSNSLQVKTCHLVRYWISAFPAEFDLNPELAEQIKE LKALLDQEGNRRHSSLIDIDSVPTYKWKRQVTQRNPVGQKKRKMSLLFDHLEPMELAE HLTYLEYRSFCKILFQDYHSFVTHGCTVDNPVLERFISLFNSVSQWVQLMILSKPTAP QRALVITHFVHVAEKLLQLQNFNTLMAVVGGLSHSSISRLKETHSHVSPETIKLWEGL TELVTATGNYGNYRRRLAACVGFRFPILGVHLKDLVALQLALPDWLDPARTRLNGAKM KQLFSILEELAMVTSLRPPVQANPDLLSLLTVSLDQYQTEDELYQLSLQREPRSKSSP TSPTSCTPPPRPPVLEEWTSAAKPKLDQALVVEHIEKMVESVFRNFDVDGDGHISQEE FQIIRGNFPYLSAFGDLDQNQDGCISREEMVSYFLRSSSVLGGRMGFVHNFQESNSLR PVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVECRRRAQSVSLEGSAPSPSP MHSHHHRAFSFSLPRPGRRGSRPPEIREEEVQTVEDGVFDIHL" variation 2002 /note="c or a" /phenotype="no change of amino acid seq." polyA_signal 2282..2287 BASE COUNT 468 a 728 c 678 g 435 t ORIGIN 1 cgatttcatt cctcgctccc cacaggtccc tctccccaaa atattcccat cttgtcctag 61 cccatccccc agactatctc aaggaccagc tgtccccacg cccccgacct ccactaggcc 121 tgtgccaccc gctgcctgca ggaagacgcc cggtcccggg ccgggttagc cccatgggaa 181 cggggttcgg tccgagcccg gtgggaggct cccggagcgc agcctgggcc cagcccaccc 241 cgcgccggcg gccatggcag gcaccctgga cctggacaag ggctgcacgg tggaggagct 301 gctccgcggg tgcatcgaag ccttcgatga ctccgggaag gtgcgggacc cgcagctggt 361 gcgcatgttc ctcatgatgc acccctggta catcccctcc tctcagctgg cggccaagct 421 gctccacatc taccaacaat cccggaagga caactccaat tccctgcagg tgaaaacgtg 481 ccacctggtc aggtactgga tctccgcctt cccagcggag tttgacttga acccggagtt 541 ggctgagcag atcaaggagc tgaaggctct gctagaccaa gaagggaacc gacggcacag 601 cagcctaatc gacatagaca gcgtccctac ctacaagtgg aagcggcagg tgactcagcg 661 gaaccctgtg ggacagaaaa agcgcaagat gtccctgttg tttgaccacc tggagcccat 721 ggagctggcg gagcatctca cctacttgga gtatcgctcc ttctgcaaga tcctgtttca 781 ggactatcac agtttcgtga ctcatggctg cactgtggac aaccccgtcc tggagcggtt 841 catctccctc ttcaacagcg tctcacagtg ggtgcagctc atgatcctca gcaaacccac 901 agccccgcag cgggccctgg tcatcacaca ctttgtccac gtggcggaga agctgctaca 961 gctgcagaac ttcaacacgc tgatggcagt ggtcgggggc ctgagccaca gctccatctc 1021 ccgcctcaag gagacccaca gccacgttag ccctgagacc atcaagctct gggagggtct 1081 cacggaacta gtgacggcga caggcaacta tggcaactac cggcgtcggc tggcagcctg 1141 tgtgggcttc cgcttcccga tcctgggtgt gcacctcaag gacctggtgg ccctgcagct 1201 ggcactgcct gactggctgg acccagcccg gacccggctc aacggggcca agatgaagca 1261 gctctttagc atcctggagg agctggccat ggtgaccagc ctgcggccac cagtacaggc 1321 caaccccgac ctgctgagcc tgctcacggt gtctctggat cagtatcaga cggaggatga 1381 gctgtaccag ctgtccctgc agcgggagcc gcgctccaag tcctcgccaa ccagccccac 1441 gagttgcacc ccaccacccc ggcccccggt actggaggag tggacctcgg ctgccaaacc 1501 caagctggat caggccctcg tggtggagca catcgagaag atggtggagt ctgtgttccg 1561 gaactttgac gtcgatgggg atggccacat ctcacaggaa gaattccaga tcatccgtgg 1621 gaacttccct tacctcagcg cctttgggga cctcgaccag aaccaggatg gctgcatcag 1681 cagggaggag atggtttcct atttcctgcg ctccagctct gtgttggggg ggcgcatggg 1741 cttcgtacac aacttccagg agagcaactc cttgcgcccc gtcgcctgcc gccactgcaa 1801 agccctgatc ctgggcatct acaagcaggg cctcaaatgc cgagcctgtg gagtgaactg 1861 ccacaagcag tgcaaggatc gcctgtcagt tgagtgtcgg cgcagggccc agagtgtgag 1921 cctggagggg tctgcaccct caccctcacc catgcacagc caccatcacc gcgccttcag 1981 cttctctctg ccccgccctg gcaggcgagg ctccaggcct ccagagatcc gtgaggagga 2041 ggtacagacg gtggaggatg gggtgtttga catccacttg taatagatgc tgtggttgga 2101 tcaaggactc attcctgcct tggagaaaat acttcaacca gagcagggag cctgggggtg 2161 tcggggcagg aggctgggga tgggggtggg atatgagggt ggcatgcagc tgagggcagg 2221 gccagggctg gtgtccctaa ggttgtacag actcttgtga atatttgtat tttccagatg 2281 gaataaaaag gcccgtgtaa ttaaccttc // LOCUS HSFAA 1310 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for fumarylacetoacetase (EC 3.7.1.2). ACCESSION X51728 NID g31290 KEYWORDS fumarylacetoacetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1310) AUTHORS Agsteribble,E. TITLE Direct Submission JOURNAL Submitted (21-OCT-1988) Agsteribble E., State University Groningen, Laboratory of Physiological Chemistry, Bloemsingel 10, 9713 KZ Groningen, The Netherlands REFERENCE 2 (bases 1 to 1310) AUTHORS Agsteribbe,E., van Faassen,H., Hartog,M.V., Reversma,T., Taanman,J.W., Pannekoek,H., Evers,R.F., Welling,G.M. and Berger,R. TITLE Nucleotide sequence of cDNA encoding human fumarylacetoacetase JOURNAL Nucleic Acids Res. 18 (7), 1887 (1990) MEDLINE 90245581 FEATURES Location/Qualifiers source 1..1310 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="pUC9" CDS 138..1187 /note="fumarylacetoacetase (AA 1-349)" /codon_start=1 /db_xref="PID:g31291" /db_xref="SWISS-PROT:P16930" /translation="MGLGQAAWKEARVFLQNLLSVSQARLRDDTELRKCAFISQASAT MHLPATIGDYTDFYSSRQHATNVGIMFRDKENALMPNWLHLPVGYHGRASSVVVSGTP IRRPMGQMKPDDSKPPVYGACKLLDMELEMAFFVGPGNRLGEPIPISKAHEHIFGMVL MNDWSARDIQKWEYVPLGPFLGKSFGTTVSPWVVPMDALMPFAVPNPKQDPRPLPYLC HDEPYTFDINLSVNLKGEGMSQAATICKSNFKYMYWTMLQQLTHHSVNGCNLRPGDLL ASGTISGPEPENFGSMLELSWKGTKPIDLGNGQTRKFLLDGDEVIITGYCQGDGYRIG FGQCAGKVLPALLPS" polyA_site 1310 /note="polyA attachment site" BASE COUNT 292 a 367 c 350 g 301 t ORIGIN 1 cctcctagcc aagaccgagg ataggtgtgg ccattggcga ccagatcctg gacctcagca 61 tcatcaagca cctctttact ggtcctgtcc tctccaaaca ccaggatgtc ttcaatcagc 121 ctacactcaa cagcttcatg ggcctgggtc aggctgcctg gaaggaggcg agagtgttct 181 tgcagaactt gctgtctgtg agccaagcca ggctcagaga tgacaccgaa cttcggaagt 241 gtgcattcat ctcccaggct tctgccacga tgcaccttcc agccaccata ggagactaca 301 cagacttcta ttcctctcgg cagcatgcta ccaacgtcgg aatcatgttc agggacaagg 361 agaatgcgtt gatgccaaat tggctgcact taccagtggg ctaccatggc cgtgcctcct 421 ctgtcgtggt gtctggcacc ccaatccgaa ggcccatggg acagatgaaa cctgatgact 481 ctaagcctcc cgtatatggt gcctgcaagc tcttggacat ggagctggaa atggcttttt 541 ttgtaggccc tggaaacaga ttgggagagc cgatccccat ttccaaggcc catgagcaca 601 tttttggaat ggtccttatg aacgactgga gtgcacgaga cattcagaag tgggagtatg 661 tccctctcgg gccattcctt gggaagagtt ttgggaccac tgtctctccg tgggtggtgc 721 ccatggatgc tctcatgccc tttgctgtgc ccaacccgaa gcaggacccc aggcccctgc 781 cgtatctgtg ccatgacgag ccctacacat ttgacatcaa cctctctgtt aacctgaaag 841 gagaaggaat gagccaggcg gctaccatat gcaagtccaa ttttaagtac atgtactgga 901 cgatgctgca gcagctcact caccactctg tcaacggctg caacctgcgg ccgggggacc 961 tcctggcttc tgggaccatc agcgggccgg agccagaaaa cttcggctcc atgttggaac 1021 tgtcgtggaa gggaacgaag cccatagacc tggggaatgg tcagaccagg aagtttctgc 1081 tggacgggga tgaagtcatc ataacagggt actgccaggg ggatggttac cgcatcggct 1141 ttggccagtg tgctggaaaa gtgctgcctg ctctcctgcc atcatgagat tttctctgct 1201 cttctggaaa caaagggctc aagcacccct ttcaaccctg tgactggggt cctccctcgg 1261 gctgtaggcc tggtccgcta ttcagtgaca aataaagcca ttgtgctctg // LOCUS HSFACC1 4567 bp RNA PRI 21-NOV-1994 DEFINITION H.sapiens FACC mRNA from complementation group C (FA(C)). ACCESSION X66893 X66184 NID g31294 KEYWORDS DNA-repair disorder. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4566) AUTHORS Strathdee,C.A., Gavish,H., Shannon,W.R. and Buchwald,M. TITLE Cloning of cDNAs for Fanconi's anaemia by functional complementation JOURNAL Nature 356 (6372), 763-767 (1992) MEDLINE 92244337 REMARK Erratum:[Nature 1992 Jul 30;358(6385):434]] REFERENCE 2 (bases 1 to 4567) AUTHORS Strathdee,C.A., Duncan,A.M. and Buchwald,M. TITLE Evidence for at least four Fanconi anaemia genes including FACC on chromosome 9 JOURNAL Nature Genet. 1 (3), 196-198 (1992) MEDLINE 93265102 REMARK (sites) REFERENCE 3 (bases 1 to 4567) AUTHORS Buchwald,M. TITLE Direct Submission JOURNAL Submitted (17-JUN-1992) buchwald m., hospital for sick children, 555 university avenue, toronto, ontario, canada COMMENT . FEATURES Location/Qualifiers source 1..4567 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lymphoid" /cell_line="hsc93" /sex="Female" 5'UTR 1..255 gene 256..1929 /gene="FACC" CDS 256..1929 /gene="FACC" /codon_start=1 /db_xref="PID:g31295" /translation="MAQDSVDLSCDYQFWMQKLSVWDQASTLETQQDTCLHVAQFQEF LRKMYEALKEMDSNTVIERFPTIGQLLAKACWNPFILAYDESQKILIWCLCCLINKEP QNSGQSKLNSWIQGVLSHILSALRFDKEVALFTQGLGYAPIDYYPGLLKNMVLSLASE LRENHLNGFNTQRRMAPERVASLSRVCVPLITLTDVDPLVEALLICHGREPQEILQPE FFEAVNEAILLKKISLPMSAVVCLWLRHLPSLEKAMLHLFEKLISSERNCLRRIECFI KDSSLPQAACHPAIFRVVDEMFRCALLETDGALEIIATIQVFTQCFVEALEKASKQLR FALKTYFPYTSPSLAMVLLQDPQDIPRGHWLQTLKHISELLREAVEDQTHGSCGGPFE SWFLFIHFGGWAEMVAEQLLMSAAEPPTALLWLLAFYYGPRDGRQRAQTMVQVKAVLG HLLAMSRSSSLSAQDLQTVAGQGTDTDLRAPAQQLIRHLLLNFLLWAPGGHTIAWDVI TLMAHTAEITHEIIGFLDQTLYRWNRLGIESPRSEKLARELLKELRTQV" 3'UTR 1929..4566 polyA_signal 2319..2324 polyA_site 2336 polyA_signal 3121..3126 polyA_site 3234 repeat_region 3355..3477 polyA_signal 4444..4449 polyA_site 4566 BASE COUNT 1080 a 1133 c 1178 g 1176 t ORIGIN 1 actgctgaca cgtgtgcgcg cgcgcggctc cactgccggg cgaccgcggg aaaattccaa 61 aaaaactcaa aaagccaata cgaggcaaag ccaaattttc aagccacaga tcccgggcgg 121 tggcttcctt tccgccactg cccaaactgc tgaagcagct cccgcgagga ccacccgatt 181 taatgtgtgc cgaccatttc cttcagtgct ggacaggctg ctgtgaaggg acatcacctt 241 ttcgcttttt ccaagatggc tcaagattca gtagatcttt cttgtgatta tcagttttgg 301 atgcagaagc tttctgtatg ggatcaggct tccactttgg aaacccagca agacacctgt 361 cttcacgtgg ctcagttcca ggagttccta aggaagatgt atgaagcctt gaaagagatg 421 gattctaata cagtcattga aagattcccc acaattggtc aactgttggc aaaagcttgt 481 tggaatcctt ttattttagc atatgatgaa agccaaaaaa ttctaatatg gtgcttatgt 541 tgtctaatta acaaagaacc acagaattct ggacaatcaa aacttaactc ctggatacag 601 ggtgtattat ctcatatact ttcagcactc agatttgata aagaagttgc tcttttcact 661 caaggtcttg ggtatgcacc tatagattac tatcctggtt tgcttaaaaa tatggtttta 721 tcattagcgt ctgaactcag agagaatcat cttaatggat ttaacactca aaggcgaatg 781 gctcccgagc gagtggcgtc cctgtcacga gtttgtgtcc cacttattac cctgacagat 841 gttgaccccc tggtggaggc tctcctcatc tgtcatggac gtgaacctca ggaaatcctc 901 cagccagagt tctttgaggc tgtaaacgag gccattttgc tgaagaagat ttctctcccc 961 atgtcagctg tagtctgcct ctggcttcgg caccttccca gccttgaaaa agcaatgctg 1021 catctttttg aaaagctaat ctccagtgag agaaattgtc tgagaaggat cgaatgcttt 1081 ataaaagatt catcgctgcc tcaagcagcc tgccaccctg ccatattccg ggttgttgat 1141 gagatgttca ggtgtgcact cctggaaacc gatggggccc tggaaatcat agccactatt 1201 caggtgttta cgcagtgctt tgtagaagct ctggagaaag caagcaagca gctgcggttt 1261 gcactcaaga cctactttcc ttacacttct ccatctcttg ccatggtgct gctgcaagac 1321 cctcaagata tccctcgggg acactggctc cagacactga agcatatttc tgaactgctc 1381 agagaagcag ttgaagacca gactcatggg tcctgcggag gtccctttga gagctggttc 1441 ctgttcattc acttcggagg atgggctgag atggtggcag agcaattact gatgtcggca 1501 gccgaacccc ccacggccct gctgtggctc ttggccttct actacggccc ccgtgatggg 1561 aggcagagag cacagactat ggtccaggtg aaggccgtgc tgggccacct cctggcaatg 1621 tccagaagca gcagcctctc agcccaggac ctgcagacgg tagcaggaca gggcacagac 1681 acagacctca gagctcctgc acaacagctg atcaggcacc ttctcctcaa cttcctgctc 1741 tgggctcctg gaggccacac gatcgcctgg gatgtcatca ccctgatggc tcacactgct 1801 gagataactc acgagatcat tggctttctt gaccagacct tgtacagatg gaatcgtctt 1861 ggcattgaaa gccctagatc agaaaaactg gcccgagagc tccttaaaga gctgcgaact 1921 caagtctaga aggcacgcag gccgtgtggg tgcccggcgt gagggatcag gctcgccagg 1981 gccacaggac aggtgatgac ctgtggccac gcatttgtgg agtaagtgcc ctcgctgggc 2041 tgtgagaatg agctgtacac atcttgggac aatctgctag tatctatttt acaaaatgca 2101 gagccaggtc cctcagccca gactcagtca gacatgttca ctaatgactc aagtgagctt 2161 cggtactcct ggtgcccgcc cggccagacc gtcagcttga taattactaa agcaaaggcc 2221 tgggtgggag aacaggtttc tagtttttac ccaagtcaag ctgcacatct attatttaaa 2281 aattcaaagt cttagaacca agaatttggt catgaaccat taaagaattt agagagaact 2341 tagctctttt tagactcttt ttaggagtca gggatctggg ataaagccac actgtcttgc 2401 tgtatggaga aattcttcaa ggggagtcag ggtccctcag gcttcccttg tgtctccctg 2461 gacctgcctg acaggccaca ggagcagaca gcacacccaa gcccgggcct ccggcacact 2521 ctttccactc tgtatttgct aaatgatgct aactgctacc aaaaggccct tgggacatca 2581 gaggagccgg cagcgaaggt agaggatgtg ttccagaaac attagaaggc aggattaatt 2641 cagttagtta gtctcttgtt aaatggaaat gggaattgga aattcctgat aaagaattgg 2701 cctggctggg tgcagtggct cacacctgtg atcccagcac tttgggaggc caaggcaggg 2761 ggattacttc agcccaggag ttccagactg cctggctaac atggcaatac cctatctcta 2821 ctaaaaatac aaaaattatc ggggtgcaat ggcatgcatc tgtaacccag ctattcaaga 2881 ggctgaggca tgaggatctc ttgaacccgg gaggtgggag ttgtagtgag ccgagatcat 2941 gacactgcac tccagcctgg gcaacagagc gagaccatct cttaaaaaaa ggcattgtta 3001 gtgtaactca aggttaacat ttatttcatg tcagtacagg gtgctttttc ctttcaggga 3061 cattctggaa ttgtattggt tgtacattct tttgtgtcta ttctgtttgt caagtgagtc 3121 aagacttgct tttgtccatt ttgatttgtg tgtattagtc tgagtcttgg ctccgttttg 3181 aggtatgagc aaagttttgc tggatagagt taacctttag ggaaattcct tattttggta 3241 tgtggcaatg ctaatagatc cactgaagat ctggaaaatt ccaggaactt tttcacctga 3301 gcctttcttc tgagaaatgc tgcagtcaga agggtgtgct ggtaaagtat tttggtggca 3361 gctgccatca tggtcattgc cttcatataa catgcttcgt gctcatggtc attgccttca 3421 tataacatgc ttcgtgccat catgatcctt gccttcatat aacaaacatg cttcgtcaga 3481 ggtgttgggg ttgaaaaagg agctgcatgc ttcactggag ttgagggcct ctcctgtctg 3541 actttaagcc agaacttgtg gctgggccat ggaagctgtg actcctctgt ggacatggtg 3601 gcagcaggga acccctagag agaggggcca ctgggaccag gcctcctgtt gtggagggac 3661 tcctgggaca gtcctccacc ctgtcctgtg gtcctgtgta cagggttggc ctcttcctcc 3721 tcccctgcca ggcctctgcc catgcccctt ccttccttct cctgggactg gtgaagctag 3781 gcatctggaa gacttcttcc tagcctggaa gccctgacct cggcccatct gcagaatctc 3841 ccagttcctt cacagctgcc gagtcctctc acgggtgcgg tggaggcggc cttgcggtgg 3901 tgctttctgg gcagccaggg gttcctgggt gggaggactg tccctctggg gacgtggcac 3961 tgaagtgcct gctggcttca tgtggccctt tgccctttcc cagcctgaga gatgctcaaa 4021 ggtggggagc tgggggagcc acccctcggc cattccctcc acctccaaga caggtggcgg 4081 ccgggcaggc actcttaagc ccacctcccc ctcttgttgc cttcgatttc ggcaaagcct 4141 gggcaggtgc caccgggaag gaatggcatc gagatgctgg gcggggacgc ggcgtggcga 4201 gggggcttga cggcgttggc ggggctgggc acaggggcag ccgcagggag gcagggatgg 4261 caaggcgtga agccaccctg gaaggaactg gaccaaggtc ttcagaggtg cgacagggtc 4321 tggaatctga ccttactcta gcaggagttt ttgtagactc tccctgatag tttagttttt 4381 gataaagcat gctggtaaaa ccactaccct cagagagagc caaaaataca gaagaggcgg 4441 agagcgcccc tccaaccagg ctgttattcc cctggactcc gtgacatctg tggaattttt 4501 tagctcttta aaatctgtaa tttgttgtct attttttcat tctaaataaa acttcagttt 4561 gcaccta // LOCUS HSFAN 3380 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for FAN protein. ACCESSION X96586 NID g1556398 KEYWORDS FAN protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3380) AUTHORS Adam-Klages,S., Adam,D., Wiegmann,K., Struve,S., Kolanus,W., Schneider-Mergener,J. and Kronke,M. TITLE FAN, a novel WD-repeat protein, couples the p55 TNF-receptor to neutral sphingomyelinase JOURNAL Cell 86 (6), 937-947 (1996) MEDLINE 96404447 REFERENCE 2 (bases 1 to 3380) AUTHORS Adam-Klages,S. TITLE Direct Submission JOURNAL Submitted (11-MAR-1996) S. Adam-Klages, Institut fuer Immunologie, Christian-Albrechts-Universitaet Kiel, Brunswikerstr. 4, D-24105 Kiel, FRG FEATURES Location/Qualifiers source 1..3380 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="muscle" /clone_lib="human muscle cDNA" CDS 13..2766 /codon_start=1 /product="FAN protein" /db_xref="PID:e229615" /db_xref="PID:g1556399" /translation="MAFIRKKQQEQQLQLYSKERFSLLLLNLEEYYFEQHRANHILHK GSHHERKIRGSLKICSKSVIFEPDSISQPIIKIPLRDCIKIGKHGENGANRHFTKAKS GGISLIFSQVYFIKEHNVVAPYKIERGKMEYVFELDVPGKVEDVVETLLQLHRASCLD KLGDQTAMITAILQSRLARTSFDKNRFQNISEKLHMECKAEMVTPLVTNPGHVCITDT NLYFQPLNGYPKPVVQITLQDVRRIYKRRHGLMPLGLEVFCTEDDLCSDIYLKFYEPQ DRDDLYFYIATYLEHHVAEHTAESYMLQWQRGHLSNYQYLLHLNNLADRSCNDLSQYP VFPWIIHDYSSSELDLSNPGTFRDLSKPVGALNKERLERLLTRYQEMPEPKFMYGSHY SSPGYVLFYLVRIAPEYMLCLQNGRFDNADRMFNSIAETWKNCLDGATDFKELIPEFY GDDVSFLVNSLKLDLGKRQGGQMVDDVELPPWASSPEDFLQKSKDALESNYVSEHLHE WIDLIFGYKQKGSDAVGAHNVFHPLTYEGGVDLNSIQDPDEKVAMLTQILEFGQTPKQ LFVTPHPRRITPKFKSLSQTSSYNASMADSPGEESFEDLTEESKTLAWNNITKLQLHE HYKIHKEAVTGITVSRNGSSVFTTSQDSTLKMFSKESKMLQRSISFSNMALSSCLLLP GDATVITSSWDNNVYFYSIAFGRRQDTLMGHDDAVSKICWHDNRLYSASWDSTVKVWS GVPAEMPGTKRHHFDLLAELEHDVSVDTISLNAASTLLVSGTKEGTVNIWDLTTATLM HQIPCHSGIVCDTAFSPDSRHVLSTGTDGCLNVIDVQTGMLISSMTSDEPQTCFVWDG NSVLSGSQSGELLVWDLLGAKISERIQGHTGAVTCIWMNEQCSSIITGGEDRQIIFWK LQY" BASE COUNT 1023 a 698 c 742 g 917 t ORIGIN 1 gaattcgcct cgatggcgtt tatccggaag aagcagcagg agcagcagct gcagctctac 61 tccaaggaga gattttcctt gctgctgctt aacttggagg agtactactt tgaacagcat 121 agagccaatc acattttgca caagggcagt caccatgaaa ggaaaatcag aggctcctta 181 aaaatatgtt caaaatcggt gatttttgaa ccagattcaa tatcccagcc catcatcaag 241 attcctttga gagactgtat aaaaatagga aagcatggag aaaatggagc caatagacac 301 ttcacaaagg caaaatctgg gggtatttca ctcattttca gtcaggtata tttcattaaa 361 gaacataatg ttgttgcacc atataaaata gaaaggggca aaatggaata tgtttttgaa 421 ttggatgttc ccgggaaagt ggaagatgtt gtggagacgt tgcttcagct tcacagagca 481 tcctgccttg acaaattggg tgaccaaacc gccatgataa cagctatttt gcagtctcgt 541 ttagctagaa catcatttga caaaaacagg ttccaaaaca tttctgaaaa gctgcacatg 601 gaatgcaaag cagaaatggt gacgcctctg gtgactaatc ctggacacgt gtgcatcacg 661 gacacaaacc tgtattttca gcccctcaac ggctacccga aacctgtggt ccagataaca 721 ctccaagatg tccgccgcat ctacaaaagg aggcacggcc tcatgcctct gggcttggaa 781 gtattttgca cagaagatga tctgtgttcc gacatctacc taaagttcta tgaacctcaa 841 gatagagatg atctctattt ttacattgcc acatacctag agcaccatgt ggcggagcac 901 actgctgaga gctacatgct gcagtggcag cgtggacacc tttccaacta tcagtacctc 961 cttcacctca acaacctggc cgaccgcagc tgcaacgacc tctcccagta ccctgtgttt 1021 ccatggataa tacatgatta ttccagctca gaactagatt tgtcaaatcc aggaaccttc 1081 cgggatctca gtaagccagt aggggcccta aataaggaac ggctggagag actactgaca 1141 cgctaccagg aaatgcctga accaaagttc atgtatggga gtcactactc ttccccgggt 1201 tatgtacttt tttatcttgt taggattgca ccagagtata tgctgtgcct gcagaatgga 1261 agatttgata atgcagatag aatgttcaac agtattgcag aaacttggaa aaactgtctg 1321 gatggtgcaa cggattttaa agagttaatt ccagaattct atggtgatga tgtgagcttt 1381 ctagtcaata gcctgaagtt ggatttggga aagagacaag gaggacagat ggttgacgac 1441 gtggagcttc ccccttgggc ttccagtccc gaggactttc tccagaagag caaagatgca 1501 ttggaaagca attatgtgtc tgaacacctt cacgagtgga ttgatctaat atttggctac 1561 aaacaaaaag ggagtgatgc agttggggcc cataatgtat ttcatcccct gacctatgaa 1621 ggaggtgtag acttgaacag catccaggat cctgatgaga aggtagccat gcttacgcaa 1681 atcttggaat ttgggcagac accaaaacaa ctatttgtga caccacatcc tcgaaggatc 1741 accccaaagt ttaaaagttt gtcccagacc tccagttata atgcttctat ggcagattcc 1801 ccaggtgaag agtcttttga agacctgacc gaagaaagca aaacactggc ctggaataac 1861 atcaccaaac tgcagttaca cgagcactat aaaatccaca aagaagcagt tactggaatc 1921 acggtctctc gcaatggatc ttcagtattc acaacatccc aagattccac cttgaagatg 1981 ttttctaaag aatcaaaaat gctacaaaga agtatatcat tttcaaatat ggctttatcg 2041 tcttgtttac ttttaccagg agatgccact gtcataactt cttcatggga taataatgtc 2101 tatttttatt ccatagcatt tggaagacgc caggacacgt taatgggaca tgatgatgct 2161 gttagtaaga tctgttggca tgacaacagg ctatattctg catcgtggga ctctacagtg 2221 aaggtgtggt ctggtgttcc tgcagagatg ccaggcacca aaagacacca ctttgacttg 2281 ctggccgagc tggaacatga tgtcagtgta gatacaatca gtttaaatgc tgcaagcaca 2341 ctgttagttt ccggcaccaa agaaggcaca gtgaatattt gggacctcac aacggccacc 2401 ttaatgcacc agattccatg ccattcaggg attgtatgtg acactgcttt tagcccagat 2461 agtcgccatg tcctcagcac aggaacagat ggctgtctta atgtcattga tgtgcagaca 2521 ggaatgctca tctcctccat gacatcagat gagccccaga cgtgctttgt ctgggatgga 2581 aattccgttt tatctggcag tcagtctggt gaactgctcg tttgggacct ccttggagca 2641 aaaatcagtg agagaataca gggccacaca ggtgctgtga catgtatatg gatgaatgaa 2701 cagtgtagca gtatcatcac aggaggggaa gacagacaaa ttatattctg gaaattgcag 2761 tattaagtgc cttttcctct cctgaatatt aaattgaact ctatttaatg catttttaaa 2821 ccaaactttt aaacggactg gtgaatgtgc aatgttagta attagaagtt ttaccacatg 2881 gaaaatttgt ggttttaaac tttctaaatc atggtgactt cattgaaagc cattagttgc 2941 cattctctta gggcagataa aatgcggctg tgttaggaaa aacatgttac actgtaaggc 3001 agatgatcgt ccccgtatga tgattgtcag aagacaggac taagtagcag agaatagcta 3061 agagataaat tgggctgggg aaacttgtca gaaagcactg aacaattaag aaattttcca 3121 agaaaatgtg cagtattctc tgctacttct gaatctgttt tgtcttccta atctatcaca 3181 attgccaccc atcgggtttt gggtgtgtgt tttcatagcg tggttacttt ctataatgct 3241 gtacccagat tctaagaacc tggagaagga ttagcagttc ttagtaagtt tactgtgtat 3301 aggaacggtt tgtatttcat tacagctatt catcttttct acattaaaaa tatttttctc 3361 taaaaaaaaa aaaaaaaaaa // LOCUS HSFASLIGA 1909 bp RNA PRI 06-JUL-1995 DEFINITION H.sapiens mRNA for fasligand. ACCESSION X89102 NID g887455 KEYWORDS Fasligand. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1909) AUTHORS Schatzlein,C.E. TITLE Direct Submission JOURNAL Submitted (23-JUN-1995) C.E. Schatzlein, Klinische Forsch. f. Rheumatologie, Klinikum der Albert-Ludwigs-, Universitaet Freiburg, Breisacherstr. 64, 79106 Freiburg, FRG REFERENCE 2 (bases 1 to 1909) AUTHORS Schaetzlein,C.E., Poehlmann,R., Philippsen,P. and Eibel,H. JOURNAL Unpublished COMMENT Related sequences: U08137, D38122 and U11821. FEATURES Location/Qualifiers source 1..1909 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="PBL" /cell_line="Human PBL" misc_signal 19..33 /note="regulatory sequence" gene 158..1003 /gene="fasligand" CDS 158..1003 /gene="fasligand" /codon_start=1 /product="Fasligand" /db_xref="PID:g887456" /db_xref="SWISS-PROT:P48023" /translation="MQQPFNYPYPQIYWVDSSASSPWAPPGTVLPCPTSVPRRPGQRR PPPPPPPPPLPPPPPPPPLPPLPLPPLKKRGNHSTGLCLLVMFFMVLVALVGLGLGMF QLFHLQKELAELRESTSQMHTASSLEKQIGHPSPPPEKKELRKVAHLTGKSNSRSMPL EWEDTYGIVLLSGVKYKKGGLVINETGLYFVYSKVYFRGQSCNNLPLSHKVYMRNSKY PQDLVMMEGKMMSYCTTGQMWARSSYLGAVFNLTSADHLYVNVSELSLVNFEESQTFF GLYKL" polyA_signal 1788..1793 polyA_signal 1834..1839 BASE COUNT 558 a 424 c 450 g 477 t ORIGIN 1 gaggtgtttc ccttagctat ggaaactcta taagagagat ccagcttgcc tcctcttgag 61 cagtcagcaa cagggtcccg tccttgacac ctcagcctct acaggactga gaagaagtaa 121 aaccgtttgc tggggctggc ctgactcacc agctgccatg cagcagccct tcaattaccc 181 atatccccag atctactggg tggacagcag tgccagctct ccctgggccc ctccaggcac 241 agttcttccc tgtccaacct ctgtgcccag aaggcctggt caaaggaggc caccaccacc 301 accgccaccg ccaccactac cacctccgcc gccgccgcca ccactgcctc cactaccgct 361 gccacccctg aagaagagag ggaaccacag cacaggcctg tgtctccttg tgatgttttt 421 catggttctg gttgccttgg taggattggg cctggggatg tttcagctct tccacctaca 481 gaaggagctg gcagaactcc gagagtctac cagccagatg cacacagcat catctttgga 541 gaagcaaata ggccacccca gtccaccccc tgaaaaaaag gagctgagga aagtggccca 601 tttaacaggc aagtccaact caaggtccat gcctctggaa tgggaagaca cctatggaat 661 tgtcctgctt tctggagtga agtataagaa gggtggcctt gtgatcaatg aaactgggct 721 gtactttgta tattccaaag tatacttccg gggtcaatct tgcaacaacc tgcccctgag 781 ccacaaggtc tacatgagga actctaagta tccccaggat ctggtgatga tggaggggaa 841 gatgatgagc tactgcacta ctgggcagat gtgggcccgc agcagctacc tgggggcagt 901 gttcaatctt accagtgctg atcatttata tgtcaacgta tctgagctct ctctggtcaa 961 ttttgaggaa tctcagacgt ttttcggctt atataagctc taagagaagc actttgggat 1021 tctttccatt atgattcttt gttacaggca ccgagaatgt tgtattcagt gagggtcttc 1081 ttacatgcat ttgaggtcaa gtaagaagac atgaaccaag tggaccttga gaccacaggg 1141 ttcaaaatgt ctgtagctcc tcaactcacc taatgtttat gagccagaca aatggaggaa 1201 tatgacggaa gaacatagaa ctctgggctg ccatgtgaag agggagaagc atgaaaaagc 1261 agctaccagg tgttctacac tcatcttagt gcctgagagt atttaggcag attgaaaagg 1321 acacctttta actcacctct caaggtgggc cttgctacct caagggggac tgtctttcag 1381 atacatggtt gtgacctgag gatttaaggg atggaaaagg aagactagag gcttgcataa 1441 taagctaaag aggctgaaag aggccaatgc cccactggca gcatcttcac ttctaaatgc 1501 atatcctgag ccatcggtga aactaacaga taagcaagag agatgttttg gggactcatt 1561 tcattcctaa cacagcatgt gtatttccag tgcaattgta ggggtgtgtg tgtgtgtgtg 1621 tgtgtgtgtg tgtgtatgac taaagagaga atgtagatat tgtgaagtac atattaggaa 1681 aatatgggtt gcatttggtc aagattttga atgcttcctg acaatcaact ctaatagtgc 1741 ttaaaaatca ttgattgtca gctactaatg atgttttcct ataatataat aaatatttat 1801 gtagatgtgc atttttgtga aatgaaaaca tgtaataaaa agtatatgtt aggatacaaa 1861 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSFASTKIN 1778 bp RNA PRI 24-OCT-1995 DEFINITION H.sapiens mRNA for FAST kinase. ACCESSION X86779 NID g1006658 KEYWORDS fast gene; FAST kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1778) AUTHORS Tian,Q., Taupin,J., Elledge,S., Robertson,M. and Anderson,P. TITLE Fas-activated serine/threonine kinase (FAST) phosphorylates TIA-1 during Fas-mediated apoptosis JOURNAL J. Exp. Med. 182 (3), 865-874 (1995) MEDLINE 95378805 REFERENCE 2 (bases 1 to 1778) AUTHORS Anderson,P.J. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) P.J. Anderson, Dana-Farber Cancer Institute, Division of Tumor Immunology, M748, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1778 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood lymphocyte" /chromosome="7" /map="q35" gene 22..1671 /gene="fast" CDS 22..1671 /gene="fast" /codon_start=1 /product="FAST kinase" /db_xref="PID:g1006659" /translation="MRRPRGEPGPRAPRPTEGATCAGPGESWSPSPNSMLRVLLSAQT SPARLSGLLLIPPVQPCCLGPSKWGDRPVGGGPSAGPVQGLQRLLEQAKSPGELLRWL GQNPSKVRAHHYSVALRRLGQLLGSRPRPPPVEQVTLQDLSQLIIRNCPSFDIHTIHV CLHLAVLLGFPSDGPLVCALEQERRLRLPPKPPPPLQPLLRGGQGLEAALSCPRFLRY PRQHLISSLAEARPEELTPHVMVLLAQHLARHRLREPQLLEAIAHFLVVQETQLSSKV VQKLVLPFGRLNYLPLEQQFMPCLERILAREAGVAPLATVNILMSLCQLRCLPFRALH FVFSPGFINYISGTPHALIVRRYLSLLDTAVELELPGYRGPRLPRRQQVPIFPQPLIT DRARCKYSHKDIVAEGLRQLLGEEKYRQDLTVPPGYCTDFLLCASSSGAVLPVRTQDP FLPYPPRSCPQGQAASSATTRDPAQRVVLVLRERWHFCRDGRVLLGSRALRERHLGLM GYQLLPLPFEELESQRGLPQLKSYLRQKLQALGLRWGPEGG" BASE COUNT 300 a 582 c 537 g 359 t ORIGIN 1 ggcggactcg gtggctagcc gatgaggagg ccgcgggggg aacccggccc ccgggccccg 61 agaccgactg agggagcgac ctgcgcaggg cccggggagt catggtctcc atcacccaac 121 tccatgcttc gagtcctgct ctctgctcag acctcccctg ctcggctgtc tggcctgctg 181 ctgatccctc cagtacagcc ctgctgttta gggcccagca aatgggggga ccggcctgtt 241 ggaggaggcc ccagtgcagg tcctgtgcaa ggactgcagc ggcttctgga acaggcgaag 301 agccctgggg agctgctgcg ctggctgggc cagaacccca gcaaggtgcg cgcccaccac 361 tactcggtgg cgcttcgtcg tctgggccag ctcttggggt ctcggccacg gccccctcct 421 gtggagcagg tcacactgca ggacttgagt cagctcatca tccgaaactg cccctccttt 481 gacattcaca ccatccacgt gtgtctgcac cttgcagtct tacttggctt tccatctgat 541 ggtcccctgg tgtgtgccct ggaacaggag cgaaggctcc gcctccctcc gaagccacct 601 ccccctttgc agccccttct ccgaggtggg caagggttgg aagctgctct aagctgcccc 661 cgttttctgc ggtatccacg gcagcatctg atcagcagcc tggcagaggc aaggccagag 721 gaactgactc cccacgtgat ggtgctcctg gcccagcacc tggcccggca ccggttgcgg 781 gagccccagc ttctggaagc cattgcccac ttcctggtgg ttcaggaaac gcaactcagc 841 agcaaggtgg tacagaagtt ggtcctgccc tttgggcgac tgaactacct gcccctggaa 901 cagcagttta tgccctgcct tgagaggatc ctggctcggg aagcaggggt ggcacccctg 961 gctacagtca acatcttgat gtcactgtgc caactgcggt gcctgccctt cagagccctg 1021 cactttgttt tttcccctgg cttcatcaac tacatcagtg gcacccctca tgctctgatt 1081 gtgcgtcgct acctctccct gctggacacg gccgtggagc tggagctccc aggataccgg 1141 ggtccccgcc ttccccgaag gcagcaagtg cccatctttc cccagcctct catcaccgac 1201 cgtgcccgct gcaagtacag tcacaaggac atagtagctg aggggttgcg ccagctgctg 1261 ggggaggaga aataccgcca ggacctgact gtgcctccag gctactgcac agacttcctg 1321 ctgtgcgcca gcagctctgg tgctgtgctt cccgtgagga cccaggaccc cttcctgcca 1381 tacccaccaa ggtcctgccc acagggccag gctgcctcta gcgccactac tcgagaccct 1441 gcccagaggg tggtgctggt gttgcgggaa cgctggcatt tctgccggga cggccgggtg 1501 ctgctgggct cgagggccct gagggagcgg cacctaggcc tgatgggcta ccagctcctg 1561 ccgctaccct tcgaggaact ggagtcccag agaggcctgc cccagctcaa gagctacctg 1621 aggcagaagc tccaagccct gggcctgcgc tgggggcctg aagggggctg aggggatgat 1681 gtggggttca ggatggcccc cccatggggg gtggatgatt tgcactttgg ttccctgtgt 1741 tttgatttct cattaaagtt cctggccttc aaaaaaaa // LOCUS HSFB19 4528 bp RNA PRI 21-MAY-1997 DEFINITION Homo sapiens fb19 mRNA. ACCESSION Y13247 NID g2117158 KEYWORDS fb19 gene; FB19 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4528) AUTHORS Gasparini,P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 4528) AUTHORS Gasparini,P. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) P. Gasparini, Medical Genetics Service Hospital, I.R.C.C.S. 'CASA Soll. Sofferenza', Viale Cappuccini, 71013 San Giovanni Rotondo, Foggia, ITALY COMMENT Related sequence X90535. FEATURES Location/Qualifiers source 1..4528 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /clone_lib="fetal brain and CaCO2" /chromosome="6" /map="p21.3" gene 540..3362 /gene="fb19" CDS 540..3362 /gene="fb19" /codon_start=1 /product="FB19 protein" /db_xref="PID:e317257" /db_xref="PID:g2117159" /translation="MGSGPIDPKELLKGLDSFLNRDGEVKSVDGISKIFSLMKEARKM VSRCTYLNILLQTRSPEILVKFIDVGGYKLLNNWLTYSKTTNNIPLLQQILLTLQHLP LTVDHLKQNNTAKLVKQLSKSSEDEELRKLASVLVSDWMAVIRSQSSTQPAEKDKKKR KDEGKSRTTLPERPLTEVKAETRAEEAPEKKREKPKSLRTTAPSHAKFRSTGLELETP SLVPVKENASTVVVSDKYNLKPIPLKRQSNVAAPGDATPPAEKKYKPLNTTPNATKEI KVKIIPPQPMEGLGFLDALNSAPVPGIKIKKKKKVLSPTAAKPSPFEGKTSTEPSTAK PSSPEPAPPSEAMDADRPGTPVPPVEVPELMDTASLEPGALDAKPVESPGDPNQLTRK GRKRKSVTWPEEGKLREYFYFELDETERVNVNKIKDFGEAAKREILSDRHAFETARRL SHDNMEEKVPWVCPRPLVLPSPLVTPGSNSQERYIQAEREKGILQELFLNKESPHEPD PEPYEPIPPKLIPLDEECSMDETPYVETLEPGGSGGSPDGAGGSKLPPVLANLMGSMG AGKGPQGPGGGGINVQEILTSIMGSPNSHPSEELLKQPDYSDKIKQMLVPHGLLGPGP IANGFPPGGPGGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDG PGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGG PGGGMVGGGGHRPHEGPGGGMGNSSGHRPHEGPGGGMGSGHRPHEGPGGSMGGGGGHR PHEGPGGGISGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGHGGP HGHRLHDVPGHRGHDHRGQPPHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRPVCRH FMMKGNCRYENNCAFYHPGVNGPPLP" BASE COUNT 1108 a 1246 c 1196 g 978 t ORIGIN 1 gtttagaggt ttgaattttc tcggagaaag acaggccggc cacgaggaaa acagaaacaa 61 gccgcagcaa catctaagcc cttgaaagga tcctgagaga ggggggaaag ggaaaacagc 121 agccaccagc ccaaccactt gtgtcttctg ccccttccca cctatcttgc ccaccccacc 181 agcccacgct gcttgggact tgaaatctgt ggccgaagga ccgtcactac ataacttcaa 241 aaataatcaa ccaccctccc ttcccaaacc acccaaattc actcatccag cgtttacttt 301 tttgaatcca ctcagaactt ttttctgcga cccccctccc taaatggagt tgggtggggg 361 ggaaatgaat actgagttgg cctttatttt taaaagactt tttgatccaa tgaggccccc 421 tatataattg agttttgggt cctggttggt tgttttattt tttttcctcc aaaattttac 481 cccctccccc ctgagcccga ggtgctgacg tcgcaaaaaa attggataaa accaccatca 541 tgggttcggg tcccatagac cccaaagaac ttctcaaggg cctggacagc ttccttaacc 601 gagatgggga agtcaaaagt gtggatggga tttccaagat cttcagtttg atgaaggaag 661 cacgaaagat ggtgagtcga tgcacttact tgaacattct cctgcagacc cgttcaccag 721 aaatattggt caaatttatt gacgttggcg gctacaaact tcttaacaat tggctgacgt 781 attcaaagac aaccaacaac attcccctcc tccagcaaat tctactgacc ctgcagcatt 841 taccgctcac tgtagaccat ctcaagcaga acaacacagc taaactggtg aagcagctga 901 gcaagtcaag tgaggatgaa gagctccgga aattggcctc agtccttgtc agcgactgga 961 tggctgtcat ccgctctcag agcagtaccc agcctgctga gaaagataag aagaaacgta 1021 aagatgaagg aaaaagtcga actacccttc ctgagcgacc tttgacagag gtgaaggctg 1081 agacccgggc tgaggaggcc ccagagaaga agagggagaa gcccaagtct cttcgcacca 1141 cagcacccag tcatgccaag ttccgttcca ctggactaga gctggagaca ccatccttgg 1201 tgcctgtgaa ggagaatgcc agcacagtgg tggtttctga caagtacaac cttaaaccca 1261 tccccctcaa acgtcagagc aacgtagctg ctccaggaga tgccactccc cctgcagaga 1321 agaaatacaa gccactcaac acaacaccta atgccaccaa agagatcaaa gtgaagatca 1381 tcccgccaca gcctatggag ggcctgggct ttctggatgc tcttaattca gcccctgttc 1441 caggcatcaa aattaagaag aaaaaaaaag tactgtcacc tacggctgcc aagccaagcc 1501 cctttgaagg gaaaacgagc acagaaccaa gcacagccaa accttcttcc ccagaaccag 1561 caccaccttc tgaggcaatg gacgcagacc gtccaggcac cccggttccc cctgttgaag 1621 tcccggagct catggataca gcctctttgg agccaggagc tctggatgcc aagccagtgg 1681 agagtcctgg agatcctaac caactgaccc ggaaaggcag gaagaggaaa agtgtgacat 1741 ggcctgagga aggcaaactg agagaatatt tctattttga attggatgaa actgaacgag 1801 taaatgtgaa taagatcaag gactttggtg aggcggctaa gcgagagata ctgtcagacc 1861 gacatgcatt tgagacagcg cggcgtctga gccatgataa catggaggag aaggtgccct 1921 gggtgtgccc ccggcccctg gttctgccct cacctcttgt cacccctgga agcaatagtc 1981 aggagcgata tatccaggct gagcgggaga agggaatcct tcaggagctc ttcctgaaca 2041 aggagagtcc tcatgagcct gatcctgagc cctacgagcc cataccccct aaactcatcc 2101 ccctagatga ggagtgttcc atggatgaga ctccgtatgt tgagactctg gaacctgggg 2161 ggtcaggtgg ctcacctgat ggggcaggag gctccaagtt gcctccagtt ctggccaatc 2221 ttatgggaag catgggtgct ggaaagggcc cccaaggccc tggaggagga ggcattaatg 2281 tccaagagat cctcacctcc atcatgggta gcccaaacag tcatccttca gaggaactac 2341 tgaaacaacc agactattcg gacaagatca agcagatgct ggtgccacat ggactcctag 2401 gccctggccc aatagccaat ggtttcccac cagggggtcc tgggggcccc aagggcatgc 2461 agcactttcc ccctggacct gggggaccta tgccaggtcc ccatggaggc cctggtgggc 2521 cagtgggtcc acgtcttctg ggtcctccac cccctccccg gggaggtgat cccttctggg 2581 atggcccggg cgaccctatg cggggtggcc caatgcgggg gggtccagga ccaggtcctg 2641 gaccatacca tagaggccga ggtggccgag gaggaaacga acctcctcct cctcctcctc 2701 cattccgagg cgccagagga ggtcgctctg gaggaggacc cccaaatgga cgagggggcc 2761 ctggtggggg catggttgga ggtggtgggc atcgtcctca cgaaggccct ggtgggggca 2821 tgggcaacag cagtggacat cgtccccacg aaggccctgg cggtggcatg ggaagtgggc 2881 atcgccccca tgaaggccct ggtggtagca tgggtggggg tggaggacat cgtccccacg 2941 aaggccctgg cggtggcatc agtggtggca gtggccatcg tccccatgaa ggccctggcg 3001 gaggaatggg tgccggtggt ggacatcgcc cccacgaagg ccctggcgga agcatgggtg 3061 gaagtggtgg acatcgtccc catgaaggcc ctggacacgg ggggccccat ggccaccggc 3121 ttcatgatgt ccctggtcac cgaggccatg accatcgagg gcagccacct catgagcacc 3181 gtggccatga tggtcctggc cacgggggag ggggccaccg agggcacgat ggaggccaca 3241 gccatggagg agacatgtca aaccgccctg tctgccgaca tttcatgatg aagggcaact 3301 gccgctatga gaacaactgt gccttctacc acccgggtgt caatgggccc cccctgccct 3361 agggaccatt tgcctgccct gttcacacaa cccctgtgga ctgcagcctc gctctttcca 3421 ccctgttatg gcttctgtga ggcccatttt cccttttccc cagctgatga ggagccggcc 3481 ccctcagttc ccacttgctt gggttcctgg gggttttctg atcactggtg cgcattgatg 3541 tacatatttt cctccagtct ggggaggaga gagactggaa acgttcctgg actgctgaag 3601 aggagaccca gttggcttca ctttttgaga agattcgccc tgtaccccaa acccctttcc 3661 agtattaccc ttaatgcttg agaacctaaa gctggttatc ctggcgaaca cccctaccct 3721 tctattgcgg gtccccacat gcacacagaa ctctgacaca ggatcagctg cacttaagaa 3781 atcatcccag ctaagttcat tattcctcat ggggtgggga gatgctgaaa ggggtattgt 3841 atatcccact gcactgagag ggctcaatca gctggatttg agttctggaa cacacatcat 3901 ccccacccct cccccagcgt gggctcacca ttcttagtcc tttctcaagt gggaccttca 3961 actttctgtg aacacccagt ctgcgtcctg ggtctgctag gttcgatgat ggcgaactcg 4021 tatctgcatc cggtgcaagt tttagctggc agaggtgaga ccggtggtgc tggtctgcct 4081 ttgccaacta tagccagtct ggagacttga taaaatactt cagtgagacc agcttctcat 4141 caacttgggc ccggcgtgct gggcctgaaa gtcacactac atgcactgcc tttgggagtc 4201 agctcactcc ctgctcccac ctggaacctt gccagcgtga aggaggcttc caggtacttc 4261 accctgtcaa ccacctctga atccccacca ggcgccttcc tgggtggatt caacaagatg 4321 attttgccct ttcccagttc tctccttcac tttggcatca gttgttttct atgaaaacag 4381 tggattggtt gggttttgtg cagggtcttg ggttagagcc aaaatggatt tgaggatgag 4441 tatttttttt tttggttttg tatattttgt acattaataa taaacagtgg aaagagaaag 4501 cagcttaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSFBLN2 4139 bp RNA PRI 28-NOV-1994 DEFINITION H.sapiens mRNA for fibulin-2. ACCESSION X82494 NID g575232 KEYWORDS FBLN2 gene; fibulin-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4139) AUTHORS Chu,M.L. TITLE Direct Submission JOURNAL Submitted (03-NOV-1994) M.L. Chu, Thomas Jefferson University, Dept of Biochemistry & Molec. Biology, 233 South 10th Street, Philadelphia, PA 19107, USA REFERENCE 2 (bases 1 to 4139) AUTHORS Zhang,R.Z., Pan,T.C., Zhang,Z.Y., Mattei,M.G., Timpl,R. and Chu,M.L. TITLE Fibulin-2 (FBLN2): human cDNA sequence, mRNA expression, and mapping of the gene on human and mouse chromosomes JOURNAL Genomics 22 (2), 425-430 (1994) MEDLINE 95104855 FEATURES Location/Qualifiers source 1..4139 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cDNA HK9" /chromosome="3" /map="p24-25" /cell_line="diploid dermal fibroblasts" mRNA 1..4139 /gene="FBLN2" /evidence=experimental gene 1..4139 /gene="FBLN2" sig_peptide 70..150 /gene="FBLN2" /evidence=experimental CDS 70..3624 /gene="FBLN2" /codon_start=1 /evidence=experimental /product="fibulin-2" /db_xref="PID:g575233" /db_xref="SWISS-PROT:P98095" /translation="MVLLWEPAGAWLALGLALALGPSVAAAAPRQDCTGVECPPLENC IEEALEPGACCATCVQQGCACEGYQYYDCLQGGFVRGRVPAGQSYFVDFGSTECSCPP GGGKISCQFMLCPELPPNCIEAVVVADSCPQCGQVGCVHAGHEYAAGHTVHLPPCRAC HCPDAGGELICYQLPGCHGNFSDAEEGDPERHYEDPYSYDQEVAEVEAATALGGEVQA GAVQAGAGGPPAALGGGSQPLSTIQAPPWPAVLPRPTAAAALGPPAPVQAKARRVTED SEEEEEEEEEREEMAVTEQLAAGGHRGLDGLPTTAPAGPSLPIQEERAEAGARAEAGA RPEENLILDAQATSRSTGPEGVTHAPSLGKAALVPTQAVPGSPRDPVKPSPHNILSTS LPDAAWIPPTREVPRKPQVLPHSHVEEDTDPNSVHSIPRSSPEGSTKDLIETCCAAGQ QWAIDNDECLEIPESGTEDNVCRTAQRHCCVSYLQEKSCMAGVLGAKEGETCGAEDND SCGISLYKQCCDCCGLGLRVRAEGQSCESNPNLGYPCNHVMLSCCEGEEPLIVPEVRR PPEPAAAPRRVSEAEMAGREALSLGTEAELPNSLPGDDQDECLLLPGELCQHLCINTV GSYHCACFPGFSLQDDGRTCRPEGHPPQPEAPQEPALKSEFSQVASNTIPLPLPQPNT CKDNGPCKQVCSTVGGSAICSCFPGYAIMADGVSCEDINECVTDLHTCSRGEHCVNTL GSFHCYKALTCEPGYALKDGECEDVDECAMGTHTCQPGFLCQNTKGSFYCQARQRCMD GFLQDPEGNCVDINECTSLSEPCRPGFSCINTVGSYTCQRNPLICARGYHASDDGAKC VDVNECETGVHRCGEGQVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQ HTCENTLGSYRCSCASGFLLAADGKRCEDVNECEAQRCSQECANIYGSYQCYCRQGYQ LAEDGHTCTDIDECAQGAGILCTFRCLNVPGSYQCACPEQGYTMTANGRSCKDVDECA LGTHNCSEAETCHNIQGSFRCLRFECPPNYVQVSKTKCERTTCHDFLECQNSPARITH YQLNFQTGLLVPAHIFRIGPAPAFTGDTIALNIIKGNEEGYFGTRRLNAYTGVVYLQR AVLEPRDFALDVEMKLWRQGSVTTFLAKMHIFFTTFAL" BASE COUNT 816 a 1277 c 1279 g 767 t ORIGIN 1 tcgcggccgc cgagcgcagt gccccgcggg tcttacagga gaggggaccg tcctgggctg 61 gcctggacca tggtgctgct ctgggagcct gcaggagcct ggcttgctct gggcctggcc 121 ctggccctgg gccccagcgt ggccgcagct gcccctcggc aggactgcac gggcgtggag 181 tgcccgccgc tggagaactg cattgaggag gcgctggagc cgggtgcctg ctgtgccacg 241 tgtgtgcagc agggctgcgc ctgcgagggc taccagtact atgactgcct acagggtggc 301 ttcgtgcgcg gccgcgtgcc cgccggtcag tcctattttg tggacttcgg gagcactgag 361 tgctcctgcc caccaggcgg cggcaagatc agctgccagt tcatgctgtg cccggagctg 421 ccgcccaact gcatcgaggc tgtagtggtg gctgacagct gcccacagtg cggccaggtg 481 ggctgcgtcc acgcgggcca cgagtacgcc gctggccaca ctgttcacct gccgccctgc 541 cgggcctgcc actgccctga cgccggtgga gagctcatct gctaccagct ccccggttgc 601 cacgggaact tctcagatgc cgaggagggt gaccccgagc gacactacga agacccctac 661 agctatgacc aggaggtggc cgaggtggaa gcagcaacag ccctgggggg tgaggtccag 721 gcgggtgcag tccaggcagg cgcagggggc cccccagctg ctctgggagg tgggagtcag 781 ccactgtcca ccatccaggc acccccctgg ccagctgtcc tccccaggcc cacagcggct 841 gctgccctgg gtcccccagc cccagtgcag gccaaagcta ggagagtgac cgaggacagt 901 gaggaggaag aagaggagga ggaggagaga gaggaaatgg ctgtcactga gcagctggca 961 gcaggtggcc acagggggct ggatgggctg cccactacag ccccagctgg acccagtctt 1021 cctatccagg aggagagggc agaagctggg gcaagggcag aagctggggc aaggcctgaa 1081 gagaacctca tcctggatgc ccaagccacg tcccgcagca ctgggccgga gggcgtgacg 1141 catgcaccga gcctgggcaa ggctgctctc gtcccaactc aggccgtgcc tggctctccc 1201 agggacccag tcaagcccag cccccacaac atcctgtcca catcactgcc tgatgcagcc 1261 tggatcccac ccacccgaga agtgcccagg aagccgcaag ttctgcccca ttcccacgtg 1321 gaggaggaca cagaccccaa ctctgtccat tctatcccca gaagtagccc tgaaggctcc 1381 accaaggacc tgatcgagac ttgctgcgca gccggacagc agtgggccat tgacaatgac 1441 gagtgcctgg agatccctga gagtggcact gaggacaacg tctgcaggac agcccagagg 1501 cactgctgtg tctcctactt gcaggagaag agctgcatgg ccggcgtcct gggagccaag 1561 gagggtgaga cctgtggggc tgaggacaac gacagctgcg gcatctccct gtacaagcaa 1621 tgctgtgact gctgtggcct gggcctccgc gtgcgggccg agggccagtc gtgtgagtcc 1681 aatcctaacc tgggctatcc ctgcaatcat gtcatgctct cctgctgtga gggtgaagag 1741 cctctcatag tacctgaggt tcgccgacct ccagagcccg cagctgcacc acggagagtt 1801 tcagaggcag agatggcggg ccgagaggcc ctgtcactgg gcacagaggc cgagctgccg 1861 aacagcctgc cgggcgatga ccaggatgag tgccttctcc tcccgggaga gctgtgccag 1921 cacctttgca tcaatactgt gggttcttac cactgtgcct gctttcctgg cttctcactg 1981 caggacgatg gccgcacttg ccgcccagag ggtcaccctc cacagccgga agccccacag 2041 gagcctgcac tgaagtcaga attttcccag gtggcctcta acaccatccc gctgccactg 2101 ccgcagccca atacctgcaa agacaatgga ccctgcaagc aggtgtgcag cactgttggg 2161 ggctcagcca tatgctcctg ttttcccggc tatgccatca tggcggatgg cgtgtcctgt 2221 gaagacatca acgagtgtgt gacggacctg cacacgtgca gccggggcga gcactgtgtg 2281 aacacactgg gctccttcca ctgctacaag gcactcacct gtgagccagg ctatgccctc 2341 aaggatggcg agtgcgaaga cgtggatgag tgtgcgatgg gcacgcacac ctgccagccg 2401 ggcttcttgt gccagaacac caagggctcc ttctactgcc aggccaggca gcgctgcatg 2461 gatggcttcc tgcaggatcc tgaaggcaac tgtgtggaca tcaacgagtg cacgtcactg 2521 tccgagccat gtcggccagg cttcagctgc atcaacacgg tgggctccta cacgtgccag 2581 aggaacccgc tgatctgcgc gcgcggctac cacgccagcg atgatggggc caagtgtgtg 2641 gacgtgaatg agtgtgagac aggtgtgcac cgctgcggtg agggccaagt gtgccacaac 2701 ctccctggct cctaccgctg tgactgcaaa gccggctttc agcgggatgc cttcggccgg 2761 ggctgcatcg acgtgaatga gtgctgggcc tcgccaggcc gcctgtgcca gcacacgtgt 2821 gagaacacac tcggctccta ccgctgttcc tgcgcctccg ggttcctgct agcagcggac 2881 ggcaagcgct gtgaagacgt gaatgagtgt gaggcccagc gctgcagcca ggagtgtgcc 2941 aacatctatg gctcctacca gtgctactgc cgccagggct accagctggc tgaggatggg 3001 cacacctgca cagacatcga cgagtgtgct caaggcgccg gcatcctctg caccttccgc 3061 tgtctcaacg tgccagggag ctaccagtgt gcatgccctg agcagggcta caccatgacg 3121 gccaacggga ggtcctgcaa ggacgtggat gagtgtgcac tgggcaccca caactgttcc 3181 gaggctgaga cctgccacaa catccagggt agcttccgct gcctgcgctt cgagtgtcct 3241 cccaactatg tccaagtctc caaaacgaag tgcgagcgca ccacgtgcca tgacttcctg 3301 gagtgccaga actcgccagc gcgcatcacg cactaccagc tcaacttcca gacgggcctc 3361 ctggtgcctg cgcatatctt ccgcattggc cccgcgccag ccttcacggg ggacaccatc 3421 gccctgaaca tcatcaaggg caatgaggag ggctactttg gcacgcgcag gctcaatgcc 3481 tacacgggtg tggtctacct gcagcgggcc gtgctggagc cccgggactt tgccctggac 3541 gtggagatga agctctggag gcagggctcc gtcaccacct tcctggccaa gatgcacatc 3601 ttcttcacca cctttgccct gtgaggtgcc agcacgggcc acctgcgggt gtggcgcagc 3661 cagggctcac actgcgtggg agggactggg tcactattgt ggtttttact ataactttgt 3721 aaattaactt aattttgctg acttgactcc tgtggcttct ggacccctcc tctgccccgc 3781 aggaggaagt tccacggcag gtggtgcgtt cccatgtagg caccaagtgg aagcttgcac 3841 ggtgggccac ggccgtggcg ggtgccctgt gggtgaggct gggtgatgac ctgaggacca 3901 gagacacgcg accatgttgg ggctcttgga ctcctctgga tgacccgtcc ccaaacgttg 3961 acattccatt tcatgttcca ctgtgattaa cttcttttct tttttaaaaa atcattttaa 4021 agttttttgt ttaactataa agtagtacat gtacattata taaaaaaaaa gttcaactag 4081 tatgaaaggg ttataaagta acagaggaaa acgcctcttg gtccctttaa aaaaaaaaa // LOCUS HSFBMBF 1671 bp RNA PRI 01-JAN-1994 DEFINITION H.sapiens mRNA for fetal beta-MHC binding factor. ACCESSION X75917 NID g438046 KEYWORDS beta MHC binding factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1671) AUTHORS Morkin,E. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1671) AUTHORS Morkin,E. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) E. Morkin, The Univ. of Arizona, Univ. Heart Center, 1501 N. Campbell, Rm 6301, Tucson, AZ 85724, USA FEATURES Location/Qualifiers source 1..1671 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus, 22 weeks" /tissue_type="heart" /clone_lib="clontech" CDS 91..1287 /codon_start=1 /product="fetal beta-MHC binding factor" /db_xref="PID:g438047" /translation="MDIRKFFGAISSGKKPVNETVKKNEKTKPSEGTVKGKKGVKEAK VNNPCKEDASRPKQHNKKKRIIYDSDSESEETVQVKNAKKKSEKLPVSCKPGKISRKD PVTYISETDEDDDFLCKKAASKSKENGVSTNSYLGASNVKKNEENTKTKSKPLSPIKL TPTSVLDYFGTESVQRSGKKMVASKKKESSQTPDDSRLNDEAIAKQLQLDEDAELERQ LHEDEEFARTLALLDEEPKTKKARKDSEEGESFSPAKAELSKAAKQKSPANEHFSIGR KTYSPAKYGKGRGSEGTKQPCRSAHQKEACSSLKASSKLALMKAQEENSYKETELLAA KRESAIEPKGEKTTPRKTKGSPTKRESVSPEDSEKNRTNYQSLSKLLKSRRSQSPGLQ RNTQGS" BASE COUNT 610 a 334 c 434 g 293 t ORIGIN 1 gaattccgcc aatgtattcc gagcggcgat aacggatacc cctccagtca cgggcctgcc 61 cacctgctcc tgttcggtcg tggggctgcg atggacattc ggaaattctt tggggctata 121 tcaagtggaa aaaagcctgt aaatgagaca gtaaagaaga atgagaagac aaaaccttcg 181 gagggaactg tcaaaggaaa gaaaggagta aaggaagcca aggttaataa tccctgcaaa 241 gaggatgcct ccagaccaaa gcagcacaac aagaaaaaga ggatcatcta tgactcagat 301 tcagaatcag aggagacagt gcaagtaaaa aatgctaaaa agaaatcaga aaaattgcca 361 gtgtcttgta aacctggtaa aatctctcgg aaggatcctg ttacctatat ttctgagaca 421 gatgaagacg atgacttttt atgtaaaaag gcagcctcca aatcaaaaga gaatggagta 481 tctacaaaca gttaccttgg agcatcaaac gtgaaaaaga atgaagaaaa cactaagact 541 aagagtaaac cgttatcacc aataaaacta acaccgacat cagtgcttga ttattttgga 601 actgaaagtg tccagagatc tgggaagaag atggtggcga gcaaaaagaa agagtcttct 661 caaaccccag atgattccag attaaatgat gaggccatcg ccaagcagct gcagcttgat 721 gaagatgcag agctggagag gcagttgcat gaagatgaag aatttgcaag aacactggcc 781 ttgttggatg aagaacctaa gaccaaaaag gctcgaaagg actctgaaga gggagaatca 841 ttttcacctg ccaaagctga gttaagcaaa gcagcaaagc agaagagccc tgctaacgag 901 catttctcaa ttggaagaaa gacctacagt cctgctaagt acggcaaggg tagaggctca 961 gaaggcacca agcagccctg cagatcagct caccagaagg aagcctgctc ctctctcaag 1021 gccagctcca agctggctct tatgaaagca caagaagaaa attcttacaa agaaacagaa 1081 ctgctggctg caaaaagaga aagtgccatt gagcccaaag gagagaaaac aactcctagg 1141 aaaacgaaag gctctccaac taaaagagag tctgtaagcc cagaagattc tgaaaagaat 1201 cgcaccaatt atcaaagctt atcgaagcta cttaaatcga gaaggtccca aagccctggg 1261 ctccaaagaa atacccaagg gagctgaaaa ctgcttggag ggcctgacgt tcgtgatcac 1321 cggagtgctg gagtccatcg aagcagaaga agccaagtct ctaattgaac gttatggggg 1381 gaaagtaaca ggaaacgtga gcaagaaaac cagctacctc gtcatgggcc gggacagcgg 1441 gcagtccaag agtgacaagg cagcagctct gggaacaaaa atccttgatg aagacggcct 1501 gttggatctg attcgaacta tgccgggcaa aaaaatccaa gtcgaaatcg ctgctgaggc 1561 tgagatgaag aaagaaaagt ccaaatcaga gagagacaga gagagacaga gaggcagaga 1621 gagacagaga gaggagaagg aggaggagga ataggaggaa gaggaggagg g // LOCUS HSFCGR31 887 bp RNA PRI 12-SEP-1993 DEFINITION Human Fc-gamma RIII-1 cDNA for Fc-gamma receptor III-1 (CD 16). ACCESSION X16863 M31936 NID g31321 KEYWORDS Fc-gamma receptor; Fc-gamma receptor III-1; Fc-gamma RIII-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 887) AUTHORS Ravetch,J.V. and Perussia,B. TITLE Alternative membrane forms of Fc gamma RIII(CD16) on human natural killer cells and neutrophils. Cell type-specific expression of two genes that differ in single nucleotide substitutions JOURNAL J. Exp. Med. 170 (2), 481-497 (1989) MEDLINE 89328325 REFERENCE 2 (bases 1 to 887) AUTHORS Ravetch,J.V. TITLE Direct Submission JOURNAL Submitted (18-APR-1990) Ravetch J.V., Sloan Kettering Institute, Dept. 6008 RRL 921, 1275 York Ave., New York, NY. USA COMMENT See for Human Fc-gamma RIII-2 receptor. FEATURES Location/Qualifiers source 1..887 /organism="Homo sapiens" /note="Allele: NA-2" /db_xref="taxon:9606" /cell_line="primary-peripheral blood granulocytes" CDS 34..735 /note="(AA 1-233)" /codon_start=1 /product="Fc-gamma receptor III-1 (CD 16)" /db_xref="PID:g31322" /db_xref="SWISS-PROT:P08637" /translation="MWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYSVLEKDSVT LKCQGAYSPEDNSTQWFHNESLISSQASSYFIDAATVNDSGEYRCQTNLSTLSDPVQL EVHIGWLLLQAPRWVFKEEDPIHLRCHSWKNTALHKVTYLQNGKDRKYFHHNSDFHIP KATLKDSGSYFCRGLVGSKNVSSETVNITITQGLAVSTISSFSPPGYQVSFCLVMVLL FAVDTGLYFSVKTNI" allele 141 /note="c is g in NA-1 allele" allele 147 /note="t is c in NA-1 allele" allele 227 /note="g is a in NA-1 allele" allele 277 /note="a is g in NA-1 allele" allele 349 /note="a is g in NA-1 allele" BASE COUNT 228 a 236 c 206 g 217 t ORIGIN 1 tctttggtga cttgtccact ccagtgtggc atcatgtggc agctgctcct cccaactgct 61 ctgctacttc tagtttcagc tggcatgcgg actgaagatc tcccaaaggc tgtggtgttc 121 ctggagcctc aatggtacag cgtgcttgag aaggacagtg tgactctgaa gtgccaggga 181 gcctactccc ctgaggacaa ttccacacag tggtttcaca atgagagcct catctcaagc 241 caggcctcga gctacttcat tgacgctgcc acagtcaacg acagtggaga gtacaggtgc 301 cagacaaacc tctccaccct cagtgacccg gtgcagctag aagtccatat cggctggctg 361 ttgctccagg cccctcggtg ggtgttcaag gaggaagacc ctattcacct gaggtgtcac 421 agctggaaga acactgctct gcataaggtc acatatttac agaatggcaa agacaggaag 481 tattttcatc ataattctga cttccacatt ccaaaagcca cactcaaaga tagcggctcc 541 tacttctgca gggggcttgt tgggagtaaa aatgtgtctt cagagactgt gaacatcacc 601 atcactcaag gtttggcagt gtcaaccatc tcatcattct ctccacctgg gtaccaagtc 661 tctttctgct tggtgatggt actccttttt gcagtggaca caggactata tttctctgtg 721 aagacaaaca tttgaagctc aacaagagac tggaaggacc ataaacttaa atggagaaag 781 gaccctcaag acaaatgacc cccatcccat gggagtaata agagcagtgg cagcagcatc 841 tctgaacatt tctctggatt tgcaacccca tcatcctcag gcctctc // LOCUS HSFCREC 1589 bp RNA PRI 10-JAN-1992 DEFINITION Human mRNA for Fc receptor. ACCESSION X54150 NID g31329 KEYWORDS cell surface glycoprotein; Fc receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1589) AUTHORS Maliszewski,C.S. TITLE Direct Submission JOURNAL Submitted (30-JUL-1990) Maliszewski C.S., Immunex Research and Development Corp., 51 University St., Seattle, WA 98101, USA REFERENCE 2 (bases 1 to 1589) AUTHORS Maliszewski,C.R., March,C.J., Schoenborn,M.A., Gimpel,S. and Shen,L. TITLE Expression cloning of a human Fc receptor for IgA JOURNAL J. Exp. Med. 172 (6), 1665-1672 (1990) MEDLINE 91079769 FEATURES Location/Qualifiers source 1..1589 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1589 /note="Fc alpha receptor" /evidence=experimental sig_peptide 40..102 /note="Fc alpha receptor" CDS 40..903 /codon_start=1 /product="Fc alpha receptor" /db_xref="PID:g31330" /db_xref="SWISS-PROT:P24071" /translation="MDPKQTTLLCLVLCLGQRIQAQEGDFPMPFISAKSSPVIPLDGS VKIQCQAIREAYLTQLMIIKNSTYREIGRRLKFWNETDPEFVIDHMDANKAGRYQCQY RIGHYRFRYSDTLELVVTGLYGKPFLSADRGLVLMPGENISLTCSSAHIPFDRFSLAK EGELSLPQHQSGEHPANFSLGPVDLNVSGIYRCYGWYNRSPYLWSFPSNALELVVTDS IHQDYTTQNLIRMAVAGLVLVALLAILVENWHSHTALNKEASADVAEPSWSQQMCQPG LTFARTPSVCK" mat_peptide 103..900 /product="Fc alpha receptor" polyA_site 1589 BASE COUNT 382 a 395 c 387 g 425 t ORIGIN 1 ggcacagatc ttggaacgag acgacctgct gtcagcacga tggaccccaa acagaccacc 61 ctcctgtgtc ttgtgctctg tctgggccag aggattcagg cacaggaagg ggactttccc 121 atgcctttca tatctgccaa atcgagtcct gtgattccct tggatggatc tgtgaaaatc 181 cagtgccagg ccattcgtga agcttacctg acccagctga tgatcataaa aaactccacg 241 taccgagaga taggcagaag actgaagttt tggaatgaga ctgatcctga gttcgtcatt 301 gaccacatgg acgcaaacaa ggcagggcgc tatcagtgcc aatataggat agggcactac 361 agattccggt acagtgacac cctggagctg gtagtgacag gcttgtatgg caaacccttc 421 ctctctgcag atcggggtct ggtgttgatg ccaggagaga atatttccct cacgtgcagc 481 tcagcacaca tcccatttga tagattttca ctggccaagg agggagaact ttctctgcca 541 cagcaccaaa gtggggaaca cccggccaac ttctctttgg gtcctgtgga cctcaatgtc 601 tcagggatct acaggtgcta cggttggtac aacaggagcc cctacctgtg gtccttcccc 661 agtaatgcct tggagcttgt ggtcacagac tccatccacc aagattacac gacgcagaac 721 ttgatccgca tggccgtggc aggactggtc ctcgtggctc tcttggccat actggttgaa 781 aattggcaca gccatacggc actgaacaag gaagcctcgg cagatgtggc tgaaccgagc 841 tggagccaac agatgtgtca gccaggattg acctttgcac gaacaccaag tgtctgcaag 901 taaacacctg gaggtgaagg cagagaggag ccaggactgt ggagtccgac aaagctactt 961 gaaggacaca agagagaaaa gctcactaag aagcttgaat ctactttttt ttttttttga 1021 gacagagtct ggctctgtca cccaggctga agtgcagtgg agcaatctcg gctcattgaa 1081 cctcttgggt tcaagtgatt cttgtgcctc agcctcccaa gtagctggaa ttacaggcac 1141 ataccactgc acccagctaa tttttgtatt tttagtagag atggggtttc actgtgttgg 1201 ccaggctggt ctcgaactcc tggacctcag gtgatccacc caccttggcc tcccaaagtg 1261 ctgagattat aggcatgagc caccacgcct ggccagatgc atgttcaaac caatcaaatg 1321 gtgttttctt atgcaggact gatcgatttg cacccacctt tctgcacata agttatggtt 1381 ttccatctta tctgtcttct gattttttat atcctgttta atttcttcct tcattgttct 1441 tctctttttt tatttatttt atttattttt atttttattt ttatttgaga cagagtctca 1501 ctctgttgcc caggaggcgg aggttgcagt gaaccaagag atggcgccag tgcactccac 1561 cctgggtgac agagagactc tttcttttt // LOCUS HSFCRI 1321 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for high affinity Fc receptor (FcRI). ACCESSION X14356 M21091 NID g31331 KEYWORDS Fc receptor; FcRI receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1321) AUTHORS Seed,B. TITLE Direct Submission JOURNAL Submitted (25-OCT-1988) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 1321) AUTHORS Allen,J.M. and Seed,B. TITLE Nucleotide sequence of three cDNAs for the human high affinity Fc receptor (FcRI) JOURNAL Nucleic Acids Res. 16 (24), 11824 (1988) MEDLINE 89098339 FEATURES Location/Qualifiers source 1..1321 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="fcr135" CDS 37..1161 /note="FcRI (AA 1-374)" /codon_start=1 /db_xref="PID:g31332" /db_xref="SWISS-PROT:P12314" /translation="MWFLTTLLLWVPVDGQVDTTKAVISLQPPWVSVFQEETVTLHCE VLHLPGSSSTQWFLNGTATQTSTPSYRITSASVNDSGEYRCQRGLSGRSDPIQLEIHR GWLLLQVSSRVFTEGEPLALRCHAWKDKLVYNVLYYRNGKAFKFFHWNSNLTILKTNI SHNGTYHCSGMGKHRYTSAGISVTVKELFPAPVLNASVTSPLLEGNLVTLSCETKLLL QRPGLQLYFSFYMGSKTLRGRNTSSEYQILTARREDSGLYWCEAATEDGNVLKRSPEL ELQVLGLQLPTPVWFHVLFYLAVGIMFLVNTVLWVTIRKELKRKKKWDLEISLDSGHE KKVTSSLQEDRHLEEELKCQEQKEEQLQEGVHRKEPQGAT" BASE COUNT 368 a 324 c 324 g 305 t ORIGIN 1 gacagatttc actgctccca ccagcttgga gacaacatgt ggttcttgac aactctgctc 61 ctttgggttc cagttgatgg gcaagtggac accacaaagg cagtgatctc tttgcagcct 121 ccatgggtca gcgtgttcca agaggaaacc gtaaccttgc actgtgaggt gctccatctg 181 cctgggagca gctctacaca gtggtttctc aatggcacag ccactcagac ctcgaccccc 241 agctacagaa tcacctctgc cagtgtcaat gacagtggtg aatacaggtg ccagagaggt 301 ctctcagggc gaagtgaccc catacagctg gaaatccaca gaggctggct actactgcag 361 gtctccagca gagtcttcac ggaaggagaa cctctggcct tgaggtgtca tgcgtggaag 421 gataagctgg tgtacaatgt gctttactat cgaaatggca aagcctttaa gtttttccac 481 tggaattcta acctcaccat tctgaaaacc aacataagtc acaatggcac ctaccattgc 541 tcaggcatgg gaaagcatcg ctacacatca gcaggaatat ctgtcactgt gaaagagcta 601 tttccagctc cagtgctgaa tgcatctgtg acatccccac tcctggaggg gaatctggtc 661 accctgagct gtgaaacaaa gttgctcttg cagaggcctg gtttgcagct ttacttctcc 721 ttctacatgg gcagcaagac cctgcgaggc aggaacacat cctctgaata ccaaatacta 781 actgctagaa gagaagactc tgggttatac tggtgcgagg ctgccacaga ggatggaaat 841 gtccttaagc gcagccctga gttggagctt caagtgcttg gcctccagtt accaactcct 901 gtctggtttc atgtcctttt ctatctggca gtgggaataa tgtttttagt gaacactgtt 961 ctctgggtga caatacgtaa agaactgaaa agaaagaaaa agtgggattt agaaatctct 1021 ttggattctg gtcatgagaa gaaggtaact tccagccttc aagaagacag acatttagaa 1081 gaagagctga aatgtcagga acaaaaagaa gaacagctgc aggaaggggt gcaccggaag 1141 gagccccagg gggccacgta gcagcggctc agtgggtggc catcgatctg gaccgtcccc 1201 tgcccacttg ctccccgtga gcactgcgta caaacatcca aaagttcaac aacaccagaa 1261 ctgtgtgtct catggtatgt aactcttaaa gcaaataaat gaactgactt caaaaaaaaa 1321 a // LOCUS HSFCRII 1403 bp RNA PRI 23-MAR-1995 DEFINITION Human FcRII mRNA for immunoglobulin G receptor. ACCESSION Y00644 NID g31335 KEYWORDS cell surface glycoprotein; IgG receptor; immunoglobulin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1403) AUTHORS Moore,K.W. and Martens,C.L. TITLE Direct Submission JOURNAL Submitted (26-OCT-1987) Moore K.W., Martens C.L., DNAX, 901 California Ave, Palo Alto, CA, 94304 USA REFERENCE 2 (bases 1 to 1403) AUTHORS Stuart,S.G., Trounstine,M.L., Vaux,D.J., Koch,T., Martens,C.L., Mellman,I. and Moore,K.W. TITLE Isolation and expression of cDNA clones encoding a human receptor for IgG (Fc gamma RII) JOURNAL J. Exp. Med. 166 (6), 1668-1684 (1987) MEDLINE 88061079 FEATURES Location/Qualifiers source 1..1403 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K937" /clone_lib="pcD vector" /clone="16.2" sig_peptide 20..121 /note="signal peptide (AA -34 to -1)" CDS 20..973 /note="precursor polypeptide (AA -34 to 287)" /codon_start=1 /db_xref="PID:g31336" /db_xref="SWISS-PROT:P12318" /translation="MTMETQMSQNVCPRNLWLLQPLTVLLLLASADSQAAAPPKAVLK LEPPWINVLQEDSVTLTCQGARSPESDSIQWFHNGNLIPTHTQPSYRFKANNNDSGEY TCQTGQTSLSDPVHLTVLSEWLVLQTPHLEFQEGETIMLRCHSWKDKPLVKVTFFQNG KSQKFSRLDPTFSIPQANHSHSGDYHCTGNIGYTLFSSKPVTITVQVPSMGSSSPMGI IVAVVIATAVAAIVAAVVALIYCRKKRISANSTDPVKAAQFEPPGRQMIAIRKRQLEE TNNDYETADGGYMTLNPRAPTDDDKNIYLTLPPNDHVNSNN" mat_peptide 122..970 /note="mature IgG receptor (AA 1-287)" misc_feature 308..316 /note="N-glycosylation site" misc_feature 551..559 /note="N-glycosylation site" misc_feature 656..742 /note="transmembrane domain" BASE COUNT 410 a 372 c 305 g 316 t ORIGIN 1 ggggggggac agtgctggga tgactatgga gacccaaatg tctcagaatg tatgtcccag 61 aaacctgtgg ctgcttcaac cattgacagt tttgctgctg ctggcttctg cagacagtca 121 agctgcagct cccccaaagg ctgtgctgaa acttgagccc ccgtggatca acgtgctcca 181 ggaggactct gtgactctga catgccaggg ggctcgcagc cctgagagcg actccattca 241 gtggttccac aatgggaatc tcattcccac ccacacgcag cccagctaca ggttcaaggc 301 caacaacaat gacagcgggg agtacacgtg ccagactggc cagaccagcc tcagcgaccc 361 tgtgcatctg actgtgcttt ccgaatggct ggtgctccag acccctcacc tggagttcca 421 ggagggagaa accatcatgc tgaggtgcca cagctggaag gacaagcctc tggtcaaggt 481 cacattcttc cagaatggaa aatcccagaa attctcccgt ttggatccca ccttctccat 541 cccacaagca aaccacagtc acagtggtga ttaccactgc acaggaaaca taggctacac 601 gctgttctca tccaagcctg tgaccatcac tgtccaagtg cccagcatgg gcagctcttc 661 accaatgggg atcattgtgg ctgtggtcat tgcgactgct gtagcagcca ttgttgctgc 721 tgtagtggcc ttgatctact gcaggaaaaa gcggatttca gccaattcca ctgatcctgt 781 gaaggctgcc caatttgagc cacctggacg tcaaatgatt gccatcagaa agagacaact 841 tgaagaaacc aacaatgact atgaaacagc tgacggcggc tacatgactc tgaaccccag 901 ggcacctact gacgatgata aaaacatcta cctgactctt cctcccaacg accatgtcaa 961 cagtaataac taaagagtaa cgttatgcca tgtggtcata ctctcagctt gcgtatggat 1021 gcaaaaaaga ggggaattgt taaaggaaaa tttaaatgga gactggaaaa atcctgagca 1081 aacaaaacca cctggccctt agaaatagct ttaactttgc ttaaactaca aacacaagca 1141 aaacttcacg gggtcatact acatacaagc ataagcaaaa cttaacttgg atcatttctg 1201 gtaaatgctt atgttagaaa taagacaacc ccagccaatc acaagcagcc tactaacata 1261 taattaggtg actagggact ttctaagaag atacctaccc ccaaaaaaca acttatgtaa 1321 ttgaaaacca accgattgcc tttattttgc ttccacattt tcccaataaa tacttgcctg 1381 tgacattttg ccactggaac act // LOCUS HSFGFRBE 3415 bp RNA PRI 23-MAR-1995 DEFINITION Human bek mRNA for fibroblast growth factor receptor-BEK. ACCESSION X52832 NID g31373 KEYWORDS cell surface glycoprotein; fibroblast growth factor receptor; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3415) AUTHORS Dionne,C.A. TITLE Direct Submission JOURNAL Submitted (01-JUN-1990) Dionne C., Rorer Central Research, 680 Allelande Road, King of Prussia, PA 19425, USA REFERENCE 2 (bases 1 to 3415) AUTHORS Dionne,C.A., Crumley,G., Bellot,F., Kaplow,J.M., Searfoss,G., Ruta,M., Burgess,W.H., Jaye,M. and Schlessinger,J. TITLE Cloning and expression of two distinct high-affinity receptors cross-reacting with acidic and basic fibroblast growth factors JOURNAL EMBO J. 9 (9), 2685-2692 (1990) MEDLINE 90360977 FEATURES Location/Qualifiers source 1..3415 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="neonatal (1 day)" /tissue_type="brainstem" /clone_lib="lambda gt11" sig_peptide 180..242 /note="signal peptide" CDS 180..2645 /note="fibroblast growth factor receptor-BEK precursor" /codon_start=1 /db_xref="PID:g31374" /db_xref="SWISS-PROT:P21802" /translation="MVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEEPPTKYQI SQPEVYVAAPGESLEVRCLLKDAAVISWTKDGVHLGPNNRTVLIGEYLQIKGATPRDS GLYACTASRTVDSETWYFMVNVTDAISSGDDEDDTDGAEDFVSENSNNKRAPYWTNTE KMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIM ESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVVGGDVEF VCKVYSDAQPHIQWIKHVEKNGSKYGPDGLPYLKVLKAAGVNTTDKEIEVLYIRNVTF EDAGEYTCLAGNSIGISFHSAWLTVLPAPGREKEITASPDYLEIAIYCIGVFLIACMV VTVILCRMKNTTKKPDFSSQPAVHKLTKRIPLRRQVTVSAESSSSMNSNTPLVRITTR LSSTADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIDKDKP KEAVTVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEY ASKGNLREYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIH RDLAARNVLVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTH QSDVWSFGVLMWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCW HAVPSQRPTFKQLVEDLDRILTLTTNEEYLDLSQPLEQYSPSYPDTRSSCSSGDDSVF SPDPMPYEPCLPQYPHINGSVKT" mat_peptide 243..2642 /note="mature fibroblast growth factor receptor-BEK" BASE COUNT 953 a 780 c 865 g 817 t ORIGIN 1 cccaggtcgc ggaggagcgt tgccattcaa gtgactgcag cagcagcggc accgctcggt 61 tcctgagccc accgcagctg aaggcattgc gcgtagtcca tgcccgtaga ggaagtgtgc 121 agatgggatt aacgtccaca tggagatatg gaagaggacc ggggattggt accgtaacca 181 tggtcagctg gggtcgtttc atctgcctgg tcgtggtcac catggcaacc ttgtccctgg 241 cccggccctc cttcagttta gttgaggata ccacattaga gccagaagag ccaccaacca 301 aataccaaat ctctcaacca gaagtgtacg tggctgcacc aggggagtcg ctagaggtgc 361 gctgcctgtt gaaagatgcc gccgtgatca gttggactaa ggatggggtg cacttggggc 421 ccaacaatag gacagtgctt attggggagt acttgcagat aaagggcgcc acgcctagag 481 actccggcct ctatgcttgt actgccagta ggactgtaga cagtgaaact tggtacttca 541 tggtgaatgt cacagatgcc atctcatccg gagatgatga ggatgacacc gatggtgcgg 601 aagattttgt cagtgagaac agtaacaaca agagagcacc atactggacc aacacagaaa 661 agatggaaaa gcggctccat gctgtgcctg cggccaacac tgtcaagttt cgctgcccag 721 ccggggggaa cccaatgcca accatgcggt ggctgaaaaa cgggaaggag tttaagcagg 781 agcatcgcat tggaggctac aaggtacgaa accagcactg gagcctcatt atggaaagtg 841 tggtcccatc tgacaaggga aattatacct gtgtggtgga gaatgaatac gggtccatca 901 atcacacgta ccacctggat gttgtggagc gatcgcctca ccggcccatc ctccaagccg 961 gactgccggc aaatgcctcc acagtggtcg gaggagacgt agagtttgtc tgcaaggttt 1021 acagtgatgc ccagccccac atccagtgga tcaagcacgt ggaaaagaac ggcagtaaat 1081 acgggcccga cgggctgccc tacctcaagg ttctcaaggc cgccggtgtt aacaccacgg 1141 acaaagagat tgaggttctc tatattcgga atgtaacttt tgaggacgct ggggaatata 1201 cgtgcttggc gggtaattct attgggatat cctttcactc tgcatggttg acagttctgc 1261 cagcgcctgg aagagaaaag gagattacag cttccccaga ctacctggag atagccattt 1321 actgcatagg ggtcttctta atcgcctgta tggtggtaac agtcatcctg tgccgaatga 1381 agaacacgac caagaagcca gacttcagca gccagccggc tgtgcacaag ctgaccaaac 1441 gtatccccct gcggagacag gtaacagttt cggctgagtc cagctcctcc atgaactcca 1501 acaccccgct ggtgaggata acaacacgcc tctcttcaac ggcagacacc cccatgctgg 1561 caggggtctc cgagtatgaa cttccagagg acccaaaatg ggagtttcca agagataagc 1621 tgacactggg caagcccctg ggagaaggtt gctttgggca agtggtcatg gcggaagcag 1681 tgggaattga caaagacaag cccaaggagg cggtcaccgt ggccgtgaag atgttgaaag 1741 atgatgccac agagaaagac ctttctgatc tggtgtcaga gatggagatg atgaagatga 1801 ttgggaaaca caagaatatc ataaatcttc ttggagcctg cacacaggat gggcctctct 1861 atgtcatagt tgagtatgcc tctaaaggca acctccgaga atacctccga gcccggaggc 1921 cacccgggat ggagtactcc tatgacatta accgtgttcc tgaggagcag atgaccttca 1981 aggacttggt gtcatgcacc taccagctgg ccagaggcat ggagtacttg gcttcccaaa 2041 aatgtattca tcgagattta gcagccagaa atgttttggt aacagaaaac aatgtgatga 2101 aaatagcaga ctttggactc gccagagata tcaacaatat agactattac aaaaagacca 2161 ccaatgggcg gcttccagtc aagtggatgg ctccagaagc cctgtttgat agagtataca 2221 ctcatcagag tgatgtctgg tccttcgggg tgttaatgtg ggagatcttc actttagggg 2281 gctcgcccta cccagggatt cccgtggagg aactttttaa gctgctgaag gaaggacaca 2341 gaatggataa gccagccaac tgcaccaacg aactgtacat gatgatgagg gactgttggc 2401 atgcagtgcc ctcccagaga ccaacgttca agcagttggt agaagacttg gatcgaattc 2461 tcactctcac aaccaatgag gaatacttgg acctcagcca acctctcgaa cagtattcac 2521 ctagttaccc tgacacaaga agttcttgtt cttcaggaga tgattctgtt ttttctccag 2581 accccatgcc ttacgaacca tgccttcctc agtatccaca cataaacggc agtgttaaaa 2641 catgaatgac tgtgtctgcc tgtccccaaa caggacagca ctgggaacct agctacactg 2701 agcagggaga ccatgcctcc cagagcttgt tgtctccact tgtatatatg gatcagagga 2761 gtaaataatt ggaaaagtaa tcagcatatg tgtaaagatt tatacagttg aaaacttgta 2821 atcttcccca ggaggagaag aaggtttctg gagcagtgga ctgccacaag ccaccatgta 2881 acccctctca cctgccgtgc gtactggctg tggaccagta ggactcaagg tggacgtgcg 2941 ttctgccttc cttgttaatt ttgtaataat tggagaagat ttatgtcagc acacacttac 3001 agagcacaaa tgcagtatat aggtgctgga tgtatgtaaa tatattcaaa ttatgtataa 3061 atatatatta tatatttaca aggagttatt ttttgtattg attttaaatg gatgtcccaa 3121 tgcacctaga aaattggtct ctcttttttt aatagctatt tgctaaatgc tgttcttaca 3181 cataatttct taattttcac cgagcagagg tggaaaaata cttttgcttt cagggaaaat 3241 ggtataacgt taatttatta ataaattggt aatatacaaa acaattaatc atttatagtt 3301 ttttttgtaa tttaagtggc atttctatgc aggcagcaca gcagactagt taatctattg 3361 cttggactta actagttatc agatcctttg aaaagagaat atttacaata tatga // LOCUS HSFGR1IG 826 bp RNA PRI 29-AUG-1991 DEFINITION Human mRNA for fibroblast growth receptor 1-Ig domain (secreted form). ACCESSION X57118 NID g31384 KEYWORDS FGF receptor; fibroblast growth factor receptor; immunoglobulin domain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 826) AUTHORS Tronick,S.R. TITLE Direct Submission JOURNAL Submitted (02-JAN-1991) S.R. Tronick, NATIONAL INSTITUTES OF HEALTH, NATIONAL CANCER INSTITUTE, BLDG 37/RM 1E24, BETHESDA MARYLAND 20892, USA REFERENCE 2 (bases 1 to 826) AUTHORS Eisemann,A., Ahn,J.A., Graziani,G., Tronick,S.R. and Ron,D. TITLE Alternative splicing generates at least five different isoforms of the human basic-FGF receptor JOURNAL Oncogene 6 (7), 1195-1202 (1991) MEDLINE 91319400 REMARK Erratum:[Oncogene 1991 Dec;6(12):2379]] COMMENT See also X57118-X57122. FEATURES Location/Qualifiers source 1..826 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="embryo" /tissue_type="lung" /cell_type="fibroblast" /cell_line="M432" mRNA 1..826 /evidence=experimental CDS 1..453 /note="the predicted amino acid sequence of this clone would represent a truncated and secreted form of the FGF receptor since it lacks a transmembrane domain" /codon_start=1 /product="fibroblast growth receptor 1-Ig domain, secreted form" /db_xref="PID:g31385" /db_xref="SWISS-PROT:P11362" /translation="MWSWKCLLFWAVLVTATLCTARPSPTLPEQAQPWGAPVEVESFL VHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQDSVPADSGLYACV TSSPSGSDTTYFSVNVSACPDLQEAKWCSASFHSITPLPFGLGTRLSD" BASE COUNT 189 a 247 c 225 g 165 t ORIGIN 1 atgtggagct ggaagtgcct cctcttctgg gctgtgctgg tcacagccac actctgcacc 61 gctaggccgt ccccgacctt gcctgaacaa gcccagccct ggggagcccc tgtggaagtg 121 gagtccttcc tggtccaccc cggtgacctg ctgcagcttc gctgtcggct gcgggacgat 181 gtgcagagca tcaactggct gcgggacggg gtgcagctgg cggaaagcaa ccgcacccgc 241 atcacagggg aggaggtgga ggtgcaggac tccgtgcccg cagactccgg cctctatgct 301 tgcgtaacca gcagcccctc gggcagtgac accacctact tctccgtcaa tgtttcagct 361 tgcccagatc tccaggaggc taagtggtgc tcggccagct tccactccat cactcccttg 421 ccatttggac ttggtactcg gcttagtgat tagaggccct gaacaggtgg tggtatccct 481 gctctgctgg agaggaaccc agatgctctc ccctcctcgg aggatgatga tgatgatgat 541 gactcctctt cagaggagaa agaaacagat aacaccaaac caaaccccgt agctccatat 601 tggacatccc cagaaaagat ggaaaagaaa ttgcatgcag tgccggctgc caagacagtg 661 aagttcaaat gcccttccag tgggacccca aaccccacac tgcgctggtt gaaaaatggc 721 aaagaattca aacctgacca cagaattgga ggctacaagg tccgttatgc cacctggagc 781 atcataatgg actctgtggt gccctctgac aagggcaact acacct // LOCUS HSFIB 1090 bp RNA PRI 06-NOV-1991 DEFINITION Human humFib mRNA for fibrillarin. ACCESSION X56597 NID g31394 KEYWORDS fibrillarin; nucleolar protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1090) AUTHORS Hurt,E.C. TITLE Direct Submission JOURNAL Submitted (12-NOV-1990) E.C. Hurt, EMBL, MEYERHOFSTR 1, POSTFACH 10.22 09, FRG REFERENCE 2 (bases 1 to 1090) AUTHORS Jansen,R.P., Hurt,E.C., Kern,H., Lehtonen,H., Carmo-Fonseca,M., Lapeyre,B. and Tollervey,D. TITLE Evolutionary conservation of the human nucleolar protein fibrillarin and its functional expression in yeast JOURNAL J. Cell Biol. 113 (4), 715-729 (1991) MEDLINE 91225069 FEATURES Location/Qualifiers source 1..1090 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" mRNA 1..1090 /gene="humFib" /evidence=experimental gene 1..1090 /gene="humFib" mat_peptide 60..1022 /gene="humFib" /product="fibrillarin" CDS 60..1025 /gene="humFib" /codon_start=1 /product="fibrillarin" /db_xref="PID:g31395" /db_xref="SWISS-PROT:P22087" /translation="MKPGFSPRGGGFGGRGGFGDRGGRGGRGGFGGGRGRGGGFRGRG RGGGGGGGGGGGGGRGGGGFHSGGNRGRGRGGKRGNQSGKNVMVEPHRHEGVFICRGK EDALVTKNLVPGESVYGEKRVSISEGDDKIEYRAWNPFRSKLAAAILGGVDQIHIKPG AKVLYLGAASGTTVSHVSDIVGPDGLVYAVEFSHRSGRDLINLAKKRTNIIPVIEDAR HPHKYRMLIAMVDVIFADVAQPDQTRIVALNAHTFLRNGGHFVISIKANCIDSTASAE AVFASEVKKMQQENMKPQEQLTLEPYERDHAVVVGVYRPPPKVKN" BASE COUNT 244 a 267 c 364 g 215 t ORIGIN 1 ggatccggca acgaaggtac catggccgga ctccggagcc gcacaaacca gggctcgcca 61 tgaagccagg attcagtccc cgtgggggtg gctttggcgg ccgagggggc tttggtgacc 121 gtggtggtcg tggaggccga gggggctttg gcgggggccg aggtcgaggc ggaggcttta 181 gaggtcgtgg acgaggagga ggtggaggcg gcggcggcgg tggaggagga ggaagaggtg 241 gtggaggctt ccattctggt ggcaaccggg gtcgtggtcg gggaggaaaa agaggaaacc 301 agtcggggaa gaatgtgatg gtggagccgc atcggcatga gggtgtcttc atttgtcgag 361 gaaaggaaga tgcactggtc accaagaacc tggtccctgg ggaatcagtt tatggagaga 421 agagagtctc gatttcggaa ggagatgaca aaattgagta ccgagcctgg aaccccttcc 481 gctccaagct agcagcagca atcctgggtg gtgtggacca gatccacatc aaaccggggg 541 ctaaggttct ctacctcggg gctgcctcgg gcaccacggt ctcccatgtc tctgacatcg 601 ttggtccgga tggtctagtc tatgcagtcg agttctccca ccgctctggc cgtgacctca 661 ttaacttggc caagaagagg accaacatca ttcctgtgat cgaggatgct cgacacccac 721 acaaataccg catgctcatc gcaatggtgg atgtgatctt tgctgatgtg gcccagccag 781 accagacccg gattgtggcc ctgaatgccc acaccttcct gcgtaatgga ggacactttg 841 tgatttccat taaggccaac tgcattgact ccacagcctc agccgaggcc gtgtttgcct 901 ccgaagtgaa aaagatgcaa caggagaaca tgaagccgca ggagcagttg acccttgagc 961 catatgaaag agaccatgcc gtggtcgtgg gagtgtacag gccacccccc aaggtgaaga 1021 actgaagttc agcgctgtca ggattgcgag agatgtgtgt tgataccatg gtaccttcgt 1081 tgccggatcc // LOCUS HSFIBLP 1496 bp RNA PRI 22-MAR-1997 DEFINITION H.sapiens mRNA for fibrinogen-like protein (pT49 protein). ACCESSION Z36531 NID g535184 KEYWORDS fibrinogen-like protein; pT49 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1496) AUTHORS Ruegg,C. and Pytela,R. TITLE Sequence of a human transcript expressed in T-lymphocytes and encoding a fibrinogen-like protein JOURNAL Gene 160 (2), 257-262 (1995) MEDLINE 95369700 REFERENCE 2 (bases 1 to 1496) AUTHORS Ruegg,C. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) Curzio Ruegg, Laboratory of the CPO, Swiss Institute for, Experimental Cancer Research, 155 Chemin des Boveresees, Epalinges, CH-1066, Switzerland FEATURES Location/Qualifiers source 1..1496 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="small intestine" /clone_lib="lambda gt11 cDNA lib HL1133b (Clontech, Palo Alto)" /sex="female" CDS 34..1353 /note="expressed in T lymphocytes (pT49 protein)" /codon_start=1 /product="fibrinogen-like protein" /db_xref="PID:g535185" /translation="MKLANWYWLSSAVLATYGFLVVANNETEEIKDERAKDVCPVRLE SRGKCEEAGECPYQVSLPPLTIQLPKQFSRIEEVFKEVQNLKEIVNSLKKSCQDCKLQ ADDNGDPGRNGLLLPSTGAPGEVGDNRVRELESEVNKLSSELKNAKEEINVLHGRLEK LNLVNMNNIENYVDSKVANLTFVVNSLDGKCSKCPSQEQIQSRPVQHLIYKDCSDYYA IGKRSSETYRVTPDPKNSSFEVYCDMETMGGGWTVLQARLDGSTNFTRTWQDYKAGFG NLRREFWLGNDKIHLLTKSKEMILRIDLEDFNGVELYALYDQFYVANEFLKYRLHVGN YNGTAGDALRFNKHYNHDLKFFTTPDKDNDRYPSGNCGLYYSSGWWFDACLSANLNGK YYHQKYRGVRNGIFWGTWPGVSEAHPGGYKSSFKEAKMMIRPKHFKP" BASE COUNT 469 a 296 c 351 g 380 t ORIGIN 1 cgcactccct gctgggtgag cagcactgta aagatgaagc tggctaactg gtactggctg 61 agctcagctg ttcttgccac ttacggtttt ttggttgtgg caaacaatga aacagaggaa 121 attaaagatg aaagagcaaa ggatgtctgc ccagtgagac tagaaagcag agggaaatgc 181 gaagaggcag gggagtgccc ctaccaggta agcctgcccc ccttgactat tcagctcccg 241 aagcaattca gcaggatcga ggaggtgttc aaagaagtcc aaaacctcaa ggaaatcgta 301 aatagtctaa agaaatcttg ccaagactgc aagctgcagg ctgatgacaa cggagaccca 361 ggcagaaacg gactgttgtt acccagtaca ggagccccgg gagaggttgg tgataacaga 421 gttagagaat tagagagtga ggttaacaag ctgtcctctg agctaaagaa tgccaaagag 481 gagatcaatg tacttcatgg tcgcctggag aagctgaatc ttgtaaatat gaacaacata 541 gaaaattatg ttgacagcaa agtggcaaat ctaacatttg ttgtcaatag tttggatggc 601 aaatgttcaa agtgtcccag ccaagaacaa atacagtcac gtccagttca acatctaata 661 tataaagatt gctctgacta ctacgcaata ggcaaaagaa gcagtgagac ctacagagtt 721 acacctgatc ccaaaaatag tagctttgaa gtttactgtg acatggagac catgggggga 781 ggctggacag tgctgcaggc acgtctcgat gggagcacca acttcaccag aacatggcaa 841 gactacaaag caggctttgg aaacctcaga agggaatttt ggctggggaa cgataaaatt 901 catcttctga ccaagagtaa ggaaatgatt ctgagaatag atcttgaaga ctttaatggt 961 gtcgaactat atgccttgta tgatcagttt tatgtggcta atgagtttct caaatatcgt 1021 ttacacgttg gtaactataa tggcacagct ggagatgcat tacgtttcaa caaacattac 1081 aaccacgatc tgaagttttt caccactcca gataaagaca atgatcgata tccttctggg 1141 aactgtgggc tgtactacag ttcaggctgg tggtttgatg catgtctttc tgcaaactta 1201 aatggcaaat attatcacca aaaatacaga ggtgtccgta atgggatttt ctggggtacc 1261 tggcctggtg taagtgaggc acaccctggt ggctacaagt cctccttcaa agaggctaag 1321 atgatgatca gacccaagca ctttaagcca taaatcactc tgttcattcc tccaggtatt 1381 cgttatctaa tagggcaatt aattccttca gcactttaga atatgccttg tttcatattt 1441 ttcatagcta aaaaatgttt gacatccttt gagatatttt attactaaaa tctgcc // LOCUS HSFIBUA 2349 bp RNA PRI 21-AUG-1995 DEFINITION H.sapiens mRNA for fibulin-1 A. ACCESSION X53741 NID g31414 KEYWORDS fibulin-1 A; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2349) AUTHORS Argraves,W.S. TITLE Direct Submission JOURNAL Submitted (03-JUL-1990) Argraves W.S., American Red Cross, 15601 Crabbs Branch Way, Rockville, MD 20855, USA REFERENCE 2 (bases 1 to 2349) AUTHORS Argraves,W.S., Tran,H., Burgess,W.H. and Dickerson,K. TITLE Fibulin is an extracellular matrix and plasma glycoprotein with repeated domain structure JOURNAL J. Cell Biol. 111 (6 Pt 2), 3155-3164 (1990) MEDLINE 91100426 REFERENCE 3 (bases 1 to 2349) AUTHORS Korenberg,J.R., Chen,X.N., Tran,H. and Argraves,W.S. TITLE Localization of the human gene for fibulin-1 (FBLN1) to chromosome band 22q13.3 JOURNAL Cytogenet. Cell Genet. 68 (3-4), 192-193 (1995) MEDLINE 95145011 FEATURES Location/Qualifiers source 1..2349 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="22q13.3" mRNA 1..2349 /evidence=experimental sig_peptide 11..85 CDS 11..1711 /codon_start=1 /product="fibulin-1 A" /db_xref="PID:g31415" /db_xref="SWISS-PROT:P23142" /translation="MERAAPSRRVPLPLLLLGGLALLAAGVDADVLLEACCADGHRMA THQKDCSLPYATESKECRMVQEQCCHSQLEELHCATGISLANEQDRCATPHGDNASLE ATFVKRCCHCCLLGRAAQAQGQSCEYSLMVGYQCGQVFRACCVKSQETGDLDVGGLQE TDKIIEVEEEQEDPYLNDRCRGGGPCKQQCRDTGDEVVCSCFVGYQLLSDGVSCEDVN ECITGSHSCRLGESCINTVGSFRCQRDSSCGTGYELTEDNSCKDIDECESGIHNCLPD FICQNTLGSFRCRPKLQCKSGFIQDALGNCIDINECLSISAPCPIGHTCINTEGSYTC QKNVPNCGRGYHLNEEGTRCVDVDECAPPAEPCGKGHRCVNSPGSFRCECKTGYYFDG ISRMCVDVNECQRYPGRLCGHKCENTLGSYLCSCSVGFRLSVDGRSCEDINECSSSPC SQECANVYGSYQCYCRRGYQLSDVDGVTCEDIDECALPTGGHICSYRCINIPGSFQCS CPSSGYRLAPNGRNCQDIDECVTGIHNCSINETCFNIQGAFRCLAFECPENYRRSAAT " mat_peptide 86..1708 /product="fibulin-1 A" polyA_site 2349 BASE COUNT 520 a 668 c 682 g 479 t ORIGIN 1 cccgccgccc atggagcgcg ccgcgccgtc gcgccgggtc ccgcttccgc tgctgctgct 61 cggcggcctt gcgctgctgg cggccggagt ggacgcggat gtcctcctgg aggcctgctg 121 tgcggacgga caccggatgg ccactcatca gaaggactgc tcgctgccat atgctacgga 181 atccaaagaa tgcaggatgg tgcaggagca gtgctgccac agccagctgg aggagctgca 241 ctgtgccacg ggcatcagcc tggccaacga gcaggaccgc tgtgccacgc cccacggtga 301 caacgccagc ctggaggcca catttgtgaa gaggtgctgc cattgctgtc tgctggggag 361 ggcggcccag gcccagggcc agagctgcga gtacagcctc atggttggct accagtgtgg 421 acaggtcttc cgggcatgct gtgtcaagag ccaggagacc ggagatttgg atgtcggggg 481 cctccaagaa acggataaga tcattgaggt tgaggaggaa caagaggacc catatctgaa 541 tgaccgctgc cgaggaggcg ggccctgcaa gcagcagtgc cgagacacgg gtgacgaggt 601 ggtctgctcc tgcttcgtgg gctaccagct gctgtctgat ggtgtctcct gtgaagatgt 661 caatgaatgc atcacgggca gccacagctg ccggcttgga gaatcctgca tcaacacagt 721 gggctctttc cgctgccagc gggacagcag ctgcgggact ggctatgagc tcacagagga 781 caatagctgc aaagatattg acgagtgtga gagtggtatt cataactgcc tccccgattt 841 tatctgtcag aatactctgg gatccttccg ctgccgaccc aagctacagt gcaagagtgg 901 ctttatacaa gatgctctag gcaactgtat tgatatcaat gagtgtttga gtatcagtgc 961 cccgtgccct attgggcata catgcatcaa cacagagggc tcctacacgt gccagaagaa 1021 cgtgcccaac tgtggccgtg gctaccatct caacgaggag ggaacgcgct gtgttgatgt 1081 ggacgagtgc gcgccacctg ctgagccctg tgggaaggga catcgctgcg tgaactctcc 1141 cggcagtttc cgctgcgaat gcaagacggg ttactatttt gacggcatca gcaggatgtg 1201 tgtcgatgtc aacgagtgcc agcgctaccc cgggcgcctg tgtggccaca agtgcgagaa 1261 cacgctgggc tcctacctct gcagctgttc cgtgggcttc cggctctctg tggatggcag 1321 gtcatgtgaa gacatcaatg agtgcagcag cagcccctgt agccaggagt gtgccaacgt 1381 ctacggctcc taccagtgtt actgccggcg aggctaccag ctcagcgatg tggatggagt 1441 cacctgtgaa gacatcgacg agtgcgccct gcccaccggg ggccacatct gctcctaccg 1501 ctgcatcaac atccctggaa gcttccagtg cagctgcccc tcgtctggct acaggctggc 1561 ccccaatggc cgcaactgcc aagacattga tgagtgtgtg actggcatcc acaactgctc 1621 catcaacgag acctgcttca acatccaggg cgcgttccgc tgcctggcct tcgagtgccc 1681 tgagaactac cgccgctccg cagccacatg atcgtaggga actctgcatg aggccatcgg 1741 tgcaggctgg agaagagaag gcaagttggc aggagtggag accacaggca tttgagccac 1801 ttcctcatgt aacttaactt gtgccttcag gacctgctca agcccgatca cgtatatacc 1861 acttccattt gatgatggaa tgctgctgtt catgaccaac tttatggcta gatgggtcag 1921 aaagcaccca gttcatgata ggcagttcag gtcatatggt gacttgatga cccagagtca 1981 aacattcagt ttccaccaaa gcccagtaac aggccaagag ctgtctctca aaagaagagt 2041 agttatctgc agaagatggc agggccttgc tccgaaagcc tagagaccgc cactgtgatt 2101 cacctatggg ggcctgccaa agctgcagcc agcatcctta tctgccactg acacctcaag 2161 caacattgga tctgctgggt catatggccc aagtggcaga gcaacttgca caacagcctg 2221 gacctgtcat agagctttct cctgttctgg accccactca aaactggcag cctttcaggt 2281 cactcaataa atgtgctgga gtaacacact caaacgagga atgtgttgcc tccaaaatcc 2341 aataggccc // LOCUS HSFKBPMR 777 bp RNA PRI 09-FEB-1995 DEFINITION Human FKBP mRNA for FK-506 binding protein. ACCESSION X52220 NID g665649 KEYWORDS FK506-binding protein; immunophilin; immunophilin B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 777) AUTHORS Peattie,D.A. TITLE Direct Submission JOURNAL Submitted (13-AUG-1990) Peattie D.A., Vertex Pharmaceuticals Incorporated, 40 Allston Street, Cambridge, MA 02139-4211, USA REFERENCE 2 (bases 1 to 777) AUTHORS Peattie,D.A., Hsiao,K., Benasutti,M. and Lippke,J.A. TITLE Three distinct messenger RNAs can encode the human immunosuppressant-binding protein FKBP12 JOURNAL Gene 150 (2), 251-257 (1994) MEDLINE 95121911 FEATURES Location/Qualifiers source 1..777 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="Clontech #HL1067J" mRNA 1..731 /gene="FKBP" gene 1..731 /gene="FKBP" CDS 31..357 /gene="FKBP" /codon_start=1 /product="FK-506 binding protein" /db_xref="PID:g665650" /db_xref="SWISS-PROT:P20071" /translation="MGVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDRN KPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGATGHPGIIPPHATLVFDV ELLKLE" polyA_site 731 /gene="FKBP" BASE COUNT 229 a 175 c 183 g 190 t ORIGIN 1 ccgcccgccc gctcagcgtc cgccgccgcc atgggagtgc aggtggaaac catctcccca 61 ggagacgggc gcaccttccc caagcgcggc cagacctgcg tggtgcacta caccgggatg 121 cttgaagatg gaaagaaatt tgattcctcc cgggacagaa acaagccctt taagtttatg 181 ctaggcaagc aggaggtgat ccgaggctgg gaagaagggg ttgcccagat gagtgtgggt 241 cagagagcca aactgactat atctccagat tatgcctatg gtgccactgg gcacccaggc 301 atcatcccac cacatgccac tctcgtcttc gatgtggagc ttctaaaact ggaatgacag 361 gaatggcctc ctcccttagc tccctgttct tgggtaagga aatggaatac tgaagggccc 421 ttcactgcct ttgctcctcc catgttatgc ccagcgtttg atgggtagca gagagaacaa 481 aaaacaccac aaggctattt ttccccctgc attctttctg tattgagtat cctttcagtg 541 ttattagtgt atgctttgaa tgtaaaaatt ggtcacccta aggaaaggaa ttggcatgtg 601 tatgttccca gttcaactca tggagatggc agctgtttaa atgtttttct atgtagttta 661 taaattaaaa ctgaattgag gactatggaa atgtaggcca aatttgtagt gccaacattt 721 tagttctttg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSFLA1A 5133 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for leukocyte-associated molecule-1 alpha subunit (LFA-1 alpha subunit). ACCESSION Y00796 NID g31421 KEYWORDS LFA-1 alpha subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5133) AUTHORS Larson,R. TITLE Direct Submission JOURNAL Submitted (26-JAN-1989) Larson R., Center of Blood Research, 800 Huntington Ave., Boston, Ma 02115, USA REFERENCE 2 (bases 1 to 5133) AUTHORS Larson,R.S., Corbi,A.L., Berman,L. and Springer,T. TITLE Primary structure of the leukocyte function-associated molecule-1 alpha subunit: an integrin with an embedded domain defining a protein superfamily JOURNAL J. Cell Biol. 108 (2), 703-712 (1989) MEDLINE 89139587 FEATURES Location/Qualifiers source 1..5133 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /map="chromosome 16" sig_peptide 89..164 /note="signal peptide (AA -25 to -1)" CDS 89..3601 /note="LFA-1 alpha subunit precursor (AA -25 to 1145)" /codon_start=1 /db_xref="PID:g31422" /db_xref="SWISS-PROT:P20701" /translation="MKDSCITVMAMALLSGFFFFAPASSYNLDVRGARSFSPPRAGRH FGYRVLQVGNGVIVGAPGEGNSTGSLYQCQSGTGHCLPVTLRGSNYTSKYLGMTLATD PTDGSILACDPGLSRTCDQNTYLSGLCYLFRQNLQGPMLQGRPGFQECIKGNVDLVFL FDGSMSLQPDEFQKILDFMKDVMKKLSNTSYQFAAVQFSTSYKTEFDFSDYVKWKDPD ALLKHVKHMLLLTNTFGAINYVATEVFREELGARPDATKVLIIITDGEATDSGNIDAA KDIIRYIIGIGKHFQTKESQETLHKFASKPASEFVKILDTFEKLKDLFTELQKKIYVI EGTSKQDLTSFNMELSSSGISADLSRGHAVVGAVGAKDWAGGFLDLKADLQDDTFIGN EPLTPEVRAGYLGYTVTWLPSRQKTSLLASGAPRYQHMGRVLLFQEPQGGGHWSQVQT IHGTQIGSYFGGELCGVDVDQDGETELLLIGAPLFYGEQRGGRVFIYQRRQLGFEEVS ELQGDPGYPLGRFGEAITALTDINGDGLVDVAVGAPLEEQGAVYIFNGRHGGLSPQPS QRIEGTQVLSGIQWFGRSIHGVKDLEGDGLADVAVGAESQMIVLSSRPVVDMVTLMSF SPAEIPVHEVECSYSTSNKMKEGVNITICFQIKSLYPQFQGRLVANLTYTLQLDGHRT RRRGLFPGGRHELRRNIAVTTSMSCTDFSFHFPVCVQDLISPINVSLNFSLWEEEGTP RDQRAQGKDIPPILRPSLHSETWEIPFEKNCGEDKKCEANLRVSFSPARSRALRLTAF ASLSVELSLSNLEEDAYWVQLDLHFPPGLSFRKVEMLKPHSQIPVSCEELPEESRLLS RALSCNVSSPIFKAGHSVALQMMFNTLVNSSWGDSVELHANVTCNNEDSDLLEDNSAT TIIPILYPINILIQDQEDSTLYVSFTPKGPKIHQVKHMYQVRIQPSIHDHNIPTLEAV VGVPQPPSEGPITHQWSVQMEPPVPCHYEDLERLPDAAEPCLPGALFRCPVVFRQEIL VQVIGTLELVGEIEASSMFSLCSSLSISFNSSKHFHLYGSNASLAQVVMKVDVVYEKQ MLYLYVLSGIGGLLLLLLIFIVLYKVGFFKRNLKEKMEAGRGVPNGIPAEDSEQLASG QEAGDPGCLKPLHEKDSESGGGKD" mat_peptide 164..3598 /note="mature LFA-1 alpha subunit (AA 1-1145)" BASE COUNT 1163 a 1435 c 1400 g 1135 t ORIGIN 1 cctctttcac cctgtctagg ttgccagcaa atcccacggg cctcctgacg ctgcccctgg 61 ggccacaggt ccctcgagtg ctggaaggat gaaggattcc tgcatcactg tgatggccat 121 ggcgctgctg tctgggttct ttttcttcgc gccggcctcg agctacaacc tggacgtgcg 181 gggcgcgcgg agcttctccc caccgcgcgc cgggaggcac tttggatacc gcgtcctgca 241 ggtcggaaac ggggtcatcg tgggagctcc aggggagggg aacagcacag gaagcctcta 301 tcagtgccag tcgggcacag gacactgcct gccagtcacc ctgagaggtt ccaactatac 361 ctccaagtac ttgggaatga ccttggcaac agaccccaca gatggaagca ttttggcctg 421 tgaccctggg ctgtctcgaa cgtgtgacca gaacacctat ctgagtggcc tgtgttacct 481 cttccgccag aatctgcagg gtcccatgct gcaggggcgc cctggttttc aggaatgtat 541 caagggcaac gtagacctgg tatttctgtt tgatggttcg atgagcttgc agccagatga 601 atttcagaaa attctggact tcatgaagga tgtgatgaag aaactcagca acacttcgta 661 ccagtttgct gctgttcagt tttccacaag ctacaaaaca gaatttgatt tctcagatta 721 tgttaaatgg aaggaccctg atgctctgct gaagcatgta aagcacatgt tgctgttgac 781 caataccttt ggtgccatca attatgtcgc gacagaggtg ttccgggagg agctgggggc 841 ccggccagat gccaccaaag tgcttatcat catcacggat ggggaggcca ctgacagtgg 901 caacatcgat gcggccaaag acatcatccg ctacatcatc gggattggaa agcattttca 961 gaccaaggag agtcaggaga ccctccacaa atttgcatca aaacccgcga gcgagtttgt 1021 gaaaattctg gacacatttg agaagctgaa agatctattc actgagctgc agaagaagat 1081 ctatgtcatt gagggcacaa gcaaacagga cctgacttcc ttcaacatgg agctgtcctc 1141 cagcggcatc agtgctgacc tcagcagggg ccatgcagtc gtgggggcag taggagccaa 1201 ggactgggct gggggctttc ttgacctgaa ggcagacctg caggatgaca catttattgg 1261 gaatgaacca ttgacaccag aagtgagagc aggctatttg ggttacaccg tgacctggct 1321 gccctcccgg caaaagactt cgttgctggc ctcgggagcc cctcgatacc agcacatggg 1381 ccgagtgctg ctgttccaag agccacaggg cggaggacac tggagccagg tccagacaat 1441 ccatgggacc cagattggct cttatttcgg tggggagctg tgtggcgtcg acgtggacca 1501 agatggggag acagagctgc tgctgattgg tgccccactg ttctatgggg agcagagagg 1561 aggccgggtg tttatctacc agagaagaca gttggggttt gaagaagtct cagagctgca 1621 gggggacccc ggctacccac tcgggcggtt tggagaagcc atcactgctc tgacagacat 1681 caacggcgat gggctggtag acgtggctgt gggggcccct ctggaggagc agggggctgt 1741 gtacatcttc aatgggaggc acggggggct tagtccccag ccaagtcagc ggatagaagg 1801 gacccaagtg ctctcaggaa ttcagtggtt tggacgctcc atccatgggg tgaaggacct 1861 tgaaggggat ggcttggcag atgtggctgt gggggctgag agccagatga tcgtgctgag 1921 ctcccggccc gtggtggata tggtcaccct gatgtccttc tctccagctg agatcccagt 1981 gcatgaagtg gagtgctcct attcaaccag taacaagatg aaagaaggag ttaatatcac 2041 aatctgtttc cagatcaagt ctctctaccc ccagttccaa ggccgcctgg ttgccaatct 2101 cacttacact ctgcagctgg atggccaccg gaccagaaga cgggggttgt tcccaggagg 2161 gagacatgaa ctcagaagga atatagctgt caccaccagc atgtcatgca ctgacttctc 2221 atttcatttc ccggtatgtg ttcaagacct catctccccc atcaatgttt ccctgaattt 2281 ctctctttgg gaggaggaag ggacaccgag ggaccaaagg gcgcagggca aggacatacc 2341 gcccatcctg agaccctccc tgcactcgga aacctgggag atcccttttg agaagaactg 2401 tggggaggac aagaagtgtg aggcaaactt gagagtgtcc ttctctcctg caagatccag 2461 agccctgcgt ctaactgctt ttgccagcct ctctgtggag ctgagcctga gtaacttgga 2521 agaagatgct tactgggtcc agctggacct gcacttcccc ccgggactct ccttccgcaa 2581 ggtggagatg ctgaagcccc atagccagat acctgtgagc tgcgaggagc ttcctgaaga 2641 gtccaggctt ctgtccaggg cattatcttg caatgtgagc tctcccatct tcaaagcagg 2701 ccactcggtt gctctgcaga tgatgtttaa tacactggta aacagctcct ggggggactc 2761 ggttgaattg cacgccaatg tgacctgtaa caatgaggac tcagacctcc tggaggacaa 2821 ctcagccact accatcatcc ccatcctgta ccccatcaac atcctcatcc aggaccaaga 2881 agactccaca ctctatgtca gtttcacccc caaaggcccc aagatccacc aagtcaagca 2941 catgtaccag gtgaggatcc agccttccat ccacgaccac aacataccca ccctggaggc 3001 tgtggttggg gtgccacagc ctcccagcga ggggcccatc acacaccagt ggagcgtgca 3061 gatggagcct cccgtgccct gccactatga ggatctggag aggctcccgg atgcagctga 3121 gccttgtctc cccggagccc tgttccgctg ccctgttgtc ttcaggcagg agatcctcgt 3181 ccaagtgatc gggactctgg agctggtggg agagatcgag gcctcttcca tgttcagcct 3241 ctgcagctcc ctctccatct ccttcaacag cagcaagcat ttccacctct atggcagcaa 3301 cgcctccctg gcccaggttg tcatgaaggt tgacgtggtg tatgagaagc agatgctcta 3361 cctctacgtg ctgagcggca tcggggggct gctgctgctg ctgctcattt tcatagtgct 3421 gtacaaggtt ggtttcttca aacggaacct gaaggagaag atggaggctg gcagaggtgt 3481 cccgaatgga atccctgcag aagactctga gcagctggca tctgggcaag aggctgggga 3541 tcccggctgc ctgaagcccc tccatgagaa ggactctgag agtggtggtg gcaaggactg 3601 agtccaggcc tgtgaggtgc agagtgccca gaactggact caggatgccc agggccactc 3661 tgcctctgcc tgcattctgc cgtgtgccct cgggcgagtc actgcctctc cctggccctc 3721 agtttcccta tctcgaacat ggaactcatt cctgaatgtc tcctttgcag gctcataggg 3781 aagacctgct gagggaccag ccaagagggc tgcaaaagtg agggcttgtc attaccagac 3841 ggttcaccag cctctcttgg ttccttcctt ggaagagaat gtctgatcta aatgtggaga 3901 aactgtagtc tcaggaccta gggatgttct ggccctcacc cctgccctgg gatgtccaca 3961 gatgcctcca ccccccagaa cctgtccttg cacactcccc tgcactggag tccagtctct 4021 tctgctggca gaaagcaaat gtgacctgtg tcactacgtg actgtggcac acgccttgtt 4081 cttggccaaa gaccaaattc cttggcatgc cttccagcac cctgcaaaat gagaccctcg 4141 tggccttccc cagcctcttc tagagccgtg atgcctccct gttgaagctc tggtgacacc 4201 agcctttctc ccaggccagg ctccttcctg tcttcctgca ttcacccaga cagctccctc 4261 tgcctgaacc ttccatctcg cccacccctc cttccttgac cagcagatcc cagctcacgt 4321 cacacacttg gttgggtcct cacatctttc acacttccac caccctgcac tactccctca 4381 aagcacacgt catgtttctt catccggcag cctggatgtt ttttccctgt ttaatgattg 4441 acgtacttag cagctatctc tcagtgaact gtgagggtaa aggctatact tgtcttgttc 4501 accttgggat gacgccgcat gatatgtcag ggcgtgggac atctagtagg tgcttgacat 4561 aatttcactg aattaatgac agagccagtg ggaagataca gaaaaagagg gccggggctg 4621 ggcgcggtgg ttcacgcctg taatcccagc actttgggag gccaaggagg gtggatcacc 4681 tgaggtcagg agttagaggc cagcctggcg aaaccccatc tctactaaaa atacaaaatc 4741 caggcgtggt ggcacacacc tgtagtccca gctactcagg aggttgaggt aggagaattg 4801 cttgaacctg ggaggtggag gttgcagtga gccaagattg cgccattgca ctccagcctg 4861 ggcaacacag cgagactccg tctcaaggaa aaaataaaaa taaaaagcgg gcacgggccc 4921 ggacatcccc acccttggag gctgtcttct caggctctgc cctgccctag ctccacaccc 4981 tctcccagga cccatcacgc ctgtgcagtg gcccccacag aaagactgag ctcaaggtgg 5041 gaaccacgtc tgctaacttg gagccccagt gccaagcaca gtgcctgcat gtatttatcc 5101 aataaatgtg aaattctgtc caaaaaaaaa aaa // LOCUS HSFLAP 540 bp RNA PRI 26-NOV-1992 DEFINITION H.sapiens mRNA for five-lipoxygenase activating protein (FLAP). ACCESSION X52195 NID g31425 KEYWORDS 5-lipoxygenase activating protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 540) AUTHORS Dixon,R.A., Diehl,R.E., Opas,E., Rands,E., Vickers,P.J., Evans,J.F., Gillard,J.W. and Miller,D.K. TITLE Requirement of a 5-lipoxygenase-activating protein for leukotriene synthesis JOURNAL Nature 343 (6255), 282-284 (1990) MEDLINE 90136904 FEATURES Location/Qualifiers source 1..540 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 31..516 /codon_start=1 /product="five-lipoxygenase activating protein (FLAP)" /db_xref="PID:g31426" /db_xref="SWISS-PROT:P20292" /translation="MDQETVGNVVLLAIVTLISVVQNGFFAHKVEHESRTQNGRSFQR TGTLAFERVYTANQNCVDAYPTFLAVLWSAGLLCSQVPAAFAGLMYLFVRQKYFVGYL GERTQSTPGYIFGKRIILFLFLMSVAGIFNYYLIFFFGSDFENYIKTISTTISPLLLI S" BASE COUNT 119 a 139 c 128 g 154 t ORIGIN 1 tgcgttttgg gggttcctgg agtatcaatc atggatcaag aaactgtagg caatgttgtc 61 ctgttggcca tcgtcaccct catcagcgtg gtccagaatg gattctttgc ccataaagtg 121 gagcacgaaa gcaggaccca gaatgggagg agcttccaga ggaccggaac acttgccttt 181 gagcgggtct acactgccaa ccagaactgt gtagatgcgt accccacttt cctcgctgtg 241 ctctggtctg cggggctact ttgcagccaa gttcctgctg cgtttgctgg actgatgtac 301 ttgtttgtgc ggcaaaagta ctttgtcggt tacctaggag agagaacgca gagcacccct 361 ggctacatat ttgggaaacg catcatactc ttcctgttcc tcatgtccgt tgctggcata 421 ttcaactatt acctcatctt ctttttcgga agtgactttg aaaactacat aaagacgatc 481 tccaccacca tctcccctct acttctcatt tcctaactct ctgctgaata tggggttggt // LOCUS HSFLMON2R 2148 bp RNA PRI 18-AUG-1994 DEFINITION H.sapiens mRNA for flavin-containing monooxygenase 4. ACCESSION Z11737 S46255 NID g31429 KEYWORDS flavin-containing monooxygenase 4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2148) AUTHORS Dolphin,C.T., Shephard,E.A., Povey,S., Smith,R.L. and Phillips,I.R. TITLE Cloning, primary sequence and chromosomal localization of human FMO2, a new member of the flavin-containing mono-oxygenase family JOURNAL Biochem. J. 287 (Pt 1), 261-267 (1992) MEDLINE 93038564 REFERENCE 2 (bases 1 to 2148) AUTHORS Dolphin,C.T. TITLE Direct Submission JOURNAL Submitted (24-FEB-1992) C.T. Dolphin, Biochemistry, Queen Mary & Westfield College, University of London, Mile End Road, London, E1 4NS, U.K FEATURES Location/Qualifiers source 1..2148 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /tissue_type="Liver" /clone_lib="lambda gt11 library from Dr S. Woo" /clone="13W, 2A1" 5'UTR 1..217 CDS 218..1894 /codon_start=1 /product="flavin-containing monooxygenase 4" /db_xref="PID:g31430" /db_xref="SWISS-PROT:P31512" /translation="MAKKVAVIGAGVSGLSSIKCCVDEDLEPTCFERSDDIGGLWKFT ESSKDGMTRVYKSLVTNVCKEMSCYSDFPFHEDYPNFMNHEKFWDYLQEFAEHFDLLK YIQFKTTVCSITKRPDFSETGQWDVVTETEGKQNRAVFDAVMVCTGHFLNPHLPLEAF PGIHKFKGQILHSQEYKIPEGFQGKRVLVIGLGNTGGDIAVELSRTAAQVLLSTRTGT WVLGRSSDWGYPYNMMVTRRCCSFIAQVLPSRFLNWIQERKLNKRFNHEDYGLSITKG KKAKFIVNDELPNCILCGAITMKTSVIEFTETSAVFEDGTVEENIDVVIFTTGYTFSF PFFEEPLKSLCTKKIFLYKQVFPLNLERATLAIIGLIGLKGSILSGTELQARWVTRVF KGLCKIPPSQKLMMEATEKEQLIKRGVFKDTSKDKFDYIAYMDDIAACIGTKPSIPLL FLKDPRLAWEVFFGPCTPYQYRLMGPGKWDGARNAILTQWDRTLKPLKTRIVPDSSKP ASMSHYLKAWGAPVLLASLLLICKSSLFLKLVRDKLQDRMSPYLVSLWRG" 3'UTR 1895..2148 polyA_signal 2116..2121 polyA_site 2138 BASE COUNT 626 a 467 c 443 g 612 t ORIGIN 1 aagacaaaca ctttccttga ctttgagaaa taatttaagt caaagaatct gctctatgct 61 aaccaagaga tagagcacag caaagatctg ccagccccag gcctctacct agtggcctgg 121 aaattcaagt attcttattg gtggaggcca tttgtttctg attagaagct gtctaaacct 181 cctactcctc aactcaaagg aaaacacaga gcataccatg gccaagaaag ttgcagtgat 241 tggagctggt gtgagtggcc tctcctccat caaatgctgt gtggatgagg acctggagcc 301 cacctgcttt gagagaagtg atgacattgg gggattatgg aagtttactg aatcttccaa 361 agatgggatg accagggtct ataagtcatt agtgacaaat gtctgtaagg aaatgtcatg 421 ttacagtgac ttccctttcc acgaagatta tcctaatttc atgaaccatg aaaaattttg 481 ggactatctc caagaatttg ctgagcactt tgacctcctg aaatacattc agtttaagac 541 cactgtgtgc agcataacga agcgtccaga cttctccgaa actggtcagt gggatgttgt 601 cacagagaca gagggcaagc aaaatagagc tgtctttgat gctgttatgg tttgcactgg 661 acatttcctg aatccccatt tacctttgga agcctttcct ggaattcata agtttaaagg 721 tcagatcctg catagtcaag agtacaagat cccagaaggc tttcagggca aacgcgtctt 781 ggtgattggt cttgggaaca ctggaggaga cattgctgtg gaactcagtc gaacggcagc 841 tcaggtactt ctcagtacta gaactggtac ctgggttctt gggcgctctt cagattgggg 901 ctatccttat aatatgatgg ttacaagaag atgctgtagt tttattgcac aagttctgcc 961 ttcacgtttt ctaaactgga ttcaagaaag gaagttgaat aagagattta atcatgagga 1021 ttatggatta agtattacca aagggaaaaa agcaaaattc attgtgaatg atgagctgcc 1081 aaactgtatc ctctgtgggg caatcactat gaaaaccagc gtgattgaat ttacagaaac 1141 ctctgctgtc tttgaagatg ggacagtgga agaaaacatt gatgttgtga tcttcactac 1201 aggatataca ttttcttttc cattttttga agaacctctt aaaagcctct gtacaaagaa 1261 gatatttcta tacaagcaag tctttccctt aaacctagag agagcgacat tagccatcat 1321 cggccttatc ggccttaaag gatccatctt atcaggcaca gagctccaag cacgatgggt 1381 cacaagagta ttcaaaggac tctgtaagat acctccatcc caaaaattga tgatggaggc 1441 tactgaaaag gaacagctca ttaaaagggg agtgtttaaa gacaccagca aagacaaatt 1501 tgactacatt gcctacatgg atgatatcgc tgcctgcata ggcacaaagc ccagcatccc 1561 acttctgttc ctcaaggatc ccagactagc ttgggaagtt ttctttggac catgtactcc 1621 ttatcagtac cgcctcatgg gccctggaaa atgggatgga gccagaaatg ccatcctgac 1681 ccagtgggac agaacattga aacctttaaa aactcgaatt gtccctgatt cctccaagcc 1741 tgcctccatg tcacattatt taaaagcctg gggggcacct gtcctacttg cctctcttct 1801 acttatctgt aaatcttcac ttttcttgaa attggtgaga gataaactac aggacagaat 1861 gtccccttac ctagtaagtc tttggcgagg atgaacctga ttgttacaag ggttacacca 1921 agtcatgcta attctatctc caagtatctt gtgcatccct cctctgctct ccatcataac 1981 tgctattagc caaattcagg cccagtcatc tcctatctga attattgtat tatcttcttc 2041 tttgttttca gtaccctctt tcttgccacc ctttccaatg catcttctac cctgctacct 2101 cagtgattat tctaaaataa atatatatga tatggtttaa aaaaaaaa // LOCUS HSFLT 7680 bp RNA PRI 15-NOV-1993 DEFINITION Human flt mRNA for receptor-related tyrosine kinase. ACCESSION X51602 NID g31431 KEYWORDS flt gene; fms-related tyrosine kinase gene; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7680) AUTHORS Shibuya,M. TITLE Direct Submission JOURNAL Submitted (02-JAN-1989) Shibuya M., Institute of Medical Science, University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo 108, Japan REFERENCE 2 (bases 1 to 7680) AUTHORS Shibuya,M., Yamaguchi,S., Yamane,A., Ikeda,T., Tojo,A., Matsushime,H. and Sato,M. TITLE Nucleotide sequence and expression of a novel human receptor-type tyrosine kinase gene (flt) closely related to the fms family JOURNAL Oncogene 5 (4), 519-524 (1990) MEDLINE 90221591 REFERENCE 3 (bases 1 to 7680) AUTHORS Han,H.J., Fujiwara,T., Shin,S. and Nakamura,Y. TITLE Dinucleotide repeat polymorphism in the 3' non-coding region of the FLTI gene JOURNAL Hum. Mol. Genet. 2 (12), 2204 (1993) MEDLINE 94154724 COMMENT Data kindly reviewed (20-JUL-1990) by Shibuya M. FEATURES Location/Qualifiers source 1..7680 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone="3-7, 3-5" /chromosome="13" /map="13q12" CDS 250..4266 /note="flt gene product (AA 1-1338)" /codon_start=1 /db_xref="PID:g31432" /db_xref="SWISS-PROT:P17948" /translation="MVSYWDTGVLLCALLSCLLLTGSSSGSKLKDPELSLKGTQHIMQ AGQTLHLQCRGEAAHKWSLPEMVSKESERLSITKSACGRNGKQFCSTLTLNTAQANHT GFYSCKYLAVPTSKKKETESAIYIFISDTGRPFVEMYSEIPEIIHMTEGRELVIPCRV TSPNITVTLKKFPLDTLIPDGKRIIWDSRKGFIISNATYKEIGLLTCEATVNGHLYKT NYLTHRQTNTIIDVQISTPRPVKLLRGHTLVLNCTATTPLNTRVQMTWSYPDEKNKRA SVRRRIDQSNSHANIFYSVLTIDKMQNKDKGLYTCRVRSGPSFKSVNTSVHIYDKAFI TVKHRKQQVLETVAGKRSYRLSMKVKAFPSPEVVWLKDGLPATEKSARYLTRGYSLII KDVTEEDAGNYTILLSIKQSNVFKNLTATLIVNVKPQIYEKAVSSFPDPALYPLGSRQ ILTCTAYGIPQPTIKWFWHPCNHNHSEARCDFCSNNEESFILDADSNMGNRIESITQR MAIIEGKNKMASTLVVADSRISGIYICIASNKVGTVGRNISFYITDVPNGFHVNLEKM PTEGEDLKLSCTVNKFLYRDVTWILLRTVNNRTMHYSISKQKMAITKEHSITLNLTIM NVSLQDSGTYACRARNVYTGEEILQKKEITIRDQEAPYLLRNLSDHTVAISSSTTLDC HANGVPEPQITWFKNNHKIQQEPGIILGPGSSTLFIERVTEEDEGVYHCKATNQKGSV ESSAYLTVQGTSDKSNLELITLTCTCVAATLFWLLLTLLIRKMKRSSSEIKTDYLSII MDPDEVPLDEQCERLPYDASKWEFARERLKLGKSLGRGAFGKVVQASAFGIKKSPTCR TVAVKMLKEGATASEYKALMTELKILTHIGHHLNVVNLLGACTKQGGPLMVIVEYCKY GNLSNYLKSKRDLFFLNKDAALHMEPKKEKMEPGLEQGKKPRLDSVTSSESFASSGFQ EDKSLSDVEEEEDSDGFYKEPITMEDLISYSFQVARGMEFLSSRKCIHRDLAARNILL SENNVVKICDFGLARDIYKNPDYVRKGDTRLPLKWMAPESIFDKIYSTKSDVWSYGVL LWEIFSLGGSPYPGVQMDEDFCSRLREGMRMRAPEYSTPEIYQIMLDCWHRDPKERPR FAELVEKLGDLLQANVQQDGKDYIPINAILTGNSGFTYSTPAFSEDFFKESISAPKFN SGSSDDVRYVNAFKFMSLERIKTFEELLPNATSMFDDYQGDSSTLLASPMLKRFTWTD SKPKASLKIDLRVTSKSKESGLSDVSRPSFCHSSCGHVSEGKRRFTYDHAELERKIAC CSPPPDYNSVVLYSTPPI" BASE COUNT 2279 a 1661 c 1739 g 2001 t ORIGIN 1 gcggacactc ctctcggctc ctccccggca gcggcggcgg ctcggagcgg gctccggggc 61 tcgggtgcag cggccagcgg gcctggcggc gaggattacc cggggaagtg gttgtctcct 121 ggctggagcc gcgagacggg cgctcagggc gcggggccgg cggcggcgaa cgagaggacg 181 gactctggcg gccgggtcgt tggccggggg agcgcgggca ccgggcgagc aggccgcgtc 241 gcgctcacca tggtcagcta ctgggacacc ggggtcctgc tgtgcgcgct gctcagctgt 301 ctgcttctca caggatctag ttcaggttca aaattaaaag atcctgaact gagtttaaaa 361 ggcacccagc acatcatgca agcaggccag acactgcatc tccaatgcag gggggaagca 421 gcccataaat ggtctttgcc tgaaatggtg agtaaggaaa gcgaaaggct gagcataact 481 aaatctgcct gtggaagaaa tggcaaacaa ttctgcagta ctttaacctt gaacacagct 541 caagcaaacc acactggctt ctacagctgc aaatatctag ctgtacctac ttcaaagaag 601 aaggaaacag aatctgcaat ctatatattt attagtgata caggtagacc tttcgtagag 661 atgtacagtg aaatccccga aattatacac atgactgaag gaagggagct cgtcattccc 721 tgccgggtta cgtcacctaa catcactgtt actttaaaaa agtttccact tgacactttg 781 atccctgatg gaaaacgcat aatctgggac agtagaaagg gcttcatcat atcaaatgca 841 acgtacaaag aaatagggct tctgacctgt gaagcaacag tcaatgggca tttgtataag 901 acaaactatc tcacacatcg acaaaccaat acaatcatag atgtccaaat aagcacacca 961 cgcccagtca aattacttag aggccatact cttgtcctca attgtactgc taccactccc 1021 ttgaacacga gagttcaaat gacctggagt taccctgatg aaaaaaataa gagagcttcc 1081 gtaaggcgac gaattgacca aagcaattcc catgccaaca tattctacag tgttcttact 1141 attgacaaaa tgcagaacaa agacaaagga ctttatactt gtcgtgtaag gagtggacca 1201 tcattcaaat ctgttaacac ctcagtgcat atatatgata aagcattcat cactgtgaaa 1261 catcgaaaac agcaggtgct tgaaaccgta gctggcaagc ggtcttaccg gctctctatg 1321 aaagtgaagg catttccctc gccggaagtt gtatggttaa aagatgggtt acctgcgact 1381 gagaaatctg ctcgctattt gactcgtggc tactcgttaa ttatcaagga cgtaactgaa 1441 gaggatgcag ggaattatac aatcttgctg agcataaaac agtcaaatgt gtttaaaaac 1501 ctcactgcca ctctaattgt caatgtgaaa ccccagattt acgaaaaggc cgtgtcatcg 1561 tttccagacc cggctctcta cccactgggc agcagacaaa tcctgacttg taccgcatat 1621 ggtatccctc aacctacaat caagtggttc tggcacccct gtaaccataa tcattccgaa 1681 gcaaggtgtg acttttgttc caataatgaa gagtccttta tcctggatgc tgacagcaac 1741 atgggaaaca gaattgagag catcactcag cgcatggcaa taatagaagg aaagaataag 1801 atggctagca ccttggttgt ggctgactct agaatttctg gaatctacat ttgcatagct 1861 tccaataaag ttgggactgt gggaagaaac ataagctttt atatcacaga tgtgccaaat 1921 gggtttcatg ttaacttgga aaaaatgccg acggaaggag aggacctgaa actgtcttgc 1981 acagttaaca agttcttata cagagacgtt acttggattt tactgcggac agttaataac 2041 agaacaatgc actacagtat tagcaagcaa aaaatggcca tcactaagga gcactccatc 2101 actcttaatc ttaccatcat gaatgtttcc ctgcaagatt caggcaccta tgcctgcaga 2161 gccaggaatg tatacacagg ggaagaaatc ctccagaaga aagaaattac aatcagagat 2221 caggaagcac catacctcct gcgaaacctc agtgatcaca cagtggccat cagcagttcc 2281 accactttag actgtcatgc taatggtgtc cccgagcctc agatcacttg gtttaaaaac 2341 aaccacaaaa tacaacaaga gcctggaatt attttaggac caggaagcag cacgctgttt 2401 attgaaagag tcacagaaga ggatgaaggt gtctatcact gcaaagccac caaccagaag 2461 ggctctgtgg aaagttcagc atacctcact gttcaaggaa cctcggacaa gtctaatctg 2521 gagctgatca ctctaacatg cacctgtgtg gctgcgactc tcttctggct cctattaacc 2581 ctccttatcc gaaaaatgaa aaggtcttct tctgaaataa agactgacta cctatcaatt 2641 ataatggacc cagatgaagt tcctttggat gagcagtgtg agcggctccc ttatgatgcc 2701 agcaagtggg agtttgcccg ggagagactt aaactgggca aatcacttgg aagaggggct 2761 tttggaaaag tggttcaagc atcagcattt ggcattaaga aatcacctac gtgccggact 2821 gtggctgtga aaatgctgaa agagggggcc acggccagcg agtacaaagc tctgatgact 2881 gagctaaaaa tcttgaccca cattggccac catctgaacg tggttaacct gctgggagcc 2941 tgcaccaagc aaggagggcc tctgatggtg attgttgaat actgcaaata tggaaatctc 3001 tccaactacc tcaagagcaa acgtgactta ttttttctca acaaggatgc agcactacac 3061 atggagccta agaaagaaaa aatggagcca ggcctggaac aaggcaagaa accaagacta 3121 gatagcgtca ccagcagcga aagctttgcg agctccggct ttcaggaaga taaaagtctg 3181 agtgatgttg aggaagagga ggattctgac ggtttctaca aggagcccat cactatggaa 3241 gatctgattt cttacagttt tcaagtggcc agaggcatgg agttcctgtc ttccagaaag 3301 tgcattcatc gggacctggc agcgagaaac attcttttat ctgagaacaa cgtggtgaag 3361 atttgtgatt ttggccttgc ccgggatatt tataagaacc ccgattatgt gagaaaagga 3421 gatactcgac ttcctctgaa atggatggct cccgaatcta tctttgacaa aatctacagc 3481 accaagagcg acgtgtggtc ttacggagta ttgctgtggg aaatcttctc cttaggtggg 3541 tctccatacc caggagtaca aatggatgag gacttttgca gtcgcctgag ggaaggcatg 3601 aggatgagag ctcctgagta ctctactcct gaaatctatc agatcatgct ggactgctgg 3661 cacagagacc caaaagaaag gccaagattt gcagaacttg tggaaaaact aggtgatttg 3721 cttcaagcaa atgtacaaca ggatggtaaa gactacatcc caatcaatgc catactgaca 3781 ggaaatagtg ggtttacata ctcaactcct gccttctctg aggacttctt caaggaaagt 3841 atttcagctc cgaagtttaa ttcaggaagc tctgatgatg tcagatatgt aaatgctttc 3901 aagttcatga gcctggaaag aatcaaaacc tttgaagaac ttttaccgaa tgccacctcc 3961 atgtttgatg actaccaggg cgacagcagc actctgttgg cctctcccat gctgaagcgc 4021 ttcacctgga ctgacagcaa acccaaggcc tcgctcaaga ttgacttgag agtaaccagt 4081 aaaagtaagg agtcggggct gtctgatgtc agcaggccca gtttctgcca ttccagctgt 4141 gggcacgtca gcgaaggcaa gcgcaggttc acctacgacc acgctgagct ggaaaggaaa 4201 atcgcgtgct gctccccgcc cccagactac aactcggtgg tcctgtactc caccccaccc 4261 atctagagtt tgacacgaag ccttatttct agaagcacat gtgtatttat acccccagga 4321 aactagcttt tgccagtatt atgcatatat aagtttacac ctttatcttt ccatgggagc 4381 cagctgcttt ttgtgatttt tttaatagtg cttttttttt ttgactaaca agaatgtaac 4441 tccagataga gaaatagtga caagtgaaga acactactgc taaatcctca tgttactcag 4501 tgttagagaa atccttccta aacccaatga cttccctgct ccaacccccg ccacctcagg 4561 gcacgcagga ccagtttgat tgaggagctg cactgatcac ccaatgcatc acgtacccca 4621 ctgggccagc cctgcagccc aaaacccagg gcaacaagcc cgttagcccc aggggatcac 4681 tggctggcct gagcaacatc tcgggagtcc tctagcaggc ctaagacatg tgaggaggaa 4741 aaggaaaaaa agcaaaaagc aagggagaaa agagaaaccg ggagaaggca tgagaaagaa 4801 tttgagacgc accatgtggg cacggagggg gacggggctc agcaatgcca tttcagtggc 4861 ttcccagctc tgacccttct acatttgagg gcccagccag gagcagatgg acagcgatga 4921 ggggacattt tctggattct gggaggcaag aaaaggacaa atatcttttt tggaactaaa 4981 gcaaatttta gacctttacc tatggaagtg gttctatgtc cattctcatt cgtggcatgt 5041 tttgatttgt agcactgagg gtggcactca actctgagcc catacttttg gctcctctag 5101 taagatgcac tgaaaactta gccagagtta ggttgtctcc aggccatgat ggccttacac 5161 tgaaaatgtc acattctatt ttgggtatta atatatagtc cagacactta actcaatttc 5221 ttggtattat tctgttttgc acagttagtt gtgaaagaaa gctgagaaga atgaaaatgc 5281 agtcctgagg agagttttct ccatatcaaa acgagggctg atggaggaaa aaggtcaata 5341 aggtcaaggg aagaccccgt ctctatacca accaaaccaa ttcaccaaca cagttgggac 5401 ccaaaacaca ggaagtcagt cacgtttcct tttcatttaa tggggattcc actatctcac 5461 actaatctga aaggatgtgg aagagcatta gctggcgcat attaagcact ttaagctcct 5521 tgagtaaaaa ggtggtatgt aatttatgca aggtatttct ccagttggga ctcaggatat 5581 tagttaatga gccatcacta gaagaaaagc ccattttcaa ctgctttgaa acttgcctgg 5641 ggtctgagca tgatgggaat agggagacag ggtaggaaag ggcgcctact cttcagggtc 5701 taaagatcaa gtgggccttg gatcgctaag ctggctctgt ttgatgctat ttatgcaagt 5761 tagggtctat gtatttagga tgcgcctact cttcagggtc taaagatcaa gtgggccttg 5821 gatcgctaag ctggctctgt ttgatgctat ttatgcaagt tagggtctat gtatttagga 5881 tgtctgcacc ttctgcagcc agtcagaagc tggagaggca acagtggatt gctgcttctt 5941 ggggagaaga gtatgcttcc ttttatccat gtaatttaac tgtagaacct gagctctaag 6001 taaccgaaga atgtatgcct ctgttcttat gtgccacatc cttgtttaaa ggctctctgt 6061 atgaagagat gggaccgtca tcagcacatt ccctagtgag cctactggct cctggcagcg 6121 gcttttgtgg aagactcact agccagaaga gaggagtggg acagtcctct ccaccaagat 6181 ctaaatccaa acaaaagcag gctagagcca gaagagagga caaatctttg ttgttcctct 6241 tctttacaca tacgcaaacc acctgtgaca gctggcaatt ttataaatca ggtaactgga 6301 aggaggttaa actcagaaaa aagaagacct cagtcaattc tctacttttt tttttttttt 6361 tccaaatcag ataatagccc agcaaatagt gataacaaat aaaaccttag ctgttcatgt 6421 cttgatttca ataattaatt cttaatcatt aagagaccat aataaatact ccttttcaag 6481 agaaaagcaa aaccattaga attgttactc agctccttca aactcaggtt tgtagcatac 6541 atgagtccat ccatcagtca aagaatggtt ccatctggag tcttaatgta gaaagaaaaa 6601 tggagacttg taataatgag ctagttacaa agtgcttgtt cattaaaata gcactgaaaa 6661 ttgaaacatg aattaactga taatattcca atcatttgcc atttatgaca aaaatggttg 6721 gcactaacaa agaacgagca cttcctttca gagtttctga gataatgtac gtggaacagt 6781 ctgggtggaa tggggctgaa accatgtgca agtctgtgtc ttgtcagtcc aagaagtgac 6841 accgagatgt taattttagg gacccgtgcc ttgtttccta gcccacaaga atgcaaacat 6901 caaacagata ctcgctagcc tcatttaaat tgattaaagg aggagtgcat ctttggccga 6961 cagtggtgta actgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgggtgtg 7021 ggtgtatgtg tgttttgtgc ataactattt aaggaaactg gaattttaaa gttactttta 7081 tacaaaccaa gaatatatgc tacagatata agacagacat ggtttggtcc tatatttcta 7141 gtcatgatga atgtattttg tataccatct tcatataata tacttaaaaa tatttcttaa 7201 ttgggatttg taatcgtacc aacttaattg ataaacttgg caactgcttt tatgttctgt 7261 ctccttccat aaatttttca aaatactaat tcaacaaaga aaaagctctt ttttttccta 7321 aaataaactc aaatttatcc ttgtttagag cagagaaaaa ttaagaaaaa ctttgaaatg 7381 gtctcaaaaa attgctaaat attttcaatg gaaaactaaa tgttagttta gctgattgta 7441 tggggttttc gaacctttca ctttttgttt gttttaccta tttcacaact gtgtaaattg 7501 ccaataattc ctgtccatga aaatgcaaat tatccagtgt agatatattt gaccatcacc 7561 ctatggatat tggctagttt tgcctttatt aagcaaattc atttcagcct gaatgtctgc 7621 ctatatattc tctgctcttt gtattctcct ttgaacccgt taaaacatcc tgtggcactc // LOCUS HSFLT4X 4450 bp RNA PRI 29-NOV-1993 DEFINITION H.sapiens Flt4 mRNA for transmembrane tyrosine kinase. ACCESSION X69878 S59182 NID g297049 KEYWORDS transmembrane tyrosine kinase; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4450) AUTHORS Galland,F., Karamysheva,A., Pebusque,M.J., Borg,J.P., Rottapel,R., Dubreuil,P., Rosnet,O. and Birnbaum,D. TITLE The FLT4 gene encodes a transmembrane tyrosine kinase related to the vascular endothelial growth factor receptor JOURNAL Oncogene 8 (5), 1233-1240 (1993) MEDLINE 93241723 REFERENCE 2 (bases 776 to 1200) AUTHORS Galland,F., Karamysheva,A., Mattei,M.G., Rosnet,O., Marchetto,S. and Birnbaum,D. TITLE Chromosomal localization of FLT4, a novel receptor-type tyrosine kinase gene JOURNAL Genomics 13 (2), 475-478 (1992) MEDLINE 92307693 REFERENCE 3 (bases 1 to 4450) AUTHORS Galland,F. TITLE Direct Submission JOURNAL Submitted (28-DEC-1992) F. Galland, INSERM, Unite 119, 27 Bd Lei Roure, Marseille 13009, FRANCE FEATURES Location/Qualifiers source 1..4450 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 22..87 /citation=[1] CDS 22..3918 /function="tyrosine kinase" /note="unnamed protein product" /citation=[1] /codon_start=1 /db_xref="PID:g297050" /db_xref="SWISS-PROT:P35916" /translation="MQRGAALCLRLWLCLGLLDGLVSDYSMTPPTLNITEESHVIDTG DSLSISCRGQHPLEWAWPGAQEAPATGDKDSEDTGVVRDCEGTDARPYCKVLLLHEVH ANDTGSYVCYYKYIKARIEGTTAASSYVFVRDFEQPFINKPDTLLVNRKDAMWVPCLV SIPGLNVTLRSQSSVLWPDGQEVVWDDRRGMLVSTPLLHDALYLQCETTWGDQDFLSN PFLVHITGNELYDIQLLPRKSLELLVGEKLVLNCTVWAEFNSGVTFDWDYPGKQAERG KWVPERRSQQTHTELSSILTIHNVSQHDLGSYVCKANNGIQRFRESTEVIVHENPFIS VEWLKGPILEATAGDELVKLPVKLAAYPPPEFQWYKDGKALSGRHSPHALVLKEVTEA STGTYTLALWNSAAGLRRNISLELVVNVPPQIHEKEASSPSIYSRHSRQALTCTAYGV PLPLSIQWHWRPWTPCKMFAQRSLRRRQQQDLMPQCRDWRAVTTQDAVNPIESLDTWT EFVEGKNKTVSKLVIQNANVSAMYKCVVSNKVGQDERLIYFYVTTIPDGFTIESKPSE ELLEGQPVLLSCQADSYKYEHLRWYRLNLSTLHDAHGNPLLLDCKNVHLFATPLAASL EEVAPGARHATLSLSIPRVAPEHEGHYVCEVQDRRSHDKHCHKKYLSVQALEAPRLTQ NLTDLLVNVSDSLEMQCLVAGAHAPSIVWYKDERLLEEKSGVDLADSNQKLSIQRVRE EDAGPYLCSVCRPKGCVNSSASVAVEGSEDKGSMEIVILVGTGVIAVFFWVLLLLIFC NMRRPAHADIKTGYLSIIMDPGEVPLEEQCEYLSYDASQWEFPRERLHLGRVLGYGAF GKVVEASAFGIHKGSSCDTVAVKMLKEGATASEQRALMSELKILIHIGNHLNVVNLLG ACTKPQGPLMVIVEFCKYGNLSNFLRAKRDAFSPCAEKSPEQRGRFRAMVELARLDRR RPGSSDRVLFARFSKTEGGARRASPDQEAEDLWLSPLTMEDLVCYSFQVARGMEFLAS RKCIHRDLAARNILLSESDVVKICDFGLARDIYKDPDYVRKGSARLPLKWMAPESIFD KVYTTQSDVWSFGVLLWEIFSLGASPYPGVQINEEFCQRVRDGTRMRAPELATPAIRH IMLNCWSGDPKARPAFSDLVEILGDLLQGRGLQEEEEVCMAPRSSQSSEEGSFSQVST MALHIAQADAEDSPPSLQRHSLAARYYNWVSFPGCLARGAETRGSSRMKTFEEFPMTP TTYKGSVDNQTDSGMVLASEEFEQIESRHRQESGFR" misc_feature 22..2346 /citation=[1] /product="extacellular domain" misc_structure 160..366 /citation=[1] /function="IG-like domain 1" misc_structure 481..651 /citation=[1] /function="IG-like domain 2" misc_structure 763..963 /citation=[1] /function="IG-like domain 3" misc_structure 1072..1230 /citation=[1] /function="IG-like domain 4" misc_structure 1342..1635 /citation=[1] /function="IG-like domain 5" misc_structure 1741..1992 /citation=[1] /function="IG-like domain 6" misc_feature 2104..2277 /citation=[1] /product="IG-like domain 7" misc_feature 2347..2412 /citation=[1] /citation=[2] /product="transmembrane domain" misc_feature 2413..2547 /citation=[1] /citation=[2] /product="juxtamembrane domain" misc_feature 2548..2853 /citation=[1] /citation=[2] /product="tyrosine kinase domain 1" misc_feature 2854..3048 /citation=[1] /citation=[2] /product="kinase insert" misc_feature 3049..3558 /citation=[1] /citation=[2] /product="tyrosine kinase domain 2" BASE COUNT 960 a 1350 c 1354 g 786 t ORIGIN 1 acccacgcgc agcggccgga gatgcagcgg ggcgccgcgc tgtgcctgcg actgtggctc 61 tgcctgggac tcctggacgg cctggtgagt gactactcca tgaccccccc gaccttgaac 121 atcacggagg agtcacacgt catcgacacc ggtgacagcc tgtccatctc ctgcagggga 181 cagcaccccc tcgagtgggc ttggccagga gctcaggagg cgccagccac cggagacaag 241 gacagcgagg acacgggggt ggtgcgagac tgcgagggca cagacgccag gccctactgc 301 aaggtgttgc tgctgcacga ggtacatgcc aacgacacag gcagctacgt ctgctactac 361 aagtacatca aggcacgcat cgagggcacc acggccgcca gctcctacgt gttcgtgaga 421 gactttgagc agccattcat caacaagcct gacacgctct tggtcaacag gaaggacgcc 481 atgtgggtgc cctgtctggt gtccatcccc ggcctcaatg tcacgctgcg ctcgcaaagc 541 tcggtgctgt ggccagacgg gcaggaggtg gtgtgggatg accggcgggg catgctcgtg 601 tccacgccac tgctgcacga tgccctgtac ctgcagtgcg agaccacctg gggagaccag 661 gacttccttt ccaacccctt cctggtgcac atcacaggca acgagctcta tgacatccag 721 ctgttgccca ggaagtcgct ggagctgctg gtaggggaga agctggtcct caactgcacc 781 gtgtgggctg agtttaactc aggtgtcacc tttgactggg actacccagg gaagcaggca 841 gagcggggta agtgggtgcc cgagcgacgc tcccaacaga cccacacaga actctccagc 901 atcctgacca tccacaacgt cagccagcac gacctgggct cgtatgtgtg caaggccaac 961 aacggcatcc agcgatttcg ggagagcacc gaggtcattg tgcatgaaaa tcccttcatc 1021 agcgtcgagt ggctcaaagg acccatcctg gaggccacgg caggagacga gctggtgaag 1081 ctgcccgtga agctggcagc gtaccccccg cccgagttcc agtggtacaa ggatggaaag 1141 gcactgtccg ggcgccacag tccacatgcc ctggtgctca aggaggtgac agaggccagc 1201 acaggcacct acaccctcgc cctgtggaac tccgctgctg gcctgaggcg caacatcagc 1261 ctggagctgg tggtgaatgt gcccccccag atacatgaga aggaggcctc ctcccccagc 1321 atctactcgc gtcacagccg ccaggccctc acctgcacgg cctacggggt gcccctgcct 1381 ctcagcatcc agtggcactg gcggccctgg acaccctgca agatgtttgc ccagcgtagt 1441 ctccggcggc ggcagcagca agacctcatg ccacagtgcc gtgactggag ggcggtgacc 1501 acgcaggatg ccgtgaaccc catcgagagc ctggacacct ggaccgagtt tgtggaggga 1561 aagaataaga ctgtgagcaa gctggtgatc cagaatgcca acgtgtctgc catgtacaag 1621 tgtgtggtct ccaacaaggt gggccaggat gagcggctca tctacttcta tgtgaccacc 1681 atccccgacg gcttcaccat cgaatccaag ccatccgagg agctactaga gggccagccg 1741 gtgctcctga gctgccaagc cgacagctac aagtacgagc atctgcgctg gtaccgcctc 1801 aacctgtcca cgctgcacga tgcgcacggg aacccgcttc tgctcgactg caagaacgtg 1861 catctgttcg ccacccctct ggccgccagc ctggaggagg tggcacctgg ggcgcgccac 1921 gccacgctca gcctgagtat cccccgcgtc gcgcccgagc acgagggcca ctatgtgtgc 1981 gaagtgcaag accggcgcag ccatgacaag cactgccaca agaagtacct gtcggtgcag 2041 gccctggaag cccctcggct cacgcagaac ttgaccgacc tcctggtgaa cgtgagcgac 2101 tcgctggaga tgcagtgctt ggtggccgga gcgcacgcgc ccagcatcgt gtggtacaaa 2161 gacgagaggc tgctggagga aaagtctgga gtcgacttgg cggactccaa ccagaagctg 2221 agcatccagc gcgtgcgcga ggaggatgcg ggaccgtatc tgtgcagcgt gtgcagaccc 2281 aagggctgcg tcaactcctc cgccagcgtg gccgtggaag gctccgagga taagggcagc 2341 atggagatcg tgatccttgt cggtaccggc gtcatcgctg tcttcttctg ggtcctcctc 2401 ctcctcatct tctgtaacat gaggaggccg gcccacgcag acatcaagac gggctacctg 2461 tccatcatca tggaccccgg ggaggtgcct ctggaggagc aatgcgaata cctgtcctac 2521 gatgccagcc agtgggaatt cccccgagag cggctgcacc tggggagagt gctcggctac 2581 ggcgccttcg ggaaggtggt ggaagcctcc gctttcggca tccacaaggg cagcagctgt 2641 gacaccgtgg ccgtgaaaat gctgaaagag ggcgccacgg ccagcgagca gcgcgcgctg 2701 atgtcggagc tcaagatcct cattcacatc ggcaaccacc tcaacgtggt caacctcctc 2761 ggggcgtgca ccaagccgca gggccccctc atggtgatcg tggagttctg caagtacggc 2821 aacctctcca acttcctgcg cgccaagcgg gacgccttca gcccctgcgc ggagaagtct 2881 cccgagcagc gcggacgctt ccgcgccatg gtggagctcg ccaggctgga tcggaggcgg 2941 ccggggagca gcgacagggt cctcttcgcg cggttctcga agaccgaggg cggagcgagg 3001 cgggcttctc cagaccaaga agctgaggac ctgtggctga gcccgctgac catggaagat 3061 cttgtctgct acagcttcca ggtggccaga gggatggagt tcctggcttc ccgaaagtgc 3121 atccacagag acctggctgc tcggaacatt ctgctgtcgg aaagcgacgt ggtgaagatc 3181 tgtgactttg gccttgcccg ggacatctac aaagaccccg actacgtccg caagggcagt 3241 gcccggctgc ccctgaagtg gatggcccct gaaagcatct tcgacaaggt gtacaccacg 3301 cagagtgacg tgtggtcctt tggggtgctt ctctgggaga tcttctctct gggggcctcc 3361 ccgtaccctg gggtgcagat caatgaggag ttctgccagc gcgtgagaga cggcacaagg 3421 atgagggccc cggagctggc cactcccgcc atacgccaca tcatgctgaa ctgctggtcc 3481 ggagacccca aggcgagacc tgcattctcg gacctggtgg agatcctggg ggacctgctc 3541 cagggcaggg gcctgcaaga ggaagaggag gtctgcatgg ccccgcgcag ctctcagagc 3601 tcagaagagg gcagcttctc gcaggtgtcc accatggccc tacacatcgc ccaggctgac 3661 gctgaggaca gcccgccaag cctgcagcgc cacagcctgg ccgccaggta ttacaactgg 3721 gtgtcctttc ccgggtgcct ggccagaggg gctgagaccc gtggttcctc caggatgaag 3781 acatttgagg aattccccat gaccccaacg acctacaaag gctctgtgga caaccagaca 3841 gacagtggga tggtgctggc ctcggaggag tttgagcaga tagagagcag gcatagacaa 3901 gaaagcggct tcaggtagct gaagcagaga gagagaaggc agcatacgtc agcattttct 3961 tctctgcact tataagaaag atcaaagact ttaagacttt cgctatttct tctactgcta 4021 tctactacaa acttcaaaga ggaaccagga ggacaagagg agcatgaaag tggacaagga 4081 gtgtgaccac tgaagcacca cagggagggg ttaggcctcc ggatgactgc gggcaggcct 4141 ggataatatc cagcctccca caagaagctg gtggagcaga gtgttccctg actcctccaa 4201 ggaaagggag acgccctttc atggtctgct gagtaacagg tgccttccca gacactggcg 4261 ttactgcttg accaaagagc cctcaagcgg cccttatgcc agcgtgacag agggctcacc 4321 tcttgccttc taggtcactt ctcacaatgt cccttcagca cctgaccctg tgcccgccga 4381 ttattccttg gtaatatgag taatacatca aagagtagta ttaaaagcta attaatcatg 4441 tttataaaaa // LOCUS HSFMO2 1713 bp RNA PRI 06-FEB-1997 DEFINITION H.sapiens mRNA for flavin-containing monooxygenase 2. ACCESSION Y09267 NID g1834492 KEYWORDS flavin-containing monooxygenase 2; FMO2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1713) AUTHORS Dolphin,C.T., Beckett,D.J., Shephard,E.A., Smith,R.L. and Phillips,I.R. TITLE Flavin-containing monooxygenases 2 and 5 of man: cDNA cloning and comparison of the developmental and tissue-specific expression of the corresponding genes to forms 1,3 and 4 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1713) AUTHORS Dolphin,C.T. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) C.T. Dolphin, Queen Mary and Westfield College, Biochemistry, Mile End Road, London, E1 4NS, UK FEATURES Location/Qualifiers source 1..1713 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" gene 61..1476 /gene="FMO2" CDS 61..1476 /gene="FMO2" /note="truncated CDS derived from major allele of FMO2 gene due to premature stop codon" /codon_start=1 /product="flavin-containing monooxygenase 2" /db_xref="PID:e281394" /db_xref="PID:g1834493" /translation="MAKKVAVIGAGVSGLISLKCCVDEGLEPTCFERTEDIGGVWRFK ENVEDGRASIYQSVVTNTSKEMSCFSDFPMPEDFPNFLHNSKLLEYFRIFAKKFDLLK YIQFQTTVLSVRKCPDFSSSGQWKVVTQSNGKEQSAVFDAVMVCSGHHILPHIPLKSF PGMERFKGQYFHSRQYKHPDGFEGKRILVIGMGNSGSDIAVELSKNAAQVFISTRHGT WVMSRISEDGYPWDSVFHTRFRSMLRNVLPRTAVKWMIEQQMNRWFNHENYGLEPQNK YIMKEPVLNDDVPSRLLCGAIKVKSTVKELTETSAIFEDGTVEENIDVIIFATGYSFS FPFLEDSLVKVENNMVSLYKYIFPAHLDKSTLACIGLIQPLGSIFPTAELQARWVTRV FKGLCSLPSERTMMMDIIKRNEKRIDLFGESQSQTLQTNYVDYLDELALEIGAKPDFC SLLFKDPKLAVRLYFGPCNSY" misc_feature 1666..1668 /note="stop codon of full-length CDS derived from minor alternative FMO2 allele" BASE COUNT 479 a 368 c 395 g 471 t ORIGIN 1 ccaagggaga aaactattct gtcaaagaga cggtgccaaa aggcaaaaac aaaggagctg 61 atggcaaaga aggtagctgt gattggagct ggggtcagtg gcctaatttc tctgaagtgc 121 tgtgtggatg agggacttga gcccacttgc tttgagagaa ctgaagatat tggaggagtg 181 tggaggttca aagagaatgt ggaagatggc cgagcaagta tctatcaatc tgtcgttacc 241 aacaccagca aagaaatgtc ctgtttcagt gactttccaa tgcctgaaga ttttccaaac 301 ttcctgcata attctaaact tctggaatat ttcaggattt ttgctaaaaa atttgatctg 361 ctaaaatata ttcagttcca gacaactgtc cttagtgtga gaaaatgtcc agatttctca 421 tcctctggcc aatggaaggt tgtcactcag agcaacggca aggagcagag tgctgtcttt 481 gacgcagtta tggtttgcag tggccaccac attctacctc atatcccact gaagtcattt 541 ccaggtatgg agaggttcaa aggccaatat ttccatagcc gccaatacaa gcatccagat 601 ggatttgagg gaaaacgcat cctggtgatt ggaatgggaa actcaggctc agatattgct 661 gttgagctga gtaagaatgc tgctcaggtt tttatcagca ccaggcatgg cacctgggtc 721 atgagccgta tctctgaaga tggctatcct tgggactcag tgttccacac ccggtttcgt 781 tctatgctcc gcaatgtact gccacgaaca gctgtaaaat ggatgataga acaacagatg 841 aatcggtggt tcaaccatga aaattatggc cttgagcctc aaaacaaata cattatgaag 901 gaacctgtac taaatgatga tgtcccaagt cgtctactct gtggagccat caaggtgaaa 961 tctacagtga aagagctcac agaaacttct gccatctttg aggatggaac agtggaggag 1021 aacattgatg tcatcatttt tgcaacagga tatagtttct cttttccctt ccttgaagat 1081 tcactcgtta aagtagagaa taatatggtc tcactgtata aatacatatt ccccgctcac 1141 ctggacaagt caaccctcgc gtgcattggt ctcatccagc ccctaggttc cattttccca 1201 actgctgaac ttcaagctcg ttgggtgaca agagttttca aaggcttgtg tagcctgccc 1261 tcagagagaa ctatgatgat ggacattatc aaaaggaatg aaaaaagaat tgacctgttt 1321 ggagaaagcc agagccagac gttgcagacc aattatgttg actacttgga cgagctcgcc 1381 ttagagatag gtgcgaagcc agatttctgc tctctcttgt tcaaagatcc taaactggct 1441 gtgagactct atttcggacc ctgcaactcc tattagtatc gcctggttgg gcctgggcaa 1501 tgggaaggag ccagaaatgc catcttcacc cagaaacaaa gaatactgaa gccactcaag 1561 actcgggccc tgaaggattc atctaatttc tcagtttctt ttctgttgaa aatcctgggc 1621 cttcttgctg ttgttgtggc ctttttttgc caacttcaat ggtcctagtc agcataatgc 1681 tttgggcttt attatcttgt cagtcactac ctc // LOCUS HSFMO3 1913 bp RNA PRI 17-APR-1996 DEFINITION H.sapiens mRNA for flavin-containing monooxygenase 3 (FMO3). ACCESSION Z47552 NID g623239 KEYWORDS flavin-containing monooxygenase 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1913) AUTHORS Dolphin,C.T., Cullingford,T.E., Shephard,E.A., Smith,R.L. and Phillips,I.R. TITLE Differential developmental and tissue-specific regulation of expression of the genes encoding three members of the flavin-containing monooxygenase family of man, FMO1, FMO3 and FM04 JOURNAL Eur. J. Biochem. 235 (3), 683-689 (1996) MEDLINE 96184548 REFERENCE 2 (bases 1 to 1913) AUTHORS Dolphin,C.T. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Colin T Dolphin, Biochemistry, Queen Mary and Westfield College, University of London, Mile End Road, London, E1 4NS, UK FEATURES Location/Qualifiers source 1..1913 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="1D16A, 1D17A, 1D18A" /dev_stage="adult" /tissue_type="liver" 5'UTR 1..93 mRNA 1..1913 CDS 94..1692 /codon_start=1 /product="flavin-containing monooxygenase 3 (FMO3)" /db_xref="PID:g623240" /translation="MGKKVAIIGAGVSGLASIRSCLEEGLEPTCFEKSNDIGGLWKFS DHAEEGRASIYKSVFSNSSKEMMCFPDFPFPDDFPNFMHNSKIQEYIIAFAKEKNLLK YIQFKTFVSSVNKHPDFATTGQWDVTTERDGKKESAVFDAVMVCSGHHVYPNLPKESF PGLNHFKGKCFHSRDYKEPGVFNGKRVLVVGLGNSGCDIATELSRTAEQVMISSRSGS WVMSRVWDNGYPWDMLLVTRFGTFLKNNLPTAISDWLYVKQMNARFKHENYGLMPLNG VLRKEPVFNDELPASILCGIVSVKPNVKEFTETSAIFEDGTIFEGIDCVIFATGYSFA YPFLDESIIKSRNNEIILFKGVFPPLLEKSTIAVIGFVQSLGAAIPTVDLQSRWAAQV IKGTCTLPSMEDMMNDINEKMEKKRKWFGKSETIQTDYIVYMDELSSFIGAKPNIPWL FLTDPKLAMEVYFGPCSPYQFRLVGPGQWPGARNAMLTQWDRSLKPMQTRVVGRLQKP CFFFHWLKLFAIPILLIAVFLVLT" 3'UTR 1693..>1913 BASE COUNT 535 a 414 c 445 g 519 t ORIGIN 1 tttctctttc aaactgccca gacggttgga caggacgtag acacacagaa gaaaagaaga 61 caaagaacgg gtaggaaaat taaaaaggtt accatgggga agaaagtggc catcattgga 121 gctggtgtga gtggcttggc ctccatcagg agctgtctgg aagaggggct ggagcccacc 181 tgctttgaga agagcaatga cattgggggc ctgtggaaat tctcagacca tgcagaggag 241 ggcagggcta gcatttacaa atcagtcttt tccaactctt ccaaagagat gatgtgtttc 301 ccagacttcc catttcccga tgacttcccc aactttatgc acaacagcaa gatccaggaa 361 tatatcattg catttgccaa agaaaagaac ctcctgaagt acatacaatt taagacattt 421 gtatccagtg taaataaaca tcctgatttt gcaactactg gccagtggga tgttaccact 481 gaaagggatg gtaaaaaaga atcggctgtc tttgatgctg taatggtttg ttccggacat 541 catgtgtatc ccaacctacc aaaagagtcc tttccaggac taaaccactt taaaggcaaa 601 tgcttccaca gcagggacta taaagaacca ggtgtattca atggaaagcg tgtcctggtg 661 gttggcctgg ggaattcggg ctgtgatatt gccacagaac tcagccgcac agcagaacag 721 gtcatgatca gttccagaag tggctcctgg gtgatgagcc gggtctggga caatggttat 781 ccttgggaca tgctgctcgt cactcgattc ggaaccttcc tcaagaacaa tttaccgaca 841 gccatctctg actggttgta cgtgaagcag atgaatgcaa gattcaagca tgaaaactat 901 ggcttgatgc ctttaaatgg agtcctgagg aaagagcctg tatttaacga tgagctccca 961 gcaagcattc tgtgtggcat tgtgtccgta aagcctaacg tgaaggaatt cacagagacc 1021 tcggccattt ttgaggatgg gaccatattt gagggcattg actgtgtaat ctttgcaaca 1081 gggtatagtt ttgcctaccc cttccttgat gagtctatca tcaaaagcag aaacaatgag 1141 atcattttat ttaaaggagt atttcctcct ctacttgaga agtcaaccat agcagtgatt 1201 ggctttgtcc agtcccttgg ggctgccatt cccacagttg acctccagtc ccgctgggca 1261 gcacaagtaa taaagggaac ttgtactttg ccttctatgg aagacatgat gaatgatatt 1321 aatgagaaaa tggagaaaaa gcgcaaatgg tttggcaaaa gcgagaccat acagacagat 1381 tacattgttt atatggatga actctcctcc ttcattgggg caaagcccaa catcccatgg 1441 ctgtttctca cagatcccaa attggccatg gaagtttatt ttggcccttg tagtccctac 1501 cagtttaggc tggtgggccc agggcagtgg ccaggagcca gaaatgccat gctgacccag 1561 tgggaccggt cgttgaaacc catgcagaca cgagtggtcg ggagacttca gaagccttgc 1621 ttctttttcc attggctgaa gctctttgca attcctattc tgttaatcgc tgttttcctt 1681 gtgttgacct aatcatcatt ttctctagga tttctgaaag ttactgacaa tacccagaca 1741 ggggctttgc tatttaaaaa ttaaaatttt cacaccacct gcttttctat tcagcatctt 1801 ttgcagtact ctgtagacat tagtcagtaa tacagtgtta tttctaggct ctgaaatagc 1861 cactttaaga atcatgtcat gatcttaaga gagcactaat catttctgtt tga // LOCUS HSFMO5 2326 bp RNA PRI 12-JAN-1995 DEFINITION H.sapiens mRNA for flavin-containing monooxygenase 5 (FMO5). ACCESSION Z47553 NID g623241 KEYWORDS flavin-containing monooxygenase 5. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2326) AUTHORS Dolphin,C.T., Povey,S., Shephard,E.A., Smith,R.L. and Phillips,I.R. TITLE Cloning, primary sequence and chromosomal localisation of human flavin-containing monooxygenase 5 (FMO5) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2326) AUTHORS Dolphin,C.T. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Colin T Dolphin, Biochemistry, Queen Mary and Westfield College, University of London, Mile End Road, London, E1 4NS, UK FEATURES Location/Qualifiers source 1..2326 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="1C1/1b, 2-1b, 2-2b, 2-3b, 3'RACE1C1" /dev_stage="adult" /tissue_type="placenta, liver" /clone_lib="lambda gt11 placenta cDNA library, lambda gt11 liver cDNA library from Dr S. Woo" 5'UTR 1..81 mRNA 1..2326 CDS 82..1683 /codon_start=1 /product="flavin-containing monooxygenase 5 (FMO5)" /db_xref="PID:g623242" /translation="MTKKRIAVIGGGVSGLSSIKCCVEEGLEPVCFERTDDIGGLWRF QENPEEGRASIYKSVIINTSKEMMCFSDYPIPDHYPNFMHNAQVLEYFRMYAKEFDLL KYIRFKTTVCSVKKQPDFATSGQWEVVTESEGKKEMNVFDGVMVCTGHHTNAHLPLES FPGIEKFKGQYFHSRDYKNPEGFTGKRVIIIGIGNSGGDLAVEISQTAKQVFLSTRRG AWILNRVGDYGYPADVLFSSRLTHFIWKICGQSLANKYLEKKINQRFDHEMFGLKPKH RALSQHPTLNDDLPNRIISGLVKVKGNVKEFTETAAIFEDGSREDDIDAVIFATGYSF DFPFLEDSVKVVKNKIPLYKKVFPPNLERPTLAIIGLIQPLGAIMPISELQGRWATQV FKGLKTLPSQSEMMAEISKAQEEIDKRYVESQRHTIQGDYIDTMEELADLVGVRPNLL SLAFTDPKLALHLLLGPCTPIHYRVQGPGKWDGARKAILTTDDRIRKPLMTRVVERSS SMTSTMTIGKFMLALAFFAIIIAYF" 3'UTR 1684..2326 polyA_signal 1987..1992 polyA_signal 2303..2308 BASE COUNT 661 a 473 c 517 g 675 t ORIGIN 1 ccagatcgca gctgaaggat ctgttgagcg cttcaggaaa ggcggacagg cgacactaac 61 aggtgaagat ctcgggagac catgactaag aaaagaattg ctgtgattgg gggaggagtg 121 agcgggctct cttccatcaa gtgctgcgta gaagaaggct tggaacctgt ctgctttgaa 181 aggactgatg acatcggagg gctctggagg ttccaggaaa atcctgaaga aggaagggcc 241 agtatttaca aatcagtgat catcaatact tctaaagaga tgatgtgctt cagtgactat 301 ccaatcccag atcattatcc caacttcatg cataatgccc aggtcctgga gtatttcagg 361 atgtatgcca aagaatttga ccttctaaag tatattcgat ttaagaccac tgtgtgcagt 421 gtgaagaagc agcctgattt tgccacttca ggccaatggg aagtggtcac tgaatctgaa 481 gggaaaaagg agatgaatgt ctttgatgga gtcatggttt gcactggcca tcacaccaat 541 gctcatctac ctctggaaag cttccctgga attgagaagt tcaaagggca gtacttccac 601 agtcgagact ataagaaccc agagggattc actggaaaga gagtcattat aattggcatt 661 gggaattctg gaggggatct ggctgtagag attagccaaa cagccaagca ggttttcctc 721 agcaccagga gaggggcttg gatcctgaat cgtgtagggg actacggata tcctgctgat 781 gtgttgttct cttctcgact tacacatttt atatggaaga tctgtggcca atcattagca 841 aacaaatatt tggaaaaaaa gataaaccaa aggtttgacc atgaaatgtt tggcctgaag 901 cctaaacaca gagctctgag tcagcatcca accttaaatg atgacctgcc aaatcgtatc 961 atttctggct tggtgaaagt gaaaggaaat gtgaaggaat tcacggagac agctgccata 1021 tttgaggatg gctccaggga ggatgacatt gatgctgtta tctttgccac aggctatagc 1081 tttgactttc catttctgga agattccgtc aaagtggtca aaaacaagat acccctgtat 1141 aaaaaggtct tccctcctaa cctggaaagg ccaactcttg caatcatagg cttgattcag 1201 cccttaggag ccattatgcc catttcagag ctccaaggac gctgggccac tcaggtattt 1261 aaaggtctaa agacattgcc ctcacagagt gaaatgatgg cagaaatatc taaagctcaa 1321 gaggaaattg acaaaaggta tgtggagagc caacgccata ccattcaggg agactacata 1381 gataccatgg aagagcttgc tgatttggtg ggggtcaggc ccaatctgct gtctctggcc 1441 ttcactgacc ccaagctggc attacactta ttactgggac cctgcactcc aatccactat 1501 cgtgtacagg gccctggaaa gtgggatggg gctcgaaaag ctatcctcac cacagatgat 1561 cgcatcagga agcctctgat gacaagagta gttgaaagga gtagttctat gacttcaaca 1621 atgacaatag gcaagtttat gctagctctt gccttctttg ctataattat agcttacttc 1681 tagttgtcct attgtcactg ccctgttttt cattgggaag cttatctaca gatgccttca 1741 gaatctgacg agattgactc tcagtttcat attgcccaga aatctacttt aatgtctctt 1801 tcgaaagcat taattcactt tcctttttcc tacaatgaaa cctgttttcc atttgtatta 1861 actcatctcc cttccactca tgatccgtca ctcttccttg tggtaatccc tagactggga 1921 gctcaggtac tcttttagtc atctttgtat gtctttagca gagttcttga catgtggtag 1981 gtgcttaata aatgtttgtt gtttatcaaa ttttatggta gggagagtaa gtcagcatcg 2041 gtataaaatc gcttactcca cgtaactctt cttctgatag ggtttgattt tctattagaa 2101 gctcaatttt agtttttttt catattataa ctaaatatgt ttcctgagag ataagagaaa 2161 taatgttcct acaatagttg tatgtatcta agataagaca tatagatgct taagacattt 2221 tgtttcactt gctattcact agtgtacttg aaacatggtc atttttagcc cttttcctta 2281 ggaaccatgt ctttattttc tcaataaaga aattactttc aactca // LOCUS HSFMR1A 4362 bp RNA PRI 23-APR-1993 DEFINITION H.sapiens FMR-1 mRNA. ACCESSION X69962 NID g296587 KEYWORDS Fragile X mental retardation 1; Fragile X syndrome. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4362) AUTHORS Oostra,B.A. TITLE Direct Submission JOURNAL Submitted (07-JAN-1993) B.A. Oostra, Dep. of Clinical Genetics, Erasmus Uni., P.O. Box 1738, 3000 DR Rotterdam, THE NETHERLANDS REFERENCE 2 (bases 1 to 4362) AUTHORS Verkerk,A.J.H.M., de Graaff,E., de Boulle,K., Eichler,E.E., Konecki,D.S., Reyniers,E., Manca,A., Poustka,A., Willems,P.J., Nelson,D.L. and Oostra,B.A. TITLE Alternative splicing in the fragile X gene FMR1 JOURNAL Hum. Mol. Genet. 2 (4), 399-404 (1993) MEDLINE 93278388 REMARK Erratum:[Hum Mol Genet 1993 Aug;2(8):1348]] FEATURES Location/Qualifiers source 1..4362 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq27.3" CDS 220..2118 /note="unnamed protein product" /codon_start=1 /db_xref="PID:g296588" /db_xref="SWISS-PROT:Q06787" /translation="MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPF HDVRFPPPVGYNKDINESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDAT YNEIVTIERLRSVNPNKPATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVT YDPENYQLVILSINEVTSKRAHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASR FHEQFIVREDLMGLAIGTHGANIQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKAR SFLEFAEDVIQVPRNLVGKVIGKNGKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMP PNSLPSNNSRVGPNAPEEKKHLDIKENSTHFSQPNSTKVQRVLVASSVVAGESQKPEL KAWQGMVPFVFVGTKDSIANATVLLDYHLNYLKEVDQLRLERLQIDEQLRQIGASSRP PPNRTDKEKSYVTDDGQGMGRGSRPYRNRGHGRRGPGYTSGTNSEASNASETESDHRD ELSDWSLAPTEEERESFLRRGDGRRRGGGGRGQGGRGRGGGFKGNDDHSRTDNRPRNP REAKGRTTDGSLQIRVDCNNERSVHTKTLQNTSSEGSRLRTGKDRNQKKEKPDSVDGQ QPLVNGVP" exon 1345..1407 /note="alternative exon" misc_feature 1691..1765 /note="alternative splice" misc_feature 1957..2007 /note="alternative splice" polyA_site 3909 BASE COUNT 1340 a 719 c 995 g 1308 t ORIGIN 1 acggcgagcg cgggcggcgg cggtgacgga ggcgccgctg ccagggggcg tgcggcagcg 61 cggcggcggc ggcggcggcg gcggcggcgg aggcggcggc ggcggcggcg gcggcggcgg 121 aggcggcggc ggcggcggcg gcggcggcgg ctgggcctcg agcgcccgca gcccacctct 181 cgggggcggg ctcccggcgc tagcagggct gaagagaaga tggaggagct ggtggtggaa 241 gtgcggggct ccaatggcgc tttctacaag gcatttgtaa aggatgttca tgaagattca 301 ataacagttg catttgaaaa caactggcag cctgataggc agattccatt tcatgatgtc 361 agattcccac ctcctgtagg ttataataaa gatataaatg aaagtgatga agttgaggtg 421 tattccagag caaatgaaaa agagccttgc tgttggtggt tagctaaagt gaggatgata 481 aagggtgagt tttatgtgat agaatatgca gcatgtgatg caacttacaa tgaaattgtc 541 acaattgaac gtctaagatc tgttaatccc aacaaacctg ccacaaaaga tactttccat 601 aagatcaagc tggatgtgcc agaagactta cggcaaatgt gtgccaaaga ggcggcacat 661 aaggatttta aaaaggcagt tggtgccttt tctgtaactt atgatccaga aaattatcag 721 cttgtcattt tgtccatcaa tgaagtcacc tcaaagcgag cacatatgct gattgacatg 781 cactttcgga gtctgcgcac taagttgtct ctgataatga gaaatgaaga agctagtaag 841 cagctggaga gttcaaggca gcttgcctcg agatttcatg aacagtttat cgtaagagaa 901 gatctgatgg gtctagctat tggtactcat ggtgctaata ttcagcaagc tagaaaagta 961 cctggggtca ctgctattga tctagatgaa gatacctgca catttcatat ttatggagag 1021 gatcaggatg cagtgaaaaa agctagaagc tttctcgaat ttgctgaaga tgtaatacaa 1081 gttccaagga acttagtagg caaagtaata ggaaaaaatg gaaagctgat tcaggagatt 1141 gtggacaagt caggagttgt gagggtgagg attgaggctg aaaatgagaa aaatgttcca 1201 caagaagagg aaattatgcc accaaattcc cttccttcca ataattcaag ggttggacct 1261 aatgccccag aagaaaaaaa acatttagat ataaaggaaa acagcaccca tttttctcaa 1321 cctaacagta caaaagtcca gagggtgtta gtggcttcat cagttgtagc aggggaatcc 1381 cagaaacctg aactcaaggc ttggcagggt atggtaccat ttgtttttgt gggaacaaag 1441 gacagcatcg ctaatgccac tgttcttttg gattatcacc tgaactattt aaaggaagta 1501 gaccagttgc gtttggagag attacaaatt gatgagcagt tgcgacagat tggagctagt 1561 tctagaccac caccaaatcg tacagataag gaaaaaagct atgtgactga tgatggtcaa 1621 ggaatgggtc gaggtagtag accttacaga aatagggggc acggcagacg cggtcctgga 1681 tatacttcag gaactaattc tgaagcatca aatgcttctg aaacagaatc tgaccacaga 1741 gacgaactca gtgattggtc attagctcca acagaggaag agagggagag cttcctgcgc 1801 agaggagacg gacggcggcg tggaggggga ggaagaggac aaggaggaag aggacgtgga 1861 ggaggcttca aaggaaacga cgatcactcc cgaacagata atcgtccacg taatccaaga 1921 gaggctaaag gaagaacaac agatggatcc cttcagatca gagttgactg caataatgaa 1981 aggagtgtcc acactaaaac attacagaat acctccagtg aaggtagtcg gctgcgcacg 2041 ggtaaagatc gtaaccagaa gaaagagaag ccagacagcg tggatggtca gcaaccactc 2101 gtgaatggag taccctaaac tgcataattc tgaagttata tttcctatac catttccgta 2161 attcttattc catattagaa aactttgtta ggccaaagac aaatagtagg caagatggca 2221 cagggcatga aatgaacaca aattatgcta agaatttttt attttttggt attggccata 2281 agcaacaatt ttcagatttg cacaaaaaga taccttaaaa tttgaaacat tgcttttaaa 2341 actacttagc acttcagggc agattttagt tttattttct aaagtactga gcagtgatat 2401 tctttgttaa tttggaccat tttcctgcat tgggtgatca ttcaccagta cattctcagt 2461 ttttcttaat atatagcatt tatggtaatc atattagact tctgttttca atctcgtata 2521 gaagtcttca tgaaatgcta tgtcatttca tgtcctgtgt cagtttatgt tttggtccac 2581 ttttccagta ttttagtgga ccctgaaatg tgtgtgatgt gacatttgtc attttcatta 2641 gcaaaaaaag ttgtatgatc tgtgcctttt ttatatcttg gcaggtagga atattatatt 2701 tggatgcaga gttcagggaa gataagttgg aaacactaaa tgttaaagat gtagcaaacc 2761 ctgtcaaaca ttagtacttt atagaagaat gcatgctttc catatttttt tccttacata 2821 aacatcaggt taggcagtat aaagaatagg acttgttttt gtttttgttt tgttgcactg 2881 aagtttgata aatagtgtta ttgagagaga tgtgtaattt ttctgtatag acaggagaag 2941 aaagaactat cttcatctga gagaggctaa aatgttttca gctaggaaca aatcttcctg 3001 gtcgaaagtt agtaggatat gcctgctctt tggcctgatg accaatttta acttagagct 3061 ttttttttta attttgtctg ccccaagttt tgtgaaattt ttcatatttt aatttcaagc 3121 ttattttgga gagataggaa ggtcatttcc atgtatgcat aataatcctg caaagtacag 3181 gtactttgtc taagaaacat tggaagcagg ttaaatgttt tgtaaacttt gaaatatatg 3241 gtctaatgtt taagcagaat tggaaaagac taagatcggt taacaaataa caactttttt 3301 ttcttttttt cttttgtttt ttgaagtgtt ggggtttggt tttgtttttt gagtcttttt 3361 tttttaagtg aaatttattg aggaaaaata tgtgaaggac cttcactcta agatgttata 3421 tttttcttaa aaagtaactc ctagtagggg taccactgaa tctgtacaga gccgtaaaaa 3481 ctgaagttct gcctctgatg tattttgtga gtttgtttct ttgaattttc attttacagt 3541 tacttttcct tgcatacaaa caagcatata aaatggcaac aaactgcaca tgatttcaca 3601 aatattaaaa agtcttttaa aaagtattgc caaacattaa tgttgatttc tagttattta 3661 ttctgggaat gtatagtatt tgaaaacaga aattggtacc ttgcacacat catctgtaag 3721 ctgtttggtt ttaaaatact gtagataatt aaccaaggta gaatgacctt gtaatgtaac 3781 tgctcttggg caatattctc tgtacatatt agcgacaaca gattggattt tatgttgaca 3841 tttgtttggt tatagtgcaa tatattttgt atgcaagcag tttcaataaa gtttgatctt 3901 cctctgctaa attgatgttg atgcaatcct tacaaatgat tgcttttaaa attttaagct 3961 aggaaaagaa atctatagaa agtgttctgt tacaaaatgt aactgttacc attggaaatt 4021 tcacgtcata ggaagttagc ctttatctac ccaactttca agaaggttct ttaataaagc 4081 gaaaactcaa ccaaatggta cttttccaca gtgtaccatt aaaatatgca ctagtctctt 4141 tttacaaggc tgtattcagc aagggcctaa cttgcttaaa gtgtaattac taacttctaa 4201 aactgtactt tgattcacat gttttcaaat ggagttggag ttcattcata ttacaatatt 4261 tgtgtgctaa acgtgtatgt ttttcagttc aaagtcatga tgtttttaaa atcttattaa 4321 agtttcaaaa atctgaagat tgtttatcta gatgtaaatt tt // LOCUS HSFMYBPC 3575 bp RNA PRI 21-SEP-1993 DEFINITION H.sapiens mRNA for fast MyBP-C. ACCESSION X73113 NID g402646 KEYWORDS C protein; muscle isoform; myosin binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3575) AUTHORS Weber,F.E., Vaughan,K.T., Reinach,F.C. and Fischman,D.A. TITLE Complete sequence of human fast-type and slow-type muscle myosin-binding-protein C (MyBP-C). Differential expression, conserved domain structure and chromosome assignment JOURNAL Eur. J. Biochem. 216 (2), 661-669 (1993) MEDLINE 93387319 REFERENCE 2 (bases 1 to 3575) AUTHORS Vaughan,K.T. TITLE Direct Submission JOURNAL Submitted (01-JUN-1993) K.T. Vaughan, Worcester Foundation for Exper. Biology, Cell Biology Group, 222 Maple Avenue, Shrewsbury MA, 10545, USA FEATURES Location/Qualifiers source 1..3575 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /germline /tissue_type="skeletal muscle cell type" /clone="H7, H5 and H75PPCR" /clone_lib="lambda gt10" /chromosome="19" CDS 20..3448 /codon_start=1 /product="fast MyBP-C" /db_xref="PID:g402647" /translation="MPEAKPAAKKAPKGKDAPKGAPKEAPPKEAPAEAPKEAPPEDQS PTAEEPTGVFLKKPDSVSVETGKDAVVVAKVNGKELPDKPTIKWFKGKWLELGSKSGA RFSFKESHNSASNVYTVELHIGKVVLGDRGYYRLEVKAKDTCDSCGFNIDVEAPRQDA SGQSLESFKRTSEKKSDTAGELDFSGLLKKREVVEEEKKKKKKDDDDLGIPPEIWELL KGAKKSEYEKIAFQYGITDLRGMLKRLKKAKVEVKKSAAFTKKLDPAYQVDRGNKIKL MVEISDPDLTLKWFKNGQEIKPSSKYVFENVGKKRILTINKCTLADDAAYEVAVKDEK CFTELFVKEPPVLIVTPLEDQQVFVGDRVEMAVEVSEEGAQVMWMKDGVELTREDSFK ARYRFKKDGKRHILIFSDVVQEDRGRYQVITNGGQCEAELIVEEKQLEVLQDIADLTV KASEQAVFKCEVSDEKVTGKWYKNGVEVRPSKRITISHVGRFHKLVIDDVRPEDEGDY TFVPDGYALGSLSAKLNFLEIKVEYVPKQEPPKIHLDCSGKTSENAIVVVAGNKLRLD VSITGEPPPVATWLKGDEVFTTTEGRTRIEKRVDCSSFVIESAQREDEGRYTIKVTNP VGEDVASIFLQVVDVPDPPEAVRITSVGEDWAILVWEPPMYDGGKPVTGYLVERKKKG SQRWMKLNFEVFTETTYESTKMIEGILYEMRVFAVNAIGVSQPSMNTKPFMPIAPTSE PLHLIVEDVTDTTTTLKWRPPNRIGAGGIDGYLVEYCLEGSEEWVPANTEPVERCGFT VKNLPTGARILFRVVGVNIAGRTEPATLAQPVTIREIAEPPKIRLPRHLRQTYIRKVG EQLNLVVPFQGKPRPQVVWTKGGAPLDTSRVHVRTSDFDTVFFVRQAARSDSGEYELS VQIENMKDTATIRIRVVEKAGPPINVMVKEVWGTNALVEWQAPKDDGNSEIMGYFVQK ADKKTMEWFNVYERNRHTSCTVSDLIVGNEYYFRVYTENICGLSDSPGVSKNTARILK TGITFKPFEYKEHDFRMAPKFLTPLIDRVVVAGYSAALNCAVRGHPKPKVVWMKNKME IREDPKFLITNYQGVLTLNIRRPSPFDAGTYTCRAVNELGEALAECKLEVRVPQ" 3'UTR 3443..3575 polyA_signal 3544..3550 BASE COUNT 886 a 959 c 1094 g 636 t ORIGIN 1 cgggaggagg tcccccgaca tgcctgaggc aaaaccagcg gccaaaaagg cccccaaagg 61 caaagatgcc cccaaaggag cccccaagga ggctccccct aaggaggctc ctgcagaggc 121 ccccaaagaa gccccacccg aggaccagtc cccgactgca gaggagccca ccggcgtttt 181 cctgaagaag ccggactccg tctcagtgga gactgggaag gacgcagtgg tcgtggccaa 241 ggtgaacggg aaggagctcc cagacaaacc gaccatcaag tggttcaagg ggaagtggct 301 ggagctgggc agcaagagtg gcgcccgctt ctccttcaag gagtcccaca actccgccag 361 caatgtgtac accgtggagc tgcacattgg gaaggtggta ctgggggacc gtgggtatta 421 ccgcctcgag gtcaaagcca aggacacctg tgacagctgt ggcttcaaca tcgatgtgga 481 ggcaccccgt caggatgcct ctgggcagag tctagaaagc ttcaagcgta cgagtgaaaa 541 gaagtcggat actgcaggtg agctggattt cagtggcctg ttgaagaaga gggaggtggt 601 ggaggaggag aagaagaaga aaaagaaaga tgacgatgac ctaggcatcc ccccggagat 661 ttgggagctc ctgaaagggg caaagaagag cgagtacgag aaaatcgcct tccagtatgg 721 catcaccgac ctccggggca tgctgaagcg gctgaaaaag gctaaggtcg aggtcaagaa 781 gagtgcagca ttcacaaaga agctggatcc agcctaccaa gtggacagag gcaacaagat 841 caagttgatg gtagagatca gcgacccaga cctgaccctc aagtggttca agaacggcca 901 ggagatcaaa ccaagcagca agtacgtgtt tgagaacgtt ggtaagaagc gaattcttac 961 catcaacaag tgcacgctgg cggatgacgc tgcctatgaa gtagctgtca aggatgagaa 1021 gtgtttcacc gagctcttcg tcaaagaacc tccagtccta attgtcacac ctcttgagga 1081 ccagcaggtg tttgtgggtg accgggtgga aatggcagtg gaggtgtcag aagagggtgc 1141 ccaggtgatg tggatgaaag atggtgtgga actgactcgg gaggattcct tcaaggcccg 1201 gtaccgcttc aagaaggacg ggaagcgcca catcctcatc ttctcagacg tggtccagga 1261 ggacaggggt cgctatcagg tcataaccaa tggcggccag tgtgaggccg agctgattgt 1321 ggaagagaaa cagctggagg tcctgcagga catcgcggat ctgacggtga aggcctcaga 1381 acaagctgtg ttcaagtgcg aggtgtctga tgagaaagtg acgggcaagt ggtataagaa 1441 tggggtcgag gtgcggccca gcaagaggat caccatttcc catgtaggca ggttccacaa 1501 gctggtgatc gatgacgtcc gccccgagga tgagggagac tacacgtttg tgcctgacgg 1561 ctacgccctt ggttcgctct cggccaagct caacttcctg gaaatcaagg tggagtacgt 1621 tcccaagcaa gagccaccaa agatccactt ggattgctcg gggaagacct cagagaatgc 1681 gattgtggtt gtggctggaa acaagctgag gcttgacgtg tccatcacag gggagccccc 1741 tcccgtcgct acctggctga agggagatga ggtattcacg accaccgagg gcaggacccg 1801 catcgagaag cgggtggact gcagcagctt tgtgattgag agtgcgcagc gggaagacga 1861 gggccgctac accatcaagg tcaccaaccc cgtcggcgag gacgtggctt ccatcttcct 1921 gcaagttgta gatgtcccag accccccgga ggctgtgcgc atcacctcgg ttggagagga 1981 ttgggccatc cttgtctggg agccaccaat gtacgatggg gggaagccag tcaccgggta 2041 cctcgtagag cggaagaaga agggctctca gcgctggatg aagctgaact ttgaggtctt 2101 cacagagacc acctatgagt ccaccaagat gatcgagggc atcctctatg agatgcgtgt 2161 cttcgccgtc aatgctatag gggtctccca gcccagcatg aacaccaagc cttttatgcc 2221 tattgcaccc acgagtgaac ccctgcacct gatagtggag gatgtgacag acaccaccac 2281 cacactcaag tggaggcctc cgaacaggat cggggcaggt ggcatcgatg ggtacctggt 2341 ggagtactgc ctggaaggct ccgaggaatg ggtccctgcc aacaccgagc ccgtggagcg 2401 ctgtggcttc accgtcaaga atctcccgac cggagccaga atcctcttcc gagtagttgg 2461 ggtcaacatc gcggggcgca cggagccggc caccctggcc cagccggtca ccatcaggga 2521 gattgcggag ccacccaaga tccggcttcc ccgccatctc cgccagacct acatccgcaa 2581 agtgggcgag cagctcaacc ttgtcgtccc cttccaggga aagccccggc cccaggtggt 2641 gtggaccaag ggcggggccc cgctggacac ctcccgcgtg cacgtgcgga ccagcgactt 2701 cgacaccgtg ttcttcgtgc gccaggcggc ccgctccgac tccggggagt acgagctgag 2761 cgtgcagatc gagaacatga aggacaccgc caccatccgc atccgcgttg tggaaaaggc 2821 tgggcccccc ataaacgtga tggtgaagga ggtgtggggc acgaacgcgc tggtggagtg 2881 gcaggccccc aaagatgatg ggaacagtga gatcatgggg tatttcgtcc agaaagcaga 2941 caaaaaaacc atggagtggt tcaacgtcta tgaacgtaac aggcacacta gctgtactgt 3001 gtccgacctt atcgtgggca atgaatacta tttccgagtt tacaccgaga acatctgtgg 3061 gctcagtgac tcacctggtg tctccaagaa cacggcccgc atcctcaaga caggaatcac 3121 cttcaaaccg ttcgagtata aggagcatga cttccggatg gctcccaagt tcctgacacc 3181 tctcatagac cgcgtggtcg tggctgggta ctcggcagcc ctcaactgtg ctgtcagagg 3241 ccacccgaag ccgaaggtgg tctggatgaa gaacaagatg gaaatccgtg aagatcccaa 3301 gttcctgata accaattacc aaggagtcct gacgctgaac atccgtcgcc cctcgccctt 3361 cgacgctggg acttacacct gccgggccgt caacgagctg ggcgaggcgc tggctgagtg 3421 caagctggag gtccgagtgc cgcagtgaga cctgtcccct acctgccaag acaattggtg 3481 gtggagtcct gaccccaatc cccaacctcc caggactgtg ttctttctgg agttttcgct 3541 gagaacaaaa cagtgttgtc tggaaaaaaa aaaaa // LOCUS HSFNRA 4204 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for fibronectin receptor alpha subunit. ACCESSION X06256 NID g31437 KEYWORDS fibronectin receptor; fibronectin receptor alpha subunit; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4204) AUTHORS Argraves,W.S., Suzuki,S., Arai,H., Thompson,K., Pierschbacher,M.D. and Ruoslahti,E. TITLE Amino acid sequence of the human fibronectin receptor JOURNAL J. Cell Biol. 105 (3), 1183-1190 (1987) MEDLINE 88007843 FEATURES Location/Qualifiers source 1..4204 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" sig_peptide 24..146 /note="pot. signal peptide (AA -41 to -1)" CDS 24..3173 /note="fibronectin receptor alpha subunit precursor (AA -41 to 1008)" /codon_start=1 /db_xref="PID:g31438" /db_xref="SWISS-PROT:P08648" /translation="MGSRTPESPLHAVQLRWGPRRRPPLVPLLLLLVPPPPRVGGFNL DAEAPAVLSGPPGSFFGFSVEFYRPGTDGVSVLVGAPKANTSQPGVLQGGAVYLCPWG ASPTQCTPIEFDSKGSRLLESSLSSSEGEEPVEYKSLQWFGATVRAHGSSILACAPLY SWRTEKEPLSDPVGTCYLSTDNFTRILEYAPCRSDFSWAAGQGYCQGGFSAEFTKTGR VVLGGPGSYFWQGQILSATQEQIAESYYPEYLINLVQGQLQTRQASSIYDDSYLGYSV AVGEFSGDDTEDFVAGVPKGNLTYGYVTILNGSDIRSLYNFSGEQMASYFGYAVAATD VNGDGLDDLLVGAPLLMDRTPDGRPQEVGRVYVYLQHPAGIEPTPTLTLTGHDEFGRF GSSLTPLGDLDQDGYNDVAIGAPFGGETQQGVVFVFPGGPGGLGSKPSQVLQPLWAAS HTPDFFGSALRGGRDLDGNGYPDLIVGSFGVDKAVVYRGRPIVSASASLTIFPAMFNP EERSCSLEGNPVACINLSFCLNASGKHVADSIGFTVELQLDWQKQKGGVRRALFLASR QATLTQTLLIQNGAREDCREMKIYLRNESEFRDKLSPIHIALNFSLDPQAPVDSHGLR PALHYQSKSRIEDKAQILLDCGEDNICVPDLQLEVFGEQNHVYLGDKNALNLTFHAQN VGEGGAYEAELRVTAPPEAEYSGLVRHPGNFSSLSCDYFAVNQSRLLVCDLGNPMKAG ASLWGGLRFTVPHLRDTKKTIQFDFQILSKNLNNSQSDVVSFRLSVEAQAQVTLNGVS KPEAVLFPVSDWHPRDQPQKEEDLGPAVHHVYELINQGPSSISQGVLELSCPQALEGQ QLLYVTRVTGLNCTTNHPINPKGLELDPEGSLHHQQKREAPSRSSASSGPQILKCPEA ECFRLRCELGPLHQQESQSLQLHFRVWAKTFLQREHQPFSLQCEAVYKALKMPYRILP RQLPQKERQVATAVQWTKAEGSYGVPLWIIILAILFGLLLLGLLIYILYKLGFFKRSL PYGTAMEKAQLKPPATSDA" mat_peptide 147..3170 /note="mature fibronectin receptor alpha subunit (AA 1 - 1008)" misc_feature 273..275 /note="pot. N-linked glycosylation site" misc_feature 567..569 /note="pot. N-linked glycosylation site" misc_feature 912..914 /note="pot. N-linked glycosylation site" misc_feature 942..944 /note="pot. N-linked glycosylation site" misc_feature 969..971 /note="pot. N-linked glycosylation site" misc_feature 1593..1595 /note="pot. N-linked glycosylation site" misc_feature 1611..1613 /note="pot. N-linked glycosylation site" misc_feature 1800..1802 /note="pot. N-linked glycosylation site" misc_feature 1849..1851 /note="pot. N-linked glycosylation site" misc_feature 2046..2048 /note="pot. N-linked glycosylation site" misc_feature 2157..2159 /note="pot. N-linked glycosylation site" misc_feature 2193..2195 /note="pot. N-linked glycosylation site" misc_feature 2340..2342 /note="pot. N-linked glycosylation site" misc_feature 2625..2627 /note="pot. N-linked glycosylation site" misc_feature 2706..2707 /note="pot. cleavage site for light chain" misc_feature 3000..3086 /note="pot. transmembrane domain" BASE COUNT 924 a 1277 c 1155 g 848 t ORIGIN 1 caggacaggg aagagcgggc gctatgggga gccggacgcc agagtcccct ctccacgccg 61 tgcagctgcg ctggggcccc cggcgccgac ccccgctcgt gccgctgctg ttgctgctcg 121 tgccgccgcc acccagggtc gggggcttca acttagacgc ggaggcccca gcagtactct 181 cggggccccc gggctccttc ttcggattct cagtggagtt ttaccggccg ggaacagacg 241 gggtcagtgt gctggtggga gcacccaagg ctaataccag ccagccagga gtgctgcagg 301 gtggtgctgt ctacctctgt ccttggggtg ccagccccac acagtgcacc cccattgaat 361 ttgacagcaa aggctctcgg ctcctggagt cctcactgtc cagctcagag ggagaggagc 421 ctgtggagta caagtccttg cagtggttcg gggcaacagt tcgagcccat ggctcctcca 481 tcttggcatg cgctccactg tacagctggc gcacagagaa ggagccactg agcgaccccg 541 tgggcacctg ctacctctcc acagataact tcacccgaat tctggagtat gcaccctgcc 601 gctcagattt cagctgggca gcaggacagg gttactgcca aggaggcttc agtgccgagt 661 tcaccaagac tggccgtgtg gttttaggtg gaccaggaag ctatttctgg caaggccaga 721 tcctgtctgc cactcaggag cagattgcag aatcttatta ccccgagtac ctgatcaacc 781 tggttcaggg gcagctgcag actcgccagg ccagttccat ctatgatgac agctacctag 841 gatactctgt ggctgttggt gaattcagtg gtgatgacac agaagacttt gttgctggtg 901 tgcccaaagg gaacctcact tacggctatg tcaccatcct taatggctca gacattcgat 961 ccctctacaa cttctcaggg gaacagatgg cctcctactt tggctatgca gtggccgcca 1021 cagacgtcaa tggggacggg ctggatgact tgctggtggg ggcacccctg ctcatggatc 1081 ggacccctga cgggcggcct caggaggtgg gcagggtcta cgtctacctg cagcacccag 1141 ccggcataga gcccacgccc acccttaccc tcactggcca tgatgagttt ggccgatttg 1201 gcagctcctt gacccccctg ggggacctgg accaggatgg ctacaatgat gtggccatcg 1261 gggctccctt tggtggggag acccagcagg gagtagtgtt tgtatttcct gggggcccag 1321 gagggctggg ctctaagcct tcccaggttc tgcagcccct gtgggcagcc agccacaccc 1381 cagacttctt tggctctgcc cttcgaggag gccgagacct ggatggcaat ggatatcctg 1441 atctgattgt ggggtccttt ggtgtggaca aggctgtggt atacaggggc cgccccatcg 1501 tgtccgctag tgcctccctc accatcttcc ccgccatgtt caacccagag gagcggagct 1561 gcagcttaga ggggaaccct gtggcctgca tcaaccttag cttctgcctc aatgcttctg 1621 gaaaacacgt tgctgactcc attggtttca cagtggaact tcagctggac tggcagaagc 1681 agaagggagg ggtacggcgg gcactgttcc tggcctccag gcaggcaacc ctgacccaga 1741 ccctgctcat ccagaatggg gctcgagagg attgcagaga gatgaagatc tacctcagga 1801 acgagtcaga atttcgagac aaactctcgc cgattcacat cgctctcaac ttctccttgg 1861 acccccaagc cccagtggac agccacggcc tcaggccagc cctacattat cagagcaaga 1921 gccggataga ggacaaggct cagatcttgc tggactgtgg agaagacaac atctgtgtgc 1981 ctgacctgca gctggaagtg tttggggagc agaaccatgt gtacctgggt gacaagaatg 2041 ccctgaacct cactttccat gcccagaatg tgggtgaggg tggcgcctat gaggctgagc 2101 ttcgggtcac cgcccctcca gaggctgagt actcaggact cgtcagacac ccagggaact 2161 tctccagcct gagctgtgac tactttgccg tgaaccagag ccgcctgctg gtgtgtgacc 2221 tgggcaaccc catgaaggca ggagccagtc tgtggggtgg ccttcggttt acagtccctc 2281 atctccggga cactaagaaa accatccagt ttgacttcca gatcctcagc aagaatctca 2341 acaactcgca aagcgacgtg gtttcctttc ggctctccgt ggaggctcag gcccaggtca 2401 ccctgaacgg tgtctccaag cctgaggcag tgctattccc agtaagcgac tggcatcccc 2461 gagaccagcc tcagaaggag gaggacctgg gacctgctgt ccaccatgtc tatgagctca 2521 tcaaccaagg ccccagctcc attagccagg gtgtgctgga actcagctgt ccccaggctc 2581 tggaaggtca gcagctccta tatgtgacca gagttacggg actcaactgc accaccaatc 2641 accccattaa cccaaagggc ctggagttgg atcccgaggg ttccctgcac caccagcaaa 2701 aacgggaagc tccaagccgc agctctgctt cctcgggacc tcagatcctg aaatgcccgg 2761 aggctgagtg tttcaggctg cgctgtgagc tcgggcccct gcaccaacaa gagagccaaa 2821 gtctgcagtt gcatttccga gtctgggcca agactttctt gcagcgggag caccagccat 2881 ttagcctgca gtgtgaggct gtgtacaaag ccctgaagat gccctaccga atcctgcctc 2941 ggcagctgcc ccaaaaagag cgtcaggtgg ccacagctgt gcaatggacc aaggcagaag 3001 gcagctatgg cgtcccactg tggatcatca tcctagccat cctgtttggc ctcctgctcc 3061 taggtctact catctacatc ctctacaagc ttggattctt caaacgctcc ctcccatatg 3121 gcaccgccat ggaaaaagct cagctcaagc ctccagccac ctctgatgcc tgagtcctcc 3181 caatttcaga ctcccattcc tgaagaacca gtccccccac cctcattcta ctgaaaagga 3241 ggggtctggg tacttcttga aggtgctgac ggccagggag aagctcctct ccccagccca 3301 gagacatact tgaagggcca gagccagggg ggtgaggagc tggggatccc tcccccccat 3361 gcactgtgaa ggacccttgt ttacacatac cctcttcatg gatgggggaa ctcagatcca 3421 gggacagagg cccagcctcc ctgaagcctt tgcattttgg agagtttcct gaaacaactg 3481 gaaagataac taggaaatcc attcacagtt ctttgggcca gacatgccac aaggacttcc 3541 tgtccagctc caacctgcaa agatctgtcc tcagccttgc cagagatcca aaagaagccc 3601 ccagtaagaa cctggaactt ggggagttaa gacctggcag ctctggacag ccccaccctg 3661 gtgggccaac aaagaacact aactatgcat ggtgccccag gaccagctca ggacagatgc 3721 cacaaggata gatgctggcc cagggccaga gcccagctcc aaggggaatc agaactcaaa 3781 tggggccaga tccagcctgg ggtctggagt tgatctggaa cccagactca gacattggca 3841 ccaatccagg cagatccagg actatatttg ggcctgctcc agacctgatc ctggaggccc 3901 agttcaccct gatttaggag aagccaggaa tttcccagga cctgaagggg ccatgatggc 3961 aacagatctg gaacctcagc ctggccagac acaggccctc cctgttcccc agagaaaggg 4021 gagcccactg tcctgggcct gcagaatttg ggttctgcct gccagctgca ctgatgctgc 4081 ccctcatctc tctgcccaac ccttccctca ccttggcacc agacacccag gacttattta 4141 aactctgttg caagtgcaat aaatctgacc cagtgccccc actgaccaga actagaaaaa 4201 aaaa // LOCUS HSFNRB 3614 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for fibronectin receptor beta subunit. ACCESSION X07979 NID g31441 KEYWORDS fibronectin receptor; fibronectin receptor beta subunit; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3614) AUTHORS Argraves,W.S., Suzuki,S., Arai,H., Thompson,K., Pierschbacher,M.D. and Ruoslahti,E. TITLE Amino acid sequence of the human fibronectin receptor JOURNAL J. Cell Biol. 105 (3), 1183-1190 (1987) MEDLINE 88007843 FEATURES Location/Qualifiers source 1..3614 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" sig_peptide 104..163 /note="put. signal peptide (AA -20 to -1)" CDS 104..2500 /note="fibronectin receptor beta subunit precursor (AA -20 to 778)" /codon_start=1 /db_xref="PID:g31442" /db_xref="SWISS-PROT:P05556" /translation="MNLQPIFWIGLISSVCCVFAQTDENRCLKANAKSCGECIQAGPN CGWCTNSTFLQEGMPTSARCDDLEALKKKGCPPDDIENPRGSKDIKKNKNVTNRSKGT AEKLKPEDIHQIQPQQLVLRLRSGEPQTFTLKFKRAEDYPIDLYYLMDLSYSMKDDLE NVKSLGTDLMNEMRRITSDFRIGFGSFVEKTVMPYISTTPAKLRNPCTSEQNCTTPFS YKNVLSLTNKGEVFNELVGKQRISGNLDSPEGGFDAIMQVAVCGSLIGWRNVTRLLVF STDAGFHFAGDGKLGGIVLPNDGQCHLENNMYTMSHYYDYPSIAHLVQKLSENNIQTI FAVTEEFQPVYKELKNLIPKSAVGTLSANSSNVIQLIIDAYNSLSSEVILENGKLSEG VTISYKSYCKNGVNGTGENGRKCSNISIGDEVQFEISITSNKCPKKDSDSFKIRPLGF TEEVEVILQYICECECQSEGIPESPKCHEGNGTFECGACRCNEGRVGRHCECSTDEVN SEDMDAYCRKENSSEICSNNGECVCGQCVCRKRDNTNEIYSGKFCECDNFNCDRSNGL ICGGNGVCKCRVCECNPNYTGSACDCSLDTSTCEASNGQICNGRGICECGVCKCTDPK FQGQTCEMCQTCLGVCAEHKECVQCRAFNKGEKKDTCTQECSYFNITKVESRDKLPQP VQPDPVSHCKEKDVDDCWFYFTYSVNGNNEVMVHVVENPECPTGPDIIPIVAGVVAGI VLIGLALLLIWKLLMIIHDRREFAKFEKEKMNAKWDTGENPIYKSAVTTVVNPKYEGK " mat_peptide 164..2497 /note="mature fibronectin receptor beta subunit (AA 1-778)" misc_feature 251..253 /note="pot. N-linked glycosylation site" misc_feature 383..385 /note="pot. N-linked glycosylation site" misc_feature 392..394 /note="pot. N-linked glycosylation site" misc_feature 737..739 /note="pot. N-linked glycosylation site" misc_feature 908..910 /note="pot. N-linked glycosylation site" misc_feature 1190..1192 /note="pot. N-linked glycosylation site" misc_feature 1319..1321 /note="pot. N-linked glycosylation site" misc_feature 1352..1354 /note="pot. N-linked glycosylation site" misc_feature 1544..1546 /note="pot. N-linked glycosylation site" misc_feature 1661..1663 /note="pot. N-linked glycosylation site" misc_feature 1853..1855 /note="pot. N-linked glycosylation site" misc_feature 2108..2110 /note="pot. N-linked glycosylation site" BASE COUNT 1110 a 617 c 817 g 1070 t ORIGIN 1 gtccgccaaa acctgcgcgg atagggaaga acagcacccc ggcgccgatt gccgtaccaa 61 acaagcctaa cgtccgctgg gccccggacg ccgcgcggaa aagatgaatt tacaaccaat 121 tttctggatt ggactgatca gttcagtttg ctgtgtgttt gctcaaacag atgaaaatag 181 atgtttaaaa gcaaatgcca aatcatgtgg agaatgtata caagcagggc caaattgtgg 241 gtggtgcaca aattcaacat ttttacagga aggaatgcct acttctgcac gatgtgatga 301 tttagaagcc ttaaaaaaga agggttgccc tccagatgac atagaaaatc ccagaggctc 361 caaagatata aagaaaaata aaaatgtaac caaccgtagc aaaggaacag cagagaagct 421 caagccagag gatattcatc agatccaacc acagcagttg gttttgcgat taagatcagg 481 ggagccacag acatttacat taaaattcaa gagagctgaa gactatccca ttgacctcta 541 ctaccttatg gacctgtctt attcaatgaa agacgatttg gagaatgtaa aaagtcttgg 601 aacagatctg atgaatgaaa tgaggaggat tacttcggac ttcagaattg gatttggctc 661 atttgtggaa aagactgtga tgccttacat tagcacaaca ccagctaagc tcaggaaccc 721 ttgcacaagt gaacagaact gcaccacccc atttagctac aaaaatgtgc tcagtcttac 781 taataaagga gaagtattta atgaacttgt tggaaaacag cgcatatctg gaaatttgga 841 ttctccagaa ggtggtttcg atgccatcat gcaagttgca gtttgtggat cactgattgg 901 ctggaggaat gttacacggc tgctggtgtt ttccacagat gccgggtttc actttgctgg 961 agatgggaaa cttggtggca ttgttttacc aaatgatgga caatgtcacc tggaaaataa 1021 tatgtacaca atgagccatt attatgatta tccttctatt gctcaccttg tccagaaact 1081 gagtgaaaat aatattcaga caatttttgc agttactgaa gaatttcagc ctgtttacaa 1141 ggagctgaaa aacttgatcc ctaagtcagc agtaggaaca ttatctgcaa attctagcaa 1201 tgtaattcag ttgatcattg atgcatacaa ttccctttcc tcagaagtca ttttggaaaa 1261 cggcaaattg tcagaaggag taacaataag ttacaaatct tactgcaaga acggggtgaa 1321 tggaacaggg gaaaatggaa gaaaatgttc caatatttcc attggagatg aggttcaatt 1381 tgaaattagc ataacttcaa ataagtgtcc aaaaaaggat tctgacagct ttaaaattag 1441 gcctctgggc tttacggagg aagtagaggt tattcttcag tacatctgtg aatgtgaatg 1501 ccaaagcgaa ggcatccctg aaagtcccaa gtgtcatgaa ggaaatggga catttgagtg 1561 tggcgcgtgc aggtgcaatg aagggcgtgt tggtagacat tgtgaatgca gcacagatga 1621 agttaacagt gaagacatgg atgcttactg caggaaagaa aacagttcag aaatctgcag 1681 taacaatgga gagtgcgtct gcggacagtg tgtttgtagg aagagggata atacaaatga 1741 aatttattct ggcaaattct gcgagtgtga taatttcaac tgtgatagat ccaatggctt 1801 aatttgtgga ggaaatggtg tttgcaagtg tcgtgtgtgt gagtgcaacc ccaactacac 1861 tggcagtgca tgtgactgtt ctttggatac tagtacttgt gaagccagca acggacagat 1921 ctgcaatggc cggggcatct gcgagtgtgg tgtctgtaag tgtacagatc cgaagtttca 1981 agggcaaacg tgtgagatgt gtcagacctg ccttggtgtc tgtgctgagc ataaagaatg 2041 tgttcagtgc agagccttca ataaaggaga aaagaaagac acatgcacac aggaatgttc 2101 ctattttaac attaccaagg tagaaagtcg ggacaaatta ccccagccgg tccaacctga 2161 tcctgtgtcc cattgtaagg agaaggatgt tgacgactgt tggttctatt ttacgtattc 2221 agtgaatggg aacaacgagg tcatggttca tgttgtggag aatccagagt gtcccactgg 2281 tccagacatc attccaattg tagctggtgt ggttgctgga attgttctta ttggccttgc 2341 attactgctg atatggaagc ttttaatgat aattcatgac agaagggagt ttgctaaatt 2401 tgaaaaggag aaaatgaatg ccaaatggga cacgggtgaa aatcctattt ataagagtgc 2461 cgtaacaact gtggtcaatc cgaagtatga gggaaaatga gtactgcccg tgcaaatccc 2521 acaacactga atgcaaagta gcaatttcca tagtcacagt taggtagctt tagggcaata 2581 ttgccatggt tttactcatg tgcaggtttt gaaaatgtac aatatgtata atttttaaaa 2641 tgttttatta ttttgaaaat aatgttgtaa ttcatgccag ggactgacaa aagacttgag 2701 acaggatggt tattcttgtc agctaaggtc acattgtgcc tttttgacct tttcttcctg 2761 gactattgaa atcaagctta ttggattaag tgatatttct atagcgattg aaagggcaat 2821 agttaaagta atgagcatga tgagagtttc tgttaatcat gtattaaaac tgatttttag 2881 ctttacatat gtcagtttgc agttatgcag aatccaaagt aaatgtcctg ctagctagtt 2941 aaggattgtt ttaaatctgt tattttgcta tttgcctgtt agacatgact gatgacatat 3001 ctgaaagaca agtatgttga gagttgctgg tgtaaaatac gtttgaaata gttgatctac 3061 aaaggccatg ggaaaaattc agagagttag gaaggaaaaa ccaatagctt taaaacctgt 3121 gtgccatttt aagagttact taatgtttgg taacttttat gccttcactt tacaaattca 3181 agccttagat aaaagaaccg agcaattttc tgctaaaaag tccttgattt agcactattt 3241 acatacaggc catactttac aaagtatttg ctgaatgggg accttttgag ttgaatttat 3301 tttattattt ttattttgtt taatgtctgg tgctttctat cacctcttct aatcttttaa 3361 tgtatttgtt tgcaattttg gggtaagact tttttatgag tactttttct ttgaagtttt 3421 agcggtcaat ttgccttttt aatgaacatg tgaagttata ctgtggctat gcaacagctc 3481 tcacctacgc gagtcttact ttgagttagt gccataacag accactgtat gtttacttct 3541 caccatttga gttgcccatc ttgtttcaca ctagtcacat tcttgtttta agtgccttta 3601 gttttaacag ttca // LOCUS HSFRA1M 954 bp RNA PRI 12-SEP-1993 DEFINITION Human fra-1 mRNA. ACCESSION X16707 NID g31462 KEYWORDS fos-related gene; fra-1 gene; leucine-zipper. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 954) AUTHORS Matsui,M. TITLE Direct Submission JOURNAL Submitted (30-OCT-1989) Matsui M., Nippon Veterinary and Zootechnical College, Molecular Oncology Laboratory, 1-10-19 Ueno Sakuragi, Taito ku Tokyo 110, Japan REFERENCE 2 (bases 1 to 954) AUTHORS Matsui,M., Tokuhara,M., Konuma,Y., Nomura,N. and Ishizaki,R. TITLE Isolation of human fos-related genes and their expression during monocyte-macrophage differentiation JOURNAL Oncogene 5 (3), 249-255 (1990) MEDLINE 90191709 COMMENT See also for fra-2 mRNA. FEATURES Location/Qualifiers source 1..954 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial" CDS 35..850 /note="fra-1 gene product (AA 1-271)" /codon_start=1 /db_xref="PID:g31463" /db_xref="SWISS-PROT:P15407" /translation="MFRDFGEPGPSSGNGGGYGGPAQPPAAAQAAQQKFHLVPSINTM SGSQELQWMVQPHFLGPSSYPRPLTYPQYSPPQPRPGVIRALGPPPGVRRRPCEQISP EEEERRRVRRERNKLAAAKCRNRRKELTDFLQAETDKLEDEKSGLQREIEELQKQKER LELVLEAHRPICKIPEGAKEGDTGSTSGTSSPPAPCRPVPCISLSPGPVLEPEALHTP TLMTTPSLTPFTPSLVFTYPSTPEPCASAHRKSSSSSGDPSSDPLGSPTLLAL" BASE COUNT 201 a 349 c 264 g 140 t ORIGIN 1 agccgtgtac cccgcagagc cgccagcccc gggcatgttc cgagacttcg gggaacccgg 61 cccgagctcc gggaacggcg gcgggtacgg cggccccgcg cagcccccgg ccgcagcgca 121 ggcagcccag cagaagttcc acctggtgcc aagcatcaac accatgagtg gcagtcagga 181 gctgcagtgg atggtacagc ctcatttcct ggggcccagc agttacccca ggcctctgac 241 ctaccctcag tacagccccc cacaaccccg gccaggagtc atccgggccc tggggccgcc 301 tccaggggta cgtcgaaggc cttgtgaaca gatcagcccg gaggaagagg agcgccgccg 361 agtaaggcgc gagcggaaca agctggctgc ggccaagtgc aggaaccgga ggaaggaact 421 gaccgacttc ctgcaggcgg agactgacaa actggaagat gagaaatctg ggctgcagcg 481 agagattgag gagctgcaga agcagaagga gcgcctagag ctggtgctgg aagcccaccg 541 acccatctgc aaaatcccgg aaggagccaa ggagggggac acaggcagta ccagtggcac 601 cagcagccca ccagccccct gccgccctgt accttgtatc tccctttccc cagggcctgt 661 gcttgaacct gaggcactgc acacccccac actcatgacc acaccctccc taactccttt 721 cacccccagc ctggtcttca cctaccccag cactcctgag ccttgtgcct cagctcatcg 781 caagagtagc agcagcagcg gagacccatc ctctgacccc cttggctctc caaccctcct 841 cgctttgtga ggcgcctgag ccctactccc tgcagatgcc accctagcca atgtctcctc 901 cccttccccc accggtccag ctggcctgga cagtatccca catccaactc cagc // LOCUS HSFRA2M 1007 bp RNA PRI 12-SEP-1993 DEFINITION Human fra-2 mRNA. ACCESSION X16706 NID g31464 KEYWORDS fos-related gene; fra-2 gene; leucine-zipper. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1007) AUTHORS Matsui,M. TITLE Direct Submission JOURNAL Submitted (30-OCT-1989) Matsui M., Nippon Veterinary and Zootechnical College, Molecular Oncology Laboratory, 1-10-19 Ueno Sakuragi, Taito ku Tokyo 110, Japan REFERENCE 2 (bases 1 to 1007) AUTHORS Matsui,M., Tokuhara,M., Konuma,Y., Nomura,N. and Ishizaki,R. TITLE Isolation of human fos-related genes and their expression during monocyte-macrophage differentiation JOURNAL Oncogene 5 (3), 249-255 (1990) MEDLINE 90191709 COMMENT See for fra-1 mRNA. FEATURES Location/Qualifiers source 1..1007 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial" CDS 4..984 /note="fra-2 gene product (AA 1-326)" /codon_start=1 /db_xref="PID:g31465" /db_xref="SWISS-PROT:P15408" /translation="MYQDYPGNFDTSSRGSSGSPAHAESYSSGGGGQQKFRVDMPGSG SAFIPTINAITTSQDLQWMVQPTVITSMSNPYPRSHPYSPLPGLASVPGHMALPRPGV IKTIGTTVGRRRRDEQLSPEEEEKRRIRRERNKLAAAKCRNRRRELTEKLQAETEELE EEKSGLQKEIAELQKEKEKLEFMLVAHGPVCKISPEERRSPPAPGLQPMRSGGGSVGA VVVKQEPLEEDSPSSSSAGLDKAQRSVIKPISIAGGFYGEEPLHTPIVVTSTPAVTPG TSNLVFTYPSVLEQESPASPSESCSKAHRRSSSSGDQSSDSLNSPTLLAL" BASE COUNT 208 a 335 c 303 g 161 t ORIGIN 1 atcatgtacc aggattatcc cgggaacttt gacacctcgt cccggggcag cagcggctct 61 cctgcgcacg ccgagtccta ctccagcggc ggcggcggcc agcagaaatt ccgggtagat 121 atgcctggct caggcagtgc attcatcccc accatcaacg ccatcacgac cagccaggac 181 ctgcagtgga tggtgcagcc cacagtgatc acctccatgt ccaacccata ccctcgctcg 241 cacccctaca gccccctgcc gggcctggcc tctgtccctg gacacatggc cctcccaaga 301 cctggcgtga tcaagaccat tggcaccacc gtgggccgca ggaggagaga tgagcagctg 361 tctcctgaag aggaggagaa gcgtcgcatc cggcgggaga ggaacaagct ggctgcagcc 421 aagtgccgga accgacgccg ggagctgaca gagaagctgc aggcggagac agaggagctg 481 gaggaggaga agtcaggcct gcagaaggag attgctgagc tgcagaagga gaaggagaag 541 ctggagttca tgttggtggc tcacggccca gtgtgcaaga ttagccccga ggagcgccga 601 tcgcccccag cccctgggct gcagcccatg cgcagtgggg gtggctcggt gggcgctgta 661 gtggtgaaac aggagcccct ggaagaggac agcccctcgt cctcgtcggc ggggctggac 721 aaggcccagc gctctgtcat caagcccatc agcattgctg ggggcttcta cggtgaggag 781 cccctgcaca cccccatcgt ggtgacctcc acacctgctg tcactccggg cacctcgaac 841 ctcgtcttca cctatcctag cgtcctggag caggagtcac ccgcatctcc ctccgaatcc 901 tgctccaagg ctcaccgcag aagcagtagc agcggggacc aatcatcaga ctccttgaac 961 tcccccactc tgctggctct gtaacccagt gcacctccct ccggagc // LOCUS HSFRUCBIP 1332 bp RNA PRI 05-NOV-1997 DEFINITION Homo sapiens mRNA for fructose-1,6-bisphosphatase. ACCESSION Y10812 NID g2154754 KEYWORDS fructose-1,6-bisphosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1332) AUTHORS Tillmann,H. and Eschrich,K. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1332) AUTHORS Eschrich,K. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) K. Eschrich, University of Leipzig, Institute of Biochemistry, Liebigstrasse 16, Leipzig, D-04103, FRG FEATURES Location/Qualifiers source 1..1332 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /clone_lib="HL1124b, Clontech" CDS 68..1087 /EC_number="3.1.3.11" /codon_start=1 /product="fructose-1,6-bisphosphatase" /db_xref="PID:e1169817" /db_xref="PID:g2154755" /translation="MTDRSPFETDMLTLTRYVMEKGRQAKGTGELTQLLNSMLTAIKA ISSAVRKAGLAHLYGIAGSVNVTGDEVKKLDVLSNSLVINMLQSSYSTCVLVSEENKD AIITAKEKRGKYVVCFDPLDGSSNIDCLASIGTIFAIYRKTSEDEPSEKDALQCGRNI VAAGYALYGSATLVALSTGQGVDLFMLDPALGEFVLVEKDVKIKKKGKIYSLNEGYAK YFDAATTEYVQKKKFPEDGSAPYGARYVGSMVADVHRTLVYGGIFLYPANQKSPKGKL RLLYECNPVAYIIEQAGGLATTGTQPVLDVKPEAIHQRVPLILGSPEDVQEYLTCVQK NQAGS" BASE COUNT 378 a 330 c 350 g 274 t ORIGIN 1 gctgcagccc tcagaagtaa gcaaggtttc ctgccgggag aaaaggattt gaagcattcc 61 agccaaaatg acggacagaa gccccttcga aaccgacatg ctcaccctga cccgctacgt 121 tatggaaaag gggcgtcagg ccaaagggac tggggagctc acccagctgc tgaactcaat 181 gctgacggcc atcaaagcca tctcctcggc tgtgcgcaag gccggtctgg cccacctgta 241 tggaatcgca ggaagcgtta acgtgacggg agatgaggtg aagaaactgg atgtgctatc 301 caattccctg gtgatcaaca tgctccaatc ctcctatagt acctgcgtcc tggtctcaga 361 agagaataag gacgccatca tcaccgccaa ggagaagcgg gggaaatacg tggtctgctt 421 tgacccactg gatggatctt ccaatattga ctgcctggcc tccatcggaa ccatctttgc 481 catctataga aagacctcag aggatgagcc ttctgaaaag gatgccctgc agtgtggccg 541 caatattgtg gccgcaggtt atgcgctgta cggtagtgca accctggtgg ctctctccac 601 agggcaaggc gtggacctct tcatgcttga cccggctctt ggtgaatttg tcctggtgga 661 aaaagatgtc aagattaaga agaaaggaaa gatttacagc ctgaatgagg gctatgccaa 721 gtattttgat gcggccacca ctgaatatgt gcagaaaaag aaattccctg aggatggcag 781 tgctccctat ggggccaggt atgtgggctc catggtggct gacgtgcacc gcaccctggt 841 ctatggagga atcttcctgt acccagccaa ccagaagagc cctaagggca agctccggct 901 cctgtatgaa tgcaatcccg tggcctacat cattgagcag gcaggaggct tggcgaccac 961 ggggacccag cctgtactgg acgtgaagcc cgaggcaatt caccagcgag tccccctcat 1021 tctggggtca ccagaggatg tgcaggaata tctcacctgt gtgcagaaaa atcaggcagg 1081 cagctagcga gtttgacccc acatgccctc ttctgtttgt tttgcacctt gtctaaggac 1141 cctaaatgaa cgataaacag agatggtagc tatgagtatg caaaaggtaa atccacttaa 1201 tcacatacag aagagcaaca acaaactgct tacgacaggt ttggaagcca caggcgattc 1261 tatggtcaat gtgaaggact agaaataaaa acccacatgt ggaaaaaaaa aaaaaaaaaa 1321 aaaaaaaaaa aa // LOCUS HSFUR 4180 bp RNA PRI 12-SEP-1993 DEFINITION Human fur mRNA for furin. ACCESSION X17094 NID g31477 KEYWORDS fur gene; furin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4180) AUTHORS Van den Ouweland,A.M.W. TITLE Direct Submission JOURNAL Submitted (07-NOV-1989) Van den Ouweland A.M.W., University of Nijmegeen, Molecular Oncology Section, Department of Biochemistry, St. Adelbertusplein 1, 6500 HB Nijmegen, The Netherlands REFERENCE 2 (bases 1 to 4180) AUTHORS van den Ouweland,A.M., van Duijnhoven,H.L., Keizer,G.D., Dorssers,L.C. and Van de Ven,W.J. TITLE Structural homology between the human fur gene product and the subtilisin-like protease encoded by yeast KEX2 JOURNAL Nucleic Acids Res. 18 (3), 664 (1990) MEDLINE 90175002 REMARK Erratum:[Nucleic Acids Res 1990 Mar 11;18(5):1332]] COMMENT See for furin gene; See also for furin C' terminus mRNA. Data kindly reviewed (22-MAR-1990) by Ouweland A.v.d. FEATURES Location/Qualifiers source 1..4180 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="blood cell" /cell_line="KG-1, ML-1, MCF-7" /clone_lib="lambda gt11" CDS 217..2601 /note="furin (AA 1-794)" /codon_start=1 /db_xref="PID:g31478" /db_xref="SWISS-PROT:P09958" /translation="MELRPWLLWVVAATGTLVLLAADAQGQKVFTNTWAVRIPGGPAV ANSVARKHGFLNLGQIFGDYYHFWHRGVTKRSLSPHRPRHSRLQREPQVQWLEQQVAK RRTKRDVYQEPTDPKFPQQWYLSGVTQRDLNVKAAWAQGYTGHGIVVSILDDGIEKNH PDLAGNYDPGASFDVNDQDPDPQPRYTQMNDNRHGTRCAGEVAAVANNGVCGVGVAYN ARIGGVRMLDGEVTDAVEARSLGLNPNHIHIYSASWGPEDDGKTVDGPARLAEEAFFR GVSQGRGGLGSIFVWASGNGGREHDSCNCDGYTNSIYTLSISSATQFGNVPWYSEACS STLATTYSSGNQNEKQIVTTDLRQKCTESHTGTSASAPLAAGIIALTLEANKNLTWRD MQHLVVQTSKPAHLNANDWATNGVGRKVSHSYGYGLLDAGAMVALAQNWTTVAPQRKC IIDILTEPKDIGKRLEVRKTVTACLGEPNHITRLEHAQARLTLSYNRRGDLAIHLVSP MGTRSTLLAARPHDYSADGFNDWAFMTTHSWDEDPSGEWVLEIENTSEANNYGTLTKF TLVLYGTAPEGLPVPPESSGCKTLTSSQACVVCEEGFSLHQKSCVQHCPPGFAPQVLD THYSTENDVETIRASVCAPCHASCATCQGPALTDCLSCPSHASLDPVEQTCSRQSQSS RESPPQQQPPRLPPEVEAGQRLRAGLLPSHLPEVVAGLSCAFIVLVFVTVFLVLQLRS GFSFRGVKVYTMDRGLISYKGLPPEAWQEECPSDSEEDEGRGERTAFIKDQSAL" BASE COUNT 794 a 1335 c 1255 g 796 t ORIGIN 1 gcggggaagc agcagcggcc aggatgaatc ccaggtgctc tggagctgga tggtgaaggt 61 cggcactctt caccctcccg agccctgccc gtctcggccc catgccccca ccagtcagcc 121 ccgggccaca ggcagtgagc aggcacctgg gagccgaggc cctatgacca ggccaaggag 181 acgggcgctc cagggtccca gccacctgtc ccccccatgg agctgaggcc ctggttgcta 241 tgggtggtag cagcaacagg aaccttggtc ctgctagcag ctgatgctca gggccagaag 301 gtcttcacca acacgtgggc tgtgcgcatc cctggaggcc cagcggtggc caacagtgtg 361 gcacggaagc atgggttcct caacctgggc cagatcttcg gggactatta ccacttctgg 421 catcgaggag tgacgaagcg gtccctgtcg cctcaccgcc cgcggcacag ccggctgcag 481 agggagcctc aagtacagtg gctggaacag caggtggcaa agcgacggac taaacgggac 541 gtgtaccagg agcccacaga ccccaagttt cctcagcagt ggtacctgtc tggtgtcact 601 cagcgggacc tgaatgtgaa ggcggcctgg gcgcagggct acacagggca cggcattgtg 661 gtctccattc tggacgatgg catcgagaag aaccacccgg acttggcagg caattatgat 721 cctggggcca gttttgatgt caatgaccag gaccctgacc cccagcctcg gtacacacag 781 atgaatgaca acaggcacgg cacacggtgt gcgggggaag tggctgcggt ggccaacaac 841 ggtgtctgtg gtgtaggtgt ggcctacaac gcccgcattg gaggggtgcg catgctggat 901 ggcgaggtga cagatgcagt ggaggcacgc tcgctgggcc tgaaccccaa ccacatccac 961 atctacagtg ccagctgggg ccccgaggat gacggcaaga cagtggatgg gccagcccgc 1021 ctcgccgagg aggccttctt ccgtggggtt agccagggcc gaggggggct gggctccatc 1081 tttgtctggg cctcggggaa cgggggccgg gaacatgaca gctgcaactg cgacggctac 1141 accaacagta tctacacgct gtccatcagc agcgccacgc agtttggcaa cgtgccgtgg 1201 tacagcgagg cctgctcgtc cacactggcc acgacctaca gcagtggcaa ccagaatgag 1261 aagcagatcg tgacgactga cttgcggcag aagtgcacgg agtctcacac gggcacctca 1321 gcctctgccc ccttagcagc cggcatcatt gctctcaccc tggaggccaa taagaacctc 1381 acatggcggg acatgcaaca cctggtggta cagacctcga agccagccca cctcaatgcc 1441 aacgactggg ccaccaatgg tgtgggccgg aaagtgagcc actcatatgg ctacgggctt 1501 ttggacgcag gcgccatggt ggccctggcc cagaattgga ccacagtggc cccccagcgg 1561 aagtgcatca tcgacatcct caccgagccc aaagacatcg ggaaacggct cgaggtgcgg 1621 aagaccgtga ccgcgtgcct gggcgagccc aaccacatca ctcggctgga gcacgctcag 1681 gcgcggctca ccctgtccta taatcgccgt ggcgacctgg ccatccacct ggtcagcccc 1741 atgggcaccc gctccaccct gctggcagcc aggccacatg actactccgc agatgggttt 1801 aatgactggg ccttcatgac aactcattcc tgggatgagg atccctctgg cgagtgggtc 1861 ctagagattg aaaacaccag cgaagccaac aactatggga cgctgaccaa gttcaccctc 1921 gtactctatg gcaccgcccc tgaggggctg cccgtacctc cagaaagcag tggctgcaag 1981 accctcacgt ccagtcaggc ctgtgtggtg tgcgaggaag gcttctccct gcaccagaag 2041 agctgtgtcc agcactgccc tccaggcttc gccccccaag tcctcgatac gcactatagc 2101 accgagaatg acgtggagac catccgggcc agcgtctgcg ccccctgcca cgcctcatgt 2161 gccacatgcc aggggccggc cctgacagac tgcctcagct gccccagcca cgcctccttg 2221 gaccctgtgg agcagacttg ctcccggcaa agccagagca gccgagagtc cccgccacag 2281 cagcagccac ctcggctgcc cccggaggtg gaggcggggc aacggctgcg ggcagggctg 2341 ctgccctcac acctgcctga ggtggtggcc ggcctcagct gcgccttcat cgtgctggtc 2401 ttcgtcactg tcttcctggt cctgcagctg cgctctggct ttagttttcg gggggtgaag 2461 gtgtacacca tggaccgtgg cctcatctcc tacaaggggc tgccccctga agcctggcag 2521 gaggagtgcc cgtctgactc agaagaggac gagggccggg gcgagaggac cgcctttatc 2581 aaagaccaga gcgccctctg atgagcccac tgcccacccc ctcaagccaa tcccctcctt 2641 gggcactttt taattcacca aagtattttt ttatcttggg actgggtttg gaccccagct 2701 gggaggcaag aggggtggag actgtttccc atcctaccct cgggcccacc tggccacctg 2761 aggtgggccc aggaccagct ggggcgtggg gagggccgta ccccaccctc agcacccctt 2821 ccatgtggag aaaggagtga aacctttagg gcagcttgcc ccggccccgg ccccagccag 2881 agttcctgcg gagtgaagag gggcagccct tgcttgttgg gattcctgac ccaggccgca 2941 gctcttgccc ttccctgtcc ctctaaagca ataatggtcc catccaggca gtcgggggct 3001 ggcctaggag atatctgagg gaggaggcca cctctccaag ggcttctgca ccctccaccc 3061 tgtcccccag ctctggtgag tcttggcggc agcagccatc ataggaaggg accaaggcaa 3121 ggcaggtgcc tccaggtgtg cacgtggcat gtggcctgtg gcctgtgtcc catgacccac 3181 ccctgtgctc cgtgcctcca ccaccactgg ccaccaggct ggcgcagcca aggccgaagc 3241 tctggctgaa ccctgtgctg gtgtcctgac caccctcccc tctcttgcac ccgcctctcc 3301 cgtcagggcc caagtccctg ttttctgagc ccgggctgcc tgggctgttg gcactcacag 3361 acctggagcc cctgggtggg tggtggggag gggcgctggc ccagccggcc tctctggcct 3421 cccacccgat gctgctttcc cctgtgggga tctcaggggc tgtttgagga tatattttca 3481 ctttgtgatt atttcacttt agatgctgat gatttgtttt tgtattttta atgggggtag 3541 cagctggact acccacgttc tcacacccac cgtccgccct gctcctccct ggctgccctg 3601 gccctgaggt gtgggggctg cagcatgttg ctgaggagtg aggaatagtt gagccccaag 3661 tcctgaagag gcgggccagc caggcgggct caaggaaagg gggtcccagt gggaggggca 3721 ggctgacatc tgtgtttcaa gtggggctcg ccatgccggg ggttcatagg tcactggctc 3781 tccaagtgcc agaggtgggc aggtggtggc actgagcccc cccaacactg tgccctggtg 3841 gagaaagcac tgacctgtca tgcccccctc aaacctcctc ttctgacgtg ccttttgcac 3901 ccctcccatt aggacaatca gtcccctccc atctgggagt ccccttttct tttctaccct 3961 agccattcct ggtacccagc catctgccca ggggtgcccc ctcctctccc atccccctgc 4021 cctcgtggcc agcccggctg gttttgtaag atactgggtt ggtgcacagt gatttttttc 4081 ttgtaattta aacaggccca gcattgctgg ttctatttaa tggacatgag ataatgttag 4141 aggttttaaa gtgattaaac gtgcagacta tgcaaaccag // LOCUS HSFVIIIR 8967 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for factor VIII. ACCESSION X01179 NID g31498 KEYWORDS factor VIII; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8967) AUTHORS Wood,W.I., Capon,D.J., Simonsen,C.C., Eaton,D.L., Gitschier,J., Keyt,B., Seeburg,P.H., Smith,D.H., Hollingshead,P., Wion,K.L., Delwart,E., Tuddenham,E.G.D., Vehar,G.A. and Lawn,R.M. TITLE Expression of active human factor VIII from recombinant DNA clones JOURNAL Nature 312 (5992), 330-337 (1984) MEDLINE 85061548 COMMENT Data kindly reviewed (20-MAR-1986) by W. Wood. FEATURES Location/Qualifiers source 1..8967 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature <1..109 /note="5' untranslated region" sig_peptide 110..166 /note="signal peptide (aa -19 to -1)" CDS 110..7165 /note="factor VIII precursor" /codon_start=1 /db_xref="PID:g31499" /translation="MQIELSTCFFLCLLRFCFSATRRYYLGAVELSWDYMQSDLGELP VDARFPPRVPKSFPFNTSVVYKKTLFVEFTDHLFNIAKPRPPWMGLLGPTIQAEVYDT VVITLKNMASHPVSLHAVGVSYWKASEGAEYDDQTSQREKEDDKVFPGGSHTYVWQVL KENGPMASDPLCLTYSYLSHVDLVKDLNSGLIGALLVCREGSLAKEKTQTLHKFILLF AVFDEGKSWHSETKNSLMQDRDAASARAWPKMHTVNGYVNRSLPGLIGCHRKSVYWHV IGMGTTPEVHSIFLEGHTFLVRNHRQASLEISPITFLTAQTLLMDLGQFLLFCHISSH QHDGMEAYVKVDSCPEEPQLRMKNNEEAEDYDDDLTDSEMDVVRFDDDNSPSFIQIRS VAKKHPKTWVHYIAAEEEDWDYAPLVLAPDDRSYKSQYLNNGPQRIGRKYKKVRFMAY TDETFKTREAIQHESGILGPLLYGEVGDTLLIIFKNQASRPYNIYPHGITDVRPLYSR RLPKGVKHLKDFPILPGEIFKYKWTVTVEDGPTKSDPRCLTRYYSSFVNMERDLASGL IGPLLICYKESVDQRGNQIMSDKRNVILFSVFDENRSWYLTENIQRFLPNPAGVQLED PEFQASNIMHSINGYVFDSLQLSVCLHEVAYWYILSIGAQTDFLSVFFSGYTFKHKMV YEDTLTLFPFSGETVFMSMENPGLWILGCHNSDFRNRGMTALLKVSSCDKNTGDYYED SYEDISAYLLSKNNAIEPRSFSQNSRHRSTRQKQFNATTIPENDIEKTDPWFAHRTPM PKIQNVSSSDLLMLLRQSPTPHGLSLSDLQEAKYETFSDDPSPGAIDSNNSLSEMTHF RPQLHHSGDMVFTPESGLQLRLNEKLGTTAATELKKLDFKVSSTSNNLISTIPSDNLA AGTDNTSSLGPPSMPVHYDSQLDTTLFGKKSSPLTESGGPLSLSEENNDSKLLESGLM NSQESSWGKNVSSTESGRLFKGKRAHGPALLTKDNALFKVSISLLKTNKTSNNSATNR KTHIDGPSLLIENSPSVWQNILESDTEFKKVTPLIHDRMLMDKNATALRLNHMSNKTT SSKNMEMVQQKKEGPIPPDAQNPDMSFFKMLFLPESARWIQRTHGKNSLNSGQGPSPK QLVSLGPEKSVEGQNFLSEKNKVVVGKGEFTKDVGLKEMVFPSSRNLFLTNLDNLHEN NTHNQEKKIQEEIEKKETLIQENVVLPQIHTVTGTKNFMKNLFLLSTRQNVEGSYDGA YAPVLQDFRSLNDSTNRTKKHTAHFSKKGEEENLEGLGNQTKQIVEKYACTTRISPNT SQQNFVTQRSKRALKQFRLPLEETELEKRIIVDDTSTQWSKNMKHLTPSTLTQIDYNE KEKGAITQSPLSDCLTRSHSIPQANRSPLPIAKVSSFPSIRPIYLTRVLFQDNSSHLP AASYRKKDSGVQESSHFLQGAKKNNLSLAILTLEMTGDQREVGSLGTSATNSVTYKKV ENTVLPKPDLPKTSGKVELLPKVHIYQKDLFPTETSNGSPGHLDLVEGSLLQGTEGAI KWNEANRPGKVPFLRVATESSAKTPSKLLDPLAWDNHYGTQIPKEEWKSQEKSPEKTA FKKKDTILSLNACESNHAIAAINEGQNKPEIEVTWAKQGRTERLCSQNPPVLKRHQRE ITRTTLQSDQEEIDYDDTISVEMKKEDFDIYDEDENQSPRSFQKKTRHYFIAAVERLW DYGMSSSPHVLRNRAQSGSVPQFKKVVFQEFTDGSFTQPLYRGELNEHLGLLGPYIRA EVEDNIMVTFRNQASRPYSFYSSLISYEEDQRQGAEPRKNFVKPNETKTYFWKVQHHM APTKDEFDCKAWAYFSDVDLEKDVHSGLIGPLLVCHTNTLNPAHGRQVTVQEFALFFT IFDETKSWYFTENMERNCRAPCNIQMEDPTFKENYRFHAINGYIMDTLPGLVMAQDQR IRWYLLSMGSNENIHSIHFSGHVFTVRKKEEYKMALYNLYPGVFETVEMLPSKAGIWR VECLIGEHLHAGMSTLFLVYSNKCQTPLGMASGHIRDFQITASGQYGQWAPKLARLHY SGSINAWSTKEPFSWIKVDLLAPMIIHGIKTQGARQKFSSLYISQFIIMYSLDGKKWQ TYRGNSTGTLMVFFGNVDSSGIKHNIFNPPIIARYIRLHPTHYSIRSTLRMELMGCDL NSCSMPLGMESKAISDAQITASSYFTNMFATWSPSKARLHLQGRSNAWRPQVNNPKEW LQVDFQKTMKVTGVTTQGVKSLLTSMYVKEFLISSSQDGHQWTLFFQNGKVKVFQGNQ DSFTPVVNSLDPPLLTRYLRIHPQSWVHQIALRMEVLGCEAQDLY" mat_peptide 167..7162 /note="mature factor VIII (aa 1-2332)" misc_feature 7163..8967 /note="3' untranslated region" misc_feature 8948..8953 /note="polyadenylation signal" polyA_site 8967 /note="polyA site" BASE COUNT 2841 a 1898 c 1833 g 2395 t ORIGIN 1 cttttcatta aatcagaaat tttacttttt tcccctcctg ggagctaaag atattttaga 61 gaagaattaa ccttttgctt ctccagttga acatttgtag caataagtca tgcaaataga 121 gctctccacc tgcttctttc tgtgcctttt gcgattctgc tttagtgcca ccagaagata 181 ctacctgggt gcagtggaac tgtcatggga ctatatgcaa agtgatctcg gtgagctgcc 241 tgtggacgca agatttcctc ctagagtgcc aaaatctttt ccattcaaca cctcagtcgt 301 gtacaaaaag actctgtttg tagaattcac ggatcacctt ttcaacatcg ctaagccaag 361 gccaccctgg atgggtctgc taggtcctac catccaggct gaggtttatg atacagtggt 421 cattacactt aagaacatgg cttcccatcc tgtcagtctt catgctgttg gtgtatccta 481 ctggaaagct tctgagggag ctgaatatga tgatcagacc agtcaaaggg agaaagaaga 541 tgataaagtc ttccctggtg gaagccatac atatgtctgg caggtcctga aagagaatgg 601 tccaatggcc tctgacccac tgtgccttac ctactcatat ctttctcatg tggacctggt 661 aaaagacttg aattcaggcc tcattggagc cctactagta tgtagagaag ggagtctggc 721 caaggaaaag acacagacct tgcacaaatt tatactactt tttgctgtat ttgatgaagg 781 gaaaagttgg cactcagaaa caaagaactc cttgatgcag gatagggatg ctgcatctgc 841 tcgggcctgg cctaaaatgc acacagtcaa tggttatgta aacaggtctc tgccaggtct 901 gattggatgc cacaggaaat cagtctattg gcatgtgatt ggaatgggca ccactcctga 961 agtgcactca atattcctcg aaggtcacac atttcttgtg aggaaccatc gccaggcgtc 1021 cttggaaatc tcgccaataa ctttccttac tgctcaaaca ctcttgatgg accttggaca 1081 gtttctactg ttttgtcata tctcttccca ccaacatgat ggcatggaag cttatgtcaa 1141 agtagacagc tgtccagagg aaccccaact acgaatgaaa aataatgaag aagcggaaga 1201 ctatgatgat gatcttactg attctgaaat ggatgtggtc aggtttgatg atgacaactc 1261 tccttccttt atccaaattc gctcagttgc caagaagcat cctaaaactt gggtacatta 1321 cattgctgct gaagaggagg actgggacta tgctccctta gtcctcgccc ccgatgacag 1381 aagttataaa agtcaatatt tgaacaatgg ccctcagcgg attggtagga agtacaaaaa 1441 agtccgattt atggcataca cagatgaaac ctttaagact cgtgaagcta ttcagcatga 1501 atcaggaatc ttgggacctt tactttatgg ggaagttgga gacacactgt tgattatatt 1561 taagaatcaa gcaagcagac catataacat ctaccctcac ggaatcactg atgtccgtcc 1621 tttgtattca aggagattac caaaaggtgt aaaacatttg aaggattttc caattctgcc 1681 aggagaaata ttcaaatata aatggacagt gactgtagaa gatgggccaa ctaaatcaga 1741 tcctcggtgc ctgacccgct attactctag tttcgttaat atggagagag atctagcttc 1801 aggactcatt ggccctctcc tcatctgcta caaagaatct gtagatcaaa gaggaaacca 1861 gataatgtca gacaagagga atgtcatcct gttttctgta tttgatgaga accgaagctg 1921 gtacctcaca gagaatatac aacgctttct ccccaatcca gctggagtgc agcttgagga 1981 tccagagttc caagcctcca acatcatgca cagcatcaat ggctatgttt ttgatagttt 2041 gcagttgtca gtttgtttgc atgaggtggc atactggtac attctaagca ttggagcaca 2101 gactgacttc ctttctgtct tcttctctgg atataccttc aaacacaaaa tggtctatga 2161 agacacactc accctattcc cattctcagg agaaactgtc ttcatgtcga tggaaaaccc 2221 aggtctatgg attctggggt gccacaactc agactttcgg aacagaggca tgaccgcctt 2281 actgaaggtt tctagttgtg acaagaacac tggtgattat tacgaggaca gttatgaaga 2341 tatttcagca tacttgctga gtaaaaacaa tgccattgaa ccaagaagct tctcccagaa 2401 ttcaagacac cgtagcacta ggcaaaagca atttaatgcc accacaattc cagaaaatga 2461 catagagaag actgaccctt ggtttgcaca cagaacacct atgcctaaaa tacaaaatgt 2521 ctcctctagt gatttgttga tgctcttgcg acagagtcct actccacatg ggctatcctt 2581 atctgatctc caagaagcca aatatgagac tttttctgat gatccatcac ctggagcaat 2641 agacagtaat aacagcctgt ctgaaatgac acacttcagg ccacagctcc atcacagtgg 2701 ggacatggta tttacccctg agtcaggcct ccaattaaga ttaaatgaga aactggggac 2761 aactgcagca acagagttga agaaacttga tttcaaagtt tctagtacat caaataatct 2821 gatttcaaca attccatcag acaatttggc agcaggtact gataatacaa gttccttagg 2881 acccccaagt atgccagttc attatgatag tcaattagat accactctat ttggcaaaaa 2941 gtcatctccc cttactgagt ctggtggacc tctgagcttg agtgaagaaa ataatgattc 3001 aaagttgtta gaatcaggtt taatgaatag ccaagaaagt tcatggggaa aaaatgtatc 3061 gtcaacagag agtggtaggt tatttaaagg gaaaagagct catggacctg ctttgttgac 3121 taaagataat gccttattca aagttagcat ctctttgtta aagacaaaca aaacttccaa 3181 taattcagca actaatagaa agactcacat tgatggccca tcattattaa ttgagaatag 3241 tccatcagtc tggcaaaata tattagaaag tgacactgag tttaaaaaag tgacaccttt 3301 gattcatgac agaatgctta tggacaaaaa tgctacagct ttgaggctaa atcatatgtc 3361 aaataaaact acttcatcaa aaaacatgga aatggtccaa cagaaaaaag agggccccat 3421 tccaccagat gcacaaaatc cagatatgtc gttctttaag atgctattct tgccagaatc 3481 agcaaggtgg atacaaagga ctcatggaaa gaactctctg aactctgggc aaggccccag 3541 tccaaagcaa ttagtatcct taggaccaga aaaatctgtg gaaggtcaga atttcttgtc 3601 tgagaaaaac aaagtggtag taggaaaggg tgaatttaca aaggacgtag gactcaaaga 3661 gatggttttt ccaagcagca gaaacctatt tcttactaac ttggataatt tacatgaaaa 3721 taatacacac aatcaagaaa aaaaaattca ggaagaaata gaaaagaagg aaacattaat 3781 ccaagagaat gtagttttgc ctcagataca tacagtgact ggcactaaga atttcatgaa 3841 gaaccttttc ttactgagca ctaggcaaaa tgtagaaggt tcatatgacg gggcatatgc 3901 tccagtactt caagatttta ggtcattaaa tgattcaaca aatagaacaa agaaacacac 3961 agctcatttc tcaaaaaaag gggaggaaga aaacttggaa ggcttgggaa atcaaaccaa 4021 gcaaattgta gagaaatatg catgcaccac aaggatatct cctaatacaa gccagcagaa 4081 ttttgtcacg caacgtagta agagagcttt gaaacaattc agactcccac tagaagaaac 4141 agaacttgaa aaaaggataa ttgtggatga cacctcaacc cagtggtcca aaaacatgaa 4201 acatttgacc ccgagcaccc tcacacagat agactacaat gagaaggaga aaggggccat 4261 tactcagtct cccttatcag attgccttac gaggagtcat agcatccctc aagcaaatag 4321 atctccatta cccattgcaa aggtatcatc atttccatct attagaccta tatatctgac 4381 cagggtccta ttccaagaca actcttctca tcttccagca gcatcttata gaaagaaaga 4441 ttctggggtc caagaaagca gtcatttctt acaaggagcc aaaaaaaata acctttcttt 4501 agccattcta accttggaga tgactggtga tcaaagagag gttggctccc tggggacaag 4561 tgccacaaat tcagtcacat acaagaaagt tgagaacact gttctcccga aaccagactt 4621 gcccaaaaca tctggcaaag ttgaattgct tccaaaagtt cacatttatc agaaggacct 4681 attccctacg gaaactagca atgggtctcc tggccatctg gatctcgtgg aagggagcct 4741 tcttcaggga acagagggag cgattaagtg gaatgaagca aacagacctg gaaaagttcc 4801 ctttctgaga gtagcaacag aaagctctgc aaagactccc tccaagctat tggatcctct 4861 tgcttgggat aaccactatg gtactcagat accaaaagaa gagtggaaat cccaagagaa 4921 gtcaccagaa aaaacagctt ttaagaaaaa ggataccatt ttgtccctga acgcttgtga 4981 aagcaatcat gcaatagcag caataaatga gggacaaaat aagcccgaaa tagaagtcac 5041 ctgggcaaag caaggtagga ctgaaaggct gtgctctcaa aacccaccag tcttgaaacg 5101 ccatcaacgg gaaataactc gtactactct tcagtcagat caagaggaaa ttgactatga 5161 tgataccata tcagttgaaa tgaagaagga agattttgac atttatgatg aggatgaaaa 5221 tcagagcccc cgcagctttc aaaagaaaac acgacactat tttattgctg cagtggagag 5281 gctctgggat tatgggatga gtagctcccc acatgttcta agaaacaggg ctcagagtgg 5341 cagtgtccct cagttcaaga aagttgtttt ccaggaattt actgatggct cctttactca 5401 gcccttatac cgtggagaac taaatgaaca tttgggactc ctggggccat atataagagc 5461 agaagttgaa gataatatca tggtaacttt cagaaatcag gcctctcgtc cctattcctt 5521 ctattctagc cttatttctt atgaggaaga tcagaggcaa ggagcagaac ctagaaaaaa 5581 ctttgtcaag cctaatgaaa ccaaaactta cttttggaaa gtgcaacatc atatggcacc 5641 cactaaagat gagtttgact gcaaagcctg ggcttatttc tctgatgttg acctggaaaa 5701 agatgtgcac tcaggcctga ttggacccct tctggtctgc cacactaaca cactgaaccc 5761 tgctcatggg agacaagtga cagtacagga atttgctctg tttttcacca tctttgatga 5821 gaccaaaagc tggtacttca ctgaaaatat ggaaagaaac tgcagggctc cctgcaatat 5881 ccagatggaa gatcccactt ttaaagagaa ttatcgcttc catgcaatca atggctacat 5941 aatggataca ctacctggct tagtaatggc tcaggatcaa aggattcgat ggtatctgct 6001 cagcatgggc agcaatgaaa acatccattc tattcatttc agtggacatg tgttcactgt 6061 acgaaaaaaa gaggagtata aaatggcact gtacaatctc tatccaggtg tttttgagac 6121 agtggaaatg ttaccatcca aagctggaat ttggcgggtg gaatgcctta ttggcgagca 6181 tctacatgct gggatgagca cactttttct ggtgtacagc aataagtgtc agactcccct 6241 gggaatggct tctggacaca ttagagattt tcagattaca gcttcaggac aatatggaca 6301 gtgggcccca aagctggcca gacttcatta ttccggatca atcaatgcct ggagcaccaa 6361 ggagcccttt tcttggatca aggtggatct gttggcacca atgattattc acggcatcaa 6421 gacccagggt gcccgtcaga agttctccag cctctacatc tctcagttta tcatcatgta 6481 tagtcttgat gggaagaagt ggcagactta tcgaggaaat tccactggaa ccttaatggt 6541 cttctttggc aatgtggatt catctgggat aaaacacaat atttttaacc ctccaattat 6601 tgctcgatac atccgtttgc acccaactca ttatagcatt cgcagcactc ttcgcatgga 6661 gttgatgggc tgtgatttaa atagttgcag catgccattg ggaatggaga gtaaagcaat 6721 atcagatgca cagattactg cttcatccta ctttaccaat atgtttgcca cctggtctcc 6781 ttcaaaagct cgacttcacc tccaagggag gagtaatgcc tggagacctc aggtgaataa 6841 tccaaaagag tggctgcaag tggacttcca gaagacaatg aaagtcacag gagtaactac 6901 tcagggagta aaatctctgc ttaccagcat gtatgtgaag gagttcctca tctccagcag 6961 tcaagatggc catcagtgga ctctcttttt tcagaatggc aaagtaaagg tttttcaggg 7021 aaatcaagac tccttcacac ctgtggtgaa ctctctagac ccaccgttac tgactcgcta 7081 ccttcgaatt cacccccaga gttgggtgca ccagattgcc ctgaggatgg aggttctggg 7141 ctgcgaggca caggacctct actgagggtg gccactgcag cacctgccac tgccgtcacc 7201 tctccctcct cagctccagg gcagtgtccc tccctggctt gccttctacc tttgtgctaa 7261 atcctagcag acactgcctt gaagcctcct gaattaacta tcatcagtcc tgcatttctt 7321 tggtgggggg ccaggagggt gcatccaatt taacttaact cttacctatt ttctgcagct 7381 gctcccagat tactccttcc ttccaatata actaggcaaa aagaagtgag gagaaacctg 7441 catgaaagca ttcttccctg aaaagttagg cctctcagag tcaccacttc ctctgttgta 7501 gaaaaactat gtgatgaaac tttgaaaaag atatttatga tgttaacatt tcaggttaag 7561 cctcatacgt ttaaaataaa actctcagtt gtttattatc ctgatcaagc atggaacaaa 7621 gcatgtttca ggatcagatc aatacaatct tggagtcaaa aggcaaatca tttggacaat 7681 ctgcaaaatg gagagaatac aataactact acagtaaagt ctgtttctgc ttccttacac 7741 atagatataa ttatgttatt tagtcattat gaggggcaca ttcttatctc caaaactagc 7801 attcttaaac tgagaattat agatggggtt caagaatccc taagtcccct gaaattatat 7861 aaggcattct gtataaatgc aaatgtgcat ttttctgacg agtgtccata gatataaagc 7921 catttggtct taattctgac caataaaaaa ataagtcagg aggatgcaat tgttgaaagc 7981 tttgaaataa aataacaatg tcttcttgaa atttgtgatg gccaagaaag aaaatgatga 8041 tgacattagg cttctaaagg acatacattt aatatttctg tggaaatatg aggaaaatcc 8101 atggttatct gagataggag atacaaactt tgtaattcta ataatgcact cagtttactc 8161 tctccctcta ctaatttcct gctgaaaata acacaacaaa aatgtaacag gggaaattat 8221 ataccgtgac tgaaaactag agtcctactt acatagttga aatatcaagg aggtcagaag 8281 aaaattggac tggtgaaaac agaaaaaaca ctccagtctg ccatatcacc acacaatagg 8341 atcccccttc ttgccctcca cccccataag attgtgaagg gtttactgct ccttccatct 8401 gcctgacccc ttcactatga ctacacagaa tctcctgata gtaaaggggg ctggaggcaa 8461 ggataagtta tagagcagtt ggaggaagca tccaaagatt gcaacccagg gcaaatggaa 8521 aacaggagat cctaatatga aagaaaaatg gatcccaatc tgagaaaagg caaaagaatg 8581 gctacttttt tctatgctgg agtattttct aataatcctg cttgaccctt atctgacctc 8641 tttggaaact ataacatagc tgtcacagta tagtcacaat ccacaaatga tgcaggtgca 8701 aatggtttat agccctgtga agttcttaaa gtttagaggc taacttacag aaatgaataa 8761 gttgttttgt tttatagccc ggtagaggag ttaaccccaa aggtgatatg gttttatttc 8821 ctgttatgtt taacttgata atcttatttt ggcattcttt tcccattgac tatatacatc 8881 tctatttctc aaatgttcat ggaactagct cttttatttt cctgctggtt tcttcagtaa 8941 tgagttaaat aaaacattga cacatac // LOCUS HSFVT1A 2272 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens fvt1 mRNA. ACCESSION X63657 S51904 NID g296185 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2272) AUTHORS Rimokh,R., Gadoux,M., Bertheas,M.F., Berger,F., Garoscio,M., Deleage,G., Germain,D. and Magaud,J.P. TITLE FVT-1, a novel human transcription unit affected by variant translocation t(2;18)(p11;q21) of follicular lymphoma JOURNAL Blood 81 (1), 136-142 (1993) MEDLINE 93112945 REFERENCE 2 (bases 1 to 2272) AUTHORS Rimokh,R. TITLE Direct Submission JOURNAL Submitted (29-JAN-1992) R. Rimokh, Hopital Edouard Herriot, Laboratoire Hematologie et Cytogenetique, Pavillon E, F-69437 Lyon cedex 03, FRANCE FEATURES Location/Qualifiers source 1..2272 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /cell_type="B lymphoblastoid" /cell_line="IARC 171" /chromosome="18" /map="18q21" mRNA 1..2272 /gene="FVT1" gene 1..2272 /gene="FVT1" CDS 108..1106 /gene="FVT1" /note="FVT1 gene is disrupted in a t(2;18) chromosomal translocation involving Ig kappa gene in a follicular lymphoma" /codon_start=1 /db_xref="PID:g296186" /db_xref="SWISS-PROT:Q06136" /translation="MLLLAAAFLVAFVLLLYMVSPLISPKPLALPGAHVVVTGGSSGI GKCIAIECYKQGAFITLVARNEDKLLQAKKEIEMHSINDKQVVLCISVDVSQDYNQVE NVIKQAQEKLGPVDMLVNCAGMAVSGKFEDLEVSTFERLMSINYLGSVYPSRAVITTM KERRVGRIVFVSSQAGQLGLFGFTAYSASKFAIRGLAEALQMEVKPYNVYITVAYPPD TDTPGFAEENRTKPLETRLISETTSVCKPEQVAKQIVKDAIQGNFNSSLGSDGYMLSA LTCGMAPVTSITEGLQQVVTMGLFRTIALFYLGSFDSIVRRCMMQREKSENADKTA" repeat_region 1543..1613 polyA_signal 2260..2265 /gene="FVT1" BASE COUNT 612 a 497 c 518 g 645 t ORIGIN 1 aggcgcccgc ccgccgcgcg tgattctcgc ctcgccgcag cccagccctg cgcgccttgc 61 ccggcggccc ccgcccggcc gctccgggcc cctggccccg cggagcgatg ctgctgctgg 121 ctgccgcctt cctcgtggcc ttcgtgctgc tgctgtacat ggtgtctccg ctcatcagcc 181 ccaagcccct cgccctgccc ggggcgcatg tggtggttac aggaggttcc agtggcatcg 241 ggaagtgcat tgctatcgag tgctataaac aaggagcttt tataactctg gttgcacgaa 301 atgaggataa gctgctgcag gcaaagaaag aaattgaaat gcactctatt aatgacaaac 361 aggtggtgct ttgcatatca gttgatgtat ctcaagacta taaccaagta gagaatgtca 421 taaaacaagc acaggagaaa ctgggtccag tggacatgct ggtaaattgt gcaggaatgg 481 cagtgtcagg aaaatttgaa gatcttgaag ttagtacctt tgaaaggtta atgagcatca 541 attacctggg cagcgtgtac cccagccggg ccgtgatcac caccatgaag gagcgccggg 601 tgggcaggat cgtgtttgtg tcctcccagg caggacagtt gggattattc ggtttcacag 661 cctactctgc atccaagttt gccataaggg gattggcaga agctttgcag atggaggtga 721 agccatataa tgtctacatc acagttgctt acccaccaga cacagacaca cctggctttg 781 ccgaagaaaa cagaacaaag cctttggaga ctcgacttat ttcagagacc acatctgtgt 841 gcaaaccaga acaggtggcc aaacaaattg ttaaagatgc catacaagga aatttcaaca 901 gttcccttgg ctcagatggg tacatgctct cggccctgac ctgtgggatg gctccagtaa 961 cttctattac tgaggggctc cagcaggtgg tcaccatggg ccttttccgc actattgctt 1021 tgttttacct tggaagtttt gacagcatag ttcgtcgctg catgatgcag agagaaaaat 1081 ctgaaaatgc agacaaaact gcctaatctt cttacccctt ggaagaagac tgtttccaaa 1141 taatttgaac agcttgctgc taaatgggac ccaatttttg gcctatagac acttatgtat 1201 tgttttcgaa tacgtcagat tggaccagtg ctcttcagga atgtggctgc aagcaagggg 1261 ctagaagttc acctcctgac agtattatta atactatgca aatatggaat aggagaccat 1321 ttgattttct aggctttgtg gtagagaggt gaaggtatga gaattaatag cgtgtgaaca 1381 aagtaaagaa caggattcca gaatgatcat taaatttgtt tctatttatt cttttttgcc 1441 cccctagaga ttaagtccag aaatgtactt tctggcacat aaagaaatct tgaggacttt 1501 gtttaaacct tccataaaaa aacaattttc ggtttctcgg gttctctctc tctgtctctc 1561 tgtctctctg tctctctgtc tctctgtctc tctctctctc tctctttctt tctttgtgta 1621 ttttattcaa gatgagttgg acccattgcc agtgagtctg aatgtcactg acagccctgt 1681 gttgtgctca ggactcactc tgctgctggt ggaaactcat ggcttctctc tctctttgat 1741 cccataaagc tacgaggggg acgggagagg gcagtgcaat gggaagtaaa gagatatttt 1801 ccagtaggaa aagcaatgct ttcttgtctt tagactcaaa tgcttaggga acgtttcatt 1861 tctcattcat ggggaaaggc agcctcctta aatgttttct gaagagcggt aaaatctaga 1921 agcttaagaa tttacagttc cttcaataac catgatgacc tgaagttcac ctatcccatt 1981 ttagcatcta cttgtttttc ccatctcttc ctttccaatt ttgcttatac tgctgtaata 2041 tttttgtaaa aaaaaaaaaa aaggaaaaaa aagaccagct aaaattttcg acttgacttt 2101 ttaacttaac tcatgaatta attaaagcaa atgaaaaaat taaaaagtgt gactttttct 2161 cggagcatat atgtagcttt taggaaaggc tgatgatggt ataaagtttg ctcattaaga 2221 aaaaaagaca aggctgattt tgaagagagt tgcttttgaa ataaaatgat ca // LOCUS HSG11 1230 bp RNA PRI 21-FEB-1994 DEFINITION H.sapiens G11 mRNA. ACCESSION X77386 NID g453211 KEYWORDS G11 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Sargent,C.A., Anderson,M.J., Hsieh,S.L., Kendall,E., Gomez-Escobar,N. and Campbell,R.D. TITLE Characterisation of the novel gene G11 lying adjacent to the complement C4A gene in the human major histocompatibility complex JOURNAL Hum. Mol. Genet. 3 (3), 481-488 (1994) MEDLINE 94282044 REFERENCE 2 (bases 1 to 1230) AUTHORS Campbell,R.D. TITLE Direct Submission JOURNAL Submitted (28-JAN-1994) R.D. Campbell, MRC Immunochemistry Unit, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /clone="G11-Y" /chromosome="6" /map="6p21.3, MHC class III region" gene 53..829 /gene="G11" CDS 53..829 /gene="G11" /codon_start=1 /evidence=experimental /db_xref="PID:g453212" /translation="MSWKRHHLIPETFGVKRRRKRGPVESDPLRGEPGSARAAVSELM QLFPRGLFEDALPPIVLRSQVYSLVPDRTVADRQLKELQEQGEIRIVQLGFDLDAHGI IFTEDYRTRVCDCVLKACDGRPYAGAVQKFLASVLPACGDLSFQQDQMTQTFGFRDSE ITHLVNAGVLTVRDAGSWWLAVPGAGRFIKYFVKGRQAVLSMVRKAKYRELLLSELLG RRAPVVVRLGLTYHVHDLIGAQLVDCISTTSGTLLRLPET" misc_difference 386..397 /gene="G11" /note="12 bp deletion im major mRNA species" polyA_signal 1205..1210 BASE COUNT 249 a 306 c 384 g 291 t ORIGIN 1 gttctcttcc ctccattcct accccttccc cggtaccata aaatcccggg atatgagctg 61 gaagaggcat cacctgatcc cggagacctt tggagttaag aggcggcgga agcgagggcc 121 tgtggagtcg gatcctcttc ggggtgagcc agggtcggcg cgcgcggctg tctcagaact 181 catgcagctg ttcccgcgag gcctgtttga ggacgcgctg ccgcccatcg tgctgaggag 241 ccaggtgtac agccttgtgc ctgacaggac cgtggccgac cggcagctga aggagcttca 301 agagcagggg gagatcagaa tcgtccagct gggcttcgac ttggatgccc atggaattat 361 cttcactgag gactacagga ccagagtatg tgactgtgtc ctcaaggcct gtgatggccg 421 accgtatgct ggggcagtgc agaaatttct agcttcagta cttccagcct gtggggacct 481 tagtttccag caggaccaaa tgacacagac ctttggcttc agggactcag aaatcacgca 541 tctggtgaat gctggagtcc tcaccgtccg agatgctggg agctggtggc tagctgtgcc 601 tggagctggg agattcatca agtactttgt taaagggcgc caggctgtcc ttagcatggt 661 ccggaaggca aagtaccggg aactgctcct atcagagctc ctgggccggc gggcgcctgt 721 cgtggtgcgg cttggcctca cctaccatgt gcacgacctc attggggccc agctagtgga 781 ctgcatctct accacttcag gaaccctcct ccgcctgcca gagacatgaa gattctgctc 841 atcattgctc agctcctcag agtgggccgg gaggggacta gaagagctgc atgatggtgg 901 ctgagacagg gtcaccttgg gaaggcttgg gagccaggat gagtgtcggg ctctcgtgtg 961 tgcaaaaggt cagatgtgac tgctgctgtt tgcctggttt ctgacccagt ggtggggttt 1021 gagcaatgct tctctgccct tccatggaaa gtggaaccag aaatggtgcc aaggctgtgg 1081 ctgttccctt tcgtgtaaaa tggtgctgtt attactctgt cttgaaatag gaaggtggga 1141 tttctgggga ggctggtgaa ggagggcagg gttcttttct ctatgtgtca tgttaaaatt 1201 gccaaataaa gtacctgtgc ctgtgaaaaa // LOCUS HSG6PDR 2625 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for glucose-6-phosphate dehydrogenase (G6PD). ACCESSION X03674 NID g31542 KEYWORDS glucose-6-phosphate dehydrogenase; inverted repeat. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2625) AUTHORS Persico,M.G., Viglietto,G., Martini,G., Toniolo,D., Paonessa,G., Moscatelli,C., Dono,R., Vulliamy,T., Luzzatto,L. and D'Urso,M. TITLE Isolation of human glucose-6-phosphate dehydrogenase (G6PD) cDNA clones: primary structure of the protein and unusual 5' non-coding region JOURNAL Nucleic Acids Res. 14 (6), 2511-2522 (1986) MEDLINE 86176746 REMARK Erratum:[Nucleic Acids Res 1986 Oct 10;14(19):7822]] COMMENT Data kindly reviewed (14-SEP-1986) by M. D'Urso. FEATURES Location/Qualifiers source 1..2625 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 471..2018 /note="G6PD (AA 1-515)" /codon_start=1 /db_xref="PID:g31543" /db_xref="SWISS-PROT:P11413" /translation="MAEQVALSRTHVCGILREELFQGDAFHQSDTHIFIIMGASGDLA KKKIYPTIWWLFRDGLLPENTFIVGYARSRLTVADIRKQSEPFFKATPEEKLKLEDFF ARNSYVAGQYDDAASYQRLNSHMNALHLGSQANRLFYLALPPTVYEAVTKNIHESCMS QIGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKEMVQNLMVLRFAN RIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQMLCLVAMEKP ASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDPTVPRGSTT ATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCKRNELVI RVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKLPDAYERLILDVFCGSQM HFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVGFQYEGTYK WVNPHKL" repeat_unit 516..525 /note="imp. inverted repeat A" misc_feature 516..554 /note="pot. stem-loop structure" repeat_unit 545..554 /note="imp. inverted repeat A'" misc_feature 2607..2612 /note="polyadenylation signal" polyA_site 2625 /note="polyadenylation site" BASE COUNT 525 a 879 c 794 g 426 t 1 others ORIGIN 1 agggacagcc cagaggaggc gtggccacgc tgccggcgga agtggagccc tccgcgagcg 61 cgcgaggccg ccggggaggc ggggaaaccg gacagtaggg gcggggcggg ccggcgatgg 121 ggatgcggga gcactacgcg gagctgcacc cgtgcccgcc ggaattgggg atgcagagca 181 gcggcagcgg gtatggcagg caccggcggg ccggcctcca gcgcaggtgc ccgagaggca 241 ggggctggcc tgggatgcgc gcgcacctgc cctcgccccg ccccgcccgc acgaggggtg 301 gtggccgagg ccccgccccg cacgcctcgc cgaggcgggt ccgctcagcc caggcgcccg 361 cccccgcccc cgccgattaa atgggccggc ggggctcagc ccccggaaac ggtcgtacac 421 ttcggggctg cgagcgcgga gggcgacgac gacgaagcgc agacagcgtc atggcagagc 481 aggtggccct gagccggacc cacgtgtgcg ggatcctgcg ggaagagctt ttccagggcg 541 atgccttcca tcagtcggat acacacatat tcatcatcat gggtgcatcg ggtgacctgg 601 ccaagaagaa gatctacccc accatctggt ggctgttccg ggatggcctt ctgcccgaaa 661 acaccttcat cgtgggctat gcccgttccc gcctcacagt ggctgacatc cgcaaacaga 721 gtgagccctt cttcaaggcc accccagagg agaagctcaa gctggaggac ttctttgccc 781 gcaactccta tgtggctggc cagtacgatg atgcagcctc ctaccagcgc ctcaacagcc 841 acatgaatgc cctccacctg gggtcacagg ccaaccgcct cttctacctg gccttgcccc 901 cgaccgtcta cgaggccgtc accaagaaca ttcacgagtc ctgcatgagc cagataggct 961 ggaaccgcat catcgtggag aagcccttcg ggagggacct gcagagctct gaccggctgt 1021 ccaaccacat ctcctccctg ttccgtgagg accagatcta ccgcatcgac cactacctgg 1081 gcaaggagat ggtgcagaac ctcatggtgc tgagatttgc caacaggatc ttcggcccca 1141 tctggaaccg ggacaacatc gcctgcgtta tcctcacctt caaggagccc tttggcactg 1201 agggtcgcgg gggctatttc gatgaatttg ggatcatccg ggacgtgatg cagaaccacc 1261 tactgcagat gctgtgtctg gtggccatgg agaagcccgc ctccaccaac tcagatgacg 1321 tccgtgatga gaaggtcaag gtgttgaaat gcatctcaga ggtgcaggcc aacaatgtgg 1381 tcctgggcca gtacgtgggg aaccccgatg gagagggcga ggccaccaaa gggtacctgg 1441 acgaccccac ggtgccccgc gggtccacca ccgccacttt tgcagccgtc gtcctctatg 1501 tggagaatga gaggtgggat ggggtgccct tcatcctgcg ctgcggcaag gccctgaacg 1561 agcgcaaggc cgaggtgagg ctgcagttcc atgatgtggc cggcgacatc ttccaccagc 1621 agtgcaagcg caacgagctg gtgatccgcg tgcagcccaa cgaggccgtg tacaccaaga 1681 tgatgaccaa gaagccgggc atgttcttca accccgagga gtcggagctg gacctgacct 1741 acggcaacag atacaagaac gtgaagctcc ctgacgccta cgagcgcctc atcctggacg 1801 tcttctgcgg gagccagatg cacttcgtgc gcagcgacga gctccgtgag gcctggcgta 1861 ttttcacccc actgctgcac cagattgagc tggagaagcc caagcccatc ccctatattt 1921 atggcagccg aggccccacg gaggcagacg agctgatgaa gagagtgggt ttccagtatg 1981 agggcaccta caagtgggtg aacccccaca agctctgagc cctggcaccc acctccaccc 2041 ccgccacggc caccctcctt cccgccgccc gaccccgagt cgggaggact ccgggaccat 2101 tgacctcagc tgcacattcc nggccccggg ctctggccac cttggcccgc ccctcgctgc 2161 tgctactacc cgagcccagc tacattcctc agctgccaag cactcgagac catcttggcc 2221 cctccagacc ctgcctgagc ctaggagctt gagtcacctc ctccactcac tccagcccaa 2281 cagaaggaag gaggagggcg cccattcgtc tgtcccagag cttattggcc actgggtctc 2341 gctccgagtg gggccagggt gggagggagg gtcaggggga ggaaaggggc gagcacccac 2401 gtgagagaat ctgcctgtgg ccttgcccgc cagcctcagt gccacttgac attccttgtc 2461 accagcaaca tctcgagccc cctagatgtc ccctgtccca ccaactctgc actccatggc 2521 caccccgtgc cacccgtagg cagcctctct gctataagaa aagcagacgc agcagctggg 2581 accccttcca acctcaatgc cctgccatta aatccgcaaa cagcc // LOCUS HSG7A 4107 bp RNA PRI 05-JUN-1992 DEFINITION Human G7a mRNA for valyl-tRNA synthetase. ACCESSION X59303 NID g31544 KEYWORDS major histocompatibility complex; valyl-tRNA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4107) AUTHORS Hsieh,H.L. TITLE Direct Submission JOURNAL Submitted (23-APR-1991) H.L. Hsieh, M R C Immunochemistry Unit, Department of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, UK REFERENCE 2 (bases 1 to 4107) AUTHORS Hsieh,S.L. and Campbell,R.D. TITLE Evidence that gene G7a in the human major histocompatibility complex encodes valyl-tRNA synthetase JOURNAL Biochem. J. 278 (Pt 3), 809-816 (1991) MEDLINE 91378943 REMARK Erratum:[Biochem J 1992 Feb 1;281(Pt 3):879]] FEATURES Location/Qualifiers source 1..4107 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="HLA:A2,B7,C2C,Bfs,C4A3,C4BQ0,DR2" gene 220..4017 /gene="G7a" CDS 220..4017 /gene="G7a" /codon_start=1 /product="valyl-tRNA synthetase" /db_xref="PID:g31545" /db_xref="SWISS-PROT:P26640" /translation="MSTLYVSPHPDAFPSLRALIAARYGEAGEGPGWGGAHPRICLQP PPTSRTSFPPPRLPALEQGPGGLWVWGATAVAQLLWPAGLGGPGGSRAAVLVQQWVSY ADTELIPAACGATLPALGLRSSAQDPQAVLGALGRALSPLEEWLRLHTYLAGEAPTLA DLAAVTALLLPFRYVLDPPARRIWNNVTRWFVTCVRQPEFRAVLGEVVLYSGARPLSH QPGPEAPALPKTAAQLKKEAKKREKLEKFQQKQKIQQQQPPPGEKKPKPEKREKRDPG VITYDLPTPPGEKKDVSGPMPDSYSPRYVEAAWYPWWEQQGFFKPEYGRPNVSAANPR GVFMMCIPPPNVTGSLHLGHALTNAIQDSLTRWHRMRGETTLWNPGCDHAGIATQVVV EKKLWREQGLSRHQLGREAFLQEVWKWKEEKGDRIYHQLKKLGSSLDWDRACFTMDPK LSAAVTEAFVRLHEEGIIYRSTRLVNWSCTLNSAISDIEVDKKELTGRTLLSVPGYKE KVEFGVLVSFAYKVQGSDSDEEVVVATTRIETMLGDVAVAVHPKDTRYQHLKGKNVIH PFLSRSLPIVFDEFVDMDFGTGAGKITPAHDQNDYEVGQRHGLEAISIMDSRGGPHQC ASAFPGPAQVLRPGKRCLVALKERGLFRGIEDNPMVVPLCNRSKDVVEPLLRPQWYVR CGEMAQAASAAVTRGDLRILPEAHQRTWHAWMDNIREWCISRQLWWGHRIPAYFVTVS DPAVPPGEDPDGRYWVSGRNEAEAREKAAKEFGVSPDKISLQQDEDVLDTWFFSGLFP LSILGWPNQSEDLSVFYPGTLLETGHDILFFWVARMVMLGLKLTGRLPFREVYLHAIV RDAHGRKMSKSLGNVIDPLDVIYGISLQGLHNQLLNSNLDPSEVEKAKEGQKADFPAG IPECGTDALRFGLCAYMSQGRDINLDVNRILGYRHFCNKLWNATKFALRGLGKGFVPS PTSQPGGHESLVDRWIRSRLTEAVRLSNQGFQAYDFPAVTTAQYSFWLYELCDVYLEC LKPVLNGVDQVAAECARQTLYTCLDVGLRLLSPFMPFVTEELFQRLPRRMPQAPPSLC VTPYPEPSECSWKDPEAEAALELALSITRAVRSLRADYNLTRIRPDCFLEVADEATGA LASAVSGYVQALASAGVVAVLALGAPAPQGCAVALASDRCSIHLQLQGLVDPARELGK LQAKRVEAQRQAQRLRERRAASGYPVKVPLEVQEADEAKLQQTEAELRKVDEAIALFQ KML" BASE COUNT 804 a 1267 c 1256 g 780 t ORIGIN 1 gtcacaaagg ggggacacgt gggcgccggc tgccggggcg gcgatcttag ggaactaggg 61 tcacctggag agccgcccac cgtctctgcc cgctcgactc ctccgcccgg gccgctcggc 121 ggtccagccg cggccggcgc ctggctgtga ggtggattcc cggcccagtc tgaccatctc 181 cctccagttt ttccacttcg ttcggacctt ctcataacta tgtccaccct ctacgtctcc 241 cctcacccag atgccttccc cagcctccga gccctcatag ccgctcgcta tggggaggct 301 ggggagggtc ccggatgggg aggagcccac ccccgcatct gtctccagcc acccccgact 361 agcaggacta gctttccccc accccgcctg ccggccctgg agcaggggcc cggtgggctc 421 tgggtgtggg gggccacggc tgtggcccag ctgctgtggc cagcaggcct ggggggccca 481 gggggcagcc gggcggctgt ccttgtccaa cagtgggtca gttacgccga cacggagtta 541 ataccagctg cctgtggagc aacgctgccg gccctgggac tccgaagctc ggcccaggac 601 ccccaggctg tgctgggggc cctgggcagg gccctgagcc ccttggagga gtggcttcgg 661 ctgcacacct acttggccgg ggaggccccc actctggctg acctggcggc tgtcacagcc 721 ttgctgctgc ctttccgata cgtcctagac ccacctgccc gccggatctg gaataatgtg 781 actcgctggt ttgtcacgtg tgtccgacag ccagaattcc gagccgtgct aggagaagtg 841 gttctatact caggagccag gcctctctct catcagccag gccccgaggc tcctgccctc 901 ccaaagacag ctgctcagct caagaaagag gcaaagaaac gggagaagct agagaaattc 961 caacagaagc agaagatcca acagcagcag ccacctccag gggagaagaa accaaaacca 1021 gagaagaggg agaaacggga tcctggggtc attacctatg acctcccaac cccacccggg 1081 gaaaagaaag atgtcagtgg ccccatgccc gactcctaca gccctcggta tgtggaggct 1141 gcctggtacc cttggtggga gcagcagggc ttcttcaagc cagagtatgg gcgtcctaat 1201 gtgtcagcag caaatccccg aggtgtcttc atgatgtgca tcccaccccc caatgtgaca 1261 ggctccctgc acctgggcca tgcactcacc aacgccatcc aggactccct gactcgatgg 1321 caccgcatgc gtggggagac caccctgtgg aaccctggct gtgaccatgc aggtattgcc 1381 acccaggtgg tggtggagaa gaagctatgg cgtgagcagg gactgagccg gcaccagctg 1441 ggccgcgagg cctttctaca ggaagtctgg aagtggaagg aggagaaagg tgaccggatt 1501 taccaccagt tgaagaagct tggcagctcc ttggactggg atcgagcctg tttcaccatg 1561 gaccctaaac tctcagcagc tgtgacagag gcctttgtcc ggcttcacga ggaaggcatc 1621 atctatcgca gtacccgcct tgttaactgg tcctgcaccc tcaactccgc catctctgac 1681 attgaggtgg ataagaagga gctgacaggt cgcaccctgc tctccgtgcc tggctacaag 1741 gagaaggtgg agttcggggt cctcgtgtcc tttgcctata aggtccaagg ctcagatagc 1801 gacgaggagg tggtggtggc aacaactcgg atcgagacaa tgctgggaga tgtggctgta 1861 gctgtgcacc ccaaagatac cagataccag cacctgaagg ggaagaacgt gatccaccca 1921 ttcctgtctc ggagccttcc cattgtcttc gatgaatttg tggacatgga ctttggcaca 1981 ggtgctggga agatcacccc cgcacatgac caaaatgact atgaagttgg gcagcggcac 2041 gggctggagg ccatcagcat catggactcc cgggggggcc ctcatcaatg tgcctccgcc 2101 tttcctgggc ctgcccaggt tttgaggcca ggaaagcggt gcctggtggc gctgaaggag 2161 cggggactgt tccgtggcat tgaggacaac cccatggtgg tgccactttg caaccggtcg 2221 aaggacgtgg tagagcctct gctgcggccg cagtggtacg ttcgctgcgg ggagatggcc 2281 caggctgcca gcgccgctgt gactcggggt gacctccgca tcctgcctga ggcccatcag 2341 cgcacatggc atgcctggat ggacaacatc cgggagtggt gcatttccag gcagctgtgg 2401 tggggccatc gcatcccagc ctactttgtc actgtcagtg acccagcggt gccccctggg 2461 gaggaccctg atgggcggta ctgggtgagt ggacgcaatg aggcggaggc ccgggagaag 2521 gcagccaagg agttcggagt gtcccctgac aagatcagtc tccagcaaga tgaggatgta 2581 ttggatacct ggttcttctc tggcctcttc cccttatcca ttttgggctg gcccaaccag 2641 tcagaagacc tgagtgtgtt ctaccccggg acactgctgg agaccggtca tgacatcctc 2701 ttcttctggg tggcccggat ggtcatgctg ggcctgaagc tcacgggcag gctgcccttt 2761 agagaggtct acctccatgc catcgtgcga gatgctcacg gccggaagat gagcaagtct 2821 ctaggcaatg tcatcgatcc cctggacgtc atctatggaa tctccctgca gggcctccac 2881 aaccagctgc tgaacagcaa cctggatccc agcgaggtgg agaaggccaa agaagggcag 2941 aaagctgact tcccagcggg gattcctgaa tgtggcaccg atgctctccg gtttggatta 3001 tgtgcctaca tgtcccaggg tcgtgacatc aacctggatg tgaaccggat actgggttac 3061 cgccacttct gcaacaagct ctggaatgcc accaagtttg cccttcgtgg ccttgggaag 3121 ggttttgtgc cctcacccac ctcccagccc ggaggccatg agagcctggt ggaccgctgg 3181 atccgcagcc gcctgacaga ggctgtgagg ctcagcaatc aaggcttcca ggcctacgac 3241 ttcccggccg tcaccactgc ccagtacagc ttctggctct atgagctctg tgatgtctac 3301 ttggagtgcc tgaaacctgt actgaatggg gtggaccagg tggcagctga gtgtgcccgc 3361 cagaccctgt acacttgcct ggacgttggc ctgcggctgc tctcaccctt catgcccttc 3421 gtgacggagg agctgttcca gaggctgccc cggaggatgc cgcaagctcc ccctagcctc 3481 tgtgttaccc cctacccgga gccctcagag tgctcctgga aggaccccga ggcagaagcc 3541 gcccttgagc tggcgctaag catcacgcga gccgtgcgct ccctgcgggc cgactacaac 3601 ctcacccgga tccggcctga ctgtttcctg gaagtggcgg atgaggccac gggcgccctg 3661 gcatcggcgg tgtcgggcta cgtgcaggcc ctggccagcg caggtgtggt ggctgttctg 3721 gccctggggg ctcccgcccc ccagggttgc gctgtggctc tggcttctga tcgctgctcc 3781 atccacctgc agcttcaggg gctggtggac cctgcacggg agctgggcaa gctgcaagcc 3841 aagcgagttg aggcccagcg gcaggcccag cgtctgcggg aacgccgtgc tgcctcgggc 3901 tatcctgtca aggtgccgct cgaagtccag gaggcagatg aagccaagct ccaacagaca 3961 gaagcagagc tcaggaaggt ggatgaggcc atcgccctat tccagaagat gctgtgatcc 4021 accacccagc ttcacccctc acccccagcg gctcaccatg gggatggcag caataaaata 4081 ttttcccaca aaaaaaaaaa aaaaaaa // LOCUS HSG9A 3391 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens mRNA for G9a. ACCESSION X69838 S57461 NID g287864 KEYWORDS ankyrin-like repeat; G9a gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3391) AUTHORS Milner,C.M. TITLE Direct Submission JOURNAL Submitted (18-DEC-1992) C.M. Milner, MRC Immunochemistry Unit, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK REFERENCE 2 (bases 1 to 3391) AUTHORS Milner,C.M. and Campbell,R.D. TITLE The G9a gene in the human major histocompatibility complex encodes a novel protein containing ankyrin-like repeats JOURNAL Biochem. J. 290 (Pt 3), 811-818 (1993) MEDLINE 93207535 REMARK Erratum:[Biochem J 1993 Jun 15;292(Pt 3):952]] FEATURES Location/Qualifiers source 1..3391 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA" /clone="G9a-4C7" /chromosome="6p21.3 MHC class III region" gene 48..3053 /gene="G9a" CDS 48..3053 /gene="G9a" /codon_start=1 /product="G9a" /db_xref="PID:g287865" /translation="MSDDVHSLGKVTSDLAKRRKLNSGGGLSEELGSARRSGEVTLTK GDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEALTEQLSEEEEEEEEEEEE EEEEEEEEEEEEDEESGNQSDRSGSSGRRKAKKKWRKDSPWVKPSRKRRKREPPRAKE PRGVNGVGSSGPSEYMEVPLGSLELPSEGTLSPNHAGVSNDTSSLETERGFEELPLCS CRMEAPKIDRISERAGHKCMATESVDGELSGCNAAILKRETMRPSSRVALMVLCETHR ARMVKHHCCPGCGYFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDASEA QEVTIPRGDGVTPPAGTAAPAPPPLSQDVPGRADTSQPSARMRGHGEPRRPPCDPLAD TIDSSGPSLTLPNGGCLSAVGLPLGPGREALEKALVIQESERRKKLRFHPRQLYLSVK QGELQKVILMLLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQ QRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQ VDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNEENICLHWASFTGSAAI AEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLT PERSDVWFALQLNRKLRLGVGNRAIRTEKIICRDVARGYENVPIPCVNGVDGEPCPED YKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRRWYDKDGRLLQEFN KIEPPLIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFIC EYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNISRFINHLCDPNIIPVR VFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEA IALEQSRLARLDPHPELLPELGSLPPVNT" polyA_signal 3353..3358 BASE COUNT 746 a 1005 c 1026 g 614 t ORIGIN 1 agcccccggt ccctgagaag cggccccctg aaatacagca tttccgcatg agtgatgatg 61 tccactcact gggaaaggtg acctcagatc tggccaaaag gaggaagctg aactcaggag 121 gtggcctgtc ggaggagtta ggttctgccc ggcgttcagg agaagtgacc ctgacgaaag 181 gggaccccgg gtccctggag gagtgggaga cggtggtggg tgatgacttc agtctctact 241 atgattccta ctctgtggat gagcgcgtgg actccgacag caagtctgaa gttgaagctc 301 taactgaaca actaagtgaa gaggaggagg aggaagagga ggaagaagaa gaagaggaag 361 aggaggagga agaggaagaa gaagaggaag atgaggagtc agggaatcag tcagatagga 421 gtggttccag tggccggcgc aaggccaaga agaaatggcg aaaagacagc ccatgggtga 481 agccgtctcg gaaacggcgc aagcgggagc ctccgcgggc caaggagcca cgaggagtga 541 atggtgtggg ctcctcaggc cccagtgagt acatggaggt ccctctgggg tccctggagc 601 tgcccagcga ggggaccctc tcccccaacc acgctggggt gtccaatgac acatcttcgc 661 tggagacaga gcgagggttt gaggagttgc ccctgtgcag ctgccgcatg gaggcaccca 721 agattgaccg catcagcgag agggcggggc acaagtgcat ggccactgag agtgtggacg 781 gagagctgtc aggctgcaat gccgccatcc tcaagcggga gaccatgagg ccatccagcc 841 gtgtggccct gatggtgctc tgtgagaccc accgcgcccg catggtcaaa caccactgct 901 gcccgggctg cggctacttc tgcacggcgg gcaccttcct ggagtgccac cctgacttcc 961 gtgtggccca ccgcttccac aaggcctgtg tgtctcagct gaatgggatg gtcttctgtc 1021 cccactgtgg ggaggatgct tctgaagctc aagaggtgac catcccccgg ggtgacgggg 1081 tgaccccacc ggccggcact gcagctcctg cacccccacc cctgtcccag gatgtccccg 1141 ggagagcaga cacttctcag cccagtgccc ggatgcgagg gcatggggaa ccccggcgcc 1201 cgccctgcga tcccctggct gacaccattg acagctcagg gccctccctg accctgccca 1261 atgggggctg cctttcagcc gtggggctgc cactggggcc aggccgggag gccctggaaa 1321 aggccctggt catccaggag tcagagaggc ggaagaagct ccgtttccac cctcggcagt 1381 tgtacctgtc cgtgaagcag ggcgagctgc agaaggtgat cctgatgctg ttggacaacc 1441 tggaccccaa cttccagagc gaccagcaga gcaagcgcac gcccctgcat gcagccgccc 1501 agaagggctc cgtggagatc tgccatgtgc tgctgcaggc tggagccaac ataaatgcag 1561 tggacaaaca gcagcggacg ccactgatgg aggccgtggt gaacaaccac ctggaggtag 1621 cccgttacat ggtgcagcgt ggtggctgtg tctatagcaa ggaggaggac ggttccacct 1681 gcctccacca cgcagccaaa atcgggaact tggagatggt cagcctgctg ctgagcacag 1741 gacaggtgga cgtcaacgcc caggacagtg gggggtggac gcccatcatc tgggctgcag 1801 agcacaagca catcgaggtg atccgcatgc tactgacgcg gggcgccgac gtcaccctca 1861 ctgataacga ggagaacatc tgcctgcact gggcctcctt cacgggcagc gccgccatcg 1921 ccgaagtcct tctgaatgcg cgctgtgacc tccatgctgt caactaccat ggggacaccc 1981 ccctgcacat cgcagctcgg gagagctacc atgactgcgt gctgttattc ctgtcacgtg 2041 gggccaaccc tgagctgcgg aacaaagagg gggacacagc atgggacctg actcccgagc 2101 gctccgacgt gtggtttgcg cttcaactca accgcaagct ccgacttggg gtgggaaatc 2161 gggccatccg cacagagaag atcatctgcc gggacgtggc tcggggctat gagaacgtgc 2221 ccattccctg tgtcaacggt gtggatgggg agccctgccc tgaggattac aagtacatct 2281 cagagaactg cgagacgtcc accatgaaca tcgatcgcaa catcacccac ctgcagcact 2341 gcacgtgtgt ggacgactgc tctagctcca actgcctgtg cggccagctc agcatccggc 2401 gctggtatga caaggatggg cgattgctcc aggaatttaa caagattgag cctccgctga 2461 ttttcgagtg taaccaggcg tgctcatgct ggagaaactg caagaaccgg gtcgtacaga 2521 gtggcatcaa ggtgcggcta cagctctacc gaacagccaa gatgggctgg ggggtccgcg 2581 ccctgcagac catcccacag gggaccttca tctgcgagta tgtcggggag ctgatctctg 2641 atgctgaggc tgatgtgaga gaggatgatt cttacctctt cgacttagac aacaaggatg 2701 gagaggtgta ctgcatagat gcccgttact atggcaacat cagccgcttc atcaaccacc 2761 tgtgtgaccc caacatcatt cccgtccggg tcttcatgct gcaccaagac ctgcgatttc 2821 cacgcatcgc cttcttcagt tcccgagaca tccggactgg ggaggagcta gggtttgact 2881 atggcgaccg cttctgggac atcaaaagca aatatttcac ctgccaatgt ggctctgaga 2941 agtgcaagca ctcagccgaa gccattgccc tggagcagag ccgtctggcc cgcctggacc 3001 cacaccctga gctgctgccc gagctcggct ccctgccccc tgtcaacaca tgagaacgga 3061 ccacaccctc tctccccagc atggatggcc acagctcagc cgcctcctct gccaccagct 3121 gctcgcaccc atgcctgggg gtgctgccat cttctctccc caccaccctt tcacacattc 3181 ctgaccagag atcccagcca ggccctggag gtctgacagc ccctccctcc cagagctggt 3241 tcctccctgg gagggcaact tcagggctgg ccaccccccg tgttccccat cctcagttga 3301 agtttgatga attgaagtcg ggcctctatg ccaactggtt ccttttgttc tcaataaatg 3361 ttgggtttgg taataaaaaa aaaaaaaaaa a // LOCUS HSGA7331 1793 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for pancreatic carcinoma marker GA733-1. ACCESSION X13425 NID g31590 KEYWORDS GA733-1 gene; glycoprotein; transmembrane protein; tumor-associated antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1793) AUTHORS Linnenbach,A.J. TITLE Direct Submission JOURNAL Submitted (02-NOV-1988) Linnenbach A.J., The Wistar Institute, 3601 Spruce Street, Philadelphia, PA, USA 19104 REFERENCE 2 (bases 1 to 1793) AUTHORS Wu,S., Huebner,K., Pyrc,J., Koprowski,H. and Linnenbach,A. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1793 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" /clone="17-1-4" /chromosome="1p36-q12." CDS 71..1042 /note="GA733-1 protein (AA 1-323)" /codon_start=1 /db_xref="PID:g31591" /db_xref="SWISS-PROT:P09758" /translation="MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVC SPDGPGGRCQCRALGSGMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDG LYDPDCDPEGRFKARQCNQTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRH RPTAGAFNHSDLDAELRRLFRERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVD IGDAAYYFERDIKGESLFQGRGGLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAG LIAVIVVVVVALVAGMAVLVITNRRKSGKYKKVEIKELGELRKEPSL" misc_feature 110..146 /note="hydrophilic domain" misc_feature 167..175 /note="pot. N-linked glycosylation site" misc_feature 428..436 /note="pot. N-linked glycosylation site" misc_feature 572..580 /note="pot. N-linked glycosylation site" misc_feature 692..700 /note="pot. N-linked glycosylation site" misc_feature 893..961 /note="transmembrane domain" misc_feature 1759..1764 /note="pot. polyA signal" misc_feature 1775..1780 /note="put. polyA signal" polyA_site 1793 /note="polyA site" BASE COUNT 363 a 529 c 513 g 388 t ORIGIN 1 cggagcccga gccccgacga gtccccgcgc ctcatccgcc cgcgtccggt ccgcgttcct 61 ccgccccacc atggctcggg gccccggcct cgcgccgcca ccgctgcggc tgccgctgct 121 gctgctggtg ctggcggcgg tgaccggcca cacggccgcg caggacaact gcacgtgtcc 181 caccaacaag atgaccgtgt gcagccccga cggccccggc ggccgctgcc agtgccgcgc 241 gctgggctcg ggcatggcgg tcgactgctc cacgctgacc tccaagtgtc tgctgctcaa 301 ggcgcgcatg agcgccccca agaacgcccg cacgctggtg cggccgagtg agcacgcgct 361 cgtggacaac gatggcctct acgaccccga ctgcgacccc gagggccgct tcaaggcgcg 421 ccagtgcaac cagacgtcgg tgtgctggtg cgtgaactcg gtgggcgtgc gccgcacgga 481 caagggcgac ctgagcctac gctgcgatga gctggtgcgc acccaccaca tcctcattga 541 cctgcgccac cgccccaccg ccggcgcctt caaccactca gacctggacg ccgagctgag 601 gcggctcttc cgcgagcgct atcggctgca ccccaagttc gtggcggccg tgcactacga 661 gcagcccacc atccagatcg agctgcggca gaacacgtct cagaaggccg ccggtgacgt 721 ggatatcggc gatgccgcct actacttcga gagggacatc aagggcgagt ctctattcca 781 gggccgcggc ggcctggact tgcgcgtgcg cggagaaccc ctgcaggtgg agcgcacgct 841 catctattac ctggacgaga ttcccccgaa gttctccatg aagcgcctca ccgccggtct 901 catcgccgtc atcgtggtgg tcgtggtggc cctcgtcgcc ggcatggccg tcctggtgat 961 caccaaccgg agaaagtcgg ggaagtacaa gaaggtggag atcaaggaac tgggggagtt 1021 gagaaaggaa ccgagcttgt aggtacccgg cggggcaggg gatggggtgg ggtaccggat 1081 ttcggtatcg tcccagaccc aagtgagtca cgcttcctga ttcctcggcg caaaggagac 1141 gtttatcctt tcaaattcct gccttccccc tcccttttgc gcacacacca ggtttaatag 1201 atcctggcct cagggtctcc tttctttctc acttctgtct tgaaggaagc atttctaaaa 1261 tgtatcccct ttcggtccaa caacaggaaa cctgactggg gcagtgaagg aagggatggc 1321 atagcgttat gtgtaaaaaa caagtatctg tatgacaacc cgggatcgtt tgcaagtaac 1381 tgaatccatt gcgacattgt gaaggcttaa atgagtttag atgggaaata gcgttgttat 1441 cgccttgggt ttaaattatt tgatgagttc cacttgtatc atggcctacc cgaggagaag 1501 aggagtttgt taactgggcc tatgtagtag cctcatttac catcgtttgt attactgacc 1561 acatatgctt gtcactggga aagaagcctg tttcagctgc ctgaacgcag tttggatgtc 1621 tttgaggaca gacattgccc ggaaactcag tctatttatt cctcagcttg cccttactgc 1681 cactgatatt ggtaatgttc ttttttgtaa aatgtttgta catatgttgt ctttgataat 1741 gttgctgtaa ttttttaaaa taaaacacga atttaataaa atatgggaaa ggc // LOCUS HSGAA 3624 bp RNA PRI 28-JUL-1995 DEFINITION H.sapiens GAA mRNA for lysosomal alpha-glucosidase (acid maltase). ACCESSION Y00839 NID g31607 KEYWORDS alpha-glucosidase; amylase; glycoprotein; lysosomal enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3624) AUTHORS Hoefsloot,L.H., Hoogeveen-Westerveld,M., Kroos,M.A., van Beeumen,J., Reuser,A.J. and Oostra,B.A. TITLE Primary structure and processing of lysosomal alpha-glucosidase; homology with the intestinal sucrase-isomaltase complex JOURNAL EMBO J. 7 (6), 1697-1704 (1988) MEDLINE 89005058 REMARK (revised by[3]) REFERENCE 2 (bases 1 to 3624) AUTHORS Reuser,A.J.J. TITLE Direct Submission JOURNAL Submitted (24-JUN-1988) to the EMBL/GenBank/DDBJ databases REMARK (revised by [3]) REFERENCE 3 (bases 1 to 3624) AUTHORS Reuser,A.J.J. TITLE Direct Submission JOURNAL Submitted (08-JUN-1990) Reuser A.J.J., Department of Cell Biology and Genetics Erasmus University, P.O. Box 1738, 300 DR Rotterdam, Netherlands COMMENT Data kindly reviewed (08-JUN-1990) by Reuser A. FEATURES Location/Qualifiers source 1..3624 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta and testis" /clone_lib="lambda gt11 (testis) and lambda gt11 (VII-75-1;placenta A1)" mRNA <1..3624 /note="alpha-proglucosidase mRNA" sig_peptide 220..426 /gene="GAA" gene 220..3078 /gene="GAA" CDS 220..3078 /gene="GAA" /EC_number="3.2.1.3" /codon_start=1 /product="glucan 1, 4-alpha-glucosidase" /db_xref="PID:g31608" /db_xref="SWISS-PROT:P10253" /translation="MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSG SSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQ CEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFF PKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPRVHSRAPSPLYSVEFSEEPF GVIVHRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRI TLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTG GILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMT RAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSG PAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHD QVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLS THYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQL ASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQ EPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTV DHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPIEALGSLPPPPAAPREPAI HSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGE LFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQ QVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC" misc_feature 427..828 /gene="GAA" /note="propiece of 70 kD alpha-glucosidase (AA 1 - 134)" misc_feature 427..582 /gene="GAA" /note="propiece of 76 kD alpha-glucosidase (AA 1 - 52)" mat_peptide 427..3075 /gene="GAA" /EC_number="3.2.1.3" /product="glucan 1, 4-alpha-glucosidase" misc_feature <583..3075 /gene="GAA" /note="76 kD alpha-glucosidase (AA 54 - <883)" misc_feature 637..639 /gene="GAA" /note="pot. N-linked glycosylation site" CDS <829..3078 /gene="GAA" /note="no start codon" /codon_start=1 /product="70 kD alpha-glucosidase" /db_xref="PID:g31609" /translation="APSPLYSVEFSEEPFGVIVHRQLDGRVLLNTTVAPLFFADQFLQ LSTSLPSQYITGLAEHLSPLMLSTSWTRITLWNRDLAPTPGANLYGSHPFYLALEDGG SAHGVFLLNSNAMDVVLQPSPALSWRSTGGILDVYIFLGPEPKSVVQQYLDVVGYPFM PPYWGLGFHLCRWGYSSTAITRQVVENMTRAHFPLDVQWNDLDYMDSRRDFTFNKDGF RDFPAMVQELHQGGRRYMMIVDPAISSSGPAGSYRPYDEGLRRGVFITNETGQPLIGK VWPGSTAFPDFTNPTALAWWEDMVAEFHDQVPFDGMWIDMNEPSNFIRGSEDGCPNNE LENPPYVPGVVGGTLQAATICASSHQFLSTHYNLHNLYGLTEAIASHRALVKARGTRP FVISRSTFAGHGRYAGHWTGDVWSSWEQLASSVPEILQFNLLGVPLVGADVCGFLGNT SEELCVRWTQLGAFYPFMRNHNSLLSLPQEPYSFSEPAQQAMRKALTLRYALLPHLYT LFHQAHVAGETVARPLFLEFPKDSSTWTVDHQLLWGEALLITPVLQAGKAEVTGYFPL GTWYDLQTVPIEALGSLPPPPAAPREPAIHSEGQWVTLPAPLDTINVHLRAGYIIPLQ GPGLTTTESRQQPMALAVALTKGGEARGELFWDDGESLEVLERGAYTQVIFLARNNTI VNELVRVTSEGAGLQLQKVTVLGVATAPQQVLSNGVPVSNFTYSPDTKVLDICVSLLM GEQFLVSWC" misc_feature 916..918 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 1387..1389 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 1627..1629 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 2173..2175 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 2863..2865 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 2992..2994 /gene="GAA" /note="pot. N-linked glycosylation site" misc_feature 3603..3607 /note="polyA signal" polyA_site 3624 /note="polyA site" BASE COUNT 643 a 1220 c 1100 g 661 t ORIGIN 1 cagttgggaa agctgaggtt gtcgccgggg ccgcgggtgg aggtcgggga tgaggcagca 61 ggtaggacag tgacctcggt gacgcgaagg accccggcca cctctaggtt ctcctcgtcc 121 gcccgttgtt cagcgaggga ggctctgggc ctgccgcagc tgacggggaa actgaggcac 181 ggagcgggcc tgtaggagct gtccaggcca tctccaacca tgggagtgag gcacccgccc 241 tgctcccacc ggctcctggc cgtctgcgcc ctcgtgtcct tggcaaccgc tgcactcctg 301 gggcacatcc tactccatga tttcctgctg gttccccgag agctgagtgg ctcctcccca 361 gtcctggagg agactcaccc agctcaccag cagggagcca gcagaccagg gccccgggat 421 gcccaggcac accccggccg tcccagagca gtgcccacac agtgcgacgt cccccccaac 481 agccgcttcg attgcgcccc tgacaaggcc atcacccagg aacagtgcga ggcccgcggc 541 tgctgctaca tccctgcaaa gcaggggctg cagggagccc agatggggca gccctggtgc 601 ttcttcccac ccagctaccc cagctacaag ctggagaacc tgagctcctc tgaaatgggc 661 tacacggcca ccctgacccg taccaccccc accttcttcc ccaaggacat cctgaccctg 721 cggctggacg tgatgatgga gactgagaac cgcctccact tcacgatcaa agatccagct 781 aacaggcgct acgaggtgcc cttggagacc ccgcgtgtcc acagccgggc accgtcccca 841 ctctacagcg tggagttctc cgaggagccc ttcggggtga tcgtgcaccg gcagctggac 901 ggccgcgtgc tgctgaacac gacggtggcg cccctgttct ttgcggacca gttccttcag 961 ctgtccacct cgctgccctc gcagtatatc acaggcctcg ccgagcacct cagtcccctg 1021 atgctcagca ccagctggac caggatcacc ctgtggaacc gggaccttgc gcccacgccc 1081 ggtgcgaacc tctacgggtc tcaccctttc tacctggcgc tggaggacgg cgggtcggca 1141 cacggggtgt tcctgctaaa cagcaatgcc atggatgtgg tcctgcagcc gagccctgcc 1201 cttagctgga ggtcgacagg tgggatcctg gatgtctaca tcttcctggg cccagagccc 1261 aagagcgtgg tgcagcagta cctggacgtt gtgggatacc cgttcatgcc gccatactgg 1321 ggcctgggct tccacctgtg ccgctggggc tactcctcca ccgctatcac ccgccaggtg 1381 gtggagaaca tgaccagggc ccacttcccc ctggacgtcc aatggaacga cctggactac 1441 atggactccc ggagggactt cacgttcaac aaggatggct tccgggactt cccggccatg 1501 gtgcaggagc tgcaccaggg cggccggcgc tacatgatga tcgtggatcc tgccatcagc 1561 agctcgggcc ctgccgggag ctacaggccc tacgacgagg gtctgcggag gggggttttc 1621 atcaccaacg agaccggcca gccgctgatt gggaaggtat ggcccgggtc cactgccttc 1681 cccgacttca ccaaccccac agccctggcc tggtgggagg acatggtggc tgagttccat 1741 gaccaggtgc ccttcgacgg catgtggatt gacatgaacg agccttccaa cttcatcaga 1801 ggctctgagg acggctgccc caacaatgag ctggagaacc caccctacgt gcctggggtg 1861 gttgggggga ccctccaggc ggccaccatc tgtgcctcca gccaccagtt tctctccaca 1921 cactacaacc tgcacaacct ctacggcctg accgaagcca tcgcctccca cagggcgctg 1981 gtgaaggctc gggggacacg cccatttgtg atctcccgct cgacctttgc tggccacggc 2041 cgatacgccg gccactggac gggggacgtg tggagctcct gggagcagct cgcctcctcc 2101 gtgccagaaa tcctgcagtt taacctgctg ggggtgcctc tggtcggggc cgacgtctgc 2161 ggcttcctgg gcaacacctc agaggagctg tgtgtgcgct ggacccagct gggggccttc 2221 taccccttca tgcggaacca caacagcctg ctcagtctgc cccaggagcc gtacagcttc 2281 agcgagccgg cccagcaggc catgaggaag gccctcaccc tgcgctacgc actcctcccc 2341 cacctctaca cactgttcca ccaggcccac gtcgcggggg agaccgtggc ccggcccctc 2401 ttcctggagt tccccaagga ctctagcacc tggactgtgg accaccagct cctgtggggg 2461 gaggccctgc tcatcacccc agtgctccag gccgggaagg ccgaagtgac tggctacttc 2521 cccttgggca catggtacga cctgcagacg gtgccaatag aggcccttgg cagcctccca 2581 cccccacctg cagctccccg tgagccagcc atccacagcg aggggcagtg ggtgacgctg 2641 ccggcccccc tggacaccat caacgtccac ctccgggctg ggtacatcat ccccctgcag 2701 ggccctggcc tcacaaccac agagtcccgc cagcagccca tggccctggc tgtggccctg 2761 accaagggtg gagaggcccg aggggagctg ttctgggacg atggagagag cctggaagtg 2821 ctggagcgag gggcctacac acaggtcatc ttcctggcca ggaataacac gatcgtgaat 2881 gagctggtac gtgtgaccag tgagggagct ggcctgcagc tgcagaaggt gactgtcctg 2941 ggcgtggcca cggcgcccca gcaggtcctc tccaacggtg tccctgtctc caacttcacc 3001 tacagccccg acaccaaggt cctggacatc tgtgtctcgc tgttgatggg agagcagttt 3061 ctcgtcagct ggtgttagcc gggcggagtg tgttagtctc tccagaggga ggctggttcc 3121 ccagggaagc agagcctgtg tgcgggcagc agctgtgtgc gggcctgggg gttgcatgtg 3181 tcacctggag ctgggcacta accattccaa gccgccgcat cgcttgtttc cacctcctgg 3241 gccggggctc tggcccccaa cgtgtctagg agagctttct ccctagatcg cactgtgggc 3301 cggggcctgg agggctgctc tgtgttaata agattgtaag gtttgccctc ctcacctgtt 3361 gccggcatgc gggtagtatt agccaccccc ctccatctgt tcccagcacc ggagaagggg 3421 gtgctcaggt ggaggtgtgg ggtatgcacc tgagctcctg cttcgcgcct gctgctctgc 3481 cccaacgcga ccgcttcccg gctgcccaga gggctggatg cctgccggtc cccgagcaag 3541 cctgggaact caggaaaatt cacaggactt gggagattct aaatcttaag tgcaattatt 3601 ttaataaaag gggcatttgg aatc // LOCUS HSGABAAA1 1742 bp RNA PRI 18-MAR-1991 DEFINITION Human mRNA for GABA-A receptor, alpha 1 subunit. ACCESSION X14766 NID g31632 KEYWORDS GABA-A receptor; gamma-aminobutyric acid receptor; gamma-aminobutyric acid receptor alpha-subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1742) AUTHORS Schofield,P.R., Pritchett,D.B., Sontheimer,H., Kettenmann,H. and Seeburg,P.H. TITLE Sequence and expression of human GABAA receptor alpha 1 and beta 1 subunits JOURNAL FEBS Lett. 244 (2), 361-364 (1989) MEDLINE 89153582 FEATURES Location/Qualifiers source 1..1742 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="human fetal brain in lambda gt10" mRNA 1..1742 /gene="GABA-A receptor alpha 1 subunit" gene 1..1742 /gene="GABA-A receptor alpha 1 subunit" sig_peptide 215..295 /gene="GABA-A receptor alpha 1 subunit" /product="GABA-A receptor alpha 1 subunit" CDS 215..1585 /gene="GABA-A receptor alpha 1 subunit" /codon_start=1 /product="GABA-A receptor alpha 1 subunit" /db_xref="PID:g31633" /db_xref="SWISS-PROT:P14867" /translation="MRKSPGLSDCLWAWILLLSTLTGRSYGQPSLQDELKDNTTVFTR ILDRLLDGYDNRLRPGLGERVTEVKTDIFVTSFGPVSDHDMEYTIDVFFRQSWKDERL KFKGPMTVLRLNNLMASKIRTPDTFFHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRL TVRAECPMHLEDFPMDAHACPLKFGSYAYTRAEVVYEWTREPARSVVVAEDGSRLNQY DLLGQTVDSGIVQSSTGEYVVMTTHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNR ESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATV NYFTKRGYAWDGKSVVPEKPKKVKDPLIKKNNTYAPTATSYTPNLARGDPGLATIAKS ATIEPKEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLLFGIFNLVYWATYLNREPQLK APTPHQ" mat_peptide 296..1582 /gene="GABA-A receptor alpha 1 subunit" /product="GABA-A receptor alpha 1 subunit" BASE COUNT 500 a 424 c 379 g 439 t ORIGIN 1 gtgaaatctt cagcaaagga gcacgcagag tccatgatgg ctcagaccaa gtgagtgaga 61 ggcagagcga ggacgcccct ctgctctggc gcgcccggac tcggactcgc agactcgcgc 121 tggctccagt ctctccacga ttctctctcc cagacttttc cccggtctta agagatcctg 181 tgtccagagg gggccttagc tgctccagcc cgcgatgagg aaaagtccag gtctgtctga 241 ctgtctttgg gcctggatcc tccttctgag cacactgact ggaagaagct atggacagcc 301 gtcattacaa gatgaactta aagacaatac cactgtcttc accaggattt tggacagact 361 cctagatggc tatgacaatc gcctgagacc aggattggga gagcgtgtaa ccgaagtgaa 421 gactgatatc ttcgtcacca gtttcggacc cgtttcagac catgatatgg aatatacaat 481 agatgtattt ttccgtcaaa gctggaagga tgaaaggtta aaatttaaag gacctatgac 541 agtcctccgg ttaaataacc taatggcaag taaaatccgg actccggaca catttttcca 601 caatggaaag aagtcagtgg cccacaacat gaccatgccc aacaaactcc tgcggatcac 661 agaggatggc accttgctgt acaccatgag gctgacagtg agagctgaat gtccgatgca 721 tttggaggac ttccctatgg atgcccatgc ttgcccacta aaatttggaa gttatgctta 781 tacaagagca gaagttgttt atgaatggac cagagagcca gcacgctcag tggttgtagc 841 agaagatgga tcacgtctaa accagtatga ccttcttgga caaacagtag actctggaat 901 tgtccagtca agtacaggag aatatgttgt tatgaccact catttccact tgaagagaaa 961 gattggctac tttgttattc aaacatacct gccatgcata atgacagtga ttctctcaca 1021 agtctccttc tggctcaaca gagagtctgt accagcaaga actgtctttg gagtaacaac 1081 tgtgctcacc atgacaacat tgagcatcag tgccagaaac tccctcccta aggtggctta 1141 tgcaacagct atggattggt ttattgccgt gtgctatgcc tttgtgttct cagctctgat 1201 tgagtttgcc acagtaaact atttcactaa gagaggttat gcatgggatg gcaaaagtgt 1261 ggttccagaa aagccaaaga aagtaaagga tcctcttatt aagaaaaaca acacttacgc 1321 tccaacagca accagctaca cccctaattt ggccaggggc gacccgggct tagccaccat 1381 tgctaaaagt gcaaccatag aacctaaaga ggtcaagccc gaaacaaaac caccagaacc 1441 caagaaaacc tttaacagtg tcagcaaaat tgaccgactg tcaagaatag ccttcccgct 1501 gctatttgga atctttaact tagtctactg ggctacgtat ttaaacagag agcctcagct 1561 aaaagccccc acaccacatc aatagatctt ttactcacat tctgttgttc agttcctctg 1621 cactgggaat ttatttatgt tctcaacgca gtaattccca tctgccttta ttgcctctgt 1681 cttaaagaat ttgaaagttt ccttattttc ataattcatt taagacaaga gacccctgtc 1741 tg // LOCUS HSGABAAB1 1866 bp RNA PRI 18-MAR-1991 DEFINITION Human mRNA for GABA-A receptor, beta 1 subunit. ACCESSION X14767 NID g31634 KEYWORDS GABA-A receptor; gamma-aminobutyric acid receptor; gamma-aminobutyric acid receptor beta-subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Schofield,P.R., Pritchett,D.B., Sontheimer,H., Kettenmann,H. and Seeburg,P.H. TITLE Sequence and expression of human GABAA receptor alpha 1 and beta 1 subunits JOURNAL FEBS Lett. 244 (2), 361-364 (1989) MEDLINE 89153582 FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="human fetal brain in lambda gt10" mRNA 1..1866 /gene="GABA-A receptor beta 1 subunit" gene 1..1866 /gene="GABA-A receptor beta 1 subunit" sig_peptide 32..106 /gene="GABA-A receptor beta 1 subunit" /product="GABA-A receptor beta 1 subunit" CDS 32..1456 /gene="GABA-A receptor beta 1 subunit" /codon_start=1 /product="GABA-A receptor beta 1 subunit" /db_xref="PID:g31635" /db_xref="SWISS-PROT:P18505" /translation="MWTVQNRESLGLLSFPVMITMVCCAHSTNEPSNMPYVKETVDRL LKGYDIRLRPDFGGPPVDVGMRIDVASIDMVSEVNMDYTLTMYFQQSWKDKRLSYSGI PLNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAA CMMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWNGGEGAVTGVNKIELPQFSIVDYKM VSKKVEFTTGAYPRLSLSFRLKRNIGYFILQTYMPSTLITILSWVSFWINYDASAARV ALGITTVLTMTTISTHLRETLPKIPYVKAIDIYLMGCFVFVFLALLEYAFVNYIFFGK GPQKKGASKQDQSANEKNKLEMNKVQVDAHGNILLSTLEIRNETSGSEVLTSVSDPKA TMYSYDSASIQYRKPLSSREAYGRALDRHGVPSKGRIRRRASQLKVKIPDLTDVNSID KWSRMFFPITFSLFNVVYWLYYVH" mat_peptide 107..1453 /gene="GABA-A receptor beta 1 subunit" /product="GABA-A receptor beta 1 subunit" BASE COUNT 516 a 438 c 412 g 500 t ORIGIN 1 gaaaagacaa ttcttttaat cagagttagt aatgtggaca gtacaaaatc gagagagtct 61 ggggcttctc tctttccctg tgatgattac catggtctgt tgtgcacaca gcaccaatga 121 acccagcaac atgccatacg tgaaagagac agtggacaga ttgctcaaag gatatgacat 181 tcgcttgcgg ccggacttcg gagggccccc cgtcgacgtt gggatgcgga tcgatgtcgc 241 cagcatagac atggtctccg aagtgaatat ggattataca ctcaccatgt atttccagca 301 gtcttggaaa gacaaaaggc tttcttattc tggaatccca ctgaacctca ccctagacaa 361 tagggtagct gaccaactct gggtaccaga cacctacttt ctgaatgaca agaaatcatt 421 tgtgcatggg gtcacagtga aaaatcgaat gattcgactg catcctgatg gaacagttct 481 ctatggactc cgaatcacaa ccacagctgc atgtatgatg gatcttcgaa gatatccact 541 ggatgagcag aactgcaccc tggagatcga aagttatggc tataccactg atgacattga 601 attttactgg aatggaggag aaggggcagt cactggtgtt aataaaatcg aacttcctca 661 attttcaatt gttgactaca agatggtgtc taagaaggtg gagttcacaa caggagcgta 721 tccacgactg tcactaagtt ttcgtctaaa gagaaacatt ggttacttca ttttgcaaac 781 ctacatgcct tctacactga ttacaattct gtcctgggtg tctttttgga tcaactatga 841 tgcatctgca gccagagtcg cactaggaat cacgacggtg cttacaatga caaccatcag 901 cacccacctc agggagaccc tgccaaagat cccttatgtc aaagcgattg atatttatct 961 gatgggttgc tttgtgtttg tgttcctggc tctgctggag tatgcctttg taaattacat 1021 cttctttggg aaaggccctc agaaaaaggg agctagcaaa caagaccaga gtgccaatga 1081 gaagaataaa ctggagatga ataaagtcca ggtcgacgcc cacggtaaca ttctcctcag 1141 caccctggaa atccggaatg agacgagtgg ctcggaagtg ctcacgagcg tgagcgaccc 1201 caaggccacc atgtactcct atgacagcgc cagcatccag taccgcaagc ccctgagcag 1261 ccgcgaggcc tacgggcgcg ccctggaccg gcacggggta cccagcaagg ggcgcatccg 1321 caggcgtgcc tcccagctca aagtcaagat ccccgacttg actgatgtga attccataga 1381 caagtggtcc cgaatgtttt tccccatcac cttttctctt tttaatgtcg tctattggct 1441 ttactatgta cactgaggtc tgttctaatg gttccattta gactactttc ctcttctatt 1501 gttttttaac cttacaggtc cccaacagcg atactgctgt ttctcgaggt aagagattca 1561 gccatccaat tggttttagg tcttgcatat cagttttatt actgcaccat gtttacttca 1621 aaaagacaaa acaaaaaaaa aattattttt ccagtctacc gtggtccagg ttatcagctc 1681 tttaagagct ctattaattg ccatgtttac aaacaaacac aaagagagaa gttagacagg 1741 tagatcttta gcagtctttt ctagtttccc tggatttcac tgatttattt tttagggaaa 1801 atgaaaagag gaccttgctg tccgcctgca ctgcttcctg gtaaactata acaaacttat 1861 gctgcc // LOCUS HSGABAAS 1745 bp RNA PRI 15-MAR-1991 DEFINITION Human mRNA for GABA-A receptor, gamma 2 subunit. ACCESSION X15376 NID g31636 KEYWORDS GABA-A receptor; gamma subunit receptor; gamma-aminobutyric acid receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1745) AUTHORS Pritchett,D.B., Sontheimer,H., Shivers,B.D., Ymer,S., Kettenmann,H., Schofield,P.R. and Seeburg,P.H. TITLE Importance of a novel GABAA receptor subunit for benzodiazepine pharmacology JOURNAL Nature 338 (6216), 582-585 (1989) MEDLINE 89181956 FEATURES Location/Qualifiers source 1..1745 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /tissue_lib="fetal brain cDNA" mRNA 1..>1745 /note="for GABA-A receptor gamma 2 subunit" sig_peptide 226..342 CDS 227..1630 /codon_start=1 /product="GABA-A receptor gamma 2 subunit" /db_xref="PID:g31637" /db_xref="SWISS-PROT:P18507" /translation="MSSPNIWSTGSSVYSTPVFSQKMTVWILLLLSLYPGFTSQKSDD DYEDYASNKTWVLTPKVPEGDVTVILNNLLEGYDNKLRPDIGVKPTLIHTDMYVNSIG PVNAINMEYTIDIFFAQMWYDRRLKFNSTIKVLRLNSNMVGKIWIPDTFFRNSKKADA HWITTPNRMLRIWNDGRVLYSLRLTIDAECQLQLHNFPMDEHSCPLEFSSYGYPREEI VYQWKRSSVEVGDTRSWRLYQFSFVGLRNTTEVVKTTSGDYVVMSVYFDLSRRMGYFT IQTYIPCTLIVVLSWVSFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVSYVTA MDLFVSVCFIFVFSALVEYGTLHYFVSNRKPSKDKDKKKKNPAPTIDIRPRSATIQMN NATHLQERDEEYGYECLDGKDCASFFCCFEDCRTGAWRHGRIHIRIAKMDSYARIFFP TAFCLFNLVYWVSYLYL" mat_peptide 343..1627 /product="GABA-A receptor gamma 2 subunit" BASE COUNT 492 a 389 c 378 g 486 t ORIGIN 1 cctgacgctt tgatggtatc tgcaagcgtt tttgctgatc ttatctctgc cccctgaata 61 ttaattccct aatctggtag caatccatct ccccagtgaa ggacctacta gaggcaggtg 121 gggggagcca ccatcagatc atcaagcata agaataatac aaaggggagg gattcttctg 181 caaccaagag gcaagaggcg agagaaggaa aaaaaaaaaa aaagcgatga gttcaccaaa 241 tatatggagc acaggaagct cagtctactc gactcctgta ttttcacaga aaatgacggt 301 gtggattctg ctcctgctgt cgctctaccc tggcttcact agccagaaat ctgatgatga 361 ctatgaagat tatgcttcta acaaaacatg ggtcttgact ccaaaagttc ctgagggtga 421 tgtcactgtc atcttaaaca acctgctgga aggatatgac aataaacttc ggcctgatat 481 aggagtgaag ccaacgttaa ttcacacaga catgtatgtg aatagcattg gtccagtgaa 541 cgctatcaat atggaataca ctattgatat attttttgcg caaatgtggt atgacagacg 601 tttgaaattt aacagcacca ttaaagtcct ccgattgaac agcaacatgg tggggaaaat 661 ctggattcca gacactttct tcagaaattc caaaaaagct gatgcacact ggatcaccac 721 ccccaacagg atgctgagaa tttggaatga tggtcgagtg ctctactccc taaggttgac 781 aattgatgct gagtgccaat tacaattgca caattttcca atggatgaac actcctgccc 841 cttggagttc tccagttatg gctatccacg tgaagaaatt gtttatcaat ggaagcgaag 901 ttctgttgaa gtgggcgaca caagatcctg gaggctttat caattctcat ttgttggtct 961 aagaaatacc accgaagtag tgaagacaac ttccggagat tatgtggtca tgtctgtcta 1021 ctttgatctg agcagaagaa tgggatactt taccatccag acctatatcc cctgcacact 1081 cattgtcgtc ctatcctggg tgtctttctg gatcaataag gatgctgttc cagccagaac 1141 atctttaggt atcaccactg tcctgacaat gaccaccctc agcaccattg cccggaaatc 1201 gctccccaag gtctcctatg tcacagcgat ggatctcttt gtatctgttt gtttcatctt 1261 tgtcttctct gctctggtgg agtatggcac cttgcattat tttgtcagca accggaaacc 1321 aagcaaggac aaagataaaa agaagaaaaa ccctgcccct accattgata tccgcccaag 1381 atcagcaacc attcaaatga ataatgctac acaccttcaa gagagagatg aagagtacgg 1441 ctatgagtgt ctggacggca aggactgtgc cagttttttc tgctgttttg aagattgtcg 1501 aacaggagct tggagacatg ggaggataca tatccgcatt gccaaaatgg actcctatgc 1561 tcggatcttc ttccccactg ccttctgcct gtttaatctg gtctattggg tctcctacct 1621 ctacctgtga ggaggtatgg gttttactga tatggttctt attcactgag tctcatggag 1681 agatgtctgt tctaagtcca cttaaataat cctctatgtg gttgataagt atctgaatct 1741 gtttc // LOCUS HSGABACHL 3153 bp RNA PRI 03-MAR-1997 DEFINITION H.sapiens mRNA for putative GABA-gated chloride channel. ACCESSION Y07637 NID g1747370 KEYWORDS GABA-gated chloride channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3153) AUTHORS Garret,M., Bascles,L., Boue-Grabot,E., Sartor,P., Charron,G., Bloch,B. and Margolskee,R.F. TITLE An mRNA encoding a putative GABA-gated chloride channel is expressed in the human cardiac conduction system JOURNAL J. Neurochem. 68 (4), 1382-1389 (1997) MEDLINE 97238072 REFERENCE 2 (bases 1 to 3153) AUTHORS Garret,M. TITLE Direct Submission JOURNAL Submitted (21-AUG-1996) M. Garret, CNRS UMR5543, Laboratoire de Neurophysiologie, Universite de Bordeaux2, 146 rue Leo Saignat, 33076 Bordeaux Cedex, FRANCE FEATURES Location/Qualifiers source 1..3153 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 51..1568 /function="expressed in cardiac conduction system" /codon_start=1 /product="putative GABA-gated chloride channel" /db_xref="PID:e274573" /db_xref="PID:g1747371" /translation="MLSKVLPVLLGILLILQSRVEGPQTESKNEASSRDVVYGPQPQP LENQLLSEETKSTETETGSRVGKLPEASRILNTILSNYDHKLRPGIGEKPTVVTVEIA VNSLGPLSILDMEYTIDIIFSQTWYDERLCYNDTFESLVLNGNVVSQLWIPDTFFRNS KRTHEHEITMPNQMVRIYKDGKVLYTIRMTIDAGCSLHMLRFPMDSHSCPLSFSSFSY PENEMIYKWENFKLEINEKNSWKLFQFDFTGVSNKTEIITTPGDFMVMTIFFNVSRRF GYVAFQNYVPSSVTTMLSWVSFWIKTESAPARTSLGITSVLTMTTLGTFSRKNFPRVS YITALDFYIAICFVFCFCALLEFAVLNFLIYNQTKAHASPKLRHPRINSRAHARTRAR SRACARQHQEAFVCQIVTTEGSDGEERPSCSAQQPPSPGSPEGPRSLCSKLACCEWCK RFKKYFCMVPDCEGSTWQQGRLCIHVYRLDNYSRVVFPVTFFFFNVLYWLVCLNL" polyA_signal 3118..3123 BASE COUNT 726 a 885 c 687 g 855 t ORIGIN 1 agagcgtgag cgcgacctcc gcgcaggtgg tggcgccggt ctccgcggaa atgttgtcca 61 aagttcttcc agtcctccta ggcatcttat tgatcctcca gtcgagggtc gagggacctc 121 agactgaatc aaagaatgaa gcctcttccc gtgatgttgt ctatggcccc cagccccagc 181 ctctggaaaa tcagctcctc tctgaggaaa caaagtcaac tgagactgag actgggagca 241 gagttggcaa actgccagaa gcctctcgca tcctgaacac tatcctgagt aattatgacc 301 acaaactgcg ccctggcatt ggagagaagc ccactgtggt cactgttgag atcgccgtca 361 acagccttgg tcctctctct atcctagaca tggaatacac cattgacatc atcttctccc 421 agacctggta cgacgaacgc ctctgttaca acgacacctt tgagtctctt gttctgaatg 481 gcaatgtggt gagccagcta tggatcccgg acaccttttt taggaattct aagaggaccc 541 acgagcatga gatcaccatg cccaaccaga tggtccgcat ctacaaggat ggcaaggtgt 601 tgtacacaat taggatgacc attgatgccg gatgctcact ccacatgctc agatttccaa 661 tggattctca ctcttgccct ctatctttct ctagcttttc ctatcctgag aatgagatga 721 tctacaagtg ggaaaatttc aagcttgaaa tcaatgagaa gaactcctgg aagctcttcc 781 agtttgattt tacaggagtg agcaacaaaa ctgaaataat cacaacccca ggtgacttca 841 tggtcatgac gattttcttc aatgtgagca ggcggtttgg ctatgttgcc tttcaaaact 901 atgtcccttc ttccgtgacc acgatgctct cctgggtttc cttttggatc aagacagagt 961 ctgctccagc ccggacctct ctagggatca cctctgttct gaccatgacc acgttgggca 1021 ccttttctcg taagaatttc ccgcgtgtct cctatatcac agccttggat ttctatatcg 1081 ccatctgctt cgtcttctgc ttctgcgctc tgttggagtt tgctgtgctc aacttcctga 1141 tctacaacca gacaaaagcc catgcttctc ctaaactccg ccatcctcgt atcaatagcc 1201 gtgcccatgc ccgtacccgt gcacgttccc gagcctgtgc ccgccaacat caggaagctt 1261 ttgtgtgcca gattgtcacc actgagggaa gtgatggaga ggagcgcccg tcttgctcag 1321 cccagcagcc ccctagccca ggtagccctg agggtccccg cagcctctgc tccaagctgg 1381 cctgctgtga gtggtgcaag cgttttaaga agtacttctg catggtcccc gattgtgagg 1441 gcagtacctg gcagcagggc cgcctctgca tccatgtcta ccgcctggat aactactcga 1501 gagttgtttt cccagtgact ttcttcttct tcaatgtgct ctactggctt gtttgcctta 1561 acttgtaggt accagctggt accctgtggg gcaacctctc cagttccccc aggaggtcca 1621 agccccttgc caagggagtt gggggaaagc agcagcagca gcaggagcga ctagagtttt 1681 tcctgcccca ttccccaaac agaagcttgc agagggtttg tctttgctgc ccctctcccc 1741 tacctggccc attcactgag tcttctcagc agaccatttc aaattattaa taaatgggcc 1801 acctccctct tcttcaagga gcatccgtga tgctcagtgt tcaaaaccac agccacttag 1861 tgatcagctc cctaaaacca tgcctaagta caggcggatt agctatcttc caacaatgct 1921 gaccaccaga caattactgc atttttccag aagcccacta ttgcctttgt agtgctttcg 1981 gcccagttct ggcctcagcc tcaaagtgca ccgactagtt gcttgcctat acctggcacc 2041 tcattaagat gctgggcagc agtataacag gaggaagaga tccctctcct ttggtcagat 2101 tattatgttc tcagttctct ctccctgcta cccctttctc tgcagataga tagacactgg 2161 cattatccct ttaggaaggg gggggggcag caagagagcc tatttgggac agcattcctc 2221 tctctctcct gctgtgacat ctccctctcc ttgctggcct ccatctttcg tctgcactac 2281 caattcaatg cccttcatcc aatgggtatc tatttttgtg tgtgattata gtaactactc 2341 cctgctttat atgccccctc ttccttctct ttaccccctg tgactctttc tgtactttcc 2401 cagtgacttg ccctagccct gacccaggca ctaggccttg gtgacttcct ggggccaaga 2461 aactaaggaa actcggcttt gcaacaggca ttactcgcca ttgattgtgc ccacccaggg 2521 cacactgatg gagttctatc acttgcttga cccctggacc cataaaccag tccactgtta 2581 tacccggggc actctaacca tcacaatcaa tcaatcaaat tcccttaaat ttgtatggca 2641 ctggaacttt ggcaaagcac ttttgacaag ttgtgtctga ttggagcttc atgatagcct 2701 tgtgacatct taggcaggat tcttatcccc attttgcaga tgaaaaaccc tgagtctcag 2761 atttctgtgg gactgtggat ctcactggaa gcctatccaa gagcccactg tcaccttcta 2821 gaccacatga tagggctaga cagctcagtt caccatgatc ttttgtcact ctgctggcac 2881 accagtggca aggccagaat gcgacctctc tttagctcaa tttctgggcc tgaggtgctc 2941 agactgcccc caagatcaaa tctctcctgg ctgtagtaac ccagtggaat gaatttggac 3001 atgccccaat gcttctatat gctaagtgaa atctgtgtct gtaatttgtt ggggggtgga 3061 tagggtgggg tctccatcta ctttttgtca ccatcatctg aaatggggaa atatgtaaat 3121 aaatatatca gcaaagcaaa aaaaaaaaaa aaa // LOCUS HSGACNMTS 993 bp RNA PRI 29-FEB-1996 DEFINITION H.sapiens mRNA for guanidinoacetate N-methyltransferase. ACCESSION Z49878 NID g1212945 KEYWORDS guanidinoacetate N-methyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 993) AUTHORS Isbrandt,D. and von Figura,K. TITLE Cloning and sequence analysis of human guanidinoacetate N-methyltransferase cDNA JOURNAL Biochim. Biophys. Acta 1264 (3), 265-267 (1995) MEDLINE 96138544 REFERENCE 2 (bases 1 to 993) AUTHORS Isbrandt,D. TITLE Direct Submission JOURNAL Submitted (15-JUN-1995) Dirk Isbrandt, Institute for Biochemistry II, University of Goettingen, Gosslerstrasse 12d, Goettingen, 37073, Germany FEATURES Location/Qualifiers source 1..993 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GAMT5-12" /tissue_type="liver" /clone_lib="cDNA library" 5'UTR 1..43 CDS 44..754 /codon_start=1 /product="guanidinoacetate N-methyltransferase" /db_xref="PID:e184531" /db_xref="PID:g1212946" /translation="MSAPSATPIFAPGENCSPAWGAAPAAYDAADTHLRILGKPVMER WETPYMHALAAAASSKGGRVLEVGFGMAIAASKVQEAPIDEHWIIECNDGVFQRLRDW APRQTHKVIPLKGLWEDVAPTLPDGHFDGILYDTYPLSEETWHTHQFNFIKNHAFRLL KPGGVLTYCNLTSWGELMKSKYSDITIMFEETQVPALLEAGFRRENIRTEVMALVPPA DCRYYAFPQMITPLVTKG" 3'UTR 755..993 BASE COUNT 171 a 347 c 310 g 165 t ORIGIN 1 cggcggcgcg cgatcgaggt cgggtcgccg tccagcctgc agcatgagcg cccccagcgc 61 gacccccatc ttcgcgcccg gcgagaactg cagccccgcg tggggggcgg cgcccgcggc 121 ctacgacgca gcggacacgc acctgcgcat cctgggcaag ccggtgatgg agcgctggga 181 gaccccctat atgcacgcgc tggccgccgc cgcctcctcc aaagggggcc gggtcctgga 241 ggtgggcttt ggcatggcca tcgcagcgtc aaaggtgcag gaggcgccca ttgatgagca 301 ttggatcatc gagtgcaatg acggcgtctt ccagcggctc cgggactggg ccccacggca 361 gacacacaag gtcatcccct tgaaaggcct gtgggaggat gtggcaccca ccctgcctga 421 cggtcacttt gatgggatcc tgtacgacac gtacccactc tcggaggaga cctggcacac 481 acaccagttc aacttcatca agaaccacgc ctttcgcctg ctgaagccgg ggggcgtcct 541 cacctactgc aacctcacct cctgggggga gctgatgaag tccaagtact cagacatcac 601 catcatgttt gaggagacgc aggtgcccgc gctgctggag gccggcttcc ggagggagaa 661 catccgtacg gaggtgatgg cgctggtccc accggccgac tgccgctact acgccttccc 721 acagatgatc acgcccctgg tgaccaaagg ctgagccccc accccggccc ggccacaccc 781 atgccctccg ccgtgccttc ctggccggga gtccagggtg tcgcaccagc cctgggctga 841 tcccagctgt gtgtcaccag aagctttccc ggcttctctg tgaggggtcc caccagccca 901 gggctgatcc cagctgtgtg tcaccagcag ctttcccagc ttgctctgtg agggtcactg 961 ctgcccactg cagggtgccc tgaggtgaag ccg // LOCUS HSGAGMR 3291 bp RNA PRI 03-MAY-1993 DEFINITION Human mRNA for GARS-AIRS-GART. ACCESSION X54199 X56340 NID g31641 KEYWORDS GARS-AIRS-GART. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3291) AUTHORS Aimi,J. TITLE Direct Submission JOURNAL Submitted (31-JUL-1990) Aimi J., Purdue University, West Lafayette, Indiana, USA REFERENCE 2 (bases 1 to 3291) AUTHORS Aimi,J., Qiu,H., Williams,J., Zalkin,H. and Dixon,J.E. TITLE De novo purine nucleotide biosynthesis: cloning of human and avian cDNAs encoding the trifunctional glycinamide ribonucleotide synthetase-aminoimidazole ribonucleotide synthetase-glycinamide ribonucleotide transformylase by functional complementation in E. coli JOURNAL Nucleic Acids Res. 18 (22), 6665-6672 (1990) MEDLINE 91067455 COMMENT Data kindly reviewed (24-APR-1991) by Aimi J. FEATURES Location/Qualifiers source 1..3291 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEP G2." mat_peptide 79..3108 /standard_name="GARS-AIRS-GART" /product="glycinamide ribonucleotide synthetase-aminoimidazole ribonucleotide synthetase-glycinamide ribonucleotide transformylase" mRNA <79..>3111 CDS 79..3111 /standard_name="GARS-AIRS-GART" /codon_start=1 /product="glycinamide ribonucleotide synthetase-aminoimidazole ribonucleotide synthetase-glycinamide ribonucleotide transformylase" /db_xref="PID:g31642" /db_xref="SWISS-PROT:P22102" /translation="MAARVLIIGSGGREHTLAWKLAQSHHVKQVLVAPGNAGTACSEK ISNTAISISDHTALAQFCKEKKIEFVVVGPEAPLAAGIVGNLRSAGVQCFGPTAEAAQ LESSKRFAKEFMDRHGIPTAQWKAFTKPEEACSFILSADFPALVVKASGLAAGKGVIV AKSKEEACKAVQEIMQEKAFGAAGETIVIEELLDGEEVSCLCFTDGKTVAPMPPAQDH KRLLEGDGGPNTGGMGAYCPAPQVSNDLLLKIKDTVLQRTVDGMQQEGTPYTGILYAG IMLTKNGPKVLEFNCRFGDPECQVILPLLKSDLYEVIQSTLDGLLCTSLPVWLENHTA LTVVMASKGYPGDYTKGVEITGFPEAQALGLEVFHAGTALKNGKVVTHGGRVLAVTAI RENLISALEEAKKGLAAIKFEGAIYRKDVGFRAIAFLQQPRSLTYKESGVDIAAGNML VKKIQPLAKATSRSGCKVDLGGFAGLFDLKAAGFKDPLLASGTDGVGTKLKIAQLCNK HDTIGQDLVAMCVNDILAQGAEPLFFLDYFSCGKLDLSVTEAVVAGIAKACGKAGCAL LGGETAEMPDMYPPGEYDLAGFAVGAMERDQKLPHLERITEGDVVVGIASSGLHSNGF SLVRKIVAKSSLQYSSPAPDGCGDQTLGDLLLTPTRIYSHSLLPVLRSGHVKAFAHIT GGGLLENIPRVLPEKLGVDLDAQTWRIPRVFSWLQQEGHLSEEEMARTFNCGVGAVLV VSKEQTEQILRDIQQHKEEAWVIGSVVARAEGSPRVKVKNLIESMQINGSVLKNGSLT NHFSFEKKKARVAVLISGTGSNLQALIDSTREPNSSAQIDIVISNKAAVAGLDKAERA GIPTRVINHKLYKNRVEFDSAIDLVLEEFSIDIVCLAGFMRILSGPFVQKWNGKMLNI HPSLLPSFKGSNAHEQALETGVTVTGCTVHFVAEDVDAGQIILQEAVPVKRGDTVATL SERVKLAEHKIFPAALQLVASGTVQLGENGKICWVKEE" BASE COUNT 943 a 689 c 814 g 845 t ORIGIN 1 accgggcaag cgggaaccag gtggccaccc ggtgtcggtt tcattttcct ttggaatttc 61 tgctttacag acagaacaat ggcagcccga gtacttataa ttggcagtgg aggaagggaa 121 catacgctgg cctggaaact tgcacagtct catcatgtca aacaagtgtt ggttgcccca 181 ggaaacgcag gcactgcctg ctctgaaaag atttcaaata ccgccatctc aatcagtgac 241 cacactgccc ttgctcaatt ctgcaaagag aagaaaattg aatttgtagt tgttggacca 301 gaagcacctc tggctgctgg gattgttggg aacctgaggt ctgcaggagt gcaatgcttt 361 ggcccaacag cagaagcggc tcagttagag tccagcaaaa ggtttgccaa agagtttatg 421 gacagacatg gaatcccaac cgcacaatgg aaggctttca ccaaacctga agaagcctgc 481 agcttcattt tgagtgcaga cttccctgct ttggttgtga aggccagtgg tcttgcagct 541 ggaaaagggg tgattgttgc aaagagcaaa gaagaggcct gcaaagctgt acaagagatc 601 atgcaggaga aagcctttgg ggcagctgga gaaacaattg tcattgaaga acttcttgac 661 ggagaagagg tgtcgtgtct gtgtttcact gatggcaaga ctgtggcccc catgccccca 721 gcacaggacc ataagcgatt actggaggga gatggtggcc ctaacacagg gggaatggga 781 gcctattgtc cagcccctca ggtttctaat gatctattac taaaaattaa agatactgtt 841 cttcagagga cagtggatgg catgcagcaa gagggtactc catatacagg tattctctat 901 gctggaataa tgctgaccaa gaatggccca aaagttctag agtttaattg ccgttttggt 961 gatccagagt gccaagtaat cctcccactt cttaaaagtg atctttatga agtgattcag 1021 tccaccttag atggactgct ctgcacatct ctgcctgttt ggctagaaaa ccacaccgcc 1081 ctaactgttg tcatggcaag taaaggttat cctggagact acaccaaggg tgtagagata 1141 acagggtttc ctgaggctca agctctagga ctggaggtgt tccatgcagg cactgccctc 1201 aaaaatggca aagtagtaac tcatgggggt agagttcttg cagtcacagc catccgggaa 1261 aatctcatat cagcccttga ggaagccaag aaaggactag ctgctataaa gtttgaggga 1321 gcaatttata ggaaagacgt cggctttcgt gccatagctt tcctccagca gcccaggagt 1381 ttgacttaca aggaatctgg agtagatatc gcagctggaa atatgctggt caagaaaatt 1441 cagcctttag caaaagccac ttccagatca ggctgtaaag ttgatcttgg aggttttgct 1501 ggtctttttg atttaaaagc agctggtttc aaagatcccc ttctggcctc tggaacagat 1561 ggcgttggaa ctaaactaaa gattgcccag ctatgcaata aacatgatac cattggtcaa 1621 gatttggtag caatgtgtgt taatgatatt ctggcacaag gagcagagcc cctcttcttc 1681 cttgattact tttcctgtgg aaaacttgac ctcagtgtaa ctgaagctgt tgttgctgga 1741 attgctaaag cttgtggaaa agctggatgt gctctccttg gaggtgaaac agcagaaatg 1801 cctgacatgt atccccctgg agagtatgac ctagctgggt ttgccgttgg tgccatggag 1861 cgagatcaga aactccctca cctggaaaga atcactgagg gtgatgttgt tgttggaata 1921 gcttcatctg gtcttcatag caatggattt agccttgtga ggaaaatcgt tgcaaaatct 1981 tccctccagt actcctctcc agcacctgat ggttgtggtg accagacttt aggggactta 2041 cttctcacgc ctaccagaat ctacagccat tcactgttac ctgtcctacg ttcaggacat 2101 gtcaaagcct ttgcccatat tactggtgga ggattactag agaacatccc cagagtcctc 2161 cctgagaaac ttggggtaga tttagatgcc cagacctgga ggatccccag ggttttctca 2221 tggttgcagc aggaaggaca cctctctgag gaagagatgg ccagaacatt taactgtggg 2281 gttggcgctg tccttgtggt atcaaaggag cagacagagc agattctgag ggatatccag 2341 cagcacaagg aagaagcctg ggtgattggc agtgtggttg cacgagctga aggttcccca 2401 cgtgtgaaag tcaagaatct gattgaaagc atgcaaataa atgggtcagt gttgaagaat 2461 ggctccctga caaatcattt ctcttttgaa aaaaaaaagg ccagagtggc tgtcttaata 2521 tctggaacag gatcgaacct gcaagcactt atagacagta ctcgggaacc aaatagctct 2581 gcacaaattg atattgttat ctccaacaaa gccgcagtag ctgggttaga taaagcggaa 2641 agagctggta ttcccactag agtaattaat cataaactgt ataaaaatcg tgtagaattt 2701 gacagtgcaa ttgacctagt ccttgaagag ttctccatag acatagtctg tcttgcagga 2761 ttcatgagaa ttctttctgg cccctttgtc caaaagtgga atggaaaaat gctcaatatc 2821 cacccatcct tgctcccttc ttttaagggt tcaaatgccc atgagcaagc cctggaaacc 2881 ggagtcacag ttactgggtg cactgtacac tttgtagctg aagatgtgga tgctggacag 2941 attattttgc aagaagctgt tcccgtgaag aggggtgata ctgtcgcaac tctttctgaa 3001 agagtaaaat tagcagaaca taaaatattt cctgcagccc ttcagctggt ggccagtgga 3061 actgtacagc ttggagaaaa tggcaagatc tgttgggtta aagaggaatg aagcctttta 3121 attcagaaat ggggccagtt tagaaagaat tatttgctgt ttgcatggtg gttttttatc 3181 atggacttgg cccaaaagaa aaactgctaa aagacaaaaa agacctcacc cttacttcat 3241 ctattttttt aataaataga gactcactaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSGAL8GEN 1107 bp RNA PRI 10-OCT-1997 DEFINITION H.sapiens mRNA for galectin-8. ACCESSION X91790 NID g2511667 KEYWORDS gal-8 gene; galectin-8. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1107) AUTHORS Hadari,Y.R., Eisenstein,M., Zakut,R. and Zick,Y. TITLE Galectin-8: on the road from structure to function JOURNAL Trends Glycosci. Glycotechnol. 9, 103-112 (1997) REFERENCE 2 (bases 1 to 1107) AUTHORS Zick,Y. TITLE Direct Submission JOURNAL Submitted (22-SEP-1995) Y. Zick, Weizmann Institute of Science, Dep. of Chemical Immunology, Weizmann Institute of Science, Rehovot 76100, ISRAEL COMMENT Related sequences U09824 and F08504. FEATURES Location/Qualifiers source 1..1107 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain hippocampus" gene 40..996 /gene="gal-8" CDS 40..996 /gene="gal-8" /note="monomer" /codon_start=1 /product="Galectin-8" /db_xref="PID:e199407" /db_xref="PID:g2511668" /translation="MMLSLNNLQNIIYNPVIPFVGTIPDQLDPGTLIVIRGHVPSDAD RFQVDLQNGSSMKPRADVAFHFNPRFKRAGCIVCNTLINEKWGREEITYDTPFQKEKK SFEIVIMVLKAKFQVAVNGKHTLLYGHRIGPEKIDTLGIYGKVNIHSIGFSFSSDLQS TQASSLELTEISRENVPKSGTPQLRLPFAARLNTPMGPGRTVVVKGEVNANAKSFNVD LLAGKSKDIALHLNPRLNIKAFVRNSFLQESWGEEERNITSFPFSPGMYFEMIIYCDV REFKVAVNGVHSLEYKHRFKELSSIDTLEINGDIHLLEVRSW" BASE COUNT 344 a 229 c 249 g 285 t ORIGIN 1 acacagaaga gactccaatc gacaagaagc tggaaaagaa tgatgttgtc cttaaacaac 61 ctacagaata tcatctataa cccggtaatc ccgtttgttg gcaccattcc tgatcagctg 121 gatcctggaa ctttgattgt gatacgtggg catgttccta gtgacgcaga cagattccag 181 gtggatctgc agaatggcag cagcatgaaa cctcgagccg atgtggcctt tcatttcaat 241 cctcgtttca aaagggccgg ctgcattgtt tgcaatactt tgataaatga aaaatgggga 301 cgggaagaga tcacctatga cacgcctttc caaaaagaga aaaagtcttt tgagatcgtg 361 attatggtgc tgaaggccaa attccaggtg gctgtaaatg gaaaacatac tctgctctat 421 ggccacagga tcggcccaga gaaaatagac actctgggca tttatggcaa agtgaatatt 481 cactcaattg gttttagctt cagctcggac ttacaaagta cccaagcatc tagtctggaa 541 ctgacagaga taagtagaga aaatgttcca aagtctggca cgccccagct taggctgcca 601 ttcgctgcaa ggttgaacac ccccatgggc cctggacgaa ctgtcgtcgt taaaggagaa 661 gtgaatgcaa atgccaaaag ctttaatgtt gacctactag caggaaaatc aaaggatatt 721 gctctacact tgaacccacg cctgaatatt aaagcatttg taagaaattc ttttcttcag 781 gagtcctggg gagaagaaga gagaaatatt acctctttcc catttagtcc tgggatgtac 841 tttgagatga taatttattg tgatgttaga gaattcaagg ttgcagtaaa tggcgtacac 901 agcctggagt acaaacacag atttaaagag ctcagcagta ttgacacgct ggaaattaat 961 ggagacatcc acttactgga agtaaggagc tggtagccta cctacacagc tgctacaaaa 1021 accaaaatac agaatggctt ctgtgatact ggccttgctg aaacgcatct cactgtcatt 1081 ctattgttta tattgttaaa atgacct // LOCUS HSGALNAT1 1680 bp RNA PRI 10-JAN-1996 DEFINITION H.sapiens mRNA for UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase (T1). ACCESSION X85018 NID g971458 KEYWORDS UDP-GalNAc-T1 gene; UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1680) AUTHORS White,T., Bennett,E.P., Takio,K., Sorensen,T., Bonding,N. and Clausen,H. TITLE Purification and cDNA cloning of a human UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase JOURNAL J. Biol. Chem. 270 (41), 24156-24165 (1995) MEDLINE 96025800 REFERENCE 2 (bases 1 to 1680) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) E.P. Bennett, Dental School, University of Copenhagen, Norre Alle 20, 2200 Copenhagen, DENMARK COMMENT Related sequences: L07780; L17437. FEATURES Location/Qualifiers source 1..1680 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="gastric" /cell_line="MKN45" gene 1..1680 /gene="GalNAc-T1" CDS 1..1680 /gene="GalNAc-T1" /codon_start=1 /product="UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase" /db_xref="PID:g971459" /translation="MRKFAYCKVVLATSLIWVLLDMFLLLYFSECNKCDEKKERGLPA GDVLEPVQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVR LEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASER DFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGW LEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMD RRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTL EIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVD YGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKE NEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEY DPVKLTLQHVNSNQCLDKATEEDSQVPSIRDCNGSRSQQWLLRNVTLPEIF" BASE COUNT 531 a 305 c 397 g 447 t ORIGIN 1 atgagaaaat ttgcatactg caaggtggtc ctagccacct ccttgatttg ggtactcttg 61 gatatgttcc tgctgcttta cttcagtgaa tgcaacaaat gtgatgaaaa aaaggagaga 121 ggacttcctg ctggagatgt tctagagcca gtacaaaagc ctcatgaagg tcctggagaa 181 atggggaaac cagtcgtcat tcctaaagag gatcaagaaa agatgaaaga gatgtttaaa 241 atcaatcagt tcaatttaat ggcaagtgag atgattgcac tcaacagatc tttaccagat 301 gttaggttag aagggtgtaa aacaaaggtg tatccagata atcttcctac aacaagtgtg 361 gtgattgttt tccacaatga ggcttggagc acacttctgc gaactgtcca tagtgtcatt 421 aatcgctcac caagacacat gatagaagaa attgttctag tagatgatgc cagtgaaaga 481 gactttttga aaaggccttt agagagttat gtgaaaaaac taaaagtacc agttcatgta 541 attcgaatgg aacaacgttc tggattgatc agagctagat taaaaggagc tgctgtgtct 601 aaaggccaag tgatcacctt cctggatgcc cattgtgagt gtacagtggg atggctggag 661 cctctcttgg ccaggatcaa acatgacagg agaacagtgg tgtgtcccat catcgatgtg 721 atcagtgatg atacttttga gtacatggca ggctctgata tgacctatgg tgggttcaac 781 tggaagctca attttcgctg gtatcctgtt ccccaaagag aaatggacag aaggaaaggt 841 gatcggactc ttcctgtcag gacacctacc atggcaggag gccttttttc aatagacaga 901 gattactttc aggaaattgg aacatatgat gctggaatgg atatttgggg aggagaaaac 961 ctagaaattt cctttaggat ttggcagtgt ggaggaactt tggaaattgt tacatgctca 1021 catgttggac atgtgtttcg gaaagctaca ccttacacgt ttccaggagg cacagggcag 1081 attatcaata aaaataacag acgacttgca gaagtgtgga tggatgaatt caagaatttc 1141 ttctatataa tttctccagg tgttacaaag gtagattatg gagatatatc gtcaagagtt 1201 ggtctaagac acaaactaca atgcaaacct ttttcctggt acctagagaa tatatatcct 1261 gattctcaaa ttccacgtca ctatttctca ttgggagaga tacgaaatgt ggaaacgaat 1321 cagtgtctag ataacatggc tagaaaagag aatgaaaaag ttggaatttt taattgccat 1381 ggtatggggg gtaatcaggt tttctcttat actgccaaca aagaaattag aacagatgac 1441 ctttgcttgg atgtttccaa acttaatggc ccagttacaa tgctcaaatg ccaccaccta 1501 aaaggcaacc aactctggga gtatgaccca gtgaaattaa ccctgcagca tgtgaacagt 1561 aatcagtgcc tggataaagc cacagaagag gatagccagg tgcccagcat tagagactgc 1621 aatggaagtc ggtcccagca gtggcttctt cgaaacgtca cccttccaga aatattctga // LOCUS HSGALNAT2 1716 bp RNA PRI 10-JAN-1996 DEFINITION H.sapiens mRNA for UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase (T2). ACCESSION X85019 NID g971460 KEYWORDS GalNAc-T2 gene; UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1716) AUTHORS White,T., Bennett,E.P., Takio,K., Sorensen,T., Bonding,N. and Clausen,H. TITLE Purification and cDNA cloning of a human UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase JOURNAL J. Biol. Chem. 270 (41), 24156-24165 (1995) MEDLINE 96025800 REFERENCE 2 (bases 1 to 1716) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) E.P. Bennett, Dental School, University of Copenhagen, Norre Alle 20, 2200 Copenhagen, DENMARK FEATURES Location/Qualifiers source 1..1716 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="gastric" /cell_line="MKN45" /clone_lib="lambda gt10" /clone="2782" gene 1..1716 /gene="GalNAc-T2" CDS 1..1716 /gene="GalNAc-T2" /codon_start=1 /product="UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase" /db_xref="PID:g971461" /translation="MRRRSRMLLCFAFLWVLGIAYYMYSGGGSALAGGAGGGAGRKED WNEIDPIKKKDLHHSNGEEKAQSMETLPPGKVRWPDFNQEAYVGGTMVRSGQDPYARN KFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVL KKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAK VLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDW NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGE NLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEY KNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI KLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKFT LNLQQ" BASE COUNT 413 a 406 c 542 g 355 t ORIGIN 1 atgcggcggc gctcgcggat gctgctctgc ttcgccttcc tgtgggtgct gggcatcgcc 61 tactacatgt actcgggggg cggctctgcg ctggccgggg gcgcgggcgg cggcgccggc 121 aggaaggagg actggaatga aattgacccc attaaaaaga aagaccttca tcacagcaat 181 ggagaagaga aagcacaaag catggagacc ctccctccag ggaaagtacg gtggccagac 241 tttaaccagg aagcttatgt tggagggacg atggtccgct ccgggcagga cccttacgcc 301 cgcaacaagt tcaaccaggt ggagagtgat aagcttcgaa tggacagagc catccctgac 361 acccggcatg accagtgtca gcggaagcag tggcgggtgg atctgccggc caccagcgtg 421 gtgatcacgt ttcacaatga agccaggtcg gccctactca ggaccgtggt cagcgtgctt 481 aagaaaagcc cgccccatct cataaaagaa atcatcttgg tggatgacta cagcaatgat 541 cctgaggacg gggctctctt ggggaaaatt gagaaagtgc gagttcttag aaatgatcga 601 cgagaaggcc tcatgcgctc acgggttcgg ggggccgatg ctgcccaagc caaggtcctg 661 accttcctgg acagtcactg cgagtgtaat gagcactggc tggagcccct cctggaaagg 721 gtggcggagg acaggactcg ggttgtgtca cccatcatcg atgtcattaa tatggacaac 781 tttcagtatg tgggggcatc tgctgacttg aagggcggtt ttgattggaa cttggtattc 841 aagtgggatt acatgacgcc tgagcagaga aggtcccggc aggggaaccc agtcgcccct 901 ataaaaaccc ccatgattgc tggtgggctg tttgtgatgg ataagttcta ttttgaagaa 961 ctggggaagt acgacatgat gatggatgtg tggggaggag agaacctaga gatctcgttc 1021 cgcgtgtggc agtgtggtgg cagcctggag atcatcccgt gcagccgtgt gggacacgtg 1081 ttccggaagc agcaccccta cacgttcccg ggtggcagtg gcactgtctt tgcccgaaac 1141 acccgccggg cagcagaggt ctggatggat gaatacaaaa atttctatta tgcagcagtg 1201 ccttctgcta gaaacgttcc ttatggaaat attcagagca gattggagct taggaagaaa 1261 ctcagctgca agcctttcaa atggtacctt gaaaatgtct atccagagtt aagggttcca 1321 gaccatcagg atatagcttt tggggccttg cagcagggaa ctaactgcct cgacactttg 1381 ggacactttg ctgatggtgt ggttggagtt tatgaatgtc acaatgctgg gggaaaccag 1441 gaatgggcct tgacgaagga gaagtcggtg aagcacatgg atttgtgcct tactgtggtg 1501 gaccgggcac cgggctctct tataaagctg cagggctgcc gagaaaatga cagcagacag 1561 aaatgggaac agatcgaggg caactccaag ctgaggcacg tgggcagcaa cctgtgcctg 1621 gacagtcgca cggccaagag cgggggccta agcgtggagg tgtgtggccc ggccctttcg 1681 cagcagtgga agttcacgct caacctgcag cagtag // LOCUS HSGAPJR 1558 bp RNA PRI 12-SEP-1993 DEFINITION Human liver mRNA for gap junction protein. ACCESSION X04325 NID g31646 KEYWORDS gap junction protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1558) AUTHORS Kumar,N.M. and Gilula,N.B. TITLE Cloning and characterization of human and rat liver cDNAs coding for a gap junction protein JOURNAL J. Cell Biol. 103 (3), 767-776 (1986) MEDLINE 86304555 FEATURES Location/Qualifiers source 1..1558 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 63..914 /note="gap junction protein (aa 1-283)" /codon_start=1 /db_xref="PID:g31647" /db_xref="SWISS-PROT:P08034" /translation="MNWTGLYTLLSGVNRHSTAIGRVWLSVIFIFRIMVLVVAAESVW GDEKSSFICNTLQPGCNSVCYDQFFPISHVRLWSLQLILVSTPALLVAMHVAHQQHIE KKMLRLEGHGDPLHLEEVKRHKVHISGTLWWTYVISVVFRLLFEAVFMYVFYLLYPGY AMVRLVKCDVYPCPNTVDCFVSRPTEKTVFTVFMLAASGICIILNVAEVVYLIIRACA RRAQRRSNPPSRKGSGFGHRLSPEYKQNEINKLLSEQDGSLKDILRRSPGTGAGLAEK SDRCSAC" misc_feature 1539..1544 /note="put. polyA signal" polyA_site 1558 /note="polyA site" BASE COUNT 310 a 445 c 453 g 350 t ORIGIN 1 cctctgggaa agggcagcag gagccaggtg tggcagtgac agggaggtgt gaatgaggca 61 ggatgaactg gacaggtttg tacaccttgc tcagtggcgt gaaccggcat tctactgcca 121 ttggccgagt atggctctcg gtcatcttca tcttcagaat catggtgctg gtggtggctg 181 cagagagtgt gtggggtgat gagaaatctt ccttcatctg caacacactc cagcctggct 241 gcaacagcgt ttgctatgac caattcttcc ccatctccca tgtgcggctg tggtccctgc 301 agctcatcct agtttccacc ccagctctcc tcgtggccat gcacgtggct caccagcaac 361 acatagagaa gaaaatgcta cggcttgagg gccatgggga ccccctacac ctggaggagg 421 tgaagaggca caaggtccac atctcaggga cactgtggtg gacctatgtc atcagcgtgg 481 tgttccggct gttgtttgag gccgtcttca tgtatgtctt ttatctgctc taccctggct 541 atgccatggt gcggctggtc aagtgcgacg tctacccctg ccccaacaca gtggactgct 601 tcgtgtcccg ccccaccgag aaaaccgtct tcaccgtctt catgctagct gcctctggca 661 tctgcatcat cctcaatgtg gccgaggtgg tgtacctcat catccgggcc tgtgcccgcc 721 gagcccagcg ccgctccaat ccaccttccc gcaagggctc gggcttcggc caccgcctct 781 cacctgaata caagcagaat gagatcaaca agctgctgag tgagcaggat ggctccctga 841 aagacatact gcgccgcagc cctggcaccg gggctgggct ggctgaaaag agcgaccgct 901 gctcggcctg ctgatgccac ataccaggca acctgccatc catccccgac cctgccctgg 961 gcgaagccct cctccttctc ccctgccggt gcacaggcct ctgcctgctg gggattactc 1021 gatcaaaacc ttccttccct ggctacttcc cttcctcccg gggccttcct tttaggtgct 1081 ggagctggag gggtggggag ctagaggcca cctatgccag tgctcaaggt tactgggagt 1141 gtgggctgcc cttgttgcct gcacccttcc ctcttccctc tccctctctc tgggaccact 1201 gggtacaaga gatgggatgc tccgacagcg tctccaatta tgaaactaat cttaaccctg 1261 tgctgtcaga taccctggtt ttctggagtc acagtcagtg aggaggatgt ggtaagagga 1321 ggcagagggc aggggtgctg tggacatgtg ggtggagaag ggagggtggc cagcactagt 1381 aaaggaggaa tagtgcttgc tggccacaag gaaaaggagg aggtgtctgg ggtgagggag 1441 ttagggagag agaagcaggc agataagttg gagcaggggt ggtcaaggcc acctctgcct 1501 ctagtcccca aggcctctct ctgcctgaaa tgttacacat taaacaggat tttacagt // LOCUS HSGARPGNA 4163 bp RNA PRI 26-JUL-1994 DEFINITION H.sapiens garp gene mRNA, complete CDS. ACCESSION Z24680 NID g439295 KEYWORDS garp gene; leucine-rich repeat containing protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4163) AUTHORS Ollendorff,V., Noguchi,T., deLapeyriere,O. and Birnbaum,D. TITLE The GARP gene encodes a new member of the family of leucine-rich repeat-containing proteins JOURNAL Cell Growth Differ. 5 (2), 213-219 (1994) MEDLINE 94235567 REFERENCE 2 (bases 1 to 4163) AUTHORS Birnbaum,D. TITLE Direct Submission JOURNAL Submitted (20-JUL-1993) BIRNBAUM D., U.119 INSERM, 27 Bd. Lei Roure, Marseille, France FEATURES Location/Qualifiers source 1..4163 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>4163 /gene="garp" 5'UTR 1..94 /gene="garp" gene 1..4163 /gene="garp" sig_peptide 95..151 /gene="garp" CDS 95..2083 /gene="garp" /codon_start=1 /db_xref="PID:g439296" /translation="MRPQILLLLALLTLGLAAQHQDKVPCKMVDKKVSCQVLGLLQVP SVLPPDTETLDLSGNQLRSILASPLGFYTALRHLDLSTNEISFLQPGAFQALTHLEHL SLAHNRLAMATALSAGGLGPLPRVTSLDLSGNSLYSGLLERLLGEAPSLHTLSLAENS LTRLTRHTFRDMPALEQLDLHSNVLMDIEDGAFEGLPRLTHLNLSRNSLTCISDFSLQ QLRVLDLSCNSIEAFQTASQPQAEFQLTWLDLRENKLLHFPDLAALPRLIYLNLSNNL IRLPTGPPQDSKGIHAPSEGWSALPLSAPSGNASGRPLSQLLNLDLSYNEIELIPDSF LEHLTSLCFLNLSRNCLRTFEARRLGSLPCLMLLDLSHNALETLELGARALGSLRTLL LQGNALRDLPPYTFANLASLQRLNLQGNRVSPCGGPDEPGPSGCVAFSGITSLRSLSL VDNEIELLRAGAFLHTPLTELDLSSNPGLEVATGALGGLEASLEVLALQGNGLMVLQV DLPCFICLKRLNLAENRLSHLPAWTQAVSLEVLDLRNNSFSLLPGSAMGGLETSLRRL YLQGNPLSCCGNGWLAAQLHQGRVDVDATQDLICRFSSQEEVSLSHVRPEDCEKGGLK NINLIIILTFILVSAILLTTLAACCCVRRQKFNQQYKA" mat_peptide 152..2080 /gene="garp" 3'UTR 2081..4153 /gene="garp" polyA_site 4136 /gene="garp" BASE COUNT 836 a 1303 c 1112 g 912 t ORIGIN 1 ttgatttggt atagtgggaa catttgcttt ggagacagat gaactggatt ctgatcgtga 61 ccctgctatt ttctccttgt gtgactttgg agccatgaga ccccagatcc tgctgctcct 121 ggccctgctg accctaggcc tggctgcaca acaccaagac aaagtgccct gtaagatggt 181 ggacaagaag gtctcgtgcc aggttctggg cctgctccag gtcccctcgg tgctcccgcc 241 agacactgag acccttgatc tatctgggaa ccagctgcgg agtatcctgg cctcacccct 301 gggcttctac acggcacttc gtcacctgga cctgagcacc aatgagatca gcttcctcca 361 gccaggagcc ttccaggccc tgacccacct ggagcacctc agcctggctc acaaccggct 421 ggcgatggcc actgcgctga gtgctggtgg cctgggcccc ctgccacgcg tgacctccct 481 ggacctgtct gggaacagcc tgtacagcgg cctgctggag cggctgctgg gggaggcacc 541 cagcctgcat accctctcac tggcggagaa cagtctgact cgcctcaccc gccacacctt 601 ccgggacatg cctgcgctgg agcagcttga cctgcatagc aacgtgctga tggacatcga 661 ggatggcgcc ttcgagggcc tgccccgcct gacccatctc aacctctcca ggaattccct 721 cacctgcatc tccgacttca gcctccagca gctgcgggtg ctagacctga gctgcaacag 781 catcgaggcc tttcagacgg cctcccagcc ccaggctgag ttccagctca cctggcttga 841 cctgcgggag aacaaactgc tccatttccc cgacctggcc gcgctcccga gactcatcta 901 cctgaacttg tccaacaacc tcatccggct ccccacaggg ccaccccagg acagcaaggg 961 catccacgca ccttccgagg gctggtcagc cctgcccctc tcagccccca gcgggaatgc 1021 cagcggccgc cccctttccc agctcttgaa tctggatttg agctacaatg agattgagct 1081 catccccgac agctttcttg agcacctgac ctccctgtgc ttcctgaacc tcagcagaaa 1141 ctgcttgcgg acctttgagg cccggcgctt aggctccctg ccctgcctga tgctccttga 1201 cttaagccac aatgccctgg agacactgga actgggcgcc agagccctgg ggtctctgcg 1261 gacgctgctc ctacagggca atgccctgcg ggacctgccc ccatacacct ttgccaatct 1321 ggccagcctg cagcggctca acctgcaggg gaaccgagtc agcccctgtg gggggccaga 1381 tgagcctggc ccctccggct gtgtggcctt ctccggcatc acctccctcc gcagcctgag 1441 cctggtggat aatgagatag agctgctcag ggcaggggcc ttcctccaca ccccactgac 1501 tgagctggac ctttcttcca atcctgggct ggaggtggcc acgggggcct tgggaggcct 1561 ggaggcctcc ttggaggtcc tggcactgca gggcaacggg ctgatggtcc tgcaggtgga 1621 cctgccctgc ttcatctgcc tcaagcggct caatcttgcc gagaaccgcc tgagccacct 1681 tcccgcctgg acacaggctg tgtcactgga ggtgctggac ctgcgaaaca acagcttcag 1741 cctcctgcca ggcagtgcca tgggtggcct ggagaccagc ctccggcgcc tctacctgca 1801 ggggaatcca ctcagctgct gcggcaatgg ctggctggca gcccagctgc accagggccg 1861 tgtggacgtg gacgccaccc aggacctgat ctgccgcttc agctcccagg aggaggtgtc 1921 cctgagccac gtgcgtcccg aggactgtga gaagggggga ctgaagaaca tcaacctcat 1981 catcatcctc accttcatac tggtctctgc catcctcctc accacgctgg ccgcctgctg 2041 ctgcgtccgc cggcagaagt ttaaccaaca gtataaagcc taaagaagcc gggagacact 2101 ctaggtcagt gggggagcct gaggtacaga gaagagtgag gactgactca aggtcacaca 2161 gtgatccgga tcccagaact ctggtctcca aattacagcc caggacacct ttctctgccg 2221 cctgctgcat cagtgggtga cccccttccc gggctgcact ttgggtccag ctgtggaagc 2281 cagaagttgg gcggtttcag ggacagccga gaataatgtt gacctgtcag atcaacaaat 2341 cttcactgag catgtatttt gtgccacacc ctgctctggg cactgggaat gctgggaaat 2401 gagatacatt cccgccctca agaatctccc agtctggtag gagagagtgc tgcagagcca 2461 cgtggccgcc acgcagtgtg cttagggcct gaggtgtgaa agcccagggc tccagagctc 2521 ggcaggcccc gctggtttgg tgcggtgagt cctgccccgg ctgtgcaggg tgagggaggg 2581 ccaagccagg aggatttgtc tgagacattt ccaagcagac tgtttgtcac gtcttctgag 2641 aatgactttc agtctctctg aaaatgaaaa gcttaggacc ggaagagaga attggagctg 2701 tacgagtgtg tctcggatct ggtattgtta ggtgggccac ggcggctcca gcagggtctg 2761 gttaaggggt ccagcccagc actggaccat tccgtctcct gctctggact tgccctctcc 2821 cttcctggca ctctcatgtt gcataccctg accccagtgc tgctctaagc accgtccctg 2881 cccagcccca cttctccatc gcagccccac cttggctgct gagccaggag ctaaaacctt 2941 agatatctgg ttctgttttg cacccagctt ggcagatgtg gatttgaatc caagccttgt 3001 gtctgcccct atgtgacagc tctatatttt atccccgttt tataaaagag gaaactgaag 3061 ttctgaaaat ctccttccag ggccccagct aactaatgcc ataggtgaga ttcaaacctt 3121 catccttctg tctccagggc ctgatcttta ccactgcagg ggctgcaggc cgttaagtgg 3181 acaggaagtg gccccacata gcccgagcag ggtctggaag catcctgtgc tgtgcacacc 3241 tgctctctcc tctctcccag gcaggcagct gcaggcgctc tcctccttct ctgcctgttt 3301 ccctcctccc ttcctttcca ccctggtgtg ggttctcctg ttctctctgt gctcttgcat 3361 tctctcattc ccttttcctc tatggagcag agcctggagt ttgagactat ggaatccaac 3421 ctccccattg cacagatggg gaaactgagg cttaggaaga gaatgaaact tgtggagagc 3481 ttatacagaa cctctggggg aaaaaagagc ccttatttgt ggggtgagat tgggggttgg 3541 accagagtga tgtcctctct cagctatcac atcacaagat aatgctggct ccaaacttcc 3601 tttctgtgcc tcatcatgca aggatctttt ttccctctta caaaaacagg taaaaagcct 3661 cacccagatg acccccatcc ctcataccat ggagtcatga gctgtctggg aagaatggac 3721 gtgctgggac caactcaaga ccttgttttg ctgtcttcat catcttacct gtgcttggcc 3781 cacagtctgg ctcatgatgt gggctcagta atgtgcgaga aagtgaaaat gccactctct 3841 ccaccccatt ttacagagga gaacaccaag gcccagagga agttaaggga gagtcaatgg 3901 gcagagccag ggctaggccc tggtggtgtg tggagcaccc aggcagaccc agtcctggtt 3961 gggatcacac ccacgggtgc tactgcacgt aacactcctc cttaggcctg gaggccaagg 4021 tgtgggtccc cacgcctgat ctttgaaaac actacacagg gctgctgtca cttcccaggg 4081 cccaggcctc agcccaggcc tcgggaccaa ctctttgtat aacctacctg aatgtattaa 4141 aaactaattt tggaaaaaaa aaa // LOCUS HSGAT1MR 2298 bp RNA PRI 17-AUG-1992 DEFINITION H.sapiens GAT1 mRNA for GABA transporter. ACCESSION X54673 NID g31657 KEYWORDS GABA transporter protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2298) AUTHORS Nelson,N. TITLE Direct Submission JOURNAL Submitted (29-AUG-1990) Nelson N., Roche Institute of Molecular Biology, Nutley NJ 07110, USA REFERENCE 2 (bases 1 to 2298) AUTHORS Nelson,H., Mandiyan,S. and Nelson,N. TITLE Cloning of the human brain GABA transporter JOURNAL FEBS Lett. 269 (1), 181-184 (1990) MEDLINE 90353567 FEATURES Location/Qualifiers source 1..2298 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /clone_lib="lambda-ZapII" gene 235..2034 /gene="GAT1" CDS 235..2034 /gene="GAT1" /codon_start=1 /product="GABA transporter" /db_xref="PID:g31658" /db_xref="SWISS-PROT:P30531" /translation="MATNGSKVADGQISTEVSEAPVANDKPKTLVVKVQKKAADLPDR DTWKGRFDFLMSCVGYAIGLGNVWRFPYLCGKNGGGAFLIPYFLTLIFAGVPLFLLEC SLGQYTSIGGLGVWKLAPMFKGVGLAAAVLSFWLNIYYIVIISWAIYYLYNSFTTTLP WKQCDNPWNTDRCFSNYSMVNTTNMTSAVVEFWERNMHQMTDGLDKPGQIRWPLAITL AIAWILVYFCIWKGVGWTGKVVYFSATYPYIMLIILFFRGVTLPGAKEGILFYITPNF RKLSDSEVWLDAATQIFFSYGLGLGSLIALGSYNSFHNNVYRDSIIVCCINSCTSMFA GFVIFSIVGFMAHVTKRSIADVAASGPGLAFLAYPEAVTQLPISPLWAILFFSMLLML GIDSQFCTVEGFITALVDEYPRLLRNRRELFIAAVCIISYLIGLSNITQGGIYVFKLF DYYSASGMSLLFLVFFECVSISWFYGVNRFYDNIQEMVGSRPCIWWKLCWSFFTPIIV AGVFIFSAVQMTPLTMGNYVFPKWGQGVGWLMALSSMVLIPGYMAYMFLALKGSLKQR IQVMVQPSEDTVRPENGPEHAQAGSSTSKEAYI" BASE COUNT 436 a 715 c 616 g 531 t ORIGIN 1 gaattccgct ccggccgcag gatctcccca aggtggcaga aggaggcctt ctggagctga 61 cccacccccg acgaccatca gggtgaggca actccaaggt cctactctct ttctgtgcct 121 gttacccacc ccgtcctcct agggtgccct tgagccgcaa aactgctgtc cacgtggacc 181 gggggtgaca tcgcacgtcc atctgccagg acccctgcgt ccaaattccg agacatggcg 241 accaacggca gcaaggtggc cgacgggcag atctccaccg aggtcagcga ggcccctgtg 301 gccaatgaca agcccaaaac cttggtggtc aaggtgcaga agaaggcggc agacctcccc 361 gaccgggaca cgtggaaggg ccgcttcgac ttcctcatgt cctgtgtggg ctatgccatc 421 ggcctgggca acgtctggag gttcccctat ctctgcggga aaaatggtgg gggagccttc 481 ctgatcccct atttcctgac actcatcttt gcgggggtcc cactcttcct gctggagtgc 541 tccctgggcc agtacacctc catcgggggg ctaggggtat ggaagctggc tcctatgttc 601 aagggcgtgg gccttgcggc tgctgtgcta tcattctggc tgaacatcta ctacatcgtc 661 atcatctcct gggccattta ctacctgtac aactccttca ccacgacact gccgtggaaa 721 cagtgcgaca acccctggaa cacagaccgc tgcttctcca actacagcat ggtcaacact 781 accaacatga ccagcgctgt ggtggagttc tgggagcgca acatgcatca gatgacggac 841 gggctggata agccaggtca gatccgctgg ccactggcca tcacgctggc catcgcctgg 901 atccttgtgt atttctgtat ctggaagggt gttggctgga ctggaaaggt ggtctacttt 961 tcagccacat acccctacat catgctgatc atcctgttct tccgtggagt gacgctgccc 1021 ggggccaagg agggcatcct cttctacatc acacccaact tccgcaagct gtctgactcc 1081 gaggtgtggc tggatgcggc aacccagatc ttcttctcat acgggctggg cctggggtcc 1141 ctgatcgctc tcgggagcta caactctttc cacaacaatg tctacaggga ctccatcatc 1201 gtctgctgca tcaattcgtg caccagcatg ttcgcaggat tcgtcatctt ctccatcgtg 1261 ggcttcatgg cccatgtcac caagaggtcc attgctgatg tggccgcctc aggccccggg 1321 ctggcgttcc tggcataccc agaggcggtg acccagctgc ctatctcccc actctgggcc 1381 atcctcttct tctccatgct gttgatgctg ggcattgaca gccagttctg cactgtggag 1441 ggcttcatca cagccctggt ggatgagtac cccaggctcc tccgcaaccg cagagagctc 1501 ttcattgctg ctgtctgcat catctcctac ctgatcggtc tctctaacat cactcagggg 1561 ggtatttatg tcttcaaact ctttgactac tactctgcca gtggcatgag cctgctgttc 1621 ctcgtgttct ttgaatgtgt ctctatttcc tggttttacg gtgtcaaccg attctatgac 1681 aatatccaag agatggttgg atccaggccc tgcatctggt ggaaactctg ctggtctttc 1741 ttcacaccaa tcattgtggc gggcgtgttc attttcagtg ctgtgcagat gacgccactc 1801 accatgggaa actatgtttt ccccaagtgg ggccagggtg tgggctggct gatggctctg 1861 tcttccatgg tcctcatccc cgggtacatg gcctacatgt tcctcgccct aaagggctcc 1921 ctgaagcagc gcatccaagt catggtccag cccagcgaag acactgttcg cccagagaat 1981 ggtcctgagc acgcccaggc gggcagctcc accagcaagg aggcctacat ctagggtggg 2041 ggccactcac cgacccgaca ctctcacccc ccgacctggc tgagtgcgac caccacttga 2101 tgtctgagga taccttccat ctcaacctac ctcgagtggt gatccagaca ccatcaccac 2161 gcagagaggg gaggtgggag gacagttaga cccctgggtg ggccctgccg tgggcaagga 2221 tacccggtgg cttctggcac tggcgggctg gtgacctttt taatccaggc cccatcagca 2281 tcccactcct ggcgggat // LOCUS HSGATA3 1475 bp RNA PRI 03-MAY-1991 DEFINITION H.sapiens GATA-3 mRNA. ACCESSION X55037 NID g31661 KEYWORDS enhancer binding-protein; hGATA-3 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1475) AUTHORS Leiden,J.M. TITLE Direct Submission JOURNAL Submitted (29-JAN-1991) J.M. Leiden, Howard Hughes Medical Institute, University of Michigan, 1150 W Medical Centre Drive, MSRB I Room 4510, Ann Arbor MI 48109, U S A REFERENCE 2 (bases 1 to 1475) AUTHORS Ho,I.C., Vorhees,P., Marin,N., Oakley,B.K., Tsai,S.F., Orkin,S.H. and Leiden,J.M. TITLE Human GATA-3: a lineage-restricted transcription factor that regulates the expression of the T cell receptor alpha gene JOURNAL EMBO J. 10 (5), 1187-1192 (1991) MEDLINE 91216113 FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="Jurkat" /clone_lib="lambda-gt11" /clone="hGATA-3" mRNA 1..1475 /gene="hGATA-3" gene 1..1475 /gene="hGATA-3" gene 125..1456 /gene="GATA-3" CDS 125..1456 /gene="GATA-3" /codon_start=1 /db_xref="PID:g31662" /db_xref="SWISS-PROT:P23771" /translation="MEVTADQPRWVSHHHPAVLNGQHPDTHHPGLSHSYMDAAQYPLP EEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRPPLLHGSLPWLDGGKV LGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHASPHLFTFPPTPPKD VSPDPSLSTPGSGGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTALGGASSSTHHP ITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTGRECVNCGATSTPLWRR DGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNANG DPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMSSKSKKCKKVHDSLEDFPKNSSFN PAALSRHMSSLSHISPFSHSSHMLTTPTPMHPPSSLSFGPHHPSSMVTAMG" misc_feature 911..1148 /gene="GATA-3" /note="zinc finger region" BASE COUNT 303 a 578 c 380 g 214 t ORIGIN 1 aaagcaaatc attcaacgac ccccgaccct ccgacggcag gagccccccg acctcccagg 61 cggacccgct ccctccccgc gcggcgttcc gggcccggcg agaggcgcga gcacagccga 121 ggccatggag gtgacggcgg accagccgcg ctgggtgagc caccaccacc ccgccgtgct 181 caacgggcag cacccggaca cgcaccaccc gggcctcagc cactcctaca tggacgcggc 241 gcagtacccg ctgccggagg aggtggatgt gctttttaac atcgacggtc aaggcaacca 301 cgtcccgccc tactacggaa actcggtcag ggccacggtg cagaggtacc ctccgaccca 361 ccacgggagc caggtgtgcc gcccgcctct gcttcatgga tccctaccct ggctggacgg 421 cggcaaagtc ctgggcagcc accacaccgc ctccccctgg aatctcagcc ccttctccaa 481 gacgtccatc caccacggct ccccggggcc cctctccgtc taccccccgg cctcgtcctc 541 ctccttgtcg gggggccacg ccagcccgca cctcttcacc ttcccgccca ccccgccgaa 601 ggacgtctcc ccggacccat cgctgtccac cccaggctcc ggcggctcgg cccggcagga 661 cgagaaagag tgcctcaagt accaggtgcc cctgcccgac agcatgaagc tggagtcgtc 721 ccactcccgt ggcagcatga ccgccctggg tggagcctcc tcgtcgaccc accaccccat 781 caccacctac ccgccctacg tgcccgagta cagctccgga ctcttccccc ccagcagcct 841 gctgggcggc tcccccaccg gcttcggatg caagtccagg cccaaggccc ggtccagcac 901 aggcagggag tgtgtgaact gtggggcaac ctcgacccca ctgtggcggc gagatggcac 961 gggacactac ctgtgcaacg cctgcgggct ctatcacaaa atgaacggac agaaccggcc 1021 cctcattaag cccaagcgaa ggctgtctgc agccaggaga gcagggacgt cctgtgcgaa 1081 ctgtcagacc accacaacca cactctggag gaggaatgcc aatggggacc ctgtctgcaa 1141 tgcctgtggg ctctactaca agcttcacaa tattaacaga cccctgacta tgaagaagga 1201 aggcatccag accagaaacc gaaaaatgtc tagcaaatcc aaaaagtgca aaaaagtgca 1261 tgactcactg gaggacttcc ccaagaacag ctcgtttaac ccggccgccc tctccagaca 1321 catgtcctcc ctgagccaca tctcgccctt cagccactcc agccacatgc tgaccacgcc 1381 cacgccgatg cacccgccat ccagcctgtc ctttggacca caccacccct ccagcatggt 1441 caccgccatg ggttagagcc ctgctcgatg ctcac // LOCUS HSGBGASIA 1766 bp RNA PRI 21-JUL-1994 DEFINITION H.sapiens mRNA for Gal-beta(1-3/1-4)GlcNAc alpha-2.3-sialyltransferase. ACCESSION X74570 NID g414890 KEYWORDS gal beta (1-3/1-4) GlcNAc alpha-2,3 sialyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1766) AUTHORS Sasaki,K. TITLE Direct Submission JOURNAL Submitted (12-AUG-1993) K. Sasaki, Tokyo Research Laboratories,, Kyowa Hakko Kogyo Co. Ltd., 3-6-6 Asahimachi, Machidashi, Tokyo 194, JAPAN REFERENCE 2 (bases 1 to 1766) AUTHORS Sasaki,K., Watanabe,E., Kawashima,K., Sekine,S., Dohi,T., Oshima,M., Hanai,N., Nishi,T. and Hasegawa,M. TITLE Expression cloning of a novel Gal beta (1-3/1-4) GlcNAc alpha 2,3-sialyltransferase using lectin resistance selection JOURNAL J. Biol. Chem. 268 (30), 22782-22787 (1993) MEDLINE 94043042 FEATURES Location/Qualifiers source 1..1766 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="melanoma" /cell_line="WM266-4" CDS 163..1152 /codon_start=1 /product="gal beta (1-3/1-4) GlcNAc alpha-2,3 sialyltransferase" /db_xref="PID:g414891" /translation="MVSKSRWKLLAMLALVLVVMVWYSISREDSFYFPIPEKKEPCLQ GEAESKASKLFGNYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSS SIPKNIQSLRCRRCVVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTT MRLFYPESAHFDPKVENNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIW DVNPKQIRILNPFFMEIAADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAG FGYPDAYNKKQTIHYYEQITLKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF" BASE COUNT 405 a 489 c 512 g 360 t ORIGIN 1 cggtcaggtc cagcacttgg gagctgactg tgctggaggt gacaggcttt gcggggtccg 61 cctgtgtgca ggagtcgcaa ggtcgctgag caggacccaa aggtggcccg aggcagccgg 121 gatgacagct ctccccagga atcctgctgc ctgctgagaa acatggtcag caagtcccgc 181 tggaagctcc tggccatgtt ggctctggtc ctggtcgtca tggtgtggta ttccatctcc 241 cgggaagaca gtttttattt tcccatccca gagaagaagg agccgtgcct ccagggtgag 301 gcagagagca aggcctctaa gctctttggc aactactccc gggatcagcc catcttcctg 361 cggcttgagg attatttctg ggtcaagacg ccatctgctt acgagctgcc ctatgggacc 421 aaggggagtg aggatctgct cctccgggtg ctagccatca ccagctcctc catccccaag 481 aacatccaga gcctcaggtg ccgccgctgt gtggtcgtgg ggaacgggca ccggctgcgg 541 aacagctcac tgggagatgc catcaacaag tacgatgtgg tcatcagatt gaacaatgcc 601 ccagtggctg gctatgaggg tgacgtgggc tccaagacca ccatgcgtct cttctaccct 661 gaatctgccc acttcgaccc caaagtagaa aacaacccag acacactcct cgtcctggta 721 gctttcaagg caatggactt ccactggatt gagaccatcc tgagtgataa gaagcgggtg 781 cgaaagggtt tctggaaaca gcctcccctc atctgggatg tcaatcctaa acagattcgg 841 attctcaacc ccttcttcat ggagattgca gctgacaaac tgctgagcct gccaatgcaa 901 cagccacgga agattaagca gaagcccacc acgggcctgt tggccatcac gctggccctc 961 cacctctgtg acttggtgca cattgccggc tttggctacc cagacgccta caacaagaag 1021 cagaccattc actactatga gcagatcacg ctcaagtcca tggcggggtc aggccataat 1081 gtctcccaag aggccctggc cattaagcgg atgctggaga tgggagctat caagaacctc 1141 acgtccttct gacctgggca agagctgtag cctgtcggtt gcctactctg ctgtctgggt 1201 gacccccatg cgtggctgtg ggggtggctg gtgccagtat gacccacttg gactcacccc 1261 ctcttgggga gggagttctg ggcctggcca ggtctgagat gaggccatgc ccctggctgc 1321 tcttatggag ccgagatcca gtcagggtgg gggcgctgga gccgtgggag cccggccagg 1381 gcagggggct cgtcgctgtg gcaccccctc tctgccagca ccaagagatt atttaatggg 1441 ctatttaatt aaggggtagg aaggtgctgt gggctggtcc cacacatcca ggaaagaggc 1501 cagtagagaa ttctgcccac tttttataaa aacttacagc gatggcccca ccaaggccta 1561 gacacggcac tggcctccca ggagggcagg ggcattggga atgggtgggt gccctccaga 1621 gaggggctgc tacctcccag caggcatggg aagagcactg gtgtgggggt tccaccgaga 1681 aggggacctc atctagaaaa gaggttacaa acctaccatt aaactatttt tcctaaaacg 1741 gaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSGBR 3088 bp RNA PRI 12-SEP-1993 DEFINITION Human liver mRNA for beta-subunit signal transducing proteins Gs/Gi (beta-G). ACCESSION X04526 NID g31667 KEYWORDS G protein; signal transducing protein; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3088) AUTHORS Codina,J., Stengel,D., Woo,S.L. and Birnbaumer,L. TITLE Beta-subunits of the human liver Gs/Gi signal-transducing proteins and those of bovine retinal rod cell transducin are identical JOURNAL FEBS Lett. 207 (2), 187-192 (1986) MEDLINE 87030912 COMMENT Data kindly reviewed (29-APR-1987) by J. Codina. FEATURES Location/Qualifiers source 1..3088 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..228 /note="put. ORFX (AA 1-75)" /codon_start=1 /db_xref="PID:g31668" /translation="MGGEWGAGPGVEQPPRRTGPSLAGAHLPAAPADAQRGPSATTCR AAAEAAVWARQAATRALRQIYMYWRPDQKPF" CDS 281..1303 /note="beta subunit (AA 1-340)" /codon_start=1 /db_xref="PID:g31669" /db_xref="SWISS-PROT:P04901" /translation="MSELDQLRQEAEQLKNQIRDARKACADATLSQITNNIDPVGRIQ MRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSYTTNKVHAIPLRSSWVMT CAYAPSGNYVACGGLDNICSIYNLKTREGNVRVSRELAGHTGYLSCCRFLDDNQIVTS SGDTTCALWDIETGQQTTTFTGHTGDVMSLSLAPDTRLFVSGACDASAKLWDVREGMC RQTFTGHESDINAICFFPNGNAFATGSDDATCRLFDLRADQELMTYSHDNIICGITSV SFSKSGRLLLAGYDDFNCNVWDALKADRAGVLAGHDNRVSCLGVTDDGMAVATGSWDS FLKIWN" misc_feature 1592..1597 /note="pot. polyA signal" misc_feature 3066..3071 /note="pot. polyA signal" polyA_site 3088 /note="polyA site" BASE COUNT 762 a 769 c 758 g 799 t ORIGIN 1 atgggcggcg agtggggagc ggggccggga gtggagcagc cgccgcggcg gactggaccg 61 agcctcgccg gcgcgcacct gcccgcagcg cccgcggacg cgcagcgcgg cccgagcgcg 121 acgacctgcc gagcggcggc cgaggcggcg gtgtgggcgc gtcaggccgc gacgagggcg 181 ctgagacaaa tttacatgta ttggagacca gaccagaagc ccttctgaat taagatctca 241 cattcttgaa ggtggcattg aagagcacta agatcggaag atgagtgagc ttgaccagtt 301 acggcaggag gccgagcaac ttaagaacca gattcgagac gccaggaaag catgtgcaga 361 tgcaactctc tctcagatca caaacaacat cgacccagtg ggaagaatcc aaatgcgcac 421 gaggaggaca ctgcgggggc acctggccaa gatctacgcc atgcactggg gcacagactc 481 caggcttctc gtcagtgcct cgcaggatgg taaacttatc atctgggaca gctacaccac 541 caacaaggtc cacgccatcc ctctgcgctc ctcctgggtc atgacctgtg catatgcccc 601 ttctgggaac tatgtggcct gcggtggcct ggataacatt tgctccattt acaatctgaa 661 aactcgtgag gggaacgtgc gcgtgagtcg tgagctggca ggacacacag gttacctgtc 721 ctgctgccga ttcctggatg acaatcagat cgtcaccagc tctggagaca ccacgtgtgc 781 cctgtgggac atcgagaccg gccagcagac gaccacgttt accggacaca ctggagatgt 841 catgagcctt tctcttgctc ctgacaccag actgttcgtc tctggtgctt gtgatgcttc 901 agccaaactc tgggatgtgc gagaaggcat gtgccggcag accttcactg gccacgagtc 961 tgacatcaat gccatatgct tctttccaaa tggcaatgca tttgccactg gctcagacga 1021 cgccacctgc aggctgtttg accttcgtgc tgaccaggag ctcatgactt actcccatga 1081 caacatcatc tgcgggatca cctctgtctc cttctccaag agcgggcgcc tcctccttgc 1141 tgggtacgac gacttcaact gcaacgtctg ggatgcactc aaagccgacc gggcaggtgt 1201 cttggctggg catgacaacc gcgtcagctg cctgggcgtg actgacgatg gcatggctgt 1261 ggcgacaggg tcctgggata gcttcctcaa gatctggaac taacgccagt agcatgtgga 1321 tgccatggag actggaagac cattccaact tggacgcgtt accatgagag ccaaccgtac 1381 taacgtgaca accctacacc tcccctcaga acttcaaaag ggcaagatct tttttccttc 1441 acttattgct catatcctat gaaaccaaga gcacaattcc cattgagaga aagatctctg 1501 tgctgtaaac taaaacaaat tgtgcattcc ttccggggcc atcgtctttg ttttcttttt 1561 tgtcttgaat gaattttaaa aggaaatata taataaaaat gttaaccaga aggtaaactt 1621 gagtgtaatt gtcagacaga cacacttttc caccagtgta tttgaatttt agaccagtga 1681 ccctgttttg tggcattcat gcaaaacatg ctgagggctt tgttcatctg gtcatcgtgt 1741 ccaaatttca gtcatgtttg tagcaagatt ttggaagcat tcatatttcc tttttaaaat 1801 gtattccttt gtgttcaaca gttaatcaaa accagagagt ctagggcagc ctctctgatg 1861 ttgtcaatga tgtaaattca gtccctggtt tttaattttc tgtctgatgt cacagatcat 1921 tgttgcacac aaacgtggca tagaaaagaa catgttcaga agccatgggg ccaagcacaa 1981 tgcggggacg gtctcaaaat gcgtgatcag agaatccttc accttatgct gaaaagtgag 2041 ctcagatcca cctccaatgt tcctcctgac ccatcctgtc tatcttctca gttgagtttt 2101 taatctcact ttgggtttcc ttgtgaagtt ggagggaagt ttataatagc ctaacactac 2161 cccaccccca actaggagga acctctgttt tcaagagaga tgcctgtcct gtgcttggat 2221 agtcagtcaa ttatttgtgt atgaaacaat gtacaaatca atgttttgaa aataatgatc 2281 tcagactttc taagttaaag ttttaaaaat tttgattgtt tgccatattg ggtgggttta 2341 ctcttagaat cgcatgctgt agaaaatgct caaaagtgca tatgggactc agtccttagg 2401 tgttcttttt cttttaagaa ataacctctt acagttgtaa ccattgcggc tctgtccact 2461 tctcgttgct gctctgtggc acatatcgga agcagtacag cgcgcggctc tacacgcttg 2521 ggtagcggga taagtcactg ttttctttat ttctttaaaa aaaaaaaaag ttctgttgca 2581 aacgactgct gttggattct gagggtgggg agggagagag agggagggag agggagtgaa 2641 gagcctgccc tcctatagtg gattcttcac gggccctcca catctgaggt ggctcattcc 2701 catcacacac agattgtcct ggtgttcatt tcaaggccag ttgtcagcag cagcgtttgg 2761 aaagcaggtt ctgtgggacc ccccgccccg ccccccgcac tccttcatag cagcagtagt 2821 ggcttctcca tcctgttttc tgcaacattc tatacaaaac tgtgctgtga ccttgcggta 2881 ggcctggatc tggcaaagag aatacaaatg aaaccccttc tttctctttc cgtccaacaa 2941 ctctgtagag ctctctgcac ccttacccct ttccaccttt tgtatttaat tttaaagtca 3001 gtgtactgca aggaagctgg atgcaagata gatactatat taaactgtac tgttatttaa 3061 gatgtaataa agcagtttga catgaggg // LOCUS HSGCA2 2954 bp RNA PRI 06-APR-1992 DEFINITION H.Sapiens mRNA for alpha2-subunit of soluble guanylyl cyclase. ACCESSION X63282 NID g31670 KEYWORDS guanylyl cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2954) AUTHORS Harteneck,C., Wedel,B., Koesling,D., Malkewitz,J., Bohme,E. and Schultz,G. TITLE Molecular cloning and expression of a new alpha-subunit of soluble guanylyl cyclase. Interchangeability of the alpha-subunits of the enzyme JOURNAL FEBS Lett. 292 (1-2), 217-222 (1991) MEDLINE 92070494 REFERENCE 2 (bases 1 to 2954) AUTHORS Harteneck,C. TITLE Direct Submission JOURNAL Submitted (30-MAR-1992) C. Harteneck, Institut fuer Pharmakologie, Freie Universitaet Berlin, Thielallee 69-73, 1000 Berlin 33, FRG FEATURES Location/Qualifiers source 1..2954 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" CDS 391..2589 /EC_number="4.6.1.2" /codon_start=1 /product="alpha2-subunit of soluble guanylyl cyclase" /db_xref="PID:g31671" /db_xref="SWISS-PROT:P33402" /translation="MSRRKISSESFSSLGSDYLETSPEEEGECPLSRLCWNGSRSPPG PLEPSPAAAAAAAAPAPTPAASAAAAAATAGARRVQRRRRVNLDSLGESISRLTAPSP QTIQQTLKRTLQYYEHQVIGYRDAEKNFHNISNRCSYADHSNKEEIEDVSGILQCTAN ILGLKFEEIQKRFGEEFFNICFHENERVLRAVGGTLQDFFNGFDALLEHIRTSFGKQA TLESPSFLCKELPEGTLMLHYFHPHHIVGFAMLGMIKAAGKKIYRLDVEVEQVANEKL CSDVSNPGNCSCLTFLIKECENTNIMKNLPQGTSQVPADLRISINTFCRAFPFHLMFD PSMSVLQLGEGLRKQLRCDTHKVLKFEDCFEIVSPKVNATFERVLLRLSTPFVIRTKP EASGSENKDKVMEVKGQMIHVPESNSILFLGSPCVDKLDELMGRGLHLSDIPIHDATR DVILVGEQAKAQDGLKKRMDKLKATLERTHQALEEEKKKTVDLLYSIFPGDVAQQLWQ GQQVQARKFDDVTMLFSDIVGFTAICAQCTPMQVISMLNELYTRFDHQCGFLDIYKVE TIGDAYCVAAGLHRKSLCHAKPIALMALKMMELSEEVLTPDGRPIQMRIGIHSGSVLA GVVGVRMPRYCLFGNNVTLASKFESGSHPRRINVSPTTYQLLKREESFTFIPRSREEL PDNFPKEIPGICYFLEVRTGPKPPKPSLSSSRIKKVSYNIGTMFLRETSL" BASE COUNT 735 a 779 c 760 g 680 t ORIGIN 1 tgggccgcag ccctccccgc cccgccgacc gcggtcacac actctcggag cctccccgtg 61 agcgggagcg cggcgcacgg cgatgcgccg aggcgggcgc tgaggcggcg ccgcgcgagc 121 agcagcagag gcggcggcgg cccccagccc agcccggcgc cgccgccgag cccgggcccc 181 aaggtgcggc ggcgccccaa gttcccgcca tgagcagccg gctcgggggg ctccgcggcc 241 ccggggactc ccgccccgcc gggcgcgacc gcagcgcccc gcggccccga cgcgcttaac 301 gttgtcgctt gccggtcccg ccaccgccgc ctccgccgcc gctcgcgtcc tcgccgccac 361 cgcctcggcc gctgcagctc cgccggcagc atgtctcgaa ggaagatttc gtccgagtcc 421 ttcagctccc tgggctccga ctacctggag accagcccgg aggaggaggg ggagtgcccc 481 ctgtctaggc tctgctggaa tggcagccgg agcccgcccg ggccgctgga gcccagcccg 541 gccgcagctg ccgctgccgc cgccccggcc ccgaccccgg ctgcttctgc cgccgccgcc 601 gctgccactg ccggggccag gagggtgcag cgccggaggc gggtcaacct ggactcgctg 661 ggcgagagca tcagccgcct gacggcgccc tcgcctcaga cgatacagca gactctcaag 721 aggacactgc agtattatga acatcaagtt attggttaca gggatgcaga aaagaatttc 781 cacaatatct ctaacagatg ctcctatgca gaccactcca acaaagaaga aattgaagat 841 gtctcaggaa ttcttcagtg tactgctaat atactcggtt tgaagtttga ggaaattcaa 901 aaaagatttg gtgaagagtt ctttaatata tgctttcatg agaatgagag agtccttcga 961 gctgtaggtg gcactttgca ggactttttt aacggctttg atgctttgtt ggaacacatt 1021 agaacttctt ttggaaaaca ggccactctg gagtcaccat ctttcctatg caaagagctc 1081 cctgaaggta ctctcatgct ccactacttc caccctcacc atattgtggg gtttgcaatg 1141 ctggggatga ttaaggctgc aggaaagaag atctatcggc tggatgtgga agtggaacag 1201 gttgcaaatg agaagctatg ctctgatgtt tcaaacccag gcaattgtag ctgtcttact 1261 ttccttatca aagaatgtga aaatactaat atcatgaaga accttccaca gggaacctcc 1321 caagttcctg cggacctcag aattagcatc aacaccttct gtagagcctt ccctttccac 1381 ttgatgtttg atcccagcat gtcagtcctt cagttggggg aaggtctaag gaagcagctt 1441 cgatgtgaca ctcacaaagt gctcaagttt gaggactgct tcgagattgt atctccaaag 1501 gttaatgcca cctttgaaag ggtcctgctg cgactgtcta ccccgtttgt gattagaacc 1561 aagcctgagg cttctggctc tgaaaataaa gacaaggtga tggaagtcaa aggacaaatg 1621 atccatgttc cagaatcaaa ttccatttta tttttgggct ctccatgtgt ggacaagttg 1681 gatgaactca tgggccgagg gctacatctc tcagacatcc ctatccatga tgccacccga 1741 gatgtcattt tggttggtga gcaggcaaag gcccaagatg ggttgaagaa aaggatggat 1801 aaattaaagg caactttaga aagaactcac caggccctgg aagaagagaa aaagaagaca 1861 gtggatcttc tatattctat tttccctggt gatgtagccc agcaattatg gcaagggcag 1921 caagtacagg ccagaaagtt tgatgatgtc accatgctct tttcagacat tgttggcttc 1981 acagccatat gtgcccagtg tactcccatg caagtaatca gcatgctgaa tgaactgtac 2041 accagatttg accaccagtg tggatttttg gatatttata aggtggaaac aataggtgat 2101 gcctactgtg ttgcagcagg gctccacaga aaaagcctct gccatgctaa acccattgct 2161 ctgatggcct tgaagatgat ggaactttca gaagaggtgc tgacacctga tggaagaccg 2221 attcagatga ggataggaat tcactcaggc tccgtgctgg ctggagttgt tggggtgcga 2281 atgccacgtt attgcctgtt tggaaataat gtcacactgg caagcaaatt cgagtcggga 2341 agtcaccctc ggcgcatcaa tgtcagccca accacttacc aattattaaa acgagaagaa 2401 agtttcacat tcattccgcg gtctcgtgaa gagcttccag acaactttcc aaaggaaatt 2461 cctgggatct gctatttcct ggaggtaagg actggtccaa agccaccaaa gccttctctt 2521 tcttcgtcga gaataaaaaa ggtttcctac aacatcggca ccatgttcct ccgggagaca 2581 agcctctgag acctgctaca gatcaaagac tcctccaaaa agcacaagcc cagaacatgg 2641 gtcaccaatg gggggtggaa agagattgtg tctctttcat tgctttgttg agaacaagca 2701 gcaaaatttc tgtattatgt caggcaataa tcctactaaa aggtggaggt gaccgctgtc 2761 aataaaaagc cggaggatga gggaaataag atgtgtccat tcatatgagt ggttttggtc 2821 atatatatac acatatattt taattacaag tgtgggtccc ctttcagaac taaccaataa 2881 atagattcca tgttttcttg tttatcacac atacaagtat ctttccctat atatttgtac 2941 cacttttgag agcc // LOCUS HSGCI 330 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for glycophorin C isoform. ACCESSION X51973 NID g31672 KEYWORDS glycophorin; glycophorin C; isoenzyme; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 330) AUTHORS Le Van Kim,C. TITLE Direct Submission JOURNAL Submitted (26-FEB-1990) Le Van Kim C., Inserm U76, INTS, 6 rue Alexandre Cabanel, 75015 Paris, France REFERENCE 2 (bases 1 to 330) AUTHORS Le Van Kim,C., Mitjavila,M.T., Clerget,M., Cartron,J.P. and Colin,Y. TITLE An ubiquitous isoform of glycophorin C is produced by alternative splicing JOURNAL Nucleic Acids Res. 18 (10), 3076 (1990) MEDLINE 90272439 COMMENT See and for overlapping sequences. Data kindly reviewed (02-JUL-1990) by Le Van Kim C. FEATURES Location/Qualifiers source 1..330 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /cell_type="erythroblast" /cell_line="K 562, Meg 01, Jurkat" /chromosome="2" /map="q14-q21" CDS 1..330 /note="glycophorin C isoform (AA 1-109)" /codon_start=1 /db_xref="PID:g31673" /db_xref="SWISS-PROT:P04921" /translation="MWSTRSPNSTAWPLSLEPDPGMSGWPDGRMETSTPTIMDIVVIA GVIAAVAIVLVSLLFVMLRYMYRHKGTYHTNEAKGTEFAESADAALQGDPALQDAGDS SRKEYFI" BASE COUNT 73 a 94 c 95 g 68 t ORIGIN 1 atgtggtcga cgagaagccc caacagcacg gcgtggcctc tcagcctcga gcctgatcca 61 gggatgtctg gatggccgga tggcagaatg gagacctcca cccccaccat aatggacatt 121 gtcgtcattg caggtgtgat tgctgctgtg gccatcgtcc tagtctccct cctcttcgtc 181 atgctgcgct acatgtaccg gcacaagggc acgtaccaca ccaatgaggc caagggcacg 241 gagtttgctg agagtgcaga tgcagccctg cagggagacc ctgccctcca agatgctggt 301 gatagcagca gaaaggagta ctttatttga // LOCUS HSGCIQR 1139 bp RNA PRI 28-SEP-1994 DEFINITION H.sapiens mRNA for gCIq-R. ACCESSION X75913 NID g472955 KEYWORDS cell surface glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1139) AUTHORS Lim,B. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) B. Lim, MRC Immunochemistry Unit, Dept. of Biochemistry, Univ. of Oxford, South Parks Rd., Oxford, OX1 3QU, UK REFERENCE 2 (bases 1 to 1139) AUTHORS Ghebrehiwet,B., Lim,B.L., Peerschke,E.I., Willis,A.C. and Reid,K.B. TITLE Isolation, cDNA cloning, and overexpression of a 33-kD cell surface glycoprotein that binds to the globular 'heads' of C1q JOURNAL J. Exp. Med. 179 (6), 1809-1821 (1994) MEDLINE 94253723 FEATURES Location/Qualifiers source 1..1139 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /clone_lib="lambda gt11" gene 79..927 /gene="gCIq-R" CDS 79..927 /gene="gCIq-R" /codon_start=1 /product="gCIq-R" /db_xref="PID:g472956" /db_xref="SWISS-PROT:Q07021" /translation="MLPLLRCVPRVLGSSVAGLRAAAPASPFRQLLQPAPRLCTRPFG LLSVRAGSERRPGLLRPRGPCACGCGCGSLHTDGDKAFVDFLSDEIKEERKIQKHKTL PKMSGGWELELNGTEAKLVRKVAGEKITVTFNINNSIPPTFDGEEEPSQGQKVEEQEP ELTSTPNFVVEVIKNDDGKKALVLDCHYPEDEVGQEDEAESDIFSIREVSFQSTGESE WKDTNYTLNTDSLDWALYDHLMDFLADRGVDNTFADELVELSTALEHQEYITFLEDLK SFVKSQ" BASE COUNT 274 a 279 c 306 g 280 t ORIGIN 1 ccggcggcgc ctcaggtcgc ggggcgccta ggcctgggtt gtccttcgca tctgcacgtg 61 ttcgcagtcg tttccgcgat gctgcctctg ctgcgctgcg tgccccgtgt gctgggctcc 121 tccgtcgccg gcctccgcgc tgccgcgccc gcctcgcctt tccggcagct cctgcagccg 181 gcaccccggc tgtgcacccg gcccttcggg ctgctcagcg tgcgcgcagg ttccgagcgg 241 cggccgggcc tcctgcggcc tcgcggaccc tgcgcctgtg gctgtggctg cggctcgctg 301 cacaccgacg gagacaaagc ttttgttgat ttcctgagtg atgaaattaa ggaggaaaga 361 aaaattcaga agcataaaac cctccctaag atgtctggag gttgggagct ggaactgaat 421 gggacagaag cgaaattagt gcggaaagtt gccggggaaa aaatcacggt cactttcaac 481 attaacaaca gcatcccacc aacatttgat ggtgaggagg aaccctcgca agggcagaag 541 gttgaagaac aggagcctga actgacatca actcccaatt tcgtggttga agttataaag 601 aatgatgatg gcaagaaggc ccttgtgttg gactgtcatt atccagagga tgaggttgga 661 caagaagacg aggctgagag tgacatcttc tctatcaggg aagttagctt tcagtccact 721 ggcgagtctg aatggaagga tactaattat acactcaaca cagattcctt ggactgggcc 781 ttatatgacc acctaatgga tttccttgcc gaccgagggg tggacaacac ttttgcagat 841 gagctggtgg agctcagcac agccctggag caccaggagt acattacttt tcttgaagac 901 ctcaagagtt ttgtcaagag ccagtagagc agacagatgc tgaaagccat agtttcatgg 961 caggctttgg ccagtgaaca aatcctactc tgaagctaga catgtgcttt gaaatgatta 1021 tcatcctaat atcatggggg aaaaaatacc aaatttaaat tatatgtttt gtgttctcat 1081 ttattatcat ttttttctgt acaaatctat tatttctaga tttttgtata acatgatag // LOCUS HSGCKR 2194 bp RNA PRI 20-MAY-1996 DEFINITION H.sapiens GCKR mRNA for glucokinase regulator. ACCESSION Z48475 NID g683571 KEYWORDS GCKR gene; glucokinase regulator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2194) AUTHORS Warner,J.P., Leek,J.P., Intody,S., Markham,A.F. and Bonthron,D.T. TITLE Human glucokinase regulatory protein (GCKR): cDNA and genomic cloning, complete primary structure, and chromosomal localization JOURNAL Mamm. Genome 6 (8), 532-536 (1995) MEDLINE 96014291 REFERENCE 2 (bases 1 to 2194) AUTHORS Bonthron,D.T. TITLE Direct Submission JOURNAL Submitted (23-FEB-1995) Bonthron D. T., University of Edinburgh, Human Genetics Unit, Western General Hospital, Edinburgh, U.K., EH4 2XU FEATURES Location/Qualifiers source 1..2194 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phGCKR67" /tissue_type="hepatoblastoma" /cell_type="hepatocyte" /cell_line="HepG2" /clone_lib="HepG2 cDNA" /chromosome="2" /map="2p22-p23" gene 64..1941 /gene="GCKR" CDS 64..1941 /gene="GCKR" /codon_start=1 /product="glucokinase regulator" /db_xref="PID:g683572" /translation="MPGTKRFQHVIETPEPGKWELSGYEAAVPITEKSNPLTQDLDKA DAENIVRLLGQCDAEIFQEEGQALSTYQRLYSESILTTMVQVAGKVQEVLKEPDGGLV VLSGGGTSGRMAFLMSVSFNQLMKGLGQKPLYTYLIAGGDRSVVASREGTEDSALHGI EELKKVAAGKKRVIVIGISVGLSAPFVAGQMDCCMNNTAVFLPVLVGFNPVSMARNDP IEDWSSTFRQVAERMQKMQEKQKAFVLNPAIGPEGLSGSSRMKGGSATKILLETLLLA AHKTVDQGIAASQRCLLEILRTFERAHQVTYSQSPKIATLMKSVSTSLEKKGHVYLVG WQTLGIIAIMDGVECIHTFGADFRDVRGFLIGDHSDMFNQKAELTNQGPQFTFSQEDF PTSILPSLTEIDTVVFIFTLDDNLTEVQTIVEQVKEKTNHIQALAHSTVGQTLPIPLK KLFPSIISITWPLLFFEYEGNFIQKFQRELSTKWVLNTVSTGAHVLLGKILQNHMLDL RISNSKLFWRALAMLQRFSGQSKARCIESLLRAIHFPQPLSDDIRAAPISCRVQVAHE KEQVIPIALLSLLFRCSITEAQAHLAAAPSVCEAVRSALAGPGQKRTADPLEILEPDV Q" polyA_site 2184 BASE COUNT 525 a 570 c 607 g 492 t ORIGIN 1 gtgaccagag gggtttgtgt ggctgaagag gcaggaggaa cagtgtatcc acagcgtggg 61 accatgccag gcacaaaacg gtttcaacat gtcattgaga ccccggagcc tggcaagtgg 121 gagttgtctg ggtacgaggc agctgtgcca atcacggaga agtcaaaccc actgacccag 181 gatctagaca aagcagatgc tgagaacatt gttcgactgc tagggcaatg tgatgctgag 241 atcttccagg aggaggggca agccctgtcc acataccaga gactctacag cgaatccatt 301 ctgaccacca tggtacaggt ggctgggaaa gttcaggaag tgctgaagga gccagatggg 361 gggctggttg tgctgagtgg agggggcacc tctggccgga tggcattcct catgtcggtg 421 tcctttaatc agctgatgaa aggtctggga cagaaacctc tttacaccta cctcattgca 481 ggtggtgaca ggtctgtggt ggcctctagg gaggggacag aagatagtgc cttgcacggg 541 attgaggaac tgaagaaggt ggctgccggg aagaagagag tgattgtcat tggcatttct 601 gtgggactct ctgctccctt tgtggcaggc cagatggact gctgcatgaa caacacagct 661 gtcttcttgc cagtcctggt tggcttcaat ccagtgagca tggccagaaa tgaccccatt 721 gaagactgga gttcaacatt ccgacaagta gcagagcgga tgcagaaaat gcaggagaaa 781 cagaaagctt ttgtgctcaa tcctgccatc gggcccgagg gtctcagcgg ctcctcccgg 841 atgaaaggtg gaagtgccac caagattctg ctggaaaccc tgttattagc agcccataag 901 actgtggacc agggcattgc agcatctcaa agatgcctcc tggaaatctt gcggacattt 961 gagcgagctc atcaggtgac ctacagccaa agccccaaga ttgccaccct gatgaagagt 1021 gtcagcacca gtctggagaa gaaaggccac gtgtacctgg ttggctggca gaccctgggt 1081 atcattgcca tcatggatgg agtagagtgc atccacacct ttggtgctga tttccgagat 1141 gtccgtggct ttctcattgg tgatcacagt gacatgttta accagaaggc tgagctcacc 1201 aaccagggtc cccagttcac cttctcccag gaggacttcc cgacttccat ccttccctct 1261 ctcacggaaa tcgatactgt ggtcttcatt ttcaccctgg atgacaacct cacggaggtg 1321 cagactatag tggagcaggt gaaagagaag accaaccaca tccaggccct ggcacacagc 1381 accgtgggtc agaccttgcc gatccctctg aagaagctct ttccctccat catcagcatc 1441 acatggccac tgcttttctt tgaatatgaa gggaacttca tccagaagtt ccagcgtgag 1501 ctaagcacca aatgggtgct gaatacagtg agtacaggtg ctcatgtgct tcttggtaag 1561 atcctacaaa accacatgtt ggaccttcgg attagcaact ccaagctctt ctggcgggcg 1621 ctggccatgc tgcagcggtt ctctggacag tccaaggctc gatgcatcga gagcctcctc 1681 cgagcgatcc actttcccca gccactgtca gatgatattc gggctgctcc catctcctgc 1741 cgtgtccagg ttgcacatga gaaggaacag gtgataccca tcgccttgct gagcctccta 1801 ttccggtgct cgatcactga ggctcaggca cacctggctg cagctccttc tgtctgtgag 1861 gctgtcagga gtgctcttgc tgggccaggt cagaagcgca ctgcggaccc cctcgagatc 1921 ctagagcctg acgttcagtg aacccatgtt tctgggtggg tgaaaggggc ccaaccctgc 1981 ccacttcagc ccagcccgcc caaggggact tgtgccagca gaacatgtgg gaggaagaag 2041 ccccgtttcc agggcatccg cagcccaggg tagggagaaa tattctctcc actttggggg 2101 agagttcttg ctctcgacct agtggtttct actctcaccg acttattctg atttcagaaa 2161 taaaatgaaa tgtcttattt tggaaaaaaa aaaa // LOCUS HSGCRAR 4788 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for alpha-glucocorticoid receptor (clone OB7). ACCESSION X03225 M10901 NID g31679 KEYWORDS glucocorticoid receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4788) AUTHORS Hollenberg,S.M., Weinberger,C., Ong,E.S., Cerelli,G., Oro,A., Lebo,R., Thompson,E.B., Rosenfeld,M.G. and Evans,R.M. TITLE Primary structure and expression of a functional human glucocorticoid receptor cDNA JOURNAL Nature 318 (6047), 635-641 (1985) MEDLINE 86092206 COMMENT About 500 bp of the 5' region were derived from clone OB10 which represents the beta-glucocorticoid receptor (see X03348). FEATURES Location/Qualifiers source 1..4788 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 133..2466 /note="(aa 1-777)" /codon_start=1 /product="alpha-glucocorticoid receptor" /db_xref="PID:g31680" /db_xref="SWISS-PROT:P04150" /translation="MDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSAS SPSLAVASQSDSKQRRLLVDFPKGSVSNAQQPDLSKAVSLSMGLYMGETETKVMGNDL GFPQQGQISLSSGETDLKLLEESIANLNRSTSVPENPKSSASTAVSAAPTEKEFPKTH SDVSSEQQHLKGQTGTNGGNVKLYTTDQSTFDILQDLEFSSGSPGKETNESPWRSDLL IDENCLLSPLAGEDDSFLLEGNSNEDCKPLILPDTKPKIKDNGDLVLSSPSNVTLPQV KTEKEDFIELCTPGVIKQEKLGTVYCQASFPGANIIGNKMSAISVHGVSTSGGQMYHY DMNTASLSQQQDQKPIFNVIPPIPVGSENWNRCQGSGDDNLTSLGTLNFPGRTVFSNG YSSPSMRPDVSSPPSSSSTATTGPPPKLCLVCSDEASGCHYGVLTCGSCKVFFKRAVE GQHNYLCAGRNDCIIDKIRRKNCPACRYRKCLQAGMNLEARKTKKKIKGIQQATTGVS QETSENPGNKTIVPATLPQLTPTLVSLLEVIEPEVLYAGYDSSVPDSTWRIMTTLNML GGRQVIAAVKWAKAIPGFRNLHLDDQMTLLQYSWMFLMAFALGWRSYRQSSANLLCFA PDLIINEQRMTLPCMYDQCKHMLYVSSELHRLQVSYEEYLCMKTLLLLSSVPKDGLKS QELFDEIRMTYIKELGKAIVKREGNSSQNWQRFYQLTKLLDSMHEVVENLLNYCFQTF LDKTMSIEFPEMLAEIITNQIPKYSNGNIKKLLFHQK" misc_feature 3101..3106 /note="pot. polyA signal" misc_feature 4679..4684 /note="pot. polyA signal" misc_feature 4762..4767 /note="pot. polyA signal" polyA_site 4788 BASE COUNT 1471 a 939 c 970 g 1408 t ORIGIN 1 tttttagaaa aaaaaaatat atttccctcc tgctccttct gcgttcacaa gctaagttgt 61 ttatctcggc tgcggcggga actgcggacg gtggcgggcg agcggctcct ctgccagagt 121 tgatattcac tgatggactc caaagaatca ttaactcctg gtagagaaga aaaccccagc 181 agtgtgcttg ctcaggagag gggagatgtg atggacttct ataaaaccct aagaggagga 241 gctactgtga aggtttctgc gtcttcaccc tcactggctg tcgcttctca atcagactcc 301 aagcagcgaa gacttttggt tgattttcca aaaggctcag taagcaatgc gcagcagcca 361 gatctgtcca aagcagtttc actctcaatg ggactgtata tgggagagac agaaacaaaa 421 gtgatgggaa atgacctggg attcccacag cagggccaaa tcagcctttc ctcgggggaa 481 acagacttaa agcttttgga agaaagcatt gcaaacctca ataggtcgac cagtgttcca 541 gagaacccca agagttcagc atccactgct gtgtctgctg cccccacaga gaaggagttt 601 ccaaaaactc actctgatgt atcttcagaa cagcaacatt tgaagggcca gactggcacc 661 aacggtggca atgtgaaatt gtataccaca gaccaaagca cctttgacat tttgcaggat 721 ttggagtttt cttctgggtc cccaggtaaa gagacgaatg agagtccttg gagatcagac 781 ctgttgatag atgaaaactg tttgctttct cctctggcgg gagaagacga ttcattcctt 841 ttggaaggaa actcgaatga ggactgcaag cctctcattt taccggacac taaacccaaa 901 attaaggata atggagatct ggttttgtca agccccagta atgtaacact gccccaagtg 961 aaaacagaaa aagaagattt catcgaactc tgcacccctg gggtaattaa gcaagagaaa 1021 ctgggcacag tttactgtca ggcaagcttt cctggagcaa atataattgg taataaaatg 1081 tctgccattt ctgttcatgg tgtgagtacc tctggaggac agatgtacca ctatgacatg 1141 aatacagcat ccctttctca acagcaggat cagaagccta tttttaatgt cattccacca 1201 attcccgttg gttccgaaaa ttggaatagg tgccaaggat ctggagatga caacttgact 1261 tctctgggga ctctgaactt ccctggtcga acagtttttt ctaatggcta ttcaagcccc 1321 agcatgagac cagatgtaag ctctcctcca tccagctcct caacagcaac aacaggacca 1381 cctcccaaac tctgcctggt gtgctctgat gaagcttcag gatgtcatta tggagtctta 1441 acttgtggaa gctgtaaagt tttcttcaaa agagcagtgg aaggacagca caattaccta 1501 tgtgctggaa ggaatgattg catcatcgat aaaattcgaa gaaaaaactg cccagcatgc 1561 cgctatcgaa aatgtcttca ggctggaatg aacctggaag ctcgaaaaac aaagaaaaaa 1621 ataaaaggaa ttcagcaggc cactacagga gtctcacaag aaacctctga aaatcctggt 1681 aacaaaacaa tagttcctgc aacgttacca caactcaccc ctaccctggt gtcactgttg 1741 gaggttattg aacctgaagt gttatatgca ggatatgata gctctgttcc agactcaact 1801 tggaggatca tgactacgct caacatgtta ggagggcggc aagtgattgc agcagtgaaa 1861 tgggcaaagg caataccagg tttcaggaac ttacacctgg atgaccaaat gaccctactg 1921 cagtactcct ggatgtttct tatggcattt gctctggggt ggagatcata tagacaatca 1981 agtgcaaacc tgctgtgttt tgctcctgat ctgattatta atgagcagag aatgactcta 2041 ccctgcatgt acgaccaatg taaacacatg ctgtatgttt cctctgagtt acacaggctt 2101 caggtatctt atgaagagta tctctgtatg aaaaccttac tgcttctctc ttcagttcct 2161 aaggacggtc tgaagagcca agagctattt gatgaaatta gaatgaccta catcaaagag 2221 ctaggaaaag ccattgtcaa gagggaagga aactccagcc agaactggca gcggttttat 2281 caactgacaa aactcttgga ttctatgcat gaagtggttg aaaatctcct taactattgc 2341 ttccaaacat ttttggataa gaccatgagt attgaattcc ccgagatgtt agctgaaatc 2401 atcaccaatc agataccaaa atattcaaat ggaaatatca aaaaacttct gtttcatcaa 2461 aagtgactgc cttaataaga atggttgcct taaagaaagt cgaattaata gcttttattg 2521 tataaactat cagtttgtcc tgtagaggtt ttgttgtttt attttttatt gttttcatct 2581 gttgttttgt tttaaatacg cactacatgt ggtttataga gggccaagac ttggcaacag 2641 aagcagttga gtcgtcatca cttttcagtg atgggagagt agatggtgaa atttattagt 2701 taatatatcc cagaaattag aaaccttaat atgtggacgt aatctccaca gtcaaagaag 2761 gatggcacct aaaccaccag tgcccaaagt ctgtgtgatg aactttctct tcatactttt 2821 tttcacagtt ggctggatga aattttctag actttctgtt ggtgtatccc ccccctgtat 2881 agttaggata gcatttttga tttatgcatg gaaacctgaa aaaaagttta caagtgtata 2941 tcagaaaagg gaagttgtgc cttttatagc tattactgtc tggttttaac aatttccttt 3001 atatttagtg aactacgctt gctcattttt tcttacataa ttttttattc aagttattgt 3061 acagctgttt aagatgggca gctagttcgt agctttccca aataaactct aaacattaat 3121 caatcatctg tgtgaaaatg ggttggtgct tctaacctga tggcacttag ctatcagaag 3181 accacaaaaa ttgactcaaa tctccagtat tcttgtcaaa aaaaaaaaaa aaaaagctca 3241 tattttgtat atatctgctt cagtggagaa ttatataggt tgtgcaaatt aacagtccta 3301 actggtatag agcacctagt ccagtgacct gctgggtaaa ctgtggatga tggttgcaaa 3361 agactaattt aaaaaataac taccaagagg ccctgtctgt acctaacgcc ctatttttgc 3421 aatggctata tggcaagaaa gctggtaaac tatttgtctt tcaggacctt ttgaagtagt 3481 ttgtataact tcttaaaagt tgtgattcca gataaccagc tgtaacacag ctgagagact 3541 tttaatcaga caaagtaatt cctctcacta aactttaccc aaaaactaaa tctctaatat 3601 ggcaaaaatg gctagacacc cattttcaca ttcccatctg tcaccaattg gttaatcttt 3661 cctgatggta caggaaagct cagctactga tttttgtgat ttagaactgt atgtcagaca 3721 tccatgtttg taaaactaca catccctaat gtgtgccata gagtttaaca caagtcctgt 3781 gaatttcttc actgttgaaa attattttaa acaaaataga agctgtagta gccctttctg 3841 tgtgcacctt accaactttc tgtaaactca aaacttaaca tatttactaa gccacaagaa 3901 atttgatttc tattcaaggt ggccaaatta tttgtgtaat agaaaactga aaatctaata 3961 ttaaaaatat ggaacttcta atatattttt atatttagtt atagtttcag atatatatca 4021 tattggtatt cactaatctg ggaagggaag ggctactgca gctttacatg caatttatta 4081 aaatgattgt aaaatagctt gtatagtgta aaataagaat gatttttaga tgagattgtt 4141 ttatcatgac atgttatata ttttttgtag gggtcaaaga aatgctgatg gataacctat 4201 atgatttata gtttgtacat gcattcatac aggcagcgat ggtctcagaa accaaacagt 4261 ttgctctagg ggaagaggga gatggagact ggtcctgtgt gcagtgaagg ttgctgaggc 4321 tctgacccag tgagattaca gaggaagtta tcctctgcct cccattctga ccacccttct 4381 cattccaaca gtgagtctgt cagcgcaggt ttagtttact caatctcccc ttgcactaaa 4441 gtatgtaaag tatgtaaaca ggagacagga aggtggtgct tacatcctta aaggcaccat 4501 ctaatagcgg gttactttca catacagccc tcccccagca gttgaatgac aacagaagct 4561 tcagaagttt ggcaatagtt tgcatagagg taccagcaat atgtaaatag tgcagaatct 4621 cataggttgc caataataca ctaattcctt tctatcctac aacaagagtt tatttccaaa 4681 taaaatgagg acatgttttt gttttctttg aatgcttttt gaatgttatt tgttattttc 4741 agtattttgg agaaattatt taataaaaaa acaatcattt gctttttg // LOCUS HSGCSAA 3004 bp RNA PRI 16-OCT-1992 DEFINITION H.sapiens soluble guanylate cyclase large subunit mRNA. ACCESSION X66534 NID g31683 KEYWORDS cytoplasmic protein; GTP pyrophosphate-lyase; Guanylate cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3004) AUTHORS Giuili,G., Scholl,U., Bulle,F. and Guellaen,G. TITLE Molecular cloning of the cDNAs coding for the two subunits of soluble guanylyl cyclase from human brain JOURNAL FEBS Lett. 304 (1), 83-88 (1992) MEDLINE 92316204 REFERENCE 2 (bases 1 to 3004) AUTHORS Guellaen,G. TITLE Direct Submission JOURNAL Submitted (10-JUL-1992) Georges Guellaen, Unite INSERM 99, Hopital Henri Mondor, 94010 Creteil, FRANCE FEATURES Location/Qualifiers source 1..3004 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, frontal lobe" /clone_lib="lambda GT10" /chromosome="4" /map="4q31.3-q33" gene 523..2676 /gene="GC-S-alpha-3" CDS 523..2676 /gene="GC-S-alpha-3" /EC_number="4.6.1.2" /note="soluble guanylate cyclase large subunit" /codon_start=1 /product="guanylate cyclase" /db_xref="PID:g31684" /db_xref="SWISS-PROT:Q02108" /translation="MFCTKLKDLKITGECPFSLLAPGQVPNESSEEAAGSSESCKATV PICQDIPEKNIQESLPQRKTSRSRVYLHTLAESICKLIFPEFERLNVALQRTLAKHKI KESRKSLEREDFEKTIAEQAVQQSPVELSKNLLVKRFLKYVTRKMKTSLGWLEAPLKI FKQLQYPSETEQPLPRSRKKGQLEDASILCLDKEDDFLHVYYFFPKRTTSLILPGIIK AAAHVLYETEVEVSLMPPCFHNDCSEFVNQPYLLYSVHMKSTKPSLSPSKPQSSLVIP TSLFCKTFPFHFMFDKDMTILQFGNGIRRLMNRRDFQGKPNFEYFEILTPKINQTFSG IMTMLNMQFVVRVRRWDNSVKKSSRVMDLKGQMIYIVESSAILFLGSPCVDRLEDFTG RGLYLSDIPIHNALRDVVLIGEQARAQDGLKKRLGKLKATLEQAHQALEEEKKKTVDL LCSIFPCEVAQQLWQGQVVQAKKFSNVTMLFSDIVGFTAICSQCSPLQVITMLNALYT RFDQQCGELDVYKVETIAMPIVWLGGLHKESDTHAVQIALMALKMMELSDEVMSPHGE PIKMRIGLHSGSVFAGVVGVKMPRYCLFGNNVTLANKFESCSVPRKINVSPTTYRLLK DCPGFVFTPRSREELPPNFPSEIPGICHFLDAYQQGTNSKPCFQKKDVEDASQFFRQS IRNRLATYIPIYKSLGFDSLKMCRASESTLGIVDG" BASE COUNT 874 a 679 c 710 g 741 t ORIGIN 1 cccttatggc gattgggcgg ctgcagagac caggactcag ttcccctgcc ctagtctgag 61 cctagtgggt gggactcagc tcagagtcag ttttcagaag caggtttcag ttgcagagtt 121 ttcctacact tttcctgcgc tagagcagcg agcagcctgg aacagaccca ggcggaggac 181 acctgtgggg gagggagcgc ctggaggagc ttagagaccc cagccgggcg tgatctcacc 241 atgtgcggat ttgcgaggcg cgccctggag ctgctagaga tccggaagca cagccccgag 301 gtgtgcgaag ccaccaagac tgcggctctt ggagaaagcg tgagcagggg gccaccgcgg 361 tctccggcct gtctgcaccc tgtcgcctga gctgcctgac agtgacaatg acatcccagt 421 taccagtgtc cttgaattga tagtggcttc tgtttgtcag tctcatataa gaactacagc 481 tcatcaggag gagatcgcag cagggtaaga gacaccaaca ccatgttctg cacgaagctc 541 aaggatctca agatcacagg agagtgtcct ttctccttac tggcaccagg tcaagttcct 601 aacgagtctt cagaggaggc agcaggaagc tcagagagct gcaaagcaac cgtgcccatc 661 tgtcaagaca ttcctgagaa gaacatacaa gaaagtcttc ctcaaagaaa aaccagtcgg 721 agccgagtct atcttcacac tttggcagag agtatttgca aactgatttt cccagagttt 781 gaacggctga atgttgcact tcagagaaca ttggcaaagc acaaaataaa agaaagcagg 841 aaatctttgg aaagagaaga ctttgaaaaa acaattgcag agcaagcagt gcagcagagt 901 ccagtggagt tatcaaagaa tctcttggtg aagaggtttt taaaatatgt tacgaggaag 961 atgaaaacat ccttggggtg gttggaggca cccttaaaga tttttaaaca gcttcagtac 1021 ccttctgaaa cagagcagcc attgccaaga agcaggaaaa aggggcagct tgaggacgcc 1081 tccattctat gcctggataa ggaggatgat tttctacatg tttactactt cttccctaag 1141 agaaccacct ccctgattct tcccggcatc ataaaggcag ctgctcacgt attatatgaa 1201 acggaagtgg aagtgtcgtt aatgcctccc tgcttccata atgattgcag cgagtttgtg 1261 aatcagccct acttgttgta ctccgttcac atgaaaagca ccaagccatc cctgtccccc 1321 agcaaacccc agtcctcgct ggtgattccc acatcgctat tctgcaagac atttccattc 1381 catttcatgt ttgacaaaga tatgacaatt ctgcaatttg gcaatggcat cagaaggctg 1441 atgaacagga gagactttca aggaaagcct aattttgaat actttgaaat tctgactcca 1501 aaaatcaacc agacctttag cgggatcatg actatgttga atatgcagtt tgttgtacga 1561 gtgaggagat gggacaactc tgtgaagaaa tcttcaaggg ttatggacct caaaggccaa 1621 atgatctaca ttgttgaatc cagtgcaatc ttgtttttgg ggtcaccctg tgtggacaga 1681 ttagaagatt ttacaggacg agggctctac ctctcagaca tcccaattca caatgcactg 1741 agggatgtgg tcttaatagg ggaacaagcc cgagctcaag atggcctgaa gaagaggctg 1801 gggaagctga aggctaccct tgagcaagcc caccaagccc tggaggagga gaagaaaaag 1861 acagtagacc ttctgtgctc catatttccc tgtgaggttg ctcagcagct gtggcaaggg 1921 caagttgtgc aagccaagaa gttcagtaat gtcaccatgc tcttctcaga catcgttggg 1981 ttcactgcca tctgctccca gtgctcaccg ctgcaggtca tcaccatgct caatgcactg 2041 tacactcgct tcgaccagca gtgtggagag ctggatgtct acaaggtgga gaccattgcg 2101 atgcctattg tgtggcttgg gggattacac aaagagagtg atactcatgc tgttcagata 2161 gcgctgatgg ccctgaagat gatggagctc tctgatgaag ttatgtctcc ccatggagaa 2221 cctatcaaga tgcgaattgg actgcactct ggatcagttt ttgctggcgt cgttggagtt 2281 aaaatgcccc gttactgtct ttttggaaac aatgtcactc tggctaacaa atttgagtcc 2341 tgcagtgtac cacgaaaaat caatgtcagc ccaacaactt acagattact caaagactgt 2401 cctggtttcg tgtttacccc tcgatcaagg gaggaacttc caccaaactt ccctagtgaa 2461 atccccggaa tctgccattt tctggatgct taccaacaag gaacaaactc aaaaccatgc 2521 ttccaaaaga aagatgtgga agatgcaagc caatttttta ggcaaagcat caggaataga 2581 ttagcaacct atatacctat ttataagtct ttggggtttg actcattgaa gatgtgtaga 2641 gcctctgaaa gcactttagg gattgtagat ggctaacaag cagtattaaa atttcaggag 2701 ccaagtcaca atctttctcc tgtttaacat gacaaaatgt actcacttca gtacttcagc 2761 tcttcaagaa aaaaaaaaaa accttaaaaa gctacttttg tgggagtatt tctattatat 2821 aaccagcact tactacctgt actcaaaatt cagcaccttg tacatatatc agataattgt 2881 agtcaattgt acaaactgat ggagtcacct gcaatctcat atcctggtgg aatgccatgg 2941 ttattaaagt gtgtttgtga tagttgtcgt caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3001 aaaa // LOCUS HSGCSAB 2443 bp RNA PRI 19-NOV-1992 DEFINITION H.sapiens soluble guanylate cyclase small subunit mRNA. ACCESSION X66533 NID g31685 KEYWORDS cytoplasmic protein; GTP pyrophosphate-lyase; Guanylate cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2443) AUTHORS Giuili,G., Scholl,U., Bulle,F. and Guellaen,G. TITLE Molecular cloning of the cDNAs coding for the two subunits of soluble guanylyl cyclase from human brain JOURNAL FEBS Lett. 304 (1), 83-88 (1992) MEDLINE 92316204 REFERENCE 2 (bases 1 to 2443) AUTHORS Guellaen,G. TITLE Direct Submission JOURNAL Submitted (10-JUL-1992) Georges Guellaen, Unite INSERM 99, Hopital Henri Mondor, 94010 Creteil, FRANCE FEATURES Location/Qualifiers source 1..2443 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, frontal lobe" /clone_lib="lambda GT10" /chromosome="4" /map="4q31.3-q33" CDS 89..1948 /EC_number="4.6.1.2" /note="soluble" /codon_start=1 /product="guanylate cyclase" /db_xref="PID:g31686" /db_xref="SWISS-PROT:Q02153" /translation="MYGFVNHALELLVIRNYGPEVWEDIKKEAQLDEEGQFLVRIIYD DSKTYDLVAAASKVLNLNAGEILQMFGKMFFVFCQESGYDTILRVLGSNVREFLQNLD ALHDHLATIYPGMRAPSFRCTDAEKGKGLILHYYSEREGLQDIVIGIIKTVAQQIHGT EIDMKVIQQRNEECDHTQFLIEEKESKEEDFYEDLDRFEENGTQESRISPYTFCKAFP FHIIFDRDLVVTQCGNAIYRVLPQLQPGNCSLLSVFSLVRPHIDISFHGILSHINTVF VLRSKEGLLDVEKLECEDELTGTEISCLRLKGQMIYLPEADSILFLCSPSVMNLDDLT RRGLYLSDIPLHDATRDLVLLGEQFREEYKLTQELEILTDRLQLTLRALEDEKKKTDT LLYSVLPPSVANELRHKRPVPAKRYDNVTILFSGIVGFNAFCSKHASGEGAMKIVNLL NDLYTRFDTLTDSRKNPFVYKVETVGDKYMTVSGLPEPCIHHARSICHLALDMMEIAG QVQVDGESVQITIGIHTGEVVTGVIGQRMPRYCLFGNTVNLTSRTETTGEKGKINVSE YTYRCLMSPENSDPQFHLEHRGPVSMKGKKEPMQVWFLSRKNTGTEETKQDDD" BASE COUNT 709 a 513 c 533 g 688 t ORIGIN 1 cccccccccg ccgctgccgc ctctgcctgg gtcccttcgg ccgtacctct gcgtgggggc 61 tgcctccccg gctcccggtg cagacaccat gtacggattt gtgaatcacg ccctggagtt 121 gctggtgatc cgcaattacg gccccgaggt gtgggaagac atcaaaaaag aggcacagtt 181 agatgaagaa ggacagtttc ttgtcagaat aatatatgat gactccaaaa cttatgattt 241 ggttgctgct gcaagcaaag tcctcaatct caatgctgga gaaatcctcc aaatgtttgg 301 gaagatgttt ttcgtctttt gccaagaatc tggttatgat acaatcttgc gtgtcctggg 361 ctctaatgtc agagaatttc tacagaacct tgatgctctg cacgaccacc ttgctaccat 421 ctacccagga atgcgtgcac cttcctttag gtgcactgat gcagaaaagg gcaaaggact 481 cattttgcac tactactcag agagagaagg acttcaggat attgtcattg gaatcatcaa 541 aacagtggca caacaaatcc atggcactga aatagacatg aaggttattc agcaaagaaa 601 tgaagaatgt gatcatactc aatttttaat tgaagaaaaa gagtcaaaag aagaggattt 661 ttatgaagat cttgacagat ttgaagaaaa tggtacccag gaatcacgca tcagcccata 721 tacattctgc aaagcttttc cttttcatat aatatttgac cgggacctag tggtcactca 781 gtgtggcaat gctatataca gagttctccc ccagctccag cctgggaatt gcagccttct 841 gtctgtcttc tcgctggttc gtcctcatat tgatattagt ttccatggga tcctttctca 901 catcaatact gtttttgtat tgagaagcaa ggaaggattg ttggatgtgg agaaattaga 961 atgtgaggat gaactgactg ggactgagat cagctgctta cgtctcaagg gtcaaatgat 1021 ctacttacct gaagcagata gcatactttt tctatgttca ccaagtgtca tgaacctgga 1081 cgatttgaca aggagagggc tgtatctaag tgacatccct ctgcatgatg ccacgcgcga 1141 tcttgttctt ttgggagaac aatttagaga ggaatacaaa ctcacccaag aactggaaat 1201 cctcactgac aggctacagc tcacgttaag agccctggaa gatgaaaaga aaaagacaga 1261 cacattgctg tattctgtcc ttcctccgtc tgttgccaat gagctgcggc acaagcgtcc 1321 agtgcctgcc aaaagatatg acaatgtgac catcctcttt agtggcattg tgggcttcaa 1381 tgctttctgt agcaagcatg catctggaga aggagccatg aagatcgtca acctcctcaa 1441 cgacctctac accagatttg acacactgac tgattcccgg aaaaacccat ttgtttataa 1501 ggtggagact gttggtgaca agtatatgac agtgagtggt ttaccagagc catgcattca 1561 ccatgcacga tccatctgcc acctggcctt ggacatgatg gaaattgctg gccaggttca 1621 agtagatggt gaatctgttc agataacaat agggatacac actggagagg tagttacagg 1681 tgtcatagga cagcggatgc ctcgatactg tctttttggg aatactgtca acctcacaag 1741 ccgaacagaa accacaggag aaaagggaaa aataaatgtg tctgaatata catacagatg 1801 tcttatgtct ccagaaaatt cagatccaca attccacttg gagcacagag gcccagtgtc 1861 catgaagggc aaaaaagaac caatgcaagt ttggtttcta tccagaaaaa atacaggaac 1921 agaggaaaca aagcaggatg atgactgaat cttggattat ggggtgaaga ggagtacaga 1981 ctaggttcca gttttctcct aacacgtgcc aagcccagga gcagttcttc cctatggata 2041 cagattttct tttgtccttg tccattaccc caagactttc ttctagatat atctctcact 2101 atccgttatt caaccttagc tctgctttct attacttttt aggctttagt atattatcta 2161 aagtttggct tttgatgtgg atgatgtgag cttcatgtgt cttaaaatct actacaagca 2221 ttacctaaca tggtgatctg caagtagtag gcacccaata aatatttgtt gaatttagtt 2281 aaatgaaact gaacagtgtt tggccatgtg tatatttata tcatgtttac caaatctgtt 2341 tagtgttcca catatatgta tatgtatatt ttaatgacta taatgtaata aagtttatat 2401 catgttggtg tatatcatta tagaaatcat tttctaaagg agt // LOCUS HSGCSFR2 2933 bp RNA PRI 06-FEB-1991 DEFINITION Human mRNA coding for granulocyte colony stimulating factor receptor 25-1. ACCESSION X55721 NID g31696 KEYWORDS granulocyte colony stimulating factor receptor; integral membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2933) AUTHORS Larsen,A.D. TITLE Direct Submission JOURNAL Submitted (02-OCT-1990) Larsen A.D., Immunex Research & Development Corp., 51 University St, Seattle WA 98101, U S A REFERENCE 2 (bases 1 to 2933) AUTHORS Larsen,A., Davis,T., Curtis,B.M., Gimpel,S., Sims,J.E., Cosman,D., Park,L., Sorensen,E., March,C.J. and Smith,C.A. TITLE Expression cloning of a human granulocyte colony-stimulating factor receptor: a structural mosaic of hematopoietin receptor, immunoglobulin, and fibronectin domains JOURNAL J. Exp. Med. 172 (6), 1559-1570 (1990) MEDLINE 91079757 COMMENT contains homologies to immmunoglobulin family, hematopoeitin receptor family, and type III repeats of fibronectin. FEATURES Location/Qualifiers source 1..2933 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placental" /clone="HuGCSFR-25-1" mRNA <1..>2933 CDS 166..2676 /codon_start=1 /product="granulocyte colony stimulating factor receptor 25-1" /db_xref="PID:g31697" /db_xref="SWISS-PROT:Q99062" /translation="MARLGNCSLTWAALIILLLPGSLEECGHISVSAPIVHLGDPITA SCIIKQNCSHLDPEPQILWRLGAELQPGGRQQRLSDGTQESIITLPHLNHTQAFLSCC LNWGNSLQILDQVELRAGYPPAIPHNLSCLMNLTTSSLICQWEPGPETHLPTSFTLKS FKSRGNCQTQGDSILDCVPKDGQSHCCIPRKHLLLYQNMGIWVQAENALGTSMSPQLC LDPMDVVKLEPPMLRTMDPSPEAAPPQAGCLQLCWEPWQPGLHINQKCELRHKPQRGE ASWALVGPLPLEALQYELCGLLPATAYTLQIRCIRWPLPGHWSDWSPSLELRTTERAP TVRLDTWWRQRQLDPRTVQLFWKPVPLEEDSGRIQGYVVSWRPSGQAGAILPLCNTTE LSCTFHLPSEAQEVALVAYNSAGTSRPTPVVFSESRGPALTRLHAMARDPHSLWVGWE PPNPWPQGYVIEWGLGPPSASNSNKTWRMEQNGRATGFLLKENIRPFQLYEIIVTPLY QDTMGPSQHVYAYSQEMAPSHAPELHLKHIGKTWAQLEWVPEPPELGKSPLTHYTIFW TNAQNQSFSAILNASSRGFVLHGLEPASLYHIHLMAASQAGATNSTVLTLMTLTPEGS ELHIILGLFGLLLLLTCLCGTAWLCCSPNRKNPLWPSVPDPAHSSLGSWVPTIMEEDA FQLPGLGTPPITKLTVLEEDEKKPVPWESHNSSETCGLPTLVQTYVLQGDPRAVSTQP QSQSGTSDQVLYGQLLGSPTSPGPGHYLRCDSTQPLLAGLTPSPKSYENLWFQASPLG TLVTPAPSQEDDCVFGPLLNFPLLQGIRVHGMEALGSF" mat_peptide 238..2673 /product="granulocyte colony stimulating factor receptor 25-1" BASE COUNT 607 a 992 c 793 g 541 t ORIGIN 1 ctggactgca gctggtttca ggaacttctc ttgacgagaa gagagaccaa ggaggccaag 61 caggggctgg gccagaggtg ccaacatggg gaaactgagg ctcggctcgg aaaggtgaag 121 taacttgtcc aagatcacaa agctggtgaa catcaagttg gtgctatggc aaggctggga 181 aactgcagcc tgacttgggc tgccctgatc atcctgctgc tccccggaag tctggaggag 241 tgcgggcaca tcagtgtctc agcccccatc gtccacctgg gggatcccat cacagcctcc 301 tgcatcatca agcagaactg cagccatctg gacccggagc cacagattct gtggagactg 361 ggagcagagc ttcagcccgg gggcaggcag cagcgtctgt ctgatgggac ccaggaatct 421 atcatcaccc tgccccacct caaccacact caggcctttc tctcctgctg cctgaactgg 481 ggcaacagcc tgcagatcct ggaccaggtt gagctgcgcg caggctaccc tccagccata 541 ccccacaacc tctcctgcct catgaacctc acaaccagca gcctcatctg ccagtgggag 601 ccaggacctg agacccacct acccaccagc ttcactctga agagtttcaa gagccggggc 661 aactgtcaga cccaagggga ctccatcctg gactgcgtgc ccaaggacgg gcagagccac 721 tgctgcatcc cacgcaaaca cctgctgttg taccagaata tgggcatctg ggtgcaggca 781 gagaatgcgc tggggaccag catgtcccca caactgtgtc ttgatcccat ggatgttgtg 841 aaactggagc cccccatgct gcggaccatg gaccccagcc ctgaagcggc ccctccccag 901 gcaggctgcc tacagctgtg ctgggagcca tggcagccag gcctgcacat aaatcagaag 961 tgtgagctgc gccacaagcc gcagcgtgga gaagccagct gggcactggt gggccccctc 1021 cccttggagg cccttcagta tgagctctgc gggctcctcc cagccacggc ctacaccctg 1081 cagatacgct gcatccgctg gcccctgcct ggccactgga gcgactggag ccccagcctg 1141 gagctgagaa ctaccgaacg ggcccccact gtcagactgg acacatggtg gcggcagagg 1201 cagctggacc ccaggacagt gcagctgttc tggaagccag tgcccctgga ggaagacagc 1261 ggacggatcc aaggttatgt ggtttcttgg agaccctcag gccaggctgg ggccatcctg 1321 cccctctgca acaccacaga gctcagctgc accttccacc tgccttcaga agcccaggag 1381 gtggcccttg tggcctataa ctcagccggg acctctcgcc ccaccccggt ggtcttctca 1441 gaaagcagag gcccagctct gaccagactc catgccatgg cccgagaccc tcacagcctc 1501 tgggtaggct gggagccccc caatccatgg cctcagggct atgtgattga gtggggcctg 1561 ggccccccca gcgcgagcaa tagcaacaag acctggagga tggaacagaa tgggagagcc 1621 acggggtttc tgctgaagga gaacatcagg ccctttcagc tctatgagat catcgtgact 1681 cccttgtacc aggacaccat gggaccctcc cagcatgtct atgcctactc tcaagaaatg 1741 gctccctccc atgccccaga gctgcatcta aagcacattg gcaagacctg ggcacagctg 1801 gagtgggtgc ctgagccccc tgagctgggg aagagccccc ttacccacta caccatcttc 1861 tggaccaacg ctcagaacca gtccttctcc gccatcctga atgcctcctc ccgtggcttt 1921 gtcctccatg gcctggagcc cgccagtctg tatcacatcc acctcatggc tgccagccag 1981 gctggggcca ccaacagtac agtcctcacc ctgatgacct tgaccccaga ggggtcggag 2041 ctacacatca tcctgggcct gttcggcctc ctgctgttgc tcacctgcct ctgtggaact 2101 gcctggctct gttgcagccc caacaggaag aatcccctct ggccaagtgt cccagaccca 2161 gctcacagca gcctgggctc ctgggtgccc acaatcatgg aggaggatgc cttccagctg 2221 cccggccttg gcacgccacc catcaccaag ctcacagtgc tggaggagga tgaaaagaag 2281 ccggtgccct gggagtccca taacagctca gagacctgtg gcctccccac tctggtccag 2341 acctatgtgc tccaggggga cccaagagca gtttccaccc agccccaatc ccagtctggc 2401 accagcgatc aggtccttta tgggcagctg ctgggcagcc ccacaagccc agggccaggg 2461 cactatctcc gctgtgactc cactcagccc ctcttggcgg gcctcacccc cagccccaag 2521 tcctatgaga acctctggtt ccaggccagc cccttgggga ccctggtaac cccagcccca 2581 agccaggagg acgactgtgt ctttgggcca ctgctcaact tccccctcct gcaggggatc 2641 cgggtccatg ggatggaggc gctggggagc ttctagggct tcctggggtt cccttcttgg 2701 gcctgccttt taaaggcctg agctagctgg agaagagggg agggtccata agcccatgac 2761 taaaaactac cccagcccag gctctcacca tctccagtca ccagcatctc cctctcctcc 2821 caatctccat aggctgggcc tcccaggcga tctgcatact ttaaggacca gatcatgctc 2881 catccagccc cacccaatgg ccttttgtgc ttgtttccta taacttcagt att // LOCUS HSGD3S 2117 bp RNA PRI 12-JUL-1994 DEFINITION H.sapiens GD3 synthase mRNA. ACCESSION X77922 NID g510987 KEYWORDS GD3 synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2117) AUTHORS Sasaki,K., Kurata,K., Kojima,N., Kurosawa,N., Ohta,S., Hanai,N., Tsuji,S. and Nishi,T. TITLE Expression cloning of a GM3-specific alpha-2,8-sialyltransferase (GD3 synthase) JOURNAL J. Biol. Chem. 269 (22), 15950-15956 (1994) MEDLINE 94253194 REFERENCE 2 (bases 1 to 2117) AUTHORS Nishi,T. TITLE Direct Submission JOURNAL Submitted (28-FEB-1994) T. Nishi, Tokyo Research Labs., Kyowa Hakko Kogyo Co. Ltd, Machida, Tokyo 194, JAPAN FEATURES Location/Qualifiers source 1..2117 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="melanoma" /cell_line="WM266-4" /clone="pAMo-GD3" /chromosome="12" CDS 483..1553 /codon_start=1 /product="GD3 synthase" /db_xref="PID:g510988" /translation="MSPCGRARRQTSRGAMAVLAWKFPRTRLPMGASALCVVVLCWLY IFPVYRLPNEKEIVQGVLQQGTAWRRNQTAARAFRKQMEDCCDPAHLFAMTKMNSPMG KSMWYDGEFLYSFTIDNSTYSLFPQATPFQLPLKKCAVVGNGGILKKSGCGRQIDEAN FVMRCNLPPLSSEYTKDVGSKSQLVTANPSIIRQRFQNLLWSRKTFVDNMKIYNHSYI YMPAFSMKTGTEPSLRVYYTLSDVGANQTVLFANPNFLRSIGKFWKSRGIHAKRLSTG LFLVSAALGLCEEVAIYGFWPFSVNMHEQPISHHYYDNVLPFSGFHAMPEEFLQLWYL HKIGALRMQLDPCEDTSLQPTS" BASE COUNT 554 a 502 c 571 g 490 t ORIGIN 1 ggggggggtt gcctggcggc gcagcagcgc gggaggcggc gaagggcgca ggagcatcgc 61 tcggagggga caaggggacg ccacgggcca catgtttagg agggagccga gccttctccc 121 ggaccctcgc cgagggcgac cgtgatgctg cagaaccggc gggagcgact cgccgccgcc 181 gctctctgcg cactcggaga ccccagcgcc cgccttctgc aggggaagcg accatggcca 241 tagatcgtga cttcaccccc agccacttcc cctagaaaga aatccttgga aaagttgcat 301 ttgaaaaaat ccttgcgctg acctttgggg ccgacggggc cgaagaagcg tgcgtgcgtt 361 tgcaagtaag agaaccaaag gtgtgtgtgc atggggggct ggcggtgggg gaccctccgc 421 tgccacttcg cctagctttg tgctgaggcc ccggcccccg cccctgggac gccggggctg 481 cgatgagccc ctgcgggcgg gcccggcgac aaacgtccag aggggccatg gctgtactgg 541 cgtggaagtt cccgcggacc cggctgccca tgggagccag tgccctctgt gtcgtggtcc 601 tctgttggct ctacatcttc cccgtctacc ggctgcccaa cgagaaagag atcgtgcagg 661 gggtgctgca acagggcacg gcgtggagga ggaaccagac cgcggccaga gcgttcagga 721 aacaaatgga agactgctgc gaccctgccc atctctttgc tatgactaaa atgaattccc 781 ctatggggaa gagcatgtgg tatgacgggg agtttttata ctcattcacc attgacaatt 841 caacttactc tctcttccca caggcaaccc cattccagct gccattgaag aaatgcgcgg 901 tggtgggaaa tggtgggatt ctgaagaaga gtggctgtgg ccgtcaaata gatgaagcaa 961 attttgtcat gcgatgcaat ctccctcctt tgtcaagtga atacactaag gatgttggat 1021 ccaaaagtca gttagtgaca gctaatccca gcataattcg gcaaaggttt cagaaccttc 1081 tgtggtccag aaagacattt gtggacaaca tgaaaatcta taaccacagt tacatctaca 1141 tgcctgcctt ttctatgaag acaggaacag agccatcttt gagggtttat tatacactgt 1201 cagatgttgg tgccaatcaa acagtgctgt ttgccaaccc caactttctg cgtagcattg 1261 gaaagttctg gaaaagtaga ggaatccatg ccaagcgcct gtccacagga ctttttctgg 1321 tgagcgcagc tctgggtctc tgtgaagagg tggccatcta tggcttctgg cccttctctg 1381 tgaatatgca tgagcagccc atcagccacc actactatga caacgtctta cccttttctg 1441 gcttccatgc catgcccgag gaatttctcc aactctggta tcttcataaa atcggtgcac 1501 tgagaatgca gctggaccca tgtgaagata cctcactcca gcccacttcc taggaacaat 1561 ggaagaagaa aggactgaac cagggtattt ttgttaggtt ttctatgtga ctccaagagg 1621 gaatggtcaa gttgtttcat gagtttgcat gggcccttgg aaaaacagga aaggagcaat 1681 gaagatccaa gcaaaacttt actttcagcg ttggcttgga ggacaaataa gaaatgaaac 1741 atcctatgaa atactttata gcacatggca gatttgcaac tagtaaaatg ctggtgaaat 1801 gctgttggta aagcacatgg ttcaaatcta gaagatgcag ttcaaaaaca agacagactc 1861 gagttgttag ggctgaggaa ccaatcaagg tagaacaaag aaaatgttgg ggtaaaagtg 1921 ttgctgattg tcaacacaaa ctggcttaat aatattaata agaacctgtc ttattaagac 1981 tggctttaga accgtaggtt tttttaaaaa attattattt atttttgccc tctttgggga 2041 agtgggtggg tagatttaaa aaatcccttc ctgagtaata aagatacaaa atgttactgc 2101 tgataaaaaa aaaaaaa // LOCUS HSGDHR 2970 bp RNA PRI 16-DEC-1994 DEFINITION Human mRNA for glutamate dehydrogenase (EC 1.4.1.3., GDH). ACCESSION X07674 M18377 NID g31706 KEYWORDS glutamate dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2970) AUTHORS Nakatani,Y. TITLE Direct Submission JOURNAL Submitted (11-MAY-1988) Nakatani Y., LMB-NINCDS, Bldg 36, 3C-24, Bethesda MD, 20892 REFERENCE 2 (bases 404 to 2950) AUTHORS Nakatani,Y., Banner,C., von Herrath,M., Schneider,M.E., Smith,H.H. and Freese,E. TITLE Comparison of human brain and liver glutamate dehydrogenase cDNAS JOURNAL Biochem. Biophys. Res. Commun. 149 (2), 405-410 (1987) MEDLINE 88106451 COMMENT cDNAs from liver and brain were identical. FEATURES Location/Qualifiers source 1..2970 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver, brain" /cell_type="fibroblast" /clone_lib="lambda gt11" /clone="pA2" mRNA <1..1960 sig_peptide 14..172 CDS 14..1690 /codon_start=1 /product="GDH" /db_xref="PID:g31707" /db_xref="SWISS-PROT:P00367" /translation="MYRYLGEALLLSRAGPAALGSASADSAALLGWARGQPAAAPQPG LALAARRHYSEAVADREDDPNFFKMVEGFFDRGASIVEDKLVEDLRTRESEEQKRNRV RGILRIIKPCNHVLSLSFPIRRDDGSWEVIEGYRAQHSQHRTPCKGGIRYSTDVSVDE VKALASLMTYKCAVVDVPFGGAKAGVKINPKNYTDNELEKITRRFTMELAKKGFIGPG IDVPAPDMSTGEREMSWIADTYASTIGHYDINAHACVTGKPISQGGIHGRISATGRGV FHGIENFINEASYMSILGMTPGFGDKTFVVQGFGNVGLHSMRYLHRFGAKCIAVGESD GSIWNPDGIDPKELEDFKLQHGSILGFPKAKPYEGSILEADCDILIPAASEKQLTKSN APRVKAKIIAEGANGPTTPEADKIFLERNIMVIPDLYLNAGGVTVSYFEWLKNLNHVS YGRLTFKYERDSNYHLLMSVQESLERKFGKHGGTIPIVPTAEFQDRISGASEKDIVHS GLAYTMERSARQIMRTAMKYNLGLDLRTAAYVNAIEKVFKVYNEAGVTFT" mat_peptide 173..1687 /product="GDH" BASE COUNT 818 a 651 c 702 g 799 t ORIGIN 1 tccgcttgtg gccatgtacc gctacctggg cgaagcgctg ttgctgtccc gggccgggcc 61 cgctgccctg ggctcggcgt ccgccgactc ggccgcgttg ctgggctggg cccggggaca 121 gcccgccgcc gccccgcagc cggggctggc attggccgcc cggcgccact acagcgaggc 181 ggtggccgac cgcgaggacg accccaactt cttcaagatg gtggagggct tcttcgatcg 241 cggcgccagc atcgtggagg acaagctggt ggaggacctg aggacccggg agagcgagga 301 gcagaagcgg aaccgggtgc gcggcatcct gcggatcatc aagccctgca accatgtgct 361 gagtctctcc ttccccatcc ggcgcgacga cggctcctgg gaggtcatcg aaggctaccg 421 ggcccagcac agccagcacc gcacgccctg caagggaggt atccgttaca gcactgatgt 481 gagtgtagat gaagtaaaag ctttggcttc tctgatgaca tacaagtgtg cagtggttga 541 tgtgccgttt gggggtgcta aagctggtgt taagatcaat cccaagaact atactgataa 601 tgaattggaa aagatcacaa ggaggttcac catggagcta gcaaaaaagg gctttattgg 661 tcctggcatt gatgtgcctg ctccagacat gagcacaggt gagcgggaga tgtcctggat 721 cgctgatacc tatgccagca ccatagggca ctatgatatt aatgcacacg cctgtgttac 781 tggtaaaccc atcagccaag ggggaatcca tggacgcatc tctgctactg gccgtggtgt 841 cttccatggg attgaaaatt tcatcaatga agcttcttac atgagcattt taggaatgac 901 accagggttt ggagataaaa catttgttgt tcagggattt ggtaatgtgg gcctacactc 961 tatgagatat ttacatcgtt ttggtgctaa atgtattgct gttggtgagt ctgatgggag 1021 tatatggaat ccagatggta ttgacccaaa ggaactggaa gacttcaaat tgcaacatgg 1081 gtccattctg ggcttcccca aggcaaagcc ctatgaagga agcatcttgg aggccgactg 1141 tgacatactg atcccagctg ccagtgagaa gcagttgacc aaatccaacg cacccagagt 1201 caaagccaag atcattgctg aaggtgccaa tgggccaaca actccagaag ctgacaagat 1261 cttcctggag agaaacatta tggttattcc agatctctac ttgaatgctg gaggagtgac 1321 agtatcttac tttgagtggc tgaagaatct aaatcatgtc agctatggcc gtttgacctt 1381 caaatatgaa agggattcta actaccactt gctcatgtct gttcaagaga gtttagaaag 1441 aaaatttgga aagcatggtg gaactattcc cattgtaccc acggcagagt tccaagacag 1501 gatatcgggt gcatctgaga aagacatcgt gcactctggc ttggcataca caatggagcg 1561 ttctgccagg caaattatgc gcacagccat gaagtataac ctgggattgg acctgagaac 1621 agctgcctat gttaatgcca ttgagaaagt cttcaaagtg tacaatgaag ctggtgtgac 1681 cttcacatag atggatcatg gctgacttcc tcactatcct cttcacatgt aacttctgca 1741 gacctatcac aagtttacat gtaaccacag aaatcccttt ctctcctgac tcattaataa 1801 tggataccat tctcaacaag tcaatccaag tcagcccgtt aaggagaaag aaattaaggt 1861 tagcggatca tgtacaagct gagtgtgaaa gtagaaatca cctacaccag agagccattt 1921 tggtattttg cctttaaata aaaagcctcc tttatctggc tgtgcagcct tgctctgtgg 1981 cttttcccaa cacaatcagt gctagtgctg gggaggaaca gtcaagagca gtcagttgct 2041 tgcttatttt ttctggatga gtctgggaca cactgtaact ttaacacatt taagaagtag 2101 gtgtgtggcc ttttcagaag gtggcatggt cctcaagtga gttcttagta ttttatatca 2161 gcaaaataat tcaattttgc aggttgcaaa caaatataaa acctgtttct gtttatgaat 2221 attattcttt tagaatagaa taagtacatg ctgctgtaat aaaattgcct ttaatcactt 2281 aacaagccta accttgactc aaacagtgaa tgcctataga aataataaat gaaaaaaact 2341 agtattttta tatcataaaa caatgtcatt tatagcttat cattcatgta ttgtccagca 2401 gacattaaaa gccctgtgga taattaagtt atcttcatac ctgcaaaatg gtggaggcta 2461 ttttcattaa aactgtcaga atttgcttac tataattatg atacagtcca aagaatgcag 2521 tcacttttta tcatgttaac taattgttct cttttgaaga tctatggttg actaattaaa 2581 caataattca agtagagtgt cccagaaaaa aaccacttgg gctccctgtt tggagtctgg 2641 ctggctctga gcattgccaa tggcccctac tcacctgact ttgtatcctc tccttttaga 2701 ggctttgcat tctgcaccca gcttcactaa cagtgggctg aaaacatcct tgggttgagt 2761 gtttcatttg ggagttattt ggccagggcc ttttgaacag tagtgtcccc atgaagtgct 2821 agataatata tgtgtaagag tcagcttttt ttttttttta actctaacac ccttcagaaa 2881 tttctaacta ctttgtaact gcatggctta acctggtgat aaaagcagtt attaaaagtc 2941 tacgttttcc aaaaaaaaaa aaaaaaaaaa // LOCUS HSGDIPR 2181 bp RNA PRI 18-OCT-1995 DEFINITION H.sapiens mRNA for glucose-dependant insulinotropic polypeptide receptor gene. ACCESSION X81832 NID g1030050 KEYWORDS glucose-dependent insulinotropic polypeptide; receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2181) AUTHORS Thorens,B. TITLE Direct Submission JOURNAL Submitted (22-SEP-1994) B. Thorens, Institute of Pharmacology & Toxicology, Bugnon 27, 1000 Lausanne, SWITZERLAND REFERENCE 2 (bases 1 to 2181) AUTHORS Gremlich,S., Porret,A., Hani,E.H., Cherif,D., Vionnet,N., Froguel,P. and Thorens,B. TITLE Cloning, functional expression, and chromosomal localization of the human pancreatic islet glucose-dependent insulinotropic polypeptide receptor JOURNAL Diabetes 44 (10), 1202-1208 (1995) MEDLINE 96007224 FEATURES Location/Qualifiers source 1..2181 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="pancreas" /cell_type="pancreatic Langerhans islets cells" /clone_lib="Lambda ZapII" /clone="39, 44 and 17" /chromosome="19" /map="19q13.3" CDS 487..1962 /codon_start=1 /product="glucose-dependent insulinotropic polypeptide receptor" /db_xref="PID:g1030051" /translation="MTTSPILQLLLRLSLCGLLLQRAETGSKGQTAGELYQRWERYRR ECQETLAAAEPPSGLACNGSFDMYVCWDYAAPNATARASCPWYLPWHHHVAAGFVLRQ CGSDGQWGLWRDHTCENPEKNEAFLDQRLILERLQVMYTVGYSLSLATLLLALLILSL FRRLHCTRNYIHINLFTSFMLRAAAILSRDRLLPRPGPYLGDQALALWNQALAACRTA QIVTQYCVGANYTWLLVEGVYLHSLLVLVGGSEEGHFRYYLLLGWGAPALFVIPWVIV RYLYENTQCWERNEVKAIWWIIRTPILMTILINFLIFIRILGILLSKLRTRQMRCRDY RLRLARSTLTLVPLLGVHEVVFAPVTEEQARAPCVAKLGFEIFLSSFQGFLVSVLYCF INKEVGRDPAAAPALWRRRGTAPPLSAIVSQVQSEIRRGWHHCRLRRSLGEEQRQLPE RAFRALPSGSGPGEVPTSRGLSSGTLPGPGNEASRELESYC" exon 1675..1755 /note="alternative exon" BASE COUNT 452 a 622 c 615 g 492 t ORIGIN 1 tgtaagtacc cctgataagc aaattagtaa ttgtcaatac ccctgttaag caattccttt 61 ttgcagtata tttctgaaat gacagaatgc tgttttaaaa acaaagaaat aaaatcctgc 121 tcctgactcg gtcaaaatat ttttaaagtc tattgtttgt tgtgcttgct ggtactaaga 181 ggctatttaa aagtataaaa ctgctttgta tccatgaggg tttcattgtg tgttagcagc 241 agtgagcttc tattaaatgt atatgtcatt tattttgttt aagtggcttt cagcaaacct 301 cagtcatatt cttatgcagg gtattgcgaa acaacttgtg ttctattaat cgtgtcttca 361 attaaaagac cacagacttc tggaaaaaaa aaaaaaaaaa ggggctgcag gagcaagtga 421 ccaggagcag gactggggac aggcctgatc gcccctgcac gaaccagacc cttcgccgcc 481 ctcacgatga ctacctctcc gatcctgcag ctgctgctgc ggctctcact gtgcgggctg 541 ctgctccaga gggcggagac aggctctaag gggcagacgg cgggggagct gtaccagcgc 601 tgggaacggt accgcaggga gtgccaggag accttggcag ccgcggaacc gccttcaggc 661 ctcgcctgta acgggtcctt cgatatgtac gtctgctggg actatgctgc acccaatgcc 721 actgcccgtg cgtcctgccc ctggtacctg ccctggcacc accatgtggc tgcaggtttc 781 gtcctccgcc agtgtggcag tgatggccaa tggggacttt ggagagacca tacatgtgag 841 aacccagaga agaatgaggc ctttctggac caaaggctca tcttggagcg gttgcaggtc 901 atgtacactg tcggctactc cctgtctctc gccacactgc tgctagccct gctcatcttg 961 agtttgttca ggcggctaca ttgcactaga aactatatcc acatcaacct gttcacgtct 1021 ttcatgctgc gagctgcggc cattctcagc cgagaccgtc tgctacctcg acctggcccc 1081 taccttgggg accaggccct tgcgctgtgg aaccaggccc tcgctgcctg ccgcacggcc 1141 cagatcgtga cccagtactg cgtgggtgcc aactacacgt ggctgctggt ggagggcgtc 1201 tacctgcaca gtctcctggt gctcgtggga ggctccgagg agggccactt ccgctactac 1261 ctgctcctcg gctggggggc ccccgcgctt ttcgtcattc cctgggtgat cgtcaggtac 1321 ctgtacgaga acacgcagtg ctgggagcgc aacgaagtca aggccatttg gtggattata 1381 cggaccccca tcctcatgac catcttgatt aatttcctca tttttatccg cattcttggc 1441 attctcctgt ccaagctgag gacacggcaa atgcgctgcc gggattaccg gctgaggctg 1501 gctcgctcca cgctgacgct ggtgcccctg ctgggtgtcc acgaggtggt gtttgctccc 1561 gtgacagagg aacaggcccg ggcgccctgc gtcgccaagc tcggctttga gatcttcctc 1621 agctccttcc agggcttcct ggtcagcgtc ctctactgct tcatcaacaa ggaggtaggc 1681 agagacccgg ccgccgcccc cgccctctgg cgcaggcggg gtacggcgcc gcctctgagc 1741 gccatcgtct cacaggtgca gtcggagatc cgccgtggct ggcaccactg ccgcctgcgc 1801 cgcagcctgg gcgaggagca acgccagctc ccggagcgcg ccttccgggc cctgccctcc 1861 ggctccggcc cgggcgaggt ccccaccagc cgcggcttgt cctcggggac cctcccaggg 1921 cctgggaatg aggccagccg ggagttggaa agttactgct agggggcggg atccccgtgt 1981 ctgttcagtt agcatggatt tattgagtgc caactgcgtg ccaggcccag tacggaggac 2041 gctggggaaa tggtgaagga aacagaaaaa aggtcctgcc cttctggaga tgacaactga 2101 gtggggaaaa cagaccgtga acacaaaaca tcaagttcca cacacgctat ggaatggtta 2161 tgaagggaag cgagaagggg g // LOCUS HSGDPK 3740 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for type I beta cGMP-dependent protein kinase (EC 2.7.1.37). ACCESSION Y07512 NID g31708 KEYWORDS cGMP-dependent protein kinase; kinase; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3740) AUTHORS Sandberg,M., Natarajan,V., Ronander,I., Kalderon,D., Walter,U., Lohmann,S.M. and Jahnsen,T. TITLE Molecular cloning and predicted full-length amino acid sequence of the type I beta isozyme of cGMP-dependent protein kinase from human placenta. Tissue distribution and developmental changes in rat JOURNAL FEBS Lett. 255 (2), 321-329 (1989) MEDLINE 90005998 REFERENCE 2 (bases 1 to 3740) AUTHORS Sandberg,M. TITLE Direct Submission JOURNAL Submitted (25-OCT-1989) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..3740 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 59..2119 /note="cGMP-dependent protein kinase (AA 1-686)" /codon_start=1 /db_xref="PID:g31709" /db_xref="SWISS-PROT:P14619" /translation="MGTLRDLQYALQEKIEELRQRDALIDELELELDQKDELIQKLQN ELDKYRSVIRPATQQAQKQSASTLQGEPRTKRQAISAEPTAFDIQDLSHVTLPFYPKS PQSKDLIKEAILDNDFMKNLELSQIQEIVDCMYPVEYGKDSCIIKEGDVGSLVYVMED GKVEVTKEGVKLCTMGPGKVFGELAILYNCTRTATVKTLVNVKLWAIDRQCFQTIMMR TGLIKHTEYMEFLKSVPTFQSLPEEILSKLADVLEETHYENGEYIIRQGARGDTFFII SKGTVNVTREDSPSEDPVFLRTLGKGDWFGEKALQGEDVRTANVIAAEAVTCLVIDRD SFKHLIGGLDDVSNKAYEDAEAKAKYEAEAAFFANLKLSDFNIIDTLGVGGFGRVELV QLKSEESKTFAMKILKKRHIVDTRQQEHIRSEKQIMQGAHSDFIVRLYRTFKDSKYLY MLMEACLGGELWTILRDRGSFEDSTTRFYTACVVEAFAYLHSKGIIYRDLKPENLILD HRGYAKLVDFGFAKKIGFGKKTWTFCGTPEYVAPEIILNKGHDISADYWSLGILMYEL LTGSPPFSGPDPMKTYNIILRGIDMIEFPKKIAKNAANLIKKLCRDNPSERLGNLKNG VKDIQKHKWFEGFNWEGLRKGTLTPPIIPSVASPTDTSNFDSFPEDNDEPPPDDNSGW DIDF" BASE COUNT 1225 a 719 c 838 g 958 t ORIGIN 1 aggaagcctc aagacgcgga gcagcggcag gaaggagccc ccggcagccc ggaggagcat 61 gggcaccttg cgggatttac agtacgcgct ccaggagaag atcgaggagc tgaggcagcg 121 ggatgctctc atcgacgagc tggagctgga gttggatcag aaggacgaac tgatccagaa 181 gctgcagaac gagctggaca agtaccgctc ggtgatccga ccagccaccc agcaggcgca 241 gaagcagagc gcgagcacct tgcagggcga gccgcgcacc aagcggcagg cgatctccgc 301 cgagcccacc gccttcgaca tccaggatct cagccatgtg accctgccct tctaccccaa 361 gagcccacag tccaaggatc ttataaagga agctatcctt gacaatgact ttatgaagaa 421 cttggagctg tcgcagatcc aggagattgt ggattgtatg tacccggtgg agtatggcaa 481 ggacagttgc atcatcaaag aaggagacgt ggggtcactg gtgtatgtca tggaagatgg 541 taaggttgaa gttacaaaag aaggtgtgaa gttgtgtacc atgggtccag gaaaagtgtt 601 tggggaattg gctattcttt acaactgtac ccggacagcg accgtcaaga ctcttgtaaa 661 tgtaaaactc tgggccattg atcgacaatg ttttcaaaca ataatgatga ggacaggact 721 catcaagcat accgagtata tggaattttt aaaaagcgtt ccaacattcc agagccttcc 781 tgaagagatc ctcagcaagc ttgctgatgt ccttgaagag acccactatg aaaatggaga 841 atatattatc aggcaaggtg caagagggga caccttcttt atcatcagca aaggaacggt 901 aaatgtcact cgtgaagact caccgagtga agacccagtc tttcttagaa ctttaggaaa 961 aggagactgg tttggagaga aagccttgca gggggaagat gtgagaacag caaacgtaat 1021 tgctgcagaa gctgtaacct gccttgtgat tgacagagac tcttttaaac atttgattgg 1081 agggctggat gatgtttcta ataaagcata tgaagatgca gaagctaaag caaaatatga 1141 agctgaagcg gctttcttcg ccaacctgaa gctgtctgat ttcaacatca ttgataccct 1201 tggagttgga ggtttcggac gagtagaact ggtccagttg aaaagtgaag aatccaaaac 1261 gtttgcaatg aagattctca agaaacgtca cattgtggac acaagacagc aggagcacat 1321 ccgctcagag aagcagatca tgcagggggc tcattccgat ttcatagtga gactgtacag 1381 aacatttaag gacagcaaat atttgtatat gttgatggaa gcttgtctag gtggagagct 1441 ctggaccatt ctcagggata gaggttcgtt tgaagattct acaaccagat tttacacagc 1501 atgtgtggta gaagcttttg cctatctgca ttccaaagga atcatttaca gggacctcaa 1561 gccagaaaat ctcatcctag atcaccgagg ttatgccaaa ctggttgatt ttggctttgc 1621 aaagaaaata ggatttggaa agaaaacatg gactttttgt gggactccag agtatgtagc 1681 cccagagatc atcctgaaca aaggccatga catttcagcc gactactggt cactgggaat 1741 cctaatgtat gaactcctga ctggcagccc acctttctca ggcccagatc ctatgaaaac 1801 ctataacatc atattgaggg ggattgacat gatagaattt ccaaagaaga ttgccaaaaa 1861 tgctgctaat ttaattaaaa aactatgcag ggacaatcca tcagaaagat tagggaattt 1921 gaaaaatgga gtaaaagaca ttcaaaagca caaatggttt gagggcttta actgggaagg 1981 cttaagaaaa ggtaccttga cacctcctat aataccaagt gttgcatcac ccacagacac 2041 aagtaatttt gacagtttcc ctgaggacaa cgatgaacca ccacctgatg acaactcagg 2101 atgggatata gacttctaat gtatttctct tacctgcttc tgccttgctg aagacagctt 2161 tttctgagac acagctgcca gcaaacctga gggaaagaga gaagattagt gctcggggtc 2221 accatgatgc ctttgatcga tgctgctcca gtaactacag tggcattagg acttatcgct 2281 tagatgacaa tagtgctctt tacatgtttt ctgtttgaac ctaaaatagc agttgacatg 2341 gtggtcctga agcaaagcct ttcaccagta aagagatgtt ttctattgtt gcaatgacct 2401 tgctttgctc tgattataat ttgaaagact gtaggaaaca cttcaatgta gtataagagt 2461 ctgtaccttg ctggaatatt caagaagatg aaagaataat atattgggta caatagatta 2521 ctatggtaca gaaactgggc tattcccttt cttcaagtga aggctgtggg atctattaca 2581 gctgcaggcc ggtgtatata ccatacaaaa gaggaccaca catctgttgg tcacagagtt 2641 catgtcacac cagtgctaga agtttcatga ttttatttcc cagcagtgct gatgacaaga 2701 ctgaatgtta ccttttcttt ctgacagatt ttaaaaattg atatgataaa agcacaactg 2761 ctatagattc tgctgagacc tctcatagta ggtatatatg agttttcaca gaagactgaa 2821 aaataatgca tgatatttgt ttgttttttt tgataaattg gcatgacaga gtggggaaaa 2881 aaagcaattc acaaaaccat ttcatatttt ttaaaatatt gtgcttaaag atggtcctgg 2941 aagtaaatga ctagcagcca attggtttta cttaacatac cctcaaactg aggcttaaag 3001 tattcccttt tataaaaata aatgcttggg gtagggtgga gtggggaggg attaaaaccc 3061 atccaaaaaa taaataaaaa ctatataggt gctatgtata tctttcatct gtaaatgtca 3121 gtgtctgaac agcaacacaa attcaaatca ttatacgtgt agccagaaac tcaagcattt 3181 tcactaaagt tattaaacca aactcctgtc caatttgact tatacaacat agtcagtcta 3241 gagttgagag acaaaggtaa ttataaacct atttgaacta gcttcttgtc ttaggcctga 3301 accaaaaaac aacaaacaaa caaaaaacaa gaatgaaaaa cagaaataaa agaagtagaa 3361 aagacaaaga aagaaagccc aaagtcaaag ttgttaatat ttacaggttt accagatctg 3421 gaacattact tatttgaggt cagagaacaa aacaagaacc tggccaggtg ttgattacct 3481 tttagtgaat aagctgagtc catatacttg tctaactaag aaagcagtac agaggaaaac 3541 aggaacctga tttttttaaa ataaatttta aataaaatag aattactaca attctgcaat 3601 ttcatactac ctaaaaaaga ctagatttga aaatgtcaag ctgatttact ttattcacat 3661 ggagaaaaga atccacaaat taaactgagt ccttcactgg catgccagtt gactattatt 3721 agctgtcata agtaaccccg // LOCUS HSGGAT 2244 bp RNA PRI 16-NOV-1993 DEFINITION Human mRNA for pancreatic gamma-glutamyltransferase. ACCESSION X60069 S40064 NID g416525 KEYWORDS gamma-glutamyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2244) AUTHORS Siest,G. TITLE Direct Submission JOURNAL Submitted (04-JUN-1991) G. Siest, Centre du Medicament, 30 rue Lionnois, 54000 Nancy, France REFERENCE 2 (bases 1 to 2244) AUTHORS Courtay,C., Oster,T., Visvikis,A., Diederich,M., Wellman,M. and Siest,G. TITLE Nucleotide sequence of human pancreatic gamma-glutamyltransferase cDNA JOURNAL Unpublished REFERENCE 3 (bases 1 to 2244) AUTHORS Courtay,C., Oster,T., Michelet,F., Visvikis,A., Diederich,M., Wellman,M. and Siest,G. TITLE Gamma-glutamyltransferase: nucleotide sequence of the human pancreatic cDNA. Evidence for a ubiquitous gamma-glutamyltransferase polypeptide in human tissues JOURNAL Biochem. Pharmacol. 43 (12), 2527-2533 (1992) MEDLINE 92337688 COMMENT See also M24903 & De Meyts E.R., Proc. Natl. Acad. Sci. U.S.A. 85:8840-8844(1988). FEATURES Location/Qualifiers source 1..2244 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="pancreas" /clone="pGGT-C19" /tissue_lib="pancreatic cDNA lgtII" 5'UTR 1..358 mRNA 1..2224 /evidence=experimental CDS 359..2068 /EC_number="2.3.2.2" /codon_start=1 /product="gamma-glutamyltranspeptidase" /db_xref="PID:g416526" /db_xref="SWISS-PROT:P19440" /translation="MKKKLVVLGLLAVVLVLVIVGLCLWLPSASKEPDNHVYTRAAVA ADAKQCSKIGRDALRDGGSAVDAAIAALLCVGLMNAHSMGIGGGLFLTIYNSTTRKAE VINAREVAPRLAFATMFNSSEQSQKGGLSVAVPGEIRGYELAHQRHGRLPWARLFQPS IQLARQGFPVGKGLAAALENKRTVIEQQPVLCEVFCRDRKVLREGERLTLPQLADTYE TLAIEGAQAFYNGSLTAQIVKDIQAAGGIVTAEDLNNYRAELIEHPLNISLGDAVLYM PSAPLSGPVLALILNILKGYNFSRESVESPEQKGLTYHRIVEAFRFAYAKRTLLGDPK FVDVTEVVRNMTSEFFAAQLRAQISDDTTHPISYYKPEFYTPDDGGTAHLSVVAEDGS AVSATSTINLYFGSKVRSPVSGILFNNEMDDFSSPSITNEFGVPPSPANFIQPGKQPL SSMCPTIMVGQDGQVRMVVGAAGGTQITTATALAIIYNLWFGYDVKRAVEEPRLHNQL LPNVTTVERNIDQAVTAALETRHHHTQIASTFIAVVQAIVRTAGGWAAASDSRKGGEP AGY" 3'UTR 2069..2244 polyA_signal 2177..2182 polyA_site 2224 BASE COUNT 468 a 689 c 688 g 399 t ORIGIN 1 cggggcaagt gaggtgctgc cgtcatccag gctggacagt tcagtgattt gcctgaggcc 61 ccacagcaga gttcaactgg agacagagaa accagctaga ggcagaggga ggtaacacgg 121 agtcccccag aaaggtctgg gctgcgcgtg cttcaggtaa cctcccttga ccttcaggag 181 aacgagaagg ctgcctgatc agagagtccc tgaagaagat tctgtggcta caggcttcag 241 cagagtgtga gggagacccc ggttatttcc tcagctattt ccaccaaatc ctcctgtctt 301 tcgtggccaa caccccaggc aaggcttggg gcccccgtct gctgctggac gcagagccat 361 gaagaagaag ttagtggtgc tgggcctgct ggccgtggtc ctggtgctgg tcattgtcgg 421 cctctgtctc tggctgccct cagcctccaa ggaacctgac aaccatgtgt acaccagggc 481 tgccgtggcc gcggatgcca agcagtgctc gaagattggg agggatgcac tgcgggacgg 541 tggctctgcg gtggatgcag ccattgcagc cctgttgtgt gtggggctca tgaatgccca 601 cagcatgggc atcgggggtg gcctcttcct caccatctac aacagcacca cacgaaaagc 661 tgaggtcatc aacgcccgcg aggtggcccc caggctggcc tttgccacca tgttcaacag 721 ctcggagcag tcccagaagg gggggctgtc ggtggcggtg cctggggaga tccgaggcta 781 tgagctggca caccagcggc atgggcggct gccctgggct cgcctcttcc agcccagcat 841 ccagctggcc cgccagggct tccccgtggg caagggcttg gcggcagccc tggaaaacaa 901 gcggaccgtc atcgagcagc agcctgtctt gtgtgaggtg ttctgccggg atagaaaggt 961 gcttcgggag ggggagagac tgaccctgcc gcagctggct gacacctacg agacgctggc 1021 catcgagggt gcccaggcct tctacaacgg cagcctcacg gcccagattg tgaaggacat 1081 ccaggcggcc gggggcattg tgacagctga ggacctgaac aactaccgtg ctgagctgat 1141 cgagcacccg ctgaacatca gcctgggaga cgcggtgctg tacatgccca gtgcgccgct 1201 cagcgggccc gtgctggccc tcatcctcaa catcctcaaa gggtacaact tctcccggga 1261 gagcgtggag agccccgagc agaagggcct gacgtaccac cgcatcgtag aggctttccg 1321 gtttgcctac gccaagagga ccctgcttgg ggaccccaag tttgtggatg tgactgaggt 1381 ggtccgcaac atgacctccg agttcttcgc tgcccagctc cgggcccaga tctctgacga 1441 caccactcac ccgatctcct actacaagcc cgagttctac acgccggatg acgggggcac 1501 tgctcacctg tctgtcgtcg cagaggacgg cagtgctgtg tccgccacca gcaccatcaa 1561 cctctacttt ggctccaagg tccgctcccc ggtcagcggg atcctgttca ataatgaaat 1621 ggacgacttc agctctccca gcatcaccaa cgagtttggg gtacccccct cacctgccaa 1681 tttcatccag ccagggaagc agccgctctc gtccatgtgc ccgacgatca tggtgggcca 1741 ggacggccag gtccggatgg tggtgggagc tgctgggggc acacagatca ccacggccac 1801 tgcactggcc atcatctaca acctctggtt cggctatgac gtgaagcggg ccgtggagga 1861 gccccggctg cacaaccagc ttctgcccaa cgtcacgaca gtggagagaa acattgacca 1921 ggcagtgact gcagccctgg agacccggca ccatcacacc cagatcgcgt ccaccttcat 1981 cgctgtggtg caagccatcg tccgcacggc tggtggctgg gcagctgcct cggactccag 2041 gaaaggcggg gagcctgccg gctactgagt gctccaggag gacaaggctg acaagcaatc 2101 cagggacaag atactcacca ggaccaggaa ggggactctg ggggaccggc ttcccctgtg 2161 agcagcagag cagcacaata aatgaggcca ctgtgccagg tgcctccctg gcctgtctcc 2221 ccacaaaaaa aaaaaaaaaa aaaa // LOCUS HSGHR 4414 bp RNA PRI 27-AUG-1997 DEFINITION Human mRNA for growth hormone receptor. ACCESSION X06562 NID g31737 KEYWORDS growth hormone receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4414) AUTHORS Leung,D.W., Spencer,S.A., Cachianes,G., Hammonds,R.G., Collins,C., Henzel,W.J., Barnard,R., Waters,M.J. and Wood,W.I. TITLE Growth hormone receptor and serum binding protein: purification, cloning and expression JOURNAL Nature 330 (6148), 537-543 (1987) MEDLINE 88065896 FEATURES Location/Qualifiers source 1..4414 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="liver cDNA in lambda gt10" /clone="ghr.262, ghr.210, ghr.501, ghr.110, ghr.281" CDS 44..1960 /note="growth hormone receptor (AA 1-638)" /codon_start=1 /db_xref="PID:g31738" /db_xref="SWISS-PROT:P10912" /translation="MDLWQLLLTLALAGSSDAFSGSEATAAILSRAPWSLQSVNPGLK TNSSKEPKFTKCRSPERETFSCHWTDEVHHGTKNLGPIQLFYTRRNTQEWTQEWKECP DYVSAGENSCYFNSSFTSIWIPYCIKLTSNGGTVDEKCFSVDEIVQPDPPIALNWTLL NVSLTGIHADIQVRWEAPRNADIQKGWMVLEYELQYKEVNETKWKMMDPILTTSVPVY SLKVDKEYEVRVRSKQRNSGNYGEFSEVLYVTLPQMSQFTCEEDFYFPWLLIIIFGIF GLTVMLFVFLFSKQQRIKMLILPPVPVPKIKGIDPDLLKEGKLEEVNTILAIHDSYKP EFHSDDSWVEFIELDIDEPDEKTEESDTDRLLSSDHEKSHSNLGVKDGDSGRTSCCEP DILETDFNANDIHEGTSEVAQPQRLKGEADLLCLDQKNQNNSPYHDACPATQQPSVIQ AEKNKPQPLPTEGAESTHQAAHIQLSNPSSLSNIDFYAQVSDITPAGSVVLSPGQKNK AGMSQCDMHPEMVSLCQENFLMDNAYFCEADAKKCIPVAPHIKVESHIQPSLNQEDIY ITTESLTTAAGRPGTGEHVPGSEMPVPDYTSIHIVQSPQGLILNATALPLPDKEFLSS CGYVSTDQLNKIMP" variation 601 /note="a is g in ghr.262 and ghr.501" variation 1167 /note="g is u in ghr.501" variation 2479 /note="c is u in ghr.110" misc_feature 4342..4347 /note="pot. polyA signal" polyA_site 4414 /note="polyA site" BASE COUNT 1398 a 872 c 869 g 1275 t ORIGIN 1 ccgcgctctc tgatcagagg cgaagctcgg aggtcctaca ggtatggatc tctggcagct 61 gctgttgacc ttggcactgg caggatcaag tgatgctttt tctggaagtg aggccacagc 121 agctatcctt agcagagcac cctggagtct gcaaagtgtt aatccaggcc taaagacaaa 181 ttcttctaag gagcctaaat tcaccaagtg ccgttcacct gagcgagaga ctttttcatg 241 ccactggaca gatgaggttc atcatggtac aaagaaccta ggacccatac agctgttcta 301 taccagaagg aacactcaag aatggactca agaatggaaa gaatgccctg attatgtttc 361 tgctggggaa aacagctgtt actttaattc atcgtttacc tccatctgga taccttattg 421 tatcaagcta actagcaatg gtggtacagt ggatgaaaag tgtttctctg ttgatgaaat 481 agtgcaacca gatccaccca ttgccctcaa ctggacttta ctgaacgtca gtttaactgg 541 gattcatgca gatatccaag tgagatggga agcaccacgc aatgcagata ttcagaaagg 601 atggatggtt ctggagtatg aacttcaata caaagaagta aatgaaacta aatggaaaat 661 gatggaccct atattgacaa catcagttcc agtgtactca ttgaaagtgg ataaggaata 721 tgaagtgcgt gtgagatcca aacaacgaaa ctctggaaat tatggcgagt tcagtgaggt 781 gctctatgta acacttcctc agatgagcca atttacatgt gaagaagatt tctactttcc 841 atggctctta attattatct ttggaatatt tgggctaaca gtgatgctat ttgtattctt 901 attttctaaa cagcaaagga ttaaaatgct gattctgccc ccagttccag ttccaaagat 961 taaaggaatc gatccagatc tcctcaagga aggaaaatta gaggaggtga acacaatctt 1021 agccattcat gatagctata aacccgaatt ccacagtgat gactcttggg ttgaatttat 1081 tgagctagat attgatgagc cagatgaaaa gactgaggaa tcagacacag acagacttct 1141 aagcagtgac catgagaaat cacatagtaa cctaggggtg aaggatggcg actctggacg 1201 taccagctgt tgtgaacctg acattctgga gactgatttc aatgccaatg acatacatga 1261 gggtacctca gaggttgctc agccacagag gttaaaaggg gaagcagatc tcttatgcct 1321 tgaccagaag aatcaaaata actcacctta tcatgatgct tgccctgcta ctcagcagcc 1381 cagtgttatc caagcagaga aaaacaaacc acaaccactt cctactgaag gagctgagtc 1441 aactcaccaa gctgcccata ttcagctaag caatccaagt tcactgtcaa acatcgactt 1501 ttatgcccag gtgagcgaca ttacaccagc aggtagtgtg gtcctttccc cgggccaaaa 1561 gaataaggca gggatgtccc aatgtgacat gcacccggaa atggtctcac tctgccaaga 1621 aaacttcctt atggacaatg cctacttctg tgaggcagat gccaaaaagt gcatccctgt 1681 ggctcctcac atcaaggttg aatcacacat acagccaagc ttaaaccaag aggacattta 1741 catcaccaca gaaagcctta ccactgctgc tgggaggcct gggacaggag aacatgttcc 1801 aggttctgag atgcctgtcc cagactatac ctccattcat atagtacagt ccccacaggg 1861 cctcatactc aatgcgactg ccttgccctt gcctgacaaa gagtttctct catcatgtgg 1921 ctatgtgagc acagaccaac tgaacaaaat catgccttag cctttctttg gtttcccaag 1981 agctacgtat ttaatagcaa agaattgact ggggcaataa cgtttaagcc aaaacaatgt 2041 ttaaaccttt tttgggggag tgacaggatg gggtatggat tctaaaatgc cttttcccaa 2101 aatgttgaaa tatgatgtta aaaaaataag aagaatgctt aatcagatag atattcctat 2161 tgtgcaatgt aaatatttta aagaattgtg tcagactgtt tagtagcagt gattgtctta 2221 atattgtggg tgttaatttt tgatactaag cattgaatgg ctatgttttt aatgtatagt 2281 aaatcacgct ttttgaaaaa gcgaaaaaat caggtggctt ttgcggttca ggaaaattga 2341 atgcaaacca tagcacaggc taattttttg ttgtttctta aataagaaac ttttttattt 2401 aaaaaactaa aaactagagg tgagaaattt aaactataag caagaaggca aaaatagttt 2461 ggatatgtaa aacatttact ttgacataaa gttgataaag attttttaat aatttagact 2521 tcaagcatgg ctattttata ttacactaca cactgtgtac tgcagttggt atgacccctc 2581 taaggagtgt agcaactaca gtctaaagct ggtttaatgt tttggccaat gcacctaaag 2641 aaaaacaaac tcgtttttta caaagccctt ttatacctcc ccagactcct tcaacaattc 2701 taaaatgatt gtagtaatct gcattattgg aatataattg ttttatctga atttttaaac 2761 aagtatttgt taatttagaa aactttaaag cgtttgcaca gatcaactta ccaggcacca 2821 aaagaagtaa aagcaaaaaa gaaaaccttt cttcaccaaa tcttggttga tgccaaaaaa 2881 aaatacatgc taagagaagt agaaatcata gctggttcac actgaccaag atacttaagt 2941 gctgcaattg cacgcggagt gagtttttta gtgcgtgcag atggtgagag ataagatcta 3001 tagcctctgc agcggaatct gttcacaccc aacttggttt tgctacataa ttatccagga 3061 agggaataag gtacaagaag cattttgtaa gttgaagcaa atcgaatgaa attaactggg 3121 taatgaaaca aagagttcaa gaaataagtt tttgtttcac agcctataac cagacacata 3181 ctcatttttc atgataatga acagaacata gacagaagaa acaaggtttt cagtccccac 3241 agataactga aaattattta aaccgctaaa agaaactttc tttctcacta aatcttttat 3301 aggatttatt taaaatagca aaagaagaag tttcatcatt ttttacttcc tctctgagtg 3361 gactggcctc aaagcaagca ttcagaagaa aaagaagcaa cctcagtaat ttagaaatca 3421 ttttgcaatc ccttaatatc ctaaacatca ttcatttttg ttgttgttgt tgttgttgag 3481 acagagtctc gctctgtcgc caggctagag tgcggtggcg cgatcttgac tcactgcaat 3541 ctccacctcc cacaggttca ggcgattccc gtgcctcagc ctcctgagta gctgggacta 3601 caggcacgca ccaccatgcc aggctaattt ttttgtattt tagcagagac ggggtttcac 3661 catgttggcc aggatggtct cgagtctcct gacctcgtga tccacccgac tcggcctccc 3721 aaagtgctgg gattacaggt gtaagccacc gtgcccagcc ctaaacatca ttcttgagag 3781 cattgggata tctcctgaaa aggtttatga aaaagaagaa tctcatctca gtgaagaata 3841 cttctcattt tttaaaaaag cttaaaactt tgaagttagc tttaacttaa atagtatttc 3901 ccatttatcg cagacctttt ttaggaagca agcttaatgg ctgataattt taaattctct 3961 ctcttgcagg aaggactatg aaaagctaga attgagtgtt taaagttcaa catgttattt 4021 gtaatagatg tttgatagat tttctgctac tttgctgcta tggttttctc caagagctac 4081 ataatttagt ttcatataaa gtatcatcag tgtagaacct aattcaattc aaagctgtgt 4141 gtttggaaga ctatcttact atttcacaac agcctgacaa catttctata gccaaaaata 4201 gctaaatacc tcaatcagtc tcagaatgtc attttggtac tttggtggcc acataagcca 4261 ttattcacta gtatgactag ttgtgtctgg cagtttatat ttaactctct ttatgtctgt 4321 ggattttttc cttcaaagtt taataaattt attttcttgg attcctgata atgtgcttct 4381 gttatcaaac accaacataa aaatgatcta aacc // LOCUS HSGIPDE1P 3339 bp RNA PRI 28-OCT-1996 DEFINITION H.sapiens mRNA for cyclic nucleotide phosphodiesterase. ACCESSION X95520 NID g1246754 KEYWORDS cGIPDE1 gene; cyclic nucleotide phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3339) AUTHORS Lobbert,R.W., Winterpacht,A., Seipel,B. and Zabel,B.U. TITLE Molecular cloning and chromosomal assignment of the human homologue of the rat cGMP-inhibited phosphodiesterase 1 (PDE3A)--a gene involved in fat metabolism located at 11p 15.1 JOURNAL Genomics 37 (2), 211-218 (1996) MEDLINE 97079687 REFERENCE 2 (bases 1 to 3339) AUTHORS Loebbert,R.W. TITLE Direct Submission JOURNAL Submitted (01-FEB-1996) R.W. Loebbert, University of Mainz, Department of Pediatrics, Langenbeckstrasse 1, D-55101 Mainz, FRG FEATURES Location/Qualifiers source 1..3339 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="adipocytes" /chromosome="11" /map="p15.1-2" gene 1..3339 /gene="cGIPDE1" CDS 1..3339 /gene="cGIPDE1" /codon_start=1 /product="cyclic nucleotide phosphodiesterase" /db_xref="PID:e221327" /db_xref="PID:g1246755" /translation="MRRDERDAKAMRSLQPPDGAGSPPESLRNGYVKSCVSPLRQDPP RGFFFHLCRFCNVELRPPPASPQQPRRCSPFCRARLSLGALAVFVLALLLGAEPESWA AGAAWLRTLLSVCSHSLSPLFSIACAFFFLTCFLTRTKRGPGPGRSCGSWWLLALPAC CYLGDFLVWQWWSWPWGDGDAGSAAPHTPPEAAAGRLLLVLSCVGLLLTLAHPLRLRH CVLVLLLASFVWWVSFTSLGSLPSALRPLLSGLVGGAGCLLALGLDHFFQIREAPLHP RLSSAAEEKVPVIRPRRRSSCVSLGETAASYYGSCKIFRRPSLPCISREQMILWDWDL KQWYKPHYQNSGGGNGVDLSVLNEARNMVSDLLTDPSLPPQVISSLRSISSLMGAFSG SCRPKINPLTPFPGFYPCSEIEDPAEKGDRKLNKGLNRNSLPTPQLRRSSGTSGLLPV EQSSRWDRNNGKRPHQEFGISSQGCYLNGPFNSNLLTIPKQRSSSVSLTHHVGLRRAG VLSSLSPVNSSNHGPVSTGSLTNRSPIEFPDTADFLNKPSVILQRSLGNAPNTPDFYQ QLRNSDSNLCNSCGHQMLKYVSTSESDGTDCCSGKSGEEENIFSKESFKLMETQQEEE TEKKDSRKLFQEGDKWLTEEAQSEQQTNIEQEVSLDLILVEEYDSLIEKMSNWNFPIF ELVEKMGEKSGRILSQVMYTLFQDTGLLEIFKIPTQQFMNYFRALENGYRDIPYHNRI HATDVLHAVWYLTTRPVPGLQQIHNGCGTGNETDSDGRINHGRIAYISSKSCSNPDES YGCLSSNIPALELMALYVAAAMHDYDHPGRTNAFLVATNAPQAVLYNDRSVLENHHAA SAWNLYLSRPEYNFLLHLDHVEFKRFRFLVIEAILATDLKKHFDFLAEFNAKANDVNS NGIEWSNENDRLLVCQVCIKLADINGPAKVRDLHLKWTEGIVNEFYEQGDEEANLGLP ISPFMDRSSPQLAKLQESFITHIVGPLCNSYDAAGLLPGQWLEAEEDNDTESGDDEDG EELDTEDEEMENNLNPKPPRRKSRRRIFCQLMHHLTENHKIWKEIVEEEEKCKADGNK LQVENSSLPQADEIQVIEEADEEE" BASE COUNT 879 a 783 c 839 g 838 t ORIGIN 1 atgaggaggg acgagcgaga cgccaaagcc atgcggtccc tgcagccgcc ggatggggcc 61 ggctcgcccc ccgagagtct gaggaacggc tacgtgaaga gctgcgtgag ccccttgcgg 121 caggaccctc cgcgcggctt cttcttccac ctctgccgct tctgcaacgt ggagctgcgg 181 ccgccgccgg cctctcccca gcagccgcgg cgctgctccc ccttctgccg ggcgcgcctc 241 tcgctgggcg ccctggctgt ctttgtcctc gccctgctgc tgggcgcgga acccgagagc 301 tgggctgccg gggccgcctg gctgcggacg ctgctgagcg tgtgttcgca cagcttgagc 361 cccctcttca gcatcgcctg tgccttcttc ttcctcacct gcttcctcac ccggaccaag 421 cggggacccg gcccgggccg gagctgcggc tcctggtggc tgctggcgct gcccgcctgc 481 tgttacctgg gggacttctt ggtgtggcag tggtggtctt ggccttgggg ggatggcgac 541 gcagggtccg cggccccgca cacgcccccg gaggcggcag cgggcaggtt gctgctggtg 601 ctgagctgcg tagggctgct gctgacgctc gcgcacccgc tgcggctccg gcactgcgtt 661 ctggtgctgc tcctggccag cttcgtctgg tgggtctcct tcaccagcct cgggtcgctg 721 ccctccgccc tcaggccgct gctctccggc ctggtggggg gcgctggctg cctgctggcc 781 ctggggttgg atcacttctt tcaaatcagg gaagcgcctc ttcatcctcg actgtccagt 841 gccgccgaag aaaaagtgcc tgtgatccga ccccggagga ggtccagctg cgtgtcgtta 901 ggagaaactg cagccagtta ctatggcagt tgcaaaatat tcaggagacc gtcgttgcct 961 tgtatttcca gagaacagat gattctttgg gattgggact taaaacaatg gtataagcct 1021 cattatcaaa attctggagg tggaaatgga gttgatcttt cagtgctaaa tgaggctcgc 1081 aatatggtgt cagatcttct gactgatcca agccttccac cacaagtcat ttcctctcta 1141 cggagtatta gtagcttaat gggtgctttc tcaggttcct gtaggccaaa gattaatcct 1201 ctcacaccat ttcctggatt ttacccctgt tctgaaatag aggacccagc tgagaaaggg 1261 gatagaaaac ttaacaaggg actaaatagg aatagtttgc caactccaca gctgaggaga 1321 agctcaggaa cttcaggatt gctacctgtt gaacagtctt caaggtggga tcgtaataat 1381 ggcaaaaggc ctcaccaaga atttggcatt tcaagtcaag gatgctatct aaatgggcct 1441 tttaattcaa atctactgac tatcccgaag caaaggtcat cttctgtatc actgactcac 1501 catgtaggtc tcagaagagc tggtgttttg tccagtctga gtcctgtgaa ttcttccaac 1561 catggaccag tgtctactgg ctctctaact aatcgatcac ccatagaatt tcctgatact 1621 gctgattttc ttaataagcc aagcgttatc ttgcagagat ctctgggcaa tgcacctaat 1681 actccagatt tttatcagca acttagaaat tctgatagca atctgtgtaa cagctgtgga 1741 catcaaatgc tgaaatatgt ttcaacatct gaatcagatg gtacagattg ctgcagtgga 1801 aaatcaggtg aagaagaaaa cattttctcg aaagaatcat tcaaacttat ggaaactcaa 1861 caagaagagg aaacagagaa gaaagacagc agaaaattat ttcaggaagg tgataagtgg 1921 ctaacagaag aggcacagag tgaacagcaa acaaatattg aacaggaagt atcactggac 1981 ctgattttag tagaagagta tgactcatta atagaaaaga tgagcaactg gaattttcca 2041 atttttgaac ttgtagaaaa gatgggagag aaatcaggaa ggattctcag tcaggttatg 2101 tataccttat ttcaagacac tggtttattg gaaatattta aaattcccac tcaacaattt 2161 atgaactatt ttcgtgcatt agaaaatggc tatcgagaca ttccttatca caatcgtata 2221 catgccacag atgtgctaca tgcagtttgg tatctgacaa cgcggccagt tcctggctta 2281 cagcagatcc acaatggttg tggaacagga aatgaaacag attctgatgg tagaattaac 2341 catgggcgaa ttgcttatat ttcttcgaag agctgctcta atcctgatga gagttatggc 2401 tgcctgtctt caaacattcc tgcattagaa ttgatggctc tatacgtggc agctgccatg 2461 catgattatg atcacccagg gaggacaaat gcatttctag tggctacaaa tgcccctcag 2521 gcagttttat acaatgacag atctgttctg gaaaatcatc atgctgcgtc agcttggaat 2581 ctatatcttt ctcgcccaga atacaacttc cttcttcatc ttgatcatgt ggaattcaag 2641 cgctttcgtt ttttagtcat tgaagcaatc cttgctacgg atcttaaaaa gcattttgat 2701 tttctcgcag aattcaatgc caaggcaaat gatgtaaata gtaatggcat agaatggagt 2761 aatgaaaatg atcgcctctt ggtatgccag gtgtgcatca aactggcaga tataaatggc 2821 ccagcaaaag ttcgagactt gcatttgaaa tggacagaag gcattgtcaa tgaattttat 2881 gagcagggag atgaagaagc aaatcttggt ctgcccatca gtccattcat ggatcgttct 2941 tctcctcaac tagcaaaact ccaagaatct tttatcaccc acatagtggg tcccctgtgt 3001 aactcctatg atgctgctgg tttgctacca ggtcagtggt tagaagcaga agaggataat 3061 gatactgaaa gtggtgatga tgaagacggt gaagaattag atacagaaga tgaagaaatg 3121 gaaaacaatc taaatccaaa accaccaaga aggaaaagca gacggcgaat attttgtcag 3181 ctaatgcacc acctcactga aaaccacaag atatggaagg aaatcgtaga ggaagaagaa 3241 aaatgtaaag ctgatgggaa taaactgcag gtggagaatt cctccttacc tcaagcagat 3301 gagattcagg taattgaaga ggcagatgaa gaggaatag // LOCUS HSGIR 1702 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for G(i) protein alpha-subunit (adenylate cyclase inhibiting GTP-binding protein). ACCESSION X04828 NID g31743 KEYWORDS adenylate cyclase-inhibiting GTP-binding protein; G protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1702) AUTHORS Didsbury,J.R., Ho,Y.S. and Snyderman,R. TITLE Human Gi protein alpha-subunit: deduction of amino acid structure from a cloned cDNA JOURNAL FEBS Lett. 211 (2), 160-164 (1987) MEDLINE 87105966 COMMENT The poly(A) site was determined 38 bp 3' to the 3' terminus of pJD43. Four pot. GTP binding and hydrolysis sites are found at the AA pos. 32-49, 200-206, 222-230 and 265-278. Data kindly reviewed (24-APR-1987) by J. Didsbury. FEATURES Location/Qualifiers source 1..1702 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /clone="pJD43." CDS 124..1191 /note="G protein alpha-subunit (AA 1-355)" /codon_start=1 /db_xref="PID:g31744" /db_xref="SWISS-PROT:P04899" /translation="MGCTVSAEDKAAAERSKMIDKNLREDGEKAAREVKLLLLGAGES GKSTIVKQMKIIHEDGYSEEECRQYRAVVYSNTIQSIMAIVKAMGNLQIDFADPSRAD DARQLFALSCTAEEQGVLPDDLSGVIRRLWADHGVQACFGRSREYQLNDSAAYYLNDL ERIAQSDYIPTQQDVLRTRVKTTGIVETHFTFKDLHFKMFDVGGQRSERKKWIHCFEG VTAIIFCVALSAYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKKDLFE EKITHSPLTICFPEYTGANKYDEAASYIQSKFEDLNKRKDTKEIYTHFTCATDTKNVQ FVFDAVTDVIIKNNLKDCGLF" BASE COUNT 375 a 504 c 506 g 317 t ORIGIN 1 ccggcagtcc cgagtgcttc ccgcagaggg ctggtggtgg gagcggagtg gagtcgggcg 61 gggccgaagc cgggccgtgg gcgtagatgg gggccgggcg gcggcggagc ggcggaacgc 121 gggatgggct gcaccgtgag cgccgaggac aaggcggcgg ccgagcgctc taagatgatc 181 gacaagaacc tgcgggagga cggagagaag gcggcgcggg aggtgaagtt gctgctgttg 241 ggtgctgggg agtcagggaa gagcaccatc gtcaagcaga tgaagatcat ccacgaggat 301 ggctactccg aggaggaatg ccggcagtac cgggcggttg tctacagcaa caccatccag 361 tccatcatgg ccattgtcaa agccatggga aacctgcaga tcgactttgc cgacccctcc 421 agagcggacg acgccaggca gctatttgca ctgtcctgca ccgccgagga gcaaggcgtg 481 ctccctgatg acctgtccgg cgtcatccgg aggctctggg ctgaccatgg tgtgcaggcc 541 tgctttggcc gctcaaggga ataccagctc aacgactcag ctgcctacta cctgaacgac 601 ctggagcgta ttgcacagag tgactacatc cccacacagc aagatgtgct acggacccgc 661 gtaaagacca cggggatcgt ggagacacac ttcaccttca aggacctaca cttcaagatg 721 tttgatgtgg gtggtcagcg gtctgagcgg aagaagtgga tccactgctt tgagggcgtc 781 acagccatca tcttctgcgt agccttgagc gcctatgact tggtgctagc tgaggacgag 841 gagatgaacc gcatgcatga gagcatgaag ctattcgata gcatctgcaa caacaagtgg 901 ttcacagaca cgtccatcat cctcttcctc aacaagaagg acctgtttga ggagaagatc 961 acacacagtc ccctgaccat ctgcttccct gagtacacag gggccaacaa atatgatgag 1021 gcagccagct acatccagag taagtttgag gacctgaata agcgcaaaga caccaaggag 1081 atctacacgc acttcacgtg cgccaccgac accaagaacg tgcagttcgt gtttgacgcc 1141 gtcaccgatg tcatcatcaa gaacaacctg aaggactgcg gcctcttctg aggggcagcg 1201 gggcctggcg ggatgggcca ccgccgaatt tgtacccccc aacccctgag gaagatgggg 1261 gcaagaagat cacgctcccc gcctgttccc ccgccgcttt tctcctcttt cctctctttg 1321 ttctcagctc cccctgtccc ctcagctcca aacgtagggg aggggttcgc acaggcctcc 1381 ctgtttgaag cctgcccttg tctgagatgc tggtaatggc catggtaccc ccttctgggc 1441 atctgttctg gtttttaacc attgtcttgt tctgtgatga ggggaggggg gcacatgctg 1501 agtctcccaa ggctgcgtct ggaggggccc ctgcttctcc agcctggacc cccagctttg 1561 cccaacacca gcccctgccc cagcccaagt ccaaatgttt acgggagcct cctgcccagt 1621 cccccaaccc cagccgctcg gaggccccaa aggaaaaagc acaagaagcg tgagacgcca 1681 ccattcctgg aaaccacagt cc // LOCUS HSGKTS1 1838 bp RNA PRI 08-SEP-1994 DEFINITION H.sapiens mRNA for glycerol kinase testis specific 1. ACCESSION X78711 NID g515028 KEYWORDS glycerol kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1838) AUTHORS Sargent,C.A., Young,C., Ferguson-Smith,M.A. and Affara,N.A. TITLE The glycerol kinase gene family: structure of the Xp gene and related intronless retroposons JOURNAL Unpublished REFERENCE 2 (bases 1 to 1838) AUTHORS Sargent,C.A. TITLE Direct Submission JOURNAL Submitted (07-APR-1994) C.A. Sargent, University of Cambridge, Dept of Pathology, Tennis Court Road, Cambridge CB2 1QP, UK REFERENCE 3 (bases 1 to 1838) AUTHORS Sargent,C.A., Young,C., Marsh,S., Ferguson-Smith,M.A. and Affara,N.A. TITLE The glycerol kinase gene family: structure of the Xp gene, and related intronless retroposons JOURNAL Hum. Mol. Genet. 3 (8), 1317-1324 (1994) MEDLINE 95078834 FEATURES Location/Qualifiers source 1..1838 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="testis" /clone_lib="Clonetech HL1010B" /chromosome="4" /map="4q13, 4q32" CDS 27..1688 /standard_name="glycerol kinase testis specific 1" /codon_start=1 /product="glycerol kinase" /db_xref="PID:g515029" /translation="MAASKKAVLGPLVGAVDQGTSSTRFLVFNSRTAELLSHHQVEIK QEFPREGWVEQDPKEILHSVYECIEKTCEKLGQLNIGISNIKAIGVSNQRETTVAWDK ITGEPLYNAVVWLDLRTQSTVESLSKRIPGNNNFVKSKTGLPLSTYFSAVKLRWLLDN VRKVQKAVEEKRALFGTIDSWLIWSLTGGVNGGVHCTDVTNASRTMLFNIHSLEWDKQ LCEFFGIPMEILPHVRSSSEIYGLMKAGALEGVPISGCLGDQSAALVGQMCFQIGQAK NTYGTGCFLLCNTGHKCVFSDHGLLTTVAYKLGRDKPVYYALEGSVAIAGAVIRWLRD NLGIIKTSEEIEKLAKEVGTSYGCYFVPAFSGLYAPYWEPSARGIICGLTQFTNKCHI AFAALEAVCFQTREILDAMNRDCGIPLSHLQVDGGMTSNKILMQLQADILYIPVVKPL MPETTALGAAMAAGAAEGVDVWSLEPEDLSAVTMERFEPQINAEESEIRYSTWKKAVM KSMGWVTTQSPEGGDPSVFCSLPLGFFIVSSMAMLIGARYISGIP" BASE COUNT 523 a 362 c 449 g 504 t ORIGIN 1 cgcgggcgga ccatgaagct ggtttcatgg cagcctcaaa gaaggcagtt ttggggccat 61 tggtgggggc agtggaccaa ggcaccagtt ccacgcgctt tttggttttc aattcaagaa 121 cagctgaact acttagtcat catcaagtgg aaataaaaca agagttccca agagaaggat 181 gggtggaaca ggaccctaag gaaattctac attctgtcta tgagtgtata gagaaaacat 241 gtgagaaact tggacagctc aatattggta tttccaacat aaaagctatt ggtgtcagca 301 accagaggga aaccaccgta gcctgggaca agataactgg agagcctctc tacaatgctg 361 tagtgtggct tgatctaaga acacagtcta ccgttgagag tcttagtaaa agaattccag 421 gaaataataa ctttgtcaag tccaagacag gccttccact tagcacttac ttcagtgcag 481 tgaaacttcg ctggctcctc gacaatgtga gaaaagttca aaaggccgtt gaagaaaaac 541 gagctctttt tgggactatt gattcatggc ttatttggag tttgacagga ggcgtcaatg 601 gaggtgtcca ctgtacagat gtaacaaatg caagtaggac tatgcttttc aacattcatt 661 ctttggaatg ggataaacaa ctctgtgaat tttttggaat tccaatggaa attcttccac 721 atgttcggag ttcttctgag atctatggcc taatgaaagc gggggccttg gaaggtgtgc 781 caatatctgg gtgtttaggg gaccagtctg ctgcactggt gggacaaatg tgcttccaga 841 ttggacaagc caaaaatacg tatggaacag gatgtttctt actatgtaat acaggccata 901 agtgtgtatt ttctgatcat ggccttctca ccacagtggc ttacaaactt ggcagagaca 961 aaccggtata ttacgctttg gaaggttctg tagctatagc tggtgctgtt attcgctggc 1021 taagagacaa tcttggaatt ataaagacct cagaagaaat tgaaaaactt gctaaagaag 1081 taggtacttc ttatggctgc tacttcgtcc cagcattttc ggggttatat gcaccttatt 1141 gggagcccag cgcaagaggg ataatctgtg gactcactca attcacgaat aaatgccata 1201 ttgcttttgc tgcattagaa gctgtttgtt tccaaactcg agagattttg gatgccatga 1261 atcgagactg tggaattcca ctcagtcatt tgcaggttga tggaggaatg accagcaaca 1321 aaattcttat gcagctacaa gcagacattc tgtatattcc agtagtgaag cccttgatgc 1381 ccgaaaccac tgcactgggt gctgccatgg cggcaggggc tgcagaagga gtcgacgtat 1441 ggagtcttga acctgaggat ttgtccgccg tcacgatgga gcggtttgaa cctcagatta 1501 atgctgagga aagtgaaatt cgttattcta catggaagaa agctgtgatg aagtcaatgg 1561 gttgggttac aactcaatct ccagaaggtg gtgaccctag tgtcttctgt agtctgccct 1621 tgggcttttt tatagtgagt agcatggcaa tgttaatcgg agcaaggtac atctcaggta 1681 ttccataaaa cctaccaact catggattcc caagatgcga gctttttaca taatgaaaga 1741 acaacccagc aattgtctct taatgcgatg acactattca tagactttga ttttatttat 1801 aagccacttg ctgcgtgacc ctccaagtag acctgtgg // LOCUS HSGLAD2A 1803 bp RNA PRI 16-MAR-1993 DEFINITION H.sapiens mRNA for glutamate decarboxylase. ACCESSION X69936 NID g31757 KEYWORDS autoantigen; glutamate decarboxylase; glutamic acid decarboxylase; histidine-hexapeptide fusion protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1803) AUTHORS Northemann,W. TITLE Direct Submission JOURNAL Submitted (23-DEC-1992) W. Northemann, Elias Entwicklungslabor, Department of molecular Biology, Obere Hardtstrasse 18, D-7800 Freiburg, FRG REFERENCE 2 (bases 1 to 1803) AUTHORS Mauch,L., Abney,C.C., Berg,H., Scherbaum,W.A., Liedvogel,B. and Northemann,W. TITLE Characterization of a linear epitope within the human pancreatic 64-kDa glutamic acid decarboxylase and its autoimmune recognition by sera from insulin-dependent diabetes mellitus patients JOURNAL Eur. J. Biochem. 212 (2), 597-603 (1993) MEDLINE 93185681 FEATURES Location/Qualifiers source 1..1803 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pancreatic carcinoma" gene 1..1803 /gene="GAD2" CDS 1..1803 /gene="GAD2" /EC_number="4.1.1.15" /note="fusion protein" /codon_start=1 /product="glutamate decarboxylase" /db_xref="PID:g31758" /db_xref="SWISS-PROT:Q05329" /translation="MSPIHHHHHHLVPRGSEASNSGFWSFGSEDGSGDSENPGTARAW CQVAQKFTGGIGNKLCALLYGDAEKPAESGGSQPPRAAARKAACACDQKPCSCSKVDV NYAFLHATDLLPACDGERPTLAFLQDVMNILLQYVVKSFDRSTKVIDFHYPNELLQEY NWELADQPQNLEEILMHCQTTLKYAIKTGHPRYFNQLSTGLDMVGLAADWLTSTANTN MFTYEIAPVFVLLEYVTLKKMREIIGWPGGSGDGIFSPGGAISNMYAMMIARFKMFPE VKEKGMAALPRLIAFTSEHSHFSLKKGAAALGIGTDSVILIKCDERGKMIPSDLERRI LEAKQKGFVPFLVSATAGTTVYGAFDPLLAVADICKKYKIWMHVDAAWGGGLLMSRKH KWKLSGVERANSVTWNPHKMMGVPLQCSALLVREEGLMQNCNQMHASYLFQQDKHYDL SYDTGDKALQCGRHVDVFKLWLMWRAKGTTGFEAHVDKCLELAEYLYNIIKNREGYEM VFDGKPQHTNVCFWYIPPSLRTLEDNEERMSRLSKVAPVIKARMMEYGTTMVSYQPLG DKVNFFRMVISNPAATHQDIDFLIEEIERLGQDL" misc_feature 13..30 /gene="GAD2" /note="histidine-hexapeptide" misc_feature 61..1803 /gene="GAD2" /note="glutamate decarboxylase" BASE COUNT 495 a 404 c 463 g 441 t ORIGIN 1 atgtccccta tacatcacca tcaccatcac ctggttccgc gtggatccga agcttcgaat 61 tctggctttt ggtctttcgg gtcggaagat ggctctgggg attccgagaa tcccggcaca 121 gcgcgagcct ggtgccaagt ggctcagaag ttcacgggcg gcatcggaaa caaactgtgc 181 gccctgctct acggagacgc cgagaagccg gcggagagcg gcgggagcca acccccgcgg 241 gccgccgccc ggaaggccgc ctgcgcctgc gaccagaagc cctgcagctg ctccaaagtg 301 gatgtcaact acgcgtttct ccatgcaaca gacctgctgc cggcgtgtga tggagaaagg 361 cccactttgg cgtttctgca agatgttatg aacattttac ttcagtatgt ggtgaaaagt 421 ttcgatagat caaccaaagt gattgatttc cattatccta atgagcttct ccaagaatat 481 aattgggaat tggcagacca accacaaaat ttggaggaaa ttttgatgca ttgccaaaca 541 actctaaaat atgcaattaa aacagggcat cctagatact tcaatcaact ttctactggt 601 ttggatatgg ttggattagc agcagactgg ctgacatcaa cagcaaatac taacatgttc 661 acctatgaaa ttgctccagt atttgtgctt ttggaatatg tcacactaaa gaaaatgaga 721 gaaatcattg gctggccagg gggctctggc gatgggatat tttctcccgg tggcgccata 781 tctaacatgt atgccatgat gatcgcacgc tttaagatgt tcccagaagt caaggagaaa 841 ggaatggctg ctcttcccag gctcattgcc ttcacgtctg aacatagtca tttttctctc 901 aagaagggag ctgcagcctt agggattgga acagacagcg tgattctgat taaatgtgat 961 gagagaggga aaatgattcc atctgatctt gaaagaagga ttcttgaagc caaacagaaa 1021 gggtttgttc ctttcctcgt gagtgccaca gctggaacca ccgtgtacgg agcatttgac 1081 cccctcttag ctgtcgctga catttgcaaa aagtataaga tctggatgca tgtggatgca 1141 gcttggggtg ggggattact gatgtcccga aaacacaagt ggaaactgag tggcgtggag 1201 agggccaact ctgtgacgtg gaatccacac aagatgatgg gagtcccttt gcagtgctct 1261 gctctcctgg ttagagaaga gggattgatg cagaattgca accaaatgca tgcctcctac 1321 ctctttcagc aagataaaca ttatgacctg tcctatgaca ctggagacaa ggccttacag 1381 tgcggacgcc acgttgatgt ttttaaacta tggctgatgt ggagggcaaa ggggactacc 1441 gggtttgaag cgcatgttga taaatgtttg gagttggcag agtatttata caacatcata 1501 aaaaaccgag aaggatatga gatggtgttt gatgggaagc ctcagcacac aaatgtctgc 1561 ttctggtaca ttcctccaag cttgcgtact ctggaagaca atgaagagag aatgagtcgc 1621 ctctcgaagg tggctccagt gattaaagcc agaatgatgg agtatggaac cacaatggtc 1681 agctaccaac ccttgggaga caaggtcaat ttcttccgca tggtcatctc aaacccagcg 1741 gcaactcacc aagacattga cttcctgatt gaagaaatag aacgccttgg acaagattta 1801 taa // LOCUS HSGLC 1573 bp RNA PRI 04-MAY-1995 DEFINITION H.sapiens mRNA for glutamine cyclotransferase. ACCESSION X71125 NID g398375 KEYWORDS glutamine cyclotransferase; post-translational modification. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1140) AUTHORS Song,I., Chuang,C.Z. and Bateman,R.C. TITLE Molecular cloning, sequence analysis, and expression of human pituitary glutamine cyclotransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1140) AUTHORS Bateman,R.C. TITLE Direct Submission JOURNAL Submitted (26-FEB-1993) R.C. Bateman, University of Southern Mississippi, USM/ Department of Chemistry & Biochemistry, S S Box 5043, Hattiesburg, MS 39406-5043, USA REFERENCE 3 (bases 1 to 1573) AUTHORS Song,I., Chuang,C.Z. and Bateman,R.C. Jr. TITLE Molecular cloning, sequence analysis and expression of human pituitary glutaminyl cyclase JOURNAL J. Mol. Endocrinol. 13 (1), 77-86 (1994) MEDLINE 95092194 FEATURES Location/Qualifiers source 1..1573 /organism="Homo sapiens" /strain="caucasians" /db_xref="taxon:9606" /haplotype="23" /dev_stage="15-83 yrs old pool" /tissue_type="pituitary gland" /clone_lib="H. sapiens pituitary cDNA library from Clontech (cat# HL 1139b)" mRNA <1..>1140 CDS 12..1097 /EC_number="2.3.2.5" /codon_start=1 /product="glutaminyl-peptide cyclotransferase" /db_xref="PID:g296949" /translation="MAGGRHRRVVGTLHLLLLVAALPWASRGVSPSASAWPEEKNYHQ PAILNSSALRQIAEGTSISEMWQNDLQPLLIERYPGSPGSYAARQHIMQRIQRLQADW VLEIDTFLSQTPYGYRSFSNIISTLNPTAKRHLVLACHYDSKYFSHWNNRVFVGATDS AVPCAMMLELARALDKKLLSLKTVSDSKPDLSLQLIFFDGEEAFLHWSPQDSLYGSRH LAAKMASTPHPPGARGTSQLHGMDLLVLLDLIGAPNPTFPNFFPNSARWFERLQAIEH ELHELGLLKDHSLEGRYFQNYSYGGVIQDDHIPFLRRGVPVLHLIPSPFPEVWHTMDD NEENLDESTIDNLNKILQVFVLEYLHL" misc_feature 156..158 /note="N-glycosylation site" misc_feature 897..899 /note="N-glycosylation site" 3'UTR 1098..1556 polyA_site 1556 BASE COUNT 449 a 336 c 334 g 454 t ORIGIN 1 ggctgggaga gatggcaggc ggaagacacc ggcgcgtcgt gggcaccctc cacctgctgc 61 tgctggtggc cgccctgccc tgggcatcca ggggggtcag tccgagtgcc tcagcctggc 121 cagaggagaa gaattaccac cagccagcca ttttgaattc atcggctctt cggcaaattg 181 cagaaggcac cagtatctct gaaatgtggc aaaatgactt acagccattg ctgatagagc 241 gatacccggg atcccctgga agctatgctg ctcgtcagca catcatgcag cgaattcaga 301 ggcttcaggc tgactgggtc ttggaaatag acaccttctt gagtcagaca ccctatgggt 361 accggtcttt ctcaaatatc atcagcaccc tcaatcccac tgctaaacga catttggtcc 421 tcgcctgcca ctatgactcc aagtattttt cccactggaa caacagagtg tttgtaggag 481 ccactgattc agccgtgcca tgtgcaatga tgttggaact tgctcgtgcc ttagacaaga 541 aactcctttc cttaaagact gtttcagact ccaagccaga tttgtcactc cagctgatct 601 tctttgatgg tgaagaggct tttcttcact ggtctcctca agattctctc tatgggtctc 661 gacacttagc tgcaaagatg gcatcgaccc cgcacccacc tggagcgaga ggcaccagcc 721 aactgcatgg catggattta ttggtcttat tggatttgat tggagctcca aacccaacgt 781 ttcccaattt ttttccaaac tcagccaggt ggttcgaaag acttcaagca attgaacatg 841 aacttcatga attgggtttg ctcaaggatc actctttgga ggggcggtat ttccagaatt 901 acagttatgg aggtgtgatt caggatgacc atattccatt tttaagaaga ggtgttccag 961 ttctgcatct gataccgtct cctttccctg aagtctggca caccatggat gacaatgaag 1021 aaaatttgga tgaatcaacc attgacaatc taaacaaaat cctacaagtc tttgtgttgg 1081 aatatcttca tttgtaatac tctgatttag tttaggataa ttggttctag aattgaattc 1141 aaaagtcaag gcatcattta aaataatctg atttcagaca aatgctgtgt ggaaacatct 1201 atcctataga tcatcctatt cttatgtgtc tttggttatc agatcaatta cagaataatt 1261 gtgttgtgat attgtgtcct aaattgctca ttaattttta tttacagatt gaaaaagagg 1321 caccgtgtaa agaaaatggc aaaataaata tctttccaag gatcatcatc acgatagcta 1381 aacagtactt aaatagcggt tggaactagg tagcctttcg aattttatga ttttttcata 1441 tgtggaaatc tattacatgt aatacaaaac aaacatgtag tttgaaggcg gtcagatttc 1501 tttgagaaat ctttgtagag ttaattttat ggaaattaaa atcagaatta aatgctaaaa 1561 aaaaaaaaaa aaa // LOCUS HSGLCNACT 1887 bp RNA PRI 01-JUL-1997 DEFINITION H.sapiens mRNA for GlcNac-1-P transferase. ACCESSION Z82022 NID g2239118 KEYWORDS GlcNAc-1-P transferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1887) AUTHORS Eckert,V., Mazhari-Tabrizi,R., Blank,M., Mumberg,D., Funk,M. and Schwarz,R. TITLE Cloning and functional expression of the human GlcNac-1-P transferase, the enzyme for the committed step of the dolichol-cycle by heterologous complementation in yeast JOURNAL Unpublished REFERENCE 2 (bases 1 to 1887) AUTHORS Eckert,V. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) Eckert V., University of Marburg, Medizinisches Zentrum fuer Hygiene, Robert-Koch-Str. 17, Marburg, Germany, D-35037 FEATURES Location/Qualifiers source 1..1887 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hsalg7" /tissue_type="lung" /cell_type="fibroblast" /clone_lib="cDNA library (pRS416-vector)" CDS 104..1306 /codon_start=1 /product="GlcNac-1-P transferase" /db_xref="PID:e280927" /db_xref="PID:g2239119" /translation="MPLLINLIVSLLGFVATVTLIPAFLGHFIAARLCGQDLNKTSRQ QIPESQGVISGAVFLIILFCFIPFPFLNCFVKEQCKAFPHHEFVALIGALLAICCMIF LGFADDVLNLRWRHKLLLPTAASLPLLMVYFTNFGNTTIVVPKPFRPILGLHLDLGIL YYVYMGLLAVFCTNAINILAGINGLEAGQSLVISASIIVFNLVELEGDCRDDHVFSLY FMIPFFFTTLGLLYHNWYPSRVFVGDTFCYFAGMTFAVVGILGHFSKTMLLFFMPQVF NFLYSLPQLLHIIPCPRHRIPRLNIKTGKLEMSYSKFKTKSLSFLGTFILKVAESLQL VTVHQSETEDGEFTECNNMTLINLLLKVLGPIHERNLTLLLLLLQILGSAITFSIRYQ LVRLFYDV" BASE COUNT 465 a 514 c 372 g 536 t ORIGIN 1 aaatactata ctatagctat agcttactat acctaactat agtcttcctc tagaactagt 61 ggatcccccg ggctgcagga attcggcacg agcggaattg cccatgccgc tgctgatcaa 121 tttgatcgtc tcgctgctgg gatttgtggc cacagtcacc ctcatcccgg ccttcctggg 181 ccacttcatt gctgcgcgcc tctgtggtca ggacctcaac aaaaccagcc gacagcagat 241 cccagaatcc cagggagtga tcagcggtgc tgttttcctt atcatcctct tctgcttcat 301 ccctttcccc ttcctgaact gctttgtgaa ggagcagtgt aaggcattcc cccaccatga 361 atttgtggcc ctgataggtg ccctccttgc catctgctgc atgatcttcc tgggctttgc 421 ggatgatgta ctgaatctgc gctggcgcca taagctgctg ctacctacag ctgcctcact 481 acctctcctc atggtctatt tcaccaactt tggcaacacg accattgtgg tgcccaagcc 541 cttccgcccg atacttggcc tgcatctgga cttgggaatc ctgtactatg tctacatggg 601 gctgctggca gtgttctgta ccaatgccat caatatccta gcaggaatta acggcctaga 661 ggctggccag tcactagtca tttctgcttc catcattgtc ttcaacctgg tagagttgga 721 aggtgattgt cgggatgatc atgtcttttc cctctacttc atgataccct tttttttcac 781 cactttggga ttgctctacc acaactggta cccatcacgg gtgtttgtgg gagatacctt 841 ctgttacttt gctggcatga cctttgccgt ggtgggcatc ttgggacact tcagcaagac 901 catgctacta ttcttcatgc cccaggtgtt caacttcctc tactcactgc ctcagctcct 961 gcatatcatc ccctgccctc gccaccgcat acccagactc aatatcaaga caggcaaact 1021 ggagatgagc tattccaagt tcaagaccaa gagcctctct ttcttgggca cctttatttt 1081 aaaggtggca gagagcctcc agctggtgac agtacaccag agtgagactg aagatggtga 1141 attcactgaa tgtaacaaca tgaccctcat caacttgcta cttaaagtcc ttgggcccat 1201 acatgagaga aacctcacat tgctcctgct gctgctgcag atcctgggca gtgccatcac 1261 cttctccatt cgatatcagc tcgttcgact cttctatgat gtctgagtcc cttgatacat 1321 tgtcctttac ctcacagtct ctaggattcc tgactcaggc tgacctctct ctctggtccc 1381 agactgcctc cttgcccatg cctctctcac tcttcatact cctccatatt ttgttctcag 1441 cattttcctt tctctgtgat cattggcatc ctgggcgttt cttgccctct gctgactact 1501 gattggattt tacctatggc tttctgcaac ttgctactct ctccctctcc atcccatctt 1561 tgcagcctct tgggtgggat acagcagctt tttttgcagt tatccacact cacatttcag 1621 agtcctgact ctcaaggaac cactggtttt tgggatagaa cttgggccag ggctaggaac 1681 acaggctcca cggtgacatg tcatttgatt gtaaattaag tgttctgatt agtaagaact 1741 aagcaggggg ccacatgctc tcaatggaga caataaagtg ttgtcttttt cttaaaaaaa 1801 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1861 aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSGLI 3600 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for GLI protein. ACCESSION X07384 NID g31767 KEYWORDS GLI protein; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3600) AUTHORS Kinzler,K.W. TITLE Direct Submission JOURNAL Submitted (03-MAY-1988) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 3600) AUTHORS Kinzler,K.W., Ruppert,J.M., Bigner,S.H. and Vogelstein,B. TITLE The GLI gene is a member of the Kruppel family of zinc finger proteins JOURNAL Nature 332 (6162), 371-374 (1988) MEDLINE 88175051 FEATURES Location/Qualifiers source 1..3600 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="D-259 MG" /clone_lib="lambda phage clones M5E, H31, I2G, J36" CDS 79..3399 /note="GLI protein (AA 1-1106)" /codon_start=1 /db_xref="PID:g31768" /db_xref="SWISS-PROT:P08151" /translation="MFNSMTPPPISSYGEPCCLRPLPSQGAPSVGTEGLSGPPFCHQA NLMSGPHSYGPARETNSCTEGPLFSSPRSAVKLTKKRALSISPLSDASLDLQTVIRTS PSSLVAFINSRCTSPGGSYGHLSIGTMSPSLGFPAQMNHQKGPSPSFGVQPCGPHDSA RGGMIPHPQSRGPFPTCQLKSELDMLVGKCREEPLEGDMSSPNSTGIQDPLLGMLDGR EDLEREEKREPESVYETDCRWDGCSQEFDSQEQLVHHINSEHIHGERKEFVCHWGGCS RELRPFKAQYMLVVHMRRHTGEKPHKCTFEGCRKSYSRLENLKTHLRSHTGEKPYMCE HEGCSKAFSNASDRAKHQNRTHSNEKPYVCKLPGCTKRYTDPSSLRKHVKTVHGPDAH VTKRHRGDGPLPRAPSISTVEPKREREGGPIREESRLTVPEGAMKPQPSPGAQSSCSS DHSPAGSAANTDSGVEMTGNAGGSTEDLSSLDEGPCIAGTGLSTLRRLENLRLDQLHQ LRPIGTRGLKLPSLSHTGTTVSRRVGPPVSLERRSSSSSSISSAYTVSRRSSLASPFP PGSPPENGASSLPGLMPAQHYLLRARYASARGGGTSPTAASSLDRIGGLPMPPWRSRA EYPGYNPNAGVTRRASDPAQAADRPAPARVQRFKSLGCVHTPPTVAGGGQNFDPYLPT SVYSPQPPSITENAAMDARGLQEEPEVGTSMVGSGLNPYMDFPPTDTLGYGGPEGAAA EPYGARGPGSLPLGPGPPTNYGPNPCPQQASYPDPTQETWGEFPSHSGLYPGPKALGG TYSQCPRLEHYGQVQVKPEQGCPVGSDSTGLAPCLNAHPSEGPPHPQPLFSHYPQPSP PQYLQSGPYTQPPPDYLPSEPRPCLDFDSPTHSTGQLKAQLVCNYVQSQQELLWEGGG REDAPAQEPSYQSPKFLGGSQVSPSRAKAPVNTYGPGFGPNLPNHKSGSYPTPSPCHE NFVVGANRASHRAAAPPRLLPPLPTCYGPLKVGGTNPSCGHPEVGRLGGGPALYPPPE GQVCNPLDSLDLDNTQLDFVAILDEPQGLSPPPSHDQRGSSGHTPPPSGPPNMAVGNM SVLLRSLPGETEFLNSSA" misc_feature 781..1250 /note="zinc finger region (AA 235-393)" BASE COUNT 785 a 1161 c 949 g 705 t ORIGIN 1 cccagactcc agccctggac cgcgcatccc gagcccagcg cccagacaga gtgtccccac 61 accctcctct gagacgccat gttcaactcg atgaccccac caccaatcag tagctatggc 121 gagccctgct gtctccggcc cctccccagt cagggggccc ccagtgtggg gacagaagga 181 ctgtctggcc cgcccttctg ccaccaagct aacctcatgt ccggccccca cagttatggg 241 ccagccagag agaccaacag ctgcaccgag ggcccactct tttcttctcc ccggagtgca 301 gtcaagttga ccaagaagcg ggcactgtcc atctcacctc tgtcggatgc cagcctggac 361 ctgcagacgg ttatccgcac ctcacccagc tccctcgtag ctttcatcaa ctcgcgatgc 421 acatctccag gaggctccta cggtcatctc tccattggca ccatgagccc atctctggga 481 ttcccagccc agatgaatca ccaaaaaggg ccctcgcctt cctttggggt ccagccttgt 541 ggtccccatg actctgcccg gggtgggatg atcccacatc ctcagtcccg gggacccttc 601 ccaacttgcc agctgaagtc tgagctggac atgctggttg gcaagtgccg ggaggaaccc 661 ttggaaggtg atatgtccag ccccaactcc acaggcatac aggatcccct gttggggatg 721 ctggatgggc gggaggacct cgagagagag gagaagcgtg agcctgaatc tgtgtatgaa 781 actgactgcc gttgggatgg ctgcagccag gaatttgact cccaagagca gctggtgcac 841 cacatcaaca gcgagcacat ccacggggag cggaaggagt tcgtgtgcca ctgggggggc 901 tgctccaggg agctgaggcc cttcaaagcc cagtacatgc tggtggttca catgcgcaga 961 cacactggcg agaagccaca caagtgcacg tttgaagggt gccggaagtc atactcacgc 1021 ctcgaaaacc tgaagacgca cctgcggtca cacacgggtg agaagccata catgtgtgag 1081 cacgagggct gcagtaaagc cttcagcaat gccagtgacc gagccaagca ccagaatcgg 1141 acccattcca atgagaagcc gtatgtatgt aagctccctg gctgcaccaa acgctataca 1201 gatcctagct cgctgcgaaa acatgtcaag acagtgcatg gtcctgacgc ccatgtgacc 1261 aaacggcacc gtggggatgg ccccctgcct cgggcaccat ccatttctac agtggagccc 1321 aagagggagc gggaaggagg tcccatcagg gaggaaagca gactgactgt gccagagggt 1381 gccatgaagc cacagccaag ccctggggcc cagtcatcct gcagcagtga ccactccccg 1441 gcagggagtg cagccaatac agacagtggt gtggaaatga ctggcaatgc agggggcagc 1501 actgaagacc tctccagctt ggacgaggga ccttgcattg ctggcactgg tctgtccact 1561 cttcgccgcc ttgagaacct caggctggac cagctacatc aactccggcc aatagggacc 1621 cggggtctca aactgcccag cttgtcccac accggtacca ctgtgtcccg ccgcgtgggc 1681 cccccagtct ctcttgaacg ccgcagcagc agctccagca gcatcagctc tgcctatact 1741 gtcagccgcc gctcctccct ggcctctcct ttcccccctg gctccccacc agagaatgga 1801 gcatcctccc tgcctggcct tatgcctgcc cagcactacc tgcttcgggc aagatatgct 1861 tcagccagag ggggtggtac ttcgcccact gcagcatcca gcctggatcg gataggtggt 1921 cttcccatgc ctccttggag aagccgagcc gagtatccag gatacaaccc caatgcaggg 1981 gtcacccgga gggccagtga cccagcccag gctgctgacc gtcctgctcc agctagagtc 2041 cagaggttca agagcctggg ctgtgtccat accccaccca ctgtggcagg gggaggacag 2101 aactttgatc cttacctccc aacctctgtc tactcaccac agccccccag catcactgag 2161 aatgctgcca tggatgctag agggctacag gaagagccag aagttgggac ctccatggtg 2221 ggcagtggtc tgaaccccta tatggacttc ccacctactg atactctggg atatggggga 2281 cctgaagggg cagcagctga gccttatgga gcgaggggtc caggctctct gcctcttggg 2341 cctggtccac ccaccaacta tggccccaac ccctgtcccc agcaggcctc atatcctgac 2401 cccacccaag aaacatgggg tgagttccct tcccactctg ggctgtaccc aggccccaag 2461 gctctaggtg gaacctacag ccagtgtcct cgacttgaac attatggaca agtgcaagtc 2521 aagccagaac aggggtgccc agtggggtct gactccacag gactggcacc ctgcctcaat 2581 gcccacccca gtgaggggcc cccacatcca cagcctctct tttcccatta cccccagccc 2641 tctcctcccc aatatctcca gtcaggcccc tatacccagc caccccctga ttatcttcct 2701 tcagaaccca ggccttgcct ggactttgat tcccccaccc attccacagg gcagctcaag 2761 gctcagcttg tgtgtaatta tgttcaatct caacaggagc tactgtggga gggtgggggc 2821 agggaagatg cccccgccca ggaaccttcc taccagagtc ccaagtttct ggggggttcc 2881 caggttagcc caagccgtgc taaagctcca gtgaacacat atggacctgg ctttggaccc 2941 aacttgccca atcacaagtc aggttcctat cccacccctt caccatgcca tgaaaatttt 3001 gtagtggggg caaatagggc ttcacatagg gcagcagcac cacctcgact tctgccccca 3061 ttgcccactt gctatgggcc tctcaaagtg ggaggcacaa accccagctg tggtcatcct 3121 gaggtgggca ggctaggagg gggtcctgcc ttgtaccctc ctcccgaagg acaggtatgt 3181 aaccccctgg actctcttga tcttgacaac actcagctgg actttgtggc tattctggat 3241 gagccccagg ggctgagtcc tcctccttcc catgatcagc ggggcagctc tggacatacc 3301 ccacctccct ctgggccccc caacatggct gtgggcaaca tgagtgtctt actgagatcc 3361 ctacctgggg aaacagaatt cctcaactct agtgcctaaa gagtagggaa tctcatccat 3421 cacagatcgc atttcctaag gggtttctat ccttccagaa aaattggggg agctgcagtc 3481 ccctgcacaa gatgccccag ggatgggagg tatgggctgg gggctatgta tagtctgtat 3541 acgttttgag gagaaatttg ataatgacac tgtttcctga taataaagga actgcatcag // LOCUS HSGLR 1365 bp RNA PRI 30-MAR-1995 DEFINITION Human mRNA for gastric lipase. ACCESSION X05997 NID g31771 KEYWORDS gastric lipase; lipase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1365) AUTHORS Bodmer,M.W., Angal,S., Yarranton,G.T., Harris,T.J., Lyons,A., King,D.J., Pieroni,G., Riviere,C., Verger,R. and Lowe,P.A. TITLE Molecular cloning of a human gastric lipase and expression of the enzyme in yeast JOURNAL Biochim. Biophys. Acta 909 (3), 237-244 (1987) MEDLINE 87299724 COMMENT Data kindly reviewed (10-DEC-1987) by LOWE P.A. FEATURES Location/Qualifiers source 1..1365 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 45..1241 /codon_start=1 /product="gastric lipase precursor" /db_xref="PID:g758063" /db_xref="SWISS-PROT:P07098" /translation="MWLLLTMASLISVLGTTHGLFGKLHPGSPEVTMNISQMITYWGY PNEEYEVVTEDGYILEVNRIPYGKKNSGNTGQRPVVFLQHGLLASATNWISNLPNNSL AFILADAGYDVWLGNSRGNTWARRNLYYSPDSVEFWAFSFDEMAKYDLPATIDFIVKK TGQKQLHYVGHSQGTTIGFIAFSTNPSLAKRIKTFYALAPVATVKYTKSLINKLRFVP QSLFKFIFGDKIFYPHNFFDQFLATEVCSREMLNLLCSNALFIICGFDSKNFNTSRLD VYLSHNPAGTSVQNMFHWTQAVKSGKFQAYDWGSPVQNRMHYDQSQPPYYNVTAMNVP IAVWNGGKDLLADPQDVGLLLPKLPNLIYHKEIPFYNHLDFIWAMDAPQEVYNDIVSM ISEDKK" sig_peptide 45..101 /note="pot.signal peptide (AA -19 to -1)" sig_peptide 63..101 /note="pot.signal peptide (AA -13 to -1)" CDS 63..1241 /codon_start=1 /product="gastric lipase precursor" /db_xref="PID:g758064" /translation="MASLISVLGTTHGLFGKLHPGSPEVTMNISQMITYWGYPNEEYE VVTEDGYILEVNRIPYGKKNSGNTGQRPVVFLQHGLLASATNWISNLPNNSLAFILAD AGYDVWLGNSRGNTWARRNLYYSPDSVEFWAFSFDEMAKYDLPATIDFIVKKTGQKQL HYVGHSQGTTIGFIAFSTNPSLAKRIKTFYALAPVATVKYTKSLINKLRFVPQSLFKF IFGDKIFYPHNFFDQFLATEVCSREMLNLLCSNALFIICGFDSKNFNTSRLDVYLSHN PAGTSVQNMFHWTQAVKSGKFQAYDWGSPVQNRMHYDQSQPPYYNVTAMNVPIAVWNG GKDLLADPQDVGLLLPKLPNLIYHKEIPFYNHLDFIWAMDAPQEVYNDIVSMISEDKK " mat_peptide 102..1238 /note="gastric lipase (AA 1-379)" BASE COUNT 396 a 284 c 268 g 417 t ORIGIN 1 agaaacagaa tcctaactat ttctgaggaa actgcaggtc caaaatgtgg ctgcttttaa 61 caatggcaag tttgatatct gtactgggga ctacacatgg tttgtttgga aaattacatc 121 ctggaagccc tgaagtgact atgaacatta gtcagatgat tacttattgg ggatacccaa 181 atgaagaata tgaagttgtg actgaagatg gttatattct tgaagtcaat agaattcctt 241 atgggaagaa aaattcaggg aatacaggcc agagacctgt tgtgtttttg cagcatggtt 301 tgcttgcatc agccacaaac tggatttcca acctgccgaa caacagcctt gccttcattc 361 tggcagatgc tggttatgat gtgtggctgg gcaacagcag aggaaacacc tgggccagaa 421 gaaacttgta ctattcacca gattcagttg aattctgggc tttcagcttt gatgaaatgg 481 ctaaatatga ccttccagcc acaatcgact tcattgtaaa gaaaactgga cagaagcagc 541 tacactatgt tggccattcc cagggcacca ccattggttt tattgccttt tccaccaatc 601 ccagcctggc taaaagaatc aaaaccttct atgctctagc tcctgttgcc actgtgaagt 661 atacaaaaag ccttataaac aaacttagat ttgttcctca atccctcttc aagtttatat 721 ttggtgacaa aatattctac ccacacaact tctttgatca atttcttgct actgaagtgt 781 gctcccgtga gatgctgaat ctcctttgca gcaatgcctt atttataatt tgtggatttg 841 acagtaagaa ctttaacacg agtcgcttgg atgtgtatct atcacataat ccagcaggaa 901 cttctgttca aaacatgttc cattggaccc aggctgttaa gtctgggaaa ttccaagctt 961 atgactgggg aagcccagtt cagaatagga tgcactatga tcagtcccaa cctccctact 1021 acaatgtgac agccatgaat gtaccaattg cagtgtggaa cggtggcaag gacctgttgg 1081 ctgaccccca agatgttggc cttttgcttc caaaactccc caatcttatt taccacaagg 1141 agattccttt ttacaatcac ttggacttta tctgggcaat ggatgcccct caagaagttt 1201 acaatgacat tgtttctatg atatcagaag ataaaaagta gttctggatt taaagaatta 1261 tccgtttgtt tttccaaaat actttattct ctcatacata gtattttcat aatgtttgac 1321 atgcagtgct tctttctgta attttgactt tagaaatata ttggc // LOCUS HSGLTSY 2437 bp RNA PRI 17-OCT-1994 DEFINITION H.sapiens QRSHs mRNA for glutaminyl-tRNA synthetase. ACCESSION X76013 NID g531595 KEYWORDS glutaminyl-tRNA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2437) AUTHORS Lamour,V., Quevillon,S., Diriong,S., N'Guyen,V.C., Lipinski,M. and Mirande,M. TITLE Evolution of the Glx-tRNA synthetase family: the glutaminyl enzyme as a case of horizontal gene transfer JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (18), 8670-8674 (1994) MEDLINE 94359993 REFERENCE 2 (bases 1 to 2437) AUTHORS Lipinski,M. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) M. Lipinski, Lab. de Biologie des Tumeurs Humaines, CNRS URA 1156, Inst. Gustave Roussy, 94805 Villeuif Cedex, FRANCE FEATURES Location/Qualifiers source 1..2437 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="Ewing tumour" /cell_line="IARC-EW11" /clone_lib="fetal brain and liver" /clone="cDD4 and F27" /chromosome="3" gene 6..2333 /gene="QRSHs" CDS 6..2333 /gene="QRSHs" /codon_start=1 /product="glutaminyl-tRNA synthetase" /db_xref="PID:g558586" /db_xref="SWISS-PROT:P47897" /translation="MAALDSLSLFTSLGLSEQKARETLKNSALSAQLREAATQAQQTL GSTIDKATGILLYGLASRLRDTRRLSFLVSYIASKKIHTEPQLSAALEYVRSHPLDPI DTVDFERECGVGVIVTPEQIEEAVEAAINRHRPQLLVERYHFNMGLLMGEARAVLKWA DGKMIKNEVDMQVLHLLGPKLEADLEKKFKVAKARLEETDRRTAKDVVENGETADQTL SLMEQLRGEALKFHKPGENYKTPGYVVTPHTMNLLKQHLEITGGQVRTRFPPEPNGIL HIGHAKAINFNFGYAKANNGICFLRFDDTNPEKEEAKFFTAICDMVAWLGYTPYKVTY ASDYFDQLYAWAVELIRRGLAYVCHQRGEELKGHNTLPSPWRDRPMEESLLLFEAMRK GKFSEGEATLRMKLVMEDGKMDPVAYRVKYTPHHRTGDKWCIYPTYDYTHCLCDSIEH ITHSLCTKEFQARRSSYFWLCNALDVYCPVQWEYGRLNLHYAVVSKRKILQLVATGAV RDWDDPRLFTLTALRRRGFPPEAINNFCARVGVTVAQTTMEPHLLEACVRDVLNDTAP RAMAVLESLRVIITNFPAAKSLDIQVPNFPADETKGFHQVPFAPIVFIERTDFKEEPE PGFKRLAWGQPVGLRHTGYVIELQHVVKGPSGCVESLEVTCRRADAGEKPKAFIHWVS QPLMCEVRLYERLFQHKNPEDPTEVPGGFLSDLNLASLHVVDAALVDCSVALAKPFDK FQFERLGYFSVDPDSHQGKLVFNRTVTLKEDPGKV" polyA_signal 2404..2409 BASE COUNT 576 a 665 c 670 g 526 t ORIGIN 1 ctgcaatggc ggctctagac tccctgtcgc tcttcactag cctcggcctg agcgagcaga 61 aggcccgcga gacgctcaag aactcggctc tgagcgcgca gctgcgcgag gccgctactc 121 aggctcagca gaccctgggt tccaccattg acaaagctac cgggatcctg ttatatggct 181 tggcctcccg actcagggat acccggcgtc tctccttcct tgtaagctac atagccagta 241 agaagatcca cactgagccc cagctaagcg ctgcccttga gtatgtgcgg agtcacccct 301 tggaccccat cgacactgtg gacttcgagc gggaatgtgg cgtgggtgtc attgtgaccc 361 cagagcagat tgaggaggct gtggaggctg ctattaacag gcaccggccc cagctcctgg 421 tggaacgtta ccatttcaac atggggctgc tgatgggaga ggctcgggct gtgctgaagt 481 gggcagatgg caaaatgatc aagaatgaag tggacatgca ggtcctccac cttctgggcc 541 ccaagttgga ggctgatctg gagaagaagt tcaaggtggc aaaagctcgg ctagaagaaa 601 cagaccggag gacggcaaag gatgtggtgg agaatggcga gactgctgac cagaccctgt 661 ctctgatgga gcagctccgg ggggaggccc ttaagttcca caagcctggt gagaactaca 721 agaccccagg ctatgtggtc actccacaca ccatgaatct actaaagcag cacctggaga 781 ttactggtgg gcaggtacgt acccggttcc cgccagaacc caatggaatc ctgcatattg 841 gacatgccaa agccatcaat ttcaactttg gctatgccaa ggccaacaat ggcatctgtt 901 ttctgcgttt tgatgacacc aaccctgaga aggaggaagc aaagttcttc acggccatct 961 gtgacatggt agcctggcta ggctacacac cttacaaagt cacatatgcg tctgactatt 1021 ttgaccagct atatgcgtgg gctgtggagc tcatccgcag gggtctggct tatgtgtgcc 1081 accagcgagg agaggagctc aaaggccata atactctgcc ttcaccctgg agagaccgtc 1141 ccatggagga gtcactgctg ctctttgagg caatgcgcaa gggcaagttt tcagagggcg 1201 aggccacact acggatgaag ctggtgatgg aggatggcaa gatggaccct gtagcctatc 1261 gagtcaagta tacaccacac caccgcacag gggacaaatg gtgcatctat cccacctacg 1321 actacacaca ctgcctctgt gactccatcg agcacatcac tcactcactc tgcaccaagg 1381 aattccaggc ccgacgctct tcctacttct ggctttgcaa tgcactggac gtctattgcc 1441 ctgtgcagtg ggagtatggc cgcctcaacc tgcactatgc tgttgtctct aagaggaaga 1501 tcctccagct tgtagcaact ggtgctgtgc gggactggga tgacccacgg ctctttacac 1561 tcacggccct gcgacggcgg ggcttcccac ctgaggccat caacaacttc tgtgcccggg 1621 tgggagtgac tgtggcacaa accacaatgg agccacatct tctagaagcc tgtgtgcgtg 1681 atgtgctgaa tgacacagcc ccacgagcca tggctgtgct ggagtcacta cgggtcatca 1741 tcaccaactt tcctgctgcc aagtccttgg acatccaggt gcccaacttc ccagctgatg 1801 agaccaaagg cttccatcag gttccctttg cacccattgt cttcattgag aggactgact 1861 tcaaggagga gccagagcca ggatttaagc gcctggcttg gggccagcct gtgggcctga 1921 ggcatacagg ctacgtcatt gagctgcagc atgttgtcaa gggccccagt ggttgtgtag 1981 agagtctgga ggtgacctgc agacgggcag atgctggaga gaagccaaag gcctttattc 2041 actgggtgtc acagcctttg atgtgtgagg ttcgcctcta tgagcgacta ttccagcaca 2101 agaaccctga agatcctact gaggtgcctg gtggattttt aagtgacctg aacctggcat 2161 cactacacgt ggtggatgca gcattagtgg actgctctgt ggccctggca aaacccttcg 2221 acaagttcca gtttgagcgt cttggatatt tctccgtgga tccagacagc catcagggaa 2281 agcttgtctt taaccgaact gtcacactga aggaagaccc aggaaaggtg tgagctggaa 2341 gcactgaacc tacctcatcc tcctggaggg tgtggctacc ctcgccaccc caaattccat 2401 gtcaataaag aacagctaaa ttctcctaga aaaaaaa // LOCUS HSGLUCAR 1322 bp RNA PRI 25-JUN-1996 DEFINITION H.sapiens mRNA for novel glucocorticoid receptor-associated protein. ACCESSION Z35491 NID g1143475 KEYWORDS glucocorticoid receptor-associated protein; RAP46. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1322) AUTHORS Zeiner,M. and Gehring,U. TITLE A protein that interacts with members of the nuclear hormone receptor family: identification and cDNA cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (25), 11465-11469 (1995) MEDLINE 96102134 REFERENCE 2 (bases 1 to 1322) AUTHORS Zeiner,M. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) Zeiner M., Universitaet Heidelberg, Institut fuer Biologische Chemie, Im Neuenheimer Feld 501, Heidelberg, Germany, D-69120 FEATURES Location/Qualifiers source 1..1322 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pK2/1.3" /dev_stage="adult" /tissue_type="liver" /clone_lib="lambda gt11" /sex="male" mRNA 1..1322 CDS 279..1103 /codon_start=1 /product="glucocortoid receptor-associated protein RAP46" /db_xref="PID:e110823" /db_xref="PID:g1143476" /translation="MKKKTRRRSTRSEELTRSEELTLSEEATWSEEATQSEEATQGEE MNRSQEVTRDEESTRSEEVTREEMAAAGLTVTVTHSNEKHDLHVTSQQGSSEPVVQDL AQVVEEVIGVPQSFQKLIFKGKSLKEMETPLSALGIQDGCRVMLIGKKNSPQEEVELK KLKHLEKSVEKIADQLEELNKELTGIQQGFLPKDLQAEALCKLDRRVKATIEQFMKIL EEIDTLILPENFKDSRLKRKGLVKKVQAFLAECDTVEQNICQETERLQSTNFALAE" BASE COUNT 350 a 318 c 412 g 242 t ORIGIN 1 tagtcgggcg gggttgtgag acgccgcgct cagcttccat cgctgggcgg tcaacaagtg 61 cgggcctggc tcagcgcggg ggggcgcgga gaccgcgagg cgaccgggag cggctgggtt 121 cccggctgcg cgcccttcgg ccaggccggg agccgcgcca gtcggagccc ccggcccagc 181 gtggtccgcc tccctctcgg cgtccacctg cccggagtac tgccagcggg catgaccgac 241 ccaccagggg cgccgccgcc ggcgctcgca ggccgcggat gaagaagaaa acccggcgcc 301 gctcgacccg gagcgaggag ttgacccgga gcgaggagtt gaccctgagt gaggaagcga 361 cctggagtga agaggcgacc cagagtgagg aggcgaccca gggcgaagag atgaatcgga 421 gccaggaggt gacccgggac gaggagtcga cccggagcga ggaggtgacc agggaggaaa 481 tggcggcagc tgggctcacc gtgactgtca cccacagcaa tgagaagcac gaccttcatg 541 ttacctccca gcagggcagc agtgaaccag ttgtccaaga cctggcccag gttgttgaag 601 aggtcatagg ggttccacag tcttttcaga aactcatatt taagggaaaa tctctgaagg 661 aaatggaaac accgttgtca gcacttggaa tacaagatgg ttgccgggtc atgttaattg 721 ggaaaaagaa cagtccacag gaagaggttg aactaaagaa gttgaaacat ttggagaagt 781 ctgtggagaa gatagctgac cagctggaag agttgaataa agagcttact ggaatccagc 841 agggttttct gcccaaggat ttgcaagctg aagctctctg caaacttgat aggagagtaa 901 aagccacaat agagcagttt atgaagatct tggaggagat tgacacactg atcctgccag 961 aaaatttcaa agacagtaga ttgaaaagga aaggcttggt aaaaaaggtt caggcattcc 1021 tagccgagtg tgacacagtg gagcagaaca tctgccagga gactgagcgg ctgcagtcta 1081 caaactttgc cctggccgag tgaggtgtag cagaaaaagg ctgtgctgcc ctgaagaatg 1141 gcgccaccag ctctgccgtc tctggagcgg aatttacctg atttcttcag ggctgctggg 1201 ggcaactggc catttgccaa ttttcctact ctcacactgg ttctcaatga aaaatagtgt 1261 ctttgtgatt ttgagtaaag ctcctatctg ttttctaaaa aaaaaaaaaa aaaaaaaaaa 1321 aa // LOCUS HSGLUCOII 2835 bp RNA PRI 22-JUL-1997 DEFINITION Homo sapiens mRNA for glucosidase II. ACCESSION AJ000332 NID g2274967 KEYWORDS glucosidase II; KIAA0088. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2835) AUTHORS Stuerzenhofecker,B. TITLE Direct Submission JOURNAL Submitted (08-JUL-1997) Stuerzenhofecker B., Abteilung Klinische Biochemie, Zentrum Innere Medizin, Robert-Koch-Str. 40, D 370 75 Goettingen, GERMANY REFERENCE 2 (bases 1 to 2835) AUTHORS Stuerzenhofecker,B., Nguyenvan,P. and Soeling,H.D. TITLE Sequence and analysis of the endoplasmic reticulum protein Glucosidase II JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2835 /organism="Homo sapiens" /plasmid="Plasmid pACT2" /db_xref="taxon:9606" /lab_host="Escherichia coli XL1-Blue" /sex="Male" /tissue_type="Brain" sig_peptide 1..96 /gene="KIAA0088" CDS 1..2835 /gene="KIAA0088" /function="Glucose trimming of glycosylated proteins within the endoplasmic reticulum" /codon_start=1 /evidence=experimental /product="Glucosidase II" /db_xref="PID:e328143" /db_xref="PID:g2274968" /translation="MAAVAAVAARRRRSWASLVLAFLGVCLGITLAVDRSNFKTCEES SFCKRQRSIRPGLSPYRALLDSLQLGPDSLTVHLIHEVTKVLLVLELQGLQKNMTRFR IDELEPRRPRYRVPDVLVADPPIARLSVSGRDENSVELTMAEGPYKIILTARPFRLDL LEDRSLLLSVNARGLLEFEHQRAPRVSQGSKDPAEGDGAQPEETPRDGDKPEETQGKA EKDEPGAWEETFKTHSDSKPYGPMSVGLDFSLPGMEHVYGIPEHADNLRLKVTEGGEP YRLYNLDVFQYELYNPMALYGSVPVLLAHNPHRDLGIFWLNAAETWVDISSNTAGKTL FGKMMDYLQGSGETPQTDVRWMSETGIIDVFLLLGPSISDVFRQYASLTGTQALPPLF SLGYHQSRWNYRDEADVLEVDQGFDDHNLPCDVIWLDIEHADGKRYFTWDPSRFPQPR TMLERLASKRRKLVAIVDPHIKVDSGYRVHEELRNLGLYVKTRDGSDYEGWCWPGSAG YPDFTNPTMRAWWANMFSYDNYEGSAPNLFVWNDMNEPSVFNGPEVTMLKDAQHYGGW EHRDVHNIYGLYVHMATADGLRQRSGGMERPFVLARAFFAGSQRFGAVWTGDNTAEWD HLKISIPMCLSLGLVGLSFCGADVGGFFKNPEPELLVRWYQMGAYQPFFRAHAHLDTG RREPWLLPSQHNDIIRDALGQRYSLLPFWYTLLYQAHREGIPVMRPLWVQYPQDVTTF NIDDQYLLGDALLVHPVSDSGAHGVQVYLPGQGEVWYDIQSYQKHHGPQTLYLPVTLS SIPVFQRGGTIVPRWMRVRRSSECMKDDPITLFVALSPQGTAQGELFLDDGYTFNYQT RQEFLLRRFSFSGNTLVSSSADPEGHFETPIWIERVVIIGAGKPAAVVLQTKGSPESR LSFQHDPETSVLVLRKPGINVASDWSIHLR" gene 1..2835 /gene="KIAA0088" mat_peptide 97..2832 /gene="KIAA0088" /function="Glucose trimming of glycosylated proteins within the endoplasmic reticulum" /product="Glucosidase II" BASE COUNT 597 a 748 c 813 g 677 t ORIGIN 1 atggcggcgg tagcggcagt ggcggcgcgt aggaggcggt cttgggcgtc tttggtactg 61 gcttttttag gggtctgcct ggggattacc cttgctgtgg atagaagcaa ctttaagacc 121 tgtgaagaga gttctttctg caagcgacag agaagcatac ggccaggcct ctctccatac 181 cgagccttgc tggactctct acagcttggt cctgattccc tcacggtcca tctgatccat 241 gaggtcacca aggtgttgct ggtgctagag cttcaggggc ttcaaaagaa catgactcgg 301 ttcaggattg atgagctgga gcctcggcga ccccgatacc gtgtaccaga tgttttggtg 361 gctgatccac caatagcccg gctttctgtc tctggtcgtg atgagaacag tgtggagtta 421 accatggctg agggacccta caagatcatc ttgacagcac ggccattccg ccttgaccta 481 ctagaggacc gaagtctttt gcttagtgtc aatgcccgag gactcttgga gtttgagcat 541 cagagggccc ctagggtctc gcaaggatca aaagacccag ctgagggcga tggggcccag 601 cctgaggaaa cacccaggga tggcgacaag ccagaggaga ctcaggggaa ggcagagaaa 661 gatgagccag gagcctggga ggagacattc aaaactcact ctgacagcaa gccgtatggc 721 cccatgtctg tgggtttgga cttctctctg ccaggcatgg agcatgtcta tgggatccct 781 gagcatgcag acaacctgag gctgaaggtc actgagggtg gggagccata tcgcctctac 841 aatttggatg tgttccagta tgagctgtac aacccaatgg ccttgtatgg gtctgtgcct 901 gtgctcctgg cacacaaccc tcatcgcgac ttgggcatct tctggctcaa tgctgcagag 961 acctgggttg atatatcttc caacactgcc gggaagaccc tgtttgggaa gatgatggac 1021 tacctgcagg gctctgggga gaccccacag acagatgttc gctggatgtc agagactggc 1081 atcattgacg tcttcctgct gctggggccc tccatctctg atgttttccg gcaatatgct 1141 agtctcacag gaacccaggc gttgccccca ctcttctccc tcggctacca ccagagccgt 1201 tggaactacc gggacgaggc tgatgtgctg gaagtggatc agggctttga tgatcacaac 1261 ctgccctgtg atgtcatctg gctagacatt gaacatgctg atggcaagcg gtatttcacc 1321 tgggacccca gtcgcttccc tcagccccgc accatgcttg agcgcttggc ttctaagagg 1381 cggaagctgg tggccatcgt agacccccac atcaaggtgg actccggcta ccgagttcac 1441 gaggagctgc ggaacctggg gctgtatgtt aaaacccggg atggctctga ctatgagggc 1501 tggtgctggc caggctcagc tggttaccct gacttcacta atcccacgat gagggcctgg 1561 tgggctaaca tgttcagcta tgacaattat gagggctcag ctcccaacct ctttgtctgg 1621 aatgacatga acgaaccatc tgtgttcaat ggtcctgagg tcaccatgct caaggatgcc 1681 cagcattatg ggggctggga gcaccgggat gtgcataaca tctatggcct ttatgtgcac 1741 atggcgactg ctgatgggct gagacagcgc tctgggggca tggaacgccc ctttgtcctg 1801 gccagggcct tcttcgctgg ctcccagcgc tttggagccg tgtggacagg ggacaacact 1861 gccgagtggg accatttgaa gatctctatt cctatgtgtc tcagcttggg gctggtggga 1921 ctttccttct gtggggcgga tgtgggtggc ttcttcaaaa acccagagcc agagctgctt 1981 gtgcgctggt accagatggg tgcttaccag ccattcttcc gggcacatgc ccacttggac 2041 actgggcgac gagagccatg gctgttacca tctcagcaca atgatataat ccgagatgcc 2101 ttgggccagc gatattcttt gctgcccttc tggtacaccc tcttatatca ggcccatcgg 2161 gaaggcattc ctgtcatgag gcccctgtgg gtgcagtacc ctcaggatgt gactaccttc 2221 aatatagatg atcagtactt gcttggggat gcgttgctgg ttcaccctgt atcagactct 2281 ggagcccatg gtgtccaggt ctatctgcct ggccaagggg aggtgtggta tgacattcaa 2341 agctaccaga agcatcatgg tccccagacc ctgtacctgc ctgtaactct aagcagtatc 2401 cctgtgttcc agcgtggagg gacaatcgtg cctcgatgga tgcgagtgcg gcggtcttca 2461 gaatgtatga aggatgaccc catcactctc tttgttgcac ttagccctca gggtacagct 2521 caaggagagc tctttctgga tgatgggtac acgttcaact atcagactcg ccaagagttc 2581 ctgctgcgtc gattctcatt ctctggcaac acccttgtct ccagctcagc agaccctgaa 2641 ggacactttg agacaccaat ctggattgag cgggtggtga taataggggc tggaaagcca 2701 gcagctgtgg tactccagac aaaaggatct ccagaaagcc gcctgtcctt ccagcatgac 2761 cctgagacct ctgtgttggt cctgcgcaag cctggcatca atgtggcatc tgattggagt 2821 attcacctgc gataa // LOCUS HSGLUR 1618 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for glutathione reductase (EC 1.6.4.2). ACCESSION X15722 NID g31824 KEYWORDS glutathione reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1618) AUTHORS Tutic,M. and Werner,D. TITLE Direct Submission JOURNAL Submitted (30-JUN-1989) Tutic M., Werner D., German Cancer Research Centre, Institute of Cell & Tumor Biology, Im Neuenheimerfeld 280, D 6900 Heidelberg, F R G REFERENCE 2 (bases 1 to 1618) AUTHORS Tutic,M., Lu,X.A., Schirmer,R.H. and Werner,D. TITLE Cloning and sequencing of mammalian glutathione reductase cDNA JOURNAL Eur. J. Biochem. 188 (3), 523-528 (1990) MEDLINE 90235822 FEATURES Location/Qualifiers source 1..1618 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt10" /clone="GRH-Mev10" /chromosome="8" /map="8p21->8p23" CDS 105..1544 /note="glutathione reductase (AA 1-479)" /codon_start=1 /db_xref="PID:g31825" /db_xref="SWISS-PROT:P00390" /translation="MACRQEPQPQGPPPAAGAVASYDYLVIGGGSGGLASARRAAELG ARAAVVESHKLGGTCVNVGCVPKKVMWNTAVHSEFMHDHADYGFPSCEGKFNWRVIKE KRDAYVSRLNAIYQNNLTKSHIEIIRGHAAFTSDPKPTIEVSGKKYTAPHILIATGGM PSTPHESQIPGASLGITSDGFFQLEELPGRSVIVGAGYIAVEMAGILSALGSKTSLMI RHDKVLRSFDSMISTNCTEELENAGVEVLKFSQVKEVKKTLSGLEVSMVTAVPGRLPV MTMIPDVDCLLWAIGRVPNTKDLSLNKLGIQTDDKGHIIVDEFQNTNVKGIYAVGDVC GKALLTPVAIAAGRKLAHRLFEYKEDSKLDYNNIPTVVFSHPPIGTVGLTEDEAIHKY GIENVKTYSTSFTPMYHAVTKRKTKCVMKMVCANKEEKVVGIHMQGLGCDEMLQGFAV AVKMGATKADFDNTVAIHPTSSEELVTLR" misc_feature 1603..1607 /note="polyA signal" polyA_site 1618 /note="polyA site" BASE COUNT 409 a 400 c 456 g 353 t ORIGIN 1 gagcgccggc gcgggaccga gctggcggcg ggcggcgcgc gcttccgagg cttcctgctg 61 cttctgcccg agcccgcggc ctcacgcgcg ccctctcccg tgccatggcc tgcaggcagg 121 agccgcagcc gcagggcccg ccgcccgctg ctggcgccgt ggcctcctat gactacctgg 181 tgatcggggg cggctcgggc gggctggcca gcgcgcgcag ggcggccgag ctgggtgcca 241 gggccgccgt ggtggagagc cacaagctgg gtggcacttg cgtgaatgtt ggatgtgtac 301 ccaaaaaggt aatgtggaac acagctgtcc actctgaatt catgcatgat catgctgatt 361 atggctttcc aagttgtgag ggtaaattca attggcgtgt tattaaggaa aagcgggatg 421 cctatgtgag ccgcctgaat gccatctatc aaaacaatct caccaagtcc catatagaaa 481 tcatccgtgg ccatgcagcc ttcacgagtg atcccaagcc cacaatagag gtcagtggga 541 aaaagtacac cgccccacac atcctgatcg ccacaggtgg tatgccctcc acccctcatg 601 agagccagat ccccggtgcc agcttaggaa taaccagcga tggatttttt cagctggaag 661 aattgcccgg ccgcagcgtc attgttggtg caggttacat tgctgtggag atggcaggga 721 tcctgtcagc cctgggttct aagacatcac tgatgatacg gcatgataag gtacttagaa 781 gttttgattc aatgatcagc accaactgca cggaggagct ggagaacgct ggcgtggagg 841 tgctgaagtt ctcccaggtc aaggaggtta aaaagacttt gtcgggcttg gaagtcagca 901 tggttactgc agttcccggt aggctaccag tcatgaccat gattccagat gttgactgcc 961 tgctctgggc cattgggcgg gtcccgaata ccaaggacct gagtttaaac aaactgggga 1021 ttcaaaccga tgacaagggt catatcatcg tagacgaatt ccagaatacc aacgtcaaag 1081 gcatctatgc agttggggat gtatgtggaa aagctcttct tactccagtt gcaatagctg 1141 ctggccgaaa acttgcccat cgactttttg aatataagga agattccaaa ttagattata 1201 acaacatccc aactgtggtc ttcagccacc cccctattgg gacagtggga ctcacggaag 1261 atgaagccat tcataaatat ggaatagaaa atgtgaagac ctattcaacg agctttaccc 1321 cgatgtatca cgcagttacc aaaaggaaaa caaaatgtgt gatgaaaatg gtctgtgcta 1381 acaaggaaga aaaggtggtt gggatccata tgcagggact tgggtgtgat gaaatgctgc 1441 agggttttgc tgttgcagtg aagatgggag caacgaaggc agactttgac aacacagtcg 1501 ccattcaccc tacctcttca gaagagctgg tcacacttcg ttgagaacca ggagacacgt 1561 gtggcgggca gtgggaccca tagatcttct gaaatgaaac aaataatcac attgactt // LOCUS HSGLUR1 2929 bp RNA PRI 18-NOV-1993 DEFINITION H.sapiens mRNA for glutamate receptor GLUR1. ACCESSION X58633 S40299 NID g414892 KEYWORDS GluR1 receptor; glutamate receptor; kainate receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2929) AUTHORS Potier,M.C. TITLE Direct Submission JOURNAL Submitted (19-MAR-1991) M.C. Potier, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 3QH, UK REMARK revised by [3] REFERENCE 2 (bases 1 to 2927) AUTHORS Potier,M.C., Spillantini,M.G. and Carter,N.P. TITLE The human glutamate receptor cDNA GluR1: cloning, sequencing, expression and localization to chromosome 5 JOURNAL DNA Seq. 2 (4), 211-218 (1992) MEDLINE 92329975 REFERENCE 3 (bases 1 to 2929) AUTHORS Potier,M.C. TITLE Direct Submission JOURNAL Submitted (09-NOV-1993) M.C. Potier, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 3QH, UK FEATURES Location/Qualifiers source 1..2929 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" /clone="KR4" /chromosome="5" gene 104..2824 /gene="GLUR1" sig_peptide 104..157 /gene="GLUR1" CDS 104..2824 /gene="GLUR1" /codon_start=1 /product="glutamate receptor GLUR1" /db_xref="PID:g414893" /db_xref="SWISS-PROT:P42261" /translation="MQHIFAFFCTGFLGAVVGANFPNNIQIGGLFPNQQSQEHAAFRF ALSQLTEPPKLLPQIDIVNISDSFEMTYRFCSQFSKGVYAIFGFYERRTVNMLTSFCG ALHVCFITPSFPVDTSNQFVLQLRPELQDALISIIDHYKWQKFVYIYDADRGLSVLQK VLDTAAEKNWQVTAVNILTTTEEGYRMLFQDLEKKKERLVVVDCESERLNAILGQIIK LEKNGIGYHYILANLGFMDIDLNKFKESGANVTGFQLVNYTDTIPAKIMQQWKNSDAR DHTRVDWKRPKYTSALTYDGVKVMAEAFQSLRRQRIDISRRGNAGDCLANPAVPWGQG IDIQRALQQVAFEGLTGNVQFNEKGRRTNYTLHVIEMKHDGIRKIGYWNEDDKFVPAA TDAQAGGDNSSVQNRTYIVTTILEDPYVMLKKNANQFEGNDRYEGYCVELAAEIAKHV GYSYRLEIVSDGKYGARDPDTKAWNGMVGELVYGRADVAVAPLTITLVREEVIDFSKP FMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVSRFSPYEWH SEEFEEGRDQTTSDQSNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTL IIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTLEAGSTKEFFRRSKIAVFE KMWTYMKSAEPSVFVRTTEEGMIRVRKSKGKYAYLLESTMNEYIEQRKPCDTMKVGGN LDSKGYGIATPKGSALRNPVNLAVLKLNEQGLLDKLKNKWWYDKGECGSGGGDSKDKT SALSLSNVAGVFYILIGGLGLAMLVALIEFCYKSRSESKRMKGFCLIPQQSINEAIRT STLPRNSGAGASSGGSGENGRVVSHDFPKSMQSIPCMSHSSGMPLGATGL" mat_peptide 158..2821 /gene="GLUR1" /product="glutamate receptor GLUR1" misc_feature 1712..1774 /gene="GLUR1" /note="transmembrane domain 1" misc_feature 1856..1918 /gene="GLUR1" /note="transmembrane domain 2" misc_feature 1952..2014 /gene="GLUR1" /note="transmembrane domain 3" misc_feature 2519..2581 /gene="GLUR1" /note="transmembrane domain 4" BASE COUNT 781 a 688 c 793 g 667 t ORIGIN 1 cgaaaagaac aggcagaaca gcgagaagaa taaagggaaa gggggggaaa caccaaatct 61 atgattggac ctgggcttct ttttcgccaa tccaaaaagg aatatgcagc acatttttgc 121 cttcttctgc accggtttcc taggcgcggt agtaggtgcc aatttcccca acaatatcca 181 gatcggggga ttatttccaa accagcagtc acaggaacat gctgctttta gatttgcttt 241 gtcgcaactc acagagcccc cgaagctgct cccccagatt gatattgtga acatcagcga 301 cagctttgag atgacctata gattctgttc ccagttctcc aaaggagtct atgccatctt 361 tgggttttat gaacgtagga ctgtcaacat gctgacctcc ttttgtgggg ccctccacgt 421 ctgcttcatt acgccgagct ttccggttga tacatcaaat cagtttgtcc ttcagctgcg 481 ccctgaactg caggatgccc tcatcagcat cattgaccat tacaagtggc agaaatttgt 541 ctacatttat gatgccgacc ggggcttatc cgtcctgcag aaagtcctgg atacagctgc 601 tgagaagaac tggcaggtga cagcagtcaa cattttgaca accacagagg aaggataccg 661 gatgctcttt caggacctgg agaagaaaaa ggagcggctg gtggtggtgg actgtgaatc 721 agaacgcctc aatgctatct tgggccagat tataaagcta gagaagaatg gcatcggcta 781 ccactacatt cttgcaaatc tgggcttcat ggacattgac ttaaacaaat tcaaggagag 841 tggcgccaat gtgacaggtt tccagctggt gaactacaca gacactattc cggccaagat 901 catgcagcag tggaagaata gtgatgctcg agaccacaca cgggtggact ggaagagacc 961 caagtacacc tctgcgctca cctacgatgg ggtgaaggtg atggctgagg ctttccagag 1021 cctgcggagg cagagaattg atatatctcg ccgggggaat gctggggatt gtctggctaa 1081 cccagctgtt ccctggggcc aagggatcga catccagaga gctctgcagc aggtcgcgtt 1141 tgaaggttta acaggaaacg tgcagtttaa tgagaaagga cgccggacca actacacgct 1201 ccacgtgatt gaaatgaaac atgacggcat ccgaaagatt ggttactgga atgaagatga 1261 taagtttgtc cctgcagcca ccgatgccca agctgggggg gataattcaa gtgttcagaa 1321 cagaacatac atcgtcacaa caatcctaga agatccttat gtgatgctca agaagaacgc 1381 caatcagttt gagggcaatg accgttacga gggctactgt gtagagctgg cggcagagat 1441 tgccaagcac gtgggctact cctaccgtct ggagattgtc agtgatggaa aatacggagc 1501 ccgagaccct gacacgaagg cctggaatgg catggtggga gagctggtct atggaagagc 1561 agatgtggct gtggctcccc ttactatcac tttggtccgg gaagaagtta tagatttctc 1621 caaaccattt atgagtttgg ggatctccat catgattaaa aaaccacaga aatccaagcc 1681 gggtgtcttc tccttccttg atcctttggc ttatgagatt tggatgtgca ttgtttttgc 1741 ctacattgga gtgagtgttg tcctcttcct ggtcagccgc ttcagtccct atgaatggca 1801 cagtgaagag tttgaggaag gacgggacca gacaaccagt gaccagtcca atgagtttgg 1861 gatattcaac agtttgtggt tctccctggg agccttcatg cagcaaggat gtgacatttc 1921 tcccaggtcc ctgtctggtc gcatcgtcgg tggcgtctgg tggttcttca ccttaatcat 1981 catctcctca tatacagcca atctggccgc cttcctgacc gtggagagga tggtgtctcc 2041 cattgagagt gcagaggacc tagcgaagca gacagaaatt gcctacggga cgctggaagc 2101 aggatctact aaggagttct tcaggaggtc taaaattgca gtgtttgaga agatgtggac 2161 atacatgaag tcagcagagc catcagtttt tgtgcggaca acagaggagg ggatgattcg 2221 agtgaggaaa tccaaaggca aatatgccta cctcctggag tccaccatga atgagtacat 2281 tgagcagcgg aaaccctgtg acaccatgaa ggtgggaggt aacttggatt ccaaaggcta 2341 tggcattgca acacccaagg ggtctgccct gagaaatcca gtaaacctgg cagtgttaaa 2401 actgaacgag caggggcttt tggacaaatt gaaaaacaaa tggtggtacg acaagggcga 2461 gtgcggcagc gggggaggtg attccaagga caagacaagc gctctgagcc tcagcaatgt 2521 ggcaggcgtg ttctacatcc tgatcggagg acttggacta gccatgctgg ttgccttaat 2581 cgagttctgc tacaaatccc gtagtgaatc caagcggatg aagggttttt gtttgatccc 2641 acagcaatcc atcaacgaag ccatacggac atcgaccctc ccccgcaaca gcggggcagg 2701 agccagcagc ggcggcagtg gagagaatgg tcgggtggtc agccatgact tccccaagtc 2761 catgcaatcg attccttgca tgagccacag ttcagggatg cccttgggag ccacgggatt 2821 gtaactggag cagatggaga ccccttgggg agcaggctcg ggctccccag ccccatccca 2881 aacccttcag tgccaaaaac aacaacaaaa tgaaacgcaa ccggaattc // LOCUS HSGLUS 2727 bp RNA PRI 03-MAR-1992 DEFINITION Human rearranged mRNA for glutamine synthase. ACCESSION X59834 NID g31830 KEYWORDS glutamate-ammonia ligase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2727) AUTHORS Van Den Hoff,M.J.B. TITLE Direct Submission JOURNAL Submitted (24-MAY-1991) M.J.B. Van Den Hoff, Dept. of Anatomy & Embryology, University of Amsterdam, Meibergdreef 15, 1105 AZ Amsterdam, The Netherlands REFERENCE 2 (bases 1 to 2727) AUTHORS Van den Hoff,M.J., Geerts,W.J., Das,A.T., Moorman,A.F. and Lamers,W.H. TITLE cDNA sequence of the long mRNA for human glutamine synthase JOURNAL Biochim. Biophys. Acta 1090 (2), 249-251 (1991) MEDLINE 92031701 FEATURES Location/Qualifiers source 1..2727 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda ZAP; ATDLMC" /clone="pchGS3.1.1, pchGS6.1, pchGS27.4" 5'UTR 1..109 mRNA 1..2727 /note="cDNA; for glutamate--ammonia ligase" /evidence=experimental CDS 110..1231 /EC_number="6.3.1.2" /codon_start=1 /product="glutamate--ammonia ligase" /db_xref="PID:g31831" /db_xref="SWISS-PROT:P15104" /translation="MTTSASSHLNKGIKQVYMSLPQGEKVQAMYIWIDGTGEGLRCKT RTLDSEPKCVEELPEWNFDGSSTLQSEGSNSDMYLVPAAMFRDPFRKDPNKLVLCEVF KYNRRPAETNLRHTCKRIMDMVSNQHPWFGMEQEYTLMGTDGHPFGWPSNGFPGPQGP YYCGVGADRAYGRDIVEAHYRACLYAGVKIAGTNAEVMPAQWEFQIGPCEGISMGDHL WVARFILHRVCEDFGVIATFDPKPIPGNWNGAGCHTNFSTKAMREENGLKYIEEAIEK LSKRHQYHIRAYDPKGGLDNARRLTGFHETSNINDFSAGVANRSARLRIPRTVGQEKK GYFEDRRPSANCEPFSVTEALIRTCLLNETGDEPFQYKN" 3'UTR 1229..2727 polyA_site 2715 BASE COUNT 698 a 619 c 666 g 744 t ORIGIN 1 agaagagcgg agctgtgagc agtactgcgg cctcctctcc tctcctaacc tcgctctcgc 61 ggcctagctt tacccgcccg cctgctcggc gaccagaaca ccttccacca tgaccacctc 121 agcaagttcc cacttaaata aaggcatcaa gcaggtgtac atgtccctgc ctcagggtga 181 gaaagtccag gccatgtata tctggatcga tggtactgga gaaggactgc gctgcaagac 241 ccggaccctg gacagtgagc ccaagtgtgt ggaagagttg cctgagtgga atttcgatgg 301 ctctagtact ttacagtctg agggttccaa cagtgacatg tatctcgtgc ctgctgccat 361 gtttcgggac cccttccgta aggaccctaa caagctggtg ttatgtgaag ttttcaagta 421 caatcgaagg cctgcagaga ccaatttgag gcacacctgt aaacggataa tggacatggt 481 gagcaaccag cacccctggt ttggcatgga gcaggagtat accctcatgg ggacagatgg 541 gcaccccttt ggttggcctt ccaacggctt cccagggccc cagggtccat attactgtgg 601 tgtgggagca gacagagcct atggcaggga catcgtggag gcccattacc gggcctgctt 661 gtatgctgga gtcaagattg cggggactaa tgccgaggtc atgcctgccc agtgggaatt 721 tcagattgga ccttgtgaag gaatcagcat gggagatcat ctctgggtgg cccgtttcat 781 cttgcatcgt gtgtgtgaag actttggagt gatagcaacc tttgatccta agcccattcc 841 tgggaactgg aatggtgcag gctgccatac caacttcagc accaaggcca tgcgggagga 901 gaatggtctg aagtacatcg aggaggccat tgagaaacta agcaagcggc accagtacca 961 catccgtgcc tatgatccca agggaggcct ggacaatgcc cgacgtctaa ctggattcca 1021 tgaaacctcc aacatcaacg acttttctgc tggtgtagcc aatcgtagcg ccagactacg 1081 cattccccgg actgttggcc aggagaagaa gggttacttt gaagatcgtc gcccctctgc 1141 caactgcgag cccttttcgg tgacagaagc cctcatccgc acgtgtcttc tcaatgaaac 1201 cggcgatgag cccttccagt acaaaaatta agtggactag acctccagct gttgagcccc 1261 tcctagttct tcatccctga ctccaactct tccccctctc ccagttgtcc cgattgtaac 1321 tcaaagggtg gaatatcaag gtcgtttttt tcattccatg tgcccagtta atcttgcttt 1381 cttttgtttg gctgggatag aggggtcaag ttattaattt cttcacacct accctccttt 1441 ttttccctat cactgaagct ttttagtgca ttagtgggga ggagggtggg gagacataac 1501 cactgcttcc atttaatggg gtgcacctgt ccaataggcg tacgtatccg gacagagcac 1561 gtttgcagag gggtctctct ccaggtagct gaaagggaag acctgacgta ctctggttag 1621 gttaggactt gccctcgtgg tggaaacttt tcttaaaaag ttataaccaa cttttctatt 1681 aaaagtggga attaggagag aaggtagggg ttgggaatca gagagaatgg ctttggtctc 1741 ttgcttgtgg gactagcctg gcttgggact aaatgccctg ctctgaacac aagcttagta 1801 taaactgatg gatatcccta ccttgaaaga agaaaaggtt cttactgctt ggtccttgat 1861 ttatcacaca aagcagaata gtatttttat atttaaatgt aaagacaaaa aactatatgt 1921 atggttttgt ggattatgtg tgttttggct aaaggaaaaa accatccagg tcacggggca 1981 ccaaatttga gacaaatagt cggattagaa ataaagcatc tcattttgag tagagagcaa 2041 ggaagtggtt cttagatggt gatctgggat taggccctca agaccccttt tgggtttctg 2101 ccctgcccac cctctggaga aggtggcact gattagttaa cagaccaaca ccgttactag 2161 cagtcactga tctccgtggc tttggtttaa aagacacact tgtccacata ggtttagaga 2221 taagagttgg ctggtcaact tgagcatgtt actgacagag ggggtattgg ggttattttc 2281 tggtaggaat agcatgtcac taaagcaggc ctttgatatt aaatttttta aaaagcaaaa 2341 ttatagaagt ttagatttta atcaaatttg tagggtttct aggtatttac agatgctgtt 2401 gctcaacgtc tcctacctct gctctgagag atgggacagg ctgagtcaaa cactgtaatt 2461 ttgtatcttg atgtctttgt taagactgct gaagaattat tttttctttt ataataagga 2521 ataaacccca cctttattcc ttcatttcat ctaccatttt ctggttcttg tgttggctgt 2581 ggcaggccag ctgtggtttt cttttgccat gacaacttct aattgccatg tacagtatgt 2641 tcaaagtcaa ataactcctc attgtaaaca aactgtgtaa ctgcccaaag cagcacttat 2701 aaatcagcct aacataaaaa aaaaaaa // LOCUS HSGLUTTR 1912 bp RNA PRI 08-NOV-1994 DEFINITION H.sapiens mRNA for glutamate transporter. ACCESSION Z32517 NID g471246 KEYWORDS glutamate transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1912) AUTHORS Manfras,B.J., Rudert,W.A., Trucco,M. and Boehm,B.O. TITLE Cloning and characterization of a glutamate transporter cDNA from human brain and pancreas JOURNAL Biochim. Biophys. Acta 1195 (1), 185-188 (1994) MEDLINE 95002073 REFERENCE 2 (bases 1 to 1912) AUTHORS Manfras,B.J. TITLE Direct Submission JOURNAL Submitted (07-APR-1994) Manfras B. J., University of Ulm, Department of Internal Medicine, Robert-Koch-Str.8, Ulm, Germany, 89081 FEATURES Location/Qualifiers source 1..1912 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GLTRpa1" /tissue_type="pancreas" /germline mRNA 1..1912 CDS 90..1814 /codon_start=1 /product="glutamate transporter" /db_xref="PID:g488752" /translation="MASTEGANNMPKQVEVRMHDSHLGSEGPKHRHLGLRLCDKLGKN LLLTLTVFGVILGAVCGGLLRLASPIHPDVVMLIAFPGDILMRMLKMLILPLIISSLI TGLSGLDAKASGRLGTRAMVYYMSTTIIAAVLGVILVLGIHPGNPKLKKQLGAGKKND EVSSLDAFLDLIRNLFPENLVQACFQQIQTVTKKVLVAPPPDEEANATSAVVSLLNET VTEVPEETKMVIKKGLEFKDGMNVLGLIGFFIAFAIPMGKMGDQGQADGGFLQHFERD CNEVSDHDHVVLSLGIACLICGKIIAIKDLEVVARQLGMYMVTVIIGLIIHGGIFLPL IYFVVTRKNPFSFFAGIFQAWITALGTASSAGTLPVTFRCLEENLGIDKRVTRFVLPV GATINMDGTALYEAVAAIFIAQMNGVVLDGGQIVTVSLTATLASVGAASIPSAGLVTM LLILTAVGLPTEDISLLVAVDWLLDRMRTSVNVVGDSFGAGIVYHLSKSELDTIDSQH RVHEDIEMTKTQSIYDDMKNHRESNSNQCVFAAHNSVIVDECKVTLAGNGKSADRVLE EEPGKREK" BASE COUNT 471 a 460 c 504 g 477 t ORIGIN 1 cccgcacttc gcgctcaccc cggcgtccgc tttctccctc gcccacatct gccggatagt 61 tctgaagagg agggggcgtt ccccagacca tggcatctac ggaaggtgcc aacaatatgc 121 ccaagcaggt ggaagtgcga atgcacgaca gtcatcttgg ctcagaggga cccaagcacc 181 ggcacctggg cctgcgcctg tgtgacaagc tggggaagaa tctgctgctc accctgacgg 241 tgtttggtgt catcctggga gcagtgtgtg gagggcttct tcgcttggca tctcccatcc 301 accctgatgt ggttatgtta atagccttcc caggggatat actcatgagg atgctaaaaa 361 tgctcattct ccctctaatc atctccagct taatcacagg gttgtcaggc ctggatgcta 421 aggctagtgg ccgcttgggc acgagagcca tggtgtatta catgtccacg accatcattg 481 ctgcagtact gggggtcatt ctggtcttgg gtatccatcc aggcaatccc aagctcaaga 541 agcagctggg ggctgggaag aagaatgatg aagtgtccag cctggatgcc ttcctggacc 601 ttattcgaaa tctcttccct gaaaaccttg tccaagcctg ctttcaacag attcaaacag 661 tgacgaagaa agtcctggtt gcaccaccgc cggacgagga ggccaacgca accagcgctg 721 ttgtctctct gttgaacgag actgtgactg aggtgccgga ggagactaag atggttatca 781 agaagggcct ggagttcaag gatgggatga acgtcttagg tctgataggg tttttcattg 841 cttttgccat ccctatgggg aagatgggag atcaaggcca agctgatggt ggatttctcc 901 aacattttga acgagattgt aatgaagtta gtgatcatga tcatgtggta ctctccctgg 961 gtatcgcctg cctgatctgt ggaaagatca ttgcaatcaa ggacttagaa gtggttgcta 1021 ggcaactggg gatgtacatg gtaacagtga tcataggcct catcatccac gggggcatct 1081 ttctcccctt gatttacttt gtagtgacca ggaaaaaccc cttctccttt tttgctggca 1141 ttttccaagc ttggatcact gccctgggca ccgcttccag tgctggaact ttgcctgtca 1201 cctttcgttg cctggaagaa aatctgggga ttgataagcg tgtgactaga ttcgtccttc 1261 ctgttggagc aaccattaac atggatggta cagcccttta tgaagcggta gccgccatct 1321 ttatagccca aatgaatggt gttgtcctgg atggaggaca gattgtgact gtaagcctca 1381 cagccaccct ggcaagcgtc ggcgcggcca gtatccccag tgccgggttg gtcaccatgc 1441 tcctcattct gacagccgtg ggcctgccaa cagaggacat cagcctgctg gtggctgtgg 1501 actggctgct ggacaggatg agaacttcag tcaatgttgt gggtgactct tttggggctg 1561 ggatagtcta tcacctctcc aagtctgagc tggataccat tgactcccag catcgagtgc 1621 atgaagatat tgaaatgacc aagactcaat ccatttatga tgacatgaag aaccacaggg 1681 aaagcaactc taatcaatgt gtctttgctg cacacaactc tgtcatagta gatgaatgca 1741 aggtaactct tgcaggcaat ggaaagtcag ccgaccgagt gttggaggaa gaacctggga 1801 aacgtgagaa ataaggatat gagtctcagc aaattcttga ataaactccc cagcgtatcc 1861 tatggtaact gatgatataa acaagctttc tttaaaaaaa aaaaaaaaaa aa // LOCUS HSGLYCA 967 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for erythrocyte membrane glycophorin A. ACCESSION X08054 NID g31834 KEYWORDS antigen; glycophorin A; membrane protein; sialoglycoprotein alpha. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 967) AUTHORS Tate,C.G. TITLE Direct Submission JOURNAL Submitted (07-JUL-1988) Tate C.G., Department of Biochemistry, School of Medical Sciences, University of Bristol, Bristol, United Kingdom REFERENCE 2 (bases 1 to 967) AUTHORS Tate,C.G. and Tanner,M.J. TITLE Isolation of cDNA clones for human erythrocyte membrane sialoglycoproteins alpha and delta JOURNAL Biochem. J. 254 (3), 743-750 (1988) MEDLINE 89061610 COMMENT [1]: see J02578 for overlapping sequence. FEATURES Location/Qualifiers source 1..967 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="reticulocyte" /clone_lib="lambdagt11" /clone="ALP1a, ALP1b" /chromosome="chromosome 4, q28-q31" CDS 57..509 /codon_start=1 /product="preglycophorin A" /db_xref="PID:g31835" /db_xref="SWISS-PROT:P02724" /translation="MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQT NDTHKRDTYAATPRAHEVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAG VIGTILLISYGIRRLIKKSPSDVKPLPSPDTDVPLSSVEIENPETSDQ" sig_peptide 57..113 /note="signal peptide (AA -19 to -1)" mat_peptide 114..506 /note="glycophorin A (AA 1 - 131)" polyA_site 967 /note="polyA site" BASE COUNT 316 a 175 c 170 g 306 t ORIGIN 1 agttgtcttt ggtagttttt ttgcactaac ttcaggaacc agctcatgat ctcaggatgt 61 atggaaaaat aatctttgta ttactattgt cagaaattgt gagcatatca gcattaagta 121 ccactgaggt ggcaatgcac acttcaactt cttcttcagt cacaaagagt tacatctcat 181 cacagacaaa tgatacgcac aaacgggaca catatgcagc cactcctaga gctcatgaag 241 tttcagaaat ttctgttaga actgtttacc ctccagaaga ggaaaccgga gaaagggtac 301 aacttgccca tcatttctct gaaccagaga taacactcat tatttttggg gtgatggctg 361 gtgttattgg aacgatcctc ttaatttctt acggtattcg ccgactgata aagaaaagcc 421 catctgatgt aaaacctctc ccctcacctg acacagacgt gcctttaagt tctgttgaaa 481 tagaaaatcc agagacaagt gatcaatgag aatctgttca ccaaaccaaa tgtggaaaga 541 acacaaagaa gacataagac ttcagtcaag tgaaaaatta acacgtggac tggacactcc 601 aataaattat atacctgcct aagttgtaca atttcagaat gcaattttca ttataatgag 661 ttccagtgac tcaatgatgg ggaaaaaaat ctctgctcat taatatttca agataaagaa 721 caaatgtttc cttgaatgct tgcttttgtg tgttagcata atttttagaa ttgtttgaga 781 attctgatcc aaaactttag ttgaattcat ctacgtttgt ttaatattaa cttaacctat 841 tctattgtat tataatgatg attctgtcaa atgaaaggct tgaaatacct agatgaagtt 901 tagattttct tcctattgta aacttttgag tctggtttca ttgttttaaa taaattaagg 961 ggacact // LOCUS HSGLYGEN 1563 bp RNA PRI 03-JUN-1994 DEFINITION H.sapiens mRNA for glycogenin. ACCESSION X79537 NID g496894 KEYWORDS glycogenin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1563) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (01-JUN-1994) H. Leffers, Inst. of Medical Research Biochemistry & Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus Univ., 8000 Aarhus C, DENMARK REFERENCE 2 (bases 1 to 1563) AUTHORS Leffers,H., Wiemann,S. and Ansorge,W. TITLE Cloning and sequencing of a cDNA encoding human glycogenin JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1563 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="keratinocyte" /clone_lib="lambda ZapII" /clone="nuk_169" /cell_line="non fractionated non cultured normal keratinocytes" CDS 84..923 /codon_start=1 /product="glycogenin" /db_xref="PID:g496895" /db_xref="SWISS-PROT:P46976" /translation="MTDQAFVTLTTNDAYAKGALVLGSSLKQHRTTRRLVVLATPQVS DSMRKVLETVFDEVIMVDVLDSGDSAHLTLMKRPELGVTLTKLHCWSLTQYSKCVFMD ADTLVLANIDDLFDREELSAAPDPGWPDCFNSGVFVYQPSVETYNQLLHLASEQGSFD GGDQGILNTFFSSWATTDIRKHLPFIYNLSSISIYSYLPAFKVKMSQEPYHICPLGRS QLWHSRLYPRKNGRNDGNRARLIIWEQIPLTTSRGNLTLTSSRNTAFFCEHIHFTSLV SDT" polyA_signal 1544..1549 BASE COUNT 418 a 355 c 349 g 441 t ORIGIN 1 gctggccgcg ctccctcccg gtgccggctt ctctgagtca ccaacctgag gctgccccgg 61 ccgcctgcgc acccggcagc accatgacag atcaggcctt tgtgacacta accacaaacg 121 atgcctacgc caaaggtgcc ctggtcctgg gatcatctct gaaacagcac aggaccacca 181 ggaggctggt cgtgctcgcc acccctcagg tctcagactc catgagaaaa gttttagaga 241 cagtctttga tgaagtcatc atggtagatg tcttggacag tggcgattct gctcatctaa 301 ccttaatgaa gaggccagag ttgggtgtca cgctgacaaa gctccactgc tggtcgctta 361 cacagtattc aaaatgtgta ttcatggatg cagatactct ggtcctagca aatattgatg 421 atctttttga cagagaagaa ttgtcagcag caccagaccc agggtggcct gactgcttca 481 attccggagt cttcgtttat cagccttcag ttgaaacata caatcagctg ttgcatcttg 541 cttctgagca aggtagtttt gatggtgggg accaaggcat actgaacaca ttttttagca 601 gctgggcaac aacagatatc agaaaacacc tgccgtttat ttataaccta agcagcatct 661 ctatatactc ctacctcccg gcatttaaag tgaagatgtc tcaggagcca tatcacatct 721 gtcccttggg gagatcccag ctatggcaca gccgtttgta tcctcggaag aacggaagga 781 acgatgggaa cagggccagg ctgattatat gggagcagat tcctttgaca acatcaagag 841 gaaacttgac acttacctcc agtagaaaca ctgcattttt ctgtgaacac atccacttca 901 caagccttgt ttctgatact tagtatctag agctgggttg agaaaagtct gttacagttg 961 ctagaggttt tcattaaaac ttatcagatg agaggctttt ttaggataag aggtgagaac 1021 tgggcaaaag ttgtgaagca gcaattctgt tatatggaca gtgttctgct ttttaatcct 1081 atttagcttg tttcagaaat tctcactttt gttgactgcc aacatacaaa gtaagggaaa 1141 ctcaagatat taagatggct gtatcagttc ttaaaatctg cagagcctgg ttcaaaatca 1201 gtcactccct tcagaagcag acatggcatc tgttccttgc ttgcttgttg gttgtgtacc 1261 tttcacgaga cctgaatttt agaattgccc agtgctgcca gagtgagtga gtgtaattct 1321 cctttcaggt aaagataggc tatctcaaca ctgctgagtg attcataaac atatcaacca 1381 atagcattaa cccattttat ttcctgtcct tagtgtctga agatgctcac cagttttctg 1441 tgtacagtaa ggcagcatgc taaaatgctt ttgttcagtt ctgtatattt gaaaatagca 1501 gtgtgttctc tgatggttac ctgcagtggc accctgtaca aaaaataaaa gacttattgc 1561 tgt // LOCUS HSGLYPIC 3692 bp RNA PRI 08-MAR-1991 DEFINITION Human mRNA for heparan sulfate proteaglycan (glypican). ACCESSION X54232 NID g31846 KEYWORDS heparan sulfate proteoglycan; suppressor gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3692) AUTHORS David,G. TITLE Direct Submission JOURNAL Submitted (23-JUL-1990) David G., Center for Human Genetics, Campus Gasthuisberg, Onderwijs EN Navorsing N6, Herestraat 49, B-3000 Leuven, Belgium REFERENCE 2 (bases 1 to 3692) AUTHORS David,G., Lories,V., Decock,B., Marynen,P., Cassiman,J.J. and Van den Berghe,H. TITLE Molecular cloning of a phosphatidylinositol-anchored membrane heparan sulfate proteoglycan from human lung fibroblasts JOURNAL J. Cell Biol. 111 (6 Pt 2), 3165-3176 (1990) MEDLINE 91100427 FEATURES Location/Qualifiers source 1..3692 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lung fibroblast" /clone_lib="Hu LF LZAP cDNA" /clone="64K3, 64K4" mRNA 1..3692 /note="glypican" /evidence=experimental CDS 222..1898 /codon_start=1 /product="glypican" /db_xref="PID:g31847" /db_xref="SWISS-PROT:P35052" /translation="MELRARGWWLLCAAAALVACARGDPASKSRSCGEVRQIYGAKGF SLSDVPQAEISGEHLRICPQGYTCCTSEMEENLANRSHAELETALRDSSRVLQAMLAT QLRSFDDHFQHLLNDSERTLQATFPGAFGELYTQNARAFRDLYSELRLYYRGANLHLE ETLAEFWARLLERLFKQLHPQLLLPDDYLDCLGKQAEALRPFGEAPRELRLRATRAFV AARSFVQGLGVASDVVRKVAQVPLGPECSRAVMKLVYCAHCLGVPGARPCPDYCRNVL KGCLANQADLDAEWRNLLDSMVLITDKFWGTSGVESVIGSVHTWLAEAINALQDNRDT LTAKVIQGCGNPKVNPQGPGPEEKRRRGKLAPRERPPSGTLEKLVSEAKAQLRDVQDF WISLPGTLCSEKMALSTASDDRCWNGMARGRYLPEVMGDGLANQINNPEVEVDITKPD MTIRQQIMQLKIMTNRLRSAYNGNDVDFQDASDDGSGSGSGDGCLDDLCGRKVSRKSS SSRTPLTHALPGLSEQEGQKTSAASCPQPPTFLLPLLLFLALTVARPRWR" sig_peptide 222..290 /note="glypican" mat_peptide 291..1895 /evidence=experimental /product="glypican" BASE COUNT 639 a 1270 c 1186 g 597 t ORIGIN 1 ggctgcccga gcgagcgttc ggacctcgca ccccgcgcgc cccgcgccgc cgccgccgcc 61 ggcttttgtt gtctccgcct cctcggccgc cgccgcctct ggaccgcgag ccgcgcgcgc 121 cgggaccttg gctctgccct tcgcgggcgg gaactgcgca ggacccggcc aggatccgag 181 agaggcgcgg gcgggtggcc gggggcgccg ccggccccgc catggagctc cgggcccgag 241 gctggtggct gctatgtgcg gccgcagcgc tggtcgcctg cgcccgcggg gacccggcca 301 gcaagagccg gagctgcggc gaggtccgcc agatctacgg agccaagggc ttcagcctga 361 gcgacgtgcc ccaggcggag atctcgggtg agcacctgcg gatctgtccc cagggctaca 421 cctgctgcac cagcgagatg gaggagaacc tggccaaccg cagccatgcc gagctggaga 481 ccgcgctccg ggacagcagc cgcgtcctgc aggccatgct tgccacccag ctgcgcagct 541 tcgatgacca cttccagcac ctgctgaacg actcggagcg gacgctgcag gccaccttcc 601 ccggcgcctt cggagagctg tacacgcaga acgcgagggc cttccgggac ctgtactcag 661 agctgcgcct gtactaccgc ggtgccaacc tgcacctgga ggagacgctg gccgagttct 721 gggcccgcct gctcgagcgc ctcttcaagc agctgcaccc ccagctgctg ctgcctgatg 781 actacctgga ctgcctgggc aagcaggccg aggcgctgcg gcccttcggg gaggccccga 841 gagagctgcg cctgcgggcc acccgtgcct tcgtggctgc tcgctccttt gtgcagggcc 901 tgggcgtggc cagcgacgtg gtccggaaag tggctcaggt ccccctgggc ccggagtgct 961 cgagagctgt catgaagctg gtctactgtg ctcactgcct gggagtcccc ggcgccaggc 1021 cctgccctga ctattgccga aatgtgctca agggctgcct tgccaaccag gccgacctgg 1081 acgccgagtg gaggaacctc ctggactcca tggtgctcat caccgacaag ttctggggta 1141 catcgggtgt ggagagtgtc atcggcagcg tgcacacgtg gctggcggag gccatcaacg 1201 ccctccagga caacagggac acgctcacgg ccaaggtcat ccagggctgc gggaacccca 1261 aggtcaaccc ccagggccct gggcctgagg agaagcggcg ccggggcaag ctggccccgc 1321 gggagaggcc accttcaggc acgctggaga agctggtctc tgaagccaag gcccagctcc 1381 gcgacgtcca ggacttctgg atcagcctcc cagggacact gtgcagtgag aagatggccc 1441 tgagcactgc cagtgatgac cgctgctgga acgggatggc cagaggccgg tacctccccg 1501 aggtcatggg tgacggcctg gccaaccaga tcaacaaccc cgaggtggag gtggacatca 1561 ccaagccgga catgaccatc cggcagcaga tcatgcagct gaagatcatg accaaccggc 1621 tgcgcagcgc ctacaacggc aacgacgtgg acttccagga cgccagtgac gacggcagcg 1681 gctcgggcag cggtgatggc tgtctggatg acctctgcgg ccggaaggtc agcaggaaga 1741 gctccagctc ccggacgccc ttgacccatg ccctcccagg cctgtcagag caggaaggac 1801 agaagacctc ggctgccagc tgcccccagc ccccgacctt cctcctgccc ctcctcctct 1861 tcctggccct tacagtagcc aggccccggt ggcggtaact gccccaaggc cccagggaca 1921 gaggccaagg actgactttg ccaaaaatac aacacagacg atatttaatt cacctcagcc 1981 tggagaggcc tggggtggga cagggagggc cggcggctct gagcaggggc aggcgcagag 2041 gtcccagccc caggcctggc ctcgcctgcc tttctgcctt ttaattttgt atgaggtcct 2101 caggtcagct gggagccagt gtgcccaaaa gccatgtatt tcagggacct caggggcacc 2161 tccggctgcc tagccctccc cccagctccc tgcaccgccg cagaagcagc ccctcgaggc 2221 ctacagagga ggcctcaaag caacccgctg gagcccacag cgagcctgtg ccttcctccc 2281 cgcctcctcc cactgggact cccagcagag cccaccagcc agccctggcc caccccccag 2341 cctccagaga agccccgcac gggctgtctg ggtgtccgcc atccagggtc tggcagagcc 2401 tctgagatga tgcatgatgc cctcccctca gcgcaggctg cagagcccgg ccccacctcc 2461 ctgcgccctt gaggggcccc agcgtctgca gggtgacgcc tgagacagca ccactgctga 2521 ggagtctgag gactgtcctc ccacagaccc tgcagtgagg ggccctccat gcgcagatga 2581 ggggccactg acccacctgc gcttctgctg gaggagggga agctgggccc aaaggcccag 2641 ggaggcagcg tgggctctgc caatgtgggc tgcccctcgc acacagggct cacagggcag 2701 gccttgctgg ggtccagggc tgttggagga ccccgagggc tgaggagcag ccaggacccg 2761 cctgctccca tcctcaccca gatcaggaac cagggcctcc ctgttcacgg tgacacaggt 2821 cagggctcag agtgaccctc ggctgtcacc tgctcacagg gatgctggtg gctggtgaga 2881 ccccgcactg cacacgggaa tgcctaggtc ccttcccgac ccagccagct gcactgcagg 2941 gcacggggac ctggatagtt aagggctttt ccaaacatgc atccatttac tgacacttcc 3001 tgtccttgtt catggagagc tgttcgctcc tcccagatgg cttcggaggc ccgcagggcc 3061 caccttggac cctggtgacc tcctgtcact cactgaggcc atcagggccc tgccccaggc 3121 ctggacgggc cctccttccc tcctgtgccc cagctgccag gtggccctgg ggaggggtgg 3181 tgtggtgttg ggaaggggtc ctgcaggggg aggaggactt ggagggtctg ggggcagctg 3241 tcctgaaccg actgaccctg aggaggccgc ttagtgctgc tttgcttttc atcaccgtcc 3301 cgcacagtgg acggaggtcc ccggttgctg gtcaggtccc catggcttgt tctctggaac 3361 ctgactttag atgttttggg atcaggagcc cccaacacag gcaagtccac cccataataa 3421 ccctgccagt gccagggtgg gctggggact ctggcacagt gatgccgggc gccaggacag 3481 cagcactccc gctgcacaca gacggcctag gggtggcgct cagaccccac cctacgctca 3541 tctctggaag gggcagccct gagtggtcac tggtcagggc agtggccaag cctgctgtgt 3601 ccttcctcca caaggtcccc ccaccgctca gtgtcagcgg gtgacgtgtg ttcttttgag 3661 tccttgtatg aataaaaggc tggaaaccta aa // LOCUS HSGLYRA1 1857 bp RNA PRI 28-MAY-1993 DEFINITION H.sapiens alpha-2 strychnine binding subunit of inhibitory glycine receptor mRNA. ACCESSION X52008 NID g31848 KEYWORDS glycine receptor; inhibitory glycine receptor; strychnine binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1857) AUTHORS Grenningloh,G., Schmieden,V., Schofield,P.R., Seeburg,P.H., Siddique,T., Mohandas,T.K., Becker,C.M. and Betz,H. TITLE Alpha subunit variants of the human glycine receptor: primary structures, functional expression and chromosomal localization of the corresponding genes JOURNAL EMBO J. 9 (3), 771-776 (1990) MEDLINE 90183975 FEATURES Location/Qualifiers source 1..1857 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" gene 388..1746 /gene="GLYRA1" CDS 388..1746 /gene="GLYRA1" /note="strychnine binding alpha-2 subunit" /codon_start=1 /product="inhibitory glycine receptor" /db_xref="PID:g31849" /db_xref="SWISS-PROT:P23416" /translation="MNRQLVNILTALFAFFLETNHFRTAFCKDHDSRSGKQPSQTLSP SDFLDKLMGRTSGYDARIRPNFKGPPVNVTCNIFINSFGSVTETTMDYRVNIFLRQQW NDSRLAYSEYPDDSLDLDPSMLDSIWKPDLFFANEKGANFHDVTTDNKLLRISKNGKV LYSIRLTLTLSCPMDLKNFPMDVQTCTMQLESFGYTMNDLIFEWLSDGPVQVAEGLTL PQFILKEEKELGYCTKHYNTGKFTCIEVKFHLERQMGYYLIQMYIPSLLIVILSWVSF WINMDAAPARVALGITTVLTMTTQSSGSRASLPKVSYVKAIDIWMAVCLLFVFAALLE YAAVNFVSRQHKEFLRLRRRQKRQNKEEDVTRESRFNFSGYGMGHCLQVKDGTAVKAT PANPLPQPPKDGDAIKKKFVDRAKRIDTISRAAFPLAFLIFNIFYWITYKIIRHEDVH KK" BASE COUNT 526 a 416 c 417 g 498 t ORIGIN 1 ggctttgcta aacagaaaag atataaaaca aaagccacag ctatctagca tggcattgtc 61 accaactccc tttgcatggt gatgcgatta aggtagcagc atttttatta ttcaggaaaa 121 gcagctgggg gattcatcag ttctgaggct ttgtctttct gggttaactg atggtcccaa 181 gcctcggttt gacctgacca tgatgcccag gactggcact ttttcttttt tctcagcaaa 241 ctgtacaaaa ccaaatctct ttttgatttt caaggaaact aggttcctgc caaattttga 301 ttgaatctgg acaataaaca gacactttgt cctagcatct ttctggaatc atttcgggat 361 atttccacaa gcaacacaga aacaggaatg aaccggcagc tagtgaacat tttgacagcc 421 ttgtttgcat ttttcttaga gacaaaccac ttcaggacgg ctttctgcaa agaccatgac 481 tccaggtctg gaaaacaacc ttcacagacc ctatctcctt cagatttctt ggacaagtta 541 atgggaagga catcaggata tgatgcaaga atcaggccaa attttaaagg tcctccagta 601 aacgttactt gcaatatttt tatcaacagt tttggatcag tcacagaaac gaccatggac 661 taccgagtga atatttttct gagacaacag tggaatgatt cacggctggc gtacagtgag 721 tacccagatg actccctgga cttggaccca tccatgctag actccatttg gaaaccagat 781 ttgttctttg ccaatgagaa gggtgccaac ttccacgatg tcaccactga caacaaattg 841 ctacggattt cgaaaaatgg caaagtgctc tacagtatca gactcacctt gaccttatcc 901 tgtcccatgg acttgaagaa ctttccgatg gatgtccaga cctgtacaat gcagctggag 961 agttttgggt acacgatgaa tgacctgata tttgagtggt taagtgatgg tccagtgcaa 1021 gttgctgaag gattgaccct gccccagttt attttgaaag aagagaagga acttggctac 1081 tgtacaaagc actacaacac tggaaagttt acctgcattg aggtcaagtt tcacctggaa 1141 cgccaaatgg gatattattt gatccagatg tacatcccaa gcctgcttat agtaattttg 1201 tcctgggttt ccttttggat aaatatggat gcagcccctg ccagggtcgc actgggcatc 1261 accacagtct taacgatgac cacccagagt tcaggctcca gggcatctct gccaaaggtc 1321 tcctatgtaa aagcgattga catctggatg gcggtgtgcc ttctgtttgt gtttgctgcc 1381 ttactggaat acgcagcggt gaacttcgtc tccaggcaac acaaggagtt cctgcgcctc 1441 cgaagaagac agaagaggca gaataaggaa gaagacgtta ctcgtgaaag tcgttttaat 1501 tttagcggtt atgggatggg tcactgcctc caagtgaaag atggaacagc tgtcaaggcc 1561 acacctgcca acccactccc acaaccgcca aaagatggag atgctatcaa gaagaagttt 1621 gtggaccggg caaaaaggat tgacacgata tctcgagctg ccttcccatt ggccttcctc 1681 attttcaaca tcttttactg gatcacatac aagatcattc ggcatgaaga tgtccacaag 1741 aaatagatgt gccctacaga ccctgggacc ttcttgcctc agtgttgtgc ttgtaaatac 1801 acagtgaaat tgtctttata tcactttgac agaggagaag attgagggag gggggag // LOCUS HSGLYRA2 1715 bp RNA PRI 28-MAY-1993 DEFINITION H.sapiens alpha-1 strychnine binding subunit of inhibitory glycine receptor mRNA. ACCESSION X52009 NID g31850 KEYWORDS glycine receptor; inhibitory glycine receptor; strychnine binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1715) AUTHORS Grenningloh,G., Schmieden,V., Schofield,P.R., Seeburg,P.H., Siddique,T., Mohandas,T.K., Becker,C.M. and Betz,H. TITLE Alpha subunit variants of the human glycine receptor: primary structures, functional expression and chromosomal localization of the corresponding genes JOURNAL EMBO J. 9 (3), 771-776 (1990) MEDLINE 90183975 FEATURES Location/Qualifiers source 1..1715 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /chromosome="X" /map="Xp21.2-p22.1" gene 297..1646 /gene="GLYRA2" CDS 297..1646 /gene="GLYRA2" /note="strychnine binding alpha-1 subunit" /codon_start=1 /product="inhibitory glycine receptor" /db_xref="PID:g31851" /db_xref="SWISS-PROT:P23415" /translation="MYSFNTLRLYLSGAIVFFSLAASKEAEAARSATKPMSPSDFLDK LMGRTSGYDARIRPNFKGPPVNVSCNIFINSFGSIAETTMDYRVNIFLRQQWNDPRLA YNEYPDDSLDLDPSMLDSIWKPDLFFANEKGAHFHEITTDNKLLRISRNGNVLYSIRI TLTLACPMDLKNFPMDVQTCIMQLESFGYTMNDLIFEWQEQGAVQVADGLTLPQFILK EEKDLRYCTKHYNTGKFTCIEARFHLERQMGYYLIQMYIPSLLIVILSWISFWINMDA APARVGLGITTVLTMTTQSSGSRASLPKVSYVKAIDIWMAVCLLFVFSALLEYAAVNF VSRQHKELLRFRRKRRHHKEDEAGEGRFNFSAYGMGPACLQAKDGISVKGANNSNTTN PPPAPSKSPEEMRKLFIQRAKKIDKISRIGFPMAFLIFNMFYWIIYKIVRREDVHNQ" BASE COUNT 424 a 480 c 407 g 404 t ORIGIN 1 cgggaggcaa cagacacgct ggagtttaac aaacagcaat actcttcgcg ctcctgaaaa 61 gcaggtctgg acgctctccg tggtgctgaa acgcctcgca gccgccgctg tccgtggtat 121 ctacgacccc ctcgctccaa tttcccctgg ggctctccct ccgcgcccct gttccccgcc 181 tccctttaac atctggatta ttttttgcaa tagcgctttc tggttttgta agtgccaatt 241 tgaaacattt tttgccccca taactcgtgg actacaaagc acaaaggacc tgaaaaatgt 301 acagcttcaa tactcttcga ctctaccttt cgggagccat tgtattcttc agccttgctg 361 cttctaagga ggctgaagct gctcgctccg caaccaagcc tatgtcaccc tcggatttcc 421 tggataagct aatggggaga acctccggat atgatgccag gatcaggccc aattttaaag 481 gtcccccagt gaacgtgagc tgcaacattt tcatcaacag ctttggttcc attgctgaga 541 caaccatgga ctatagggtc aacatcttcc tgcggcagca atggaacgac ccccgcctgg 601 cctataatga ataccctgac gactctctgg acctggaccc atccatgctg gactccatct 661 ggaaacctga cctgttcttt gccaacgaga agggggccca cttccatgag atcaccacag 721 acaacaaatt gctaaggatc tcccggaatg ggaatgtcct ctacagcatc agaatcaccc 781 tgacactggc ctgccccatg gacttgaaga atttccccat ggatgtccag acatgtatca 841 tgcaactgga aagctttgga tatacgatga atgacctcat ctttgagtgg caggaacagg 901 gagccgtgca ggtagcagat ggactaactc tgccccagtt tatcttgaag gaagagaagg 961 acttgagata ctgcaccaag cactacaaca caggtaaatt cacctgcatt gaggcccggt 1021 tccacctgga gcggcagatg ggttactacc tgattcagat gtatattccc agcctgctca 1081 ttgtcatcct ctcatggatc tccttctgga tcaacatgga tgctgcacct gctcgtgtgg 1141 gcctaggcat caccactgtg ctcaccatga ccacccagag ctccggctct cgagcatctc 1201 tgcccaaggt gtcctatgtg aaagccattg acatttggat ggcagtttgc ctgctctttg 1261 tgttctcagc cctattagaa tatgctgccg ttaactttgt gtctcggcaa cataaggagc 1321 tgctccgatt caggaggaag cggagacatc acaaggagga tgaagctgga gaaggccgct 1381 ttaacttctc tgcctatggg atgggcccag cctgtctaca ggccaaggat ggcatctcag 1441 tcaagggcgc caacaacagt aacaccacca acccccctcc tgcaccatct aagtccccag 1501 aggagatgcg aaaactcttc atccagaggg ccaagaagat cgacaaaata tcccgcattg 1561 gcttccccat ggccttcctc attttcaaca tgttctactg gatcatctac aagattgtcc 1621 gtagagagga cgtccacaac cagtgaaggg tctgaaaggt tgggggaggc tgggagaggg 1681 gaacgtggga atagcacagg aatctgagag acggt // LOCUS HSGM2APT 2436 bp RNA PRI 15-FEB-1995 DEFINITION H.sapiens mRNA for GM2 activator protein. ACCESSION X62078 NID g313158 KEYWORDS G(M2) activator protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2436) AUTHORS Klima,H., Tanaka,A., Schnabel,D., Nakano,T., Schroder,M., Suzuki,K. and Sandhoff,K. TITLE Characterization of full-length cDNAs and the gene coding for the human GM2 activator protein JOURNAL FEBS Lett. 289 (2), 260-264 (1991) MEDLINE 92008637 REFERENCE 2 (bases 1 to 2436) AUTHORS Klima,H., Klein,A., van Echten,G., Schwarzmann,G., Suzuki,K. and Sandhoff,K. TITLE Over-expression of a functionally active human GM2-activator protein in Escherichia coli JOURNAL Biochem. J. 292 (Pt 2), 571-576 (1993) MEDLINE 93277527 FEATURES Location/Qualifiers source 1..2436 /organism="Homo sapiens" /isolate="patient with juvenile form of Sandhoff disease" /db_xref="taxon:9606" /clone_lib="cDNA" /cell_type="fibroblast" /clone="pUC18" CDS 59..640 /note="alternative" /codon_start=1 /product="GM2 activator protein" /db_xref="PID:g673415" /translation="MQSLMQAPLLIALGLLLATPAQAHLKKPSQLSSFSWDNCDEGKD PAVIRSLTLEPDPIVVPGNVTLSVVGSTSVPLSSPLKVDLVLEKEVAGLWIKIPCTDY IGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFVVPDLELP SWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI" CDS 71..640 /note="alternative" /codon_start=1 /product="GM2 activator protein" /db_xref="PID:g673416" /translation="MQAPLLIALGLLLATPAQAHLKKPSQLSSFSWDNCDEGKDPAVI RSLTLEPDPIVVPGNVTLSVVGSTSVPLSSPLKVDLVLEKEVAGLWIKIPCTDYIGSC TFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFVVPDLELPSWLT TGNYRIESVLSSSGKRLGCIKIAASLKGI" BASE COUNT 570 a 632 c 576 g 658 t ORIGIN 1 aaggcacctc tgccgccaca gaccttgcag ttaactccgc cctgacccac ccttcccgat 61 gcagtccctg atgcaggctc ccctcctgat cgccctgggc ttgcttctcg cgacccctgc 121 gcaagcccac ctgaaaaagc catcccagct cagtagcttt tcctgggata actgtgatga 181 agggaaggac cctgcggtga tcagaagcct gactctggag cctgacccca tcgtcgttcc 241 tggaaatgtg accctcagtg tcgtgggcag caccagtgtc cccctgagtt ctcctctgaa 301 ggtggattta gttttggaga aggaggtggc tggcctctgg atcaagatcc catgcacaga 361 ctacattggc agctgtacct ttgaacactt ctgtgatgtg cttgacatgt taattcctac 421 tggggagccc tgcccagagc ccctgcgtac ctatgggctt ccttgccact gtcccttcaa 481 agaaggaacc tactcactgc ccaagagcga attcgttgtg cctgacctgg agctgcccag 541 ttggctcacc accgggaact accgcataga gagcgtcctg agcagcagtg ggaagcgtct 601 gggctgcatc aagatcgctg cctctctaaa gggcatataa catggcatct gccacagcag 661 aatggagcgg tgtgaggaag gtcccttttc ctctgttttg tgtttgccaa ggccaaactc 721 ccactctctg ccccccttta atcccctttc tacagtgagt ccactaccct cactgaaaat 781 cattttgtac cacttacatt ttaggctggg gcaagcagcc ctgacctaag ggagaatgag 841 ttggacagtt cttgatagcc cagggcatct gctgggctga ccacgttact catccccgtt 901 aacattctct ctaaagagcc tcgttcattt ccaaagcagt taaggaatgg gaaccagagt 961 gttttaggac ctgaagaatc tttatgactc tctctctttc actctttttt ttttttgtca 1021 ctaagttaaa agcgaagtga gagtattaac gtttttgttc tcctccggcc ccctgttaca 1081 atgaaggggc aaaagtattt gctcttagtc tattcctccc ttaacttctg tgactaattt 1141 ttatttcctt tctagatttg cccaattaat actagggtgc agtgtatcct ggagaggtag 1201 ggtgtgtggg ggaggaatcc cttgggggag atattaggag tgctctgttg tttacaaact 1261 cacggtaccc gcagggccta gcaagagact taaatgactg ataagaaccg tgagaaacat 1321 gttgcttcca ggcttgattt cgatttttcg cttttttttt ttttgagaca gaatctcact 1381 ttgtcaccag gctggagtgc agtggtgcaa tctcacctca ctgcaacctc cgcctcctgg 1441 gttcaagcaa ttctcctgcc tcagcctccc aagtagcttg gactacaggc cctgccacca 1501 cgcccggcta atttgtgtat ttttagtaga gatggggttt caccatgttg gccaggatgg 1561 tctcgatctc ttgacctcgt gatctgtcca ccttggcctt gcaaagcgct ggattacagg 1621 catgagccac tacacccagc cgatttttcc tttttgatta aagatgctat tacaatgtaa 1681 atatttctta cacagaaagt cacagcacat gtgcccattg atacaaggct gctgaggcct 1741 ggtctccagt tggaaatata attaagggtg gcaaggactg gagtcagttg gagagtgcat 1801 agccagtctg tgaagacaac tgccagatac tggcaatact ccagcctggt gacagagtga 1861 gactctgtct caaaaaaaaa gtttcaatgt ttactcctag agaagccaaa aatccagatt 1921 tgtatatgaa atcttaccat tttaaaagat tggcagctaa ttattttttt aaaaagctgt 1981 gcagtgtgat gtgtcccaaa cggactggct catgggtggc cacgtcacaa cctctgatct 2041 cagaccgtgc atgccttgtc ctcttaagac aactcctgtg gcaccgtttc tccctccaca 2101 gggccaaagc catagtgtcc ggtcccaagg acaaggctct tccagtgcta ggagaggtat 2161 gagcagcctc tcacctgtga gctgtgggga tcacaaggct gcctgcctca gtcttggagt 2221 cctgttgggt gaatgaggca gatgggaaag agcctcacca gcagctgctt ttggagcagg 2281 ggtccaagga agagagggtg gcctcgacat caaactgcct ggatttttct accaccctgt 2341 tacatcataa caacttctga aacacacacc agccctgagt tctgggctca tttgaagcct 2401 ggaatagcaa taaatctttt taacttgcgg acagtt // LOCUS HSGNAT1 1292 bp RNA PRI 12-SEP-1993 DEFINITION Human GNAT1 mRNA for transducin alpha-chain. ACCESSION X15088 NID g31864 KEYWORDS transducin; transducin alpha-chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1292) AUTHORS Van Dop,C. TITLE Direct Submission JOURNAL Submitted (21-APR-1989) Van Dop C., Dept. Pediatrics, UCLA Medical Center, 10833 Le Conte Ave., Los Angeles, CA 90024, USA REFERENCE 2 (bases 1 to 1292) AUTHORS Van Dop,C., Medynski,D.C. and Apone,L.M. TITLE Nucleotide sequence for a cDNA encoding the alpha subunit of retinal transducin (GNAT1) isolated from the human eye JOURNAL Nucleic Acids Res. 17 (12), 4887 (1989) MEDLINE 89315237 COMMENT Data kindly reviewed (03-AUG-1989) by Van Dop C. FEATURES Location/Qualifiers source 1..1292 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="eye" /clone="KA27" CDS 104..1156 /note="transducin alpha-chain" /codon_start=1 /db_xref="PID:g31865" /db_xref="SWISS-PROT:P11488" /translation="MGAGASAEEKHSRELEKKLKEDAEKDARTVKLLLLGAGESGKST IVKQMKIIHQDGYSLEECLEFIAIIYGNTLQSILAIVRAMTTLNIQYGDSARQDDARK LMHMADTIEEGTMPKEMSDIIQRLWKDSGIQACFERASEYQLNDSAGYYLSDLERLVT PGYVPTEQDVLRSRVKTTGIIETQFSFKDLNFRMFDVGGQRSEPKKWIHCFEGVTCII FIAALTAYDMVLVEDDEVNRMHESLHLFNSICNHRYFATTSIVLFLNKKDVFFEKVKK AHLSICFPDYDGPNTYEDAGNYIKVQFLELNMRRDVKEIYSHMTCATDTQNVKFCFDA VTDIIIKENLKDCGLF" misc_feature 1253..1258 /note="pot. polyA site" BASE COUNT 311 a 387 c 362 g 232 t ORIGIN 1 aggtcctcct gggccagaag ggttcctggg agccaggttc tgggatcccc tccatccaga 61 agaaccacct gctcactctg tcccttcgcc tgctgctggg accatggggg ctggggccag 121 tgctgaggag aagcactcca gggagctgga aaagaagctg aaagaggacg ctgagaagga 181 tgctcgaacc gtgaagctgc tgcttctggg tgccggtgag tccgggaaga gcaccatcgt 241 caagcagatg aagattatcc accaggacgg gtactcgctg gaagagtgcc tcgagtttat 301 cgccatcatc tacggcaaca cgttgcagtc catcctggcc atcgtacgcg ccatgaccac 361 actcaacatc cagtacggag actctgcacg ccaggacgac gcccggaagc tgatgcacat 421 ggcagacact atcgaggagg gcacgatgcc caaggagatg tcggacatca tccagcggct 481 gtggaaggac tccggtatcc aggcctgttt tgagcgcgcc tcggagtacc agctcaacga 541 ctcggcgggc tactacctct ccgacctgga gcgcctggta accccgggct acgtgcccac 601 cgagcaggac gtgctgcgct cgcgagtcaa gaccactggc atcatcgaga cgcagttctc 661 cttcaaggat ctcaacttcc ggatgttcga tgtgggcggg cagcgctcgg agccgaagaa 721 gtggatccac tgcttcgagg gcgtgacctg catcatcttc atcgcggcgc tgaccgcgta 781 cgacatggtg ctagtggagg acgacgaagt gaaccgcatg cacgagagcc tgcacctgtt 841 caacagcatc tgcaaccacc gctacttcgc cacgacgtcc atcgtgctct tccttaacaa 901 gaaggacgtc ttcttcgaga aggtcaagaa ggcgcacctc agcatctgtt tcccggacta 961 cgatggaccc aacacctacg aggacgccgg caactacatc aaggtgcagt tcctcgagct 1021 caacatgcgg cgcgacgtga aggagatcta ttcccacatg acgtgcgcca ccgacacgca 1081 gaacgtcaaa ttctgcttcg acgctgtcac cgacatcatc atcaaggaga acctcaaaga 1141 ctgtggcctc ttctgagcca gggcctgtgc tgcagtcggg gacaaggagc ttccgtctgg 1201 caaggccggg gcacaatttg cactcccctc agctagacgc agcagactca gcaataaacc 1261 tttgcatcag gcaaaaaaaa aaaaacaaaa aa // LOCUS HSGNSM 2379 bp RNA PRI 11-DEC-1995 DEFINITION H.sapiens GNS mRNA encoding glucosamine-6-sulphatase. ACCESSION Z12173 M23657 NID g31866 KEYWORDS GNS gene; N-acetylglucosamine-6-sulphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 618 to 2379) AUTHORS Robertson,D.A., Freeman,C., Nelson,P.V., Morris,C.P. and Hopwood,J.J. TITLE Human glucosamine-6-sulfatase cDNA reveals homology with steroid sulfatase JOURNAL Biochem. Biophys. Res. Commun. 157 (1), 218-224 (1988) MEDLINE 89061714 REFERENCE 2 (bases 1 to 2379) AUTHORS Morris,C. TITLE Direct Submission JOURNAL Submitted (04-JUN-1992) Morris C., Adelaide Children's Hospital, Chemical Pathology, 72 King William Road, North Adelaide, South Australia, Australia, 5006 REFERENCE 3 (bases 1 to 2379) AUTHORS Robertson,D.A., Freeman,C., Morris,C.P. and Hopwood,J.J. TITLE A cDNA clone for human glucosamine-6-sulphatase reveals differences between arylsulphatases and non-arylsulphatases JOURNAL Biochem. J. 288 (Pt 2), 539-544 (1992) MEDLINE 93098807 FEATURES Location/Qualifiers source 1..2379 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Endothelial" /cell_type="Fibroblast" /clone_lib="Clontech lambda gt11 endothelial cDNA library" /clone="p6s50, p6s41, p6s18" /chromosome="12" /map="q14" gene 88..1746 /gene="GNS" sig_peptide 88..195 /gene="GNS" /citation=[3] /evidence=experimental CDS 88..1746 /gene="GNS" /EC_number="3.1.6.14" /function="hydrolysis of heparan sulphate" /note="glucosamine-6-sulphatase" /citation=[1] /citation=[3] /codon_start=1 /evidence=experimental /product="N-acetylglucosamine-6-sulphatase" /db_xref="PID:g31867" /db_xref="SWISS-PROT:P15586" /translation="MRLLPLAPGRLRRGSPRHLPSCSPALLLLVLGGCLGVFGVAAGT RRPNVVLLLTDDQDEVLGGMTPLKKTKALIGEMGMTFSSAYVPSALCCPSRASILTGK YPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGKYLNEYGAPDAG GLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLANVSLDFLDY KSNFEPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIHGTNKHWLIRQAKTP MTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF SLPIDKRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANIDLGPTILDIAGYDLNKTQMD GMSLLPILRGASNLTWRSDVLVEYQGEGRNVTDPTCPSLSPGVSQCFPDCVCEDAYNN TYACVRTMSALWNLQYCEFDDQEVFVEVYNLTADPDQITNIAKTIDPELLGKMNYRLM MLQSCSGPTCRTPGVFDPGYRFDPRLMFSNRGSVRTRRFSKHLL" mat_peptide 217..1743 /gene="GNS" /citation=[3] /evidence=experimental /product="N-acetylglucosamine-6-sulphatase" BASE COUNT 615 a 583 c 553 g 628 t ORIGIN 1 ggaattccgg tcggcctctc gcccttcagc tacctgtgcg tccctccgtc ccgtcccgtc 61 ccggggtcac cccggagcct gtccgctatg cggctcctgc ctctagcccc aggtcggctc 121 cggcggggca gcccccgcca cctgccctcc tgcagcccag cgctgctact gctggtgctg 181 ggcggctgcc tgggggtctt cggggtggct gcgggaaccc ggaggcccaa cgtggtgctg 241 ctcctcacgg acgaccagga cgaagtgctc ggcggcatga caccactaaa gaaaaccaaa 301 gctctcatcg gagagatggg gatgactttt tccagtgctt atgtgccaag tgctctctgc 361 tgccccagca gagccagtat cctgacagga aagtacccac ataatcatca cgttgtgaac 421 aacactctgg aggggaactg cagtagtaag tcctggcaga agatccaaga accaaatact 481 ttcccagcaa ttctcagatc aatgtgtggt tatcagacct tttttgcagg gaaatattta 541 aatgagtacg gagccccaga tgcaggtgga ctagaacacg ttcctctggg ttggagttac 601 tggtatgcct tggaaaagaa ttctaagtat tataattaca ccctgtctat caatgggaag 661 gcacggaagc atggtgaaaa ctatagtgtg gactacctga cagatgtttt ggctaatgtc 721 tccttggact ttctggacta caagtccaac tttgagccct tcttcatgat gatcgccact 781 ccagcgcctc attcgccttg gacagctgca cctcagtacc agaaggcttt ccagaatgtc 841 tttgcaccaa gaaacaagaa cttcaacatc catggaacga acaagcactg gttaattagg 901 caagccaaga ctccaatgac taattcttca atacagtttt tagataatgc atttaggaaa 961 aggtggcaaa ctctcctctc agttgatgac cttgtggaga aactggtcaa gaggctggag 1021 ttcactgggg agctcaacaa cacttacatc ttctatacct cagacaatgg ctatcacaca 1081 ggacagtttt ccttgccaat agacaagaga cagctgtatg agtttgatat caaagttcca 1141 ctgttggttc gaggacctgg gatcaaacca aatcagacaa gcaagatgct ggttgccaac 1201 attgacttgg gtcctactat tttggacatt gctggctacg acctaaataa gacacagatg 1261 gatgggatgt ccttattgcc cattttgaga ggtgccagta acttgacctg gcgatcagat 1321 gtcctggtgg aataccaagg agaaggccgt aacgtcactg acccaacatg cccttccctg 1381 agtcctggcg tatctcaatg cttcccagac tgtgtatgtg aagatgctta taacaatacc 1441 tatgcctgtg tgaggacaat gtcagcattg tggaatttgc agtattgcga gtttgatgac 1501 caggaggtgt ttgtagaagt ctataatctg actgcagacc cagaccagat cactaacatt 1561 gctaaaacca tagacccaga gcttttagga aagatgaact atcggttaat gatgttacag 1621 tcctgttctg ggccaacctg tcgcactcca ggggtttttg accccggata caggtttgac 1681 ccccgtctca tgttcagcaa tcgcggcagt gtcaggactc gaagattttc caaacatctt 1741 ctgtagcgac ctcacacagc ctctgcagat ggatccctgc acgcctcttt ctgatgaagt 1801 gattgtagta ggtgtctgta gctagtcttc aagaccacac ctggaagagt ttctgggctg 1861 gctttaagtc ctgtttgaaa aagcaaccca gtcagctgac ttcctcgtgc aatgtgttaa 1921 actgtgaact ctgcccatgt gtcaggagtg gctgtctctg gtctcttcct ttagctgaca 1981 aggacactcc tgaggtcttt gttctcactg tatttttttt atcctggggc cacagttctt 2041 gattattcct cttgtggtta aagactgaat ttgtaaaccc attcagataa atggcagtac 2101 tttaggacac acacaaacac acagatacac cttttgatat gtaagcttga cctaaagtca 2161 aaggacctgt gtagcatttc agattgagca cttcactatc aaaaatacta acatcacatg 2221 gcttgaagag taaccatcag agctgaatca tccaagtaag aacaagtacc attgttgatt 2281 gataagtaga gatacatttt ttatgatgtt catcacagtg tggtaaggtt gcaaattcaa 2341 aacatgtcac ccaagctctg ttcatgtttt tgtgaattc // LOCUS HSGNT3 1902 bp RNA PRI 01-OCT-1996 DEFINITION H.sapiens mRNA for UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase. ACCESSION X92689 NID g1296629 KEYWORDS GalNAc-T3 gene; UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1902) AUTHORS Bennett,E.P., Hassan,H. and Clausen,H. TITLE cDNA cloning and expression of a novel human UDP-N-acetyl-alpha-D-galactosamine. Polypeptide N-acetylgalactosaminyltransferase, GalNAc-t3 JOURNAL J. Biol. Chem. 271 (29), 17006-17012 (1996) MEDLINE 96291839 REFERENCE 2 (bases 1 to 1902) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (31-OCT-1995) E.P. Bennett, Dental School, University of Copenhagen, Norre Alle 20, 2200 Copenhagen, DENMARK REMARK Revised by submittor 15-JAN-96 FEATURES Location/Qualifiers source 1..1902 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="salivary gland" /clone_lib="lambda gt11" /clone="#1" /clone="#8" gene 1..1902 /gene="GalNac-T3" CDS 1..1902 /gene="GalNac-T3" /codon_start=1 /product="UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferas" /db_xref="PID:e209711" /db_xref="PID:g1617312" /translation="MAHLKRLVKLHIKRHYHKKFWKLGAVIFFFIIVLVLMQREVSVQ YSKEESRMERNMKNKNKMLDLMLEAVNNIKDAMPKMQIGAPVRQNIDAGERPCLQGYY TAAELKPVLDRPPQDSNAPGASGKAFKTTNLSVEEQKEKERGEAKHCFNAFASDRISL HRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLK EIILVDDASVDEYLHDKLDEYVKQFSIVKIVRQRERKGLITARLLGATVATAETLTFL DAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWS LSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENI EMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDEYKEI FYRRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYVPDLNPVISGY IKSVGQPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQ GLVQLKACTYKGHKTVVTGEQIWEIQKDQLLYNPFLKMCLSANGEHPSLVSCNPSDPL QKWILSQND" BASE COUNT 633 a 348 c 405 g 516 t ORIGIN 1 atggctcacc taaagcgact agtaaaatta cacattaaaa gacattacca taaaaagttc 61 tggaagcttg gtgcagtaat ttttttcttt ataatagttt tggttttaat gcaaagagaa 121 gtaagtgttc aatattccaa agaggaatca aggatggaaa ggaacatgaa aaacaaaaac 181 aagatgttgg atttaatgct agaagctgta aacaatatta aggatgccat gccaaaaatg 241 caaataggag cacctgtcag gcaaaacatt gatgctggtg agagaccttg tttgcaagga 301 tattatacag cagcagaatt gaagcctgtc cttgaccgtc cacctcagga ttcaaatgca 361 cctggtgctt ctggtaaagc attcaagaca accaatttaa gtgttgaaga gcaaaaggaa 421 aaggaacgtg gggaagctaa acactgcttt aatgctttcg caagtgacag gatttctttg 481 caccgagatc ttggaccaga cactcgacct cctgaatgta ttgaacaaaa atttaagcgc 541 tgccctcccc tgcccaccac cagtgtcata atagtttttc ataatgaagc gtggtccacg 601 ttgcttagaa ctgtccacag tgtgctctat tcttcacctg caatactgct gaaggaaatc 661 attttggtgg atgatgctag tgtagatgag tacttacatg ataaactaga tgaatatgta 721 aaacaatttt ctatagtaaa aatagtcaga caaagagaaa gaaaaggtct gatcactgct 781 cggttgctag gagcaacagt cgcaacagct gaaacgctca catttttaga tgctcactgt 841 gagtgtttct atggttggct agaacctctg ttggccagaa tagctgagaa ctacacggct 901 gtcgtaagtc cagatattgc atccatagat ctgaacacgt ttgaattcaa caaaccttct 961 ccttatggaa gtaaccataa ccgtggaaat tttgactgga gtctttcatt tggctgggag 1021 tcgcttcctg atcatgagaa gcaaagaagg aaagatgaaa cctacccaat taaaacaccc 1081 acttttgcag gaggactttt ttccatatca aaagaatatt ttgagtatat tggaagctat 1141 gatgaagaaa tggaaatctg gggaggtgaa aatatagaaa tgtctttcag agtatggcaa 1201 tgtggtgggc agttggagat tatgccttgc tctgttgttg gacatgtttt tcgcagcaaa 1261 agccctcata gctttccaaa aggcactcag gtgattgcta gaaaccaagt tcgccttgca 1321 gaagtctgga tggatgaata caaggaaata ttttatagga gaaatacaga tgcagcaaaa 1381 attgttaaac aaaaagcatt tggtgatctt tcaaaaagat ttgaaataaa acaccgtctt 1441 cggtgtaaaa attttacatg gtatctgaac aacatttatc cagaggtgta tgtgccagac 1501 cttaatcctg ttatatctgg atacattaaa agcgttggtc agcctctatg tctggatgtt 1561 ggagaaaaca atcaaggagg caaaccatta attatgtata catgtcatgg acttggggga 1621 aaccagtact ttgaatactc tgctcaacat gaaattcggc acaacatcca gaaggaatta 1681 tgtcttcatg ctgctcaagg tctcgttcag ctgaaggcat gtacctacaa aggtcacaag 1741 acagttgtca ctggagagca gatatgggag atccagaagg atcaacttct atacaatcca 1801 ttcttaaaaa tgtgcctttc agcaaatgga gagcatccaa gtttagtgtc atgcaaccca 1861 tcagatccac tccaaaaatg gatacttagc caaaatgatt aa // LOCUS HSGONA 621 bp RNA PRI 07-JAN-1995 DEFINITION Human messenger RNA for chorionic gonadotropin. ACCESSION V00518 NID g31868 KEYWORDS complementary DNA; gonadotropin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 621) AUTHORS Fiddes,J.C. and Goodman,H.M. TITLE Isolation, cloning and sequence analysis of the cDNA for the alpha-subunit of human chorionic gonadotropin JOURNAL Nature 281 (5730), 351-356 (1979) MEDLINE 80011660 COMMENT KST HSA.GONADOTROPIN [621]. FEATURES Location/Qualifiers source 1..621 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>398 /note="messenger RNA" CDS 51..401 /codon_start=1 /product="chorionic gonadotropin" /db_xref="PID:g31869" /db_xref="SWISS-PROT:P01215" /translation="MDYYRKYAAIFLVTLSVFLHVLHSAPDVQDCPECTLQENPFFSQ PGAPILQCMGCCFSRAYPTPLRSKKTMLVQKNVTSESTCCVAKSYNRVTVMGGFKVEN HTACHCSTCYYHKS" BASE COUNT 165 a 152 c 124 g 180 t ORIGIN 1 cagtaaccgc cctgaacaca tcctgcaaaa agcccagaga aaggagcgcc atggattact 61 acagaaaata tgcagctatc tttctggtca cattgtcggt gtttctgcat gttctccatt 121 ccgctcctga tgtgcaggat tgcccagaat gcacgctaca ggaaaaccca ttcttctccc 181 agccgggtgc cccaatactt cagtgcatgg gctgctgctt ctctagagca tatcccactc 241 cactaaggtc caagaagacg atgttggtcc aaaagaacgt cacctcagag tccacttgct 301 gtgtagctaa atcatataac agggtcacag taatgggggg tttcaaagtg gagaaccaca 361 cggcgtgcca ctgcagtact tgttattatc acaaatctta aatgttttac caagtgctgt 421 cttgatgact gctgattttc tggaatggaa aattaagttg tttagtgttt atggctttgt 481 gagataaaac tctccttttc cttaccatac cactttgaca cgcttcaagg atatactgca 541 gctttactgc cttcctcctt atcctacagt acaatcagca gtctagttct tttcatttgg 601 aatgaataca gcattaagct t // LOCUS HSGP25L2G 824 bp RNA PRI 25-SEP-1995 DEFINITION H.sapiens mRNA for gp25L2 protein. ACCESSION X90872 NID g996056 KEYWORDS gp25l2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 824) AUTHORS Dominguez,M., Fazel,A., Parlati,F., Bell,A.W., Thomas,D.Y. and Bergeron,J.J.M. JOURNAL Unpublished REFERENCE 2 (bases 1 to 824) AUTHORS Dominguez,M. TITLE Direct Submission JOURNAL Submitted (07-AUG-1995) M. Dominguez, McGill University, Anatomy and Cell Biology, 3640 University, Montreal, PQ, H3A2B2, CANADA FEATURES Location/Qualifiers source 1..824 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda gt10" /clone="c15181" gene 92..736 /gene="gp25l2" CDS 92..736 /gene="gp25l2" /note="associated to Golgi apparatus" /codon_start=1 /db_xref="PID:g996057" /translation="MRTLLLVLWLATRGSALYFHIGETEKKCFIEEIPDETMVIGNYR TQLYDKQREEYQPATPGFGMCVEVKDPEDKVILAREYGSEGRFTFTSHTPGEHQICLH SNSTKFSLFAGGMLRVHLDIQVGEHANDYAEIPAKDKLSELQLRVRQLVEQVEQIQKE QNYQRWREERFRQTSESTNQRVLWWSILQTLILVAIGVWQMRHLKSFFEAKKLV" BASE COUNT 181 a 214 c 259 g 169 t 1 others ORIGIN 1 tttttcccag tcacggacgt tgtaaaacga cggccattcg gtgggcgtgc tgctcgtccg 61 gccccggccc ggaaccgggc tgggtagagt gatgcggacc ctcctgctgg tgctgtggct 121 ggcgacgcgc ggaagcgcgc tctactttca catcggagag acggagaaga agtgctttat 181 tgaggagatc ccggacgaga ccatggtcat aggaaactac cggacgcagc tgtatgacaa 241 gcagcgggag gagtaccagc cggccacccc ggggtttggt atgtgtgtgg aggtgaagga 301 cccagaggac aaggtcatcc tggcccggga gtatggctcc gaggggaggt tcactttcac 361 ttcccatacc cctggtgagc accagatctg tcttcactcc aattccacca agttttccct 421 ctttgctgga ggcatgctga gagttcacct ggacatccag gtaggtgaac atgccaatga 481 ctatgcagaa attcctgcta aagacaagtt gagtgagttg cagctacgag tgcgacagct 541 ggtggaacaa gtggagcaga tccagaaaga gcagaactac cagcggtggc gagaggagcg 601 cttccggcag accagtgaga gcaccaacca gcgggtgctg tggtggtcca ttctgcagac 661 cctcatcctc gtggccatcg gtgtctggca gatgcggcac ctcaagagct tctttgaagc 721 caagaagctt gtgtagtgtc ccaggtgtca caacccatcc tcccaggctg ggggggagaa 781 aagggncctc ctggaactga cttcttctgt caggaggatg gttt // LOCUS HSGP34 1048 bp RNA PRI 22-AUG-1994 DEFINITION H.sapiens OX40 ligand/gp34. ACCESSION X79929 NID g510293 KEYWORDS gp 34 gene; Ox40. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1048) AUTHORS Godfrey,W.R., Fagnoni,F.F., Harara,M.A., Buck,D. and Engleman,E.G. TITLE Identification of a human OX-40 ligand, a costimulator of CD4+ T cells with homology to tumor necrosis factor JOURNAL J. Exp. Med. 180 (2), 757-762 (1994) MEDLINE 94321936 REFERENCE 2 (bases 1 to 1048) AUTHORS Godfrey,W.R. TITLE Direct Submission JOURNAL Submitted (30-JUN-1994) W.R. Godfrey, Stanford Blood Center, 800 Welch Road, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..1048 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PHA activated PBL" /clone="R2A #30" gene 138..689 /gene="OX 40 ligand/ gp 34" CDS 138..689 /gene="OX 40 ligand/ gp 34" /codon_start=1 /db_xref="PID:g510294" /db_xref="SWISS-PROT:P23510" /translation="MERVQPLEENVGNAARPRFERNKLLLVASVIQGLGLLLCFTYIC LHFSALQVSHRYPRIQSIKVQFTEYKKEKGFILTSQKEDEIMKVQNNSVIINCDGFYL ISLKGYFSQEVNISLHYQKDEEPLFQLKKVRSVNSLMVASLTYKDKVYLNVTTDNTSL DDFHVNGGELILIHQNPGEFCVL" polyA_signal 1024..1029 BASE COUNT 277 a 255 c 221 g 295 t ORIGIN 1 ggccctggga cctttgccta ttttctgatt gataggcttt gttttgtctt tacctccttc 61 tttctgggga aaacttcagt tttatcgcac gttccccttt tccatatctt catcttccct 121 ctacccagat tgtgaagatg gaaagggtcc aacccctgga agagaatgtg ggaaatgcag 181 ccaggccaag attcgagagg aacaagctat tgctggtggc ctctgtaatt cagggactgg 241 ggctgctcct gtgcttcacc tacatctgcc tgcacttctc tgctcttcag gtatcacatc 301 ggtatcctcg aattcaaagt atcaaagtac aatttaccga atataagaag gagaaaggtt 361 tcatcctcac ttcccaaaag gaggatgaaa tcatgaaggt gcagaacaac tcagtcatca 421 tcaactgtga tgggttttat ctcatctccc tgaagggcta cttctcccag gaagtcaaca 481 ttagccttca ttaccagaag gatgaggagc ccctcttcca actgaagaag gtcaggtctg 541 tcaactcctt gatggtggcc tctctgactt acaaagacaa agtctacttg aatgtgacca 601 ctgacaatac ctccctggat gacttccatg tgaatggcgg agaactgatt cttatccatc 661 aaaatcctgg tgaattctgt gtcctttgag gggctgatgg caatatctaa aaccaggcac 721 cagcatgaac accaagctgg gggtggacag ggcatggatt cttcattgca agtgaaggag 781 cctcccagct cagccacgtg ggatgtgaca agaagcagat cctggccctc ccgcccccac 841 ccctcaggga tatttaaaac ttattttata taccagttaa tcttatttat ccttatattt 901 tctaaattgc ctagccgtca caccccaaga ttgccttgag cctactaggc acctttgtga 961 gaaagaaaaa atagatgcct cttcttcaag atgcattgtt tctattggtc aggcaattgt 1021 cataataaac ttatgtcatt gaaaacgg // LOCUS HSGP95SOR 3723 bp RNA PRI 14-APR-1997 DEFINITION H.sapiens mRNA for sortilin. ACCESSION X98248 NID g1834494 KEYWORDS Sort1 gene; sortilin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3723) AUTHORS Petersen,C.M., Nielsen,M.S., Nykjaer,A., Jacobsen,L., Tommerup,N., Rasmussen,H.H., Roigaard,H., Gliemann,J., Madsen,P. and Moestrup,S.K. TITLE Molecular identification of a novel candidate sorting receptor purified from human brain by receptor-associated protein affinity chromatography JOURNAL J. Biol. Chem. 272 (6), 3599-3605 (1997) MEDLINE 97166212 REFERENCE 2 (bases 1 to 3723) AUTHORS Moestrup,S.K. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) S.K. Moestrup, Dep. Medical Biochemistry, University of Aarhus, Ole Worms Alle Bldg 170, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..3723 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Jurkat ZAP express" /chromosome="1" /map="p13.1-21.3" mRNA 1..3723 sig_peptide 22..120 /gene="Sort1" gene 22..2523 /gene="Sort1" CDS 22..2523 /gene="Sort1" /codon_start=1 /product="sortilin" /db_xref="PID:e246784" /db_xref="PID:g1834495" /translation="MERPWGAADGLSRWPHGLGLLLLLQLLPPSTLSQDRLDAPPPPA APLPRWSGPIGVSWGLRAAAAGGAFPRGGRWRRSAPGEDEECGRVRDFVAKLANNTHQ HVFDDLRGSVSLSWVGDSTGVILVLTTFHVPLVIMTFGQSKLYRSEDYGKNFKDITDL INNTFIRTEFGMAIGPENSGKVVLTAEVSGGSRGGRIFRSSDFAKNFVQTDLPFHPLT QMMYSPQNSDYLLALSTENGLWVSKNFGGKWEEIHKAVCLAKWGSDNTIFFTTYANGS CKADLGALELWRTSDLGKSFKTIGVKIYSFGLGGRFLFASVMADKDTTRRIHVSTDQG DTWSMAQLPSVGQEQFYSILAANDDMVFMHVDEPGDTGFGTIFTSDDRGIVYSKSLDR HLYTTTGGETDFTNVTSLRGVYITSVLSEDNSIQTMITFDQGGRWTHLRKPENSECDA TAKNKNECSLHIHASISISQKLNVPMAPLSEPKLVGMVIAHGSVGDAISVMVPDVYIS DDGGYSWTKMLEGPHYYTILDSGGIIVAIEHSSRPINVIKFSTDEGQCWQTYTFTRDP IYFTGLASEPGARSMNISIWGFTESFLTSQWVSYTIDFKDILERNCEEKDYTIWLAHS TDPEDYEDGCILGYKEQFLRLRKSSMCQNGRDYVVTKQPSICLCSLEDFLCDFGYYRP ENDSKCVEQPELKGHDLEFCLYGKRREEHLTTNGYRKIPGDKCQGGVNPVREVKDLKK KCTSNFLSPEKQNSKSNSVPIILAIVGLMLVTVVAGVLIVKKYVCGGRFLVHRYSVLQ QHAEANGVDGVDALDTASHTNKSGYHDDSDEDLLE" BASE COUNT 905 a 936 c 995 g 887 t ORIGIN 1 ggtcggcggc attcggcggc gatggagcgg ccctggggag ctgcggacgg cctctcgcgc 61 tggccccatg gcctcggcct cctcctcctc ctgcagctgc tgccgccgtc gaccctcagc 121 caggaccggc tggacgcgcc gccgccgccc gctgcgccgc tgccgcgctg gtctggcccc 181 atcggggtga gctgggggct gcgggcggcc gcagccgggg gcgcgtttcc ccgcggcggc 241 cgttggcgtc gcagcgcgcc gggcgaggac gaggagtgcg gccgggtccg ggacttcgtc 301 gccaagctgg ccaacaacac gcaccagcat gtgtttgatg atctcagagg ctcagtatcc 361 ttgtcctggg ttggagatag cactggggtc attctagtct tgactacctt ccatgtacca 421 ctggtaatta tgacttttgg acagtccaag ctatatcgaa gtgaggatta tgggaagaac 481 tttaaggata ttacagatct catcaataac acctttattc ggactgaatt tggcatggct 541 attggtcctg agaactctgg aaaggtggtg ttaacagcag aggtgtctgg aggaagtcgt 601 ggaggaagaa tcttcagatc atcagatttt gcgaagaatt ttgtgcaaac agatctccct 661 tttcatcctc tcactcagat gatgtatagc cctcagaatt ctgattatct tttagctctc 721 agcactgaaa atggcctgtg ggtgtccaag aattttgggg gaaaatggga agaaatccac 781 aaagcagtat gtttggccaa atggggatca gacaacacca tcttctttac aacctatgca 841 aatggctcct gcaaagctga ccttggggct ctggaattat ggagaacttc agacttggga 901 aaaagcttca aaactattgg tgtgaaaatc tactcatttg gtcttggggg acgtttcctt 961 tttgcctctg tgatggctga taaggatacc acaagaagga tccacgtttc aacagatcaa 1021 ggggacacat ggagcatggc ccagctcccc tccgtgggac aggaacagtt ctattctatt 1081 ctggcagcaa atgatgacat ggtattcatg catgtagatg aacctggaga cactgggttt 1141 ggcacaatct ttacctcaga tgatcgaggc attgtctatt ccaagtcttt ggaccgacat 1201 ctctacacta ccacaggcgg agagacggac tttaccaacg tgacctccct ccgcggcgtc 1261 tacataacaa gcgtgctctc cgaagataat tctatccaga ccatgatcac ttttgaccaa 1321 ggaggaaggt ggacgcacct gaggaagcct gaaaacagtg aatgtgatgc tacagcaaaa 1381 aacaagaatg agtgcagcct tcatattcat gcttccatca gcatctccca gaaactgaat 1441 gttccaatgg ccccactctc agagccgaag ctcgtaggca tggtcattgc tcatggtagc 1501 gtgggggatg ccatctcagt gatggttcca gatgtgtaca tctcagatga tgggggttac 1561 tcctggacaa agatgctgga aggaccccac tattacacca tcctggattc tggaggcatc 1621 attgtggcca ttgagcacag cagccgtcct atcaatgtga ttaagttctc cacagacgaa 1681 ggtcaatgct ggcaaaccta cacgttcacc agggacccca tctatttcac tggcctagct 1741 tcagaacctg gagctaggtc catgaatatc agcatttggg gcttcacaga atctttcctg 1801 accagccagt gggtctccta caccattgat tttaaagata tccttgaaag gaactgtgaa 1861 gagaaggact ataccatatg gctggcacac tccacagacc ctgaagatta tgaagatggc 1921 tgcattttgg gctacaaaga acagtttctg cggctacgca agtcatccat gtgtcagaat 1981 ggtcgagact atgttgtgac caagcagccc tccatctgcc tctgttccct ggaggacttt 2041 ctctgtgatt ttggctacta ccgtccagaa aatgactcca agtgtgtgga acagccagaa 2101 ctgaagggcc acgacctgga gttttgtctg tacggaaaga gaagagaaga acacctaaca 2161 acaaatgggt accggaaaat tccaggggac aaatgccagg gtggggtaaa tccagttcga 2221 gaagtaaaag acttgaaaaa gaaatgcaca agcaactttt tgagtccgga aaaacagaat 2281 tccaagtcaa attctgttcc aattatcctg gccatcgtgg gattgatgct ggtcacagtc 2341 gtagcaggag tgctcattgt gaagaaatat gtctgtgggg gaaggttcct ggtgcatcga 2401 tactctgtgc tgcagcagca tgcagaggcc aatggtgtgg atggtgtgga tgctttggac 2461 acagcctccc acactaataa aagtggttat catgatgact cagatgagga cctcttggaa 2521 tagctcttca gaggagctgg acccagcatg gatggtggaa ccacagtacc tcttacactc 2581 cctgtggctc caacttcagg aaataaattt cccattgcga ggacccagct ctgtttctgc 2641 tgcttccatc aaagccaaaa gacctacact aaagaaatgc agggtggggg tggggaaccc 2701 tgagcacttt tttacaattg gctctgagaa aaagggagac attttaaatt ctttaacttc 2761 ttatttctcg tcctgtctct ttgcaaagta tgggcttttg tttttgtttt ttaagggaaa 2821 cgaaatggaa ttcgaaggga ccttttcact aaccccactt ctgtgtgttc tgcatggtgc 2881 ctgccccagg gcatctgcca actccagtat cagctctcac agtgtacttg gtaccatccc 2941 tgggctctgc tggcgagacg aaacagctgt agagatgaaa acaggctgca gaggctggca 3001 cagctggccg gcttttctcc atctggggac agtcctactc caagaacact gcacaccagc 3061 tcctcacaca gatcccactt acggcgcgca acgggttcta ggctgcaggc agctcgagga 3121 cccgcggccc cgccccggct cggcctggca gatagcagag gcagcaggcg tgccgggggg 3181 gcatgttgct gtaaccagtg gcccagggga tgttacggtg gacagtgcac ctggagggcg 3241 ggccccgcag ggtgaaccat gctgcagtgg ctgtcgggca tcgggtatac tccttcgggg 3301 gttactgctc tggtgaagac tatgagacac tgcgtcagat agatgtgcac attttcaatg 3361 cagtgtcctt gcgttggaca aagctgcccc cggtgaagtc tgccatccgt gggcaagctc 3421 ctgtggtacc ctacatgcgc tatggacact caaccgtcct catcgacgac acagtcctcc 3481 tttggggcgg gcggaatgac accgaagggc ctgcaatgtg ctctatgcct ttgacgtcaa 3541 tacgcacaag tggttcacac cccgagtgtc agggacagtt cctggggccc gggatggaca 3601 ttcagcctgt gtcctaggca agatcatgta catttttggg ggctacgagc agcaggcgac 3661 tgtttttcca atgacattca caagctagat accagcacca tgacatggac tcttatctgt 3721 aca // LOCUS HSGPCP 1341 bp RNA PRI 26-NOV-1997 DEFINITION Homo sapiens mRNA for G-protein coupled receptor. ACCESSION Y13583 NID g2652933 KEYWORDS G-protein coupled receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1341) AUTHORS Hanze,J., Dittrich,K., Dotsch,J. and Rascher,W. TITLE Molecular cloning of a novel human receptor gene with homology to the rat adrenomedullin receptor and high expression in heart and immune system JOURNAL Biochem. Biophys. Res. Commun. 240 (1), 183-188 (1997) MEDLINE 98042541 REFERENCE 2 (bases 1 to 1341) AUTHORS Haenze,J. TITLE Direct Submission JOURNAL Submitted (03-JUN-1997) J. Haenze, Department of Pediatrics, University Giessen, Feulgenstr. 12, D-35385, Giessen, FRG REMARK revised by submitter 15-AUG-1997 FEATURES Location/Qualifiers source 1..1341 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 95..1309 /codon_start=1 /product="G-protein coupled receptor" /db_xref="PID:e1188592" /db_xref="PID:g2652934" /translation="MSVKPSWGPGPSEGVTAVPTSDLGEIHNWTELLDLFNHTLSECH VELSQSTKRVVLFALYLAMFVVGLVENLLVICVNWRGSGRAGLMNLYILNMAIADLGI VLSLPVWMLEVTLDYTWLWGSFSCRFTHYFYFVNMYSSIFFLVCLSVDRYVTLTSASP SWQRYQHRVRRAMCAGIWVLSAIIPLPEVVHIQLVEGPEPMCLFMAPFETYSTWALAV ALSTTILGFLLPFPLITVFNVLTACRLRQPGQPKSRRHCLLLCAYVAVFVMCWLPYHV TLLLLTLHGTHISLHCHLVHLLYFFYDVIDCFSMLHCVINPILYNFLSPHFRGRLLNA VVHYLPKDQTKAGTCASSSSCSTQHSIIITKGDSQPAAAAPHPEPSLSFQAHHLLPNT SPISPTQPLTPS" BASE COUNT 227 a 477 c 327 g 310 t ORIGIN 1 cagcctcctc acagctcccc atagcctgga cctgccggcc ctccctccag gaccgagggg 61 ctcccaaggg aaactcaggc gtgtgctggt cccaatgtca gtgaaaccca gctgggggcc 121 tggcccctcg gagggggtca ccgcagtgcc taccagtgac cttggagaga tccacaactg 181 gaccgagctg cttgacctct tcaaccacac tttgtctgag tgccacgtgg agctcagcca 241 gagcaccaag cgcgtggtcc tctttgccct ctacctggcc atgtttgtgg ttgggctggt 301 ggagaacctc ctggtgatat gcgtcaactg gcgcggctca ggccgggcag ggctgatgaa 361 cctctacatc ctcaacatgg ccatcgcgga cctgggcatt gtcctgtctc tgcccgtgtg 421 gatgctggag gtcacgctgg actacacctg gctctggggc agcttctcct gccgcttcac 481 tcactacttc tactttgtca acatgtatag cagcatcttc ttcctggtgt gcctcagtgt 541 cgaccgctat gtcaccctca ccagcgcctc cccctcctgg cagcgttacc agcaccgagt 601 gcggcgggcc atgtgtgcag gcatctgggt cctctcggcc atcatcccgc tgcctgaggt 661 ggtccacatc cagctggtgg agggccctga gcccatgtgc ctcttcatgg caccttttga 721 aacgtacagc acctgggccc tggcggtggc cctgtccacc accatcctgg gcttcctgct 781 gcccttccct ctcatcacag tcttcaatgt gctgacagcc tgccggctgc ggcagccagg 841 acaacccaag agccggcgcc actgcttgct gctgtgcgcc tacgtggccg tctttgtcat 901 gtgctggctg ccctatcatg tgaccctgct gctgctcaca ctgcatggga cccacatctc 961 cctccactgc cacctggtcc acctgctcta cttcttctat gatgtcattg actgcttctc 1021 catgctgcac tgtgtcatca accccatcct ttacaacttt ctcagcccac acttccgggg 1081 ccggctcctg aatgctgtag tccattacct tcctaaggac cagaccaagg cgggcacatg 1141 cgcctcctct tcctcctgtt ccacccagca ttccatcatc atcaccaagg gtgatagcca 1201 gcctgctgca gcagcccccc accctgagcc aagcctgagc tttcaggcac accatttgct 1261 tccaaatact tcccccatct ctcccactca gcctcttaca cccagctgag gtactagaat 1321 tcagcggccg ctgaattcta g // LOCUS HSGPIP137 3268 bp RNA PRI 19-JUL-1996 DEFINITION H.sapiens mRNA encoding GPI-anchored protein p137. ACCESSION Z48042 NID g662993 KEYWORDS GPI-anchored protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3268) AUTHORS Ellis,J.A. and Luzio,J.P. TITLE Identification and characterization of a novel protein (p137) which transcytoses bidirectionally in Caco-2 cells JOURNAL J. Biol. Chem. 270 (35), 20717-20723 (1995) MEDLINE 95386525 REFERENCE 2 (bases 1 to 3268) AUTHORS Ellis,J.A. TITLE Direct Submission JOURNAL Submitted (19-JAN-1995) Ellis J. A., University of Cambridge, Clinical Biochemistry, Hills Rd, Cambridge, Cambs., U.K., CB2 2QR REFERENCE 3 (bases 1 to 3268) AUTHORS Gessler,M., Klamt,B., Tsaoussidou,S., Ellis,J.A. and Luzio,J.P. TITLE The gene encoding the GPI-anchored membrane protein p137GPI (M11S1) maps to human chromosome 11p13 and is highly conserved in the mouse JOURNAL Genomics 32 (1), 169-170 (1996) MEDLINE 96230346 FEATURES Location/Qualifiers source 1..3268 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lambdagt11 human colon" CDS 202..2151 /codon_start=1 /product="GPI-anchored protein p137" /db_xref="PID:g662994" /translation="MKQILGVIDKKLRNLEKKKGKLDDYQERMNKGERLNQDQLDAVS KYQEVTNNLEFAKELQRSFMALSQDIQKTIKKTARREQLMREEAEQKRLKTVLELQYV LDKLGDDEVRTDLKQGLNGVPILSEEELSLLDEFYKLVDPERDMSLRLNEQYEHASIH LWDLLEGKEKPVCGTTYKVLKEIVERVFQSNYFDSTHNHQNGLCEEEEADSAPAVEDQ VPEAEPEPAEEYTEQSEVESTEYVNRQFMAETQFTSGEKEQVDEWTVETVEVVNSLQQ QPQAASPSVPEPHSLTPVAQADPLVRRQRVQDLMAQMQGPDNFIQDSMLDFENQTLDP AIVSAQPMNPTQNMDMPQLVCPPVHSESRLAQPNQVPVQPEATQVPLVSSTSEGYTAS QPLYQPSHATEQRPQKEPIDQIQATISLNTDQTTASSSLPAASQPQVFQAGTSKPLHS SGINVNAAPFQSMQTVFNMNAPVPPVNEPETLKQQNQYQASYNQSFSSQPHQVEQTEL QQEQLQTVVGTYHGSPDQSHQVTGNHQQPPQQNTGFPRSNQPYYNSRGVSRGGSRGAR GLMNGYRGPAMDSEEDMMVTALHSLTLQTVVIHSLSSVLPGITLAINGMDISRISSEA LGRVDHGEPHEVVEGPQDPTEGCRK" BASE COUNT 974 a 741 c 736 g 817 t ORIGIN 1 cggcttcctc ccgctttttc ttctctctcc ttgcggtctg aagatgccct cggccaccag 61 ccacagcggg agcgcagcaa gtcgtccgga ccgccaccgc cgtcgggttc ctccgggagt 121 gaggcggccg cgggagccgg ggccgccgcg ccggcttctc agcaccccgc aaccggcacc 181 ggcgctgtcc agaccgaggc catgaagcag attctcgggg tgatcgacaa gaaacttcgg 241 aacctggaga agaaaaaggg taagcttgat gattaccagg aacgaatgaa caaaggggaa 301 aggcttaatc aagatcagct ggatgccgtt tctaagtacc aggaagtcac aaataatttg 361 gagtttgcaa aagaattaca gaggagtttc atggcactaa gtcaagatat tcagaaaaca 421 ataaagaaga cagcacgtcg ggagcagctt atgagagaag aagctgaaca gaaacgttta 481 aaaactgtac ttgagctaca gtatgttttg gacaaattgg gagatgatga agtgcggact 541 gacctgaaac aaggtttgaa tggagtgcca atattgtccg aagaggagtt gtcattgttg 601 gatgaattct ataagctagt agaccctgaa cgggacatga gcttgaggtt gaatgaacag 661 tatgaacatg cctccattca cctgtgggac ctgctggaag ggaaggaaaa acctgtatgt 721 ggaaccacct ataaagttct aaaggaaatt gttgagcgtg tttttcagtc aaactacttt 781 gacagcaccc acaaccacca gaatgggctg tgtgaggaag aagaggcaga ctcagcacct 841 gcagttgaag accaggtacc tgaagctgaa cctgagccag cagaagagta cactgagcaa 901 agtgaagttg aatcaacaga gtatgtaaat agacagttca tggcagaaac acagttcacc 961 agtggtgaaa aggagcaggt agatgagtgg acagttgaaa cggttgaggt ggtaaattca 1021 ctccagcagc aacctcaggc tgcatcccct tcagtaccag agccccactc tttgactcca 1081 gtggctcagg cagatcccct tgtgagaaga cagcgagtac aagaccttat ggcacaaatg 1141 cagggtcccg ataatttcat acaggattca atgctggatt ttgaaaatca gacacttgat 1201 cctgccattg tatctgcaca gcctatgaat ccaacacaaa acatggacat gccccagctg 1261 gtttgccctc cagttcattc tgaatctaga cttgctcagc ctaatcaagt tcctgtacaa 1321 ccagaagcga cacaggttcc tttggtatca tccacaagtg aggggtacac agcatctcaa 1381 cccttgtacc agccttctca tgctacagag caacgaccac agaaggaacc aattgatcag 1441 attcaggcaa caatctcttt aaatacagac cagactacag catcatcatc ccttcctgct 1501 gcgtctcagc ctcaagtatt tcaggctggg acaagcaaac ctttacatag cagtggaatc 1561 aatgtaaatg cagctccatt ccaatccatg caaacggtgt tcaatatgaa tgccccagtt 1621 cctcctgtta atgaaccaga aactttaaaa cagcaaaatc agtaccaggc cagttataac 1681 cagagctttt ctagtcagcc tcaccaagta gaacaaacag agcttcagca agaacagctt 1741 caaacagtgg ttggcactta ccatggttcc ccagaccagt cccatcaagt gactggtaac 1801 caccagcagc ctcctcagca gaacactgga tttccacgta gcaatcagcc ctattacaat 1861 agtcgtggtg tgtctcgtgg aggctcccgt ggtgctagag gcttgatgaa tggataccgg 1921 ggccctgcaa tggattcaga ggaggatatg atggttaccg cccttcattc tctaacactc 1981 caaacagtgg ttatacacag tctcagttca gtgctccccg ggattactct ggctatcaac 2041 gggatggata tcagcagaat ttcaagcgag gctctgggca gagtggacca cggggagccc 2101 cacgaggtcg tggagggccc ccaagaccca acagagggat gccgcaaatg aacactcagc 2161 aagtgaatta atctgattca caggattatg tttaatcgcc aaaaacacac tggccagtgt 2221 accataatat gttaccagaa gagttattat ctatttgttc tccctttcag gaaacttatt 2281 gtaaagggac tgttttcatc ccataaagac aggactacaa ttgtcagctt tctattacct 2341 ggatatggaa ggaaactatt tttactctgc atgttctgtc ctaagcgtca tcttgagcct 2401 tgcacatgat actcagattc ctcacccttg cttaggagta aaacaatata ctttacaggg 2461 tgataataat ctccatagtt atttgaagtg gcttgaaaaa ggcaagattg acttttatga 2521 cattggataa aatctacaaa tcagccctcg agttattcaa tgataactga caaactaaat 2581 tatttcccta gaaaggaaga tgaaaggagt ggagtgtggt ttggccagaa caactgcatt 2641 tcacagcttt tccagttaaa ttggagcact gaacgttcag atgcatacca aattatgcat 2701 gggtcctaat cacacatata aggctggcta ccagctttga cacagcactg ttcatctggc 2761 caaacaactg tggttaaaaa cacatgtaaa atggcttttt aacagctgat actgtataag 2821 acaaagccaa gatgcaaaat taggctttga ttggcacttt ttgaaaaata tgcaacaaat 2881 atgggatgta atccggatgg ccgcttctgt acttaatgtg aaatatttag ataccttttt 2941 gaacacttaa cagtttcttt gagacaatga ctttgtaagg attggtacta tctatcattc 3001 cttatgacat gtacattgtc tgtcactaat ccttggattt tgctgtattg tcaccgggat 3061 tggtacaggt actgatgaaa atctctagtg gataatcata acactctcgg tcacatgttt 3121 ttccttcagc ttgaaagctt tttttaaaag gaaaagatac caaatgcctg ctgctaccac 3181 ccttttcaat tgctatgttt tgaaaggcac cagtatgtgt tttagattga tttccctgtt 3241 tcagggaaat cacggacagt agtttccg // LOCUS HSGRX 837 bp RNA PRI 10-JAN-1996 DEFINITION H.sapiens mRNA for glutaredoxin. ACCESSION X76648 NID g531404 KEYWORDS glutaredoxin; grx gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 837) AUTHORS Padilla,A.C. TITLE Direct Submission JOURNAL Submitted (08-DEC-1993) A.C. Padilla, Karolinska Institute, Medical Nobel Institute, Biochem. I, 17177 Stockholm, SWEDEN REFERENCE 2 (bases 1 to 837) AUTHORS Padilla,C.A., Martinez-Galisteo,E., Barcena,J.A., Spyrou,G. and Holmgren,A. TITLE Purification from placenta, amino acid sequence, structure comparisons and cDNA cloning of human glutaredoxin JOURNAL Eur. J. Biochem. 227 (1-2), 27-34 (1995) MEDLINE 95154298 REFERENCE 3 (bases 1 to 837) AUTHORS Padilla,C.A., Spyrou,G. and Holmgren,A. TITLE High-level expression of fully active human glutaredoxin (thioltransferase) in E. coli and characterization of Cys7 to Ser mutant protein JOURNAL FEBS Lett. 378 (1), 69-73 (1996) MEDLINE 96140711 FEATURES Location/Qualifiers source 1..837 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /clone_lib="lambda ZAP" gene 64..384 /gene="grx" CDS 64..384 /gene="grx" /codon_start=1 /product="glutaredoxin" /db_xref="PID:g531405" /translation="MAQEFVNCKIQPGKVVVFIKPTCPYCRRAQEILSQLPIKQGLLE FVDITATNHTNEIQDYLQQLTGARTVPRVFIGKDCIGGCSDLVSLQQSGELLTRLKQI GALQ" BASE COUNT 266 a 175 c 186 g 210 t ORIGIN 1 aattcggcac gaggcaatac ctgcaactga ggattcttcc cggggagacc gcagcccatc 61 ggcatggctc aagagtttgt gaactgcaaa atccagcctg ggaaggtggt tgtgttcatc 121 aagcccacct gcccgtactg caggagggcc caagagatcc tcagtcaatt gcccatcaaa 181 caagggcttc tggaatttgt cgatatcaca gccaccaacc acactaacga gattcaagat 241 tatttgcaac agctcacggg agcaagaacg gtgcctcgag tctttattgg taaagattgt 301 ataggcggat gcagtgatct agtctctttg caacagagtg gggaactgct gacgcggcta 361 aagcagattg gagctctgca gtaaccacag atctcatagg aaatgttcaa caattctgtg 421 aaaggtcaca ggacccaatt ggagaaatca tatgaaaagc atagttggtc ttggtgtcat 481 atggatgaga ggcacaagtg cagaggcctg tggtcatgtg gaacactctg ttatttaaga 541 tggctatcca gataatcctg aacactgtgt atttatttta tttagactac cagcaaagat 601 taaagcatga aatgtaaaac atctgataaa acttacagcc ccctacacca agagtgtatc 661 tgtgaaagag ctcctacact ttgaaaactt aagaatccct tatcatgaag tttgcctgtt 721 ctagaattgt aagattgtta atttccttca atctctagtg acaacactta atttcttttc 781 taataaaaaa aacctataga tgaaaaaaaa aaaaaaaaaa actcgagggg gggcccg // LOCUS HSGSA1R 1516 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for coupling protein G(s) alpha-subunit (alpha-S1) (stimulatory regulatory component Gs of adenylyl cyclase). ACCESSION X04409 NID g31912 KEYWORDS adenylate cyclase; G protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 25 to 1516) AUTHORS Mattera,R., Codina,J., Crozat,A., Kidd,V., Woo,S.L. and Birnbaumer,L. TITLE Identification by molecular cloning of two forms of the alpha-subunit of the human liver stimulatory (GS) regulatory component of adenylyl cyclase JOURNAL FEBS Lett. 206 (1), 36-42 (1986) MEDLINE 87005246 REFERENCE 2 (bases 1 to 1516) AUTHORS Birnbaumer,L. TITLE Direct Submission JOURNAL Submitted (11-MAY-1987) to the EMBL/GenBank/DDBJ databases COMMENT See x04408 for alpha-S2. The sequence differs from alpha-S2 in that it lacks a stretch of 42bp between the replaced nucleotides C and A (instead of G) at pos. 201 and 202. Data kindly reviewed (11-MAY-1987) by Birnbaumer L. FEATURES Location/Qualifiers source 1..1516 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human liver cDNA in (lambda)gt11" /clone="(lambda)a1, (lambda)a2" CDS 13..1155 /note="alpha-S1 (AA 1-380)" /codon_start=1 /db_xref="PID:g31913" /db_xref="SWISS-PROT:P04895" /translation="MGCLGNSKTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLL LLGAGESGKSTIVKQMRILHVNGFNGDSEKATKVQDIKNNLKEAIETIVAAMSNLVPP VELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDC AQYFLDKIDVIKQADYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERR KWIQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILF LNKQDLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRI STASGDGRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYELL" misc_feature 1402..1407 /note="put. polyA signal" misc_feature 1406..1411 /note="alternate polyA signal" misc_feature 1428..1433 /note="alternate polyA signal" misc_feature 1438..1443 /note="alternate polyA signal" misc_feature 1456..1461 /note="alternate polyA signal" misc_feature 1493..1498 /note="alternate polyA signal" misc_feature 1500..1505 /note="alternate polyA signal" polyA_site 1516 /note="polyA site" BASE COUNT 444 a 386 c 374 g 312 t ORIGIN 1 gccgccgccg ccatgggctg cctcgggaac agtaagaccg aggaccagcg caacgaggag 61 aaggcgcagc gtgaggccaa caaaaagatc gagaagcagc tgcagaagga caagcaggtc 121 taccgggcca cgcaccgcct gctgctgctg ggtgctggag aatctggtaa aagcaccatt 181 gtgaagcaga tgaggatcct gcatgttaat gggtttaatg gagacagtga gaaggcaacc 241 aaagtgcagg acatcaaaaa caacctgaaa gaggcgattg aaaccattgt ggccgccatg 301 agcaacctgg tgccccccgt ggagctggcc aaccccgaga accagttcag agtggactac 361 atcctgagtg tgatgaacgt gcctgacttt gacttccctc ccgaattcta tgagcatgcc 421 aaggctctgt gggaggatga aggagtgcgt gcctgctacg aacgctccaa cgagtaccag 481 ctgattgact gtgcccagta cttcctggac aagatcgacg tgatcaagca ggctgactat 541 gtgccgagcg atcaggacct gcttcgctgc cgtgtcctga cttctggaat ctttgagacc 601 aagttccagg tggacaaagt caacttccac atgtttgacg tgggtggcca gcgcgatgaa 661 cgccgcaagt ggatccagtg cttcaacgat gtgactgcca tcatcttcgt ggtggccagc 721 agcagctaca acatggtcat ccgggaggac aaccagacca accgcctgca ggaggctctg 781 aacctcttca agagcatctg gaacaacaga tggctgcgca ccatctctgt gatcctgttc 841 ctcaacaagc aagatctgct cgctgagaaa gtccttgctg ggaaatcgaa gattgaggac 901 tactttccag aatttgctcg ctacactact cctgaggatg ctactcccga gcccggagag 961 gacccacgcg tgacccgggc caagtacttc attcgagatg agtttctgag gatcagcact 1021 gccagtggag atgggcgtca ctactgctac cctcatttca cctgcgctgt ggacactgag 1081 aacatccgcc gtgtgttcaa cgactgccgt gacatcattc agcgcatgca ccttcgtcag 1141 tacgagctgc tctaagaagg gaacccccaa atttaattaa agccttaagc acaattaatt 1201 aaaagtgaaa cgtaattgta caagcagtta atcacccacc atagggcatg attaacaaag 1261 caacctttcc cttcccccga gtgattttgc gaaaccccct tttcccttca gcttgcttag 1321 atgttccaaa tttagaaagc ttaaggcggc ctacagaaaa aggaaaaaag gccacaaaag 1381 ttccctctca ctttcagtaa aaataaataa aacagcagca gcaaacaaat aaaatgaaat 1441 aaaagaaaca aatgaaataa atattgtgtt gtgcagcatt aaaaaaaatc aaaataaaaa 1501 ttaaatgtga gcaaag // LOCUS HSGST1 2587 bp RNA PRI 12-SEP-1993 DEFINITION Human GST1-Hs mRNA for GTP-binding protein. ACCESSION X17644 NID g31920 KEYWORDS cell cycle control; GST1-Hs gene; GTP-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2587) AUTHORS Hoshino,S., Miyazawa,H., Enomoto,T., Hanaoka,F., Kikuchi,Y., Kikuchi,A. and Ui,M. TITLE A human homologue of the yeast GST1 gene codes for a GTP-binding protein and is expressed in a proliferation-dependent manner in mammalian cells JOURNAL EMBO J. 8 (12), 3807-3814 (1989) MEDLINE 90059983 FEATURES Location/Qualifiers source 1..2587 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KB" /clone_lib="lambda gt10" /clone="pGH5" CDS 649..2148 /note="GST1-Hs protein (AA 1-499)" /codon_start=1 /db_xref="PID:g31921" /db_xref="SWISS-PROT:P15170" /translation="MELSEPIVENGETEMSPEESWEHKEEISEAEPGGGSLGDGRPPE ESAHEMMEEEEEIPKPKSVVAPPGAPKKEHVNVVFIGHVDAGKSTIGGQIMYLTGMVD KRTLEKYEREAKEKNRETWYLSWALDTNQEERDKGKTVEVGRAYFETEKKHFTILDAP GHKSFVPNMIGGASQADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVL INKMDDPTVNWSNERYEECKEKLVPFLKKVGFNPKKDIHFMPCSGLTGANLKEQSDFC PWYIGLPFIPYLDNLPNFNRSVDGPIRLPIVDKYKDMGTVVLGKLESGSICKGQQLVM MPNKHNVEVLGILSDDVETDTVAPGENLKIRLKGIEEEEILPGFILCDPNNLCHSGRT FDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITALICLVDKKSGEKSKTRPRFVKQD QVCIARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD" misc_feature 2489..2493 /note="pot. mRNA instability signal" misc_feature 2570..2575 /note="pot. polyadenylation signal" polyA_site 2587 /note="polyadenylation site" BASE COUNT 711 a 608 c 670 g 598 t ORIGIN 1 ggcacacacg aggaggaggg ttgagctgct gccgccgccg cctctgtcgt cgtcgcgagt 61 gtggagtcgg gactggagct gctgccgcgg cgacgccggg gatctttgtc gctagctccc 121 ggcccttctg ccccgccgcc ttccctcagt cagcgttgcc cactcctctc cggccgggcg 181 cccctgcctc catttctcgc tctctgtcca ccacacacac ggcccccccg atcatggatc 241 cgggcagtgg cggcggcggc ggcggcggcg gcggcggcgg gagcagcagc ggcagcagca 301 gcagcgactc ggcgcctgac tgctgggacc aggcggacat ggaagccccc gggccgggcc 361 cttgcggcgg cggcggcttc cctggcggcg gcggccgagg cccagcggga gaacctcagc 421 gcggccttca gccggcaact caacgtcaac gccaagccct tcgtgcccaa cgtccacgcc 481 gccgagttcg tgccgtcctt cctgcggggc ccggcagcgc cgccaccccc agctggcggc 541 gccgccaata accacggagc cggcagcggc gcgggaggcc gtgcggcacc tgtggaatcc 601 tctcaagagg aacagtcatt gtgtgaaggt tcaaattcag ctgttagcat ggaactttca 661 gaacctattg tagaaaatgg agagacagaa atgtctccag aagaatcatg ggagcacaaa 721 gaagaaataa gtgaagcaga gccagggggt ggttccttgg gagatggaag gccgccagag 781 gaaagtgccc atgaaatgat ggaggaggaa gaggaaatcc caaaacctaa gtctgtggtt 841 gcaccgccag gtgctcctaa gaaagagcat gtaaatgtag tattcattgg gcacgtagat 901 gctggcaagt caaccattgg aggacaaata atgtatttga ctggaatggt tgacaaaagg 961 acgcttgaaa agtatgaaag agaagctaaa gagaaaaaca gagaaacttg gtacttgtct 1021 tgggccttag acacaaatca ggaagaacga gacaagggta aaacagtaga agtgggtcgt 1081 gcctattttg aaaccgaaaa gaagcatttc acaattctag atgcccctgg ccacaagagt 1141 tttgtcccaa atatgattgg tggtgcctct caagctgatt tggctgtgct ggtaatctca 1201 gccaggaaag gagagtttga aactggattt gaaaaaggag gacagacaag agaacatgca 1261 atgttggcaa agacagcagg tgtaaaacac ctaattgtgc taattaataa gatggatgat 1321 ccaacagtaa attggagcaa tgagagatat gaagaatgta aggagaaact agtgccattt 1381 ttgaaaaaag ttggcttcaa tcccaaaaag gacattcact ttatgccctg ctcaggactt 1441 actggagcaa atctcaaaga gcagtcggat ttctgtcctt ggtacattgg attaccgttt 1501 attccatatc tggataattt gccgaacttc aatagatcag ttgatggacc aatcaggctg 1561 ccaattgtgg ataagtacaa ggatatgggc actgtggtcc tgggaaagct ggaatcagga 1621 tctatttgta aaggccagca gcttgtgatg atgccaaaca agcacaacgt ggaagttctt 1681 ggaatacttt ccgatgatgt agagactgat accgtagccc caggtgaaaa cctcaaaatc 1741 agactgaaag gaattgaaga agaggagatt cttccagggt ttatactttg tgatcctaat 1801 aatctttgtc attctggacg cacatttgat gcccagatag tgattataga gcacaaatcc 1861 atcatctgcc caggctataa tgcggtgctg catattcata cctgtattga ggaggtggaa 1921 ataacagcct taatctgctt ggtagacaaa aaatcaggag aaaaaagtaa gacccgaccc 1981 cgttttgtga aacaagatca agtatgcatt gctcgcttaa ggacagcagg aaccatctgc 2041 cttgagacct ttaaagactt ccctcagatg ggtcgtttca ccttaagaga tgagggtaag 2101 accattgcaa ttggaaaagt tctgaaactg gttccagaga aagactaagc attttcttga 2161 tgaccctgca caatactgtg aggaaaattg actgcagaag cctacttcac accgccttct 2221 cttattttct gcccattgat aaacctctcc ccatattttg caaagaggaa attcacagca 2281 aaagtccaca ttatgtcagc tttctcatat tgagagctct gctatgccac tgttgaattt 2341 ttcccaagat tcctgtccct agccctcact tcaaactctg cttccttgga cagatttggc 2401 aatagctttg taagtgatgt ggacataatt gcctacaata atgaaaacct acaggaattt 2461 ttttattttt cattttcccc ttaggcatat ttagtatttt tcccccaggc cagatcattc 2521 gtgagtgtgc gagtgtgtgt gcacatgtta caaaggcaac taccatgtta ataaaatatt 2581 caatttg // LOCUS HSGSTPI 714 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for class Pi glutathione S-transferase (GST-Pi; E.C.2.5.1.18). ACCESSION X06547 NID g31945 KEYWORDS glutathione S-transferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 714) AUTHORS Kano,T., Sakai,M. and Muramatsu,M. TITLE Structure and expression of a human class pi glutathione S-transferase messenger RNA JOURNAL Cancer Res. 47 (21), 5626-5630 (1987) MEDLINE 88026724 FEATURES Location/Qualifiers source 1..714 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" /clone="pGPi2" CDS 7..639 /note="glutathione S-transferase (GST-Pi) (AA 1 - 210)" /codon_start=1 /db_xref="PID:g31946" /db_xref="SWISS-PROT:P09211" /translation="MPPYTVVYFPVRGRCAALRMLLADQGQSWKEEVVTVETWQEGSL KASCLYGQLPKFQDGDLTLYQSNTILRHLGRTLGLYGKDQQEAALVDMVNDGVEDLRC KYISLIYTNYEAGKDDYVKALPGQLKPFETLLSQNQGGKTFIVGDQISFADYNLLDLL LIHEVLAPGCLDAFPLLSAYVGRLSARPKLKAFLASPEYVNLPINGNGKQ" misc_feature 694..699 /note="polyA signal" polyA_site 714 /note="polyA site" BASE COUNT 147 a 221 c 208 g 138 t ORIGIN 1 gccaccatgc cgccctacac cgtggtctat ttcccagttc gaggccgctg cgcggccctg 61 cgcatgctgc tggcagatca gggccagagc tggaaggagg aggtggtgac cgtggagacg 121 tggcaggagg gctcactcaa agcctcctgc ctatacgggc agctccccaa gttccaggac 181 ggagacctca ccctgtacca gtccaatacc atcctgcgtc acctgggccg cacccttggg 241 ctctatggga aggaccagca ggaggcagcc ctggtggaca tggtgaatga cggcgtggag 301 gacctccgct gcaaatacat ctccctcatc tacaccaact atgaggcggg caaggatgac 361 tatgtgaagg cactgcccgg gcaactgaag ccttttgaga ccctgctgtc ccagaaccag 421 ggaggcaaga ccttcattgt gggagaccag atctccttcg ctgactacaa cctgctggac 481 ttgctgctga tccatgaggt cctagcccct ggctgcctgg atgcgttccc cctgctctca 541 gcatatgtgg ggcgcctcag cgcccggccc aagctcaagg ccttcctggc ctcccctgag 601 tacgtgaacc tccccatcaa tggcaacggg aaacagtgag ggttgggggg actctgagcg 661 ggaggcagag tttgccttcc tttctccagg accaataaaa tttctaagag agct // LOCUS HSGSTT1 1004 bp RNA PRI 11-JUL-1994 DEFINITION H.sapiens GSTT1 mRNA. ACCESSION X79389 NID g510904 KEYWORDS glutathione transferase; GSTT1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1004) AUTHORS Pemble,S., Schroeder,K.R., Spencer,S.R., Meyer,D.J., Hallier,E., Bolt,H.M., Ketterer,B. and Taylor,J.B. TITLE Human glutathione S-transferase theta (GSTT1): cDNA cloning and the characterization of a genetic polymorphism JOURNAL Biochem. J. 300 (Pt 1), 271-276 (1994) MEDLINE 94256948 REFERENCE 2 (bases 1 to 1004) AUTHORS Pemble,S.E. TITLE Direct Submission JOURNAL Submitted (22-JUN-1994) S.E. Pemble, University College London, Cancer Res Campaign, Mol Toxicology Res Group, Windeyer Bldg, Cleveland St London W1P 6DB, UK FEATURES Location/Qualifiers source 1..1004 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEP G2" gene 1..723 /gene="GST T1" CDS 1..723 /gene="GST T1" /EC_number="2.5.1.18" /codon_start=1 /product="glutathione transferase T1" /db_xref="PID:g510905" /db_xref="SWISS-PROT:P30711" /translation="MGLELYLDLLSQPCRAVYIFAKKNDIPFELRIVDLIKGQHLSDA FAQVNPLKKVPALKDGDFTLTESVAILLYLTRKYKVPDYWYPQDLQARARVDEYLAWQ HTTLRRSCLRALWHKVMFPVFLGGPVSPQTLAATLAELDVTLQLLEDKFLQNKAFLTG PHISLADLVAITELMHPVGAGCQVFEGRPKLATWRQRVEAAVGEDLFQEAHEVILKAK DFPPADPTIKQKLMPWVLAMIR" BASE COUNT 212 a 310 c 264 g 218 t ORIGIN 1 atgggtctgg agctctacct ggacctgctg tcccagccct gccgcgctgt ttacatcttt 61 gccaagaaga acgacattcc cttcgagctg cgcatcgtgg atctgattaa aggtcagcac 121 ttaagcgatg cctttgccca ggtgaacccc ctcaagaagg tgccggcctt gaaggacggg 181 gacttcacct tgacggagag tgtggccatc ctgctctacc tgacgcgcaa atataaggtc 241 cctgactact ggtaccctca ggacctgcag gcccgtgccc gtgtggatga gtacctggca 301 tggcagcaca cgactctgcg gagaagctgc ctccgggcct tgtggcataa ggtgatgttc 361 cctgtgttcc tgggtgggcc agtatctccc cagacactgg cagccaccct ggcagagttg 421 gatgtgaccc tgcagttgct cgaggacaag ttcctccaga acaaggcctt ccttactggt 481 cctcacatct ccttagctga cctcgtagcc atcacggagc tgatgcatcc cgtgggtgct 541 ggctgccaag tcttcgaagg ccgacccaag ctggccacat ggcggcagcg cgtggaggca 601 gcagtggggg aggacctctt ccaggaggcc catgaggtca ttctgaaggc caaggacttc 661 ccacctgcag accccaccat aaagcagaag ctgatgccct gggtgctggc catgatccgg 721 tgagctggga aacctcaccc ttgcaccgtc ctcagcagtc cacaaagcat tttcatttct 781 aatggcccat gggagccagg cccagaaagc aggaatggct tgcttaagac ttgcccaagt 841 cccagagcac ctcacctccc gaagccacca tccccaccct gtcttccaca gccgcctgaa 901 agccacaatg agaatgatgc acactgaggc cttgtgtcct ttaatcactg catttcattt 961 tgattttgga taataaacct ggctcagcct gagcctctgc ttct // LOCUS HSGTK 1328 bp RNA PRI 26-JUN-1995 DEFINITION H.sapiens mRNA for glutamine transaminase K. ACCESSION X82224 NID g758590 KEYWORDS beta-lyase gene; glutamine transaminase K. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1328) AUTHORS Perry,S., Harries,H., Scholfield,C., Lock,T., King,L., Gibson,G. and Goldfarb,P. TITLE Molecular cloning and expression of a cDNA for human kidney cysteine conjugate beta-lyase JOURNAL FEBS Lett. 360 (3), 277-280 (1995) MEDLINE 95188952 REFERENCE 2 (bases 1 to 1328) AUTHORS Goldfarb,P.S.G. TITLE Direct Submission JOURNAL Submitted (17-OCT-1994) P.S.G. Goldfarb, School of Biological Sciences, University of Surrey, Guildford, Surrey GU2 5XH, UK FEATURES Location/Qualifiers source 1..1328 /organism="Homo sapiens" /sub_species="caucasian" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" /clone="pbeta1RT1" /chromosome="9" gene 1..1269 /gene="beta-lyase" CDS 1..1269 /gene="beta-lyase" /EC_number="2.6.1.64" /codon_start=1 /product="glutamine--phenylpyruvate aminotransferase" /db_xref="PID:g758591" /translation="MAKQLQARRLDGIDYNPWVEFVKLASEHDVVNLGQGFPDFPPPD FAVEAFQHAVSGDFMLNQYTKTFGYPPLTKILASFFGELLGQEIDPLRNVLVTVGGYG ALFTAFQALVDEGDEVIIIEPFFDCYEPMTMMAGGRPVFVSLKPGPIQNGELGSSSNW QLDPMELAGKFTSRTKALVLNTPNNPLGKVFSREELELVASLCQQHDVVCITDEVYQW MVYDGHQHISIASLPGMWERTLTIGSAGKTFSATGWKVGWVLGPDHIMKHLRTVHQNS VFHCPTQSQAAVAESFEREQLLFRQPSSYFVQFPQAMQRCRDHMIRSLQSVGLKPIIP QGSYFLITDISDFKRKMPDLPGAVDEPYDRRFVKWMIKNKGLVAIPVSIFYSVPHQKH FDHYIRFCFVKDEATLQAMDEKLRKWKVEL" misc_binding 730..747 /gene="beta-lyase" /bound_moiety="PLP" BASE COUNT 294 a 390 c 371 g 273 t ORIGIN 1 atggccaaac agctgcaggc ccgaaggcta gacgggatcg actacaaccc ctgggtggag 61 tttgtgaaac tggccagtga gcatgacgtc gtgaacttgg gccagggctt cccggatttc 121 ccaccaccag actttgccgt ggaagccttt cagcacgctg tcagtggaga cttcatgctt 181 aaccagtaca ccaagacatt tggttaccca ccactgacga agatcctggc aagtttcttt 241 ggggagctgc tgggtcagga gatagacccg ctcaggaatg tgctggtgac tgttggtggc 301 tatggggccc tgttcacagc cttccaggcc ctggtggacg aaggagacga ggtcatcatc 361 atcgaaccct tttttgactg ctacgagccc atgacaatga tggcaggggg tcgtcctgtg 421 tttgtgtccc tgaagccggg tcccatccag aatggagaac tgggttccag cagcaactgg 481 cagctggacc ccatggagct ggccggcaaa ttcacatcac gcaccaaagc cctggtcctc 541 aacaccccca acaaccccct gggcaaggtg ttctccaggg aagagctgga gctggtggcc 601 agcctttgcc agcagcatga cgtggtgtgt atcactgatg aagtctacca gtggatggtc 661 tacgacgggc accagcacat cagcattgcc agcctccctg gcatgtggga acggaccctg 721 accatcggca gcgccggcaa gaccttcagc gccactggct ggaaggtggg ctgggtcctg 781 ggtccagatc acatcatgaa gcacctgcgg accgtgcacc agaactccgt cttccactgc 841 cccacgcaga gccaggctgc agtagccgag agctttgaac gggagcagct gctcttccgc 901 caacccagca gctactttgt gcagttcccg caggccatgc agcgctgccg tgaccacatg 961 atacgtagcc tacagtcagt gggcctgaag cccatcatcc ctcagggcag ctacttcctc 1021 atcacagaca tctcagactt caagaggaag atgcctgact tgcctggagc tgtggatgag 1081 ccctatgaca gacgcttcgt caagtggatg atcaagaaca agggcttggt ggccatccct 1141 gtctccatct tctatagtgt gccacatcag aagcactttg accactatat ccgcttctgt 1201 tttgtgaagg atgaagccac gctccaggcc atggacgaga agctgcggaa gtggaaggtg 1261 gaactctagc cctgaagtca cgccttggcc ctgacatccc cacatgcccg cagagatcct 1321 ctttgagt // LOCUS HSGTPBIP 1880 bp RNA PRI 30-MAR-1995 DEFINITION H.sapiens mRNA for GTP-binding protein. ACCESSION X80754 NID g577778 KEYWORDS GTP-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1880) AUTHORS Schenker,T., Lach,C., Kessler,B., Calderara,S. and Trueb,B. TITLE A novel GTP-binding protein which is selectively repressed in SV40 transformed fibroblasts JOURNAL J. Biol. Chem. 269 (41), 25447-25453 (1994) MEDLINE 95014343 REFERENCE 2 (bases 1 to 1880) AUTHORS Trueb,B. TITLE Direct Submission JOURNAL Submitted (29-JUL-1994) B. Trueb, Swiss Federal Institute of Technology, Zurich, Biochemistry I, ETH Zentrum, 8092 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..1880 /organism="Homo sapiens" /strain="Caucasian" /db_xref="taxon:9606" /tissue_type="lung" /cell_type="fibroblast" /cell_line="WI-38" /clone="lambda 385" CDS 48..1142 /codon_start=1 /product="GTP-binding protein" /db_xref="PID:g577779" /translation="MGILEKISEIEKEIARTQKNKATEYHLGLLKAKLAKYRAQLLEP SKSASSKGEGFDVMKSGDARVALIGFPSVGKSTFLSLMTSTASEAASYEFTTLTCIPG VIEYKGANIQLLDLPGIIEGAAQGKGRGRQVIAVARTADVIIMMLDATKGEVQRSLLE KELESVGIRLNKHKPNIYFKPKKGGGISFNSTVTLTQCSEKLVQLILHEYKIFNAEVL FREDCSPDEFIDVIVGNRVYMPCLYVYNKIDQISMEEVDRLARKPNSVVISCGMKLNL DYLLEMLWEYLALTCIYTKKRGQRPDFTDAIILRKGASVEHVCHRIHRSLASQFKYAL VWGTSTKYSPQRVGLTHTMEHEDVIQIVKK" polyA_signal 1227..1238 polyA_signal 1853..1858 polyA_site 1871 polyA_site 1880 BASE COUNT 417 a 549 c 524 g 390 t ORIGIN 1 ccggtgccgc cgccaccgct gtctgtgcgc ccacctctgc tgctaccatg gggatcttag 61 agaagatctc ggagatcgag aaggagatcg ctcggacaca gaagaacaag gccactgagt 121 atcatctggg cctgctgaaa gctaagctcg ccaagtatcg ggcccagctc ctggaaccgt 181 ccaaatcggc ctcatccaaa ggagagggct ttgatgtcat gaagtcgggt gatgcccgtg 241 tggcgctgat tggatttccc tctgtgggta agtccacatt cttgagtctg atgacctcca 301 cggccagcga ggcagcgtcc tatgagttca ccactctgac gtgtattcct ggggtcattg 361 aatacaaagg tgccaacatc cagctcctgg accttcctgg aatcattgaa ggcgcagccc 421 aaggaaaagg ccgtggccgg caggtgatcg ctgtggcgcg cacggctgac gtcatcatca 481 tgatgctgga tgccaccaag ggagaggtgc agaggtctct gctggagaag gagctggagt 541 ctgtgggcat ccgcctcaac aagcacaagc ctaacatcta cttcaagccc aagaaaggtg 601 gtggcatctc ctttaactcg acagtcacgc tgacccagtg ctcggaaaag ctggtgcagc 661 tcatcctgca cgaatacaag atcttcaatg cagaagtgct tttccgagaa gactgctccc 721 cggacgagtt catcgatgtg atcgtgggca accgggtgta catgccctgc ctgtatgttt 781 ataacaaaat cgaccagatc tccatggaag aggtggaccg cctggcccga aaacccaaca 841 gtgtggtcat cagctgcggc atgaagctga acctggacta tctgctggag atgctttggg 901 agtacttggc cctgacctgc atctacacca agaagagagg acagaggcca gacttcacag 961 acgccatcat tctccggaaa ggggcctcag tggagcacgt gtgccaccgc atccaccggt 1021 cactcgccag ccagttcaag tacgccctgg tgtggggcac cagcaccaag tacagtccgc 1081 agcgggtggg cctgacccac accatggagc atgaggacgt catccagatc gtgaagaagt 1141 aacggcgcct gccgggcctc ccgcccacct gcctcgtctc cctggggagg tggtcccact 1201 gggacacaca aacacccaaa cagaaaaata caaatacacg taccccagga aggggtccct 1261 caagtctctg ctatttacag aagtttcttc agtaggcaga cgaagagtgt gttggggcaa 1321 aggggctcgg ttggaggcat ttcccataag actgagccct ctcatggggg ttttgagttt 1381 gtagtgctga gcctgcatct gtgcctccca gccccctgca ctgagggagc aagttgccca 1441 catgcccgcc agccagggcc taaagcagat ggcatgctca gtgccaggct ggtagctggg 1501 cctgtttggg tccctggagg ctgtggctgc tgtcatggca cctcactccc tcagctcttg 1561 ccagcttctc tgacacttgg gttgggggcc cttccaggag gaaaccccct tgggtgcccc 1621 acacagggct ctccatgatg ggaaccagtg gttagtggct tcaaaggccc agctgacacc 1681 ctccacagcc taaggggtgt cctaaagtgc ctccccctgt attccccctc ccagggcagc 1741 ccctgcccag cacaaaaccc caggaccctg gctctgcacg cctggggcag ggacttttga 1801 gtttaggatc tgtattttct aagtccccag ttctcctggc tctcctttct gaaataaagg 1861 attgaaaacg gttcctgttc // LOCUS HSGTS 4586 bp RNA PRI 26-OCT-1994 DEFINITION H.sapiens mRNA for glutaminyl-tRNA synthetase. ACCESSION X54326 NID g31957 KEYWORDS glutamate-tRNA ligase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4586) AUTHORS Knippers,R. TITLE Direct Submission JOURNAL Submitted (17-JUL-1990) Knippers R., Universitaet Konstanz, Fakultaet fuer Biologie, Postfach 5560, D 7750 Konstanz REFERENCE 2 (bases 1 to 4586) AUTHORS Fett,R. and Knippers,R. TITLE The primary structure of human glutaminyl-tRNA synthetase. A highly conserved core, amino acid repeat regions, and homologies with translation elongation factors JOURNAL J. Biol. Chem. 266 (3), 1448-1455 (1991) MEDLINE 91107633 FEATURES Location/Qualifiers source 1..4586 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 59..4381 /codon_start=1 /product="glutaminyl-tRNA synthetase" /db_xref="PID:g31958" /db_xref="SWISS-PROT:P07814" /translation="MEHTEIDHWLEFSATKLSSCDSFTSTINELNHCLSLRTYLVGNS LSLADLCVWATLKGNAAWQEQLKQKKAPVHVKRWFGFLEAQQAFQSVGTKWDVSTTKA RVAPEKKQDVGKFVELPGAEMGKVTVRFPPEASGYLHIGHAKAALLNQHYQVNFKGKL IMRFDDTNPEKEKEDFEKVILEDVAMLHIKPDQFTYTSDHFETIMKYAEKLIQEGKAY VDDTPAEQMKAEREQRIESKHRKNPIEKNLQMWEEMKKGSQFGHSCCLRAKIDMSSNN GCMRDPTLYRCKIQPHPRTGNKYNVYPTYDFACPIVDSIEGVTHALRTTEYHDRDEQF YWIIEALGIRKPYIWEYSRLNLNNTVLSKRKLTWFVNEGLVDGWDDPRFPTVRGVLRR GMTVEGLKQFIAAQGSSRSVVNMEWDKIWAFNKKVIDPVAPRYVALLKKEVIPVNVPE AQEEMKEVAKHPKNPEVGLKPVWYSPKVFIEGADAETFSEGEMVTFINWGNLNITKIH KNADGKIISLDAKFNLENKDYKKTTKVTWLAETTHALPIPVICVTYEHLITKPVLGKD EDFKQYVNKNSKHEELMLGDPCLKDLKKGDIIQLQRRGFFICDQPYEPVSPYSCKEAP CVLIYIPDGHTKEMPTSGSKEKTKVEATKNETSAPFKERPTPSLNNNCTTSEDSLVLY NRVAVQGDVVRELKAKKAPKEDVDAAVKQLLSLKAEYKEKTGQEYKPGNPPAEIGQNI SSNSSASILESKSLYDEVAAQGEVVRKLKAEKSPKAKINEAVECLLSLKAQYKEKTGK EYIPGQPPLSQSSDSSPTRNSEPAGLETPEAKVLFDKVASQGEVVRKLKTEKAPKDQV DIAVQELLQLKAQYKSLIGVEYKPVSATGAEDKDKKKKEKENKSEKQNKPQKQNDGQR KDPSKNQGGGLSSSGAGEGQGPKKQTRLGLEAKKEENLADWYSQVITKSEMIEYHDIS GCYILRPWAYAIWEAIKDFFDAEIKKLGVENCYFPMFVSQSALEKEKTHVADFAPEVA WVTRSGKTELAEPIAIRPTSETVMYPAYAKWVQSHRDLPIKLNQWCNVVRWEFKHPQP FLRTREFLWQEGHSAFATMEEAAEEVLQILDLYAQVYEELLAIPVVKGRKTEKEKFAG GDYTTTIEAFISASGRAIQGGTSHHLGQNFSKMFEIVFEDPKIPGEKQFAYQNSWGLT TRTIGVMTMVHGDNMGLVLPPRVACVQVVIIPCGITNALSEEDKEALIAKCNDYRRRL LSVNIRVRADLRDNYSPGWKFNHWELKGVPIRLEVGPRDMKSCQFVAVRRDTGEKLTV AENEAETKLQAILEDIQVTLFTRASEDLKTHMVVANTMEDFQKILDSGKIVQIPFCGE IDCEDWIKKTTARDQDLEPGAPSMGAKSLCIPFKPLCELQPGAKCVCGKNPAKYYTLF GRSY" BASE COUNT 1516 a 861 c 1044 g 1165 t ORIGIN 1 tatacttcgc tacttggcta gagttgcaac tacagctggg ttatatggct ctaatctgat 61 ggaacatact gagattgatc actggttgga gttcagtgct acaaaattat cttcatgtga 121 ttcctttact tctacaatta atgaactcaa tcattgcctg tctctgagaa catacttagt 181 tggaaactcc ttgagtttag cagatttatg tgtttgggcc accctaaaag gaaatgctgc 241 ctggcaagaa cagttgaaac agaagaaagc tccagttcat gtaaaacgtt ggtttggctt 301 tcttgaagcc cagcaggcct tccagtcagt aggtaccaag tgggatgttt caacaaccaa 361 agctcgagtg gcacctgaga aaaagcaaga tgttgggaaa tttgttgagc ttccaggtgc 421 ggagatggga aaggttaccg tcagatttcc tccagaggcc agtggttact tacacattgg 481 gcatgcaaaa gctgctcttc tgaaccagca ctaccaggtt aactttaaag ggaaactgat 541 catgagattt gatgacacaa atcctgaaaa agaaaaggaa gattttgaga aggttatctt 601 ggaagatgtt gcaatgttgc atatcaaacc agatcaattt acttatactt cggatcattt 661 tgaaactata atgaagtatg cagagaagct aattcaagaa gggaaggctt atgtggatga 721 tactcctgct gaacagatga aagcagaacg tgagcagagg atagaatcta aacatagaaa 781 aaaccctatt gagaagaatc tacaaatgtg ggaagaaatg aaaaaaggga gccagtttgg 841 tcactcctgt tgtttgcgag caaaaattga catgagtagt aacaatggat gcatgagaga 901 tccaaccctt tatcgctgca aaattcaacc acatccaaga actggaaata aatacaatgt 961 ttatccaaca tatgattttg cctgccccat agttgacagc atcgaaggtg ttacacatgc 1021 cctgagaaca acagaatacc atgacagaga tgagcagttt tactggatta ttgaagcttt 1081 aggcataaga aaaccatata tttgggaata tagtcggcta aatctcaaca acacagtgct 1141 atccaaaaga aaactcacat ggtttgtcaa tgaaggacta gtagatggat gggatgaccc 1201 aagatttcct acggttcgtg gtgtactgag aagagggatg acagttgaag gactgaaaca 1261 gtttattgct gctcagggct cctcacgttc agtcgtgaac atggagtggg acaaaatctg 1321 ggcgtttaac aaaaaggtta ttgacccagt ggctccacga tatgttgcat tactgaagaa 1381 agaagtgatc ccagtgaatg tacctgaagc tcaggaggag atgaaagaag tagccaaaca 1441 cccaaagaat cctgaggttg gcttgaagcc tgtgtggtat agtcccaaag ttttcattga 1501 aggtgctgat gcagagactt tttcggaggg tgagatggtt acatttataa attggggcaa 1561 cctcaacatt acaaaaatac acaaaaatgc agatggaaaa atcatatctc ttgatgcaaa 1621 gtttaatttg gaaaacaaag actacaagaa aaccactaag gtcacttggc ttgcagagac 1681 tacacatgct cttcctattc cagtaatctg tgtcacttat gagcacttga tcacaaagcc 1741 agtgctagga aaagacgagg actttaagca gtatgtcaac aagaacagta agcatgaaga 1801 gctaatgcta ggggatccct gccttaagga tttgaaaaaa ggagatatta tacaactcca 1861 gagaagagga ttcttcatat gtgatcaacc ttatgaacct gttagcccat atagttgcaa 1921 ggaagccccg tgtgttttga tatacattcc tgatgggcac acaaaggaaa tgccaacatc 1981 agggtcaaag gaaaagacca aagtagaagc cacaaaaaat gagacctctg ctccttttaa 2041 ggaaagacca acaccttctc tgaataataa ttgtactaca tctgaggatt ccttggtcct 2101 ttacaataga gtggctgttc aaggagatgt ggttcgtgaa ttaaaagcca agaaagcacc 2161 aaaggaagat gtagatgcag ctgtaaaaca gcttttgtct ttgaaagctg aatataagga 2221 gaaaactggc caggaatata aacctggaaa ccctcctgct gaaataggac agaatatttc 2281 ttctaattcc tcagcaagta ttctggaaag taaatctctg tatgatgaag ttgctgcaca 2341 aggggaggtg gttcgtaagc taaaagctga aaaatcccct aaggctaaaa taaatgaagc 2401 tgtagaatgc ttactgtccc tgaaggctca gtataaagaa aaaactggga aggagtacat 2461 acctggtcag cccccattat ctcaaagttc ggattcaagc ccaaccagaa attctgaacc 2521 tgctggttta gaaacaccag aagcgaaagt actttttgac aaagtagctt ctcaagggga 2581 agtagttcgg aaacttaaaa ctgaaaaagc ccctaaggat caagtagata tagctgttca 2641 agaactcctt cagctaaagg cacagtacaa gtctttgata ggagtagagt ataagcctgt 2701 gtcggccact ggagctgagg acaaagataa gaagaagaaa gaaaaagaaa ataaatctga 2761 aaagcagaat aagcctcaga aacaaaatga tggccaaagg aaagaccctt ctaaaaacca 2821 aggaggtggg ctctcatcaa gtggagcagg agaagggcag gggcctaaga aacagaccag 2881 gttgggtctt gaggcaaaaa aagaagaaaa tcttgctgat tggtattctc aggtcatcac 2941 aaagtcagaa atgattgaat accatgacat aagtggctgt tatattcttc gtccctgggc 3001 ctatgccatt tgggaagcca tcaaggactt ttttgatgct gagatcaaga aacttggtgt 3061 tgaaaactgc tacttcccca tgtttgtgtc tcaaagtgca ttagagaaag agaagactca 3121 tgttgctgac tttgccccag aggttgcttg ggttacaaga tctggcaaaa ccgagctggc 3181 agaaccaatt gccattcgtc ctactagtga aacagtaatg tatcctgcat atgcaaaatg 3241 ggtacaatca cacagagacc tgcccatcaa gctcaatcag tggtgcaatg tggtgcgttg 3301 ggaattcaag catcctcagc ctttcctacg tactcgtgaa tttctttggc aggaagggca 3361 cagtgctttt gctaccatgg aagaggcagc ggaagaggtc ttgcagatac ttgacttata 3421 tgctcaggta tatgaagaac tcctggcaat tcctgttgtt aaaggaagaa agacggaaaa 3481 ggaaaaattt gcaggaggag actatacaac tacaatagaa gcatttatat ctgctagtgg 3541 aagagctatc cagggaggaa catcacatca tttagggcag aatttttcca aaatgtttga 3601 aatcgttttt gaagatccaa agataccagg agagaagcaa tttgcctatc aaaactcctg 3661 gggcctgaca actcgaacta ttggtgttat gaccatggtt catggggaca acatgggttt 3721 agtattacca ccccgtgtag catgtgttca ggtggtgatt attccttgtg gcattaccaa 3781 tgcactttct gaagaagaca aagaagcgct gattgcaaaa tgcaatgatt atcgaaggcg 3841 attactcagt gttaacatcc gcgttagagc tgatttacga gataattatt ctccaggttg 3901 gaaattcaat cactgggagc tcaagggagt tcccattaga cttgaagttg ggccacgtga 3961 tatgaagagc tgtcagtttg tagccgtcag acgagatact ggagaaaagc tgacagttgc 4021 tgaaaatgag gcagagacta aacttcaagc tattttggaa gacatccagg tcaccctttt 4081 cacaagggct tctgaagacc ttaagactca tatggttgtg gctaatacaa tggaagactt 4141 tcagaagata ctagattctg gaaagattgt tcagattcca ttctgtgggg aaattgactg 4201 tgaggactgg atcaaaaaga ccactgccag ggatcaagat cttgaacctg gtgctccatc 4261 catgggagct aaaagccttt gcatcccctt caaaccactc tgtgaactgc agcctggagc 4321 caaatgtgtc tgtggcaaga accctgccaa gtactacacc ttatttggtc gcagctactg 4381 agggatgaac gaaagccccc tcttcaactc ctctcacttt ttaaagcatt gatattagta 4441 tcttctcaga tacagaccgt tttatgattt tttaaaaagt aaaagttcta aaatgaagtc 4501 acacaggaca attattctta tgcctaagtt aacagtggat aaaagacttt tctgtaaaca 4561 actccagtaa taaatatcat gaacta // LOCUS HSGTSF 1695 bp RNA PRI 27-MAR-1995 DEFINITION Human mRNA for glioblastoma-derived T-cell suppressor factor G-TsF (transforming growth factor-beta2, TGF-beta2). ACCESSION Y00083 NID g31959 KEYWORDS T-cell suppressor factor; transforming growth factor-beta2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1695) AUTHORS Hofer,E. TITLE Direct Submission JOURNAL Submitted (02-NOV-1987) Hofer, E., Sandoz AG, Department for Biotechnology, Preclinical Research, Building 386/328, Sandoz AG, CH-4002 Basel REFERENCE 2 (bases 1 to 1695) AUTHORS de Martin,R., Haendler,B., Hofer-Warbinek,R., Gaugitsch,H., Wrann,M., Schlusener,H., Seifert,J.M., Bodmer,S., Fontana,A. and Hofer,E. TITLE Complementary DNA for human glioblastoma-derived T cell suppressor factor, a novel member of the transforming growth factor-beta gene family JOURNAL EMBO J. 6 (12), 3673-3677 (1987) MEDLINE 88111555 FEATURES Location/Qualifiers source 1..1695 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="glioblastoma 308" /clone_lib="lambda gt10" /clone="lambda SUP25, lambda SUP40, lambda SUP42" CDS 182..1426 /note="G-Tsf precursor" /codon_start=1 /db_xref="PID:g31960" /db_xref="SWISS-PROT:P08112" /translation="MHYCVLSAFLILHLVTVALSLSTCSTLDMDQFMRKRIEAIRGQI LSKLKLTSPPEDYPEPEEVPPEVISIYNSTRDLLQEKASRRAAACERERSDEEYYAKE VYKIDMPPFFPSENAIPPTFYRPYFRIVRFDVSAMEKNASNLVKAEFRVFRLQNPKAR VPEQRIELYQILKSKDLTSPTQRYIDSKVVKTRAEGEWLSFDVTDAVHEWLHHKDRNL GFKISLHCPCCTFVPSNNYIIPNKSEELEARFAGIDGTSTYTSGDQKTIKSTRKKNSG KTPHLLLMLLPSYRLESQQTNRRKKRALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKW IHEPKGYNANFCAGACPYLWSSDTQHSRVLSLYNTINPEASASPCCVSQDLEPLTILY YIGKTPKIEQLSNMIVKSCKCS" misc_feature 1087..1088 /note="put. protease cleavage site" mat_peptide 1088..1423 /note="put. mature G-Tsf" BASE COUNT 523 a 386 c 354 g 432 t ORIGIN 1 caagcaggat acgtttttct gttgggcatt gactagattg tttgcaaaag tttcgcatca 61 aaaacaaaca acaacaacaa aaaaccaaac aactctcctt gatctatact ttgagaattg 121 ttgatttctt tttttttatt ctgactttta aaaacaactt ttttttccac ttttttaaaa 181 aatgcactac tgtgtgctga gcgcttttct gatcctgcat ctggtcacgg tcgcgctcag 241 cctgtctacc tgcagcacac tcgatatgga ccagttcatg cgcaagagga tcgaggcgat 301 ccgcgggcag atcctgagca agctgaagct caccagtccc ccagaagact atcctgagcc 361 cgaggaagtc cccccggagg tgatttccat ctacaacagc accagggact tgctccagga 421 gaaggcgagc cggagggcgg ccgcctgcga gcgcgagagg agcgacgaag agtactacgc 481 caaggaggtt tacaaaatag acatgccgcc cttcttcccc tccgaaaatg ccatcccgcc 541 cactttctac agaccctact tcagaattgt tcgatttgac gtctcagcaa tggagaagaa 601 tgcttccaat ttggtgaaag cagagttcag agtctttcgt ttgcagaacc caaaagccag 661 agtgcctgaa caacggattg agctatatca gattctcaag tccaaagatt taacatctcc 721 aacccagcgc tacatcgaca gcaaagttgt gaaaacaaga gcagaaggcg aatggctctc 781 cttcgatgta actgatgctg ttcatgaatg gcttcaccat aaagacagga acctgggatt 841 taaaataagc ttacactgtc cctgctgcac ttttgtacca tctaataatt acatcatccc 901 aaataaaagt gaagaactag aagcaagatt tgcaggtatt gatggcacct ccacatatac 961 cagtggtgat cagaaaacta taaagtccac taggaaaaaa aacagtggga agaccccaca 1021 tctcctgcta atgttattgc cctcctacag acttgagtca caacagacca accggcggaa 1081 gaagcgtgct ttggatgcgg cctattgctt tagaaatgtg caggataatt gctgcctacg 1141 tccactttac attgatttca agagggatct agggtggaaa tggatacacg aacccaaagg 1201 gtacaatgcc aacttctgtg ctggagcatg cccgtattta tggagttcag acactcagca 1261 cagcagggtc ctgagcttat ataataccat aaatccagaa gcatctgctt ctccttgctg 1321 cgtgtcccaa gatttagaac ctctaaccat tctctactac attggcaaaa cacccaagat 1381 tgaacagctt tctaatatga ttgtaaagtc ttgcaaatgc agctaaaatt cttggaaaag 1441 tggcaagacc aaaatgacaa tgatgatgat aatgatgatg acgacgacaa cgatgatgct 1501 tgtaacaaga aaacataaga gagccttggt tcatcagtgt taaaaaattt ttgaaaaggc 1561 ggtactagtt cagacacttt ggaagtttgt gttctgtttg ttaaaactgg catctgacac 1621 aaaaaaagtt gaaggcctta ttctacattt cacctacttt gtaagtgaga gagacaagaa 1681 gcaaattttt ttaaa // LOCUS HSH126L12 2023 bp mRNA PRI 29-JAN-1998 DEFINITION Homo sapiens cDNA mapping to 22q13. ACCESSION AL021682 NID g2827692 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2023) AUTHORS Smink,L.J. and Burton,J. TITLE Direct Submission JOURNAL Submitted (20-JAN-1998) E-mail contact: humquery@sanger.ac.uk Clone requests:clonerequest@sanger.ac.uk COMMENT This sequence was generated from a cDNA clone isolated using sequence from the BAC clone CIT987SK-384D8 sequenced by The Institute for Genomic Research(U62317). All matches to EMBL sequences shown 90% or more. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/. FEATURES Location/Qualifiers source 1..2023 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /tissue_type="placental" /map="q13" /clone="H126L12" misc_feature join(1..35,36..117,117..256,255..321,321..347,347..377) /note="match: 5' EST H14385 clone 48292" misc_feature 1..197 /note="match: 3' EST F08055 clone c-2pa01" exon 1..198 /number=1 misc_feature join(1..256,255..277) /note="match: 5' EST F11333 clone c-2ve05" mRNA 1..1908 /product="hypothetical protein" misc_feature join(10..117,117..256) /note="match: 5' EST R25092 clone 35322" misc_feature join(61..256,255..314) /note="match: 3' EST Z44689 clone c-27b04" misc_feature join(88..117,117..254) /note="match: 5' EST R58812 clone G5169" misc_feature 110..181 /note="match: 5' EST W46005 clone 354698" misc_feature join(153..256,255..451) /note="match: 3' EST T07636 clone HFBEM13" misc_feature 163..256 /note="match: 5' EST AA079864 clone 545466" exon 199..300 /number=2 misc_feature 205..256 /note="match: 5' EST W40662 clone 351519" misc_feature 255..306 /note="match: 5' EST AA032389 clone 466427" exon 301..356 /number=3 exon 357..441 /number=4 exon 442..510 /number=5 exon 511..590 /number=6 CDS 539..1402 /note="unnamed protein product" /codon_start=1 /db_xref="PID:e1248830" /db_xref="PID:g2827693" /translation="MALVAPDEMEKINNPLYSRQGEVLASRKDFRMNTCVPHPRRAFM LEPEGMSPMEPAGVSPMPGTQNDTGRTEEQPMEVSVCRSPVPALGFSQEPGPSPERPM PLGGGEDEDAEEAVELPEASAPKAALEPKESRSPQQSAALPRRYMLREREGAPEPASC VKETPDLWQSLDPLNSLESKPFKKGRPYSVPPCVEEALGQKRKRKGAAKLQDFHQWYL VAYADHADSRRLRRKGPSFADMEVLYWTHVKEQLETLRKLQRREVAEQWLRPAEEDHL EDSPGRPGGSR" exon 591..736 /number=7 exon 737..820 /number=8 misc_feature join(755..1002,990..1059) /note="match: 3' EST AA337520 clone 111369" exon 821..951 /number=9 misc_feature 944..1410 /note="match: EST AA311710" exon 952..1023 /number=10 exon 1024..1090 /number=11 exon 1091..1198 /number=12 exon 1199..1252 /number=13 exon 1253..1323 /number=14 exon 1324..1396 /number=15 misc_feature complement(join(1373..1417,1179..1391)) /note="match: 3' EST Z40533 clone c-27b04" misc_feature complement(join(1373..1417,1122..1391)) /note="match: 3' EST T17152 clone 35322" misc_feature 1387..1430 /note="match: 5' EST AA199365 clone 633722" exon 1397..1465 /number=16 misc_feature 1416..1791 /note="match: 3' EST AA632685 clone IMAGE:1132878" misc_feature join(1433..1708,1726..1820) /note="match: 3' EST AA632841 clone IMAGE:1133034" misc_feature join(1451..1550,1572..1591,1617..1636,1633..1658, 1686..1707,1726..1867) /note="match: 5' EST AA455393 clone 812164" exon 1466..1515 /number=17 misc_feature join(1510..1708,1726..1812) /note="match: 5' EST H18591 clone 171952" exon 1516..1617 /number=18 misc_feature join(1571..1812,1823..1840) /note="match: 5' EST T83097 clone 110744" misc_feature 1571..1812 /note="match: 5' EST T84295 clone 111369" exon 1618..1767 /number=19 misc_feature complement(join(1720..2017,1554..1721)) /note="match: 3' EST AA633816 clone 858163" misc_feature complement(join(1722..1857,1627..1721)) /note="match: 3' EST T90560 clone 110744" misc_feature complement(1729..1857) /note="match: 3' EST R15808 clone 53308" misc_feature complement(1746..1835) /note="match: 3' EST T82687 clone d459-f" misc_feature 1750..2007 /note="match: 3' EST AA306248 clone 951748" exon 1768..1908 /number=20 misc_feature complement(join(1860..2023,1769..1858,1745..1774)) /note="match: 3' EST R45430 clone 35322" misc_feature complement(join(1861..2023,1774..1842)) /note="match: 3' EST AA603948 clone IMAGE:1117595" misc_feature complement(join(1861..2023,1774..1844)) /note="match: 3' EST AA661752 clone IMAGE:1219221" misc_feature complement(join(1862..1901,1553..1844)) /note="match: 3' EST AA661847 clone IMAGE:1219311" misc_feature 1895..1935 /note="match: 5' EST AA184753 clone 637565" misc_feature complement(join(1899..2006,1864..1901,1722..1841, 1627..1721)) /note="match: 3' EST T85179 clone 111369" misc_feature complement(join(1899..2006,1880..1901)) /note="match: 3' EST H18592 clone 171952" misc_feature complement(join(1976..2007,1921..1982)) /note="match: 3' EST AA456032 clone 812164" misc_feature complement(join(1976..2007,1899..1982,1864..1901, 1722..1841,1643..1716,1624..1644)) /note="match: 3' EST T79868 clone 114881" misc_feature complement(join(1976..2006,1861..1982,1824..1844, 1722..1823,1698..1721)) /note="match: 3' EST AA630051 clone 951748" misc_feature complement(join(1981..2017,1149..1178,1079..1155, 1048..1083)) /note="match: 3' EST H14336 clone 48292" polyA_signal 2005..2010 polyA_site 2007 BASE COUNT 432 a 596 c 641 g 354 t ORIGIN 1 aaatggcgcc agaactagtg gcgggctgag gacgccgtac ccctcggaag gcagccctgc 61 ggtccctttg ccgcccgttc cctcccggac atggaggacg tggaggcgcg cttcgcccac 121 ctcttgcagc ccatccgcga cctcaccaag aactgggagg tggacgtggc ggcccagctg 181 ggcgagtatc tggaggacct ggatcagatc tgcatttctt ttgacgaagg caagaccaca 241 atgaacttca ttgaaggcag cgttgttgat ccatggctct gcctgcgtct acagtaagaa 301 ggtggaatac ctctactggc tcgtctacca ggcccttgat ttcatctctg gaaagaggcg 361 ggccaagcag ctctcttcgg tgcaggagga cagggccaat ggggttgcca gctccggggt 421 cccccaggag gcagagaatg agttcctgtc gctggatgac ttccctgact cccggactaa 481 cgtggatctc aagaatgatc agacgcccag tgaggtcctc atcatccccc tcctgcccat 541 ggccctggtg gcccctgatg aaatggagaa aataaacaat cccctgtaca gccgtcaggg 601 tgaggtcctg gccagccgga aggatttcag gatgaacacg tgcgttcccc accccagaag 661 ggccttcatg ttggagccag aaggcatgtc ccccatggaa ccagcgggcg tttcccccat 721 gccagggacc cagaatgaca ccgggaggac tgaggagcag ccaatggaag tttccgtgtg 781 caggagccct gtcccagcac tcggcttctc ccaggagcca ggcccctctc cagaacgccc 841 gatgcccctg ggtgggggcg aggacgagga tgcagaggag gcagtagagc ttcctgaggc 901 ctcggccccc aaggccgctc tggagcccaa ggagtccagg agcccgcagc agagtgctgc 961 cctgcccagg aggtacatgc tgcgggagcg ggagggggcc ccagagcctg catcctgcgt 1021 gaaggagact ccagacctct ggcagagcct ggaccccttg aactccttgg agtctaagcc 1081 cttcaagaaa ggtaggcctt actctgtgcc cccctgtgtg gaggaggctc tgggacagaa 1141 gcgcaagagg aagggcgctg ccaagctgca ggacttccac cagtggtacc tggttgccta 1201 tgcagaccat gccgacagca ggcggcttcg gcgaaagggt ccgtcctttg cagacatgga 1261 ggtcctgtac tggacacacg tgaaggagca gttggaaact ctccggaagc tgcagaggag 1321 ggaggtggct gagcagtggc tgcggcctgc agaggaggac cacctggagg attcccctgg 1381 aagacctggg ggcagcagat gactttctag agcctgagga gtacatggag cccgagggag 1441 cagaccccag ggaagccgct gaccttgacg cagtgccgat gtccctgagc tacgaggagc 1501 tggttcgaag gaaagtggag ctcttcatcg ccacctccca gaagtttatc caggagacag 1561 agctgagcca gcgcatcagg gactgggagg acacagtgca gcctctgctc caggagcagg 1621 agcagcatgt gccctttgac atccacacct atggggacca gctggtctca cggttccccc 1681 agctcaatga gtggtgtccc tttgcggagc tggtggctgg ccagccggcc ttcgaggtgt 1741 gtcgttccat gctggcctcc ctgcagctgg ccaatgacta cacagtggag ataacccagc 1801 agcccgggct ggagatggcc gtggacacca tgtccctgag attgctcacg caccagcgag 1861 cgcacaagcg cttccagacc tacgctgccc cctccatggc ccagccctga gtggggagca 1921 ccgaggcagg ggtgggggaa tgtgtactga ggagccgtgc gtctgctcct ggctggcccg 1981 gcctaataaa gcagtgttgc catctcaaaa aaaaaaaaaa aaa // LOCUS HSH2AX 1585 bp RNA PRI 12-SEP-1993 DEFINITION Human H2A.X mRNA encoding histone H2A.X. ACCESSION X14850 NID g31972 KEYWORDS histone; histone H2A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1585) AUTHORS Bonner,W. TITLE Direct Submission JOURNAL Submitted (24-MAR-1989) Bonner W., National Cancer Institute/National Institutes of Health, Building 37-Room 5D17, Bethesda MD 20892, U S A REFERENCE 2 (bases 1 to 1585) AUTHORS Mannironi,C., Bonner,W.M. and Hatch,C.L. TITLE H2A.X. a histone isoprotein with a conserved C-terminal sequence, is encoded by a novel mRNA with both DNA replication type and polyA 3' processing signals JOURNAL Nucleic Acids Res. 17 (22), 9113-9126 (1989) MEDLINE 90067914 COMMENT Data kindly reviewed (21-FEB-1990) by Bonner W.M. FEATURES Location/Qualifiers source 1..1585 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Okayama-Berg" CDS 74..505 /note="histone H2A.X" /codon_start=1 /db_xref="PID:g31973" /db_xref="SWISS-PROT:P16104" /translation="MSGRGKTGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGHYAERV GAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGGVT IAQGGVLPNIQAVLLPKKTSATVGPKAPSGGKKATQASQEY" repeat_unit 555..561 /note="inverted repeat A" misc_feature 555..570 /note="stem-loop structure" repeat_unit 563..570 /note="inverted repeat A'" misc_feature 1561..1566 /note="polyA signal" polyA_site 1585 /note="polyA site" BASE COUNT 251 a 558 c 501 g 275 t ORIGIN 1 acagcagtta cactgcggcg ggcgtctgtt ctagtgtttg agccgtcgtg cttcaccggt 61 ctacctcgct agcatgtcgg gccgcggcaa gactggcggc aaggcccgcg ccaaggccaa 121 gtcgcgctcg tcgcgcgccg gcctccagtt cccagtgggc cgtgtacacc ggctgctgcg 181 gaagggccac tacgccgagc gcgttggcgc cggcgcgcca gtgtacctgg cggcagtgct 241 ggagtacctc accgctgaga tcctggagct ggcgggcaat gcggcccgcg acaacaagaa 301 gacgcgaatc atcccccgcc acctgcagct ggccatccgc aacgacgagg agctcaacaa 361 gctgctgggc ggcgtgacga tcgcccaggg aggcgtcctg cccaacatcc aggccgtgct 421 gctgcccaag aagaccagcg ccaccgtggg gccgaaggcg ccctcgggcg gcaagaaggc 481 cacccaggcc tcccaggagt actaagaggg cccgcgccgc ggccggccgc cccagctccc 541 catgccacca caaaggccct tttaagggcc accaccgccc tcatggaaag agctgagccg 601 cttcagactg cggggcaagc gggccgcggc tcccttcccc tcccctcccc tcgcccgcct 661 tcgccgcccg gcctcgagtc cccgcccgcc cccgctcccg tcccgcaccg cctgccgcgt 721 cggcctcggg cctgccctgt ccgccgtccg ccctccggta gggttcgggc cttccggatg 781 cggcttgggc gctcttcggg gacctccgtg gcgcggaaga cccgagcctg ccggggggag 841 gccggcggcg ccgcacctgc ccgcctcggc gttcgtgact cagccgcccc atcccgagtc 901 gctaaggggc tgcggggagg ccgcagcacc ttctggaaga cttggccttc cgctctgacg 961 cagggccgag gtgggcagtc caggccgaga gccggcggcc ctgaaggtga gtgaggccct 1021 cggcagctgc agccggggtg tctggtaccc ccccggcgtg gtgcttagcc caggactttc 1081 agacggccgc tggccgggag gctttggtgg gagagacgcg atcgccgatt tcggtctggc 1141 gccccttctg cggccgggac ccaggccttt cacatcagct ctccctccat cttcattcat 1201 aggtctgcgc tggggccggg acgaagcact tggtaacagg cacatcttcc tcccgagtga 1261 ctgcctccta ggaggacatt taggggaggg cagaggcctg cagtttggct tcacggctgg 1321 ctatgtggac agcaagagtc gttttgcgga acgcgactgg cagccaggcc tgtcgggccc 1381 ccgacgccgc cccatttccc ttccagcaaa ctcaactcgg caatccaagc acctagatac 1441 cagcacaagt cggttaatcc ctgtctggac tgagcctccg ttggcttctg aactggaatt 1501 ctgcagctaa cccttccacg actagaacct taggcattgg ggagttttag atggactaat 1561 tttattaaag gattgttttt ttttt // LOCUS HSH2AZ 869 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for histone H2A.Z. ACCESSION X52317 X06885 NID g31974 KEYWORDS histone; histone H2A; nucleoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 869) AUTHORS Hatch,C.L. TITLE Direct Submission JOURNAL Submitted (25-APR-1990) Hatch C.L., Laboratory of Molecular Pharmacology, National Cancer Institute, Bldg 37 Rm 5C25, 9000 Rockville Pike, Bethesda MD 20982, USA REFERENCE 2 (bases 1 to 869) AUTHORS Hatch,C.L. and Bonner,W.M. TITLE Sequence of cDNAs for mammalian H2A.Z, an evolutionarily diverged but highly conserved basal histone H2A isoprotein species JOURNAL Nucleic Acids Res. 16 (3), 1113-1124 (1988) MEDLINE 88143983 COMMENT See 'M33918','M33917' for human H2A.Z gene and upstream sequences. Data kindly reviewed (06-JUL-1990) by C.L. Hatch. FEATURES Location/Qualifiers source 1..869 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="lymphocytic" /cell_line="HUT78" /clone_lib="Okayama-Berg" CDS 107..493 /note="histone H2A.Z (AA 1-127)" /codon_start=1 /db_xref="PID:g31975" /db_xref="SWISS-PROT:P17317" /translation="MAGGKAGKDSGKAKTKAVSRSQRAGLQFPVGRIHRHLKSRTTSH GRVGATAAVYSAAILEYLTAEVLELAGNASKDLKVKRITPRHLQLAIRGDEELDSLIK ATIAGGGVIPHIHKSLIGKKGQQKTV" repeat_region 727..782 /note="pot. stem-repeat" misc_feature 848..853 /note="polyA signal" BASE COUNT 229 a 181 c 209 g 250 t ORIGIN 1 cagtttgaat cgcggtgcga ccgaaggagt aggtgctggg atcgtcaccg tggcaccgat 61 tagccttttc tctgccttgc ttgcttgagc ttcagcggaa ttcgaaatgg ctggcggtaa 121 ggctggaaag gactccggaa aggccaagac aaaggcggtt tcccgctcgc agagagccgg 181 cttgcagttc ccagtgggcc gtattcatcg acacctaaaa tctaggacga ccagtcatgg 241 acgtgtgggc gcgactgccg ctgtgtacag cgcagccatc ctggagtacc tcaccgcaga 301 ggtacttgaa ctggcaggaa atgcatcaaa agacttaaag gtaaagcgta ttacccctcg 361 tcacttgcaa cttgctattc gtggagatga agaattggat tctctcatca aggctacaat 421 tgctggtggt ggtgtcattc cacacatcca caaatctctg attgggaaga aaggacaaca 481 gaagactgtc taaaggatgc ctggattcct tgttatctca ggactctaaa tactctaaca 541 gctgtccagt gttggtgatt ccagtggact gtatctctgt gaaaaacaca attttgcctt 601 tttgtaattc tatttgagca agttggaagt ttaattagct ttccaaccaa ccaaatttct 661 gcattcgagt cttaaccata tttaagtgtt actgtggctt caaagaagct attgattctg 721 aagtagtggg ttttgattga gttgactgtt tttaaaaaac tgtttggatt ttaattgtga 781 tgcagaagtt atagtaacaa acatttggtt ttgttcagac cttatttcca ctctggtgga 841 taagttcaat aaaggtcata tcccaaact // LOCUS HSH4BHIS 814 bp DNA PRI 09-NOV-1992 DEFINITION H.sapiens H4/b gene for H4 histone. ACCESSION X60482 NID g31996 KEYWORDS H4/b gene; histone H4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 814) AUTHORS Doenecke,D. TITLE Direct Submission JOURNAL Submitted (08-JUL-1991) D. Doenecke, Georg-August Univer, Inst fuer Biochemie, Zentrum 3 des Fachbereichs Medizin Bioce, Humboldtallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 814) AUTHORS Doenecke,D. and Kardalinou,E. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..814 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Placenta" /clone="21" TATA_signal 203..218 gene 257..568 /gene="H4/b" CDS 257..568 /gene="H4/b" /codon_start=1 /product="H4 histone" /db_xref="PID:g31997" /db_xref="SWISS-PROT:P02304" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" terminator 603..619 /note="Histone mRNA" BASE COUNT 221 a 187 c 202 g 204 t ORIGIN 1 gatcgcgcca ctgcattcca gcctgggcaa cagagcaaga acccgtctca aaaaaaaaaa 61 aaaaaaaaaa aaaaagcacc ctgtgaggaa acagtactaa ttatttgata ttctgggaaa 121 agtgggggac aactgtcagg cttctttgtc gaaagtttat gaactgatgg ctcagttaat 181 ggctgcaagt atagtgtgtg tgtatatata tatatatacc tagcagtatt tattaaatcc 241 cagctgtggt ttcaagatgt ctggccgcgg taagggcgga aagggtctag gtaagggtgg 301 cgccaagcgt caccgtaagg tattgcgtga caatatccaa ggaatcacca agcccgctat 361 ccgccgcctg gctcgccgcg gcggcgtcaa gcgtatttct ggcctcattt atgaggaaac 421 tcgcggagtg ctgaaagttt tcctggaaaa tgtaatccgc gatgctgtca cctacacgga 481 acacgccaaa cgcaagacag tcacagccat ggacgtggtg tacgcgctca agcgccaggg 541 acgcactctt tatggcttcg gcggctgagc ttacctctac agtacactac cgcaaaacca 601 acggcccttt tcagggccac ctatccactc aggagaaaga gtagtagtca ctgctaaaag 661 tgtagtttca cgtgtttagt agctccggtt ttcaagttaa atggtcttat tacgccttgg 721 cttcatatct tactggccgg tgaggcatta gtgtattaaa gtttattttc actcttgctg 781 tgtcgcccat gctggagtaa tcaatggcgc gatc // LOCUS HSH4GHIS 738 bp DNA PRI 08-DEC-1995 DEFINITION H.sapiens H4/g gene for H4 histone. ACCESSION X60486 NID g32003 KEYWORDS H4/g gene; histone H4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Doenecke,D. TITLE Direct Submission JOURNAL Submitted (08-JUL-1991) D. Doenecke, Georg-August Univer, Inst fuer Biochemie, Zentrum 3 des Fachbereichs Medizin Bioce, Humboldtallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 738) AUTHORS Drabent,B., Kardalinou,E., Bode,C. and Doenecke,D. TITLE Association of histone H4 genes with the mammalian testis-specific H1t histone gene JOURNAL DNA Cell Biol. 14 (7), 591-597 (1995) MEDLINE 95352203 FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Leucocyte" /clone="D4.1" TATA_signal 185..191 gene 232..543 /gene="H4/g" CDS 232..543 /gene="H4/g" /codon_start=1 /product="H4 histone" /db_xref="PID:g32004" /db_xref="SWISS-PROT:P02304" /translation="MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGV KRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFG G" terminator 595..610 /note="Histone mRNA" BASE COUNT 217 a 178 c 174 g 169 t ORIGIN 1 aatacagcgc attcaacttg caaacaccct tccactccca caaagagcaa gctgtcactg 61 gccaatcaaa acaatgaacc ataatgaaac agtttttctt gctccaccca ctcggtgacc 121 aaatttgaaa aaaaaaaaaa accgcgccaa ctcatgttgt tttcaatcag gtccgccaag 181 tttgtattta aggaactgtt tcagttcata ccttccactg cgataggaat catgtctggt 241 cgcggcaaag gcggaaaagg cttggggaag ggtggtgcta agcgccatcg taaggtgctc 301 cgggataaca tccagggcat tacaaaaccg gctatccgcc gtttggctcg gcgcggtggg 361 gtcaagcgca tttccggtct tatctatgag gagactcgag gtgtgcttaa ggttttctta 421 gagaacgtta ttcgagacgc cgtcacctat acggagcacg ccaagcgcaa aactgtcaca 481 gccatggatg tagtatatgc cctaaaacgt caggggcgca ctctgtatgg cttcggcggc 541 tgaatctaag aatacgcggt ctcctgagaa cttcaaaaaa caaaaaaacc caaaggccct 601 tttcagggcc gctcacaaag tcgtttaaag agctgaaatg cgttgcgaga atgagtttgg 661 atgacagaaa taaccgtgac agcctgcata agaatgaatt gtgtttgcca tgaccggcca 721 cactgtgaca aaatttca // LOCUS HSHACHRB2 1771 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for neuronal nicotinic acetylcholine receptor beta-2 subunit. ACCESSION X53179 NID g32016 KEYWORDS acetylcholine receptor; nichotinic acetylcholine receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1771) AUTHORS Anand,R. TITLE Direct Submission JOURNAL Submitted (22-MAY-1990) Anand R., The Salk Institute for Biological Studies, P.O.Box 85800, San Diego, CA 92138-9216, USA, The Salk Institute foe Biological Studies, P.O.Box 85800, San Diego, CA 92138-9216, USA REFERENCE 2 (bases 1 to 1771) AUTHORS Anand,R. and Lindstrom,J. TITLE Nucleotide sequence of the human nicotinic acetylcholine receptor beta 2 subunit gene JOURNAL Nucleic Acids Res. 18 (14), 4272 (1990) MEDLINE 90332444 FEATURES Location/Qualifiers source 1..1771 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /cell_type="neuron" /clone_lib="lambda-zap, lambda-gt10 fetal brain cDNA" /clone="HuMM1, HuWK1" sig_peptide 167..241 /note="signal peptide (AA -25 to -1)" CDS 167..1675 /codon_start=1 /product="precursor peptide (AA -25 to 477)" /db_xref="PID:g32017" /db_xref="SWISS-PROT:P17787" /translation="MARRCGPVALLLGFGLLRLCSGVWGTDTEERLVEHLLDPSRYNK LIRPATNGSELVTVQLMVSLAQLISVHEREQIMTTNVWLTQEWEDYRLTWKPEEFDNM KKVRLPSKHIWLPDVVLYNNADGMYEVSFYSNAVVSYDGSIFWLPPAIYKSACKIEVK HFPFDQQNCTMKFRSWTYDRTEIDLVLKSEVASLDDFTPSGEWDIVALPGRRNENPDD STYVDITYDFIIRRKPLFYTINLIIPCVLITSLAILVFYLPSDCGEKMTLCISVLLAL TVFLLLISKIVPPTSLDVPLVGKYLMFTMVLVTFSIVTSVCVLNVHHRSPTTHTMAPW VKVVFLEKLPALLFMQQPRHHCARQRLRLRRRQREREGAGALFFREAPGADSCTCFVN RASVQGLAGAFGAEPAPVAGPGRSGEPCGCGLREAVDGVRFIADHMRSEDDDQSVSED WKYVAMVIDRLFLWIFVFVCVFGTIGMFLQPLFQNYTTTTFLHSDHSAPSSK" mat_peptide 242..1672 /note="nicotinic acetylcholine receptor (AA 1-477)" BASE COUNT 313 a 596 c 507 g 355 t ORIGIN 1 gcagccggct ccctgaggcc caggaaccac cgcggcggcc ggcaccacct ggacccagct 61 ccaggcgggc gcggcttcag caccacggac agcgccccac ccgcggccct ccccccggcg 121 gcgcgctcca gccggtgtag gcgaggcagc gagctatgcc cgcggcatgg cccggcgctg 181 cggccccgtg gcgctgctcc ttggcttcgg cctcctccgg ctgtgctcag gggtgtgggg 241 tacggataca gaggagcggc tggtggagca tctcctggat ccttcccgct acaacaagct 301 tatccgccca gccaccaatg gctctgagct ggtgacagta cagcttatgg tgtcactggc 361 ccagctcatc agtgtgcatg agcgggagca gatcatgacc accaatgtct ggctgaccca 421 ggagtgggaa gattatcgcc tcacctggaa gcctgaagag tttgacaaca tgaagaaagt 481 tcggctccct tccaaacaca tctggctccc agatgtggtc ctgtacaaca atgctgacgg 541 catgtacgag gtgtccttct attccaatgc cgtggtctcc tatgatggca gcatcttctg 601 gctgccgcct gccatctaca agagcgcatg caagattgaa gtaaagcact tcccatttga 661 ccagcagaac tgcaccatga agttccgttc gtggacctac gaccgcacag agatcgactt 721 ggtgctgaag agtgaggtgg ccagcctgga cgacttcaca cctagtggtg agtgggacat 781 cgtggcgctg ccgggccggc gcaacgagaa ccccgacgac tctacgtacg tggacatcac 841 gtatgacttc atcattcgcc gcaagccgct cttctacacc atcaacctca tcatcccctg 901 tgtgctcatc acctcgctag ccatccttgt cttctacctg ccatccgact gtggcgagaa 961 gatgacgttg tgcatctcag tgctgctggc gctcacggtc ttcctgctgc tcatctccaa 1021 gatcgtgcct cccacctccc tcgacgtgcc gctcgtcggc aagtacctca tgttcaccat 1081 ggtgcttgtc accttctcca tcgtcaccag cgtgtgcgtg ctcaacgtgc accaccgctc 1141 gcccaccacg cacaccatgg cgccctgggt gaaggtcgtc ttcctggaga agctgcccgc 1201 gctgctcttc atgcagcagc cacgccatca ttgcgcccgt cagcgcctgc gcctgcggcg 1261 acgccagcgt gagcgcgagg gcgctggagc cctcttcttc cgcgaagccc caggggccga 1321 ctcctgcacg tgcttcgtca accgcgcgtc ggtgcagggg ttggccgggg ccttcggggc 1381 tgagcctgca ccagtggcgg gccccgggcg ctcaggggag ccgtgtggct gtggcctccg 1441 ggaggcggtg gacggcgtgc gcttcatcgc agaccacatg cggagcgagg acgatgacca 1501 gagcgtgagt gaggactgga agtacgtcgc catggtgatc gaccgcctct tcctctggat 1561 ctttgtcttt gtctgtgtct ttggcaccat cggcatgttc ctgcagcctc tcttccagaa 1621 ctacaccacc accaccttcc tccactcaga ccactcagcc cccagctcca agtgaggccc 1681 ttcctcatct ccatgctctt tcacccagcc accctctgct gcacagtagt gttgggtgga 1741 ggatggacga gtgagctacc aggaagaggg g // LOCUS HSHAGH1 1011 bp RNA PRI 28-MAR-1996 DEFINITION H.sapiens mRNA for Glyoxalase II. ACCESSION X90999 NID g1237212 KEYWORDS glyoxalase II gene; HAGH1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1011) AUTHORS Ridderstrom,M., Saccucci,F., Hellman,U., Bergman,T., Principato,G. and Mannervik,B. TITLE Molecular cloning, heterologous expression, and characterization of human glyoxalase II JOURNAL J. Biol. Chem. 271 (1), 319-323 (1996) MEDLINE 96132921 REFERENCE 2 (bases 1 to 1011) AUTHORS Ridderstroem,M. TITLE Direct Submission JOURNAL Submitted (25-AUG-1995) M. Ridderstroem, Uppsala University, Dept of Biochemistry, BMC, Box 576, S-751 23 Uppsala, SWEDEN FEATURES Location/Qualifiers source 1..1011 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="Clontech HL1001b/2102" gene 37..819 /gene="HAGH1" CDS 37..819 /gene="HAGH1" /EC_number="3.1.2.6" /codon_start=1 /product="glyoxalase II" /db_xref="PID:e197127" /db_xref="PID:g1237213" /translation="MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHG VKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNV KCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVL GRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEE FTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD" BASE COUNT 242 a 266 c 302 g 201 t ORIGIN 1 gatttgcgga agaacctgac cgtggacgag ggcaccatga aggtagaggt gctgcctgcc 61 ctgaccgaca actacatgta cctggtcatt gatgatgaga ccaaggaggc tgccattgtg 121 gatccggtgc agccccagaa ggtcgtggac gcggcgagaa agcacggggt gaaactgacc 181 acagtgctca ccacccacca ccactgggac catgctggcg ggaatgagaa actggtcaag 241 ctggagtcgg gactgaaggt gtacgggggt gacgaccgta tcggggccct gactcacaag 301 atcactcacc tgtccacact gcaggtgggg tctctgaacg tcaagtgcct ggcgaccccg 361 tgccacactt caggacacat ttgttacttc gtgagcaagc ccggaggctc ggagccccct 421 gccgtgttca caggtgacac cttgtttgtg gctggctgcg ggaagttcta tgaagggact 481 gcggatgaga tgtgtaaagc tctgctggag gtcttgggcc ggctcccccc ggacacaaga 541 gtctactgtg gccacgagta caccatcaac aacctcaagt ttgcacgcca cgtggagccc 601 ggcaatgccg ccatccggga gaagctggcc tgggccaagg agaagtacag catcggggag 661 cccacagtgc catccaccct ggcagaggag tttacctaca accccttcat gagagtgagg 721 gagaagacgg tgcagcagca cgcaggtgag acggacccgg tgaccaccat gcgggccgtg 781 cgcagggaga aggaccagtt caagatgccc cgggactgag gccgccctgc accttcagcg 841 gatttgggga ttaggctctt ttaggtaact ggctttcctg ctggtccgtg cgggaaattc 901 agtcttgatt taaccttaat tttacagccc ttggcttgtg ttatcggaca ttctaatgca 961 tatttataag agaagtttaa caagtattta ttcccataaa aaaaaaaaaa a // LOCUS HSHAPRA 2972 bp RNA PRI 14-JAN-1991 DEFINITION Human hap mRNA encoding a DNA-binding hormone receptor. ACCESSION Y00291 NID g32025 KEYWORDS DNA binding hormone receptor; steroid hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2972) AUTHORS de The,H., Marchio,A., Tiollais,P. and Dejean,A. TITLE A novel steroid thyroid hormone receptor-related gene inappropriately expressed in human hepatocellular carcinoma JOURNAL Nature 330 (6149), 667-670 (1987) MEDLINE 88065931 FEATURES Location/Qualifiers source 1..2972 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 322..1668 /note="hap protein" /codon_start=1 /db_xref="PID:g32026" /db_xref="SWISS-PROT:P10826" /translation="MFDCMDVLSVSPGQILDFYTASPSSCMLQEKALKACFSGLTQTE WQHRHTAQSIETQSTSSEELVPSPPSPLPPPRVYKPCFVCQDKSSGYHYGVSACEGCK GFFRRSIQKNMIYTCHRDKNCVINKVTRNRCQYCRLQKCFEVGMSKESVRNDRNKKKK ETSKQECTESYEMTAELDDLTEKIRKAHQETFPSLCQLAKYTTNSSADHRVRLDLGLW DKFSELATKCIIKIVEFAKRLPGFTGLTIADQITLLKAACLDILILRICTRYTPEQDT MTFSDGLTLNRTQMHNAGFGPLTDLVFTFANQLLPLEMDDTETGLLSAICLICGDRQD LEEPTKVDKLQEPLLEALKIYIRKRRPSKPHMFPKILMKITDLRSISAKGAERVITLK MEIPGSMPPLIQEMMENSEGHEPLTPSSSGNTAEHSPSISPSSVENSGVSQSPLVQ" BASE COUNT 879 a 650 c 602 g 841 t ORIGIN 1 cggggtagga tccggaaccc attcggaagg ctttttgcaa gcatttactt ggaaggagaa 61 cttgggatct ttctgggaac cccccgcccc ggctggattg gccgagcaag cctggaaaat 121 ggtaaatgat catttggatc aattacaggc ttttagctgg cttgtctgtc ataattcatg 181 attcggggct gggaaaaaga ccaacagcct acgtgccaaa aaaggggcag agtttgatgg 241 agttgggtgg acttttctat gccatttgcc tccacaccta gaggataagc acttttgcag 301 acattcagtg caagggagat catgtttgac tgtatggatg ttctgtcagt gagtcctggg 361 caaatcctgg atttctacac tgcgagtccg tcttcctgca tgctccagga gaaagctctc 421 aaagcatgct tcagtggatt gacccaaacc gaatggcagc atcggcacac tgctcaatca 481 attgaaacac agagcaccag ctctgaggaa ctcgtcccaa gccccccatc tccacttcct 541 ccccctcgag tgtacaaacc ctgcttcgtc tgccaggaca aatcatcagg gtaccactat 601 ggggtcagcg cctgtgaggg atgtaagggc tttttccgca gaagtattca gaagaatatg 661 atttacactt gtcaccgaga taagaactgt gttattaata aagtcaccag gaatcgatgc 721 caatactgtc gactccagaa gtgctttgaa gtgggaatgt ccaaagaatc tgtcaggaat 781 gacaggaaca agaaaaagaa ggagacttcg aagcaagaat gcacagagag ctatgaaatg 841 acagctgagt tggacgatct cacagagaag atccgaaaag ctcaccagga aactttccct 901 tcactctgcc agctggctaa atacaccacg aattccagtg ctgaccatcg agtccgactg 961 gacctgggcc tctgggacaa attcagtgaa ctggccacca agtgcattat taagatcgtg 1021 gagtttgcta aacgtctgcc tggtttcact ggcttgacca tcgcagacca aattaccctg 1081 ctgaaggccg cctgcctgga catcctgatt cttagaattt gcaccaggta taccccagaa 1141 caagacacca tgactttctc agacggcctt accctaaatc gaactcagat gcacaatgct 1201 ggatttggtc ctctgactga ccttgtgttc acctttgcca accagctcct gcctttggaa 1261 atggatgaca cagaaacagg ccttctcagt gccatctgct taatctgtgg agaccgccag 1321 gaccttgagg aaccgacaaa agtagataag ctacaagaac cattgctgga agcactaaaa 1381 atttatatca gaaaaagacg acccagcaag cctcacatgt ttccaaagat cttaatgaaa 1441 atcacagatc tccgtagcat cagtgctaaa ggtgcagagc gtgtaattac cttgaaaatg 1501 gaaattcctg gatcaatgcc acctctcatt caagaaatga tggagaattc tgaaggacat 1561 gaacccttga ccccaagttc aagtgggaac acagcagagc acagtcctag catctcaccc 1621 agctcagtgg aaaacagtgg ggtcagtcag tcaccactcg tgcaataaga cattttctag 1681 ctacttcaaa cattccccag taccttcagt tccaggattt aaaatgcaag aaaaaacatt 1741 tttactgctg cttagttttt ggactgaaaa gatattaaaa ctcaagaagg accaagaagt 1801 tttcatatgt atcaatatat atactcctca ctgtgtaact tacctagaaa tacaaacttt 1861 tccaatttta aaaaatcagc catttcatgc aaccagaaac tagttaaaag cttctatttt 1921 cctctttgaa cactcaagat gcatggcaaa gacccagtca aaatgattta cccctggtta 1981 agtttctgaa gactttgtac atacagaagt atggctctgt tctttctata ctgtatgttt 2041 ggtgctttcc ttttgtcttg catactcaaa ataaccatga caccaaggtt atgaaataga 2101 ctactgtaca cgtctaccta ggttcaaaaa gataactgtc ttgctttcat ggaatagtca 2161 agacatcaag gtaaggaaac aggactattg acaggactat tgtacagtat gacaagataa 2221 ggctgaagat attctacttt agttagtatg gaagcttgtc tttgctcttt ctgatgctct 2281 caaactgcat cttttatttc atgttgccca gtaaaagtat acaaattccc tgcactagca 2341 gaagagaatt ctgtatcagt gtaactgcca gttcagttaa tcaaatgtca tttgttcaat 2401 tgttaatgtc actttaaatt aaaagtggtt tattacttgt ttaatgacat aactacacag 2461 ttagttaaaa aaaatttttt tacagtaatg atagcctcca aggcagaaac acttttcagt 2521 gttaagtttt tgtttacttg ttcacaagcc attagggaaa tttcatggga taattagcag 2581 gctggtctac cactggacca tgtaactcta gtgtccttcc tgattcatgc ctgatattgg 2641 gatttttttc cagcccttct tgatgccaag ggctaattat attacatccc aaagaaacag 2701 gcatagaatc tgcctccttt gaccttgttc aatcactatg aagcagagtg aaagctgtgg 2761 tagagtggtt aacagataca agtgtcagtt tcttagttct catttaagca ctactggaat 2821 tttttttttt gatatattag caagtctgtg atgtactttc actggctctg tttgtacatt 2881 gagattgttt gtttaacaat gctttctatg ttcatatact gtttaccttt ttccatggac 2941 tctcctggca aagaataaaa tatatttatt tt // LOCUS HSHB15RNA 1761 bp RNA PRI 31-JUL-1992 DEFINITION Homo sapiens mRNA for HB15. ACCESSION Z11697 NID g32027 KEYWORDS HB15 gene; immunoglobulin superfamily. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1761) AUTHORS Zhou,L.J., Schwarting,R., Smith,H.M. and Tedder,T.F. TITLE A novel cell-surface molecule expressed by human interdigitating reticulum cells, Langerhans cells, and activated lymphocytes is a new member of the Ig superfamily JOURNAL J. Immunol. 149 (2), 735-742 (1992) MEDLINE 92325513 REFERENCE 2 (bases 1 to 1761) AUTHORS Tedder,T.F. TITLE Direct Submission JOURNAL Submitted (11-FEB-1992) T.F. Tedder, Division of Tumor Immunology, Dana-Farber Cancer Institute/Harvard Medical School, 44 Binney St., Boston, MA, 02115-6084, USA FEATURES Location/Qualifiers source 1..1761 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="human tonsil" /cell_type="lymphocyte" /clone_lib="cDNA library in lambda gt-11" /clone="pHB15" CDS 11..628 /note="a cell-surface molecule expressed by interdigitating reticulum cells, Langerhans cells and activated lymphocytes.A member of the immunoglobulin superfamily" /codon_start=1 /evidence=experimental /product="HB15" /db_xref="PID:g32028" /db_xref="SWISS-PROT:Q01151" /translation="MSRGLQLLLLSCAYSLAPATPEVKVACSEDVDLPCTAPWDPQVP YTVSWVKLLEGGEERMETPQEDHLRGQHYHQKGQNGSFDAPNERPYSLKIRNTTSCNS GTYRCTLQDPDGQRNLSGKVILRVTGCPAQRKEETFKKYRAEIVLLLALVIFYLTLII FTCKFARLQSIFPDFSKAGMERAFLPVTSPNKHLGLVTPHKTELV" sig_peptide 11..67 mat_peptide 68..625 /note="proposed amino-terminus of mature protein product" /product="HB15" polyA_signal 1248..1253 BASE COUNT 453 a 399 c 442 g 467 t ORIGIN 1 gaattccgcc atgtcgcgcg gcctccagct tctgctcctg agctgcgcct acagcctggc 61 tcccgcgacg ccggaggtga aggtggcttg ctccgaagat gtggacttgc cctgcaccgc 121 cccctgggat ccgcaggttc cctacacggt ctcctgggtc aagttattgg agggtggtga 181 agagaggatg gagacacccc aggaagacca cctcagggga cagcactatc atcagaaggg 241 gcaaaatggt tctttcgacg cccccaatga aaggccctat tccctgaaga tccgaaacac 301 taccagctgc aactcgggga catacaggtg cactctgcag gacccggatg ggcagagaaa 361 cctaagtggc aaggtgatct tgagagtgac aggatgccct gcacagcgta aagaagagac 421 ttttaagaaa tacagagcgg agattgtcct gctgctggct ctggttattt tctacttaac 481 actcatcatt ttcacttgta agtttgcacg gctacagagt atcttcccag atttttctaa 541 agctggcatg gaacgagctt ttctcccagt tacctcccca aataagcatt tagggctagt 601 gactcctcac aagacagaac tggtatgagc aggatttctg caggttcttc ttcctgaagc 661 tgaggctcag gggtgtgcct gtctgttaca ctggaggaga gaagaatgag cctacgctga 721 agatggcatc ctgtgaagtc cttcacctca ctgaaaacat ctggaagggg atcccacccc 781 attttctgtg ggcaggcctc gaaaaccatc acatgaccac atagcatgag gccactgctg 841 cttctccatg gccacctttt cagcgatgta tgcagctatc tggtcaacct cctggacatt 901 ttttcagtca tataaaagct atggtgagat gcagctggaa aagggtcttg ggaaatatga 961 atgcccccag ctggcccgtg acagactcct gaggacagct gtcctcttct gcatcttggg 1021 gacatctctt tgaattttct gtgttttgct gtaccagccc agatgtttta cgtctgggag 1081 aaattgacag atcaagctgt gagacagtgg gaaatattta gcaaataatt tcctggtgtg 1141 aaggtcctgc tattactaag gagtaatctg tgtacaaaga aataacaagt cgatgaacta 1201 ttccccagca gggtcttttc atctgggaaa gacatccata aagaagcaat aaagaagagt 1261 gccacattta tttttatatc tatatgtact tgtcaaagaa ggtttgtgtt tttctgcttt 1321 tgaaatctgt atctgtagtg agatagcatt gtgaactgac aggcagcctg gacatagaga 1381 gggagaagaa gtcagagagg gtgacaagat agagagctat ttaatggccg gctggaaatg 1441 ctgggctgac ggtgcagtct gggtgctcgt ccacttgtcc cactatctgg gtgcatgatc 1501 ttgagcaagt tccttctggt gtctgctttc tccattgtaa accacaaggc tgttgcatgg 1561 gctaatgaag atcatatacg tgaaaattct ttgaaaacat ataaagcact atacagattc 1621 gaaactccat tgagtcatta tccttgctat gatgatggtg ttttggggat gagagggtgc 1681 tatccatttc tcatgttttc cattgtttga aacaaagaag gttaccaaga agcctttcct 1741 gtagccttct gtaggaattc c // LOCUS HSHB2B 734 bp DNA PRI 28-FEB-1995 DEFINITION H.sapiens HB2B gene for high sulfur keratin. ACCESSION X63338 S47244 NID g311881 KEYWORDS hair microfibrill matrix protein; HB2B gene; high sulphur keratin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 734) AUTHORS Zhumabayeva,B.D. TITLE Direct Submission JOURNAL Submitted (25-NOV-1991) B.D. Zhumabayeva, Institute of Molecular Genetics, USSR Academy of Science, Kurchatov sq. 46, Moscow, 123182, USSR REFERENCE 2 (bases 1 to 734) AUTHORS Zhumabaeva,B.D., Gening,L.V. and Gazaryan,K.G. TITLE Cloning and structural characterization of human hair sulfur-rich keratin genes JOURNAL Mol. Biol. 26, 550-555 (1992) REFERENCE 3 (bases 1 to 734) AUTHORS Zhumabaeva,B.D., Gening,L.V. and Gazarian,K.G. TITLE [Cloning and structural characteristics of human hair keratin genes rich in sulfur] JOURNAL Mol. Biol. 26, 813-820 (1992) FEATURES Location/Qualifiers source 1..734 /organism="Homo sapiens" /db_xref="taxon:9606" CAAT_signal 13..17 TATA_signal 59..66 gene 142..669 /gene="HB2B" CDS 142..669 /gene="HB2B" /codon_start=1 /product="high sulfur keratin" /db_xref="PID:g311882" /translation="MTCCQTSFCGYPSCSTSGTCGSSCCQPSCCETSCCQPSCCETSC CQPSCCQTSFCDFLASQLVDLQLSCCQPSCCETSCCQPSCCQTSSCGTGCGIGGGIGY GQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCHTPTCCQLHHAEASCCRPSYCGQ SCCRPVCCCYSCSHC" BASE COUNT 161 a 234 c 183 g 156 t ORIGIN 1 aattatccaa cacaataagg cagagcttct gaattatgta aacagtagct ggccaggctt 61 ataaaaggcc aatgtggcag ccatcaccaa aactcagaaa ctcctccaag caacccagac 121 ttcataccag ctcccaacac catgacctgc tgccagacca gcttctgtgg atatcccagc 181 tgctccacca gtgggacatg cggctccagc tgctgccagc caagctgctg tgagaccagc 241 tgctgccagc caagctgctg tgagaccagc tgctgccagc caagctgctg ccagaccagc 301 ttctgcgatt tcctagcttc tcaactagtg gacctgcagc tcagttgctg ccagccaagc 361 tgctgtgaga ccagctgctg ccagccaagc tgctgccaga ccagctcctg cggaactggc 421 tgtggcattg gtggtggcat tggctatggc caggagggca gcagtggagc tgtgagcacc 481 cgtatcaggt ggtgccgccc agactgccgt gtggagggta cctgcctgcc cccctgctgt 541 gtggtgagct gccacacccc aacctgctgc cagctgcacc acgccgaggc ctcctgctgc 601 cgcccatcct actgtggaca gtcctgctgc cgcccagtct gctgctgcta ctcctgtagc 661 cactgctaaa gcagtttgct gatttaactg aaattccatt tcagttccat tcagttaagc 721 aataattcta agaa // LOCUS HSHBD1 362 bp RNA PRI 08-SEP-1997 DEFINITION H.sapiens mRNA for hBD-1 protein. ACCESSION X92744 NID g1617087 KEYWORDS hBD-1; hBD-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 362) AUTHORS Liu,L., Zhao,C., Heng,H.H. and Ganz,T. TITLE The human beta-defensin-1 and alpha-defensins are encoded by adjacent genes: two peptide families with differing disulfide topology share a common ancestry JOURNAL Genomics 43 (3), 316-320 (1997) MEDLINE 97422608 REFERENCE 2 (bases 1 to 362) AUTHORS Zhao,C. TITLE Direct Submission JOURNAL Submitted (03-NOV-1995) C. Zhao, UCLA, Department of Medicine, 37-055, Center for the Health Sciences, LosAngeles, CA. 90095-1678, USA FEATURES Location/Qualifiers source 1..362 /organism="Homo sapiens" /note="caucasian" /db_xref="taxon:9606" /sex="female" /tissue_type="kidney" gene 68..274 /gene="hBD-1" CDS 68..274 /gene="hBD-1" /codon_start=1 /db_xref="PID:e209045" /db_xref="PID:g1617088" /db_xref="SWISS-PROT:Q09753" /translation="MRTSYLLLFTLCLLLSEMASGGNFLTGLGHRSDHYNCVSSGGQC LYSACPIFTKIQGTCYRGKAKCCK" mat_peptide 164..271 /gene="hBD-1" BASE COUNT 92 a 87 c 82 g 101 t ORIGIN 1 gctcagcctc caaaggagcc agcctctccc cagttcctga aatcctgagt gttgcctgcc 61 agtcgccatg agaacttcct accttctgct gtttactctc tgcttacttt tgtctgagat 121 ggcctcaggt ggtaactttc tcacaggcct tggccacaga tctgatcatt acaattgcgt 181 cagcagtgga gggcaatgtc tctattctgc ctgcccgatc tttaccaaaa ttcaaggcac 241 ctgttacaga gggaaggcca agtgctgcaa gtgagctggg agtgaccaga agaaatgacg 301 cagaagtgaa atgaactttt tataagcatt cttttaataa aggaaaattg cttttgaagt 361 at // LOCUS HSHBF1A 2559 bp RNA PRI 25-AUG-1995 DEFINITION H.sapiens HBF-1 mRNA for transcription factor. ACCESSION X74142 NID g516380 KEYWORDS HBF-1 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2559) AUTHORS Murphy,D.B., Wiese,S., Burfeind,P., Schmundt,D., Mattei,M.G., Schulz-Schaeffer,W. and Thies,U. TITLE Human brain factor 1, a new member of the fork head gene family JOURNAL Genomics 21 (3), 551-557 (1994) MEDLINE 95048332 REFERENCE 2 (bases 1 to 2559) AUTHORS Wiese,S. TITLE Direct Submission JOURNAL Submitted (19-JUL-1993) S. Wiese, Institut f Humangenetik, Gosslerstr 12D, 37073 Goettingen, FRG REFERENCE 3 (bases 1 to 2559) AUTHORS Wiese,S., Murphy,D.B., Schlung,A., Burfeind,P., Schmundt,D., Schnulle,V., Mattei,M.G. and Thies,U. TITLE The genes for human brain factor 1 and 2, members of the fork head gene family, are clustered on chromosome 14q JOURNAL Biochim. Biophys. Acta 1262 (2-3), 105-112 (1995) MEDLINE 95322450 FEATURES Location/Qualifiers source 1..2559 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" gene 219..1752 /gene="HBF-1" CDS 219..1652 /gene="HBF-1" /codon_start=1 /product="transcription factor" /db_xref="PID:g516381" /translation="MLDMGDRKEVKMIPKSSFSINSLVPEGLQNDNHHASHGHHNSHH PQHHHHHHHHHHHPPPPAPQPPPPRAAQQQQPPPPPLAPQAGGAAQSNDEKGPQLLLL PPTDHHRPPSGAKAGGCCRPGELGPVGPDEKEKGAGAGGEEKKGAGEGGKDGEGGKEG EKKNGKYEKPPFSYNALIMMAIRQSPEKRLTLNGIYEFIMKNFPYYRENKQGWQNSIR HNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFIGGTTGKLRRRSTTSPAKLAFKR GAALTSTGLTFMDRAGSLYWPMSPFLSLHHPRASSTLSYNGTTSAYPSHPMPYSSVLT QNSLGNNHSFSTANGLSVDRLVNGEIPYATHHLTAAALAASVPCGLSVPCSGTYSLNP CSVNLLAGQTSYFFPHVPHPSMTSQSSTSMSARAASSSTSPPAPRPLPCESLRPSLPS FTTGLSGGLSDYFTHQNQGSSSNPLIH" polyA_signal 1747..1752 /gene="HBF-1" BASE COUNT 579 a 787 c 627 g 566 t ORIGIN 1 ttttttttta attcctgagg ggtggttgct gcttttgcta catgacttgc cagcgcccga 61 gcctgcgtcc aactgcgctg ctgccggagc gctcagtgcc gccgctgccg cccgccgccc 121 ccccgcgccc cgttcggcac ccaccggtcg ccgcgccgcc cgcgcgccgc tgtcccgctc 181 ccgcgccgcc gccgccgttt ccccccgacg actgggtgat gctggacatg ggagatagga 241 aagaggtgaa aatgatcccc aagtcctcgt tcagcatcaa cagcctggtg cccgagggcc 301 tccagaacga caaccaccac gcgagccacg gccaccacaa cagccaccac ccccagcacc 361 accaccacca ccaccaccat caccaccacc cgccgccgcc cgccccgcaa ccgccgccgc 421 cccgagccgc gcagcagcag cagccgccgc cgccgccgct cgccccgcag gccggcggcg 481 ccgcgcaatc gaacgacgaa aagggccccc agctgcttct gctcccgccg accgaccacc 541 accggccgcc gtccggagct aaagccggag gctgctgccg gcctggggag ctggggcccg 601 tcgggccgga cgagaaggag aagggcgccg gcgctggggg ggaggagaag aagggcgcgg 661 gcgagggcgg caaggacggg gaggggggca aggagggcga gaagaagaac ggcaagtacg 721 agaagccgcc gttcagctac aacgcgctca tcatgatggc catccggcag agccccgaga 781 agcgcctcac gctcaacggc atctacgagt tcatcatgaa gaacttccct tactaccgcg 841 agaacaagca gggctggcag aactccatcc gccacaatct gtccctcaac aagtgcttcg 901 tgaaggtgcc gcgccactac gacgacccgg gcaagggcaa ctactggatg ctggacccgt 961 cgagcgacga cgtgttcatc ggcggcacca cgggcaagct gcggcgccgc tccaccacct 1021 cgccggccaa gctggccttc aagcgcggtg ccgcgctcac ctccaccggc ctcaccttca 1081 tggaccgcgc cggctccctc tactggccca tgtcgccctt cctgtccctg caccaccctc 1141 gcgccagcag cactttgagt tacaacggga ccacgtcggc ctaccccagc caccccatgc 1201 cctacagctc cgtgttgact caaaactcgc tgggcaacaa ccactccttc tccaccgcca 1261 acgggctgag cgtggaccgg ctggtcaacg gggagatccc gtacgccacg caccacctca 1321 cggccgctgc gctcgccgcc tccgtgccct gcggcctgtc ggtgccctgc tccgggacct 1381 actccctcaa cccctgctcc gtcaacctgc tcgcgggcca gaccagttac tttttccccc 1441 acgtccccca cccgtcaatg acttcgcaga gcagcacgtc catgagcgcc agggccgcgt 1501 cctcctccac gtcgccgcca gcccctcgcc ccctgccctg tgagtcttta agaccctctt 1561 tgccaagttt tacgacggga ctgtctgggg gactgtctga ttatttcaca catcaaaatc 1621 aggggtcttc ttccaaccct ttaatacatt aacatccctg ggaccagact gtaagtgaac 1681 gttttacaca catttgcatt gtaaatgata attaaaaaaa taagtccagg tattttttat 1741 taagcccccc cctcccattt ctgtacgttt gttcagtctc tagggttgtt tattattcta 1801 acaaggtgtg gagtgtcagc gaggtgcaat gtggggagaa tacattgtag aatataaggt 1861 ttggaagtca aattatagta gaatgtgtat ctaaatagtg actgctttgc catttcattc 1921 aaacctgaca agtctatctc taagagccgc cagatttcca tgtgtgcagt attataagtt 1981 atcatggaac tatatggtgg acgcagacct tgagaacaac ctaaattatg gggagaattt 2041 taaaatgtta aactgtaatt tgtatttaaa aagcattcgt agtaaaggtg cccaagaaat 2101 tattttggcc atttattgtt ttctcctttt ctttaaagaa ctgttttttt ttcttttgtt 2161 tacttttaga ccaaagattg ggttctagaa aatgcgcctt ggtatactaa gtattaaaac 2221 aaacaaaaag gaaagttgtt tcagttggca acgctgccca ttcaattgaa tcagaagggg 2281 acaaaattaa cgattgcctt cagtttgtgt tgtgtatatt ttgatgtatg tggtcactaa 2341 caggtcactt ttattttttc taaatgtagt gaaatgttaa tacctattgt acttataggt 2401 aaaccttgca aatatgtaac ctgtgttgcg caaatgccgc ataaatttga gtgattgtta 2461 atgttgtctt aaaatttctt gattgtgata ctgtggtcat atgcccgtgt ttgtcactta 2521 caaaaatgtt tactatgaac acacataaat aaaaaatag // LOCUS HSHBGF8 889 bp RNA PRI 12-SEP-1993 DEFINITION Human pleiotrophin (PTN) mRNA. ACCESSION X52946 NID g32030 KEYWORDS growth factor; heparin-binding growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 889) AUTHORS Li,S. TITLE Direct Submission JOURNAL Submitted (02-MAY-1990) Li Y. S., Jewish Hospital of St.Louis at Washington Univ. Med. Sch., 216 South Kingshighway Blvd, St, Louis, MO 63110 USA REFERENCE 2 (bases 1 to 889) AUTHORS Li,Y.S., Milner,P.G., Chauhan,A.K., Watson,M.A., Hoffman,R.M., Kodner,C.M., Milbrandt,J. and Deuel,T.F. TITLE Cloning and expression of a developmentally regulated protein that induces mitogenic and neurite outgrowth activity JOURNAL Science 250 (4988), 1690-1694 (1990) MEDLINE 91102543 FEATURES Location/Qualifiers source 1..889 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" mRNA 1..889 /evidence=experimental CDS 252..758 /codon_start=1 /product="pleiotrophin" /db_xref="PID:g32031" /db_xref="SWISS-PROT:P21246" /translation="MQAQQYQQQRRKFAAAFLAFIFILAAVDTAEAGKKEKPEKKVKK SDCGEWQWSVCVPTSGDCGLGTREGTRTGAECKQTMKTQRCKIPCNWKKQFGAECKYQ FQAWGECDLNTALKTRTGSLKRALHNAECQKTVTISKPCGKLTKPKPQAESKKKKKEG KKQEKMLD" sig_peptide 252..347 mat_peptide 348..755 /product="pleiotrophin" polyA_site 889 /note="polyA site" BASE COUNT 298 a 204 c 244 g 143 t ORIGIN 1 gtcaaaggca ggatcaggtt ccccgccttc cagtccaaaa atcccgccaa gagagcccca 61 gagcagagga aaatccaaag tggagagagg ggaagaaaga gaccagtgag tcatccgtcc 121 agaaggcggg gagagcagca gcggcccaag caggagctgc agcgagccgg gtacctggac 181 tcagcggtag caacctcgcc ccttgcaaca aaggcagact gagcgccaga gaggacgttt 241 ccaactcaaa aatgcaggct caacagtacc agcagcagcg tcgaaaattt gcagctgcct 301 tcttggcatt cattttcata ctggcagctg tggatactgc tgaagcaggg aagaaagaga 361 aaccagaaaa aaaagtgaag aagtctgact gtggagaatg gcagtggagt gtgtgtgtgc 421 ccaccagtgg agactgtggg ctgggcacac gggagggcac tcggactgga gctgagtgca 481 agcaaaccat gaagacccag agatgtaaga tcccctgcaa ctggaagaag caatttggcg 541 cggagtgcaa ataccagttc caggcctggg gagaatgtga cctgaacaca gccctgaaga 601 ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 661 tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca agcagaatct aagaagaaga 721 aaaaggaagg caagaaacag gagaagatgc tggattaaaa gatgtcacct gtggaacata 781 aaaaggacat cagcaaacag gatcagttaa ctattgcatt tatatgtacc gtaggctttg 841 tattcaaaaa ttatctatag ctaagtacac aataagcaaa aacaaaaag // LOCUS HSHBK2 4234 bp RNA PRI 12-SEP-1993 DEFINITION Human HBK2 mRNA for potassium channel protein. ACCESSION X17622 NID g32032 KEYWORDS membrane protein; potassium channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4234) AUTHORS Pongs,O. TITLE Direct Submission JOURNAL Submitted (23-NOV-1989) Pongs O., Ruhr Universitaet, Lehrstuhl fuer Biochemie, Universitaetstr 150, D-4630 Bochum REFERENCE 2 (bases 1 to 4234) AUTHORS Grupe,A., Schroter,K.H., Ruppersberg,J.P., Stocker,M., Drewes,T., Beckh,S. and Pongs,O. TITLE Cloning and expression of a human voltage-gated potassium channel. A novel member of the RCK potassium channel family JOURNAL EMBO J. 9 (6), 1749-1756 (1990) MEDLINE 90269208 FEATURES Location/Qualifiers source 1..4234 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus." /tissue_type="brain" /clone="HBK2" CDS 863..2452 /note="put. HBK2 protein (AA 1-529)" /codon_start=1 /db_xref="PID:g32033" /db_xref="SWISS-PROT:P17658" /translation="MRSEKSLTLAAPGEVRGPEGEQQDAGDFPEAGGGGGCCSSERLV INISGLRFETQLRTLSLFPDTLLGDPGRRVRFFDPLRNEYFFDRNRPSFDAILYYYQS GGRLRRPVNVPLDIFLEEIRFYQLGDEALAAFREDEGCLPEGGEDEKPLPSQPFQRQV WLLFEYPESSGPARGIAIVSVLVILISIVIFCLETLPQFRVDGRGGNNGGVSRVSPVS RGSQEEEEDEDDSYTFHHGITPGEMGTGGSSSLSTLGGSFFTDPFFLVETLCIVWFTF ELLVRFSACPSKPAFFRNIMNIIDLVAIFPYFITLGTELVQQQEQQPASGGGGQNGQQ AMSLAILRVIRLVRVFRIFKLSRHSKGLQILGKTLQASMRELGLLIFFLFIGVILFSS AVYFAEADDDDSLFPSIPDAFWWAVVTMTTVGYGDMYPMTVGGKIVGSLCAIAGVLTI ALPVPVIVSNFNYFYHRETEQEEQGQYTHVTCGQPAPDLRATDNGLGKPDFPEANRER RPSYLPTPHRAYAEKRMLTEV" BASE COUNT 851 a 1174 c 1227 g 982 t ORIGIN 1 aagcttactg gtgaggcaag tgtgcgtcta tttccatggc gccctggctc gcggcagccc 61 ctggctgggc gaggggtgtg atgtgggagt ggggtgggag ggggcagcag gcggggcctg 121 ccacgtcact tggagagtgt gtgttgggaa ggaagggcag agcggagagc cgagccgctg 181 cagctgcggc ggcggcagcg aagccttgag ccgtggggag gtgggtcccc ggctcgggcg 241 ccggggcagc cccgggcctc tgcgaggcct gcggcgcggc tcctagggag gaggtggcgg 301 ctgtggcggc cggaaccgcg accttggccg gacccagccc cgcggtggac gcagggcgga 361 ggccgagccc cgcaggagtc tttgccgagc cggaggaggc gcatctggcg cttcggtacc 421 agcggcagcc gggggtccgg agcggctgga ggagcgcagt ggagaactgg gaagagctag 481 cccggctgga gggcggacct ctgcgtccgg gagccggggt ctcaaggcac cgctgggggc 541 gaagcacggc gtcttttcgg gcagccagtt tcacacgcgc ctgtgtgccg gttccgggca 601 tcccagtaag ctctagcacc cgggcgcggg taacgggaag cgcagaacca aatccccagc 661 gcccaggtca cctccccaga cccagccttg cagggaccag ggctttaggg ctcacggacc 721 caacggccag gtcagaccgc gaaccgggag agcgcggccc caccctaaag agggcgcacg 781 ggagctgggg agcgggtgcc gcgctccaga gattgtgtcg tgggcgccgt cctagtggcg 841 gggagcgcac ctccgagggg gcatgagatc ggagaaatcc cttacgctgg cggcgccggg 901 ggaggtccgt gggccggagg gggagcaaca ggatgcggga gacttcccgg aggccggcgg 961 gggcgggggc tgctgtagta gcgagcggct ggtgatcaat atctccgggc tgcgctttga 1021 gacacaattg cgcaccctgt cgctgtttcc ggacacgctg ctcggagacc ctggccggcg 1081 agtccgcttc ttcgaccccc tgaggaacga gtacttcttc gaccgcaacc ggcccagctt 1141 cgacgccatc ctctactact accagtctgg gggccgcctg cggaggccgg tcaacgtgcc 1201 cctggacatt ttcctggagg agatccgctt ctaccagctg ggggacgagg ccctggcggc 1261 cttccgggag gacgagggct gcctgcccga aggtggcgag gacgagaagc cgctgccctc 1321 ccagcccttc cagcgccagg tgtggctgct ctttgagtac ccagagagct ctgggccggc 1381 caggggcatc gccatcgtct ccgtgttggt cattctcatc tccatagtca tcttttgcct 1441 ggagacctta ccccagttcc gtgtagatgg tcgaggtgga aacaatggtg gtgtgagtcg 1501 agtctcccca gtttccaggg ggagtcagga ggaagaggag gatgaagacg attcctacac 1561 atttcatcat ggcatcaccc ctggggaaat ggggaccggg ggctcctcct cactcagtac 1621 tcttgggggc tccttcttta cagacccctt ctttctggtg gagacgctgt gcattgtctg 1681 gttcactttt gagctcctgg tgcgcttctc cgcctgccct agcaagccgg ccttcttccg 1741 gaacatcatg aacatcattg acttggtggc tatcttcccc tacttcatca ccctgggcac 1801 tgagctggtg cagcagcagg agcagcaacc agccagtgga ggaggcggcc agaatgggca 1861 gcaggccatg tccctggcca tcctccgagt catccgcctg gtccgggtgt tccgcatctt 1921 caagctctcc cgccactcca aggggctgca gatcctgggc aagaccttgc aggcctccat 1981 gagggagctg gggctgctca tcttcttcct cttcatcggg gtcatcctct tctccagtgc 2041 cgtctacttc gcagaggctg acgatgacga ttcgcttttt cccagcatcc cggatgcctt 2101 ctggtgggca gtggttacaa tgaccacggt aggttacggg gacatgtacc ccatgactgt 2161 ggggggaaag atcgtgggct cgctgtgtgc catcgctggg gtcctcacca ttgccctgcc 2221 tgtgcccgtc atcgtctcca acttcaacta cttctaccac cgggagacgg agcaggagga 2281 gcaaggccag tatacccacg tcacttgtgg gcagcctgcg ccggacctga gggcaactga 2341 caacggactt ggcaagcctg acttccccga ggctaaccgg gaacggagac ccagctacct 2401 tcctacacca catcgggcct atgcagagaa aagaatgctc acggaggtct gacccatgca 2461 ggcagggcct gcaggagggg agcactgagc taacagtctc ttaggcttcc ttctcatttc 2521 cactactcac tctagcttca gttgacttct tgactctctc ccctacaccc actacctggc 2581 atccaggacc aaatacctgg actatcaacc ttgttgctta atccctgcag cattcaaggt 2641 taatccatct aagtgacatt tttgaaattc cagcggtgcc acccaatcat gcccagcttc 2701 tgtcatatga atgagatata catttatatg acagaagctg ggcatgattg ggtggcaccg 2761 ctggaatttc aaaaatgtca aggaacagca aatgtcaaca ggatggaaac cagccctatc 2821 tgagtcttcg ctccctcctt agtgttcttt gctttgggtc atgtgcgttt cctagcttca 2881 ggccacttgg taactggaag aagctggagg acagaagcag tactcaactt gctgttattc 2941 cagtgccctg taacaaccac tggtcctcct gcagatgacc cttggtagag tctttatttg 3001 catagcctca aaataggtta ttcgttctaa acttggatgg aattagagaa tacaatcaaa 3061 ctttaccact tggaggacac ggggttagtc caggaccaaa gaggccaatg gatttttcaa 3121 agtgtgcccc agcacacaga ggcactggtg ttcggtctac atttagttct ccccactctg 3181 atcccctgac tctccagctt ccaggaaggt tccttctcag agccaaatac tctttgtgca 3241 agtgccttcc tgagcagaag aactggagaa agggaaccac agagccagga ggaatgtctg 3301 agcagagtca agcaactggc ttgaccacag tctgaagcaa ggtgccactt aaacagatac 3361 tgttttctca aaggggcaga ggaatcgtgt tgcagatggc agccttttct ccttcatttt 3421 ccccacattt tctctggccc tctaccttgc ttcctgggag tttgatttag gattgctgtt 3481 gaaggcttcc tcaggcaaac tccagcttaa agccctagac aggtaaaagc acacattgga 3541 tggcagcatg ggtttcttcc cattttatgg gcatgaaata tgtggtttag aataaggaac 3601 aagcattatt cctttgccaa cagcctcact ctaagaggct tttttgctga gtcaagcaaa 3661 cacttgcctg ctctgcccct tggaggtgca tttgacctgc tctcactggt aaggtgactt 3721 ggtggcgttc ccacttgatt tagccatttt cttccattgt gagaccactg ccatctatcc 3781 acctgcccac ctcccctttt gtttctcagt aacattgcca tttgtttttt gcctttgata 3841 aactgtgatg tactgttctg agatcttttg ggtgcagttc tgaaactgaa aggactgtta 3901 acatgttttt aattttatat ctatgctttc agactctttg atgataattt ttttttttaa 3961 aaattatctc tcgaagagca acttacgaga ggacagcctt atgagggttt gcttgagagg 4021 cagtgtggct tctgtgactg ccagctctaa atctcgatct tgccataact ttacagggta 4081 acttgggtcc acagtcactc tttgtgcctc agtttaccca cccattaaat gggaacatta 4141 ctgtcttccc ctccctacct catggggaat gtctgggaag ctggggacat tgctatgcaa 4201 atgtgtgaat cttagtcatg gatttgattt ttag // LOCUS HSHBPMR 1716 bp RNA PRI 19-JUN-1997 DEFINITION H.sapiens mRNA histone RNA hairpin-binding protein. ACCESSION Z71188 NID g1279473 KEYWORDS HBP; histone RNA hairpin-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1716) AUTHORS Martin,F. TITLE Direct Submission JOURNAL Submitted (18-APR-1996) Martin F., Institut of Zoology, Developmental Biology, Baltzerstrasse 4, Bern, Switzerland, CH-3012 REFERENCE 2 (bases 1 to 1716) AUTHORS Martin,F., Schaller,A., Eglite,S., Schumperli,D. and Muller,B. TITLE The gene for histone RNA hairpin binding protein is located on human chromosome 4 and encodes a novel type of RNA binding protein JOURNAL EMBO J. 16 (4), 769-778 (1997) MEDLINE 97201520 FEATURES Location/Qualifiers source 1..1716 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Gal4-HBP 2" /cell_type="EBV-transformed peripheral lymphocytes" /clone_lib="lambda-ACT" /chromosome="4p16.3" exon 1..158 /number=1 CDS 105..917 /function="involved in histone mRNA metabolism" /codon_start=1 /product="histone RNA hairpin-binding protein" /db_xref="PID:e236578" /db_xref="PID:g1279474" /translation="MACRPRSPPRHQSRCDGDASPPSPARWSLGRKRRADGRRWRPED AEEAEHRGAERRPESFTTPEGPKPRSRCSDWASAVEEDEMRTRVNKEMARYKRKLLIN DFGRERKSSSGSSDSKESMSTVPADFETDESVLMRRQKQINYGKNTIAYDRYIKEVPR HLRQPGIHPKTPNKFKKYSRRSWDQQIKLWKVALHFWDPPAEEGCDLQEIHPVDLESA ESSSEPQTSSQDDFDVYSGTPTKVRHMDSQVEDEFDLEACLTEPLRDFSAMS" exon 159..280 /number=2 exon 281..385 /number=3 exon 386..445 /number=4 exon 446..583 /number=5 exon 584..733 /number=6 exon 734..800 /number=7 exon 801..1716 /number=8 polyA_signal 1694..1699 polyA_site 1716 BASE COUNT 474 a 393 c 403 g 446 t ORIGIN 1 gcgggtttct gcctcaggcc ctgccctgct ctactctgcg ctctctgccc gcgccgccgc 61 cgcctcagcc tcggccctgc gctgcgcgcc cggcccgtgc tgccatggcc tgccgcccgc 121 gaagcccgcc gaggcatcag agccgctgcg acggtgacgc cagcccgccg tcccccgcgc 181 gatggagcct gggacggaag cgcagagccg acggcaggcg ctggaggccc gaagacgccg 241 aggaggcaga gcaccgcggc gccgagcgca gacccgagag ctttaccact cctgaaggcc 301 ctaaaccccg ttccagatgc tctgactggg caagtgcagt tgaagaagat gaaatgagga 361 ccagagttaa caaagaaatg gcaagatata aaaggaaact cctcatcaat gactttggaa 421 gagagagaaa atcatcatca ggaagttctg attcaaagga gtctatgtct actgtgccgg 481 ctgactttga gacagatgaa agtgtcctaa tgaggagaca gaagcagatc aactatggga 541 agaacacaat tgcctacgat cgttatatta aagaagtccc aagacacctt cgacaacctg 601 gcattcatcc caagacccct aataaattta agaagtatag tcgacgttca tgggaccagc 661 aaatcaaact ctggaaggtg gctctgcatt tttgggatcc tccagcggaa gaaggatgtg 721 atttgcaaga aatacaccct gtagaccttg aatctgcaga aagcagctcc gagccccaga 781 ccagctctca ggatgacttt gatgtgtact ctggcacacc caccaaggtg agacacatgg 841 acagtcaagt ggaggatgag tttgatttgg aagcttgttt aactgaaccc ttgagagact 901 tctcagccat gagctaactg ccccctggcg gccaggaaga gaaacagctc ctccccgact 961 aggtggaagg ctggccaggc accaagcatg tgtgtgcact tgtacctggt ggtttctctg 1021 ttagcagtcc attagctcat gctgaattat ttttgcctta ctttcttaag aaacattaat 1081 tttatgtata gtgagtatat tttgcatgtt ttaaattgta aatggagcta agtccaagaa 1141 agtacttgaa gctctcttcc agcgagctta attgcgtaat ccctgttgtc ctccagggta 1201 agctgacacg tctacataac tggttttcca caggcatctt cagttattgc ttgtcaggtg 1261 gactgttttg gatttaacca tgtaatccat gggaccaatt gagagtcagc tacttttata 1321 ggcatcaaag tattctcaga cacctttaat atctttatgg aaacttaatt tttggccttt 1381 tatcaatatg tcataacagc attctgaagt cagacattgt taaattgagc tattaaacta 1441 atgagtttta tgtaagttat atggtcttaa tttggtactt gtaaatagca ctagttagac 1501 tctttagaat actccaagag ttagggcagc agagtggagc gatttagaaa gaacatttta 1561 aaacaatcag ttaatttacc atgtaaaatt gctgtaaatg ataatgtgta cagattttct 1621 gttcaaatat tcaattgtaa acttcttgtt aagactgtta cgtttctatt gcttttgtat 1681 gggatattgc aaaaataaaa aggaaagaac cctctt // LOCUS HSHBRM 5959 bp RNA PRI 12-MAR-1996 DEFINITION H.sapiens hbrm mRNA. ACCESSION X72889 NID g414116 KEYWORDS glucocorticoid receptor; hbrm gene; helicase; nuclear; transcriptional regulator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5959) AUTHORS Muchardt,C. and Yaniv,M. TITLE A human homologue of Saccharomyces cerevisiae SNF2/SWI2 and Drosophila brm genes potentiates transcriptional activation by the glucocorticoid receptor JOURNAL EMBO J. 12 (11), 4279-4290 (1993) MEDLINE 94038910 REFERENCE 2 (bases 1 to 5959) AUTHORS Muchardt,C. TITLE Direct Submission JOURNAL Submitted (25-MAR-1993) C. Muchardt, Pasteur Institute Paris, Unite de Virus Oncogenes, 25 rue du Docteur Roux, 75724 Paris Cedex 15, FRANCE REFERENCE 3 (bases 1 to 5959) AUTHORS Aves,S.J., Hindley,J., Phear,G.A. and Tongue,N. TITLE A fission yeast gene mapping close to suc1 encodes a protein containing two bromodomains JOURNAL Mol. Gen. Genet. 248 (4), 491-498 (1995) MEDLINE 96004771 FEATURES Location/Qualifiers source 1..5959 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="lambda gt10" /map="5 F" gene 223..4983 /gene="hbrm" CDS 223..4983 /gene="hbrm" /codon_start=1 /product="HBRM" /db_xref="PID:g414117" /translation="MSTPTDPGAMPHPGPSPGPGPSPGPILGPSPGPGPSPGSVHSMM GPSPGPPSVSHPMPTMGSTDFPQEGMHQMHKPIDGIHDKGIVEDIHCGSMKGTGMRPP HPGMGPPQSPMDQHSQGYMSPHPSPLGAPEHVSSPMSGGGPTPPQMPPSQPGALIPGD PQAMSQPNRGPSPFSPVQLHQLRAQILAYKMLARGQPLPETLQLAVQGKRTLPGLQQQ QQQQQQQQQQQQQQQQQQQQPPQPQTQQQQQPALVNYNRPSGPGPELSGPSTPQKLPV PAPGGRPSPAPPAAAQPPAAAVPGPSVPQPAPGQPSPVLQLQQKQSRISPIQKPQGLD PVEILQEREYRLQARIAHRIQELENLPGSLPPDLRTKATVELKALRLLNFQRQLRQEV VACMRRDTTLETALNSKAYKRSKRQTLREARMTEKLEKQQKIEQERKRRQKHQEYLNS ILQHAKDFKEYHRSVAGKIQKLSKAVATWHANTEREQKKETERIEKERMRRLMAEDEE GYRKLIDQKKDRRLAYLLQQTDEYVANLTNLVWEHKQAQAAKEKKKRRRRKKKAEENA EGGESALGPDGEPIDESSQMSDLPVKVTHTETGKVLFGPEAPKASQLDAWLEMNPGYE VAPRSDSEESDSDYEEEDEEEESSRQETEEKILLDPNSEEVSEKDAKQIIETAKQDVD DEYSMQYSARGSQSYYTVAHAISEWVEKQSALLINGTLKHYQLQGLEWMVSLYNNNLN GILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVVKI SYKGTPAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH CKLTQVLNTHYVAPRRILLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFA MTGERVDLNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEYVIKCDMSALQKI LYRHMQAKGILLTDGSEKDKKGKGGAKTLMNTIMQLRKICNHPYMFQHIEESFAEHLG YSNGVINGAELYRASGKFELLDRILPKLRATNHRVLLFCQMTSLMTIMEDYFAFRNFL YLRLDGTTKSEDRAALLKKFNEPGSQYFIFLLSTRAGGLGLNLQAADTVVIFDSDWNP HQDLQAQDRAHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKS SSHERRAFLQAILEHEEENEEEDEVPDDETLNQMIARREEEFDLFMRMDMDRRREDAR NPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKIFGRGSRQRRDVDYSDALTEKQ WLRAIEDGNLEEMEEEVRLKKRKRRRNVDKDPAKEDVEKAKKRRGRPPAEKLSPNPPK LTKQMNAIIDTCINYKDSCNVEKVPSNSQLEIEGNSSGRQLSEVFIQLPSRKELPEYY ELIRKPVDFKKIKERIRNHKYRSLGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSV FKSARQKIAKEEESEDESNEEEEEEDEEESESEAKSVKVKIKLNKKDDKGRDKGKGKK RPNRGKAKPVVSDFDSDEEQDEREQSEGSGTDDE" BASE COUNT 1822 a 1352 c 1504 g 1281 t ORIGIN 1 gaattccgga tgctcagatg aaagccccga gatcacagag acccggcgag atcacagaga 61 cccggcctga aggaacgtgg aaagaccaat gtacctgttt tgaccggttg cctggagcaa 121 gaagttccag ttggggagaa ttttcagaag ataaagtcgg agattgtgga aagacttgac 181 ttgcagcatt actctactga ctggcagaga caggagaggt agatgtcaac gcccacagac 241 cctggtgcga tgccccaccc agggccttcg ccgggtcctg ggccttcccc tgggccaatt 301 cttgggccta gtccaggacc aggaccatcc ccaggttccg tccacagcat gatggggcca 361 agtcctggac ctccaagtgt ctcccatcct atgccgacga tggggtccac agacttccca 421 caggaaggca tgcatcaaat gcataagccc atcgatggta tacatgacaa ggggattgta 481 gaagacatcc attgtggatc catgaagggc actggtatgc gaccacctca cccaggcatg 541 ggccctcccc agagtccaat ggatcaacac agccaaggtt atatgtcacc acacccatct 601 ccattaggag ccccagagca cgtctccagc cctatgtctg gaggaggccc aactccacct 661 cagatgccac caagccagcc gggggccctc atcccaggtg atccgcaggc catgagccag 721 cccaacagag gtccctcacc tttcagtcct gtccagctgc atcagcttcg agctcagatt 781 ttagcttata aaatgctggc ccgaggccag cccctccccg aaacgctgca gcttgcagtc 841 caggggaaaa ggacgttgcc tggcttgcag caacaacagc agcagcaaca gcagcagcag 901 cagcagcagc agcagcagca gcagcagcaa cagcagccgc cgcaaccaca gacgcagcaa 961 caacagcagc cggcccttgt taactacaac agaccatctg gcccggggcc ggagctgagc 1021 ggcccgagca ccccgcagaa gctgccggtg cccgcgcccg gcggccggcc ctcgcccgcg 1081 ccccccgcag ccgcgcagcc gcccgcggcc gcagtgcccg ggccctcagt gccgcagccg 1141 gccccggggc agccctcgcc cgtcctccag ctgcagcaga agcagagccg catcagcccc 1201 atccagaaac cgcaaggcct ggaccccgtg gaaattctgc aagagcggga atacagactt 1261 caggcccgca tagctcatag gatacaagaa ctggaaaatc tgcctggctc tttgccacca 1321 gatttaagaa ccaaagcaac cgtggaacta aaagcacttc ggttactcaa tttccagcgt 1381 cagctgagac aggaggtggt ggcctgcatg cgcagggaca cgaccctgga gacggctctc 1441 aactccaaag catacaaacg gagcaagcgc cagactctga gagaagctcg catgaccgag 1501 aagctggaga agcagcagaa gattgagcag gagaggaaac gccgtcagaa acaccaggaa 1561 tacctgaaca gtattttgca acatgcaaaa gattttaagg aatatcatcg gtctgtggcc 1621 ggaaagatcc agaagctctc caaagcagtt gcaacttggc atgccaacac tgaaagagag 1681 cagaagaagg agacagagcg gattgaaaag gagagaatgc ggcgactgat ggctgaagat 1741 gaggagggtt atagaaaact gattgatcaa aagaaagaca ggcgtttagc ttaccttttg 1801 cagcagaccg atgagtatgt agccaatctg accaatctgg tttgggagca caagcaagcc 1861 caggcagcca aagagaagaa gaagaggagg aggaggaaga agaaggctga ggagaatgca 1921 gagggtgggg agtctgccct gggaccggat ggagagccca tagatgagag cagccagatg 1981 agtgacctcc ctgtcaaagt gactcacaca gaaaccggca aggttctgtt cggaccagaa 2041 gcacccaaag caagtcagct ggacgcctgg ctggaaatga atcctggtta tgaagttgcc 2101 cctagatctg acagtgaaga gagtgattct gattatgagg aagaggatga ggaagaagag 2161 tccagtaggc aggaaaccga agagaaaata ctcctggatc caaatagcga agaagtttct 2221 gagaaggatg ctaagcagat cattgagaca gctaagcaag acgtggatga tgaatacagc 2281 atgcagtaca gtgccagggg ctcccagtcc tactacaccg tggctcatgc catctcggag 2341 tgggtggaga aacagtctgc cctcctaatt aatgggaccc taaagcatta ccagctccag 2401 ggcctggaat ggatggtttc cctgtataat aacaacttga acggaatctt agccgatgaa 2461 atggggcttg gaaagaccat acagaccatt gcactcatca cttatctgat ggagcacaaa 2521 agactcaatg gcccctatct catcattgtt cccctttcga ctctatctaa ctggacatat 2581 gaatttgaca aatgggctcc ttctgtggtg aagatttctt acaagggtac tcctgccatg 2641 cgtcgctccc ttgtccccca gctacggagt ggcaaattca atgtcctctt gactacttat 2701 gagtatatta taaaagacaa gcacattctt gcaaagattc ggtggaaata catgatagtg 2761 gacgaaggcc accgaatgaa gaatcaccac tgcaagctga ctcaggtctt gaacactcac 2821 tatgtggccc ccagaaggat cctcttgact gggaccccgc tgcagaataa gctccctgaa 2881 ctctgggccc tcctcaactt cctcctccca acaattttta agagctgcag cacatttgaa 2941 caatggttca atgctccatt tgccatgact ggtgaaaggg tggacttaaa tgaagaagaa 3001 actatattga tcatcaggcg tctacataag gtgttaagac catttttact aaggagactg 3061 aagaaagaag ttgaatccca gcttcccgaa aaagtggaat atgtgatcaa gtgtgacatg 3121 tcagctctgc agaagattct gtatcgccat atgcaagcca aggggatcct tctcacagat 3181 ggttctgaga aagataagaa ggggaaagga ggtgctaaga cacttatgaa cactattatg 3241 cagttgagaa aaatctgcaa ccacccatat atgtttcagc acattgagga atcctttgct 3301 gaacacctag gctattcaaa tggggtcatc aatggggctg aactgtatcg ggcctcaggg 3361 aagtttgagc tgcttgatcg tattctgcca aaattgagag cgactaatca ccgagtgctg 3421 cttttctgcc agatgacatc tctcatgacc atcatggagg attattttgc ttttcggaac 3481 ttcctttacc tacgccttga tggcaccacc aagtctgaag atcgtgctgc tttgctgaag 3541 aaattcaatg aacctggatc ccagtatttc attttcttgc tgagcacaag agctggtggc 3601 ctgggcttaa atcttcaggc agctgataca gtggtcatct ttgacagcga ctggaatcct 3661 catcaggatc tgcaggccca agaccgagct caccgcatcg ggcagcagaa cgaggtccgg 3721 gtactgaggc tctgtaccgt gaacagcgtg gaggaaaaga tcctcgcggc cgcaaaatac 3781 aagctgaacg tggatcagaa agtgatccag gcgggcatgt ttgaccaaaa gtcttcaagc 3841 cacgagcgga gggcattcct gcaggccatc ttggagcatg aagaggaaaa tgaggaagaa 3901 gatgaagtac cggacgatga gactctgaac caaatgattg ctcgacgaga agaagaattt 3961 gaccttttta tgcggatgga catggaccgg cggagggaag atgcccggaa cccgaaacgg 4021 aagccccgtt taatggagga ggatgagctg ccctcctgga tcattaagga tgacgctgaa 4081 gtagaaaggc tcacctgtga agaagaggag gagaaaatat ttgggagggg gtcccgccag 4141 cgccgtgacg tggactacag tgacgccctc acggagaagc agtggctaag ggccatcgaa 4201 gacggcaatt tggaggaaat ggaagaggaa gtacggctta agaagcgaaa aagacgaaga 4261 aatgtggata aagatcctgc aaaagaagat gtggaaaaag ctaagaagag aagaggccgc 4321 cctcccgctg agaaactgtc accaaatccc cccaaactga caaagcagat gaacgctatc 4381 atcgatacgt gtataaacta caaagatagt tgtaacgtgg agaaggtgcc cagtaattct 4441 cagttggaaa tagaaggaaa cagttcaggg cgacagctca gtgaagtctt cattcagtta 4501 ccttcaagga aagaattacc agaatactat gaattaatta ggaagccagt ggatttcaaa 4561 aaaataaagg aaaggattcg taatcataag taccggagcc taggcgacct ggagaaggat 4621 gtcatgcttc tctgtcacaa cgctcagacg ttcaacctgg agggatccca gatctatgaa 4681 gactccatcg tcttacagtc agtgtttaag agtgcccggc agaaaattgc caaagaggaa 4741 gagagtgagg atgaaagcaa tgaagaggag gaagaggaag atgaagaaga gtcagagtcc 4801 gaggcaaaat cagtcaaggt gaaaattaag ctcaataaaa aagatgacaa aggccgggac 4861 aaagggaaag gcaagaaaag gccaaatcga ggaaaagcca aacctgtagt gagcgatttt 4921 gacagcgatg aggagcagga tgaacgtgaa cagtcagaag gaagtgggac ggatgatgag 4981 tgatcagtat ggaccttttt ccttggtaga actgaattcc ttcctcccct gtctcatttc 5041 tacccagtga gttcatttgt catataggca ctgggttgtt tctatatcat catcgtctat 5101 aaactagctt taggatagtg ccagacaaac atatgatatc atggtgtaaa aaacacacac 5161 atacacaaat atttgtaaca tattgtgacc aaatgggcct caaagattca gattgaaaca 5221 aacaaaaagc ttttgatgga aaatatgtgg gtggatagta tatttctatg ggtgggtcta 5281 atttggtaac ggtttgattg tgcctggttt tatcacctgt tcagatgaga agatttttgt 5341 cttttgtagc actgataacc aggagaagcc attaaaagcc actggttatt ttatttttca 5401 tcaggcaatt ttcgaggttt ttatttgttc ggtattgttt ttttacactg tggtacatat 5461 aagcaacttt aataggtgat aaatgtacag tagttagatt tcacctgcat atacgttttt 5521 ccattttatg ctctatgatc tgaacaaaag ctttttgaat tgtataagat ttatgtctac 5581 tgtaaacatt gcttaatttt tttgctcttg atttaaaaaa aagttttgtt gaaagcgcta 5641 ttgaatattg caatctatat agtgtattgg atggcttctt ttgtcaccct gatctcctat 5701 gttaccaatg tgtatcgtct ccttctccct aaagtgtact taatctttgc tttctttgca 5761 caatgtcttt ggttgcaagt cataagcctg aggcaaataa attccagtaa tttcgaagaa 5821 tgtggtgttg gtgctttcct aataaagaaa taatttcgct tgaaaaaaaa aaaaaaaaaa 5881 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5941 aaaaaaaaaa aaggaattc // LOCUS HSHBZ17 4760 bp RNA PRI 19-SEP-1994 DEFINITION H.sapiens HBZ17 mRNA. ACCESSION X77366 NID g541677 KEYWORDS bZIP transcription factor; leucine zipper protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4760) AUTHORS Luna,L., Johnsen,O., Skartlien,A.H., Pedeutour,F., Turc-Carel,C., Prydz,H. and Kolsto,A.B. TITLE Molecular cloning of a putative novel human bZIP transcription factor on chromosome 17q22 JOURNAL Genomics 22 (3), 553-562 (1994) MEDLINE 95095252 REFERENCE 2 (bases 1 to 4760) AUTHORS Luna,L. TITLE Direct Submission JOURNAL Submitted (28-JAN-1994) L. Luna, Biotechnology Centre of Oslo, University of Oslo, Postbox 1125, Blindern, 0317 Oslo, NORWAY FEATURES Location/Qualifiers source 1..4760 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17q22" gene 593..2911 /gene="HBZ17" CDS 593..2911 /gene="HBZ17" /codon_start=1 /product="hbZ17" /db_xref="PID:g541678" /translation="MLSLKKYLTEGLLQFTILLSLIGVRVDVDTYLTSQLPPLREIIL GPSSAYTQTQFHNLRNTLDGYGIHPKSIDLDNYFTARRLLSQVRALDRFQVPTTEVNA WLVHRDPEGSVSGSQPNSGLALESSSGLQDVTGPDNGVRESETEQGFGEDLEDLGAVA PPVSGDLTKEDIDLIDILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAG EGAEALARNLLVDGETGESFPAQVPSGEDQTALSLEECLRLLEATCPFGENAEFPADI SSITEAVPSESEPPALQNNLLSPLLTGTESPFDLEQQWQDLMSIMEMQAMEVNTSASE ILYSAPPGDPLSTNYSLAPNTPINQNVSLHQASLGGCSQDFLLFSPEVESLPVASSST LLPLAPSNSTSLNSTFGSTNLTGLFFPPQLNGTANDTAGPELPDPLGGLLDEAMLDEI SLMDLAIEEGFNPVQASQLEEEFDSDSGLSLDSSHSPSSLSSSEGSSSSSSSSSSSSS SASSSASSSFSEEGAVGYSSDSETLDLEEAEGAVGYQPEYSKFCRMSYQDPAQLSCLP YLEHVGHNHTYNMAPSALDSADLPPPSALKKGSKEKQADFLDKQMSRDEHRARAMKIP FTNDKIINLPVEEFNELLSKYQLSEAQLSLIRDIRRRGKNKMAAQNCRKRKLDTILNL ERDVEDLQRDKARLLREKVEFLRSLRQMKQKVQSLYQEVFGRLRDENGRPYSPSQYAL QYAGDGSVLLIPRTMADQQARRQERKPKDRRK" misc_feature 965..1456 /gene="HBZ17" /note="acidic region" misc_feature 1709..1786 /gene="HBZ17" /note="HOB1 motif" misc_feature 2015..2181 /gene="HBZ17" /note="Serine-rich region" misc_feature 2570..2629 /gene="HBZ17" /note="basic region" misc_feature 2636..2743 /gene="HBZ17" /note="Leucine zipper" BASE COUNT 1123 a 1235 c 1383 g 1019 t ORIGIN 1 ggccggcggt ggcggcggcg agggctggac tcgggcttag ggcctgctgt ggaggcagcg 61 gcggacgccg agctaagcag tttctctgga aacccccctg gtaagtgtgg aggaggcggg 121 acactctgac ccaagacgaa aggcctgtag ctccagccaa agaaaataaa ccttaggagg 181 gagaaggaaa aaaaaaaatc catcagctgt tcctgagaac agcctgcatt ggaatctaca 241 gagaggacaa ctaatgtgag tgaggaagtg actgtatgtg gactgtggag aaagtaagtc 301 acgtgggccc ttgaggacct ggactgggtt aggaacagtt gtactttcag aggtgaggtg 361 tcgagaaggg aaagtgaatg tggtctggag tgtgtccttg gccttggctc cacagggtgt 421 gctttcctct ggggccgtca gggagctcat cccttgtgtt ctgccagggt ggggtacggg 481 gtttgacact gaggagggta acctgctggc tggagcggca gagcagtggc cttgatttgt 541 cttttggaag attttaaaaa ccaaaaagca taaacattct ggtccttcag caatgctttc 601 tctgaagaaa tacttaacgg aaggacttct ccagttcacc attctgctga gtttgattgg 661 ggtacgggtg gacgtggata cttacctgac ctcacagctt cccccactcc gggagatcat 721 cctggggccc agttctgcct atactcagac ccagttccac aacctgagga ataccttgga 781 tggctatggt atccacccca agagcataga cctggacaat tacttcactg cccggcggct 841 cctcagtcag gtgagggccc tggacaggtt ccaggtgcca accactgagg taaatgcctg 901 gctggttcac cgagacccag aggggtctgt ctctggcagt cagcccaact caggcctcgc 961 cctcgagagt tccagtggcc tccaagatgt gacaggccca gacaacgggg tgcgagaaag 1021 cgaaacggag cagggattcg gtgaagattt ggaggatttg ggggctgtag cccccccagt 1081 cagtggagac ttaaccaaag aggacataga tctgattgac atcctttggc gacaggatat 1141 tgatctgggg gctgggcgtg aggtttttga ctatagtcac cgccagaagg agcaggatgt 1201 ggagaaggag ctgcgagatg gaggcgagca ggacacctgg gcaggcgagg gcgcggaagc 1261 tctggcacgg aacctgctag tggatggaga gactggggag agcttccctg cacaggtgcc 1321 tagtggggag gaccagacgg ccctgtccct ggaagagtgc cttaggctgc tggaagccac 1381 ctgccccttt ggggagaatg ctgagtttcc agcagacatt tccagcataa cagaagcagt 1441 gcctagtgag agtgagcccc ctgctcttca aaacaacctc ttgtctcctc ttctgaccgg 1501 gacagagtca ccatttgatt tggaacagca gtggcaagat ctcatgtcca tcatggaaat 1561 gcaggccatg gaagtgaaca catcagcaag tgaaatcctg tacagtgccc ctcctggaga 1621 cccactgagc accaactaca gccttgcccc caacactccc atcaatcaga atgtcagcct 1681 gcatcaggcg tccctggggg gctgcagcca ggacttctta ctcttcagcc ccgaggtgga 1741 aagcctgcct gtggccagta gctccacgct gctcccgttg gcccccagca attctaccag 1801 cctcaactcc accttcggct ccaccaacct gacagggctc ttctttccac cccagctcaa 1861 tggcacagcc aatgacacag caggcccaga gctgcctgac cctttggggg gtctgttaga 1921 tgaagctatg ttggatgaga tcagccttat ggacctggcc attgaagaag gctttaaccc 1981 tgtgcaggcc tcccagctgg aggaggaatt tgactctgac tcaggccttt ccttagactc 2041 gagccatagc ccttcttccc taagcagctc tgaaggcagt tcttcctctt cttcctcctc 2101 ctcttcctct tcttcctctg cttcttcctc tgcctcttcc tccttttctg aggaaggtgc 2161 ggttggctac agctctgact ctgagaccct ggatctggaa gaggccgagg gtgctgtggg 2221 ctaccagcct gagtattcca agttctgccg catgagctac caggatccag ctcagctctc 2281 atgcctgccc tacctggagc acgtgggcca caaccacaca tacaacatgg cacccagtgc 2341 cctggactca gccgacctgc caccacccag tgccctcaag aaaggcagca aggagaagca 2401 ggctgacttc ctggacaagc agatgagccg ggatgagcac cgagcccgag ccatgaagat 2461 ccctttcacc aatgacaaaa tcatcaacct gcctgtggag gagttcaatg aactgctgtc 2521 caaataccag ttgagtgaag cccagctgag cctcatccga gacatccggc gccggggcaa 2581 gaacaagatg gcggcgcaga actgccgcaa gcgcaagctg gacaccatcc tgaatctgga 2641 gcgtgatgtg gaggacctgc agcgtgacaa agcccggctg ctgcgggaga aagtggagtt 2701 cctgcgctcc ctgcgacaga tgaagcagaa ggtccagagc ctgtaccagg aggtgtttgg 2761 gcggctgcga gatgagaacg gacgacccta ctcgcccagt cagtatgcgc tccagtacgc 2821 cggggacggc agtgtcctcc tcatcccccg cacgatggcc gaccagcagg cccggcggca 2881 ggagaggaag ccaaaggacc ggagaaagtg agcctgggga agaagggggt ttgaagccca 2941 ccaagaccga aactggagaa gggctggacc tggacctgga cctggaccta cagcggggac 3001 ttaaatgcct tcttatccaa tatatcttct cagatgggat gactgcgggt cagtgtacag 3061 gaagaggcag gcactggctg gctcagctcc actcgggtgg agtggaagtg gccagaccat 3121 ttagacggac agggtcctca ccctacccct ttcctgtgag gcaggggtgg tggtggagtt 3181 gctggaggta gaggagctat gtggagcaaa ggccgacaga ggggaaggaa tggacctgtg 3241 agaggaaggg aaggtggcag aaagtctcat ttcaggaagg agggatagaa ggaaggaagg 3301 aaggaacccc cccccccccc cgaaaaaaaa atcaaagcgg gaagaaaatc agagggaagg 3361 ttaaggttgg ctctggccag gattccaggc agcaggttgg agtgactggt gggcctagat 3421 cactggtgtg ataaacccca tttcaccccg gggggggtgg ggtacacaga cacagggtgg 3481 gggtggggag gggcggtgtt aactctttct gctccttgca ttttgacatc cctgaagggg 3541 agctcttgga tatcattggc catgtttcaa tcgaatggag ccactgggcc ccaacactgg 3601 ctttgagatt tagagtcaaa gggtagagtg aacaggaaag ggtcacgtgg tcccatgttg 3661 caacagcccc aacatcacgc atgtcattca ctgccttgcc actccatctc cctccgtgct 3721 ccagccaccc ctgagctgag gctcccattg tctccatcag agcctgcatg tgtatgccgt 3781 cctcccctgg tccggtgttt gtgttcccca cccctcacag actgcctgag ctcttctgta 3841 agctggggta gggtgatggc agtgctccgg gaactgggcc tgcagccttc ctcttctggg 3901 actgctgtga ggcagaggaa tgatggagaa tctagtgtag cagcctccag gcaggattca 3961 gcacaacact ggggagtcac ccttccctcg ggcctctgcc taccaacaac tgggcttatc 4021 actgggaaaa cacaaaaaat tacacaaccc agcaacaaca aaagaactag tcctcttaga 4081 atttcttgcg ctttgatttt tttagggctt gtgccctgtt tcacttatag ggtctagaat 4141 gcttgtgttg agtaaaaagg agattcccaa tattcaaagc tgctaaatgt tctctttgcc 4201 ataaagactc cgtgtaactg tgtgaacact tgggattttt ctcctctgtc ccgaggtcgt 4261 cgtctgcttt cttttttggg tttctttcta gaagattgag aagtgcatat gacaggctga 4321 gagcacctcc ccaaacacac aagctctcag ccacaggcag cttctccaca gccccagctt 4381 cgcacaggct cctggagggc tgcctggggg aggcagacat gggagtgcca aggtggccag 4441 atggttccag gactacaatg tctttatttt taactgtttg ccactgctgc cctcacccct 4501 gcccggctct ggagtaccgt ctgccccaga caagtgggag tgaaatgggg gtggggggaa 4561 gcactgattc ccagttaggg ggtgcctaac tgagcagtag ggatagaagg tgtgaacctg 4621 ggagtgcttt tataaattat tttccttgta gattttattt ttaatttatc tctgtgacct 4681 gccagggaga ggggagagag agagagatgc tgttgagcac atgacaaaat aaaataaaat 4741 ggatgattca aaaaaaaaaa // LOCUS HSHC21 637 bp RNA PRI 20-MAR-1991 DEFINITION Human mRNA for putative cytokine 21 (HC21). ACCESSION X16166 NID g32035 KEYWORDS cytokine; lymphokine. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 637) AUTHORS Chang,H.C. and Reinherz,E.L. TITLE Isolation and characterization of a cDNA encoding a putative cytokine which is induced by stimulation via the CD2 structure on human T lymphocytes JOURNAL Eur. J. Immunol. 19 (6), 1045-1051 (1989) MEDLINE 89325421 FEATURES Location/Qualifiers source 1..637 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /clone="AA8,AA10,I1,20A,5B" mRNA 1..637 /note="cytokine 21" /evidence=experimental sig_peptide 53..121 /note="cytokine 21" CDS 53..331 /codon_start=1 /product="cytokine 21" /db_xref="PID:g32036" /db_xref="SWISS-PROT:P13236" /translation="MKLCVTVLSLLMLVAAFCSPALSAPMGSDPPTACCFSYTARKLP RNFVVDYYETSSLCSQPAVVFQTKRSKQVCADPSESWVQEYVYDLELN" mat_peptide 122..328 /product="cytokine 21" polyA_signal 610..615 polyA_signal 623..628 polyA_site 637 BASE COUNT 148 a 171 c 128 g 190 t ORIGIN 1 cttctgagtt ctgcagcctc acctctgaga aaacctcttt tcgaccaata ccatgaagct 61 ctgcgtgact gtcctgtctc tcctcatgct agtagctgcc ttctgctctc cagcgctctc 121 agcaccaatg ggctcagacc ctcccaccgc ctgctgcttt tcttacaccg cgaggaagct 181 tcctcgcaac tttgtggtag attactatga gaccagcagc ctctgctccc agccagctgt 241 ggtattccaa accaaaagaa gcaagcaagt ctgtgctgat cccagtgaat cctgggtcca 301 ggagtacgtg tatgacctgg aactgaactg agctgctcag agacaggaag tcttcaggga 361 aggtcacctg agcccggatg cttctccatg agacacatct cctccatact caggactcct 421 ctccgcagtt cctgtccctt ctcttaattt aatctttttt atgtgccgtg ttattgtatt 481 aggtgtcatt tccattattt atattagttt agccaaagga taagtgtccc ctatggggat 541 ggtccactgt cactgtttct ctgctgttgc aaatacatgg ataacacatt tgattctgtg 601 tgttttcata ataaaacttt aaaataaaat gcagaat // LOCUS HSHCERN 1195 bp RNA PRI 09-MAY-1995 DEFINITION Human hcerN3 gene mRNA for N snRNP associated protein. ACCESSION X15892 NID g32039 KEYWORDS autoantigen; hcerN3 gene; N protein; ribonucleoprotein; snRNP associated protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1195) AUTHORS Lerner,M. TITLE Direct Submission JOURNAL Submitted (17-JUL-1989) Lerner M., Yale University School of Medicine, Section of Molecular Neurobiology, 333 Cedar Street, New Haven CT06510, U S A REFERENCE 2 (bases 1 to 1195) AUTHORS Schmauss,C., McAllister,G., Ohosone,Y., Hardin,J.A. and Lerner,M.R. TITLE A comparison of snRNP-associated Sm-autoantigens: human N, rat N and human B/B' JOURNAL Nucleic Acids Res. 17 (4), 1733-1743 (1989) MEDLINE 89160326 REMARK Erratum:[Nucleic Acids Res 1989 Aug 25;17(16):6777]] FEATURES Location/Qualifiers source 1..1195 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adolescent" /tissue_type="cerebellum" /clone_lib="lambda gt11" /clone="hcVI1" CDS 343..1065 /note="N protein (AA 1-240)" /codon_start=1 /db_xref="PID:g32040" /db_xref="SWISS-PROT:P14648" /translation="MTVGKSSKMLQHIDYRMRCILQDGRIFIGTFKAFDKHMNLILCD CDEFRKIKPKNAKQPEREEKRVLGLVLLRGENLVSMTVEGPPPKDTGIARVPLAGAAG GPGVGRAAGRGVPAGVPIPQAPAGLAGPVRGVGGPSQQVMTPQGRGTVAAAAVAATAS IAGAPTQYPPGRGTPPPPVGRATPPPGIMAPPPGMRPPMGPPIGLPPARGTPIGMPPP GMRPPPPGIRGPPPPGMRPPRP" misc_feature 1176..1180 /note="polyA signal" polyA_site 1195 /note="polyA site" BASE COUNT 284 a 314 c 315 g 282 t ORIGIN 1 tgatatcgag aattcgggcc ggcggccgct ccactctgcc aaccaagagt gtgagttgta 61 cccgaggctt ctcagcagca gcaagtacct gtgttgggat ttccaggctg aactgaggca 121 ggcattctta gctgagacac caagaggtgg ttaaagccat attggagtag cgaggaatct 181 gattccaagc aaaaaccagg ctccatctac tctttgaagc ttctgcccag cttgcattgt 241 ttctaggaga acccgcgtca tacctttatc tatagccttc ccctaggtct tcagaagcat 301 caagttttaa ctgtggacat tggatttggt ggaacagcaa tcatgactgt tggcaagagt 361 agcaagatgc tgcagcacat tgactataga atgagatgta tcctgcaaga tggccgaatc 421 ttcattggca cctttaaggc ttttgacaag catatgaatt tgatcctctg tgattgtgat 481 gagttcagaa agatcaagcc aaagaatgcg aagcaaccag agcgtgaaga aaagcgggtt 541 ttgggtctgg tgttgctgcg tggggagaac ttggtatcca tgactgtgga ggggccaccc 601 cccaaagata ctggcattgc tcgggtacca cttgctggag ctgctggagg ccctggggtt 661 ggtagggcag ctggtagagg agtaccagct ggtgtgccaa ttccccaggc ccctgctgga 721 ttggcaggcc ctgtccgagg agttggggga ccatcccagc aggtaatgac tccacaggga 781 agaggcactg tagcagctgc tgctgttgct gcgaccgcca gtattgctgg agccccaaca 841 cagtacccac caggacgggg cactccgccc ccacccgtcg gcagagcaac cccacctcca 901 ggcattatgg ctcctccacc tggtatgaga ccacccatgg gcccaccaat tgggcttccc 961 cctgctcgag ggacgccaat aggcatgccc cctccgggaa tgagaccccc tccaccaggc 1021 attagaggtc cacctccccc aggaatgcgt ccaccaagac cttagcatac tgttgatcca 1081 tctcagtcac tttttcccct gcaatgcgtc ttgtgaaatt gtgtagagtg tttgtgagct 1141 ttttgttccc tcattctgca ttaataatag ctaataataa atgcatagag caatt // LOCUS HSHCGIXPT 665 bp RNA PRI 21-OCT-1996 DEFINITION H.sapiens mRNA for HCGIX protein. ACCESSION X95289 NID g1628392 KEYWORDS HCG IX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 665) AUTHORS Pichon,L., Hampe,A., Giffon,T., Carn,G., Legall,J.Y. and David,V. TITLE A new non-HLA multigene family associated with the PERB11 family within the MHC class I region JOURNAL Immunogenetics 44 (4), 259-267 (1996) MEDLINE 96337915 REFERENCE 2 (bases 1 to 665) AUTHORS Pichon,L. TITLE Direct Submission JOURNAL Submitted (15-JAN-1996) L. Pichon, upr 41 cnrs, recombinaisons genetiques, 2 avenue du pr lon Bernard, 35043 Rennes cedex, FRANCE FEATURES Location/Qualifiers source 1..665 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /dev_stage="fetal" /map="p21.3" /tissue_type="liver/spleen" gene 38..271 /gene="HCGIX" CDS 38..271 /gene="HCGIX" /codon_start=1 /db_xref="PID:e225822" /db_xref="PID:g1628393" /translation="MSWGWEQSQEESSAWSRQTPTPQLETQEPHSPHSNKKHQLLVPK KGGPQLQGLRPALELRTRGLQGEDRASGTREPL" BASE COUNT 199 a 166 c 199 g 101 t ORIGIN 1 agaggaaaga accctgggaa cgggaggcga agggataatg agctggggat gggagcagtc 61 gcaggaagaa tcctctgcct ggagccggca gactccaacc cctcagcttg agactcagga 121 gccccatagt ccccacagca ataagaagca ccagctcctg gtcccgaaaa aaggagggcc 181 ccaactccag ggactgcggc ccgccctgga gctgagaaca cgcggactcc agggagagga 241 cagggcttca gggacccgag agccgctctg agcaccgggg gatgtgactg cctcagcggc 301 agagctggaa gggccctcga atgccattca caggaacagc ccaggaaccc agggacttca 361 gaaggtttgt ccgaaaagtg agaggaggcg gaggagagag cctgcgaggt caagctgcag 421 agaacatgag cttctacctc cagatgtgcc agggtgcatc tcaataaact tggattttgg 481 ccaggcgcgg tggctcaggc ctgtaatccc agctctggaa gctgaaggat agcttgagcc 541 caggagttcg aggctgcagt gagctatgat ctcaccacta cactccagcc tgggtgacag 601 caagagatct tgtctcagaa ataaataaat aaaatttaaa aataaaaaaa aaaaaaaaaa 661 aaaaa // LOCUS HSHCR 1239 bp RNA PRI 11-AUG-1992 DEFINITION Human mRNA for protein HC (alpha-1-microglobulin). ACCESSION X04225 NID g32046 KEYWORDS alpha-1-microglobulin; glycoprotein; microglobulin; protein HC; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1231) AUTHORS Traboni,C. and Cortese,R. TITLE Sequence of a full length cDNA coding for human protein HC (alpha 1 microglobulin) JOURNAL Nucleic Acids Res. 14 (15), 6340 (1986) MEDLINE 86312901 REMARK revised by [2] REFERENCE 2 (bases 1 to 1239) AUTHORS Cortese,R. TITLE Direct Submission JOURNAL Submitted (10-AUG-1992) Cortese R., IRBM, Via Pontina km 30.600, Pomezia (Rome), Italy FEATURES Location/Qualifiers source 1..1239 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 73..1131 /codon_start=1 /product="HC polypeptide" /db_xref="PID:g32047" /db_xref="SWISS-PROT:P02760" /translation="MRSLGALLLLLSACLAVSAGPVPTPPDNIQVQENFNISRIYGKW YNLAIGSTCPWLKKIMDRMTVSTLVLGEGATEAEISMTSTRWRKGVCEETSGAYEKTD TDGKFLYHKSKWNITMESYVVHTNYDEYAIFLTKKFSRHHGPTITAKLYGRAPQLRET LLQDFRVVAQGVGIPEDSIFTMADRGECVPGEQEPEPILIPRVRRAVLPQEEEGSGGG QLVTEVTKKEDSCQLGYSAGPCMGMTSRYFYNGTSMACETFQYGGCMGNGNNFVTEKE CLQTCRTVAACNLPIVRGPCRAFIQLWAFDAVKGKCVLFPYGGCQGNGNKFYSEKECR EYCGVPGDGDEELLRFSN" sig_peptide 73..129 /note="HC polypeptide" mat_peptide 130..1128 /product="HC polypeptide" polyA_site 1239 BASE COUNT 290 a 328 c 375 g 246 t ORIGIN 1 gcagggaggc ggtggccctt ctgttgctag accgagcctg tgggatatac caaggcagag 61 gagcccatag ccatgaggag cctcggggcc ctgctcttgc tgctgagcgc ctgcctggcg 121 gtgagcgctg gccctgtgcc aacgccgccc gacaacatcc aagtgcagga aaacttcaat 181 atctctcgga tctatgggaa gtggtacaac ctggccatcg gttccacctg cccctggctg 241 aagaagatca tggacaggat gacagtgagc acgctggtgc tgggagaggg cgctacagag 301 gcggagatca gcatgaccag cactcgttgg cggaaaggtg tctgtgagga gacgtctgga 361 gcttatgaga aaacagatac tgatgggaag tttctctatc acaaatccaa atggaacata 421 accatggagt cctatgtggt ccacaccaac tatgatgagt atgccatttt cctgaccaag 481 aaattcagcc gccatcatgg acccaccatt actgccaagc tctacgggcg ggcgccgcag 541 ctgagggaaa ctctcctgca ggacttcaga gtggttgccc agggtgtggg catccctgag 601 gactccatct tcaccatggc tgaccgaggt gaatgtgtcc ctggggagca ggaaccagag 661 cccatcttaa tcccgagagt ccggagggct gtgctacccc aagaagagga aggatcaggg 721 ggtgggcaac tggtaactga agtcaccaag aaagaagatt cctgccagct gggctactcg 781 gccggtccct gcatgggaat gaccagcagg tatttctata atggtacatc catggcctgt 841 gagactttcc agtacggcgg ctgcatgggc aacggtaaca acttcgtcac agaaaaggag 901 tgtctgcaga cctgccgaac tgtggcggcc tgcaatctcc ccatagtccg gggcccctgc 961 cgagccttca tccagctctg ggcatttgat gctgtcaagg ggaagtgcgt cctcttcccc 1021 tacgggggct gccagggcaa cgggaacaag ttctactcag agaaggagtg cagagagtac 1081 tgcggtgtcc ctggtgatgg tgatgaggag ctgctgcgct tctccaactg acaactggcc 1141 ggtctgcaag tcagaggatg gccagtgtct gtcccggggt cctgtggcag gcagcgcaag 1201 caacctgggt ccaaataaaa actaaattgt aaactcctg // LOCUS HSHE3A 875 bp RNA PRI 04-MAY-1994 DEFINITION H.sapiens mRNA for HE3(alpha). ACCESSION X76383 NID g434359 KEYWORDS epididymis specific; HE3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 875) AUTHORS Kirchhoff,C. TITLE Direct Submission JOURNAL Submitted (04-NOV-1993) C. Kirchhoff, Institut fuer Hormon -und, Fortpflanzungsforschung, Grandweg 64, 22529 Hamburg 54, FRG REFERENCE 2 (bases 1 to 875) AUTHORS Kirchhoff,C., Pera,I., Rust,W. and Ivell,R. TITLE Major human epididymis-specific gene product, HE3, is the first representative of a novel gene family JOURNAL Mol. Reprod. Dev. 37 (2), 130-137 (1994) MEDLINE 94235297 FEATURES Location/Qualifiers source 1..875 /organism="Homo sapiens" /isolate="patient 3/13/90" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="epididymis" /cell_type="epitelial cell" /clone_lib="lambda uni-Zap" /clone="HE3-22" gene 80..466 /gene="HE3 alpha" CDS 80..466 /gene="HE3 alpha" /codon_start=1 /product="human epididymis-specific gene product, alpha" /db_xref="PID:g434360" /translation="MTSSLKIWGILLALLCILCRLCVYSNNIYWREFIKLHYLSPSRE FKEYKCDVLMREKEALKGKSFHTFIYSLWFKIQRACINEKGSDRYRNAYVWPQVPSNY SSVTGRSTTIGTQRAEASATLNSIVA" polyA_signal 854..861 BASE COUNT 254 a 193 c 171 g 257 t ORIGIN 1 ccacatgctg atccccacta caatcagtga cctgaactca gagtccaagt aggacacgca 61 ggtggacgtg gtgactgaga tgacatcctc tctaaagatt tggggcatac tcttggccct 121 gctttgcatc ctttgcaggc tgtgtgtata cagtaacaac atttactgga gagaattcat 181 aaaacttcat tacttaagtc caagtcgaga atttaaagag tacaaatgtg atgtcctcat 241 gagagaaaaa gaggctctga aaggcaagag ctttcatacg ttcatctata gcttatggtt 301 caaaattcag cgtgcatgca tcaatgagaa ggggagcgac cgatatagaa atgcatatgt 361 atggccccag gtgccctcaa actactcgag tgtcactggg agaagtacaa caataggtac 421 acagagagca gaagcttcag ctacattgaa ttccattgtg gcgtagatgg atatgttgat 481 aacatagaag acctgaggat tatagaacct atcagcaact agaaagtcta tgcacatcct 541 cagatattgg tagagtattc agtgcttcca aagtggtggg ccctgcctcc atcaatagcc 601 cctgccactc cccgcttaca tttatgtgtc agtgttttcc aactacttag agtttatgta 661 cctcgtgatt tcttgatacc aaatctttgt gtggtttctg tatctgtgat acaattttgt 721 cctaatttgc ctaatttaca cccacatttt ttccaagatt cagctcatat ggcatctgtc 781 ctcgactaac ctaagacttt cctgatattg actctcttta tacctaccca agctgaatga 841 ccctcctttt cttaaataaa atatattatt ctaaa // LOCUS HSHE4MR 583 bp RNA PRI 04-DEC-1991 DEFINITION H.sapiens HE4 mRNA for extracellular proteinase inhibitor homologue. ACCESSION X63187 NID g32050 KEYWORDS HE4 gene; proteinase inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 583) AUTHORS Kirchoff,C. TITLE Direct Submission JOURNAL Submitted (21-NOV-1991) C. Kirchoff, Inst. f. Hormon- u Fortpflanzungsforsch., Grandweg 64, 2000 Hamburg 54, FRG REFERENCE 2 (bases 1 to 583) AUTHORS Kirchhoff,C., Habben,I., Ivell,R. and Krull,N. TITLE A major human epididymis-specific cDNA encodes a protein with sequence homology to extracellular proteinase inhibitors JOURNAL Biol. Reprod. 45 (2), 350-357 (1991) MEDLINE 92153963 FEATURES Location/Qualifiers source 1..583 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="epididymis" /clone_lib="lambda gt11" mRNA 1..583 CDS 28..405 /codon_start=1 /product="HE4 protein" /db_xref="PID:g32051" /translation="MPACRLGPLAAALLLSLLLFGFTLVSGTGAEKTGVCPELQADQN CTQECVSDSECADNLKCCSAGCATFCLLCPNDKEGSCPQVNINFPQLGLCRDQCQVDT QCPGQMKCCRNGCGKVSCVTPNF" polyA_signal 547..552 BASE COUNT 119 a 199 c 145 g 120 t ORIGIN 1 cccctgcacc ccgcccggca tagcaccatg cctgcttgtc gcctaggccc gctagccgcc 61 gccctcctcc tcagcctgct gctgttcggc ttcaccctag tctcaggcac aggagcagag 121 aagactggcg tgtgccccga gctccaggct gaccagaact gcacgcaaga gtgcgtctcg 181 gacagcgaat gcgccgacaa cctcaagtgc tgcagcgcgg gctgtgccac cttctgcctt 241 ctctgcccca atgataagga gggttcctgc ccccaggtga acattaactt tccccagctc 301 ggcctctgtc gggaccagtg ccaggtggac acgcagtgtc ctggccagat gaaatgctgc 361 cgcaatggct gtgggaaggt gtcctgtgtc actcccaatt tctgaggtcc agccaccacc 421 aggctgagca gtgaggagag aaagtttctg cctggccctg catctggttc cagcccacct 481 gccctcccct ttttcgggac tctgtattcc ctcttggggt gaccacagct tctccctttc 541 ccaaccaata aagtaaccac tttcagcaaa aaaaaaaaaa aaa // LOCUS HSHE5M 430 bp RNA PRI 06-JUL-1994 DEFINITION H.sapiens HE5 mRNA for CDw52 antigen. ACCESSION X67699 S51939 NID g32052 KEYWORDS antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 430) AUTHORS Kirchhoff,C. TITLE Direct Submission JOURNAL Submitted (21-AUG-1992) C. Kirchhoff, Institute for Hormone & Fertility Res, Grandweg 64, 2000 Hamburg 54, FRG REFERENCE 2 (bases 1 to 430) AUTHORS Kirchhoff,C., Krull,N., Pera,I. and Ivell,R. TITLE A major mRNA of the human epididymal principal cells, HE5, encodes the leucocyte differentiation CDw52 antigen peptide backbone JOURNAL Mol. Reprod. Dev. 34 (1), 8-15 (1993) MEDLINE 93119668 REFERENCE 3 (bases 1 to 430) AUTHORS Krull,N., Ivell,R., Osterhoff,C. and Kirchhoff,C. TITLE Region-specific variation of gene expression in the human epididymis as revealed by in situ hybridization with tissue-specific cDNAs JOURNAL Mol. Reprod. Dev. 34 (1), 16-24 (1993) MEDLINE 93119659 FEATURES Location/Qualifiers source 1..430 /organism="Homo sapiens" /isolate="patient 3" /note="Allele: 1" /db_xref="taxon:9606" /tissue_type="epididymis" /cell_type="principal cells" /cell_line="human epididymis tissue" /clone_lib="lambda gt11 (c)" /clone="HE5" gene 25..210 /gene="HE5" sig_peptide 25..96 /gene="HE5" /product="CDw52 antigen" CDS 25..210 /gene="HE5" /codon_start=1 /product="CDw52 antigen" /db_xref="PID:g32053" /db_xref="SWISS-PROT:P31358" /translation="MKRFLFLLLTISLLVMVQIQTGLSGQNDTSQTSSPSASSSMSGG IFLFFVANAIIHLFCFS" mat_peptide 97..207 /gene="HE5" /product="CDw52 antigen" misc_signal 133..207 /gene="HE5" /note="GPI-anchorage signal" variation 143 /gene="HE5" /note="polymorphism" variation 147 /gene="HE5" /note="polymorphism" polyA_signal 408..413 BASE COUNT 112 a 127 c 102 g 89 t ORIGIN 1 gacagccacg aagatcctac caaaatgaag cgcttcctct tcctcctact caccatcagc 61 ctcctggtta tggtacagat acaaactgga ctctcaggac aaaacgacac cagccaaacc 121 agcagcccct cagcatccag cagcatgagc ggaggcattt tccttttctt cgtggccaat 181 gccataatcc acctcttctg cttcagttga ggtgacacgt ctcagcctta gccctgtgcc 241 ccctgaaaca gctgccacca tcactcgcaa gagaatcccc tccatctttg ggaggggttg 301 atgccagaca tcaccaggtt gtagaagttg acaggcagtg ccatgggggc aacagccaaa 361 ataggggggt aatgatgtag gggccaagca gtgcccagct gggggagaat aaagttaccc 421 ttgtactgca // LOCUS HSHE6 4665 bp RNA PRI 21-MAY-1997 DEFINITION H.sapiens mRNA for HE6 Tm7 receptor. ACCESSION X81892 NID g2117160 KEYWORDS HE6 gene; HE6 receptor; seven transmembrane-domain receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4665) AUTHORS Osterhoff,C., Ivell,R. and Kirchhoff,C. TITLE Cloning of a human epididymis-specific mRNA, HE6, encoding a novel member of the seven transmembrane-domain receptor superfamily JOURNAL DNA Cell Biol. 16 (4), 379-389 (1997) MEDLINE 97294669 REFERENCE 2 (bases 1 to 4665) AUTHORS Osterhoff,C. TITLE Direct Submission JOURNAL Submitted (13-AUG-1996) C. Osterhoff, Institute for Hormone & Fertility Res, Grandweg 64, D- 22529 Hamburg, FRG FEATURES Location/Qualifiers source 1..4665 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="epididymis" /clone_lib="lambda Uni-ZAP" gene 73..3117 /gene="HE6" CDS 73..3117 /gene="HE6" /note="human epididymis gene 6" /codon_start=1 /product="seven transmembrane-domain receptor" /db_xref="PID:e274706" /db_xref="PID:g2117161" /translation="MVFSVRQCGHVGRTEEVLLTFKIFLVIICLHVVLVTSLEEDTDN SSLSPPPAKLSVVSFAPSSNEVETTSLNDVTLSLLPSNETEKTKITIVKTFNASGVKP QRNICNLSSICNDSAFFRGEIMFQYDKESTVPQNQHITNGTLTGVLSLSELKRSELNK TLQTLSETYFIMCATAEAQSTLNCTFTIKLNNTMNACAAIAALERVKIRPMEHCCCSV RIPCPSSPEELGKLQCDLQDPIVCLADHPRGPPFSSSQSIPVVPRATVLSQVPKATSF AEPPDYSPVTHNVPSPIGEIQPLSPQPSAPIASSPAIDMPPQSETISSPMPQTHVSGT PPPVKASFSSPTVSAPANVNTTSAPPVQTDIVNTSSISDLENQVLQMEKALSLGSLEP NLAGEMINQVSRLLHSPPDMLAPLAQRLLKVVDDIGLQLNFSNTTISLTSPSLALAVI RVNASSFNTTTFVAQDPANLQVSLETQAPENSIGTITLPSSLMNNLPAHDMELASRVQ FNFFETPALFQDPSLENLSLISYVISSSVANLTVRNLTRNVTVTLKHINPSQDELTVR CVFWDLGRNGGRGGWSDNGCSVKDRRLNETICTCSHLTSFGVLLDLSRTSVLPAQMMA LTFITYIGCGLSSIFLSVTLVTYIAFEKIRRDYPSKILIQLCAALLLLNLVFLLDSWI ALYKMQGLCISVAVFLHYFLLVSFTWMGLEAFHMYLALVKVFNTYIRKYILKFCIVGW GVPAVVVTIILTISPDNYGLGSYGKFPNGSPDDFCWINNNAVFYITVVGYFCVIFLLN VSMFIVVLVQLCRIKKKKQLGAQRKTSIQDLRSIAGLTFLLGITWGFAFFAWGPVNVT FMYLFAIFNTLQGFFIFIFYCVAKENVRKQWRRYLCCGKLRLAENSDWSKTATNGLKK QTVNQGVSSSSNSLQSSSNSTNSTTLLVNNDCSVHASGNGNASTERNGVSFSVQNGDV CLHDFTGKQHMFNEKEDSCNGKGRMALRRTSKRGSLHFIEQM" polyA_signal 4647..4652 BASE COUNT 1295 a 1070 c 939 g 1361 t ORIGIN 1 agccagcccg aggacgcgag cggcaggtgt gcacagaggt tctccacttt gttttctgaa 61 ctcgcggtca ggatggtttt ctctgtcagg cagtgtggcc atgttggcag aactgaagaa 121 gttttactga cgttcaagat attccttgtc atcatttgtc ttcatgtcgt tctggtaaca 181 tccctggaag aagatactga taattccagt ttgtcaccac cacctgctaa attatctgtt 241 gtcagttttg ccccctcctc caatgaggtt gaaacaacaa gcctcaatga tgttacttta 301 agcttactcc cttcaaacga aacagaaaaa actaaaatca ctatagtaaa aaccttcaat 361 gcttcaggcg tcaaacccca gagaaatatc tgcaatttgt catctatttg caatgactca 421 gcatttttta gaggtgagat catgtttcaa tatgataaag aaagcactgt tccccagaat 481 caacatataa cgaatggcac cttaactgga gtcctgtctc taagtgaatt aaaacgctca 541 gagctcaaca aaaccctgca aaccctaagt gagacttact ttataatgtg tgctacagca 601 gaggcccaaa gcacattaaa ttgtacattc acaataaaac tgaataatac aatgaatgca 661 tgtgctgcaa tagccgcttt ggaaagagta aagattcgac caatggaaca ctgctgctgt 721 tctgtcagga taccctgccc ttcctcccca gaagagttgg gaaagcttca gtgtgacctg 781 caggatccca ttgtctgtct tgctgaccat ccacgtggcc caccattttc ttccagccaa 841 tccatcccag tggtgcctcg ggccactgtg ctttcccagg tccccaaagc tacctctttt 901 gctgagcctc cagattattc acctgtgacc cacaatgttc cctctccaat aggggagatt 961 caaccccttt caccccagcc ttcagctccc atagcttcca gccctgccat tgacatgccc 1021 ccacagtctg aaacgatctc ttcccctatg ccccaaaccc atgtctccgg caccccacct 1081 cctgtgaaag cctcattttc ctctcccacc gtgtctgccc ctgcgaatgt caacactacc 1141 agcgcacctc ctgtccagac agacatcgtc aacaccagca gtatttctga tcttgagaac 1201 caagtgttgc agatggagaa ggctctgtcc ttgggcagcc tggagcctaa cctcgcagga 1261 gaaatgatca accaagtcag cagactcctt cattccccgc ctgacatgct ggcccctctg 1321 gctcaaagat tgctgaaagt agtggatgac attggcctac agctgaactt ttcaaacacg 1381 actataagtc taacctcccc ttctttggct ctggctgtga tcagagtgaa tgccagtagt 1441 ttcaacacaa ctacctttgt ggcccaagac cctgcaaatc ttcaggtttc tctggaaacc 1501 caagctcctg agaacagtat tggcacaatt actcttcctt catcgctgat gaataattta 1561 ccagctcatg acatggagct agcttccagg gttcagttca atttttttga aacacctgct 1621 ttgtttcagg atccttccct ggagaacctc tctctgatca gctacgtcat atcatcgagt 1681 gttgcaaacc tgaccgtcag gaacttgaca agaaacgtga cagtcacatt aaagcacatc 1741 aacccgagcc aggatgagtt aacagtgaga tgtgtatttt gggacttggg cagaaatggt 1801 ggcagaggag gctggtcaga caatggctgc tctgtcaaag acaggagatt gaatgaaacc 1861 atctgtacct gtagccatct aacaagcttc ggcgttctgc tggacctatc taggacatct 1921 gtgctgcctg ctcaaatgat ggctctgacg ttcattacat atattggttg tgggctttca 1981 tcaatttttc tgtcagtgac tcttgtaacc tacatagctt ttgaaaagat ccggagggat 2041 tacccttcca aaatcctcat ccagctgtgt gctgctctgc ttctgctgaa cctggtcttc 2101 ctcctggact cgtggattgc tctgtataag atgcaaggcc tctgcatctc agtggctgta 2161 tttcttcatt attttctctt ggtctcattc acatggatgg gcctagaagc attccatatg 2221 tacctggccc ttgtcaaagt atttaatact tacatccgaa aatacatcct taaattctgc 2281 attgtcggtt ggggggtacc agctgtggtt gtgaccatca tcctgactat atccccagat 2341 aactatgggc ttggatccta tgggaaattc cccaatggtt caccggatga cttctgctgg 2401 atcaacaaca atgcagtatt ctacattacg gtggtgggat atttctgtgt gatatttttg 2461 ctgaacgtca gcatgttcat tgtggtcctg gttcagctct gtcgaattaa aaagaagaag 2521 caactgggag cccagcgaaa aaccagtatt caagacctca ggagtatcgc tggccttaca 2581 tttttactgg gaataacttg gggctttgcc ttctttgcct ggggaccagt taacgtgacc 2641 ttcatgtatc tgtttgccat ctttaatacc ttacaaggat ttttcatatt catcttttac 2701 tgtgtggcca aagaaaatgt caggaagcaa tggaggcggt atctttgttg tggaaagtta 2761 cggctggctg aaaattctga ctggagtaaa actgctacta atggtttaaa gaagcagact 2821 gtaaaccaag gagtgtccag ctcttcaaat tccttacagt caagcagtaa ctccactaac 2881 tccaccacac tgctagtgaa taatgattgc tcagtacacg caagcgggaa tggaaatgct 2941 tctacagaga ggaatggggt ctcttttagt gttcagaatg gagatgtgtg ccttcacgat 3001 ttcactggaa aacagcacat gtttaacgag aaggaagatt cctgcaatgg gaaaggccgt 3061 atggctctca gaaggacttc aaagcgggga agcttacact ttattgagca aatgtgattc 3121 ctttcttcta aaatcaaagc atgatgcttg acagtgtgaa atgtccaatt ttacctttta 3181 cacaatgtga gatgtatgaa aatcaactca ttttattctc ggcaacatct ggagaagcat 3241 aagctaatta agggcgatga ttattattac aagaagaaac caagacatta caccatggtt 3301 tttagacatt tctgatttgg tttcttatct ttcattttat aagaaggttg gttttaaaca 3361 atacactaag aatgactcct ataaagaaaa caaaaaaagg tagtgaactt tcagctacct 3421 tttaaagagg ctaagttatc tttgataaca tcatataaag caactgttga cttcagcctg 3481 ttggtgagtt tagttgtgca tgcctttgtt gtatataagc taaattctag tgacccatgt 3541 gtcaaaaatc ttacttctac atttttttgt atttattttc tactgtgtaa atgtattcct 3601 ttgtagaatc atggttgttt tgtctcacgt gataattcag aaaatccttg ctcgttccgc 3661 aaatcctaaa gctccttttg gagatgatat aggatgtgaa atacagaaac ctcagtgaaa 3721 tcaagaaata atgatcccag ccagactgag aaaatgtaag cagacagtgc cacagttagc 3781 tcatacagtg cctttgagca agttaggaaa agatgccccc actgggcaga cacagcccta 3841 tgggtcatgg tttgacaaac agagtgagag accatatttt agccccactc accctcttgg 3901 gtgcacgacc tgtacagcca aacacagcat ccaatatgaa tacccatccc ctgaccgcat 3961 ccccagtagt cagattatag aatctgcacc aagatgttta gctttatacc ttggccacag 4021 agagggatga actgtcatcc agaccatgtg tcaggaaaat tgtgaacgta gatgaggtac 4081 atacactgcc gcttctcaaa tccccagagc ctttaggaac aggagagtag actaggattc 4141 cttctcttaa aaaggtacat atatatggaa aaaaatcata ttgccgttct ttaaaaggca 4201 actgcatggt acattgttga ttgttatgac tggtacactc tggcccagcc agagctataa 4261 ttgtttttta aatgtgtctt gaagaatgca cagtgacaag gggagtagct attgggaaca 4321 gggaactgtc ctacactgct attgttgcta catgtatcga gccttgattg ctcctagtta 4381 tatacagggt ctatcttgct tcctacctac atctgcttga gcagtgcctc aagtacatcc 4441 ttattaggaa catttcaaac cccttttagt taagtctttc actaaggttc tcttgcatat 4501 atttcaagtg aatgttggat ctcagactaa ccatagtaat aatacacatt tctgtgagtg 4561 ctgacttgtc tttgcaatat ttcttttctg atttatttaa ttttcttgta tttatatgtt 4621 aaaatcaaaa atgttaaaat caatgaaata aatttgcagt taaga // LOCUS HSHEAM 1968 bp RNA PRI 27-FEB-1994 DEFINITION Human HS1 gene for heamatopoietic lineage cell specific protein. ACCESSION X16663 NID g32054 KEYWORDS haematopoietic lineage cell specific protein; HS1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1968) AUTHORS Watanabe,T. TITLE Direct Submission JOURNAL Submitted (28-SEP-1989) Watanabe T., Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka 812, Japan REFERENCE 2 (bases 1 to 1968) AUTHORS Kitamura,D., Kaneko,H., Miyagoe,Y., Ariyasu,T. and Watanabe,T. TITLE Isolation and characterization of a novel human gene expressed specifically in the cells of hematopoietic lineage JOURNAL Nucleic Acids Res. 17 (22), 9367-9379 (1989) MEDLINE 90067934 COMMENT *source: H6-3C4; library=lambda gt11; Data kindly reviewed (31-JAN-1990) by Watanabe T. FEATURES Location/Qualifiers source 1..1968 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /cell_line="human-mouse hybridoma" CDS 43..1503 /note="haematopoietic lineage cell protein (AA 1-486)" /codon_start=1 /db_xref="PID:g32055" /db_xref="SWISS-PROT:P14317" /translation="MWKSVVGHDVSVSVETQGDDWDTDPDFVNDISEKEQRWGAKTIE GSGRTEHINIHQLRNKVSEEHDVLRKKEMESGPKASHGYGGRFGVERDRMDKSAVGHE YVAEVEKHSSQTDAAKGFGGKYGVERDRADKSAVGFDYKGEVEKHTSQKDYSRGFGGR YGVEKDKWDKAALGYDYKGETEKHESQRDYAKGFGGQYGIQKDRVDKSAVGFNEMEAP TTAYKKTTPIEAASSGARGLKAKFESMAEEKRKREEEEKAQQVARRQQERKAVTKRSP EAPQPVIAMEEPAVPAPLPKKISSEAWPPVGTPPSSESEPVRTSREHPVPLLPIRQTL PEDNEEPPALPPRTLEGLQVEEEPVYEAEPEPEPEPEPEPENDYEDVEEMDRHEQEDE PEGDYEEVLEPEDSSFSSALAGSSGCPAGAGAGAVALGISAVALYDYQGEGSDELSFD PDDVITDIEMVDEGWWRGRCHGHFGLFPANYVKLLE" BASE COUNT 499 a 459 c 590 g 420 t ORIGIN 1 aattccgccg ggcgcttaga acagaggctt gcacaggtgg agatgtggaa gtctgtagtg 61 ggccatgatg tgtctgtttc cgtggagacc cagggtgatg attgggacac agatcctgac 121 tttgtgaatg acatctctga aaaggagcaa cgatggggag ccaagaccat cgaggggtct 181 ggacgcacag aacacatcaa catccaccag ctgaggaaca aagtatcaga ggagcatgat 241 gttctcagga agaaagagat ggagtcaggg cccaaagcat cccatggcta tggaggtcgg 301 tttggagtag aaagagaccg aatggacaag agtgcagtgg gccatgagta tgttgccgag 361 gtggagaagc actcttctca gacggatgct gccaaaggct ttgggggcaa gtacggagtt 421 gagagggaca gggcagacaa gtcagcagtc ggctttgatt ataaaggaga agtggagaag 481 catacatctc agaaagatta ctctcgtggc tttggtggcc ggtacggggt ggagaaggat 541 aaatgggaca aagcagctct gggatatgac tacaagggag agacggagaa acacgagtcc 601 cagagagatt atgccaaggg ctttggtggc cagtatggaa tccagaagga ccgagtggat 661 aagagcgctg tcggcttcaa tgaaatggag gccccgacca cagcttataa gaagacgacg 721 cccatagaag ccgcttctag tggtgcccgt gggctgaagg cgaaatttga gtccatggct 781 gaggagaaga ggaagcgaga ggaagaggag aaggcacagc aggtggccag gaggcaacag 841 gagcgaaagg ctgtgacaaa gaggagccct gaggctccac agccagtgat agctatggaa 901 gagccagcag taccggcccc actgcccaag aaaatctcct cagaggcctg gcctccagtt 961 gggactcctc catcatcaga gtctgagcct gtgagaacca gcagggaaca cccagtgccc 1021 ttgctgccca ttaggcagac tctcccggag gacaatgagg agcccccagc tctgccccct 1081 aggactctgg aaggcctcca ggtggaggaa gagccagtgt acgaagcaga gcctgagcct 1141 gagcccgagc ctgagcccga gcctgagaat gactatgagg acgttgagga gatggacagg 1201 catgagcagg aggatgaacc agagggggac tatgaggagg tgctcgagcc tgaagattct 1261 tctttttctt ctgctctggc tggatcatca ggctgcccgg ctggggctgg ggctggggct 1321 gtggctctgg ggatctcagc tgtggctcta tatgattacc aaggagaggg aagtgatgag 1381 ctttcctttg atccggacga cgtaatcact gacattgaga tggtggacga gggctggtgg 1441 cggggacgtt gccatggcca ctttggactc ttccctgcaa attatgtcaa gcttctggag 1501 tgactagagc tcactgtcta ctgcaactgt gatttcccat gtccaaagtg gctctgctcc 1561 accccctccc tattcctgat gcaaatgtct aaccagatga gtttctggac agacttccct 1621 ctcctgcttc attaagggct tggggcagag acagcatggg gaaggaggtc cccttcccca 1681 agagtcctct ctatcctgga tgagctcatg aacatttctc ttgtgttcct gactccttcc 1741 caatgaacac ctctctgcca ccccaagctc tgctctcctc ctctgtgagc tctgggcttc 1801 ccagtttgtt tacccgggaa agtacgtcta gattgtgtgg tttgcctcat tgtgctattt 1861 gcccactttc cttccctgaa gaaatatctg tgaaccttct ttctgttcag tcctaaaatt 1921 cgaaataaag tgagactatg gttcacctgt aaaaaaaaaa aaggaatt // LOCUS HSHELAGT 1197 bp RNA PRI 12-JAN-1991 DEFINITION Human mRNA for UDP_galactose:N-acetylglucosaminide-(beta 1->4) galactosyltransferase. ACCESSION X55415 NID g32057 KEYWORDS glycosyl transferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1197) AUTHORS Watzele,G. TITLE Direct Submission JOURNAL Submitted (01-NOV-1990) Watzele G., Institute of Physiology, University of Zuerich, Winterthurerstr. 190, CH-8057 Zuerich, Switzerland REFERENCE 2 (bases 1 to 1197) AUTHORS Watzele,G. and Berger,E.G. TITLE Near identity of HeLa cell galactosyltransferase with the human placental enzyme JOURNAL Nucleic Acids Res. 18 (23), 7174 (1990) MEDLINE 91088335 COMMENT See (HSGSTE) for the placental cDNA sequence. FEATURES Location/Qualifiers source 1..1197 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cervix" /cell_type="adenocarcinoma" /cell_line="HeLa." CDS 1..1197 /note="unnamed protein product" /codon_start=1 /db_xref="PID:g32058" /db_xref="SWISS-PROT:P15291" /translation="MRLREPLLSGSAAMPGASLQRACRLLVAVCALHLGVTLVYYLAG RDLSRLPQLVGVSTPLQGGSNSAAAIGQSSGELRTGGARPPPPLGASSQPRPGGDSSP VVDSGPGPASNLTSVPVPHTTALSLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPN VKMGGRYAPRDCVSPHKVAIIIPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAG DTIFNRAKLLNVGFQEALKDYDYTCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKF GFSLPYVQYFGGVSALSKQQFLTINGFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAV VGRCRMIRHSRDKKNEPNPQRFDRIAHTKETMLSDGLNSLTYQVLDVQRYPLYTQITV DIGTPS" mat_peptide 1..1194 /EC_number="2.4.1.38" /EC_number="2.4.1.22" /note="44kD form; lactose synthase" /product="beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase" mat_peptide 40..1194 /EC_number="2.4.1.22" /EC_number="2.4.1.38" /note="42kD form; beta-N-acetylglucosaminyl-glycopeptide beta-1,4-galactosyltransferase" /product="lactose synthase" CDS 40..1197 /note="unnamed protein product" /codon_start=1 /db_xref="PID:g32059" /translation="MPGASLQRACRLLVAVCALHLGVTLVYYLAGRDLSRLPQLVGVS TPLQGGSNSAAAIGQSSGELRTGGARPPPPLGASSQPRPGGDSSPVVDSGPGPASNLT SVPVPHTTALSLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPNVKMGGRYAPRDCV SPHKVAIIIPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAGDTIFNRAKLLNVG FQEALKDYDYTCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGGV SALSKQQFLTINGFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAVVGRCRMIRHSRDK KNEPNPQRFDRIAHTKETMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPS" BASE COUNT 254 a 360 c 313 g 270 t ORIGIN 1 atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag 61 cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac 121 tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg 181 cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga 241 ggggcccggc cgccgcctcc tctaggcgcc tcctcccagc cgcgcccggg tggcgactcc 301 agcccagtcg tggattctgg ccctggcccc gctagcaact tgacctcggt cccagtgccc 361 cacaccaccg cactgtcgct gcccgcctgc cctgaggagt ccccgctgct tgtgggcccc 421 atgctgattg agtttaacat gcctgtggac ctggagctcg tggcaaagca gaacccaaat 481 gtgaagatgg gcggccgcta tgcccccagg gactgcgtct ctcctcacaa ggtggccatc 541 atcattccat tccgcaaccg gcaggagcac ctcaagtact ggctatatta tttgcaccca 601 gtcctgcagc gccagcagct ggactatggc atctatgtta tcaaccaggc gggagacact 661 atattcaatc gtgctaagct cctcaatgtt ggctttcaag aagccttgaa ggactatgac 721 tacacctgct ttgtgtttag tgacgtggac ctcattccaa tgaatgacca taatgcgtac 781 aggtgttttt cacagccacg gcacatttcc gttgcaatgg ataagtttgg attcagccta 841 ccttatgttc agtattttgg aggtgtctct gctctaagta aacaacagtt tctaaccatc 901 aatggatttc ctaataatta ttggggctgg ggaggagaag atgatgacat ttttaacaga 961 ttagttttta gaggcatgtc tatatctcgc ccaaatgctg tggtcgggag gtgtcgcatg 1021 atccgccact caagagacaa gaaaaatgaa cccaatcctc agaggtttga ccgaattgca 1081 cacacaaagg agacaatgct ctctgatggt ttgaactcac tcacctacca ggtgctggat 1141 gtacagagat acccattgta tacccaaatc acagtggaca tcgggacacc gagctag // LOCUS HSHEPLF 1192 bp RNA PRI 22-SEP-1993 DEFINITION H.sapiens mRNA for hepatic leukemia factor. ACCESSION X68985 NID g402775 KEYWORDS hepatic leukemia factor; hepatic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1192) AUTHORS Hunger,S.P., Ohyashiki,K., Toyama,K. and Cleary,M.L. TITLE Hlf, a novel hepatic bZIP protein, shows altered DNA-binding properties following fusion to E2A in t(17;19) acute lymphoblastic leukemia JOURNAL Genes Dev. 6 (9), 1608-1620 (1992) MEDLINE 92387537 FEATURES Location/Qualifiers source 1..1192 /organism="Homo sapiens" /isolate="patient with t(17;19) (q21-22;p13)" /db_xref="taxon:9606" /clone_lib="HAL-01" CDS 99..986 /codon_start=1 /product="hepatic leukemia factor" /db_xref="PID:g402776" /translation="MEKMSRPLPLNPTFIPPPYGVLRSLLENPLKLPLHHEDAFSKDK DKEKKLDDESNSPTVPQSAFLGPTLWDKTLPYDGDTFQLEYMDLEEFLSENGIPPSPS QHDHSPHPPGLQPASSAAPSVMDLSSRASAPLHPGIPSPNCMQSPIRPGQLLPANRNT PSPIDPDTIQVPVGYEPDPADLALSSIPGQEMFDPRKRKFSEEELKPQPMIKKARKVF IPDDLKDDKYWARRRKNNMAAKRSRDARRLKENQIAIRASFLEKENSALRQEVADLRK ELGKCKNILAKYEARHGPL" BASE COUNT 288 a 344 c 300 g 260 t ORIGIN 1 ctctaggacg gaggaaaagc tcagcaacat tttagggggc ggttgtttct ttctttctta 61 tttctttttt taaggggaaa aaatttgagt gcatcgcgat ggagaaaatg tcccgaccgc 121 tccccctgaa tcccaccttt atcccgcctc cctacggcgt gctcaggtcc ctgctggaga 181 acccgctgaa gctccccctt caccacgaag acgcatttag taaagataaa gacaaggaaa 241 agaagctgga tgatgagagt aacagcccga cggtccccca gtcggcattc ctggggccta 301 ccttatggga caaaaccctt ccctatgacg gagatacttt ccagttggaa tacatggacc 361 tggaggagtt tttgtcagaa aatggcattc cccccagccc atctcagcat gaccacagcc 421 ctcaccctcc tgggctgcag ccagcttcct cggctgcccc ctcggtcatg gacctcagca 481 gccgggcctc tgcacccctt caccctggca tcccatctcc gaactgtatg cagagcccca 541 tcagaccagg tcagctgttg ccagcaaacc gcaatacacc aagtcccatt gatcctgaca 601 ccatccaggt cccagtgggt tatgagccag acccagcaga tcttgccctt tccagcatcc 661 ctggccagga aatgtttgac cctcgcaaac gcaagttctc tgaggaagaa ctgaagccac 721 agcccatgat caagaaagct cgcaaagtct tcatccctga tgacctgaag gatgacaagt 781 actgggcaag gcgcagaaag aacaacatgg cagccaagcg ctcccgcgac gcccggaggc 841 tgaaagagaa ccagatcgcc atccgggcct cgttcctgga gaaggagaac tcggccctcc 901 gccaggaggt ggctgacttg aggaaggagc tgggcaaatg caagaacata cttgccaagt 961 atgaggccag gcacgggccc ctgtaggatg gcatttttgc aggctggctt tggaatagat 1021 ggacagtttg tttcctgtct gatagcacca cacgcaaacc aacctttctg acatcagcac 1081 tttaccagag gcataaacac aactgactcc cattttggtg tgcatctgtg tgtgtgtgcg 1141 tgtatatgtg cttgtgctca tgtgtgtggt cagcggtatg tgcgtgtgcg tg // LOCUS HSHEPSH 2363 bp RNA PRI 22-SEP-1995 DEFINITION Human hepatoma mRNA for serine protease hepsin. ACCESSION X07732 M18930 NID g32063 KEYWORDS hepsin; membrane protein; serine protease; zymogen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2363) AUTHORS Leytus,S.P., Loeb,K.R., Hagen,F.S., Kurachi,K. and Davie,E.W. TITLE A novel trypsin-like serine protease (hepsin) with a putative transmembrane domain expressed by human liver and hepatoma cells JOURNAL Biochemistry 27 (3), 1067-1074 (1988) MEDLINE 88209431 COMMENT see x07002 for liver hepsin partial cDNA sequence the authors combined the sequence from several overlapping Hep G2 cDNA clones; additional sequences were found in clones HepG2UW17 and HepG2UW2. FEATURES Location/Qualifiers source 1..2363 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_line="Hep G2" /clone_lib="lambda gt11" /clone="HepG2UW63, HepG2UW61, HepG2UW17, HepG2UW2" mRNA 1..191 /note="HepG2UW7 mRNA" misc_feature 83 /note="5' end of HepG2UW19" misc_feature 89 /note="5' end of HepG2UW61" misc_feature 98 /note="5' end of HepG2UW20" misc_feature 116 /note="5' end of HepG2UW17" misc_feature 132 /note="5' end of HepG2UW63 and HepG2UW2" variation 192..771 /note="insert in HepG2UW17 (compared to HepG2UW7; put. unspliced intron)" variation 627..771 /note="insert in HepG2UW2 (compared to HepG2UW7; put. unspliced intron, part.)" mRNA 772..2363 /note="HepG2UW7 mRNA" CDS 826..2079 /codon_start=1 /product="hepsin" /db_xref="PID:g32064" /db_xref="SWISS-PROT:P05981" /translation="MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLL RSDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHS ELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPV DRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVF AGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPV CLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEARVPIISNDVCNGADFYGNQIKP KMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTK VSDFREWIFQAIKTHSEASGMVTQL" sig_peptide 826..1311 mat_peptide 1312..2076 polyA_site 2363 BASE COUNT 403 a 808 c 696 g 456 t ORIGIN 1 tcgagcccgc tttccaggga ccctacctga gggcccacag gtgaggcagc ctggcctagc 61 aggccccacg ccaccgcctc tgcctccagg ccgcccgctg ctgcggggcc accatgctcc 121 tgcccaggcc tggagactga cccgaccccg gcactacctc gaggctccgc ccccacctgc 181 tggaccccag ggtaaggaca agggccccca gactcacagt tccagccctg aggacagggg 241 ttccctcatc cccccaccca gcctaatgcc cacctcctaa tagaggggtt cctggggacc 301 tgaagagggg gcactatgac gtctccccaa gcacctaggt gttctgtcct gctcttcctt 361 cagactcagc cgttggaccc cagtcctttc ctccccagac ccaggagttc cagccctcag 421 gcccctcctc cctcatacta gggagtcctg gcccccaaat tcctcctttc ccaagactta 481 tgatttcagg tcctcagctg tctcctccct caaaccggga tcctcagtcc cctgctccac 541 caggctcagg catgggggtc cccatccctg caaatccagg cgtccccccg ctgctggtca 601 gacactgacc ccatccttga acccagccca atctgcgtcc gtgatcacgg cgtgctctgg 661 ccaaggccca gtccctacag cctgcctgga tggacgcctg ggactggggg cgccaggact 721 gggctgggct gggctccccc aggccctgcc tccccgtcca tctcctcaca ggtcccaccc 781 tggcccagga ggtcagccag ggaatcatta acaagaggca gtgacatggc gcagaaggag 841 ggtggccgga ctgtgccatg ctgctccaga cccaaggtgg cagctctcac tgcggggacc 901 ctgctacttc tgacagccat cggggcggca tcctgggcca ttgtggctgt tctcctcagg 961 agtgaccagg agccgctgta cccagtgcag gtcagctctg cggacgctcg gctcatggtc 1021 tttgacaaga cggaagggac gtggcggctg ctgtgctcct cgcgctccaa cgccagggta 1081 gccggactca gctgcgagga gatgggcttc ctcagggcac tgacccactc cgagctggac 1141 gtgcgaacgg cgggcgccaa tggcacgtcg ggcttcttct gtgtggacga ggggaggctg 1201 ccccacaccc agaggctgct ggaggtcatc tccgtgtgtg attgccccag aggccgtttc 1261 ttggccgcca tctgccaaga ctgtggccgc aggaagctgc ccgtggaccg catcgtggga 1321 ggccgggaca ccagcttggg ccggtggccg tggcaagtca gccttcgcta tgatggagca 1381 cacctctgtg ggggatccct gctctccggg gactgggtgc tgacagccgc ccactgcttc 1441 ccggagcgga accgggtcct gtcccgatgg cgagtgtttg ccggtgccgt ggcccaggcc 1501 tctccccacg gtctgcagct gggggtgcag gctgtggtct accacggggg ctatcttccc 1561 tttcgggacc ccaacagcga ggagaacagc aacgatattg ccctggtcca cctctccagt 1621 cccctgcccc tcacagaata catccagcct gtgtgcctcc cagctgccgg ccaggccctg 1681 gtggatggca agatctgtac cgtgacgggc tggggcaaca cgcagtacta tggccaacag 1741 gccggggtac tccaggaggc tcgagtcccc ataatcagca atgatgtctg caatggcgct 1801 gacttctatg gaaaccagat caagcccaag atgttctgtg ctggctaccc cgagggtggc 1861 attgatgcct gccagggcga cagcggtggt ccctttgtgt gtgaggacag catctctcgg 1921 acgccacgtt ggcggctgtg tggcattgtg agttggggca ctggctgtgc cctggcccag 1981 aagccaggcg tctacaccaa agtcagtgac ttccgggagt ggatcttcca ggccataaag 2041 actcactccg aagccagcgg catggtgacc cagctctgac cggtggcttc tcgctgcgca 2101 gcctccaggg cccgaggtga tcccggtggt gggatccacg ctgggccgag gatgggacgt 2161 ttttcttctt gggcccggtc cacaggtcca aggacaccct ccctccaggg tcctctcttc 2221 cacagtggcg ggcccactca gccccgagac cacccaacct caccctcctg acccccatgt 2281 aaatattgtt ctgctgtctg ggactcctgt ctaggtgccc ctgatgatgg gatgctcttt 2341 aaataataaa gatggttttg att // LOCUS HSHEPTP 2565 bp RNA PRI 25-OCT-1991 DEFINITION H.sapiens HePTP mRNA for tyrosine phosphatase. ACCESSION X53364 NID g32066 KEYWORDS tyrosine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2565) AUTHORS Jirik,F.R. TITLE Direct Submission JOURNAL Submitted (05-JUN-1990) Jirik F.R., Biomedical Research Centre, 2222 Health Sciences Mall, University of British Columbia Campus, Vancouver, B.C. V6T 1W5, Canada REFERENCE 2 (bases 1 to 2565) AUTHORS Jirik,F.R., Janzen,N.M., Melhado,I.G. and Harder,K.W. TITLE Cloning and chromosomal assignment of a widely expressed human receptor-like protein-tyrosine phosphatase JOURNAL FEBS Lett. 273 (1-2), 239-242 (1990) MEDLINE 91032191 FEATURES Location/Qualifiers source 1..2565 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HePTP" /clone_lib="cDNA; HepG2" misc_feature 179..235 /note="leader sequence" misc_feature 179..598 /note="extracellular domain" CDS 179..2560 /codon_start=1 /product="tyrosine phosphatase precursor" /db_xref="PID:g32067" /translation="MDSWFILVLLGSGLICVSANNATTVAPSVGITRLINSSTAEPVK EEAKTSNPTSSLTSLSVAPTFSPNITLGPTYLTTVNSSDSDNGTTRTASTNSIGITIS PNGTWLPDNQFTDARTEPWPGNSSTAATTPETFPPSDETPIIAVMVALSSLLVIVFII IVLYMLRFKKYKQAGSHSNSFRLSNGRTEDVEPQSVPLLARSPSTNRKYPPLPVDKLE EEINRRMADDNKLFREEFNALPACPIQATCEAASKEENKEKNRYVNILPYDHSRVHLT PVEGVPDSDYINASFINGYQEKNKFIAAQGPKEETVNDFWRMIWEQNTATIVMVTNLK ERKECKCAQYWPDQGCWTYGNIRVSVEDVTVLVDYTVRKFCIQQVGDMTNRKPQRLIT QFHFTSWPDFGVPFTPIGMLKFLKKVKACNPQYAGAIVVHCSAGVGRTGTFVVIDAML DMMHTERKVDVYGFVSRIRAQRCQMVQTDMQYVFIYQALLEHYLYGDTELEVTSLETH LQKIYNKIPGTSNNGLEEEFKKLTSIKIQNDKMRTGNLPANMKKNRVLQIIPYEFNRV IIPVKRGEENTDYVNASFIDGYRQKDSYIASQGPLLHTIEDFWRMIWEWKSCSIVMLT ELEERGQEKCAQYWPSDGLVSYGDITVELKKEEECESYTVRDLLVTNTRENKSRQIRQ FHFHGWPEVGIPSDGKGMISIIAAVQKQQQQSGNHPITVHCSAGAGRTGTFCALSTVL ERVKAEGILDVFQTVKSLRLQRPHMVQTLEQYEFCYKVVQEYIDAFSDYANFK" mat_peptide 236..2557 /product="tyrosine phosphatase" misc_feature 599..676 /note="transmembrane domain" misc_feature 944..1681 /note="catalitic domain 1" misc_feature 1823..2551 /note="catalitic domain 2" BASE COUNT 729 a 637 c 626 g 573 t ORIGIN 1 aattccgtgg taatggatga tgcagttcaa ataactaagg acacatgttc aaagagcata 61 attaactttt taaaagaagc tagacttctt cagaagcttg ccagtttttc aagctgattt 121 ctctcactgg caactcttca gagtgctgtt cctactccac cctcccctgg tgataagcat 181 ggattcctgg ttcattcttg ttctgctcgg cagtggtctg atatgtgtca gtgccaacaa 241 tgctaccaca gttgcacctt ctgtaggaat cacaagatta attaactcat caacggcaga 301 accagttaaa gaagaggcca aaacttcaaa tccaacttct tcactaactt ctctttctgt 361 ggcaccaaca ttcagcccaa atataactct gggacccacc tatttaacca ctgtcaattc 421 ttcagactct gacaatggga ccacaagaac agcaagcacc aattctatag gcattacaat 481 ttcaccaaat ggaacgtggc ttccagataa ccagttcacg gatgccagaa cagaaccctg 541 gccggggaat tccagcaccg cagcaaccac tccagaaact ttccctcctt cagatgagac 601 accaattatt gcggtgatgg tggccctgtc ctctctgcta gtgatcgtgt ttattatcat 661 agttttgtac atgttaaggt ttaagaaata caagcaagct gggagccatt ccaattcttt 721 ccgcttatcc aacggccgca ctgaggatgt ggagccccag agtgtgccac ttctggccag 781 atccccaagc accaacagga aatacccacc cctgcccgtg gacaagctgg aagaggaaat 841 taaccggaga atggcagacg acaataagct cttcagggag gaattcaacg ctctccctgc 901 atgtcctatc caggccacct gtgaggctgc ttccaaggag gaaaacaagg aaaaaaatcg 961 atatgtaaac atcttgcctt atgaccactc tagagtccac ctgacaccgg ttgaaggggt 1021 tccagattct gattacatca atgcttcatt catcaacggt taccaagaaa agaacaaatt 1081 cattgctgca caaggaccaa aagaagaaac ggtgaatgat ttctggcgga tgatctggga 1141 acaaaacaca gccaccatcg tcatggttac caacctgaag gagagaaagg agtgcaagtg 1201 cgcccagtac tggccagacc aaggctgctg gacctatggg aatattcggg tgtctgtaga 1261 ggatgtgact gtcctggtgg actacacagt acggaagttc tgcatccagc aggtgggcga 1321 catgaccaac agaaagccac agcgcctcat cactcagttc cactttacca gctggccaga 1381 ctttggggtg ccttttaccc cgatcggcat gctcaagttc ctcaagaagg tgaaggcctg 1441 taaccctcag tatgcagggg ccatcgtggt ccactgcagt gcaggtgtag ggcgtacagg 1501 tacctttgtc gtcattgatg ccatgctgga catgatgcat acagaacgga aggtggacgt 1561 gtatggcttt gtgagccgga tccgggcaca gcgctgccag atggtgcaaa ccgatatgca 1621 gtatgtcttc atataccaag cccttctgga gcattatctc tatggagata cagaactgga 1681 agtgacctct ctagaaaccc acctgcagaa aatttacaac aaaatcccag ggaccagcaa 1741 caatggatta gaggaggagt ttaagaagtt aacatcaatc aaaatccaga atgacaagat 1801 gcggactgga aaccttccag ccaacatgaa gaagaaccgt gttttacaga tcattccata 1861 tgaattcaac agagtgatca ttccagttaa gcggggcgaa gagaatacag actatgtgaa 1921 cgcatccttt attgatggct accggcagaa ggactcctat atcgccagcc agggccctct 1981 tctccacaca attgaggact tctggcgaat gatctgggag tggaaatcct gctctatcgt 2041 gatgctaaca gaactggagg agagaggcca ggagaagtgt gcccagtact ggccatctga 2101 tggactggtg tcctatggag atattacagt ggaactgaag aaggaggagg aatgtgagag 2161 ctacaccgtc cgagacctcc tggtcaccaa caccagggag aataagagcc ggcagatccg 2221 gcagttccac ttccatggct ggcctgaagt gggcatcccc agtgacggaa agggcatgat 2281 cagcatcatc gccgccgtgc agaagcagca gcagcagtca gggaaccacc ccatcaccgt 2341 gcactgcagc gccggggcag gaaggacggg gaccttctgt gccctgagca ccgtcctgga 2401 gcgtgtgaaa gcagagggga ttttggatgt cttccagact gtcaagagcc tgcggctaca 2461 gaggccacac atggtccaga cactggaaca gtatgagttc tgctacaagg tggtgcagga 2521 gtatattgat gcattctcag attatgccaa cttcaagtaa gcggc // LOCUS HSHEVIN 2645 bp RNA PRI 31-MAR-1995 DEFINITION H.sapiens mRNA for high endothelial venule. ACCESSION X82157 NID g758065 KEYWORDS hevin gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2645) AUTHORS Girard,J.P. and Springer,T.A. TITLE Cloning from purified high endothelial venule cells of hevin, a close relative of the antiadhesive extracellular matrix protein SPARC JOURNAL Immunity 2 (1), 113-123 (1995) MEDLINE 95323677 REFERENCE 2 (bases 1 to 2645) AUTHORS Girard,J. TITLE Direct Submission JOURNAL Submitted (10-OCT-1994) J. Girard, Center for Blood Research and Dept. of Pathology, Harvard Medical School, 200 Longwoode Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2645 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" /cell_type="high endothelial cells" /cell_line="purified high endothelial cells" /clone="HEC25" sig_peptide 198..260 /gene="hevin" CDS 198..2192 /gene="hevin" /codon_start=1 /db_xref="PID:g758066" /translation="MKTGLFFLCLLGTAAAIPTNARLLSDHSKPTAETVAPDNTAIPS LRAEDEENEKETAVSTEDDSHHKAEKSSVLKSKEESHEQSAEQGKSSSQELGLKDQED SDGDLSVNLEYAPSEGTLDIKEDMSEPQEKKLSENTDFLAPGVSSFTDSNQQESITKR EENQEQPRNYSHHQLNRSSKHSQGLRDQGNQEQDPNISNGEEEEEKEPGEVGTHNDNQ ERKTELPREHANSKQEEDNTQSDDILEESDQPTQVSKMQEDEFDQGNQEQEDNSNAEM EEENASNVNKHIQETEWQSQEGKTGLEAISNHKETEEKTVSEALLMEPTDDGNTTPRN HGVDDDGDDDGDDGGTDGPRHSASDDYFIPSQAFLEAERAQSIAYHLKIEEQREKVHE NENIGTTEPGEHQEAKKAENSSNEEETSSEGNMRVHAVDSCMSFQCKRGHICKADQQG KPHCVCQDPVTCPPTKPLDQVCGTDNQTYASSCHLFATKCRLEGTKKGHQLQLDYFGA CKSIPTCTDFEVIQFPLRMRDWLKNILMQLYEANSEHAGYLNEKQRNKVKKIYLDEKR LLAGDHPIDLLLRDFKKNYHMYVYPVHWQFSELDQHPMDRVLTHSELAPLRASLVPME HCITRFFEECDPNKDKHITLKEWGHCFGIKEEDIDENLLF" gene 198..2192 /gene="hevin" polyA_signal 2608..2613 BASE COUNT 923 a 516 c 580 g 626 t ORIGIN 1 gagcagcaga atttcaactc cagtagactt gaatatgcct ctgggcaaag aagcagagct 61 aacgaggaaa gggatttaaa gagtttttct tgggtgtttg tcaaactttt attccctgtc 121 tgtgtgcaga ggggattcaa cttcaatttt tctgcagtgg ctctgagtcc agccccttac 181 ttaaagatct ggaaagcatg aagactgggc tttttttcct atgtctcttg ggaactgcag 241 ctgcaatccc gacaaatgca agattattat ctgatcattc caaaccaact gctgaaacgg 301 tagcacccga caacactgca atccccagtt taagggctga agatgaagaa aatgaaaaag 361 aaacagcagt atccacagaa gacgattccc accataaggc tgaaaaatca tcagtactaa 421 agtcaaaaga ggaaagccat gaacagtcag cagaacaggg caagagttct agccaagagc 481 tgggattgaa ggatcaagag gacagtgatg gtgacttaag tgtgaatttg gagtatgcac 541 catctgaagg tacattggac ataaaagaag atatgagtga gcctcaggag aaaaaactct 601 cagagaacac tgattttttg gctcctggtg ttagttcctt cacagattct aaccaacaag 661 aaagtatcac aaagagagag gaaaaccaag aacaacctag aaattattca catcatcagt 721 tgaacaggag cagtaaacat agccaaggcc taagggatca aggaaaccaa gagcaggatc 781 caaatatttc caatggagaa gaggaagaag aaaaagagcc aggtgaagtt ggtacccaca 841 atgataacca agaaagaaag acagaattgc ccagggagca tgctaacagc aagcaggagg 901 aagacaatac ccaatctgat gatattttgg aagagtctga tcaaccaact caagtaagca 961 agatgcagga ggatgaattt gatcagggta accaagaaca agaagataac tccaatgcag 1021 aaatggaaga ggaaaatgca tcgaacgtca ataagcacat tcaagaaact gaatggcaga 1081 gtcaagaggg taaaactggc ctagaagcta tcagcaacca caaagagaca gaagaaaaga 1141 ctgtttctga ggctctgctc atggaaccta ctgatgatgg taataccacg cccagaaatc 1201 atggagttga tgatgatggc gatgatgatg gcgatgatgg cggcactgat ggccccaggc 1261 acagtgcaag tgatgactac ttcatcccaa gccaggcctt tctggaggcc gagagagctc 1321 aatccattgc ctatcacctc aaaattgagg agcaaagaga aaaagtacat gaaaatgaaa 1381 atataggtac cactgagcct ggagagcacc aagaggccaa gaaagcagag aactcatcaa 1441 atgaggagga aacgtcaagt gaaggcaaca tgagggtgca tgctgtggat tcttgcatga 1501 gcttccagtg taaaagaggc cacatctgta aggcagacca acagggaaaa cctcactgtg 1561 tctgccagga tccagtgact tgtcctccaa caaaacccct tgatcaagtt tgtggcactg 1621 acaatcagac ctatgctagt tcctgtcatc tattcgctac taaatgcaga ctggagggga 1681 ccaaaaaggg gcatcaactc cagctggatt attttggagc ctgcaaatct attcctactt 1741 gtacggactt tgaagtgatt cagtttcctc tacggatgag agactggctc aagaatatcc 1801 tcatgcagct ttatgaagcc aactctgaac acgctggtta tctaaatgag aagcagagaa 1861 ataaagtcaa gaaaatttac ctggatgaaa agaggctttt ggctggggac catcccattg 1921 accttctctt aagggacttt aagaaaaact accacatgta tgtgtatcct gtgcactggc 1981 agtttagtga acttgaccaa caccctatgg atagagtctt gacacattct gaacttgctc 2041 ctctgcgagc atctctggtg cccatggaac actgcataac ccgtttcttt gaggagtgtg 2101 accccaacaa ggataagcac atcaccctga aggagtgggg ccactgcttt ggaattaaag 2161 aagaggacat agatgaaaat ctcttgtttt gaacgaagat tttaaagaac tcaactttcc 2221 agcatcctcc tctgttctaa ccacttcaga aatatatgca gctgtgatac ttgtagattt 2281 atatttagca aaatgttagc atgtatgaca agacaatgag agtaattgct tgacaacaac 2341 ctatgcacca ggtatttaac attaactttg gaaacaaaaa tgtacaatta agtaaagtca 2401 acatatgcaa aatactgtac attgtgaaca gaagtttaat tcatagtaat ttcactctct 2461 gcattgactt atgagataat taatgattaa actattaatg ataaaaataa tgcatttgta 2521 ttgttcataa tatcatgtgc acttcaagaa aatggaatgc tactcttttg tggtttacgt 2581 gtattatttt caatatctta ataccctaat aaagagtcca taaaaatcca aaaaaaaaaa 2641 aaaaa // LOCUS HSHFATPRO 14756 bp RNA PRI 17-JAN-1996 DEFINITION H.sapiens mRNA for hFat protein. ACCESSION X87241 NID g1107686 KEYWORDS cadherin repeat; EGF repeat; FAT; laminin A-G domain; tumour suppressor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14756) AUTHORS Dunne,J., Hanby,A.M., Poulsom,R., Jones,T.A., Sheer,D., Chin,W.G., Da,S.M., Zhao,Q., Beverley,P.C. and Owen,M.J. TITLE Molecular cloning and tissue expression of FAT, the human homologue of the Drosophila fat gene that is located on chromosome 4q34-q35 and encodes a putative adhesion molecule JOURNAL Genomics 30 (2), 207-223 (1995) MEDLINE 96163873 REFERENCE 2 (bases 1 to 14756) AUTHORS Dunne,J. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) J. Dunne, Imperial Cancer Research Fund, Lymphocyte Molecular Biology, PO Box 123, Lincoln's Inn Fields, London, WC2A 3PX, United Kingdom FEATURES Location/Qualifiers source 1..14756 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="J6 and CEM" /chromosome="4" /map="q34-35" sig_peptide 187..249 /gene="hFat" CDS 187..13959 /gene="hFat" /codon_start=1 /product="homologue of Drosophila Fat protein" /db_xref="PID:g1107687" /translation="MGRHLALLLLLLLLFQHFGDSDGSQRLEQTPLQFTHLEYNVTVQ ENSAAKTYVGHPVKMGVYITHPAWEVRYKIVSGDSENLFKAEEYILGDFCFLRIRTKG GNTAILNREVKDHYTLIVKALEKNTNVEARTKVRVQVLDTNDLRPLFSPTSYSVSLPE NTAIRTSIARVSATDADIGTNGEFYYSFKDRTDMFAIHPTSGVIVLTGRLDYLETKLY EMEILAADRGMKLYGSSGISSMAKLTVHIEQANECAPVITAVTLSPSELDRDPAYAIV TVDDCDQGANGDIASLSIVAGDLLQQFRTVRSFPGSKEYKVKAIGDIDWDSHPFGYNL TLQAKDKGTPPQFSSVKVIHVTSPQFKAGPVKFEKDVYRAEISEFAPPNTPVVMVKAI PAYSHLRYVFKRTPGKAKFSLNYNTGLISILEPVKRQQAAHFELEVTTSDRKASTKVL VKVLGANSNPPEFTQTAYKAAFDENVPIGTTIMSLSAVDPDEGENGYVTYSIANLNHV PFAIDHFTGAVSTSENLDYELMPRVYTLRIRASDWGLPYRREVEVLATITLNNLNDNT PLFEKINCEGTIPRDLGVGEQITTVSAIDADELQLVQYQIEAGNELDLFSLNPNSGVL SLKRSLMDGLGAKVSFHSLRITATDGENFATPLYINITVAASHKLVNLQCEETGVAKM LAEKLLQANKLHNQGEVEDIFFDSHSVNAHIPQFRSTLPTGIQVKENQPVGSSVIFMN STDLDTGFNGKLVYAVSGGNEDSCFMIDMETGMLKILSPLDRETTDKYTLNITVYDLG IPQKAAWRLLHVVVVDANDNPPEFLQESYFVEVSEDKEVHSEIIQVEATDKDLGPNGH VTYSILTDTDTFSIDSVTGVVNIARPLDRELQHEHSLKIEARDQAREEPQLFSTVVVK VSLEDVNDNPPTFIPPNYRVKVREDLPEGTVIMWLEAHDPDLGQSGQVRYSLLDHGEG NFDVDKLSGAVRIVQQLDFEKKQVYNLTVRAKDKGKPVSLSSTCYVEVEVVDVNENLH PPVFSSFVEKGTVKEDAPVGSLVMTVSAHDEDAGRDGEIRYSIRDGSGVGVFKIGEET GVIETSDRLDRESTSHYWLTVFATDQGVVPLSSFIEIYIEVEDVNDNAPQTSEPVYYP EIMENSPKDVSVVQIEAFDPDSSSNDKLMYKITSGNPQGFFSIHPKTGLITTTSRKLD REQQDEHILEVTVTDNGSPPKSTIARVIVKILDENDNKPQFLQKFYKIRLPEREKPDR ERNARREPLYRVIATDKDEGPNAEISYSIEDGNEHGKFFIEPKTGVVSSKRFSAAGEY DILSIKAVDNGRPQKSSTTRLHIEWISKPKQSLEPISFEESFFTFTVMESDPVAHMIG VISVEPPGIPLWFDITGGNYDSHFDVDKGTGTIIVAKPLDAEQKSNYNLTVEATDGTT TILTQVFIKVIDTNDHRPQFSTSKYEVVIPEDTAPETEILQISAVDQDEKNKLIYTLQ SSRDPLSLKKFRLDPATGSLYTSEKLDHEAVSPAHLTVMVRDQDVPVKRNFARIVVNV SDTNDHAPWFTASSYKGRVYESAAVGSVVLQVTALDKDKGKNAEVLYSIESGNIGNIG NSFMIDPVLGSIKTAKELDRSNQAEYDLMVKATDKGSPPMSEITSVRIFVTIADNASP KFTSKEYSVELSETVSIGSFVGMVTAHSQSSVVYEIKDGNTGDAFDINPHSGTIITQK ALDFETLPIYTLIIQGTNMAGLSTNTTVLVHLQDENDNAPVFMQAEYTGLISESASIN SVVLTDRNVPLVIRAADADKDSNALLVYHIVEPSVHTYFAIDSSTGAIHTVLSLDYEE TSIFHFTVQVHDMGTPRLFAEYAANVTVHVIDINDCPPVFAKPLYEASLLLPTYKGVK VITVNATDADSSAFSQLIYSITEGNIGEKFSMDYKTGALTVQNTTQLRSRYELTVRAS DGRFAGLTSVKINVKESKESHLKFTQDVYSAVVKENSTEAETLAVITAIGSPINEPLF YHILNPDRRFKISRTSGVLSTTGTPFDREQQEAFDVVVEVIEEHKPSAVAHVVVKVIV EDQNDNAPVFVNLPYYAVVKVDTEVGHVIRYVTAVDRDSGRNGEVHYYLKEHHEHFQI GPLGEISLKKQFELDTLNKEYLVTVVAKDGGNPAFSAEVIVPITVMNKAMPVFEKPFY SAEIAESIQVHSPVVHVQANSPEGLKVFYSITDGDPFSQFTINFNTGVINVIAPLDFE AHPAYKLSIRATDSLTGAHAEVFVDIIVDDINDNPPVFAQQSYAVTLSEASVIGTSVV QVRATDSDSEPNRGISYQMFGNHSKSHDHFHVDSSTGLISLLRTLDYEQSRQHTIFVR AVDGGMPTLSSDVIVTVDVTDLNGNPPLFEQQIYEARISEHAPHGHFVTCVKAYDADS SDIDKLQYSILSGNDHKHFVIDSATGIITLSNLHRHALKPFYSLNLSVSDGVFRSSTQ VHVTVIGGNLHSPAFLQNEYEVELAENAPLHTLVMEVKTTDGDSGIYGHVTYHIVNDF AKDRFYINERGQIFTLEKLDRETPAEKVISVRLMAKDAGGKVAFCTVNVILTDDNDNA PQFRATKYEVNIGSSAAKGTSVVKSASDADEGSNADITYAIEADSESVKENLEINKLS GVITTKESLIGLENEFFTFFVRAVDNGSPSKESVVLVYVKILPPEMQLPKFSEPFYTF TVSEDVPVGTEIDLIRAEHSGTVLYSLVKGNTPESNRDESFVIDRQSGRLKLEKSLDH ETTKWYQFSILARCTQDDHEMVASVDVSIQVKDANDNSPVFESSPYEAFIVENLPGGS RVIQIRASDADSGTNGQVMYSLDQSQSVEVIESFAINMETGWITTLKELDHEKRDNYQ IKVVASDHGEKIQLSSTAIVDVTVTDVNDSPPRFTAEIYKGTVSEDDPQGGVIAILST TDADSEEINRQVTYFITGGDPLGQFAVETIQNEWKVYVKKPLDREKRDNYLLTITATD GTFSSKAIVEVKVLDANDNSPVCEKTLYSDTIPEDVLPGKLIMQISATDADIRSNAEI TYTLLGSGAEKFKLNPDTGELKTSTPLDREEQAVYHLLVRATDGGGRFCQASIVVTLE DVNDNAPEFSADPYAITVFENTEPGTLLTRVQATDADAGLNRKILYSLIDSADGQFSI NELSGIIQLEKPLDRELQAVYTLSLKAVDQGLPRRLTATGTVIVSVLDINDNPPVFEY REYGATVSEDILVGTEVLQVYAASRDIEANAEITYSIISGNEHGKFSIDSKTGAVFII ENLDYESSHEYYLTVEATDGGTPSLSDVATVNVNVTDINDNTPVFSQDTYTTVISEDA VLEQSVITVMADDADGPSNSHIHYSIIDGNQGSSFTIDPVRGEVKVTKLLDRETISGY TLTVQASDNGSPPRVNTTTVNIDVSDVNDNAPVFSRGNYSVIIQENKPVGFSVLQLVV TDEDSSHNGPPFFFTIVTGNDEKAFEVNPQGVLLTSSAIKRKEKDHYLLQVKVADNGK PQLSSLTYIDIRVIEESIYPPAILPLEIFITSSGEEYSGGVIGKIHATDQDVYDTLTY SLDPQMDNLFSVSSTGGKLIAHKKLDIGQYLLNVSVTDGKFTTVADITVHIRQVTQEM LNHTIAIRFANLTPEEFVGDYWRNFQRALRNILGVRRNDIQIVSLQSSEPHPHLDVLL FVEKPGSAQISTKQLLHKINSSVTDIEEIIGVRILNVFQKLCAGLDCPWKFCDEKVSV DESVMSTHSTARLSFVTPRHHRAAVCLCKEGRCPPVHHGCEDDPCPEGSECVSDPWEE KHTCVCPSGRFGQCPGSSSMTLTGNSYVKYRLTENENKLEMKLTMRLRTYSTHAVVMY ARGTDYSILEIHHGRLQYKFDCGSGPGIVSVQSIQVNDGQWHAVALEVNGNYARLVLD QVHTASGTAPGTLKTLNLDNYVFFGGHIRQQGTRHGRSPQVGNGFRGCMDSIYLNGQE LPLNSKPRSYAHIEESVDVSPGCFLTATEDCASNPCQNGGVCNPSPAGGYYCKCSALY IGTHCEISVNPCSSNPCLYGGTCVVDNGGFVCQCRGLYTGQRCQLSPYCKDEPCKNGG TCFDSLDGAVCQCDSGFRGERCQSDIDECSGNPCLHGALCENTHGSYHCNCSHEYRGR HCEDAAPNQYVSTPWNIGLAEGIGIVVFVAGIFLLVVVFVLCRKMISRKKKHQAEPKD KHLGPATAFLQRPYFDSKLNKNIYSDIPPQVPVRPISYTPSIPSDSRNNLDRNSFEGS AIPEHPEFSTFNPESVHGHRKAVAVCSVAPNLPPPPPSNSPSDSDSIQKPSWDFDYDT KVVDLDPCLSKKPLEEKPSQPYSARESLSEVQSLSSFQSESCDDNGYHWDTSDWMPSV PLPDIQEFPNYEVIDEQTPLYSADPNAIDTDYYPGGYDIESDFPPPPEDFPAADELPP LPPEFSNQFESIHPPRDMPAAGSLGSSSRNRQRFNLNQYLPNFYPLDMSEPQTKGTGE NSTCREPHAPYPPGYQRHFEAPAVESMPMSVYASTASCSDVSACCEVESEVMMSDYES GDDGHFEEVTIPPLDSQQHTEV" gene 187..13959 /gene="hFat" polyA_signal 14738..14743 BASE COUNT 4181 a 3331 c 3492 g 3752 t ORIGIN 1 ctgggcggcc gggcgcgggg agagggcgcg ggagcggctc gtgcggcagg taccatgcgg 61 acgcgcgagc ccggcgaggc cccggcaggc ccgtccctgc tcgggggcgc gctgagacgg 121 cgggtgagct ccacgagagc gccgtcgcca cttcgggcca actttgcgat tcccgacagt 181 taagcaatgg ggagacattt ggctttgctc ctgcttctgc tccttctctt ccaacatttt 241 ggagacagtg atggcagcca acgacttgaa cagactcctc tgcagtttac acacctcgag 301 tacaacgtca ccgtgcagga gaactctgca gctaagactt atgtggggca tcctgtcaag 361 atgggtgttt acattacaca tccagcgtgg gaagtaaggt acaaaattgt ttccggagac 421 agtgaaaacc tgttcaaagc tgaagagtac attctcggag acttttgctt tctaagaata 481 aggaccaaag gaggaaatac agctattctt aatagagaag tgaaggatca ctacacattg 541 atagtgaaag cacttgaaaa aaatactaat gtggaggcgc gaacaaaggt cagggtgcag 601 gtgctggata caaatgactt gagaccgtta ttctcaccca cctcatacag cgtttcttta 661 cctgaaaaca cagctataag gaccagtatc gcaagagtca gcgccacgga tgcagacata 721 ggaaccaacg gggaatttta ctacagtttt aaagatcgaa cagatatgtt tgctattcac 781 ccaaccagtg gtgtgatagt gttaactggt agacttgatt acctagagac caagctctat 841 gagatggaaa tcctcgctgc ggaccgtggc atgaagttgt atgggagcag tggcatcagc 901 agcatggcca agctaacggt gcacatcgaa caggccaatg aatgtgctcc ggtgataaca 961 gcagtgacat tgtcaccatc agaactggac agggacccag catatgcaat tgtgacagtg 1021 gatgactgcg atcagggtgc caatggtgac atagcatctt taagcatcgt ggcaggtgac 1081 cttctccagc agtttagaac agtgaggtcc tttccaggga gtaaggagta taaagtcaaa 1141 gccatcggtg acattgattg ggacagtcat cctttcggct acaatctcac actacaggct 1201 aaagataaag gaactccgcc ccagttctct tctgttaaag tcattcacgt gacttctcca 1261 cagttcaaag ccgggccagt caagtttgaa aaggatgttt acagagcaga aataagtgaa 1321 tttgctcctc ccaacacacc tgtggtcatg gtaaaggcca ttcctgctta ttcccatttg 1381 aggtatgttt ttaaaaggac acctggaaaa gctaaattca gtttaaatta caacactggt 1441 ctcatttcta ttttagaacc agttaaaaga cagcaggcag cccattttga acttgaagta 1501 acaacaagtg acagaaaagc gtccaccaag gtcttggtga aagtcttagg tgcaaatagc 1561 aatccccctg aatttaccca gacagcgtac aaagctgctt ttgatgagaa cgtgcccatt 1621 ggtactacta tcatgagcct gagtgccgta gaccctgatg agggtgagaa tgggtacgtg 1681 acatacagta tcgcaaattt aaatcatgtg ccgtttgcga ttgaccattt cactggtgcc 1741 gtgagtacgt cagaaaacct ggactacgaa ctgatgcctc gggtttatac tctgaggatt 1801 cgtgcatcag actggggctt gccgtaccgc cgggaagtcg aagtccttgc tacaattact 1861 ctcaataact tgaatgacaa cacacctttg tttgagaaaa taaattgtga agggacaatt 1921 cccagagatc taggcgtggg agagcaaata accactgttt ctgctattga tgcagatgaa 1981 cttcagttgg tacagtatca gattgaagct ggaaatgaac tggatttgtt tagtttaaac 2041 cccaactcgg gggtattgtc attaaagcga tcgctaatgg atggcttagg tgcaaaggtg 2101 tctttccaca gtctgagaat cacagctaca gatggagaaa attttgccac accattatat 2161 atcaacataa cagtggctgc cagtcacaag ctggtaaact tgcagtgtga agagactggt 2221 gttgccaaaa tgctggcaga gaagctcctg caggcaaata aattacacaa ccagggagag 2281 gtggaggata ttttcttcga ttctcactct gtcaatgctc acataccgca gtttagaagc 2341 actcttccga ctggtattca ggtaaaggaa aaccagcctg tgggttccag tgtaattttc 2401 atgaactcca ctgaccttga cactggcttc aatggaaaac tggtctatgc tgtttctgga 2461 ggaaatgagg atagttgctt catgattgat atggaaacag gaatgctgaa aattttatct 2521 cctcttgacc gtgaaacaac agacaaatac accctgaata ttaccgtcta tgaccttggg 2581 ataccccaga aggctgcgtg gcgtcttcta catgtcgtgg ttgtcgatgc caatgataat 2641 ccacccgagt ttttacagga gagctatttt gtggaagtga gtgaagacaa ggaggtacat 2701 agtgaaatca tccaggttga agccacagat aaagacctgg ggcccaacgg acacgtgacg 2761 tactcaattc ttacagacac agacacattt tcaattgaca gcgtgacggg tgttgttaac 2821 atcgcacgcc ctctggatcg agagctgcag catgagcact ccttaaagat tgaggccagg 2881 gaccaagcca gagaagagcc tcagctgttc tccactgtcg ttgtgaaagt atcactagaa 2941 gatgttaatg acaacccacc tacatttatt ccacctaatt atcgtgtgaa agtccgagag 3001 gatcttccag aaggaaccgt catcatgtgg ttagaagccc acgatcctga tttaggtcag 3061 tctggtcagg tgagatacag ccttctggac cacggagaag gaaacttcga tgtggataaa 3121 ctcagtggag cagttaggat cgtccagcag ttggactttg agaagaagca agtgtataat 3181 ctcactgtga gggccaaaga caagggaaag ccagtttctc tgtcttctac ttgctatgtt 3241 gaagttgagg tggttgatgt gaatgagaac ctgcacccac ccgtgttttc cagctttgtg 3301 gaaaagggga cagtgaaaga agatgcacct gttggttcat tggtaatgac ggtgtcggct 3361 catgatgagg acgccggaag agatggggag atccgatact ccattagaga tggctctggc 3421 gttggtgttt tcaaaatagg tgaagagaca ggtgtcatag agacgtcaga tcgactggac 3481 cgtgaatcga cctcccatta ttggctaaca gtctttgcaa ccgatcaggg tgtcgtgcct 3541 ctttcatcgt tcatagagat ctacatagag gttgaggatg tcaatgacaa tgcaccacag 3601 acatcagagc ctgtttatta cccagaaatc atggaaaatt ctcctaaaga tgtatctgtg 3661 gtccagatcg aggcatttga tccagattcg agctctaatg acaagctcat gtacaaaatt 3721 acaagtggaa atccacaagg attcttttca atacatccta aaacaggtct catcacaact 3781 acgtcaagga agctagaccg agaacagcaa gatgaacaca tattagaggt tactgtgaca 3841 gacaatggta gtccccccaa atcaaccatt gcaagagtca ttgtgaaaat ccttgatgaa 3901 aatgacaaca aacctcagtt tctgcaaaag ttctacaaaa tcagactccc tgagcgggaa 3961 aagccagacc gagaaagaaa tgccagacgg gagccgctct atcgcgtcat agccaccgac 4021 aaggatgagg gccccaatgc agaaatctcc tacagcatcg aagacgggaa tgagcatggc 4081 aaatttttca tcgaaccgaa aactggagtg gtttcgtcca agaggttttc agcagctgga 4141 gaatatgata ttctttcaat taaggcagtt gacaatggtc gccctcaaaa gtcatcaacc 4201 accagactcc atattgaatg gatctccaag cccaaacagt ccctggagcc catttcattt 4261 gaagaatcat tttttacctt tactgtgatg gaaagtgacc ccgttgctca catgattgga 4321 gtaatatctg tggagcctcc tggcataccc ctttggtttg acatcactgg tggcaactac 4381 gacagtcact tcgatgtgga caagggaact ggaaccatca ttgttgccaa acctcttgat 4441 gcagaacaga agtcaaacta caacctcaca gtcgaggcta cagatggaac caccactatc 4501 ctcactcagg tattcatcaa agtaatagac acaaatgacc atcgtcctca gttttctaca 4561 tcaaagtatg aagttgttat tcctgaagat acagcgccag aaacagaaat tttgcaaatc 4621 agtgctgtgg atcaggatga gaaaaacaaa ctaatctaca ctctgcagag cagtagagat 4681 ccactgagtc tcaagaaatt tcgtcttgat cctgcaaccg gctctctcta tacttctgag 4741 aaactggatc atgaagctgt ttcaccagca cacctcacgg tcatggtacg agatcaagat 4801 gtgcctgtaa aacgcaactt tgcaaggatt gtggtcaatg tcagcgacac gaatgaccac 4861 gccccgtggt tcaccgcttc ctcctacaaa gggcgggttt atgaatcggc agccgttggc 4921 tcagttgtgt tgcaggtgac ggctctggac aaggacaaag ggaaaaatgc tgaagtgctg 4981 tactcgatcg agtcaggaaa tattggaaat attggaaatt cttttatgat tgatcctgtc 5041 ttgggctcta ttaaaactgc caaagaatta gatcgaagta accaagcgga gtatgattta 5101 atggtaaaag ctacagataa gggcagtcca ccaatgagtg aaataacttc tgtgcgtatc 5161 tttgtcacaa ttgctgacaa cgcctctccg aagtttacat caaaagaata ttctgttgaa 5221 cttagtgaaa ctgtcagcat tgggagtttc gttgggatgg ttacagccca tagtcaatca 5281 tcagtggtgt atgaaataaa agatggaaat acaggtgatg cttttgatat taatccacat 5341 tctggaacta tcatcactca gaaagccctg gactttgaaa ctttgcccat ttacacattg 5401 ataatacaag gaactaacat ggctggtttg tccactaata caacggttct agttcacttg 5461 caggatgaga atgacaacgc gccagttttt atgcaggcag aatatacagg actcattagt 5521 gaatcagcct caattaacag cgtggtccta acagacagga atgtcccact ggtgattcga 5581 gcagctgatg ctgataaaga ctcaaatgct ttgcttgtat atcacattgt tgaaccatct 5641 gtacacacat attttgctat tgattctagc actggtgcta ttcatacagt actaagtctg 5701 gactatgaag aaacaagtat ttttcacttt accgtccaag tgcatgacat gggaacccca 5761 cgtttatttg ctgagtatgc agcgaatgta acagtacatg taattgacat taatgactgc 5821 ccccctgtgt ttgccaagcc attatatgaa gcatctcttt tgttaccaac atacaaagga 5881 gtaaaagtca tcacagtaaa tgctacagat gctgattcaa gtgcattctc acagttgatt 5941 tactccatca ccgaaggcaa catcggggag aagttttcta tggactacaa gactggtgct 6001 ctcactgtcc aaaacacaac tcagttaaga agccgctacg agctaaccgt tagagcttcc 6061 gatggcagat ttgccggcct tacctctgtc aaaattaatg tgaaagaaag caaagaaagt 6121 cacctaaagt ttacccagga tgtctactct gcggtagtga aagagaattc caccgaggcc 6181 gaaacattag ctgtcattac tgctattggg agtccaatca atgagccttt gttttatcac 6241 atcctcaacc cagatcgcag atttaaaata agccgcactt caggggttct gtcaaccact 6301 ggcacgccct tcgatcgtga gcagcaggag gcgtttgatg tggttgtaga agtgatagag 6361 gaacataagc cttctgcagt ggcccacgtt gtcgtgaagg tcattgtaga agaccaaaat 6421 gataatgcgc cggtgtttgt caaccttccc tactacgccg ttgttaaagt ggacactgag 6481 gtgggccatg tcattcgcta tgtcactgct gtagacagag acagtggcag aaacggggaa 6541 gtgcattact acctcaagga acatcatgaa cactttcaaa ttggaccctt gggtgaaatt 6601 tcactgaaaa agcaatttga gcttgacacc ttaaataaag aatatcttgt tacagtggtt 6661 gcaaaagatg gagggaaccc ggccttttca gcggaagtta tcgttccgat cactgtcatg 6721 aataaagcca tgcctgtgtt tgaaaaacct ttctacagtg cagagattgc agagagcatc 6781 caggtgcaca gccctgtggt ccacgtgcag gctaacagcc cggaaggcct gaaagtgttc 6841 tacagcatca cagacggaga ccctttcagc cagttcacta ttaacttcaa tactggagtt 6901 atcaatgtca tagctcctct ggactttgag gcccacccgg catataagct gagcatacgc 6961 gcaactgact ccttgacggg cgctcatgct gaagtatttg tggacatcat agtagacgac 7021 atcaatgata accctcctgt gtttgctcag cagtcttatg cggtgaccct gtctgaggca 7081 tctgtaattg gaacgtctgt tgttcaagtt agagccaccg attctgattc agaaccaaat 7141 agaggaatct cataccagat gtttgggaat cacagcaaga gtcatgatca ttttcatgta 7201 gacagcagca ctggcctcat ctcactactc agaaccctgg attacgagca gtcccggcag 7261 cacacgattt ttgtgagggc agttgatggt ggtatgccca cgctgagcag tgatgtgatt 7321 gtcacggtgg acgttaccga cctcaatggt aatccaccac tctttgaaca acagatttat 7381 gaagccagaa ttagcgagca cgcccctcat gggcatttcg tgacctgtgt aaaagcctat 7441 gatgcagaca gttcagacat agacaagttg cagtattcca ttctgtctgg caatgatcat 7501 aaacattttg tcattgacag tgcaacaggg attatcaccc tctcaaacct gcaccggcac 7561 gccctgaagc cattttacag tcttaacctg tcagtgtctg atggagtttt tagaagttcc 7621 acccaggttc atgtaactgt aattggaggc aatttgcaca gtcctgcttt ccttcagaac 7681 gaatatgaag tggaactagc tgaaaacgct cccctacata ccctggtgat ggaggtgaaa 7741 actacggatg gggattctgg tatttatggt cacgttactt accatattgt aaatgacttt 7801 gccaaagaca gattttacat aaatgagaga ggacagatat ttactttgga aaaacttgat 7861 cgagaaaccc cggcggagaa agtgatctca gtccgtttaa tggctaagga tgctggagga 7921 aaagttgctt tctgcaccgt gaatgtcatc cttacagatg acaatgacaa tgcaccacaa 7981 tttcgagcaa ccaaatacga agtgaatatc gggtccagtg ctgctaaagg gacttcagtc 8041 gtaaagtctg caagtgatgc cgatgagggc tccaatgccg acatcaccta tgccattgaa 8101 gcagactctg aaagtgtaaa agagaatttg gaaattaaca aactgtccgg cgtaatcact 8161 acaaaggaga gcctcattgg cttggaaaat gaattcttca ctttctttgt tagagctgtg 8221 gataatgggt ctccatcaaa agaatctgtt gttcttgtct atgttaaaat ccttccaccg 8281 gaaatgcagc ttccaaaatt ttcagaacct ttctatacct ttacagtgtc agaggacgtg 8341 cctgttggaa cagagataga tctcatccga gcagaacata gtgggactgt tctttacagc 8401 ctggtcaaag ggaatactcc agaaagcaat agggatgagt cctttgtgat tgacagacag 8461 agcgggagac tgaagttgga gaagagtctt gatcatgaga caactaagtg gtatcagttt 8521 tccatactgg ccaggtgcac tcaagatgac catgagatgg tggcttctgt agatgttagt 8581 atccaagtga aagatgcaaa tgacaacagc ccggtctttg aatctagtcc atatgaggca 8641 ttcattgttg aaaacctgcc agggggaagt agagtaattc agatcagggc atctgatgct 8701 gactcaggaa ccaacggcca agttatgtat agcctggatc agtcacaaag tgtggaagtc 8761 attgaatcct ttgccattaa catggaaaca ggctggatta caactttaaa ggaacttgac 8821 catgaaaaga gagacaatta ccagattaaa gtggttgcat cagatcatgg tgaaaagatc 8881 cagctatcct ccacagccat tgtggatgtt accgtcaccg atgtcaacga tagtccacca 8941 cgattcacgg ccgagatcta taaagggact gtgagtgagg atgaccccca aggtggggtg 9001 attgccatct taagtaccac ggatgctgat tctgaagaga tcaacagaca agttacatat 9061 ttcataacag gaggggatcc tttaggacag tttgccgttg aaactataca gaatgaatgg 9121 aaggtatatg tgaagaaacc tctagacagg gaaaaaaggg acaattacct tcttactatc 9181 acggcaactg atggcacctt ctcatcaaaa gcgatagttg aagtgaaagt tctggatgca 9241 aatgacaaca gtccagtttg tgaaaagact ttatattcag acactattcc tgaagacgtc 9301 cttcctggaa aattgatcat gcagatctct gctacagacg cagacatccg ctctaacgct 9361 gaaattactt acacgttatt gggttcaggt gcagaaaaat tcaaactaaa tccagacaca 9421 ggtgaactga aaacgtcaac cccccttgat cgtgaggagc aagctgttta tcatcttctc 9481 gtcagggcca cagatggagg aggaagattc tgccaagcca gtattgtcgt cacgctagaa 9541 gatgtgaacg ataacgcccc cgaattctct gccgatcctt atgccatcac cgtgtttgaa 9601 aacacagagc cgggaacgct gctgacaaga gtgcaggcca cagatgccga cgcaggatta 9661 aatcggaaga ttttatactc actgattgac tctgctgatg ggcagttctc cattaacgaa 9721 ttatctggaa ttattcagtt agaaaaacct ttggacagag aactccaggc agtatacacc 9781 ctctctttga aagctgtgga tcaaggcttg ccaaggaggc tgactgccac tggcactgtg 9841 attgtatcag ttcttgacat aaatgacaac ccccctgtgt ttgagtaccg tgaatatggt 9901 gccaccgtgt ctgaggacat tcttgttgga actgaagttc ttcaagtgta tgcagcaagt 9961 cgggatattg aagcaaatgc agaaatcacc tactcaataa taagtggaaa tgaacatggg 10021 aaattcagca tagattctaa aacaggggcc gtatttatca ttgagaatct ggattatgag 10081 agctctcatg agtattacct aacagtagag gccactgatg gaggcacgcc ttcactgagc 10141 gacgttgcca ctgtgaacgt taatgtaaca gatatcaacg ataatacccc tgtgttcagc 10201 caagacacct acacgacagt catcagtgaa gatgccgttc ttgagcagtc tgtcatcacg 10261 gttatggccg atgatgccga tggaccttcc aacagccaca tccactactc aattatagat 10321 ggcaaccaag gaagctcgtt cacaattgac cccgtcaggg gagaagtcaa agtgaccaaa 10381 cttctcgacc gagaaacgat ttcaggttac acgctcacgg ttcaagcttc tgataatggc 10441 agtccaccca gagtcaacac gacgaccgtg aacatcgatg tgtccgatgt caatgacaac 10501 gcgcccgtct tctccagggg aaactacagt gtcattatcc aggaaaataa gccagtgggc 10561 ttcagcgtgc tgcagctggt agtaacagat gaggattctt cccataacgg tccacccttc 10621 ttctttacta ttgtaactgg aaatgatgag aaggcttttg aagttaaccc gcaaggagtc 10681 ctcctgacat catctgccat caagaggaag gagaaagatc attacttact gcaggtgaag 10741 gtggcagata atggaaagcc tcagttgtca tctttgacat acattgacat tagggtaatt 10801 gaggagagca tctatccgcc tgcgattttg cccctggaga ttttcatcac ctcttctgga 10861 gaagaatact caggtggcgt cattgggaag atccatgcca cagaccagga cgtgtatgat 10921 actctaacct acagtctcga ccctcagatg gacaacctgt tctctgtttc cagcacaggg 10981 ggcaagctga tagcacacaa aaagctagac atagggcaat accttctcaa tgtcagcgta 11041 acagatggga agttcacgac ggtggccgac atcacagtgc atatcagaca agtcacacag 11101 gagatgttga accacaccat cgcgatccgc tttgccaacc tcactccgga agaattcgtt 11161 ggtgactact ggcgcaactt ccagcgagct ttacggaaca tcctgggtgt gaggaggaac 11221 gacatacaga ttgttagttt gcagtcctct gaacctcacc cacatctgga cgtcttactt 11281 tttgtagaga aaccaggtag tgctcagatc tcaacaaaac aacttctgca caagattaac 11341 tcttccgtga ctgacattga ggaaatcatt ggagttagga tactgaatgt attccagaaa 11401 ctctgcgcgg gactggactg cccctggaag ttctgcgatg aaaaggtgtc tgtggatgaa 11461 agtgtgatgt caacacacag cacagccaga ctgagttttg tgactccccg ccaccacagg 11521 gcagcggtgt gtctctgcaa agagggaagg tgcccacctg tccaccatgg ctgtgaagat 11581 gatccgtgcc ctgagggatc cgaatgtgtg tctgatccct gggaggagaa acacacctgt 11641 gtctgtccca gcggcaggtt tggtcagtgc ccagggagtt catctatgac actgactgga 11701 aacagctacg tgaaataccg tctgacggaa aatgaaaaca aattagagat gaaactgacc 11761 atgaggctca gaacatattc cacgcatgcg gttgtcatgt atgctcgagg aactgactat 11821 agcatcttgg agattcatca tggaaggctg cagtacaagt ttgactgtgg aagtggccct 11881 ggaattgtct ctgttcagag cattcaggtc aatgatgggc agtggcacgc agtggccctg 11941 gaagtgaatg gaaactatgc tcgcttggtt ctagaccaag ttcatactgc atcgggcaca 12001 gccccaggga ctctgaaaac cctgaacctg gataactatg tgttttttgg tggccacatc 12061 cgtcagcagg gaacaaggca tggaagaagt cctcaagttg gtaatggttt caggggttgt 12121 atggactcca tttatttgaa tgggcaggag ctccctttaa acagcaaacc cagaagctat 12181 gcacacatcg aagagtcggt ggatgtatct ccaggctgct tcctgacggc cacggaagac 12241 tgcgccagca acccttgcca gaatggaggc gtttgcaatc cgtcacctgc tggaggttat 12301 tactgcaaat gcagtgcctt gtacataggg acccactgtg agataagcgt caatccgtgt 12361 tcctccaacc catgcctcta tgggggcacg tgtgttgtcg acaacggagg ctttgtttgc 12421 cagtgtagag gattatatac tggtcagagg tgtcagctta gtccatactg caaagatgaa 12481 ccctgtaaga atggcggaac atgctttgac agtttggatg gcgccgtttg tcagtgtgat 12541 tcgggtttta ggggagaaag gtgtcagagt gatatcgacg agtgctctgg aaacccttgc 12601 ctgcacgggg ccctctgtga gaacacgcac ggctcctatc actgcaactg cagccacgag 12661 tacaggggac gtcactgcga ggatgctgcg cccaaccagt atgtgtccac gccgtggaac 12721 attgggttgg cggaaggaat tggaatcgtt gtgtttgttg cagggatatt tttactggtg 12781 gtggtgtttg ttctctgccg taagatgatt agtcggaaaa agaagcatca ggctgaacct 12841 aaagacaagc acctgggacc cgctacggct ttcttgcaaa gaccgtattt tgattccaag 12901 ctaaataaga acatttactc agacatacca ccccaggtgc ctgtccggcc tatttcctac 12961 accccgagta ttccaagtga ctcaagaaac aatctggacc gaaattcctt cgaaggatct 13021 gctatcccag agcatcccga attcagcact tttaaccccg agtctgtgca cgggcaccga 13081 aaagcagtgg cggtctgcag cgtggcgcca aacctgcctc ccccaccccc ttcaaactcc 13141 ccttctgaca gcgactccat ccagaagcct agctgggact ttgactatga cacaaaagtg 13201 gtggatcttg atccctgtct ttccaagaag cctctagagg aaaagccttc ccagccatac 13261 agtgcccggg aaagcctgtc tgaagtgcag tccctgagct ccttccagtc cgaatcgtgc 13321 gatgacaatg ggtatcactg ggatacatca gattggatgc caagcgttcc tctgccggac 13381 atacaagagt tccccaacta tgaggtgatt gatgagcaga cacccctgta ctcagcagat 13441 ccaaacgcca tcgatacgga ctattaccct ggaggctacg acatcgaaag tgattttcct 13501 ccacccccag aagacttccc cgcagctgat gagctaccac cgttaccgcc cgaattcagc 13561 aatcagtttg aatccatcca ccctcctaga gacatgcctg ccgcgggtag cttgggttct 13621 tcatcaagaa accggcagag gttcaacttg aatcagtatt tgcccaattt ttatcccctc 13681 gatatgtctg aacctcaaac aaaaggcact ggtgagaata gtacttgtag agaaccccat 13741 gccccttacc cgccagggta tcaaagacac ttcgaggcgc ccgctgtcga gagcatgccc 13801 atgtctgtgt acgcctccac cgcctcctgc tctgacgtgt cagcctgctg cgaagtggag 13861 tccgaggtca tgatgagtga ctatgagagc ggggacgacg gccacttcga agaggtgacg 13921 atcccgcccc tggattccca gcagcacacg gaagtctgac tctcaactcc ccccaaagtg 13981 cctgacttta gtgaacctag aggtgatgtg agtaatccgc gctgttcttt gcagcagtgc 14041 ttccaagctt tttttggtga gccgaatggg catggctgcg ctggatcctg cgcctctgga 14101 cgtgctagcc atttccagtg tcccaactac tgtcatcgtg aggttttcat cggctgtgcc 14161 atttcccaac gtcttttggg atttacatct gtctgtgtta aaataatcaa acgaaaaatc 14221 agtcctgtgt tgtcagcatg attcatgtat ttatatagat ttgattattt taattttcct 14281 gtctcttttt tttgtaaatt ttatgtacag atttgatttt tcatagtttt aactagattt 14341 ccaagatatt ttgtgcattt gtttcaactg aattttggtg gtgtcagtgc cattatctag 14401 caccctgatt tttttttttt tactataacc agggtttcat tctgtctttt tccactgaag 14461 tgtgacattt tgttagtaca tttcagtgta gtcattcatt tctagctgta cataggatga 14521 aggagagatc agatacatga acatgtctta catgggttgc tgtatttaga attataaaca 14581 tttttcatta ttggaaagtg taacggggac cttctgcata cctgtttaga accaaaacca 14641 ccatgacaca gtttttatag tgtctgtata tttgtgatgc aatggtcttg taaaggtttt 14701 taatgaaaac taccattagc cagtctttct tactgacaat aaattattaa taaaat // LOCUS HSHFH4 2286 bp RNA PRI 01-APR-1997 DEFINITION H.sapiens mRNA for fork head homologue 4. ACCESSION X99349 NID g1922309 KEYWORDS fork head homologue 4; hepatocyte nuclear factor 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2286) AUTHORS Murphy,D.B., Seemann,S., Wiese,S., Kirschner,R., Grzeschik,K.H. and Thies,U. TITLE The human hepatocyte nuclear factor 3/fork head gene FKHL13: genomic structure and pattern of expression JOURNAL Genomics 40 (3), 462-469 (1997) MEDLINE 97230460 REFERENCE 2 (bases 1 to 2286) AUTHORS Wiese,S. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) S. Wiese, Institut fuer Humangenetik, Gosslerstr 12 D, 37073 Goettingen, FRG FEATURES Location/Qualifiers source 1..2286 /organism="Homo sapiens" /db_xref="taxon:9606" /map="q22-q25" /chromosome="17" gene 12..2276 /gene="HFH4" CDS 12..1271 /gene="HFH4" /codon_start=1 /product="fork head homologue 4" /db_xref="PID:e264336" /db_xref="PID:g1922310" /translation="MAESCLRLSGTGPAEEPAGGRLEEPDALDDSLTSLQWLQEFSIL NAKAPALPPGGTDPHGYHQVPGSAAPGSPLAADPACLGQPHTPGKPTSSCTSRSAPPG LQAPPPDDVDYATNPHVKPPYSYATLICMAMQASKATKITLSAIYKWITDNFCYFRHA DPTWQNSIRHNLSLNKCFIKVPREKDEPGKGGFWRIDPQYAERLLSGAFKKRRLPPVH IHPAFARQAAQEPSAVPRAGPLTVNTEAQQLLREFEEPTGEAVWVQARAGWDISPNTL CPRGGQGPAPPSTLLPTPEEQGELEPLKGNFDWEAIFDAGTLGGELGALEALELSPPL SPASHVDVDLTIHGRHIDCPATWGPSVEQAADSLDFDETFLATSFLQHPWDESGSGCL PPEPLFEAGDATLASDLQDWASVGAFL" polyA_signal 2270..2276 /gene="HFH4" BASE COUNT 448 a 772 c 702 g 364 t ORIGIN 1 gaattccaga catggcggag agctgtctgc gcctctcggg aaccggcccg gcggaggagc 61 cggccggagg gcgcctggag gagcccgacg ccctggatga cagcctgacc agcctgcagt 121 ggctgcagga attctccatt ctcaacgcca aggcccccgc cctgcccccg gggggcaccg 181 acccccacgg ctaccaccag gtgccaggtt cagcggcgcc cgggtccccc ctggcggccg 241 accccgcctg cctggggcag ccacacacgc cgggcaagcc cacgtcgtcg tgcacgtcgc 301 ggagcgcgcc cccggggctg caggccccac cccccgacga cgtggactac gccaccaatc 361 cgcacgtgaa gcctccctac tcgtatgcca cgctcatctg catggccatg caggccagca 421 aggccaccaa gatcaccctg tcggccatct acaagtggat cacggacaac ttctgctact 481 tccgccacgc agatcccacc tggcagaatt caatccgcca caacctgtct ctgaacaagt 541 gcttcatcaa agtgcctcgg gagaaggacg aaccaggcaa ggggggcttc tggcgcattg 601 acccccagta cgcggagcgg ctactgagcg gcgctttcaa gaagcggcga ctgccccctg 661 tccacatcca cccagccttt gcccgccagg ccgcgcagga gcccagcgct gtcccccggg 721 ccgggccgct gacggtgaat accgaggccc agcagctgct gcgggagttc gaggagccca 781 ccggggaggc ggtctgggta caggcgaggg caggctggga cataagccca aacaccctct 841 gcccaagggg tggccaaggt cccgcgcccc ccagcaccct gctgcccacc ccggaggagc 901 agggtgagct ggaacccctc aaaggcaact ttgactggga ggccatcttc gacgccggca 961 ctctgggcgg ggagctgggt gcactggagg ccctggagct gagcccgcct ctgagccccg 1021 cctcacacgt ggacgtggac ctcaccatcc acggccgcca catcgactgc cctgccacct 1081 gggggccttc ggtggagcag gctgccgaca gcctggactt cgatgagacc ttcctggcca 1141 catccttcct gcagcacccc tgggacgaga gcggcagtgg ctgcctgccc ccggagcccc 1201 tctttgaggc tggggatgcc accctggcct ccgacctgca ggactgggcc agcgtggggg 1261 ccttcttgta agaggccagg ccctgcccca cctctggaca gtgcccaagt cagggtccaa 1321 aactgccccc caacacaggt ccacagacac cccaccacct atgcaggggc tgggccaggg 1381 ctccaaggct tgccccaaag gccacatggc caccagcccc agctgccatc agattcaagc 1441 ccaggaggct gaaaacgagg gcccaggacc agaatcgctg cctcctttcc ccagccccac 1501 cttgtacaca cagtgtttca ttgttccgtg ttttcccagc cccagaaacc ggctaaagga 1561 ccctgcacca tgagagccga ggcctggagg agcccgggtc aggctgggga ggaacagaac 1621 tgggccctcc cagagcacct ccgcttcccc cctgcttccc caggtctcta tccagagaga 1681 gtccccaggt acaacaaatg ctaattagat gacagcaaat taaccccctg gaggcttctc 1741 ctggcagagc ctccctgggg ccggggcagg ctgtggatgg ggcggagcag ggcagaagat 1801 ggactggggg agggggcaga gagaggagac caaaatgagg tggtgccaca gggtggggca 1861 aggagatcct ctctaaggcc tctggggtct ttgcctggcc ccatccctag ggggcgggga 1921 ggggacgtaa atccctaatc tttaagcccg acttgaggct gagagcagct ggaagtttgg 1981 gtttggtggt ttgggggccg gggcaaccaa gctgtatggg gcaggacaga cagactaatg 2041 tagtgagtgt agctgtagct gaggcttaac tgggagggat gccgagcttg ctggaactac 2101 tgggaccaag aagcggggta ccccacgccc ctgcctgcac tcctcggggg cgtggggcgt 2161 gccttgctcc acccggactc cctgggctgc gtcccacatc caccctcctg ccccgtgggg 2221 caatttaacc tttttcatga aagttattta caatgaaaag tttttaaaaa taaaattttt 2281 gaattc // LOCUS HSHFKH4 3575 bp RNA PRI 08-JAN-1997 DEFINITION H.sapiens HFKH4 mRNA for fork head like protein. ACCESSION X94553 NID g1770431 KEYWORDS fork head domain; fork head related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3575) AUTHORS Wiese,S., Murphy,D.B., Emmerich,D. and Thies,U. TITLE The novel HNF3/fork head like 4 (HFKH4) gene is JOURNAL Unpublished REFERENCE 2 (bases 1 to 3575) AUTHORS Wiese,S. TITLE Direct Submission JOURNAL Submitted (28-DEC-1995) S. Wiese, Institut fuer Humangenetik, Gosslerstr 12 D, 37073 Goettingen, FRG FEATURES Location/Qualifiers source 1..3575 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" /clone_lib="lambda ZAPII" gene 2055..3557 /gene="HFKH4" CDS 2055..3557 /gene="HFKH4" /codon_start=1 /product="fork head like protein" /db_xref="PID:e218476" /db_xref="PID:g1770432" /translation="MGGPARSPSASTNCLLCLRAPKPLLRAHNLGSNPKLAGTTDQLQ PPQPRDHFRTPRPPGTSAQGTLQPETRVQSGREATALPASRSTTRAEPSASGSLPSLC LHRASPRPRTLSLQRAPAWAAGLSGTARDDPLSSPQKGRASVPGTPGPPPPPDSVGIQ SPGVWDARAMTVERAVVAKPEVWYREGRAGAPAPPAARKPPYSYIRRHAMAIGSPRLT LGGIYKFITEGFPFYPDNPKKWQNSIRHNLTINDCFLKIPREAGRRRKGNYWALDPNA EDMFESGSFLRRRKASSVGLSTYPAYMQDAAAAAAAAAAAAIFPGAVPPRAPPNRAPS IQAKRAAVAGRPPHLLPAESPGHFRVFGLVPERPLKQELGPAPWGPGGSFAFSSDGAP ATTNGYQPRQASPGPVRPTPSYAAAYAGPDGSTPREKAVRYFADAGRVGGTPCPQRAA AVAGGDHGGLLRRTSPGQFGALEPATTGGQLGGPVQAPTMLAMLPLIPVG" misc_feature 2609..2938 /gene="HFKH4" /note="fork head region" BASE COUNT 696 a 1245 c 1038 g 595 t 1 others ORIGIN 1 gaattcctag actcccgagt cttggaatct tagaattaca gggtcgcagg atcgcaggat 61 cgcattcttt agatttatgg aatcacgtta gccccgtcgt ccagaattta aaatctagat 121 atctgaatct cagaatcttg aaagttactg agagccacga aatcccgtat taaaaggtgc 181 ctataatata atccagttaa aaattttaaa atttgaagtt tttcctcctt tgtaaaaacg 241 aaacagacga aatctgggcg ttttgcttct ccattacttg ggaagaagga aaccaggagt 301 tttgtttagc gagtgtaaag cccctctccc ttcgttcgac caaccctcta ccgcagaagg 361 ttgtcaagca aagcctttga gcgtttccac acaccgggtc gacggagcaa ggatctgggc 421 tttgctcgct cttccgaagc agctgccaac gcttggccgg gcttagctgc cctcagctca 481 ccgccacgaa gaacgcgact aaaaccctcc aagcatgcca cccgttaccg gccagaacct 541 ctcgtcttgg gatctcccac actcacctgg caccaccctc ccgctctcac ccacagcccc 601 gcccgccccc ccccccccac accggaaagc tgcgtccggg ctggagcact ggaacccgcg 661 cccaaggggg gaatcctatg cgtcaggggc ctcggagatc agcacacgcc caccagctat 721 ttaacagagt agaaccctga ggccctgcga ggggacaagg acaggccctg gatctcccag 781 tgaatggcca gggaacgaac ccggcgagag gggcgcgcgc aggatctcag gttaaggacc 841 aagttccggc tcagggacag caggaaagga actcagaaat tggacaccat gaagcaaacg 901 tgtgtcccga ctgcccgccc cttcccccgg agacgcccac ccggccaccg ttctcttccc 961 actcccccat tacccacagc cctcactccc ctgcggaagg ggtgcttggc tgcctctggg 1021 ggcttcagag ctaccctggt cccgggggat tggaggagga ggttacctct cctgcgtcgt 1081 tctttcaatc cgtgcaccac tatccatcaa atagagacag atcctgggcc tctcaaagac 1141 ggatgattgg gggtggtgat tggcctatcc ctaaatatct accacgcaag gactcttgag 1201 agatccagac cccggtacag tcgagggacc tggggcccaa aaagsggaaa gcggctacct 1261 cttaccacac agttgggaag cgcagtccta aaggagacgc aggttggaga ctccgctaag 1321 cggagaagcc gcagtggggc catggaaagt caccttccct ttcggttcta ggaattactc 1381 attcgaaaga tggggggact ggagtgccga gtggctgtgg cagccacgat tggggtttgg 1441 aaaccatcct gaaaggcccg gggagccagt ctcctggaac ttctccctcc ctattcccac 1501 aaaaaccaag cgccctctcg gccaattctc accctctcag gacaaaaaag tgagatgagc 1561 ccgtcctttc acctgcgatc caagcccttg gcagaggcct gaaaagtccg aaaactccga 1621 gttcgggcgg tgaggtctcc cgagccggtt cctgaactct ccgggcctca gtcgatcggg 1681 gtgggacccc cccccccccg cccatctcca agcgccctcc ccaccctgac gttgtggggc 1741 tcctaccggg cgccacagct gctcctacct ggggaggtgc gccgggcccc aggggggcgg 1801 acaagtcggg gggcgggcag ggaaccggtt ccgccccacg cttcgtgtgc ccctttaagg 1861 aggggaagcc ggccgaggga ggagccggtc cagtgtgtgc aggggagcgc ctcgccagcg 1921 gtccgcgggg ctggagacca cgccgtggag aggaccagcc tcaggtcgcc ccgcctgggg 1981 ccgggtccca cctcactgcc ccgcctcgcc tctctgcccg tggcgttacc gccaccttgc 2041 ctcgggggca gggcatgggc ggccccgcca gatcgcccag cgccagtact aactgcctgc 2101 tctgccttcg agccccgaag cctcttctgc gcgcgcacaa cctaggcagt aatcctaaac 2161 tagcgggcac cacagaccag ctgcagccac cccaacccag ggatcacttc cggacccctc 2221 gaccgcccgg caccagcgcg caagggaccc ttcagccgga gaccagagtc cagtccggtc 2281 gcgaggccac cgcgctgccc gcctcgagaa gcacaacgcg ggctgagccg tcggctagcg 2341 ggtcactccc gagcctctgt ctgcaccgcg ccagccccag accacggacg ctgagcctcc 2401 agcgcgcgcc agcctgggcc gctgggctct ccgggacagc ccgtgacgat cccctgagct 2461 ctccgcagaa gggccgagcg tccgttccgg ggacgccagg cccgcccccg ccccccgaca 2521 gcgtggggat ccagagcccg ggggtgtggg acgcccgcgc catgactgtc gagagggccg 2581 tcgtcgcgaa gccggaggtg tggtaccgtg aaggaagagc gggcgcgcca gcgccccctg 2641 cagcgcggaa gccgccctac agctacattc ggcgtcatgc catggccatc ggcagcccga 2701 ggctcacgct gggcggcatc tacaagttca tcaccgaggg cttccccttc tacccggaca 2761 accccaaaaa gtggcagaac agcatccgcc acaacctcac aatcaacgac tgcttcctca 2821 agatcccgcg cgaggccggc cgccgccgta agggcaacta ctgggcgctc gaccccaacg 2881 cggaggacat gttcgagagc ggcagcttcc tgcgccgccg caaggcttca agcgtcggac 2941 tctccaccta cccggcttac atgcaggacg cggcggctgc cgccgccgcc gccgccgccg 3001 ccgccatctt cccaggggcg gtcccgccgc gcgccccccc taacagggct ccgtctattc 3061 aggctaagcg cgccgccgtc gctggccgcc cgcctcatct actacccgcg gagtcgcccg 3121 gccatttccg cgtcttcggc ctggttcctg agcggccgct caagcaagaa ttggggcccg 3181 caccgtgggg gcccggcggc tctttcgcct tttcctccga tggcgccccc gctaccacca 3241 acggctacca accacgacag gcttcaccgg gacccgtccg gccaaccccc tcctatgcgg 3301 ctgcctacgc gggccccgac ggaagtaccc ccagggagaa ggcagtgcga tactttgccg 3361 atgctgggcg ggtcgggggc accccttgcc cccagcgggc ggcagcagtg gcgggtggag 3421 accacggtgg acttctacgg cgcacgtcgc ccggccagtt cggagcgctg gagcctgcta 3481 caactggcgg gcagctcgga gggccagtgc aggcgcctac catgctcgcc atgctgccgc 3541 ttatcccggt gggatagatc ggttcgaggg aattc // LOCUS HSHGDS 1677 bp RNA PRI 07-APR-1992 DEFINITION H.sapiens hGDS mRNA for smg GDS. ACCESSION X63465 NID g32079 KEYWORDS exchange protein; hGDS gene; smg GDS; stimulatory GDP/GTP exchange protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Kikuri,A. TITLE Direct Submission JOURNAL Submitted (13-NOV-1991) A. Kikuri, Department of Biochemistry, Kobe University School of Medicine, Kusonoki-cho 7-5-1, Kobe 650, JAPAN REFERENCE 2 (bases 1 to 1677) AUTHORS Kikuchi,A., Kaibuchi,K., Hori,Y., Nonaka,H., Sakoda,T., Kawamura,M., Mizuno,T. and Takai,Y. TITLE Molecular cloning of the human cDNA for a stimulatory GDP/GTP exchange protein for c-Ki-ras p21 and smg p21 JOURNAL Oncogene 7 (2), 289-293 (1992) MEDLINE 92195658 FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1677 /gene="hGDS" CDS 1..1677 /gene="hGDS" /codon_start=1 /product="smg GDS" /db_xref="PID:g32080" /translation="MDNLSDTLKKLKITAVDKTEDSLEGCLDCLLQALAQNNTETSEK IQASGILQLFASLLTPQSSCKAKVANIIAEVAKNEFMRIPCVDAGLISPLVQLLNSKD QEVLLQTGRALGNICYDSHSLQAQLINMGVIPTLVKLLGIHCQNAALTEMCLVAFGNL AELESSKEQFASTNIAEELVKLFKKQIEHDKREMIFEVLAPLAENDAIKLQLVEAGLV ECLLEIVQQKVDSDKEDDITELKTGSDLMVLLLLGDESMQKLFEGGKGSVFQRVLSWI PSNNHQLQLAGALAIANFARNDANCIHMVDNGIVEKLMDLLDRHVEDGNVTVQHAALS ALRNLAIPVIDKAKMLSAGVTEAVLKFLKSEMPPVQFKLLGTLRMLIDAQAEAAEQLG KNVKLVERLVEWCEAKDHAGVMGESNRLLSALIRHSKSKDVIKTIVQSGGIKHLVTMA TSEHVIMQNEALVALALIAALELGTAEKDLESAKLVQILHRLLADERSAPEIKYNSMV LICALMGSECLHKEVQDLAFLDVVSKLRSHENKSVRQQASLTEQRLTVES" BASE COUNT 547 a 291 c 388 g 451 t ORIGIN 1 atggataatc tcagtgatac cttgaagaag ctgaagataa cagctgttga caagactgag 61 gatagtttag aaggatgctt ggattgtctg cttcaagccc tggctcaaaa taatacggaa 121 acaagtgaaa aaatccaagc aagtggaata cttcagctgt ttgcaagtct gttgactcca 181 cagtcttcct gcaaagccaa agtagctaac atcatagcag aagtagccaa aaatgagttt 241 atgcgaattc catgtgtgga tgctggattg atttcaccac tggtgcagct gctaaatagc 301 aaagaccagg aagtgctgct tcaaacgggc agggctctag gaaacatatg ttacgatagc 361 cattcgcttc aagctcagct tatcaatatg ggtgttattc ctaccttagt gaaattactg 421 ggcatccact gccaaaatgc agctcttaca gaaatgtgtc ttgttgcatt tggtaattta 481 gcagaacttg agtcaagtaa agaacagttt gccagtacaa acattgctga agagctagta 541 aaactcttca agaaacaaat cgaacatgat aagagagaaa tgatttttga agttcttgct 601 ccattggcag aaaatgatgc tattaaacta cagctggttg aagcaggcct agtagagtgt 661 ctactagaga ttgttcagca aaaagtggat agtgacaaag aagatgatat tactgagctc 721 aaaactggtt cagatctcat ggttttatta cttcttggag atgaatccat gcagaagtta 781 tttgaaggag gaaaaggtag tgtatttcaa agggtactct cttggatccc atcaaataac 841 caccagctac agcttgctgg agcattggca attgcaaatt ttgccagaaa tgatgcaaat 901 tgtattcata tggtagacaa tgggattgta gaaaaactta tggatttact ggacagacat 961 gtagaagatg gaaatgtaac agtacagcat gcagcactaa gtgccctcag aaacctggcc 1021 attccagtta tagataaagc aaagatgtta tcagctgggg tcacagaggc agttttgaaa 1081 tttcttaaat ctgaaatgcc tcctgttcag ttcaaacttc tgggaacatt aagaatgtta 1141 atagatgcac aagcagaagc tgctgaacaa ttgggaaaga atgttaagtt agtggagcgt 1201 ttggtggaat ggtgtgaagc caaagatcat gctggtgtga tgggggagtc aaacagactg 1261 ctgtctgccc ttatacgaca cagtaaatca aaagatgtaa ttaaaaccat tgtgcagagt 1321 ggtggcatca agcatctagt taccatggca actagtgaac atgtaataat gcagaatgaa 1381 gctcttgttg ctttggcatt aatagcagct ttagaattgg gcactgctga gaaagatcta 1441 gaaagtgcta aacttgtaca gattttacat agactgctag cagatgagag aagtgctcct 1501 gaaatcaaat ataattccat ggtcctgata tgtgctctta tgggatctga atgtctacac 1561 aaggaagtac aggatttggc ttttctagat gtcgtatcca aacttcgcag tcatgagaac 1621 aaaagtgttc gccagcaggc ctctctcaca gagcagagac ttactgtgga aagctga // LOCUS HSHGF 5898 bp RNA PRI 13-FEB-1994 DEFINITION Human mRNA for hepatocyte growth factor (HGF). ACCESSION X16323 S80567 NID g32081 KEYWORDS growth factor; hepatocyte growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5898) AUTHORS Nakamura,T. TITLE Direct Submission JOURNAL Submitted (16-OCT-1989) Nakamura T., Dept of Biology, Faculty of Science, Kyushu University, Hokozaki 6-10-1 Higashi ku, Fukuoka 812, Japan REFERENCE 2 (bases 1 to 5898) AUTHORS Nakamura,T., Nishizawa,T., Hagiya,M., Seki,T., Shimonishi,M., Sugimura,A., Tashiro,K. and Shimizu,S. TITLE Molecular cloning and expression of human hepatocyte growth factor JOURNAL Nature 342 (6248), 440-443 (1989) MEDLINE 90066676 REFERENCE 3 (bases 1 to 5898) AUTHORS Nakamura,T. TITLE Structure and function of hepatocyte growth factor JOURNAL Prog. Growth Factor Res. 3 (1), 67-85 (1991) MEDLINE 92135784 COMMENT Data kindly reviewed (08-JAN-1990) by Nakamura T. FEATURES Location/Qualifiers source 1..5898 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 135..2321 /note="HGF (AA 1-728)" /codon_start=1 /db_xref="PID:g32082" /db_xref="SWISS-PROT:P14210" /translation="MWVTKLLPALLLQHVLLHLLLLPIAIPYAEGHKKRRNTIHEFKK SAKTTLIKIDPALKIKTKKVNTADQCANRCTRNNGLPFTCKAFVFDKARKQCLWFPFN SMSSGVKKEFGHEFDLYENKDYIRNCIIGKGRSYKGTVSITKSGIKCQPWSSMIPHEH SFLPSSYRGKDLQENYCRNPRGEEGGPWCFTSNPEVRYEVCDIPQCSEVECMTCNGES YRGLMDHTESGKICQRWDHQTPHRHKFLPERYPDKGFDDNYCRNPDGQPRPWCYTLDP HTRWEYCAIKTCADNTVNDTDVPMETTECIQGQGEGYRGTANTIWNGIPCQRWDSQYP HKHDMTPENFKCKDLRENYCRNPDGSESPWCFTTDPNIRVGYCSQIPNCDMSNGQDCY RGNGKNYMGNLSQTRSGLTCSMWNKNMEDLHRHIFWEPDASKLNENYCRNPDDDAHGP WCYTGNPLIPWDYCPISRCEGDTTPTIVNLDHPVISCAKTKQLRVVNGIPTRTNVGWM ISLRYRNKHICGGSLIKESWVLTARQCFPSRDLKDYEAWLGIHDVHGRGEEKRKQVLN VSQLVYGPEGSDLVLMKLARPAVLDDFVNTIDLPNYGCTIPEKTSCSVYGWGYTGLIN YDGLLRVAHLYIMGNEKCSQHHRGKVTLNESEICAGAEKIGSGPCEGDYGGPLVCEQH KMRMVLGVIVPGRGCAIPNRPGIFVRVAYYAKWIHKIILTYKVPQS" misc_feature 5875..5879 /note="put. polyA signal" polyA_site 5898 /note="polyA site" BASE COUNT 1920 a 1024 c 1082 g 1872 t ORIGIN 1 cacacaacaa acttagctca tcgcaataaa aagcagctca gagccgactg gctcttttag 61 gcactgactc cgaacaggat tctttcaccc aggcatctcc tccagaggga tccgccagcc 121 cgtccagcag caccatgtgg gtgaccaaac tcctgccagc cctgctgctg cagcatgtcc 181 tcctgcatct cctcctgctc cccatcgcca tcccctatgc agagggacat aagaaaagaa 241 gaaatacaat tcacgaattc aaaaaatcag caaagactac cctaatcaaa atagatccag 301 cactgaagat aaaaaccaaa aaagtgaata ctgcagacca atgtgctaat agatgtacta 361 ggaataatgg acttccattc acttgcaagg cctttgtttt tgataaagcg agaaaacaat 421 gcctctggtt ccccttcaat agcatgtcaa gtggagtgaa gaaagaattt ggccatgaat 481 ttgacctcta tgaaaacaaa gactacatta gaaactgcat catcggtaaa ggacgcagct 541 acaagggaac agtatctatc actaagagtg gcatcaaatg tcagccctgg agttccatga 601 taccacacga acacagcttt ttgccttcga gctatcgggg taaagaccta caggaaaact 661 actgtcgaaa tcctcgaggg gaagaagggg gaccctggtg tttcacaagc aatccagagg 721 tacgctacga agtctgtgac attcctcagt gttcagaagt tgaatgcatg acctgcaatg 781 gggagagtta tcgaggtctc atggatcata cagaatcagg caagatttgt cagcgctggg 841 atcatcagac accacaccgg cacaaattct tgcctgaaag atatcccgac aagggctttg 901 atgataatta ttgccgcaat cccgatggcc agccgaggcc atggtgctat actcttgacc 961 ctcacacccg ctgggagtac tgtgcaatta aaacatgcgc tgacaatact gtaaatgata 1021 ctgatgttcc tatggaaaca actgaatgca tccaaggtca aggagaaggc tacaggggca 1081 ctgccaatac catttggaat ggaattccat gtcagcgttg ggattctcag tatcctcaca 1141 agcatgacat gactcctgaa aatttcaagt gcaaggacct acgagaaaat tactgccgaa 1201 atccagatgg gtctgaatca ccctggtgtt ttaccactga tccaaacatc cgagttggtt 1261 actgctccca aattccaaac tgtgatatgt caaatggaca agattgttat cgtgggaatg 1321 gcaaaaatta tatgggcaac ttatcccaaa caagatctgg actaacgtgt tcaatgtgga 1381 acaagaacat ggaagactta caccgtcata tcttctggga accagatgca agtaagctga 1441 atgagaatta ctgccgaaat ccagatgatg atgctcatgg accctggtgc tacacgggaa 1501 atccactcat tccttgggat tattgcccta tttctcgttg tgaaggtgat accacaccta 1561 caatagtcaa tttagaccat cctgtaatat cttgcgccaa aacgaaacaa ctgcgagttg 1621 taaatgggat tccaacacga acaaatgtag gatggatgat tagtttgaga tacagaaata 1681 aacatatctg cggaggatca ttgataaagg aaagttgggt tcttactgca cgacagtgtt 1741 tcccttctcg agacttgaaa gattatgagg cttggcttgg aattcatgat gtccatggaa 1801 gaggagagga gaaacgcaaa caggttctca atgtttccca gctggtatat ggccctgaag 1861 gatcagatct ggttttaatg aagcttgcca gacctgctgt cctggatgat tttgttaata 1921 caattgattt acctaattat ggatgcacaa ttcctgaaaa gaccagttgc agtgtttatg 1981 gctggggcta cactggattg atcaactatg atggtctatt acgagtggca catctctata 2041 taatgggaaa tgagaaatgc agccagcatc accgagggaa ggtgactctg aatgagtctg 2101 aaatatgtgc tggggctgag aagattggat caggaccatg tgagggggat tatggtggcc 2161 cacttgtttg tgagcaacat aaaatgagaa tggttcttgg tgtcattgtt cccggccgtg 2221 gatgcgccat tccaaatcgt cctggtattt ttgtccgagt agcatattat gcaaaatgga 2281 tacacaaaat tattttaaca tataaggtac cacagtcata gctgaagtaa gtgtgtctga 2341 agcacccacc aatacaactg tcttttacat gaagatttca gagaatgtgg aattaaaaat 2401 accacttaca acaatcctaa gacaactact ggagagtcat gtttgttaaa attctcatta 2461 atgtttatgg gtgttttctg ttgttttgtt tgtcagtgtt attttgtcaa tgttgaagtg 2521 aattaaggta catgcaagtg tagtaacata tctcctgaag atacttgaat ggattaaaaa 2581 aacacacagg tataattgct ggataaagat tttgtgggga aaaaatcaat taatctctct 2641 aagctgcttt ctgaggttgg tttcttaata atgagtaaac cataaattaa atgttatttt 2701 aacctcacca aaacaattta taccttgtgt ccttaaattg taccctatat taaattatat 2761 tacatttcat atgctatatg ttatagttca ttcatttctc ttcaccatgt atcctgcaat 2821 actggtacac gaacacactt tttacaaaac cacataccca tgtacacatg cctaggtaca 2881 catgtacatg cactacagtt taaattatga tgtacttaat gtaacctcta aatattttag 2941 aagtatgtac ctatagtttt acctcaaaaa aatagaaatc tctaaagacc agtagaaata 3001 ttaaaaaatg atgcaaaatc aaaatgagtg gctaattctc catacgtaat ctgcagatga 3061 tcttctctgg ttgacatttt acgtgtggcc atcaccccgg gttaaataac acctaatcta 3121 ggtgtttaca tgtattcaat atcctagttt gtttcatgta gtttctaatt cttaaaggaa 3181 agagggtaat aattctattt gtgtaatttg tttcctccaa acttaaggcc acttatttac 3241 acaagatatt tgtatgtcta ctttcctaaa gcatttcttc agtgctcaga tcagtgtcta 3301 attgaagaag attaaaactg ctttggtcat taaaaacgta tttaaatagg ttaattctaa 3361 gacttgctgc tgtgattgac ttctagctca ctgcctttaa attttaaaaa atttaagagg 3421 aaaattttca tgtctccaaa gttttataaa tacccttcat caagtcatgc attaaagtat 3481 atattagaga aaaaaaaata cttttctcaa cctggaagat tttagcctaa taaagttttt 3541 ttgaagtaaa agaaaacttg taaagggaaa gaaactagtt tgtctaaact ctgtattcat 3601 tttttttttt tttgaagtac agtggaatct gttgaatcag atattttatc aagatatctt 3661 tattttttct tatttcattt ttacaaagat cactcccaat gccatatgta atagacattt 3721 aaatttcgtg ttctgtatga cagccaaatg atcatattta tcattgtatt tgtcatgttt 3781 agctaaaaat catgtattgt tgagaaatag aataacaaaa agtaatagga taggctttga 3841 atttttgcaa aaaatcttcc tgtacaaaac atctttaaaa ataatttttt gagtggtgtg 3901 aatctagtat tcccatttct ctgatttagt tttcttgagt gatttttatc aaggctaagt 3961 ccccaaatga ttccctaaca gctctttaga ataccgttta atctggacta aaatggtttt 4021 aagtttatgg agagtttagt ccacagaact aactggactt ctggcggcaa gtccagaaat 4081 gcttatacaa attttttttt cataataaga tatgtgctgg tatcaaggaa cttaaagtgg 4141 aagcaaaaag acatccaagt agttgctagt ctccatcatc ttatctgatt gtatttctct 4201 tttccttata taatacacca ttttcataag aacacctaga aatttcaaga gtatattgcc 4261 aaaatataaa gtatatttcc tagtttcttc tggctgaacc agtgaaattt tattgttgca 4321 tattaatgat atttttaaaa cttttataaa aattgtcata cttttaaata ctcacatttt 4381 aaaaatactt cttttatgac tcttcctcta aatttcctgg aaatacagat aaagattagc 4441 tagatacaag atgcagctaa gtatttagac attttgagcc cagtattttt cattttatta 4501 aaggctaaaa acaataccac caataaatca tcaaacaaac tgtacaaaat aattctgtct 4561 ttgggaggct ccttttgtga tagagggaca tgggtggaat tgacaatgaa agttagatga 4621 acaaggtccg tgttatttta ggtagtagaa cagggtagag tcatgtcatt atttgcgggc 4681 ggaagatact atttaccacg tgttctttgc tgaatcaatt attaaacatt tttaaaaatc 4741 caattatcca ctttattttg tgtcattgac aaaaggatct tttaagtcag aggtttcaat 4801 gtgatttttg gcttggctgt ttgaataatg gttatgtact gttataattg tagacatttt 4861 ctcatgtcta ccaggaattg aagtgtaaaa ctaaaatatt tttcataatg cctctgccgt 4921 gcggaaggaa tgataatcct tttgtatact tctttaattt tattgtaaaa tgtgtaatga 4981 cttttaccta tatgctgtgg gcaggtcctc agtaaaatct attgagtcaa tttctagtat 5041 taataggctt ttgcttgcta tctaagtgtt tcaaattatg ggaagtgtga gacactggaa 5101 ggcaagaaaa ttaacaataa tggcatgtga tagcaaaatt gtatttcact tattcctgtg 5161 aatatttctt gttggtacca atggtactgt acaaagtgaa tgttatagcc acaacattct 5221 cttgaaaaga acactgtcaa gaagtgggaa attgctgtca ggcatttcgt tgttgttttt 5281 aaacttttta aaaaagaaat actggttttg caagatagag atcatgaggt aaataatttt 5341 aataagctct tatactaaaa agccttaaat cgatttactg agattcaaaa catactatta 5401 taatcaatta tatcccatat atgtaggcaa actcatttaa aaaataaaat taattttggt 5461 aaaagtacat agtgtttgtt tttaaaatac ataattttaa aataaatcgc ttgtcatgat 5521 aaagtccaaa aagaagttat ctttcaatat tcaactaagt ttggagctaa gaatttacta 5581 atacaaaaaa aagttaaaat gttttggacc atatatatct tgacagtgta acttttaagt 5641 aggctcattt ccatttgcac agaaagtttc tgtctttagg aaactgaaaa tgaaatactg 5701 tggatgttat gactgtttgt cttctatgta aataggaaat taataagctg cctattgagt 5761 ggtatagctg tatgcttacc caaaaaaggg aacactgtgg ttatgacttg tattataaac 5821 tttctgtagt taataaagtt gttattttta taaccatgat tatatattat tattaataaa 5881 atattttatc gaaatgct // LOCUS HSHGLHOMO 3480 bp RNA PRI 30-MAY-1995 DEFINITION H.sapiens mRNA for human giant larvae homolog. ACCESSION X87342 NID g854123 KEYWORDS HGL; Human giant larvae homolog. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3480) AUTHORS Wiemann,S., Tommerup,N., Celis,J.E., Ansorge,W. and Leffers,H. TITLE A human homolog of the Drosophila 1(2) giant larvae tumor suppressor maps to 17q24-25 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3480) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (22-MAY-1995) H. Leffers, National University Hospital, Dept of Growth & Reproduction, Blegdamsvej 9, DK-2100 Copenhagen, DENMARK FEATURES Location/Qualifiers source 1..3480 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="transformed amnion cells (AMA)" /clone_lib="lambda ZAP II, cDNA" /clone="HGL" /chromosome="17" /map="q24-25" gene 131..3178 /gene="HGL" CDS 131..3178 /gene="HGL" /codon_start=1 /product="Human giant larvae homologue" /db_xref="PID:g854124" /translation="MRRFLRPGHDPVRERLKRDLFQFNKTVEHGFPHQPSALGYSPSL HILAIGTRSGAIKLYGAPGVEFMGLHQENNAVTQIHLLPGQCQLVTLLDDNSLHLWSL KVKGGASELQEDESFTLRGPPGAAPSATQITVVLPHSSCELLYLGTESGNVFVVQLPA FRALEDRTISSDAVLQRLPEEARHRRVFEMVEALQEHPRDPNQILIGYSRGLVVIWDL QGSRVLYHFLSSQQLENIWWQRDGRLLVSCHSDGSYCQWPVSSEAQQPEPLRSLVPYG PFPCKAITRILWLTTRQGLPFTIFQGGMPRASYGDRHCISVIHDGQQTAFDFTSRVIG FTVLTEADPAATFDDPYALVVLAEEELVVIDLQTAGWPPVQLPYLASLHCSAITCSHH VSNIPLKLWERIIAAGSRQNAHFSTMEWPIDGGTSLTPAPPQRDLLLTGHEDGTVRFW DASGVCLRLLYKLSTVRVFLTDTDPNENFSAQGEDEWPPLRKVGSFDPYSDDPRLGIQ KIFLCKYSGYLAVAGTAGQVLVLELNDEAAEQAVEQVEADLLQDQEGYRWKGHERLAA RSGPVRFEPGFQPFVLVQCQPPAVVTSLALHSEWRLVAFGTSHGFGLFDHQQRRQVFV KCTLHPSDQLALEGPLSRVKSLKKSLRQSFRRMRRSRVSSRKRHPAGPPGEAQEGSAK AERPGLQNMELAPVQRKIEARSAEDSFTGFVRTLYFADTYLKDSSRHCPSLWAGTNGG TIYAFSLRVPPAERRMDEPVRAEQAKEIQLMHRAPVVGILVLDGHSVPLPEPLEVAHD LSKSPDMQGSHQLLVVSEEQFKVFTLPKVSAKLKLKLTALEGSRVRRVSVAHFGSRRA EDYGEHHLAVLTNLGDIQVVSLPLLKPQVRYSCIRREDVSGIASCVFTKYGQGFYLIS PSEFERFSLSTKWLVEPRCLVDSAETKNHRPGNGAGPKKAPSRARNSGTQSDGEEKQP GLVMERALLSDERAATGVHIEPPWGAASAMAEQSEWLSVQAAR" polyA_signal 3459..3464 BASE COUNT 623 a 1128 c 1103 g 626 t ORIGIN 1 cgcccagcag cccgtgggca ggcgcggcgg agcgagcggg gccggcggcg ggcgccgagg 61 gacgccgagg cctcgggcgg gggctggccc ggggttccag gtctccagtg ggggctgcag 121 actaagcaaa atgaggcggt tcctgaggcc agggcatgac cctgtgcggg agaggctcaa 181 gcgggacctg ttccagttta acaagacggt ggagcatggc ttcccgcacc agcccagcgc 241 cctcggctac agcccgtccc tgcacatcct ggccatcggc acccgttctg gagccatcaa 301 gctctacgga gccccaggcg tggagttcat ggggctgcac caggagaaca acgctgtgac 361 gcagatccac ctcctgcccg gccagtgcca gctggtcacc ctgctggatg acaacagcct 421 gcacctttgg agcctgaagg tcaagggcgg ggcatcggag ctgcaggagg atgagagctt 481 cacactgcgt ggacccccag gggctgcccc cagtgccaca cagatcaccg tggtcctgcc 541 acattcctcc tgcgagctgc tctacctggg caccgagagt ggcaacgtgt ttgtggtgca 601 gctgccagct tttcgtgcgc tggaggaccg gaccatcagc tcggacgcgg tgctgcagcg 661 gttgccagag gaggcccgcc accggcgtgt gttcgagatg gtggaggcac tgcaggagca 721 ccctcgagac cccaaccaga tcctgatcgg ctacagccga ggcctcgttg tcatctggga 781 cctacagggc agccgcgtgc tctaccactt cctcagcagc cagcaactgg agaacatctg 841 gtggcagcgg gacggccgcc tgctcgtcag ctgtcactct gacggcagct actgccagtg 901 gcccgtgtcc agcgaagccc agcaaccaga gcccctccgc agcctcgtgc cttacggtcc 961 ctttccttgc aaagcgatta ccagaatcct ctggctgacc actaggcagg ggttgccctt 1021 caccatcttc cagggtggca tgccacgggc cagctacggg gaccgccact gcatctcagt 1081 gatccacgat ggccagcaga cggccttcga cttcacctcc cgtgtcatcg gcttcactgt 1141 cctcacagag gcagaccctg cagccacctt tgacgacccc tatgccctgg tggtgctggc 1201 tgaggaggag ctggtggtga ttgacctgca gacagcaggc tggccaccgg tccagctgcc 1261 ctacctggct tctctgcact gttccgccat cacctgctct caccacgtct ccaacatccc 1321 gctgaagctg tgggagcgga tcattgccgc cggcagccgg cagaacgcac acttctccac 1381 catggagtgg ccaattgatg gtggcaccag cctgacccca gccccacccc agagggacct 1441 gctgctcaca gggcacgagg acggcacggt gcggttctgg gatgcctcgg gtgtctgcct 1501 gcggctgctc tacaaactca gcactgtgcg cgtgttcctc accgacacgg accccaacga 1561 gaacttcagt gcccagggcg aggacgagtg gcccccactc cgcaaggtgg gctcctttga 1621 cccctacagt gatgaccccc ggctgggcat ccagaagatc ttcctctgca agtacagcgg 1681 ctacctggct gtggcaggca cggcagggca ggtgctggta ctggaactga atgacgaggc 1741 agcggagcag gctgtggagc aggtggaggc cgacctgctg caggaccaag agggctaccg 1801 ctggaagggg cacgagcgcc tggcagcccg ctcagggccc gtgcgctttg agcctggctt 1861 tcagcccttc gtgttggtgc agtgtcagcc cccggctgtg gtcacctcct tggccctgca 1921 ctctgagtgg cggctcgtgg ccttcggcac cagccatggc tttggcctct ttgaccacca 1981 gcagcggcgg caggtctttg ttaagtgcac actgcacccc agtgaccagc tggccttgga 2041 gggcccactc tcccgcgtca agtccctcaa gaagtccttg cgtcagtcat tccgccggat 2101 gcgtcggagc cgggtgtcca gccggaagcg gcacccggct ggccccccag gagaggcaca 2161 ggaggggagt gccaaggctg agcggccagg cctccagaac atggagctgg cgcctgtgca 2221 gcgcaagatc gaggctcgct cggcagagga ctccttcaca ggcttcgtcc ggaccctgta 2281 ctttgctgac acctacctga aggacagctc ccggcactgc ccctcgctgt gggctggcac 2341 caatgggggc accatctatg ccttctccct gcgtgtgcct cccgccgagc ggagaatgga 2401 tgagcctgtg cgggcagagc aggccaagga gatccagctg atgcaccggg cgccggtggt 2461 gggcatcctg gtgctcgacg gacacagcgt accccttccc gagcccctcg aagtggccca 2521 tgatctgtcg aagagccctg acatgcaggg aagccaccag ctgctcgtcg tatcagagga 2581 gcagttcaag gtgttcacgc tgcccaaggt gagtgccaag ctgaagttga agctgacggc 2641 cctggagggc tcaagagtgc ggcgggtcag cgtggcccac ttcggcagtc gtcgagccga 2701 ggactacggg gagcaccacc tggcagtcct taccaacctg ggcgacatcc aggtggtctc 2761 gctgcccctg ctcaagcccc aggtgcgcta cagctgcatc cgccgggagg acgtcagtgg 2821 catcgcctcc tgcgtcttca ccaaatatgg ccaaggcttc tacctgatct caccctcgga 2881 gtttgagcgc ttctctctct ccaccaagtg gctggtggag ccccggtgtc tggtggattc 2941 agcagaaacc aagaaccacc gccctggtaa cggtgcgggc cccaagaagg ccccgagccg 3001 agccaggaac tcagggactc agagtgatgg cgaggagaag cagcccggcc tggtgatgga 3061 gcgcgctctg ctcagtgatg agagagcggc aactggcgtt cacatcgagc cgccgtgggg 3121 tgcagcctca gcaatggcgg agcagagtga gtggctgagc gtccaggctg cgcgatgagc 3181 acacactact actgatggcc tttcgggggt ccctgcccca accggagagg ccggtgcaca 3241 gggccccgcc aggggctggg ggcatcccgg cttccacaat gcagctgctc tgggcctcgg 3301 gagaggagag accccagtcc cctgggctgc ccttcccggg cctcgtctgt ctgggtcctt 3361 tggtcaatgt tgcacagttt ttattgctcc catccctttt tgtagtgggc tgggttttaa 3421 gttataaatg ttaactgcct ctgggtgaaa aagtttttaa taaacaccta ttacctcttg // LOCUS HSHGM07EG 3459 bp DNA PRI 25-JUN-1997 DEFINITION H.sapiens HGMP07E gene for olfactory receptor. ACCESSION X65857 S59676 NID g425220 KEYWORDS G protein-coupled receptor; HGMP07E gene; olfactory receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3459) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (27-APR-1992) M. Parmentier, Universite Libre de Bruxelles, I.R.I.B.H.N. ULB Campus Erasme, 808 route de Lennik, 1070 Bruxelles, BELGIUM REFERENCE 2 (bases 1 to 3459) AUTHORS Schurmans,S., Muscatelli,F., Miot,F., Mattei,M.G., Vassart,G. and Parmentier,M. TITLE The OLFR1 gene encoding the HGMP07E putative olfactory receptor maps to the 17p13-->p12 region of the human genome and reveals an MspI restriction fragment length polymorphism JOURNAL Cytogenet. Cell Genet. 63 (3), 200-204 (1993) MEDLINE 93251832 FEATURES Location/Qualifiers source 1..3459 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda charon 4a" /chromosome="17" /map="p12-13" gene 883..1821 /gene="HGMP07E" CDS 883..1821 /gene="HGMP07E" /note="putative olfactory receptor" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g425221" /db_xref="SWISS-PROT:P34982" /translation="MDGGNQSEGSEFLLLGMSESPEQQQILFWMFLSMYLVTVVGNVL IILAISSDSRLHTPVYFFLANLSFTDLFFVTNTIPKMLVNLQSHNKAISYAGCLTQLY FLVSLVALDNLILAVMAYDRYVAICCPLHYTTAMSPKLCILLLSLCWVLSVLYGLIHT LLMTRVTFCGSRKIHYIFCEMYVLLRMACSNIQINHTVLIATGCFIFLIPFGFVIISY VLIIRAILRIPSVSKKYKAFSTCASHLGAVSLFYGTLCMVYLKPLHTYSVKDSVATVM YAVVTPMMNPFIYSLRNKDMHGALGRLLDKHFKRLT" BASE COUNT 1074 a 768 c 633 g 984 t ORIGIN 1 gaattcatgc ttggtgacac ttctcttcat attagaaacc ttgatatgtt tatcaattta 61 tttaattttt actttacccc tcactggaga atttgaaggt ttcgcctgat tcttgttaac 121 atctgagtgc catgatttat ttgttgacag tggggatgag atagtttttc attataacat 181 ttttcctgat tctcgttaaa tataacaata ataggcaaca tttctgagtg ctaattgcct 241 accaactcct tgtgttaagc actttactac ttaattttcc cccaccctat aactcatgat 301 tcgcatgaag aaactgagtc ttaaggaact ctggaaccct acctgaggtt tcacagcaag 361 taagtgcagt cgtactatct tactagcagg gaaagacttg atttcaagta ttctaaactc 421 tatttctagt agtctttcag tggatagtcc cacttttttc agatagatca tcatatcaac 481 aaataataat tggtcctact cctttcaaat acttttattt tcttcctttt ttgaactact 541 tatactgcat gtatctatat taattatttt aacacatata catatcataa cacactaatc 601 accaaacact tcaagaatag tgtaagatga caggattgaa aataagaatt acacattatt 661 cctttaacat tgagtttccc agctttgaag tagctgaaat aattatatcg cataaaaact 721 ttgttatatt tttcactttc ttattttcaa aaattataaa attgggtgta agacattctt 781 aattctaaga aaatgttgat tttgcttatc ttcatgtttt tattcaatta aggacttttg 841 gtaaacattt gctggtgtta atgttaaaag agagttgggg aaatggatgg aggcaaccag 901 agtgaaggtt cagagttcct tctcctgggg atgtcagaga gtcctgagca gcagcagatc 961 ctgttttgga tgttcctgtc catgtacctg gtcacggtgg tgggaaatgt gctcatcatc 1021 ctggccatca gctctgattc ccgcctgcac acccccgtgt acttcttcct ggccaacctc 1081 tccttcactg acctcttctt tgtcaccaac acaatcccca agatgctggt gaacctccag 1141 tcccataaca aagccatctc ctatgcaggg tgtctgacgc agctctactt cctggtctcc 1201 ttggtggccc tggacaacct catcctggct gtgatggcat atgaccgcta tgtggccatc 1261 tgctgccccc tccactacac cacagccatg agccctaagc tctgtatctt actcctttcc 1321 ttgtgttggg tcctatccgt cctctatggc ctcatacaca ccctcctcat gaccagagtg 1381 accttctgtg ggtcacgaaa aatccactac atcttctgtg agatgtatgt attgctgagg 1441 atggcatgtt ccaacattca gattaatcac acagtgctga ttgccacagg ctgcttcatc 1501 ttcctcattc cctttggatt cgtgatcatt tcctatgtgc tgattatcag agccatcctc 1561 agaataccct cagtctctaa gaaatacaaa gccttctcca cctgtgcctc ccatttgggt 1621 gcagtctccc tcttctatgg gacactttgt atggtatacc taaagcccct ccatacctac 1681 tctgtgaagg actcagtagc cacagtgatg tatgctgtgg tgacacccat gatgaatccc 1741 ttcatctaca gcctgaggaa caaggacatg catggggctc tgggaagact cctagataaa 1801 cactttaaga ggctgacatg agggcaattt ggaaagacag cattaaagtg gagactagga 1861 atatccttca ccctatgtaa gggattgtcc tgtgtgttat acagcagtga ttgggacatg 1921 gctccagctc agagacagca tatagatatg tggtgataaa aaagacatat ttgtaacctg 1981 gtgtccccca ggtctcatca gccttggccg taaataaggt cacactaaca ccaacactag 2041 aatgttgcag ggtcaaattc ttcaatgtac ttgactacag ggccacattc ttggccttat 2101 ctgactatat ccagtttaaa cctagaagtg tctctcatct agcacacatc caaagtacag 2161 aaagtaaata gtagctgata agaaagttag tcacatggct gtggaggttt gaaagagatt 2221 gcaatcatac atatttgtat cagctgatcc agcacgtgat atagacctcg acaggtggtg 2281 ttcaattcat ttgacattca tgcagtcatt catcaactca ttctattcat aatgacagtg 2341 tacagaggcc tcaaactggg ttacaaatgt gaggtcacag tctactcggg gaagtacata 2401 aatttacatt aaacataaat ggacctaact catcaattaa aaagtaatgt tcaacatact 2461 gattaaaaaa taaaatatag atagatgtat ttaaagagac acgatgaaag cacaggaata 2521 tataaagctt gaaattaaaa agagaaacag acatatttgg aaaatactaa atattttaag 2581 tgaaatattg ctgcaatgac atcaggtaaa agatatttca aaacgggtga aaggagtgat 2641 gtcaacaaga tggtggaatc gacagttgta tctctcatcc accaacatac agactaattt 2701 atcaaccatc cacagatgaa aatacctttg tgagagctcc agaatccaag tgaaagttta 2761 tagcaccctg gtagtacaca gaagtagaaa aaacaccata ttgaacattg tagaaaaaac 2821 tctgtcacat tacctgaatc acccctcacc caagccagca cagagtagca caaagagaga 2881 tcccctcatc tcatgagttc ttccataagt aaaaaagaaa ataaaacata tgtacaactt 2941 cccctgactt tcaggatgct aaccaagagg accacttctg tcatgcctca ccaagaatac 3001 tgaggcaatt cacatggcca gacccctctg agtagctaag aacaagaaaa aaaaaaaaat 3061 aagaaaatgg ttgggagctc ttaatagtca gtatgtgtat tttaacaact ggggccttgc 3121 accctacagt gggcctgtgc atggtgccca gaagctggcc catccaccca catccccaag 3181 cactaggcct gcctgcccat agaccctgcc aactggccag cccagaatat ctggctaagc 3241 tgactggtga aaaacagttc ctatcaaatt ggactttccc tatcaaaacc agcctgtaaa 3301 gactaaaaga gatgactgct tcttcaaatg tgcaaacaca aatgtgaagc tacaggaaac 3361 acaaggaatc aaggaaactt agcaaagcca aaggaacaaa ataaagcttc agaaaccatc 3421 aaagaaatgg agaaatatga actgcctgac aaagaattc // LOCUS HSHGMCSF 1807 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for granulocyte-macrophage colony-stimulating factor receptor (hGM-CSF-R). ACCESSION X17648 NID g32087 KEYWORDS differation factor receptor; granulocyte-macrophage colony stimulating factor receptor; growth factor receptor; transmembrane glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1807) AUTHORS Gearing,D.P., King,J.A., Gough,N.M. and Nicola,N.A. TITLE Expression cloning of a receptor for human granulocyte-macrophage colony-stimulating factor JOURNAL EMBO J. 8 (12), 3667-3676 (1989) MEDLINE 90059966 COMMENT Data kindly reviewed (20-FEB-1990) by Gearing D.P. FEATURES Location/Qualifiers source 1..1807 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="pi H3M" /clone="pGMR138 and pGMR29" CDS 25..93 /note="unidentified ORF (22 AA)" /codon_start=1 /db_xref="PID:g32088" /translation="MPWGLQQENPWRQQIREAAMFA" sig_peptide 150..215 /note="signal peptide (AA -22 to -1)" CDS 150..1352 /note="precursor protein (AA -22 to 378)" /codon_start=1 /db_xref="PID:g32089" /db_xref="SWISS-PROT:P15509" /translation="MLLLVTSLLLCELPHPAFLLIPEKSDLRTVAPASSLNVRFDSRT MNLSWDCQENTTFSKCFLTDKKNRVVEPRLSNNECSCTFREICLHEGVTFEVHVNTSQ RGFQQKLLYPNSGREGTAAQNFSCFIYNADLMNCTWARGPTAPRDVQYFLYIRNSKRR REIRCPYYIQDSGTHVGCHLDNLSGLTSRNYFLVNGTSREIGIQFFDSLLDTKKIERF NPPSNVTVRCNTTHCLVRWKQPRTYQKLSYLDFQYQLDVHRKNTQPGTENLLINVSGD LENRYNFPSSEPRAKHSVKIRAADVRILNWSSWSEAIEFGSDDGNLGSVYIYVLLIVG TLVCGIVLGFLFKRFLRIQRLFPPVPQIKDKLNDNHEVEDEIIWEEFTPEEGKGYREE VLTVKEIT" mat_peptide 216..1349 /note="mature granulocyte-macrophage colony-stimulating factor receptor (AA 1-378)" misc_feature 1776..1781 /note="pot. polyadenylation signal" misc_feature 1783..1788 /note="pot. polyadenylation signal" polyA_site 1807 /note="polyadenylation site" BASE COUNT 499 a 437 c 453 g 418 t ORIGIN 1 agcaggtgga aggagaggaa gcggatgccg tggggtttac agcaggaaaa tccgtggaga 61 cagcagatcc gagaagcggc gatgtttgcg tagaaccctg tacgtgcttc cttcggcctg 121 tcgctcttcc cttctctctg accagcacca tgcttctcct ggtgacaagc cttctgctct 181 gtgagttacc acacccagca ttcctcctga tcccagagaa atcggatctg cgaacagtgg 241 caccagcctc tagtctcaat gtgaggtttg actccaggac gatgaattta agctgggact 301 gccaagaaaa cacaaccttc agcaagtgtt tcttaactga caagaagaac agagtcgtgg 361 aacccaggct cagtaacaac gaatgttcgt gcacatttcg tgaaatttgt ctgcatgaag 421 gagtcacatt tgaggttcac gtgaatacta gtcaaagagg atttcaacag aaactgcttt 481 atccaaattc aggaagggag ggtaccgctg ctcagaattt ctcctgtttc atctacaatg 541 cggatttaat gaactgtacc tgggcgaggg gtccgacggc cccccgtgac gtccagtatt 601 ttttgtacat acgaaactca aagagaagga gggagatccg gtgtccttat tacatacaag 661 actcaggaac ccatgtggga tgtcacctgg ataacctgtc aggattaacg tctcgcaatt 721 actttctggt taacggaacc agccgagaaa ttggcatcca attctttgat tcacttttgg 781 acacaaagaa aatagaacga ttcaaccctc ccagcaatgt caccgtacgt tgcaacacga 841 cgcactgcct cgtacggtgg aaacagccca ggacctatca gaagctgtcg tacctggact 901 ttcagtacca gctggacgtc cacagaaaga atacccagcc tggcacggaa aacctactga 961 ttaatgtttc tggtgatttg gaaaatagat acaactttcc aagctctgag cccagagcaa 1021 aacacagtgt gaagatcaga gctgcagacg tccgcatctt gaattggagc tcctggagtg 1081 aagccattga atttggttct gacgacggga acctcggctc tgtgtacatt tatgtgctcc 1141 taatcgtggg aacccttgtc tgtggcatcg tcctcggctt cctctttaaa aggttcctta 1201 ggatacagcg gctgttcccg ccagttccac agatcaaaga caaactgaat gataaccatg 1261 aggtggaaga cgagatcatc tgggaggaat tcaccccaga ggaagggaaa ggctaccgcg 1321 aagaggtctt gaccgtgaag gaaattacct gagacccaga gggtgtagga atggcatgga 1381 catctccgcc tccgcgacac gggggaactg ttttcttgat gatgctgtga acctttatat 1441 cattttctat gtttttattt aaaaacatga catttggggc caggcgcggt ggctcacgcc 1501 tgtaatccca gcactttggg aggccaaggc aggcggatca cctgaggtca ggagttcaag 1561 accagcctgc ccaacatggt gaaaccccat ctggactaaa aatgcagaaa tttacccagg 1621 cacggcggcg gacgcccatc atcccagcta cttgggaggc tgaggcagga gaattgcttg 1681 aacccgtgag gcggaggttg tagtgagcca agatcgcacc attgcacacc aacctgcgtg 1741 acagagcaag attgcatctc aaaacaaaca ataataataa ataataaaaa cctgatattt 1801 ggctggg // LOCUS HSHHB5 2454 bp RNA PRI 18-MAR-1997 DEFINITION H.sapiens mRNA for hair keratin, hHb5. ACCESSION X99140 NID g1903217 KEYWORDS hHb5 gene; keratin; type II intermediate filament. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2454) AUTHORS Rogers,M.A., Langbein,L., Praetzel,S., Moll,I., Krieg,T., Winter,H. and Schweizer,J. TITLE Sequences and differential expression of three novel human type-II hair keratins JOURNAL Differentiation 61 (3), 187-194 (1997) MEDLINE 97237682 REFERENCE 2 (bases 1 to 2454) AUTHORS Schweizer,J. TITLE Direct Submission JOURNAL Submitted (06-JUL-1996) J. Schweizer, German Cancer Research Center, Institute of Experimental Pathology, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany, FRG FEATURES Location/Qualifiers source 1..2454 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human scalp cDNA in Lambda Zap-II" /clone="phKII-5" /map="17q12-21" gene 67..1590 /gene="hHb5" CDS 67..1590 /gene="hHb5" /codon_start=1 /product="type II intermediate filament of hair keratin" /db_xref="PID:e255345" /db_xref="PID:g1903218" /translation="MSCRSYRISSGCGVTRNFSSCSAVAPKTGNRCCISAAPYRGVSC YRGLTGFGSRSLCNLGSCGPRIAVGGFRAGSCGRSFGYRSGGVCGPSPPCITTVSVNE SLLTPLNLEIDPNAQCVKQEEKEQIKSLNSRFAAFIDKVRFLEQQNKLLETKWQFYQN QRCCESNLEPLFSGYIETLRREAECVEADSGRLASELNHVQEVLEGYKKKYEEEVALR ATAENEFVVLKKDVDCAYLRKSDLEANVEALVEESSFLRRLYEEEIRVLQAHISDTSV IVKMDNSRDLNMDCIIAEIKAQYDDVASRSRAEAESWYRSKCEEMKATVIRHGETLRR TKEEINELNRMIQRLTAEIENAKCQRAKLEAAVAEAEQQGEAALSDARCKLAELEGAL QKAKQDMACLLKEYQEVMNSKLGLDIEIATYRRLLEGEEHRLCEGVGSVNVCVSSSRG GVSCGGLSYSTTPGRQITSGPSAIGGSITVVAPDSCAPCQPRSSSFSCGSSRSVRFA" polyA_signal 2416..2421 BASE COUNT 513 a 740 c 716 g 485 t ORIGIN 1 tgagcctcgc actctgccgc ccgcaccacc ttccgctgcc tctcagactc tgctcagcct 61 cacacgatgt cgtgccgctc ctacaggatc agctcaggat gcggggtcac caggaacttc 121 agctcctgct cagctgtggc ccccaaaact ggcaaccgct gctgcatcag cgccgccccc 181 taccgagggg tgtcctgcta ccgagggctg acgggcttcg gcagccgcag cctctgcaac 241 ctgggctcct gcgggccccg gatagctgta ggtggcttcc gagccggctc ctgcggacgc 301 agcttcggct accgctccgg gggcgtgtgc ggacccagcc ccccatgcat cactaccgtg 361 tcggtcaacg agagcctcct cacgcccctc aacctggaga tcgaccccaa cgcacagtgc 421 gtgaagcagg aggagaagga gcagatcaag tccctcaaca gcaggttcgc ggccttcatc 481 gacaaggtgc gcttcctgga gcagcagaac aagctgctgg agaccaagtg gcagttctac 541 cagaaccagc gctgctgcga gagcaacctg gagccactgt tcagtggcta catcgagact 601 ctgcggcggg aggccgagtg cgtggaggcc gacagcggga ggctggcctc agagctcaac 661 catgtgcagg aggtgctgga gggctacaag aagaagtatg aagaggaggt ggccctgaga 721 gccacagcag agaatgagtt tgtcgttcta aagaaggacg tggactgtgc ctacctgcgg 781 aaatcagacc tggaggccaa tgtggaggcc ctggtggagg agtctagctt cctgaggcgc 841 ctctatgaag aggagatccg cgttctccaa gcccacatct cagacacctc ggtcatagtc 901 aagatggaca acagccgaga cctgaacatg gactgcatca tcgctgagat caaggctcag 961 tatgacgatg ttgccagccg cagccgggcc gaggctgagt cctggtaccg tagcaagtgt 1021 gaggagatga aggccacggt gatcaggcat ggggagaccc tgcgccgcac caaggaggag 1081 atcaacgagc tgaaccgcat gatccagagg ctgacggccg agattgagaa tgccaagtgc 1141 cagcgtgcca agctggaggc tgctgtggct gaggcagagc agcagggtga ggcggccctc 1201 agcgatgccc gctgcaagct ggctgagctg gagggcgccc tgcagaaggc caagcaggac 1261 atggcctgcc tgctcaagga gtaccaggag gtgatgaact ccaagctggg cctggacatc 1321 gagatcgcca cctacaggcg cctgctggag ggcgaggaac acaggctgtg tgaaggtgtg 1381 ggctctgtga atgtctgtgt cagcagctcc cgtggtggag tctcctgtgg gggcctctcc 1441 tacagcacca ccccagggcg ccagatcact tctggcccct cagccatagg cggcagcatc 1501 acggtggtgg cccctgactc ctgtgccccc tgccagcctc gttcctccag cttcagctgc 1561 gggagtagcc ggtcggtccg ctttgcctag tagagtcatg gagccagggc ttcctgccaa 1621 gcacctgcct gcctgcatca ctgcactgaa tggcatgtga atggaaaatg tgtgcttgct 1681 tccagaatct tctggatgtt cctacagagg ggaagaccta cagagggaaa gaccctcggg 1741 ccgctcccct gcgccttttc atgctaggga gatgcatcct agttgtcctc ctggcagctg 1801 ttttcagagg cattcccagc ccttcactta actcctactt agctccaaaa tacctgtatc 1861 caatttgtat tattccccca gctctcaggg acaagaccag tcccccagcg tggtggtcag 1921 cacggaagct ccaccttctg ggtggaggcg ccatcctaac catccagcca ggccacccac 1981 aacccgagaa tcagggagaa agtccctccc cagcagcccc ctcctcctgg ctgggaagaa 2041 tggtccccca gcaagcactt gcctgttcat tcccgttcat gttttgcttc tctctcagac 2101 tgccttcctg cttctgggct aacctgttcc agccaggctc ctcatgtgac ctcgcagttg 2161 agaagcccat tatcgtgggg catccttttg cctacagccc ctggttaggg cactttggac 2221 aggtcttgct attcagtgaa cctttgtaca tttcaaagaa gactccatgg ctgctccaga 2281 tgcccccttg ctgggtgcag gtggggactg tccaatgcag agctggcggg acagagagtt 2341 aagccacttc ctgggtctcc ttcttatgac tgtctatggg tgcattgcct tctgggttgt 2401 ctcgatctgt gtttcaataa atgccgctgc aatgcaaaaa aaaaaaaaaa aaaa // LOCUS HSHIN1 973 bp RNA PRI 12-SEP-1993 DEFINITION H.sapiens mRNA for Hin-1. ACCESSION X68242 NID g32098 KEYWORDS gene activation; promoter insertion. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 973) AUTHORS Senn,H. TITLE Direct Submission JOURNAL Submitted (07-SEP-1992) H. Senn, Inst f Medizinische Mikrobiologie, Universitaet Basel, Petersplatz 10, CH-4003 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 973) AUTHORS Raineri,I. and Senn,H.P. TITLE HIV-1 promotor insertion revealed by selective detection of chimeric provirus-host gene transcripts JOURNAL Nucleic Acids Res. 20 (23), 6261-6266 (1992) MEDLINE 93117099 FEATURES Location/Qualifiers source 1..973 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..21 /note="5' primer sequence" source 1..973 /organism="Human immunodeficiency virus type 1" /proviral /db_xref="taxon:11676" gene 128..565 /gene="Hin-1" CDS 128..565 /gene="Hin-1" /codon_start=1 /db_xref="PID:g32099" /db_xref="SWISS-PROT:Q01804" /translation="MACIHYLRENREKFEAFIEGSFEEYLKRLENPQEWVGQVEISAL SLMYRKDFIIYREPNVSPSQVTENNFPEKVLLCFSNGNHYDIVYPIKYKESSAMCQSL LYELLYEKVFKTDVSKIVMELDTLEVADEDNSEISDSEDDSCK" polyA_signal 900..905 misc_feature 935..937 /note="3' primer sequence" BASE COUNT 325 a 143 c 182 g 323 t ORIGIN 1 tgtgactctg gtaactagag atccctcaga cccttttagt cagtgtggaa aatctctagc 61 agtttacact aagaactctt tgttacatca ggtattgcac tctcagtctc gccatgttga 121 agtcagaatg gcctgtattc actatcttcg agagaacaga gagaaatttg aagcgtttat 181 agaaggatca tttgaagaat atttaaagcg tttggaaaat ccacaggaat gggtaggaca 241 agtggaaata agtgcccttt ctcttatgta caggaaagat tttataattt atcgggaacc 301 aaatgtttct ccttcacaag taacagaaaa taattttcct gaaaaggtgt tactgtgttt 361 ttcaaatgga aatcattatg atattgtgta tcccataaag tataaagaaa gctctgctat 421 gtgtcagtct ctcctttatg aattgctgta tgagaaggta tttaaaactg atgttagtaa 481 aattgtgatg gaactagaca cgttggaagt agctgatgaa gataacagtg aaatatcaga 541 ttcagaggat gacagttgca agtaagaatg aaatcctaaa atacctttct tagtgccata 601 caaggaaagt taagtaaatg tctttacatt tcagtaaatg tctttctata acatatattg 661 aggtattaat ggtattcata aaatacagca gtctgctgaa gtttctttat ccctgtcact 721 aattttacct aatttctatt atgggactgt ttgctttgaa agttggtgtt tttggttgat 781 gagaatttac gtctgcaatc tagatgcata tttgtagaat aaatttggtc ctacctatat 841 gtgtgtgtat tgtaaaattt taaagttaac ttgtcaattg ttcacatacc cagtaatgaa 901 ataaaatggc cgtttggatt tccttcaaaa aaaaaaaaaa aaaaaaaaaa aaaaactgtc 961 tctgtcacct gtc // LOCUS HSHINGE 515 bp RNA PRI 06-APR-1995 DEFINITION Human mRNA for mitochondrial hinge protein. ACCESSION Y00764 NID g32100 KEYWORDS hinge protein; ubiquinol-cytochrome c reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 515) AUTHORS Ohta,S. TITLE Direct Submission JOURNAL Submitted (13-JAN-1988) Ohta S., Jichi Medical School, Dept. of Biochemistry, Minamikawachi-machi, Kawachi-gun, Tochigi-ken, Japan 329-04 REFERENCE 2 (bases 1 to 515) AUTHORS Ohta,S., Goto,K., Arai,H. and Kagawa,Y. TITLE An Extremely Acidic Amino-Terminal Presequence of the Precursor for the Human Mitochondrial Hinge Protein JOURNAL Unpublished FEATURES Location/Qualifiers source 1..515 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 37..75 /note="signal peptide" CDS 37..312 /note="hinge protein precursor (AA -13 to 78)" /codon_start=1 /db_xref="PID:g32101" /db_xref="SWISS-PROT:P07919" /translation="MGLEDEQKMLTESGDPEEEEEEEEELVDPLTTVREQCEQLEKCV KARERLELCDERDSSRSHTEEDCTEELFDFLHARDHCVAHKLFNNLK" mat_peptide 76..309 /note="hinge protein (AA 1- 78)" BASE COUNT 148 a 97 c 137 g 133 t ORIGIN 1 ggggggctcg tgttgaatct agaaccgtag ccagacatgg gactggagga cgagcaaaag 61 atgcttaccg aatccggaga tcctgaggag gaggaagagg aagaggagga attagtggat 121 cccctaacaa cagtgagaga gcaatgcgag cagttggaga aatgtgtaaa ggcccgggag 181 cggctagagc tctgtgatga gcgtgattcc tctcgatcac atacagaaga ggattgcacg 241 gaggagctct ttgacttctt gcatgcgagg gaccattgcg tggcccacaa actctttaac 301 aacttgaaat aaatgtgtgg acttaagttg caccccagtc ttcatcatct gggcatcaga 361 atatttcctt atggttttgg atgtaccatt tgtttcttat ttgtgtaact gtaagttcac 421 atcaacctca tgggtttggc ttgaggctgg tagcttctat gtaattcgca atgattccat 481 ctaaataaaa gttctatgat ctgcaaaaaa aaaaa // LOCUS HSHIP3K 1782 bp RNA PRI 11-JAN-1991 DEFINITION Human mRNA for inositol 1,4,5-triphosphate 3-kinase. ACCESSION X54938 NID g32104 KEYWORDS inositol; inositol 1,4,5-triphosphate 3-kinase; kinase; triphosphate. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1782) AUTHORS Takazawa,K. and Erneux,C. TITLE Direct Submission JOURNAL Submitted (25-OCT-1990) Takazawa K., Erneux C., Institute of Interdisciplinary Research, School of Medicine, Free University of Brussels, Route de Lennik 808, B-1070 Brussels, Belgium REFERENCE 2 (bases 1 to 1782) AUTHORS Takazawa,K., Perret,J., Dumont,J.E. and Erneux,C. TITLE Human brain inositol 1,4,5-trisphosphate 3-kinase cDNA sequence JOURNAL Nucleic Acids Res. 18 (23), 7141 (1990) MEDLINE 91088302 COMMENT Data kindly reviewed (01-JAN-1991) by Takazawa K. FEATURES Location/Qualifiers source 1..1782 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /tissue_type="hippocampus" /clone_lib="lambda ZapII" /clone="HH39" mat_peptide 12..1394 /product="inositol 1,4,5-triphosphate 3-kinase" CDS 12..1397 /codon_start=1 /product="inositol 1,4,5-triphosphate 3-kinase" /db_xref="PID:g32105" /db_xref="SWISS-PROT:P23677" /translation="MTLPGGPTGMARPGGARPCSPGLERAPRRSVGELRLLFEARCAA VAAAAAAGEPRARGAKRRGGQVPNGLPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERE RDCLPAAGSSHLQQPRRLSTSSVSSTGSSSLLEDSEDDLLSDSESRSRGNVQLEAGED VGQKNHWQKIRTMVNLPVISPFKKRYAWVQLAGHTGSFKAAGTSGLILKRCSEPERYC LARLMADALRGCVPAFHGVVERDGESYLQLQDLLDGFDGPCVLDCKMGVRTYLEEELT KARERPKLRKDMYKKMLAVDPEAPTEEEHAQRAVTKPRYMQWREGISSSTTLGFRIEG IKKADGSCSTDFKTTRSREQVLRVFEEFVQGDEEVLRRYLNRLQQIRDTLEVSEFFRR HEVIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDHRRPWEEGNREDGYLLGLDN LIGILASLAER" BASE COUNT 324 a 575 c 605 g 278 t ORIGIN 1 gaattccgga aatgaccctg cccgggggcc caacgggcat ggcgcggccg gggggcgcga 61 ggccctgcag cccggggctg gagcgggccc cgcgccggag tgtcggggag ctgcgcctgc 121 tcttcgaggc gcgctgtgcg gcggtcgctg cggccgccgc cgcgggggag ccccgggccc 181 gcggggccaa gcggcgtggg ggacaggtcc ccaacgggct tccgcgggct cccccggccc 241 cggtgatccc tcagctgacc gtgacagccg aggagcccga cgtgcccccg accagccctg 301 ggccgccgga gcgggagagg gactgcctcc cggcagcggg ctcttcgcac ctgcagcagc 361 cgcgccgcct ttccacctcg tcggtctcct ccactggctc ctcgtcgctg ctcgaggact 421 cggaggacga cctgctgagc gacagtgaga gccggagccg cggcaacgtg cagctggaag 481 cgggcgagga cgtgggtcag aaaaaccact ggcagaagat ccggaccatg gtcaatctgc 541 cggtcataag ccctttcaag aagcgctacg cctgggtgca gctggcaggg cacactggga 601 gttttaaggc ggcgggcacc agcgggctga tcctgaagcg ctgctcggag ccggagcgct 661 actgcctggc gcggctgatg gctgacgcgc tgcgcggctg cgtgcctgcc ttccacggcg 721 tggtggagcg cgacggcgaa agctacctgc agctgcagga cctgctcgat ggcttcgacg 781 gaccttgtgt gctcgactgc aaaatgggcg tcaggactta cctagaggag gagctgacca 841 aggcccgtga gcggcccaag ctgcggaagg acatgtacaa gaaaatgctg gcggtggatc 901 ctgaagctcc cacggaggag gagcacgcgc agcgcgccgt caccaagccg cgctacatgc 961 agtggcggga aggcatcagc tccagcacca ccctcggctt ccgcatcgag ggcatcaaga 1021 aagcggacgg ctcctgcagc accgacttca agactacgcg aagccgagag caggtgcttc 1081 gcgtctttga agagtttgtg caaggagatg aggaagtgct gaggcggtat ctgaaccgcc 1141 tgcagcagat ccgggacacc ctggaggtat ccgagttctt caggaggcac gaggtgatcg 1201 gcagctcgct cctctttgtg cacgatcact gccatcgcgc cggcgtgtgg ctcatcgact 1261 tcggcaagac cacgcccctc cccgatggcc agatcctgga ccaccggcgg ccctgggagg 1321 agggcaaccg cgaggacggc tatttgctgg ggctggacaa tctcattggc atcctggcca 1381 gcctggctga gagatgaggc tggactcctg tccccgcggg ccgctcacct gacatgtgga 1441 cctgcagctt tgtccccact gtgcatgccg gcttgagact ggagccccgc ggtgcagggc 1501 agttcaccgg gtcctgcagg accaggtgcc agccactaag ggggggcacc gccgatgcca 1561 ggggttttgc ccacccgggc cccagcgttc ccagagccaa atgacactaa cttatagaag 1621 gggagggggc aaagggcttc ttcctcaggc cagctcttct gaggaggctc tgccctctcc 1681 agaggtgcca gaccgcggat tttatttagc aagcccagac cttccggtct aacgtctcac 1741 accacgacgg actccccttc ctaataaaac tcaaagacaa aa // LOCUS HSHIRA 3420 bp RNA PRI 20-OCT-1995 DEFINITION H.sapiens HIRA mRNA. ACCESSION X77633 NID g840773 KEYWORDS HIRAHs gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3420) AUTHORS Lamour,V., Lecluse,Y., Desmaze,C., Spector,M., Bodescot,M., Aurias,A., Osley,M.A. and Lipinski,M. TITLE A human homolog of the S. cerevisiae HIR1 and HIR2 transcriptional repressors cloned from the DiGeorge syndrome critical region JOURNAL Hum. Mol. Genet. 4 (5), 791-799 (1995) MEDLINE 95359996 REFERENCE 2 (bases 1 to 3420) AUTHORS Lipinski,M. TITLE Direct Submission JOURNAL Submitted (07-FEB-1994) M. Lipinski, Laboratoire de Biologie des Tumeurs Humaines CNRS URA 1156, Institut Gustave Roussy, 94805 Villejuif Cedex, FRANCE FEATURES Location/Qualifiers source 1..3420 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="fetal brain, colon carcinoma cell-line" /cell_line="SW613-S" /clone="CF18, SW30, cDAC30.25" /chromosome="22q11.2" /clone_lib="fetal brain, colon carcinoma" gene 109..3030 /gene="HIRAHs" CDS 109..3030 /gene="HIRAHs" /codon_start=1 /db_xref="PID:g840774" /translation="MSPVLQEDDEKDENIPKMLCQMDNHLACVNCVRWSNSGMYLASG GDDKLIMVWKRATYIGPSTVFGSSGKLANVEQWRCVSILRNHSGDVMDVAWSPHDAWL ASCSVDNTVVIWNAVKFPEILATLRGHSGLVKGLTWDPVGKYIASQADDRSLKVWRTL DWQLETSITKPFDECGGTTHVLRLSWSPDGHYLVSAHAMNNSGPTAQIIEREGWKTNM DFVGHRKAVTVVKFNPKIFKKKQKNGSSAKPSCPYCCCAVGSKDRSLSVWLTCLKRPL VVIHELFDKSIMDISWTLNGLGILVCSMDGSVAFLDFSQDELGDPLSEEEKSRIHQST YGKSLAIMTEAQLSTAVIENPEMLKYQRRQQQQQLDQKSAATREMGSATSVAGVVNGE SLEDIRKNLLKKQVETRTADGRRRITPLCIAQLDTGDFSTAFFNSIPLSGSLAGTMLS SHSSPQLLPLDSSTPNSFGASKPCTEPVVAASARPAGDSVNKDSMNATSTPAALSPSV LTTPSKIEPMKAFDSRFTERSKATPGAPALTSMTPTAVERLKEQNLVKELRPRDLLES SSDSDEKVPLAKASSLSKRKLELEVETVEKKKKGRPRKDSRLMPVSLSVQSPAALTAE KEAMCLSAPALALKLPIPSPQRAFTLQVSSDPSMYIEVENEVTVVGGVKLSRLKCNRE GKEWETVLTSRILTAAGSCDVVCVACEKRMLSVFSTCGRRLLSPILLPSPISTLHCTG SYVMALTAAATLSVWDVHRQVVVVKEESLHSILAGSDMTVSQILLTQHGIPVMNLSDG KAYCFNPSLSTWNLVSDKQDSLAQCADFRSSLPSQDAMLCSGPLAINQGRTSNSGRQA ARLFSVPHVVQQETTLAYLENQVAAALTLQSSHEYRHWLLVYARYLVNEGFEYRLREI CKDLLGPVHYSTGSQWESTVVGLRKRELLKELLPVIGQNLRFQRLFTECQEQLDILRD K" BASE COUNT 781 a 954 c 964 g 721 t ORIGIN 1 ggcaaccaca atggcaagcc gattttttca gttgatattc accctgacgg gaccaagttc 61 gcaactggag gacaagggca ggattctggg aaggttgtga tctggaatat gtctccagtc 121 ctccaggagg atgacgagaa ggatgaaaat attcccaaga tgctttgcca gatggacaat 181 cacttagcat gtgtgaactg tgtgcggtgg tcaaacagtg ggatgtattt agcttctggg 241 ggagatgaca aactgattat ggtgtggaag cgggctacgt acatcggccc cagcaccgtg 301 ttcggctcca gtggtaagct tgccaatgtg gagcagtggc ggtgtgtctc tatcctccgg 361 aatcattcag gcgatgtgat ggatgtagca tggtctcccc acgatgcctg gctagcctca 421 tgcagcgtgg ataacactgt cgtcatctgg aatgctgtaa agttcccaga aattctagct 481 actctgagag gtcattctgg cttggtcaaa gggttgacat gggaccctgt tggtaaatac 541 atagcttctc aagctgatga ccgcagccta aaggtgtgga ggacgctgga ctggcagttg 601 gagaccagca tcaccaagcc ttttgatgag tgtggaggaa cgacccatgt gttgcggctc 661 agctggtcac ctgatgggca ttacctggtg tctgcccatg ccatgaacaa ctcaggcccc 721 actgcccaga tcatcgaacg ggagggatgg aagaccaaca tggactttgt tgggcaccgg 781 aaagctgtga ctgtcgtgaa attcaaccca aaaatcttca aaaagaagca gaagaatggg 841 agttctgcga agcctagctg cccgtactgc tgctgtgctg ttggcagcaa ggaccgctcg 901 ctttctgtct ggctcacatg tctgaaacgg ccgctggtgg tcatccatga actgtttgac 961 aaatccatca tggatatttc ctggactctg aatgggctgg gcatcttggt atgctctatg 1021 gacggctctg tggcattcct cgacttctcc caggatgagc ttggcgatcc cctgagcgag 1081 gaggagaaga gccgcattca ccagtccacc tatggcaaga gcctagccat catgaccgag 1141 gcccagctct ccacagccgt cattgagaac cctgagatgc tcaagtacca gcgaaggcag 1201 cagcagcagc agctggacca gaagagtgct gcgaccaggg agatgggctc agccacctca 1261 gtcgcaggcg ttgtcaacgg ggagagtctt gaagatatca ggaagaatct tttgaagaaa 1321 caagttgaga ctcggacagc agatggccgg agaagaatca cgcctctctg catagcacag 1381 ctggacactg gggacttctc cacggcattc tttaacagca tccccctctc gggctccctg 1441 gcgggcacca tgctctcttc tcatagcagt ccacagctac tgccactgga ctccagtacc 1501 cctaactcct tcggcgcctc gaagccttgc acagagcctg tggtggctgc cagtgccaga 1561 cctgcaggcg attctgtcaa taaagacagt atgaatgcta cctctactcc tgctgcattg 1621 tcaccttctg tgttaacgac cccgtccaag atcgaaccca tgaaagcgtt tgactcccgg 1681 ttcacagagc ggtccaaagc cacaccaggt gctcctgccc tgaccagcat gactccgaca 1741 gctgtggaaa ggttaaaaga gcagaacctt gtgaaagagc tgaggccccg agacctcctg 1801 gagagcagca gtgacagcga tgagaaagtc cctttggcta aggcttcctc actgtccaag 1861 cgaaaacttg agcttgaggt agagacagta gagaagaaga agaaagggcg gcctcggaag 1921 gactctcgtc tcatgcctgt gtctctgtct gtccagtctc cagctgccct aaccgcagag 1981 aaggaggcca tgtgtctgtc tgcaccagca cttgcactga agctgccaat tccaagcccc 2041 cagagagcat tcaccctcca ggtcagctcc gatccttcca tgtacattga ggtggagaat 2101 gaagtgacag tggtgggggg cgtgaagctg agccgcctga agtgcaaccg ggaagggaag 2161 gagtgggaga cggtactcac cagccggatc ctcactgctg cgggcagctg tgacgtggtg 2221 tgtgtcgcct gtgaaaaaag gatgctgtca gtgttctcca cctgtggtcg ccgtctcctc 2281 tctcccatcc tcctgccatc cccgatctct actttgcatt gcacaggctc ctacgtcatg 2341 gcgctcaccg ctgcagccac actctctgtc tgggatgttc acagacaggt ggttgtggtg 2401 aaagaagagt ctctgcactc catcctggca ggaagtgata tgacggtatc acagatcttg 2461 ctgacgcagc atggaatccc agtaatgaac ctgtccgatg ggaaggcgta ctgctttaat 2521 ccgtcacttt ccacatggaa cctggtttct gacaagcagg actcactggc tcagtgtgca 2581 gactttagga gcagcctgcc atcccaggac gccatgctgt gctcaggacc gttagccata 2641 aaccagggcc gcacctccaa ctcgggaagg caggctgccc ggctcttctc cgtgcctcat 2701 gtggtgcagc aagagaccac cctggcctac ctagagaacc aggtggcagc agcactcacc 2761 ctgcagtcca gccacgagta ccgccattgg ctcctcgtct acgcacggta cctcgtaaac 2821 gaagggtttg aataccgact tcgagaaata tgcaaggact tactgggtcc ggttcactac 2881 tccactggaa gccagtggga gtcaacagta gtgggtctgc ggaagaggga gctgctgaag 2941 gagctgctac cagtcatcgg gcagaacctc cgattccagc gcctcttcac cgagtgtcag 3001 gaacagctcg acatcctgag ggacaagtag cctgccccag cctgccctgg ctgcagcaag 3061 ggcagggcca cactctcgcc gctgatgaca tgcaggaccg cctctcacct gaccaggctg 3121 tagggggagg agacactggc aggagatgtg ctgtcctgca ccagcgccag cccagctccc 3181 tgggcagatg tgccctgtgt cctgggtctg acattgcctc aggaggggga agctcatccc 3241 tccctccaag cccccgatgc ggagctaggg ctggagctct gccaggtgcc ctggggccag 3301 caaggcagtc ccaggcctgc cgtctccagc acggccccaa ggtggacact agcccctgct 3361 gctgcaggcg ccatgctgct tcagcagtga cgattgagcc atttgtgaga cagaatccgg // LOCUS HSHISH2B 843 bp DNA PRI 12-SEP-1993 DEFINITION Human histone H2b gene. ACCESSION X00088 NID g32112 KEYWORDS histone; histone H2B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 843) AUTHORS Zhong,R., Roeder,R.G. and Heintz,N. TITLE The primary structure and expression of four cloned human histone genes JOURNAL Nucleic Acids Res. 11 (21), 7409-7425 (1983) MEDLINE 84069776 FEATURES Location/Qualifiers source 1..843 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 297..303 /note="Hogness Box" precursor_RNA 324..805 /note="primary transcript" CDS 370..747 /note="histone H2b" /codon_start=1 /db_xref="PID:g32113" /db_xref="SWISS-PROT:P06899" /translation="MPEPAKSAPAPKKGSKKAVTKAQKKDGKSAAHRKESYSIYVYKV LKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLL PGELAKHAVSEGTKAVTKYTSAK" misc_signal 781..796 /note="dyad symmetry pot. transcription termination" BASE COUNT 208 a 233 c 207 g 195 t ORIGIN 1 cttggcctta gcgcgggctt tgcctccctg cttgccacgt ccagacatag cgagcgcaac 61 tcactacgag caaccacaaa gtgaacggga aaggcggcgc tttttataaa cactattggg 121 cgcgaaaaag aagacgtgtt gttggttggg actgcagttt aatttcaacc aatagtagtg 181 cgtcttctgg atttgcgaat cctgattggg cagacctgac ctctgacgtt accctgaata 241 actaccaatc agacacaaga cttcaactct tcaccttatt tgcataagcg attctatata 301 aaagcgcctt gtcataccct gctcacgctg tttttccttt tcgttggcgc tttatagcta 361 cacagtgcta tgccagagcc agcgaagtct gctcccgccc cgaaaaaggg ctccaagaag 421 gcggtgacta aggcgcagaa gaaagacggc aagagcgcag cgcaccgcaa ggagagctat 481 tccatctatg tgtacaaggt tctgaagcag gtccaccctg acaccggcat ttcgtccaag 541 gccatgggca tcatgaattc gtttgtgaac gacattttcg agcgcatcgc aggtgaggct 601 tcccgccttg cgcattacaa caagcgctcg accatcacct ccagggagat ccagacggcc 661 gtgcgcctgc tgctgcctgg ggagttggcc aagcacgccg tgtccgaggg tactaaggcc 721 gtcaccaagt acaccagcgc taagtaaaca gtgagttggt tgcaaactct caaccctaac 781 ggctctttta agagccaccc atgttctcaa agaaagagct ggtgcttgta tttcctcctc 841 gct // LOCUS HSHISH3 698 bp DNA PRI 12-SEP-1993 DEFINITION Human histone H3 gene. ACCESSION X00090 NID g32114 KEYWORDS histone; histone H3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 698) AUTHORS Zhong,R., Roeder,R.G. and Heintz,N. TITLE The primary structure and expression of four cloned human histone genes JOURNAL Nucleic Acids Res. 11 (21), 7409-7425 (1983) MEDLINE 84069776 FEATURES Location/Qualifiers source 1..698 /organism="Homo sapiens" /db_xref="taxon:9606" promoter 93..97 /note="CAAT box" promoter 114..120 /note="TATA box" precursor_RNA 148..659 /note="put. primary transcript" CDS 186..596 /note="histone H3" /codon_start=1 /db_xref="PID:g32115" /db_xref="SWISS-PROT:P16106" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLV GLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" misc_signal 634..649 /note="dyad symmetry" BASE COUNT 169 a 183 c 177 g 169 t ORIGIN 1 acggtaatga caggaatctc tcttaatctg caactaggca cagagatggg ccaatccaag 61 aagggcgcgg ggatttttga attttcttgg gtccaatagt tggtggtctg actctataaa 121 agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaagctt ttctgtggtt 181 ttgccatggc tcgtactaaa cagacagctc ggaaatccac cggcggtaaa gcgccacgca 241 agcagctggc taccaaggct gctcgcaaga gcgcgccggc taccggcggt gtgaaaaagc 301 ctcaccgtta ccgtccgggt actgtggctc tgcgtgagat ccgccgctac caaaagtcga 361 ccgagttgct gattcggaag ctgccgttcc agcgcctggt gcgagaaatc gcccaagact 421 tcaagaccga tcttcgcttc cagagctctg cggtaatggc gctgcaggag gcttgtgagg 481 cctacttggt agggctcttt gaggacacaa acctttgcgc catccatgct aagcgagtga 541 ctattatgcc caaagacatc cagctcgctc gccgcattcg cggagaaaga gcgtaaatgt 601 aaagttactt tttcatcagt cttaaaaccc aaaggctctt ttcagagcca cccacttatt 661 ccaacgaaag tagctgtgat aattttttgt tgtctcaa // LOCUS HSHK2 2905 bp RNA PRI 02-JUN-1997 DEFINITION H.sapiens mRNA for kinesin-2. ACCESSION Y08319 NID g1922312 KEYWORDS HK2 gene; kinesin-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2905) AUTHORS Debernardi,S., Fontanella,E., De Gregorio,L., Pierotti,M.A. and Delia,D. TITLE Identification of a novel human kinesin-related gene (HK2) by the cDNA differential display technique JOURNAL Genomics 42 (1), 67-73 (1997) MEDLINE 97321046 REFERENCE 2 (bases 1 to 2905) AUTHORS Debernardi,S. TITLE Direct Submission JOURNAL Submitted (24-SEP-1996) S. Debernardi, Istituto Nazionale Tumori, O.S.A., Via Venezian 1, I- 20133 Milano, ITALY FEATURES Location/Qualifiers source 1..2905 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hemopoietic" /cell_type="lymphoid" /chromosome="5q" gene 19..2058 /gene="HK2" CDS 19..2058 /gene="HK2" /codon_start=1 /product="kinesin-2" /db_xref="PID:e267601" /db_xref="PID:g1922313" /translation="MVTSLNEDNESVTVEWIENGDTKGKEIDLESIFSLNPDLVPDEE IEPSPETPPPPASSAKVNKIVKNRRTVASIKNDPPSRDNRVVGSARARPSQFPEQSSS AQQNGSVSDISPVQAAKKEFGPPSRRKSNCVKEVEKLQEKREKRRLQQQELREKRAQD VDATNPNYEIMCMIRDFRGSLDYRPLTTADPIDEHRICVCVRKRPLNKKETQMKDLDV ITIPSKDVVMVHEPKQKVDLTRYLENQTFRFDYAFDDSAPNEMVYRFTAKPLVETIFE RGMATCFAYGQTGSGKTHTMGGDFSGKNQDCSKGIYALAARDVFLMLKKPNYKKLELQ VYATFFEIYSGKVFDLLNRKTKLRVLEDGKQQVQVVGLQEREVKCVEDVLKLIDIGNS CRTSGQTSANAHSSRSHAVFQIILRRKGKLHGKFSLIDLAGNERGADTSSADRQTRLE GAEINKSLLALKECIRALGRNKPHTPFRASKLTQVLRDSFIGENSRTCMIATISPGMA SCENTLNTLRYANRVKELTVDPTAAGDVRPIMHHPPNQIDDLETQWGVGSSPQRDDLK LLCEQNEEEVSPQLFTFHEAVSQMVEMEEQVVEDHRAVFQESIRWLEDEKALLEMTEE VDYDVDSYATQLEAILEQKIDILTELRDKVKSFRAALQEEEQASKQINPKRPRAL" misc_binding 856..915 /gene="HK2" /bound_moiety="ATP" BASE COUNT 999 a 514 c 589 g 803 t ORIGIN 1 ggccgaatac atcaagcaat ggtaacatct ttaaatgaag ataatgaaag tgtaactgtt 61 gaatggatag aaaatggaga tacaaaaggc aaagagattg acctggagag catcttttca 121 cttaaccctg accttgttcc tgatgaagaa attgaaccca gtccagaaac acctccacct 181 ccagcatcct cagccaaagt aaacaaaatt gtaaagaatc gacggactgt agcttctatt 241 aagaatgacc ctccttcaag agataataga gtggttggtt cagcacgtgc acggcccagt 301 caatttcctg aacagtcttc ctctgcacaa cagaatggta gtgtttcaga tatatctcca 361 gttcaagctg caaaaaagga atttggaccc ccttcacgta gaaaatctaa ttgtgtgaaa 421 gaagtagaaa aactgcaaga aaaacgagag aaaaggagat tgcaacagca agaacttaga 481 gaaaaaagag cccaggacgt tgatgctaca aacccaaatt atgaaattat gtgtatgatc 541 agagacttta gaggaagttt ggattataga ccattaacaa cagcagatcc tattgatgaa 601 cataggatat gtgtgtgtgt aagaaaacga ccactcaata aaaaagaaac tcaaatgaaa 661 gatcttgatg taatcacaat tcctagtaaa gatgttgtga tggtacatga accaaaacaa 721 aaagtagatt taacaaggta cctagaaaac caaacatttc gttttgatta tgcctttgat 781 gactcagctc ctaatgaaat ggtttacagg tttactgcta aaccactagt ggaaactata 841 tttgaaaggg gaatggctac atgctttgct tatgggcaga ctggaagtgg aaaaactcat 901 actatgggtg gtgacttttc aggaaagaac caagattgtt ctaaaggaat ttatgcatta 961 gcagctcgag atgtcttttt aatgctaaag aagccaaact ataagaagct agaacttcaa 1021 gtatatgcaa ccttctttga aatttatagt ggaaaggtgt ttgacttgct aaacaggaaa 1081 acaaaattaa gagttctaga agatggaaaa cagcaggttc aagtggtggg attacaggaa 1141 cgggaggtca aatgtgttga agatgtactg aaactcattg acataggcaa cagttgcaga 1201 acatccggtc aaacatctgc aaatgcacat tcatctcgga gccatgcagt gtttcagatt 1261 attcttagaa ggaaaggaaa actacatggc aaattttctc tcattgattt ggctggaaat 1321 gaaagaggag ctgatacttc cagtgcggac aggcaaacta ggcttgaagg tgctgaaatt 1381 aataaaagcc ttttagcact caaggagtgc atcagagcct taggtagaaa taaacctcat 1441 actcctttcc gtgcaagtaa actcactcag gtgttaagag attctttcat aggtgaaaac 1501 tctcgtacct gcatgattgc cacaatctct ccaggaatgg catcctgtga aaatactctt 1561 aatacattaa gatatgcaaa tagggtcaaa gaattgactg tagatccaac tgctgctggt 1621 gatgttcgtc caataatgca ccatccacca aaccagattg atgacttaga gacacagtgg 1681 ggtgtgggga gttcccctca gagagatgat ctaaaacttc tttgtgaaca aaatgaagaa 1741 gaagtctctc cacagttgtt tactttccac gaagctgttt cacaaatggt agaaatggaa 1801 gaacaagttg tagaagatca cagggcagtg ttccaggaat ctattcggtg gttagaagat 1861 gaaaaggccc tcttagagat gactgaagaa gtagattatg atgtcgattc atatgctaca 1921 caacttgaag ctattcttga gcaaaaaata gacattttaa ctgaactgcg ggataaagtg 1981 aaatctttcc gtgcagctct acaagaggag gaacaagcca gcaagcaaat caacccgaag 2041 agaccccgtg ccctttaaac cggcatttgc tgctaaagga tacccagaac cctcactact 2101 gtaacataca acggttcagc tgtaagggcc atttgaaagt ttggaatttt aagtgtctgt 2161 ggaaaatgtt ttgtccttca cctgaattac atttcaattt tgtgaaacac tcttttgtct 2221 acaaaatgct tctagtccag gaggcacaac caagaactgg gattaatgaa gcattttgtt 2281 tcatttacac aaatagtgat ttacttttgg agatccttgt cagttttatt ttctatttga 2341 tgaagtaaga ctgtggactc aatccagagc cagatagtag gggaagccac agcatttcct 2401 tttaactcag ttcaattttt gtagtgagac tgagcagttt taaatccttt gcgtgcatgc 2461 atacctcatc agtgattgta cataccttgc ccactcctag agacagctgt gctcactttt 2521 cctgctttgt gccttgatta aggctactga ccctaaattt ctgaagcaca gccaagaaaa 2581 attacattcc ttgtcattgt aaattacctt tgtgtgtaca tttttactgt atttgagaca 2641 ttttttgtgt gtgactagtt aattttgcag gatgtgccat atcattgaac ggaactaaag 2701 tctgtgacag tggatatagc tgctggacca ttccatctta tatgtaaaga aatctggaat 2761 tattatttta aaaccatata acatgtgatt ataatttttc ttagcatttt ctttgtaaag 2821 aactacaata taaactagtt ggtgtataat aaaaagtaat gaaattctga agaaaaaaaa 2881 aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSHL05 1304 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for HLA-DR antigens associated invariant chain (p33). ACCESSION X00497 M14765 NID g32130 KEYWORDS complementary DNA; glycoprotein; histocompatibility antigen; membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1304) AUTHORS Strubin,M., Mach,B. and Long,E.O. TITLE The complete sequence of the mRNA for the HLA-DR-associated invariant chain reveals a polypeptide with an unusual transmembrane polarity JOURNAL EMBO J. 3 (4), 869-872 (1984) MEDLINE 84207945 REFERENCE 2 (bases 1 to 1304) AUTHORS Mach,B. TITLE Direct Submission JOURNAL Submitted (01-JUN-1984) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (01-JUN-1984) by B. Mach. FEATURES Location/Qualifiers source 1..1304 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 8..706 /note="putative p33" /codon_start=1 /db_xref="PID:g32131" /db_xref="SWISS-PROT:P04233" /translation="MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKC SRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPK PVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKG SFPENLRHLKNTMETIDWKVFESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSS GLGVTKQDLGPVPM" CDS 56..706 /note="putative p33" /codon_start=1 /db_xref="PID:g32132" /translation="MDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLL LAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQAL PMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETI DWKVFESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPVPM" misc_feature 149..223 /note="transmembrane region" old_sequence 1201..1203 /note="GCA [1] revised to GGA [2]" /citation=[1] misc_feature 1286..1292 /note="polyadenylation signal" polyA_site 1304 /note="polyadenylation site" BASE COUNT 307 a 426 c 326 g 245 t ORIGIN 1 ttcccagatg cacaggagga gaagcaggag ctgtcgggaa gatcagaagc cagtcatgga 61 tgaccagcgc gaccttatct ccaacaatga gcaactgccc atgctgggcc ggcgccctgg 121 ggccccggag agcaagtgca gccgcggagc cctgtacaca ggcttttcca tcctggtgac 181 tctgctcctc gctggccagg ccaccaccgc ctacttcctg taccagcagc agggccggct 241 ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca tgaagcttcc 301 caagcctccc aagcctgtga gcaagatgcg catggccacc ccgctgctga tgcaggcgct 361 gcccatggga gccctgcccc aggggcccat gcagaatgcc accaagtatg gcaacatgac 421 agaggaccat gtgatgcacc tgctccagaa tgctgacccc ctgaaggtgt acccgccact 481 gaaggggagc ttcccggaga acctgagaca ccttaagaac accatggaga ccatagactg 541 gaaggtcttt gagagctgga tgcaccattg gctcctgttt gaaatgagca ggcactcctt 601 ggagcaaaag cccactgacg ctccaccgaa agagtcactg gaactggagg acccgtcttc 661 tgggctgggt gtgaccaagc aggatctggg cccagtcccc atgtgagagc agcagaggcg 721 gtcttcaaca tcctgccagc cccacacagc tacagctttc ttgctccctt cagcccccag 781 cccctccccc atgtcccacc ctgtacctca tcccatgaga cctggtgcct ggctctttcg 841 tcacccttgt acaagacaaa ccaagtcgga acagcagata acaatgcagc aaggccctgc 901 tgcccaatct ccatctgtca acaggggcgt gaggtcccag gaagtggcca aaagctagac 961 agatccccgt tcctgacatc acagcagcct ccaacacaag gctccaagac ctaggctcat 1021 ggacgagatg ggaaggcaca gggagaaggg ataaccctac acccagaccc caggctggac 1081 atgctgactg tcctctcccc tccagccttt ggccttggct tttctagcct atttacctgc 1141 aggctgagcc actctcttcc ctttccccag catcactccc caaggaagag ccaatgtttt 1201 ggacccataa tcctttctgc cgacccctag ttccctctgc tcagccaagc ttgttatcag 1261 ctttcagggc catggttcac attagaataa aaggtagtaa ttag // LOCUS HSHL06 907 bp RNA PRI 30-MAR-1995 DEFINITION Human RNA sequence of the human DS glycoprotein alpha subunit from the HLA-D region of the major histocompatibility complex(MHC). ACCESSION X00033 K01172 NID g32133 KEYWORDS glycoprotein; histocompatibility antigen; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 907) AUTHORS Chang,H.C., Moriuchi,T. and Silver,J. TITLE The heavy chain of human B-cell alloantigen HLA-DS has a variable N-terminal region and a constant immunoglobulin-like region JOURNAL Nature 305 (5937), 813-815 (1983) MEDLINE 84039792 FEATURES Location/Qualifiers source 1..907 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 20..784 /note="DS alpha subunit" /codon_start=1 /db_xref="PID:g32134" /db_xref="SWISS-PROT:P04226" /translation="MILNKALMLGALALTTVMSPCGGEDIVADHVASYGVNLYQSYGP SGQFTHEFDGDEEFYVDLERKETVWKLPLFHRLRFDPQFALTNIAVLKHNLNILIKRS NSTAATNEVPEVTVFSKSPVTLGQPNTLICLVDNIFPPVVNITWLSNGHSVTEGVSET SFLSKSDHSFFKISYLTFLPSADEIYDCKVEHWGLDEPLLKHWEPEIPAPMSELTETV VCALGLSVGLVGIVVGTVLIIRGLRSVGASRHQGPL" sig_peptide 20..88 /note="signal peptide" misc_feature 889..894 /note="polyadenylation signal" polyA_site 907 /note="site of polyadenylation" BASE COUNT 210 a 240 c 224 g 233 t ORIGIN 1 aggctgcctt gggaagaaga tgatcctaaa caaagctctg atgctggggg ccctcgccct 61 gaccaccgtg atgagccctt gtggaggtga agacattgtg gctgaccacg ttgcctctta 121 cggtgtaaac ttgtaccagt cttacggtcc ctctggccag ttcacccatg aatttgatgg 181 agacgaggag ttctatgtgg acctggagag gaaggagact gtctggaagt tgcctctgtt 241 ccacagactt agatttgacc cgcaatttgc actgacaaac atcgctgtgc taaaacataa 301 cttgaacatc ctgattaaac gctccaactc taccgctgct accaatgagg ttcctgaggt 361 cacagtgttt tccaagtctc ccgtgacact gggtcagccc aacaccctca tctgtcttgt 421 ggacaacatc tttcctcctg tggtcaacat cacctggctg agcaatgggc actcagtcac 481 agaaggtgtt tctgagacca gcttcctctc caagagtgat cattccttct tcaagatcag 541 ttacctcacc ttcctccctt ctgctgatga gatttatgac tgcaaggtgg agcactgggg 601 cctggatgag cctcttctga aacactggga gcctgagatt ccagcaccta tgtcagagct 661 cacagagact gtggtctgtg ccctggggtt gtctgtgggc ctcgtgggca ttgtggtggg 721 gaccgtcttg atcatccgag gcctgcgttc agttggtgct tccagacacc aagggccctt 781 gtgaatccca tcctgaaaaa gaaggtgtta cctactaaga gatgcctggg gtaagccgcc 841 cagctaccta attcctcagt aacatcgatc taaaatctcc atggaagcaa taaattccct 901 ttaagag // LOCUS HSHLADPB 14782 bp DNA PRI 24-APR-1993 DEFINITION Human HLA-DP-beta 1 gene and HLA-DP-alpha-1 gene exon 1. ACCESSION X02228 NID g32194 KEYWORDS Alu repetitive sequence; antigen; cell surface antigen; cell surface glycoprotein; class II antigen; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14782) AUTHORS Kelly,A. and Trowsdale,J. TITLE Complete nucleotide sequence of a functional HLA-DP beta gene and the region between the DP beta 1 and DP alpha 1 genes: comparison of the 5' ends of HLA class II genes JOURNAL Nucleic Acids Res. 13 (5), 1607-1621 (1985) MEDLINE 85215568 FEATURES Location/Qualifiers source 1..14782 /organism="Homo sapiens" /db_xref="taxon:9606" prim_transcript complement(<1..619) /note="HLA-DP-alpha 1 primary transcript" exon complement(441..619) /partial /note="pot. exon 1" CDS complement(441..540) /partial /codon_start=1 /product="pot. HLA-DP-alpha 1 precursor (aa -31 to +2) (441 is 1st base in codon)" /db_xref="PID:g32196" /translation="MRPEDRMFHIRAVILRALSLAFLLSLRGAGAIK" sig_peptide complement(448..540) mRNA join(2943..3112,7644..7907,11919..12200,12748..12858, 13188..13211,13508..13736) exon 2943..3112 /number=1 prim_transcript 2943..13736 /note="pot. HLA-DP-beta 1 primary transcript" CDS join(3012..3111,7644..7907,11919..12200,12748..12858, 13188..13207) /codon_start=1 /product="pot. hla-dp-beta 1" /db_xref="PID:g296648" /db_xref="SWISS-PROT:P04440" /translation="MMVLQVSAAPRTVALTALLMVLLTSVVQGRATPENYLFQGRQEC YAFNGTQRFLERYIYNREEFARFDSDVGEFRAVTELGRPAAEYWNSQKDILEEKRAVP DRMCRHNYELGGPMTLQRRVQPRVNVSPSKKGPLQHHNLLVCHVTDFYPGSIQVRWFL NGQEETAGVVSTNLIRNGDWTFQILVMLEMTPQQGDVYTCQVEHTSLDSPVTVEWKAQ SDSARSKTLTGAGGFVLGLIICGVGIFMHRRSKKVQRGSA" sig_peptide 3012..3098 /note="pot. signal peptide (aa -29 to -1)" mat_peptide join(3099..3111,7644..7907,11919..12200,12748..12858, 13188..13204) /product="pot. mature hla-dp-beta 1" intron 3112..7643 /number=1 repeat_region 3880..3897 /note="18 bp direct repeat 1" misc_feature 3880..4625 /note="region of repetitive DNA" repeat_region 4608..4625 /note="18 bp direct repeat 1'" misc_feature 6043..6136 /note="pot. alternative signal peptide" exon 7644..7907 /number=2 intron 7908..11918 /number=2 repeat_region 10416..10424 /note="9 bp direct repeat 2" misc_feature 10416..10748 /note="Alu-I repeat unit" repeat_region 10740..10748 /note="9 bp direct repeat 2'" misc_feature 11475..11775 /note="Alu-I repeat unit" misc_feature 11776..11878 /note="102 bp highly dATP-rich region (57)" exon 11919..12200 /number=3 intron 12201..12747 /number=3 exon 12748..12858 /number=4 intron 12859..13187 /number=4 exon 13188..13211 /number=5 intron 13212..13507 /number=5 exon 13508..13736 /number=6 polyA_signal 13723..13728 polyA_signal 13824..13829 BASE COUNT 4052 a 3170 c 3629 g 3931 t ORIGIN 1 ggatcccaga gagataggag ggccctgata gtaggtcact gtgtgcagga atctggggaa 61 ggcagtgtat gaccctcaga gctgggtctg gacttcaaac ttggctcgtt gatctgctgt 121 gtaaccttgg aaaacttatt catctttttg agcttcagtt ttttcaaaat aatttctaaa 181 taaaaggaat aatttctaaa tgaatggaat attatcttca ttgaagattc ctgtgagatg 241 taaatgggga aagaaactat gcaggagtct cataaattct ggctgttatt gctgttatta 301 ttatgagggc cagagggaac atagactatg aggaccagat agatcaatga gcccctaaaa 361 tctgtgatcc ctgaagcagc aattgatgtg aaccacccca tcactcaccc cgacgctcct 421 gcgtcctcct gagcactcac ccttgatggc cccagctcct cggagactca gcaggaaagc 481 caaggagagg gctctcaaga tcacagctct gatatggaac attctgtctt cagggcgcat 541 gttgtggggt ctataattga tgactgtgag cacaggaaca gtgatgagga actgaggccg 601 agtggaggca gatgagactg aaactgtggg cctctagcac tggaaatggg tggagaggaa 661 tcagcatggc tgggattcac ctatcagaga aatcatagag ctgacattct ctgttgctgg 721 gtaaagagga cgctggaagg tgctggggaa gagatgggag aattttaggt accagcgtgg 781 tcaagagagc tccagttcac agttcatttt cagagttaga gaaagagatg taaaaagata 841 agttacacct tcttctgacg gcaaatgttt tccattatgt tccttctccc gagccccacc 901 cccatcccag acagtcagat gatcttcgat gttttttggt cactatattt taaatcatgt 961 tttatgttat gttgtcaata ttttacaaaa atattctgct gataattaag aatgaatgtg 1021 ctatctaata aaatatataa ttaatctttc tttcaggtcc acctccctga gatacctcct 1081 ttttatttaa tcatttctgc agaagtgtta taatttctat ttagaggttt taattaactt 1141 gaatgaagtt gatctttaat tgtttatcta ttcctggtta cctttgttag tgaaatttct 1201 agataatttt tatttttcag atttcttagt atttgatttt tcctggtatt taaacagtgt 1261 aataacattt ttatctttaa attactagtc ttgttatttc attttcatat aagaataccc 1321 aggacagcat tacctgtggt aacaatgtgc gcccatattt tgatcttgtt tttaagaagg 1381 gtttctctaa tgtttttctg ttacaggtaa tgttaatttt ttattttata ttctctttac 1441 catatttaag aaatactttt ctagtctcat tttaaatatt tcaattttga gctatttatt 1501 tgatactcat agagaaggtc acaaaacatt tactatttaa tgtaatgatg aagtacatat 1561 attacgttaa tattttatct tatttgtggt agccttacct tgcataaata ataattacta 1621 acagattagg acatgagaga ttctgttatt agtgctttgc atgcattacc tcatttaaac 1681 ctcatattaa acctgaggga ggtattatta atgtctactg taaaaataaa ttacctgaga 1741 catcgaggaa gtatttgtct aattatctat ggcaggtaaa tgacaaggag aaaagtccca 1801 cccaggcagt tactaaaaaa actgagtttt tctccacaat cctctcctgg ccccttaatc 1861 ctactagaca ccttctacta cataattatt ttcttctctt gcattttaca tgctagcctt 1921 ctatttacat tttaatattg atttaaagaa atgatgccaa tttgattttt tttgaaatta 1981 gaattggtgg tccaacagga tcacatttat aagtgtctaa agtaagaagt aatgttcttt 2041 gaaagtttgt aaaaatattc actctaaaca aaatagaatc agatgctttg aaggaggtgg 2101 ggtctttgat gatttttttt cactttcttc cttatttacc agtcaattta tattctctat 2161 ggactttatt tttccaaagc aatttcagac ctattgatct catttgatct taagagcttt 2221 gctataaggc aggttatatc atccccatat tgaagacaag gaatcgaagt ccaagagagg 2281 cagtgtcgtt aaagctgcat atttacatgg tagggtaggt ggtgtgtcca cgctcccagt 2341 gtaaggtccc tagactgagc cctcctgacc ctgatgacag tcctgtggaa gaacctggta 2401 actcctgcac atcgcaggac tcacagacct ctgggagaaa gtaaatatga atgggtgcta 2461 atcttaaaca cacccttgga caaaggcaag acagacagac tcagacctca tttgagttct 2521 gagatgggta ctctaatccc tctaagtcat gccactgaat gaccttttac acactaagat 2581 agcacttttt ccacaacaga ccatgtcctg tgggtgtgtg aggtgtggca gaattgggga 2641 aatgataatc cctgtagatg ggccagcaga atatttgaga tcaccttcag agcaaagaaa 2701 acgcataatc tcgccaaaca tcatgactta tctgactggt taaaatgagt atcactgtct 2761 ttcctccgtc atcttaagtg catcacaggc tttatatttt cagacctttc atactaactt 2821 tctgcctagt gagcaatgac tcatacaaag ctcagtgtcc attggttctt ttctcagact 2881 ctgtccaatc ccagggtcac agaagactac ttgggttcat ggtctctaat atttcaaaca 2941 ggagctccct ttagcgagtc cttcttttcc tgactgcagc tcttttcatt ttgccatcct 3001 tttccagctc catgatggtt ctgcaggttt ctgcggcccc ccggacagtg gctctgacgg 3061 cgttactgat ggtgctgctc acatctgtgg tccagggcag ggccactcca ggtaagagcc 3121 gaactgccat tcttggaggg tctggctcag ggaacaattc ctaggggacg ttatctttaa 3181 gggatcaaat tctgagacag gctgcggggg ctcctgccct aaggcagtgt cctctcttcc 3241 cagctagaga aagaggttca tcccctatag gatagcttgc taccctactg gcctattctc 3301 tctccaagga catgggtaca gtaaacagag agaggtgccc agtggtcagt atgcttgtct 3361 ttggggaaaa tgggaccaag aggtcctgga taaccttgga cagacaaggt ttgcagagag 3421 agaagttggc aagtgcaggc tcctgggcgt gttcatgtct gcatccagcc tggaggggac 3481 tcaggcagag agccctaagc tggagtgtcc aggctctgag gatcactgag gattcagtgc 3541 tcacgaagaa tgcctcttat tccccagggt ggagcaggag cccacatccc ttggacaatt 3601 aaggagagaa gggagggagg gggataggtt ttagcccctg aaggcattct cattaaaggt 3661 acttctccca gcctccccag aacttggtta gggtactaga gtgggttgcg acttgtagga 3721 agaatgagat gaggttgtgt gggtgcatga cagggattga gtgtaggtta tcagacagcc 3781 aaggaagcag taaccaagtg aaaaatctct tcttcctgct gcctccctgt ggctggtgta 3841 atattatggc atctatgatc cattgttttt ctctcaggat actctcagga tatttctttt 3901 tatatatata tatactttaa gttctagggt acatgtgcac aacgtgcagg tttgttacat 3961 atgtatacat gtgccatgtt ggtgtgctgc acccattaac tcgtcattta cattaggtat 4021 atttcctaat gctatccctc ccccctcccc ccaccccaca acaggccccg gtgtatgatg 4081 ttccccttcc tgtgtccatg tgttctcatt gttcagttcc cacctatgag tgagaacatg 4141 tggtctttgg ttttttgtcc ttgcaatagt ttgctgtgaa tgatggtttc cagcttcctc 4201 catgtcccta caaaggacat gaactcatcc ttttttatgg ctgcacagta ttccatggtg 4261 tatatgtgtg cattttctta atccagtcta tcactgatgg acagttgggt tggttccaag 4321 tctttgctat tgtgaatagt gccgctataa acatatgtgt gcatgtgtct ttatagcagc 4381 atgatttata atcctttggg tatataccca gtaatgggat ggctgggtca aatggtattt 4441 ctagttctag atccttgagg aattgccaca ctgtcttgag ataccatctc acaccagtta 4501 aaatggcgat cattaaaaag tcaggaaaca acaggtgctg gagaggatgt ggagaaatag 4561 gaacactttt actctgttgg tgggactgta aactagttca accattgtac tctcaggaca 4621 tttctagtcc aaatttacac caacactctg agaggaagga ctgcaaagta ggtaccttag 4681 ttttccactg acttccactt ttcctgctta cacccttcct cctagacctc tccacacccc 4741 tcctaggaca cacctaaaag gtactgacat catgtcacct cctcatcttt cagggtagca 4801 aggttggaat ctcctgaata cagcccctca agccctaaaa cctcttatct attaccttgg 4861 gttcattgtc caggaagggg aggagaactt gaacttgtag tcacagaagg gtgctgagaa 4921 ctaaccagca ggacggctca gccctgggaa ctgcagaggg gtgaggctgg ggagagagga 4981 ggctggagca gcactggtga cactgaacag tgtcaggagg aagtgacgga tgcagcgccc 5041 ccatcccata ggcagagctg tcatgtggga tgagggacag tgttgggagc caccaaggaa 5101 acccagaggt gggggagcag agagcagaag ggagcatgtg atgctggaca gtgaaaggga 5161 ggacaggcaa aggctgggtt gaggtttgta gggggaatga gatgaggcag tggagccatg 5221 tgacagggac tgagggtaga ttactggagc tccctgcgta gaatgaatgt tcaatcaaaa 5281 tttgctggag ggagagctgg agccataggg gagtgggtaa agtgggcagg gctgattcca 5341 caattccctg catgctcccc caactccaca cacatcccca acctcaaaca gggcacaaga 5401 ccaaagggct gaggagccag gctatagctt aaagaggctg ggggagaaaa gcttggctga 5461 gacaacccat agggagctag aggtttttaa tatatcctat tctgaataag agacgaattc 5521 attcagatca gtggtttcaa accgtgctct gggcaactca attgctaagg gttccacaaa 5581 caggataaag tttcttatat acaaaaaaaa atgaaggttt caaattacac cataaaaccc 5641 ctcattgctt atgtctactt ggcaggtaaa attccatttc aaaagttaaa tgtacttaaa 5701 aaattaccta agactgggta aattaaaaaa attaaatgtt gcaaagaaaa aattcaaaat 5761 tcttattctt gaatgaaaaa cgttctctta ctggtgattg aggaggagaa acaaagacta 5821 acaaatgaaa atgggagaat ccacactcag agtggggcaa ctgaacaggc aggggcggat 5881 ggatggcaga ggaggaggaa tctggaccaa ggagctgggg ggctctgggc ctggaatttt 5941 agggtctggg gcccaacacc aggagagagg caggtcagga tatctgagtc aagacctggg 6001 atcttgcctt agcaatgaca ctggagacta aaggtggact ccatggtgcc cttgagccca 6061 gccctacccc atctccacta tcctctgcca ccagctgtgc aacttctgct aggggtgagg 6121 ttaataaact ggagaagtta atttgtggag catgaaacag atgagcagaa caatcacagc 6181 accttaattt ccccagtgtg cccaagaaca gagcaggcct gaagatactc aaacagaaac 6241 aaacatgtgc cgtgtcactg ataattctgt gtagacacac acctgccaga cactgctcat 6301 ggcactccct aggaagaaca gcatgtggga aaggctgcca aaattgttca tgtaaaaatt 6361 acatcaatgc tgtcttcctc ggtgctgcct atgcagctgg cagccatctc ttcctccaca 6421 tcatggcctc cctcagactc ctcatgaagg ataagatcct caaaaagagg accaacaagt 6481 tcatgaggca ccaatcagac tgaaatgtca aaattaagca taactggcgg aaacccagag 6541 gtcttaacag tagggttcgt agaaggtcca agggccagat cttgatgccc aacattgctt 6601 atgggagcaa caacaacaac aaaaaaaaca tgctgcccag tggcttccag aagtttctgg 6661 tccacagcct caaggagctg aaagtgctgc tgatgtgcaa caaatcttac tgtgctgaga 6721 tcgctcacaa aatttcctcc agaactgcaa agtcatcatg gaaagagtca cccagccggc 6781 catcagagtc accaacccca gtaccagggt gcacagctaa gaaaatgagt agaaagttca 6841 tgtccacgtt ttgtgtgtaa ataaaaccat aaaaactgcc aaaaaaaatt acatcaatgc 6901 ctctaaaccc aaaggactct acccccacag gtccctggtt gttgtggtga ttttcattgt 6961 gtaaaatact ttccacatct tttgacacca agtctttctg cagccatgtt tgaaaattaa 7021 ctttcaggct acagagtctt tcttatacca aagttgaaga aagttttaag aaatatattt 7081 ctacatctcc tacatgcaaa acaacaggag caagttgagg aattctcaag aaactggtcg 7141 agaagagaga gcgcttagct atggaaaaga gaaagaagga agggagggct tcctggagga 7201 ggtggcattt gaaccaggac tgacatcagg atggaaatgt cagtcaggga gttaagtagg 7261 gggagcagct ccgccctcca cgtccccagc tcctcccgcc cctgtttttt ctcccagtga 7321 ccccacgtga aacgtctccg cctcctccag ccaccagcag aagggactgc cttcccctca 7381 gtgctcgccc ctccctagtg atcactcagt gcccctgagc tcattctttt cagtaaattc 7441 tctctctgcg tggtgagaaa acaggcctgg agaggctctg cgacccgctt aggaccacag 7501 aactcggtac taggaaaact cctattttaa aatccagccc tgggtgggaa gatttgggaa 7561 gaatcgttaa tattgagaga gagagggaga aagaggatta gatgagagtg gcgcctccgc 7621 tcatgtccgc cccctccccg cagagaatta ccttttccag ggacggcagg aatgctacgc 7681 gtttaatggg acacagcgct tcctggagag atacatctac aaccgggagg agttcgcgcg 7741 cttcgacagc gacgtggggg agttccgggc ggtgacggag ctggggcggc ctgctgcgga 7801 gtactggaac agccagaagg acatcctgga ggagaagcgg gcagtgccgg acaggatgtg 7861 cagacacaac tacgagctgg gcgggcccat gaccctgcag cgccgaggtg agtgagggct 7921 ttgggccggc ggtcccaggg cagccccgcg ggcccgtgcc cagggcgcag gagcagccgg 7981 gttggcctaa gggaccttag tgccgggcgg aaaggggact ttgggttggg gattcatggg 8041 gggagcccat ctggagcttg tcaggggagc gagcgcgggg acctggactg ggctgagcat 8101 ggagtgagga ggacgagagc agagagaccc ccgggagctt catcaggcct ggcagctgac 8161 tgcatgtggg gtgaaaaaag gaagccacag gacagcgcac aagggtatgg tgtggagatg 8221 gaggtggaga tggcacagca ggccacacag agaagaaacc tacagggagg tagctgggtt 8281 tgaggtgctt gaggggcaga tgggtggtct gatgggcagg tagacagaag ggtctgcagc 8341 cggggaggag actgagatac atgagaccat ccagggagag gggacccagg gggaagagca 8401 aaggaccgga tcctgggaac tggacagttg tgatttggcc aagacagaaa agcctgtgaa 8461 agagaccaaa aaaacccaag tgcagtgtga ggagaggccc gcagagaaga gtcttggaag 8521 ctgaggggag gtgacctcag cagcacagtg gacagcggtg ccagtgactt gggaaggtca 8581 gaaaacagaa gatggaaagt gggtttggaa accagggaga cctggggaga gcaggttggc 8641 cgcagcggca ggagctggaa tgggaggggg tgcatgaggc tgagtgtggc gcatcctcct 8701 cggggctgag atggatttta cttgtcttgg gttccccacg gctgtcacag ggcagtgtct 8761 cagttcattc gtctttttcc ttcaggaagt ctgggtgtaa agggatggag agaggtgagg 8821 tgtgtgcagt aagaggattt ctcaaggatg ggacaggaag gccttggagc tttggcttcc 8881 tcctgtgaac ttgtggggtg gggagcctgg tgcaccaacc tgagggactt gagggagtag 8941 tatcaggatg tgggattgag ccctggacct tttttctaga aagaggaaaa aaatgaaggg 9001 aggaggagga ggaagctggg gagatcacac ctttgatttt cttgttcctg gaaagtgaaa 9061 ggaagttcac ctgctatgag tgagaaggtg gacacactgg gtggggatga ggtgagtgac 9121 atgagcttag gaaagttgct gaggtaattg gttgagagag gtgttcaaat aaaaataacg 9181 caattggcaa aaactgttac taagactttg tagaggcacc aatcagtgac atggcagcat 9241 tttctttcac agtaatcaac tgccagattg cagacagccc tgatgccagc ctaaggagtg 9301 tgggtttctc ctccaggccc gcaggtcccc aacctcactc ctctgaagac tcttctggag 9361 atcctctgtg atgcacagat ctccagactc agtgccccca gactcagatt ccctgggtgg 9421 ggaggtctgg ggatctctgc ttgtaatcag ctccctagag gttcccatgt agccagataa 9481 gtattgtcag aacactgaag atttttgaaa aatgaaaaag agaaggttgg agatgtgtct 9541 tcagaagact actaagggtg ctggctagag gagggaccag aggcagggag atgaggtagg 9601 aaactgctat tatttgtcag ggaaattgca atcaaggcat gagttagaac agggaaaaca 9661 cagaggcaag ggagaggtgg aagggggagg aaagaagtag tgacaattcc agggtggatg 9721 tccacccaaa tctagaagta attgagcaaa tgttttctgg gcattagaga aggcaactag 9781 aacaaacagg aatccttgcc ttggtgaaat gtatttgaac tgggtcagaa atgaggccat 9841 tgggtatcag gccttaactc cagcgcaccc tggaggtcac tgatgtggct ccaggctgac 9901 ctgctcctgt caaagaatat tgagcaagat gcctctcgtg gaatgttctg ggaccttaaa 9961 acagataccc aagtattccc cctgatttca tggttcccag aagctctatg gggaagaaat 10021 tgtaggtaat tcacaactga gatttagaca taagttgaat agtgtaatgg acattgagtt 10081 aaccgaggta atgaagtagt gagacacagg tgcccctgaa ataaactcac attgagggaa 10141 gaggctgaca atgtggatca gtctgaaaac aaggcaaaaa tacaataggg agtaagggtt 10201 gtgtgtcagt tcaagactgt acttttacct ggcccagcgc catgttaggg tatttgtgtt 10261 ctccaggaag tagaaaggaa agaactgagt gattagggac ctagaagact aatttgagac 10321 attcctcttg atgagctgtt ctctagggta gtcctctgaa agagctgttc tctagtggat 10381 ctccctgaat gaactgttct ctaggagcac ttgacccttt tctgtgtttg ttttttgttt 10441 tgtgtttgtg tttgtttttg agacaggttc tcactttgtc tcccaggctg gagtgctgtg 10501 gcaccatcat ggctcactgc agcctcaacc tcctgggctc aagtgatcct cctgcctcag 10561 cctcccatgt agctagaact acagatacac gtaccaccat gtctggctaa tttatttttc 10621 tttttagaga tgggttctca ctatgttgcc caggccggtc tcaaaaccct gggctcaagt 10681 gatcctcatg cctcaacctc ccaaagtgct aagattatag gcatgaccac catgcctggc 10741 cttttctgct ttctgaggag gaaaaaggta ctggtggcag agatccaaaa gaaaagttgc 10801 cagtggcagt gtggaaattc acctgagaac aacaggacaa gctggggcac aaatgcaaag 10861 atgcagaggg aggcaacacc tggtcatctg tgagaccttc atgggacctg aagacgcagc 10921 acagaggagg aacttgaaaa aggacgggat ttctactact caagcatgta ggagctcagg 10981 atattctgta aatatgaaga ttttgagttt ttgtaggtga ggtaaaaaaa tacataggtt 11041 ttttacagaa taagacatgt aaagctctct tcattttctt tgtattttca tgaagttatt 11101 agattcacag gccaccataa tgccattgtc tgtatatctt aatttcaaga tattatttga 11161 gtaaattttg cttcctttgt atcaagatag aactttgaaa aggtaggtaa tttcacagtt 11221 gatcaaatat tctttgccca aattactttt ggttaaaatt tctcctaaat gtgctacaga 11281 gtgcaaactc tgtctccctg ccattccgct atatacttac taactattat tttattcaag 11341 atcatgcatg ctctacttga aggtctattt ctatcttttc aatgctaccc ttacccacta 11401 gcctaatcac attattccta ttttcaacat ctaggaatca attacatagt gaacatgcct 11461 aagaaataat aatctgggca gatgcagtgg ctcaggcccg taatcccagc cctttgagag 11521 gccgagcggg tggatcactt gaggtcaggc gttggtcaag tgctcctaga gaaccaggct 11581 gaccaacatg gagaaacctt gtctctacta ataatacaaa aattagccag gtgaagtggc 11641 aggcacctat aatcccagct attcgggagg ctgaggaagg agaattggtt gaagcccaga 11701 ggtggaggtt gcagtgagcc aatattgcgc cactgcattc cagacttggc aacagagtga 11761 cactccatct caacaaaaag aaagaatgaa agaaagaaag agcgagatta tgtctcaaaa 11821 aaaaggaagg aaggaaggaa ggaaggaagg aaggaaggaa ggaaggaaag aaggacaatc 11881 tcaaattcta tttcattatt tttcttccac gctcctagtc cagcctaggg tgaatgtttc 11941 cccctccaag aaggggccct tgcagcacca caacctgctt gtctgccacg tgacggattt 12001 ctacccaggc agcattcaag tccgatggtt cctgaatgga caggaggaaa cagctggggt 12061 cgtgtccacc aacctgatcc gtaatggaga ctggaccttc cagatcctgg tgatgctgga 12121 aatgaccccc cagcagggag atgtctacac ctgccaagtg gagcacacca gcctggatag 12181 tcctgtcacc gtggagtgga gtgagtctct gatgaccctc tagaccccac ctctgaagag 12241 caggggactc tctggctctg gggtccactc atcttatctt ctgcatctat accctggggc 12301 catgtccaaa ccccatcttt cttctatacc agctcctgag catagtttga agccagggaa 12361 atggagactt cctgaccttg gcttaggggt tcctgaagat tcatagttct cccccttgtc 12421 agagaatcta gggacactga ctggtctcga aaccctcaca cttaggaact gacctcacac 12481 ataggaacag ttctcttcct tcagcatttt agcctcttct caggcatttt gagaggcaac 12541 ttccagaatc agcatttgcc accttgttga ggtcacaccc ctgttccaga tatgagggtg 12601 gctctttctg aatttcctct tagcaagctt tttccgctgc actgtcctca tcccgatatg 12661 ctgcatcagg ctccagaatc tcagacagga catgagtagg gatgcagctg gtggaggtga 12721 cactaaacct gggtctgtcc ttcccagagg cacagtctga ttctgcccgg agtaagacat 12781 tgacgggagc tgggggcttc gtgctggggc tcatcatctg tggagtgggc atcttcatgc 12841 acaggaggag caagaaaggt gagaaagcct gcagggtgag cgggacttac cttcccctgg 12901 catattcaca cttattccac gatgaggggt ttgacagaaa agaaatgtca gaaagctcta 12961 gaggccactg atatcagata atcggggaac aaacatgacc tatagcgaga gagggatccc 13021 aggctgggat cttaatgcag ccagatgcat gaggtcccaa gtactcaggc tcctgcggag 13081 cgtccattga gtgatgggca atggaatttg gtgggatgga aatgtttctc taattatctg 13141 aggtggtttc aatggctgat tatataacct ttcgtctttc atttcagttc aacgaggatc 13201 tgcataaaca ggtaatattc ctgctttgat ttccttgtgg ggtgggttgc aggaggatat 13261 gagtcctttc tgtgcattgt aacactgagg ctcctccagg aagggaatct caggcatgaa 13321 cccctctttc aatgtcagcc ttcaggcaag tggggaaaga gcattgcttg gctccattgc 13381 tgaaggaagc agagatcaac tctgttattt atcagcctga gacgcatcct ctcaccataa 13441 tttttctctc ctggacttac aggaaggagg ctggcaacct gggataactt gtcttttacc 13501 cccacagggt tcctgagctc actgaaaaga ctattgtgcc ttaggaaaag catttgctgt 13561 gtttcgttag catctggctc caggacagac cttcaacttc caaattggat actgctgcca 13621 agaagttgct ctgaagtcag tttctatcat tctgctcttt gattcaaagc actgtttctc 13681 tcactgggcc tccaaccatg ttcccttctt cttagcacca caaataatca aaacccaaca 13741 tgactgtttg ttttccttta aaaatatgca ccaaatcatc tctcatcact tttctctgag 13801 ggttttagta gacagtagga gttaataaag aagttcattt tggtttaaac ataggaaaga 13861 agagaaccat gaaaatgggg atatgttaac tattgtataa tggggcctgt tacacatgac 13921 actcttctga attgactgta tttcagtgag ctgcccccaa atcaagttta gtgccctcat 13981 ccatttatgt ctcagaccac tattcttaac tattcaatgg tgagcagact gcaaatctgc 14041 ctgataggac ccatattccc acagcactaa ttcaacatat accttactga gagcatgttt 14101 tatcattacc attaagaagt taaatgaaca tcagaattta aaatcataaa tataatctaa 14161 tacactttaa ccattttctt tgtgtgccat cacaaatact ccttaaccaa atacggcttg 14221 gacttttgaa tgcatccaat agacgtcatt tgtcgtctaa gtctgcattc atccaccagc 14281 ctaggcctcc tgtcttaatt tcatacagac agaaatgact cccactgggg aaagagcaaa 14341 gcaatacatg tagcactctt tttcaaacac tggtcttttt ttttttctta acaatccaac 14401 attgttatgt gttttgcgtc tcatattgac accttttggt caaggtagag gacatgtttg 14461 ttgtaagctt tctttttcgt gtagaggatg gattcttcac tcctgataca cacaatcagt 14521 gcacagcagc tctcttatac atccagttga tgccttcagt ctccctggct tcttacaagc 14581 atcttctggg ccttgtgtgt ccctgggcac ctgtccctgg tcaattcccg aaagctactg 14641 tgctcctctt gcccatctcc ccttgcaaat aatatcttcc atcgggggac cggcttcctc 14701 caatttcagg agaggtgggg ctgaaggcac agacttgggc gtcactggca cagatataag 14761 taaatacagc tggagtctgc ag // LOCUS HSHLASBA 14646 bp DNA PRI 16-FEB-1995 DEFINITION Human HLA-SB(DP) alpha gene. ACCESSION X03100 NID g32243 KEYWORDS antigen; cell surface glycoprotein; class II antigen; glycoprotein; inverted repeat; Kpn repetitive sequence; major histocompatibility complex; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14646) AUTHORS Lawrance,S.K., Das,H.K., Pan,J. and Weissman,S.M. TITLE The genomic organisation and nucleotide sequence of the HLA-SB(DP) alpha gene JOURNAL Nucleic Acids Res. 13 (20), 7515-7528 (1985) MEDLINE 86041930 COMMENT Data kindly reviewed (05-MAY-1987) by S.K. Lawrance. FEATURES Location/Qualifiers source 1..14646 /organism="Homo sapiens" /db_xref="taxon:9606" intron complement(1..78) /gene="HLA-SB beta" /number=1 prim_transcript complement(<1..247) /gene="HLA-SB beta" gene complement(1..247) /gene="HLA-SB beta" exon complement(79..247) /gene="HLA-SB beta" /number=1 CDS complement(<79..178) /gene="HLA-SB beta" /codon_start=1 /db_xref="PID:g32244" /translation="MMVLQVSAAPRTVALTALLMVLLTSVVQGRATP" misc_signal complement(323..374) /note="beta consensus sequence, put. regulatory region" repeat_unit 514..526 /note="imp. inverted repeat a" repeat_unit 2189..2200 /note="imp. inverted repeat a'" misc_signal 2461..2512 /note="alpha consensus sequence; put. regulatory region" mRNA join(2571..2749,6334..6579,6920..7201,7416..7582, 10872..11200) /gene="HLA-SB alpha" gene 2571..11200 /gene="HLA-SB" prim_transcript 2571..11200 /gene="HLA-SB alpha" exon 2571..2749 /gene="HLA-SB alpha" /number=1 gene 2571..11200 /gene="HLA-SB alpha" sig_peptide 2650..2742 /gene="HLA-SB alpha" misc_feature 2650..2749 /gene="HLA-SB alpha" /note="precursor fragment" CDS join(2650..2749,6334..6579,6920..7201,7416..7570) /gene="HLA-SB alpha" /codon_start=1 /product="class II antigen" /db_xref="PID:g673417" /translation="MRPEDRMFHIRAVILRALSLAFLLSLRGAGAIKADHVSTYAAFV QTHRPTGEFMFEFDEDEMFYVDLDKKETVWHLEEFGQAFSFEAQGGLANIAILNNNLN TLIQRSNHTQATNDPPEVTVFPKEPVELGQPNTLICHIDKFFPPVLNVTWLCNGELVT EGVAESLFLPRTDYSFHKFHYLTFVPSAEDFYDCRVEHWGLDQPLLKHWEAQEPIQMP ETTETVLCALGLVLGLVGIIVGTVLIIKSLRSGHDPRAQGTL" misc_feature 2724..2725 /gene="HLA-SB alpha" /note="pot. alternate signal sequence splice site" intron 2750..6333 /gene="HLA-SB alpha" /number=1 repeat_unit 4964..4990 /gene="HLA-SB alpha" /note="inverted repeat b" repeat_unit 5063..5088 /gene="HLA-SB alpha" /note="inverted repeat b'" exon 6334..6579 /gene="HLA-SB alpha" /number=2 intron 6580..6919 /gene="HLA-SB alpha" /number=2 exon 6920..7201 /gene="HLA-SB alpha" /number=3 intron 7202..7415 /gene="HLA-SB alpha" /number=3 exon 7416..7582 /gene="HLA-SB alpha" /number=4 intron 7583..10871 /gene="HLA-SB alpha" /number=4 misc_feature 8601..9100 /gene="HLA-SB alpha" /note="sequence homologous to IgC epsilon genes" repeat_unit 8991..9029 /gene="HLA-SB alpha" /note="inverted repeat C" repeat_unit 10516..10554 /gene="HLA-SB alpha" /note="inverted repeat C'" exon 10872..11200 /gene="HLA-SB alpha" /number=5 repeat_region 12301..12800 /note="Kpn repetitive sequence" BASE COUNT 4552 a 2957 c 2741 g 4395 t 1 others ORIGIN 1 tttgatccct taaagataac gtcccctagg aattgttccc tgagccagac cctccaagaa 61 tggcagttcg gctcttacct ggagtggccc tgccctggac cacagatgtg agcagcacca 121 tcagtaacgc cgtcagagcc actgtccggg gggccgcaga aacctgcaga accatcatgg 181 agctggaaaa ggatggcaaa atgaaaagag ctgcagtcag gaaaagaagg actcgctaaa 241 gggagctcct gtttgaaata ttagagacca tgaacccaag tagtcttctg tgaccctggg 301 attggacaga gtctgagaaa agaaccaatg gacactgagc tttgtatgag tcattgctca 361 ctaggcagaa agttagtatg aaaggtctga aaatataaag cctgtgatgc acttaagatg 421 acggaggaaa gacagtgata ctcattttaa ccagtcagat aagtcatgat gtttggggag 481 attatgcgtt ttctttgctc tgaaggtgat ctcaaatatt ctgctggccc atctacaggg 541 attatcattt ccccaattct gccacacctc acacacccac aggacatggt ctgttgtgga 601 aaaagtgcta tcttagtgtg taaaaggtca ttcagtggca tgacttagag ggattagagt 661 acccatctca gaactcaaat gaggtctgag tctgtctgtc ttgcctttgt ccaagggtgt 721 gtttaagatt agcacccatt catatttact ttctcccaga ggtctgtgag tcctgcgatg 781 tgcaggagtt accaggttct tccacaggac tgtcatcagg gtcaggaggg ctcagtctag 841 ggaccttaca ctgggagcgt ggacacacca cctaccctac catgtaaata tgcagcttta 901 acgacactgc ctctcttgga cttcgattcc ttgtcttcaa tatggggatg atataacctg 961 ccttatagca aagctcttaa gatcaaatga gatcaatagg tctgaaattg ctttggaaaa 1021 ataaagtcca tagagaatat aaattgactg gtaaataagg aagaaagtga aaaaaaatca 1081 tcaaagaccc cacctccttc aaagcatctg attctatttt gtttagagtg aatattttta 1141 caaactttca aagaacatta cttcttactt tagacactta taaatgtgat cctgttggac 1201 caccaattct aatttcaaaa aaaatcaaat tggcatcatt tctttaaatc aatattaaaa 1261 tgtaaataga aggctagcat gtaaaatgca agagaagaaa ataattatgt agtagaaggt 1321 gtctagtagg attaaggggc caggagagga ttgtggagaa aaactcagtt tttttagtaa 1381 ctgcctgggt gggacttttc tccttgtcat ttacctgcca tagataatta gacaaatact 1441 tcctcgatgt ctcaggtaat ttatttttac agtagacatt aataatacct ccctcaggtt 1501 taatatgagg tttaaatgag gtaatgcatg caaagcacta ataacagaat ctctcatgtc 1561 ctaatctgtt agtaattatt atttatgcaa ggtaaggcta ccacaaataa gataaaatat 1621 taacgtaata tatgtacttc atcattacat taaatagtaa atgttttgtg accttctcta 1681 tgagtatcaa ataaatagct caaaattgaa atatttaaaa tgagactaga aaagtatttc 1741 ttaaatatgg taaagagaat ataaaataaa aaattaacat tacctgtaac agaaaaacat 1801 tagagaaacc cttcttaaaa acaagatcaa aatatgggcg cacattgtta ccacaggtaa 1861 tgctgtcctg ggtattctta tatgaaaatg aaataacaag actagtaatt taaagataaa 1921 aatgttatta cactgtttaa ataccaggaa aaatcaaata ctaagaaatc tgaaaaataa 1981 aaattatcta gaaatttcac taacaaaggt aaccaggaat agataaacaa ttaaagatca 2041 acttcattca agttaattaa aacctctaaa tagaaattat aacacttctg cagaaatgat 2101 taaataaaaa ggaggtatct cagggaggtg gacctgaaag aaagattaat tatatatttt 2161 attagatagc acattcattc ttaattatca gcagaatatt tttgtaaaat attgacaaca 2221 taacataaaa catgatttaa aatatagtga ccaaaaaaca tcaaagatca tctgactgtc 2281 tgggatgggg gtggggctcg ggagaaggaa cataatggaa aacatttgcc gtcagaagaa 2341 ggtgtaactt atctttttac atctctttct ctaactctga aaatgaactg tgaactggag 2401 ctctcttgac cacgctggta cctaaaattc tcccatctct tccccagcac cttccagcgt 2461 cctctttacc cagcaacaga gaatgtcagc tctatgattt ctctgatagg tgaatcccag 2521 ccatgctgat tcctctccac ccatttccag tgctagaggc ccacagtttc agtctcatct 2581 gcctccactc ggcctcagtt cctcatcact gttcctgtgc tcacagtcat caattataga 2641 ccccacaaca tgcgccctga agacagaatg ttccatatca gagctgtgat cttgagagcc 2701 ctctccttgg ctttcctgct gagtctccga ggagctgggg ccatcaaggg tgagtgctca 2761 ggaggacgca ggagcgtcgg ggtgagtgat ggggtggttc acatcaattg ctgcttcagg 2821 gatcacagat tttaggggct cattgatcta tctggtcctc atagtctatg ttccctctgg 2881 ccctcataat aataacagca ataacagcca gaatttatga gactcctgca tagtttcttt 2941 ccccatttac atctcacagg aatcttcaat gaagataata ttccattcat ttagaaatta 3001 ttccttttat ttagaaatta ttttgaaaaa actgaagctc aaaaagatga ataagttttc 3061 caaggttaca cagcagatca acgagccaag tttgaagtcc agacccagct ctgagggtca 3121 tacactgcct tccccagatt cctgcacaca gtgacctact atcagggccc tcctatctct 3181 ctgggatccc cagcctctat cttttgtggc tgctttacag gaactccgag ctatggactc 3241 tgcattagga gacgaagtgc aaagagtgtt tctgtatcct ccctctcttc taggacccta 3301 gggctcttcc tgggtctttg tgggtggtca caagctttcc tctctcaaga cagcagggtt 3361 gcatggtctt gatagccttg tgattcgggt tctgagagat tcaggactgc aagggaggcc 3421 tagacttttg atagctgcaa ggactcagcc agagatggac cgtagtgaat gctccttttt 3481 cctgtagctg aaatcaggga gaatgacatc aagcctgtgc atgatgctgt cattccaaaa 3541 tctagtgatg gggaaggtta gaatccataa cgtacaagat gcacactggc ttcagacagt 3601 tttatttaag atgtgtagaa taaagaggag gtcaggctgg gtagaaccag aagtatctat 3661 tgccctgttc gcggtcacct gagttatttc taatgttatg ttataataaa caccacaata 3721 ggcttctctt catagatgca aatacttttt agtattcttg gtagaaattc ctaatgagct 3781 cagctgtctc ttcagggctt ccctgcccag tctcttaaca tttaaacatg tcatttacct 3841 taaaaacata agtgcaaacc aactgataaa aaacaacctt gccttcagtc tgcatcctgt 3901 cccagagaca ctttctttgt gtcctcacac gtggagctaa gcttctgact tgtctctggt 3961 acatccctga ggatcctctc atcttggcca tcaggaacct ctacagaagg tcaaattcag 4021 tgggttcttc tcagtgcctc tgacttgagt tactaataac atttgcacta taatccactt 4081 ctttctgatg aactaccctg tccttatttt tctcctgttt acctggatcc tccttatcat 4141 cttttaaacc acctcttaac tatcatgttc tctcattata ccctgagatc tcggcaattc 4201 tgatttttgg cactcttcct ggaaaatctt atttaacctg cacctgccac taatgactct 4261 cagttctatg gcctaaattc ctctcctgag accacccata atccacaaat atctatgtat 4321 tatttctcct tagatgactt tcaggtcttc taagtgcaat agccccacag taaactcagt 4381 atcttctccc ggtcaggctg tcttccctga gagaagtggc ttttgccctg ttttctgaat 4441 gcctacattg aagccatctg ttccccagga agccttccct gatgtgctgt ttggtcgcat 4501 cttgtgtata cctacgtatc tgcacttatc cttctgaacc tgctgttgtc ctgtcacttg 4561 tgtttccttc tgtgacttat acgcgtctgc agaacaggac gtatgtatta tttttatttg 4621 ggtatttagc atctaacagt gtttgacata tagtagtctt ttaatacata tttttgtctg 4681 aatggaaatg atattttgaa gaaaaataat ctgttccata gctggctgat ctttggactg 4741 cagaacttgt gaaagtgttt tttaaaaagc attttaaaaa gtacaaggga cattcatgta 4801 ttaagaagat gagtttccaa taactgctag aggactttgt gtctttttat tttaccctct 4861 ttttcctgat gagtcctttg agtcctttaa actgaggagc aagctaagtt tcctagtgaa 4921 atacctatag gatttgtttt gtttagtttc aaataccact ctttgcttgg ccacttactg 4981 tgtcagggag tcattctcag tgaaaaataa gacacaggtc ataccctcta gacacttaca 5041 attacagtgg caaggagtca ttctcctgtc actgtaagtg gccaagcaca gactgggtcc 5101 ccacatgtca gggctgaaaa ctcacaggga aatctgtgag ttgggaggtg agagcagaag 5161 agtcccgtag ttccttctca ctctgatgca tttatcattc taaacccaga ctttcacata 5221 cacattcatc gttttctttc atgataatag ttgcttttat cctcttatct ttgctaattc 5281 ttacaaacta ataaagacta agaaacaaaa taaattaaat cctacaggtg ttccaaactc 5341 agcaataatt tctagttggc ctctaaaaca aaaatcaaaa tataaatgta agaaaagttt 5401 agaatgctta gtacctgtgt gatgaaataa tctgtacact aaacccccaa gtcatgagtt 5461 tacctataca acaaacctgc acatgtactc ctgaacataa aataaatgtt gaaatatttt 5521 taaaaaggaa acaaaagttt ggaacaaatg ccaaaataac tgtactgtac ttttgaattt 5581 atatgcccca aatgaaaaat attatcaaca aagctataca ttctacagtt tcatgttcat 5641 aaactaagac agaaacttta aaactgtcaa gagccctaaa atttgaagga tattttcttc 5701 ttcctctcaa ttttgtattt ttttctacct tttctataat aagaaaaaga aaatgtccat 5761 tcccccaccc ccatgactct aaaaacaatt ttacatctgt gtcatagaaa aattaagatc 5821 ttaatgggag agaaaacctc tctactagtt ccgccagtag ccgtatgacc ttagcaagtt 5881 attaatatgt aacttccctg catttcctta cctgtaaaat gtatgatatg tatttgcttc 5941 atagggttat tgtgacaatt cagcgagtga aatatgtaaa gtatttagaa ggatgcctgg 6001 cacaagtaag tgctcaacaa atgttagctg tcattgttac tattactatt gtgtagggtc 6061 aggatgccca gactttcaaa gaccaggaag cagcttgact tatcagtgat aaacttttca 6121 ttttgttctt tgctcctttc tttttataac tgctcatctg ctctgtatta tttcctttat 6181 ggtgttgctc cttcttcttc cccatatgtc cttcctttga cctcttacct tcttcctttt 6241 tatattcata agtctttatt cattctctag ctttgaccac ttgcatattc aaactgacat 6301 tttgtcgtgt ttttctctac tgtctttatg cagcggacca tgtgtcaact tatgccgcgt 6361 ttgtacagac gcatagacca acaggggagt ttatgtttga atttgatgaa gatgagatgt 6421 tctatgtgga tctggacaag aaggagaccg tctggcatct ggaggagttt ggccaagcct 6481 tttcctttga ggctcagggc gggctggcta acattgctat attgaacaac aacttgaata 6541 ccttgatcca gcgttccaac cacactcagg ccaccaacgg tacgccctat ctttgcctct 6601 tcctctgtag cccaactgga agggatgaga gggcctctct gccaccctca gactaggaag 6661 cctaagtgcc ccctgctgtg tgatcctctt cccctagtgg ccatgggctg atcccactac 6721 agcaagggct tgcatcctct cttctcagga gagagaaagg tgagcagagt gaggctggtc 6781 agtggtgtga tacccctctc tgtgattcag agctgccata aaatctaagg ctgaggtaga 6841 ggaccaccct cccctaagag gtggagcctt tgtgattcat cccagaagag gggcctaacc 6901 tggtgctgtc tccttccaga tccccctgag gtgaccgtgt ttcccaagga gcctgtggag 6961 ctgggccagc ccaacaccct catctgccac attgacaagt tcttcccacc agtgctcaac 7021 gtcacgtggc tgtgcaacgg ggagctggtc actgagggtg tcgctgagag cctcttcctg 7081 cccagaacag attacagctt ccacaagttc cattacctga cctttgtgcc ctcagcagag 7141 gacttctatg actgcagggt ggagcactgg ggcttggacc agccgctcct caagcactgg 7201 ggtatgcaac tgcttttctc tccataatct cctggcatcc tctattccaa agacctggtg 7261 tcctctgcac cagctttccg cactggctgg gtctcagtcc tctcctcgtc ctaacatcca 7321 attaactggt ccataacctt caattcccac aaccatccca ggccatcacc accctcactg 7381 cacctcctga ccctatctct tcattcttcc cccagaggcc caagagccaa tccagatgcc 7441 tgagacaacg gagactgtgc tctgtgccct gggcctggtg ctgggcctag tcggcatcat 7501 cgtgggcacc gtcctcatca taaagtctct gcgttctggc catgaccccc gggcccaggg 7561 gaccctgtga aatactgtaa aggtgggaat gtaaagagga ggccctagga tttgtagaat 7621 gtaaggaagg gaggaaaaat tcaatctgat aagtgttcat tgatcttcta atgggttaaa 7681 agcattcagc cacataacaa caacaacacc gataactaac tgagtagtta atatggtcag 7741 gcgctattct gaggatttac atttattaac tcactttatt ctcacacata gtctttgagg 7801 taggtactat tattttcact atttcacatg agagatactt acatcttttt acatacacag 7861 agactttaag cactttgatc aagttcccac agctatgaag tagtagggct agcttccaat 7921 ccagaaagtc tggatccaag actgtttatc cactgtccta ttcaccctat tttgtgaagg 7981 aaaagaccaa gttcaaattc tccagagtcc attgccaaat aatggagtca gatctatatt 8041 tctatacata attacaacac agtgtggtgg gtgcctgtaa ctacttactg tctctacttg 8101 gactcattcc atggcaatgt tcacacaaaa aatgcccctc cagagatctt acaggtttct 8161 atttatcata acactcacca tgctttatat ttttatatgt tttgggaatt ctcttagcat 8221 tagacagtga acttccatgc agatgaccac atctaattca ttattattat tgttattcat 8281 gctggacctc aggtacaaaa ggttaagaac ttctcagttc attatatgat catcattggt 8341 gcctccgagc tctctctctc tcccttgatt tatttggtcc cttttatctc cagtccttac 8401 tcccatatct aacctcttac ccctacctca taggtaaaca ttttaatgaa tttgatgttt 8461 ccttttattt gcatagatcc tctgtaatat gtagtagtgt ccagtgtaca tgtattttta 8521 attaaccaaa atggcattaa attatagatc taattttgta catccagttt gtttcttcca 8581 aatcttccat agtattttac tttatatgtc catgcattag tccattttgc attgctataa 8641 aggaatatct gaagttacct aatttacgaa gaaaagagct ttaaatggct cacagatctg 8701 caggctgtac gcgaaacatg gcactagcat ctgcttctgt tgggggattc tggaagcttt 8761 tactcatggt ggaaggcaag tggagccagt gcatcacatg gtcatagagg gagaaagaga 8821 catagaaaga ggtgccagcc tctttttaac aaccaggttt catgtgcact aatagagtga 8881 gaactcactc attacccgga gaggggacaa agccattcat gagggtctcc tccatgattc 8941 aaatacctcc caccaggccc cacctgcaac actggggatc aattttcaac atgagacttg 9001 gaagtgacaa atatccaaat catattaatc cacatatcta cattgctcct gggatacctg 9061 gatcattcct ggttctctac tattgcaagc aatgcttgta tctcacatgg aactgcatat 9121 acatgtgggc ctgacctgca tccctggaat gtatgtatcc tagaaagggg ttgcagggtt 9181 gctggagatg cagctcctta atttgactaa acactgctca tcttctcatc agaatggctg 9241 tactcatctg aacttccttt gtcagtactc taattgtcct gcaactccta aatggacttc 9301 aacactggac attatccagt tttctaactt ttgccaattt catgtgcata aagaaatatg 9361 ctgttttatt ttgcatttct ttaattacta ataattgggg ctataattag gactgattag 9421 ccacttgggg gttccttttc tataaattgc ctgttcacat tcattgtcca tttttgtact 9481 atgtgcttcc atcattttct tattgatttg caggtgatcc ttatatagtc ctgctagtag 9541 tcccttgtca gttttaggca ttgcaaatgt tttcctctaa tctgacttct ggcaactgtc 9601 tccttggttt cctttattga agagaaatcc ttaatatttt gtaatgaagt ccatcaactg 9661 tatttttgtt tgtgtgtctt ttttaaaaga agtcttccct atactgagat atcaaagata 9721 ctcttaaaac atctcctaca gttttaaatt tcacatttac tactttaatt catctgggat 9781 tcatctttgt gtttgatggg gatcatgttt tatttttctt tatataatgg gccagtgtgt 9841 tcccacaact actaaatagt tcaccttttc cccataggtt agtagtgtct cctttgctat 9901 actgaaagct cccattatag gtgggcctgt gtctgagttc catcttgttc cactgttctg 9961 tttgtctctt cttgtgccag tgtcctagta ttttgattac tatgacattg tagtgtgtgt 10021 tagtatccag taggacaaat tcttgtttat ttttcttagt tcacacacat ttataattat 10081 atctataatg atttgtaaca gagtgaagtg aatgtagaat gtcagatgtt aagaggaaga 10141 atggaaaaga gggctgggac tagggtgatg taggggatgc acctggctta ggtgcaaaat 10201 ttgggggata ccaaaagaac tcagtaataa ntcatatttt aatgaaatat cttgaaaagg 10261 caaaattaat gcaaagatac atgattaaca aaacatccaa agaggagtat ttaacaaaaa 10321 tggagaagca gagaagcaga agaattagga gaatatgctg tcacatgagc caaggaatta 10381 aagaattcag gaaggaggaa gtactgctgt cagatgttca acagaggtca ttttagaaaa 10441 tttaccttgg tttttgaaat cctttcaaag agcagtatac acaatgtgag caagtatcct 10501 tcgttcattg ccgtcattga tatggtttgg atatttgtcc cttccaattc tcattccagg 10561 gttaagcttc ttctctgccc tcagtaatgt ggcccttccc cttgtctgta tattttggag 10621 acatgaagca tgtgggatgg cctcacagtc agctggggtt tgagggtgaa attcaatgac 10681 tttcgtgaac tccttggctc ctatgtgctc ttcactggag gaccagggca tgtgcaggga 10741 tgaccacctt ctccctggga cctgaacagg gcagagaaat gggaagctcg ggtgcaaagg 10801 gagtggggaa gatgggtccg ggcttacagt actgaaccca ggaatgacaa taactgtgtg 10861 tgttgctgca ggtgacaaaa tatctgaaca gaagaggact taggagagat ctgaactcca 10921 gctgccctac aaactccatc tcagcttttc ttctcacttc atgtgaaaac tactccagtg 10981 gctgactgaa ttgctgaccc ttcaagctct gtccttatcc attacctcaa agcagtcatt 11041 ccttagtaaa gtttccaaca aatagaaatt aatgacactt tggtagcact aatatggaga 11101 ttatcctttc attgagcctt ttatcctctg ttctcctttg aagaacccct cactgtcacc 11161 ttcccgagaa taccctaaga ccaataaata cttcagtatt tcagagcggg gagactctga 11221 gtcattctta ctggaagtct aggaccaggt cacatgtgaa tactatttct tgaaggtgtg 11281 gtttcaacct ctgttgccga tgtggttact aaaggttctg atcccacttg aacggaaagg 11341 tctgaggata ttgattcagt cctgggtttt tccctaacta caggataggg tggggtagag 11401 aaaggatatt tgggggaaat tttacttgga tgaagatttt cttggatgta gtttgaagac 11461 tgcagtgttt gaagtctctg agggaagaga tttggtctgt ctggatcaag atttcaggca 11521 gattaggatt ccattcacag cccctgagct tccttcccaa ggctgtattg taattatagc 11581 aatatttcat ggaggatttt tctacatgat aaactaagag ccaagaaata aaatttttaa 11641 aatgccctaa ttcattgcaa tttttaccag ccatagtcac tccatgtggg agaacttaaa 11701 tcatgattac cagagctttc aaaggtttga gaatagtgat gattatgaag aaaaatatct 11761 tatttgagca aggattttgt ttctttatga gtgttcatta gatattacga tgaaaaaagc 11821 atgaaatggt aaaaattcag ataaatataa aaacatgttc tctagttttt tttaagttaa 11881 aaaaggaatt gtttaaagta aaaattattt gggggtttat aacataccca gaagtaaaat 11941 atgatgacaa tggcacaaag aatagaaggg agaaatggaa gtataatgtt gtaagtttct 12001 tatacatgtt aagtggtgtg ttattatttg aaggtagaat gtattaagat gaatatttta 12061 agctcctgat aactattgaa aaaaaaagag gtatagccaa gaggccaatg gagaagataa 12121 aatagaacac taagcataat taattcaaaa taaagaaata aaaaagggaa agtctggtaa 12181 gacaaaaaga aaacaaactg taagatggta gagtttaaaa caaccatact aataattgaa 12241 ttaaatgcac atggcctaaa tattctaatg aaaagggaaa gattgtcaga atgcacaaaa 12301 aaatctacag gccaacttca tgctctctac ataatgccct ctttaaatat gaaggcaaag 12361 acaggtaaaa agtaaaagaa tgggaaaata catgtatacc gtggaatgct atgcagccat 12421 aaaaaaatga gttcatgttg tttgtgggga catggatgaa gctggaagcc atccttcaca 12481 gcaaactaac acaggaacag aaaaccaaac accacacgtt ctcactcgta agtgggagtt 12541 caacaattag aacacatgga cacagggagg ggaacacctc acaccagggt ctgtcagggc 12601 atggggagca aggggaggga gagcattagg acacataccg aatgtattgc atggcttaaa 12661 acctagatga tgggttgata gatgcagcaa accacatggc acatgtataa ctatgtaaca 12721 aacctgcaca ttctgcacat gtatcccaga acttaaagta aaaaaaaaaa aaacgaaaat 12781 aatgccaacc atggaagtat tggtggctgt gttaatatca gaaatataag actcagaaat 12841 attaccaagg agaaagaagg atagttcata atgataaaag atcattttat tatcaacata 12901 taacaatcct aaatgtgttt gttctagaaa atatgtctta aatcacatta taccaaaaat 12961 gataaaaata aatcagaaat agacaaattc acaattatat tttagtattc tagcactcag 13021 taaacaataa aatatttagg aaaaaacttc atgaggacat gatagattta aataacatta 13081 tcaatgaacc aacgtgatct aatcaagatc tgtagaatat tccacccaat agtggcagaa 13141 tacacattat tttcaaatgc tcaagaatat tccacaggac agactataca ctgggtcata 13201 aacacatata aataaatgtc taaataaata aatgtctcta ttgaaatcat acagaataca 13261 ttctcggacc acaatagcat taaattagaa accaataaca gaaaaatacc ttgaaagtcc 13321 caaataccta gaaattaaaa agtatactgc taaacagcac ctggatttaa aaagagtcag 13381 aaggaaaatt agaaaatatt ttgaactgag tgaatatgaa agcacactat caaaattagt 13441 atgatacact aattacacaa taatttataa ctttacataa ttggaaagga gggaaactct 13501 aaaatcaacc atctatgttc ccatcttaag aagctagaaa aaaaaagtca aatgaatccc 13561 aagattaata agatcagaaa taaatgcaat aaaatggaca aacaataaag aaaataaaca 13621 aagtcaattg ctggttttca ataaggctca atacattcat gaatctctag gtagatggat 13681 caagaaaaag agataagact caaatcccca atatcagaaa tggaagtggg tacgtcacaa 13741 caaatcatac agacattaaa agtattatga cagaatgcta tgaaaatgcc aataaacaaa 13801 aatgacaata aatttgacaa tttacattgt taaattaaat taaatttggt gtaaagcttt 13861 ctccatatct taaattccta catagcaaac taacccaact taacataact gcgttatgca 13921 aacaaactac agcctaactt aagagtgttg taataaatag ctgagtctca gccaatcaca 13981 ggctgccaag tgatcatatt atgtccccca taagcaaatg cctcatcacg ccatgcccat 14041 ataaggcaaa cactgagctg taataaattg gctggttttg agtatcactt cctgttttta 14101 tctataaaca ctgccttcac atgttgctgg acagagcttt ctgaatcttt ctgggttctg 14161 agggctcccc aattcatgaa ttgttctttg ctaaataaac tctgttaaat tcaacttctc 14221 taaaattttt attttaacaa catgaaatag aaacatttct tgaacgattc aaattaccaa 14281 aactaaatca agaagaaatc tctgtcaatg gaagaaatta agtttgtaat tttaaaaatc 14341 cttctcacag agaaggccaa atggtttcca ggtcagatgg cttcattgat taatccatag 14401 tattctgtca taatactgtc ataatacttt taatgtctgt atgattaatt ctattataca 14461 tttaaggaag aaacaataca aactcaatac aaactctttc aacaaaagag cagaagaaaa 14521 cacatcccaa taatttacaa gtccaatata ctaacatcaa acaatgacat taaaagaaaa 14581 ggaaactaaa gacacacaaa catagccatg aaaatgttta acaaaatatt ttttaaaatt 14641 gaattc // LOCUS HSHMGCOAS 2058 bp RNA PRI 31-JUL-1995 DEFINITION H.sapiens mRNA for 3-hydroxy-3-methylglutaryl coenzyme A synthase. ACCESSION X83618 NID g619876 KEYWORDS hydroxymethyl-CoA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Mascaro,C., Buesa,C., Ortiz,J.A., Haro,D. and Hegardt,F.G. TITLE Molecular cloning and tissue expression of human mitochondrial 3-hydroxy-3-methylglutaryl-CoA synthase JOURNAL Arch. Biochem. Biophys. 317 (2), 385-390 (1995) MEDLINE 95200282 REFERENCE 2 (bases 1 to 2058) AUTHORS Hegardt,F.G. TITLE Direct Submission JOURNAL Submitted (23-DEC-1994) F.G. Hegardt, Unit of Biochemistry,, University of Barcelona, School of Pharmacy, Avda. Diagonal 643, 08028 Barcelona, SPAIN COMMENT Related sequences: M33648, M60657, U12788, X52625, X66435, X73679, X77516 and J. Biol. Chem. 261, 3710-3716 (1986). FEATURES Location/Qualifiers source 1..2058 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 52..1578 /EC_number="4.1.3.5" /codon_start=1 /product="hydroxymethylglutaryl-CoA synthase" /db_xref="PID:g619877" /translation="MQRLLTPVKRILQLTRAVQETSLTPARLLPVAHQRFSTASAVPL AKTDTWPKDVGILALEVYFPAQYVDQTDLEKYNNVEAGKYTVGLGQTRMGFCSVQEDI NSLCLTVVQRLMERIQLPWDSVGRLEVGTETIIDKSKAVKTVLMELFQDSGNTDIEGI DTTNACYGGTASLFNAANWMESSSWDGRYAMVVCGDIAVYPSGNARPTGGAGAVAMLI GPKAPLALERGLRGTHMENVYDFYKPNLASEYPIVDGKLSIQCYLRALDRCYTSYRKK IQNQWKQAGSDRPFTLDDLQYMIFHTPFCKMVQKSLARLMFNDFLSASSDTQTSLYKG LEAFGGLKLEDTYTNKDLDKALLKASQDMFDKKTKASLYLSTHNGNMYTSSLYGCLAS LLSHHSAQELAGSRIGAFSYGSGLAASFFSFRVSQDAAPGSPLDKLVSSTSDLPKRLA SRKCVSPEEFTEIMNQREQFYHKVNFSPPGDTNSLFPGTWYLERVDEQHRRKYARRPV " BASE COUNT 518 a 533 c 500 g 507 t ORIGIN 1 cggtttctgc tgggtttctg aactgctggg tttctgcttg ctcctctgga gatgcagcgt 61 ctgttgactc cagtgaagcg cattctgcaa ctgacaagag cggtgcagga aacctccctc 121 acacctgctc gcctgctccc agtagcccac caaaggtttt ctacagcctc tgctgtcccc 181 ctggccaaaa cagatacttg gccaaaggac gtgggcatcc tggccctgga ggtctacttc 241 ccagcccaat atgtggacca aactgacctg gagaagtata acaatgtgga agcaggaaag 301 tatacagtgg gcttgggcca gacccgtatg ggcttctgct cagtccaaga ggacatcaac 361 tccctgtgcc tgacggtggt gcaacggctg atggagcgca tacagctccc atgggactct 421 gtgggcaggc tggaagtagg cactgagacc atcattgaca agtccaaagc tgtcaaaaca 481 gtgctcatgg aactcttcca ggattcaggc aatactgata ttgagggcat agataccacc 541 aatgcctgct acggtggtac tgcctccctc ttcaatgctg ccaactggat ggagtccagt 601 tcctgggatg gtcgttatgc catggtggtc tgtggagaca ttgccgtcta tcccagtggt 661 aatgctcgtc ccacaggtgg ggccggagct gtggctatgc tgattggccc aaaggcccct 721 ctggccctgg agcgagggct gaggggaacc catatggaga atgtgtatga cttctacaaa 781 ccaaatttgg cctcggagta cccaatagtg gatgggaagc tttccatcca gtgctacttg 841 cgggccttgg atcgatgtta cacatcatac cgtaaaaaaa tccagaatca gtggaagcaa 901 gctggcagcg atcgaccctt cacccttgac gatttacagt atatgatctt tcatacaccc 961 ttttgcaaga tggtccagaa gtctctggct cgcctgatgt tcaatgactt cctgtcagcc 1021 agcagtgaca cacaaaccag cttatataag gggctggagg ctttcggggg gctaaagctg 1081 gaagacacct acaccaacaa ggacctggat aaagcacttc taaaggcctc tcaggacatg 1141 ttcgacaaga aaaccaaggc ttccctttac ctctccactc acaatgggaa catgtacacc 1201 tcatccctgt acgggtgcct ggcctcgctt ctgtcccacc actctgccca agaactggct 1261 ggctccagga ttggtgcctt ctcttatggc tctggtttag cagcaagttt cttttcattt 1321 cgagtatccc aggatgctgc tccaggctct cccctggaca agttggtgtc cagcacatca 1381 gacctgccaa aacgcctagc ctcccgaaag tgtgtgtctc ctgaggagtt cacagaaata 1441 atgaaccaaa gagagcaatt ctaccataag gtgaatttct ccccacctgg tgacacaaac 1501 agccttttcc caggtacttg gtacctggag cgagtggacg agcagcatcg ccgaaagtat 1561 gcccggcgtc ccgtctaaag gtgttctgca gatccatgga aagcttcctg ggaaacgtat 1621 gctagcagag cttctccccg tgaatcatat ttttaagatc ccactcttag ctggtaaatg 1681 aatttgaatc gacatagtag ccccataagc atcagccctg tagagtgagg agccatctct 1741 agcgggccct tcattcctct ccatgctgca atcactgtcc tgggcttatg gtgcctatgg 1801 actaggggtc ctttgtgaaa gagcaagatg gagcaatgga gagaagacct cttcctgaat 1861 cactggactc cagaaatgtg catgcagatc agctgttgcc ttcaagatcc agataaactt 1921 tcctgtcatg tgttagaact ttattattat taatattgtt aaacttctgt gctgttcctg 1981 tgaatctcca aattttgtac cttgttctaa gctaatatat agcaattaaa aagagagaaa 2041 gagaaaaaaa aaaaaaaa // LOCUS HSHMGICP 4633 bp RNA PRI 22-APR-1996 DEFINITION H.sapiens mRNA for HMGI-C protein. ACCESSION X92518 NID g1225979 KEYWORDS HMGI-C gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4633) AUTHORS Ashar,H.R., Cherath,L., Przybysz,K.M. and Chada,K. TITLE Genomic characterization of human HMGIC, a member of the accessory transcription factor family found at translocation breakpoints in lipomas JOURNAL Genomics 31 (2), 207-214 (1996) MEDLINE 96422186 REFERENCE 2 (bases 1 to 4633) AUTHORS Ashar,H.R. TITLE Direct Submission JOURNAL Submitted (24-OCT-1995) H.R. Ashar, Department of Biochemistry, Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, Piscataway NJ 08854, USA FEATURES Location/Qualifiers source 1..4633 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="DLD-1" /clone_lib="YAC yWPR 384 (CGM)" mRNA join(463..1421,1422..1508,1509..1559,1560..1592, 1593..4633) /gene="HMGI-C" exon 463..1421 /gene="HMGI-C" /number=1 gene 463..4633 /gene="HMGI-C" CDS 1311..1640 /gene="HMGI-C" /codon_start=1 /db_xref="PID:e206539" /db_xref="PID:g1225980" /translation="MSARGEGAGQPSTSAQGQPAAPAPQKRGRGRPRKQQQEPTGEPS PKRPRGRPKGSKNKSPSKAAQKKAEATGEKRPRGRPRKWPQQVVQKKPAQEETEETSS QESAEED" exon 1422..1508 /gene="HMGI-C" /number=2 exon 1509..1559 /gene="HMGI-C" /number=3 exon 1560..1592 /gene="HMGI-C" /number=4 exon 1593..4633 /gene="HMGI-C" /number=5 terminator 1638..1640 /gene="HMGI-C" BASE COUNT 1257 a 1183 c 908 g 1285 t ORIGIN 1 atccatgctt tacactttat gcttcggccg tatgttgtgt ggaattgtga cggataacaa 61 tttcacacag gaaacagcta tgaccatgat tacgccaagc tcgaaattaa ccctcactaa 121 agggaacaaa agctggtacc gggccccccc tcgacggtat cgataagctt gatatcgaat 181 tcctgcagcc cgggggatcc cccgctgtcc ctttaacccc gccgccgggc gcacgtgagc 241 ggctccgggt ggcacccggc gccccggccg ccgaggcagt tgtatttcga acgtgcctct 301 ggctagcagc caggcgcctt ggctcgcggt ccgcctggcc tccctcctcc tcatactttt 361 cttcctgcgc aaccccctcc cctttatccg cccacgatta gaggtgggca ctccccccac 421 caccaccccc tccccaacgc aagcgcgtgc acgcacacac accacacaca ctcacactca 481 cacacactca cacacactca tcccacttga atcttggggc aggaactcag aaaacttcca 541 gcccgggcag cgcgcgcttg gtgcaagact caggagctag cagcccgtcc ccctccgact 601 ctccggtgcc ggcgctgcct gctcccgcca ccctaggagg cgcggtgcca cccactactc 661 tgtcctctgc ctgtgctccg tgcccgaccc tatcccggcg gagtctcccc atcctccttt 721 gctttccgac tgcccaaggc actttcaatc tcaatctctt ctctctctct ctctctctct 781 ctctctctct ctctctctct ctctctctct cgcagggtgg ggggaagagg aggaggaatt 841 ctttccccgc ctaacatttc aagggacaca attcactcca agtctcttcc ctttccaagc 901 cgcttccgaa gtgctcccgg tgcccgcaac tcctgatccc aacccgcgag aggagcctct 961 gcgacctcaa agcctctctt ccttctccct cgcttccctc ctcctcttgc tacctccacc 1021 tccaccgcca cctccacctc cggcacccac ccaccgccgc cgccgccacc ggcagcgcct 1081 cctcctctcc tcctcctcct cccctcttct ctttttggca gccgctggac gtccggtgtt 1141 gatggtggca gcggcggcag ctaagcaaca gcagccctcg cagcccgcca gctcgcgctc 1201 gccccgccgg cgtccccagc cctatcacct catctcccga aaggtgctgg gcagctccgg 1261 ggcggtcgag gcgaacggct gcagcggcgg tacgggcggc gggaggcagg atgagcgcac 1321 gcggtgaggg cgcggggcag ccgtccactt cagcccaggg acaacctgcc gccccagcgc 1381 ctcagaagag aggacgcggc cgccccagga agcagcagca agaaccaacc ggtgagccct 1441 ctcctaagag acccagggga agacccaaag gcagcaaaaa caagagtccc tctaaagcag 1501 ctcaaaagaa agcagaagcc actggagaaa aacggccaag aggcagacct aggaaatggc 1561 cacaacaagt tgttcagaag aagcctgctc aggaggaaac tgaagagaca tcctcacaag 1621 agtctgccga agaggactag ggggcgccaa cgttcgattt ctacctcagc agcagttgga 1681 tcttttgaag ggagaagaca ctgcagtgac cacttattct gtattgccat ggtctttcca 1741 ctttcatctg gggtggggtg gggtggggtg ggggaggggg gggtggggtg gggagaaatc 1801 acataacctt aaaaaggact atattaatca ccttctttgt aatcccttca cagtcccagg 1861 tttagtgaaa aactgctgta aacacagggg acacagctta acaatgcaac ttttaattac 1921 tgttttcttt tttcttaacc tactaatagt ttgttgatct gataagcaag agtgggcggg 1981 tgagaaaaac cgaattgggt ttagtcaatc actgcactgc atgcaaacaa gaaacgtgta 2041 cacttgtgac gtcggcattc atataggaag aacgcggtgt gtaacactgt gtacacctca 2101 aataccaccc caacccactc cctgtagtga atcctctgtt tagaacacca aagataagga 2161 ctagatacta ctttctcttt ttcgtataat cttgtagaca gcttacttga tgatttttaa 2221 ctttttattt ctaaatgaga cgaaatgctg atgtatcctt tcattcagct aacaaactag 2281 aaaaggttat gttcattttt caaaaaggga agtaagcaaa caaatattgc caactcttct 2341 atttatggat atcacacata tcagcaggag taataaattt actcacagca cttgttttca 2401 ggacaacact tcattttcag gaaatctact tcctacagag ccaaaatgcc atttagcaat 2461 aaataacact tgtcagcctc agagcattta aggaaactag acaagtaaaa ttatcctctt 2521 tgtaatttaa tgaaaaggta caacagaata atgcatgatg aactcaccta attatgaggt 2581 gggaggagcg aaatctaaat ttcttttgct atagttatac atcaatttaa aaagcaaaaa 2641 aaaaaaaggg gggggcaatc tctctctgtg tctttctctc tctctccctc tccctctctc 2701 ttttcattgt gtatcagttt ccatgaaaga cctgaatacc acttacctca aattaagcat 2761 atgtgttact tcaagtaata cgttttgaca taagatggtt gaccaaggtg cttttcttcg 2821 gcttgagttc accatctctt cattcaaact gcacttttag ccagagatgc aatatatccc 2881 cactactcaa tactacctct gaatgttaca atgaatttac agtctagtac ttattacatg 2941 ctgctataca caagcaatgc aagaaaaaaa cttactgggt aggtgattct aatcatctgc 3001 agaacaaaaa gtacacttaa ttacagttaa agaagcaatc tccttactgt gtttcagcat 3061 gactatgtat ttttctatgt ttttttaatt aaaaatttta aaatacttgt ttcagcttct 3121 ctgctagatt tctaaattaa cttgaaaatt ttttaaccaa gtcgctccta ggttcttaag 3181 gataattttc cacaatcaca ctacacatca cacaagattt gactgtaata tttaaatatt 3241 accctccaag tctgtacctc aaatgaattc tttaaggaga tggactaatt gacttgcaaa 3301 gacctacctc cagacttcaa aaggaatgaa cttgttactt gcagcattca tttgtttttt 3361 caatgtttga aatagttcaa actgcagcta accctagtca aaactatttt tgtaaaagac 3421 atttgataga aaggaacacg tttttacata cttttgcaaa ataagtaaat aataaataaa 3481 ataaaagcca accttcaaag aaacttgaag ctttgtaggt gagatgcttc ttgccctgct 3541 tttgcataat gcaatcaaaa atatgagttt ttaagattag ttgaatataa gaaaatgctt 3601 gacaaatatt ttcatgtatt ttacacaaat gtgatttttg taatatgtct caaccagatt 3661 tattttaaac gcttcttatg tagagttttt atgcctttct ctcctagtga gtgtgctgac 3721 tttttaacat ggtattatca actgggccag gaggtagttt ctcatgtcgg cttttgtcag 3781 tatggctttt agtactgaag ccaaatgaaa cacaaaacca tctctcaacc agctgcttca 3841 gggaggtagt tcaaggcaca tacctctctg agactgcaga tcgctcactg ttgtgaatca 3901 ccaagagcta tggagagaat aaactcaaca ttactgttaa ctgtgcgtta aataagcaaa 3961 taaacagtgg ctcataaaaa taaaagtcgc attccatatc tttggatggg ccttttagaa 4021 acctcattgg ccagctcata aaatggaagc aattgctcat gttggccaaa catggtgcac 4081 cgagtgattt ccatctctgg taaagttaca cttttatttc ctgtatgttg tacaatcaaa 4141 acacactact acctcttaag tcccagtata cctcattttt catactgaaa aaaaaagctt 4201 gtggccaatg gaacagtaag aacatcataa aatttttata tatatagttt atttttgtgg 4261 gagataaatt ttataggact gttctttgct gttgttggtc gcagctaaat aagactggac 4321 atttaacttt tctaccattt ctgcaagtta ggtatgtttg ccaggagaaa agtatcaaga 4381 cgtttaactg cagttgactt tctccctgtt cctttgagtg tcttctaact ttattctttg 4441 ttctttatgt agaattgctg tctatgattg tactttgaat cgcttgactt gttgaaaata 4501 tttctctagt gtattatcac tgtctgttct gcacaataaa cataacagcc tctgtgatcc 4561 ccatgtgttt tgattcctgc tctttgttac agttccatta aatgagtaat aaagtttggt 4621 caaaatagat caa // LOCUS HSHMLHI 2503 bp mRNA PRI 31-MAR-1994 DEFINITION Human DNA mismatch repair (hmlh1) mRNA, complete cds. ACCESSION U07418 NID g466461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2503) AUTHORS Papadopoulos,N., Nicolaides,N.C., Wei,Y.F., Ruben,S.M., Carter,K.C., Rosen,C.A., Haseltine,W.H., Fleischmann,R.D., Fraser,C.M., Adams,M.D., Venter,J.C., Hamilton,S.R., Petersen,G.M., Watson,P., Lynch,H.T., Peltomaki,P., Mecklin,J.P., Chapelle,A.D., Kinzler,K.W. and Vogelstein,B. TITLE Mutation of a mutL Homolog is Associated with Hereditary Colon Cancer JOURNAL Science 263, 1625-1629 (1994) MEDLINE 94174309 REFERENCE 2 (bases 1 to 2503) AUTHORS Wei,Y.F. TITLE Direct Submission JOURNAL Submitted (04-MAR-1994) Ying-Fei Wei, Molecular Biology, Human Genome Sciences, Inc., 9620 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..2503 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="p21" /tissue_type="gall bladder" /dev_stage="adult" gene 42..2312 /gene="hmlh1" CDS 42..2312 /gene="hmlh1" /note="human homolog of E. coli mutL gene product, Swiss-Prot Accession Number P23367" /codon_start=1 /function="DNA mismatch repair" /db_xref="PID:g466462" /translation="MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKS TSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGE ALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIA TRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRS VFGNAVSRELIEIGCEDKTLAFKMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAI ETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIESKL LGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQK LDAFLQPLSKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGD TTKGTSEMSEKRGPTSSNPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLTSVLS LQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELFYQILIY DFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLAD YFSLEIDEEGNLIGLPLLIDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAM FYSIRKQYISEESTLSGQQSEVPGSIPNSWKWTVEHIVYKALRSHILPPKHFTEDGNI LQLANLPDLYKVFERC" BASE COUNT 723 a 539 c 599 g 642 t ORIGIN 1 gttgaacatc tagacgtttc cttggctctt ctggcgccaa aatgtcgttc gtggcagggg 61 ttattcggcg gctggacgag acagtggtga accgcatcgc ggcgggggaa gttatccagc 121 ggccagctaa tgctatcaaa gagatgattg agaactgttt agatgcaaaa tccacaagta 181 ttcaagtgat tgttaaagag ggaggcctga agttgattca gatccaagac aatggcaccg 241 ggatcaggaa agaagatctg gatattgtat gtgaaaggtt cactactagt aaactgcagt 301 cctttgagga tttagccagt atttctacct atggctttcg aggtgaggct ttggccagca 361 taagccatgt ggctcatgtt actattacaa cgaaaacagc tgatggaaag tgtgcataca 421 gagcaagtta ctcagatgga aaactgaaag cccctcctaa accatgtgct ggcaatcaag 481 ggacccagat cacggtggag gacctttttt acaacatagc cacgaggaga aaagctttaa 541 aaaatccaag tgaagaatat gggaaaattt tggaagttgt tggcaggtat tcagtacaca 601 atgcaggcat tagtttctca gttaaaaaac aaggagagac agtagctgat gttaggacac 661 tacccaatgc ctcaaccgtg gacaatattc gctccgtctt tggaaatgct gttagtcgag 721 aactgataga aattggatgt gaggataaaa ccctagcctt caaaatgaat ggttacatat 781 ccaatgcaaa ctactcagtg aagaagtgca tcttcttact cttcatcaac catcgtctgg 841 tagaatcaac ttccttgaga aaagccatag aaacagtgta tgcagcctat ttgcccaaaa 901 acacacaccc attcctgtac ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg 961 tgcaccccac aaagcatgaa gttcacttcc tgcacgagga gagcatcctg gagcgggtgc 1021 agcagcacat cgagagcaag ctcctgggct ccaattcctc caggatgtac ttcacccaga 1081 ctttgctacc aggacttgct ggcccctctg gggagatggt taaatccaca acaagtctga 1141 cctcgtcttc tacttctgga agtagtgata aggtctatgc ccaccagatg gttcgtacag 1201 attcccggga acagaagctt gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc 1261 agccccaggc cattgtcaca gaggataaga cagatatttc tagtggcagg gctaggcagc 1321 aagatgagga gatgcttgaa ctcccagccc ctgctgaagt ggctgccaaa aatcagagct 1381 tggaggggga tacaacaaag gggacttcag aaatgtcaga gaagagagga cctacttcca 1441 gcaaccccag aaagagacat cgggaagatt ctgatgtgga aatggtggaa gatgattccc 1501 gaaaggaaat gactgcagct tgtacccccc ggagaaggat cattaacctc actagtgttt 1561 tgagtctcca ggaagaaatt aatgagcagg gacatgaggt tctccgggag atgttgcata 1621 accactcctt cgtgggctgt gtgaatcctc agtgggcctt ggcacagcat caaaccaagt 1681 tataccttct caacaccacc aagcttagtg aagaactgtt ctaccagata ctcatttatg 1741 attttgccaa ttttggtgtt ctcaggttat cggagccagc accgctcttt gaccttgcca 1801 tgcttgcctt agatagtcca gagagtggct ggacagagga agatggtccc aaagaaggac 1861 ttgctgaata cattgttgag tttctgaaga agaaggctga gatgcttgca gactatttct 1921 ctttggaaat tgatgaggaa gggaacctga ttggattacc ccttctgatt gacaactatg 1981 tgcccccttt ggagggactg cctatcttca ttcttcgact agccactgag gtgaattggg 2041 acgaagaaaa ggaatgtttt gaaagcctca gtaaagaatg cgctatgttc tattccatcc 2101 ggaagcagta catatctgag gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct 2161 ccattccaaa ctcctggaag tggactgtgg aacacattgt ctataaagcc ttgcgctcac 2221 acattctgcc tcctaaacat ttcacagaag atggaaatat cctgcagctt gctaacctgc 2281 ctgatctata caaagtcttt gagaggtgtt aaatatggtt atttatgcac tgtgggatgt 2341 gttcttcttt ctctgtattc cgatacaaag tgttgtatca aagtgtgata tacaaagtgt 2401 accaacataa gtgttggtag cacttaagac ttatacttgc cttctgatag tattccttta 2461 tacacagtgg attgattata aataaataga tgtgtcttaa cat // LOCUS HSHMPFK 2759 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for muscle phosphofructokinase (E.C. 2.7.1.11). ACCESSION Y00698 NID g32342 KEYWORDS phosphofructokinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2759) AUTHORS Nakajima,H. TITLE Direct Submission JOURNAL Submitted (12-JAN-1988) Nakajima H., The Second Dept. of Internal Medicine, Osaka University Medical School, 1-1-50 Fukushima, Fukushima-ku, Osaka 553, Japan REFERENCE 2 (bases 1 to 2759) AUTHORS Nakajima,H., Noguchi,T., Yamasaki,T., Kono,N., Tanaka,T. and Tarui,S. TITLE Cloning of human muscle phosphofructokinase cDNA JOURNAL FEBS Lett. 223 (1), 113-116 (1987) MEDLINE 88030023 FEATURES Location/Qualifiers source 1..2759 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" /clone="lambdaHMP2E1.8" CDS 20..2362 /note="phosphofructokinase (AA 1 - 780)" /codon_start=1 /db_xref="PID:g32343" /db_xref="SWISS-PROT:P08237" /translation="MTHEEHHAAKTLGIGKAIAVLTSGGDAQGMNAAVRAVVRVGIFT GARVFFVHEGYQGLVDGGDHIKEATWESVSMMLQLGGTVIGSARCKDFREREGRLRAA YNLVKRGITNLCVIGGDGSLTGADTFRSEWSDLLSDLQKAGKITDEEATKSSYLNIVG LVGSIDNDFCGTDMTIGTDSALHRIMEIVDAITTTAQSHQRTFVLEVMGRHCGYLALV TSLSCGADWVFIPECPPDDDWEEHLCRRLSETRTRGSRLNIIIVAEGAIDKNGKPITS EDIKNLVVKRLGYDTRVTVLGHVQRGGTPSAFDRILGSRMGVEAVMALLEGTPDTPAC VVSLSGNQAVRLPLMECVQVTKDVTKAMDEKKFDEALKLRGRSFMNNWEVYKLLAHVR PPVSKSGSHTVAVMNVGAPAAGMNAAVRSTVRIGLIQGNRVLVVHDGFEGLAKGQIEE AGWSYVGGWTGQGGSKLGTKRTLPKKSFEQISANITKFNIQGLVIIGGFEAYTGGLEL MEGRKQFDELCIPFVVIPATVSNNVPGSDFSVGADTALNTICTTCDRIKQSAAGTKRR VFIIETMGGYCGYLATMAGLAAGADAAYIFEEPFTIRDLQANVEHLVQKMKTTVKRGL VLRNEKCNENYTTDFIFNLYSEEGKGIFDSRKNVLGHMQQGGSPTPFDRNFATKMGAK AMNWMSGKIKESYRNGRIFANTPDSGCVLGMRKRALVFQPVAELKDQTDFEHRIPKEQ WWLKLRPILKILAKYEIDLDTSDHAHLEHITRKRSGEAAV" misc_feature 2735..2740 /note="pot. polyA signal" polyA_site 2759 /note="polyA site" BASE COUNT 668 a 639 c 780 g 672 t ORIGIN 1 gcctgactga gagtggatca tgacccatga agagcaccat gcagccaaaa ccctggggat 61 tggcaaagcc attgctgtct taacctctgg tggagatgcc caaggtatga atgctgctgt 121 cagggctgtg gttcgagttg gtatcttcac cggtgcccgt gtcttctttg tccatgaggg 181 ttatcaaggc ctggtggatg gtggagatca catcaaggaa gccacctggg agagcgtttc 241 gatgatgctt cagctgggag gcacggtgat tggaagtgcc cggtgcaagg actttcggga 301 acgagaagga cgactccgag ctgcctacaa cctggtgaag cgtgggatca ccaatctctg 361 tgtcattggg ggtgatggca gcctcactgg ggctgacacc ttccgttctg agtggagtga 421 cttgttgagt gacctccaga aagcaggtaa gatcacagat gaggaggcta cgaagtccag 481 ctacctgaac attgtgggcc tggttgggtc aattgacaat gacttctgtg gcactgatat 541 gaccattggc actgactctg ccctgcatcg gatcatggaa attgtagatg ccatcactac 601 cactgcccag agccaccaga ggacatttgt gttagaagta atgggccgcc actgtggata 661 cctggccctt gtcacctctc tgtcctgtgg ggccgactgg gtttttattc ctgaatgtcc 721 accagatgac gactgggagg aacacctttg tcgccgactc agcgagacaa ggacccgtgg 781 ttctcgtctc aacatcatca ttgtggctga gggtgcaatt gacaagaatg gaaaaccaat 841 cacctcagaa gacatcaaga atctggtggt taagcgtctg ggatatgaca cccgggttac 901 tgtcttgggg catgtgcaga ggggtgggac gccatcagcc tttgacagaa ttctgggcag 961 caggatgggt gtggaagcag tgatggcact tttggagggg accccagata ccccagcctg 1021 tgtagtgagc ctctctggta accaggctgt gcgcctgccc ctcatggaat gtgtccaggt 1081 gaccaaagat gtgaccaagg ccatggatga gaagaaattt gacgaagccc tgaagctgag 1141 aggccggagc ttcatgaaca actgggaggt gtacaagctt ctagctcatg tcagaccccc 1201 ggtatctaag agtggttcgc acacagtggc tgtgatgaac gtgggggctc cggctgcagg 1261 catgaatgct gctgttcgct ccactgtgag gattggcctt atccagggca accgagtgct 1321 cgttgtccat gatggtttcg agggcctggc caaggggcag atagaggaag ctggctggag 1381 ctatgttggg ggctggactg gccaaggtgg ctctaaactt gggactaaaa ggactctacc 1441 caagaagagc tttgaacaga tcagtgccaa tataactaag tttaacattc agggccttgt 1501 catcattggg ggctttgagg cttacacagg gggcctggaa ctgatggagg gcaggaagca 1561 gtttgatgag ctctgcatcc catttgtggt cattcctgct acagtctcca acaatgtccc 1621 tggctcagac ttcagcgttg gggctgacac agcactcaat actatctgca caacctgtga 1681 ccgcatcaag cagtcagcag ctggcaccaa gcgtcgggtg tttatcattg agactatggg 1741 tggctactgt ggctacctgg ctaccatggc tggactggca gctggggccg atgctgccta 1801 catttttgag gagcccttca ccattcgaga cctgcaggca aatgttgaac atctggtgca 1861 aaagatgaaa acaactgtga aaaggggctt ggtgttaagg aatgaaaagt gcaatgagaa 1921 ctataccact gacttcattt tcaacctgta ctctgaggag gggaagggca tcttcgacag 1981 caggaagaat gtgcttggtc acatgcagca gggtgggagc ccaaccccat ttgataggaa 2041 ttttgccact aagatgggcg ccaaggctat gaactggatg tctgggaaaa tcaaagagag 2101 ttaccgtaat gggcggatct ttgccaatac tccagattcg ggctgtgttc tggggatgcg 2161 taagagggct ctggtcttcc aaccagtggc tgagctgaag gaccagacag attttgagca 2221 tcgaatcccc aaggaacagt ggtggctgaa actgaggccc atcctcaaaa tcctagccaa 2281 gtacgagatt gacttggaca cttcagacca tgcccacctg gagcacatca cccggaagcg 2341 gtccggggaa gctgccgtct aaacctctct ggagtgaggg gaatagatta cctgatcatg 2401 gtcagctcac accctaataa gtccacatct tctcagtgtt ttagctgttt ttttcattag 2461 gtttcctttt attctgtacc ttgcagccat gaccagttct ggccaggagc tggaggagca 2521 ggcagtgggt gggagctcct tttaggtaga atttaacatg acttctgccc cagctttatc 2581 tgtcacacaa ggctgggcac ctctagtgct actgctagat atcacttact cagttagaat 2641 tttcctaaaa ataagcttta tttatttctt tgtgataaca aagagtcttg gttcctctac 2701 tacttttact acagtgacaa attgtaacta cactaataaa tgccaactgg tcactgtga // LOCUS HSHMPLK 1740 bp mRNA PRI 18-JUL-1995 DEFINITION Human (clones 18, 23, 27, 24) c-myeloproliferative leukemia virus type K (c-mpl-K) mRNA, complete cds. ACCESSION M90103 NID g184262 KEYWORDS c-myeloproliferative leukemia virus; hematopoietic growth factor receptor; transmembrane protein; v-myeloproliferative leukemia virus cellular oncogene homologue. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1740) AUTHORS Vigon,I., Mornon,J.P., Cocault,L., Mitjavila,M.T., Tambourin,P., Gisselbrecht,S. and Souyri,M. TITLE Molecular cloning and characterization of MPL, the human homolog of the v-mpl oncogene: identification of a member of the hematopoietic growth factor receptor superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (12), 5640-5644 (1992) MEDLINE 92302297 FEATURES Location/Qualifiers source 1..1740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEL" /map="1p34" /chromosome="1" gene 1..1740 /gene="c-mpl-K" CDS 1..1740 /gene="c-mpl-K" /note="bp 1-1566 is common to c-mpl-P and c-mpl-K" /codon_start=1 /product="c-myeloproliferative leukemia virus type K" /db_xref="PID:g184263" /translation="MPSWALFMVTSCLLLAPQNLAQVSSQDVSLLASDSEPLKCFSRT FEDLTCFWDEEEAAPSGTYQLLYAYPREKPRACPLSSQSMPHFGTRYVCQFPDQEEVR LFFPLHLWVKNVFLNQTRTQRVLFVDSVGLPAPPSIIKAMGGSQPGELQISWEEPAPE ISDFLRYELRYGPRDPKNSTGPTVIQLIATETCCPALQRPHSASALDQSPCAQPTMPW QDGPKQTSPSREASALTAEGGSCLISGLQPGNSYWLQLRSEPDGISLGGSWGSWSLPV TVDLPGDAVALGLQCFTLDLKNVTCQWQQQDHASSQGFFYHSRARCCPRDRYPIWENC EEEEKTNPGLQTPQFSRCHFKSRNDSIIHILVEVTTAPGTVHSYLGSPFWIHQAVRLP TPNLHWREISSGHLELEWQHPSSWAAQETCYQLRYTGEGHQDWKVLEPPLGARGGTLE LRPRSRYRLQLRARLNGPTYQGPWSSWSDPTRVETATETAWISLVTALHLVLGLSAVL GLLLLRWQFPAHYRYRPRQAGDWRWTRWSRTCKQAFLVRSVTPDLRPPPVRTYGFALP ARHLWDSPRLLTL" sig_peptide 7..24 /gene="c-mpl-K" mat_peptide 25..579 /gene="c-mpl-K" misc_signal 269..273 /gene="c-mpl-K" /note="degenerated WS box" misc_signal 473..478 /gene="c-mpl-K" /note="WS box" BASE COUNT 355 a 572 c 467 g 346 t ORIGIN 1 atgccctcct gggccctctt catggtcacc tcctgcctcc tcctggcccc tcaaaacctg 61 gcccaagtca gcagccaaga tgtctccttg ctggcatcag actcagagcc cctgaagtgt 121 ttctcccgaa catttgagga cctcacttgc ttctgggatg aggaagaggc agcgcccagt 181 gggacatacc agctgctgta tgcctacccg cgggagaagc cccgtgcttg ccccctgagt 241 tcccagagca tgccccactt tggaacccga tacgtgtgcc agtttccaga ccaggaggaa 301 gtgcgtctct tctttccgct gcacctctgg gtgaagaatg tgttcctaaa ccagactcgg 361 actcagcgag tcctctttgt ggacagtgta ggcctgccgg ctccccccag tatcatcaag 421 gccatgggtg ggagccagcc aggggaactt cagatcagct gggaggagcc agctccagaa 481 atcagtgatt tcctgaggta cgaactccgc tatggcccca gagatcccaa gaactccact 541 ggtcccacgg tcatacagct gattgccaca gaaacctgct gccctgctct gcagaggcct 601 cactcagcct ctgctctgga ccagtctcca tgtgctcagc ccacaatgcc ctggcaagat 661 ggaccaaagc agacctcccc aagtagagaa gcttcagctc tgacagcaga gggtggaagc 721 tgcctcatct caggactcca gcctggcaac tcctactggc tgcagctgcg cagcgaacct 781 gatgggatct ccctcggtgg ctcctgggga tcctggtccc tccctgtgac tgtggacctg 841 cctggagatg cagtggcact tggactgcaa tgctttacct tggacctgaa gaatgttacc 901 tgtcaatggc agcaacagga ccatgctagc tcccaaggct tcttctacca cagcagggca 961 cggtgctgcc ccagagacag gtaccccatc tgggagaact gcgaagagga agagaaaaca 1021 aatccaggac tacagacccc acagttctct cgctgccact tcaagtcacg aaatgacagc 1081 attattcaca tccttgtgga ggtgaccaca gccccgggta ctgttcacag ctacctgggc 1141 tcccctttct ggatccacca ggctgtgcgc ctccccaccc caaacttgca ctggagggag 1201 atctccagtg ggcatctgga attggagtgg cagcacccat cgtcctgggc agcccaagag 1261 acctgttatc aactccgata cacaggagaa ggccatcagg actggaaggt gctggagccg 1321 cctctcgggg cccgaggagg gaccctggag ctgcgcccgc gatctcgcta ccgtttacag 1381 ctgcgcgcca ggctcaacgg ccccacctac caaggtccct ggagctcgtg gtcggaccca 1441 actagggtgg agaccgccac cgagaccgcc tggatctcct tggtgaccgc tctgcatcta 1501 gtgctgggcc tcagcgccgt cctgggcctg ctgctgctga ggtggcagtt tcctgcacac 1561 tacaggtacc gcccccgcca ggcaggagac tggcggtgga ccaggtggag ccgaacgtgt 1621 aaacaggcat tcttggttcg ctctgtgacc ccagatctcc gtccaccgcc cgtgcgcacc 1681 tacggcttcg cacttcctgc acgtcacctc tgggactcgc cgcggctcct tacactctaa // LOCUS HSHNF4 1441 bp RNA PRI 08-NOV-1994 DEFINITION H.sapiens HNF4 mRNA for hepatocyte nuclear factor 4. ACCESSION X76930 NID g575252 KEYWORDS hepatocyte nuclear factor 4; HNF4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1441) AUTHORS Chartier,F.L., Bossu,J.P., Laudet,V., Fruchart,J.C. and Laine,B. TITLE Cloning and sequencing of cDNAs encoding the human hepatocyte nuclear factor 4 indicate the presence of two isoforms in human liver JOURNAL Gene 147 (2), 269-272 (1994) MEDLINE 95011627 REFERENCE 2 (bases 1 to 1441) AUTHORS Chartier,F.L. TITLE Direct Submission JOURNAL Submitted (24-DEC-1993) F.L. Chartier, INSERM U325, Inst. Pasteur, 1 Rue Calmette, 59019 Lille Cedex, FRANCE FEATURES Location/Qualifiers source 1..1441 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone="1" gene 20..1417 /gene="HNF4" CDS 20..1417 /gene="HNF4" /codon_start=1 /product="hepatocyte nuclear factor 4" /db_xref="PID:g575253" /db_xref="SWISS-PROT:P41235" /translation="MDMADYSAALDPAYTTLEFENVQVLTMGNDTSPSEGTNLNAPNS LGVSALCAICGDRATGKHYGASSCDGCKGFFRRSVRKNHMYSCRFSRQCVVDKDKRNQ CRYCRLKKCFRAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPV SGINGDIRAKKIASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEH LLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNE YAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPT LQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGTN VIVANTMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGASGSEPYKLLPGAVATIVKP LSAIPQPTITKQEVI" BASE COUNT 319 a 456 c 411 g 255 t ORIGIN 1 ctccaaaacc ctcgtcgaca tggacatggc cgactacagt gctgcactgg acccagccta 61 caccaccctg gaatttgaga atgtgcaggt gttgacgatg ggcaatgaca cgtccccatc 121 agaaggcacc aacctcaacg cgcccaacag cctgggtgtc agcgccctgt gtgccatctg 181 cggggaccgg gccacgggca aacactacgg tgcctcgagc tgtgacggct gcaagggctt 241 cttccggagg agcgtgcgga agaaccacat gtactcctgc agatttagcc ggcagtgcgt 301 ggtggacaaa gacaagagga accagtgccg ctactgcagg ctcaagaaat gcttccgggc 361 tggcatgaag aaggaagccg tccagaatga gcgggaccgg atcagcactc gaaggtcaag 421 ctatgaggac agcagcctgc cctccatcaa tgcgctcctg caggcggagg tcctgtcccg 481 acagatcacc tcccccgtct ccgggatcaa cggcgacatt cgggcgaaga agattgccag 541 catcgcagat gtgtgtgagt ccatgaagga gcagctgctg gttctcgttg agtgggccaa 601 gtacatccca gctttctgcg agctccccct ggacgaccag gtggccctgc tcagagccca 661 tgctggcgag cacctgctgc tcggagccac caagagatcc atggtgttca aggacgtgct 721 gctcctaggc aatgactaca ttgtccctcg gcactgcccg gagctggcgg agatgagccg 781 ggtgtccata cgcatccttg acgagctggt gctgcccttc caggagctgc agatcgatga 841 caatgagtat gcctacctca aagccatcat cttctttgac ccagatgcca aggggctgag 901 cgatccaggg aagatcaagc ggctgcgttc ccaggtgcag gtgagcttgg aggactacat 961 caacgaccgc cagtatgact cgcgtggccg ctttggagag ctgctgctgc tgctgcccac 1021 cttgcagagc atcacctggc agatgatcga gcagatccag ttcatcaagc tcttcggcat 1081 ggccaagatt gacaacctgt tgcaggagat gctgctggga gggtccccca gcgatgcacc 1141 ccatgcccac caccccctgc accctcacct gatgcaggaa catatgggaa ccaacgtcat 1201 cgttgccaac acaatgccca ctcacctcag caacggacag atgtgtgagt ggccccgacc 1261 caggggacag gcagccaccc ctgagacccc acagccctca ccgccaggtg cgtcagggtc 1321 tgagccctat aagctcctgc cgggagccgt cgccacaatc gtcaagcccc tctctgccat 1381 cccccagccg accatcacca agcaggaagt tatctagcaa gccgctgggg cttgggggct 1441 c // LOCUS HSHNF4G 3248 bp RNA PRI 07-MAR-1996 DEFINITION H.sapiens mRNA for hepatocyte nuclear factor 4 gamma. ACCESSION Z49826 NID g1217962 KEYWORDS hepatocyte nuclear factor 4; hepatocyte nuclear factor 4 gamma. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3248) AUTHORS Ryffel,G.U. TITLE Direct Submission JOURNAL Submitted (08-JUN-1995) Ryffel G. U., Universitaetsklinikum Essen, Institut fuer Zellbiologie, Hufelandstr. 55, Essen, Germany, D-45122 REFERENCE 2 (bases 1 to 3248) AUTHORS Drewes,T., Senkel,S., Holewa,B. and Ryffel,G.U. TITLE Human hepatocyte nuclear factor 4 isoforms are encoded by distinct and differentially expressed genes JOURNAL Mol. Cell. Biol. 16 (3), 925-931 (1996) MEDLINE 96182096 FEATURES Location/Qualifiers source 1..3248 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="cDNA library" CDS 690..3014 /function="transcription factor" /codon_start=1 /evidence=experimental /product="hepatocyte nuclear factor 4 gamma (HNF4gamma)" /db_xref="PID:e183832" /db_xref="PID:g1217963" /translation="MVCAQVFIAKLSSRTPIRFIRRPVRTMSLTSKCPEAKAMELGGV ETGSMKAKEQPTVPGIIRYRGCTLRRMASSARTGRKMMMVAVLLANSVKKAMTTVMSS TARAGGTFSRGCSCPPIHTDSPDSSQPLAMAKPPPRRRMMFQGTVSWAFFQVSKGSVS VLGALKSNHHLREGLWDGKKNKMVTIATPVTASERNPGLKRLAQPGMNFGSRVKRRMA NMQKRMKTACSANLMGPRFWYSSRITARALSSALVLISDLFFLQPLKLSPPYRKEIQS QPANRNNSIRGKAKMNQEPKFTTSHCGKKLLSWPSRMRLGAVPVSVAVPPILDMDMAN YSEVLDPTYTTLEFETMQILYNSSDSSAPETSMNTTDNGVNCLCAICGDRATGKHYGA STCDGCKGFFRRSIRKSHIYSCRFSRQCVVDKDKRNQCRYCRLRKCFRAGMKKEAVQN ERDRISTRRSTFDGSNIPSINTLAQAEVRSRQISVSSPGSSTDINVKKIASIGDVCES MKQQLLVLVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMMYKDILLLGNN YVIHRNSCEVEISRVANRVLDELVRPFQEIQIDDNEYACLKAIVFFDPDAKGLSDPVK IKNMRFQVQIGLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFVKLFGMVK IDNLLQEMLLGGASNDGSHLHHPMHPHLSQDPLTGQTILLGPMSTLVHADQISTPETP LPSPPQGSGQEQYKIAANQASVISHQHLSKQKQL" BASE COUNT 922 a 709 c 833 g 784 t ORIGIN 1 gtccctttcc ctgtctccaa ggattaaaaa agaatctctg actgctggtt atcggcaggg 61 gtaaaacaac tgggtgcaga atgggcacca actaagcaaa atcacacact ggttgtgaac 121 agctggaaca actgggttgg aacattgcag cccaggaaag tgatctttaa agacaaggtg 181 gtctcttcag tgtaatcgaa agtaatcact gggggataat gttagctgtg cccatgggaa 241 tgaatggcct cctatcccaa ctgtacgaga caaacaccat gccagatttg ctcactgttg 301 atgacactgg gctgtcttgt atcctaatga taaaacagct aactgataaa ggagcttatt 361 tctcaggcag cagatctgag agatacctgg gctttgaact gcatgaaaga gagagaaagt 421 atcctagata atggctcatg gacttgagtg ggccactgga ggtcaccctt gcttgctgca 481 tattggatta cagaaaagaa ttatttgatt gttttgtagt gtcctagcag caggtgatgg 541 tgacagccct ttgagagggt tcagcctcaa gggagtcctc cagggggact cagagggtcc 601 gaaatgtgtc attggccaag gtgggtggca atgctgtgac attgaccgag tacatatcag 661 cccagtccgg gaaggtgccc agctggaaga tggtctgtgc ccaggtattc atagccaaac 721 tgagcagcag gacacccatc aggttcatca ggaggcctgt ccgcaccatg tctttgacca 781 gcaagtgtcc agaggcgaag gcgatggagt tggggggcgt tgagaccggg agcatgaagg 841 caaaggagca gccgactgtg cccggaatca tcagatacag ggggtgcact ctcaggcgga 901 tggccagctc tgccaggacc ggcaggaaga tgatgatggt cgccgtgttg ctggcaaact 961 cagtgaagaa ggcgatgacc acagtgatga gcagcacagc cagggcgggg ggcacattct 1021 ccagggggtg cagctgccca ccaatccata cagacagccc cgattcctca cagcctttgg 1081 ccatggcgaa gccccctccc aggagaagga tgatgttcca aggcactgtc tcctgggcct 1141 tcttccaggt cagcaagggc tctgtctctg tgttgggagc tttgaagtca aaccaccact 1201 tgagagaggg cctttgggac gggaagaaga acaagatggt gacaatagcc acgccggtga 1261 cagcatcaga aagaaaccca ggattgaaga ggctggccca gccagggatg aacttcgggt 1321 cccgggtgaa gaggaggatg gcaaacatgc agaaaaggat gaaaacagcc tgttcggcaa 1381 acttgatggg ccccaggttc tggtattctt cccgaattac agctcgagcc ctatcttctg 1441 cattggttct tatctcagat ttattcttcc tccagcccct gaagctcagt cccccgtaca 1501 ggaaggagat ccagagccag cctgccaaca ggaacaacag cataagaggg aaggcgaaaa 1561 tgaaccagga gccgaaattc accacgtcac actgcggaaa gaaactcttg agctggccaa 1621 gcaggatgag gttaggggct gtgcccgtga gtgtggctgt gcccccaata ctggacatgg 1681 acatggcaaa ttacagtgaa gttttggacc caacttacac aactttggag tttgaaacta 1741 tgcagattct atataattca agtgatagtt ctgccccaga gacaagtatg aataccacag 1801 acaacggtgt caactgtctg tgtgctatct gtggggacag agcaacagga aaacactatg 1861 gggcatccac ctgtgatggg tgcaagggtt tcttcagacg cagcattcgt aagagtcaca 1921 tttattcttg caggttcagt cggcaatgtg ttgttgacaa ggacaaaagg aatcaatgta 1981 gatattgtcg attaagaaag tgttttagag cgggaatgaa aaaagaagct gtacaaaatg 2041 aacgtgacag aataagcacc agaagaagca catttgatgg cagcaacatc ccctccatta 2101 acacactggc acaagctgaa gttcggtctc gccagatctc agtctcaagc cctgggtcaa 2161 gcactgacat aaacgttaag aaaattgcaa gtattggtga tgtctgtgaa tctatgaaac 2221 agcagctctt agtcttggtg gaatgggcta aatatattcc tgccttctgt gaattaccat 2281 tggatgatca ggtggcactg ttgagagctc acgcagggga gcacttactg cttggagcta 2341 caaagagatc catgatgtat aaagatattt tgcttttggg aaacaactat gttattcacc 2401 gcaacagctg tgaagttgag attagccgtg tggccaatcg tgttctagat gagctggtta 2461 gaccatttca agaaatccag attgatgaca atgagtatgc ttgtttaaag gcaattgtat 2521 tttttgatcc agatgcaaaa gggctaagcg atccagtaaa aattaagaac atgaggttcc 2581 aagtgcagat cggtttggag gactacatca atgatcggca gtatgactcc cgggggaggt 2641 ttggagagtt gcttctgctc ctgcccacac tgcagagcat cacgtggcaa atgattgagc 2701 aaatacagtt tgttaaactt tttgggatgg ttaaaattga caatctactt caggaaatgc 2761 tattaggtgg ggcttccaat gatggcagtc atctccatca tccaatgcat ccacatttgt 2821 ctcaagaccc attaactgga caaactatac ttttaggtcc catgtcaaca ctggttcatg 2881 cagaccagat ctcaactcct gaaaccccac tcccttcccc accacaaggc tctgggcaag 2941 aacagtacaa aatagctgca aaccaagcat cagtcatttc acaccagcat ctctccaaac 3001 aaaagcaatt gtgaaaatgt gtttacttca gaacggcact acataaatgt gaaaagttgt 3061 tgatcttgaa atatctcaag atagcacttt tggcaaactc ttagccaagg cttcttcatt 3121 ggtgctgtta taagatggta tcctattttc ttgtttatac gttcattcta tttgttattg 3181 ctactatgtg aaactttcac atgcaaccaa tgtatatctg agtttgaagg atgtttatat 3241 agggtagg // LOCUS HSHNP36 2281 bp RNA PRI 22-AUG-1995 DEFINITION H.sapiens mRNA for nucleolar protein, HNP36. ACCESSION X86681 NID g951266 KEYWORDS HNP36 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2281) AUTHORS Williams,J.B. and Lanahan,A.A. TITLE A mammalian delayed-early response gene encodes HNP36, a novel, conserved nucleolar protein JOURNAL Biochem. Biophys. Res. Commun. 213 (1), 325-333 (1995) MEDLINE 95367016 REFERENCE 2 (bases 1 to 2281) AUTHORS Williams,J.B. TITLE Direct Submission JOURNAL Submitted (26-APR-1995) J.B. Williams, Vanderbilt University School of Medicine, Division of Endocrinology, Rm 715, MRB II, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..2281 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /clone_lib="Heart cDNA" gene 386..1366 /gene="HNP36" CDS 386..1366 /gene="HNP36" /codon_start=1 /product="HNP36 protein" /db_xref="PID:g951267" /translation="MASVCFINSFSAVLQGSLFGQLGTMPSTYSTLFLSGQGLAGIFA ALAMLLSMASGVDAETSALGYFITPYVGILMSIVCYLSLPHLKFARYYLANKSSQAQA QELETKAELLQSDENGIPSSPQKVALTLDLDLEKEPESEPDEPQKPGKPSVFTVFQKI WLTALCLVLVFTVTLSVFPAITAMVTSSTSPGKWSQFFNPICCFLLFNIMDWLGRSLT SYFLWPDEDSRLLPLLVCLRFLFVPLFMLCHVPQRSRLPILFPQDAYFITFMLLFAVS NGYLVSLTMCLAPRQVLPHEREVAGALMTFFLALGLSCGASLSFLFKALL" BASE COUNT 399 a 758 c 613 g 511 t ORIGIN 1 gaccggtggg gcgggtgcgg cttctctgcc cctttcaccc caggcgcatc cgccgcggcg 61 gccatggccc gaggagacgc cccgcgggac agctaccacc tggtcgggat cagcttcttc 121 atcctggggc tgggcaccct ccttccctgg aacttcttca tcaccgccat cccgtacttc 181 caggcgcgac tggccggggc cggcaacagc acagccagga tcctgagcac caaccacacg 241 ggtcccgagg atgccttcaa cttcaacaat tgggtgacgc tgctgtccca gctgcccctg 301 ctgctcttca ccctcctcaa ctccttcctg taccagtgcg ctggtcaagg tggacatgag 361 ccccggaccc ttcttctcca tcaccatggc ctccgtctgc ttcatcaact ccttcagtgc 421 agtcctacag ggcagcctct tcgggcagct gggcaccatg ccctccacct acagcaccct 481 cttcctcagc ggccagggcc tggctgggat ctttgctgcc cttgccatgc tcctgtccat 541 ggccagtggc gtggacgccg agacctctgc cctggggtac tttatcacgc cctatgtggg 601 catcctcatg tccatcgtgt gttacctgag cctgcctcac ctgaagtttg cccgctacta 661 cctggccaat aaatcatccc aggcccaagc tcaggagctg gagaccaaag ctgagctcct 721 ccagtctgat gagaacggga ttcccagtag tccccagaaa gtagctctga ccctggatct 781 tgacctggag aaggagccgg aatcagagcc agatgagccc cagaagccag gaaaaccttc 841 agtcttcact gtcttccaga agatctggct gacagcgctg tgccttgtgt tggtcttcac 901 agtcaccctg tccgtcttcc ccgccatcac agccatggtg accagctcca ccagtcctgg 961 gaagtggagt cagttcttca accccatctg ctgcttcctc ctcttcaaca tcatggactg 1021 gctgggacgg agcctgacct cttacttcct gtggccagac gaggacagcc ggctgctgcc 1081 cctgctggtc tgcctgcggt tcctgttcgt gcccctcttc atgctgtgcc acgtgcccca 1141 gaggtcccgg ctgcccatcc tcttcccaca ggatgcctac ttcatcacct tcatgctgct 1201 ctttgccgtt tctaatggct acctggtgtc cctcaccatg tgcctggcgc ccaggcaggt 1261 gctgccacac gagagggagg tggccggcgc cctcatgacc ttcttcctgg ccctgggact 1321 ttcctgtgga gcctccctct ccttcctctt caaggcgctg ctctgaagtg gcccctccag 1381 gctctttggc agcctcttct cgacgtctcc ttccggagct gagatccagc ccagggcgaa 1441 tggcgagctt ggctcaggcc tctgcggggt ggaggcccct gggcctgagg ctgccagcag 1501 cgggcaggag ctgctcttca tccacttgga gtgctgcggg gaagaaatca ccaccggtca 1561 ttctaaccct cacccaggaa tgggggtgac tcgctcaaga cctcatggaa agggtgatga 1621 ctagggaaaa gagggtgcag ggcacggctg ctccccacca ccaggtctgc atttgttcat 1681 catcatcagg agcagaggtg accagagggt tcagagtggg aggcggggcc agcccaggcc 1741 aggagcgcct catcttccca ggcctcagcc acccagggta aaaggtgcca gggaagttgt 1801 gggcacctga gaggaggaac agatgtggag gacctgaggg tgctcaaagg gccaggctca 1861 gcctcaagca gtgttttcat tgccaacact tactgtaccc actccgcaga gcccagctgg 1921 gcctgggccc cagggccaca gctagcctgc atgtgtgtac tgcactttac agtttgcaaa 1981 gctcttccat acccactctc tcaccgaagc ctaattgagg ctcttggaag gagtcaggca 2041 aggattgtgc ttcccccatt atacaggtga caaaactgag tcctggggaa ggtggctggt 2101 ccgtggtaga gccgggaccc aatcccctct ctctcctccc tgttggtgct gttcttcctg 2161 cccaacacct gtttctcttt tcctcaaggg gtttggggca ggagcctggg cacttactcc 2221 ccgtttttgc tgtttctcct tctgaccctg ctcttgggtc taataacccc atttatttgg 2281 c // LOCUS HSHNRNPG 1894 bp RNA PRI 10-SEP-1993 DEFINITION H.sapiens mRNA gene for hnRNP G protein. ACCESSION Z23064 NID g398925 KEYWORDS hnRNP G protein; RNA-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1894) AUTHORS Soulard,M.M., Della Valle,V.V., Siomi,M.M., Pinol-Roma,S.S., Codogno,P.P., Bauvy,C.C., Belli,M.M., Lacroix,J.J., Monod,G.G., Dreyfuss,G.G. and Larsen,C.C. TITLE hnRNP G: sequence and characterization of a glycosylated RNA-binding protein JOURNAL Nucleic Acids Res. 21 (18), 4210-4217 (1993) MEDLINE 94021365 REFERENCE 2 (bases 1 to 1894) AUTHORS Larsen,C.C. TITLE Direct Submission JOURNAL Submitted (17-JUN-1993) Christian-Jacques CJL Larsen, INSERM U-301, 27 rue Juliette Dodu, Paris, 75010, France FEATURES Location/Qualifiers source 1..1894 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="V5" /tissue_type="Breast" /cell_type="Epithelial cell" /cell_line="ZR-75-1 (human breast carcinoma)" /clone_lib="5'stretch cDNA library from Clontech" /chromosome="6p12" /sex="Female" /germline CDS 12..1325 /codon_start=1 /product="hnRNP G protein" /db_xref="PID:g398926" /db_xref="SWISS-PROT:P38159" /translation="MVEADRPGKLFIGGLNTETNEKALEAVFGKYGRIVEVLLMKDRE TNKSRGFAFVTFESPADAKDAARDMNGKSLDGKAIKVEQATKPSFESGRRGPPPPPRS RGPPRGLRGGRGGSGGTRGPPSRGGHMDDGGYSMNFNMSSSRGPLPVKRGPPPRSGGP PPKRSAPSGPVRSSSGMGGRAPVSRGRDSYGGPPRREPLPSRRDVYLSPRDDGYSTKD SYSSRDYPSSRDTRDYAPPPRDYTYRDYGHSSSRDDYPSREYSDRDGYGRDRDYSDHP SGGSYRDSYESYGNSRSAPPTRGPPPSYGGSSRYDDYSSSRDGYGGSRDSYSSSRSDL YSSGRDRVGRQERGLPPSMERGYLLHVIPTAVQAADDQEVVAVEEADLIEGEAEADTR NKQNFGPKSQFKETKSGNYSIITTQGLLKGKIVLLFLNSLLSSPP" BASE COUNT 619 a 394 c 429 g 452 t ORIGIN 1 cggaaaaaaa aatggttgaa gcagatcgcc caggaaagct cttcattggt gggcttaata 61 cggaaacaaa tgagaaagct cttgaagcag tatttggcaa atatggacga atagtggaag 121 tactcttgat gaaagaccgt gaaaccaaca aatcaagagg atttgctttt gtcacctttg 181 aaagcccagc agacgctaag gatgcagcca gagacatgaa tggaaagtca ttagatggaa 241 aagccatcaa ggtggaacaa gccaccaaac catcatttga aagtggtaga cgtggaccgc 301 ctccacctcc aagaagtaga ggccctccaa gaggtcttag aggtggaaga ggaggaagtg 361 gaggaaccag gggacctccc tcacggggag gacacatgga tgacggtgga tattccatga 421 attttaacat gagttcttcc aggggaccac tcccagtaaa aagaggacca ccaccaagaa 481 gtgggggtcc tcctcctaag agatctgcac cttcaggacc agttcgcagt agcagtggaa 541 tgggaggaag agctcctgta tcacgtggaa gagatagtta tggaggtcca cctcgaaggg 601 aaccgctgcc ctctcgtaga gatgtttatt tgtctccaag agatgatggg tattctacta 661 aagacagcta ttcaagcaga gattacccaa gttctcgtga tactagagat tatgcaccac 721 caccacgaga ttatacttac cgtgattatg gtcattccag ttcacgtgat gactatccat 781 caagagaata tagcgataga gatggatatg gtcgtgatcg tgactattca gatcatccaa 841 gtggaggttc ctacagagat tcatatgaga gttatggtaa ctcacgtagt gctccaccta 901 cacgagggcc cccgccatct tatggtggaa gcagtcgcta tgatgattac agcagctcac 961 gtgacggata tggtggaagt cgagacagtt actcaagcag ccgaagtgat ctctactcaa 1021 gtggtcgtga tcgggttggc agacaagaaa gagggcttcc cccttctatg gaaagggggt 1081 acctcctcca cgtgattcct acagcagttc aagccgcgga cgaccaagag gtggtggccg 1141 tggaggaagc cgatctgata gagggggagg cagaagcaga tactagaaac aaacaaaact 1201 ttggaccaaa atcccagttc aaagaaacaa aaagtggaaa ctattctatc ataactaccc 1261 aaggactact aaaaggaaaa attgtgttac tttttttaaa ttccctgtta agttcccctc 1321 cataattttt atgttcttgt gaggaaaaaa gtaaaacatg tttaatttta tttgacttct 1381 gcattgcttt tcaacaagca aatgttaaat gtgttaagac ttgtactagt gttgtaactt 1441 tccaagtaaa agtatcccct aaaggccact tcctatctga tttttcccag caaatgaggc 1501 aggcaattct agtcttccac aaaacatcta gccatctaaa atggagagat gaatcattct 1561 acctatacaa acaagctagc tattagaggg tggttggggt atgctactca taagatttca 1621 gggtgtcttc caactgaaat ctcaatgttc tcagtacgaa aaacctgaaa tcacatgcct 1681 atgtaaggaa agtgctattc acccagtaaa cccaaaaaag caaatggata atgctggcca 1741 ttttgccttt ctgacatttc cttgggaatc tgcaagaacc tcccctttcc cttcccccaa 1801 taagaccatt taagtgtgtg ttaaacaact acagaatact aagtaaaaag tttggccaaa 1861 accaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSHNRNPI 3319 bp RNA PRI 03-DEC-1993 DEFINITION H.sapiens mRNA for heterogeneous nuclear ribonucleoprotein. ACCESSION X66975 S41306 NID g32353 KEYWORDS nuclear ribonucleoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3319) AUTHORS Michael,W.M. TITLE Direct Submission JOURNAL Submitted (23-JUN-1992) W.M. Michael, Howard Hughes Medical Inst, at Univ of Pennsylvania, 422 Curie Boulevard, Room 330, Philadelphia PA 19104-6148, USA REFERENCE 2 (bases 1 to 3319) AUTHORS Ghetti,A., Pinol-Roma,S., Michael,W.M., Morandi,C. and Dreyfuss,G. TITLE hnRNP I, the polypyrimidine tract-binding protein: distinct nuclear localization and association with hnRNAs JOURNAL Nucleic Acids Res. 20 (14), 3671-3678 (1992) MEDLINE 92350668 FEATURES Location/Qualifiers source 1..3319 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt11" /clone="pHI" gene 88..1761 /gene="hnRNP I" CDS 88..1761 /gene="hnRNP I" /codon_start=1 /product="nuclear ribonucleoprotein" /db_xref="PID:g32354" /translation="MDGIVPDIAVGTKRGSDELFSTCVTNGPFIMSSNSASAANGNDS KKFKGDSRSAGVPSRVIHIRKLPIDVTEGEVISLGLPFGKVTNLLMLKGKNQAFIEMN TEEAANTMVNYYTSVTPVLRGQPIYIQFSNHKELKTDSSPNQARAQAALQAVNSVQSG NLALAASAAAVDAGMAMAGQSPVLRIIVENLFYPVTLDVLHQIFSKFGTVLKIITFTK NNQFQALLQYADPVSAQHAKLSLDGQNIYNACCTLRIDFSKLTSLNVKYNNDKSRDYT RPDLPSGDSQPSLDQTMAAAFGAPGIISASPYAGAGFPPTFAIPQAAGLSVPNVHGAL APLAIPSAAAAAAAAGRIAIPGLAGAGNSVLLVSNLNPERVTPQSLFILFGVYGDVQR VKILFNKKENALVQMADGNQAQLAMSHLNGHKLHGKPIRITLSKHQNVQLPREGQEDQ GLTKDYGNSPLHRFKKPGSKNFQNIFPPSATLHLSNIPPSVSEEDLKVLFSSNGGVVK GFKFFQKDRKMALIQMGSVEEAVQALIDLHNHDLGENHHLRVSFSKSTI" BASE COUNT 733 a 1016 c 859 g 711 t ORIGIN 1 cttgtgagtc tataactcgg agccgttggg tcggttcctg ctattccggc gcctccactc 61 cgtcccccgc ggtctgctct gtgtgccatg gacggcattg tcccagatat agccgttggt 121 acaaagcggg gatctgacga gcttttctct acttgtgtca ctaacggacc gtttatcatg 181 agcagcaact cggcttctgc agcaaacgga aatgacagca agaagttcaa aggtgacagc 241 cgaagtgcag gcgtcccctc tagagtgatc cacatccgga agctccccat cgacgtcacg 301 gagggggaag tcatctccct ggggctgccc tttgggaagg tcaccaacct cctgatgctg 361 aaggggaaaa accaggcctt catcgagatg aacacggagg aggctgccaa caccatggtg 421 aactactaca cctcggtgac ccctgtgctg cgcggccagc ccatctacat ccagttctct 481 aaccacaagg agctgaagac cgacagctct cccaaccagg cgcgggccca ggcggccctg 541 caggcggtga actcggtcca gtcggggaac ctggccttgg ctgcctcggc ggcggccgtg 601 gacgcaggga tggcgatggc cgggcagagt cctgtgctca ggatcatcgt ggagaacctc 661 ttctaccctg tgaccctgga tgtgctgcac cagattttct ccaagttcgg cacagtgttg 721 aagatcatca ccttcaccaa gaacaaccag ttccaggccc tgctgcagta tgcggacccc 781 gtgagcgccc agcacgccaa gctgtcgctg gacgggcaga acatctacaa cgcctgctgc 841 acgctgcgca tcgacttttc caagctcacc agcctcaacg tcaagtacaa caatgacaag 901 agccgtgact acacacgccc agacctgcct tccggggaca gccagccctc gctggaccag 961 accatggccg cggccttcgg tgcacctggt ataatctcag cctctccgta tgcaggagct 1021 ggtttccctc ccacctttgc cattcctcaa gctgcaggcc tttccgttcc gaacgtccac 1081 ggcgccctgg cccccctggc catcccctcg gcggcggcgg cagctgcggc ggcaggtcgg 1141 atcgccatcc cgggcctggc gggggcagga aattctgtat tgctggtcag caacctcaac 1201 ccagagagag tcacacccca aagcctcttt attcttttcg gcgtctacgg tgacgtgcag 1261 cgcgtgaaga tcctgttcaa taagaaggag aacgccctag tgcagatggc ggacggcaac 1321 caggcccagc tggccatgag ccacctgaac gggcacaagc tgcacgggaa gccgatccgc 1381 atcacgctct cgaagcacca gaacgtgcag ctgccccgcg agggccagga ggaccagggc 1441 ctgaccaagg actacggcaa ctcacccctg caccgcttca agaagccggg ctccaagaac 1501 ttccagaaca tattcccgcc ctcggccacg ctgcacctct ccaacatccc gccctcagtc 1561 tccgaggagg atctcaaggt cctgttttcc agcaatgggg gcgtcgtcaa aggattcaag 1621 ttcttccaga aggaccgcaa gatggcactg atccagatgg gctccgtgga ggaggcggtc 1681 caggccctca ttgacctgca caaccacgac ctcggggaga accaccacct gcgggtctcc 1741 ttctccaagt ccaccatcta ggggcacagg cccccacggc cgggccccct gcgacaactt 1801 ccatcattcc agagaaaagc cactttaaaa acagctgaag tgaccttagc agaccagaga 1861 ttttattttt ttaaagagaa atcagtttac cttgttttta aaaaaattaa atctagttca 1921 ccttgctcac cctgcggtga cagggacagc tcaggcttct tggtgactgt ggcagcggga 1981 gttcccggcc ctccacaccc ggagccacac ccctgggcca tgccttggtg gggcctgtgt 2041 cgggcgtggg gccctgcagg tgggcgcccc gaccacgact tggcttcctt gtgccttaaa 2101 aaacctgcct tcctgcagcc acacacccac ccggggtgtc ctggggaccc aaggggtggg 2161 ggggtcacac cagagagagg cagggggcct ggccggctcc tgcaggatca tgcagctggg 2221 gccggcggcc gcggctgcga caccccaacc ccagccctct aatcaagtca cgtgattctc 2281 ccttcacccc cgccccaggg ccttcccttc tgcccccagg cgggctcccc gctgctccag 2341 ctgcggagct ggtcgacata atctctgtat tatatacttt gcagttgcag acgtctgtgc 2401 ctagcaatat ttccagttga ccaaatattc taatcttttt tcatttatat gcaaaagaaa 2461 tagttttaag taacttttta tagcaagatg atacaatggt atgagtgtaa tctaaacttc 2521 cttgtggtat taccttgtat gctgttactt ttattttatt ccttgtaatt aagtcacagg 2581 caggacccag tttccagaga gcaggcgggg ccgcccagtg ggtcaggcac agggagcccc 2641 ggtcctatct tagagcccct gagcttcagg gaaggggcgg gcgtgtcgcc gcctctggca 2701 tcgcctccgg ttgccttaca ccacgccttc acctgcagtc gcctagaaaa cttgctctca 2761 aacttcaggg ttttttcttc cttcaaattt tggaccaaag tctcatttct gtgttttgcc 2821 tgcctctgat gctgggaccc ggaaggcggg cgctcctctg tcttctctgt gctctttcta 2881 ccgcccccgc gtcctgtccc gggggctctc ctaggatccc ctttccgtaa aagcgtgtaa 2941 caagggtgta aatatttata attttttata cctgttgtga gacccgaggg gcggcggcgc 3001 ggttttttat ggtgacacaa atgtatattt tgctaacagc aattccaggc tcagtattgt 3061 gaccgcggac cacaggggac cccacgcaca ttccgttgcc ttacccgatg gcttgtgacg 3121 cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc tgtgttcgct 3181 gtggacgctg tagaggcagg ttggccagtc tgtacctgga cttcgaataa atcttccgta 3241 tcctcgctcc gttccgcctt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3301 aaaaaaaaaa aaaaaaaaa // LOCUS HSHNRNPL 2033 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for novel heterogeneous nuclear RNP protein, L protein. ACCESSION X16135 NID g32355 KEYWORDS hnRNP protein; ribonucleoprotein; RNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2033) AUTHORS Swanson,M. TITLE Direct Submission JOURNAL Submitted (15-AUG-1989) Swanson M., Northwestern University, Biochemistry Molecular & Cell Biology, 2153 Sheridan Road, Evanston IL 60208, U S A REFERENCE 2 (bases 1 to 2033) AUTHORS Pinol-Roma,S., Swanson,M.S., Gall,J.G. and Dreyfuss,G. TITLE A novel heterogeneous nuclear RNP protein with a unique distribution on nascent transcripts JOURNAL J. Cell Biol. 109 (6 Pt 1), 2575-2587 (1989) MEDLINE 90078296 FEATURES Location/Qualifiers source 1..2033 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="lambda gt11" /clone="pHCL3" CDS 29..1705 /note="L protein (AA 1-558)" /codon_start=1 /db_xref="PID:g32356" /db_xref="SWISS-PROT:P14866" /translation="MVKMAAAGGGGGGGRYYGGGSEGGRAPKRLKTDNAGDQHGGGGG GGGGAGAAGGGGGGENYDDPHKTPASPVVHIRGLIDGVVEADLVEALQEFGPISYVVV MPKKRQALVEFEDVLGACNAVNYAADNQIYIAGHPAFVNYSTSQKISRPGDSDDSRSV NSVLLFTILNPIYSITTDVLYTICNPCGPVQRIVIFRKNGVQAMVEFDSVQSAQRAKA SLNGADIYSGCCTLKIEYAKPTRLNVFKNDQDTWDYTNPNLSGQGDPGSNPNKRQRQP PLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHYEGRRMGPPVGGHRRGPSRYGPQYGH PPPPPPPPEYGPHADSPVLMVYGLDQSKMNGDRVFNVFCLYGNVEKVKFMKSKPGAAM VEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPAIMPGQSYGLEDGSCSYKDFSESR NNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFFEICDELGVKRPSSVKVFSGKS ERSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYTLKLCFSTAQHAS" misc_feature 2005..2010 /note="polyA signal" misc_feature 2022..2027 /note="polyA signal" polyA_site 2033 /note="polyA site" BASE COUNT 478 a 532 c 557 g 466 t ORIGIN 1 ggacgagcag cggaggcggt cgggagcgat ggtgaagatg gcggcggcgg gcggcggagg 61 cggcggtggc cgctactacg gcggcggcag tgagggcggc cgggccccta agcggctcaa 121 gactgacaac gccggcgacc agcacggagg cggcggcggt ggcggtggag gagccggggc 181 ggcgggcggc ggcggcggtg gggagaacta cgatgacccg cacaaaaccc ctgcctcccc 241 agttgtccac atcaggggcc tgattgacgg tgtggtggaa gcagaccttg tggaggcctt 301 gcaggagttt ggacccatca gctatgtggt ggtaatgcct aaaaagagac aagcactggt 361 ggagtttgaa gatgtgttgg gggcttgcaa cgcagtgaac tacgcagccg acaaccaaat 421 atacattgct ggtcacccag cttttgtcaa ctactctacc agccagaaga tctcccgccc 481 tggggactcg gatgactccc ggagcgtgaa cagtgtgctt ctctttacca tcctgaaccc 541 catttattcg atcaccacgg atgttcttta cactatctgt aatccttgtg gccctgtcca 601 gagaattgtc attttcagga agaatggagt tcaggcgatg gtggaatttg actcagttca 661 aagtgcccag cgggccaagg cctctctcaa tggggctgat atctattctg gctgttgcac 721 tctgaagatc gaatacgcaa agcctacacg cttgaatgtg ttcaagaatg atcaggatac 781 ttgggactac acaaacccca atctcagtgg acaaggtgac cctggcagca accccaacaa 841 acgccagagg cagccccctc tcctgggaga tcaccccgca gaatatggag ggccccacgg 901 tgggtaccac agccattacc atgatgaggg ctacgggccc cccccacctc actacgaagg 961 gagaaggatg ggtccaccag tggggggtca ccgtcggggc ccaagtcgct acggccccca 1021 gtatgggcac cccccacccc ctcccccacc acccgagtat ggccctcacg ccgacagccc 1081 tgtgctcatg gtctatggct tggatcaatc taagatgaac ggtgaccgag tcttcaatgt 1141 cttctgctta tatggcaatg tggagaaggt gaaattcatg aaaagcaagc cgggggccgc 1201 catggtggag atggctgatg gctacgctgt agaccgggcc attacccacc tcaacaacaa 1261 cttcatgttt gggcagaagc tgaatgtctg tgtctccaag cagccagcca tcatgcctgg 1321 tcagtcatac gggttggaag acgggtcttg cagttacaaa gacttcagtg aatcccggaa 1381 caatcggttc tccaccccag agcaggcagc caagaaccgc atccagcacc ccagcaacgt 1441 gctgcacttc ttcaacgccc cgctggaggt gaccgaggag aacttctttg agatctgcga 1501 tgagctggga gtgaagcggc catcttctgt gaaagtattc tcaggcaaaa gtgagcgcag 1561 ctcctctgga ctgctggagt gggaatccaa gagcgatgcc ctggagactc tgggcttcct 1621 gaaccattac cagatgaaaa acccaaatgg tccataccct tacactctga agttgtgttt 1681 ctccactgct cagcacgcct cctaattagg tgcctaggaa gagtcccatc tgagcaggaa 1741 gacatttctc tttcctttat gccatttttt gtttttgtta tttgcaaaag atcttgtatt 1801 cctttttttt tttttttttt tttaaatgct aggtttgtag aggcttactt aaccttaatg 1861 gaaacgctgg aaatctgcag ggggagggag aggggaactg ttatctccca agattaacct 1921 tcacttttaa aaaattattg tacatgtgat tttttttttt cctgttcata catttgtgct 1981 gcccatgtac tcttggcaca tttcaataaa attgtttgga aaataaacac agc // LOCUS HSHNRNPU 3223 bp RNA PRI 18-NOV-1993 DEFINITION H.sapiens U21.1 mRNA. ACCESSION X65488 S40037 NID g32357 KEYWORDS hnRNP U protein; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3223) AUTHORS Kiledjian,M. and Dreyfuss,G. TITLE Primary structure and binding activity of the hnRNP U protein: binding RNA through RGG box JOURNAL EMBO J. 11 (7), 2655-2664 (1992) MEDLINE 92331618 REFERENCE 2 (bases 1 to 3223) AUTHORS Kiledjian,M. TITLE Direct Submission JOURNAL Submitted (08-APR-1992) M. Kiledjian, Howard Hughes Med Institute at University of Pennsylvania, 422 Curie Boulevard, Philadelphia, PA 19104-6148, USA FEATURES Location/Qualifiers source 1..3223 /organism="Homo sapiens" /strain="Hela D98" /db_xref="taxon:9606" /clone_lib="lambda GT11" /clone="pGem-U21.1" gene 42..2462 /gene="U21.1" CDS 42..2462 /gene="U21.1" /codon_start=1 /product="hnRNP U protein" /db_xref="PID:g32358" /db_xref="SWISS-PROT:Q00839" /translation="MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDD EEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGD QMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGE QQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGDGKTEQ KGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNC DLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKD IDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFAN FESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPI PEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNI LGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVS AAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLP EVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFN RGGGHRGRGGLNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVF PGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWS QHYHQGYY" BASE COUNT 981 a 606 c 888 g 748 t ORIGIN 1 cgagtttgag gcagcgctag cggtgaatcg gggccctcac catgagttcc tcgcctgtta 61 atgtaaaaaa gctgaaggtg tcggagctga aagaggagct caagaagcga cgcctttctg 121 acaagggtct caaggccgag ctcatggagc gactccaggc tgcgctggac gacgaggagg 181 ccgggggccg ccccgccatg gagcccggga acggcagcct agacctgggc ggggattccg 241 ctgggcgctc gggagcaggc ctcgagcagg aggccgcggc cggcggcgat gaagaggagg 301 aagaagagga agaggaggag gaaggaatct ccgctctgga cggcgaccag atggagctag 361 gagaggagaa cggggccgcg ggggcggccg actcgggccc gatggaggag gaggaggccg 421 cctcggaaga cgagaacggc gacgatcagg gtttccagga aggggaagat gagctcgggg 481 acgaagagga aggcgcgggc gacgagaacg ggcacgggga gcagcagcct caaccgccgg 541 cgacgcagca gcaacagccc caacagcagc gcggggccgc caaggaggcc gcggggaaga 601 gcagcggccc cacctcgctg ttcgcggtga cggtggcgcc gcccggggcg aggcagggcc 661 agcagcaggc gggaggggac ggcaaaacag aacagaaagg cggagataaa aagaggggtg 721 ttaaaagacc acgagaagat catggccgtg gatattttga gtacattgaa gagaacaagt 781 atagcagagc caaatctcct cagccacctg ttgaagaaga agatgaacac ttcgatgaca 841 cagtggtttg tcttgatact tataattgtg atctacattt taaaatatca agagatcgtc 901 tcagtgcttc ttcccttaca atggagagtt ttgcttttct ttgggctgga ggaagagcat 961 cctatggtgt gtcaaaaggc aaagtgtgtt ttgagatgaa ggttacagag aagatcccag 1021 taaggcattt atatacaaaa gatattgaca tacatgaagt tcgtattggc tggtcactaa 1081 ctacaagtgg aatgttactt ggtgaagaag aattttctta tgggtattct ctaaaaggaa 1141 taaaaacatg caactgtgag actgaagatt atggagaaaa gtttgatgaa aatgatgtga 1201 ttacatgttt tgctaacttt gaaagtgatg aagtagaact ctcgtatgct aagaatggac 1261 aagatcttgg cgttgccttc aaaatcagta aggaagttct tgctggacgg ccactgttcc 1321 cgcatgttct ctgccacaac tgtgcagttg aatttaattt tggtcagaag gaaaagccat 1381 attttccaat acctgaagag tatactttca tccagaacgt ccccttagag gatcgagtta 1441 gaggaccaaa ggggcctgaa gagaagaaag attgtgaagt tgtgatgatg attggcttgc 1501 caggagctgg aaaaactacc tgggttacta aacatgcagc agaaaatcca gggaaatata 1561 acattcttgg cacaaatact attatggata agatgatggt ggcaggtttt aagaagcaaa 1621 tggcagatac tggaaaactg aacacactgt tgcagagagc cccccagtgt cttgggaaat 1681 ttattgagat tgctgcccga aagaagcgaa attttattct ggatcagaca aatgtgtctg 1741 ctgctgccca gaggagaaaa atgtgcctgt ttgcaggctt ccagcgaaaa gctgttgtag 1801 tttgcccaaa agatgaagac tataagcaaa gaacacagaa gaaagcagaa gtagagggga 1861 aagacctacc agaacatgcg gtcctcaaaa tgaaaggaaa ctttaccctc ccagaggtag 1921 ctgagtgctt tgatgaaata acctatgttg aacttcagaa ggaagaagcc caaaaactct 1981 tggagcaata taaggaagaa agcaaaaagg ctcttccacc agaaaagaaa cagaacactg 2041 gctcaaagaa aagcaataaa aataagagtg gcaagaacca gtttaacaga ggtggtggcc 2101 atagaggacg tggaggactc aatatgcgtg gtggaaattt cagaggagga gcccctggga 2161 atcgtggcgg atataatagg aggggcaaca tgccacagag aggtggtggc ggtggaggaa 2221 gtggtggaat cggctatcca taccctcgtg cccctgtttt tcctggccgt ggtagttact 2281 caaacagagg gaactacaac agaggtggaa tgcccaacag agggaactac aaccagaact 2341 tcagaggacg aggaaacaat cgtggctaca aaaatcaatc tcagggctac aaccagtggc 2401 agcagggtca attctggggt cagaagccat ggagtcagca ttatcaccaa ggatattatt 2461 gaatacccaa ataaaacgaa ctgatacata tttctccaaa accttcacaa gaagtcgact 2521 gttttcttta gtaggctaac tttttaaaca ttccacaaga ggaagtgcct gcgggttcct 2581 tttttagaag ctttgtgggt tgattttttt tcttttcttt tttgtacatt tttaattgca 2641 gtttaaaagt gaatcgtaag agaacctcag cattgtgcac gataagagaa tgtgtcagta 2701 tttcagggtt ctacatttat ctgtaaaatg tgactttttt ttttttttat cacaacagaa 2761 gtaaaatgtt gctttgtacc tggtgtcttt tattaagaat ttactccccc catttctcac 2821 agagaataac agtcgggagt cattgtcaca atataataga aatgttagca accagattca 2881 tgtaaggact aagtggtcct catgaattgc attaagactc tgtactgctc atattacact 2941 ccatcctctc tgtagtttgc tgggtagtgg agggggtaag ctaaatcata gtttctgaca 3001 ataactggga aggttttttc ttaaaataac aatggaattg gtataattgg gattgaaaac 3061 taaaacttgg aactaagata gagaagatgg agtgtatgta gaagggctgt taaaaatgta 3121 aaacttggtt gcattatttg tggaggctca aacttgtgaa ggttaatacc ataatttttc 3181 catttgttct gcattttgat tctgaaaaga aagctggctt tgc // LOCUS HSHOX22 2786 bp DNA PRI 17-SEP-1992 DEFINITION Human Hox2.2 gene for a homeobox protein. ACCESSION X58431 NID g32369 KEYWORDS homeobox protein; Hox-2.2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2786) AUTHORS Largman,C. TITLE Direct Submission JOURNAL Submitted (15-MAR-1991) C. Largman, Martinez VA Medical Center, Dept of Research, 150 Muir Road, Martinez CA 94553, USA REFERENCE 2 (bases 1 to 2786) AUTHORS Shen,W.F., Detmer,K., Simonitch-Eason,T.A., Lawrence,H.J. and Largman,C. TITLE Alternative splicing of the HOX 2.2 homeobox gene in human hematopoietic cells and murine embryonic and adult tissues JOURNAL Nucleic Acids Res. 19 (3), 539-545 (1991) MEDLINE 91187672 FEATURES Location/Qualifiers source 1..2786 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="human placental" mRNA <316..2708 /gene="HOX 2.2" /note="alternatively spliced mRNA, contains the intron" exon <316..730 /gene="HOX 2.2" /number=1 mRNA join(<316..730,1793..2708) /gene="HOX 2.2" gene 316..2708 /gene="HOX 2.2" CDS join(316..730,1793..2052) /gene="HOX 2.2" /codon_start=1 /db_xref="PID:g32370" /db_xref="SWISS-PROT:P17509" /translation="MSSYFVNSTFPVTLASGQESFLGHVPLYSSGYRDPLRHYPAPYG PGPGQDKGFATSSYYRPAGGGYGRAAPCAYGPAPAFYREKESACALSGADEQPPFHPE PRKSDCAQDKSVFGETEEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLE LEKEFHYNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEE EEKQAE" CDS 316..738 /gene="HOX 2.2" /note="derived from alternatively spliced mRNA" /codon_start=1 /product="putative truncated protein" /db_xref="PID:g32371" /db_xref="SWISS-PROT:P17509" /translation="MSSYFVNSTFPVTLASGQESFLGHVPLYSSGYRDPLRHYPAPYG PGPGQDKGFATSSYYRPAGGGYGRAAPCAYGPAPAFYREKESACALSGADEQPPFHPE PRKSDCAQDKSVFGETEEQKCSTPVYPWMQRMNSCNSE" intron 731..1792 /gene="HOX 2.2" /number=1 exon 1793..2708 /gene="HOX 2.2" /number=2 polyA_signal 2687..2692 /gene="HOX 2.2" BASE COUNT 612 a 789 c 867 g 518 t ORIGIN 1 gctggcggtt ggtgtgccgc gccgccggga gagagaggag aaggcgaagg agaaggagga 61 aaaaaaaagt gatagttgtg cgcggtcctc gtgcgctggc gctcctcctg ggtgctattg 121 acaattcctg cctctgccat tggtcagtgt tggatcagat ggttgtattt ccttctggcc 181 ctcactgcac ctcgccatcc cccctcagct aaaacccaat ctcggatata ctactatagc 241 gcggcgctcg gactataaaa cacaacaaat ataaacccgg cggagcagca gcggccgcgc 301 gcgcctcccc tcccaatgag ttcctatttc gtgaactcca ccttccccgt cactctggcc 361 agcgggcagg agtccttcct gggccacgta ccgctctatt cgtcgggcta tcgggacccg 421 ctgagacatt accccgcgcc ctacgggcca gggccgggcc aggacaaggg ctttgccact 481 tcctcctatt accgcccggc ggggggtggc tacggccgag cggcgccctg cgcctacggc 541 ccggcgccgg ccttctaccg cgagaaagag tcggcctgcg cactctccgg cgccgacgag 601 cagcccccgt tccaccccga gccgcggaag tcggactgcg cgcaggacaa gagcgtgttc 661 ggcgagacag aagagcagaa gtgctccact ccggtctacc cgtggatgca gcggatgaat 721 tcgtgcaaca gtgagtgaga cttcccggtc gccgtcgccc cggctcccct gggcgcccac 781 cccgggacac agaactagtg agcgccccct gccccaatct cccacagggt gctgggtggg 841 catctcagtt aggagataga agaagattgg gcgccggccg ggggtctctc gctgtgtccc 901 ctattaggca ggagttacaa agtttgcaaa gtcccagccg cactgggagc catggggagc 961 aagtgtgctc tcctggggcg cctggccaga gccggggttc caggacaggg aagggagcag 1021 gagcctgcag tcactggctt ggctttactt tgagccccca acccccctct cccagccctg 1081 gtagggttcc ccaaccaaag acggctccat taaaaacgga gtgcgtcatg caagggggta 1141 gggggtgacg ctgaagagga agagttgagt ctgtctggga ctgtgtttcc ctgtggtggg 1201 gagctggggc aagatgaggc accaggaaag gggtggtagt cagaggaggg aggaggaata 1261 aggggaagga agagagaggg agcaggcgga gcgagagagg gagaaacagg cgcggggtct 1321 caggttggat tatttgttgg ccttaagtcc caactgatgt ccattagccg gactcgaaag 1381 tgaggagccg ttaatgtgga ctatggatcg atctacgtca ttacggatta acggcctgga 1441 tttatcattg ggttgggggg atgacggggg gtgggaacaa gatggatgga aagggaggga 1501 aagacaagat gcaactagga agagacacac ctgaccagcc cctcccccca ggctcaggtg 1561 ggagttccaa ctgctcctcc cctcccccat ctagagtcta cagacggcac aggcctagga 1621 gactaggagg gaatctggag ggggcgctgg aggagtgcga aacgggggaa gggagccggc 1681 agaatagagg gcattcccgg tactggctgg agtctgtgtt cgagggtcga ctaggggagg 1741 gggtcctggg cccggtgacc gcaggcctca gcatctccac tctgcgtaac aggttcctcc 1801 tttgggccca gcggccggcg aggccgccag acatacacac gttaccagac gctggagctg 1861 gagaaggagt ttcactacaa tcgctacctg acgcggcggc ggcgcatcga gatcgcgcac 1921 gccctgtgcc tgacggagag gcagatcaag atatggttcc agaaccgacg catgaagtgg 1981 aaaaaggaga gcaaactgct cagcgcgtct cagctcagtg ccgaggagga ggaagaaaaa 2041 caggccgagt aaggtgctgg aaagggaggg aggacgccga gggaaaggcc tgtggggagc 2101 cacgggcgtc agagagaccc gggaaggaag gctctcgggt gggggagcca ggacacctgc 2161 tctccggcgc agacagcggg gcccagcgct ctcctggacg cccccgccgc acagctcccg 2221 gcgggtgctc tgaggcctca ctactcgagc ccacccagca tcccgcgtcg tcccttcctt 2281 cccgaggaac tgccctcagc ctgatcaggc ttcctggtga gaactgagga gcggactcac 2341 ttgatgtttc ctggaagcag agcaaaagtt ctcttgtccc tgtcgcgtct cattttgtcc 2401 atgtcccccg tgcacggttc aatggtagat tcgctgtcct cagcgggggc cttgaagact 2461 ccctgatccc agacctggtc gtctctccca ccccctcccc aaagccactg gaaggagcac 2521 atactaccta gaagtaagaa gaggagcctc agaagaaaac aaagttctat tttattaatt 2581 ttctatgtgt tgtgtttgta gtcttgtctt agctctggac gcgaaatact tcgatgatga 2641 tgatgatgat gataataata ataataataa caacaacaac aacaataata aagatgtgaa 2701 aactcgaacg ctcggtcacc tccaatcctc ccctgccatt ttttcctctc tcctcaaccc 2761 ccagcctctc catctttcct gtgcca // LOCUS HSHOX329 1522 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cp19 homeobox from HOX-3 locus. ACCESSION X07495 NID g32385 KEYWORDS homeobox. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1522) AUTHORS Boncinelli,E. TITLE Direct Submission JOURNAL Submitted (29-APR-1988) Boncinelli E., I.I.G.B.- CNR, Via G. Marconi 10 80125 Napoli, Italy REFERENCE 2 (bases 1 to 1522) AUTHORS Simeone,A., Pannese,M., Acampora,D., D'Esposito,M. and Boncinelli,E. TITLE At least three human homeoboxes on chromosome 12 belong to the same transcription unit JOURNAL Nucleic Acids Res. 16 (12), 5379-5390 (1988) MEDLINE 88262550 FEATURES Location/Qualifiers source 1..1522 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="full term placenta" /map="HOX-3 locus chromosome 12" CDS 608..1402 /note="translated region (AA 1-264)" /codon_start=1 /db_xref="PID:g32386" /db_xref="SWISS-PROT:P09017" /translation="MIMSSYLMDSNYIDPKFPPCEEYSQNSYIPEHSPEYYGRTRESG FQHHHQELYPPPPPRPSYPERQYSCTSLQGPGNSRGHGPAQAGHHHPEKSQSLCEPAP LSGASASPSPAPPACSQPAPDHPSSAASKQPIVYPWMKKIHVSTVNPNYNGGEPKRSR AAYTRQQVLELEKEFHYNRYLTRRRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLP NTKVRSAPPAGAAPSTLSAATPGTSEDHSQSATPPEQQRAEDITRL" misc_feature 1073..1255 /note="cp19 homeobox" BASE COUNT 368 a 464 c 373 g 317 t ORIGIN 1 acctgccctg ggcgctccct tcattagcag tatttttttt aaattaatct gattaataat 61 tatttttccc ccatttaatt ttttttcctc ccaggtggag ttgccgaagc tgggggcagc 121 tggggagggt ggggatggga ggggagagac agaagttgag ggcatctctc tcttccttcc 181 cgaccctctg gcccccaagg ggcaggagga atgcaggagc aggagttgag tttgggagct 241 gcagatgcct ccgcccctcc tctctcccag gctcttcctc ctgccccctt cttgcaactc 301 tccttaattt tgtttggctt ttggatgatt ataattattt ttatttttga atttatataa 361 agtatatgtg tgtgtgtgtg gagctgagac aggctcggca gcggcacaga atgagggaag 421 acgagaaaga gagtgggaga gagagaggca gagagggaga gagggagagt gacagcagcg 481 ctcgcggggg ctcaaccccc agacctccag aaatgacgtc agaatcattt gcatcccgct 541 gcctctacct gcctggtcca gctgggaccc tgcctcgccg gccgcatggc cagagggttg 601 gaaattaatg atcatgagct cgtatttgat ggactctaac tacatcgatc cgaaatttcc 661 tccatgcgaa gaatattcgc aaaatagcta catccctgaa cacagtccgg aatattacgg 721 ccggaccagg gaatcgggat tccagcatca ccaccaggag ctgtacccac caccgcctcc 781 gcgccctagc taccctgagc gccagtatag ctgcaccagt ctccaggggc ccggcaattc 841 gcgaggccac gggccggccc aggcgggcca ccaccacccc gagaaatcac agtcgctctg 901 cgagccggcg cctctctcag gcgcctccgc ctccccgtcc ccagccccgc cagcctgcag 961 ccagccagcc cccgaccatc cctccagcgc cgccagcaag caacccatag tctacccatg 1021 gatgaaaaaa attcacgtta gcacggtgaa ccccaattat aacggagggg aacccaagcg 1081 ctcgagggca gcctataccc ggcagcaagt cctggaatta gagaaagagt ttcattacaa 1141 ccgctacctg acccgaagga gaaggatcga gatcgcccac tcgctgtgcc tctctgagag 1201 gcagatcaaa atctggttcc aaaaccgtcg catgaaatgg aagaaggacc accgactccc 1261 caacaccaaa gtcaggtcag cacccccggc cggcgctgcg cccagcaccc tttcggcagc 1321 taccccgggt acttctgaag accactccca gagcgccacg ccgccggagc agcaacgggc 1381 agaggacatt accaggttat aaaacataac tcacacccct gcccccaccc catgccccca 1441 ccctcccctc acacacaaat tgactcttat ttatagaatt taatatatat atatatatat 1501 atatataggt tcttttctct ct // LOCUS HSHOX40MR 1373 bp RNA PRI 13-MAY-1994 DEFINITION H.sapiens mRNA for OX40 homologue. ACCESSION X75962 NID g472957 KEYWORDS OX40 antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1358) AUTHORS Latza,U., Durkop,H., Schnittger,S., Ringeling,J., Eitelbach,F., Hummel,M., Fonatsch,C. and Stein,H. TITLE The human OX40 homolog: cDNA structure, expression and chromosomal assignment of the ACT35 antigen JOURNAL Eur. J. Immunol. 24 (3), 677-683 (1994) MEDLINE 94170844 REFERENCE 2 (bases 1 to 1373) AUTHORS Latza,U. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) U. Latza, FU Berlin, Klinikum Steglitz, Institute of Pathology, Hindenburgdamm 30, 12200 Berlin, FRG FEATURES Location/Qualifiers source 1..1373 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 6..89 CDS 6..839 /codon_start=1 /product="OX40 homologue" /db_xref="PID:g472958" /db_xref="SWISS-PROT:P43489" /translation="MCVGARRLGRGPCAALLLLGLGLSTVTGLHCVGDTYPSNDRCCH ECRPGNGMVSRCSRSQNTVCRPCGPGFYNDVVSSKPCKPCTWCNLRSGSERKQLCTAT QDTVCRCRAGTQPLDSYKPGVDCAPCPPGHFSPGDNQACKPWTNCTLAGKHTLQPASN SSDAICEDRDPPATQPQETQGPPARPITVQPTEAWPRTSQGPSTRPVEVPGGRAVAAI LGLGLVLGLLGPLAILLALYLLRRDQRLPPDAHKPPGGGSFRTPIQEEQADAHSTLAK I" mat_peptide 90..836 /product="OX40 homologue" repeat_region 814..958 repeat_region 959..1079 polyA_signal 1341..1346 polyA_site 1358 BASE COUNT 257 a 452 c 423 g 241 t ORIGIN 1 cgaggatgtg cgtgggggct cggcggctgg gccgcgggcc gtgtgcggct ctgctcctcc 61 tgggcctggg gctgagcacc gtgacggggc tccactgtgt cggggacacc taccccagca 121 acgaccggtg ctgccacgag tgcaggccag gcaacgggat ggtgagccgc tgcagccgct 181 cccagaacac ggtgtgccgt ccgtgcgggc cgggcttcta caacgacgtg gtcagctcca 241 agccgtgcaa gccctgcacg tggtgtaacc tcagaagtgg gagtgagcgg aagcagctgt 301 gcacggccac acaggacaca gtctgccgct gccgggcggg cacccagccc ctggacagct 361 acaagcctgg agttgactgt gccccctgcc ctccagggca cttctcccca ggcgacaacc 421 aggcctgcaa gccctggacc aactgcacct tggctgggaa gcacaccctg cagccggcca 481 gcaatagctc ggacgcaatc tgtgaggaca gggacccccc agccacgcag ccccaggaaa 541 cccagggccc cccggccagg cccatcactg tccagcccac tgaagcctgg cccagaacct 601 cacagggacc ctccacccgg cccgtggagg tccccggggg ccgtgcggtt gccgccatcc 661 tgggcctggg cctggtgctg gggctgctgg gccccctggc catcctgctg gccctgtacc 721 tgctccggag ggaccagagg ctgccccccg atgcccacaa gccccctggg ggaggcagtt 781 tccggacccc catccaagag gagcaggccg acgcccactc caccctggcc aagatctgac 841 ctgggcccac caaggtggac gctgggcccc gccaggctgg agcccggagg gtctgctggg 901 cggggcccag cgtccacctt ggtgggccca ggtcagatct tggccagggt ggagtggggc 961 ccagcataac atacagtgcc gccaccagcc ccctgagccc tgtttccccc cactactctt 1021 ccaaatttgt ggagacaggg ctcagggggc tggtggcggc actgtatgtt atgctgggca 1081 tgacggccat gacttcggct atcttctgtt ttaggtcctt gatttcctga tctttctgaa 1141 ggatttgtcc ttgggcaatc tcgagctgcc gctttgcatc gcccagtgcg gagaacaggt 1201 ccagcttgat tctcgtctct gcacttaagc tgttctccag gtgcgtgtga tttgtcaaaa 1261 gaaagccttc tggatgctgt taagatgtac ccttcaggtg aacctggtat cagacccaca 1321 gtacttgctg tttgagaaaa aataaaaaca aaaaggtcaa aaaaaaaaaa aaa // LOCUS HSHOX4C 1133 bp RNA PRI 09-JAN-1992 DEFINITION Human HOX4C mRNA for a homeobox protein. ACCESSION X59372 NID g32390 KEYWORDS homeobox gene; HOX4C gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1133) AUTHORS Duboule,D. TITLE Direct Submission JOURNAL Submitted (02-MAY-1991) D. Duboule, E M B L, Meyerhofstrasse 1, Heidelberg, Germany REFERENCE 2 (bases 1 to 1133) AUTHORS Zappavigna,V., Renucci,A., Izpisua-Belmonte,J.C., Urier,G., Peschle,C. and Duboule,D. TITLE HOX4 genes encode transcription factors with potential auto- and cross-regulatory capacities JOURNAL EMBO J. 10 (13), 4177-4187 (1991) MEDLINE 92097538 FEATURES Location/Qualifiers source 1..1133 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="embryonic, 7wk p.c." /tissue_type="spinal cord" /clone_lib="cDNA" /chromosome="2" mRNA 1..1133 /gene="HOX4C" /note="cDNA" /evidence=experimental gene 1..1133 /gene="HOX4C" exon <37..802 /gene="HOX4C" /number=1 CDS 37..1065 /gene="HOX4C" /codon_start=1 /product="homeobox protein" /db_xref="PID:g32391" /db_xref="SWISS-PROT:P28356" /translation="MSSSGTLSNYYVDSLIGHEGDEVFAARFGPPGPGAQGRPAGVAD GPAATAAEFASCSFAPRSAVFSASWSAVPSQPPAAAAMSGLYHPYVPPPPLAASASEP GRYVRSWMEPLPGFPGGAGGGGGGGGGGPGRGPSPGPSGPANGRHYGIKPETRAAPAP ATAASTTSSSSTSLSSSSKRTECSVARESQGSSGPEFSCNSFLQEKAAGGDGGNGPGA GIGAATGTGGSSEPSACSDHPIPGCSLKEEEKQHSQPQQQQLDPNNPEANWIHARSTR KKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKIWFQNRRMKMKKMS KEKCPKGD" exon 803..1065 /gene="HOX4C" /number=2 BASE COUNT 198 a 381 c 372 g 182 t ORIGIN 1 agtgtaatgt tgggtgggag tgcgggacgc ctcaaaatgt cttccagtgg caccctcagc 61 aactactacg tggactcgct tataggccat gagggcgacg aggtgttcgc ggcgcgcttc 121 gggccgccgg ggccaggcgc gcagggccgg cctgcaggtg tggctgatgg cccggccgcc 181 accgccgccg agttcgcctc gtgtagtttt gcccccagat cggccgtgtt ctctgcctcg 241 tggtccgcgg tgccctccca gcccccggca gcggcggcga tgagcggcct ctaccacccg 301 tacgttcccc cgccgcccct ggccgcctct gcctccgagc ccggccgcta cgtgcgctcc 361 tggatggagc cgctgcccgg cttcccgggc ggtgcgggcg gtggcggtgg tggtggaggc 421 ggcggtccgg gccgcggtcc cagccctggc cccagcggcc cagccaacgg gcgccactac 481 gggattaagc ctgaaacccg agcggccccg gcccccgcca cggccgcctc caccacctcc 541 tcctcctcca cttccttatc ctcctcctcc aaacggactg agtgctccgt ggcccgggag 601 tcccagggga gcagcggccc cgagttctcg tgcaactcgt tcctgcagga gaaggcggca 661 ggcggcgacg gggggaacgg gcctggggca gggatcgggg ccgcgactgg gacgggcggc 721 tcgtcggagc cctcagcttg cagcgaccac ccgatcccag gctgttcgct gaaggaggag 781 gagaagcagc attcgcagcc gcagcagcag caacttgacc caaacaaccc cgaagcgaac 841 tggatccacg ctcgctccac ccggaaaaag cgctgtccct acaccaaata ccagacgctt 901 gagctggaga aagaattcct cttcaacatg tacctcaccc gggaccggcg ctacgaggtg 961 gccaggattc tcaacctaac agagagacag gtcaaaatct ggtttcagaa ccgtaggatg 1021 aaaatgaaaa agatgagcaa ggagaaatgc cccaaaggag actgacccgg cgcggtgctg 1081 gcgggagcgc tcaagggcag cggatttgtt gttgttgctg tttcctttgt ggg // LOCUS HSHOX4D 1126 bp RNA PRI 09-JAN-1992 DEFINITION Human HOX4D mRNA for a homeobox protein. ACCESSION X59373 NID g32392 KEYWORDS homeobox gene; HOX4D gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1126) AUTHORS Duboule,D. TITLE Direct Submission JOURNAL Submitted (02-MAY-1991) D. Duboule, E M B L, Meyerhofstrasse 1, Heidelberg, Germany REFERENCE 2 (bases 1 to 1126) AUTHORS Zappavigna,V., Renucci,A., Izpisua-Belmonte,J.C., Urier,G., Peschle,C. and Duboule,D. TITLE HOX4 genes encode transcription factors with potential auto- and cross-regulatory capacities JOURNAL EMBO J. 10 (13), 4177-4187 (1991) MEDLINE 92097538 FEATURES Location/Qualifiers source 1..1126 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="embryo, 7wk p.c." /tissue_type="spinal cord" /clone_lib="cDNA" /chromosome="2" mRNA 1..1126 /gene="HOX4D" /note="cDNA" /evidence=experimental gene 1..1126 /gene="HOX4D" CDS 34..1056 /gene="HOX4D" /codon_start=1 /product="homeobox protein" /db_xref="PID:g32393" /db_xref="SWISS-PROT:P28358" /translation="MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMG TYGMQTCGLLPSLAKREVNHQNMGMNVHPYIPQVDSWTDPNRSCRIEQPVTQQVPTCS FTTNIKEESNCCMYSDKRNKLISAEVPSYQRLVPESCPVENPEVPVPRYFRLSQTYAT GKTQEYNNSPEGSSTVMLQLNPRGAAKPQLSAAQLQMEKKMNEPVSGQEPTKVSQVES PEAKGGLPEERSCLAEVSVSSPEVQEKESKEEIKSDTPTSNWLTAKSGRKKRCPYTKH QTLELEKEFLFNMYLTRERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELT ANLTFS" exon <34..779 /gene="HOX4D" /number=1 exon 780..1056 /gene="HOX4D" /number=2 BASE COUNT 309 a 316 c 275 g 226 t ORIGIN 1 cacaatctct cttcttcaaa ttctttcccc aaaatgtcct ttcccaacag ctctcctgct 61 gctaatactt ttttagtaga ttccttgatc agtgcctgca ggagtgacag tttttattcc 121 agcagcgcca gcatgtacat gccaccacct agcgcagaca tggggaccta tggaatgcaa 181 acctgtggac tgctcccgtc tctggccaaa agagaagtga accaccaaaa tatgggtatg 241 aatgtgcatc cttatatacc tcaagtagac agttggacag atccgaacag atcttgtcga 301 atagagcaac ctgttacaca gcaagtcccc acttgctcct tcaccaccaa cattaaggaa 361 gaatccaatt gctgcatgta ttctgataag cgcaacaaac tcatttcggc cgaggtccct 421 tcgtaccaga ggctggtccc tgagtcttgt cccgttgaga accctgaggt tcccgtccct 481 cgatatttta gactgagtca gacctacgcc accgggaaaa cccaagagta caataatagc 541 cccgaaggca gctccactgt catgctccag ctcaaccctc gtggcgcggc caagccgcag 601 ctctccgctg cccagctgca gatggaaaag aagatgaacg agcccgtgag cggccaggag 661 cccaccaaag tctcccaggt ggagagcccc gaggccaaag gcggccttcc cgaagagagg 721 agctgcctgg ctgaggtctc cgtgtccagt cccgaagtgc aggagaagga aagcaaagag 781 gaaatcaagt ctgatacacc aaccagcaat tggctcactg caaagagtgg cagaaagaag 841 aggtgccctt acactaagca ccaaacgctg gaattagaaa aagagttctt gttcaatatg 901 tacctcaccc gcgagcgccg cctagagatc agtaagagcg ttaacctcac cgacaggcag 961 gtcaagattt ggtttcaaaa ccgccgaatg aaactcaaga agatgagccg agagaaccgg 1021 atccgagaac tgaccgccaa cctcacgttt tcttaggtct gaggccggtc tgaggccgga 1081 tcagaggcca ggattggaga gggggcaccg cgttccaggg cccagt // LOCUS HSHPBRII4 3426 bp RNA PRI 21-JUN-1995 DEFINITION H.sapiens HPBRII-4 mRNA. ACCESSION X67337 NID g871298 KEYWORDS HPBRII-4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3426) AUTHORS Fleischhauer,K.L. JOURNAL Unpublished REFERENCE 2 (bases 1 to 3426) AUTHORS Fleischhauer,K.L. TITLE Direct Submission JOURNAL Submitted (01-JUN-1992) K.L. Fleischhauer, Memorial Sloan Kettering Cancer Center, 1275 York Ave., New York NY 10021, USA COMMENT See also X67336. FEATURES Location/Qualifiers source 1..3426 /organism="Homo sapiens" /isolate="HPB-ALL" /db_xref="taxon:9606" /cell_line="HPB-ALL" /clone_lib="cDNA HPB-All" /clone="HPB-ALL.1" gene 35..1690 /gene="HPBRII-4" CDS 35..1690 /gene="HPBRII-4" /codon_start=1 /db_xref="PID:g871299" /translation="MADGVDHINIYADVGEEFNQEAEYGGHDQIDLYDDVISPSANNG DAPEDRDYMDTLPPTVGDDVGKGAAPNVVYTYTGKRIALYIGNLTWWTTDEDLTEAVH SLGVNDILEIKFFENRANGQSKGFALVGVGSEASSKKLMDLLPKRELHGQNPVVTPCN KQFLSQFEMQSRKTTQSGQMSGEGKAGPPGGSSRAAFPQGGRGRGRFPGAVPGGDRFP GPAGPGGPPPPFPAGQTPPRPPLGPPGPPGPPGPPPPGQVLPPPLAGPPNRGDRPPPP VLFPGQPFGQPPLGPLPPGPPPPVPGYGPPPGPPPPQQGPPPPPGPFPPRPPGPLGPP LTLAPPPHLPGPPPGAPPPAPHVNPAFFPPPTNSGMPTSDSRGPPPTDPYGRPPPYDR GDYGPPGREMDTARTPLSEAEFEEIMNRNRAISSSAISRAVSDASAGDYGSAIETLVT AISLIKQSKVSADDRCKVLISSLQDCLHGIESKSYGSGSRRERSRERDHSRSREKSRR HKSRSRDRHDDYYRERSRERERHRDRDRDRDRERDREREYRHR" BASE COUNT 983 a 695 c 730 g 1018 t ORIGIN 1 aattccgggc ggcggcggcc gaggctgaag gaagatggcg gacggcgtgg accacataaa 61 catttacgcg gatgtcggcg aagagttcaa ccaggaagct gaatatggtg ggcatgatca 121 gatagatttg tatgacgatg tcatatctcc atctgcaaat aatggagatg ccccagaaga 181 ccgagattac atggatactc tcccaccaac tgttggtgat gatgtgggta aaggagcagc 241 accaaatgtt gtctatacat atactggaaa gagaattgca ttatatattg gaaatctaac 301 atggtggaca acagatgaag acttaactga agcagttcat tctttgggag taaatgatat 361 tttggagata aaattttttg aaaatcgagc aaatggccag tcaaaggggt ttgcccttgt 421 tggtgttgga tctgaagcat cttcaaaaaa gttaatggat ctgttaccta aaagagaact 481 tcatggtcag aatcctgttg taactccatg caataaacag ttcctgagtc aatttgaaat 541 gcagtccagg aaaactacac aatcaggaca aatgtctggg gaaggtaaag ctggtcctcc 601 aggaggcagt tcccgtgcag catttccaca aggtggtaga ggacggggcc gttttccagg 661 ggctgttcct ggtggggaca gatttcctgg gccagcagga ccaggagggc cacccccacc 721 ttttccagct ggacagactc caccacgtcc acccttaggt cctccaggcc cacctggtcc 781 accaggtcct ccacctcctg gtcaggttct gcctcctcct ctagctgggc ctcctaatcg 841 aggagatcgc cctccaccac cagttctttt tcctggacaa ccttttgggc agcctccatt 901 gggtccactt cctcctggcc ctccacctcc agttccaggc tacggccccc ctcctggccc 961 accacctcca caacagggac cacctccacc tccaggcccc tttccacctc gtccacccgg 1021 tccacttggg ccacccctta cactagctcc tcctccgcat cttcctggac cacctccagg 1081 tgccccaccg ccagctccgc atgtgaaccc agctttcttt cctccaccaa ctaacagtgg 1141 catgcctaca tcagatagcc gaggtccacc accaacagat ccatatgggc gacctccacc 1201 atatgatagg ggtgactatg gcccccctgg aagggaaatg gatactgcaa gaacgccatt 1261 gagtgaagct gaatttgaag aaatcatgaa tagaaatagg gcaatctcaa gcagtgctat 1321 ttcgagagct gtgtctgatg ccagtgctgg tgattatggg agtgctattg agacactggt 1381 aactgcaatt tctttaatta aacaatccaa agtatctgct gatgatcgtt gcaaagttct 1441 tattagttct ttgcaagatt gccttcatgg aattgagtcc aagtcttatg gttctggatc 1501 aagacgtgaa cgatcaagag agagggacca tagtagatca cgagaaaaga gtcgacgtca 1561 taaatcccgt agtagagacc gtcatgacga ttattacaga gagagaagca gagaacgaga 1621 gaggcaccgg gatcgtgacc gagaccgtga ccgagagcgt gaccgagagc gcgaatatcg 1681 tcatcgttag aagctgaagg aagaggatca ccttccaaga caaaacagtc ttcatgggcc 1741 aaaaatgacg cttgtccagc agtttgcttc ttgtgattga actgaacctg taaggattca 1801 tggataaaat gaacaggaat agatctgaat aaagcaaatc tgcataaatg gtaaccagta 1861 gctctacttt tattttttat gttgcttaac tgttttattt gaaggaaacc tgtgtgattt 1921 aaaaagttat agcttttgca actttattac tggttatata catttggcca ttatgatgtg 1981 caagcaattg gaaaaaaagt caagtaaatg cttgtttttg tagtagtttg ttcttgttaa 2041 aaatgtttat atgataatgt ctgtaaacag catcactttg attacaatag atgtagtgtt 2101 gtaataaact gtttaatggg gctgatgtgt aaagctgttc aagttatttg atgtttacac 2161 ctcagggaaa gtcttgtgtt cagcaatatc taaagataat gttactatga caacattttt 2221 actgtccttt aaagcattgc aatagcgttt ttggatatgc ctcaatctaa tcttgcgttc 2281 agtgaattaa acatagtaat taagtgtctt ttgcccttga ttttgatatt agaataggtg 2341 attacatgga tatttaatat ttctatattc tgcttttcta gctgttttta cctagttagc 2401 ttgtgacttt gctgaatggt atgtaaactt gtaaaaatag agatttgaca gacatagcaa 2461 tctagtcaat gtgtaagggg tcaaaaaaaa cagaggtttt aacacataag taaaaacccg 2521 tacatatttg atgtgtaatg caggttaatt acaacacaga tgtaccgaaa cacttaattg 2581 tgaaccgcta acattgaaga aattttgaca attccgattt gatgctgcaa ttacttgctg 2641 tttttattga tcttatggtt tatttcttaa gccatagtca gtgtaaatac agccctgcag 2701 caggtaaatg tgagtaaaga gagccttata ttttccaatt ggtataaaat ttttgaagga 2761 tgtgatgttc attaacattc ggttgtattc cccagtattt gtaatgggaa attacagata 2821 aaccgtgtct gcacagttta aggaatacta tgtatattca tgcaccgtat tgattcatgc 2881 tatagttact taatcaaaga tttttttcaa acctgcctta catataggcc cactttaaaa 2941 gcacctgact agcatgtgtt cttgattgca aaattggcag aggcagggtg tcaacttgat 3001 taggtgtttt tatgggaatg taatttgaaa tcactacttc agaaatttga cttaaaattc 3061 ttgagcacgt taatatgttt ttaagatctg attatctttg agagatcttc tgttaataca 3121 cattggttgt taaagagtac ccaaattcta ggacaatgct taaagtgtta aaatacccta 3181 gatactgtgt tatgtgcaac tgtagaaacc ctccagaaat ttccactgct gttcttcact 3241 ttcatcttgt ctgctatcaa accacttctg acaaaattag ctgttttgaa ttacccatat 3301 cactgccagt tttattttaa aatattttgt gtttgaagta tctgtgcatg ggatcgttga 3361 tgtttatcag aactgttcac tttcagaaat gattttttaa agcattttgt tgaaatgcgg 3421 ttgctt // LOCUS HSHPCP 1182 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for hematopoetic proteoglycan core protein. ACCESSION X17042 M25538 NID g32432 KEYWORDS haematopoetic proteoglycan core protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1182) AUTHORS Stellrecht,C.M. and Saunders,G.F. TITLE Nucleotide sequence of a cDNA encoding a hemopoietic proteoglycan core protein JOURNAL Nucleic Acids Res. 17 (18), 7523 (1989) MEDLINE 90016819 COMMENT Data kindly reviewed (29-JAN-1990) by Saunders G. FEATURES Location/Qualifiers source 1..1182 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="myelogenous leukemia" /clone="pD-D2" CDS 25..501 /note="hematopoetic proteoglycan core protein (AA 1 - 158)" /codon_start=1 /db_xref="PID:g32433" /db_xref="SWISS-PROT:P10124" /translation="MMQKLLKCSRLVLALALILVLESSVQGYPTQRARYQWVRCNPDS NSANCLEEKGPMFELLPGESNKIPRLRTDLFPKTRIQDLNRIFPLSEDYSGSGFGSGS GSGSGSGSGFLTEMEQDYQLVDESDAFHDNLRSLDRNLPSDSQDLGQHGLEEDFML" misc_feature 1163..1168 /note="pot. polyA signal" polyA_site 1182 /note="polyA site" BASE COUNT 359 a 223 c 224 g 376 t ORIGIN 1 gaattccgct agactaagtt ggtcatgatg cagaagctac tcaaatgcag tcggcttgtc 61 ctggctcttg ccctcatcct ggttctggaa tcctcagttc aaggttatcc tacgcagaga 121 gccaggtacc aatgggtgcg ctgcaatcca gacagtaatt ctgcaaactg ccttgaagaa 181 aaaggaccaa tgttcgaact acttccaggt gaatccaaca agatcccccg tctgaggact 241 gacctttttc caaagacgag aatccaggac ttgaatcgta tcttcccact ttctgaggac 301 tactctggat caggcttcgg ctccggctcc ggctctggat caggatctgg gagtggcttc 361 ctaacggaaa tggaacagga ttaccaacta gtagacgaaa gtgatgcttt ccatgacaac 421 cttaggtctc ttgacaggaa tctgccctca gacagccagg acttgggtca acatggatta 481 gaagaggatt ttatgttata aaagaggatt ttcccacctt gacaccaggc aatgtagtta 541 gcatatttta tgtaccatgg ttatatgatt aatcttggga caaagaattt tatagaaatt 601 tttaaacatc tgaaaaagaa gcttaagttt tatcatcctt ttttttctca tgaattctta 661 aaggattatg ctttaatgct gttatctatc ttattgttct tgaaaatacc tgcatttttt 721 ggtatcatgt tcaaccaaca tcattatgaa attaattaga ttcccatggc cataaaatgg 781 ctttaaagaa tatatatata tttttaaagt agcttgagaa gcaaattggc aggtaatatt 841 tcatacctaa attaagactc tgacttggat tgtgaattat aatgatatgc cccttttctt 901 ataaaaacaa aaaaaaaata atgaaacaca gtgaatttgt agagtggggg tatttgacat 961 attttacagg gtggagtgta ctatatacta ttacctttga atgtgtttgc agagctagtg 1021 gatgtgtttg tctacaagta tgattgctgt tacataacac cccaaattaa ctcccaaatt 1081 aaaacacagt tgtgctgtca atacctcata ctgctttacc tttttttcct ggatatctgt 1141 gtattttcaa atgttactat atattaaagc agaaatataa cc // LOCUS HSHPDMPK 1598 bp RNA PRI 30-MAY-1997 DEFINITION H.sapiens mRNA for hypothetical protein downstream of DMPK and DMAHP. ACCESSION Y10936 NID g2143253 KEYWORDS hypothetical protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1598) AUTHORS Alwazzan,M., Hamshere,M.G., Lennon,G. and Brook,J.D. TITLE Six transcripts map within 200 kilobases of the myotonic dystrophy expanded repeat JOURNAL Unpublished REFERENCE 2 (bases 1 to 1598) AUTHORS Alwazzan,M. TITLE Direct Submission JOURNAL Submitted (29-JAN-1997) M. Alwazzan, Queens Medical Centre, Genetics, University of Nottingham, Nottingham NG7 2UH, UK FEATURES Location/Qualifiers source 1..1598 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="q13.3" /dev_stage="adult" /tissue_type="frontal cortex" /clone="20D7-FC4" CDS 332..940 /note="maps downstream of DMPK and DMAHP" /codon_start=1 /product="hypothetical protein" /db_xref="PID:e305308" /db_xref="PID:g2143254" /translation="MTCLTDVPTGCAAVEPTARLPAAAWASTITTGCCPAMGQAGAGP AGRKGSEAGGGPGRAHHAHPSPLPREPRVRTGPPAHSPTPGRIDPSPELSWGSTGVTQ ESPLLDPVDFLLFRTRAVDPLRRVFFFFYQHLTFFSIQPQPPPCHAFHPKDPPAGSRR QLILVPLKGPPILAPILSLTPILSRWSCYFPRSRIAQGWHLS" BASE COUNT 349 a 473 c 389 g 387 t ORIGIN 1 tccctgtgcc gcttgtaccg gcacgtgtcg cacgacttcc tagagatccg cttcaagatt 61 cagcggctgc tggagccgcg acagtacatg ctgctgctgc ccgagcacgt gctggtcaag 121 atcttcagct tcctgcccac gcgcgcgctg gccgccctca agtgcacctg ccaccacttc 181 aagggcatca tcgaggcgtt tggcgtgcgg gccacagact cgcgctggag ccgagacccg 241 ctctaccgcg atgatccgtg caaacagtgc cgcaagagat acgagaaggg cgacgtgtcg 301 ctctgccgct ggcaccccaa gccctaccac catgacctgc cttacggacg ttcctactgg 361 atgtgctgcc gtcgagccga ccgcgagact cccggctgcc gcctgggcct ccacgataac 421 aactgggtgc tgccctgcaa tgggccaggc gggggccggg ccggccggga ggaagggaag 481 tgaagccggg ggagggccgg ggagagccca ccacgcccac ccctcccctc tcccccggga 541 gccgagggtc cggactgggc cgcctgccca ttcccctact ccaggcagga ttgatccctc 601 accggaactg agctggggtt caacaggggt gacccaggaa agccctctgc ttgatcctgt 661 tgattttcta ctcttcagaa ccagggccgt ggaccctctg agaagggtgt tttttttttt 721 ttatcagcat ctcactttct tctccattca gccccaaccc cctccctgcc atgctttcca 781 ccccaaggac ccaccagcag ggagcagacg gcagctaatt ttggtaccac ttaagggtcc 841 ccccattctg gcccccatcc tctccctcac cccaattctt tctcgctgga gctgttattt 901 cccaagaagc cgcatcgccc aaggttggca cctctcctaa cctgccacac gacagctctc 961 acctctctag gaattggggg ctgtcaggtc acaggtggga tctggcattt ttttatgaca 1021 gtccatttct agatggtttg gctattaaag aagtgggggg gaaatactgt tttctcctta 1081 acctcaagct accagtctct cctcttccgc gtagaggaag aggggggcag acaaaaaaaa 1141 agctgaagta taaaaaccct cctcctcccg ttattattta agctactgcc atcaacccca 1201 cccccataaa gctgtgaagc cctttgcctc gcttttgaca gcgtgggggg ggcccgtggg 1261 cagggacttc ggatttgcat ttctgggttg tttttccttc cactctgggg tctcgttcag 1321 gcttggggtc tcctcacccc agccacatgt tttctttaag aatcctttgg tcaaccagga 1381 ccttgatttt tcaggcattt tggtctgggg tatttttgtt tgttctctct gttttttgtt 1441 ttcttttttt tctttccatc gtggttctgg aagctttcta gcgtgtggca tctgaccaat 1501 tttgaattgg tccttttcta taaatcaaaa ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1561 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSHPGF1 2259 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for human heparin-binding growth factor 1/ acidic fibroblast growth factor. ACCESSION X51943 NID g32435 KEYWORDS fibroblast growth factor; growth factor; heparin-binding growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2259) AUTHORS Chiu,I.M. TITLE Direct Submission JOURNAL Submitted (20-FEB-1990) Chiu I.-M., Dept of Internal Medicine, Davis Medical Research Centre, The Ohio State University, 480 West 9th Ave, Columbus OH 43210, USA REFERENCE 2 (bases 1 to 2259) AUTHORS Chiu,I.M., Wang,W.P. and Lehtoma,K. TITLE Alternative splicing generates two forms of mRNA coding for human heparin-binding growth factor 1 JOURNAL Oncogene 5 (5), 755-762 (1990) MEDLINE 90265618 FEATURES Location/Qualifiers source 1..2259 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="neonate" /tissue_type="brain stem" /clone="pHBGF1.1, 1.2, 1.3, 1.4 and 1.5" CDS 35..502 /note="HBGF-1 (AA 1-155)" /codon_start=1 /db_xref="PID:g32436" /db_xref="SWISS-PROT:P05230" /translation="MAEGEITTFTALTEKFNLPPGNYKKPKLLYCSNGGHFLRILPDG TVDGTRDRSDQHIQLQLSAESVGEVYIKSTETGQYLAMDTDGLLYGSQTPNEECLFLE RLEENHYNTYISKKHAEKNWFVGLKKNGSCKRGPRTHYGQKAILFLPLPVSSD" misc_feature 203..204 /note="ex1/ex2 splice site" misc_feature 307..308 /note="ex2/ex3 splice site" polyA_site 2235 /note="polyA site" BASE COUNT 667 a 497 c 535 g 560 t ORIGIN 1 tcttgaaagc gccacaagca gcagctgctg agccatggct gaaggggaaa tcaccacctt 61 cacagccctg accgagaagt ttaatctgcc tccagggaat tacaagaagc ccaaactcct 121 ctactgtagc aacgggggcc acttcctgag gatccttccg gatggcacag tggatgggac 181 aagggacagg agcgaccagc acattcagct gcagctcagt gcggaaagcg tgggggaggt 241 gtatataaag agtaccgaga ctggccagta cttggccatg gacaccgacg ggcttttata 301 cggctcacag acaccaaatg aggaatgttt gttcctggaa aggctggagg agaaccatta 361 caacacctat atatccaaga agcatgcaga gaagaattgg tttgttggcc tcaagaagaa 421 tgggagctgc aaacgcggtc ctcggactca ctatggccag aaagcaatct tgtttctccc 481 cctgccagtc tcttctgatt aaagagatct gttctgggtg ttgaccactc cagagaagtt 541 tcgaggggtc ctcacctggt tgacccaaaa atgttccctt gaccattggc tgcgctaacc 601 cccagcccac agagcctgaa tttgtaagca acttgcttct aaatgcccag ttcacttctt 661 tgcagagcct tttacccctg cacagtttag aacagaggga ccaaattgct tctaggagtc 721 aactggctgg ccagtctggg tctgggtttg gatctccaat tgcctcttgc aggctgagtc 781 cctccatgca aaagtggggc taaatgaagt gtgttaaggg gtcggctaag tgggacatta 841 gtaactgcac actatttccc tctactgagt aaaccctatc tgtgattccc ccaaacatct 901 ggcatggctc ccttttgtcc ttcctgtgcc ctgcaaatat tagcaaagaa gcttcatgcc 961 aggttaggaa ggcagcattc catgaccaga aacagggaca aagaaatccc cccttcagaa 1021 cagaggcatt taaaatggaa aagagagatt ggattttggt gggtaactta gaaggatggc 1081 atctccatgt agaataaatg aagaaaggga ggcccagccg caggaaggca gaataaatcc 1141 ttgggagtca ttaccacgcc ttgaccttcc caaggttact cagcagcaga gagccctggg 1201 tgacttcagg tggagagcac tagaagtggt ttcctgataa caagcaagga tatcagagct 1261 gggaaattca tgtggatctg gggactgagt gtgggagtgc agagaaagaa agggaaactg 1321 gctgagggga taccataaaa agaggatgat ttcagaagga gaaggaaaaa gaaagtaatg 1381 ccacacattg tgcttggccc ctggtaagca gaggctttgg ggtcctagcc cagtgcttct 1441 ccaacactga agtgcttgca gatcatctgg ggacctggtt tgaatggaga ttctgattca 1501 gtgggttggg ggcagagttt ctgcagttcc atcaggtccc ccccaggtgc aggtgctgac 1561 aatactgctg ccttacccgc catacattaa ggagcagggt cctggtccta aagagttatt 1621 caaatgaagg tggttcgacg ccccgaacct cacctgacct caactaaccc ttaaaaatgc 1681 acacctcatg agtctacctg agcattcagg cagcactgac aatagttatg cctgtactaa 1741 ggagcatgat tttaagaggc tttggccaat gcctataaaa tgcccatttc gaagatatac 1801 aaaaacatac ttcaaaaatg ttaaaccctt accaacagct tttcccagga gaccatttgt 1861 attaccatta cttgtataaa tacacttcct gcttaaactt gacccaggtg gctagcaaat 1921 tagaaacacc attcatctct aacatatgat actgatgcca tgtaaaggcc tttaataagt 1981 cattgaaatt tactgtgaga ctgtatgttt taattgcatt taaaaatata tagcttgaaa 2041 gcagttaaac tgattagtat tcaggcactg agaatgatag taataggata caatgtataa 2101 gctactcact tatctgatac ttatttacct ataaaatgag atttttgttt tccactgtgc 2161 tattacaaat tttcttttga aagtaggaac tcttaagcaa tggtaattgt gaataaaaat 2221 tgatgagagt gttaaaaaaa aaaaaaaaaa cccgaattc // LOCUS HSHPS12 1788 bp RNA PRI 12-SEP-1993 DEFINITION Human pHS1-2 mRNA with ORF homologous to membrane receptor proteins. ACCESSION X12433 NID g32451 KEYWORDS transmembrane protein; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1788) AUTHORS Rapiejko,P.J. TITLE Direct Submission JOURNAL Submitted (19-JUL-1988) State University of New york at Stony Brook, Dept. of Pharmaceutical Sciences, SUNY at Stony Brook, Stony Brook, NY 11794-8651 REFERENCE 2 (bases 1 to 1788) AUTHORS Rapiejko,P.J., George,S.T. and Malbon,C.C. TITLE Primary structure of a human protein which bears structural similarities to members of the rhodopsin/beta-adrenergic receptor family JOURNAL Nucleic Acids Res. 16 (17), 8721 (1988) MEDLINE 88335630 COMMENT the put. ORF encodes a protein showing homolgies to bovine rhodopsin, beta2-adrenergic receptor, musacrinic M1 receptor and G21 protein. FEATURES Location/Qualifiers source 1..1788 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" CDS 176..1453 /note="put. ORF" /codon_start=1 /db_xref="PID:g32452" /db_xref="SWISS-PROT:P08910" /translation="MNAMLETPELPAVFDGVKLAAVAAVLYVIVRCLNLKSPTAPPDL YFQDSGLSRFLLKSCPLLTKEYIPPLIWGKSGHIQTALYGKMGRVRSPHPYGHRKFIT MSDGATSTFDLFEPLAEHCVGDDITMVICPGIANHSEKQYIRTFVDYAQKNGYRCAVL NHLGALPNIELTSPRMFTYGCTWEFGAMVNYIKKTYPLTQLVVVGFSLGGNIVCKYLG ETQANQEKVLCCVSVCQGYSALRAQETFMQWDQCRRFYNFLMADNMKKIILSHRQALF GDHVKKPQSLEDTDLSRLYTATSLMQIDDNVMRKFHGYNSLKEYYEEESCMRYLHRIY VPLMLVNAADDPLVHESLLTIPKSLSEKRENVMFVLPLHGGHLGFFEGSVLFPEPLTW MDKLVVEYANAICQWERNKLQCSDTEQVEADLE" BASE COUNT 414 a 472 c 468 g 434 t ORIGIN 1 gaattcgggc ggggagctgc aggaaccaga ctgggggcga gctgagcacc tgtagtcaat 61 cacacgcagc ttttaggttt gtttgaataa gagatctgac ctgaccggcc caactgtaca 121 actcttcaag gaaaattcgt atttgcagtg ggaagaataa gtaacattga tcaagatgaa 181 tgccatgctg gagactcccg aactcccagc cgtgtttgat ggagtgaagc tggctgcagt 241 ggctgctgtg ctgtacgtga tcgtccggtg tttgaacctg aagagcccca cagccccacc 301 tgacctctac ttccaggact cggggctctc acgctttctg ctcaagtcct gtcctcttct 361 gaccaaagaa tacattccac cgttgatctg ggggaaaagt ggacacatcc agacagcctt 421 gtatgggaag atgggaaggg tgaggtcgcc acatccttat gggcaccgga agttcatcac 481 tatgtctgat ggagccactt ctacattcga cctcttcgag cccttggctg agcactgtgt 541 tggagatgat atcaccatgg tcatctgccc tggaattgcc aatcacagcg agaagcaata 601 catccgcact ttcgttgact acgcccagaa aaatggctat cggtgcgccg tgctgaacca 661 cctgggtgcc ctgcccaaca ttgaattgac ctcgccacgc atgttcacct atggctgcac 721 gtgggaattt ggagccatgg tgaactacat caagaagaca tatcccctga cccagctggt 781 cgtcgtgggc ttcagcctgg gtggtaacat tgtgtgcaaa tacttggggg agactcaggc 841 aaaccaagag aaggtcctgt gctgcgtcag cgtgtgccag gggtacagtg cactgagggc 901 ccaggaaacc ttcatgcaat gggatcagtg ccggcggttc tacaacttcc tcatggctga 961 caacatgaag aagatcatcc tctcgcacag gcaagctctt tttggagacc atgttaagaa 1021 accccagagc ctggaagaca cggacttgag ccggctctac acagcaacat ccctgatgca 1081 gattgatgac aatgtgatga ggaagtttca cggctataac tccctgaagg aatactatga 1141 ggaagaaagt tgcatgcggt acctgcacag gatttatgtt cctctcatgc tggttaatgc 1201 agctgacgat ccgttggtgc atgaaagtct tctaaccatt ccaaaatctc tttcagagaa 1261 acgagagaac gtcatgtttg tgctgcctct gcatgggggc cacttgggct tctttgaggg 1321 ctctgtgctg ttccccgagc ccctgacatg gatggataag ctggtggtgg agtacgccaa 1381 cgccatttgc caatgggagc gtaacaagtt gcagtgctct gacacggagc aggtggaggc 1441 cgacctggag tgaggcctcc ggactctggc acgctccagc agccctcctc tggaagctgc 1501 gtcccctcac cccctgtttc aggtctccca tctccctcag tgacctggat ctgacctcac 1561 accatcagca gggggcaccc accatgcaca cctgtctcgg agtaggcagc tcttcctggg 1621 agctccaggc tatttttgtg cttagttact ggttttctcc attgcattgt taggcatggt 1681 gacaagtgac agagttcttg ccctctgtcc agtttcagca tctggttgct tttaagccaa 1741 gtacatctag tttccctatt aaaaatgtgt ctgaatcccc ccgaattc // LOCUS HSHREV107 1070 bp RNA PRI 24-NOV-1995 DEFINITION H.sapiens mRNA for rat HREV107-like protein. ACCESSION X92814 NID g1054751 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1070) AUTHORS Husmann,K. and Schaefer,R. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1070) AUTHORS Schaefer,R. TITLE Direct Submission JOURNAL Submitted (27-OCT-1995) R. Schaefer, Division of Cancer Research Dept., Dept. of Pathology, Schmelzbergstr. 12, CH 8091 Zuerich, SWITZERLAND FEATURES Location/Qualifiers source 1..1070 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="Clontech cat. no. HL1188x" gene 408..896 /gene="orf" CDS 408..896 /gene="orf" /note="homologous to rat HREV107 (ACC.NO. X76453)" /codon_start=1 /db_xref="PID:g1054752" /translation="MRAPIPEPKPGDLIEIFRPFYRHWAIYVGDGYVVHLAPPSEVAG AGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHDDKYSPLPCTKIIQRAEELVGQ EVLYKLTSENCEHFVNELRYGVARSDQVRDVIIAASVAGMGLAAMSLIGVMFSRNKRQ KQ" polyA_signal 1016..1021 polyA_signal 1020..1026 BASE COUNT 246 a 259 c 333 g 232 t ORIGIN 1 gcgattgctg gggctgcagc gctgcctccg agaccgagag tgggtggagc gggtcttcct 61 ggaagggtgc gataaggccg ggcgaggtgc ctgggatgct tctccccttc cgcgaggaag 121 agatctaatt gggtagggcg ggtgtagact agcctgccga gccgcccgct ggcacctgca 181 gcctcctggg cgcccgcggg cccggcgaga aagttgttaa agggagcgag gtggttgttc 241 ctggggtccg aggcgcgcct ctcacgccct gcccaacaga agccgcagtc ccgtggggtc 301 tggagacgca gtttccttgt taatgacaat aaatccctgc tccccctgcc tcagacatct 361 acgcagcgaa atcgagcctg gccttgaggg tccacaccgc gaggaagatg cgtgcgccca 421 ttccagagcc taagcctgga gacctgattg agatttttcg ccctttctac agacactggg 481 ccatctatgt tggcgatgga tatgtggttc atctggcccc tccaagtgag gtcgcaggag 541 ctggtgcagc cagtgtcatg tccgccctga ctgacaaggc catcgtgaag aaggaattgc 601 tgtatgatgt ggccgggagt gacaagtacc aggtcaacaa caaacatgat gacaagtact 661 cgccgctgcc ctgcacgaaa atcatccagc gggcggagga gctggtgggg caggaggtgc 721 tctacaagct gaccagtgag aactgcgagc actttgtgaa tgagctgcgc tatggagtcg 781 cccgcagtga ccaggtcaga gatgtcatca tcgctgcaag cgttgcagga atgggcttgg 841 cagccatgag ccttattgga gtcatgttct caagaaacaa gcgacaaaag caataactga 901 aaaagactgt ctgtcagcga tgactttata catcaagggg gtcttgtttt gctagagagt 961 ttggggtttg gtttgtggat ttcattgtga tttataataa ggcttatttt cacagaataa 1021 aataaagcaa aacgagggag gattttattg ggggagtgca gcccaaaaaa // LOCUS HSHRPB70 341 bp RNA PRI 16-OCT-1997 DEFINITION H.sapiens mRNA for RNA polymerase II subunit. ACCESSION Z47727 NID g717186 KEYWORDS RNA polymerase II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 341) AUTHORS Shpakovski,G.V., Acker,J., Wintzerith,M., Lacroix,J.F., Thuriaux,P. and Vigneron,M. TITLE Four subunits that are shared by the three classes of RNA polymerase are functionally interchangeable between Homo sapiens and Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 15 (9), 4702-4710 (1995) MEDLINE 95379812 REFERENCE 2 (bases 1 to 341) AUTHORS Vigneron,M. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Marc Vigneron, IGBMC, 1, rue Laurent Fries, Illkirch, Alsace, 67404, FRANCE FEATURES Location/Qualifiers source 1..341 /organism="Homo sapiens" /strain="HeLa cells" /db_xref="taxon:9606" /clone="hRPB7.0" /cell_type="fibroblast, transformed" /cell_line="HeLa" /sex="Female" CDS 50..226 /codon_start=1 /product="RNA polymerase II" /db_xref="PID:g717187" /db_xref="SWISS-PROT:P53803" /translation="MDTQKDVQPPKQQPMIYICGECHTENEIKSRDPIRCRECGYRIM YKKRTKRLVVFDAR" BASE COUNT 101 a 60 c 81 g 99 t ORIGIN 1 ggatttggaa acgcggagtg agtttttccg tgctgtgtag gggctaacaa tggacaccca 61 gaaggacgtt caacctccaa agcagcaacc aatgatatat atctgtggag agtgtcacac 121 agaaaatgaa ataaaatcta gggatccaat cagatgcaga gaatgtggat acagaataat 181 gtacaagaaa aggactaaaa gattggtcgt ttttgatgct cgatgaatgc tgggaattca 241 gaggaatgtc ttcacttata cttggatttg ctctcttccc atttctgatt gttgtatagc 301 tttcgatttt gcttacagta gttccccctt atcttcggga g // LOCUS HSHRPL4 1382 bp RNA PRI 25-OCT-1994 DEFINITION H.sapiens HRPL4 mRNA. ACCESSION X73974 NID g560475 KEYWORDS ribosomal protein L4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1382) AUTHORS Bagni,C. TITLE Direct Submission JOURNAL Submitted (08-JUL-1993) C. Bagni, Dept of Biology, University of Rome 'Tor Vergata', Via della Ricerca Scientifica, 00133 Rome, ITALY REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 1382) AUTHORS Bagni,C., Mariottini,P., Annesi,F. and Amaldi,F. TITLE Human ribosomal protein L4: cloning and sequencing of the cDNA and primary structure of the protein JOURNAL Biochim. Biophys. Acta 1216 (3), 475-478 (1993) MEDLINE 94092742 REFERENCE 3 (bases 1 to 1382) AUTHORS Bagni,C. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) C. Bagni, Dept of Biology, University of Rome 'Tor Vergata', Via della Ricerca Scientifica, 00133 Rome, ITALY FEATURES Location/Qualifiers source 1..1382 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone="pHL4" gene 16..1296 /gene="HRPL4" CDS 16..1296 /gene="HRPL4" /codon_start=1 /product="ribosomal protein L4" /db_xref="PID:g560476" /db_xref="SWISS-PROT:P36578" /translation="MAVARPLISVYSEKGESSGKNVTLPAVFKAPIRPDMVNFVHTNL RKNNRQPYAVSELAGHQTRAESWGTGRAVARIPRVRGGGTHRSGQGAFGNMCRGGRMF APTKTWRRWHRRVNTTQKRYAICSALAASALPALVMSKGHRIEEFPELPLVVEDKVEG YKKTKEAVLLLKKLKAWNDIKKVYASQRMRAGKGKMRNRRIQRRGPCIIYNEDNGIIK AFRNIPGITLLNVSKLNILKLAPGGHVGRFCIWTESAFRKLDELYGTWRKAASLKSNY NLPMHKMINTDLSRILKSPEIQRALRAPRKKIHRRVLKKNPLKNLRIMLKLNPYAKTM RRNTILRQARNHKLRVDKAAAAAAALQAKSDEKAAVAGKKPVVGKKGKKAAVGVKKQK KPLVGKKAAATKKPAPEKKPAEKKPTTEEKKPAA" BASE COUNT 411 a 321 c 341 g 309 t ORIGIN 1 tctcctctct ccgccatggc tgttgctcgc ccactgatat ccgtgtactc cgaaaagggg 61 gagtcatctg gcaaaaatgt cactttgcct gctgtcttca aggctcctat tcgaccagat 121 atggtgaact ttgttcacac caacttgcgc aaaaacaaca gacagcccta tgctgtcagt 181 gaattagcag gtcatcagac tagagctgag tcttggggta ctggcagagc tgtggctcga 241 attcccagag ttcgaggtgg tgggactcac cgctctggcc agggtgcttt tggaaacatg 301 tgtcgtggag gccgaatgtt tgcaccaacc aaaacctggc gccgttggca tcgtagagtg 361 aacacaaccc aaaaacgata cgccatctgt tctgccctgg ctgcctcagc cctaccagca 421 ctggtcatgt ctaaaggtca tcgtattgag gagttccctg aacttccttt ggtagttgaa 481 gataaagttg aaggctacaa gaagaccaag gaagctgttt tgctccttaa gaaacttaaa 541 gcctggaatg atatcaaaaa ggtctatgcc tctcagcgaa tgagagctgg caaaggcaaa 601 atgagaaacc gtcgtatcca gcgcaggggc ccgtgcatca tctataatga ggataatggt 661 atcatcaagg ccttcagaaa catccctgga attactctgc ttaatgtaag caagctgaac 721 attttgaagc ttgctcctgg tgggcatgtg ggacgtttct gcatttggac tgaaagtgct 781 ttccggaagt tagatgaatt gtacggcact tggcgtaaag ccgcttccct caagagtaac 841 tacaatcttc ccatgcacaa gatgattaat acagatctta gcagaatctt gaaaagccca 901 gagatccaaa gagcccttcg agcaccacgc aagaagatcc atcgcagagt cctaaagaag 961 aacccactga aaaacttgag aatcatgttg aagctaaacc catatgcaaa gaccatgcgc 1021 cggaacacca ttcttcgcca ggccaggaat cacaagctcc gggtggataa ggcagctgct 1081 gcagcagcgg cactacaagc caaatcagat gagaaggcgg cggttgcagg caagaagcct 1141 gtggtaggta agaaaggaaa gaaggctgct gttggtgtta agaagcagaa gaagcctctg 1201 gtgggaaaaa aggcagcagc taccaagaaa ccagcccctg aaaagaagcc tgcagagaag 1261 aaacctacta cagaggagaa gaagcctgct gcataaactc ttaaatttga ttattccata 1321 aaggtcaaat cattttggac agcttctttt gaataaagac ctgttataca ggcagtgaga 1381 aa // LOCUS HSHRPTPU 4998 bp RNA PRI 23-NOV-1992 DEFINITION H.sapiens hR-PTPu gene for protein tyrosine phosphatase. ACCESSION X58288 NID g32455 KEYWORDS hR-PTPu gene; protein tyrosine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4998) AUTHORS Gebbink,M.F.B.G. TITLE Direct Submission JOURNAL Submitted (12-JUL-1991) M.F.B.G. Gebbink, Dutch Center Inst., Plesmanlaan 121, 1066 CX Amsterdam, THE NETHERLANDS REFERENCE 2 (bases 1 to 4998) AUTHORS Gebbink,M.F., van Etten,I., Hateboer,G., Suijkerbuijk,R., Beijersbergen,R.L., Geurts van Kessel,A. and Moolenaar,W.H. TITLE Cloning, expression and chromosomal localization of a new putative receptor-like protein tyrosine phosphatase JOURNAL FEBS Lett. 290 (1-2), 123-130 (1991) MEDLINE 92008644 COMMENT See X58287, X58289 for related sequences. FEATURES Location/Qualifiers source 1..4998 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="mc10, mc12, helapcdc" /chromosome="18" /map="18pter-q11" mRNA 1..4998 /gene="hR-PTPu" /evidence=experimental gene 1..4998 /gene="hR-PTPu" sig_peptide 1..60 /gene="hR-PTPu" CDS 1..4359 /gene="hR-PTPu" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g32456" /db_xref="SWISS-PROT:P28827" /translation="MRTLGTCLATLAGLLLTAAGETFSGGCLFDEPYSTCGYSQSEGD DFNWEQVNTLTKPTSDPWMPSGSLMLVNASGRPEGQRAHLLLPQLKENDTHCIDFHYF VSSKSNSPPGLLNVYVKVNNGPLGNPIWNISGDPTRTWNRAELAISTFWPNFYQVIFE VITSGHQGYLAIDEVKVLGHPCTRTPHFLRIQNVEVNAGQFATFQCSAIGRTVAGDRL WLQGIDVRDAPLKEIKVTSSRRFIASFNVVNTTKRDAGKYRCMIRTEGGVGISNYAEL VVKEPPVPIAPPQLASVGATYLWIQLNANSINGDGPIVAREVEYCTASGSWNDRQPVD STSYKIGHLDPDTEYEISVLLTRPGEGGTGSPGPALRTRTKCADPMRGPRKLEVVEVK SRQITIRWEPFGYNVTRCHSYNLTVHYCYQVGGQEQVREEVSWDTENSHPQHTITNLS PYTNVSVKLILMNPEGRKESQELIVQTDEDLPGAVPTESIQGSTFEEKIFLQWREPTQ TYGVITLYEITYKAVSSFDPEIDLSNQSGRVSKLGNETHFLFFGLYPGTTYSFTIRAS TAKGFGPPATNQFTTKISAPSMPAYELETPLNQTDNTVTVMLKPAHSRGAPVSVYQIV VEEERPRRTKKTTEILKCYPVPIHFQNASLLNSQYYFAAEFPADSLQAAQPFTIGDNK TYNGYWNTPLLPYKSYRIYFQAASRANGETKIDCVQVATKGAATPKPVPEPEKQTDHT VKIAGVIAGILLFVIIFLGVVLVMKKRKLAKKRKETMSSTRQEMTVMVNSMDKSYAEQ GTNCDEAFSFMDTHNLNGRSVSSPSSFTMKTNTLSTSVPNSYYPDETHTMASDTSSLV QSHTYKKREPADVPYQTGQLHPAIRVADLLQHITQMKCAEGYGFKEEYESFFEGQSAP WDSAKKDENRMKNRYGNIIAYDHSRVRLQTIEGDTNSDYINGNYIDGYHRPNHYIATQ GPMQETIYDFWRMVWHENTASIIMVTNLVEVGRVKCCKYWPDDTEIYKDIKVTLIETE LLAEYVIRTFAVEKRGVHEIREIRQFHFTGWPDHGVPYHATGLLGFVRQVKSKSPPSA GPLVVHCSAGAGRTGCFIVIDIMLDMAEREGVVDIYNCVRELRSRRVNMVQTEEQYVF IHDAILEACLCGDTSVPASQVRSLYYDMNKLDPQTNSSQIKEEFRTLNMVTPTLRVED CSIALLPRNHEKNRCMDILPPDRCLPFLITIDGESSNYINAALMDSYKQPSAFIVTQH PLPNTVKDFWRLVLDYHCTSVVMLNDVDPAQLCPQYWPENGVHRHGPIQVEFVSADLE EDIISRIFRIYNAARPQDGYRMVQQFQFLGWPMYRDTPVSKRSFLKLIRQVDKWQEEY NGGEGPTVVHCLNGGGRSGTFCAISIVCEMLRHQRTVDVFHAVKTLRNNKPNMVDLLD QYKFCYEVALEYLNSG" mat_peptide 61..4356 /gene="hR-PTPu" /EC_number="3.1.3.48" /product="protein-tyrosine phosphatase" BASE COUNT 1416 a 1189 c 1213 g 1176 t 4 others ORIGIN 1 atgaggacac ttgggacttg cctggcgact ttggccggac ttttgctaac tgcggcgggc 61 gagacgttct caggtggctg cctctttgat gagccgtata gcacatgtgg atatagtcaa 121 tctgaaggtg atgacttcaa ttgggagcaa gtgaacacct tgactaaacc gacttctgat 181 ccatggatgc catcaggttc tctcatgctg gtgaatgcct ctgggagacc tgaggggcag 241 agagcccacc tgctcttacc ccaacttaag gaaaatgaca cccactgcat cgattttcac 301 tattttgtgt ccagcaagag taattctcct ccggggttac tcaatgtcta cgtgaaggtc 361 aataacgggc cactggggaa tcctatctgg aatatatctg gagacccaac acgtacatgg 421 aacagggcag aactggccat tagtactttc tggcctaact tttatcaggt gatttttgaa 481 gtgataactt ctggacatca aggctatctc gctatcgatg aggtgaaggt gttaggacat 541 ccatgtacca ggactcctca cttcctgcgg attcagaatg tggaagttaa tgctggccag 601 tttgctacct tccagtgcag tgccatcggc aggaccgtgg caggagacag gctctggtta 661 cagggcattg atgtgcgaga tgctcctctg aaggaaatca aggtgaccag ctcccgacgc 721 ttcattgctt catttaatgt tgtgaatacc accaaacgag atgctggaaa gtaccgctgc 781 atgattcgca ctgaaggagg tgttggaata tcaaactatg cagagttggt agttaaagaa 841 ccacccgttc ctattgcccc acctcagctc gcctctgtag gagccaccta cctgtggata 901 cagctcaacg ccaactccat caatggggat gggcccattg tggcccgaga ggtggagtac 961 tgcacggcca gtgggagctg gaatgaccgg cagccagtcg attccacgag ctataaaatt 1021 ggacaccttg acccagatac agaatatgag attagtgtgc tcctgaccag gccaggggag 1081 ggtggcactg gctctcctgg tccagctctc aggacaagaa caaagtgtgc tgatcccatg 1141 cgaggcccaa gaaaactaga agtagtggag gtcaaatctc ggcaaatcac tatccgctgg 1201 gagccatttg gatataatgt aactcgttgc cacagttata atctcactgt ccactactgt 1261 taccaagttg gaggacaaga acaagtgcga gaagaagtaa gctgggatac agaaaattca 1321 caccctcaac acacgatcac taacctgtca ccatacacca atgtcagtgt gaaactgatc 1381 ctcatgaacc cagagggccg gaaggaaagc caagaactca tagtgcagac agatgaagac 1441 ctcccaggtg ctgttcccac tgaatccata caaggaagta cctttgaaga gaagatattt 1501 cttcagtgga gagaaccaac tcaaacatat ggtgtaatca ctttatatga gatcacctac 1561 aaagcagtca gttcctttga cccagaaata gatttatcca atcagagtgg aagagtttca 1621 aagctgggaa atgaaaccca ttttctgttt tttggactgt atccggggac cacatactcc 1681 tttaccatcc gagctagcac agctaagggt tttgggcctc cagcaacaaa ccagttcacc 1741 accaaaatat cagcaccctc tatgccagct tatgaacttg agacaccttt gaatcaaact 1801 gacaataccg tgacagtcat gctgaaacct gcccacagca gaggagcacc tgtcagtgtc 1861 tatcaaatag ttgttgagga agaacgtcct cgaagaacta aaaagacgac agaaatctta 1921 aagtgctacc cagtgccaat tcacttccag aatgcttctc tgctgaactc acagtactac 1981 tttgctgcag aatttcctgc agacagcctc caagctgcgc agccttttac aattggtgat 2041 aataagacat ataatggata ctggaacact ccccttctcc cctataaaag ctacagaatt 2101 tatttccaag ctgctagtag agccaatggg gaaaccaaaa tagactgtgt ccaagtggcc 2161 acaaaaggag ctgccactcc gaaaccagtc ccagaacccg agaaacagac agaccataca 2221 gttaaaattg ctggagtcat cgcgggcatc ttgctgttcg tgattatatt tcttggagtt 2281 gtgttggtaa tgaagaaaag gaaactggcc aagaagcgga aagagaccat gagcagcacc 2341 cgacaggaga tgactgtgat ggtgaactca atggacaaga gctatgctga gcagggcaca 2401 aactgcgacg aggctttctc attcatggac acgcacaatc tgaatgggag atctgtgtct 2461 tcaccatcgt ccttcacaat gaaaacaaat acactgagca catcggtgcc taattcctat 2521 tacccagatg aaacccacac aatggccagc gataccagca gcctggtgca gtcccatact 2581 tacaagaagc gagagccggc cgacgtgccc tatcagactg ggcagctcca ccccgccatc 2641 cgggtggcag acctccttca gcacatcaca cagatgaagt gtgcggaggg ctacggcttc 2701 aaggaggaat acgagagctt ctttgaaggg cagtctgcac catgggactc ggctaagaaa 2761 gatgagaaca gaatgaagaa cagatacggg aatatcattg catacgatca ttcccgagtg 2821 aggctgcaga caatagaagg agacacaaac tcagactata tcaatggcaa ttatatcgat 2881 ggttatcatc gacccaatca ttacattgct acccaagggc caatgcagga aaccatctat 2941 gacttctgga ggatggtgtg gcacgaaaac actgcaagta tcatcatggt gaccaatctt 3001 gtggaagtgg gaagggtcaa atgctgcaaa tactggccag atgacacaga gatatataaa 3061 gacattaaag ttaccctaat agaaacagaa ctactggcag aatatgtgat aagaacattt 3121 gctgttgaaa agagaggtgt gcatgaaatc cgagagatca gacagtttca cttcactggc 3181 tggccggatc atggggtccc ctaccatgcc accggcctgc tgggattcgt gcggcaagtc 3241 aagtccaaga gcccgcccag tgcaggccca ctggtggtgc actgcagtgc tggtgcaggg 3301 aggactggct gtttcatcgt cattgatatc atgttggaca tggccgaaag ggaaggggtc 3361 gtagacatct acaactgcgt cagggagctg cggtcacgga gggtgaacat ggtgcaaaca 3421 gaggagcagt atgtgtttat ccacgatgcg atcctggaag cctgtctttg tggggacacc 3481 tctgtgcctg cttcccaagt taggtctctg tattatgaca tgaacaaact ggatccacag 3541 acaaactcaa gccagattaa agaggaattc cggacgctaa acatggtgac accaacgctg 3601 cgagtagagg actgcagcat cgcactgttg ccccggaacc atgagaaaaa ccggtgcatg 3661 gacatcctgc ccccagaccg ctgcctgccc ttcctcatca ccatcgatgg ggagagcagc 3721 aactacatca atgctgccct catggacagc tataaacagc cttcagcttt tatagtcacc 3781 cagcatcctt tgccaaacac agtgaaagac ttttggagac tggtcctgga ttatcactgc 3841 acatccgtag ttatgctaaa tgatgtggat cctgcccagt tgtgtccaca gtactggcca 3901 gaaaacggag tacacagaca cggccccatc caggtggaat ttgtctctgc tgacctggaa 3961 gaggacatca tcagcaggat attccgcatt tacaatgccg ccagacccca agatggatat 4021 cggatggtgc agcaattcca gttcctgggc tggccgatgt acagggacac accagtgtct 4081 aagcgctcct tcttgaagct cattcgccag gtggacaagt ggcaagagga atacaatggc 4141 ggggaaggcc cgaccgttgt gcactgcttg aacgggggag gccgcagtgg gacgttctgc 4201 gccatcagca tcgtatgtga gatgctccgg caccagagaa ccgtggatgt ctttcacgct 4261 gtgaagacac tgaggaacaa caagcccaac atggtcgacc tcctggatca gtacaagttc 4321 tgctacgagg tggccctgga atacttgaat tctggctgat ggtgtaaaca gctctgcaaa 4381 caatcccttt cataccacaa agccaagacg ttccatggta tttgtgcaaa agagatgaag 4441 acttctcaat atgcttattt tgctttgact aattggctct ttttaagagc caagaaagtg 4501 tttctaaaat tgcttgcact gcccaatccc agtaatgctg ctgcctgaca gaaacacaca 4561 cacagccaca gttgccaaat ncccgtactc cttgccacgg ttctagagca gcgtagacag 4621 ctggtaaact gaagagcaca actatattct tatgaaggaa tttgtacctt tggggtatta 4681 ttttgtggcc cgtgaccctc gttattgtta cagctgagtg tatgtttttg ttctgtggag 4741 aatgctatct ggcattatgg taatatatta ttttaggtaa tatttgtact ttaacatgtt 4801 gcataatata tgcttatgta gctttccagg actaacagat aaatgtgtaa taacaaagat 4861 atgttgtatg agtngtcgtt tctgtcagat ttgtattgtt tccaagggaa aannttgggg 4921 gaggactcag ttcacaaaat gcaaaactca acgatcagat tcacggaccc agagcttttc 4981 catgtgttta tattgtaa // LOCUS HSHRSR 1953 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for histidyl-tRNA synthetase (HRS). ACCESSION X05345 NID g32457 KEYWORDS histidyl-tRNA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1953) AUTHORS Tsui,F.W. and Siminovitch,L. TITLE Isolation, structure and expression of mammalian genes for histidyl-tRNA synthetase JOURNAL Nucleic Acids Res. 15 (8), 3349-3367 (1987) MEDLINE 87203366 FEATURES Location/Qualifiers source 1..1953 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 78..1604 /note="histidyl-tRNA synthetase (AA 1-508)" /codon_start=1 /db_xref="PID:g32458" /db_xref="SWISS-PROT:P12081" /translation="MAERAPLEELVKLQGERVRGLKQQKASAELIEEEVAKLLKLKAQ LGPDESKQKFVLKTPKGTRDYSPRQMAVREKVFDVIIRCFKRHGAEVIDTPVFELKET LMGKYGEDSKLIYDLKDQGGELLSLRYDLTVPFARYLAMNKLTNIKRYHIAKVYRRDN PAMTGGRYPNSITVDFDIAGQFDPMNPDAESLKIMCEILSSLQIGNFLVKVNDRRILD GMFAVCGVPDSKFRTICSSVDKLDKVSWEEVKNEMVGEKGLAPEVADRIGDYVQQHGG VSLVEQLVQDPKLSQNKQALEGLGDLKLLFEYLTLFGIDDKISFDLSLARGLDYYTGV IYEAVLLQTPAQEGEEPWCGQCGCWRRYDGLVGMFDPQRRKVAMCGAQHWGGRIFSIV EQRLEALEEKIRTTETQVLVASAQKKLARGKTKACLRLWDAGIKAELLYKKNPKLLNQ LQYCEEAGIPLVAIIGEQELKDGVIKLRSVTSREEVDVRREELVEEIKRRTGQPLCIC " BASE COUNT 515 a 448 c 570 g 420 t ORIGIN 1 agccggaagt catccttgct gaggctgggg caaccaccgc aggtcgagac agcaggcggc 61 tcaagtggac agccgggatg gcagagcgtg cgccgctgga ggagctggtg aaacttcagg 121 gagagcgcgt gcgaggcctc aagcagcaga aggccagcgc cgagctgatc gaggaggagg 181 tggcgaaact cctgaaactg aaggcacagc tgggtcctga tgaaagcaaa cagaaatttg 241 tgctcaaaac ccccaagggc acaagagact atagtccccg gcagatggca gttcgcgaga 301 aggtgtttga cgtaatcatc cgttgcttca agcgccacgg tgcagaagtc attgatacac 361 ctgtatttga actaaaggaa acactgatgg gaaagtatgg ggaagactcc aagcttatct 421 atgacctgaa ggatcagggc ggggagctcc tgtcccttcg ctatgacctc actgttcctt 481 ttgctcggta tttggcaatg aataaactga ccaacattaa acgctaccac atagcaaagg 541 tatatcggcg ggataaccca gccatgaccg gaggccgata tccgaattct atcactgtgg 601 attttgacat cgctggccag tttgatccca tgaatcctga tgcagagtcc ctgaagatca 661 tgtgcgagat cctgagttca cttcagatag gcaacttcct ggtcaaggta aatgatcggc 721 gcatcctaga tggaatgttt gctgtctgtg gtgttcctga tagcaagttc cgtaccatct 781 gctcctcagt ggacaaacta gataaggtgt cctgggagga agtaaagaat gagatggtgg 841 gagagaaggg ccttgcacca gaagtggctg atcgcattgg ggactatgtc cagcaacatg 901 gtggggtttc cctggtggaa caactggtcc aggatcctaa actatcccaa aacaagcagg 961 ccttggaggg cttgggagac ctgaagttgc tctttgagta cctgacccta tttggcattg 1021 atgacaaaat ctcctttgac ctgagccttg ctcgagggct ggattactac actggggtga 1081 tctatgaggc agtgctgcta cagaccccag cccaggaggg ggaagagccc tggtgtgggc 1141 agtgtggctg ctggaggcgc tatgatgggc tagtgggcat gttcgacccc caaaggcgca 1201 aggtcgccat gtgtggggct cagcattggg gtggacggat tttctccatc gtggaacaga 1261 gactagaggc tttggaggag aagatacgga ccacggagac acaggtgctt gtggcatctg 1321 cacagaaaaa gctggctaga ggaaagacta aagcttgtct cagactgtgg gatgctggga 1381 tcaaggctga gctgctgtac aagaagaacc caaagctact gaaccagtta cagtactgtg 1441 aggaggcagg catcccactg gtggctatca tcggcgagca ggaactcaag gatggggtca 1501 tcaagctccg ttcagtgacg agcagggaag aggtggatgt ccgaagagaa gagcttgtgg 1561 aggaaatcaa aaggagaaca ggccagcccc tctgcatctg ctgaactgaa caaactatca 1621 gaggaaagga agtgggactg gcactatttg aggttaagac aaactgcata tgtacttcaa 1681 ttgctttgca cttttccgtt tcagcggaag acctgaagag tggtcagaac agagcctttg 1741 atttttatta tggttatttt attgattatt actggcaaaa acggccaggt acaacacctt 1801 tttcatacaa ggcccaggag gcttagtcca gtctgtgctc ctgggctaca aggacccagc 1861 ctgagatggt cccatctgca gggcccgcac cagttggagc agatacctcc ccaccaccaa 1921 ttgccaaagg tccaataaaa tgcctcaacc acg // LOCUS HSHRYK 2563 bp RNA PRI 20-MAR-1996 DEFINITION H.sapiens mRNA for h-ryk. ACCESSION X69970 NID g32461 KEYWORDS H-RYK gene; transmembrane protein; Tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2563) AUTHORS Tamagnone,L. TITLE Direct Submission JOURNAL Submitted (17-DEC-1992) L. Tamagnone, University of Helsinki, Dept of Virology & Pathology, Cancer Biology Laboratory, Haartmaninkatu 3, 00290 Helsinki 29, FINLAND REFERENCE 2 (bases 1 to 2563) AUTHORS Tamagnone,L., Partanen,J., Armstrong,E., Lasota,J., Ohgami,K., Tazunoki,T., LaForgia,S., Huebner,K. and Alitalo,K. TITLE The human ryk cDNA sequence predicts a protein containing two putative transmembrane segments and a tyrosine kinase catalytic domain JOURNAL Oncogene 8 (7), 2009-2014 (1993) MEDLINE 93288416 COMMENT Related sequences: M59373 & M59374. FEATURES Location/Qualifiers source 1..2563 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="blood" /cell_type="leukaemia" /cell_line="HEL" /clone_lib="cDNA in lambda GT-11" /clone="E4" /chromosome="3q,12-25" gene 91..1914 /gene="h-ryk" CDS 91..1914 /gene="h-ryk" /codon_start=1 /product="h-ryk" /db_xref="PID:g32462" /db_xref="SWISS-PROT:Q04696" /translation="MRGAARLGRPGRSCLPGARGLRAPPPPPLLLLLALLPLLPAPGA AAAPAPRPPELQSASAGPSVSLYLSEDEVRRLIGLDAELYYVRNDLISHYALSFNLLV PSETNFLHFTWHAKSKVEYKLGFQVDNVLAMDMPQVNISVQGEVPRTLSVFRVELSCT GKVDSEVMILMQLNLTVNSSKNFTVLNFKRRKMCYKKLEEVKTSALDKNTSRTIYDPV HAAPTTSTRVFYISVGVCCAVIFLVAIILAVLHLHSMKRIELDDSISASSSSQGLSQP STQTTQYLRADTPNNATPITSYPTLRIEKNDLRSVTLLEAKGEVKDIAISRERITLKD VLQEGTFGRIFHGILIDEKDPNKEKQAFVKTVKDQASEIQVTMMLTESCKLRGLHHRN LLPITHVCIEEGEKPMVILPYMNWGNLKLFLRQCKLVEANNPQAISQHDLVHMPIQIA CGMSYLARREVIHKDLAARNCVIDDTLQVKITDNALSRDLFPMDYHCLGDNENRPVRW MALESLVNNEFSSASDVWAFGVTLWELMTLGQTPYVDIDPFEMAAYLKDGYRIAQPIN CPDELFAVMACCWALDPEERPKFQQLVQCLTEFHAALGAYV" BASE COUNT 702 a 557 c 613 g 691 t ORIGIN 1 cggctcgggg ctgtgagcgg ctcggggccg ggggtgggcg gcggtgcggc gggcggccga 61 cgctcctctt cggcggcggc ggcggcggcc atgcgtgggg cggcgcggct ggggcggccg 121 ggccggagtt gcctcccggg ggcccgcggc ctgagggccc cgccgccgcc gccgctgctg 181 cttctgcttg cgctgttgcc gctgctgccc gcgcctggcg ctgccgccgc ccccgccccg 241 cggcccccgg agctgcagtc ggcttccgcg gggcccagcg tgagtctcta cctgagcgag 301 gacgaggtgc gccggctgat cggtcttgat gcagaacttt attatgtgag aaatgacctt 361 attagtcact acgctctatc ctttaatctg ttagtaccca gtgagacaaa tttcctgcac 421 ttcacctggc atgcgaagtc caaggttgaa tataagctgg gattccaagt ggacaatgtt 481 ttggcaatgg atatgcccca ggtcaacatt tctgttcagg gggaagttcc acgcacttta 541 tcagtgtttc gggtagagct ttcctgtact ggcaaagtag attctgaagt tatgatacta 601 atgcagctca acttgacagt aaattcttca aaaaatttta ccgtcttaaa ttttaaacga 661 aggaaaatgt gctacaaaaa acttgaagaa gtaaaaactt cagccttgga caaaaacact 721 agcagaacta tttatgatcc tgtacatgca gctccaacca cttctacgcg tgtgttttat 781 attagtgtag gggtttgttg tgcagtaata tttctcgtag caataatatt agctgttttg 841 caccttcata gtatgaaaag gattgaactg gatgacagca ttagtgccag cagtagttcc 901 caagggctgt ctcagccatc cacccagacg actcagtatc tgagagcaga cacgcccaac 961 aatgcaactc ctatcaccag ttatcctacc ttgcggatag agaagaacga cttgagaagt 1021 gtcactcttt tggaggccaa aggcgaggtg aaggatatag caatatccag agagaggata 1081 actctaaaag atgtactcca agaaggtact tttgggcgta ttttccatgg gattttaata 1141 gatgaaaaag atccaaataa agaaaaacaa gcatttgtca aaacagttaa agatcaagct 1201 tctgaaattc aggtgacaat gatgctcact gaaagttgta agctgcgagg tcttcatcac 1261 agaaatcttc ttcctattac tcatgtgtgt atagaagaag gagaaaagcc catggtgata 1321 ttgccttaca tgaattgggg gaatcttaaa ttgtttttac gacagtgcaa gttagtagag 1381 gccaataatc cacaggcaat ttctcagcat gacctggtac acatgcctat tcagattgcc 1441 tgtggaatga gctacctggc cagaagggaa gtcatccaca aagacctggc tgccaggaac 1501 tgtgtcattg atgacacact tcaagttaag atcacagaca atgccctctc cagagacttg 1561 ttccccatgg actatcactg tctgggggac aatgaaaaca ggccagttcg ttggatggct 1621 cttgaaagtc tggttaataa cgagttctct agcgctagtg atgtgtgggc ctttggagtg 1681 acgctgtggg aactcatgac tctgggccag actccctacg tggacattga ccccttcgag 1741 atggccgcat acctgaaaga tggttaccga atagcccagc caatcaactg tcctgatgaa 1801 ttatttgctg tgatggcctg ttgctgggcc ttagatccag aggagaggcc caagtttcag 1861 cagctggtac agtgcctaac agagtttcat gcagccctgg gggcctacgt ctgactcctc 1921 tccaatccca caccatcagg aagaaggtgc ctgtcggggc tcacttgaag cctgtcaggg 1981 atgctttgta tctaacacaa cgccaacaga agcacatttg tcttccagaa caccgtgcct 2041 tagaaatgct ttagaatctg aactttttaa gacagactta ataatgtggc atattttcta 2101 gatatcactt ttattaggtt gaactgaaag gctttttgta aattttttgg ccaaaatttt 2161 ttaaaacata cttactttgg actaggggta cattcttaca aaataaataa acagttttta 2221 aaattgttta gacacagata ttgttaatta gctatcttag tgccaactgc ttttattttt 2281 ttacttcatc aaggtgatgt aagtgactca cctttaaagt ttttttagtg ttatttttta 2341 tcactactct gggaaatggt ttgtcttcaa gatgcaatac ttttcttagt aaaggaaaaa 2401 cagcataaaa agatacctgg tctgccttgt acaagaaaag gcaatattag aggaagaaaa 2461 tttaaagaaa agctagagga aaaaaaaaaa aattttttaa aaaatactta ttagaagcaa 2521 actgcccttg catggaaaac tgtttatttt tttcagtgaa aag // LOCUS HSHSJ1MR 2895 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens HSJ1 mRNA. ACCESSION X63368 S37374 S37375 NID g32468 KEYWORDS dnaJ protein homologues; heat shock protein homologue; HSJ1 gene; HSJ1a protein; HSJ1b protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2895) AUTHORS Cheetham,M.E. TITLE Direct Submission JOURNAL Submitted (27-NOV-1991) M.E. Cheetham, Dept of Neuroscience, Inst of Psychiatry, De crespigny Park, Denmark Hill, London, SE5 8AF, UK REFERENCE 2 (bases 1 to 2895) AUTHORS Cheetham,M.E., Brion,J.P. and Anderton,B.H. TITLE Human homologues of the bacterial heat-shock protein DnaJ are preferentially expressed in neurons JOURNAL Biochem. J. 284 (Pt 2), 469-476 (1992) MEDLINE 92287055 FEATURES Location/Qualifiers source 1..2895 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain (Alzheimer) frontal cortex" /cell_type="neurone" /clone_lib="cDNA in lambda gt11/Zap" gene 26..1978 /gene="HSJ1" CDS 26..1081 /gene="HSJ1" /codon_start=1 /product="HSJ1b" /db_xref="PID:g32469" /db_xref="SWISS-PROT:P25686" /translation="MASYYEILDVPRSASADDIKKAYRRKALQWHPDKNPDNKEFAEK KFKEVAEAYEVLSDKHKREIYDRYGREGLTGTGTGPSRAEAGSGGPGFTFTFRSPEEV FREFFGSGDPFAELFDDLGPFSELQNRGSRHSGPFFTFSSSFPGHSDFSSSSFSFSPG AGAFRSVSTSTTFVQGRRITTRRIMENGQERVEVEEDGQLKSVTINGVPDDLARGLEL SRREQQPSVTSRSGGTQVQQTPASCPLDSDLSEDEDLQLAMAYSLSEMEAAGKKPAGG REAQHRRQGRPRPSTKIQAWGGPRRVRGVKQPNAVHPQRRRPLAASSSEHRAQPDLIQ ILTGGSDSLWEEKRGVS" CDS join(26..848,1968..1978) /gene="HSJ1" /note="alternatively spliced" /codon_start=1 /product="HSJ1a" /db_xref="PID:g32470" /db_xref="SWISS-PROT:P25686" /translation="MASYYEILDVPRSASADDIKKAYRRKALQWHPDKNPDNKEFAEK KFKEVAEAYEVLSDKHKREIYDRYGREGLTGTGTGPSRAEAGSGGPGFTFTFRSPEEV FREFFGSGDPFAELFDDLGPFSELQNRGSRHSGPFFTFSSSFPGHSDFSSSSFSFSPG AGAFRSVSTSTTFVQGRRITTRRIMENGQERVEVEEDGQLKSVTINGVPDDLARGLEL SRREQQPSVTSRSGGTQVQQTPASCPLDSDLSEDEDLQLAMAYSLSEMEAAGKKPADV F" polyA_signal 2843..2848 BASE COUNT 544 a 827 c 905 g 619 t ORIGIN 1 cccgcctgac gactgaccag ttgccatggc atcctactac gagatcctag acgtgccgcg 61 aagtgcgtcc gctgatgaca tcaagaaggc gtatcggcgc aaggctctcc agtggcaccc 121 agacaaaaac ccagataata aagagtttgc tgagaagaaa tttaaggagg tggccgaggc 181 atatgaagtg ctgtctgaca agcacaagcg ggagatttac gaccgctatg gccgggaagg 241 gctgacaggg acaggaactg gcccatctcg ggcagaagct ggcagtggtg ggcctggctt 301 caccttcacc ttccgcagcc ccgaggaggt cttccgggaa ttctttggga gtggagaccc 361 ttttgcagag ctctttgatg acctgggccc cttctcagag cttcagaacc ggggttcccg 421 acactcaggc cccttcttta ccttctcttc ctccttccct gggcactccg atttctcctc 481 ctcatctttc tccttcagtc ctggggctgg tgcttttcgc tctgtttcta catctaccac 541 ctttgtccaa ggacgccgca tcaccacacg cagaatcatg gagaacgggc aggagcgggt 601 ggaagtggag gaggatgggc agctgaagtc agtcacaatc aatggtgtcc cagatgacct 661 ggcacgtggc ttggagctga gccgtcgcga gcagcagccg tcagtcactt ccaggtctgg 721 gggcactcag gtccagcaga cccctgcctc atgccccttg gacagcgacc tctctgagga 781 tgaggacctg cagctggcca tggcctacag cctgtcagag atggaggcag ctgggaagaa 841 acccgcaggt gggcgggagg cacagcaccg acggcagggg cgcccaaggc ccagcaccaa 901 gatccaggct tgggggggac ccaggagggt gcgaggggtg aagcaaccaa acgcagtcca 961 tccccagagg agaaggcctc tcgctgcctc atcctctgaa caccgggccc aacctgatct 1021 gatccagatc ttgactgggg ggtctgactc actgtgggaa gagaagaggg gagtatcctg 1081 agttgtagga actgctttcc aactccaagc tccctccaca agtttccctc cccaggcccc 1141 ccacacccca gtgtggactt gggatttgct gtgctcagcc cagggctgat aggtccctgg 1201 tgaagcccag ggtggggggt gtcagggcag tggaggggcc cgaggagcca ggttgcattt 1261 attggatggg gagctccaag gggcattagt ggtttgggct gggcttttgt gccctggtac 1321 tctgccacct gtgttgctga tggtgtcaag gaaggaggac ttggcctagg gttgtctgag 1381 ccggagccgg cagctccact ggagagcagt gcaggcagag tggagcctcc tgctctcctg 1441 gaccagctgc agacccccaa ccctggtttc tgtgccatgt tgcgctctga ccgtctctgt 1501 tgcttctctt ctggtgttgc ttctcctccc tcccattctc tctgcaactc ctgcgggcgc 1561 atcgcttgct ttcactgccg tctggctagg actcccttct tccttccttc cccgagaagg 1621 cctcaatgtg gcgaggaaga tgctggggcc ggtagggctg tgagatcttc tggggaggct 1681 agccgggtgg ggcgggagcc tctcagctgt ccagattcag aactggagcc cactcctcct 1741 ccctctcgtt gcctcagccc tgccctcacc ctcagactag gcagaggtga ggctggctca 1801 ccctgaagag gtgggatagg aggggactgc acccatactg cttccctacc acaaatcagg 1861 gctcagggag aggccatgcg gcagcccagg tctgcatgct gagccccatc ctccacagct 1921 tgccgctgac gctctctcct gtcaccccgc ccctgctctc tccccagatg tgttctgagc 1981 tggatgccgg gttccagaat cgctgcacag ttccaacagg acagcgcctt cccccatgcg 2041 ctgggagggg accctccatt tctccccctc acccatgctg agtgtagagc cggggcctgg 2101 gtggcgggtg ggggccgggt gggaggtggc agtagtctta gcctgtgcac tctcttcctt 2161 gggtgtttgg tgctggctcc tggggactac aaatcccaga gtgcggtgtg cccggcctca 2221 tttctgatag atcccgcttg ggggaggtgg tgtatggtta cggagctgtg catcttggga 2281 catgtagtag cccaggtctt gtcactcgct gtgagatggg gagattttgt cttttgattt 2341 atccctgtag ggctggcagg gttgtagatg aagggggaat gatctgagcc ttggttcccc 2401 tgacacgtct tgctagcccc agggttagag tgggcagggc agagccgcgc agcacctggg 2461 agcggtacct ttcccttggg cagcctgggg tcccaggaac aagccagggc gagtggcatg 2521 tctgcctgag cagggtgtgg ccccagaaag ctgaggagtg tgggctggca gagagcttcg 2581 agggcaaggc cacccgcggg ggcgtgtgtg tggtggggct tggcatgtga tggcagctcc 2641 agctccaggc atgccgctgc ttgtatggct ttctttggcc tctgaccctg ctgcccattc 2701 tttccaacat cacagatgaa ctgcctctcc tcctccctgc ctggggagcc cagtggccag 2761 ggagggagtg gtggagccag tcgctgtaac actgagcctc agagacgaac caaaaccagc 2821 tgggctgagc tcagatccag ggggaagaaa tgctggaagt caataaaact gagtttgaga 2881 aaaaaaaaaa aaaaa // LOCUS HSHSKER 1024 bp RNA PRI 19-NOV-1992 DEFINITION H.sapiens mRNA for high-sulphur keratin. ACCESSION X63755 NID g32471 KEYWORDS high sulphur keratin; keratin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1024) AUTHORS Drabent,B. TITLE Direct Submission JOURNAL Submitted (20-DEC-1991) B. Drabent, Abt. Molekularbiologie, Humboldallee 23, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 1024) AUTHORS Drabent,B. and Doenecke,D. TITLE Nucleotide sequence of a Human high-sulphur keratin cDNA JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1024 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testes" /clone_lib="lambda-gt11" /clone="I3" CDS 239..748 /codon_start=1 /product="high-sulpher keratin" /db_xref="PID:g32472" /translation="MGCCGCSGGCGSSCGGCDSSCGSCGSGCRGCGPSCCAPVCCCKP VCCCVPACSCSSCGKRGCGSCGGSKGGCGSCGCSQCSCCKPCCCSSGCGSSCCQCSCC KPYCSQCSCCKPCCSSSGRGSSCCQSSCCKPCCSSSGCGSSCCQSSCCKPCCSQSRCC VPVCYQCKI" BASE COUNT 200 a 315 c 271 g 238 t ORIGIN 1 gaattccaca aggaaatcat ctcaggagga agggcttata cttggatcca gaaaatatca 61 acatagccaa agaaaaacaa tcaagacata cctccaggag ctgtgtaaca gcaaccggaa 121 agagaaacaa tggtgtgttc ctatgtggga tataaagagc cggggctcag ggggctccac 181 acctgcacct ccttctcacc tgctcctcta cctgctccac cctcaatcca ccagaaccat 241 gggctgctgt ggctgctccg gaggctgtgg ctccagctgt ggaggctgtg actccagctg 301 tgggagctgt ggctctggct gcaggggctg tggccccagc tgctgtgcac ccgtctgctg 361 ctgcaagccc gtgtgctgct gtgttccagc ctgttcctgc tctagctgtg gcaagcgggg 421 ctgtggctcc tgtgggggct ccaagggagg ctgtggttct tgtggctgct cccagtgcag 481 ttgctgcaag ccctgctgtt gctcttcagg ctgtgggtca tcctgctgcc agtgcagctg 541 ctgcaagccc tactgctccc agtgcagctg ctgtaagccc tgttgctcct cctcgggtcg 601 tgggtcatcc tgctgccaat ccagctgctg caagccctgc tgctcatcct caggctgtgg 661 gtcatcctgc tgccagtcca gctgctgcaa gccctgctgc tcccagtcca gatgctgtgt 721 ccctgtgtgc taccagtgca agatctgagg ctctagtggg aaacctcagg tagctcccga 781 agatctgtgc tttccaacaa gtgactaccc ttgaagcaca tccccttctg gatctgaaaa 841 gagcccttgg ctcagggcgt ctttttccag cccctgagga aaaggaatga accactccct 901 gcccattccc tataagaata tcccaagacc caggcaattt tgcccctctt tcccacatgc 961 ccccatatgt ctgagccaaa ctgcactggg ggctgccctc atgccaagca agagcctgga 1021 attc // LOCUS HSHSP1 2130 bp RNA PRI 18-JAN-1993 DEFINITION H.sapiens h-Sp1 mRNA. ACCESSION X68194 NID g32473 KEYWORDS h-Sp1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2130) AUTHORS Isojima,S. TITLE Direct Submission JOURNAL Submitted (01-SEP-1992) S. Isojima, Dept of Obstetrics and Gynecology, Hyogo Medical college, 1-1 Mukogawa-cho, Nishinomiya, Hyogo-ken 663, JAPAN REFERENCE 2 (bases 1 to 2130) AUTHORS Isojima,S. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2130 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /cell_type="spermatozoa" gene 34..813 /gene="h-Sp1" CDS 34..813 /gene="h-Sp1" /codon_start=1 /db_xref="PID:g32474" /translation="MAPNIYLVRQRISRLGQRMSGFQINLNPLKEPLGFIKVLEWIAS IFAFATCGGFKGQTEIQVNCPPAVTENKTVTATFGYPFRLNEASFQPPPGVNICDVNW KDYVLIGDYSSSAQFYVTFAVFVFLYCIAALLLYVGYTSLYLDSRKLPMIDFVVTLVA TFLWLVSTSAWAKALTDIKIATGHNIIDELPPCKKKAVLCYFGSVTSMGSLNVSVIFG FLNMILWGGNAWFVYKETSLHSPSNTSAPHSQGGIPPPTGI" BASE COUNT 614 a 369 c 413 g 734 t ORIGIN 1 tgaaccgagg caagggggcg cggcgcacgc agtatggcgc ccaacatcta cttggttcgc 61 cagcggatca gtcgactcgg ccagaggatg tccggcttcc agatcaacct caacccgctc 121 aaggagccac tcggcttcat caaggtcctc gagtggattg cttctatctt tgcttttgcc 181 acctgtggag gttttaaggg ccaaacagaa attcaagtga attgtcctcc tgcagttact 241 gagaataaaa ctgttacagc tacttttggt tatccattca ggttgaatga ggcatcattt 301 cagccacctc caggtgtaaa catatgtgat gtaaattgga aagattacgt cctcataggc 361 gattactctt cttctgcaca attctatgtt acctttgcag tctttgtgtt cctgtactgc 421 attgctgccc ttctgcttta tgttggctac acgagtctgt atctggatag tcgtaaactt 481 cctatgatag actttgttgt tacacttgtt gccacttttt tgtggttggt gagcacttca 541 gcctgggcta aagctctgac agatattaaa atagctactg gtcacaatat tattgatgaa 601 cttccgcctt gtaagaagaa agcagtactg tgttactttg gctctgtgac cagtatggga 661 tccctaaatg tatctgtgat atttggcttt ctaaatatga tactctgggg aggaaatgct 721 tggtttgtgt acaaggagac cagcctacac agtccatcaa atacatctgc ccctcatagc 781 caaggaggta ttccacctcc taccggaata taattaaagg gagaaataca ctgtatgaag 841 tatatgttga tactatgaca tgttgccaac accttgagaa gcattatttg tttctaataa 901 aagtaatggc tttgtcaata tattggtggg tttaaaactt tgctgctttt ttacataaag 961 cctgtgcctt tcctagaaag ttaagatgta aatgtattct cacatgtaaa tttgaaagtt 1021 caggggtcta ttatgaaatg gattacacat tttaaatgaa cccataattt ttttcactaa 1081 agctgtttgc cctccaaagt gtttacacct aagcctaaca tgtatcgctc attcagaaaa 1141 ctgttatatt gtcataccat agtaggaaga aaaaccttta tttggaatat acactactgt 1201 aagtttgtac agatcatata cctaccacct gtctttgctt aaagagcctt gattacataa 1261 atatgtagga aaaaacatat tgagttcaaa atttatatct aacattgttt atgttatgat 1321 tttttttaat tgcaaagact aggtgtatat ttttttctgt ttttctaaat gacccgtggt 1381 acttaatagg tgtactaaaa ttgtgttggg agcagggatt tggaaatttc tgagagatgt 1441 gtagttaatt agtaattctg tttcatgaga tatgatctgt tatgctagtg gtttaatagg 1501 cttgctatgt aagtagaacg tggctcaact agatatctta tatgtatggg cattacctct 1561 tagtgatatt tgtttcctgt cctttgttgc tcatgctgtt taagtgcagg ctgagaccca 1621 gcctctttgt aagtacagta aaataatcca ccgtttttta cagaccctag tcaaagggtt 1681 aaaaaaatta agattgcttt ccatgtttga aatttaccat tgagagtcaa tgaagttgct 1741 attttgagtt tagcattgat attgtgaaaa taagtgcaat ttggatttca tgtttcttaa 1801 tattcattct tgtttcacaa atgaatgatt aaggaattat gcatcataaa ggaacctaag 1861 tgaggtatat gatgagtgta ttgtctttgc acacacatat aggtatattc tgaatacaag 1921 cttattcatt ttgcttccta atctttttgt tgtacaggga ttcaggtttc ttattcttac 1981 aacatgattg tttatatgtg aagcacatct tgctgttgcc ttatttttga tgcttttatt 2041 catgacaaga attgtcaata taagaatgta tatctttgcc gcaaccaatt taataaagga 2101 gttgaaagaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSHSP10 309 bp RNA PRI 21-JUL-1995 DEFINITION H.sapiens mRNA for heat shock protein 10. ACCESSION X75821 NID g509780 KEYWORDS heat shock protein 10. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 309) AUTHORS Legname,G., Fossati,G., Gromo,G., Monzini,N., Marcucci,F. and Modena,D. TITLE Expression in Escherichia coli, purification and functional activity of recombinant human chaperonin 10 JOURNAL FEBS Lett. 361 (2-3), 211-214 (1995) MEDLINE 95212550 REFERENCE 2 (bases 1 to 309) AUTHORS Monzini,N. TITLE Direct Submission JOURNAL Submitted (14-OCT-1993) N. Monzini, Italfarmaco S.p.A., Via dei Lavoratori 54, 20092 Gnisello Balsamo, Milano, ITALY REMARK revised by author 12-FEB-94 and 12-APR-94 FEATURES Location/Qualifiers source 1..309 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatoma" /cell_line="HEP-G2" gene 1..309 /gene="HU-HSP10" CDS 1..309 /gene="HU-HSP10" /codon_start=1 /product="heat shock protein 10" /db_xref="PID:g509781" /db_xref="SWISS-PROT:Q04984" /translation="MAGQAFRKFLPLFDRVLVERSAAETVTKGGIMLPEKSQGKVLQA TVVAVGSGSKGKGGEIQPVSVKVGDKVLLPEYGGTKVVLDDKDYFLFRDGDILGKYVD " BASE COUNT 99 a 48 c 81 g 81 t ORIGIN 1 atggcaggac aagcgtttag aaagtttctt ccactctttg accgagtatt ggttgaaagg 61 agtgctgctg aaactgtaac caaaggaggc attatgcttc cagaaaaatc tcaaggaaaa 121 gtattgcaag caacagtagt cgctgttgga tcgggttcta aaggaaaggg tggagagatt 181 caaccagtta gcgtgaaagt tggagataaa gttcttctcc cagaatatgg aggcaccaaa 241 gtagttctag atgacaagga ttatttccta tttagagatg gtgacattct tggaaagtac 301 gtagactga // LOCUS HSHST2 744 bp RNA PRI 07-APR-1992 DEFINITION H.sapiens hst-2 (FGF-6) mRNA. ACCESSION X63454 NID g32490 KEYWORDS angiogenic capacity; FGF-related; growth factor; hst-2 gene; transforming capacity. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 744) AUTHORS Terada,M. TITLE Direct Submission JOURNAL Submitted (09-DEC-1991) M. Terada, National Cancer Center Research Inst, 1-1 Tsukiji 5-chome, Chuo-ku, Tokyo 104, JAPAN REFERENCE 2 (bases 1 to 744) AUTHORS Iida,S., Yoshida,T., Naito,K., Sakamoto,H., Katoh,O., Hirohashi,S., Sato,T., Onda,M., Sugimura,T. and Terada,M. TITLE Human hst-2 (FGF-6) oncogene: cDNA cloning and characterization JOURNAL Oncogene 7 (2), 303-309 (1992) MEDLINE 92195660 FEATURES Location/Qualifiers source 1..744 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA in lambda gt10" /clone="R0.8" gene 75..671 /gene="hst-2 (FGF-6)" CDS 75..671 /gene="hst-2 (FGF-6)" /codon_start=1 /db_xref="PID:g32491" /db_xref="SWISS-PROT:P10767" /translation="MSRGAGRLQGTLWALVFLGILVGMVVPSPAGTRANNTLLDSRGW GTLLSRSRAGLAGEIAGVNWESGYLVGIKRQRRLYCNVGIGFHLQVLPDGRISGTHEE NPYSLLEISTVERGVVSLFGVRSALFVAMNSKGRLYATPSFQEECKFRETLLPNNYNA YESDLYQGTYIALSKYGRVKRGSKVSPIMTVTHFLPRI" BASE COUNT 173 a 199 c 219 g 153 t ORIGIN 1 tttagggcca ttaattctga ccacgtgcct gagaggcaag gtggatggcc ctgggacaga 61 aactgttcat cactatgtcc cggggagcag gacgtctgca gggcacgctg tgggctctcg 121 tcttcctagg catcctagtg ggcatggtgg tgccctcgcc tgcaggcacc cgtgccaaca 181 acacgctgct ggactcgagg ggctggggca ccctgctgtc caggtctcgc gcggggctag 241 ctggagagat tgccggggtg aactgggaaa gtggctattt ggtggggatc aagcggcagc 301 ggaggctcta ctgcaacgtg ggcatcggct ttcacctcca ggtgctcccc gacggccgga 361 tcagcgggac ccacgaggag aacccctaca gcctgctgga aatttccact gtggagcgag 421 gcgtggtgag tctctttgga gtgagaagtg ccctcttcgt tgccatgaac agtaaaggaa 481 gattgtacgc aacgcccagc ttccaagaag aatgcaagtt cagagaaacc ctcctgccca 541 acaattacaa tgcctacgag tcagacttgt accaagggac ctacattgcc ctgagcaaat 601 acggacgggt aaagcggggc agcaaggtgt ccccgatcat gactgtcact catttccttc 661 ccaggatcta aggacccaca aaagaaggct tacagattta aagcatcatc tgttcgattg 721 aaattttgca ccagcgaaga attc // LOCUS HSHT 1658 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for truncated form of complement factor H. ACCESSION X07523 Y00716 NID g32492 KEYWORDS complement factor H. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1658) AUTHORS Day,A.J. TITLE Direct Submission JOURNAL Submitted (03-MAY-1988) Day A.J., Dept. of Biochemistry, University of Oxford, MRC Immunochemistry Unit, South Parks Road, Oxford OX1 3QU REFERENCE 2 (bases 1 to 1658) AUTHORS Ripoche,J., Day,A.J., Harris,T.J. and Sim,R.B. TITLE The complete amino acid sequence of human complement factor H JOURNAL Biochem. J. 249 (2), 593-602 (1988) MEDLINE 88134059 COMMENT Data kindly reviewed (06-JUN-1988) by Day A.J. FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pat153/PvuII/8" /clone="B-38-1 tissue=liver" misc_feature 1..1408 /note="seq. identical to full-length factor H" misc_feature <1..73 /note="5'-UT region" CDS 74..1423 /codon_start=1 /product="complement factor H" /db_xref="PID:g758073" /translation="MRLLAKIICLMLWAICVAEDCNELPPRRNTEILTGSWSDQTYPE GTQAIYKCRPGYRSLGNVIMVCRKGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGN VFEYGVKAVYTCNEGYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSS AMEPDREYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPDVING SPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCEEKSCDNPYIPNGD YSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTSTGWIPAPRCTLKPCDYPDIKHGG LYHENMRRPYFPVAVGKYYSYYCDEHFETPSGSYWDHIHCTQDGWSPAVPCLRKCYFP YLENGYNQNYGRKFVQGKSIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVSFTL" sig_peptide 74..127 /note="put. signal peptide" mat_peptide 128..1420 /note="translated region" misc_feature 1409..1420 /note="unique coding seq. not found in full-length factor H" misc_feature 1421..1423 /note="stop codon" misc_feature 1424..1658 /note="3'-UT region" misc_feature 1634..1639 /note="pot. polyA signal" BASE COUNT 513 a 296 c 383 g 466 t ORIGIN 1 aattcttgga agaggagaac tggacgttgt gaacagagtt agctggtaaa tgtcctctta 61 aaagatccaa aaaatgagac ttctagcaaa gattatttgc cttatgttat gggctatttg 121 tgtagcagaa gattgcaatg aacttcctcc aagaagaaat acagaaattc tgacaggttc 181 ctggtctgac caaacatatc cagaaggcac ccaggctatc tataaatgcc gccctggata 241 tagatctctt ggaaatgtaa taatggtatg caggaaggga gaatgggttg ctcttaatcc 301 attaaggaaa tgtcagaaaa ggccctgtgg acatcctgga gatactcctt ttggtacttt 361 tacccttaca ggaggaaatg tgtttgaata tggtgtaaaa gctgtgtata catgtaatga 421 ggggtatcaa ttgctaggtg agattaatta ccgtgaatgt gacacagatg gatggaccaa 481 tgatattcct atatgtgaag ttgtgaagtg tttaccagtg acagcaccag agaatggaaa 541 aattgtcagt agtgcaatgg aaccagatcg ggaataccat tttggacaag cagtacggtt 601 tgtatgtaac tcaggctaca agattgaagg agatgaagaa atgcattgtt cagacgatgg 661 tttttggagt aaagagaaac caaagtgtgt ggaaatttca tgcaaatccc cagatgttat 721 aaatggatct cctatatctc agaagattat ttataaggag aatgaacgat ttcaatataa 781 atgtaacatg ggttatgaat acagtgaaag aggagatgct gtatgcactg aatctggatg 841 gcgtccgttg ccttcatgtg aagaaaaatc atgtgataat ccttatattc caaatggtga 901 ctactcacct ttaaggatta aacacagaac tggagatgaa atcacgtacc agtgtagaaa 961 tggtttttat cctgcaaccc ggggaaatac agccaaatgc acaagtactg gctggatacc 1021 tgctccgaga tgtaccttga aaccttgtga ttatccagac attaaacatg gaggtctata 1081 tcatgagaat atgcgtagac catactttcc agtagctgta ggaaaatatt actcctatta 1141 ctgtgatgaa cattttgaga ctccgtcagg aagttactgg gatcacattc attgcacaca 1201 agatggatgg tcgccagcag taccatgcct cagaaaatgt tattttcctt atttggaaaa 1261 tggatataat caaaattatg gaagaaagtt tgtacagggt aaatctatag acgttgcctg 1321 ccatcctggc tacgctcttc caaaagcgca gaccacagtt acatgtatgg agaatggctg 1381 gtctcctact cccagatgca tccgtgtcag ctttaccctc tgaacttctg atcgaaggtc 1441 atccctctcc agcttgagtg gatcaaagat gacaagggcc aatggaacca agtttgagtc 1501 ttgccaggtc aatacttggg tcctgagtat ggtgactagt atctgttttg ttatgtgtgt 1561 attattccag ccagaatggg aaatgctaat tcagctcctc caggcagcca atggggctgg 1621 tggctttgag attattaaac tcttctggat cctctacg // LOCUS HSHTFIIAS 657 bp RNA PRI 21-APR-1995 DEFINITION H.sapiens hTFIIAs mRNA for smallest (gamma) TFIIA subunit. ACCESSION X81713 NID g558213 KEYWORDS small subunit; TFIIA gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 657) AUTHORS Sun,X., Ma,D., Sheldon,M., Yeung,K. and Reinberg,D. TITLE Reconstitution of human TFIIA activity from recombinant polypeptides: a role in TFIID-mediated transcription JOURNAL Genes Dev. 8 (19), 2336-2348 (1994) MEDLINE 95047377 REFERENCE 2 (bases 1 to 657) AUTHORS Sun,X. TITLE Direct Submission JOURNAL Submitted (19-SEP-1994) X. Sun, HHMI, Dept of Biochemistry, UMDNJ-Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, NJ08854-5635, USA REFERENCE 3 (bases 1 to 657) AUTHORS Ozer,J., Moore,P.A., Bolden,A.H., Lee,A., Rosen,C.A. and Lieberman,P.M. TITLE Molecular cloning of the small (gamma) subunit of human TFIIA reveals functions critical for activated transcription JOURNAL Genes Dev. 8 (19), 2324-2335 (1994) MEDLINE 95047376 FEATURES Location/Qualifiers source 1..657 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 100..429 /gene="hTFIIAs" CDS 100..429 /gene="hTFIIAs" /codon_start=1 /product="smallest subunit of TFIIA" /db_xref="PID:g558214" /translation="MAYQLYRNTTLGNSLQESLDELIQSQQITPQLALQVLLQFDKAI NAALAQRVRNRVNFRGSLNTYRFCDNVWTFVLNDVEFREVTELIKVDKVKIVACDGKN TGSNTTE" BASE COUNT 210 a 133 c 128 g 186 t ORIGIN 1 cgggctcgtg gcggcttctg tccgctccga gggaagcgcc ttccccacag gacatcaatg 61 caagcttgaa taagaaaaac aaattcttcc tcctaagcca tggcatatca gttatacaga 121 aatactactt tgggaaacag tcttcaggag agcctagatg agctcataca gtctcaacag 181 atcacccccc aacttgccct tcaagttcta cttcagtttg ataaggctat aaatgcagca 241 ctggctcaga gggtcaggaa cagagtcaat ttcaggggct ctctaaatac gtacagattc 301 tgcgataatg tgtggacttt tgtactgaat gatgttgaat tcagagaggt gacagaactt 361 attaaagtgg ataaagtgaa aattgtagcc tgtgatggta aaaatactgg ctccaatact 421 acagaatgaa tagaaaaaat atgacttttt tacaccatct tctgttattc attgcttttg 481 aagagaagca tagaagagac tttttattta ttctagaatt gcagaaatga ctacactgtg 541 ctataccaga gaattccagt agaaagaaac ttgtaactct gtagcctctt acatcacctt 601 tattatacag catgaaaaac cataacttct ttttaaggac aaaagttgtt gccttcc // LOCUS HSHTH1R 1816 bp RNA PRI 23-NOV-1995 DEFINITION Human mRNA for tyrosine hydroxylase (HTH-1). ACCESSION X05290 NID g32501 KEYWORDS tyrosine hydroxylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1816) AUTHORS Grima,B., Lamouroux,A., Boni,C., Julien,J.F., Javoy-Agid,F. and Mallet,J. TITLE A single human gene encoding multiple tyrosine hydroxylases with different predicted functional characteristics JOURNAL Nature 326 (6114), 707-711 (1987) MEDLINE 87173064 FEATURES Location/Qualifiers source 1..1816 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma tumour" CDS 20..1513 /note="HTH-1 (AA 1-497)" /codon_start=1 /db_xref="PID:g32502" /db_xref="SWISS-PROT:P07101" /translation="MPTPDATTPQAKGFRRAVSELDAKQAEAIMSPRFIGRRQSLIED ARKEREAAVAAAAAAVPSEPGDPLEAVAFEEKEGKAVLNLLFSPRATKPSALSRAVKV FETFEAKIHHLETRPAQRPRAGGPHLEYFVRLEVRRGDLAALLSGVRQVSEDVRSPAG PKVPWFPRKVSELDKCHHLVTKFDPDLDLDHPGFSDQVYRQRRKLIAEIAFQYRHGDP IPRVEYTAEEIATWKEVYTTLKGLYATHACGEHLEAFALLERFSGYREDNIPQLEDVS RFLKERTGFQLRPVAGLLSARDFLASLAFRVFQCTQYIRHASSPMHSPEPDCCHELLG HVPMLADRTFAQFSQDIGLASLGASDEEIEKLSTLSWFTVEFGLCKQNGEVKAYGAGL LSSYGELLHCLSEEPEIRAFDPEAAAVQPYQDQTYQSVYFVSESFSDAKDKLRSYASR IQRPFSVKFDPYTLAIDVLDSPQAVRRSLEGVQDELDTLAHALSAIG" misc_feature 1795..1800 /note="pot. polyA signal" polyA_site 1816 /note="poly A site" BASE COUNT 295 a 623 c 573 g 325 t ORIGIN 1 cggacctcca cactgagcca tgcccacccc cgacgccacc acgccacagg ccaagggctt 61 ccgcagggcc gtgtctgagc tggacgccaa gcaggcagag gccatcatgt ccccgcggtt 121 cattgggcgc aggcagagcc tcatcgagga cgcccgcaag gagcgggagg cggcggtggc 181 agcagcggcc gctgcagtcc cctcggagcc cggggacccc ctggaggctg tggcctttga 241 ggagaaggag gggaaggccg tgctaaacct gctcttctcc ccgagggcca ccaagccctc 301 ggcgctgtcc cgagctgtga aggtgtttga gacgtttgaa gccaaaatcc accatctaga 361 gacccggccc gcccagaggc cgcgagctgg gggcccccac ctggagtact tcgtgcgcct 421 cgaggtgcgc cgaggggacc tggccgccct gctcagtggt gtgcgccagg tgtcagagga 481 cgtgcgcagc cccgcggggc ccaaggtccc ctggttccca agaaaagtgt cagagctgga 541 caagtgtcat cacctggtca ccaagttcga ccctgacctg gacttggacc acccgggctt 601 ctcggaccag gtgtaccgcc agcgcaggaa gctgattgct gagatcgcct tccagtacag 661 gcacggcgac ccgattcccc gtgtggagta caccgccgag gagattgcca cctggaagga 721 ggtctacacc acgctgaagg gcctctacgc cacgcacgcc tgcggggagc acctggaggc 781 ctttgctttg ctggagcgct tcagcggcta ccgggaagac aatatccccc agctggagga 841 cgtctcccgc ttcctgaagg agcgcacggg cttccagctg cggcctgtgg ccggcctgct 901 gtccgcccgg gacttcctgg ccagcctggc cttccgcgtg ttccagtgca cccagtatat 961 ccgccacgcg tcctcgccca tgcactcccc tgagccggac tgctgccacg agctgctggg 1021 gcacgtgccc atgctggccg accgcacctt cgcgcagttc tcgcaggaca ttggcctggc 1081 gtccctgggg gcctcggatg aggaaattga gaagctgtcc acgctgtcat ggttcacggt 1141 ggagttcggg ctgtgtaagc agaacgggga ggtgaaggcc tatggtgccg ggctgctgtc 1201 ctcctacggg gagctcctgc actgcctgtc tgaggagcct gagattcggg ccttcgaccc 1261 tgaggctgcg gccgtgcagc cctaccaaga ccagacgtac cagtcagtct acttcgtgtc 1321 tgagagcttc agtgacgcca aggacaagct caggagctat gcctcacgca tccagcgccc 1381 cttctccgtg aagttcgacc cgtacacgct ggccatcgac gtgctggaca gcccccaggc 1441 cgtgcggcgc tccctggagg gtgtccagga tgagctggac acccttgccc atgcgctgag 1501 tgccattggc taggtgcacg gcgtccctga gggcccttcc caacctcccc tggtcctgca 1561 ctgtcccgga gctcaggccc tggtgagggg ctgggtcccg ggtgcccccc atgccctccc 1621 tgctgccagg ctcccactgc ccctgcacct gcttctcagc gcaacagctg tgtgtgcccg 1681 tggtgaggtt gtgctgcctg tggtgaggtc ctgtcctggc tcccagggtc ctgggggctg 1741 ctgcactgcc ctccgccctt ccctgacact gtctgctgcc ccaatcaccg tcacaataaa 1801 agaaactgtg gtctct // LOCUS HSHUBF 3097 bp RNA PRI 28-JUN-1994 DEFINITION Human mRNA for upstream binding factor (hUBF). ACCESSION X53390 NID g509240 KEYWORDS DNA binding protein; transcription factor; upstream binding factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3097) AUTHORS Jantzen,H.M. TITLE Direct Submission JOURNAL Submitted (04-MAY-1990) Jantzen H.-M., University of California, Berkeley, Department of Molecular and Cell Biology, 401 barker Hall, Berkeley, CA 94720, USA REFERENCE 2 (bases 1 to 3097) AUTHORS Jantzen,H.M., Admon,A., Bell,S.P. and Tjian,R. TITLE Nucleolar transcription factor hUBF contains a DNA-binding motif with homology to HMG proteins JOURNAL Nature 344 (6269), 830-836 (1990) MEDLINE 90231434 FEATURES Location/Qualifiers source 1..3097 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt11" CDS 148..2442 /note="upstream binding factor (AA 1-764)" /codon_start=1 /db_xref="PID:g509241" /db_xref="SWISS-PROT:P17480" /translation="MNGEADCPTDLEMAAPKGQDRWSQEDMLTLLECMKNNLPSNDSS KFKTTESHMDWEKVAFKDFSGDMCKLKWVEISNEVRKFRTLTELILDAQEHVKNPYKG KKLKKHPDFPKKPLTPYFRFFMEKRAKYAKLHPEMSNLDLTKILSKKYKELPEKKKMK YIQDFQREKQEFERNLARFREDHPDLIQNAKKSDIPEKPKTPQQLWYTHEKKVYLKVR PDATTKEVKDSLGKQWSQLSDKKRLKWIHKALEQRKEYEEIMRDYIQKHPELNISEEG ITKSTLTKAERQLKDKFDGRPTKPPPNSYSLYCAELMANMKDVPSTERMVLCSQQWKL LSQKEKDAYHKKCDQKKKDYEVELLRFLESLPEEEQQRVLGEEKMLNINKKQATSPAS KKPAQEGGKGGSEKPKRPVSAMFIFSEEKRRQLQEERPELSESELTRLLARMWNDLSE KKKAKYKAREAALKAQSERKPGGEREERGKLPESPKRAEEIWQQSVIGDYLARFKNDR VKALKAMEMTWNNMEKKEKLMWIKKAAEDQKRYERELSEMRAPPAATNSSKKMKFQGE PKKPPMNGYQKFSQELLSNGELNHLPLKERMVEIGSRWQRISQSQKEHYKKLAEEQQK QYKVHLDLWVKSLSPQDRAAYKEYISNKRKSMTKLRGPNPKSSRTTLQSKSESEEDDE EDEDDEDEDEEEEDDENGDSSEDGGDSSESSSEDESEDGDENEEDDEDEDDDEDDDED EDNESEGSSSSSSSSGDSSDSDSN" BASE COUNT 863 a 829 c 916 g 489 t ORIGIN 1 gccagacaaa gcccgagatg gcgaagcgga gcgacggcta atggcgagcc ccacgcgccg 61 cgctccgccc gccccgctcc ggtgaggtgg ctttgacccc gggttgcccg gccagcacga 121 ccgaggaggt ggctggacag ctggaggatg aacggagaag ccgactgccc cacagacctg 181 gaaatggccg cccccaaagg ccaagaccgt tggtcccagg aagacatgct gactttgctg 241 gaatgcatga agaacaacct tccatccaat gacagctcca agttcaaaac caccgaatca 301 cacatggact gggaaaaagt agcatttaaa gacttttctg gagacatgtg caagctcaaa 361 tgggtggaga tttctaatga ggtgaggaag ttccgtacat tgacagaatt gatcctcgat 421 gctcaggaac atgttaaaaa tccttacaaa ggcaaaaaac tcaagaaaca cccagacttc 481 ccaaagaagc ccctgacccc ttatttccgc ttcttcatgg agaagcgggc caagtatgcg 541 aaactccacc ctgagatgag caacctggac ctaaccaaga ttctgtccaa gaaatacaag 601 gagcttccgg agaagaagaa gatgaaatat attcaggact tccagagaga gaaacaggag 661 ttcgagcgaa acctggcccg attcagggag gatcaccccg acctaatcca gaatgccaag 721 aaatcggaca tcccagagaa gcccaaaacc ccccagcagc tgtggtacac ccacgagaag 781 aaggtgtatc tcaaagtgcg gccagatgcc actacgaagg aggtgaagga ctccctgggg 841 aagcagtggt ctcagctctc ggacaaaaag aggctgaaat ggattcataa ggccctggag 901 cagcggaagg agtacgagga gatcatgaga gactatatcc agaagcaccc agagctgaac 961 atcagtgagg agggtatcac caagtccacc ctcaccaagg ccgaacgcca gctcaaggac 1021 aagtttgacg ggcgacccac caagccacct ccgaacagct actcgctgta ctgcgcagag 1081 ctcatggcca acatgaagga cgtgcccagc acagagcgca tggtgctgtg cagccagcag 1141 tggaagctgc tgtcccagaa ggagaaggac gcctatcaca agaagtgtga tcagaaaaag 1201 aaagattacg aggtggagct gctccgtttc ctcgagagcc tgcctgagga ggagcagcag 1261 cgggtcttgg gggaagagaa gatgctgaac atcaacaaga agcaggccac cagccccgcc 1321 tccaagaagc cagcccagga agggggcaag ggcggctccg agaagcccaa gcggcccgtg 1381 tcggccatgt tcatcttctc ggaggagaaa cggcggcagc tgcaggagga gcggcctgag 1441 ctctccgaga gcgagctgac ccgcctgctg gcccgaatgt ggaacgacct gtctgagaag 1501 aagaaggcca agtacaaggc ccgagaggcg gcgctcaagg ctcagtcgga gaggaagccc 1561 ggcggggagc gcgaggaacg gggcaagctg cccgagtccc ccaaaagagc tgaggagatc 1621 tggcaacaga gcgttatcgg cgactacctg gcccgcttca agaatgaccg ggtgaaggcc 1681 ttgaaagcca tggaaatgac ctggaataac atggaaaaga aggagaaact gatgtggatt 1741 aagaaggcag ccgaagacca aaagcgatat gagagagagc tgagtgagat gcgggcacct 1801 ccagctgcta caaattcttc caagaagatg aaattccagg gagaacccaa gaagcctccc 1861 atgaacggtt accagaagtt ctcccaggag ctgctgtcca atggggagct gaaccacctg 1921 ccgctgaagg agcgcatggt ggagatcggc agtcgctggc agcgcatctc ccagagccag 1981 aaggagcact acaaaaagct ggccgaggag cagcaaaagc agtacaaggt gcacctggac 2041 ctctgggtta agagcctgtc tccccaggac cgtgcagcat ataaagagta catctccaat 2101 aaacgtaaga gcatgaccaa gctgcgaggc ccaaacccca aatccagccg gactactctg 2161 cagtccaagt cggagtccga ggaggatgat gaagaggatg aggatgacga ggacgaggat 2221 gaagaagagg aagatgatga gaatggggac tcctctgaag atggcggcga ctcctctgag 2281 tccagcagcg aggacgagag cgaggatggg gatgagaatg aagaggatga cgaggacgaa 2341 gacgacgacg aggatgacga tgaggatgaa gataatgagt ccgagggcag cagctccagc 2401 tcctcctcct caggggactc ctcagactct gactccaact gaggctcagc cccaccccag 2461 ggcagccagg gagagcccag gagctcccct ccccaactga ccacctttgt ttctccccca 2521 tgttctgtcc cttgcccccc tggcctcccc cactttcttt ctttctttaa aaaaaaaaaa 2581 aaaaatacgg tgggggtagg gggctggagg agcccaggcc aggactctgc agcctcagag 2641 acatcagccc ttgggggtcc tcctccaggg acagcaacta tcagactaag ccagcaccgg 2701 accagcctgg cccaccccac ccacttctgc acttgcggtt ccggcatgga caatggaccg 2761 gagagtgggg gtggggggtc ccaaagagtt tgatgaggcc ctccacacct gcggcccaat 2821 ccaaggtggg gtggaagctt ggggaagacc cattccttcc cagaggggcc tgccacctgg 2881 acccctgcat tggaactgga ggcagggaac atggggagga ggagggtcgg tgccttcaag 2941 aaaacaggca ctgccctggt ggccctctcc cctgccccct gcaggaagga gctgcctgga 3001 cccgttcatg ggggaggggg cagaagtgtt ttttatatat gtgtatatat ttttttttaa 3061 gctctgagct gtcaacgaga cgtttcctac cgaaaaa // LOCUS HSHUMAPC 1901 bp RNA PRI 17-JAN-1996 DEFINITION H.sapiens Cctg mRNA for chaperonin. ACCESSION X74801 NID g671526 KEYWORDS Cctg gene; chaperonin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1108) AUTHORS Malik,A.N. TITLE Direct Submission JOURNAL Submitted (24-AUG-1993) A.N. Malik, University of Wales College of Cardiff, Dept of Biochemistry, PO Box 903, Cardiff CF1 1st, UK REFERENCE 2 (bases 1 to 1108) AUTHORS Walkley,N.A., Demaine,A.G. and Malik,A.N. TITLE Cloning, structure and mRNA expression of human Cctg, which encodes the chaperonin subunit CCT gamma JOURNAL Biochem. J. 313 (Pt 2), 381-389 (1996) MEDLINE 96152518 REFERENCE 3 (bases 1 to 1901) AUTHORS Malik,A.N. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) A.N. Malik, University of Wales College of Cardiff, Dept of Biochemistry, PO Box 903, Cardiff CF1 1st, UK FEATURES Location/Qualifiers source 1..1901 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="lambda GT10 clones, M13 and PBLUESCRIPT subclones" /dev_stage="adult" gene 1..1635 /gene="Cctg" CDS 1..1635 /gene="Cctg" /codon_start=1 /product="gamma subunit of CCT chaperonin" /db_xref="PID:g671527" /translation="MGHRPVLVLSQNTKRESGRKVQSGNINAAKTIADIIRTCLGPKS MMKMLLDPMGGIVMTNDGNAILREIQVQHPAAKSMIEISRTQDEEVGDGTTSVIILAG EMLSVAEHFLEQQMHPTVVISAYRKALDDMISTLKKISIPVDISDSDMMLNIINSSIT TKAISRWSSLACNIALDAVKMVQFEENGRKEIDIKKYARVEKIPGGIIEDSCVLRGVM INKDVTHPRMRRYIKNPRIVLLDSSLEYKKGGSQTDIEITREEDFTRILQMEEEYIQQ LCEDIIQLKPDVVITEKGISDLAQHYLMRANITAIRRVRKTDNNRIARACGARIVSRP EELREDDVGTGAGLLEIKKIGDEYFTFITDCKDPKACTILLRGASKEILSEVERNLQD AMQVCRNVLLDPQLVPGGGASEMAVAHALTEKSKAMTGVEQWPYRAVAQALEVIPRTL IQNCGASTIRLLTSLRAKHTQENCETWGVNGETGTLVDMKELGIWEPLAVKLQTYKTA VETAVLLLRIDDIVSGHKKKGDDQSRQGGAPDAGQE" polyA_signal 1052..1057 /gene="Cctg" BASE COUNT 551 a 438 c 498 g 414 t ORIGIN 1 atggggcatc ggccggtgct cgtgctcagc cagaacacaa agcgtgaatc cggaagaaaa 61 gttcaatctg gaaacatcaa tgctgccaag actattgcag atatcatccg aacatgtttg 121 ggacccaagt ccatgatgaa gatgcttttg gacccaatgg gaggcattgt gatgaccaat 181 gatggcaatg ccattcttcg agagattcaa gtccagcatc cagcggccaa gtccatgatc 241 gaaattagcc ggacccagga tgaagaagtt ggagatggga ccacatcagt aattattctt 301 gcaggggaaa tgctgtctgt agctgagcac ttcctggagc agcagatgca cccaacagtg 361 gtgatcagtg cttaccgcaa ggcattggat gatatgatca gcaccctaaa gaaaataagt 421 atcccagtcg acatcagtga cagtgatatg atgctgaaca tcatcaacag ctctattact 481 accaaagcca tcagccggtg gtcatctttg gcttgcaaca ttgccctgga tgctgtcaag 541 atggtacagt ttgaggagaa tggtcggaaa gagattgaca taaaaaaata tgcaagagtg 601 gaaaagatac ctggaggcat cattgaagac tcctgtgtct tgcgtggagt catgattaac 661 aaggatgtga cccatccacg tatgcggcgc tatatcaaga accctcgcat tgtgctgctg 721 gattcttctc tggaatacaa gaaaggagga agccagactg acattgagat tacacgagag 781 gaggacttca cccgaattct ccagatggag gaagagtaca tccagcagct ctgtgaggac 841 attatccaac tgaagcccga tgtggtcatc actgaaaagg gcatctcaga tttagctcag 901 cactacctta tgcgggccaa tatcacagcc atccgcagag tccggaagac agacaataat 961 cgcattgcta gagcctgtgg ggcccggata gtcagccgac cagaggaact gagagaagat 1021 gatgttggaa caggagcagg cctgttggaa atcaagaaaa ttggagatga atactttact 1081 ttcatcactg actgcaaaga ccccaaggcc tgcaccattc tcctccgggg ggctagcaaa 1141 gagattctct cggaagtaga acgcaacctc caggatgcca tgcaagtgtg tcgcaatgtt 1201 ctcctggacc ctcagctggt gccagggggt ggggcctccg agatggctgt cgcccatgcc 1261 ttgacagaaa aatccaaggc catgactggt gtggaacaat ggccatacag ggctgttgcc 1321 caggccctag aggtcattcc tcgtaccctg atccagaact gtggggccag caccatccgt 1381 ctacttacct cccttcgggc caagcacacc caggagaact gtgagacctg gggtgtaaat 1441 ggtgagacgg gtactttggt ggacatgaag gaactgggca tatgggagcc attggctgtg 1501 aagctgcaga cttataagac agcagtggag acggcagttc tgctactgcg aattgatgac 1561 atcgtttcag gccacaaaaa gaaaggcgat gaccagagcc ggcaaggcgg ggctcctgat 1621 gctggccagg agtgagtgct aggcaaggct acttcaatgc acagaaccag cagagtctcc 1681 ccttttcctg agccagagtg ccaggaacac tgtggacgtc tttgttcaga agggatcagg 1741 ttggggggca gcccccagtc cctttctgtc ccagctcagt tttccaaaag acactgacat 1801 gtaattcttc tctattgtaa ggtttccatt tagtttgctt ccgatgatta aatctaagtc 1861 atttgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSHUMFLI 2938 bp RNA PRI 28-JUN-1995 DEFINITION H.sapiens HUMFLI-1 mRNA. ACCESSION X67001 S44250 NID g32529 KEYWORDS FLI-1 gene homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2938) AUTHORS Delattre,O. TITLE Direct Submission JOURNAL Submitted (26-MAY-1992) O. Delattre, Lab. de Genet. des Tumeurs. Inst. Curie, 26 rue D'Ulm, 75231 Paris Cedex, FRANCE REFERENCE 2 (bases 1 to 2938) AUTHORS Delattre,O., Zucman,J., Plougastel,B., Desmaze,C., Melot,T., Peter,M., Kovar,H., Joubert,I., de Jong,P., Rouleau,G., Aurias,A. and Thomas,G. TITLE Gene fusion with an ETS DNA-binding domain caused by chromosome translocation in human tumours JOURNAL Nature 359 (6391), 162-165 (1992) MEDLINE 92396239 FEATURES Location/Qualifiers source 1..2938 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="bone marrow" /clone_lib="cDNA, Clontech HL1058" /chromosome="11q24" gene 143..1501 /gene="HUMFLI-1" CDS 143..1501 /gene="HUMFLI-1" /codon_start=1 /product="homologue of the murine FLI-1 gene" /db_xref="PID:g32530" /db_xref="SWISS-PROT:Q01543" /translation="MDGTIKEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQ PHKINPLPPQQEWINQPVRVNVKREYDHMNGSRESPVDCSVSKCSKLVGGGESNPMNY NSYMDEKNGPPPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQ NMDGKELCKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSRLSVKE DPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANP GSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDK LSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYH AHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGS YY" misc_feature 971..1222 /gene="HUMFLI-1" /note="ETS domain" polyA_signal 2390..2395 polyA_signal 2411..2416 polyA_signal 2415..2420 polyA_signal 2908..2913 /evidence=experimental polyA_site 2927 BASE COUNT 847 a 692 c 675 g 724 t ORIGIN 1 ggagggcgct cgcagggggc acgcagggag ggcccagggc gccagggagg ccgcgccggg 61 ctaatccgaa ggggctgcga ggtcaggctg taaccgggtc aatgtgtgga atattggggg 121 gctcggctgc agacttggcc aaatggacgg gactattaag gaggctctgt cggtggtgag 181 cgacgaccag tccctctttg actcagcgta cggagcggca gcccatctcc ccaaggccga 241 catgactgcc tcggggagtc ctgactacgg gcagccccac aagatcaacc ccctcccacc 301 acagcaggag tggatcaatc agccagtgag ggtcaacgtc aagcgggagt atgaccacat 361 gaatggatcc agggagtctc cggtggactg cagcgttagc aaatgcagca agctggtggg 421 cggaggcgag tccaacccca tgaactacaa cagctatatg gacgagaaga atggcccccc 481 tcctcccaac atgaccacca acgagaggag agtcatcgtc cccgcagacc ccacactgtg 541 gacacaggag catgtgaggc aatggctgga gtgggccata aaggagtata gcttgatgga 601 gatcgacaca tcctttttcc agaacatgga tggcaaggaa ctgtgtaaaa tgaacaagga 661 ggacttcctc cgcgccacca ccctctacaa cacggaagtg ctgttgtcac acctcagtta 721 cctcagggaa agttcactgc tggcctataa tacaacctcc cacaccgacc aatcctcacg 781 attgagtgtc aaagaagacc cttcttatga ctcagtcaga agaggagcat ggggcaataa 841 catgaattct ggcctcaaca aaagtcctcc ccttggaggg gcacaaacga tcagtaagaa 901 tacagagcaa cggccccagc cagatccgta tcagatcctg ggcccgacca gcagtcgcct 961 agccaaccct ggaagcgggc agatccagct gtggcaattc ctcctggagc tgctctccga 1021 cagcgccaac gccagctgta tcacctggga ggggaccaac ggggagttca aaatgacgga 1081 ccccgatgag gtggccaggc gctggggcga gcggaaaagc aagcccaaca tgaattacga 1141 caagctgagc cgggccctcc gttattacta tgataaaaac attatgacca aagtgcacgg 1201 caaaagatat gcttacaaat ttgacttcca cggcattgcc caggctctgc agccacatcc 1261 gaccgagtcg tccatgtaca agtacccttc tgacatctcc tacatgcctt cctaccatgc 1321 ccaccagcag aaggtgaact ttgtccctcc ccatccatcc tccatgcctg tcacttcctc 1381 cagcttcttt ggagccgcat cacaatactg gacctccccc acggggggaa tctaccccaa 1441 ccccaacgtc ccccgccatc ctaacaccca cgtgccttca cacttaggca gctactacta 1501 gaagcttact catcagtggc cttctagctg aagcccatcc tgcacactta ctggatgctt 1561 tggactcaac aggacatatg tggccttgaa gggaagacaa aactggatgt tctttcttgt 1621 tggatagaac ctttgtattt gttctttaaa aacatttttt ttaatgttgg taacttttgc 1681 ttcctctacc tgaacaaaga gatgaataat tccatgggcc agtatgccag tttgaattct 1741 cagtctccta gcatcttgtg agttgcatat taagattact ggaatggtta agtcatggtt 1801 ctgagaaaga agctgtacgt tttctttatg tttttatgac caaagcagtt tcttgtcaat 1861 acacggggtt cagtatgaca cagaatcatg gacttaaccc gtcatgttct ggtttgagat 1921 ttagtgacaa atagaggtgg gaagcttata atctaatttt aggaggacca aattcagtgg 1981 atggcaactg gaacattgat tgtaaggcca gtgaagtttt cacccaactg gaatttgatg 2041 gaaagaaggt ttgtgtgttt aagacgccaa gggcattgca gaatccctct cagtggacag 2101 tatgcactca gctgaccact ctctctagaa atagtcaaga tatgaactaa gaaattttaa 2161 tgcaaataca tacattcctg aaagacgggg aattaaatta ctaatttttt ttttttttta 2221 aatgatgaca gtggtcccag aacttggaaa agttgtaggg atttctaaac tcaagcagat 2281 tcgcaagtgc tgtgcgcttg tcagaccatc agaccagggc caaccaatca gaaggcaact 2341 tactgtataa attatgcaga gttattttcc tatatctcac agtattaaaa ataaataatt 2401 aaaaattaag aataaataaa cgagttgacc tcggtcacaa aagcagtttt actatcgaat 2461 caatcgctgt tatttttttt aatgtaattt gtacatcttt tttcaatctg tacatttggg 2521 ctgtctgtat gtttttatag ctggttttta aaaagcataa tatgcctata gctgaaaagg 2581 aaacagggct gtttaagtca ctgacttatg agaaagcaaa gcactggtac agttatttaa 2641 caggcataca caagcaggga aagataatcc atttagatct ttaatgcttt ggaaatgcgt 2701 gtaacagtac tgcaataatc acagctctgg gaaaaacaac gaaactttcc cttgtggaga 2761 ggagggattt tcctgctcta tataagcaac atatttttag acattaaaat atatataatt 2821 ttgcaggtaa ttgttgactt ttttaactat attaagcgtt aagctgacaa ctgtcaaaga 2881 agaccatgtt gtaaaataat ttgactaaat aaatggttcc ttctctcaaa aaaaaaaa // LOCUS HSHUMIG 2545 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens Humig mRNA. ACCESSION X72755 S60728 NID g311375 KEYWORDS chemokine; cytokine; Humig gene; secreted protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2545) AUTHORS Farber,J.M. TITLE Direct Submission JOURNAL Submitted (22-MAR-1993) J.M. Farber, Johns Hopkins Univ. School of Medicine, Ross 1147, 720 Rutland Avenue, Baltimore, MD 21205, USA REFERENCE 2 (bases 1 to 2545) AUTHORS Farber,J.M. TITLE HuMig: a new human member of the chemokine family of cytokines JOURNAL Biochem. Biophys. Res. Commun. 192 (1), 223-230 (1993) MEDLINE 93236577 FEATURES Location/Qualifiers source 1..2545 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="child" /tissue_type="leukaemia" /cell_type="monocyte" /cell_line="THP-1" /clone_lib="THP-1/IFN-gamma cDNA" /clone="H-1-3" misc_feature 13..19 /note="cis-acting element; putative" gene 40..417 /gene="Humig" CDS 40..417 /gene="Humig" /codon_start=1 /db_xref="PID:g311376" /db_xref="SWISS-PROT:Q07325" /translation="MKKSGVLFLLGIILLVLIGVQGTPVVRKGRCSCISTNQGTIHLQ SLKDLKQFAPSPSCEKIEIIATLKNGVQTCLNPDSADVKELIKKWEKQVSQKKKQKNG KKHQKKKVLKVRKSQRSRQKKTT" BASE COUNT 755 a 581 c 457 g 752 t ORIGIN 1 atccaataca ggagtgactt ggaactccat tctatcacta tgaagaaaag tggtgttctt 61 ttcctcttgg gcatcatctt gctggttctg attggagtgc aaggaacccc agtagtgaga 121 aagggtcgct gttcctgcat cagcaccaac caagggacta tccacctaca atccttgaaa 181 gaccttaaac aatttgcccc aagcccttcc tgcgagaaaa ttgaaatcat tgctacactg 241 aagaatggag ttcaaacatg tctaaaccca gattcagcag atgtgaagga actgattaaa 301 aagtgggaga aacaggtcag ccaaaagaaa aagcaaaaga atgggaaaaa acatcaaaaa 361 aagaaagttc tgaaagttcg aaaatctcaa cgttctcgtc aaaagaagac tacataagag 421 accacttcac caataagtat tctgtgttaa aaatgttcta ttttaattat accgctatca 481 ttccaaagga ggatggcata taatacaaag gcttattaat ttgactagaa aatttaaaac 541 attactctga aattgtaact aaagttagaa agttgatttt aagaatccaa acgttaagaa 601 ttgttaaagg ctatgattgt ctttgttctt ctaccaccca ccagttgaat ttcatcatgc 661 ttaaggccat gattttagca atacccatgt ctacacagat gttcacccaa ccacatccca 721 ctcacaacag ctgcctggaa gagcagccct aggcttccac gtactgcagc ctccagagag 781 tatctgaggc acatgtcagc aagtcctaag cctgttagca tgctggtgag ccaagcagtt 841 tgaaattgag ctggacctca ccaagctgct gtggccatca acctctgtat ttgaatcagc 901 ctacaggcct cacacacaat gtgtctgaga gattcatgct gattgttatt gggtatcacc 961 actggagatc accagtgtgt ggctttcaga gcctcctttc tggctttgga agccatgtga 1021 ttccatcttg cccgctcagg ctgaccactt tatttctttt tgttcccctt tgcttcattc 1081 aagtcagctc ttctccatcc taccacaatg cagtgccttt cttctctcca gtgcacctgt 1141 catatgctct gatttatctg agtcaactcc tttctcatct tgtccccaac accccacaga 1201 agtgctttct tctcccaatt catcctcact cagtccagct tagttcaagt cctgcctctt 1261 aaataaacct ttttggacac acaaattatc ttaaaactcc tgtttcactt ggttcagtac 1321 cacatgggtg aacactcaat ggttaactaa ttcttgggtg tttatcctat ctctccaacc 1381 agattgtcag ctccttgagg gcaagagcca cagtatattt ccctgtttct tccacagtgc 1441 ctaataatac tgtggaacta ggttttaata attttttaat tgatgttgtt atgggcagga 1501 tggcaaccag accattgtct cagagcaggt gctggctctt tcctggctac tccatgttgg 1561 ctagcctctg gtaacctctt acttattatc ttcaggacac tcactacagg gaccagggat 1621 gatgcaacat ccttgtcttt ttatgacagg atgtttgctc agcttctcca acaataagaa 1681 gcacgtggta aaacacttgc ggatattctg gactgttttt aaaaaatata cagtttaccg 1741 aaaatcatat aatcttacaa tgaaaaggac tttatagatc agccagtgac caaccttttc 1801 ccaaccatac aaaaattcct tttcccgaag gaaaagggct ttctcaataa gcctcagctt 1861 tctaagatct aacaagatag ccaccgagat ccttatcgaa actcatttta ggcaaatatg 1921 agttttattg tccgtttact tgtttcagag tttgtattgt gattatcaat taccacacca 1981 tctcccatga agaaagggaa cggtgaagta ctaagcgcta gaggaagcag ccaagtcggt 2041 tagtggaagc atgattggtg cccagttagc ctctgcagga tgtggaaacc tccttccagg 2101 ggaggttcag tgaattgtgt aggagaggtt gtctgtggcc agaatttaaa cctatactca 2161 ctttcccaaa ttgaatcact gctcacactg ctgatgattt agagtgctgt ccggtggaga 2221 tcccacccga acgtcttatc taatcatgaa actccctagt tccttcatgt aacttccctg 2281 aaaaatctaa gtgtttcata aatttgagag tctgtgaccc acttaccttg catctcacag 2341 gtagacagta tataactaac aaccaaagac tacatattgt cactgacaca cacgttataa 2401 tcatttatca tatatataca tacatgcata cactctcaaa gcaaataatt tttcacttca 2461 aaacagtatt gacttgtata ccttgtaatt tgaaatattt tctttgttaa aatagaatgg 2521 tatcaataaa tagaccatta atcag // LOCUS HSHUMM9 3250 bp RNA PRI 15-NOV-1993 DEFINITION H.sapiens HUMM9 mRNA. ACCESSION X74837 NID g416179 KEYWORDS Man9-mannosidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3250) AUTHORS Bause,E. TITLE Direct Submission JOURNAL Submitted (01-SEP-1993) E. Bause, Institut fuer Physiologische Chemie, Chemie der Universitaet Bonn, 53115 Bonn, Nussallee 11, FRG REFERENCE 2 (bases 1 to 3250) AUTHORS Bause,E., Bieberich,E., Rolfs,A., Volker,C. and Schmidt,B. TITLE Molecular cloning and primary structure of Man9-mannosidase from human kidney JOURNAL Eur. J. Biochem. 217 (2), 535-540 (1993) MEDLINE 94039087 FEATURES Location/Qualifiers source 1..3250 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" /clone_lib="lambda gt10, Clontech HL1123a" sig_peptide 690..794 /gene="HUMM3" CDS 690..2567 /gene="HUMM3" /codon_start=1 /product="Man9-mannosidase" /db_xref="PID:g416180" /db_xref="SWISS-PROT:P33908" /translation="MNSNFITFDLKMSLLPSNLFSAFITLCFGAIFFLPDSSKLLSGV LFHSSPALQPAADHKPGPGARAEDAAEGRARRREEGAPGDPEAALEDNLARIRENHER ALREAKETLQKLPEEIQRDILLEKKKVAQDQLRDKAPFRGLPPVDFVPPIGVESREPA DAAIREKRAKIKEMMKHAWNNYKGYAWGLNELKPISKGGHSSSLFGNIKGATIVDALD TLFIMEMKHEFEEAKSWVEENLDFNVNAEISVFEVNIRFVGGLLSAYYLSGEEIFRKK AVELGVKLLPAFHTPSGIPWALLNMKSGIGRNWPWASGGSSILAEFGTLHLEFMHLSH LSGNPIFAEKVMNIRTVLNKLEKPQGLYPNYLNPSSGQWGQHHVSVGGLGDSFYEYLL KAWLMSDKTDLEAKKMYFDAVQAIETHLIRKSSSGLTYIAEWKRGLLEHKMGHLTCFA GGMFALGADAAPEGMAQHYLELGAEIARTCHESYNRTFMKLGPEAFRFDGGVEAIATR QNEKYYILRPEVMETYMYMWRLTHDPKYRKWAWEAVEALENHCRVNGGYSGLRDVYLL HESYDDVQQSFFLAETLKYLYLIFSDDDLLPLEHWIFNSEAHLLPILPKDKKEVEIRE E" gene 690..2567 /gene="HUMM3" mat_peptide 795..2564 /gene="HUMM3" /product="Man9-mannosidase" BASE COUNT 924 a 684 c 715 g 927 t ORIGIN 1 ccaacttatt taaaacaaaa caattttgta ggtattatta tacccatttc acagatgatg 61 ataaatgaga ccaatagaag ttaaataact tgccaaaggc cacacagctg gtgagtgatg 121 gagaacgaat taaaactcaa gtgagcataa ttctaaaagc catcttctcg ttagtgtttc 181 tcactatcca ggtctgcctt tgccttattt aactgaagtt aagccatcct tacctgtgat 241 cacctagcct ctcagtttgg ggggatcatt acagcgggtt tttaactccc aatgttctgg 301 tccagtttgc tttacatgtt cttatttata cattgtcaag gatgacctca ggacagtaca 361 gcaaggacac agtggcactt cacattttgt tcccacgaaa tgactggggc ataatctcag 421 atcatcttcc tttagaatgt ggaaacatca gcagaagaat attagtcttt atacaagtca 481 aatccaaaat gacacatgtg aaaactaata gagctgactt tcagccatga tagctttggc 541 acacctcaca tccctttgtt caacctctct tccctcaacg gagagctgca ttcctgggaa 601 tttctgttgt gcacttttcc cacttgccct gctgtcattt aaaggtgaac attctagttt 661 tgctaagaaa accctttcct tcatttggaa tgaacagcaa ttttattact tttgacctta 721 aaatgagttt gctgccttca aatcttttca gcgccttcat cacgctctgc ttcggggcga 781 tcttcttcct gccagactcc tccaagctgc tcagcggggt cctgttccac tccagccccg 841 ccttgcagcc ggccgccgac cacaagcccg ggcccggggc gcgcgccgag gacgcggccg 901 aggggcgagc ccggcgccgc gaggaggggg cacccgggga cccggaggcc gccctggagg 961 acaacttggc caggatccgc gaaaaccacg agcgggctct cagggaagcc aaggagaccc 1021 tgcagaagct gcccgaggag atccaaagag acatcctact ggagaagaag aaggtggccc 1081 aggaccagct gcgtgacaag gcgccgttca gaggcctgcc cccggtggac ttcgtgcccc 1141 caatcggggt ggagagccgg gagcccgccg acgccgccat ccgcgagaaa agggcaaaga 1201 tcaaagagat gatgaaacat gcttggaata attataaagg ttatgcctgg ggattaaatg 1261 aactcaaacc tatatcaaaa ggaggccatt caagcagttt gtttggtaac atcaaaggag 1321 caactatagt agatgccctg gatacacttt ttattatgga aatgaaacat gaatttgaag 1381 aagcaaaatc atgggttgaa gaaaatttag attttaatgt gaatgctgaa atttctgtct 1441 ttgaagtaaa tatacgcttt gttggtggac tactctcagc ctactatctg tctggagaag 1501 agatttttcg aaagaaagca gtggaacttg gggtaaaatt gctacctgca tttcatactc 1561 cctctggaat accttgggca ttgctgaata tgaaaagtgg tattggaagg aactggccct 1621 gggcctctgg aggcagcagt attctggcag aatttggaac cctgcatttg gagtttatgc 1681 acttgagcca cttatcagga aaccccatct ttgctgaaaa ggtaatgaat attcgaacag 1741 tactgaacaa actggaaaaa ccacaaggcc tttatcctaa ctatctgaat cccagtagtg 1801 gacagtgggg tcaacatcat gtatcagttg gaggacttgg agacagcttc tatgagtatt 1861 tgctgaaggc ctggttaatg tctgacaaga cagatctgga agctaagaag atgtattttg 1921 atgctgttca ggctatcgag actcatttga tccgcaagtc tagcagcgga ctaacttata 1981 tcgcagagtg gaaaaggggc ctcctggagc acaagatggg ccacctgacc tgcttcgcgg 2041 ggggcatgtt cgcactcggg gctgatgcag ctcccgaagg catggcccaa cactaccttg 2101 aactcggggc tgaaattgcc cgtacttgtc atgaatcata taatcgaaca tttatgaaac 2161 tgggaccaga agctttcaga tttgatggtg gtgttgaagc catcgctaca agacaaaatg 2221 aaaaatacta catcttacgg ccagaagtta tggagactta catgtatatg tggagactga 2281 ctcatgatcc aaagtacagg aaatgggcct gggaagccgt agaggccttg gaaaaccatt 2341 gcagagtgaa tggaggctat tcaggcctaa gggatgttta ccttcttcat gagagttatg 2401 atgatgtgca gcagagtttc ttcctggcag agacattgaa atatttgtac ctaatatttt 2461 ctgacgacga tcttcttcca ctggagcatt ggatcttcaa tagcgaggca catcttctcc 2521 ctatcctccc taaagataaa aaggaagttg aaatcagaga ggaataaaaa agacatttat 2581 attttattct gctccattcc cttcactgta taccttaata attccttttc tggtaatcag 2641 gcacatgatg aactttgatt agtaggtctg tgattaagtt cttaaattgt tttgcagtct 2701 tttatgttta ttatcatagg tataggtgga cctaaattcc ttatcatatc tttattaatt 2761 cagccagtgt atccaccagt tttttgttta tgtttttaag taacctatta tctctggatt 2821 tcatgaaggt gtaatatcgt ttttgttaaa ctgaatagaa ttgtatagcg atgacctctt 2881 aattataatt tgatttgact gcaaaacttt ttcctcctct aagaggagat gatgtctgct 2941 ttaagctgta atgttttgcc atgttgcaaa aagccataat aataagtata aaaaagcttt 3001 ttcctttaca atttcatgtt aatctggttt gtctgtccac cagagacaga tcttctgtga 3061 cagcctcctt atgcaggtct atcattattt gatagaatgt cttctaaaat acttcactca 3121 cattgtaatt caaattagaa agtcattcca aaaggtcatg tcatgttgac ctcatttcat 3181 cggaactgca gtatattttt gttggttaat tatattagtg ttttctattt tgaaaaaaaa 3241 aaaaaaaaaa // LOCUS HSHUMS3 833 bp RNA PRI 12-SEP-1993 DEFINITION Human Hums3 mRNA for 40S ribosomal protein s3. ACCESSION X55715 NID g32531 KEYWORDS ribosomal protein; ribosomal protein S3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 833) AUTHORS Zhang,X.T. TITLE Direct Submission JOURNAL Submitted (29-SEP-1990) Zhang X.T., Institute of Molecular and Cell Biology, National University of Singapore, 10 Kent Ridge Crescent, Singapore 0511 REFERENCE 2 (bases 1 to 833) AUTHORS Zhang,X.T., Tan,Y.M. and Tan,Y.H. TITLE Isolation of a cDNA encoding human 40S ribosomal protein s3 JOURNAL Nucleic Acids Res. 18 (22), 6689 (1990) MEDLINE 91067464 COMMENT Data kindly reviewed (04-DEC-1990) by Zhang X.-t. FEATURES Location/Qualifiers source 1..833 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epithelium" /cell_line="Wish cell" /clone="p54-2" CDS 23..754 /note="ribosomal protein s3" /codon_start=1 /db_xref="PID:g32532" /db_xref="SWISS-PROT:P23396" /translation="MAVQISKKRKFVADGIFKAELNEFLTRELAEDGYSGVEVRVTPT RTEIIILATRTQNVLGEKGRRIRELTAVVQKRFGFPEGSVELYAEKVATRGLCAIAQA ECLRYKLLGGLAVRRACYGVLRFIMESGAKGCEVVVSGKLRGQRAKSMKFVDGLMIHS GDPVNYYVDTAVRHVLLRQGVLGIKVKIMLPWDPTGKIGPKKPLPDHVSIVEPKDEIL PTTPISEQKGGKPELPAMPQPVPTA" misc_feature 812..817 /note="polyA signal" BASE COUNT 203 a 189 c 250 g 191 t ORIGIN 1 cgttgctgtc ggcggcggca agatggcagt gcaaatatcc aagaagagga agtttgtcgc 61 tgatggcatc ttcaaagctg aactgaatga gtttcttact cgggagctgg ctgaagatgg 121 ctactctgga gttgaggtgc gagttacacc aaccaggaca gaaatcatta tcttagccac 181 cagaacacag aatgttcttg gtgagaaggg ccggcggatt cgggaactga ctgctgtagt 241 tcagaagagg tttggctttc cagagggcag tgtagagctt tatgctgaaa aggtggccac 301 tagaggtctg tgtgccattg cccaggcaga gtgtctgcgt tacaaactcc taggagggct 361 tgctgtgcgg agggcctgct atggtgtgct gcggttcatc atggagagtg gggccaaagg 421 ctgcgaggtt gtggtgtctg ggaaactccg aggacagagg gctaaatcca tgaagtttgt 481 ggatggcctg atgatccaca gtggagaccc tgttaactac tacgttgaca ctgctgtgcg 541 ccacgtgttg ctcagacagg gtgtgctggg catcaaggtg aagatcatgc tgccctggga 601 cccaactggt aagattggcc ctaagaagcc cctgcctgac cacgtgagca ttgtggaacc 661 taaagatgag atactgccca ccacccccat ctcagaacag aagggtggga agccagagct 721 gcctgccatg ccccagccag tccccacagc ataacagggt ctccttggca gctgcattct 781 ggagtctgga tgttgctctc taaagaactt taataaaatt ttgtacaaaa gac // LOCUS HSHUPROBX 1704 bp RNA PRI 28-JAN-1993 DEFINITION H.sapiens mRNA for proline rich homeobox (Prh) protein. ACCESSION X67235 NID g32547 KEYWORDS homeobox gene; Prh gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1704) AUTHORS Manfioletti,G. TITLE Direct Submission JOURNAL Submitted (09-JUL-1992) G. Manfioletti, Dip. Biochimica, Biofisica e Chimica delle Macromolecole, via Valerio 38, 34100 Trieste, ITALY REFERENCE 2 (bases 1 to 1704) AUTHORS Crompton,M.R., Bartlett,T.J., MacGregor,A.D., Manfioletti,G., Buratti,E., Giancotti,V. and Goodwin,G.H. TITLE Identification of a novel vertebrate homeobox gene expressed in haematopoietic cells JOURNAL Nucleic Acids Res. 20 (21), 5661-5667 (1992) MEDLINE 93087175 FEATURES Location/Qualifiers source 1..1704 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG1" gene 8..820 /gene="huprobox" CDS 8..820 /gene="huprobox" /codon_start=1 /db_xref="PID:g32548" /db_xref="SWISS-PROT:Q03014" /translation="MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIEDILGRGPAA PTPAPTLPSPNSSFTSLVSPYRTPVYEPTPIHPAFSHHSAAALAAAYGPGGFGGPLYP FPRTVNDYTHALLRHDPLGKPLLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQK YLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRRLKQENPQSNKKEELESLDSSCDQR QDLPSEQNKGASLDSSQCSPSPASQEDLESEISEDSDQEVDIEGDKSYFNAG" misc_feature 413..595 /gene="huprobox" /note="homeobox" polyA_signal 1667..1672 BASE COUNT 483 a 411 c 372 g 438 t ORIGIN 1 cggagccatg cagtacccgc accccgggcc ggcggcgggc gccgtggggg tgccgctgta 61 cgcgcccacg ccgctgctgc aacccgcaca cccgacgccc ttttacatcg aggacatcct 121 gggccgcggg cccgccgcgc ccacgcccgc ccccacgctg ccgtccccca actcctcctt 181 caccagcctc gtgtccccct accggacccc ggtgtacgag cccacgccga tccatccagc 241 cttctcgcac cactccgccg ccgcgctggc cgctgcctac ggacccggcg gcttcggggg 301 ccctctgtac cccttcccgc ggacggtgaa cgactacacg cacgccctgc tccgccacga 361 ccccctgggc aaacctctac tctggagccc cttcttgcag aggcctctgc ataaaaggaa 421 aggcggccag gtgagattct ccaacgacca gaccatcgag ctggagaaga aattcgagac 481 gcagaaatat ctctctccgc ccgagaggaa gcgtctggcc aagatgctgc agctcagcga 541 gagacaggtc aaaacctggt ttcagaatcg acgcgctaaa tggaggagac taaaacagga 601 gaaccctcaa agcaataaaa aagaagaact ggaaagtttg gacagttcct gtgatcagag 661 gcaagatttg cccagtgaac agaataaagg tgcttctttg gatagctctc aatgttcgcc 721 ctcccctgcc tcccaggaag accttgaatc agagatttca gaggattctg atcaggaagt 781 ggacattgag ggcgataaaa gctattttaa tgctggatga tgaccactgg cattggcatg 841 ttcagaaaac tggatttagg aataatgttt tgctacagaa aatcttcata gaagaactgg 901 aaggctatat aagaaaggga atcaattctc tggtattctg gaaacctaaa aatatttggt 961 gcactgctca attaacaaac ctacatggag accttaattt tgacttaaca aatagtttat 1021 gtactgctct taggttgttt tgataaagtg acattatagt gattaaattc ttcccccttt 1081 aaaaaaacag ttagtggttt tcactattta taaaaaatta attttgaact ttttgttaaa 1141 tttttaagtt atagctttaa aggttttaat aggaccttct tgaacgactt ttctgtaatc 1201 tgtttatctc ccacttaatg gaaaggcaaa ggggtacccc aaatccagag ctgcctacat 1261 ttcaggcagc cttggagtat tttaaaagga aaacattctt tacttttata tgacattctt 1321 atactgctgt ctcaaatcca aaaacatttc agagctcttg tctcagagat gtgtgttctt 1381 tttgtcagag atatggttga tgagaatctt aaatgcttgt tttgcactat cacttagtac 1441 ctgtttgacc aaggtgttaa ggggatagta cctcccaatt caagcagaga aactgacctg 1501 actaaagtta atcgcagatg aactagaagt cacaggttaa ttaaatgtaa gtagattgta 1561 gatactgttt tatatcaaac aatgtttata atgtgtatat agaattgttc actgtaaaaa 1621 aaatggccaa aatgtgtttt ttttttaata agtaacttga ctataaaata aagccgtccg 1681 tgggacgact gacaaaaaaa aaaa // LOCUS HSHVLCAD 2219 bp RNA PRI 29-MAR-1996 DEFINITION H.sapiens HVLCAD gene. ACCESSION X86556 NID g790446 KEYWORDS acyl-CoA dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2219) AUTHORS Andresen,B.S., Bross,P., Vianey-Saban,C., Divry,P., Zabot,M.T., Roe,C.R., Nada,M.A., Byskov,A., Kruse,T.A., Neve,S., Kristiansen,K., Knudsen,I., Corydon,M.J. and Gregersen,N. TITLE Cloning and characterization of human very-long-chain acyl-CoA dehydrogenase cDNA, chromosomal assignment of the gene and identification in four patients of nine different mutations within the VLCAD gene JOURNAL Hum. Mol. Genet. 5 (4), 461-472 (1996) MEDLINE 96254975 REMARK Erratum:[[published erratum appears in Hum Mol Genet 1996 Sep;5(9):1390]] REFERENCE 2 (bases 1 to 2219) AUTHORS Andresen,B.S. TITLE Direct Submission JOURNAL Submitted (24-APR-1995) B.S. Andresen, Center for Medical Molecular Biology, Skejby Sygehus, Brendstrupgaardvej, DK-8200 Aarhus N, DENMARK FEATURES Location/Qualifiers source 1..2219 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="17" gene 86..2053 /gene="HVLCAD" CDS 86..2053 /gene="HVLCAD" /codon_start=1 /product="very-long-chain acyl-CoA dehydrogenase" /db_xref="PID:g790447" /translation="MQAARMAASLGRQLLRLGGGSSRLTALLGQPRPGPARRPYAGGA AQLALDKSDSHPSDALTRKKPAKAESKSFAVGMFKGQLTTDQVFPYPSVLNEEQTQFL KELVEPVSRFFEEVNDPAKNDALEMVEETTWQGLKELGAFGLQVPSELGGVGLCNTQY ARLVEIVGMHDLGVGITLGAHQSIGFKGILLFGTKAQKEKYLPKLASGETVAAFCLTE PSSGSDAASIRTSAVPSPCGKYYTLNGSKLWISNGGLADIFTVFAKTPVTDPATGAVK EKITAFVVERGFGGITHGPPEKKMGIKASNTAEVFFDGVRVPSENVLGEVGSGFKVAM HILNNGRFGMAAALAGTMRGIIAKAVDHATNRTQFGEKIHNFGLIQEKLARMVMLQYV TESMAYMVSANMDQGATDFQIEAAISKIFGSEAAWKVTDECIQIMGGMGFMKEPGVER VLRDLRIFRIFEGTNDILRLFVALQGCMDKGKELSGLGSALKNPFGNAGLLLGEAGKQ LRRRAGLGSGLSLSGLVHPELSRSGELAVRALEQFATVVEAKLIKHKKGIVNEQFLLQ RLADGAIDLYAMVVVLSRASRSLSEGHPTAQHEKMLCDTWCIEAAARIREGMAALQSD PWQQELYRNFKSISKALVERGGVVTSNPLGF" polyA_signal 2201..2206 BASE COUNT 481 a 585 c 705 g 448 t ORIGIN 1 aggacgtggg cgtgcaggac gcgggcgtgc aggacgccag agctgggtca gagctcgagc 61 cagcggcgcc cggagagatt cggagatgca ggcggctcgg atggccgcga gcttggggcg 121 gcagctgctg aggctcgggg gcggaagctc gcggctcacg gcgctcctgg ggcagccccg 181 gcccggccct gcccggcggc cctatgccgg gggtgccgct cagctggctc tggacaagtc 241 agattcccac ccctctgacg ctctgaccag gaaaaaaccg gccaaggcgg aatctaagtc 301 ctttgctgtg ggaatgttca aaggccagct caccacagat caggtgttcc catacccgtc 361 cgtgctcaac gaagagcaga cacagtttct taaagagctg gtggagcctg tgtcccgttt 421 cttcgaggaa gtgaacgatc ccgccaagaa tgacgctctg gagatggtgg aggagaccac 481 ttggcagggc ctcaaggagc tgggggcctt tggtctgcaa gtgcccagtg agctgggtgg 541 tgtgggcctt tgcaacaccc agtacgcccg tttggtggag atcgtgggca tgcatgacct 601 tggcgtgggc attaccctgg gggcccatca gagcatcggt ttcaaaggca tcctgctctt 661 tggcacaaag gcccagaaag aaaaatacct ccccaagctg gcatctgggg agactgtggc 721 cgctttctgt ctaaccgagc cctcaagcgg gtcagatgca gcctccatcc gaacctctgc 781 tgtgcccagc ccctgtggaa aatactatac cctcaatgga agcaagcttt ggatcagtaa 841 tgggggccta gcagacatct tcacggtctt tgccaagaca ccagttacag atccagccac 901 aggagccgtg aaggagaaga tcacagcttt tgtggtggag aggggcttcg ggggcattac 961 ccatgggccc cctgagaaga agatgggcat caaggcttca aacacagcag aggtgttctt 1021 tgatggagta cgggtgccat cggagaacgt gctgggtgag gttgggagtg gcttcaaggt 1081 tgccatgcac atcctcaaca atggaaggtt tggcatggct gcggccctgg caggtaccat 1141 gagaggcatc attgctaagg cggtagatca tgccactaat cgtacccagt ttggggagaa 1201 aattcacaac tttgggctga tccaggagaa gctggcacgg atggttatgc tgcagtatgt 1261 aactgagtcc atggcttaca tggtgagtgc taacatggac cagggagcca cggacttcca 1321 gatagaggcc gccatcagca aaatctttgg ctcggaggca gcctggaagg tgacagatga 1381 atgcatccaa atcatggggg gtatgggctt catgaaggaa cctggagtag agcgtgtgct 1441 ccgagatctt cgcatcttcc ggatctttga ggggacaaat gacattcttc ggctgtttgt 1501 ggctctgcag ggctgtatgg acaaaggaaa ggagctctct gggcttggca gtgctctaaa 1561 gaatcccttt gggaatgctg gcctcctgct aggagaggca ggcaaacagc tgaggcggcg 1621 ggcagggctg ggcagcggcc tgagtctcag cggacttgtc cacccggagt tgagtcggag 1681 tggcgagctg gcagtacggg ctctggagca gtttgccact gtggtggagg ccaagctgat 1741 aaaacacaag aaggggattg tcaatgaaca gtttctgctg cagcggctgg cagacggggc 1801 catcgacctc tatgccatgg tggtggttct ctcgagggcc tcaagatccc tgagtgaggg 1861 ccaccccacg gcccagcatg agaaaatgct ctgtgacacc tggtgtatcg aggctgcagc 1921 tcggatccga gagggcatgg ccgccctgca gtctgacccc tggcagcaag agctctaccg 1981 caacttcaaa agcatctcca aggccttggt ggagcggggt ggtgtggtca ccagcaaccc 2041 acttggcttc tgaatactcc cggccagggc ctgtcccagt tatgtgcctt ccctcaagcc 2101 aaagccgaag cccctttcct taaggccctg gtttgtcccg aaggggccta gtgttcccag 2161 cactgtgcct gctctcaaga gcacttactg cctcgcaaat aataaaaatt tctagccag // LOCUS HSHYACDO 1247 bp RNA PRI 18-OCT-1995 DEFINITION H.sapiens mRNA for 3-hydroxyanthranilic acid dioxygenase. ACCESSION Z29481 NID g443918 KEYWORDS 3-HAO; 3-hydroxyanthranilic acid dioxygenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1247) AUTHORS Malherbe,P., Kohler,C., Da Prada,M., Lang,G., Kiefer,V., Schwarcz,R., Lahm,H.W. and Cesura,A.M. TITLE Molecular cloning and functional expression of human 3-hydroxyanthranilic-acid dioxygenase JOURNAL J. Biol. Chem. 269 (19), 13792-13797 (1994) MEDLINE 94245687 REFERENCE 2 (bases 1 to 1247) AUTHORS Malherbe,P. TITLE Direct Submission JOURNAL Submitted (10-JAN-1994) Pari Malherbe, Preclinical Research,PRPN, F. Hoffmann-La Roche, Ltd,Pharma Division,, Basel, CH-4002, Switzerland FEATURES Location/Qualifiers source 1..1247 /organism="Homo sapiens" /strain="Human" /db_xref="taxon:9606" /clone="hHAOF9" /cell_line="human hepatoma cell line,Hep G2,ATCC HB 8065" /clone_lib="Hep G2 Lamda gt11 cDNA library" CDS 41..901 /codon_start=1 /product="3-hydroxyanthranilic acid dioxygenase" /db_xref="PID:g443919" /db_xref="SWISS-PROT:P46952" /translation="MERRLGVRAWVKENRGSFQPPVCNKLMHQEQLKVMFVGGPNTRK DYHIEEGEEVFYQLEGDMVLRVLEQGKHRDVVIRQGEIFLLPARVPHSPQRFANTVGL VVERRRLETELDGLRYYVGDTMDVLFEKWFYCKDLGTQLAPIIQEFFSSEQYRTGKPI PDQLLKEPPFPLSTRSIMEPMSLDAWLDSHHRELQAGTPLSLFGDTYETQVIAYGQGS SEGLRQNVDVWLWQLEGSSVVTMGGRRLSLAPDDSLLVLAGTSYAWERTQGSVALSVT QDPACKKPLG" polyA_signal 1225..1230 BASE COUNT 254 a 399 c 369 g 225 t ORIGIN 1 cgcgggagga cagcgctgcg aggaggcgcc cgggacagtc atggagcgcc gcctgggagt 61 gagggcctgg gtgaaggaga accggggctc cttccagccc ccggtctgca acaagctcat 121 gcaccaggag cagctcaaag tcatgttcgt cggaggcccc aacaccagga aggactatca 181 catcgaagag ggtgaagagg tattttacca gctggaggga gacatggttc tccgagtcct 241 ggagcaaggg aaacaccggg atgtggtcat tcggcaggga gagatattcc tcctgcctgc 301 cagggtgccc cactcaccac agaggtttgc caacaccgtg gggctggtgg ttgagcgaag 361 gcggctggag accgagctag atgggctcag gtactatgtg ggcgacacca tggacgttct 421 gtttgagaag tggttctact gcaaggacct cggcacgcag ttggccccca tcatccagga 481 gttcttcagc tctgagcagt acagaacagg aaagcccatc cctgaccagc tgctcaagga 541 gccaccattc cctctgagca cacgatccat catggagccc atgtccctgg atgcctggct 601 ggacagccac cacagggagc tgcaggcagg cacaccactc agcctgtttg gggacaccta 661 tgagacccag gtgatcgcct atgggcaagg cagcagcgaa ggcctgagac agaatgtgga 721 cgtgtggctg tggcagctgg agggctcctc ggtggtgaca atggggggac ggcgcctgag 781 cctggcccct gatgacagcc tcctggtgct agctgggacc tcgtatgcct gggagcgaac 841 acaaggctct gtggccctgt ctgtgaccca ggaccctgcc tgcaagaagc ccctggggtg 901 accctcttgc catggcctga agcagccaca ggttggccaa gcaccctcga gtgccatccc 961 tgccaaacaa ctctcccagc ccccactacc tctctgtgta ctgccgctgt gtcccccaca 1021 gacctgcaca ttgttgtcac ccaccctcct gcccttctca gcccagatgc catgccctgg 1081 gcgggcagca gctccccatc ttctctggca gactcagccc actgccttgc cagtcttgcc 1141 aggtggtcta cccccggccc cgctcctgcc cattcctctg tccctgcaga ctcagtgcag 1201 cacttccaca ccaagaaggc cctcaataaa ggcttcctga ggaacgc // LOCUS HSHYLTK 1968 bp RNA PRI 10-OCT-1994 DEFINITION H.sapiens HYL tyrosine kinase mRNA. ACCESSION X77278 NID g471312 KEYWORDS HYLTK gene; nonreceptor protein tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1968) AUTHORS Sakano,S., Iwama,A., Inazawa,J., Ariyama,T., Ohno,M. and Suda,T. TITLE Molecular cloning of a novel non-receptor tyrosine kinase, HYL (hematopoietic consensus tyrosine-lacking kinase) JOURNAL Oncogene 9 (4), 1155-1161 (1994) MEDLINE 94181267 REFERENCE 2 (bases 1 to 1968) AUTHORS Iwama,A. TITLE Direct Submission JOURNAL Submitted (14-JAN-1994) A. Iwama, Dept of Cell Differentiation, Inst of Mol Embryology & Genetics, Kumamoto University School of Medicine, 2-2-1 Honjo, Kumamoto 860, JAPAN FEATURES Location/Qualifiers source 1..1968 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hematopoietic cell" /clone_lib="UT-7 cDNA library" /chromosome="19p13" mRNA 1..1968 /gene="HHYLTK" gene 1..1968 /gene="HHYLTK" CDS 208..1731 /gene="HHYLTK" /codon_start=1 /product="HYL tyrosine kinase" /db_xref="PID:g557272" /db_xref="SWISS-PROT:P42679" /translation="MAGRGSLVSWRAFHGCDSAEELPRVSPRFLRAWHPPPVSARMPT RRWAPGTQCITKCEHTRPKPGELAFRKGDVVTILEACENKSWYRVKHHTSGQEGLLAA GALREREALSADPKLSLMPWFHGKISGQEAVQQLQPPEDGLFLVRESARHPGDYVLCV SFGRDVIHYRVLHRDGHLTIDEAVFFCNLMDMVEHYSKDKGAICTKLVRPKRKHGTKS AEEELARAGWLLNLQHLTLGAQIGEGEFGAVLQGEYLGQKVAVKNIKCDVTAQAFLDE TAVMTKMQHENLVRLLGVILHQGLYIVMEHVSKGNLVNFLRTRGRALVNTAQLLQFSL HVAEGMEYLESKKLVHRDLAARNILVSEDLVAKVSDFGLAKAERKGLDSSRLPVKWTA PEALKHGKFTSKSDVWSFGVLLWEVFSYGRAPYPKMSLKEVSEAVEKGYRMEPPEGCP GPVHVLMSSCWEAEPARRPPFRKLAEKLARELRSAGAPASVSGQDADGSTSPRSQEP" BASE COUNT 391 a 614 c 652 g 311 t ORIGIN 1 cggaggccct cctgggggcg ggcgcggggc gcggctcggg ggcgccccct gagcagaaaa 61 caggaagaac caggctcggt ccagtggcac ccagctccct acctcctgtg ccagccgact 121 ggcctgtggc aggccattcc cagcgtcccc gactgtgacc acttgctcag tgtgcctctc 181 acctgcctca gtttccctct gggggcgatg gcggggcgag gctctctggt ttcctggcgg 241 gcatttcacg gctgtgattc tgctgaggaa cttccccggg tgagcccccg cttcctccga 301 gcctggcacc cccctcccgt ctcagccagg atgccaacga ggcgctgggc cccgggcacc 361 cagtgtatca ccaaatgcga gcacacccgc cccaagccag gggagctggc cttccgcaag 421 ggcgacgtgg tcaccatcct ggaggcctgc gagaacaaga gctggtaccg cgtcaagcac 481 cacaccagtg gacaggaggg gctgctggca gctggggcgc tgcgggagcg ggaggccctc 541 tccgcagacc ccaagctcag cctcatgccg tggttccacg ggaagatctc gggccaggag 601 gctgtccagc agctgcagcc tcccgaggat gggctgttcc tggtgcggga gtccgcgcgc 661 caccccggcg actacgtcct gtgcgtgagc tttggccgcg acgtcatcca ctaccgcgtg 721 ctgcaccgcg acggccacct cacaatcgat gaggccgtgt tcttctgcaa cctcatggac 781 atggtggagc attacagcaa ggacaagggc gctatctgca ccaagctggt gagaccaaag 841 cggaaacacg ggaccaagtc ggccgaggag gagctggcca gggcgggctg gttactgaac 901 ctgcagcatt tgacattggg agcacagatc ggagagggag agtttggagc tgtcctgcag 961 ggtgagtacc tggggcaaaa ggtggccgtg aagaatatca agtgtgatgt gacagcccag 1021 gccttcctgg acgagacggc cgtcatgacg aagatgcaac acgagaacct ggtgcgtctc 1081 ctgggcgtga tcctgcacca ggggctgtac attgtcatgg agcacgtgag caagggcaac 1141 ctggtgaact ttctgcggac ccggggtcga gccctcgtga acaccgctca gctcctgcag 1201 ttttctctgc acgtggccga gggcatggag tacctggaga gcaagaagct tgtgcaccgc 1261 gacctggccg cccgcaacat cctggtctca gaggacctgg tggccaaggt cagcgacttt 1321 ggcctggcca aagccgagcg gaaggggcta gactcaagcc ggctgcccgt caagtggacg 1381 gcgcccgagg ctctcaaaca cgggaagttc accagcaagt cggatgtctg gagttttggg 1441 gtgctgctct gggaggtctt ctcatatgga cgggctccgt accctaaaat gtcactgaaa 1501 gaggtgtcgg aggccgtgga gaaggggtac cgcatggaac cccccgaggg ctgtccaggc 1561 cccgtgcacg tcctcatgag cagctgctgg gaggcagagc ccgcccgccg gccacccttc 1621 cgcaaactgg ccgagaagct ggcccgggag ctacgcagtg caggtgcccc agcctccgtc 1681 tcagggcagg acgccgacgg ctccacctcg ccccgaagcc aggagccctg accccacccg 1741 gtggcccttg gccccagagg accgagagag tggagagtgc ggcgtggggg cactgaccag 1801 gcccaaggag ggtccaggcg ggcaagtcat cctcctggtg cccacagcag gggctggccc 1861 acgtaggggg ctctgggcgg cccgtggaca ccccagacct gcgaaggatg atcgcccgat 1921 aaagacggat tctaaggact ctaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSHZF10 1970 bp RNA PRI 15-MAY-1995 DEFINITION H.sapiens HZF10 mRNA for zinc finger protein. ACCESSION X78933 NID g498720 KEYWORDS zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1970) AUTHORS Abrink,M., Aveskogh,M. and Hellman,L. TITLE Isolation of cDNA clones for 42 different Kruppel-related zinc finger proteins expressed in the human monoblast cell line U-937 JOURNAL DNA Cell Biol. 14 (2), 125-136 (1995) MEDLINE 95169271 REFERENCE 2 (bases 1 to 1970) AUTHORS Abrink,M. TITLE Direct Submission JOURNAL Submitted (01-JUN-1994) M. Abrink, Department of Immunology, University of Uppsala, The Biomedical Centre Box 582, 751 23 Uppsala, SWEDEN FEATURES Location/Qualifiers source 1..1970 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U-937" /clone_lib="Clontech 1029a" gene 200..1666 /gene="HZF10" CDS 200..1666 /gene="HZF10" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g498721" /translation="MENLTKHSIECSSFRGDWECKNQFERKQGSQEGHFSEMIFTPED MPTFSIQHQRIHTDEKLLECKECGKDFSFVSVLVRHQRIHTGEKPYECKECGKAFGSG ANLAYHQRIHTGEKPFECKECGKAFGSGSNLTHHQRIHTGEKPYECKECGKAFSFGSG LIRHQIIHSGEKPYECKECGKSFSFESALIRHHRIHTGEKPYECIDCGKAFGSGSNLT QHRRIHTGEKPYECKACGMAFSSGSALTRHQRIHTGEKPYICNECGKAFSFGSALTRH QRIHTGEKPYVCKECGKAFNSGSDLTQHQRIHTGEKPYECKECEKAFRSGSKLIQHQR MHTGEKPYECKECGKTFSSGSDLTQHHRIHTGEKPYECKECGKAFGSGSKLIQHQLIH TGERPYECKECGKSFSSGSALNRHQRIHTGEKPYECKECGKAFYSGSSLTQHQRIHTG EKLYECKNCGKAYGRDSEFQQHKKSHNGKKLCELETIN" misc_feature 389..1627 /gene="HZF10" /note="zinc finger domain" BASE COUNT 651 a 331 c 448 g 540 t ORIGIN 1 ggccttgggg ctttgtgaga tagctagggt cgacttgtgt ggattctagt agaacggagc 61 tgaccctatc cgaacaggcc cctgcctgct tggatagagg cctccgaaca ggagtaaaga 121 atggctgttg aacatccaca aggcacctgc aagactatga atcaaagttg agaccaagaa 181 attatttctg aaaaaggata tggaaaacct tacaaaacac agcattgagt gttcaagttt 241 cagaggtgat tgggaatgta aaaaccagtt tgagagaaaa cagggatctc aggaaggaca 301 tttcagtgaa atgatattta ctcctgaaga catgcccact ttcagtatcc agcatcagag 361 aattcatact gatgagaaac tccttgaatg taaggaatgt gggaaggatt ttagttttgt 421 atcagtcctt gttcgacatc agcgaattca tactggtgag aaaccttatg aatgcaaaga 481 atgtggcaag gcctttggta gtggtgcaaa ccttgcttac catcaaagaa ttcatactgg 541 tgagaagcct tttgaatgta aagaatgtgg gaaggccttt ggtagtggct caaaccttac 601 tcaccatcag agaattcata ctggtgagaa accctatgag tgtaaggaat gtgggaaagc 661 ctttagtttt ggatcaggcc ttatacgaca tcagatcatt cacagtggag agaagcctta 721 tgagtgtaag gaatgtggga agtcctttag ttttgaatca gcccttattc ggcatcacag 781 aattcacaca ggtgagaaac cttatgaatg tatagattgt ggtaaagcct ttggcagtgg 841 ttcaaacctt actcaacatc ggcggattca tactggtgag aaaccttatg aatgcaaagc 901 atgtggaatg gcctttagca gtggttcggc tcttactcgg catcagagaa ttcataccgg 961 tgagaaacca tatatatgta atgaatgtgg taaggccttt agttttggat cagcccttac 1021 tcgacatcaa agaattcata ctggtgagaa accttatgta tgtaaggaat gtgggaaggc 1081 ttttaatagt ggctcagatc tcactcagca tcagagaatt cacactggtg agaaacccta 1141 tgagtgtaag gagtgtgaga aagcctttag aagtggttca aaacttattc agcatcaaag 1201 aatgcatact ggagagaaac cttatgaatg taaggaatgt gggaagacct ttagtagtgg 1261 ttcagacctt actcaacatc acagaattca tactggtgag aaaccctatg aatgtaagga 1321 atgtgggaag gcctttggta gtggctcaaa acttatccaa caccagctaa tccatactgg 1381 tgaaagaccc tatgaatgta aagaatgtgg aaagtccttt agtagtggtt cagctcttaa 1441 tcggcaccag agaatacaca ctggtgagaa accctatgaa tgtaaggagt gtgggaaggc 1501 tttttatagt ggctcaagcc ttactcagca tcagagaatt catacaggtg agaaacttta 1561 tgaatgtaag aactgtggga aggcttatgg gagggattca gagtttcagc aacataagaa 1621 aagtcataat ggtaagaaac tctgcgaatt ggaaactata aattgaaatt atgtgctgaa 1681 ggaaggactc taaacatatg acttaagaaa attcatagtg gtgaaaatct ctacaaatag 1741 aactaaggta caaatgcctt acttatgctt cacaggttag tcagtctaag aatatttata 1801 caggaaaaaa atcaccccaa ataaaataaa tatttgaaga tccttatcta tattcattcc 1861 ttcattactt ttggaaaatt cttacttgtg aatgttaaaa atgaaaaaaa aatcatttat 1921 tatattttgc ctcaacttta aacattggaa aactcatttc tgggttaatc // LOCUS HSI12SRN 1163 bp RNA PRI 12-SEP-1993 DEFINITION Human 12S RNA induced by poly(rI), poly(rC) and Newcastle disease virus. ACCESSION X13956 NID g32574 KEYWORDS Alu repetitive sequence; induced 12S RNA; induced RNA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1163) AUTHORS Lammers,R., Gross,G., Mayr,U. and Collins,J. TITLE Alternative mechanisms for gene activation induced by poly(rI).poly(rC) and Newcastle disease virus JOURNAL Eur. J. Biochem. 178 (1), 93-99 (1988) MEDLINE 89078418 COMMENT Data kindly reviewed (22-DEC-1989) by Gross G. FEATURES Location/Qualifiers source 1..1163 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="diploid fibroblast" /cell_line="FS-4" /clone="pG4" precursor_RNA <1..1163 /note="induced 12S RNA" repeat_region 15..135 /note="Alu-like repeat" CDS 163..411 /note="9kD protein (AA 1-82)" /codon_start=1 /db_xref="PID:g32575" /db_xref="SWISS-PROT:P13994" /translation="MEPPSPSPTHLSCIFFLLITVSPLEASSTRARVFPCLPLYAECP EQSLAQGKEKSHPGGGGERPGLAGQGEPDHPAGARDGR" misc_feature 1088..1095 /note="pot.polyA signal" misc_feature 1136..1141 /note="pot.polyA signal" polyA_site 1163 /note="polyA site" BASE COUNT 258 a 396 c 319 g 190 t ORIGIN 1 accccccccc cccctgcttg ggaggctgag gcaggagaat ggcgtgaacc tgggaggtgg 61 agtttgcagt gagctgagat cgtgccactg cactccagcc tggccgacag agcaagactc 121 ttgtctcaaa aaaatataga attaaactaa attaaaaaat aaatggaacc acccagccca 181 tcccctaccc acctgtcctg catttttttt ctccttatca ctgtctcacc actagaggcc 241 agctccacca gggcaagggt ttttccctgc ctgccactgt atgccgagtg tccagaacaa 301 agcctagcac aaggaaaaga aaaaagccat ccaggaggag gaggagagag accaggcctt 361 gcaggccaag gcgagcctga ccatcccgct ggtgcccgag acggaagatg accgcaagct 421 ggcggctctg ctgaagttcc acaccctgga ctcctacgag gacaagcaga aactcaagcg 481 gaccgagatc atcagccgct cctggttccc ctctgccccc ggatccgcct ccagcaagca 541 aggtcagcgg cgtcctgaag aagctggcac agagccgcag aaccgcgctt gccacctccc 601 ccatcaccgt cggggacctg ggcatcgtgc ggcggaggtc tcgggacgtc ccggagagcc 661 cccagcatgc ggccgacacc cccaagtctg gggaaccgcg ggtaccagag gaggctgccc 721 aggaccggcc catgtccccc ggagactgtc ctccggaaac aactgagacc cccaagtgca 781 gcagcccgag ggggcaggaa gggagccgtc aggacaagcc cctgtcgcca gcaggctcct 841 cccaggaggc agctgacacc cccgacacgc ggccacccct gcagtctcgg ctcctccctc 901 gtggcggact actccgactc ggagagtgag tgagcgatcc ccatcctgga gactggaccc 961 gctctagagg cccggacaca cccaggaggc ccctcacaga ctgcagaccc ccggctcgcc 1021 caccagccct gggagagctc agatgccgca tcctccccag accgcgcctt cctgcaaccg 1081 tggagttatt tatttggtcc tggtgagggt gtttgtgcct tgtgagactc cgtacattaa 1141 agacctgtct cttcttccct gtc // LOCUS HSI15PGN1 584 bp RNA PRI 14-MAR-1996 DEFINITION H.sapiens mRNA for I-15P (I-BABP) protein. ACCESSION X90908 NID g971462 KEYWORDS 15 kDa protein; bile acid-binding protein; I-15P (I-BABP) gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 584) AUTHORS Fujita,M., Fujii,H., Kanda,T., Sato,E., Hatakeyama,K. and Ono,T. TITLE Molecular cloning, expression, and characterization of a human intestinal 15-kDa protein JOURNAL Eur. J. Biochem. 233 (2), 406-413 (1995) MEDLINE 96067678 REFERENCE 2 (bases 1 to 584) AUTHORS Fujii,H. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) H. Fujii, Dept. of Biochemistry, Asahimachidori 1-757, Niigata 951, JAPAN FEATURES Location/Qualifiers source 1..584 /organism="Homo sapiens" /isolate="patient #1" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /rearranged /tissue_type="ileum" /cell_type="epithelial" /clone="pI15P-1" gene 129..515 /gene="I-15P (I-BABP)" CDS 129..515 /gene="I-15P (I-BABP)" /codon_start=1 /product="15kDa protein (I-15P), bile acid-binding protein (I-BABP)" /db_xref="PID:g975660" /translation="MAFTGKFEMESEKNYDEFMKLLGISSDVIEKARNFKIVTEVQQD GQDFTWSQHYSGGHTMTNKFTVGKESNIQTMGGKTFKATVQMEGGKLVVNFPNYHQTS EIVGDKLVEVSTIGGVTYERVSKRLA" BASE COUNT 165 a 151 c 165 g 103 t ORIGIN 1 gaagaagtgg ggtgacttag gggctgagcc tcagcaactg ggagagttta taagctggga 61 tagcagaccc ctcagcacca cccattctcc tcatccctct gctctctggc ctccagcctc 121 ccagcagcat ggctttcacc ggcaagttcg agatggagag tgagaagaat tatgatgagt 181 tcatgaagct ccttgggatc tccagcgatg taatcgaaaa ggcccgcaac ttcaagatcg 241 tcacggaggt gcagcaggat gggcaggact tcacttggtc ccagcactac tccgggggcc 301 acaccatgac caacaagttc actgttggca aggaaagcaa catacagaca atggggggca 361 agacgttcaa ggccactgtg cagatggagg gcgggaagct ggtggtgaat ttccccaact 421 atcaccagac ctcagagatc gtgggtgaca agctggtgga ggtctccacc atcggaggcg 481 tgacctatga gcgcgtgagc aagagactgg cctaagcagc caggcccggc ccagggagct 541 acaaacccac caataaaact gatataagga caaaaaaaaa aaaa // LOCUS HSI6REC 1486 bp RNA PRI 06-AUG-1992 DEFINITION Human mRNA for interleukin-6-receptor. ACCESSION X58298 NID g32580 KEYWORDS cell surface receptor; interleukin 6 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1486) AUTHORS Schooltink,H. TITLE Direct Submission JOURNAL Submitted (12-MAR-1991) H. Schooltink, Dept of Biochemistry, RWTH Aachen Klinikum, Pauwelsstrasse 30, 5100 Aachen, Germany REFERENCE 2 (bases 1 to 1486) AUTHORS Schooltink,H., Stoyan,T., Lenz,D., Schmitz,H., Hirano,T., Kishimoto,T., Heinrich,P.C. and Rose-John,S. TITLE Structural and functional studies on the human hepatic interleukin-6 receptor. Molecular cloning and overexpression in HepG2 cells JOURNAL Biochem. J. 277 (Pt 3), 659-664 (1991) MEDLINE 91336983 REFERENCE 3 (bases 1 to 1486) AUTHORS Krause,E., Wegenka,U., Moller,C., Horn,F. and Heinrich,P.C. TITLE Gene expression of the high molecular weight proteinase inhibitor alpha 2-macroglobulin JOURNAL Biol. Chem. Hoppe-Seyler 373 (7), 509-515 (1992) MEDLINE 92384960 COMMENT see also: X12830. FEATURES Location/Qualifiers source 1..1486 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="hepatoma" /cell_line="HEP G 2" /clone_lib="lambda gt 10" /clone="7" 5'UTR 1..51 CDS 52..1458 /codon_start=1 /product="interleukin-6-receptor" /db_xref="PID:g32581" /db_xref="SWISS-PROT:P08887" /translation="MLAVGCALLAALLAAPGAALAPRRCPAQEVARGVLTSLPGDSVT LTCPGVEPEDNATVHWVLRKPAAGSHPSRWAGMGRRLLLRSVQLHDSGNYSCYRAGRP AGTVHLLVDVPPEEPQLSCFRKSPLSNVVCEWGPRSTPSLTTKAVLLVRKFQNSPAED FQEPCQYSQESQKFSCQLAVPEGDSSFYIVSMCVASSVGSKFSKTQTFQGCGILQPDP PANITVTAVARNPRWLSVTWQDPHSWNSSFYRLRFELRYRAERSKTFTTWMVKDLQHH CVIHDAWSGLRHVVQLRAQEEFGQGEWSEWSPEAMGTPWTESRSPPAENEVSTPMQAL TTNKDDDNILFRDSANATSLPVQDSSSVPLPTFLVAGGSLAFGTLLCIAIVLRFKKTW KLRALKEGKTSMHPPYSLGQLVPERPRPTPVLVPLISPPVSPSSLGSDNTSSHNRPDA RDPRSPYDISNTDYFFPR" sig_peptide 52..108 mat_peptide 109..1455 /product="interleukin-6-receptor" 3'UTR 1456..1486 BASE COUNT 305 a 453 c 439 g 289 t ORIGIN 1 attagcctgt ccgcctctgc gggaccatgg agtggtagcc gaggaggaag catgctggcc 61 gtcggctgcg cgctgctggc tgccctgctg gccgcgccgg gagcggcgct ggccccaagg 121 cgctgccctg cgcaggaggt ggcgagaggc gtgctgacca gtctgccagg agacagcgtg 181 actctgacct gcccgggggt agagccggaa gacaatgcca ctgttcactg ggtgctcagg 241 aagccggctg caggctccca ccccagcaga tgggctggca tgggaaggag gctgctgctg 301 aggtcggtgc agctccacga ctctggaaac tattcatgct accgggccgg ccgcccagct 361 gggactgtgc acttgctggt ggatgttccc cccgaggagc cccagctctc ctgcttccgg 421 aagagccccc tcagcaatgt tgtttgtgag tggggtcctc ggagcacccc atccctgacg 481 acaaaggctg tgctcttggt gaggaagttt cagaacagtc cggccgaaga cttccaggag 541 ccgtgccagt attcccagga gtcccagaag ttctcctgcc agttagcagt cccggaggga 601 gacagctctt tctacatagt gtccatgtgc gtcgccagta gtgtcgggag caagttcagc 661 aaaactcaaa cctttcaggg ttgtggaatc ttgcagcctg atccgcctgc caacatcaca 721 gtcactgccg tggccagaaa cccccgctgg ctcagtgtca cctggcaaga cccccactcc 781 tggaactcat ctttctacag actacggttt gagctcagat atcgggctga acggtcaaag 841 acattcacaa catggatggt caaggacctc cagcatcact gtgtcatcca cgacgcctgg 901 agcggcctga ggcacgtggt gcagcttcgt gcccaggagg agttcgggca aggcgagtgg 961 agcgagtgga gcccggaggc catgggcacg ccttggacag aatccaggag tcctccagct 1021 gagaacgagg tgtccacccc catgcaggca cttactacta ataaagacga tgataatatt 1081 ctcttcagag attctgcaaa tgcgacaagc ctcccagtgc aagattcttc ttcagtacca 1141 ctgcccacat tcctggttgc tggagggagc ctggccttcg gaacgctcct ctgcattgcc 1201 attgttctga ggttcaagaa gacgtggaag ctgcgggctc tgaaggaagg caagacaagc 1261 atgcatccgc cgtactcttt ggggcagctg gtcccggaga ggcctcgacc caccccagtg 1321 cttgttcctc tcatctcccc accggtgtcc cccagcagcc tggggtctga caatacctcg 1381 agccacaacc gaccagatgc cagggaccca cggagccctt atgacatcag caatacagac 1441 tacttcttcc ccagatagct ggctgggtgg caccagcagc ctggac // LOCUS HSIAI3B 4403 bp RNA PRI 18-MAR-1994 DEFINITION H.sapiens IAI.3B mRNA. ACCESSION X76952 NID g463244 KEYWORDS IAI3B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4403) AUTHORS Campbell,I.G., Nicolai,H.M., Foulkes,W.D., Senger,G., Stamp,G.W., Allan,G., Boyer,C., Jones,K., Bast Jr,R.C., Solomon,E., Trowsdale,J. and Black,D.M. TITLE A novel gene encoding a B-box protein within the BRCA1 region at 17q21.1 JOURNAL Hum. Mol. Genet. 3 (4), 589-594 (1994) MEDLINE 94348506 REFERENCE 2 (bases 1 to 4403) AUTHORS Campbell,I.G. TITLE Direct Submission JOURNAL Submitted (21-DEC-1993) I.G. Campbell, University of Southampton, Obstetrics and Gynaecology, Princess Anne Hospital, Coxford Road, Southampton S09 4HA Hants, UK FEATURES Location/Qualifiers source 1..4403 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="ovary" /cell_type="epithelium" /cell_line="OVCA432" /clone_lib="lambda gt11" /clone="IAI3B" /map="17q21.1-21.2" /chromosome="17q" gene 93..2993 /gene="IAI3B" CDS 93..2993 /gene="IAI3B" /codon_start=1 /db_xref="PID:g463245" /translation="MEPQVTLNVTFKNEIQSFLVSDPENTTWADIEAMVKVSFDLNTI QIKYLDEENEEVSINSQGEYEEALKMAVKQGNQLQMQVHEGHHVVDEAPPPVVGAKRL AARAGKKPLAHYSSLVRVLGSDMKTPEDPAVQSFPLVPCDTDQPQDKPPDWFTSYLET FREQVVNETVEKLEQKLHEKLVLQNPSLGSCPSEVSMPTSEETLFLPENQFSWHIACN NCQRRIVGVRYQCSLCPSYNICEDCEAGPYGHDTNHVLLKLRRPVVGSSEPFCHSKYS TPRLPAALEQVRLQKQVDKNFLKAEKQRLRAEKKQRKAEVKELKKQLKLHRKIHLWNS IHGLQSPKSPLGRPESLLQSNTLMLPLQPCTSVMPMLSAAFVDENLPDGTHLQPGTKF IKHWRMKNTGNVKWSADTKLKFMWGNLTLASTEKKDVLVPCLKAGHVGVVSVEFIAPA LEGTYTSHWRLSHKGQQFGPRVWCSIIVDPFPSEESPDNIEKGMISSSKTDDLTCQQE ETFLLAKEERQLGEVTEQTEGTAACIPQKAKNVASERELYIPSVDLLTAQDLLSFELL DINIVQELERVPHNTPVDVTPCMSPLPHDSPLIEKPGLGQIEEENEGAGFKALPDSMV SVKRKAENIASVEEAEEDLSGTQFVCETVIRSLTLDAAPDHNPPCRQKSLQMTFALPE GPLGNEKEEIIHIAEEEAVMEEEEDEEDEEEEDELKDEVQSQSSASSEDYIIILPECF DTSRPWGILCTALRSHSQAWSEVLKASLGFEAGQEPAEAGERLPGGENQPQEHSISDI LTTSQTLETVPLIPEVVELPPSLPRSSPCVHHHGSPGVDLPVTIPEVSSVPDQIRGEP RGSSGLVNSRQKSYDHSRHHHGSSIAGGLVKGALSVAASAYKALFAGPPVTAQPIISE DQTAALMARLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLNNNDWYSQRY" misc_feature 743..854 /gene="IAI3B" /note="B-BOX" BASE COUNT 1298 a 929 c 1056 g 1119 t 1 others ORIGIN 1 ggaattccgg gatagcggca gagccggtag cggacggtcc ttgcattggc ctccggcagg 61 cgccccccgg gggcgggaag ctgcctcaca gcatggaacc acaggttact ctaaatgtga 121 cttttaaaaa tgaaattcaa agctttctgg tttctgatcc agaaaataca acttgggctg 181 atatcgaagc tatggtaaaa gtttcatttg atctgaatac tattcaaata aaatacctgg 241 atgaggaaaa tgaagaggta tccatcaaca gtcaaggaga atatgaagaa gcgcttaaga 301 tggcagttaa acagggaaac caactgcaga tgcaagtcca cgaagggcac catgtcgttg 361 atgaagcccc acccccagtt gtaggagcaa aacgactagc tgccagggca gggaagaagc 421 cacttgcaca ttactcttca ctggtgagag tcttgggatc agacatgaag accccagagg 481 atcctgcagt gcagtcgttt ccacttgttc catgtgacac agaccagcct caagacaagc 541 ccccagactg gttcacaagc tacctggaga cgttcagaga acaagtggtt aacgaaacgg 601 ttgagaagct tgaacagaaa ttacatgaaa agcttgtcct ccagaaccca tccttgggtt 661 cttgtccctc agaagtctca atgcctactt cagaagaaac attgtttttg ccagaaaacc 721 agttcagctg gcatattgct tgcaacaact gccaaagaag gattgttggt gtccgctacc 781 agtgtagcct atgcccatcc tacaatatct gtgaagattg tgaagcaggg ccatatggcc 841 atgacactaa ccacgtcctg ctgaagttgc ggagacctgt tgtgggctcc tctgaaccgt 901 tctgtcactc aaagtactct actcctcgtc ttcctgctgc tctggaacaa gtcaggctcc 961 agaaacaggt tgataagaac tttcttaaag cagaaaagca aaggttgcga gctgagaaga 1021 aacaacgtaa agcagaggtc aaggaactta aaaagcagct taaactccat aggaaaattc 1081 acctgtggaa ttcaatccat ggactccaga gccccaagtc tcctttaggc cgacctgaga 1141 gcttgctcca gtctaatacc ctgatgctcc ctttgcagcc ctgtacctcc gttatgccaa 1201 tgctcagtgc agcatttgtg gatgagaatt tgcctgatgg gactcacctt cagccaggaa 1261 ccaagtttat caaacactgg aggatgaaaa atacaggaaa tgtaaagtgg agtgcagaca 1321 caaagctcaa gttcatgtgg ggaaacctga ctttggcttc cacagaaaag aaggatgttt 1381 tggttccctg cctcaaggcc ggccatgtgg gagttgtatc tgtggagttc attgccccag 1441 ccttggaggg aacgtatact tcccattggc gtctttctca caaaggccag caatttgggc 1501 ctcgggtctg gtgcagtatc atagtagatc ctttcccctc cgaagagagc cctgataaca 1561 ttgaaaaggg catgatcagc tcaagcaaaa ctgatgatct cacctgccag caagaggaaa 1621 cttttcttct ggctaaagaa gaaagacagc ttggtgaagt gactgagcag acagaaggga 1681 cagcagcctg catcccacag aaggcaaaaa atgttgccag tgagagggag ctctacatcc 1741 catctgtgga tcttctgact gcccaggacc tgctgtcctt tgagctgttg gatataaaca 1801 ttgttcaaga gttggagaga gtgccccaca acacccctgt ggatgtgact ccctgcatgt 1861 ctcctctgcc acatgacagt cctttaatag agaagccagg cttggggcag atagaggaag 1921 agaatgaagg ggcaggattt aaagcacttc ctgattctat ggtgtcagta aagaggaagg 1981 ctgagaacat tgcttctgtg gaggaagcag aagaagacct gagtgggacc cagtttgtgt 2041 gtgagacagt aatccgatcc cttaccttgg atgctgcccc agaccacaac cctccttgca 2101 gacagaagtc cttgcagatg acatttgcct tgcctgaagg accacttgga aatgagaagg 2161 aggagattat ccatatcgct gaggaagaag ctgtcatgga ggaggaggag gatgaggagg 2221 atgaggagga ggaggatgag ctcaaagatg aagttcaaag tcagtcctct gcttcctcag 2281 aggattacat catcatcctg cctgagtgct ttgataccag ccgcccctgg gggattctat 2341 gtacagctct gcgctctcac agccaggcct ggagcgaggt gctgaaggca agcctggggt 2401 ttgaggctgg gcaggaacca gctgaggctg gggaaagact ccctggaggg gagaaccagc 2461 cacaggagca cagcataagt gacatcctca cgacctcaca gactctggaa acagtgcccc 2521 taatcccaga ggtagtggag cttccaccgt cactgcccag gagctctcct tgtgtacatc 2581 atcatggttc cccaggagtg gatttaccag ttaccatacc agaagtttct tcagtccctg 2641 atcagatcag aggagagccc agaggctcat caggacttgt aaacagcaga cagaagagct 2701 atgaccactc aaggcaccat catgggagca gcattgctgg aggactggtg aagggggctt 2761 tgtctgttgc tgcctctgca tacaaggccc tgtttgctgg gccaccagtc actgcacagc 2821 caataatttc tgaagatcag acagcagccc tgatggcccg tctctttgaa atgggattct 2881 gtgacaggca gctgaaccta cggctgctga agaaacacaa ttacaatatc ctgcaggttg 2941 tgacagaact tcttcagtta aacaacaacg actggtacag ccaacgctat tgaggagtga 3001 ccttgtatta aataactgcc tgctgctcag agatgatctt tattctgtca ttggggtatg 3061 ggatagaagc ccttgcttat ttttaatctg atgaatctgt atagagccca tcgttgagtt 3121 accaagacaa tacctgctac agtattttgg ggagcaaact aaagaccaga agttaaattt 3181 tcactttang acattggatg aatagtatga agacagtttt tcagttgatt tggataaaac 3241 tattttagtg cattgacaag tgtaacttca acttcatata gaaccatttt tctttctgct 3301 tttattgaaa ctgagtattt ttctttggct aatgtggatt ttttatgggg atatctgtta 3361 attttcaggt tttgaaaaga cattaacctc ggaagttgtt tttaagaatt attctcataa 3421 ttcttattct cataatttct gtaatccacc tcaagcttca tagttatttg gcattgaaat 3481 aacacccaga gcatgataga atgtgtactc ttccctctct caaggagaag taattttcct 3541 gcaatactta ataattggca ccgttgcttt ctaaagactc catggtgcat tcaagagtat 3601 ccaacttcaa gggaatctct gcatttcaat gaaaggagga agagtgtgct gataaaccta 3661 ccagcaccta ttgagcaatg tctattatag taattttgca tacattttta tttaagggaa 3721 aaaatatagg tattgtgaaa tattttgcta atcttataga aaaggaaaaa atcccgttat 3781 ttaaagggaa aagtaaattt aacagttgcc ttttttctta atcgtcaggg cagatcttat 3841 tttacagtac agtcggggga aatagaaaca tgtgaaaggc aaaaggcagg ctcctaaatt 3901 aatgtcagtg aagttcaggg tgggcaaatg agtgtgtgtg aggtatagga aatgctgatg 3961 acttctttaa tgcttgaagt ccgttcacag gtatctagcc ctagaatgcc tagaacagga 4021 agaggcagct ggtgttctgc aaaacttgga caggggcaaa gttgctgaaa aagttttggt 4081 ttaacccgaa gataagtgga aaagagcttg tccatgaacc caggttctca ctctgtttac 4141 agaagtgtgt tgagtacagt tggtgaagga agaggtaaca aaaaatgcta aatattttat 4201 ccatgaaaat gacttccaga aaaggaagaa tatgaacccc agaccgaagg ggaaaagata 4261 gttaatagta ttatctaacc tggttggtat ttgtaatgaa tggtgatttt aattagtcat 4321 tagccataat gatgtttatt tacagtataa ctcctgaatg ctacttaaat aaaccaggat 4381 tcaaactgca aaaaaaaaaa aaa // LOCUS HSIATIH3 2807 bp RNA PRI 30-MAR-1993 DEFINITION H.sapiens mRNA for inter-alpha-trypsin inhibitor heavy chain H3. ACCESSION X67055 NID g288562 KEYWORDS H3 heavy chain; inter-alpha-trypsin inhibitor heavy chain; proteinase inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2807) AUTHORS Bourguignon,J. TITLE Direct Submission JOURNAL Submitted (23-JUN-1992) J. Bourguignon, Institut National de la Sante et de la Recherche Medicale INSERM, Unite 295 Faculte de Med-Pharm de Rouen, Ave de l'Universite BP 97, F-76803 St Etienne Rouvray Cedex, FRANCE REFERENCE 2 (bases 1 to 2807) AUTHORS Bourguignon,J., Diarra-Mehrpour,M., Thiberville,L., Bost,F., Sesboue,R. and Martin,J.P. TITLE Human pre-alpha-trypsin inhibitor-precursor heavy chain. cDNA and deduced amino-acid sequence JOURNAL Eur. J. Biochem. 212 (3), 771-776 (1993) MEDLINE 93215656 COMMENT See also X14690. FEATURES Location/Qualifiers source 1..2807 /organism="Homo sapiens" /isolate="2" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /cell_type="hepatocyte" /clone_lib="lambda gt11" /clone="5" /chromosome="3p" /map="p211-p212" CDS 21..2678 /codon_start=1 /product="inter-alpha-trypsin inhibitor heavy chain H3" /db_xref="PID:g288563" /translation="MVALSHLGSALQLGSLCFPRSPFRLLGKRSLPEGVANGIEVYST KINSKVTSRFAHNVVTMRAVNRADTAKEVSFDVELPKTAFITNFTLTIDGVTYPGNVK EKEVAKKQYEKAVSQGKTAGLVKASGRKLEKFTVSVNVAAGSKVTFELTYEELLKRHK GKYEMYLKVQPKQLVKHFEIEVDIFEPQGISMLDAEASFITNDLLGSALTKSFSGKKG HVSFKPSLDQQRSCPTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPV VPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ ATPENLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEHRIPERSTSIVIMLTDG DANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDS DADLQLQGFYEEVANPLLTGVEMEYPENAILDLTQNTYQHFYDGSEIVVAGRLVDEDM NSFKADVKGHGATNDLTFTEEVDMKEMEKALQERDYIFGNYIERLWAYLTIEQLLEKR KNAHGEEKENLTARALDLSLKYHFVTPLTSMVVTKPEDNEDERAIADKPGEDAEATPV SPAMSYLTSYQPPQNPYYYVDGDPHFIIQIPEKDDALCFNIDEAPGTVLRLIQDAVTG LTVNGQITGDKRGSPDSKTRKTYFGKLGIRNAQMDFQVEVTTEKITCGTGRASTFSWL DTVTVTQDGLSMMINRKNMVVSFGDGVTFVVVLHQVWKKHPVHRDFLGFYVVDSHRMS AQTHGLLGQFFQPFDFKVSDIRPGSDPTKPDATLVVKNHQLIVTRGSQKDYRKDASIG TKVVCWFVHNNGEGLIDGVHTDYIVPNLF" BASE COUNT 723 a 732 c 785 g 567 t ORIGIN 1 tattcagcga tggactttgc atggtggccc tgtctcatct tggctctgct ctccagcttg 61 gcagcctctg cttcccgaga agcccctttc ggctgcttgg gaaacggagc ctcccggaag 121 gggtggccaa tggcatcgag gtctacagta ccaaaatcaa ctccaaggtg acctcccgtt 181 ttgctcacaa tgttgtcacc atgagagccg tcaaccgtgc agacacggcc aaggaggttt 241 cctttgatgt ggagctgccc aagacggcct tcatcaccaa cttcaccttg accatcgacg 301 gtgttaccta ccctgggaat gtcaaggaga aggaagttgc caagaagcag tatgaaaagg 361 ctgtgtccca gggcaagacg gccggcttgg tcaaggcctc tgggaggaag ttggagaagt 421 tcacagtctc ggtcaacgtg gctgcaggca gcaaagtcac cttcgagcta acctacgagg 481 agctgctgaa gaggcacaag ggcaagtacg agatgtacct caaggtccag cctaagcaac 541 tggtcaaaca ctttgagatc gaggtagaca tcttcgagcc tcagggaatc agcatgctgg 601 atgctgaggc ctctttcatc accaacgacc tcctgggaag cgccctcacc aagtccttct 661 cagggaaaaa gggccatgtg tccttcaagc ccagcttaga ccaacagcgt tcatgcccaa 721 cctgtacaga ttccctcctc aatggagatt tcactatcac ctatgacgtg aacagagaat 781 ctcctggcaa cgtgcagata gtcaatggct acttcgtgca cttctttgca cctcaaggcc 841 ttccagtggt gcctaagaac gtggcctttg tgattgacat cagcggctcc atggctggtc 901 ggaaattaga gcagacaaag gaggcccttc tcagaatcct ggaagatatg aaagaggaag 961 actatctgaa tttcatcctg ttcagtggag atgtgtccac atggaaagag cacttagtcc 1021 aggccacgcc cgagaacctc caggaggcca ggacgtttgt gaagagcatg gaggataaag 1081 gaatgaccaa catcaatgac gggctgctga ggggcatcag tatgctgaac aaggcccgag 1141 aggagcacag aatcccagag aggagcacct ccattgtcat catgctgact gatggggatg 1201 ccaatgttgg tgagagcaga cccgaaaaaa tccaagagaa tgtgcggaat gccatcgggg 1261 gcaagttccc cttgtataac ctgggctttg gcaacaatct gaattataac ttcctggaga 1321 acatggccct ggagaaccat gggtttgccc ggcgcattta tgaggactct gatgccgatt 1381 tgcagttgca gggcttctat gaggaggtgg ccaacccact gctgacgggt gtggagatgg 1441 agtaccccga gaacgctatc ctggacctca cccagaacac ttaccagcac ttctacgatg 1501 gctctgagat cgtggtggcc gggcgcctgg tggacgagga catgaacagc tttaaggcag 1561 atgtgaaggg ccatggggcc accaacgacc tgaccttcac agaggaggtg gacatgaagg 1621 agatggagaa ggccctgcag gagcgggact acatcttcgg gaattacatt gagcggctct 1681 gggcctacct caccattgag cagctgctgg agaagcgcaa gaacgcccat ggcgaggaga 1741 aggagaacct cacggcccgg gccctggacc tgtccctcaa gtatcacttt gtgactccac 1801 tgacctcaat ggtggtgacc aagcctgagg acaacgagga tgagagggcc attgccgaca 1861 agcctgggga agatgcagaa gccacaccgg tgagccccgc catgtcctac ctgaccagct 1921 accagcctcc tcaaaacccc tactactatg tggacgggga tccccacttc atcatccaaa 1981 ttccggagaa agacgatgcc ctctgcttca acatcgatga agccccaggc acagtgctgc 2041 gccttattca ggatgcagtc acaggcctca cagttaatgg gcagatcact ggcgacaaga 2101 gaggcagccc tgactccaag accagaaaga cttactttgg aaaactgggc attcgcaatg 2161 ctcagatgga cttccaggtg gaggtgacaa cggagaagat cacctgtgga acaggccgtg 2221 cgagcacttt cagctggctg gacacagtca cagtcacgca ggatgggctg tccatgatga 2281 tcaacaggaa gaacatggtg gtctcctttg gagatggggt taccttcgtg gtcgtcctac 2341 accaggtgtg gaagaaacat cctgtccacc gtgactttct aggcttctac gtggtggaca 2401 gtcaccggat gtcagcacag acgcatgggc tgctggggca attcttccaa ccctttgact 2461 ttaaagtgtc tgacatccgg ccaggctctg accccacaaa gccagatgcc acattggtgg 2521 tgaagaacca tcagctgatt gtcaccaggg gctcccagaa agactacaga aaggatgcca 2581 gcatcggcac gaaggttgtc tgctggttcg tccacaacaa cggagaaggg ctgattgatg 2641 gtgtccacac tgactacatt gtccccaacc tgttttgagt agacacacca gctcctgttg 2701 ggatggatgg cccggatttt atggcatctg gaacatgggc acagagaggg gcctgtggga 2761 ggggctggga aaataaagtc caaggtcgag ccagaaaaaa aaaaaaa // LOCUS HSICA512 3307 bp RNA PRI 17-MAY-1994 DEFINITION H.sapiens mRNA for islet cell antigen ICA-512 (putative tyrosine phosphatase). ACCESSION X62899 NID g32612 KEYWORDS tyrosine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3307) AUTHORS Rabin,D.U., Pleasic,S.M., Shapiro,J.A., Yoo-Warren,H., Oles,J., Hicks,J.M., Goldstein,D.E. and Rae,P.M. TITLE Islet cell antigen 512 is a diabetes-specific islet autoantigen related to protein tyrosine phosphatases JOURNAL J. Immunol. 152 (6), 3183-3188 (1994) MEDLINE 94194080 REFERENCE 2 (bases 1 to 3307) AUTHORS Shapiro,J. TITLE Direct Submission JOURNAL Submitted (29-OCT-1991) J. Shapiro, Molecular Diagnostics Inc, Miles Research Center, 400 Morgan Lane, West Haven CT 06516, USA COMMENT . FEATURES Location/Qualifiers source 1..3307 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 925..2571 /citation=[1] /codon_start=1 /product="Islet Cell Antigen 512" /db_xref="PID:g32613" /translation="MEGPVEGRDTAELPARTSPMPGHPTASPTSSEVQQVPSPVSSEP PKAARPPVTPVLLEKKSPLGQSQPTVAGQPSARPAAEEYGYIVTDQKPLSLAAGVKLL EILAEHVHMSSGSFINISVVGPALTFRIRHNEQNLSLADVTQQAGLVKSELEAQTGLQ ILQTGVGQREEAAAVLPQTAHSTSPMRSVLLTLVALAGVAGLLVALAVALCVRQHARQ QDKERLAALGPEGAHGDTTFEYQDLCRQHMATKSLFNRAEGPPEPSRVSSVSSQFSDA AQASPSSHSSTPSWCEEPAQANMDISTGHMILAYMEDHLRNRDRLAKEWQALCAYQAE PNTCATAQGEGNIKKNRHPDFLPYDHARIKLKVESSPSRSDYINASPIIEHDPRMPAY IATQGPLSHTIADFWQMVWESGCTVIVMLTPLVEDGVKQCDRYWPDEGASLYHVYEVN LVSEHIWCEDFLVRSFYLKNVQTQETRTLTQFHFLSWPAEGTPASTRPLLDFRRKVNK CYRGRSCPIIVHCSDGAGEDRHLHPHRHGPEPHGKRSEGD" BASE COUNT 667 a 1087 c 945 g 608 t ORIGIN 1 ctccaacgct tacaaggtgt gctccgacaa ctcatgtccc aaggattgtc ctggcacgat 61 gacctcaccc agtatgtgat ctctcaggag atggagcgca tccccaggct tcgcccccag 121 agccccgtcc aagggacagg tctggcttgg cacccaagag acctggtcct gctggagagc 181 tgcttttaca ggacatcccc actggctccg ccctgctgcc cagcatcggc ttccacaacc 241 accagtgggc aaaggtggag ctggggccag ctcctctctg tcccctctgc aggctgagct 301 gctcccgcct ctcttggagc acctgctgct gcccccacag cctccccacc cttcactgag 361 ttacgaacct gccttgctgc agccctacct gttccaccag tttggctccc gtgatggctc 421 cagggtctca gagggctccc cagggatggt cagtgtcggc cccctgccca aggctgaagc 481 ccctgccctc ttcagcagaa ctgcctccaa gggcatattt ggggaccacc ctggccactc 541 ctacggggac cttccagggc cttcacctgc ccagcttttt caagactctg ggctgctcta 601 tctggcccag gagttgccag cacccagcag ggccagggtg ccaaggctgc cagagcaagg 661 gagcagcagc cgggcagagg actccccaga gggctatgag aaggaaggac taggggatcg 721 tggagagaag cctgcttccc cagctgtgca gccagatgcg gctctgcaga ggctggcgct 781 gtgctggcgg gctatggggt agagctgcgt cagctgaccc ctgagcagct ctccacactc 841 ctgaccctgc tgcagctact gcccaagggt gcaggaagaa atccgggagg ggttgtaaat 901 gttggagctg atatcaagaa aacaatggag gggccggtgg agggcagaga cacagcagag 961 cttccagccc gcacatcccc catgcctgga caccccactg ccagccctac ctccagtgaa 1021 gtccagcagg tgccaagccc tgtctcctct gagcctccca aagctgccag accccctgtg 1081 acacctgtcc tgctagagaa gaaaagccca ctgggccaga gccagcccac ggtggcagga 1141 cagccctcag cccgcccagc agcagaggaa tatggctaca tcgtcactga tcagaagccc 1201 ctgagcctgg ctgcaggagt gaagctgctg gagatcctgg ctgagcatgt gcacatgtcc 1261 tcaggcagct tcatcaacat cagtgtggtg ggaccagccc tcaccttccg catccggcac 1321 aatgagcaga acctgtcttt ggctgatgtg acccaacaag cagggctggt gaagtctgaa 1381 ctggaagcac agacagggct ccaaatcttg cagacaggag tgggacagag ggaggaggca 1441 gctgcagtcc ttccccaaac tgcgcacagc acctcaccca tgcgctcagt gctgctcact 1501 ctggtggccc tggcaggtgt ggctgggctg ctggtggctc tggctgtggc tctgtgtgtg 1561 cggcagcatg cgcggcagca agacaaggag cgcctggcag ccctggggcc tgagggggcc 1621 catggtgaca ctacctttga gtaccaggac ctgtgccgcc agcacatggc cacgaagtcc 1681 ttgttcaacc gggcagaggg tccaccggag ccttcacggg tgagcagtgt gtcctcccag 1741 ttcagcgacg cagcccaggc cagccccagc tcccacagca gcaccccgtc ctggtgcgag 1801 gagccggccc aagccaacat ggacatctcc acgggacaca tgattctggc atacatggag 1861 gatcacctgc ggaaccggga ccgccttgcc aaggagtggc aggccctctg tgcctaccaa 1921 gcagagccaa acacctgtgc caccgcgcag ggggagggca acatcaaaaa gaaccggcat 1981 cctgacttcc tgccctatga ccatgcccgc ataaaactga aggtggagag cagcccttct 2041 cggagcgatt acatcaacgc cagccccatt attgagcatg accctcggat gccagcctac 2101 atagccacgc agggcccgct gtcccatacc atcgcagact tctggcagat ggtgtgggag 2161 agcggctgca ccgtcatcgt catgctgacc ccgctggtgg aggatggtgt caagcagtgt 2221 gaccgctact ggccagatga gggtgcctcc ctctaccacg tatatgaggt gaacctggtg 2281 tcggagcaca tctggtgcga ggactttctg gtgcggagct tctacctgaa gaacgtgcag 2341 acccaggaga cgcgcacgct cacgcagttc cacttcctca gctggccggc agagggcaca 2401 ccggcctcca cgcggcccct gctggacttc cgcaggaagg tgaacaagtg ctaccggggc 2461 cgctcctgcc ccatcatcgt gcactgcagt gatggtgcgg gggaggaccg gcacctacat 2521 cctcatcgac atggtcctga accgcatggc aaaaggagtg aaggagattg acatcgctgc 2581 caccctggag catgtccgtg accagcggcc tggccttgtc cgctctaagg accagtttga 2641 atttgccctg acagccgtgg cggaggaagt gaatcccatc ctcaaggccc tgccccagtg 2701 agaccctggg gccccttggc gggcagccca gcctctgtcc ctctttgcct gtgtgagcat 2761 ctctgtgtac ccactcctca ctgccccacc agccacctct tgggcatgct cagcccttcc 2821 tagaagagtc aggaagggaa agccagaagg gcacgcctgc ccagcctcgc atgccagagc 2881 ctggggcatc ccagagccca gagcatccca tgggggtgct gcagccagga ggagaggaaa 2941 ggacatgggt agcaattcta cccagagcct tctcctgcct acattccctg gcctggctct 3001 cctgtagctc tcctggggtt ctgggagttc cctgaacatc tgtgtgtgtc cccctatgct 3061 ccagtatgga agaatggggt ggagggtcgc cacacccggc tccccctgct tctcagcccc 3121 gggcctgcct ctgactcaca cttgggcgct ctgccctccc tgcctcacgc ccagcctcct 3181 cccaccaccc tcccaccatg cgctgctcaa cctctctcct tctggcgcaa gagaacattt 3241 ctagaaaaaa ctacttttgt accagtgtga ataaagttag tgtgttgtct gtgctgctgc 3301 aaaaaaa // LOCUS HSICAM1 1846 bp RNA PRI 09-DEC-1995 DEFINITION Human mRNA for intercellular adhesion molecule-1 ICAM-1. ACCESSION X06990 NID g32614 KEYWORDS cell adhesion molecule; intercellular adhesion molecule 1; lymphocyte-function associated antigen-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1846) AUTHORS Simmons,D., Makgoba,M.W. and Seed,B. TITLE ICAM, an adhesion ligand of LFA-1, is homologous to the neural cell adhesion molecule NCAM JOURNAL Nature 331 (6157), 624-627 (1988) MEDLINE 88122667 FEATURES Location/Qualifiers source 1..1846 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /clone="pICAM-1" sig_peptide 13..87 /note="signal peptide (AA-25 to -1)" CDS 13..1611 /codon_start=1 /product="intercellular adhesion molecule-1 ICAM-1" /db_xref="PID:g758074" /db_xref="SWISS-PROT:P05362" /translation="MAPSSPRPALPALLVLLGALFPGPGNAQTSVSPSKVILPRGGSV LVTCSTSCDQPKLLGIETPLPKKELLLPGNNRKVYELSNVQEDSQPMCYSNCPDGQST AKTFLTVYWTPERVELAPLPSWQPVGKNLTLRCQVEGGAPRANLTVVLLRGEKELKRE PAVGEPAEVTTTVLVRRDHHGANFSCRTELDLRPQGLELFENTSAPYQLQTFVLPATP PQLVSPRVLEVDTQGTVVCSLDGLFPVSEAQVHLALGDQRLNPTVTYGNDSFSAKASV SVTAEDEGTQRLTCAVILGNQSQETLQTVTIYSFPAPNVILTKPEVSEGTEVTVKCEA HPRAKVTLNGVPAQPLGPRAQLLLKATPEDNGRSFSCSATLEVAGQLIHKNQTRELRV LYGPRLDERDCPGNWTWPENSQQTPMCQAWGNPLPELKCLKDGTFPLPIGESVTVTRD LEGTYLCRARSTQGEVTREVTVNVLSPRYEIVIITVVAAAVIMGTAGLSTYLYNRQRK IKKYRLQQAQKGTPMKPNTQATPP" mat_peptide 88..1608 /note="ICAM-1 (AA +1 to +507)" BASE COUNT 416 a 580 c 531 g 319 t ORIGIN 1 ctcagcctcg ctatggctcc cagcagcccc cggcccgcgc tgcccgcact cctggtcctg 61 ctcggggctc tgttcccagg acctggcaat gcccagacat ctgtgtcccc ctcaaaagtc 121 atcctgcccc ggggaggctc cgtgctggtg acatgcagca cctcctgtga ccagcccaag 181 ttgttgggca tagagacccc gttgcctaaa aaggagttgc tcctgcctgg gaacaaccgg 241 aaggtgtatg aactgagcaa tgtgcaagaa gatagccaac caatgtgcta ttcaaactgc 301 cctgatgggc agtcaacagc taaaaccttc ctcaccgtgt actggactcc agaacgggtg 361 gaactggcac ccctcccctc ttggcagcca gtgggcaaga accttaccct acgctgccag 421 gtggagggtg gggcaccccg ggccaacctc accgtggtgc tgctccgtgg ggagaaggag 481 ctgaaacggg agccagctgt gggggagccc gctgaggtca cgaccacggt gctggtgagg 541 agagatcacc atggagccaa tttctcgtgc cgcactgaac tggacctgcg gccccaaggg 601 ctggagctgt ttgagaacac ctcggccccc taccagctcc agacctttgt cctgccagcg 661 actcccccac aacttgtcag cccccgggtc ctagaggtgg acacgcaggg gaccgtggtc 721 tgttccctgg acgggctgtt cccagtctcg gaggcccagg tccacctggc actgggggac 781 cagaggttga accccacagt cacctatggc aacgactcct tctcggccaa ggcctcagtc 841 agtgtgaccg cagaggacga gggcacccag cggctgacgt gtgcagtaat actggggaac 901 cagagccagg agacactgca gacagtgacc atctacagct ttccggcgcc caacgtgatt 961 ctgacgaagc cagaggtctc agaagggacc gaggtgacag tgaagtgtga ggcccaccct 1021 agagccaagg tgacgctgaa tggggttcca gcccagccac tgggcccgag ggcccagctc 1081 ctgctgaagg ccaccccaga ggacaacggg cgcagcttct cctgctctgc aaccctggag 1141 gtggccggcc agcttataca caagaaccag acccgggagc ttcgtgtcct gtatggcccc 1201 cgactggacg agagggattg tccgggaaac tggacgtggc cagaaaattc ccagcagact 1261 ccaatgtgcc aggcttgggg gaacccattg cccgagctca agtgtctaaa ggatggcact 1321 ttcccactgc ccatcgggga atcagtgact gtcactcgag atcttgaggg cacctacctc 1381 tgtcgggcca ggagcactca aggggaggtc acccgcgagg tgaccgtgaa tgtgctctcc 1441 ccccggtatg agattgtcat catcactgtg gtagcagccg cagtcataat gggcactgca 1501 ggcctcagca cgtacctcta taaccgccag cggaagatca agaaatacag actacaacag 1561 gcccaaaaag ggacccccat gaaaccgaac acacaagcca cgcctccctg aacctatccc 1621 gggacagggc ctcttcctcg gccttcccat attggtggca gtggtgccac actgaacaga 1681 gtggaagaca tatgccatgc agctacacct accggccctg ggacgccgga ggacagggca 1741 ttgtcctcag tcagatacaa cagcatttgg ggccatggta cctgcacacc taaaacacta 1801 ggccacgcat ctgatctgta gtcacatgac taagccaaga ggaagg // LOCUS HSICAM2 1041 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for ICAM-2, cell adhesion ligand for LFA-1. ACCESSION X15606 NID g32623 KEYWORDS cell adhesion molecule; glycoprotein; ICAM-2 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1041) AUTHORS Staunton,D.E., Dustin,M.L. and Springer,T.A. TITLE Functional cloning of ICAM-2, a cell adhesion ligand for LFA-1 homologous to ICAM-1 JOURNAL Nature 339 (6219), 61-64 (1989) MEDLINE 89238547 FEATURES Location/Qualifiers source 1..1041 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial cells" /clone_lib="CDM8" sig_peptide 63..125 /note="signal peptide (AA -21 to -1)" CDS 63..890 /codon_start=1 /product="ICAM-2 preprotein (AA -21 to 254)" /db_xref="PID:g32624" /db_xref="SWISS-PROT:P13598" /translation="MSSFGYRTLTVALFTLICCPGSDEKVFEVHVRPKKLAVEPKGSL EVNCSTTCNQPEVGGLETSLNKILLDEQAQWKHYLVSNISHDTVLQCHFTCSGKQESM NSNVSVYQPPRQVILTLQPTLVAVGKSFTIECRVPTVEPLDSLTLFLFRGNETLHYET FGKAAPAPQEATATFNSTADREDGHRNFSCLAVLDLMSRGGNIFHKHSAPKMLEIYEP VSDSQMVIIVTVVSVLLSLFVTSVLLCFIFGQHLRQQRMGTYGVRAAWRRLPQAFRP" mat_peptide 126..887 /product="ICAM-2 protein (AA 1 - 254)" misc_feature 201..209 /note="pot. N-linked glycosylation site" misc_feature 306..314 /note="pot. N-linked glycosylation site" misc_feature 375..383 /note="pot. N-linked glycosylation site" misc_feature 519..527 /note="pot. N-linked glycosylation site" misc_feature 588..596 /note="pot. N-linked glycosylation site" misc_feature 621..629 /note="pot. N-linked glycosylation site" misc_feature 732..809 /note="transmembrane domain" misc_feature 1023..1028 /note="pot. polyA signal" polyA_site 1041 /note="polyA site" BASE COUNT 218 a 312 c 288 g 223 t ORIGIN 1 ctaaagatct ccctccaggc agcccttggc tggtccctgc gagcccgtgg agactgccag 61 agatgtcctc tttcggttac aggaccctga ctgtggccct cttcaccctg atctgctgtc 121 caggatcgga tgagaaggta ttcgaggtac acgtgaggcc aaagaagctg gcggttgagc 181 ccaaagggtc cctcgaggtc aactgcagca ccacctgtaa ccagcctgaa gtgggtggtc 241 tggagacctc tctaaataag attctgctgg acgaacaggc tcagtggaaa cattacttgg 301 tctcaaacat ctcccatgac acggtcctcc aatgccactt cacctgctcc gggaagcagg 361 agtcaatgaa ttccaacgtc agcgtgtacc agcctccaag gcaggtcatc ctgacactgc 421 aacccacttt ggtggctgtg ggcaagtcct tcaccattga gtgcagggtg cccaccgtgg 481 agcccctgga cagcctcacc ctcttcctgt tccgtggcaa tgagactctg cactatgaga 541 ccttcgggaa ggcagcccct gctccgcagg aggccacagc cacattcaac agcacggctg 601 acagagagga tggccaccgc aacttctcct gcctggctgt gctggacttg atgtctcgcg 661 gtggcaacat ctttcacaaa cactcagccc cgaagatgtt ggagatctat gagcctgtgt 721 cggacagcca gatggtcatc atagtcacgg tggtgtcggt gttgctgtcc ctgttcgtga 781 catctgtcct gctctgcttc atcttcggcc agcacttgcg ccagcagcgg atgggcacct 841 acggggtgcg agcggcttgg aggaggctgc cccaggcctt ccggccatag caaccatgag 901 tggcatggcc accaccacgg tggtcactgg aactcagtgt gactcctcag ggttgaggtc 961 cagccctggc tgaaggactg tgacaggcag cagagacttg ggacattgcc ttttctagcc 1021 cgaatacaaa cacctggact t // LOCUS HSICAM3RN 1817 bp RNA PRI 13-JAN-1993 DEFINITION H.sapiens ICAM-3 mRNA. ACCESSION X69819 NID g32627 KEYWORDS ICAM-3 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1817) AUTHORS de Fougerolles,A.R., Klickstein,L.B. and Springer,T.A. TITLE Cloning and expression of ICAM-3 reveals strong homology to other Ig family counter receptors for LFA1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1817) AUTHORS de Fougerolles,A.R. TITLE Direct Submission JOURNAL Submitted (15-DEC-1992) A.R. De Fougerolles, Centre for Blood Research, East Quadrangle Res Facility Rm 250, 200 Longwood Avenue, Boston MA 02115, USA COMMENT Conflicts to the sequence reported by Faqwcett, J. in Nature, 360, 481-484 and by Vazeux, R. in Nature, 360, 485-488. FEATURES Location/Qualifiers source 1..1817 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" /clone_lib="lambda S2T, ATCC Acc# 37546" /clone="lambda 14a2.2, lambda 7.3.1, lambda 11.2" gene 9..1652 /gene="ICAM-3" CDS 9..1652 /gene="ICAM-3" /codon_start=1 /db_xref="PID:g32628" /db_xref="SWISS-PROT:P32942" /translation="MATMVPSVLWPRACWTLLVCCLLTPGVQGQEFLLRVEPQNPVLS AGGSLFVNCSTDCPSSEKIALETSLSKELVASGMGWAAFNLSNVTGNSRILCSVYCNG SQITGSSNITVYGLPERVELAPLPPWQPVGQNFTLRCQVEGGSPRTSLTVVLLRWEEE LSRQPAVEEPAEVTATVLASRDDHGAPFSCRTELDMQPQGLGLFVNTSAPRQLRTFVL PVTPPRLVAPRFLEVETSWPVDCTLDGLFPASEAQVYLALGDQMLNATVMNHGDTLTA TATATARADQEGAREIVCNVTLGGERREARENLTVFSFLGPIVNLSEPTAHEGSTVTV SCMAGARVQVTLDGVPAAAPGQPAQLQLNATESDDGRSFFCSATLEVDGEFLHRNSSV QLRVLYGPKIDRATCPQHLKWKDKTRHVLQCQARGNPYPELRCLKEGSSREVPVGIPF FVNVTHNGTYQCQASSSRGKYTLVVVMDIEAGSSHFVPVFVAVLLTLGVVTIVLALMY VFREHQRSGSYHVREESTYLPLTSMQPTEAMGEEPSRAE" sig_peptide 9..95 /gene="ICAM-3" misc_feature 195..221 /gene="ICAM-3" /note="region encoding sequence of ICAM-3 lys-C cleaved peptide 17" misc_feature 1212..1244 /gene="ICAM-3" /note="region encoding sequence of ICAM-3 lys-C cleaved peptide 10" misc_feature 1464..1538 /gene="ICAM-3" /note="transmembrane region" misc_feature 1539..1649 /gene="ICAM-3" /note="cytoplasmic region" BASE COUNT 423 a 524 c 542 g 328 t ORIGIN 1 ctgtcagaat ggccaccatg gtaccatccg tgttgtggcc cagggcctgc tggactctgc 61 tggtctgctg tctgctgacc ccaggtgtcc aggggcagga gttccttttg cgggtggagc 121 cccagaaccc tgtgctctct gctggagggt ccctgtttgt gaactgcagt actgattgtc 181 ccagctctga gaaaatcgcc ttggagacgt ccctatcaaa ggagctggtg gccagtggca 241 tgggctgggc agccttcaat ctcagcaacg tgactggcaa cagtcggatc ctctgctcag 301 tgtactgcaa tggctcccag ataacaggct cctctaacat caccgtgtac gggctcccgg 361 agcgtgtgga gctggcaccc ctgcctcctt ggcagccggt gggccagaac ttcaccctgc 421 gctgccaagt ggagggtggg tcgccccgga ccagcctcac ggtggtgctg cttcgctggg 481 aggaggagct gagccggcag cccgcagtgg aggagccagc ggaggtcact gccactgtgc 541 tggccagcag agacgaccac ggagcccctt tctcatgccg cacagaactg gacatgcagc 601 cccaggggct gggactgttc gtgaacacct cagccccccg ccagctccga acctttgtcc 661 tgcccgtgac ccccccgcgc ctcgtggccc cccggttctt ggaggtggaa acgtcgtggc 721 cggtggactg caccctagac gggctttttc cagcctcaga ggcccaggtc tacctggcgc 781 tgggggacca gatgctgaat gcgacagtca tgaaccacgg ggacacgcta acggccacag 841 ccacagccac ggcgcgcgcg gatcaggagg gtgcccggga gatcgtctgc aacgtgaccc 901 tagggggcga gagacgggag gcccgggaga acttgacggt ctttagcttc ctaggaccca 961 ttgtgaacct cagcgagccc accgcccatg aggggtccac agtgaccgtg agttgcatgg 1021 ctggggctcg agtccaggtc acgctggacg gagttccggc cgcggccccg gggcagccag 1081 ctcaacttca gctaaatgct accgagagtg acgacggacg cagcttcttc tgcagtgcca 1141 ctctcgaggt ggacggcgag ttcttgcaca ggaacagtag cgtccagctg cgagtcctgt 1201 atggtcccaa aattgaccga gccacatgcc cccagcactt gaaatggaaa gataaaacga 1261 gacacgtcct gcagtgccaa gccaggggca acccgtaccc cgagctgcgg tgtttgaagg 1321 aaggctccag ccgggaggtg ccggtgggga tcccgttctt cgtcaacgta acacataatg 1381 gtacttatca gtgccaagcg tccagctcac gaggcaaata caccctggtc gtggtgatgg 1441 acattgaggc tgggagctcc cactttgtcc ccgtcttcgt ggcggtgtta ctgaccctgg 1501 gcgtggtgac tatcgtactg gccttaatgt acgtcttcag ggagcaccaa cggagcggca 1561 gttaccatgt tagggaggag agcacctatc tgcccctcac gtctatgcag ccgacagaag 1621 caatggggga agaaccgtcc agagctgagt gacgctggga tccgggatca aagttggcgg 1681 gggcttggct gtgccctcag attccgcacc aataaagcct tcaaactccc taaaaaaaaa 1741 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1801 aaaaaaaaaa aaaaaaa // LOCUS HSICLNGEN 1368 bp RNA PRI 08-MAY-1996 DEFINITION H.sapiens mRNA for Icln protein. ACCESSION X91788 NID g1001874 KEYWORDS Icln gene; Icln protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1368) AUTHORS Buyse,G., de Greef,C., Raeymaekers,L., Droogmans,G., Nilius,B. and Eggermont,J. TITLE The ubiquitously expressed pICln protein forms homomeric complexes in vitro JOURNAL Biochem. Biophys. Res. Commun. 218 (3), 822-827 (1996) MEDLINE 96158969 REFERENCE 2 (bases 1 to 1368) AUTHORS Eggermont,J. TITLE Direct Submission JOURNAL Submitted (22-SEP-1995) J. Eggermont, K.U.Leuven, Laboratory of Physiology, Campus Gasthuisberg, B-3000 Leuven, BELGIUM FEATURES Location/Qualifiers source 1..1368 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat leukaemic T cell" gene 89..802 /gene="Icln" CDS 89..802 /gene="Icln" /codon_start=1 /product="Icln protein" /db_xref="PID:g1001875" /translation="MSFLKSFPPPGPAEGLLRQQPDTEAVLNGKGLGTGTLYIAESRL SWLDGSGLGFSLEYPTISLHALSRDRSDCLGEHLYVMVNAKFEEESKEPVADEEEEDS DDDVEPITEFRFVPSDKSALEAMFTAMCECQALHPDPEDEDSDDYDGEEYDVEAHEQG QGDIPTFYTYEEGLSHLTAEGQATLERLEGMLSQSVSSQYNMAGVRTEDSIRDYEDGM EVDTTPTVAGQFEDADVDH" polyA_signal 1346..1351 BASE COUNT 376 a 272 c 339 g 381 t ORIGIN 1 gtgactgcct cttccagggc gggcggtgtg gtgcacgcat tgctgtgctc caactccctc 61 agggcctgtg ttgccgcact ctgctgctat gagcttcctc aaaagtttcc cgccgcctgg 121 gccagcggag gggctcctgc ggcagcagcc agacactgag gctgtgctga acgggaaggg 181 cctcggcact ggtacccttt acatcgctga gagccgcctg tcttggttag atggctctgg 241 attaggattc tcactggaat accccaccat tagtttacat gcattatcca gggaccgaag 301 tgactgtcta ggagagcatt tgtatgttat ggtgaatgcc aaatttgaag aagaatcaaa 361 agaacctgtt gctgatgaag aagaggaaga cagtgatgat gatgttgaac ctattactga 421 atttagattt gtgcctagtg ataaatcagc gttggaggca atgttcactg caatgtgcga 481 atgccaggcc ttgcatccag atcctgagga tgaggattca gatgactacg atggagaaga 541 atatgatgtg gaagcacatg aacaaggaca gggggacatc cctacatttt acacctatga 601 agaaggatta tcccatctaa cagcagaagg ccaagccaca ctggagagat tagaaggaat 661 gctttctcag tctgtgagca gccagtataa tatggctggg gtcaggacag aagattcaat 721 aagagattat gaagatggga tggaggtgga taccacacca acagttgctg gacagtttga 781 ggatgcagat gttgatcact gaaaatgatt tatgcaagtt taagattctg ctcctaagtg 841 taggagagaa cttggtgcct cttccactct ggagtgaagt taatgaaagt ctttttcctt 901 ttccaaaacc caacctgaac cagttctttc ttgagacaga ctatactgag acaacaagtt 961 gtcaccagca gaagatagat aatatgacct ttattaactt gatgaattaa cttaaccaag 1021 agggtatttg tagtttacta tttaccctaa aactttctgt gtctgggtac cctctgagta 1081 ggcctataat tcctaccttg actgtgtgca tcatttgtaa gctagcagat ctatgtggtg 1141 aaaatgcaca ggagcttggt agactgcggg ggaaagagag agctcctttc gccatgtttt 1201 accagtctgc tgttataacc tcttaggttg tatcctttaa tttccagcct tttaggttag 1261 tttctgtaac agaacaagtg agtctgggat gaagtcctca aagtacttca aatggtaatt 1321 gttttgtttt tgtaatagct taacaaataa acctaggttt tctataaa // LOCUS HSID1 926 bp RNA PRI 19-JAN-1995 DEFINITION H.sapiens Id1 mRNA. ACCESSION X77956 NID g457784 KEYWORDS Id1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 926) AUTHORS Deed,R.W., Jasiok,M. and Norton,J.D. TITLE Nucleotide sequence of the cDNA encoding human helix-loop-helix Id-1 protein: identification of functionally conserved residues common to Id proteins JOURNAL Biochim. Biophys. Acta 1219 (1), 160-162 (1994) MEDLINE 94368847 REFERENCE 2 (bases 1 to 926) AUTHORS Deed,R. TITLE Direct Submission JOURNAL Submitted (25-FEB-1994) R. Deed, Paterson Institute for Cancer Research, Dept of Regulation, Christie Hospital NHS Trust, Wilmslow Road, Manchester, UK FEATURES Location/Qualifiers source 1..926 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="placenta" /clone_lib="lambda gt10" /clone="Id1" gene 36..500 /gene="Id1" CDS 36..500 /gene="Id1" /codon_start=1 /db_xref="PID:g457785" /db_xref="SWISS-PROT:P41134" /translation="MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISR CRGAGARLPALLDEQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIR DLQLELNSESEVGTPGGRGLPVRAPLSTLNGEISALTAEAACVPADDRILCR" polyA_signal 893..898 BASE COUNT 193 a 262 c 281 g 190 t ORIGIN 1 ggggcccatt ctgtttcagc cagtcgccaa gaatcatgaa agtcgccagt ggcagcaccg 61 ccaccgccgc cgcgggcccc agctgcgcgc tgaaggccgg caagacagcg agcggtgcgg 121 gcgaggtggt gcgctgtctg tctgagcaga gcgtggccat ctcgcgctgc cggggcgccg 181 gggcgcgcct gcctgccctg ctggacgagc agcaggtaaa cgtgctgctc tacgacatga 241 acggctgtta ctcacgcctc aaggagctgg tgcccaccct gccccagaac cgcaaggtga 301 gcaaggtgga gattctccag cacgtcatcg actacatcag ggaccttcag ttggagctga 361 actcggaatc cgaagttggg acccccgggg gccgagggct gccggtccgg gctccgctca 421 gcaccctcaa cggcgagatc agcgccctga cggccgaggc ggcatgcgtt cctgcggacg 481 atcgcatctt gtgtcgctga agcgcctccc ccagggaccg gcggacccca gccatccagg 541 gggcaagagg aattacgtgc tctgtgggtc tcccccaacg cgcctcgccg gatctgaggg 601 agaacaagac cgatcggcgg ccactgcgcc cttaactgca tccagcctgg ggctgaggct 661 gaggcactgg cgaggagagg gcgctcctct ctgcacacct actagtcacc agagacttta 721 gggggtggga ttccactcgt gtgtttctat tttttgaaaa gcagacattt taaaaaatgg 781 tcacgtttgg tgcttctcag atttctgagg aaattgcttt gtattgtata ttacaatgat 841 caccgactga gaatattgtt ttacaatagt tctgtggggc tgtttttttg ttattaaaca 901 aataatttag atggtgaaaa aaaaaa // LOCUS HSIDEM 5397 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for phosphatidylinositol 3 kinase gamma. ACCESSION X83368 NID g1507821 KEYWORDS phosphatidylinositol 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5397) AUTHORS Stoyanov,B., Volinia,S., Hanck,T., Rubio,I., Loubtchenkov,M., Malek,D., Stoyanova,S., Vanhaesebroeck,B., Dhand,R., Nuernberg,B., Gierschik,P., Seedorf,K., Hsuan,J.J., Waterfield,M.D. and Wetzker,R. TITLE Cloning and Characterization of a G-Protein-Activated Human Phosphoinositide-3 kinase JOURNAL Science 269, 690-693 (1995) MEDLINE 95350661 REFERENCE 2 (bases 1 to 5397) AUTHORS Waterfield,M.D. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) M.D. Waterfield, Ludwig-Inst. for Cancer Research, Courtauld Building, 91 Riding House Street, London, W1P 8BT, UK REMARK Revised by [3] REFERENCE 3 (bases 1 to 5397) AUTHORS Waterfield,M.D. TITLE Direct Submission JOURNAL Submitted (23-AUG-1996) M.D. Waterfield, Ludwig-Inst. for Cancer Research, Courtauld Building, 91 Riding House Street, London, W1P 8BT, UK COMMENT X83368 is homologous to M93252. FEATURES Location/Qualifiers source 1..5397 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" CDS 324..3629 /note="phosphatidylinositol 3 kinase gamma, p110 gamma; activated by G protein alpha and betagammma subunits" /codon_start=1 /product="idem" /db_xref="PID:e264520" /db_xref="PID:g1507822" /db_xref="SWISS-PROT:P48736" /translation="MELENYKQPVVLREDNCRRRRRMKPRSAASLSSMELIPIEFVLP TSQRKCKSPETALLHVAGHGNVEQMKAQVWLRALETSVAADFYHRLGPHHFLLLYQKK GQWYEIYDKYQVVQTLDCLRYWKATHRSPGQIHLVQRHPPSEESQAFQRQLTALIGYD VTDVSNVHDDELEFTRRGLVTPRMAEVASRDPKLYAMHPWVTSKPLPEYLWKKIANNC IFIVIHRSTTSQTIKVSPDDTPGAILQSFFTKMAKKKSLMDIPESQSEQDFVLRVCGR DEYLVGETPIKNFQWVRHCLKNGEEIHVVLDTPPDPALDEVRKEEWPLVDDCTGVTGY HEQLTIHGKDHESVFTVSLWDCDRKFRVKIRGIDIPVLPRNTDLTVFVEANIQHGQQV LCQRRTSPKPFTEEVLWNVWLEFSIKIKDLPKGALLNLQIYCGKAPALSSKASAESPS SESKGKVRLLYYVNLLLIDHRFLLRRGEYVLHMWQISGKGEDQGSFNADKLTSATNPD KENSMSISILLDNYCHPIALPKHQPTPDPEGDRVRAEMPNQLRKQLEAIIATDPLNPL TAEDKELLWHFRYESLKHPKAYPKLFSSVKWGQQEIVAKTYQLLARREVWDQSALDVG LTMQLLDCNFSDENVRAIAVQKLESLEDDDVLHYLLQLVQAVKFEPYHDSALARFLLK RGLRNKRIGHFLFWFLRSEIAQSRHYQQRFAVILEAYLRGCGTAMLHDFTQQVQVIEM LQKVTLDIKSLSAEKYDVSSQVISQLKQKLENLQNSQLPESFRVPYDPGLKAGALAIE KCKVMASKKKPLWLEFKCADPTALSNETIGIIFKHGDDLRQDMLILQILRIMESIWET ESLDLCLLPYGCISTGDKIGMIEIVKDATTIAKIQQSTVGNTGAFKDEVLNHWLKEKS PTEEKFQAAVERFVYSCAGYCVATFVLGIGDRHNDNIMITETGNLFHIDFGHILGNYK SFLGINKERVPFVLTPDFLFVMGTSGKKTSPHFQKFQDICVKAYLALRHHTNLLIILF SMMLMTGMPQLTSKEDIEYIRDALTVGKNEEDAKKYFLDQIEVCRDKGWTVQFNWFLH LVLGIKQGEKHSA" BASE COUNT 1534 a 1224 c 1207 g 1432 t ORIGIN 1 gaattcggca cgagcacttc cttctcggct agattatctg aaactgttgt cggttcttga 61 gatgatacta ccaccgaatg tctgtgtttc attgtctagt ccaacctgta ttgtggatat 121 ctacaacgtt ccggcaatag ttttgcaggt gcatcacatt tttgtttttg ttttgggagg 181 aaaagggagg gcacggcagc caggcttcat attcctacaa gtgcatgctt caagattact 241 gtacttacag tgtttccaac atcttctcat aaaaggggaa agcttcatag cctcaaccat 301 gaaggaaacc agtcgcatag ggcatggagc tggagaacta taaacagccc gtggtgctga 361 gagaggacaa ctgccgaagg cgccggagga tgaagccgcg cagtgctgcc agcctgtcct 421 ccatggagct catccccatc gagttcgtgc tgcccaccag ccagcgcaaa tgcaagagcc 481 ccgaaacggc gctgctgcac gtggccggcc acggcaacgt ggagcagatg aaggcccagg 541 tgtggctgcg agcgctggag accagcgtgg cggcggactt ctaccaccgg ctgggaccgc 601 atcacttcct cctgctctat cagaagaagg ggcagtggta cgagatctac gacaagtacc 661 aggtggtgca gactctggac tgcctgcgct actggaaggc cacgcaccgg agcccgggcc 721 agatccacct ggtgcagcgg cacccgccct ccgaggagtc ccaagccttc cagcggcagc 781 tcacggcgct gattggctat gacgtcactg acgtcagcaa cgtgcacgac gatgagctgg 841 agttcacgcg ccgtggcttg gtgaccccgc gcatggcgga ggtggccagc cgcgacccca 901 agctctacgc catgcacccg tgggtgacgt ccaagcccct cccggagtac ctgtggaaga 961 agattgccaa caactgcatc ttcatcgtca ttcaccgcag caccaccagc cagaccatta 1021 aggtctcacc cgacgacacc cccggcgcca tcctgcagag cttcttcacc aagatggcca 1081 agaagaaatc tctgatggat attcccgaaa gccaaagcga acaggatttt gtgctgcgcg 1141 tctgtggccg ggatgagtac ctggtgggcg aaacgcccat caaaaacttc cagtgggtga 1201 ggcactgcct caagaacgga gaagagattc acgtggtact ggacacgcct ccagacccgg 1261 ccctagacga ggtgaggaag gaagagtggc cgctggtgga cgactgcacg ggagtcaccg 1321 gctaccatga gcagcttacc atccacggca aggaccacga gagtgtgttc accgtgtccc 1381 tgtgggactg cgaccgcaag ttcagggtca agatcagagg cattgatatc cccgtcctgc 1441 ctcggaacac cgacctcaca gtttttgtag aggcaaacat ccagcatggg caacaagtcc 1501 tttgccaaag gagaaccagc cccaaaccct tcacagagga ggtgctgtgg aatgtgtggc 1561 ttgagttcag tatcaaaatc aaagacttgc ccaaaggggc tctactgaac ctccagatct 1621 actgcggtaa agctccagca ctgtccagca aggcctctgc agagtccccc agttctgagt 1681 ccaagggcaa agttcggctt ctctattatg tgaacctgct gctgatagac caccgtttcc 1741 tcctgcgccg tggagaatac gtcctccaca tgtggcagat atctgggaag ggagaagacc 1801 aaggaagctt caatgctgac aaactcacgt ctgcaactaa cccagacaag gagaactcaa 1861 tgtccatctc cattcttctg gacaattact gccacccgat agccctgcct aagcatcagc 1921 ccacccctga cccggaaggg gaccgggttc gagcagaaat gcccaaccag cttcgcaagc 1981 aattggaggc gatcatagcc actgatccac ttaaccctct cacagcagag gacaaagaat 2041 tgctctggca ttttagatac gaaagcctta agcacccaaa agcatatcct aagctattta 2101 gttcagtgaa atggggacag caagaaattg tggccaaaac ataccaattg ttggccagaa 2161 gggaagtctg ggatcaaagt gctttggatg ttgggttaac aatgcagctc ctggactgca 2221 acttctcaga tgaaaatgta agagccattg cagttcagaa actggagagc ttggaggacg 2281 atgatgttct gcattacctt ctacaattgg tccaggctgt gaaatttgaa ccataccatg 2341 atagcgccct tgccagattt ctgctgaagc gtggtttaag aaacaaaaga attggtcact 2401 ttttgttttg gttcttgaga agtgagatag cccagtccag acactatcag cagaggttcg 2461 ctgtgattct ggaagcctat ctgaggggct gtggcacagc catgctgcac gactttaccc 2521 aacaagtcca agtaatcgag atgttacaaa aagtcaccct tgatattaaa tcgctctctg 2581 ctgaaaagta tgacgtcagt tcccaagtta tttcacaact taaacaaaag cttgaaaacc 2641 tgcagaattc tcaactcccc gaaagcttta gagttccata tgatcctgga ctgaaagcag 2701 gagcgctggc aattgaaaaa tgtaaagtaa tggcctccaa gaaaaaacca ctatggcttg 2761 agtttaaatg tgccgatcct acagccctat caaatgaaac aattggaatt atctttaaac 2821 atggtgatga tctgcgccaa gacatgctta ttttacagat tctacgaatc atggagtcta 2881 tttgggagac tgaatctttg gatctatgcc tcctgccata tggttgcatt tcaactggtg 2941 acaaaatagg aatgatcgag attgtgaaag acgccacgac aattgccaaa attcagcaaa 3001 gcacagtggg caacacggga gcatttaaag atgaagtcct gaatcactgg ctcaaagaaa 3061 aatcccctac tgaagaaaag tttcaggcag cagtggagag atttgtttat tcctgtgcag 3121 gctactgtgt ggcaaccttt gttcttggaa taggcgacag acacaatgac aatattatga 3181 tcaccgagac aggaaaccta tttcatattg acttcgggca cattcttggg aattacaaaa 3241 gtttcctggg cattaataaa gagagagtgc catttgtgct aacccctgac ttcctctttg 3301 tgatgggaac ttctggaaag aagacaagcc cacacttcca gaaatttcag gacatctgtg 3361 ttaaggctta tctagccctt cgtcatcaca caaacctact gatcatcctg ttctccatga 3421 tgctgatgac aggaatgccc cagttaacaa gcaaagaaga cattgaatat atccgggatg 3481 ccctcacagt ggggaaaaat gaggaggatg ctaaaaagta ttttcttgat cagatcgaag 3541 tttgcagaga caaaggatgg actgtgcagt ttaattggtt tctacatctt gttcttggca 3601 tcaaacaagg agagaaacat tcagcctaat actttaggct agaatcaaaa acaagttagt 3661 gttctatggt ttaaattagc atagcaatca tcgaacttgg atttcaaatg caatagacat 3721 tgtgaaagct ggcatttcag aagtatagct cttttcctac ctgaactctt ccctggagaa 3781 aagatgttgg cattgctgat tgtttggtta agcaatgtcc agtgctagga ttatttgcag 3841 gtttggtttt ttctcatttg tctgtggcat tggagaatat tctcggttta aacagactaa 3901 tgacttcctt attgtccctg atattttgac tatcttacta ttgagtgctt ctggaaattc 3961 tttggaataa ttgatgacat ctattttcat ctgggtttag tctcaatttt ggttatcttt 4021 gtgttcctca agctctttaa agaaaaagat gtaatcgttg taacctttgt ctcattcctt 4081 aaatgatgct tccaaacatc tccttagtgt ctgcaggtgt tagtggtgtg ctaaaagcaa 4141 ggaaagcgag ttagtctttt cagtgtcttt tgcaattcaa ttcttttgtc atgtataact 4201 gagacacaca aacacagcag gagaaatcta aaccgttgtg ccttgacctt cctctgctgg 4261 tcttgttcca gggttatgaa tatgaaaaaa tagagatgag actttttgtg tcaactctgt 4321 ccacaagagt gagttatcta gtatgattag tatagctttc tccagcatgg cagcaggaag 4381 taactacagg gcctctttta tgcctgacat ttcttccctt cctttttccc tgcctccctt 4441 tttcatcaat tgcaatgctc ccacaactct ttacagactt gtgaaatctt caagaacacc 4501 tttactctat aactcaaaaa ttagttgaaa aataattact tctcaaggat tattagaatc 4561 ttaggtactt atttgtaaag atgtttagtg actttttttt caagtatcta taaaggaggc 4621 agattctaga aaatatgaat tagtttccaa atgccttaat tttaaacttt ggcctgaaca 4681 gttttttctt tttcttaatg gaagaagata tttaatatct taaaaatatt ccaagttagg 4741 aagaacacta cttgccttat ccatttccca tttaaaggac ttttaaactt tgacacagtc 4801 cttcagattt cctgaaaatc cttgaaatat cttactttaa aaatattttc atctctgaaa 4861 tatctcgtta tttattggag gtattgttta accttagata gaccattaaa ttatttataa 4921 aatattttgt aattactgta gctaatacat tacatagaaa aaactatgtt aacagtgtct 4981 ctgtttaagt ataatcagat ataaatatat aacttaattt tttaatttta aaaaatagat 5041 acctgtttga ctttgaggta gtccaggcct ttttcttttt tttttttttt aatgtgtgca 5101 aaagcccaaa ggttcctaag cctggctgca aagaagaatc aacagggaca ctttttaaaa 5161 acactcttat cagcctgggg caacacagtg agactccatc tcttaaaaaa aaaattagct 5221 gggtatagtg gtatgtgcct gtagtcccag gtactcagga ggctgaggca ggaggattgc 5281 ctgagcccag gaggtggaaa ctgcagagag tcatgatcat gtccttacac tccagcctgg 5341 ataacagagc gagaccctgt ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa actcgag // LOCUS HSIDNADP 1751 bp RNA PRI 26-JUN-1995 DEFINITION H.sapiens mRNA for mitochondrial isocitrate dehydrogenase (NADP+). ACCESSION X69433 NID g872120 KEYWORDS isocitrate dehydrogenase (NADP+). SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1751) AUTHORS Song,B.J. TITLE Direct Submission JOURNAL Submitted (24-NOV-1992) B.J. Song, National Institute of Health, Laboratory of Metabolism & Molec Biology, NIAAA, 12501 Washington Avenue, Rockville, MD 20852, USA REFERENCE 2 (bases 1 to 1751) AUTHORS Huh,T.L., Oh,I.U., Kim,Y.O., Huh,J.W. and Song,B.J. TITLE Characterization of a cDNA clone for human mitochondrial NADP+-specific isocitrate dehydrogenase JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1751 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /clone_lib="lambda gt11" CDS 87..1445 /EC_number="1.1.1.42" /codon_start=1 /product="isocitrate dehydrogenase (NADP+)" /db_xref="PID:g872121" /db_xref="SWISS-PROT:P48735" /translation="MAGYLRVVRSLCRASGSRPAWAPAALTAPTSQEHPRRHYADKRI KVAKPVVEMDGDEMTRIIWQFIKEKLILPHVDIQLKYFDLGLPNRDQTDDQVTIDSAL ATQKYSVAVKCATITPDEARVEEFKLKKMWKSPNGTIRNILGGTVFREPIICKNIPRL VPGWTKPITIGRHAHGDQYKATDFVADRAGTFKMVFTPKDGSGVKEWEVYNFPAGGVG MGMYNTDESISGFAHSCFQYAIQKKWPLYMSTKNTILKAYDGRFKDIFQEIFDKHYKT DFDKNKIWYEHRLIDDMVAQVLKSSGGFVWACKNYDGDVQSDILAQGFGSLGLMTSVL VCPDGKTIEAEAAHGTVTRHYREHQKGRPTSTNPIASIFAWTRGLEHRGKLDGNQDLI RFAQMLEKVCVETVESGAMTKDLAGCIHGLSNVKLNEHFLNTMDFLDTIKSNLDRALG RQ" BASE COUNT 398 a 497 c 513 g 343 t ORIGIN 1 ccagcgttag cccgcggcca ggcagccggg aggagcggcg cgcgctcgga cctctcccgc 61 cctgctcgtt cgctctccag cttgggatgg ccggctacct gcgggtcgtg cgctcgctct 121 gcagagcctc aggctcgcgg ccggcctggg cgccggcggc cctgacagcc cccacctcgc 181 aagagcatcc gcggcgccac tatgccgaca aaaggatcaa ggtggcgaag cccgtggtgg 241 agatggatgg tgatgagatg acccgtatta tctggcagtt catcaaggag aagctcatcc 301 tgccccacgt ggacatccag ctaaagtatt ttgacctcgg gctcccaaac cgtgaccaga 361 ctgatgacca ggtcaccatt gactctgcac tggccaccca gaagtacagt gtggctgtca 421 agtgtgccac catcacccct gatgaggccc gtgtggaaga gttcaagctg aagaagatgt 481 ggaaaagtcc caatggaact atccggaaca tcctgggggg gactgtcttc cgggagccca 541 tcatctgcaa aaacatccca cgcctagtcc ctggctggac caagcccatc accattggca 601 ggcacgccca tggcgaccag tacaaggcca cagactttgt ggcagaccgg gccggcactt 661 tcaaaatggt cttcacccca aaagatggca gtggtgtcaa ggagtgggaa gtgtacaact 721 ttcccgcagg cggcgtgggc atgggcatgt acaacaccga cgagtccatc tcaggttttg 781 cgcacagctg cttccagtat gccatccaga agaaatggcc gctgtacatg agcaccaaga 841 acaccatact gaaagcctac gatgggcgtt tcaaggacat cttccaggag atctttgaca 901 agcactataa gaccgacttc gacaagaata agatctggta tgagcaccgg ctcattgatg 961 acatggtggc tcaggtcctc aagtcttcgg gtggctttgt gtgggcctgc aagaactatg 1021 acggagatgt gcagtcagac atcctggccc agggctttgg ctcccttggc ctgatgacgt 1081 ccgtcctggt ctgccctgat gggaagacga ttgaggctga ggccgctcat gggaccgtca 1141 cccgccacta tcgggagcac cagaagggcc ggcccaccag caccaacccc atcgccagca 1201 tctttgcctg gacacgtggc ctggagcacc gggggaagct ggatgggaac caagacctca 1261 tcaggtttgc ccagatgctg gagaaggtgt gcgtggagac ggtggagagt ggagccatga 1321 ccaaggacct ggcgggctgc attcacggcc tcagcaatgt gaagctgaac gagcacttcc 1381 tgaacaccat ggacttcctc gacaccatca agagcaacct ggacagagcc ctgggcaggc 1441 agtaggggga ggcgccaccc atggctgcag tggaggggcc agggctgagc cggcgggtcc 1501 tcctgagcgc ggcagagggt gagcctcaca gcccctctct ggaggccttt ctaggggatg 1561 tttttttata agccagatgt ttttaaaagc atatgtgtgt ttcccctcat ggtgacgtga 1621 ggcaggagca gtgcgtttta cctcagccag tcagtatgtt ttgcatactg taatttatat 1681 tgcccttgga acacatggtg ccatatttag ctactaaaaa gctcttcaca aaaaaaaaaa 1741 cggattccgc g // LOCUS HSIDO 1475 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for indoleamine 2,3-dioxygenase. ACCESSION X17668 NID g32629 KEYWORDS dioxygenase; indoleamine 2,3-dioxygenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1475) AUTHORS Tone,S. TITLE Direct Submission JOURNAL Submitted (21-NOV-1989) Tone S., Wakayama Medical College, 27 Kyubancho, Wakayama 640, Japan REFERENCE 2 (bases 1 to 1475) AUTHORS Tone,S., Takikawa,O., Habara-Ohkubo,A., Kadoya,A., Yoshida,R. and Kido,R. TITLE Primary structure of human indoleamine 2,3-dioxygenase deduced from the nucleotide sequence of its cDNA JOURNAL Nucleic Acids Res. 18 (2), 367 (1990) MEDLINE 90221825 FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /cell_type="fibroblast" /cell_line="HEL cell (Flow 2000)" /clone_lib="lambda gt11" CDS 23..1234 /note="indoleamine 2,3-dioxygenase" /codon_start=1 /db_xref="PID:g32630" /db_xref="SWISS-PROT:P14902" /translation="MAHAMENSWTISKEYHIDEEVGFALPNPQENLPDFYNDWMFIAK HLPDLIESGQLRERVEKLNMLSIDHLTDHKSQRLARLVLGCITMAYVWGKGHGDVRKV LPRNIAVPYCQLSKKLELPPILVYADCVLANWKKKDPNKPLTYENMDVLFSFRDGDCS KGFFLVSLLVEIAAASAIKVIPTVFKAMQMQERDTLLKALLEIASCLEKALQVFHQIH DHVNPKAFFSVLRIYLSGWKGNPQLSDGLVYEGFWEDPKEFAGGSAGQSSVFQCFDVL LGIQQTAGGGHAAQFLQDMRRYMPPAHRNFLCSLESNPSVREFVLSKGDAGLREAYDA CVKALVSLRSYHLQIVTKYILIPASQQPKENKTSEDPSKLEAKGTGGTDLMNFLKTVR STTEKSLLKEG" BASE COUNT 440 a 319 c 328 g 388 t ORIGIN 1 cccagaggag cagactacaa gaatggcaca cgctatggaa aactcctgga caatcagtaa 61 agagtaccat attgatgaag aagtgggctt tgctctgcca aatccacagg aaaatctacc 121 tgatttttat aatgactgga tgttcattgc taaacatctg cctgatctca tagagtctgg 181 ccagcttcga gaaagagttg agaagttaaa catgctcagc attgatcatc tcacagacca 241 caagtcacag cgccttgcac gtctagttct gggatgcatc accatggcat atgtgtgggg 301 caaaggtcat ggagatgtcc gtaaggtctt gccaagaaat attgctgttc cttactgcca 361 actctccaag aaactggaac tgcctcctat tttggtttat gcagactgtg tcttggcaaa 421 ctggaagaaa aaggatccta ataagcccct gacttatgag aacatggacg ttttgttctc 481 atttcgtgat ggagactgca gtaaaggatt cttcctggtc tctctattgg tggaaatagc 541 agctgcttct gcaatcaaag taattcctac tgtattcaag gcaatgcaaa tgcaagaacg 601 ggacactttg ctaaaggcgc tgttggaaat agcttcttgc ttggagaaag cccttcaagt 661 gtttcaccaa atccacgatc atgtgaaccc aaaagcattt ttcagtgttc ttcgcatata 721 tttgtctggc tggaaaggca acccccagct atcagacggt ctggtgtatg aagggttctg 781 ggaagaccca aaggagtttg cagggggcag tgcaggccaa agcagcgtct ttcagtgctt 841 tgacgtcctg ctgggcatcc agcagactgc tggtggagga catgctgctc agttcctcca 901 ggacatgaga agatatatgc caccagctca caggaacttc ctgtgctcat tagagtcaaa 961 tccctcagtc cgtgagtttg tcctttcaaa aggtgatgct ggcctgcggg aagcttatga 1021 cgcctgtgtg aaagctctgg tctccctgag gagctaccat ctgcaaatcg tgactaagta 1081 catcctgatt cctgcaagcc agcagccaaa ggagaataag acctctgaag acccttcaaa 1141 actggaagcc aaaggaactg gaggcactga tttaatgaat ttcctgaaga ctgtaagaag 1201 tacaactgag aaatcccttt tgaaggaagg ttaatgtaac ccaacaagag cacattttat 1261 catagcagag acatctgtat gcattcctgt cattacccat tgtaacagag ccacaaacta 1321 atactatgca atgttttacc aataatgcaa tacaaaagac ctcaaaatac ctgtgcattt 1381 cttgtaggaa aacaacaaaa ggtaattatg tgtaattata ctagaagttt tgtaatctgt 1441 atcttatcat tggaataaaa tgacattcaa taaat // LOCUS HSIEF7442 1943 bp RNA PRI 13-MAY-1993 DEFINITION H.sapiens IEF 7442 mRNA. ACCESSION X72841 NID g297903 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1943) AUTHORS Nielsen,M.S., Rasmussen,H.H., Dejgaard,K., Celis,J.E. and Leffers,H. TITLE Molecular cloning and expression of two novel human cDNAs coding for proteins containing WD-40 repeats and sharing similarity to yeast MSI1 a negative regulator of the RAS-cAMP pathway JOURNAL Unpublished REFERENCE 2 (bases 1 to 1943) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (06-MAY-1993) H. Leffers, Institute of Medical Biochemistry &, Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..1943 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA" gene 286..1563 /gene="7442" CDS 286..1563 /gene="7442" /codon_start=1 /product="IEF 7442" /db_xref="PID:g297904" /translation="MASKEMFEDTVEERVINEEYKIWKKNTPFLYDLVMTHALQWPSL TVQWLPEVTKPEGKDYALHWLVLGTHTSDEQNHLVVARVHIPNDDAQFDASHCDSDKG EFGGFGSVTGKIECEIKINHEGEVNRARYMPQNPHIIATKTPSSDVLVFDYTKHPAKP DPSGECNPDLRLRGHQKEGYGLSWNSNLSGHLLSASDDHTVCLWDINAGPKEGKIVDA KAIFTGHSAVVEDVAWHLLHESLFGSVADDQKLMIWDTRSNTTSKPSHLVDAHTAEVN CLSFNPYSEFILATGSADKTVALWDLRNLKLKLHTFESHKDEIFQVHWSPHNETILAS SGTDRRLNVWDLSKIGEEQSAEDAEDGPPELLFIHGGHTAKISDFSWNPNEPWVICSV SEDNIMQIWQMAENIYNDEESDVTTSELEGQGS" polyA_signal 1928..1933 BASE COUNT 551 a 398 c 490 g 504 t ORIGIN 1 gcctcgtcag ctgcctgggc gggctgggag gcgcgggttg aaaagtctcg ttccaagttt 61 ggagagagag agaagagcgc ctcagacctc ggtacccgcg agcggggagg aggcaggaaa 121 gaaggacgcg gcgtctgggg agcacccagg cagcaagacg gggcccgggc tttcgacagt 181 ggggagtgtg acgcgcttgg gaaaggcagg agcgccacgt cgggctgctc ttggctaacg 241 agaggagtcc gaggcggcgg cgaggggcga acgacccgac gcaagatggc gagtaaagag 301 atgtttgaag atactgtgga ggagcgtgtc atcaatgaag aatataaaat ctggaagaag 361 aatacaccgt ttctatatga cctggttatg acccatgctc ttcagtggcc cagtcttacc 421 gttcagtggc ttcctgaagt gactaaacct gaaggaaaag attatgccct tcattggcta 481 gtgctgggga ctcatacgtc tgatgagcag aatcatctgg tggttgctcg agtacatatt 541 cccaatgatg atgcacagtt tgatgcttcc cattgtgaca gtgacaaggg tgaatttggt 601 ggctttggtt ctgtaacagg aaaaattgaa tgtgaaatta aaatcaatca cgaaggagaa 661 gtaaaccgtg ctcgttacat gccgcagaat cctcacatca ttgctacaaa aacaccatct 721 tctgatgtgt tggtttttga ctatacaaaa caccctgcta aaccagaccc aagtggagaa 781 tgtaatcctg atctcagatt aagaggtcac cagaaggaag gctatggtct ctcctggaat 841 tcaaatttga gtggacatct cctaagtgca tctgatgacc atactgtttg tctgtgggat 901 ataaacgcag gaccaaaaga aggcaaaatt gtggatgcta aagccatctt tactggccac 961 tcagctgttg tagaggatgt ggcctggcac ctgctgcacg agtcattgtt tggatctgtt 1021 gctgatgatc agaaacttat gatatgggac accaggtcca ataccacctc caagccgagt 1081 cacttggtgg atgcgcacac tgccgaagtc aactgcctct cattcaatcc ctacagcgaa 1141 tttattctag ccaccggctc tgcggataag accgtagctt tatgggatct gcgtaactta 1201 aaattaaaac tccatacctt cgaatctcat aaagatgaaa ttttccaggt ccactggtct 1261 ccacataatg aaactattct ggcttcaagt ggtactgacc gccgcctgaa tgtgtgggat 1321 ttaagtaaaa ttggggaaga acaatcagca gaagatgcag aagatgggcc tccagaactc 1381 ctgtttattc atggaggaca cactgctaag atttcagatt ttagctggaa ccccaatgag 1441 ccttgggtca tttgctcagt gtctgaggat aacatcatgc agatatggca aatggctgaa 1501 aatatttaca atgatgaaga gtcagatgtc acgacatccg aactggaggg acaaggatct 1561 taaacccaaa gtacgagaaa tgtttctgtt gaatgtaatg ctacatgaat gcttgattta 1621 tcaagcgcca aaaaggcatt gtatagtagg aaatgtaagt ggggtggctt atggcttctt 1681 tatcctctga ttctagcatt tcaagtgagc tgttgcgtac tgtatcatat tgtagctatt 1741 agggaagaga agaatgttgc ttaagaaaga acatcaccat tgattttaaa tacaagtagc 1801 agggtattgc ctttgattca actgttttaa gtcctcattt tctcaaacta agtgcttgct 1861 gttcccaaat atgcaagaat aacttttaca ctttttcctt ccaacacttc ttgattggct 1921 ttgcagaaat aaagttttaa aat // LOCUS HSIEF9306 1417 bp RNA PRI 27-MAY-1993 DEFINITION H.sapiens IEF 9306 mRNA. ACCESSION X71810 NID g297905 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1417) AUTHORS Nielsen,M.S., Rasmussen,H.H., Celis,J.E. and Leffers,H. TITLE Molecular cloning and expression of two novel human cDNAs encoding proteins containing WD-40 repeats and sharing similarity to yeast MSI1 a negative regulation of the RAS-cAMP pathway JOURNAL Unpublished REFERENCE 2 (bases 1 to 1417) AUTHORS Nielsen,M.S. TITLE Direct Submission JOURNAL Submitted (06-MAY-1993) M.S. Nielsen, Institute of Medicine Biochemistry, Ole Worms Alle Building 170, Aarhus University, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..1417 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA" gene 74..1306 /gene="9306" CDS 74..1306 /gene="9306" /codon_start=1 /product="IEF SSP 9306" /db_xref="PID:g297906" /translation="MADKEAAFDDAVEERVINEEYKIWKKNTPFLYDLVMTHALEWPS LTAQWLPDVTRPEGKDFSIHRLVLGTHTSDEQNHLVIASVQLPNDDAQFDASHYDSEK GEFGGFGSVSGKIEIEIKINHEGEVNRARYMPQNPCIIATKTPSSDVLVFDYTKHPSK PDPSGECNPDLRLRGHQKEGYGLSWNPNLSGHLLSASDDHTICLWDISAVPKEGKVVD AKTIFTGHTAVVEDVSWHLLHESLFGSVADDQKLMIWDTRSNNTSKPSHSVDAHTAEV NCLSFNPYSEFILATGSADKTVALWDLRNLKLKLHSFESHKDEIFQVQWSPHNETILA SSGTDRRLNVWDLSKIGEEQSPEDAEDGPPELLFIHGGHTAKISDFSWNPNEPWVICS VSEDNIMQVWQMELVLDH" polyA_signal 1392..1397 BASE COUNT 392 a 331 c 322 g 372 t ORIGIN 1 agcgagctct tgcagcctcc ccgcccctcc cgcaacgctc gaccccagga ttcccccggc 61 tcgcctgccc gccatggccg acaaggaagc agccttcgac gacgcagtgg aagaacgagt 121 gatcaacgag gaatacaaaa tatggaaaaa gaacacccct tttctttatg atttggtgat 181 gacccatgct ctggagtggc ccagcctaac tgcccagtgg cttccagatg taaccagacc 241 agaagggaaa gatttcagca ttcatcgact tgtcctgggg acacacacat cggatgaaca 301 aaaccatctt gttatagcca gtgtgcagct ccctaatgat gatgctcagt ttgatgcgtc 361 acactacgac agtgagaaag gagaatttgg aggttttggt tcagttagtg gaaaaattga 421 aatagaaatc aagatcaacc atgaaggaga agtaaacagg gcccgttata tgccccagaa 481 cccttgtatc atcgcaacaa agactccttc cagtgatgtt cttgtctttg actatacaaa 541 acatccttct aaaccagatc cttctggaga gtgcaaccca gacttgcgtc tccgtggaca 601 tcagaaggaa ggctatgggc tttcttggaa cccaaatctc agtgggcact tacttagtgc 661 ttcagatgac cataccatct gcctgtggga catcagtgcc gttccaaagg agggaaaagt 721 ggtagatgcg aagaccatct ttacagggca tacggcagta gtagaagatg tttcctggca 781 tctactccat gagtctctgt ttgggtcagt tgctgatgat cagaaactta tgatttggga 841 tactcgttca aacaatactt ccaaaccaag ccactcagtt gatgctcaca ctgctgaagt 901 gaactgcctt tctttcaatc cttatagtga gttcattctt gccacaggat cagctgacaa 961 gactgttgcc ttgtgggatc tgagaaatct gaaacttaag ttgcattcct ttgagtcaca 1021 taaggatgaa atattccagg ttcagtggtc acctcacaat gagactattt tagcttccag 1081 tggtactgat cgcagactga atgtctggga tttaagtaaa attggagagg aacaatcccc 1141 agaagatgca gaagacgggc caccagagtt gttgtttatt catggtggtc atactgccaa 1201 gatatctgat ttctcctgga atcccaatga accttgggtg atttgttctg tatcagaaga 1261 caatatcatg caagtgtggc aaatggagtt agtccttgac cactagtttg atgccatctc 1321 cattttgggt gacctgtttc accagcaggc ctgttactct ccatgactaa ctgtgtaagt 1381 gcttaaaatg gaataaattg cttttctaca taacccc // LOCUS HSIFD1 9937 bp DNA PRI 04-AUG-1994 DEFINITION Human interferon genes LeIF-L and LeIF-J and pseudogene LeIF-M with intergenic regions. These genes are located on chromosome 9. ACCESSION V00531 J00217 NID g32631 KEYWORDS interferon; pseudogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9937) AUTHORS Ullrich,A., Gray,A., Goeddel,D.V. and Dull,T.J. TITLE Nucleotide sequence of a portion of human chromosome 9 containing a leukocyte interferon gene cluster JOURNAL J. Mol. Biol. 156 (3), 467-486 (1982) MEDLINE 83010248 FEATURES Location/Qualifiers source 1..9937 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-hleif-1" /dev_stage="foetus" /tissue_type="liver" /map="9p22" mRNA <1..>425 /gene="IFNA" /note="pseudo-IFN-alpha-m processed pseudogene" gene 1..7764 /gene="IFNA" CDS <1..153 /gene="IFNA" /note="pseudo-IFN-alpha-m" /pseudo /codon_start=1 repeat_region 704..5233 /note="whole repeat unit of IFNA-L" mRNA 2194..3178 /gene="IFNA" /note="putative" CDS 2262..2831 /gene="IFNA" /pseudo repeat_region 5672..9634 /note="whole repeat unit of IFNA-J" mRNA 7128..8127 /gene="9p22" /note="mRNA of IFNA-J" gene 7128..8127 /gene="9p22" CDS 7195..7764 /gene="IFNA" /note="precursor from leukocyte" /codon_start=1 /product="interferon alpha-j" /db_xref="PID:g32632" /db_xref="SWISS-PROT:P01567" /translation="MARSFSLLMVVLVLSYKSICSLGCDLPQTHSLRNRRALILLAQM GRISPFSCLKDRHEFRFPEEEFDGHQFQKTQAISVLHEMIQQTFNLFSTEDSSAAWEQ SLLEKFSTELYQQLNDLEACVIQEVGVEETPLMNEDFILAVRKYFQRITLYLMEKKYS PCAWEVVRAEIMRSFSFSTNLKKGLRRKD" BASE COUNT 3489 a 1637 c 1732 g 3077 t 2 others ORIGIN 1 ccttgaggag gtacttcnag ggntccatgg gaatccagag aatctacctg aaagagaaga 61 aatacagtga ctgtgcttgg gaggttgtca gagtggaatc atgaaatcct tctcttcatc 121 aacagacttg caaggactga gaagtaagga tgaagacctg gggtctgctt tagtctttct 181 tattttcttc ctcttcctag ctgtgtgttt atttcttctt ttctagttcc ttaacttgta 241 aagttagttc attggtttga ggtctttctt cttttttaat ataagctttt acagctttca 301 atttcccctt tagctctgtt ttcactgcat cacatacgtc ttggtatgtt gtgttttcat 361 tttcacctct ctcaagatat tttctaagtt cccctctggt catctttttt ttttttgtac 421 tatactttaa gttccaggtt aaatgtgctc aacgtgcaag tttgttacat aggtatacat 481 ttgccatgtt ggtttgctgc acccattaac ttgtcattta cattaggtat ttctcctctt 541 acatgtttat cacatttaga catacatttg ttttcttttt gttcccataa attaagatga 601 aaatcagacc acttttacct tctaggaaaa gtgaagtgag aaatataaat atatttgctg 661 ttgtgaatgc catatagaac cattgtatag tccatttaaa aatgagaata aatgagaaat 721 tagtaaaaac accacttact aaatagctga tttgctaaag cagacctcat tccatttaag 781 gactcagtat ctatagggcc tacttataca aaaaaaaact tcttacacaa aaaaaaaaaa 841 aaaaaaaagt tagagccaga gttcaggatc atgctgaaag ttatttcttt gcataataat 901 attcaatatt tataaattta tgaatttaga acaaagatgg tcttttttat ttgataagaa 961 ttgacttgga taggaacttc tgaaaacctt tagggaatat gaacttcaat gtaaaatgcc 1021 aaaaatgatt taaatcatac tattttctaa gtcatatatg tttattggat tgatacttct 1081 tttaagggta caaaaattag ttctcgtagt gtaaatgaat ctaacatatt acaatagttt 1141 ctgactttcc aacaactcta tccaacaaaa ttttattgct taatatacat atttctcatt 1201 gggttttttt gtgtatgata tgagaagcac tggtattgag ttcatgaaga taaacaaaat 1261 atttgtaaga ccaatgttac aaacctatag caaatagatg actgtgattg gaggactttt 1321 tgtccatttt ttgctggatc ttaaagtctt atcacagtat gtggctttaa cctgcatatc 1381 tttgggctgc cattgactat cttatagtta ttagttatgt ttgatcctca gttcttcagg 1441 atgtttggta gactttgaga attcaatcca aatagcttac attatatgtt ttatttctac 1501 taaagttatt caatacatca gtacttgtgt caagtgctga aaagaaaaaa gttttggcaa 1561 tatctggatg aatactgcag ctggtgaatt tacaaattat tttctcatat aaagcaaaat 1621 tcaaagcttc atacactaag agaaaaattt taaaaaatta ttgattcata tttttaggag 1681 ttttgaatga ttaggtaggt aactacattc atattattaa tgtgtattat atagattttt 1741 attttgcata tgtactttga tacaaaattt gcatgaacaa attatactaa aagttattcc 1801 acaaatatac ttatcaaatt aaaataaatg tcaatagctt ttaaacttag attttagttt 1861 aacttttctg tcattcttaa ctttactttg aataaaaaga gcaaactttg tagtttttat 1921 ctgtgaagta gaggtatacg taatatacat aaatagatat gccaaatctg tgttactaaa 1981 atttcatgaa gatttcaatt aaaaaaaaac cataaaaggc tttgagtgca ggtgaaaaat 2041 aggcaatgat gaaaaaaaac gaaaaacttt ttaaacacat ggagagagta cataaagaaa 2101 gcaaaaacag agatagaaag taaaactagg gcatttagaa aatggaaatt agtatgttca 2161 ctatttaaga cctatgcaca gagcaaagtc ttcagaaaac ctagaggccg aagttcaagg 2221 ttatccatct caagtagcct agcaatattt gcaacatccc aatggccctg tccttttctt 2281 tacttatggc cgtgctggtg ctcagctaca aatccatctg atctctgggc tgtgatctgc 2341 ctcagaccca caccctgcgt aataggaggg ccttgatact cctgggacaa atgggaagaa 2401 tctctccttt ctcctgcctg aaggacagac atgatttccg aatcccccag gaggagtttg 2461 atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg atccagcaga 2521 ccttcaatct cttcagcaca gaggactcat ctgctgcttg ggaacagagc ctcctagaaa 2581 aattttccac tgaaatttac cagcaactga atgacctgga agcatgtgtg atacaggagg 2641 ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg aggaaatact 2701 tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgcc tgggaggttg 2761 tcagagcaga aatcatgaga tccctctcgt tttcaacaaa cttgcaaaaa agattaagga 2821 ggaaggattg aaaactggtt caacatggca atgatcctga ttgactaata cattatctca 2881 cactttcatg agttcttcca tttcaaagac tcacttctat aaccacgacg tgttgaatca 2941 aaattttcaa atgttttcag cagtgtaaag aagtgtcgtg tatacctgtg caggcactag 3001 tcctttacag atgaccattc tgatgtctct gttcatcttt tgtttaaata tttatttaat 3061 tatttttaaa atttatgtaa tatcatgagt cgctttacat tgtggttaat gtaacaatat 3121 atgttcttca tatttagcca atatattaat ttcctttttc attaaatttt tactatacaa 3181 aatttcttgt gtttgtttat tctttaagat aaaatgccaa ggctgacttt acaacctgac 3241 ttaaaaatag atgatttaat tatgttacct atcataattt tattcaagtt ataaaaatat 3301 atttttttct gtacctggtt atatgttgcc ttcaggatat aaacgtgaac ataaaatata 3361 cagtccctgt tctcttgtat ctttgatttt ttcaggaaag aaatctaaaa acaataataa 3421 tgctgaatta atatcagtga tgctaactgc tataatgtga ggaagtaaaa aaacaatgaa 3481 ttcctcttag cagaatgtag attgagacat atctggaaat aaaagcagag atattctctg 3541 taaactgact tcaacatgta attgaaaatg tacattgcaa gtcagatatg tgaatttgca 3601 gtttccaagg aatacgatat ctggaagttc ataactggca atggaaagga cgcaaatgaa 3661 ggctgtcata tggggagcaa gtggagaggg aaaaaaagac ttaaactgga ttctgaggat 3721 cttccaccat taaagtgtgg gaacagaaga gacacaaagg aaacagaggt ggaatacctt 3781 aacattagaa ggacaagagg gaatggtgat aaaagtgtat ttagaaaata aatgtgctta 3841 gaaaaggaat caataaactt atggaaaatg tgaattaaaa ctgagcacta cagcaagaaa 3901 atagatggca atgcagagct tactgagagc tggattcata gaattaatca gcagaagcca 3961 tactggggta gacagaagag tgactcagaa aagagaaatc aagataacac atacagaaaa 4021 tgtgagaaaa ctgcctttgc aatggtggca agtaataagt ttggaccccc caaaaatgtg 4081 gattatcttt tatctgcata gtgtttcctt tttgaaaata tgtcactgaa taaatttcat 4141 aattgtgatg cattggtgaa tcatattaat acatttaata atttatatat ttaaagcata 4201 aaatgtaaaa ttatttacaa tagtaattga tcattatttt gattaatact ctgttaaatg 4261 tcagtaaaaa ctgacacatt ttttcataaa ataaaattgg aaactggaaa agataatctc 4321 tttctgagta ctttaggaat ggggaaagga ttccctcttt aataaatggt gctgggaaaa 4381 ctggatagcc atatgcagag aactgaaact ggaccccttc cttacacctt atacaaaaat 4441 taatgcaaga tggattaaac gcttaaatgt aaaacccaaa accataaaaa ccctagaaga 4501 aaacgtaggc aatagcattc aggacatcgg catgggcaaa gattttatga tgaaatcgcc 4561 aaaagcaact gcaaccaaag ctaaaattga caaatggcat ctaattaaag agattctgca 4621 cagcaaaaga aactgtcaat catcacggtg aacaggcaac ttacagaatg ggagaaaatt 4681 tttacagcct acccaattga cagaggtcta atatccagaa tggacaaaga acttaaacaa 4741 attcacaaaa aaaaaaaaaa gccccatcga aaagtgggca aaggacatga acagacactt 4801 ctcaaaagta gacatttatg tggccaataa acatgaaaaa agctcaacat cactgatcat 4861 tagagaaatg aaaatcaaaa ctgcaatgag atgccatctc atgtcactca gaatggcaat 4921 tactaaaaag tcaggaaaca acagatgctg gtgaagctgt ggaaaaatag gaaagctttt 4981 acactgttgg taggaatgta aattacttca accattgtgg aagacagtgt ggcgattcat 5041 caaggattta gaaccagaaa taccattaga cccagcaatc ctattcctga gtatataccc 5101 aaaggaatac aaataattct attataaaaa tacatgcacg tgtatgttta ttgcaacact 5161 atttacaata gcaaagacag gaaaccaacc caaatgccca tcaatgatag aatggattaa 5221 aaaaattgtg cttcaatttt tttaatccta ctgaagctgt aggattattt gggggacaga 5281 gactattatt ccctactttt gggaatagta aattgtctga ttcctacaaa ctgtgtgaat 5341 tggagagttt gaatttcaat atgtgactct gtatttgaca ccaggctatt tattttctat 5401 tataacaaag tagagggagg attagagatg aagtcataaa tagttaatat agtgccaggc 5461 aaaggatgat attatgctgc tcttgcaact tgaatcccca gatctacatg caccttaaaa 5521 aactagaacc ccagtggttt tagcagtaaa ctaaatgggc attactgatt ttcactaaaa 5581 gctgaatgga aatttttgtc attgtctatt acaatcccaa atatggccat gatgaagaaa 5641 aacagcttca ttcttgaaca ccttccatta gaagaaaaaa tgagaaacta ggaaaaactc 5701 cacctactaa atagctgatt tgctaaaaca gacctcattc catttaagga ctcagtatct 5761 atagggccga ggcaaactaa ctttgccaga gttcaggacc actctgaaag ttaattcctt 5821 acataataat attcaatatt tataaattta tgaatttaga acaaagatgg tcttttatat 5881 ttgataagaa ttgacttgga taggaacttc tgaaaacctt tagggaatat gaacttcaat 5941 gaaaaatgcc aaaaatgatt taattgataa tattttctaa gtcacatatg tttattggaa 6001 tgatacttct tctaagggta caaaaattag ttctcgtagt gtaaacaaat ctgacatatt 6061 gcaaaagttt gtaactctcc aagaactcta tccaacaaaa tttcattgct taatatacat 6121 ctttctcgtt gggttttctt gtgtatgata tgagaagcac tggtattgag ttcatgatga 6181 taaacaaaat atttgcaaga tcaacgttac aaacctatgg caaatagatg actgtgattg 6241 gaggactttt tgtcaatttt tttgctggat cttgaagtct taccacaata tgtggcttta 6301 acctgcctac ctttgtgctg ccattgacca tcttatggtt attagttatg attgatcctc 6361 agttcttcag gatgttttgt acactttgag aattcaatgc aaatagccta tattatatga 6421 tttatttcta caaaagttat tcaacacatc agtacttatg tcaagtgctg aaaagaaaaa 6481 agtgttggca atatctggat gaatactgca gctagtgaag tttacaaatt attttctcat 6541 ataaagcaaa attcaaagct tcatatacta tgagaaaatt tttttaaaat tgattcatat 6601 ttctagcagt tttgaatgat taggtatgta attacattca tattaatgtg tattatacag 6661 atttttattt tgcatatgta atttgaaaca acaaaattta catgaacaaa ttacattaaa 6721 agttattcca caaatatact tatctaatta aacttagatt ttaatagctt ttaaacttag 6781 attttagttt aacttttctg tcattcttaa cttactttga ataaaaagag caaacttcat 6841 actttttatc tgtgaagtag aggtatatgt agaataccta aatagatatg ccaaatctgt 6901 gttattaaaa tttcatgaac atttcaatta gaaaaaaata ccataaaagg ctttgagtgc 6961 aggggaaaaa caggcaatga tgaaaaaaaa aatgaaaaac gtatttaaac acatggagag 7021 agtgcataaa gaaagcaaaa acagagatag aaagtaaaac tagggcattt agaaaatgga 7081 aattagtatg ttcactattt aagacctatg cacagagcaa agtctccaga aaacctagag 7141 gccacggttc aagttaccca cctcaggtag cctagtgata tttgcaaaat cccaatggcc 7201 cggtcctttt ctttactgat ggtcgtgctg gtactcagct acaaatccat ctgctctctg 7261 ggctgtgatc tgcctcagac ccacagcctg cgtaatagga gggccttgat actcctggca 7321 caaatgggaa gaatctctcc tttctcctgc ttgaaggaca gacatgaatt cagattccca 7381 gaggaggagt ttgatggcca ccagttccag aagactcaag ccatctctgt cctccatgag 7441 atgatccagc agaccttcaa tctcttcagc acagaggact catctgctgc ttgggaacag 7501 agcctcctag aaaaattttc cactgaactt taccagcaac tgaatgacct ggaagcatgt 7561 gtgatacagg aggttggggt ggaagagact cccctgatga atgaggactt catcctggct 7621 gtgaggaaat acttccaaag aatcactctt tatctaatgg agaagaaata cagcccttgt 7681 gcctgggagg ttgtcagagc agaaatcatg agatccttct ctttttcaac aaacttgaaa 7741 aaaggattaa ggaggaagga ttgaaaactg gttcatcatg gaaatgattc tcattgacta 7801 atgcatcatc tcacactttc atgagttctt ccatttcaaa gactcacttc tataaccacc 7861 acaagttgaa tcaaaatttc caaatgtttt caggagtgtt aagaagcatc gtgtttacct 7921 gtgcaggcac tagtccttta cagatgacca ttctgatgtc tcctttcatc tatttattta 7981 aatatttatt tatttaacta tttttattat ttaaattatt ttttatgtaa tatcatatgt 8041 acctttacat tgtggttaat gtaacaaata tgttcttcat atttagccaa tatattaatt 8101 tcctttttca ttaaattttt actatacaaa atttcttgtg tttgtttatt ttttaagatt 8161 aaatgccaag cctgactgta taacctgact taaaaataga tgatttaagt aagttaccta 8221 tcataatttt attcaagtta tagaaaaata tatttttcta taccaggtta tctgttgcct 8281 tcatgatata aacgtgaaca taaaaaatac agttcttgtt ctcttgtatc tttgattttt 8341 gtcaggaaag aaatctaaaa acaataataa tgctgaatta atatcggtta tactaactgc 8401 tgtaatgtga ggaagtaaaa aaaaatgaat tcctcttagc agaacataga ttaagaaatg 8461 tctgcaaata aaagtagagg tactctctat aaactgactt tcaacatgta attgaaaatg 8521 tacattgcaa gtcagatata tgagtttgca gtttccaagg aatatgatat ctggaagttc 8581 ataactaagc aatggaaacg ccaaaaatga aggctgtcat gtggggagca agcagagagg 8641 gaaaaaagac ttaaactgga ttctgaggaa cttccactat taaagtgggg gaacagaaga 8701 cacacaaaga aaacagaggt ggaatacctt atcattagaa ggacgagagg gaatgatgat 8761 aaaagtgtat ttggagggaa tgatgataaa tggtgctggg aaaactggat agccataggc 8821 agaaaattga acctggacca cttccttaca ccttatacaa aaattaactc aagatggatt 8881 aaagacttaa atgtaaaacc aaaaaccata aaaaccctag aagaaaacat aggacacagg 8941 atgggcaaag attttatgaa gaaatcgcca aaagcaactg caacaaaagt taaattgaca 9001 aatggcattt aattagagaa cttctgcaca gcaaaagaaa ctacctatca tcagagtgaa 9061 caggcaaccc atagaatggg agaaaatttt tgcagtctac ccaactcaca caggtctaat 9121 atccagaatg tacaaagaac ttaaacaaat ttacaaaaac aaaaagcccc atcaaaaagt 9181 gggtgaagga tatgaacaga atcttctcga aagtagacat ttatgcgacc aataatcatt 9241 aaaaaactca acatcactga tcattagaga aatgcaaatc aaaaccacaa tgaaatacca 9301 tctcatgcca ctcagaatgg caattattaa agagtcaaaa tcaacagatg ctggtgaagc 9361 tgtggaaaaa taggaaatct tttacactgt ttgtgggaat gtaaattact tcaaccattg 9421 tggaagacag tgtggcgatt catcaaggtt ctagaaccag aaataccatt tgacccagca 9481 atctcattac tgagtatata cccaaaggaa tataaatcgt tctgttataa aatacacgca 9541 cgtgtatgtt tattgcagca tgtttacaat agcaaagaca caaaaccaac ccaagtgccc 9601 atcaatgata gactggatta aaaaaaattg tacttaaaac ctaggtgacg gatgagagtg 9661 accttgaact ggcaagacca ggtcagtgca agtgacattg gtttcttaga tactcctcag 9721 ctccgtgtgg gtctcccgat tctcagttct gcctgctgac ttgttcattc atttccatgg 9781 cagggctgca gttgggtcgt gcctggtttt ttggcctccc aatgtctctt aattaactta 9841 atattacttg gtaatactaa agcaatgatg aattatgtgc atggagtcat taatcttctg 9901 tgtaaatatc tcaaacactg tgtcttaatt atagact // LOCUS HSIFD6 997 bp DNA PRI 17-DEC-1994 DEFINITION Gene for human fibroblast interferon beta 1. ACCESSION V00535 NID g32639 KEYWORDS interferon; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 997) AUTHORS Lawn,R.M., Adelman,J., Franke,A.E., Houck,C.M., Gross,M., Najarian,R. and Goeddel,D.V. TITLE Human fibroblast interferon gene lacks introns JOURNAL Nucleic Acids Res. 9 (5), 1045-1052 (1981) MEDLINE 81198952 FEATURES Location/Qualifiers source 1..997 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 15..860 /note="messenger RNA" mRNA 15..853 /note="messenger RNA" CDS 97..660 /note="interferon beta 1" /codon_start=1 /db_xref="PID:g32640" /db_xref="SWISS-PROT:P01574" /translation="MTNKCLLQIALLLCFSTTALSMSYNLLGFLQRSSNFQCQKLLWQ LNGRLEYCLKDRMNFDIPEEIKQLQQFQKEDAALTIYEMLQNIFAIFRQDSSSTGWNE TIVENLLANVYHQINHLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYLKAKEYS HCAWTIVRVEILRNFYFINRLTGYLRN" BASE COUNT 300 a 200 c 201 g 296 t ORIGIN 1 ggccataccc atggagaaag gacattctaa ctgcaacctt tcgaagcctt tgctctggca 61 caacaggtag taggcgacac tgttcgtgtt gtcaacatga ccaacaagtg tctcctccaa 121 attgctctcc tgttgtgctt ctccactaca gctctttcca tgagctacaa cttgcttgga 181 ttcctacaaa gaagcagcaa ttttcagtgt cagaagctcc tgtggcaatt gaatgggagg 241 cttgaatact gcctcaagga caggatgaac tttgacatcc ctgaggagat taagcagctg 301 cagcagttcc agaaggagga cgccgcattg accatctatg agatgctcca gaacatcttt 361 gctattttca gacaagattc atctagcact ggctggaatg agactattgt tgagaacctc 421 ctggctaatg tctatcatca gataaaccat ctgaagacag tcctggaaga aaaactggag 481 aaagaagatt tcaccagggg aaaactcatg agcagtctgc acctgaaaag atattatggg 541 aggattctgc attacctgaa ggccaaggag tacagtcact gtgcctggac catagtcaga 601 gtggaaatcc taaggaactt ttacttcatt aacagactta caggttacct ccgaaactga 661 agatctccta gcctgtgcct ctgggactgg acaattgctt caagcattct tcaaccagca 721 gatgctgttt aagtgactga tggctaatgt actgcatatg aaaggacact agaagatttt 781 gaaattttta ttaaattatg agttattttt atttatttaa attttatttt ggaaaataaa 841 ttatttttgg tgcaaaagtc aacatggcag ttttaatttc gatttgattt atataaccat 901 ccatattata aaattgccag tacctattag ttgttctttt taaaatatac ctgcaaagta 961 gtatactttg gttcctgcct taaggaattt aaaattc // LOCUS HSIFI56R 1642 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for 56-KDa protein induced by interferon. ACCESSION X03557 J00108 M24594 NID g32644 KEYWORDS interferon response. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1642) AUTHORS Wathelet,M., Moutschen,S., Defilippi,P., Cravador,A., Collet,M., Huez,G. and Content,J. TITLE Molecular cloning, full-length sequence and preliminary characterization of a 56-kDa protein induced by human interferons JOURNAL Eur. J. Biochem. 155 (1), 11-17 (1986) MEDLINE 86136112 REFERENCE 2 (bases 1301 to 1642) AUTHORS Chebath,J., Merlin,G., Metz,R., Benech,P. and Revel,M. TITLE Interferon-induced 56,000 Mr protein and its mRNA in human cells: molecular cloning and partial sequence of the cDNA JOURNAL Nucleic Acids Res. 11 (5), 1213-1226 (1983) MEDLINE 83143342 REFERENCE 3 (bases ) AUTHORS Wathelet,M. TITLE Direct Submission JOURNAL Submitted (30-JUN-1986) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (30-JUN-1986) by Wathelet M. FEATURES Location/Qualifiers source 1..1642 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 65..1501 /note="56-KDa protein (aa 1-478)" /codon_start=1 /db_xref="PID:g32645" /db_xref="SWISS-PROT:P09914" /translation="MSTNGDDHQVKDSLEQLRCHFTWELSIDDDEMPDLENRVLDQIE FLDTKYSVGIHNLLAYVKHLKGQNEEALKSLKEAENLMQEEHDNQANVRSLVTWGNFA WMYYHMGRLAEAQTYLDKVENICKKLSNPFRYRMECPEIDCEEGWALLKCGGKNYERA KACFEKVLEVDPENPESSAGYAISAYRLDGFKLATKNHKPFSLLPLRQAVRLNPDNGY IKVLLALKLQDEGQEAEGEKYIEEALANMSSQTYVFRYAAKFYRRKGSVDKALELLKK ALQETPTSVLLHHQIGLCYKAQMIQIKEATKGQPRGQNREKLDKMIRSAIFHFESAVE KKPTFEVAHLDLARMYIEAGNHRKAEENFQKLLCMKPVVEETMQDIHFYYGRFQEFQK KSDVNAIIHYLKAIKIEQASLTRDKSINSLKKLVLRKLRRKALDLESLSLLGFVYKLE GNMNEALEYYERALRLAADFENSVRQGP" old_sequence 1405 /note="c was u in [1]" /citation=[1] misc_feature 1623..1628 /note="pot. polyadenylation signal" polyA_site 1642 /note="polyadenylation site" BASE COUNT 551 a 318 c 369 g 404 t ORIGIN 1 ccagatctca gaggagcctg gctaagcaaa accctgcaga acggctgcct aatttacagc 61 aaccatgagt acaaatggtg atgatcatca ggtcaaggat agtctggagc aattgagatg 121 tcactttaca tgggagttat ccattgatga cgatgaaatg cctgatttag aaaacagagt 181 cttggatcag attgaattcc tagacaccaa atacagtgtg ggaatacaca acctactagc 241 ctatgtgaaa cacctgaaag gccagaatga ggaagccctg aagagcttaa aagaagctga 301 aaacttaatg caggaagaac atgacaacca agcaaatgtg aggagtctgg tgacctgggg 361 caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 421 caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 481 agaaatagac tgtgaggaag gatgggcctt gctgaagtgt ggaggaaaga attatgaacg 541 ggccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 601 tgggtatgcg atctctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 661 gccattttct ttgcttcccc taaggcaggc tgtccgctta aatccagaca atggatatat 721 taaggttctc cttgccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 781 cattgaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 841 gttttaccga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 901 ggaaacaccc acttctgtct tactgcatca ccagataggg ctttgctaca aggcacaaat 961 gatccaaatc aaggaggcta caaaagggca gcctagaggg cagaacagag aaaagctaga 1021 caaaatgata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 1081 tgaggtggct catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 1141 tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa caatgcaaga 1201 catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg tcaatgcaat 1261 tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ataaaagtat 1321 caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 1381 cttgagcctc cttgggttcg tctacaaatt ggaaggaaat atgaatgaag ccctggagta 1441 ctatgagcgg gccctgagac tggctgctga ctttgagaac tctgtgagac aaggtcctta 1501 ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca tttactaatc 1561 atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 1621 ataataaatc tgacaaaata tt // LOCUS HSIFNABR 1296 bp RNA PRI 23-JUL-1994 DEFINITION H.sapiens mRNA for interferon alpha/beta receptor. ACCESSION X77722 NID g488363 KEYWORDS interferon alpha/beta receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1296) AUTHORS Novick,D., Cohen,B. and Rubinstein,M. TITLE The human interferon alpha/beta receptor: characterization and molecular cloning JOURNAL Cell 77 (3), 391-400 (1994) MEDLINE 94236684 REFERENCE 2 (bases 1 to 1296) AUTHORS Rubinstein,M. TITLE Direct Submission JOURNAL Submitted (17-FEB-1994) M. Rubinstein, Weizmann Institute of Science, Dept of Molecular Genetics & Virology, Po Box 26, Rehovot 76100, ISRAEL FEATURES Location/Qualifiers source 1..1296 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="monocytes" /clone_lib="lambda pCEV9" /chromosome="21" gene 226..1243 /gene="HUIFNABR" CDS 226..1221 /gene="HUIFNABR" /codon_start=1 /product="interferon alpha/beta receptor" /db_xref="PID:g488364" /db_xref="SWISS-PROT:P48551" /translation="MLLSQNAFIVRSLNLVLMVYISLVFGISYDSPDYTDESCTFKIS LRNFRSILSWELKNHSIVPTHYTLLYTIMSKPEDLKVVKNCANTTRSFCDLTDEWRST HEAYVTVLEGFSGNTTLFSCSHNFWLAIDMSFEPPEFEIVGFTNHINVMVKFPSIVEE ELQFDLSLVIEEQSEGIVKKHKPEIKGNMSGNFTYIIDKLIPNTNYCVSVYLEHSDEQ AVIKSPLKCTLLPPGQESESAESAKIGGIITVFLIALVLTSTIVTLKWIGYICLRNSL PKVLRQGLTKGWNAVAIHRCSHNALQSETPELKQSSCLSFPSSWDYKRASLCPSD" sig_peptide 226..303 /gene="HUIFNABR" mat_peptide 316..1218 /gene="HUIFNABR" /product="interferon alpha/beta receptor" polyA_signal 1238..1243 /gene="HUIFNABR" BASE COUNT 411 a 274 c 274 g 337 t ORIGIN 1 gcttttgtcc cccgcccgcc gcttctgtcc gagaggccgc ccgcgaggcg catcctgacc 61 gcgagcgtcg ggtcccagag ccgggcgcgg ctggggcccg aggctagcat ctctcgggag 121 ccgcaaggcg agagctgcaa agtttaatta gacacttcag aattttgatc acctaatgtt 181 gatttcagat gtaaaagtca agagaagact ctaaaaatag caaagatgct tttgagccag 241 aatgccttca tcgtcagatc acttaatttg gttctcatgg tgtatatcag cctcgtgttt 301 ggtatttcat atgattcgcc tgattacaca gatgaatctt gcactttcaa gatatcattg 361 cgaaatttcc ggtccatctt atcatgggaa ttaaaaaacc actccattgt accaactcac 421 tatacattgc tgtatacaat catgagtaaa ccagaagatt tgaaggtggt taagaactgt 481 gcaaatacca caagatcatt ttgtgacctc acagatgagt ggagaagcac acacgaggcc 541 tatgtcaccg tcctagaagg attcagcggg aacacaacgt tgttcagttg ctcacacaat 601 ttctggctgg ccatagacat gtcttttgaa ccaccagagt ttgagattgt tggttttacc 661 aaccacatta atgtgatggt gaaatttcca tctattgttg aggaagaatt acagtttgat 721 ttatctctcg tcattgaaga acagtcagag ggaattgtta agaagcataa acccgaaata 781 aaaggaaaca tgagtggaaa tttcacctat atcattgaca agttaattcc aaacacgaac 841 tactgtgtat ctgtttattt agagcacagt gatgagcaag cagtaataaa gtctccctta 901 aaatgcaccc tccttccacc tggccaggaa tcagaatcag cagaatctgc caaaatagga 961 ggaataatta ctgtgttttt gatagcattg gtcttgacaa gcaccatagt gacactgaaa 1021 tggattggtt atatatgctt aagaaatagc ctccccaaag tcttgaggca aggtctcact 1081 aagggctgga atgcagtggc tattcacagg tgcagtcata atgcactaca gtctgaaact 1141 cctgagctca aacagtcgtc ctgcctaagc ttccccagta gctgggatta caagcgtgca 1201 tccctgtgcc ccagtgatta agttttatta tgtagaaaat aaagagcaaa cagttacaaa 1261 agaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSIFNIN3 777 bp RNA PRI 19-JUL-1995 DEFINITION Human interferon-inducible mRNA fragment (cDNA 6-16). ACCESSION X02492 X02495 NID g32697 KEYWORDS Alu repetitive sequence; interferon; interferon response; repetitive sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 777) AUTHORS Kelly,J.M., Porter,A.C., Chernajovsky,Y., Gilbert,C.S., Stark,G.R. and Kerr,I.M. TITLE Characterization of a human gene inducible by alpha- and beta-interferons and its expression in mouse cells JOURNAL EMBO J. 5 (7), 1601-1606 (1986) MEDLINE 86300661 REFERENCE 2 (bases 1 to 777) AUTHORS Friedman,R.L., Manly,S.P., McMahon,M., Kerr,I.M. and Stark,G.R. TITLE Transcriptional and posttranscriptional regulation of interferon-induced gene expression in human cells JOURNAL Cell 38 (3), 745-755 (1984) MEDLINE 85024867 REFERENCE 3 (bases 1 to 777) AUTHORS Kerr,I.M. TITLE Direct Submission JOURNAL Submitted (12-AUG-1986) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (12-SEP-1986) by Kerr I. FEATURES Location/Qualifiers source 1..777 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 70..462 /note="put. precursor protein" /codon_start=1 /db_xref="PID:g32698" /db_xref="SWISS-PROT:P09912" /translation="MRQKAVSVFLCYLLLFTCSGVEAGKKKCSESSDSGSGFWKALTF MAVGGGLAVAGLPALGFTGAGIAANSVAASLMSWSAILNGGGVPAGGLVATLQSLGAG GSSVVIGNIGALMRYATHKYLDSEEDEE" sig_peptide 70..129 /note="put. signal peptide (aa -20 to -1)" sig_peptide 70..138 /note="alternate signal peptide (aa -23 to -1)" mat_peptide 130..459 /note="put. mature protein (aa 1-110)" mat_peptide 139..459 /note="alternate mature protein (aa 1-107)" repeat_region 513..722 /note="18 nucleotide direct repeats (homology to Alu repeat)" repeat_region 531..704 /note="Alu repeat" misc_feature 758..762 /note="pot. polyA signal" polyA_site 777 /note="polyadenylation site" BASE COUNT 154 a 221 c 216 g 186 t ORIGIN 1 gctccgggct gaagattgct tctcttctct cctccaaggt ctagtgacgg agcccgcgcg 61 cgcgccacca tgcggcagaa ggcggtatcc gttttcttgt gctacctgct gctcttcact 121 tgcagtgggg tggaggcagg taagaaaaag tgctcggaga gctcggacag cggctccggg 181 ttctggaagg ccctgacctt catggccgtc ggaggaggac tcgcagtcgc cgggctgccc 241 gcgctgggct tcaccggcgc cggcatcgcg gccaactcgg tggctgcctc gctgatgagc 301 tggtctgcga tcctgaatgg gggcggcgtg cccgccgggg ggctagtggc cacgctgcag 361 agcctcgggg ctggtggcag cagcgtcgtc ataggtaata ttggtgccct gatgcggtac 421 gccacccaca agtatctcga tagtgaggag gatgaggagt agccagcagc tcccagaacc 481 tcttcttcct tcttggccta actcttccag ttaggatcta gaactttgcc tttttttttt 541 tttttttttt tttgagatgg gttctcacta tattgtccag gctagagtgc agtggctatt 601 cacagatgcg aacatagtac actgcagcct ccaactccta gcctcaagtg atcctcctgt 661 ctcaacctcc caagtaggat tacaagcatg cgccgacgat gcccagaatc cagaactttg 721 tctatcactc tccccaacaa cctagatgtg aaaacagaat aaacttcacc cagaaaa // LOCUS HSIGF1A 616 bp RNA PRI 29-NOV-1993 DEFINITION H.sapiens mRNA for IGF-1a. ACCESSION X56773 S61841 NID g32989 KEYWORDS IGF-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 616) AUTHORS Sandberg-Nordqvist,A.C., Stahlbom,P.A., Lake,M. and Sara,V.R. TITLE Characterization of two cDNAs encoding insulin-like growth factor 1 (IGF-1) in the human fetal brain JOURNAL Brain Res. Mol. Brain Res. 12 (1-3), 275-277 (1992) MEDLINE 92186627 REFERENCE 2 (bases 1 to 616) AUTHORS Sandberg Nordqvist,A.C. TITLE Direct Submission JOURNAL Submitted (19-NOV-1990) A.C.Sandberg Nordqvist, KAROLINSKA INST'S DEPT OF PATHOLOGY, KAROLINSKA HOSPITAL, BOX 605 00, S-104 01 STOCKHOLM, SWEDEN REFERENCE 3 (bases 1 to 616) AUTHORS Sandberg-Nordqvist,A.C., Stahlbom,P.A., Reinecke,M., Collins,V.P., von Holst,H. and Sara,V. TITLE Characterization of insulin-like growth factor 1 in human primary brain tumors JOURNAL Cancer Res. 53 (11), 2475-2478 (1993) MEDLINE 93265440 FEATURES Location/Qualifiers source 1..616 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /chromosome="12" /map="q22-q24" gene 1..462 /gene="IGF-1" CDS 1..462 /gene="IGF-1" /codon_start=1 /product="IGF-1a" /db_xref="PID:g32990" /db_xref="SWISS-PROT:P01343" /translation="MGKISSLPTQLFKCCFCDFLKVKMHTMSSSHLFYLALCLLTFTS SATAGPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDL RRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKEVHLKNASRGSAGNKNYRM" mat_peptide 145..354 /gene="IGF-1" /product="IGF-1a" exon 403..616 /note="exon 5" BASE COUNT 159 a 158 c 160 g 139 t ORIGIN 1 atgggaaaaa tcagcagtct tccaacccaa ttatttaagt gctgcttttg tgatttcttg 61 aaggtgaaga tgcacaccat gtcctcctcg catctcttct acctggcgct gtgcctgctc 121 accttcacca gctctgccac ggctggaccg gagacgctct gcggggctga gctggtggat 181 gctcttcagt tcgtgtgtgg agacaggggc ttttatttca acaagcccac agggtatggc 241 tccagcagtc ggagggcgcc tcagacaggc atcgtggatg agtgctgctt ccggagctgt 301 gatctaagga ggctggagat gtattgcgca cccctcaagc ctgccaagtc agctcgctct 361 gtccgtgccc agcgccacac cgacatgccc aagacccaga aggaagtaca tttgaagaac 421 gcaagtagag ggagtgcagg aaacaagaac tacaggatgt aggaagaccc tcctgaggag 481 tgaagagtga catgccaccg caggatcctt tgctctgcac gagttacctg ttaaactttg 541 gaacacctac caaaaaataa gtttgataac atttaaaaga tgggcgtttc ccccaatgaa 601 atacacaagt aaacat // LOCUS HSIGF2 1046 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for IGF-II precursor (insulin-like growth factor). ACCESSION X00910 M17862 NID g32995 KEYWORDS growth factor; insulin super family; insulin-like growth factor II; signal peptide; somatomedin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1046) AUTHORS Bell,G.I., Merryweather,J.P., Sanchez-Pescador,R., Stempien,M.M., Priestley,L., Scott,J. and Rall,L.B. TITLE Sequence of a cDNA clone encoding human preproinsulin-like growth factor II JOURNAL Nature 310 (5980), 775-777 (1984) MEDLINE 84295592 REFERENCE 2 (bases 32 to 876) AUTHORS Jansen,M., van Schaik,F.M., van Tol,H., Van den Brande,J.L. and Sussenbach,J.S. TITLE Nucleotide sequences of cDNAs encoding precursors of human insulin-like growth factor II (IGF-II) and an IGF-II variant JOURNAL FEBS Lett. 179 (2), 243-246 (1985) MEDLINE 85102019 COMMENT Data kindly reviewed (13-JUN-1985) by G. Bell. FEATURES Location/Qualifiers source 1..1046 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..250 /note="leader" sig_peptide 251..322 CDS 251..793 /note="IGF-II precursor" /codon_start=1 /db_xref="PID:g32996" /db_xref="SWISS-PROT:P01344" /translation="MGIPMGKSMLVLLTFLAFASCCIAAYRPSETLCGGELVDTLQFV CGDRGFYFSRPASRVSRRSRGIVEECCFRSCDLALLETYCATPAKSERDVSTPPTVLP DNFPRYPVGKFFQYDTWKQSTQRLRRGLPALLRARRGHVLAKELEAFREAKRHRPLIA LPTQDPAHGGAPPEMASNRK" misc_feature 323..418 /note="B-domain (aa1-32)" misc_feature 323..523 /note="IGF-II" variation 407..408 /note="GA CTT CCA GG is inserted in IGF-II-var (ref 2)" misc_feature 419..442 /note="C-domain (aa33-40)" misc_feature 443..505 /note="A-domain (aa41-61)" misc_feature 506..523 /note="D-domain (aa62-67)" misc_feature 524..790 /note="E-domain (aa68-156)" misc_feature 791..1046 /note="trailer" BASE COUNT 190 a 387 c 287 g 182 t ORIGIN 1 caggggccga agagtcacca ccgagcttgt gtgggaggag gtggattcca gcccccagcc 61 ccagggctct gaatcgctgc cagctcagcc ccctgcccag cctgccccac agcctgagcc 121 ccagcaggcc agagagccca gtcctgaggt gagctgctgt ggcctgtggc caggcgaccc 181 cagcgctccc agaactgagg ctggcagcca gccccagcct cagccccaac tgcgaggcag 241 agagacacca atgggaatcc caatggggaa gtcgatgctg gtgcttctca ccttcttggc 301 cttcgcctcg tgctgcattg ctgcttaccg ccccagtgag accctgtgcg gcggggagct 361 ggtggacacc ctccagttcg tctgtgggga ccgcggcttc tacttcagca ggcccgcaag 421 ccgtgtgagc cgtcgcagcc gtggcatcgt tgaggagtgc tgtttccgca gctgtgacct 481 ggccctcctg gagacgtact gtgctacccc cgccaagtcc gagagggacg tgtcgacccc 541 tccgaccgtg cttccggaca acttccccag ataccccgtg ggcaagttct tccaatatga 601 cacctggaag cagtccaccc agcgcctgcg caggggcctg cctgccctcc tgcgtgcccg 661 ccggggtcac gtgctcgcca aggagctcga ggcgttcagg gaggccaaac gtcaccgtcc 721 cctgattgct ctacccaccc aagaccccgc ccacgggggc gcccccccag agatggccag 781 caatcggaag tgagcaaaac tgccgcaagt ctgcagcccg gcgccaccat cctgcagcct 841 cctcctgacc acggacgttt ccatcaggtt ccatcccgaa aatctctcgg ttccacgtcc 901 ccctggggct tctcctgacc cagtccccgt gccccgcctc cccgaaacag gctactctcc 961 tcggccccct ccatcgggct gaggaagcac agcagcatct tcaaacatgt acaaaatcga 1021 ttggctttaa acaccttcac atacct // LOCUS HSIGF27 4156 bp DNA PRI 25-JUN-1997 DEFINITION Human DNA for insulin-like growth factor II (IGF-2); exon 7 and additional ORF. ACCESSION X07868 NID g32998 KEYWORDS growth factor; insulin-like growth factor II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4156) AUTHORS Sussenbach,J.S. TITLE Direct Submission JOURNAL Submitted (10-JUN-1988) Sussenbach J.S., Laboratory for Physiological Chemistry, Vondellaan 24a, 3521 GG Utrecht, The Netherlands REFERENCE 2 (bases 1 to 4156) AUTHORS de Pagter-Holthuizen,P., Jansen,M., van der Kammen,R.A., van Schaik,F.M. and Sussenbach,J.S. TITLE Differential expression of the human insulin-like growth factor II gene. Characterization of the IGF-II mRNAs and an mRNA encoding a putative IGF-II-associated protein JOURNAL Biochim. Biophys. Acta 950 (3), 282-295 (1988) MEDLINE 89000779 COMMENT Data kindly reviewed (23-AUG-1988) by SUSSENBACH J.S. FEATURES Location/Qualifiers source 1..4156 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="11" mRNA 1..4038 /note="Exon 7" CDS 1..234 /note="insulin-like growth factor II (78 AA); Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e4393" /db_xref="PID:g1335138" /db_xref="SWISS-PROT:P01344" /translation="DNFPRYPVGKFFQYDTWKQSTQRLRRGLPALLRARRGHVLAKEL EAFREAKRHRPLIALPTQDPAHGGAPPEMASNRK" misc_feature 714..719 /note="pot. polyA signal" misc_feature 1020..1029 /note="Nuclear factor III recognition site" misc_feature 2045..2052 /note="pot. Sp1 binding site" misc_feature 2417..2421 /note="region of transcription start of additional ORF" CDS 3508..3762 /note="1.8 kb mRNA (AA 1-84)" /codon_start=1 /db_xref="PID:g33000" /db_xref="SWISS-PROT:P09565" /translation="MTPGVVHASPPQSQRVPRQAPCEWAIRNIGQKPKEPNCHNCGTH IGLRSKTLRGTPNYLPIRQDTHPPSVIFCLAGVGVPGLPV" misc_feature 4020..4025 /note="pot. polyA signal" polyA_site 4038 /note="polyA site" BASE COUNT 1055 a 1348 c 1007 g 746 t ORIGIN 1 gacaacttcc ccagataccc cgtgggcaag ttcttccaat atgacacctg gaagcagtcc 61 acccagcgcc tgcgcagggg cctgcctgcc ctcctgcgtg cccgccgggg tcacgtgctc 121 gccaaggagc tcgaggcgtt cagggaggcc aaacgtcacc gtcccctgat tgctctaccc 181 acccaagacc ccgcccacgg gggcgccccc ccagagatgg ccagcaatcg gaagtgagca 241 aaactgccgc aagtctgcag cccggcgcca ccatcctgca gcctcctcct gaccacggac 301 gtttccatca ggttccatcc cgaaaatctc tcggttccac gtccccctgg ggcttctcct 361 gacccagtcc ccgtgccccg cctccccgaa acaggctact ctcctcggcc ccctccatcg 421 ggctgaggaa gcacagcagc atcttcaaac atgtacaaaa tcgattggct ttaaacaccc 481 ttcacatacc ctccccccaa attatcccca attatcccca cacataaaaa atcaaaacat 541 taaactaacc cccttccccc ccccccacaa caaccctctt aaaactaatt ggctttttag 601 aaacacccca caaaagctca gaaattggct ttaaaaaaaa caaccaccaa aaaaaatcaa 661 ttggctaaaa aaaaaaagta ttaaaaacga attggctgag aaacaattgg caaaataaag 721 gaatttggca ctccccaccc ccctctttct cttctccctt ggactttgag tcaaattggc 781 ctggacttga gtccctgaac cagcaaagag aaaagaaggg ccccagaaat cacaggtggg 841 cacgtcgctc gtaccgccat ctcccttctc acgggaattt tcagggtaaa ctggccatcc 901 gaaaatagca acaacccaga ctggctcctc actccctttt ccatcactaa aaatcacaga 961 gcagtcagag ggacccagta agaccaaagg aggggaggac agagcatgaa aaccaaaatc 1021 catgcaaatg aaatgtaatt ggcacgaccc tcacccccaa atcttacatc tcaattccca 1081 tcctaaaaag cactcatact ttatgcatcc ccgcagctac acacacacaa cacacagcac 1141 acgcatgaac acagcacaca cacgagcaca gcacacacac gagcatacag cacacacaca 1201 aacgcacagc acacacagca cacagatgag cacacagcac acacacaaac gcacagcaca 1261 cacacgcaca cacatgcaca cacagcacac aaacgcacgg cacacacacg cacacacagt 1321 gcacacacag cacacacgca aacgcacacg cacacacaaa cgcacagcac acacgcacac 1381 acagcacaca cacgagcaca cagcacacaa acgcacagca cacgcacaca catgcacaca 1441 cagcacacta gcacacagca cacacacaaa gacacagcac acacatgcac acacagcaca 1501 cacacgcgaa cacagcacac acgaacacag cacacacagc acacacacaa acacagcaca 1561 cacatgcaca cagcacatgc acacacagca cacacatgaa cacagcacac agcacacaca 1621 tgcacacagc acacacgcat gcacagcaca catgaacaca gcacacacaa acacacagca 1681 cacacatgca cacacagcac acacactcat gcgcagcaca tacatgaaca cagctcacag 1741 cacacaaaca cgcagcacac acgttgcaca cgcaagcacc cacctgcaca cacacatgcg 1801 cacacacacg cacaccccca caaaattaga tgaaaacaat aagcatatct aagcaactac 1861 gatatctgta tggatcaggc caaagtcccg ctaagattct ccaatgtttt catggtctga 1921 gcccccctcc tgttcccatc tccactgccc ctcggccctg tctgtgccct gcctctcaga 1981 ggagggggct cagatggtgc ggcctgagtg tgcggccggc ggcatttggg atacacccgt 2041 aggtgggcgg ggtgtgtccc aggcctaatt ccatctttcc accatgacag agatgccctt 2101 gtgaggctgg cctccttggc gcctgtcccc acggcccccg cagcgtgagc cacgatgctc 2161 cccatacccc acccattccc gatacacctt acttactgtg tgttggccca gccagagtga 2221 ggaaggagtt tggccacatt ggagatggcc ggtagctgag cagacatgcc cccacgagta 2281 gcctgactcc ctggtgtgct cctggaagga agatcttggg gaccccccca ccggagcaca 2341 cctagggatc atctttgccc gtctcctggg gaccccccaa gaaatgtgga gtcctcgggg 2401 gccgtgcact gatgcgggga gtgtgggaag tctggcggtt ggaggggtgg gtggggggca 2461 gtgggggctg ggcgggggga gttctggggt aggaagtggt cccgggagat tttggatgga 2521 aaagtcagga ggattgacag cagacttgca gaattacata gagaaattag gaacccccaa 2581 atttcatgtc aattgatcta ttccccctct ttgtttcttg gggcattttt cctttttttt 2641 ttttttttgt ttttttttta cccctcctta gctttatgcg ctcagaaacc aaattaaacc 2701 ccccccccat gtaacagggg ggcagtgaca aaagcaagaa cgcacgaagc cagcctggag 2761 accaccacgt cctgcccccc gccatttatc gccctgattg gattttgttt ttcatctgtc 2821 cctgttgctt gggttgagtt gagggtggag cctcctgggg ggcatggcca tgagccccct 2881 tggagaagtc agaggggagt ggagaaggca tgtccggcct ggcttctggg gacagtggct 2941 ggtccccaga agtcctgagg gcggaggggg gggttgggca gggtctcctc aggtgtcagg 3001 agggtgctcg gaggccacag gagggggctc ctggctggcc tgaggctggc cggaggggaa 3061 ggggctagca ggtgtgtaaa cagagggttc catcagctgg ggcagggtgg ccgccttccg 3121 cacacttgag gaaccctccc ctctccctcg gtgacatctt gcccgcccct cagcaccctg 3181 ccttgtctcc aggaggtccg aagctctgtg ggacctcttg ggggcaaggt ggggtgaggc 3241 cggggagtag ggaggtcagg cgggtctgag cccacagagc aggagagctg ccaggtctgc 3301 ccatcgacca ggttgcttgg gccccggagc ccacgggtct ggtgatgcca tagcagccac 3361 caccgcggcg cctagggctg cggcagggac tcggcctctg ggaggtttac ctcgccccca 3421 cttgtgcccc cagctcagcc cccctgcacg cagcccgact agcagtctag aggcctgagg 3481 cttctgggtc ctggtgacgg ggctggcatg accccggggg tcgtccatgc cagtccgcct 3541 cagtcgcaga gggtccctcg gcaagcgccc tgtgagtggg ccattcggaa cattggacag 3601 aagcccaaag agccaaattg tcacaattgt ggaacccaca ttggcctgag atccaaaacg 3661 cttcgaggca ccccaaatta cctgcccatt cgtcaggaca cccacccacc cagtgttata 3721 ttctgcctcg ccggagtggg tgttcccggg ctgcctgtct gacctccgtg cctagtcgtg 3781 gctctccatc ttgtctcctc cccgtgtccc caatgtcttc agtggggggc cccctcttgg 3841 gtcccctcct ctgccatcac ctgaagaccc ccacgccaaa cactgaatgt cacctgtgcc 3901 tgccgcctcg gtccaccttg cggcccgtgt ttgactcaac tcagctcctt taacgctaat 3961 atttccggca aaatcccatg cttgggtttt gtctttaacc ttgtaacgct tgcaatccca 4021 ataaagcatt aaaagtcatg atcttctgag gtgttccact ctctgacttg ggtactggac 4081 tgccggaggg agggaagggg ctgagcacct ggaagcaggc agagggggat agaagaggga 4141 aggggaagga aggcct // LOCUS HSIGFBP2 1433 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for insulin-like growth factor binding protein (IGFBP-2). ACCESSION X16302 NID g33009 KEYWORDS insulin-like growth factor binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1433) AUTHORS Binkert,C., Landwehr,J., Mary,J.L., Schwander,J. and Heinrich,G. TITLE Cloning, sequence analysis and expression of a cDNA encoding a novel insulin-like growth factor binding protein (IGFBP-2) JOURNAL EMBO J. 8 (9), 2497-2502 (1989) MEDLINE 90060007 FEATURES Location/Qualifiers source 1..1433 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" sig_peptide 118..234 /note="signal peptide (AA -39 to -1)" CDS 118..1104 /note="precursor polypeptide (AA -39 to 289)" /codon_start=1 /db_xref="PID:g33010" /db_xref="SWISS-PROT:P18065" /translation="MLPRVGCPALPLPPPPLLPLLPLLLLLLGASGGGGGARAEVLFR CPPCTPERLAACGPPPVAPPAAVAAVAGGARMPCAELVREPGCGCCSVCARLEGEACG VYTPRCGQGLRCYPHPGSELPLQALVMGEGTCEKRRDAEYGASPEQVADNGDDHSEGG LVENHVDSTMNMLGGGGSAGRKPLKSGMKELAVFREKVTEQHRQMGKGGKHHLGLEEP KKLRPPPARTPCQQELDQVLERISTMRLPDERGPLEHLYSLHIPNCDKHGLYNLKQCK MSLNGQRGECWCVNPNTGKLIQGAPTIRGDPECHLFYNEQQEACGVHTQRMQ" mat_peptide 235..1101 /note="mature IGFBP-2 (AA 1 to 289)" misc_feature 1416..1420 /note="pot. polyadenylation signal" polyA_site 1433 /note="polyadenylation site" BASE COUNT 239 a 466 c 501 g 227 t ORIGIN 1 attcggggcg agggaggagg aagaagcgga ggaggcggct cccgctcgca gggccgtgca 61 cctgcccgcc cgcccgctcg ctcgctcgcc cgccgcgccg cgctgccgac cgccagcatg 121 ctgccgagag tgggctgccc cgcgctgccg ctgccgccgc cgccgctgct gccgctgctg 181 ccgctgctgc tgctgctact gggcgcgagt ggcggcggcg gcggggcgcg cgcggaggtg 241 ctgttccgct gcccgccctg cacacccgag cgcctggccg cctgcgggcc cccgccggtt 301 gcgccgcccg ccgcggtggc cgcagtggcc ggaggcgccc gcatgccatg cgcggagctc 361 gtccgggagc cgggctgcgg ctgctgctcg gtgtgcgccc ggctggaggg cgaggcgtgc 421 ggcgtctaca ccccgcgctg cggccagggg ctgcgctgct atccccaccc gggctccgag 481 ctgcccctgc aggcgctggt catgggcgag ggcacttgtg agaagcgccg ggacgccgag 541 tatggcgcca gcccggagca ggttgcagac aatggcgatg accactcaga aggaggcctg 601 gtggagaacc acgtggacag caccatgaac atgttgggcg ggggaggcag tgctggccgg 661 aagcccctca agtcgggtat gaaggagctg gccgtgttcc gggagaaggt cactgagcag 721 caccggcaga tgggcaaggg tggcaagcat caccttggcc tggaggagcc caagaagctg 781 cgaccacccc ctgccaggac tccctgccaa caggaactgg accaggtcct ggagcggatc 841 tccaccatgc gccttccgga tgagcggggc cctctggagc acctctactc cctgcacatc 901 cccaactgtg acaagcatgg cctgtacaac ctcaaacagt gcaagatgtc tctgaacggg 961 cagcgtgggg agtgctggtg tgtgaacccc aacaccggga agctgatcca gggagccccc 1021 accatccggg gggaccccga gtgtcatctc ttctacaatg agcagcagga ggcttgcggg 1081 gtgcacaccc agcggatgca gtagaccgca gccagccggt gcctggcgcc cctgcccccc 1141 gcccctctcc aaacaccggc agaaaacgga gagtgcttgg gtggtgggtg ctggaggatt 1201 ttccagttct gacacacgta tttatatttg gaaagagacc agcaccgagc tcggcacctc 1261 cccggcctct ctcttcccag ctgcagatgc cacacctgct ccttcttgct ttccccgggg 1321 gaggaagggg gttgtggtcg gggagctggg gtacaggttt ggggaggggg aagagaaatt 1381 tttatttttg aacccctgtg tcccttttgc ataagattaa aggaaggaaa agt // LOCUS HSIGFIIR 9090 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for insuline-like growth factor II receptor. ACCESSION Y00285 NID g33054 KEYWORDS insuline-like growth factor II receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9090) AUTHORS Morgan,D.O. TITLE Direct Submission JOURNAL Submitted (08-OCT-1987) Morgan O., Hormine Research Institute, University of California, San Francisco CA 94143 USA REFERENCE 2 (bases 1 to 9090) AUTHORS Morgan,D.O., Edman,J.C., Standring,D.N., Fried,V.A., Smith,M.C., Roth,R.A. and Rutter,W.J. TITLE Insulin-like growth factor II receptor as a multifunctional binding protein JOURNAL Nature 329 (6137), 301-307 (1987) MEDLINE 87315441 REMARK Erratum:[Nature 1988 Jul;20(7):442]] FEATURES Location/Qualifiers source 1..9090 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2 Hepatoma cells" /clone_lib="lambda gt10" misc_feature 1..147 /note="5' UT-region" sig_peptide 148..267 /note="signal peptide (AA -40 to -1)" CDS 148..7623 /note="precursor polypeptide (AA -40 to 2451)" /codon_start=1 /db_xref="PID:g33055" /db_xref="SWISS-PROT:P11717" /translation="MGAAAGRSPHLGPAPARRPQRSLLLLQLLLLVAAPGSTQAQAAP FPELCSYTWEAVDTKNNVLYKINICGSVDIVQCGPSSAVCMHDLKTRTYHSVGDSVLR SATRSLLEFNTTVSCDQQGTNHRVQSSIAFLCGKTLGTPEFVTATECVHYFEWRTTAA CKKDIFKANKEVPCYVFDEELRKHDLNPLIKLSGAYLVDDSDPDTSLFINVCRDIDTL RDPGSQLRACPPGTAACLVRGHQAFDVGQPRDGLKLVRKDRLVLSYVREEAGKLDFCD GHSPAVTITFVCPSERREGTIPKLTAKSNCRYEIEWITEYACHRDYLESKTCSLSGEQ QDVSIDLTPLAQSGGSSYISDGKEYLFYLNVCGETEIQFCNKKQAAVCQVKKSDTSQV KAAGRYHNQTLRYSDGDLTLIYFGGDECSSGFQRMSVINFECNKTAGNDGKGTPVFTG EVDCTYFFTWDTEYACVKEKEDLLCGATDGKKRYDLSALVRHAEPEQNWEAVDGSQTE TEKKHFFINICHRVLQEGKARGCPEDAAVCAVDKNGSKNLGKFISSPMKEKGNIQLSY SDGDDCGHGKKIKTNITLVCKPGDLESAPVLRTSGEGGCFYEFEWRTAAACVLSKTEG ENCTVFDSQAGFSFDLSPLTKKNGAYKVETKKYDFYINVCGPVSVSPCQPDSGACQVA KSDEKTWNLGLSNAKLSYYDGMIQLNYRGGTPYNNERHTPRATLITFLCDRDAGVGFP EYQEEDNSTYNFRWYTSYACPEEPLECVVTDPSTLEQYDLSSLAKSEGGLGGNWYAMD NSGEHVTWRKYYINVCRPLNPVPGCNRYASACQMKYEKDQGSFTEVVSISNLGMAKTG PVVEDSGSLLLEYVNGSACTTSDGRQTTYTTRIHLVCSRGRLNSHPIFSLNWECVVSF LWNTEAACPIQTTTDTDQACSIRDPNSGFVFNLNPLNSSQGYNVSGIGKIFMFNVCGT MPVCGTILGKPASGCEAETQTEELKNWKPARPVGIEKSLQLSTEGFITLTYKGPLSAK GTADAFIVRFVCNDDVYSGPLKFLHQDIDSGQGIRNTYFEFETALACVPSPVDCQVTD LAGNEYDLTGLSTVRKPWTAVDTSVDGRKRTFYLSVCNPLPYIPGCQGSAVGSCLVSE GNSWNLGVVQMSPQAAANGSLSIMYVNGDKCGNQRFSTRITFECAQISGSPAFQLQDG CEYVFIWRTVEACPVVRVEGDNCEVKDPRHGNLYDLKPLGLNDTIVSAGEYTYYFRVC GKLSSDVCPTSDKSKVVSSCQEKREPQGFHKVAGLLTQKLTYENGLLKMNFTGGDTCH KVYQRSTAIFFYCDRGTQRPVFLKETSDCSYLFEWRTQYACPPFDLTECSFKDGAGNS FDLSSLSRYSDNWEAITGTGDPEHYLINVCKSLAPQAGTEPCPPEAAACLLGGSKPVN LGRVRDGPQWRDGIIVLKYVDGDLCPDGIRKKSTTIRFTCSESQVNSRPMFISAVEDC EYTFAWPTATACPMKSNEHDDCQVTNPSTGHLFDLSSLSGRAGFTAAYSEKGLVYMSI CGENENCPPGVGACFGQTRISVGKANKRLRYVDQVLQLVYKDGSPCPSKSGLSYKSVI SFVCRPEAGPTNRPMLISLDKQTCTLFFSWHTPLACEQATECSVRNGSSIVDLSPLIH RTGGYEAYDESEDDASDTNPDFYINICQPLNPMHAVPCPAGAAVCKVPIDGPPIDIGR VAGPPILNPIANEIYLNFESSTPCLADKHFNYTSLIAFHCKRGVSMGTPKLLRTSECD FVFEWETPVVCPDEVRMDGCTLTDEQLLYSFNLSSLSTSTFKVTRDSRTYSVGVCTFA VGPEQGGCKDGGVCLLSGTKGASFGRLQSMKLDYRHQDEAVVLSYVNGDRCPPETDDG VPCVFPFIFNGKSYEECIIESRAKLWCSTTADYDRDHEWGFCRHSNSYRTSSIIFKCD EDEDIGRPQVFSEVRGCDVTFEWKTKVVCPPKKLECKFVQKHKTYDLRLLSSLTGSWS LVHNGVSYYINLCQKIYKGPLGCSERASICRRTTTGDVQVLGLVHTQKLGVIGDKVVV TYSKGYPCGGNKTASSVIELTCTKTVGRPAFKRFDIDSCTYYFSWDSRAACAVKPQEV QMVNGTITNPINGKSFSLGDIYFKLFRASGDMRTNGDNYLYEIQLSSITSSRNPACSG ANICQVKPNDQHFSRKVGTSDKTKYYLQDGDLDVVFASSSKCGKDKTKSVSSTIFFHC DPLVEDGIPEFSHETADCQYLFSWYTSAVCPLGVGFDSENPGDDGQMHKGLSERSQAV GAVLSLLLVALTCCLLALLLYKKERRETVISKLTTCCRRSSNVSYKYSKVNKEEETDE NETEWLMEEIQLPPPRQGKEGQENGHITTKSVKALSSLHGDDQDSEDEVLTIPEVKVH SGRGAGAESSHPVRNAQSNALQEREDDRVGLVRGEKARKGKSSSAQQKTVSSTKLVSF HDDSDEDLLHI" mat_peptide 268..7620 /note="mature insuline-like growth factor II receptor (AA 1-2451)" misc_feature 3235..9090 /note="homologous to bovine mannose-6-phosphate receptor" misc_feature 5845..5973 /note="homologous with fibronectin type II structure" misc_feature 7060..7128 /note="protein transmembrane domain" misc_feature 7621..9090 /note="3' UT-region" BASE COUNT 2248 a 2238 c 2479 g 2125 t ORIGIN 1 cgagcccagt cgagccgcgc tcacctcggg ctcccgctcc gtctccacct ccgcctttgc 61 cctggcggcg cgaccccgtc ccggcgcggc ccccagcagt cgcgcgccgt tagcctcgcg 121 cccgccgcgc agtccgggcc cggcgcgatg ggggccgccg ccggccggag cccccacctg 181 gggcccgcgc ccgcccgccg cccgcagcgc tctctgctcc tgctgcagct gctgctgctc 241 gtcgctgccc cggggtccac gcaggcccag gccgccccgt tccccgagct gtgcagttat 301 acatgggaag ctgttgatac caaaaataat gtactttata aaatcaacat ctgtggaagt 361 gtggatattg tccagtgcgg gccatcaagt gctgtttgta tgcacgactt gaagacacgc 421 acttatcatt cagtgggtga ctctgttttg agaagtgcaa ccagatctct cctggaattc 481 aacacaacag tgagctgtga ccagcaaggc acaaatcaca gagtccagag cagcattgcc 541 ttcctgtgtg ggaaaaccct gggaactcct gaatttgtaa ctgcaacaga atgtgtgcac 601 tactttgagt ggaggaccac tgcagcctgc aagaaagaca tatttaaagc aaataaggag 661 gtgccatgct atgtgtttga tgaagagttg aggaagcatg atctcaatcc tctgatcaag 721 cttagtggtg cctacttggt ggatgactcc gatccggaca cttctctatt catcaatgtt 781 tgtagagaca tagacacact acgagaccca ggttcacagc tgcgggcctg tccccccggc 841 actgccgcct gcctggtaag aggacaccag gcgtttgatg ttggccagcc ccgggacgga 901 ctgaagctgg tgcgcaagga caggcttgtc ctgagttacg tgagggaaga ggcaggaaag 961 ctagactttt gtgatggtca cagccctgcg gtgactatta catttgtttg cccgtcggag 1021 cggagagagg gcaccattcc caaactcaca gctaaatcca actgccgcta tgaaattgag 1081 tggattactg agtatgcctg ccacagagat tacctggaaa gtaaaacttg ttctctgagc 1141 ggcgagcagc aggatgtctc catagacctc acaccacttg cccagagcgg aggttcatcc 1201 tatatttcag atggaaaaga atatttgttt tatttgaatg tctgtggaga aactgaaata 1261 cagttctgta ataaaaaaca agctgcagtt tgccaagtga aaaagagcga tacctctcaa 1321 gtcaaagcag caggaagata ccacaatcag accctccgat attcggatgg agacctcacc 1381 ttgatatatt ttggaggtga tgaatgcagc tcagggtttc agcggatgag cgtcataaac 1441 tttgagtgca ataaaaccgc aggtaacgat gggaaaggaa ctcctgtatt cacaggggag 1501 gttgactgca cctacttctt cacatgggac acggaatacg cctgtgttaa ggagaaggaa 1561 gacctcctct gcggtgccac cgacgggaag aagcgctatg acctgtccgc gctggtccgc 1621 catgcagaac cagagcagaa ttgggaagct gtggatggca gtcagacgga aacagagaag 1681 aagcattttt tcattaatat ttgtcacaga gtgctgcagg aaggcaaggc acgagggtgt 1741 cccgaggacg cggcagtgtg tgcagtggat aaaaatggaa gtaaaaatct gggaaaattt 1801 atttcctctc ccatgaaaga gaaaggaaac attcaactct cttattcaga tggtgatgat 1861 tgtggtcatg gcaagaaaat taaaactaat atcacacttg tatgcaagcc aggtgatctg 1921 gaaagtgcac cagtgttgag aacttctggg gaaggcggtt gcttttatga gtttgagtgg 1981 cgcacagctg cggcctgtgt gctgtctaag acagaagggg agaactgcac ggtctttgac 2041 tcccaggcag ggttttcttt tgacttatca cctctcacaa agaaaaatgg tgcctataaa 2101 gttgagacaa agaagtatga cttttatata aatgtgtgtg gcccggtgtc tgtgagcccc 2161 tgtcagccag actcaggagc ctgccaggtg gcaaaaagtg atgagaagac ttggaacttg 2221 ggtctgagta atgcgaagct ttcatattat gatgggatga tccaactgaa ctacagaggc 2281 ggcacaccct ataacaatga aagacacaca ccgagagcta cgctcatcac ctttctctgt 2341 gatcgagacg cgggagtggg cttccctgaa tatcaggaag aggataactc cacctacaac 2401 ttccggtggt acaccagcta tgcctgcccg gaggagcccc tggaatgcgt agtgaccgac 2461 ccctccacgc tggagcagta cgacctctcc agtctggcaa aatctgaagg tggccttgga 2521 ggaaactggt atgccatgga caactcaggg gaacatgtca cgtggaggaa atactacatt 2581 aacgtgtgtc ggcctctgaa tccagtgccg ggctgcaacc gatatgcatc ggcttgccag 2641 atgaagtatg aaaaagatca gggctccttc actgaagtgg tttccatcag taacttggga 2701 atggcaaaga ccggcccggt ggttgaggac agcggcagcc tccttctgga atacgtgaat 2761 gggtcggcct gcaccaccag cgatggcaga cagaccacat ataccacgag gatccatctc 2821 gtctgctcca ggggcaggct gaacagccac cccatctttt ctctcaactg ggagtgtgtg 2881 gtcagtttcc tgtggaacac agaggctgcc tgtcccattc agacaacgac ggatacagac 2941 caggcttgct ctataaggga tcccaacagt ggatttgtgt ttaatcttaa tccgctaaac 3001 agttcgcaag gatataacgt ctctggcatt gggaagattt ttatgtttaa tgtctgcggc 3061 acaatgcctg tctgtgggac catcctggga aaacctgctt ctggctgtga ggcagaaacc 3121 caaactgaag agctcaagaa ttggaagcca gcaaggccag tcggaattga gaaaagcctc 3181 cagctgtcca cagagggctt catcactctg acctacaaag ggcctctctc tgccaaaggt 3241 accgctgatg cttttatcgt ccgctttgtt tgcaatgatg atgtttactc agggcccctc 3301 aaattcctgc atcaagatat cgactctggg caagggatcc gaaacactta ctttgagttt 3361 gaaaccgcgt tggcctgtgt tccttctcca gtggactgcc aagtcaccga cctggctgga 3421 aatgagtacg acctgactgg cctaagcaca gtcaggaaac cttggacggc tgttgacacc 3481 tctgtcgatg ggagaaagag gactttctat ttgagcgttt gcaatcctct cccttacatt 3541 cctggatgcc agggcagcgc agtggggtct tgcttagtgt cagaaggcaa tagctggaat 3601 ctgggtgtgg tgcagatgag tccccaagcc gcggcgaatg gatctttgag catcatgtat 3661 gtcaacggtg acaagtgtgg gaaccagcgc ttctccacca ggatcacgtt tgagtgtgct 3721 cagatatcgg gctcaccagc atttcagctt caggatggtt gtgagtacgt gtttatctgg 3781 agaactgtgg aagcctgtcc cgttgtcaga gtggaagggg acaactgtga ggtgaaagac 3841 ccaaggcatg gcaacttgta tgacctgaag cccctgggcc tcaacgacac catcgtgagc 3901 gctggcgaat acacttatta cttccgggtc tgtgggaagc tttcctcaga cgtctgcccc 3961 acaagtgaca agtccaaggt ggtctcctca tgtcaggaaa agcgggaacc gcagggattt 4021 cacaaagtgg caggtctcct gactcagaag ctaacttatg aaaatggctt gttaaaaatg 4081 aacttcacgg ggggggacac ttgccataag gtttatcagc gctccacagc catcttcttc 4141 tactgtgacc gcggcaccca gcggccagta tttctaaagg agacttcaga ttgttcctac 4201 ttgtttgagt ggcgaacgca gtatgcctgc ccacctttcg atctgactga atgttcattc 4261 aaagatgggg ctggcaactc cttcgacctc tcgtccctgt caaggtacag tgacaactgg 4321 gaagccatca ctgggacggg ggacccggag cactacctca tcaatgtctg caagtctctg 4381 gccccgcagg ctggcactga gccgtgccct ccagaagcag ccgcgtgtct gctgggtggc 4441 tccaagcccg tgaacctcgg cagggtaagg gacggacctc agtggagaga tggcataatt 4501 gtcctgaaat acgttgatgg cgacttatgt ccagatggga ttcggaaaaa gtcaaccacc 4561 atccgattca cctgcagcga gagccaagtg aactccaggc ccatgttcat cagcgccgtg 4621 gaggactgtg agtacacctt tgcctggccc acagccacag cctgtcccat gaagagcaac 4681 gagcatgatg actgccaggt caccaaccca agcacaggac acctgtttga tctgagctcc 4741 ttaagtggca gggcgggatt cacagctgct tacagcgaga aggggttggt ttacatgagc 4801 atctgtgggg agaatgaaaa ctgccctcct ggcgtggggg cctgctttgg acagaccagg 4861 attagcgtgg gcaaggccaa caagaggctg agatacgtgg accaggtcct gcagctggtg 4921 tacaaggatg ggtccccttg tccctccaaa tccggcctga gctataagag tgtgatcagt 4981 ttcgtgtgca ggcctgaggc cgggccaacc aataggccca tgctcatctc cctggacaag 5041 cagacatgca ctctcttctt ctcctggcac acgccgctgg cctgcgagca agcgaccgaa 5101 tgttccgtga ggaatggaag ctctattgtt gacttgtctc cccttattca tcgcactggt 5161 ggttatgagg cttatgatga gagtgaggat gatgcctccg ataccaaccc tgatttctac 5221 atcaatattt gtcagccact aaatcccatg cacgcagtgc cctgtcctgc cggagccgct 5281 gtgtgcaaag ttcctattga tggtcccccc atagatatcg gccgggtagc aggaccacca 5341 atactcaatc caatagcaaa tgagatttac ttgaattttg aaagcagtac tccttgctta 5401 gcggacaagc atttcaacta cacctcgctc atcgcgtttc actgtaagag aggtgtgagc 5461 atgggaacgc ctaagctgtt aaggaccagc gagtgcgact ttgtgttcga atgggagact 5521 cctgtcgtct gtcctgatga agtgaggatg gatggctgta ccctgacaga tgagcagctc 5581 ctctacagct tcaacttgtc cagcctttcc acgagcacct ttaaggtgac tcgcgactcg 5641 cgcacctaca gcgttggggt gtgcaccttt gcagtcgggc cagaacaagg aggctgtaag 5701 gacggaggag tctgtctgct ctcaggcacc aagggggcat cctttggacg gctgcaatca 5761 atgaaactgg attacaggca ccaggatgaa gcggtcgttt taagttacgt gaatggtgat 5821 cgttgccctc cagaaaccga tgacggcgtc ccctgtgtct tccccttcat attcaatggg 5881 aagagctacg aggagtgcat catagagagc agggcgaagc tgtggtgtag cacaactgcg 5941 gactacgaca gagaccacga gtggggcttc tgcagacact caaacagcta ccggacatcc 6001 agcatcatat ttaagtgtga tgaagatgag gacattggga ggccacaagt cttcagtgaa 6061 gtgcgtgggt gtgatgtgac atttgagtgg aaaacaaaag ttgtctgccc tccaaagaag 6121 ttggagtgca aattcgtcca gaaacacaaa acctacgacc tgcggctgct ctcctctctc 6181 accgggtcct ggtccctggt ccacaacgga gtctcgtact atataaatct gtgccagaaa 6241 atatataaag ggcccctggg ctgctctgaa agggccagca tttgcagaag gaccacaact 6301 ggtgacgtcc aggtcctggg actcgttcac acgcagaagc tgggtgtcat aggtgacaaa 6361 gttgttgtca cgtactccaa aggttatccg tgtggtggaa ataagaccgc atcctccgtg 6421 atagaattga cctgtacaaa gacggtgggc agacctgcat tcaagaggtt tgatatcgac 6481 agctgcactt actacttcag ctgggactcc cgggctgcct gcgccgtgaa gcctcaggag 6541 gtgcagatgg tgaatgggac catcaccaac cctataaatg gcaagagctt cagcctcgga 6601 gatatttatt ttaagctgtt cagagcctct ggggacatga ggaccaatgg ggacaactac 6661 ctgtatgaga tccaactttc ctccatcaca agctccagaa acccggcgtg ctctggagcc 6721 aacatatgcc aggtgaagcc caacgatcag cacttcagtc ggaaagttgg aacctctgac 6781 aagaccaagt actaccttca agacggcgat ctcgatgtcg tgtttgcctc ttcctctaag 6841 tgcggaaagg ataagaccaa gtctgtttct tccaccatct tcttccactg tgaccctctg 6901 gtggaggacg ggatccccga gttcagtcac gagactgccg actgccagta cctcttctct 6961 tggtacacct cagccgtgtg tcctctgggg gtgggctttg acagcgagaa tcccggggac 7021 gacgggcaga tgcacaaggg gctgtcagaa cggagccagg cagtcggcgc ggtgctcagc 7081 ctgctgctgg tggcgctcac ctgctgcctg ctggccctgt tgctctacaa gaaggagagg 7141 agggaaacag tgataagtaa gctgaccact tgctgtagga gaagttccaa cgtgtcctac 7201 aaatactcaa aggtgaataa ggaagaagag acagatgaga atgaaacaga gtggctgatg 7261 gaagagatcc agctgcctcc tccacggcag ggaaaggaag ggcaggagaa cggccatatt 7321 accaccaagt cagtgaaagc cctcagctcc ctgcatgggg atgaccagga cagtgaggat 7381 gaggttctga ccatcccaga ggtgaaagtt cactcgggca ggggagctgg ggcagagagc 7441 tcccacccag tgagaaacgc acagagcaat gcccttcagg agcgtgagga cgatagggtg 7501 gggctggtca ggggtgagaa ggcgaggaaa gggaagtcca gctctgcaca gcagaagaca 7561 gtgagctcca ccaagctggt gtccttccat gacgacagcg acgaggacct cttacacatc 7621 tgactccgca gtgcctgcag gggagcacgg agccgcggga cagccaagca cctccaacca 7681 aataagactt ccactcgatg atgcttctat aattttgcct ttaacagaaa ctttcaaaag 7741 ggaagagttt ttgtgatggg ggagagggtg aaggaggtca ggccccactc cttcctgatt 7801 gtttacagtc attggaataa ggcatggctc agatcggcca cagggcggta ccttgtgccc 7861 agggttttgc cccaagtcct catttaaaag cataaggccg gacgcatctc aaaacagagg 7921 gctgcattcg aagaaaccct tgctgcttta gtcccgatag ggtatttgac cccgatatat 7981 tttagcattt taattctctc cccctattta ttgactttga caattactca ggtttgagaa 8041 aaaggaaaaa aaaacagcca ccgtttcttc ctgccagcag gggtgtgatg taccagtttg 8101 tccatcttga gatggtgagg ctgtcagtgt atggggcagc ttccggcggg atgttgaact 8161 ggtcattaat gtgtcccctg agttggagct cattctgtct cttttctctt ttgctttctg 8221 tttcttaagg gcacacacac gtgcgtgcga gcacacacac acatacgtgc acagggtccc 8281 cgagtgccta ggttttggag agtttgcctg ttctatgcct ttagtcagga atggctgcac 8341 ctttttgcat gatatcttca agcctgggcg tacagagcac atttgtcagt atttttgccg 8401 gctggtgaat tcaaacaacc tgcccaaaga ttgatttgtg tgtttgtgtg tgtgtgtgtg 8461 tgtgtgtgtg tgtgtgagtg gagttgaggt gtcagagaaa atgaattttt tccagatttg 8521 gggtataggt ctcatctctt caggttctca tgataccacc tttactgtgc ttattttttt 8581 aagaaaaaag tgttgatcaa ccattcgacc tataagaagc cttaatttgc acagtgtgtg 8641 acttacagaa actgcatgaa aaatcatggg ccagagcctc ggccctagca ttgcacttgg 8701 cctcatgctg gagggaggct gggcgggtac agcgcggagg aggagggagg ccaggcgggc 8761 atggcgtgga ggaggaggga ggccgggcgg tcacagcatg gaggaggagg gaggcgctgc 8821 tggtgttctt attctggcgg cagcgccttt cctgccatgt ttagtgaatg acttttctcg 8881 cattgtagaa ttgtatatag actctggtgt tctattgctg agaagcaaac cgccctgcag 8941 catccctcag cctgtaccgg tttggctggc ttgtttgatt tcaacatgag tgtatttttt 9001 aaaattgatt tttctcttca tttttttttc aatcaacttt actgtaatat aaagtattca 9061 acaatttcaa taaaagataa attattaaaa // LOCUS HSIGFIRR 4989 bp RNA PRI 09-MAY-1995 DEFINITION Human mRNA for insulin-like growth factor I receptor. ACCESSION X04434 M24599 NID g33058 KEYWORDS glycoprotein; insulin receptor; insulin-like growth factor I receptor; membrane glycoprotein; receptor; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4989) AUTHORS Ullrich,A., Gray,A., Tam,A.W., Yang-Feng,T., Tsubokawa,M., Collins,C., Henzel,W., Bon,T.L., Kathuria,S., Chen,E., Jakobs,S., Francke,U., Ramachandran,J. and Fujita-Yamaguchi,Y. TITLE Insulin-like growth factor I receptor primary structure: comparison with insulin receptor suggests structural determinants that define functional specificity JOURNAL EMBO J. 5 (10), 2503-2512 (1986) MEDLINE 87053815 FEATURES Location/Qualifiers source 1..4989 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="(lamda)gt10" /clone="(lambda)IGF-1-R.85, (lambda)IGF-1-R.76" sig_peptide 46..121 CDS 46..4149 /codon_start=1 /product="IGF-I receptor" /db_xref="PID:g804990" /db_xref="SWISS-PROT:P08069" /translation="MKSGSGGGSPTSLWGLLFLSAALSLWPTSGEICGPGIDIRNDYQ QLKRLENCTVIEGYLHILLISKAEDYRSYRFPKLTVITEYLLLFRVAGLESLGDLFPN LTVIRGWKLFYNYALVIFEMTNLKDIGLYNLRNITRGAIRIEKNADLCYLSTVDWSLI LDAVSNNYIVGNKPPKECGDLCPGTMEEKPMCEKTTINNEYNYRCWTTNRCQKMCPST CGKRACTENNECCHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRC VDRDFCANILSAESSDSEGFVIHDGECMQECPSGFIRNGSQSMYCIPCEGPCPKVCEE EKKTKTIDSVTSAQMLQGCTIFKGNLLINIRRGNNIASELENFMGLIEVVTGYVKIRH SHALVSLSFLKNLRLILGEEQLEGNYSFYVLDNQNLQQLWDWDHRNLTIKAGKMYFAF NPKLCVSEIYRMEEVTGTKGRQSKGDINTRNNGERASCESDVLHFTSTTTSKNRIIIT WHRYRPPDYRDLISFTVYYKEAPFKNVTEYDGQDACGSNSWNMVDVDLPPNKDVEPGI LLHGLKPWTQYAVYVKAVTLTMVENDHIRGAKSEILYIRTNASVPSIPLDVLSASNSS SQLIVKWNPPSLPNGNLSYYIVRWQRQPQDGYLYRHNYCSKDKIPIRKYADGTIDIEE VTENPKTEVCGGEKGPCCACPKTEAEKQAEKEEAEYRKVFENFLHNSIFVPRPERKRR DVMQVANTTMSSRSRNTTAADTYNITDPEELETEYPFFESRVDNKERTVISNLRPFTL YRIDIHSCNHEAEKLGCSASNFVFARTMPAEGADDIPGPVTWEPRPENSIFLKWPEPE NPNGLILMYEIKYGSQVEDQRECVSRQEYRKYGGAKLNRLNPGNYTARIQATSLSGNG SWTDPVFFYVQAKTGYENFIHLIIALPVAVLLIVGGLVIMLYVFHRKRNNSRLGNGVL YASVNPEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKGVVKDEPETRV AIKTVNEAASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIMELMTRGDLK SYLRSLRPEMENNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARNCMVAED FTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGVVLWE IATLAEQPYQGLSNEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFLEI ISSIKEEMEPGFREVSFYYSEENKLPEPEELDLEPENMESVPLDPSASSSSLPLPDRH SGHKAENGPGPGVLVLRASFDERQPYAHMNGGRKNERALPLPQSSTC" mat_peptide 122..4132 /note="IGF-I receptor" misc_feature 122..2251 /note="alpha-subunit (AA 1 - 710)" misc_feature 182..190 /note="pot.N-linked glycosylation site (AA 21 - 23)" misc_feature 335..343 /note="pot.N-linked glycostlation site (AA 72 - 74)" misc_feature 434..442 /note="pot.N-linked glycostlation site (AA 105 - 107)" misc_feature 761..769 /note="pot.N-linked glycostlation site (AA 214 - 216)" misc_feature 971..979 /note="pot.N-linked glycostlation site (AA 284 - 286)" misc_feature 1280..1288 /note="pot.N-linked glycostlation site (AA 387 - 389)" misc_feature 1343..1351 /note="pot.N-linked glycosylation site (AA 408 - 410)" misc_feature 1631..1639 /note="pot.N-linked glycostlation site (AA 504 - 506)" misc_feature 1850..1858 /note="pot.N-linked glycosylation site (AA 577 - 579)" misc_feature 1895..1903 /note="pot.N-linked glycosylation site (AA 592 - 594)" misc_feature 1949..1957 /note="pot.N-linked glycosylation site (AA 610 - 612)" misc_feature 2240..2251 /note="putative proreceptor processing site (AA 707 - 710)" misc_feature 2252..4132 /note="beta-subunit (AA 711 - 1337)" misc_feature 2270..2278 /note="pot.N-linked glycosylation site (AA 717 - 719]" misc_feature 2297..2305 /note="pot.N-linked glycosylation site (AA 726 - 728)" misc_feature 2321..2329 /note="pot.N-linked glycosylation site (AA 734 - 736)" misc_feature 2729..2737 /note="pot.N-linked glycosylation site (AA 870 - 872)" misc_feature 2768..2776 /note="pot.N-linked glycosylation site (AA 883 - 885)" misc_feature 2837..2908 /note="transmembrane region (AA 906 - 929)" misc_feature 2918..2926 /note="pot.N-linked glycosylation site (AA 933 - 935)" misc_feature 3047..3049 /note="pot.ATP binding site (AA 976)" misc_feature 3053..3055 /note="pot.ATP binding site (AA 978)" misc_feature 3062..3064 /note="pot.ATP binding site (AA 981)" misc_feature 3128..3130 /note="pot.ATP binding site (AA 1003)" BASE COUNT 1216 a 1371 c 1320 g 1082 t ORIGIN 1 tttttttttt ttttgagaaa gggaatttca tcccaaataa aaggaatgaa gtctggctcc 61 ggaggagggt ccccgacctc gctgtggggg ctcctgtttc tctccgccgc gctctcgctc 121 tggccgacga gtggagaaat ctgcgggcca ggcatcgaca tccgcaacga ctatcagcag 181 ctgaagcgcc tggagaactg cacggtgatc gagggctacc tccacatcct gctcatctcc 241 aaggccgagg actaccgcag ctaccgcttc cccaagctca cggtcattac cgagtacttg 301 ctgctgttcc gagtggctgg cctcgagagc ctcggagacc tcttccccaa cctcacggtc 361 atccgcggct ggaaactctt ctacaactac gccctggtca tcttcgagat gaccaatctc 421 aaggatattg ggctttacaa cctgaggaac attactcggg gggccatcag gattgagaaa 481 aatgctgacc tctgttacct ctccactgtg gactggtccc tgatcctgga tgcggtgtcc 541 aataactaca ttgtggggaa taagccccca aaggaatgtg gggacctgtg tccagggacc 601 atggaggaga agccgatgtg tgagaagacc accatcaaca atgagtacaa ctaccgctgc 661 tggaccacaa accgctgcca gaaaatgtgc ccaagcacgt gtgggaagcg ggcgtgcacc 721 gagaacaatg agtgctgcca ccccgagtgc ctgggcagct gcagcgcgcc tgacaacgac 781 acggcctgtg tagcttgccg ccactactac tatgccggtg tctgtgtgcc tgcctgcccg 841 cccaacacct acaggtttga gggctggcgc tgtgtggacc gtgacttctg cgccaacatc 901 ctcagcgccg agagcagcga ctccgagggg tttgtgatcc acgacggcga gtgcatgcag 961 gagtgcccct cgggcttcat ccgcaacggc agccagagca tgtactgcat cccttgtgaa 1021 ggtccttgcc cgaaggtctg tgaggaagaa aagaaaacaa agaccattga ttctgttact 1081 tctgctcaga tgctccaagg atgcaccatc ttcaagggca atttgctcat taacatccga 1141 cgggggaata acattgcttc agagctggag aacttcatgg ggctcatcga ggtggtgacg 1201 ggctacgtga agatccgcca ttctcatgcc ttggtctcct tgtccttcct aaaaaacctt 1261 cgcctcatcc taggagagga gcagctagaa gggaattact ccttctacgt cctcgacaac 1321 cagaacttgc agcaactgtg ggactgggac caccgcaacc tgaccatcaa agcagggaaa 1381 atgtactttg ctttcaatcc caaattatgt gtttccgaaa tttaccgcat ggaggaagtg 1441 acggggacta aagggcgcca aagcaaaggg gacataaaca ccaggaacaa cggggagaga 1501 gcctcctgtg aaagtgacgt cctgcatttc acctccacca ccacgtcgaa gaatcgcatc 1561 atcataacct ggcaccggta ccggccccct gactacaggg atctcatcag cttcaccgtt 1621 tactacaagg aagcaccctt taagaatgtc acagagtatg atgggcagga tgcctgcggc 1681 tccaacagct ggaacatggt ggacgtggac ctcccgccca acaaggacgt ggagcccggc 1741 atcttactac atgggctgaa gccctggact cagtacgccg tttacgtcaa ggctgtgacc 1801 ctcaccatgg tggagaacga ccatatccgt ggggccaaga gtgagatctt gtacattcgc 1861 accaatgctt cagttccttc cattcccttg gacgttcttt cagcatcgaa ctcctcttct 1921 cagttaatcg tgaagtggaa ccctccctct ctgcccaacg gcaacctgag ttactacatt 1981 gtgcgctggc agcggcagcc tcaggacggc tacctttacc ggcacaatta ctgctccaaa 2041 gacaaaatcc ccatcaggaa gtatgccgac ggcaccatcg acattgagga ggtcacagag 2101 aaccccaaga ctgaggtgtg tggtggggag aaagggcctt gctgcgcctg ccccaaaact 2161 gaagccgaga agcaggccga gaaggaggag gctgaatacc gcaaagtctt tgagaatttc 2221 ctgcacaact ccatcttcgt gcccagacct gaaaggaagc ggagagatgt catgcaagtg 2281 gccaacacca ccatgtccag ccgaagcagg aacaccacgg ccgcagacac ctacaacatc 2341 accgacccgg aagagctgga gacagagtac cctttctttg agagcagagt ggataacaag 2401 gagagaactg tcatttctaa ccttcggcct ttcacattgt accgcatcga tatccacagc 2461 tgcaaccacg aggctgagaa gctgggctgc agcgcctcca acttcgtctt tgcaaggact 2521 atgcccgcag aaggagcaga tgacattcct gggccagtga cctgggagcc aaggcctgaa 2581 aactccatct ttttaaagtg gccggaacct gagaatccca atggattgat tctaatgtat 2641 gaaataaaat acggatcaca agttgaggat cagcgagaat gtgtgtccag acaggaatac 2701 aggaagtatg gaggggccaa gctaaaccgg ctaaacccgg ggaactacac agcccggatt 2761 caggccacat ctctctctgg gaatgggtcg tggacagatc ctgtgttctt ctatgtccag 2821 gccaaaacag gatatgaaaa cttcatccat ctgatcatcg ctctgcccgt cgctgtcctg 2881 ttgatcgtgg gagggttggt gattatgctg tacgtcttcc atagaaagag aaataacagc 2941 aggctgggga atggagtgct gtatgcctct gtgaacccgg agtacttcag cgctgctgat 3001 gtgtacgttc ctgatgagtg ggaggtggct cgggagaaga tcaccatgag ccgggaactt 3061 gggcaggggt cgtttgggat ggtctatgaa ggagttgcca agggtgtggt gaaagatgaa 3121 cctgaaacca gagtggccat taaaacagtg aacgaggccg caagcatgcg tgagaggatt 3181 gagtttctca acgaagcttc tgtgatgaag gagttcaatt gtcaccatgt ggtgcgattg 3241 ctgggtgtgg tgtcccaagg ccagccaaca ctggtcatca tggaactgat gacacggggc 3301 gatctcaaaa gttatctccg gtctctgagg ccagaaatgg agaataatcc agtcctagca 3361 cctccaagcc tgagcaagat gattcagatg gccggagaga ttgcagacgg catggcatac 3421 ctcaacgcca ataagttcgt ccacagagac cttgctgccc ggaattgcat ggtagccgaa 3481 gatttcacag tcaaaatcgg agattttggt atgacgcgag atatctatga gacagactat 3541 taccggaaag gaggcaaagg gctgctgccc gtgcgctgga tgtctcctga gtccctcaag 3601 gatggagtct tcaccactta ctcggacgtc tggtccttcg gggtcgtcct ctgggagatc 3661 gccacactgg ccgagcagcc ctaccagggc ttgtccaacg agcaagtcct tcgcttcgtc 3721 atggagggcg gccttctgga caagccagac aactgtcctg acatgctgtt tgaactgatg 3781 cgcatgtgct ggcagtataa ccccaagatg aggccttcct tcctggagat catcagcagc 3841 atcaaagagg agatggagcc tggcttccgg gaggtctcct tctactacag cgaggagaac 3901 aagctgcccg agccggagga gctggacctg gagccagaga acatggagag cgtccccctg 3961 gacccctcgg cctcctcgtc ctccctgcca ctgcccgaca gacactcagg acacaaggcc 4021 gagaacggcc ccggccctgg ggtgctggtc ctccgcgcca gcttcgacga gagacagcct 4081 tacgcccaca tgaacggggg ccgcaagaac gagcgggcct tgccgctgcc ccagtcttcg 4141 acctgctgat ccttggatcc tgaatctgtg caaacagtaa cgtgtgcgca cgcgcagcgg 4201 ggtggggggg gagagagagt tttaacaatc cattcacaag cctcctgtac ctcagtggat 4261 cttcagttct gcccttgctg cccgcgggag acagcttctc tgcagtaaaa cacatttggg 4321 atgttccttt tttcaatatg caagcagctt tttattccct gcccaaaccc ttaactgaca 4381 tgggccttta agaaccttaa tgacaacact taatagcaac agagcacttg agaaccagtc 4441 tcctcactct gtccctgtcc ttccctgttc tccctttctc tctcctctct gcttcataac 4501 ggaaaaataa ttgccacaag tccagctggg aagccctttt tatcagtttg aggaagtggc 4561 tgtccctgtg gccccatcca accactgtac acacccgcct gacaccgtgg gtcattacaa 4621 aaaaacacgt ggagatggaa atttttacct ttatctttca cctttctagg gacatgaaat 4681 ttacaaaggg ccatcgttca tccaaggctg ttaccatttt aacgctgcct aattttgcca 4741 aaatcctgaa ctttctccct catcggcccg gcgctgattc ctcgtgtccg gaggcatggg 4801 tgagcatggc agctggttgc tccatttgag agacacgctg gcgacacact ccgtccatcc 4861 gactgcccct gctgtgctgc tcaaggccac aggcacacag gtctcattgc ttctgactag 4921 attattattt gggggaactg gacacaatag gtctttctct cagtgaaggt ggggagaagc 4981 tgaaccggc // LOCUS HSIGG4FCA 768 bp RNA PRI 21-JAN-1994 DEFINITION H.sapiens mRNA for Immunoglobulin G1, Fc fragment. ACCESSION X70421 NID g33068 KEYWORDS immunoglobulin; protein A binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 768) AUTHORS Filpula,D. TITLE H. sapiens mRNA for immunoglobulin G1, Fc fragment JOURNAL Unpublished REFERENCE 2 (bases 1 to 768) AUTHORS Filpula,D.R. TITLE Direct Submission JOURNAL Submitted (10-FEB-1993) D.R. Filpula, Enzon Labs, 16020 Industrial Drive, Gaithersburg, Maryland 20877, USA FEATURES Location/Qualifiers source 1..768 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 1..63 /note="OmpA signal peptide of E.coli" CDS 1..768 /codon_start=1 /product="IgG1 Fc fragment" /db_xref="PID:g33069" /translation="MKKTAIAIAVALAGFATVAQADVESKSCDKTHTCPPCPAPELLG GPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREE QYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLP PSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKL TVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK" misc_feature 64..69 /note="synthetic DNA for AatII site" misc_feature 70..114 /note="Ig hinge region" misc_feature 115..444 /note="Ig CH2 region" misc_feature 445..765 /note="Ig CH3 region" BASE COUNT 190 a 246 c 205 g 127 t ORIGIN 1 atgaaaaaga cagctatcgc gattgcagtg gcactggctg gtttcgctac cgtagcgcag 61 gccgacgtcg agtccaaatc ttgtgacaaa actcacacat gcccaccgtg cccagcacct 121 gaactcctgg ggggaccgtc agtcttcctc ttccccccaa aacccaagga caccctcatg 181 atctcccgga cccctgaggt cacatgcgtg gtggtggacg tgagccacga agaccctgag 241 gtcaagttca actggtacgt ggacggcgtg gaggtgcata atgccaagac aaagccgcgg 301 gaggagcagt acaacagcac gtaccgtgtg gtcagcgtcc tcaccgtcct gcaccaggac 361 tggctgaatg gcaaggagta caagtgcaag gtctccaaca aagccctccc agcccccatc 421 gagaaaacca tctccaaagc caaagggcag ccccgagagc cacaggtgta caccctgccc 481 ccatcccggg atgagctgac caagaaccag gtcagcctga cctgcctggt caaaggcttc 541 tatcccagcg acatcgccgt ggagtgggag agcaatgggc agccggagaa caactacaag 601 accacgcctc ccgtgctgga ctccgacggc tccttcttcc tctacagcaa gctcaccgtg 661 gacaagagca ggtggcagca ggggaacgtc ttctcatgct ccgtgatgca tgaggctctg 721 cacaaccact acacgcagaa gagcctctcc ctgtctccgg gtaaatga // LOCUS HSIGLV 870 bp RNA PRI 27-NOV-1995 DEFINITION Human mRNA for Ig lambda-chain. ACCESSION X14583 NID g33394 KEYWORDS Ig light chain; immunoglobulin; lambda-immunoglobulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 870) AUTHORS Kishimoto,T. TITLE Direct Submission JOURNAL Submitted (03-MAR-1989) Kishimoto T., Yoshitomi Pharmaceutical Industries Ltd, Research Labs, 7-25 Koyata 3-chome, Iruma Shi, Saitama, 358 Japan REFERENCE 2 (bases 1 to 414) AUTHORS Kishimoto,T., Okajima,H., Okumoto,T. and Taniguchi,M. TITLE Nucleotide sequences of the cDNAs encoding the V-regions of H- and L-chains of a human monoclonal antibody with broad reactivity to malignant tumor cells JOURNAL Nucleic Acids Res. 17 (11), 4385 (1989) MEDLINE 89296497 COMMENT hybridoma; clone=4G12 L6 Data kindly reviewed (03-JUL-1989) by Kishimoto T. FEATURES Location/Qualifiers source 1..870 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymph node" /cell_type="lymphocyte" /cell_line="4G12" sig_peptide 25..84 /note="signal peptide (AA -20 to -1)" CDS 25..732 /codon_start=1 /product="lambda-chain precursor (AA -20 to 215)" /db_xref="PID:g33395" /translation="MTCSPLLLTLLIHCTGSWAQSVLTQPPSVSAAPGQKVTISCSGS SSNIGNNYVSWYQQLPGTAPKLLIYDNNKRPSGIPDRFSGSKSGTSATLGITGLQTGD EADYYCGTWDSSLSAGVFGGGTKLTVLGQPKAAPSVTLFPPSSEELQANKATLVCLIS DFYPGAVTVAWKADSSPVKAGVETTTPSKQSNNKYAASSYLSLTPEQWKSHRSYSCQV THEGSTVEKTVAPTECS" misc_feature 85..375 /note="V region" misc_feature 376..414 /note="J region" misc_feature 415..729 /note="C region" BASE COUNT 206 a 285 c 213 g 166 t ORIGIN 1 ccgaatttcg ggacaatctt catcatgacc tgctcccctc tcctcctcac ccttctcatt 61 cactgcacag ggtcctgggc ccagtctgtg ttgacgcagc cgccctcagt gtctgcggcc 121 ccaggacaga aggtcaccat ctcctgctct ggaagcagct ccaacattgg gaataattat 181 gtatcctggt accagcagct cccaggaaca gcccccaaac tcctcattta tgacaataat 241 aagcgaccct cagggattcc tgaccgattc tctggctcca agtctggcac gtcagccacc 301 ctgggcatca ccggactcca gactggggac gaggccgatt attactgcgg aacatgggat 361 agcagcctga gtgctggggt attcggcgga gggaccaagc tgaccgtcct aggtcagccc 421 aaggctgccc cctcggtcac tctgttcccg ccctcctctg aggagcttca agccaacaag 481 gccacactgg tgtgtctcat aagtgacttc tacccgggag ccgtgacagt ggcctggaag 541 gcagatagca gccccgtcaa ggcgggagtg gagaccacca caccctccaa acaaagcaac 601 aacaagtacg cggccagcag ctatctgagc ctgacgcctg agcagtggaa gtcccacaga 661 agctacagct gccaggtcac gcatgaaggg agcaccgtgg agaagacagt ggcccctaca 721 gaatgttcat aggttctaaa ccctcacccc ccccacggga gactagagct gcaggatccc 781 aggggagggg tctctcctcc caccccaagg catcaagccc ttctccctgc actcaataaa 841 ccctcaataa atattctcat tgtcaatcag // LOCUS HSIGM201 2213 bp RNA PRI 03-APR-1995 DEFINITION Human mRNA for IgM heavy chain complete sequence. ACCESSION X17115 NID g33450 KEYWORDS Ig heavy chain; IgM gene; IgM heavy chain; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2213) AUTHORS Friedlander,R.M. TITLE Direct Submission JOURNAL Submitted (03-NOV-1989) Friedlander R. M., Harvard Medical School, Howard Hughes Medical Institute, Department of Genetics, 25 Shattuck St, Boston, MA 02115, USA REFERENCE 2 (bases 1 to 2213) AUTHORS Friedlander,R.M., Nussenzweig,M.C. and Leder,P. TITLE Complete nucleotide sequence of the membrane form of the human IgM heavy chain JOURNAL Nucleic Acids Res. 18 (14), 4278 (1990) MEDLINE 90332450 REFERENCE 3 (bases 1 to 2213) AUTHORS Kristensen,T., Lopez,R. and Prydz,H. TITLE An estimate of the sequencing error frequency in the DNA sequence databases JOURNAL DNA Seq. 2 (6), 343-346 (1992) MEDLINE 93075997 REMARK Erratum:[DNA Seq 1993;3(5):337]] COMMENT For genomic sequence see , and . The author reports various conflicts with these sequences. Data kindly reviewed (30-MAY-1990) by Friedlander R.M. FEATURES Location/Qualifiers source 1..2213 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoid" /cell_type="B" /cell_line="lymphoma 201" /clone="201-203" misc_feature 1..39 /note="putative VECTOR sequence Bluescript SKP+" /citation=[3] CDS 73..1956 /note="precursor (AA -15 to 612)" /codon_start=1 /db_xref="PID:g33451" /db_xref="SWISS-PROT:P01871" /db_xref="SWISS-PROT:P20769" /translation="MDWTWRFLFVVAAATGVQSQVQLVQSGAEVKKPGSSVKVSCKAS GGTFSSYAISWVRQAPGQGLEWMGGIIPIFGTANYAQKFQGRVTITADESTSTAYMEL SSLRSEDTAVYYCAKTGILGPYSSGWYPNSDYYYYGMDVWGQGTTVTVSSGSASAPTL FPLVSCENSPSDTSSVAVGCLAQDFLPDSITFSWKYKNNSDISSTRGFPSVLRGGKYA ATSQVLLPSKDVMQGTDEHVVCKVQHPNGNKEKNVPLPVIAELPPKVSVFVPPRDGFF GNPRSKSKLICQATGFSPRQIQVSWLREGKQVGSGVTTDQVQAEAKESGPTTYKVTST LTIKESDWLSQSMFTCRVDHRGLTFQQNASSMCVPDQDTAIRVFAIPPSFASIFLTKS TKLTCLVTDLTTYDSVTISWTRQNGEAVKTHTNISESHPNATFSAVGEASICEDDWNS GERFTCTVTHTDLPSPLKQTISRPKGVALHRPDVYLLPPAREQLNLRESATITCLVTG FSPADVFVQWMQRGQPLSPEKYVTSAPMPEPQAPGRYFAHSILTVSEEEWNTGETYTC VVAHEALPNRVTERTVDKSTEGEVSADEEGFENLWATASTFIVLFLLSLFYSTTVTLF KVK" sig_peptide 73..117 /note="leader peptide (AA -15 to -1)" mat_peptide 118..1953 /note="IgM heavy chain (AA 1 to 612)" polyA_site 2213 /note="polyA site" BASE COUNT 462 a 708 c 629 g 414 t ORIGIN 1 gctctagaac tagtggatcc cccgggctgc aggaattctc taaagaagcc cctgggagca 61 cagctcatca ccatggactg gacctggagg ttcctctttg tggtggcagc agctacaggt 121 gtccagtccc aggtgcagct ggtgcagtct ggggctgagg tgaagaagcc tgggtcctcg 181 gtgaaggtct cctgcaaggc ttctggaggc accttcagca gctatgctat cagctgggtg 241 cgacaggccc ctggacaagg gcttgagtgg atgggaggga tcatccctat ctttggtaca 301 gcaaactacg cacagaagtt ccagggcaga gtcacgatta ccgcggacga atccacgagc 361 acagcctaca tggagctgag cagcctgaga tctgaggaca cggccgtgta ttactgtgcg 421 aaaaccggga tcctggggcc gtatagcagt ggctggtacc cgaactcgga ctactactac 481 tacggtatgg acgtctgggg ccaagggacc acggtcaccg tctcctcagg gagtgcatcc 541 gccccaaccc ttttccccct cgtctcctgt gagaattccc cgtcggatac gagcagcgtg 601 gccgttggct gcctcgcaca ggacttcctt cccgactcca tcactttctc ctggaaatac 661 aagaacaact ctgacatcag cagcacccgg ggcttcccat cagtcctgag agggggcaag 721 tacgcagcca cctcacaggt gctgctgcct tccaaggacg tcatgcaggg cacagacgaa 781 cacgtggtgt gcaaagtcca gcaccccaac ggcaacaaag aaaagaacgt gcctcttcca 841 gtgattgctg agctgcctcc caaagtgagc gtcttcgtcc caccccgcga cggcttcttc 901 ggcaaccccc gcagcaagtc caagctcatc tgccaggcca cgggtttcag tccccggcag 961 attcaggtgt cctggctgcg cgaggggaag caggtggggt ctggcgtcac cacggaccag 1021 gtgcaggctg aggccaaaga gtctgggccc acgacctaca aggtgaccag cacactgacc 1081 atcaaagaga gcgactggct cagccagagc atgttcacct gccgcgtgga tcacaggggc 1141 ctgaccttcc agcagaatgc gtcctccatg tgtgtccccg atcaagacac agccatccgg 1201 gtcttcgcca tccccccatc ctttgccagc atcttcctca ccaagtccac caagttgacc 1261 tgcctggtca cagacctgac cacctatgac agcgtgacca tctcctggac ccgccagaat 1321 ggcgaagctg tgaaaaccca caccaacatc tccgagagcc accccaatgc cactttcagc 1381 gccgtgggtg aggccagcat ctgcgaggat gactggaatt ccggggagag gttcacgtgc 1441 accgtgaccc acacagacct gccctcgcca ctgaagcaga ccatctcccg gcccaagggg 1501 gtggccctgc acaggcccga tgtctacttg ctgccaccag cccgggagca gctgaacctg 1561 cgggagtcgg ccaccatcac gtgcctggtg acgggcttct ctcccgcgga cgtcttcgtg 1621 cagtggatgc agagggggca gcccttgtcc ccggagaagt atgtgaccag cgccccaatg 1681 cctgagcccc aggccccagg ccggtacttc gcccacagca tcctgaccgt gtccgaagag 1741 gaatggaaca cgggggagac ctacacctgc gtggtggccc atgaggccct gcccaacagg 1801 gtcaccgaga ggaccgtgga caagtccacc gagggggagg tgagcgccga cgaggagggc 1861 tttgagaacc tgtgggccac cgcctccacc ttcatcgtcc tcttcctcct gagcctcttc 1921 tacagtacca ccgtcacctt gttcaaggtg aaatgatccc aacagaagaa catcggagac 1981 cagagagagg aactcaaagg ggcgctgcct ccgggtctgg ggtcctggcc tgcgtggcct 2041 gttggcacgt gtttctcttc ccgcccggcc tccagttgtg tgctctcaca caggcttcct 2101 tctcgaccgg caggggctgg ctggcttgca ggccacgagg tgggctctac cccacactgc 2161 tttgctgtgt atacgcttgt tgccctgaaa taaatatgca cattttatcc atg // LOCUS HSIGRFX 3086 bp RNA PRI 25-MAY-1993 DEFINITION H.sapiens gene for MHC class II regulatory factor RFX. ACCESSION X58964 NID g311362 KEYWORDS major histocompatibility complex class II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3086) AUTHORS Reith,W., Herrero-Sanchez,C., Kobr,M., Silacci,P., Berte,C., Barras,E., Fey,S. and Mach,B. TITLE MHC class II regulatory factor RFX has a novel DNA-binding domain and a functionally independent dimerization domain JOURNAL Genes Dev. 4 (9), 1528-1540 (1990) MEDLINE 91071581 FEATURES Location/Qualifiers source 1..3086 /organism="Homo sapiens" /db_xref="taxon:9606" gene 94..3033 /gene="RFX" CDS 94..3033 /gene="RFX" /codon_start=1 /product="MHC class II regulatory factor RFX" /db_xref="PID:g33568" /db_xref="SWISS-PROT:P22670" /translation="MATQAYTELQAAPPPSQPPQAPPQAQPQPPPPPPPAAPQPPQPP TAAATPQPQYVTELQSPQPQAQPPGGQKQYVTELPAVPAPSQPTGAPTPSPAPQQYIV VTVSEGAMRASETVSEASPGSTASQTGVPTQVVQQVQGTQQRLLVQTSVQAKPGHVSP LQLTNIQVPQQALPTQRLVVQSAAPGSKGGQVSLTVHGTQQVHSPPEQSPVQANSSSS KTAGAPTGTVPQQLQVHGVQQSVPVTQERSVVQATPQAPKPGPVQPLTVQGLQPVHVA QEVQQLQQVPVPHVYSSQVQYVEGGDASYTASAIRSSTYSYPETPLYTQTASTSYYEA AGTATQVSTPATSQAVASSGSMPMYVSGSQVVASSASTGAGASNSSGGGGSGGGGGGG GGGGGGGSGSTGGGGSGAGTYVIQGGYMLGSASQSYSHTTRASPATVQWLLDNYETAE GVSLPRSTLYCHYLLHCQEQKLEPVNAASFGKLIRSVFMGLRTRRLGTRGNSKYHYYG LRIKASSPLLRLMEDQQHMAMRGQPFSQKQRLKPIQKMEGMTNGVAVGQQPSTGLSDI SAQVQQYQQFLDASRSLPDFTELDLQGKVLPEGVGPGDIKAFQVLYREHCEAIVDVMV NLQFTLVETLWKTFWRYNLSQPSEAPPLAVHDEAEKRLPKAILVLLSKFEPVLQWTKH CDNVLYQGLVEILIPDVLRPIPSALTQAIRNFAKSLESWLTHAMVNIPEEMLRVKVAA AGAFAQTLRRYTSLNHLAQAARAVLQNTAQINQMLSDLNRVDFANVQEQASWVCRCED RVVQRLEQDFKVTLQQQNSLEQWAAWLDGVVSQVLKPYQGSAGFPKAAKLFLLKWSFY SSMVIRDLTLRSAASFGSFHLIRLLYDEYMYYLIEHRVAQAKGETPIAVMGEFANLAT SLNPLDPDKDEEEEEEEESEDELPQDISLAAGGESPALGPETLEPPAKLARTDARGLF VQALPSS" BASE COUNT 603 a 1107 c 937 g 439 t ORIGIN 1 gcgaaaagcg tttccgcaga cagaagtggg gagaagcgga ggaattaaaa aaaaaaaaaa 61 gccttattta ttatcatttt ccccaccgtt ggcatggcaa cacaggcgta tactgagcta 121 caggcagccc cgccaccatc ccagccgcca caggccccgc cacaagccca gccccagccg 181 ccaccgccac cacccccagc ggcaccccag cccccgcagc cacccaccgc tgctgccacc 241 cctcagcccc aatatgtcac cgagctgcag agcccccagc cccaggcaca gccaccgggt 301 ggccagaagc agtacgtgac ggagctcccg gctgtacccg caccctcgca gccaaccggt 361 gcacccaccc cttcgcctgc accccagcag tacatcgtgg tcactgtctc tgaaggtgcc 421 atgcgggcca gcgagacagt gtcggaggcc agccccggct ccaccgccag ccagaccggc 481 gttcctactc aggtggttca gcaggtgcag ggcacccagc agcggctgct ggtccagacg 541 agcgtgcagg ccaagccagg ccacgtgtcg cccctccagc tgaccaacat ccaagtgccc 601 cagcaggctc ttcccacgca gcgtctggtg gtgcagagcg cagccccagg cagcaaaggt 661 ggccaggtct ccctgacggt ccatggtacc cagcaggtgc actcgccccc agagcagtcg 721 ccggtgcagg ccaacagctc ttccagcaag acagccgggg cccccacggg cacagtgcca 781 cagcagctgc aggtccacgg cgtccagcag agtgtccccg tcacccaaga gagatctgtg 841 gtccaggcca ctccacaagc gcccaaaccc ggcccggtgc agccgctgac cgtgcagggc 901 ctccagccag tccacgtggc tcaagaggtg cagcagctcc agcaggtgcc cgtcccacac 961 gtgtactcca gccaggtgca gtatgtggag ggcggcgatg ccagctacac ggccagtgcc 1021 atccgttcca gcacctactc ctatcccgag acgccgctgt acacgcagac ggcaagcacc 1081 agctactacg aggccgcagg cacggccacc caggtcagca cccccgccac ctcccaggcg 1141 gtggccagca gtggctccat gcccatgtac gtgtccggca gccaggtcgt cgccagctcc 1201 gccagcactg gggctggggc cagcaacagc agcggaggtg gtggcagtgg tggtggcggc 1261 ggcggcgggg gaggcggtgg cgggggtggc agtggcagca ccggaggcgg cggcagcgga 1321 gcaggcacct acgtgatcca aggcggctac atgctgggca gtgccagcca gtcttactct 1381 cacaccaccc gtgcctcgcc agccacggtc cagtggctcc tggacaacta tgagacggct 1441 gagggcgtga gtctgccacg gagcaccctc tactgccact acttactgca ctgccaggag 1501 cagaagctgg agcccgtcaa cgccgcctcc ttcggcaagc tcatccgctc cgtcttcatg 1561 ggcctgcgaa cccgccgtct gggcaccagg ggcaactcca agtaccacta ctatggcctg 1621 cgcatcaagg ccagctcacc cctgctgcgg ctgatggagg accagcagca catggccatg 1681 cggggccagc ccttctcgca gaagcagagg ctcaagccca tccagaagat ggaaggcatg 1741 accaacggcg tggcggtggg gcagcagccg agcacggggc tgtcggacat cagcgcccag 1801 gtgcagcagt accagcaatt tttggatgcc tctcggagcc tccctgactt cacagagctc 1861 gacctccagg gcaaggtgct gcctgagggc gtcgggcccg gggacatcaa agccttccag 1921 gtcctgtacc gggaacactg tgaggccatt gtcgacgtca tggtgaacct gcagttcacc 1981 ctggtggaga cgctgtggaa gaccttctgg aggtacaacc tcagccagcc cagtgaggcg 2041 ccaccgctgg ctgtacatga cgaggccgag aagcgactgc ccaaagccat cctggtgctc 2101 ctctccaagt tcgagcccgt gctccaatgg accaagcact gtgacaacgt gctgtaccag 2161 ggcctggtgg aaatcctcat tcccgacgtg ctgcggccca tccccagtgc cttgacccaa 2221 gcgatccgga actttgccaa gagcctggag agctggctca cccacgccat ggtcaacatc 2281 cccgaggaga tgctgcgggt gaaggtggcc gcggctggcg ccttcgcgca gacactgcgg 2341 cgctacacgt cgctcaacca cctggcgcag gcggcgcgcg ctgtgctgca gaacaccgca 2401 cagatcaacc agatgctgag cgacctcaac cgcgtggact tcgccaacgt gcaggagcag 2461 gcctcgtggg tgtgccgctg cgaggaccgc gtggtgcagc ggctggagca ggacttcaag 2521 gtgacgctgc agcagcagaa ctcgctggag cagtgggcgg cctggctgga cggcgtggtg 2581 agccaggtgc tcaagcccta ccagggcagc gccggcttcc ccaaggccgc caagctcttc 2641 ctcctcaagt ggtccttcta cagctccatg gtgatccggg acctgaccct gcgcagcgcc 2701 gccagcttcg gttccttcca cctcatccgg ctgctctacg acgagtacat gtactacctg 2761 atcgagcacc gcgtagccca ggccaagggc gagaccccca tcgccgtcat gggcgagttc 2821 gccaatctgg ccacctccct gaaccccctg gaccccgaca aagacgagga ggaagaagag 2881 gaggaggaga gcgaggacga gctgccgcag gacatctcac tggcggctgg cggcgagtca 2941 cccgcgctgg gcccggagac cctggagccg ccggccaagc tggcgcggac tgacgcgcgc 3001 ggcctcttcg tgcaggcgct gccctccagc taagcccttg gcctccccgc cccacccgcc 3061 cccgccaccc ctccacgcca gggtcc // LOCUS HSIGVH001 967 bp RNA PRI 12-SEP-1993 DEFINITION Human CLL-12 transcript of unrearranged immunoglobulin V(H)5 gene. ACCESSION X58397 NID g33615 KEYWORDS Ig heavy chain; Ig variable region; immunoglobulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 967) AUTHORS Berman,J.E. TITLE Direct Submission JOURNAL Submitted (11-MAR-1991) J.E. Berman, Bressler Bldg Rm 13-041, University of Maryland School of Med, Dept of Microbiol and Immunology, 655 West Baltimore Str, Baltimore MD 21201, USA REFERENCE 2 (bases 1 to 967) AUTHORS Berman,J.E., Humphries,C.G., Barth,J., Alt,F.W. and Tucker,P.W. TITLE Structure and expression of human germline VH transcripts JOURNAL J. Exp. Med. 173 (6), 1529-1535 (1991) MEDLINE 91237299 REFERENCE 3 (bases 1 to 967) AUTHORS Richardson,A.L., Humphries,C.G. and Tucker,P.W. TITLE Molecular cloning and characterization of the t(2;14) translocation associated with childhood chronic lymphocytic leukemia JOURNAL Oncogene 7 (5), 961-970 (1992) MEDLINE 92237023 COMMENT see also: X07178; X06907. FEATURES Location/Qualifiers source 1..967 /organism="Homo sapiens" /isolate="CLL-12" /db_xref="taxon:9606" /germline /cell_type="B-cell" /cell_line="CLL-12" /chromosome="14" mRNA 1..967 /gene="immunoglobulin heavy chain" /note="cDNA" gene 1..967 /gene="immunoglobulin heavy chain" CDS 40..426 /gene="immunoglobulin heavy chain" /note="variable region V251 from V(H)5 gene" /codon_start=1 /db_xref="PID:g33616" /translation="MGSTAILALLLAVLQGVCAEVQLVQSGAEVKKPGESLKISCKGS GYSFTSYWIGWVRQMPGKGLEWMGIIYPGDSDTRYSPSFQGQVTISADKSISTAYLQW SSLKASDTAMYYCARHTVRETSPEPV" misc_feature 40..423 /gene="immunoglobulin heavy chain" /note="variable region V251 from V(H)5 gene" misc_feature 393..399 /gene="immunoglobulin heavy chain" /note="heptamer" misc_feature 422..430 /gene="immunoglobulin heavy chain" /note="nonamer" BASE COUNT 238 a 243 c 222 g 264 t ORIGIN 1 agctggatct cagggcttca ttttctgtcc tccaccatca tggggtcaac cgccatcctc 61 gccctcctcc tggctgttct ccaaggagtc tgtgccgagg tgcagctggt gcagtctgga 121 gcagaggtga aaaagcccgg ggagtctctg aagatctcct gtaagggttc tggatacagc 181 tttaccagct actggatcgg ctgggtgcgc cagatgcccg ggaaaggcct ggagtggatg 241 gggatcatct atcctggtga ctctgatacc agatacagcc cgtccttcca aggccaggtc 301 accatctcag ccgacaagtc catcagcacc gcctacctgc agtggagcag cctgaaggcc 361 tcggacaccg ccatgtatta ctgtgcgaga cacacagtga gagaaaccag ccccgagccc 421 gtctaaaacc ctccacaccg caggtgcaga gtgagctgct agagactcac tccccagggg 481 cctctctatt catctgggga ggaaacactg gctgtttgtg tcctcaggag caagaaccag 541 agaacaatgt gggagggttc ccagccccta aggcaactgt ataggggacc tgaccatggg 601 aggtggattc tctgacgggg ctcttgtgtg ttctacaagg ttgttcatgg tgtatattag 661 atggttaaca tcaaaaggct gcctaacagg cacctctcca atatgatagt attttaatta 721 gtgaaaattt tacacagttc atcattgctt gcttgccttc ctccctcctg tccgctctca 781 ctcactcctt cttttatttt ctacttaatt ttacaaaatc atttaacccc tttttgaact 841 attaataggt tatctttgtt tggtgattgt ttttctttta ataatatgta ctgaataatt 901 catctttgta ccaattcata agtattctgg tgtaataaag acttctttca aaaaaaaaaa 961 aaaaaaa // LOCUS HSIIC2 1866 bp RNA PRI 05-MAY-1993 DEFINITION H. sapiens mRNA for IIC2. ACCESSION Y00498 NID g297403 KEYWORDS IIC2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Kimura,S., Pastewka,J., Gelboin,H.V. and Gonzalez,F.J. TITLE cDNA and amino acid sequences of two members of the human P450IIC gene subfamily JOURNAL Nucleic Acids Res. 15 (23), 10053-10054 (1987) MEDLINE 88096500 FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA human liver" /tissue_type="liver" /clone="P450IIC1 & P450IIC2" gene 43..1515 /gene="IIC2" CDS 43..1515 /gene="IIC2" /codon_start=1 /db_xref="PID:g297404" /db_xref="SWISS-PROT:P10632" /translation="MEPFVVLVLCLSFMLLFSLWRQSCRRRKLPPGPTPLPIIGNMLQ IDVKDICKSFTNFSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPI SQRITKGLGIISSNGKRWKEIRRFSLTTLRNFGMGKRSIEDRVQEEAHCLVEELRKTK ASPCDPTFILGCAPCNVICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQVCNN FPLLIDCFPGTHNKVLKNVALTRSYIREKVKEHQASLDVNNPRDFIDCFLIKMEQEKD NQKSEFNIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRH RSPCMQDRSHMPYTDAVVHEIQRYSDLVPTGVPHAVTTDTKFRNYLIPKGTTIMALLT SVLHDDKEFPNPNIFDPGHFLDKNGNFKKSDYFMPFSAGKRICAGEGLARMELFLFLT TILQNFNLKSVDDLKNLNTTAVTKGIVSLPPSYQICFIPV" BASE COUNT 547 a 416 c 368 g 535 t ORIGIN 1 agtgcaagct cacagctgtc ttaataagaa gagaaggctt caatggaacc ttttgtggtc 61 ctggtgctgt gtctctcttt tatgcttctc ttttcactct ggagacagag ctgtaggaga 121 aggaagctcc ctcctggccc cactcctctt cctattattg gaaatatgct acagatagat 181 gttaaggaca tctgcaaatc tttcaccaat ttctcaaaag tctatggtcc tgtgttcacc 241 gtgtattttg gcatgaatcc catagtggtg tttcatggat atgaggcagt gaaggaagcc 301 ctgattgata atggagagga gttttctgga agaggcaatt ccccaatatc tcaaagaatt 361 actaaaggac ttggaatcat ttccagcaat ggaaagagat ggaaggagat ccggcgtttc 421 tccctcacaa ccttgcggaa ttttgggatg gggaagagga gcattgagga ccgtgttcaa 481 gaggaagctc actgccttgt ggaggagttg agaaaaacca aggcttcacc ctgtgatccc 541 actttcatcc tgggctgtgc tccctgcaat gtgatctgct ccgttgtttt ccagaaacga 601 tttgattata aagatcagaa ttttctcacc ctgatgaaaa gattcaatga aaacttcagg 661 attctgaact ccccatggat ccaggtctgc aataatttcc ctctactcat tgattgtttc 721 ccaggaactc acaacaaagt gcttaaaaat gttgctctta cacgaagtta cattagggag 781 aaagtaaaag aacaccaagc atcactggat gttaacaatc ctcgggactt tatcgattgc 841 ttcctgatca aaatggagca ggaaaaggac aaccaaaagt cagaattcaa tattgaaaac 901 ttggttggca ctgtagctga tctatttgtt gctggaacag agacaacaag caccactctg 961 agatatggac tcctgctcct gctgaagcac ccagaggtca cagctaaagt ccaggaagag 1021 attgatcatg taattggcag acacaggagc ccctgcatgc aggataggag ccacatgcct 1081 tacactgatg ctgtagtgca cgagatccag agatacagtg accttgtccc caccggtgtg 1141 ccccatgcag tgaccactga tactaagttc agaaactacc tcatccccaa gggcacaacc 1201 ataatggcat tactgacttc cgtgctacat gatgacaaag aatttcctaa tccaaatatc 1261 tttgaccctg gccactttct agataagaat ggcaacttta agaaaagtga ctacttcatg 1321 cctttctcag caggaaaacg aatttgtgca ggagaaggac ttgcccgcat ggagctattt 1381 ttatttctaa ccacaatttt acagaacttt aacctgaaat ctgttgatga tttaaagaac 1441 ctcaatacta ctgcagttac caaagggatt gtttctctgc caccctcata ccagatctgc 1501 ttcatccctg tctgaagaat gctagcccat ctggctgctg atctgctatc acctgcaact 1561 ctttttttat caaggacatt cccactatta tgtcttctct gacctctcat caaatcttcc 1621 cattcactca atatcccata agcatccaaa ctccattaag gagagttgtt caggtcactg 1681 cacaaatata tctgcaatta ttcatactct gtaacacttg tattaattgc tgcatatgct 1741 aatacttttc taatgctgac tttttaatat gttatcactg taaaacacag aaaagtgatt 1801 aatgaatgat aatttagatc catttctttt gtgaatgtgc taaataaaaa gtgttattaa 1861 ttgcta // LOCUS HSIKBL 1459 bp RNA PRI 31-MAY-1994 DEFINITION H.sapiens IKBL mRNA. ACCESSION X77909 NID g496329 KEYWORDS IKBL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1459) AUTHORS Albertella,M.R. and Campbell,R.D. TITLE Characterization of a novel gene in the human major histocompatibility complex that encodes a potential new member of the I kappa B family of proteins JOURNAL Hum. Mol. Genet. 3 (5), 793-799 (1994) MEDLINE 94362679 REFERENCE 2 (bases 1 to 1459) AUTHORS Campbell,R.D. TITLE Direct Submission JOURNAL Submitted (24-FEB-1994) R.D. Campbell, MRC Immunochemistry Unit, University of Oxford, Dept of Biochemistry, South Parks Road, Oxford OX1 3QU, UK FEATURES Location/Qualifiers source 1..1459 /organism="Homo sapiens" /note="MHC class III region" /db_xref="taxon:9606" /clone="p3U1" /chromosome="6" /map="21.3" gene 69..1459 /gene="IKBL" CDS 69..1214 /gene="IKBL" /codon_start=1 /db_xref="PID:g496330" /translation="MSNPSPQVPEEEASTSVCRPKSSMASTSRRQRRERRFRRYLSAG RLVRAQALLQRHPGLDVDAGQPPPLHRACARHDAPALCLLLRLGADPAHQDRHGDTAL HAAARQGPDAYTDFFLPLLSRCPSAMGIKNKDGETPGQILGWGPPWDSAEEEEEDDAS KEREWRQKLQGELEDEWQEVMGRFEGDASHETQEPESFSAWSDRLAREHAQKCQQQQR EAEGSCRPPRAEGSSQSWRHEEEEQRLFRERARAKEEELRESRARRAQEALGDREPKP TRAGPREEHPRGAGRGSLWRFGDVPWPCPGGGDPEAMAAALVARGPPLEEQGALRRYL RVQQVRWHPDRFLQRFRSQIETWELGRVMGAVTALSQALNRHAEALK" polyA_signal 1409..1414 /gene="IKBL" polyA_site 1432..1459 /gene="IKBL" BASE COUNT 321 a 421 c 492 g 225 t ORIGIN 1 ccgagcttct taaacacagg ccttgggcta cggctctggg ggtacttggg ggggcggggg 61 caggtctgat gagtaacccc tccccccagg ttccagagga agaagcctcc acatctgtct 121 gccggcccaa gagttccatg gcctccactt cccgccgcca acgccgagaa cgtcgctttc 181 gtcgttactt gtctgcagga cggctggtcc gggcccaggc cctcctccag cgacacccag 241 gcctcgatgt agatgctggg cagcccccac cactgcaccg ggcctgtgcc cgccacgatg 301 cccctgccct gtgcctgctg cttcggctcg gggctgaccc tgcccaccag gaccgccatg 361 gggacacggc actgcatgct gctgcccgcc agggcccaga tgcctacacc gatttcttcc 421 tcccgctgct aagccgctgt ccctctgcca tgggaataaa gaataaggat ggggagaccc 481 ctggccaaat tttgggctgg ggacccccct gggattctgc tgaagaggag gaagaagatg 541 atgcctccaa ggagcgggaa tggagacaga agctccaggg tgagctggag gacgagtggc 601 aggaagtcat ggggaggttt gaaggtgatg cctcccatga aacccaggaa cctgagtcct 661 tctcagcctg gtcagatcgc ctggcccggg aacatgccca gaagtgccag cagcagcagc 721 gagaagcaga gggatcctgt cgacccccac gtgctgaggg ctccagccag agctggcgac 781 acgaggagga ggagcagcgg ctcttcaggg agcgagcccg ggccaaggag gaagagctgc 841 gtgagagccg agccaggagg gcgcaggagg ctctagggga ccgagaaccc aagccaacca 901 gggccgggcc cagggaagag caccccagag gagcggggag gggcagcctc tggcgatttg 961 gtgatgtgcc ctggccctgc cctgggggag gggacccaga ggccatggct gcagccctgg 1021 tggccagggg cccccctttg gaggaacagg gggctctgag gaggtacttg agggtccagc 1081 aggtccgctg gcaccctgac cgcttcctgc agcgattccg aagccagatt gagacctggg 1141 agctgggccg tgtgatggga gcagtgacag ccctttctca ggccctgaat cgccatgcag 1201 aggccctcaa gtgaccctag ggaagaagca agaaacttcg gggctgcagc ctcaggatga 1261 ggcagaagga agggtaaggg aaaggatggg gaccacaagg aagagccagg tgctgctcag 1321 cagaggatat gggtgggagc gaaagttgta acaagtgggg gtggggggtg cgggccgcca 1381 ccactgctcc ttgactctgc cgtttcctaa taagacctgg ttccacatct caaaaaaaaa 1441 aaaaaaaaaa aaaaaaaaa // LOCUS HSIL12R 2100 bp mRNA PRI 07-SEP-1994 DEFINITION Human IL12 receptor component mRNA, complete cds. ACCESSION U03187 NID g507150 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 65 to 2050) AUTHORS Chua,A.O., Chizzonite,R., Desai,B.B., Truitt,T.P., Nunes,P., Minetti,L.J., Warrier,R.R., Presky,D.H., Levine,J.F., Gately,M.K. and Gubler,U. TITLE Expression cloning of a human IL-12 receptor component. A new member of the cytokine receptor superfamily with strong homology to gp130 JOURNAL J. Immunol. 153 (1), 128-136 (1994) MEDLINE 94267217 REFERENCE 2 (bases 1 to 2100) AUTHORS Gubler,U. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Ueli Gubler, Hoffmann La Roche Inc., 340 Kingsland Street, Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1..2100 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA activated lymphoblasts" CDS 65..2053 /codon_start=1 /product="IL12 receptor component" /db_xref="PID:g507151" /translation="MEPLVTWVVPLLFLFLLSRQGAACRTSECCFQDPPYPDADSGSA SGPRDLRCYRISSDRYECSWQYEGPTAGVSHFLRCCLSSGRCCYFAAGSATRLQFSDQ AGVSVLYTVTLWVESWARNQTEKSPEVTLQLYNSVKYEPPLGDIKVSKLAGQLRMEWE TPDNQVGAEVQFRHRTPSSPWKLGDCGPQDDDTESCLCPLEMNVAQEFQLRRRQLGSQ GSSWSKWSSPVCVPPENPPQPQVRFSVEQLGQDGRRRLTLKEQPTQLELPEGCQGLAP GTEVTYRLQLHMLSCPCKAKATRTLHLGKMPYLSGAAYNVAVISSNQFGPGLNQTWHI PADTHTEPVALNISVGTNGTTMYWPARAQSMTYCIEWQPVGQDGGLATCSLTAPQDPD PAGMATYSWSRESGAMGQEKCYYITIFASAHPEKLTLWSTVLSTYHFGGNASAAGTPH HVSVKNHSLDSVSVDWAPSLLSTCPGVLKEYVVRCRDEDSKQVSEHPVQPTETQVTLS GLRAGVAYTVQVRADTAWLRGVWSQPQRFSIEVQVSDWLIFFASLGSFLSILLVGVLG YLGLNRAARHLCPPLPTPCASSAIEFPGGKETWQWINPVDFQEEASLQEALVVEMSWD KGERTEPLEKTELPEGAPELALDTELSLEDGDRCKAKM" mat_peptide 137..2050 /product="IL12 receptor component" misc_feature 218..220 /note="conserved cytokine receptor Cys" misc_feature 248..256 /note="conserved cytokine receptor motif: CXW" misc_feature 302..304 /note="conserved cytokine receptor Cys" misc_feature 323..325 /note="conserved cytokine receptor Cys" misc_feature 425..433 /note="potential N-linked glycosylation site" misc_feature 728..742 /note="conserved cytokine receptor motif: WSXWS" misc_feature 1049..1057 /note="potential N-linked glycosylation site" misc_feature 1100..1108 /note="potential N-linked glycosylation site" misc_feature 1118..1126 /note="potential N-linked glycosylation site" misc_feature 1388..1396 /note="potential N-linked glycosylation site" misc_feature 1430..1438 /note="potential N-linked glycosylation site" misc_feature 1685..1777 /note="probable transmembrane region" BASE COUNT 411 a 623 c 658 g 408 t ORIGIN 1 ggtggctgaa cctcgcaggt ggcagagagg ctcccctggg gctgtggggc tctacgtgga 61 tccgatggag ccgctggtga cctgggtggt ccccctcctc ttcctcttcc tgctgtccag 121 gcagggcgct gcctgcagaa ccagtgagtg ctgttttcag gacccgccat atccggatgc 181 agactcaggc tcggcctcgg gccctaggga cctgagatgc tatcggatat ccagtgatcg 241 ttacgagtgc tcctggcagt atgagggtcc cacagctggg gtcagccact tcctgcggtg 301 ttgccttagc tccgggcgct gctgctactt cgccgccggc tcagccacca ggctgcagtt 361 ctccgaccag gctggggtgt ctgtgctgta cactgtcaca ctctgggtgg aatcctgggc 421 caggaaccag acagagaagt ctcctgaggt gaccctgcag ctctacaact cagttaaata 481 tgagcctcct ctgggagaca tcaaggtgtc caagttggcc gggcagctgc gtatggagtg 541 ggagaccccg gataaccagg ttggtgctga ggtgcagttc cggcaccgga cacccagcag 601 cccatggaag ttgggcgact gcggacctca ggatgatgat actgagtcct gcctctgccc 661 cctggagatg aatgtggccc aggaattcca gctccgacga cggcagctgg ggagccaagg 721 aagttcctgg agcaagtgga gcagccccgt gtgcgttccc cctgaaaacc ccccacagcc 781 tcaggtgaga ttctcggtgg agcagctggg ccaggatggg aggaggcggc tgaccctgaa 841 agagcagcca acccagctgg agcttccaga aggctgtcaa gggctggcgc ctggcacgga 901 ggtcacttac cgactacagc tccacatgct gtcctgcccg tgtaaggcca aggccaccag 961 gaccctgcac ctggggaaga tgccctatct ctcgggtgct gcctacaacg tggctgtcat 1021 ctcctcgaac caatttggtc ctggcctgaa ccagacgtgg cacattcctg ccgacaccca 1081 cacagaacca gtggctctga atatcagcgt cggaaccaac gggaccacca tgtattggcc 1141 agcccgggct cagagcatga cgtattgcat tgaatggcag cctgtgggcc aggacggggg 1201 ccttgccacc tgcagcctga ctgcgccgca agacccggat ccggctggaa tggcaaccta 1261 cagctggagt cgagagtctg gggcaatggg gcaggaaaag tgttactaca ttaccatctt 1321 tgcctctgcg caccccgaga agctcacctt gtggtctacg gtcctgtcca cctaccactt 1381 tgggggcaat gcctcagcag ctgggacacc gcaccacgtc tcggtgaaga atcatagctt 1441 ggactctgtg tctgtggact gggcaccatc cctgctgagc acctgtcccg gcgtcctaaa 1501 ggagtatgtt gtccgctgcc gagatgaaga cagcaaacag gtgtcagagc atcccgtgca 1561 gcccacagag acccaagtta ccctcagtgg cctgcgggct ggtgtagcct acacggtgca 1621 ggtgcgagca gacacagcgt ggctgagggg tgtctggagc cagccccagc gcttcagcat 1681 cgaagtgcag gtttctgatt ggctcatctt cttcgcctcc ctggggagct tcctgagcat 1741 ccttctcgtg ggcgtccttg gctaccttgg cctgaacagg gccgcacggc acctgtgccc 1801 gccgctgccc acaccctgtg ccagctccgc cattgagttc cctggaggga aggagacttg 1861 gcagtggatc aacccagtgg acttccagga agaggcatcc ctgcaggagg ccctggtggt 1921 agagatgtcc tgggacaaag gcgagaggac tgagcctctc gagaagacag agctacctga 1981 gggtgcccct gagctggccc tggatacaga gttgtccttg gaggatggag acaggtgcaa 2041 ggccaagatg tgatcgttga ggctcagaga gggtgagtga ctcgcccgag gctacgtagc // LOCUS HSIL13 1376 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for IL-13 receptor. ACCESSION Y08768 NID g1877211 KEYWORDS IL-13 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1376) AUTHORS Guo,J., Apiou,F., Mellerin,M.P., Lebeau,B., Jacques,Y. and Minvielle,S. TITLE Chromosome mapping and expression of the human interleukin-13 receptor JOURNAL Genomics 42 (1), 141-145 (1997) MEDLINE 97321053 REFERENCE 2 (bases 1 to 1376) AUTHORS Minvielle,S. TITLE Direct Submission JOURNAL Submitted (10-OCT-1996) S. Minvielle, Inserm U 211 Institut de Biologie, 9 Quai De Moncousu, 44035 Nantes, FRANCE FEATURES Location/Qualifiers source 1..1376 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="brain" CDS 126..1268 /codon_start=1 /product="IL-13 receptor" /db_xref="PID:e291909" /db_xref="PID:g1877212" /translation="MAFVCLAIGCLYTFLISTTFGCTSSSDTEIKVNPPQDFEIVDPG YLGYLYLQWQPPLSLDHFKECTVEYELKYRNIGSETWKTIITKNLHYKDGFDLNKGIE AKIHTLLPWQCTNGSEVQSSWAETTYWISPQGIPETKVQDMDCVYYNWQYLLCSWKPG IGVLLDTNYNLFYWYEGLDHALQCVDYIKADGQNIGCRFPYLEASDYKDFYICVNGSS ENKPIRSSYFTFQLQNIVKPLPPVYLTFTRESSCEIKLKWSIPLGPIPARCFDYEIEI REDDTTLVTATVENETYTLKTTNETRQLCFVVRSKVNIYCSDDGIWSEWSDKQCWEGE DLSKKTLLRFWLPFGFILILVIFVTGLLLRKPNTYPKMIPEFFCDT" BASE COUNT 432 a 247 c 284 g 413 t ORIGIN 1 gtaagaacac tctcgtgagt ctaacggtct tccggatgaa ggctatttga agtcgccata 61 acctggtcag aagtgtgcct gtcggcgggg agagaggcaa tatcaaggtt ttaaatctcg 121 gagaaatggc tttcgtttgc ttggctatcg gatgcttata tacctttctg ataagcacaa 181 catttggctg tacttcatct tcagacaccg agataaaagt taaccctcct caggattttg 241 agatagtgga tcccggatac ttaggttatc tctatttgca atggcaaccc ccactgtctc 301 tggatcattt taaggaatgc acagtggaat atgaactaaa ataccgaaac attggtagtg 361 aaacatggaa gaccatcatt actaagaatc tacattacaa agatgggttt gatcttaaca 421 agggcattga agcgaagata cacacgcttt taccatggca atgcacaaat ggatcagaag 481 ttcaaagttc ctgggcagaa actacttatt ggatatcacc acaaggaatt ccagaaacta 541 aagttcagga tatggattgc gtatattaca attggcaata tttactctgt tcttggaaac 601 ctggcatagg tgtacttctt gataccaatt acaacttgtt ttactggtat gagggcttgg 661 atcatgcatt acagtgtgtt gattacatca aggctgatgg acaaaatata ggatgcagat 721 ttccctattt ggaggcatca gactataaag atttctatat ttgtgttaat ggatcatcag 781 agaacaagcc tatcagatcc agttatttca cttttcagct tcaaaatata gttaaacctt 841 tgccgccagt ctatcttact tttactcggg agagttcatg tgaaattaag ctgaaatgga 901 gcataccttt gggacctatt ccagcaaggt gttttgatta tgaaattgag atcagagaag 961 atgatactac cttggtgact gctacagttg aaaatgaaac atacaccttg aaaacaacaa 1021 atgaaacccg acaattatgc tttgtagtaa gaagcaaagt gaatatttat tgctcagatg 1081 acggaatttg gagtgagtgg agtgataaac aatgctggga aggtgaagac ctatcgaaga 1141 aaactttgct acgtttctgg ctaccatttg gtttcatctt aatattagtt atatttgtaa 1201 ccggtctgct tttgcgtaag ccaaacacct acccaaaaat gattccagaa tttttctgtg 1261 atacatgaag actttccata tcaagagaca tggtattgac tcaacagttt ccagtcatgg 1321 ccaaatgttc aatatgagtc tcaataaact gaatttttct tgcgaatgtt gaaaaa // LOCUS HSIL13RA 4039 bp RNA PRI 22-JAN-1997 DEFINITION H.sapiens IL-13Ra mRNA. ACCESSION Y10659 NID g1806035 KEYWORDS IL13Ra gene; interleukin-13. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4039) AUTHORS Gauchat,J.F.M., Schlagenhauf,E., Feng,N.P., Moser,R., Yamage,M., Jeannin,P., Alouani,S., Elson,G., Notarangelo,L.D., Wells,T., Eugster,H.P. and Bonnefoy,J.Y. TITLE A novel 4 kb IL-13Ra mRNA expressed in human B, T and endothelial cells, encoding for an alternate type two IL-4/IL-13R JOURNAL Unpublished REFERENCE 2 (bases 1 to 4039) AUTHORS Gauchat,J.F.M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1997) J-F.M. Gauchat, Geneva Biomedical Research Institute, Immunology, Glaxo Research And Development, 14 Ch Des Aulx, Plan-Les-Ouates, CH1228, SWITZERLAND FEATURES Location/Qualifiers source 1..4039 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /clone_lib="Lambda gt10" /clone="3.1" /dev_stage="adult" gene 44..1327 /gene="IL-13Ra" CDS 44..1327 /gene="IL-13Ra" /codon_start=1 /db_xref="PID:e293160" /db_xref="PID:g1806036" /translation="MEWPARLCGLWALLLCAGGGGGGGGAAPTETQPPVTNLSVSVEN LCTVIWTWNPPEGASSNCSLWYFSHFGDKQDKKIAPETRRSIEVPLNERICLQVGSQC STNESEKPSILVEKCISPPEGDPESAVTELQCIWHNLSYMKCSWLPGRNTSPDTNYTL YYWHRSLEKIHQCENIFREGQYFGCSFDLTKVKDSSFEQHSVQIMVKDNAGKIKPSFN IVPLTSRVKPDPPHIKNLSFHNDDLYVQWENPQNFISRCLFYEVEVNNSQTETHNVFY VQEAKCENPEFERNVENTSCFMVPGVLPDTLNTVRIRVKTNKLCYEDDKLWSNWSQEM SIGKKRNSTLYITMLLIVPVIVAGAIIVLLLYLKRLKIIIFPPIPDPGKIFKEMFGDQ NDDTLHWKKYDIYEKQTKEETDSVVLIENLKKASQ" BASE COUNT 1135 a 839 c 896 g 1169 t ORIGIN 1 tgccaaggct ccagcccggc cgggctccga ggcgagaggc tgcatggagt ggccggcgcg 61 gctctgcggg ctgtgggcgc tgctgctctg cgccggcggc gggggcgggg gcgggggcgc 121 cgcgcctacg gaaactcagc cacctgtgac aaatttgagt gtctctgttg aaaacctctg 181 cacagtaata tggacatgga atccacccga gggagccagc tcaaattgta gtctatggta 241 ttttagtcat tttggcgaca aacaagataa gaaaatagct ccggaaactc gtcgttcaat 301 agaagtaccc ctgaatgaga ggatttgtct gcaagtgggg tcccagtgta gcaccaatga 361 gagtgagaag cctagcattt tggttgaaaa atgcatctca cccccagaag gtgatcctga 421 gtctgctgtg actgagcttc aatgcatttg gcacaacctg agctacatga agtgttcttg 481 gctccctgga aggaatacca gtcccgacac taactatact ctctactatt ggcacagaag 541 cctggaaaaa attcatcaat gtgaaaacat ctttagagaa ggccaatact ttggttgttc 601 ctttgatctg accaaagtga aggattccag ttttgaacaa cacagtgtcc aaataatggt 661 caaggataat gcaggaaaaa ttaaaccatc cttcaatata gtgcctttaa cttcccgtgt 721 gaaacctgat cctccacata ttaaaaacct ctccttccac aatgatgacc tatatgtgca 781 atgggagaat ccacagaatt ttattagcag atgcctattt tatgaagtag aagtcaataa 841 cagccaaact gagacacata atgttttcta cgtccaagag gctaaatgtg agaatccaga 901 atttgagaga aatgtggaga atacatcttg tttcatggtc cctggtgttc ttcctgatac 961 tttgaacaca gtcagaataa gagtcaaaac aaataagtta tgctatgagg atgacaaact 1021 ctggagtaat tggagccaag aaatgagtat aggtaagaag cgcaattcca cactctacat 1081 aaccatgtta ctcattgttc cagtcatcgt cgcaggtgca atcatagtac tcctgcttta 1141 cctaaaaagg ctcaagatta ttatattccc tccaattcct gatcctggca agatttttaa 1201 agaaatgttt ggagaccaga atgatgatac tctgcactgg aagaagtacg acatctatga 1261 gaagcaaacc aaggaggaaa ccgactctgt agtgctgata gaaaacctga agaaagcctc 1321 tcagtgatgg agataattta tttttacctt cactgtgacc ttgagaagat tcttcccatt 1381 ctccatttgt tatctgggaa cttattaaat ggaaactgaa actactgcac catttaaaaa 1441 caggcagctc ataagagcca caggtcttta tgttgagtcg cgcaccgaaa aactaaaaat 1501 aatgggcgct ttggagaaga gtgtggagtc attctcattg aattataaaa gccagcaggc 1561 ttcaaactag gggacaaagc aaaaagtgat gatagtggtg gagttaatct tatcaagagt 1621 tgtgacaact tcctgaggga tctatacttg ctttgtgttc tttgtgtcaa catgaacaaa 1681 ttttatttgt aggggaactc atttggggtg caaatgctaa tgtcaaactt gagtcacaaa 1741 gaacatgtag aaaacaaaat ggataaaatc tgatatgtat tgtttgggat cctattgaac 1801 catgtttgtg gctattaaaa ctcttttaac agtctgggct gggtccggtg gctcacgcct 1861 gtaatcccag caatttggga gtccgaggcg ggcggatcac tcgaggtcag gagttccaga 1921 ccagcctgac caaaatggtg aaacctcctc tctactaaaa ctacaaaaat taactgggtg 1981 tggtggcgcg tgcctgtaat cccagctact cgggaagctg aggcaggtga attgtttgaa 2041 cctgggaggt ggaggttgca gtgagcagag atcacaccac tgcactctag cctgggtgac 2101 agagcaagac tctgtctaaa aaacaaaaca aaacaaaaca aaacaaaaaa acctcttaat 2161 attctggagt catcattccc ttcgacagca ttttcctctg ctttgaaagc cccagaaatc 2221 agtgttggcc atgatgacaa ctacagaaaa accagaggca gcttctttgc caagaccttt 2281 caaagccatt ttaggctgtt aggggcagtg gaggtagaat gactccttgg gtattagagt 2341 ttcaaccatg aagtctctaa caatgtattt tcttcacctc tgctactcaa gtagcattta 2401 ctgtgtcttt ggtttgtgct aggcccccgg gtgtgaagca cagacccctt ccaggggttt 2461 acagtctatt tgagactcct cagttcttgc cacttttttt tttaatctcc accagtcatt 2521 tttcagacct tttaactcct caattccaac actgatttcc ccttttgcat tctccctcct 2581 tcccttcctt gtagcctttt gactttcatt ggaaattagg atgtaaatct gctcaggaga 2641 cctggaggag cagaggataa ttagcatctc aggttaagtg tgagtaatct gagaaacaat 2701 gactaattct tgcatatttt gtaacttcca tgtgagggtt ttcagcattg atatttgtgc 2761 attttctaaa cagagatgag gtggtatctt cacgtagaac attggtattc gcttgagaaa 2821 aaaagaatag ttgaacctat ttctctttct ttacaagatg ggtccaggat tcctcttttc 2881 tctgccataa atgattaatt aaatagcttt tgtgtcttac attggtagcc agccagccaa 2941 ggctctgttt atgcttttgg ggggcatata ttgggttcca ttctcaccta tccacacaac 3001 atatccgtat atatcccctc tactcttact tcccccaaat ttaaagaagt atgggaaatg 3061 agaggcattt cccccacccc atttctctcc tcacacacag actcatatta ctggtaggaa 3121 cttgagaact ttatttccaa gttgttcaaa catttaccaa tcatattaat acaatgatgc 3181 tatttgcaat tcctgctcct aggggagggg agataagaaa ccctcactct ctacaggttt 3241 gggtacaagt ggcaacctgc ttccatggcc gtgtagaagc atggtgccct ggcttctctg 3301 aggaagctgg ggttcatgac aatggcagat gtaaagttat tcttgaagtc agattgaggc 3361 tgggagacag ccgtagtaga tgttctactt tgttctgctg ttctctagaa agaatatttg 3421 gttttcctgt ataggaatga gattaattcc tttccaggta ttttataatt ctgggaagca 3481 aaacccatgc ctccccctag ccatttttac tgttatccta tttagatggc catgaagagg 3541 atgctgtgaa attcccaaca aacattgatg ctgacagtca tgcagtctgg gagtggggaa 3601 gtgatctttt gttcccatcc tcttctttta gcagtaaaat agctgaggga aaagggaggg 3661 aaaaggaagt tatgggaata cctgtggtgg ttgtgatccc taggtcttgg gagctcttgg 3721 aggtgtctgt atcagtggat ttcccatccc ctgtgggaaa ttagtaggct catttactgt 3781 tttaggtcta gcctatgtgg attttttcct aacataccta agcaaaccca gtgtcaggat 3841 ggtaattctt attctttcgt tcagttaagt ttttcccttc atctgggcac tgaagggata 3901 tgtgaaacaa tgttaacatt tttggtagtc ttcaaccagg gattgtttct gtttaacttc 3961 ttataggaaa gcttgagtaa aataaatatt gtctttttgt atgtcaagcg ggccgccacc 4021 gcggtggaaa ctccagctt // LOCUS HSIL1BRNA 1358 bp RNA PRI 02-JUL-1992 DEFINITION H.sapiens mRNA for interleukin-1B converting enzyme. ACCESSION X65019 NID g33792 KEYWORDS interleukin-1B converting enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1358) AUTHORS Thornberry et,al. TITLE Interleukin-1B converting enzyme: a heterrodimeric cysteine protease required for processing of the IL-1B precursor in human blood monocytes JOURNAL Nature In press REFERENCE 2 (bases 1 to 1358) AUTHORS Tocci,M. TITLE Direct Submission JOURNAL Submitted (27-FEB-1992) M. Tocci, Merck, Sharp and Dohme Research Lab, 126 E. Lincoln Avenue, R80W-183, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..1358 /organism="Homo sapiens" /isolate="1 year old male" /db_xref="taxon:9606" /cell_line="THP.1 - acute monocytic leukemia" /clone_lib="lambda gt10" CDS 8..1222 /note="subunit structure p20 1 p10 1" /codon_start=1 /product="interleukin-1B converting enzyme" /db_xref="PID:g33793" /db_xref="SWISS-PROT:P29466" /translation="MADKVLKEKRKLFIRSMGEGTINGLLDELLQTRVLNKEEMEKVK RENATVMDKTRALIDSVIPKGAQACQICITYICEEDSYLAGTLGLSADQTSGNYLNMQ DSQGVLSSFPAPQAVQDNPAMPTSSGSEGNVKLCSLEEAQRIWKQKSAEIYPIMDKSS RTRLALIICNEEFDSIPRRTGAEVDITGMTMLLQNLGYSVDVKKNLTASDMTTELEAF AHRPEHKTSDSTFLVFMSHGIREGICGKKHSEQVPDILQLNAIFNMLNTKNCPSLKDK PKVIIIQACRGDSPGVVWFKDSVGVSGNLSLPTTEEFEDDAIKKAHIEKDFIAFCSST PDNVSWRHPTMGSVFIGRLIEHMQEYACSCDVEEIFRKVRFSFEQPDGRAQMPTTERV TLTRCFYLFPGH" BASE COUNT 435 a 270 c 314 g 339 t ORIGIN 1 aaaagccatg gccgacaagg tcctgaagga gaagagaaag ctgtttatcc gttccatggg 61 tgaaggtaca ataaatggct tactggatga attattacag acaagggtgc tgaacaagga 121 agagatggag aaagtaaaac gtgaaaatgc tacagttatg gataagaccc gagctttgat 181 tgactccgtt attccgaaag gggcacaggc atgccaaatt tgcatcacat acatttgtga 241 agaagacagt tacctggcag ggacgctggg actctcagca gatcaaacat ctggaaatta 301 ccttaatatg caagactctc aaggagtact ttcttccttt ccagctcctc aggcagtgca 361 ggacaaccca gctatgccca catcctcagg ctcagaaggg aatgtcaagc tttgctccct 421 agaagaagct caaaggatat ggaaacaaaa gtcggcagag atttatccaa taatggacaa 481 gtcaagccgc acacgtcttg ctctcattat ctgcaatgaa gaatttgaca gtattcctag 541 aagaactgga gctgaggttg acatcacagg catgacaatg ctgctacaaa atctggggta 601 cagcgtagat gtgaaaaaaa atctcactgc ttcggacatg actacagagc tggaggcatt 661 tgcacaccgc ccagagcaca agacctctga cagcacgttc ctggtgttca tgtctcatgg 721 tattcgggaa ggcatttgtg ggaagaaaca ctctgagcaa gtcccagata tactacaact 781 caatgcaatc tttaacatgt tgaataccaa gaactgccca agtttgaagg acaaaccgaa 841 ggtgatcatc atccaggcct gccgtggtga cagccctggt gtggtgtggt ttaaagattc 901 agtaggagtt tctggaaacc tatctttacc aactacagaa gagtttgagg atgatgctat 961 taagaaagcc cacatagaga aggattttat cgctttctgc tcttccacac cagataatgt 1021 ttcttggaga catcccacaa tgggctctgt ttttattgga agactcattg aacatatgca 1081 agaatatgcc tgttcctgtg atgtggagga aattttccgc aaggttcgat tttcatttga 1141 gcagccagat ggtagagcgc agatgcccac cactgaaaga gtgactttga caagatgttt 1201 ctacctcttc ccaggacatt aaaataagga aactgtatga atgtctgtgg gcagggtgaa 1261 gagatccttc tgtaaaggtt tttgaattat gtctgctgaa taataaactt ttttgaaata 1321 ataaatctgg tagaaaaatg aaaaaaaaaa aaaaaaaa // LOCUS HSIL1R2II 1308 bp RNA PRI 26-MAY-1992 DEFINITION H.sapiens IL-1R2 mRNA for type II interleukin-1 receptor, (cell line CB23). ACCESSION X59770 NID g33796 KEYWORDS il-1r2 gene; interleukin 1 receptor; interleukin 1 receptor antagonist. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1308) AUTHORS Sims,J.E. TITLE Direct Submission JOURNAL Submitted (01-AUG-1991) J.E. Sims, Immunex Res and Development Corp, 51 University Street, Seattle WA 98101, USA REFERENCE 2 (bases 1 to 1308) AUTHORS McMahan,C.J., Slack,J.L., Mosley,B., Cosman,D., Lupton,S.D., Brunton,L.L., Grubin,C.E., Wignall,J.M., Jenkins,N.A., Brannan,C.I., Copeland,N.G., Huebner,K., Croce,C.M., Cannizzarro,L.A., Benjamin,D., Dower,S.K., Spriggs,M.K. and Sims,J.E. TITLE A novel IL-1 receptor, cloned from B cells by mammalian expression, is expressed in many cell types JOURNAL EMBO J. 10 (10), 2821-2832 (1991) MEDLINE 92007725 COMMENT See also X59769. FEATURES Location/Qualifiers source 1..1308 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /cell_line="CB233" /chromosome="2" /map="2Q12-2Q22" mRNA 1..1286 /gene="IL-1R2" /evidence=experimental gene 1..1286 /gene="IL-1R2" sig_peptide 62..100 /gene="IL-1R2" CDS 62..1258 /gene="IL-1R2" /codon_start=1 /product="type II interleukin-1 receptor" /db_xref="PID:g33797" /db_xref="SWISS-PROT:P27930" /translation="MLRLYVLVMGVSAFTLQPAAHTGAARSCRFRGRHYKREFRLEGE PVALRCPQVPYWLWASVSPRINLTWHKNDSARTVPGEEETRMWAQDGALWLLPALQED SGTYVCTTRNASYCDKMSIELRVFENTDAFLPFISYPQILTLSTSGVLVCPDLSEFTR DKTDVKIQWYKDSLLLDKDNEKFLSVRGTTHLLVHDVALEDAGYYRCVLTFAHEGQQY NITRSIELRIKKKKEETIPVIISPLKTISASLGSRLTIPCKVFLGTGTPLTTMLWWTA NDTHIESAYPGGRVTEGPRQEYSENNENYIEVPLIFDPVTREDLHMDFKCVVHNTLSF QTLRTTVKEASSTFSWGIVLAPLSLAFLVLGGIWMHRRCKHRTGKADGLTVLWPHHQD FQSYPK" mat_peptide 101..1255 /gene="IL-1R2" /product="type II interleukin-1 receptor" polyA_signal 1259..1263 /gene="IL-1R2" BASE COUNT 348 a 325 c 322 g 313 t ORIGIN 1 gccacgtgct gctgggtctc agtcctccac ttcccgtgtc ctctggaagt tgtcaggagc 61 aatgttgcgc ttgtacgtgt tggtaatggg agtttctgcc ttcacccttc agcctgcggc 121 acacacaggg gctgccagaa gctgccggtt tcgtgggagg cattacaagc gggagttcag 181 gctggaaggg gagcctgtag ccctgaggtg cccccaggtg ccctactggt tgtgggcctc 241 tgtcagcccc cgcatcaacc tgacatggca taaaaatgac tctgctagga cggtcccagg 301 agaagaagag acacggatgt gggcccagga cggtgctctg tggcttctgc cagccttgca 361 ggaggactct ggcacctacg tctgcactac tagaaatgct tcttactgtg acaaaatgtc 421 cattgagctc agagtttttg agaatacaga tgctttcctg ccgttcatct catacccgca 481 aattttaacc ttgtcaacct ctggggtatt agtatgccct gacctgagtg aattcacccg 541 tgacaaaact gacgtgaaga ttcaatggta caaggattct cttcttttgg ataaagacaa 601 tgagaaattt ctaagtgtga gggggaccac tcacttactc gtacacgatg tggccctgga 661 agatgctggc tattaccgct gtgtcctgac atttgcccat gaaggccagc aatacaacat 721 cactaggagt attgagctac gcatcaagaa aaaaaaagaa gagaccattc ctgtgatcat 781 ttcccccctc aagaccatat cagcttctct ggggtcaaga ctgacaatcc cgtgtaaggt 841 gtttctggga accggcacac ccttaaccac catgctgtgg tggacggcca atgacaccca 901 catagagagc gcctacccgg gaggccgcgt gaccgagggg ccacgccagg aatattcaga 961 aaataatgag aactacattg aagtgccatt gatttttgat cctgtcacaa gagaggattt 1021 gcacatggat tttaaatgtg ttgtccataa taccctgagt tttcagacac tacgcaccac 1081 agtcaaggaa gcctcctcca cgttctcctg gggcattgtg ctggccccac tttcactggc 1141 cttcttggtt ttggggggaa tatggatgca cagacggtgc aaacacagaa ctggaaaagc 1201 agatggtctg actgtgctat ggcctcatca tcaagacttt caatcctatc ccaagtgaaa 1261 taaatggaat gaaataattc aaacacaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSIL1RFT 2156 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for interleukin-1 receptor (fibroblast type). ACCESSION X16896 NID g33800 KEYWORDS interleukin 1 receptor; interleukin receptor; receptor; signal transduction; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS Gubler,U. TITLE Direct Submission JOURNAL Submitted (12-OCT-1989) Gubler U., Dept. of Molecular Genetics, HoffmannLaRoche Inc., Nutley, NY 07110, USA REFERENCE 2 (bases 1 to 2156) AUTHORS Chua,A.O. and Gubler,U. TITLE Sequence of the cDNA for the human fibroblast type interleukin-1 receptor JOURNAL Nucleic Acids Res. 17 (23), 10114 (1989) MEDLINE 90098789 FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" misc_feature 249..251 /note="upstream stop codon" sig_peptide 309..359 /note="Interleukin-I receptor signal peptide (AA -17 to -1)" CDS 309..2018 /note="Interleukin-I receptor precursor (AA -17 to 552)" /codon_start=1 /db_xref="PID:g33801" /db_xref="SWISS-PROT:P14778" /translation="MKVLLRLICFIALLISSLEADKCKEREEKIILVSSANEIDVRPC PLNPNEHKGTITWYKDDSKTPVSTEQASRIHQHKEKLWFVPAKVEDSGHYYCVVRNSS YCLRIKISAKFVENEPNLCYNAQAIFKQKLPVAGDGGLVCPYMEFFKNENNELPKLQW YKDCKPLLLDNIHFSGVKDRLIVMNVAEKHRGNYTCHASYTYLGKQYPITRVIEFITL EENKPTRPVIVSPANETMEVDLGSQIQLICNVTGQLSDIAYWKWNGSVIDEDDPVLGE DYYSVENPANKRRSTLITVLNISEIESRFYKHPFTCFAKNTHGIDAAYIQLIYPVTNF QKHMIGICVTLTVIIVCSVFIYKIFKIDIVLWYRDSCYDFLPIKASDGKTYDAYILYP KTVGEGSTSDCDIFVFKVLPEVLEKQCGYKLFIYGRDDYVGEDIVEVINENVKKSRRL IIILVRETSGFSWLGGSSEEQIAMYNALVQDGIKVVLLELEKIQDYEKMPESIKFIKQ KHGAIRWSGDFTQGPQSAKTRFWKNVRYHMPVQRRSPSSKHQLLSPATKEKLQREAHV PLG" mat_peptide 360..2015 /note="mature Interleukin-I receptor (AA 1 to 552)" BASE COUNT 633 a 451 c 517 g 555 t ORIGIN 1 gccggagccg actcggagcg cgcggcgcgg ccgggaggag ccgagcgcgc cgggcgcggc 61 gtgggggcgc cggctgcccc gcgcgcccag ggagcggcag gaatgtgaca atcgcgcgcc 121 cgcaccgtag cactcctcgc tcggctccta gggctctcgc cctctgagct gagccgggtt 181 ccgcccgggc tgggatccca tcaccctcca cggccgtccg tccaggtaga cgcaccctct 241 gaagatggtg actccctcct gagaagctgg accccttggt aaaagacaag gccttctcca 301 agaagaatat gaaagtgtta ctcagactta tttgtttcat agctctactg atttcttctc 361 tggaggctga taaatgcaag gaacgtgaag aaaaaataat tttagtgtca tctgcaaatg 421 aaattgatgt tcgtccctgt cctcttaacc caaatgaaca caaaggcact ataacttggt 481 ataaagatga cagcaagaca cctgtatcta cagaacaagc ctccaggatt catcaacaca 541 aagagaaact ttggtttgtt cctgctaagg tggaggattc aggacattac tattgcgtgg 601 taagaaattc atcttactgc ctcagaatta aaataagtgc aaaatttgtg gagaatgagc 661 ctaacttatg ttataatgca caagccatat ttaagcagaa actacccgtt gcaggagacg 721 gaggacttgt gtgcccttat atggagtttt ttaaaaatga aaataatgag ttacctaaat 781 tacagtggta taaggattgc aaacctctac ttcttgacaa tatacacttt agtggagtca 841 aagataggct catcgtgatg aatgtggctg aaaagcatag agggaactat acttgtcatg 901 catcctacac atacttgggc aagcaatatc ctattacccg ggtaatagaa tttattactc 961 tagaggaaaa caaacccaca aggcctgtga ttgtgagccc agctaatgag acaatggaag 1021 tagacttggg atcccagata caattgatct gtaatgtcac cggccagttg agtgacattg 1081 cttactggaa gtggaatggg tcagtaattg atgaagatga cccagtgcta ggggaagact 1141 attacagtgt ggaaaatcct gcaaacaaaa gaaggagtac cctcatcaca gtgcttaata 1201 tatcggaaat tgaaagtaga ttttataaac atccatttac ctgttttgcc aagaatacac 1261 atggtataga tgcagcatat atccagttaa tatatccagt cactaatttc cagaagcaca 1321 tgattggtat atgtgtcacg ttgacagtca taattgtgtg ttctgttttc atctataaaa 1381 tcttcaagat tgacattgtg ctttggtaca gggattcctg ctatgatttt ctcccaataa 1441 aagcttcaga tggaaagacc tatgacgcat atatactgta tccaaagact gttggggaag 1501 ggtctacctc tgactgtgat atttttgtgt ttaaagtctt gcctgaggtc ttggaaaaac 1561 agtgtggata taagctgttc atttatggaa gggatgacta cgttggggaa gacattgttg 1621 aggtcattaa tgaaaacgta aagaaaagca gaagactgat tatcatttta gtcagagaaa 1681 catcaggctt cagctggctg ggtggttcat ctgaagagca aatagccatg tataatgctc 1741 ttgttcagga tggaattaaa gttgtcctgc ttgagctgga gaaaatccaa gactatgaga 1801 aaatgccaga atcgattaaa ttcattaagc agaaacatgg ggctatccgc tggtcagggg 1861 actttacaca gggaccacag tctgcaaaga caaggttctg gaagaatgtc aggtaccaca 1921 tgccagtcca gcgacggtca ccttcatcta aacaccagtt actgtcacca gccactaagg 1981 agaaactgca aagagaggct cacgtgcctc tcgggtagca tggagaagtt gccaagagtt 2041 ctttaggtgc ctcctgtctt atggcgttgc aggccaggtt atgcctcatg ctgacttgca 2101 gagttcatgg aatgtaacta tatcatcctt tatccctgag gtcaccagga atcagg // LOCUS HSIL2REC 2335 bp RNA PRI 22-JUL-1993 DEFINITION Human mRNA for interleukin-2 receptor. ACCESSION X01057 X01058 X01402 NID g33812 KEYWORDS alternate splicing; interleukin receptor; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2335) AUTHORS Leonard,W.J., Depper,J.M., Crabtree,G.R., Rudikoff,S., Pumphrey,J., Robb,R.J., Kroenke,M., Svetlik,P.B., Peffer,N.J., Waldmann,T.A. and Greene,W.C. TITLE Molecular cloning and expression of cDNAs for the human interleukin-2 receptor JOURNAL Nature 311 (5987), 626-631 (1984) MEDLINE 85012733 REFERENCE 2 (bases 1 to 1309) AUTHORS Nikaido,T., Shimizu,A., Ishida,N., Sabe,H., Teshigawara,K., Maeda,M., Uchiyama,T., Yodoi,J. and Honjo,T. TITLE Molecular cloning of cDNA encoding human interleukin-2 receptor JOURNAL Nature 311 (5987), 631-635 (1984) MEDLINE 85012734 REFERENCE 3 (bases 1 to 862) AUTHORS Cosman,D., Cerretti,D.P., Larsen,A., Park,L., March,C., Dower,S., Gillis,S. and Urdal,D. TITLE Cloning, sequence and expression of human interleukin-2 receptor JOURNAL Nature 312 (5996), 768-771 (1984) MEDLINE 85086253 REFERENCE 4 (bases 1 to 2335) AUTHORS Behn-Krappa,A. and Doerfler,W. TITLE The state of DNA methylation in the promoter and exon 1 regions of the human gene for the interleukin-2 receptor alpha chain (IL-2R alpha) in various cell types JOURNAL Hum. Mol. Genet. 2 (7), 993-999 (1993) MEDLINE 93372865 FEATURES Location/Qualifiers source 1..2335 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA join(<1..497,764..>2335) /note="put. fragment of 3,3 Kb mRNA" sig_peptide 181..243 /note="signal peptide (aa -21 to -1)" CDS 181..999 /codon_start=1 /product="interleukin-2 receptor" /db_xref="PID:g33813" /db_xref="SWISS-PROT:P01589" /translation="MDSYLLMWGLLTFIMVPGCQAELCDDDPPEIPHATFKAMAYKEG TMLNCECKRGFRRIKSGSLYMLCTGNSSHSSWDNQCQCTSSATRNTTKQVTPQPEEQK ERKTTEMQSPMQPVDQASLPGHCREPPPWENEATERIYHFVVGQMVYYQCVQGYRALH RGPAESVCKMTHGKTRWTQPQLICTGEMETSQFPGEEKPQASPEGRPESETSCLVTTT DFQIQTEMAATMETSIFTTEYQVAVAGCVFLLISVLLLSGLTWQRRQRKSRRTI" mat_peptide 244..996 /product="interleukin-2 receptor" variation 895 /note="existing variant with T changed to C" polyA_signal 1523..1528 unsure 1604 /note="T maybe C" BASE COUNT 690 a 592 c 573 g 480 t ORIGIN 1 gaattccccc cccccccccc cgagagactg gatggaccca caagggtgac agcccaggcg 61 gaccgatctt cccatcccac atcctccggc gcgatgccaa aaagaggctg acggcaactg 121 ggccttctgc agagaaagac ctccgcttca ctgccccggc tggtcccaag ggtcaggaag 181 atggattcat acctgctgat gtggggactg ctcacgttca tcatggtgcc tggctgccag 241 gcagagctct gtgacgatga cccgccagag atcccacacg ccacattcaa agccatggcc 301 tacaaggaag gaaccatgtt gaactgtgaa tgcaagagag gtttccgcag aataaaaagc 361 gggtcactct atatgctctg tacaggaaac tctagccact cgtcctggga caaccaatgt 421 caatgcacaa gctctgccac tcggaacaca acgaaacaag tgacacctca acctgaagaa 481 cagaaagaaa ggaaaaccac agaaatgcaa agtccaatgc agccagtgga ccaagcgagc 541 cttccaggtc actgcaggga acctccacca tgggaaaatg aagccacaga gagaatttat 601 catttcgtgg tggggcagat ggtttattat cagtgcgtcc agggatacag ggctctacac 661 agaggtcctg ctgagagcgt ctgcaaaatg acccacggga agacaaggtg gacccagccc 721 cagctcatat gcacaggtga aatggagacc agtcagtttc caggtgaaga gaagcctcag 781 gcaagccccg aaggccgtcc tgagagtgag acttcctgcc tcgtcacaac aacagatttt 841 caaatacaga cagaaatggc tgcaaccatg gagacgtcca tatttacaac agagtaccag 901 gtagcagtgg ccggctgtgt tttcctgctg atcagcgtcc tcctcctgag tgggctcacc 961 tggcagcgga gacagaggaa gagtagaaga acaatctaga aaaccaaaag aacaagaatt 1021 tcttggtaag aagccgggaa cagacaacag aagtcatgaa gcccaagtga aatcaaaggt 1081 gctaaatggt cgcccaggag acatccgttg tgcttgcctg cgttttggaa gctctgaagt 1141 cacatcacag gacacggggc agtggcaacc ttgtctctat gccagctcag tcccatcaga 1201 gagcgagcgc tacccacttc taaatagcaa tttcgccgtt gaagaggaag ggcaaaacca 1261 ctagaactct ccatcttatt ttcatgtata tgtgttcatt aaagcatgaa tggtatggaa 1321 ctctctccac cctatatgta gtataaagaa aagtaggttt acattcatct cattccaact 1381 tcccagttca ggagtcccaa ggaaagcccc agcactaacg taaatacaca acacacacac 1441 tctaccctat acaactggac attgtctgcg tggttccttt ctcagccgct tctgactgct 1501 gattctcccg ttcacgttgc ctaataaaca tccttcaaga actctgggct gctacccaga 1561 aatcatttta cccttggctc aatcctctaa gctaaccccc ttctactgag ccttcagtct 1621 tgaatttcta aaaaacagag gccatggcag aataatcttt gggtaacttc aaaacggggc 1681 agccaaaccc atgaggcaat gtcaggaaca gaaggatgaa tgaggtccca ggcagagaat 1741 catacttagc aaagttttac ctgtgcgtta ctaattggcc tctttaagag ttagtttctt 1801 tgggattgct atgaatgata ccctgaattt ggcctgcact aatttgatgt ttacaggtgg 1861 acacacaagg tgcaaatcaa tgcgtacgtt tcctgagaag tgtctaaaaa caccaaaaag 1921 ggatccgtac attcaatgtt tatgcaagga aggaaagaaa gaaggaagtg aagagggaga 1981 agggatggag gtcacactgg tagaacgtaa ccacggaaaa gagcgcatca ggcctggcac 2041 ggtggctcag gcctataacc ccagctccct aggagaccaa ggcgggagca tctcttgagg 2101 ccaggagttt gagaccagcc tgggcagcat agcaagacac atccctacaa aaaattagaa 2161 attggctgga tgtggtggca tacgcctgta gtcctagcca ctcaggaggc tgaggcagga 2221 ggattgcttg agcccaggag ttcgaggctg cagtcagtca tgatggcacc actgcactcc 2281 agcctgggca acagagcaag atcctgtctt taaggaaaaa aagacaaggg aattc // LOCUS HSIL4R 3597 bp RNA PRI 26-MAY-1992 DEFINITION Human IL-4-R mRNA for the interleukin 4 receptor. ACCESSION X52425 NID g33833 KEYWORDS B cell growth factor; IL-4-R gene; interleukin; interleukin 4 receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3597) AUTHORS Idzerda,R.L., March,C.J., Mosley,B., Lyman,S.D., Bos,T.V., Gimpel,S.D., Din,W.S., Grabstein,K.H., Widmer,M.B., Park,L.S., Cosman,D. and Beckmann,M.P. TITLE Human interleukin 4 receptor confers biological responsiveness and defines a novel receptor superfamily JOURNAL J. Exp. Med. 171 (3), 861-873 (1990) MEDLINE 90171849 COMMENT Data kindly reviewed (11-MAR-1991) by Beckmann M.P. FEATURES Location/Qualifiers source 1..3597 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell, tissue=peripheral blood, clone=T22-8" mRNA <1..3597 /gene="IL-4-R gene" gene 1..3597 /gene="IL-4-R gene" CDS 176..2653 /gene="IL-4-R gene" /codon_start=1 /product="interleukin 4 receptor" /db_xref="PID:g33834" /db_xref="SWISS-PROT:P24394" /translation="MGWLCSGLLFPVSCLVLLQVASSGNMKVLQEPTCVSDYMSISTC EWKMNGPTNCSTELRLLYQLVFLLSEAHTCIPENNGGAGCVCHLLMDDVVSADNYTLD LWAGQQLLWKGSFKPSEHVKPRAPGNLTVHTNVSDTLLLTWSNPYPPDNYLYNHLTYA VNIWSENDPADFRIYNVTYLEPSLRIAASTLKSGISYRARVRAWAQCYNTTWSEWSPS TKWHNSYREPFEQHLLLGVSVSCIVILAVCLLCYVSITKIKKEWWDQIPNPARSRLVA IIIQDAQGSQWEKRSRGQEPAKCPHWKNCLTKLLPCFLEHNMKRDEDPHKAAKEMPFQ GSGKSAWCPVEISKTVLWPESISVVRCVELFEAPVECEEEEEVEEEKGSFCASPESSR DDFQEGREGIVARLTESLFLDLLGEENGGFCQQDMGESCLLPPSGSTSAHMPWDEFPS AGPKEAPPWGKEQPLHLEPSPPASPTQSPDNLTCTETPLVIAGNPAYRSFSNSLSQSP CPRELGPDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPETWEQILRRNVLQHGAAAA PVSAPTSGYQEFVHAVEQGGTQASAVVGLGPPGEAGYKAFSSLLASSAVSPEKCGFGA SSGEEGYKPFQDLIPGCPGDPAPVPVPLFTFGLDREPPRSPQSSHLPSSSPEHLGLEP GEKVEDMPKPPLPQEQATDPLVDSLGSGIVYSALTCHLCGHLKQCHGQEDGGQTPVMA SPCCGCCCGDRSSPPTTPLRAPDPSPGGVPLEASLCPASLAPSGISEKSKSSSSFHPA PGNAQSSSQTPKIVNFVSVGPTYMRVS" sig_peptide 176..250 /gene="IL-4-R gene" /product="interleukin 4 receptor" mat_peptide 251..2650 /gene="IL-4-R gene" /product="interleukin 4 receptor" misc_feature 872..943 /gene="IL-4-R gene" /note="putative transmembrane region" /product="interleukin 4 receptor" polyA_signal 3579..3584 /gene="IL-4-R gene" BASE COUNT 794 a 1034 c 1039 g 730 t ORIGIN 1 ggcgaatgga gcaggggcgc gcagataatt aaagatttac acacagctgg aagaaatcat 61 agagaagccg ggcgtggtgg ctcatgccta taatcccagc acttttggag gctgaggcgg 121 gcagatcact tgagatcagg agttcgagac cagcctggtg ccttggcatc tcccaatggg 181 gtggctttgc tctgggctcc tgttccctgt gagctgcctg gtcctgctgc aggtggcaag 241 ctctgggaac atgaaggtct tgcaggagcc cacctgcgtc tccgactaca tgagcatctc 301 tacttgcgag tggaagatga atggtcccac caattgcagc accgagctcc gcctgttgta 361 ccagctggtt tttctgctct ccgaagccca cacgtgtatc cctgagaaca acggaggcgc 421 ggggtgcgtg tgccacctgc tcatggatga cgtggtcagt gcggataact atacactgga 481 cctgtgggct gggcagcagc tgctgtggaa gggctccttc aagcccagcg agcatgtgaa 541 acccagggcc ccaggaaacc tgacagttca caccaatgtc tccgacactc tgctgctgac 601 ctggagcaac ccgtatcccc ctgacaatta cctgtataat catctcacct atgcagtcaa 661 catttggagt gaaaacgacc cggcagattt cagaatctat aacgtgacct acctagaacc 721 ctccctccgc atcgcagcca gcaccctgaa gtctgggatt tcctacaggg cacgggtgag 781 ggcctgggct cagtgctata acaccacctg gagtgagtgg agccccagca ccaagtggca 841 caactcctac agggagccct tcgagcagca cctcctgctg ggcgtcagcg tttcctgcat 901 tgtcatcctg gccgtctgcc tgttgtgcta tgtcagcatc accaagatta agaaagaatg 961 gtgggatcag attcccaacc cagcccgcag ccgcctcgtg gctataataa tccaggatgc 1021 tcaggggtca cagtgggaga agcggtcccg aggccaggaa ccagccaagt gcccacactg 1081 gaagaattgt cttaccaagc tcttgccctg ttttctggag cacaacatga aaagggatga 1141 agatcctcac aaggctgcca aagagatgcc tttccagggc tctggaaaat cagcatggtg 1201 cccagtggag atcagcaaga cagtcctctg gccagagagc atcagcgtgg tgcgatgtgt 1261 ggagttgttt gaggccccgg tggagtgtga ggaggaggag gaggtagagg aagaaaaagg 1321 gagcttctgt gcatcgcctg agagcagcag ggatgacttc caggagggaa gggagggcat 1381 tgtggcccgg ctaacagaga gcctgttcct ggacctgctc ggagaggaga atgggggctt 1441 ttgccagcag gacatggggg agtcatgcct tcttccacct tcgggaagta cgagtgctca 1501 catgccctgg gatgagttcc caagtgcagg gcccaaggag gcacctccct ggggcaagga 1561 gcagcctctc cacctggagc caagtcctcc tgccagcccg acccagagtc cagacaacct 1621 gacttgcaca gagacgcccc tcgtcatcgc aggcaaccct gcttaccgca gcttcagcaa 1681 ctccctgagc cagtcaccgt gtcccagaga gctgggtcca gacccactgc tggccagaca 1741 cctggaggaa gtagaacccg agatgccctg tgtcccccag ctctctgagc caaccactgt 1801 gccccaacct gagccagaaa cctgggagca gatcctccgc cgaaatgtcc tccagcatgg 1861 ggcagctgca gcccccgtct cggcccccac cagtggctat caggagtttg tacatgcggt 1921 ggagcagggt ggcacccagg ccagtgcggt ggtgggcttg ggtcccccag gagaggctgg 1981 ttacaaggcc ttctcaagcc tgcttgccag cagtgctgtg tccccagaga aatgtgggtt 2041 tggggctagc agtggggaag aggggtataa gcctttccaa gacctcattc ctggctgccc 2101 tggggaccct gccccagtcc ctgtcccctt gttcaccttt ggactggaca gggagccacc 2161 tcgcagtccg cagagctcac atctcccaag cagctcccca gagcacctgg gtctggagcc 2221 gggggaaaag gtagaggaca tgccaaagcc cccacttccc caggagcagg ccacagaccc 2281 ccttgtggac agcctgggca gtggcattgt ctactcagcc cttacctgcc acctgtgcgg 2341 ccacctgaaa cagtgtcatg gccaggagga tggtggccag acccctgtca tggccagtcc 2401 ttgctgtggc tgctgctgtg gagacaggtc ctcgccccct acaacccccc tgagggcccc 2461 agacccctct ccaggtgggg ttccactgga ggccagtctg tgtccggcct ccctggcacc 2521 ctcgggcatc tcagagaaga gtaaatcctc atcatccttc catcctgccc ctggcaatgc 2581 tcagagctca agccagaccc ccaaaatcgt gaactttgtc tccgtgggac ccacatacat 2641 gagggtctct taggtgcatg tcctcttgtt gctgagtctg cagatgagga ctagggctta 2701 tccatgcctg ggaaatgcca cctcctggaa ggcagccagg ctggcagatt tccaaaagac 2761 ttgaagaacc atggtatgaa ggtgattggc cccactgacg ttggcctaac actgggctgc 2821 agagactgga ccccgcccag cattgggctg ggctcgccac atcccatgag agtagagggc 2881 actgggtcgc cgtgccccac ggcaggcccc tgcaggaaaa ctgaggccct tgggcacctc 2941 gacttgtgaa cgagttgttg gctgctccct ccacagcttc tgcagcagac tgtccctgtt 3001 gtaactgccc aaggcatgtt ttgcccacca gatcatggcc cacgtggagg cccacctgcc 3061 tctgtctcac tgaactagaa gccgagccta gaaactaaca cagccatcaa gggaatgact 3121 tgggcggcct tgggaaatcg atgagaaatt gaacttcagg gagggtggtc attgcctaga 3181 ggtgctcatt catttaacag agcttcctta ggttgatgct ggaggcagaa tcccggctgt 3241 caaggggtgt tcagttaagg ggagcaacag aggacatgaa aaattgctat gactaaagca 3301 gggacaattt gctgccaaac acccatgccc agctgtatgg ctgggggctc ctcgtatgca 3361 tggaaccccc agaataaata tgctcagcca ccctgtgggc cgggcaatcc agacagcagg 3421 cataaggcac cagttaccct gcatgttggc ccagacctca ggtgctaggg aaggcgggaa 3481 ccttgggttg agtaatgctc gtctgtgtgt tttagtttca tcacctgtta tctgtgtttg 3541 ctgaggagag tggaacagaa ggggtggagt tttgtataaa taaagtttct ttgtctc // LOCUS HSIL5R2 2024 bp RNA PRI 26-MAY-1992 DEFINITION Human HSIL5R2 gene for interleukin-5 receptor type 2. ACCESSION X61177 NID g33839 KEYWORDS cytokine receptor; HSIL5R2 gene; interleukin 5 receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2024) AUTHORS Murata,Y. TITLE Direct Submission JOURNAL Submitted (31-JUL-1991) Y. Murata, Dept of Biology, Inst for Medical Immunology, Kumamoto Univ Medical School, 2-2-1 Honjo, Kumamoto 860, JAPAN REFERENCE 2 (bases 1 to 2024) AUTHORS Murata,Y., Takaki,S., Migita,M., Kikuchi,Y., Tominaga,A. and Takatsu,K. TITLE Molecular cloning and expression of the human interleukin 5 receptor JOURNAL J. Exp. Med. 175 (2), 341-351 (1992) MEDLINE 92121815 COMMENT See also X61176-X61178. FEATURES Location/Qualifiers source 1..2024 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood" /cell_type="eosinophil" /clone_lib="cDNA" /clone="lambda h5R.27" mRNA 1..2024 /gene="HSIL5R2" /evidence=experimental gene 1..2024 /gene="HSIL5R2" sig_peptide 104..163 /gene="HSIL5R2" /product="interleukin-5 receptor type 2 precursor" CDS 104..1294 /gene="HSIL5R2" /codon_start=1 /product="interleukin-5 receptor type 2 precursor" /db_xref="PID:g33840" /translation="MIIVAHVLLILLGATEILQADLLPDEKISLLPPVNFTIKVTGLA QVLLQWKPNPDQEQRNVNLEYQVKINAPKEDDYETRITESKCVTILHKGFSASVRTIL QNDHSLLASSWASAELHAPPGSPGTSIVNLTCTTNTTEDNYSRLRSYQVSLHCTWLVG TDAPEDTQYFLYYRYGSWTEECQEYSKDTLGRNIACWFPRTFILSKGRDWLAVLVNGS SKHSAIRPFDQLFALHAIDQINPPLNVTAEIEGTRLSIQWEKPVSAFPIHCFDYEVKI HNTRNGYLQIEKLMTNAFISIIDDLSKYDVQVRAAVSSMCREAGLWSEWSQPIYVGND EHKPLREWFVIVIMATICFILLILSLICKICHLWIKLFPPIPAPKSNIKDLFVTTNYE KAGI" mat_peptide 164..1291 /gene="HSIL5R2" /product="interleukin-5 receptor type 2" BASE COUNT 597 a 428 c 439 g 560 t ORIGIN 1 tagatgctgg ggttgcagcc acgagcatag acacgacaga cacggtcctc gccatcttct 61 gttgagtact ggtcggaaca agaggatcgt ctgtagacag gatatgatca tcgtggcgca 121 tgtattactc atccttttgg gggccactga gatactgcaa gctgacttac ttcctgatga 181 aaagatttca cttctcccac ctgtcaattt caccattaaa gttactggtt tggctcaagt 241 tcttttacaa tggaaaccaa atcctgatca agagcaaagg aatgttaatc tagaatatca 301 agtgaaaata aacgctccaa aagaagatga ctatgaaacc agaatcactg aaagcaaatg 361 tgtaaccatc ctccacaaag gcttttcagc aagtgtgcgg accatcctgc agaacgacca 421 ctcactactg gccagcagct gggcttctgc tgaacttcat gccccaccag ggtctcctgg 481 aacctcaatt gtgaatttaa cttgcaccac aaacactaca gaagacaatt attcacgttt 541 aaggtcatac caagtttccc ttcactgcac ctggcttgtt ggcacagatg cccctgagga 601 cacgcagtat tttctctact ataggtatgg ctcttggact gaagaatgcc aagaatacag 661 caaagacaca ctggggagaa atatcgcatg ctggtttccc aggactttta tcctcagcaa 721 agggcgtgac tggcttgcgg tgcttgttaa cggctccagc aagcactctg ctatcaggcc 781 ctttgatcag ctgtttgccc ttcacgccat tgatcaaata aatcctccac tgaatgtcac 841 agcagagatt gaaggaactc gtctctctat ccaatgggag aaaccagtgt ctgcttttcc 901 aatccattgc tttgattatg aagtaaaaat acacaataca aggaatggat atttgcagat 961 agaaaaattg atgaccaatg cattcatctc aataattgat gatctttcta agtacgatgt 1021 tcaagtgaga gcagcagtga gctccatgtg cagagaggca gggctctgga gtgagtggag 1081 ccaacctatt tatgtgggaa atgatgaaca caagcccttg agagagtggt ttgtcattgt 1141 gattatggca accatctgct tcatcttgtt aattctctcg cttatctgta aaatatgtca 1201 tttatggatc aagttgtttc caccaattcc agcaccaaaa agtaatatca aagatctctt 1261 tgtaaccact aactatgaga aagctggaat ttaaattcaa gcatgtttta acttttggtt 1321 taaggtactt gggtgtacct ggcagtgttg taagctcttt acattaatta attaactctc 1381 taggtactgt tatcttcatt ttataaacaa ggcagctgaa gttgagagaa ataagtaacc 1441 tgtcctaggt cacacaatta ggaaatgaca gatctggcag tctatttcca ggcagtctat 1501 ttccacgagg tcatgagtgc gaaagaggga ctaggggaag aatgattaac tccagggagc 1561 tgacttttct agtgtgctta cctgttttgc atctctcaag gatgtgccat gaagctgtag 1621 ccaggtggaa ttgtaccaca gccctgacat gaacacctga tggcagctgc tgggttggag 1681 cctagacaaa aacatgaaga accatggctg ctgcctgagc ccatcgtgct gtaattatag 1741 aaaaccttct aagggaagaa tatgctgata tttttcagat aagtacccct tttataaaaa 1801 tcctccaagt tagccctcga ttttccatgt aaggaaacag aggctttgag ataatgtctg 1861 tctcctaagg gacaaagcca ggacttgatc ctgtcttaaa aatgcaaaat gtagtacttc 1921 ttccatcaaa ggtagacatg cactaaggga caggttttgg cttggtatca gaatacattt 1981 ttaaaagctg tgtaagaatt gaacgggctg tactaggggg tata // LOCUS HSILF 3043 bp RNA PRI 16-JAN-1995 DEFINITION Human mRNA for transcription factor ILF. ACCESSION X60787 NID g33853 KEYWORDS ilF gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3043) AUTHORS Li,C. TITLE Direct Submission JOURNAL Submitted (05-JUL-1991) C. Li, UCLA School of Medicine, Div. Hematology/Oncology Dept. Medicine, 11-934 Louis Factor, 10833 Le Conte Ave., Los Angeles CA 90024-1678, USA REFERENCE 2 (bases 1 to 3043) AUTHORS Li,C., Lai,C.F., Sigman,D.S. and Gaynor,R.B. TITLE Cloning of a cellular factor, interleukin binding factor, that binds to NFAT-like motifs in the human immunodeficiency virus long terminal repeat JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (17), 7739-7743 (1991) MEDLINE 91352065 FEATURES Location/Qualifiers source 1..3043 /organism="Homo sapiens" /isolate="ilF 101" /db_xref="taxon:9606" /clone_lib="HeLa and lymphoid lambda gt11" /map="17q25" mRNA 1..3043 /gene="ilF" /evidence=experimental gene 1..3043 /gene="ilF" CDS 518..2149 /gene="ilF" /codon_start=1 /product="transcription factor ILF" /db_xref="PID:g33854" /db_xref="SWISS-PROT:Q01167" /translation="MAQVFVDGVFQRRGAPPLQLPRVCTFRFPSTNIKITFTALSSEK REKQEASESPVKAVQPHISPLTINIPDTMAHLISPLPSPTGTISAANSCPSSPRGAGS SGYKVGRVMPSDLNLMADNSQPENEKEASGGDSPKDDSKPPYSYAQLIVQAITMAPDK QLTLNGIYTHITKNYPYYRTADKGWQNSIRHNLSLNRYFIKVPRSQEEPGKGSFWRID PASESKLIEQAFRKRRPRGVPCFRTPLGPLSSRSAPASPNHAGVLSAHSSGAQTPESL SREGSPAPLEPEPGAAQPKLAVIQEARFAQSAPGSPLSSQPVLITVQRQLPQAIKPVT YTVATPVTTSTSQPPVVQTVHVVHQIPAVSVTSVAGLAPANTYTVSGQAVVTPAAVLA PPKAEAQENGDHREVKVKVEPIPAIGHATLGTASRIIQTAQTTPVQTVTIVQQAPLGQ HQLPIKTVTQNGTHVASVPTAVHGQVNNAAASPLHMLATHASASASLPTKRHNGDQPE QPELKRIKTEDGEGIVIALSVDTPPAAVREKGVQN" misc_feature 932..1225 /gene="ilF" /note="DNA-binding domain" BASE COUNT 744 a 902 c 741 g 656 t ORIGIN 1 acaaggtctg tttcttttcc cttaagaacc acaaaagtgc taattttccc tttacaaatg 61 ttacataaaa tcttgattac aaaactactc aacaacttaa tcctaccatc catacaatta 121 tatgttcctt ttcagcaagg tcaaaagcca acgatgtagt ccctttgtac actagagatt 181 ctttacataa tagaagttca cagactgaaa tatcattgct gcagtctttg gagaacagaa 241 ctggtccatt tctgtgaatc ctttttgcac ggacaagact caagactggc taggtgaaaa 301 ctggtggcag ggaccccagc cctcctgtta actttcagga ccacctactt catcatcaag 361 tctgagaggc tgaggttcgg tcagcagcag ctccgctttg gctctgatca tacagttttt 421 agtaatcttt cctgaaaaac actggctttc tctggaaaag cttcctcagc tggaagcttt 481 actccctcct cgacagaaag tgaggattac agatgacatg gctcaggtat tcgtggacgg 541 cgtgttccag aggcgcgggg cgccgccgct gcagctgccg cgcgtgtgca cattcaggtt 601 cccgagcaca aacatcaaga taacgttcac tgccctgtcc agcgagaaga gagagaagca 661 ggaggcgtct gagtctccag tgaaggccgt acagccacac atctcgcccc tgaccatcaa 721 cattccagac accatggccc acctcatcag ccctctgccc tcccccacgg gaaccatcag 781 cgctgcaaac tcctgcccct ccagcccccg gggagcgggg tcttcagggt acaaggtggg 841 ccgagtgatg ccatctgacc tcaatttaat ggctgacaac tcacagcctg aaaatgaaaa 901 ggaagcttca ggtggagaca gcccgaagga tgattcaaag ccgccttact cctacgcgca 961 gctgatagtt caggcgatta cgatggctcc cgacaaacag ctcaccctga acgggattta 1021 tacacacatc actaaaaatt atccctacta caggactgcg gacaagggct ggcagaattc 1081 aattcgccac aatctctctc tgaatcgtta tttcatcaaa gtgccgcgtt cccaggaaga 1141 accaggcaaa ggctcgttct ggaggataga cccagcctct gaaagcaaat taatagaaca 1201 ggcttttagg aaacgacggc ctaggggcgt gccctgcttt agaacccctc tgggaccgct 1261 ctcttctagg agtgccccag cctctcccaa tcacgcggga gtgctgtctg ctcactctag 1321 tggcgcccag acccctgaga gcctgtcgag ggaaggttcg ccggcccccc tggagcctga 1381 gcctggcgct gcacagccca aactcgctgt catccaggaa gcccggtttg cccagagcgc 1441 cccagggtca cctctgtcca gtcagccagt cttaatcacc gtccagcggc agctaccaca 1501 ggccatcaag cctgtcacct acactgtggc caccccagtg accacctcga cctcccagcc 1561 acccgtcgtg cagacggttc acgtcgtcca ccagatccca gcggtgtcgg tcaccagtgt 1621 ggccggactg gccccagcga acacgtacac tgtctctgga caagctgtgg tcaccccggc 1681 agccgtgctg gcccctccta aggcagaggc ccaggagaat ggagaccaca gggaagtcaa 1741 agtgaaagta gagcctattc ccgccattgg ccacgccacg ctcggcactg ccagccggat 1801 cattcagacg gcacagacca ccccggtcca gacggtgacc atagtacaac aggcacctct 1861 aggtcaacac cagctaccaa taaaaactgt aacacaaaac ggcactcacg tggcatcagt 1921 ccccactgcg gtccacggcc aggtgaacaa tgccgcggcg agtcctttgc acatgttggc 1981 aacacacgca tccgcatcgg cctccctgcc cacaaagcgc cacaacggtg accagccgga 2041 gcagccggag ctgaagcgga tcaagacaga agacggcgag ggcatcgtca ttgccctgag 2101 cgtggacacg ccaccggcag ccgtaaggga aaagggtgtc cagaactagc gaccgggaga 2161 gcttttcttt aacgatatca actctgtggt gccaaaagga gacgcggcct cccgccagca 2221 ctcgggggtg cagggccctg tggttggact tcacctctca gcactgaaaa cccaaaaccc 2281 agctggcctt aacactcctt aaagacagaa gtcacacttg aacaaaaccc acacacaaca 2341 aaacctgatt tgggagacgg tgtctccact gagcacctgc tgggctgagc ttctacctac 2401 gagtgaaact ctgtcctccc gcgaggacca ggcatcgctg tgtgaggacg gcacggccag 2461 cgcctgctgt gagtgggtct cccaagacta ggcctcagga cgcgggggga gccatccccg 2521 ccgccctcac aggacccacc aggcagcgga gacatgtgga attagagtat tttgaggtgt 2581 cctttcttta caaaataatg gggtcttggg catttcacat cactccattt ctactgagac 2641 tttcagaatc acacaggccc tttccgtgga tttcatttgg ggcaaagaaa caacatagtt 2701 ttgtttttgt tttcagccta tggaatgatt tccttttgtc tgtcttgttc aagttcagac 2761 gaagctactc tggcatctgc acatttccgt gttacagcag ctgcctgatg aattttatcc 2821 acctccattt cagcatgtgg ctcgcgtgga caggtggacg gacgctgtgg ccgcatggaa 2881 ccttgagaac ccagggacga gccagtgccg ggaaggaact gccgggactc accgagctgc 2941 acttaactgt tctctttctg gctatttttt gttgtttgtt tctttgtgtt gactttgtcc 3001 ctggcaaaat tttccactct gagtaaaaca agtctcggaa ttc // LOCUS HSIMOGN38 1202 bp RNA PRI 16-SEP-1996 DEFINITION H.sapiens mRNA for imogen 38. ACCESSION Z68747 NID g1546899 KEYWORDS imogen 38. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1202) AUTHORS Hutton,J.C. and Roep,B.O. TITLE Human Imogen 38. T-cell and antibody responses in newly diagnosed diabetic subjects JOURNAL Unpublished REFERENCE 2 (bases 1 to 1202) AUTHORS Hutton,J.C. TITLE Direct Submission JOURNAL Submitted (17-JAN-1996) John C Hutton, Clinical Biochemistry, University of Cambridge, Addenbrooke's Hospital, Hills Road, Cambridge, Cambs., CB2 2QR, United Kingdom FEATURES Location/Qualifiers source 1..1202 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="forearm skin" /cell_type="fibroblast primary cell culture" /clone_lib="PCR product" /sex="Male" CDS 6..1193 /note="proprotein" /codon_start=1 /product="imogen 38" /db_xref="PID:e218584" /db_xref="PID:g1546900" /translation="MFPRVSTFLPLRPLSRHPLSSGSPETSAAAIMLLTVRHGTVRYR SSALLARTKNNIQRYFGTNSVICSKKDKQSVRTEETSKETSESQDSEKENTKKDLLGI IKGMKVELSTVNVRTTKPPKRRPLKSLEATLGRLRRATEYAPKKRIEPLSPELVAAAS AVADSLPFDKQTTKSELLSQLQQHEEESRAQRDAKRPKISFSNIISDMKVARSATARV RSRPELRIQFDEGYDNYPGQEKTDDLKKRKNIFTGKRLNIFDMMAVTKEAPETDTSPS LWNVEFAKQLATVNEQPLQNGFEELIQWTKEGKLWEFPINNEAGFDDDGSEFHEHIFL EKHLESFPKQGPIRHFMELVTCGLSKNPYLSVKQKVEHIEWFRNYFNEKKDILKESNI QFN" BASE COUNT 407 a 242 c 272 g 281 t ORIGIN 1 cggcgatgtt tcctagagtc tcgacgttcc tacctcttcg ccccctttcc cgccaccctt 61 tgtcctctgg aagcccggag acatcagcgg ctgcgattat gctactcact gttcggcacg 121 gaacagtcag gtaccgcagt tcagcgctgt tggcccggac aaaaaataac atccaaagat 181 attttggcac taacagtgtg atctgtagca agaaagataa gcagtctgtt cgaactgagg 241 agacttccaa ggagacttca gagagccaag acagtgaaaa ggaaaatacg aaaaaagact 301 tgttaggcat tattaagggc atgaaagttg aattaagcac agtaaatgta cgaacaacaa 361 agccccccaa aagaagacca cttaaaagtt tggaagctac acttggcagg cttcgaagag 421 ctacagaata tgctccaaag aagagaattg agcccctgag tcctgagttg gtggcagctg 481 catctgctgt ggcagattct ctcccttttg ataagcaaac aaccaagtca gagctgctga 541 gccagctcca gcagcatgag gaagagtcaa gggcacagag agatgcaaag cgacctaaaa 601 ttagtttcag taacataata tcagatatga aagttgccag atctgctaca gctagagttc 661 gttcaagacc agagcttcgg attcagtttg atgaaggcta tgacaattat cctggccagg 721 agaagacgga tgatcttaaa aaaaggaaaa atatattcac agggaaaaga cttaatattt 781 ttgacatgat ggcagttact aaagaagcac ctgaaacaga cacatcacct tcactttgga 841 atgtggaatt tgctaagcag ttagccacag taaatgaaca accccttcag aatggatttg 901 aagagctgat ccagtggaca aaagagggga aactatggga gttcccaatt aacaatgaag 961 caggttttga tgatgatggt tcagaatttc atgaacatat atttctggag aaacacctgg 1021 agagctttcc aaaacaagga ccaattcgcc acttcatgga gctggtgact tgtggccttt 1081 ccaaaaaccc atatcttagt gttaaacaga aggttgaaca catagagtgg tttagaaatt 1141 attttaatga aaaaaaggat attctaaaag aaagtaacat acagttcaat taagaccatg 1201 ga // LOCUS HSINAL2 5373 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for integrin alpha-2 subunit. ACCESSION X17033 NID g33906 KEYWORDS collagen receptor; integrin; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5373) AUTHORS Takada,Y. and Hemler,M.E. TITLE Direct Submission JOURNAL Submitted (22-SEP-1989) Takada Y., Hemler M.E., Hemler M.E., Dana-Farber Cancer Institute, Mayer 613, 44 Binney, Street, Boston, MA 02115, USA REFERENCE 2 (bases 1 to 5373) AUTHORS Takada,Y. and Hemler,M.E. TITLE The primary structure of the VLA-2/collagen receptor alpha 2 subunit (platelet GPIa): homology to other integrins and the presence of a possible collagen-binding domain JOURNAL J. Cell Biol. 109 (1), 397-407 (1989) MEDLINE 89308879 FEATURES Location/Qualifiers source 1..5373 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial cell" /clone="2.72" sig_peptide 49..135 /note="signal peptide (AA -29 to -1)" CDS 49..3594 /note="integrin alpha-2 preprotein (AA -29 to 1152)" /codon_start=1 /db_xref="PID:g33907" /db_xref="SWISS-PROT:P17301" /translation="MGPERTGAAPLPLLLVLALSQGILNCCLAYNVGLPEAKIFSGPS SEQFGYAVQQFINPKGNWLLVGSPWSGFPENRMGDVYKCPVDLSTATCEKLNLQTSTS IPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQLSA SFSPATQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQVGLIQYAN NPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASGGRRSATK VMVVVTDGESHDGSMLKAVIDQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIAS IPTERYFFNVSDEAALLEKAGTLGEQIFSIEGTVQGGDNFQMEMSQVGFSADYSSQND ILMLGAVGAFGWSGTIVQKTSHGHLIFPKQAFDQILQDRNHSSYLGYSVAAISTGEST HFVAGAPRANYTGQIVLYSVNENGNITVIQAHRGDQIGSYFGSVLCSVDVDKDTITDV LLVGAPMYMSDLKKEEGRVYLFTIKKGILGQHQFLEGPEGIENTRFGSAIAALSDINM DGFNDVIVGSPLENQNSGAVYIYNGHQGTIRTKYSQKILGSDGAFRSHLQYFGRSLDG YGDLNGDSITDVSIGAFGQVVQLWSQSIADVAIEASFTPEKITLVNKNAQIILKLCFS AKFRPTKQNNQVAIVYNITLDADGFSSRVTSRGLFKENNERCLQKNMVVNQAQSCPEH IIYIQEPSDVVNSLDLRVDISLENPGTSPALEAYSETAKVFSIPFHKDCGEDGLCISD LVLDVRQIPAAQEQPFIVSNQNKRLTFSVTLKNKRESAYNTGIVVDFSENLFFASFSL PVDGTEVTCQVAASQKSVACDVGYPALKREQQVTFTINFDFNLQNLQNQASLSFQALS ESQEENKADNLVNLKIPLLYDAEIHLTRSTNINFYEISSDGNVPSIVHSFEDVGPKFI FSLKVTTGSVPVSMATVIIHIPQYTKEKNPLMYLTGVQTDKAGDISCNADINPLKIGQ TSSSVSFKSENFRHTKELNCRTASCSNVTCWLKDVHMKGEYFVNVTTRIWNGTFASST FQTVQLTAAAEINTYNPEIYVIEDNTVTIPLMIMKPDEKAEVPTGVIIGSIIAGILLL LALVAILWKLGFFKRKYEKMTKNPDEIDETTELSS" mat_peptide 136..3591 /note="mature integrin alpha-2 (AA 1-1152)" BASE COUNT 1635 a 1088 c 1134 g 1516 t ORIGIN 1 gaattcctgc aaacccagcg caactacggt cccccggtca gacccaggat ggggccagaa 61 cggacagggg ccgcgccgct gccgctgctg ctggtgttag cgctcagtca aggcatttta 121 aattgttgtt tggcctacaa tgttggtctc ccagaagcaa aaatattttc cggtccttca 181 agtgaacagt ttgggtatgc agtgcagcag tttataaatc caaaaggcaa ctggttactg 241 gttggttcac cctggagtgg ctttcctgag aaccgaatgg gagatgtgta taaatgtcct 301 gttgacctat ccactgccac atgtgaaaaa ctaaatttgc aaacttcaac aagcattcca 361 aatgttactg agatgaaaac caacatgagc ctcggcttga tcctcaccag gaacatggga 421 actggaggtt ttctcacatg tggtcctctg tgggcacagc aatgtgggaa tcagtattac 481 acaacgggtg tgtgttctga catcagtcct gattttcagc tctcagccag cttctcacct 541 gcaactcagc cctgcccttc cctcatagat gttgtggttg tgtgtgatga atcaaatagt 601 atttatcctt gggatgcagt aaagaatttt ttggaaaaat ttgtacaagg ccttgatata 661 ggccccacaa agacacaggt ggggttaatt cagtatgcca ataatccaag agttgtgttt 721 aacttgaaca catataaaac caaagaagaa atgattgtag caacatccca gacatcccaa 781 tatggtgggg acctcacaaa cacattcgga gcaattcaat atgcaagaaa atatgcctat 841 tcagcagctt ctggtgggcg acgaagtgct acgaaagtaa tggtagttgt aactgacggt 901 gaatcacatg atggttcaat gttgaaagct gtgattgatc aatgcaacca tgacaatata 961 ctgaggtttg gcatagcagt tcttgggtac ttaaacagaa acgcccttga tactaaaaat 1021 ttaataaaag aaataaaagc gatcgctagt attccaacag aaagatactt tttcaatgtg 1081 tctgatgaag cagctctact agaaaaggct gggacattag gagaacaaat tttcagcatt 1141 gaaggtactg ttcaaggagg agacaacttt cagatggaaa tgtcacaagt gggattcagt 1201 gcagattact cttctcaaaa tgatattctg atgctgggtg cagtgggagc ttttggctgg 1261 agtgggacca ttgtccagaa gacatctcat ggccatttga tctttcctaa acaagccttt 1321 gaccaaattc tgcaggacag aaatcacagt tcatatttag gttactctgt ggctgcaatt 1381 tctactggag aaagcactca ctttgttgct ggtgctcctc gggcaaatta taccggccag 1441 atagtgctat atagtgtgaa tgagaatggc aatatcacgg ttattcaggc tcaccgaggt 1501 gaccagattg gctcctattt tggtagtgtg ctgtgttcag ttgatgtgga taaagacacc 1561 attacagacg tgctcttggt aggtgcacca atgtacatga gtgacctaaa gaaagaggaa 1621 ggaagagtct acctgtttac tatcaaaaag ggcattttgg gtcagcacca atttcttgaa 1681 ggccccgagg gcattgaaaa cactcgattt ggttcagcaa ttgcagctct ttcagacatc 1741 aacatggatg gctttaatga tgtgattgtt ggttcaccac tagaaaatca gaattctgga 1801 gctgtataca tttacaatgg tcatcagggc actatccgca caaagtattc ccagaaaatc 1861 ttgggatccg atggagcctt taggagccat ctccagtact ttgggaggtc cttggatggc 1921 tatggagatt taaatgggga ttccatcacc gatgtgtcta ttggtgcctt tggacaagtg 1981 gttcaactct ggtcacaaag tattgctgat gtagctatag aagcttcatt cacaccagaa 2041 aaaatcactt tggtcaacaa gaatgctcag ataattctca aactctgctt cagtgcaaag 2101 ttcagaccta ctaagcaaaa caatcaagtg gccattgtat ataacatcac acttgatgca 2161 gatggatttt catccagagt aacctccagg gggttattta aagaaaacaa tgaaaggtgc 2221 ctgcagaaga atatggtagt aaatcaagca cagagttgcc ccgagcacat catttatata 2281 caggagccct ctgatgttgt caactctttg gatttgcgtg tggacatcag tctggaaaac 2341 cctggcacta gccctgccct tgaagcctat tctgagactg ccaaggtctt cagtattcct 2401 ttccacaaag actgtggtga ggatggactt tgcatttctg atctagtcct agatgtccga 2461 caaataccag ctgctcaaga acaacccttt attgtcagca accaaaacaa aaggttaaca 2521 ttttcagtaa cactgaaaaa taaaagggaa agtgcataca acactggaat tgttgttgat 2581 ttttcagaaa acttgttttt tgcatcattc tccctaccgg ttgatgggac agaagtaaca 2641 tgccaggtgg ctgcatctca gaagtctgtt gcctgcgatg taggctaccc tgctttaaag 2701 agagaacaac aggtgacttt tactattaac tttgacttca atcttcaaaa ccttcagaat 2761 caggcgtctc tcagtttcca agccttaagt gaaagccaag aagaaaacaa ggctgataat 2821 ttggtcaacc tcaaaattcc tctcctgtat gatgctgaaa ttcacttaac aagatctacc 2881 aacataaatt tttatgaaat ctcttcggat gggaatgttc cttcaatcgt gcacagtttt 2941 gaagatgttg gtccaaaatt catcttctcc ctgaaggtaa caacaggaag tgttccagta 3001 agcatggcaa ctgtaatcat ccacatccct cagtatacca aagaaaagaa cccactgatg 3061 tacctaactg gggtgcaaac agacaaggct ggtgacatca gttgtaatgc agatatcaat 3121 ccactgaaaa taggacaaac atcttcttct gtatctttca aaagtgaaaa tttcaggcac 3181 accaaagaat tgaactgcag aactgcttcc tgtagtaatg ttacctgctg gttgaaagac 3241 gttcacatga aaggagaata ctttgttaat gtgactacca gaatttggaa cgggactttc 3301 gcatcatcaa cgttccagac agtacagcta acggcagctg cagaaatcaa cacctataac 3361 cctgagatat atgtgattga agataacact gttacgattc ccctgatgat aatgaaacct 3421 gatgagaaag ccgaagtacc aacaggagtt ataataggaa gtataattgc tggaatcctt 3481 ttgctgttag ctctggttgc aattttatgg aagctcggct tcttcaaaag aaaatatgaa 3541 aagatgacca aaaatccaga tgagattgat gagaccacag agctcagtag ctgaaccagc 3601 agacctacct gcagtgggaa ccggcagcat cccagccagg gtttgctgtt tgcgtgcatg 3661 gatttctttt taaatcccat atttttttta tcatgtcgta ggtaaactaa cctggtattt 3721 taagagaaaa ctgcaggtca gtttggatga agaaattgtg gggggtgggg gaggtgcggg 3781 gggcaggtag ggaaataata gggaaaatac ctattttata tgatggggga aaaaaagtaa 3841 tctttaaact ggctggccca gagtttacat tctaatttgc attgtgtcag aaacatgaaa 3901 tgcttccaag catgacaact tttaaagaaa aatatgatac tctcagattt taagggggaa 3961 aactgttctc tttaaaatat ttgtctttaa acagcaacta cagaagtgga agtgcttgat 4021 atgtaagtac ttccacttgt gtatatttta atgaatattg atgttaacaa gaggggaaaa 4081 caaaacacag gttttttcaa tttatgctgc tcatccaaag ttgccacaga tgatacttcc 4141 aagtgataat tttatttata aactaggtaa aatttgttgt tggttccttt tataccacgg 4201 ctgccccttc cacaccccat cttgctctaa tgatcaaaac atgcttgaat aactgagctt 4261 agagtatacc tcctatatgt ccatttaagt taggagaggg ggcgatatag agactaaggc 4321 acaaaatttt gtttaaaact cagaatataa catttatgta aaatcccatc tgctagaagc 4381 ccatcctgtg ccagaggaag gaaaaggagg aaatttcctt tctcttttag gaggcacaac 4441 agttctcttc taggatttgt ttggctgact ggcagtaacc tagtgaattt ttgaaagatg 4501 agtaatttct ttggcaacct tcctcctccc ttactgaacc actctcccac ctcctggtgg 4561 taccattatt atagaagccc tctacagcct gactttctct ccagcggtcc aaagttatcc 4621 cctcctttac ccctcatcca aagttcccac tccttcagga cagctgctgt gcattagata 4681 ttagggggga aagtcatctg tttaatttac acacttgcat gaattactgt atataaactc 4741 cttaacttca gggagctatt ttcatttagt gctaaacaag taagaaaaat aagctagagt 4801 gaatttctaa atgttggaat gttatgggat gtaaacaatg taaagtaaaa cactctcagg 4861 atttcaccag aagttacaga tgaggcactg gaaaccacca ccaaattagc aggtgcacct 4921 tctgtggctg tcttgtttct gaagtacttt ttcttccaca agagtgaatt tgacctaggc 4981 aagtttgttc aaaaggtaga tcctgagatg atttggtcag attgggataa ggcccagcaa 5041 tctgcatttt aacaagcacc ccagtcacta ggatgcagat ggaccacact ttgagaaaca 5101 ccacccattt ctactttttg caccttattt tctctgttcc tgagccccca cattctctag 5161 gagaaactta gattaaaatt cacagacact acatatctaa agctttgaca agtccttgac 5221 ctctataaac ttcagagtcc tcattataaa atgggaagac tgagctggag ttcagcagtg 5281 atgcttttta gttttaaaag tctatgatct gatctggact tcctataata caaatacaca 5341 atcctccaag aatttgactt ggaaaaggaa ttc // LOCUS HSINB4 5645 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for integrin beta(4)subunit. ACCESSION X51841 NID g33910 KEYWORDS integrin; integrin beta(4)subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5645) AUTHORS Suzuki,S. TITLE Direct Submission JOURNAL Submitted (10-FEB-1990) Suzuki S., Doheny Eye Institute, 1355 San Pablo Street, Los Angeles CA 90033, U S A REFERENCE 2 (bases 1 to 5645) AUTHORS Suzuki,S. and Naitoh,Y. TITLE Amino acid sequence of a novel integrin beta 4 subunit and primary expression of the mRNA in epithelial cells JOURNAL EMBO J. 9 (3), 757-763 (1990) MEDLINE 90183973 COMMENT Data kindly reviewed (20-JUN-1990) by Suzuki S. FEATURES Location/Qualifiers source 1..5645 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /cell_type="retinal pigment epithelial cells" /clone="lambda-beta(4)-[E(5),1]" sig_peptide 127..207 /note="signal peptide (AA -27 to -1)" CDS 127..5385 /note="integrin beta(4)subunit precursor (AA -27 to 1725)" /codon_start=1 /db_xref="PID:g33911" /db_xref="SWISS-PROT:P16144" /translation="MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRV DKDCAYCTDEMFRDRRCNTQAELLAAGCQRESIVVMESSFQITEETQIDTTLRRSQMS PQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQNLARVL SQLTSDYTIGFGKFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRN KLQGERISGNLDAPEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYEADGANV LAGIMSRNDERCHLDTTGTYTQYRTQDYPSVPTLVRLLAKHNIIPIFAVTNYSYSYYE KLHTYFPVSSLGVLQEDSSNIVELLEEAFNRIRSNLDIRALDSPRGLRTEVTSKMFQK TRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGI ICDVCTCELQKEVRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGE DKPCSGRGECQCGHCVCYGEGRYEGQFCEYDNFQCPRTSGFLCNDRGRCSMGQCVCEP GWTGPSCDCPLSNATCIDSNGGICNGRGHCECGRCHCHQQSLYTDTICEINYSAIHPG LCEDLRSCVQCQAWGTGEKKGRTCEECNFKVKMVDELKRAEEVVVRCSFRDEDDDCTY SYTMEGDGAPGPNSTVLVHKKKDCPPGSFWWLIPLLLLLLPLLALLLLLCWKYCACCK ACLALLPCCNRGHMVGFKEDHYMLRENLMASDHLDTPMLRSGNLKGRDVVRWKVTNNM QRPGFATHAASINPTELVPYGLSLRLARLCTENLLKPDTRECAQLRQEVEENLNEVYR QISGVHKLQQTKFRQQPNAGKKQDHTIVDTVLMAPRSAKPALLKLTEKQVEQRAFHDL KVAPGYYTLTADQDARGMVEFQEGVELVDVRVPLFIRPEDDDEKQLLVEAIDVPAGTA TLGRRLVNITIIKEQARDVVSFEQPEFSVSRGDQVARIPVIRRVLDGGKSQVSYRTQD GTAQGNRDYIPVEGELLFQPGEAWKELQVKLLELQEVDSLLRGRQVRRFHVQLSNPKF GAHLGQPHSTTIIIRDPDELDRSFTSQMLSSQPPPHGDLGAPQNPNAKAAGSRKIHFN WLPPSGKPMGYRVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGE GPYSSLVSCRTHQEVPSEPGRLAFNVVSSTVTQLSWAEPAETNGEITAYEVCYGLVND DNRPIGPMKKVLVDNPKNRMLLIENLRESQPYRYTVKARNGAGWGPEREAIINLATQP KRPMSIPIIPDIPIVDAQSGEDYDSFLMYSDDVLRSPSGSQRPSVSDDTEHLVNGRMD FAFPGSTNSLHRMTTTSAAAYGTHLSPHVPHRVLSTSSTLTRDYNSLTRSEHSHSTTL PRDYSTLTSVSSHDSRLTAGVPDTPTRLVFSALGPTSLRVSWQEPRCERPLQGYSVEY QLLNGGELHRLNIPNPAQTSVVVEDLLPNHSYVFRVRAQSQEGWGREREGVITIESQV HPQSPLCPLPGSAFTLSTPSAPGPLVFTALSPDSLQLSWERPRRPNGDIVGYLVTCEM AQGGGPATAFRVDGDSPESRLTVPGLSENVPYKFKVQARTTEGFGPEREGIITIESQD GGPFPQLGSRAGLFQHPLQSEYSSITTTHTSATEPFLVDGPTLGAQHLEAGGSLTRHV TQEFVSRTLTTSGTLSTHMDQQFFQT" mat_peptide 208..5382 /note="mature integrin beta(4)subunit (AA 1-1725)" misc_feature 5626..5631 /note="polyA signal" polyA_site 5645 /note="polyA site" BASE COUNT 1137 a 1856 c 1694 g 958 t ORIGIN 1 cgcccgcgcg ctgcagcccc atctcctagc ggcagcccag gcgcggaggg agcgagtccg 61 ccccgaggta ggtccaggac gggcgcacag cagcagccga ggctggccgg gagagggagg 121 aagaggatgg cagggccacg ccccagccca tgggccaggc tgctcctggc agccttgatc 181 agcgtcagcc tctctgggac cttggcaaac cgctgcaaga aggccccagt gaagagctgc 241 acggagtgtg tccgtgtgga taaggactgc gcctactgca cagacgagat gttcagggac 301 cggcgctgca acacccaggc ggagctgctg gccgcgggct gccagcggga gagcatcgtg 361 gtcatggaga gcagcttcca aatcacagag gagacccaga ttgacaccac cctgcggcgc 421 agccagatgt ccccccaagg cctgcgggtc cgtctgcggc ccggtgagga gcggcatttt 481 gagctggagg tgtttgagcc actggagagc cccgtggacc tgtacatcct catggacttc 541 tccaactcca tgtccgatga tctggacaac ctcaagaaga tggggcagaa cctggctcgg 601 gtcctgagcc agctcaccag cgactacact attggatttg gcaagtttgt ggacaaagtc 661 agcgtcccgc agacggacat gaggcctgag aagctgaagg agccctggcc caacagtgac 721 ccccccttct ccttcaagaa cgtcatcagc ctgacagaag atgtggatga gttccggaat 781 aaactgcagg gagagcggat ctcaggcaac ctggatgctc ctgagggcgg cttcgatgcc 841 atcctgcaga cagctgtgtg cacgagggac attggctggc gcccggacag cacccacctg 901 ctggtcttct ccaccgagtc agccttccac tatgaggctg atggcgccaa cgtgctggct 961 ggcatcatga gccgcaacga tgaacggtgc cacctggaca ccacgggcac ctacacccag 1021 tacaggacac aggactaccc gtcggtgccc accctggtgc gcctgctcgc caagcacaac 1081 atcatcccca tctttgctgt caccaactac tcctatagct actacgagaa gcttcacacc 1141 tatttccctg tctcctcact gggggtgctg caggaggact cgtccaacat cgtggagctg 1201 ctggaggagg ccttcaatcg gatccgctcc aacctggaca tccgggccct agacagcccc 1261 cgaggccttc ggacagaggt cacctccaag atgttccaga agacgaggac tgggtccttt 1321 cacatccggc ggggggaagt gggtatatac caggtgcagc tgcgggccct tgagcacgtg 1381 gatgggacgc acgtgtgcca gctgccggag gaccagaagg gcaacatcca tctgaaacct 1441 tccttctccg acggcctcaa gatggacgcg ggcatcatct gtgatgtgtg cacctgcgag 1501 ctgcaaaaag aggtgcggtc agctcgctgc agcttcaacg gagacttcgt gtgcggacag 1561 tgtgtgtgca gcgagggctg gagtggccag acctgcaact gctccaccgg ctctctgagt 1621 gacattcagc cctgcctgcg ggagggcgag gacaagccgt gctccggccg tggggagtgc 1681 cagtgcgggc actgtgtgtg ctacggcgaa ggccgctacg agggtcagtt ctgcgagtat 1741 gacaacttcc agtgtccccg cacttccggg ttcctctgca atgaccgagg acgctgctcc 1801 atgggccagt gtgtgtgtga gcctggttgg acaggcccaa gctgtgactg tcccctcagc 1861 aatgccacct gcatcgacag caatgggggc atctgtaatg gacgtggcca ctgtgagtgt 1921 ggccgctgcc actgccacca gcagtcgctc tacacggaca ccatctgcga gatcaactac 1981 tcggcgatcc acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg 2041 ggcaccggcg agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg 2101 gacgagctta agagagccga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac 2161 gactgcacct acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc 2221 ctggtgcaca agaagaagga ctgccctccg ggctccttct ggtggctcat ccccctgctc 2281 ctcctcctcc tgccgctcct ggccctgcta ctgctgctat gctggaagta ctgtgcctgc 2341 tgcaaggcct gcctggcact tctcccgtgc tgcaaccgag gtcacatggt gggctttaag 2401 gaagaccact acatgctgcg ggagaacctg atggcctctg accacttgga cacgcccatg 2461 ctgcgcagcg ggaacctcaa gggccgtgac gtggtccgct ggaaggtcac caacaacatg 2521 cagcggcctg gctttgccac tcatgccgcc agcatcaacc ccacagagct ggtgccctac 2581 gggctgtcct tgcgcctggc ccgcctttgc accgagaacc tgctgaagcc tgacactcgg 2641 gagtgcgccc agctgcgcca ggaggtggag gagaacctga acgaggtcta caggcagatc 2701 tccggtgtac acaagctcca gcagaccaag ttccggcagc agcccaatgc cgggaaaaag 2761 caagaccaca ccattgtgga cacagtgctg atggcgcccc gctcggccaa gccggccctg 2821 ctgaagctta cagagaagca ggtggaacag agggccttcc acgacctcaa ggtggccccc 2881 ggctactaca ccctcactgc agaccaggac gcccggggca tggtggagtt ccaggagggc 2941 gtggagctgg tggacgtacg ggtgcccctc tttatccggc ctgaggatga cgacgagaag 3001 cagctgctgg tggaggccat cgacgtgccc gcaggcactg ccaccctcgg ccgccgcctg 3061 gtaaacatca ccatcatcaa ggagcaagcc agagacgtgg tgtcctttga gcagcctgag 3121 ttctcggtca gccgcgggga ccaggtggcc cgcatccctg tcatccggcg tgtcctggac 3181 ggcgggaagt cccaggtctc ctaccgcaca caggatggca ccgcgcaggg caaccgggac 3241 tacatccccg tggagggtga gctgctgttc cagcctgggg aggcctggaa agagctgcag 3301 gtgaagctcc tggagctgca agaagttgac tccctcctgc ggggccgcca ggtccgccgt 3361 ttccacgtcc agctcagcaa ccctaagttt ggggcccacc tgggccagcc ccactccacc 3421 accatcatca tcagggaccc agatgaactg gaccggagct tcacgagtca gatgttgtca 3481 tcacagccac cccctcacgg cgacctgggc gccccgcaga accccaatgc taaggccgct 3541 gggtccagga agatccattt caactggctg cccccttctg gcaagccaat ggggtacagg 3601 gtaaagtact ggattcaggg tgactccgaa tccgaagccc acctgctcga cagcaaggtg 3661 ccctcagtgg agctcaccaa cctgtacccg tattgcgact atgagatgaa ggtgtgcgcc 3721 tacggggctc agggcgaggg accctacagc tccctggtgt cctgccgcac ccaccaggaa 3781 gtgcccagcg agccagggcg tctggccttc aatgtcgtct cctccacggt gacccagctg 3841 agctgggctg agccggctga gaccaacggt gagatcacag cctacgaggt ctgctatggc 3901 ctggtcaacg atgacaaccg acctattggg cccatgaaga aagtgctggt tgacaaccct 3961 aagaaccgga tgctgcttat tgagaacctt cgggagtccc agccctaccg ctacacggtg 4021 aaggcgcgca acggggccgg ctgggggcct gagcgggagg ccatcatcaa cctggccacc 4081 cagcccaaga ggcccatgtc catccccatc atccctgaca tccctatcgt ggacgcccag 4141 agcggggagg actacgacag cttccttatg tacagcgatg acgttctacg ctctccatcg 4201 ggcagccaga ggcccagcgt ctccgatgac actgagcacc tggtgaatgg ccggatggac 4261 tttgccttcc cgggcagcac caactccctg cacaggatga ccacgaccag tgctgctgcc 4321 tatggcaccc acctgagccc acacgtgccc caccgcgtgc taagcacatc ctccaccctc 4381 acacgggact acaactcact gacccgctca gaacactcac actcgaccac actgccgagg 4441 gactactcca ccctcacctc cgtctcctcc cacgactctc gcctgactgc tggtgtgccc 4501 gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctgg 4561 caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 4621 ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4681 gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4741 ggccgagagc gtgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 4801 tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 4861 ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 4921 ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg gccagccacc 4981 gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5041 aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 5101 gagggcatca tcaccataga gtcccaggat ggaggaccct tcccgcagct gggcagccgt 5161 gccgggctct tccagcaccc gctgcaaagc gagtacagca gcatcaccac cacccacacc 5221 agcgccaccg agcccttcct agtggatggg ccgaccctgg gggcccagca cctggaggca 5281 ggcggctccc tcacccggca tgtgacccag gagtttgtga gccggacact gaccaccagc 5341 ggaaccctta gcacccacat ggaccaacag ttcttccaaa cttgaccgca ccctgcccca 5401 cccccgccat gtcccactag gcgtcctccc gactcctctc ccggagcctc ctcagctact 5461 ccatccttgc acccctgggg gcccagccca cccgcatgca cagagcaggg gctaggtgtc 5521 tcctgggagg catgaagggg gcaaggtccg tcctctgtgg gcccaaacct atttgtaacc 5581 aaagagctgg gagcagcaca aggacccagc ctttgttctg cacttaataa atggttttgc 5641 tactg // LOCUS HSINE1 946 bp RNA PRI 04-AUG-1997 DEFINITION H.sapiens INE1 mRNA. ACCESSION Y10696 NID g2239120 KEYWORDS Alu repetitive element; INE1 gene; X-inactivation. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 946) AUTHORS Esposito,T., Gianfrancesco,F., Ciccodicola,A., D'Esposito,M., Nagaraja,R., Mazzarella,R., D'Urso,M. and Forabosco,A. TITLE Escape from X inactivation of two new genes associated with DXS6974E and DXS7020E JOURNAL Genomics 43 (2), 183-190 (1997) MEDLINE 97386586 REFERENCE 2 (bases 1 to 946) AUTHORS Forabosco,A. TITLE Direct Submission JOURNAL Submitted (10-JAN-1997) A. Forabosco, Sezione Istologia, Embriologia e Genetica, Dipartimento Scienze Morfologiche e Medico-Legali, Universita di Modena, Via Del Pozzo 71, 41100 Modena, ITALY FEATURES Location/Qualifiers source 1..946 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="8 week embryo" /chromosome="X" /map="Xp11.3-p11.4" gene 155..433 /gene="INE1" CDS 155..433 /gene="INE1" /codon_start=1 /db_xref="PID:e310101" /db_xref="PID:g2239121" /translation="MTMVKEGLSLQRPQGYEGLECLLLWKSLRKDPEVGGVREPTHFT VSGGGIQGRILSRGGTWPNLGFERILLVCVPASWWWLGESASILGAVT" repeat_region 650..946 /rpt_family="Alu" BASE COUNT 238 a 208 c 283 g 217 t ORIGIN 1 gattcttgga gggagagggg aacggacagc aaacaataga cacaatacga tcgtgaggta 61 tatagttaat gttggtgatg agtgctcctc aaaggaaaca actaaaacca gacaagggaa 121 gtcggggtgt cagggagaat ggtttgtagt attgatgaca atggttaagg aaggcctgag 181 tttgcaaaga cctcaaggat atgagggact agagtgcctt ctgctgtgga agagcctgcg 241 caaagatcct gaggtgggag gggtcaggga gccaactcat tttacggtaa gtggaggtgg 301 gatccagggg aggattctga gcagaggagg gacgtggccc aatttgggct ttgaaaggat 361 cctgcttgtc tgtgtgccag catcctggtg gtggttgggg gagtctgcat caatcctggg 421 tgctgttacc tgaaaaaata gcaggtgtta gacgaaagaa cagatgtctg gcccattgtt 481 tccagtctgt tcctgtccac agcttccctt tatgctgagc ccctgccaca tgcatcatca 541 tcctggccat gtggcattgt cacagaccgt ctctcctgct agcctcctaa ctcagggcct 601 gggtcttccc cagcattaag taaatcagtc tcaggcagac agccctttat aggtgtttgg 661 gccaggtgcg gtggctcacg cctgtaatcc cagctctttg tggggccgag gtctttgagc 721 tcaggagttc aagaccagcc tgggcagcat gacaaaaccc tgtctctacc aaaaatacaa 781 aaattagcca cgcatggtgg cacacacctg tggtcccagc tactcgggag gctgaggtgg 841 gaggatcgct ggagcctggg aagttgaggc tgcagtgagc cgtgatcatg ccactgcatt 901 ccagcctgtg tgatggagag agaccctgtc tcaaaaaaaa aaaaaa // LOCUS HSINFGER 1172 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for gamma-interferon inducible early response gene (with homology to platelet proteins). ACCESSION X02530 M17752 NID g33917 KEYWORDS interferon response; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1172) AUTHORS Luster,A.D., Unkeless,J.C. and Ravetch,J.V. TITLE Gamma-interferon transcriptionally regulates an early-response gene containing homology to platelet proteins JOURNAL Nature 315 (6021), 672-676 (1985) MEDLINE 85240552 REFERENCE 2 (bases 1 to 1172) AUTHORS Luster,A.D. TITLE Direct Submission JOURNAL Submitted (29-JUL-1986) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (29-JUL-1986) by Luster A.D. FEATURES Location/Qualifiers source 1..1172 /organism="Homo sapiens" /strain="(U 937 histiocytic lymphoma cell line)" /db_xref="taxon:9606" misc_RNA 1 /note="cap site" sig_peptide 67..129 /note="pot. signal peptide (aa-21 to -1)" CDS 67..363 /note="early response precursor polypeptide (aa-21 to 77)" /codon_start=1 /db_xref="PID:g33918" /db_xref="SWISS-PROT:P02778" /translation="MNQTAILICCLIFLTLSGIQGVPLSRTVRCTCISISNQPVNPRS LEKLEIIPASQFCPRVEIIATMKKKGEKRCLNPESKAIKNLLKAVSKEMSKRSP" mat_peptide 130..360 /note="mature early response polypeptide (aa 1-77)" old_sequence 1138..1141 /note="ugaa was uga in [1]" /citation=[1] old_sequence 1146..1148 /note="caa was ca in [1]" /citation=[1] misc_feature 1155..1160 /note="pot. polyA signal" polyA_site 1172 /note="polyA site" BASE COUNT 384 a 231 c 208 g 349 t ORIGIN 1 gagacattcc tcaattgctt agacatattc tgagcctaca gcagaggaac ctccagtctc 61 agcaccatga atcaaactgc gattctgatt tgctgcctta tctttctgac tctaagtggc 121 attcaaggag tacctctctc tagaaccgta cgctgtacct gcatcagcat tagtaatcaa 181 cctgttaatc caaggtcttt agaaaaactt gaaattattc ctgcaagcca attttgtcca 241 cgtgttgaga tcattgctac aatgaaaaag aagggtgaga agagatgtct gaatccagaa 301 tcgaaggcca tcaagaattt actgaaagca gttagcaagg aaatgtctaa aagatctcct 361 taaaaccaga ggggagcaaa atcgatgcag tgcttccaag gatggaccac acagaggctg 421 cctctcccat cacttcccta catggagtat atgtcaagcc ataattgttc ttagtttgca 481 gttacactaa aaggtgacca atgatggtca ccaaatcagc tgctactact cctgtaggaa 541 ggttaatgtt catcatccta agctattcag taataactct accctggcac tataatgtaa 601 gctctactga ggtgctatgt tcttagtgga tgttctgacc ctgcttcaaa tatttccctc 661 acctttccca tcttccaagg gtactaagga atctttctgc tttggggttt atcagaattc 721 tcagaatctc aaataactaa aaggtatgca atcaaatctg ctttttaaag aatgctcttt 781 acttcatgga cttccactgc catcctccca aggggcccaa attctttcag tggctaccta 841 catacaattc caaacacata caggaaggta gaaatatctg aaaatgtatg tgtaagtatt 901 cttatttaat gaaagactgt acaaagtata agtcttagat gtatatattt cctatattgt 961 tttcagtgta catggaataa catgtaatta agtactatgt atcaatgagt aacaggaaaa 1021 ttttaaaaat acagatagat atatgctctg catgttacat aagataaatg tgctgaatgg 1081 ttttcaaata aaaatgaggt actctcctgg aaatattaag aaagactatc taaatgttga 1141 aagatcaaaa ggttaataaa gtaattataa ct // LOCUS HSINH2 1268 bp RNA PRI 11-APR-1997 DEFINITION H.sapiens mRNA for inhibitor 2 gene. ACCESSION X78873 NID g474387 KEYWORDS inhibitor-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1268) AUTHORS Helps,N.R., Street,A.J., Elledge,S.J. and Cohen,P.T. TITLE Cloning of the complete coding region for human protein phosphatase inhibitor 2 using the two hybrid system and expression of inhibitor 2 in E. coli JOURNAL FEBS Lett. 340 (1-2), 93-98 (1994) MEDLINE 94164316 REFERENCE 2 (bases 1 to 1268) AUTHORS Helps,N.R. TITLE Direct Submission JOURNAL Submitted (19-APR-1994) N.R. Helps, Medical Research Council Protein, Phosphorylation Unit, Dept of Biochemistry, Univ. of Dundee, Medical Sciences Institute, Dow Street, Dundee DD1 4HN Scotland, UK FEATURES Location/Qualifiers source 1..1268 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral lymphocytes" /clone_lib="lambda ACT library" gene 4..1268 /gene="inhibitor 2" CDS 4..621 /gene="inhibitor 2" /codon_start=1 /db_xref="PID:g474388" /db_xref="SWISS-PROT:P41236" /translation="MAASTASHRPIKGILKNKTSTTSSMVASAEQPRGNVDEELSKKS QKWDEMNILATYHPADKDYGLMKIDEPSTPYHSMMGDDEDACSDTEATEAMAPDILAR KLAAAEGLEPKYRIQEQESSGEEDSDLSPEEREKKRQFEMKRKLHYNEGLNIKLARQL ISKDLHDDDEDEEMLETADGESMNTEESNQGSTPSDQQQNKLRSS" polyA_signal 1245..1250 /gene="inhibitor 2" polyA_site 1268 /gene="inhibitor 2" BASE COUNT 437 a 226 c 250 g 355 t ORIGIN 1 ccaatggcgg cctcgacggc ctcgcaccgg cccatcaagg ggatcttgaa gaacaagacc 61 tctacgactt cctctatggt ggcgtcggcc gaacagcccc gcgggaatgt cgacgaggag 121 ctgagcaaaa aatcccagaa gtgggatgaa atgaacatct tggcgacgta tcatccagca 181 gacaaagact atggtttaat gaaaatagat gaaccaagca ctccttacca tagtatgatg 241 ggggatgatg aagatgcctg tagtgacacc gaggccactg aagccatggc gccagacatc 301 ttagccagga aattagctgc agctgaaggc ttggagccaa agtatcggat tcaggaacaa 361 gaaagcagtg gagaggagga tagtgacctc tcacctgaag aacgagaaaa aaagcgacaa 421 tttgaaatga aaaggaagct tcactacaat gaaggactca atatcaaact agccagacaa 481 ttaatttcaa aagacctaca tgatgatgat gaagatgaag aaatgttaga gactgcagat 541 ggagaaagca tgaatacgga agaatcaaat caaggatcta ctccaagtga ccaacagcaa 601 aacaaattac gaagttcata gacgagattt gttcaacact gcaattgttt gttagatgta 661 aaccctgtga ctatagtacg ttgcttcttg ttcttcacaa ttcatgactt aagtaccaaa 721 atgcatacca gttattatat attgccaaga attaaatgat aaacttagag actgattaga 781 ctgaaaatgc ctaatcgata tatatattct tgtgcctagt actttaccac aaatacagtg 841 taatatcatc agtccaaaac tgcattactt ttgtaaaaac actggttaat ttgtataaga 901 tattatagag ctttttatgc tttagaagtt aaacaatatc tttggggggg aactaattta 961 ttttcatcac ttgaaatgtg gtagctctta caaagtttat tgatttgatt tttttaaaaa 1021 tcaaaagcca attgaacaac aggatatata gactgataaa tatttaggct gaatagtatt 1081 ttaacacttg tcttcaactt gatttgtctg tttaattgaa aagaattata agagttactg 1141 ttgcattttc tgacctacta tttttaaaat tcctgttgag tttctttgtg tttacaagga 1201 aaggactgaa ctttttctca tcaaaactag cttttttccc cacaaataaa ttatcaggtt 1261 aaactttc // LOCUS HSINOSA 4164 bp RNA PRI 13-JAN-1994 DEFINITION H.sapiens mRNA for nitric oxide synthase. ACCESSION X73029 NID g441452 KEYWORDS nitric oxide synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4164) AUTHORS Charles,I.G., Palmer,R.M., Hickery,M.S., Bayliss,M.T., Chubb,A.P., Hall,V.S., Moss,D.W. and Moncada,S. TITLE Cloning, characterization, and expression of a cDNA encoding an inducible nitric oxide synthase from the human chondrocyte JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11419-11423 (1993) MEDLINE 94068614 REFERENCE 2 (bases 1 to 4164) AUTHORS Charles,I. TITLE Direct Submission JOURNAL Submitted (23-APR-1993) I. Charles, Wellcome Research Laboratories, Ble. 113, Deot. of Cekk Biology, Langley Park, Beckenham, Kent, BR3 3BS, UK FEATURES Location/Qualifiers source 1..4164 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chondrocyte" gene 226..3687 /gene="INOS" CDS 226..3687 /gene="INOS" /codon_start=1 /product="nitric oxide synthase" /db_xref="PID:g441453" /db_xref="SWISS-PROT:P35228" /translation="MACPWKFLFKTKFHQYAMNGEKDINNNVEKAPCATSSPVTQDDL QYHNLSKQQNESPQPLVETGKKSPESLVKLDATPLSSPRHVRIKNWGSGMTFQDTLHH KAKGILTCRSKSCLGSIMTPKSLTRGPRDKPTPPDELLPQAIEFVNQYYGSFKEAKIE EHLARVEAVTKEIETTGTYQLTGDELIFATKQAWRNAPRCIGRIQWSNLQVFDARSCS TAREMFEHICRHVRYSTNNGNIRSAITVFPQRSDGKHDFRVWNAQLIRYAGYQMPDGS IRGDPANVEFTQLCIDLGWKPKYGRFDVVPLVLQANGRDPELFEIPPDLVLEVAMEHP KYEWFRELELKWYALPAVANMLLEVGGLEFPGCPFNGWYMGTEIGVRDFCDVQRYNIL EEVGRRMGLETHKLASLWKDQAVVEINIAVLHSFQKQNVTIMDHHSAAESFMKYMQNE YRSRGGCPADWIWLVPPMSGSITPVFHQEMLNYVLSPFYYYQVEAWKTHVWQDEKRRP KRREIPLKVLVKAVLFACMLMRKTMASRVRVTILFATETGKSEALAWDLGALFSCAFN PKVVCMDKYRLSCLEEERLLLVVTSTFGNGDCPGNGEKLKKSLFMLKELNNKFRYAVF GLGSSMYPRFCAFAHDIDQKLSHLGASQLTPMGEGDELSGQEDAFRSWAVQTFKAACE TFDVRGKQHIQIPKLYTSNVTWDPHHYRLVQDSQPLDLSKALSSMHAKNVFTMRLKSR QNLQSPTSSRATILVELSCEDGQGLNYLPGEHLGVCPGNQPALVQGILERVVDGPTPH QTVRLEALDESGSYWVSDKRLPPCSLSQALTYFLDITTPPTQLLLQKLAQVATEEPER QRLEALCQPSEYSKWKFTNSPTFLEVLEEFPSLRVSAGFLLSQLPILKPRFYSISSSR DHTPTEIHLTVAVVTYHTRDGQGPLHHGVCSTWLNSLKPQDPVPCFVRNASGFHLPED PSHPCILIGPGTGIAPFRSFWQQRLHDSQHKGVRGGRMTLVFGCRRPDEDHIYQEEML EMAQKGVLHAVHTAYSRLPGKPKVYVQDILRQQLASEVLRVLHKEPGHLYVCGDVRMA RDVAHTLKQLVAAKLKLNEEQVEDYFFQLKSQKRYHEDIFGAVFPYEAKKDRVAVQPS SLEMSAL" BASE COUNT 974 a 1210 c 1127 g 853 t ORIGIN 1 agagaactca gcctcattcc tgctttaaaa tctctcggcc acctttgatg aggggactgg 61 gcagttctag acagtcccga agttctcaag gcacaggtct cttcctggtt tgactgtcct 121 taccccgggg aggcagtgca gccagctgca agccccacag tgaagaacat ctgagctcaa 181 atccagataa gtgacataag tgacctgctt tgtaaagcca tagagatggc ctgtccttgg 241 aaatttctgt tcaagaccaa attccaccag tatgcaatga atggggaaaa agacatcaac 301 aacaatgtgg agaaagcccc ctgtgccacc tccagtccag tgacacagga tgaccttcag 361 tatcacaacc tcagcaagca gcagaatgag tccccgcagc ccctcgtgga gacgggaaag 421 aagtctccag aatctctggt caagctggat gcaaccccat tgtcctcccc acggcatgtg 481 aggatcaaaa actggggcag cgggatgact ttccaagaca cacttcacca taaggccaaa 541 gggattttaa cttgcaggtc caaatcttgc ctggggtcca ttatgactcc caaaagtttg 601 accagaggac ccagggacaa gcctacccct ccagatgagc ttctacctca agctatcgaa 661 tttgtcaacc aatattacgg ctccttcaaa gaggcaaaaa tagaggaaca tctggccagg 721 gtggaagcgg taacaaagga gatagaaaca acaggaacct accaactgac gggagatgag 781 ctcatcttcg ccaccaagca ggcctggcgc aatgccccac gctgcattgg gaggatccag 841 tggtccaacc tgcaggtctt cgatgcccgc agctgttcca ctgcccggga aatgtttgaa 901 cacatctgca gacacgtgcg ttactccacc aacaatggca acatcaggtc ggccatcacc 961 gtgttccccc agcggagtga tggcaagcac gacttccggg tgtggaatgc tcagctcatc 1021 cgctatgctg gctaccagat gccagatggc agcatcagag gggaccctgc caacgtggaa 1081 ttcactcagc tgtgcatcga cctgggctgg aagcccaagt acggccgctt cgatgtggtc 1141 cccctggtcc tgcaggccaa tggccgtgac cctgagctct tcgaaatccc acctgacctt 1201 gtgcttgagg tggccatgga acatcccaaa tacgagtggt ttcgggaact ggagctaaag 1261 tggtacgccc tgcctgcagt ggccaacatg ctgcttgagg tgggcggcct ggagttccca 1321 gggtgcccct tcaatggctg gtacatgggc acagagatcg gagtccggga cttctgtgac 1381 gtccagcgct acaacatcct ggaggaagtg ggcaggagaa tgggcctgga aacgcacaag 1441 ctggcctcgc tctggaaaga ccaggctgtc gttgagatca acattgctgt gctccatagt 1501 ttccagaagc agaatgtgac catcatggac caccactcgg ctgcagaatc cttcatgaag 1561 tacatgcaga atgaataccg gtcccgtggg ggctgcccgg cagactggat ttggctggtc 1621 cctcccatgt ctgggagcat cacccccgtg tttcaccagg agatgctgaa ctacgtcctg 1681 tcccctttct actactatca ggtagaggcc tggaaaaccc atgtctggca ggacgagaag 1741 cggagaccca agagaagaga gattccattg aaagtcttgg tcaaagctgt gctctttgcc 1801 tgtatgctga tgcgcaagac aatggcgtcc cgagtcagag tcaccatcct ctttgcgaca 1861 gagacaggaa aatcagaggc gctggcctgg gacctggggg ccttattcag ctgtgccttc 1921 aaccccaagg ttgtctgcat ggataagtac aggctgagct gcctggagga ggaacggctg 1981 ctgttggtgg tgaccagtac gtttggcaat ggagactgcc ctggcaatgg agagaaactg 2041 aagaaatcgc tcttcatgct gaaagagctc aacaacaaat tcaggtacgc tgtgtttggc 2101 ctcggctcca gcatgtaccc tcggttctgc gcctttgctc atgacattga tcagaagctg 2161 tcccacctgg gggcctctca gctcaccccg atgggagaag gggatgagct cagtgggcag 2221 gaggacgcct tccgcagctg ggccgtgcaa accttcaagg cagcctgtga gacgtttgat 2281 gtccgaggca aacagcacat tcagatcccc aagctctaca cctccaatgt gacctgggac 2341 ccgcaccact acaggctcgt gcaggactca cagcctttgg acctcagcaa agccctcagc 2401 agcatgcatg ccaagaacgt gttcaccatg aggctcaaat ctcggcagaa tctacaaagt 2461 ccgacatcca gccgtgccac catcctggtg gaactctcct gtgaggatgg ccaaggcctg 2521 aactacctgc cgggggagca ccttggggtt tgcccaggca accagccggc cctggtccaa 2581 ggtatcctgg agcgagtggt ggatggcccc acaccccacc agacagtgcg cctggaggcc 2641 ctggatgaga gtggcagcta ctgggtcagt gacaagaggc tgcccccctg ctcactcagc 2701 caggccctca cctacttcct ggacatcacc acacccccaa cccagctgct gctccaaaag 2761 ctggcccagg tggccacaga agagcctgag agacagaggc tggaggccct gtgccagccc 2821 tcagagtaca gcaagtggaa gttcaccaac agccccacat tcctggaggt gctagaggag 2881 ttcccgtccc tgcgggtgtc tgctggcttc ctgctttccc agctccccat tctgaagccc 2941 aggttctact ccatcagctc ctcccgggat cacacgccca cagagatcca cctgactgtg 3001 gccgtggtca cctaccacac ccgagatggc cagggtcccc tgcaccacgg cgtctgcagc 3061 acatggctca acagcctgaa gccccaagac ccagtgccct gctttgtgcg gaatgccagc 3121 ggcttccacc tccccgagga tccctcccat ccttgcatcc tcatcgggcc tggcacaggc 3181 atcgcgccct tccgcagttt ctggcagcaa cggctccatg actcccagca caagggagtg 3241 cggggaggcc gcatgacctt ggtgtttggg tgccgccgcc cagatgagga ccacatctac 3301 caggaggaga tgctggagat ggcccagaag ggggtgctgc atgcggtgca cacagcctat 3361 tcccgcctgc ctggcaagcc caaggtctat gttcaggaca tcctgcggca gcagctggcc 3421 agcgaggtgc tccgtgtgct ccacaaggag ccaggccacc tctatgtttg cggggatgtg 3481 cgcatggccc gggacgtggc ccacaccctg aagcagctgg tggctgccaa gctgaaattg 3541 aatgaggagc aggtcgagga ctatttcttt cagctcaaga gccagaagcg ctatcacgaa 3601 gatatctttg gtgctgtatt tccttacgag gcgaagaagg acagggtggc ggtgcagccc 3661 agcagcctgg agatgtcagc gctctgaggg cctacaggag gggttaaagc tgccggcaca 3721 gaacttaagg atggagccag ctctgcatta tctgaggtca cagggcctgg ggagatggag 3781 gaaagtgata tcccccagcc tcaagtctta tttcctcaac gttgctcccc atcaagccct 3841 ttacttgacc tcctaacaag tagcaccctg gattgatcgg agcctcctct ctcaaactgg 3901 ggcctccctg gtcccttgga gacaaaatct taaatgccag gcctggcaag tgggtgaaag 3961 atggaacttg ctgctgagtg caccacttca agtgaccacc aggaggtgct atcgcaccac 4021 tgtgtattta actgccttgt gtacagttat ttatgcctct gtatttaaaa aactaacacc 4081 cagtctgttc cccatggcca cttgggtctt ccctgtatga ttccttgatg gagatattta 4141 catgaattgc attttacttt aatc // LOCUS HSINPO5P 2640 bp RNA PRI 08-SEP-1994 DEFINITION H.sapiens mRNA for 43 kDa inositol polyphosphate 5-phosphatase. ACCESSION Z31695 NID g469143 KEYWORDS inositol polyphosphate 5-phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2640) AUTHORS Laxminarayan,K.M., Chan,B.K., Tetaz,T., Bird,P.I. and Mitchell,C.A. TITLE Characterization of a cDNA encoding the 43-kDa membrane-associated inositol-polyphosphate 5-phosphatase JOURNAL J. Biol. Chem. (1994) In press REFERENCE 2 (bases 1 to 2640) AUTHORS Mitchell,C.A. TITLE Direct Submission JOURNAL Submitted (29-MAR-1994) Mitchell C.A., Monash University, Medicine, Box Hill Hospital, Box Hill, Victoria, Australia, 3128 FEATURES Location/Qualifiers source 1..2640 /organism="Homo sapiens" /macronuclear /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="placental cDNA" CDS 102..1193 /codon_start=1 /product="43 kDa inositol polyphosphate 5-phosphatase" /db_xref="PID:g469144" /translation="MALHCQEFGGKNYEASMSHVDKFVKELLSSDAMKEYNRARVYLD ENYKSQEHFTALGSFYFLHESLKNIYQFDFKAKKYRKVAGKEIYSDTLESTPMLEKEK FRRLLPRVQMVKKRLHPDEVVIADCAFDLVNIHLFHDASNLVAWETSPSVYSGIRHKA LGYVLDRIIDQRFEKVSYFVFGDFNFRLDSKSVVETLSAKPPMQTVRAADTNEVVKLI FRESDNDRKVMLQLEKKLFDYFNQEVFRDNNGTALLEFDKELSVFKDRLYELDISFPP SYPYSEDARQGEQYMNTRCPAWCDRILMSPSAKELVLRSESEEKVVTYDHIGPNVCMG DHKPVFLAFRIMPGAGKPHAHVHKCCVVQ" BASE COUNT 665 a 647 c 656 g 672 t ORIGIN 1 gggcggccaa cgtgggctcg ctcttcgacg acccagaaaa cctgcagaag aactggcttc 61 gggaatttta ccaggtcgtg cacacacaca agccgcactt catggccttg cactgtcagg 121 agtttggagg gaagaactac gaggcctcca tgtcccacgt ggacaagttc gtcaaagaac 181 tattgtcgag tgatgcgatg aaagaatata acagggctcg agtctacctg gatgaaaact 241 acaaatccca ggagcacttc acggcactag gaagctttta ttttcttcat gagtccttaa 301 aaaacatcta ccagtttgac tttaaagcta agaagtatag aaaggtcgct ggcaaagaga 361 tctactcgga taccttagag agcacgccca tgctggagaa ggagaagttt cgcagactac 421 ttccccgagt gcaaatggtc aagaaaaggc ttcatccgga cgaggtggtg attgcagact 481 gtgcctttga cttggtgaat atccatcttt tccatgatgc ttccaatctg gtcgcctggg 541 aaacaagccc ttccgtgtac tcgggaatcc ggcacaaggc actgggctac gtgctggaca 601 gaatcattga tcagcgattc gagaaggttt cctactttgt atttggtgat ttcaacttcc 661 ggctggattc caagtctgtc gtggagacgc tctcagcaaa accaccgatg cagacggtcc 721 gggccgccga caccaatgaa gtggtgaagc tcatatttcg tgagtcggac aacgaccgga 781 aggttatgct ccagttagaa aagaaactct tcgactactt caaccaggag gttttccgag 841 acaacaacgg caccgcgctc ttggagtttg acaaggagtt gtctgtcttt aaggacagac 901 tgtatgaact ggacatctcg ttccctccca gctacccgta cagtgaggac gcccgccagg 961 gtgagcagta catgaacacc cggtgcccag cctggtgtga ccgcatcctc atgtccccgt 1021 ctgccaagga gctggtgctg cggtcggaga gcgaggagaa ggttgtcacc tatgaccaca 1081 ttgggcccaa cgtctgcatg ggagaccaca agcccgtgtt cctggccttc cgaatcatgc 1141 ccggggcagg taaacctcat gcccatgtgc acaagtgttg tgtcgtgcag tgacgtggtg 1201 ggaagagatg ccagcgccac gagaggacac ttcgtgagcc tccctgtagc cgtggaccga 1261 atacgcactc ttgaaagctg catcgagaac ccgcccaagc gccacctgct agacggccag 1321 ccccacactt cgcttcagcc tccggaccat tccggagcag ccccacatac ctcactgtct 1381 cgtctgtcta tgtgacatta agtagaaata ttggtttttt ttttttttta aataagtcac 1441 agtcctgttg tcaaaactct aatagacagc aaagagggtc tgtaccgtag acttcacagt 1501 tttcagtttt taatgattgc cagtggaggg gcttcttcag cacagagacc ccccactgtg 1561 tccagggacc ccctctgcca ggtggaggtg tgtccagggg ctggggaagc cgagacgggc 1621 actccctctg ccggccggca gcgtggccct gagcatggca agggggtctg tctctgccga 1681 tgctccttcc gcggcactga ctctgcgccg tgtcacatgg tttttgaatc acactgcagc 1741 tgctttccat ttttatatat atataaatat atataaatat atacttttta aaaataattt 1801 ataaatctta ccaaaactta tgctaaatat actttccagt atgaacgcac aggagagtcc 1861 catcagcagg cggcattgga gtctaggagc tcagctgtgt gtccatcaac acacaaattc 1921 gtaaaaaaca cacatggcct cgccatcgtg ggtaaaatcg gccccacagc acgtctgcac 1981 cagcgggccg ttactcccat gccgttcttc tgtgtaatat taagaactga atgtgaagtt 2041 tatagctagc ctgggtgtac cttttaagaa ttttgtaaac cgtttgtctg tcttttgtta 2101 ctgttttatg gtgccaagta tcctacgtta caacaataat atcatgggag aaatagaaat 2161 agcctagttt gcttccaata gaaactgctt ttaacatggg ctgtatataa aaatattaaa 2221 gagaaacaaa actgtacatt tcctcattgc tccgctacag acaacccatg tcataacctt 2281 gttgcaaata tttttctcct atagcagtaa gtacagcatt agaaggtgat tagagagtct 2341 gttgatgaaa cacaaatgta tgtttttatt gatttttact ttagaacact acagagttcc 2401 tgggaccggg gtgaaggcat tagctgggtg tttgtgtggg ataaatacta ccactgcaag 2461 tgactgctgt ccgctgcgga atctgttctt ggtggaagca caggtccgtg tcgctgctgt 2521 ggttgccgct gtccgcggtt caacacggag tccgccccgc gggtttcagc tgttggtcgt 2581 tctgaggggc ctttggaagt gaccggtctg gttcctaagc aataaaattg accgtggtga // LOCUS HSINSP4BP 2837 bp RNA PRI 30-NOV-1997 DEFINITION Homo sapiens mRNA for Ins(1,3,4,5)P4-binding protein. ACCESSION X89399 NID g2653401 KEYWORDS GAP1 IP4BP gene; Ins P4-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2837) AUTHORS Cullen,P.J., Hsuan,J.J., Truong,O., Letcher,A.J., Jackson,T.R., Dawson,A.P. and Irvine,R.F. TITLE Identification of a specific Ins(1,3,4,5)P4-binding protein as a member of the GAP1 family JOURNAL Nature 376 (6540), 527-530 (1995) MEDLINE 95364929 REFERENCE 2 (bases 1 to 2837) AUTHORS Cullen,P.J. TITLE Direct Submission JOURNAL Submitted (03-JUL-1995) P.J. Cullen, The Babraham Institute, Babraham Hall, Cambridge CB2 4AT, UK REMARK Revised by [3] REFERENCE 3 (bases 1 to 2837) AUTHORS Cullen,P.J. TITLE Direct Submission JOURNAL Submitted (26-NOV-1997) P.J. Cullen, Department of Biochemistry, School of Medical Sciences, University of Bristol, Bristol. BS8 4TD, UK FEATURES Location/Qualifiers source 1..2837 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="circulating blood" /clone_lib="lambda ZAP" gene 47..2551 /gene="GAP1 IP4BP" CDS 47..2551 /gene="GAP1 IP4BP" /codon_start=1 /product="Ins P4-binding protein" /db_xref="PID:e1188702" /db_xref="PID:g2653402" /translation="MAVEDEGLRVFQSVKIKIGEAKNLPSYPGPSKMRDCYCTVNLDQ EEVFRTKIVEKSLCPFYGEDFYCEIPRSFRHLSFYIFDRDVFRRDSIIGKVAIQKEDL QKYHNRDTWFQLQHVDADSEVQGKVHLELRLSEVITDTGVVCHKLATRIVECQGLPIV NGQCDPYATVTLAGPFRSEAKKTKVKRKTNNPQFDEVFYFEVTRPCSYSKKSHFDFEE EDVDKLEIRVDLWNASNLKFGDEFLGELRIPLKVLRQSSSYEAWYFLQPRDNGSKSLK PDDLGSLRLNVVYTEDHVFSSDYYSPLRDLLLKSADVEPVSASAAHILGEVCREKQEA AVPLVRLFLHYGRVVPFISAIASAEVKRTQDPNTIFRGNSLASKCIDETMKLAGMHYL HVTLKPAIEEICQSHKPCEIDPVKLKDGENLENNMENLRQYVDRVFHAITESGVSCPT VMCDIFFSLREAAAKRFQDDPDVRYTAVSSFIFLRFFAPAILSPNLFQLTPHHTDPQT SRTLTLISKTVQTLGSLSKSKSASFKESYMATFYEFFNEQKYADAVKNFLDLISSSGR RDPKSVEQPIVLKEGFMIKRAQGRKRFGMKNFKKRWFRLTNHEFTYHKSKGDQPLYSI PIENILAVEKLEEESFKMKNMFQVIQPERALYIQANNCVEAKDWIDILTKVSQCNQKR LTVYHPSAYLSGHWLCCRAPSDSAPGCSPCTGGLPANIQLDIDGDRETERIYSLFNLY MSKLEKMQEACGSKSVYDGPEQEEYSTFVIDDPQETYKTLKQVIRWVGALEQEHAQYK RDKFKKTKYGSQEHPIGDKSFQNYIRQQSETSTHSI" BASE COUNT 719 a 751 c 787 g 580 t ORIGIN 1 ctcggcgcgc gcttggggcg aggctcggcg ggcgcggacg cgcagcatgg cggtggagga 61 cgaggggctc cgggtcttcc agagcgtgaa gatcaagatc ggtgaagcca aaaaccttcc 121 ctcttacccg gggccgagca agatgaggga ttgctactgc acggtgaacc tggaccagga 181 ggaggttttc aggaccaaaa ttgtggaaaa gtcactctgc ccgttttacg gagaagactt 241 ttactgtgaa attcctcgga gctttcgtca cctgtccttc tacattttcg atagagacgt 301 tttccggagg gattccatca tagggaaggt ggccatccag aaggaggact tgcagaagta 361 ccacaacagg gacacctggt tccagctgca gcacgtggac gctgactcgg aagtgcaggg 421 caaagtgcac ctggagctgc ggctgagcga ggtcatcaca gacactgggg tcgtctgcca 481 caagctcgcc acacgcatcg tcgagtgcca gggcctcccc atcgtgaatg ggcaatgtga 541 cccctacgcc accgtgacgc tggcaggacc cttcagatca gaagcaaaga agacgaaagt 601 gaagaggaag accaacaatc cccagttcga tgaagtgttt tattttgagg tgacccggcc 661 ctgtagctac agcaagaagt cccactttga ctttgaggag gaagacgtgg acaagctcga 721 aatcagagtt gacctctgga atgccagtaa cctgaagttt ggagatgaat tcctgggaga 781 actaaggatc ccgttgaaag tcctgcggca gtccagctcc tacgaggcgt ggtacttcct 841 ccagccccgg gacaatggta gcaagagcct aaagccagac gacctgggct ccctgcggct 901 gaacgtggta tacacggaag accacgtgtt ttcttctgac tattacagcc ctctgcggga 961 cctgctgttg aagtctgcgg atgtggagcc cgtgtcagcg tctgcggccc acatcctggg 1021 cgaggtttgc cgggagaagc aggaggcggc cgtcccgctg gtgcggctct tcctacacta 1081 tggcagggtg gtgccattca tcagtgccat cgccagcgcg gaggtgaagc ggacccagga 1141 ccccaacacc atcttccgag gaaactcact ggcgtccaag tgcattgacg agaccatgaa 1201 gctggcgggg atgcattacc tgcatgtcac cctgaagccc gccatcgagg agatatgcca 1261 gagccacaaa ccctgtgaaa tcgaccctgt gaagttgaaa gacggagaaa accttgaaaa 1321 caacatggag aacctacggc agtatgtgga ccgcgtcttc cacgccatca ccgagtctgg 1381 ggtgagctgc ccgaccgtca tgtgtgacat cttcttctcc ctccgggagg cggcggccaa 1441 gcgcttccag gatgacccgg acgtcaggta cactgcagtg agcagcttca tcttcctgag 1501 gttctttgcg cccgccattc tctcccccaa cctcttccag ctcacgccgc accacacgga 1561 cccccagacg tccaggacgc tgacattgat ctccaagacc gttcagaccc tcggcagcct 1621 gtccaagtcc aaatctgcga gttttaagga gtcctacatg gctacatttt atgaattctt 1681 caatgagcag aaatatgctg atgcggtgaa gaacttcttg gatctgattt cgtcctcggg 1741 gagaagagac cccaagagtg ttgagcagcc catcgtgctt aaagaagggt tcatgatcaa 1801 gagggcccaa ggacggaagc gctttgggat gaagaatttt aagaagagat ggtttcgttt 1861 gaccaaccat gaatttacct accacaaaag caaaggggac cagcctctct acagcattcc 1921 catcgagaac atcctggcag tggagaagct ggaggaggag tctttcaaaa tgaaaaacat 1981 gttccaggtc atccagccag agcgtgcgct gtacatccag gccaacaact gcgtggaggc 2041 caaggactgg atcgacattc tcaccaaagt gagccagtgc aaccagaagc gcctcaccgt 2101 ctaccacccg tccgcctacc tgagcggcca ctggctgtgc tgtagggcgc catccgactc 2161 ggctccgggc tgctcgccct gcactggcgg cctcccagcc aacatccagc tggacattga 2221 tggggaccgt gagacggagc gtatctactc cctcttcaac ttgtacatga gcaagctgga 2281 gaagatgcag gaggcctgtg ggagcaaatc tgtgtatgac ggcccggagc aggaggagta 2341 ttcgacgttc gtcattgacg acccccagga gacctacaag acgctaaagc aagtcatccg 2401 ctgggttggg gctttggagc aggagcacgc ccagtataag agggacaagt tcaagaagac 2461 gaaatatgga agccaggagc accccatcgg agacaagagc ttccagaact acatccggca 2521 gcagtccgag acctccactc attccattta aagtctgcgg gacgcgcccg cggccgcttc 2581 cctttagtga gggttaatgc ttcgagcaga catgataaga tacattgatg agtttggaca 2641 aaccacaact agaatgcagt gaaaaaaatg ctttatttgt gaaatttgtg atgctattgc 2701 tttatttgta accattataa gctgcaataa acaagttaac aacaacaatt gcattcattt 2761 tatgtttcag gttcaagggg agatgtggga ggtttttaaa gcaaagtaaa acaaagtaaa 2821 acctctacaa atgtggt // LOCUS HSINTA6R 5629 bp RNA PRI 24-JUL-1997 DEFINITION Human mRNA for integrin alpha 6. ACCESSION X53586 NID g33943 KEYWORDS integrin; integrin alpha 6 subunit; laminin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5629) AUTHORS Quaranta,V. TITLE Direct Submission JOURNAL Submitted (19-JUN-1990) Quaranta V., Scripps Clinnic and Research Foundation, Research Institute of Scripps Clinic, Department of Immunnology, IMM-8, 10666 North Torrey Pines Road, La Jolla, CA 92037, USA REFERENCE 2 (bases 1 to 5629) AUTHORS Tamura,R.N., Rozzo,C., Starr,L., Chambers,J., Reichardt,L.F., Cooper,H.M. and Quaranta,V. TITLE Epithelial integrin alpha 6 beta 4: complete primary structure of alpha 6 and variant forms of beta 4 JOURNAL J. Cell Biol. 111 (4), 1593-1604 (1990) MEDLINE 91009492 COMMENT Subunit structure alpha(6)beta(4) = alpha(E)beta(4) = TSP180 Subunit structure alpha(6)beta(1) = VLA-6. FEATURES Location/Qualifiers source 1..5629 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /cell_type="carcinoma" /cell_line="FG" /clone_lib="ptZB FG-2/3, ptZB FG-4" /clone="26.44, 26.1-7" mRNA <147..>5495 sig_peptide 147..215 /note="integrin alpha 6 (or alpha E) protein signal" CDS 147..3368 /codon_start=1 /product="integrin alpha 6 (or alpha E) protein" /db_xref="PID:g33944" /db_xref="SWISS-PROT:P23229" /translation="MAAAGQLCLLYLSAGLLSRLGAAFNLDTREDNVIRKYGDPGSLF GFSLAMHWQLQPEDKRLLLVGAPRGEALPLQRANRTGGLYSCDITARGPCTRIEFDND ADPTSESKEDQWMGVTVQSQGPGGKVVTCAHRYEKRQHVNTKQESRDIFGRCYVLSQN LRIEDDMDGGDWSFCDGRLRGHEKFGSCQQGVAATFTKDFHYIVFGAPGTYNWKGIVR VEQKNNTFFDMNIFEDGPYEVGGETEHDESLVPVPANSYLGFSLDSGKGIVSKDEITF VSGAPRANHSGAVVLLKRDMKSAHLLPEHIFDGEGLASSFGYDVAVVDLNKDGWQDIV IGAPQYFDRDGEVGGAVYVYMNQQGRWNNVKPIRLNGTKDSMFGIAVKNIGDINQDGY PDIAVGAPYDDLGKVFIYHGSANGINTKPTQVLKGISPYFGYSIAGNMDLDRNSYPDV AVGSLSDSVTIFRSRPVINIQKTITVTPNRIDLRQKTACGAPSGICLQVKSCFEYTAN PAGYNPSISIVGTLEAEKERRKSGLSSRVQFRNQGSEPKYTQELTLKRQKQKVCMEET LWLQDNIRDKLRPIPITASVEIQEPSSRRRVNSLPEVLPILNSDEPKTAHIDVHFLKE GCGDDNVCNSNLKLEYKFCTREGNQDKFSYLPIQKGVPELVLKDQKDIALEITVTNSP SNPRNPTKDGDDAHEAKLIATFPDTLTYSAYRELRAFPEKQLSCVANQNGSQADCELG NPFKRNSNVTFYLVLSTTEVTFDTPYLDINLKLETTSNQDNLAPITAKAKVVIELLLS VSGVAKPSQVYFGGTVVGEQAMKSEDEVGSLIEYEFRVINLGKPLTNLGTATLNIQWP KEISNGKWLLYLVKVESKGLEKVTCEPQKEINSLNLTESHNSRKKREITEKQIDDNRK FSLFAERKYQTLNCSVNVNCVNIRCPLRGLDSKASLILRSRLWNSTFLEEYSKLNYLD ILMRAFIDVTAAAENIRLPNAGTQVRVTVFPSKTVAQYSGVPWWIILVAILAGILMLA LLVFILWKCGFFKRNKKDHYDATYHKAEIHAQPSDKERLTSDA" mat_peptide 216..3365 /product="integrin alpha 6 (or alpha E) protein" misc_feature 216..3179 /note="extracellular domain" old_sequence 378..379 /citation=[1] /replace="tt" old_sequence 1113 /citation=[1] /replace="a" misc_feature 3180..3257 /note="transmembane domain" misc_feature 3258..3367 /note="cytoplasmic domain" polyA_signal 5489..5495 /note="putative" BASE COUNT 1666 a 1097 c 1284 g 1582 t ORIGIN 1 gcgcgaccgt cccgggggtg gggccgggcg cagcggcgag aggaggcgaa ggtggctgcg 61 gtagcagcag cgcggcagcc tcggacccag cccggagcgc agggcggccg ctgcaggtcc 121 ccgctcccct ccccgtgcgt ccgcccatgg ccgccgccgg gcagctgtgc ttgctctacc 181 tgtcggcggg gctcctgtcc cggctcggcg cagccttcaa cttggacact cgggaggaca 241 acgtgatccg gaaatatgga gaccccggga gcctcttcgg cttctcgctg gccatgcact 301 ggcaactgca gcccgaggac aagcggctgt tgctcgtggg ggccccgcgc ggagaagcgc 361 ttccactgca gagagccaac agaacgggag ggctgtacag ctgcgacatc accgcccggg 421 ggccatgcac gcggatcgag tttgataacg atgctgaccc cacgtcagaa agcaaggaag 481 atcagtggat gggggtcacc gtccagagcc aaggtccagg gggcaaggtc gtgacatgtg 541 ctcaccgata tgaaaaaagg cagcatgtta atacgaagca ggaatcccga gacatctttg 601 ggcggtgtta tgtcctgagt cagaatctca ggattgaaga cgatatggat gggggagatt 661 ggagcttttg tgatgggcga ttgagaggcc atgagaaatt tggctcttgc cagcaaggtg 721 tagcagctac ttttactaaa gactttcatt acattgtatt tggagccccg ggtacttata 781 actggaaagg gattgttcgt gtagagcaaa agaataacac tttttttgac atgaacatct 841 ttgaagatgg gccttatgaa gttggtggag agactgagca tgatgaaagt ctcgttcctg 901 ttcctgctaa cagttactta ggtttttctt tggactcagg gaaaggtatt gtttctaaag 961 atgagatcac ttttgtatct ggtgctccca gagccaatca cagtggagcc gtggttttgc 1021 tgaagagaga catgaagtct gcacatctcc tccctgagca catattcgat ggagaaggtc 1081 tggcctcttc atttggctat gatgtggcgg tggtggacct caacaaggat gggtggcaag 1141 atatagttat tggagcccca cagtattttg atagagatgg agaagttgga ggtgcagtgt 1201 atgtctacat gaaccagcaa ggcagatgga ataatgtgaa gccaattcgt cttaatggaa 1261 ccaaagattc tatgtttggc attgcagtaa aaaatattgg agatattaat caagatggct 1321 acccagatat tgcagttgga gctccgtatg atgacttggg aaaggttttt atctatcatg 1381 gatctgcaaa tggaataaat accaaaccaa cacaggttct caagggtata tcaccttatt 1441 ttggatattc aattgctgga aacatggacc ttgatcgaaa ttcctaccct gatgttgctg 1501 ttggttccct ctcagattca gtaactattt tcagatcccg gcctgtgatt aatattcaga 1561 aaaccatcac agtaactcct aacagaattg acctccgcca gaaaacagcg tgtggggcgc 1621 ctagtgggat atgcctccag gttaaatcct gttttgaata tactgctaac cccgctggtt 1681 ataatccttc aatatcaatt gtgggcacac ttgaagctga aaaagaaaga agaaaatctg 1741 ggctatcctc aagagttcag tttcgaaacc aaggttctga gcccaaatat actcaagaac 1801 taactctgaa gaggcagaaa cagaaagtgt gcatggagga aaccctgtgg ctacaggata 1861 atatcagaga taaactgcgt cccattccca taactgcctc agtggagatc caagagccaa 1921 gctctcgtag gcgagtgaat tcacttccag aagttcttcc aattctgaat tcagatgaac 1981 ccaagacagc tcatattgat gttcacttct taaaagaggg atgtggagac gacaatgtat 2041 gtaacagcaa ccttaaacta gaatataaat tttgcacccg agaaggaaat caagacaaat 2101 tttcttattt accaattcaa aaaggtgtac cagaactagt tctaaaagat cagaaggata 2161 ttgctttaga aataacagtg acaaacagcc cttccaaccc aaggaatccc acaaaagatg 2221 gcgatgacgc ccatgaggct aaactgattg caacgtttcc agacacttta acctattctg 2281 catatagaga actgagggct ttccctgaga aacagttgag ttgtgttgcc aaccagaatg 2341 gctcgcaagc tgactgtgag ctcggaaatc cttttaaaag aaattcaaat gtcacttttt 2401 atttggtttt aagtacaact gaagtcacct ttgacacccc atatctggat attaatctga 2461 agttagaaac aacaagcaat caagataatt tggctccaat tacagctaaa gcaaaagtgg 2521 ttattgaact gcttttatcg gtctcgggag ttgctaaacc ttcccaggtg tattttggag 2581 gtacagttgt tggcgagcaa gctatgaaat ctgaagatga agtgggaagt ttaatagagt 2641 atgaattcag ggtaataaac ttaggtaaac ctcttacaaa cctcggcaca gcaaccttga 2701 acattcagtg gccaaaagaa attagcaatg ggaaatggtt gctttatttg gtgaaagtag 2761 aatccaaagg attggaaaag gtaacttgtg agccacaaaa ggagataaac tccctgaacc 2821 taacggagtc tcacaactca agaaagaaac gggaaattac tgaaaaacag atagatgata 2881 acagaaaatt ttctttattt gctgaaagaa aataccagac tcttaactgt agcgtgaacg 2941 tgaactgtgt gaacatcaga tgcccgctgc gggggctgga cagcaaggcg tctcttattt 3001 tgcgctcgag gttatggaac agcacatttc tagaggaata ttccaaactg aactacttgg 3061 acattctcat gcgagccttc attgatgtga ctgctgctgc cgaaaatatc aggctgccaa 3121 atgcaggcac tcaggttcga gtgactgtgt ttccctcaaa gactgtagct cagtattcgg 3181 gagtaccttg gtggatcatc ctagtggcta ttctcgctgg gatcttgatg cttgctttat 3241 tagtgtttat actatggaag tgtggtttct tcaagagaaa taagaaagat cattatgatg 3301 ccacatatca caaggctgag atccatgctc agccatctga taaagagagg cttacttctg 3361 atgcatagta ttgatctact tctgtaattg tgtggattct ttaaacgctc taggtacgat 3421 gacagtgttc cccgatacca tgctgtaagg atccggaaag aagagcgaga gatcaaagat 3481 gaaaagtata ttgataacct tgaaaaaaaa cagtggatca caaagtggaa cagaaatgaa 3541 agctactcat agcgggggcc taaaaaaaaa aaagcttcac agtacccaaa ctgctttttc 3601 caactcagaa attcaatttg gatttaaaag cctgctcaat ccctgaggac tgatttcaga 3661 gtgactacac acagtacgaa cctacagttt taactgtgga tattgttacg tagcctaagg 3721 ctcctgtttt gcacagccaa atttaaaact gttggaatgg atttttcttt aactgccgta 3781 atttaacttt ctgggttgcc tttgtttttg gcgtggctga cttacatcat gtgttgggga 3841 agggcctgcc cagttgcact caggtgacat cctccagata gtgtagctga ggaggcacct 3901 acactcacct gcactaacag agtggccgtc ctaacctcgg gcctgctgcg cagacgtcca 3961 tcacgttagc tgtcccacat cacaagacta tgccattggg gtagttgtgt ttcaacggaa 4021 agtgctgtct taaactaaat gtgcaataga aggtgatgtt gccatcctac cgtcttttcc 4081 tgtttcctag ctgtgtgaat acctgctcac gtcaaatgca tacaagtttc attctccctt 4141 tcactaaaaa cacacaggtg caacagactt gaatgctagt tatacttatt tgtatatggt 4201 atttattttt tcttttcttt acaaaccatt ttgttattga ctaacaggcc aaagagtctc 4261 cagtttaccc ttcaggttgg tttaatcaat cagaattaga attagagcat gggagggtca 4321 tcactatgac ctaaattatt tactgcaaaa agaaaatctt tataaatgta ccagagagag 4381 ttgttttaat aacttatcta taaactataa cctctccttc atgacagcct ccaccccaca 4441 acccaaaagg tttaagaaat agaattataa ctgtaaagat gtttatttca ggcattggat 4501 attttttact ttagaagcct gcataatgtt tctggattta catactgtaa cattcaggaa 4561 ttcttggaga agatgggttt attcactgaa ctctagtgcg gtttactcac tgctgcaaat 4621 actgtatatt caggacttga aagaaatggt gaatgcctat ggaactagtg gatccaaact 4681 gatccagtat aagactactg aatctgctac caaaacagtt aatcagtgag tcgagtgttc 4741 tattttttgt tttgtttcct cccctatctg tattcccaaa aattactttg gggctaattt 4801 aacaagaact ttaaattgtg ttttaattgt aaaaatggca gggggtggaa ttattactct 4861 atacattcaa cagagactga atagatatga aagctgattt tttttaatta ccatgcttca 4921 caatgttaag ttatatgggg agcaacagca aacaggtgct aatttgtttt ggatatagta 4981 taagcagtgt ctgtgttttg aaagaataga acacagtttg tagtgccact gttgttttgg 5041 ggggggcttt ttttcttttt ccggaaaatc cttaaacctt aagatactaa ggacgttgtt 5101 ttggttgtac ttggaattct tagtcacaaa atatattttg tttacaaaaa tttctgtaaa 5161 acaggttata acagtgttta aagtctcagt ttcttgcttg gggaacttgt gtccctaatg 5221 tgttagattg ctagattgct aaggagctga tacttgacag ttttttagac ctgtgttact 5281 aaaaaaaaga tgaatgtcgg aaaagggtgt tgggagggtg gtcaacaaag aaacaaagat 5341 gttatggtgt ttagacttat ggttgttaaa aatgtcatct caagtcaagt cactggtctg 5401 tttgcatttg atacattttt gtactaacta gcattgtaaa attatttcat gattagaaat 5461 tacctgtgga tatttgtata aaagtgtgaa ataaattttt tataaaagtg ttcattgttt 5521 cgtaacacag cattgtatat gtgaagcaaa ctctaaaatt ataaatgaca acctgaatta 5581 tctatttcat caaaaaaaaa aaaaaaaaaa actttatggg cacaactgg // LOCUS HSINTAL4 3805 bp RNA PRI 02-AUG-1995 DEFINITION Human mRNA for integrin alpha-4 subunit. ACCESSION X16983 X15356 NID g33945 KEYWORDS cell adhesion molecule; cell adhesion receptor; integrin; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3805) AUTHORS Hemler,M.E. TITLE Direct Submission JOURNAL Submitted (17-NOV-1989) Hemler M.E., Dana-Farber Cancer Institute, Mayer 613, 44 Binney Street, Boston, MA 02115 REFERENCE 2 (bases 1 to 3805) AUTHORS Takada,Y., Elices,M.J., Crouse,C. and Hemler,M.E. TITLE The primary structure of the alpha 4 subunit of VLA-4: homology to other integrins and a possible cell-cell adhesion function JOURNAL EMBO J. 8 (5), 1361-1368 (1989) MEDLINE 89356603 FEATURES Location/Qualifiers source 1..3805 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="HPB-MLT" /clone_lib="HPB-MLT lambda gt10" /clone="4.10, 4.39, 4.37, 4.43" CDS 25..3141 /codon_start=1 /product="integrin alpha-4 subunit preprotein" /db_xref="PID:g33946" /db_xref="SWISS-PROT:P13612" /translation="MFPTESAWLGKRGANPGPEAAVRETVMLLLCLGVPTGRPYNVDT ESALLYQGPHNTLFGYSVVLHSHGANRWLLVGAPTANWLANASVINPGAIYRCRIGKN PGQTCEQLQLGSPNGEPCGKTCLEERDNQWLGVTLSRQPGENGSIVTCGHRWKNIFYI KNENKLPTGGCYGVPPDLRTELSKRIAPCYQDYVKKFGENFASCQAGISSFYTKDLIV MGAPGSSYWTGSLFVYNITTNKYKAFLDKQNQVKFGSYLGYSVGAGHFRSQHTTEVVG GAPQHEQIGKAYIFSIDEKELNILHEMKGKKLGSYFGASVCAVDLNADGFSDLLVGAP MQSTIREEGRVFVYINSGSGAVMNAMETNLVGSDKYAARFGESIVNLGDIDNDGFEDV AIGAPQEDDLQGAIYIYNGRADGISSTFSQRIEGLQISKSLSMFGQSISGQIDADNNG YVDVAVGAFRSDSAVLLRTRPVVIVDASLSHPESVNRTKFDCVENGWPSVCIDLTLCF SYKGKEVPGYIVLFYNMSLDVNRKAESPPRFYFSSNGTSDVITGSIQVSSREANCRTH QAFMRKDVRDILTPIQIEAAYHLGPHVISKRSTEEFPPLQPILQQKKEKDIMKKTINF ARFCAHENCSADLQVSAKIGFLKPHENKTYLAVGSMKTLMLNVSLFNAGDDAYETTLH VKLPVGLYFIKILELEEKQINCEVTDNSGVVQLDCSIGYIYVDHLSRIDISFLLDVSS LSRAEEDLSITVHATCENEEEMDNLKHSRVTVAIPLKYEVKLTVHGFVNPTSFVYGSN DENEPETCMVEKMNLTFHVINTGNSMAPNVSVEIMVPNSFSPQTDKLFNILDVQTTTG ECHFENYQRVCALEQQKSAMQTLKGIVRFLSKTDKRLLYCIKADPHCLNFLCNFGKME SGKEASVHIQLEGRPSILEMDETSALKFEIRATGFPEPNPRVIELNKDENVAHVLLEG LHHQRPKRYFTIVIISSSLLLGLIVLLLISYVMWKAGFFKRQYKSILQEENRRDSWSY INSKSNDD" sig_peptide 25..141 /note="signal peptide (AA -39 to -1)" mat_peptide 142..3138 /note="integrin alpha-4 subunit (AA 1-999)" misc_feature 2974..3042 /note="put. transmembrane domain" BASE COUNT 1191 a 707 c 856 g 1051 t ORIGIN 1 gaattccggg ccgcttagtg ttgaatgttc cccaccgaga gcgcatggct tgggaagcga 61 ggcgcgaacc cgggccccga agccgccgtc cgggagacgg tgatgctgtt gctgtgcctg 121 ggggtcccga ccggccgccc ctacaacgtg gacactgaga gcgcgctgct ttaccagggc 181 ccccacaaca cgctgttcgg ctactcggtc gtgctgcaca gccacggggc gaaccgatgg 241 ctcctagtgg gtgcgcccac tgccaactgg ctcgccaacg cttcagtgat caatcccggg 301 gcgatttaca gatgcaggat cggaaagaat cccggccaga cgtgcgaaca gctccagctg 361 ggtagcccta atggagaacc ttgtggaaag acttgtttgg aagagagaga caatcagtgg 421 ttgggggtca cactttccag acagccagga gaaaatggat ccatcgtgac ttgtgggcat 481 agatggaaaa atatatttta cataaagaat gaaaataagc tccccactgg tggttgctat 541 ggagtgcccc ctgatttacg aacagaactg agtaaaagaa tagctccgtg ttatcaagat 601 tatgtgaaaa aatttggaga aaattttgca tcatgtcaag ctggaatatc cagtttttac 661 acaaaggatt taattgtgat gggggcccca ggatcatctt actggactgg ctctcttttt 721 gtctacaata taactacaaa taaatacaag gcttttttag acaaacaaaa tcaagtaaaa 781 tttggaagtt atttaggata ttcagtcgga gctggtcatt ttcggagcca gcatactacc 841 gaagtagtcg gaggagctcc tcaacatgag cagattggta aggcatatat attcagcatt 901 gatgaaaaag aactaaatat cttacatgaa atgaaaggta aaaagcttgg atcgtacttt 961 ggagcttctg tctgtgctgt ggacctcaat gcagatggct tctcagatct gctcgtggga 1021 gcacccatgc agagcaccat cagagaggaa ggaagagtgt ttgtgtacat caactctggc 1081 tcgggagcag taatgaatgc aatggaaaca aacctcgttg gaagtgacaa atatgctgca 1141 agatttgggg aatctatagt taatcttggc gacattgaca atgatggctt tgaagatgtt 1201 gctatcggag ctccacaaga agatgacttg caaggtgcta tttatattta caatggccgt 1261 gcagatggga tctcgtcaac cttctcacag agaattgaag gacttcagat cagcaaatcg 1321 ttaagtatgt ttggacagtc tatatcagga caaattgatg cagataataa tggctatgta 1381 gatgtagcag ttggtgcttt tcggtctgat tctgctgtct tgctaaggac aagacctgta 1441 gtaattgttg acgcttcttt aagccaccct gagtcagtaa atagaacgaa atttgactgt 1501 gttgaaaatg gatggccttc tgtgtgcata gatctaacac tttgtttctc atataagggc 1561 aaggaagttc caggttacat tgttttgttt tataacatga gtttggatgt gaacagaaag 1621 gcagagtctc caccaagatt ctatttctct tctaatggaa cttctgacgt gattacagga 1681 agcatacagg tgtccagcag agaagctaac tgtagaacac atcaagcatt tatgcggaaa 1741 gatgtgcggg acatcctcac cccaattcag attgaagctg cttaccacct tggtcctcat 1801 gtcatcagta aacgaagtac agaggaattc ccaccacttc agccaattct tcagcagaag 1861 aaagaaaaag acataatgaa aaaaacaata aactttgcaa ggttttgtgc ccatgaaaat 1921 tgttctgctg atttacaggt ttctgcaaag attgggtttt tgaagcccca tgaaaataaa 1981 acatatcttg ctgttgggag tatgaagaca ttgatgttga atgtgtcctt gtttaatgct 2041 ggagatgatg catatgaaac gactctacat gtcaaactac ccgtgggtct ttatttcatt 2101 aagattttag agctggaaga gaagcaaata aactgtgaag tcacagataa ctctggcgtg 2161 gtacaacttg actgcagtat tggctatata tatgtagatc atctctcaag gatagatatt 2221 agctttctcc tggatgtgag ctcactcagc agagcggaag aggacctcag tatcacagtg 2281 catgctacct gtgaaaatga agaggaaatg gacaatctaa agcacagcag agtgactgta 2341 gcaatacctt taaaatatga ggttaagctg actgttcatg ggtttgtaaa cccaacttca 2401 tttgtgtatg gatcaaatga tgaaaatgag cctgaaacgt gcatggtgga gaaaatgaac 2461 ttaactttcc atgttatcaa cactggcaat agtatggctc ccaatgttag tgtggaaata 2521 atggtaccaa attcttttag cccccaaact gataagctgt tcaacatttt ggatgtccag 2581 actactactg gagaatgcca ctttgaaaat tatcaaagag tgtgtgcatt agagcagcaa 2641 aagagtgcaa tgcagacctt gaaaggcata gtccggttct tgtccaagac tgataagagg 2701 ctattgtact gcataaaagc tgatccacat tgtttaaatt tcttgtgtaa ttttgggaaa 2761 atggaaagtg gaaaagaagc cagtgttcat atccaactgg aaggccggcc atccatttta 2821 gaaatggatg agacttcagc actcaagttt gaaataagag caacaggttt tccagagcca 2881 aatccaagag taattgaact aaacaaggat gagaatgttg cgcatgttct actggaagga 2941 ctacatcatc aaagacccaa acgttatttc accatagtga ttatttcaag tagcttgcta 3001 cttggactta ttgtacttct gttgatctca tatgttatgt ggaaggctgg cttctttaaa 3061 agacaataca aatctatcct acaagaagaa aacagaagag acagttggag ttatatcaac 3121 agtaaaagca atgatgatta aggacttctt tcaaattgag agaatggaaa acagactcag 3181 gttgtagtaa agaaatttaa aagacactgt ttacaagaaa aaatgaattt tgtttggact 3241 tcttttactc atgatcttgt gacatattat gtcttcatgc aaggggaaaa tctcagcaat 3301 gattactctt tgagatagaa gaactgcaaa ggtaataata cagccaaaga taatctctca 3361 gcttttaaat gggtagagaa acactaaagc attcaattta ttcaagaaaa gtaagccctt 3421 gaagatatct tgaaatgaaa gtataactga gttaaattat actggagaag tcttagactt 3481 gaaatactac ttaccatatg tgcttgcctc agtaaaatga accccactgg gtgggcagag 3541 gttcatttca aatacatctt tgatacttgt tcaaaatatg ttctttaaaa atataatttt 3601 ttagagagct gttcccaaat tttctaacga gtggaccatt atcactttaa agccctttat 3661 ttataataca tttcctacgg gctgtgttcc aacaaccatt ttttttcagc agactatgaa 3721 tattatagta ttataggcca aactggcaaa cttcagactg aacatgtaca ctggtttgag 3781 cttagtgaaa tgacttccgg aatct // LOCUS HSIRF 3498 bp RNA PRI 08-JAN-1993 DEFINITION H.sapiens mRNA for iron regulatory factor. ACCESSION Z11559 NID g33962 KEYWORDS iron regulatory factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3498) AUTHORS Hirling,H., Emery-Goodman,A., Thompson,N., Neupert,B., Seiser,C. and Kuhn,L.C. TITLE Expression of active iron regulatory factor from a full-length human cDNA by in vitro transcription/translation JOURNAL Nucleic Acids Res. 20 (1), 33-39 (1992) MEDLINE 92150156 REFERENCE 2 (bases 1 to 3498) AUTHORS Hirling,H. TITLE Direct Submission JOURNAL Submitted (19-DEC-1991) Hirling H., Swiss Institute for Experimental Cancer Research, Chemin des Boveresses 155, Epalinges sur Lausanne, Vaud, Switzerland COMMENT . FEATURES Location/Qualifiers source 1..3498 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 108..2777 /codon_start=1 /product="iron regulatory factor" /db_xref="PID:g33963" /translation="MSNPFAHLAEPLDPVQPGKKFFNLNKLEDSRYGRLPFSIRVLLE AAIRNCDEFLVKKQDIENILHWNVTQHKNIEVPFKPARVILQDFTGVPAVVDFAAMRD AVKKLGGDPEKINPVCPADLVIDHSIQVDFNRRADSLQKNQDLEFERNRERFEFLKWG SQAFHNMRIIPPGSGIIHQVNLEYLARVVFDQDGYYYPDSLVGTDSHTTMIDGLGILG WGVGGIEAEAVMLGQPISMVLPQVIGYRLMGKPHPLVTSTDIVLTITKHLRQVGVVGK FVEFFGPGVAQLSIADRATIANMCPEYGATAAFFPVDEVSITYLVQTGRDEEKLKYIK KYLQAVGMFRDFNDPSQDPDFTQVVELDLKTVVPCCSGPKRPQDKVAVSDMKKDFESC LGAKQGFKGFQVAPEHHNDHKTFIYDNTEFTLAHGSVVIAAITSCTNTSNPSVMLGAG LLAKKAVDAGLNVMPYIKTSLSPGSGVVTYYLQESGVMPYLSQLGFDVVGYGCMTCIG NSGPLPEPVVEAITQGDLVAVGVLSGNRNFEGRVHPNTRANYLASPPLVIAYAIAGTI RIDFEKEPLGVNAKGQQVFLKDIWPTRDEIQAVERQYVIPGMFKEVYQKIETVNESWN ALATPSDKLFFWNSKSTYIKSPPFFENLTLDLQPPKSIVDAYVLLNLGDSVTTDHISP AGNIARNSPAARYLTNRGLTPREFNSYGSRRGNDAVMARGTFANIRLLNRFLNKQAPQ TIHLPSGEILDVFDAAERYQQAGLPLIVLAGKEYGAGSSRDWAAKGPFLLGIKAVLAE SYERIHRSNLVGMGVIPLEYLPGENADALGLTGQERYTIIIPENLKPQMKVQVKLDTG KTFQAVMRFDTDVELTYFLNGGILNYMIRKMAK" BASE COUNT 918 a 799 c 856 g 925 t ORIGIN 1 gggctcgaac gcgcagcgca cgggaaccgg tcccgctgct tgggtcaggt tcgccggtcg 61 cgggagcccc gccgtgcagt cggaggaaca cgtggccatc agtaatcatg agcaacccat 121 tcgcacacct tgctgagcca ttggatcctg tacaaccagg aaagaaattc ttcaatttga 181 ataaattgga ggattcaaga tatgggcgct taccattttc gatcagagtt cttctggaag 241 cagccattcg gaattgtgat gagtttttgg tgaagaaaca ggatattgaa aatattctac 301 attggaatgt cactcagcac aagaacatag aagtgccatt taagcctgct cgtgtcatcc 361 tgcaggactt tacgggtgtg cccgctgtgg ttgactttgc tgcaatgcgt gatgctgtga 421 aaaagttagg aggagatcca gagaaaataa accctgtctg ccctgctgat cttgtaatag 481 atcattccat ccaggttgat ttcaacagaa gggcagacag tttacagaag aatcaagacc 541 tggaatttga aagaaataga gagcgatttg aatttttaaa gtggggttcc caggcttttc 601 acaacatgcg gattattccc cctggctcag gaatcatcca ccaggtgaat ttggaatatt 661 tggcaagagt ggtatttgat caggatggat attattaccc agacagcctc gtgggcacag 721 actcgcacac taccatgatt gatggcttgg gcattcttgg ttggggtgtc ggtggtattg 781 aagcagaagc tgtcatgctg ggtcagccaa tcagtatggt gcttcctcag gtgattggct 841 acaggctgat ggggaagccc caccctctgg taacatccac tgacatcgtg ctcaccatta 901 ccaagcacct ccgccaggtt ggggtagtgg gcaaatttgt cgagttcttc gggcctggag 961 tagcccagtt gtccattgct gaccgagcta cgattgctaa catgtgtcca gagtacggag 1021 caactgctgc ctttttccca gttgatgaag ttagtatcac gtacctggtg caaacaggtc 1081 gtgatgaaga aaaattaaag tatattaaaa aatatcttca ggctgtagga atgtttcgag 1141 atttcaatga cccttctcaa gacccagact tcacccaggt tgtggaatta gatttgaaaa 1201 cagtagtgcc ttgctgtagt ggacccaaaa ggcctcagga caaagttgct gtgtccgaca 1261 tgaaaaagga ctttgagagc tgccttggag ccaagcaagg atttaaagga ttccaagttg 1321 ctcctgaaca tcataatgac cataagacct ttatctatga taacactgaa ttcacccttg 1381 ctcatggttc tgtggtcatt gctgccatta ctagctgcac aaacaccagt aatccgtctg 1441 tgatgttagg ggcaggattg ttagcaaaga aagctgtgga tgctggcctg aacgtgatgc 1501 cttacatcaa aactagcctg tctcctggga gtggcgtggt cacctactac ctacaagaaa 1561 gcggagtcat gccttatctg tctcagcttg ggtttgacgt ggtgggctat ggctgcatga 1621 cctgcattgg caacagtggg cctttacctg aacctgtggt agaagccatc acacagggag 1681 accttgtagc tgttggagta ctatctggaa acaggaattt tgaaggtcga gttcacccca 1741 acacccgggc caactattta gcctctcccc ccttagtaat agcatatgca attgctggaa 1801 ccatcagaat cgactttgag aaagagccat tgggagtaaa tgcaaaggga cagcaggtat 1861 ttctgaaaga tatctggccg actagagacg agatccaggc agtggagcgt cagtatgtca 1921 tcccggggat gtttaaggaa gtctatcaga aaatagagac tgtgaatgaa agctggaatg 1981 ccttagcaac cccatcagat aagctgtttt tctggaattc caaatctacg tatatcaaat 2041 caccaccatt ctttgaaaac ctgactttgg atcttcagcc ccctaaatct atagtggatg 2101 cctatgtgct gctaaatttg ggagattcgg taacaactga ccacatctcc ccagctggaa 2161 atattgcaag aaacagtcct gctgctcgct acttaactaa cagaggccta actccacgag 2221 aattcaactc ctatggctcc cgccgaggta atgacgccgt catggcacgg ggaacatttg 2281 ccaacattcg cttgttaaac agatttttga acaagcaggc accacagact atccatctgc 2341 cttctgggga aatccttgat gtgtttgatg ctgctgagcg gtaccagcag gcaggccttc 2401 ccctgatcgt tctggctggc aaagagtacg gtgcaggcag ctcccgagac tgggcagcta 2461 agggcccttt cctgctggga atcaaagccg tcctggccga gagctacgag cgcattcacc 2521 gcagtaacct ggttgggatg ggtgtgatcc cacttgaata tctccctggt gagaatgcag 2581 atgccctggg gctcacaggg caagaacgat acactatcat tattccagaa aacctcaaac 2641 cacaaatgaa agtccaggtc aagctggata ctggcaagac cttccaggct gtcatgaggt 2701 ttgacactga tgtggagctc acttatttcc tcaacggggg catcctcaac tacatgatcc 2761 gcaagatggc caagtaggag acgtgcactt ggtcgtgcgc ccagggagga agccgcacca 2821 ccagccagcg caggccctgg tggagaggcc tccctggctg cctctgggag gggtgctgcc 2881 ttgtagatgg agcaagtgag cactgagggt ctggtgccaa tcctgtaggc acaaaaccag 2941 aagtttctac attctctatt tttgttaatc atcttctctt tttccagaat ttggaagcta 3001 gaatggtggg aatgtcagta gtgccagaaa gagagaacca agcttgtctt taaagttact 3061 gatcacagga cgttgctttt tcactgtttc ctattaatct tcagctgaac acaagcaaac 3121 cttctcagga ggtgtctcct accctcttat tgttcctctt acgctctgct caatgaaacc 3181 ttcctcttga gggtcatttt cctttctgta ttaattatac cagtgttaag tgacatagat 3241 aagaactttg cacacttcaa atcagagcag tgattctctc ttctctcccc ttttccttca 3301 gagtgaatca tccagactcc tcatggatag gtcgggtgtt aaagttgttt tgattatgta 3361 ccttttgata gatccacata aaaagaaatg tgaagttttc ttttactatc ttttcattta 3421 tcaagcagag acctttgttg ggaggcggtt tgggagaaca catttctaat ttgaatgaaa 3481 tgaaatctat tttcagtg // LOCUS HSIRF2 2144 bp RNA PRI 07-APR-1994 DEFINITION Human mRNA for interferon regulatory factor-2 (IRF-2). ACCESSION X15949 NID g33966 KEYWORDS interferon regulatory factor 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2144) AUTHORS Itoh,S. TITLE Direct Submission JOURNAL Submitted (25-JUL-1989) Itoh S., Dept of Mol. Biol., Institute of Cell. Biol., Osaka University, 1-3 Yamadaoka Suita-shi Osaka 565, Japan REFERENCE 2 (bases 1 to 2144) AUTHORS Itoh,S., Harada,H., Fujita,T., Mimura,T. and Taniguchi,T. TITLE Sequence of a cDNA coding for human IRF-2 JOURNAL Nucleic Acids Res. 17 (20), 8372 (1989) MEDLINE 90045964 COMMENT See for interferon regulatory factor-1 sequence. FEATURES Location/Qualifiers source 1..2144 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell" /cell_line="Jurkat III" /clone="pHIRF4S-51" CDS 99..1148 /note="interferon regulatory factor-2 (AA 1-349)" /codon_start=1 /db_xref="PID:g33967" /db_xref="SWISS-PROT:P14316" /translation="MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARH GWDVEKDAPLFRNRAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKG NNAFRVYRMLPLSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAV LTSTIKNEVDSTVNIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSE LYPLQISPVSSYAESETTDSVPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPG MASFVTSNKPDLQVTIKEESNPVPYNSSWPPFQDLPLSSSMTPASSSSRPDRETRASV IKKTSDITQARVKSC" misc_feature 1975..1980 /note="pot.polyA signal" polyA_site 2144 /note="polyA site" BASE COUNT 607 a 523 c 460 g 554 t ORIGIN 1 aactgacggg ctttcatttc catttcacac accctagcaa cacttatacc ttgcggaatt 61 gtattggtag cgtgaaaaaa gcacactgag agggcaccat gccggtggaa aggatgcgca 121 tgcgcccgtg gctggaggag cagataaact ccaacacgat cccggggctc aagtggctta 181 acaaggaaaa gaagattttt cagatcccct ggatgcatgc ggctagacat gggtgggatg 241 tggaaaaaga tgcaccactc tttagaaacc gggcaatcca tacaggaaag catcaaccag 301 gagtagataa acctgatccc aaaacatgga aggcgaattt cagatgcgcc atgaattcct 361 tgcctgatat tgaagaagtc aaggataaaa gcataaagaa aggaaataat gccttcaggg 421 tctaccgaat gctgccccta tcagaacggc cttctaagaa aggaaagaaa ccaaagacag 481 aaaaagaaga caaagttaag cacatcaagc aagaaccagt tgagtcatct ctggggctta 541 gtaatggagt aagtgatctt tctcctgagt atgcggtcct gacttcaact ataaaaaatg 601 aagtggatag tacggtgaac atcatagttg taggacagtc ccatctggac agcaacattg 661 agaatcaaga gattgtcacc aatccgccag acatttgcca agttgtagag gtgaccactg 721 agagcgacga gcagccggtc agcatgagcg agctctaccc tctgcagatc tcccccgtgt 781 cttcctatgc agaaagcgaa acgactgata gtgtgcccag cgatgaagag agtgccgagg 841 ggcggccaca ctggcggaag aggaatattg aaggcaaaca gtacctcagc aacatgggga 901 ctcgaggctc ctacctgctg cccggcatgg cgtccttcgt cacttccaac aaaccggacc 961 tccaggtcac catcaaagag gagagcaatc cggtgcctta caacagctcc tggccccctt 1021 ttcaagacct ccccctttct tcctccatga ccccagcatc cagcagcagt cggccagacc 1081 gggagacccg ggccagcgtc atcaagaaaa catcggatat cacccaggcc cgcgtcaaga 1141 gctgttaagc ctctgactct ccgcggtggt tgttggggct tcttggcttt gttttgttgt 1201 ttgtttgtat tttatttttt tctctctgac acctatttta gacaaatcta agggaaaaag 1261 ccttgacaat agaacattga ttgctgtgtc caactccagt acctggagct tctctttaac 1321 tcaggactcc agcccattgg tagacgtgtg tttctagagc ctgctggatc tcccagggct 1381 actcactcaa gttcaaggac caacaagggc agtggaggtg ctgcattgcc tgcggtcaag 1441 gccagcaagg tggagtggat gcctcagaac ggacgagata atgtgaacta gctggaattt 1501 tttattcttg tgaatatgta cataggcagc actagcgaca ttgcagtctg cttctgcacc 1561 ttatcttaaa gcacttacag ataggccttc ttgtgatctt gctctatctc acagcacact 1621 cagcaccccc ttctctgccc attccccagc ctctcttcct atcccatccc atcccatccc 1681 atcccatccc atcccatccc gctcttttcc tacttttcct tccctcaaag cttccattcc 1741 acatccggag gagaagaagg aaatgaattt ctctacagat gtcccatttt cagactgctt 1801 taaaaaaaat ccttctaatc tgctatgctt gaatgccacg cggtacaaag gaaaaagtat 1861 catggaaata ttatgcaaat tcccagattt gaagacaaaa atactctaat tctaaccaga 1921 gcaagctttt ttatttttta tacaggggaa tattttattc aaggtaaaat tctaaataaa 1981 atataattgt tttttatctt ttctacagca aatttataat tttaagattc cttttcttgt 2041 ttatcagcag ttgttattac atccttgtgg cacatttttt tttaattttg taaaggtgaa 2101 aaaagctttt atgagctcat ctagcaatca gattttcctg tgga // LOCUS HSIRF3MR 1407 bp RNA PRI 29-MAR-1996 DEFINITION H.sapiens mRNA for interferon regulatory factor 3. ACCESSION Z56281 NID g1107688 KEYWORDS interferon regulatory factor 3; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Au,W.C., Moore,P.A., Lowther,W., Juang,Y.T. and Pitha,P.M. TITLE Identification of a member of the interferon regulatory factor family that binds to the interferon-stimulated response element and activates expression of interferon-induced genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (25), 11657-11661 (1995) MEDLINE 96102173 REFERENCE 2 (bases 1 to 1407) AUTHORS Moore,P. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) Moore P., University of Aberdeen, Molecular and Cell Biology, Marischal College, Aberdeen, Scotland, AB9 1AS FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Retina" gene 47..1330 /gene="IRF3" CDS 47..1330 /gene="IRF3" /function="transcription factor" /codon_start=1 /product="interferon regulatory factor 3" /db_xref="PID:e205170" /db_xref="PID:g1107689" /translation="MGTPKPRILPWLVSQLDLGQLEGVAWVNKSRTRFRIPWKHGLRQ DAQQEDFGIFQAWAEATGAYVPGRDKPDLPTWKRNFRSALNRKEGLRLAEDRSKDPHD PHKIYEFVNSGVGDFSQPDTSPDTNGGGSTSDTQEDILDELLGNMVLAPLPDPGPPSL AVAPEPCPQPLRSPSLDNPTPFPNLGPSENPLKRLLVPGEEWEFEVTAFYRGRQVFQQ TISCPEGLRLVGSEVGDRTLPGWPVTLPDPGMSLTDRGVMSYVRHVLSCLGGGLALWR AGQWLWAQRLGHCHTYWAVSEELLPNSGHGPDGEVPKDKEGGVFDLGPFIVDLITFTE GSGRSPRYALWFCVGESWPQDQPWTKRLVMVKVVPTCLRALVEMARVGGASSLENTVD LHISNSHPLSLTSDQYKAYLQDLVEGMDFQGPGES" BASE COUNT 277 a 435 c 439 g 256 t ORIGIN 1 ggttccagct gcccgcacgc cccgaccttc catcgtaggc cggaccatgg gaaccccaaa 61 gccacggatc ctgccctggc tggtgtcgca gctggacctg gggcaactgg agggcgtggc 121 ctgggtgaac aagagccgca cgcgcttccg catcccttgg aagcacggcc tacggcagga 181 tgcacagcag gaggatttcg gaatcttcca ggcctgggcc gaggccactg gtgcatatgt 241 tcccgggagg gataagccag acctgccaac ctggaagagg aatttccgct ctgccctcaa 301 ccgcaaagaa gggttgcgtt tagcagagga ccggagcaag gaccctcacg acccacataa 361 aatctacgag tttgtgaact caggagttgg ggacttttcc cagccagaca cctctccgga 421 caccaatggt ggaggcagta cttctgatac ccaggaagac attctggatg agttactggg 481 taacatggtg ttggccccac tcccagatcc gggaccccca agcctggctg tagcccctga 541 gccctgccct cagcccctgc ggagccccag cttggacaat cccactccct tcccaaacct 601 ggggccctct gagaacccac tgaagcggct gttggtgccg ggggaagagt gggagttcga 661 ggtgacagcc ttctaccggg gccgccaagt cttccagcag accatctcct gcccggaggg 721 cctgcggctg gtggggtccg aagtgggaga caggacgctg cctggatggc cagtcacact 781 gccagaccct ggcatgtccc tgacagacag gggagtgatg agctacgtga ggcatgtgct 841 gagctgcctg ggtgggggac tggctctctg gcgggccggg cagtggctct gggcccagcg 901 gctggggcac tgccacacat actgggcagt gagcgaggag ctgctcccca acagcgggca 961 tgggcctgat ggcgaggtcc ccaaggacaa ggaaggaggc gtgtttgacc tggggccctt 1021 cattgtagat ctgattacct tcacggaagg aagcggacgc tcaccacgct atgccctctg 1081 gttctgtgtg ggggagtcat ggccccagga ccagccgtgg accaagaggc tcgtgatggt 1141 caaggttgtg cccacgtgcc tcagggcctt ggtagaaatg gcccgggtag ggggtgcctc 1201 ctccctggag aatactgtgg acctgcacat ttccaacagc cacccactct ccctcacctc 1261 cgaccagtac aaggcctacc tgcaggactt ggtggagggc atggatttcc agggccctgg 1321 ggagagctga gccctcgctc ctcatggtgt gcctccaacc cccctgttcc ccaccacctc 1381 aaccaataaa ctggttcctg ctatgaa // LOCUS HSIRF4 5320 bp mRNA PRI 29-OCT-1996 DEFINITION Human lymphocyte specific interferon regulatory factor/interferon regulatory factor 4 (LSIRF/IRF4) mRNA, complete cds. ACCESSION U52682 NID g1378108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5320) AUTHORS Grossman,A., Mittrucker,H.W., Nicholl,J., Suzuki,A., Chung,S., Antonio,L., Suggs,S., Sutherland,G.R., Siderovski,D.P. and Mak,T.W. TITLE Cloning of human lymphocyte-specific interferon regulatory factor (hLSIRF/hIRF4) and mapping of the gene to 6p23-p25 JOURNAL Genomics 37 (2), 229-233 (1996) MEDLINE 97079690 REFERENCE 2 (bases 1 to 5320) AUTHORS Grossman,A., Mittrucker,H.-W., Nicholl,J., Baker,E., Suzuki,A., Chung,S., Antonio,L., Suggs,S., Sutherland,G.R., Siderovski,D.P. and Mak,T.W. TITLE Direct Submission JOURNAL Submitted (26-MAR-1996) A. Grossman, Medical Biophysics, Ontario Cancer Institute, 610 University Ave, Toronto, ON M5G 2C1, Canada FEATURES Location/Qualifiers source 1..5320 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /chromosome="6" /map="6p23-25" /cell_type="PHA stimulated T lymphocyte, peripheral blood lymphocyte" gene 1..5320 /gene="IRF4" 5'UTR 1..125 /gene="IRF4" exon 1..71 /gene="IRF4" /number=1 exon 72..341 /gene="IRF4" /number=2 CDS 126..1478 /gene="IRF4" /note="Induced by T-cell/ B-cell receptor crosslinking; Not induced by interferon; Binds the interferon stimulated response element (ISRE) in vitro; Binds the immunoglobulin lambda light chain enhancer, together with PU.1; Also known as PIP/NF-EM5; LSIRF/IRF4; Intron4-Exon 5A=tagcagGTTCACAACTAC..; Intron4-Exon 5B=tagCAGGTTCACAACTAC..; Two forms of the protein exist (450/451 aa), one with an additional glutamine at aa 164 (Q/QQ); transcription factor" /codon_start=1 /product="lymphocyte specific interferon regulatory factor/interferon regulatory factor 4" /db_xref="PID:g1272477" /translation="MNLEGGGRGGEFGMSAVSCGNGKLRQWLIDQIDSGKYPGLVWEN EEKSIFRIPWKHAGKQDYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALN KSNDFEELVERSQLDISDPYKVYRIVPEGAKKGAKQLTLEDPQMSMSHPYTMTTPYPS LPAQVHNYMMPPLDRSWRDYVPDQPHPEIPYQCPMTFGPRGHHWQGPACENGCQVTGT FYACAPPESQAPGVPTEPSIRSAEALAFSDCRLHICLYYREILVKELTTSSPEGCRIS HGHTYDASNLDQVLFPYPEDNGQRKNIEKLLSHLERGVVLWMAPDGLYAKRLCQSRIY WDGPLALCNDRPNKLERDQTCKLFDTQQFLSELQAFAHHGRSLPRFQVTLCFGEEFPD PQRQRKLITAHVEPLLARQLYYFAQQNSGHFLRGYDLPEHISNPEDYHRSIRHSSIQE " misc_feature 183..537 /gene="IRF4" /note="encodes tryptophan pentad repeat DNA binding domain" exon 342..538 /gene="IRF4" /number=3 exon 539..617 /gene="IRF4" /number=4 exon 618..759 /gene="IRF4" /note="exon 5A/5B" exon 760..867 /gene="IRF4" /number=6 exon 868..1221 /gene="IRF4" /number=7 exon 1222..1334 /gene="IRF4" /number=8 exon 1335..(1489.5320) /gene="IRF4" /number=9 3'UTR 1479..5320 /gene="IRF4" repeat_region 1519..1684 /rpt_family="Alu" polyA_signal 5297..5302 /gene="IRF4" BASE COUNT 1373 a 1256 c 1250 g 1441 t ORIGIN 1 acctcgcact ctcagtttca ccgctcgatc ttgggaccca ccgctgccct cagctccgag 61 tccagggcga gtgcagagca gagcgggcgg aggaccccgg gcgcgggcgc ggacggcacg 121 cgggcatgaa cctggagggc ggcggccgag gcggagagtt cggcatgagc gcggtgagct 181 gcggcaacgg gaagctccgc cagtggctga tcgaccagat cgacagcggc aagtaccccg 241 ggctggtgtg ggagaacgag gagaagagca tcttccgcat cccctggaag cacgcgggca 301 agcaggacta caaccgcgag gaggacgccg cgctcttcaa ggcttgggca ctgtttaaag 361 gaaagttccg agaaggcatc gacaagccgg accctcccac ctggaagacg cgcctgcggt 421 gcgctttgaa caagagcaat gactttgagg aactggttga gcggagccag ctggacatct 481 cagacccgta caaagtgtac aggattgttc ctgagggagc caaaaaagga gccaagcagc 541 tcaccctgga ggacccgcag atgtccatga gccaccccta caccatgaca acgccttacc 601 cttcgctccc agcccaggtt cacaactaca tgatgccacc cctcgaccga agctggaggg 661 actacgtccc ggatcagcca cacccggaaa tcccgtacca atgtcccatg acgtttggac 721 cccgcggcca ccactggcaa ggcccagctt gtgaaaatgg ttgccaggtg acaggaacct 781 tttatgcttg tgccccacct gagtcccagg ctcccggagt ccccacagag ccaagcataa 841 ggtctgccga agccttggcg ttctcagact gccggctgca catctgcctg tactaccggg 901 aaatcctcgt gaaggagctg accacgtcca gccccgaggg ctgccggatc tcccatggac 961 atacgtatga cgccagcaac ctggaccagg tcctgttccc ctacccagag gacaatggcc 1021 agaggaaaaa cattgagaag ctgctgagcc acctggagag gggcgtggtc ctctggatgg 1081 cccccgacgg gctctatgcg aaaagactgt gccagagcag gatctactgg gacgggcccc 1141 tggcgctgtg caacgaccgg cccaacaaac tggagagaga ccagacctgc aagctctttg 1201 acacacagca gttcttgtca gagctgcaag cgtttgctca ccacggccgc tccctgccaa 1261 gattccaggt gactctatgc tttggagagg agtttccaga ccctcagagg caaagaaagc 1321 tcatcacagc tcacgtagaa cctctgctag ccagacaact atattatttt gctcaacaaa 1381 acagtggaca tttcctgagg ggctacgatt taccagaaca catcagcaat ccagaagatt 1441 accacagatc tatccgccat tcctctattc aagaatgaaa aatgtcaaga tgagtggttt 1501 tctttttcct tttttttttt tttttttgat acggggatac ggggtcttgc tctgtctccc 1561 aggctggagt gcagtgacac aatctcagct cactgtgacc tccgcctcct gggttcaaga 1621 gactctcctg cctcagcctc cctggtagct gggattacag gtgtgagcca ctgcacccac 1681 ccaagacaag tgattttcat tgtaaatatt tgactttagt gaaagcgtcc aattgactgc 1741 cctcttactg ttttgaggaa ctcagaagtg gagatttcag ttcagcggtt gaggagaatt 1801 gcggcgagac aagcatggaa aatcagtgac atctgattgg cagatgagct tatttcaaaa 1861 ggaagggtgg ctttgcattt cttgtgttct atagactgcc atcattgatg atcactgtga 1921 aaattgacca agtgatgtgt ttacatttac tgaaatgtgc tctttaattt gttgtagatt 1981 aggtcttgct ggaagacaga gaaaacttgc ctttcagtat tgacactgac tagagtgatg 2041 actgcttgta ggtatgtctg tgccatttct cagggaagta agatgtaaat tgaagaagcc 2101 tcacacgtaa aagaaatgta ttaatgtatg taggagctgc agttcttgtg gaagacactt 2161 gctgagtgaa ggaaatgaat ctttgactga agccgtgcct gtagccttgg ggaggcccat 2221 cccccacctg ccagcggttt cctggtgtgg gtccctctgc cccaccctcc ttcccattgg 2281 ctttctctcc ttggcctttc ctggaagcca gttagtaaac ttcctatttt cttgagtcaa 2341 aaaacatgag cgctactctt ggatgggaca tttttgtctg tcctacaatc tagtaatgtc 2401 taagtaatgg ttaagttttc ttgtttctgc atctttttga ccctcattct ttagagatgc 2461 taaaattctt cgcataaaga agaagaaatt aaggaacata aatcttaata cttgaactgt 2521 tgcccttctg tccaagtact taactatctg ttcccttcct ctgtgccacg ctcctctgtt 2581 tgcttggctg tccagcgatc agccatggcg acactaaagg aggaggagcc ggggactccc 2641 aggctggaga gcactgccag gacccaccac tggaagcagg atggagctga ctacggaact 2701 gcacactcag tgggctgttt ctgcttattt catctgttct atgcttcctc gtgccaatta 2761 tagtttgaca gggccttaaa attacttggc tttttccaaa tgcttctatt tatagaatcc 2821 caaagacctc cacttgctta agtataccta tcacttacat ttttgtggtt ttgagaaagt 2881 acagcagtag actggggcgt cacctccagg ccgtttctca tactacagga tatttactat 2941 tactcccagg atcagcagaa gattgcgtag ctctcaaatg tgtgttcctg cttttctaat 3001 ggatatttta aattcattca acaagcacct agtaagtgcc tgctgtatcc ctacattaca 3061 cagttcagcc tttatcaagc ttagtgagca gtgagcactg aaacattatt ttttaatgtt 3121 taaaaagttt ctaatattaa agtcagaata ttaatacaat taatattaat attaactaca 3181 gaaaagacaa acagtagaga acagcaaaaa aataaaaagg atctcctttt ttcccagccc 3241 aaattctcct ctctaaaagt gtccacaaga aggggtgttt attcttccaa cacatttcac 3301 ttttctgtaa atatacataa acttaaaaag aaaacctcat ggagtcatct tgcacacact 3361 ttcatgcagt gctctttgta gctaacagtg aagatttacc tcgttctgct cagaggcctt 3421 gctgtggagc tccactgcca tgtacccagt agggtttgac atttcattag ccatgcaaca 3481 tggatatgta ttgggcagca gactgtgttt cgtgaactgc agtgatgtat acatcttata 3541 gatgcaaagt attttggggt atattatcct aagggaagat aaagatgata ttaagaactg 3601 ctgtttcacg gggcccttac ctgtgaccct ctttgctgaa gaatatttaa ccccacacag 3661 cacttcaaag aagctgtctt ggaagtctgt ctcaggagca ccctgtcttc ttaattctcc 3721 aagcggatgc tccatttcaa ttgctttgtg acttcttctt ctttgttttt ttaaatatta 3781 tgctgcttta acagtggagc tgaattttct ggaaaatgct tcttggctgg ggccactacc 3841 tcctttccta tctttacatc tatgtgtatg ttgacttttt aaaattctga gtgatccagg 3901 gtatgaccta gggaatgaac tagctatgaa atactcaggg ttaggaatcc tagcacttgt 3961 ctcaggactc tgaaaaggaa cggcttcctc attccttgtc ttgataaagt ggaattggca 4021 aactagaatt tagtttgtac tcagtggaca gtgctgttga agatttgagg acttgttaaa 4081 gagcactggg tcatatggaa aaaatgtatg tgtctcccag gtgcatttct tggtttatgt 4141 cttgttcttg agattttgta tatttaggaa aacctcaagc agtaattaat atctcctgga 4201 acactataga gaaccaagtg accgactcat ttacaactga aacctaggaa gcccctgagt 4261 cctgagcgaa aacaggagag ttagtcgccc tacaggaaac ccagctagac tattgggtat 4321 gaactaaaaa gagactgtgc catggtgaga aaaatgtaaa atcctacagt ggaatgagca 4381 gcccttacag tgttgttacc accaagggca ggtaggtatt agtgtttgaa aaagctggtc 4441 tttgagcgag ggcataaata cagctagccc caggggtgga acaactgtgg gagtcttggg 4501 tactcgcacc tcttggcttt gttgatgctc cgccaggaag gccacttgtg tgtgcgtgtc 4561 agttactttt ttagtaacaa ttcagatcca gtgtaaactt ccgttcattg ctctccagtc 4621 acatgccccc acttccccac aggtgaaagt ttttctgaaa gtgttgggat tggttaaggt 4681 ctttatttgt attacgtatc tccccaagtc ctctgtggcc agctgcatct gtctgaatgg 4741 tgcgtgaagg ctctcagacc ttacacacca ttttgtaagt tatgttttac atgccccgtt 4801 tttgagactg atctcgatgc aggtggatct ccttgagatc ctgatagcct gttacaggaa 4861 tgaagtaaag gtcagttttt tttgtattga ttttcacagc tttgaggaac atgcataaga 4921 aatgtagctg aagtagaggg gacgtgagag aagggccagg ccggcaggcc aaccctcctc 4981 caatggaaat tcccgtgttg cttcaaactg agacagatgg gacttaacag gcaatggggt 5041 ccacttcccc ctcttcagca tcccccgtac cccacttttt gctgaaagaa ctgccagcag 5101 gtaggacccc agaggccccc aaatgaaagc ttgaatttcc cctactggct ctgcgttttg 5161 ctgagatctg taggaaagga tgcttcacaa actgaggtag ataatgctat gctgtcgttg 5221 gtatacatca tgaattttta tgtaaattgc tctgcaaagc aaattgatat gtttgataaa 5281 tttatgtttt taggtaaata aaaactttta aaaagttgtt // LOCUS HSIRP 2301 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for irp protein (int-1 related protein). ACCESSION X07876 Y00838 NID g33970 KEYWORDS int-1 related protein; irp gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2301) AUTHORS Wainwright,B.J., Scambler,P.J., Stanier,P., Watson,E.K., Bell,G., Wicking,C., Estivill,X., Courtney,M., Boue,A., Pedersen,P.S., Williamson,R. and Farrall,M. TITLE Isolation of a human gene with protein sequence similarity to human and murine int-1 and the Drosophila segment polarity mutant wingless JOURNAL EMBO J. 7 (6), 1743-1748 (1988) MEDLINE 89005063 REFERENCE 2 (bases 1 to 2301) AUTHORS Farrall,M. TITLE Direct Submission JOURNAL Submitted (22-APR-1988) Farrall M., Department of Biochemistry and Molecular Genetics, St. Mary's Hospital Medical School, Norfolk Place, London W1 1PG, England COMMENT cDNA from placental library revealed no difference in sequence Data kindly reviewed (10-Nov-1988) by Farrall M. FEATURES Location/Qualifiers source 1..2301 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" misc_feature 273..1374 /note="ORF" CDS 295..1377 /note="Irp protein (AA 1-360)" /codon_start=1 /db_xref="PID:g33971" /db_xref="SWISS-PROT:P09544" /translation="MNAPLGGIWLWLPLLLTWLTPEVNSSWWYMRATGGSSRVMCDNV PGLVSSQRQLCHRHPDVMRAISQGVAEWTAECQHQFRQHRWNCNTLDRDHSLFGRVLL RSSRESAFVYAISSAGVVFAITRACSQGEVKSCSCDPKKMGSAKDSKGIFDWGGCSDN IDYGIKFARAFVDAKERKGKDARALMNLHNNRAGRKAVKRFLKQECKCHGVSGSCTLR TCWLAMADFRKTGDYLWRKYNGAIQVVMNQDGTGFTVANERFKKPTKNDLVYFENSPD YCIRDREAGSLGTAGRVCNLTSRGMDSCEVMCCGRGYDTSHVTRMTKCGCKFHWCCAV RCQDCLEALDVHTCKAPKNADWTTAT" misc_feature 360 /note="pot. cleavage site (AA 22-23)" misc_feature 364..372 /note="pot. N-linked glycosylation site (AA 24-26)" misc_feature 369 /note="pot. cleavage site (AA 25-26)" misc_feature 1177..1185 /note="pot. N-linked glycosylation site (AA 295-297)" misc_feature 1996..2001 /note="pot. alt. polyA signal" polyA_site 2019 /note="alt. polyA site" misc_feature 2283..2288 /note="pot. polyA signal" polyA_site 2301 /note="polyA site" BASE COUNT 560 a 557 c 643 g 541 t ORIGIN 1 agcagagcgg acgggcgcgc gggaggcgcg cagagctttc gggctgcagg cgctcgctgc 61 cgctggggaa ttgggctgtg ggcgaggcgg tccgggctgg cctttatcgc tcgctgggcc 121 catcgtttga aactttatca gcgagtcgcc actcgtcgca ggaccgagcg gggggcgggg 181 gcgcggcgag gcggcggccg tgacgaggcg ctcccggagc tgagcgcttc tgctctgggc 241 acgcatggcg cccgcacacg gagtctgacc tgatgcagac gcaagggggt taatatgaac 301 gcccctctcg gtggaatctg gctctggctc cctctgctct tgacctggct cacccccgag 361 gtcaactctt catggtggta catgagagct acaggtggct cctccagggt gatgtgcgat 421 aatgtgccag gcctggtgag cagccagcgg cagctgtgtc accgacatcc agatgtgatg 481 cgtgccatta gccagggcgt ggccgagtgg acagcagaat gccagcacca gttccgccag 541 caccgctgga attgcaacac cctggacagg gatcacagcc tttttggcag ggtcctactc 601 cgaagtagtc gggaatctgc ctttgtttat gccatctcct cagctggagt tgtatttgcc 661 atcaccaggg cctgtagcca aggagaagta aaatcctgtt cctgtgatcc aaagaagatg 721 ggaagcgcca aggacagcaa aggcattttt gattggggtg gctgcagtga taacattgac 781 tatgggatca aatttgcccg cgcatttgtg gatgcaaagg aaaggaaagg aaaggatgcc 841 agagccctga tgaatcttca caacaacaga gctggcagga aggctgtaaa gcggttcttg 901 aaacaagagt gcaagtgcca cggggtgagc ggctcatgta ctctcaggac atgctggctg 961 gccatggccg acttcaggaa aacgggcgat tatctctgga ggaagtacaa tggggccatc 1021 caggtggtca tgaaccagga tggcacaggt ttcactgtgg ctaacgagag gtttaagaag 1081 ccaacgaaaa atgacctcgt gtattttgag aattctccag actactgtat cagggaccga 1141 gaggcaggct ccctgggtac agcaggccgt gtgtgcaacc tgacttcccg gggcatggac 1201 agctgtgaag tcatgtgctg tgggagaggc tacgacacct cccatgtcac ccggatgacc 1261 aagtgtgggt gtaagttcca ctggtgctgc gccgtgcgct gtcaggactg cctggaagct 1321 ctggatgtgc acacatgcaa ggcccccaag aacgctgact ggacaaccgc tacatgaccc 1381 cagcaggcgt caccatccac cttcccttct acaaggactc cattggatct gcaagaacac 1441 tggacctttg ggttctttct ggggggatat ttcctaaggc atgtggcctt tatctcaacg 1501 gaagccccct cttcctccct gggggcccca ggatgggggg ccacacgctg cacctaaagc 1561 ctaccctatt ctatccatct cctggtgttc tgcagtcatc tcccctcctg gcgagttctc 1621 tttggaaata gcatgacagg ctgttcagcc gggagggtgg tgggcccaga ccactgtctc 1681 cacccacctt gacgtttctt ctttctagag cagttggcca agcagaaaaa aaagtgtctc 1741 aaaggagctt tctcaatgtc ttcccacaaa tggtcccaat taagaaattc catacttctc 1801 tcagatggaa cagtaaagaa agcagaatca actgcccctg acttaacttt aacttttgaa 1861 aagaccaaga cttttgtctg tacaagtggt tttacagcta ccacccttag ggtaattggt 1921 aattacctgg agaagaatgg ctttcaatac ccttttaagt ttaaaatgtg tatttttcaa 1981 ggcatttatt gccatattaa aatctgatgt aacaaggtgg ggacgtgtgt cctttggtac 2041 tatggtgtgt tgtatctttg taagagcaaa agcctcagaa agggattgct ttgcattact 2101 gtccccttga tataaaaaat ctttagggaa tgagagttcc ttctcactta gaatctgaag 2161 ggaattaaaa agaagatgaa tggtctggca atattctgta actattgggt gaatatggtg 2221 gaaaataatt tagtggatgg aatatcagaa gtatatctgt acagatcaag aaaaaaagga 2281 agaataaaat tcctatatca t // LOCUS HSIRPR 5180 bp RNA PRI 30-MAR-1995 DEFINITION Human mRNA for insulin receptor precursor. ACCESSION X02160 NID g33972 KEYWORDS insulin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5180) AUTHORS Ullrich,A., Bell,J.R., Chen,E.Y., Herrera,R., Petruzzelli,L.M., Dull,T.J., Gray,A., Coussens,L., Liao,Y.C.J., Tsubokawa,M., Mason,A., Seeburg,P.H., Grunfeld,C., Rosen,O.M. and Ramachandran,J. TITLE Human insulin receptor and its relationship to the tyrosine kinase family of oncogenes JOURNAL Nature 313 (6005), 756-761 (1985) MEDLINE 85137889 REFERENCE 2 (bases 2711 to 2713) AUTHORS Chen,E.Y. TITLE Direct Submission JOURNAL Submitted (23-JUL-1985) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (23-JUL-1985) by Chen E.Y. FEATURES Location/Qualifiers source 1..5180 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 49..4161 /note="insulin receptor precursor" /codon_start=1 /db_xref="PID:g33973" /db_xref="SWISS-PROT:P06213" /translation="MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNL TRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDL FPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDW SRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKV CPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQ DWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGP CPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEIS GYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQ GKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTS FDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLR SNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSV PLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLKGLKLPS RTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVV FVPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVIS GLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVH LMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIR ATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQP DGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNA RDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTL VVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHR DLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTS SDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQ FNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRS SHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPS" sig_peptide 49..129 /note="put. signal peptide (aa -27 to -1)" misc_feature 130..2949 /note="put. proreceptor polypeptide (aa 1-1343)" misc_feature 130..2298 /note="put. alpha-subunit (aa 1-699)" misc_feature 2237..2298 /note="put. proteolytic cleavage site (aa 700-723)" misc_feature 2299..4158 /note="put. beta-subunit (aa 724-1343)" misc_feature 2881..2949 /note="put. transmembrane sequence (aa 918-940)" polyA_site 5180 /note="put. polyadenylation site" BASE COUNT 1217 a 1363 c 1390 g 1210 t ORIGIN 1 accgggagcg cgcgctctga tccgaggaga ccccgcgctc ccgcagccat gggcaccggg 61 ggccggcggg gggcggcggc cgcgccgctg ctggtggcgg tggccgcgct gctactgggc 121 gccgcgggcc acctgtaccc cggagaggtg tgtcccggca tggatatccg gaacaacctc 181 actaggttgc atgagctgga gaattgctct gtcatcgaag gacacttgca gatactcttg 241 atgttcaaaa cgaggcccga agatttccga gacctcagtt tccccaaact catcatgatc 301 actgattact tgctgctctt ccgggtctat gggctcgaga gcctgaagga cctgttcccc 361 aacctcacgg tcatccgggg atcacgactg ttctttaact acgcgctggt catcttcgag 421 atggttcacc tcaaggaact cggcctctac aacctgatga acatcacccg gggttctgtc 481 cgcatcgaga agaacaatga gctctgttac ttggccacta tcgactggtc ccgtatcctg 541 gattccgtgg aggataatta catcgtgttg aacaaagatg acaacgagga gtgtggagac 601 atctgtccgg gtaccgcgaa gggcaagacc aactgccccg ccaccgtcat caacgggcag 661 tttgtcgaac gatgttggac tcatagtcac tgccagaaag tttgcccgac catctgtaag 721 tcacacggct gcaccgccga aggcctctgt tgccacagcg agtgcctggg caactgttct 781 cagcccgacg accccaccaa gtgcgtggcc tgccgcaact tctacctgga cggcaggtgt 841 gtggagacct gcccgccccc gtactaccac ttccaggact ggcgctgtgt gaacttcagc 901 ttctgccagg acctgcacca caaatgcaag aactcgcgga ggcagggctg ccaccagtac 961 gtcattcaca acaacaagtg catccctgag tgtccctccg ggtacacgat gaattccagc 1021 aacttgctgt gcaccccatg cctgggtccc tgtcccaagg tgtgccacct cctagaaggc 1081 gagaagacca tcgactcggt gacgtctgcc caggagctcc gaggatgcac cgtcatcaac 1141 gggagtctga tcatcaacat tcgaggaggc aacaatctgg cagctgagct agaagccaac 1201 ctcggcctca ttgaagaaat ttcagggtat ctaaaaatcc gccgatccta cgctctggtg 1261 tcactttcct tcttccggaa gttacgtctg attcgaggag agaccttgga aattgggaac 1321 tactccttct atgccttgga caaccagaac ctaaggcagc tctgggactg gagcaaacac 1381 aacctcacca tcactcaggg gaaactcttc ttccactata accccaaact ctgcttgtca 1441 gaaatccaca agatggaaga agtttcagga accaaggggc gccaggagag aaacgacatt 1501 gccctgaaga ccaatgggga ccaggcatcc tgtgaaaatg agttacttaa attttcttac 1561 attcggacat cttttgacaa gatcttgctg agatgggagc cgtactggcc ccccgacttc 1621 cgagacctct tggggttcat gctgttctac aaagaggccc cttatcagaa tgtgacggag 1681 ttcgacgggc aggatgcatg tggttccaac agttggacgg tggtagacat tgacccaccc 1741 ctgaggtcca acgaccccaa atcacagaac cacccagggt ggctgatgcg gggtctcaag 1801 ccctggaccc agtatgccat ctttgtgaag accctggtca ccttttcgga tgaacgccgg 1861 acctatgggg ccaagagtga catcatttat gtccagacag atgccaccaa cccctctgtg 1921 cccctggatc caatctcagt gtctaactca tcatcccaga ttattctgaa gtggaaacca 1981 ccctccgacc ccaatggcaa catcacccac tacctggttt tctgggagag gcaggcggaa 2041 gacagtgagc tgttcgagct ggattattgc ctcaaagggc tgaagctgcc ctcgaggacc 2101 tggtctccac cattcgagtc tgaagattct cagaagcaca accagagtga gtatgaggat 2161 tcggccggcg aatgctgctc ctgtccaaag acagactctc agatcctgaa ggagctggag 2221 gagtcctcgt ttaggaagac gtttgaggat tacctgcaca acgtggtttt cgtccccagg 2281 ccatctcgga aacgcaggtc ccttggcgat gttgggaatg tgacggtggc cgtgcccacg 2341 gtggcagctt tccccaacac ttcctcgacc agcgtgccca cgagtccgga ggagcacagg 2401 ccttttgaga aggtggtgaa caaggagtcg ctggtcatct ccggcttgcg acacttcacg 2461 ggctatcgca tcgagctgca ggcttgcaac caggacaccc ctgaggaacg gtgcagtgtg 2521 gcagcctacg tcagtgcgag gaccatgcct gaagccaagg ctgatgacat tgttggccct 2581 gtgacgcatg aaatctttga gaacaacgtc gtccacttga tgtggcagga gccgaaggag 2641 cccaatggtc tgatcgtgct gtatgaagtg agttatcggc gatatggtga tgaggagctg 2701 catctctgcg tctcccgcaa gcacttcgct ctggaacggg gctgcaggct gcgtgggctg 2761 tcaccgggga actacagcgt gcgaatccgg gccacctccc ttgcgggcaa cggctcttgg 2821 acggaaccca cctatttcta cgtgacagac tatttagacg tcccgtcaaa tattgcaaaa 2881 attatcatcg gccccctcat ctttgtcttt ctcttcagtg ttgtgattgg aagtatttat 2941 ctattcctga gaaagaggca gccagatggg ccgctgggac cgctttacgc ttcttcaaac 3001 cctgagtatc tcagtgccag tgatgtgttt ccatgctctg tgtacgtgcc ggacgagtgg 3061 gaggtgtctc gagagaagat caccctcctt cgagagctgg ggcagggctc cttcggcatg 3121 gtgtatgagg gcaatgccag ggacatcatc aagggtgagg cagagacccg cgtggcggtg 3181 aagacggtca acgagtcagc cagtctccga gagcggattg agttcctcaa tgaggcctcg 3241 gtcatgaagg gcttcacctg ccatcacgtg gtgcgcctcc tgggagtggt gtccaagggc 3301 cagcccacgc tggtggtgat ggagctgatg gctcacggag acctgaagag ctacctccgt 3361 tctctgcggc cagaggctga gaataatcct ggccgccctc cccctaccct tcaagagatg 3421 attcagatgg cggcagagat tgctgacggg atggcctacc tgaacgccaa gaagtttgtg 3481 catcgggacc tggcagcgag aaactgcatg gtcgcccatg attttactgt caaaattgga 3541 gactttggaa tgaccagaga catctatgaa acggattact accggaaagg gggcaagggt 3601 ctgctccctg tacggtggat ggcaccggag tccctgaagg atggggtctt caccacttct 3661 tctgacatgt ggtcctttgg cgtggtcctt tgggaaatca ccagcttggc agaacagcct 3721 taccaaggcc tgtctaatga acaggtgttg aaatttgtca tggatggagg gtatctggat 3781 caacccgaca actgtccaga gagagtcact gacctcatgc gcatgtgctg gcaattcaac 3841 cccaacatga ggccaacctt cctggagatt gtcaacctgc tcaaggacga cctgcacccc 3901 agctttccag aggtgtcgtt cttccacagc gaggagaaca aggctcccga gagtgaggag 3961 ctggagatgg agtttgagga catggagaat gtgcccctgg accgttcctc gcactgtcag 4021 agggaggagg cggggggccg ggatggaggg tcctcgctgg gtttcaagcg gagctacgag 4081 gaacacatcc cttacacaca catgaacgga ggcaagaaaa acgggcggat tctgaccttg 4141 cctcggtcca atccttccta acagtgccta ccgtggcggg ggcgggcagg ggttcccatt 4201 ttcgctttcc tctggtttga aagcctctgg aaaactcagg attctcacga ctctaccatg 4261 tccaatggag ttcagagatc gttcctatac atttctgttc atcttaaggt ggactcgttt 4321 ggttaccaat ttaactagtc ctgcagagga tttaactgtg aacctggagg gcaaggggtt 4381 tccacagttg ctgctccttt ggggcaacga cggtttcaaa ccaggatttt gtgttttttc 4441 gttcccccca cccgccccca gcagatggaa agaaagcacc tgtttttaca aattcttttt 4501 tttttttttt ttttttgctg gtgtctgagc ttcagtataa aagacaaaac ttcctgtttg 4561 tggaacaaaa gttcgaaaga aaaaacaaaa caaaaacacc cagccctgtt ccaggagaat 4621 ttcaagtttt acaggttgag cttcaagatg gtttttttgg tttttttttt ttctctcatc 4681 caggctgaag gatttttttt ttctttacaa aatgagttcc tcaaattgac caatagctgc 4741 tgctttcata ttttggataa gggtctgtgg tcccggcgtg tgctcacgtg tgtatgcacg 4801 tgtgtgtgtc cattagacac ggctgacgtg tgtgcaaagt atccatgcgg agttgatgct 4861 ttgggaattg gctcatgaag gttcttctca agggtgcgag ctcatccccc tctctccttc 4921 cttcttattg actgggagac tgtgctctcg acagattctt cttgtgtcag aagtctagcc 4981 tcaggtttct accctccctt cacattggtg gccaagggag gagcatttca tttggagtga 5041 ttatgaatct tttcaagacc aaaccaagct aggacattaa aaaaaaaaaa aagaaaaaga 5101 aagaaaaaac aaaatggaaa aaggaaaaaa aaaaagaact gagatgacag agttttgaga 5161 atatatttgt accatattta // LOCUS HSISANNEX 1475 bp RNA PRI 24-FEB-1992 DEFINITION H.sapiens mRNA for intestine-specific annexin. ACCESSION Z11502 NID g33979 KEYWORDS intestine-specific annexin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1475) AUTHORS Wice,B.M. and Gordon,J.I. TITLE A strategy for isolation of cDNAs encoding proteins affecting human intestinal epithelial cell growth and differentiation: characterization of a novel gut-specific N-myristoylated annexin JOURNAL J. Cell Biol. 116 (2), 405-422 (1992) MEDLINE 92112982 REFERENCE 2 (bases 1 to 1475) AUTHORS Wice,B.M. TITLE Direct Submission JOURNAL Submitted (22-NOV-1991) WICE B., Molecular Biology and Pharmacology, Washington University School of Medicine, 4566 Scott Ave, Saint Louis, MO, 63110, USA FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="committed,predifferentiated HT-29/inosine" /cell_line="HT-29" /clone_lib="cDNA in lambda GEM-2" /clone="pBS7" CDS 80..1030 /codon_start=1 /product="intestine-specific annexin" /db_xref="PID:g33980" /db_xref="SWISS-PROT:P27216" /translation="MGNRHAKASSPQGFDVDRDAKKLNKACKGMGTNEAAIIEILSGR TSDERQQIKQKYKATYGKELEEVLKSELSGNFEKTALALLDRPSEYAARQLQKAMKGL GTDESVLIEFLCTRTNKEIIAIKEAYQRLFDRSLESDVKGDTSGNLKKILVSLLQANR NEGDDVDKDLAGQDAKDLYDAGEGRWGTDELAFNEVLAKRSYKQLRATFQAYQILIGK DIEEAIEEETSGDLQKAYLTLVRCAQDCEDYFAERLYKSMKGAGTDEETLIRIVVTRA EVDLQGIKAKFQEKYQKSLSDMVRSDTSGDFRKLLVALLH" polyA_signal 1452..1457 /note="polyA_signal=ATTAAA" polyA_site 1475 BASE COUNT 451 a 309 c 384 g 331 t ORIGIN 1 cgttgctgtc ggtaaacttt gcctgtagga ggactgatct cttaatgaaa tacagaaaaa 61 ccatctcaga aaaaggaaaa tgggcaatcg tcatgctaaa gcgagcagtc ctcagggttt 121 tgatgtggat cgagatgcca aaaagctgaa caaagcctgc aaaggaatgg ggaccaatga 181 agcagccatc attgaaatct tatcgggcag gacatcagat gagaggcaac aaatcaagca 241 aaagtacaag gcaacgtacg gcaaggagct ggaggaagta ctcaagagtg agctgagtgg 301 aaacttcgag aagacagcgt tggcccttct ggaccgtccc agcgagtacg ccgcccggca 361 gctgcagaag gctatgaagg gtctgggcac agatgagtcc gtcctcattg agttcctgtg 421 cacgaggacc aataaggaaa tcatcgccat taaagaggcc taccaaaggc tatttgatag 481 gagcctcgaa tcagatgtca aaggtgatac aagtggaaac ctaaaaaaaa tcctggtgtc 541 tctgctgcag gctaatcgca atgaaggaga tgacgtggac aaagatctag ctggtcagga 601 tgccaaagat ctgtatgatg caggggaagg ccgctggggc actgatgagc ttgcgttcaa 661 tgaagtcctg gccaagagga gctacaagca gttacgagcc acctttcaag cctatcaaat 721 tctcattggc aaagacatag aagaagccat tgaagaagaa acatcaggcg acttgcagaa 781 ggcctattta actctcgtga gatgtgccca ggattgtgag gactattttg ctgaacgtct 841 gtacaagtcg atgaagggtg cggggaccga tgaggagacg ttgattcgca tagtcgtgac 901 cagggccgag gtggaccttc aggggatcaa agcaaagttc caagagaagt atcagaagtc 961 tctctctgac atggttcgct cagatacctc cggggacttc cggaaactgc tagtagccct 1021 cttgcactga gccaagccag ggcaatagga acacagggtg gaaccacctt tgtcaagagc 1081 acattccaaa tcaaacttgc aaatgagact cccgcacgaa aacccttaag agtcccggat 1141 tactttcttg gcagcttaag tggcgcagcc aggccaagct gtgtaagtta agggcagtaa 1201 cgttaagatg cgtgggcagg gcaccttgaa ctctggctta gcaagcatct aggctgcctc 1261 ttcactttct tttagcatgg taactggatg ttttctaaac actaatgaaa tcagcagttg 1321 atgaaaaaac tatgcatttg taatggcaca tttagaagga tatgcatcac acaagtaagg 1381 tacaggaaag acaaaattaa acaatttatt aattttcctt ctgtgtgttc aatttgaaag 1441 cctcattgtt aattaaagtt gtggattatg cctct // LOCUS HSISG20GN 694 bp RNA PRI 30-MAY-1997 DEFINITION H.sapiens mRNA for gpISG20 protein. ACCESSION X89773 NID g2143257 KEYWORDS isg20 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 694) AUTHORS Tissot,C., Nissen,J. and Mechti,N. TITLE Molecular cloning of a new interferon-inductible PML nuclear bodies-associated protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 694) AUTHORS Mechti,N. TITLE Direct Submission JOURNAL Submitted (12-JUL-1995) N. Mechti, Institut de Genetique Moleculaire de Montpellier, 1919 route de Mende, F- 34033 Montpellier Cedex, FRANCE REMARK Revised by author 30-MAY-97 FEATURES Location/Qualifiers source 1..694 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblastoid" /cell_line="daudi" gene 41..580 /gene="isg20" CDS 41..580 /gene="isg20" /note="interferon induced" /codon_start=1 /db_xref="PID:e319196" /db_xref="PID:g2143258" /translation="MAGSREVVAMDCEMVGLGPHRESGLARCSLVNVHGAVLYDKFIR PEGEITDYRTRVSGVTPQHMVGATPFAVARLEILQLLKGKLVVGHDLKHDFQALKEDM SGYTIYDTSTDRLLWREAKLDHCRRVSLRVLSERLLHKSIQPLGHSSVEDARATMELY QISQRIRARRGLPRLAVSD" BASE COUNT 147 a 192 c 218 g 137 t ORIGIN 1 ctgcagaatt cggcacgagc tctgagggtc cccaaggaac atggctggga gccgtgaggt 61 ggtggccatg gactgcgaga tggtggggct ggggccccac cgggagagtg gcctggctcg 121 ttgcagcctc gtgaacgtcc acggtgctgt gctgtacgac aagttcatcc ggcctgaggg 181 agagatcacc gattacagaa cccgggtcag cggggtcacc cctcagcaca tggtgggggc 241 cacaccattt gccgtggcca ggctagagat cctgcagctc ctgaaaggca agctggtggt 301 gggtcatgac ctgaagcacg acttccaggc actgaaagag gacatgagcg gctacacaat 361 ctacgacacg tccactgaca ggctgttgtg gcgtgaggcc aagctggacc actgcaggcg 421 tgtctccctg cgggtgctga gtgagcgcct cctgcacaag agcatccagc cgcttggaca 481 cagctcggtg gaagatgcga gggcaacgat ggagctctat caaatctccc agagaatccg 541 agcccgccga gggctgcccc gcctggctgt gtcagactga agccccatcc agcccgttcc 601 gcagggacta gaggctttcg gctttttggg acagcaacta ccttgctttt ggaaaataca 661 tttttaatag taaagtggct ctatattttc tcta // LOCUS HSITBA1 1411 bp RNA PRI 23-JAN-1997 DEFINITION H.sapiens mRNA for ITBA1 protein. ACCESSION X92475 NID g1488607 KEYWORDS ITBA1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 73 to 1411) AUTHORS Faranda,S., Frattini,A., Zucchi,I., Patrosso,C., Milanesi,L., Montagna,C. and Vezzoni,P. TITLE Characterization and fine localization of two new genes in Xq28 using the genomic SequencesolidusEST database screening approach JOURNAL Genomics 34 (1), 323-327 (1996) MEDLINE 96299683 REFERENCE 2 (bases 1 to 1411) AUTHORS Faranda,S. TITLE Direct Submission JOURNAL Submitted (20-OCT-1995) S. Faranda, ITBA - CNR, via Ampere 56, I-20131 Milan, ITALY REFERENCE 3 (bases 1 to 72) AUTHORS Zoppe,M., Frattini,A., Faranda,S. and Vezzoni,P. TITLE The complete sequence of the host cell factor 1 (HCFC1) gene and its promoter: a role for YY1 transcription factor in the regulation of its expression JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1411 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /clone_lib="cDNA skeletal muscle (Genethon)" /clone="HSBB5D05" /chromosome="X" /map="Xq28" gene 285..1070 /gene="ITBA1" CDS 285..1070 /gene="ITBA1" /codon_start=1 /db_xref="PID:e206538" /db_xref="PID:g1488608" /translation="MNPEWGQAFVHVAVAGGLCAVAVFTGIFDSVSVQVGYEHYAEAP VAGLPAFLAMPFNSLVNMAYTLLGLSWLHRGGAMGLGPRYLKDVFAAMALLYGPVQWL RLWTQWRRAAVLDQWLTLPIFAWPVAWCLYLDRGWRPWLFLSLECVSLASYGLALLHP QGFEVALGAHVVAAVGQALRTHRHYGSTTSATYLALGVLSCLGFVVLKLCDHQLARWR LFQCLTGHFWSKVCDVLQFHFAFLFLTHFNTHPRFHPSGGKTR" 3'UTR 1071..1411 polyA_site 1350..1355 BASE COUNT 245 a 414 c 421 g 331 t ORIGIN 1 gaatttcccc cagcgaggcg agtgaggcga aatacccgta tggtgatagc tggccttttc 61 gcgccaatac tgaaaaaggc agaacgttcc tccgctggcg ccagccaatc agcaggactc 121 ctgccttcct tcggggcaag gtcgcagcat ctgcctcgga aatcacgaaa tcacggggct 181 tctttctgct ggctcagccg ggaggcccag agtgttctgc agaggctgcg tattgaaggc 241 tgctctctga agctccctgc cccaggtcac gccgccggtt ccagatgaat ccagagtggg 301 ggcaggcctt cgtgcacgtg gccgtggccg gtggcctctg tgccgtggct gtgttcacgg 361 gcattttcga cagtgtttcc gtgcaagtgg gctatgagca ctacgccgag gcgcccgtgg 421 ccggcctccc tgccttcctg gccatgccgt tcaactcact cgtgaacatg gcctacacgc 481 tgctggggct gtcgtggctg cacaggggcg gcgcgatggg gctgggtccc cgctacctga 541 aggacgtgtt cgcagccatg gccctgctct atggccccgt gcagtggctg cgcctgtgga 601 cgcagtggcg ccgtgccgcg gtgctggacc agtggctcac actgcccatc tttgcatggc 661 ccgtggcctg gtgcctctac ctagaccgcg gctggcggcc ctggctgttc ctctctcttg 721 agtgcgtctc cctggccagt tatggcctcg ctctgctgca tccccagggc ttcgaggtcg 781 cactgggtgc tcacgtggtg gccgctgtgg ggcaggcgct gcgcacccac aggcactatg 841 gcagcaccac ctcggctacc tacttagctt tgggggtgct ctcttgcctg ggctttgtgg 901 tcctcaagct gtgtgaccat cagctcgcac ggtggcgtct cttccagtgc ctcacaggcc 961 acttctggtc caaggtctgt gacgtgctcc agttccactt tgcgtttttg tttctgacgc 1021 atttcaacac tcacccaaga ttccatccct ctggcgggaa gacgcgttga acccagggaa 1081 gaacctgctg aaaaccgatg acccccagca ttgaaatgga ctctgagatg gcagcgtggt 1141 gccagtgtca gacatcctgt gtgtgatgat atgcactgat tacacaagac tgccctttcc 1201 tgagaagctg cgggcttcgg tgtggagggg tggagtgctg tgatctcgac aacttacttt 1261 caaagacata aagcacagat ctccgcacag gggatgtgtg tgttcctgat gtaatttgca 1321 taacttttct gtagtttgaa atgtttccaa ataaatattg gcaaggggag tggaaatgac 1381 accaagaagc ccctcatgct catggttgga c // LOCUS HSITBA2 599 bp RNA PRI 12-AUG-1996 DEFINITION H.sapiens mRNA for ITBA2 protein. ACCESSION X92896 NID g1488609 KEYWORDS ITBA2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 599) AUTHORS Faranda,S. TITLE Direct Submission JOURNAL Submitted (09-NOV-1995) S. Faranda, ITBA - CNR, via Ampere 56, I-20131 Milan, ITALY REFERENCE 2 (bases 1 to 599) AUTHORS Faranda,S., Frattini,A., Zucchi,I., Patrosso,C., Milanesi,L., Montagna,C. and Vezzoni,P. TITLE Characterization and fine localization of two new genes in Xq28 using the genomic sequence/EST database screening approach JOURNAL Genomics 34 (3), 323-327 (1996) MEDLINE 96374823 FEATURES Location/Qualifiers source 1..599 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="X" /map="Xq28" 5'UTR 1..10 gene 11..328 /gene="ITBA2" exon 11..184 /gene="ITBA2" /number=1 /evidence=experimental CDS 11..328 /gene="ITBA2" /codon_start=1 /evidence=experimental /db_xref="PID:e209875" /db_xref="PID:g1488610" /translation="MQTQAEALTAGMAGVATAAAGAWTQPQLRPVELPQRTRQVRAET PRLPQGVTNAAAHIHPQRAFPDPLGGGNRPWVPGTRCRAPPKGGWEGSHSEWQDPGRP LES" exon 185..313 /gene="ITBA2" /number=2 /evidence=experimental exon 314..599 /number=3 /evidence=experimental 3'UTR 329..599 polyA_signal 580..585 BASE COUNT 106 a 182 c 188 g 123 t ORIGIN 1 cgggacgcgg atgcagacgc aggcggaggc gctgacggcg gggatggccg gggtggccac 61 agctgccgcg ggggcgtgga cacagccgca gctccggccg gtggagctcc cccagcgcac 121 gcgccaggtc cgggcagaga cgccgcgtct gccgcagggg gtcacgaatg cggccgcaca 181 tattcaccct cagcgtgcct ttcccgaccc ccttggaggc ggaaatcgcc catgggtccc 241 tggcaccaga tgccgagccc caccaaaggg tggttgggaa ggatctcaca gtgagtggca 301 ggatcctggt cgtccgctgg aaagctgaag actgtcgcct gctccgaatt tccgtcatca 361 actttcttga ccagctttcc ctggtggtgc ggaccatgca gcgctttggg ccccccgttt 421 cccgctaagc ctggcctggg caaatggagc gaggtcccac tttgcgtctc cttgtaggca 481 gtgcgtccat ccttccctag ggcaggaatt cccacagttg ctactttcct gggagggcct 541 catgttttat ctggttctta aatgtttgtt actacagaaa ataaaactga ggtattatt // LOCUS HSITIH1 2920 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for inter-alpha-trypsin inhibitor heavy chain ITIH1. ACCESSION X63652 NID g33988 KEYWORDS HI-30 (30KDa protease inhibitor) binding; inter-alpha-trypsin inhibitor heavy chain; ITI H2 binding; ITI H3 binding; protease inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2920) AUTHORS Diarra-Mehrpour,M. TITLE Direct Submission JOURNAL Submitted (30-JAN-1992) M. Diarra-Mehrpour, INSERM-Unite 295, Faculte de Medecine-Pharmacie de Rouen, Avenue de l'Universite - B.P. 97, F-76803 St Etienne Rouvray Cedex, FRANCE REFERENCE 2 (bases 1 to 2920) AUTHORS Gebhard,W., Schreitmuller,T., Hochstrasser,K. and Wachter,E. TITLE Two out of the three kinds of subunits of inter-alpha-trypsin inhibitor are structurally related JOURNAL Eur. J. Biochem. 181 (3), 571-576 (1989) MEDLINE 89276339 REFERENCE 3 (bases 1 to 2920) AUTHORS Diarra-Mehrpour,M., Bourguignon,J., Bost,F., Sesboue,R., Muschio,F., Sarafan,N. and Martin,J.P. TITLE Human inter-alpha-trypsin inhibitor: full-length cDNA sequence of the heavy chain H1 JOURNAL Biochim. Biophys. Acta 1132 (1), 114-118 (1992) MEDLINE 92379086 COMMENT Related sequences: M18192-3, M22972-3 and X16260. FEATURES Location/Qualifiers source 1..2920 /organism="Homo sapiens" /isolate="3" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver/blood" /cell_type="hepatocyte/leucocyte" /clone_lib="liver cDNA library lambda gt11; genomic library EMBL3; 5' end of cDNA PCR ampl." prim_transcript 1..2920 CDS 25..2760 /codon_start=1 /product="inter-alpha-trypsin inhibitor heavy chain ITIH1" /db_xref="PID:g33989" /translation="MDGAMGPRGLLLCMYLVSLLILQAMPALGSATGRSKSSEKRQAV DTAVDGVFIRSLKVNCKVTSRFAHYVVTSQVVNTANEAREVAFDLEIPKTAFISDFAV TADGNAFIGDIKDKVTAWKQYRKAAISGENAGLVRASGRTMEQFTIHLTVNPQSKVTF QLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDVDIFEPQGISKLDAQASFLPKELAA QTIKKSFSGKKGHVLFRPTVSQQQSCPTCSTSLLNGHFKVTYDVTRDEICDLLVANNH FAHFFAPQNLTNMNKNVVFVIDISGSMRGQKVKQTKEALLKILGDMQPGDYFDLVLFG TRVQSWKGSLVQASEANLQAAQDFVRGFSLDEATNLNGGLLRGIEILNQVQESLPELS NHASILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSME NNGRAQRIYEDHDATQQLQGFYSQVAKPLLVDVDLQYPQDAVLALTQNHHKQYYEGSE IVVAGRIADNKQSSFKADVQAHGEGQEFSITCLVDEEEMKKLLRERGHMLENHVERLW AYLTIQELLAKRMKVDREVRANLSSQALRMSLDYGFVTPLTSMSIRGMADQDGLKPTI DKPSEDSPPLEMLGPRRTFVLSALQPSPTHSSSNTQRLPDRVTGVDTDPHFIIHVPQK EDTLCFNINEEPGVILSLVQDPNTGFSVNGQLIGNKARSPGQHDGTYFGRLGIANPAT DFQLEVTPQNITLNPGFGGPVFSWRDQAVLRQDGVVVTINKKRNLVVSVDDGGTFEVV LHRVWKGSSVHQDFLGFYVLDSHRMSARTHGLLGQFFHPIGFEVSDIHPGSDPTKPDA TMVVRNRRLTVTRGLQKDYSKDPWHGAEVSCWFIHNNGAGLIDGAYTDYIVPDIF" conflict 1778 /citation=[2] /replace="a" conflict 1808 /citation=[2] /replace="a" conflict 2572 /citation=[2] /replace="t" BASE COUNT 699 a 813 c 820 g 588 t ORIGIN 1 acagggcagc aggagcctta gagcatggac ggtgccatgg ggcctcgggg gctgctgttg 61 tgcatgtacc tggtatctct cctcatcctg caggccatgc ctgccctggg ctcggctaca 121 ggcaggtcca agagcagcga gaagcgacag gctgtggaca ccgctgtcga tggcgtgttc 181 atccggagtt tgaaagtcaa ctgcaaagtc acctctcgct tcgcccacta tgttgtcacc 241 agccaagtgg tcaacactgc caatgaagcc agggaagtgg ccttcgacct ggaaatcccc 301 aagacagcat tcatcagtga ctttgccgtt acagcagatg gaaacgcatt tatcggagac 361 ataaaggaca aggtgactgc atggaagcag taccggaaag cagctatctc aggagagaat 421 gccggccttg tcagggcctc ggggagaact atggagcaat tcaccatcca cctcaccgtc 481 aatccccaga gcaaggtcac gtttcagctg acttatgagg aagtgctgaa gagaaaccat 541 atgcagtatg aaattgtcat caaagtcaag cccaagcagc tggtgcatca ttttgagatt 601 gatgtggaca tcttcgagcc ccaggggatc agcaagctgg atgcccaggc ctctttcctg 661 ccgaaggaac tggcagccca aactatcaag aagtccttct caggaaaaaa gggtcatgtg 721 ctgttccgtc ccaccgtgag ccagcagcag tcctgcccca catgctctac atccttactg 781 aacgggcact tcaaggtgac ctacgatgtc actcgagacg agatctgcga cctcctggtg 841 gccaataacc actttgccca cttctttgcc ccccaaaacc tgacaaacat gaacaagaac 901 gtggtttttg tgattgacat cagtggctcc atgagaggcc agaaagtgaa gcagaccaag 961 gaggcactcc ttaaaattct gggggacatg cagccagggg actactttga cctggttctt 1021 tttgggactc gagtacaatc gtggaagggc tcgctggtgc aagcatctga ggccaaccta 1081 caagcagctc aagactttgt gcggggcttt tccctggatg aggccacaaa cctgaatgga 1141 ggtttgctcc ggggaattga gatcttgaac caagttcagg aaagcctccc agaactcagc 1201 aaccatgcct caatactcat catgttgaca gatggcgatc ccacagaggg ggtgacggac 1261 cgttcccaaa tcctcaagaa cgtccgcaac gccatccggg gcaggttccc gctctacaac 1321 ctgggtttcg gccacaatgt ggactttaac tttctggagg tcatgtccat ggagaacaac 1381 ggacgggccc agagaatcta cgaggaccat gatgccaccc agcagctgca gggtttctac 1441 agccaggtag ccaaacccct gctggtggat gtggatttgc agtaccccca ggatgctgtc 1501 ttggccctga cccagaacca ccataaacag tactacgaag gctcagagat tgtggtggcc 1561 gggcgcattg ctgacaacaa acagagcagc ttcaaggctg atgtgcaggc ccatggggag 1621 ggacaagaat tcagtataac ctgcctagtg gatgaggagg agatgaagaa actgctccga 1681 gagcgtggcc acatgctgga gaaccacgtc gagcgcctct gggcctacct caccatccag 1741 gagctgctgg ccaagcggat gaaggtggac agggaggtga gggccaacct gtcatcccag 1801 gccctgcgga tgtcgctgga ctatgggttt gtgaccccac tgacctccat gagcatcagg 1861 ggcatggcgg accaggacgg cctgaagccc accatcgaca agccctcaga ggattctccg 1921 cctttggaga tgctgggacc cagaaggacg ttcgtgctgt cagccttgca gccttctcct 1981 actcattcca gctccaatac ccagcggctg ccagaccgag tgaccggcgt ggacacagac 2041 cctcacttca tcatccacgt gccccagaaa gaggacaccc tgtgcttcaa catcaatgag 2101 gagcctggtg ttatcctgag cctggtacag gaccccaaca caggcttctc agtgaatgga 2161 cagctcattg gcaacaaggc caggagccct gggcagcatg acggcacgta cttcgggcgg 2221 ctgggaatcg caaaccctgc cacggacttt cagttggaag tgactcctca gaacattacg 2281 ctgaaccccg gctttggtgg gcctgtgttt tcctggaggg accaagctgt gctgcggcag 2341 gacggggtgg tggtgaccat caacaagaag aggaacctgg tggtgtctgt ggacgacggt 2401 ggcacctttg aggttgtttt gcaccgagtg tggaagggga gctcggtcca ccaggacttc 2461 ctgggcttct atgtgctgga cagtcatcgg atgtcagccc ggacgcacgg gctgctgggg 2521 caatttttcc accccatcgg ttttgaagtg tctgacatcc acccaggctc cgaccccaca 2581 aagccagatg ccacgatggt ggtgaggaac cgccggctca cggtcaccag gggtttgcaa 2641 aaagactaca gcaaggaccc gtggcatggg gccgaggtgt cctgctggtt cattcacaac 2701 aatggggctg gactcatcga tggtgcctac actgattata tcgtccccga catcttctga 2761 gccctctggc cagcacgcct gtcctccccc ggggccaagg cagaggagga ggacgacatc 2821 ctgacctgct gctgaggctg tacctccttg actaagctgg ttccttgtgt caaagcacct 2881 catgccttcc attaaagaga ggccgtgtcc aaaaaaaaaa // LOCUS HSITKBI 4505 bp RNA PRI 02-SEP-1992 DEFINITION H.sapiens mRNA for 1D-myo-inositol-trisphosphate 3-kinase B isoenzyme. ACCESSION X57206 NID g33990 KEYWORDS inositol 1,4,5-triphosphate 3-kinase; isoenzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4505) AUTHORS Takazawa,K. TITLE Direct Submission JOURNAL Submitted (11-JAN-1991) K. Takazawa, I R I B H N SCHOOL OF MEDICINE, FREE UNIVERSITY OF BRUSSELS (ULB), CAMPUS ERASME (BAT C), ROUTE DE LENNICK 808, B-1070 BRUSSELS, BELGIUM REFERENCE 2 (bases 1 to 4505) AUTHORS Takazawa,K., Perret,J., Dumont,J.E. and Erneux,C. TITLE Molecular cloning and expression of a new putative inositol 1,4,5-trisphosphate 3-kinase isoenzyme JOURNAL Biochem. J. 278 (Pt 3), 883-886 (1991) MEDLINE 91378954 FEATURES Location/Qualifiers source 1..4505 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /tissue_type="hippocampus" /clone_lib="cDNA" /clone="hh2T" /sex="Female" CDS 102..1520 /EC_number="2.7.1.127" /codon_start=1 /product="1D-myo-inositol-trisphosphate 3-kinase" /db_xref="PID:g33991" /db_xref="SWISS-PROT:P27987" /translation="MLEPLPCWDAAKDLKEPQCPPGDRVGVQPGNSRVWQGTMEKAGL AWTRGTGVQSEGTWESQRQDSDALPSPELLPQDQDKPFLRKACSPSNIPAVIITDMGT QEDGALEETQGSPRGNLPLRKLSSSSASSTGFSSSYEDSEEDISSDPERTLDPNSAFL HTLDQQKPRVSKSWRKIKNMVHWSPFVMSFKKKYPWIQLAGHAGSFKAAANGRILKKH CESEQRCLDRLMVDVLRPFVPAYHGDVVKDGERYNQMDDLLADFDSPCVMDCKMGIRT YLEEELTKARKKPSLRKDMYQKMIEVDPEAPTEEEKAQRAVTKPRYMQWRETISSTAT LGFRIEGIKKEDGTVNRDFKKTKTREQVTEAFREFTKGNHNILIAYRDRLKAIRTTLE VSPFFKCHEVIGSSLLFIHDKKEQAKVWMIDFGKTTPLPEGQTLQHDVPWQEGNREDG YLSGLNNLVDILTEMSQDAPLA" BASE COUNT 1001 a 1213 c 1266 g 1025 t ORIGIN 1 gaattccgga gggagggtcc ccaacgctgg gcttgcttgg gggcagcccc tcagcacagc 61 cggggaccgg gaatgtggag gcgggaattc cttctggcag aatgctggag cctttgccct 121 gttgggacgc tgcgaaagat ctgaaagaac ctcagtgccc tcctggggac agggtgggtg 181 tgcagcctgg gaactccagg gtttggcagg gcaccatgga gaaagccggt ttggcttgga 241 cgcgtggcac aggggtgcaa tcagagggga cttgggaaag ccagcggcag gacagtgatg 301 ccctcccaag tccggagctg ctaccccaag atcaggacaa gcctttcctg aggaaggcct 361 gcagccccag caacatacct gctgtcatca ttacagacat gggcacccag gaggatgggg 421 ccttggagga gacgcaggga agccctcggg gcaacctgcc cctgaggaaa ctgtcctctt 481 cctcggcctc ctccacgggc ttctcctcat cctacgaaga ctcagaggag gacatctcca 541 gtgaccctga gcgcaccctg gaccccaact cagctttcct gcataccctg gaccagcaga 601 aacctagagt gagcaaatca tggaggaaga taaaaaacat ggtgcactgg tctcccttcg 661 tcatgtcctt caagaagaag tacccctgga tccagctggc aggacacgca gggagtttca 721 aggcagctgc caatggcagg atcctgaaga agcactgtga gtcagagcag cgctgcctgg 781 accggctgat ggtggatgtg ctgaggccct tcgtacctgc ctaccatggg gatgtggtga 841 aggacgggga gcgctacaac cagatggacg acctgctggc cgacttcgac tcgccctgtg 901 tgatggactg caagatggga atcaggacct acctggagga ggagctcacg aaggcccgga 961 agaagcccag cctgcggaag gacatgtacc agaagatgat cgaggtggac cccgaggccc 1021 ccaccgagga ggaaaaagca cagcgggctg tgaccaagcc acggtacatg cagtggcggg 1081 agaccatcag ctccacggcc accctggggt tcaggatcga gggaatcaag aaagaagacg 1141 gcaccgtgaa ccgggacttc aagaagacca aaacgaggga gcaggtcacc gaggccttca 1201 gagagttcac taaaggaaac cataacatcc tgatcgccta tcgggaccgg ctgaaggcca 1261 ttcgaaccac tctagaagtt tctcccttct tcaagtgcca cgaggtcatt ggcagctccc 1321 tcctcttcat ccacgacaag aaggaacagg ccaaagtgtg gatgatcgac tttgggaaaa 1381 ccacgcccct gcctgagggc cagaccctgc agcatgacgt cccctggcag gaggggaacc 1441 gggaggatgg ctacctctcg gggctcaata acctcgtcga catcctgacc gagatgtccc 1501 aggatgcccc actcgcctga gctgcccacg ccctccctgg cccccgcctg ggcctccttt 1561 cctcctcctg tgcttccttt ctcgttccta acttttcctt cacttacacc tgactgaccc 1621 tcctgaactg cactacaaga cactttgtag aagaggagat gagagtttct agtcattttc 1681 ctaacttcag ggcttggagg tggtgtttgc actgcttttt gtagagaggg tcacctacta 1741 gaagagaaat gcccagtctt agaggtgggt caggtgtaga gctggagggg gtccctggct 1801 gctgagggga ccctaccaga tgagccctgc ctctgggagc cccctaggaa gcaccagcct 1861 ggacctacca cctgcggagg cctgctgccc cctggcggcc agtgctgtta gagtgctgcc 1921 aagcacagcc ttatttctgc cggggcctcc ccaccggaga gcccaggggg ccggccgggt 1981 tcctggtccc tggctgggag cagggctttc tggtagttgg ggcacaaaac catcggggaa 2041 ccacatgttg actgtgagca aagtgtcttc cgattagcag cctcagggat gccctggtgg 2101 cctctccagg gctgctcagg caaggccccc cacccatctg gtatggaaac ctgccggctc 2161 caggccagac ccaggagcca agagaaggct gaagccagct tggctgtgtt ctctgatcta 2221 ggccttccca gaggaggcga gcagaagctg tgccacttgg aattgcaacc catgagttca 2281 gaaggcacac tctgccatgc tgagctccaa gggtgctacc aggggaagat gggatctata 2341 gagtctctgg gccctggccc cagggaggag cacatttttc ttgaccctca cctacctggt 2401 gctagttggt caaccctgcc tgcatacatg ggctcctgtc atggggccca gagtcccttg 2461 cagatataga aataggggag gagctcaggt ctgcgccagg caggaagaag gcaggcttct 2521 ggcttccaga ggtgccgcgg tggcctcctg gcatcatttg ttattgcctc tgaaacaagc 2581 cttactgcct ggagggctta gattcctgct tccccaatgt agtgtgggta tcttgtaggg 2641 tatgtggtgg atgccagggc gtgctccagg cacctcttcc tgaagtctct gcatttggag 2701 attcgtggag aacctattta agcccaattt taactgaaag ccagtgagtc tgatatggaa 2761 gggaatgtaa aatttgcctg acttcttaag aacaaaaccc ccagctctgt gccccatgct 2821 ccttggggct tgccacccac tcctttgctg tcagaggtac aggagctggg agagtccagg 2881 agctagggac acagagggag actatggacc aaggtgtgtg tgtctggagg aaccactgcc 2941 caccccacca ccccggggtc tctggggaac tgtcaacctg cccacgggac atgtacattt 3001 ccccttttgt gctggaagtg tgagtgacac ttgctggggg tggagggtgg gacacatgag 3061 gatgtataag tacagatttt aaaaaaggaa atcaacttac acttcctggc tcttgtttaa 3121 aacagtggtg agctcctgtg tgggccgact tgctaaaggt cacacacgcg cccggtggag 3181 cacgagagac ctcgtggcag catgtgatct ggaaggcagg caggacgggg gcgttgggga 3241 gccaaagtca actctgggcc tctggagcta tagtgacttt tgggctagaa gggaccctgg 3301 tggtctgtgc ttcagccatt tgcagggcag gggcatcatt aattcagacg taaagattct 3361 atgaatatgg actggccaaa agttatcctt actccatctg tgaaagaagt ttgctaaagc 3421 aaatcatgat atgaacaaaa attacagggg acctgtttaa gagaacaaaa tgttccaagc 3481 actttaggca gacaccagct gtttgcaaac aatgtgctaa tatgcaaatg atgtgcttat 3541 taaaggaggc ccatggggcc tcttattggc aatacttggc tgtgggttac attaaatatg 3601 tgaacatagt atgaagtagc atcattttag ggttattctg ttacttaggg tttttgtttt 3661 ctgttttttt tttctctttt tttgtattta ccgtgctagt tctcttctac acctactctg 3721 tctctcaagc cattttgcca ctcgcttccc tgccatctgg cccttccctt tgtctcagtg 3781 ggatagatgg attgtgaaat ggaatctccc agaacccctg ccctggcagc ctggaagacc 3841 gtgcctgccc agccctcgtc accacaggga ctccttgggt cctggcagtg catgtgccag 3901 caggcaggac aaactctgtg tacctgtgcc caggtgaatg ggcgcagggt cctcttgccc 3961 tgtcctgcgg ggggccccac gagttcctgg cattcagcac tgcttagcat tctcggaagg 4021 tttcttcaac tgcttgcttt tcccaggctt gcctttagtg tcatgtaaga catttttaag 4081 ttatatttat tttgttgggt tttaaaattg cacagaacac taagaccgaa aggctggact 4141 cttgtttctc cttgaaagct ttgcctttgt tttgaacttc ctttcccact tggtagaaag 4201 agcccagaag cagccctggc cctgtaagat ggactctttc atccttcagt tgtatttagc 4261 tttgagtttc tctgcatctg tccaccccat gtgtatataa cccagcccct ggctctgggg 4321 tggtcacctc gtcagtgcct tttgttctgg aggagaggac ccccccgcct gccgagaggc 4381 tctcttcctg ttctgcaccc ctctccccat gggaccttgg agaaaactga actgttacaa 4441 acccctgcac agtgcctgtc aaacagatgc aaaccttcct gaataaagcc ttggagacgg 4501 aattc // LOCUS HSJ000644 1642 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for SPOP. ACCESSION AJ000644 NID g2695707 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1642) AUTHORS Nagai,Y., Kojima,T., Muro,Y., Hachiya,T., Nishizawa,Y., Wakabayashi,T. and Hagiwara,M. TITLE Identification of a novel nuclear speckle-type protein, SPOP JOURNAL FEBS Lett. 418 (1-2), 23-26 (1997) MEDLINE 98074898 REFERENCE 2 (bases 1 to 1642) AUTHORS Hagiwara,M. TITLE Direct Submission JOURNAL Submitted (11-AUG-1997) Hagiwara M., Medical Research Institute, Department of Endocrinology, Tokyo Medical and Dental University, 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113, JAPAN FEATURES Location/Qualifiers source 1..1642 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 158..1282 /codon_start=1 /product="SPOP" /db_xref="PID:e1216712" /db_xref="PID:g2695708" /translation="MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFC REEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKF KFSILNAKGEETKAMESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCE VSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAIL AARSPVFSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAA ADKYALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLE TSGWKSMVVSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS" BASE COUNT 424 a 367 c 449 g 402 t ORIGIN 1 gaatcggcgg tcccgcaggt cccggatgtt gcggacagta tgaggcaagc gcagggggac 61 ggggaccagc agctgtcgcc gccgctctca gggtgaagag ggaacagaaa tctttgcccc 121 ctgactttgg aaatctcgtt taaccttcaa actggcgatg tcaagggttc caagtcctcc 181 acctccggca gaaatgtcga gtggccccgt agctgagagt tggtgctaca cacagatcaa 241 ggtagtgaaa ttctcctaca tgtggaccat caataacttt agcttttgcc gggaggaaat 301 gggtgaagtc attaaaagtt ctacattttc atcaggagca aatgataaac tgaaatggtg 361 tttgcgagta aaccccaaag ggttagatga agaaagcaaa gattacctgt cactttacct 421 gttactggtc agctgtccaa agagtgaagt tcgggcaaaa ttcaaattct ccatcctgaa 481 tgccaaggga gaagaaacca aagctatgga gagtcaacgg gcatataggt ttgtgcaagg 541 caaagactgg ggattcaaga aattcatccg tagagatttt cttttggatg aggccaacgg 601 gcttctccct gatgacaagc ttaccctctt ctgcgaggtg agtgttgtgc aagattctgt 661 caacatttct ggccagaata ccatgaacat ggtaaaggtt cctgagtgcc ggctggcaga 721 tgagttagga ggactgtggg agaattcccg gttcacagac tgctgcttgt gtgttgccgg 781 ccaggaattc caggctcaca aggctatctt agcagctcgt tctccggttt ttagtgccat 841 gtttgaacat gaaatggagg agagcaaaaa gaatcgagtt gaaatcaatg atgtggagcc 901 tgaagttttt aaggaaatga tgtgcttcat ttacacgggg aaggctccaa acctcgacaa 961 aatggctgat gatttgctgg cagctgctga caagtatgcc ctggagcgct taaaggtcat 1021 gtgtgaggat gccctctgca gtaacctgtc cgtggagaac gctgcagaaa ttctcatcct 1081 ggccgacctc cacagtgcag atcagttgaa aactcaggca gtggatttca tcaactatca 1141 tgcttcggat gtcttggaga cctctgggtg gaagtcaatg gtggtgtcac atccccactt 1201 ggtggctgag gcataccgct ctctggcttc agcacagtgc ccttttctgg gacccccacg 1261 caaacgcctg aagcaatcct aagatcctgc ttgttgtaag actccgttta atttccagaa 1321 gcagcagcca ctgttgctgc cactgaccac caggtagaca gcgcaatctg tggagctttt 1381 actctgttgt gaggggaaga gactgcattg tggccccaga cttttaaaac agcactaaat 1441 aacttggggg aaacgggggg agggaaaatg aaatgaaaac cctgttgctg cgtcactgtg 1501 ttccctttgg cctgtctgag tttgatactg tggggattca gtttaggcgc tggcccgagg 1561 atatcccagc ggtggtactt cggagacacc tgtctgcatc tgactgagca gaacaaatcg 1621 tcaggtgcct ggagcaaaaa gg // LOCUS HSJ002190 2470 bp mRNA PRI 06-FEB-1998 DEFINITION Homo sapiens cDNA for dihydroxyacetone phosphate acyltransferase (DAP-AT). ACCESSION AJ002190 NID g2584768 KEYWORDS dihydroxyacetone phosphate acyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2470) AUTHORS Thai,P.T., Heid,H., Rackwitz,H., Hunziker,A., Gorgas,K. and Just,W. TITLE Ether lipid biosynthesis: isolation and molecular characterization of human dihydroxyacetonephosphate acyltransferase JOURNAL FEBS Lett. 420, 205-211 (1997) REFERENCE 2 (bases 1 to 2470) AUTHORS Just,W.W. TITLE Direct Submission JOURNAL Submitted (30-OCT-1997) Just W.W., Biochemiezentrum Heidelberg (BZH), University of Heidelberg, Im Neuenheimer Feld 328, 69120 Heidelberg, GERMANY FEATURES Location/Qualifiers source 1..2470 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /tissue_lib="Lambda cDNA library" CDS 158..2200 /function="biosynthesis of etherlipids and plasmalogenes" /codon_start=1 /evidence=experimental /product="dihydroxyacetone phosphate acyltransferase" /db_xref="PID:e1169563" /db_xref="PID:g2584769" /translation="MESSSSSNSYFSVGPTSPSAVVLLYSKELKKWDEFEDILEERRH VSDLKFAMKCYTPLVYKGITPCKPIDIKCSVLNSEEIHYVIKQLSKESLQSVDVLREE VSEILDEMSHKLRLGAIRFCAFTLSKVFKQIFSKVCVNEEGIQKLQRAIQEHPVVLLP SHRSYIDFLMLSFLLYNYDLPVPVIAAGMDFLGMKMVGELLRMSGAFFMRRTFGGNKL YWAVFSEYVKTMLRNGYAPVEFFLEGTRSRSAKTLTPKFGLLNIVMEPFFKREVFDTY LVPISISYDKILEETLYVYELLGVPKPKESTTGLLKARKILSENFGSIHVYFGDPVSL RSLAAGRMSRSSYNLVPRYIPQKQSEDMHAFVTEVAYKMELLQIENMVLSPWTLIVAV LLQNRPSMDFDALVEKTLWLKGLTQAFGGFLIWPDNKPAEEVVPASILLHSNIASLVK DQVILKVDSGDSEVVDGLMLQHITLLMCSAYRNQLLNIFVRPSLVAVALQMTPGFRKE DVYSCFRFLRDVFADEFIFLPGNTLKDFEEGCYLLCKSEAIQVTTKDILVTEKGNTVL EFLVGLFKPFVESYQIICKYLLSEEEDHFSEEQYLAAVRKFTSQLLDQGTSQCYDVLS SDVQKNALAACVRLGVVEKKKINNNCIFNVNEPATTKLEEMLGCKTPIGKPATAKL" BASE COUNT 718 a 510 c 562 g 680 t ORIGIN 1 gaattcggca cgagccggga tcctgtgtag cggctgcaga gggtgccgcc gccctaggcg 61 aagtagggcc gtcctgagcg aaagaaccgc ccccagcagg agcaccacca cggcttagca 121 aagaatccca gaccccgccc gggaaggcag ccgcaccatg gagtcttcca gttcatctaa 181 ctcttatttc tccgttggcc caaccagtcc cagcgctgtc gtgctcctct actcgaagga 241 gctcaaaaag tgggatgagt ttgaagatat tttagaagag aggaggcatg tcagtgactt 301 gaaatttgca atgaaatgct acacacctct tgtctataag ggaattactc catgtaaacc 361 aattgatatt aaatgtagtg ttctcaattc tgaggagatt cattatgtca ttaaacagct 421 ttccaaggaa tcccttcaat ctgtggatgt cctccgagag gaagtgagtg agatcttaga 481 tgaaatgagt cacaaactgc gtcttggagc cattcggttt tgtgccttca ccctgagcaa 541 agtatttaaa caaattttct cgaaggtgtg tgtaaatgaa gaaggtattc agaaactaca 601 aagagccatc caggagcatc ctgttgttct gctgcctagt catcgaagtt acattgactt 661 cctcatgttg tcttttcttc tatacaatta tgatttgcct gtgccagtta tagcagcagg 721 aatggacttc ctgggaatga aaatggttgg tgagctgcta cgaatgtcgg gtgccttttt 781 catgcggcgt acctttggtg gcaataaact ctactgggct gtattctctg aatatgtaaa 841 aactatgtta cggaatggtt atgctcctgt tgaatttttc ctcgaaggga caagaagccg 901 ctctgccaag acattgactc ctaaatttgg tcttctgaat attgtgatgg agccattttt 961 taaaagagaa gtttttgata cctaccttgt cccaattagt atcagttatg ataagatctt 1021 ggaagaaact ctttatgtgt atgagcttct aggggttcct aaaccaaaag agtctacaac 1081 tgggttgctg aaagccagaa agattctctc tgaaaatttt ggaagcatcc atgtgtactt 1141 tggagatcct gtgtcacttc gatctttggc agctgggagg atgagtcgga gctcatataa 1201 cttggttcca agatacattc ctcagaaaca gtctgaggac atgcatgcct ttgtcactga 1261 agttgcctac aaaatggagc ttctgcaaat tgaaaacatg gttttgagcc cctggaccct 1321 aatagttgct gttctgcttc agaaccggcc atccatggac tttgatgctc tggtggaaaa 1381 gactttatgg ctaaaaggct taacccaggc atttggaggg tttctcattt ggcctgataa 1441 taaacctgct gaagaagttg tcccggccag cattcttctg cattccaaca ttgccagcct 1501 tgtcaaagac caggtgattc tgaaagtgga ctccggagac tcggaagtgg tcgatgggct 1561 tatgctccag cacatcactc tcctcatgtg ctcagcttat aggaaccagc tgctcaacat 1621 ttttgtgcgc ccatccttag tagcagtagc attgcagatg acaccagggt tcaggaaaga 1681 ggatgtctac agttgctttc gcttcctacg tgatgttttt gcagatgagt tcatcttcct 1741 tccaggaaac acactaaagg actttgaaga aggctgttac ctgctttgta aaagtgaagc 1801 catacaagtg actacgaaag acatcctagt tacagagaaa ggaaatactg tgttagaatt 1861 tttagtagga ctctttaaac cttttgtgga aagctatcag ataatttgca agtacctttt 1921 gagtgaagaa gaggaccact tcagtgagga acagtacttg gctgcagtca gaaaattcac 1981 aagtcagctt ctcgatcaag gtacctctca atgttatgat gtattatctt ctgatgtgca 2041 gaaaaacgcc ttagcagcct gtgtgaggct cggagtagtg gagaagaaga agataaataa 2101 taactgtata tttaatgtga atgaacctgc cacaaccaaa ttagaagaaa tgcttggttg 2161 taagacacca ataggaaaac cagccactgc aaaactttaa taatcaacaa atagttatgg 2221 aaaattcggt cacgtaatta ctctcatcga aggactcatt acaacaaaca gggaagtaaa 2281 ggaagagaca catcctctca tactccctga gactctgaga acagtggacg cagagggaag 2341 agatgatcat tggaagcaat cagtttactc ttccccacca cagtggttaa aaggcgtttg 2401 tatctgacac tatgtgtgtg ttttaaaata aacttttgga aacatgaaaa aaaaaaaaaa 2461 aaaactcgag // LOCUS HSJ002211 663 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens cDNA for a CXC chemokine. ACCESSION AJ002211 NID g2832410 KEYWORDS CXC chemokine. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 663) AUTHORS Legler,D.F., Baggiolini,M. and Moser,B. TITLE BLR1L, a novel CXC chemokine selective for B lymphocytes JOURNAL Unpublished REFERENCE 2 (bases 1 to 663) AUTHORS Moser,B. TITLE Direct Submission JOURNAL Submitted (05-NOV-1997) Moser B., University of Bern, Theodor Kocher Institute, Freiestrasse 1, CH-3012 Bern, SWITZERLAND FEATURES Location/Qualifiers source 1..663 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="diploid" /cell_type="PBL" sig_peptide 35..100 /gene="BLR1L" CDS 35..364 /gene="BLR1L" /codon_start=1 /product="CXC chemokine" /db_xref="PID:e1249325" /db_xref="PID:g2832411" /translation="MKFISTSLLLMLLVSSLSPVQGVLEVYYTSLRCRCVQESSVFIP RRFIDRIQILPRGNGCPRKEIIVWKKNKSIVCVDPQAEWIQRMMEVLRKRSSSTLPVP VFKRKIP" gene 35..364 /gene="BLR1L" mat_peptide 101..361 /gene="BLR1L" BASE COUNT 176 a 136 c 145 g 198 t 8 others ORIGIN 1 cagagctcaa gtctgaactc tacctccaga cagaatgaag ttcatctcga catctctgct 61 tctcatgctg ctggtcagca gcctctctcc agtccaaggt gttctggagg tctattacac 121 aagcttgagg tgtagatgtg tccaagagag ctcagtcttt atccctagac gcttcattga 181 tcgaattcaa atcttgcccc gtgggaatgg ttgtccaaga aaagaaatca tagtctggaa 241 gaagaacaag tcaattgtgt gtgtggaccc tcaagctgaa tggatacaaa gaatgatgga 301 agtattgaga aaaagaagtt cttcaactct accagttcca gtgtttaaga gaaagattcc 361 ctgatgctga tatttccact aagaacacct gcattcttcc cttatccctg ctctgggatt 421 ttagttttgt gcttagttaa atcttttcca gggagaaaga acttccccat acaaataagg 481 catgaggact atgtaaaaat aaccttgcag gagctggatg gggggccaaa ctcaagcttc 541 tttcactcca caggcaccct attntacact tgggggtttt gcnttctttn tttcntcagg 601 gggggggaaa gtttcttttg gaaantagtt nttccagttn ttaggtatta cagggttntt 661 ttt // LOCUS HSJGEBFR 1529 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for lymphocyte IgE receptor (low affinity receptor Fc epsilon R). ACCESSION X04772 NID g34002 KEYWORDS IgE receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1529) AUTHORS Ludin,C., Hofstetter,H., Sarfati,M., Levy,C.A., Suter,U., Alaimo,D., Kilchherr,E., Frost,H. and Delespesse,G. TITLE Cloning and expression of the cDNA coding for a human lymphocyte IgE receptor JOURNAL EMBO J. 6 (1), 109-114 (1987) MEDLINE 87218454 COMMENT Data kindly reviewed (21-AUG-1987) by Hofstetter H. FEATURES Location/Qualifiers source 1..1529 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="B-lymphoblast RPMI 8866" CDS 214..1179 /note="IgE receptor (AA 1-321)" /codon_start=1 /db_xref="PID:g34003" /db_xref="SWISS-PROT:P06734" /translation="MEEGQYSEIEELPRRRCCRRGTQIVLLGLVTAALWAGLLTLLLL WHWDTTQSLKQLEERAARNVSQVSKNLESHHGDQMAQKSQSTQISQELEELRAEQQRL KSQDLELSWNLNGLQADLSSFKSQELNERNEASDLLERLREEVTKLRMELQVSSGFVC NTCPEKWINFQRKCYYFGKGTKQWVHARYACDDMEGQLVSIHSPEEQDFLTKHASHTG SWIGLRNLDLKGEFIWVDGSHVDYSNWAPGEPTSRSQGEDCVMMRGSGRWTDAFCDRK LGAWVCDRLATCTPPASEGSAESMGPDSRPDPDGRLPTPSAPLHS" misc_feature 283..345 /note="put. membrane anchor region" misc_feature 400..408 /note="pot. N-linked glycosylation site" misc_feature 1507..1512 /note="pot polyA signal" polyA_site 1529 /note="polyA site" BASE COUNT 351 a 461 c 437 g 280 t ORIGIN 1 agtggctcta ctttcagaag aaagtgtctc tcttcctgct taaacctctg tctctgacgg 61 tccctgccaa tcgctctggt cgaccccaac acactaggag gacagacaca ggctccaaac 121 tccactaacc agagctgtga ttgtgcccgc tgagtggact gcgttgtcag ggagtgagtg 181 ctccatcatc gggagaatcc aagcaggacc gccatggagg aaggtcaata ttcagagatc 241 gaggagcttc ccaggaggcg gtgttgcagg cgtgggactc agatcgtgct gctggggctg 301 gtgaccgccg ctctgtgggc tgggctgctg actctgcttc tcctgtggca ctgggacacc 361 acacagagtc taaaacagct ggaagagagg gctgcccgga acgtctctca agtttccaag 421 aacttggaaa gccaccacgg tgaccagatg gcgcagaaat cccagtccac gcagatttca 481 caggaactgg aggaacttcg agctgaacag cagagattga aatctcagga cttggagctg 541 tcctggaacc tgaacgggct tcaagcagat ctgagcagct tcaagtccca ggaattgaac 601 gagaggaacg aagcttcaga tttgctggaa agactccggg aggaggtgac aaagctaagg 661 atggagttgc aggtgtccag cggctttgtg tgcaacacgt gccctgaaaa gtggatcaac 721 ttccaacgga agtgctacta cttcggcaag ggcaccaagc agtgggtcca cgcccggtat 781 gcctgtgacg acatggaagg gcagctggtc agcatccaca gcccggagga gcaggacttc 841 ctgaccaagc atgccagcca caccggctcc tggattggcc ttcggaactt ggacctgaag 901 ggagagttta tctgggtgga tgggagccat gtggactaca gcaactgggc tccaggggag 961 cccaccagcc ggagccaggg cgaggactgc gtgatgatgc ggggctccgg tcgctggacc 1021 gacgccttct gcgaccgtaa gctgggcgcc tgggtgtgcg accggctggc cacatgcacg 1081 ccgccagcca gcgaaggttc cgcggagtcc atgggacctg attcaagacc agaccctgac 1141 ggccgcctgc ccaccccctc tgcccctctc cactcttgag catggataca gccaggccca 1201 gagcaagacc ctgaagaccc ccaaccacgg cctaaaagcc tctttgtggc tgaaaggtcc 1261 ctgtgacatt ttctgccacc caaacggagg cagctgacac atctcccgct cctctatggc 1321 ccctgccttc ccaggagtac accccaacag caccctctcc agatgggagt gcccccaaca 1381 gcaccctctc cagatgagag ttacacccca acagcaccct ctccagatgc agccccatct 1441 cctcagcacc ccaggacctg agtatcccca gctcagggtg gtgagtcctc ctgtccagcc 1501 tgcatcaata aaatggggca gtgatggcc // LOCUS HSJUNB 1797 bp RNA PRI 12-SEP-1993 DEFINITION Human jun-B mRNA for JUN-B protein. ACCESSION X51345 NID g34014 KEYWORDS jun-B gene; jun-B protein; nuclear protein; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1797) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (05-JAN-1990) Nomura N., Molecular Oncology Laboratory, Nippon Medical School, Sakuragi, 1-10-19 Uenosakuragi, Taito-ku, Tokyo 110, Japan REFERENCE 2 (bases 1 to 1797) AUTHORS Nomura,N., Ide,M., Sasamoto,S., Matsui,M., Date,T. and Ishizaki,R. TITLE Isolation of human cDNA clones of jun-related genes, jun-B and jun-D JOURNAL Nucleic Acids Res. 18 (10), 3047-3048 (1990) MEDLINE 90272414 COMMENT For jun-D gene see . Data kindly reviewed (04-JUL-1990) by Nomura N. FEATURES Location/Qualifiers source 1..1797 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myelogeneous" /cell_line="AML,KG-1" CDS 254..1297 /note="jun-B gene product (AA 1-347)" /codon_start=1 /db_xref="PID:g34015" /db_xref="SWISS-PROT:P17275" /translation="MCTKMEQPFYHDDSYTATGYGRAPGGLSLHDYKLLKPSLAVNLA DPYRSLKAPGARGPGPEGGGGGSYFSGQGSDTGASLKLASSELERLIVPNSNGVITTT PTPPGQYFYPRGGGSGGGAGGAGGGVTEEQEGFADGFVKALDDLHKMNHVTPPNVSLG ATGGPPAGPGGVYAGPEPPPVYTNLSSYSPASASSGGAGAAVGTGSSYPTTTISYLPH APPFAGGHPAQLGLGRGASTFKEEPQTVPEARSRDATPPVSPINMEDQERIKVERKRL RNRLAATKCRKRKLERIARLEDKVKTLKAENAGLSSTAGLLREQVAQLKQKVMTHVSN GCQLLLGVKGHAF" BASE COUNT 340 a 608 c 556 g 293 t ORIGIN 1 ccagcaggga gctgggagct gggggaaacg acgccaggaa agctatcgcg ccagagaggg 61 cgacgggggc tcgggaagcc tgacagggct tttgcgcaca gctgccggct ggctgctacc 121 cgcccgcgcc agcccccgag aacgcgcgac caggcaccca gtccggtcac cgcagcggag 181 agctcgccgc tcgctgcagc gaggcccgga gcggccccgc agggaccctc cccagaccgc 241 ctgggccgcc cggatgtgca ctaaaatgga acagcccttc taccacgacg actcatacac 301 agctacggga tacggccggg cccctggtgg cctctctcta cacgactaca aactcctgaa 361 accgagcctg gcggtcaacc tggccgaccc ctaccggagt ctcaaagcgc ctggggctcg 421 cggacccggc ccagagggcg gcggtggcgg cagctacttt tctggtcagg gctcggacac 481 cggcgcgtct ctcaagctcg cctcttcgga gctggaacgc ctgattgtcc ccaacagcaa 541 cggcgtgatc acgacgacgc ctacaccccc gggacagtac ttttaccccc gcgggggtgg 601 cagcggtgga ggtgcagggg gcgcaggggg cggcgtcacc gaggagcagg agggcttcgc 661 cgacggcttt gtcaaagccc tggacgatct gcacaagatg aaccacgtga caccccccaa 721 cgtgtccctg ggcgctaccg gggggccccc ggctgggccc gggggcgtct acgccggccc 781 ggagccacct cccgtttaca ccaacctcag cagctactcc ccagcctctg cgtcctcggg 841 aggcgccggg gctgccgtcg ggaccgggag ctcgtacccg acgaccacca tcagctacct 901 cccacacgcg ccgcccttcg ccggtggcca cccggcgcag ctgggcttgg gccgcggcgc 961 ctccaccttc aaggaggaac cgcagaccgt gccggaggcg cgcagccggg acgccacgcc 1021 gccggtgtcc cccatcaaca tggaagacca agagcgcatc aaagtggagc gcaagcggct 1081 gcggaaccgg ctggcggcca ccaagtgccg gaagcggaag ctggagcgca tcgcgcgcct 1141 ggaggacaag gtgaagacgc tcaaggccga gaacgcgggg ctgtcgagta ccgccggcct 1201 cctccgggag caggtggccc agctcaaaca gaaggtcatg acccacgtca gcaacggctg 1261 tcagctgctg cttggggtca agggacacgc cttctgaacg tcccctgccc ctttacggac 1321 accccctcgc ttggacggct gggcacacgc ctcccactgg ggtccaggga gcaggcggtg 1381 ggcacccacc ctgggaccta ggggcgccgc aaaccacact ggactccggc ccccctaccc 1441 tgcgcccagt ccttccacct cgacgtttac aagccccccc ttccactttt ttttgtatgt 1501 tttttttctg ctggaaacag actcgattca tattgaatat aatatatttg tgtatttaac 1561 agggagggga agagggggcg atcgcggcgg agctggcccc gccgcctggt actcaagccc 1621 gcggggacat tgggaagggg acccccgccc cctgccctcc cctctctgca ccgtactgtg 1681 gaaaagaaac acgcacttag tctctaaaga gtttatttta agacgtgttt gtgtttgtgt 1741 gtgtttgttc tttttattga atctatttaa gtaaaaaaaa aattggttct ttattaa // LOCUS HSJUND 1612 bp RNA PRI 12-SEP-1993 DEFINITION Human jun-D mRNA for JUN-D protein. ACCESSION X51346 NID g34016 KEYWORDS jun-D gene; jun-D protein; nuclear protein; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1612) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (05-JAN-1990) Nomura N., Molecular Oncology Laboratory, Nippon Medical School, Sakuragi, 1-10-19 Uenosakuragi, Taito-ku, Tokyo 110, Japan REFERENCE 2 (bases 1 to 1612) AUTHORS Nomura,N., Ide,M., Sasamoto,S., Matsui,M., Date,T. and Ishizaki,R. TITLE Isolation of human cDNA clones of jun-related genes, jun-B and jun-D JOURNAL Nucleic Acids Res. 18 (10), 3047-3048 (1990) MEDLINE 90272414 COMMENT For human jun-B gene see . Data kindly reviewed (01-JUL-1990) by Nomura N. FEATURES Location/Qualifiers source 1..1612 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial" CDS 3..914 /note="jun-D gene product (AA 1-303)" /codon_start=1 /db_xref="PID:g34017" /db_xref="SWISS-PROT:P17535" /translation="MKKDALTLSLSEQVAAALKPAAAPPPTPLRADGAPSAAPPDGLL ASPDLGLLKLASPELERLIIQSNGLVTTTPTSSQFLYPKVAASEEQEFAEGFVKALED LHKQNQLGAGAAAAAAAAAAGGPSGTATGSAPPGELAPAAAAPEAPVYANLSSYAGGA GGAGGAATVAFAAEPVPFPPPPPPGALGPPRLAALKDEPQTVPDVPSFGESPPLSPID MDTQERIKAERKRLRNRIAASKCRKRKLERISRLEEKVKTLKSQNTELASTASLLREQ VAQLKQKVLSHVNSGCQLLPQHQVPAY" BASE COUNT 242 a 578 c 510 g 282 t ORIGIN 1 tgatgaagaa ggacgcgctg acgctgagcc tgagtgagca ggtggcggca gcgctcaagc 61 ctgcggccgc gccgcctcct acccccctgc gcgccgacgg cgcccccagc gcggcacccc 121 ccgacggcct gctcgcctct cccgacctgg ggctgctgaa gctggcctcc cccgagctcg 181 agcgcctcat catccagtcc aacgggctgg tcaccaccac gccgacgagc tcacagttcc 241 tctaccccaa ggtggcggcc agcgaggagc aggagttcgc cgagggcttc gtcaaggccc 301 tggaggattt acacaagcag aaccagctcg gcgcgggcgc ggccgctgcc gccgccgccg 361 ccgccgccgg ggggccctcg ggcacggcca cgggctccgc gccccccggc gagctggccc 421 cggcggcggc cgcgcccgaa gcgcctgtct acgcgaacct gagcagctac gcgggcggcg 481 ccgggggcgc ggggggcgcc gcgacggtcg ccttcgctgc cgaacctgtg cccttcccgc 541 cgccgccacc cccaggcgcg ttggggccgc cgcgcctggc tgcgctcaag gacgagccac 601 agacggtgcc cgacgtgccg agcttcggcg agagcccgcc gttgtcgccc atcgacatgg 661 acacgcagga gcgcatcaag gcggagcgca agcggctgcg caaccgcatc gccgcctcca 721 agtgccgcaa gcgcaagctg gagcgcatct cgcgcctgga agagaaagtg aagaccctca 781 agagtcagaa cacggagctg gcgtccacgg cgagcctgct gcgcgagcag gtggcgcagc 841 tcaagcagaa agtcctcagc cacgtcaaca gcggctgcca gctgctgccc cagcaccagg 901 tgcccgcgta ctgagtccgc gcgcggggcg catgcgcggc caccctcccc aaggggcggg 961 ctcgcggggg ggtgtcgtgg gcgccccgga cttggagagg gtgcggccct ggggaccccc 1021 ccctccccga gtgtgcccag gaactcagag agggcgcggc ccccggggat tcccccccga 1081 gggtgcccag gactcggaag gggcgccccg gactcgacaa gctggacccc ctgctcccgg 1141 gggggcgagc gcatgacccc cccgccctcg cgctgcctct ttcccccgcg cggccgcccc 1201 gtgttgcaca aacccgcgcg tctcggctgc ccctttgtac accgcgccgc ggaagggggc 1261 tccgaggggg cgcagcctca aaccctgcct ttcctttact tttacttttt tttttttttc 1321 ctttggaaga gagaagaaca gagtgttcga ttctgcccta tttatgtttc tactcgggaa 1381 caaacgttgg ttgtgtgtgt gtgtgttttc ttgtgttggt tttttaaaga aatgggaaga 1441 agaaaaaaaa attctccgcc cctttcctcg atctcgctcc cccttcggtt ctttcgaccg 1501 gtcccccctc ccttttttgt tctgttttgt tttgttttgc tacgagtcca cattcctgtt 1561 tgtaatcctt ggttcgcccg gttttctgtt ttcagtaaag tctcgttacg cc // LOCUS HSKALIG 4094 bp RNA PRI 16-JAN-1995 DEFINITION H.sapiens KALIG-1 mRNA for neural cell adhesion and axonal path-finding molecule homologue. ACCESSION X60299 NID g34024 KEYWORDS axonal pathfinding molecule homologue; KALIG-1 gene; Kallmann syndrome interval gene; neural cell adhesion molecule homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4094) AUTHORS Franco,B. TITLE Direct Submission JOURNAL Submitted (17-SEP-1991) B. Franco, Baylor College of Medicine, One Baylor Plaza, 77030 Houston, Texas, USA REFERENCE 2 (bases 1 to 4094) AUTHORS Franco,B., Guioli,S., Pragliola,A., Incerti,B., Bardoni,B., Tonlorenzi,R., Carrozzo,R., Maestrini,E., Pieretti,M., Taillon-Miller,P., Brown,C.J., Willard,F.H., Lawrence,C.B., Persico,G.M., Camerino,G. and Ballabio,A. TITLE A gene deleted in Kallmann's syndrome shares homology with neural cell adhesion and axonal path-finding molecules JOURNAL Nature 353 (6344), 529-536 (1991) MEDLINE 92018217 FEATURES Location/Qualifiers source 1..4094 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="total embryo, fetal brain male (Clontech), fetal brain female (Stratagene)" /clone="FB2;23A;26;29;12;23B, TE4 and TC3" /chromosome="X" /map="Xp22.3" sig_peptide 64..123 /gene="KALIG-1" CDS 64..2106 /gene="KALIG-1" /note="Kallmann syndrome interval gene" /codon_start=1 /db_xref="PID:g34025" /db_xref="SWISS-PROT:P23352" /translation="MVPGVPGAVLTLCLWLAASSGCLAAGPGAAAARRLDESLSAGSV QRAPCASRCLSLQITRISAFFQHFQNNGSLVWCQNHKQCSKCLEPCKESGDLRKHQCQ SFCEPLFPKKSYECLTSCEFLKYILLVKQGDCPAPEKASGFAAACVESCEVDNECSGV KKCCSNGCGHTCQVPKTLYKGVPLKPRKELRFTELQSGQLEVKWSSKFNISIEPVIYV VQRRWNYGIHPSEDDATHWQTVAQTTDERVQLTDIRPSRWYQFRVAAVNVHGTRGFTA PSKHFRSSKDPSAPPAPANLRLANSTVNSDGSVTVTIVWDLPEEPDIPVHHYKVFWSW MVSSKSLVPTKKKRRKTTDGFQNSVILEKLQPDCDYVVKLQAITYWGQTRLKSAKVSL HFTSTHATNNKEQLVKTRKGGIQTQLPFQRRRPTRPLEVGAPFYQDGQLQVKVYWKKT EDPTVNRYHVRWFPEACAHNRTTGSEASSGMTHENYIILQDLSFSCKYKVTVQPIRPK SHSKAEAVFFTTPPCSALKGKSHKPIGCLGERGHVLSKVLAKPENLSASFIVQDVNIT GHFSWKMAKANLYQPMTGFQVTWAEVTTESRQNSLPNSIISQSQILPSDHYVLTVPNL RPSTLYRLEVQVLTPGGEGPATIKTFRTPELPPSSAHRSHLKHRHPHHYKPSPERY" gene 64..2106 /gene="KALIG-1" mat_peptide 124..2104 /gene="KALIG-1" /note="Kallmann syndrome interval gene" BASE COUNT 1183 a 921 c 872 g 1118 t ORIGIN 1 gaattccagc aaggagcctc ggcccgcgcc cggcgccctc gccctcgccc tcgacccgca 61 gccatggtgc ccggggtgcc cggcgcggtc ctgaccctct gcctctggct ggcggcctcc 121 agcggctgcc tggcggccgg ccccggcgcg gctgctgcgc ggcggctgga cgagtcgctg 181 tctgccggga gcgtccagcg cgctccgtgc gcctccaggt gcctgagcct gcagatcact 241 cgcatctccg ccttcttcca gcacttccag aacaatggtt ccctggtttg gtgccagaat 301 cacaagcaat gttctaagtg cctggagccc tgcaaggaat caggggacct gaggaaacac 361 cagtgccaaa gcttttgtga gcctctcttc cccaagaaga gctacgaatg cttgaccagc 421 tgtgagttcc tcaaatacat cctgttggtg aagcaggggg actgtccggc tcctgagaaa 481 gccagtggat ttgcggccgc ctgtgttgaa agctgcgaag ttgacaatga gtgctctggg 541 gtgaagaaat gttgttcgaa tgggtgtgga cacacctgtc aagtacccaa gactctgtac 601 aaaggtgtcc ccctgaagcc cagaaaagag ttacgattta cagaactgca gtctggacag 661 ctggaggtta agtggtcctc gaaattcaat atttctattg agcctgtgat ctatgtggta 721 caaagaagat ggaattatgg aatccatcct agcgaagatg acgccactca ctggcagaca 781 gtggcccaga ccacagacga gcgagttcaa ctgactgaca taagacccag ccgatggtac 841 cagtttcgag tggctgctgt gaatgtgcat ggaactcgag gcttcactgc ccccagcaaa 901 cacttccgtt cttccaaaga tccatctgcc ccaccagcac cggctaacct ccggctggcc 961 aactccaccg tcaacagtga tgggagtgtg accgtcacta tagtttggga tctccccgag 1021 gagccggaca tccctgtgca tcattacaag gtcttttgga gctggatggt cagcagtaag 1081 tctcttgtcc caacaaagaa gaagcggaga aagactacgg atgggtttca aaattctgtg 1141 atcctggaga aactccagcc agactgtgac tatgttgtga aattgcaagc cataacgtac 1201 tggggacaga cacggctgaa gagtgcaaag gtgtcccttc acttcacatc gacacatgca 1261 accaacaaca aagaacagct tgtgaaaact agaaaaggtg gaattcaaac acaactccct 1321 tttcaaagac gacgacccac tcgcccgctg gaagtcggag ctcccttcta tcaggatggc 1381 caactgcaag ttaaagtcta ctggaagaag acagaagatc ccactgtcaa ccgatatcat 1441 gtgcggtggt ttcctgaagc gtgtgcccac aacagaacaa ccggatcaga ggcatcatct 1501 ggcatgaccc acgaaaatta cataattctt caagatctgt cattttcctg caagtataag 1561 gtgactgtcc aaccaatacg gccaaaaagt cactccaagg cagaagctgt tttcttcact 1621 actccaccat gctctgctct taaggggaag agccacaagc ctattggctg cctgggcgaa 1681 cgaggtcatg ttctttctaa ggtgctagct aagcctgaga acctttctgc ttcattcatc 1741 gtccaggatg tgaacatcac cggtcacttt tcttggaaga tggccaaggc caatctctat 1801 cagcccatga ctgggtttca agtgacttgg gctgaggtca ctacggaaag cagacagaac 1861 agcctaccca acagcattat ttcacagtcc cagattctgc cttccgatca ttatgtccta 1921 acagtgccca atctgagacc atctactctt taccgactgg aagtgcaagt gctgacccca 1981 ggaggggagg ggccggccac catcaagacg ttccggacgc cggagctccc accctcttca 2041 gcacacagat ctcatcttaa gcatcgtcat ccacatcatt acaagccttc tccagaaaga 2101 tactaaactg ttcaaaaaga ttttgtgaaa ttgcacagat gtgtaagctt gttgaacttc 2161 ggccacgaga catgcacact tccagaggca gtgggaactg ctcagaggcc cggactctcc 2221 tatgtgactt tagtgcagga agaacttctg tcaatcatgg acgcatctgg agacaagtga 2281 gaaacagtag attggtgaag acagacacca gttccctaca agcatggaga aaatgaagaa 2341 taggcctgtt taatgctaaa ttttgttttc atgtatggtg tcgctcattt ctattgaatt 2401 acaacagaac tcagttttcc ctgaatttgg agcaccaaac tcgcccaaaa ggagagtaac 2461 aaatacacaa ttcacacata acactaagcg taaatctaat caataaaata tatttttgac 2521 taaattattg attcgatatg aaaaatcaag taagattaca cagctttgtt tttttgaatc 2581 tttcctaaga tcatttttat cctacgtgat ttttaaatga aaatgtgtaa tctaaaatat 2641 accagcgaat ttaaatctaa aaatgctcct actttaagta ccttgtgctg ctctttatgc 2701 aaaggtaaat caaagttccc tctataaatt atgatttaca aaagactccc aagccagagg 2761 aactcaatga aataagctgc taatcagatt ttaccttgga gaaatgaaat tatttcttgg 2821 ggatgccttt aatatttgat cctattatgt gagagatttt cctgatatgt tatcttattt 2881 atatttccct tattttcctc aatgcagata atagcttttg gtgcactttt gtttcaccat 2941 ctgaaaattc acaaaacttc ttgcttcaaa tgaaaaaatc ccaactattg agcatgttta 3001 aatctttgca gagatttgcc ttttcttaat caaagtaagg tctttgtgtg ctagtatatt 3061 attggtaatg ttttaaaaat tcctttgatt gatagagaag gacagttatt tgcatttaat 3121 tcacccatat gctttcaaat ctagtatatc ttactttttg gaaatgtttt atgctacaaa 3181 ttcgtgcctt gtagcatgaa cttaagtcaa aacgtgttat caatatagag tgttgcagtg 3241 tatattgtaa caacctaaaa cgcagagaag tttaatttaa tactgttttt tttcttgaag 3301 gaatactcac atacatggtt tgaaatgtgc atagatatgc atgtctatat aattataaat 3361 gcatgtgtat atatatgcaa atatatgtac atatacatgt atatacacac agacacatgc 3421 atatacatga atataccttg agcatgaatc cctggagaaa tcgttttcgt agctcaccaa 3481 tggtgagtaa agatacagct cttttaaagg tcataaggat aatatatttt ccccatcaat 3541 gctgattctg agaaaagagc aatttatcaa aattaaacac tgtaaaagaa aggtgtccat 3601 atgtctttac ctacctaagt aaaacaggaa gaaaatcagt aacattatcc ttaggttttg 3661 acaatggtac ttgcttcttg ttgttttatt gtttcctgaa ttcatgcaga tgcctggttt 3721 tcctggaaga gtggataact cagaagtcac tgtactccac agagcctcac tgcagtgtct 3781 taaaggtaga tgcaattaaa atgcaggaaa aaaaaacttt tctgatgttg atgcatgtct 3841 ttgggaaaca catttataaa catggatacc tgataataga tattgaaacc catttcctgt 3901 gtgttaaaat atttaaaaag tggatattcc aggaatgttt tgcagctttg tacaagtaac 3961 ataaattgga gacctcagaa tgaaagttca tgttggttct gaatggttca ctgcagctcc 4021 tgtcacaagc tgggatggat ttatcacatt gagttatgaa attacctggt tctaagaatt 4081 tttgagggaa ttcc // LOCUS HSKALLI 871 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for preprokallikrein (EC 3.4.21). ACCESSION X13561 NID g34026 KEYWORDS kallikrein; protease; secretory protein; serine protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 871) AUTHORS Appelhans,H. TITLE Direct Submission JOURNAL Submitted (17-NOV-1988) Appelhans H., Technische Hochschule Darmstadt, Institut fuer Biochemie, Petersenstrasse 22, 6100 Darmstadt, FRG REFERENCE 2 (bases 1 to 871) AUTHORS Angermann,A., Bergmann,C. and Appelhans,H. TITLE Cloning and expression of human salivary-gland kallikrein in Escherichia coli JOURNAL Biochem. J. 262 (3), 787-793 (1989) MEDLINE 90073574 FEATURES Location/Qualifiers source 1..871 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="22,X,Y" /tissue_type="salivary gland" /cell_type="secretoty cells" /clone_lib="pUC9" /clone="pKA21" CDS 37..825 /codon_start=1 /product="preprokallikrein (AA -24 to 238)" /db_xref="PID:g34027" /db_xref="SWISS-PROT:P06870" /translation="MWFLVLCLALSLGGTGAAPPIQSRIVGGWECEQHSQPWQAALYH FSTFQCGGILVHRQWVLTAAHCISDNYQLWLGRHNLFDDENTAQFVHVSESFPHPGFN MSLLENHTRQADEDYSHDLMLLRLTEPADTITDAVKVVELPTEEPEVGSTCLASGWGS IEPENFSFPDDLQCVDLKILPNDECKKAHVQKVTDFMLCVGHLEGGKDTCVGDSGGPL MCDGVLQGVTSWGYVPCGTPNKPSVAVRVLSYVKWIEDTIAENS" sig_peptide 37..108 /note="signal peptide (AA -24 to -1)" mat_peptide 109..822 /note="prokallikrein (AA 1 - 238)" misc_feature 851..856 /note="polyA signal" BASE COUNT 179 a 260 c 241 g 191 t ORIGIN 1 tcctccacct gctggcccct ggacacctct gtcaccatgt ggttcctggt tctgtgcctc 61 gccctgtccc tgggggggac tggtgctgcg cccccgattc agtcccggat tgtgggaggc 121 tgggagtgtg agcagcattc ccagccctgg caggcggctc tgtaccattt cagcactttc 181 cagtgtgggg gcatcctggt gcaccgccag tgggtgctca cagctgctca ttgcatcagc 241 gacaattacc agctctggct gggtcgccac aacttgtttg acgacgaaaa cacagcccag 301 tttgttcatg tcagtgagag cttcccacac cctggcttca acatgagcct cctggagaac 361 cacacccgcc aagcagacga ggactacagc cacgacctca tgctgctccg cctgacagag 421 cctgctgata ccatcacaga tgctgtgaag gtcgtggagt tgcccaccga ggaacccgaa 481 gtggggagca cctgtttggc ttccggctgg ggcagcatcg aaccagagaa tttctcattt 541 ccagatgatc tccagtgtgt ggacctcaaa atcctgccta atgatgagtg caaaaaagcc 601 cacgtccaga aggtgacaga cttcatgctg tgtgtcggac acctggaagg tggcaaagac 661 acctgtgtgg gtgattcagg gggcccgctg atgtgtgatg gtgtgctcca aggtgtcaca 721 tcatggggct acgtcccttg tggcaccccc aataagcctt ctgtcgccgt cagagtgctg 781 tcttatgtga agtggatcga ggacaccata gcggagaact cctgaacgcc cagccctgtc 841 ccctaccccc agtaaaatca aatgtgcatc c // LOCUS HSKATP 1260 bp RNA PRI 09-JAN-1995 DEFINITION H.sapiens mRNA for KATP (cardiac). ACCESSION X83582 NID g619878 KEYWORDS potassium channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1260) AUTHORS Ashford,M.L., Bond,C.T., Blair,T.A. and Adelman,J.P. TITLE Cloning and functional expression of a rat heart KATP channel JOURNAL Nature 370 (6489), 456-459 (1994) MEDLINE 94322936 REFERENCE 2 (bases 1 to 1260) AUTHORS Adelman,J.P. TITLE Direct Submission JOURNAL Submitted (19-DEC-1994) J.P. Adelman, Vollum Insitute for Advanced Biomedical, Research (VIABR), Oregon Health Sciences Univeristy, L474, 3181 SW Sam Jackson Park Rd., Portland OR 97201, USA FEATURES Location/Qualifiers source 1..1260 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human pancreas t10 library from Clontech" CDS 1..1260 /codon_start=1 /product="potassium channel" /db_xref="PID:g619879" /db_xref="SWISS-PROT:P48544" /translation="MAGDSRNAMNQDMEIGVTPWDPKKIPKQARDYVPIATDRTRLLA EGKKPRQRYMEKSGKCNVHHGNVQETYRYLSDLFTTLVDLKWRFNLLVFTMVYTVTWL FFGFIWWLIAYIRGDLDHVGDQEWIPCVENLSGFVSAFLFSIETETTIGYGFRVITEK CPEGIILLLVQAILGSIVNAFMVGCMFVKISQPKKRAETLMFSNNAVISMRDEKLCLM FRVGDLRNSHIVEASIRAKLIKSRQTKEGEFIPLNQTDINVGFDTGDDRLFLVSPLII SHEINEKSPFWEMSQAQLHQEEFEVVVILEGMVEATGMTCQARSSYMDTEVLWGHRFT PVLTLEKGFYEVDYNTFHDTYETNTPSCCAKELAEMKREGRLLQYLPSPPLLGRCAEA GLDAEAEQNEEDEPKGLGGSREARGSV" BASE COUNT 285 a 357 c 359 g 259 t ORIGIN 1 atggctggtg attctaggaa tgccatgaac caggacatgg agattggagt cactccctgg 61 gaccccaaga agattccaaa acaggcccgc gattatgtcc ccattgccac agaccgtacg 121 cgcctgctgg ccgagggcaa gaagccacgc cagcgctaca tggagaagag cggcaagtgc 181 aacgtgcacc acggcaacgt ccaggagacc taccggtacc tgagtgacct cttcaccacc 241 ctggtggacc tcaagtggcg cttcaacttg ctcgtcttca ccatggttta cactgtcacc 301 tggctgttct tcggcttcat ttggtggctc attgcttata tccggggtga cctggaccat 361 gttggcgacc aagagtggat tccttgtgtt gaaaacctca gtggcttcgt gtccgctttc 421 ctgttctcca ttgagaccga aacaaccatt gggtatggct tccgagtcat cacagagaag 481 tgtccagagg ggattatact cctcttggtc caggccatcc tgggctccat cgtcaatgcc 541 ttcatggtgg ggtgcatgtt tgtcaagatc agccagccca agaagagagc ggagaccctc 601 atgttttcca acaacgcagt catctccatg cgggacgaga agctgtgcct catgttccgg 661 gtgggcgacc tccgcaactc ccacatcgtg gaggcctcca tccgggccaa gctcatcaag 721 tcccggcaga ccaaagaggg ggagttcatc cccctgaacc agacagacat caacgtgggc 781 tttgacacgg gcgacgaccg cctcttcctg gtgtctcctc tgatcatctc ccacgagatc 841 aacgagaaga gccctttctg ggagatgtct caggctcagc tgcatcagga agagtttgaa 901 gttgtggtca ttctagaagg gatggtggaa gccacaggca tgacctgcca agcccggagc 961 tcctacatgg atacagaggt gctctggggc caccgattca caccagtcct caccttggaa 1021 aagggcttct atgaggtgga ctacaacacc ttccatgata cctatgagac caacacaccc 1081 agctgctgtg ccaaggagct ggcagaaatg aagagggaag gccggctcct ccagtacctc 1141 cccagccccc cactgctggg gcggtgtgct gaggcagggc tggatgcaga ggctgagcag 1201 aatgaagaag atgagcccaa ggggctgggt gggtccaggg aggccagggg ctcggtgtga // LOCUS HSKCC 3722 bp mRNA PRI 06-JUL-1996 DEFINITION Human K-Cl cotransporter (hKCC1) mRNA, complete cds. ACCESSION U55054 NID g1399211 KEYWORDS bumetanide, furosemide, K-Cl cotransporter, membrane transport, transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3722) AUTHORS Gillen,C.M., Brill,S., Payne,J.A. and Forbush,B. 3rd. TITLE Molecular cloning and functional expression of the K-Cl cotransporter from rabbit, rat, and human. A new member of the cation-chloride cotransporter family JOURNAL J. Biol. Chem. 271 (27), 16237-16244 (1996) MEDLINE 96279170 REFERENCE 2 (bases 1 to 3722) AUTHORS Gillen,C.M. TITLE Direct Submission JOURNAL Submitted (15-APR-1996) Christopher M. Gillen, Cellular and Molecular Physiology, Yale University School of Medicine, 333 Cedar St., New Haven, CT 06520, USA FEATURES Location/Qualifiers source 1..3722 /organism="Homo sapiens" /note="Reported sequence is a composite from complete sequencing of IMAGE consortium EST cDNA clones #154000 (R67622, R67622), #159363 (H15821, H161129), #34953 (R19769, R45189), #151758 (H02923, H04227), and from a reverse transcriptase PCR product from human embryonic kidney tissue culture cells (HEK 293)" /db_xref="taxon:9606" /chromosome="16" /map="16q22.1" gene 56..3313 /gene="hKCC1" CDS 56..3313 /gene="hKCC1" /note="ion cotransport protein" /codon_start=1 /product="K-Cl cotransporter" /db_xref="PID:g1399212" /translation="MPHFTVVPVDGPRRGDYDNLEGLSWVDYGERAELDDSDGHGNHR ESSPFLSPLEASRGIDYYDRNLALFEEELDIRPKVSSLLGKLVSYTNLTQGAKEHEEA ESGEGTRRRAAEAPSMGTLMGVYLPCLQNIFGVILFLRLTWMVGTAGVLQALLIVLIC CCCTLLTAISMSAIATNGVVPAGGSYFMISRSLGPEFGGAVGLCFYLGTTFAAAMYIL GAIEILLTYIAPPAAIFYPSGAHDTSNATLNNMRVYGTIFLTFMTLVVFVGVKYVNKF ASLFLACVIISILSIYAGGIKSIFDPPVFPVCMLGNRTLSRDQFDICAKTAVVDNETV ATQLWSFFCHSPNLTTDSCDPYFMLNNVTEIPGIPGAAAGVLQENLWSAYLEKGDIVE KHGLPSADAPSLKESLPLYVVADIATSFTVLVGIFFPSVTGIMAGSNRSGDLRDAQKS IPVGTILAIITTSLVYFSSVVLFGACIEGVVLRDKYGDGVSRNLVVGTLAWPSPWVIV IGSFFSTCGAGLQSLTGAPRLLQAIAKDNIIPFLRVFGHGKVNGEPTWALLLTALIAE LGILIASLDMVAPILSMFFLMCYLFVNLACAVQTLLRTPNWRPRFKYYHWALSFLGMS LCLALMFVSSWYYALVAMLIAGMIYKYIEYQGAEKEWGDGIRGLSLSAARYALLRLEE GPPHTKNWRPQLLVLLKLDEDLHVKYPRLLTFASQLKAGKGLTIVGSVIQGSFLESYG EAQAAEQTIKNMMEIEKVKGFCQVVVASKVREGLAHLIQSCGLGGMRHNSVVLGWPYG WRQSEDPRAWKTFIDTVRCTTAAHLALLVPKNIAFYPSNHERYLEGHIDVWWIVHDGG MLMLLPFLLRQHKVWRKCRMRIFTVAQMDDNSIQMKKDLAVFLYHLRLEAEVEVVEMH NSDISAYTYERTLMMEQRSQMLRQMRLTKTEREREAQLVKDRHSALRLESLYSDEEDE SAVGADKIQMTWTRDKYMTETWDPSHAPDNFRELVHIKPDQSNVRRMHTAVKLNEVIV TRSHDARLVLLNMPGPPRNSEGDENYMEFLEVLTEGLERVLLVRGGGREVITIYS" BASE COUNT 712 a 1088 c 1126 g 796 t ORIGIN 1 acgaggcagc ggcgggcggc tgggacggcg ggtgcggcgg ggccgagccc gcacgatgcc 61 tcacttcacc gtggtgccag tggacgggcc gaggcgcggc gactatgaca acctcgaggg 121 gctcagttgg gtggactacg gggagcgcgc cgagctggat gactcggacg gacatggcaa 181 ccacagagag agcagccctt ttctttcccc cttggaggct tccagaggaa ttgactacta 241 tgacaggaac ctggcactgt ttgaggaaga gctggacatc cgcccaaagg tatcgtctct 301 tctgggaaag ctcgtcagct acaccaacct cacccagggc gccaaagagc atgaggaggc 361 cgagagtggg gagggcaccc gccggagggc agccgaggca cccagcatgg gcaccctcat 421 gggggtgtac ctgccctgcc tgcagaatat ctttggggtt atcctcttcc tgcggctgac 481 ctggatggtg ggcacagcag gtgtgctaca ggccctcctc atcgtgctta tctgctgctg 541 ttgtaccctg ctgacggcca tctccatgag tgccatcgcc accaacggtg tggttccagc 601 tgggggctcc tatttcatga tctctcgttc actggggcca gaatttggag gtgctgtggg 661 cctgtgcttc tacctgggaa caacattcgc agcagccatg tacatcctgg gggccatcga 721 gatcttgctg acctacattg ccccaccagc tgccattttt tacccatcgg gtgctcatga 781 cacgtcgaat gccactttga acaatatgcg tgtgtatggg accattttcc tgaccttcat 841 gaccctggtg gtgtttgtgg gggtcaagta tgtgaacaaa tttgcctcgc tcttcctggc 901 ctgtgtgatc atctccatcc tctccatcta tgctgggggc ataaagtcta tatttgaccc 961 tcccgtgttt ccggtatgca tgctgggcaa caggaccctg tcccgggacc agtttgacat 1021 ctgtgccaag acagctgtag tggacaatga gacagtggcc acccagctat ggagtttctt 1081 ctgccacagc cccaacctta cgaccgactc ctgtgacccc tacttcatgc tcaacaatgt 1141 gaccgagatc cctggcatcc ccggggcagc tgctggtgtg ctccaggaaa acctgtggag 1201 cgcctacctg gagaagggtg acatcgtgga gaagcatggg ctgccctccg cagatgcccc 1261 gagcctgaag gagagcctgc ctctgtacgt ggtcgctgac atcgccacat ccttcaccgt 1321 gctggtcggc atcttcttcc cttctgtaac aggcatcatg gctggctcaa accgctctgg 1381 ggaccttcgt gacgcccaga agtctatccc tgtggggacc attctggcca tcattacaac 1441 ttccctcgtg tacttcagca gtgtggttct ctttggtgcc tgcattgagg gtgtggttct 1501 ccgggacaag tatggcgatg gtgtcagcag gaacttggtg gtgggcacac tggcctggcc 1561 ttcaccctgg gtcatcgtca tcggctcctt cttttcaacg tgtggcgctg gcctccagag 1621 cctcacaggg gcaccacgcc tattgcaggc cattgccaag gacaacatca tccccttcct 1681 ccgggtgttt ggccacggga aggtgaatgg tgaacccaca tgggcactcc tcctgacggc 1741 actcatcgcc gagctgggca tcctcatcgc ctccctcgac atggtggccc ccatcttatc 1801 catgttcttt ctgatgtgct acctgttcgt gaacctcgcc tgtgcggtgc agacactcct 1861 gaggaccccc aactggcggc cccggttcaa gtactatcac tgggcgctgt ccttcctggg 1921 catgagtctc tgcctggccc ttatgtttgt ctcctcctgg tactatgccc tggtggccat 1981 gctcatcgcc ggcatgatct acaaatacat cgagtaccaa ggggctgaga aggagtgggg 2041 tgacgggatc cgaggcctgt ccctgagcgc tgcccgctac gcgctgttgc ggctggagga 2101 ggggcctcct cacaccaaga actggcggcc gcagctgctg gtgctgctga agctggacga 2161 ggacctccac gtgaagtacc cgcggctcct caccttcgcc tcccagctca aggctggcaa 2221 gggcctgacc attgttggtt ctgtcatcca ggggagcttc ttggagagct atggcgaggc 2281 tcaggccgcc gagcagacca tcaagaacat gatggaaatt gagaaggtga agggcttctg 2341 ccaggtggtg gtggccagca aggtgcggga ggggctggcc cacctcatcc agtcctgtgg 2401 cctgggaggc atgcggcata actccgtggt gctgggctgg ccctacggct ggcgacagag 2461 cgaggacccc cgtgcctgga agaccttcat tgacaccgtg cgctgcacta cggctgccca 2521 cctggccctg ctcgtgccca agaacatcgc cttctacccc agcaaccacg agcgctacct 2581 ggagggccac atagacgtgt ggtggatcgt gcacgatggt ggcatgctca tgcttctgcc 2641 cttcctgctg cgccagcata aggtctggag gaagtgccgg atgcgcatct tcacagtggc 2701 ccagatggat gacaacagca tccagatgaa gaaggacctg gctgtctttc tgtaccatct 2761 gcgccttgag gccgaggtgg aggtggtgga gatgcataac agtgacatct ctgcatacac 2821 ctacgagcgg acgctgatga tggagcagcg gtcgcagatg ctgcggcaga tgagactgac 2881 caagactgag cgggagcgag aagcccagct ggtcaaggat cggcactcgg ccctgcggct 2941 ggagagcctg tactcggacg aggaagatga gtctgcagtg ggggctgaca agatccagat 3001 gacgtggacc agggacaagt acatgactga gacctgggac cccagccatg cccctgacaa 3061 tttccgggag ctggtgcaca ttaagccgga ccaatccaat gtgcggcgca tgcacactgc 3121 tgtgaagctc aatgaagtca ttgtcacgcg ctcccacgac gcccgcctgg ttctcctaaa 3181 catgcctggc ccacccagga acagtgaggg cgacgagaac tacatggagt tcctcgaggt 3241 gctgaccgag ggccttgagc gggtgctgtt ggtgcgcggt ggtggccgtg aagtcatcac 3301 catctactcc tgagcccagt gtcatcttgt ggcctggagt cgaggtcttg gccaggacat 3361 aacaagctgt ggtctggggt aacagcctct tcccagcacc cacctgccag ccctgcttgc 3421 ctggccctgt cctggaccca gctttgctag gtctccttgg aaaccaggcc tgggcctcaa 3481 aatggagatg gatcccaggt cttgtgggac cctgggatgt ttggggactt tactatctag 3541 caccccagta ggcctgtcct ggccagagaa gactggtagg ggccgagtgg ggtttgaagg 3601 cagccggccc ggcccagccc aggagcgcta tttattgcat atttattgtt tggatgtcac 3661 catcagagac gaagggaagg gtagccaggg agggagtcca gcccagctgc ctgcaggaag 3721 at // LOCUS HSKDEL 1086 bp RNA PRI 17-JUN-1991 DEFINITION Human mRNA for a presumptive KDEL receptor. ACCESSION X55885 NID g34030 KEYWORDS ERD2 gene; KDEL receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1086) AUTHORS Lewis,M.J. TITLE Direct Submission JOURNAL Submitted (13-SEP-1990) Lewis M.J., MRC, Lab. of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK REFERENCE 2 (bases 1 to 1086) AUTHORS Lewis,M.J. and Pelham,H.R. TITLE A human homologue of the yeast HDEL receptor JOURNAL Nature 348 (6297), 162-163 (1990) MEDLINE 91043069 FEATURES Location/Qualifiers source 1..1086 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="CDM8 cDNA" mRNA 1..1086 CDS 147..785 /note="homologue of the yeast HDEL receptor" /codon_start=1 /product="KDEL receptor" /db_xref="PID:g34031" /db_xref="SWISS-PROT:P24390" /translation="MNLFRFLGDLSHLLAIILLLLKIWKSRSCAGISGKSQVLFAVVF TARYLDLFTNYISLYNTCMKVVYIACSFTTVWLIYSKFKATYDGNHDTFRVEFLVVPT AILAFLVNHDFTPLEILWTFSIYLESVAILPQLFMVSKTGEAETITSHYLFALGVYRT LYLFNWIWRYHFEGFFDLIAIVAGLVQTVLYCDFFYLYITKVLKGKKLSLPA" BASE COUNT 210 a 339 c 254 g 283 t ORIGIN 1 ctaaaggtcc cctccccgga gcggagcgca cctagggtcc ctcttccgtc cccccagccc 61 agctacccgt tcagaccagc agcctcgggg ggcacccccc gccagcctgc ctccctcccg 121 ctcagccctg ccagggttcc ccagccatga atctcttccg attcctggga gacctctccc 181 acctcctcgc catcatcttg ctactgctca aaatctggaa gtcccgctcg tgcgccggaa 241 tttcagggaa gagccaggtc ctgtttgctg tggtgttcac tgcccgatat ctggacctct 301 tcaccaacta catctcactc tacaacacgt gtatgaaggt ggtctacata gcctgctcct 361 tcaccacggt ctggttgatt tatagcaagt tcaaagctac ttacgatggg aaccatgaca 421 cgttcagagt ggagttcctg gtcgttccca cagccattct ggcgttcctg gtcaatcatg 481 acttcacccc tctggagatc ctctggacct tctccatcta cctggagtca gtggccatct 541 tgccgcagct gttcatggtg agcaagaccg gcgaggcgga gaccatcacc agccactact 601 tgtttgcgct aggcgtttac cgcacgctct atctcttcaa ctggatctgg cgctaccatt 661 tcgagggctt cttcgacctc atcgccattg tggcaggcct ggtccagaca gtcctctact 721 gcgatttctt ctacctctat atcaccaaag tcctaaaggg gaagaagttg agtttgccgg 781 catagccccg gtcctctcca tctctctcct cggcagcagc gggaggcaga ggaaggcggc 841 agaagatgaa gagctttccc atccaggggt gactttttta agaacccacc tcttgtgctc 901 cccatcccgc ctcctgccgg gtttcagggg gacagtggag gatccaggtc ttggggagct 961 caggacttgg gctgtttgta gttttttgcc ttttagacaa gaaaaaaaaa tctttccact 1021 ctttagtttt tgattctgat gactcgtttt ttcttctact ctgtggcccc aaattttata 1081 aagtga // LOCUS HSKHCMR 3688 bp RNA PRI 10-JUN-1992 DEFINITION H.sapiens mRNA for kinesin (heavy chain). ACCESSION X65873 NID g34082 KEYWORDS kinesin; kinesin heavy chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3688) AUTHORS Vale,R.D. TITLE Direct Submission JOURNAL Submitted (28-APR-1992) R.D. Vale, Univ of California at San Francisco, Dept of Pharmacology, Box 0450, San Francisco CA 94143, USA REFERENCE 2 (bases 1 to 3688) AUTHORS Navone,F., Niclas,J., Hom-Booher,N., Sparks,L., Bernstein,H., McCaffrey,G. and Vale,R. TITLE Cloning and expression of a human kinesin heavy chain gene: Interaction of the C-terminal domain with cytoplasmic microtubules in transfected CV-1 cells JOURNAL J. Cell Biol. In press FEATURES Location/Qualifiers source 1..3688 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone="HK-3, HK-21, HK-H8" CDS 314..3205 /codon_start=1 /product="kinesin heavy chain" /db_xref="PID:g34083" /db_xref="SWISS-PROT:P33176" /translation="MADLAECNIKVMCRFRPLNESEVNRGDKYIAKFQGEDTVVIASK PYAFDRVFQSSTSQEQVYNDCAKKIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDP EGMGIIPRIVQDIFNYIYSMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLSVHEDKN RVPYVKGCTERFVCSPDEVMDTIDEGKSNRHVAVTNMNEHSSRSHSIFLINVKQENTQ TEQKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGSTYVP YRDSKMTRILQDSLGGNCRTTIVICCSPSSYNESETKSTLLFGQRAKTIKNTVCVNVE LTAEQWKKKYEKEKEKNKILRNTIQWLENELNRWRNGETVPIDEQFDKEKANLEAFTV DKDITLTNDKPATAIGVIGNFTDAERRKCEEEIAKLYKQLDDKDEEINQQSQLVEKLK TQMLDQEELLASTRRDQDNMQAELNRLQAENDASKEEVKEVLQALEELAVNYDQKSQE VEDKTKEYELLSDELNQKSATLASIDAELQKLKEMTNHQKKRAAEMMASLLKDLAEIG IAVGNNDVKQPEGTGMIDEEFTVARLYISKMKSEVKTMVKRCKQLESTQTESNKKMEE NEKELAACQLRISQHEAKIKSLTEYLQNVEQKKRQLEESVDALSEELVQLRAQEKVHE MEKEHLNKVQTANEVKQAVEQQIQSHRETHQKQISSLRDEVEAKAKLITDLQDQNQKM MLEQERLRVEHEKLKATDQEKSRKLHELTVMQDRREQARQDLKGLEETVAKELQTLHN LRKLFVQDLATRVKKSAEIDSDDTGGSAAQKQKISFLENNLEQLTKVHKQLVRDNADL RCELPKLEKRLRATAERVKALESALKEAKENASRDRKRYQQEVDRIKEAVRSKNMARR GHSAQIAKPIRPGQHPAASPTHPSAIRGGGAFVQNSQPVAVRGGGGKQV" BASE COUNT 1317 a 694 c 839 g 838 t ORIGIN 1 ccagcccccg cagtccgccc agaccgtaaa gggggacgct gaggagccgc ggacgctctc 61 cccggtgccg ccgccgctgc cgccgccatg gctgccatga tggatcggaa gtgagcatta 121 gggttaacgg ctgccggcgc cggctcttca agtcccggct ccccggccgc ctccacccgg 181 ggaacgcgag cgcggcgcag ctgactgctg cctctcacgg ccctcgcgac cacaagccct 241 caggtccggc gcgttccctg caagactgag cggcggggag tggctcccgg ccgcggcccc 301 ggctgcgaga aagatggcgg acctggccga gtgcaacatc aaagtgatgt gtcgcttcag 361 acctctcaac gagtctgaag tgaaccgcgg cgacaagtac atcgccaagt ttcagggaga 421 agacacggtc gtgatcgcgt ccaagcctta tgcatttgat cgggtgttcc agtcaagcac 481 atctcaagag caagtgtata atgactgtgc aaagaagatt gttaaagatg tacttgaagg 541 atataatgga acaatatttg catatggaca aacatcctct gggaagacac acacaatgga 601 gggtaaactt catgatccag aaggcatggg aattattcca agaatagtgc aagatatttt 661 taattatatt tactccatgg atgaaaattt ggaatttcat attaaggttt catattttga 721 aatatatttg gataagataa gggacctgtt agatgtttca aagaccaacc tttcagttca 781 tgaagacaaa aaccgagttc cctatgtaaa ggggtgcaca gagcgttttg tatgtagtcc 841 agatgaagtt atggatacca tagatgaagg aaaatccaac agacatgtag cagttacaaa 901 tatgaatgaa catagctcta ggagtcacag tatatttctt attaatgtca aacaagagaa 961 cacacaaacg gaacaaaagc tgagtggaaa actttatctg gttgatttag ctggtagtga 1021 aaaggttagt aaaactggag ctgaaggtgc tgtgctggat gaagctaaaa acatcaacaa 1081 gtcactttct gctcttggaa atgttatttc tgctttggct gagggtagta catatgttcc 1141 atatcgagat agtaaaatga caagaatcct tcaagattca ttaggtggca actgtagaac 1201 cactattgta atttgctgct ctccatcatc atacaatgag tctgaaacaa aatctacact 1261 cttatttggc caaagggcca aaacaattaa gaacacagtt tgtgtcaatg tggagttaac 1321 tgcagaacag tggaaaaaga agtatgaaaa agaaaaagaa aaaaataaga tcctgcggaa 1381 cactattcag tggcttgaaa atgagctcaa cagatggcgt aatggggaga cggtgcctat 1441 tgatgaacag tttgacaaag agaaagccaa cttggaagct ttcacagtgg ataaagatat 1501 tactcttacc aatgataaac cagcaaccgc aattggagtt ataggaaatt ttactgatgc 1561 tgaaagaaga aagtgtgaag aagaaattgc taaattatac aaacagcttg atgacaagga 1621 tgaagaaatt aaccagcaaa gtcaactggt agagaaactg aagacgcaaa tgttggatca 1681 ggaggagctt ttggcatcta ccagaaggga tcaagacaat atgcaagctg agctgaatcg 1741 ccttcaagca gaaaatgatg cctctaaaga agaagtgaaa gaagttttac aggccctaga 1801 agaacttgct gtcaattatg atcagaagtc tcaggaagtt gaagacaaaa ctaaggaata 1861 tgaattgctt agtgatgaat tgaatcagaa atcggcaact ttagcgagta tagatgctga 1921 gcttcagaaa cttaaggaaa tgaccaacca ccagaaaaaa cgagcagctg agatgatggc 1981 atctttacta aaagaccttg cagaaatagg aattgctgtg ggaaataatg atgtaaagca 2041 gcctgaggga actggcatga tagatgaaga gttcactgtt gcaagactct acattagcaa 2101 aatgaagtca gaagtaaaaa ccatggtgaa acgttgcaag cagttagaaa gcacacaaac 2161 tgagagcaac aaaaaaatgg aagaaaatga aaaggagtta gcagcatgtc agcttcgtat 2221 ctctcaacat gaagccaaaa tcaagtcatt gactgaatac cttcaaaatg tggaacaaaa 2281 gaaaagacag ttggaggaat ctgtcgatgc cctcagtgaa gaactagtcc agcttcgagc 2341 acaagagaaa gtccatgaaa tggaaaagga gcacttaaat aaggttcaga ctgcaaatga 2401 agttaagcaa gctgttgaac agcagatcca gagccataga gaaactcatc aaaaacagat 2461 cagtagtttg agagatgaag tagaagcaaa agcaaaactt attactgatc ttcaagacca 2521 aaaccagaaa atgatgttag agcaggaacg tctaagagta gaacatgaga agttgaaagc 2581 cacagatcag gaaaagagca gaaaactaca tgaacttacg gttatgcaag atagacgaga 2641 acaagcaaga caagacttga agggtttgga agagacagtg gcaaaagaac ttcagacttt 2701 acacaacctg cgcaaactct ttgttcagga cctggctaca agagttaaaa agagtgctga 2761 gattgattct gatgacaccg gaggcagcgc tgctcagaag caaaaaatct cctttcttga 2821 aaataatctt gaacagctca ctaaagtgca caaacagttg gtacgtgata atgcagatct 2881 ccgctgtgaa cttcctaagt tggaaaagcg acttcgagct acagctgaga gagtgaaagc 2941 tttggaatca gcactgaaag aagctaaaga aaatgcatct cgtgatcgca aacgctatca 3001 gcaagaagta gatcgcataa aggaagcagt caggtcaaag aatatggcca gaagagggca 3061 ttctgcacag attgctaaac ctattcgtcc cgggcaacat ccagcagctt ctccaactca 3121 cccaagtgca attcgtggag gaggtgcatt tgttcagaac agccagccag tggcagtgcg 3181 aggtggagga ggcaaacaag tgtaatcgtt tatacatacc cacaggtgtt aaaaagtaat 3241 cgaagtacga agaggacatg gtatcaagca gtcattcaat gactataacc tctactccct 3301 tgggattgta gaattataac ttttaaaaaa aatgtataaa ttatacctgg cctgtacagc 3361 tgtttcctac ctactcttct tgtaaactct gctgcttccc aacacaacta gagtgcaatt 3421 ttggcatctt aggagggaaa aaggacagtt tacaactgtg gccctattta ttacacagtt 3481 tgtctatcgt gtcttaaatt tagtctttac tgtgccaagc taactctacc ttataggact 3541 gtactttttg tattttttgt gtatgtttat tttttaatct cagtttaaat tacctagcta 3601 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3661 aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSKHK3A 1899 bp RNA PRI 14-OCT-1994 DEFINITION H.sapiens KHK mRNA for ketohexokinase, clone pHKHK3a. ACCESSION X78678 NID g558350 KEYWORDS ketohexokinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1899) AUTHORS Bonthron,D.T., Brady,N., Donaldson,I.A. and Steinmann,B. TITLE Molecular basis of essential fructosuria: molecular cloning and mutational analysis of human ketohexokinase (fructokinase) JOURNAL Hum. Mol. Genet. 3 (9), 1627-1631 (1994) MEDLINE 95135420 REFERENCE 2 (bases 1 to 1899) AUTHORS Bonthron,D.T. TITLE Direct Submission JOURNAL Submitted (08-APR-1994) D.T. Bonthron, University of Edinburgh, Human Genetics Unit, Western General Hospital, Edinburgh EH4 2XU, UK FEATURES Location/Qualifiers source 1..1899 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /clone_lib="pCDM8" /clone="pHKHK3a" gene 9..905 /gene="KHK" CDS 9..905 /gene="KHK" /EC_number="2.7.1.3" /codon_start=1 /product="ketohexokinase" /db_xref="PID:g558351" /translation="MEEKQILCVGLVVLDVISLVDKYPKEDSEIRCLSQRWQRGGNAS NSCTVLSLLGAPCAFMGSMAPGHVADFVLDDLRRYSVDLRYTVFQTTGSVPIATVIIN EASGSRTILYYDRSLPDVSATDFEKVDLTQFKWIHIEGRNASEQVKMLQRIDAHNTRQ PPEQKIRVSVEVEKPREELFQLFGYGDVVFVSKDVAKHLGFQSAEEALRGLYGRVRKG AVLVCAWAEEGADALGPDGKLLHSDAFPPPRVVDTLGAGDTFNASVIFSLSQGRSVQE ALRFGCQVAGKKCGLQGFDGIV" BASE COUNT 410 a 523 c 550 g 416 t ORIGIN 1 gtagcctcat ggaagagaag cagatcctgt gcgtggggct agtggtgctg gacgtcatca 61 gcctggtgga caagtaccct aaggaggact cggagataag gtgtttgtcc cagagatggc 121 agcgcggagg caacgcgtcc aactcctgca ccgttctctc cctgctcgga gccccctgtg 181 ccttcatggg ctcaatggct cctggccatg ttgctgattt tgtcctggat gacctccgcc 241 gctattctgt ggacctacgc tacacagtct ttcagaccac aggctccgtc cccatcgcca 301 cggtcatcat caacgaggcc agtggtagcc gcaccatcct atactatgac aggagcctgc 361 cagatgtgtc tgctacagac tttgagaagg ttgatctgac ccagttcaag tggatccaca 421 ttgagggccg gaacgcatcg gagcaggtga agatgctgca gcggatagac gcacacaaca 481 ccaggcagcc tccagagcag aagatccggg tgtccgtgga ggtggagaag ccacgagagg 541 agctcttcca gctgtttggc tacggagacg tggtgtttgt cagcaaagat gtggccaagc 601 acttggggtt ccagtcagca gaggaagcct tgaggggctt gtatggtcgt gtgaggaaag 661 gggctgtgct tgtctgtgcc tgggctgagg agggcgccga cgccctgggc cctgatggca 721 aattgctcca ctcggatgct ttcccgccac cccgcgtggt ggatacactg ggagctggag 781 acaccttcaa tgcctccgtc atcttcagcc tctcccaggg gaggagcgtg caggaagcac 841 tgagattcgg gtgccaggtg gccggcaaga agtgtggcct gcagggcttt gatggcatcg 901 tgtgagagca ggtgccggct cctcacacac catggagact accattgcgg ctgcatcgcc 961 ttctcccctc catccagcct ggcgtccagg ttgccctgtt caggggacag atgcaagctg 1021 tggggaggac tctgcctgtg tcctgtgttc cccacaggga gaggctctgg ggggatggct 1081 gggggatgca gagcctcaga gcaaataaat cttcctcaga gccagcttct cctctcaatg 1141 tctgaactgc tctggctggg cattcctgag gctctgactc ttcgatcctc cctctttgtg 1201 tccattcccc aaattaacct ctccgcccag gcccagagga ggggctgcct gggctagagc 1261 agcgagaagt gccctgggct tgccaccagc tctgccctgg ctggggagga cactcggtgc 1321 cccacaccca gtgaacctgc caaagaaacc gtgagagctc ttcggggccc tgcgttgtgc 1381 agactctatt cccacagctc agaagctggg agtccacacc gctgagctga actgacaggc 1441 cagtgggggg caggggtgcg cctcctctgc cctgcccacc agcctgtgat ttgatggggt 1501 cttcattgtc cagaaatacc tcctcccgct gactgcccca gagcctgaaa gtctcaccct 1561 tggagcccac cttggaatta agggcgtgcc tcagccacaa atgtgaccca ggatacagag 1621 tgttgctgtc ctcagggagg tccgatctgg aacacatatt ggaattgggg ccaactccaa 1681 tatagggtgg gtaaggcctt ataatgtaaa gagcatataa tgtaaagggc tttagagtga 1741 gacagacctg gattcaaatc tgccatttaa ttagctgcat atcaccttag ggtacagcac 1801 ttaacgcaat ctgcctcaat ttcttcatct gtcaaatgga accaattctg cttggctaca 1861 gaattattgt gaggataaaa atcatatata aaaaaaaaa // LOCUS HSKINEC 4613 bp RNA PRI 08-SEP-1995 DEFINITION H.sapiens kinectin gene. ACCESSION Z22551 NID g296163 KEYWORDS 156 kDa protein; kinectin gene; receptor for Kinesin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4613) AUTHORS Futterer,A., Kruppa,G., Kramer,B., Lemke,H. and Kronke,M. TITLE Molecular cloning and characterization of human kinectin JOURNAL Mol. Biol. Cell 6 (2), 161-170 (1995) MEDLINE 95306853 REMARK (sites) REFERENCE 2 (bases 1 to 4613) AUTHORS Fuetterer,A. TITLE Direct Submission JOURNAL Submitted (15-APR-1993) Kroenke M., Technical University Munich, Inst. f. Medical Microbiology, Trogerstr. 32, 81675 Munich, Germany FEATURES Location/Qualifiers source 1..4613 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Kinectin" /cell_line="L540, YT, K562" 5'UTR 1..69 gene 70..4140 /gene="kinectin" CDS 70..4140 /gene="kinectin" /note="subcellular localization: perinuclear and ER;conserved among several species" /codon_start=1 /product="156 kDa Protein" /db_xref="PID:g296164" /translation="MEFYESAYFIVLIPSIVITVIFLFFWLFMKETLYDEVLAKQKRE QKLIPTKTDKKKAEKKKNKKKEIQNGNLHESDSESVPRDFKLSDALAVEDDQVAPVPL NVVETSSSVRERKKKEKKQKPVLEEQVIKESDASKIPGKKVEPVPVTKQPTPPSEAAA SKKKPGQKKSKNGSDDQDKKVETLMVPSKRQEALPLHQETKQESGSGKKASSKKQKTE NVFVDEPLIHATTYIPLMDNADSSPVVDKREVIDLLKPDQVEGIQKSGTKKLKTETDK ENAEVKFKDFLLSLKTMMFSEDEALCVVDLLKEKSGVIQDALKKSSKGELTTLIHQLQ EKDKLLAAVKEDAAATKDRCKQLTQEMMTEKERSNVVMTRMKDRIGTLEKEHNVFQNK IHVSYQETQQMQMKFQQVREQMEAEIAHLKQENGILRDAVSNTTNQLESKQSAELNKL RQDYARLVNELTEKTGKLQQEEVQKKNAEQAATQLKVQLQEAERRWEEVQSYIRKRTA EHEAAQQDLQSKFVAKENEVQSLHSKLTDTLVSKQQLEQRLMQLMESEQKRVNKEESL QMQVQDILEQNEALKAQIQQFHSQIAAQTSASVLAEELHKVIAEKDKQIKQTEDSLAS ERDRLTSKEEELKDIQNMNFLLKAEVQKLQALANEQAAAAHELEKMQQSVYVKDDKIR LLEEQLQHEISNKMEEFKILNDQNKALKSEVQKLQTLVSEQPNKDVVEQMEKCIQEKD EKLKTVEELLETGLIQVATKEEELNAIRTENSSLTKEVQDLKAKQNDQVSFASLVEEL KKVIHEKDGKIKSVEELLEAELLKVANKEKTVQDLKQEIKALKEEIGNVQLEKAQQLS ITSKVQELQNLLKGKEEQMNTMKAVLEEKEKDLANTGKWLQDLQEENESLKAHVQEVA QHNLKEASSASQFEELEIVLKEKGNELKRLEAMLKERESDLSSKTQLLQDVQDENKLF KSQIEQLKQQNYQQASSFPPHEELLKVISEREKEISGLWNELDSLKDAVEHQRKKNND LREKNWEAMEALASTEKMLQDKVNKTSKERQQQVEAVELEAKEVLKKLFPKVSVPSNL SYGEWLHGFEKKAKECMAGTSGSEEVKVLEHKLKEADEMHTLLQLECEKYKSVLAETE GILQKLQRSVEQEENKWKVKVDESHKTIKQMQSSFTSSEQELERLRSENKDIENLRRE REHLEMELEKAEMERSTYVTEVRELKDLLTELQKKLDDSYSEAVRQNEELNLLKAQLN ETLTKLRTEQNERQKVAGDLHKAQQSLELIQSKIVKAAGDTTVIENSDVSPETESSEK ETMSVSLNQTVTQLQQLLQAVNQQLTKEKEHYQVLE" 3'UTR 4141..4613 BASE COUNT 1770 a 723 c 1020 g 1100 t ORIGIN 1 gcgccgcgtc ttcccggtct cctttcccgg ccgcacaggg ttttatagga tcacattgac 61 aaaagtacca tggagtttta tgagtcagca tattttattg ttcttattcc ttcaatagtt 121 attacagtaa ttttcctctt cttctggctt ttcatgaaag aaacattata tgatgaagtt 181 cttgcaaaac agaaaagaga acaaaagctt attcctacca aaacagataa aaagaaagca 241 gaaaagaaaa agaataaaaa gaaagaaatc cagaatggaa acctccatga atccgactct 301 gagagtgtac ctcgagactt taaattatca gatgctttgg cagtagaaga tgatcaagtt 361 gcacctgttc cattgaatgt cgttgaaact tcaagtagtg ttagggaaag aaaaaagaag 421 gaaaagaaac aaaagcctgt gcttgaagag caggtcatca aagaaagtga cgcatcaaag 481 attcctggca aaaaagtaga acctgtccca gttactaaac agcccacccc tccctctgaa 541 gcagctgcct cgaagaagaa accagggcag aagaagtcta aaaatggaag cgatgaccag 601 gataaaaagg tggaaactct catggtacca tcaaaaaggc aagaagcatt gcccctccac 661 caagagacta aacaagaaag tggatcaggg aagaaagctt catcaaagaa acaaaagaca 721 gaaaatgtct tcgtagatga accccttatt catgcaacta cttatattcc tttgatggat 781 aatgctgact caagtcctgt ggtagataag agagaggtta ttgatttgct taaacctgac 841 caagtagaag ggatccagaa atctgggact aaaaaactga agaccgaaac tgacaaagaa 901 aatgctgaag tgaagtttaa agattttctt ctgtccttga agactatgat gttttctgaa 961 gatgaggctc tttgtgttgt agacttgcta aaggagaagt ctggtgtaat acaagatgct 1021 ttaaagaagt caagtaaggg agaattgact acgcttatac atcagcttca agaaaaggac 1081 aagttactcg ctgctgtgaa ggaagatgct gctgctacaa aggatcggtg taagcagtta 1141 acccaggaaa tgatgacaga gaaagaaaga agcaatgtgg ttatgacaag gatgaaagat 1201 cggattggaa cattagaaaa ggaacataat gtatttcaaa acaaaataca tgtcagttat 1261 caagagactc aacagatgca gatgaagttt cagcaagttc gtgagcagat ggaggcagag 1321 atagctcact tgaagcagga aaatggtata ctgagagatg cagtcagcaa cactacaaat 1381 caactggaaa gcaagcagtc tgcagaacta aataaactac gccaggatta tgctaggttg 1441 gtgaatgagc tgactgagaa aacaggaaag ctacagcaag aggaagtcca aaagaagaat 1501 gctgagcaag cagctactca gttgaaggtt caactacaag aagctgagag aaggtgggaa 1561 gaagttcaga gctacatcag gaagagaaca gcggaacatg aggcagcaca gcaagattta 1621 cagagtaaat ttgtggccaa agaaaatgaa gtacagagtc tgcatagtaa gcttacagat 1681 accttggtat caaaacaaca gttggagcaa agactaatgc agttaatgga atcagagcag 1741 aaaagggtga acaaagaaga gtctctacaa atgcaggttc aggatatttt ggagcagaat 1801 gaggctttga aagctcaaat tcagcagttc cattcccaga tagcagccca gacctccgct 1861 tcagttctag cagaagaatt acataaagtg attgcagaaa aggataagca gataaaacag 1921 actgaagatt ctttagcaag tgaacgtgat cgtttaacaa gtaaagaaga ggaacttaag 1981 gatatacaga atatgaattt cttattaaaa gctgaagtgc agaaattaca ggccctggca 2041 aatgagcagg ctgctgctgc acatgaattg gagaagatgc aacaaagtgt ttatgttaaa 2101 gatgataaaa taagattgct ggaagagcaa ctacaacatg aaatttcaaa caaaatggaa 2161 gaatttaaga ttctaaatga ccaaaacaaa gcattaaaat cagaagttca gaagctacag 2221 actcttgttt ctgaacagcc taataaggat gttgtggaac aaatggaaaa atgcattcaa 2281 gaaaaagatg agaagttaaa gactgtggaa gaattacttg aaactggact tattcaggtg 2341 gcaactaaag aagaggagct gaatgcaata agaacagaaa attcatctct gacaaaagaa 2401 gttcaagact taaaagctaa gcaaaatgat caggtttctt ttgcctctct agttgaagaa 2461 cttaagaaag tgatccatga gaaagatgga aagatcaagt ctgtagaaga gcttctggag 2521 gcagaacttc tcaaagttgc taacaaggag aaaactgttc aggatttgaa acaggaaata 2581 aaggctctaa aagaagaaat aggaaatgtc cagcttgaaa aggctcaaca gttatctatc 2641 acttccaaag ttcaggagct tcagaactta ttaaaaggaa aagaggaaca gatgaatacc 2701 atgaaggctg ttttggaaga gaaagagaaa gacctagcca atacagggaa gtggttacag 2761 gatcttcaag aagaaaatga atctttaaaa gcacatgttc aggaagtagc acaacataac 2821 ttgaaagagg cctcttctgc atcacagttt gaagaacttg agattgtgtt gaaagaaaag 2881 ggaaatgaat tgaagaggtt agaagccatg ctaaaagaga gggagagtga tctttctagc 2941 aaaacacagc tgttacagga tgtacaagat gaaaacaaat tgtttaagtc ccaaattgag 3001 cagcttaaac aacaaaacta ccaacaggca tcttcttttc cccctcatga agaattatta 3061 aaagtaattt cagaaagaga gaaagaaata agtggtctct ggaatgagtt agattctttg 3121 aaggatgcag ttgaacacca gaggaagaaa aacaatgacc ttcgggagaa aaactgggaa 3181 gcaatggaag cattggcatc aactgaaaaa atgctgcagg acaaagtgaa caagacttcc 3241 aaggaaaggc agcaacaggt ggaagctgtt gagttggagg ctaaagaagt tctcaaaaaa 3301 ttatttccaa aggtgtctgt cccttctaat ttgagttatg gtgaatggtt gcatggattt 3361 gaaaaaaagg caaaagaatg tatggctgga acttcagggt cagaggaggt taaggttcta 3421 gagcacaagt tgaaagaagc tgatgaaatg cacacattgt tacagctaga gtgtgaaaaa 3481 tacaaatccg tccttgcaga aacagaagga attttacaga agctacagag aagtgttgag 3541 caagaagaaa ataaatggaa agttaaggtc gatgaatcac acaagactat taaacagatg 3601 cagtcatcat ttacatcttc agaacaagag ctagagcgat taagaagcga aaataaggat 3661 attgaaaatc tgagaagaga acgagaacat ttggaaatgg aactagaaaa ggcagagatg 3721 gaacgatcta cctatgttac agaagtcaga gagctgaaag atctgttgac tgaattgcag 3781 aaaaaacttg atgattcata ttctgaagca gtaagacaga atgaagagct aaatttgttg 3841 aaggcacagt taaatgaaac actcacaaaa cttagaactg aacaaaatga aagacagaag 3901 gtagctggtg atttgcataa ggctcaacag tcactggagc ttatccagtc aaaaatagta 3961 aaagctgctg gagacactac tgttattgaa aatagtgatg tttccccaga aacggagtct 4021 tctgagaagg agacaatgtc tgtaagtcta aatcagactg taacacagtt acagcagttg 4081 cttcaggcgg taaaccaaca gctcacaaag gagaaagagc actaccaggt gttagagtga 4141 agtaattggg aaactgttca tttgaggata aaaaaggcat tgtattatat tttgccaaat 4201 taaagcctta tttatgtttt caccctttct actttgtcag aaacactgaa cagagttttg 4261 tcttttctaa tccttgttag actactgatt taaagaagga aaaaaaaaag ccaactctgt 4321 agacaccttc agagtttagt tttataataa aaactgtttg aataattaga cctttacatt 4381 cctgaagata aacatgtaat cttttatctt attttgctca ataaaattgt tcagaagatc 4441 aaagtggtaa agacaatgta aaatttaaca ttttaatact gatgttgtac actgttttac 4501 ttaacatttt gggaagtaac tgcctctgac ttcaactcaa gaaaacactt ttttgttgct 4561 aatgtaatcg gtttttgtaa tggcgtcagc aaataaaagg atgcttatta ttc // LOCUS HSKINRELP 3741 bp RNA PRI 12-MAR-1996 DEFINITION H.sapiens mRNA for kinesin-related protein. ACCESSION X85137 NID g1155083 KEYWORDS kinesin-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3741) AUTHORS Kress,M.M. TITLE Direct Submission JOURNAL Submitted (07-MAR-1995) M.M. Kress, IFC - CNRS - UPR 9044, 7 Rue Guy Moquet, F- 94802 Villejuif Cedex, FRANCE REFERENCE 2 (bases 1 to 3741) AUTHORS Blangy,A., Lane,H.A., d'Herin,P., Harper,M., Kress,M. and Nigg,E.A. TITLE Phosphorylation by p34cdc2 regulates spindle association of human Eg5, a kinesin-related motor essential for bipolar spindle formation in vivo JOURNAL Cell 83 (7), 1159-1169 (1995) MEDLINE 96128120 FEATURES Location/Qualifiers source 1..3741 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="DAUDI cells" /clone="HSEg5-D2" CDS 11..3184 /codon_start=1 /product="kinesin-related protein" /db_xref="PID:e140873" /db_xref="PID:g1155084" /translation="MASQPNSSAKKKEEKGKNIQVVVRCRPFNLAERKASAHSIVECD PVRKEVSVRTGGLADKSSRKTYTFDMVFGASTKQIDVYRSVVCPILDEVIMGYNCTIF AYGQTGTGKTFTMEGERSPNEEYTWEEDPLAGIIPRTLHQIFEKLTDNGTEFSVKVSL LEIYNEELFDLLNPSSDVSERLQMFDDPRNKRGVIIKGLEEITVHNKDEVYQILEKGA AKRTTAATLMNAYSSRSHSVFSVTIHMKETTIDGEELVKIGKLNLVDLAGSENIGRSG AVDKRAREAGNINQSLLTLGRVITALVERTPHVPYRESKLTRILQDSLGGRTRTSIIA TISPASLNLEETLSTLEYAHRAKNILNKPEVNQKLTKKALIKEYTEEIERLKRDLAAA REKNGVYISEENFRVMSGKLTVQEEQIVELIEKIGAVEEELNRVTELFMDNKNELDQC KSDLQNKTQELETTQKHLQETKLQLVKEEYITSALESTEEKLHDAASKLLNTVEETTK DVSGLHSKLDRKKAVDQHNAEAQDIFGKNLNSLFNNMEELIKDGSSKQKAMLEVHKTL FGNLLSSSVSALDTITTVALGSLTSIPENVSTHVSQIFNMILKEQSLAAESKTVLQEL INVLKTDLLSSLEMILSPTVVSILKINSQLKHIFKTSLTVADKIEDQKKRNSDGFLSI LCNNLHELQENTICSLVESQKQCGNLTEDLKTIKQTHSQELCKLMNLWTERFCALEEK CENIQKPLSSVQENIQQKSKDIVNKMTFHSQKFCADSDGFSQELRNFNQEGTKLVEES VKHSDKLNGNLEKISQETEQRCESLNTRTVYFSEQWVSSLNEREQELHNLLEVVSQCC EASSSDITEKSDGRKAAHEKQHNIFLDQMTIDEDKLIAQNLELNETIKIGLTKLNCFL EQDLKLDIPTGTTPQRKSYLYPSTLVRTEPREHLLDQLKRKQPELLMMLNCSENNKEE TIPDVDVEEAVLGQYTEEPLSQEPSVDAGVDCSSIGGVPFFQHKKSHGKDKENRGINT LERSKVEETTEHLVTKSRLPLRAQINL" BASE COUNT 1292 a 652 c 799 g 998 t ORIGIN 1 gaattccgtc atggcgtcgc agccaaattc gtctgcgaag aagaaagagg agaaggggaa 61 gaacatccag gtggtggtga gatgcagacc atttaatttg gcagagcgga aagctagcgc 121 ccattcaata gtagaatgtg atcctgtacg aaaagaagtt agtgtacgaa ctggaggatt 181 ggctgacaag agctcaagga aaacatacac ttttgatatg gtgtttggag catctactaa 241 acagattgat gtttaccgaa gtgttgtttg tccaattctg gatgaagtta ttatgggcta 301 taattgcact atctttgcgt atggccaaac tggcactgga aaaactttta caatggaagg 361 tgaaaggtca cctaatgaag agtatacctg ggaagaggat cccttggctg gtataattcc 421 acgtaccctt catcaaattt ttgagaaact tactgataat ggtactgaat tttcagtcaa 481 agtgtctctg ttggagatct ataatgaaga gctttttgat cttcttaatc catcatctga 541 tgtttctgag agactacaga tgtttgatga tccccgtaac aagagaggag tgataattaa 601 aggtttagaa gaaattacag tacacaacaa ggatgaagtc tatcaaattt tagaaaaggg 661 ggcagcaaaa aggacaactg cagctactct gatgaatgca tactctagtc gttcccactc 721 agttttctct gttacaatac atatgaaaga aactacgatt gatggagaag agcttgttaa 781 aatcggaaag ttgaacttgg ttgatcttgc aggaagtgaa aacattggcc gttctggagc 841 tgttgataag agagctcggg aagctggaaa tataaatcaa tccctgttga ctttgggaag 901 ggtcattact gcccttgtag aaagaacacc tcatgttcct tatcgagaat ctaaactaac 961 tagaatcctc caggattctc ttggagggcg tacaagaaca tctataattg caacaatttc 1021 tcctgcatct ctcaatcttg aggaaactct gagtacattg gaatatgctc atagagcaaa 1081 gaacatattg aataagcctg aagtgaatca gaaactcacc aaaaaagctc ttattaagga 1141 gtatacggag gagatagaac gtttaaaacg agatcttgct gcagcccgtg agaaaaatgg 1201 agtgtatatt tctgaagaaa attttagagt catgagtgga aaattaactg ttcaagaaga 1261 gcagattgta gaattgattg aaaaaattgg tgctgttgag gaggagctga atagggttac 1321 agagttgttt atggataata aaaatgaact tgaccagtgt aaatctgacc tgcaaaataa 1381 aacacaagaa cttgaaacca ctcaaaaaca tttgcaagaa actaaattac aacttgttaa 1441 agaagaatat atcacatcag ctttggaaag tactgaggag aaacttcatg atgctgccag 1501 caagctgctt aacacagttg aagaaactac aaaagatgta tctggtctcc attccaaact 1561 ggatcgtaag aaggcagttg accaacacaa tgcagaagct caggatattt ttggcaaaaa 1621 cctgaatagt ctgtttaata atatggaaga attaattaag gatggcagct caaagcaaaa 1681 ggccatgcta gaagtacata agaccttatt tggtaatctg ctgtcttcca gtgtctctgc 1741 attagatacc attactacag tagcacttgg atctctcaca tctattccag aaaatgtgtc 1801 tactcatgtt tctcagattt ttaatatgat actaaaagaa caatcattag cagcagaaag 1861 taaaactgta ctacaggaat tgattaatgt actcaagact gatcttctaa gttcactgga 1921 aatgatttta tccccaactg tggtgtctat actgaaaatc aatagtcaac taaagcatat 1981 tttcaagact tcattgacag tggccgataa gatagaagat caaaaaaaaa ggaactcaga 2041 tggctttctc agtatactgt gtaacaatct acatgaacta caagaaaata ccatttgttc 2101 cttggttgag tcacaaaagc aatgtggaaa cctaactgaa gacctgaaga caataaagca 2161 gacccattcc caggaacttt gcaagttaat gaatctttgg acagagagat tctgtgcttt 2221 ggaggaaaag tgtgaaaata tacagaaacc acttagtagt gtccaggaaa atatacagca 2281 gaaatctaag gatatagtca acaaaatgac ttttcacagt caaaaatttt gtgctgattc 2341 tgatggcttc tcacaggaac tcagaaattt taaccaagaa ggtacaaaat tggttgaaga 2401 atctgtgaaa cactctgata aactcaatgg caacctggaa aaaatatctc aagagactga 2461 acagagatgt gaatctctga acacaagaac agtttatttt tctgaacagt gggtatcttc 2521 cttaaatgaa agggaacagg aacttcacaa cttattggag gttgtaagcc aatgttgtga 2581 ggcttcaagt tcagacatca ctgagaaatc agatggacgt aaggcagctc atgagaaaca 2641 gcataacatt tttcttgatc agatgactat tgatgaagat aaattgatag cacaaaatct 2701 agaacttaat gaaaccataa aaattggttt gactaagctt aattgctttc tggaacagga 2761 tctgaaactg gatatcccaa caggtacgac accacagagg aaaagttatt tatacccatc 2821 aacactggta agaactgaac cacgtgaaca tctccttgat cagctgaaaa ggaaacagcc 2881 tgagctgtta atgatgctaa actgttcaga aaacaacaaa gaagagacaa ttccggatgt 2941 ggatgtagaa gaggcagttc tggggcagta tactgaagaa cctctaagtc aagagccatc 3001 tgtagatgct ggtgtggatt gttcatcaat tggcggggtt ccatttttcc agcataaaaa 3061 atcacatgga aaagacaaag aaaacagagg cattaacaca ctggagaggt ctaaagtgga 3121 agaaactaca gagcacttgg ttacaaagag cagattacct ctgcgagccc agatcaacct 3181 ttaattcact tgggggttgg caattttatt tttaaagaaa aacttaaaaa taaaacctga 3241 aaccccagaa cttgagcctt gtgtatagat tttaaaagaa tatatatatc agccgggcgc 3301 gtggctctag ctgtaatccc agctaacttt ggaggctgag gcgggtggat tgcttgagcc 3361 caggagtttg agaccagcct ggccaacgtg cgctaaaacc ttcgtctctg ttaaaaatta 3421 gccgggcgtg gtgggcacac tcctgtaatc ccagctactg gggaggctga ggcacgagaa 3481 tcacttgaac ccagaagcgg ggttgcagtg agccaaaggt acaccactac actccagcct 3541 gggcaacaga gcaagactcg gtctcaaaaa taaaatttaa aaaagatata aggcagtact 3601 gtaaattcag ttgaattttg atatctaccc atttttctgt catccctata gttcactttg 3661 tattaaattg ggtttcattt gggatttgca atgtaaatac gtatttctag ttttcatata 3721 aagtagttct tttaggaatt c // LOCUS HSKITCR 5084 bp RNA PRI 12-SEP-1993 DEFINITION Human c-kit proto-oncogene mRNA. ACCESSION X06182 NID g34084 KEYWORDS colony stimulating factor receptor; glycoprotein; growth factor receptor; kit cellular oncogene; kit oncogene; platelet-derived growth factor receptor; proto-oncogene; transmembrane protein; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5084) AUTHORS Yarden,Y., Kuang,W.J., Yang-Feng,T., Coussens,L., Munemitsu,S., Dull,T.J., Chen,E., Schlessinger,J., Francke,U. and Ullrich,A. TITLE Human proto-oncogene c-kit: a new cell surface receptor tyrosine kinase for an unidentified ligand JOURNAL EMBO J. 6 (11), 3341-3351 (1987) MEDLINE 88111521 COMMENT in aditition to the tyrosine kinase region the c-kit deduced AA sequence shares major structural features with the macrophage growth factor ( CSF-1) and platelet-derived growth factor ( PDGF ) receptor subfamily. FEATURES Location/Qualifiers source 1..5084 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="term placenta and fetal brain" /clone_lib="lambda gt11" /clone="HFB-ckit/1, HFB-ckit/171, HP-ckit/63" CDS 22..2952 /note="protein p145-ckit (AA 1 - 976)" /codon_start=1 /db_xref="PID:g34085" /db_xref="SWISS-PROT:P10721" /translation="MRGARGAWDFLCVLLLLLRVQTGSSQPSVSPGEPSPPSIHPGKS DLIVRVGDEIRLLCTDPGFVKWTFEILDETNENKQNEWITEKAEATNTGKYTCTNKHG LSNSIYVFVRDPAKLFLVDRSLYGKEDNDTLVRCPLTDPEVTNYSLKGCQGKPLPKDL RFIPDPKAGIMIKSVKRAYHRLCLHCSVDQEGKSVLSEKFILKVRPAFKAVPVVSVSK ASYLLREGEEFTVTCTIKDVSSSVYSTWKRENSQTKLQEKYNSWHHGDFNYERQATLT ISSARVNDSGVFMCYANNTFGSANVTTTLEVVDKGFINIFPMINTTVFVNDGENVDLI VEYEAFPKPEHQQWIYMNRTFTDKWEDYPKSENESNIRYVSELHLTRLKGTEGGTYTF LVSNSDVNAAIAFNVYVNTKPEILTYDRLVNGMLQCVAAGFPEPTIDWYFCPGTEQRC SASVLPVDVQTLNSSGPPFGKLVVQSSIDSSAFKHNGTVECKAYNDVGKTSAYFNFAF KGNNKEQIHPHTLFTPLLIGFVIVAGMMCIIVMILTYKYLQKPMYEVQWKVVEEINGN NYVYIDPTQLPYDHKWEFPRNRLSFGKTLGAGAFGKVVEATAYGLIKSDAAMTVAVKM LKPSAHLTEREALMSELKVLSYLGNHMNIVNLLGACTIGGPTLVITEYCCYGDLLNFL RRKRDSFICSKQEDHAEAALYKNLLHSKESSCSDSTNEYMDMKPGVSYVVPTKADKRR SVRIGSYIERDVTPAIMEDDELALDLEDLLSFSYQVAKGMAFLASKNCIHRDLAARNI LLTHGRITKICDFGLARDIKNDSNYVVKGNARLPVKWMAPESIFNCVYTFESDVWSYG IFLWELFSLGSSPYPGMPVDSKFYKMIKEGFRMLSPEHAPAEMYDIMKTCWDADPLKR PTFKQIVQLIEKQISESTNHIYSNLANCSPNRQKPVVDHSVRINSVGSTASSSQPLLV HDDV" misc_feature 409..417 /note="pot. N-linked glycosylation site" misc_feature 454..462 /note="pot. N-linked glycosylation site" misc_feature 868..876 /note="pot. N-linked glycosylation site" misc_feature 898..906 /note="pot. N-linked glycosylation site" misc_feature 919..927 /note="pot. N-linked glycosylation site" misc_feature 979..987 /note="pot. N-linked glycosylation site" misc_feature 1075..1083 /note="pot. N-linked glycosylation site" misc_feature 1120..1128 /note="pot. N-linked glycosylation site" misc_feature 1408..1416 /note="pot. N-linked glycosylation site" misc_feature 1582..1650 /note="transmembrane domain" misc_feature 1807..1809 /note="residue involved in ATP binding" misc_feature 1813..1815 /note="residue involved in ATP binding" misc_feature 1822..1824 /note="residue involved in ATP binding" misc_feature 1858..1860 /note="residue involved in ATP binding" BASE COUNT 1450 a 1053 c 1111 g 1470 t ORIGIN 1 gatcccatcg cagctaccgc gatgagaggc gctcgcggcg cctgggattt tctctgcgtt 61 ctgctcctac tgcttcgcgt ccagacaggc tcttctcaac catctgtgag tccaggggaa 121 ccgtctccac catccatcca tccaggaaaa tcagacttaa tagtccgcgt gggcgacgag 181 attaggctgt tatgcactga tccgggcttt gtcaaatgga cttttgagat cctggatgaa 241 acgaatgaga ataagcagaa tgaatggatc acggaaaagg cagaagccac caacaccggc 301 aaatacacgt gcaccaacaa acacggctta agcaattcca tttatgtgtt tgttagagat 361 cctgccaagc ttttccttgt tgaccgctcc ttgtatggga aagaagacaa cgacacgctg 421 gtccgctgtc ctctcacaga cccagaagtg accaattatt ccctcaaggg gtgccagggg 481 aagcctcttc ccaaggactt gaggtttatt cctgacccca aggcgggcat catgatcaaa 541 agtgtgaaac gcgcctacca tcggctctgt ctgcattgtt ctgtggacca ggagggcaag 601 tcagtgctgt cggaaaaatt catcctgaaa gtgaggccag ccttcaaagc tgtgcctgtt 661 gtgtctgtgt ccaaagcaag ctatcttctt agggaagggg aagaattcac agtgacgtgc 721 acaataaaag atgtgtctag ttctgtgtac tcaacgtgga aaagagaaaa cagtcagact 781 aaactacagg agaaatataa tagctggcat cacggtgact tcaattatga acgtcaggca 841 acgttgacta tcagttcagc gagagttaat gattctggag tgttcatgtg ttatgccaat 901 aatacttttg gatcagcaaa tgtcacaaca accttggaag tagtagataa aggattcatt 961 aatatcttcc ccatgataaa cactacagta tttgtaaacg atggagaaaa tgtagatttg 1021 attgttgaat atgaagcatt ccccaaacct gaacaccagc agtggatcta tatgaacaga 1081 accttcactg ataaatggga agattatccc aagtctgaga atgaaagtaa tatcagatac 1141 gtaagtgaac ttcatctaac gagattaaaa ggcaccgaag gaggcactta cacattccta 1201 gtgtccaatt ctgacgtcaa tgctgccata gcatttaatg tttatgtgaa tacaaaacca 1261 gaaatcctga cttacgacag gctcgtgaat ggcatgctcc aatgtgtggc agcaggattc 1321 ccagagccca caatagattg gtatttttgt ccaggaactg agcagagatg ctctgcttct 1381 gtactgccag tggatgtgca gacactaaac tcatctgggc caccgtttgg aaagctagtg 1441 gttcagagtt ctatagattc tagtgcattc aagcacaatg gcacggttga atgtaaggct 1501 tacaacgatg tgggcaagac ttctgcctat tttaactttg catttaaagg taacaacaaa 1561 gagcaaatcc atccccacac cctgttcact cctttgctga ttggtttcgt aatcgtagct 1621 ggcatgatgt gcattattgt gatgattctg acctacaaat atttacagaa acccatgtat 1681 gaagtacagt ggaaggttgt tgaggagata aatggaaaca attatgttta catagaccca 1741 acacaacttc cttatgatca caaatgggag tttcccagaa acaggctgag ttttgggaaa 1801 accctgggtg ctggagcttt cgggaaggtt gttgaggcaa ctgcttatgg cttaattaag 1861 tcagatgcgg ccatgactgt cgctgtaaag atgctcaagc cgagtgccca tttgacagaa 1921 cgggaagccc tcatgtctga actcaaagtc ctgagttacc ttggtaatca catgaatatt 1981 gtgaatctac ttggagcctg caccattgga gggcccaccc tggtcattac agaatattgt 2041 tgctatggtg atcttttgaa ttttttgaga agaaaacgtg attcatttat ttgttcaaag 2101 caggaagatc atgcagaagc tgcactttat aagaatcttc tgcattcaaa ggagtcttcc 2161 tgcagcgata gtactaatga gtacatggac atgaaacctg gagtttctta tgttgtccca 2221 accaaggccg acaaaaggag atctgtgaga ataggctcat acatagaaag agatgtgact 2281 cccgccatca tggaggatga cgagttggcc ctagacttag aagacttgct gagcttttct 2341 taccaggtgg caaagggcat ggctttcctc gcctccaaga attgtattca cagagacttg 2401 gcagccagaa atatcctcct tactcatggt cggatcacaa agatttgtga ttttggtcta 2461 gccagagaca tcaagaatga ttctaattat gtggttaaag gaaacgctcg actacctgtg 2521 aagtggatgg cacctgaaag cattttcaac tgtgtataca cgtttgaaag tgacgtctgg 2581 tcctatggga tttttctttg ggagctgttc tctttaggaa gcagccccta tcctggaatg 2641 ccggtcgatt ctaagttcta caagatgatc aaggaaggct tccggatgct cagccctgaa 2701 cacgcacctg ctgaaatgta tgacataatg aagacttgct gggatgcaga tcccctaaaa 2761 agaccaacat tcaagcaaat tgttcagcta attgagaagc agatttcaga gagcaccaat 2821 catatttact ccaacttagc aaactgcagc cccaaccgac agaagcccgt ggtagaccat 2881 tctgtgcgga tcaattctgt cggcagcacc gcttcctcct cccagcctct gcttgtgcac 2941 gacgatgtct gagcagaatc agtgtttggg tcacccctcc aggaatgatc tcttcttttg 3001 gcttccatga tggttatttt cttttctttc aacttgcatc caactccagg atagtgggca 3061 ccccactgca atcctgtctt tctgagcaca ctttagtggc cgatgatttt tgtcatcagc 3121 caccatccta ttgcaaaggt tccaactgta tatattccca atagcaacgt agcttctacc 3181 atgaacagaa aacattctga tttggaaaaa gagagggagg tatggactgg gggccagagt 3241 cctttccaag gcttctccaa ttctgcccaa aaatatggtt gatagtttac ctgaataaat 3301 ggtagtaatc acagttggcc ttcagaacca tccatagtag tatgatgata caagattaga 3361 agctgaaaac ctaagtcctt tatgtggaaa acagaacatc attagaacaa aggacagagt 3421 atgaacacct gggcttaaga aatctagtat ttcatgctgg gaatgagaca taggccatga 3481 aaaaaatgat ccccaagtgt gaacaaaaga tgctcttctg tggaccactg catgagcttt 3541 tatactaccg acctggtttt taaatagagt ttgctattag agcattgaat tggagagaag 3601 gcctccctag ccagcacttg tatatacgca tctataaatt gtccgtgttc atacatttga 3661 ggggaaaaca ccataaggtt tcgtttctgt atacaaccct ggcattatgt ccactgtgta 3721 tagaagtaga ttaagagcca tataagtttg aaggaaacag ttaataccat tttttaagga 3781 aacaatataa ccacaaagca cagtttgaac aaaatctcct cttttagctg atgaacttat 3841 tctgtagatt ctgtggaaca agcctatcag cttcagaatg gcattgtact caatggattt 3901 gatgctgttt gacaaagtta ctgattcact gcatggctcc cacaggagtg ggaaaacact 3961 gccatcttag tttggattct tatgtagcag gaaataaagt ataggtttag cctccttcgc 4021 aggcatgtcc tggacaccgg gccagtatct atatatgtgt atgtacgttt gtatgtgtgt 4081 agacaaatat ttggaggggt atttttgccc tgagtccaag agggtccttt agtacctgaa 4141 aagtaacttg gctttcatta ttagtactgc tcttgtttct tttcacatag ctgtctagag 4201 tagcttacca gaagcttcca tagtggtgca gaggaagtgg aaggcatcag tccctatgta 4261 tttgcagttc acctgcactt aaggcactct gttatttaga ctcatcttac tgtacctgtt 4321 ccttagacct tccataatgc tactgtctca ctgaaacatt taaattttac cctttagact 4381 gtagcctgga tattattctt gtagtttacc tctttaaaaa caaaacaaaa caaaacaaaa 4441 aactcccctt cctcactgcc caatataaaa ggcaaatgtg tacatggcag agtttgtgtg 4501 ttgtcttgaa agattcaggt atgttgcctt tatggtttcc cccttctaca tttcttagac 4561 tacatttaga gaactgtggc cgttatctgg aagtaaccat ttgcactgga gttctatgct 4621 ctcgcacctt tccaaagtta acagattttg gggttgtgtt gtcacccaag agattgttgt 4681 ttgccatact ttgtctgaaa aattcctttg tgtttctatt gacttcaatg atagtaagaa 4741 aagtggttgt tagttataga tgtctaggta cttcaggggc acttcattga gagttttgtc 4801 ttgccatact ttgtctgaaa aattcctttg tgtttctatt gacttcaatg atagtaagaa 4861 aagtggttgt tagttataga tgtctaggta cttcaggggc acttcattga gagttttgtc 4921 aatgtctttt gaatattccc aagcccatga gtccttgaaa atatttttta tatatacagt 4981 aactttatgt gtaaatacat aagcggcgta agtttaaagg atgttggtgt tccacgtgtt 5041 ttattcctgt atgttgtcca attgttgaca gttctgaaga attc // LOCUS HSKOAT 1599 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for kidney ornithine aminotransferase (EC 2.6.1.13). ACCESSION Y07511 NID g34137 KEYWORDS ornithine aminotransferase; ornithine-oxo-acid aminotransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Matsuzawa,T. TITLE Direct Submission JOURNAL Submitted (25-OCT-1989) Matsuzawa T., Department of Biochemistry, School of Medicine, Fujita Health University, 1-89 Dengakugakubo, Kutsukake-cho, Toyoake Aichi 470-11, JAPAN REFERENCE 2 (bases 1 to 1599) AUTHORS Kobayashi,T., Nishii,M., Takagi,Y., Titani,K. and Matsuzawa,T. TITLE Molecular cloning and nucleotide sequence analysis of mRNA for human kidney ornithine aminotransferase. An examination of ornithine aminotransferase isozymes between liver and kidney JOURNAL FEBS Lett. 255 (2), 300-304 (1989) MEDLINE 90005995 COMMENT Data kindly reviewed (01-FEB-1990) by Matsuzawa T. FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="lambda gt10" sig_peptide 41..145 /note="signal peptide (AA -35 to -1)" CDS 41..1360 /note="precursor (AA -35 to 404)" /codon_start=1 /db_xref="PID:g34138" /db_xref="SWISS-PROT:P04181" /translation="MFSKLAHLQRFAVLSRGVHSSVASATSVATKKTVQGPPTSDDIF EREYKYGAHNYHPLPVALERGKGIYLWDVEGRKYFDFLSSYSAVNQGHCHPKIVNALK SQVDKLTLTSRAFYNNVLGEYEEYITKLFNYHKVLPMNTGVEAGETACKLARKWGYTV KGIQKYKAKIVFAAGNFWGRTLSAISSSTDPTSYDGFGPFMPGFDIIPYNDLPALERA LQDPNVAAFMVEPIQGEAGVVVPDPGYLMGVRELCTRHQVLFIADEIQTGLARTGRWL AVDYENVRPDIVLLGKALSGGLYPVSAVLCDDDIMLTIKPGEHGSTYGGNPLGCRVAI AALEVLEEENLAENADKLGIILRNELMKLPSDVVTAVRGKGLLNAIVIKETKDWDAWK VCLRLRDNGLLAKPTHGDIIRFAPPLVIKEDELRESIEIINKTILSF" mat_peptide 146..1357 /note="mature OAT (AA 1-404)" BASE COUNT 436 a 321 c 382 g 460 t ORIGIN 1 gaattcgtca gatctgtggt ttttctactt gaaggacaca atgttttcca aactagcaca 61 tttgcagagg tttgctgtac ttagtcgcgg agttcattct tcagtggctt ctgctacatc 121 tgttgcaact aaaaaaacag tccaaggccc tccaacctct gatgacattt ttgaaaggga 181 atataagtat ggtgcacaca actaccatcc tttacctgta gccctggaga gaggaaaagg 241 tatttactta tgggatgtag aaggcagaaa atattttgac ttcctgagtt cttacagtgc 301 tgtcaaccaa gggcattgtc accccaagat tgtgaatgct ctgaagagtc aagtggacaa 361 attgacctta acatctagag ctttctataa taacgtactt ggtgaatatg aggagtatat 421 tactaaactt ttcaactacc acaaagttct tcctatgaat acaggagtgg aggctggaga 481 gactgcctgt aaactagctc gtaagtgggg ctataccgtg aagggcattc agaaatacaa 541 agcaaagatt gtttttgcag ctgggaactt ctggggtagg acgttgtctg ctatctccag 601 ttccacagac ccaaccagtt acgatggttt tggaccattt atgccgggat tcgacatcat 661 tccctataat gatctgcccg cactggagcg tgctcttcag gatccaaatg tggctgcgtt 721 catggtagaa ccaattcagg gtgaagcagg cgttgttgtt ccggatccag gttacctaat 781 gggagtgcga gagctctgca ccaggcacca ggttctcttt attgctgatg aaatacagac 841 aggattggcc agaactggta gatggctggc tgttgattat gaaaatgtca gacctgatat 901 agtcctcctt ggaaaggccc tttctggggg cttataccct gtgtctgcag tgctgtgtga 961 tgatgacatc atgctgacca ttaagccagg ggagcatggg tccacatacg gtggcaatcc 1021 actaggctgc cgagtggcca tcgcagccct tgaggtttta gaagaagaaa accttgctga 1081 aaatgcagac aaattgggca ttatcttgag aaatgaactc atgaagctac cttctgatgt 1141 tgtaactgcc gtaagaggaa aaggattatt aaacgctatt gtcattaaag aaaccaaaga 1201 ttgggatgct tggaaggtgt gtctacgact tcgagataat ggacttctgg ccaagccaac 1261 ccatggcgac attatcaggt ttgcgcctcc gctggtgatc aaggaggatg agcttcgaga 1321 gtccattgaa attattaaca agaccatctt gtctttctga gggtagccag ctgttttcag 1381 tggtccctgg gagccagctg gagacaggtg gtcctgtaaa agctttattc ctaatgtggg 1441 cacattccac tcccatgagt cttcaaaaac tttttttttg aatatatttt tttcagttga 1501 tacataatag aacaacgttt atgaacctgc cgtttgcttt gtaacgtaac taaataatgt 1561 aatggcatct atattcagtt gaagtgtttt gatgaattc // LOCUS HSKSA 1504 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for adenocarcinoma-associated antigen (KSA). ACCESSION X14758 NID g34186 KEYWORDS antigen; cell surface glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1504) AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D., Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A. TITLE Molecular cloning and characterization of a human adenocarcinoma/epithelial cell surface antigen complementary DNA JOURNAL Cancer Res. 49 (2), 314-317 (1989) MEDLINE 89089570 FEATURES Location/Qualifiers source 1..1504 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="UCLA-P3" /clone_lib="lambda gt11." sig_peptide 155..223 /note="put. signal peptide (most likely cleavage site) (AA -23 to -1)" CDS 155..1099 /note="KSA preproantigen peptide" /codon_start=1 /db_xref="PID:g34187" /db_xref="SWISS-PROT:P16422" /translation="MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNR QCQCTSVGAQNTVICSKLAAKCLVMKAEMNGSKLGRRAKPEGALQNNDGLYDPDCDES GLFKAKQCNGTSTCWCVNTAGVRRTDKDTEITCSERVRTYWIIIELKHKAREKPYDSK SLRTALQKEITTRYQLDPKFITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEK DVKGESLFHSKKMDLTVNGEQLDLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVV MAVVAGIVVLVISRKKRMAKYEKAEIKEMGEMHRELNA" misc_feature 224..1096 /note="KSA proantigen (AA 1-291)" mat_peptide 398..1096 /note="mature KSA antigen (AA 59-291)" polyA_site 1504 /note="polyA site" BASE COUNT 442 a 302 c 356 g 404 t ORIGIN 1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc 61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc 121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt 181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg 241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac 301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat 361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca 421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca 481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa 541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa 601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga 661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa 721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat 781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa 841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat 901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat 961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc 1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca 1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat 1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg 1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga 1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt 1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc 1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc 1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa 1501 actg // LOCUS HSKUPMR 1302 bp RNA PRI 13-MAY-1992 DEFINITION Human KUP mRNA for protein with two zinc fingers. ACCESSION X16576 NID g34191 KEYWORDS zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1302) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (21-SEP-1989) Chardin P., INSERM U-248, 10 avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 1302) AUTHORS Chardin,P., Courtois,G., Mattei,M.G. and Gisselbrecht,S. TITLE The KUP gene, located on human chromosome 14, encodes a protein with two distant zinc fingers JOURNAL Nucleic Acids Res. 19 (7), 1431-1436 (1991) MEDLINE 91227131 FEATURES Location/Qualifiers source 1..1302 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /clone_lib="lambda gt10" gene 1..1302 /gene="KUP" CDS 1..1302 /gene="KUP" /codon_start=1 /product="KUP protein" /db_xref="PID:g34192" /db_xref="SWISS-PROT:P24278" /translation="MDTASHSLVLLQQLNMQREFGFLCDCTVAIGDVYFKAHRAVLAA FSNYFKMIFIHQTSECIKIQPTDIQPDIFSYLLHIMYTGKGPKQIVDHSRLEEGIRFL HADYLSHIATEMNQVFSPETVQSSNLYGIQISTTQKTVVKQGLEVKEAPSSNSGNRAA VQGDLPQLSLAIGLDDGTADQQRACPATQALEEHQKPPVSIKQERCDPESVISQSHPS PSSEVTGPTFTENSVKIHLCHYCGERFDSRSNLRQHLHTHVSGSLPFGVPASILESND LGEVHPLNENSEALECRRLSSFIVKENEQQPDHTNRGTTEPLQISQVSLISKDTEPVE LNCNFSFSRKRKMSCTICGHKFPRKSQLLEHMYTHKGKSYRYNRCQRFGNALAQRFQP YCDSWSDVSLKSSRLSQEHLDLPCALESELTQENVDTILVE" BASE COUNT 378 a 318 c 276 g 330 t ORIGIN 1 atggacactg ccagccatag ccttgttctt ctccagcagc tgaacatgca gcgagaattt 61 ggttttctgt gtgattgcac agttgcaatt ggagatgttt acttcaaagc ccacagagca 121 gtgcttgctg ctttttctaa ctatttcaag atgatattta ttcaccaaac aagtgaatgc 181 ataaaaatac aaccaactga catccaacct gacatattca gctatttgtt gcacattatg 241 tacacgggga aagggccaaa acagattgtg gatcatagtc gtttggagga agggattcga 301 tttcttcacg ccgactacct ttctcacatt gcaactgaaa tgaatcaagt gttctcacca 361 gagactgtgc agtcctcaaa tttatatggc attcagatct caacaaccca aaaaacagtt 421 gtcaaacaag gactggaggt caaagaagct ccttccagta acagtggaaa cagagctgct 481 gtccagggtg acctccccca gttgtctctt gctattggtc tggatgatgg cactgcagac 541 cagcagaggg cctgtcctgc cacccaggcc ctggaggagc accagaagcc cccagtttcc 601 atcaagcagg agagatgtga cccagaatct gtgatctccc agagccaccc ctcaccctca 661 tcagaggtga caggccccac ttttactgaa aacagtgtca aaatacactt atgccattac 721 tgtggggaac gttttgattc ccgtagtaac ctaaggcaac atctccatac acatgtgtct 781 ggatccctgc cattcggtgt ccctgcttcc attctggaaa gtaatgacct tggtgaagtg 841 catcccctta atgaaaacag cgaggccctt gaatgccgca ggctcagctc cttcattgtt 901 aaggagaatg aacagcagcc agaccacacc aaccggggta ccacagagcc tttgcagatc 961 agtcaagtat ctttgatctc caaagacaca gagccagtag aattaaactg taatttttct 1021 ttttcaagga aaagaaaaat gagctgtacc atctgtggtc ataaattccc tcgaaagagc 1081 caattgttgg aacacatgta tacacacaaa ggtaaatctt acagatataa ccgatgccaa 1141 aggtttggta atgcattggc ccagagattt cagccatact gtgacagctg gtctgatgtc 1201 tccctgaaaa gttctcgctt gtcacaagaa cacttagact tgccttgtgc cttagagtca 1261 gagctcacac aagaaaatgt ggatactatc ctagttgagt ag // LOCUS HSKYNU3MO 1999 bp RNA PRI 01-JUL-1997 DEFINITION Homo sapiens mRNA for kynurenine 3-monooxygenase. ACCESSION Y13153 NID g2239123 KEYWORDS kynurenine 3-monooxygenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1999) AUTHORS Alberati-Giani,D., Cesura,A.M., Broger,C., Warren,W.D., Roever,S. and Malherbe,P. TITLE Cloning and functional expression of human Kynurenine 3-monooxygenase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1999) AUTHORS Alberati-Giani,D. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) D. Alberati-Giani, F. Hoffmann-La Roche, Pharma Div., Preclinical Res., CNS, PRPN-D, 70/307B, CH-4070 Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..1999 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hKMOC15" /clone_lib="Lambda uni-ZAP cDNA library" /dev_stage="adult" /tissue_type="liver" CDS 53..1513 /EC_number="1.14.13.9" /codon_start=1 /product="kynurenine 3-monooxygenase" /db_xref="PID:e319208" /db_xref="PID:g2239124" /translation="MDSSVIQRKKVAVIGGGLVGSLQACFLAKRNFQIDVYEAREDTR VATFTRGRSINLALSHRGRQALKAVGLEDQIVSQGIPMRARMIHSLSGKKSAIPYGTK SQYILSVSRENLNKDLLTAAEKYPNVKMHFNHRLLKCNPEEGMITVLGSDKVPKDVTC DLIVGCDGAYSTVRSHLMKKPRFDYSQQYIPHGYMELTIPPKNGDYAMEPNYLHIWPR NTFMMIALPNMNKSFTCTLFMPFEEFEKLLTSNDVVDFFQKYFPDAIPLIGEKLLVQD FFLLPAQPMISVKCSSFHFKSHCVLLGDAAHAIVPFFGQGMNAGFEDCLVFDELMDKF SNDLSLCLPVFSRLRIPDDHAISDLSMYNYIEMRAHVNSSWFIFQKNMERFLHAIMPS TFIPLYTMVTFSRIRYHEAVQRWHWQKKVINKGLFFLGSLIAISSTYLLIHYMSPRSF LCLRRPWNWIAHFRNTTCFPAKAVDSLEQISNLISR" BASE COUNT 612 a 411 c 400 g 576 t ORIGIN 1 gaattcggca cgagcagaag caacaataat tgtgaaaaat acttcagcag ttatggactc 61 atctgtcatt caaaggaaaa aagtagctgt cattggtggt ggcttggttg gctcattaca 121 agcatgcttt cttgcaaaga ggaatttcca gattgatgta tatgaagcta gggaagatac 181 tcgagtggct accttcacac gtggaagaag cattaactta gccctttctc atagaggacg 241 acaagccttg aaagctgttg gcctggaaga tcagattgta tcccaaggta ttcccatgag 301 agcaagaatg atccactctc tttcaggaaa aaagtctgca attccctatg ggacaaagtc 361 tcagtatatt ctttctgtaa gcagagaaaa tctaaacaag gatctattga ctgctgctga 421 gaaatacccc aatgtgaaaa tgcactttaa ccacaggctg ttgaaatgta atccagagga 481 aggaatgatc acagtgcttg gatctgacaa agttcccaaa gatgtcactt gtgacctcat 541 tgtaggatgt gatggagcct attcaactgt cagatctcac ctgatgaaga aacctcgctt 601 tgattacagt cagcagtaca ttcctcatgg gtacatggag ttgactattc cacctaagaa 661 cggagattat gccatggaac ctaattatct gcatatttgg cctagaaata cctttatgat 721 gattgcactt cctaacatga acaaatcatt cacatgtact ttgttcatgc cctttgaaga 781 gtttgaaaaa cttctaacca gtaatgatgt ggtagatttc ttccagaaat actttccgga 841 tgccatccct ctaattggag agaaactcct agtgcaagat ttcttcctgt tgcctgccca 901 gcccatgata tctgtaaagt gctcttcatt tcactttaaa tctcactgtg tactgctggg 961 agatgcagct catgctatag tgccgttttt tgggcaagga atgaatgcgg gctttgaaga 1021 ctgcttggta tttgatgagt taatggataa attcagtaac gaccttagtt tgtgtcttcc 1081 tgtgttctca agattgagaa tcccagatga tcacgcgatt tcagacctat ccatgtacaa 1141 ttacatagag atgcgagcac atgtcaactc aagctggttc atttttcaga agaacatgga 1201 gagatttctt catgcgatta tgccatcgac ctttatccct ctctatacaa tggtcacttt 1261 ttccagaata agataccatg aggctgtgca gcgttggcat tggcaaaaaa aggtgataaa 1321 caaaggactc tttttcttgg gatcactgat agccatcagc agtacctacc tacttataca 1381 ctacatgtca ccacgatctt tcctctgctt gagaagacca tggaactgga tagctcactt 1441 ccggaataca acatgtttcc ccgcaaaggc cgtggactcc ctagaacaaa tttccaatct 1501 cattagcagg tgatagaaag gttttgtggt agcaaatgca tgatttctct gtgaccaaaa 1561 ttaagcatga aaaaaatgtt tccattgcca tatttgattc actagtggaa gatagtgttc 1621 tgcttataat taaactgaat gtagagtatc tctgtatgtt aattgcaatt actggttggg 1681 gggtgcattt taaaagatga aacatgcagc ttccctacat tacacacact caggttgagt 1741 cattctaact ataaaagtgc aatgactaag atccttcact tctctgaaag taaggcccta 1801 gatgcctcag ggaagacagt aatcatgcct tttctttaaa agacacaata ggactcgcaa 1861 cagcattgac tcaacaccta ggactaaaaa tcacaactta actagcatgt taactgcact 1921 tttcattacg tgaatggaac ttacctaacc acagggctca gacttactag ataaaaccag 1981 aaatggaaat aaggaattc // LOCUS HSL17ARP 479 bp RNA PRI 02-DEC-1991 DEFINITION Human mRNA for HL23 ribosomal protein homologue. ACCESSION X55954 NID g34193 KEYWORDS ribosomal protein; ribosomal protein L17A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 471) AUTHORS Berchtold,M.W. TITLE Direct Submission JOURNAL Submitted (18-SEP-1990) Berchtold M.W., University of Zuerich, Dept of Pharmacology & Biochemistry, Winterthurerstr 190, Zuerich 8057, Switzerland REFERENCE 2 (bases 1 to 471) AUTHORS Berchtold,M.W. and Berger,M.C. TITLE Isolation and analysis of a human cDNA highly homologous to the yeast gene encoding L17A ribosomal protein JOURNAL Gene 102 (2), 283-288 (1991) MEDLINE 91340166 FEATURES Location/Qualifiers source 1..479 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" CDS 13..435 /codon_start=1 /product="HL23 ribosomal protein" /db_xref="PID:g34194" /db_xref="SWISS-PROT:P23131" /translation="MSKRGRGGSSGAKFRISLGLPVGAVINCADNTGAKNLYIISVKG IKGRLNRLPAAGVGDMVMATVKKGKPELRKKVHPAVVIRQRKSYRRKDGVFLYFEDNA GVIVNNKGEMKGSAITGPVAKECADLWPRIASNAGSIA" BASE COUNT 146 a 96 c 128 g 109 t ORIGIN 1 ccggcgttca agatgtcgaa gcgaggacgt ggtgggtcct ctggtgcgaa attccggatt 61 tccttgggtc ttccggtagg agctgtaatc aattgtgctg acaacacagg agccaaaaac 121 ctgtatatca tctccgtgaa ggggatcaag ggacggctga acagacttcc cgctgctggt 181 gtgggtgaca tggtgatggc cacagtcaag aaaggcaaac cagagctcag aaaaaaggta 241 catccagcag tggtcattcg acaacgaaag tcataccgta gaaaagatgg cgtgtttctt 301 tattttgaag ataatgcagg agtcatagtg aacaataaag gcgagatgaa aggttctgcc 361 attacaggac cagtagcaaa ggagtgtgca gacttgtggc cccggattgc atccaatgct 421 ggcagcattg catgattctc cagtatattt gtaaaaaata aaaaaaaact aaacccatt // LOCUS HSL23MR 770 bp RNA PRI 12-SEP-1993 DEFINITION Human L23 mRNA for putative ribosomal protein. ACCESSION X53777 NID g34198 KEYWORDS ribosomal protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 770) AUTHORS Mager,D.L. TITLE Direct Submission JOURNAL Submitted (11-JUL-1990) Mager D.L., B.C. Cancer Research Centre, Terry Fox Laboratory, 601 W. 10th Ave., Vancouver, B.C., Canada V5Z IL3 REFERENCE 2 (bases 1 to 770) AUTHORS Mager,D.L. and Freeman,J.D. TITLE A human gene related to the ribosomal protein L23 gene of Halobacterium marismortui JOURNAL Nucleic Acids Res. 18 (17), 5301 (1990) MEDLINE 90384852 COMMENT Data kindly reviewed (24-SEP-1990) by Mager D. FEATURES Location/Qualifiers source 1..770 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="lymphocytes" CDS 139..693 /note="putative ribosomal protein (AA 1-184)" /codon_start=1 /db_xref="PID:g34199" /db_xref="SWISS-PROT:P18621" /translation="MVRYSLDPENPTKSCKSRGSNLRVHFKNTRETAQAIKGMHIRKA TKYLKDVTLQKQCVPFRRYNGGVGRCAQAKQWGWTQGRWPKKSAEFLLHMLKNAESNA ELKGLDVDSLVIEHIQVNKAPKMRRRTYRAHGRINPYMSSPCHIEMILTEKEQIVPKP EEEVAQKKKISQKKLKKQKLMARE" BASE COUNT 288 a 146 c 162 g 174 t ORIGIN 1 aggacacctt tggattaata atgaaaacaa ctactctctg agcagctgtt cgaatcatct 61 gatatttata ctgaatgagt tactgtaagt acgtattgac agaattacac tgtactttcc 121 tctaggtgat ctgtgaaaat ggttcgctat tcacttgacc cggagaaccc cacgaaatca 181 tgcaaatcaa gaggttccaa tcttcgtgtt cactttaaga acactcgtga aactgctcag 241 gccatcaagg gtatgcatat acgaaaagcc acgaagtatc tgaaagatgt cactttacag 301 aaacagtgtg taccattccg acgttacaat ggtggagttg gcaggtgtgc gcaggccaag 361 caatggggct ggacacaagg tcggtggccc aaaaagagtg ctgaattttt gctgcacatg 421 cttaaaaacg cagagagtaa tgctgaactt aagggtttag atgtagattc tctggtcatt 481 gagcatatcc aagtgaacaa agcacctaag atgcgccgcc ggacctacag agctcatggt 541 cggattaacc catacatgag ctctccctgc cacattgaga tgatccttac ggaaaaggaa 601 cagattgttc ctaaaccaga agaggaggtt gcccagaaga aaaagatatc ccagaagaaa 661 ctgaagaaac aaaaacttat ggcacgggag taaattcagc attaaaataa atgtaattaa 721 aaagaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSL35A 430 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ribosomal protein L35a. ACCESSION X52966 NID g34200 KEYWORDS ribosomal protein; ribosomal protein L35a. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 430) AUTHORS Herzog,H. TITLE Direct Submission JOURNAL Submitted (09-MAY-1990) Herzog H., Institut fuer Biochemie (Nat. Fak), Peter Mayerstrasse 1a, A-6020 Innsbruck, Austria REFERENCE 2 (bases 1 to 430) AUTHORS Herzog,H., Hofferer,L., Schneider,R. and Schweiger,M. TITLE cDNA encoding the human homologue of rat ribosomal protein L35a JOURNAL Nucleic Acids Res. 18 (15), 4600 (1990) MEDLINE 90356408 COMMENT Data kindly reviewed (13-AUG-1990) by Herzog H. FEATURES Location/Qualifiers source 1..430 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..9 /note="pyrimidine rich sequence" CDS 63..395 /note="ribosomal protein L35a (AA 1-110)" /codon_start=1 /db_xref="PID:g34201" /db_xref="SWISS-PROT:P18077" /translation="MSGRLWSKAIFAGYKRGLRNQREHTALLKIEGVYARDETEFYLG KRCAYVYKAKNNTVTPGGKPNKTRVIWGKVTRAHGNSGMVLAKFRSNLPAKAIGHRIR VMLYPSRI" misc_feature 410..416 /note="polyadenylation site" BASE COUNT 130 a 96 c 102 g 102 t ORIGIN 1 cttctcttac cgccatcttg gctcctgtgg aggcctgctg gaacggactt ctaaaaggaa 61 ctatgtctgg aaggctgtgg tccaaggcca tttttgctgg ctataagcgg ggtctccgga 121 accaaaggga gcacacagct cttcttaaaa ttgaaggtgt ttacgcccga gatgaaacag 181 aattctattt gggcaagaga tgcgcttatg tatataaagc aaagaacaac acagtcactc 241 ctggcggcaa accaaacaaa accagagtca tctggggaaa agtaactcgg gcccatggaa 301 acagtggcat ggttcttgcc aaattccgaa gcaatcttcc tgctaaggcc attggacaca 361 gaatccgagt gatgctgtac ccctcaagga tttaaactaa cgaaaaatca ataaataatt 421 gtggatttgt // LOCUS HSLACT 833 bp RNA PRI 12-SEP-1993 DEFINITION Messenger RNA for human prolactin. ACCESSION V00566 J00299 NID g34210 KEYWORDS complementary DNA; lactin; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 833) AUTHORS Cooke,N.E., Coit,D., Shine,J., Baxter,J.D. and Martial,J.A. TITLE Human prolactin. cDNA structural analysis and evolutionary comparisons JOURNAL J. Biol. Chem. 256 (8), 4007-4016 (1981) MEDLINE 81168179 FEATURES Location/Qualifiers source 1..833 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..833 /note="messenger rna" CDS 5..688 /note="reading frame prolactin" /codon_start=1 /db_xref="PID:g34211" /db_xref="SWISS-PROT:P01236" /translation="MNIKGSPWKGSLLLLLVSNLLLCQSVAPLPICPGGAARCQVTLR DLFDRAVVLSHYIHNLSSEMFSEFDKRYTHGRGFITKAINSCHTSSLATPEDKEQAQQ MNQKDFLSLIVSILRSWNEPLYHLVTEVRGMQEAPEAILSKAVEIEEQTKRLLEGMEL IVSQVHPETKENEIYPVWSGLPSLQMADEESRLSAYYNLLHCLRRDSHKIDNYLKLLK CRIIHNNNC" BASE COUNT 218 a 233 c 180 g 202 t ORIGIN 1 aaacatgaac atcaaaggat cgccatggaa agggtccctc ctgctgctgc tggtgtcaaa 61 cctgctgctg tgccagagcg tggccccctt gcccatctgt cccggcgggg ctgcccgatg 121 ccaggtgacc cttcgagacc tgtttgaccg cgccgtcgtc ctgtcccact acatccataa 181 cctctcctca gaaatgttca gcgaattcga taaacggtat acccatggcc gggggttcat 241 taccaaggcc atcaacagct gccacacttc ttcccttgcc acccccgaag acaaggagca 301 agcccaacag atgaatcaaa aagactttct gagcctgata gtcagcatat tgcgatcctg 361 gaatgagcct ctgtatcatc tggtcacgga agtacgtggt atgcaagaag ccccggaggc 421 tatcctatcc aaagctgtag agattgagga gcaaaccaaa cggcttctag agggcatgga 481 gctgatagtc agccaggttc atcctgaaac caaagaaaat gagatctacc ctgtctggtc 541 gggacttcca tccctgcaga tggctgatga agagtctcgc ctttctgctt attataacct 601 gctccactgc ctacgcaggg attcacataa aatcgacaat tatctcaagc tcctgaagtg 661 ccgaatcatc cacaacaaca actgctaagc ccacatccat ttcatctatt tctgagaagg 721 tccttaatga tccgttccat tgcaagcttc ttttagttgt atctcttttg aatccatgct 781 tgggtgtaac aggtctcctc ttaaaaaata aaaactgact cgttagagac atc // LOCUS HSLACTG 3310 bp DNA PRI 24-APR-1993 DEFINITION Human alpha-lactalbumin gene. ACCESSION X05153 NID g34212 KEYWORDS alpha-lactalbumin; Alu repetitive sequence; lactalbumin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3310) AUTHORS Hall,L., Emery,D.C., Davies,M.S., Parker,D. and Craig,R.K. TITLE Organization and sequence of the human alpha-lactalbumin gene JOURNAL Biochem. J. 242 (3), 735-742 (1987) MEDLINE 87241386 COMMENT Data kindly reviewed (01-OCT-1987) by HALL L. FEATURES Location/Qualifiers source 1..3310 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 596..625 /note="pot.regulatory sequence" TATA_signal 706..712 prim_transcript 736..3097 exon 736..894 /number=1 mRNA join(736..894,1542..1700,2190..2265,2765..3097) sig_peptide 762..818 CDS join(762..894,1542..1700,2190..2265,2765..2825) /codon_start=1 /product="alpha-lactalbumin precursor" /db_xref="PID:g296662" /db_xref="SWISS-PROT:P00709" /translation="MRFFVPLFLVGILFPAILAKQFTKCELSQLLKDIDGYGGIALPE LICTMFHTSGYDTQAIVENNESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDD DITDDIMCAKKILDIKGIDYWLAHKALCTEKLEQWLCEKL" CDS join(819..894,1542..1700,2190..2265,2765..2822) /note="Author-given protein sequence is in conflict with the conceptual translation." /codon_start=1 /product="mature alpha-lactalbumin" /db_xref="PID:e74492" /db_xref="PID:g1335200" /translation="KQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAIVEN NESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILDIKGID YWLAHKALCTEKLEQWLCEKL" intron 895..1541 /number=1 repeat_region complement(1069..1351) /note="Alu repetitive sequence" exon 1542..1700 /number=2 intron 1701..2189 /number=2 exon 2190..2265 /number=3 intron 2266..2764 /number=3 exon 2765..3097 /number=4 polyA_signal 3075..3080 polyA_site 3098 BASE COUNT 770 a 821 c 687 g 1032 t ORIGIN 1 gagctcctgg gctcaagtga tccaccagac tcggcctccc aaaatgccgg gattacaggt 61 gtgagccact gtgcctggcc tagatgcttt catacaggct tttcaattat gcattttcct 121 taagtaggaa gtcttaagat ccaagttata tcggattgtt gtagtctacg ttcccatatt 181 ctattcctat ttctgagcct tcagtcatga gctaccatat taaagaacta attctgggcc 241 ttgttacatg gctggattgg ttggacaagt gccagctctg atcctgggac tgtggcatgt 301 gatgacatac accccctctc cacattctgc atgtctctag gggggaaggg ggaagctcgg 361 tatagaacct ttattgtatt ttctgattgc ctcacttctt atattgcccc catgcccttc 421 tttgttcctc aagtaaccag agacagtgct tcccagaacc aaccctacaa gaaacaaagg 481 gctaaacaaa gccaaatggg aagcaggatc atggtttgaa ctctttctgg ccagagaaca 541 atacctgcta tggactagat actgggagag ggaaaggaaa agtagggtga attatggaag 601 gaagctggca ggctcagcgt ttctgtcttg gcatgaccag tctctcttca ttctcttcct 661 agatgtaggg cttggtacca gagcccctga ggctttctgc atgaatataa ataaatgaaa 721 ctgagtgatg cttccatttc aggttcttgg gggtagccaa aatgaggttc tttgtccctc 781 tgttcctggt gggcatcctg ttccctgcca tcctggccaa gcaattcaca aaatgtgagc 841 tgtcccagct gctgaaagac atagatggtt atggaggcat cgctttgcct gaatgtgagt 901 tccctgcctc tgtgtttcat ccattcctca tacgcttctc tcctccatcc cctctttctt 961 ccacttcgcc cctccacttt tacttaatta tctaatcatc ctcttttctg ctcatttgca 1021 tactctttta tttcatgtat gtatatatgt atgtatttat ttatttttga ggtggagttt 1081 cgctcttgtt gcccagactg gagtgcaatg gtgtaatctc ggctcactgc aacctccgcc 1141 tcctcggttc aagtgattct cctgcctcag cctcccaagt agctggaatt acaggcaccc 1201 accaccatgc ctggctaatt ttgtattttt tgtagagaca gggtttcacc atgttggcca 1261 ggctggtctc aaacttctga cctcaggtga tccgccctcc tcagcctccc aaagtgttgg 1321 gattacaagc gtgagccatc atgcctggcc ccatttattt tcctatcctt tctttctctt 1381 attgtctgat ttttttttgg aattctccat ctcatcaaga aactctgagc tttgccatct 1441 ttggagattg gctggaaagc atttttgtct gagaattaca gttcctcctt tatgcagatc 1501 ctgtacatct ctgtggtatc tctttctcat ctttccctca gtgatctgta ccatgtttca 1561 caccagtggt tatgacacac aagccatagt tgaaaacaat gaaagcacgg aatatggact 1621 cttccagatc agtaataagc tttggtgcaa gagcagccag gtccctcagt caaggaacat 1681 ctgtgacatc tcctgtgaca gtgagtagcc cctataaccc tctttctctg tttttctgag 1741 gcctgccctt gggataatct cctttttagt gccaagcaga cctcaggctt cattgccttg 1801 gctgggctct ataaaaattg tgggacttga attggcagta ctgagtaaga agctgtttgg 1861 atttttcatg gtcatcaaat ccccagacag ttccttgagg ttcagtggta gacaatcgga 1921 gctgtctgag agtcttggaa tctgattgtc tgcattttca gggtaagtca gttgatgaag 1981 ctgatgattc ctccagagat atcccaggga aatgaaggaa gtccctaccc agggttagac 2041 attaccacat tggtcctttc atatagaaag acaacaggca caagccttga gtttagagaa 2101 cccactggat ccaggggtta ggggaactca gtgcctttct gggtaatact tgtcagctgt 2161 ctcaatcctt tccctgtaac tcctgccaga gttcctggat gatgacatta ctgatgacat 2221 aatgtgtgcc aagaagatcc tggatattaa aggaattgac tactggtgaa tccttattct 2281 attttctatt tccccatcct ccttctcctt accccattag cccagcaccc ctttcctctt 2341 accctatctc ttggtcattt aatctagaat acagtgtctg aaacaaagct tacctagaga 2401 ctcaggtttc tgttattaag cctctctcgc tccgctcctt ggtagcaatt ttcctaataa 2461 ggggttgcct aatggagggc tcagacccag gcctcctttc acttagactt ggacatctaa 2521 ttccacttgt ttagttctat gccctaaagc aagctgttgg taacattgca tctctttttt 2581 aaccctacaa ttttcttgga tattttttat ggactgtatt ccacttgatg gcttgtgtcg 2641 cttgacatca ggccaggaat gtctttctgt aattctcgtc cacgctcttc cacttcagcc 2701 ctcctgggaa tgaatgtaaa gattcagtca gctaactcac cttgtccccc ttctccatta 2761 tcaggttggc ccataaagcc ctctgcactg agaagctgga acagtggctt tgtgagaagt 2821 tgtgagtgtc tgctgtcctt ggcacccctg cccactccac actcctggaa tacctcttcc 2881 ctaatgccac ctcagtttgt ttctttctgt tcccccaaag cttatctgtc tctgagcctt 2941 gggccctgta gtgacatcac cgaattcttg aagactattt tccagggatg cctgagtggt 3001 gcactgagct ctagaccctt actcagtgcc ttcgatggca ctttcactac agcacagatt 3061 tcacctctgt cttgaataaa ggtcccactt tgaagtcact ggctgtaatt tttttccccc 3121 tggagggaag gggaagaaat aggatgagta ggtggacact gaagccatag gtcatagcca 3181 ccttccatct ctactgaaga agaagtaggc tgaatttaca atagaaaggt gaaggttact 3241 gtctgtacca actcaatgca acaaactttt attgatcacc taatctattc aaggaactgt 3301 agacggatcc // LOCUS HSLAG3 1872 bp RNA PRI 07-OCT-1996 DEFINITION Human LAG-3 mRNA for CD4-related protein involved in lymphocyte activation. ACCESSION X51985 NID g1488611 KEYWORDS cell surface glycoprotein; immune response; immunoglobulin superfamily; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1872) AUTHORS Triebel,F. TITLE Direct Submission JOURNAL Submitted (26-FEB-1990) Triebel F., Laboratoire d'Immunologie Cellulaire U333, Institut Gustave Roussy rue Camille Desmoulins, 94805 Villejuif, France REMARK Revised by [3] REFERENCE 2 (bases 1 to 1872) AUTHORS Triebel,F., Jitsukawa,S., Baixeras,E., Roman-Roman,S., Genevee,C., Viegas-Pequignot,E. and Hercend,T. TITLE LAG-3, a novel lymphocyte activation gene closely related to CD4 JOURNAL J. Exp. Med. 171, 1393-1405 (1990) MEDLINE 90237736 REFERENCE 3 (bases 1 to 1872) AUTHORS Triebel,F. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Triebel F., Laboratoire d'Immunologie Cellulaire U333, Institut Gustave Roussy rue Camille Desmoulins, 94805 Villejuif, France COMMENT Data kindly reviewed (08-OCT-1990) by Triebel F. FEATURES Location/Qualifiers source 1..1872 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="MB-F(5)" sig_peptide 231..296 /note="signal peptide" CDS 231..1808 /codon_start=1 /product="LAG-3 protein precursor" /db_xref="PID:e258342" /db_xref="PID:g1491706" /db_xref="SWISS-PROT:P18627" /translation="MWEAQFLGLLFLQPLWVAPVKPLQPGAEVPVVWAQEGAPAQLPC SPTIPLQDLSLLRRAGVTWQHQPDSGPPAAAPGHPLAPGPHPAAPSSWGPRPRRYTVL SVGPGGLRSGRLPLQPRVQLDERGRQRGDFSLWLRPARRADAGEYRAAVHLRDRALSC RLRLRLGQASMTASPPGSLRASDWVILNCSFSRPDRPASVHWFRNRGQGRVPVRESPH HHLAESFLFLPQVSPMDSGPWGCILTYRDGFNVSIMYNLTVLGLEPPTPLTVYAGAGS RVGLPCRLPAGVGTRSFLTAKWTPPGGGPDLLVTGDNGDFTLRLEDVSQAQAGTYTCH IHLQEQQLNATVTLAIITVTPKSFGSPGSLGKLLCEVTPVSGQERFVWSSLDTPSQRS FSGPWLEAQEAQLLSQPWQCQLYQGERLLGAAVYFTELSSPGAQRSGRAPGALPAGHL LLFLTLGVLSLLLLVTGAFGFHLWRRQWRPRRFSALEQGIHPRQAQSKIEELEQEPEP EPEPEPEPEPEPEPEQL" mat_peptide 297..1805 /note="LAG-3 protein" BASE COUNT 300 a 673 c 523 g 376 t ORIGIN 1 tcaggctgcc tgatctgccc agctttccag ctttcctctg gattccggcc tctggtcatc 61 cctccccacc ctctctccaa ggccctctcc tggtctccct tcttctagaa ccccttcctc 121 cacctccctc tctgcagaac ttctccttta ccccccaccc cccaccactg ccccctttcc 181 ttttctgacc tccttttgga gggctcagcg ctgcccagac cataggagag atgtgggagg 241 ctcagttcct gggcttgctg tttctgcagc cgctttgggt ggctccagtg aagcctctcc 301 agccaggggc tgaggtcccg gtggtgtggg cccaggaggg ggctcctgcc cagctcccct 361 gcagccccac aatccccctc caggatctca gccttctgcg aagagcaggg gtcacttggc 421 agcatcagcc agacagtggc ccgcccgctg ccgcccccgg ccatcccctg gcccccggcc 481 ctcacccggc ggcgccctcc tcctgggggc ccaggccccg ccgctacacg gtgctgagcg 541 tgggtcccgg aggcctgcgc agcgggaggc tgcccctgca gccccgcgtc cagctggatg 601 agcgcggccg gcagcgcggg gacttctcgc tatggctgcg cccagcccgg cgcgcggacg 661 ccggcgagta ccgcgccgcg gtgcacctca gggaccgcgc cctctcctgc cgcctccgtc 721 tgcgcctggg ccaggcctcg atgactgcca gccccccagg atctctcaga gcctccgact 781 gggtcatttt gaactgctcc ttcagccgcc ctgaccgccc agcctctgtg cattggttcc 841 ggaaccgggg ccagggccga gtccctgtcc gggagtcccc ccatcaccac ttagcggaaa 901 gcttcctctt cctgccccaa gtcagcccca tggactctgg gccctggggc tgcatcctca 961 cctacagaga tggcttcaac gtctccatca tgtataacct cactgttctg ggtctggagc 1021 ccccaactcc cttgacagtg tacgctggag caggttccag ggtggggctg ccctgccgcc 1081 tgcctgctgg tgtggggacc cggtctttcc tcactgccaa gtggactcct cctgggggag 1141 gccctgacct cctggtgact ggagacaatg gcgactttac ccttcgacta gaggatgtga 1201 gccaggccca ggctgggacc tacacctgcc atatccatct gcaggaacag cagctcaatg 1261 ccactgtcac attggcaatc atcacagtga ctcccaaatc ctttgggtca cctggatccc 1321 tggggaagct gctttgtgag gtgactccag tatctggaca agaacgcttt gtgtggagct 1381 ctctggacac cccatcccag aggagtttct caggaccttg gctggaggca caggaggccc 1441 agctcctttc ccagccttgg caatgccagc tgtaccaggg ggagaggctt cttggagcag 1501 cagtgtactt cacagagctg tctagcccag gtgcccaacg ctctgggaga gccccaggtg 1561 ccctcccagc aggccacctc ctgctgtttc tcacccttgg tgtcctttct ctgctccttt 1621 tggtgactgg agcctttggc tttcaccttt ggagaagaca gtggcgacca agacgatttt 1681 ctgccttaga gcaagggatt caccctcgcc aggctcagag caagatagag gagctggagc 1741 aagaaccgga gccggagccg gagccggaac cggagcccga gcccgagccc gagccggagc 1801 agctctgacc tggagctgag gcagccagca gatctcagca gcccagtcca aataaacgtc 1861 ctgtctagca gc // LOCUS HSLAL 2626 bp RNA PRI 25-FEB-1994 DEFINITION H.sapiens mRNA for lysosomal acid lipase. ACCESSION X76488 NID g434305 KEYWORDS lysosomal acid lipase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2626) AUTHORS Ameis,D., Merkel,M., Eckerskorn,C. and Greten,H. TITLE Purification, characterization and molecular cloning of human hepatic lysosomal acid lipase JOURNAL Eur. J. Biochem. 219 (3), 905-914 (1994) MEDLINE 94155897 REFERENCE 2 (bases 1 to 2626) AUTHORS Ameis,D. TITLE Direct Submission JOURNAL Submitted (29-NOV-1993) D. Ameis, Medical Department, University Hospital Eppendorf, Martinistrasse 52, 20246 Hamburg, FRG FEATURES Location/Qualifiers source 1..2626 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="10" gene 146..1345 /gene="LAL/LIPA" CDS 146..1345 /gene="LAL/LIPA" /EC_number="3.1.1.13" /note="sterol esterase" /codon_start=1 /evidence=experimental /product="lysosomal acid lipase" /db_xref="PID:g434306" /db_xref="SWISS-PROT:P38571" /translation="MKMRFLGLVVCLVLWTLHSEGSGGKLTAVDPETNMNVSEIISYW GFPSEEYLVETEDGYILCLNRIPHGRKNHSDKGPKPVVFLQHGLLADSSNWVTNLANS SLGFILADAGFDVWMGNSRGNTWSRKHKTLSVSQDEFWAFSYDEMAKYDLPASINFIL NKTGQEQVYYVGHSQGTTIGFIAFSQIPELAKRIKMFFALGPVASVAFCTSPMAKLGR LPDHLIKDLFGDKEFLPQSAFLKWLGTHVCTHVILKELCGNLCFLLCGFNERNLNMSR VDVYTTHSPAGTSVQNMLHWSQAVKFQKFQAFDWGSSAKNYFHYNQSYPPTYNVKDML VPTAVWSGGHDWLADVYDVNILLTQITNLVFHESIPEWEHLDFIWGLDAPWRLYNKII NLMRKYQ" sig_peptide 146..226 /gene="LAL/LIPA" /evidence=experimental misc_feature 227..373 /gene="LAL/LIPA" /note="propeptide" /evidence=experimental mat_peptide 374..1342 /gene="LAL/LIPA" /evidence=experimental /product="lysosomal acid lipase" BASE COUNT 741 a 527 c 547 g 811 t ORIGIN 1 attggctgaa caaatagtcc cgagggtggt gctaccgccc tcccgacaag gcagaccagg 61 ccccctgcag gtcccctatc cgcaccccgg cccctgagag ctggcactgc gactcgagac 121 agcggcccgg caggacagct ccagaatgaa aatgcggttc ttggggttgg tggtctgttt 181 ggttctctgg accctgcatt ctgaggggtc tggagggaaa ctgacagctg tggatcctga 241 aacaaacatg aatgtgagtg aaattatctc ttactgggga ttccctagtg aggaatacct 301 agttgagaca gaagatggat atattctgtg ccttaaccga attcctcatg ggaggaagaa 361 ccattctgac aaaggtccca aaccagttgt cttcctgcaa catggcttgc tggcagattc 421 tagtaactgg gtcacaaacc ttgccaacag cagcctgggc ttcattcttg ctgatgctgg 481 ttttgacgtg tggatgggca acagcagagg aaatacctgg tctcggaaac ataagacact 541 ctcagtttct caggatgaat tctgggcttt cagttatgat gagatggcaa aatatgacct 601 accagcttcc attaacttca ttctgaataa aactggccaa gaacaagtgt attatgtggg 661 tcattctcaa ggcaccacta taggttttat agcattttca cagatccctg agctggctaa 721 aaggattaaa atgttttttg ccctgggtcc tgtggcttcc gtcgccttct gtactagccc 781 tatggccaaa ttaggacgat taccagatca tctcattaag gacttatttg gagacaaaga 841 atttcttccc cagagtgcgt ttttgaagtg gctgggtacc cacgtttgca ctcatgtcat 901 actgaaggag ctctgtggaa atctctgttt tcttctgtgt ggatttaatg agagaaattt 961 aaatatgtct agagtggatg tatatacaac acattctcct gctggaactt ctgtgcaaaa 1021 catgttacac tggagccagg ctgttaaatt ccaaaagttt caagcctttg actggggaag 1081 cagtgccaag aattattttc attacaacca gagttatcct cccacataca atgtgaagga 1141 catgcttgtg ccgactgcag tctggagcgg gggtcacgac tggcttgcag atgtctacga 1201 cgtcaatatc ttactgactc agatcaccaa cttggtgttc catgagagca ttccggaatg 1261 ggagcatctt gacttcattt ggggcctgga tgccccttgg aggctttata ataaaattat 1321 taatctaatg aggaaatatc agtgaaagct ggacttgagc tgtgtaccac caagtcaatg 1381 attatgtcat gtgaaaatgt gtttgcttca tttctgtaaa acacttgttt ttctttccca 1441 ggtcttttgt ttttttatat ccaagaaaat gataactttg aagatgccca gttcactcta 1501 gtttcaatta gaaacatact agctattttt cctttaatta gggctggaat aggaagccag 1561 tgtctcaacc atagtattgt ctctttaagt cttttaaata tcactgatgt gtaaaaaggt 1621 cattatatcc attctgtttt taaaatttaa aatatattga ctttttgccc ttcataggac 1681 aaagtaatat atgtgttgga attttaaaat tgtgttgtca ttggtaaatc tgtcactgac 1741 ttaagcgagg tataaaagta cgcagttttc atgtccttgc cttaaagagc tctctagtct 1801 aacggtcttg tagttagaga tctaaatgac attttatcat gttttcctgc agcaggtgca 1861 tagtcaaatc cagaaatatc acagctgtgc cagtaataag gatgctaaca attaatttta 1921 tcaaacctaa ctgtgacagc tgtgatttga cacgttttaa ttgctcaggt taaatgaaat 1981 agttttccgg cgtcttcaaa aacaaattgc actgataaaa caaaaacaaa agtatgtttt 2041 aaatgctttg aagactgata cactcaacca tctatattca tgagctctca atttcatggc 2101 aggccatagt tctacttatc tgagaagcaa atccctgtgg agactatacc actatttttc 2161 ctgagattaa tgtactcttg gagcccgcta ctgtcgttat tgatcacatc tgtgtgaagc 2221 caaagccccg tggttgccca tgagaagtgt cctagttcat tttcacccaa atgaagtgtg 2281 aacgtgatgt tttcggatgc aaactcagct cagggattca ttttgtgtct tagttttata 2341 tgcatcctta tttttaatac acctgcttca cgtccctatg ttgggaagtc catatttgtc 2401 tgcttttctt gcagcatcat ttcctggaca atactgtccg gtggacaaaa tgacaattga 2461 tatgttttcc tgatataatt actttagctg cactaacagt acaatgcttg ttaatggtta 2521 atataggcag ggcgaatact actttgtaac ttttaaagtc ttaaactttt caataaaatt 2581 gagtgagact tataggccca aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSLAMAR 2404 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for nuclear envelope protein lamin A precursor. ACCESSION X03444 NID g34227 KEYWORDS intermediate filament; lamin A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2404) AUTHORS McKeon,F.D., Kirschner,M.W. and Caput,D. TITLE Homologies in both primary and secondary structure between nuclear envelope and intermediate filament proteins JOURNAL Nature 319 (6053), 463-468 (1986) MEDLINE 86118697 FEATURES Location/Qualifiers source 1..2404 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 211..2319 /note="put. lamin A precursor (aa 1-702)" /codon_start=1 /db_xref="PID:g34228" /db_xref="SWISS-PROT:P02545" /translation="METPSQRRATRSGAQASSTPLSPTRITRLQEKEDLQELNDRLAV YIDRVRSLETENAGLRLRITESEEVVSREVSGIKAAYEAELGDARKTLDSVAKERARL QLELSKVREEFKELKARNTKKEGDLIAAQARLKDLEALLNSKEAALSTALSEKRTLEG ELHDLRGQVAKLEAALGEAKKQLQDEMLRRVDAENRLQTMKEELDFQKNIYSEELRET KRRHETRLVEIDNGKQREFESRLADALQELRAQHEDQVEQYKKELEKTYSAKLDNARQ SAERNSNLVGAAHEELQQSRIRIDSLSAQLSQLQKQLAAKEAKLRDLEDSLARERDTS RRLLAEKEREMAEMRARMQQQLDEYQELLDIKLALDMEIHAYRKLLEGEEERLRLSPS PTSQRSRGRASSHSSQTQGGGSVTKKRKLESTESRSSFSQHARTSGRVAVEEVDEEGK FVRLRNKSNEDQSMGNWQIKRQNGDDPLLTYRFPPKFTLKAGQVVTIWAAGAGATHSP PTDLVWKAQNTWGCGNSLRTALINSTGEEVAMRKLVRSVTVVEDDEDEDGDDLLHHHH GSHCSSSGDPAEYNLRLAHRAVRDLRAACRQGICQRLRSPGGRTHLLWLFCLQCHGHS QLPQCGGQWGWQLRGQSGHPLLPPGQLQPPNPEPPELQHHVIWDLPGRGGGGGFLRPP HLMPTPCPARHGRGLEAKEK" polyA_site 2404 /note="polyA site" BASE COUNT 513 a 732 c 780 g 379 t ORIGIN 1 actcagtgtt cgcgggagcc gcacctacac cagccaaccc agatcccgag gtccgacagc 61 gcccggccca gatccccacg cctgccagga gcaagccgag agccagccgg ccggcgcact 121 ccgactccga gcagtctctg tccttcgacc cgagccccgc gccctttccg ggacccctgc 181 cccgcgggca gcgctgccaa cctgccggcc atggagaccc cgtcccagcg gcgcgccacc 241 cgcagcgggg cgcaggccag ctccactccg ctgtcgccca cccgcatcac ccggctgcag 301 gagaaggagg acctgcagga gctcaatgat cgcttggcgg tctacatcga ccgtgtgcgc 361 tcgctggaaa cggagaacgc agggctgcgc cttcgcatca ccgagtctga agaggtggtc 421 agccgcgagg tgtccggcat caaggccgcc tacgaggccg agctcgggga tgcccgcaag 481 acccttgact cagtagccaa ggagcgcgcc cgcctgcagc tggagctgag caaagtgcgt 541 gaggagttta aggagctgaa agcgcgcaat accaagaagg agggtgacct gatagctgct 601 caggctcggc tgaaggacct ggaggctctg ctgaactcca aggaggccgc actgagcact 661 gctctcagtg agaagcgcac gctggagggc gagctgcatg atctgcgggg ccaggtggcc 721 aagcttgagg cagccctagg tgaggccaag aagcaacttc aggatgagat gctgcggcgg 781 gtggatgctg agaacaggct gcagaccatg aaggaggaac tggacttcca gaagaacatc 841 tacagtgagg agctgcgtga gaccaagcgc cgtcatgaga cccgactggt ggagattgac 901 aatgggaagc agcgtgagtt tgagagccgg ctggcggatg cgctgcagga actgcgggcc 961 cagcatgagg accaggtgga gcagtataag aaggagctgg agaagactta ttctgccaag 1021 ctggacaatg ccaggcagtc tgctgagagg aacagcaacc tggtgggggc tgcccacgag 1081 gagctgcagc agtcgcgcat ccgcatcgac agcctctctg cccagctcag ccagctccag 1141 aagcagctgg cagccaagga ggcgaagctt cgagacctgg aggactcact ggcccgtgag 1201 cgggacacca gccggcggct gctggcggaa aaggagcggg agatggccga gatgcgggca 1261 aggatgcagc agcagctgga cgagtaccag gagcttctgg acatcaagct ggccctggac 1321 atggagatcc acgcctaccg caagctcttg gagggcgagg aggagaggct acgcctgtcc 1381 cccagcccta cctcgcagcg cagccgtggc cgtgcttcct ctcactcatc ccagacacag 1441 ggtgggggca gcgtcaccaa aaagcgcaaa ctggagtcca ctgagagccg cagcagcttc 1501 tcacagcacg cacgcactag cgggcgcgtg gccgtggagg aggtggatga ggagggcaag 1561 tttgtccggc tgcgcaacaa gtccaatgag gaccagtcca tgggcaattg gcagatcaag 1621 cgccagaatg gagatgatcc cttgctgact taccggttcc caccaaagtt caccctgaag 1681 gctgggcagg tggtgacgat ctgggctgca ggagctgggg ccacccacag cccccctacc 1741 gacctggtgt ggaaggcaca gaacacctgg ggctgcggga acagcctgcg tacggctctc 1801 atcaactcca ctggggaaga agtggccatg cgcaagctgg tgcgctcagt gactgtggtt 1861 gaggacgacg aggatgagga tggagatgac ctgctccatc accaccacgg ctcccactgc 1921 agcagctcgg gggaccccgc tgagtacaac ctgcgcctcg cgcaccgtgc tgtgcgggac 1981 ctgcgggcag cctgccgaca aggcatctgc cagcggctca ggagcccagg tgggcggacc 2041 catctcctct ggctcttctg cctccagtgt cacggtcact cgcagctacc gcagtgtggg 2101 gggcagtggg ggtggcagct tcggggacaa tctggtcacc cgctcctacc tcctgggcaa 2161 ctccagcccc cgaacccaga gcccccagaa ctgcagcatc atgtaatctg ggacctgcca 2221 ggcaggggtg ggggtggagg cttcctgcgt cctcctcacc tcatgcccac cccctgccct 2281 gcacgtcatg ggagggggct tgaagccaaa gaaaaataac cctttggttt ttttcttctt 2341 gtattttttt ttctaagaga agttattttc tacagtggtt ttatactgaa ggaaaaacac 2401 aagc // LOCUS HSLAMB2S 5683 bp RNA PRI 05-DEC-1995 DEFINITION H.sapiens LAMB2 mRNA for beta2 laminin. ACCESSION X79683 NID g663206 KEYWORDS lamB2 gene; laminin; laminin B2 chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5683) AUTHORS Wewer,U.M., Gerecke,D.R., Durkin,M.E., Kurtz,K.S., Mattei,M.G., Champliaud,M.F., Burgeson,R.E. and Albrechtsen,R. TITLE Human beta 2 chain of laminin (formerly S chain): cDNA cloning, chromosomal localization, and expression in carcinomas JOURNAL Genomics 24 (2), 243-252 (1994) MEDLINE 95213013 REFERENCE 2 (bases 1 to 5683) AUTHORS Wewer,U.M. TITLE Direct Submission JOURNAL Submitted (09-JUN-1994) U.M. Wewer, University of Copenhagen, Laboratory of Molecular Pathology, University Inst. of Pathological Anatomy, Frederik V's Vej 11, Copenhagen 2100, DENMARK REMARK revised by [3] MAT FEATURES Location/Qualifiers source 1..5683 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta, colon" /cell_type="carcinome" /cell_line="human placenta, human colon carcinome" /clone_lib="clontech HL1008" /clone="A, MIP" gene 166..5562 /gene="LAMB2" CDS 166..5562 /gene="LAMB2" /codon_start=1 /product="beta2/S laminin chain" /db_xref="PID:g1335202" /translation="MELTSTERGRGQPLPWELRLPLLLSVLAATLAQAPAPDVPGCSR GSCYPATADLLVGRADRLTASSTCGLNGRQPYCIVSHLQDEKKCFLCDSRRPFSARDN PHTHRIQNVVTSFAPQRRAAWWQSQNGIPAVTIQLDLEAEFHFTHLIMTFKTFRPAAM LVERSADFGRTWHVYRYFSYHCGADFPGVPLAPPRHWDDVVCESRYSEIEPSTEGEVI YRVLDPAIPIPDPYSSRIQNLLKITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYE LVVRGNCFCYGHASECAPAPGAPAHAEGMVHGACICKHNTRGLNCEQCQDFYRDLPWR PAEDGHSHACRKCDRHGHTHSCHFDMAVYLGSGNVSGGVCDGCQHNTAWRHCELCRPF FYRDPTKDLRDPAVCRSCDCDPMGSQDGGRCDSHDDPALGLVSGQCRCKEHVVGTRCQ QCRDGFFGLSISDPSGCRRCQCNARGTVPGSTPCDPNSGSCYCKRLVTGRGCDRCLPG HWGLSLDLLGCRPCDCDVGGALDPQCDEGTGQCHCRQHMVGRRCEQVQPGYFRPFLDH LIWEAENTRGQVLDVVERLVTPGETPSWTGSGFVRLQEGQTLEFLVASVPNAMDYDLL LRLEPQVPEQWAELELIVQRPGPVPAHSLCGHLVPRDDRIQGTLQPHARYLIFPNPVC LEPGISYKLHLKLVRTGGSAQPETPYSGPGLLIDSLVLLPRVLVLEMFSGGDAAALER QATFERYQCHEEGLVPSKTSPSEACAPLLISLSTLIYNGALPCQCNPQGSLSSECNPH GGQCLCKPGVVGRRCDTCAPGYYGFGPTGCQACQCSPRGALSSLCERTSGQCLCRTGA FGLRCDACQRGQWGFPSCRPCVCNGHADECNTHTGACLGCRDLTGGEHCERCIAGFHG DPRLPYGAQCRPCPCPEGPGSQRHFATSCHQDEYSQQIVCHCRAGYTGLRCEACAPGQ FGDPSRPGGRCQLCECSGNIDPMDPDACDPHPGQCLRCLHHTEGPHCAHSKPGFHGQA ARQSCHRCTCNLLGTNPQQCPSPDQCHCDPSSGQCPCLPNVQALAVDRCAPNFWNLTS GHGCQPCACLPSPEEGPTCNEFTGQCHCLCGFGGRTCSECQELHWGDPGLQCHACDCD SRGIDTPQCHRFTGHCTCRPGVSGVRCDQCARGFSGIFPACHPCHACFGDWDRVVQDL AARTQRLEQRAQELQQTGVLGAFESSFWHMQEKLGIVQGIVGARNTSAASTAQLVEAT EELRREIGEATEHLTQLEADLTDVQDENFNANHALSGLERDRLALNLTLRQLDQHLDL LKHSNFLGAYDSIRHAHSQSAEAERRANTSALAVPSPVSNSASARHRTEALMDAQKED FNSKHMANQRALGKLSAHTHTLSLTDINELVCGAQGLHHDRTSPCGGAGCRDEDGQPR CGGLSCNGAAATADLALGRARHTQAELQRALAEGGSILSRVAETRRQASEAQQRAQAA LDKANASRGQVEQANQELQELIQSVKDFLNQEGADPDSIEMVATRVLELSIPASAEQI QHLAGAIAERVRSLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEKQKAETVQAA LEEAQRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGAERALSSAGERARQLDALLE ALKLKRAGNSLAASTAEETAGSAQGRAQEAEQLLRGPLGDQYQTVKALAERKAQGVLA AQARAEQLPDEARDLLQAAQDKLQRLQELEGTYEENERALESKAAQLDGLEARMRSVL QAINLQVQIYNTCQ" BASE COUNT 1159 a 1690 c 1728 g 1106 t ORIGIN 1 ccgcccggtg ttgcgctcct tcccagaatc cgctccggcc tttccttcct gccgcgattc 61 ccaactttgc tcaaagtcgc cggactctaa gctgtcggag ggaccgctgg acagacctgg 121 gaactgacag agggcctgga gggaaatagg ccaaagaccc acaggatgga gctgacctca 181 accgaaagag ggaggggaca gcctctgccc tgggaacttc gactgcccct actgctaagc 241 gtgctggctg ccacactggc acaggcccct gccccggatg tccctggctg ttccagggga 301 agctgctacc ccgccacggc cgacctgctg gtgggccgag ctgacagact gactgcctca 361 tccacttgtg gcctgaatgg ccgccagccc tactgcatcg tcagtcacct gcaggacgaa 421 aagaagtgct tcctttgtga ctcccggcgc cccttctctg ctagagacaa cccacacacc 481 catcgcatcc agaatgtagt caccagcttt gcaccacagc ggcgggcagc ttggtggcag 541 tcacagaatg gtatccctgc ggtcaccatc cagctggacc tggaggctga gtttcatttc 601 acacacctca ttatgacctt caagacattt cgccctgctg ccatgctggt cgaacgctca 661 gcagactttg gccgcacctg gcatgtgtac cgatatttct cctatcactg tggggctgac 721 ttcccaggag tcccactagc acccccacgg cactgggatg atgtagtctg tgagtcccgc 781 tactcagaga ttgagccatc cactgaaggc gaggtcatct atcgtgtgct ggaccctgcc 841 atccctatcc cagaccccta cagctcacgg attcagaacc tgttgaagat caccaaccta 901 cgggtgaacc tgactcgtct acacacgttg ggagacaacc tactcgaccc acggagggag 961 atccgagaga agtactacta tgccctctat gagctggttg tacgtggcaa ctgcttctgc 1021 tacggacacg cctcagagtg tgcacccgcc ccaggggcac cagcccatgc tgagggcatg 1081 gtgcacggag cttgcatctg caaacacaac acacgtggcc tcaactgcga gcagtgtcag 1141 gatttctatc gtgacctgcc ctggcgtccg gctgaggacg gccatagtca tgcctgtagg 1201 aagtgtgatc ggcatgggca cacccacagc tgccacttcg acatggccgt atacctcgga 1261 tctggcaatg tgagtggagg tgtgtgtgat ggatgtcagc ataacacagc gtggcgccac 1321 tgtgagctct gtcggccctt cttctaccgt gacccaacca aggacctgcg ggatccggct 1381 gtgtgccgct cctgtgattg tgaccccatg ggttctcaag acggtggtcg ctgtgattcc 1441 catgatgacc ctgcactggg actggtctcc ggccagtgtc gctgcaaaga acacgtggtg 1501 ggcactcgct gccagcaatg ccgtgatggc ttctttgggc tcagcatcag tgacccgtct 1561 gggtgccggc gatgtcaatg taatgcacgg ggcacagtgc ctgggagcac tccttgtgac 1621 cccaacagtg gatcctgtta ctgcaaacgt ctagtgactg gacgtggatg tgaccgctgc 1681 ctgcctggcc actggggcct gagcctcgac ctgctcggct gccgcccctg tgactgcgac 1741 gtgggtggtg ctttggatcc ccagtgtgat gagggcacag gtcaatgcca ctgccgccag 1801 cacatggttg ggcgacgctg tgagcaggtg caacctggct acttccggcc cttcctggac 1861 cacctaattt gggaggctga gaacacccga gggcaggtgc tcgatgtggt ggagcgcctg 1921 gtgacccccg gggaaactcc atcctggact ggctcaggct tcgtgcgact acaggaaggt 1981 cagaccctgg agttcctggt ggcctctgtg ccgaacgcga tggactatga cctgctgctg 2041 cgcttagagc cccaggtccc tgagcaatgg gcagagttgg aactgattgt gcagcgtcca 2101 gggcctgtgc ctgcccacag cctgtgtggg catttggtgc ccagggatga tcgcatccaa 2161 gggactctgc aaccacatgc caggtacttg atatttccta atcctgtctg ccttgagcct 2221 ggtatctcct acaagctgca tctgaagctg gtacggacag ggggaagtgc ccagcctgag 2281 actccctact ctggacctgg cctgctcatt gactcgctgg tgctgctgcc ccgtgtcctg 2341 gtgctagaga tgtttagtgg gggtgatgct gctgccctgg agcgccaggc cacctttgaa 2401 cgctaccaat gccatgagga gggtctggtg cccagcaaga cttctccctc tgaggcctgc 2461 gcacccctcc tcatcagcct gtccaccctc atctacaatg gtgccctgcc atgtcagtgc 2521 aaccctcaag gttcactgag ttctgagtgc aaccctcatg gtggtcagtg cctgtgcaag 2581 cctggagtgg ttgggcgccg ctgtgacacg tgtgcccctg gctactatgg ctttggcccc 2641 acaggctgtc aagcctgcca gtgcagccca cgaggggcac tcagcagtct ctgtgaaagg 2701 accagtgggc aatgtctctg tcgaactggt gcctttgggc ttcgctgtga cgcctgccag 2761 cgtggccagt ggggattccc tagctgccgg ccatgtgtct gcaatgggca tgcagatgag 2821 tgcaacaccc acacaggcgc ttgcctgggc tgccgtgatc tcacaggggg tgagcactgt 2881 gaaaggtgca ttgctggttt ccacggggac ccacggctgc catatggggc gcagtgccgg 2941 ccctgtccct gtcctgaagg ccctgggagc caacggcact ttgctacttc ttgccaccag 3001 gatgaatatt cccagcagat tgtgtgccac tgccgggcag gctatacggg gctgcgatgt 3061 gaagcttgtg cccctgggca gtttggggac ccatcaaggc caggtggccg gtgccaactg 3121 tgtgagtgca gtgggaacat tgacccaatg gatcctgatg cctgtgaccc acaccccggg 3181 caatgcctgc gctgtttaca ccacacagag ggtccacact gtgcccactc gaagcctggc 3241 ttccatggcc aggctgcccg gcagagctgt caccgctgca catgcaacct gctgggcaca 3301 aatccgcagc agtgcccatc tcctgaccag tgccactgtg atccaagcag tgggcagtgc 3361 ccatgcctcc ccaatgtcca ggccctagct gtagaccgct gtgcccccaa cttctggaac 3421 ctcaccagtg gccatggttg ccagccttgt gcctgcctcc caagcccgga agaaggcccc 3481 acctgcaacg agttcacagg gcagtgccac tgcctgtgcg gctttggagg gcggacttgt 3541 tctgagtgcc aagagctcca ctggggagac cctgggttgc agtgccatgc ctgtgattgt 3601 gactctcgtg gaatagatac acctcagtgt caccgcttca caggtcactg cacgtgccgc 3661 ccaggggtgt ctggtgtgcg ctgtgaccag tgtgcccgtg gcttctcagg aatctttcct 3721 gcctgccatc cctgccatgc atgcttcggg gattgggacc gagtggtgca ggacttggca 3781 gcccgtacac agcgcctaga gcagcgggcg caggagttgc aacagacggg tgtgctgggt 3841 gcctttgaga gcagcttctg gcacatgcag gagaagctgg gcattgtgca gggcatcgta 3901 ggtgcccgca acacctcagc cgcctccact gcacagcttg tggaggccac agaggagctg 3961 cggcgtgaaa ttggggaggc cactgagcac ctgactcagc tcgaggcaga cctgacagat 4021 gtgcaagatg agaacttcaa tgccaaccat gcactaagtg gtctggagcg agataggctt 4081 gcacttaatc tcacactgcg gcagctcgac cagcatcttg acttgctcaa acattcaaac 4141 ttcctgggtg cctatgacag catccggcat gcccatagcc agtctgcaga ggcagaacgt 4201 cgtgccaata cctcagccct ggcagtacct agccctgtga gcaactcggc aagtgctcgg 4261 catcggacag aggcactgat ggatgctcag aaggaggact tcaacagcaa acacatggcc 4321 aaccagcggg cacttggcaa gctctctgcc catacccaca ccctgagcct gacagacata 4381 aatgagctgg tgtgtggggc ccagggattg catcatgatc gtacaagccc ttgtgggggt 4441 gccggctgtc gagatgagga tgggcagccg cgctgtgggg gcctcagctg caatggggca 4501 gcggctacag cagacctagc actgggccgg gcccggcaca cacaggcaga gctgcagcgg 4561 gcactggcag aaggtggtag catcctcagc agagtggctg agactcgtcg gcaggcaagc 4621 gaggcacagc agcgggccca ggcagccctg gacaaggcta atgcttccag gggacaggtg 4681 gaacaggcca accaggaact tcaagaactt atccagagtg tgaaggactt cctcaaccag 4741 gagggggctg atcctgatag cattgaaatg gtggccacac gggtgctaga gctctccatc 4801 ccagcttcag ctgagcagat ccagcacctg gcgggcgcga ttgcagagcg agtccggagc 4861 ctggcagatg tggatgcgat cctggcacgt actgtaggag atgtgcgtcg tgccgagcag 4921 ctactgcagg atgcacggcg ggcaaggagc tgggctgagg atgagaaaca gaaggcagag 4981 acagtacagg cagcactgga ggaggcccag cgggcacagg gtattgccca gggtgccatc 5041 cggggggcag tggctgacac acgggacaca gagcagaccc tgtaccaggt acaggagagg 5101 atggcaggtg cagagcgggc actgagctct gcaggtgaaa gggctcggca gttggatgct 5161 ctcctggagg ctctgaaatt gaaacgggca ggaaatagtc tggcagcctc tacagcagaa 5221 gaaacggcag gcagtgccca gggtcgtgcc caggaggctg agcagctgct acgcggtcct 5281 ctgggtgatc agtaccagac ggtgaaggcc ctagctgagc gcaaggccca aggtgtgctg 5341 gctgcacagg caagggcaga acaactgccg gatgaggctc gggacctgtt gcaagccgct 5401 caggacaagc tgcagcggct acaggaattg gaaggcacct atgaggaaaa tgagcgggca 5461 ctggagagta aggcagccca gttggacggg ttggaggcca ggatgcgcag cgtgcttcaa 5521 gccatcaact tgcaggtgca gatctacaac acctgccagt gacccctgcc caaggcctac 5581 cccagttcct agcactgccc cacatgcatg tctgcctatg cactgaagag ctcttggccc 5641 ggcagggccc ccaataaacc agtgtgaacc cccaaaaaaa aaa // LOCUS HSLAMB2T 5200 bp RNA PRI 27-MAR-1996 DEFINITION H.sapiens mRNA for laminin. ACCESSION Z15008 S47028 NID g34229 KEYWORDS laminin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5200) AUTHORS Kallunki,P., Sainio,K., Eddy,R., Byers,M., Kallunki,T., Sariola,H., Beck,K., Hirvonen,H., Shows,T.B. and Tryggvason,K. TITLE A truncated laminin chain homologous to the B2 chain: structure, spatial expression, and chromosomal assignment JOURNAL J. Cell Biol. 119 (3), 679-693 (1992) MEDLINE 93016279 REFERENCE 2 (bases 1 to 5200) AUTHORS Tryggvason,K. TITLE Direct Submission JOURNAL Submitted (27-AUG-1992) Tryggvason K., Biocenter and University of Oulu, Biochemistry, Linnanmaa, Oulu, Finland, SF-90570 FEATURES Location/Qualifiers source 1..5200 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Fibrosarcoma" /cell_line="HT-1080" /chromosome="1" CDS 118..3699 /codon_start=1 /product="Laminin" /db_xref="PID:g34230" /translation="MPALWLGCCLCFSLLLPAARATSRREVCDCNGKSRQCIFDRELH RQTGNGFRCLNCNDNTDGIHCEKCKNGFYRHRERDRCLPCNCNSKGSLSARCDNSGRC SCKPGVTGARCDRCLPGFHMLTDAGCTQDQRLLDSKCDCDPAGIAGPCDAGRCVCKPA VTGERCDRCRSGYYNLDGGNPEGCTQCFCYGHSASCRSSAEYSVHKITSTFHQDVDGW KAVQRNGSPAKLQWSQRHQDVFSSAQRLDPVYFVAPAKFLGNQQVSYGQSLSFDYRVD RGGRHPSAHDVILEGAGLRITAPLMPLGKTLPCGLTKTYTFRLNEHPSNNWSPQLSYF EYRRLLRNLTALRIRATYGEYSTGYIDNVTLISARPVSGAPAPWVEQCICPVGYKGQF CQDCASGYKRDSARLGPFGTCIPCNCQGGGACDPDTGDCYSGDENPDIECADCPIGFY NDPHDPRSCKPCPCHNGFSCSVIPETEEVVCNNCPPGVTGARCELCADGYFGDPFGEH GPVRPCQPCQCNSNVDPSASGNCDRLTGRCLKCIHNTAGIYCDQCKAGYFGDPLAPNP ADKCRACNCNPMGSEPVGCRSDGTCVCKPGFGGPNCEHGAFSCPACYNQVKIQMDQFM QQLQRMEALISKAQGGDGVVPDTELEGRMQQAEQALQDILRDAQISEGASRSLGLQLA KVRSQENSYQSRLDDLKMTVERVRALGSQYQNRVRDTHRLITQMQLSLAESEASLGNT NIPASDHYVGPNGFKSLAQEATRLAESHVESASNMEQLTRETEDYSKQALSLVRKALH EGVGSGSGSPDGAVVQGLVEKLEKTKSLAQQLTREATQAEIEADRSYQHSLRLLDSVS PLQGVSDQSFQVEEAKRIKQKADSLSSLVTRHMDEFKRTQKNLGNWKEEAQQLLQNGK SGREKSDQLLSRANLAKSRAQEALSMGNATFYEVESILKNLREFDLQVDNRKAEAEEA MKRLSYISQKVSDASDKTQQAERALGSAAADAQRAKNGAGEALEISSEIEQEIGSLNL EANVTADGALAMEKGLASLKSEMREVEGELERKELEFDTNMDAVQMVITEAQKVDTRA KNAGVTIQDTLNTLDGLLHLMDQPLSVDEEGLVLLEQKLSRAKTQINSQLRPMMSELE ERARQQRGHLHLLETSIDGILADVKNLENIRDNLPPGCYNTQALEQQ" sig_peptide 118..183 polyA_site 4433 polyA_site 5195 BASE COUNT 1364 a 1236 c 1392 g 1208 t ORIGIN 1 gaccacctga tcgaaggaaa aggaaggcac agcggagcgc agagtgagaa ccaccaaccg 61 aggcgccggg cagcgacccc tgcagcggag acagagactg agcggcccgg caccgccatg 121 cctgcgctct ggctgggctg ctgcctctgc ttctcgctcc tcctgcccgc agcccgggcc 181 acctccagga gggaagtctg tgattgcaat gggaagtcca ggcagtgtat ctttgatcgg 241 gaacttcaca gacaaactgg taatggattc cgctgcctca actgcaatga caacactgat 301 ggcattcact gcgagaagtg caagaatggc ttttaccggc acagagaaag ggaccgctgt 361 ttgccctgca attgtaactc caaaggttct cttagtgctc gatgtgacaa ctctggacgg 421 tgcagctgta aaccaggtgt gacaggagcc agatgcgacc gatgtctgcc aggcttccac 481 atgctcacgg atgcggggtg cacccaagac cagagactgc tagactccaa gtgtgactgt 541 gacccagctg gcatcgcagg gccctgtgac gcgggccgct gtgtctgcaa gccagctgtt 601 actggagaac gctgtgatag gtgtcgatca ggttactata atctggatgg ggggaaccct 661 gagggctgta cccagtgttt ctgctatggg cattcagcca gctgccgcag ctctgcagaa 721 tacagtgtcc ataagatcac ctctaccttt catcaagatg ttgatggctg gaaggctgtc 781 caacgaaatg ggtctcctgc aaagctccaa tggtcacagc gccatcaaga tgtgtttagc 841 tcagcccaac gactagatcc tgtctatttt gtggctcctg ccaaatttct tgggaatcaa 901 caggtgagct atgggcaaag cctgtccttt gactaccgtg tggacagagg aggcagacac 961 ccatctgccc atgatgtgat cctggaaggt gctggtctac ggatcacagc tcccttgatg 1021 ccacttggca agacactgcc ttgtgggctc accaagactt acacattcag gttaaatgag 1081 catccaagca ataattggag cccccagctg agttactttg agtatcgaag gttactgcgg 1141 aatctcacag ccctccgcat ccgagctaca tatggagaat acagtactgg gtacattgac 1201 aatgtgaccc tgatttcagc ccgccctgtc tctggagccc cagcaccctg ggttgaacag 1261 tgtatatgtc ctgttgggta caaggggcaa ttctgccagg attgtgcttc tggctacaag 1321 agagattcag cgagactggg gccttttggc acctgtattc cttgtaactg tcaaggggga 1381 ggggcctgtg atccagacac aggagattgt tattcagggg atgagaatcc tgacattgag 1441 tgtgctgact gcccaattgg tttctacaac gatccgcacg acccccgcag ctgcaagcca 1501 tgtccctgtc ataacgggtt cagctgctca gtgattccgg agacggagga ggtggtgtgc 1561 aataactgcc ctcccggggt caccggtgcc cgctgtgagc tctgtgctga tggctacttt 1621 ggggacccct ttggtgaaca tggcccagtg aggccttgtc agccctgtca atgcaacagc 1681 aatgtggacc ccagtgcctc tgggaattgt gaccggctga caggcaggtg tttgaagtgt 1741 atccacaaca cagccggcat ctactgcgac cagtgcaaag caggctactt cggggaccca 1801 ttggctccca acccagcaga caagtgtcga gcttgcaact gtaaccccat gggctcagag 1861 cctgtaggat gtcgaagtga tggcacctgt gtttgcaagc caggatttgg tggccccaac 1921 tgtgagcatg gagcattcag ctgtccagct tgctataatc aagtgaagat tcagatggat 1981 cagtttatgc agcagcttca gagaatggag gccctgattt caaaggctca gggtggtgat 2041 ggagtagtac ctgatacaga gctggaaggc aggatgcagc aggctgagca ggcccttcag 2101 gacattctga gagatgccca gatttcagaa ggtgctagca gatcccttgg tctccagttg 2161 gccaaggtga ggagccaaga gaacagctac cagagccgcc tggatgacct caagatgact 2221 gtggaaagag ttcgggctct gggaagtcag taccagaacc gagttcggga tactcacagg 2281 ctcatcactc agatgcagct gagcctggca gaaagtgaag cttccttggg aaacactaac 2341 attcctgcct cagaccacta cgtggggcca aatggcttta aaagtctggc tcaggaggcc 2401 acaagattag cagaaagcca cgttgagtca gccagtaaca tggagcaact gacaagggaa 2461 actgaggact attccaaaca agccctctca ctggtgcgca aggccctgca tgaaggagtc 2521 ggaagcggaa gcggtagccc ggacggtgct gtggtgcaag ggcttgtgga aaaattggag 2581 aaaaccaagt ccctggccca gcagttgaca agggaggcca ctcaagcgga aattgaagca 2641 gataggtctt atcagcacag tctccgcctc ctggattcag tgtctccgct tcagggagtc 2701 agtgatcagt cctttcaggt ggaagaagca aagaggatca aacaaaaagc ggattcactc 2761 tcaagcctgg taaccaggca tatggatgag ttcaagcgta cacaaaagaa tctgggaaac 2821 tggaaagaag aagcacagca gctcttacag aatggaaaaa gtgggagaga gaaatcagat 2881 cagctgcttt cccgtgccaa tcttgctaaa agcagagcac aagaagcact gagtatgggc 2941 aatgccactt tttatgaagt tgagagcatc cttaaaaacc tcagagagtt tgacctgcag 3001 gtggacaaca gaaaagcaga agctgaagaa gccatgaaga gactctccta catcagccag 3061 aaggtttcag atgccagtga caagacccag caagcagaaa gagccctggg gagcgctgct 3121 gctgatgcac agagggcaaa gaatggggcc ggggaggccc tggaaatctc cagtgagatt 3181 gaacaggaga ttgggagtct gaacttggaa gccaatgtga cagcagatgg agccttggcc 3241 atggaaaagg gactggcctc tctgaagagt gagatgaggg aagtggaagg agagctggaa 3301 aggaaggagc tggagtttga cacgaatatg gatgcagtac agatggtgat tacagaagcc 3361 cagaaggttg ataccagagc caagaacgct ggggttacaa tccaagacac actcaacaca 3421 ttagacggcc tcctgcatct gatggaccag cctctcagtg tagatgaaga ggggctggtc 3481 ttactggagc agaagctttc ccgagccaag acccagatca acagccaact gcggcccatg 3541 atgtcagagc tggaagagag ggcacgtcag cagaggggcc acctccattt gctggagaca 3601 agcatagatg ggattctggc tgatgtgaag aacttggaga acattaggga caacctgccc 3661 ccaggctgct acaataccca ggctcttgag caacagtgaa gctgccataa atatttctca 3721 actgaggttc ttgggataca gatctcaggg ctcgggagcc atgtcatgtg agtgggtggg 3781 atggggacat ttgaacatgt ttaatgggta tgctcaggtc aactgacctg accccattcc 3841 tgatcccatg gccaggtggt tgtcttattg caccatactc cttgcttcct gatgctgggc 3901 atgaggcaga taggcactgg tgtgagaatg atcaaggatc tggaccccaa agatagactg 3961 gatggaaaga caaactgcac aggcagatgt ttgcctcata atagtcgtaa gtggagtcct 4021 ggaatttgga caagtgctgt tgggatatag tcaacttatt ctttgagtaa tgtgactaaa 4081 ggaaaaaact ttgactttgc ccaggcatga aattcttcct aatgtcagaa cagagtgcaa 4141 cccagtcaca ctgtggccag taaaatacta ttgcctcata ttgtcctctg caagcttctt 4201 gctgatcaga gttcctccta cttacaaccc agggtgtgaa catgttctcc attttcaagc 4261 tggaagaagt gagcagtgtt ggagtgagga cctgtaaggc aggcccattc agagctatgg 4321 tgcttgctgg tgcctgccac cttcaagttc tggacctggg catgacatcc tttcttttaa 4381 tgatgccatg gcaacttaga gattgcattt ttattaaagc atttcctacc agcaaagcaa 4441 atgttgggaa agtatttact ttttcggttt caaagtgata gaaaagtgtg gcttgggcat 4501 tgaaagaggt aaaattctct agatttatta gtcctaattc aatcctactt ttcgaacacc 4561 aaaaatgatg cgcatcaatg tattttatct tattttctca atctcctctc tctttcctcc 4621 acccataata agagaatgtt cctactcaca cttcagctgg gtcacatcca tccctccatt 4681 catccttcca tccatctttc catccattac ctccatccat ccttccaaca tatatttatt 4741 gagtacctac tgtgtgccag gggctggtgg gacagtggtg acatagtctc tgccctcata 4801 gagttgattg tctagtgagg aagacaagca tttttaaaaa ataaatttaa acttacaaac 4861 tttgtttgtc acaagtggtg tttattgcaa taaccgcttg gtttgcaacc tctttgctca 4921 acagaacata tgttgcaaga ccctcccatg ggcactgagt ttggcaagga tgacagagct 4981 ctgggttgtg cacatttctt tgcattccag cgtcactctg tgccttctac aactgattgc 5041 aacagactgt tgagttatga taacaccagt gggaattgct ggaggaacca gaggcacttc 5101 caccttggct gggaagacta tggtgctgcc ttgcttctgt atttccttgg attttcctga 5161 aagtgttttt aaataaagaa caattgttag atgccaaaaa // LOCUS HSLAPR 2097 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for lysosomal acid phosphatase (EC 3.1.3.2). ACCESSION X12548 NID g34262 KEYWORDS acid phosphatase; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2097) AUTHORS Pohlmann,R., Krentler,C., Schmidt,B., Schroeder,W., Lorkowski,G., Cully,J., Mersmann,G., Geier,C., Waheed,A., Gottschalk,S., Grzeschik,K.H., Hasilik,A. and von Figura,K. TITLE Human lysosomal acid phosphatase: cloning, expression and chromosomal assignment JOURNAL EMBO J. 7 (8), 2343-2350 (1988) MEDLINE 89052645 COMMENT Data kindly reviewed (5 June 1989) by Von Figura K. FEATURES Location/Qualifiers source 1..2097 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta." /clone_lib="lambda gt11" /clone="lambda CT29" sig_peptide 13..102 /note="signal peptide (AA -30 to -1)" CDS 13..1284 /note="acid phosphatase precursor protein" /codon_start=1 /db_xref="PID:g34263" /db_xref="SWISS-PROT:P11117" /translation="MAGKRSGWSRAALLQLLLGVNLVVMPPTRARSLRFVTLLYRHGD RSPVKTYPKDPYQEEEWPQGFGQLTKEGMLQHWELGQALRQRYHGFLNTSYHRQEVYV RSTDFDRTLMSAEANLAGLFPPNGMQRFNPNISWQPIPVHTVPITEDRLLKFPLGPCP RYEQLQNETRQTPEYQNESSRNAQFLDMVANETGLTDLTLETVWNVYDTLFCEQTHGL RLPPWASPQTMQRLSRLKDFSFRFLFGIYQQAEKARLQGGVLLAQIRKNLTLMATTSQ LPKLLVYSAHDTTLVALQMALDVYNGEQAPYASCHIFELYQEDSGNFSVEMYFRNESD KAPWPLSLPGCPHRCPLQDFLRLTEPVVPKDWQQECQLASGPADTEVIVALAVCGSIL FLLIVLLLTVLFRMQAQPPGYRHVADGEDHA" mat_peptide 103..1281 /note="mature acid phosphatase (AA 1-393)" BASE COUNT 443 a 632 c 574 g 448 t ORIGIN 1 attacaacgg tgatggcggg caagcggtcc ggctggagcc gggcggctct cctccagctc 61 cttctcggcg tgaacctggt ggtgatgccg cccacccggg cccggagtct gcgcttcgtt 121 accttgctgt accgccatgg agaccgttca ccagtgaaga catatcccaa ggacccctat 181 caggaagaag aatggcccca ggggtttggt cagttaacca aggaggggat gctacagcac 241 tgggaactgg gccaggccct gcggcagcgc tatcacggct tcctaaacac ctcttatcac 301 cggcaagagg tttatgtgcg aagcacagac tttgaccgga ctctcatgag tgctgaggcc 361 aacctggctg gactcttccc tcccaacggg atgcagcgct tcaacccgaa catctcgtgg 421 cagcctattc ctgtgcacac tgtgcccatc actgaggaca ggctgctgaa gttcccgttg 481 ggcccatgtc cccgttatga gcagctgcag aacgagaccc ggcagacacc agagtatcag 541 aatgagagtt ctcggaatgc acaatttctg gacatggtgg ccaacgagac agggcttaca 601 gacctgacac tggagaccgt ctggaatgtc tatgacacac tcttctgtga gcaaacgcac 661 gggctgcgcc tgccgccctg ggcctcaccc caaaccatgc agcgtctcag ccggctaaag 721 gacttcagct tccgcttcct cttcggaatc taccagcagg cggagaaggc ccggcttcag 781 gggggagtcc tgctggctca gataaggaag aacctgaccc taatggcgac cacctcccag 841 ctccccaagc tgctggttta ctctgcgcac gacactaccc tggttgccct gcaaatggca 901 ctggatgtct acaatggtga acaagccccc tacgcctcct gccacatatt tgaactgtac 961 caggaagatt ctgggaattt ctcagtggag atgtactttc ggaacgagag tgacaaggcc 1021 ccctggccgc tcagcctgcc tggctgccct caccgctgcc cactgcagga cttccttcgc 1081 ctcacagagc ccgtcgtgcc caaggattgg cagcaggagt gccagctggc aagcggtcct 1141 gcagacacag aggtgattgt ggccttggct gtatgtggct ccatcctctt cctcctcata 1201 gtgctgctcc tcaccgtcct cttccggatg caggcccagc ctcctggcta ccgccacgtc 1261 gcagatgggg aggaccacgc ctgacaacca ctcagccccc ttccctccac ctcctagggg 1321 aggtgggctg ggccctcgct cctgactgtt gctgctcccc agcccatgga caggagatcc 1381 tgggttgggc ctccctctga tgaccccagc cagatgagcg agtggggctc agcgtggccc 1441 atggtgcctg tcactcagca ttcccatgcc tgatgtttac caagtgctgt gttggacact 1501 ggctttctcc aaacaggatt tgcctcctcc acgctcccta cacacctgag atgtaaactg 1561 gcagtcagtg ttcactcagg acctaggatt agaaaatggc agagttggtg ctggatccac 1621 cttgcacttc tatcaagccc tgttcttttt cctccagcct gaagtcttcg gcaaatagct 1681 cagagggaca cggtcttgcc tctcagtgct tattttagtg ggaaaaacag ctaataccag 1741 gggtacaaac attggctccc aaggaactgg atcacccaac agccagccag ccacatttcc 1801 ctgtgtctgg ctagagccac cattagactc agacagaatg cttcaggaat cgttgtcacc 1861 ccttcaactg gagcaggacg gaaggttgtc tgtacttggg agggagtggg gagtggtggg 1921 aagggagtcg cttgtcacac gaatcaggaa actgctctcc ctcagctggg ctggggtctc 1981 cagggacctg agctacatgc aggttgtgag ctggaaaaga aagttctaga ctgtggccca 2041 gaatagggct ggggcagctc ccagagaatg aatagtgctg tttcctattg gactgat // LOCUS HSLARR 7702 bp RNA PRI 19-SEP-1995 DEFINITION Human mRNA for LCA-homolog. LAR protein (leukocyte antigen related). ACCESSION Y00815 NID g34266 KEYWORDS antigen; cell surface glycoprotein; glycoprotein; immunoglobulin superfamily; LAR gene; leukocyte common antigen; neural cell adhesion molecule; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7702) AUTHORS Saito,H. TITLE Direct Submission JOURNAL Submitted (15-SEP-1988) Saito H., Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115 REFERENCE 2 (bases 1 to 7702) AUTHORS Streuli,M., Krueger,N.X., Hall,L.R., Schlossman,S.F. and Saito,H. TITLE A new member of the immunoglobulin superfamily that has a cytoplasmic region homologous to the leukocyte common antigen JOURNAL J. Exp. Med. 168 (5), 1523-1530 (1988) MEDLINE 89035978 REFERENCE 3 (bases 1 to 7702) AUTHORS Schaapveld,R.Q., van den Maagdenberg,A.M., Schepens,J.T., Weghuis,D.O., Geurts van Kessel,A., Wieringa,B. and Hendriks,W.J. TITLE The mouse gene Ptprf encoding the leukocyte common antigen-related molecule LAR: cloning, characterization, and chromosomal localization JOURNAL Genomics 27 (1), 124-130 (1995) MEDLINE 95394448 FEATURES Location/Qualifiers source 1..7702 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" /cell_type="lymphocyte" /clone_lib="lambda gt11" sig_peptide 371..418 /note="put. signal peptide (AA -16 to -1)" CDS 371..6064 /codon_start=1 /product="put. LAR preprotein (AA -16 to 1881)" /db_xref="PID:g34267" /db_xref="SWISS-PROT:P10586" /translation="MVPLVPALVMLGLVAGAHGDSKPVFIKVPEDQTGLSGGVASFVC QATGEPKPRITWMKKGKKVSSQRFEVIEFDDGAGSVLRIQPLRVQRDEAIYECTATNS LGEINTSAKLSVLEEEQLPPGFPSIDMGPQLKVVEKARTATMLCAAGGNPDPEISWFK DFLPVDPATSNGRIKQLRSGALQIESSEESDQGKYECVATNSAGTRYSAPANLYVRVR RVAPRFSIPPSSQEVMPGGSVNLTCVAVGAPMPYVKWMMGAEELTKEDEMPVGRNVLE LSNVVRSANYTCVAISSLGMIEATAQVTVKALPKPPIDLVVTETTATSVTLTWDSGNS EPVTYYGIQYRAAGTEGPFQEVDGVATTRYSIGGLSPFSEYAFRVLAVNSIGRGPPSE AVRARTGEQAPSSPPRRVQARMLSASTMLVQWEPPEEPNGLVRGYRVYYTPDSRRPPN AWHKHNTDAGLLTTVGSLLPGITYSLRVLAFTAVGDGPPSPTIQVKTQQGVPAQPADF QAEVESDTRIQLSWLLPPQERIIMYELVYWAAEDEDQQHKVTFDPTSSYTLEDLKPDT LYRFQLAARSDMGVGVFTPTIEARTAQSTPSAPPQKVMCVSMGSTTVRVSWVPPPADS RNGVITQYSVAHEAVDGEDRGRHVVDGISREHSSWDLVGLEKWTEYRVWVRAHTDVGP GPESSPVLVRTDEDVPSGPPRKVEVEPLNSTAVHVYWKLPVPSKQHGQIRGYQVTYVR LENGEPRGLPIIQDVMLAEAQWRPEESEDYETTISGLTPETTYSVTVAAYTTKGDGAR SKPKIVTTTGAVPGRPTMMISTTAMNTALLQWHPPKELPGELLGYRLQYCRADEARPN TIDFGKDDQHFTVTGLHKGTTYIFRLAAKNRAGLGEEFEKEIRTPEDLPSGFPQNLHV TGLTTSTTELAWDPPVLAERNGRIISYTVVFRDINSQQELQNITTDTRFTLTGLKPDT TYDIKVRAWTSKGSGPLSPSIQSRTMPVEQVFAKNFRVAAAMKTSVLLSWEVPDSYKS AVPFKILYNGQSVEVDGHSMRKLIADLQPNTEYSFVLMNRGSSAGGLQHLVSIRTAPD LLPHKPLPASAYIEDGRFDLSMPHVQDPSLVRWFYIVVVPIDRVGGSMLTPRWSTPEE LELDELLEAIEQGGEEQRRRRRQAERLKPYVAAQLDVLPETFTLGDKKNYRGFYNRPL SPDLSYQCFVLASLKEPMDQKRYASSPYSDEIVVQVTPAQQQEEPEMLWVTGPVLAVI LIILIVIAILLFKRKRTHSPSSKDEQSIGLKDSLLAHSSDPVEMRRLNYQTPGMRDHP PIPITDLADNIERLKANDGLKFSQEYESIDPGQQFTWENSNLEVNKPKNRYANVIAYD HSRVILTSIDGVPGSDYINANYIDGYRKQNAYIATQGPLPETMGDFWRMVWEQRTATV VMMTRLEEKSRVKCDQYWPARGTETCGLIQVTLLDTVELATYTVRTFALHKSGSSEKR ELRQFQFMAWPDHGVPEYPTPILAFLRRVKACNPLDAGPMVVHCSAGVGRTGCFIVID AMLERMKHEKTVDIYGHVTCMRSQRNYMVQTEDQYVFIHEALLEAATCGHTEVPARNL YAHIQKLGQVPPGESVTAMELEFKLLASSKAHTSRFISANLPCNKFKNRLVNIMPYEL TRVCLQPIRGVEGSDYINASFLDGYRQQKAYIATQGPLAESTEDFWRMLWEHNSTIIV MLTKLREMGREKCHQYWPAERSARYQYFVVDPMAEYNMPQYILREFKVTDARDGQSRT IRQFQFTDWPEQGVPKTGEGFIDFIGQVHKTKEQFGQDGPITVHCSAGVGRTGVFITL SIVLERMRYEGVVDMFQTVKTLRTQRPAMVQTEDQYQLCYRAALEYLGSFDHYAT" mat_peptide 419..6061 /note="put. LAR protein (AA 1 - 1881)" misc_feature 419..4120 /note="put. extracellular domain" misc_feature 4121..4192 /note="put. transmembrane domain" misc_feature 4193..6061 /note="put. cytoplasmic domain" BASE COUNT 1636 a 2286 c 2292 g 1488 t ORIGIN 1 cgggagcggc gggagcggtg gcggcggcag aggcggcggc tccagcttcg gctccggctc 61 gggctcgggc tccggctccg gctccggctc cggctccagc tcgggtggcg gtggcgggag 121 cgggaccagg tggaggcggc ggcggcagag gagtgggagc agcggcccta gcggcttgcg 181 gggggacatg cggaccgacg gcccctggat aggcggaagg agtggaggcc ctggtgcccg 241 gcccttggtg ctgagtatcc agcaagagtg accggggtga agaagcaaag actcggttga 301 ttgtcctggg ctgtggctgg ctgtggagct agagccctgg atggcccctg agccagcccc 361 agggaggacg atggtgcccc ttgtgcctgc actggtgatg cttggtttgg tggcaggcgc 421 ccatggtgac agcaaacctg tcttcattaa agtccctgag gaccagactg ggctgtcagg 481 aggggtagcc tccttcgtgt gccaagctac aggagaaccc aagccgcgca tcacatggat 541 gaagaagggg aagaaagtca gctcccagcg cttcgaggtc attgagtttg atgatggggc 601 agggtcagtg cttcggatcc agccattgcg ggtgcagcga gatgaagcca tctatgagtg 661 tacagctact aacagcctgg gtgagatcaa cactagtgcc aagctctcag tgctcgaaga 721 ggaacagctg ccccctgggt tcccttccat cgacatgggg cctcagctga aggtggtgga 781 gaaggcacgc acagccacca tgctatgtgc cgcaggcgga aatccagacc ctgagatttc 841 ttggttcaag gacttccttc ctgtagaccc tgccacgagc aacggccgca tcaagcagct 901 gcgttcaggt gccttgcaga tagagagcag tgaggaatcc gaccaaggca agtacgagtg 961 tgtggcgacc aactcggcag gcacacgtta ctcagcccct gcgaacctgt atgtgcgagt 1021 gcgccgcgtg gctcctcgtt tctccatccc tcccagcagc caggaggtga tgccaggcgg 1081 cagcgtgaac ctgacatgcg tggcagtggg tgcacccatg ccctacgtga agtggatgat 1141 gggggccgag gagctcacca aggaggatga gatgccagtt ggccgcaacg tcctggagct 1201 cagcaatgtc gtacgctctg ccaactacac ctgtgtggcc atctcctcgc tgggcatgat 1261 cgaggccaca gcccaggtca cagtgaaagc tcttccaaag cctccgattg atcttgtggt 1321 gacagagaca actgccacca gtgtcaccct cacctgggac tctgggaact cggagcctgt 1381 aacctactat ggcatccagt accgcgcagc gggcacggag ggcccctttc aggaggtgga 1441 tggtgtggcc accacccgct acagcattgg cggcctcagc cctttctcgg aatatgcctt 1501 ccgcgtgctg gcggtgaaca gcatcgggcg agggccgccc agcgaggcag tgcgggcacg 1561 cacgggagaa caggcgccct ccagcccacc gcgccgcgtg caggcacgca tgctgagcgc 1621 cagcaccatg ctggtgcagt gggagcctcc cgaggagccc aacggcctgg tgcggggata 1681 ccgcgtctac tatactccgg actcccgccg ccccccgaac gcctggcaca agcacaacac 1741 cgacgcgggg ctcctcacga ccgtgggcag cctgctgcct ggcatcacct acagcctgcg 1801 cgtgcttgcc ttcaccgccg tgggcgatgg ccctcccagc cccaccatcc aggtcaagac 1861 gcagcaggga gtgcctgccc agcccgcgga cttccaggcc gaggtggagt cggacaccag 1921 gatccagctc tcgtggctgc tgccccctca ggagcggatc atcatgtatg aactggtgta 1981 ctgggcggca gaggacgaag accaacagca caaggtcacc ttcgacccaa cctcctccta 2041 cacactagag gacctgaagc ctgacacact ctaccgcttc cagctggctg cacgctcgga 2101 tatgggggtg ggcgtcttca cccccaccat tgaggcccgc acagcccagt ccaccccctc 2161 cgcccctccc cagaaggtga tgtgtgtgag catgggctcc accacggtcc gggtaagttg 2221 ggtcccgccg cctgccgaca gccgcaacgg cgttatcacc cagtactccg tggcccacga 2281 ggcggtggac ggcgaggacc gcgggcggca tgtggtggat ggcatcagcc gtgagcactc 2341 cagctgggac ctggtgggcc tggagaagtg gacggagtac cgggtgtggg tgcgggcaca 2401 cacagacgtg ggccccggcc ccgagagcag cccggtgctg gtgcgcaccg atgaggacgt 2461 gcccagcggg cctccgcgga aggtggaggt ggagccactg aactccactg ctgtgcatgt 2521 ctactggaag ctgcctgtcc ccagcaagca gcatggccag atccgcggct accaggtcac 2581 ctacgtgcgg ctggagaatg gcgagccccg tggactcccc atcatccaag acgtcatgct 2641 agccgaggcc cagtggcggc cagaggagtc cgaggactat gaaaccacta tcagcggcct 2701 gaccccggag accacctact ccgttactgt tgctgcctat accaccaagg gggatggtgc 2761 ccgcagcaag cccaaaattg tcactacaac aggtgcagtc ccaggccggc ccaccatgat 2821 gatcagcacc acggccatga acactgcgct gctccagtgg cacccaccca aggaactgcc 2881 tggcgagctg ctgggctacc ggctgcagta ctgccgggcc gacgaggcgc ggcccaacac 2941 catagatttc ggcaaggatg accagcactt cacagtcacc ggcctgcaca aggggaccac 3001 ctacatcttc cggcttgctg ccaagaaccg ggctggcttg ggtgaggagt tcgagaagga 3061 gatcaggacc cccgaggacc tgcccagcgg cttcccccaa aacctgcatg tgacaggact 3121 gaccacgtct accacagaac tggcctggga cccgccagtg ctggcggaga ggaacgggcg 3181 catcatcagc tacaccgtgg tgttccgaga catcaacagc caacaggagc tgcagaacat 3241 cacgacagac acccgcttta cccttactgg cctcaagcca gacaccactt acgacatcaa 3301 ggtccgcgca tggaccagca aaggctctgg cccactcagc cccagcatcc agtcccggac 3361 catgccggtg gagcaagtgt ttgccaagaa cttccgggtg gcggctgcaa tgaagacgtc 3421 tgtgctgctc agctgggagg ttcccgactc ctataagtca gctgtgccct ttaagattct 3481 gtacaatggg cagagtgtgg aggtggacgg gcactcgatg cggaagctga tcgcagacct 3541 gcagcccaac acagagtact cgtttgtgct gatgaaccgt ggcagcagcg cagggggcct 3601 gcagcacctg gtgtccatcc gcacagcccc cgacctcctg cctcacaagc cgctgcctgc 3661 ctctgcctac atagaggacg gccgcttcga tctctccatg ccccatgtgc aagacccctc 3721 gcttgtcagg tggttctaca ttgttgtggt acccattgac cgtgtgggcg ggagcatgct 3781 gacgccaagg tggagcacac ccgaggaact ggagctggac gagcttctag aagccatcga 3841 gcaaggcgga gaggagcagc ggcggcggcg gcggcaggca gaacgtctga agccatatgt 3901 ggctgctcaa ctggatgtgc tcccggagac ctttaccttg ggggacaaga agaactaccg 3961 gggcttctac aaccggcccc tgtctccgga cttgagctac cagtgctttg tgcttgcctc 4021 cttgaaggaa cccatggacc agaagcgcta tgcctccagc ccctactcgg atgagatcgt 4081 ggtccaggtg acaccagccc agcagcagga ggagccggag atgctgtggg tgacgggtcc 4141 cgtgctggca gtcatcctca tcatcctcat tgtcatcgcc atcctcttgt tcaaaaggaa 4201 aaggacccac tctccgtcct ctaaggatga gcagtcgatc ggactgaagg actccttgct 4261 ggcccactcc tctgaccctg tggagatgcg gaggctcaac taccagaccc caggtatgcg 4321 agaccaccca cccatcccca tcaccgacct ggcggacaac atcgagcgcc tcaaagccaa 4381 cgatggcctc aagttctccc aggagtatga gtccatcgac cctggacagc agttcacgtg 4441 ggagaattca aacctggagg tgaacaagcc caagaaccgc tatgcgaatg tcatcgccta 4501 cgaccactct cgagtcatcc ttacctctat cgatggcgtc cccgggagtg actacatcaa 4561 tgccaactac atcgatggct accgcaagca gaatgcctac atcgccacgc agggccccct 4621 gcccgagacc atgggcgatt tctggagaat ggtgtgggaa cagcgcacgg ccactgtggt 4681 catgatgaca cggctggagg agaagtcccg ggtaaaatgt gatcagtact ggccagcccg 4741 tggcaccgag acctgtggcc ttattcaggt gaccctgttg gacacagtgg agctggccac 4801 atacactgtg cgcaccttcg cactccacaa gagtggctcc agtgagaagc gtgagctgcg 4861 tcagtttcag ttcatggcct ggccagacca tggagttcct gagtacccaa ctcccatcct 4921 ggccttccta cgacgggtca aggcctgcaa ccccctagac gcagggccca tggtggtgca 4981 ctgcagcgcg ggcgtgggcc gcaccggctg cttcatcgtg attgatgcca tgttggagcg 5041 gatgaagcac gagaagacgg tggacatcta tggccacgtg acctgcatgc gatcacagag 5101 gaactacatg gtgcagacgg aggaccagta cgtgttcatc catgaggcgc tgctggaggc 5161 tgccacgtgc ggccacacag aggtgcctgc ccgcaacctg tatgcccaca tccagaagct 5221 gggccaagtg cctccagggg agagtgtgac cgccatggag ctcgagttca agttgctggc 5281 cagctccaag gcccacacgt cccgcttcat cagcgccaac ctgccctgca acaagttcaa 5341 gaaccggctg gtgaacatca tgccctacga attgacccgt gtgtgtctgc agcccatccg 5401 tggtgtggag ggctctgact acatcaatgc cagcttcctg gatggttata gacagcagaa 5461 ggcctacata gctacacagg ggcctctggc agagagcacc gaggacttct ggcgcatgct 5521 atgggagcac aattccacca tcatcgtcat gctgaccaag cttcgggaga tgggcaggga 5581 gaaatgccac cagtactggc cagcagagcg ctctgctcgc taccagtact ttgttgttga 5641 cccgatggct gagtacaaca tgccccagta tatcctgcgt gagttcaagg tcacggatgc 5701 ccgggatggg cagtcaagga caatccggca gttccagttc acagactggc cagagcaggg 5761 cgtgcccaag acaggcgagg gattcattga cttcatcggg caggtgcata agaccaagga 5821 gcagtttgga caggatgggc ctatcacggt gcactgcagt gctggcgtgg gccgcaccgg 5881 ggtgttcatc actctgagca tcgtcctgga gcgcatgcgc tatgagggcg tggtcgacat 5941 gtttcagacc gtgaagaccc tgcgtacaca gcgtcctgcc atggtgcaga cagaggacca 6001 gtatcagctg tgctaccgtg cggccctgga gtacctcggc agctttgacc actatgcaac 6061 gtaactaccg ctcccctctc ctccgccacc cccgccgtgg ggctccggag gggacccagc 6121 tcctctgagc cataccgacc atcgtccagc cctcctacgc agatgctgtc actggcagag 6181 cacagcccac ggggatcaca gcgtttcagg aacgttgcca caccaatcag agagcctaga 6241 acatccctgg gcaagtggat ggcccagcag gcaggcactg tggcccttct gtccaccaga 6301 cccacctgga gcccgcttca agctctctgt tgcgctcccg catttctcat gcttcttctc 6361 atggggtggg gttggggcaa agcctccttt ttaatacatt aagtggggta gactgaggga 6421 ttttagcctc ttccctctga tttttccttt cgcgaatccg tatctgcaga atgggccact 6481 gtaggggttg gggtttattt tgttttgttt tttttttttt tttgtatgac ttctgctgaa 6541 ggacagaaca ttgccttcct cgtgcagagc tggggctgcc agcctgagcg gaggctcggc 6601 cgtgggccgg gaggcagtgc tgatccggct gctcctccag cccttcagac gagatcctgt 6661 ttcagctaaa tgcagggaaa ctcaatgttt ttttaagttt tgttttccct ttaaagcctt 6721 tttttaggcc acattgacag tggtgggcgg ggagaagata gggaacactc atccctggtc 6781 gtctatccca gtgtgtgttt aacattcaca gcccagaacc acagatgtgt ctgggagagc 6841 ctggcaaggc attcctcatc accatcgtgt ttgcaaaggt taaaacaaaa acaaaaaacc 6901 acaaaaataa aaaacaaaaa aaacaaaaaa cccaaaaaaa aaaaaaaaaa gagtcagccc 6961 ttggcttctg cttcaaaccc tcaagagggg aagcaactcc gtgtgcctgg ggttcccgag 7021 ggagctgctg gctgacctgg gcccacagag cctggctttg gtccccagca ttgcagtatg 7081 gtgtggtgtt tgtaggctgt ggggtctggc tgtgtggcca aggtgaatag cacaggttag 7141 ggtgtgtgcc acaccccatg cacctcaggg ccaagcgggg gcgtggctgg cctttcaggt 7201 ccaggccagt gggcctggta gcacatgtct gtcctcagag caggggccag atgattttcc 7261 tccctggttt gcagctgttt tcaaagcccc cgataatcgc tcttttccac tccaagatgc 7321 cctcataaac caatgtggca agactactgg acttctatca atggtactct aatcagtcct 7381 tattatccca gcttgctgag gggcagggag agcgcctctt cctctgggca gcgctatcta 7441 gataggtaag tgggggcggg gaagggtgca tagctgtttt agctgaggga cgtggtgccg 7501 acgtccccaa acctagctag gctaagtcaa gatcaacatt ccagggttgg taatgttgga 7561 tgatgaaaca ttcattttta ccttgtggat gctagtgctg tagagttcac tgttgtacac 7621 agtctgtttt ctatttgtta agaaaaacta cagcatcatt gcataattct tgatggtaat 7681 aaatttgaat aatcagattt ct // LOCUS HSLASNA 3151 bp RNA PRI 01-FEB-1994 DEFINITION H.sapiens mRNA for lung amiloride sensitive Na+ channel protein. ACCESSION X76180 NID g452649 KEYWORDS Na+ channel; Na+ channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3151) AUTHORS Barbry,P. TITLE Direct Submission JOURNAL Submitted (19-NOV-1993) P. Barbry, CNRS, IPMC 660 Route des Lucioles, 06560 Sophia Antipolis, FRANCE REFERENCE 2 (bases 1 to 3151) AUTHORS Voilley,N., Lingueglia,E., Champigny,G., Mattei,M.G., Waldmann,R., Lazdunski,M. and Barbry,P. TITLE The lung amiloride-sensitive Na+ channel: biophysical properties, pharmacology, ontogenesis, and molecular cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (1), 247-251 (1994) MEDLINE 94105144 FEATURES Location/Qualifiers source 1..3151 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /cell_type="epithelial" /dev_stage="adult" /chromosome="12" CDS 100..2109 /codon_start=1 /product="Na+ channel protein" /db_xref="PID:g452650" /db_xref="SWISS-PROT:P37088" /translation="MEGNKLEEQDSSPPQSTPGLMKGNKREEQGLGPEPAAPQQPTAE EEALIEFHRSYRELFEFFCNNTTIHGAIRLVCSQHNRMKTAFWAVLWLCTFGMMYWQF GLLFGEYFSYPVSLNINLNSDKLVFPAVTICTLNPYRYPEIKEELEELDRITEQTLFD LYKYSSFTTLVAGSRSRRDLRGTLPHPLQRLRVPPPPHGARRARSVASSLRDNNPQVD WKDWKIGFQLCNQNKSDCFYQTYSSGVDAVREWYRFHYINILSRLPETLPSLEEDTLG NFIFACRFNQVSCNQANYSHFHHPMYGNCYTFNDKNNSNLWMSSMPGINNGLSLMLRA EQNDFIPLLSTVTGARVMVHGQDEPAFMDDGGFNLRPGVETSISMRKETLDRLGGDYG DCTKNGSDVPVENLYPSKYTQQVCIHSCFQESMIKECGCAYIFYPRPQNVEYCDYRKH SSWGYCYYKLQVDFSSDHLGCFTKCRKPCSVTSYQLSAGYSRWPSVTSQEWVFQMLSR QNNYTVNNKRNGVAKVNIFFKELNYKTNSESPSVTMVTLLSNLGSQWSLWFGSSVLSV VEMAELVFDLLVIMFLMLLRRFRSRYWSPGRGGRGAQEVASTLASSPPSHFCPHPMSL SLSQPGPAPSPALTAPPPAYATLGPRPSPGGSAGASSSTCPLGGP" BASE COUNT 677 a 995 c 803 g 676 t ORIGIN 1 ccggccagcg ggcgggctcc ccagccaggc cgctgcacct gtcaggggaa caagctggag 61 gagcaggacc ctagacctct gcagcccata ccaggtctca tggaggggaa caagctggag 121 gagcaggact ctagccctcc acagtccact ccagggctca tgaaggggaa caagcgtgag 181 gagcaggggc tgggccccga acctgcggcg ccccagcagc ccacggcgga ggaggaggcc 241 ctgatcgagt tccaccgctc ctaccgagag ctcttcgagt tcttctgcaa caacaccacc 301 atccacggcg ccatccgcct ggtgtgctcc cagcacaacc gcatgaagac ggccttctgg 361 gcagtgctgt ggctctgcac ctttggcatg atgtactggc aattcggcct gcttttcgga 421 gagtacttca gctaccccgt cagcctcaac atcaacctca actcggacaa gctcgtcttc 481 cccgcagtga ccatctgcac cctcaatccc tacaggtacc cggaaattaa agaggagctg 541 gaggagctgg accgcatcac agagcagacg ctctttgacc tgtacaaata cagctccttc 601 accactctcg tggccggctc ccgcagccgt cgcgacctgc gggggactct gccgcacccc 661 ttgcagcgcc tgagggtccc gcccccgcct cacggggccc gtcgagcccg tagcgtggcc 721 tccagcttgc gggacaacaa cccccaggtg gactggaagg actggaagat cggcttccag 781 ctgtgcaacc agaacaaatc ggactgcttc taccagacat actcatcagg ggtggatgcg 841 gtgagggagt ggtaccgctt ccactacatc aacatcctgt cgaggctgcc agagactctg 901 ccatccctgg aggaggacac gctgggcaac ttcatcttcg cctgccgctt caaccaggtc 961 tcctgcaacc aggcgaatta ctctcacttc caccacccga tgtatggaaa ctgctatact 1021 ttcaatgaca agaacaactc caacctctgg atgtcttcca tgcctggaat caacaacggt 1081 ctgtccctga tgctgcgcgc agagcagaat gacttcattc ccctgctgtc cacagtgact 1141 ggggcccggg taatggtgca cgggcaggat gaacctgcct ttatggatga tggtggcttt 1201 aacttgcggc ctggcgtgga gacctccatc agcatgagga aggaaaccct ggacagactt 1261 gggggcgatt atggcgactg caccaagaat ggcagtgatg ttcctgttga gaacctttac 1321 ccttcaaagt acacacagca ggtgtgtatt cactcctgct tccaggagag catgatcaag 1381 gagtgtggct gtgcctacat cttctatccg cggccccaga acgtggagta ctgtgactac 1441 agaaagcaca gttcctgggg gtactgctac tataagctcc aggttgactt ctcctcagac 1501 cacctgggct gtttcaccaa gtgccggaag ccatgcagcg tgaccagcta ccagctctct 1561 gctggttact cacgatggcc ctcggtgaca tcccaggaat gggtcttcca gatgctatcg 1621 cgacagaaca attacaccgt caacaacaag agaaatggag tggccaaagt caacatcttc 1681 ttcaaggagc tgaactacaa aaccaattct gagtctccct ctgtcacgat ggtcaccctc 1741 ctgtccaacc tgggcagcca gtggagcctg tggttcggct cctcggtgtt gtctgtggtg 1801 gagatggctg agctcgtctt tgacctgctg gtcatcatgt tcctcatgct gctccgaagg 1861 ttccgaagcc gatactggtc tccaggccga gggggcaggg gtgctcagga ggtagcctcc 1921 accctggcat cctcccctcc ttcccacttc tgcccccacc ccatgtctct gtccttgtcc 1981 cagccaggcc ctgctccctc tccagccttg acagcccctc cccctgccta tgccaccctg 2041 ggcccccgcc catctccagg gggctctgca ggggccagtt cctccacctg tcctctgggg 2101 gggccctgag agggaaggag aggtttctca caccaaggca gatgctcctc tggtgggagg 2161 gtgctggccc tggcaagatt gaaggatgtg cagggcttcc tctcagagcc gcccaaactg 2221 ccgttgatgt gtggagggga agcaagatgg gtaagggctc aggaagttgc tccaagaaca 2281 gtagctgatg aagctgccca gaagtgcctt ggctccagcc ctgtacccct tggtactgcc 2341 tctgaacact ctggtttccc cacccaactg cggctaagtc tctttttccc ttggatcagc 2401 caagcgaaac ttggagcttt gacaaggaac tttcctaaga aaccgctgat aaccaggaca 2461 aaacacaacc aagggtacac gcaggcatgc acgggtttcc tgcccagcga cggcttaagc 2521 cagcccccga ctggcctggc cacactgctc tccagtagca cagatgtctg ctcctcctct 2581 tgaacttggg tgggaaaccc cacccaaaag ccccctttgt tacttaggca attccccttc 2641 cctgactccc gagggctagg gctagagcag acccgggtaa gtaaaggcag acccagggct 2701 cctctagcct catacccgtg ccctcacaga gccatgcccc ggcacctctg ccctgtgtct 2761 ttcatacctc tacatgtctg cttgagatat ttcctcagcc tgaaagtttc cccaaccatc 2821 tgccagagaa ctcctatgca tcccttagaa ccctgctcag acaccattac ttttgtgaac 2881 gcttctgcca catcttgtct tccccaaaat tgatcactcc gccttctcct gggctcccgt 2941 agcacactat aacatctgct ggagtgttgc tgttgcacca tactttcttg tacatttgtg 3001 tctcccttcc caactagact gtaagtgcct tgcggtcagg gactgaatct tgcccgttta 3061 tgtatgctcc atgtctagcc catcatcctg cttggagcaa gtaggcagga gctcaataaa 3121 tgtttgttgc atgaaaaaaa aaaaaaaaaa a // LOCUS HSLCA 4597 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for T200 leukocyte common antigen (CD45, LC-A). ACCESSION Y00062 NID g34275 KEYWORDS alternative splicing; cell surface antigen; cell surface glycoprotein; leukocyte common antigen; phosphoprotein; T200 glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4597) AUTHORS Trowbridge,I.S. TITLE Direct Submission JOURNAL Submitted (10-JUN-1987) Trowbridge I.S. The Salk Institute, P.O. Box 85800 San Diego, CA 92138-926 USA REFERENCE 2 (bases 1 to 4597) AUTHORS Ralph,S.J., Thomas,M.L., Morton,C.C. and Trowbridge,I.S. TITLE Structural variants of human T200 glycoprotein (leukocyte-common antigen) JOURNAL EMBO J. 6 (5), 1251-1257 (1987) MEDLINE 87275816 FEATURES Location/Qualifiers source 1..4597 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IB4 (human B cell, human tonsil)" /clone_lib="pcD and lambda gt10" /clone="pHLC-1 and lambdaHLC-1" /map="long arm of chromosome 1" sig_peptide 147..215 /note="signal peptide (AA -23 to -1)" CDS 147..3578 /note="precursor polypeptide (AA -23 to 1120)" /codon_start=1 /db_xref="PID:g34276" /translation="MYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTDAYLNASETTTLS PSGSAVISTTTIATTPSKPTCDEKYANITVDYLYNKETKLFTAKLNVNENVECGNNTC TNNEVHNLTECKNASVSISHNSCTAPDKTLILDVPPGVEKFQLHDCTQVEKADTTICL KWKNIETFTCDTQNITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNNHKFTNA SKIIKTDFGSPGEPQIIFCRSEAAHQGVITWNPPQRSFHNFTLCYIKETEKDCLNLDK NLIKYDLQNLKPYTKYVLSLHAYIIAKVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTS DNSMHVKCRPPRDRNGPHERYHLEVEAGNTLVRNESHKNCDFRVKDLQYSTDYTFKAY FHNGDYPGEPFILHHSTSYNSKALIAFLAFLIIVTSIALLVVLYKIYDLHKKRSCNLD EQQELVERDDEKQLMNVEPIHADILLETYKRKIADEGRLFLAEFQSIPRVFSKFPIKE ARKPFNQNKNRYVDILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPR DETVDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGTRAFGDVVVKINQHK RCPDYIIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFF SGPIVVHCSAGVGRTGTYIGIDAMLEGLEAENKVDVYGYVVKLRRQRCLMVQVEAQYI LIHQALVEYNQFGETEVNLSELHPYLHNMKKRDPPSEPSPLEAEFQRLPSYRSWRTQH IGNQEENKSKNRNSNVIPYDYNRVPLKHELEMSKESEHDSDESSDDDSDSEEPSKYIN ASFIMSYWKPEVMIAAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWG EGKQTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPK ELISMIQVVKQKLPQKNSSEGNKHHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEE VVDIFQVVKALRKARPGMVSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNE VDKVKQDANCVNPLGAPEKLPEAKEQAEGSEPTSGTEGPEHSVNGPASPALNQGS" mat_peptide 216..3575 /note="mature T200 glycoprotein (Aa 1-1120)" misc_feature 252..260 /note="N-glycosylation site" misc_feature 357..365 /note="N-glycosylation site" misc_feature 441..449 /note="N-glycosylation site" misc_feature 471..479 /note="N-glycosylation site" misc_feature 489..497 /note="N-glycosylation site" misc_feature 666..674 /note="N-glycosylation site" misc_feature 795..803 /note="N-glycosylation site" misc_feature 918..926 /note="N-glycosylation site" misc_feature 1065..1073 /note="N-glycosylation site" misc_feature 1125..1133 /note="N-glycosylation site" misc_feature 1248..1256 /note="N-glycosylation site" misc_feature 1389..1454 /note="put. transmembrane region" polyA_site 4574..4579 /note="region of polyA site" BASE COUNT 1554 a 809 c 912 g 1322 t ORIGIN 1 cgacatttta actgaactgc gggataaagt gaaatctttc cgtgcagctc tacgagagga 61 ggaaattgtt cctcgtctga taagacaaca gtggagaaag gacgcatgct gtttcttagg 121 gacacggctg acttccagat atgaccatgt atttgtggct taaactcttg gcatttggct 181 ttgcctttct ggacacagaa gtatttgtga cagggcaaag cccaacacct tcccccactg 241 atgcctacct taatgcctct gaaacaacca ctctgagccc ttctggaagc gctgtcattt 301 caaccacaac aatagctact actccatcta agccaacatg tgatgaaaaa tatgcaaaca 361 tcactgtgga ttacttatat aacaaggaaa ctaaattatt tacagcaaag ctaaatgtta 421 atgagaatgt ggaatgtgga aacaatactt gcacaaacaa tgaggtgcat aaccttacag 481 aatgtaaaaa tgcgtctgtt tccatatctc ataattcatg tactgctcct gataagacat 541 taatattaga tgtgccacca ggggttgaaa agtttcagtt acatgattgt acacaagttg 601 aaaaagcaga tactactatt tgtttaaaat ggaaaaatat tgaaaccttt acttgtgata 661 cacagaatat tacctacaga tttcagtgtg gtaatatgat atttgataat aaagaaatta 721 aattagaaaa ccttgaaccc gaacatgagt ataagtgtga ctcagaaata ctctataata 781 accacaagtt tactaacgca agtaaaatta ttaaaacaga ttttgggagt ccaggagagc 841 ctcagattat tttttgtaga agtgaagctg cacatcaagg agtaattacc tggaatcccc 901 ctcaaagatc atttcataat tttaccctct gttatataaa agagacagaa aaagattgcc 961 tcaatctgga taaaaacctg atcaaatatg atttgcaaaa tttaaaacct tatacgaaat 1021 atgttttatc attacatgcc tacatcattg caaaagtgca acgtaatgga agtgctgcaa 1081 tgtgtcattt cacaactaaa agtgctcctc caagccaggt ctggaacatg actgtctcca 1141 tgacatcaga taatagtatg catgtcaagt gtaggcctcc cagggaccgt aatggccccc 1201 atgaacgtta ccatttggaa gttgaagctg gaaatactct ggttagaaat gagtcgcata 1261 agaattgcga tttccgtgta aaagatcttc aatattcaac agactacact tttaaggcct 1321 attttcacaa tggagactat cctggagaac cctttatttt acatcattca acatcttata 1381 attctaaggc actgatagca tttctggcat ttctgattat tgtgacatca atagccctgc 1441 ttgttgttct ctacaaaatc tatgatctac ataagaaaag atcctgcaat ttagatgaac 1501 agcaggagct tgttgaaagg gatgatgaaa aacaactgat gaatgtggag ccaatccatg 1561 cagatatttt gttggaaact tataagagga agattgctga tgaaggaaga ctttttctgg 1621 ctgaatttca gagcatcccg cgggtgttca gcaagtttcc tataaaggaa gctcgaaagc 1681 cctttaacca gaataaaaac cgttatgttg acattcttcc ttatgattat aaccgtgttg 1741 aactctctga gataaacgga gatgcagggt caaactacat aaatgccagc tatattgatg 1801 gtttcaaaga acccaggaaa tacattgctg cacaaggtcc cagggatgaa actgttgatg 1861 atttctggag gatgatttgg gaacagaaag ccacagttat tgtcatggtc actcgatgtg 1921 aagaaggaaa caggaacaag tgtgcagaat actggccgtc aatggaagag ggcactcggg 1981 cttttggaga tgttgttgta aagatcaacc agcacaaaag atgtccagat tacatcattc 2041 agaaattgaa cattgtaaat aaaaaagaaa aagcaactgg aagagaggtg actcacattc 2101 agttcaccag ctggccagac cacggggtgc ctgaggatcc tcacttgctc ctcaaactga 2161 gaaggagagt gaatgccttc agcaatttct tcagtggtcc cattgtggtg cactgcagtg 2221 ctggtgttgg gcgcacagga acctatatcg gaattgatgc catgctagaa ggcctggaag 2281 ccgagaacaa agtggatgtt tatggttatg ttgtcaagct aaggcgacag agatgcctga 2341 tggttcaagt agaggcccag tacatcttga tccatcaggc tttggtggaa tacaatcagt 2401 ttggagaaac agaagtgaat ttgtctgaat tacatccata tctacataac atgaagaaaa 2461 gggatccacc cagtgagccg tctccactag aggctgaatt ccagagactt ccttcatata 2521 ggagctggag gacacagcac attggaaatc aagaagaaaa taaaagtaaa aacaggaatt 2581 ctaatgtcat cccatatgac tataacagag tgccacttaa acatgagctg gaaatgagta 2641 aagagagtga gcatgattca gatgaatcct ctgatgatga cagtgattca gaggaaccaa 2701 gcaaatacat caatgcatct tttataatga gctactggaa acctgaagtg atgattgctg 2761 ctcagggacc actgaaggag accattggtg acttttggca gatgatcttc caaagaaaag 2821 tcaaagttat tgttatgctg acagaactga aacatggaga ccaggaaatc tgtgctcagt 2881 actggggaga aggaaagcaa acatatggag atattgaagt tgacctgaaa gacacagaca 2941 aatcttcaac ttataccctt cgtgtctttg aactgagaca ttccaagagg aaagactctc 3001 gaactgtgta ccagtaccaa tatacaaact ggagtgtgga gcagcttcct gcagaaccca 3061 aggaattaat ctctatgatt caggtcgtca aacaaaaact tccccagaag aattcctctg 3121 aagggaacaa gcatcacaag agtacacctc tactcattca ctgcagggat ggatctcagc 3181 aaacgggaat attttgtgct ttgttaaatc tcttagaaag tgcggaaaca gaagaggtag 3241 tggatatttt tcaagtggta aaagctctac gcaaagctag gccaggcatg gtttccacat 3301 tcgagcaata tcaattccta tatgacgtca ttgccagcac ctaccctgct cagaatggac 3361 aagtaaagaa aaacaaccat caagaagata aaattgaatt tgataatgaa gtggacaaag 3421 taaagcagga tgctaattgt gttaatccac ttggtgcccc agaaaagctc cctgaagcaa 3481 aggaacaggc tgaaggttct gaacccacga gtggcactga ggggccagaa cattctgtca 3541 atggtcctgc aagtccagcc ttaaatcaag gttcatagga aaagacataa atgaggaaac 3601 tccaaacctc ctgttagctg ttatttctat ttttgtagaa gtaggaagtg aaaataggta 3661 tacagtggat taattaaatg cagcgaacca atatttgtag aagggttata ttttactact 3721 gtggaaaaat atttaagata gttttgccag aacagtttgt acagacgtat gcttatttta 3781 aaattttatc tcttattcag taaaaaacaa cttctttgta atcgttatgt gtgtatatgt 3841 atgtgtgtat gggtgtgtgt ttgtgtgaga gacagagaaa gagagagaat tctttcaagt 3901 gaatctaaaa gcttttgctt ttcctttgtt tttatgaaga aaaaatacat tttatattag 3961 aagtgttaac ttagcttgaa ggatctgttt ttaaaaatca taaactgtgt gcagactcaa 4021 taaaatcatg tacatttctg aaatgacctc aagatgtcct ccttgttcta ctcatatata 4081 tctatcttat atacttacta ttttacttct agagatagta cataaaggtg gtatgtgtgt 4141 gtatgctact acaaaaaagt tgttaactaa attaacattg ggaaatctta tattccatat 4201 attagcattt agtccaatgt ctttttaagc ttatttaatt aaaaaatttc cagtgagctt 4261 atcatgctgt ctttacatgg ggttttcaat tttgcatgct cgattattcc ctgtacaata 4321 tttaaaattt attgcttgat acttttgaca acaaattagg ttttgtacaa ttgaacttaa 4381 ataaatgtca ttaaaataaa taaatgcaat atgtattaat attcattgta taaaaataga 4441 agaatacaaa catatttgtt aaatatttac atatgaaatt taatatagct atttttatgg 4501 aatttttcat tgatatgaaa aatatgatat tgcatatgca tagttcccat gttaaatccc 4561 attcataact ttcattaaag catttacttt gaatttc // LOCUS HSLCKB 2032 bp RNA PRI 12-SEP-1993 DEFINITION Human lck mRNA for membrane associated protein tyrosine kinase. ACCESSION X13529 NID g34294 KEYWORDS lck gene; membrane-associated protein; oncogene; phosphoprotein; tyrosine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2032) AUTHORS Perlmutter,R.M., Marth,J.D., Lewis,D.B., Peet,R., Ziegler,S.F. and Wilson,C.B. TITLE Structure and expression of lck transcripts in human lymphoid cells JOURNAL J. Cell. Biochem. 38 (2), 117-126 (1988) MEDLINE 89123626 COMMENT See for overlapping sequence. FEATURES Location/Qualifiers source 1..2032 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /chromosome="1p32-35" CDS 52..1581 /note="lck protein (AA 1-509)" /codon_start=1 /db_xref="PID:g34295" /db_xref="SWISS-PROT:P06239" /translation="MGCGCSSHPEDDWMENIDVCENCHYPIVPLDGKGTLLIRNGSEV RDPLVTYEGSNPPASPLQDNLVIALHSYEPSHDGDLGFEKGEPLRILEQSGEWWKAQS LTTGQEGFIPFNFVAKANSLEPEPWFFKNLSRKDAERQLLAPGNTHGSFLIRESESTA GSFSLSVRDFDQNQGEVVKHYKIRNLDNGGFYISPRITFPGLHELVRHYTNASDGLCT RLSRPCQTQKPQKPWWEDEWEVPRETLKLVERLGAGQFGEVWMGYYNGHTKVAVKSLK QGSMSPDAFLAEANLMKQLQHQRLVRLYAVVTQEPIYIITEYMENGSLVDFLKTPSGI KLTINKLLDMAAQIAEGMAFIEERNYIHRDLRAANILVSDTLSCKIADFGLARLIEDN EYTAREGAKFPIKWTAPEAINYGTFTIKSDVWSFGILLTEIVTHGRIPYPGMTNPEVI QNLERGYRMVRPDNCPEELYQLMRLCWKERPEDRPTFDYLRSVLEDFFTATEGQYQPQ P" misc_feature 2008..2013 /note="polyadenylation signal" polyA_site 2032 /note="polyadenylation site" BASE COUNT 450 a 579 c 581 g 422 t ORIGIN 1 cgcctggacc atgtgaatgg ggccagaggg ctcccgggct gggcagggac catgggctgt 61 ggctgcagct cacacccgga agatgactgg atggaaaaca tcgatgtgtg tgagaactgc 121 cattatccca tagtcccact ggatggcaag ggcacgctgc tcatccgaaa tggctctgag 181 gtgcgggacc cactggttac ctacgaaggc tccaatccgc cggcttcccc actgcaagac 241 aacctggtta tcgctctgca cagctatgag ccctctcacg acggagatct gggctttgag 301 aagggggaac cactccgcat cctggagcag agcggcgagt ggtggaaggc gcagtccctg 361 accacgggcc aggaaggctt catccccttc aattttgtgg ccaaagcgaa cagcctggag 421 cccgaaccct ggttcttcaa gaacctgagc cgcaaggacg cggagcggca gctcctggcg 481 cccgggaaca ctcacggctc cttcctcatc cgggagagcg agagcaccgc cgggtccttt 541 tcactgtcgg tccgggactt cgaccaaaac cagggagagg tggtgaaaca ttacaagatc 601 cgtaatctgg acaacggtgg cttctacatc tcccctcgaa tcacttttcc cggcctgcat 661 gaactggtcc gccattacac caatgcttca gatgggctgt gcacacggtt gagccgcccc 721 tgccagaccc agaagcccca gaagccgtgg tgggaggacg agtgggaggt tcccagggag 781 acgctgaagc tggtggagcg gctgggggct ggacagttcg gggaggtgtg gatggggtac 841 tacaacgggc acacgaaggt ggcggtgaag agcctgaagc agggcagcat gtccccggac 901 gccttcctgg ccgaggccaa cctcatgaag cagctgcaac accagcggct ggttcggctc 961 tacgctgtgg tcacccagga gcccatctac atcatcactg aatacatgga gaatgggagt 1021 ctagtggatt ttctcaagac cccttcaggc atcaagttga ccatcaacaa actcctggac 1081 atggcagccc aaattgcaga aggcatggca ttcattgaag agcggaatta tattcatcgt 1141 gaccttcggg ctgccaacat tctggtgtct gacaccctga gctgcaagat tgcagacttt 1201 ggcctagcac gcctcattga ggacaacgag tacacagcca gggagggggc caagtttccc 1261 attaagtgga cagcgccaga agccattaac tacgggacat tcaccatcaa gtcagatgtg 1321 tggtcttttg ggatcctgct gacggaaatt gtcacccacg gccgcatccc ttacccaggg 1381 atgaccaacc cggaggtgat tcagaacctg gagcgaggct accgcatggt gcgccctgac 1441 aactgtccag aggagctgta ccaactcatg aggctgtgct ggaaggagcg cccagaggac 1501 cggcccacct ttgactacct gcgcagtgtg ctggaggact tcttcacggc cacagagggc 1561 cagtaccagc ctcagccttg agaggaggcc ttgagaggcc ctggggttct ccccctttct 1621 ctccagcctg acttggggag atggagttct tgtgccatag tcacatggcc tatgcacata 1681 tggactctgc acatgaatcc cacccacatg tgacacatat gcaccttgtg tctgtacacg 1741 tgtcctgtag ttgcgtggac tctgcacatg tcttgtgcat gtgtagcctg tgcatgtatg 1801 tcttggacac tgtacaaggt acccctttct ggctctccca tttcctgaga ccaccagaga 1861 gaggggagaa gcctgggatt gacagaagct tctgcccacc tacttttctt tcctcagatc 1921 atccagaagt tcctcaaggg ccaggacttt atctaatacc tctgtgtgct cctccttggt 1981 gcctggcctg gcacacatca ggagttcaat aaatgtctgt tgatgactgc cg // LOCUS HSLDHAR 1661 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for lactate dehydrogenase-A (LDH-A, EC 1.1.1.27). ACCESSION X02152 NID g34312 KEYWORDS lactate dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1661) AUTHORS Tsujibo,H., Tiano,H.F. and Li,S.S. TITLE Nucleotide sequences of the cDNA and an intronless pseudogene for human lactate dehydrogenase-A isozyme JOURNAL Eur. J. Biochem. 147 (1), 9-15 (1985) MEDLINE 85127030 COMMENT Data kindly reviewed (03-JAN-1986) by S. Li. FEATURES Location/Qualifiers source 1..1661 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 98..1096 /note="lactate dehydrogenase-A" /codon_start=1 /db_xref="PID:g34313" /db_xref="SWISS-PROT:P00338" /translation="MATLKDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDL ADELALVDVIEDKLKGEMMDLQHGSLFLRTPKIVSGKDYNVTANSKLVIITAGARQQE GESRLNLVQRNVNIFKFIIPNVVKYSPNCKLLIVSNPVDILTYVAWKISGFPKNRVIG SGCNLDSARFRYLMGERLGVHPLSCHGWVLGEHGDSSVPVWSGMNVAGVSLKTLHPDL GTDKDKEQWKEVHKQVVESAYEVIKLKGYTSWAIGLSVADLAESIMKNLRRVHPVSTM IKGLYGIKDDVFLSVPCILGQNGISDLVKVTLTSEEEARLKKSADTLWGIQKELQF" misc_feature 1644..1649 /note="put. polyA signal" polyA_site 1661 /note="polyA site" BASE COUNT 458 a 340 c 388 g 475 t ORIGIN 1 tgctgcagcc gctgccgccg attccggatc tcattgccac gcgcccccga cgaccgcccg 61 acgtgcattc ccgattcctt ttggttccaa gtccaatatg gcaactctaa aggatcagct 121 gatttataat cttctaaagg aagaacagac cccccagaat aagattacag ttgttggggt 181 tggtgctgtt ggcatggcct gtgccatcag tatcttaatg aaggacttgg cagatgaact 241 tgctcttgtt gatgtcatcg aagacaaatt gaagggagag atgatggatc tccaacatgg 301 cagccttttc cttagaacac caaagattgt ctctggcaaa gactataatg taactgcaaa 361 ctccaagctg gtcattatca cggctggggc acgtcagcaa gagggagaaa gccgtcttaa 421 tttggtccag cgtaacgtga acatatttaa attcatcatt cctaatgttg taaaatacag 481 cccgaactgc aagttgctta ttgtttcaaa tccagtggat atcttgacct acgtggcttg 541 gaagataagt ggttttccca aaaaccgtgt tattggaagt ggttgcaatc tggattcagc 601 ccgattccgt tacctgatgg gggaaaggct gggagttcac ccattaagct gtcatgggtg 661 ggtccttggg gaacatggag attccagtgt gcctgtatgg agtggaatga atgttgctgg 721 tgtctctctg aagactctgc acccagattt agggactgat aaagataagg aacagtggaa 781 agaggttcac aagcaggtgg ttgagagtgc ttatgaggtg atcaaactca aaggctacac 841 atcctgggct attggactct ctgtagcaga tttggcagag agtataatga agaatcttag 901 gcgggtgcac ccagtttcca ccatgattaa gggtctttac ggaataaagg atgatgtctt 961 ccttagtgtt ccttgcattt tgggacagaa tggaatctca gaccttgtga aggtgactct 1021 gacttctgag gaagaggccc gtttgaagaa gagtgcagat acactttggg ggatccaaaa 1081 ggagctgcaa ttttaaagtc ttctgatgtc atatcatttc actgtctagg ctacaacagg 1141 attctaggtg gaggttgtgc atgttgtcct ttttatctga tctgtgatta aagcagtaat 1201 attttaagat ggactgggaa aaacatcaac tcctgaagtt agaaataaga atggtttgta 1261 aaatccacag ctatatcctg atgctggatg gtattaatct tgtgtagtct tcaactggtt 1321 agtgtgaaat agttctgcca cctctgacgc accactgcca atgctgtacg tactgcattt 1381 gccccttgag ccaggtggat gtttaccgtg tgttatataa cttcctggct ccttcactga 1441 acatgcctag tccaacattt tttcccagtg agtcacatcc tgggatccag tgtataaatc 1501 caatatcatg tcttgtgcat aattcttcca aaggatctta ttttgtgaac tatatcagta 1561 gtgtacatta ccatataatg taaaaagatc tacatacaaa caatgcaacc aactatccaa 1621 gtgttatacc aactaaaacc cccaataaac cttgaacagt g // LOCUS HSLDL100 14121 bp RNA PRI 30-MAR-1995 DEFINITION Human mRNA for apolipoprotein B-100. ACCESSION X04506 NID g34330 KEYWORDS apolipoprotein B-100; low density lipoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14121) AUTHORS Knott,T.J., Wallis,S.C., Powell,L.M., Pease,R.J., Lusis,A.J., Blackhart,B., McCarthy,B.J., Mahley,R.W., Levy-Wilson,B. and Scott,J. TITLE Complete cDNA and derived protein sequence of human apolipoprotein B-100 JOURNAL Nucleic Acids Res. 14 (18), 7501-7503 (1986) MEDLINE 87016385 REFERENCE 2 (bases 1 to 14121) AUTHORS Knott,T.J. TITLE Direct Submission JOURNAL Submitted (31-OCT-1986) Knott T.J., Molecular Medicine Research Group, MRC Clinical Research Centre, Watford Road, Harrow, Middlesex, England FEATURES Location/Qualifiers source 1..14121 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 129..209 /note="signal peptide (aa -27 to -1)" CDS 129..13820 /note="precursor" /codon_start=1 /db_xref="PID:g34331" /db_xref="SWISS-PROT:P04114" /translation="MDPPRPALLALLALPALLLLLLAGARAEEEMLENVSLVCPKDAT RFKHLRKYTYNYEAESSSGVPGTADSRSATRINCKVELEVPQLCSFILKTSQCTLKEV YGFNPEGKALLKKTKNSEEFAAAMSRYELKLAIPEGKQVFLYPEKDEPTYILNIKRGI ISALLVPPETEEAKQVLFLDTVYGNCSTHFTVKTRKGNVATEISTERDLGQCDRFKPI RTGISPLALIKGMTRPLSTLISSSQSCQYTLDAKRKHVAEAICKEQHLFLPFSYNNKY GMVAQVTQTLKLEDTPKINSRFFGEGTKKMGLAFESTKSTSPPKQAEAVLKTLQELKK LTISEQNIQRANLFNKLVTELRGLSDEAVTSLLPQLIEVSSPITLQALVQCGQPQCST HILQWLKRVHANPLLIDVVTYLVALIPEPSAQQLREIFNMARDQRSRATLYALSHAVN NYHKTNPTGTQELLDIANYLMEQIQDDCTGDEDYTYLILRVIGNMGQTMEQLTPELKS SILKCVQSTKPSLMIQKAAIQALRKMEPKDKDQEVLLQTFLDDASPGDKRLAAYLMLM RSPSQADINKIVQILPWEQNEQVKNFVASHIANILNSEELDIQDLKKLVKEALKESQL PTVMDFRKFSRNYQLYKSVSLPSLDPASAKIEGNLIFDPNNYLPKESMLKTTLTAFGF ASADLIEIGLEGKGFEPTLEALFGKQGFFPDSVNKALYWVNGQVPDGVSKVLVDHFGY TKDDKHEQDMVNGIMLSVEKLIKDLKSKEVPEARAYLRILGEELGFASLHDLQLLGKL LLMGARTLQGIPQMIGEVIRKGSKNDFFLHYIFMENAFELPTGAGLQLQISSSGVIAP GAKAGVKLEVANMQAELVAKPSVSVEFVTNMGIIIPDFARSGVQMNTNFFHESGLEAH VALKAGKLKFIIPSPKRPVKLLSGGNTLHLVSTTKTEVIPPLIENRQSWSVCKQVFPG LNYCTSGAYSNASSTDSASYYPLTGDTRLELELRPTGEIEQYSVSATYELQREDRALV DTLKFVTQAEGAKQTEATMTFKYNRQSMTLSSEVQIPDFDVDLGTILRVNDESTEGKT SYRLTLDIQNKKITEVALMGHLSCDTKEERKIKGVISIPRLQAEARSEILAHWSPAKL LLQMDSSATAYGSTVSKRVAWHYDEEKIEFEWNTGTNVDTKKMTSNFPVDLSDYPKSL HMYANRLLDHRVPETDMTFRHVGSKLIVAMSSWLQKASGSLPYTQTLQDHLNSLKEFN LQNMGLPDFHIPENLFLKSDGRVKYTLNKNSLKIEIPLPFGGKSSRDLKMLETVRTPA LHFKSVGFHLPSREFQVPTFTIPKLYQLQVPLLGVLDLSTNVYSNLYNWSASYSGGNT STDHFSLRARYHMKADSVVDLLSYNVQGSGETTYDHKNTFTLSCDGSLRHKFLDSNIK FSHVEKLGNNPVSKGLLIFDASSSWGPQMSASVHLDSKKKQHLFVKEVKIDGQFRVSS FYAKGTYGLSCQRDPNTGRLNGESNLRFNSSYLQGTNQITGRYEDGTLSLTSTSDLQS GIIKNTASLKYENYELTLKSDTNGKYKNFATSNKMDMTFSKQNALLRSEYQADYESLR FFSLLSGSLNSHGLELNADILGTDKINSGAHKATLRIGQDGISTSATTNLKCSLLVLE NELNAELGLSGASMKLTTNGRFREHNAKFSLDGKAALTELSLGSAYQAMILGVDSKNI FNFKVSQEGLKLSNDMMGSYAEMKFDHTNSLNIAGLSLDFSSKLDNIYSSDKFYKQTV NLQLQPYSLVTTLNSDLKYNALDLTNNGKLRLEPLKLHVAGNLKGAYQNNEIKHIYAI SSAALSASYKADTVAKVQGVEFSHRLNTDIAGLASAIDMSTNYNSDSLHFSNVFRSVM APFTMTIDAHTNGNGKLALWGEHTGQLYSKFLLKAEPLAFTFSHDYKGSTSHHLVSRK SISAALEHKVSALLTPAEQTGTWKLKTQFNNNEYSQDLDAYNTKDKIGVELTGRTLAD LTLLDSPIKVPLLLSEPINIIDALEMRDAVEKPQEFTIVAFVKYDKNQDVHSINLPFF ETLQEYFERNRQTIIVVVENVQRNLKHINIDQFVRKYRAALGKLPQQANDYLNSFNWE RQVSHAKEKLTALTKKYRITENDIQIALDDAKINFNEKLSQLQTYMIQFDQYIKDSYD LHDLKIAIANIIDEIIEKLKSLDEHYHIRVNLVKTIHDLHLFIENIDFNKSGSSTASW IQNVDTKYQIRIQIQEKLQQLKRHIQNIDIQHLAGKLKQHIEAIDVRVLLDQLGTTIS FERINDVLEHVKHFVINLIGDFEVAEKINAFRAKVHELIERYEVDQQIQVLMDKLVEL THQYKLKETIQKLSNVLQQVKIKDYFEKLVGFIDDAVKKLNELSFKTFIEDVNKFLDM LIKKLKSFDYHQFVDETNDKIREVTQRLNGEIQALELPQKAEALKLFLEETKATVAVY LESLQDTKITLIINWLQEALSSASLAHMKAKFRETLEDTRDRMYQMDIQQELQRYLSL VGQVYSTLVTYISDWWTLAAKNLTDFAEQYSIQDWAKRMKALVEQGFTVPEIKTILGT MPAFEVSLQALQKATFQTPDFIVPLTDLRIPSVQINFKDLKNIKIPSRFSTPEFTILN TFHIPSFTIDFVEMKVKIIRTIDQMQNSELQWPVPDIYLRDLKVEDIPLARITLPDFR LPEIAIPEFIIPTLNLNDFQVPDLHIPEFQLPHISHTIEVPTFGKLYSILKIQSPLFT LDANADIGNGTTSANEAGIAASITAKGESKLEVLNFDFQANAQLSNPKINPLALKESV KFSSKYLRTEHGSEMLFFGNAIEGKSNTVASLHTEKNTLELSNGVIVKINNQLTLDSN TKYFHKLNIPKLDFSSQADLRNEIKTLLKAGHIAWTSSGKGSWKWACPRFSDEGTHES QISFTIEGPLTSFGLSNKINSKHLRVNQNLVYESGSLNFSKLEIQSQVDSQHVGHSVL TAKGMALFGEGKAEFTGRHDAHLNGKVIGTLKNSLFFSAQPFEITASTNNEGNLKVRF PLRLTGKIDFLNNYALFLSPSAQQASWQVSARFNQYKYNQNFSAGNNENIMEAHVGIN GEANLDFLNIPLTIPEMRLPYTIITTPPLKDFSLWEKTGLKEFLKTTKQSFDLSVKAQ YKKNKHRHSITNPLAVLCEFISQSIKSFDRHFEKNRNNALDFVTKSYNETKIKFDKYK AEKSHDELPRTFQIPGYTVPVVNVEVSPFTIEMSAFGYVFPKAVSMPSFSILGSDVRV PSYTLILPSLELPVLHVPRNLKLSLPHFKELCTISHIFIPAMGNITYDFSFKSSVITL NTNAELFNQSDIVAHLLSSSSSVIDALQYKLEGTTRLTRKRGLKLATALSLSNKFVEG SHNSTVSLTTKNMEVSVAKTTKAEIPILRMNFKQELNGNTKSKPTVSSSMEFKYDFNS SMLYSTAKGAVDHKLSLESLTSYFSIESSTKGDVKGSVLSREYSGTIASEANTYLNSK STRSSVKLQGTSKIDDIWNLEVKENFAGEATLQRIYSLWEHSTKNHLQLEGLFFTNGE HTSKATLELSPWQMSALVQVHASQPSSFHDFPDLGQEVALNANTKNQKIRWKNEVRIH SGSFQSQVELSNDQEKAHLDIAGSLEGHLRFLKNIILPVYDKSLWDFLKLDVTTSIGR RQHLRVSTAFVYTKNPNGYSFSIPVKVLADKFITPGLKLNDLNSVLVMPTFHVPFTDL QVPSCKLDFREIQIYKKLRTSSFALNLPTLPEVKFPEVDVLTKYSQPEDSLIPFFEIT VPESQLTVSQFTLPKSVSDGIAALDLNAVANKIADFELPTIIVPEQTIEIPSIKFSVP AGIVIPSFQALTARFEVDSPVYNATWSASLKNKADYVETVLDSTCSSTVQFLEYELNV LGTHKIEDGTLASKTKGTLAHRDFSAEYEEDGKFEGLQEWEGKAHLNIKSPAFTDLHL RYQKDKKGISTSAASPAVGTVGMDMDEDDDFSKWNFYYSPQSSPDKKLTIFKTELRVR ESDEETQIKVNWEEEAASGLLTSLKDNVPKATGVLYDYVNKYHWEHTGLTLREVSSKL RRNLQNNAEWVYQGAIRQIDDIDVRFQKAASGTTGTYQEWKDKAQNLYQELLTQEGQA SFQGLKDNVFDGLVRVTQKFHMKVKHLIDSLIDFLNFPRFQFPGKPGIYTREELCTMF IREVGTVLSQVYSKVHNGSEILFSYFQDLVITLPFELRKHKLIDVISMYRELLKDLSK EAQEVFKAIQSLKTTEVLRNLQDLLQFIFQLIEDNIKQLKEMKFTYLINYIQDEINTI FNDYIPYVFKLLKENLCLNLHKFNEFIQNELQEASQELQQIHQYIMALREEYFDPSIV GWTVKYYELEEKIVSLIKNLLVALKDFHSEYIVSASNFTSQLSSQVEQFLHRNIQEYL SILTDPDGKGKEKIAELSATAQEIIKSQAIATKKIISDYHQQFRYKLQDFSDQLSDYY EKFIAESKRLIDLSIQNYHTFLIYITELLKKLQSTTVMNPYMKLAPGELTIIL" mat_peptide 210..13817 /note="apolipoprotein B-100" misc_feature 14093..14098 /note="polyA signal" polyA_site 14121 /note="polyA site" BASE COUNT 4371 a 3240 c 2903 g 3607 t ORIGIN 1 attcccaccg ggacctgcgg ggctgagtgc ccttctcggt tgctgccgct gaggagcccg 61 cccagccagc cagggccgcg aggccgaggc caggccgcag cccaggagcc gccccaccgc 121 agctggcgat ggacccgccg aggcccgcgc tgctggcgct gctggcgctg cctgcgctgc 181 tgctgctgct gctggcgggc gccagggccg aagaggaaat gctggaaaat gtcagcctgg 241 tctgtccaaa agatgcgacc cgattcaagc acctccggaa gtacacatac aactatgagg 301 ctgagagttc cagtggagtc cctgggactg ctgattcaag aagtgccacc aggatcaact 361 gcaaggttga gctggaggtt ccccagctct gcagcttcat cctgaagacc agccagtgca 421 ccctgaaaga ggtgtatggc ttcaaccctg agggcaaagc cttgctgaag aaaaccaaga 481 actctgagga gtttgctgca gccatgtcca ggtatgagct caagctggcc attccagaag 541 ggaagcaggt tttcctttac ccggagaaag atgaacctac ttacatcctg aacatcaaga 601 ggggcatcat ttctgccctc ctggttcccc cagagacaga agaagccaag caagtgttgt 661 ttctggatac cgtgtatgga aactgctcca ctcactttac cgtcaagacg aggaagggca 721 atgtggcaac agaaatatcc actgaaagag acctggggca gtgtgatcgc ttcaagccca 781 tccgcacagg catcagccca cttgctctca tcaaaggcat gacccgcccc ttgtcaactc 841 tgatcagcag cagccagtcc tgtcagtaca cactggacgc taagaggaag catgtggcag 901 aagccatctg caaggagcaa cacctcttcc tgcctttctc ctacaacaat aagtatggga 961 tggtagcaca agtgacacag actttgaaac ttgaagacac accaaagatc aacagccgct 1021 tctttggtga aggtactaag aagatgggcc tcgcatttga gagcaccaaa tccacatcac 1081 ctccaaagca ggccgaagct gttttgaaga ctctccagga actgaaaaaa ctaaccatct 1141 ctgagcaaaa tatccagaga gctaatctct tcaataagct ggttactgag ctgagaggcc 1201 tcagtgatga agcagtcaca tctctcttgc cacagctgat tgaggtgtcc agccccatca 1261 ctttacaagc cttggttcag tgtggacagc ctcagtgctc cactcacatc ctccagtggc 1321 tgaaacgtgt gcatgccaac ccccttctga tagatgtggt cacctacctg gtggccctga 1381 tccccgagcc ctcagcacag cagctgcgag agatcttcaa catggcgagg gatcagcgca 1441 gccgagccac cttgtatgcg ctgagccacg cggtcaacaa ctatcataag acaaacccta 1501 cagggaccca ggagctgctg gacattgcta attacctgat ggaacagatt caagatgact 1561 gcactgggga tgaagattac acctatttga ttctgcgggt cattggaaat atgggccaaa 1621 ccatggagca gttaactcca gaactcaagt cttcaatcct caaatgtgtc caaagtacaa 1681 agccatcact gatgatccag aaagctgcca tccaggctct gcggaaaatg gagcctaaag 1741 acaaggacca ggaggttctt cttcagactt tccttgatga tgcttctccg ggagataagc 1801 gactggctgc ctatcttatg ttgatgagga gtccttcaca ggcagatatt aacaaaattg 1861 tccaaattct accatgggaa cagaatgagc aagtgaagaa ctttgtggct tcccatattg 1921 ccaatatctt gaactcagaa gaattggata tccaagatct gaaaaagtta gtgaaagaag 1981 ctctgaaaga atctcaactt ccaactgtca tggacttcag aaaattctct cggaactatc 2041 aactctacaa atctgtttct cttccatcac ttgacccagc ctcagccaaa atagaaggga 2101 atcttatatt tgatccaaat aactaccttc ctaaagaaag catgctgaaa actaccctca 2161 ctgcctttgg atttgcttca gctgacctca tcgagattgg cttggaagga aaaggctttg 2221 agccaacatt ggaagctctt tttgggaagc aaggattttt cccagacagt gtcaacaaag 2281 ctttgtactg ggttaatggt caagttcctg atggtgtctc taaggtctta gtggaccact 2341 ttggctatac caaagatgat aaacatgagc aggatatggt aaatggaata atgctcagtg 2401 ttgagaagct gattaaagat ttgaaatcca aagaagtccc ggaagccaga gcctacctcc 2461 gcatcttggg agaggagctt ggttttgcca gtctccatga cctccagctc ctgggaaagc 2521 tgcttctgat gggtgcccgc actctgcagg ggatccccca gatgattgga gaggtcatca 2581 ggaagggctc aaagaatgac ttttttcttc actacatctt catggagaat gcctttgaac 2641 tccccactgg agctggatta cagttgcaaa tatcttcatc tggagtcatt gctcccggag 2701 ccaaggctgg agtaaaactg gaagtagcca acatgcaggc tgaactggtg gcaaaaccct 2761 ccgtgtctgt ggagtttgtg acaaatatgg gcatcatcat tccggacttc gctaggagtg 2821 gggtccagat gaacaccaac ttcttccacg agtcgggtct ggaggctcat gttgccctaa 2881 aagctgggaa gctgaagttt atcattcctt ccccaaagag accagtcaag ctgctcagtg 2941 gaggcaacac attacatttg gtctctacca ccaaaacgga ggtgatccca cctctcattg 3001 agaacaggca gtcctggtca gtttgcaagc aagtctttcc tggcctgaat tactgcacct 3061 caggcgctta ctccaacgcc agctccacag actccgcctc ctactatccg ctgaccgggg 3121 acaccagatt agagctggaa ctgaggccta caggagagat tgagcagtat tctgtcagcg 3181 caacctatga gctccagaga gaggacagag ccttggtgga taccctgaag tttgtaactc 3241 aagcagaagg tgcgaagcag actgaggcta ccatgacatt caaatataat cggcagagta 3301 tgaccttgtc cagtgaagtc caaattccgg attttgatgt tgacctcgga acaatcctca 3361 gagttaatga tgaatctact gagggcaaaa cgtcttacag actcaccctg gacattcaga 3421 acaagaaaat tactgaggtc gccctcatgg gccacctaag ttgtgacaca aaggaagaaa 3481 gaaaaatcaa gggtgttatt tccatacccc gtttgcaagc agaagccaga agtgagatcc 3541 tcgcccactg gtcgcctgcc aaactgcttc tccaaatgga ctcatctgct acagcttatg 3601 gctccacagt ttccaagagg gtggcatggc attatgatga agagaagatt gaatttgaat 3661 ggaacacagg caccaatgta gataccaaaa aaatgacttc caatttccct gtggatctct 3721 ccgattatcc taagagcttg catatgtatg ctaatagact cctggatcac agagtccctg 3781 aaacagacat gactttccgg cacgtgggtt ccaaattaat agttgcaatg agctcatggc 3841 ttcagaaggc atctgggagt cttccttata cccagacttt gcaagaccac ctcaatagcc 3901 tgaaggagtt caacctccag aacatgggat tgccagactt ccacatccca gaaaacctct 3961 tcttaaaaag cgatggccgg gtcaaatata ccttgaacaa gaacagtttg aaaattgaga 4021 ttcctttgcc ttttggtggc aaatcctcca gagatctaaa gatgttagag actgttagga 4081 caccagccct ccacttcaag tctgtgggat tccatctgcc atctcgagag ttccaagtcc 4141 ctacttttac cattcccaag ttgtatcaac tgcaagtgcc tctcctgggt gttctagacc 4201 tctccacgaa tgtctacagc aacttgtaca actggtccgc ctcctacagt ggtggcaaca 4261 ccagcacaga ccatttcagc cttcgggctc gttaccacat gaaggctgac tctgtggttg 4321 acctgctttc ctacaatgtg caaggatctg gagaaacaac atatgaccac aagaatacgt 4381 tcacactatc atgtgatggg tctctacgcc acaaatttct agattcgaat atcaaattca 4441 gtcatgtaga aaaacttgga aacaacccag tctcaaaagg tttactaata ttcgatgcat 4501 ctagttcctg gggaccacag atgtctgctt cagttcattt ggactccaaa aagaaacagc 4561 atttgtttgt caaagaagtc aagattgatg ggcagttcag agtctcttcg ttctatgcta 4621 aaggcacata tggcctgtct tgtcagaggg atcctaacac tggccggctc aatggagagt 4681 ccaacctgag gtttaactcc tcctacctcc aaggcaccaa ccagataaca ggaagatatg 4741 aagatggaac cctctccctc acctccacct ctgatctgca aagtggcatc attaaaaata 4801 ctgcttccct aaagtatgag aactacgagc tgactttaaa atctgacacc aatgggaagt 4861 ataagaactt tgccacttct aacaagatgg atatgacctt ctctaagcaa aatgcactgc 4921 tgcgttctga atatcaggct gattacgagt cattgaggtt cttcagcctg ctttctggat 4981 cactaaattc ccatggtctt gagttaaatg ctgacatctt aggcactgac aaaattaata 5041 gtggtgctca caaggcgaca ctaaggattg gccaagatgg aatatctacc agtgcaacga 5101 ccaacttgaa gtgtagtctc ctggtgctgg agaatgagct gaatgcagag cttggcctct 5161 ctggggcatc tatgaaatta acaacaaatg gccgcttcag ggaacacaat gcaaaattca 5221 gtctggatgg gaaagccgcc ctcacagagc tatcactggg aagtgcttat caggccatga 5281 ttctgggtgt cgacagcaaa aacattttca acttcaaggt cagtcaagaa ggacttaagc 5341 tctcaaatga catgatgggc tcatatgctg aaatgaaatt tgaccacaca aacagtctga 5401 acattgcagg cttatcactg gacttctctt caaaacttga caacatttac agctctgaca 5461 agttttataa gcaaactgtt aatttacagc tacagcccta ttctctggta actactttaa 5521 acagtgacct gaaatacaat gctctggatc tcaccaacaa tgggaaacta cggctagaac 5581 ccctgaagct gcatgtggct ggtaacctaa aaggagccta ccaaaataat gaaataaaac 5641 acatctatgc catctcttct gctgccttat cagcaagcta taaagcagac actgttgcta 5701 aggttcaggg tgtggagttt agccatcggc tcaacacaga catcgctggg ctggcttcag 5761 ccattgacat gagcacaaac tataattcag actcactgca tttcagcaat gtcttccgtt 5821 ctgtaatggc cccgtttacc atgaccatcg atgcacatac aaatggcaat gggaaactcg 5881 ctctctgggg agaacatact gggcagctgt atagcaaatt cctgttgaaa gcagaacctc 5941 tggcatttac tttctctcat gattacaaag gctccacaag tcatcatctc gtgtctagga 6001 aaagcatcag tgcagctctt gaacacaaag tcagtgccct gcttactcca gctgagcaga 6061 caggcacctg gaaactcaag acccaattta acaacaatga atacagccag gacttggatg 6121 cttacaacac taaagataaa attggcgtgg agcttactgg acgaactctg gctgacctaa 6181 ctctactaga ctccccaatt aaagtgccac ttttactcag tgagcccatc aatatcattg 6241 atgctttaga gatgagagat gccgttgaga agccccaaga atttacaatt gttgcttttg 6301 taaagtatga taaaaaccaa gatgttcact ccattaacct cccatttttt gagaccttgc 6361 aagaatattt tgagaggaat cgacaaacca ttatagttgt agtggaaaac gtacagagaa 6421 acctgaagca catcaatatt gatcaatttg taagaaaata cagagcagcc ctgggaaaac 6481 tcccacagca agctaatgat tatctgaatt cattcaattg ggagagacaa gtttcacatg 6541 ccaaggagaa actgactgct ctcacaaaaa agtatagaat tacagaaaat gatatacaaa 6601 ttgcattaga tgatgccaaa atcaacttta atgaaaaact atctcaactg cagacatata 6661 tgatacaatt tgatcagtat attaaagata gttatgattt acatgatttg aaaatagcta 6721 ttgctaatat tattgatgaa atcattgaaa aattaaaaag tcttgatgag cactatcata 6781 tccgtgtaaa tttagtaaaa acaatccatg atctacattt gtttattgaa aatattgatt 6841 ttaacaaaag tggaagtagt actgcatcct ggattcaaaa tgtggatact aagtaccaaa 6901 tcagaatcca gatacaagaa aaactgcagc agcttaagag acacatacag aatatagaca 6961 tccagcacct agctggaaag ttaaaacaac acattgaggc tattgatgtt agagtgcttt 7021 tagatcaatt gggaactaca atttcatttg aaagaataaa tgatgttctt gagcatgtca 7081 aacactttgt tataaatctt attggggatt ttgaagtagc tgagaaaatc aatgccttca 7141 gagccaaagt ccatgagtta atcgagaggt atgaagtaga ccaacaaatc caggttttaa 7201 tggataaatt agtagagttg acccaccaat acaagttgaa ggagactatt cagaagctaa 7261 gcaatgtcct acaacaagtt aagataaaag attactttga gaaattggtt ggatttattg 7321 atgatgctgt gaagaagctt aatgaattat cttttaaaac attcattgaa gatgttaaca 7381 aattccttga catgttgata aagaaattaa agtcatttga ttaccaccag tttgtagatg 7441 aaaccaatga caaaatccgt gaggtgactc agagactcaa tggtgaaatt caggctctgg 7501 aactaccaca aaaagctgaa gcattaaaac tgtttttaga ggaaaccaag gccacagttg 7561 cagtgtatct ggaaagccta caggacacca aaataacctt aatcatcaat tggttacagg 7621 aggctttaag ttcagcatct ttggctcaca tgaaggccaa attccgagag actctagaag 7681 atacacgaga ccgaatgtat caaatggaca ttcagcagga acttcaacga tacctgtctc 7741 tggtaggcca ggtttatagc acacttgtca cctacatttc tgattggtgg actcttgctg 7801 ctaagaacct tactgacttt gcagagcaat attctatcca agattgggct aaacgtatga 7861 aagcattggt agagcaaggg ttcactgttc ctgaaatcaa gaccatcctt gggaccatgc 7921 ctgcctttga agtcagtctt caggctcttc agaaagctac cttccagaca cctgatttta 7981 tagtccccct aacagatttg aggattccat cagttcagat aaacttcaaa gacttaaaaa 8041 atataaaaat cccatccagg ttttccacac cagaatttac catccttaac accttccaca 8101 ttccttcctt tacaattgac tttgtcgaaa tgaaagtaaa gatcatcaga accattgacc 8161 agatgcagaa cagtgagctg cagtggcccg ttccagatat atatctcagg gatctgaagg 8221 tggaggacat tcctctagcg agaatcaccc tgccagactt ccgtttacca gaaatcgcaa 8281 ttccagaatt cataatccca actctcaacc ttaatgattt tcaagttcct gaccttcaca 8341 taccagaatt ccagcttccc cacatctcac acacaattga agtacctact tttggcaagc 8401 tatacagtat tctgaaaatc caatctcctc ttttcacatt agatgcaaat gctgacatag 8461 ggaatggaac cacctcagca aacgaagcag gtatcgcagc ttccatcact gccaaaggag 8521 agtccaaatt agaagttctc aattttgatt ttcaagcaaa tgcacaactc tcaaacccta 8581 agattaatcc gctggctctg aaggagtcag tgaagttctc cagcaagtac ctgagaacgg 8641 agcatgggag tgaaatgctg ttttttggaa atgctattga gggaaaatca aacacagtgg 8701 caagtttaca cacagaaaaa aatacactgg agcttagtaa tggagtgatt gtcaagataa 8761 acaatcagct taccctggat agcaacacta aatacttcca caaattgaac atccccaaac 8821 tggacttctc tagtcaggct gacctgcgca acgagatcaa gacactgttg aaagctggcc 8881 acatagcatg gacttcttct ggaaaagggt catggaaatg ggcctgcccc agattctcag 8941 atgagggaac acatgaatca caaattagtt tcaccataga aggacccctc acttcctttg 9001 gactgtccaa taagatcaat agcaaacacc taagagtaaa ccaaaacttg gtttatgaat 9061 ctggctccct caacttttct aaacttgaaa ttcaatcaca agtcgattcc cagcatgtgg 9121 gccacagtgt tctaactgct aaaggcatgg cactgtttgg agaagggaag gcagagttta 9181 ctgggaggca tgatgctcat ttaaatggaa aggttattgg aactttgaaa aattctcttt 9241 tcttttcagc ccagccattt gagatcacgg catccacaaa caatgaaggg aatttgaaag 9301 ttcgttttcc attaaggtta acagggaaga tagacttcct gaataactat gcactgtttc 9361 tgagtcccag tgcccagcaa gcaagttggc aagtaagtgc taggttcaat cagtataagt 9421 acaaccaaaa tttctctgct ggaaacaacg agaacattat ggaggcccat gtaggaataa 9481 atggagaagc aaatctggat ttcttaaaca ttcctttaac aattcctgaa atgcgtctac 9541 cttacacaat aatcacaact cctccactga aagatttctc tctatgggaa aaaacaggct 9601 tgaaggaatt cttgaaaacg acaaagcaat catttgattt aagtgtaaaa gctcagtata 9661 agaaaaacaa acacaggcat tccatcacaa atcctttggc tgtgctttgt gagtttatca 9721 gtcagagcat caaatccttt gacaggcatt ttgaaaaaaa cagaaacaat gcattagatt 9781 ttgtcaccaa atcctataat gaaacaaaaa ttaagtttga taagtacaaa gctgaaaaat 9841 ctcacgacga gctccccagg acctttcaaa ttcctggata cactgttcca gttgtcaatg 9901 ttgaagtgtc tccattcacc atagagatgt cggcattcgg ctatgtgttc ccaaaagcag 9961 tcagcatgcc tagtttctcc atcctaggtt ctgacgtccg tgtgccttca tacacattaa 10021 tcctgccatc attagagctg ccagtccttc atgtccctag aaatctcaag ctttctcttc 10081 cacatttcaa ggaattgtgt accataagcc atatttttat tcctgccatg ggcaatatta 10141 cctatgattt ctcctttaaa tcaagtgtca tcacactgaa taccaatgct gaacttttta 10201 accagtcaga tattgttgct catctccttt cttcatcttc atctgtcatt gatgcactgc 10261 agtacaaatt agagggcacc acaagattga caagaaaaag gggattgaag ttagccacag 10321 ctctgtctct gagcaacaaa tttgtggagg gtagtcataa cagtactgtg agcttaacca 10381 cgaaaaatat ggaagtgtca gtggcaaaaa ccacaaaagc cgaaattcca attttgagaa 10441 tgaatttcaa gcaagaactt aatggaaata ccaagtcaaa acctactgtc tcttcctcca 10501 tggaatttaa gtatgatttc aattcttcaa tgctgtactc taccgctaaa ggagcagttg 10561 accacaagct tagcttggaa agcctcacct cttacttttc cattgagtca tctaccaaag 10621 gagatgtcaa gggttcggtt ctttctcggg aatattcagg aactattgct agtgaggcca 10681 acacttactt gaattccaag agcacacggt cttcagtgaa gctgcagggc acttccaaaa 10741 ttgatgatat ctggaacctt gaagtaaaag aaaattttgc tggagaagcc acactccaac 10801 gcatatattc cctctgggag cacagtacga aaaaccactt acagctagag ggcctctttt 10861 tcaccaacgg agaacataca agcaaagcca ccctggaact ctctccatgg caaatgtcag 10921 ctcttgttca ggtccatgca agtcagccca gttccttcca tgatttccct gaccttggcc 10981 aggaagtggc cctgaatgct aacactaaga accagaagat cagatggaaa aatgaagtcc 11041 ggattcattc tgggtctttc cagagccagg tcgagctttc caatgaccaa gaaaaggcac 11101 accttgacat tgcaggatcc ttagaaggac acctaaggtt cctcaaaaat atcatcctac 11161 cagtctatga caagagctta tgggatttcc taaagctgga tgtaaccacc agcattggta 11221 ggagacagca tcttcgtgtt tcaactgcct ttgtgtacac caaaaacccc aatggctatt 11281 cattctccat ccctgtaaaa gttttggctg ataaattcat tactcctggg ctgaaactaa 11341 atgatctaaa ttcagttctt gtcatgccta cgttccatgt cccatttaca gatcttcagg 11401 ttccatcgtg caaacttgac ttcagagaaa tacaaatcta taagaagctg agaacttcat 11461 catttgccct caacctacca acactccccg aggtaaaatt ccctgaagtt gatgtgttaa 11521 caaaatattc tcaaccagaa gactccttga ttcccttttt tgagataacc gtgcctgaat 11581 ctcagttaac tgtgtcccag ttcacgcttc caaaaagtgt ttcagatggc attgctgctt 11641 tggatctaaa tgcagtagcc aacaagatcg cagactttga gttgcccacc atcatcgtgc 11701 ctgagcagac cattgagatt ccctccatta agttctctgt acctgctgga attgtcattc 11761 cttcctttca agcactgact gcacgctttg aggtagactc tcccgtgtat aatgccactt 11821 ggagtgccag tttgaaaaac aaagcagatt atgttgaaac agtcctggat tccacatgca 11881 gctcaaccgt acagttccta gaatatgaac taaatgtttt gggaacacac aaaatcgaag 11941 atggtacgtt agcctctaag actaaaggaa cacttgcaca ccgtgacttc agtgcagaat 12001 atgaagaaga tggcaaattt gaaggacttc aggaatggga aggaaaagcg cacctcaata 12061 tcaaaagccc agcgttcacc gatctccatc tgcgctacca gaaagacaag aaaggcatct 12121 ccacctcagc agcctcccca gccgtaggca ccgtgggcat ggatatggat gaagatgacg 12181 acttttctaa atggaacttc tactacagcc ctcagtcctc tccagataaa aaactcacca 12241 tattcaaaac tgagttgagg gtccgggaat ctgatgagga aactcagatc aaagttaatt 12301 gggaagaaga ggcagcttct ggcttgctaa cctctctgaa agacaacgtg cccaaggcca 12361 caggggtcct ttatgattat gtcaacaagt accactggga acacacaggg ctcaccctga 12421 gagaagtgtc ttcaaagctg agaagaaatc tgcagaacaa tgctgagtgg gtttatcaag 12481 gggccattag gcaaattgat gatatcgacg tgaggttcca gaaagcagcc agtggcacca 12541 ctgggaccta ccaagagtgg aaggacaagg cccagaatct gtaccaggaa ctgttgactc 12601 aggaaggcca agccagtttc cagggactca aggataacgt gtttgatggc ttggtacgag 12661 ttactcaaaa attccatatg aaagtcaagc atctgattga ctcactcatt gattttctga 12721 acttccccag attccagttt ccggggaaac ctgggatata cactagggag gaactttgca 12781 ctatgttcat aagggaggta gggacggtac tgtcccaggt atattcgaaa gtccataatg 12841 gttcagaaat actgttttcc tatttccaag acctagtgat tacacttcct ttcgagttaa 12901 ggaaacataa actaatagat gtaatctcga tgtataggga actgttgaaa gatttatcaa 12961 aagaagccca agaggtattt aaagccattc agtctctcaa gaccacagag gtgctacgta 13021 atcttcagga ccttttacaa ttcattttcc aactaataga agataacatt aaacagctga 13081 aagagatgaa atttacttat cttattaatt atatccaaga tgagatcaac acaatcttca 13141 atgattatat cccatatgtt tttaaattgt tgaaagaaaa cctatgcctt aatcttcata 13201 agttcaatga atttattcaa aacgagcttc aggaagcttc tcaagagtta cagcagatcc 13261 atcaatacat tatggccctt cgtgaagaat attttgatcc aagtatagtt ggctggacag 13321 tgaaatatta tgaacttgaa gaaaagatag tcagtctgat caagaacctg ttagttgctc 13381 ttaaggactt ccattctgaa tatattgtca gtgcctctaa ctttacttcc caactctcaa 13441 gtcaagttga gcaatttctg cacagaaata ttcaggaata tcttagcatc cttaccgatc 13501 cagatggaaa agggaaagag aagattgcag agctttctgc cactgctcag gaaataatta 13561 aaagccaggc cattgcgacg aagaaaataa tttctgatta ccaccagcag tttagatata 13621 aactgcaaga tttttcagac caactctctg attactatga aaaatttatt gctgaatcca 13681 aaagattgat tgacctgtcc attcaaaact accacacatt tctgatatac atcacggagt 13741 tactgaaaaa gctgcaatca accacagtca tgaaccccta catgaagctt gctccaggag 13801 aacttactat catcctctaa ttttttaaaa gaaatcttca tttattcttc ttttccaatt 13861 gaactttcac atagcacaga aaaaattcaa actgcctata ttgataaaac catacagtga 13921 gccagccttg cagtaggcag tagactataa gcagaagcac atatgaactg gacctgcacc 13981 aaagctggca ccagggctcg gaaggtctct gaactcagaa ggatggcatt ttttgcaagt 14041 taaagaaaat caggatctga gttattttgc taaacttggg ggaggaggaa caaataaatg 14101 gagtctttat tgtgtatcat a // LOCUS HSLDLCP 2904 bp RNA PRI 15-NOV-1994 DEFINITION H.sapiens LDLC mRNA. ACCESSION Z34975 NID g575653 KEYWORDS LDLC gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2904) AUTHORS Podos,S.D., Reddy,P., Ashkenas,J. and Krieger,M. TITLE LDLC encodes a brefeldin A-sensitive, peripheral Golgi protein required for normal Golgi function JOURNAL J. Cell Biol. 127 (3), 679-691 (1994) MEDLINE 95050941 REFERENCE 2 (bases 1 to 2904) AUTHORS Podos,S.D. TITLE Direct Submission JOURNAL Submitted (04-JUL-1994) Steven D Podos, BIOLOGY, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA FEATURES Location/Qualifiers source 1..2904 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 96..2312 /codon_start=1 /product="ldlCp" /db_xref="PID:g575654" /translation="MEKSRMNLPKGPDTLCFDKDEFMKEDFDVDHFVSDCRKRVQLEE LRDDLELYYKLLKTAMVELINKDYADFVNLSTNLVGMDKALNQLSVPLGQLREEVLSL RSSVSEGIRAVDERMSKQEDIRKKKMCVLRLIQVIRSVEKIEKILNSQSSKETSALEA SSPLLTGQILERIATEFNQLQFHAVQSKGMPLLDKVRPRIAGITAMLQQSLEGLLLEG LQTSDVDIIRHCLRTYATIDKTRDAEALVGQVLVKPYIDEVIIEQFVESHPNGLQVMY NKLLEFVPHHCRLLREVTGGAISSEKGNTVPGYDFLVNSVWPQIVQGLEEKLPSLFNP GNPDAFHEKYTISMDFVRRLERQCGSQASVKRLRAHPAYHSFNKKWNLPVYFQIRFRE IAGSLEAALTDVLEDAPAESPYCLLASHRTWSSLRRCWSDEMFLPLLVHRLWRLTLQI LARYSVFVNELSLRPISNESPKEIKKPLVTGSKEPSITQGNTEDQGSGPSETKPVVSI SRTQLVYVVADLDKLQEQLPELLEIIKPKLEMIGFKNFSSISAALEDSQSSFSACVPS LSSKIIQDLSDSCFGFLKSALEVPRLYRRTNKEVPTTASSYVDSALKPLFQLQSGHKD KLKQAIIQQWLEGTLSESTHKYYETVSDVLNSVKKMEESLKRLKQARKTTPANPVGPS GGMSDDDKIRLQLALDVEYLGEQIQKLGLQASDIKSFSALAELVAAAKDQATAEQP" exon 1324..1475 BASE COUNT 834 a 639 c 688 g 743 t ORIGIN 1 ggaaactggc ggtggccgcg gccgccgagt cggtctgcgc atcctcctgc gttttctcgc 61 ttggatcttg gcactgagag gcggtggccg gcgggatgga gaaaagtagg atgaacctgc 121 ccaaggggcc ggacacgctc tgcttcgaca aggacgagtt catgaaggaa gatttcgatg 181 tcgatcattt tgtgtctgac tgtaggaagc gggtccagct ggaagaactg agagatgacc 241 tggagctcta ctataaactt cttaaaacag ccatggtcga actcatcaac aaggattatg 301 cagattttgt caatctttca acaaacttgg ttggcatgga caaagccctc aaccagcttt 361 ctgtgccttt gggacaatta cgagaagagg ttctgagcct tagatcgtct gtcagtgaag 421 gaattcgggc agttgatgaa cgaatgtcta aacaagagga cattaggaaa aaaaagatgt 481 gtgtattgag gcttatacaa gttattcggt cagttgagaa aattgaaaaa atcttaaact 541 ctcaaagttc taaagaaacc tctgcactag aagcaagcag cccccttttg actggacaaa 601 ttttggagag aattgccaca gaatttaatc agttacagtt tcatgctgtt caaagcaaag 661 gcatgcctct tttggacaaa gtaagaccgc gtatagctgg cattacagcc atgttacagc 721 agtcactgga aggtctccta ttagaaggcc ttcagacgtc tgacgtcgat ataatacggc 781 actgcttgcg gacttacgcc acgattgaca agacacggga cgcggaggcc ttagttggcc 841 aagtactagt gaaaccatac atagacgagg tgattataga gcagtttgtt gaatctcatc 901 ccaatggcct tcaggtcatg tataataaac tcctggagtt tgttcctcac cattgccgcc 961 ttcttcgaga agtcacagga ggtgccatct ccagtgaaaa aggcaatact gttcctggat 1021 atgacttttt ggtgaattct gtttggccac aaatagtaca aggattagaa gaaaagttac 1081 cctcgctttt taatcctggg aatcccgatg catttcatga gaaatatacc ataagtatgg 1141 attttgtcag aagattggaa cggcagtgtg gatcacaggc tagtgtaaag agattaagag 1201 cccatcctgc ctatcacagc ttcaataaga agtggaactt gcctgtttat tttcaaataa 1261 gatttagaga aatagcggga tccttagaag cagcacttac agatgtcctg gaagatgccc 1321 cagctgaaag tccgtattgc cttttggctt ctcatagaac ttggagcagc cttaggaggt 1381 gttggtcaga tgagatgttc ttgccattac tggtgcatcg cctgtggaga ctcactctgc 1441 agattttggc acgatactct gtgtttgtca atgagctttc actcaggccc atttctaatg 1501 aaagtcccaa ggagatcaag aaacctttgg taactggtag caaagaacct tccatcaccc 1561 aaggaaacac tgaagaccaa ggaagtggtc cttcggaaac aaagcctgtg gtttccattt 1621 cccgcactca gctcgtgtat gtggttgcag acctggacaa gcttcaggag cagcttccag 1681 aactcttgga aataatcaag ccaaaacttg aaatgattgg ctttaagaat ttttcttcta 1741 tctcagcagc cctggaggac tcccagagct ctttttcagc ctgtgtgccc tccttgagta 1801 gcaagatcat ccaggattta agtgactctt gcttcggttt cctaaaaagc gccctggagg 1861 ttcccaggct ttaccgaaga accaataagg aggtcccaac cacagcttcc tcctatgtgg 1921 acagtgctct gaagccctta ttccagcttc agagcggaca caaggataag ctcaaacaag 1981 caataattca gcagtggcta gaaggcactc tcagtgaaag cactcataag tactatgaaa 2041 ccgtgtcaga tgtattaaac tctgtgaaga agatggaaga gagcctgaaa aggctgaaac 2101 aagccagaaa aaccactccc gccaaccccg tcggtcccag tggtggcatg agcgacgacg 2161 acaaaatcag gctgcagttg gccctagatg ttgagtactt gggagagcag atacaaaagt 2221 tgggactaca agcaagtgac ataaaaagct tctcagctct cgcagagctt gttgctgctg 2281 ccaaggacca ggcaacagca gagcagcctt aagcatcttg gaagatcccg aggttagatt 2341 cttaagcaag agaagagttg gacttccagg ctgaagggga gaaagtgact ctgttctctt 2401 agcaaccgtc tgtagcaaag aagtgcttcc agcatcactc cagcaacacg cccatgcgtc 2461 ttctctcagc gtatttgggt cttctttgcc caaaagaaca caaaagcctt tttccattgt 2521 atggaagata gtttttaaga catttgaaac tttctactat agtttacaga acaaattatt 2581 ttatttttat tgtaaatctt agtgtggaag agctgatttc taaaatatga ttaaagtaaa 2641 tatataccta tgaatatcaa gagtcgtctc cctgagcctg tagttggaag tgacgactgt 2701 aatggaatga tgtcttgtat agaaatgccc ttctctgaaa taaagagaac tcctgggctt 2761 tctaaagagg ctgcgggaag ccatcctcca ctcccactgt gtgtgagagc agtgcttctg 2821 atcctgctgt caccccgacc tctggcagga gccggcgcca gtaggaaaga cctccttcct 2881 aaataaaaga agtgtctccc aaaa // LOCUS HSLDLRRL 14896 bp RNA PRI 18-APR-1996 DEFINITION Human mRNA for LDL-receptor related protein. ACCESSION X13916 NID g34338 KEYWORDS calcium binding protein; cell surface protein; lipoprotein receptor; low density lipoprotein receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14896) AUTHORS Herz,J.J. TITLE Direct Submission JOURNAL Submitted (24-NOV-1988) Herz J.J., EMBL, Meyerhofstrasse 1, 6900 Heidelberg, FRG REFERENCE 2 (bases 1 to 14896) AUTHORS Herz,J., Hamann,U., Rogne,S., Myklebost,O., Gausepohl,H. and Stanley,K.K. TITLE Surface location and high affinity for calcium of a 500-kd liver membrane protein closely related to the LDL-receptor suggest a physiological role as lipoprotein receptor JOURNAL EMBO J. 7 (13), 4119-4127 (1988) MEDLINE 89210795 REFERENCE 3 (bases 1 to 14896) AUTHORS Myklebost,O., Arheden,K., Rogne,S., Geurts van Kessel,A., Mandahl,N., Herz,J., Stanley,K., Heim,S. and Mitelman,F. TITLE The gene for the human putative apoE receptor is on chromosome 12 in the segment q13-14 JOURNAL Genomics 5 (1), 65-69 (1989) MEDLINE 89357986 COMMENT Data kindly reviewed (16-MAY-1989) by Herz J. J. FEATURES Location/Qualifiers source 1..14896 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="HL9" /clone="LRP 1-9" /map="chromosome 12 q13-14" sig_peptide 467..523 /note="put. signal peptide (-19 to -1)" CDS 467..14101 /codon_start=1 /product="LDL-receptor related precursor (AA -19 to 4525)" /db_xref="PID:g34339" /translation="MLTPPLLLLLPLLSALVAAAIDAPKTCSPKQFACRDQITCISKG WRCDGERDCPDGSDEAPEICPQSKAQRCQPNEHNCLGTELCVPMSRLCNGVQDCMDGS DEGPHCRELQGNCSRLGCQHHCVPTLDGPTCYCNSSFQLQADGKTCKDFDECSVYGTC SQLCTNTDGSFICGCVEGYLLQPDNRSCKAKNEPVDRPPVLLIANSQNILATYLSGAQ VSTITPTSTRQTTAMDFSYANETVCWVHVGDSAAQTQLKCARMPGLKGFVDEHTINIS LSLHHVEQMAIDWLTGNFYFVDDIDDRIFVCNRNGDTCVTLLDLELYNPKGIALDPAM GKVFFTDYGQIPKVERCDMDGQNRTKLVDSKIVFPHGITLDLVSRLVYWADAYLDYIE VVDYEGKGRQTIIQGILIEHLYGLTVFENYLYATNSDNANAQQKTSVIRVNRFNSTEY QVVTRVDKGGALHIYHQRRQPRVRSHACENDQYGKPGGCSDICLLANSHKARTCRCRS GFSLGSDGKSCKKPEHELFLVYGKGRPGIIRGMDMGAKVPDEHMIPIENLMNPRALDF HAETGFIYFADTTSYLIGRQKIDGTERETILKDGIHNVEGVAVDWMGDNLYWTDDGPK KTISVARLEKAAQTRKTLIEGKMTHPRAIVVDPLNGWMYWTDWEEDPKDSRRGRLERA WMDGSHRDIFVTSKTVLWPNGLSLDIPAGRLYWVDAFYDRIETILLNGTDRKIVYEGP ELNHAFGLCHHGNYLFWTEYRSGSVYRLERGVGGAPPTVTLLRSERPPIFEIRMYDAQ QQQVGTNKCRVNNGGCSSLCLATPGSRQCACAEDQVLDADGVTCLANPSYVPPPQCQP GEFACANSRCIQERWKCDGDNDCLDNSDEAPALCHQHTCPSDRFKCENNRCIPNRWLC DGDNDCGNSEDESNATCSARTCPPNQFSCASGRCIPISWTCDLDDDCGDRSDESASCA YPTCFPLTQFTCNNGRCININWRCDNDNDCGDNSDEAGCSHSCSSTQFKCNSGRCIPE HWTCDGDNDCGDYSDETHANCTNQATRPPGGCHTDEFQCRLDGLCIPLRWRCDGDTDC MDSSDEKSCEGVTHVCDPSVKFGCKDSARCISKAWVCDGDNDCEDNSDEENCESLACR PPSHPCANNTSVCLPPDKLCDGNDDCGDGSDEGELCDQCSLNNGGCSHNCSVAPGEGI VCSCPLGMELGPDNHTCQIQSYCAKHLKCSQKCDQNKFSVKCSCYEGWVLEPDGESCR SLDPFKPFIIFSNRHEIRRIDLHKGDYSVLVPGLRNTIALDFHLSQSALYWTDVVEDK IYRGKLLDNGALTSFEVVIQYGLATPEGLAVDWIAGNIYWVESNLDQIEVAKLDGTLR TTLLAGDIEHPRAIALDPRDGILFWTDWDASLPRIEAASMSGAGRRTVHRETGSGGWP NGLTVDYLEKRILWIDARSDAIYSARYDGSGHMEVLRGHEFLSHPFAVTLYGGEVYWT DWRTNTLAKANKWTGHNVTVVQRTNTQPFDLQVYHPSRQPMAPNPCEANGGQGPCSHL CLINYNRTVSCACPHLMKLHKDNTTCYEFKKFLLYARQMEIRGVDLDAPYYNYIISFT VPDIDNVTVLDYDAREQRVYWSDVRTQAIKRAFINGTGVETVVSADLPNAHGLAVDWV SRNLFWTSYDTNKKQINVARLDGSFKNAVVQGLEQPHGLVVHPLRGKLYWTDGDNISM ANMDGSNRTLLFSGQKGPVGLAIDFPESKLYWISSGNHTINRCNLDGSGLEVIDAMRS QLGKATALAIMGDKLWWADQVSEKMGTCSKADGSGSVVLRNSTTLVMHMKVYDESIQL DHKGTNPCSVNNGDCSQLCLPTSETTRSCMCTAGYSLRSGQQACEGVGSFLLYSVHEG IRGIPLDPNDKSDALVPVSGTSLAVGIDFHAENDTIYWVDMGLSTISRAKRDQTWRED VVTNGIGRVEGIAVDWIAGNIYWTDQGFDVIEVARLNGSFRYVVISQGLDKPRAITVH PEKGYLFWTEWGQYPRIERSRLDGTERVVLVNVSISWPNGISVDYQDGKLYWCDARTD KIERIDLETGENREVVLSSNNMDMFSVSVFEDFIYWSDRTHANGSIKRGSKDNATDSV PLRTGIGVQLKDIKVFNRDRQKGTNVCAVANGGCQQLCLYRGRGQRACACAHGMLAED GASCREYAGYLLYSERTILKSIHLSDERNLNAPVQPFEDPEHMKNVIALAFDYRAGTS PGTPNRIFFSDIHFGNIQQINDDGSRRITIVENVGSVEGLAYHRGWDTLYWTSYTTST ITRHTVDQTRPGAFERETVITMSGDDHPRAFVLDECQNLMFWTNWNEQHPSIMRAALS GANVLTLIEKDIRTPNGLAIDHRAEKLYFSDATLDKIERCEYDGSHRYVILKSEPVHP FGLAVYGEHIFWTDWVRRAVQRANKHVGSNMKLLRVDIPQQPMGIIAVANDTNSCELS PCRINNGGCQDLCLLTHQGHVNCSCRGGRILQDDLTCRAVNSSCRAQDEFECANGECI NFSLTCDGVPHCKDKSDEKPSYCNSRRCKKTFRQCSNGRCVSNMLWCNGADDCGDGSD EIPCNKTACGVGEFRCRDGTCIGNSSRCNQFVDCEDASDEMNCSATDCSSYFRLGVKG VLFQPCERTSLCYAPSWVCDGANDCGDYSDERDCPGVKRPRCPLNYFACPSGRCIPMS WTCDKEDDCEHGEDETHCNKFCSEAQFECQNHRCISKQWLCDGSDDCGDGSDEAAHCE GKTCGPSSFSCPGTHVCVPERWLCDGDKDCADGADESIAAGCLYNSTCDDREFMCQNR QCIPKHFVCDHDRDCADGSDESPECEYPTCGPSEFRCANGRCLSSRQWECDGENDCHD QSDEAPKNPHCTSPEHKCNASSQFLCSSGRCVAEALLCNGQDDCGDSSDERGCHINEC LSRKLSGCSQDCEDLKIGFKCRCRPGFRLKDDGRTCADVDECSTTFPCSQRCINTHGS YKCLCVEGYAPRGGDPHSCKAVTDEEPFLIFANRYYLRKLNLDGSNYTLLKQGLNNAV ALDFDYREQMIYWTDVTTQGSMIRRMHLNGSNVQVLHRTGLSNPDGLAVDWVGGNLYW CDKGRDTIEVSKLNGAYRTVLVSSGLREPRALVVDVQNGYLYWTDWGDHSLIGRIGMD GSSRSVIVDTKITWPNGLTLDYVTERIYWADAREDYIEFASLDGSNRHVVLSQDIPHI FALTLFEDYVYWTDWETKSINRAHKTTGTNKTLLISTLHRPMDLHVFHALRQPDVPNH PCKVNNGGCSNLCLLSPGGGHKCACPTNFYLGSDGRTCVSNCTASQFVCKNDKCIPFW WKCDTEDDCGDHSDEPPDCPEFKCRPGQFQCSTGICTNPAFICDGDNDCQDNSDEANC DIHVCLPSQFKCTNTNRCIPGIFRCNGQDNCGDGEDERDCPEVTCAPNQFQCSITKRC IPRVWVCDRDNDCVDGSDEPANCTQMTCGVDEFRCKDSGRCIPARWKCDGEDDCGDGS DEPKEECDERTCEPYQFRCKNNRCVPGRWQCDYDNDCGDNSDEESCTPRPCSESEFSC ANGRCIAGRWKCDGDHDCADGSDEKDCTPRCDMDQFQCKSGHCIPLRWRCDADADCMD GSDEEACGTGVRTCPLDEFQCNNTLCKPLAWKCDGEDDCGDNSDENPEECARFVCPPN RPFRCKNDRVCLWIGRQCDGTDNCGDGTDEEDCEPPTAHTTHCKDKKEFLCRNQRCLS SSLRCNMFDDCGDGSDEEDCSIDPKLTSCATNASICGDEARCVRTEKAAYCACRSGFH TVPGQPGCQDINECLRFGTCSQLCNNTKGGHLCSCARNFMKTHNTCKAEGSEYQVLYI ADDNEIRSLFPGHPHSAYEQAFQGDESVRIDAMDVHVKAGRVYWTNWHTGTISYRSLP PAAPPTTSNRHRRQIDRGVTHLNISGLKMPRGIAIDWVAGNVYWTDSGRDVIEVAQMK GENRKTLISGMIDEPHAIVVDPLRGTMYWSDWGNHPKIETAAMDGTLRETLVQDNIQW PTGLAVDYHNERLYWADAKLSVIGSIRLNGTDPIVAADSKRGLSHPFSIDVFEDYIYG VTYINNRVFKIHKFGHSPLVNLTGGLSHASDVVLYHQHKQPEVTNPCDRKKCEWLCLL SPSGPVCTCPNGKRLDNGTCVPVPSPTPPPDAPRPGTCNLQCFNGGSCFLNARRQPKC RCQPRYTGDKCELDQCWEHCRNGGTCAASPSGMPTCRCPTGFTGPKCTQQVCAGYCAN NSTCTVNQGNQPQCRCLPGFLGDRCQYRQCSGYCENFGTCQMAADGSRQCRCTAYFEG SRCEVNKCSRCLEGACVVNKQSGDVTCNCTDGRVAPSCLTCVGHCSNGGSCTMNSKMM PECQCPPHMTGPRCEEHVFSQQQPGHIASILIPLLLLLLLVLVAGVVFWYKRRVQGAK GFQHQRMTNGAMNVEIGNPTYKMYEGGEPDDVGGLLDADFALDPDKPTNFTNPVYATL YMGGHGSRHSLASTDEKRELLGRGPEDEIGDPLA" mat_peptide 524..14098 /note="LDL-receptor related protein (AA 1-4525)" misc_feature 524..790 /note="LDL-receptor-like domain 1" misc_feature 3026..4012 /note="LDL-receptor-like domain 2" misc_feature 8036..9283 /note="LDL-receptor-like domain 3" misc_feature 10466..11794 /note="LDL-receptor-like domain 4" BASE COUNT 3192 a 4562 c 4395 g 2747 t ORIGIN 1 cagcggtgcg agctccaggc ccatgcactg aggaggcgga aacaagggga gcccccagag 61 ctccatcaag ccccctccaa aggctcccct acccggtcca cgccccccac cccccctccc 121 cgcctcctcc caattgtgca tttttgcagc cggaggcggc tccgagatgg ggctgtgagc 181 ttcgcccggg gagggggaaa gagcagcgag gagtgaagcg ggggggtggg gtgaagggtt 241 tggatttcgg ggcagggggc gcacccccgt cagcaggccc tccccaaggg gctcggaact 301 ctacctcttc acccacgccc ctggtgcgct ttgccgaagg aaagaataag aacagagaag 361 gaggaggggg aaaggaggaa aagggggacc ccccaactgg ggggggtgaa ggagagaagt 421 agcaggacca gaggggaagg ggctgctgct tgcatcagcc cacaccatgc tgaccccgcc 481 gttgctcctg ctgctgcccc tgctctcagc tctggtcgcg gcggctatcg acgcccctaa 541 gacttgcagc cccaagcagt ttgcctgcag agatcaaata acctgtatct caaagggctg 601 gcggtgcgac ggtgagaggg actgcccaga cggatctgac gaggcccctg agatttgtcc 661 acagagtaag gcccagcgat gccagccaaa cgagcataac tgcctgggta ctgagctgtg 721 tgttcccatg tcccgcctct gcaatggggt ccaggactgc atggacggct cagatgaggg 781 gccccactgc cgagagctcc aaggcaactg ctctcgcctg ggctgccagc accattgtgt 841 ccccacactc gatgggccca cctgctactg caacagcagc tttcagcttc aggcagatgg 901 caagacctgc aaagattttg atgagtgctc agtgtacggc acctgcagcc agctatgcac 961 caacacagac ggctccttca tatgtggctg tgttgaagga tacctcctgc agccggataa 1021 ccgctcctgc aaggccaaga acgagccagt agaccggccc cctgtgctgt tgatagccaa 1081 ctcccagaac atcttggcca cgtacctgag tggggcccag gtgtctacca tcacacctac 1141 gagcacgcgg cagaccacag ccatggactt cagctatgcc aacgagaccg tatgctgggt 1201 gcatgttggg gacagtgctg ctcagacgca gctcaagtgt gcccgcatgc ctggcctaaa 1261 gggcttcgtg gatgagcaca ccatcaacat ctccctcagt ctgcaccacg tggaacagat 1321 ggccatcgac tggctgacag gcaacttcta ctttgtggat gacatcgatg ataggatctt 1381 tgtctgcaac agaaatgggg acacatgtgt cacattgcta gacctggaac tctacaaccc 1441 caagggcatt gccctggacc ctgccatggg gaaggtgttt ttcactgact atgggcagat 1501 cccaaaggtg gaacgctgtg acatggatgg gcagaaccgc accaagctcg tcgacagcaa 1561 gattgtgttt cctcatggca tcacgctgga cctggtcagc cgccttgtct actgggcaga 1621 tgcctatctg gactatattg aagtggtgga ctatgagggc aagggccgcc agaccatcat 1681 ccagggcatc ctgattgagc acctgtacgg cctgactgtg tttgagaatt atctctatgc 1741 caccaactcg gacaatgcca atgcccagca gaagacgagt gtgatccgtg tgaaccgctt 1801 taacagcacc gagtaccagg ttgtcacccg ggtggacaag ggtggtgccc tccacatcta 1861 ccaccagagg cgtcagcccc gagtgaggag ccatgcctgt gaaaacgacc agtatgggaa 1921 gccgggtggc tgctctgaca tctgcctgct ggccaacagc cacaaggcgc ggacctgccg 1981 ctgccgttcc ggcttcagcc tgggcagtga cgggaagtca tgcaagaagc cggagcatga 2041 gctgttcctc gtgtatggca agggccggcc aggcatcatc cggggcatgg atatgggggc 2101 caaggtcccg gatgagcaca tgatccccat tgaaaacctc atgaaccccc gagccctgga 2161 cttccacgct gagaccggct tcatctactt tgccgacacc accagctacc tcattggccg 2221 ccagaagatt gatggcactg agcgggagac catcctgaag gacggcatcc acaatgtgga 2281 gggtgtggcc gtggactgga tgggagacaa tctgtactgg acggacgatg ggcccaaaaa 2341 gacaatcagc gtggccaggc tggagaaagc tgctcagacc cgcaagactt taatcgaggg 2401 caaaatgaca caccccaggg ctattgtggt ggatccactc aatgggtgga tgtactggac 2461 agactgggag gaggacccca aggacagtcg gcgtgggcgg ctggagaggg cgtggatgga 2521 tggctcacac cgagacatct ttgtcacctc caagacagtg ctttggccca atgggctaag 2581 cctggacatc ccggctgggc gcctctactg ggtggatgcc ttctacgacc gcatcgagac 2641 gatactgctc aatggcacag accggaagat tgtgtatgaa ggtcctgagc tgaaccacgc 2701 ctttggcctg tgtcaccatg gcaactacct cttctggact gagtatcgga gtggcagtgt 2761 ctaccgcttg gaacggggtg taggaggcgc accccccact gtgacccttc tgcgcagtga 2821 gcggcccccc atctttgaga tccgaatgta tgatgcccag cagcagcaag ttggcaccaa 2881 caaatgccgg gtgaacaatg gcggctgcag cagcctgtgc ttggccaccc ctgggagccg 2941 ccagtgcgcc tgtgctgagg accaggtgtt ggacgcagac ggcgtcactt gcttggcgaa 3001 cccatcctac gtgcctccac cccagtgcca gccaggcgag tttgcctgtg ccaacagccg 3061 ctgcatccag gagcgctgga agtgtgacgg agacaacgat tgcctggaca acagtgatga 3121 ggccccagcc ctctgccatc agcacacctg cccctcggac cgattcaagt gcgagaacaa 3181 ccggtgcatc cccaaccgct ggctctgcga cggggacaat gactgtggga acagtgaaga 3241 tgagtccaat gccacttgtt cagcccgcac ctgccccccc aaccagttct cctgtgccag 3301 tggccgctgc atccccatct cctggacgtg tgatctggat gacgactgtg gggaccgctc 3361 tgatgagtct gcttcgtgtg cctatcccac ctgcttcccc ctgactcagt ttacctgcaa 3421 caatggcaga tgtatcaaca tcaactggag atgcgacaat gacaatgact gtggggacaa 3481 cagtgacgaa gccggctgca gccactcctg ttctagcacc cagttcaagt gcaacagcgg 3541 gcgttgcatc cccgagcact ggacctgcga tggggacaat gactgcggag actacagtga 3601 tgagacacac gccaactgca ccaaccaggc cacgaggccc cctggtggct gccacactga 3661 tgagttccag tgccggctgg atggactatg catccccctg cggtggcgct gcgatgggga 3721 cactgactgc atggactcca gcgatgagaa gagctgtgag ggagtgaccc acgtctgcga 3781 tcccagtgtc aagtttggct gcaaggactc agctcggtgc atcagcaaag cgtgggtgtg 3841 tgatggcgac aatgactgtg aggataactc ggacgaggag aactgcgagt ccctggcctg 3901 caggccaccc tcgcaccctt gtgccaacaa cacctcagtc tgcctgcccc ctgacaagct 3961 gtgtgatggc aacgacgact gtggcgacgg ctcagatgag ggcgagctct gcgaccagtg 4021 ctctctgaat aacggtggct gcagccacaa ctgctcagtg gcacctggcg aaggcattgt 4081 gtgttcctgc cctctgggca tggagctggg gcccgacaac cacacctgcc agatccagag 4141 ctactgtgcc aagcatctca aatgcagcca aaagtgcgac cagaacaagt tcagcgtgaa 4201 gtgctcctgc tacgagggct gggtcctgga acctgacggc gagagctgcc gcagcctgga 4261 ccccttcaag ccgttcatca ttttctccaa ccgccatgaa atccggcgca tcgatcttca 4321 caaaggagac tacagcgtcc tggtgcccgg cctgcgcaac accatcgccc tggacttcca 4381 cctcagccag agcgccctct actggaccga cgtggtggag gacaagatct accgcgggaa 4441 gctgctggac aacggagccc tgactagttt cgaggtggtg attcagtatg gcctggccac 4501 acccgagggc ctggctgtag actggattgc aggcaacatc tactgggtgg agagtaacct 4561 ggatcagatc gaggtggcca agctggatgg gaccctccgg accaccctgc tggccggtga 4621 cattgagcac ccaagggcaa tcgcactgga tccccgggat gggatcctgt tttggacaga 4681 ctgggatgcc agcctgcccc gcattgaggc agcctccatg agtggggctg ggcgccgcac 4741 cgtgcaccgg gagaccggct ctgggggctg gcccaacggg ctcaccgtgg actacctgga 4801 gaagcgcatc ctttggattg acgccaggtc agatgccatt tactcagccc gttacgacgg 4861 ctctggccac atggaggtgc ttcggggaca cgagttcctg tcgcacccgt ttgcagtgac 4921 gctgtacggg ggggaggtct actggactga ctggcgaaca aacacactgg ctaaggccaa 4981 caagtggacc ggccacaatg tcaccgtggt acagaggacc aacacccagc cctttgacct 5041 gcaggtgtac cacccctccc gccagcccat ggctcccaat ccctgtgagg ccaatggggg 5101 ccagggcccc tgctcccacc tgtgtctcat caactacaac cggaccgtgt cctgcgcctg 5161 cccccacctc atgaagctcc acaaggacaa caccacctgc tatgagttta agaagttcct 5221 gctgtacgca cgtcagatgg agatccgagg tgtggacctg gatgctccct actacaacta 5281 catcatctcc ttcacggtgc ccgacatcga caacgtcaca gtgctagact acgatgcccg 5341 cgagcagcgt gtgtactggt ctgacgtgcg gacacaggcc atcaagcggg ccttcatcaa 5401 cggcacaggc gtggagacag tcgtctctgc agacttgcca aatgcccacg ggctggctgt 5461 ggactgggtc tcccgaaacc tgttctggac aagctatgac accaataaga agcagatcaa 5521 tgtggcccgg ctggatggct ccttcaagaa cgcagtggtg cagggcctgg agcagcccca 5581 tggccttgtc gtccaccctc tgcgtgggaa gctctactgg accgatggtg acaacatcag 5641 catggccaac atggatggca gcaatcgcac cctgctcttc agtggccaga agggccccgt 5701 gggcctggct attgacttcc ctgaaagcaa actctactgg atcagctccg ggaaccatac 5761 catcaaccgc tgcaacctgg atgggagtgg gctggaggtc atcgatgcca tgcggagcca 5821 gctgggcaag gccaccgccc tggccatcat gggggacaag ctgtggtggg ctgatcaggt 5881 gtcggaaaag atgggcacat gcagcaaggc tgacggctcg ggctccgtgg tccttcggaa 5941 cagcaccacc ctggtgatgc acatgaaggt ctatgacgag agcatccagc tggaccataa 6001 gggcaccaac ccctgcagtg tcaacaacgg tgactgctcc cagctctgcc tgcccacgtc 6061 agagacgacc cgctcctgca tgtgcacagc cggctatagc ctccggagtg gccagcaggc 6121 ctgcgagggc gtaggttcct ttctcctgta ctctgtgcat gagggaatca ggggaattcc 6181 cctggatccc aatgacaagt cagatgccct ggtcccagtg tccgggacct cgctggctgt 6241 cggcatcgac ttccacgctg aaaatgacac catctactgg gtggacatgg gcctgagcac 6301 gatcagccgg gccaagcggg accagacgtg gcgtgaagac gtggtgacca atggcattgg 6361 ccgtgtggag ggcattgcag tggactggat cgcaggcaac atctactgga cagaccaggg 6421 ctttgatgtc atcgaggtcg cccggctcaa tggctccttc cgctacgtgg tgatctccca 6481 gggtctagac aagccccggg ccatcaccgt ccacccggag aaagggtact tgttctggac 6541 tgagtggggt cagtatccgc gtattgagcg gtctcggcta gatggcacgg agcgtgtggt 6601 gctggtcaac gtcagcatca gctggcccaa cggcatctca gtggactacc aggatgggaa 6661 gctgtactgg tgcgatgcac ggacagacaa gattgaacgg atcgacctgg agacaggtga 6721 gaaccgcgag gtggttctgt ccagcaacaa catggacatg ttttcagtgt ctgtgtttga 6781 ggatttcatc tactggagtg acaggactca tgccaacggc tctatcaagc gcgggagcaa 6841 agacaatgcc acagactccg tgcccctgcg aaccggcatc ggcgtccagc ttaaagacat 6901 caaagtcttc aaccgggacc ggcagaaagg caccaacgtg tgcgcggtgg ccaatggcgg 6961 gtgccagcag ctgtgcctgt accggggccg tgggcagcgg gcctgcgcct gtgcccacgg 7021 gatgctggct gaagacggag catcgtgccg cgagtatgcc ggctacctgc tctactcaga 7081 gcgcaccatt ctcaagagta tccacctgtc ggatgagcgc aacctcaatg cgcccgtgca 7141 gcccttcgag gaccctgagc acatgaagaa cgtcatcgcc ctggcctttg actaccgggc 7201 aggcacctct ccgggcaccc ccaatcgcat cttcttcagc gacatccact ttgggaacat 7261 ccaacagatc aacgacgatg gctccaggag gatcaccatt gtggaaaacg tgggctccgt 7321 ggaaggcctg gcctatcacc gtggctggga cactctctat tggacaagct acacgacatc 7381 caccatcacg cgccacacag tggaccagac ccgcccaggg gccttcgagc gtgagaccgt 7441 catcactatg tctggagatg accacccacg ggccttcgtt ttggacgagt gccagaacct 7501 catgttctgg accaactgga atgagcagca tcccagcatc atgcgggcgg cgctctcggg 7561 agccaatgtc ctgaccctta tcgagaagga catccgtacc cccaatggcc tggccatcga 7621 ccaccgtgcc gagaagctct acttctctga cgccaccctg gacaagatcg agcggtgcga 7681 gtatgacggc tcccaccgct atgtgatcct aaagtcagag cctgtccacc ccttcgggct 7741 ggccgtgtat ggggagcaca ttttctggac tgactgggtg cggcgggcag tgcagcgggc 7801 caacaagcac gtgggcagca acatgaagct gctgcgcgtg gacatccccc agcagcccat 7861 gggcatcatc gccgtggcca acgacaccaa cagctgtgaa ctctctccat gccgaatcaa 7921 caacggtggc tgccaggacc tgtgtctgct cactcaccag ggccatgtca actgctcatg 7981 ccgagggggc cgaatcctcc aggatgacct cacctgccga gcggtgaatt cctcttgccg 8041 agcacaagat gagtttgagt gtgccaatgg cgagtgcatc aacttcagcc tgacctgcga 8101 cggcgtcccc cactgcaagg acaagtccga tgagaagcca tcctactgca actcccgccg 8161 ctgcaagaag actttccggc agtgcagcaa tgggcgctgt gtgtccaaca tgctgtggtg 8221 caacggggcc gacgactgtg gggatggctc tgacgagatc ccttgcaaca agacagcctg 8281 tggtgtgggc gagttccgct gccgggacgg gacctgcatc gggaactcca gccgctgcaa 8341 ccagtttgtg gattgtgagg acgcctcaga tgagatgaac tgcagtgcca ccgactgcag 8401 cagctacttc cgcctgggcg tgaagggcgt gctcttccag ccctgcgagc ggacctcact 8461 ctgctacgca cccagctggg tgtgtgatgg cgccaatgac tgtggggact acagtgatga 8521 gcgcgactgc ccaggtgtga aacgccccag atgccctctg aattacttcg cctgccctag 8581 tgggcgctgc atccccatga gctggacgtg tgacaaagag gatgactgtg aacatggcga 8641 ggacgagacc cactgcaaca agttctgctc agaggcccag tttgagtgcc agaaccatcg 8701 ctgcatctcc aagcagtggc tgtgtgacgg cagcgatgac tgtggggatg gctcagacga 8761 ggctgctcac tgtgaaggca agacgtgcgg cccctcctcc ttctcctgcc ctggcaccca 8821 cgtgtgcgtc cccgagcgct ggctctgtga cggtgacaaa gactgtgctg atggtgcaga 8881 cgagagcatc gcagctggtt gcttgtacaa cagcacttgt gacgaccgtg agttcatgtg 8941 ccagaaccgc cagtgcatcc ccaagcactt cgtgtgtgac cacgaccgtg actgtgcaga 9001 tggctctgat gagtcccccg agtgtgagta cccgacctgc ggccccagtg agttccgctg 9061 tgccaatggg cgctgtctga gctcccgcca gtgggagtgt gatggcgaga atgactgcca 9121 cgaccagagt gacgaggctc ccaagaaccc acactgcacc agcccagagc acaagtgcaa 9181 tgcctcgtca cagttcctgt gcagcagtgg gcgctgtgtg gctgaggcac tgctctgcaa 9241 cggccaggat gactgtggcg acagctcgga cgagcgtggc tgccacatca atgagtgtct 9301 cagccgcaag ctcagtggct gcagccagga ctgtgaggac ctcaagatcg gcttcaagtg 9361 ccgctgtcgc cctggcttcc ggctgaagga tgacggccgg acgtgtgctg atgtggacga 9421 gtgcagcacc accttcccct gcagccagcg ctgcatcaac acccatggca gctataagtg 9481 tctgtgtgtg gagggctatg caccccgcgg cggcgacccc cacagctgca aggctgtgac 9541 tgacgaggaa ccgtttctga tcttcgccaa ccggtactac ctgcgcaagc tcaacctgga 9601 cgggtccaac tacacgttac ttaagcaggg cctgaacaac gccgttgcct tggattttga 9661 ctaccgagag cagatgatct actggacaga tgtgaccacc cagggcagca tgatccgaag 9721 gatgcacctt aacgggagca atgtgcaggt cctacaccgt acaggcctca gcaaccccga 9781 tgggctggct gtggactggg tgggtggcaa cctgtactgg tgcgacaaag gccgggacac 9841 catcgaggtg tccaagctca atggggccta tcggacggtg ctggtcagct ctggcctccg 9901 tgagcccagg gctctggtgg tggatgtgca gaatgggtac ctgtactgga cagactgggg 9961 tgaccattca ctgatcggcc gcatcggcat ggatgggtcc agccgcagcg tcatcgtgga 10021 caccaagatc acatggccca atggcctgac gctggactat gtcactgagc gcatctactg 10081 ggccgacgcc cgcgaggact acattgaatt tgccagcctg gatggctcca atcgccacgt 10141 tgtgctgagc caggacatcc cgcacatctt tgcactgacc ctgtttgagg actacgtcta 10201 ctggaccgac tgggaaacaa agtccattaa ccgagcccac aagaccacgg gcaccaacaa 10261 aacgctcctc atcagcacgc tgcaccggcc catggacctg catgtcttcc atgccctgcg 10321 ccagccagac gtgcccaatc acccctgcaa ggtcaacaat ggtggctgca gcaacctgtg 10381 cctgctgtcc cccgggggag ggcacaaatg tgcctgcccc accaacttct acctgggcag 10441 cgatgggcgc acctgtgtgt ccaactgcac ggctagccag tttgtatgca agaacgacaa 10501 gtgcatcccc ttctggtgga agtgtgacac cgaggacgac tgcggggacc actcagacga 10561 gcccccggac tgccctgagt tcaagtgccg gcccggacag ttccagtgct ccacaggtat 10621 ctgcacaaac cctgccttca tctgcgatgg cgacaatgac tgccaggaca acagtgacga 10681 ggccaactgt gacatccacg tctgcttgcc cagtcagttc aaatgcacca acaccaaccg 10741 ctgtattccc ggcatcttcc gctgcaatgg gcaggacaac tgcggagatg gggaggatga 10801 gagggactgc cccgaggtga cctgcgcccc caaccagttc cagtgctcca ttaccaaacg 10861 gtgcatcccc cgggtctggg tctgcgaccg ggacaatgac tgtgtggatg gcagtgatga 10921 gcccgccaac tgcacccaga tgacctgtgg tgtggacgag ttccgctgca aggattcggg 10981 ccgctgcatc ccagcgcgtt ggaagtgtga cggagaggat gactgtgggg atggctcgga 11041 tgagcccaag gaagagtgtg atgaacgcac ctgtgagcca taccagttcc gctgcaagaa 11101 caaccgctgc gtgcccggcc gctggcagtg cgactacgac aacgattgcg gtgacaactc 11161 cgatgaagag agctgcaccc ctcggccctg ctccgagagt gagttctcct gtgccaacgg 11221 ccgctgcatc gcggggcgct ggaaatgcga tggagaccac gactgcgcgg acggctcgga 11281 cgagaaagac tgcacccccc gctgtgacat ggaccagttc cagtgcaaga gcggccactg 11341 catccccctg cgctggcgct gtgacgcaga cgccgactgc atggacggca gcgacgagga 11401 ggcctgcggc actggcgtgc ggacctgccc cctggacgag ttccagtgca acaacacctt 11461 gtgcaagccg ctggcctgga agtgcgatgg cgaggatgac tgtggggaca actcagatga 11521 gaaccccgag gagtgtgccc ggttcgtgtg ccctcccaac cggcccttcc gttgcaagaa 11581 tgaccgcgtc tgtctgtgga tcgggcgcca atgcgatggc acggacaact gtggggatgg 11641 gactgatgaa gaggactgtg agccccccac agcccacacc acccactgca aagacaagaa 11701 ggagtttctg tgccggaacc agcgctgcct ctcctcctcc ctgcgctgca acatgttcga 11761 tgactgcggg gacggctctg acgaggagga ctgcagcatc gaccccaagc tgaccagctg 11821 cgccaccaat gccagcatct gtggggacga ggcacgctgc gtgcgcaccg agaaagcggc 11881 ctactgtgcc tgccgctcgg gcttccacac cgtgcccggc cagcccggat gccaagacat 11941 caacgagtgc ctgcgcttcg gcacctgctc ccagctctgc aacaacacca agggcggcca 12001 cctctgcagc tgcgctcgga acttcatgaa gacgcacaac acctgcaagg ccgaaggctc 12061 tgagtaccag gtcctgtaca tcgctgatga caatgagatc cgcagcctgt tccccggcca 12121 cccccattcg gcttacgagc aggcattcca gggtgacgag agtgtccgca ttgatgctat 12181 ggatgtccat gtcaaggctg gccgtgtcta ttggaccaac tggcacacgg gcaccatctc 12241 ctaccgcagc ctgccacctg ctgcgcctcc taccacttcc aaccgccacc ggcgacagat 12301 tgaccggggt gtcacccacc tcaacatttc agggctgaag atgcccagag gcatcgccat 12361 cgactgggtg gccggaaacg tgtactggac cgactcgggc cgagatgtga ttgaggtggc 12421 gcagatgaag ggcgagaacc gcaagacgct catctcgggc atgattgacg agccccacgc 12481 cattgtggtg gacccactga gggggaccat gtactggtca gactggggca accaccccaa 12541 gattgagacg gcagcgatgg atgggacgct tcgggagaca ctggtgcagg acaacattca 12601 gtggcccaca ggcctggccg tggattatca caatgagcgg ctgtactggg cagacgccaa 12661 gctttcagtc atcggcagca tccggctcaa tggcacggac cccattgtgg ctgctgacag 12721 caaacgaggc ctaagtcacc ccttcagcat cgacgtcttt gaggattaca tctatggtgt 12781 cacctacatc aataatcgtg tcttcaagat ccataagttt ggccacagcc ccttggtcaa 12841 cctgacaggg ggcctgagcc acgcctctga cgtggtcctt taccatcagc acaagcagcc 12901 cgaagtgacc aacccatgtg accgcaagaa atgcgagtgg ctctgcctgc tgagccccag 12961 tgggcctgtc tgcacctgtc ccaatgggaa gcggctggac aacggcacat gcgtgcctgt 13021 gccctctcca acgccccccc cagatgctcc ccggcctgga acctgtaacc tgcagtgctt 13081 caacggtggc agctgtttcc tcaatgcacg gaggcagccc aagtgccgct gccaaccccg 13141 ctacacgggt gacaagtgtg aactggacca gtgctgggag cactgtcgca atgggggcac 13201 ctgtgctgcc tccccctctg gcatgcccac gtgccggtgc cccacgggct tcacgggccc 13261 caaatgcacc cagcaggtgt gtgcgggcta ctgtgccaac aacagcacct gcactgtcaa 13321 ccagggcaac cagccccagt gccgatgcct acccggcttc ctgggcgacc gctgccagta 13381 ccggcagtgc tctggctact gtgagaactt tggcacatgc cagatggctg ctgatggctc 13441 ccgacaatgc cgctgcactg cctactttga gggatcgagg tgtgaggtga acaagtgcag 13501 ccgctgtctc gaaggggcct gtgtggtcaa caagcagagt ggggatgtca cctgcaactg 13561 cacggatggc cgggtggccc ccagctgtct gacctgcgtc ggccactgca gcaatggcgg 13621 ctcctgtacc atgaacagca aaatgatgcc tgagtgccag tgcccacccc acatgacagg 13681 gccccggtgt gaggagcacg tcttcagcca gcagcagcca ggacatatag cctccatcct 13741 aatccctctg ctgttgctgc tgctgctggt tctggtggcc ggagtggtat tctggtataa 13801 gcggcgagtc caaggggcta agggcttcca gcaccaacgg atgaccaacg gggccatgaa 13861 cgtggagatt ggaaacccca cctacaagat gtacgaaggc ggagagcctg atgatgtggg 13921 aggcctactg gacgctgact ttgccctgga ccctgacaag cccaccaact tcaccaaccc 13981 cgtgtatgcc acactctaca tggggggcca tggcagtcgc cactccctgg ccagcacgga 14041 cgagaagcga gaactcctgg gccggggccc tgaggacgag ataggggacc ccttggcata 14101 gggccctgcc ccgtcggact gcccccagaa agcctcctgc cccctgccgg tgaagtcctt 14161 cagtgagccc ctccccagcc agcccttccc tggccccgcc ggatgtataa atgtaaaaat 14221 gaaggaatta cattttatat gtgagcgagc aagccggcaa gcgagcacag tattatttct 14281 ccatcccctc cctgcctgct ccttggcacc cccatgctgc cttcagggag acaggcaggg 14341 agggcttggg gctgcacctc ctaccctccc accagaacgc accccactgg gagagctggt 14401 ggtgcagcct tcccctccct gtataagaca ctttgccaag gctctcccct ctcgccccat 14461 ccctgcttgc ccgctcccac agcttcctga gggctaattc tgggaaggga gagttctttg 14521 ctgcccctgt ctggaagacg tggctctggg tgaggtaggc gggaaaggat ggagtgtttt 14581 agttcttggg ggaggccacc ccaaacccca gccccaactc caggggcacc tatgagatgg 14641 ccatgctcaa cccccctccc agacaggccc tccctgtctc cagggccccc accgaggttc 14701 ccagggctgg agacttcctc tggtaaacat tcctccagcc tcccctcccc tggggacgcc 14761 aaggaggtgg gccacaccca ggaagggaaa gcgggcagcc ccgttttggg gacgtgaacg 14821 ttttaataat ttttgctgaa ttctttacaa ctaaataaca cagatattct tataaataaa 14881 attgtaaaaa aaaaaa // LOCUS HSLFA3 1040 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for lymphocyte function associated antigen-3 (LFA-3). ACCESSION Y00636 NID g34346 KEYWORDS cell surface glycoprotein; lymphocyte function associated antigen-3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1040) AUTHORS Wallner,B.P. TITLE Direct Submission JOURNAL Submitted (31-AUG-1987) B.P. Wallner, Biogen Research Corp., 14 Cambridge Center, Cambridge, MA 02143 REFERENCE 2 (bases 1 to 1040) AUTHORS Wallner,B.P., Frey,A.Z., Tizard,R., Mattaliano,R.J., Hession,C., Sanders,M.E., Dustin,M.L. and Springer,T.A. TITLE Primary structure of lymphocyte function-associated antigen 3 (LFA-3). The ligand of the T lymphocyte CD2 glycoprotein JOURNAL J. Exp. Med. 166 (4), 923-932 (1987) MEDLINE 88009714 FEATURES Location/Qualifiers source 1..1040 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human tonsil cDNA." /clone="lambdaLFA3HT16" misc_feature 1..9 /note="5' UT-region" sig_peptide 10..91 /note="signal peptide (AA -28 to -1)" CDS 10..762 /note="precursor polypeptide (AA -28 to 222)" /codon_start=1 /db_xref="PID:g34347" /db_xref="SWISS-PROT:P19256" /translation="MVAGSDAGRALGVLSVVCLLHCFGFISCFSQQIYGVVYGNVTFH VPSNVPLKEVLWKKQKDKVAELENSEFRAFSSFKNRVYLDTVSGSLTIYNLTSSDEDE YEMESPNITDTMKFFLYVLESLPSPTLTCALTNGSIEVQCMIPEHYNSHRGLIMYSWD CPMEQCKRNSTSIYFKMENDLPQKIQCTLSNPLFNTTSSIILTTCIPSSGHSRHRYAL IPIPLAVITTCIVLYMNGILKCDRKPDRTNSN" mat_peptide 92..759 /note="mature LFA-3 (AA 1-222)" misc_feature 760..1040 /note="3' UT-region" BASE COUNT 342 a 192 c 193 g 313 t ORIGIN 1 cgacgagcca tggttgctgg gagcgacgcg gggcgggccc tgggggtcct cagcgtggtc 61 tgcctgctgc actgctttgg tttcatcagc tgtttttccc aacaaatata tggtgttgtg 121 tatgggaatg taactttcca tgtaccaagc aatgtgcctt taaaagaggt cctatggaaa 181 aaacaaaagg ataaagttgc agaactggaa aattctgaat tcagagcttt ctcatctttt 241 aaaaataggg tttatttaga cactgtgtca ggtagcctca ctatctacaa cttaacatca 301 tcagatgaag atgagtatga aatggaatcg ccaaatatta ctgataccat gaagttcttt 361 ctttatgtgc ttgagtctct tccatctccc acactaactt gtgcattgac taatggaagc 421 attgaagtcc aatgcatgat accagagcat tacaacagcc atcgaggact tataatgtac 481 tcatgggatt gtcctatgga gcaatgtaaa cgtaactcaa ccagtatata ttttaagatg 541 gaaaatgatc ttccacaaaa aatacagtgt actcttagca atccattatt taatacaaca 601 tcatcaatca ttttgacaac ctgtatccca agcagcggtc attcaagaca cagatatgca 661 cttataccca taccattagc agtaattaca acatgtattg tgctgtatat gaatggtatt 721 ctgaaatgtg acagaaaacc agacagaacc aactccaatt gattggtaac agaagatgaa 781 gacaacagca taactaaatt attttaaaaa ctaaaaagcc atctgatttc tcatttgagt 841 attacaattt ttgaacaact gttggaaatg taacttgaag cagctgcttt aagaagaaat 901 acccactaac aaagaacaag cattagtttt ggctgtcatc aacttattat atgactaggt 961 gcttgctttt tttgtcagta aattgttttt actgatgatg tagatacttt tgtaaataaa 1021 tgtaaatatg tacacaagtg // LOCUS HSLH2MR 1300 bp RNA PRI 27-MAY-1997 DEFINITION Human L-H2 mRNA coding for an asialoglycoprotein receptor. ACCESSION X55283 NID g34354 KEYWORDS asialoglycoprotein receptor; cytoplasmic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1300) AUTHORS Paietta,E. TITLE Direct Submission JOURNAL Submitted (25-OCT-1990) Paietta E., Montefiore Medical Centre, Department of Oncology, 111 East 210th Street, Bronx, NY 10467, U S A REFERENCE 2 (bases 1 to 1300) AUTHORS Paietta,E., Stockert,R.J. and Racevskis,J. TITLE Differences in the abundance of variably spliced transcripts for the second asialoglycoprotein receptor polypeptide, H2, in normal and transformed human liver JOURNAL Hepatology 15 (3), 395-402 (1992) MEDLINE 92184202 COMMENT See also for variant asialoglycoprotein receptor mRNA. Related sequence M11025. FEATURES Location/Qualifiers source 1..1300 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="hepatocyte" /clone_lib="lambda-gt11" mRNA <1..>1300 /gene="L-H2" gene 1..1300 /gene="L-H2" mat_peptide 191..1051 /gene="L-H2" /product="asialoglycoprotein receptor" CDS 191..1054 /gene="L-H2" /codon_start=1 /product="asialoglycoprotein receptor" /db_xref="PID:g34355" /db_xref="SWISS-PROT:Q03969" /translation="MAKDFQDIQQLSSEENDHPFHQGPPPAQPLAQRLCSMVCFSLLA LSFNILLLVVICVTGSQSAQLQAELRSLKEAFSNFSSSTLTEVQAISTHGGSVGDKIT SLGAKLEKQQQDLKADHDALLFHLKHFPVDLRFVACQMELLHSNGSQRTCCPVNWVEH QGSCYWFSHSGKAWAEAEKYCQLENAHLVVINSWEEQKFIVQHTNPFNTWIGLTDSDG SWKWVDGTDYRHNYKNWAVTQPDNWHGHELGGSEDCVEVQPDGRWNDDFCLQVYRWVC EKRRNATGEVA" BASE COUNT 302 a 395 c 350 g 253 t ORIGIN 1 ctgcactctc ctctcccctg tgagctccac ctgccccagt tctcctggct ttaacccctc 61 cttggccaag gccagggttg cctgcgggag ccaggcgtcc gctctccaca cctttcacag 121 ccccagccct cagagcaacc tcagcccagc ccagcccagc tccagctcca gctccagccc 181 gggccccatc atggccaagg actttcaaga tatccagcag ctgagctcgg aggaaaatga 241 ccatcctttc catcaagggc cacctcctgc ccagcccctg gcacagcgtc tctgctccat 301 ggtctgcttc agtctgcttg ccctgagctt caacatcctg ctgctggtgg tcatctgtgt 361 gactgggtcc caaagtgcac agctgcaagc cgagctgcgg agcctgaagg aagctttcag 421 caacttctcc tcgagcaccc tgacggaggt ccaggcaatc agcacccacg gaggcagcgt 481 gggtgacaag atcacatccc taggagccaa gctggagaaa cagcagcagg acctgaaagc 541 agatcacgat gccctgctct tccatctgaa gcacttcccc gtggacctgc gcttcgtggc 601 ctgccagatg gagctcctcc acagcaacgg ctcccaaagg acctgctgcc ccgtcaactg 661 ggtggagcac caaggcagct gctactggtt ctctcactcc gggaaggcct gggctgaggc 721 ggagaagtac tgccagctgg agaacgcaca cctggtggtc atcaactcct gggaggagca 781 gaaattcatt gtacaacaca cgaacccctt caatacctgg ataggtctca cggacagtga 841 tggctcttgg aaatgggtgg atggcacaga ctataggcac aactacaaga actgggctgt 901 cactcagcca gataattggc acgggcacga gctgggtgga agtgaagact gtgttgaagt 961 ccagccggat ggccgctgga acgatgactt ctgcctgcag gtgtaccgct gggtgtgtga 1021 gaaaaggcgg aatgccaccg gcgaggtggc ctgaccccag cacacctctg gctaacccat 1081 accccacacc tgcccagctc tggcttctct gttgaggatt ttgaggaaag gaagaaacac 1141 tgagacaggg gtatggggaa gagctgagca aagagagaaa ggaggtagtt taagagtccc 1201 tgaccctgga ggactgagat cccacctcct tctgtaattc attgtaatta ttataatcgt 1261 cagcctcttc aatggcgtag gaaagaagaa acaaatgctt // LOCUS HSLIFR 3591 bp RNA PRI 20-APR-1994 DEFINITION H.sapiens mRNA for leukemia inhibitory factor (LIF) receptor. ACCESSION X61615 NID g34365 KEYWORDS leukemia-inhibitory factor receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3591) AUTHORS Gearing,D.P., Thut,C.J., VandeBos,T., Gimpel,S.D., Delaney,P.B., King,J., Price,V., Cosman,D. and Beckmann,M.P. TITLE Leukemia inhibitory factor receptor is structurally related to the IL-6 signal transducer, gp130 JOURNAL EMBO J. 10 (10), 2839-2848 (1991) MEDLINE 92007727 FEATURES Location/Qualifiers source 1..3591 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 179..310 /note="leukemia inhibitory factor receptor" CDS 179..3472 /codon_start=1 /product="leukemia inhibitory factor receptor" /db_xref="PID:g34366" /db_xref="SWISS-PROT:P42702" /translation="MMDIYVCLKRPSWMVDNKRMRTASNFQWLLSTFILLYLMNQVNS QKKGAPHDLKCVTNNLQVWNCSWKAPSGTGRGTDYEVCIENRSRSCYQLEKTSIKIPA LSHGDYEITINSLHDFGSSTSKFTLNEQNVSLIPDTPEILNLSADFSTSTLYLKWNDR GSVFPHRSNVIWEIKVLRKESMELVKLVTHNTTLNGKDTLHHWSWASDMPLECAIHFV EIRCYIDNLHFSGLEEWSDWSPVKNISWIPDSQTKVFPQDKVILVGSDITFCCVSQEK VLSALIGHTNCPLIHLDGENVAIKIRNISVSASSGTNVVFTTEDNIFGTVIFAGYPPD TPQQLNCETHDLKEIICSWNPGRVTALVGPRATSYTLVESFSGKYVRLKRAEAPTNES YQLLFQMLPNQEIYNFTLNAHNPLGRSQSTILVNITEKVYPHTPTSFKVKDINSTAVK LSWHLPGNFAKINFLCEIEIKKSNSVQEQRNVTIKGVENSSYLVALDKLNPYTLYTFR IRCSTETFWKWSKWSNKKQHLTTEASPSKGPDTWREWSSDGKNLIIYWKPLPINEANG KILSYNVSCSSDEETQSLSEIPDPQHKAEIRLDKNDYIISVVAKNSVGSSPPSKIASM EIPNDDLKIEQVVGMGKGILLTWHYDPNMTCDYVIKWCNSSRSEPCLMDWRKVPSNST ETVIESDEFRPGIRYNFFLYGCRNQGYQLLRSMIGYIEELAPIVAPNFTVEDTSADSI LVKWEDIPVEELRGFLRGYLFYFGKGERDTSKMRVLESGRSDIKVKNITDISQKTLRI ADLQGKTSYHLVLRAYTDGGVGPEKSMYVVTKENSVGLIIAILIPVAVAVIVGVVTSI LCYRKREWIKETFYPDIPNPENCKALQFQKSVCEGSSALKTLEMNPCTPNNVEVLETR SAFPKIEDTEIISPVAERPEDRSDAEPENHVVVSYCPPIIEEEIPNPAADEAGGTAQV IYIDVQSMYQPQAKPEEEQENDPVGGAGYKPQMHLPINSTVEDIAAEEDLDKTAGYRP QANVNTWNLVSPDSPRSIDSNSEIVSFGSPCSINSRQFLIPPKDEDSPKSNGGGWSFT NFFQNKPND" mat_peptide 311..3469 /product="leukemia inhibitory factor receptor" BASE COUNT 1145 a 696 c 736 g 1014 t ORIGIN 1 agatcttgga acgagacgac ctgctctctc tcccagaacg tgtctctgct gcaaggcacc 61 gggccctttc gctctgcaga actgcacttg caagaccatt atcaactcct aatcccagct 121 cagaaaggga gcctctgcga ctcattcatc gccctccagg actgactgca ttgcacagat 181 gatggatatt tacgtatgtt tgaaacgacc atcctggatg gtggacaata aaagaatgag 241 gactgcttca aatttccagt ggctgttatc aacatttatt cttctatatc taatgaatca 301 agtaaatagc cagaaaaagg gggctcctca tgatttgaag tgtgtaacta acaatttgca 361 agtgtggaac tgttcttgga aagcaccctc tggaacaggc cgtggtactg attatgaagt 421 ttgcattgaa aacaggtccc gttcttgtta tcagttggag aaaaccagta ttaaaattcc 481 agctctttca catggtgatt atgaaataac aataaattct ctacatgatt ttggaagttc 541 tacaagtaaa ttcacactaa atgaacaaaa cgtttcctta attccagata ctccagagat 601 cttgaatttg tctgctgatt tctcaacctc tacattatac ctaaagtgga acgacagggg 661 ttcagttttt ccacaccgct caaatgttat ctgggaaatt aaagttctac gtaaagagag 721 tatggagctc gtaaaattag tgacccacaa cacaactctg aatggcaaag atacacttca 781 tcactggagt tgggcctcag atatgccctt ggaatgtgcc attcattttg tggaaattag 841 atgctacatt gacaatcttc atttttctgg tctcgaagag tggagtgact ggagccctgt 901 gaagaacatt tcttggatac ctgattctca gactaaggtt tttcctcaag ataaagtgat 961 acttgtaggc tcagacataa cattttgttg tgtgagtcaa gaaaaagtgt tatcagcact 1021 gattggccat acaaactgcc ccttgatcca tcttgatggg gaaaatgttg caatcaagat 1081 tcgtaatatt tctgtttctg caagtagtgg aacaaatgta gtttttacaa ccgaagataa 1141 catatttgga accgttattt ttgctggata tccaccagat actcctcaac aactgaattg 1201 tgagacacat gatttaaaag aaattatatg tagttggaat ccaggaaggg tgacagcgtt 1261 ggtgggccca cgtgctacaa gctacacttt agttgaaagt ttttcaggaa aatatgttag 1321 acttaaaaga gctgaagcac ctacaaacga aagctatcaa ttattatttc aaatgcttcc 1381 aaatcaagaa atatataatt ttactttgaa tgctcacaat ccgctgggtc gatcacaatc 1441 aacaatttta gttaatataa ctgaaaaagt ttatccccat actcctactt cattcaaagt 1501 gaaggatatt aattcaacag ctgttaaact ttcttggcat ttaccaggca actttgcaaa 1561 gattaatttt ttatgtgaaa ttgaaattaa gaaatctaat tcagtacaag agcagcggaa 1621 tgtcacaatc aaaggagtag aaaattcaag ttatcttgtt gctctggaca agttaaatcc 1681 atacactcta tatacttttc ggattcgttg ttctactgaa actttctgga aatggagcaa 1741 atggagcaat aaaaaacaac atttaacaac agaagccagt ccttcaaagg ggcctgatac 1801 ttggagagag tggagttctg atggaaaaaa tttaataatc tattggaagc ctttacccat 1861 taatgaagct aatggaaaaa tactttccta caatgtatcg tgttcatcag atgaggaaac 1921 acagtccctt tctgaaatcc ctgatcctca gcacaaagca gagatacgac ttgataagaa 1981 tgactacatc atcagcgtag tggctaaaaa ttctgtgggc tcatcaccac cttccaaaat 2041 agcgagtatg gaaattccaa atgatgatct caaaatagaa caagttgttg ggatgggaaa 2101 ggggattctc ctcacctggc attacgaccc caacatgact tgcgactacg tcattaagtg 2161 gtgtaactcg tctcggtcgg aaccatgcct tatggactgg agaaaagttc cctcaaacag 2221 cactgaaact gtaatagaat ctgatgagtt tcgaccaggt ataagatata attttttcct 2281 gtatggatgc agaaatcaag gatatcaatt attacgctcc atgattggat atatagaaga 2341 attggctccc attgttgcac caaattttac tgttgaggat acttctgcag attcgatatt 2401 agtaaaatgg gaagacattc ctgtggaaga acttagaggc tttttaagag gatatttgtt 2461 ttactttgga aaaggagaaa gagacacatc taagatgagg gttttagaat caggtcgttc 2521 tgacataaaa gttaagaata ttactgacat atcccagaag acactgagaa ttgctgatct 2581 tcaaggtaaa acaagttacc acctggtctt gcgagcctat acagatggtg gagtgggccc 2641 ggagaagagt atgtatgtgg tgacaaagga aaattctgtg ggattaatta ttgccattct 2701 catcccagtg gcagtggctg tcattgttgg agtggtgaca agtatccttt gctatcggaa 2761 acgagaatgg attaaagaaa ccttctaccc tgatattcca aatccagaaa actgtaaagc 2821 attacagttt caaaagagtg tctgtgaggg aagcagtgct cttaaaacat tggaaatgaa 2881 tccttgtacc ccaaataatg ttgaggttct ggaaactcga tcagcatttc ctaaaataga 2941 agatacagaa ataatttccc cagtagctga gcgtcctgaa gatcgctctg atgcagagcc 3001 tgaaaaccat gtggttgtgt cctattgtcc acccatcatt gaggaagaaa taccaaaccc 3061 agccgcagat gaagctggag ggactgcaca ggttatttac attgatgttc agtcgatgta 3121 tcagcctcaa gcaaaaccag aagaagaaca agaaaatgac cctgtaggag gggcaggcta 3181 taagccacag atgcacctcc ccattaattc tactgtggaa gatatagctg cagaagagga 3241 cttagataaa actgcgggtt acagacctca ggccaatgta aatacatgga atttagtgtc 3301 tccagactct cctagatcca tagacagcaa cagtgagatt gtctcatttg gaagtccatg 3361 ctccattaat tcccgacaat ttttgattcc tcctaaagat gaagactctc ctaaatctaa 3421 tggaggaggg tggtccttta caaacttttt tcagaacaaa ccaaacgatt aacagtgtca 3481 ccgtgtcact tcagtcagcc atctcaataa gctcttactg ctagtgttgc tacatcagca 3541 ctgggcattc ttggagggat cctgtgaagt attgttagga ggtgaacttc a // LOCUS HSLIGX1 2284 bp RNA PRI 13-JAN-1997 DEFINITION H.sapiens mRNA for ligase like protein, X-1. ACCESSION X98266 NID g1770450 KEYWORDS ligase; X-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2284) AUTHORS Matsumoto-Taniura,N., Pirollet,F., Monroe,R., Gerace,L. and Westendorf,J.M. TITLE Identification of novel M phase phosphoproteins by expression cloning JOURNAL Mol. Biol. Cell 7 (9), 1455-1469 (1996) MEDLINE 97039687 REFERENCE 2 (bases 1 to 2284) AUTHORS Westendorf,J.M. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) J.M. Westendorf, INSERM U366, DBMS/CS-CENG, 17 rue des Martyrs, F- 38054 Grenoble Cedex 9, FRANCE FEATURES Location/Qualifiers source 1..2284 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /dev_stage="31 years old" /tissue_type="cervix" /cell_type="epithelial" /cell_line="HeLa" /clone_lib="lambda gt11" /clone="4" /clone="3" /clone="16" gene 1..681 /gene="X-1" CDS <1..681 /gene="X-1" /codon_start=1 /db_xref="PID:e248484" /db_xref="PID:g1770451" /translation="DAKYQPDDPEAIDPMTDFLGTTALMSDLDDFEEGDGAVTLMTLH AAKGLEFPVVFLVGMEEGIFPLSRALMAEDQLEEERRLAYVGITRAMKKLFLTNAFSR LLYGRTQSNEASRFINEISPALLETEYGQNTRVTPRRDLPFDRKNQRAQATTYRATPV TKTTSGTTGGDQITWAPGDKVSHKKWGIGTVVAVSGDRSDQELKVAFPSEGVKQLLAA FAPITKVN" CDS 722..2086 /codon_start=1 /product="ligase-like protein" /db_xref="PID:e248508" /db_xref="PID:g1770452" /translation="MLPEFTPIEQLSTSAAQQEVAQLQADLTEYGVAYYEQDAPLVED HVYDALYARLVALESAFPQFVTQDSPTQNVGGALTKSGLAKVEHPAPMLSLGDVFSLE ELAAWDERTTKSLGFQSPYNLELKIDGLAVALTYVDGRLVQASTRGNGTIGEDVTRNV KTIKAIPQQLTEPLTIEVRGEIYMPKKSFAALNVQREADGLEPFANPRNAAAGSLRQL DAQVTKQRELSAFVYYTAEPEVLGVTTQSGALARFAELGLPTDTHNQVIEKMADIADF IATYTAKRDTLAYGIDGVVVKVNALDNQVELGNTVKIPRWAIAYKFPPEEALTVVRDI EWTVGRTGAVTPTAIMDPVQLAGTTVQRASLHNPDYLQQKDIRIGDTVTLHKAGDIIP EVGQVILSQRPADSIAYPIPVACPACGSELVHISGEVALRCINPFCAAQIQEGLTHLL HVMR" BASE COUNT 619 a 484 c 585 g 596 t ORIGIN 1 gatgccaagt atcaaccaga tgatccagaa gccattgatc cgatgacgga cttcttgggc 61 accactgctt taatgagtga tttggatgac tttgaagaag gcgacggcgc ggtgacattg 121 atgacgttac atgcagccaa aggcttggag ttccctgttg tctttttggt gggcatggaa 181 gaaggcatct tccccttgtc acgcgcgttg atggctgaag atcagcttga agaagaacga 241 cgcttggcgt atgttgggat tacccgcgcc atgaaaaaac tgttcttaac caatgccttt 301 tcgcgtttgt tgtatggtcg aacacaatct aatgaggctt ctcgctttat taatgaaatt 361 tctccagcct tacttgagac cgaatatggt caaaatacac gggtaacacc acgtcgtgat 421 ttgccgtttg atcggaaaaa ccagcgcgcg caagcaacaa cttatcgtgc gaccccagtg 481 actaaaacaa ctagtggcac gactggtggg gatcaaatca cgtgggcgcc aggcgataaa 541 gtctctcaca aaaagtgggg gattgggaca gtggttgccg ttagtggtga tcgttcggac 601 caagaactta aagtggcctt cccatcggaa ggggtgaagc agttactggc cgcctttgca 661 ccgattacaa aggttaacta accaagtaag tgcgacttta atatgagatt aggaaaatag 721 gatgttacca gaatttacac caattgaaca attgtctacg tcagcagcac aacaagaagt 781 tgcccagttg caagctgatc tcactgaata tggggtcgcc tactatgaac aagatgcccc 841 gctggttgaa gatcatgtct atgatgcgct ttatgcgcga ttagtggcgc tagaatcagc 901 ttttccacag tttgtgacgc aagattcccc aacccaaaat gttggtggtg ccttgacaaa 961 atctggctta gccaaagtgg aacatccggc accaatgtta tctttggggg atgtttttag 1021 cttagaagag ttagcagctt gggatgaacg gacgaccaaa agtctcggtt ttcaatcgcc 1081 ttataacctt gaattaaaaa ttgatgggtt agcggtggcg ttgacttatg ttgatggccg 1141 gcttgtccaa gcgtcaaccc gtgggaatgg gacaattggc gaggatgtca cgcgtaacgt 1201 caaaacgatt aaggctattc cacaacaatt aacggaaccc ttaaccattg aagtacgtgg 1261 cgaaatttat atgcctaaaa agagttttgc cgctttgaat gtacaacgtg aagccgatgg 1321 cttggaaccg tttgctaatc ctagaaatgc ggcggctggc tcattgcggc aacttgacgc 1381 acaggtgacg aaacaacgcg aattatcagc ctttgtgtac tatacggctg aaccagaggt 1441 acttggtgtt actacccaaa gtggcgcgtt ggcccgtttt gccgagcttg gtttgccgac 1501 tgatacgcat aatcaagtga ttgaaaaaat ggctgatatt gctgatttta tcgccaccta 1561 tacagctaaa cgtgacacgt tggcctatgg aattgacggg gtggtggtga aagtcaatgc 1621 gctggataat caagtggagc ttggtaatac ggtgaaaatt ccgcgctggg cgattgccta 1681 taagttccca ccagaagaag ccttaacggt ggttcgcgat attgaatgga ccgtgggtcg 1741 gacaggtgcc gtcacaccaa cggctatcat ggatcctgtt caactagctg gaaccacagt 1801 gcaacgtgcg agtttgcaca atcctgatta tttgcaacaa aaagatattc gcatcgggga 1861 tacggtcacg ttgcataagg ctggggatat tattccggaa gttggccaag tgattttatc 1921 tcagcgacca gccgatagta tcgcatatcc gataccagtt gcctgtccag cgtgtgggtc 1981 agaattagtc cacatctcag gcgaagtggc gttacgttgt attaaccctt tctgtgcggc 2041 acaaatccaa gaagggttga cgcatttgct tcacgtaatg cgatgaatat cgccggtatg 2101 gggccacgcg tgattgacca attattaaaa gccaactata tcaaggatgt ggcgtcaatt 2161 tatcgcttag atatgacgca attattgtct ctggataagt ttcaagagaa atcagcggct 2221 aatttgttag ccgctattac gcaatcaaaa gaaaattcgt tagaacggtt attatttggt 2281 ctag // LOCUS HSLIMDP 3614 bp RNA PRI 04-SEP-1997 DEFINITION H.sapiens mRNA for ZNF185 gene. ACCESSION Y09538 NID g2370125 KEYWORDS LIM-domain protein; ZNF185 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3614) AUTHORS Heiss,N.S., Gloeckner,G., Bachner,D., Kioschis,P., Klauck,S.M., Hinzmann,B., Rosenthal,A., Herman,G.E. and Poustka,A. TITLE Genomic structure of a novel LIM domain gene (ZNF185) in Xq28 and comparisons with the orthologous murine transcript JOURNAL Genomics 43 (3), 329-338 (1997) MEDLINE 97422610 REFERENCE 2 (bases 1 to 3614) AUTHORS Heiss,N.S. TITLE Direct Submission JOURNAL Submitted (20-NOV-1996) N.S. Heiss, Deutsches Krebsforschungszentrum (DKFZ), Department of Molecular Genome Analysis, Im Neuenheimer Feld 280, 69120 Heidelberg, 69120, FRG FEATURES Location/Qualifiers source 1..3614 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq28" gene 41..1399 /gene="ZNF185" CDS 41..1399 /gene="ZNF185" /codon_start=1 /product="LIM-domain protein" /db_xref="PID:e315426" /db_xref="PID:g2370126" /translation="MQRQAPYNIRRSSTSGDTEEEEEEEVVPFSSDEQKRRSEAASGV LRRTAPREHSYVLSAAKKSTGSPTQETQAPFIAKRVEVVEEDGPSEKSQDPPALARST PGSNSSRGEEIVRLQILTPRAGLRLVAPDVEGMSSSATSVSAVPADRKSNSTAAQEDA KADPKGALADCEGKDVPTRVGEAWQERPGAPRGGQGDPAVPAQQPADPSTPERQSSPS GSEQLVRRESCGSSVLTDFEGKDVATKVGEAWQDRPRAPRGGQGDPAVPTQQPADPST PEQQNSPSGSEQFVRRESCTSRVRSPSSCMVTVTVTATSEQPHIYIPAPASELDSSST TKGILFVKEYVNASEVSSGKPVSARYSNVSSIEDSFAMEKKPPCGSTPYSERTTGGIC TYCNREIRDCPKITLEHLGICCHEYCFKCGICSKPMGDLLDQIFIHRDTIHCGKCYEK LF" BASE COUNT 941 a 888 c 900 g 885 t ORIGIN 1 gctacaagat gaccactgag gattacaaga agctgtgagt atgcaacgcc aggcacccta 61 caatatcagg cgcagctcta catcagggga caccgaggag gaggaggagg aggaggtggt 121 gccattctcc tcagatgaac agaaacggag gtcagaggct gcaagcggtg ttctgaggag 181 gacagctccc cgggagcact cctacgtcct gtcagcggcc aagaagagca ctggcagtcc 241 tacccaggag acacaggcac cgtttatcgc gaagagggtg gaggtggtgg aagaggacgg 301 gccttctgag aagagccagg acccacctgc tctggcaaga tccactcctg gctcaaacag 361 ctcaagaggt gaggaaattg tccgcctgca gatcctgaca cccagggcag gactccgcct 421 ggtggcccca gacgtggaag gcatgagctc cagtgccact tcagtctctg ctgtccctgc 481 tgataggaag agcaacagca cagcagccca ggaggatgca aaggcagacc caaagggggc 541 cttggctgat tgtgagggga aggatgtacc caccagggtc ggagaggcct ggcaggagag 601 gcctggagct ccaagaggtg gccaaggaga cccagctgta cccgctcagc aacctgcaga 661 tcccagcacc ccagagcggc agagcagccc cagcggatct gagcaacttg tcagacgaga 721 gagttgtggc agtagcgtgt tgactgattt tgaggggaag gatgtggcca ccaaggtcgg 781 agaggcctgg caggacaggc ctagagcccc aagaggtggc caaggagacc cagctgtacc 841 cactcagcaa cctgcagatc ccagtacccc agaacagcag aacagcccca gcggatctga 901 gcaattcgtc agacgagaga gctgcaccag cagggtgagg agcccctcga gctgcatggt 961 cactgttact gtcactgcca catctgagca gcctcacatt tatattccag cccccgcaag 1021 tgaattggac tccagctcta ccaccaaagg gattctcttc gtgaaggagt acgtgaatgc 1081 tagtgaagtg tcttctggga agccagtatc tgcacgctat agcaacgtca gcagcattga 1141 ggactcattc gccatggaga agaagcctcc atgtggcagc actccatact ctgagaggac 1201 aactggaggg atctgtactt actgcaaccg tgagatccga gactgtccaa agattaccct 1261 agaacatctt ggtatctgct gccatgaata ttgctttaag tgtgggattt gcagtaaacc 1321 gatgggcgat ctcctggatc agatcttcat tcaccgtgac accattcact gtgggaaatg 1381 ctatgagaag ctcttctagc gaccccccac cgccaggctg atcagaagct gatgactcgt 1441 ggacaaattt ggctgtcccc agttttgccc caagttgctg tctccccttc cctcacctcc 1501 tccctccctg tttgatttct tcatgctttt gcccttctca agttgaagtt gcatacatcc 1561 aatatcgtat cttaatgatg ctatgataat tgcttgtgtg tgtagcttct tgtagcttag 1621 aaagcgcttt atgcccatga tgtcatttca ggctcaacca aagaggatca aacaggaatt 1681 ccatcttggc ttccctaaga cagattggct ttctaatgag tttaagtggg cagaagtgta 1741 gggttcagtg tgtcctgact cccttgaggc ttataatggg ccaagttgaa gactgttgat 1801 gatccctggt gggtaaattg cagacatcaa atgctaggga ttggcatagg ctagtgttta 1861 gcttgtctat ttgccatatc tattttttta aatttccata cacttgtaaa agtagttagt 1921 tgcttttgat tgagttatat agcagttttt catttggtct tccactcacc gttcactata 1981 tttgagtgtt cccttacagg tatgttggca tgtgttggaa aatttacaca attaggttta 2041 aattcagtag gatgtgattt tgggggtgga ctgatcaaag tgatatctgt gtctgttgga 2101 atcttgatag ctgattaatt tgccctcaat tctgctccct gaacttcaca cataaatctt 2161 cccaagtggg ttttagggtg tatagatccc agcaggatta aggaagtgga aaagcagcta 2221 acatttcttg aggctctacc acatagcagg cactgtcaca gagtaatggc attaatcccc 2281 ataataatcc tgtgaaggtg atattctcat cccatcttag acatgaggat attggaactc 2341 agagaggtgg ctattgcatt gcgcagaacg ctacagagcc catgctcttc ccagagcagc 2401 acccacaaaa gcaagcattg attttgtgct cagtgtgtgc caagcactgt gcagagggta 2461 cacagttcct gccaggttaa caccctccct tcaggcctcc caaaggcata ggcttgcaaa 2521 gagcagaagg tgtgaaatca cactcttcct ctgggcatcc tggatccctg aattatcccc 2581 ccccccatga agtacttcaa gggccaagct gcccctttcc ctcctctccg cccatgaaaa 2641 tgcctccaaa ctgagatgct ttcagctgag aacagatttg actcacagac attaccaaag 2701 aggagcttgt gaatccagga aaagctccag ggggctagct gatctgagca gagagctttc 2761 agtgacccat tttcctgtct agactctgcc ttaagctagt ggcaactgct ggggccccag 2821 gtacttggga catggaaact cgttggatgg ctgggcagat gtaagcctgt ccatgcagtc 2881 agccgatcct ctgctcaggt tcagctggac tctgccatct gtgggcccag catcactctg 2941 taagttcctt gaaaggaaga acaaccttag agtatttctg atacaaaatg agggcctctg 3001 ctcttgattt aattataaaa tgtctacgtc tttctccagt ttctgagccc tatgcacatt 3061 ggcttgtggg cttgttcttc ctgccaaatg atcagagagg gaacattcca tttatttgta 3121 gtggatttcc tctggagggc atgtacccac actaaatacc aactgctctt cctcagctgt 3181 agtccccaac atcagacttg gcacgtggtg gacactaaca cacaggcact caatgaatga 3241 gtgaaggaaa taaaagtcac cccccgttgg tgagaaggtg cctatccccc tgagtcctca 3301 gtgcaggacc agtggatgaa aggcaaggta aagaggccca agataggctg gcttcccccg 3361 ttcaaggtat agtctgcctt taagggagtt ttagaaccaa catgcaagac attgaaagaa 3421 atcttgcaag agccattatt gacttagatc caaaacagcc tctctcatgt ctaaaaaggc 3481 acagaatttt gcagatctga ggaagaggga tgcattacct ttttgcttct tttcaattgc 3541 ttagtgtttc taatcatact taatccacac taatgtgcgc aattataata aatgctaaaa 3601 tatcaaaaaa aaaa // LOCUS HSLINE1O 6065 bp DNA PRI 15-JUN-1992 DEFINITION Human DNA for LINE-1 transposable element ORFI and II. ACCESSION X52235 NID g34372 KEYWORDS LINE repetitive sequence; poly(A)-type retrotransposon; retrotransposon. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6065) AUTHORS Hohjoh,H. TITLE Direct Submission JOURNAL Submitted (19-MAR-1990) Hohjoh H., Research Laboratory for Genetic Information, Kyushi University, 18 Fukuoka 812, Japan REFERENCE 2 (bases 1 to 6065) AUTHORS Hohjoh,H., Minakami,R. and Sakaki,Y. TITLE Selective cloning and sequence analysis of the human L1 (LINE-1) sequences which transposed in the relatively recent past JOURNAL Nucleic Acids Res. 18 (14), 4099-4104 (1990) MEDLINE 90332398 REFERENCE 3 (bases 1 to 6065) AUTHORS Minakami,R., Kurose,K., Etoh,K., Furuhata,Y., Hattori,M. and Sakaki,Y. TITLE Identification of an internal cis-element essential for the human L1 transcription and a nuclear factor(s) binding to the element JOURNAL Nucleic Acids Res. 20 (12), 3139-3145 (1992) MEDLINE 92319645 FEATURES Location/Qualifiers source 1..6065 /organism="Homo sapiens" /strain="Japanese" /db_xref="taxon:9606" /tissue_type="placenta" CDS 904..1050 /note="ORFI" /codon_start=1 /db_xref="PID:g34373" /translation="MGKKQNRKTGNCKMHSASPPPKEHSSSPATEQSWMENDFDELRE EGFR" CDS 3668..5806 /note="ORFII; Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e33293" /db_xref="PID:g1335205" /translation="VGFIPGMQDWFNMHKSINVIQHINRTKDKNHMIVSIDAEKAFDK IQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSP LLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKPIS NFSKVSGYKINVQKSQAFLYTNNRQTESQIMNELPFTIASKRIKYLGIQITRDVKDLF KENYKPLLKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNTIPIKLPMTFFTE LEKTTLKFIWNQKRARIAKAIRSQKNKSGGITLPDFKLYYKATVTKTAWYWYQNRDID QWNRTEPSEITPHIYNYLIFDKPEKNEQWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKINSRWIKDLIVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTIRVNRQPTKWEKTFATYSSDKGLISRIYNELKQIYKKKTNNPIKK WAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNR CWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLEPEIPFDPAIPLLGIYPKDSKSC CYKDTCTRMFIAALFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFVSFVG TWMKLEIIILSKLSQEQKTKHCIFSLIGGN" BASE COUNT 2355 a 1307 c 1225 g 1178 t ORIGIN 1 gagaggagcc aagatggccg aatagcaaca gctccggtct acagctccca gcgtgagcga 61 cgcagaagac gggtgatttc tgcatttcca tctgaggtac tgggttcatc tcactaggga 121 gtgccagaca gtgggcgcag gtcagtgggt gcgcgcaccg tgcacgagcc gaagcagggc 181 gaggcattgt ctcacttggg aagcgcaagg ggtcagggag ttccctttcc gagtcaaaga 241 aaggggtgac ggacgcacct ggaaaatcgg gtctctccca cccgaatatt gcgctttcgg 301 accggcttaa aaaacggcgc accgcgagat tatatcttgc acctggctag gagggtccta 361 cgcccacgga gtctcgctga ttgctagcac agcagtctga gatcaaactg caaggccgca 421 gcaaggctgg gggaggggcg cccgccattg cccaggcttg cttgggtaaa caaagcagcc 481 tggcagctcg aactgggtgg agcccaccac agctcaagga ggcctgcctg cctttgtagg 541 ctccacctct gggggcaggg cacagacaaa taaaaagaca gcagtaacct ctgcagactt 601 aaatgtccct gtctgacagc tttgaagaga gcagtggttc tcccagcatg cagctggaga 661 tctgagaacg ggcagactgc ctcctcaagt gggtccctga cccctgaccc ccgagcagcc 721 taactgggag gcacccccca gcaggggcac actgacacct cacacggcag ggtattccaa 781 cagacctgca gctgagggtc ctgtctgtta gaaggaaaac taacaaacag aaaggacatc 841 cacaccaaaa acccatctgt acatcaccat catcaaagat caaaagtaga taaaaccacg 901 aagatgggga aaaaacagaa cagaaaaact ggaaactgta aaatgcatag tgcctctcct 961 cctccaaagg aacacagttc ctcaccagca acggaacaaa gctggatgga gaatgacttt 1021 gacgagctga gagaagaagg cttcagatga tcaaattact ctgagctatg ggaggacatt 1081 caaaccaaag gcaaagaagt tgaaaccttt gaaaaaaatt tagaagaatg tataactaga 1141 ataaccaata cagagaagtg cttaaaggag ctgatggagc tgaaaaccaa ggctcgagaa 1201 ctacgtgaag aatgcagaag cctcaggagc cgatgcgatc aactggaaga aagggtatca 1261 gcaatggaag atgaaatgaa tgaaatgaag caagaaggga agtttagaga caaaagaata 1321 aaaagaaatg agcaaagcct ccaagaaata tgggactatg tgaaaagacc aaatctatgt 1381 ctgatcggtg tacctgaaag tgatggggac aatggaacca agttggaaaa cacgctgcag 1441 gatattatcc aggagaactt ccccaatcta gcaaggaagg ccaacgttca gattcaggaa 1501 atacagagaa cgccacaaag atactcctcg agaagagcaa ctccaagaca cataattgtc 1561 agattcacca aagttgaaat gaaggaaaaa atgttaaggg cagccagaga gaaaggtcgg 1621 gttaccctca aagggaaacc catcagacta acagcggatc tctcagcaga aaccctacaa 1681 gccagaagag agtgggggcc aatattcaac attcttaaag aaaagaattt tcaacccaga 1741 atttcatatc cagccaaact aagcttcata agtgaaggag aaataaaatc ctttacagac 1801 aagaaatgct gagagatttt gtcaccacca ggcctaccct aaaagagctc ctgaaggaag 1861 cgccaaacat ggaaaggaac aaccggtacc agccgctgca aaatcatgcc aaaatgtaaa 1921 gaccatcgag actaggaaga aactgcatca actaacgagc aaaatcacca gctaacatca 1981 taatgacagg atcaaattca cacataacaa tattaacttt aaatgtaaat ggattaaatg 2041 ctccaattaa aagacacaga ctggcaaatt ggataaagag tcaagaccca tcagtgtgct 2101 gtattcagga aacccatctc atgtgcagag acacacatag gctcaaaata aaaggatgga 2161 ggaagatcta ccaagcaaat gtaaaacaaa aaaaggcagg ggttgcaatc ctagtctctg 2221 ataaaacaga ctttaaacca acaaagatca aaagagacaa agaaggccat tacataatgg 2281 taaagggatc aattcaacaa gaagagctaa ctatcctaaa tatatatgca cccaatacag 2341 gagcacccag attcataaag aaagtcatga gtgacctaca aagagactta gactcccaca 2401 cattaataat gggagacttt aacaccccac tgtcaacatt agacagatca acgagacaga 2461 aagtcaacaa ggatacccag gaattgaacc tctgcaccaa gcagacctaa tagacatcta 2521 cagaactctc caccccaaat caacagaatc tacatttttt tcagcaccac accacaccta 2581 ttccaaaatt gaccacatac ttggaagtaa agctctcctc agcaaatgta aaagaacaga 2641 aattataaca aactatctct cagaccacag tgcaatcaaa ctagaactca ggattaagaa 2701 tctcattcaa aaccgctcaa ctacatggaa actgaacaac ctgctcctga atgactactg 2761 ggtacataac gaaatgaagg cagaaataaa gatgttcttt gaaaccaacg agaacaaaga 2821 cacaacatac cagaatctct gggacgcatt caaagcagtg tgtagaggga aatttatagc 2881 actaaatggc tacaagagaa agcaggaaag atccaaaatt gacacccgaa catcacaatt 2941 aaaagaacta gaaaagcaag agcaaacaca ttcaaaagct agcagaaggc aagaaataac 3001 taaaatcaga gcagaactga aggaaataga gacacaaaaa acccttcaaa aaattaatga 3061 atccaggagc ttgttttttg aaaggatcaa caaaattgat agaccgctag caagactaat 3121 aaagaaaaaa agagagaaga atctaataga cacaataaaa aatgataaag gggatatcac 3181 caccgatccc acagaaatac aaactaccat cagagaatac tacaaacacc tctactcaaa 3241 taaactagaa aatctagaag aaatggataa attcctcgac acatacactc tcccaagact 3301 aaaccaggaa gaagttgaat ctctgaatag accaaaaaca ggatctgaaa ttgtggcaat 3361 aatcaatagt ttaccaacca aaaagagtcc aggaccagat ggattcacag ccgaattcta 3421 ccagaggtac aaggaggaac tggtaccatt ccttctgaaa ctattccaat caatagaaaa 3481 agagggaatc ctccctaact cattttatga ggccagcatc attctgatac caaagccggg 3541 cagagacaca accaaaaaag agaattttag accaatatcc ttgatgaaca ttgatgcaaa 3601 aatcctcaat aaaatactgg caaaatgaat ccagcagcac atcaaaaagt ttatccaccg 3661 tgattaagtg ggcttcatcc ctgggatgca agactggttc aatatgcaca aatcaataaa 3721 tgtaatccag catataaaca gaaccaaaga caaaaaccac atgattgtct caatagatgc 3781 agaaaaagcc tttgacaaaa ttcaacaacc cttcatgcta aaaactctca ataaattagg 3841 tattgatggg acgtatttca aaataataag agctatctat gacaaaccca cagccaatat 3901 catactgaat ggtcaaaaac tggaagcatt ccctttgaaa actggcacaa gacagggatg 3961 ccctctctca ccactcctat tcaacatagt gttggaagtt ctggccaggg caattaggca 4021 ggagaaggaa ataaagggta ttcaattagg aaaagaggaa gtcaaattgt ccctgtttgc 4081 agacgacatg attgtatatc tagaaaaccc cattgtctca gcccaaaatc tccttaagcc 4141 gataagcaac ttcagcaaag tctcaggata caaaatcaat gtacaaaaat cacaagcatt 4201 cttatacacc aacaacagac aaacagagag ccaaatcatg aatgaactac cattcacaat 4261 tgcttcaaag agaataaaat acttaggaat ccaaattaca agggatgtga aggacctctt 4321 caaggagaac tacaaaccac tgctcaagga aataaaagag gatacaaaca aatggaagaa 4381 cattccatgc tcatgggtag gaagaatcaa tatcatgaaa atggccatac tgcccaaggt 4441 aatttacaga ttcaatacca tccccataaa gctaccaatg actttcttca cagaattgga 4501 aaaaactact ttaaagttca tatggaacca aaaaagggcc cgcattgcca aggcaatccg 4561 aagccaaaag aacaaatctg gaggcatcac actacctgac ttcaaactat actacaaggc 4621 tacagtaacc aaaacagcat ggtactggta ccaaaacaga gatatagatc aatggaacag 4681 aacagagccc tcagaaataa cgccgcatat ctacaactat ctgatctttg acaaacctga 4741 gaaaaatgag caatggggaa aggattccct atttaataaa tggtgctggg aaaactggct 4801 agccatatgt agaaagctga aactggatcc tttccttaca ccttatacaa aaatcaattc 4861 aagatggatt aaagacttaa tcgttagacc taaaaccata aaaaccctag aagaaaacct 4921 aggcattacc attcaggaca taggcatggg caaggacttc atgtctaaaa caccaaaagc 4981 aatggcaaca aaagccaaaa ttgacaaatg ggatctaatt aaactaaaga gcttctgcac 5041 agcaaaagaa actaccatca gagtgaacag gcaacctaca aaatgggaga aaactttcgc 5101 aacctactca tctgacaaag ggctaatatc cagaatctac aatgaactca aacaaattta 5161 caagaaaaaa acaaacaacc ccatcaaaaa gtgggcgaag gacatgaaca gacacttctc 5221 aaaagaagac atttatgcag ccaaaaaaca catgaaaaaa tgctcaccat cactggccat 5281 cagagaaatg caaatcaaaa ccacaatgag ataccatctc acaccagtta gaatggcaat 5341 cattaaaaag tcaggaaaca acaggtgctg gagaggatgt ggagaaatag gaacactttt 5401 acactgttgg tgggactgta aactagttca accgttgtgg aagtcagtgt ggcgattcct 5461 cagggatcta gaaccagaaa taccatttga cccagccatc ccattactgg gtatataccc 5521 aaaggactct aaatcatgct gctataaaga cacatgcaca cgtatgttta ttgcggcatt 5581 attcacaata gcaaagactt ggaaccaacc caaatgtcca acaatgatag actggattaa 5641 gaaaatgtgg cacatataca ccatggaata ctatgcagcc ataaaaaatg atgagttcgt 5701 gtcctttgta gggacatgga tgaaattgga aatcatcatt cttagtaaac tatcgcaaga 5761 acaaaaaacc aaacactgca tattctcact cataggtggg aattgaacaa tgagatcaca 5821 tggacacagg aaggggaata tcacactctg gggactgtgg tggggtgggg ggagagggga 5881 gggatagcat tgggagatat acctaatgct agatgacgag ttagtgggtg cagtgcacca 5941 gcatggcaca tttatacata tgtaactaac ctgcacaatg tgcacatgta ccctaaaact 6001 taaagtataa taaaaaaaaa taaaaaaaat aaaaataaaa aagaaaatat gggaaaggta 6061 aaaaa // LOCUS HSLINKC 1492 bp RNA PRI 21-MAR-1994 DEFINITION Human mRNA for cartilage link protein. ACCESSION X17405 Y00166 NID g463246 KEYWORDS cartilage link protein; CRTL1 gene; extracellular matrix protein; link protein; proteoglycan. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1400) AUTHORS Dudhia,J. TITLE Direct Submission JOURNAL Submitted (19-JAN-1990) Dudhia J., Kennedy Institute of Rheumatology, 6 Bute Gardens, Hammersmith, London W6 7DW, UK REFERENCE 2 (bases 1 to 288) AUTHORS Dudhia,J. and Hardingham,T.E. TITLE The primary structure of human cartilage link protein JOURNAL Nucleic Acids Res. 18 (5), 1292 (1990) MEDLINE 90206798 REMARK Erratum:[Nucleic Acids Res 1990 Apr 25;18(8):2214]] REFERENCE 3 (bases 1 to 1400) AUTHORS Dudhia,J. TITLE Direct Submission JOURNAL Submitted (23-MAR-1990) to the EMBL/GenBank/DDBJ databases REFERENCE 4 (bases 1 to 500) AUTHORS Dudhia,J. TITLE Direct Submission JOURNAL Submitted (27-FEB-1989) Dudhia J., Kennedy Institute of Rheumatology, 6 Bute Gardens, Hammersmith, London W6 7DW, UK REMARK revised by [6] MAT REFERENCE 5 (bases 1 to 1400) AUTHORS Perkins,S.J., Nealis,A.S., Dudhia,J. and Hardingham,T.E. TITLE Immunoglobulin fold and tandem repeat structures in proteoglycan N-terminal domains and link protein JOURNAL J. Mol. Biol. 206 (4), 737-753 (1989) MEDLINE 89293837 REFERENCE 6 (bases 1 to 1492) AUTHORS Dudhia,J. TITLE Direct Submission JOURNAL Submitted (18-MAR-1994) Dudhia J., Kennedy Institute of Rheumatology, 6 Bute Gardens, Hammersmith, London W6 7DW, UK COMMENT See also for link protein 3' mRNA. Related sequence: Y00166. FEATURES Location/Qualifiers source 1..1492 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="chondrocyte" /clone_lib="cDNA lambda gt11" /clone="lambda 8.1D3" /chromosome="5" /map="5q13-q14.1" mRNA 1..>1492 /gene="CRTL1" 5'UTR 1..315 /gene="CRTL1" gene 1..1492 /gene="CRTL1" CDS 316..1380 /gene="CRTL1" /codon_start=1 /product="cartilage link protein" /db_xref="PID:g34378" /db_xref="SWISS-PROT:P10915" /translation="MKSLLLLVLISICWADHLSDNYTLDHDRAIHIQAENGPHLLVEA EQAKVFSHRGGNVTLPCKFYRDPTAFGSGIHKIRIKWTKLTSDYLKEVDVFVSMGYHK KTYGGYQGRVFLKGGSDSDASLVITDLTLEDYGRYKCEVIEGLEDDTVVVALDLQGVV FPYFPRLGRYNLNFHEAQQACLDQDAVIASFDQLYDAWRGGLDWCNAGWLSDGSVQYP ITKPREPCGGQNTVPGVRNYGFWDKDKSRYDVFCFTSNFNGRFYYLIHPTKLTYDEAV QACLNDGAQIAKVGQIFAAWKILGYDRCDAGWLADGSVRYPISRPRRRCSPTEAAVRF VGFPDKKHKLYGVYCFRAYN" sig_peptide 316..360 /gene="CRTL1" mat_peptide 361..1377 /gene="CRTL1" /product="cartilage link protein" 3'UTR 1381..>1492 /gene="CRTL1" BASE COUNT 394 a 342 c 366 g 390 t ORIGIN 1 ttaggctgta attaggggat ttgggaggag aactttcctg gtgacgcttt gcttttcttc 61 tgctcttggt gagaaagtgc ctccttcttc ccaggatcag gacctctgcc atccagcgcc 121 acaaagagac attctgcaca cacactcaca cacacacaca cacacacact ctcacactcg 181 cccagagaca aacttaaggt gaggagaaag agcgctagct tcacttgatc tccagcttcc 241 aacttaagca gaacttgaga gcatccgaac tcctggattt caggacaagt gaagaagatt 301 ctttgggcta taaagatgaa gagtctactt cttctggtgc tgatttcaat ctgctgggct 361 gatcatcttt cagacaacta tactctggat catgacagag ctattcacat ccaagcagaa 421 aatggccccc atctacttgt ggaagcagag caagccaagg tgttttcaca cagaggtggc 481 aatgttacac tgccatgtaa attttatcga gaccctacag catttggctc aggaatccat 541 aaaatccgaa ttaagtggac caagctaact tcggattacc tcaaggaagt ggatgttttt 601 gtttccatgg gataccacaa aaaaacctat ggaggctacc agggtagagt gtttctgaag 661 ggaggcagtg atagtgatgc ttctctggtc atcacagacc tcactctgga agattatggg 721 agatataagt gtgaggtgat tgaaggatta gaagatgata ctgttgtggt agcactggac 781 ttacaaggtg tggtattccc ttactttcca cgactggggc gctacaatct caattttcac 841 gaggcgcagc aggcgtgtct ggaccaggat gctgtgatcg cctccttcga ccagctgtac 901 gacgcctggc ggggcgggct ggactggtgc aatgccggct ggctcagtga tggctctgtg 961 caatatccca tcacaaagcc cagagagccc tgtgggggcc agaacacagt gcccggagtc 1021 aggaactacg gattttggga taaagataaa agcagatatg atgttttctg ttttacatcc 1081 aatttcaatg gccgttttta ctatctgatc caccccacca aactgaccta tgatgaagcg 1141 gtgcaagctt gtctcaatga tggtgctcag attgcaaaag tgggccagat atttgctgcc 1201 tggaaaattc tcggatatga ccgctgtgat gcgggctggt tggcggatgg cagcgtccgc 1261 taccccatct ctaggccaag aaggcgctgc agtcctactg aggctgcagt gcgcttcgtg 1321 ggttttccag ataaaaagca taagctgtat ggtgtctact gcttcagagc atacaactga 1381 atgtgccctt agagcgcact agttttaaag tcattaagaa catgtgaaag gtgttttttt 1441 tttccaatat gaactcatgc aagttaccaa aactgtgata accctttttt ac // LOCUS HSLIPAS 1612 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for lipoprotein lipase (EC 3.1.1.34). ACCESSION X54516 NID g34382 KEYWORDS lipase; lipoprotein lipase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1612) AUTHORS Takagi,A. TITLE Direct Submission JOURNAL Submitted (28-AUG-1990) Takagi A., National Cardiovascular Center Research Institute, Dept of Etiology and Pathophysiology, 5-7-1 Fujishirodai Suita, Osaka 565, Japan REFERENCE 2 (bases 1 to 1612) AUTHORS Takagi,A., Ikeda,Y. and Yamamoto,A. TITLE DNA sequence of lipoprotein lipase cDNA cloned from human monocytic leukemia THP-1 cells JOURNAL Nucleic Acids Res. 18 (21), 6436 (1990) MEDLINE 91057142 COMMENT Data kindly reviewed (04-DEC-1990) by Takagi A. FEATURES Location/Qualifiers source 1..1612 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocytic leukemia THP-1 cells" /clone="HLC 601" /chromosome="8p22" sig_peptide 144..224 /note="signal peptide" CDS 144..1571 /note="precursor protein" /codon_start=1 /db_xref="PID:g34383" /db_xref="SWISS-PROT:P06858" /translation="MESKALLVLTLAVWLQSLTASRGGVAAADQRRDFIDIESKFALR TPEDTAEDTCHLIPGVAESVATCHFNHSSKTFMVIHGWTVTGMYESWVPKLVAALYKR EPDSNVIVVDWLSRAQEHYPVSAGYTKLVGQDVARFINWMEEEFNYPLDNVHLLGYSL GAHAAGIAGSLTNKKVNRITGLDPAGPNFEYAEAPSRLSPDDADFVDVLHTFTRGSPG RSIGIQKPVGHVDIYPNGGTFQPGCNIGEAIRVIAERGLGDVDQLVKCSHERSIHLFI DSLLNEENPSKAYRCSSKEAFEKGLCLSCRKNRCNNLGYEINKVRAKRSSKMYLKTRS QMPYKVFHYQVKIHFSGTESETHTNQAFEISLYGTVAESENIPFTLPEVSTNKTYSFL IYTEVDIGELLMLKLKWKSDSYFSWSDWWSSPGFAIQKIRVKAGETQKKVIFCSREKV SHLQKGKAPAVFVKCHDKSLNKKSG" mat_peptide 225..1568 /note="mature lipoprotein lipase" BASE COUNT 433 a 403 c 412 g 364 t ORIGIN 1 ccacttctag ctgccctgcc atccccttta aagggcgact tgctcagcgc caaaccgcgg 61 ctccagccct ctccagcctc cggctcagcc ggctcatcag tcggtccgcg ccttgcagct 121 cctccagagg gacgcgcccc gagatggaga gcaaagccct gctcgtgctg actctggccg 181 tgtggctcca gagtctgacc gcctcccgcg gaggggtggc cgccgccgac caaagaagag 241 attttatcga catcgaaagt aaatttgccc taaggacccc tgaagacaca gctgaggaca 301 cttgccacct cattcccgga gtagcagagt ccgtggctac ctgtcatttc aatcacagca 361 gcaaaacctt catggtgatc catggctgga cggtaacagg aatgtatgag agttgggtgc 421 caaaacttgt ggccgccctg tacaagagag aaccagactc caatgtcatt gtggtggact 481 ggctgtcacg ggctcaggag cattacccag tgtccgcggg ctacaccaaa ctggtgggac 541 aggatgtggc ccggtttatc aactggatgg aggaggagtt taactaccct ctggacaatg 601 tccatctctt gggatacagc cttggagccc atgctgctgg cattgcagga agtctgacca 661 ataagaaagt caacagaatt actggcctcg atccagctgg acctaacttt gagtatgcag 721 aagccccgag tcgtctttct cctgatgatg cagattttgt agacgtctta cacacattca 781 ccagagggtc ccctggtcga agcattggaa tccagaaacc agttgggcat gttgacattt 841 acccgaatgg aggtactttt cagccaggat gtaacattgg agaagctatc cgcgtgattg 901 cagagagagg acttggagat gtggaccagc tagtgaagtg ctcccacgag cgctccattc 961 atctcttcat cgactctctg ttgaatgaag aaaatccaag taaggcctac aggtgcagtt 1021 ccaaggaagc ctttgagaaa gggctctgct tgagttgtag aaagaaccgc tgcaacaatc 1081 tgggctatga gatcaataaa gtcagagcca aaagaagcag caaaatgtac ctgaagactc 1141 gttctcagat gccctacaaa gtcttccatt accaagtaaa gattcatttt tctgggactg 1201 agagtgaaac ccataccaat caggcctttg agatttctct gtatggcacc gtggccgaga 1261 gtgagaacat cccattcact ctgcctgaag tttccacaaa taagacctac tccttcctaa 1321 tttacacaga ggtagatatt ggagaactac tcatgttgaa gctcaaatgg aagagtgatt 1381 catactttag ctggtcagac tggtggagca gtcccggctt cgccattcag aagatcagag 1441 taaaagcagg agagactcag aaaaaggtga tcttctgttc tagggagaaa gtgtctcatt 1501 tgcagaaagg aaaggcacct gcggtatttg tgaaatgcca tgacaagtct ctgaataaga 1561 agtcaggctg aaactgggcg aatctacaga acaaagaacg gcatgtgaat tc // LOCUS HSLIPCR 1399 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for lipocortin. ACCESSION X05908 NID g34387 KEYWORDS lipocortin; phospholipase a2 inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1399) AUTHORS Wallner,B.P., Mattaliano,R.J., Hession,C., Cate,R.L., Tizard,R., Sinclair,L.K., Foeller,C., Chow,E.P., Browning,J.L., Ramachandran,K.L. and Pepinsky,R.B. TITLE Cloning and expression of human lipocortin, a phospholipase A2 inhibitor with potential anti-inflammatory activity JOURNAL Nature 320 (6057), 77-81 (1986) MEDLINE 86146879 FEATURES Location/Qualifiers source 1..1399 /organism="Homo sapiens" /db_xref="taxon:9606" precursor_RNA 24..1399 /note="primary transcript" CDS 75..1115 /note="lipocortin (AA 1-346)" /codon_start=1 /db_xref="PID:g34388" /db_xref="SWISS-PROT:P04083" /translation="MAMVSEFLKQAWFIENEEQEYVQTVKSSKGGPGSAVSPYPTFNP SSDVAALHKAIMVKGVDEATIIDILTKRNNAQRQQIKAAYLQETGKPLDETLKKALTG HLEEVVLALLKTPAQFDADELRAAMKGLGTDEDTLIEILASRTNKEIRDINRVYREEL KRDLAKDITSDTSGDFRNALLSLAKGDRSEDFGVNEDLADSDARALYEAGERRKGTDV NVFNTILTTRSYPQLRRVFQKYTKYSKHDMNKVLDLELKGDIEKCLTAIVKCATSKPA FFAEKLHQAMKGVGTRHKALIRIMVSRSEIDMNDIKAFYQKMYGISLCQAILDETKGD YEKILVALCGGN" misc_feature 1379..1384 /note="pot. polyA signal" BASE COUNT 464 a 273 c 298 g 364 t ORIGIN 1 agtgtgaaat cttcagagaa gaatttctct ttagttcttt gcaagaaggt agagataaag 61 acactttttc aaaaatggca atggtatcag aattcctcaa gcaggcctgg tttattgaaa 121 atgaagagca ggaatatgtt caaactgtga agtcatccaa aggtggtccc ggatcagcgg 181 tgagccccta tcctaccttc aatccatcct cggatgtcgc tgccttgcat aaggccataa 241 tggttaaagg tgtggatgaa gcaaccatca ttgacattct aactaagcga aacaatgcac 301 agcgtcaaca gatcaaagca gcatatctcc aggaaacagg aaagcccctg gatgaaacac 361 ttaagaaagc ccttacaggt caccttgagg aggttgtttt agctctgcta aaaactccag 421 cgcaatttga tgctgatgaa cttcgtgctg ccatgaaggg ccttggaact gatgaagata 481 ctctaattga gattttggca tcaagaacta acaaagaaat cagagacatt aacagggtct 541 acagagagga actgaagaga gatctggcca aagacataac ctcagacaca tctggagatt 601 ttcggaacgc tttgctttct cttgctaagg gtgaccgatc tgaggacttt ggtgtgaatg 661 aagacttggc tgattcagat gccagggcct tgtatgaagc aggagaaagg agaaagggga 721 cagacgtaaa cgtgttcaat accatcctta ccaccagaag ctatccacaa cttcgcagag 781 tgtttcagaa atacaccaag tacagtaagc atgacatgaa caaagttctg gacctggagt 841 tgaaaggtga cattgagaaa tgcctcacag ctatcgtgaa gtgcgccaca agcaaaccag 901 ctttctttgc agagaagctt catcaagcca tgaaaggtgt tggaactcgc cataaggcat 961 tgatcaggat tatggtttcc cgttctgaaa ttgacatgaa tgatatcaaa gcattctatc 1021 agaagatgta tggtatctcc ctttgccaag ccatcctgga tgaaaccaaa ggagattatg 1081 agaaaatcct ggtggctctt tgtggaggaa actaaacatt cccttgatgg tctcaagcta 1141 tgatcagaag actttaatta tatattttca tcctataagc ttaaatagga aagtttcttc 1201 aacaggatta cagtgtagct acctacatgc tgaaaaatat agcctttaaa tcatttttat 1261 attataactc tgtataatag agataagtcc attttttaaa aatgttttcc ccaaaccata 1321 aaaccctata caagttgttc tagtaacaat acatgagaaa gatgtctatg tagctgaaaa 1381 taaaatgacg tcacaagac // LOCUS HSLLREP3 934 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for LLRep3. ACCESSION X17206 NID g34391 KEYWORDS LLrep3; repetitive DNA; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 934) AUTHORS Jenner,D.E. TITLE Direct Submission JOURNAL Submitted (04-DEC-1989) Jenner D.E., ICI PHARMACEUTICALS, Biotechnology Department, Alderley Park, Macclesfield, Cheshire SK10 4TG, U K REFERENCE 2 (bases 1 to 934) AUTHORS Slynn,G., Jenner,D., Potts,W., Elvin,P., Morten,J.E. and Markham,A.F. TITLE Human cDNA sequence homologous to the mouse LLRep3 gene family JOURNAL Nucleic Acids Res. 18 (3), 681 (1990) MEDLINE 90175019 COMMENT Data kindly reviewed (28-MAR-1990) by Jenner D. FEATURES Location/Qualifiers source 1..934 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" /clone_lib="lambda-gt11" /clone="lambda-15c" CDS 241..906 /note="put. LLRep3 protein (AA 1-221)" /codon_start=1 /db_xref="PID:g34392" /db_xref="SWISS-PROT:P15880" /translation="MKIKSLEEIYLFSLPIKESEIIDFFLGASLKDEVLKIMPVQKQT RAGQRTRFKAFVAIGDYNGHVGLGVKCSKEVATAIRGAIILAKLSIVPVRRGYWGNKI GKPHTVPCKVTGRCGSVLVRLIPAPRGTGIVSAPVPKKLLMMAGIDDCYTSARGCTAT LGNFAKATFDAISKTYSYLTPDLWKETVFTKSPYQEFTDHLVKTHTRVSVQRTQAPAV ATT" misc_feature 924..929 /note="put. polyA signal" BASE COUNT 193 a 275 c 282 g 184 t ORIGIN 1 ggcatccact tcttttccga caaaacacca aatggcggat gacgccggtg cagcgggggg 61 gcccggaggc tggtggccct gggatgggga accgcggtgg cttccgcgag gtttcggcag 121 tggcattcgg ggccggggtc gcggcgtgga cggggccggg gcgaggcgcg gagctcgcga 181 ggcaaggccg aggataagga gtggatgccc gtcaccaagt tgggccgctt ggtcaaggac 241 atgaagatca agtccctgga ggagatctat ctcttctccc tgcccattaa ggaatcagag 301 atcattgatt tcttcctggg ggcctctctc aaggatgagg ttttgaagat tatgccagtg 361 cagaagcaga cccgtgccgg ccagcgcacc aggttcaagg catttgttgc tatcggggac 421 tacaatggcc acgtcggtct gggtgttaag tgctccaagg aggtggccac cgccatccgt 481 ggggccatca tcctggccaa gctctccatc gtccccgtgc gcagaggcta ctgggggaac 541 aagatcggca agccccacac tgtcccctgc aaggtgacag gccgctgcgg ctctgtgctg 601 gtacgcctca tccctgcacc caggggcact ggcatcgtct ccgcacctgt gcctaagaag 661 ctgctcatga tggctggtat cgatgactgc tacacctcag cccggggctg cactgccacc 721 ctgggcaact tcgccaaggc cacctttgat gccatttcta agacctacag ctacctgacc 781 cccgacctct ggaaggagac tgtattcacc aagtctccct atcaggagtt cactgaccac 841 ctcgtcaaga cccacaccag agtctccgtg cagcggactc aggctccagc tgtggctaca 901 acatagggtt tttatacaag aaaaataaag tgaa // LOCUS HSLON 3103 bp RNA PRI 15-MAR-1994 DEFINITION H.sapiens mRNA for Lon protease-like protein. ACCESSION X74215 NID g414045 KEYWORDS ATP-dependent protease; lon protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3103) AUTHORS Amerik,A.Y. TITLE Direct Submission JOURNAL Submitted (21-JUL-1993) A.Y. Amerik, Shemyakin and Ovchinnikov Institute of Bioorganic Chem. Russian Academy of Sciences, 16/10 Miklukho-Maklaya str, 117871 Moscow V-437, Russia REFERENCE 2 (bases 1 to 3103) AUTHORS Amerik,A.Yu., Petukhova,G.V., Grigorenko,V.G., Lykov,I.P., Yarovoi,S.V., Lipkin,V.M. and Gorbalenya,A.E. TITLE Cloning and sequence analysis of cDNA for a human homolog of eubacterial ATP-dependent Lon proteases JOURNAL FEBS Lett. 340 (1-2), 25-28 (1994) MEDLINE 94164302 FEATURES Location/Qualifiers source 1..3103 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="human brain cDNA" /clone="lhs1.6, lhs28, lhs37. lhs64, lhs7, lhs5, lhs20, lhs29, lhs51" CDS 34..2571 /codon_start=1 /product="Lon protease-like protein" /db_xref="PID:g414046" /db_xref="SWISS-PROT:P36777" /translation="MTIPDVFPHLPLIAITRNPVFPRFIKIIEVKNKKLVELLRRKVR LAQPYVGVFLKRDDSNESDVVESLDEIYHTGTFAQIHEMQDLGDKLRMIVMGHRRVHI SRQLEVEPEEPEAENKHKPRRKSKRGKKEAEDELSARHPAELAMEPTPELPAEVLMVE VENVVHEDFQVTEEVKALTAEIVKTIRDIIALNPLYRESVLQMMQAGQRVVDNPIYLS DMGAALTGAESHELQDVLEETNIPKRLYKALSLLKKEFELSKLQQRLGREVEEKIKQT HRKYLLQEQLKIIKKELGLEKDDKDAIEEKFRERLKELVVPKHVMDVVDEELSKLGLL DNHSSEFNVTRNYLDWLTSIPWGKYSNENLDLARAQAVLEEDHYGMEDVKKRILEFIA VSQLRGSTQGKILCFYGPPGVGKTSIARSIARALNREYFRFSVGGMTDVAEIKGHRRT YVGAMPGKIIQCLKKTKTENPLILIDEVDKIGRGYQGDPSSALLELLDPEQNANFLDH YLDVPVDLSKVLFICTANVTDTIPEPLRDRMEMINVSGYVAQEKLAIAERYLVPQARA LCGLDESKAKLSSDVLTLLIKQYCRESGVRNLQKQVEKVLRKSAYKIVSGEAESVEVT PENLQDFVGKPVFTVERMYDVTPPGVVMGLAWTAMGGSTLFVETSLRRPQDKDAKGDK DGSLEVTGQLGEVMKESARIAYTFARAFLMQHAPANDYLVTSHIHLHVPEGATPKDGP SAGCTIVTALLSLAMGRPVRQNLAMTGEVSLTGKILPVGGIKEKTIAAKRAGVTCIIL PAENKKDFYDLAAFITEGLEVHFVEHYREIFDIAFPDEQAEALAVER" polyA_signal 3067..3072 BASE COUNT 734 a 853 c 956 g 560 t ORIGIN 1 ggggaaggcc cggtcataac ggcgctcacg cccatgacga tccccgatgt gtttccgcac 61 ctgccgctca tcgccatcac ccgcaacccg gtgttcccgc gctttatcaa gattatcgag 121 gttaaaaata agaagttggt tgagctgctg agaaggaaag ttcgtctcgc ccagccttat 181 gtcggcgtct ttctaaagag agatgacagc aatgagtcgg atgtggtcga gagcctggat 241 gaaatctacc acacggggac gtttgcccag atccatgaga tgcaggacct tggggacaag 301 ctgcgcatga tcgtcatggg acacagaaga gtccatatca gcagacagct ggaggtggag 361 cccgaggagc cggaggcgga gaacaagcac aagccccgca ggaagtcaaa gcggggcaag 421 aaggaggcgg aggacgagct gagcgccagg cacccggcgg agctggcgat ggagcccacc 481 cctgagctcc cggctgaggt gctcatggtg gaggtagaga acgttgtcca cgaggacttc 541 caggtcacgg aggaggtgaa agccctgact gcagagatcg tgaagaccat ccgggacatc 601 attgccttga accctctcta cagggagtca gtgctgcaga tgatgcaggc tggccagcgg 661 gtggtggaca accccatcta cctgagcgac atgggcgccg cgctcaccgg ggccgagtcc 721 catgagctgc aggacgtcct ggaagagacc aatattccta agcggctgta caaggccctc 781 tccctgctga agaaggaatt tgaactgagc aagctgcagc agcgcctggg gcgggaggtg 841 gaggagaaga tcaagcagac ccaccgtaag tacctgctgc aggagcagct aaagatcatc 901 aagaaggagc tgggcctgga gaaggacgac aaggatgcca tcgaggagaa gttccgggag 961 cgcctgaagg agctcgtggt ccccaagcac gtcatggatg ttgtggacga ggagctgagc 1021 aagctgggcc tgctggacaa ccactcctcg gagttcaatg tcacccgcaa ctacctagac 1081 tggctcacgt ccatcccttg gggcaagtac agcaacgaga acctggacct ggcgcgggca 1141 caggcagtgc tggaggaaga ccactacggc atggaggacg tcaagaaacg catcctggag 1201 ttcattgccg ttagccagct ccgcggctcc acccagggca agatcctctg cttctatggc 1261 ccccctggcg tgggtaagac cagcattgct cgctccatcg cccgcgccct gaaccgagag 1321 tacttccgct tcagcgtcgg gggcatgact gacgtggctg agatcaaggg ccacaggcgg 1381 acctacgtgg gcgccatgcc cgggaagatc atccagtgtt tgaagaagac caagacggag 1441 aaccccctga tcctcatcga cgaggtggac aagatcggcc gaggctacca gggggacccg 1501 tcgtcggcac tgctggagct gctggaccca gagcagaatg ccaacttcct ggaccactac 1561 ctggacgtgc ccgtggactt gtccaaggtg ctgttcatct gcacggccaa cgtcacggac 1621 accatccccg agccgctgcg agaccgtatg gagatgatca acgtgtcagg ctacgtggcc 1681 caggagaagc tggccattgc ggagcgctac ctggtgcccc aggctcgcgc cctgtgtggc 1741 ttggatgaga gcaaggccaa gctgtcatcg gacgtgctga cgctgctcat caagcagtac 1801 tgccgcgaga gcggtgtccg caacctgcag aagcaagtgg agaaggtgtt acggaaatcg 1861 gcctacaaga ttgtcagcgg cgaggccgag tccgtggagg tgacgcccga gaacctgcag 1921 gacttcgtgg ggaagcccgt gttcaccgtg gagcgcatgt atgacgtgac accgcccggc 1981 gtggtcatgg ggctggcctg gaccgcaatg ggaggctcca cgctgtttgt ggagacatcc 2041 ctgagacggc cacaggacaa ggatgccaag ggtgacaagg atggcagcct ggaggtgaca 2101 ggccagctgg gggaggtgat gaaggagagc gcccgcatag cctacacctt cgccagagcc 2161 ttcctcatgc agcacgcccc cgccaatgac tacctggtga cctcacacat ccacctgcat 2221 gtgcccgagg gcgccacccc caaggacggc ccaagcgcag gctgcaccat cgtcacggcc 2281 ctgctgtccc tggccatggg caggcctgtc cggcagaatc tggccatgac tggcgaagtc 2341 tccctcacgg gcaagatcct gcctgttggt ggcatcaagg agaagaccat tgcggccaag 2401 cgcgcagggg tgacgtgcat catcctgcca gccgagaaca agaaggactt ctacgacctg 2461 gcagccttca tcaccgaggg cctggaggtg cacttcgtgg aacactaccg ggagatcttc 2521 gacatcgcct tcccggacga gcaggcagag gcgctggccg tggaacggtg acggccaccc 2581 cgggactgca ggcggcggat gtcaggccct gtctgggcca gaactgagcg ctgtggggag 2641 cgcgcccgga cctggcagtg gagccaccga gcgagcagct cggtccagtg acccagatcc 2701 cagggacctc agtcggctta atcagagtgt ggcatagaag ctatttaatg attaaagtca 2761 tttgcagtgg gagttagcat cactaacctg acagttgttg ccaggaattt gctttgttta 2821 ctgctagtat attagaaatc ctagatctca gaatcacaat agtaataaac aacaggggtc 2881 attttttcct aacttactct gtgttcaggt gtggaatttc tgtctcccaa gaggaaatgt 2941 gacttcactt tggtgccaat ggacagaaaa ttctacctgt gctacatagg agaagtttgg 3001 aatgcactta atagctggtt tttacacctt gatttcgagg tggaaagaaa ttgatcatga 3061 atctctaata aatttaaatc tcttaaacca aaaaaaaaaa aaa // LOCUS HSLPH 6274 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for lactase-phlorizin hydrolase LPH (EC 3.2.1.23-62). ACCESSION X07994 NID g34399 KEYWORDS glycosylceramidase; hydrolase; phlorizin hydrolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6274) AUTHORS Mantei,N. TITLE Direct Submission JOURNAL Submitted (28-JUN-1988) Mantei N., Swiss Federal Institute of Technology Zurich, Universitaetstrasse 16, ETH-Zentrum, CH-8092 Zurich, Switzerland REFERENCE 2 (bases 1 to 6274) AUTHORS Mantei,N., Villa,M., Enzler,T., Wacker,H., Boll,W., James,P., Hunziker,W. and Semenza,G. TITLE Complete primary structure of human and rabbit lactase-phlorizin hydrolase: implications for biosynthesis, membrane anchoring and evolution of the enzyme JOURNAL EMBO J. 7 (9), 2705-2713 (1988) MEDLINE 89030634 COMMENT Data kindly reviewed (12-DEC-1988) by Mantei N. FEATURES Location/Qualifiers source 1..6274 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="intestinal mucosa" /clone="pHLac-61, pHLac-5, pHLac-1" sig_peptide 12..68 /note="signal peptide (AA -19 to -1)" CDS 12..5795 /note="LPH prepro-polypeptide (AA -19 to 1908)" /codon_start=1 /db_xref="PID:g34400" /db_xref="SWISS-PROT:P09848" /translation="MELSWHVVFIALLSFSCWGSDWESDRNFISTAGPLTNDLLHNLS GLLGDQSSNFVAGDKDMYVCHQPLPTFLPEYFSSLHASQITHYKVFLSWAQLLPAGST QNPDEKTVQCYRRLLKALKTARLQPMVILHHQTLPASTLRRTEAFADLFADYATFAFH SFGDLVGIWFTFSDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESYAFQGGKLS VVLRAEDIPELLLEPPISALAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKV FIFNLKLPDCPSTMKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSL TGSLALQPDQQQDHETTDSSPASAYQRVWEAFANQSRAERDAFLQDTFPEGFLWGAST GAFNVEGGWAEGGRGVSIWDPRRPLNTTEGQATLEVASDSYHKVASDVALLCGLRAQV YKFSISWSRIFPMGHGSSPSLPGVAYYNKLIDRLQDAGIEPMATLFHWDLPQALQDHG GWQNESVVDAFLDYAAFCFSTFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGISDPGVA SFKVAHLVLKAHARTWHHYNSHHRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFL HFMLGWFAHPVFVDGDYPATLRTQIQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLG LSHYTSRLISNAPQNTCIPSYDTIGGFSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVS LEYTRGKVPIYLAGNGMPIGESENLFDDSLRVDYFNQYINEVLKAIKEDSVDVRSYIA RSLIDGFEGPSGYSQRFGLHHVNFSDSSKSRTPRKSAYFFTSIIEKNGFLTKGAKRLL PPNTVNLPSKVRAFTFPSEVPSKAKVVWEKFSSQPKFERDLFYHGTFRDDFLWGVSSS AYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNATGDIACDSYHQLDADLNMLRALKVK AYRFSISWSRIFPTGRNSSINSHGVDYYNRLINGLVASNIFPMVTLFHWDLPQALQDI GGWENPALIDLFDSYADFCFQTFGDRVKFWMTFNEPMYLAWLGYGSGEFPPGVKDPGW APYRIAHTVIKAHARVYHTYDEKYRQEQKGVISLSLSTHWAEPKSPGVPRDVEAADRM LQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLPSFTEEEKRFIRATADVF CLNTYYSRIVQHKTPRLNPPSYEDDQEMAEEEDPSWPSTAMNRAAPWGTRRLLNWIKE EYGDIPIYITENGVGLTNPNTEDTDRIFYHKTYINEALKAYRLDGIDLRGYVAWSLMD NFEWLNGYTVKFGLYHVDFNNTNRPRTARASARYYTEVITNNGMPLAREDEFLYGRFP EGFIWSAASAAYQIEGAWRADGKGLSIWDTFSHTPLRVENDAIGDVACDSYHKIAEDL VTLQNLGVSHYRFSISWSRILPDGTTRYINEAGLNYYVRLIDTLLAASIQPQVTIYHW DLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPFVIAYQGYGYGTA APGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWAEPRDPSN QEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEFTESEK RRINGTYDFFGFNHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLKMTP FGFRRILNWLKEEYNDPPIYVTENGVSQREETDLNDTARIYYLRTYINEALKAVQDKV DLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSLPRIPKASAKFYASVVRCNGFPD PATGPHACLHQPDAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSLVLLGVCGLAF LSYKYCKRSKQGKTQRSQQELSPVSSF" misc_feature 69..5792 /note="LPH propeptide (AA 1 to 1908)" mat_peptide 2616..5792 /note="put. mature LPH (AA 850 to 1908)" misc_feature 6250..6255 /note="polyA signal" polyA_site 6274 /note="polyA site" BASE COUNT 1536 a 1692 c 1582 g 1464 t ORIGIN 1 gttcctagaa aatggagctg tcttggcatg tagtctttat tgccctgcta agtttttcat 61 gctgggggtc agactgggag tctgatagaa atttcatttc caccgctggt cctctaacca 121 atgacttgct gcacaacctg agtggtctcc tgggagacca gagttctaac tttgtagcag 181 gggacaaaga catgtatgtt tgtcaccagc cactgcccac tttcctgcca gaatacttca 241 gcagtctcca tgccagtcag atcacccatt ataaggtatt tctgtcatgg gcacagctcc 301 tcccagcagg aagcacccag aatccagacg agaaaacagt gcagtgctac cggcgactcc 361 tcaaggccct caagactgca cggcttcagc ccatggtcat cctgcaccac cagaccctcc 421 ctgccagcac cctccggaga accgaagcct ttgctgacct cttcgccgac tatgccacat 481 tcgccttcca ctccttcggg gacctagttg ggatctggtt caccttcagt gacttggagg 541 aagtgatcaa ggagcttccc caccaggaat caagagcgtc acaactccag accctcagtg 601 atgcccacag aaaagcctat gagatttacc acgaaagcta tgcttttcag ggcggaaaac 661 tctctgttgt cctgcgagct gaagatatcc cggagctcct gctagaacca cccatatctg 721 cgcttgccca ggacacggtc gatttcctct ctcttgattt gtcttatgaa tgccaaaatg 781 aggcaagtct gcggcagaag ctgagtaaat tgcagaccat tgagccaaaa gtgaaagttt 841 tcatcttcaa cctaaaactc ccagactgcc cctccaccat gaagaaccca gccagtctgc 901 tcttcagcct ttttgaagcc ataaataaag accaagtgct caccattggg tttgatatta 961 atgagtttct gagttgttca tcaagttcca agaaaagcat gtcttgttct ctgactggca 1021 gcctggccct tcagcctgac cagcagcagg accacgagac cacggactcc tctcctgcct 1081 ctgcctatca gagagtctgg gaagcatttg ccaatcagtc cagagcggaa agggatgcct 1141 tcctgcagga tactttccct gaaggcttcc tctggggtgc ctccacagga gcctttaacg 1201 tggaaggagg ctgggccgag ggtgggagag gggtgagcat ctgggatcca cgcaggcccc 1261 tgaacaccac tgagggccaa gcgacgctgg aggtggccag cgacagttac cacaaggtag 1321 cctctgacgt cgccctgctt tgcggcctcc gggctcaggt gtacaagttc tccatctcct 1381 ggtcccggat cttccccatg gggcacggga gcagccccag cctcccaggc gttgcctact 1441 acaacaagct gattgacagg ctacaggatg cgggcatcga gcccatggcc acgctgttcc 1501 actgggacct gcctcaggcc ctgcaggatc atggtggatg gcagaatgag agcgtggtgg 1561 atgccttcct ggactatgcg gccttctgct tctccacatt tggggaccgt gtgaagctgt 1621 gggtgacctt ccatgagccg tgggtgatga gctacgcagg ctatggcacc ggccagcacc 1681 ctcccggcat ctctgaccca ggagtggcct cttttaaggt ggctcacttg gtcctcaagg 1741 ctcatgccag aacttggcac cactacaaca gccatcatcg cccacagcag caggggcacg 1801 tgggcattgt gctgaactca gactgggcag aacccctgtc tccagagagg cctgaggacc 1861 tgagagcctc tgagcgcttc ttgcacttca tgctgggctg gtttgcacac cccgtctttg 1921 tggatggaga ctacccagcc accctgagga cccagatcca acagatgaac agacagtgct 1981 cccatcctgt ggctcaactc cccgagttca cagaggcaga gaagcagctc ctgaaaggct 2041 ctgctgattt tctgggtctg tcgcattaca cctcccgcct catcagcaac gccccacaaa 2101 acacctgcat ccctagctat gataccattg gaggcttctc ccaacacgtg aaccatgtgt 2161 ggccccagac ctcatcctct tggattcgtg tggtgccctg ggggataagg aggctgttgc 2221 agtttgtatc cctggaatac acaagaggaa aagttccaat ataccttgcc gggaatggca 2281 tgcccatagg ggaaagtgaa aatctctttg atgattcctt aagagtagac tacttcaatc 2341 aatatatcaa tgaggtgctc aaggctatca aggaagactc tgtggatgtt cgttcctaca 2401 ttgctcgttc cctcattgat ggcttcgaag gcccttctgg ttacagccag cggtttggcc 2461 tgcaccacgt caacttcagc gacagcagca agtcaaggac tcccaggaaa tctgcctact 2521 ttttcactag catcatagaa aagaacggtt tcctcaccaa gggggcaaaa agactgctac 2581 cacctaatac agtaaacctc ccctccaaag tcagagcctt cacttttcca tctgaggtgc 2641 cctccaaggc taaagtcgtt tgggaaaagt tctccagcca acccaagttc gaaagagatt 2701 tgttctacca cgggacgttt cgggatgact ttctgtgggg cgtgtcctct tccgcttatc 2761 agattgaagg cgcgtgggat gccgatggca aaggccccag catctgggat aactttaccc 2821 acacaccagg gagcaatgtg aaagacaatg ccactggaga catcgcctgt gacagctatc 2881 accagctgga tgccgatctg aatatgctcc gagctttgaa ggtgaaggcc taccgcttct 2941 ctatctcctg gtctcggatt ttcccaactg ggagaaacag ctctatcaac agtcatgggg 3001 ttgattatta caacaggctg atcaatggct tggtggcaag caacatcttt cccatggtga 3061 cattgttcca ttgggacctg ccccaggccc tccaggatat cggaggctgg gagaatcctg 3121 ccttgattga cttgtttgac agctacgcag acttttgttt ccagaccttt ggtgatagag 3181 tcaagttttg gatgactttt aatgagccca tgtacctggc atggctaggt tatggctcag 3241 gggaatttcc cccaggggtg aaggacccag gctgggcacc atataggata gcccacaccg 3301 tcatcaaagc ccatgccaga gtctatcaca cgtacgatga gaaatacagg caggagcaga 3361 agggggtcat ctcgctgagc ctcagtacac actgggcaga gcccaagtca ccaggggtcc 3421 ccagagatgt ggaagccgct gaccgaatgc tgcagttctc cctgggctgg tttgctcacc 3481 ccatttttag aaacggagac tatcctgaca ccatgaagtg gaaagtgggg aacaggagtg 3541 aactgcagca cttagccacc tcccgcctgc caagcttcac tgaggaagag aagaggttca 3601 tcagggcgac ggccgacgtc ttctgcctca acacgtacta ctccagaatc gtgcagcaca 3661 aaacacccag gctaaaccca ccctcctacg aagacgacca ggagatggct gaggaggagg 3721 acccttcgtg gccttccacg gcaatgaaca gagctgcgcc ctgggggacg cgaaggctgc 3781 tgaactggat caaggaagag tatggtgaca tccccattta catcaccgaa aacggagtgg 3841 ggctgaccaa tccgaacacg gaggatactg ataggatatt ttaccacaaa acctacatca 3901 atgaggcttt gaaagcctac aggctcgatg gtatagacct tcgagggtat gtcgcctggt 3961 ctctgatgga caactttgag tggctaaatg gctacacggt caagtttgga ctgtaccatg 4021 ttgatttcaa caacacgaac aggcctcgca cagcaagagc ctccgccagg tactacacag 4081 aggtcattac caacaacggc atgccactgg ccagggagga tgagtttctg tacggacggt 4141 ttcctgaggg cttcatctgg agtgcagctt ctgctgcata tcagattgaa ggtgcgtgga 4201 gagcagatgg caaaggactc agcatttggg acacgttttc tcacacacca ctgagggttg 4261 agaacgatgc cattggagac gtggcctgtg acagttatca caagattgct gaggatctgg 4321 tcaccctgca gaacctgggt gtgtcccact accgtttttc catctcctgg tctcgcatcc 4381 tccctgatgg aaccaccagg tacatcaatg aagcgggcct gaactactac gtgaggctca 4441 tcgatacact gctggccgcc agcatccagc cccaggtgac catttaccac tgggacctac 4501 cacagacgct ccaagatgta ggaggctggg agaatgagac catcgtgcag cggtttaagg 4561 agtatgcaga tgtgctcttc cagaggctgg gagacaaggt gaagttttgg atcacgttga 4621 atgagccctt tgtcattgct taccagggct atggctacgg aacagcagct ccaggagtct 4681 ccaataggcc tggcactgcc ccctacattg ttggccacaa tctaataaag gctcatgctg 4741 aggcctggca tctgtacaac gatgtgtacc gcgccagtca aggtggcgtg atttccatca 4801 ccatcagcag tgactgggct gaacccagag atccctctaa ccaggaggat gtggaggcag 4861 ccaggagata tgttcagttc atgggaggct ggtttgcaca tcctattttc aagaatggag 4921 attacaatga ggtgatgaag acgcggatcc gtgacaggag cttggctgca ggcctcaaca 4981 agtctcggct gccagaattt acagagagtg agaagaggag gatcaacggc acctatgact 5041 tttttgggtt caatcactac accactgtcc tcgcctacaa cctcaactat gccactgcca 5101 tctcttcttt tgatgcagac agaggagttg cttccatcgc agatcgctcg tggccagact 5161 ctggctcctt ctggctgaag atgacgcctt ttggcttcag gaggatcctg aactggttaa 5221 aggaggaata caatgaccct ccaatttatg tcacagagaa tggagtgtcc cagcgggaag 5281 aaacagacct caatgacact gcaaggatct actaccttcg gacttacatc aatgaggccc 5341 tcaaagctgt gcaggacaag gtggaccttc gaggatacac agtttggagt gcgatggaca 5401 attttgagtg ggccacaggc ttttcagaga gatttggtct gcattttgtg aactacagtg 5461 acccttctct gccaaggatc cccaaagcat cagcgaagtt ctacgcctct gtggtccgat 5521 gcaatggctt ccctgacccc gctacagggc ctcacgcttg tctccaccag ccagatgctg 5581 gacccaccat cagccccgtg agacaggagg aggtgcagtt cctggggcta atgctcggca 5641 ccacagaagc acagacagct ttgtacgttc tcttttctct tgtgcttctt ggagtctgtg 5701 gcttggcatt tctgtcatac aagtactgca agcgctctaa gcaagggaaa acacaacgaa 5761 gccaacagga attgagcccg gtgtcttcat tctgatgagt taccacctca agttctatga 5821 agcaggccta gtttcttcat ctatctttac cggccaccaa acaccttagg gtcttagact 5881 ctgctgatac tggacttctc cataaagtcc tgctgcaccg ttagagatga ctttaatctt 5941 gaatgatttc gacttgctga gtaaaatgga aatatctcca tcttgctcca gtatcagagt 6001 tcatttgggc atttgagaag caagtagctc ttgcggaaac gtgtagatac tggtctagtg 6061 ggtctgtgaa ccacttaatt gaacttaaca gggctgtttt aagtttcaga gttgttaagg 6121 gttgttaagg gagcaaaaac cgtaaaaatc cttcctataa gaagaaatca actccattgc 6181 atagactgca atatcatctc ctgcccttct gcaagctctc cctagcttca catcttgtgt 6241 tttccagaaa ataaaaacag aagactgtcc tttc // LOCUS HSLR11 6840 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for mosaic protein LR11. ACCESSION Y08110 NID g1552323 KEYWORDS LR11 gene; mosaic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6840) AUTHORS Morwald,S., Yamazaki,H., Bujo,H., Kusunoki,J., Kanaki,T., Seimiya,K., Morisaki,N., Nimpf,J., Schneider,W.J. and Saito,Y. TITLE A novel mosaic protein containing LDL receptor elements is highly conserved in humans and chickens JOURNAL Arterioscler. Thromb. Vasc. Biol. 17 (5), 996-1002 (1997) MEDLINE 97301565 REFERENCE 2 (bases 1 to 6840) AUTHORS Morwald,S. TITLE Direct Submission JOURNAL Submitted (16-SEP-1996) S. Morwald, University and Biocenter Vienna, Department of Molecular Genetics, Dr. Bohrg. 9/2, A- 1030 Vienna, AUSTRIA FEATURES Location/Qualifiers source 1..6840 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="lambda gt11" gene 81..6725 /gene="LR11" CDS 81..6725 /gene="LR11" /codon_start=1 /product="mosaic protein LR11" /db_xref="PID:e266648" /db_xref="PID:g1552324" /translation="MATRSSRRESRLPFLFTLVALLPPGALCEVWTQRLHGGSAPLPQ DRGFLVVQGDPRELRLWARGDARGASRADEKPLRRKRSAALQPEPIKVYGQVSLNDSH NQMVVHWAGEKSNVIVALARDSLALARPKSSDVYVSYDYGKSFKKISDKLNFGLGNRS EAVIAQFYHSPADNKRYIFADAYAQYLWITFDFCNTLQGFSIPFRAADLLLHSKASNL LLGFDRSHPNKQLWKSDDFGQTWIMIQEHVKSFSWGIDPYDKPNTIYIERHEPSGYST VFRSTDFFQSRENQEVILEEVRDFQLRDKYMFATKVVHLLGSEQQSSVQLWVSFGRKP MRAAQFVTRHPINEYYIADASEDQVFVCVSHSNNRTNLYISEAEGLKFSLSLENVLYY SPGGAGSDTLVRYFANEPFADFHRVEGLQGVYIATLINGSMNEENMRSVITFDKGGTW EFLQAPAFTGYGEKINCELSQGCSLHLAQRLSQLLNLQLRRMPILSKESAPGLIIATG SVGKNLASKTNVYISSSAGARWREALPGPHYYTWGDHGGIITAIAQGMETNELKYSTN EGETWKTFIFSEKPVFVYGLLTEPGEKSTVFTIFGSNKENVHSWLILQVNATDALGVP CTENDYKLWSPSDERGNECLLGHKTVFKRRTPHATCFNGEDFDRPVVVSNCSCTREDY ECDFGFKMSEDLSLEVCVPDPEFSGKSYSPPVPCPVGSTYRRTRGYRKISGDTCSGGD VEARLEGELVPCPLAEENEFILYAVRKSIYRYDLASGATEQLPLTGLRAAVALDFDYE HNCLYWSDLALDVIQRLCLNGSTGQEVIINSGLETVEALAFEPLSQLLYWVDAGFKKI EVANPDGDFRLTIVNSSVLDRPRALVLVPQEGVMFWTDWGDLKPGIYRSNMDGSAAYH LVSEDVKWPNGISVDDQWIYWTDAYLECIERITFSGQQRSVILDNLPHPYAIAVFKNE IYWDDWSQLSIFRASKYSGSQMEILANQLTGLMDMKIFYKGKNTGSNACVPRPCSLLC LPKANNSRSCRCPEDVSSSVLPSGDLMCDCPQGYQLKNNTCVKEENTCLRNQYRCSNG NCINSIWWCDFDNDCGDMSDERNCPTTICDLDTQFRCQESGTCIPLSYKCDLEDDCGD NSDESHCEMHQCRSDEYNCSSGMCIRSSWVCDGDNDCRDWSDEANCTAIYHTCEASNF QCRNGHCIPQRWACDGDTDCQDGSDEDPVNCEKKCNGFRCPNGTCIPSSKHCDGLRDC SDGSDEQHCEPLCTHFMDFVCKNRQQCLFHSMVCDGIIQCRDGSDEDAAFAGCSQDPE FHKVCDEFGFQCQNGVCISLIWKCDGMDDCGDYSDEANCENPTEAPNCSRYFQFRCEN GHCIPNRWKCDRENDCGDWSDEKDCGDSHILPFSTPGPSTCLPNYYRCSSGTCVMDTW VCDGYRDCADGSDEEACPLLANVTAASTPTQLGRCDRFEFECHQPKTCIPNWKRCDGH QDCQDGRDEANCPTHSTLTCMSREFQCEDGEACIVLSERCDGFLDCSDESDEKACSDE LTVYKVQNLQWTADFSGDVTLTWMRPKKMPSASCVYNVYYRVVGESIWKTLETHSNKT NTVLKVLKPDTTYQVKVQVQCLSKAHNTNDFVTLRTPEGLPDAPRNLQLSLPREAEGV IVGHWAPPIHTHGLIREYIVEYSRSGSKMWASQRAASNFTEIKNLLVNTLYTVRVAAV TSRGIGNWSDSKSITTIKGKVIPPPDIHIDSYGENYLSFTLTMESDIKVNGYVVNLFW AFDTHKQERRTLNFRGSILSHKVGNLTAHTSYEISAWAKTDLGDSPLAFEHVMTRGVR PPAPSLKAKAINQTAVECTWTGPRNVVYGIFYATSFLDLYRNPKSLTTSLHNKTVIVS KDEQYLFLVRVVVPYQGPSSDYVVVKMIPDSRLPPRHLHVVHTGKTSVVIKWESPYDS PDQDLLYAIAVKDLIRKTDRSYKVKSRNSTVEYTLNKLEPGGKYHIIVQLGNMSKDSS IKITTVSLSAPDALKIITENDHVLLFWKSLALKEKHFNESRGYEIHMFDSAMNITAYL GNTTDNFFKISNLKMGHNYTFTVQARCLFGNQICGEPAILLYDELGSGADASATQAAR STDVAAVVVPILFLILLSLGVGFAILYTKHRRLQSSFTAFANSHYSSRLGSAIFSSGD DLGEDDEDAPMITGFSDDVPMVIA" BASE COUNT 1679 a 1744 c 1831 g 1586 t ORIGIN 1 ccggcccagc ggctctcctg gcctcgcgct gcacattctc tcctggcggc ggcgccacct 61 gcagtagcgt tcgcccgaac atggcgacac ggagcagcag gagggagtcg cgactcccgt 121 tcctattcac cctggtcgca ctgctgccgc ccggagctct ctgcgaagtc tggacgcaga 181 ggctgcacgg cggcagcgcg cccttgcccc aggaccgggg cttcctcgtg gtgcagggcg 241 acccgcgcga gctgcggctg tgggcgcgcg gggatgccag gggggcgagc cgcgcggacg 301 agaagccgct ccggaggaaa cggagcgctg ccctgcagcc cgagcccatc aaggtgtacg 361 gacaggttag tctgaatgat tcccacaatc agatggtggt gcactgggct ggagagaaaa 421 gcaacgtgat cgtggccttg gcccgagata gcctggcatt ggcgaggccc aagagcagtg 481 atgtgtacgt gtcttacgac tatggaaaat cattcaagaa aatttcagac aagttaaact 541 ttggcttggg aaataggagt gaagctgtta tcgcccagtt ctaccacagc cctgcggaca 601 acaagcggta catctttgca gacgcttatg cccagtacct ctggatcacg tttgacttct 661 gcaacactct tcaaggcttt tccatcccat ttcgggcagc tgatctcctc ctacacagta 721 aggcctccaa ccttctcttg ggctttgaca ggtcccaccc caacaagcag ctgtggaagt 781 cagatgactt tggccagacc tggatcatga ttcaggaaca tgtcaagtcc ttttcttggg 841 gaattgatcc ctatgacaaa ccaaatacca tctacattga acgacacgaa ccctctggct 901 actccactgt cttccgaagt acagatttct tccagtcccg ggaaaaccag gaagtgatcc 961 ttgaggaagt gagagatttt cagcttcggg acaagtacat gtttgctaca aaggtggtgc 1021 atctcttggg cagtgaacag cagtcttctg tccagctctg ggtctccttt ggccggaagc 1081 ccatgagagc agcccagttt gtcacaagac atcctattaa tgaatattac atcgcagatg 1141 cctccgagga ccaggtgttt gtgtgtgtca gccacagtaa caaccgcacc aatttataca 1201 tctcagaggc agaggggctg aagttctccc tgtccttgga gaacgtgctc tattacagcc 1261 caggaggggc cggcagtgac accttggtga ggtattttgc aaatgaacca tttgctgact 1321 tccaccgagt ggaaggattg caaggagtct acattgctac tctgattaat ggttctatga 1381 atgaggagaa catgagatcg gtcatcacct ttgacaaagg gggaacctgg gagtttcttc 1441 aggctccagc cttcacggga tatggagaga aaatcaattg tgagctttcc cagggctgtt 1501 cccttcatct ggctcagcgc ctcagtcagc tcctcaacct ccagctccgg agaatgccca 1561 tcctgtccaa ggagtcggct ccaggcctca tcatcgccac tggctcagtg ggaaagaact 1621 tggctagcaa gacaaacgtg tacatctcta gcagtgctgg agccaggtgg cgagaggcac 1681 ttcctggacc tcactactac acatggggag accacggcgg aatcatcacg gccattgccc 1741 agggcatgga aaccaacgag ctaaaataca gtaccaatga aggggagacc tggaaaacat 1801 tcatcttctc tgagaagcca gtgtttgtgt atggcctcct cacagaacct ggggagaaga 1861 gcactgtctt caccatcttt ggctcgaaca aagagaatgt ccacagctgg ctgatcctcc 1921 aggtcaatgc cacggatgcc ttgggagttc cctgcacaga gaatgactac aagctgtggt 1981 caccatctga tgagcggggg aatgagtgtt tgctgggaca caagactgtt ttcaaacggc 2041 ggacccccca tgccacatgc ttcaatggag aggactttga caggccggtg gtcgtgtcca 2101 actgctcctg cacccgggag gactatgagt gtgacttcgg tttcaagatg agtgaagatt 2161 tgtcattaga ggtttgtgtt ccagatccgg aattttctgg aaagtcatac tcccctcctg 2221 tgccttgccc tgtgggttct acttacagga gaacgagagg ctaccggaag atttctgggg 2281 acacttgtag cggaggagat gttgaagcgc gactggaagg agagctggtc ccctgtcccc 2341 tggcagaaga gaacgagttc attctgtatg ctgtgaggaa atccatctac cgctatgacc 2401 tggcctcggg agccaccgag cagttgcctc tcaccgggct acgggcagca gtggccctgg 2461 actttgacta tgagcacaac tgtttgtatt ggtccgacct ggccttggac gtcatccagc 2521 gcctctgttt gaatggaagc acagggcaag aggtgatcat caattctggc ctggagacag 2581 tagaagcttt ggcttttgaa cccctcagcc agctgcttta ctgggtagat gcaggcttca 2641 aaaagattga ggtagctaat ccagatggcg acttccgact cacaatcgtc aattcctctg 2701 tgcttgatcg tcccagggct ctggtcctcg tgccccaaga gggggtgatg ttctggacag 2761 actggggaga cctgaagcct gggatttatc ggagcaatat ggatggttct gctgcctatc 2821 acctggtgtc tgaggatgtg aagtggccca atggcatctc tgtggacgac cagtggattt 2881 actggacgga tgcctacctg gagtgcatag agcggatcac gttcagtggc cagcagcgct 2941 ctgtcattct ggacaacctc ccgcacccct atgccattgc tgtctttaag aatgaaatct 3001 actgggatga ctggtcacag ctcagcatat tccgagcttc caaatacagt gggtcccaga 3061 tggagattct ggcaaaccag ctcacggggc tcatggacat gaagattttc tacaagggga 3121 agaacactgg aagcaatgcc tgtgtgccca ggccatgcag cctgctgtgc ctgcccaagg 3181 ccaacaacag tagaagctgc aggtgtccag aggatgtgtc cagcagtgtg cttccatcag 3241 gggacctgat gtgtgactgc cctcagggct atcagctcaa gaacaatacc tgtgtcaaag 3301 aagagaacac ctgtcttcgc aaccagtatc gctgcagcaa cgggaactgt atcaacagca 3361 tttggtggtg tgactttgac aacgactgtg gagacatgag cgatgagaga aactgcccta 3421 ccaccatctg tgacctggac acccagtttc gttgccagga gtctgggact tgtatcccac 3481 tgtcctataa atgtgacctt gaggatgact gtggagacaa cagtgatgaa agtcattgtg 3541 aaatgcacca gtgccggagt gacgagtaca actgcagttc cggcatgtgc atccgctcct 3601 cctgggtatg tgacggggac aacgactgca gggactggtc tgatgaagcc aactgtaccg 3661 ccatctatca cacctgtgag gcctccaact tccagtgccg aaacgggcac tgcatccccc 3721 agcggtgggc gtgtgacggg gatacggact gccaggatgg ttccgatgag gatccagtca 3781 actgtgagaa gaagtgcaat ggattccgct gcccaaacgg cacttgcatc ccatccagca 3841 aacattgtga tggtctgcgt gattgctctg atggctccga tgaacagcac tgcgagcccc 3901 tctgtacgca cttcatggac tttgtgtgta agaaccgcca gcagtgcctg ttccactcca 3961 tggtctgtga cggaatcatc cagtgccgcg acgggtccga tgaggatgcg gcgtttgcag 4021 gatgctccca agatcctgag ttccacaagg tatgtgatga gttcggtttc cagtgtcaga 4081 atggagtgtg catcagtttg atttggaagt gcgacgggat ggatgattgc ggcgattatt 4141 ctgatgaagc caactgcgaa aaccccacag aagccccaaa ctgctcccgc tacttccagt 4201 ttcggtgtga gaatggccac tgcatcccca acagatggaa atgtgacagg gagaacgact 4261 gtggggactg gtctgatgag aaggattgtg gagattcaca tattcttccc ttctcgactc 4321 ctgggccctc cacgtgtctg cccaattact accgctgcag cagtgggacc tgcgtgatgg 4381 acacctgggt gtgcgacggg taccgagatt gtgcagatgg ctctgacgag gaagcctgcc 4441 ccttgcttgc aaacgtcact gctgcctcca ctcccaccca acttgggcga tgtgaccgat 4501 ttgagttcga atgccaccaa ccgaagacgt gtattcccaa ctggaagcgc tgtgacggcc 4561 accaagattg ccaggatggc cgggacgagg ccaattgccc cacacacagc accttgactt 4621 gcatgagcag ggagttccag tgcgaggacg gggaggcctg cattgtgctc tcggagcgct 4681 gcgacggctt cctggactgc tcggacgaga gcgatgaaaa ggcctgcagt gatgagttga 4741 ctgtgtacaa agtacagaat cttcagtgga cagctgactt ctctggggat gtgactttga 4801 cctggatgag gcccaaaaaa atgccctctg catcttgtgt atataatgtc tactacaggg 4861 tggttggaga gagcatatgg aagactctgg agacccacag caataagaca aacactgtat 4921 taaaagtctt gaaaccagat accacgtatc aggttaaagt acaggttcag tgtctcagca 4981 aggcacacaa caccaatgac tttgtgaccc tgaggacccc agagggattg ccagatgccc 5041 ctcgaaatct ccagctgtca ctccccaggg aagcagaagg tgtgattgta ggccactggg 5101 ctcctcccat ccacacccat ggcctcatcc gtgagtacat tgtagaatac agcaggagtg 5161 gttccaagat gtgggcctcc cagagggctg ctagtaactt tacagaaatc aagaacttat 5221 tggtcaacac tctatacacc gtcagagtgg ctgcggtgac tagtcgtgga ataggaaact 5281 ggagcgattc taaatccatt accaccataa aaggaaaagt gatcccacca ccagatatcc 5341 acattgacag ctatggtgaa aattatctaa gcttcaccct gaccatggag agtgatatca 5401 aggtgaatgg ctatgtggtg aaccttttct gggcatttga cacccacaag caagagagga 5461 gaactttgaa cttccgagga agcatattgt cacacaaagt tggcaatctg acagctcata 5521 catcctatga gatttctgcc tgggccaaga ctgacttggg ggatagccct ctggcatttg 5581 agcatgttat gaccagaggg gttcgcccac ctgcacctag cctcaaggcc aaagccatca 5641 accagactgc agtggaatgt acctggaccg gcccccggaa tgtggtttat ggtattttct 5701 atgccacgtc ctttcttgac ctctatcgca acccgaagag cttgactact tcactccaca 5761 acaagacggt cattgtcagt aaggatgagc agtatttgtt tctggtccgt gtagtggtac 5821 cctaccaggg gccatcctct gactacgttg tagtgaagat gatcccggac agcaggcttc 5881 caccccgtca cctgcatgtg gttcatacgg gcaaaacctc cgtggtcatc aagtgggaat 5941 caccgtatga ctctcctgac caggacttgt tgtatgcaat tgcagtcaaa gatctcataa 6001 gaaagactga caggagctac aaagtaaaat cccgtaacag cactgtggaa tacaccctta 6061 acaagttgga gcctggcggg aaataccaca tcattgtcca actggggaac atgagcaaag 6121 attccagcat aaaaattacc acagtttcat tatcagcacc tgatgcctta aaaatcataa 6181 cagaaaatga tcatgttctt ctgttttgga aaagcctggc tttaaaggaa aagcatttta 6241 atgaaagcag gggctatgag atacacatgt ttgatagtgc catgaatatc acagcttacc 6301 ttgggaatac tactgacaat ttctttaaaa tttccaacct gaagatgggt cataattaca 6361 cgttcaccgt ccaagcaaga tgcctttttg gcaaccagat ctgtggggag cctgccatcc 6421 tgctgtacga tgagctgggg tctggtgcag atgcatctgc aacgcaggct gccagatcta 6481 cggatgttgc tgctgtggtg gtgcccatct tattcctgat actgctgagc ctgggggtgg 6541 ggtttgccat cctgtacacg aagcaccgga ggctgcagag cagcttcacc gccttcgcca 6601 acagccacta cagctccagg ctggggtccg caatcttctc ctctggggat gacctggggg 6661 aagatgatga agatgcccct atgataactg gattttcaga tgacgtcccc atggtgatag 6721 cctgaaagag ctttcctcac tagaaaccaa atggtgtaaa tattttattt gataaagata 6781 gttgatggtt tattttaaaa gatgcacttt gagttgcaat atgttatttt tatatgggcc // LOCUS HSLRPGENE 2840 bp RNA PRI 15-APR-1996 DEFINITION H.sapiens lrp mRNA. ACCESSION X79882 NID g895839 KEYWORDS LRP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2840) AUTHORS Scheffer,G.L., Wijngaard,P.L., Flens,M.J., Izquierdo,M.A., Slovak,M.L., Pinedo,H.M., Meijer,C.J., Clevers,H.C. and Scheper,R.J. TITLE The drug resistance-related protein LRP is the human major vault protein JOURNAL Nat. Med. 1 (6), 578-582 (1995) MEDLINE 96071506 REFERENCE 2 (bases 1 to 2840) AUTHORS Scheffer,G.L. TITLE Direct Submission JOURNAL Submitted (27-JUN-1994) G.L. Scheffer, VUA Pathologie, Immunology/Cell Biology, Free University Hospital, De Boelelaan 1117, 1081 HV Amsterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..2840 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="aneuploid" /cell_line="HT1080/Dr4" /clone_lib="pCDM8" /clone="lrp" gene 106..2796 /gene="lrp" CDS 106..2796 /gene="lrp" /codon_start=1 /db_xref="PID:g895840" /translation="MATEEFIIRIPPYHYIHVLDQNSNVSRVEVGPKTYIRQDNERVL FAPMRMVTVPPRHYCTVANPVSRDAQGLVLFDVTGQVRLRHADLEIRLAQDPFPLYPG EVLEKDITPLQVVLPNTALHLKALLDFEDKDGDKVVAGDEWLFEGPGTYIPRKEVEVV EIIQATIIRQNQALRLRARKECWDRDGKERVTGEEWLVTTVGAYLPAVFEEVLDLVDA VILTEKTALHLRARRNFRDFRGVSRRTGEEWLVTVQDTEAHVPDVHEEVLGVVPITTL GPHNYCVILDPVGPDGKNQLGQKRVVKGEKSFFLQPGEQLEQGIQDVYVLSEQQGLLL RALQPLEEGEDEEKVSHQAGDHWLIRGPLEYVPSAKVEVVEERQAIPLDENEGIYVQD VKTGKVRAVIGSTYMLTQDEVLWEKELPPGVEELLNKGQDPLADRGEKDTAKSLQPLA PRNKTRVVSYRVPHNAAVQVYDYREKRARVVFGPELVSLGPEEQFTVLSLSAGRPKRP HARRALCLLLGPDFFTDVITIETADHARLQLQLAYNWHFEVNDRKDPQETAKLFSVPD FVGDACKAIASRVRGAVASVTFDDFHKNSARIIRTAVFGFETSEAKGPDGMALPRPRD QAVFPQNGLVVSSVDVQSVEPVDQRTRDALQRSVQLAIEITTNSQEAAAKHEAQRLEQ EARGRLERQKILDQSEAEKARKELLELEALSMAVESTGTAKAEAESRAEAARIEGEGS VLQAKLKAQALAIETEAELQRVQKVRELELVYARAQLELEVSKAQQLAEVEVKKFKQM TEAIGPSTIRDLAVAGPEMQVKLLQSLGLKSTLITDGSTPINLFNTAFGLLGMGPEGQ PLGRRVPVAQPWGGDIPPVCSGPSSSWRQPRGACTALTPD" BASE COUNT 610 a 807 c 919 g 504 t ORIGIN 1 cctcgagatc cattgtgctg gaaaggttcc ccatctgagg cgtttgttgc agctacctgc 61 acttctagat tcatcttctt gtgagccctg ggcttaggag tcaccatggc aactgaagag 121 ttcatcatcc gcatcccccc ataccactat atccatgtgc tggaccagaa cagcaacgtg 181 tcccgtgtgg aggtcgggcc aaagacctac atccggcagg acaatgagag ggtactgttt 241 gcccccatgc gcatggtgac cgtcccccca cgtcactact gcacagtggc caaccctgtg 301 tctcgggatg cccagggctt ggtgctgttt gatgtcacag ggcaagttcg gcttcgccac 361 gctgacctcg agatccggct ggcccaggac cccttccccc tgtacccagg ggaggtgctg 421 gaaaaggaca tcacacccct gcaggtggtt ctgcccaaca ctgccctcca tctaaaggcg 481 ctgcttgatt ttgaggataa agatggagac aaggtggtgg caggagatga gtggcttttc 541 gagggacctg gcacgtacat cccccggaag gaagtggagg tcgtggagat cattcaggcc 601 accatcatca ggcagaacca ggctctgcgg ctcagggccc gcaaggagtg ctgggaccgg 661 gacggcaagg agagggtgac aggggaagaa tggctggtca ccacagtagg ggcgtacctc 721 ccagcggtgt ttgaggaggt tctggatttg gtggacgccg tcatccttac ggaaaagaca 781 gccctgcacc tccgggctcg gcggaacttc cgggacttca ggggagtgtc ccgccgcact 841 ggggaggagt ggctggtaac agtgcaggac acagaggccc acgtgccaga tgtccacgag 901 gaggtgctgg gggttgtgcc catcaccacc ctgggccccc acaactactg cgtgattctc 961 gaccctgtcg gaccggatgg caagaatcag ctggggcaga agcgcgtggt caagggagag 1021 aagtcttttt tcctccagcc aggagagcag ctggaacaag gcatccagga tgtgtatgtg 1081 ctgtcggagc agcaggggct gctgctgagg gccctgcagc ccctggagga gggggaggat 1141 gaggagaagg tctcacacca ggctggggac cactggctca tccgcggacc cctggagtat 1201 gtgccatctg ccaaagtgga ggtggtggag gagcgccagg ccatccctct agacgagaac 1261 gagggcatct atgtgcagga tgtcaagacc ggaaaggtgc gcgctgtgat tggaagcacc 1321 tacatgctga cccaggacga agtcctgtgg gagaaagagc tgcctcccgg ggtggaggag 1381 ctgctgaaca aggggcagga ccctctggca gacaggggtg agaaggacac agctaagagc 1441 ctccagccct tggcgccccg gaacaagacc cgtgtggtca gctaccgcgt gccccacaac 1501 gctgcggtgc aggtgtacga ctaccgagag aagcgagccc gcgtggtctt cgggcctgag 1561 ctggtgtcgc tgggtcctga ggagcagttc acagtgttgt ccctctcagc tgggcggccc 1621 aagcgtcccc atgcccgccg tgcgctctgc ctgctgctgg ggcctgactt cttcacagac 1681 gtcatcacca tcgaaacggc ggatcatgcc aggctgcaac tgcagctggc ctacaactgg 1741 cactttgagg tgaatgaccg gaaggacccc caagagacgg ccaagctctt ttcagtgcca 1801 gactttgtag gtgatgcctg caaagccatc gcatcccggg tgcggggggc cgtggcctct 1861 gtcactttcg atgacttcca taagaactca gcccgcatca ttcgcactgc tgtctttggc 1921 tttgagacct cggaagcgaa gggccccgat ggcatggccc tgcccaggcc ccgggaccag 1981 gctgtcttcc cccaaaacgg gctggtggtc agcagtgtgg acgtgcagtc agtggagcct 2041 gtggatcaga ggacccggga cgccctgcaa cgcagcgtcc agctggccat cgagatcacc 2101 accaactccc aggaagcggc ggccaagcat gaggctcaga gactggagca ggaagcccgc 2161 ggccggcttg agcggcagaa gatcctggac cagtcagaag ccgagaaagc tcgcaaggaa 2221 cttttggagc tggaggctct gagcatggcc gtggagagca ccgggactgc caaggcggag 2281 gccgagtccc gtgcggaggc agcccggatt gagggagaag ggtccgtgct gcaggccaag 2341 ctaaaagcac aggccttggc cattgaaacg gaggctgagc tccagagggt ccagaaggtc 2401 cgagagctgg aactggtcta tgcccgggcc cagctggagc tggaggtgag caaggctcag 2461 cagctggctg aggtggaggt gaagaagttc aagcagatga cagaggccat aggccccagc 2521 accatcaggg accttgctgt ggctgggcct gagatgcagg taaaactgct ccagtccctg 2581 ggcctgaaat caaccctcat caccgatggc tccactccca tcaacctctt caacacagcc 2641 tttgggctgc tggggatggg gcccgagggt cagcccctgg gcagaagggt gccagtggcc 2701 cagccctggg gaggggatat ccccccagtc tgctcaggcc cctcaagctc ctggagacaa 2761 ccacgtggtg cctgtactgc gctaactcct gattaataca atggaagttt ctgggcaaaa 2821 aaaaaaaaaa aaagtttcca // LOCUS HSLTFRG 2619 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for lactoferrin. ACCESSION X53961 NID g34415 KEYWORDS lactoferrin; lactotransferrin; secreted protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2619) AUTHORS Rey,M.W. TITLE Direct Submission JOURNAL Submitted (16-JUL-1990) Rey M.W., Genencor International, 180 Kimball Way, South San Francisco, CA 94080, USA REFERENCE 2 (bases 1 to 2619) AUTHORS Rey,M.W., Woloshuk,S.L., deBoer,H.A. and Pieper,F.R. TITLE Complete nucleotide sequence of human mammary gland lactoferrin JOURNAL Nucleic Acids Res. 18 (17), 5288 (1990) MEDLINE 90384839 COMMENT sequence is both genomic and cDNA see for conflicting sequence. FEATURES Location/Qualifiers source 1..2619 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="mammary gland" /cell_type="epithelial" /clone_lib="lambda gt11" /clone="HLF-1" promoter 227..232 /note="pot. TATA box" sig_peptide 295..351 /note="signal peptide (AA -19 to -1)" CDS 295..2430 /codon_start=1 /product="precursor (AA -19 to 692)" /db_xref="PID:g34416" /db_xref="SWISS-PROT:P02788" /translation="MKLVFLVLLFLGALGLCLAGRRRRSVQWCAVSQPEATKCFQWQR NMRKVRGPPVSCIKRDSPIQCIQAIAENRADAVTLDGGFIYEAGLAPYKLRPVAAEVY GTERQPRTHYYAVAVVKKGGSFQLNELQGLKSCHTGLRRTAGWNVPTGTLRPFLNWTG PPEPIEAAVARFFSASCVPGADKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKC LRDGAGDVAFIRESTVFEDLSDEAERDEYELLCPDNTRKPVDKFKDCHLARVPSHAVV ARSVNGKEDAIWNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRI DSGLYLGSGYFTAIQNLRKSEEEVAARRARVVWCAVGEQELRKCNQWSGLSEGSVTCS SASTTEDCIALVLKGEADAMSLDGGYVYTACKCGLVPVLAENYKSQQSSDPDPNCVDR PVEGYLAVAVVRRSDTSLTWNSVKGKKSCHTAVDRTAGWNIPMGLLFNQTGSCKFDEY FSQSCAPGSDPRSNLCALCIGDEQGENKCVPNSNERYYGYTGAFRCLAENAGDVAFVK DVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRKPVTEARSCHLAMAPNHAVVSRMD KVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSETKNLLFNDNTECLARLHGKTTYEKY LGPQYVAGITNLKKCSTSPLLEACEFLRK" mat_peptide 352..2427 /note="lactoferrin (AA 1-692)" BASE COUNT 626 a 661 c 767 g 565 t ORIGIN 1 gactcctagg ggcttgcaga cctagtggga gagaaagaac atcgcagcag ccaggcagaa 61 ccaggacagg tgaggtgcag gctggctttc ctctcgcagc gcggtgtgga gtcctgtcct 121 gcctcagggc ttttcggagc ctggatcctc aaggaacaag tagacctggc cgcggggagt 181 ggggagggaa ggggtgtcta ttgggcaaca gggcggcaaa gccctgaata aaggggcgca 241 gggcaggcgc aagtgcagag ccttcgtttg ccaagtcgcc tccagaccgc agacatgaaa 301 cttgtcttcc tcgtcctgct gttcctcggg gccctcggac tgtgtctggc tggccgtagg 361 agaaggagtg ttcagtggtg cgccgtatcc caacccgagg ccacaaaatg cttccaatgg 421 caaaggaata tgagaaaagt gcgtggccct cctgtcagct gcataaagag agactccccc 481 atccagtgta tccaggccat tgcggaaaac agggccgatg ctgtgaccct tgatggtggt 541 ttcatatacg aggcaggcct ggccccctac aaactgcgac ctgtagcggc ggaagtctac 601 gggaccgaaa gacagccacg aactcactat tatgccgtgg ctgtggtgaa gaagggcggc 661 agctttcagc tgaacgaact gcaaggtctg aagtcctgcc acacaggcct tcgcaggacc 721 gctggatgga atgtccctac agggacactt cgtccattct tgaattggac gggtccacct 781 gagcccattg aggcagctgt ggccaggttc ttctcagcca gctgtgttcc cggtgcagat 841 aaaggacagt tccccaacct gtgtcgcctg tgtgcgggga caggggaaaa caaatgtgcc 901 ttctcctccc aggaaccgta cttcagctac tctggtgcct tcaagtgtct gagagacggg 961 gctggagacg tggcttttat cagagagagc acagtgtttg aggacctgtc agacgaggct 1021 gaaagggacg agtatgagtt actctgccca gacaacactc ggaagccagt ggacaagttc 1081 aaagactgcc atctggcccg ggtcccttct catgccgttg tggcacgaag tgtgaatggc 1141 aaggaggatg ccatctggaa tcttctccgc caggcacagg aaaagtttgg aaaggacaag 1201 tcaccgaaat tccagctctt tggctcccct agtgggcaga aagatctgct gttcaaggac 1261 tctgccattg ggttttcgag ggtgcccccg aggatagatt ctgggctgta ccttggctcc 1321 ggctacttca ctgccatcca gaacttgagg aaaagtgagg aggaagtggc tgcccggcgt 1381 gcgcgggtcg tgtggtgtgc ggtgggcgag caggagctgc gcaagtgtaa ccagtggagt 1441 ggcttgagcg aaggcagcgt gacctgctcc tcggcctcca ccacagagga ctgcatcgcc 1501 ctggtgctga aaggagaagc tgatgccatg agtttggatg gaggatatgt gtacactgca 1561 tgcaaatgtg gtttggtgcc tgtcctggca gagaactaca aatcccaaca aagcagtgac 1621 cctgatccta actgtgtgga tagacctgtg gaaggatatc ttgctgtggc ggtggttagg 1681 agatcagaca ctagccttac ctggaactct gtgaaaggca agaagtcctg ccacaccgcc 1741 gtggacagga ctgcaggctg gaatatcccc atgggcctgc tcttcaacca gacgggctcc 1801 tgcaaatttg atgaatattt cagtcaaagc tgtgcccctg ggtctgaccc gagatctaat 1861 ctctgtgctc tgtgtattgg cgacgagcag ggtgagaata agtgcgtgcc caacagcaac 1921 gagagatact acggctacac tggggctttc cggtgcctgg ctgagaatgc tggagacgtt 1981 gcatttgtga aagatgtcac tgtcttgcag aacactgatg gaaataacaa tgaggcatgg 2041 gctaaggatt tgaagctggc agactttgcg ctgctgtgcc tcgatggcaa acggaagcct 2101 gtgactgagg ctagaagctg ccatcttgcc atggccccga atcatgccgt ggtgtctcgg 2161 atggataagg tggaacgcct gaaacaggtg ctgctccacc aacaggctaa atttgggaga 2221 aatggatctg actgcccgga caagttttgc ttattccagt ctgaaaccaa aaaccttctg 2281 ttcaatgaca acactgagtg tctggccaga ctccatggca aaacaacata tgaaaaatat 2341 ttgggaccac agtatgtcgc aggcattact aatctgaaaa agtgctcaac ctcccccctc 2401 ctggaagcct gtgaattcct caggaagtaa aaccgaagaa gatggcccag ctccccaaga 2461 aagcctcagc cattcactgc ccccagctct tctccccagg tgtgttgggg ccttggctcc 2521 cctgctgaag gtggggattg cccatccatc tgcttacaat tccctgctgt cgtcttagca 2581 agaagtaaaa tgagaaattt tgttgatatt caaaaaaaa // LOCUS HSLTGFBP4 5054 bp RNA PRI 04-AUG-1997 DEFINITION Homo sapiens mRNA for latent transforming growth factor-beta binding protein-4. ACCESSION Y13622 NID g2190401 KEYWORDS TGF beta binding protein; TGF-beta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5054) AUTHORS Giltay,R., Kostka,G. and Timpl,R. TITLE Sequence and expression of a novel member (LTBP-4) of the family of latent transforming growth factor-beta binding proteins JOURNAL FEBS Lett. 411 (2-3), 164-168 (1997) MEDLINE 97415399 REFERENCE 2 (bases 1 to 5054) AUTHORS Kostka,G. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) G. Kostka, MPI fuer Biochemie, 82152 Martinsried, FRG FEATURES Location/Qualifiers source 1..5054 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..4764 /codon_start=1 /product="latent TGF-beta binding protein-4" /db_xref="PID:e321550" /db_xref="PID:g2190402" /translation="MGDVKALLFVVAARARRLGGAAASESLAVSEAFCRVRSCQPKKC AGPQRCLNPVPAVPSPSPSVRKRQVSLNWQPLTLQEARALLKRRRPRGPGGRGLLRRR PPQRAPAGKAPVLCPLICHNGGVCVKPDRCFCPPDFAGKFCQLHSSGARPPAPAVPGL TRSVYTMPLANHRDDEHGVASMVSVHVEHPQEASVVVHQVERVSGPWEEADAEAVARA EAAARAEAAAPYTVLAQSAPREDGYSDASGFGYCFRELRGGECASPLPGLRTQEVCCR GAGLAWGVHDCQLCSERLGNSERVSAPDGPCPTGFERVNGSCEDVDECATGGRCQHGE CANTRGGYTCVCPDGFLLDSSRSSCISQHVISEAKGPCFRVLRDGGCSLPILRNITKQ ICCCSRVGKAWGRGCQLCPPFGSEGFREICPAGPGYHYSASDLRYNTRPLGQEPPRVS LSQPRTLPATSRPSAGFLPTHRLEPRPEPRPDPRPGPELPLPSIPAWTGPEIPESGPS SGMCQRNPQVCGPGRCISRPSGYTCACDSGFRLSPQGTRCIDVDECRRVPPPCAPGRC ENSPGSFRCVCGPGFRAGPRAAECLDVDECHRVPPPCDLGRCENTPGSFLCVCPAGYQ AAPHGASCQDVDECTQSPGLCGRGGCKNLPGSFRCVCPAGFRGSACEEDVDECAQEPP PCGPGRCDNTAGSFHCACPAGFRSRGPGAPCQDVDECARSPPPCTYGRCENTEGSFQC VCPMGFQPNTAGSECEDVDECENHLACPGQECVNSPGSFQCRTCPSGHHLHRGRCTDV DECSSGAPPCGPHGHCTNTEGSFRCSCAPGYRAPSGRPGPCADVNECLEGDFCFPHGE CLNTDGSFACTCAPGYRPGPRGASCLDVDECSEEDLCQSGICTNTDGSFECICPPGHR AGPDLASCLDVDECRERGPALCGSQRCENSPGSYRCVRDCDPGYHAGPEGTCDDVDEC QEYGPEICGAQRCENTPGSYRCTPACDPGYQPTPGGGCQDVDECRNRSFCGAHAVCQN LPGSFQCLCDQGYEGARDGRHCVDVNECETLQGVCGAALCENVEGSFLCVCPNSPEEF DPMTGRCVPPRTSVGMSPGSQPQAPVSPVLPARPPPPPLSRRPRKPRKGPVGSGCREC YFDTAAPDACDNILARNVTWQECCCTVGEGWGSGCRIQQCPGTETAEYQSLCPHGRGY LAPSGDLSLRRDVDECQLFRDQVCKSGVCVNTAPGYSCYCSNGYYYHTQRLECIDNDE CADEEPACEGGRCVNTVGSYHCTCEPPLVLDGSQRRCVSNESQSLDDNLGVCWQEVGA DLVCSHPRLDRQATYTECCCLYGEAWGMDCALCPAQDSDDFEALCNVLRPPAYSPPRP GGFGLPYEYGPDLGPPYQGLPYGPELYPPPALPYDPYPPPPGPFARREAPYGAPRFDM PDFEDDGGPYGESEAPAPPGPGTRWPYRSRDTRRSFPEPEEPPEGGSYAGSLAEPYEE LEAEECGILDGCTNDRCVRVPEGFTCRCFDGYRLDMTRMACVDINECDEAEAASPLCV NARCLNTDGSFRCICRPGFAPTHQPHHCAPARPRA" BASE COUNT 824 a 1740 c 1590 g 900 t ORIGIN 1 atgggagacg taaaagcgtt gctgtttgtc gttgctgccc gggccagacg tttaggagga 61 gccgctgcat ccgagtccct ggctgtctcc gaagccttct gcagggtccg aagctgccag 121 cccaaaaagt gtgcaggccc ccagcggtgc ctgaacccag tgcctgcagt gcccagtccc 181 agccccagcg tgaggaagag acaggtgtcc ctcaactggc agccactgac gctccaggag 241 gccagagctc tactgaagcg gcggcggccc cgggggccag ggggccgggg actactgaga 301 aggaggcccc cacagcgtgc ccccgctggc aaggccccgg tcctgtgtcc cttgatctgt 361 cacaatggcg gtgtgtgcgt gaagcctgac cgctgcttct gtcccccgga cttcgctggc 421 aagttctgcc agttgcactc ctcgggcgcc cggcccccgg ccccggctgt accaggcctc 481 acccgctccg tgtacactat gccactggcc aaccaccgcg acgacgagca cggcgtggca 541 tctatggtga gcgtccacgt ggagcacccg caggaggcgt cggtggtggt gcaccaggtg 601 gagcgtgtgt ctggcccttg ggaggaggcg gacgctgagg cggtggcgcg ggcggaagcg 661 gcggcgcggg cggaggcggc agcgccctac acggtgttgg cacagagcgc gccgcgggag 721 gacggctact cagatgcctc gggcttcggt tactgctttc gggagctgcg cggaggcgaa 781 tgcgcgtccc cgctgcccgg gctccggacg caggaggtct gctgccgagg ggccggcttg 841 gcctggggcg ttcacgactg tcagctgtgc tccgagcgcc tggggaactc cgaaagagtg 901 agcgccccag atggaccttg tccaaccggc tttgaaagag ttaatgggtc ctgcgaagat 961 gtggatgagt gcgcgactgg cgggcgctgc cagcacggcg agtgtgcaaa cacgcgcggc 1021 gggtacacgt gtgtgtgccc cgacggcttt ctgctcgact cgtcccgcag cagctgcatc 1081 tcccaacacg tgatctcaga ggccaaaggg ccctgcttcc gcgtgctccg cgacggcggc 1141 tgttcgctgc ccattctgcg gaacatcact aaacagatct gctgctgcag ccgcgtaggc 1201 aaggcctggg gccggggctg ccagctctgc ccacccttcg gctcagaggg tttccgggag 1261 atctgcccgg ctggtcctgg ttaccactac tcggcctccg acctccgcta caacaccaga 1321 cccctgggcc aggagccacc ccgagtgtca ctcagccagc ctcgtaccct gccagccacc 1381 tctcggccat ctgcaggctt tctgcccacc catcgcctgg agccccggcc tgaaccccgg 1441 cccgatcccc ggcccggccc tgagcttccc ttgcccagca tccctgcctg gactggtcct 1501 gagattcctg aatcaggtcc ctcctccggc atgtgtcagc gcaaccccca ggtctgcggc 1561 ccaggacgct gcatttcccg gcccagcggc tacacctgcg cttgcgactc tggcttccgg 1621 ctcagccccc agggcacccg atgcattgat gtggacgaat gtcgccgcgt gcccccgccc 1681 tgtgctcccg ggcgctgcga gaactcacca ggcagcttcc gctgcgtgtg cggcccgggc 1741 ttccgagccg gcccacgggc tgcggaatgc ctggatgtgg acgagtgcca ccgcgtgccg 1801 ccgccgtgtg acctcgggcg ctgcgagaac acgccaggca gcttcctgtg cgtgtgcccc 1861 gccgggtacc aggctgcacc gcacggagcc agctgccagg atgtggatga atgcacccag 1921 agcccaggcc tgtgtggccg agggggctgc aagaacctgc ctggctcttt ccgctgtgtt 1981 tgcccggctg gcttccgggg ctcggcgtgt gaagaggatg tggatgagtg tgcccaggaa 2041 ccgccgccct gtgggcccgg ccgctgtgac aacacggcag gctcctttca ctgtgcctgc 2101 cctgctggct tccgctcccg agggcccggg gccccctgcc aagatgtgga tgagtgtgcc 2161 cgaagccccc caccctgcac ctacgggcgg tgtgagaaca cagaaggcag cttccagtgt 2221 gtctgcccca tgggcttcca acccaacact gctggctccg agtgcgaaga tgtggatgaa 2281 tgtgaaaacc acctcgcatg ccctgggcag gaatgtgtga actcgcccgg ctccttccag 2341 tgcaggacct gtccttctgg ccaccacctg caccgtggca gatgcactga tgtggacgaa 2401 tgcagttcgg gtgcccctcc ctgtggtccc cacggccact gcactaacac cgaaggctcc 2461 ttccgctgca gctgcgcgcc aggctaccgg gcgccgtcgg gtcggcccgg gccctgcgca 2521 gacgtgaacg agtgcctgga gggcgatttc tgcttccctc acggcgagtg cctcaacact 2581 gacggctcct ttgcctgtac ttgtgcccct ggctaccgac ccggaccccg cggagcctct 2641 tgcctcgacg ttgacgagtg cagcgaggag gacctttgcc agagcggcat ctgtaccaac 2701 accgacggct ccttcgagtg catctgtcct ccgggacacc gcgctggtcc ggacctcgcc 2761 tcctgtctcg acgtggacga atgtcgcgag cgaggtccag ccctgtgcgg gtcgcagcgt 2821 tgtgagaact ctcccggctc ctaccgctgt gtccgggact gcgatcctgg gtaccacgcg 2881 ggccccgagg gcacctgtga cgatgtggat gagtgccaag aatatggtcc cgagatttgt 2941 ggagcccagc gttgtgagaa cacccctggc tcctaccgct gtacaccagc ctgtgaccct 3001 ggctatcagc ccacgccagg gggcggatgc caggatgtgg acgaatgccg gaaccggtcc 3061 ttctgcggtg cccacgccgt gtgccagaac ctgcccggct ccttccagtg cctctgtgac 3121 cagggttacg agggggcacg ggatgggcgt cactgcgtgg atgtgaacga gtgtgaaaca 3181 ctacagggtg tatgtggagc tgccctgtgt gaaaatgtcg aaggctcctt cctctgtgtc 3241 tgccccaaca gcccggaaga gtttgacccc atgactggac gctgtgttcc cccacgaact 3301 tctgttggca tgtccccagg ctcgcagccc caggcacctg ttagccccgt tctgcccgcc 3361 aggccacctc cgccacccct gtcccgccga cccagaaaac ctaggaaggg ccctgtgggg 3421 agtgggtgcc gggagtgcta ttttgacaca gcggccccgg atgcatgtga caacatcctg 3481 gctcggaatg tgacatggca ggagtgctgc tgtactgtgg gtgagggctg gggcagcggc 3541 tgccgcatcc agcagtgccc gggcaccgag acagctgagt accagtcatt gtgccctcac 3601 ggccggggct acctggcgcc cagtggagac ctgagcctcc ggagagacgt ggacgaatgt 3661 cagctcttcc gagaccaggt gtgcaagagt ggcgtgtgtg tgaacacggc cccgggctac 3721 tcatgctatt gcagcaacgg ctactactac cacacacagc ggctggagtg catcgataat 3781 gacgagtgcg ccgatgagga accggcctgt gagggcggcc gctgtgtcaa cactgtgggc 3841 tcttatcact gtacctgcga gcccccactg gtgctggatg gctcgcagcg ccgctgcgtc 3901 tccaacgaga gccagagcct cgatgacaat ctgggagtgt gctggcagga agtgggggct 3961 gacctcgtgt gcagccaccc tcggctggac cgtcaggcca cctacacaga gtgctgctgc 4021 ctgtatggag aggcctgggg catggactgc gccctctgcc ctgcgcagga ctcagatgac 4081 ttcgaggccc tgtgcaatgt gctacgcccc cccgcatata gccccccgcg accaggtggc 4141 tttggactcc cctacgagta cggcccagac ttaggtccac cttaccaggg cctcccatat 4201 gggcctgagt tgtacccacc acctgcgcta ccctacgacc cctacccacc gccacctggg 4261 cccttcgccc gccgggaggc tccttatggg gcaccccgct tcgacatgcc agactttgag 4321 gacgatggtg gcccctatgg cgaatctgag gctcctgcgc cacctggccc gggcacccgc 4381 tggccctatc ggtcccggga cacccgccgc tccttcccag agcccgagga gcctcctgaa 4441 ggtggaagct atgctggttc cctggctgag ccctacgagg agctggaggc ggaggagtgc 4501 gggatcctgg acggctgcac caacgaccgc tgcgtgcgcg tccccgaagg cttcacctgc 4561 cgttgcttcg acggctaccg cctggacatg acccgcatgg cctgcgttga catcaacgag 4621 tgtgatgagg ccgaggctgc ctccccgctg tgcgtcaacg cgcgttgcct caacacggat 4681 ggctccttcc gctgcatctg ccgcccggga ttcgcaccca cgcaccagcc gcaccactgt 4741 gcgcccgcac ggccccgggc ctgagccctg gcacccgctg gccacccacc cgcgcccgcc 4801 actcggggcc cctgccgcgc atcctgcagc ccgcttatgc gtatgtgcac ggggccgccc 4861 gcctggacct ggagaaggga cctacggacg cctggaagct gcgacgccct gcactgctcc 4921 cgcctccacc agcgcctccc actgatgtcg tggtcccggg cctggcccag gggccccttt 4981 acatgccctc tcccttttat aaaattttcc attaaaaacc acctattttc taaaaaaaaa 5041 aaaaaaaaaa aaaa // LOCUS HSLUBGRGP 2402 bp RNA PRI 19-SEP-1995 DEFINITION H.sapiens LU gene for Lutheran blood group glycoprotein. ACCESSION X83425 NID g603559 KEYWORDS Lutheran blood group glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2402) AUTHORS Parsons,S.F., Mawby,W.J. and Anstee,D.J. TITLE Lutheran blood group glycoprotein is a new member of the immunoglobulin superfamily of proteins (abstract) JOURNAL Vox Sang. 67, 1-1 (1994) REMARK (sites) REFERENCE 2 (bases 1 to 2402) AUTHORS Parsons,S.F., Mallinson,G., Holmes,C.H., Houlihan,J.M., Simpson,K.L., Mawby,W.J., Spurr,N.K., Warne,D., Barclay,A.N. and Anstee,D.J. TITLE The Lutheran blood group glycoprotein, another member of the immunoglobulin superfamily, is widely expressed in human tissues and is developmentally regulated in human liver JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (12), 5496-5500 (1995) MEDLINE 95296337 REFERENCE 3 (bases 1 to 2402) AUTHORS Parsons,S.F. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) S.F. Parsons, International Blood Group Ref. Lab., Southmead Road, Bristol BS10 5ND, UK FEATURES Location/Qualifiers source 1..2402 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Lutheran" /tissue_type="placenta" /clone_lib="lambda gt11" /chromosome="19" gene 23..1909 /gene="LU" CDS 23..1909 /gene="LU" /function="unknown" /citation=[2] /codon_start=1 /evidence=experimental /product="Lutheran blood group glycoprotein" /db_xref="PID:g603560" /translation="MEPPDAPAQARGAPRLLLLAVLLAAHPDAQAEVRLSVPPLVEVM RGKSVILDCTPTGTHDHYMLEWFLTDRSGARPRLASAEMQGSELQVTMHDTRGRSPPY QLDSQGRLVLAEAQVGDERDYVCVVRAGAAGTAEATARLNVFAKPEATEVSPNKGTLS VMEDSAQEIATCNSRNGNPAPKITWYRNGQRLEVPVEMNPEGYMTSRTVREASGLLSL TSTLYLRLRKDDRDASFHCAAHYSLPEGRHGRLDSPTFHLTLHYPTEHVQFWVGSPST PAGWVREGDTVQLLCRGDGSPSPEYTLFRLQDEQEEVLNVNLEGNLTLEGVTRGQSGT YGCRVEDYDAADDVQLSKTLELRVAYLDPLELSEGKVLSLPLNSSAVVNCSVHGLPTP ALRWTKDSTPLGDGPMLSLSSITFDSNGTYVCEASLPTVPVLSRTQNFTLLVQGSPEL KTAEIEPKADGSWREGDEVTLICSARGHPDPKLSWSQLGGSPAEPIPGRQGWVSSSLT LKVTSALSRDGISCEASNPHGNKRHVFHFGAVSPQTSQAGVAVMAVAVSVGLLLLVVA VFYCVRRKGGPCCRQRREKGAPPPGEPGLSHSGSEQPEQTGLLMGGASGGARGGSGGF GDEC" sig_peptide 23..115 /gene="LU" /citation=[2] /evidence=experimental mat_peptide 116..1906 /gene="LU" /citation=[2] /function="unknown" /evidence=experimental /product="Lutheran blood group glycoprotein" 3'UTR 1910..2402 /citation=[2] /evidence=experimental polyA_site 2402 /citation=[2] /evidence=experimental BASE COUNT 426 a 844 c 742 g 390 t ORIGIN 1 agtctccgcc gccgccgtga acatggagcc cccggacgca ccggcccagg cgcgcggggc 61 cccgcggctg ctgttgctcg cagtcctgct ggcggcgcac ccagatgccc aggcggaggt 121 gcgcttgtct gtacccccgc tggtggaggt gatgcgagga aagtctgtca ttctggactg 181 cacccctacg ggaacccacg accattatat gctggaatgg ttccttaccg accgctcggg 241 agctcgcccc cgcctagcct cggctgagat gcagggctct gagctccagg tcacaatgca 301 cgacacccgg ggccgcagtc ccccatacca gctggactcc caggggcgcc tggtgctggc 361 tgaggcccag gtgggcgacg agcgagacta cgtgtgcgtg gtgagggcag gggcggcagg 421 cactgctgag gccactgcgc ggctcaacgt gtttgcaaag ccagaggcca ctgaggtctc 481 ccccaacaaa gggacactgt ctgtgatgga ggactctgcc caggagatcg ccacctgcaa 541 cagccggaac gggaacccgg cccccaagat cacgtggtat cgcaacgggc agcgcctgga 601 ggtgcccgta gagatgaacc cagagggcta catgaccagc cgcacggtcc gggaggcctc 661 gggcctgctc tccctcacca gcaccctcta cctgcggctc cgcaaggatg accgagacgc 721 cagcttccac tgcgccgccc actacagcct gcccgagggc cgccacggcc gcctggacag 781 ccccaccttc cacctcaccc tgcactatcc cacggagcac gtgcagttct gggtgggcag 841 cccgtccacc ccagcaggct gggtacgcga gggtgacact gtccagctgc tctgccgggg 901 ggacggcagc cccagcccgg agtatacgct tttccgcctt caggatgagc aggaggaagt 961 gctgaatgtg aatctcgagg ggaacttgac cctggaggga gtgacccggg gccagagcgg 1021 gacctatggc tgcagagtgg aggattacga cgcggcagat gacgtgcagc tctccaagac 1081 gctggagctg cgcgtggcct atctggaccc cctggagctc agcgagggga aggtgctttc 1141 cttacctcta aacagcagtg cagtcgtgaa ctgctccgtg cacggcctgc ccacccctgc 1201 cctacgctgg accaaggact ccactcccct gggcgatggc cccatgctgt cgctcagttc 1261 tatcaccttc gattccaatg gcacctacgt atgtgaggcc tccctgccca cagtcccggt 1321 cctcagccgc acccagaact tcacgctgct ggtccaaggc tcgccagagc taaagacagc 1381 ggaaatagag cccaaggcag atggcagctg gagggaagga gacgaagtca cactcatctg 1441 ctctgcccgc ggccatccag accccaaact cagctggagc caattggggg gcagccccgc 1501 agagccaatc cccggacggc agggttgggt gagcagctct ctgaccctga aagtgaccag 1561 cgccctgagc cgcgatggca tctcctgtga agcctccaac ccccacggga acaagcgcca 1621 tgtcttccac ttcggcgccg tgagccccca gacctcccag gctggagtgg ccgtcatggc 1681 cgtggccgtc agcgtgggcc tcctgctcct cgtcgttgct gtcttctact gcgtgagacg 1741 caaagggggc ccctgctgcc gccagcggcg ggagaagggg gctccgccgc caggggagcc 1801 agggctgagc cactcggggt cggagcaacc agagcagacc ggccttctca tgggaggtgc 1861 ctccggagga gccaggggtg gcagcggggg cttcggagac gagtgctgag ccaagaacct 1921 cctagaggct gtccctggac ctggagctgc aggcatcaga gaaccagccc tgctcacgcc 1981 atgcccgccc ccgccttccc tcttccctct tccctctccc tgcccagccc tcccttcctt 2041 cctctgccgg caaggcaggg acccacagtg gctgcctgcc tccgggaggg aaggagaggg 2101 agggtgggtg ggtgggaggg ggccttcctc cagggaatgt gactctccca ggccccagaa 2161 tagctcctgg acccaagccc aaggcccagc ctgggacaag gctccgaggg tcggctggcc 2221 ggagctattt ttacctcccg cctcccctgc tggtcccccc acctgacgtc ttgctgcaga 2281 gtctgacact ggattccccc ccctcacccc gcccctggtc ccactcctgc ccccgcccta 2341 cctccgcccc accccatcat ctgtggacac tggagtctgg aataaatgct gtttgtcaca 2401 tc // LOCUS HSLYAM1 2330 bp RNA PRI 22-MAR-1995 DEFINITION Human Lyam-1 mRNA for leukocyte adhesion molecule-1. ACCESSION X16150 NID g34428 KEYWORDS cell surface protein; leukocyte adhesion protein; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2330) AUTHORS Tedder,T.F. TITLE Direct Submission JOURNAL Submitted (09-NOV-1989) Tedder T.F REFERENCE 2 (bases 1 to 2330) AUTHORS Tedder,T.F., Isaacs,C.M., Ernst,T.J., Demetri,G.D., Adler,D.A. and Disteche,C.M. TITLE Isolation and chromosomal localization of cDNAs encoding a novel human lymphocyte cell surface molecule, LAM-1. Homology with the mouse lymphocyte homing receptor and other human adhesion proteins JOURNAL J. Exp. Med. 170 (1), 123-133 (1989) MEDLINE 89310350 FEATURES Location/Qualifiers source 1..2330 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" /clone_lib="lambda gt11" /clone="pLAM-1" /chromosome="1q23-25" sig_peptide 92..175 /note="signal peptide (AA -28 to -1)" CDS 92..1210 /note="prepro-polypeptide (AA -28 to 344)" /codon_start=1 /db_xref="PID:g34429" /db_xref="SWISS-PROT:P14151" /translation="MIFPWKCQSTQRDLWNIFKLWGWTMLCCDFLAHHGTDCWTYHYS EKPMNWQRARRFCRDNYTDLVAIQNKAEIEYLEKTLPFSRSYYWIGIRKIGGIWTWVG TNKSLTEEAENWGDGEPNNKKNKEDCVEIYIKRNKDAGKWNDDACHKLKAALCYTASC QPWSCSGHGECVEIINNYTCNCDVGYYGPQCQFVIQCEPLEAPELGTMDCTHPLGNFN FNSQCAFSCSEGTNLTGIEETTCEPFGNWSSPEPTCQVIQCEPLSAPDLGIMNCSHPL ASFSFTSACTFICSEGTELIGKKKTICESSGIWSNPSPICQKLDKSFSMIKEGDYNPL FIPVAVMVTAFSGLAFIIWLARRLKKGKKSKRSMNDPY" misc_feature 176..1207 /note="propeptide (AA 1 to 344)" mat_peptide 206..1207 /note="mature leukocyte adhesion protein (AA 11 to 344)" misc_feature 2296..2301 /note="polyA attachment site" BASE COUNT 661 a 522 c 487 g 660 t ORIGIN 1 gaattccctt tgggcaagga cctgagaccc ttgtgctaag tcaagaggct caatgggctg 61 cagaagaact agagaaggac caagcaaagc catgatattt ccatggaaat gtcagagcac 121 ccagagggac ttatggaaca tcttcaagtt gtgggggtgg acaatgctct gttgtgattt 181 cctggcacat catggaaccg actgctggac ttaccattat tctgaaaaac ccatgaactg 241 gcaaagggct agaagattct gccgagacaa ttacacagat ttagttgcca tacaaaacaa 301 ggcggaaatt gagtatctgg agaagactct gcctttcagt cgttcttact actggatagg 361 aatccggaag ataggaggaa tatggacgtg ggtgggaacc aacaaatctc tcactgaaga 421 agcagagaac tggggagatg gtgagcccaa caacaagaag aacaaggagg actgcgtgga 481 gatctatatc aagagaaaca aagatgcagg caaatggaac gatgacgcct gccacaaact 541 aaaggcagcc ctctgttaca cagcttcttg ccagccctgg tcatgcagtg gccatggaga 601 atgtgtagaa atcatcaata attacacctg caactgtgat gtggggtact atgggcccca 661 gtgtcagttt gtgattcagt gtgagccttt ggaggcccca gagctgggta ccatggactg 721 tactcaccct ttgggaaact tcaacttcaa ctcacagtgt gccttcagct gctctgaagg 781 aacaaactta actgggattg aagaaaccac ctgtgaacca tttggaaact ggtcatctcc 841 agaaccaacc tgtcaagtga ttcagtgtga gcctctatca gcaccagatt tggggatcat 901 gaactgtagc catcccctgg ccagcttcag ctttacctct gcatgtacct tcatctgctc 961 agaaggaact gagttaattg ggaagaagaa aaccatttgt gaatcatctg gaatctggtc 1021 aaatcctagt ccaatatgtc aaaaattgga caaaagtttc tcaatgatta aggagggtga 1081 ttataacccc ctcttcattc cagtggcagt catggttact gcattctctg ggttggcatt 1141 tatcatttgg ctggcaagga gattaaaaaa aggcaagaaa tccaagagaa gtatgaatga 1201 cccatattaa atcgcccttg gtgaaagaaa attcttggaa tactaaaaat catgagatcc 1261 tttaaatcct tccatgaaac gttttgtgtg gtggcacctc ctacgtcaaa catgaagtgt 1321 gtttccttca gtgcatctgg gaagatttct acctgaccaa cagttccttc agcttccatt 1381 tcacccctca tttatccctc aacccccagc ccacaggtgt ttatacagct cagctttttg 1441 tcttttctga ggagaaacaa ataagaccat aaagggaaag gattcatgtg gaatataaag 1501 atggctgact ttgctctttc ttgactcttg ttttcagttt caattcagtg ctgtacttga 1561 tgacagacac ttctaaatga agtgcaaatt tgatacatat gtgaatatgg actcagtttt 1621 cttgcagatc aaatttcgcg tcgtcttctg tatacgtgga ggtacactct atgaagtcaa 1681 aagtctacgc tctcctttct ttctaactcc agtgaagtaa tggggtcctg ctcaagttga 1741 aagagtccta tttgcactgt agcctcgccg tctgtgaatt ggaccatcct atttaactgg 1801 cttcagcctc cccaccttct tcagccacct ctctttttca gttggctgac ttccacacct 1861 agcatctcat gagtgccaag caaaaggaga gaagagagaa atagcctgcg ctgtttttta 1921 gtttgggggt tttgctgttt ccttttatga gacccattcc tatttcttat agtcaatgtt 1981 tcttttatca cgatattatt agtaagaaaa catcactgaa atgctagctg caactgacat 2041 ctctttgatg tcatatggaa gagttaaaac aggtggagaa attccttgat tcacaatgaa 2101 atgctctcct ttcccctgcc cccagacctt ttatccactt acctagattc tacatattct 2161 ttaaatttca tctcaggcct ccctcaaccc caccacttct tttataacta gtcctttact 2221 aatccaaccc atgatgagct cctcttcctg gcttcttact gaaaggttac cctgtaacat 2281 gcaattttgc atttgaataa agcctgcttt ttaagtgtta aaaagaattc // LOCUS HSLYT3 1382 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for Lyt-3 protein. ACCESSION X13452 NID g34440 KEYWORDS antigen; cell surface antigen; cell surface glycoprotein; cell-surface marker; glycoprotein; Ly-3 protein; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1382) AUTHORS DiSanto,J.P., Knowles,R.W. and Flomenberg,N. TITLE The human Lyt-3 molecule requires CD8 for cell surface expression JOURNAL EMBO J. 7 (11), 3465-3470 (1988) MEDLINE 89091089 COMMENT Data kindly reviewed (20-Jul-1989) by Flommenberg N. FEATURES Location/Qualifiers source 1..1382 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HPB-ALL" /clone_lib="lambda gt10" CDS 15..647 /note="Lyt-3 preprotein (AA -21 to 189)" /codon_start=1 /db_xref="PID:g34441" /db_xref="SWISS-PROT:P10966" /translation="MRPRLWLLLAAQLTVLHGNSVLQQTPAYIKVQTNKMVMLSCEAK ISLSNMRIYWLRQRQAPSSDSHHEFLALWDSAKGTIHGEEVEQEKIAVFRDASRFILN LTSVKPEDSGIYFCMIVGSPELTFGKGTQLSVVDFLPTTAQPTKKSTLKKRVCRLPRP ETQKGPLCSPITLGLLVAGVLVLLVSLGVAIHLCCRRRRARLRFMKQFYK" sig_peptide 15..77 /note="signal peptide (AA -21 to -1)" mat_peptide 78..644 /note="mature Lyt-3 protein (AA 1 - 189)" misc_feature 1345..1350 /note="pot. polyA signal" BASE COUNT 298 a 361 c 380 g 343 t ORIGIN 1 gaattccggc cacgatgcgg ccgcggctgt ggctcctcct ggccgcgcag ctgacagttc 61 tccatggcaa ctcagtcctc cagcagaccc ctgcatacat aaaggtgcaa accaacaaga 121 tggtgatgct gtcctgcgag gctaaaatct ccctcagtaa catgcgcatc tactggctga 181 gacagcgcca ggcaccgagc agtgacagtc accacgagtt cctggccctc tgggattccg 241 caaaagggac tatccacggt gaagaggtgg aacaggagaa gatagctgtg tttcgggatg 301 caagccggtt cattctcaat ctcacaagcg tgaagccgga agacagtggc atctacttct 361 gcatgatcgt cgggagcccc gagctgacct tcgggaaggg aactcagctg agtgtggttg 421 atttccttcc caccactgcc cagcccacca agaagtccac ccttaagaag agagtgtgcc 481 ggttacccag gccagagacc cagaagggcc cactttgtag ccccatcacc cttggcctgc 541 tggtggctgg cgtcctggtt ctgctggttt ccctgggagt ggccatccac ctgtgctgcc 601 ggcggaggag agcccggctt cgtttcatga aacaatttta caaataagca gagaatacgg 661 ttttggtgtc ctgctacaaa aagacatcgg tcagtaatga gcacgatgtg gaaaaatgag 721 agaagggaca cattcaaccc tggagagttc aatggctgct gaagctgcct gcttttcact 781 gctgcaaggc ctttctgtgt gtgatgtgca tgggagcaac ttgttcgtgg gtcatcggga 841 atactaggga gaaggtttca ttgcccccag ggcacttcac agagtgtgct ggaggactga 901 gtaagaaatg ctgcccatgc caccgcttcc ggctcctgtg ctttccctga actgggacct 961 ttagtggtgg ccatttagcc accatctttg caggttgctt tgccctggta gggcagtaac 1021 attgggtcct gggtctttca tggggtgatg ctgggctggc tccctcttgg tcttcccagg 1081 ctggggctga ccttcctcgc agagaggcca ggtgcaggtt gggaatgagg cttgctgaga 1141 ggggctgtcc agttcccaga aggcatatca gtctctgagg gcttcctttg gggccgggaa 1201 cttgcgggtt tgaggatagg agttcacttc atcttctcag ctcccatttc tactcttaag 1261 tttctcagct cccatttcta ctctcccatg gcttcatgct tctttcattt tctgtttgtt 1321 ttatacaaat gtcttagttg tacaaataaa gtcccaggtt aaagataaaa aaaccggaat 1381 tc // LOCUS HSM6 1638 bp RNA PRI 20-JUL-1993 DEFINITION H.sapiens mRNA for M6 antigen. ACCESSION X64364 S40605 NID g34448 KEYWORDS M6 antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1638) AUTHORS Stockinger,H. TITLE Direct Submission JOURNAL Submitted (05-FEB-1992) H. Stockinger, Inst of Immunology-Vircc, Brunner Strasse 59, A-1235 Vienna, AUSTRIA REFERENCE 2 (bases 1 to 1638) AUTHORS Kasinrerk,W., Fiebiger,E., Stefanova,I., Baumruker,T., Knapp,W. and Stockinger,H. TITLE Human leukocyte activation antigen M6, a member of the Ig superfamily, is the species homologue of rat OX-47, mouse basigin, and chicken HT7 molecule JOURNAL J. Immunol. 149 (3), 847-854 (1992) MEDLINE 92340888 FEATURES Location/Qualifiers source 1..1638 /organism="Homo sapiens" /db_xref="taxon:9606" gene 58..867 /gene="H34" CDS 58..867 /gene="H34" /codon_start=1 /product="M6 antigen" /db_xref="PID:g34449" /db_xref="SWISS-PROT:P35613" /translation="MAAALFVLLGFALLGTHGASGAAGTVFTTVEDLGSKILLTCSLN DSATEVTGHRWLKGGVVLKEDALPGQKTEFKVDSDDQWGEYSCVFLPEPMGTANIQLH GPPRVKAVKSSEHINEGETAMLVCKSESVPPVTDWAWYKITDSEDKALMNGSESRFFV SSSQGRSELHIENLNMEADPGQYRCNGTSSKGSDQAIITLRVRSHLAALWPFLGIVAE VLVLVTIIFIYEKRRKPEDVLDDDDAGSAPLKSSGQHQNDKGKNVRQRNSS" BASE COUNT 329 a 525 c 470 g 314 t ORIGIN 1 gccgcgggcg gcggcggcag cggttggagg ttgtaggacc ggcgaggaat aggaatcatg 61 gcggctgcgc tgttcgtgct gctgggattc gcgctgctgg gcacccacgg agcctccggg 121 gctgccggca cagtcttcac taccgtagaa gaccttggct ccaagatact cctcacctgc 181 tccttgaatg acagcgccac agaggtcaca gggcaccgct ggctgaaggg gggcgtggtg 241 ctgaaggagg acgcgctgcc cggccagaaa acggagttca aggtggactc cgacgaccag 301 tggggagagt actcctgcgt cttcctcccc gagcccatgg gcacggccaa catccagctc 361 cacgggcctc ccagagtgaa ggccgtgaag tcgtcagaac acatcaacga gggggagacg 421 gccatgctgg tctgcaagtc agagtccgtg ccacctgtca ctgactgggc ctggtacaag 481 atcactgact ctgaggacaa ggccctcatg aacggctccg agagcaggtt cttcgtgagt 541 tcctcgcagg gccggtcaga gctacacatt gagaacctga acatggaggc cgaccccggc 601 cagtaccggt gcaacggcac cagctccaag ggctccgacc aggccatcat cacgctccgc 661 gtgcgcagcc acctggccgc cctctggccc ttcctgggca tcgtggctga ggtgctggtg 721 ctggtcacca tcatcttcat ctacgagaag cgccggaagc ccgaggacgt cctggatgat 781 gacgacgccg gctctgcacc cctgaagagc agcgggcagc accagaatga caaaggcaag 841 aacgtccgcc agaggaactc ttcctgaggc aggtggcccg aggacgctcc ctgctccgcg 901 tctgcgccgc cgccggagtc cactcccagt gcttgcaaga ttccaagttc tcacctctta 961 aagaaaaccc accccgtaga ttcccatcat acacttcctt cttttttaaa aaagttgggt 1021 tttctccatt caggattctg ttccttagga ttttttcctt ctgaagtgtt tcacgagagc 1081 ccgggagctg ctgccctgcg gccccgtctg tggctttcag cctctgggtc tgagtcatgg 1141 ccgggtgggc ggcacagcct tctccactgg ccggagtcag tgccaggtcc ttgccctttg 1201 tggaaagtca caggtcacac gaggggcccc gtgtcctgcc tgtctgaagc caatgctgtc 1261 tggttgcgcc atttttgtgc ttttatgttt aattttatga gggccacggg tctgtgttcg 1321 actcagcctc agggacgact ctgacctctt ggccacagag gactcacttg cccacaccga 1381 gggcgacccc gtcacagcct caagtcactc ccaagccccc tccttgtctg tgcatccggg 1441 ggcagctctg gagggggttt gctggggaac tggcgccatc gccgggactc cagaaccgca 1501 gaagcctccc cagctcaccc ctggaggacg gccggctctc tatagcacca gggctcacgt 1561 gggaaccccc ctcccaccca ccgccacaat aaagatcgcc cccacctcca ccctcaaaaa 1621 aaaaaaaaaa aaaaaaaa // LOCUS HSMAC 10300 bp RNA PRI 21-SEP-1994 DEFINITION H.sapiens giantin mRNA. ACCESSION X75304 NID g405714 KEYWORDS giantin; Golgi apparatus; outer membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10300) AUTHORS Seelig,H.P. TITLE Direct Submission JOURNAL Submitted (30-SEP-1993) H.P. Seelig, Institute of Immunology & Molec.Genetics, Kriegsstrasse 99, 76133 Karlsruhe, FRG REFERENCE 2 (bases 1 to 10300) AUTHORS Seelig,H.P., Schranz,P., Schroter,H., Wiemann,C., Griffiths,G. and Renz,M. TITLE Molecular genetic analyses of a 376-kilodalton Golgi complex membrane protein (giantin) JOURNAL Mol. Cell. Biol. 14 (4), 2564-2576 (1994) MEDLINE 94187728 REFERENCE 3 (bases 1 to 10300) AUTHORS Seelig,H.P., Schranz,P., Schroter,H., Wiemann,C. and Renz,M. TITLE Macrogolgin--a new 376 kD Golgi complex outer membrane protein as target of antibodies in patients with rheumatic diseases and HIV infections JOURNAL J. Autoimmun. 7 (1), 67-91 (1994) MEDLINE 94257116 FEATURES Location/Qualifiers source 1..10300 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gtII cDNA" mat_peptide 127..9903 /note="a new 376kD Golgi complex outher membrane protein" /product="giantin" CDS 127..9906 /note="a new 376kD Golgi complex outher membrane protein" /codon_start=1 /product="giantin" /db_xref="PID:g405715" /translation="MLSRLSGLANVVLHELSGDDDTDQNMRAPLDPELHQESDMEFNN TTQEDVQERLAYAEQLVVELKDIIRQKDVQLQQKDEALQEERKAADNKIKKLKLHAKA KLTSLNKYIEEMKAQGGTVLPTEPQSEEQLSKHDKSSTEEEMEIEKIKHKLQEKEELI STLQAQLTQAQAEQPAQSSTEMEEFVMMKQQLQEKEEFISTLQAQLSQTQAEQAAQQV VREKDARFETQVRLHEDELLQLVTQADVETEMQQKLRVLQRKLEEHEESLVGRAQVVD LLQQELTAAEQRNQILSQQLQQMEAEHNTLRNTVETEREESKILLEKMELEVAERKLS FHNLQEEMHHLLEQFEQAGQAQAELESRYSALEQKHKAEMEEKTSHILSLQKTGQELQ SACDALKDQNSKLLQDKNEQAVQSAQTIQQLEDQLQQKSKEISQFLNRLPLQQHETAS QTSFPDVYNEGTQAVTEENIASLQKRVVELENEKGALLLSSIELEELKAENEKLSSQI TLLEAQNRTGEADREVSEISIVDIANKRSSSAEESGQDVLENTFSQKHKELSVLLLEM KEAQEEIAFLKLQLQGKRAEEADHEVLDQKEMKQMEGEGIAPIKMKVFLEDTGQDFPL MPNEESSLPAVEKEQASTEHQSRTSEEISLNDAGVELKSTKQDGDKSLSAVPDIGQCH QDELERLKSQILELELNFHKAQEIYEKNLDEKAKEISNLNQLIEEFKKNADNNSSAFT ALSEERDQLLSQVKELSMVTELRAQVKQLEMNLAEAERQRRLDYESQTAHDNLLTEQI HSLSIEAKSKDVKIEVLQNELDDVQLQFSEQSTLIRSLQSQLQNKESEVLEGAERVRH ISSKVEELSQALSQKELEITKMDQLLLEKKRDVETLQQTIEEKDQQVTEISFSMTEKM VQLNEEKFSLGVEIKTLKEQLNLLSRAEEAKKEQVEEDNEVSSGLKQNYDEMSPAGQI SKEELQHEFDLLKKENEQRKRKLQAALINRKELLQRVSRLEEELANLKDESKKEIPLS ETERGEVEEDKENKEYSEKCVTSKCQEIEIYLKQTISEKEVELQHIRKDLEEKLAAEE QFQALVKQMNQTLQDKTNQIDLLQAEISENQAIIQKLITSNTDASDGDSVALVKETVV ISPPCTGSSEHWKPELEEKILALEKEKEQLQKKLQEALTSRKAILKKAQEKERHLREE LKQQKDDYNRLQEQFDEQSKENENIGDQLRQLQIQVRESIDGKLPSTDQQESCSSTPG LEEPLFKATEQHHTQPVLESNLCPDWPSHSEDASALQGGTSVAQIKAQLKEIEAEKVE LELKVSSTTSELTKKSEEVFQLQEQINKQGLEIESLKTVSHEAEVHAESLQQKLESSQ LQIAGLEHLRELQPKLDELQKLISKKEEDVSYLSGQLSEKEAALTKIQTEIIEQEDLI KALHTQLEMQAKEHDERIKQLQVELCEMKQKPEEIGEESRAKQQIQRKLQAALISRKE ALKENKSLQEELSLARGTIERLTKSLADVESQVSAQNKEKDTVLGRLALLQEERDKLI TEMDRSLLENQSLSSSCESLKLALEGLTEDKEKLVKEIESLKSSKIAESTEWQEKHKE LQKEYEILLQSYENVSNEAERIQHVVEAVRQEKQELYGKLRSTEANKKETEKQLQEAE QEMEEMKEKMRKFAKSKQQKILELEEENDRLRAEVHPAGDTAKECMETLLSSNASMKE ELERVKMEYETLSKKFQSLMSEKDSLSEEVQDLKHQIEDNVSKQANLEATEKHDNQTN VTEEGTQSIPGETEEQDSLSMSTRPTCSESVPSAKSANPAVSKDFSSHDEINNYLQQI DQLKERIAGLEEEKQKNKEFSQTLENEKNTLLSQISTKDGELKMLQEEVTKMNLLNQQ IQEELSRVTKLKETAEEEKDDLEERLMNQLAELNGSIGNYCQDVTDAQIKNELLESEM KNLKKCVSELEEEKQQLVKEKTKVESEIRKEYLEKIQGAQKEPGNKSHAKELQELLKE KQQEVKQLQKDCIRYQEKISALERTVKALEFVQTESQKDLEITKENLAQAVEHRKKAQ AELASFKVLLDDTQSEAARVLADNLKLKKELQSNKESVKSQMKQKDEDLERRLEQAEE KHLKEKKNMQEKLDALRREKVHLEETIGEIQVTLNKKDKEVQQLQENLDSTVTQLAAF TKSMSSLQDDRDRVIDEAKKWERKFSDAIQSKEEEIRLKEDNCSVLKDQLRQMSIHME ELKINISRLEHDKQIWESKAQTEVQLQQKVCDTLQGENKELLSQLEETRHLYHSSQNE LAKLESELKSLKDQLTDLSNSLEKCKEQKGNLEGIIRQQEADIQNSKFSYEQLETDLQ ASRELTSRLHEEINMKEQKIISLLSGKEEAIQVAIAELRQQHDKEIKELENLLSQEEE ENIVLEEENKKAVDKTNQLMETLKTIKKENIQQKAQLDSFVKSMSSLQNDRDRIVGDY QQLEERHLSIILEKDQLIQEAAAENNKLKEEIRGLRSHMDDLNSENAKLDAELIQYRE DLNQVITIKDSQQKQLLEVQLQQNKELENKYAKLEEKLKESEEANEDLRRSFNALQEE KQDLSKEIESLKVSISQLTRQVTALQEEGTLGLYHAQLKVKEEEVHRLSALFSSSQKR IAELEEELVCVQKEAAKKVGEIEDKLKKELKHLHHDAGIMRNETETAEERVAELARDL VEMEQKLLMVTKENKGLTAQIQSFGRSMSSLQNSRDHANEELDELKRKYDASLKELAQ LKEQGLLNRERDALLSETAFSMNSTEENSLSHLEKLNQQLLSKDEQLLHLSSQLEDSY NQVQSFSKAMASLQNERDHLWNELEKFRKSEEGKQRSAAQPSTSPAEVQSLKKAMSSL QNDRDRLLKELKNLQQQYLQINQEITELHPLKAQLQEYQDKTKAFQIMQEELRQENLS WQHELHQLRMEKSSWEIHERRMKEQYLMAISDKDQQLSHLQNLIRELRSSSSQTQPLK VQYQRQASPETSASPDGSQNLVYETELLRTQLNDSLKEIHQKELRIQQLNSNFSQLLE EKNTLSIQLCDTSQSLRENQQHYGDLLNHCAVLEKQVQELQAGPLNIDVAPGAPQEKN GVHRKSDPEELREPQQSFSEAQQQLCNTRQEVNELRKLLEEERDQRVAAENALSVAEE QIRRLEHSEWDSSRTPIIGSCGTQEQALLIDLTSNSCRRTRSGVGWKRVLRSLCHSRT RVPLLAAIYFLMIHVLLILCFTGHL" BASE COUNT 3794 a 1869 c 2433 g 2204 t ORIGIN 1 aactgctagt ggctgagtcc ctggcggggc gcggcggtgg aaggtgtcgc gtacgggctt 61 cccgagctga cgtggcttga attgggaggg gggcagctgg agcctcaggc ggcagcgctt 121 ctagaaatgc tgagccgatt atcaggatta gcaaatgttg ttttgcatga attatcagga 181 gatgatgaca ctgatcagaa tatgagggct cccctagacc ctgaattaca ccaagaatct 241 gacatggaat ttaataatac tacacaagaa gatgttcagg agcgcctggc ttatgcagag 301 caattggtgg tggagctaaa agatattatt agacagaagg atgttcaact gcagcagaaa 361 gatgaagctc tacaggaaga gagaaaagct gctgataaca aaattaaaaa actaaaactt 421 catgcgaagg ccaaattaac ttctttgaat aaatacatag aagaaatgaa agcacaagga 481 gggactgttc tgcctacaga acctcagtca gaggagcaac tttccaagca tgacaagagt 541 tctacagagg aagagatgga aatagaaaag ataaaacata agctccagga gaaggaggaa 601 ctaatcagca ctttgcaagc ccagcttact caggcacagg cagaacaacc tgcacagagt 661 tctacagaga tggaagaatt tgtaatgatg aagcaacagc tccaggagaa ggaagaattc 721 attagcactt tacaagccca gctcagccag acacaggcag agcaagctgc acagcaggtg 781 gtccgagaga aagatgcccg ctttgaaaca caagttcgtc ttcatgaaga tgagcttctt 841 cagttagtaa cccaggcaga tgtggaaaca gagatgcaac agaaattgag ggtgctgcaa 901 aggaagcttg aggaacacga agaatccttg gtgggccgtg ctcaggtcgt tgacttgctg 961 caacaggagc tgactgctgc tgagcagaga aaccagattc tctctcagca gttacagcag 1021 atggaagctg agcataatac tttgaggaac actgtggaaa cagaaagaga ggagtccaag 1081 attctactgg aaaagatgga acttgaagtg gcagagagaa aattatcctt ccataatctg 1141 caggaagaaa tgcatcatct tttagaacag tttgagcaag caggccaagc ccaggctgaa 1201 ctagagtctc ggtatagtgc tttggagcag aagcacaaag cagaaatgga agagaagacc 1261 tctcatattt tgagtcttca aaagactgga caagagctgc agtctgcctg tgatgctcta 1321 aaggatcaaa attcaaagct tctccaagat aagaatgaac aggcagttca gtcagcccag 1381 accattcagc aactggaaga tcagctccag caaaaatcca aagaaattag ccaatttcta 1441 aatagactgc ccttgcaaca acatgaaaca gcatctcaga cttctttccc agatgtttat 1501 aatgagggca cacaggcagt cactgaggag aatattgctt ctttgcagaa gagagtggta 1561 gaactagaga atgaaaaggg agccttgctc cttagttcta tagagctgga ggagctgaaa 1621 gctgagaatg aaaaactgtc ttctcagatt actctcctag aggctcagaa tagaactggg 1681 gaggcagaca gagaagtcag tgagatcagc attgttgata ttgccaacaa gaggagctct 1741 tctgctgagg aaagtggaca agatgttcta gaaaacacat tttctcagaa acataaagaa 1801 ttatcagttt tattgttgga aatgaaagaa gctcaagagg aaattgcatt tcttaaatta 1861 cagctccagg gaaaaagggc tgaggaagca gatcatgagg tccttgacca gaaagaaatg 1921 aaacagatgg agggtgaggg aatagctcca attaaaatga aagtatttct tgaagataca 1981 gggcaagatt ttcccttaat gccaaatgaa gagagcagtc ttccagcagt tgaaaaagaa 2041 caggcgagca ctgaacatca aagtagaaca tctgaggaaa tatctttaaa tgatgctgga 2101 gtagaattga aatcaacaaa gcaggatggt gataaatccc tttctgctgt accagatatt 2161 ggtcagtgtc atcaggatga gttggaaagg ttaaaaagtc aaattttgga gctcgagcta 2221 aactttcata aagcacaaga aatctatgag aaaaatttag atgagaaagc taaggaaatt 2281 agcaacctaa accagttgat tgaggagttt aagaaaaatg ctgacaacaa cagcagtgca 2341 ttcactgctt tgtctgaaga aagagaccag cttctctctc aggtgaagga acttagcatg 2401 gtaacagaat tgagggctca ggtaaagcaa ctggaaatga accttgcaga agcagaaagg 2461 caaagaagac ttgattatga aagccaaact gcccatgaca acctgctcac tgaacagatc 2521 catagtctca gcatagaagc caaatctaaa gatgtgaaaa ttgaagtttt acagaatgaa 2581 ctggatgatg tgcagcttca gttttctgag cagagtaccc tgataagaag cctgcaaagc 2641 cagctgcaaa ataaggaaag tgaagtgctt gagggggcag aacgtgtaag gcatatctca 2701 agtaaagtgg aagaactgtc ccaggctctt tcacagaagg aacttgaaat aacaaaaatg 2761 gatcagctct tactagagaa aaagagagat gtggaaaccc tccaacaaac catcgaggag 2821 aaggatcaac aagtgacaga aatcagcttt agtatgactg agaaaatggt tcagcttaat 2881 gaagagaagt tttctcttgg ggttgaaatt aagactctta aagaacagct aaatttatta 2941 tccagagctg aggaagcaaa aaaagagcag gtggaagaag ataatgaagt ttcttctggc 3001 cttaaacaaa attatgatga gatgagccca gcaggacaaa taagtaagga agaacttcag 3061 catgaatttg accttctgaa gaaagaaaat gagcagagaa agagaaagct ccaggcagct 3121 cttattaaca gaaaggagct tctgcaaaga gtcagtagat tggaagaaga attagccaac 3181 ttgaaagatg aatctaagaa agaaatccca ctcagtgaga ctgagagggg agaagtggaa 3241 gaagataaag aaaacaaaga atactcagaa aaatgtgtga cttctaagtg ccaagaaata 3301 gaaatttatt taaaacagac aatatctgag aaagaagtgg aactacagca tataaggaag 3361 gatttggaag aaaagctggc agctgaagag caattccagg ctctggtcaa acagatgaat 3421 cagaccttgc aagataaaac aaaccaaata gatttgctcc aagcagaaat cagtgaaaac 3481 caagcaatta tccagaagtt aatcacaagt aacacggatg caagtgatgg ggactccgta 3541 gcacttgtaa aggaaacagt ggtgataagt ccaccttgta caggtagtag tgaacactgg 3601 aaaccagaac tagaagaaaa gatactggcc cttgaaaaag aaaaggagca acttcaaaag 3661 aagctacagg aagccttaac ctcccgcaag gcaattctta aaaaggcaca ggagaaagaa 3721 agacatctca gggaggagct aaagcaacag aaagatgact ataatcgctt gcaagaacag 3781 tttgatgagc aaagcaagga aaatgagaat attggagacc agctaaggca actccagatt 3841 caagtaaggg aatccataga cggaaaactc ccaagcacag accagcagga atcgtgttct 3901 tccactccag gtttagaaga acctttattc aaagccacag aacagcatca cactcaacct 3961 gttttagagt ccaacttgtg cccagactgg ccttctcatt ctgaagatgc gagtgctctg 4021 cagggcggaa cttctgttgc ccagattaag gcccagctga aggaaataga ggctgagaaa 4081 gtagagttag aattgaaagt tagttctaca acaagtgagc ttactaaaaa atcagaagag 4141 gtatttcagt tacaagagca gataaataaa cagggtttag aaatcgagag tctaaagaca 4201 gtatcccatg aagctgaagt ccatgccgaa agcctgcagc agaaattgga aagcagccaa 4261 ctacaaattg ctggcctaga acatctaaga gaattgcaac ctaaactgga tgaactgcaa 4321 aaactcataa gcaaaaagga agaagacgtt agctaccttt ctggacaact tagtgagaaa 4381 gaagcagctc tcactaaaat acagacagag ataatagaac aagaagattt aattaaggct 4441 ctgcatacac agctagaaat gcaagccaaa gagcatgatg agaggataaa gcagctacag 4501 gtggaacttt gtgaaatgaa gcaaaaacca gaagagattg gagaagaaag tagagcaaag 4561 caacaaatac aaaggaaact gcaagctgcc cttatttccc gaaaagaagc actaaaagaa 4621 aacaaaagtc tccaagagga attgtctttg gccagaggta ccattgaacg tctcaccaag 4681 tctctggcag atgtggaaag ccaagtttct gctcaaaata aagaaaaaga tacggtctta 4741 ggaaggttag ctcttcttca agaagaaaga gacaaactca ttacagaaat ggacaggtct 4801 ttattggaaa atcagagtct cagcagctcc tgtgaaagtc taaaactagc tctagagggt 4861 cttactgaag acaaggaaaa gttagtgaag gaaattgaat ctttgaaatc ttctaagatt 4921 gcagaaagta ctgagtggca agagaaacac aaggagctac aaaaagagta tgaaattctt 4981 ctgcagtcct atgagaatgt tagtaatgaa gcagaaagga ttcagcatgt ggtggaagct 5041 gtgaggcaag agaaacaaga actgtatggc aagttaagaa gcacagaggc aaacaagaag 5101 gagacagaaa agcagttgca ggaagctgag caagaaatgg aggaaatgaa agaaaagatg 5161 agaaagtttg ctaaatctaa acagcagaaa atcctagagc tggaagaaga gaatgaccgg 5221 cttagggcag aggtgcaccc tgcaggagat acagctaaag agtgtatgga aacacttctt 5281 tcttccaatg ccagcatgaa ggaagaactt gaaagggtca aaatggagta tgaaaccctt 5341 tctaagaagt ttcagtcttt aatgtctgag aaagactctc taagtgaaga ggttcaagat 5401 ttaaagcatc agatagaaga taatgtatct aaacaagcta acctagaggc caccgagaaa 5461 catgataacc aaacgaatgt cactgaagag ggaacacagt ctataccagg tgagactgaa 5521 gagcaagact ctctgagtat gagcacaaga cctacatgtt cagaatcggt tccatcagcg 5581 aagagtgcca accctgctgt aagtaaggat ttcagctcac atgatgaaat taataactac 5641 ctacagcaga ttgatcagct caaagaaaga attgctggat tagaggagga gaagcagaaa 5701 aacaaggaat ttagccagac tttagaaaat gagaaaaata ccttactgag tcagatatca 5761 acaaaggatg gtgaactaaa aatgcttcag gaggaagtaa ccaaaatgaa cctgttaaat 5821 cagcaaatcc aagaagaact ctccagagtt accaaactaa aggagacagc agaagaagag 5881 aaagatgatt tggaagagag gcttatgaat caattagcag aacttaatgg aagcattggg 5941 aattactgtc aggatgttac agatgcccaa ataaaaaatg agctattgga atctgaaatg 6001 aagaacctta aaaagtgtgt gagtgaattg gaagaagaaa agcagcagtt agtcaaggaa 6061 aaaactaagg tggaatcaga aatacgaaag gaatatttgg agaaaataca aggtgctcag 6121 aaagaacccg gaaataaaag ccatgcaaag gaacttcagg aactgttaaa agaaaaacaa 6181 caagaagtaa agcagctaca gaaggactgc atcaggtatc aagagaaaat tagtgctctg 6241 gagagaactg ttaaagctct agaatttgtt caaactgaat ctcaaaaaga tttggaaata 6301 accaaagaaa atctggctca agcagttgaa caccgcaaaa aggcacaagc agaattagct 6361 agcttcaaag tcctgctaga tgacactcaa agtgaagcag caagggtcct agcagacaat 6421 ctcaagttga aaaaggaact tcagtcaaat aaagaatcag ttaaaagcca gatgaaacaa 6481 aaggatgaag atcttgagcg aagactggaa caggcagaag agaagcacct gaaagagaag 6541 aagaatatgc aagagaaact ggatgctttg cgcagagaaa aagtccactt ggaagagaca 6601 attggagaga ttcaggttac tttgaacaag aaagacaagg aagttcagca acttcaggaa 6661 aacttggaca gtactgtgac ccagcttgca gcctttacta agagcatgtc ttccctccag 6721 gatgatcgtg acagggtgat agatgaagct aagaaatggg agaggaagtt tagtgatgcg 6781 attcaaagca aagaagaaga aattagactc aaagaagata attgcagtgt tctaaaggat 6841 caacttagac agatgtccat ccatatggaa gaattaaaga ttaacatttc caggcttgaa 6901 catgacaagc agatttggga gtccaaggcc cagacagagg tccagcttca gcagaaggtc 6961 tgtgatactc tacaggggga aaacaaagaa cttttgtccc agctagaaga gacacgccac 7021 ctataccaca gttctcagaa tgaattagct aagttggaat cagaacttaa gagtctcaaa 7081 gaccagttga ctgatttaag taactcttta gaaaaatgta aggaacaaaa aggaaacttg 7141 gaagggatca taaggcagca agaggctgat attcaaaatt ctaagttcag ttatgaacaa 7201 ctggagactg atcttcaggc ctccagagaa ctgaccagta ggctgcatga agaaataaat 7261 atgaaagagc aaaagattat aagcctgctt tctggcaagg aagaggcaat ccaagtagct 7321 attgctgaac tgcgtcagca acatgataaa gaaattaaag agctggaaaa cctgctgtcc 7381 caggaggaag aggagaatat tgttttagaa gaggagaaca aaaaggctgt tgataaaacc 7441 aatcagctta tggaaacact gaaaaccatc aaaaaggaaa acattcagca aaaggcacag 7501 ttggattcct ttgttaaatc catgtcttct ctccaaaatg atcgagaccg catagtgggt 7561 gactatcaac agctggaaga gcgacatctc tctataatct tggaaaaaga ccaactcatc 7621 caagaggctg ctgcagagaa taataagctt aaagaagaaa tacgaggctt gagaagtcat 7681 atggatgatc tcaattctga gaatgccaag ctagatgcag aactgatcca atatagagaa 7741 gacctgaacc aagtgataac aataaaggac agccaacaaa agcagcttct tgaagttcaa 7801 cttcagcaaa ataaggagct ggaaaataaa tatgctaaat tagaagaaaa gctgaaggaa 7861 tctgaggaag caaatgagga tctgcggagg tcctttaatg ccctacaaga agagaaacaa 7921 gatttatcta aagagattga gagtttgaaa gtatctatat cccagctaac aagacaagta 7981 acagccttgc aagaagaagg tactttagga ctctatcatg cccagttaaa agtaaaagaa 8041 gaagaggtac acaggttaag tgctttgttt tcctcctctc aaaagagaat tgcagaactg 8101 gaagaagaat tggtttgtgt tcaaaaggaa gctgccaaga aggtaggtga aattgaagat 8161 aaactgaaga aagaattaaa gcatcttcat catgatgcag ggataatgag aaatgaaact 8221 gaaacagcag aagagagagt ggcagagcta gcaagagatt tggtggagat ggaacagaaa 8281 ttactcatgg tcaccaaaga aaataaaggt ctcacagcac aaattcagtc ttttggaagg 8341 tctatgagtt ccttgcaaaa tagtagagat catgccaatg aggaacttga tgaactgaaa 8401 aggaaatatg atgccagtct gaaggaattg gcacagttga aagaacaggg actcttaaac 8461 agagagagag atgctcttct ttctgaaacc gccttttcaa tgaactccac tgaggagaat 8521 agcttgtctc accttgagaa acttaaccaa cagctcctat ccaaagatga gcaattgctt 8581 cacttgtcct cacaactaga agattcttat aaccaagtgc agtccttttc caaggctatg 8641 gccagtctgc agaatgagag agatcacctg tggaatgagc tggagaaatt tcgaaagtca 8701 gaggaaggga agcagaggtc tgcagctcag ccttccacca gcccagctga agtacagagt 8761 ttaaaaaaag ctatgtcttc actccaaaat gacagagaca gactactgaa ggaattgaag 8821 aatctgcagc agcaatactt acagattaat caagagatca ctgagttaca tccactgaag 8881 gctcaacttc aggagtatca agataagaca aaagcatttc agattatgca agaagagctc 8941 aggcaggaaa acctctcctg gcagcatgag ctgcatcagc tcaggatgga gaagagttcc 9001 tgggaaatac atgagaggag aatgaaggaa cagtacctta tggctatctc agataaagat 9061 cagcagctca gtcatctgca gaatcttata agggaattga ggtcttcttc ctcccagact 9121 cagcctctca aagtgcaata ccaaagacag gcatccccag agacatcagc ttccccagat 9181 gggtcacaaa atctggttta tgagacagaa cttctcagga cccagctcaa tgacagctta 9241 aaggaaattc accaaaagga gttaagaatt cagcaactga acagcaactt ctctcagcta 9301 ctggaagaga aaaacaccct ttccattcag ctctgcgata ccagtcagag tcttcgtgag 9361 aaccagcagc actatggtga ccttttaaat cactgtgcag tcttggagaa gcaggttcaa 9421 gagctgcagg cggggccact aaatatagat gttgctccag gagctcccca ggaaaagaat 9481 ggagttcaca gaaagagtga ccctgaggaa ctaagggaac cgcagcaaag cttttctgaa 9541 gctcagcagc agctatgcaa caccagacag gaagtgaatg aattaaggaa gctgctggaa 9601 gaagaacgag accaaagagt ggctgctgag aatgctctct ctgtggccga ggagcagatc 9661 agacggttag agcacagtga atgggactct tcccggactc ctatcattgg ctcctgtggc 9721 actcaggagc aggcactgtt aatagatctt acaagcaaca gttgtcgaag gacccggagt 9781 ggcgttggat ggaagcgagt cctgcgttca ctctgtcatt cacggacccg agtgccactt 9841 ctagcagcca tctactttct aatgattcat gtcctgctca ttctgtgttt tacgggccat 9901 ctatagactt agttgttact ctttggacca ctcccttcaa aacttggaat tctctcacct 9961 ctaacatcag aacatcaatt ccagtggaac agtcttccca tttacaggtc ttctctccaa 10021 ctcttcacgg aaagtgcctg caaaaacaga ggtggatacg aggacaggtt ggagctgcag 10081 ggactggcga gtctgctttc ttctactgcc ctgagcctga acgcttctgc ttaatctgag 10141 aatcacattt ggtttgttga gcctaatatt tgttgagatt ttgcaggacc ctgatctttt 10201 gtggtcctgt aaaagatact gaggaatgtc tttcagccaa gccaagagga tggtttcaat 10261 aaacctaata atctgaagtt cagctttttt tttttttttt // LOCUS HSMACHR 1386 bp DNA PRI 12-SEP-1993 DEFINITION Human gene for M1 muscarinic acetylcholine receptor. ACCESSION X52068 NID g34450 KEYWORDS acetylcholine receptor; G protein-coupled receptor; M1 muscarinic acetylcholine receptor; muscarinic acetylcholine receptor; receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1386) AUTHORS Chapman,C.G. TITLE Direct Submission JOURNAL Submitted (08-MAR-1990) Chapman C.G., Smithkline Beecham Pharmaceuticals, Biosciences Research Centre, Great Burgh, Yew Tree Bottom Road, Epsom Surrey KT18 5XQ, U K REFERENCE 2 (bases 1 to 1386) AUTHORS Chapman,C.G. and Browne,M.J. TITLE Isolation of the human ml (Hml) muscarinic acetylcholine receptor gene by PCR amplification JOURNAL Nucleic Acids Res. 18 (8), 2191 (1990) MEDLINE 90245684 COMMENT Data kindly reviewed (05-APR-1991) by Chapman C.G. FEATURES Location/Qualifiers source 1..1386 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" CDS 1..1383 /note="M1 muscarinic acetylcholine receptor (AA 1-460)" /codon_start=1 /db_xref="PID:g34451" /db_xref="SWISS-PROT:P11229" /translation="MNTSAPPAVSPNITVLAPGKGPWQVAFIGITTGLLSLATVTGNL LVLISFKVNTELKTVNNYFLLSLACADLIIGTFSMNLYTTYLLMGHWALGTLACDLWL ALDYVASNASVMNLLLISFDRYFSVTRPLSYRAKRTPRRAALMIGLAWLVSFVLWAPA ILFWQYLVGERTVLAGQCYIQFLSQPIITFGTAMAAFYLPVTVMCTLYWRIYRETENR ARELAALQGSETPGKGGGSSSSSERSQPGAEGSPETPPGRCCRCCRAPRLLQAYSWKE EEEEDEGSMESLTSSEGEEPGSEVVIKMPMVDPEAQAPTKQPPRSSPNTVKRPTKKGR DRAGKGQKPRGKEQLAKRKTFSLVKEKKAARTLSAILLAFILTWTPYNIMVLVSTFCK DCVPETLWELGYWLCYVNSTINPMCYALCNKAFRDTFRLLLLCRWDKRRWRKIPKRPG SVHRTPSRQC" BASE COUNT 280 a 462 c 384 g 260 t ORIGIN 1 atgaacactt cagccccacc tgctgtcagc cccaacatca ccgtcctggc accaggaaag 61 ggtccctggc aagtggcctt cattgggatc accacgggcc tcctgtcgct agccacagtg 121 acaggcaacc tgctggtact catctctttc aaggtcaaca cggagctcaa gacagtcaat 181 aactacttcc tgctgagcct ggcctgtgct gacctcatca tcggtacctt ctccatgaac 241 ctctatacca cgtacctgct catgggccac tgggctctgg gcacgctggc ttgtgacctc 301 tggctggccc tggactatgt ggccagcaat gcctccgtca tgaatctgct gctcatcagc 361 tttgaccgct acttctccgt gactcggccc ctgagctacc gtgccaagcg cacaccccgc 421 cgggcagctc tgatgatcgg cctggcctgg ctggtttcct ttgtgctctg ggccccagcc 481 atcctcttct ggcagtacct ggtaggggag cggacagtgc tagctgggca gtgctacatc 541 cagttcctct cccagcccat catcaccttt ggcacagcca tggctgcctt ctacctccct 601 gtcacagtca tgtgcacgct ctactggcgc atctaccggg agacagagaa ccgagcacgg 661 gagctggcag cccttcaggg ctccgagacg ccaggcaaag ggggtggcag cagcagcagc 721 tcagagaggt ctcagccagg ggctgagggc tcaccagaga ctcctccagg ccgctgctgt 781 cgctgctgcc gggcccccag gctgctgcag gcctacagct ggaaggaaga agaggaagag 841 gacgaaggct ccatggagtc cctcacatcc tcagagggag aggagcctgg ctccgaagtg 901 gtgatcaaga tgccaatggt ggaccccgag gcacaggccc ccaccaagca gcccccacgg 961 agctccccaa atacagtcaa gaggccgact aagaaagggc gtgatcgagc tggcaagggc 1021 cagaagcccc gtggaaagga gcagctggcc aagcggaaga ccttctcgct ggtcaaggag 1081 aagaaggcgg ctcggaccct gagtgccatc ctcctggcct tcatcctcac ctggacaccg 1141 tacaacatca tggtgctggt gtccaccttc tgcaaggact gtgttcccga gaccctgtgg 1201 gagctgggct actggctgtg ctacgtcaac agcaccatca accccatgtg ctacgcactc 1261 tgcaacaaag ccttccggga cacctttcgc ctgctgctgc tttgccgctg ggacaagaga 1321 cgctggcgca agatccccaa gcgccctggc tccgtgcacc gcactccctc ccgccaatgc 1381 tgatag // LOCUS HSMAFTRAN 1631 bp RNA PRI 08-SEP-1997 DEFINITION Homo sapiens mRNA for transcription factor, Maf. ACCESSION Y11514 NID g2344815 KEYWORDS Maf gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1631) AUTHORS Marini,M.G., Chan,K., Casula,L., Kan,Y.W., Cao,A. and Moi,P. TITLE hMAF, a small human transcription factor that heterodimerizes specifically with Nrf1 and Nrf2 JOURNAL J. Biol. Chem. 272 (26), 16490-16497 (1997) MEDLINE 97341189 REFERENCE 2 (bases 1 to 1631) AUTHORS Moi,P. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) P. Moi, Universit di Cagliari, Ist. di Clin. Biologia dellEt Evol., Ospedale Regionale per le Microcitemie, Via Jenner S/N, 09121, ITALY FEATURES Location/Qualifiers source 1..1631 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" /dev_stage="adult" mRNA 1..1631 /evidence=experimental /product="transcription factor" gene 107..595 /gene="Maf" CDS 107..595 /gene="Maf" /note="heterodimerizes specifically with Nrf1 and Nrf2" /codon_start=1 /product="transcription factor" /db_xref="PID:e325387" /db_xref="PID:g2344816" /translation="MTTPNKGNKALKVKREPGENGTSLTDEELVTMSVRELNQHLRGL SKEEIVQLKQRRRTLKNRGYAASCRVKRVTQKEELEKQKAELQQEVEKLASENASMKL ELDALRSKYEALQTFARTVARSPVAPARGPLAAGLGPLVPGKVAATSVITIVKSKTDA RS" BASE COUNT 394 a 415 c 485 g 337 t ORIGIN 1 cctggttctc taggggcagg gtggacagga agcagctcag cccagcctgg gagaggccaa 61 gggctgcctc ctatcagaga gcacctgctc gctgtgcccc cgggttatga cgacccccaa 121 taaaggaaac aaggccttga aggtgaagcg ggagccgggt gagaatggca ccagcctgac 181 ggatgaggag ctggtgacca tgtcggtgcg ggagctgaac cagcacctgc ggggcctgtc 241 caaggaggag atcgtccagc tgaagcagcg ccggcgcacg ctcaagaacc gcggctacgc 301 tgccagctgc cgcgtgaagc gggtgacgca gaaggaggag ctggagaagc agaaggcgga 361 gctgcagcag gaggtggaga agctggcctc agagaacgcc agcatgaagc tggagctcga 421 cgcgctgcgc tccaagtacg aggcgctgca gaccttcgcc cggacggtgg cccgcagccc 481 cgtggcgcca gcccggggcc cccttgccgc cggcctgggg cccctcgtcc caggcaaggt 541 ggccgccacc agcgtcatca caatagtaaa gtccaagacg gatgcccgat cgtagggacg 601 cgcgtctgcc caggcgggtc tttgcggggc cactaggcac atggcgaatt tagctgccct 661 gtccctctgt ttccttctct tctctttcct ccctctcttc cccacccttc tctcttccct 721 gaaagcacaa cctgtacccc aggggcgccg ggctgagccc ctttgatctc gtcatgtcgt 781 cgtgtgtttg tatgttggat tggtcagttc ggcggtgacg tgggtcgccc caaccccttt 841 tgtccagggc catgcaggct tggagtccag agttggtgct gtggaacgga ctagagagag 901 ttgcggagag agaaggagag gcacgctggg cctcgcgtgt ccccgagcag tgagggtccc 961 agtgttccct ccactcccga gtggccacag gctcgcgggc tgggaaggct tcactctctt 1021 tagccccagg ggagcagctc agcttagccc agcatgaaga gatgggctct gctctgagag 1081 tagggcgggc ttgaaggccc tgatgggtgg accaccagcc tgggcgcagt ggtgctgggg 1141 cgtgcagctg ggcccagggg ctgtgactca ggcctgacac cgttgcactg aacaagacca 1201 aatcgctggt tgtgcgctta acgtgagggt gggtccagtg tgccctgcga tgggtcccgt 1261 gtcactgttt acatgaccta tttgtgtggt tatatagccc tttatttaaa agagagaagt 1321 tccttttaca aagttattaa attaattata tgtttaaaag ttaaagaaaa aagagctgca 1381 gagtatttat aaaactgtct tttagaaaaa aaacaagcaa gaagaccatt tgaccatatg 1441 aatggaaaag ggaagaaagt ataatagaaa ctttgctagt taaaaaaaaa aagaaaaaaa 1501 aagaaaaaaa atccctttct tgtaaactta cggacacctc tttgtggctg ttggagttta 1561 gtttttatat acacagagtt atcagacatt atttataaaa cttagtttaa aaaaaaaaaa 1621 aaaaaaaaaa a // LOCUS HSMAGEXP 1866 bp RNA PRI 13-SEP-1995 DEFINITION H.sapiens mRNA for MAGE-Xp. ACCESSION X82539 NID g608992 KEYWORDS MAGE-Xp gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Muscatelli,F., Walker,A.P., De Plaen,E., Stafford,A.N. and Monaco,A.P. TITLE Isolation and characterization of a MAGE gene family in the Xp21.3 region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (11), 4987-4991 (1995) MEDLINE 95281581 REFERENCE 2 (bases 1 to 1866) AUTHORS Monaco,A.P. TITLE Direct Submission JOURNAL Submitted (07-NOV-1994) A.P. Monaco, Imperial Cancer Research Fund Lab., Inst.of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="testes" /clone_lib="testes cDNA (Clontech)" /clone="5H-4, 6H-1, 7H-3" /chromosome="X" /map="Xp21.3" mRNA join(1..73,74..176,177..260,261..1864) /gene="MAGE-Xp" gene 1..1864 /gene="MAGE-Xp" exon 1..73 /gene="MAGE-Xp" /number=1 exon 74..176 /gene="MAGE-Xp" /number=2 exon 177..260 /gene="MAGE-Xp" /number=3 exon 261..1864 /gene="MAGE-Xp" /number=4 CDS 324..1367 /gene="MAGE-Xp" /codon_start=1 /db_xref="PID:g608993" /db_xref="SWISS-PROT:P43366" /translation="MPRGQKSKLRAREKRRKAREETQGLKVRHATAAEKEECPSSSPV LGDTPTSSPAAGIPQKPQGAPPTTTAAAAVSCTESDEGAKCQGEENASFSQATTSTES SVKDPVAWEAGMLMHFILRKYKMREPIMKADMLKVVDEKYKDHFTEILNGASRRLELV FGLDLKEDNPSSHTYTLVSKLNLTNDGNLSNDWDFPRNGLLMPLLGVIFLKGNSATEE EIWKFMNVLGAYDGEEHLIYGEPRKFITQDLVQEKYLKYEQVPNSDPPRYQFLWGPRA YAETTKMKVLEFLAKMNGATPRDFPSHYEEALRDEEERAQVRSSVRARRRTTATTFRA RSRAPFSRSSHPM" BASE COUNT 504 a 424 c 457 g 481 t ORIGIN 1 gagtgttgca actgggcctg gcatgtttca gcgtggtgtc cagcagtgtc tcccactcct 61 tgtgaagtct gaggttgcaa aaggactgtg atcatatgaa gatcatccag gagtacaact 121 cgaaattctc agaaaacagg accttgatgt gagaggagca ggttcaggta aacaaagggc 181 gaggacccga gcgagcttaa ggccagtggg gtgcagcgtc tggtcagccg agggtgaatt 241 ctcaggactg gtcgggagtc aaggtgccac atctcctgcc tttctgctca ctttcctgcc 301 tgttttgcct gaccacagcc atcatgcctc ggggtcagaa gagtaagctc cgtgctcgtg 361 agaaacgccg caaggcgcga gaggagaccc agggtctcaa ggttcgtcac gccactgcag 421 cagagaaaga ggagtgcccc tcctcctctc ctgttttagg ggatactccc acaagctccc 481 ctgctgctgg cattccccag aagcctcagg gagctccacc caccaccact gctgctgcag 541 ctgtgtcatg taccgaatct gacgaaggtg ccaaatgcca aggtgaggaa aatgcaagtt 601 tctcccaggc cacaacatcc actgagagct cagtcaaaga tcctgtagcc tgggaggcag 661 gaatgctgat gcacttcatt ctacgtaagt ataaaatgag agagcccatt atgaaggcag 721 atatgctgaa ggttgttgat gaaaagtaca aggatcactt cactgagatc ctcaatggag 781 cctctcgccg cttggagctc gtctttggcc ttgatttgaa ggaagacaac cctagtagcc 841 acacctacac cctcgtcagt aagctaaacc tcaccaatga tggaaacctg agcaatgatt 901 gggactttcc caggaatggg cttctgatgc ctctcctggg tgtgatcttc ttaaagggca 961 actctgccac cgaggaagag atctggaaat tcatgaatgt gttgggagcc tatgatggag 1021 aggagcactt aatctatggg gaaccccgta agttcatcac ccaagatctg gtgcaggaaa 1081 aatatctgaa gtacgagcag gtgcccaaca gtgatccccc acgctatcaa ttcctatggg 1141 gtccgagagc ctatgctgaa accaccaaga tgaaagtcct cgagtttttg gccaagatga 1201 atggtgccac tccccgtgac ttcccatccc attatgaaga ggctttgaga gatgaggaag 1261 agagagccca agtccgatcc agtgttagag ccaggcgtcg cactactgcc acgactttta 1321 gagcgcgttc tagagcccca ttcagcaggt cctcccaccc catgtgagaa ctcaggcaga 1381 ttgttcactt tgtttttgtg gcaagatgcc aaccttttga agtagtgagc agccaagata 1441 tggctagaga gatcatcata tatatctcct ttgtgttcct gttaaacatt agtatctttc 1501 aagtgttttt cttttaatag aatgtttatt tagagttggg atctatgtct atgagcgaca 1561 tggatcacac atttattggt gctgccagct ttaagcataa gagttttgat attctatatt 1621 tttcaaatcc ttgaatcttt tttgggttga agaagaagaa agcatagctt tagaatagag 1681 attttctcag aaatgtgtga agaacctcac acaacataat tggagtctta aaatagagga 1741 agagtaagca aagcatgtca agtttttgtt ttctgcattc agttttgttt ttgtaaaatc 1801 caaagataca tacctggttg tttttagcct tttcaagaat gcagataaaa taaatagtaa 1861 taaatt // LOCUS HSMALA 462 bp RNA PRI 18-APR-1995 DEFINITION H.sapiens MAL-a mRNA. ACCESSION X76678 NID g435477 KEYWORDS Mal gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 462) AUTHORS Alonso,M. TITLE Direct Submission JOURNAL Submitted (09-DEC-1993) M. Alonso, Centro de Biologia Molecular, Universidad Autonoma, Cantoblanco, 28049 Madrid, SPAIN REFERENCE 2 (bases 1 to 462) AUTHORS Rancano,C., Rubio,T. and Alonso,M.A. TITLE Alternative splicing of human T-cell-specific MAL mRNA and its correlation with the exon/intron organization of the gene JOURNAL Genomics 21 (2), 447-450 (1994) MEDLINE 94375076 FEATURES Location/Qualifiers source 1..462 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cells" /cell_line="human thymocytes" /chromosome="2" gene 1..462 /gene="MAL-a" CDS 1..462 /gene="MAL-a" /codon_start=1 /db_xref="PID:g435478" /db_xref="SWISS-PROT:P21145" /translation="MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASS LVPWPLVQGWVMFVSVFCFVATTTLIILYIIGAHGGETSWVTLDAAYHCTAALFYLSA SVLEALATITMQDGFTYRHYHENIAAVVFSYIATLLYVVHAVFSLIRWKSS" BASE COUNT 72 a 151 c 119 g 120 t ORIGIN 1 atggcccccg cagcggcgac ggggggcagc accctgccca gtggcttctc ggtcttcacc 61 accttgcccg acttgctctt catctttgag tttatcttcg ggggcctggt gtggatcctg 121 gtggcctcct ccctggtgcc ctggcccctg gtccagggct gggtgatgtt cgtgtctgtg 181 ttctgcttcg tggccaccac caccttgatc atcctgtaca taattggagc ccacggtgga 241 gagacttcct gggtcacctt ggacgcagcc taccactgca ccgctgccct cttttacctc 301 agcgcctcag tcctggaggc cctggccacc atcacgatgc aagacggctt cacctacagg 361 cactaccatg aaaacattgc tgccgtggtg ttctcctaca tagccactct gctctacgtg 421 gtccatgcgg tgttctcttt aatcagatgg aagtcttcat aa // LOCUS HSMAPKP4 2303 bp RNA PRI 06-MAR-1997 DEFINITION H.sapiens mRNA for MAP kinase phosphatase 4. ACCESSION Y08302 NID g1871538 KEYWORDS dual specificity phosphatase; MAP kinase phosphatase; mitogen-activated protein kinase phosphatase; MPK4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2303) AUTHORS Muda,M., Boschert,U., Smith,A., Antonsson,B., Gillieron,C., Chabert,C., Camps,M., Martinou,I., Ashworth,A. and Arkinstall,S. TITLE Molecular cloning and functional characterization of a novel mitogen-activated protein kinase phosphatase, MKP-4 JOURNAL J. Biol. Chem. 272 (8), 5141-5151 (1997) MEDLINE 97184169 REFERENCE 2 (bases 1 to 2303) AUTHORS Muda,M. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) M. Muda, Geneva Biomedical Research Institute, 14 chemin des Aulx, CH-1228 Plan-les-Ouates, Geneva, SWITZERLAND FEATURES Location/Qualifiers source 1..2303 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt10" /clone="MKP-4" gene 114..1268 /gene="MKP4" CDS 114..1268 /gene="MKP4" /note="MAP kinase phosphatase; dual specificity phosphatase" /codon_start=1 /product="mitogen-activated protein kinase phosphatase 4" /db_xref="PID:e274633" /db_xref="PID:g1871539" /translation="MEGLGRSCLWLRRELSPPRPRLLLLDCRSRELYESARIGGALSV ALPALLLRRLRRGSLSVRALLPGPPLQPPPPAPVLLYDQGGGRRRRGEAEAEAEEWEA ESVLGTLLQKLREEGYLAYYLQGGFSRFQAECPHLCETSLAGRAGSSMAPVPGPVPVV GLGSLCLGSDCSDAESEADRDSMSCGLDSEGATPPPVGLRASFPVQILPNLYLGSARD SANLESLAKLGIRYILNVTPNLPNFFEKNGDFHYKQIPISDHWSQNLSRFFPEAIEFI DEALSQNCGVLVHCLAGVSRSVTVTVAYLMQKLHLSLNDAYDLVKRKKSNISPNFNFM GQLLDFERSLRLEERHSQEQGSGGQASAASNPPSFFTTPTSDGAFELAPT" polyA_signal 2264..2269 polyA_site 2284 BASE COUNT 360 a 733 c 748 g 462 t ORIGIN 1 cgcttcccgc cgcccgagct tcggaaactt cccggccgcg acgcagggaa ccggcgcgga 61 gaaccgagca gagcggagcg cccgtggtcc agcgtgtagg gagccgatcg cccatggagg 121 gtctgggccg ctcgtgcctg tggctgcgtc gggagctgtc gcccccgcgg ccgcggctcc 181 tgctcctgga ctgccgcagc cgcgagctgt acgagtcggc gcgcatcggt ggggcgctga 241 gcgtggccct gccggcgctc ctgctgcgcc gcctgcggag gggcagcctg tcggtgcgcg 301 cgctcctgcc tgggccgccg ctgcagccgc ccccgcctgc ccccgtgctc ctgtacgacc 361 agggcggggg ccggcgccgg cgcggggagg ccgaggccga ggccgaggag tgggaggccg 421 agtcggtgct gggcaccctg ctgcagaagc tgcgagagga aggctacctg gcctactacc 481 tccagggagg cttcagcaga ttccaggccg agtgccctca cctgtgtgag accagccttg 541 ctggccgtgc cggctccagc atggcgccgg tgcccggtcc agtgcccgtg gtggggttgg 601 gcagcctgtg cctgggctcc gactgctctg atgcggaatc cgaggctgac cgcgactcca 661 tgagctgtgg cctggattcg gagggtgcca cacccccacc agtggggctg cgggcatcct 721 tccctgtcca gatcctgccc aacctctatc tgggcagtgc ccgggattcc gccaatttgg 781 agagcctggc caaactgggc atccgctaca tcctcaatgt cacccccaac ctcccaaact 841 tcttcgagaa gaatggtgac tttcactaca agcagatccc catctccgac cactggagcc 901 agaacctgtc gcggttcttt ccggaggcca ttgagttcat tgatgaggcc ttgtcccaga 961 actgcggggt gctcgtccac tgcttggcgg gggtcagccg ttctgtcacc gtcactgtgg 1021 cctacctcat gcagaagctc cacctctctc tcaacgatgc ctatgacctg gtcaagagga 1081 agaagtctaa catctccccc aacttcaact tcatggggca gttgctggac tttgagcgca 1141 gcttgcggct ggaggagcgc cactcgcagg agcagggcag tggggggcag gcatctgcgg 1201 cctccaaccc gccctccttc ttcaccaccc ccaccagtga tggcgccttc gagctggccc 1261 ccacctaggg ccccgtggcc ggcaggccgg cccctgcccc acccccaccc acgggtgtcc 1321 ctgcccactc gtgtggcaag ggaggggagg gcaggagggc tcggcctgag cagggtgctg 1381 gggggagagc gcaatacctc acgcgggctg ccgtcctaat caacgtgcct atggcgggac 1441 cacgctcgga gcctgcctct tctgcgactg ttactttttc tttgcgggat gggggtgggg 1501 gttccctctc caggtggttg tccaagccca ggtcccggcc ctgggtgctc agccagctcg 1561 gctaggccct gcgcctccct gcgcttcccc cttcaggaag ggtgtgtgcc acctcgttgc 1621 actggatccc agtggctgct tgggggagag gcgtttgcca tcactggtgt tgtcacctcc 1681 ctgtttctcc accaagggct tgggcctctc ggggctgggg cctcccaggg gatggggacc 1741 cagaggtgca gtggccgccc acatccatgg cctaggagct actgggcagg ttcccggcca 1801 cacatctggt gggctgtttt gttttttttt ttcctcttcc cccagatgtc ttgacgggat 1861 cactggggct ctttgtgagt gagggtggcc aaactaccgc cggaggagat ggggtctcag 1921 agcgagagct gcggaggggg aggggaagaa gaaggcctca cttttgctgc tgcggggccc 1981 acacagccgc tgctactttg gggggtgggg aaggggccaa gctgcagaca cacacagtca 2041 ttcatttctg tccacacccc tgtgggtggc gggtgtgcgt gtgtgtgctt gtgtgtgcgc 2101 acgtgtcggc gctcacacac acatgctagc ccactgatgc acccagccca gggctggcag 2161 tctttgcagc gtggggccgt ctcaccctgg agcctggaga ggatctatgc ttgtttgttt 2221 ttgtaatcca tatcatagtt gctttcttta attgttcctt ctgaataaac agtttattta 2281 agataaaaaa aaaaaaaaaa aaa // LOCUS HSMASP2PR 2475 bp RNA PRI 04-APR-1997 DEFINITION H.sapiens mRNA for MASP-2 protein. ACCESSION Y09926 NID g1929053 KEYWORDS MASP-2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2475) AUTHORS Jensen,T.V. TITLE Direct Submission JOURNAL Submitted (06-DEC-1996) T.V. Jensen, University of Aarhus, Microbiology & Immunology, Bartholin Building, 8000 Aarhus C, DENMARK REMARK Revised by author on 04-APR-1997 REFERENCE 2 (bases 1 to 2475) AUTHORS Thiel,S., Jensen,T.V., Stover,C.M., Schwaeble,W., Laursen,S.B., Poulsen,K., Willis,A.C., Eggleton,P., Hansen,S., Holmskov,U., Reid,K.B.M. and Jensenius,J.C. TITLE A second serine protease associated with mannan-binding lectin that activates complement JOURNAL Nature 386 (6624), 506-510 (1997) MEDLINE 97242412 FEATURES Location/Qualifiers source 1..2475 /organism="Homo sapiens" /macronuclear /db_xref="taxon:9606" gene 37..2097 /gene="MASP-2" CDS 37..2097 /gene="MASP-2" /codon_start=1 /product="MASP-2 protein" /db_xref="PID:e311185" /db_xref="PID:g1929054" /translation="MRLLTLLGLLCGSVATPLGPKWPEPVFGRLASPGFPGEYANDQE RRWTLTAPPGYRLRLYFTHFDLELSHLCEYDFVKLSSGAKVLATLCGQESTDTERAPG KDTFYSLGSSLDITFRSDYSNEKPFTGFEAFYAAEDIDECQVAPGEAPTCDHHCHNHL GGFYCSCRAGYVLHRNKRTCSALCSGQVFTQRSGELSSPEYPRPYPKLSSCTYSISLE EGFSVILDFVESFDVETHPETLCPYDFLKIQTDREEHGPFCGKTLPHRIETKSNTVTI TFVTDESGDHTGWKIHYTSTAQPCPYPMAPPNGHVSPVQAKYILKDSFSIFCETGYEL LQGHLPLKSFTAVCQKDGSWDRPMPACSIVDCGPPDDLPSGRVEYITGPGVTTYKAVI QYSCEETFYTMKVNDGKYVCEADGFWTSSKGEKSLPVCEPVCGLSARTTGGRIYGGQK AKPGDFPWQVLILGGTTAAGALLYDNWVLTAAHAVYEQKHDASALDIRMGTLKRLSPH YTQAWSEAVFIHEGYTHDAGFDNDIALIKLNNKVVINSNITPICLPRKEAESFMRTDD IGTASGWGLTQRGFLARNLMYVDIPIVDHQKCTAAYEKPPYPRGSVTANMLCAGLESG GKDSCRGDSGGALVFLDSETERWFVGGIVSWGSMNCGEAGQYGVYTKVINYIPWIENI ISDF" BASE COUNT 626 a 645 c 619 g 585 t ORIGIN 1 ctcgtgcaat tcggcacgag gctggacggg cacaccatga ggctgctgac cctcctgggc 61 cttctgtgtg gctcggtggc cacccccttg ggcccgaagt ggcctgaacc tgtgttcggg 121 cgcctggcat cccccggctt tccaggggag tatgccaatg accaggagcg gcgctggacc 181 ctgactgcac cccccggcta ccgcctgcgc ctctacttca cccacttcga cctggagctc 241 tcccacctct gcgagtacga cttcgtcaag ctgagctcgg gggccaaggt gctggccacg 301 ctgtgcgggc aggagagcac agacacggag cgggcccctg gcaaggacac tttctactcg 361 ctgggctcca gcctggacat taccttccgc tccgactact ccaacgagaa gccgttcacg 421 gggttcgagg ccttctatgc agccgaggac attgacgagt gccaggtggc cccgggagag 481 gcgcccacct gcgaccacca ctgccacaac cacctgggcg gtttctactg ctcctgccgc 541 gcaggctacg tcctgcaccg taacaagcgc acctgctcag ccctgtgctc cggccaggtc 601 ttcacccaga ggtctgggga gctcagcagc cctgaatacc cacggccgta tcccaaactc 661 tccagttgca cttacagcat cagcctggag gaggggttca gtgtcattct ggactttgtg 721 gagtccttcg atgtggagac acaccctgaa accctgtgtc cctacgactt tctcaagatt 781 caaacagaca gagaagaaca tggcccattc tgtgggaaga cattgcccca caggattgaa 841 acaaaaagca acacggtgac catcaccttt gtcacagatg aatcaggaga ccacacaggc 901 tggaagatcc actacacgag cacagcgcag ccttgccctt atccgatggc gccacctaat 961 ggccacgttt cacctgtgca agccaaatac atcctgaaag acagcttctc catcttttgc 1021 gagactggct atgagcttct gcaaggtcac ttgcccctga aatcctttac tgcagtttgt 1081 cagaaagatg gatcttggga ccggccaatg cccgcgtgca gcattgttga ctgtggccct 1141 cctgatgatc tacccagtgg ccgagtggag tacatcacag gtcctggagt gaccacctac 1201 aaagctgtga ttcagtacag ctgtgaagag accttctaca caatgaaagt gaatgatggt 1261 aaatatgtgt gtgaggctga tggattctgg acgagctcca aaggagaaaa atcactccca 1321 gtctgtgagc ctgtttgtgg actatcagcc cgcacaacag gagggcgtat atatggaggg 1381 caaaaggcaa aacctggtga ttttccttgg caagtcctga tattaggtgg aaccacagca 1441 gcaggtgcac ttttatatga caactgggtc ctaacagctg ctcatgccgt ctatgagcaa 1501 aaacatgatg catccgccct ggacattcga atgggcaccc tgaaaagact atcacctcat 1561 tatacacaag cctggtctga agctgttttt atacatgaag gttatactca tgatgctggc 1621 tttgacaatg acatagcact gattaaattg aataacaaag ttgtaatcaa tagcaacatc 1681 acgcctattt gtctgccaag aaaagaagct gaatccttta tgaggacaga tgacattgga 1741 actgcatctg gatggggatt aacccaaagg ggttttcttg ctagaaatct aatgtatgtc 1801 gacataccga ttgttgacca tcaaaaatgt actgctgcat atgaaaagcc accctatcca 1861 aggggaagtg taactgctaa catgctttgt gctggcttag aaagtggggg caaggacagc 1921 tgcagaggtg acagcggagg ggcactggtg tttctagata gtgaaacaga gaggtggttt 1981 gtgggaggaa tagtgtcctg gggttccatg aattgtgggg aagcaggtca gtatggagtc 2041 tacacaaaag ttattaacta tattccctgg atcgagaaca taattagtga tttttaactt 2101 gcgtgtctgc agtcaaggat tcttcatttt tagaaatgcc tgtgaagacc ttggcagcga 2161 cgtggctcga gaagcattca tcattactgt ggacatggca gttgttgctc cacccaaaaa 2221 aacagactcc aggtgaggct gctgtcattt ctccacttgc cagtttaatt ccagccttac 2281 ccattgactc aaggggacat aaaccacgag agtgacagtc atctttgccc acccagtgta 2341 atgtcactgc tcaaattaca tttcattacc ttaaaaagcc agtctctttt catactggct 2401 gttggcattt ctgtaaactg cctgtccatg ctctttgttt ttaaacttgt tcttattgaa 2461 aaaaaaaaaa aaaaa // LOCUS HSMAT82 511 bp RNA PRI 28-NOV-1995 DEFINITION H.sapiens mRNA for MAT8 protein. ACCESSION X93036 S74645 NID g1085025 KEYWORDS mat8 gene; MAT8 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 511) AUTHORS Morrison,B.W. TITLE Direct Submission JOURNAL Submitted (14-NOV-1995) B.W. Morrison, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA REFERENCE 2 (bases 1 to 511) AUTHORS Morrison,B.W., Moorman,J.R., Kowdley,G.C., Kobayashi,Y.M., Jones,L.R. and Leder,P. TITLE Mat-8, a novel phospholemman-like protein expressed in human breast tumors, induces a chloride conductance in Xenopus oocytes JOURNAL J. Biol. Chem. 270 (5), 2176-2182 (1995) MEDLINE 95138184 FEATURES Location/Qualifiers source 1..511 /organism="Homo sapiens" /note="tumor" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="breast" /cell_line="SKBR-3" /clone_lib="lambda ZAP II" gene 60..323 /gene="mat8" CDS 60..323 /gene="mat8" /function="cloride conductance" /codon_start=1 /product="MAT8 protein" /db_xref="PID:e211793" /db_xref="PID:g1085026" /translation="MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYDWHSLQVGGLIC AGVLCAMGIIIVMSAKCKCKFGQKSGHHPGETPPLITPGSAQS" misc_feature 60..119 /gene="mat8" /note="leader sequence" misc_feature 174..233 /gene="mat8" /note="transmembrane region" polyA_signal 496..503 /note="non-consensus" BASE COUNT 110 a 149 c 124 g 127 t 1 others ORIGIN 1 cccgatttct cccggaacct ctgctcagcc tggtgaacca cacaggccag cgctctgaca 61 tgcagaaggt gaccctgggc ctgcttgtgt tcctggcagg ctttcctgtc ctggacgcca 121 atgacctaga agataaaaac agtcctttct actatgactg gcacagcctc caggttggcg 181 ggctcatctg cgctggggtt ctgtgcgcca tgggcatcat catcgtcatg agtgcaaaat 241 gcaaatgcaa gtttggccag aagtccggtc accatccagg ggagactcca cctctcatca 301 ccccaggctc agcccaaagc tgatgaggac agaccagctg aaattgggtg gaggaccgtt 361 ctctgtcccc aggtcctgtc tctgcacaga aacttgaact ccaggatgga attcttcctc 421 ctctgctggg actcctttgc atggcagggc ctcatctcac ctctcgcaag agggtctctt 481 tgttcaattt tttttaatct aaaatgatta n // LOCUS HSMATUMN 1552 bp RNA PRI 10-OCT-1995 DEFINITION H.sapiens MaTu MN mRNA for p54/58N protein. ACCESSION X66839 NID g1000701 KEYWORDS transmembrane glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1552) AUTHORS Pastorek,J. TITLE Direct Submission JOURNAL Submitted (11-JUN-1992) J. Pastorek, Institute of Virology, Slovak Academy of Sciences, Dubravska 9, 842 46 Bratislavia, SLOVAK REPUBLIC REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 1552) AUTHORS Pastorek,J., Pastorekova,S., Callebaut,I., Mornon,J., Zelnik,V., Opavsky,R., Zatovicova,M., Liao,S., Portetelle,D., Stanbridge,E.J., Zavada,J. and Burny,A. TITLE Cloning and characterization of MN, a human tumor-associated protein with a domain homologous to carbonic anhydrase and a putative helix-loop-helix DNA binding segment JOURNAL Oncogene 9 (10), 2877-2888 (1994) MEDLINE 94366734 REFERENCE 3 (bases 1 to 1552) AUTHORS Pastorek,J. TITLE Direct Submission JOURNAL Submitted (19-JUL-1994) J. Pastorek, Institute of Virology, Slovak Academy of Sciences, Dubravska 9, 842 46 Bratislavia, SLOVAK REPUBLIC REMARK revised by [4] MAT REFERENCE 4 (bases 1 to 1552) AUTHORS Pastorek,J. TITLE Direct Submission JOURNAL Submitted (28-SEP-1995) J. Pastorek, Institute of Virology, Slovak Academy of Sciences, Dubravska 9, 842 46 Bratislavia, SLOVAK REPUBLIC FEATURES Location/Qualifiers source 1..1552 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="carcinoma" /cell_type="epithelial" /clone_lib="lambda gt11" /clone="MN1" gene 43..1422 /gene="MaTu MN" CDS 43..1422 /gene="MaTu MN" /codon_start=1 /product="p54/58N" /db_xref="PID:g1000702" /translation="MAPLCPSPWLPLLIPAPAPGLTVQLLLSLLLLMPVHPQRLPRMQ EDSPLGGGSSGEDDPLGEEDLPSEEDSPREEDPPGEEDLPGEEDLPGEEDLPEVKPKS EEEGSLKLEDLPTVEAPGDPQEPQNNAHRDKEGDDQSHWRYGGDPPWPRVSPACAGRF QSPVDIRPQLAAFCPALRPLELLGFQLPPLPELRLRNNGHSVQLTLPPGLEMALGPGR EYRALQLHLHWGAAGRPGSEHTVEGHRFPAEIHVVHLSTAFARVDEALGRPGGLAVLA AFLEEGPEENSAYEQLLSRLEEIAEEGSETQVPGLDISALLPSDFSRYFQYEGSLTTP PCAQGVIWTVFNQTVMLSAKQLHTLSDTLWGPGDSRLQLNFRATQPLNGRVIEASFPA GVDSSPRAAEPVQLNSCLAAGDILALVFGLLFAVTSVAFLVQMRRQHRRGTKGGVSYR PAEVAETGA" BASE COUNT 302 a 471 c 461 g 318 t ORIGIN 1 gcccgtacac accgtgtgct gggacacccc acagtcagcc gcatggctcc cctgtgcccc 61 agcccctggc tccctctgtt gatcccggcc cctgctccag gcctcactgt gcaactgctg 121 ctgtcactgc tgcttctgat gcctgtccat ccccagaggt tgccccggat gcaggaggat 181 tcccccttgg gaggaggctc ttctggggaa gatgacccac tgggcgagga ggatctgccc 241 agtgaagagg attcacccag agaggaggat ccacccggag aggaggatct acctggagag 301 gaggatctac ctggagagga ggatctacct gaagttaagc ctaaatcaga agaagagggc 361 tccctgaagt tagaggatct acctactgtt gaggctcctg gagatcctca agaaccccag 421 aataatgccc acagggacaa agaaggggat gaccagagtc attggcgcta tggaggcgac 481 ccgccctggc cccgggtgtc cccagcctgc gcgggccgct tccagtcccc ggtggatatc 541 cgcccccagc tcgccgcctt ctgcccggcc ctgcgccccc tggaactcct gggcttccag 601 ctcccgccgc tcccagaact gcgcctgcgc aacaatggcc acagtgtgca actgaccctg 661 cctcctgggc tagagatggc tctgggtccc gggcgggagt accgggctct gcagctgcat 721 ctgcactggg gggctgcagg tcgtccgggc tcggagcaca ctgtggaagg ccaccgtttc 781 cctgccgaga tccacgtggt tcacctcagc accgcctttg ccagagttga cgaggccttg 841 gggcgcccgg gaggcctggc cgtgttggcc gcctttctgg aggagggccc ggaagaaaac 901 agtgcctatg agcagttgct gtctcgcttg gaagaaatcg ctgaggaagg ctcagagact 961 caggtcccag gactggacat atctgcactc ctgccctctg acttcagccg ctacttccaa 1021 tatgaggggt ctctgactac accgccctgt gcccagggtg tcatctggac tgtgtttaac 1081 cagacagtga tgctgagtgc taagcagctc cacaccctct ctgacaccct gtggggacct 1141 ggtgactctc ggctacagct gaacttccga gcgacgcagc ctttgaatgg gcgagtgatt 1201 gaggcctcct tccctgctgg agtggacagc agtcctcggg ctgctgagcc agtccagctg 1261 aattcctgcc tggctgctgg tgacatccta gccctggttt ttggcctcct ttttgctgtc 1321 accagcgtcg cgttccttgt gcagatgaga aggcagcaca gaaggggaac caaagggggt 1381 gtgagctacc gcccagcaga ggtagccgag actggagcct agaggctgga tcttggagaa 1441 tgtgagaagc cagccagagg catctgaggg ggagccggta actgtcctgt cctgctcatt 1501 atgccacttc cttttaactg ccaagaaatt ttttaaaata aatatttata at // LOCUS HSMAXM 623 bp RNA PRI 04-DEC-1994 DEFINITION H.sapiens max mRNA. ACCESSION X60287 S95058 NID g599792 KEYWORDS max gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 623) AUTHORS Maekelae,T.P. TITLE Direct Submission JOURNAL Submitted (10-SEP-1991) T.P. Maekelae, Departments of Virolology & Pathology, Univ. of Helsinki, Haartmaninkatu 3, 00290 Helsinki, FINLAND REFERENCE 2 (bases 1 to 623) AUTHORS Makela,T.P., Koskinen,P.J., Vastrik,I. and Alitalo,K. TITLE Alternative forms of Max as enhancers or suppressors of Myc-ras cotransformation JOURNAL Science 256 (5055), 373-377 (1992) MEDLINE 92229468 COMMENT Overlap of sequenced fragments. FEATURES Location/Qualifiers source 1..623 /organism="Homo sapiens" /db_xref="taxon:9606" gene 22..333 /gene="max" CDS 22..333 /gene="max" /codon_start=1 /evidence=experimental /db_xref="PID:g599793" /translation="MSDNDDIEVESDEEQPRFQSAADKRAHHNALERKRRDHIKDSFH SLRDSVPSLQGEKASRAQILDKATEYIQYMRRKNHTHQQDIDDLKRQNALLEQQGESE S" exon 316..417 /note="internal alternative" /evidence=experimental BASE COUNT 185 a 172 c 163 g 103 t ORIGIN 1 ccgctccctg ggccgtagga aatgagcgat aacgatgaca tcgaggtgga gagcgacgaa 61 gagcaaccga ggtttcaatc tgcggctgac aaacgggctc atcataatgc actggaacga 121 aaacgtaggg accacatcaa agacagcttt cacagtttgc gggactcagt cccatcactc 181 caaggagaga aggcatcccg ggcccaaatc ctagacaaag ccacagaata tatccagtat 241 atgcgaagga aaaaccacac acaccagcaa gatattgacg acctcaagcg gcagaatgct 301 cttctggagc agcaagggga aagcgagagc tgatcaagtt ctttgttcct ggggaattca 361 cttctcttcc ttcctcatgg aagatgcaag taaaaggaaa tgcaagtaac cacctggtcc 421 gtgcactgga gaaggcgagg tcaagtgccc aactgcagac caactacccc tcctcagaca 481 acagcctcta caccaacgcc aagggcagca ccatttctgc cttcgatggg ggctcggact 541 ccagctcgga gtctgagcct gaagagcccc aaagcaggaa gaagctccgg atggaggcca 601 gctaagccac tcggggcagg cca // LOCUS HSMBPC 3605 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for mannose-binding protein C. ACCESSION X15422 NID g34486 KEYWORDS mannose-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3605) AUTHORS Ezekowitz,R.A.B. TITLE Direct Submission JOURNAL Submitted (17-AUG-1989) Ezekowitz R.A.B., The Children's Hospital, Enders Building 7th Floor, 300 Longwood Avenue, Boston MA 02115, U S A REFERENCE 2 (bases 1 to 3605) AUTHORS Sastry,K., Herman,G.A., Day,L., Deignan,E., Bruns,G., Morton,C.C. and Ezekowitz,R.A. TITLE The human mannose-binding protein gene. Exon structure reveals its evolutionary relationship to a human pulmonary surfactant gene and localization to chromosome 10 JOURNAL J. Exp. Med. 170 (4), 1175-1189 (1989) MEDLINE 90010778 COMMENT X15422 revises MBP cDNA seq published by: Ezekowitz et. al. J. Exp. Med. 167:1034-1046(1988). Data kindly reviewed (22-FEB-1990) by Ezckowitz R.A.B. FEATURES Location/Qualifiers source 1..3605 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="heatocytes" /cell_line="Hep62" /clone_lib="lambda gt10" /clone="48-11" /chromosome="10q11.2-q21." sig_peptide 66..125 /note="signal peptide (AA -20 to -1)" CDS 66..812 /codon_start=1 /product="precursor protein" /db_xref="PID:g34487" /db_xref="SWISS-PROT:P11226" /translation="MSLFPSLPLLLLSMVAASYSETVTCEDAQKTCPAVIACSSPGIN GFPGKDGRDGTKGEKGEPGQGLRGLQGPPGKLGPPGNPGPSGSPGPKGQKGDPGKSPD GDSSLAASERKALQTEMARIKKWLTFSLGKQVGNKFFLTNGEIMTFEKVKALCVKFQA SVATPRNAAENGAIQNLIKEEAFLGITDEKTEGQFVDLTGNRLTYTNWNEGEPNNAGS DEDCVLLLKNGQWNDVPCSTSHLAVCEFPI" mat_peptide 126..809 /note="mat. mannose-binding protein C (AA 1-228)" BASE COUNT 1055 a 679 c 647 g 1224 t ORIGIN 1 ggtaaatatg tgttcattaa ctgagattaa ccttccctga gttttctcac accaaggtga 61 ggaccatgtc cctgtttcca tcactccctc tccttctcct gagtatggtg gcagcgtctt 121 actcagaaac tgtgacctgt gaggatgccc aaaagacctg ccctgcagtg attgcctgta 181 gctctccagg catcaacggc ttcccaggca aagatgggcg tgatggcacc aagggagaaa 241 agggggaacc aggccaaggg ctcagaggct tacagggccc ccctggaaag ttggggcctc 301 caggaaatcc agggccttct gggtcaccag gaccaaaggg ccaaaaagga gaccctggaa 361 aaagtccgga tggtgatagt agcctggctg cctcagaaag aaaagctctg caaacagaaa 421 tggcacgtat caaaaagtgg ctgaccttct ctctgggcaa acaagttggg aacaagttct 481 tcctgaccaa tggtgaaata atgacctttg aaaaagtgaa ggccttgtgt gtcaagttcc 541 aggcctctgt ggccaccccc aggaatgctg cagagaatgg agccattcag aatctcatca 601 aggaggaagc cttcctgggc atcactgatg agaagacaga agggcagttt gtggatctga 661 caggaaatag actgacctac acaaactgga acgagggtga acccaacaat gctggttctg 721 atgaagattg tgtattgcta ctgaaaaatg gccagtggaa tgacgtcccc tgctccacct 781 cccatctggc cgtctgtgag ttccctatct gaagggtcat atcactcagg ccctccttgt 841 ctttttactg caacccacag gcccacagta tgcttgaaaa gataaattat atcaatttcc 901 tcatatccag tattgttcct tttgtgggca atcactaaaa atgatcacta acagcaccaa 961 caaagcaata atagtagtag tagtagttag cagcagcagt agtagtcatg ctaattatat 1021 aatattttta atatatacta tgaggcccta tcttttgcat cctacattaa ttatctagtt 1081 taattaatct gtaatgcttt cgatagtgtt aacttgctgc agtatgaaaa taagacggat 1141 ttatttttcc atttacaaca aacacctgtg ctctgttgag ccttcctttc tgtttgggta 1201 gagggctccc ctaatgacat caccacagtt taataccaca gctttttacc aagtttcagg 1261 tattaagaaa atctattttg taactttctc tatgaactct gttttctttc taatgagata 1321 ttaaaccatg taaagaacat aaataacaaa tctcaagcaa acagcttcac aaattctcac 1381 acacatacat acctatatac tcactttcta gattaagata tgggacattt ttgactccct 1441 agaagccccg ttataactcc tcctagtact aactcctagg aaaatactat tctgacctcc 1501 atgactgcac agtaatttcg tctgtttata aacattgtat agttggaatc atattgtgtg 1561 taatgttgta tgtcttgctt actcagaatt aagtctgtga gattcattca tgtcatgtgt 1621 acaaaagttt catccttttc attgccatgt agggttccct tatattaata ttcctcagtt 1681 catccattct attgttaata ggcacttaag tggcttccaa tttttggcca tgaggaagag 1741 aacccacgaa cattcctgga cttgtctttt ggtggacatg gtgcactaat ttcactacct 1801 atccaggagt ggaactggta gaggatgagg aaagcatgta ttcagcttta gtagatatta 1861 ccagttttcc taagtgattg tatgaattta tgctcctacc ggcaatgtgt ggcagtccta 1921 gatgctctat gtgcttgtaa aaagtcaatg ttttcagttc tcttgatttt cattattcct 1981 gtggatgtaa agtgatattt ccccatggtt ttaatctgta tttccccaac atgtaataag 2041 gttgaacact tttttatatg cttattgggc acttgggtat cttcttctgt gaagtacccg 2101 ttcacatttt tgtattttgt ttaaattagt tagccaatat ttttcttact gatttttaag 2161 ttatttttac attctgaata tgtccttttt aatgtgtatt acaaatattt tgctagtttt 2221 tgacttgctc ctaatgttga attttgatga acaaaatttc ctaattttga gaaagtctta 2281 tttattcata ttttctttca aaattagtgc tttttgtgtc atgtttaaga aatttttgcc 2341 catcccaaaa tcataagata tttttcatga ttttgaaacc atgaagagat ttttcatgat 2401 tttgaaatca tgaagatatt tttccatttt tttctaatag ttttattaat aaacattcta 2461 tctattcctg gtagaataga tatccacttg agacagcact atgtaggaaa gaccattttt 2521 cctccactga actagggtgg tgcatttttg taagttaggt aactgtatgt gtgtgtgtct 2581 gtttctgggc tgtctattct agtctatttg ttgatgcttg tgtcaaacag tacactatct 2641 taattattgt acatttatag ttgtaactgt agtccagctt tgttcttctt caagtcaaga 2701 tttccatata aatattagaa acagtttctc aatttctaca aaatcctgat gaggtttcta 2761 ctgggaccac attgagtcta tcaatcaact tatgcagaac tggcaactta ctactgaatc 2821 tctaatcaat gttcatcatg tatcgcttca tttaactagg atttctctaa cttaattgct 2881 atgttttgag atttttagtt taaaaacctt gtatatcttg ttttggtggt tttagtgatt 2941 ttaataatat attttaaata ttttttcttt tctattgttg tacacagaaa tacagttaag 3001 ttttgtgtgt agtcttacga tgtttagtaa cctcaataag tttatttctt aaatctagta 3061 atttgtagat tcctctggat tttgtatatg catagtcatg taagctgaaa atatggcaat 3121 acttgcttct tcccaattgc tttacctttt ttcttacctt attgcactgg ttagcaaccc 3181 caatacagag accaccagag caggtataga ctcctgaaag acaatataat gaagtgctcc 3241 agtcaggcct atctaaactg gattcacagc tctgtcactt aattgctaca tgatctagag 3301 ccagttactt tgtgtttcag ccatgtattt gcagctgaga gaaaataatc attcttattt 3361 catgaaaatt gtggggatga tgaaataagt taacaccttt aaagtgtgta gtaaagtatc 3421 aggatactat attttaggtc ttaatacaca cagttatgcc gctagataca tgctttttaa 3481 tgagataatg tgatattata cataacacat atcgattttt aaaaattaaa tcaaccttgc 3541 tttgatggaa taaactccat ttagtcacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3601 aaaaa // LOCUS HSMCP 1530 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for membrane cofactor protein. ACCESSION Y00651 NID g34504 KEYWORDS complement protein; glycoprotein; membrane cofactor protein; membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1530) AUTHORS Lublin,D.M. TITLE Direct Submission JOURNAL Submitted (27-JUN-1988) Lublin D.M., Dept. of Pathology, Washington Univ. School of Medicine, Box 8118, 660 South Euclid Avenue, St. Louis, MO 63110 REFERENCE 2 (bases 1 to 1530) AUTHORS Lublin,D.M., Liszewski,M.K., Post,T.W., Arce,M.A., Le Beau,M.M., Rebentisch,M.B., Lemons,L.S., Seya,T. and Atkinson,J.P. TITLE Molecular cloning and chromosomal localization of human membrane cofactor protein (MCP). Evidence for inclusion in the multigene family of complement-regulatory proteins JOURNAL J. Exp. Med. 168 (1), 181-194 (1988) MEDLINE 88286080 FEATURES Location/Qualifiers source 1..1530 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /clone_lib="lambda gt10" /clone="MCP-9" /chromosome="1q3.2" sig_peptide 44..145 /note="signal peptide (AA -34 to -1)" CDS 44..1198 /note="membrane cofactor preprotein (AA -34 to 350)" /codon_start=1 /db_xref="PID:g34505" /db_xref="SWISS-PROT:P15529" /translation="MEPPGRRECPFPSWRFPGLLLAAMVLLLYSFSDACEEPPTFEAM ELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRNHTWLPVSDDACYRETCPYI RDPLNGQAVPANGTYEFGYQMHFICNEGYYLIGEEILYCELKGSVAIWSGKPPICEKV LCTPPPKIKNGKHTFSEVEVFEYLDAVTYSCDPAPGPDPFSLIGESTIYCGDNSVWSR AAPECKVVKCRFPVVENGKQISGFGKKFYYKATVMFECDKGFYLDGSDTIVCDSNSTW DPPVPKCLKVSTSSTTKSPASSASGPRPTYKPPVSNYPGYPKPEEGILDSLDVWVIAV IVIAIVVGVAVICVVPYRYLQRRKKKGKADGGAEYATYQTKSTTPAEQRG" mat_peptide 146..1195 /note="mature membrane cofactor protein (AA 1 - 350)" repeat_region 146..328 /note="consensus repeat" misc_feature 290..292 /note="pot. N-linked glycosylation site" repeat_region 329..517 /note="consensus repeat" misc_feature 383..385 /note="pot. N-linked glycosylation site" repeat_region 518..715 /note="consensus repeat" repeat_region 716..895 /note="consensus repeat" misc_feature 860..862 /note="pot. N-linked glycosylation site" misc_feature 1028..1096 /note="transmembrane domain" misc_feature 1497..1502 /note="pot. polyA signal" misc_feature 1509..1514 /note="pot. alt. polyA signal" polyA_site 1530 /note="polyA site" BASE COUNT 430 a 302 c 335 g 463 t ORIGIN 1 tctgctttcc tccggagaaa taacagcgtc ttccgcgccg cgcatggagc ctcccggccg 61 ccgcgagtgt ccctttcctt cctggcgctt tcctgggttg cttctggcgg ccatggtgtt 121 gctgctgtac tccttctccg atgcctgtga ggagccacca acatttgaag ctatggagct 181 cattggtaaa ccaaaaccct actatgagat tggtgaacga gtagattata agtgtaaaaa 241 aggatacttc tatatacctc ctcttgccac ccatactatt tgtgatcgga atcatacatg 301 gctacctgtc tcagatgacg cctgttatag agaaacatgt ccatatatac gggatccttt 361 aaatggccaa gcagtccctg caaatgggac ttacgagttt ggttatcaga tgcactttat 421 ttgtaatgag ggttattact taattggtga agaaattcta tattgtgaac ttaaaggatc 481 agtagcaatt tggagcggta agcccccaat atgtgaaaag gttttgtgta caccacctcc 541 aaaaataaaa aatggaaaac acacctttag tgaagtagaa gtatttgagt atcttgatgc 601 agtaacttat agttgtgatc ctgcacctgg accagatcca ttttcactta ttggagagag 661 cacgatttat tgtggtgaca attcagtgtg gagtcgtgct gctccagagt gtaaagtggt 721 caaatgtcga tttccagtag tcgaaaatgg aaaacagata tcaggatttg gaaaaaaatt 781 ttactacaaa gcaacagtta tgtttgaatg cgataagggt ttttacctcg atggcagcga 841 cacaattgtc tgtgacagta acagtacttg ggatccccca gttccaaagt gtcttaaagt 901 gtcgacttct tccactacaa aatctccagc gtccagtgcc tcaggtccta ggcctactta 961 caagcctcca gtctcaaatt atccaggata tcctaaacct gaggaaggaa tacttgacag 1021 tttggatgtt tgggtcattg ctgtgattgt tattgccata gttgttggag ttgcagtaat 1081 ttgtgttgtc ccgtacagat atcttcaaag gaggaagaag aaagggaaag cagatggtgg 1141 agctgaatat gccacttacc agactaaatc aaccactcca gcagagcaga gaggctgaat 1201 agattccaca acctggtttg ccagttcatc ttttgactct attaaaatct tcaatagttg 1261 ttattctgta gtttcactct catgagtgca actgtggctt agctaatatt gcaatgtggc 1321 ttgaatgtag gtagcatcct ttgatgcttc tttgaaactt gtatgaattt gggtatgaac 1381 agattgcctg ctttccctta aataacactt agatttattg gaccagtcag cacagcatgc 1441 ctggttgtat taaagcaggg atatgctgta ttttataaaa ttggcaaaat tagagaaata 1501 tagttcacaa tgaaattata ttttctttgt // LOCUS HSMCR30 1171 bp mRNA PRI 26-JAN-1998 DEFINITION H.sapiens mRNA for metaphase chromosmal protein. ACCESSION X94629 NID g1154802 KEYWORDS DNA-binding protein; mcp30 gene; metaphase chromosomal protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1171) AUTHORS Rocha,E.B., Catita,J.A. and Sunrel,C.F. TITLE Molecular cloning of metaphase chromosome protein 1 (MCP1), a novel human autoantigene that associates with condensed chromosomes during mitasis JOURNAL Chromosome Res. 6, 1-11 (1998) REFERENCE 2 (bases 1 to 1171) AUTHORS Rocha,E.M.R.B. TITLE Direct Submission JOURNAL Submitted (04-JAN-1996) E.M.R.B. Rocha, Centro de Citologia Experimental, Univ. Porto Lab. Gene Mol., Rua do Campo Aleore 823, Porto, 4150, PORTUGAL FEATURES Location/Qualifiers source 1..1171 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="S3 clontech" gene 105..977 /gene="mcp30" CDS 105..977 /gene="mcp30" /note="DNA binding protein" /codon_start=1 /product="metaphase chromosomal protein" /db_xref="PID:e1173582" /db_xref="PID:g1154803" /translation="MMVNHNFTSFTSNERAVVKLNEVMQAALVNSDDRDWRYFVMLVP VLYDMQQFLVKEGSMNERFVAQAPKFDINFWRMIITVMAINFFKWQGKDVAELMKTSS AIDDLQFKFLQVDDKDDHFNLPVIAETFRGLSPKMKPLKGADSVVALEPKLTEAQIQA ELEFADKRLAQFKAASVKDVVSDNVVNMLRGFHEGLATEYQATHDLWQPAMFNALATD KLFNYWSPAWDNLDGIGGEVKSYLTFLSQKQDISGLSEFVTGTAGIDRYIDVAALNHL LEQMPEDVLAERAL" BASE COUNT 336 a 201 c 283 g 351 t ORIGIN 1 aattcgctgg atttttaaga gcgcgttatt atttaacgca ccgacaaaag tatagccctg 61 aaacgtttga agtggcctca ttcttcttag atgatgtcat tgcgatgatg gttaaccata 121 actttaccag cttcacgagc aatgaacggg cggtcgttaa gttaaatgaa gtcatgcaag 181 ccgcattggt caatagtgat gatcgcgatt ggcgttactt tgtgatgctt gtgccagtgc 241 tttatgacat gcaacaattc ttggttaaag aaggtagcat gaatgaacgc tttgtggcac 301 aagcaccgaa gtttgacatt aacttctggc gcatgattat cacggtgatg gcgattaact 361 tcttcaagtg gcaaggtaaa gatgtggctg aattgatgaa gacatcttct gcaatcgatg 421 atttgcaatt caagttcttg caggtggacg ataaggatga tcactttaac ctaccagtca 481 ttgctgagac gtttagagga ttgtcaccaa agatgaagcc cttaaagggc gctgattcag 541 ttgttgcgct tgaacctaag ttgactgaag cgcaaattca agctgaactc gaatttgctg 601 acaagcgctt ggcgcaattc aaggcggctt ctgtaaaaga tgtggtcagt gataacgtgg 661 ttaacatgtt acgtgggttc catgaaggat tggcgactga atatcaagcg acacatgact 721 tatggcaacc agcgatgttt aatgcgttag caacggataa gttgtttaat tattggtcac 781 cggcttggga taaccttgat ggcattggtg gtgaggtgaa atcttatctc actttcttga 841 gtcaaaagca agatattagt ggcttaagtg aatttgtgac agggactgct ggtattgatc 901 gctacattga tgtggcagcc ttgaatcatc ttcttgagca aatgccagaa gatgttttgg 961 cagaacgtgc cttataaaaa taagtaaaaa ggacaatcac tattgaaata gcgattgtcc 1021 tttttataat gtaatatgac gaatcgtacc aggtaaaatg atggtcattt gatgggtttt 1081 gggggattgg attaaaattt ccccatgttg attctgataa acatagccag taaaattcgt 1141 cacttcttca gatagtagtg acacattgac t // LOCUS HSMCSGEN1 747 bp RNA PRI 20-NOV-1996 DEFINITION H.sapiens mRNA for mitochondrial capsule selenoprotein. ACCESSION X89960 NID g1050519 KEYWORDS capsule selenoprotein; MCS gene; mitochondrial. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 747) AUTHORS Aho,H., Schwemmer,M., Tessman,D., Murphy,D., Mattei,G., Engel,W. and Adham,I.M. TITLE Isolation, expression, and chromosomal localization of the human mitochondrial capsule selenoprotein gene (MCSP) JOURNAL Genomics 32 (2), 184-190 (1996) MEDLINE 96429994 REFERENCE 2 (bases 1 to 747) AUTHORS Adham,I. TITLE Direct Submission JOURNAL Submitted (26-JUL-1995) I. Adham, Inst f Humangenetik, Gosslerstr 12d, 37073 Goettingen, FRG FEATURES Location/Qualifiers source 1..747 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="q21" /sex="male" gene 126..476 /gene="mcs" CDS 126..476 /gene="mcs" /note="specific expression in spermatids" /codon_start=1 /product="mitochondrial capsule selenoprotein" /db_xref="PID:g1050520" /db_xref="SWISS-PROT:P49901" /translation="MCDQTKHSKCCPAKGNQCCPPQQNQCCQSKGNQCCPPKQNQCCQ PKGSQCCPPKHNHCCQPKAPCCIQARCCGLETKPEVSPLNMESEPNSPQTQDKGCQTQ QQPHSPQNESRPSK" BASE COUNT 238 a 194 c 175 g 140 t ORIGIN 1 gggcatggac tcactagact gctgaggaag atcaataata cctactggaa tcagtcatga 61 gaagtcaagc atggaaattg tgaattgtgt gtgtggccag accagtacct ccaagtgttc 121 agaagatgtg tgaccagaca aaacacagta aatgctgccc agcaaaaggc aatcaatgct 181 gcccaccaca gcagaaccag tgctgccagt caaaaggcaa tcaatgctgc ccaccaaaac 241 agaaccagtg ctgccagcca aaaggcagtc aatgctgccc accaaaacac aatcactgct 301 gccagccaaa agccccatgc tgcattcagg ccaggtgctg tggtttggag accaagcctg 361 aagtctcacc ccttaacatg gagtctgagc ccaactcacc gcaaactcag gacaagggct 421 gtcaaaccca gcagcagccc catagcccac aaaatgagtc caggccaagc aaatgagagc 481 agaagaagtc aaacaaagaa gaagtccctg gggccatgcc tttcactttg tagggtgggg 541 gattactgag agtcaggcta gacctgtgtt tagaggagca gttttcacag tgactaccat 601 ttccacccaa tgagaggctc ctatttccca tcatagctcc ctaccctagg gaggcctcca 661 tctggaaatg ggaggatgaa gaggctagaa tcatctttcc tagtgatcct gacatttaga 721 cagcacagaa ataaagagca ataaaaa // LOCUS HSMCSP 7918 bp RNA PRI 01-OCT-1996 DEFINITION H.sapiens mRNA for melanoma-associated chondroitin sulfate proteoglycan (MCSP). ACCESSION X96753 NID g1617313 KEYWORDS melanoma-associated chondroitin sulfate proteoglycan. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7918) AUTHORS Pluschke,G., Vanek,M., Evans,A., Dittmar,T., Schmid,P., Itin,P., Filardo,E.J. and Reisfeld,R.A. TITLE Molecular cloning of a human melanoma-associated chondroitin sulfate proteoglycan JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (18), 9710-9715 (1996) MEDLINE 96382532 REFERENCE 2 (bases 1 to 7918) AUTHORS Pluschke,G. TITLE Direct Submission JOURNAL Submitted (21-MAR-1996) G. Pluschke, Swiss Tropical Institute, Socinstr 57, CH-4002, Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..7918 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /cell_type="melanoma" /cell_line="A375met" /chromosome="15" CDS 1..6969 /codon_start=1 /product="melanoma-associated chondroitin sulfate proteoglycan (MCSP)" /db_xref="PID:e257808" /db_xref="PID:g1617314" /translation="MQSGRGPPLPAPGLALALTLTMLARLASAASFFGENHLEVPVAT ALTDIDLQLQFSTSQPEALLLLAAGPADHLLLQLYSGRLQVRLVLGQEELRLQTPAET LLSDSIPHTVVLTVVEGWATLSVDGFLNASSAVPGAPLEVPYGLFVGGTGTLGLPYLR GTSRPLRGCLHAATLNGRSLLRPLTPDVHEGCAEEFSASDDVALGFSGPHSLAAFPAW GTQDEGTLEFTLTTQSRQAPLAFQAGGRRGDFIYVDIFEGHLRAVVEKGQGTVLLHNS VPVADGQPHEVSVHINAHRLEISVDQYPTHTSNRGVLSYLEPRGSLLLGGLDAEASRH LQEHRLGLTPEATNASLLGCMEDLSVNGQRRGLREALLTRNMAAGCRLEEEEYEDDAY GHYEAFSTLAPEAWPAMELPEPCVPEPGLPPVFANFTQLLTISPLVVAEGGTAWLEWR HVQPTLDLMEAELRKSQVLFSVTRGAHYGELELDILGAQARKMFTLLDVVNRKARFIH DGSEDTSDQLVLEVSVTARVPMPSCLRRGQTYLLPIQVNPVNDPPHIIFPHGSLMVIL EHTQKPLGPEVFQAYDPDSACEGLTFQVLGTSSGLPVERRDQPGEPATEFSCRELEAG SLVYVHCGGPAQDLTFRVSDGLQASPPATLKVVAIRPAIQIHRSTGLRLAQGSAMPIL PANLSVETNAVGQDVSVLFRVTGALQFGELQKHSTGGVEGAEWWATQAFHQRDVEQGR VRYLSTDPQHHAYDTVENLALEVQVGQEILSNLSFPVTIQRATVWMLRLEPLHTQNTQ QETLTTAHLEATLEEAGPSPPTFHYEVVQAPRKGNLQLQGTRLSDGQGFTQDDIQAGR VTYGATARASEAVEDTFRFRVTAPPYFSPLYTFPIHIGGDPDAPVLTNVLLVVPEGGE GVLSADHLFVKSLNSASYLYEVMERPRLGRLAWRGTQDKTTMVTSFTNEDLLRGRLVY QHDDSETTEDDIPFVATRQGESSGDMAWEEVRGVFRVAIQPVNDHAPVQTISRIFHVA RGGRRLLTTDDVAFSDADSGFADAQLVLTRKDLLFGSIVAVDEPTRPIYRFTQEDLRK RRVLFVHSGADRGWIQLQVSDGQHQATALLEVQASEPYLRVANGSSLVVPQGGQGTID TAVLHLDTNLDIRSGDEVHYHVTAGPRWGQLVRAGQPATAFSQQDLLDGAVLYSHNGS LSPEDTMAFSVEAGPVHTDATLQVTIALEGPLAPLKLVRHKKIYVFQGEAAEIRRDQL EAAQEAVPPADIVFSVKSPPSAGYLVMVSRGALADEPPSLDPVQSFSQEAVDTGRVLY LHSRPEAWSDAFSLDVASGLGAPLEGVLVELEVLPAAIPLEAQNFSVPEGGSLTLAPP LLRVSGPYFPTLLGLSLQVLEPPQHGPLQKEDGPQARTLSAFSWRMVEEQLIRYVHDG SETLTDSFVLMANASEMDRQSHPVAFTVTVLPVNDQPPILTTNTGLQMWEGATAPIPA EALRSTDGDSGSEDLVYTIEQPSNGRVVLRGAPGTEVRSFTQAQLDGGLVLFSHRGTL DGGFPFRLSDGEHTSPGHFFRVTAQKQVLLSLKGSQTLTVCPGSVQPLSSQTLRASSS AGTDPQLLLYRVVRGPQLGRLFHAQQDSTGEALVNFTQAEVYAGNILYEHEMPPEPFW EAHDTLELQLSSPPARDVAATLAVAVSFEAACPQRPSHLWKNKGLWVPEGQRARITVA ALDASNLLASVPSPQRSEHDVLFQVTQFPSRGQLLVSEEPLHAGQPHFLQSQLAAGQL VYAHGGGGTQQDGFHFRAHLQGPAGASVAGPQTSEAFAITVRDVNERPPQPQASVPLR LTRGSRAPISRAQLSVVDPDSAPGEIEYEVQRAPHNGFLSLVGGGLGPVTRFTQADVD SGRLAFVANGSSVAGIFQLSMSDGASPPLPMSLAVDILPSAIEVQLRAPLEVPQALGR SSLSQQQLRVVSDREEPEAAYRLIQGPQYGHLLVGGRPTSAFSQFQIDQGEVVFAFTN FSSSHDHFRVLALARGVNASAVVNVTVRALLHVWAGGPWPQGATLRLDPTVLDAGELA NRTGSVPRFRLLEGPRHGRVVRVPRARTEPGGSQLVEQFTQQDLEDGRLGLEVGRPEG RAPGPAGDSLTLELWAQGVPPAVASLDFATEPYNAARPYSVALLSVPEAARTEAGKPE SSTPTGEPGPMASSPEPAVAKGGFLSFLEANMFSVIIPMCLVLLLLALILPLLFYLRK RNKTGKHDVQVLTAKPRNGLAGDTETFRKVEPGQAIPLTAVPGQGPPPGGQPDPELLQ FCRTPNPALKNGQYWV" BASE COUNT 1451 a 2551 c 2455 g 1460 t 1 others ORIGIN 1 atgcagtccg gccgcggccc cccacttcca gcccccggcc tggccttggc tttgaccctg 61 actatgttgg ccagacttgc atccgcggct tccttcttcg gtgagaacca cctggaggtg 121 cctgtggcca cggctctgac cgacatagac ctgcagctgc agttctccac gtcccagccc 181 gaagccctcc ttctcctggc agcaggccca gctgaccacc tcctgctgca gctctactct 241 ggacgcctgc aggtcagact tgttctgggc caggaggagc tgaggctgca gactccagca 301 gagacgctgc tgagtgactc catcccccac actgtggtgc tgactgtcgt agagggctgg 361 gccacgttgt cagtcgatgg gtttctgaac gcctcctcag cagtcccagg agccccccta 421 gaggtcccct atgggctctt tgttgggggc actgggaccc ttggcctgcc ctacctgagg 481 ggaaccagcc gacccctgag gggttgcctc catgcagcca ccctcaatgg ccgcagcctc 541 ctccggcctc tgacccccga tgtgcatgag ggctgtgctg aagagttttc tgccagtgat 601 gatgtggccc tgggcttctc tgggccccac tctctggctg ccttccctgc ctggggcact 661 caggacgaag gaaccctaga gtttacactc accacacaga gccggcaggc acccttggcc 721 ttccaggcag ggggccggcg tggggacttc atctatgtgg acatatttga gggccacctg 781 cgggccgtgg tggagaaggg ccagggtacc gtattgctcc acaacagtgt gcctgtggcc 841 gatgggcagc cccatgaggt cagtgtccac atcaatgctc accggctgga aatctccgtg 901 gaccagtacc ctacgcatac ttcgaaccga ggagtcctca gctacctgga gccacggggc 961 agtctccttc tcggggggct ggatgcagag gcctctcgtc acctccagga acaccgcctg 1021 ggcctgacac cagaggccac caatgcctcc ctgctgggct gcatggaaga cctcagtgtc 1081 aatggccaga ggcgggggct gcgggaagct ttgctgacgc gcaacatggc agccggctgc 1141 aggctggagg aggaggagta tgaggacgat gcctatggcc attatgaagc tttctccacc 1201 ctggctcccg aggcttggcc agccatggag ctgcctgagc catgcgtgcc tgagccaggg 1261 ctgcctcctg tctttgccaa tttcacccag ctgctgacta tcagcccact ggtggtggcc 1321 gagggtggca cagcctggct tgagtggagg catgtgcagc ccacgctgga cctgatggag 1381 gctgagctgc gcaaatccca ggtgctgttc agcgtgaccc gaggggcaca ctatggcgag 1441 ctcgagctgg acatcctggg tgcccaggca cgaaaaatgt tcaccctcct ggacgtggtg 1501 aaccgcaagg cccgcttcat ccacgatggc tctgaggaca cctccgacca gctggtgctg 1561 gaggtgtcgg tgacggctcg ggtgcccatg ccctcatgcc ttcggagggg ccaaacatac 1621 ctcctgccca tccaggtcaa ccctgtcaat gacccacccc acatcatctt cccacatggc 1681 agcctcatgg tgatcctgga acacacgcag aagccgctgg ggcctgaggt tttccaggcc 1741 tatgacccgg actctgcctg tgagggcctc accttccagg tccttggcac ctcctctggc 1801 ctccccgtgg agcgccgaga ccagcctggg gagccggcga ccgagttctc ctgccgggag 1861 ttggaggccg gcagcctagt ctatgtccac tgcggtggtc ctgcacagga cttgacgttc 1921 cgggtcagcg atggactgca ggccagcccc ccggccacgc tgaaggtggt ggccatccgg 1981 ccggccatac agatccaccg cagcacaggg ttgcgactgg cccaaggctc tgccatgccc 2041 atcttgcccg ccaacctgtc ggtggagacc aatgccgtgg ggcaggatgt gagcgtgctg 2101 ttccgcgtca ctggggccct gcagtttggg gagctgcaga agcatagtac aggtggggtg 2161 gagggtgctg agtggtgggc cacacaggcg ttccaccagc gggatgtgga gcagggccgc 2221 gtgaggtacc tgagcactga cccacagcac cacgcttacg acaccgtgga gaacctggcc 2281 ctggaggtgc aggtgggcca ggagatcctg agcaatctgt ccttcccagt gaccatccag 2341 agagccactg tgtggatgct gcggctggag ccactgcaca ctcagaacac ccagcaggag 2401 accctcacca cagcccacct ggaggccacc ctggaggagg caggcccaag ccccccaacc 2461 ttccattatg aggtggttca ggctcccagg aaaggcaacc ttcaactaca gggcacaagg 2521 ctgtcagatg gccagggctt cacccaggat gacatacagg ctggccgggt gacctatggg 2581 gccacagctc gtgcctcaga ggcagtcgag gacaccttcc gtttccgtgt cacagctcca 2641 ccatatttct ccccactcta taccttcccc atccacattg gtggtgaccc agatgcgcct 2701 gtcctcacca atgtcctcct cgtggtgcct gagggtggtg agggtgtcct ctctgctgac 2761 cacctctttg tcaagagtct caacagtgcc agctacctct atgaggtcat ggagcggccc 2821 cgccttggga ggttggcttg gcgtgggaca caggacaaga ccactatggt gacatccttc 2881 accaatgaag acctgttgcg tggccggctg gtctaccagc atgatgactc cgagaccaca 2941 gaagatgata tcccatttgt tgctacccgc cagggcgaga gcagtggtga catggcctgg 3001 gaggaggtac ggggtgtctt ccgagtggcc atccagcccg tgaatgacca cgcccctgtg 3061 cagaccatca gccggatctt ccatgtggcc cggggtgggc ggcggctgct gactacagac 3121 gacgtggcct tcagcgatgc tgactcgggc tttgctgacg cccagctggt gcttacccgc 3181 aaggacctcc tctttggcag tatcgtggcc gtagatgagc ccacgcggcc catctaccgc 3241 ttcacccagg aggacctcag gaagaggcga gtactgttcg tgcactcagg ggctgaccgt 3301 ggctggatcc agctgcaggt gtccgacggg caacaccagg ccactgcgct gctggaggtg 3361 caggcctcgg aaccctacct ccgtgtggcc aacggctcca gccttgtggt ccctcaagga 3421 ggccagggca ccatcgacac ggccgtgctc cacctggaca ccaacctcga catccgcagt 3481 ggggatgagg tccactacca cgtcacagct ggccctcgct ggggacaact agtccgggct 3541 ggtcagccag ccacagcctt ctcccagcag gacctgctgg atggggccgt tctctatagc 3601 cacaatggca gcctcagccc cgaagacacc atggccttct ccgtggaagc agggccagtg 3661 cacacggatg ccaccctaca agtgaccatt gccctagagg gcccactggc cccactgaag 3721 ctggtccggc acaagaagat ctacgtcttc cagggagagg cagctgagat cagaagggac 3781 cagctggagg cagcccagga ggcagtgcca cctgcagaca tcgtattctc agtgaagagc 3841 ccaccgagtg ccggctacct ggtgatggtg tcgcgtggcg ccttggcaga tgagccaccc 3901 agcctggacc ctgtgcagag cttctcccag gaggcagtgg acacaggcag ggtcctgtac 3961 ctgcactccc gccctgaggc ctggagcgat gccttctcgc tggatgtggc ctcaggcctg 4021 ggtgctcccc tcgagggcgt ccttgtggag ctggaggtgc tgcccgctgc catcccacta 4081 gaggcgcaaa acttcagcgt ccctgagggt ggcagcctca ccctggcccc tccactgctc 4141 cgtgtctccg ggccctactt ccccactctc ctgggcctca gcctgcaggt gctggagcca 4201 ccccagcatg gacccctgca gaaggaggac ggacctcaag ccaggaccct cagcgccttc 4261 tcctggagaa tggtggaaga gcagctgatc cgctacgtgc atgacgggag cgagacactg 4321 acagacagtt ttgtcctgat ggctaatgcc tccgagatgg atcgccagag ccatcctgtg 4381 gccttcactg tcactgtcct gcctgtcaat gaccaacccc ccatcctcac tacaaacaca 4441 ggcctgcaga tgtgggaggg ggccactgcg cccatccctg cggaggctct gaggagcacg 4501 gacggcgact ctgggtctga ggatctggtc tacaccatcg agcagcccag caacgggcgg 4561 gtagtgctgc ggggggcgcc gggcactgag gtgcgcagct tcacgcaggc ccagctggac 4621 ggcgggctcg tgctgttctc acacagagga accctggatg gaggcttccc gttccgcctc 4681 tctgacggcg agcacacttc ccccggacac ttcttccgag tgacggccca gaagcaagtg 4741 ctcctctcgc tgaagggcag ccagacactg actgtctgcc cagggtccgt ccagccactc 4801 agcagtcaga ccctcagggc cagctccagc gcaggcactg acccccagct cctgctctac 4861 cgtgtggtgc ggggccccca gctaggccgg ctgttccacg cccagcagga cagcacaggg 4921 gaggccctgg tgaacttcac tcaggcagag gtctacgctg ggaatattct gtatgagcat 4981 gagatgcccc ccgagccctt ttgggaggcc catgataccc tagagctcca gctgtcctcg 5041 ccgcctgccc gggacgtggc cgccaccctt gctgtggctg tgtcttttga ggctgcctgt 5101 ccccagcgcc ccagccacct ctggaagaac aaaggtctct gggtccccga gggccagcgg 5161 gccaggatca ccgtggctgc tctggatgcc tccaatctct tggccagcgt tccatcaccc 5221 cagcgctcag agcatgatgt gctcttccag gtcacacagt tccccagccg gggccagctg 5281 ttggtgtccg aggagcccct ccatgctggg cagccccact tcctgcagtc ccagctggct 5341 gcagggcagc tagtgtatgc ccacggcggt gggggcaccc agcaggatgg cttccacttt 5401 cgtgcccacc tccaggggcc agcaggggcc tccgtggctg gaccccaaac ctcagaggcc 5461 tttgccatca cggtgaggga tgtaaatgag cggccccctc agccacaggc ctctgtccca 5521 ctccggctca cccgaggctc tcgtgccccc atctcccggg cccagctgag tgtggtggac 5581 ccagactcag ctcctgggga gattgagtac gaggtccagc gggcacccca caacggcttc 5641 ctcagcctgg tgggtggtgg cctggggccc gtgacccgct tcacgcaagc cgatgtggat 5701 tcagggcggc tggccttcgt ggccaacggg agcagcgtgg caggcatctt ccagctgagc 5761 atgtctgatg gggccagccc acccctgccc atgtccctgg ctgtggacat cctaccatcc 5821 gccatcgagg tgcagctgcg ggcacccctg gaggtgcccc aagctttggg gcgctcctca 5881 ctgagccagc agcagctccg ggtggtttca gatcgggagg agccagaggc agcataccgc 5941 ctcatccagg gaccccagta tgggcatctc ctggtgggcg ggcggcccac ctcggccttc 6001 agccaattcc agatagacca gggcgaggtg gtctttgcct tcaccaactt ctcctcctct 6061 catgaccact tcagagtcct ggcactggct aggggtgtca atgcatcagc cgtagtgaac 6121 gtcactgtga gggctctgct gcatgtgtgg gcaggtgggc catggcccca gggtgccacc 6181 ctgcgcctgg accccaccgt cctagatgct ggcgagctgg ccaaccgcac aggcagtgtg 6241 ccgcgcttcc gcctcctgga gggaccccgg catggccgcg tggtccgcgt gccccgagcc 6301 aggacggagc ccgggggcag ccagctggtg gagcagttca ctcagcagga ccttgaggac 6361 gggaggctgg ggctggaggt gggcaggcca gaggggaggg cccccggccc cgcaggtgac 6421 agtctcactc tggagctgtg ggcacagggc gtcccgcctg ctgtggcctc cctggacttt 6481 gccactgagc cttacaatgc tgcccggccc tacagcgtgg ccctgctcag tgtccccgag 6541 gccgcccgga cggaagcagg gaagccagag agcagcaccc ccacaggcga gccaggcccc 6601 atggcatcca gccctgagcc cgctgtggcc aagggaggct tcctgagctt ccttgaggcc 6661 aacatgttca gcgtcatcat ccccatgtgc ctggtacttc tgctcctggc gctcatcctg 6721 cccctgctct tctacctccg aaaacgcaac aagacgggca agcatgacgt ccaggtcctg 6781 actgccaagc cccgcaacgg cctggctggt gacaccgaga cctttcgcaa ggtggagcca 6841 ggccaggcca tcccgctcac agctgtgcct ggccaggggc cccctccagg aggccagcct 6901 gacccagagc tgctgcagtt ctgccggaca cccaaccctg cccttaagaa tggccagtac 6961 tgggtgtgaa ggcctggcct gggcccagat gctgatcggg ccagggacag gcttgcccat 7021 gtcccgggcc ccattgcttc catgcccggt gctgtctgag tatccccaga gcaagagaga 7081 cctggagaca ccaggggtgg agggtcctgg gagatagtcc caggggtccg ggacagagtg 7141 gagtcaagag ctggaacctc cctcagctca ctccgagcct ggagaactgc aggggccaag 7201 gtggaggcag gcttaagttc agtcctcctg ccctggagct ggtttgggct gtcaaaacca 7261 gggtaacctc ctacatgggt catgactctg ggtcctgggt ctgtgacctt gggtaagtcg 7321 cgcctgaccc aggctgctaa gagggcaagg agaaggaagt accctgggga gggaagggac 7381 agaggaagct attcctggct tttctactcc aacccaggcc accctttgtc tctnccccag 7441 agttgagaaa aaaacttcct cccctggttt tttagggaga tggtatcccc tggagtagag 7501 ggcaagagga gagagcgcct ccagtctaga aggcataagc caataggata atatattcag 7561 ggtgcagggt gggtaggttg ctctggggat gggtttattt aagggagatt gcaaggaagc 7621 tatttaacat ggtgctgagc tagccaggac tgatggagcc cctgggggtg tgggatggag 7681 gagggtctgc agccagttca ttcccagggc cccatcttga tgggccaagg gctaaacatg 7741 catgtgtcag tggctttgga gcaggctagg ctggggctca tcgagggtct caggccgagg 7801 ccactgtagt gccagtgccc ccctgaggac tagggcaggc agctgggggc acttggttcc 7861 atggagcctg gataaacagt gctttggagg ctctggaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSMDCOADI 1017 bp RNA PRI 13-APR-1994 DEFINITION H.sapiens mRNA for mitochondrial dodecenoyl-CoA delta-isomerase. ACCESSION Z25820 NID g472986 KEYWORDS dodecenoyl-CoA delta-isomerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1017) AUTHORS Janssen,U. and Stoffel,W. TITLE Mitochondrial 3,2-Trans-Enoyl-CoA Isomerase - cDNA cloning, mitochondrial import and functional expression of the human enzyme JOURNAL Unpublished REFERENCE 2 (bases 1 to 1017) AUTHORS Hofmann,K.O. TITLE Direct Submission JOURNAL Submitted (31-AUG-1993) Hofmann K. O., Universitaet Koeln, Institut fuer Biochemie, Joseph Stelzmann Str. 52, Koeln, Germany, D-50931 FEATURES Location/Qualifiers source 1..1017 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Liver" /clone_lib="human liver cDNA" CDS 9..917 /EC_number="5.3.3.8" /citation=[1] /codon_start=1 /evidence=experimental /product="dodecenoyl-CoA delta-isomerase" /db_xref="PID:g472987" /db_xref="SWISS-PROT:P42126" /translation="MALVASVRVPARVLLRAGARLPGAALGRTERAAGGGDGARPFGS QRVLVEPDAAAGVAVMKFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPG VFSAGLDLTEMCGRSPAHYAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALT CDYRILADNPRYCIGLNETQLGIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEAL QVGIVDQVVPEEQVQSTALSAIAQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQN FVSFISKDSIQKSLQMYLERLKEEKG" mat_peptide 132..914 /EC_number="5.3.3.8" /citation=[1] /product="dodecenoyl-CoA delta-isomerase" BASE COUNT 202 a 303 c 341 g 171 t ORIGIN 1 cggtcaagat ggcgctggtg gcttctgtgc gagtcccggc gcgcgttctg ctccgcgcgg 61 gggcccggct cccgggcgcg gccctcgggc ggacggagcg ggcggccggc ggcggagacg 121 gcgcgcggcc gttcgggagc cagcgggtgc tggtggagcc ggacgcggcc gcaggggtcg 181 ctgtgatgaa attcaagaac cccccagtga acagcctgag cctggagttt ctgacggagc 241 tggtcatcag cctggagaag ctggagaatg acaagagctt ccgcggtgtc attctgacct 301 cggaccgccc gggtgtcttc tcggccggcc tggacctgac ggagatgtgt gggaggagcc 361 ccgcccacta cgctgggtac tggaaggccg ttcaggagct gtggctgcgg ttgtaccagt 421 ccaacctggt gctggtctcc gccatcaacg gagcctgccc cgctggaggc tgcctggtgg 481 ccctgacctg tgactaccgc atcctggcgg acaaccccag gtactgcata ggactcaatg 541 agacccagct gggcatcatc gcccctttct ggttgaaaga caccctggag aacaccatcg 601 ggcaccgggc ggcggagcgt gccctgcagc tggggctgct cttcccgccg gcggaggccc 661 tgcaggtggg catagtggac caggtggtcc cggaggagca ggtgcagagc actgcgctgt 721 cagcgatagc ccagtggatg gccattccag accatgctcg acagctgacc aaggccatga 781 tgcgaaaggc cacggccagc cgcctggtca cgcagcgcga tgcggacgtg cagaacttcg 841 tcagcttcat ctccaaagac tccatccaga agtccctgca gatgtactta gagaggctca 901 aagaagaaaa aggctaacga ttgggctgcc acaggcttac ggccacacgt gcccctgtgg 961 gtcccaggga ggtcttaaac aaggtatttt tcaacttaaa aaaaaaaaaa aaaaaaa // LOCUS HSMDCR 2020 bp RNA PRI 18-OCT-1995 DEFINITION H.sapiens mRNA for MDR15 protein. ACCESSION X68829 NID g840783 KEYWORDS chemotaxis; G-protein coupled receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2020) AUTHORS Moser,B. TITLE Direct Submission JOURNAL Submitted (27-OCT-1992) B. Moser, University of Bern Theodor-Kocher Inst., Freistrasse 1, P.O. Box 99, CH-3000 Bern 9, SWITZERLAND REFERENCE 2 (bases 1 to 2020) AUTHORS Barella,L., Loetscher,M., Tobler,A., Baggiolini,M. and Moser,B. TITLE Sequence variation of a novel heptahelical leucocyte receptor through alternative transcript formation JOURNAL Biochem. J. 309 (Pt 3), 773-779 (1995) MEDLINE 95366951 FEATURES Location/Qualifiers source 1..2020 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="mononuclear leukocytes" CDS 289..1272 /codon_start=1 /product="MDCR15 protein" /db_xref="PID:g840784" /translation="MASFKAVFVPVAYSLIFLLGVIGNVLVLVILERHRQTRSSTETF LFHLAVADLLLVFILPFAVAEGSVGWVLGTFLCKTVIALHKVNFYCSSLLLACIAVDR YLAIVHAVHAYRHRRLLSIHITCGTIWLVGFLLALPEILFAKVSQGHHNNSLPRCTFS QENQAETHAWFTSRFLYHVAGFLLPMLVMGWCYVGVVHRLRQAQRRPQRQKAVRVAIL VTSIFFLCWSPYHIVIFLDTLARLKAVDNTCKLNGSLPVAITMCEFLGLAHCCLNPML YTFAGVKFRSDLSRLLTKLGCTSPASLCQLFPSWRRSSLSESENATSLTTF" BASE COUNT 400 a 658 c 525 g 437 t ORIGIN 1 ccactctaag gaatgcggtc cctttgacag gcgaaaaact gaagttggaa aagacaaagt 61 gatttgttca aaattgaaat ttgaaacttg acatttggtc agtgggccct atgtaggaaa 121 aaacctccaa gagagctagg gttcctctca gagaggaaag acaggtcctt aggtcctcac 181 cctcccgtct ccttgccctt gcagttctgg gaactggaca gattggacaa ctataacgac 241 acctccctgg tggaaaatca tctctgccct gccacagagg ggcccctcat ggcctccttc 301 aaggccgtgt tcgtgcccgt ggcctacagc ctcatcttcc tcctgggcgt gatcggcaac 361 gtcctggtgc tggtgatcct ggagcggcac cggcagacac gcagttccac ggagaccttc 421 ctgttccacc tggccgtggc cgacctcctg ctggtcttca tcttgccctt tgccgtggcc 481 gagggctctg tgggctgggt cctggggacc ttcctctgca aaactgtgat tgccctgcac 541 aaagtcaact tctactgcag cagcctgctc ctggcctgca tcgccgtgga ccgctacctg 601 gccattgtcc acgccgtcca tgcctaccgc caccgccgcc tcctctccat ccacatcacc 661 tgtgggacca tctggctggt gggcttcctc cttgccttgc cagagattct cttcgccaaa 721 gtcagccaag gccatcacaa caactccctg ccacgttgca ccttctccca agagaaccaa 781 gcagaaacgc atgcctggtt cacctcccga ttcctctacc atgtggcggg attcctgctg 841 cccatgctgg tgatgggctg gtgctacgtg ggggtagtgc acaggttgcg ccaggcccag 901 cggcgccctc agcggcagaa ggcagtcagg gtggccatcc tggtgacaag catcttcttc 961 ctctgctggt caccctacca catcgtcatc ttcctggaca ccctggcgag gctgaaggcc 1021 gtggacaata cctgcaagct gaatggctct ctccccgtgg ccatcaccat gtgtgagttc 1081 ctgggcctgg cccactgctg cctcaacccc atgctctaca ctttcgccgg cgtgaagttc 1141 cgcagtgacc tgtcgcggct cctgaccaag ctgggctgta ccagccctgc ctccctgtgc 1201 cagctcttcc ctagctggcg caggagcagt ctctctgagt cagagaatgc cacctctctc 1261 accacgttct aggtcccagt gtcccctttt attgctgctt ttccttgggg caggcagtga 1321 tgctggatgc tccttccaac aggagctggg atcctaaggg ctcaccgtgg ctaagagtgt 1381 cctaggagta tcctcatttg gggtagctag aggaaccaac ccccatttct agaacatccc 1441 tgccagctct tctgccggcc ctggggctag gctggagccc agggagcgga aagcagctcg 1501 aaggcacagt gaaggctgtc cttacccatc tgcacccccc tgggctgaga gaacctcacg 1561 cacctcccat cctaatcatt caatgctgaa gaaacaactt ctacttctgc ccttgccaac 1621 ggagagcgcc tgcccctccc agaacacact ccatcagctt aggggctgct gacctccaca 1681 gcttcccctc tctcctcctg cccacctgtc aaacaaagcc agaagctgag caccagggga 1741 tgagtggagg ttaaggctga ggaaaggcca gctggcagca gagtgtggcc ttcggacaac 1801 tcagtcccta aaaacacaga cattctgcca ggcccccaag cctgcagtca tcttgaccaa 1861 gcaggaagct cagactggtt gagttcaggt agctgcccct ggctctgacc gaaacagcgc 1921 tgggtccacc ccatgtcacc ggatcctggg tggtctgcag gcagggctga ctctaggtgc 1981 ccttggaggc cagccagtga cctgaggaag cgtgaaggcc // LOCUS HSMEF2 2975 bp mRNA PRI 26-JAN-1998 DEFINITION H.sapiens mRNA for myocyte-specific enhancer factor 2 (MEF2). ACCESSION X68505 NID g34535 KEYWORDS alternative splicing; DNA-binding protein; MADS box; mef2 gene; muscle specific protein; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2975) AUTHORS Pollock,R. and Treisman,R. TITLE Human SRF-related proteins: DNA-binding properties and potential regulatory targets JOURNAL Genes Dev. 5 (12A), 2327-2341 (1991) MEDLINE 92084105 REFERENCE 2 (bases 1 to 2975) AUTHORS Yu,Y.T., Breitbart,R.E., Smoot,L.B., Lee,Y., Mahdavi,V. and Nadal-Ginard,B. TITLE Human myocyte-specific enhancer factor 2 comprises a group of tissue-restricted MADS box transcription factors JOURNAL Genes Dev. 6 (9), 1783-1798 (1992) MEDLINE 92387551 REFERENCE 3 (bases 1 to 2975) AUTHORS Breitbart,R. TITLE Direct Submission JOURNAL Submitted (25-SEP-1992) R. Breitbart, Children's Hospital, Harvard Medical School, Dept. of Cardiology, 300 Longwood Ave., Boston, MA 02115, USA COMMENT Related sequences:MEF2 and MEFa are isoforms of the same gene that also encodes the human SRF-related clones RSRFC4 and RSRFC9 (acc# x63381). FEATURES Location/Qualifiers source 1..2975 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart & skeletal muscle" /clone_lib="lambda gt11+ZAP II" 5'UTR <1..414 allele 55..273 /citation=[2] /replace="aa" repeat_region 56..272 /note="alternatively spliced Alu repeat" gene 415..1938 /gene="mef2" CDS 415..1938 /gene="mef2" /codon_start=1 /product="myocyte-specific enhancer factor 2 (MEF2)" /db_xref="PID:g34536" /db_xref="SWISS-PROT:Q02078" /translation="MGRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIA LIIFNSSNKLFQYASTDMDKVLLKYTEYNEPHESRTNSDIVEALNKKEHRGCDSPDPD TSYVLTPHTEEKYKKINEEFDNMMRNHKIAPGLPPQNFSMSVTVPVTSPNALSYTNPG SSLVSPSLAASSTLTDSSMLSPPQTTLHRNVSPGAPQRPPSTGNAGGMLSTTDLTVPN GAGSSPVGNGFVNSRASPNLIGATGANSLGKVMPTKSPPPPGGGNLGMNSRKPDLRVV IPPSSKGMMPPLSEEEELELNTQRISSSQATQPLATPVVSVTTPSLPPQGLVYSAMPT AYNTDYSLTSADLSALQGFNSPGMLSLGQVSAWQQHHLGQAALSSLVAGGQLSQGSNL SINTNQNISIKSEPISPPRDRMTPSGFQQQQQQQQQQQPPPPPQPQPQPPQPQPRQEM GRSPVDSLSSSSSSYDGSDREDPRGDFHSPIVLGRPPNTEDRESPSVKRMRMDAWVT" misc_feature 673..810 /gene="mef2" /note="alternative coding exon" allele 1278..1303 /gene="mef2" /note="peptide absent in RSRFC4/9" /citation=[1] /replace="" 3'UTR 1939..>2975 BASE COUNT 914 a 659 c 628 g 774 t ORIGIN 1 gaattttctg caaggatcat atctaagtgc actttttgct gatacttcat ttctagacat 61 tgagtctcac tctacccccc aggctgaagt gcagtggtgt gatctcggtt cactgcaacc 121 tccgcctcca ggttcaagtg attctcgtac ctcagcctcc cgagtagctg ggattacagg 181 cgcctgccac catgcctggc tgatatttat atttttagta gagatggagt ttcaccatgt 241 tggccaggct ggtctcgaac tctggacctc agatcttgta gaaaatttca gctgtagccc 301 ttggactaga agctgaaata acagaagctg tgtacgatgc attagggtat tgaagaaaat 361 taacttttga attaaatatt tggaatataa ggaaataagg aaagttgact gaaaatgggg 421 cggaagaaaa tacaaatcac acgcataatg gatgaaagga accgacaggt cacttttaca 481 aagagaaagt ttggattaat gaagaaagcc tatgaactta gtgtgctctg tgactgtgaa 541 atagcactca tcattttcaa cagctctaac aaactgtttc aatatgctag cactgatatg 601 gacaaagttc ttctcaagta tacagaatat aatgaacctc atgaaagcag aaccaactcg 661 gatattgttg aggctctgaa caagaaggaa cacagagggt gcgacagccc agaccctgat 721 acttcatatg tgctaactcc acatacagaa gaaaaatata aaaaaattaa tgaggaattt 781 gataatatga tgcggaatca taaaatcgca cctggtctgc cacctcagaa cttttcaatg 841 tctgtcacag ttccagtgac cagccccaat gctttgtcct acactaaccc agggagttca 901 ctggtgtccc catctttggc agccagctca acgttaacag attcaagcat gctctctcca 961 cctcaaacca cattacatag aaatgtgtct cctggagctc ctcagagacc accaagtact 1021 ggcaatgcag gtgggatgtt gagcactaca gacctcacag tgccaaatgg agctggaagc 1081 agtccagtgg ggaatggatt tgtaaactca agagcttctc caaatttgat tggagctact 1141 ggtgcaaata gcttaggcaa agtcatgcct acaaagtctc cccctccacc aggtggtggt 1201 aatcttggaa tgaacagtag gaaaccagat cttcgagttg tcatcccccc ttcaagcaag 1261 ggcatgatgc ctccactatc ggaggaagag gaattggagt tgaacaccca aaggatcagt 1321 agttctcaag ccactcaacc tcttgctacc ccagtcgtgt ctgtgacaac cccaagcttg 1381 cctccgcaag gacttgtgta ctcagcaatg ccgactgcct acaacactga ttattcactg 1441 accagcgctg acctgtcagc ccttcaaggc ttcaactcgc caggaatgct gtcgctggga 1501 caggtgtcgg cctggcagca gcaccaccta ggacaagcag ccctcagctc tcttgttgct 1561 ggagggcagt tatctcaggg ttccaattta tccattaata ccaaccaaaa catcagcatc 1621 aagtccgaac cgatttcacc tcctcgggat cgtatgaccc catcgggctt ccagcagcag 1681 cagcagcagc agcagcagca gcagccgccg ccaccaccgc agccccagcc acaacccccg 1741 cagccccagc cccgacagga aatggggcgc tcccctgtgg acagtctgag cagctctagt 1801 agctcctatg atggcagtga tcgggaggat ccacggggcg acttccattc tccaattgtg 1861 cttggccgac ccccaaacac tgaggacaga gaaagccctt ctgtaaagcg aatgaggatg 1921 gacgcgtggg tgacctaagg cttccaagct gatgtttgta cttttgtgtt actgcagtga 1981 cctgccctac atatctaaat cggtaaataa ggacatgagt taaatatatt tatatgtaca 2041 tacatatata tatcccttta catatatatg tatgtgggtg tgagtgtgtg tgtatgtgtg 2101 ggtgtgtgtt acatacacag aatcaggcac ttacctgcaa actccttgta ggtctgcaga 2161 tgtgtgtccc atggcagaca aagcaccctg taggcacaga caagtctggc acttccttgg 2221 actacttgtt tcgtaaagat aaccagtttt tgcagagaaa cgtgtaccca tatataattc 2281 tcccacacta gcttgcagaa acctagaggg ccccctactt gttttattta actgtgcagt 2341 gactgtagtt acttaagaga aaatgctttg tagaacagag cagtagaaaa gcaggaacca 2401 agaaagcaat actgtacata aaatgtcatt tatattttcc aacctggcat gggtgtctgt 2461 tgcaaagggg tgcatgggaa agggctgttg atattaaaaa caaacaaaac aaaaaagccc 2521 cacacataac tgttttgcac gtgcaaaaat gtattgggtc aagaagtgat ctttagctaa 2581 taaagaaaga gaatagaaaa cacgcatgag atattcagaa aatactagcc tagaaatata 2641 gagcattaac aaaggaaaat taatatatta agttataatt ggaatatgtc agaagtttct 2701 ttttacattc atatcttaaa aattaaagaa actgatttta gctcatgtat attttatatg 2761 aaagaaaaca cccttatgaa ttgatgacta tatataaaat tatattcact acttttgaac 2821 acattctgct atgaattatt tatataagcc aaagctatat gttgtaactt ttttttagag 2881 aatagcttta tcttggttta actctttagt tttattttaa gaggggaaaa caaaaatatc 2941 ttgcaagcag aaccttgaaa aaaaaaaagg aattc // LOCUS HSMEORPRA 2095 bp RNA PRI 20-JUL-1993 DEFINITION H.sapiens membrane organizing protein gene. ACCESSION Z22664 NID g312042 KEYWORDS membrane organizing protein; Neurofibromatosis Type 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2095) AUTHORS Lutchman,M. TITLE Direct Submission JOURNAL Submitted (05-MAY-1993) Mohini Lutchman, Neurology/Neurosurgery, McGill University, 1650, Cedar Avenue, Montreal, Quebec, H3G 1A4, Canada REFERENCE 2 (bases 1 to 2095) AUTHORS Rouleau,G.A., Merel,P., Lutchman,M., Sanson,M., Zucman,J., Marineau,C., Hoang-Xuan,K., Demczuk,S., Desmaze,C., Plougastel,B., Pulst,S., Lenoir,G., Bijlsma,E., Fashold,R., Dumanski,J., de Jong,P., Parry,D., Eldrige,R., Aurias,A., Delattre,O. and Thomas,G. TITLE Alteration in a new gene encoding a putative membrane-organizing protein causes neuro-fibromatosis type 2 JOURNAL Nature 363 (6429), 515-521 (1993) MEDLINE 93281181 FEATURES Location/Qualifiers source 1..2095 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="N1.1" /dev_stage="fetus" /tissue_type="brain" /clone_lib="Stratagene Lambda Zap fetal brain library" /chromosome="chromosome 22" /germline 5'UTR 1..174 CDS 175..1962 /note="putative product" /codon_start=1 /product="membrane organizing protein" /db_xref="PID:g312043" /db_xref="SWISS-PROT:P35240" /translation="MAGAIASRMSFSSLKRKQPKTFTVRIVTMDAEMEFNCEMKWKGK DLFDLVCRTLGLRETWFFGLQYTIKDTVAWLKMDKKVLDHDVSKEEPVTFHFLAKFYP ENAEEELVQEITQHLFFLQVKKQILDEKIYCPPEASVLLASYAVQAKYGDYDPSVHKR GFLAQEELLPKRVINLYQMTPEMWEERITAWYAEHRGRARDEAEMEYLKIAQDLEMYG VNYFAIRNKKGTELLLGVDALGLHIYDPENRLTPKISFPWNEIRNISYSDKEFTIKPL DKKIDVFKFNSSKLRVNKLILQLCIGNHDLFMRRRKADSLEVQQMKAQAREEKARKQM ERQRLAREKQMREEAERTRDELERRLLQMKEEATMANEALMRSEETADLLAEKAQITE EEAKLLAQKAAEAEQEMQRIKATAIRTEEEKRLMEQKVLEAEVLALKMAEESERRAKE ADQLKQDLQEAREAERRAKQKLLEIATKPTYPPMNPIPAPLPPDIPSFNLIGDSLSFD FKDTDMKRLSMEIEKEKVEYMEKSKHLQEQLNELKTEIEALKLKERETALDILHNENS DRGGSSKHNTIKKLTLQSAKSRVAFFEEL" misc_feature join(198..200,204..206) /note="phosphorylation-2 site" misc_feature 219..230 /note="histone-methylation site" misc_feature 258..266 /note="phosphorylation site" misc_feature 408..419 /note="cytochrome c_methylation site" misc_feature 435..443 /note="phosphorylation site" misc_feature 440..451 /note="cytochrome c_methylation site" misc_feature 714..722 /note="phosphorylation site" misc_feature 852..863 /note="cytochrome c_methylation site" misc_feature join(933..935,939..941) /note="phosphorylation-2 site" misc_feature 933..1758 /note="alpha-helix" misc_feature join(960..962,969..971) /note="phosphorylation-2 site" misc_feature 963..971 /note="n-glycosylation site" misc_feature 1005..1016 /note="cytochrome c_methylation site" misc_feature join(1026..1028,1035..1037) /note="phosphorylation-2 site" misc_feature 1032..1040 /note="n-glycosylation site" misc_feature 1104..1115 /note="histone-methylation site" misc_feature join(1110..1112,1119..1121) /note="phosphorylation-2 site" misc_feature 1119..1127 /note="phosphorylation site" misc_feature 1164..1175 /note="histone-methylation site" misc_feature 1230..1238 /note="phosphorylation site" misc_feature 1305..1313 /note="phosphorylation site" misc_feature 1314..1322 /note="phosphorylation site" misc_feature 1347..1355 /note="phosphorylation site" misc_feature 1431..1439 /note="phosphorylation site" misc_feature 1533..1710 /note="alpha-helix" misc_feature 1692..1700 /note="phosphorylation site" misc_feature join(1719..1721,1728..1730) /note="phosphorylation-2 site" misc_feature 1720..1728 /note="phosphorylation site" misc_feature join(1728..1730,1772..1774) /note="phosphorylation-2 site" misc_feature join(1878..1880,1887..1889) /note="phosphorylation-2 site" misc_feature 1905..1916 /note="cytochrome c_methylation site" 3'UTR 1963..2095 BASE COUNT 580 a 510 c 617 g 388 t ORIGIN 1 gcagggtcct cgcggcccat gctggccgct ggggacccgc gcagcccaga ccgttcccgg 61 gccggccagc cgccaccatg gtggccctga ggcctgtgca gcaactccag gggggctaaa 121 gggctcagag tgcagcccgt ggggcgcgag ggtcccgggc ctgagccccg cgccatggcc 181 ggggccatcg cttcccgcat gagcttcagc tctctcaaga ggaagcaacc caagacgttc 241 accgtgagga tcgtcaccat ggacgccgag atggagttca attgcgagat gaagtggaaa 301 gggaaggacc tctttgattt ggtgtgccgg actctggggc tccgagaaac ctggttcttt 361 ggactgcagt acacaatcaa ggacacagtg gcctggctca aaatggacaa gaaggtactg 421 gatcatgatg tttcaaagga agaaccagtc acctttcact tcttggccaa attttatcct 481 gagaatgctg aagaggagct ggttcaggag atcacacaac atttattctt cttacaggta 541 aagaagcaga ttttagatga aaagatctac tgccctcctg aggcttctgt gctcctggct 601 tcttacgccg tccaggccaa gtatggtgac tacgacccca gtgttcacaa gcggggattt 661 ttggcccaag aggaattgct tccaaaaagg gtaataaatc tgtatcagat gactccggaa 721 atgtgggagg agagaattac tgcttggtac gcagagcacc gaggccgagc cagggatgaa 781 gctgaaatgg aatatctgaa gatagctcag gacctggaga tgtacggtgt gaactacttt 841 gcaatccgga ataaaaaggg cacagagctg ctgcttggag tggatgccct ggggcttcac 901 atttatgacc ctgagaacag actgaccccc aagatctcct tcccgtggaa tgaaatccga 961 aacatctcgt acagtgacaa ggagtttact attaaaccac tggataagaa aattgatgtc 1021 ttcaagttta actcctcaaa gcttcgtgtt aataagctga ttctccagct atgtatcggg 1081 aaccatgatc tatttatgag gagaaggaaa gccgattctt tggaagttca gcagatgaaa 1141 gcccaggcca gggaggagaa ggctagaaag cagatggagc ggcagcgcct cgctcgagag 1201 aagcagatga gggaggaggc tgaacgcacg agggatgagt tggagaggag gctgctgcag 1261 atgaaagaag aagcaacaat ggccaacgaa gcactgatgc ggtctgagga gacagctgac 1321 ctgttggctg aaaaggccca gatcaccgag gaggaggcaa aacttctggc ccagaaggcc 1381 gcagaggctg agcaggaaat gcagcgcatc aaggccacag cgattcgcac ggaggaggag 1441 aagcgcctga tggagcagaa ggtgctggaa gccgaggtgc tggcactgaa gatggctgag 1501 gagtcagaga ggagggccaa agaggcagat cagctgaagc aggacctgca ggaagcacgc 1561 gaggcggagc gaagagccaa gcagaagctc ctggagattg ccaccaagcc cacgtacccg 1621 cccatgaacc caattccagc accgttgcct cctgacatac caagcttcaa cctcattggt 1681 gacagcctgt ctttcgactt caaagatact gacatgaagc ggctttccat ggagatagag 1741 aaagaaaaag tggaatacat ggaaaagagc aagcatctgc aggagcagct caatgaactc 1801 aagacagaaa tcgaggcctt gaaactgaaa gagagggaga cagctctgga tattctgcac 1861 aatgagaact ccgacagggg tggcagcagc aagcacaata ccattaaaaa gctcaccttg 1921 cagagcgcca agtcccgagt ggccttcttt gaagagctct agcaggtgac ccagccaccc 1981 caggacctgc cacttctcct gctaccggga ccgcgggatg gaccagatat caagagagcc 2041 atccataggg agctggctgg gggtttccgt gggagctcca gaactttccc cagct // LOCUS HSMETTRAN 769 bp RNA PRI 17-JAN-1995 DEFINITION H.sapiens mRNA for methyltransferase. ACCESSION X54228 NID g34558 KEYWORDS methyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 769) AUTHORS Hayakawa,H., Koike,G. and Sekiguchi,M. TITLE Expression and cloning of complementary DNA for a human enzyme that repairs O6-methylguanine in DNA JOURNAL J. Mol. Biol. 213 (4), 739-747 (1990) MEDLINE 90294292 FEATURES Location/Qualifiers source 1..769 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji (Mer+)" /clone="1H3" CDS 41..664 /codon_start=1 /product="O-6-methylguanine-DNA methyltransferase" /db_xref="PID:g34559" /db_xref="SWISS-PROT:P16455" /translation="MDKDCEMKRTTLDSPLGKLELSGCEQGLHEIKLLGKGTSAADAV EVPAPAAVLGGPEPLMQCTAWLNAYFHQPEAIEEFPVPALHHPVFQQESFTRQVLWKL LKVVKFGEVISYQQLAALAGNPKAARAVGGAMRGNPVPILIPCHRVVCSSGAVGNYSG GLAVKEWLLAHEGHRLGKPGLGGSSGLAGAWLKGAGATSGSPPAGRN" polyA_site 769 BASE COUNT 162 a 216 c 249 g 142 t ORIGIN 1 ccccccccgc cccccccgcc gccccttggt acttggaaaa atggacaagg attgtgaaat 61 gaaacgcacc acactggaca gccctttggg gaagctggag ctgtctggtt gtgagcaggg 121 tctgcacgaa ataaagctcc tgggcaaggg gacgtctgca gctgatgccg tggaggtccc 181 agcccccgct gcggttctcg gaggtccgga gcccctgatg cagtgcacag cctggctgaa 241 tgcctatttc caccagcccg aggctatcga agagttcccc gtgccggcac ttcaccatcc 301 cgttttccag caagagtcgt tcaccagaca ggtgttatgg aagctgctga aggttgtgaa 361 attcggagaa gtgatttctt accagcaatt agcagccctg gcaggcaacc ccaaagccgc 421 gcgagcagtg ggaggagcaa tgagaggcaa tcctgtcccc atcctcatcc cgtgccacag 481 agtggtctgc agcagcggag ccgtgggcaa ctactccgga ggactggccg tgaaggaatg 541 gcttctggcc catgaaggcc accggttggg gaagccaggc ttgggaggga gctcaggtct 601 ggcaggggcc tggctcaagg gagcgggagc tacctcgggc tccccgcctg ctggccgaaa 661 ctgagtatgt gcagtaggat ggatgtttga gcgacacaca cgtgtaacac tgcatcggat 721 gcggggcgtg gaggcaccgc tgtattaaag gaagtggcag tgtcctggg // LOCUS HSMFH1 3289 bp DNA PRI 14-MAY-1997 DEFINITION H.sapiens MFH-1 gene. ACCESSION Y08223 NID g1869804 KEYWORDS mesenchyme fork head-1 protein; MFH-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3289) AUTHORS Miura,N. TITLE Direct Submission JOURNAL Submitted (18-SEP-1996) N. Miura, Akita University School of Medicine, Department of Biochemistry, 1-1-1 Hondo, Akita 010, JAPAN FEATURES Location/Qualifiers source 1..3289 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1197..2702 /gene="MFH-1" CDS 1197..2702 /gene="MFH-1" /codon_start=1 /product="Mesenchyme Fork Head-1" /db_xref="PID:e303016" /db_xref="PID:g1869805" /translation="MQARYSVSDPNALGVVPYLSEQNYYRAAGSYGGMASPMGVYSGH PEQYSAGMGRSYAPYHHHQPAAPKDLVKPPYSYIALITMAIQNAPEKKITLNGIYQFI MDRFPFYRENKQGWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGS FLRRRRRFKKKDVSKEKEERAHLKEPPPAASKGAPATPHLADAPKEAEKKVVIKSEAA SPALPVITKVETLSPESALQGSPRSAASTPAGSPDGSLPEHHAAAPNGLPGFSVENIM TLRTSPPGGELSPGAGRAGLVVPPLALPYAAAPPAAYGQPCAQGLEAGAAGGYQCSMR AMSLYTGAERPAHMCVPPALDEALSDHPSGPTSPLSALNLAAGQEGALAATGHHHQHH GHHHPQAPPPPPAPQPQPTPQPGAAAAQAASWYLNHSGDLNHLPGHTFAAQQQTFPNV REMFNSHRLGIENSTLGESQVSGNASCQLPYRSTPPLYRHAAPYSYDCTKY" BASE COUNT 639 a 1125 c 925 g 600 t ORIGIN 1 gaattcggag gattaagttg tcagtcagca cgttgctacc ttcccctcta tgcactccgc 61 tgcctggctc ctcggcgggg agcgagggaa actcagtttg tagggtttac ctctaaaacc 121 tcgataggtt atccttgacg accccgagcc tggaaactcc ctgttgatga ttaattattt 181 gattaaataa gtataacatc caggagaggc cctgccattc caatccagcg cgtttgcttt 241 tgaatccatt acacctgggc ccccataatt aggaaatcta attattcgct tcatcactca 301 ttaataagaa aaatgtccca ggatcattgc tacttacaag gtctttggga gagatatttt 361 actctattaa tccattctat tttatatttc aaattgattt tttttaacag aggaaagtgg 421 ctatcttttt gttttgggca tgtgggccca ttcaccaaaa tgtgatcata aaataaattt 481 taataagata taacttttta aaaagttttc aagtgaagac ggagtcgccg cggaggccgg 541 ggcggcgggg tcttagagcc gacggattcc tgcgctcctc gccccgattg gcgccggact 601 cctctcagct gccgggtgat tggctcaaag ttccgggagg gggcgtggcc cgaggaaagt 661 aaaaactcgc tttcagcaag aagacttttg aaacttttcc caatccctaa aagggacttg 721 gcctcttttt ctgggctcag cggggcagcc gctcggaccc cggcgcgctg accctcgggg 781 ctgccgattc gctgggggct tggagagcct cctgcgcccc tcctcgcgcg ggccgagggt 841 ccaccttggt ccccaggccg cggcgtctcc gctgggtccg cggccgcccg cctgcccgcg 901 ctgccgccgc cgggtcctgg agccagcgag gagcggggcc ggcgctgcgc ttgcccgggg 961 cgcgccctcc aggatgccga tccgcccggt ccgctgaaag cgcgcgcccc tgctcggccc 1021 gagcgacgac gaccgcgcac cctcgccccg gaggctgcca ggagaccggg gccgcccctc 1081 ccgctcccct cctctccccc tctggctctc tcgcgctctc tcgctctcag ggcccccctc 1141 gctcccccgg ccgcagtccg tgcgcgaggg cgccggcgag ccgtctcgga agcagcatgc 1201 aggcgcgcta ctccgtgtcc gaccccaacg ccctgggagt ggtgccctac ctgagcgagc 1261 agaattacta ccgggctgcg ggcagctacg gcggcatggc cagccccatg ggcgtctatt 1321 ccggccaccc ggagcagtac agcgcgggga tgggccgctc ctacgcgccc taccaccacc 1381 accagcccgc ggcgcctaag gacctggtga agccgcccta cagctacatc gcgctcatca 1441 ccatggccat ccagaacgcg cccgagaaga agatcacctt gaacggcatc taccagttca 1501 tcatggaccg cttccccttc taccgggaga acaagcaggg ctggcagaac agcatccgcc 1561 acaacctctc gctcaacgag tgcttcgtca aggtgccccg cgacgacaag aagcccggca 1621 agggcagtta ctggaccctg gacccggact cctacaacat gttcgagaac ggcagcttcc 1681 tgcggcgccg gcggcgcttc aaaaagaagg acgtgtccaa ggagaaggag gagcgggccc 1741 acctcaagga gccgcccccg gcggcgtcca agggcgcccc ggccaccccc cacctagcgg 1801 acgcccccaa ggaggccgag aagaaggtgg tgatcaagag cgaggcggcg tccccggcgc 1861 tgccggtcat caccaaggtg gagacgctga gccccgagag cgcgctgcag ggcagcccgc 1921 gcagcgcggc ctccacgccc gccggctccc ccgacggttc gctgccggag caccacgccg 1981 cggcgcccaa cgggctgcct ggcttcagcg tggagaacat catgaccctg cgaacgtcgc 2041 cgccgggcgg agagctgagc ccgggggccg gacgcgcggg cctggtggtg ccgccgctgg 2101 cgctgccata cgccgccgcg ccgcccgccg cctacggcca gccgtgcgct cagggcctgg 2161 aggccggggc cgccgggggc taccagtgca gcatgcgagc gatgagcctg tacaccgggg 2221 ccgagcggcc ggcgcacatg tgcgtcccgc ccgccctgga cgaggccctc tcggaccacc 2281 cgagcggccc cacgtcgccc ctgagcgctc tcaacctcgc cgccggccag gagggcgcgc 2341 tcgccgccac gggccaccac caccagcacc acggccacca ccacccgcag gcgccgccgc 2401 ccccgccggc tccccagccc cagccgacgc cgcagcccgg ggccgccgcg gcgcaggcgg 2461 cctcctggta tctcaaccac agcggggacc tgaaccacct ccccggccac acgttcgcgg 2521 cccagcagca aactttcccc aacgtgcggg agatgttcaa ctcccaccgg ctggggattg 2581 agaactcgac cctcggggag tcccaggtga gtggcaatgc cagctgccag ctgccctaca 2641 gatccacgcc gcctctctat cgccacgcag ccccctactc ctacgactgc acgaaatact 2701 gacgtgtccc gggacctccc ctccccggcc cgctccggct tcgcttccca gccccgaccc 2761 aaccagacaa ttaaggggct gcagagacgc aaaaaagaaa caaaacatgt ccaccaacct 2821 tttctcagac ccgggagcag agagcgggca cgctagcccc cagccgtctg tgaagagcgc 2881 aggtaacttt aattcgccgc cccgtttctg ggatcccagg aaacccctcc aaagggacgc 2941 agcccaacaa aatgagtatt ggtcttaaaa tccccctccc ctaccaggac ggctgtgctg 3001 tgctcgacct gagctttcaa aagttaagtt atggacccaa atcccatagc gagcccctag 3061 tgactttctg taggggtccc cataggtgta tgggggtctc tatagataat atatgtgctg 3121 tgtgtaattt taaatttctc caaccgtgct gtacaaatgt gtggatttgt aatcaggcta 3181 ttttgttgtt gttgttgttg ttcagagcca ttaatataat atttaaagtt gagttcactg 3241 gataagtttt tcatcttgcc caaccatttc taactgccaa attgaattc // LOCUS HSMGLU7 3021 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for metabotropic glutamate receptor type 7. ACCESSION X94552 NID g1370110 KEYWORDS glutamate receptor; mGlu7 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3021) AUTHORS Makoff,A., Pilling,C., Harrington,K. and Emson,P. TITLE Human metabotropic glutamate receptor type 7: molecular cloning and mRNA distribution in the CNS JOURNAL Brain Res. Mol. Brain Res. 40 (1), 165-170 (1996) MEDLINE 96437220 REFERENCE 2 (bases 1 to 3021) AUTHORS Makoff,A.J. TITLE Direct Submission JOURNAL Submitted (28-DEC-1995) A.J. Makoff, Clinical Pharmacology Unit, Department of Neuroscience, Institute of Psychiatry, Denmark Hill, London SE5 8AF, UK FEATURES Location/Qualifiers source 1..3021 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 239..2986 /gene="mGluR7" CDS 239..2986 /gene="mGluR7" /codon_start=1 /product="glutamate receptor" /db_xref="PID:e218967" /db_xref="PID:g1370111" /translation="MVQLRKLLRVLTLMKFPCCVLEVLLCALAAAARGQEMYAPHSIR IEGDVTLGGLFPVHAKGPSGVPCGDIKRENGIHRLEAMLYALDQINSDPNLLPNVTLG ARILDTCSRDTYALEQSLTFVQALIQKDTSDVRCTNGEPPVFVKPEKVVGVIGASGSS VSIMVANILRLFQIPQISYASTAPELSDDRRYDFFSRVVPPDSFQAQAMVDIVKALGW NYVSTLASEGSYGEKGVESFTQISKEAGGLCIAQSVRIPQERKDRTIDFDRIIKQLLD TPNSRAVVIFANDEDIKQILAAAKRADQVGHFLWVGSDSWGSKINPLHQHEDIAEGAI TIQPKRATVEGFDAYFTSRTLENNRRNVWFAEYWEENFNCKLTISGSKKEDTDRKCTG QERIGKDSNYEQEGKVQFVIDAVYAMAHALHHMNKDLCADYRGVCPEMEQAGGKKLLK YIRNVNFNGSAGTPVMFNKNGDAPGRYDIFQYQTTNTSNPGYRLIGQWTDELQLNIED MQWGKGVREIPASVCTLPCKPGQRKKTQKGTPCCWTCEPCDGYQYQFDEMTCQHCPYD QRPNENRTGCQDIPIIKLEWHSPWAVIPVFLAMLGIIATIFVMATFIRYNDTPIVRAS GRELSYVLLTGIFLCYIITFLMIAKPDVAVCSFRRVFLGLGMCISYAALLTKTNRIYR IFEQGKKSVTAPRLISPTSQLAITSSLISVQLLGVFIWFGVDPPNIIIDYDEHKTMNP EQARGVLKCDITDLQIICSLGYSILLMVTCTVYAIKTRGVPENFNEAKPIGFTMYTTC IVWLAFIPIFFGTAQSAEKLYIQTTTLTISMNLSASVALGMLYMPKVYIIIFHPELNV QKRKRSFKAVVTAATMSSRLSHKPSDRPNGEAKTELCENVDPNSPAAKKKYVSYNNLV I" variation 1536 /gene="mGluR7" /replace="t" BASE COUNT 765 a 827 c 782 g 647 t ORIGIN 1 gtgagcgcga gcgcggcgcg ccggccggct aacccgagag cgcgaggcgc cccaggctgg 61 caggcgccgc gggacccctc accctctctg gtcgcccctc cccggattcc cccaccctcc 121 gtgcctgcag gagcccctgg gctttcccgg aggagctcgc cctgaagggc ccggacctcg 181 gcgagcccac caccgttccc tccagcgccg ccgccgccac cgcagcagcc ggagcagcat 241 ggtccagctg aggaagctgc tccgcgtcct gactttgatg aagttcccct gctgcgtgct 301 ggaggtgctc ctgtgcgcgc tggcggcggc ggcgcgcggc caggagatgt acgccccgca 361 ctcaatccgg atcgaggggg acgtcaccct cggggggctg ttccccgtgc acgccaaggg 421 tcccagcgga gtgccctgcg gcgacatcaa gagggaaaac gggatccaca ggctggaagc 481 gatgctctac gccctggacc agatcaacag tgatcccaac ctactgccca acgtgacgct 541 gggcgcgcgg atcctggaca cttgttccag ggacacttac gcgctcgaac agtcgcttac 601 tttcgtccag gcgctcatcc agaaggacac ctccgacgtg cgctgcacca acggcgaacc 661 gccggttttc gtcaagccgg agaaagtagt tggagtgatt ggggcttcgg ggagttcggt 721 ctccatcatg gtagccaaca tcctgaggct cttccagatc ccccagatta gttatgcatc 781 aacggcaccc gagctaagtg atgaccggcg ctatgacttc ttctctcgcg tggtgccacc 841 cgattccttc caagcccagg ccatggtaga cattgtaaag gccctaggct ggaattatgt 901 gtctaccctc gcatcggaag gaagttatgg agagaaaggt gtggagtcct tcacgcagat 961 ttccaaagag gcaggtggac tctgcattgc ccagtccgtg agaatccccc aggaacgcaa 1021 agacaggacc attgactttg atagaattat caaacagctc ctggacaccc ccaactccag 1081 ggccgtcgtg atttttgcca acgatgagga tataaagcag atccttgcag cagccaaaag 1141 agctgaccaa gttggccatt ttctttgggt gggatcagac agctggggat ccaaaataaa 1201 cccactgcac cagcatgaag atatcgcaga aggggccatc accattcagc ccaagcgagc 1261 cacggtggaa gggtttgatg cctactttac gtcccgtaca cttgaaaaca acagaagaaa 1321 tgtatggttt gccgaatact gggaggaaaa cttcaactgc aagttgacga ttagtgggtc 1381 aaaaaaagaa gacacagatc gcaaatgcac aggacaggag agaattggaa aagattccaa 1441 ctatgagcag gagggtaaag tccagttcgt gattgacgca gtctatgcta tggctcacgc 1501 ccttcaccac atgaacaagg atctctgtgc tgactaccgg ggtgtctgcc cagagatgga 1561 gcaagctgga ggcaagaagt tgctgaagta tatacgcaat gttaatttca atggtagtgc 1621 tggcactcca gtgatgttta acaagaacgg ggatgcacct gggcgttatg acatctttca 1681 gtaccagacc acaaacacca gcaacccggg ttaccgtctg atcgggcagt ggacagacga 1741 acttcagctc aatatagaag acatgcagtg gggtaaagga gtccgagaga tacccgcctc 1801 agtgtgcaca ctaccatgta agccaggaca gagaaagaag acacagaaag gaactccttg 1861 ctgttggacc tgtgagcctt gcgatggtta ccagtaccag tttgatgaga tgacatgcca 1921 gcattgcccc tatgaccaga ggcccaatga aaatcgaacc ggatgccagg atattcccat 1981 catcaaactg gagtggcact ccccctgggc tgtgattcct gtcttcctgg caatgttggg 2041 gatcattgcc accatctttg tcatggccac tttcatccgc tacaatgaca cgcccattgt 2101 ccgggcatct gggcgggaac tcagctatgt tcttttgacg ggcatctttc tttgctacat 2161 catcactttc ctgatgattg ccaaaccaga tgtggcagtg tgttctttcc ggcgagtttt 2221 cttgggcttg ggtatgtgca tcagttatgc agccctcttg acgaaaacaa atcggattta 2281 tcgcatattt gagcagggca agaaatcagt aacagctccc agactcataa gcccaacatc 2341 acaactggca atcacttcca gtttaatatc agttcagctt ctaggggtgt tcatttggtt 2401 tggtgttgat ccacccaaca tcatcataga ctacgatgaa cacaagacaa tgaaccctga 2461 gcaagccaga ggggttctca agtgtgacat tacagatctc caaatcattt gctccttggg 2521 atatagcatt cttctcatgg tcacatgtac tgtgtatgcc atcaagactc ggggtgtacc 2581 cgagaatttt aacgaagcca agcccattgg attcactatg tacacgacat gtatagtatg 2641 gcttgccttc attccaattt tttttggcac cgctcaatca gcggaaaagc tctacataca 2701 aactaccacg cttacaatct ccatgaacct aagtgcatca gtggcgctgg ggatgctata 2761 catgccgaaa gtgtacatca tcattttcca ccctgaactc aatgtccaga aacggaagcg 2821 aagcttcaag gcggtagtca cagcagccac catgtcatcg aggctgtcac acaaacccag 2881 tgacagaccc aacggtgagg caaagaccga gctctgtgaa aacgtagacc caaacagccc 2941 tgctgcaaaa aagaagtatg tcagttataa taacctggtt atctaacctg ttccattcca 3001 tggaaccatg gaggaggaag a // LOCUS HSMGLUR3 3410 bp RNA PRI 23-JUL-1996 DEFINITION H.sapiens mRNA for metabotropic glutamate receptor type 3. ACCESSION X77748 NID g1171563 KEYWORDS metabotropic glutamate receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3410) AUTHORS Makoff,A., Volpe,F., Lelchuk,R., Harrington,K. and Emson,P. TITLE Molecular characterization and localization of human metabotropic glutamate receptor type 3 JOURNAL Brain Res. Mol. Brain Res. 40 (1), 55-63 (1996) MEDLINE 96437205 REFERENCE 2 (bases 1 to 3410) AUTHORS Makoff,A.J. TITLE Direct Submission JOURNAL Submitted (17-FEB-1994) A.J. Makoff, Wellcome Foundaton Ltd, Beckenham, Kent BR3 3BS, UK FEATURES Location/Qualifiers source 1..3410 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 259..2892 /codon_start=1 /product="metabotropic glutamate receptor type 3 (mGluR3)" /db_xref="PID:e99039" /db_xref="PID:g1171564" /translation="MLTRLQVLTLALFSKGFLLSLGDHNFLRREIKIEGDLVLGGLFP INEKGTGTEECGRINEDRGIQRLEAMLFAIDEINKDDYLLPGVKLGVHILDTCSRDTY ALEQSLEFVRASLTKVDEAEYMCPDGSYAIQENIPLLIAGVIGGSYSSVSIQVANLLR LFQIPQISYASTSAKLSDKSRYDYFARTVPPDFYQAKAMAEILRFFNWTYVSTVASEG DYGETGIEAFEQEARLRNICIATAEKVGRSNIRKSYDSVIRELLQKPNARVVVLFMRS DDSRELIAAASRANASFTWVASDGWGAQESIIKGSEHVAYGAITLELASQPVRQFDRY FQSLNPYNNHRNPWFRDFWEQKFQCSLQNKRNHRRVCDKHLAIDSSNYEQESKIMFVV NAVYAMAHALHKMQRTLCPNTTKLCDAMKILDGKKLYKDYLLKINFTAPFNPNKDADS IVKFDTFGDGMGRYNVFNFQNVGGKYSYLKVGHWAETLSLDVNSIHWSRNSVPTSQCS DPCAPNEMKNMQPGDVCCWICIPCEPYEYLADEFTCMDCGSGQWPTADLTGCYDLPED YIRWEDAWAIGPVTIACLGFMCTCMVVTVFIKHNNTPLVKASGRELCYILLFGVGLSY CMTFFFIAKPSPVICALRRLGLGSSFAICYSALLTKTNCIARIFDGVKNGAQRPKFIS PSSQVFICLGLILVQIVMVSVWLILEAPGTRRYTLAEKRETVILKCNVKDSSMLISLT YDVILVILCTVYAFKTRKCPENFNEAKFIGFTMYTTCIIWLAFLPIFYVTSSDYRVQT TTMCISVSLSGFVVLGCLFAPKVHIILFQPQKNVVTHRLHLNRFSVSGTGTTYSQSSA STYVPTVCNGREVLDSTTSSL" BASE COUNT 894 a 838 c 816 g 862 t ORIGIN 1 cttttgtgtc ggatgaggag gaccaaccat gagccagagc ccgggtgcag gctcaccgcc 61 gccgctgcca ccgcggtcag ctccagttcc tgccaggagt tgtcggtgcg aggaattttg 121 tgacaggctc tgttagtctg ttcctccctt atttgaagga caggccaaag atccagtttg 181 gaaatgagag aggactagca tgacacattg gctccaccat tgatatctcc cagaggtaca 241 gaaacaggat tcatgaagat gttgacaaga ctgcaagttc ttaccttagc tttgttttca 301 aagggatttt tactctcttt aggggaccat aactttctaa ggagagagat taaaatagaa 361 ggtgaccttg ttttaggggg cctgtttcct attaacgaaa aaggcactgg aactgaagaa 421 tgtgggcgaa tcaatgaaga ccgagggatt caacgcctgg aagccatgtt gtttgctatt 481 gatgaaatca acaaagatga ttacttgcta ccaggagtga agttgggtgt tcacattttg 541 gatacatgtt caagggatac ctatgcattg gagcaatcac tggagtttgt cagggcatct 601 ttgacaaaag tggatgaagc tgagtatatg tgtcctgatg gatcctatgc cattcaagaa 661 aacatcccac ttctcattgc aggggtcatt ggtggctctt atagcagtgt ttccatacag 721 gtggcaaacc tgctgcggct cttccagatc cctcagatca gctacgcatc caccagcgcc 781 aaactcagtg ataagtcgcg ctatgattac tttgccagga ccgtgccccc cgacttctac 841 caggccaaag ccatggctga gatcttgcgc ttcttcaact ggacctacgt gtccacagta 901 gcctccgagg gtgattacgg ggagacaggg atcgaggcct tcgagcagga agcccgcctg 961 cgcaacatct gcatcgctac ggcggagaag gtgggccgct ccaacatccg caagtcctac 1021 gacagcgtga tccgagaact gttgcagaag cccaacgcgc gcgtcgtggt cctcttcatg 1081 cgcagcgacg actcgcggga gctcattgca gccgccagcc gcgccaatgc ctccttcacc 1141 tgggtggcca gcgacggctg gggcgcgcag gagagcatca tcaagggcag cgagcatgtg 1201 gcctacggcg ccatcaccct ggagctggcc tcccagcctg tccgccagtt cgaccgctac 1261 ttccagagcc tcaaccccta caacaaccac cgcaacccct ggttccggga cttctgggag 1321 caaaagtttc agtgcagcct ccagaacaaa cgcaaccaca ggcgcgtctg cgacaagcac 1381 ctggccatcg acagcagcaa ctacgagcaa gagtccaaga tcatgtttgt ggtgaacgcg 1441 gtgtatgcca tggcccacgc tttgcacaaa atgcagcgca ccctctgtcc caacactacc 1501 aagctttgtg atgctatgaa gatcctggat gggaagaagt tgtacaagga ttacttgctg 1561 aaaatcaact tcacggctcc attcaaccca aataaagatg cagatagcat agtcaagttt 1621 gacacttttg gagatggaat ggggcgatac aacgtgttca atttccaaaa tgtaggtgga 1681 aagtattcct acttgaaagt tggtcactgg gcagaaacct tatcgctaga tgtcaactct 1741 atccactggt cccggaactc agtccccact tcccagtgca gcgacccctg tgcccccaat 1801 gaaatgaaga atatgcaacc aggggatgtc tgctgctgga tttgcatccc ctgtgaaccc 1861 tacgaatacc tggctgatga gtttacctgt atggattgtg ggtctggaca gtggcccact 1921 gcagacctaa ctggatgcta tgaccttcct gaggactaca tcaggtggga agacgcctgg 1981 gccattggcc cagtcaccat tgcctgtctg ggttttatgt gtacatgcat ggttgtaact 2041 gtttttatca agcacaacaa cacacccttg gtcaaagcat cgggccgaga actctgctac 2101 atcttattgt ttggggttgg cctgtcatac tgcatgacat tcttcttcat tgccaagcca 2161 tcaccagtca tctgtgcatt gcgccgactc gggctgggga gttccttcgc tatctgttac 2221 tcagccctgc tgaccaagac aaactgcatt gcccgcatct tcgatggggt caagaatggc 2281 gctcagaggc caaaattcat cagccccagt tctcaggttt tcatctgcct gggtctgatc 2341 ctggtgcaaa ttgtgatggt gtctgtgtgg ctcatcctgg aggccccagg caccaggagg 2401 tatacccttg cagagaagcg ggaaacagtc atcctaaaat gcaatgtcaa agattccagc 2461 atgttgatct ctcttaccta cgatgtgatc ctggtgatct tatgcactgt gtacgccttc 2521 aaaacgcgga agtgcccaga aaatttcaac gaagctaagt tcataggttt taccatgtac 2581 accacgtgca tcatctggtt ggccttcctc cctatatttt atgtgacatc aagtgactac 2641 agagtgcaga cgacaaccat gtgcatctct gtcagcctga gtggctttgt ggtcttgggc 2701 tgtttgtttg cacccaaggt tcacatcatc ctgtttcaac cccagaagaa tgttgtcaca 2761 cacagactgc acctcaacag gttcagtgtc agtggaactg ggaccacata ctctcagtcc 2821 tctgcaagca cgtatgtgcc aacggtgtgc aatgggcggg aagtcctcga ctccaccacc 2881 tcatctctgt gattgtgaat tgcagttcag ttcttgtgtt tttagactgt tagacaaaag 2941 tgctcacgtg cagctccaga atatggaaac agagcaaaag aacaacccta gtaccttttt 3001 ttagaaacag tacgataaat tatttttgag gactgtatat agtgatgtgc tagaactttc 3061 taggctgagt ctagtgcccc tattattaac aattccccca gaacatggaa ataaccattg 3121 tttacagagc tgagcattgg tgacagggtc tgacatggtc agtctactaa aaaacaaaaa 3181 aaaaaaacaa aaaaaaaaaa acaaaagaaa aaaataaaaa tacggtggca atattatgta 3241 accttttttc ctatgaagtt ttttgtaggt ccttgttgta actaatttag gatgagtttc 3301 tatgttgtat attaaagtta cattatgtgt aacagattga ttttctcagc acaaaataaa 3361 aagcatctgt attaatgtaa agatactgag aataaaacct tcaaggtttt // LOCUS HSMGLUR4 3884 bp RNA PRI 04-JUN-1996 DEFINITION H.sapiens mRNA for metabotropic glutamate receptor type 4. ACCESSION X80818 NID g1160182 KEYWORDS metabotropic glutamate receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3884) AUTHORS Makoff,A., Lelchuk,R., Oxer,M., Harrington,K. and Emson,P. TITLE Molecular characterization and localization of human metabotropic glutamate receptor type 4 JOURNAL Brain Res. Mol. Brain Res. 37 (1-2), 239-248 (1996) MEDLINE 96346635 REFERENCE 2 (bases 1 to 3884) AUTHORS Makoff,A.J. TITLE Direct Submission JOURNAL Submitted (02-AUG-1994) A.J. Makoff, Wellcome Foundation Ltd, Beckenham, Kent BR3 3BS, UK FEATURES Location/Qualifiers source 1..3884 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 171..2909 /codon_start=1 /product="metabotropic glutamate receptor type 4" /db_xref="PID:e114243" /db_xref="PID:g1160183" /translation="MPGKRGLGWWWARLPLCLLLSLYGPWMPSSLGKPKGHPHMNSIR IDGDITLGGLFPVHGRGSEGKPCGELKKEKGIHRLEAMLFALDRINNDPDLLPNITLG ARILDTCSRDTHALEQSLTFVQALIEKDGTEVRCGSGGPPIITKPERVVGVIGASGSS VSIMVANILRLFKIPQISYASTAPDLSDNSRYDFFSRVVPSDTYQAQAMVDIVRALKW NYVSTVASEGSYGESGVEAFIQKSREDGGVCIAQSVKIPREPKAGEFDKIIRRLLETS NARAVIIFANEDDIRRVLEAARRANQTGHFFWMGSDSWGSKIAPVLHLEEVAEGAVTI LPKRMSVRGFDRYFSSRTLDNNRRNIWFAEFWEDNFHCKLSRHALKKGSHVKKCTNRE RIGQDSAYEQEGKVQFVIDAVYAMGHALHAMHRDLCPGRVGLCPRMDPVDGTQLLKYI RNVNFSGIAGNPVTFNENGDAPGRYDIYQYQLRNDSAEYKVIGSWTDHLHLRIERMHW PGSGQQLPRSICSLPCQPGERKKTVKGMPCCWHCEPCTGYQYQVDRYTCKTCPYDMRP TENRTGCRPIPIIKLEWGSPWAVLPLFLAVVGIAATLFVVITFVRYNDTPIVKASGRE LSYVLLAGIFLCYATTFLMIAEPDLGTCSLRRIFLGLGMSISYAALLTKTNRIYRIFE QGKRSVSAPRFISPASQLAITFSLISLQLLGICVWFVVDPSHSVVDFQDQRTLDPRFA RGVLKCDISDLSLICLLGYSMLLMVTCTVYAIKTRGVPETFNEAKPIGFTMYTTCIVW LAFIPIFFGTSQSADKLYIQTTTLTVSVSLSASVSLGMLYMPKVYIILFHPEQNVPKR KRSLKAVVTAATMSNKFTQKGNFRPNGEAKSELCENLEAPALATKQTYVTYTNHAI" BASE COUNT 734 a 1265 c 1080 g 805 t ORIGIN 1 ccgagtgaca aggaggtggg agagggtagc agcatgggct acgcggttgg ctgccctcag 61 tccccctgct gctgaagctg ccctgcccat gcccacccag gccgtggggc caggggcctg 121 ccagggctag gagtgggcct gccgttcatg ggtctctagg gatttccgag atgcctggga 181 agagaggctt gggctggtgg tgggcccggc tgcccctttg cctgctcctc agcctttacg 241 gcccctggat gccttcctcc ctgggaaagc ccaaaggcca ccctcacatg aattccatcc 301 gcatagatgg ggacatcaca ctgggaggcc tgttcccggt gcatggccgg ggctcagagg 361 gcaagccctg tggagaactt aagaaggaaa agggcatcca ccggctggag gccatgctgt 421 tcgccctgga tcgcatcaac aacgacccgg acctgctgcc taacatcacg ctgggcgccc 481 gcattctgga cacctgctcc agggacaccc atgccctcga gcagtcgctg acctttgtgc 541 aggcgctcat cgagaaggat ggcacagagg tccgctgtgg cagtggcggc ccacccatca 601 tcaccaagcc tgaacgtgtg gtgggtgtca tcggtgcttc agggagctcg gtctccatca 661 tggtggccaa catccttcgc ctcttcaaga taccccagat cagctacgcc tccacagcgc 721 cagacctgag tgacaacagc cgctacgact tcttctcccg cgtggtgccc tcggacacgt 781 accaggccca ggccatggtg gacatcgtcc gtgccctcaa gtggaactat gtgtccacag 841 tggcctcgga gggcagctat ggtgagagcg gtgtggaggc cttcatccag aagtcccgtg 901 aggacggggg cgtgtgcatc gcccagtcgg tgaagatacc acgggagccc aaggcaggcg 961 agttcgacaa gatcatccgc cgcctcctgg agacttcgaa cgccagggca gtcatcatct 1021 ttgccaacga ggatgacatc aggcgtgtgc tggaggcagc acgaagggcc aaccagacag 1081 gccatttctt ctggatgggc tctgacagct ggggctccaa gattgcacct gtgctgcacc 1141 tggaggaggt ggctgagggt gctgtcacga tcctccccaa gaggatgtcc gtacgaggct 1201 tcgaccgcta cttctccagc cgcacgctgg acaacaaccg gcgcaacatc tggtttgccg 1261 agttctggga ggacaacttc cactgcaagc tgagccgcca cgccctcaag aagggcagcc 1321 acgtcaagaa gtgcaccaac cgtgagcgaa ttgggcagga ttcagcttat gagcaggagg 1381 ggaaggtgca gtttgtgatc gatgccgtgt acgccatggg ccacgcgctg cacgccatgc 1441 accgtgacct gtgtcccggc cgcgtggggc tctgcccgcg catggaccct gtagatggca 1501 cccagctgct taagtacatc cgaaacgtca acttctcagg catcgcaggg aaccctgtga 1561 ccttcaatga gaatggagat gcgcctgggc gctatgacat ctaccaatac cagctgcgca 1621 acgattctgc cgagtacaag gtcattggct cctggactga ccacctgcac cttagaatag 1681 agcggatgca ctggccgggg agcgggcagc agctgccccg ctccatctgc agcctgccct 1741 gccaaccggg tgagcggaag aagacagtga agggcatgcc ttgctgctgg cactgcgagc 1801 cttgcacagg gtaccagtac caggtggacc gctacacctg taagacgtgt ccctatgaca 1861 tgcggcccac agagaaccgc acgggctgcc ggcccatccc catcatcaag cttgagtggg 1921 gctcgccctg ggccgtgctg cccctcttcc tggccgtggt gggcatcgct gccacgttgt 1981 tcgtggtgat cacctttgtg cgctacaacg acacgcccat cgtcaaggcc tcgggccgtg 2041 aactgagcta cgtgctgctg gcaggcatct tcctgtgcta tgccaccacc ttcctcatga 2101 tcgctgagcc cgaccttggc acctgctcgc tgcgccgaat cttcctggga ctagggatga 2161 gcatcagcta tgcagccctg ctcaccaaga ccaaccgcat ctaccgcatc ttcgagcagg 2221 gcaagcgctc ggtcagtgcc ccacgcttca tcagccccgc ctcacagctg gccatcacct 2281 tcagcctcat ctcgctgcag ctgctgggca tctgtgtgtg gtttgtggtg gacccctccc 2341 actcggtggt ggacttccag gaccagcgga cactcgaccc ccgcttcgcc aggggtgtgc 2401 tcaagtgtga catctcggac ctgtcgctca tctgcctgct gggctacagc atgctgctca 2461 tggtcacgtg caccgtgtat gccatcaaga cacgcggcgt gcccgagacc ttcaatgagg 2521 ccaagcccat tggcttcacc atgtacacca cttgcatcgt ctggctggcc ttcatcccca 2581 tcttctttgg cacctcgcag tcggccgaca agctgtacat ccagacgacg acgctgacgg 2641 tctcggtgag tctgagcgcc tcggtgtccc tgggaatgct ctacatgccc aaagtctaca 2701 tcatcctctt ccacccggag cagaacgtgc ccaagcgcaa gcgcagcctc aaagccgtcg 2761 ttacggcggc caccatgtcc aacaagttca cgcagaaggg caacttccgg cccaacggag 2821 aggccaagtc tgagctctgc gagaaccttg aggccccagc gctggccacc aaacagactt 2881 acgtcactta caccaaccat gcaatctagc gagtccatgg agctgagcag caggaggagg 2941 agccgtgacc ctgtggaagg tgcgtcgggc cagggccaca cccaagggcc cagctgtctt 3001 gcctgcccgt gggcacccac ggacgtggct tggtgctgag gatagcagag cccccagcca 3061 tcactgctgg cagcctgggc aaaccgggtg agcaacagga ggacgagggg ccggggcggt 3121 gccaggctac cacaagaacc tgcgtcttgg accattgccc ctcccggccc caaaccacag 3181 gggctcaggt cgtgtgggcc ccagtgctag atctctccct cccttcgtct ctgtctgtgc 3241 tgttggcgac ccctctgtct gtctccagcc ctgtctttct gttctcttat ctctttgttt 3301 caccttttcc ctctctggcg tccccggctg cttgtactct tggccttttc tgtgtctcct 3361 ttctggctct tgcctccgcc tctctctctc atcctctttg tcctcagctc ctcctgcttt 3421 cttgggtccc accagtgtca cttttctgcc gttttctttc ctgttctcct ctgcttcatt 3481 ctcgtccagc cattgctccc ctctccctgc cacccttccc cagttcacca aaccttacat 3541 gttgcaaaag agaaaaaagg aaaaaaaatc aaaacacaaa aaagccaaaa cgaaaacaaa 3601 tctcgagtgt gttgccaagt gctgcgtcct cctggtggcc tctgtgtgtg tccctgtggc 3661 ccgcagcctg cccgcctgcc ccgcccatct gccgtgtgtc ttgcccgcct gccccgcccg 3721 tctgccgtct gtcttgcccg cctgcccgcc tgcccctcct gccgaccaca cggagttcag 3781 tgcctgggtg tttggtgatg gttattgacg acaatgtgta gcgcatgatt gtttttatac 3841 caagaacatt tctaataaaa ataaacacat ggttttgcaa aaaa // LOCUS HSMH3C2R 2609 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for complement component C2. ACCESSION X04481 K01236 NID g34627 KEYWORDS complement protein C2; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2609) AUTHORS Bentley,D.R. TITLE Primary structure of human complement component C2. Homology to two unrelated protein families JOURNAL Biochem. J. 239 (2), 339-345 (1986) MEDLINE 87127920 REFERENCE 2 (bases 1797 to 2187) AUTHORS Bentley,D.R. and Porter,R.R. TITLE Isolation of cDNA clones for human complement component C2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (4), 1212-1215 (1984) MEDLINE 84144868 FEATURES Location/Qualifiers source 1..2609 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 37..2295 /note="precursor polypeptide" /codon_start=1 /db_xref="PID:g34628" /db_xref="SWISS-PROT:P06681" /translation="MGPLMVLFCLLFLYPGLADSAPSCPQNVNISGGTFTLSHGWAPG SLLTYSCPQGLYPSPASRLCKSSGQWQTPGATRSLSKAVCKPVRCPAPVSFENGIYTP RLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVCDNGAGHCPNPGISLGA VRTGFRFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPICRQPYSYDFPEDVAP ALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSENDFLIFKES ASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENG TGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHI REILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFE HMLDVSKLTDTICGVGNMSANASDQERTPWHVTIKPKSQETCRGALISDQWVLTAAHC FRDGNDHSLWRVNVGDPKSQWGKELLIEKAVISPGFDVFAKKNQGILEFYGDDIALLK LAQKVKMSTHARPICLPCTMEANLALRRPQGSTCRDHENELLNKQSVPAHFVALNGSK LNINLKMGVEWTSCAEVVSQEKTMFPNLTDVREVVTDQFLCSGTQEDESPCKGESGGA VFLERRFRFFQVGLVSWGLYNPCLGSADKNSRKRAPRSKVPPPRDFHINLFRMQPWLR QHLGDVLNFLPL" sig_peptide 37..96 /note="put. signal peptide (AA -20 to -1)" mat_peptide 97..2292 /note="mature polypeptide (AA 1-732)" misc_feature 121..129 /note="pot. N-glycosylation site" misc_feature 370..378 /note="pot. N-glycosylation site" misc_feature 765..766 /note="C1s cleavage site" misc_feature 904..912 /note="pot. N-glycosylation site" misc_feature 1033..1041 /note="pot. N-glycosylation site" misc_feature 1435..1443 /note="pot. N-glycosylation site" misc_feature 1447..1455 /note="pot. N-glycosylation site" conflict 1797 /note="g is a in [2]" /citation=[2] misc_feature 1897..1905 /note="pot. N-glycosylation site" conflict 1902 /note="g is a in [2]" /citation=[2] misc_feature 1987..1995 /note="pot. N-glycosylation site" conflict 2037 /note="g is a in [2]" /citation=[2] conflict 2187 /note="c is u in [2]" /citation=[2] misc_feature 2588..2593 /note="pot. polyA signal" polyA_site 2609 /note="polyA site" BASE COUNT 586 a 752 c 690 g 581 t ORIGIN 1 ggctctctac ctctcgccgc ccctagggag gacaccatgg gcccactgat ggttcttttt 61 tgcctgctgt tcctgtaccc aggtctggca gactcggctc cctcctgccc tcagaacgtg 121 aatatctcgg gtggcacctt caccctcagc catggctggg ctcctgggag ccttctcacc 181 tactcctgcc cccagggcct gtacccatcc ccagcatcac ggctgtgcaa gagcagcgga 241 cagtggcaga ccccaggagc cacccggtct ctgtctaagg cggtctgcaa acctgtgcgc 301 tgtccagccc ctgtctcctt tgagaatggc atttataccc cacggctggg gtcctatccc 361 gtgggtggca atgtgagctt cgagtgtgag gatggcttca tattgcgggg ctcgcctgtg 421 cgtcagtgtc gccccaacgg catgtgggat ggagaaacag ctgtgtgtga taatggggct 481 ggccactgcc ccaacccagg catttcactg ggcgcagtgc ggacaggctt ccgctttggt 541 catggggaca aggtccgcta tcgctgctcc tcgaatcttg tgctcacggg gtcttcggag 601 cgggagtgcc agggcaacgg ggtctggagt ggaacggagc ccatctgccg ccaaccctac 661 tcttatgact tccctgagga cgtggcccct gccctgggca cttccttctc ccacatgctt 721 ggggccacca atcccaccca gaagacaaag gaaagcctgg gccgtaaaat ccaaatccag 781 cgctctggtc atctgaacct ctacctgctc ctggactgtt cgcagagtgt gtcggaaaat 841 gactttctca tcttcaagga gagcgcctcc ctcatggtgg acaggatctt cagctttgag 901 atcaatgtga gcgttgccat tatcaccttt gcctcagagc ccaaagtcct catgtctgtc 961 ctgaacgaca actcccggga tatgactgag gtgatcagca gcctggaaaa tgccaactat 1021 aaagatcatg aaaatggaac tgggactaac acctatgcgg ccttaaacag tgtctatctc 1081 atgatgaaca accaaatgcg actcctcggc atggaaacga tggcctggca ggaaatccga 1141 catgccatca tccttctgac agatggaaag tccaatatgg gtggctctcc caagacagct 1201 gttgaccata tcagagagat cctgaacatc aaccagaaga ggaatgacta tctggacatc 1261 tatgccatcg gggtgggcaa gctggatgtg gactggagag aactgaatga gctagggtcc 1321 aagaaggatg gtgagaggca tgccttcatt ctgcaggaca caaaggctct gcaccaggtc 1381 tttgaacata tgctggatgt ctccaagctc acagacacca tctgcggggt ggggaacatg 1441 tcagcaaacg cctctgacca ggagaggaca ccctggcatg tcactattaa gcccaagagc 1501 caagagacct gccggggggc cctcatctcc gaccaatggg tcctgacagc agctcattgc 1561 ttccgcgatg gcaacgacca ctccctgtgg agggtcaatg tgggagaccc caaatcccag 1621 tggggcaaag aattgcttat tgagaaggcg gtgatctccc cagggtttga tgtctttgcc 1681 aaaaagaacc agggaatcct ggagttctat ggtgatgaca tagctctgct gaagctggcc 1741 cagaaagtaa agatgtccac ccatgccagg cccatctgcc ttccctgcac gatggaggcc 1801 aatctggctc tgcggagacc tcaaggcagc acctgtaggg accatgagaa tgaactgctg 1861 aacaaacaga gtgttcctgc tcattttgtc gccttgaatg ggagcaaact gaacattaac 1921 cttaagatgg gagtggagtg gacaagctgt gccgaggttg tctcccaaga aaaaaccatg 1981 ttccccaact tgacagatgt cagggaggtg gtgacagacc agttcctatg cagtgggacc 2041 caggaggatg agagtccctg caagggagaa tctgggggag cagttttcct tgagcggaga 2101 ttcaggtttt ttcaggtggg tctggtgagc tggggtcttt acaacccctg ccttggctct 2161 gctgacaaaa actcccgcaa aagggcccct cgtagcaagg tcccgccgcc acgagacttt 2221 cacatcaatc tcttccgcat gcagccctgg ctgaggcagc acctggggga tgtcctgaat 2281 tttttacccc tctagccatg gccactgagc cctctgctgc cctgccagaa tctgccgccc 2341 ctccatcttc tacctctgaa tggccaccct tagaccctgt gatccatcct ctctcctagc 2401 tgagtaaatc cgggtctcta ggatgccaga ggcagcgcac acaagctggg aaatcctcag 2461 ggctcctacc agcaggactg cctcgctgcc ccacctcccg ctccttggcc tgtccccaga 2521 ttccttccct ggttgacttg actcatgctt gtttcacttt cacatggaat ttcccagtta 2581 tgaaattaat aaaaatcaat ggtttccac // LOCUS HSMHC3A5 56827 bp DNA PRI 15-FEB-1997 DEFINITION Human HLA class III region containing notch4 (NOTCH4) gene, complete cds, complete sequence. ACCESSION U89335 NID g1841541 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 56827) AUTHORS Li,L., Huang,G., Banta,A., Deng,Y., Chen,L., Pham,Q., Rowen,L. and Hood,L. TITLE Cloning, characterization, and a complete 57-kilobase sequence of the human NOTCH4 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 56827) AUTHORS Rowen,L. TITLE Direct Submission JOURNAL Submitted (12-FEB-1997) Molecular Biotechnology, University of Washington, Box 357730, Seattle, WA 98195, USA COMMENT Cosmids A5 and W24A were obtained from Thomas Spies (Spies et al, Nature (1990) 348: 744-747). This contig overlaps the sequence of W5A by 12137 bp (see GenBank Accession Number U89336). Cosmid A5 spans base 1-29399; cosmid W24A spans 22566-56827. Where the two cosmids overlap, the sequence shown is that of cosmid A5. The variation feature was used to indicate these differences. Sequencing methodology: high redundancy shotgun. Interspersed repeats were identified with RepeatMasker (available from http://ftp.genome.washington.edu/RM/RepeatMasker.html). Microsatellites (n > 8 repeating units) were identified by sputnik (available from http://serac.mbt.washington.edu/chrisa/software/sputnik.html). FEATURES Location/Qualifiers source 1..56827 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21" source 1..29399 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid A5" repeat_region complement(3..72) /note="LTR/MER4" /rpt_family="MER41_internal" repeat_region complement(142..337) /note="LTR/MER4" /rpt_family="MER41_internal" repeat_region 689..977 /note="SINE" /rpt_family="AluSx" repeat_region complement(1219..1514) /note="SINE" /rpt_family="AluSx" repeat_region complement(2248..2329) /note="SINE" /rpt_family="MIR" repeat_region 2919..2992 /note="LINE/L1" /rpt_family="L1ME2" repeat_region 2997..3244 /note="SINE" /rpt_family="AluSq/x" repeat_region complement(3247..3565) /note="LTR/MaLR" /rpt_family="MLT1A1" repeat_region complement(5148..5212) /note="SINE" /rpt_family="MIR" repeat_region 5680..5723 /note="SINE" /rpt_family="MIR" repeat_region 6162..6463 /note="SINE" /rpt_family="AluSx" repeat_region complement(6826..7412) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(7604..7902) /note="SINE" /rpt_family="AluSx" repeat_region complement(7922..8089) /note="LINE/L2" /rpt_family="MIR2" repeat_region 8170..8353 /note="LINE/L1" /rpt_family="L1MC4" repeat_region 8855..9033 /note="LINE/L1" /rpt_family="L1MC3" repeat_region 9035..9337 /note="SINE" /rpt_family="AluSq" repeat_region 9339..9419 /note="LINE/L1" /rpt_family="L1MC3" repeat_region complement(9420..9720) /note="SINE" /rpt_family="AluSx" repeat_region 9721..9883 /note="LINE/L1" /rpt_family="L1MC2" repeat_region complement(10233..10505) /note="SINE" /rpt_family="AluSx" repeat_region complement(10651..10723) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(10724..10772) /note="LINE/L1" /rpt_family="L1MB8" repeat_region 10774..11068 /note="SINE" /rpt_family="AluSq" repeat_region complement(11069..11410) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(11411..11677) /note="SINE" /rpt_family="AluSx" repeat_region complement(11678..11810) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(11811..12112) /note="SINE" /rpt_family="AluSp" repeat_region complement(12114..12212) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(12222..12641) /note="other/LTR?" /rpt_family="MER102" repeat_region 13180..13473 /note="SINE" /rpt_family="AluSx" repeat_region 13482..13776 /note="SINE" /rpt_family="AluSx" repeat_region 13808..13890 /note="SINE" /rpt_family="AluJb" repeat_region 13817..13891 /note="microsatellite" /rpt_type=tandem /rpt_unit=taa repeat_region 13900..13963 /note="SINE" /rpt_family="AluYa5" repeat_region 13978..14139 /note="SINE" /rpt_family="AluJb" repeat_region 14233..14534 /note="SINE" /rpt_family="AluSq" repeat_region 14535..14839 /note="LINE/L1" /rpt_family="L1M4" repeat_region complement(14889..15109) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(15152..15451) /note="SINE" /rpt_family="AluSx" repeat_region 15530..15750 /note="SINE" /rpt_family="AluJo" repeat_region complement(15796..15922) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(15940..16316) /note="LINE/L1" /rpt_family="L1" repeat_region 16358..16525 /note="LTR/retroviral" /rpt_family="MER71" repeat_region 16686..16760 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(16854..17155) /note="SINE" /rpt_family="AluSx" repeat_region complement(17156..17435) /note="SINE" /rpt_family="AluSx" repeat_region complement(17547..17634) /note="LINE/L2" /rpt_family="MIR2" repeat_region 18044..18357 /note="LINE/L2" /rpt_family="MIR2" repeat_region 18407..18694 /note="SINE" /rpt_family="AluY" repeat_region 18696..18984 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(18994..19298) /note="SINE" /rpt_family="AluYb8" repeat_region 19306..19742 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(19819..20114) /note="SINE" /rpt_family="AluJb" repeat_region 21370..21663 /note="SINE" /rpt_family="AluSx" repeat_region 21717..21892 /note="SINE/Alu" /rpt_family="FRAM" repeat_region 22028..22095 /note="LINE/L1" /rpt_family="L1MC4" repeat_region 22101..22386 /note="SINE" /rpt_family="AluSq" repeat_region 22387..22864 /note="LINE/L1" /rpt_family="L1MC4" source 22566..56827 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W24A" gene 23910..53094 /gene="NOTCH4" exon 23910..24054 /gene="NOTCH4" /number=1 CDS join(23985..24054,24824..24905,25104..25399,26585..26932, 27032..27154,27269..27505,27626..27781,28126..28320, 29804..29917,30646..30759,30845..30967,32520..32679, 33677..33822,34092..34244,34680..34797,35021..35108, 35305..35458,36997..37181,43544..43796,44051..44163, 45334..45857,46433..46816,46927..47102,48793..49012, 49208..49280,49379..49517,50344..50639,50866..51013, 51517..51614,51788..52501) /gene="NOTCH4" /note="notch4 exons defined by comparison to cDNA sequences obtained from heart tissue and/or from comparison to mouse notch4 encoded by GenBank Accession Number U43691" /codon_start=1 /product="notch4" /db_xref="PID:g1841543" /translation="MQPPSLLLLLLLLLLCVSVVRPRGLLCGSFPEPCANGGTCLSLS LGQGTCQCAPGFLGETCQFPDPCQNAQLCQNGGSCQALLPAPLGLPSSPSPLTPSFLC TCLPGFTGERCQAKLEDPCPPSFCSKRGRCHIQASGRPQCSCMPGWTGEQCQLRDFCS ANPCVNGGVCLATYPQIQCHCPPGFEGHACERDVNECFQDPGPCPKGTSCHNTLGSFQ CLCPVGQEGPRCELRAGPCPPRGCSNGGTCQLMPEKDSTFHLCLCPPGFIGPGCEVNP DNCVSHQCQNGGTCQDGLDTYTCLCPETWTGWDCSEDVDECEAQGPPHCRNGGTCQNS AGSFHCVCVSGWGGTSCEENLDDCIAATCAPGSTCIDRVGSFSCLCPPGRTGLLCHLE DMCLSQPCHGDAQCSTNPLTGSTLCLCQPGYSGPTCHQDLDECLMAQQGPSPCEHGGS CLNTPGSFNCLCPPGYTGSRCEADHNECLSQPCHPGSTCLDLLATFHCLCPPGLEGQL CEVETNECASAPCLNHADCHDLLNGFQCICLPGFSGTRCEEDIDECRSSPCANGGQCQ DQPGAFHCKCLPGFEGPRCQTEVDECLSDPCPVGASCLDLPGAFFCLCPSGFTGQLCE VPLCAPNLCQPKQICKDQKDKANCLCPDGSPGCAPPEDNCTCHHGHCQRSSCVCDVGW TGPECEAELGGCISAPCAHGGTCYPQPSGYNCTCPTGYTGPTCSEEMTACHSGPCLNG GSCNPSPGGYYCTCPPSHTGPQCQTSTDYCVSAPCFNGGTCVNRPGTFSCLCAMGFQG PRCEGKLRPSCADSPCRNRATCQDSPQGPRCLCPTGYTGGSCQTLMDLCAQKPCPRNS HCLQTGPSFHCLCLQGWTGPLCNLPLSSCQKAALSQGIDVSSLCHNGGLCVDSGPSYF CHCPPGFQGSLCQDHVNPCESRPCQNGATCMAQPSGYLCQCAPGYDGQNCSKELDACQ SQPCHNHGTCTPKPGGFHCACPPGFVGLRCEGDVDECLDQPCHPTGTAACHSLANAFY CQCLPGHTGQWCEVEIDPCHSQPCFHGGTCEATAGSPLGFICHCPKGFEGPTCSHRAP SCGFHHCHHGGLCLPSPKPGFPPRCACLSGYGGPDCLTPPAPKGCGPPSPCLYNGSCS ETTGLGGPGFRCSCPHSSPGPRCQKPGAKGCEGRSGDGACDAGCSGPGGNWDGGDCSL GVPDPWKGCPSHSRCWLLFRDGQCHPQCDSEECLFDGYDCETPPACTPAYDQYCHDHF HNGHCEKGCNTAECGWDGGDCRPEDGDPEWGPSLALLVVLSPPALDQQLFALARVLSL TLRVGLWVRKDRDGRDMVYPYPGARAEEKLGGTRDPTYQERAAPQTQPLGKETDSLSA GFVVVMGVDLSRCGPDHPASRCPWDPGLLLRFLAAMAAVGALEPLLPGPLLAVHPHAG TAPPANQLPWPVLCSPVAGVILLALGALLVLQLIRRRRREHGALWLPPGFTRRPRTQS APHRRRPPLGEDSIGLKALKPKAEVDEDGVVMCSGPEEGEEAEETGPPSTCQLWSLSG GCGALPQAAMLTPPQESEMEAPDLDTRGPDGVTPLMSAVCCGEVQSGTFQGAWLGCPE PWEPLLDGGACPQAHTVGTGETPLHLAARFSRPTAARRLLEAGANPNQPDRAGRTPLH AAVAADAREVCQLLLRSRQTAVDARTEDGTTPLMLAARLAVEDLVEELIAAQADVGAR DKWGKTALHWAAAVNNARAARSLLQAGADKDAQDNREQTPLFLAAREGAVEVAQLLLG LGAARELRDQAGLAPADVAHQRNHWDLLTLLEGAGPPEARHKATPGREAGPFPRARTV SVSVPPHGGGALPRCRTLSAGAGPRGGGACLQARTWSVDLAARGGGAYSHCRSLSGVG AGGGPTPRGRRFSAGMRGPRPNPAIMRGRYGVAAGRGGRVSTDDWPCDWVALGACGSA SNIPIPPPCLTPSPERGSPQLDCGPPALQEMPINQGGEGKK" repeat_region 24000..24026 /note="microsatellite" /rpt_type=tandem /rpt_unit=ctg variation 24816 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" exon 24824..24905 /gene="NOTCH4" /number=2 exon 25104..25399 /gene="NOTCH4" /number=3 variation 25297 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" variation 25470 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="g" variation 25694..25696 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="" variation 25846 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" repeat_region 25896..26115 /note="SINE" /rpt_family="MIR" repeat_region complement(26181..26258) /note="SINE" /rpt_family="MIR" exon 26585..26932 /gene="NOTCH4" /number=4 variation 26655 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" exon 27032..27154 /gene="NOTCH4" /number=5 variation 27045 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" variation 27047 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" variation 27084 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="g" exon 27269..27505 /gene="NOTCH4" /number=6 variation 27305 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" variation 27390 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" variation 27588 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="g" exon 27626..27781 /gene="NOTCH4" /number=7 variation 27848..27849 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="" variation 27968 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" variation 28084 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" exon 28126..28320 /gene="NOTCH4" /number=8 variation 28487 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="a" variation 28666 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="t" variation 28817 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" repeat_region complement(28895..29078) /note="SINE" /rpt_family="MIR" variation 28963 /gene="NOTCH4" /note="this variation present in cosmid W24A" /replace="c" exon 29804..29917 /gene="NOTCH4" /number=9 repeat_region complement(30286..30483) /note="SINE" /rpt_family="MIR" exon 30646..30759 /gene="NOTCH4" /number=10 exon 30845..30967 /gene="NOTCH4" /number=11 repeat_region 31484..31619 /note="SINE" /rpt_family="AluSx" repeat_region 31620..31886 /note="SINE" /rpt_family="AluY" repeat_region 31887..32054 /note="SINE" /rpt_family="AluSx" exon 32520..32679 /gene="NOTCH4" /number=12 repeat_region 32995..33272 /note="SINE" /rpt_family="AluSx" repeat_region 33273..33393 /note="LINE/L2" /rpt_family="MIR2" repeat_region 33421..33464 /note="microsatellite" /rpt_type=tandem /rpt_unit=TA exon 33677..33822 /gene="NOTCH4" /number=13 exon 34092..34244 /gene="NOTCH4" /number=14 repeat_region 34341..34534 /note="SINE" /rpt_family="AluJb" exon 34680..34797 /gene="NOTCH4" /number=15 exon 35021..35108 /gene="NOTCH4" /number=16 exon 35305..35458 /gene="NOTCH4" /number=17 repeat_region 36288..36335 /note="microsatellite" /rpt_type=tandem /rpt_unit=ttat repeat_region complement(36320..36624) /note="SINE" /rpt_family="AluSx" repeat_region complement(36655..36954) /note="SINE" /rpt_family="AluSg" exon 36997..37181 /gene="NOTCH4" /number=18 repeat_region complement(37289..37761) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(37824..37895) /note="LINE/L1" /rpt_family="L1M1/2" repeat_region complement(37962..38022) /note="LINE/L1" /rpt_family="L1MA4A" repeat_region complement(38029..38330) /note="SINE" /rpt_family="AluSx" repeat_region complement(38333..39181) /note="LINE/L1" /rpt_family="L1MA3" repeat_region complement(39184..39482) /note="SINE" /rpt_family="AluSx" repeat_region complement(39484..39903) /note="LINE/L1" /rpt_family="L1MA3" repeat_region complement(39909..40184) /note="SINE" /rpt_family="AluSg" repeat_region complement(40273..40589) /note="SINE" /rpt_family="AluSq" repeat_region complement(40597..40675) /note="LINE/L1" /rpt_family="L1MA3" repeat_region 40703..41001 /note="SINE" /rpt_family="AluSg" repeat_region complement(41047..41871) /note="LINE/L1" /rpt_family="L1MA3" repeat_region complement(41884..42185) /note="SINE" /rpt_family="AluSx" repeat_region complement(42366..42664) /note="SINE" /rpt_family="AluSg" repeat_region 42899..43021 /note="SINE" /rpt_family="MIR" repeat_region complement(43166..43235) /note="LINE/L2" /rpt_family="MIR2" exon 43544..43796 /gene="NOTCH4" /number=19 exon 44051..44163 /gene="NOTCH4" /note="GC splice violates consensus, not a sequencing error" /number=20 repeat_region 44730..45030 /note="SINE" /rpt_family="AluSp" repeat_region complement(45088..45186) /note="LINE/L1" /rpt_family="L1MD2" exon 45334..45857 /gene="NOTCH4" /number=21 exon 46433..46816 /gene="NOTCH4" /number=22 exon 46927..47102 /gene="NOTCH4" /number=23 repeat_region 47425..47723 /note="SINE" /rpt_family="AluY" repeat_region complement(47738..47814) /note="SINE" /rpt_family="MIR" repeat_region 47818..48033 /note="DNA/MER2_type" /rpt_family="MER46" repeat_region complement(48035..48214) /note="SINE" /rpt_family="AluSx" repeat_region complement(48217..48515) /note="SINE" /rpt_family="AluY" repeat_region complement(48517..48653) /note="SINE" /rpt_family="AluSx" repeat_region complement(48662..48742) /note="SINE" /rpt_family="MIR" exon 48793..49012 /gene="NOTCH4" /number=24 exon 49208..49289 /gene="NOTCH4" /number=25 exon 49379..49517 /gene="NOTCH4" /number=26 repeat_region 49634..49814 /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region 49818..50118 /note="SINE" /rpt_family="AluY" repeat_region complement(50166..50229) /note="SINE" /rpt_family="MIR" exon 50344..50639 /gene="NOTCH4" /number=27 exon 50866..51013 /gene="NOTCH4" /number=28 exon 51517..51614 /gene="NOTCH4" /number=29 exon 51788..53094 /gene="NOTCH4" /note="similar to ESTs with GenBank Accession Numbers R48606, R48709" /number=30 repeat_region 52887..52936 /note="LINE/L2" /rpt_family="MIR2" repeat_region 53314..53642 /note="DNA/MER1_type" /rpt_family="MER1B" repeat_region complement(53817..53978) /note="SINE" /rpt_family="MIR" repeat_region complement(54157..54281) /note="SINE" /rpt_family="AluJb" repeat_region complement(54419..54720) /note="SINE" /rpt_family="AluSx" repeat_region 54762..54814 /note="LINE/L2" /rpt_family="MIR2" exon 55425..55466 /note="Grail exon; similar to EST with GenBank Accession Number AA050074" /number=1 CDS join(55425..55466,56068..56229,56435..56572) /codon_start=1 /product="unknown" /db_xref="PID:g1841542" /translation="MEAERPQEEEDGEQTELLLDLVAEAQSRRLEEQRATFYTPQNPS SLAPAPLRPLEDREQLYSTILSHQCQRMEAQRSEPPLPPGGQELLELLLRVQGGGRME EQRSRPPTHTC" exon 56068..56229 /note="similar to ESTs with GenBank Accession Numbers W76064, R59617" /number=2 exon 56435..56827 /note="similar to ESTs with GenBank Accession Numbers W76064, R59617, W72507" /number=3 BASE COUNT 14876 a 14304 c 13616 g 14031 t ORIGIN 1 gatcctaggc ctgtgttcta tcctatggta tccttctcca tgacagaacg acacaaaaag 61 gcaaagacaa aggaagatgg caaaaaaggt gatttctgag aaggggccaa acaatacaaa 121 tatttatagt tcaaagtagc aaaagtacac aagattcact acaacctaag actagtcaca 181 caaatcttct tccctattaa tcaaaacttt gcagaggaga caaacagtga cttttattgt 241 tcactcaact catttgcaca gagagagaaa ggccagaggc tagctggtaa gaaattagct 301 ttttaccagc ttgtcaggtt tatggcttcc ctttctcagc tgcttccaga agagcagagt 361 ggcttttgat gaccctgctt gctgcaccgt agctgtgggg gccaagccac gttacaaaag 421 aaaaacattc cttttctttt catggaacca caggcaaaag gccccagttt tgcaagaggc 481 agcccaacag gttgtgtggg ggaactaaat taacatttcc cattccaaca ggagttatac 541 acacatgtca aaacacagac actagtcact ctgctcagtg cccaagtatc aacctggcaa 601 ggctcaaact tgcccccatt ggcccctatc tgagtctccc ccccaatata agtttcatgg 661 tggtagggca tattaagaaa gcctggcagg ccaggcgcgg tggctcctgc ctgtaatcct 721 agcactttgg gaggccaagg caggcagatc acctgaggtc aggagttcga gaccagcctg 781 gccaacatgg tgaaaccctg tctctactaa aaatacaaaa ttagccaggt gtggtggtgt 841 gcacctgtaa tcccagctac tcaggaggct gaggtaggaa aatcgtttga acctgggagg 901 cagaggtttc agtgagccga gatcacgcca ctgcactcca gcctgagtga cagagtgaga 961 ctccatctca aaaaaaagta ccagaactag aatgaagctg ttttctctgc tatgtcttgc 1021 agtattcctt ccagtgcaat ttactgtcaa agtctaacgt tgtgcccaat ggcaaagaag 1081 aaatgtttat agggtccagc tccagaacca caaggcagga caaagatgga gtggatttgg 1141 aacaaagatg caataaaata atattgagca ccagtcacac ctttggctgc acagttttca 1201 tattacacat atttgaactt tttttttttt ttgagacaga gtcttgccct gtcactcagg 1261 ccagagttca gtggcacaat ctcagctcac tgcaacctct gcctcctggg ttcaagcgat 1321 tctcctgcct cagcctccca agtagctggg attacaggca cccaccatca cacccagcta 1381 atttttgtat ttttagtaga aatggggttt tgctatgttg gccaggctgg tctggaactc 1441 ccgacctcag gtgatctgcc tgccttggcc tcccaaggtg ctgggattac aggcatgagc 1501 caccgtgccc ggcctgaact tttatacaac aatgaaataa ttctctattt tcaccaatca 1561 agacaaagcc accttataag tgaagaagtt ctcactctgt ccccaaaata aagagagccc 1621 tagtcattat atagctcaat acacagctgt attatttact cttcaaattc agtcacagta 1681 tccttgacca tatatactct taccaaaagg actatattac gacataactt ccaacaattt 1741 gtatataaaa tcataagtgt caggagaaga aacagttgcc gagagcagca tgtgtactgc 1801 ccaggtaggt gacccagtca gtggtgggga cgaatttact gaccctctca atcccgtgta 1861 ggtcaggccc atccatggtt aaggctgaaa cacacaagtc ctagtgggag gaatctgaaa 1921 accaagttag gtgaaaaggc ctaggggtct ccaaccttgg cctccgtccc tgatagaatc 1981 agaggagaga ggcctggggt gctggaggca gaagaggccc tgccctgtcc aaccctgaga 2041 ggcagattct cagtccccag gatgtcggac acttttttct ctcttgtcac tgagtctcct 2101 ctccacaggc agaactgtgg gctgaaccct gaggccagcg cctctgctct ctaatgacca 2161 ccaccaaata ctcaggccca ctgttcacta ggtggcatct tactctgagc attacatatt 2221 tatttccgtg aatgttttag cttaatacca tgagatggtc cttactgtcc tcactttaca 2281 gctgagagac agagcctcga tgatctgagg aaacctgccc agggtcacag gtcgtgtggc 2341 caaaccctaa caggggccag gtctacatca aagctcagtt gagctctaaa cagggggcca 2401 ccagattagg ggcaccagcc ctaggggtcc tccagggatc tggagagaca ggaaagaagg 2461 cagagaacat attagaacta ttataaaatc ttcaatctta tgccctttta atgtgatttg 2521 cgtttgttct ttcaacctgc attatgtttg cacatgtttg ttagctgcat gttcaaaata 2581 tattactgaa tggagtgtgt gatcaaaaga gtttggagat ccttgctcca aatctctgct 2641 ccagaaatct tctctaaagc agcgggtctc agaaaaggaa gcagaaccaa gaagcagcag 2701 cggcctggga ggcgcccgga cagaaggtgc tccgtgggcg ggggggagta actcgtgggc 2761 ccgggaaaag gcccccaacc tggtctcacc agattttcct cagcctctgc tgcccccttg 2821 tggccaccac gtggcagaca gaaagagcag ttcccagcag gaagacaccg agggaggggt 2881 acggaggatg cagaaaacaa tcttagaacc gtggaaggga gcgaaggaaa ccagacacta 2941 aagacctcaa gctgtagcaa tccaaacatg cgaagttgaa gaatagcaaa acgaggcggg 3001 cgcggtggct catgcctgta atcctagcac tttgggaggc tgaagcgggc ggatcacttg 3061 aggtcaagag tttgagacca gcctggccaa catggtgaaa ccccgtccct actaaaaata 3121 caaaaattag ccggaaatcg cttgaacccg gaaggcggag gttgcagtga gccaagatca 3181 tgccactaca ctccagcctg ggcaacagag taagactctg tcttaaaaaa aaaaaaaaga 3241 aaaaagaaaa gaaatgtctt ttctcacagt tctagaggcc agaaaacctg agatcaaggc 3301 actactgcct ggtgagggct gttctctgct tccaatgtgg catctgttgc ttcatcctct 3361 ggaggggagg aacactgtgt cctcacgtgg cagggagtgg aagggcaaaa gggatgaact 3421 ctttccatca agtcctttta taatgacatt aatccattcg tgccggctct gccaaaaggc 3481 cctcccaaaa ggccccacct tccaacagtt gcatgggaat taaatttcca acacatgaat 3541 tttgggagac acattcacag cgtagtccag cctaagcaat acgctgtgcc aggattgcgg 3601 ggagggaggg aatagctgca gaagcaggga ataaagaatt tggactgaga catgtttcag 3661 ttgacatcct ggtggggagt ggaggaggtg gctgatgtga agaatgtgaa gaaaacagtg 3721 attccactga aaacacatcc gacagtcctc tatccttacc aggtgctccg tatagtttgg 3781 cgagagaggc agaaaggtct gagttccaag gagaggtcag ggctggagac attccattgt 3841 gtggccacag cttcttccca gagcttgcag gagacatgtt cacaccacct cccaccaccc 3901 ccacgatgct atttgttctg tatgtctctc ccacctagac tgggagcctt ggagggcagg 3961 gcattggggg ccagagagcc ttccaggcag actctcctaa tgcccgaagc aaggctaggc 4021 tcagaagccc cgtgaggaag gcaggtgagg aacaggcagc ccttgaccca tgactgtgcc 4081 aggttgggga cctgggaccc agggaagcct tctcccgcct caccatcagc acaggccagg 4141 attggagagt ttacctcctt tggtgtttac caaacacaca gggaagacca gttgtgagta 4201 gaccaaggca gcccagctgg gtgatcagca agtctcacca cctcaggcta ccctcttcac 4261 acaccgcagc ctgtcccctg cctacactgc gcatcttgag tgctctgacc ccttcactac 4321 cccttcctcc acccccctgg acagcagccc tgagacaggt cctgaacccc tagccaagta 4381 cccactgccc acccccagtg ccccacccac ccttactagc tctcacccac cctcccatcc 4441 taaacctcag ccctcccacc cccccatgcc cacctgagcc cccgctttca gctgcctaag 4501 aacttccccc tgatgtgaag aaaacagcga ttcccctgaa aacacatctg acagtcctct 4561 atccttagcc agggctccat atagtttggc gagagaggca gagaaacaac aacgaaaaaa 4621 aaatatacac gcacacacac acacacagac atccacagat tcataaacat tttggatccc 4681 tggagcttgg atacattttc taaagggcaa gcagttgttt tgcttattta attttattga 4741 caatcatctt gtcattcact cttgtaatta taaatttttg gatttacctt tttaaatata 4801 ttttaataga ttctttcccc atactagacc ccagcacaca actaatttcc ttgtcatgaa 4861 aaaataaaaa taaaaaactc atttgtgata tctttaactt ttgcctgtag tttctaaatt 4921 taaaatggaa gattgtcaat ccataattat atgtgtccag taaaaatttt aaagactgtt 4981 tccagggtta attgtcaacc tccccaggaa agggataacc agtaatgccg ccccagccca 5041 gagtctccta ggacggcagc tgacactggc caagtccgcc aggtgcctcc tgtccatctc 5101 cttccctctt catccagcac ccacccactc acccacgcca gccccaacct ctttacagat 5161 gaaggaactg agtgaggctc agagaggtta gctagtctga ccaacatcac actgtgtcaa 5221 actcctaagc taagtgtttt tttcactatt atatactctt ctgctctacc caccaaaaaa 5281 tttcttatca tgcctttgtt gcgaaagaag gaaagaaaga aagaaggaag gaaggaagga 5341 aggaaggaag gaaagaagga aggaaggaag gaaggaaaga aggaaggaag gaaagaaaga 5401 aggaaggaag gaaggaagga aagaaggaat gaaggaagga aggaaggaaa gaaagaagga 5461 aggaaggaag gaaggaagga aggaaagaaa gaaagaaaga aagaaagaaa atcttttgtc 5521 cccaaagtta gaaaaacaag tgaaaaaggc ccaaggctat cagcgagggc tccagaacca 5581 aggcagtgcg gccctgccct ttcctcccac ctcacccctg ctctgctttc ctcagccact 5641 cctgaccagc aagcaggaca ctgggcatgg gtcccaagcc tgtgtcacct tgggcaggtc 5701 gcgtctcccc tctgggcctc agtaaaagaa gaaatgggac caagtgaaca gttccaggct 5761 gtgttcccca gcatcctaca tcccatggca accttgaagg gtcactctgg gtaggggtgg 5821 ggtctcagga ggggaaagac tcagccagag cgctgatcct atatctcttc tctattttgg 5881 gattctagat atgagtttga tggacaaaat tcctctacat ttgacatgat gaaaagaaag 5941 tttgaaagcc agaggcaaga ccatcctgaa gattcctttc aactccaagg tcctttcact 6001 tccacaggtg atctgacatc accacctctc tcaccacgct gacagcactt tcattttgac 6061 tcttgtgatg aggtcacctg gtatctacac aggatggggt ttttttgtgg aaaatgaagg 6121 atttccatat ctgcgaattt attacacaag agttttaata tggctgagca cagtggctca 6181 tgcctgtaat cccagcactt tgggaggcca aggcgggcag atcacttgag gtcaggagtt 6241 caagactagc ctggccaaca taacaaaacc ccatctctac taaaaataca aaaattagtc 6301 aggtgtggtg gtgtgcacct atagtcccag ctacttggga ggctagggca ggagaatcgc 6361 ttgaacctgg gaggcagagg ttgcagtgag ccaagatcgc accactgcac tccagcctgg 6421 gcaacacagc aagaccctat ctcaaaaaat ggaggaaaaa aaagagtttt aatcccagat 6481 gttttattca ttaaaattca ctgctaaaca aacaaccaaa aaaactgtca gttaactaga 6541 aggattgacg ggggtgtctg attaactgga agtgtcctga agatgtcctg actttataag 6601 gacaattttg taatgaatac atatacattc tgtgagggca gggagggttt tgctaagcac 6661 agggtagata gtcaagcttg gttaacttac tgcttgaata aatgaatgaa tacatgaact 6721 tttctctggt ggtcaagagt aatgcagggc cttagctgct ttgcagtaca tgtggcctcc 6781 aacgtcacct tcagctgctc ctgttgccag catttaacag ccttccagtc actcctcact 6841 cttcacaatt cactaaaatt ttcagtccct gggtcactgt ctttctcttc atcccactcc 6901 tgctaccatt ctttgtggct tctctatcaa ccaagacact ccacccaaca ccctggcctc 6961 tcaccttgac cttttgacct tctcacctcc agtgatcttg tcctccccct accacagcca 7021 ctcactccac agtcatatct agaccttgtc attacctgta cctgtacctg aaacctctcc 7081 ataatctcag ttttaagtat cccccttcct gtccttgcag atcactccct ctagtgtcca 7141 cctccaaaga tcaaacaata ttttcacttc tccagacctc caaccccttt gaccctgaca 7201 ttatgagatc ctgtgccctc ccaaccttat cccgcttagg tttcatgaag cattgttgta 7261 agagctatct tgcatctcaa ctcctatttc cctcattccc tccatcatgc ctgcctgtca 7321 aaagcctagc tctggctacc tccaacttcc caccgactcc atgcctgcat ccaagcagct 7381 gaagtggttg gataaaaata ctcaaccatg ctcatcaaac atttacaaac ttcaagttgg 7441 ccattagaaa cacccagcag ataacctgtg atggaattag agtaaagaaa agaaaaagaa 7501 gtatacggca ttttcctggc cactcactca cagaatttca aggcaactat ttcacacttt 7561 cttgtatttc attacacagc ctccatcctc ctcactttct tttttttttt tttttttttt 7621 tttgagacag agtctcactg tatggcccag gctggagtgc aatggcgcaa tctcggctca 7681 ccgcaacctc tgcctcctgg gttcaagtga ttctcctgct tcggcctccc aattagctag 7741 gattacaggc atgcaccacc acacccggct aattttgtat ttttagtaga gacgagattt 7801 caccatgttg cccaggctgg tctcaaactc ctgactcagg tgatccaccc tcctcagcct 7861 cccaaaatgt tgggattact ggcatgagcc aatgtgctca gctcatcttc ctcactttca 7921 actaacaacc aactcctcca cttgggcact agaccccact gcctctcacc tactcagggc 7981 cattcctcca gcaatacccc atctcctgca caccgccttt ccctctgcag tggctcactc 8041 ccatcagcaa gcaaatatgt tattcctcct atcttagaaa ataaaaaatg aataaataca 8101 aaccttcctc agcacccaaa ctgaggtctc ctagtaccat ttcctctaaa aagaaacata 8161 tagcagctta tgtacaagat gagcctgaac atcttatcac cagaaaacaa gaaaggcatc 8221 aaagatatta gtattttatc aacttgaggc tcgcactgcc caaatatggc ataaatggag 8281 cctcagtaaa aatagtgata gcaacggatt aaaacacatt caaatcaact gactcatagt 8341 gatactaaat aaagggtcat tggttacctt tggcagatgc tagagaacaa actcattatt 8401 ctgaaagctg ctaaataaag gggaaagaat ggcatcaatc tgcctttcct aaataagtca 8461 tgtcaaaata gtagatggga cataatctgt ataaatgaaa tcagtttaga cagaataata 8521 gacttagaat atctgcattt tataatccct aagaaaataa tggatctagc aatgctcatc 8581 atggctacca actagaattc tatgcctcct gatagaaaca cagcacaata ccacctatga 8641 cgtagccttg ccagaaaata gatcatgaat catataaagt ctttaaatct aactaccagt 8701 ttaagaagaa aatgggggag gcagaggaat atggtaaatg gtacagtgat tcaattagca 8761 aaattcagaa tgtggaaagt tctataggaa acaagccatt tcttcaacaa ataaatgtca 8821 agggggtaaa aaaagattta aagagactta agctccaagt ctaaccatga gaaaatcacc 8881 aaacaaattc caaaagaggg gcagcctgca taacacctga ctagtacacc tcaaaactat 8941 caaggtcacc aaaaacaagg aacactggca aaactgtcac aaccaagagg gcccaaagag 9001 acatgacaaa tacatgaaat atggtaacct ggaaggccag gcgtggtggc tcacgcttgc 9061 aatccagcac tttgggaggc cgaggcgggc agatcacttg aggccaggag ttcgagacca 9121 gcctgtccaa atggggaaac cccgtctcta ctcaaaatac aaaaaaatta gccgggcatg 9181 gtggtgggca cctgtagtcc caggtactcg ggaggctgaa gcagaagaat tgcatgaaac 9241 taggaggcgg aggttgcagt gagccaagat cacaccattg caatccagcc tgggcaacaa 9301 gagcaaaatt tcgtctcaaa aaaaaaagga aaagaaatat ggaatcctgg aacagacaaa 9361 aagacaccag gtgaaaacta agacaatctg aatgaatgaa gtatggacgt taataataat 9421 ttatcttttt ttttttttga gacagagtct tgctctgttg cccaggctgg agtgcagtgg 9481 catgatctca gcttactgca gcctccactt ccaaggttca agtgattctc ctgcctcagc 9541 ctcccaagta gctggaatta caggtgtgct ccaccacgcc tggctaattt ttgtattttt 9601 agtagagaca gggtttcacc atgttggcca ggctggtctc aaactcctgg cctcaggtga 9661 tccgcccacc tcagcctccc gaagtgctgg gattacaggc gtgagccacc atgcccgacc 9721 aatttatcaa tattgattca ttaattataa cacatatacc cacactcacg taagatgtta 9781 ttaataaggg acactgaatc cagggagggc acatgggaat actctgtact atcctctcag 9841 tttctctata aatctaaaac tgttctaaaa tgtgaattcc acttcaaaga agagaaagag 9901 agacttaaga cacatatcaa ctgaatacaa tgcacggata ttgttttgat attgattcaa 9961 actgtatata tatttaatgg agaatttggg aaaactaaac attgcatatt tgataatatt 10021 aagaaattat gtaaactttt cagaaataat actagcaaaa tggttatatt atttaagagt 10081 tcttatcttt taggatactg aaatatttgt gatagaaatg atacaatatc atatcatatc 10141 atgtcatatc atatcatatc atatctggga tttgttctaa aataatctgg tatggaggtt 10201 gggagtagag atggaaccag agtggtcctg aatttttttt ttctttagac agtgtctcgc 10261 tctgttgccc aggccggagt gccattttga ctcactgcaa cctccgcctt ttgagttcaa 10321 gtgattctcc tgcctcagcc accctagtag ctggaattac aggcatgcac caccatgtcc 10381 ggctaatttt tgtatttttt attagagacg gggtttcacc acgttggcca ggctggtctt 10441 gaactcctga cctcaggtga tccacctgcc tcagcctccc aaagtgctag gattacaggc 10501 atcagtggtc ctgaattaat gattattgca tctgggtcac gggtatatgg gagttctttt 10561 tcattatctc tatttatgtg tatgaatttt ctggaataat gagtttttta aaatttctca 10621 tgacctcata tgctgtctcg ctagacatac tttctctgca gctcttgaca gcaaaattcc 10681 ttaagaaatt ctacaacaca aaatgtctct acctcctctc ctctcacaga tttattgagg 10741 cataatttac ataccataaa attcacccac tttggtcggg cgcagtggct cacagctgta 10801 atcccagcac tttgggaggc cgaggtggat ggatcacttg aggtcaggag ttcgagacca 10861 gcctggccaa cacggtgaaa ccccgtctct actaaaaata caaaaattag ccaggtgtag 10921 tggtggcccc ctgtaatctc agctactcgg gaggctgagg caagagaatt gcttgaacct 10981 gggaggtgga gattgcagtg agccaagatg gcaccactgc actgtagcct gggcaacaga 11041 tcgagactcc atctcaaaga agaaaaaatc atccatttta agtgtacaat tcaatgattt 11101 tagtatattt atagagttgt aaaactatca ccacaatcta attttgaaac atttccatca 11161 caccaaaaag aaatttcata ttcctttgca cttagccccc attccaaacc tgagccctag 11221 acaatcacta atctttctgt ctctatagat ttgcctattt tagacatttc gtgtaagtgg 11281 attcctgcag ccttttgagt ctagcttctt tcacttagca taatgttttt gaggttcatt 11341 cattttgcag catgtatcca tatttcattc atttttattg ctgaatagta ttccattgta 11401 tggacacacc tttttttttt tttttttttg agatagagtc ttgttctgtc acccaggctg 11461 gagtgcagtg gtgtgatctc agctcactgc aacctctgcc tcccaggttc aagcaattct 11521 tctgcctcag cctcccaagt aactgggatt acaggggtgc accaccatgc ccagctaaat 11581 tttttgtatt tttagtagag acggggtttc accatgttgg ccagcctggt ctcaaactcc 11641 tgacctcaag tgatgcacct gcctcagcct cccaaaggga cacaccatat tttgttcacc 11701 aattcactga tcaatggata tttggttgtt tttacttcat acctattgtg aataacactg 11761 ctatgaattc ttatacaagt atttgtgtgg acaatacgct ttcatttttc tttctttttt 11821 tttttttttg agacagagtt ttgctcttgt tgcccatgct ggagtgcaac agcgcaatct 11881 cagctcaccg caacctccgc ctcctgggtt caagtgattc tcctgcctca gcctcccaag 11941 tagctgggat tacaggcatg caccaccatg cccagctaat ttttgtattt ttagtagaga 12001 tggggtttct ccatgttggt caggctggtc tcaaactcct gacctcgggt gatccgccca 12061 cctcggcctc ccaaattcct gggattacag gcgtgagcca ccacacctgg cctgctttca 12121 tttttcttag gtagatacct aggaacccaa ttgctgagtc acatggaaaa tctgtgttga 12181 atgttttaag gacttaacca gctgccttca agctccaaga caagatgacg tagatgctct 12241 tctgattcct cctgctaaat acagctacaa tcctggatgg tatatataaa acaaacataa 12301 gaagatcctg aaagatgcag agaaggcaga ctggctagga atctcaagac ccgaaaaaac 12361 aatatagtgg tgagttccct gggtttggct tttgcctcat atatacaaga ctgggtgctg 12421 gaaaagccag caaccaggaa actccaacag gaagatgaaa aaaatcccgg aaagtctctg 12481 gccaaaggac caacaaagga acatcctgga aagacaaaac ttttagacaa tgactactct 12541 atttcaggca aaaaaaaaaa aaaaaaaaaa aaacactctc acccccatat ctgccaataa 12601 aggatgagtg gggagtttag ccatcaactc ccacccagct gaggcacctc tccccagcaa 12661 gtaggaagct gggactctca gccccgcctg gtggtatgaa atccccctcc atcacaacca 12721 gtgtcactgg agaccccgtg gggaacagga atgaggtgct tcttactctc tcagccaggg 12781 tgtgtcagca gagacctagt gggaagcctg aacccccatc cacacctagc aataacaagg 12841 agcacggctc cctcaagtgt tcacagaggc caagtgagga acctgggcct ctcccccaac 12901 ctgacagcag tgaggcagca ccctctttgc tcaactagtg cggtgtcaaa ggatgatcac 12961 taaaacagac ttaaataaga tccagagtct cataacatac aatccaaaat gtgcaggatg 13021 caagcgaaaa tcacttgtca taccaagaat cgggaaaatc ttgaataaga aaagacaatt 13081 aatagacatc aacacccaag ttgacttaaa tgttggaatc acctgatgca tatttaaagc 13141 aagcaagtat tttaaaacaa gtattaaaaa tgctttacag gccaggcaca ttggctcact 13201 cctgtaatcc cagcactttg ggaggccaag gcgggtggat cactcaaggt caggagttcg 13261 agaccagcct gaccaacatg gcgaaacccc gtctccacta aacatacaaa tattagctgg 13321 gcatggtggt gcatgcctgt aatcccagct actagggatg ctgaggcagg agaatcacct 13381 gaacccagag gcagaggttg cagtgagctg agatcatgcc attgcactcc aacttgggtg 13441 acagagcaac actccatctc aaaaaaaaaa aaagtttcac aggccaggca cattggctca 13501 catctgtaat cccagcattc tgggaggccg aggtaggcag atcacttgag gtcaggagtt 13561 caaaaccagc ctggccaaca tggcaaaacc ccatctctac taaaaataca aaaattagcc 13621 agacatggtg gcagatgcct ataatcccag ctattcaaga gactgaggca ggagaatcac 13681 ttgaacctgg gaggtggagg ttgcagtgag ccaagattgc actgctgcac tccagcctgg 13741 gcaaccgagt gagactctgt ctcaaaaaaa aaaaaagctt tacagccaag tgcagtggct 13801 catgcctggt gggaggatca cttgagccca ggagttcatg accagcaaca tagggaaatg 13861 ctgtctctac aaaaaacaac aacaaaatag taataataat aataataata ataataatta 13921 cccaggaatg gtagtgtgta cttgtggtcc cagccccttg gaaaaaaaag tgtttttaat 13981 tagccagaca tggtggtgtg tacttgtggt accagctact tgaaggctga ggtgggagga 14041 tcacttgagc ctaggaggtc aaggctgcag tgagccatga ttgcacttca gcctgggcaa 14101 cagagcaaga ccttgtctca aaaagaaaag gaaaaaaaaa aaaacaacat ttaacaagca 14161 attacaaact ctcttgaaac aaatgaaaac cagaaatttt ggcaaagaaa tggaagatat 14221 aaagaagaat caggctgggt gaggtggctc acacctgtaa tcccagcact ttgagaggcg 14281 gaggtgggcg gatcatgagg tcaagagatc aagaccatcc tggccaacat ggtgaaaccc 14341 cgtctctact aaaaatacaa aaagttagcc aggcatggtg gcaggtgcct ggaattccag 14401 ctacttggga ggctgaggca ggagaatcac ttgaacctgg gaggcagagg ttgcagtgag 14461 ccaagatcgt accattgcac ttcagcttgg gcaaaaacag cgaaactcca tctccaaaaa 14521 aaaaaaaaag aagaagaaga atcaaataga aattttagaa cttagaaata taatcatgaa 14581 aattgaagac tcaatagatt ggcttaacag cagattggag aggacagagg aaagattcag 14641 cgaacttaaa ggtagaagag aaattactca atctggacaa cggagaaaaa tagactgaaa 14701 aaaatgaaca gaggttcatt tatgggactg taacaaaaga gctaactaac gttctgtcat 14761 cagcatttaa gaagaagaga gagagaatgg agcaggaaaa tgtactcaaa gaaacaatgc 14821 ctgaaaaact cccaaatttg gcaaaagata ttaactgatt gatgcaagaa gctaagtgaa 14881 ccccaaacag tatgttttca tgtatttgat ggccatttat atatcttctt tggtgaaatg 14941 tctattcaaa tctcttgccc attttttatt gttattattg agttataaaa gttcttttta 15001 tattctagat acaagtccct tatctgataa atgatttgca aatacttgat ctcattctat 15061 tgcttttgtt tatcttttca ctttcttatg gttgctgtta tttttatgtt tgtttttttc 15121 cttaatgaag gcaatttcca ctttcttttt tttttttttt tttttttttt tgagacagag 15181 tcttgctcta tcacccaggc gggagtgcag tggcatgatt tcggctcacc acaacttcca 15241 actcccaggt ttaagaaatt atcctgcctc agcctcccaa gtagctggga ctacatgcat 15301 gcaccaccat gcctggctaa tttttgcatt ttttagagac agggtttcac catgttggcc 15361 aggctggtct caaactcttg gccccaagtg atccacctgc ctcggcctct caaagtgcta 15421 ggattacagg tgtaagccac cacacccagc cttcactttc ttgatggtgc cttttggagt 15481 acaaaagttt tcaattttga tgaaatccaa ttatcaatat tttatctgaa gcatgggcaa 15541 tatagtgaga ccctgtctcc acaccaaaaa aaaaaaaaaa aaatttaaca attaaccaag 15601 cgtggttgta tgcacctgta gtcccagcta tccaggaagc tgaggcagga ggatcacttg 15661 agcccgggag ttgaggctgc agtgaactat gattgcacca tggcactcca gactgggtga 15721 cataccagga ccctgtctcc aaaaacaaaa tactatttta ttgcttgtgt agatgtaaga 15781 aagtatcata tgtaagaaat cattgcctag cccaacatca tgaatatcta cttttaggcc 15841 tttttctaat agttttatag ttttgcactt acatttaggt ctatgatcaa ttttaagttg 15901 gttttttgtg tatggtatga ggtaatatag gaattacaag ctataaatcc cctctaagca 15961 tagctttagc tacatgtcaa aaattaattc tgatatgttc tgtttcaatt ttcattcagc 16021 ttgaaatatt ttctggtttt ccttgtgatt tcttcttcca cccattggtt gtttagaagc 16081 atgctgttta attttcacac acttgtaatt ttctcaaatt tcctgttgtt gttgatttct 16141 aatttaattc tattgtgatc agagaacatc tttgtattat ttcagtcctt ttaaaattat 16201 tgagacgtgt tttatggcct agcatatggc ctatcctgga gaacgttcca tgtacacttg 16261 agaaaaatgg tatatcctgc tattgtttag cagagtgttt tatagatgtc cgttagactt 16321 agttagttga cagtgttacc caagtcttct gctcaaggac agccctcacc tctcagccct 16381 ctttgtggac tgaagataat tgcttcccca aggtcacact ctttctagga gcaacccacc 16441 ttccatgact gattaatgtg gggggcaaaa acccagctcc cttgccccag ttggggatgg 16501 ctctgaaggg ccatcccagc tccgggttaa atctcaaagt ccacttcctg gagacccaac 16561 ctgtgtaccc cttattcagt caaactatac ccactctagt tttaaaatat attccaaatc 16621 agactcacca tgccccaatc atctctcact ggtcaccaca tcgagaatgg cctgggccaa 16681 cagttggggg caagttggaa gcaagggtgc caggtaggat gctatcacaa tattcctaat 16741 tagagatggt aggggtgtgg tgatgagcag tttaatttgg tacttatttt tgaaactaca 16801 gttgccatct cttattgacc tctcagtctt tttttttttt tttttttttt tttttttttt 16861 tttttttttt tttgagacag agtctcactc tgtcacccag gttggagtgc agtggcacaa 16921 tctcagctca ctgcaacctc tacctcccgg gttcaagcga ttctcctgtc tcaatctccc 16981 gagtagctgg gactacaggc gtgcaccact atgcccggct aatttttgta tttttagtag 17041 agatggggtt tcaccatatt ggtcaggctg gtctcaaact cctgacctca ggtgatccac 17101 ctgccttggt ctcccaaagt gctgggatta caggtgtgag ccaccacgcc cggcctgagg 17161 cagagtcttg ctctgtcacc cacgctggag tgcaatggct taatttcagc tcagtgcaac 17221 ctccacctcc caggttcaag tggttctcct gcctcagcct ctggagtagc tgagactaca 17281 ggcgtgtgcc actatgcctg gctgattttt gcatttttgg tggagacacg gtttcaccat 17341 gttggctagg ctagtctcga actcctgacc tcagatgatc ccacctccac cacccaaagt 17401 gctgggatta caggtgtgag ccaccgcacc cgaccaacct cccaatctaa agtagccccc 17461 aagttcttct ctctcccctc accctgcctt atatcatctc atactcctta tcgctatctg 17521 atattatatt tcatatttac ttggtatctg tctattttgt tcaagaccat aaattcaggg 17581 cctaaaacat tgcggggtat aaagactgtg ttcaataaat actgtgtaaa tgaactgata 17641 agtaaataag caaatgacat gcatcagtac ttactgaatg ctgcactgaa tgtcagcaaa 17701 ggcataagag aatgtctgga tctgtagttt ctgatgtaat cgaagcagaa acttgtttcc 17761 cagccatgcc cacattagtt ttttaaatga caaaaaataa accctactaa gacagatggc 17821 gcctcagggt agaaagaaca tgggtttgga tgtgaataac tcacatctga aacacactta 17881 gtagctatat gaacttgtac aagtgactca acttctctga gctccacatc tcactgtggg 17941 tggaggtaat ggtaccctcc tcctgggggt attttaagtg agacagtgca cgctgagttg 18001 aggtcctgct ccacacactg aggcatggtc aagtccaaaa acaagtaaat gaaaaagaca 18061 aaaatccttg actttgtgga attggcagtc agtaaataag aaatataaat taaatatatg 18121 ttagttagat ggtgagaaat aataaggaga aaagccaatg ggggtgggga acatgagaga 18181 aggcttccag ttttgaaatg gggtagccaa ggaaggcctt gattaggtgc cttttgagat 18241 gagggacaga gccacgaaga cagctgggga aggaagcagt tcaggcagtg agaagaacaa 18301 ggctctaagg tgtgaatgtg cctgtttcaa gaacagcagg aagctaatgg ggctggatgg 18361 tgagaagtaa tcaaagatga ggttagagag ggaagggcct tggctggggg gcagtggctc 18421 atgcctgtaa tcccagcact ttgggaggcc gaggcaggta ggttaggaga tcgagaccat 18481 cctggctaac acggtgaaac cctatctcta ctaaaaatac aaaaaaatag cccggcgtgg 18541 tagcacgcac ctgtagtccc agctactcag gaggctgagg caggagaatc gcttgaaccg 18601 gggaggcgga ggttgcagtg agccgagatt gcaccactgc actccagcct gggccacaga 18661 gcaagactgc gtctcaccaa aaaaaaaaga gagagagaga aagagattga gagggaaagg 18721 ccttgtgcag agccttgcag gccctgtagg aaagttagct ttactctgag tgagtggaga 18781 agcgattgga gacttttgag cagaggagtg aggtggccta acttgtattt caactgactc 18841 actttggctg ctgtgtagag atatggacaa ggagagcaag gacagcagca ggaagacaag 18901 ttaggagatg caagagatga cataggcttg gactaggata ttggcaatag ggatgagaag 18961 agatgagaag tgctcagatt ctggatatac atattttttt tttttttttt gagatggagt 19021 ctcgctctgt cgcccaggct ggactgcgga ctgcagtggc gcaatctcgg ctcactgcaa 19081 gctccgcttc ccgggttcac gccattctcc tgcctcagcc tcccgagtag ctgggaccac 19141 aggcgcccgc caccgcgccc ggctaatttt ttgtattttt agtagagacg gggtttcacc 19201 ttgttagcca ggatggtctc gatctcctga cctcatgatc cacccgcctc agcctcccaa 19261 agtgctggga ttacaggcgt gagccaccgc gcccggcctg gatatacata ttttgaaggt 19321 agagtccaca ggatttaaga cagtttggat gtggagtatg agaaaaagag aggaatcagc 19381 aatatcttca aggatttggg cctaaacaat ggaaagaata gaactgcaaa actcaattct 19441 atcaggaagg aagagaaggt taagggagaa gatcaggtca gttctggaca tgttaagttt 19501 gtggtacttg tacttggaag tacaagtctg gagttccggg aagtgtatga tcttgagagt 19561 catcagcaca aaggtggtct tcatagatgg tatctgaatc tggtagggag ccaggagact 19621 ggatgagatc acctagggag acagaaaact agaagaggcc cagggaataa ccatggacac 19681 tacaatggta agaggttggc gagatgaaga gtaaccaaca gggaggcaga gagggagggg 19741 cctggggcaa cggggggctc catatctccc gaaccatata tttacaaata atctccaaga 19801 gtggattact cttgcttatt tcttattttt ttttttttct cactctggtc tcaccttgtc 19861 accctggctg gattgcagtg gcatgattat ggctcgttgt agcctcaacc tcctgggctc 19921 aagtgatcct cccacgtcag cctcccaagt agctggggca acgggtgtgc atcaccacgg 19981 cctgctaatt tttgtatttt ttcataggga tggggtttcc ctatgttgcc caggctgatc 20041 ttgaactcct gggatcaagt gattctccca ccttggcctc ccaaagtgct gggagtacag 20101 gcatgagcca ccgctgcacc tggcactctt gcttatctct atggctacag tggcctattg 20161 ccttctttgt gtttggacac attatcaggg ccaacctcgg agttcacctt ttaaggtcac 20221 aagtccctat gactgaagtg tgaatgactg acaggtttat ccttctcagg gggtatacaa 20281 tgaaagaaaa tccctttaat aaaatgaatc tctatttgat gaaatactgt aggaaaaagg 20341 gtcatttccc agagggaact atctgtatcc ctggcatgct gcagttcact gtagtgatga 20401 tggtaccacc ctggtcagta tcaacccttg ggaagccatt gggaaggaga aacaagctct 20461 tgggggagca tcaatactgc tttgggctgt aaggtcttag aggccaggaa aagtatctgg 20521 gacccaagca tagctcataa tgcctgagca ggtgcacact gccttacctt aagcaggata 20581 aagcaagaag tgggcaggca gcttcctgac actgtcttta aactcagctt ctgcccacca 20641 cacttctggt tcccttccac ctaaccacct ctctgtctcc gttgcaattt ctcctttttc 20701 tcatgctcca gcctagtgcc ccagcctcct tttccacaaa tggtgttaga ttgtcaacat 20761 tgcagaaatg gtgagttcag ttctttccac caaggtcttc gcggttccat gagaaacctg 20821 ttgtctcttc ctattttcct ttcactactc accagcacca aatccccagt cagcaaacca 20881 gagagtacaa aagcagggac ttttacacta gggttcctct tccccatacc cacagttgcc 20941 tcctcaacta aggaaggtga tgggaaaatg acaatgacac caataggaca atgggacaaa 21001 agcatggaca ggaaactcac aaaagcatgg acaggaaaca cgaatagcca ttaaacagtt 21061 aaaaagaatg ttgacactta acaacaaatg tgcagtttaa aggagcaata agagaccatt 21121 tcatctatca aagtggcgag gattctaatg tggagggttg gcgagagtaa ggacacaggc 21181 agacactact aactggagta gaaaggcctc actgctttct ggagaatggc ttggcaacac 21241 atagagacca agagctttaa aacaagtcta tgtcttcacc ctttcactca gggttttcac 21301 ttctagaatt tagcctaagg caataattag ccatgcaaat atgtaagttc aaggatattt 21361 accgaaggtc tgggcgcggt ggctcacacc tataatccta gcactttggg aggccgaaag 21421 aggtggtcag attacctgag gtcaggagtt cgagaccagc ctggataaca tgatgaaacc 21481 ccgtctctac taaaaataca aaaatcagcc aggcatgtgc ctgtaatccc agctactcag 21541 gaggctgagg caggagtagc ttgaacccag gaagcagagg ttgccgtgaa ccgagatcac 21601 accactgcac tccatcctgg acaacagagt gagactccat ctcaaaaaaa aaaaaaaaaa 21661 gaagaagaaa aagaaaaaaa aagaatattt accacagctc agtttattat atgtttaaaa 21721 attgaaggcc aggtgtggtg gtacacacct gtaatcccag agctttggga ggccgaggaa 21781 ggaggttgct gtgagcccag gaattcaagg ttatagtgag ctatgtgcgt gccactgcac 21841 tccagcctga gtgacagagc aagaccctgt ctctaaaaaa acagaaagca aattgagacc 21901 cacctaaaca tgctataata ggaaattggt ttaaatgaac tagtagaata ctgggtgacc 21961 attataaatg atgctggctc acgggcttcc aaatgtgata cactggtact tgtgcccaaa 22021 aatgcataaa caccagacat actcaaattt aggaacagtc tacaaaacaa ctgtcctgta 22081 tgcttaaaaa tgccagtatc ggccaggtgc ggtggctcac gcctataatc ccagcacttt 22141 gggaggccaa gatgggtgga ttgcctaagc tcaggaattt gagaccagcc tgggcaccat 22201 ggtgaaaccc tgtctctact aaaatacaaa aagtcagcca ggcgtggtgg tgggcgcctg 22261 taattccagc tactcaggag gctgaggcac gagaattgct tgaacccagg cggtggaggt 22321 tgcagtgagc caaggtcgtg ccactgcaat ccagcctgga ctgtctcaaa aaaaaaaaaa 22381 aaaaaagtca atataatgaa aggcaaagta catttaaggt actgtactca tattaaagga 22441 aactaaaaag actcgacagc taaatgcaac gcaggatgcc aaatgagatc ctagaccaaa 22501 ggaaaaaatt gtcatgaagg acattatggg gcaattggca ggacctgaat ttggactgtc 22561 aattagatca tagtattaca tcggtctaag tttcctgatt tggataattg tactatatta 22621 tgtaaaagaa tgttttgttc ttagaaaatt tgcactgatg gatttaaggg taaagggtat 22681 cttgtatgca acttactctc aaatggttca gtaaaaaata caaatgtgta tttatagaga 22741 aataatgata aagtaaatgt ggtcaaatga tattagtgga tgaatctgag tgaagggtat 22801 cctggaattc tttttagtat ttttgcccct ttcccatgag attaaaagta ttttaaaata 22861 aaaagctaaa aaaaggaaga aagtggtgct ggtgaagtat attccccggt aggggaaggc 22921 tctcaggtgc accagcagca gccatgagtg cctcaacacc agggagagca cagctgccac 22981 tgacaccttc tgccaccctg gactctcagt tccctgtgct actaaaggaa ctcagtgtgt 23041 ggttgacccc aaagttgtcc tgggttgact caagaaggta ggatgagcat tctgaggcaa 23101 agaattctct tttgtgattt tattgactcc aattttgcat tctgactggc attccctgca 23161 tcccaaggac cttgacagcg gagggaggca gagatggagg aagtgaaaac tacccaaatt 23221 cagtgtttgt tacagacaat tcagactgca aaatttaggg tagactatgt tcatttatca 23281 ctgataatga cagtcttaac attcccctac aacaggaaga ccaagatttc cccaaaaccg 23341 gccagcatct tgcccattcg ccagaaggag aaaaataagt cctggcaaga gccaagataa 23401 ggcccagaag cccctgggtt cctttagcca aggtgagtgg tttcaaatta tgacaagttg 23461 caggttctct gagaagcatc tgtaataacc tggcaaatta agcatcctct cctgggagga 23521 ggaatacaga actctgtaac cacccaatac ctgtttccag gtcctgcccc tcctggggca 23581 cacggcagcc accttgcaat tctcatccct agaaaggaga gaccagatca acaaacagca 23641 gggctgggac tgcccagggg gttccgagat tccttctccc ctcctatcac ctgccctcca 23701 ggcacaccgt cctacttccc cctacttccc caggggttgt cagggacaga aggcccctcc 23761 ttcatccccc ctagtgttcc tccactcttc ctccgccccc cattactagg gtgtccagga 23821 cattgtgtga ctcaggaaac agctcagacg tgaggcttgc agcaggccga ggaggaagaa 23881 gaggggcagt gggagcagag gaggtggctc ctgccccagt gagagctctg agggtccctg 23941 cctgaagagg gacagggact ggggcttgga gaaggggctg tggaatgcag cccccttcac 24001 tgctgctgct gctgctgctg ctgctgctat gtgtctcagt ggtcagaccc agaggtgagg 24061 catggcgtgg gtgaggtgag gggacccagc tcccttagga ggatgttcag tggggtgggg 24121 gaagagggcc aagccccagg ccgtgtgagg gatgctggat ggaggagatt ctcactgccc 24181 aaatagagac ggcctccagg gaaagacggc tctgcccatg gagctgcttt gggcctggtg 24241 ccaggggtgg tgactgctgg gggatgggtg agagggtgcc cacctccagg aagaacctcg 24301 tcagcactgg cactggagga ctcttgcagc catagggaag aggggaagag ggaacacact 24361 gaccacctgc ttggggagga gatgagaggg aagcaggaga tggggacatg aaaggtcagg 24421 cctactaagc cctttcttag tccagctgtc cccacccccc ggatggctca atgctcggcc 24481 tttccgggag gaaatctctt cgaagtctca gccattcacc tcccgggagc cacctccgcc 24541 cctcttctga cccctgttgt cttgcttccg agagatggag tccgaggctg gacttgggag 24601 gccagagaat aaacaggaaa ggggggtagg gattagtaac tgggacggag ggcactgggg 24661 ctggggctgg gtaccatgtg gagagtgggg acagatgtga agaagaggtg gtttagagta 24721 cctgtgggag ctgctgtggg caggtctctc aggagcacct agaagaggaa aggtggaggc 24781 acagcaccca gggcttccat tgcgcctgcc tctcctccct cagggctgct gtgtgggagt 24841 ttcccagaac cctgtgccaa tggaggcacc tgcctgagcc tgtctctggg acaagggacc 24901 tgccagtgag tgtgccttgc aggagtggga gactggagag aaagggggag ggagagcagg 24961 gggggagagg tgaggaagtg agaccaaaga agaaagagag gaagtgaagg agatgaaggg 25021 aaacaaatga aggcagagga gggagtgggc aagaatagga agaggggcca gtgatgtgag 25081 ttttcctctc ctcccctgcc caggtgtgcc cctggcttcc tgggtgagac gtgccagttt 25141 cctgacccct gccagaacgc ccagctctgc caaaatggag gcagctgcca agccctgctt 25201 cccgctcccc tagggctccc cagctctccc tctccattga cacccagctt cttgtgcact 25261 tgcctccctg gcttcactgg cgagagatgc caggccaagc ttgaagaccc ttgtcctccc 25321 tccttctgtt ccaaaagggg ccgctgccac atccaggcct cgggccgccc acagtgctcc 25381 tgcatgcctg gatggacagg taagcgctgc tgggggcagc caggagggga caggcaggag 25441 caatgggcta ggctgtgggt ggggaagata gaactggagc ctgagaaact gcaagccctt 25501 tgaagacaga agccatgaga atcaacatgc caattcttgg caatccactt acccacaacc 25561 aacattcacc agcatggttg tactgattgc taaaatgtta aaatatttcc aaattaaggg 25621 tgccatgagc cccctttgtg caccatcctg atgcctgtcc tagccccttt aatctcccca 25681 ttgcctagca gctagaagag ggtcattgct ctgcatacca ggggtcctcc agacttttgc 25741 attctgagca tctgaatggc tcccattctg agtggaggga gccattatat cacctgggaa 25801 gactgcagtg gtgggagggg caccgggaag ggaaggatgt gacccagaga gtggattggg 25861 ggccgcccca ggaggagggg tgtaaccctg gggcaagctt agtgcttcat tctaggggct 25921 ctgcaccagc ccctggatcc aaatgctagc tctgccactg atcagctaca tgacctcata 25981 taagatattt tagctttctg gtgttcagtt gtcagctgac aaacagggag agtaatggtc 26041 acacttcata aggttgctga gaggacagaa ggggccgatg ctcaggagat gcttgctcag 26101 ctcagcacct ggcacctcca ctgctgccgc cattaccact ggtgcacatg gactgtgaag 26161 tgagtctcca ggtgcctaaa cccacttaaa gattaggaaa tgaggttcag aaaggcaaag 26221 tggctcaccc aagggtatac aaccagttgt ggcacagcat ggtgccacct gagtctcctg 26281 cctgcagacg tggggtgctt ttcacctccc ccaagatcac ccacgtccca gattttctca 26341 ggcaaggcca atttgcaata ctctcatcat cactttagaa gatatggtca ctccagataa 26401 accctcccaa gccatgacat cgctcagagc aggggtgatg gaacagagca aagaaagtat 26461 ggtaataaag ggaaggaaat atgaaaatga gacccagaga taatccagag tgagcactgg 26521 gtaacctcag atgggctaga attcgtacaa tgctagaaac ggctccctct gtcctctgcc 26581 tcaggtgagc agtgccagct tcgggacttc tgttcagcca acccatgtgt taatggaggg 26641 gtgtgtctgg ccacgtaccc ccagatccag tgccactgcc caccgggctt cgagggccat 26701 gcctgtgaac gtgatgtcaa cgagtgcttc caggacccag gaccctgccc caaaggcacc 26761 tcctgccata acaccctggg ctccttccag tgcctctgcc ctgtggggca ggagggtcca 26821 cgttgtgagc tgcgggcagg accctgccct cctaggggct gttcgaatgg gggcacctgc 26881 cagctgatgc cagagaaaga ctccaccttt cacctctgcc tctgtccccc aggtgtgtcc 26941 tcacaggggc tctccggccg cccctctctc tgggcagggc aggatgtctc cgttggagcc 27001 tcctcccaca gctgatccat gaccctgtca ggtttcatag gcccgggctg tgaggtgaat 27061 ccagacaact gtgtcagcca ccaatgtcag aatgggggca cttgccagga tgggctggac 27121 acctacacct gcctctgccc agaaacctgg acaggtgagt tgtttaagcc acatccatga 27181 cacccatggc ccagagagtt ggcccctggc ctcccctact catagggctc ccagccttag 27241 ccctcgtccc ctccccaacc ccctgcaggc tgggactgct ccgaagatgt ggatgagtgt 27301 gaggcccagg gtccccctca ctgcagaaac gggggcacct gccagaactc tgctggtagc 27361 tttcactgcg tgtgtgtgag tggctggggg ggcacaagct gtgaggagaa cctggatgac 27421 tgtattgctg ccacctgtgc cccgggatcc acctgcattg accgggtggg ctctttctcc 27481 tgcctctgcc cacctggacg cacaggtatg ggggtagagg gtatcaggag gtgggaggta 27541 gagaaggagg gtgagagaag caccaggagg actgctagga gcttcaaatg gcctttgaga 27601 gcctcacccc ctcttacccc tccaggactc ctgtgccact tggaagacat gtgtctgagc 27661 cagccgtgcc atggggatgc ccaatgcagc accaaccccc tcacaggctc cacactctgc 27721 ctgtgtcagc ctggctattc ggggcccacc tgccaccagg acctggacga gtgtctgatg 27781 ggtgaggcca ctcccacttc agagcctctc tgagcctcag acaggcctct gcactgaaga 27841 cagaaaaggg ggcagattgc ttttccaatt aaaaaaccaa acatcttttt ccttgaattt 27901 gcccagattt ggcatctctt gcctacatga ccctctctcc aatgttcagc ccctcagtcc 27961 ccatgaattt ggtcccttat ttcctttcca tcttaaagac acaagcccct tccccaattt 28021 ggtctcgtct gccacacgca ggcccccaca ccttccctga cagtctcacc tccttgccct 28081 tcctgccctg acccctgtgg actcccagct cttctctcct cccagcccag caaggcccaa 28141 gtccctgtga acatggcggt tcctgcctca acactcctgg ctccttcaac tgcctctgtc 28201 cacctggcta cacaggctcc cgttgtgagg ctgatcacaa tgagtgcctc tcccagccct 28261 gccacccagg aagcacctgt ctggacctac ttgccacctt ccactgcctc tgcccgccag 28321 gtatcagctg gatggggcct tgggtgggga aaacagggaa ctagtcctga acccactagg 28381 aatgccccct ccagagtaag gacagcttca ggccaattgg cgtaagttac cacagatgct 28441 tctctctcta cccccagacg aaaactcagg gacacccaag acccctggga gaggggttac 28501 cacagatggt agtgaggtta tgcattcctc aacttggggg gaagctgcca ttcatttcat 28561 agtcatcata gaggctgcac aacctggtcc actgtacaca gcagcccagc aagagagggt 28621 agaagagcag ttcataaact ttctgtgctg cagcctttgc tcaggccaac ccagaatgct 28681 ccctctgatt atagaaactc tcccatgtag agattcaagg taatccctta aaatcccaaa 28741 agccctgtga tacaacagga aaatttggta caacaagaaa aaaattgctg caagacagca 28801 cccacctcca ggctagtttt aagggggaaa agtcgcccca gggagacagc aacagagcca 28861 acatcaagga gttgaatgaa atcagaaaaa taatcgccaa ctttatgcca ggtactgtct 28921 gagcatctta caggcattgt ctcatctact tattacaata actctatgag gtcagcactg 28981 cccattttat aactgaagaa actgaggcac agagagttta agtgacttgt ccaaggtcac 29041 ccagctagca agtggcagag ctgagattca aaccaagggc ttcaacaatt ataaccacta 29101 ccccatattg actttctaaa ctgagcggca cccaaagata ctggctcagg tcacccaaca 29161 gacaatcata gagaaataag agaaaacggt tcggtaaccc aagggacaac attgtagata 29221 tcaaggagct tcagaagcag actcctcagg caagaaaaga aaggaagcca aggtccagag 29281 gttgatccca cctcaattca ggatgaaaca gtggagacca ggatgaaccc aaagcaacgg 29341 gacaaatata ggagcaacaa gcttcccaga tgcacttcaa attcctccac tttgggatct 29401 ctgttctccc tagcatggag gcccgcccag ggagaacaag aagtgggact cattctccaa 29461 gccaatttgt tcctatttgt accttgaggt cctccgggct gatcagcctg ccctggtgag 29521 ccccgccctc tgtatacaca ggaatggcca ccagaaagcc ttgtagtcct cccgcaggtc 29581 cccagcaagc accctgttcc ctggcctttc acacctcaag gagcagggcc acacactgcg 29641 aagcagcagg gcctcagggt tcatcttatt caaccccatg cagacagcac ctcgggggag 29701 gaccgcctga gtggggcaag tcaggagcag ggccgattct agaacacagg tctcccaggc 29761 agacctggct gagccacagc cctcatggtc cccatgtccc caggcttaga agggcagctc 29821 tgtgaggtgg agaccaacga gtgtgcctca gctccctgcc tgaaccacgc ggattgccat 29881 gacctgctca acggcttcca gtgcatctgc ctgcctggtg agtacagatg cctctctggc 29941 caccctcaga ccccaggcct ctgaacctgc agagttcagg ctcagcaatc acccaaggcc 30001 acttgaagct ctctctagcc aagccaagga gtcctccaaa tctgtctttg ctccccaaaa 30061 tctctactct tacatcccca aatcttccct tgcttacttg cccattctca tctctgtcct 30121 accaaatcac ccaaagatcc ctcctttcca gtcctcctgc acagcctctg tgtatgcatg 30181 ttgaggtccc aggctggtct tggcactctc atcataaacc cagcaaaagc tgccccaagc 30241 ctttcttctc cagcctcacc ggacactcct ctgtcccctg ctattataat aactacactt 30301 attcaccact tactccaggc tagccatttg gctgagtact tttcaggcgt tatgtcattt 30361 aatcttttta acagtaccat gaggtaggtt ccattattat tcccctttta cacagaacag 30421 gaaactgggg cctagagggt tgaccagctt gcccaaagtc acacagctgg caggtggctg 30481 agcttcacct tttctgcatc atctcctgtc accccacgct cacctgcccc aggtgtcttc 30541 tctgggaagc tctgcaagtt cacctttcct ggcaagggaa ggcgccatgc tgtgcctccc 30601 tcgatgacct tggcctcctc tccccaccca cttccgcccc accaggattc tccggcaccc 30661 gatgtgagga ggatatcgat gagtgcagaa gctctccctg tgccaatggt gggcagtgcc 30721 aggaccagcc tggagccttc cactgcaagt gtctcccagg taaactgggg cacacactgt 30781 gggggacagc gggagcagga ggcagacatc cgtgcaggtc cctgaccttc ctgctgtgcc 30841 acaggctttg aagggccacg ctgtcaaaca gaggtggatg agtgcctgag tgacccatgt 30901 cccgttggag ccagctgcct tgatcttcca ggagccttct tttgcctctg cccctctggt 30961 ttcacaggtt cacaggggag gcattggaaa gaactggcag aatattttat tccatttggg 31021 ttggggcaga gttcattggt gggtgtttga tggttgggat gtgagaatag aatgagaatg 31081 gtatccttta aagttattta gtgtaaaacc tgcataattg tacaaccatg gggacaaggg 31141 caggggttac cgtagggccc acatgggccc agtgtaaatg gtgtgattgc ggcttgggag 31201 cagaggacgg ggctcaaaga aaaggcattc acttgcttat ttagcaagca cttaccaacg 31261 cctactatgc cagatacgga ggcaaatctg agtaagacag ttgccatctt catgtgactt 31321 taagtctaat acagagagac cagcaagtct cctgttgatc atagcccaca gaggtgctct 31381 gagagaaaaa cgtgcaggat attacaagag cacagaggac cagccaaccc agactaaaaa 31441 tggaggaggt gatggctgag atgagtcttg aaagataagc agaggctggg cgtggtggct 31501 cacgcctata gtcccagcac ttcgggaagc cgaggcgggt ggatcacctg agattaggag 31561 ttcgagatca gcctggccaa catggtgaaa cctcgtctct attaaaaata caaaaattac 31621 actttgggag gccgaggcgg gtggatcatg aggtcaggag atcaacacta tcctggctaa 31681 cacggtgaaa ccccatctca actaaaaata caaaaaaata actgggcgtg gtggcgggcg 31741 cctatagtcc cagctactcc agaggctgag gcaggagaat ggcgtgaacc cgggaggcgg 31801 agcttgcagt gagctgagat cacgccactg cactccagcc tgggtggcag agtaagactc 31861 cacctcaaaa aatacaaaaa tacaaaaatt agccaggcgt ggtggcgggc gcctgcaatc 31921 ccagctattg gggaggctga ggcaagagaa tcgcttgaac ctgggaagca gtggttgcag 31981 tgagccgaga tcactccact gcactccagc ctgggtgaca gagcaagatt ccatctcaaa 32041 aaaaaaagaa ggaaggaagg aaggaaagaa ggaaggaagg aaggagagaa gggaaggaaa 32101 ggaggaaatg aggaaagaga gaaagataga aagatggatg gtcaggagtc tgtctaaata 32161 gagtgccaga tagtgtgttt tagctggaga taactgcatg tgcaaagaca cagatggaag 32221 aaaagcccac cccatttaag gaactgtaag aaagtcagag ttaagggtac agcaaggcaa 32281 agatgagaaa cacagctgtt gtacaaatgt catgtcctgc aggactctgc atatcattct 32341 gagaaagtta aacaatatct taaaggcaat agggacccat tgaaggacag gttcatgggt 32401 tcatagggag tgagtaaggc aagcataaga agtggctttg gcccaatgaa ggatgtggct 32461 ttggcccaat gaaggatgtg gaggagctgt tttctttttg acccatcttc ccaccccagg 32521 ccagctctgt gaggttcccc tgtgtgctcc caacctgtgc cagcccaagc agatatgtaa 32581 ggaccagaaa gacaaggcca actgcctctg tcctgatgga agccctggct gtgccccacc 32641 tgaggacaac tgcacctgcc accacgggca ctgccagagg taacatcttc cagaccctcc 32701 ccatctgccc cctcctttgg gctcccttcg ctaggacagg agaagacagc cagtgagatg 32761 taggtctgtg agaaatgacc aatggggaaa aggaaggaga tggcaaagtt cttagggcaa 32821 ggcagtggga gggctcaact ggtaagtgtt atccaaggag aagagagtcc acaaaaactg 32881 gtggaaacag aggaccaggg ggtcagagca gaaagaagag cattaaatcc cagggcgaat 32941 taatcattca ttagaaaaat atctgctgag gccaggcgca gtgctgatta cggtctcatg 33001 ccggtaatcc cagcactttg ggaggccgag gtgggcggat cacctgagat cagcagttca 33061 acatcagcct ggccaacatg gtgaaaccct gtagctacta aaaatgcaaa aattagctgg 33121 gcatggtggc gcacctgtaa tcccagctac ttgggaggct gaagcggaag gatcacttga 33181 acccaagagg cggaggttgc agtcagccaa gatcatgcca ctgcactcca gcctgggtga 33241 cagagcaaga ctccgtctca aaaaaaaaaa aatctgctga gcacctactt tgtgtggcta 33301 ctgttccagg ccctggggga aacacaaagc aaaagagata aagcaactgc tctcgtagag 33361 ctttcattct aaagaaagac agaaaataag taagttacag aaagaatata tatgtgtgtg 33421 tatatatata tatatatata tatatatata tatatatata tatatctcca actagatata 33481 tagatgcata tatctagttg gagaaaatga gcaggtgttg gggaggatgg gggcggtgct 33541 gagagcaaag tcactgaaga aagaggccag aatctcaggg ccaaaagaag aggaagtcat 33601 aggggtccag gcagcaggga ggggaacata agcagtagga gaagagaaaa accctcccct 33661 ttctctttac aaccagatcc tcatgtgtgt gtgacgtggg ttggacgggg ccagagtgtg 33721 aggcagagct agggggctgc atctctgcac cctgtgccca tggggggacc tgctaccccc 33781 agccctctgg ctacaactgc acctgcccta caggctacac aggtgagacc ctccctaaac 33841 catatacacc ctgtgctggt caccccctat gtcaagggta aggcaggcga ccacgggccc 33901 tgagtctcag tcaccactaa ggagaactgg actgcagggg acacgggtga tgtgtgggag 33961 gtggtgagaa cggaggttga tgggaaagaa taatagggtt ggggatgagg aagggaagga 34021 agaggaaaga gaaggagagc ggagggaggg aaggaggaca ccctttctca gcccccacct 34081 gtttccttca ggacccacct gtagtgagga gatgacagct tgtcactcag ggccatgtct 34141 caatggcggc tcctgcaacc ctagccctgg aggctactac tgcacctgcc ctccaagcca 34201 cacagggccc cagtgccaaa ccagcactga ctactgtgtg tctggtgagt gcccactgtg 34261 tcatggggct ggggtccaca ggagaatgga agaactaagg agggtatgct tgtgtcatat 34321 tttttaaaat ttaattgaac agaccagcct gggaaacata atgaaactcc atctctaaat 34381 tagctgggca cacagtggct aaggcctgta gtctcaaata cttgggaggc taaggtggga 34441 ggatgacttg agcccagaag gttaagactg cagtgagctc tgatggcacc actgcctggg 34501 taacagagca agaccctatc taaaaataat aaaatatcat ataaataaaa attaattgga 34561 tggcagtgag ctgtcgtttt gatgggggtg gggggtgtat gtcctatgta cagtgtgact 34621 gaggtcacag gccaggttac aggcaaggaa cccaaaccct cacagcctcc tctctccagc 34681 cccgtgcttc aatgggggta cctgtgtgaa caggcctggc accttctcct gcctctgtgc 34741 catgggcttc cagggcccgc gctgtgaggg aaagctccgc cccagctgtg cagacaggtg 34801 agcagggccc aaagacccct agaagggaga aaacccctca gccttcccac ccctcctctc 34861 atctccccct gggttccagg ctgcctccca ccccacacct cgttccgcct tccctcctgg 34921 cttgccacct ccctgtggtt gcctcccgac aatatcacac actaccttcc ccttgttgcc 34981 cgaacttctt cctatcatgc cctgtctctg tcttgttcag cccctgtagg aatagggcaa 35041 cctgccagga cagccctcag ggtccccgct gcctctgccc cactggctac accggaggca 35101 gctgccaggt gagggccatt gaagtcaggc gtgctgagga gggaagtggc tgggagggaa 35161 accagggagg gtcacctggt cccaggccat tcaggagaag gtttttgaag taagggattt 35221 cgaggagctg gagtgggcag aggaccatct ctgggctgag aatctgatgc tatgtcctgc 35281 ctcacgctcc ctcccacccc tcagactctg atggacttat gtgcccagaa gccctgccca 35341 cgcaattccc actgcctcca gactgggccc tccttccact gcttgtgcct ccagggatgg 35401 accgggcctc tctgcaacct tccactgtcc tcctgccaga aggctgcact gagccaaggt 35461 aaccaacacc ggcactgact ggagagcaaa tgaagaaaat atgggtgttc tcacctgccc 35521 cgttcctgtg ggcttccaca cagcttttgg ccaaaacaac cacctgagaa tcaaaaccac 35581 acagacaaat cagttcttga ttgcagaggt gttgaattgt gcaatcagag aagcctaaca 35641 caattcctgt tacctcattt aacgcaattc aagcaacacc aaccagccca ttccacagga 35701 ttccagttta acccaaccat gagggacatg actcaacgtg gagcaactca actctgtcct 35761 gatcatcata acccaggcag agtggaccac acttaatcca gctacactca acgcatttca 35821 ccccacccca ctcacttcga tgtcacgcca ccctctttgt aaaagaacac taaaataagg 35881 tttgatatga gaagagctcg ggaaagagat tatgatggag ttaagggata gctgatggga 35941 atcacaggga gagaacatta aaggaatgca gaactcaggg gacagactta agccacttgt 36001 cctcacttga ggcatacctt gaattatttc ttgctcaagc cttgctttaa gccaagaaaa 36061 cagttgcttt aacattatga taagctttag gggtatgtct ccaatattct ggcagaaagg 36121 gtgaggagag aggtaaaact cgatggtcta atttcttcag tgccaggagt tattaactcc 36181 ttgggcccag accagtctcc acccaggaat gttagtggtc cccaaggtaa aggcctctac 36241 acccagagat tgggccctga atacaccctt cctcctcctc acatacctta tttatttatt 36301 tatttattta tttatttatt tatttattta tttatttttg agatggagtc ccagtctgtc 36361 acccagggtg gagtgcagtg gcacgatctc ggctcactgc aaccttcacc tcctaggttc 36421 aagtgattct cctgcctcag cttcccaagt agctgggatt cgtgtgccac catgcccagc 36481 taattttttt tttttttgta tttttagtag agatggggtt tcattatgtt ggccagactg 36541 gtctcgaact cctaacctca gtgatccacc cgtctcagcc tcccaaagtg ctggaattac 36601 aggtgtgagc caccacgccc ggccacatcc ccatttaccg tttctgtttg tttgtttgtt 36661 tgtttgtttg tttgagatgg agtcttgccc tatagcccag gctggagtgc aatggcacga 36721 ttttggctca ctacaacctg cgctgcccag gttcaagtga ttctcctgcc tcagcctccc 36781 aagtagctgg gattacaggt gcctgccaca acgcccgacc aattttttgt atttttagta 36841 gagacggggt ttccccatgt ttgccaggct ggtctcacac tcctaacctg gtgatccacc 36901 tgcctcggcc tcccaaagtg ctgggattac aggcgtaagc catggcgccc ggccccccac 36961 ttatctttgt atatccctgt ttgtctctct ccccaggcat agacgtctct tccctttgcc 37021 acaatggagg cctctgtgtc gacagcggcc cctcctattt ctgccactgc ccccctggat 37081 tccaaggcag cctgtgccag gatcacgtga acccatgtga gtccaggcct tgccagaacg 37141 gggccacctg catggcccag cccagtgggt atctctgcca ggtgagaggg tctgcaggag 37201 aagggggagg aaagacaagg gtgggctgga tgggagagac agtagtgact gagggaagac 37261 ccaatgtatc catttcacct ccttttattt tttttaaccc cacacaccca actaaccaca 37321 aagtcatgcc actttatctc ctgaagtttt ctcaagtctg tgctttcacc ttcacccttt 37381 gtcccactgc ctgagtcctg acctttgttg cgttccagat agatacttgc atcagcctcc 37441 taactacagt ctctgcctcc acacttgtcc ttctataggt tcagccttta ccccgtggcc 37501 agggtactct actgaaaatg tccagttgac ctgtctctcc cctgctgaac cttcggaggc 37561 tgcccgcctc attctgaata aagtccaagg cctgcatgac ccagccctgc ctgcctctca 37621 ggcctcagct ctccccatcc cacccttcta ctctctgctc cagcaacatg aaactgctgc 37681 tgactttgcc acgtacccca tactgattct tgcctctcag cctttatccc tgctattgac 37741 actacctgga atggccttcc caacccctct tccacaggct ggtgttcagg aggcatcttc 37801 ttcaggaaga tgtccctaac ttctccccag gctggattag ggcctcttct ttgtggtcca 37861 ggtcactaag ccaggagagg caaagctggc attcaagtct aagcagcctg attcatatgc 37921 tcagaaccac aacttttttg tgtgtgtggt tgccatttta ttttcttttg ttattgacaa 37981 acggtagtca tacgtatcta tggggtacat gtgagttttt tggtttcttt ttttttttct 38041 ctctcgtttt tggagacaga gtcgctctat cccccaggct gaagggcagt ggcataatct 38101 tggctcactg aaacccccgc ctcccaggtt caagcagttc tcatgtctca gcctcccgag 38161 tagctgggat tacaggaacg cgccaccacg cctggctaat ttttgtattt ttagtagaga 38221 tggggtttca ccatgttggg caggctggtc tcagaactcc tgacctaaag tgatccaccc 38281 gctcagcctg ccaaagtgat aagattacag gtatgagcca ccgcacctgg acacatgtga 38341 tattttgata catgcataca atgtatcatg atcaaatcag ggtaattggg atatctatca 38401 tctcaaacat catttctttg tgttgagaac atttcaaatc ttctcttcta gttattttga 38461 aatacaaatt gttaactatc accatccttc tctgctatct aacactagaa cttattcctt 38521 ctatatgacc ataattttgt atccattaac caacctctct tcatctcccc ctccctgcca 38581 ccctttctaa cctctggtaa ccatcattct accctacttc catgagacta acttttttag 38641 ctcccacata tgagtgagga gcaatattac acatgcaata ggaggctgat gcaagtatct 38701 acctggaatg caagaaaggt caggactaac taaggcagtg gcacaaaggt gaaggtgaga 38761 gacatgcaat atttgtcttt ccgtgcctgg cttatttcac ttaacataat gtccttcagt 38821 ttcatccatg ttgctgaaaa tgataggatt tcattctttt tctggctgaa taatattcca 38881 ttgtgtctat atgacacagt ttcttttttc cattcatttg ttgatggcac ttcagttaat 38941 tccatatgtt agctattgtg aacagtgctg caataaatat ggggatgcag atatctcttc 39001 aagatactga tttcctttgg atatgtaatg aacagtagat tgatcacatg gtagatctat 39061 ttttaatttt tgagaaactt ccatactttt ctccatactg gctgtgttag tttacattcc 39121 caccaacagt gtatgagggt tcccctttct ctgcatcctt gccagcatct gctatttttt 39181 gcattttttt tccttttctg agacagagtc ttgctgtgtt gcccaggctg gatcacagtg 39241 acttgatctc agctcactgc aacctctgcc tcccagattc aagcgattct tgtacttcag 39301 cctcccaagt aactgggatc acaggcgtgg accaccatgc ttggctaatt ttttgtattt 39361 ttagtagaga tggggtttca ccatgttggc caggatggtc tcgaactcct gacctcaaat 39421 gatcttcccg cctcaacctc tgaaagtgct gggattacag gcatgggcca ccacacttgg 39481 cccttgtgtt ttcaataata gccactttaa ctggtgtgag atgatatctc attgtggttt 39541 tgatttacat ttccctgatt tgcatccata tacctgttgg ccatttgtat gtcttctttt 39601 gagaaatgtc tgttcagata atttgctcat tttttaaacc acattatttg tcggtggtgg 39661 tagtggtggt gtttgctgtt gagttcctta tacgttctga ttattaatcc cttgtcagac 39721 agtttgcaaa tattttcttc tattctgtag gttgcctctt cactccatta attgtttcct 39781 ttgctgcaca gacacttttt agcttgatgt aatcacattt gtctgttgtt gcctttgttg 39841 cctggctgtt gaggtcttac ccaaaaaact tttgcccaga ccaatgtctt gaagcatttc 39901 ccaaatgatt tttttttttt ttttttttga gaaggagtct tgcaccgtcg cctgggctgg 39961 agtgcagtgg cgcaaatttg actcactgca acctttgcct cctgggttca agcgattctc 40021 ctgccttagc ctcccaaata gctggaattt acaggtgccc accaccacgc ccagctattt 40081 ttttgtattt ttagtagaga cggggtttca ccatgttggc caggctggtc tcaaactcct 40141 gaccttgtgt ttgaggatta caggtgtgac ccaccgtgcc cggctgaatt ttttttttag 40201 tggcttcaag tttccagctg tacatgtaag gctttaatct attttgtata tgatgacaga 40261 tagagattta gtttcttttt ttcttttttt ttgggggggg ggatagagtc ttgctctgtt 40321 gccctgttgc ccagtctgga gtgcagtggt atgatctcag ctcactgcaa cctccacctc 40381 ccaagttcaa ctgattctcc tgcctcagcc tcctgagtag ctggaactac aggtgcacac 40441 cactacgccc ggctaatttt tgtaatttta gtagagatgg ggtttcacca tattggtcag 40501 gctggtttca aactcctgac ctcaggtgat ccacccacct ccgcctccca aagtgctggg 40561 attacaggcg tgagccaccc cgcccggcct aggtttagtt tcttctgcat atggatatcc 40621 agttttccca gcacaattta ttgaagagag tgtcctttcc ccagtgtgtg tacttggtgc 40681 ctttgttgaa agtaagttgg ctggccggga gtggtggctc atgcctgtaa tcccagcatt 40741 ttgggagacc gaggtgggca gatcacaagg tcaggagttc gagaccagcc tgaccaacat 40801 agtgaaaccc ccgtctctac taaaaataca aaaattagcc aggcatggtg gtgcgcacct 40861 gtaatcccag ctactcagga ggctgaggca ggagaatcgc ttgaacccag gaggtggagg 40921 ttgcagtgag ccagatcgtg ccattgcact ccggcctggc aacagagcaa gactccatct 40981 caaaaaaaaa aaaaggaaag aaaagaaaaa agaaagtaag ttggctgtta aatgtttgga 41041 cttgtttttc tgggctctct attacattcc attggtctat gtgtctgttt tttatgccag 41101 caccatgctg ttttggttac tatagcttta tagtatattt ttaagttagg tagtgtggta 41161 cctctagctt tgttcttttt gctcaggact gctttggcta tttgggtctt ttacagttca 41221 gataaatttt agggttgttt tttctatttc tgtaaagaat attattggta ttttcatagg 41281 ggttgcatga ctctgtagat cactttggta agcacagaca ttttagcagt attcattctt 41341 ccaatccatg aacacaggat atctttccat ttttttgtgt cctcttcaat ttatttcatc 41401 aatgttttat agctgtcatt gcagtactct ttcacttctt tggttaaatt tattcatttg 41461 tttttatttt ttgtaactat tataaatggg attgctttct tgatttcttt ttctgattgt 41521 ttgctgttag cgtatagaaa tgctactact ttttctacat tgattttgta tcctacagct 41581 ttactgaatt tgtttataac cagtgttttc tttaggtttt tctaaatata ggattatgtc 41641 atctgtgaac atggataatt tgagttcttc ttttgcaatt tggatgccct ttatttcttt 41701 ctcctgccta attgctctgg ccaggacttc cagtattacc ttgaataaaa atagtgaaag 41761 tgagcatcct tgtcttgttc cagatcttag aggaaaggct ttcaactttt ccccattcaa 41821 tatgatgtta gctgtgggtt tgtcatatat ggcttttatt attttgagat atagaaccac 41881 agcttttttt ttttgagaca gagtcttgct ctgtcactca ggctggaatg tagtggtgca 41941 atctcagctc actgcaacct ctacctcccg ggctcaagca attcacctgc ctcagcctcc 42001 ccagtagctg ggattacagg tgcctgccac cacacctagc taattttgtg tatgtgtgta 42061 tttttagtag agatggggtt tcaccatgtt ggccaggctg gtctcaaaat cctgacctca 42121 agtgatccac ccgccttgac ctcccaaact gctgggatta caggcgtgag ccaccgtgcc 42181 cggccagaac cacaactttt gatagaaggc tcaagacaga taccctaacc taccctcttt 42241 tttcactttt ttattttatt ttttaacctt ttattatgaa cattttcaaa cataaacaaa 42301 agcagtatac tgatcagtag taaacctctg ggcacccatt actcagcttt acttattctt 42361 tttttttttt tttttttttt tttttgagac agcatctcac tctgttgccc cggctagagt 42421 acagtggcgc gatctcggtt cactgcaacc tccgcctccc gggttcaagc gattctcctg 42481 cctcagcctc ctgagtagct gggactacag ggacatgcca ccatgcccgg ctaatttttg 42541 tatttatagt agagatggag tttcaccata ttggccaggc tggtctcgaa ctcctgacct 42601 cgtgatctgc ccacctcagc ctaccaaagt gctgggatta caggcgtgag ccaccgcacc 42661 cggctattta cttcttcttt tatgaagctc cctcctccaa aacaccccca tcacctgttc 42721 cttccagctc tctgaccact ccttggattc tctgtgaatt cccttttctc tctttgaagc 42781 ctgccttcct ggtactgtac tcttgcacac tctctttcct cttgcaagaa gccagcacgt 42841 ggtacagatc ttgccaatga cccttctctc actagctgag tggcatgaag aagcagaaaa 42901 tggttaagag cattggtttg gagtcacaga ccttcattga ttcccagctc tgccacctat 42961 agctatttga cttgcacaag tcactaacct ttcagagact cagcttcctt acgtgcaaag 43021 taaaaatcga atgagataac ccaaataaaa tgtcattagg gggattttta ggttatgtat 43081 ataaatcatg caataaatgc tagtcatttc tttcctctgg ttgactgaga gcttccagga 43141 aataggaatg ggttctaact ttctttgtat tcctagtgcc taaaacggtg cctgacacaa 43201 agtaggcact caatagatgc ttatgaatta ataaagtatg agagagcctg gtaggtattt 43261 agcaggggag gaaggtttta ccaaaaatgg tgctgtgttt ggtggcagtg tgtcatagag 43321 attgtttggg actggggaag tttgagttgt gtgtcgccaa caattgtgtc tcatggggag 43381 ttgagataga aggattgtga cacatggcca tgatggatgg tgagttgagt gatgctgttg 43441 agctggaagg tgggggactg gacagactat cttgagctgg gtcccttgta gtgctgggtt 43501 gggctcatcc actggttccc tgtctaatcc tctttgtctg cagtgtgccc caggctacga 43561 tggacagaac tgctcaaagg aactcgatgc ttgtcagtcc caaccctgtc acaaccatgg 43621 aacctgtact cccaaacctg gaggcttcca ctgtgcctgc cctccaggct ttgtggggct 43681 acgctgtgag ggagacgtgg acgagtgtct ggaccagccc tgccacccca caggcactgc 43741 agcctgccac tctctggcca atgccttcta ctgccagtgt ctgcctggac acacaggtga 43801 ggccccaaga caaggggcac aagtgtgtct ggagcacagc caagcagacc atggagagcc 43861 agatagtctc cacccatgcg gcagccgtca cctggtccat cccctgcctc cacgcccacc 43921 cccgcccaga aaagatgccc caggatccct tcacctgcac atctagcact gggccaacat 43981 ccaggaatga gctaggatgg aggcagtgac tgatgcagtg tgtgacatct aatctccccc 44041 ataattacag gccagtggtg tgaggtggag atagacccct gccacagcca accctgcttt 44101 catggaggga cctgtgaggc cacagcagga tcacccctgg gtttcatctg ccactgcccc 44161 aaggcaagtg accacaaatc tgccttctct gttgccccct atgctgacaa ggcaagaata 44221 cctcagttgg aatcccagaa gggactgtgg gtgagcactg atgtggaaat tattggaaaa 44281 agccatgcca agctcacagt gggaagtgtc tctcagaagc agtcaaaggc aaggcaggat 44341 cagttgatag catgaatgga attttcaaaa atcacaggcg ttgcctaagg gaaggtcagg 44401 agctccccaa gctcaagctg cgtggtgggt ggcctcagat aggttatttt aactctgtgt 44461 gtgtttgtat atgtatttat ggacctcaga tgcatggaat tagactaatc ttaagctttg 44521 gttcctgata cactgacatt ggtttatgcc tggtcttctt ttattttatt attctaacaa 44581 tgtaacaccc atgaacctaa cccaagaatt tcaatattaa taataactta catctactta 44641 agtcctcctc ctgtatcctg ttccctctcc agaggaagag gaagacatat gatcctattt 44701 ctaaggagta agataataat ataacagccg gccgggcaca gtggctcacg cctgtaatcc 44761 cagcactttg ggaggccgag gcaggcggat cacctgaggt cgggcattcg agaccagcct 44821 gacaaacatg gagaaaccct gtctctacta aaaatacaaa ttagctgggc gtggtggtgc 44881 atggctgtaa tcccagctat tgggaaggct gaggcaggag aattgcttga acccgggagg 44941 cagaggttgc aatgagctga gattgcacca ttgcactcca gcctggacaa caagagcgaa 45001 actctgtctc aaaaataata ataataataa tataatagca ttctattaac tgtttagtct 45061 tctaggactt gcactgtaat gccacagtcc atcaggttgt tgcacacagc tgtgcttcat 45121 ccattttcaa cagaatgtaa tatgtcattg tgtgaaatta ccacaggaca tggtttcaac 45181 atccacaaaa tgattaactt gatgctctct gaggcgcctt ttagatatga gaatctagga 45241 ccctctgcac cgtcttaacc caagagtttg cttgatggag agcgggaaga ataatgcaag 45301 ttgcatctcc aatatctccc ctcccctcca cagggttttg aaggccccac ctgcagccac 45361 agggcccctt cctgcggctt ccatcactgc caccacggag gcctgtgtct gccctcccct 45421 aagccaggct tcccaccacg ctgtgcctgc ctcagtggct atgggggtcc tgactgcctg 45481 accccaccag ctcctaaagg ctgtggccct ccctccccat gcctatacaa tggcagctgc 45541 tcagagacca cgggcttggg gggcccaggc tttcgatgct cctgccctca cagctctcca 45601 gggccccggt gtcagaaacc cggagccaag gggtgtgagg gcagaagtgg agatggggcc 45661 tgcgatgctg gctgcagtgg cccgggagga aactgggatg gaggggactg ctctctggga 45721 gtcccagacc cctggaaggg ctgcccctcc cactctcggt gctggcttct cttccgggac 45781 gggcagtgcc acccacagtg tgactctgaa gagtgtctgt ttgatggcta cgactgtgag 45841 acccctccag cctgcacgtg agcctgaaat ccactggagc cagggaagga gaggggtggg 45901 tgagaggagg aggaaggacg tagatggctc tgagttacag tgtggccaca gccttgggct 45961 ccagggagtt tccaccctaa taaccatcac taaacagggg tcgaagactc tggactccaa 46021 cctagggtaa tggggtggca tcagtattta atgtggggcg tggcctttgg gctcctctct 46081 aagagttgaa ggaactcagg tctcaagcct ccttccctaa gccttgctgc catggagtat 46141 ttcccctagc agtcagcacc tcacagaggg aaaagggcct gggactctcc tttagaaaca 46201 gaggagagct tgggagggta cagagagggg acagtctagg gagacagggg tgttagcaga 46261 cattggggtg tctggactac catccaggac ttgactaagc tcattgctcc acagctgccc 46321 ccacttagca accaaagccc tagagggcac aaaatatggg gaattctttc tagggtgaag 46381 aaaagagtca ggttttaggg aggtcctgag tccccctctc cttaccccac agtccagcct 46441 atgaccagta ctgccatgat cacttccaca acgggcactg tgagaaaggc tgcaacactg 46501 cagagtgtgg ctgggatgga ggtgactgca ggcctgaaga tggggaccca gagtgggggc 46561 cctccctggc cctgctggtg gtactgagcc ccccagccct agaccagcag ctgtttgccc 46621 tggcccgggt gctgtccctg actctgaggg taggactctg ggtaaggaag gatcgtgatg 46681 gcagggacat ggtgtacccc tatcctgggg cccgggctga agaaaagcta ggaggaactc 46741 gggaccccac ctatcaggag agagcagccc ctcaaacaca gcccctgggc aaggagaccg 46801 actccctcag tgctgggtaa gaagctaggt ggagggaagg gccagacacc agttttttta 46861 agagggcaga gggaggaaag ggagccaggg accaatacag aggtctctga ggtgcctcct 46921 ctacaggttt gtggtggtca tgggtgtgga tttgtcccgc tgtggccctg accacccggc 46981 atcccgctgt ccctgggacc ctgggcttct actccgcttc cttgctgcga tggctgcagt 47041 gggagccctg gagcccctgc tgcctggacc actgctggct gtccaccctc atgcagggac 47101 cggtaggtga ccccttgcca ctttctctga cctctgttcc caggccagct ctcatgctag 47161 caacaggcaa tggaggctga atcaaacagg acagctgaga ctgaaaatgt tctttgtggg 47221 gacttacttt ccctaacccc gctttctcta actgaatctc ccactggccc atttgttcta 47281 cagtctcctt ccttatttcc ctaagcacat tatcctaacc tctgtcatag ccctccaaca 47341 aagggatggt ttatcttctc taccagactg agaataccta atagtctttg tatcagacaa 47401 ttcatagtac atgaaagaat aataggctgg gcgcagtggc tcatgcctat aatcccagca 47461 cgttgggaga ccaaggcagg tggatcacga ggtcaggaga ttgagaccat cctggctaat 47521 gcggtgaaac cctgtctcta ctaaaaataa aaaaattagc cggctgtggt ggcgggtgct 47581 tgtagtctca gctactcagg aggctgaggc aggagaatgg cgtgaacctg ggaggtggag 47641 cttgcagtga gccgagatcg cgccactgca ctccagcctg ggcgacagag ggagactcca 47701 tctcaaaaaa aaaaaaagaa aaataactgc tatatcgtac tttgtgcctt actctaagca 47761 ttttacattg ttacctcatt taatcctccc ccacaacccc atgaggcacg tactgctggt 47821 tgagtatccc ttatctgaaa tgcttgggaa caaaagtgtt tcagatttcg gatttatttt 47881 ggaatatttg cattatactt actggttcag catccctaat acaacatcca aatgctacaa 47941 tgagcatttc ctttgagcgt tatgttggta ctctaaaagt ttcagacttt ggaacatttc 48001 agatttggga ttggggttat ggatactcag cctttttttg tgtgtttgtt ttctgagaca 48061 gtcttactct gtcagccaca ctggagtaca gtgacgccat ctcagctcac tgcaacctct 48121 gcctcctggg tttaagcaat tctcttgctt cagactactg agtagctgga attacaatgg 48181 catgccacca tgccctgata attttttttg tttttgtttt gttttgtttt gtttgagaca 48241 gagtcttgcc ttgtcgccca ggccggagtg cagtggcgcg atctcggctc actgcaagct 48301 ccacctccca agttcacgcc attctcctgc ctcagcctcc caagtagctg ggactacagg 48361 tgcccgccac cacacctggc taattttttg tatttttagt agagacaggg tttcaccatg 48421 ttagccagga tggtctcgat ctcctgacct catgatccac ctgcctcagc ctcccaaagt 48481 gctgagatta taggagtaag ccactacacc cagccactaa tttttatatt tttagtagag 48541 agggggtttt gccatgttgg ccaggctggt ctcgaactcc tggcctcata tgatccacct 48601 gcctcagctt cccaaagtgc tgggattaca ggcatgagcc actgtgccca acctcaatct 48661 atattatcat ccccattttg cagataagga aaccgaggca aagacaggct actaaacttg 48721 tccaaaggtc tcccaatagt aatcagtctc accaggagtg gcctctcttt gtgactctgt 48781 ctctcccacc agcaccccct gccaaccagc ttccctggcc tgtgctgtgc tccccagtgg 48841 ccggggtgat tctcctggcc ctaggggctc ttctcgtcct ccagctcatc cggcgtcgac 48901 gccgagagca tggagctctc tggctgcccc ctggtttcac tcgacggcct cggactcagt 48961 cagctcccca ccgacgccgg cccccactag gcgaggacag cattggtctc aagtgagaat 49021 gaggagaaac ccaggctcag gaaggggagt ctctcctatg gcgatattta caatcagaaa 49081 agataagaaa tactattgca gaagtcaaag ataggggaag gagagagggg tgggaagcct 49141 gctggaaatt ttggagaccc tgatggtcat aattccgtgt aacctctacc cacccattcc 49201 tttccagggc actgaagcca aaggcagaag ttgatgagga tggagttgtg atgtgctcag 49261 gccctgagga gggagaggag gtgggccagg tgaaagggct ggggcaagaa tggtctggag 49321 gtgatggaag ggatgaaagg gcaaatcaac cttcactgat ccttgctgtt acccaaaggc 49381 tgaagaaaca ggcccaccct ccacgtgcca gctctggtct ctgagtggtg gctgtggggc 49441 gctccctcag gcagccatgc taactcctcc ccaggaatct gagatggaag cccctgacct 49501 ggacacccgt ggacctggta tgtgagtcaa cccagaccaa gaaaaaaaaa aaaagtcctt 49561 tgaccctatt agaatcagag agtcctttaa tatcagaact agaggaaata attttagact 49621 gagtgcctta gaacaatgat tctcaaagtg tggtcctcag acagcaaaat cagcatcacc 49681 tgggaatttg tcagaaatgc aaattattgg gctccactac agagctactg actcaggaat 49741 ttaaaatgtt aggcaatctg ttttaacaag cccttcaggt gaatctgatc cagactcgtt 49801 tgagaaaacc actgctaggc cgggcgtggt ggctcacgcc tgtaatccca gcactttggg 49861 aggccaaggc gggtggatca caaggtcagg agatcgagac catcctggct aacacagtga 49921 aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tggtggcggg agcctgtagt 49981 cccagctgct ctggaggcta aggcaggaga atggcgtgaa cctgggagga ggagcttgca 50041 gtgagccgag atcgcgccac tgcactccag cctgggtgac agggcgagac tccgtctcag 50101 aaaaaaaaaa aaaaaaaaga gaaaaccact gccctggaat gtcagagaat taagctgcag 50161 gttcctttta cagaggaaga aactgaagtc agagaaaagc agaaaagtca cttggctaaa 50221 gccacacaga gccagaactt agcttcccaa cacctcaggt tttgattctc tctgagctta 50281 catgttgtcc cttccccctt gttgtgtcct ttagattgac ccattactct gtcttaccaa 50341 cagatggggt gacacccctg atgtcagcag tttgctgtgg ggaagtacag tccgggacct 50401 tccaaggggc atggttggga tgtcctgagc cctgggaacc tctgctggat ggaggggcct 50461 gtccccaggc tcacaccgtg ggcactgggg agacccccct gcacctggct gcccgattct 50521 cccggccaac cgctgcccgc cgcctccttg aggctggagc caaccccaac cagccagacc 50581 gggcagggcg cacacccctt catgctgctg tggctgctga tgctcgggag gtctgccagg 50641 ttagcacaca ctgaggtccc tacagggaat ggggcgagct tacaagtaaa gctggacaga 50701 agcatcccct agagtttgac aaggaggaaa ttggtgtgat tgggaacctg acagggaaac 50761 tgcggaggat ggctgaatat ggattgcgag tggggttaat agtgtaagga actcgagttg 50821 gcagtccaag gtaccccagg ggtcactggc cctctgtctc cccagcttct gctccgtagc 50881 agacaaactg cagtggacgc tcgcacagag gacgggacca cacccttgat gctggctgcc 50941 aggctggcgg tggaagacct ggttgaagaa ctgattgcag cccaagcaga cgtgggggcc 51001 agagataaat ggggtatgta gaggaagggg tgatgtatgc tatagagaag ttgagcagat 51061 ggggtgggag atagcgtgca aaatataggt gcagcagagg ggcattccct ctcatcctgc 51121 tgttacggcg gtcaatctga gatgcggtgg aagtacgggc cgcgtgagtt tcccccccca 51181 actcccaccc tcaacaccac actggccctc cgctccagct tactggggaa ctggcatgga 51241 acacagtgtc tgtggaaagg ggggggaatc tcgtgggggg agactgtctc ccggtctcac 51301 cgaccccaga acaatgcccc attgtccctc ccgcgcactg gtgacgtcac cagggcaaca 51361 cttcctgcag gccggtggtc tcctgggcaa cgcttcccgc ctttgaggga ccagccggcc 51421 cgaatagccc ttcccccaag gccagaaccc gtgggaaacc ggaacccagg cgtctggccc 51481 ccaactgggg taacaacctc ccacgtcgtc ccctagggaa aactgcgctg cactgggctg 51541 ctgccgtgaa caacgcccga gccgcccgct cgcttctcca ggccggagcc gataaagatg 51601 cccaggacaa cagggttaga tgggacagag ggcttcccac aaaacagtca ggcgcacgag 51661 agatggaaag tgcggtaacc cgcaaagcct gaagggatag gggccagtgg tcgcgcaagt 51721 gaaggcagaa aggcccagtc ctgtgggcgt ggccttccct gatatcggcc ctggctcttc 51781 tgtacaggag cagacgccgc tattcctggc ggcgcgggaa ggagcggtgg aagtagccca 51841 gctactgctg gggctggggg cagcccgaga gctgcgggac caggctgggc tagcgccggc 51901 ggacgtcgct caccaacgta accactggga tctgctgacg ctgctggaag gggctgggcc 51961 accagaggcc cgtcacaaag ccacgccggg ccgcgaggct gggcccttcc cgcgcgcacg 52021 gacggtgtca gtaagcgtgc ccccgcatgg gggcggggct ctgccgcgct gccggacgct 52081 gtcagccgga gcaggccctc gtgggggcgg agcttgtctg caggctcgga cttggtccgt 52141 agacttggct gcgcgggggg gcggggccta ttctcattgc cggagcctct cgggagtagg 52201 agcaggagga ggcccgaccc ctcgcggccg taggttttct gcaggcatgc gcgggcctcg 52261 gcccaaccct gcgataatgc gaggaagata cggagtggct gccgggcgcg gaggcagggt 52321 ctcaacggat gactggccct gtgattgggt ggccctggga gcttgcggtt ctgcctccaa 52381 cattccgatc ccgcctcctt gccttactcc gtccccggag cggggatcac ctcaacttga 52441 ctgtggtccc ccagccctcc aagaaatgcc cataaaccaa ggaggagagg gtaaaaaata 52501 gaagaataca tggtagggag gaattccaaa aatgattacc cattaaaagg caggctggaa 52561 ggccttcctg gttttaagat ggatccccca aaatgaaggg ttgtgagttt agtttctctc 52621 ctaaaatgaa tgtatgccca ccagagcaga catcttccac gtggagaagc tgcagctctg 52681 gaaagagggt ttaagatgct aggatgaggc aggcccagtc ctcctccaga aaataagaca 52741 ggccacagga gggcagagtg gagtggaaat acccctaagt tggaaccaag aattgcaggc 52801 atatgggatg taagatgttc tttcctatat atggtttcca aagggtgccc ctatgatcca 52861 ttgtccccac tgcccacaaa tggctgacaa atatttattg ggcacctact atgtgccagg 52921 cactgtgtag gtgctgaaaa gtggccaagg gccacccccg ctgatgactc cttgcattcc 52981 ctcccctcac aacaaagaac tccactgtgg ggatgaagcg cttcttctag ccactgctat 53041 cgctatttaa gaaccctaaa tctgtcaccc ataataaagc tgatttgaag tgttaccttt 53101 ttttggagga attggggaga agaatgggaa aaaagatggg agtgactgca taatgtcagc 53161 attttgtgct tttggctcag catttggatt ggatggagga tgtaagtata gtttaaaagc 53221 aagaataagt atatttaggg gccctatgat aatttagggt attatctgaa agcaagaatc 53281 tagtagccaa gggagaaacc gcacacacta ggtcaggggt ccccaaccct tgggccacag 53341 actggtactg gtccatggcc tcttaggaac tgggccacac agcaggaggt gagcaagcat 53401 tactgcccaa gctccacctc ctgtcagatc agcataagca ttaaattctc ataggaactc 53461 gaaccctatt gtgaactgtg catgcaaggg atctaagttt ctcgcttctt acgggaatct 53521 aatgcctaat gatctggggt ggaacagttt catcctgaaa ccagccctcc gtcgccaccg 53581 accatggaat aattgtcttc cacgaaactc ttccctggtg ccaaaaaggt aggagaccac 53641 tgcactagat gatgcacaca ctttgtccct catcctaggg cttttactta tggccactta 53701 ggagattcct aaggccacaa gtcaagtaga tggagagagt atcttgaaac tttgtccacc 53761 ttgcagcaat atgttgctag gtttgaaaca tggagtcatg aggcattttg aaagccaata 53821 atatctacag tttattaagt attcactatg catcaagtgc tttattacat tattaataca 53881 tcaaccctat gaagtaggtg ctattaaaac cctatttcac tcagaaattg aggcacagag 53941 atctgcccaa gattgcaagg aataagcggc aggaccagat ctcttcatca catttcacat 54001 tccaaatcac tcagctataa actccctaac atgacaggtt gccatttaga ggtaccaaat 54061 ggttgtctgc cctgctcctt ccttgatgcc aaccagcctg attagcattg atcaaagacc 54121 aagccagaga ggtagtcctc tcccttttca atttcatttc atttctttct ttttctgaaa 54181 cagggtattg ctctgttgcc cagcctggag tgcagtggca caaatggctc actgcagcca 54241 caacctcctg ggctcaagca atcttcctac ctcagcctct catttttcca tatctatatc 54301 tcttatgccc aaaataaact ttcccctgcc ccttgtctgc actaaactgt aagtttccaa 54361 aatgaaccct tcccgtactc tatttggtac acatcttgtc tcctgaatag agtgtatttt 54421 ttattttatt ttattttgga gacggagtct cgctctgtca cctaggctgg agcgcagtgg 54481 cacaatctca gttcaccgca acctccgcct cccgggttca agcaattctc ctgcctcaac 54541 ctcctgagta gctgggatta caggcgcatg tggccacgcc cagctaattt tttgtatttt 54601 agtaaagatg gggtttcacc atgttcccca ggctggtctc caactcctga gctcaggcaa 54661 tccacccgcc tcagcctccc aaagtgctag gattacaggt gtgagccacc gcagccggcc 54721 atctcctgaa tagattttaa atacctagag gtcagggatg attattcaat acatatatat 54781 tgaatactta ctatgtgttg gacccggtgc tagggtttta tgtatatatt tgagagctcc 54841 acatccctgg atctgaatcc tccacttccc actggaacca tgcccctccc agtcccggta 54901 agtaagaggg aagatcggga gggccaaatc ctacaccagg gtctatctta gggagggaag 54961 ggacctggct ggggggaggg ggattctgag gagtgaaacc acttcctgtg tagctagttc 55021 ctgtgttgac agaaagagca gaagaggagg tggggtggag ggagcagagc cagggattag 55081 gggactactg aggctctgga gatgagaccg ccaggagtcc ttccccacca tgagccccct 55141 ccactcctgc agctggagga gtttttccca gtctcagtgc tgccctgggg cgagagagac 55201 tgaacaagct gtttgggtgg gaagagaatg gaggaagttg acagggatgg gcggggcccg 55261 tgggggggct gaccaggaac ccagcttcct gctcagtacc caggcatcca gcccccagct 55321 cacccccacc ccttccagcc cccactcccc tcaggaaccc aaggttccag ccctcctccc 55381 aaatcccagc cacccctccc ccaccagttt ctcccctcta ggggatggag gctgagagac 55441 cccaggaaga agaggatggt gagcaggtga gctgggcacg gggttgggga ggctgacact 55501 gggaaagaag ggaggtgaga ggacctgggg cagaaatgta gggacacagg ggccttgaaa 55561 ggcttgggca aactgaggca ggaacagaga cacacagaga ggaaacgggc cactggccta 55621 gccccctgtc cactcctccc gcttcaacca ccactgcttg actagaatgg acatattttg 55681 gcatcagggc ccccctcagg atgaggaagg ctggccccct ccaaactcca ccactcggcc 55741 ttggcgatct gctcctccat cccctcctcc tccagggacc cgccacacag gtacccctac 55801 ccacccaggg agagccccga ccctagtgcc cacatcctga ccccattacc aaggcccact 55861 ccattgtggg ccctctcccc acctcctcca gactcccctt gggattcccc attgcacccc 55921 ctctcctctg atccaaagtc cctaatcacg tcaccctgtc cacactcccc cacggctcct 55981 gtctgccaac ctctctgggt ctctgagccc tccacacccc tctccccagc cctgggaccc 56041 cgctcggcct ccctgctctc cctgcagact gaactccttc tggacctggt ggctgaagcc 56101 cagtcccgcc gcctggagga gcagagggcc accttctaca ccccccaaaa cccctcaagc 56161 ctagcccctg ccccactccg tcctctcgag gacagagaac agctttacag cactatcctc 56221 agtcaccagg taagacatcc ccccaggagg caaacccagg cctcctggtc tcttggcccc 56281 tgttctcttt ggggctctac tcctgtttct ccctaggcac cccatcgcct tcacaggttt 56341 cctatatgcc tccccatacc aacccttgat cctctcaaga acctcctcct ctcagaccct 56401 caccaaagct ctccctctcc ctccactcct ccagtgccag cggatggaag cccagcggtc 56461 agagcctccc ctccctccag gggggcaaga gctcctggag ttgctgctga gagttcaggg 56521 tgggggtcga atggaggagc aaaggtcccg gccccccaca cacacctgct gagacttgag 56581 ccccaaccag cccttccttg ccactggtct caaagctggg cagcccattg catgccctca 56641 actcttgctt ggcaggggta ccagagactg aaagacacgg cacaaatctc aatattcatc 56701 tcccacatca ccttccctgg gaactggaca gggtgaaagt cctcaaactc tgggaacagg 56761 cgagatggaa cagggattta actccccgcc cacaggtcca tgggagcttg aggcagtaag 56821 ggggatc // LOCUS HSMHC3W36A 100267 bp DNA PRI 15-FEB-1997 DEFINITION Human HLA class III region containing cAMP response element binding protein-related protein (CREB-RP) and tenascin X (tenascin-X) genes, complete cds, complete sequence. ACCESSION U89337 NID g1841544 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 100267) AUTHORS Rowen,L., Dankers,C., Baskin,D., Faust,J., Loretz,C., Ahearn,M.E., Banta,A., Schwartzell,S., Smith,T.M., Spies,T. and Hood,L. TITLE Sequence determination of 300 kilobases of the human class III MHC locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 100267) AUTHORS Rowen,L. TITLE Direct Submission JOURNAL Submitted (12-FEB-1997) Department of Molecular Biotechnology, Box 357730 University of Washington, Seattle, WA 98195, USA COMMENT Cosmids W36A, W6A, W35A and T27B were obtained from Thomas Spies (Spies et al, Nature (1990) 348: 744-747). This contig overlaps U89336 by 363 bp. Cosmid W36A spans bases 1-375, cosmid W6A spans 376-38616, cosmid W35A spans 35806-69816 and cosmid T27B spans 66264-100267. There were no sequence variations where sequences overlap. Sequencing methodology: high redundancy shotgun. Interspersed Repeats were identified with RepeatMasker (available from http://ftp.genome.washington.edu/RM/RepeatMasker.html) Microsatellites (n > 8 repeat units) were identified with sputnik (available from http://serac.mbt.washington.edu/ chrisa/software/sputnik.html). FEATURES Location/Qualifiers source 1..100267 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21" source 1..375 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W36A" repeat_region 302..600 /note="SINE" /rpt_family="AluSc" source 376..38616 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W6A" repeat_region 602..752 /note="LTR/other" /rpt_family="PABL_B" repeat_region complement(776..853) /note="DNA/MER1_type" /rpt_family="MER20" repeat_region 886..1054 /note="DNA/MER1_type" /rpt_family="MER20" repeat_region 1082..1384 /note="SINE" /rpt_family="AluSq" repeat_region 1385..1470 /note="LINE/L1" /rpt_family="L1PA13" repeat_region 1474..1757 /note="SINE" /rpt_family="AluSx" repeat_region 1762..2060 /note="SINE" /rpt_family="AluSx" repeat_region 2062..2181 /note="LINE/L1" /rpt_family="L1PB3" repeat_region 2226..2518 /note="SINE" /rpt_family="AluJb" repeat_region complement(2522..2651) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(2634..3120) /note="LINE/L2" /rpt_family="MIR2" repeat_region 3124..3415 /note="SINE" /rpt_family="AluJo" repeat_region complement(3469..3671) /note="LINE/L2" /rpt_family="MIR2" repeat_region 3672..3974 /note="SINE" /rpt_family="AluSg" repeat_region 4028..4084 /note="SINE" /rpt_family="MIR" repeat_region complement(4634..4931) /note="SINE" /rpt_family="AluSc" repeat_region complement(5335..5543) /note="SINE" /rpt_family="AluJo" repeat_region complement(5864..6162) /note="SINE" /rpt_family="AluSq" repeat_region complement(6811..7075) /note="SINE" /rpt_family="AluSq" repeat_region complement(7320..7553) /note="SINE" /rpt_family="AluSg" repeat_region complement(7554..7749) /note="SINE" /rpt_family="AluSc" repeat_region 7751..8047 /note="SINE" /rpt_family="AluSx" repeat_region 8449..8581 /note="SINE/Alu" /rpt_family="FLAM_C" repeat_region 8603..8738 /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region complement(9042..9293) /note="SINE" /rpt_family="AluSx" repeat_region complement(9387..9651) /note="SINE" /rpt_family="AluSc" repeat_region complement(9656..9897) /note="SINE" /rpt_family="AluSq" repeat_region complement(9913..10130) /note="SINE" /rpt_family="AluSx" exon 12385..12512 /gene="CREB-RP" /number=1 gene 12385..25368 /gene="CREB-RP" CDS join(12431..12512,12888..12967,13118..13196,13444..13535, 14386..14521,19289..19374,19513..19648,19735..19866, 20680..20813,21496..21681,21771..21862,22599..22778, 22943..23050,23233..23314,23530..23600,23822..23933, 24073..24157,24669..24898) /gene="CREB-RP" /note="similar to CREB-RP encoded by GenBank Accession Number U31903 and to G13 product encoded by GenBank Accession Numbers X98053 and X98054" /codon_start=1 /product="cAMP response element binding protein-related protein" /db_xref="PID:g1841545" /translation="MAELMLLSEIADPTRFFTDNLLSPEDWDSTLYSGLDEVAEEQTQ LFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQVKSEPSSPCSSSSLSSESS RLSTEPSSEALGVGEVLHVKTESLAPPLCLLGDDPTSSFETVQINVIPTSDDSSDVQT KIEPVSPCSSVNSEASLLSADSSSQAFIGEEVLEVKTESLSPSGCLLWDVPAPSLGAV QISMGPSLDGSSGKALPTRKPPLQPKPVVLTTVPMPSRAVPPSTTVLLQSLVQPPPVS PVVLIQGAIRVQPEGPAPSLPRPERKSIVPAPMPGNSCPPEVDAKLLKRQQRMIKNRE SACQSRRKKKEYLQGLEARLQAVLADNQQLRRENAALRRRLEALLAENSELKLGSGNR KVVCIMVFLLFIAFNFGPVSISEPPSAPISPRMNKGEPQPRRHLLGFSEQEPVQGVEP LQGSSQGPKEPQPSPTDQPSFSNLTAFPGGAKELLLRDLDQLFLSSDCRHFNRTESLR LADELSGWVQRHQRGRRKIPQRAQERQKSQPRKKSPPVKAVPIQPPGPPERDSVGQLQ LYRHPDRSQPAFLDAIDRREDTFYVVSFRRDHLLLPAISHNKTSRPKMSLVMPAMAPN ETLSGRGAPGDYEEMMQIECEVMDTRVIHIKTSTVPPSLRKQPSPTPGNATGGPLPVS AASQAHQASHQPLYLNHP" exon 12888..12967 /gene="CREB-RP" /number=2 exon 13118..13196 /gene="CREB-RP" /number=3 exon 13444..13535 /gene="CREB-RP" /number=4 repeat_region complement(13759..14058) /note="SINE" /rpt_family="AluJb" exon 14386..14521 /gene="CREB-RP" /number=5 repeat_region complement(14864..15158) /note="SINE" /rpt_family="AluSg" repeat_region 15331..15404 /note="LINE/L2" /rpt_family="MIR2" repeat_region 15511..15787 /note="SINE" /rpt_family="AluYa5" repeat_region 16009..16418 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(16419..16713) /note="SINE" /rpt_family="AluSp" repeat_region 16951..17195 /note="SINE" /rpt_family="AluY" repeat_region complement(17390..17699) /note="SINE" /rpt_family="AluSx" repeat_region complement(17772..18070) /note="SINE" /rpt_family="AluY" repeat_region 18408..18563 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(18920..19216) /note="SINE" /rpt_family="AluSp" exon 19513..19648 /gene="CREB-RP" /number=7 exon 19735..19866 /gene="CREB-RP" /number=8 repeat_region 19887..20188 /note="SINE" /rpt_family="AluSx" repeat_region 20240..20541 /note="SINE" /rpt_family="AluY" exon 20680..20813 /gene="CREB-RP" /number=9 exon 21496..21681 /gene="CREB-RP" /number=10 exon 21771..21862 /gene="CREB-RP" /number=11 repeat_region complement(22022..22326) /note="SINE" /rpt_family="AluSx" repeat_region complement(22445..22488) /note="SINE" /rpt_family="MIR" exon 22599..22778 /gene="CREB-RP" /number=12 exon 22943..23050 /gene="CREB-RP" /number=13 exon 23233..23314 /gene="CREB-RP" /number=14 exon 23530..23600 /gene="CREB-RP" /number=15 exon 23822..23933 /gene="CREB-RP" /number=16 exon 24073..24157 /gene="CREB-RP" /number=17 exon 24669..25368 /gene="CREB-RP" /number=18 repeat_region complement(25924..26191) /note="SINE" /rpt_family="AluSg" repeat_region 26646..26797 /note="SINE" /rpt_family="MIR" repeat_region complement(27530..27661) /note="SINE/Alu" /rpt_family="FLAM_C" repeat_region 28511..28641 /note="SINE" /rpt_family="MIR" exon 31263..31456 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number U52696." /number=1 gene join(31263..31456,42432..42842,43189..45027,51285..51413, 51588..51851,52759..52962,53131..53382,54566..54883, 55924..56220,58307..58597,58960..59265,61267..61599, 66692..66997,68361..68654,70230..70517,70783..71091, 71508..71822,71937..72254,72690..72983,75543..75839, 78206..78502,78945..79262,82269..82601,83756..84073, 84491..84808,86981..87271,87672..87989,90338..90661, 91073..91390,92010..92297,92648..92926,94204..94485, 95334..95654,95944..96279,96531..96653,96768..96911, 97104..97223,97342..97485,97579..97709,97829..97961, 98054..98205,98298..98394,98487..98648,98726..98889, 99210..99504) /gene="tenascin-X" repeat_region complement(34428..34838) /note="LINE/L1" /rpt_family="L1ME3" repeat_region complement(34796..34906) /note="LINE/L1" /rpt_family="L1ME2" repeat_region 35074..35556 /note="LINE/L2" /rpt_family="MIR2" repeat_region 35624..35700 /note="LINE/L2" /rpt_family="MIR2" repeat_region 35701..36003 /note="SINE" /rpt_family="AluJo" source 35806..69816 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W35A" repeat_region 36158..36400 /note="LINE/L2" /rpt_family="MIR2" repeat_region 36453..36494 /note="DNA/MER1_type" /rpt_family="MER5B" repeat_region complement(36497..36595) /note="DNA/MER1_type" /rpt_family="MER3" repeat_region 36607..36908 /note="SINE" /rpt_family="AluSx" repeat_region 36913..37187 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(37188..37498) /note="SINE" /rpt_family="AluSx" repeat_region complement(37506..37696) /note="LINE/L1" /rpt_family="L1ME3" repeat_region complement(37701..37997) /note="SINE" /rpt_family="AluSc" repeat_region complement(37998..38042) /note="LINE/L1" /rpt_family="L1ME3" repeat_region complement(38107..38613) /note="LINE/L1" /rpt_family="L1ME3" repeat_region 38616..38909 /note="SINE" /rpt_family="AluJo" repeat_region 38916..39041 /note="SINE" /rpt_family="AluSq" repeat_region 39049..39340 /note="SINE" /rpt_family="AluY" repeat_region 39343..39415 /note="SINE" /rpt_family="AluSp" repeat_region complement(39540..39836) /note="SINE" /rpt_family="AluSg" repeat_region complement(39891..40191) /note="SINE" /rpt_family="AluJo" repeat_region 40233..40533 /note="SINE" /rpt_family="AluSx" repeat_region 40976..41108 /note="SINE/Alu" /rpt_family="FLAM_A" repeat_region 41113..41402 /note="SINE" /rpt_family="AluSx" repeat_region 41764..41858 /note="LINE/L2" /rpt_family="MIR2" repeat_region 41860..42160 /note="SINE" /rpt_family="AluSp" exon 42432..42842 /gene="tenascin-X" /note="similar to sequences with GenBank Accession Numbers U52696, U52699, U52701, and X71923" /number=2 CDS join(42440..42842,43189..45027,51285..51413,51588..51851, 52759..52962,53131..53382,54566..54883,55924..56220, 58307..58597,58960..59265,61267..61599,66692..66997, 68361..68654,70230..70517,70783..71091,71508..71822, 71937..72254,72690..72983,75543..75839,78206..78502, 78945..79262,82269..82601,83756..84073,84491..84808, 86981..87271,87672..87989,90338..90661,91073..91390, 92010..92297,92648..92926,94204..94485,95334..95654, 95944..96279,96531..96653,96768..96911,97104..97223, 97342..97485,97579..97709,97829..97961,98054..98205, 98298..98394,98487..98648,98726..98889,99210..99311) /gene="tenascin-X" /note="Intron-exon boundaries were defined by a) alignment to mRNAs, b) grail, c) dot matrix alignment of repeating units d) low-level blastn hits to other species and other tenascin genes or e) blastx hits to tenascins or fibronectin; some exons could have been missed, and the boundaries of some could be wrong; a full-length transcript will resolve the issue" /codon_start=1 /product="tenascin X" /db_xref="PID:g1841546" /translation="MMPAQYALTSSLVLLVLLSTARAGPFSSRSNVTLPAPRPPPQPG GHTVGAGVGSPSSQLYEHTVEGGEKQVVFTHRINLPPSTGCGCPPGTEPPVLASEVQA LRVRLEILEELVKGLKEQCTGGCCPASAQAGTGQTDVRTLCSLHGVFDLSRCTCSCEP GWGGPTCSDPTDAEIPPSSPPSASGSCPDDCNDQGRCVRGRCVCFPGYTGPSCGWPSC PGDCQGRGRCVQGVCVCRAGFSGPDCSQRSCPRGCSQRGRCEGGRCVCDPGYTGDDCG MRSCPRGCSQRGRCENGRCVCNPGYTGEDCGVRSCPRGCSQRGRCKDGRCVCDPGYTG EDCGTRSCPWDCGEGGRCVDGRCVCWPGYTGEDCSTRTCPRDCRGRGRCEDGECICDT GYSGDDCGVRSCPGDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCRGRGRCENGVC VCNAGYSGEDCGVRSCPGDCRGRGRCESGRCMCWPGYTGRDCGTRACPGDCRGRGRCV DGRCVCNPGFTGEDCGSRRCPGDCRGHGLCEDGVCVCDAGYSGEDCSTRSCPGGCRGR GQCLDGRCVCEDGYSGEDCGVRQCPNDCSQHGVCQDGVCICWEGYVSEDCSIRTCPSN CHGRGRCEEGRCLCDPGYTGPTCATRMCPADCRGRGRCVQGVCLCHVGYGGEDCGQEE PPASACPGGCGPRELCRAGQCVCVEGFRGPDCAIQTCPGDCRGRGECHDGSCVCKDGY AGEDCGEARVPSSASAYDQRGLAPGQEYQVTVRALRGTSWGLPASKTITTMIDGPQDL RVVAVTPTTLELGWLRPQAEVDRFVVSYVSAGNQRVRLEVPPEADGTLLTDLMPGVEY VVTVTAERGRAVSYPASVRANTEEREEESPPRPSLSQPPRRPWGNLTAELSRFRGTVQ DLERHLRAHGYPLRANQTYTSVARHIHEYLQRQVLGSSADGALLVSLDGLRGQFERVV LRWRPQPPAEGPGGELTVPGTTRTVSLPDLRPGTTYHVEVHGVRAGQTSKSYAFITTT GPSTTQGAQAPLLQQRPQELGELRVLGRDETGRLRVVWTAQPDTFAYFQLRMRVPEGP GAHEEVLPGDVRQALVPPPPPGTPYELSLHGVPPGGKPSDPIIYQGIMDKDEEKPGKS SGPPRLGELTVTDRTSDSLLLRWTVPEGEFDSFVIQYKDRDGQPQVVPVEGPQRSAVI TSLDPGRKYKFVLYGFVGKKRHGPLVAEAKILPQSDPSPGTPPHLGNLWVTDPTPDSL HLSWTVPEGQFDTFMVQYRDRDGRPQVVPVEGPERSFVVSSLDPDHKYRFTLFGIANK KRYGPLTADGTTAPERKEEPPRPEFLEQPLLGELTVTGVTPDSLRLSWTVAQGPFDSF MVQYKDAQGQPQAVPVAGDENEVTVPGLDPDRKYKMNLYGLRGRQRVGPESVVAKTAP QEDVDETPSPTELGTEAPESPEEPLLGELTVTGSSPDSLSLFWTVPQGSFDSFTVQYK DRDGRPRAVRVGGKESEVTVGGLEPGHKYKMHLYGLHEGQRVGPVSAVGVTAPQQEET PPATESPLEPRLGELTVTDVTPNSVGLSWTVPEGQFDSFIVQYKDKDGQPQVVPVAAD QREVTVYNLEPERKYKMNMYGLHDGQRMGPLSVVIVTAPATEASKPPLEPRLGELTVT DITPDSVGLSWTVPEGEFDSFVVQYKDRDGQPQVVPVAADQREVTIPDLEPSRKYKFL LFGIQDGKRRSPVSVEAKTVARGDASPGAPPRLGELWVTDPTPDSLRLSWTVPEGQFD SFVVQFKDKDGPQVVPVEGHERSVTVTPLDAGRKYRFLLYGLLGKKRHGPLTADGTTE ARSAMDDTGTKRPPKPRLGEELQVTTVTQNSVGLSWTVPEGQFDSFVVQYKDRDGQPQ VVPVEGSLREVSVPGLDPAHRYKLLLYGLHHGKRVGPISAVAITAGREETETETTAPT PPAPEPHLGELTVEEATSHTLHLSWMVTEGEFDSFEIQYTDRDGQLQMVRIGGDRNDI TLSGLESDHRYLVTLYGFSDGKHVGPVHVEALTVPEEEKPSEPPTATPEPPIKPRLGE LTVTDATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEEGVTISGLEPDHK YKMNLYGFHGGQRMGPVSVVGVTEPSMEAPEPAEEPLLGELTVTGSSPDSLSLSWTVP QGRFDSFTVQYKDRDGRPQVVRVGGEESEVTVGGLEPGRKYKMHLYGLHEGRRVGPVS AVGVTAPEEESPDAPLAKLRLGQMTVRDITSDSLSLSWTVPEGQFDHFLVQFKNGDGQ PKAVRVPGHEDGVTISGLEPDHKYKMNLYGFHGGQRVGPVSAVGLTASTEPPTPEPPI KPRLEELTVTDATPDSLSLSWTVPEGQFDHFLVQYKNGDGQPKATRVPGHEDRVTISG LEPDNKYKMNLYGFHGGQRVGPVSAIGVTEEETPSPTEPSMEAPEPPEEPLLGELTVT GSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQVVRVGGEESEVTVGGLEPGRKYKMH LYGLHEGRRVGPVSTVGVTAPQEDVDETPSPTEPGTEAPEPPEEPLLGELTVTGSSPD SLSLSWTVPQGRFDSFTVQYKDRDGRPQAVRVGGQESKVTVRGLEPGRKYKMHLYGLH EGRRLGPVSAVGVTEDEAETTQAVPTMTPEPPIKPRLGELTMTDATPDSLSLSWTVPE GQFDHFLVQYRNGDGQPKAVRVPGHEDGVTISGLEPDHKYKMNLYGFHGGQRVGPISV IGVTEEETPSPTELSTEAREPPEEPLLGELTVTGSSPDSLSLSWTIPQGHFDSFTVQY KDRDGRPQVMRVRGEESEVTVGGLEPGRKYKMHLYGLHEGRRVGPVSTVGVTVPTTTP EPPNKPRLGELTVTDATPDSLSLSWMVPEGQFDHFLVQYRNGDGQPKVVRVPGHEDGV TISGLEPDHKYKMNLYGFHGGQRVGPISVIGVTEEETPAPTEPSTEAPEPPEEPLLGE LTVTGSSPDSLSLSWTIPQGRFDSFTVQYKDRDGRPQVVRVRGEESEVTVGGLEPGCK YKMHLYGLHEGQRVGPVSAVGVTAPKDEAETTQAVPTMTPEPPIKPRLGELTVTDATP DSLSLSWMVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGVTISGLEPDHKYKMNLYGF HGGQRVGPVSAIGVTEEETPSPTEPSTEAPEAPEEPLLGELTVTGSSPDSLSLSWTVP QGRFDSFTVQYKDRDGQPQVVRVRGEESEVTVGGLEPGRKYKMHLYGLHEGQRVGPVS TVGITAPLPTPLPVEPRLGELAVAAVTSDSVGLSWTVAQGPFDSFLVQYRDAQGQPQA VPVSGDLRAVAVSGLDPARKYKFLLFGLQNGKRHGPVPVEARTAPDTKPSPRLGELTV TDATPDSVGLSWTVPEGEFDSFVVQYKDKDGRLQVVPVAANQREVTVQGLEPSRKYRF LLYGLSGRKRLGPISADSTTAPLEKELPPHLGELTVAEETSSSLRLSWTVAQGPFDSF VVQYRDTDGQPRAVPVAADQRTVTVEDLEPGKKYKFLLYGLLGGKRLGPVSALGMTAP EEDTPAPELAPEAPEPPEEPRLGVLTVTDTTPDSMRLSWSVAQGPFDSFVVQYEDTNG QPQALLVDGDQSKILISGLEPSTPYRFLLYGLHEGKRLGPLSAEGTTGLAPAGQTSEE SRPRLSQLSVTDVTTSSLRLNWEAPPGAFDSFLLRFGVPSPSTLEPHPRPLLQRELMV PGTRHSAVLRDLRSGTLYSLTLYGLRGPHKADSIQGTARTLSPVLESPRDLQFSEIRE TSAKVNWMPPPSRADSFKVSYQLADGGEPQSVQVDGQARTQKLQGLIPGARYEVTVVS VRGFEESEPLTGFLTTVPDGPTQLRALNLTEGFAVLHWKPPQNPVDTYDVQVTAPGAP PLQAETPGSAVDYPLHDLVLHTNYTATVRGLRGPNLTSPASITFTTGLEAPRDLEAKE VTPRTALLTWTEPPVRPAGYLLSFHTPGGQNQEILLPGGITSHQLLGLFPSTSYNARL QAMWGQSLLPPVSTSFTTGGLRIPFPRDCGEEMQNGAGASRTSTIFLNGNRERPLNVF CDMETDGGGWLVFQRRMDGQTDFWRDWEDYAHGFGNISGEFWLGNEALHSLTQAGDYS MRVDLRAGDEAVFAQYDSFHVDSAAEYYRLHLEGYHGTAGDSMSYHSGSVFSARDRDP NSLLISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYHWKGFEFSVPFTEMKLR PRNFRSPAGGG" exon 43189..45027 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number X71923; defined also by reference to blastn and blastx hits to tenascin." /number=3 repeat_region complement(46522..46829) /note="SINE" /rpt_family="AluJb" repeat_region complement(47926..48123) /note="SINE" /rpt_family="AluSq" repeat_region 48338..48525 /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region 48863..49155 /note="SINE" /rpt_family="AluSp" repeat_region 49227..49526 /note="SINE" /rpt_family="AluSp" repeat_region 50899..51119 /note="SINE" /rpt_family="MIR" exon 51285..51413 /gene="tenascin-X" /note="defined by blastx." /number=4 exon 51588..51851 /gene="tenascin-X" /note="defined by Grail, blastx" /number=5 exon 52759..52962 /gene="tenascin-X" /note="defined by blastn and blastx" /number=6 exon 53131..53382 /gene="tenascin-X" /note="defined by Grail, blastx" /number=7 exon 54566..54883 /gene="tenascin-X" /note="defined by Grail, blastx" /number=8 exon 55924..56220 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=9 repeat_region 56682..56984 /note="SINE" /rpt_family="AluJo" repeat_region 57266..57796 /note="LINE/L2" /rpt_family="MIR2" exon 58307..58597 /gene="tenascin-X" /note="repeating unit defined by dot matrix; exon boundaries could be 58331-58597" /number=10 exon 58960..59265 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=11 repeat_region 59923..60122 /note="SINE" /rpt_family="MIR" repeat_region 60124..60424 /note="SINE" /rpt_family="AluY" exon 61267..61599 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=12 repeat_region 61751..61899 /note="SINE" /rpt_family="MIR" misc_feature 62005..62096 /note="similar to 5S rRNA" repeat_region complement(62165..62467) /note="SINE" /rpt_family="AluSx" repeat_region complement(62474..62598) /note="microsatellite" /rpt_type=tandem /rpt_unit=tttc repeat_region complement(62609..62899) /note="SINE" /rpt_family="AluSx" repeat_region complement(63645..63944) /note="SINE" /rpt_family="AluSx" repeat_region 63945..64055 /note="SINE" /rpt_family="MIR" repeat_region 64905..65657 /note="LINE/L2" /rpt_family="MIR2" repeat_region 65665..65957 /note="SINE" /rpt_family="AluSp" repeat_region 65969..66152 /note="LINE/L2" /rpt_family="MIR2" source 66264..100267 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid T27B" repeat_region 66322..66618 /note="SINE" /rpt_family="AluSx" exon 66692..66997 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=13 repeat_region complement(67677..67956) /note="SINE" /rpt_family="AluSq" exon 68361..68654 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=14 repeat_region 68779..69064 /note="SINE" /rpt_family="AluSg" exon 70230..70517 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=15 exon 70783..71091 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=16 repeat_region complement(71187..71411) /note="LINE/L1" /rpt_family="L1ME1" exon 71508..71822 /gene="tenascin-X" /note="repeating unit defined by dot matrix, weak similarity to others." /number=17 exon 71937..72254 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=18 exon 72690..72983 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=19 repeat_region 73457..73583 /note="microsatellite" /rpt_type=tandem /rpt_unit=tttc repeat_region complement(73565..73868) /note="SINE" /rpt_family="AluJb" repeat_region complement(73887..74212) /note="LINE/L1" /rpt_family="L1MB8" repeat_region complement(74259..74540) /note="SINE" /rpt_family="AluJo" repeat_region complement(74550..74606) /note="LINE/L1" /rpt_family="L1MB5" repeat_region 74711..75040 /note="SINE" /rpt_family="AluSx" repeat_region 75214..75399 /note="LINE/L2" /rpt_family="MIR2" exon 75543..75839 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=20 repeat_region 76149..76449 /note="SINE" /rpt_family="AluSx" repeat_region 76467..76514 /note="SINE" /rpt_family="MIR" repeat_region complement(77244..77553) /note="SINE" /rpt_family="AluSx" repeat_region complement(77589..77885) /note="SINE" /rpt_family="AluY" exon 78206..78502 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=21 exon 78945..79262 /gene="tenascin-X" /note="repeating unit defined by dot matrix; exon boundaries could be 78969-79262" /number=22 repeat_region 79721..80015 /note="SINE" /rpt_family="AluSx" repeat_region complement(80307..80606) /note="SINE" /rpt_family="AluSp" repeat_region complement(81221..81522) /note="SINE" /rpt_family="AluSx" exon 82269..82601 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=23 repeat_region 82941..82993 /note="LINE/L2" /rpt_family="MIR2" repeat_region 83180..83277 /note="LINE/L2" /rpt_family="MIR2" exon 83756..84073 /gene="tenascin-X" /note="repeating unit defined by dot matrix; exon boundaries could be 83783-84073" /number=24 exon 84491..84808 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=25 repeat_region complement(84947..85041) /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region 86139..86197 /note="LINE/L2" /rpt_family="MIR2" repeat_region 86378..86475 /note="LINE/L2" /rpt_family="MIR2" exon 86981..87271 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=26 exon 87672..87989 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=27 repeat_region complement(88189..88489) /note="SINE" /rpt_family="AluY" repeat_region complement(88497..88552) /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region complement(88558..88885) /note="LINE/L1" /rpt_family="L1MB3" repeat_region complement(88904..89037) /note="SINE" /rpt_family="AluJo" repeat_region complement(89038..89187) /note="LINE/L1" /rpt_family="L1MB7" repeat_region 89767..89871 /note="LINE/L2" /rpt_family="MIR2" exon 90338..90661 /gene="tenascin-X" /note="repeating unit defined by dot matrix; exon boundaries could be 90371-90661" /number=28 exon 91073..91390 /gene="tenascin-X" /note="repeating units defined by dot matrix" /number=29 repeat_region 91677..91886 /note="SINE" /rpt_family="MIR" exon 92010..92297 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=30 exon 92648..92926 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=31 repeat_region 93132..93250 /note="SINE" /rpt_family="MIR" exon 94204..94485 /gene="tenascin-X" /note="repeating unit defined by dot matrix; similar in sequence to GenBank Accession Number M25813" /number=32 exon 95334..95654 /gene="tenascin-X" /note="repeating unit defined by dot matrix; similar in sequence to GenBank Accession Number M25813" /number=33 repeat_region complement(95834..95899) /note="SINE" /rpt_family="MIR" exon 95944..96279 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=34 exon 96531..96653 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=35 exon 96768..96911 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=36 exon 97104..97223 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=37 exon 97342..97485 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=38 exon 97579..97709 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=39 exon 97829..97961 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=40 exon 98054..98205 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=41 exon 98298..98394 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=42 exon 98487..98648 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=43 exon 98726..98889 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=44 exon 99210..99504 /gene="tenascin-X" /note="similar in sequence to GenBank Accession Number M25813" /number=45 BASE COUNT 22715 a 26354 c 27135 g 24063 t ORIGIN 1 gatatctgag acaagtcaag gtaacagagg cagctgtttg aatagattca ctggacaatc 61 taaggcagct ctccgcacca agctgtaaag gagataagat agaaataatc actctggtac 121 cacagtaaac aggccttgaa ggtactgggg ccctcacagc ttaatcagac ttagcaagaa 181 tttttttgcc tctgaccctc tagttgaaac aaaattagtt actgatagac tttggtgaat 241 gccatactgc atgtaggcat ataacctaaa cctgtataaa cactaagaaa atagtaacac 301 tggccgggtg tggtggctca cacctgtaat tctagcactt tgggaggccg aggtgggtgg 361 atcacaaggt caagagatcg agaccatcct ggccaacatg gtgaaaccct gtctctacta 421 aaaatacaaa aattagctgg gcgtggtggc acgcgtctgt agtcccagct actcgggagc 481 ctgaggcagg agaatcattt aaacccagga gtcggaggtt gtggtgagcc aagatcgcac 541 cactacactc tagcctggcg acagagcgag actccgtctc aaaaaaaaga aaaaaagaaa 601 attgtaacac tttgagttgg tctggtggaa ttatctccga ccctttccct gtatccggtg 661 acagcaacaa attcccttct ttcctagttt gtctgcttct cgttattggg ccacgagaaa 721 acacagccaa acctggctta gttccaggaa taagttctca actagagtct aaactagggt 781 ttctcagttt cagggattat tgacattttg ggtcagatag ttctttgtca tgggggtctg 841 acctgtgcat tgtcttgcca agtatctcct gtgggacatc ttgaccagtg gttttcaaga 901 gtgggtgatc ctcccatccc acccccagga acatttggca atgtctagag ttgtttcagt 961 tatcacaact gaggaggtgc tcctagcgtc tagtgagcag cggccaggca tgctgttaaa 1021 ctccttgcaa ctggaggaca caataaataa ttatttcaca gccatagtaa agaatgaaat 1081 tggccggtcg cggtggctca cgcctgttat cccaccactt tgggaggccg aggtgggcag 1141 atcacctgag gtcaggagtt tgagaccagc ctggccaatg tggtgaaacc ccgtctctac 1201 taaaaatatc aaaattagcc aggcatggta gcacgtgcct gtaatcccag ctactcagga 1261 ggctgaggca ggagaatcgc ttgaacctgg gaggcggagg ttgcagtgag ccaagatcat 1321 gccattgcat tccagcctga gcgacaagag aaaaactcca tctcaaaaaa aaaaaaaaaa 1381 aaaaaatgaa atcatgtcct ttgcaacaac gtggatgaag ctggaggcca ttatcctaag 1441 tgaactaact caaacataga aaaccaaata tttggctggg tgtgacagct catgcctgta 1501 atcccagcac tttgggaggc cgaggcaggt ggatcactcg agctcagcag ttcgagacaa 1561 gcctagtctc taccaaaaac acaaaaaatt agctgggcat agtggtatgt gcctgtggtc 1621 ccagctactt gggaatctga agtgggagga tcacttgaac cagggaggca gaggttgcag 1681 tgagctggag tcacaccact gcactccagc ctgggttaca gagtgagacc ccatcacggg 1741 aaaaaaaaaa aaaaaaaaaa agtcgggcac agtggctcac gcctgtaatc ccagcacttt 1801 gggagacaga ggcgggcaga tcacctgagg tcaggagttc gagaccagcc tggccaacac 1861 ggcaaaaccc cgtctctact aaaaacatga aaattagcct gacatggtgg cgtgagcctg 1921 taattgcagc tactcaggag gctgaggcag gagaatcact tgaacctggg gcgaggagga 1981 ggttgcagtg agcagagata gcgccactgc actccagcct gggcaacaga gtgaggctcc 2041 atctcaaaaa aaaaaaaaaa tttacctatt gggtacaatg ttcactattt gggtaacagg 2101 tacaccacta gaagcccaat ccccaccagg atgcaatgta atcatgtaac aaccaaccag 2161 gtgtactgct tgggtttaaa attgtggggg aaaaaaaaga aaagaaagaa gaaaaaaatt 2221 atcctgccag gtgcagtggt tcatatctgt aatcccagca ctttgggagg ctgaggcaga 2281 tggatcactt gagttcagga gtttgagacc agcctggaca acataaggag tccccatctc 2341 tacaaaaaat taaaaaatta tctgggcata atggcacaca cctgtggtcc cagctatttg 2401 ggaggatgag gtaggaggat cacttgaacc tgggaagtcg aggatgcagt gagccatggt 2461 aaccccactg cactccagcc taggcaacag gctgtctcaa aaacaaaaaa gaaagaaagc 2521 attccctagt tccttattta ctgctccttc tcagtcttca ctgacagctc ctattccttt 2581 tccagacctt taaatattag agagcctctg gactctattc ttggccctct cctctaccct 2641 aatccaagct cccatgattc cattgtccct tcctttgtcc cttaattgga ttttctgctt 2701 ctatttttta ctgatgtggc ctaccctcta tgaagaaacc agagtgatct ttaaaattac 2761 cagatgatat catttacctg ctcaaaaccc tctgatgcct cctgatcaca tgtgggacaa 2821 aataaaaagt ccttcccatg gcctatgaag cctacctggc cttgtttctc cccacctctc 2881 tgtcttcatc ccttaccact ctttctcttg ctcactctgc tccactcact ctgacctcct 2941 ttctgaattt aacataaaaa tccaattcct ttctacctca gggcctttgc actctctttt 3001 tcttttcctg gacactctcc ctccagatac ttgcatgact gctcctttat gtcatttata 3061 tctgcacaaa catcacctcc acaaggaggc tttccctcat cacttgatta aatatagtac 3121 catggccagg cacagtggct cttgcttata atctcagcac tttgggaggc agaggcagga 3181 ggattgcttg aggcctggag attgagacca gcccaggcaa catagtgaaa cctcgtctct 3241 aaaaaaaaaa aaaaaaatta gccagacatg gtggcatgaa cctgcagtcc cagctactca 3301 tgaggctgcg gtgggaggat cacttaagcc caggaggtca aggctgcggt gagccatgac 3361 cacactactg cactccagct tgggcaacag agcaagactc tataaataat aataatcatc 3421 atcatcaata aatatagcac catgcatacc tgggagcact ttctatccca ttcccttacc 3481 cctcaccctg cttcatggtg ttcatagcac ttgttagtgc ctgacattgt attaaatttt 3541 attgattttt ttatcgtctg tcatcttcaa ttcaatgtaa gttaccagag atccaaggct 3601 ttttccactg ccttattccc agcatgtagc acagtgcttg gtacaaagga ggccctcagt 3661 aaagatttct tggctgggcg cagtggctca cgcctgtaat cccagcactt tgggaggctg 3721 aggtgggcgg atcgtgaggt cacaaatccg agaccagcct gaccaacatg gtgaaaccct 3781 gtccctacta aaaaaaatac aaaagttagc cgggcgtggg ggtgcacacc tgtagtccca 3841 gctactcagg aggctgaggc aggagaatca cttgaaccca ggaggcgtag gttgcagtga 3901 gccgagatca cgctactgca ctccagcctg ggtgacagag cgagacgtca tctcaaaaca 3961 gcaacaagaa caaaaaccaa ccaaccaaac aaacaaaatt ctttgaatca ctgaatgaat 4021 aagtagttca tagagttgtg tgagagaaaa atagctaata tatacaagac atttagaaca 4081 atgctgagta ttacggtcat aatgcatggc cagcgtttag taaggtgaag ctgttatcat 4141 taaacacttc actgccaaaa aggctgcctc cctcctgctt gaaaaccctg actataaggg 4201 aagcacagca aaatgctttg ggaccaaaaa ggagggagga acgattttga cacaaatccc 4261 agcaaaacta aaagccacag aaaccaccaa agctaaacag aaggggaaat ttatttagct 4321 tttggaaagc tagaattttt gtctctcatg cagataataa atggaagaga ataagtgcag 4381 catatggttc catgtggtga gtatagaacg caggggaatg tgagagtgta aaaacttcca 4441 accagctcaa tgcctggtga cagggcagca caaaggagta ttggaatagc ctgggaaaac 4501 tgagactcac agactggaga ggaataatcc tgggtgttaa gtggctttcc ttacttctcc 4561 taccttgtgc tgcttcttcc acctgcaata aatctcttcc atttcttttt ctttttcttt 4621 ttttctgttt tgttttgttt tgttttgttt tttgagatgg agtcttgcgc tggcgtcatg 4681 ctggagtgta gtggcgccat cttggctcac tgcaacctcc gccccctggg ttcaagcgat 4741 tctcctgcct caacctccca agtagctggg actacaggca catgcgacca cgcccagata 4801 atttttgtat ttttagtaga gacggggttt caccatgtta gccaggatag tctcgatctc 4861 ttgaccttgt gatccacccg cctcagcctc ccaaagtgct gggattacag gcatgagcca 4921 ccgtgcccgg cagtctcttc catttctgca tatcaaattt gcggccctct ttcaagtctc 4981 aactctctga agccatccat gaggtatttt cagatcaacc ccagttgaaa ataactcccc 5041 tgccacttat cactttttcc tttgcatctg gagcattggc taatctgact tttcactctc 5101 attagattac tggtggggat tctccgtatt tgtctccata tttcactggc aaattctggg 5161 tattcaataa gggctactga atgaataaga aaaaaattct acttcatcca tttagagggc 5221 aagcatcata tgctctgtgt gcaggaacaa gatgaaacaa ccactagctg gcaaactacc 5281 tcaagccagc tcctgaacct ctccttgggt tctgcctcag gttaactctt tttttttttt 5341 tttttttttt ttttaaaaga cagggtctca ctctgttgcc caggctgggg tacagtggtg 5401 tgatcctagc tcactgcagt ctcaactcct gggctcaagt aatccttccg ccttagcctc 5461 ctgaatagct aggactacag gtgcaccacc ctgcctagct aatttttatt ttgtagaggc 5521 agggtctcta tgttgcccag gcttcaggtt aggtcttgaa tgaaggcaaa caatgccctt 5581 ctgggataga acaaggagaa taacagagta tggccagttt ctctgagcca tcttagaagg 5641 ggaggaccca gaaaaaagga aggagggaag tgactccagg taagatgtaa atatatttgc 5701 aggtaaatcc ccagaaactt gtcaaatatg gtttccctca gtccaatttt cttcacttaa 5761 aggagtggag aattgttcca aaatggagtt aacattctgc aatcatagca gaattattgc 5821 cctcctccca agttcttgca ttctcctaga tgtctctttg accttttttt ttttttttga 5881 gacggagttt cacactattg cccaggttgg agtgcaatgg tgtgatcctg gctcactgca 5941 acctctgcct cccaggttca agcgattctc ctgcctcagc ctcccaagta gctgggatta 6001 caggcgcccg ctaccacacc tggctaattt ttatatttta ttagtagaga tggggtttta 6061 caatgttgac caggctggtc tcgaactcct gacctcatgt gatctgcccg ccttggcctc 6121 ccaaagtgct gggattacag gcgtgagcca ctgcacctgg ccctctttga ccttcttaac 6181 ccaaagtctc acttttcgag aggtttctgt gtagcaagac ggggtaagta tgtcccattt 6241 gcattcacac gatttctgga ggtattggtg atgttggtcc ctgcaacacc aaaggaagag 6301 agggagcaga ggacaatttg tagtgacaag aagagactgg atccagtccc ttggggaaga 6361 agattccttt tcatctcttc ctctcatgcc tgctcatctt ccatttcatt ctcaaatctt 6421 taggggaaag ggagaggaga gagacattgc tgtttcccca ctttccagtc caagcatcta 6481 tcccaacaga taagcaaatt tcacttgtca tcctctttat gtatctttct caaatgtctt 6541 gttctctagc caggcttaag gatataatct tcctggtttg tggctctctt tcgtcttgat 6601 tccttgatga ctgcctccaa agagctgagc tctggcacaa ttagacttga gggagatagt 6661 gtgagtcagt caacaagcat gtagtatcta cactttgggg gagagcctat actctttctt 6721 atttcttttt ttacaaatta ttattttgag acagaatttc tcaaattcct ttctacctca 6781 ggaaaggaaa ggaaattgga taaggcgact ttgtcgccag gctgaagtgc agtggtgtga 6841 tctcggctca ctgcaacctc tgcctcccgg gttcaagcga ttctcctgcc tcagcctcct 6901 aagtagctga gactacaggc acacacctcc acaccagcct aatttttgta tttttagtaa 6961 agacggggtt tcaccatgtt ggcccgactg gtctcaaact cctgacctcg agtgatccac 7021 ccaccttggc ctcccaaagt gctgggatga caggcatgag ccactgcgcc tggccactct 7081 ttcctatttc tgaatcttca caggttgttg gtcccttcct agtcctgtgt ttctcaagtt 7141 aagactaggg ggaatctaca ggtccatggc tatcttctct ggggaactct tctaaaactt 7201 taagtttgtg cttacctctt gatgggccat atttttaaaa agagaaatat tttcatacat 7261 gatgatttgg attaggtttt ttttaggggt ttcaatgcct atatttgtat ttgtgtttat 7321 ttatttattt attttttgag acagagtctt gctgtgtcgc ccaggctgga gtgcaatggt 7381 gtgatcttgg ctcactgcaa cctccgcctc ctgggttcaa gcgattcttc tgtctcagcc 7441 tcccaagtag ctgggactat aggcgtgtgc caccacgcca ggctaatttt tgaattttta 7501 atagagacgg agtttcacca tattgacctt gtgatccgct cgcctcggcc tccgtgcagt 7561 ggcgcgatct cagctcactg caacctccgc ctcccaggtt caagcgattc tcctgcctca 7621 gtctcctgag tagctgggac tacaggcatg cgccacacca gctaattttt gtatttttag 7681 tagagacggg gtttcaccat gttggccagg ctggtctcaa actcctgacc taaagtgatc 7741 cgcctgcctt ggcgccgtgg ctcacgcctg taatcccaac actttgggaa gccgaggtgg 7801 gcagatcact tctggtcggg agtttgagac cagcctggct gacatggtga aaccccatct 7861 tcactaaaaa tacaaaaatt agccaggcat ggtggcaggt gcctgtcatc cccgctactt 7921 gggaggctga ggcaggataa tcgcttgaac ctgggaggca gaggttgtaa tgagccgaga 7981 tctcgccagt gcactccagc ctgcgtgaca gagtgatact ccgtctcaaa aaaaacaaaa 8041 aacaaaaaaa caaaaacaaa caaacaaaac cagggagaga attgttttga cacaagtctt 8101 tggctaatga tgaaaccgtc tcccttcctg gctgtcctgg catggttttt atagtctgag 8161 gcacaatggt gagagcaaac ttaagaggga gccaaagtga ggtactcaag agagggatcc 8221 ttctttcctg gagaactgtg gtagcccagg ctgtgtaggt agctggacaa gcaagactga 8281 atcacaatag gtctctacat cttctattag gaggagaacc tgcaacatcc agtggagcag 8341 gtcacaccag ttatacatgt aaaatgtagg ataaactaaa ggggcacaga gccaggggag 8401 aagacaatgg gatgagactg ttctagaatc tcatcccatt ggttcattgc tgggtgtggt 8461 ggctcacacc tataatccca gcactttggg aggctgagat gggatgatgg cttgaggcca 8521 ggagtttgag accagcctgg taaacacagc agaccccatc tctctaaaaa aaaaaaaaaa 8581 aaaagaaaaa catacggttt atgaaccagc agcatctgca ttaaccagct tattgaaatg 8641 cagaatcaca ggccccacaa cagacttcct aaatcataat ctgcaactta acaagttccc 8701 taggtgatat gtatgcacac ttatgtttga aaagcactaa gatttcttga tgaaggagga 8761 cttgaaaggc aatgatggat gtgaaaggaa aggtaaagag aagcctcagg tagtcaccca 8821 agggacaggg ccggttggag agagagtccc gaggttttat cctggagaac accctgtact 8881 gaatgagctc tgaacataaa gatagttagc ataggagggc ctgaagtctc cagataaaag 8941 gctgctgcca ctatcattta ccacgacctc tgccattctc cactctattg tcatccgccc 9001 ccagtctcca ttccaggact tctctacact ttgacttttt gtttgtttgt ttgtttgttt 9061 gagacggagt cttgcgctgt cgcccaggct ggagcgcagt ggcacgatct tggctcaccg 9121 caagctccgc cttccgggtt catgccattc tcctgcctca gcctcccggg tagctgggac 9181 tataggtgcc cgccaccacg cccagctaat tttttgtatt tttagtagag acggggtttc 9241 accatgttgt ccaggctggt ctcgaacccc tgacctcaag tgatcccccc gccgccccgc 9301 cccctccccc ccgccccgcc cccccccgcc gcctcggcct cccaaattgc tgggattaca 9361 ggcgtgcgcg atgcccggct ttttatttat ttatttattt atttttgagg cgggaatctt 9421 gctctgtcgc caggctggat tgcagtggca ccatctcggc tcactgcaac ctccgactcc 9481 ctggttcaag cgattctccc acctcagcct cccaagtagc tgggattaca ggcacacgcc 9541 accatgccca gctaactttt tgtattttta gtagagacga gatttcacca tgttgccagg 9601 atggtctcga tcacctgacc tcgtgatccg cccacctcag cctcccagag tctcagttgc 9661 caaagctgga gtgcaatggc gcgatctcgg ctcactgcaa cctccgcttc ccaggtaagc 9721 cattctcctg cctcagcctc ctgggtagct gggatatagg cgcccgccat cacgccgagc 9781 tatttttgca tttttagtag agacggggtt tcaccatgtt ggccaggctg gtcttgaact 9841 cctgacctca acctcccaaa gtgctgggat tacaggcgtg agccaccgcg cccggcccac 9901 cttttctttt tttttttttt ttttttttgt ttgagacgga gtctctagtc tcgctctgtc 9961 gcccaggctg gagtgcaatg gtgtgatctc ggctcactgc aacgtctgtc tcccgggttc 10021 aagcgattct cctgtttcag ccttccgagt agttgggatt acaggcgcgc gccaccatga 10081 cctactaatt tttgtatttt tagtagagac agggtctcac catgttggcc cactttgact 10141 cttgagcagc ctggccagcc cgaccgcgcc aaattctgtt cgattctgcc tagttcggtt 10201 gctctggcct agttcagttg ctaaggcctg gagcttcatg gttgcggagg aaatgatgtc 10261 acgttcaata ggcgggctaa ccagattcct cccttctccc gattggctgc caggaatttg 10321 actagattcg gagtctcgcg ggctccaggg ttagttgtca gtatctttcc cagttgttcc 10381 gccccctacc cccgcctccc gcaccgcgcc cctctccggc tgccctctcc gcgtggggca 10441 aggctccgag ggcagcattc agtagccatt tagctttgga aggagaggtg attcgaatgg 10501 cccggctcct cctgtcacca tgccaggcac tttggccgcg caggtactta ttgacccgac 10561 cgggtgtccg tagttggcgc ggctacctta accgcaggga attgtggaat ttatagttct 10621 aaattatatg tgggtggaac ggggaagctg gagcagattt ttggaggaaa gcaaaactgg 10681 ggactttcag gactaggggc ctgggtctca gaagaatggg aaaggacgag aaaggagtct 10741 aaataagaac cctgctatta gcattgtttg gttttctttt caggtgctga cctgaacctg 10801 gttcatccct ttctgaccaa aactgttcac tcaccgtgga agggactaag catccatatg 10861 gagacgccac cagtcaatac aattggagaa aaggacacct ctcagccgca acaagagtgg 10921 gaaaagaacc ttcgggagaa ccttgattca gttattcaga ttaggcagca gccccgagac 10981 cctcctaccg aaacgcttga gctggaagta agcccagatc cagccagcca aattctagag 11041 catactcaag gagctgaaaa actggttgct gaacttgaag gagactctca taagtctcat 11101 ggatcaacca gtcagatgcc agaggccctt caagcttctg atctctggta ctgccccgat 11161 gggagctttg tcaagaagat cgtaatccgt ggccatggct tggacaaacc caaactaggc 11221 tcctgctgcc gggtactggc tttggggttt cctttcggat cagggccgcc agagggctgg 11281 acagagctaa ctatgggcgt agggccatgg agggaggaaa cttgggggga gctcatagag 11341 aaatgcttgg agtccatgtg tcaaggtgag gaagcagagc ttcagctgcc tgggcactct 11401 ggacctcctg tcaggctcac actggcatcc ttcactcaag gccgagactc ctgggagctg 11461 gagactagcg agaaggaagc cctggccagg gaagaacgtg caaggggcac agaactattt 11521 cgagctggga accctgaagg agctgcccga tgctatggac gggctcttcg gctgctcctg 11581 actttacccc cacctggccc tccagaacga actgtccttc atgccaatct ggctgcctgt 11641 cagttgttgc tagggcagcc tcagttggca gcccagagct gtgaccgggt gttggagcgg 11701 gagcctggcc atttaaaggc cttataccga aggggggttg cccaggctgc ccttgggaac 11761 ctggaaaaag caactgctga cctcaagaag gtgctggcga tagatcccaa aaaccgggca 11821 gcccaggagg aactggggaa ggtggtcatt caggggaaga accaggatgc agggctggct 11881 cagggtctgc gcaagatgtt tggctgatta aaagttaaac cttaaaagag acaggaactt 11941 gtgaattgtg gtctgtgctg tgagatttgg gggtgggggg ccagagggaa ggtttcagcc 12001 catatcccat tctcagcttt tggcttccta ggggaggtag cagtggtcac agtgcgcccc 12061 ttttgagggg cttatctatc tccgaggaca catagaagct gtctaaacta aacttagtag 12121 tggctggggg agatggtgga ccttaatact gggtactaca tctcccaggc gctttatcac 12181 tggagcctgt ttggcgcatg cgcttggcgg tatggcactg tctggaagtt acttcctgct 12241 gctgtgctcc cattgggccc gcctacacag tcatccgcat ctgttgattc acgtgctcag 12301 cacccaggtc tcttactggt cggtagagct tccgggacgc cccctttttt gaaagagtca 12361 actgattagt tggtgatggg gagaggcggg ccttgggaac cgtctcctgg ttggggggtg 12421 ggggggaaag atggcggagc tgatgctgct cagcgagatt gctgacccga cgcgtttctt 12481 caccgacaac ctgcttagcc cggaggactg gggtctgcag agtgaggcac cggggagggg 12541 agagggctgt ggcaaactgt ccctgaggca cgcgggaaag aagcgggggc ctaaagagga 12601 agtctgggca aatacggaat ccgccatctg aaggagagcg gagcaggggg ctcgttgtgc 12661 attaaggggg atacaaaagt cagggaagaa ctctgctccc tgctggaacg tcgagttgtt 12721 ccggggtcgg ggaaggaggg gctgaagcgc cctggttgta tcacttttca agtgggcggg 12781 cggtgggtct ggccgagcgg agccaatgag ggatggggcg ggggcttgtg tgcgatctgt 12841 aggctggggc caccgaacga ccccccgacg ctcagcattt cttgcagaca gcaccttgta 12901 ttctggccta gatgaagtgg ccgaggagca gacgcagctc ttccgttgcc cggagcagga 12961 tgtcccggta ttgtgtcccc agcctttctc taacccttgg tgggtggaaa gtgagcccag 13021 aacctggacc cattcacagg ccataactca tctggagtca cccctacttt ctcatccttt 13081 ctcacactcc tgccctcttg tatcctccct tatttagttt gacggcagct ccctggacgt 13141 ggggatggat gtcagcccct ctgagccccc atgggaactc ctgccgatct tcccaggtaa 13201 ggtagtcttt gtcgttccct cagatttcca gaaaccctca tcttcccact gtttcccaga 13261 aactacgtag cttcctatat cctcacgctt ctacataagt gcctatcctc tctccttcaa 13321 tcccctattt ttgtttcttc tcccttgctg cggtacagga atccccaccg acatgtcctt 13381 gggtactttt tggactcttc ttaagctctg cttcctgata atttctaact ccctttcctc 13441 cagatcttca ggtgaagtct gagccatctt ccccctgctc ttcctcctcc ctcagctccg 13501 agtcatcgcg tctctccaca gagccatcca gcgaggtgag agagccatat ccctcttact 13561 ctctgtgacg gggactctgc ttcattccct tcacccgtgg gtgacaaaga aatgatttaa 13621 atgatttgtg tgctatgagt catgaggtgt gtgaggtgtg gggggccttc tccactacct 13681 ttgccgtttc cttaatatct tcattttcca catccccttt ggttatctag cagtgaagat 13741 ttttttgttt gtttgttttt tgttttttgc ttttttttga gatagggcct ccctctgtca 13801 cccaaactac agtgtagtga cagggtcaca gctcactgta gccttgacct cctgagctca 13861 ggtgatcctc ctgcctcagc ctcctgagta gttgggacca cagacacatg ccaccatgtc 13921 tggctaattt tttttattgt ttgtaaaaat ggggtcttcc tatgttgccc agagtggact 13981 taaactcctg ggctcaagtg atcctcccac cttggcatcc caaagtgctt ggattacagg 14041 tataagccac cacgcctgat cttgggtctt tttaatataa agagttatgt ggggctggga 14101 agaagagagt tattgactgc tgagaattct ggaactgttc ttgcccaagt tgtccactct 14161 gaggccatat ttgaagtcat gagacttcat caattctccc agaaataaag tagaaagaat 14221 gaacaggtta gaaagatggt cagtaccatt cagaccagaa tgtcaaataa ttgatggctt 14281 gtaagcccta aacttctaaa gcttttgttc tcgctcttac tgtcaacatc tcccttgttt 14341 tttgcccttc tttctgcccc ctgccccaca ccaactctca cccaggctct tggggtaggg 14401 gaggtgctcc atgtgaagac agagtccttg gcacccccac tgtgtctcct gggagatgac 14461 ccaacatcct catttgaaac cgtccagatc aatgttatcc ccacctctga tgattcctca 14521 ggtaataaaa cagcctgccc ccaatccttg gctctactac gggagaatgt gttatgtgtg 14581 tgtgtgtatg tgtgtagatg gaggggaagt gcaggcagca ggacagggag aaaaaggtgg 14641 ggaggggaaa tggcagaaaa ccaggaggga aggcaggaag gagaggaaac atttcaggga 14701 accacagcaa gggcagtgct gaggagatgg atattatttt tctctctttt tcttgttcct 14761 agtattctgt catgccattc aacttacttt ttaaaaactt gtgagcagtt tagggcatat 14821 gtatctgtgt gattcccctc cacccaacat tttattataa acatttttgt tgttgttgag 14881 acagagtctc acactgtcgc tcgggctgaa gtgcagtggt gtggtctcag ctcactgcaa 14941 cctccacctc ccaggttcaa gtgattctcc tgtctcagcc tcccaagtag ctgggattac 15001 aggcacccac caccacgccc agctaatttt ttgtattttt agtagagatg gggtttcacc 15061 atgttggcca ggctggtctt gaactcctga cctcatgagt cgcctacctc cgcctcccaa 15121 agtgctgaga ttacaggcgt gagccatcgc gcccagccca taaacatttt taaacataca 15181 gaaaggctag aacaccatat actaaactag attctacaat taacattatt tgttctattt 15241 ttaaagtgtt cccaagttta tttactcgta gctgtgtatt tagtaaatgt ttattgatgt 15301 gatcatatct gaggtataat aattataaaa aagaaatatt tattaagcac ctactccatg 15361 ccagatactg agcagcatgc tgggaatgca aagatgaaca agacctgacc accatgctga 15421 ctttcaaacc ctgtgatttg gtaatgggag aaagacaagt accagtcaat acaggtgtgg 15481 tcattctagg atagaaatca ctagaggtgg ggctgggtgt ggtggctcat gcttgtaatc 15541 ccagcacttt gggaggccga ggtgggcgga tcacgaggtc aggagattga gaccatcctg 15601 gctaagatgg tgaaaccccg tgtctactaa aaatacaaaa aattagcctg tagtcccagc 15661 tacttgggag gctgaggcag gagaatggca tgaacccggg aggcggagct tgcagtgagc 15721 cgagatcacg ccactgcact ccatcctggg caaaagagca agactccgtc tggaaaaaaa 15781 aaaaaaatca ctagagggta ccatgaaagc tcataagagg ggacatttca tccacattgg 15841 cgctgaatct gggatgtgcc cctggaagaa gggtttcctg agagttttaa aggacacgta 15901 gaaattagcc aggaggacag tgtgggggtt gacttgtgtc tgaggaccct gcatttcttc 15961 agagtggatg taagaaggat ttggggagtg atgtggaact gcaggggcta cagggccagg 16021 tggggcccct gctaaggggc tcttagcttt tatcctgtgg ataggtgggg gcctttggag 16081 aattaagcag ggaagtaata tgattaggaa gatgattcta gtggcaaagt agagaatggg 16141 ttgcagaggg gcagggccag aggcaagaga accaaggaag aagcttatgc agttgatcaa 16201 gtaagatgtt cggatgtcat gaagaaagca gacttgagaa agaatgaagt ggaatagatg 16261 agttgtgacc tcctggacat tggagagagg taggagtcaa agaggtttgt ggctaggatg 16321 gctgtgggtg ggtgatgtta ctgttcactg gatggagact gcaaaagcag gggctttctg 16381 gtcagggaag atgatgaatt cagcgttgga catgttgatt ctgttgttgt tgttgttatg 16441 gagcttcact cttgttgccc aggctggagt gcaatagtgg tgtgatcttg gctcactgta 16501 atctctgcct cctgggttca agcgattctc ctgcctcagc ctcctgagta gctgggatta 16561 caggcatgct ccaccacgcc tggctaattt tgtattttta gtagagacag ggcttctttc 16621 atgttaggct ggtcttgaat tcccgacctc aggtgatcca cccactttgg ctcccaaagt 16681 gctgggatta caggcgtaag ccaccatgcc cagttgacat gttgattgtt gatttcaaag 16741 taccttctgg atctcctggc accatctctc actggctgaa ctccaaccag ggagcccgct 16801 gatgcagtcc attcagattc tccctggatc tcagggagca cagcagggag ggatggaaag 16861 tggatttgga gggacaaaca gattatatct agcagattaa tacatgttat aaaaaagata 16921 cacaaaataa ggattgaaat accattagca ggccaggcac agtggttcac acctgtaatc 16981 ccagcacttt gggaggccaa ggcgggcaga tcacgaggtc aggagatcaa gaccattcgg 17041 gctaacacag tgaagcgcca tctctactaa aaatacaaaa aattagccag gcatggtggc 17101 aggcacctgt agtctcagct actcgggagg ctgaggcagg agaatggggt gaacccggga 17161 ggcggagctt gcagtgagcc gagatcgccc actgctatta gcagcaaaag gattgtaagg 17221 attgttggtg gctttgcatt gcatagtgtt attagagtca gattctcctc catacttagg 17281 tattttttga gggcctgtac tttaccctca atggacatga gacatagttc ttgcttttgg 17341 agagttttgt gagggttata atacatcatg gttaaatagt tccctttttt tttttttttt 17401 tttttttttg agacagagtt ttgccctgtc acccaggcta gagtgcattg gcgtgatctc 17461 cactcactgc aacctccgtt tcctgggttt aagctattgt cctgcctcag cccccctaat 17521 agctaggact acaggcacgt gccaccatgt ccggctattt ttttgtattt ttggtagaga 17581 cagggttttg ccatgttgtt caggctggtg tcaaactcct ggcttcaagc aatccacctg 17641 cgttgggtgt cggcctccca aagtgctggg attacaggcg tgagccacca tgcccagcca 17701 tggttaaata gttctgatag tggttatctc tgtgtaggga atctgagtga ctggaggacc 17761 gggaagggaa gttttttttt tttttttttt tgagacagag tctcgctcta cccaggctgg 17821 agtgcagtgg cgcaatctcg gctcactgca agctccgccc cctgggttca tgccattctc 17881 ctccctcagt ctcccaagtt gctgggacta caggcgtccg ccaccacgcc tggctaattt 17941 ttctattttt ttagtagaga cggggtttca ccgtgttagc caggatggtc tcgatctcct 18001 gacctcgtga tccacctgcc tcggcctccc aaagtgctgg gattacaggc gtgagccacc 18061 gtgccccgcc aagatttttt ttttgactta attttccttt gttcgtttgg atttgtatca 18121 tgactgtatt atctattcaa aaatataaat tgaagtttta aaaagcaatc ttagtgatca 18181 tataattaag taccaaagtg aacggtatgg accagaggta gtggtggact gttgacctgt 18241 gggctgaatc tggccctcag aagggtttta tttcaccgac atggggccta gcattttctt 18301 tcatggagtg atcaacaagt cataaaggca gatttattgg aggccatgga ggctcagagg 18361 tgactgaggc tttgctgttt ggtgcctggg aggaaaactt gtgtgacggg gcaatcagga 18421 agcctcattt atggccagtt agatttgggt gactggtgag gcatgagaat ggaagggacc 18481 agcaggcagc ttggaagtca gaatctggtg caggtgagca gccaggggga gatgcacact 18541 tggcggttgt cagcatggag atgcattggg cagggataag gtggaaatcc agcaagatag 18601 gaatttgggc catgtgaatg ctggagaccc ccctgccact tcccctgctc aggcctattc 18661 tatgtgaata gacaagttct gtgaacaaga aagagggaaa cgcagaaggg caaagacagc 18721 tttgaggaac atggagcagg gatgagaaaa atcagaggag ccactaaagg agaaagggaa 18781 gcactaggcg aaaagcgggg cggagagcca gcaggtgagc agctctgtga ctgcccctcc 18841 tggatggcct gtaatctagg gccagaagag ttttattttc aggttcaaag agaaaggacc 18901 catttcccca tcttaaccct cttttttatt tttttgagat ggagttttgc tcttcttgcc 18961 caggctggag tgcaatggcg cgatcttggc tcactgcaac ctttgcctcc caggttcaag 19021 cgattctcct gcctcagcct cctgagtatc tgggacacag ggatgcgcca ccatgcccga 19081 ctaattttgt atttttagta gagacggggt ttctccatgt tggtcaggct ggtcttgaac 19141 tcctgacctc aggtgatctg cccgccttgg cctcccaaag tgctgagatt acaggcgtga 19201 gccaccatgc ccagcctccc atcttaaccc tcttaaccct tcagtgttta catgatatcc 19261 agcttttgcc tcatcctgcc tcccacagat gtccagacca agatagaacc tgtctctcca 19321 tgttcttccg tcaactctga ggcctccctg ctctcagccg actcctccag ccaggtgagc 19381 accaccttcc ttccttcctc caggttttcc agtgagttcc tcctgaccta gaggtatgag 19441 agctttccct tcttcagctc cttctcctcg acactcttcc tgactaatgt atctctggaa 19501 tgccccacat aggcttttat aggagaggag gtcctggaag tgaagacaga gtccctgtcc 19561 ccttcaggat gcctcctgtg ggatgtccca gccccctcac ttggagctgt ccagatcagc 19621 atgggcccat cccttgatgg ctcctcaggt aatggagtct cctgacactt ggaatcccta 19681 gtgagaggcc tttggtcctg acatttcttg ttcttttctc tcaccttcct tcaggcaaag 19741 ccctgcccac ccggaagccg ccactgcagc ccaaacctgt agtgctaacc actgtcccaa 19801 tgccatccag agctgtgcct cccagcacca cagtccttct gcagtccctc gtccagccac 19861 ccccaggtac tgaagaagga gaaaagggcc gggcatggtg gctcacgccg gtaatcccag 19921 cactttggga ggctgaggcg ggcgaatcac ctgaggtcag aagtttaaga ccagcctggc 19981 caacgtggtg aaaccttgtc tctactaaaa atacaaaaat tagcgtggag tggtggcagg 20041 cgcctgtaat cccagctcct cgggaggctg aggcaggaga atcacttgaa cccaggaggc 20101 ggaggttgca gtgagccgag atcatgccac tgcactccag cctgggcgac aaagcgagac 20161 tttgtctcaa aaaaaaaaaa aagaaaaaag aaaaaccaaa aaataaaaat aaaaattaat 20221 taattaatta aaaaaaaaag gccaggcaca gtggctcacg cctgtaatcc cagcactttg 20281 gcaggccaag gcaggcggat cacaaggtca ggagatcgag accatcctga ctaacacagt 20341 gaaaccccgt ctctactaaa aatacaaaaa aattagccag gcgtggtggc gtgcgtctgt 20401 agttccagct gctggggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 20461 gcagtgagcc gagatcgcac cactgcactc cagcctggtc gacagagcaa gactgcatct 20521 caaaaaagaa aaaaaggaga aaaggaggcg ggctgcattg accctgtcag agagcttgca 20581 gccattgagg ctgcccaatc cttgggtgct cttgtgtggg tgtggacaca aggaccccac 20641 ccagcgcaca gccctctgac ctctgctcac ttgccccagt gtccccagtt gtcctcatcc 20701 agggtgctat tcgagtccag cctgaagggc cggctccctc tctaccacgg cctgagagga 20761 agagcatcgt tcccgctcct atgcctggaa actcctgccc gcctgaagtg gatgtaagtg 20821 ttggggagga gagaagtagg gcgagtgatg ggaagggaag agggaagtct tgccatagag 20881 agaactgtgg gatgaggagg gagtcgatac ctaggagaaa aaagtgagga aacggaccca 20941 ttagatcatt ctgagggtca gaatggactc ctgtttctat atgaatatca actccaggta 21001 gttttagggt cctttaggtt tgctgtcact gcagatggag aaatggggtt ggtctccata 21061 aattaaaggc aaggaagtcg tactgtacta tacgactacg gtcatggcca tagttgggga 21121 gggaggctgc ctcctagggg aaaatcaggt aaactgtata agcagtataa caaggttaaa 21181 ctttgaaccc tacaaatgtg ggctgacagg aagaaagagt tacgaaggaa agagtttcca 21241 ccagcatgga gaaagttctt cccacactct aacctttccc acctctgagg agcagtagca 21301 gggaatgttt gaaagaaggg ccccttccct agcgtgtttg agtagagagg aaaaacctct 21361 ctggctatgt aggtgggtca ggaaggcagg ggaatggcca tgtgagcttt gggtaaatca 21421 ttccttgggg tttgagagca tagtctcctt ctgccaactc atctgtcatt cttttttgac 21481 cttccgggtg cctaggcaaa gctgctgaag cggcagcagc gaatgatcaa gaaccgggag 21541 tcagcctgcc agtcccggag aaagaagaaa gagtatctgc agggactgga ggctcggctg 21601 caagcagtac tggctgacaa ccagcagctc cgccgagaga atgctgccct ccggcggcgg 21661 ctggaggccc tgctggctga agtaagacca gtctgtccct gggagaccaa caggaataca 21721 gcctccttgg aatgattccc ctttttcttg tctccttctt tgtaccttag aacagcgagc 21781 tcaagttagg gtctggaaac aggaaggtgg tctgcatcat ggtcttcctt ctcttcattg 21841 ccttcaactt tggacctgtc aggtgagacc ttccttgctt cactagaacc ttccaggtgg 21901 agcccacgtt acagcctcct agctgcctgc tttggcccag gccatttcct tctgcttata 21961 tacaaatggt tctggtgtcc ttggcaccga tgatgtgtct ggccagcccc ttcctgctcc 22021 attatttgtc tttttttctt ggagatggag ccttgttctg ttgcccaggc tggagtgcag 22081 tggcacgatc tcggctcact gcaacctctg cttcctgggt tcaagtgatt ctcctgcctc 22141 aacctcccaa gtagctggga ctacaggcgc acgccaccat gcccggctaa ttttttgtat 22201 tttttagtag agacagggtt tcactgttgt tgcccaggct ggtctcgaac tcctgagctc 22261 aggcagtctg cccacctcag cctcccaaag tgctaggatt acaggcttga gccacctcgc 22321 ctggcctcct gctccattat ttttccagtg gagagtagat gatggtgata atgctggtag 22381 aaatagtgac tattaagtgc ttactgagtt ctaggctcta ggtgatgcac tgtctttttt 22441 gtatgtaggg accatcctta ttcccatctt acagaggcag actcaggctt gtataacaga 22501 ttgcaactca agcagtccat ctgcagagtc tttattctta cctagatact tgatacagct 22561 gaaattcttc attgtgctca gtccttcttt atccacagca tcagtgagcc tccttcagct 22621 cccatctctc ctcggatgaa caagggggag cctcaacccc ggagacactt gctggggttc 22681 tcagagcaag agccagttca gggagttgaa cctctccagg ggtcctccca gggccctaag 22741 gagccccagc ccagccccac agaccagccc agtttcaggt gaggagagag agagagggcc 22801 ccctctcgtt tgaggatatg agttggtctt ctccccgctt tcagtgaggg ggacccaaag 22861 ctttctcttg ccgtcatcgc cttctttgtt gtgggagtgc tgggcagtct ccaagtgttg 22921 tctccttcct gcttcactcc agcaacctga cagccttccc tgggggcgcc aaggagctac 22981 tactaagaga cctagaccag ctcttcctct cctctgattg ccggcacttc aaccgcactg 23041 agtccctgag gtgtgggatt cattgctggg gcatccagct cccccggcct cccaaggcct 23101 tctgcagtca gcctggctgg cctgtgtact aaagcgccag tgctccctgc cctctcctca 23161 ctcctgtggg tgggcaagga gggtgggaat cttggcctgg catggaggtc ctgagctcct 23221 ccatttcccc aggcttgctg acgagttgag tggctgggtc cagcgccacc agagaggccg 23281 gaggaagatc cctcagaggg cccaggagag acaggtaggg cgggtggctg gttcaaggaa 23341 ggccacttga agacaagggc taaaggcctt gtctggggat ttctagtggg catctcgctg 23401 tggaagtttg agggatcgtt tgaaggaagt ggagggttga gtgggaggtg gcatcagctt 23461 ccaggggctg tttgtctctt tcccttttcc cttcaccctc tccaagtcct tagatacttc 23521 tccttccaga agtctcagcc acggaagaag tcacctccag ttaaggcagt ccccatccaa 23581 ccccctggac ccccagaaag gtgagtgtgg ggtggggtgc ttacttatta agtgaaattc 23641 cactttcaag agctgtaccc ccagtagctg tcctgtgcct gtcattactg tcaccagtgg 23701 gattatacct ccctcctcca ctgggggtct cctgtctttt tctctcatcc tacccgcttc 23761 cctggtagaa actgagcatt gggcttagtt cccctcaaat cctgtttccc cacctgccta 23821 gggattctgt gggccagctg caactatatc gccacccaga ccgttcgcag ccagcattct 23881 tggatgcaat tgaccgacgg gaagacacat tttatgttgt ctctttccga agggtgagtt 23941 tctcctgccc ttccatctct gtcaccccag gttcccagca gtctgtcttc agtgggggga 24001 tgtagagtag ggctggggag cttgttggca tctttgcctc cacccttctg gcctggccca 24061 tctgttcccc aggaccacct gctgctccca gccatcagcc acaacaagac ctcccggccc 24121 aagatgtccc tggtgatgcc tgccatggcc cccaatggta actctttccc tgctgggtat 24181 gggggcagtt ccaacaggga gatccagcct gggagctgag ggggaagggg gagaggagca 24241 gggcatccag agagctgctg gacccacagg tgagaagcag caggaagtag gtggagggaa 24301 gagcaccatg caggagactt gggagggaga caagggtgag ggtccttccc tggtgaaggt 24361 gcttgggtag atgactgggc agaggcgggg ggccgggaac ctgctagact ctcacttact 24421 tttttctctt gatgtcctct cctgtgtgac cccacactca atgccctttt cttttcctac 24481 cctctccttg cccctgcacc aaactctgtc cctgccatcc tgtttaactc ccgctatgga 24541 tctgcctcct cactattgcc ctttcttttt ctctctgctc acctctactt ttctgacctc 24601 ctcccctcct ccctgcaatc cctgtttgcc tccccatctc ctcttcctta actctcccct 24661 tctcacagag accctgtcag gccgtggggc cccgggggac tatgaggaga tgatgcagat 24721 cgagtgtgag gtcatggaca ccagggtgat tcacatcaag acctccacag tgcccccctc 24781 gctccgaaaa cagccatccc caaccccagg caatgccaca ggtggcccct tgccagtctc 24841 tgcagccagc caggcccacc aggcctccca ccagcccctc tacctcaatc atccctgacc 24901 tctgccattc acactgactt agaacggggg gagggggtac caggtggcca ggtgggactg 24961 tttcaaattt ccctgatccc caggcttggg gcaattggta aaggaaagag caggtgtggg 25021 ggttaagcac ttatttgagg tgggggtgtt cacctctctt ctcatccctt ttcagaatat 25081 agggctcctc tcattcctgt gaacccccag tcctggcttc tttgtttgag gggattgtgt 25141 gaggttcagt tgtggggtgg gtggtgagct gctgcatatt ttttattttg tttctctagt 25201 gttatggcag tggaggtggg aatttagtcc ccaggtggga caagggaagt tttttcattt 25261 tggagctagt tactgggagt aagggagggt ggggtggggg ggagttcagg tttatgtgtg 25321 tgcatttctt ttttattatt attaaataaa caacttggag ggagttgaag gaatggtggc 25381 tgccttcctg gctcgtgatg gggagtttgg gggaaagtgg cttttccaga gccagcatca 25441 aagaatcaag cactgaagac atagctggct ctgaggtggg gtgggtgcgg gatcacagac 25501 cgcatgtcat ccccttgccc ccacccctgc aactctgggg tatatctgtg tctagcaacg 25561 atgtctcatg gcgtgacaga ctagaggata cgcatccctg gaatgtgagt caacaggaaa 25621 gatggggccc cacgcccttc agggtgagca gcctctgggt ctcctgctct cgagcaggtt 25681 ctccccctct tcttgctgtc caggatgagc agctcaaaca aacaacaaga gctggggcca 25741 gaggctcaga gcccaggaag tgggggtctc ggtgaagaag gggtgatatt tggaagggaa 25801 gggggatgtg gcaggatatc ctctaagccc taaactggag gggactgaga cttgagagtc 25861 cagatgaaag acaaagacaa gagtggggca ggcagaaaaa gctgtacatt cttatgtttt 25921 taattttttt tttttttttg agacggagtc ttgctctctc acccaggctg gagtgcagtg 25981 gcgcgatctt ggctcactgc aacctccgcc tcttgggttc aagtgattct tctgcctcag 26041 cctctcaagt agctgggact acaggtgcac gccaccacac ccggctaatt tttgtatttt 26101 taatagagat agggtttcac catattggcc aggctgccca cttcggcctc ccaaagtgct 26161 gggattacag gtgtgagcca ccatgcctgg ctaggctgtg cattctaatt aggaggaagg 26221 ggagcaggac aaagtgaggg attgagtctg aggccaacct ggaagaatcc ccctccagct 26281 gctcctgttc agcagtgcct caggcctgac cctgcaggag tgtcctgggt cagctggagg 26341 gtgtgagctg ggcctgggct caatgcccag atgactggac aggggctgcc tgggccataa 26401 gcccaggcct gaaggaggga ggaagaacct catgtgtcca gccaggcaac ccccacaccc 26461 ctcccactgc cctcacccat tactgtgttt ggggtgaaaa cagaccaagt aactatggca 26521 acctggaacc agggatccca ggaactagac ctttcctccc tccagagggc agcccagctt 26581 ctcagctatg ctttctatcc cagagtcctt gagccttatc caccgtatct ctggaggggg 26641 tgcagtgaga acaaggattt tggaagcaca tcacctaagc tggattccct accctgcctt 26701 tcgttagtag atgtccttaa acattctgag ccttagtttg cttatctgta agatgggatg 26761 ttcctacctc ctaatcaggg ttgatgacag gatgaaagac gacgtcctca gtgcctggtc 26821 cataatagat acaccttttc ctcatccctt gcccctttct gtccccatct ctgttgtcct 26881 tctctcttcc acttttatcc catttgggtt cattactgtt tcgttgtccc attgtcttcc 26941 tccttcctgc ccccagttcg ggaaagtact tgaagtgctg gcatctcagg ggtcaccctg 27001 gggtgggaca tcctgctcct ctgctgattg gcaagggaca gctggcactt tggtgtggat 27061 gagacatcag acctctcttg gctgagggat tgtggtggca ggaaggattg agaaggacgg 27121 tgagtgggta cctgatgcag ggtgaaagcc gggcaggctc tgggctgctg ccttctaggc 27181 ccatgtggat ctccattcct ctggccacac ccccaaggcc actcagcatg ccaggtcgct 27241 gtcactcctg cacctcattg gcccctcctg ccctatgccc actttgtctt tctgttctgc 27301 tttccttgtg ccaccatctc cttcatttct tcctcaggat cccacctttt gttctctctc 27361 atccctgacc tcatcatccc tcccttcctc tttctagact tctttcctgc ctttctttgc 27421 tctctacctc attagcagtt atcacaacct ctttttgaac cattattcat cactgttttt 27481 ttcttctttt gtatcttccc caccccatcc actgcttcac tgtcttaaat ttttttttta 27541 atttaaaaat agaagtgggg tctcgctatg ttacccaggc tggtcttgaa ctcctggcct 27601 caagcagtcc tcccatgttg gcctcccaaa gtgctaggat tacaggcatg agccaccgtg 27661 ctggccttat ttaaaaatat atatatattt ttaaattcac tgtctttttc ctttgttttt 27721 cttggtcttc tcatctgctg ctgctgctga ccctgtgctt gtctcctccc tttctcatgc 27781 cgcccccttt ctatttccca ttctgctccc tcgtccacat ttcttgcccc tttccattat 27841 gtcctctctt ttcccccact ttggtcctct cctcctcctt ccttccccca ctccctttac 27901 ttcccacacc agttctccat cctcttccca gctgtggggg ccagcactgg ggagcctgat 27961 gtttgcctca tcgttctgct atggcttctg gatacagcac atctggattc cggggcccag 28021 atgtgtggtt gccacagcga cctgggtccc tctggtaaat acaggcgccc acccgcctca 28081 gagcagagaa caaaacaatt gtgtgacttt cttttgtcac gagatagaaa tgtccttcct 28141 cctcccttct ccccacaccc atgtctctag gcaaaggaag ttttaagggt ccaaaccctt 28201 ccttctgagg ctcccttgcc cccttttcag ctgtagcctc ttagttctct cccatctttc 28261 atcctgcgct ctgtctttac attctgtgtt ccccatcctc acccgacccc cagttctcaa 28321 taatggaatg ttgaccaccc ccttccccat ctgatgccat ttcttctaag tgaatctaca 28381 caataactgg atgcagaaaa gtttcaaaaa tctggattct ccacccttct ttgtacaagt 28441 gcttcaaaga agccacttca tttgttccct aagtcttcaa tcctttcctt tcctggagtt 28501 gtttggggag aagagcgaat acaggcttag gaatgagaac acctgggtga gacccacctt 28561 caccactact gacttagccc cttgaacaag acccttaatg tctatgagcc tccgtttcct 28621 cagccacaaa gtgggattaa tgcccgcact gtgtaggaaa ctgttttgta aacggaagcc 28681 cggcacaagt gttttctatc tttcccctct ctgggactgc tttcttggac cgcagcccct 28741 ggcatctctt gttcccctgt cttcatcctt caggtccttg ccctcaattg cccttactct 28801 tatgtgtctg ccttctctcc tccctatctt tcctttcctt ttctctatct ttcatccctt 28861 ttcctcctga ggtcaggaca ggaacttgga ggatcattca aggagggagg ctcagagctg 28921 aagacacagg gactgaggtt acagagaaca gccggagctt gtggacagtt ctggacaaca 28981 ctgacctttc tacccaccac cacccagaca cctcagtgaa gccaacccca gctctgacca 29041 ctgggcttcc ctgccttcac gcctgggctc cttacccttt ccccctcctc tctcccagtc 29101 ctttcacatt tcagtgttga cttgcattta ttggtcacct acactgtgca aggcttggtg 29161 cttttcttac aagatgtgaa catacaagac atgagcttct ccatgagcct ggaaaccaaa 29221 gggtgaatca aatggtaaac aaagataatt tcagaacctg aaactatatc tgctaagctt 29281 ccagtggctt ccccaaatgc cttataagct tgtgtacatc tggaatccca catttctctc 29341 tgagacccca acagtgggct tccagaacat cccaggacca tccttcttgc tgtcttctct 29401 gaggtccctg gccactctct gtgccaaatg tacttatgtt gaccattctt cctaaataat 29461 aaagccagga catagggcat aggcacacgc gcgtgcacac acacactaaa ggatatgacc 29521 atggaacaga aatctgaagg tgtcagagga aattcatacc ctgtgttctg ccctctgtgt 29581 gatactgtct ccagaccctc cctcttccta tcactgcagg gctccccacc tcctgccagg 29641 agatccttgg gaagctgcta cctactgctg gctctggggg cttcaggcca aggcagagga 29701 aggaagctga gcctggctct gtccaagatg ctgacttcat cctccccatc aaagccagga 29761 aaaaaacagc cacacacacc cagacacacg gacacacacg gggacaccca cgcaaaagcc 29821 catgtgtgcc tttgccctcg gccacatgga gcttactggg catctggcca cagccacatg 29881 cagtgcaggc acgccctgac ccaggaacac cacacacctg ccaccatggg gatctgacct 29941 ggtgtgcaga cacctgggcc gtcacccatg tggcagaatc aagcacatat gggctccctc 30001 acgtaccttt cacccgtgac ttggccaagt tacacctcag tggtgaggtc tgtagcagga 30061 tagagagacc cagtcctcac tgtgaagctg ggggtagtgg gttcccactc agctgggata 30121 tttctcctct gtagcaaagt ctctttttca gctccttgtc agggcagcac atggggagat 30181 gaagctaggg aaacacccaa cagacttatg tggggtgtgt ctgtctccct gtcccccctg 30241 tcctctggct gtcagtcttc tgcctgggac cagagacagc gtgtccgttt cctccagccc 30301 ctacctctcc ccgcctctgt tcctgctccc ccgcagcacc caggcaccag ccccccaccc 30361 tgctttccac cacttcaccc tctattcagg tgacccaccc ttatatttgc ctcctgtctc 30421 tagccctttc ttcaacccta aaccttcctc ctaaccttca ggctttcctt ccttgtccgc 30481 cccaacccca gcttcctccc caaggcttgt atgtcccagt gttcacccaa gcccttgccc 30541 tgccagtgcc caagcctcac ctgctctttc tctagggact tgctgaggta ggagaatgca 30601 aagggggcag tgacaggaga gggactgaac agaacagaca gcaggagtgg ctcggggacg 30661 ggataagccg agaagataga aatggtggga ctggagcagg gtggggactt ggggtcccag 30721 agacggtccc agttggagaa agaggttgcc tcttgcctcc tgccctctgc ccctgctccc 30781 tgccccagac tgccagggga gaactggtga gaggatgagg gagccagcac tgggtgatag 30841 acatggggtg cgatccagag atccaagcta gagcagtggc tgggaagaga tgggagagga 30901 gcatggggga gggaggggaa gggggctggc catgaggaag gatataggct gagaggaaac 30961 cactggggtc agagaggagg ggcagcaggt ggagaaatgg tggggaaaaa agaaatggca 31021 ggaagggtgg ggggtgtcaa gttgggcaaa aattagggtg agctaaaaat ggcataggga 31081 aaggacgtgg gaggtaagac cggggctggg ggttgagatg agggtatggt gagacagtga 31141 tgctagacag ggagtggggg ctggggtggg ggcaggagaa gaggaggggt ggcatgtggg 31201 agggcctggg aggggtggga ggacctgccc tatctccgct cccctccctg acctcatcct 31261 ggtcctcccc ttctcctccc ctgctcgctg cagactccct cctcactgtc gctgccgaga 31321 tccacagtcg gttgtggctc agcccctgtt gcaggggaca agtgagggag acttccctgt 31381 cctgccctga gacgccgccc tcccggggtt ggggacagag caggtgcaga ggcactgcag 31441 ctgctcggtt gcccaggtaa ggaacagagt tctggggaga cccagaagga gaggcaaggg 31501 atttctcgcc tatccccatg ccccaggtga aggtgtggca ggtgagctct cagcagtgct 31561 tccagagcaa ggacttctgt tcaggacaga ggacatagga gcattcgctg ggaaccaggc 31621 tccctgtgaa ggagtgtctg ggaggagagg gtggtatgtg tgtgttgggg tgggcagtat 31681 gtgtcagggt ttggatggga tgcagacaac tgagaggggc tgggaacaaa ccgagggact 31741 ccgggagtca gatggtggaa ccagggtaga ggtggttatt ggtgaggtga gcccctgggc 31801 cctggaggtg ctggggctag cgggaagata ggactaggct gaactggtgg ccagctgaac 31861 cttggaagga aagtgcttgc ggaatcccaa atgtcgggct agagcagctg agcccaggct 31921 ctgggggctc caggctccag gaggtgccag agggtgcttc tttgggcatc caaatgggga 31981 ctccctaggt ggcagcgtga gagttgcagg gagggaggtc tgggagatct ctttaggaaa 32041 tactgaaatg ggacagtcag gactcgggtc tgggacgtgc acagagcctg cagagaggat 32101 cctaaaactt gttcttggcc agagctctgc actcatgact cacggttcag cagggctggg 32161 atggctgaga gatgcctctg ggggcacagg ggctccgtct tcactctgtg ccccttccag 32221 gctccccttg cctctgtctg tctgaacccg cactctccct cacctcgccg cccccgcccc 32281 ccaccaccag tacacgtgac acgccccggg tgggaagggc gggcccctta tctcgcctgg 32341 aaagaattta atggcataga aacaactgga agtggaactg agaggaacta ctgggaagca 32401 actcggaaac accacagctg ctgcctgccc tgcccctgct tcccgcatct ccagcccctc 32461 ctcttttccc tgaccctgcc ttcaagtccc tggggaccca gcacctcaat tcctcaaact 32521 tcctttgctc actccagacc tagccactag gtgcaggacc agaaactgcc ttcctgtctc 32581 tcattgcacc cctaaccatt ttcagctggg gcctggaaat cctttctctt aatttttcca 32641 ttcccaggcc gattagctcc ccagaataag gggtatctgg attccccaac ctacccctcc 32701 ctgacacctc cttcctacct ggttccagcc caatatctga gacccccaca ttcccttttt 32761 agtctcactc ctagaatcct tctaccttca cctctcattg ccagattcaa ctgagggtgt 32821 agcaggaggg tcagagggag gcatgggggc aggaccagaa cccccatctg tggaacatct 32881 tttatggaag ccagcccctc tccccacccc ttgggaccta tacagtccct aatgagcaaa 32941 gtgtcaaagt tggctacaac acaaaatacc caggaagctg agtcacagtt tgggaagatg 33001 cccccatccc tgccagccct ccaactgtgg ggagggtgcc aagagggtga ggccatatct 33061 ctctccaaac acacactaca gagaaccacc ccagacatag tggcaggggg aatggtgaag 33121 gggagacaca agagacaccc aattctttgc ttgcccagta ccgagagaag agcagaatta 33181 tcccaggagc tggaatgggg agctggaggg ctgggtggtg gcaacagggc agggagggag 33241 attgggaagg agcctctggg tgtgtggtca atggaggctg ctctgctctt ctagaccaat 33301 ctcaccagct ttccagaaaa gctggtgagc tctcttcact ccttcctggt gcaaagaccc 33361 tctctgcttt cccccactgt gcagggaaag atgaggaatc cacttatttg gggaggggca 33421 ctggacaact cacagaggcc tccaggggta cggggaagta tggtggaggc catgtgtccg 33481 ctgcttccca ctgggaaggg cctcctgctt tgttcctctc acaccatcca ggttctttca 33541 ctcacagagc tgccctcgga gccaacaaga aatcagcctt cttaggcatc acccaatgac 33601 taggtgtgga ggaaacctga aggggagatt gggtgtgtgt gcacgtgtgt gcatgggcac 33661 ccatatgtgt acctacaagg atgcaggttt tgggggtgga tttgtgtgtc tgtgcaggta 33721 tgtttgtact tgtggatgag cacacattaa tatatttgta agtgtacatg tgtacacatc 33781 tttgtgcaag cattcccagg gtgtgggagt gattagcatg gccttagaca caaagtgaca 33841 ggatttgagg gtcggtcaga gcccatctct tacccaaggc tacactccag gctgacagaa 33901 gggctgggga gacaggaaag ggcagctctc tgtatgccag gcctctcggc ctgggctttc 33961 ttgggtgggg agcctcttgg tgtggaaaca ctctgtccct gttcagctcc ctttccgtct 34021 cttgctggct ggtagcagaa tgtgatggct ggtagcccca attactcctt tttctctgat 34081 ccaggagaaa agtttaagag tagaaagcca gaagatgtgt gcgtgttggg tggaggtgtg 34141 tgtgtgtcac agtcacctcc gcctaccagt gacacatccc ccactcctca ccggtcagca 34201 tttccacacc caccctgccc tcctcctttc cccagcctct ggtctggaag agctgggcac 34261 ctttcaacat ctctcctcaa ctcttcaccc tagctgaatc ctgactggcc ccctcacact 34321 caggtttgcc atagccctgt ctcctgtctt atcctgggac ccccctacca cctttcctcc 34381 cacctcttgt ctctgtgtgc tccccaatcc ctgcctcagt aaactccaca aacacataca 34441 actgggtaac cgtcatccag atcaagaaac aagacacctt tagcacccca gaagcccccc 34501 gccccatgcc cctactagtc actaccccct caagagtatc acacttctga cttggagcac 34561 tacagattaa tgttgcctct cctcaaactt cacataaata gaactgtgca gcatgctctt 34621 tcctgtgtct ggcttttgtt caacatcatc ttagtaagat ccatcacact actgcatata 34681 actatagttc attcctgctt tattttattc cacggtgtga ataaaccaca atttatttat 34741 tctactgatg tgaaatgttt gagtagttcc cagtttgggg ctattatgag ctatgttggg 34801 tatattccta ggagtggaat tgcggggtca tagggtttgc atgtgttcac ctttagtaca 34861 tttagtagat tctgtcaaac agttttccaa agtgcttgta ccatctggtc ctgccaggat 34921 cttccctggc ctggcctctg attctgcctc agccctggtg gctgtgtcta accaacttcc 34981 agtcccagac actgtccgtg gtcctagccc tgaccttggt cctgcccttg gttccagcct 35041 taatgctagt cctgcgttta cattgaactt ttcattcatt catatagtcg gtaaatattt 35101 attgagcacc agttatgtgc cagatgcagg gatacagcag ccaagaaaaa aaaataggtg 35161 ccatgcctgc cctcatggac tttatagtct agtgggcaag aaagtaataa aataatctca 35221 cagtatattt attattactt tggacttcaa aaaaatattt gtttatatat gattgcaaat 35281 tgttataaac gctacacatg agaatggcat gatgctgtga tatcacatga taggaggacc 35341 tgacctggtc aggaaaactt ccctgtgaaa aaaatggcgg gctgagacct gaaggatgag 35401 aaggccaaga gagaagggaa cagtgttcca cacggatgga acagagtata caaagatctg 35461 gggttgggga tgggcacaga gagtatgagg gacttgaagg tcagtggagc tggaatggag 35521 ggagggaggg aaatgttatg ggtggtgaag tgggagcagg ggatacatga tggtagggct 35581 gtttgtcagt gttctaagag aatttggaag ggagacactg aagcattttg aggataggga 35641 catcctgatt gcgtttgtga gactgtagag ggcaagagtg gaagcagagg gaagggtcag 35701 ggctggacac agtggctcac gcctgtaatc ccagcacatt gggaggctag ggtgggagga 35761 ttgcttgagt ccaagagttc aaggccagcc cgagcaacat agaaagatcc tgtctttgca 35821 aaaaaaatta aaaattagct gagcgtgatg gcacacacct gtagtcccaa ctactcagga 35881 ggctgaggtg ggaggattgc ttgagcaaca ggaggtcaat gctgcagcga gctatgatcg 35941 caccactgca caccagccta ggtgacagag tgagaccctg tctcaaaaaa aaagaaaaaa 36001 aaagcagcca ggaggcaatt gcagttgtca ggcaagaggc ggatggaagc ttggattaga 36061 atgaaggtaa gtgggagaga aggaaatgaa ttaaagagat gtttatgaga aaatacaggt 36121 agatacaagg aaatatggta gccttgaatg ggctattagg tttctggctt gtaaagctgg 36181 atgaatggtg gggtcctgag cttcaatggg gaattctggg aggggatggg atacaggaag 36241 gtgaccgtga gttcaggtta caggcatggt gaatttgagg tgcccaagtg gagatgccaa 36301 gtaggcaggt ggatatattg ctgtggagcg agaggaagat gactggggtg ttttacaaat 36361 tggtgtatag tttggaaatt gaagttataa gtctggatga aggtaccagg tggatgacta 36421 gagcagtagg tctattaatt tagtagaaat ttgttgcaca ttacaatcat ctggggagct 36481 tttaaaaata ctgagtaaaa ttcacttact tggtcgttct agtcacatct caagtgttgc 36541 agagcctcat atccctcctg gctaccatat tggtcagcgt ggatacagaa tgttttaaaa 36601 gagaagggcc aggtgtggtg gctcatgcct gtaaccccag cactttggga ggccgaggtg 36661 ggcagatcac ctgagctgag gagctcaaga ccagcctggt caagatggtg aaacctcgtc 36721 tctactaaaa ataaaaaatt agctaggtgt ggtagcgggt gcctgtagtc ccagctactc 36781 gggaggctga ggcaggagaa tcacttgaac cccaggaggc agaggttgca ttgagccaag 36841 atcgtgccac tgcactccag cctgggcgac agagtgagac tccatctcaa aaaaaaaaaa 36901 aaaaaaaaag atgcggtgaa ctgtgtctag tgattttgag agaagtcaaa ggtcaagcaa 36961 gatgaagaca aatgtccctt agattttgtg gcaccgaagt cattgggaag ccctagaggg 37021 agctgtttgg tgggttggag ggagtagagt ggattgaggc atgcataggc tataaggaaa 37081 tgtaaacatc aaatacaaat aaattttgac aactgtggcc atgaagtaga ggagagaaat 37141 atgtggtggc tggagggggt gtgagattgg ggataggttt tttaaggttt catttttttt 37201 tttttttgag atggagtctc gctctgttgc ccaggctgga gtgcagtggc acaatctcaa 37261 ctcactgcaa cctttgcctc ccaggttcaa gcgattctcc cgcctcagcc tcccaagtag 37321 ctgggattac aggcacctgc caccatgccc agctaatttt tttttatttt ttatttttag 37381 tagagacggg gtttcaccat gatcgccagg ctggttttga actcctgacc tcaagtgatc 37441 cgcctacctt ggcctcccaa agtactgaga ttacaggcat gagccaccac acccagccat 37501 ggttcaattt ttatacagca aaatgccaaa gtcttaagtg tgcagctcaa tgagtttttg 37561 aatatgtata tatctgtgta accacccaga ttaagataca aaatatttcc agaaatatga 37621 ctccttccgg tcaataatcc ctacaagtaa tctgatccta tcttctaaaa gcaaagatta 37681 actctgcctg tttttgtttg tttttttgtt tgttttttga catagagtct tgatctgttg 37741 ccaggctgga gtgcagtggc attatcttgg ttcactgcaa cctccgcctc tgggattcaa 37801 gcaattcttt tgccttagcc tcccaggtag ctgggactac aggtgtgtgc caccatgccc 37861 agctaatttt tgtattttta gtagagacag ggtttcaccg tgttggccag gatggtctcg 37921 atctcttgac ctcgtgatcc gcccgcctcg gcttcccaaa gtgccgggat tacaggcatg 37981 agccactgca cccagcctct gcctgttttt gaacctcata taaacggcat tattcagagt 38041 gttttctttt ttcttttttc tttttttggt agctcaggaa gggtatataa gagtataaga 38101 gcattctctt tctgtgtgtg atatttttta ctcagcacta tgtgtgtgaa attcacccat 38161 gttgcatgta gcaatagggt gttttttatt gctgtatagt attccattgt atgactctgg 38221 cacaatttat ttatccactt tatgaatgga cacttgagtt gctttcagtt ttagctaata 38281 taggaataaa tctcatgaac attatggtct acgtcttttg gaagaactat gcagtcattc 38341 ccaccggata tttagttata gatatctggg ttatagacta tatgttttag cagatactac 38401 caaacaactt tccagtgttt gcattaatat atgctcccac ttgcaacaga tgacaattcc 38461 agctgcttca cattcttgcc aatatcaagc aatgacagcc tctttgaaga gggacaagca 38521 ttctggagga tatttagtgg catcttattg tggttttaac gtacatttac ccgataacta 38581 gtgattttgt gcatctttta aaatgctgct tggatctggg tgcagtggct cacacctgta 38641 agcccaacta ctgtggaggg tgaggcagga ggattgcttg aggccaggag ttcaaggcca 38701 gcctgggcaa cacagcaaga tcccatctct acaaaaaatt taaaaattag ccaggcatgg 38761 tgatgcccat ctgtagttcc agctactctg gaggctgagg caggaggatc acttgagccc 38821 aggagtttga ggctgcagtg agctgtgatt gtgccactgt gcttcagcct gggtgacaga 38881 ggtagagttt atcttaaaat aaaatgaaat agggtgggtg tggtggccca tgcctgtaat 38941 cctaataact ttgggaggcc aaggcagatc acttgaggtc aggagtttga gaccagcctg 39001 gccaacttgg tgaaaccctg tctctactaa aaaaaaaaaa aaaaaaaagg ctgggtgcgg 39061 tggctcacgc ctgtaatccc agcactgtgg gaggccaagg tgggcggatc acgaggtcag 39121 gagatcgaga ccatcctggc taacacggtg aaacctcatc tctactaaaa atacaaaaaa 39181 atgagccaag tatggtggtg ggcgcctgta gtcccagcta ctcaggaggc tgaggcagga 39241 gaatggcatg aacctggtag gcagagcttg cagtgagcca agacgcacca ctgcactcca 39301 gcctgggcga aagagcaaga ctccatctct acaaaaaaaa tgagctgggc atgtggcgca 39361 tgcctgtatc ctagctactc cagaggctga ggtaggagaa tcacttgagc ccaggtaaaa 39421 taaataaaat gtttattggc tctttggcta tattcttttc tggagtacct gtttgtcttt 39481 tgacaattta aaaaactagg tttcctgatt ttggcttatt ttgttttgca gagatttttt 39541 ttttattttt tttttttttg agatggagtt tcactcttcc tctcaggctg aagtgaagtg 39601 gcatgatctc agctcactgc aacctccgcc tccagtttca agcgattctc ctcctcagcc 39661 tcccgagtag ctgggactac aggtgcccgc taccacgccc agctaatttt tgtattttta 39721 gtagagatgg ggtttcacca tgttggccag gctggtcttg aactcttgac ctcatgatcc 39781 gcccacttca gcctcccaaa gtgctgggat ttcaggtgtg aaccaccaca cccagctgcc 39841 ttcttattct cttaatgatg taatttgatg acctgaaatt ttgttttgtt tttttctttt 39901 tctttttttt gaaacagagt atcagctctg ttgcccaggc tggagtgcag tggcatgatc 39961 acagctcacc acagcctcga cctcccagac tcaagtaatc ctcccacctc agcccctgag 40021 tagctgggac tacaggtgta ccaccacgcc cagtgaattc tttttgtttt ttgtagagac 40081 agagtctcac tatcttaccc aggctggtct ccaactcctg ggctcatgca atcctcacac 40141 ttaagcctcc tgaagtgctg ggattacaga tacgagccac tgcacccggc ctatggttag 40201 gttttgtgtg tgttttactt aagaaatctt gcggctgggt gcagtgactc aagcctgtaa 40261 tcctagcact ttgggaagcc gaggcgggcg gattgcctga gttcaggagt ttgagaccag 40321 cctggccaac atagtgaagc cccgtctcta ctaaaaatac aaaaaatagc tgggtgtggg 40381 agcagatgcc tgtaatcccc gctactcagg aggctgaggt aggagaatca cttgaacctg 40441 ggagacggag gttgcagtga gccgagatag caccagtgta ctccactctg ggcgacagag 40501 caagactcag tcttaaaaaa aaaaaaaaag aaatcttgct tactccaagg tcagaaattt 40561 tctgtaagat attgctatga aagatattgc tttacttttc ttatttaggt cctgggttca 40621 tctcaaatta atttcggtat atcagacata aggtggagaa tgactgtttt tttttttttt 40681 tttttttttt tacttaggga tgttggattg acccagcacc aggaactaaa aagaccatcc 40741 tttctcctct taactgcagg ggcacttttg tcgttaatta gttgaccaca tatgtgtggg 40801 cctgtttctg ggccctattc tgtttcatct gtctatcctt gcacctttaa aataagtctt 40861 gatatttgat catgtaagtc ctgaggaaga ggatttttgt tttttttttt tcaattggag 40921 atatttgagc aaggatccag ttgagagtga ggagttgaat aaaaaagaga gagaagccaa 40981 gcatggtggc tcatgctttt aatcccagct acctgggagg ctgaggtggg aggatggctt 41041 gagcccagga gttcaagtcc agcttcaacc agggggttga gcaagatcct gtctctatat 41101 tagaaagagg tcgtgcggtg actcatgcct gtaatcccag cactttggga ggccaaggcg 41161 ggcacatcac ttgtggtcag gagttcgaga ccagcctgcc tacatggtga aacctcattt 41221 ctactaaaaa tacaaaaatt agctgggcat ggaggtgggc acctataatc ccagctactc 41281 aggaggccga ggcaggaaaa tcacttgagc taggagacag aggttgcagt gagccgagat 41341 cgcaccactg cactccagcc tgggcgacag agtaagattc catctcaaaa aaaaaaaaaa 41401 aagtgagaga gagagggaga agataatctg cacttttctt agaagacagt aggagctggg 41461 atccaggtgt ttggggaagg attggcccta ggtactcatc cactaattca ttcaacattg 41521 atgtgtcaga tagaacattt atgtgccaga cacaagtata cagcagtcct gggatagatg 41581 aggtccttgt tctcatggag ctcccattgt agcattgaga gaaaggtaat aagtaaactg 41641 ataactcagg cagcagtaaa tgccatgaag aacaggaaca ggatattgtg atagagagta 41701 atggaagact gccctaactg ggagttaaga tttgaagatg gagccagcct ttatgagcag 41761 agtgaagagc atcccagaca tagggaacac acaatgcaaa ggcccttgga gtagtgagag 41821 cttgtagtgc tcgtgggaaa aacaaaagac cagtgtagag ccgggtgcgg tggctcatgc 41881 ctgtaatccc agcactttgg gaggctgagg cgagcagatc acctgaggtt aggagtctga 41941 gaccagcctg accgacatgg agaaacccgg tctctactaa aagtacaaaa ttagccaggc 42001 atggtggggc atgcctgtaa tcccagctac tcaggaggct gaggcaggag aatcgcttgg 42061 acctgggagg cggaggttgt ggtgagctga gatcgcacca ctgcactcca gcctgggcaa 42121 taagagtgaa actccgtctc aaacaaacaa acaaaacaaa acaaaacaaa acaaaacaaa 42181 aaaaccagtg taggagaaca caatagaaaa gggggtgagg agcagggtaa ataaatgata 42241 tcagagaggt agagaacaga cttgcccagg gagggtatcc tttagggttg ggcggaatgt 42301 gaatttttac atgcatgtga gttgggtaga ttacttgtgg gtggtgggac cgtgggtatg 42361 tgctggagaa ggggggattc attttgtctc ctttttgaag ctgctctcat acctaccctt 42421 tctccctgca gcctcctgaa tgatgccagc ccagtatgct ctaacctcca gcctggttct 42481 cctggtgctg ctgagcacag ccagagcagg ccccttctct tcacggtcca atgtgacact 42541 gccagccccc cggccccctc cccagccagg gggccacaca gtgggggctg gagtgggaag 42601 cccctcttct cagctttacg agcacacagt ggaaggaggg gagaagcagg tggtattcac 42661 ccaccgcatt aacctgcccc cttccactgg ctgtggctgt cccccaggca ccgagccccc 42721 agtccttgct tcagaggtac aggccctgag ggtccgtcta gagatcctgg aggagttggt 42781 gaaggggctc aaggaacagt gcactggggg atgttgtcct gcctctgccc aagctggcac 42841 aggtgagcag gtgatcacag aagagggtgg agaggtgggg tggggtgggc attgctagtc 42901 cataaaggtc cttggtatga attagaagaa ggcactcctt cctcaccatg aggggtgtgg 42961 atgcagccca gaacacaact tggagagcag agctgggcta catgtcaacc aaagcatatg 43021 cgagggccct gagaggctgc atacattaca catactagca actgggagca gacatggcct 43081 tatgagatga ggctagcctg gctaggggcc tgcacatagg agcactacat aggctcagcc 43141 tctctcccag gagagagact gagacttgcc tctcccctcc tactccaggt cagacagatg 43201 tgcggaccct ctgcagtctc catggtgtgt ttgatctgag ccgctgcacc tgttcctgtg 43261 agccaggctg gggtgggccc acctgctcag accccacaga tgctgagatc cctccctctt 43321 ccccaccctc agcctcgggg tcctgcccag atgactgcaa tgatcagggt cgctgtgtcc 43381 gtggtcgttg cgtgtgcttt cccggctaca ctggccccag ctgtggctgg ccatcctgtc 43441 ccggggactg ccaaggccgt gggcgctgcg tgcagggcgt gtgtgtgtgc cgggcaggct 43501 tctcaggccc cgactgcagc cagcgctcct gccctcgagg ttgcagccag aggggacgct 43561 gtgagggtgg gcgctgcgtg tgtgacccag gctacactgg tgacgactgt ggcatgagga 43621 gctgccctcg cggttgcagt cagagggggc gctgtgagaa tgggcgctgc gtgtgtaacc 43681 ccggctacac tggcgaggac tgtggggtga ggagctgccc tcggggctgc agccagcggg 43741 gacgctgcaa ggacgggcgc tgcgtgtgtg accccggcta cactggcgag gactgtggta 43801 cgcggagctg cccctgggac tgtggcgagg gcgggcgctg cgtggacggc cgctgcgtgt 43861 gctggcccgg gtacacaggc gaggactgca gcacgcggac atgtccgagg gactgccggg 43921 gccgcgggcg ctgcgaggac ggcgaatgca tttgcgacac gggctacagc ggggacgact 43981 gcggcgtgcg cagctgccct ggcgactgca accaaagggg ccgctgcgag gacggccgct 44041 gcgtgtgctg gccggggtac actggaaccg attgcggctc gcgcgcctgc ccacgcgact 44101 gtagaggtcg cgggcgctgc gagaacggcg tgtgtgtttg caatgcgggc tacagcggcg 44161 aggactgcgg tgtgcgcagc tgtcctgggg actgtcgtgg ccggggccgc tgtgagagtg 44221 gccgctgcat gtgttggccg gggtacacag gccgggactg cggcacgcgc gcctgtcctg 44281 gcgactgtcg cgggcgcggg cgctgcgtgg atggccgctg cgtgtgcaac ccgggcttca 44341 ccggtgagga ctgtgggagc cgtcgctgtc ccggggactg ccgtgggcac ggcctttgcg 44401 aggatggcgt gtgcgtgtgt gacgcaggct actcagggga agactgcagc acgcgcagct 44461 gccccggggg ctgccgaggc cgcggccagt gcctagatgg gcggtgtgtg tgcgaggacg 44521 gctactctgg cgaggattgc ggtgtgaggc agtgcccgaa tgactgcagc cagcacggcg 44581 tgtgccagga cggtgtgtgc atctgttggg aaggctacgt gagtgaggac tgcagcatcc 44641 gcacctgccc ctccaactgc cacgggaggg gccgctgtga ggaagggcgc tgcctgtgcg 44701 acccaggcta caccggccct acctgtgcca cccgcatgtg cccggctgac tgccggggac 44761 gtgggcggtg tgtgcaagga gtgtgcctgt gccacgtggg ctatggcggt gaggactgcg 44821 ggcaggaaga gcctccagcc agcgcctgcc ctggaggctg cgggccccgg gaactgtgcc 44881 gggcaggcca gtgtgtgtgt gtagagggct tccgaggccc tgactgtgcc atccagacat 44941 gcccagggga ctgccgtggc cgaggagagt gtcacgatgg cagctgtgtc tgcaaagatg 45001 ggtatgctgg cgaagactgc ggagaaggtg agcaggcagc cttccccagt gtactctggg 45061 actgtgattc tgtgaacagg agccatgggg aagacctgag cctgaggaag agtggaggga 45121 gagcatgtca ttccaaggag ccagtgccca ccaggggcag cagaaccaca ggggtggggc 45181 tttccagtgg gcaggactgg ccttcagatc tctgaggagc aggttggggc ccatttagtt 45241 gcatttaggg agattctgtg ctcctgaaga tggagagaag gtgaaggcga tccaagcccc 45301 gttcagccct ctgtcagcta ttctccacta gcagagagga gtgggagtta ggatgggcct 45361 cttcgatgtc cctgagtaaa aggggctgtg ggtgggggtg gttgccttgt gctaaccagc 45421 tttccctaac ccattctctt gggcagaggt gccaaccatt gagggcatga ggatgcatct 45481 cttggaggag acaacagttc ggacagagtg gaccccggct cctggccccg tggatgccta 45541 tgaaattcag ttcatcccca cggtgagaga gatgccaggc tccaggaggg gactgagtgg 45601 gcaggacagg ggggcagggc tcacctcctg gaggaagttc caggtgatca ttggttgagt 45661 ccagatggct gggtacctga agctgctgta aaggctttct tagcctgctt gacactgggc 45721 actgaggcct ctgcagcatc tccagcaagc attggtccca cacagttgga acacctggta 45781 acagagagct cagtacttcc tgaggctgtc cagtccattg ttggtcatca ctgttagctg 45841 ttatcattgt tgcagaatcc gtctgcccct attctagcac ctgtccgttg gccctagctg 45901 ggtattcatc agcagctgga atgttttttg ctttccctgt gtcctacacc acaggccttc 45961 aggtatttgg agacaccagc ctgatgtctt cattcctcag gcaaaacctc ccagtttctt 46021 cgcccctttt ccctaagtct ttattctctg aatcctttac cttcctgtga ttcttgcttt 46081 tttcatgtgc ccccaaaatg tggggacaga gaggggccca ctgacttact ctgaatttat 46141 ggacactttt ctcagagctg gcgtatagca atttgtagtc acacaagcag gcagctgttt 46201 tggtcatatt tggtttttta agtaactttt aattataaaa gtaatatagg ccccgtgcag 46261 aaaacttaga aaatacagat tagcagggaa aaaagatgga aggaaagaag acaacattca 46321 actatataca cttttttttt tttcacctaa atataggcct ttgcatttgg ccctgttcaa 46381 tgttatcttt tttggtttgg gccaagctgt ctaggctgtt gcaataattt tgagtctgtg 46441 aacttcatct cccacctcct tgtcatcagt acatgcaaac cagttttttc tctgtctccc 46501 tccaacttgc tctttttttt tttttttttt tttttttttt tgagacagag agtctcactc 46561 tgtggcccag gctcgagtgc agtggtgcca tctcagctca ctgcaacctc tgcatcccag 46621 gcttaggtca tcctcttacc tcagcctccc aagtagctgg gactacaggt gtgcaccacc 46681 ttagcctgct aatgtttgta tttttagtag agacggggtt tctccatgtt gcatgcccag 46741 gctggtcctg aactcctggg ctcaagtgat ctgcccgcct cagcctccca aagtgctggg 46801 attacaggca tgagccacca ctcctggcca tccaacttgc tcttaaaaca ttagagggaa 46861 caccctaagg acaaagcctt gtggtcactt atttggggcc tccctttgtg cagactgggc 46921 ttttaagata tggtcgtttc cccagctgca aatcttctga catttctacc atccaactca 46981 tgttttcctg catcatccca caaatgccat atggggtgtg ggccatgtca tcctcatctt 47041 catcagctta gccaggactc tgtaaaaata aacaactccc tgggcctaaa gcctcggaag 47101 gcactaggac agaaggggac agctcagagg aactgtcatc ctcactccac tttgctttgt 47161 cttgtcatta tgctcctcag gccagccccc tactatgttc tgcctctcct cactctccag 47221 ccaaagcaga agcagaaatc tcaacttcgt ggcgcaacca agatagcttg aattttttct 47281 tctgtttaaa catctttttt tgttgttgtt ataaaaatac tttaatcacg agacaagaat 47341 ttggcaacta gagaaaagaa aaacaattcc tggctgctcc tcactaaaac aagcactgct 47401 aacattttaa agtgtttctc ccactctctt cttctctgta tatattattt acatagctct 47461 aatcttgaaa gggtgggact tcgaaaagga agtttcttaa gaaatttcag ggaaagaaga 47521 attaaaataa acacttgaga taagaggagc ttttagggca gcagttgtct cctgctctga 47581 gtaaataaat acaaaccgat ccttggtgtt gtcaagggcg ccgctttcca gttgaagccc 47641 tagactttct ccctcactca gagccaggat ggtggtgcca agttggcctg tgccaggctt 47701 ggccatgttc ttagctcatc cattctgctc tcggtatggg catctgtcac attctgggtc 47761 ttctctgagc ctgggtgtgg aaaacactcc tgtaaaattg gtccccagac actcctcttg 47821 ggcctgggct acccaccctc agaccatgct ttctgctgtc tgtagctggt ccctggcatc 47881 gtaccttctg ccttggcaag tgccatctat ttcttttctt ttctttcttt cttttttttt 47941 tttttgagac agagttttgc tcttgttgct caggctggag tgcaatggcg caatctcggc 48001 tcactgcaac ctctgcttcc caggttcaaa cgattctcct gcctcagcct cccaagtagc 48061 tgggattaca gacacctgcc acaatgccca gctacccttt gtccatttct gtgggtacag 48121 agtatattca atagctgtct gtcctctggc tgctgctgtc atcctagaag cctatctcta 48181 cctcttacct tccctcttcc cagaagcact tctctctcaa agcccaaact ctgctattcc 48241 agatcaaggg acatacctag ttattggaat tatttttcag aatgcagaag aagaaagttt 48301 ctcctctccc tgtatatgtg gcactgttac tctgactcag tggttctcaa agtgtgttcc 48361 ctggacctgc agcatcagca tcacctggga gtatgttagg catgcaaatt gtcagactcc 48421 cccactagat ctactgagtc agaaagtctg ggcacagagc ccagtaacct gtattgcaac 48481 aagccctcca ggtgattctg atgcccctcc agcctgaggc cccctaagtg aaggaatgtt 48541 gctattttgc aaagtgctag ctccactatg aatgtgagag aaagcatggg gaaactgagg 48601 gagatgagtt cctgaatgag aaagaggttc agatattcct tgtctgggtg tgtcctgcag 48661 cctggaaatc agggctaggc taagtgttca ctgggactca aaacctcaca gcctcagaga 48721 agtttttgct tgggtctagt ctaaaaacag caatgagggt gagataagct ccaaggacca 48781 gccaaggtag agttcatctg gagccacgag gttaatattg agataaactc taaaacttcc 48841 caagaacaaa atcagtgaag caggccaggc gcagtggctc atgcctgtaa tcccagcact 48901 ttgggaggcc aaagtgggtg gatcatctga ggtcaggagt ttgagaacag cctgaccaac 48961 atagtgaaac cccatctcta ctaaaaatac aaaattagct gagcatggtg gtatatgcct 49021 gtaatcccag ctacttggga ggctaaggaa gaatcccttg aatcttggag gcggaggttg 49081 tagtgagctg agatcgtgcc attgcactcc agcctgggag acaagagcga aattccgtct 49141 caaaaaaaaa aaaaatcagt gaagcagcca agctttacca aggggcagag ccagctgcca 49201 gcatctgttt agaaagactg cttcagggcc aggtgtggtg gctcacgcct gtaatcccag 49261 cactttggga ggctgaggtg ggtggatcac ctgaggtcgg gagttcgaga ccagcctgac 49321 caacatggag aaaccccgtc tctactaaaa atacaaaatt agccggacat ggtggcgcat 49381 gcctgtaatc ccagctactc gggaggctga ggcaggagaa tcgcttgaac ccgggaggcg 49441 gagctcgcga tgagctgaga tcgcgccatt gcactccagc ctgggcaaca agagcaaaac 49501 tccctctcaa aaaaagaaaa aagaaagcct gcttcagacc agtaagatgg tgctaggaga 49561 cgtaggagac atgggttagg ttgatggact ttcaggggac tgttcccaga tggatctcag 49621 gaggaaccaa gaagtgtcta aaggatatat atagggagaa ggagaactct ggggataggc 49681 ttgcctcact gaaagaacca gacatcatca gggagcatcc aaggtgtgtg tattgcggct 49741 gtggggatgg ggtgggtgtg gataaagaga aagagacata cacacacaca gagaaaatga 49801 acaaataaat tctgggaagg gtaaggattt catgggtctt gagttgaggc caacccaagg 49861 cttcctaggc tggggtattc taaggcaggg tgaataggag agggctggag gtcctatgtg 49921 cccaggaaaa cttttcaacc cagaaaactg acaaagcttt acttcatcct caacccagaa 49981 atctctacgt ggccttctca tctggtttta tacttcggta tttactatgt tgatttcccc 50041 acagttctct tggtttagga gagtaccttt tgaaatttcc aatcctactt ggctgaatgg 50101 aggtgtcccc atcacagcca acatagctag gcagaacaca gcatccttag cggctgtcct 50161 agctggacag cacagcaaaa tgtggggctg ttagaatgct cacagatgta cagttcctcc 50221 tgtttcccca gtggtgggca ttccaggaga ccctgcctgg agcagggtcc agtagattcc 50281 ctttggaggc accacacata atcataattc cttaaatgta cacttgtagg agtgttgttt 50341 tctcacaagc atttgctaag catagagatt gttgtaagta tctcatatat tcaagatgtg 50401 gccatacatt tagctattag agtctcagct acctgaggac agggaccata tattttattt 50461 ttgtgtctcc aatgcccagc atgatgccag gtacaaaatc aacaatcagg aaacatattt 50521 gtggaacttt gagcaggtgg ttcttgtttt gaggagccta caacccgggt gaagagacaa 50581 gaaaagtccc tcaagggaaa ggttgtgatg cgggtcccac tcctgttaca cagccatgag 50641 caggcatcta cgaggaggtt cagggtgagg cagacccttt aaggagaaag tgcaacttga 50701 aggggcatcc ttgatctact tcatttgttc tgacaggacc aagcttagga aaaggctcct 50761 catgcccagg tagtcagagt tgccagggca aagctggagg gagagtcctg gtgcatgttg 50821 ggtggaattc tggcagatga acacggggtg cccatgagag gggactctga cagataaata 50881 gggggtgccc atgagagggg actctgggat cacactgcta gggcctaaat tcttgccatt 50941 ttcttggtgt tggtgacgtt ggcaagatac ttgtccttct gtgtctcagt ttccttatct 51001 gtaaaatggg actaatagga cctgtttcgt agggttgttg tgagaattaa gtgagttcat 51061 gtgtgtacct cgcctagaac agtggctcac aaggagttag cgctctcagc atgtttgctc 51121 cccagcttgt acagggtcaa gttccacaca gtaccatgct gacactgttt agaattggag 51181 aaagaagaga taagggggat tgagcagcag aggcaagagt gccagcccct gagccacctg 51241 gtgctctctc tcacagacag agggggcgag ccccccattc acagcacggg ttccaagctc 51301 tgcctcagcc tatgaccaga gaggactggc ccctggacag gagtaccagg tcactgtccg 51361 agcccttcga gggaccagct ggggccttcc tgcctccaag accatcacca ccagtgagag 51421 ctggggctgt ggggaggggt gtcctgagtg gagatctgga ctagagaggg aatctgccct 51481 ccgggagggc agaaggaagg ggctggatgg gggctaggct cttggaagga aagatgatga 51541 ttgaagaacc agacacccac ctgagtcctc ctctttaacc tggctagtga tcgatgggcc 51601 ccaggacctc cgagtggtgg ctgtgacacc gacaacactg gagcttggct ggctgcgtcc 51661 ccaggctgag gtggaccgat ttgtggtgtc ctacgtcagt gccggcaacc agagggtgag 51721 gctggaagtg ccccctgaag cagacgggac gctgctgact gacctgatgc caggcgtaga 51781 atatgtggtg actgtcacag cggagcgggg ccgggcagtc agctacccag cttctgtcag 51841 ggccaacaca ggtatggctg gccagaggtt agggaagggc cctggtttcc cagccttgga 51901 tcctcctcct caggagccca ggctcagggc tcacagactc ctctgaatgc ttctggcagg 51961 tgtgtggcag tctcgaggca cctgctgggg gccttgctct gtcctgaggc cactggggag 52021 acagagccct cagatgcctg gagtcctgta ggaaggttac aggttagcct atgaaaggaa 52081 tggccccagg gagagaagtg aataggagat gctgcctgat ccagtcagtt aagggaggtt 52141 cttttttaaa tttaggtacc gggacaggcc ttgtgctgat agatcttgtg ggactggaat 52201 taggcagatt agagatcatc agtcctgttg acttttggat tgggattgct gggaggtggg 52261 tctcacagca gtgattttcc actaaacctc gaggtttctt aacagatacc tggatatttt 52321 tcttttcctg tgacaggggg acacccacct cgctgtagct catgttctct tccccgactc 52381 ccctttctct ccttcagcta cagggcccca cccacgggct tctctcttct ttctagggca 52441 ccaacagtgg tgggcttgga ggggaatggg ggggctgcgg gacactgacc gatttccctc 52501 cgttctcttt ccaggtgggg ctggggcaca gggaagaggc ctctggctct ggtgggaggg 52561 atcgagggag gggctgccgc gggaaggagt gccgggaggg agctgccact gacctgttct 52621 cccctttttt gcccctggca gcaccaggcc actacagtta cccggaggtg cgccccccag 52681 ccccgccccc caagtcccgg ccccggccag ccccagcccc gcggccccca cggccccctt 52741 ggccctcgag gccagcagag gaaagggagg aggagtcccc gcccaggcca agcctgtccc 52801 agcccccacg gcggccttgg ggcaacctga cggccgagct gagccgtttc cgcggcacgg 52861 tgcaggacct ggagcgccac ctgcgggctc acggctaccc actgcgggcc aaccagactt 52921 acacgtcggt ggcgcgccac atccatgaat acttgcagcg gcgtctgttg gccgccgccc 52981 cagccggctc ccccgcaccc ccgccccgcc acccccgccc caccgccagc cctgatcccg 53041 gcaccaggaa acgggactcc aaccagggaa tctacggcct ctcgcctgaa ggcgtcgacc 53101 gggtggctgc gtcccgccac cccaagccag aggtgctggg cagttccgcc gatggcgcgc 53161 ttctcgtgtc tctcgacggg ctccgcggcc agttcgagcg cgtggtgctg cgctggcggc 53221 ctcagccgcc tgcagagggc cccggcggtg agctgactgt gccgggcacc acgcgcaccg 53281 tcagcctgcc cgacctcagg cccggcacca cctaccacgt ggaggtccac ggggtgcggg 53341 cggggcagac ctccaagtcc tacgccttca tcaccaccac aggtagtgtg ggctggggcc 53401 acgggacacc tatcccttgt ccagcctcac ctgccgttgg agcctgcatt catgaattcc 53461 tcccactgcc accccccact gcccccttct accttcaggc cctacctggc cggggctcca 53521 gggggccccc aggacaaagc acgagtccat tgtccctaga agacgtcccc accccagggt 53581 cctggcatcc atcctgtgga tgcctaaggt caaggcagcg ccttctttgg ggtttcccgg 53641 gacacttcca ggattgttat tttaggcaat agagggtgat gtggttgggg cacctgcaaa 53701 gattcactgg agcagtttga gtgggctggg aattgaaaac agtgattccc gttctgactc 53761 actgtggtga gagtggctga agactgtggt tagtggttac tgttttccac aatttggcct 53821 tgaccttgaa gggacacatc agaactgcgt gggggtggcc tgggtgccat caggcagcaa 53881 ctgtatattg gggaggtgaa accactgctt cttaagctgg cttttcagac cccctggctc 53941 actttatgct ggtctctccc acaggaacga tgtctgggct tggggaaagg gcaccctgtg 54001 gctgctgtgg ggagggtggt ggacctcttc ttcccccata gggcccagat gccacctgac 54061 ctggatgtcc acccccacac cctctctctc ttcccactgc ccactctcct cttgcatcct 54121 ccctctctaa caccactgtc tttcttccag gcttctcact gcagcactaa tggaagccac 54181 ctgaccacca cccagtgggg gggatgtatg tggggaggga catggaagtc tccgggggct 54241 ggaggtcatg gaagccttgt ggggtgaagg atagcaggct ttattttacc tattacctgc 54301 ctgaagcaca ggcccgtctg ctcaggaatg tggtagatgg gggtgtttgg aggattggct 54361 gacctttggg aggagggagt ggcccaggtg agggaaccag aagtagaagt agaacaaggc 54421 tcttagaggc tggagggcag aacctgggga ctggggattc ctttctagtt ctcatactgg 54481 actctcctct ttttctcacc ctgccccttc ccagggtcct cacccttggg cctcttgggg 54541 actaccgatg agcctcctcc ctcaggcccc tcgacgacgc aaggggccca ggctcctctc 54601 ctgcagcagc gcccccagga gctgggagag ttgagggtgc tgggcagaga tgagacaggg 54661 cgcctccgtg tggtctggac cgcccagcct gacacctttg cctacttcca actgcgcatg 54721 cgggtgcccg aggggccggg ggcacatgag gaagtgctgc caggggacgt ccgccaggct 54781 ctggtgcctc caccccctcc tggaaccccg tatgagctgt cacttcatgg ggtccctcct 54841 gggggcaagc cctctgaccc catcatctac caaggcatta tgggtacatc ttggactctg 54901 ctcagatcaa tatcctgaga ttgggggagg agctggggtt attgggactt cagaggtagg 54961 atggatattg aggggccagt gaagggcagg gctgatgttt ggggctggat agaggagaaa 55021 aggagagaaa aggaattgca ggttcagcag gaagatggat cgagggggat cttaaaacaa 55081 tgataaggat gggtggggag aggggaaaga ggcagaggta aaaaggccga gggtgggaca 55141 gagattccag gagagaggct gaaggtccct ggccctccac ctgcagcttt ccttgccctg 55201 tcccctgccc ccactcttcg gttgctcctc tgcccttccc tgccctataa atctttcagg 55261 gtgggcatgg gaaggcctct ggacactgac cctcagcccc caatacctcg tccctgggag 55321 cctggtttgc agtggacagc agtccatggt cccatcagcc agtttctgct gggggtgaag 55381 gctccttggg gcagggggct ccacagccat caggacttca gacatggatg aggtttgcag 55441 gaccagtgaa agacaggtcc tggggtacct ggctttcaca tcccctcttt gttcctctct 55501 gtgggaccag ggtggctgac tcatccccca gctccaggtc aagctcacca ctggaagaga 55561 gggtggtaag agagacagca gctcaggtgg gcagaggagc tttcagactg ctctgatgct 55621 gcagcaggga ggcttgaaac tctcattcag aaccaggggg acctgaaacc cccgcaaaga 55681 tcctggtgtg cagagagacc aagaccttgt gagggatgga gaaactgtca caagggaaga 55741 agcagagatt gagagggaaa tgcccctctt ccagaagagc tagtcactga ggggtgggca 55801 gatagagtct tccggggagg gatctatcca ggatggagtg aggtgggcag gggagaggga 55861 aggagacaca gacaagagaa cggttgagtc cactatgctt tgcttttgcc tctcaaactc 55921 cagacaagga tgaggagaag cctgggaagt cctcaggccc accacgcctg ggtgagctga 55981 cggtgacaga caggacctcc gactccttgc tcctgcgctg gacggtcccc gagggcgagt 56041 ttgactcctt cgtgatccag tacaaagaca gggacgggca gccccaggtg gtgcccgtgg 56101 aaggacccca gcgctcggcc gtcatcacct ccctggatcc tggccgcaag tacaaatttg 56161 tcctgtatgg gtttgttggc aagaagaggc atggtccgct ggtggctgaa gccaagatct 56221 gtgagtgaca gcagtaacac cctgccctct gtactgccct gaagaggttt tctcagtgct 56281 ttgggaacct gctttgggag ctccgggaag gcttcctgga ggcagtggtg catgagctga 56341 attctgaagg gaaagtagtg gtggggcagg caaagagttt gagaagctca gtgcagcctg 56401 agggagtcac ggtgagtaag gtggggagag catggccagt aagagaatca ctgttacagt 56461 ttcctctggc tggagaacag ggcatgtggg gggtgccaaa gggcctggtc tcagggcttc 56521 ctgtgctgtg ctgaggggct tgggcctcat tcgaaggccc ctagagggcc tctgaagggt 56581 aaagtgggag agtgatgaga ctagatattt atcttgcaaa taaccgggtg gctgctgaga 56641 ctagctggag aagtgagtta agagctggta aaggctagca gggttgatcc tgcctataat 56701 cccagcacct tgggaggcca aggcaggagg atcgcttgag gctaggagtt tgagaccaac 56761 ctggcaacat agtgagaccc cccatctcta ctataacaaa agatattagc caggcgtggt 56821 ggtgcgtgcc tgtagtccca gctacttgag aggcttgagg tgggaggatc gcttgagccc 56881 aggagaccga ggctgcagag agccatgatt gtgccactgt actccagcct gggtgctatc 56941 cacactctgc acagagcaac accctgtctc aaaaaaaaga aaaatgatga tgggtcagct 57001 gggatggtgg cagtggggct ggatggggga ggcgaaggaa agcaaggtgt accaggcagc 57061 ccccagggtt ctggatgaga cagtcccatt ccctgaggca gggagcagca gaggtgggga 57121 acaacgcctt agataaactt agatcaatat ctaagagaca cttacataaa taccttttgt 57181 gcatttattg agagtcagtt gtgatcaaga tgttctctaa gtggatcatc acaccccata 57241 aagcacacct gggaggcatg cctactttat aggtcaggag aattgaggca ccagagtagt 57301 cgtgtggggt tggggaagag acagggccac tggaaccctg tcgacagcct acactcacag 57361 cggaggcaga ggaagaagag cccacagggt agcttgcgga gtgacttgca gggaggggca 57421 gaaccaggga gggtgctgtc tggactccag gaaggagggt gcggttggac ttgtcagatt 57481 ccggaacagt gattcaaggg ctacggagcg tccacagaat ttagcaatgg caaggccacc 57541 cctggcctgc atgagagctg ctgcctggtg gaggggtgga gctggatcgg gagggtggtg 57601 gaggtgagga catggctgca atcagtgtag acatcacttt ctggaatctg ggctgggaat 57661 gagagcaaag agggaagatg gaggctagag agagatgcac acagagtggg ggtgaggagg 57721 ggtgtgacag tagcttgttg aaatgctgat ggcacagctg ctggggagtg aagggtcagt 57781 gctgcaggcg agggagtcca tccatgggtc aaagttccca aagaggagga gaggcctgca 57841 gcatagatga gggaggctgt cccacattaa aacgcttcaa ggattgtgct ctcctctctt 57901 ggagaagaag ccagctagta gttaggcttg gaagcagtgg aggttgtact gggaagattt 57961 ccctggctat ggagatgtga aagggcctgg ctgggctccc ctctttctga agctccctgc 58021 tgctctggtt tgtggttgga ggcagagcca cactagtcca cgcagtgctt ccatgccgtt 58081 tgaaagaagg cacccattcc ttaaagtgga caccactcct caccagggca cacagctcgg 58141 caagcagagt gctggctgga tttcagtccc tctgtactcc ctcagcaaat gcacttttgt 58201 gcagtgaaca acctgcacag cctcacacag cattcctatg tgggatttgg cttccccatt 58261 ctccttggac ctaagtctct cccttctctc tacccactct tcctagtgcc tcagagtgac 58321 ccaagtccag ggactccacc ccacctggga aacctgtggg tgacagaccc taccccagat 58381 tcactgcacc tctcctggac tgtccctgag ggccagtttg acaccttcat ggtccagtac 58441 agggacaggg atggacggcc ccaggtggta cctgtggaag ggcccgagcg ttcatttgtt 58501 gtctcctcac tggaccctga ccacaagtac agattcactc tgtttggaat tgcgaacaag 58561 aagcggtatg gccccctcac ggccgatggc accactggtg agtagcagcc acctcagccc 58621 ccatgtgacc ccttctcagc cccatgcaca cttcccttca gcccaaaacc agctggggtc 58681 ccccagcaag acagctgact ccagcttccc ggccccactc cagggctgag gggcagccag 58741 actcccccgc actgctgggc cccttccatc ccaagcagca gctccccgtg gcccagaagg 58801 tgtgtcatac cctggctgtg tcaggcttcc cagaagttta gcactcgaat taatggacta 58861 gtgacccccc accccccata ggtgtgacat cccgtcgaaa ccccaagccc cagtcccagg 58921 ctgccagttc agcacctggc tggatctcct tgtttacagc tccagagagg aaagaggagc 58981 ccccccgccc tgagttcctg gagcagcccc tcctggggga actgacagtg accggcgtga 59041 ccccagactc cttgcgtctc tcatggacag tggcccaggg ccccttcgac tcattcatgg 59101 tccagtacaa ggatgcacag gggcagcccc aggcagtgcc tgttgcgggg gatgagaatg 59161 aggttactgt ccccggcctg gatcccgacc ggaagtataa gatgaacctc tacgggcttc 59221 gtggcaggca gcgtgtgggg cccgagtctg tggtggccaa gactggtgag tcatggctgc 59281 caggcctccc tcccctgcta gccccatcct gtgagcggga cttggctggg gctccttcca 59341 gcctccctcc atcttcgcct tctcagctca ttttgccaaa gcctccaccc acttcctccc 59401 gcactgctct cctgtcctct gcatgccagg tccccctcat ccagcctcca cctcctcatc 59461 tcttcctcag cctcacttct tcatctccaa gggcaggtcc tattttcccc cattccaaac 59521 acctggactc atccttccac ctggggctgt cctccccaca gatgtcctca ccattcggtc 59581 actgtgttct cacccctcag tttttactgg gctccttcat ggtctggttt gggccatggt 59641 cacttctggg cttctgtcac ctgctgccca caacctcctg ccatttctgc tgtgttcaga 59701 ggtggcagat ccaagtgaca agagcaccac gtttctctag cctttttgca gcatgaagag 59761 gactcctctc agctcccgac tccttgactc ccagccccct tggaacaggg caaagggatt 59821 gtgttcagcc cagcatccca gcctgttcct ccatgtcccc agcctttccc tgtccagtgc 59881 cattcccctg gcctgtgaat tatttgatga gagatagtgt ggatagtggt taggagcaaa 59941 gactccggag ccaactgccc gagtctcact accagccttg caacttactt cctgtaggac 60001 cttgtgcaaa ttgcttaacc tctctgtgcc tcagctttcc cttctgtcag gtggctggat 60061 ttcagtccct ctgcacagag ctgttggaat tacatgagtt aatatatgta taacacttag 60121 aaaggccagg cacagtggct catgcctgta atcccagcat tttgggaggc caaggcgggt 60181 ggatcacgag gtcaggagat cgagaccatc ctggctaaca cggtgaaacc gtgtctccac 60241 taaaaataca aaaaattagc caggcatggt ggcaggtgcc tgtagtccca gctactcaag 60301 aggatgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcagtga gccgagatcg 60361 cgccactgca ttccagcctg ggcgacagag tgagactcca tctcaaaaaa aaaacaaaaa 60421 caaaaacaaa aaaagcccac acacttagaa tagtgcttgg agcagcaaag cattttcaga 60481 gtgagctgca atttctccct gactctctca cttcccagta atcttttttg ctccccacct 60541 ccctcccttc ctctgggttg tcattcttgc caattctttc ctcaatctgc tatcccaaag 60601 tagtccccca tgtagcaggt cacagaaggg ctggtttccc ttgtgccaca gtctccccac 60661 tccatggagc ctcagcagag acctaactgt agggggcggg taggtgccca cagggctgtg 60721 attccagagc cagccaggga aggacatgtc tctccttgga tcccactcac cccagaattt 60781 ttctcccact catgcctgga gccctgcctt agcaattcat tccttggggt taaaggggag 60841 ggaggggcag ggctgcctgg ggaggccaca gcaagagcac attccatggt cagatcctca 60901 tatgaggggt ggggcagcca gagtgccctg gggagggcac aggtaggaag aggcctcact 60961 gccctcctca ctgtctgagg tcagcactgt cagtgaacgt ggtttggaag aaccctcaga 61021 ggcttcctct cttcccatct cagctccttt ctcagacacg aagggagacc cctcaccccc 61081 acagaaccca ccagagcagg atggggcaag ggctgtctca gccttctcca tccgtgagtg 61141 accgaatggc aaatacaaag gctacagagc ccctgagccc acagcgtaag ggcatttcca 61201 tggctgtcat ctgtgggcag cagcagcaaa gccagcagcc ctttatgcct ctatctcttc 61261 tctcagctcc tcaggaggat gtggacgaga cccccagccc cacagaactg ggcacggagg 61321 ccccggagtc ccccgaggag ccgctcctgg gggagctgac agtgacagga tcctcccctg 61381 attcgctgag cctcttctgg accgtccccc agggcagctt cgactctttc accgtgcagt 61441 acaaggacag ggatgggcgg ccccgggcgg tgcgtgttgg gggcaaggag agtgaggtca 61501 ccgtgggagg cctagagccc gggcacaagt acaagatgca cctgtacggc ctccacgagg 61561 ggcagcgcgt gggcccggtg tccgccgtgg gcgtgacagg tgagtgaggg gcaggggcct 61621 gctttggttc cctcccactg ctggctccca gcttccctgt agccagttgc tctccttgct 61681 ccagtggaac cttgaggctg cctgccaggg gaagacctag aaagagcatg tcactgggga 61741 ctggcgtacc accagctttc atggccctga gcaagtttta aactttctgc acctctgttt 61801 ccttgtccgt aaaatgggga cacctacctc acagagttat tgtgagggtt aagtgagtta 61861 acagatgtaa agcatcaaga atgggcctgg caaaagtaaa tatgctgcca gtgtctgctg 61921 tgagaattgt tataaatatt cttactctga ggtctgcttt gtcccctttg gctggttcag 61981 ttacttggtc agaatgcaca tgaatctaag gccataccac cctgaaggtg tcctatcatc 62041 tcctctgatc tcagaagcta agcagggtca ggtctggtta gtacttggat gggagaatgc 62101 gcgtgagaca ttggcctctt catcagtaca gcttgaagca agctttcttt tctttcttgt 62161 cttttttttt tttttttttt ttttgagaca gagtctcact ctgttgccca gcctggagtg 62221 cagtggtgcg atctcagctc actgcaacct ctgcctccca agttcaagtg attctcctgc 62281 ctcagcctcc caaatagctg ggactacaga tgcgtgccac cacacctggt tacttttttg 62341 tatttttagt agagacgggg ttcaccgtgt tggccaggct ggtctcaaac tccagatcct 62401 caagtgagcc acctgcctcg gcctcccaaa gttctggtat tacaggcgtg agccaccacg 62461 cctgaccaca aagctttctc tttcttcttt ctctcttttt ctttctttct ttctttcttt 62521 ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctttct ctttctttct 62581 gtctctctct ctctctttct ttcttttttt cctttctttc tgttttttga gctctgttgc 62641 ccaggctgga atgcagtggt gcaatcttgg ctcactgcaa cctttgcctc ccaggttcaa 62701 gtgattctcc tgcctcagcc tcccgagtag ctgggattac aggcatgcac caccacacct 62761 ggctaatttt tgtattttta atagagatgg ggttttacca tgtcagccag gctggtctca 62821 aactcttgac ctcaagtgat ccacccgcct tggccttcca aagtgctggg attataggca 62881 taagccacca cgcccggcca caaggctttc tttaatactc atccagtgag tgactttaca 62941 gcaaagaaga acatcccttc caccctttct ctcaggcctg gttacagggc cacatccacc 63001 accacacagg accagcctgg gcacctcctc tccagggcct cagcccaaga ggctgagtgg 63061 acagcagtgc ctctgggacc ctcgggctga cctgcccggt ttagtcacag tgaggggaat 63121 ctcttttttc ctttgctcac agaaacaaag acacaccttt ctcaaaaata acttttacta 63181 taaggggaaa gatttaaaaa ttaaaagtag agaaatagat agttttccag aacatggttt 63241 ctgtaattcc accaagctca gcctcatcat gacttggttg atttcatttc cagaaccttc 63301 caccctcaat ttattttctt tgcctcagtc agccctgaga agagggagac ggcaaagcca 63361 ccctgagtgg ctgtccttgg ggcctgaggt tgtagggtgg aggctggtgc caggcccaga 63421 catgctgcca agtctgctcc tggagtgacc ctccctctgc cgctctgtcc cagggaccca 63481 cactgggctc tgtgaggccc tgtgcctggt ccctggggta aggatgggct ctgtcaggca 63541 ctgaccaagt cggtgcagta gagataaatg gagtttggga atctgacaaa ctgagttcct 63601 tcctgtcttc ctctgttctc agctgtatga tttttatttt tttattttat tattattttt 63661 ttgagacaga gtcttgcttt gtcacccagg ctggagtgca gtggcatgat ctcagctcac 63721 tgcaacctct ggctcccagg ttcaagcagt cctcctgcct cagcccccca agtagctggg 63781 attacaggcg cccaccacca cacccggcta atttttgtat ttttagtaga gacagggttt 63841 caccatgtta gccaggctgg tctcgaactt ctgacctcag atgatccacc cgccttggcc 63901 tcccaaagtg ctgggattac aggcgtgagg caccgtgccc agccagctgt atgatcttga 63961 aaaacttacc tcctctcctg gaacctcagc ttcctcatat tcaggaggga gtaaacgctg 64021 cccagcttgt tgtcacggtt aaaggtggtg atgtaacatg tctggcatct tctcagctca 64081 ttcatctttt gctcagacct tcccgtgagg gaatccctgt atatccatgt tctaatgctg 64141 gatattttga aacatgtatt ttacacgtcc tggataaaat agatgaggcc ctttaggaaa 64201 gaaatgggca tcagtgcgca ttatcagagc agcaaaagga cgaagcagct ggagtccgcc 64261 aagctggggg ctgtgtctgg cagcacacac agtgcagttg ccagggggag aggtgcgtgg 64321 agctcattag acacacttct aaattggaaa aggactgtcc cacatgcagc atggggactg 64381 ccagagtcat gaatcgttat aattgtacct ttccagttaa tccaaaccac cagcaataaa 64441 gcttgatggt gaaggacatt ttcattccct catggaaaga agtgcagaga gggcaaaaag 64501 gagcagggaa tgggtgctgc aggaacaagt aaggttagaa ggcaaatgaa atgagagaaa 64561 atgcccagcc tggtacctgt acatagtaga tgttcaatac ttggtggtca ctctgctccc 64621 tcccaccacc tcagctgcct ccatgcctgc cctccccagc ctgactcatt tctgtctgcc 64681 cgaggcctgg tcaccagcct gttttaagca gttgtcactt gcttggagcc ttgtcttgcc 64741 ctccggagga agccagagaa acaagttctt gggagcccca cccatccagc caccttctct 64801 gccattttgc ctcttgggac caccttctag tttgctcact aatgagtgct tcctcatgcg 64861 ggagtggtgg ggatctgcag gtcctgagcc cagacctgta gaccactcat tattttacag 64921 acatctgtgg agcacctact gtgtgccagg gactgggaca cagcagtgaa taaagcagag 64981 ctcctgcctg aggagctcac attctagtgg agagggacag gcaacaaatt gcacaaatat 65041 gtcagtggca ccggatgtga cagtgatgag tggtggggaa aagcagagcc tagtagggag 65101 acagcaaggg cctcactgcg agggcagtgt ctgaggaaag ccctgaagga ggtgagagga 65161 cagccaccta ggcctttggg gaaagagcat cccagacaga aggaagagca aatgcagaga 65221 ctggcaggcg agaagcagcc ccagatgtct gcagacagtg agggatgcag tatgttttgt 65281 ggtggagttg gaggagggca ggcaataggg aatgatctaa gagatgaata tgggacctca 65341 gaggccagtg caggacttgg atttcaccct gagtgaggta gaagccactg cgggaaccag 65401 ctctggctaa tgtgttaaca ggatccttct ggctgcagac ccaacagact aaaggggggt 65461 gagggcagaa ggcgatgacc tgagagaatg ttgtgataat ccaggaggag ccagtggtgc 65521 ttggaggtgg caaggagggg tcagactctg aaggtagtga caacaggcgg atacccacag 65581 ggcggatgtg gagtttaaga gaggactcgg ggatgactct ggcggtttag atctgagcag 65641 ctggaagaat gaagttgtaa tcctggtcag gcgtggagga tcatgcctgt aatcccagca 65701 cttggggagg ccgaggcggg cggatcacct gaggtcggga gttcaagacc aacctgacta 65761 acgtggagaa accccatctc tactgaaaat acaaaattag ccggacgtgg tggcacatgc 65821 ctgtaatccc agctactcag gaggctaagg caggagaatc gcttgaaccc aggaggcgta 65881 ggttgcggtg agccgagagg gcgccatcgc acaccagcct gagcaacaac agcgaaactc 65941 cgtctcaaaa aaaaaaagtt gtaatcttac tgaggtgggg aggctgtggc gggagcaggt 66001 gtcaggaagc tcagaggctc agttttggac ttgttgagtt tgacacacaa gtaggcctcc 66061 aaggaagacg ttgcacgaac actagtctgg ggttcagggg aggtctggac agagggtgta 66121 aatctggagt cattggcata tagatggaat ttcagcgctc tcccagggtg tcagtgatgg 66181 ttgatactct tctcccccaa caaatgcagc ctcagacacc gagcagggac ctagtgaagg 66241 actgttgacc agacagatgt tcagatcaat gcgcttttac atcaaaacca tcatgcgggt 66301 tccacaaatg caacagacat cgctgggtgt ggtggctcac acctgtaatc ccagcacttt 66361 gggggctgag gcaggtggat cacctgaggt caggggtttg agaccagcct ggccaacatg 66421 gcaaatgccg tctccactaa aaatacacac acacacaaaa ttaggcaggc atggtggcgc 66481 acgcctgtag tcccagctac ttgggaggct aaggcaggag gatcgcttga acccgggagg 66541 cagaggctgc agtgagccga ggtcgcccac tgctctgggt gacagcgaga ctccatctca 66601 aaaataaaaa ataaaaaata aataactaga catcattact ataagaaaag ggggattctt 66661 tactaacagc cccctaccat ctctgtccca gccccacaac aagaagagac ccctccagcc 66721 actgagtccc cgctggagcc acgcctagga gagctgacag tgacagatgt gacccccaac 66781 tctgtgggcc tctcctggac agtccccgag ggccagtttg actccttcat agtccagtac 66841 aaggacaagg acgggcagcc ccaggtggtg ccggtggcgg cagaccagcg agaggtcaca 66901 gtctacaacc tggagcctga gagaaaatat aagatgaaca tgtatggact acatgatggg 66961 caacgcatgg gccccctgtc tgtggtcatc gtgacgggtg agtaatgggg ggactcagtc 67021 ctcatctctg gttacccaga gctcccccac cccatggccc ttcttccagc cctgccttgg 67081 taccaccgca ccctcactga ggggctagat cccctcaggc cctggctcgg tgaccctgcc 67141 agtctttctg ttccctgacc cctttactcc tcccagggct agagccttct cagccctcca 67201 gctccttccc catctccact ctgccccccg cccaggggaa tgtcttattt ccatttggcc 67261 taaatccagc cacagacttt ctctccccct ctccgactgc ccacagctgt ttccacccct 67321 tctctttaca accctgctgt gacttggaca cagctgccac ctcctgcttc tgcccttttt 67381 tgttgaggcg aagtgcttta catcctgagt cccttctcag gagtcctctg ccacctgcct 67441 gcactgcatt ccatcccagc cctggagagt tttcatctct ccaggaggct gcctcactct 67501 tgctctgggc tccttcctgc ccctcatggc ttcctgggca caggttctcc atgattgttg 67561 tgctgtggca tctggtgaaa cccctacccc acctgaaggc aatgttttta aatctgtaaa 67621 ataagctgta taggattaca aaggaaacat tatatttata gacagttatc aaaatatttc 67681 ttttttattt ttgagatgga gttttgctct tgtcatccag gctggagtgc aatggcgcga 67741 tctcggctca ctgcaacctc cgcttcccaa gttcaagcaa ttctcctgcc tcagcctccc 67801 gagtaactgg gattacgggc acgtgccacc acacccagct aattttttgt atttttagta 67861 gaaacagggt ttcgccatgt tggccaggct ggcctcaaac tcctgacctc aggtgatcca 67921 cccgcctcgg cctggcatga gccactgcac ccggcccaaa atatttctta aaaatccaaa 67981 gttacgattt agtaatactt ggacttattt actagttcat taaataacag gacttggtgg 68041 cgcgtttact aaactacaaa atttagaagc agtgatggat ataaaaggta ttttaaagtt 68101 tctgcaataa ctctaaggtg acatgaaaac atctgtggtt tctattggta acaaagtcac 68161 aataattcta ctctggtggg ttttctatat tagttttgga aagaaatgct gcatttcagt 68221 tagaggtctg gaagtaaagg tgtaactatt ccccatctca gttcacagcc cccctgaggc 68281 cctcaaagcc caggggttct cccctcagag atctcaacca tgttctttgt cttcccaaat 68341 cccagctccc ctcccaccag ccccagccac agaggcctcc aagcctcccc tggagccacg 68401 cctaggggag ctgacagtga cggatataac ccctgactct gtgggcctct catggacagt 68461 ccctgagggt gaattcgact cctttgtggt tcagtacaag gacagggacg ggcagcccca 68521 ggtggtgccc gtggctgcag atcagcggga ggtcactatc cctgacctgg aaccctcccg 68581 caagtacaag ttcctgctct ttgggatcca ggatgggaaa cgacgcagcc cagtctctgt 68641 ggaggcaaag acgggtgaga tgggccccta cacagctgag cctgaggcca cagccttccc 68701 acccttctct tcactccctc tgctgagtct tccctttgtc cacctgctct gtctcatttg 68761 aaaagccaca tggggctagc acagtgactc atgcctgtaa tcccagcact ttgggagtcc 68821 aaggcaggtc aggagtttga gaccagtctg gccaacatgg caaaaccctg tctctactaa 68881 aaagacaaaa attagctggg tgtggtgatg ggtgctagta attccagcta ctcgggagtc 68941 tgaggcacaa gaatcacttg aacccaggag gtggaggctg cagggagctg agattgcgcc 69001 actgcactcc agcctgggtg atagagccag actccgtctc aaaaaagacc aaaaaaaaaa 69061 aaaaagaaag aaaagccgca aagcagatga ggaccaggag agtggggcca gttcctgggt 69121 ctgctgtccc ctgctgactc tgcctctctt ctcgtaattt ccacttttgt tggatgaggg 69181 aagcccctac aggcctcttc ctgcagggtg gatgtgggag cctgggggtg ctgggagaag 69241 gggcgagggg agccaggaac ccttcctgcc ttgcgaatcc atcacctcct gagagtcttt 69301 gttgtctccc ctgtgactct cacccaggtt ctcaccctcc tagctcaggc ctgtctgggc 69361 ccatatgtgc ctcctacaga ggctgccctt ccttctctgc cctcctcctg aagtctcaga 69421 aattgggaac cagaggagcc cagcccagct gtgccccttt ccgagggttc agtccaagcc 69481 ctcccctgcc cctgggggct ccctattgcc agaaaacttg caataacaat gtcgtcagcg 69541 tcctcaccgt ccaggaaggc agcaggccag ctccttctca agccctggga tggctcctca 69601 ggcagacacc agatcccttc cctgcctgag ctccccggca tcatccccct caacattctc 69661 acaggcagcc cacgctcctt caccaggccc agtgggagcc ttagttctcc cgggccagcc 69721 ggtgcagaat aagaaaggga acggagaagg gtgagaacta cctgtgtcgt aacatcctgc 69781 ctccctgacc tgccgagtgc ccccatcagg gatcctgcgc tcctcggggc cggccaaatc 69841 acccctttcc tcaggggccg gccaaatcac ggccaaatcg tgtgctcctg ggtaagagga 69901 gatagagacc cagtctgtaa gaagctgcac cctgctgggg aagcagtctg tgagaggcgg 69961 aaagaggctg gacagaaagg ggagtaagat gaggcaggga gctagggaat ctcttttcct 70021 ggaggtgact caggacaggt gaccctcccc acttgggtca aggggagaga ggcatggacc 70081 agacggagat ggtggtgagg atgggaggtg atggaaatat tttggggaag cactgagcat 70141 ttgggcagtt ctgggttttt ccagccccag agggaggcag tcaaggagtc gttttggaaa 70201 aaataaacac aagaaccttt cctctccagt tgcccgaggt gacgccagcc caggggcccc 70261 accccgcctt ggggagctgt gggtgacaga ccccacccca gactcactgc gcctctcctg 70321 gacggttcct gagggccagt tcgactcttt tgtggtccag ttcaaggaca aagacgggcc 70381 ccaggtggtg cccgtggagg gccatgagcg ctctgtcact gtcacccctc tggatgccgg 70441 ccgcaagtac agattcctcc tctatggcct cctgggcaag aagcgccatg gccctctcac 70501 tgccgacggc accacgggtg aggggcattc cctgcaggtc cctgctctgc tcccctcagg 70561 ccaagcagca gacggtcact tgtggtggct gccattacca ttatttggcc ccagccactc 70621 ggctcagatc cagctcccca cgtccctgca caagccccta gcacagcccg gcaggactca 70681 gccacgcaaa gccctgtctc cagacctctc aatcctgccc gtcgaggtca cccagtcttc 70741 cagaaacagc tcagccgttt cctctccgtg tctccatctc agaagcccgg agtgctatgg 70801 atgatactgg aacaaagcgt cccccaaaac cccgtctggg ggaggagctg caggtgacca 70861 ccgtgaccca gaactccgtg ggcctctcct ggacagtccc tgagggccag tttgactcct 70921 ttgtggtcca gtacaaagac agggacgggc agccccaggt ggtgcccgtg gagggcagcc 70981 tcagggaggt cagcgtgccg ggcctggacc ctgcccacag gtacaagctg ctgctctacg 71041 ggctgcacca cggcaagcgt gtgggcccca tctcggccgt cgccattact ggtgagtgtg 71101 cggcagctgg aacacctgtg cctccttccc gcctggctct cctggtctga ctgagccata 71161 agatctctga gcttcccatt ttatatcatt tccattgatc cagaaagttt ccttgtgccc 71221 ctttagagtc agttgcccca gtcccagcct ccagcaacca ctgtctgctt tcgatcatcc 71281 tggagacagc attttgactc tccatgtaat ggaatcacaa tacgcagtct cttgtgtttc 71341 atttcttgca attagcacaa tgcttttgaa attcacccat gtgcagcaaa agctggtctt 71401 tcgattgctg actgcttgcc ctcactccct ctcctccctc actcctcctg agagtctggg 71461 tgcagcgaca caccagccat ctgtctctac ctgtctctgt gaaccagccg gcagggaaga 71521 aacggaaact gagaccacgg ccccgacccc tccagcgcct gagccccacc tcggggagtt 71581 gacagtggag gaggccacgt cacacaccct gcatctctcc tggatggtga ctgagggaga 71641 atttgactcc ttcgaaatcc agtacacaga tagagacggg caactccaaa tggtccgcat 71701 aggaggtgac cggaatgaca tcaccctctc tggcctggaa tccgaccaca gatacctggt 71761 gaccctgtat ggtttcagtg atgggaagca tgtaggtcct gtccatgtcg aggccctgac 71821 aggtgagaac tctgcccact atgcctcctt tcagatggct gggagagtcc agaggacagc 71881 agagtcccgc ggataccctg cccacctcag tcctctcttt ccatgtctct gtccagtccc 71941 ggaggaggag aagccttcag aacctcccac cgcaaccccc gagcccccca tcaagcctcg 72001 cctgggggag ctgaccgtga cagatgccac ccctgactcc ctcagcctgt cctggacagt 72061 tcccgaggga cagtttgacc acttcctggt ccagtacagg aatggagatg ggcagcccaa 72121 ggcagtgagg gtgccagggc acgaggaagg ggtcaccatc tcgggcctgg agccagacca 72181 taaatacaag atgaacctgt acggcttcca cggtggccag cgcatgggcc ctgtgtctgt 72241 cgtcggggtg acaggtgagt ggatgatggg agccccaggg tgggagccat gggagggtca 72301 ccctcttgct ctttggtgat gactggtggg gaatgggaca agggtctggt cagcaccaca 72361 gacctgcttg tggctggggc tggggctccc cttgggcctt cctgtgaggt tgaccactgg 72421 ctcctcctga acagagaggg gccatcggga attttgctgt gctggtggct gtcccaggtc 72481 ccccacagct gaccctggaa cttgtcatgt gtgttagctg tcagctgagc aggaccaccc 72541 agccccaaga gtaggcctct ctgaactgac ctcgggtccc ccagtcatag ccttggcttc 72601 tccctccttt tccccagtac ccaaggacat ccccctcact ctctcttcct ccttctcagc 72661 tgcagaggaa gagaccccca gccccacaga acccagcatg gaggccccgg agcccgctga 72721 ggagccgctc ctgggggagc taacagtgac aggatcctcc cctgactcgc tgagcctctc 72781 ctggaccgtc ccccagggcc gcttcgactc cttcaccgtg cagtacaagg acagggacgg 72841 gcggccccag gtggtgcgtg ttgggggcga ggagagtgaa gtcaccgtgg ggggcctgga 72901 gcctgggcgc aagtacaaga tgcacctgta cggcctccac gaggggcggc gcgtgggccc 72961 agtgtctgct gtgggcgtca cgggtgagtg tgcactgcag agccctctgg gttgggtctt 73021 agcaaagtac agcctccagc atctcctcca ctagggaccc agaaccccaa gacctcaaac 73081 ctgtaataca ccctgttcac taaaggctca ggacaggtgt gatctgggga cagagagagc 73141 aaatccagga gaagtgtcgg agccatatgg gaaaggcccc cagggactga ggcctcttgg 73201 gaggtgattc actggctggt ttgtggctct ccgcttttcc ttggaactct atacataact 73261 ctttttgtga gattggaaca catttttgaa atacaaatta attaaattaa taagtatttg 73321 ttaaataaaa tttcaaatgt ataaacagaa agaatagtac aaagaccctg ccatcctcct 73381 tcaatcaaat atcgacttct ggccagtctg gctcatctct gccccaccca cttcctccac 73441 tagaatcact agaatgtttt ctttctttct ttctttcttt ctttctttct ttctttcttt 73501 ctttctttct ttctttcttt cttccttcct tcctttcttt ctttctcttt ctttctttct 73561 ttcattcttt tctttctttc tttttagaca aggtctcact ctgtcgcctg ggctagagtg 73621 tactggcaca gtcacaattc actgcactgc agcctcaacc tcctgggctt aagggatcct 73681 cccacttcag cctcccgaat agctgggact acaagtgcac tccaccatgc ccagctaatt 73741 ttttgtattt tttgtagaga cagggtttag ccatgttgcc caggccagtc tcgaactcct 73801 gggctcaagt tatcctccca cctcagccaa agtgatagga ttccaggcgt gagccgccac 73861 acccggcctg aatgttttct ttattaacac ctttatggag atataattca tgtggcataa 73921 aatacacttg tttttagtgt acaacttact gattttagta tctttaccta cttgtgcagc 73981 catcaccacg atctaatttt agaacttttc catcaccccc aaaagaaatc ttgggtccag 74041 acccattcct acctcagccc tagtttataa ctaatctact tttttctctg tagattgctg 74101 ttccagaaca tttcgtataa atggactcaa acaatatgta ttcttttgtg tctagcttct 74161 ttcactgagc ataatgtttt tgaggttcat caatgtagca tgtagaagta ctatacttac 74221 tacattgcta ttgattgatt gattgattga ttgattgatt ttttttgaga caaattccca 74281 ctctattgcc caggctatag tgcagtggca tgatcatagc tcactgcagc cccagactcc 74341 tgggctcaag ccatcctccc atctcagccc cccaggtagc tggggccaca ggtgctcgcc 74401 accacactca gctaatttaa aaaaattttt tgtagagatg gggtctcaca gtgttaccca 74461 ggctgatatc aaactcctgg cttcaaggaa tcctcctgcc tgggccttac aaagtgctgg 74521 gatagcaggc atgagccacc tgctctctct cagccactac actgcttttt attgccaaac 74581 agtattgcat tgtgtggtta tactactgca gtattttaaa aacgaaccac agacatccta 74641 tgctttcatc actaaatatt taaaggtata tctcttttta aaaaaattat ttttaaaaaa 74701 tccacaatat ggccaggtga ggtggttcac gcttgtaatc ccagcacttt gggaggccaa 74761 ggcaggtgga tcacttgagg taaagagttc aagaccagcc tgaccaacat ggtgaaaccc 74821 cgtctctact aaaaataata ataataataa taaattttaa aaagtacaaa aaagcggagc 74881 gtggtggcac gtgcctgtag tcccagctac ttggcggggg ttgaggcaca agaaattgct 74941 tgcacccagg aggtggaggt tgcagtgagc caagatcgcg ccactgcact ccagcctggg 75001 tgacagagtg agactctgtg tcaaaaaata aataaataaa taataaaatt ccacaataca 75061 attagcatac cttgataagt gaacaatcat gccttgatat caagtatcca ggtagtgcta 75121 cagtctcccg agtgtctcat aattgttgtt tcccaatttg ttgctgaaat caggtctata 75181 gaaggaggat acattgcaat tgagaaaact gaaattttaa atagagtaat cagggaaggc 75241 ccacagaaga aggtggcttt tgaacaaaga cttaaaggaa gtgtggggat gagccatgaa 75301 gatatctggg gaaagaacat ttaggtagag ggaacagtca ttgtaaaagt actgaggtag 75361 gaggctgcct ggcatggttg agggaccata aggaagccac aaaatctgat tatgattata 75421 atcagaaatc gatcttaagt tagggccaga ctgggcccac cttctgacct gaattcaaga 75481 cagctggggc ctcaacacct ccttgcagca agagaaaaga ttctgtttct atctcttcct 75541 agcccccgaa gaggagtccc ctgatgctcc tcttgcaaag ctgcgcctag ggcagatgac 75601 agtgagagac atcacctccg actccctcag cctctcctgg acagtccccg agggccagtt 75661 tgaccatttc ttggtccagt ttaagaatgg ggacgggcag cccaaggcgg tgcgggtgcc 75721 gggacacgag gatggggtca ccatctcggg cctggagcca gaccacaagt acaagatgaa 75781 cctgtacggc ttccacggtg gccagcgcgt gggccccgtg tctgctgttg gtttaactgg 75841 tgagtgtgca gtagggcact gggccctgcc ctgaactaga ctcagtttcc cttttattgt 75901 cagtatctgg tggctttatt ttactttccc tgagaccaag cctcccaagc tcctgggaat 75961 ggttctgctg gtgccttcac tccgagactt tggtattgcc ccaggtcttc tgcccgttac 76021 acatgggctt tttgggttcc caaagaggtg ggggaccagg aaaataacac agactgaggc 76081 acacctgggg ttgaccatag cttcagccca tttgagctgt gtgtcctcaa gagactcaca 76141 caacctgtct gggcacggtg gctcatgcct gtaatcccag cactttggga ggccgaaggg 76201 ggcagatcac ttgaggtcag gagttcgaga ccagcctggc caacatcatg aaacccccat 76261 ctctactaaa aatacaaaaa ttagctgggt gtggtggcac atgcctgtag tcccagctac 76321 tcgggaggct aaggcgggag aatcgcttga acccaggagg cggaggttgc agtgagccga 76381 gatcgagcca ctgcactcca gcctgtgtga cagagtgaga ccctgtctca aaaaaacaaa 76441 acaagcaaac aaaaaacaga cacacgacct ctctgtacat ccttttcctc atctgtaaag 76501 aaggaataac catacctgtc gtgaagggtg ggcgggagtg tggcttggct ctcacacaca 76561 atgagagata actgcagtct ttcccttctg aggaaggtag gtagttcctg aaaaccgctt 76621 agggcaaatt ctcataatta tcactgattt cagtgggaaa aattgtatgt gttctctaag 76681 ctcccaaatt aatacccatt tatttttaaa acaacactga atcctgttaa catggatact 76741 attactttaa ttgtaaatat tggccatgtg cacaacttaa taaggcattt tcctttcatt 76801 accctgtaac tcggctcttt gatgcttttg taagctcttt tatagctgtt accccctaga 76861 tgcttaagac tcagttctca aaaagaacag acaaatgaaa tgtgatgaat ttgtaatacc 76921 actatcaagg cagaataaaa ccctatgaag agcagaaaag attaaagaaa tcaatgaaaa 76981 caaagcaatt gatttcacac gggccaaaag aactggcatc actggaggtg ctctctctgc 77041 cgtaagttag cagagctact gcaggtccag agacaactca gctgcaaaga tgccctcatg 77101 tggtctcatc tctctaccct cagctataaa atctaagttg gtttctaaaa atgctcagga 77161 ttaggtggaa tttccaagcc caggtggatg aacctcctga tacaacttgc ctgttgtgtt 77221 tggttggtgt tctttgtttt cttttttttt ttttctttat gttgagatgg agtctcactc 77281 tgtcacccag gctctcaggc tggagtgccg tggtgcgatc tcagctcatt gcaacctctg 77341 cctcccaggt ccaagcagtt ctcctgcctc agccttccga gtagctggga ttacaggcat 77401 gcaccagcac atctggctaa tttttgtatt tttagtacag acaggatttc gccatgttgg 77461 ccaggctggt ctcaagctcc tgacctcagg tgatcctccc gcctcggcct cccaaagtgc 77521 tgggattaca ggcctgagcc accacgcctg gccctgattg ctgttttttg tttgtttgtt 77581 tgtttttgtt tgtttgtttt tttcttgaga aggagtctcg ctgtctccca ggatggagtg 77641 cagtggcgtg atcttggctc accgcaagct ccgcctcctg ggttcatgcc attctcctgc 77701 ctcagcctcc tgagtagctg ggactacagg tgcccgccaa cacgcctggc taattttttg 77761 tatttttagt aaagacgggg tttcacagtg ttagccagga tggtctcaat ctcctgacct 77821 catgatccgc ccgcctcggc ctcccaaagt gctgagatta caggcctgag ccaccacacc 77881 cggccctggt tggtgttctt ataaagcatc aattctgtcc tgcaaaacaa atttgcttgg 77941 cagctcctaa gggtcccagg cctgcacact gagcatcaca gggatgccac ctgagcatca 78001 tgacttcacc acaggtgtgg tgttaactga acattcaaag cagaaaaatg atattctaag 78061 cttggggcac cagcatccag actgtgggcc cttggtgtgt ctaagagaat cccagagtcc 78121 cttggttaaa aggaggccca ccaggcatgc ccacccattc tattttctga tgcagcccca 78181 ggaaaggatg aagaaatggc cccagcctcg acagaacctc ccacccctga accccccatc 78241 aagcctcgcc tggaggagct gaccgtgaca gatgcgaccc ctgactccct cagcctgtcc 78301 tggacggttc ccgagggaca gtttgaccac ttcctggtcc agtacaagaa tggggatggg 78361 cagcccaagg caacacgggt gccaggacat gaggacaggg tcaccatctc cggcctggag 78421 ccagacaaca agtacaagat gaacctgtac ggcttccacg gtggccagcg tgtgggcccc 78481 gtgtctgcca tcggggtgac aggtgaatgg acgatgggag ccccagggtg ggagccatgg 78541 gagggtcacc ctcttgctct ttggtgatga ctggtgggga atgggacggg tctggtcagc 78601 accacagacc tacttgtggc tggggctggg gctcccattg tacctttttg tgtggttgac 78661 ccctggctcc ccctgagcag ggaggggcca ttgggagttt tgctgtgctg gtggctgtgc 78721 caggtccccc acagctgacc ctggaatttg tcatgtgtgt tagctgtcag ctgagcagga 78781 ccccaagaat gggcctctct gaactgacct caggtcccct agtcatagcc ttggctattt 78841 catatgtccc ctagtcatag ccttctccct ccttttcccc acgacgtaag cacatccccc 78901 agggaccctg ccatcctctc tgtgtccctt tttctcagct gcagaggaag agacccccag 78961 ccccacagaa cccagcatgg aggccccgga gccccctgag gagccgctcc tgggggagct 79021 aacagtgaca ggatcctccc ctgactcgct gagcctctcc tggaccgtcc cccagggccg 79081 cttcgactcc ttcaccgtgc agtacaagga cagggacggg cggccccagg tggtgcgtgt 79141 tgggggcgag gagagcgagg tcaccgtggg gggcctggag cctgggcgca aatacaagat 79201 gcacctgtat ggcctccacg aggggcggcg cgtgggcccg gtgtccaccg tgggcgtgac 79261 tggtgagtag tgcttggagt ctcggggtaa ccacctttcc ctcatgggta cctggtttac 79321 tgctgtgccc tttcaccaag ccctgtgggt ccagacttgt ctcctgtgtc ctgccctccc 79381 tctgtgccct gtggctgtgg cctatgctag cggtttgctt gttcactttg gcgtggcctc 79441 cctctaatgg tcaaccactg ataacctcca agagtgaaat atgtcaggcg cacaccaagc 79501 ctgacttacg agaattttcc tttctttcca tcttaataac agcattcttt gttataacca 79561 catacacaaa tgcacacaag aggaacaatc caacgtcaaa ccacgttggg atgactaaat 79621 aatttaggca acatctataa aaagagctat tatgccgcca ctgaaatgtt ttaacataag 79681 acccttactt tataatgtta cattgaaaaa agagagccta ggctgggcac agtggcttac 79741 gcctgtaatc ccagcacttt gggaggccga ggcgggctga tcgcctgagg tcaggagttc 79801 cagaccagtc tagccaacat ggtgaaaccc catctctact aaaaatacaa aaattagcca 79861 agcatggtgg cacatgcctg taatcccagt tacttgggag gtggaggcac gagaattgct 79921 tgaacccggg aggcagaggt tgcagtgagc cgagatcacc actgcactcc agcctggcag 79981 atacagccaa actcagtctc aaaaaagaag aaaaaggagc ccaaattttt gtaaggatgc 80041 taatcacaac catttgaaag taacaggcat ggcaagaaga ctggaaagaa acacatccca 80101 cgggttgcct gatttgttag atagaataga atattccagc acattttttt ctgtattatt 80161 ctattctgca tagtccaaat tttcttaaat ggcaaacatt tcttttacaa tcagggaaag 80221 ggaaaagaga aaatatataa tgtcttctcc ttctttacac tccatctgtc ctactcctct 80281 atcgctcttg tggcttattt catttatttc cttctttttt ttttttgaga tgaagtttcg 80341 ctcttgtcgc ccaggctgga gtgcaatggc atgatctcgg ctcactacaa cctctgcctc 80401 ctggattcaa gcgattctcc tgcctcagcc tcccgagtag ctgggattac aggcatgcgc 80461 caccacatct ggctaatttt gtatttttag tagagaggga gtttctccat gttggtcagg 80521 ctgctcaaac tcctgacctc aggtgatctg cccgcctcgg cctcccaaag tgctgggatt 80581 acaggcatga gccaccgcgc ccagccagac ttatttcatt tctaattatg aggaagttca 80641 aaatttctag tccataagaa attggagggg tggagatcat gaggttaatt gaattccaaa 80701 tgctcagtca ggctgccttt tctaagtcat cctcagccag gttgttatgc ctggcttaga 80761 gtatttttcg agaattccta ccgatgggga agaatcagcc ctgctgagag gcggtctctt 80821 gcagggcagg tcgccacctc tgtggcacca agtgaggtgc gagctcatct cccaaaggcc 80881 atccagacag aggaagcact ctcaatcggc atccccgatt cctgggggag aggcctcatt 80941 tccatatgca tgttcagagg accatctgag tgttgggaaa cagggggaat agggaaatta 81001 ttggaaaatg aacattagaa aaattcacat tccgggggta agctgtcctg gtccacgttc 81061 agttttgtgt tcctgtctcc actgtgggga accacagaac tgacagagga caggctgagg 81121 gacccacccc tgcccctcct gttctatgtg tattgctgcc ccacccccac cccacactct 81181 attattaatt gttgtagccc agaatttctt tctttctttt tctttttttt tttttttttt 81241 gagactgagt ctcactctgt tgcccaggct ggagtgcagt ggtgtgatct cagctcactg 81301 caacctctgc ctcccaggtt caagtgattg tcctgcctca gcctcctgag tagctgggat 81361 tacaggtgca tgccaccacg cctggctaat ttttgtattt tcagtagaga ccgggtttca 81421 tgatattggc caagctagac tcgaattccc aacctcaggt gatccacccg cctcggcctc 81481 ccaaaatgct aggattacag gcgtgaacca ctgctcccag cccagaattt cttttttagc 81541 ccacgttttt ctagtgaaaa taatacagca acattatgtg gaaagcttga aaaacagaaa 81601 acaagaatct catagtccta ctttctcccc cagctgccgg cattaataac aatgtgtgta 81661 gtgcacattc ccttcctgta ttttgcatgc agagcgtgtt ttaaagttgc aatcagtgtt 81721 cataaagttc ttggcacttc cttttttgta cacaagtaca ttgtaatcat tcacctcacg 81781 gctacacaac cagcattcat catcgtttca atggttattt gatgcgttgc ggtgaaagca 81841 ctataacaaa attaatcatc ttctacgggt catttgtgtt cctgacacat ctgctgtcat 81901 gaataacccc atcgtgagtt cttccatgct ataacttttt cctgctttaa taattttctt 81961 aggacagatg cccagaactg ggattattgg gtcaaaggaa atgagaatta ctttggctcc 82021 tgacactatg tctcagttgc cttctggagg tttctaacag tgcagctctg ccagcagtac 82081 aggccggggg ctcggggata cctcaccggc tcttattcca agagtcactg aacggcgaac 82141 acaaagctgc cccagccctc agcctgctct ggaggggcgc atttgatgta tgacctctgt 82201 tgacagcacc agcaaagcaa gttgccctta aacccttaaa ctctgtatcc ccctatatta 82261 cctttcagcc ccacaagagg atgtggacga gacccccagc cctacagaac caggcacaga 82321 ggccccagag ccccccgagg agcctctcct gggggagctg acagtgacag gatcctcccc 82381 tgactcgctg agcctttcct ggaccgtccc ccagggccgc tttgactcct tcaccgtgca 82441 gtacaaggac agggacgggc ggccccaggc ggtgcgtgtt gggggccagg agagcaaggt 82501 cactgtgagg ggcctggagc ctgggcgcaa gtacaagatg cacctgtacg gcctccacga 82561 ggggcggcgc ctgggcccgg tgtctgccgt gggcgtcaca ggtgagtgag tgtgggtggg 82621 gcagggttgg aagacagccc tagaaaatgt gcccttctct accattttcc tatacatatt 82681 tctgtcttga tggggctcac agtgaaagga atatagcaac attatggaaa gacatgtcat 82741 ggagagacag gctgcaatcc agcaaatgaa gcaaaggcgg gtgagcatgt gatagggagg 82801 cccagggctc aggtcaggac cagacaggga cgcctaagtc accctgccca tgggtaccca 82861 ggggacagcc aggacctgag gccaggcatg ccttagcttg gtgacagctt tagagagaag 82921 gtgaagtgtg tcagataatc acagctggtg caaaggccag gaggctagaa agagcatggc 82981 acgtgagagg cactgaggat tgagtggggt gtcctttacg gtgagtatct catcctgaag 83041 tgtgggagca gaggagggga ccactcacca ggcctggggt ctcccaggga tgaggatgtg 83101 gttgcccagg tctgtgctga gatggcccca gcagctgtgc ctgtgtgaag ctcatccgtg 83161 ggggctgaag atggggatgg ggtggcaggg agcctggagg cagcgaggcc agtaggcagt 83221 tggtggccct ggtgagaggt gacagtggct caaactagga tcggggactg gaggtggggt 83281 aggaaggtat ccaggggaat tcagggtaaa gatgctctga ggctgctggc agctggtgag 83341 gagctggatc caggaacaca ccccaggctc tggcctcggg aggagtgtgc tgagcttgtt 83401 gcggagcaaa gacagaagcc cagtgaacaa aagatggcga agagacccca gtgctgggag 83461 gccaggggtg cagaggccga gtggggctgt gctcaaaaga gaggcggtgc tggagggaca 83521 gggagaggtg gcctgggtgt tgggaggtgg gggtgaggtg ggggctgagg gcaggagggt 83581 cagggtgagg gataggaaag gccacaggag aggagaggat gaagagctgt gctggagggg 83641 ctgtgggcag catcgtcctg ctcttgggca ctttgtgttt tgtgacacat cctttctatg 83701 ctgaactgag gagccaggga cctcactgtc cccacacgtg tctgtccaac tccagaggat 83761 gaagccgaga ccacccaagc agtgcctacc atgacccctg agccccccat caagcctcgc 83821 ctgggggagc tgaccatgac agatgccacc cctgactccc tcagcctgtc ctggacggtt 83881 cccgagggcc agtttgacca cttcctggtc cagtacagga atggggatgg gcagcccaag 83941 gcggtgcggg tgccggggca cgaggacggg gtcaccatct caggcctgga gccagaccat 84001 aaatacaaga tgaacctgta cggcttccac ggtggccagc gcgtgggccc catctctgtc 84061 attggggtga cgggtgagtg gatgatggca gccccagggt gggagccgtg ggagggtcac 84121 cctcttgctc tttggtgatg actggtgggg aatgggccag gggtccggtc agcaccacag 84181 acctgcttgt ggctggggct ccccttgggc cttcctctga ggctgacccc tggctcctcc 84241 tgagcaggga ggggccatca ggagttctgc tgtgctggtg actgtcccag gtcccccaca 84301 gctgaccctg gaacttgtca tgtgtgttag ctgtcagttg agcaggacca cccagcccca 84361 agaatgggct tttctgaaat gacctcacat acccagtagt ggccatggtt tctccctcct 84421 tcccttgaag acctgagcac atcccccagg cacctggcat cctctctata tctccttttc 84481 tcagctgcag aggaagagac ccccagcccc acggaactca gcactgaggc ccgggagccc 84541 cctgaggagc cgctcctggg ggagctgaca gtgacaggat cctcccctga ctcgctgagc 84601 ctctcctgga ccatccccca gggccacttc gactccttca ccgtgcagta caaggacagg 84661 gacgggcggc cccaggtgat gcgtgtcagg ggcgaggaga gcgaggtcac cgtggggggc 84721 ctggagcccg ggcgcaaata caagatgcac ctgtacggcc tccacgaggg gcggcgtgtg 84781 ggcccggtgt ccaccgtggg tgtgacaggt gagtgtttgt gagtgaggaa gatggcccta 84841 gaagatgttg ctttctctgc aacttcatga aaacaaaaat attctcacca gccgggcttc 84901 ttttgcacgt ttccatgctt ggggatcttg ctaagaggca gattggggtt caacaggttt 84961 gggctagggc cagagattct gcatttccaa caagaaatga tgcggatgct gctggcccat 85021 gatcacgccc cttgagtagc aaagttcttc acgacaaagg aattggaccc tcttttgaaa 85081 tctgttgaag agaatttttc tgcgtccctg attcctggta gtgtgctttc tctgtggagt 85141 tgactagggg ccgtgaagga agacagaagg cagtgagggg cagcgcgtcg actgcacact 85201 ctggaagccc actattggaa taggaatagg aacagacctt tctcaccaat gggccaattt 85261 gttcattcag caaagaattc cttgccctga tccggacact gtaattttag ggtatcaatg 85321 tcctctggcc aacgacctcc agaaactcac ctaaaattga atggatggaa ttatgctcca 85381 tgcagtgcag gaggactgtg gggtgactta ggaagaggct ttagaaagag attgttaagg 85441 aactgggctt gtgtttggtg ttttaggaaa gcatttaagg aagcaaggtt ttgctcttta 85501 ttagacgctg tcaggaagaa gggataaccc tattaccggg cgtctcacta agtcttatct 85561 ttggggaggt caactagagc aaggccaaag ctgccattgg taaagaagca gctatcactc 85621 agattagctg gatgggtgat gtttggttat ttttgtgctt tggaaaatgt tagagttcat 85681 ccttactgag acatgatcac agactggcct tgttcttgtc ttgatccatc ctactgacaa 85741 atggcctagt ctgatgttga tgttccatga aattgtctgt tcaactggac cacgccaaga 85801 ccagactgtg ccaggccagc cccaagcagc cgtggcctgg cagagggaaa gggcagctca 85861 cgggtgtcag ggctgccttt ttctttcata ggcggaagct ggaaagtgac acacacaagt 85921 tcatgttttc atggggctca caattgtggg cacaagcaga aaaccggaaa gtaagcaaag 85981 aaggatgagc atgtggagcc caggagccaa gccaggattg ggaagggaca gtgaagtcac 86041 tctgcccaca agtacccagg ggacagccag gacctgaggc caggcaggcc ttagcttgat 86101 gacagctttg gagaggaagt gaagcatgtc aaatcatcac agctagtgca aaggccagga 86161 ggctagaaag agcatggtgt gtgagagaca ctgaggattg agtggggtgt cctttgcagt 86221 tggtggatca tcctgaagtg tgggagcaga ggaggggacc actcaccagg cctggggtct 86281 cccagggatg aggatgtggt tgcccaggtc tgtgctgaga tggccccagc agctgtgcct 86341 gtgtgaagtt catccgtggg agctaaagat ggggatgggg tggcagggag cctggaggca 86401 gggaggccag taggcagttg gtggccctgg tgagaggtga cagtggctca aactaggatt 86461 ggggactgga ggtgggggag gaaggtatcc agggaaactc gggggagagg tactctgagg 86521 ctgctggcag ctggtgaggg gctgggtcca ggaacacacc ccaagctctg gcctcgggag 86581 gagtgtgctg agcttgttgc ggagcaaaga cagaagccca gtgaacaaaa gatggcgagg 86641 agacaccagt gcagggaggc taagggtgca gaggccgagt ggggctgtgc tcaaaagaga 86701 ggcggtgctg gagggacagg gagaggtggc ctgggtgttg ggaggtgggg gtgaggtggg 86761 ggctgagggc aggggggtca gggtgaggga taggaaaggc cgcaggagag gagaggatga 86821 agagctgtgc tggaggcgct gtgggcagca tcgtcctgct cttgggcact ttgtgttttg 86881 tgacacatcc tttctatgct gaactgagaa cccagggacc tcactctccc cacacgtgtc 86941 tgtccagctc cagaggatga agcagagacc acccaagcag tgcccaccac aacccctgag 87001 ccccccaaca agcctcgcct cggggagctg accgtgacag atgccacccc tgactccctc 87061 agcctgtcct ggatggtccc cgagggccag tttgaccact tcctggtcca gtacaggaat 87121 ggggatgggc agcccaaggt ggtgcgggtg ccggggcacg aggacggggt caccatctca 87181 ggcctggagc cagaccacaa gtacaagatg aacctgtacg gcttccacgg tggccagcgc 87241 gtgggcccca tctctgtcat tggggtgaca ggtgagtgta cgatgggagc cccagagtgg 87301 ggcctgtggg agggtctccc tttctctggt gatgggtgaa ctggcccagg aagcccctct 87361 gctcttggct gagccatggt acttttttgt ctttccccac ttccctgagg actgacagat 87421 cttcctgggt ggagaagggc cctgtgagct ctgttggtgg ctgtcccaag ttccccagca 87481 ctgacctcag agcttgtcat gtgtgttgac tgtaaactga gcaagaccac ccagctccaa 87541 agatgggcct ctccaagctg accccaggac ccccactcat ggccacagct tcgctctcct 87601 tcctcacaag acccaaggac atcccccagg gaagctgcct caccttctct gtcccctctt 87661 ctcagctgca gaggaagaaa ctcccgcccc cacagaaccc agcacggagg ccccggagcc 87721 ccctgaggag ccgctcctgg gggagctgac agtgacagga tcctcccctg actcgctgag 87781 cctctcctgg accatccccc agggccgctt cgactccttc actgtgcagt acaaggacag 87841 ggacgggcgg ccccaggtgg tgcgtgtcag gggcgaggag agcgaggtca ccgtgggggg 87901 cctggagccc gggtgcaaat acaagatgca cctgtacggc ctccacgagg ggcagcgcgt 87961 gggcccagtg tccgctgtgg gtgtgacagg tgagtaagtg tgagtgaggc gaggtgggga 88021 agatggccct ggaagacact gctgtctctc cagtcttcgt gaaaacatac tctgaagagt 88081 attccttcct ttgtgtatcc aaatgcctgg agatttttgt taaaaagcag atttggattc 88141 agcaggcttg gcctagggca agagaccatg catttctttc tttttctttc tttctttttt 88201 ttttttttga gatagagtct cgctttgtca cccaggctgg agtacagtgg ctcaatctcg 88261 gctcactgga acctccgcct cctgggttca agtgattctc ctacctcagc ctcctgagta 88321 actgggacta caggcgtgcg ccaccacgcc cagctaattt tttcatattt ttagtagaga 88381 tgggtttcac cgtgttagcc aggatggtct caatctcctg acctcgtgat ctgccctcct 88441 cggcctccca aagtgctggg atcataggcg tgagccacgg cgcccagccg accatccatt 88501 tctaacaaga ttctcatcta tcttgatgcc actggtccaa ggaacacact ttttttgttg 88561 ttttttattg tagtaaagta cacttgacat aattcgccat tctaatcgct ttctagtttg 88621 cagtccagca tgaagtccac tcacattgct gtggactgtc accctccact tccagaactc 88681 ctcttcccac actgaaactt cctacccact agacacgaac tcccattctc cccttgcccc 88741 agcccctggc aacaccattc tactctctgt ctctatgaat ttgactctag gtacctcata 88801 taagtgcaat catacaatat ttgtcttttt tgaatggttc atttcattaa atacaatgtc 88861 tttaaggttc atctatgtcg tagcatcagt cagaatttcc tttttatttt attttctgtg 88921 gcaatggggg tctcggtatg ttgcccaggc tggtctcaaa ctcctggcct caagcgagtc 88981 tcccaccttg acctcccaaa gtgctgggat tataggcaag agccactgca cctggccaga 89041 acttccttcc ttttcaggct gaataatgtt tcgttttaca catacagacc acactttgct 89101 catcctttca tccattgatg gacatctggg ttgcttccac ctcttagcta tcaagaataa 89161 tgctgctatg aatatttgtg tgcaaatcac agaacacctt tgtagcaaag ctcctcccag 89221 taacagattt ggaaactctc ttagaatttg ttgaagcgaa tttttcttgc attcatgaat 89281 cctcacagag gttaggctca gagttagggt tcctgtccta atgggaaaag ttcctgtcct 89341 aatggggcta acgtcatggg ggacaggctg tggaccagta agaaagcaaa ggcgggtgag 89401 catgtgacaa gaagcccaga gccaggcagg aatacctaaa ccaccctacc tgtggatact 89461 caggggacag tcaggatctg aggccaggca ggccttagct tggtgacagc tttagagaga 89521 aggtgaagcg tgtcagatca gcacaggtgg tgcaaaggcc aggaggctag aaagagcatg 89581 gtgtgtgaga agtactgagg attgagtggg gtgtcctttg cagtgaatag atagtcctga 89641 agtgtgggag cagaggaggc ctggggtctc ccagggatga ggatgtgatt gcccaggtct 89701 gtgctgagat ggccccagca gctgtgcctg tgtgaagctc atctgtgggg gctaaagatg 89761 gggatggggt ggcagggagc ctagaggcag ggaagccaat aggcagttgg tggccctggt 89821 gagaggtgac agtggctcaa actaggatcg gggactggag gttggggagg aaggtatcca 89881 gggaaactcg ggggagaggt aactctgagg ctgctggcag ctggtgaggg gctgggtcca 89941 ggaacacacc ccaggctctg gcctcgggag gagtgtgctg agcttgttgc agagcaaaga 90001 cagaagccca gtgaacaaaa gatggcgagg agaccccagt gcagggaggg taagggtgca 90061 gaggccaagt ggagctgtgc tcaaaagaga ggcggtgctg gagggacagg gagaggtggc 90121 ctgggtgttg ggaggtgggg gtgaggtggg ggccgagggc aggggggtca gggtgaggga 90181 taggaaaggc cgcaggggag gagaggatga agagctgtgc tggaggcgct gtgggcagca 90241 tcgtcctgtt cttgggcact ttgtgttttg tgacacatcc tttctatgct gaactgagaa 90301 cccagggtcc tcactgtccc cacacgtgtc tgtccagctc caaaggatga agccgagacc 90361 acccaagcag tgcctaccat gacccctgag ccccccatca agcctcgcct gggggagctg 90421 accgtgacag atgccacccc cgactccctc agcctgtcct ggatggttcc cgagggccag 90481 tttgaccact tcctggtcca gtacaggaat ggggatgggc agcccaaggc ggtgcgggtg 90541 ccggggcacg aggacggggt caccatctca ggcctggagc cagaccataa atacaagatg 90601 aacctgtacg gcttccacgg tggccagcgc gtaggccctg tgtctgccat tggggtgacg 90661 ggtgagtgaa tgatgggagc cccagggtgg gagctgtggg agggccacct cttgctcttt 90721 ggtgatgact ggtggggaat gggacagggg tctggtcagc accacagaac tgcttgtggc 90781 tggggctggg actccccttg ggccttccta tgtggttgac ccctggctcc ccctgagcag 90841 ggaggggcca tcaggagttt tgctgtgctg gtggctgtgc caggtccccc cacagctgac 90901 cctggaactt gttacgtgtg ttagctgtca gctgagcagg accacccagc cccaagaatg 90961 gacttctctg aaatgacctc aggtccccca gtcatagcct tggcttctcc ctccttttcc 91021 ccaggaccca aggacatccc cctcactctc tctccctcct tctccactgc agaggaagag 91081 acccccagcc ccacagaacc cagcactgag gccccggagg cccctgagga gccgctcctg 91141 ggggagttga cagtgacagg atcctcccct gactcgctga gcctctcctg gaccgtcccc 91201 cagggccgct tcgactcctt caccgtgcag tacaaggaca gggacgggca gccccaggtg 91261 gtgcgtgtca ggggcgagga gagcgaggtc accgtggggg gcctggagcc cgggcgcaaa 91321 tacaagatgc atctgtacgg cctccacgag gggcagcgcg tgggcccagt gtccaccgtg 91381 ggcatcacgg gtgagtgggg ggacaggccc tcgtccccag gtttacctct gcagccccct 91441 tgtgtttctc ctttggatct tggcacctct tttgactggg cctctaggtt tctgtctttt 91501 ctcccatgtt gctaatgatc ctgcctcatc cttggattca tggagactgt gtggagtcag 91561 atgggcaggt agtccatgcc ctgtttgttg tagcttcctt ctcccattgt gatctggaac 91621 ctccatgttg ctcgtgtgct catttgttag tcttcaggct cccttaagga gtattttagt 91681 ggctcagaaa gtgcttcaga tccagctgcc cagatctgca tgtgccttcg aatgtagtga 91741 ttgtgcgact ttgcaggcgt ttcctaacct tgtgcttcag attcctgcaa agttggggtg 91801 atgataataa tggcacccac ttcatatgtt gtgcgagggt taaatgcacc actgtttgtg 91861 agctgcttac agcaatgcag ggcacagatt ctaaaacaag cgttttagag gaggccgcta 91921 agaaatgctc actccagtcc tgggagagca ctgcctccct tgcgcagggt cctgcctcct 91981 gacccatggg cctgctctgc tcttttcagc gcccctgccc acaccactgc cggtggagcc 92041 ccgcctgggg gagctggcgg tggcggccgt gacctcggac tcagtgggcc tctcatggac 92101 ggtggcccag ggcccctttg actccttcct ggtacagtac agggacgcgc aggggcagcc 92161 ccaggcagtg cctgtgagcg gagacctccg agcggtcgcc gtctcggggc tggacccggc 92221 ccgcaagtac aagttcctgc tctttggact ccagaatggg aaacgccacg gcccagtccc 92281 tgtggaggcc aggaccggtg agtgagggct ggaggcctcc cgcggccaga gccttcgccc 92341 ccttgtggca cctgttggaa tttcacgttc tgtgccccac actccagtcc tcagcaccca 92401 ctgatttatt gggtccagga aaacccaggc cctgagctct cctctcccac tcacactcat 92461 ccctctccct actttccctg caccccaggg gacacttgct ttcttgtctg gctcctcttt 92521 tattcctact cctggcccag tcaccctccc gttcctggga ccggctcaca gagccatagc 92581 agcccaggaa gctccgtggc ccctttgcct ccatcccact ctccatgcac ctcactgtct 92641 tttccagccc cagacaccaa accgtctccc cgcctggggg agctgactgt gacagatgcg 92701 acccctgact ccgtgggcct ctcgtggacg gtccctgagg gcgaattcga ctccttcgtg 92761 gtccagtaca aggataagga tggtcggctc caggtggtgc cggtggcagc caaccagcgg 92821 gaggtcacag tccagggcct ggagcccagt aggaaataca ggttcctgct ctatggtctg 92881 tcaggcagga aacgactggg ccccatctct gctgacagca ccacaggtga gtcccagtcc 92941 agcctccacc ttttccagag ctgcctctca tccgagccct cagagctggc cctgcagcct 93001 tcccgtgaga ttccctctat cagccctgac acaaccatct taactccgaa agtgagtccc 93061 tctgtgtgtc aggcacagtt ctgaagcata tgcattactt cctttcattc gctgactgct 93121 gaggaaataa aagggacacg ggctttgggg ccagcagatc tgtgttccag ttccagtcct 93181 agcctttgcc agggtgacca tggacaagcc gcttcatccc cgggacctca gtgtcctcac 93241 ctgcataatg ctgggaatgg cgctggactc agtagtggtt cccatcctcc tccctcctcc 93301 ctgactcgga gccgaggggt ggaagaaaag ggagggcttg gctatggaaa gacgatggaa 93361 aaacctggtg tcactatgcc tatgactcag ctccctgagg aaggggtgtg gtgggtcacc 93421 gaccctcacc ccactcccag gaccatgagt cactatgccc agaggaggac agagcagccg 93481 gccagtctgg gcctgggtcc ctgggactcc gtaattctct tcccaaaccc tcactgtggg 93541 gacggggtac agatccccac ccatcggccc cagccccacc cgggcaagcc tctgctctgc 93601 cctctgttct cccagctaat cccctgtctt ctccctgccc caccctggct gccccggccc 93661 agtggggtca gtgtggggct ggggaagcag gaggcacgga tgttcaggct gatggccaca 93721 aggggacaag ggggagatca cagcctggca gtgatgggag ccgtgcattg gccggctgcc 93781 cagaccagct ctcgatgttt ggggtgtctc cgtgacaacc cagagacctc cggggagtct 93841 ctctcagctc tggcctcatt cttgctcagg ggctgggggt cagggtaaac aaaggccctc 93901 tctgcacccc cagccaccca tccctcggga gatgatctgt aatgaatttg gcgcatcctc 93961 gatcacagca gggaaggggc ggggcaggaa ggagtttggg cagcagcctc cagaggagga 94021 gggggctgtt ctccctcatt cctgtggggc atggcgggag caggcctgtg tgtctcctca 94081 aggagctgtc cctggggcta cactggaggg accatttccc agaacctcac acctccggga 94141 ggctgccagg gcttaggcaa aggcagcatg tgactaagag ctttccctcc tccctctgca 94201 cagctcccct ggagaaggag ctacctcccc acctggggga actgaccgtg gctgaggaga 94261 cctccagctc tctgcgcctg tcctggacgg tagcccaggg cccctttgac tccttcgtgg 94321 tccagtacag ggacacggac gggcagccca gggcagtgcc tgtggccgca gaccagcgca 94381 cagtcaccgt agaggacctg gagcctggca agaaatacaa gtttctgctc tacgggctcc 94441 ttgggggaaa gcgcctgggc ccggtctctg ccctgggaat gacaggtgag gctgctgtgc 94501 ctggctatag caagccagct tgtgtgggtt tccttgtgca tttgggctga agacaaagat 94561 gactgcagga gtgggcaggc cggagtgggg cgccctggcc tgtccccagg aaggaggagg 94621 agtctgcagc cctgtgggct tcaacatcca tcaaggagtc cagagcagga gccaggccag 94681 gcgggaggga aaggccctgg gaggggctct ctaatctccc agccccgact ctgccccgtc 94741 actgccactg ctcctcatta ctcgctgggg ctgctgtcgc ctccccgaag ggtggccttg 94801 tccagatagc ggcaaacctc cctgccgtgg atgagtcagg agcattttct taagaggaac 94861 atcactggaa aacaaaatga gcggggacac agaaaccaac agcagtggct gcatttgtgg 94921 tacaggctcc tcttccagag ctcgctgatg cccacctcag acaggcctga ccacggcacg 94981 gctggtggga tttgccagtc acctcaacca gccagttcca ccctcagctt ctctcagaag 95041 ggagcaccac actcctcaag ctcagtgaat gtatcccggc atgggtgggg ccagagcctg 95101 tgatatctcg aggtgggctc ggcaggacac cggggtgtgg aagggggaag cgagcacctg 95161 actcagacag cgcgggagct cgcaggagtc acgaggccac agcgacttca ttgtctgact 95221 gggcctggac ctataaactt cccacctcag ccttgggcca agcctggaag ataaaaatgg 95281 agcaccccat ggcgcccctc actcagattc tcccctgggc ttctcccacg cagccccaga 95341 agaggacaca ccagccccag agttagcccc agaggcccct gagcctcctg aagagccccg 95401 cctaggagtg ctgaccgtga ccgacacaac cccagactcc atgcgcctct cgtggagcgt 95461 ggcccagggc ccctttgatt ccttcgtggt ccagtatgag gacacgaacg ggcagcccca 95521 ggccttgctc gtggacggcg accagagcaa gatcctcatc tcaggcctgg agcccagcac 95581 cccctacagg ttcctcctct atggcctcca tgaagggaag cgcctggggc ccctctcagc 95641 tgagggcacc acaggtacca ccaggcgtct ccggcctcta gcctaggact cagaagggag 95701 aaacgggggc tcagaagggg tggtcgcagg gaaagagcgt gaggcgggta ccagggagag 95761 aggatggatg ggctggatgc gagtggcctt tagctctgcc ccacaggacc cccctgtggc 95821 tgcaagtccc tggttacaga tagagaaaca ggggcaggga ggggggtgga agggacgtgc 95881 tctgggtcac caagctggtg tgcttctgtc tccaatccct tctcccccac ccactccgtg 95941 cagggctggc tcctgctggt cagacctcag aggagtcaag gccccgcctg tcccagctgt 96001 ctgtgactga cgtgaccacc agttcactga ggctcaactg ggaggcccca ccgggggcct 96061 tcgactcctt cctgctccgc tttggggttc catcaccaag cactctggag ccgcatccgc 96121 gtccactgct gcagcgcgag ctgatggtgc cggggacacg gcactcggcc gtgctccggg 96181 acctgcgttc cgggactctg tacagcctga cactgtatgg gctgcgagga ccccacaagg 96241 ccgacagcat ccagggaacc gcccgcaccc tcagcccagg taaggaccca cacacactct 96301 gccccaaagt gggggtcttt gtacttcacg ggggggacct agtgcctcag ccagcggtgg 96361 gggtgggcga gttggtggtg ggcctggagg aatctgcaga gcgacttcca ttcctgggga 96421 ctagaggaaa aggggtggtg agcctgtgct ggagcagagg cgaggggggg actcgcaggg 96481 agaagcctcc ctgcccctgc ctgcgtcatt gttccttgac ccctctgcag ttctggagag 96541 cccccgtgac ctccaattca gtgaaatcag ggagacctca gccaaggtca actggatgcc 96601 cccaccatcc cgggcggaca gcttcaaagt ctcctaccag ctggcggacg gaggtggtgc 96661 ctttgccatg tgctcatcgc ctcgcatttc ctctcccccc tgcactctgc ccaccctcca 96721 gccgccctgg ggttccctgg gtaaccctcg atccccaatg ttttcagggg agcctcagag 96781 tgtgcaggtg gatggccagg cccggaccca gaaactccag gggctgatcc caggcgctcg 96841 ctatgaggtg accgtggtct cggtccgagg ctttgaggag agtgagcctc tcacaggctt 96901 cctcaccacg ggtgagatgg actgggaccc ggggcaagag gtgggagcca agaaaacggc 96961 atgggtggga gttgagagag aacgaggagg gtgaaaggga ggtggtggag gctccgattg 97021 cggacgggag gccagtggag tctggggagg cacggagtag agagagccgc ggggaccctt 97081 ctgagcccct ccccttcccc cagttcctga cggtcccaca cagttgcgtg cactgaactt 97141 gaccgaggga ttcgccgtgc tgcactggaa gcccccccag aatcctgtgg acacctatga 97201 cgtccaggtc acagcccctg ggggtgagca ggcctgaggc ctctggaggg gacttgttca 97261 gggtggggat tgcagggggg aggctggact ctggccgagg atggaggggg caggccttga 97321 tgcccctctc tacactccca gccccgcctc tgcaggcgga gaccccaggc agcgcggtgg 97381 actaccccct gcatgacctt gtcctccaca ccaactacac cgccacagtg cgtggcctgc 97441 ggggccccaa cctcacttcc ccagccagca tcaccttcac cacaggtagg gtctgtgggg 97501 tgtgtgggac agggagagga ggtagaggga gccaggttgg gcctcatccc catctcctct 97561 tcctgctttc cctcctaggg ctagaggccc ctcgggactt ggaggccaag gaagtgaccc 97621 cccgcaccgc cctgctcact tggactgagc ccccagtccg gcccgcaggc tacctgctca 97681 gcttccacac ccctggtgga cagaaccagg tgccccggcc ccactgaccc aactcccctc 97741 cctgggtgat tccaggaggt gctgcctctg gccctcccgg agggtctcca cctccctctc 97801 ccctgacccc cccttgtctg tcccacagga gatcctgctc ccaggaggga tcacatctca 97861 ccagctcctt ggcctctttc cctccacctc ctacaatgca cggctccagg ccatgtgggg 97921 ccagagcctc ctgccgcccg tgtccacctc tttcaccacg ggtacctgga cgcacgggcc 97981 cggggccggg ggctgggtgg gcagccaggg cctaaggctt ggaaaaggac tggcccctgc 98041 tctcctctcc caggtgggct gcggatcccc ttccccaggg actgcgggga ggagatgcag 98101 aacggagccg gtgcctccag gaccagcacc atcttcctca acggcaaccg cgagcggccc 98161 ctgaacgtgt tttgcgacat ggagactgat gggggcggct ggctggtggg tggcattggg 98221 aagcccaggg gtctgtgcag ggcagggtct gttgccccgg gagccagagg ctgatggtgc 98281 ccccacttgc ttcccaggtg ttccagcgcc gcatggatgg acagacagac ttctggaggg 98341 actgggagga ctatgcccat ggttttggga acatctctgg agagttctgg ctgggtcagt 98401 gcctcacagg gactggggaa ctacggatgg ggatgggggc cctgtggaca ccaggaccct 98461 gatgagggca cgtatcccac ccccaggcaa tgaggccctg cacagcctga cacaggcagg 98521 tgactactcc atgcgcgtgg acctgcgggc tggggacgag gctgtgttcg cccagtacga 98581 ctccttccac gtagactcgg ctgcggagta ctaccgcctc cacttggagg gctaccacgg 98641 caccgcaggt aagcagaggc tgtgaggctg ggagggtgag gctgggaggg gaggccctca 98701 tggctccttc ctccaccctg cccaggggac tccatgagct accacagcgg cagtgtcttc 98761 tctgcccgtg atcgggaccc caacagcttg ctcatctcct gcgctgtctc ctaccgaggg 98821 gcctggtggt acaggaactg ccactacgcc aacctcaacg ggctctacgg gagcacagtg 98881 gaccatcagg tgaggggtgg ggaggcggct cagagctggg gtggctgggg ctcggcctgc 98941 ctaggtttca gccccacagt gtaacaggca agggactgag tggctgggtg aaatggaaca 99001 atcatgccag cctcgcagag ggagctggag ttgatttatt ggctggaaag ggccagctca 99061 gaattaagcc tcaatcctct gcagcggagg gtcaggaagg gagctctgcg gggaggttgg 99121 ttgagtgctg ggagctacct ccttaagggg aatgggagga gcagatggga catccggctt 99181 tgactctctc ttgacaaccc ctttcccagg gagtgagctg gtaccactgg aagggcttcg 99241 agttctcggt gcccttcacg gaaatgaagc tgagaccaag aaactttcgc tccccagcgg 99301 ggggaggctg agctgctgcc cacctctctc gcaccccagt atgactgccg agcactgagg 99361 ggtcgccccg agagaagagc cagggtcctt caccacccag ccgctggagg aagccttctc 99421 tgccagcgat ctcgcagcac tgtgtttaca ggggggaggg gaggggttcg tacaggagca 99481 ataaaggaga aactgaggta cccggctggc atcggtcctg ccccatcact ggctctggcc 99541 cgggctgtgg gcccccatcc cccggggctg cagccgcact tggaaaggct gcatcttgag 99601 gatgacactg cagtggggca ggggctgcag ggagggcagg gcgtccccgg agggcagcag 99661 cgtgaaggcc tgcagcagtc gggtcagcac cacgaagagc tccaggcgcg ccagcggctc 99721 gcccaggcac acgcgggcac cgcagccgaa ggccagagct ctggagttct tgcctggctc 99781 caggaagcga tctgcgggcg ggtggacagg tgggtgggga ggcgttcagc ggcagcgggg 99841 accagcctcc accacatttt cacggcaggc ccccggcccc ccacatacca ggccagaact 99901 catgtggcct ctcccagacc gtctcatcca ggtgggcgcc ttggaggttc ggaatgatga 99961 ctgtgccctc agggatgtcg tagccggaga tgctgaaggg ggctggagtt agaggctggc 100021 caggacctcc ctgggctcgg gctttcctca ctcatcccca accctcggga gtcacctgct 100081 gggccgtgtg gtgcggtggg gcaaggctaa gggcacaacg ggccgcaggc gcagcacctc 100141 ggcgatggtg gcattgagca agggcagccg tgcacggtcc ttgtagggga cccgggagct 100201 ggaggcacca gggcccagtt cgtggtctag ctcctcctgc agtcgctgct gaatctgggg 100261 aatgatc // LOCUS HSMHC3W5A 62944 bp DNA PRI 15-FEB-1997 DEFINITION Human HLA class III region containing NOTCH4 gene, partial sequence, homeobox PBX2 (HPBX) gene, receptor for advanced glycosylation end products (RAGE) gene, complete cds, and 6 unidentified cds, complete sequence. ACCESSION U89336 NID g1841547 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 62944) AUTHORS Rowen,L., Dankers,C., Baskin,D., Faust,J., Loretz,C., Ahearn,M.E., Banta,A., Spies,T. and Hood,L. TITLE Sequence determination of 300 kilobases of the human class III MHC locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 62944) AUTHORS Rowen,L. TITLE Direct Submission JOURNAL Submitted (12-FEB-1997) Department of Molecular Biotechnology, Box 357730 University of Washington, Seattle, WA 98195, USA COMMENT Cosmids W5A and W12A were obtained from Thomas Spies (Spies et al (1990) Nature 348: 744-747. This contig overlaps U89335 by 12137 bp and U89337 by 363 bp. Cosmid W5A spans bases 1-40510 and cosmid W12A spans 28776-62944. There were no sequence differences where the cosmids overlapped. Sequencing methodology: high redundancy shotgun. Interspersed repeats were identified with RepeatMasker (available from http://ftp.genome.washington.edu/RM/RepeatMasker.html) Microsatellites (n >8 repeating units) were identified with sputnik (available from http://serac.mbt.washington.edu/ chrisa/software/sputnik.html). FEATURES Location/Qualifiers source 1..62944 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21" gene 1..7831 /gene="NOTCH4" source 1..40510 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W5A" repeat_region 40..340 /note="SINE" /rpt_family="AluSp" repeat_region complement(398..496) /note="LINE/L1" /rpt_family="L1MD2" exon 644..1167 /gene="NOTCH4" /number=21 exon 1743..2126 /gene="NOTCH4" /number=22 exon 2237..2412 /gene="NOTCH4" /number=23 repeat_region 2735..3033 /note="SINE" /rpt_family="AluY" repeat_region complement(3048..3124) /note="SINE" /rpt_family="MIR" repeat_region 3128..3343 /note="DNA/MER2_type" /rpt_family="MER46" repeat_region complement(3345..3524) /note="SINE" /rpt_family="AluSx" repeat_region complement(3527..3825) /note="SINE" /rpt_family="AluY" repeat_region complement(3827..3963) /note="SINE" /rpt_family="AluSq" repeat_region complement(3972..4052) /note="SINE" /rpt_family="MIR" exon 4103..4322 /gene="NOTCH4" /number=24 exon 4518..4599 /gene="NOTCH4" /number=25 exon 4689..4827 /gene="NOTCH4" /number=26 repeat_region 4944..5124 /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region 5128..5428 /note="SINE" /rpt_family="AluY" repeat_region complement(5476..5539) /note="SINE" /rpt_family="MIR" exon 5654..5949 /gene="NOTCH4" /number=27 exon 6176..6323 /gene="NOTCH4" /number=28 exon 6827..6924 /gene="NOTCH4" /number=29 exon 7098..7831 /gene="NOTCH4" /number=30 repeat_region 8197..8246 /note="LINE/L2" /rpt_family="MIR2" repeat_region 8624..8952 /note="DNA/MER1_type" /rpt_family="MER1B" repeat_region complement(9127..9288) /note="SINE" /rpt_family="MIR" repeat_region complement(9467..9591) /note="SINE" /rpt_family="AluJb" repeat_region complement(9729..10030) /note="SINE" /rpt_family="AluSx" repeat_region 10072..10124 /note="LINE/L2" /rpt_family="MIR2" exon 10735..10776 /note="identified by Grail" /number=1 CDS join(10735..10776,11378..11539,11745..11882) /note="intron-exon boundaries identified by a contig of ESTs with GenBank Accession Numbers W76064, R59617, W72507" /codon_start=1 /product="unknown" /db_xref="PID:g1841548" /translation="MEAERPQEEEDGEQTELLLDLVAEAQSRRLEEQRATFYTPQNPS SLAPAPLRPLEDREQLYSTILSHQCQRMEAQRSEPPLPPGGQELLELLLRVQGGGRME EQRSRPPTHTC" exon 11378..11539 /note="identified by ESTs" /number=2 exon 11745..12482 /note="identified by ESTs" /number=3 gene 13062..18054 /gene="HBX2" exon 13062..13553 /gene="HBX2" /number=1 CDS join(13333..13553,14452..14525,14744..14991,15092..15282, 15466..15601,15853..16006,16347..16435,16558..16644, 16774..16866) /gene="HBX2" /note="similar to human PBX2 encoded by GenBank Accession Number X59842" /codon_start=1 /product="homeobox PBX2 gene" /db_xref="PID:g1841549" /translation="MDERLLGPPPPGGGRGGLGLVSGEPGGPGEPPGGGDPGGGSGGV PGGRGKQDIGDILQQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSI RSSQEEEPVDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGGVSPDNSIEHSD YRSKLAQIRHIYHSELEKYEQACNEFTTHVMNLLREQSRTRPVAPKEMERMVSIIHRK FSAIQMQLKQSTCEAVMILRSRFLDARRKRRNFSKQATEVLNEYFYSHLSNPYPSEEA KEELAKKCGITVSQVSNWFGNKRIRYKKNIGKFQEEANIYAVKTAVSVTQGGHSRTSS PTPPSSAGSGGSFNLSGSGDMFLGMPGLNGDSYSASQVESLRHSMGPGGYGDNLGGGQ MYSPREMRANGSWQEAVTPSSVTSPTEGPGSVHSDTSN" exon 14452..14525 /gene="HBX2" /number=2 exon 14744..14991 /gene="HBX2" /number=3 exon 15092..15282 /gene="HBX2" /number=4 exon 15466..15601 /gene="HBX2" /number=5 repeat_region 15652..15775 /note="SINE/Alu" /rpt_family="FLAM_A" exon 15853..16006 /gene="HBX2" /number=6 exon 16347..16435 /gene="HBX2" /number=7 exon 16558..16644 /gene="HBX2" /number=8 exon 16774..18504 /number=9 gene 18991..22277 /gene="RAGE" exon 18991..19074 /gene="RAGE" /number=1 CDS join(19023..19074,19258..19364,19495..19690,19857..19921, 20044..20131,20222..20404,20547..20677,20857..20998, 21613..21639,21768..21894,22006..22102) /gene="RAGE" /note="similar to sequence encoded by GenBank Accession Number M91211; exons identified also by similarity to EST with GenBank Accession Number T83326" /codon_start=1 /product="receptor for advanced glycosylation end products" /db_xref="PID:g1841550" /translation="MAAGTAVGAWVLVLSLWGAVVGAQNITARIGEPLVLKCKGAPKK PPQRLEWKLNTGRTEAWKVLSPQGGGPWDSVARVLPNGSLFLPAVGIQDEGIFRCQAM NRNGKETKSNYRVRVYQIPGKPEIVDSASELTAGVPNKVGTCVSEGSYPAGTLSWHLD GKPLVPNEKGVSVKEQTRRHPETGLFTLQSELMVTPARGGDPRPTFSCSFSPGLPRHR ALRTAPIQPRVWEPVPLEEVQLVVEPEGGAVAPGGTVTLTCEVPAQPSPQIHWMKDGV PLPLPPSPVLILPEIGPQDQGTYSCVATHSSHGPQESRAVSISIIEPGEEGPTAGSVG GSGLGTLALALGILGGLGTAALLIGVILWQRRQRRGEERKAPENQEEEEERAELNQSE EPEAGESSTGGP" exon 19258..19364 /gene="RAGE" /number=2 exon 19495..19690 /gene="RAGE" /number=3 exon 19857..19921 /gene="RAGE" /number=4 exon 20044..20131 /gene="RAGE" /number=5 exon 20222..20404 /gene="RAGE" /number=6 exon 20547..20677 /gene="RAGE" /number=7 exon 20857..20998 /gene="RAGE" /number=8 repeat_region 21213..21530 /note="SINE" /rpt_family="AluSp" exon 21613..21639 /gene="RAGE" /number=9 exon 21768..21894 /gene="RAGE" /number=10 exon 22006..22277 /gene="RAGE" /number=11 exon complement(22456..23016) /number=6 CDS complement(join(22919..23016,23119..23236,23314..23368, 23482..23594,23686..23704,24594..24733)) /note="intron-exon boundaries defined by a contig of ESTs with GenBank Accession Numbers AA205944, W73464, W73185, R97740, H87107, T84842; similar to zinc finger of C. elegans, encoded by GenBank Accession Number Z46787, and A. thaliana, encoded by GenBank Accession Number U81598" /codon_start=1 /product="unknown" /db_xref="PID:g1841551" /translation="MAAAEEEDGGPEGPNRERGGAGATFECNICLETAREAVVSVCGH LYCWPCLHQWLETRPERQECPVCKAGISREKVVPLYGRGSQKPQDPRLKTPPRPQGQR PAPESRGGFQPFGDTGGFHFSFGVGAFPFGFFTTVFNAHEPFRRGTGVDLGQGHPASS WQDSLFLFLAIFFFFWLLSI" exon complement(23119..23236) /number=5 exon complement(23314..23368) /number=4 exon complement(23482..23594) /number=3 exon complement(23686..23704) /number=2 exon complement(24594..24733) /number=1 source 28776..62944 /organism="Homo sapiens" /chromosome="6" /map="6p21" /clone="cosmid W12A" repeat_region 29024..29160 /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(29170..29218) /note="DNA/MER1_type" /rpt_family="MER5A" repeat_region 29235..29340 /note="SINE/Alu" /rpt_family="FLAM_C" repeat_region complement(29346..29424) /note="DNA/Mer1_type" /rpt_family="MER5A" repeat_region 29525..29661 /note="LINE/L2" /rpt_family="MIR2" repeat_region 29785..29856 /note="LINE/L2" /rpt_family="MIR2" repeat_region 30643..30729 /note="SINE" /rpt_family="MIR" repeat_region complement(30779..30913) /note="DNA/MER1_type" /rpt_family="MER33" repeat_region complement(30916..31213) /note="SINE" /rpt_family="AluSg" repeat_region complement(31215..31393) /note="DNA/MER1_type" /rpt_family="MER33" CDS join(31773..31948,32175..32308,32645..32820,32985..33080, 33207..33279,33797..33969) /note="intron-exon boundaries defined by a contig of ESTs with GenBank Accession Numbers AA009402, T77083, T16458, W24266, AA009906, HSZ78316, W44905, HSC0EA051, R80159, W65345; similar to 1-acyl-SN-glycerol-3-phosphate acyltransferase in C. elegans (encoded by GenBank Accesion Number Z73975), E. coli (Swiss-Prot Accession Number P26647), S. cerevisiae (Swiss-Prot Accession Number P33333)" /codon_start=1 /product="unknown" /db_xref="PID:g1841552" /translation="MLLLLLFLLLLFLLPTLWFCSPSAKYFFKMAFYNGWILFLAVLA IPVCAVRGRNVENMKILRLMLLHIKYLYGIRVEVRGAHHFPPSQPYVVVSNHQSSLDL LGMMEVLPGRCVPIAKRELLWAGSAGLACWLAGVIFIDRKRTGDAISVMSEVAQTLLT QDVRVWVFPEGTRNHNGSMLPFKRGAFHLAVQAQVPIVPIVMSSYQDFYCKKERRFTS GQCQVRVLPPVPTEGLTPDDVPALADRVRHSMLTVFREISTDGRGGGDYLKKPGGGG" exon 31773..31948 /number=1 exon 32175..32308 /number=2 exon 32645..32820 /number=3 exon 32985..33080 /number=4 exon 33207..33279 /number=5 exon 33797..35021 /number=6 exon complement(34946..35332) /number=8 CDS complement(join(35286..35332,35586..35739,35813..35892, 35985..36155,36237..36332,36432..36541,36622..36744, 36976..37076)) /note="intron-exon boundaries defined by a contig of ESTs with GenBank Accession Numbers N21410, AA004456, W67119, N31366, N98620, R10584; similar to latent transforming growth factor beta binding protein precursor" /codon_start=1 /product="unknown" /db_xref="PID:g1841553" /translation="MGSRAELCTLLGGFSFLLLLIPGEGAKGGSLRESQGVCSKQTLV VPLHYNESYSQPVYKPYLTLCAGRRICSTYRTMYRVMWREVRREVQQTHAVCCQGWKK RHPGALTCEAICAKPCLNGGVCVRPDQCECAPGWGGKHCHVDVDECRTSITLCSHHCF NTAGSFTCGCPHDLVLGVDGRTCMEGSPEPPTSASILSVAVREAEKDERALKQEIHEL RGRLERLEQWAGQAGAWVRAVLPVPPEELQPEQVAELWGRGDRIESLSDQVLLLEERL GACSCEDNSLGLGVNHR" exon complement(35586..35739) /number=7 exon complement(35813..35892) /number=6 exon complement(35985..36155) /number=5 exon complement(36237..36332) /number=4 exon complement(36432..36541) /number=3 exon complement(36622..36744) /number=2 exon complement(36976..37076) /number=1 repeat_region complement(37436..37632) /note="SINE" /rpt_family="AluSp" repeat_region complement(37671..37974) /note="SINE" /rpt_family="AluSx" repeat_region 38031..38331 /note="SINE" /rpt_family="AluSg" exon complement(39567..40434) /number=9 CDS complement(join(40291..40434,40619..40673,45318..45402, 45522..45605,47264..47371,47459..47554,48041..48194, 48447..48637,49588..49597)) /note="intron-exon boundaries defined by contig of ESTs with GenBank Accession Numbers AA143270, AA143271, N28773, W26954, N24410, AA008747, R81931, D79029, N36592; similar to palmitoyl-protein thioesterase precurser in B. taurus (Swiss-Prot Accession Number P45478), C. elegans (encoded by GenBank Accession Number U50313), H. sapiens (Swiss Prot Accession Number P50897)" /codon_start=1 /product="unknown" /db_xref="PID:g1841554" /translation="MKSCGSMLGLWGQRLPAAWVLLLLPFLPLLLLAAPAPHRASYKP VIVVHGLFDSSYSFRHLLEYINETHPGTVVTVLDLFDGRESLRPLWEQVQGFREAVVP IMAKAPQGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYGDTDYLKWLFP TSMRSNLYRICYSPWGQEFSICNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRK NFLRVGHLVLIGGPDDGVITPWQSSFFGFYDANETVLEMEEQLVYLRDSFGLKTLLAR GAIVRCPMAGISHTAWHSNRTLYETCIEPWLS" exon complement(40619..40673) /number=8 repeat_region 40722..41023 /note="SINE" /rpt_family="AluSx" repeat_region 41146..41222 /note="LINE/L1" /rpt_family="L1MC4" repeat_region 41224..41355 /note="SINE" /rpt_family="AluSx" repeat_region 41359..41652 /note="SINE" /rpt_family="AluSx" repeat_region 41653..41953 /note="SINE" /rpt_family="AluSq" repeat_region 42122..42237 /note="LINE/L1" /rpt_family="L1MC4" repeat_region complement(42238..42505) /note="SINE" /rpt_family="AluJb" repeat_region 42508..42544 /note="LINE/L1" /rpt_family="L1MC4" repeat_region complement(42558..42852) /note="SINE" /rpt_family="AluSx" repeat_region 42897..43185 /note="SINE" /rpt_family="AluY" repeat_region 43222..43522 /note="SINE" /rpt_family="AluY" repeat_region 43768..43789 /note="microsatellite" /rpt_type=tandem /rpt_unit=at repeat_region complement(43789..44089) /note="SINE" /rpt_family="AluSp" repeat_region 44090..44268 /note="LINE/L1" /rpt_family="L1MB3" repeat_region complement(44277..44635) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(44592..44732) /note="LINE/L2" /rpt_family="MIR2" repeat_region 44893..45069 /note="LINE/L1" /rpt_family="L1MC3" repeat_region 44931..45069 /note="LINE/L1" /rpt_family="L1MD3" exon complement(45318..45402) /number=7 exon complement(45522..45605) /number=6 repeat_region 45642..45700 /note="LINE/L2" /rpt_family="MIR2" repeat_region 45730..46013 /note="SINE" /rpt_family="AluSx" repeat_region complement(46051..46342) /note="SINE" /rpt_family="AluSx" repeat_region complement(46343..46562) /note="SINE" /rpt_family="MIR" repeat_region complement(46567..46783) /note="DNA/MER1_type" /rpt_family="MER58A" repeat_region 46837..47131 /note="SINE" /rpt_family="AluSx" exon complement(47264..47371) /number=5 exon complement(47459..47554) /number=4 repeat_region complement(47658..47771) /note="SINE" /rpt_family="MIR" exon complement(48041..49194) /number=3 exon complement(48447..48637) /number=2 exon complement(49588..49646) /number=1 exon 52616..52855 /number=1 CDS join(52616..52855,53501..53686,53825..54001) /note="intron-exon boundaries defined by Xgrail and similarities to ESTs with GenBank Accesion Numbers T10235, W30645; 5'end may be wrong" /codon_start=1 /product="unknown" /db_xref="PID:g1841555" /translation="MPPDPYLQETRFEGPLPPPPPAAAAPPPPAPAQTAQAPGFVVPT HAGTVGTLPLGGYVAPGYPLQLQPCTAYVPVYPVGTPYAGGTPGGTGVTSTLPPPPQG PGLALLEPRRPPHDYMPIAVLTTICCFWPTGIIAIFKAVQVRTALARGDMVSAEIASR EARNFSFISLAVGIAAMVLCTILTVVIIIAAQHHENYWDP" exon 53501..53686 /number=2 exon 53825..54131 /number=3 repeat_region 55555..55645 /note="LINE/L2" /rpt_family="MIR2" repeat_region 55779..56069 /note="SINE" /rpt_family="AluSq" repeat_region complement(56791..56951) /note="LINE/L2" /rpt_family="MIR2" repeat_region 57363..57465 /note="DNA/MER1_type" /rpt_family="MER5B" repeat_region 57474..57604 /note="SINE" /rpt_family="AluJb" repeat_region 57611..57838 /note="SINE" /rpt_family="AluY" repeat_region 57849..58150 /note="SINE" /rpt_family="AluSx" repeat_region 58151..58311 /note="SINE" /rpt_family="AluJb" repeat_region 58312..58390 /note="DNA/MER1_type" /rpt_family="MER5B" repeat_region 58804..59097 /note="SINE" /rpt_family="AluY" repeat_region complement(59348..59647) /note="SINE" /rpt_family="AluSq" repeat_region complement(59653..59764) /note="SINE" /rpt_family="AluJo" repeat_region complement(59784..59955) /note="SINE" /rpt_family="AluSx" repeat_region complement(59958..60268) /note="SINE" /rpt_family="AluY" repeat_region complement(60283..60581) /note="SINE" /rpt_family="AluSq" repeat_region complement(60589..60718) /note="SINE" /rpt_family="AluJo" repeat_region complement(60891..61555) /note="LINE/L2" /rpt_family="MIR2" repeat_region complement(61704..61994) /note="SINE" /rpt_family="AluSx" repeat_region complement(62110..62325) /note="LINE/L2" /rpt_family="MIR2" repeat_region 62327..62548 /note="LTR/other" /rpt_family="PABL_A" repeat_region 62883..62944 /note="SINE" /rpt_family="AluYb8" BASE COUNT 15145 a 17344 c 15900 g 14555 t ORIGIN 1 gatcctattt ctaaggagta agataataat ataacagccg gccgggcaca gtggctcacg 61 cctgtaatcc cagcactttg ggaggccgag gcaggcggat cacctgaggt cgggcattcg 121 agaccagcct gacaaacatg gagaaaccct gtctctacta aaaatacaaa ttagctgggc 181 gtggtggtgc atggctgtaa tcccagctat tgggaaggct gaggcaggag aattgcttga 241 acccgggagg cagaggttgc aatgagctga gattgcacca ttgcactcca gcctggacaa 301 caagagcgaa actctgtctc aaaaataata ataataataa tataatagca ttctattaac 361 tgtttagtct tctaggactt gcactgtaat gccacagtcc atcaggttgt tgcacacagc 421 tgtgcttcat ccattttcaa cagaatgtaa tatgtcattg tgtgaaatta ccacaggaca 481 tggtttcaac atccacaaaa tgattaactt gatgctctct gaggcgcctt ttagatatga 541 gaatctagga ccctctgcac cgtcttaacc caagagtttg cttgatggag agcgggaaga 601 ataatgcaag ttgcatctcc aatatctccc ctcccctcca cagggttttg aaggccccac 661 ctgcagccac agggcccctt cctgcggctt ccatcactgc caccacggag gcctgtgtct 721 gccctcccct aagccaggct tcccaccacg ctgtgcctgc ctcagtggct atgggggtcc 781 tgactgcctg accccaccag ctcctaaagg ctgtggccct ccctccccat gcctatacaa 841 tggcagctgc tcagagacca cgggcttggg gggcccaggc tttcgatgct cctgccctca 901 cagctctcca gggccccggt gtcagaaacc cggagccaag gggtgtgagg gcagaagtgg 961 agatggggcc tgcgatgctg gctgcagtgg cccgggagga aactgggatg gaggggactg 1021 ctctctggga gtcccagacc cctggaaggg ctgcccctcc cactctcggt gctggcttct 1081 cttccgggac gggcagtgcc acccacagtg tgactctgaa gagtgtctgt ttgatggcta 1141 cgactgtgag acccctccag cctgcacgtg agcctgaaat ccactggagc cagggaagga 1201 gaggggtggg tgagaggagg aggaaggacg tagatggctc tgagttacag tgtggccaca 1261 gccttgggct ccagggagtt tccaccctaa taaccatcac taaacagggg tcgaagactc 1321 tggactccaa cctagggtaa tggggtggca tcagtattta atgtggggcg tggcctttgg 1381 gctcctctct aagagttgaa ggaactcagg tctcaagcct ccttccctaa gccttgctgc 1441 catggagtat ttcccctagc agtcagcacc tcacagaggg aaaagggcct gggactctcc 1501 tttagaaaca gaggagagct tgggagggta cagagagggg acagtctagg gagacagggg 1561 tgttagcaga cattggggtg tctggactac catccaggac ttgactaagc tcattgctcc 1621 acagctgccc ccacttagca accaaagccc tagagggcac aaaatatggg gaattctttc 1681 tagggtgaag aaaagagtca ggttttaggg aggtcctgag tccccctctc cttaccccac 1741 agtccagcct atgaccagta ctgccatgat cacttccaca acgggcactg tgagaaaggc 1801 tgcaacactg cagagtgtgg ctgggatgga ggtgactgca ggcctgaaga tggggaccca 1861 gagtgggggc cctccctggc cctgctggtg gtactgagcc ccccagccct agaccagcag 1921 ctgtttgccc tggcccgggt gctgtccctg actctgaggg taggactctg ggtaaggaag 1981 gatcgtgatg gcagggacat ggtgtacccc tatcctgggg cccgggctga agaaaagcta 2041 ggaggaactc gggaccccac ctatcaggag agagcagccc ctcaaacaca gcccctgggc 2101 aaggagaccg actccctcag tgctgggtaa gaagctaggt ggagggaagg gccagacacc 2161 agttttttta agagggcaga gggaggaaag ggagccaggg accaatacag aggtctctga 2221 ggtgcctcct ctacaggttt gtggtggtca tgggtgtgga tttgtcccgc tgtggccctg 2281 accacccggc atcccgctgt ccctgggacc ctgggcttct actccgcttc cttgctgcga 2341 tggctgcagt gggagccctg gagcccctgc tgcctggacc actgctggct gtccaccctc 2401 atgcagggac cggtaggtga ccccttgcca ctttctctga cctctgttcc caggccagct 2461 ctcatgctag caacaggcaa tggaggctga atcaaacagg acagctgaga ctgaaaatgt 2521 tctttgtggg gacttacttt ccctaacccc gctttctcta actgaatctc ccactggccc 2581 atttgttcta cagtctcctt ccttatttcc ctaagcacat tatcctaacc tctgtcatag 2641 ccctccaaca aagggatggt ttatcttctc taccagactg agaataccta atagtctttg 2701 tatcagacaa ttcatagtac atgaaagaat aataggctgg gcgcagtggc tcatgcctat 2761 aatcccagca cgttgggaga ccaaggcagg tggatcacga ggtcaggaga ttgagaccat 2821 cctggctaat gcggtgaaac cctgtctcta ctaaaaataa aaaaattagc cggctgtggt 2881 ggcgggtgct tgtagtctca gctactcagg aggctgaggc aggagaatgg cgtgaacctg 2941 ggaggtggag cttgcagtga gccgagatcg cgccactgca ctccagcctg ggcgacagag 3001 ggagactcca tctcaaaaaa aaaaaaagaa aaataactgc tatatcgtac tttgtgcctt 3061 actctaagca ttttacattg ttacctcatt taatcctccc ccacaacccc atgaggcacg 3121 tactgctggt tgagtatccc ttatctgaaa tgcttgggaa caaaagtgtt tcagatttcg 3181 gatttatttt ggaatatttg cattatactt actggttcag catccctaat acaacatcca 3241 aatgctacaa tgagcatttc ctttgagcgt tatgttggta ctctaaaagt ttcagacttt 3301 ggaacatttc agatttggga ttggggttat ggatactcag cctttttttg tgtgtttgtt 3361 ttctgagaca gtcttactct gtcagccaca ctggagtaca gtgacgccat ctcagctcac 3421 tgcaacctct gcctcctggg tttaagcaat tctcttgctt cagactactg agtagctgga 3481 attacaatgg catgccacca tgccctgata attttttttg tttttgtttt gttttgtttt 3541 gtttgagaca gagtcttgcc ttgtcgccca ggccggagtg cagtggcgcg atctcggctc 3601 actgcaagct ccacctccca agttcacgcc attctcctgc ctcagcctcc caagtagctg 3661 ggactacagg tgcccgccac cacacctggc taattttttg tatttttagt agagacaggg 3721 tttcaccatg ttagccagga tggtctcgat ctcctgacct catgatccac ctgcctcagc 3781 ctcccaaagt gctgagatta taggagtaag ccactacacc cagccactaa tttttatatt 3841 tttagtagag agggggtttt gccatgttgg ccaggctggt ctcgaactcc tggcctcata 3901 tgatccacct gcctcagctt cccaaagtgc tgggattaca ggcatgagcc actgtgccca 3961 acctcaatct atattatcat ccccattttg cagataagga aaccgaggca aagacaggct 4021 actaaacttg tccaaaggtc tcccaatagt aatcagtctc accaggagtg gcctctcttt 4081 gtgactctgt ctctcccacc agcaccccct gccaaccagc ttccctggcc tgtgctgtgc 4141 tccccagtgg ccggggtgat tctcctggcc ctaggggctc ttctcgtcct ccagctcatc 4201 cggcgtcgac gccgagagca tggagctctc tggctgcccc ctggtttcac tcgacggcct 4261 cggactcagt cagctcccca ccgacgccgg cccccactag gcgaggacag cattggtctc 4321 aagtgagaat gaggagaaac ccaggctcag gaaggggagt ctctcctatg gcgatattta 4381 caatcagaaa agataagaaa tactattgca gaagtcaaag ataggggaag gagagagggg 4441 tgggaagcct gctggaaatt ttggagaccc tgatggtcat aattccgtgt aacctctacc 4501 cacccattcc tttccagggc actgaagcca aaggcagaag ttgatgagga tggagttgtg 4561 atgtgctcag gccctgagga gggagaggag gtgggccagg tgaaagggct ggggcaagaa 4621 tggtctggag gtgatggaag ggatgaaagg gcaaatcaac cttcactgat ccttgctgtt 4681 acccaaaggc tgaagaaaca ggcccaccct ccacgtgcca gctctggtct ctgagtggtg 4741 gctgtggggc gctccctcag gcagccatgc taactcctcc ccaggaatct gagatggaag 4801 cccctgacct ggacacccgt ggacctggta tgtgagtcaa cccagaccaa gaaaaaaaaa 4861 aaaagtcctt tgaccctatt agaatcagag agtcctttaa tatcagaact agaggaaata 4921 attttagact gagtgcctta gaacaatgat tctcaaagtg tggtcctcag acagcaaaat 4981 cagcatcacc tgggaatttg tcagaaatgc aaattattgg gctccactac agagctactg 5041 actcaggaat ttaaaatgtt aggcaatctg ttttaacaag cccttcaggt gaatctgatc 5101 cagactcgtt tgagaaaacc actgctaggc cgggcgtggt ggctcacgcc tgtaatccca 5161 gcactttggg aggccaaggc gggtggatca caaggtcagg agatcgagac catcctggct 5221 aacacagtga aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tggtggcggg 5281 agcctgtagt cccagctgct ctggaggcta aggcaggaga atggcgtgaa cctgggagga 5341 ggagcttgca gtgagccgag atcgcgccac tgcactccag cctgggtgac agggcgagac 5401 tccgtctcag aaaaaaaaaa aaaaaaaaga gaaaaccact gccctggaat gtcagagaat 5461 taagctgcag gttcctttta cagaggaaga aactgaagtc agagaaaagc agaaaagtca 5521 cttggctaaa gccacacaga gccagaactt agcttcccaa cacctcaggt tttgattctc 5581 tctgagctta catgttgtcc cttccccctt gttgtgtcct ttagattgac ccattactct 5641 gtcttaccaa cagatggggt gacacccctg atgtcagcag tttgctgtgg ggaagtacag 5701 tccgggacct tccaaggggc atggttggga tgtcctgagc cctgggaacc tctgctggat 5761 ggaggggcct gtccccaggc tcacaccgtg ggcactgggg agacccccct gcacctggct 5821 gcccgattct cccggccaac cgctgcccgc cgcctccttg aggctggagc caaccccaac 5881 cagccagacc gggcagggcg cacacccctt catgctgctg tggctgctga tgctcgggag 5941 gtctgccagg ttagcacaca ctgaggtccc tacagggaat ggggcgagct tacaagtaaa 6001 gctggacaga agcatcccct agagtttgac aaggaggaaa ttggtgtgat tgggaacctg 6061 acagggaaac tgcggaggat ggctgaatat ggattgcgag tggggttaat agtgtaagga 6121 actcgagttg gcagtccaag gtaccccagg ggtcactggc cctctgtctc cccagcttct 6181 gctccgtagc agacaaactg cagtggacgc tcgcacagag gacgggacca cacccttgat 6241 gctggctgcc aggctggcgg tggaagacct ggttgaagaa ctgattgcag cccaagcaga 6301 cgtgggggcc agagataaat ggggtatgta gaggaagggg tgatgtatgc tatagagaag 6361 ttgagcagat ggggtgggag atagcgtgca aaatataggt gcagcagagg ggcattccct 6421 ctcatcctgc tgttacggcg gtcaatctga gatgcggtgg aagtacgggc cgcgtgagtt 6481 tcccccccca actcccaccc tcaacaccac actggccctc cgctccagct tactggggaa 6541 ctggcatgga acacagtgtc tgtggaaagg ggggggaatc tcgtgggggg agactgtctc 6601 ccggtctcac cgaccccaga acaatgcccc attgtccctc ccgcgcactg gtgacgtcac 6661 cagggcaaca cttcctgcag gccggtggtc tcctgggcaa cgcttcccgc ctttgaggga 6721 ccagccggcc cgaatagccc ttcccccaag gccagaaccc gtgggaaacc ggaacccagg 6781 cgtctggccc ccaactgggg taacaacctc ccacgtcgtc ccctagggaa aactgcgctg 6841 cactgggctg ctgccgtgaa caacgcccga gccgcccgct cgcttctcca ggccggagcc 6901 gataaagatg cccaggacaa cagggttaga tgggacagag ggcttcccac aaaacagtca 6961 ggcgcacgag agatggaaag tgcggtaacc cgcaaagcct gaagggatag gggccagtgg 7021 tcgcgcaagt gaaggcagaa aggcccagtc ctgtgggcgt ggccttccct gatatcggcc 7081 ctggctcttc tgtacaggag cagacgccgc tattcctggc ggcgcgggaa ggagcggtgg 7141 aagtagccca gctactgctg gggctggggg cagcccgaga gctgcgggac caggctgggc 7201 tagcgccggc ggacgtcgct caccaacgta accactggga tctgctgacg ctgctggaag 7261 gggctgggcc accagaggcc cgtcacaaag ccacgccggg ccgcgaggct gggcccttcc 7321 cgcgcgcacg gacggtgtca gtaagcgtgc ccccgcatgg gggcggggct ctgccgcgct 7381 gccggacgct gtcagccgga gcaggccctc gtgggggcgg agcttgtctg caggctcgga 7441 cttggtccgt agacttggct gcgcgggggg gcggggccta ttctcattgc cggagcctct 7501 cgggagtagg agcaggagga ggcccgaccc ctcgcggccg taggttttct gcaggcatgc 7561 gcgggcctcg gcccaaccct gcgataatgc gaggaagata cggagtggct gccgggcgcg 7621 gaggcagggt ctcaacggat gactggccct gtgattgggt ggccctggga gcttgcggtt 7681 ctgcctccaa cattccgatc ccgcctcctt gccttactcc gtccccggag cggggatcac 7741 ctcaacttga ctgtggtccc ccagccctcc aagaaatgcc cataaaccaa ggaggagagg 7801 gtaaaaaata gaagaataca tggtagggag gaattccaaa aatgattacc cattaaaagg 7861 caggctggaa ggccttcctg gttttaagat ggatccccca aaatgaaggg ttgtgagttt 7921 agtttctctc ctaaaatgaa tgtatgccca ccagagcaga catcttccac gtggagaagc 7981 tgcagctctg gaaagagggt ttaagatgct aggatgaggc aggcccagtc ctcctccaga 8041 aaataagaca ggccacagga gggcagagtg gagtggaaat acccctaagt tggaaccaag 8101 aattgcaggc atatgggatg taagatgttc tttcctatat atggtttcca aagggtgccc 8161 ctatgatcca ttgtccccac tgcccacaaa tggctgacaa atatttattg ggcacctact 8221 atgtgccagg cactgtgtag gtgctgaaaa gtggccaagg gccacccccg ctgatgactc 8281 cttgcattcc ctcccctcac aacaaagaac tccactgtgg ggatgaagcg cttcttctag 8341 ccactgctat cgctatttaa gaaccctaaa tctgtcaccc ataataaagc tgatttgaag 8401 tgttaccttt ttttggagga attggggaga agaatgggaa aaaagatggg agtgactgca 8461 taatgtcagc attttgtgct tttggctcag catttggatt ggatggagga tgtaagtata 8521 gtttaaaagc aagaataagt atatttaggg gccctatgat aatttagggt attatctgaa 8581 agcaagaatc tagtagccaa gggagaaacc gcacacacta ggtcaggggt ccccaaccct 8641 tgggccacag actggtactg gtccatggcc tcttaggaac tgggccacac agcaggaggt 8701 gagcaagcat tactgcccaa gctccacctc ctgtcagatc agcataagca ttaaattctc 8761 ataggaactc gaaccctatt gtgaactgtg catgcaaggg atctaagttt ctcgcttctt 8821 acgggaatct aatgcctaat gatctggggt ggaacagttt catcctgaaa ccagccctcc 8881 gtcgccaccg accatggaat aattgtcttc cacgaaactc ttccctggtg ccaaaaaggt 8941 aggagaccac tgcactagat gatgcacaca ctttgtccct catcctaggg cttttactta 9001 tggccactta ggagattcct aaggccacaa gtcaagtaga tggagagagt atcttgaaac 9061 tttgtccacc ttgcagcaat atgttgctag gtttgaaaca tggagtcatg aggcattttg 9121 aaagccaata atatctacag tttattaagt attcactatg catcaagtgc tttattacat 9181 tattaataca tcaaccctat gaagtaggtg ctattaaaac cctatttcac tcagaaattg 9241 aggcacagag atctgcccaa gattgcaagg aataagcggc aggaccagat ctcttcatca 9301 catttcacat tccaaatcac tcagctataa actccctaac atgacaggtt gccatttaga 9361 ggtaccaaat ggttgtctgc cctgctcctt ccttgatgcc aaccagcctg attagcattg 9421 atcaaagacc aagccagaga ggtagtcctc tcccttttca atttcatttc atttctttct 9481 ttttctgaaa cagggtattg ctctgttgcc cagcctggag tgcagtggca caaatggctc 9541 actgcagcca caacctcctg ggctcaagca atcttcctac ctcagcctct catttttcca 9601 tatctatatc tcttatgccc aaaataaact ttcccctgcc ccttgtctgc actaaactgt 9661 aagtttccaa aatgaaccct tcccgtactc tatttggtac acatcttgtc tcctgaatag 9721 agtgtatttt ttattttatt ttattttgga gacggagtct cgctctgtca cctaggctgg 9781 agcgcagtgg cacaatctca gttcaccgca acctccgcct cccgggttca agcaattctc 9841 ctgcctcaac ctcctgagta gctgggatta caggcgcatg tggccacgcc cagctaattt 9901 tttgtatttt agtaaagatg gggtttcacc atgttcccca ggctggtctc caactcctga 9961 gctcaggcaa tccacccgcc tcagcctccc aaagtgctag gattacaggt gtgagccacc 10021 gcagccggcc atctcctgaa tagattttaa atacctagag gtcagggatg attattcaat 10081 acatatatat tgaatactta ctatgtgttg gacccggtgc tagggtttta tgtatatatt 10141 tgagagctcc acatccctgg atctgaatcc tccacttccc actggaacca tgcccctccc 10201 agtcccggta agtaagaggg aagatcggga gggccaaatc ctacaccagg gtctatctta 10261 gggagggaag ggacctggct ggggggaggg ggattctgag gagtgaaacc acttcctgtg 10321 tagctagttc ctgtgttgac agaaagagca gaagaggagg tggggtggag ggagcagagc 10381 cagggattag gggactactg aggctctgga gatgagaccg ccaggagtcc ttccccacca 10441 tgagccccct ccactcctgc agctggagga gtttttccca gtctcagtgc tgccctgggg 10501 cgagagagac tgaacaagct gtttgggtgg gaagagaatg gaggaagttg acagggatgg 10561 gcggggcccg tgggggggct gaccaggaac ccagcttcct gctcagtacc caggcatcca 10621 gcccccagct cacccccacc ccttccagcc cccactcccc tcaggaaccc aaggttccag 10681 ccctcctccc aaatcccagc cacccctccc ccaccagttt ctcccctcta ggggatggag 10741 gctgagagac cccaggaaga agaggatggt gagcaggtga gctgggcacg gggttgggga 10801 ggctgacact gggaaagaag ggaggtgaga ggacctgggg cagaaatgta gggacacagg 10861 ggccttgaaa ggcttgggca aactgaggca ggaacagaga cacacagaga ggaaacgggc 10921 cactggccta gccccctgtc cactcctccc gcttcaacca ccactgcttg actagaatgg 10981 acatattttg gcatcagggc ccccctcagg atgaggaagg ctggccccct ccaaactcca 11041 ccactcggcc ttggcgatct gctcctccat cccctcctcc tccagggacc cgccacacag 11101 gtacccctac ccacccaggg agagccccga ccctagtgcc cacatcctga ccccattacc 11161 aaggcccact ccattgtggg ccctctcccc acctcctcca gactcccctt gggattcccc 11221 attgcacccc ctctcctctg atccaaagtc cctaatcacg tcaccctgtc cacactcccc 11281 cacggctcct gtctgccaac ctctctgggt ctctgagccc tccacacccc tctccccagc 11341 cctgggaccc cgctcggcct ccctgctctc cctgcagact gaactccttc tggacctggt 11401 ggctgaagcc cagtcccgcc gcctggagga gcagagggcc accttctaca ccccccaaaa 11461 cccctcaagc ctagcccctg ccccactccg tcctctcgag gacagagaac agctttacag 11521 cactatcctc agtcaccagg taagacatcc ccccaggagg caaacccagg cctcctggtc 11581 tcttggcccc tgttctcttt ggggctctac tcctgtttct ccctaggcac cccatcgcct 11641 tcacaggttt cctatatgcc tccccatacc aacccttgat cctctcaaga acctcctcct 11701 ctcagaccct caccaaagct ctccctctcc ctccactcct ccagtgccag cggatggaag 11761 cccagcggtc agagcctccc ctccctccag gggggcaaga gctcctggag ttgctgctga 11821 gagttcaggg tgggggtcga atggaggagc aaaggtcccg gccccccaca cacacctgct 11881 gagacttgag ccccaaccag cccttccttg ccactggtct caaagctggg cagcccattg 11941 catgccctca actcttgctt ggcaggggta ccagagactg aaagacacgg cacaaatctc 12001 aatattcatc tcccacatca ccttccctgg gaactggaca gggtgaaagt cctcaaactc 12061 tgggaacagg cgagatggaa cagggattta actccccgcc cacaggtcca tgggagcttg 12121 aggcagtaag ggggatccca ggcacccatc tcaaggagtg gctgggagtc ttttccctaa 12181 cttgtgggga caccaccagt tgtcaagcta ctaggcagta gggtctgagg gctcaggcct 12241 ccacctgaga ggttataacc tgagagacag ctctaccctt cctcccagta agaagggaag 12301 gtgggtgggc acctgagaga ttaagactat tctcccagtc ccactaccag cacccccgat 12361 ccctgagact gaggggttta cgggctgtga atggaccttc agccctgccc accctccctc 12421 cccactgctg ctgagtctgt ctgatgtttt ggttgtgtga ataaatataa ttcccctctg 12481 gactgcagac tggtatctgg ggggcccagg cggggtgaaa ggtaggaagg tgaggccaga 12541 ggccttttct ctccccagtc tggccagagg ccagctcccc tccccggctg gttaattact 12601 ggctcattaa gcagcggctg gagacctccc taattatctc ccccagcccc cctcttcggt 12661 tttaattaag tagaacaggg aggggagtca ttagaacaag aaatacgaac tgagctgccg 12721 gtgaacccag gcattccagc ggcctgagtc cacatcgctt agatccctga ttcaggaccc 12781 aggtgacaga cgcccccagc cgccaacaca gccccactcc taggccgcgg aagtccagcc 12841 agggggcttt cccatatctt tcagatggcc cgtctcctcc cctcatcccc tcttccctct 12901 cccctcctcc actaggtctc agttcctctg tttctgtgtc tctctctccg cccccagctc 12961 ctccctgttc ctcctctctt ctcccctcct cttcctctcc ggctcccctc ccccagcctc 13021 cctccctcgc tcccccccct tctccctcct ccctcctccc tctctctcac acacaccccc 13081 gcttgggcct cctctctctc tccggctcca ttttctccgc cgccgggggc cggggtctcc 13141 tgtggggggc ccagccggta tcccaggtct cccttcagtg ccggggtgaa cccccggggg 13201 agccgggagc cgggggcaga cgggcggggg ttggggcgga gggagcagcg gccccagcga 13261 gtttgggggg agaagtaacc aggcgggggg aggggcggag cagggagggg gcctcagggc 13321 ccccccccag ctatggacga acggctactg gggccgcccc ctccaggcgg gggccggggg 13381 ggcctgggat tggtgagtgg ggagcctggg ggccctggcg agcctcccgg tggcggagac 13441 cccggtgggg gtagcggggg ggtcccggga ggccgaggga agcaagacat cggggacatt 13501 ctgcagcaga taatgaccat caccgaccag agcctggacg aggcccaggc caagtgagtg 13561 cccccactcc gggaccccac acagacccag caaaccccgt tcacatgttc tgaatcttct 13621 gggagccccc cccaactcca gggccctctc caggatccaa cagctctctt ctctccttat 13681 tcctgggagc ccatagaaaa gtgatccctc tcaaacctcc cttcaccccc aggccctgaa 13741 accttcacag agggaacccc cggtggcccg gctccccact cctaaccttt tgccgacccc 13801 tgcagtctcc tggaacagcc ccatccccgg gagccccctc tggctcccag actaagaaac 13861 tgttcttggg ctacgttatc ttctccccta actctccacc cagccccctc attctctcca 13921 gatgtggaga cctccacacc ctctccagag cccctaaagc tcctctccac tgctcagcca 13981 gacactaggt gcatcaaagc ctcccacctg ctcagcccca ggaccccttc acacacccta 14041 cactgatctc cccagttagc tcggcacccc cagccccact ctgccacctc aaactctgac 14101 tcttctcaac cccagcttct gtctctctcc ctctgaaacc taccaagtca ctttcctttc 14161 tccatccact cccagattcc tcctcctacc tttctagacc atctcccaaa gcccgcagcc 14221 tttaacctgc tgcctgcatc ttccctgtgt ctccctgaag ctgaggagct tccccatgct 14281 ctgggagctg atcttttccc aagaactcct cattccaccc ccaactcatt ccacccccaa 14341 tccgcttcct ccctccgcag actgaccctc ctccctcctt gttctcaggc cccctgctct 14401 gtttctctag ctcctcaact tttctctttc cccactccca ctcctcccaa ggaaacacgc 14461 cctaaactgc caccgaatga agcctgctct ctttagcgtc ctgtgtgaaa tcaaggagaa 14521 aactggtatg tgggcccccc ccggattgct caactctggg aacagaaccc tgttcattat 14581 agggctagag tgtgacaact tggggccctg aggaaagtaa ggagtcaggg ggactgggga 14641 aggaaccaaa gcctgggaac ttggctctcc aggaagcacc aggaggactg agcactgggt 14701 attggggtct ctgggtccct aagtccactc gcctgcatgc taggcctcag cattcggagc 14761 tcccaggagg aggagccggt ggacccacag ctgatgcgct tggacaacat gcttctggca 14821 gagggtgtgg ctgggcccga gaaagggggc ggctcagcag cagcagctgc agccgctgca 14881 gcctctggtg gtggtgtgtc ccctgacaac tccatcgaac actcggacta tcgcagcaaa 14941 cttgcccaga tccgtcacat ataccactcg gagctggaga agtatgagca ggtaaggaga 15001 ggaggcttgg gtgggtggag ggaagggctc ttgcagggga atcccatggt caaagggctc 15061 ctcctcacca gcccactggc ccccactaca ggcatgtaat gagttcacga cccatgtcat 15121 gaacctgctg agggagcaga gccgcaccag gcccgtggcc cccaaagaga tggaacgcat 15181 ggtgagcatc atccatcgaa agttcagcgc catccagatg cagctgaagc agagcacctg 15241 cgaggctgtg atgatcctgc gctcccgttt cctggatgcc aggtgggccc agggacccca 15301 ggctggcccc cagcactggg ctccttccca ttcctctcca agaccctgag ctgccatgct 15361 gcacaacatg gtactccatg acaatggtga ctctggggtc atgccatgtg acagccctgc 15421 caggacatca acatcctcct caccgctctt ctccctcctc tgtagacgaa agcgccgtaa 15481 cttcagcaaa caggccactg aggtcctaaa tgagtatttc tactcccacc tgagtaaccc 15541 atatcctagt gaggaggcca aggaggagct tgccaagaag tgtggcatca ccgtgtctca 15601 ggtattatgg aggttgcggg aggagttgtc aggcaaagtg cacgcatctc agctaggtgc 15661 agtggtgtgt tcctgtaatc ccagctacta gggaggctga agtgggagga tcacttgaat 15721 tggagaccag cctgggcaac agcatagtga gaccaggaag caaaaaaaaa aaaaatgctg 15781 tcactcacat cttattcagt gaaggacttc agaggcaaat gtttctacct gaccctcctt 15841 tctgccccac aggtctccaa ctggtttggc aacaagagga ttcgctataa gaaaaacatc 15901 ggaaagttcc aagaggaggc aaacatctat gctgtcaaga ccgccgtgtc agtcacccag 15961 gggggccaca gccgcaccag ctccccgaca cccccttcct ctgcaggtgg atcccactgt 16021 caccccagct gactgttttg cacacttcct gcttttgttc ccacttccta tctaggcagg 16081 atcatagcag agagggggcc ttttggggtg agagggaccg agctgagata ggctggagat 16141 gtcaggggac agaggccatt ccagtgatct tagttctgcc tttcttccca cgggtggcca 16201 aggaacagcc tgctctttct gtgtgttgga atgttatttt gtggataatt ggagtatagt 16261 agcatgtccc cacaagagtt gagagttgtg gttcatcctc taccatcacg ggctctatta 16321 cactcttccc tctctgcccc cacaaggctc tggcggctct ttcaatctct caggatctgg 16381 agacatgttt ctggggatgc ctgggctcaa cggagattcc tattctgctt cccaggtcag 16441 atgcccatct cctctcgaat agggctttcc ccaactccat ttcctctact ttaggataca 16501 agacctcttt cctctgaggc ttctcttcac tgtcatacct tcctctgctg cctgcaggtg 16561 gaatcactcc gacactcgat ggggccaggg ggctatgggg ataacctcgg gggaggccag 16621 atgtacagcc cacgggaaat gagggtgagt ggatcctgaa gctcctctct gtccagttct 16681 cacaggacag aggggcattt tccctagtaa tgttgtgccc acacagggtt cccagggctc 16741 tctctgtttt ctgtactctg tctctccttt caggcaaatg gcagctggca agaggctgtg 16801 accccctctt cagtgacatc cccaacggag ggaccaggga gtgttcactc tgatacctcc 16861 aactgatctt gcccctcagg gtcacagggg tgggggctct cacaaggcga cttgaagagg 16921 acgcaggctt ccagaggaca aaccccaata caggagaagc acaagacaga gaagggccaa 16981 tggggtcatc ccctccctaa cgagactctc tgtgctgggg gtgctaatta catggcagga 17041 agaatggggc ctctaagggg agtgtggggt ctgtctctcc cttttttcca tctttttcct 17101 ctctcgcttt ctttcttaca cagaaacata cacataccga gaaacctatt tctcagaccc 17161 ctttttctcc tctgtctttc tctctccctc tcccacacct cacacacaca tactcccact 17221 tgcaactatt ctgtttctct cctgggctcc cccactttcc cttccccacc ccacttgtat 17281 gctctggaat ctgtggagac gccagccctg cccaatcaga gatgccaaaa atggggacat 17341 gacttctgga cagaggacat gggccacgcc cccatgcatc cccacccccg cccctccgga 17401 cggcttactt acctcatacg cagctcatct taaaccaata gaatcgctcg gtggacgaga 17461 gtgtctgact cagatatcta cctcggaggg agtttctgct actttaggga attattgact 17521 gggctttggg gttgaacttt ttttttttta aagaaagaaa aagaaaccct gggatccatc 17581 tgtttttttt gttgttgttg ttgtttttgt tgttgttggt ggtggtggtg gtggtggttc 17641 ttaattttta atttagtttg gggaagtagc ttgttttttt ttttataaat atgttgattt 17701 cttgtctttt ttttttattt cttactttcc catattaggg gtgatagcca aaggggttct 17761 ggtaagagaa agggggacaa acagaactgg taaagaggcc cccctggctc caggcctgtc 17821 catcaggaag taaattttac agggcaccaa gctttgcccc ctaaaatccc ttaggtgttc 17881 tttgttcatg caggcaggtt tctgccgcat ttgatgtgga ggcagtgaag ggcttgccct 17941 gctggcctct catccccctt cttcccacaa cccttgggca gggctggact cagtaatttt 18001 gaggaaattg aagatgccat cttcccctgt gagtgacatg tctttaattt tttaaaaaac 18061 tactatttga aaattggagg gggaagaatg ggaagggagt tattgccaaa tatgttaaat 18121 atgggttggg gtgcttgtat atgtatcttc ctcaatttcc ccataaatga ggtatctttt 18181 tgtcacacca aaatcaaggg gtagggagag ggaggaggtt gcaaaaagcc agatgtgggg 18241 gaaaagtaac atcaacactg tcccatcctc agccctgaac tagctaccat ctgatcccct 18301 cagacattct caggatttta caagactgtc agagtgggga acccctccca ttaaagatcc 18361 gggcaggact ggggacaggt tggaagtgtg atgggtgggg gggtgggagg catgggccgg 18421 gggcagttct ctcctcactt gtaaacttgt gtagtttcac agaaaaaaaa caaaatgcag 18481 ttttaaataa agaaatttct tttttccctg ggtttagttg agaatttttt tcaaaaaaca 18541 tgagaaaccc cagaaaaaaa atgattttct ttcacgaagt tccaaacagg tttctctcct 18601 gttccccagc cttgccttca tgatgcaggc ccaattgcac ccttgcagac aacagtctgg 18661 cctgaaccct attgatgcaa ctttgcgcaa tcaagatggg gctccagtgg gtcaccaggc 18721 agccctgatg gactgatgga ataaatagga tcgggggctc tgagggaatg agaccctaga 18781 gggtacactc cccatccccc agggaagtga ctgtacccag aggctggtag tacccagggg 18841 tggggtgata attatttctc tagtacctga aggactcttg tcccaaaggc atgaattcct 18901 agcattccct gtgacaagac gactgaaaga tgggggctgg agagagggtg caggccccac 18961 ctagggcgga ggccacagca gggagagggg cagacagagc caggaccctg gaaggaagca 19021 ggatggcagc cggaacagca gttggagcct gggtgctggt cctcagtctg tggggtgagc 19081 cactccctca accccactga ccctccctgc agaaagcact ttaaccccac accccagtcg 19141 tcctagaact tttcccagaa cccgaggaag tgcctttcaa ggtccctcac ccaccctgtc 19201 caaattttgt tagccctcat tcccttccta cccctctacc atggtgctat ctcccagggg 19261 cagtagtagg tgctcaaaac atcacagccc ggattggcga gccactggtg ctgaagtgta 19321 agggggcccc caagaaacca ccccagcggc tggaatggaa actggtaagc ggggctcctg 19381 ttgcagcctc ccaacttcca gggagaccag caatgatttg gatccccgtc actctgcctc 19441 acagtccttt cccaaaggcc ttgcactgtt taggccctgc ttctctgctt ctagaacaca 19501 ggccggacag aagcttggaa ggtcctgtct ccccagggag gaggcccctg ggacagtgtg 19561 gctcgtgtcc ttcccaacgg ctccctcttc cttccggctg tcgggatcca ggatgagggg 19621 attttccggt gccaggcaat gaacaggaat ggaaaggaga ccaagtccaa ctaccgagtc 19681 cgtgtctacc gtaagaattc cagggtcttc tccaaggcct ccctcttacc taagaaaaag 19741 ccttcaaccc cagccttggc ccatgagggc ctctgacttc cactggcctc atttccacac 19801 acagagtttg agaaccttca caattacagc ctctgactgg atttttcctc cttcagagat 19861 tcctgggaag ccagaaattg tagattctgc ctctgaactc acggctggtg ttcccaataa 19921 ggtagtggaa gaaagcagga gaagtagaaa acggccctgt gaacaggagg cgagtgtgtg 19981 tgggtgtggg tgtgtggcat ctctcatttt caaaggattc tgaggtcacc actctttccc 20041 caggtgggga catgtgtgtc agagggaagc taccctgcag ggactcttag ctggcacttg 20101 gatgggaagc ccctggtgcc taatgagaag ggtgagtcct aaggtgcccc ccaagctgcc 20161 ttctccctga tctcactccc acacccaccc tgggataatt tgtcttatcc tcccatcata 20221 ggagtatctg tgaaggaaca gaccaggaga caccctgaga cagggctctt cacactgcag 20281 tcggagctaa tggtgacccc agcccgggga ggagatcccc gtcccacctt ctcctgtagc 20341 ttcagcccag gccttccccg acaccgggcc ttgcgcacag cccccatcca gccccgtgtc 20401 tggggtgagc ataggtgggg agggccccaa gctcacgtga gcacgttctg gaagtctgac 20461 ccttagggaa agagggagtc aagcccatgg ccactgggat cactcacaag tgtaactctc 20521 cacctcaaaa cccttccaac tcccagagcc tgtgcctctg gaggaggtcc aattggtggt 20581 ggagccagaa ggtggagcag tagctcctgg tggaaccgta accctgacct gtgaagtccc 20641 tgcccagccc tctcctcaaa tccactggat gaaggatgtg agtgacctgg agagaggggc 20701 tgggaggtag ggtgaaccat aactagcaac agggagggca gagggctaac gagggaaagg 20761 caggctagga gctgaggagg aagagagggt atctgaagat atggagacaa aaagacaagg 20821 gttttgaaat agtctcctct ccccttcccc caccagggtg tgcccttgcc ccttcccccc 20881 agccctgtgc tgatcctccc tgagataggg cctcaggacc agggaaccta cagctgtgtg 20941 gccacccatt ccagccacgg gccccaggaa agccgtgctg tcagcatcag catcatcggt 21001 gagacctctc cccaagccct acagaccctg ggactagggt gcaggacagc acaggctcta 21061 atttcctgcc ccattctggc cttatcccta acagccaccc cacctctccc tccatgcacc 21121 cacacccaag cctcccctac cccacccaaa ttctgccaag agagcagcca agcctctccc 21181 ttcttccctc tgagctaaaa aaaggaacag acggctgggc gcggtggctc acgcctgtaa 21241 tcccaacact ttgggaggct gaggcgggca gatcacctga ggtagggagt tcgagaccag 21301 cctgaccaac atggagaaac cccatttcta ctaaaaatac aaaattagcc aggcatggtg 21361 gcacatgcct gtaatcccag ctacctggga ggccagctac ttgagaggct gaggcaggag 21421 aattgcttga acccaggagg catagattgc gatgagccaa gatcgcacca ttgcatgcca 21481 gcctgggcaa caaaagtgaa actccatctc aaaaaaaaaa agaaagggaa agactccact 21541 ggggctccca ctaaataacc ctctctcaac ccgaagtctt cctttctgac tggatccaac 21601 tttgtcttcc agaaccaggc gaggaggggc caactgcagg tgaggggttt gataaagtca 21661 gggaagcaga agatagcccc caacacatgt gactgggggg atggtcaaca agaaaggaat 21721 ggtgagtggt ggtggctgtg ctctcaattt tccctgtctc cgtacaggct ctgtgggagg 21781 atcagggctg ggaactctag ccctggccct ggggatcctg ggaggcctgg ggacagccgc 21841 cctgctcatt ggggtcatct tgtggcaaag gcggcaacgc cgaggagagg agaggtgagt 21901 ggagaaagcc agacccctca gacctagggc ttccaggcag caagcgaaga ggggtcgggg 21961 ggtggaacga caacgtgccg cattcccccc aatctttctc ctcaggaagg ccccagaaaa 22021 ccaggaggaa gaggaggagc gtgcagaact gaatcagtcg gaggaacctg aggcaggcga 22081 gagtagtact ggagggcctt gaggggccca cagacagatc ccatccatca gctccctttt 22141 ctttttccct tgaactgttc tggcctcaga ccaactctct cctgtataat ctctctcctg 22201 tataacccca ccttgccaag ctttcttcta caaccagagc cccccacaat gatgattaaa 22261 cacctgacac atcttgctct tgtgtgtctg tgtgtgtgta tgagacacaa cctcacccct 22321 atacccttga gggccctgaa ggaaagggac tcacccccat acttcaccat actataccaa 22381 acatctactc aagttgggga gaagatgctt ctgtcggggg tgggggcgaa cttgggaaga 22441 gatcccatca atatatttca ccttttttat tgaatttgta ttaaaggagg tagtgagggg 22501 gcggaagcac ttaagagtca gaatccatat tagactctgg ggagtgaaaa attaaattaa 22561 atcagtaaga tggggagtgg gggaagagtc agagggaact ttgcccacct ttgaagatca 22621 aatcaagaaa tcagggaaag caaagactta ggagaggaga aagacattct ctcaatccat 22681 cctccttccc cagggcagag aattaaacaa cgttactgag tgagcctctg agcagaaggc 22741 tctcccatct atgcacagac ttcactcctc ctccccaggc cttcctggac aatgtccagg 22801 gctggcctta gccaacagaa atagaggggt caagggggtc caggagtacg gaagggtcag 22861 cagggaccct caatactgat tcttctctgg ctggaggtgg gcaggaagca gacatagctc 22921 aaatactgag cagccaaaaa aagaagaaga tggcgagaaa caggaagagg gaatcctgcc 22981 agctggaggc tgggtgaccc tgtcccagat ccacacctgt gggagagagg aaagctgtgg 23041 aagcatatgc tcctaggctg ggagggggcc tgaggggatt cacagggctc cctgatggga 23101 gctgagtgtg actcttacct gtaccccggc ggaaaggctc atgggcattg aagacggtgg 23161 tgaaaaagcc aaagggaaaa gcaccaacac caaatgagaa gtggaagccc ccggtatcac 23221 caaatggctg gaatccctag ggaggcagaa aaagtcagac gggaagccgg caaatctgtc 23281 aaggaaggga cacaactgga caagaagact cacccctctg ctctccggag ctggtctctg 23341 gccctggggg cggggtggag tttttaatct gaggaagtgg agagagaaag ttaacaggga 23401 tttttctcct cccatcttcc acaccgtttt ccaagggcag aagccttcaa tcttccctaa 23461 gcaacacctc cagtctctca cctgggatcc tggggcttct ggctccctcg cccataaagc 23521 gggacaacct tctctctgct gatcccagct ttacatactg gacactcttg ccgttctggc 23581 cgtgtctcca gccactgggg agaaaaaagg tggtttccag tatacaagag ggtcttacag 23641 ctcctcagac ctccccattt ccctcttcat ctcctgagta cgcacctgat gaagacatgg 23701 ccaactggat gggggagaaa aaaaaaaaaa ggtcaaacta gctacagaaa agagagacac 23761 agaccctaga cttcgcagaa tcccatctaa cccctcttcc caagcaacct gctgttgctt 23821 ttcagatttt ctgcaacctc taccatgcca gccaacttag ttagccttcc tgcttgtctg 23881 atcttccaac acctaaagct ctgtccatcc tcaacacact caaccctctc ctttctcctc 23941 tccccaacaa acacatacaa attttcgtgc cctctctttt ctgcctttca agttaatttc 24001 taatttcctt cagccacctc tttctgggtc tcctcttttc aaccccaacc ccatcactcc 24061 aaaccaaacc cctttactag cacattcccc cattactcac tttcaagctc aataatgtcc 24121 ctatctttat gaccctttaa cctttcaagt ctgcctctcc acagtgccct tataccagcc 24181 ccctcccaga tctcatctga atgtgatcca tatttcctgg ttctccccga ctcaactgat 24241 gcgtgcctcc cttaaccttt gtgtctcact tgtttccacc tgcacagcta agacccctca 24301 cttctctggg gtaaggtggc tcgggtctca cattgtcctg ccactccccg ccccaccttc 24361 tcttctcagc acatcacgtg cctcagctcc tggttcctaa gacctttctt tccacagatc 24421 tcgaccgtta tactcccacc cacacatacc agcaaagtct tatgtctcct gtcgggcttc 24481 acctatggga acgtgccctc cgattatctg tatgactgta tgattattcg ctcctagcct 24541 ctccagtata taagcgagac ccaccacctc ccgcccccct cctcgattct caccagtaca 24601 ggtggccaca cacactgacc acagcttccc gagcagtctc caaacatata ttacattcga 24661 aggtcgcgcc cgccccgccc cgctcgcgat ttggcccttc ggggcccccg tcctcctcct 24721 ccgctgctgc catggccggt tttgtttcgc cccacgtacc cttcagtccc cccaaataca 24781 catacacacg ccccaacaaa ccagaaacca cctcctgccc acgatcgttg ggcaggcttc 24841 aaggtttcct aatcactatt ggtctgaatg cctgccagtc acaaagaatt caaaagaaag 24901 gattggccca aagggtaggg gcgggaaaag gttagtgcag tcctgccttc gcacaatggc 24961 tattggctga tacggtctaa gtcaatgtgc aatgccaagg gattggtaat aactcgctac 25021 acgctgtcgc ctggccaagg agggctttat tcgtctgagt agttgtcagt cataaccaaa 25081 gccataagca atttgctcgg gactacctat agacctcgcc cactataagc ccctttcttt 25141 ccttcgcttc ctcttttaga gaatgtccgg attgctattg gactttggag cgtatggctc 25201 caaatcaact cattggctaa aacttgacgg aaaatggtgg ttaggtaaaa cgcgcctgcg 25261 cagcacgcgg cgggacgggg gtgggccaat cctgtgaggg tttaaccttc tcttgttcca 25321 cctcttcacc cctatcttgt cgccatggtg actgctctac aattggcgag gcttgcactt 25381 caaagtccta ggctcgcttc atccgggtcc ttcagctgtg gactttctgc tgattgggcc 25441 ttttcctttt cccctgattg gccgacatcg ggaaagacgg cgaagagcta ggaaaagagg 25501 gaaaacacta gggtcgcagg gttcaaaatg gctccaacct cctttggtga cgtagagagc 25561 agaacttggg tctgcccctc ccttttagtt aagggagcag aactgggatt agcccgacgt 25621 ttggatagtg ggaacatcga tctgcggcgc tggtgttaac ccaactcatt cggctggacg 25681 actcagccct ccccatatta ggtgatttac agagcaaaac tgaactaaag gcccacccct 25741 ttcttaatgt tgtacacaga gtagaacagg attgacttca actccgtttt aaaccttcag 25801 agcaggaaag ctctgggctc aacccctttg tgagtggtgc aaaagggaca aagcccgccc 25861 cttttaagga gacccgcgga ggctagaccc gccctttcct ctttataatt tgcccatcag 25921 aaatagggtc ttcttcccag gttggacccc ggggagtttg ggcttttcct acaatcactg 25981 accctcactg tgactaaagg agcagaatta ggtaacagtc ctcccactac caatcctctt 26041 cccgagggca tgtaaactaa tgcagggtaa aggtgtggct agagggggga ccttgataaa 26101 agatcccatg tgactcaaga gtaaggaaag atgagaagtt agcagttgcg taaagaagga 26161 ctggggcaga tgaggattca ggaagcttga ggtttaggaa ggaagatatt gagagggaaa 26221 ggtggaaatg aaggagagtg aagtgatgga atgatcctag taaagggata atgggagtgg 26281 aggaagagaa gagggggtgg aaaactagat acatggctac caaattaagg aggcacgcgc 26341 attccagagg aatcggcatt cttcctcact ttttattttt ctagaaagca cccctgaagc 26401 caaatttcca ttggaagaaa agatgtaccc atattgtatg ttgtgagaag gggttgtctc 26461 agcttgggca agtaaggaga ctgatacgaa ggaagtagga aagaaaaggt acagaggtaa 26521 aagagcatgg aaaaggaaag ggtcagggat aaggccaaag agatctcttc tctttaaagg 26581 ccagagaagg caggtggagg ggggagctgg actgctggga gatagtgagg gacaaagggc 26641 aaaggaaacc agaccagagg actggagagt gagatggagt gagatggagt cctggagaga 26701 aaaagaagag aggtgaactt aatgcttgtc atatggtagg tagatgcttg ataaatgttt 26761 agaattgaat gggtacggga aaaggggtcc ttaagaatag ttggggggaa taagcagcag 26821 ataaccggag ttgagaaaaa aagagaccaa gtaaaagtgg cagttaaaag agagctgatg 26881 gagaataaag gaaggaatgt gcggaaggag gaatacagca ccaggggatc cagagctgaa 26941 agggagttag agaaaaaaaa gatgcagctg gagccagaga tgggggcaaa gaccgaggga 27001 gagccctggg ccggggctcg caagaggaca ctggtagatg tggggaggag atgccagagt 27061 ttctgggaga cgattggcaa aacaggctgc ccatcaccgc cctccacttc ctggccggcc 27121 ccggaaacca gcaggcgttg gggaggggtg gcgggggaat agcggcggca gcagccccag 27181 ccctcagaga gacagcagaa agggagggag ggagggtgct ggggggacag ccccccacca 27241 ttcctaccgc tatgggccca acctcccact cccacctccc ctccatcggc cggggctagg 27301 acacccccaa atcccgtcgc ccccttggca ccgacacccc gacagagaca gagacacagc 27361 catccgccac caccgctgcc gcagcctggc tggggagggg gccagccccc caggccccct 27421 acccctctga ggtgtgggcg ggaaagggat gggaggagga gggaagaggg tgctgaaagc 27481 gactaggatg aggggaaggg gagagattgg gtctgggagg gccgactggg ggagagggtt 27541 gctggggaaa ggagaggggc cgactgggaa gagggttgct ggggatagga gaggggacct 27601 gagagggagg aaggatggaa gagacctggg agggaggaga aatggaaacc cttgtgaatt 27661 tgggactggg agcgtgcaca gggaatcctg gagagggaat tccctacacc ttccccaatt 27721 ccttttcttg ccctttgacc ccacatgact cttgaagggt catgagggga gaaggccagc 27781 agaatttgcc tcttaggaat acccttaggt gcctctgttt ccatctaggc acaggacctc 27841 ttgtttctca gtggccttcc acactgctag acccttactg acacacaaat gccttatggg 27901 agccatgttt tctacattga gtctgtgtgc ctttgacatg tttaatggct tgtgtgcaac 27961 taggttgtcc caatgctatc cataggctgt gtagaaatgg tgtgttattt tctatcagaa 28021 ttgcccattc ttcattcttg tgtccatgtc tcacatccag ttttgacatg ttttaagtac 28081 cgcatgtgtg tgagttttca tatattgcac ctgttctata atttcatgtt acttgcacat 28141 tttatgtttt ggcatgttta tttcagcatg tgaaggttat ataccttatt ttgctttggc 28201 tgacatgtcc atggtcctac catttgcagt agtcttcatg tgtgggatcc catggcttgg 28261 ctgaataccc cacactctga tgtctgactg aattggcctg tttgctgtgt tttcccagtc 28321 acagttcaca gaacacatgt gtatgcgcct ttgcatgata cactgatgta acaggaccat 28381 agaatgtgtg ttataaattt gtcatcagta tattttgtga gccgtatgct cattaaattt 28441 ggccctccat tgcatttcta aatccttgga cttttgttct ccaaagaggg tcacttaata 28501 tcaagtgtta agagaagaag gtaactgggt ctccaggtct gcaaagaacc atccctgcat 28561 gccttacctt ggtgacctcc ctggcccata cctctctaca caaacattat ctttccagtg 28621 gctgtgtaca gtctgtgtcc atgagctcaa tgcatgtcac agggtcaatc ctgctgtgaa 28681 ccccattgtt ggtatttatt tatggacatt atcctccatt ctttgcactg ttggcacaca 28741 tttgatgaga gcagcatctt tccctgtggc atcttgatcc cattcggtac atttctcttg 28801 tgagatgacc tcttcctgat tattgttact ctgccttcat tatggctatg tattgcatgt 28861 atctattcag agtcggttac cattaggctt ggtgtgttcg ttactttctc agtgacttct 28921 tttagtagtc acttctactc aagaggataa ctatctaatt tgtgatcaga accgccatct 28981 ctgtcattaa ctgtggctct atgggtgggt atacagcctt agaatctgtt cagcaagtgt 29041 ttattgagca cctactccat ctcctattgt cctggcactg gagataagac agagtccctg 29101 tccttaagct gcttacagcc taaggaggga aacaaaaacg ccagtcaaca catagtgtgt 29161 tgtcaagatc agtggttctt aaattcaggc gcacatcaga ctcaccagag ggcttgttga 29221 aatacagatt gctagccagc cacgatggct cacacctgta atcccaacag tttggaaggc 29281 tgaggcagga ggatcgcgtg agtccaggag ttcaaaacca gcctgagtga cagagtgaga 29341 aaaagaaaaa cagattgctg gacctaatgc ccagaatttc tgattcagta gatctggggt 29401 gaagtctaat aatttgcatt tctatacttc gagacccgct gatcaagata aaagtgtaag 29461 gaaagcacag gcctggaggg gttcaggcag ccctccaaaa ggtgacactg agctgtgtga 29521 ggcaggagaa gagacaggca ttccagccaa agggaacagc atgttcaagg tggggaagca 29581 tgaaagatca tggtgtctga ggaactgaag tgaatcagtt tgactggaac aaagagtttt 29641 gtgaggatgt ggtcgaagat gtaagcagaa gtcaacttat caagaagagc ctttaggcaa 29701 gacagggaaa tgctctgtgt ttcagaaaaa tctgtgctaa cagaaaaatc tctgtggttg 29761 ctacgtggaa gatggattgg agggagttgg gagcctactc cacatagttc aggggagaaa 29821 tgatgctgtt ctgaacttgt aggggcagtg ggatggagca gctcagaaag cctacggtat 29881 tccaactggc agggtctcct gtttcctcta ttgcctgtta ccttctcgct tggcaatagg 29941 cttacctttg agcatagccc ttcccatcat gggaagacag tgcctgtggc ctcagtagga 30001 atgacaggta tttgcctgaa cacccttttt gtgaattgtt accctgcccc caacactggg 30061 gcagagtgga ggaaggagga agaacctaga acacaggttc tgtgttcctg cctctcttcc 30121 tcttgagccc tttcctctcc cagggcaagt gctgttaggt cacctttact ccattccctc 30181 cttttttcac ttggtgaggc ctcacacact gtacctgccc acgcaaagtg tcactagaag 30241 gaagggaaag ggtagtagga ttcgtttgcc tgtctggagg taggattggt ctttgtagct 30301 attccaggta tgtccataag tttacctagg aataggggag ctgcctgggt ggagagggat 30361 ttttctagtt atgcatttac attcttttat ctgtcactgg gtgtaattat aaatttgtgt 30421 ctatgtgtga acatgttagt ctttgtatga ctgtgtgtct gttggcatta gtgacacgaa 30481 cttttaatct tgccatttgg cccttgggta tatggctgtg agtgttctgt cacaatcacc 30541 atatatgctg tgtgctgtgt tcgtatatat atatgcaata cataccagtg tcagcgtaat 30601 ggagtggtta ggaacacagg cggcttagat ttaacctaga aactgctgtt taggagctgt 30661 atgacctcag gtaagttatt tagcctccct gggcctattt cctataaaat gtaaatagta 30721 atagtacttt ctagactatc atatgcatca ttttaagagt ttaacttaat gtatagacca 30781 gtactgttct acagaaatat aatgcaagcc acaatgtaat ttttttatgg tagccacatt 30841 tttacaaggc aaaaagagtg aaattaattt tagtaatata ttttcttgaa tctgataaca 30901 tccaaaagat tataatttct tttttttttt ttggaaatgg agtctcactc cattgcccag 30961 gctagagtgc agtggcgtga tcttggctca ctgcaacctc cgcctcccgg attcaagcaa 31021 ttctcctgcc tcagcctccc gagtagctgg gattaaaggc atgcgccaac aggcccggct 31081 aatttttgta tttttagtag agacggggtt tcaccatgtt ggtcaggctg gtcctgaact 31141 cctgacctcg tgatctgccc acctcggctt cccaaagtgc tgggattaca ggcgtgagcc 31201 actgcgcccg gcccaagatt ataatttcaa aatgtagtca gcataaaaga ttagtaatgg 31261 atatctcaca ttttgttttt tattcagtct ttgaaatctg atgtgtattt tacatttcca 31321 gcacatctca gttcagacta gctgcatttc aagtagccac atgtaggtgg tggctacttt 31381 ctcggacagc acaagtatag accattataa gacccttacc agctacaagt gttagctatt 31441 attcttgttg tcatttatta tcaggtatct gtgaattgta gatgtctgtg tcttgtgtct 31501 cttgtctgaa tatatccgga gcctttggga agagtggtgg gagagcagtc ctgagctctt 31561 tctccaccac cctcatccta gagagccttc ctgggaaggt ttcaatgaga cccctgcccc 31621 agtttgtgtc tcaggccctt gtcctcatag caccagcccc cagccctgcc ttctgtgcct 31681 tgcctacccc actctcctcc agaaaccagg ctgattgtcc cttgccccat cccctgcagg 31741 tggccagaat ggatttgtgg ccaggggcat ggatgctgct gctgctgctc ttcctgctgc 31801 tgctcttcct gctgcccacc ctgtggttct gcagccccag tgccaagtac ttcttcaaga 31861 tggccttcta caatggctgg atcctcttcc tggctgtgct cgccatccct gtgtgtgccg 31921 tgcgaggacg caacgtcgag aacatgaagt gaggggcaag gggtcttggg caatgaggga 31981 acctaagggt acaaagtgag tagtggattg ggggaagggg gcatggtgtg tgtagaaaag 32041 actgagagag accagagaca gggaatgggg agaggactgc aaaggtggtc agaaagacag 32101 taaggtgggg ggagctgagg catgcagatg gacatcaatg gatcccactg ggaccccttg 32161 ccatgacccc acaggatctt gcgtctaatg ctgctccaca tcaaatacct gtacgggatc 32221 cgagtggagg tgcgaggggc tcaccacttc cctccctcgc agccctatgt tgttgtctcc 32281 aaccaccaga gctctctcga tctgcttggt gagaccccac cacagggcac acctccccca 32341 gccatgcctc ccctcctgaa accttcccta gaatatcttc tcctagagat cctcaattcc 32401 ccttcctctg ggacattgcc cccttgcctc ccactcaggc cttcattccc tgggtagaac 32461 tgccctcata agcagggtac atatactttt ggtcaccctt tccttcactt ggggcccccc 32521 tccctgccta gtctcctcct tcacctccag tccctaccag agggtgatga gctgggtgag 32581 gtgggttgcc ttctgtgaca ctctgcctcc accccgatcc tcacccactc ccaccctgcc 32641 caagggatga tggaggtact gccaggccgc tgtgtgccca ttgccaagcg cgagctactg 32701 tgggctggct ctgccgggct ggcctgctgg ctggcaggag tcatcttcat cgaccggaag 32761 cgcacggggg atgccatcag tgtcatgtct gaggtcgccc agaccctgct cacccaggac 32821 gtgagtcatc ctggggaaat gggggattgg agggatacag agtagaacag ttgtaaataa 32881 actgatatgc agggccagtg ggcctcaaag gtcccattat aacatcacac ctattctgac 32941 tcctccatat gtatttgtct tctttgaccc tctttctccc ccaggtgagg gtctgggtgt 33001 ttcctgaggg aacgagaaac cacaatggct ccatgctgcc cttcaaacgt ggcgccttcc 33061 atcttgcagt gcaggcccag gtgactactg ctcttcgttc tgctactcag ctgccaaccc 33121 ccaccattcc ctcatctctg ggcaggggct tattgtagga gtctctgaag agagctgtgg 33181 actgacctgc tttaaccctt ccccaggttc ccattgtccc catagtcatg tcctcctacc 33241 aagacttcta ctgcaagaag gagcgtcgct tcacctcggg tgagggcttt gagcagttct 33301 ggggtagggt gtgtccggag aggctgggag gacatccctg tgaggcaggg ggatcattca 33361 gtgtcagagc catgagatgt ctacacagtc atctagtcta accccacatc agccaataag 33421 tctttactaa gcacccacca taccctgcca gatgggtagc acttggtccc accaagagag 33481 gctgttacta atcttaacag gaaagataag gcctgtgtgc acaaagctgt aatgaataac 33541 actcattcag cagtaaatgc caaacccaga ggaggggggc tggaggggtg ctgaggagat 33601 gtctgaactg gggattggag aaggctttgt ataggagaag ggcctcagaa gtggcagctg 33661 gcaagcccag ggatggttgt ccagggttgg gggaagagaa ctgaaaggtt gaggaagagt 33721 atcactcgga agctgggccc cacctgtggg caaagacctg ggtggacagg ccatgatggt 33781 gctccccttg ccccaggaca atgtcaggtg cgggtgctgc ccccagtgcc cacggaaggg 33841 ctgacaccag atgacgtccc agctctggct gacagagtcc ggcactccat gctcactgtt 33901 ttccgggaaa tctccactga tggccggggt ggtggtgact atctgaagaa gcctgggggc 33961 ggtgggtgaa ccctggctct gagctctcct cccatctgtc cccatcttcc tccccacacc 34021 tacccaccca gtgggccctg aagcagggcc aaaccctctt ccttgtctcc cctctcccca 34081 cttattctcc tatttggaat cttcaacttc tgaagtgaat gtggatacag cgccactcct 34141 gccccctctt ggccccatcc atggactctt gcctcggtgc agtctccact cttgaccccc 34201 acctcctact gtcttgtctg tgggacagtt gcctccccct catctccagt gactcagcct 34261 acacaaggga ggggaacatt ccatccccag tggagtctct tcctatgtgg tcttctctac 34321 ccctctaccc cacattggcc agtggactca tccattcttt ggaacaaatc ccccccactc 34381 caaagtccat ggattcaatg gactcatcca tttgtgagga ggacttctcg ccctctggct 34441 ggaagctgat acctgaagca ctcccaggct catcctggga gctttcctca gcaccttcac 34501 cttccctccc agtgtagcct cctgtcagtg ggggctggac ccttctaatt cagaggtctc 34561 atgcctgccc ttgcccagat gcccagggtc gtgcactctc tgggatacca gttcagtctc 34621 cacatttctg gttttctgtc cccatagtac agttcttcag tggacatgac cccacccagc 34681 cccctgcagc cctgctgcac catctcacca gacacaaggg gaagaagcag acatcaggtg 34741 ctgcactcac ttctgccccc tggggagttg gggaaaggaa cgaaccctgg ctggagggga 34801 taggagggct tttaatttat ttctttttct gttgaggctt ccccctctct gagccagttt 34861 tcatttcttc ctggtggcat tagccactcc ctgcctctca ctccagacct gttcccacaa 34921 ctggggaggt aggctgggag caaaaggaga gggtgggacc cagttttgcg tggttggttt 34981 ttattaatta tctggataac agcaaaaaaa ctgaaaataa agagagagag agatctgggt 35041 gttggtggtt gcatttgtta aggaattgaa gcagttcttg cccaggcaac ctgcccccag 35101 ccagaagact caggggcagg ccaagaacac aggcctcccc ctttcttcag ctctctgaag 35161 tttccattgt tcattgctct ttggtggctg atagccttat ctgcagctca cagtcggcca 35221 atcccagagg attagtgggt ccggtttctg tataaattag ggggcagggg tgctgtagag 35281 gcttcttatc gatgattgac gccgaggccc aggctgttgt cctcacagga gcctggttaa 35341 tgacatggca gacacagtgg ctgtggtcag cctggagtgg actacactgc cactctcacc 35401 aaacaataag tgaaactgtt gggctgggga caggatttca gaagagaacg atggtaaagt 35461 ggagaggcat gaggatagtg aatgttggag aggggcttgg aggaaagagg gaatgcctga 35521 atggagaggg gtcttgggga aagttgggga atagaagtca aggcgggagg agtgtgagga 35581 ctcacaggca cctagcctct cctccagcag cagcacctgg tcgctgagag attcgatccg 35641 gtcaccccgg ccccacagct cagccacctg ttctggctgc agctcttcag gcggcacggg 35701 cagcaccgct ctgacccagg ccccagcctg accggcccac tagaaaggaa gagatgcctc 35761 agggtattga cagtgacgtc tggcctcgcc ccacccagca ggcttggctc acctgctcca 35821 gccgctccag gcgccctcgc agctcgtgaa tctcctgctt cagagcgcgc tcatcttttt 35881 ccgcctcccg aactagggac aaaggagacc aagagtgtga ccactacccg agcccggacg 35941 cccgtcccga gtccctcggg tggcccgtac tcctgcccac tcaccggcca cgctgagtat 36001 gctggcactg gttgggggct ctggggaccc ctccatgcag gtgcgcccgt ccacgcctag 36061 cactaggtca tgggggcagc cgcaggtgaa gctgcctgcc gtattaaaac aatggtgcga 36121 gcagagggtg atgctggtcc tacattcatc cacgtctagt taccgaaaaa aggaaggggc 36181 tgagagaggg ggcgggggca agcacctggg taggtgggga ggacaagctg actcacccac 36241 atgacagtgc ttccctcccc agccgggggc gcactcgcac tggtcaggcc taacgcagac 36301 gcctccgttc aggcaaggct tggcgcagat ggctgaggac agaggaagtg gggttcagac 36361 tcaaaccgac gacccagctc cccagctccg tggggcgcgc ctcccgcaag gcccggaaga 36421 cccagcctca ccttcacagg tgagcgcccc cgggtgccgc ttcttccagc cctggcagca 36481 cactgcatgg gtctgctgaa cctcccgcct cacctcccgc cacataacgc ggtacatggt 36541 cctggaacac agcgcccggc tcaggaccct gagtacgggt cctagttggg gttcttgggg 36601 tcccatctcc ccatccctca cctgtaagtg ctgcagatgc gcctcccagc gcacaaggtc 36661 aggtagggct tgtacactgg ttggctgtag gactcgttgt agtggagcgg gaccaccagt 36721 gtctgcttgg agcagactcc ctgactgcgt ccatgccaga ggatgaggtg ggagagattg 36781 agtaggccag gccctgggga gccaggagtc ccacttccct caagaggcta ctgaggcccc 36841 ccgctctccc tccagatgac agcctctcac ccccattcta aacttacagt tatgttttgc 36901 ctcctgtgaa gccccacccc cagcagaagg ctcctgagaa gagctcaccc cgggccctac 36961 cccctctgtt gtcacctctc tctgagggat ccacccttgg ccccctcgcc tggtatcagt 37021 agcaggagga aggagaatcc gcctaagaga gtgcacagct cagccctgga ccccatgatt 37081 cgctttgacg ctggacccta caggctgcag gcaagaaaag gttaatggat gcccgtcccc 37141 tctcctatta atttctccag cactagtccc tccaagggca ctctgcaggt accctctaag 37201 ggagtcagga cattcacttt tacatactag ccaccaggat tgcctacacc tgtgtgtaca 37261 acccaacact atcctgtcct tagcatatca tgatcctttc agcatcataa aagctcacac 37321 cccagcacac tccctccacc tcccctctaa cctacttact tctaatcccc tctgcacaac 37381 ctggagggac acacagtcaa ccctcccctt atgaccctcc tgtctttttt tgggtttttt 37441 tttgtttttg tttttgagaa ggagtttcgc tcttactacc caggctggaa tgcaatggca 37501 tgttcttgcc tcaccgcacg acctccgccc cccaggttca agtgattctc ctgcctcagc 37561 ctcccaaata gctgggatta caggcatgcg ccaccacgcc tggctaattt tgttttgttt 37621 tgttttgtag ggtgtgaggg tatatagcta gggttttttt tttttggttt tttttttgtt 37681 gttgtttttt gagacggagt ctcgctgtca ccctggctgg agtgcagtgg tgcgatctcg 37741 gcttgctgca agctccgcct cccgggttca tgccattctc ctgcctcagc ctcccgagta 37801 gctgggacta caggcgcctg ccaccacgcc cggctaattt tgttttgtat ttttagtaga 37861 gacggggttt ctccatgttg gtcaggctgg tctcgaactc ccgacttcag gtgatccgcc 37921 tgccttggcc tcccaaagtg ctgggattac aggtgtgagc caccatgcct ggccaaccct 37981 cctgtcttta acatgccctc ttataacttc ataccttcaa aaccctagct ggttgggcgc 38041 ggtggctcac acctgtaatc ccagcacttt gggaggctga ggtgggtgga tcatgaggtc 38101 aggagttcga gaccagcctg gccaagatgg tgaaacccat ctctactaaa aaatacaaaa 38161 aattagccag gcgcagtggt ggacgcctgt aatcccagct actcgggaag ctgaggcagg 38221 agaatccctt gaaccctgga ggcagaggtt gcagtgaacc aagatcatgc cactgcactc 38281 tagcctgggc gacagagcaa gactccgtct caaaaaaaca aacaaaacaa acaaacaaaa 38341 aaaaccccta gctatatacc ttcacaccgt acacacaaac caagcacctg gaaactccac 38401 acctttcaca cactgctact cccctcacat acccacaccg tcacataacg ccctaaatgc 38461 acatcccttg ctccaacaaa acaccccgca actcatgccc accctaaggc tctgagtaaa 38521 ccccactctt tccccatttg aaattctctc cccacttgcc ttcctctctc tctccattcc 38581 cacctggctt ctttctcctg ggaggcttca agcagaccag cctcagcaga agcagctcag 38641 actggtgggt gggcctggca ggctaagaag gagaggaggg gctgggccag agagtcctcc 38701 cattcctgcc ccctcccaca agcctcctcc ttagctccag cagggtcagc tcagtagggt 38761 caagtcccac taccctcatc cccaccccag caaagggctc cctagaagta tctttccaac 38821 cctctgaggc ccctatttct ggactcccca gatcagaagc tatgagctct gtaacaccac 38881 cagtaccccc ttgaacccaa aacagactag gggagagtta gggggcaggg agagaaccag 38941 ctgcagggaa caaagcagtt caggttatgg gagaaaaagc aagatcagct gaggaaagct 39001 agaagggcaa gtcgtcacaa aggggcaggg gggcagccca gggcaccaag gggaaaactg 39061 cccccctctc ttcatgacat ttgttagggc ttagggggaa cagaattgag tcagccacca 39121 ccccccatgc cagaacagac agggccctat tgtctcagcc aaaattcctt ctttcaagga 39181 agaggaggct cattgtccag ccctacaccc agctctggcc cacaaagctc aaaagcggca 39241 caacgaatgc ccaccctgac cctctgcccc ctcgtctagc ctgggggtgg caggcgcatt 39301 ccacccatga ggctgaggcc caaaccactg gagccctgag cttaaccccc cagtcttggg 39361 gactgggaga ggaagagaat tgtcttcagc aggaaagaac ccgcagagaa ccaggaaccc 39421 acaaagaatg ggcattgaga gagagcggaa acaccaaggg ggtccccacc ctagaccagg 39481 catctgggca cccaggcctc aggctccgcc cccaccctcc ttggggagcc aggtcccctc 39541 cacctggaaa tgagccaagt cacactgagg aaatggaact ttatttccat aaatacaggg 39601 ataacaccta ttcaaaggta gttaaaagag ggcctggggc ctcaaagaaa ctaggctctc 39661 ccaggggggt actccaacac tgatcatagg gactggggga tccccaaacc tgagatgggc 39721 ctcataggcc acagatattc cccaacactg acacttcaag aacggaactg tccccatagg 39781 ggagcctcag aaccccactc tcatgggtag tccctcttag gagttgggag ggctgatgtc 39841 aggggacttt agagaaaaaa gggaacatgg ggaggagaga agctaaaaat gtcctgagtg 39901 gcctggaagg agacccctgt ggtgggcagg gggtgggttc tccacccata cagccagata 39961 cggaggagca gcagcagcaa aagcagccac aagttaaaaa catggtttct cacttcccaa 40021 cttcggcctt gagagaaagg gacagcacgg agcaatcccc caaatgagag gacatgaggt 40081 aggggaggcc tggaattgtc attcatggag gagcagagga agggggttct gggaggccaa 40141 gtctctacta aaaccccgtc tctactaaaa atgggggata atatgggagc aatgaggtgg 40201 tcacaggcac accaaagcct gacatctgct ttccaaggcc accacttggt ctctggaccg 40261 aggagttcct ggggacccct gaatatatcc tcaggagagc caaggttcaa tgcaggtctc 40321 ataaagggta cggttggagt gccaggctgt gtgggagata ccggccattg gacacctcac 40381 tatggccccc cgggccaata gagtcttcaa cccaaaagaa tcccgcagat aaaccttcaa 40441 ggtggtcgaa ggggcgtgga agcatggaag agagacacaa ggagagacaa agtgagttac 40501 tgctgggatc ctggacctcc tccccacagg gtgaaccctt cagctcagga gtcacagaga 40561 gggctctgga ataaggtggg acagcggcta gaaggggaag taatcccagg gggctcacca 40621 gttgctcctc catctccagg acggtctcat ttgcatcata gaaaccaaag aagctacaaa 40681 gagatttggg gggaggttat cagaagagct ggagaaaatc tggccgggcg cagtggcaca 40741 cgcctgtaat tgcagcactt tgggaggcca aggagggcaa atcacctgag gccaggagtt 40801 caagaccagc ctgaccaaaa tggtgaaacc ccatctctac taaaaataca aaaattagct 40861 gggcatggtg gcagatgcct gcaagcccag ctacccaaga ggctgaggca ggataattac 40921 ttgaacccgg gaggtggagg ttgcattgag tcgagatcgc accactgcac ttcagcctgg 40981 gtgacagagc gggaccccat ctcaaaaagg aaaggaaagg aaaaaggaaa gaaaaaagaa 41041 aaggaaaaaa gaaaggaaga aatcaaggtg ggctaaggtc ccaaaggaac ccaaggccta 41101 ctggggagac aggtagcagg gaggacactc aaaactacct tactggatat aatgtacttc 41161 atgaggtgat acactgaaga tacgacctca cttctgtaga aaccccatca aaaatgcatt 41221 actggccggg cgaggtggct cacacctgta atcccagcac tttgggaggc caaggccggc 41281 ggatcacctg aggtcgggag ttcaagacca gcctggccaa cgtggtgaaa ccccatctct 41341 actaaaaata caaaattagc tgggcgtggt ggctcaagcc tgtaatccca gcactttggg 41401 aggccgagga gggtggatca tctgaggtca ggaattcgag accagcctgg ccaacacgga 41461 gaaaccctgt ctctactaaa aatacaaaat tagctgggcg tggtgggcgc ctgtaatccc 41521 agctactcag gaggcagagg caggagaatt gcttgattct gggacgcaaa ggttgcagtg 41581 agccgggatg gcgccactgc actccagcct ggcgacagag tgagactttg tctcaaaaaa 41641 aaaaaaaaaa gaggccaggt gtggtggctc atgcctgtaa tcccagcact ttgggagacc 41701 aaggagggtg gatcacctga ggtcaggagt tcaagaccag cctggccaac atggagaaac 41761 cccgtctcta ctaaaaatac aaaattagtt gggcatggtg gtgggcgcct ataatcccag 41821 ctactcagga ggctaagaca ggagaatcac ttgaacctgg caggcggagg ttgcagtggg 41881 ccgagatttg ccattgcact ccagcctggg caacaagagt gaaactccaa ctcaaacaaa 41941 caaacaaaca aaaagatact aaagagacgt aacaagatca tgcaactcaa gatcctgatt 42001 tggatcttcc actgtatatt tttttctgta aggacagttg gaaaaatttg aataatctgt 42061 gagcgcatat tcaggaaaat ttgaatctat gtttatattt aaatataaca ttaacgtata 42121 taaataaatg tatatatatt tagagaaaaa agatattaat gtaaacatga caaaatgtta 42181 acatttgcga aatctaggtg aggagtataa atgactgctt tttgctattt tggtaacttt 42241 tttttttttt tttgagacag ggtcttactc tgtcacccag gcaggagtgc aatggtgaga 42301 tctcggctca ctgcagcctt ggcctcctag gctcaagcaa ttctcgtacc tcagcctccc 42361 aagtagctgg gactacaagg gcacaccacc acgcccagct aatttttgta ttttaggtaa 42421 agacagggtt ttgccatgtt gcccaggctg gtctcaaacc cctgggctca tgcctcggcc 42481 tcccaaagtg ctaggattac aggcgcaaac ttttcttaag tatgaaatta tttcaaaata 42541 gaaaggtctt aaaatccttt ttttcttttt tttttgagac agagccttgc tctgtcaccc 42601 aggctggagt gcagtggcac cctgtcggct cattgcaacc tccgcctcct ggtttcaagt 42661 tctcctgcct cagcctcctg agtagctgga actacaggcg tgcgccacca ggcccactaa 42721 tttttgtatt tttagtagag atggggtttc tcaatgttag ccagctgttc tcgaactcct 42781 gacctcaggt gatccacccg cctcggcttc ccaaagtgct gggattatag gagtgagcca 42841 ccgcacccag ccagatggag ttaaaatctt ttaattaaaa aatattggcc aggcaaggcc 42901 gggtgcgtgg gctcacacct ataatcctag cactttggga ggccgaggcg ggtggatcac 42961 gaggtcagga gatcgaaacc atcctggcta acacagtgaa accccgtctc tactaaaaat 43021 acaaaaaaat tagccgggcg tggtggcggg tgcctgtagt cccagcgact caggaggctg 43081 aggcaggaga atggcgtgaa cctgggaggc agagcttgca gtgagccgag atcacgccac 43141 tgcactccag cctgggcgac cgagcgaaga ctccaactca aaaaatatat atctatctat 43201 atatagagag agatatatat tggccaggcg cagtggctca cgcctgtaat cccaacactt 43261 tgggaggccg aagcaggcgg atcacaaggt caggagatcg agaccatcct ggctaacaca 43321 gtgaaacccc gtctctacca gaaatactaa aaattagcca ggcatggtgg tgggcacctg 43381 tagtcccagc cacttgggag gtgaggcagg agaatggctt gaacccagga ggcggaggtt 43441 gcagtgagcc gagattgtgc tattacactc tagcctgggc gacaagaaca aaactctgtc 43501 tcaaaaaaaa aaaaaaggaa gaaacagtga cttggaacat taaaaatgtt atataaccat 43561 gagctatcac tgtcattcat agggttgtgg tagatgtgaa atgacatgat gtacataaaa 43621 ctcatcactt actattatat tattacaata ttttaagaga tgtcctgcct tcactgaaaa 43681 ctgtccagtg ccttcccatc tcactcagaa tttaaaaaaa aaaatcaaaa gcctggttac 43741 cgggactgtc gggaaatagg gatgaggata tatatatata tatatatatt tttttttttt 43801 ttttttttga gatggagttt cgctcttgtt gcccaggctg gagtgcaatg gcgcaatctc 43861 agctcactgc aacctctgcc tcccaggttc aagcgattct cctgcctcag cctcccaagt 43921 agctgggatt acatgcatgc atcaccacac ccagctagtt ttgtattttt agtacagaca 43981 gggtttctcc atgttggtcg ggctagtctc gaactcccga cctcaggtga tccactgcct 44041 cggcctccca aagtgctggg attacaggcg tgagccacca cacccagtct attttttaat 44101 gggtatacgg tttcagtttg gggaaaaaga agttctggag atggatggtg ctgatgggtg 44161 atggttttac aatgatgtga gtatacttaa tgccacaaaa ctgtacattt ttaaatggtt 44221 aaaatggcaa ttttatgtta tgtatatttt atcacaaaaa aaagaaaaaa aaatatcaag 44281 ggccttacct tgacctgcta aggtttgaca tgcctggtcc cctgctacca cttttagctc 44341 ctctcctgtc ctctccccag ctccttgtgc actagccatg ctggcctcct tattgctcac 44401 acgtgcttca gggcctctgc aggtgccaga ccttctccct ggggggttct tccacccaga 44461 gcacaactcc ctccttcact tccttcagtc tctaattgaa tggtgacttt tccaggagga 44521 cttcttcggc cactattgaa actaggcccc ggacatcctc taatcctttc ccctgcctta 44581 ttacctgaca tatatatttg tatgtatgta tcagctatct tacaaactag aatataagct 44641 acataacatt agggacttct cttttattta ccactgcatc cctagggccc agaacaagcc 44701 tgcgcccata atgttgaata aatatttgtt gagcaattca agtagctcag gtgacattac 44761 agatcacaca tggtgaccta taacacaggc aagcacatag taccatggag ccatggattt 44821 ttttctaagg aataggatgg aggggacaaa gctggaggct gttataatag tccaggtaag 44881 taaagaggtg gtaggaaatg tgataaaatg gtataaaact acacacacac attgaaccaa 44941 tgtgaatttc ctggttttga tactgtgcta taattacata ggatgcaacc actgggggaa 45001 gctgggtgaa gggcctcgct atactatctt tgcaatttcc tatgaatcta taagaatttc 45061 aaaattaaaa gtttttataa agtgggggag ggggtgatag ggatggagag gagaagagtc 45121 agcaggactt agtgactggc ttgatacaag gggttgggat atgactccca ggttttggtc 45181 ttgggtgaca tgatgaatgg ggggaggagc actaactgaa cagtggaagg agcaggccag 45241 tttaggcatg agataaagac caactggggt tgggggatgt ctttagccaa tcttcaggcc 45301 acaaaatccc ttattacctg gactgccagg gagtaataac accatcatca gggcccccaa 45361 tcagcaccag gtggcccaca cgcagaaagt tcttccgcca tactgtagga tgggaagaag 45421 agaggctgag tcagccacag gggtcaggcc aggttggaga gggagacata gggagtcaaa 45481 gaagcagaaa aagcaacaca ggtaggagcc tgaattctca cctgtggcat tgggatggtc 45541 tctttcccca ttgatcaggg ccaggaagct gctggcattg aggtacaagt catcgtggtg 45601 gggatctggg tacaaacagc agttagaaaa agaagcagaa aagggaggca agactggtgg 45661 tgaaaagccc aataaggaag ctgcagcaat agtcaaggtg gatagtaata aaatagtagt 45721 gataatgtag gccaggtgcg gtggctcaca cctgtaatcc cagcactttg ggaggtagag 45781 gcaggtggat cacttgaagt cgggagtttg agaccagcct ggctaacatg gcgaaaccct 45841 gtctctacta aaaatacaaa agttagccag gcgtggtggt gcatgcctgt aatcccagtt 45901 actcggggcg ctggaatcac ttgaacccag gaggtggagg ttacagtgag ccgagactgc 45961 accactgcac tccatcctgg gagacacagt gagactccat ctcaaaataa taatagtagt 46021 gataacaacc gtaaacatag taacaagtac tttttttttt ttagatgaaa tctcactccg 46081 tcacccaggc tgaagtgcag tggcaggatc tccgctcact gcaacatctg cctcccgggt 46141 tcaagcaatt ctcctgactc agcatcctga gtagctggga ttacaagcgt gtgcccacat 46201 tcagctaaat tttttttgta tttttagtag agatggggtt tcaccatgtt ggccaggctg 46261 gtctcgaacc cgacctcagg tgatgcgccc acctccccct gccaagatat tgggattaca 46321 ggtgtgagcc accacacctg gcaatagcaa gtacttctat ctagtatcta ctatatgagc 46381 caggtactat tcaaagtaca ttgcattcat ttatttattt aatccttaca accacctggt 46441 gaagtacatg ctataatatt ttacagataa ggaaaactga gtaacagaat ggttaagtaa 46501 cttgcccaaa ggcacccaat agggtcaaga ttcaaaccca agtattctgg ccccatggtc 46561 tggtctagag gttggcaaac tgtgaccaaa ggaccaaatc cagcctgcta cttgtttttg 46621 taaatgaagt tttactggaa cacaggcaca ttcattcaca tatggtacat ggctgctttc 46681 acactacaac agcagaattg aggagttgtg aaacagacta tatgtcctac aaagtcaaaa 46741 atatttactc tctggccctt tatagacaag gtttgctgac tcccacatca gactaaacct 46801 tctaaggcaa taaggtgaca cacttaaaac attctgggcc aggtgcaatg gttcacacct 46861 gtaatcccag cactttggga ggccgaggcg agtggatcac ctgaggtcag gagttcgaga 46921 ccagcctggc caacatggcg aaactccatc tctacttaaa atacaaaact tagccgggca 46981 tgatggcgcc tgcctgtaat cccagctact agggggactg aggcaggagg atcacttgaa 47041 cctgggaggc ggaggttgca gtgagccgag atggtgcact gcactccagc ctgggcaaca 47101 gaacaagact ccgtctcaaa aaaaaaaaaa attctgaaca gagcctgttc aaataactca 47161 ataaatgtaa gttatcttta ttgtcatcac tgctattggt tgtagcagag gtggaagcaa 47221 ctgacctgat ccatggaagc cccagttcag catccccact caccatgcca gtagttgcag 47281 atggagaatt cctggcccca ggggctatag cagatccgat agaggttaga ccgcatggag 47341 gtggggaaca gccacttcaa gtagtccgtg tctggcaagc gaatggcaat gctaagtgac 47401 cataaacctc tgttccccca aaactcaggg cattctatgg agtctagtgc ccactcacct 47461 ccatactgtc ccatctgtgg agaggagagg gagatgaaag aatccacgtt gtgatcatcc 47521 atgacagaaa gcagagcccg gcacacaagg ccccctgtaa gcagaacacc acattgggca 47581 ggcacttaag gacagcaaag ccagcagcac cccccacccc caccacacac acacacacag 47641 agacacacac agacatacac tgtacacaaa ggagactgag gcttacagaa ggcaggcaca 47701 ccctgtgcca aaaggtcaca catcatataa gtgactgagc ccgaattaga tgctgggcct 47761 cctgcctcta gttctaggat gtttttacct gccccattag cccttatgtc caagaaccat 47821 gggtaacagg aagcaaaagg gcagcagtgt agggagcctc cctcccattc aagacagaga 47881 tgagacccag gtgctgaggg aagtcagaaa ggaagggctc aggtaccagg gttgtgcctg 47941 gagcaatgtg gttcagaaaa agggacgcta ggaagtgtcc ctcagataag gatcaagcct 48001 cagatagggc ttaggagtta ggggcagggg agtcgcctac cctgcgagta gcagatgaga 48061 tgcacccctt gaggggcctt tgccatgatg gggaccacag cctctcggaa cccttgcacc 48121 tgttcccaca ggggtcgcaa gctctctctc ccatcgaaga gatcgagcac tgtcaccaca 48181 gtcccggggt gtgtctgtgg gaagggggca atgcagccac cggggatagg ctaagaagct 48241 cccacacgcc accccctggc ccgcgtccac aggtctattg taccctgcta gaaccaggga 48301 tcccgtcccc caactctccc ccacgccagc accagctccc tgaggaactg ggcaggccca 48361 gaggggtggc tttcagttcc ccgctctccc tcccctgcca cagtagacgc ctctaacgcc 48421 ctgcacccag gtgtcccctg ccagacctca ttgatgtatt ccagcaggtg gcggaagctg 48481 tacgagctgt cgaagagccc atgcaccacg atgaccggct tgtaggacgc gcggtggggc 48541 gcgggggctg caagcagcag cagcggcagg aaaggcaaca gaagcaggac ccacgccgcg 48601 gggagccgct gcccccagag ccccagcatg ctcccgcctg agaaaggggt gataagggca 48661 tgtgagggag atggcaacac cctcctccct cggaacagac gcccggcaat caagccacct 48721 cctctcgctc ccacaccagg tccgtgtcac aaaatagcct acttttaact tactccagcc 48781 ctctccctac aaacacaccc ccccaccacg ttgacgcacc aacgcgcacc cgaagtcccg 48841 cctccaactc agcgttcggg ggacttgttc tctaggtcca ggatcttcct aatgcatcgc 48901 ctcagccatg aacaacgcgg agttctttaa tacccgtgag cagcagccca ggccccttga 48961 agagtgcaga ctcccacctg gcctgggtcc gtaggcctcg ctccacccgc tgtttactta 49021 tcccaagtct ggaacccacg gtggcggggg gaagggtgag gaaagagaac gcaggggaat 49081 gacggatggg agggggaagg gcgggctgct tggggtcgcg caaatccgtc acgtccgggg 49141 ctttctctgg caaccgcgcg agcgttcccc gcaacacaga cccaggacag gaggggcaat 49201 ggaatattcc attgcgccct aggtgctggg gaggaaacag gcggagcgat ccatttaggc 49261 cagtggggag aggaaaaagc agcaaacata ttctgggaat ggaaagaagg cctctccagg 49321 cttcgttgcc cccagcgacc cgaaagtccg atttccccgc cttgattctc cccacttccc 49381 aatacaggcg tctggctccg cagcagaaca cgaagtttgc attccccaag gggcggccag 49441 ggggcggacc agggaaaggt agtcctctgc attttgccgt gtgctgggtg agtggcaggg 49501 tggcctggag ggctgatgcc agcccgggcg tgcccctcaa cacccacccc accccaccct 49561 ccagtccgcc ccaggtcagc gacttacaac tcttcattct gaagtgcgtg tagtgccctt 49621 gtctccagag acgcagagag tcctcgaggc cccttgagct aagtgcagcc tggcccagtt 49681 tctcctgccc tactctactc ccctccctat aagcgaccca ccctcaaggg gcggagggcg 49741 cgtagggatg cgctgactca tgcccgcgta atttcgacca gtctttcaac ctgaacgacc 49801 cccagaatct ggctgtctga gttatctgtg ggtggctccc taccgaaacc cccaggcgcc 49861 ccacctctcc gcctgtgacc cctaaccgac accctagtgc ctcagggctt ttacttgcta 49921 gggcttttac ttagcattta aagacgtttc tctagagata aggatttctc agcatgtctg 49981 agcccctctc tctgtttaca aggtcattgc ttggtctaaa tttgtctcaa tcaaaacatt 50041 tttgcgctca gaatggtgta gagtgacgat gaggtggcgg taaggggttg agcgctcacg 50101 cgaacgccta agtgaccaga acgactggtg tgaagccgtg atctgactct gtggagcctg 50161 ggactggttt cagcgagagc ctctgtactg ctctgtagtc tctgctagga catggacgaa 50221 aagggacgca gccgggagag cgactgcccc aggtgggggc tggggggacg taagggaaga 50281 ataatgatta tctggctgtg tttttgttta aaaaaaaaaa aaaactggtg taaatctcca 50341 tagacttcct atcctcgctc ccttcaccca cccacgcgct cccaagatgc aataagcaaa 50401 taaaaagact aataacgctg tactgcaggt cgactcagga gctggagatg cgtcttggtg 50461 ggaggtggtg ggtgggagta ggggggttta gggaatggat ctgaacattg accagcccag 50521 agactctaag cagcctcaaa agcaaaggga aagtggggga acaagcacac attaccaaca 50581 gagctgccag aaatgagcag ttaagtcata ccccctaccc cctccaaaag agcttcagct 50641 ccttagtcct ggacgaagaa ggtatgtatg cacaccgccc aaactctctc tcccttttcc 50701 ctccaaggcc cagctccccc tgatttacag acctgggcct ccctctttac tgctaggttg 50761 gtaggttcac caaaccctgg gaacttttca gaccatcgca gttctgaact ctgcacaatt 50821 ctctctcaca cacaagtatt tatctcttct aaagaggagg aaactggggc caaggatttg 50881 ggagaagacc ctggcacctt gcagggagct aagaggggga gacgacctgc ctctggaggc 50941 acctgggtta ttaactccac tgagaacctg ttcacttcct cccacaatac aatcactgag 51001 tcttggtggg ggagactcca gagagttctc cttccttcct ttccttagtc ccacccgcct 51061 cgcctggtct agcctccgtg tttccatgac aactccaaag gagcccaaac tgggggcttg 51121 aatgccgggg taagaggagg agagaggtgg tccgagagca gagagagacc gagtgggaaa 51181 catctgaagc gctccccctc cctcgcctcg gtccctttaa gctccccccc tccccgctct 51241 ccctccgccc gccccccccg cccccccccc ccgccgctgc cttcatctct ccatctctgc 51301 gctgctgccg ctgcgccatc cagcacccag actccagcac cggccgagga cccccactcc 51361 ggctgcaggg accctgtccc agcgagaccg caggcatgtc atccgaaaag tcaggtaaaa 51421 acaataacaa aacctcccac cccctccact gtctccagac tctccgtccc ccttgcccca 51481 accccctccc ttacccctcc tcagctgtgg ttctatttca ttccccttct ctccagctct 51541 caacactccc ccagtccccc tcctctttct gtctccccct ttctcttcct ttcctctttc 51601 cagtggcagc ctctgcccct tgccaacaac atggtcaggg gggtaggttg agagggtgaa 51661 ggaggtacag ccaggttttg cagggatggc atcattggga gtgacagatg gacaatcact 51721 ggctggcatg gagacatcct gtgaggaaat atggagacat gaccagatgg gggttgtcaa 51781 gggagcaaaa tccagagggc tcttcttaat ctgccctaaa agaggtcccg agattctcac 51841 agaggctggg gcactcctcc ccccactgaa ggaacagcag agtggaacac atgtcatccc 51901 acatgtgttt atacaactgt tgaattgagc acatattaac acagggttgc atgtctacgc 51961 atacgcacac acaggactag ctcggatagg ccagcccaaa ggcagctata gcaaaggaga 52021 ggggattagg tctgcaggtg agagctgggt gcatggtgat gaaaaagaca gaaaagaagc 52081 agaccagagt tgtgacctca aaactagatt ggaaggaaga aggagggggg cagatggcct 52141 agatacagcc cctctcttgc ccctcaaatt agagatggtt tctcacccgt ctctctctat 52201 gtgtctctcc cattatcttt ctccatccct gaccggctgt gtttcccctt accccctcct 52261 caactcatca ctgtgtcatc tttcctctta tactctcctc cactcacctc ccccaggact 52321 cccagactca gtccctcaca cttctccgcc gccctacaat gcccctcagc ctccagccga 52381 acccccagcc ccaccgccac aggcagcccc ttcctcacac catcaccacc accaccacta 52441 ccatcagtct ggcaccgcca ccctcccgcg cttaggggca gggggcctgg cctcttccgc 52501 ggccaccgct cagcgcggtc cctcctcctc tgccacgctg ccgaggcccc cccaccacgc 52561 ccctcccggc cctgctgccg gggcaccccc acccggctgc gctaccttgc cccgcatgcc 52621 acccgaccct tacctgcagg agactcgctt cgagggccca cttcccccgc cgccgcccgc 52681 tgccgccgcc ccgcccccgc cggcgccagc ccagactgcc caggcccctg gcttcgtggt 52741 gcccacgcac gcggggactg tgggcacgct gccgctgggg ggctacgtag cgcccggata 52801 ccccctgcag ctgcagcctt gcactgctta cgtgccggtc tacccggtgg gcacggtgag 52861 tgccgggcag acagggacat gggaaagagg gggacgcgat acaggacttg aaattgggga 52921 tacgctgggg gctggtagga tagaggaaca agggcaggga acaggtagtg ttcccgggac 52981 aagccctaga aagaagggag cctgagacag gaaggactag ggagagacac gggagtagga 53041 gtctactggt gcccagagtc agggcctggg agggggatcg gagcctagag gttcagagga 53101 ggtctgaaag taggaaaccg cctggcgggg gacgggggga atggaagctg ggaaccaaga 53161 gggatgtggg agaagcctgg gactaagggg gtaggggagg cctggtaggt gtctggaggg 53221 aagaaagaag gtctgacctg aggccaggac agccccagtg ggaccatacc ttgcgggaga 53281 gaatgtagaa agcccaagaa tatggtggtt aatgaagcaa ggaagggagg agaggggctt 53341 aggtggaatt tatgggtgtc ctggaagggt aatgggtgct ttattttgag aagccatagg 53401 taaaaattgt gcttttaaag ccactctgcc agccgcccaa cgctggtagg ctgggagagg 53461 gtcagagtga tgcccctgcc ccccaaattt ccttccacag ccatatgcag gcgggacccc 53521 ggggggaaca ggagtgacct ccactctccc cccgccgccc cagggcccag ggctggccct 53581 actggagccg aggcgcccgc cacacgacta catgcccatc gcggtgctga ccaccatctg 53641 ttgcttctgg cctactggca tcattgccat cttcaaggcc gtgcaggtag ggggcagggg 53701 catactcggt ttgggggcgg ggacagggag ttctgggcgt tctggggacc atcttagaga 53761 aaggctgagg cgttcgaacg aggccgcagc tctttgacct ccttccccca cccctcctcc 53821 gtaggtgcgc acggccttgg cccgcggaga catggtgtcg gccgagatcg cttcacgcga 53881 ggcccggaac ttctccttca tctccctggc cgtgggcatc gcggccatgg tgctctgtac 53941 catcctcacc gtagtcatca tcatcgccgc gcagcaccac gagaactact gggatcccta 54001 aaaacgcccc tggtccggcc ccactctgcg cccctcgatc tcccaggctc tttctgcagt 54061 cataccgcgg acccaatggg cgccctgcac acccgtttct ggggccgtca gacttggata 54121 catcgtaaac tccgcctcca cggaacgtct cgccttgcga gcaagctcgg aatccagttc 54181 ctcaggaacc cctccaaaac ccacaccccc agggacgccg ctttccggga tcccggccaa 54241 acgccggacc ctcagtcgct ccaggccccc tcaccctcaa agtgtagcgc ccccaaccga 54301 gcaacctcgg tttggtccct aaaaccccgc ctcctctata agcaccgccc cagctctgac 54361 aaaaccccgc ctccaggtcg gcaggctccg ccttcttttc ttctccgcgg ggtgattcag 54421 tccagtgatt gggtttgtgg ctccaggcct cgcccacaga cggacagacc cctccctttc 54481 ttccggcaaa aggaccgagc cctggggtag taaggccccc acactcctgt tttttgcaag 54541 tacatttttg tccctcctcc acccaggtat ctgcctattt tcttgctaat cccagaacct 54601 ttccttttgc tttttttaag gacatttggg aagttcctgg tgtaggaccc ttctccctgg 54661 gataagaaac ctgcctgtaa acgctctgta aatactccct tccacccatc ccagcccctg 54721 ggcagccggg cagaagggaa tccaggctat ggacctccca agtccccgct ccccgctccc 54781 ctcggcggcc ccgccttgtt ctgatctgtg tgtgagtgtg tgtgaacttc tgaaagacaa 54841 tattaaagag acttagttga tttatcgccc gcaattccaa agactgcggc cccgcaaaga 54901 ccctccccac ttttgattcc gcctttcact tcccttcatc tcctcttcca aggaaaaaaa 54961 aagaaaaccc gacagagact aacgtgaggg acacagattc ccagatcgcc agagagacac 55021 gtgaatatgg gggacggagg ggagctccct gggaatccac caaggaagac cttggggtcc 55081 attctcagtg aggctttatt ttcttagtga ggctttattt ccccagtacc ccttttccat 55141 tccctactat ccccagaact ccaggaagac aagagaacag agagggcaga ccatggtgaa 55201 gaagctggtc aaggatagag tgatgggggc caagaagaga tgccagctgc ccatagctgt 55261 tcctgactgt gggctggagg gtggcagata acttggatta gagccccaca tgctggactg 55321 tagggggtat aggaaaggca aagagagcag attgctgtgg gagcccaggg aggaggtcaa 55381 tggcctctca agtctccctg ggactagttg ccctctccta tcctgaggtc aaccaatagg 55441 cctcttttct gaggggagtg gtgattaggg gatgctgcca gcagtgggct tgggtctttg 55501 gttgtacacc cacaggacag ggtcctaacc taatttatgc atttatgcaa catgcagact 55561 ccatgctgga tgctggtgag acatccaccc agacagcagg tgtgacccct gacctcatgg 55621 agcttacaat ctagaaggga agccatgcac tgagatagga aatgtgatag gagatgtggg 55681 aggagggtgg ggtgaggtgt gaatatcatg ggttcgccct tggatgtttc gcatttaatg 55741 tgcctacctt accaaacttc ttgaaaaaca gaattcacgg ctgggcacgg tggctaacac 55801 ctgtaattcc agcactttgg gaggccaagg caggcggatc acttgaggtc aggagttcgg 55861 gaccagcctg gccaacatgg tgaaacccca tctgtactaa aaataataaa ttagccgggc 55921 atggtgacgg gcacctgcag tcccagctac ttggaaggct gaggcaggag aatcgcttga 55981 acttgggagg cggaggttgc agtgagccaa gatcaagtca ttgcactcca gcctgggcaa 56041 caagagcgaa actccatctc aaaaaaaaag gggcaggggg gtgcggaata ttgctatcag 56101 agatatgtct gtatctgtct ttttttcata cccactattt cttgacttat tggaatatgc 56161 atccacaccc tattctcctg aaacctccct ctcaaatggc aaccaattac ccagatccca 56221 aatcaatggc attttcacag tattctgcct ctctaagccc tcccatgttt gtcccttgac 56281 cctgaattcc ccacataccc aggttcaaga acgtattacc tcccttcccc cgacacacac 56341 acagacctct aaccattccg cctctgcctt ccctccttgc tttttctcct cccttcccct 56401 aagtgtccat ctctgatttt ttttccttct ctgagacttt cctttgaaaa gctcattcac 56461 tcaactcagc taatcaaatg cttttccaac ctgtatttta catgcagacg ttttttccag 56521 tgcctgaaga tcatagccac atggaagatg cttgctggtc ctctaaaatc aacacattcc 56581 aaaacaaact gatgaaatgc cacagaaaac ctgcttccct gtctctgtta atagaccatt 56641 ttcccaatca ctcttcctta tctagaaatt tgccgcctcc tccctctccc tcctgcctta 56701 tttttggtgt gctctcaagc ctagtcagtt ctgccaccac aatgtctttt gaatctgtaa 56761 tttttcccca gcccttgctg gaaaaattac ctcattagga ttattgcatc agctttccca 56821 acaggtctcc ctacttccta cttctcccta cctcaagtct ccctccacac tattgtcaga 56881 tcagcctccc taaacacact ttcataactc cactgctcaa aagcccttca aggttcccta 56941 ttgcctactc aattatttac aaaattttaa tctggccttt aacaccttcc ataatttaaa 57001 atcatccagc ttctctgttc attctaattt ctaaccatac tgaattactc atcattccca 57061 cctccaggta tttgcccagc tgttcccaaa tcttagacct ccttcccaaa tctctgccta 57121 ttataaacca aattgtttgc agcttgtcac aaatttcaga ggctttagac cacatctagg 57181 ctcttgctca tgatgttctc cccctcttga catctgcctc tctacttggt caaattttat 57241 ttacaatatc tttcaaagtc cggatcaatt catccctcac tgtagcaaaa aggtgtctct 57301 ctttcctctg ggttagatta aagcagtggg tttccaaaca gagcttcagg cattccttgg 57361 agcaacggtt cttaccccag gttacacatt agaatccact ggggagcact gaaaaatctg 57421 aaagcccatg tcgcatccta aaacaattaa atcagaatct ctgggctcag tgccagtggc 57481 tcacacctgt aatctcagca ctctgggaag tcgaggcagg aggatcgttt aagcccagga 57541 gttcgagact agcctgggca atatggtgag accccatctc tacaagaaag tttaaaaact 57601 agcctaggcc agatcgagac catcctggct aacatggtga aaccgcgtct ctactaaaaa 57661 tacaaaaaaa ttagctgggt gtggtggtag gtgcctgtag tcccagctac tcaggaggct 57721 gaggcaggag aacggcatga acccgggagg tggagcttgc agtgagccga gatcgtgcca 57781 tgcactccag cctgggcaac agagcgagac tccgtctcaa aaaaaaaaaa aaaaaaaaaa 57841 ttagcctagg ccgagcgcgg tggctcacac atataatctc agcactttgg gaggccgagg 57901 tgggtggatc acctgaggtc aggcgttcaa gaccagcctg gccaacgtgg tgaaaccccg 57961 tctccactaa aactgcaaaa atcagccggg tatggtggca catgcctgta atcccagcta 58021 ctcaggaggc tgaggcagaa gaatcgcttg aacctaggag gcggaggttg cagtgagccc 58081 agattgcacc actgcactcc agcctgggtg gcagagaggc actcagtctc aaaaagaaaa 58141 agaaaaaaaa aatttagccg agccccatgg ctcacctgta gtcttagcta cttgggaggg 58201 tgagacggga ggattgcttg ggcctgaggg gcagaggctt cagtgatcag aacacggtgc 58261 tccagcctgg gcaacagagt gagaccctat ctcaaaacaa acaaaaaaga atctctgggg 58321 gtggaactct ggcatcagta tttaagacat aaccaggtga ttccaaaggg cagccaaggt 58381 tgggaatcac aggtttacag acactttaag gaccacccag ggagaatgga aaggccaaaa 58441 aatgacttca acccgggcaa ttttctttag gagaaaagta tctagatcta agattcatgt 58501 tatctgcata ttttctcatc tcttcccacc tgatatcttg acatacacag tttaccttgt 58561 gagctacttg aaggcaagag tcaaatctgg cttatctctg tttttcagat cctcatcaga 58621 taaaacctcc tctaccaatc tttgaactca attgaattga aaggaatcag gcaaagggag 58681 agatgaataa aaaatgactt agctttcttt tgcttttcct taggaatgtt tgtttcttaa 58741 tctcagatct cttaacctca cccagactcc tccctcatga ggaaataaaa tgttacaatg 58801 tgtggccggg cgcggtggct cactcctgta atcctagcac tttgggaggc cgagatgggc 58861 ggatcacgag gtcaggagat caggaccatc ctggctaaca cggtgaaacc ccatttccac 58921 taaaaataca aaaaattagc tgggtgtggt ggcaggcacc tgtagtccca gccacttggg 58981 aggctgaggc aagagaatgg tgtgaatccc ggaggcagag cttgcagtga gccgagattg 59041 tgccactgca ctccagcctg ggcaacaggg cgagactccg tctcaaaaaa aaaaaaagtt 59101 acaatgtgat cctcttccta tctgactcac ctccctctgc aggtgatacc cttggctgtg 59161 gcacctctac tgacagagac tccatcctct aggagaggct ctgccctccc ataacatctg 59221 tatgggtgct ctcatacaga tgggggaaaa ccatgattag attctgggtc atcattcccc 59281 ctctacaatg gaagattcag cacctgatca ctctacttcc accgctcatt tcattttctt 59341 ctttttatta tttatttatt tatttttgag atggagtttt gctcttctcg cccaggctgg 59401 agtgcagtgg cgcaatctcg tctcactgca acctccgcct cctgagttca atagattctc 59461 ctgcctcagc cttagtaggt gtgattacag gcatctgcca ccacgcccag ctaatttttg 59521 tatttttact agagacaggg tttcaccatg ttggccaggc tggtcttgaa ctcctgatct 59581 taggttatct gcctacctcg gcctcccaaa gtgctgggat tacacacgtg agccaccgag 59641 cccagcctct ttttcttttt tctttttttt ctgaggcagg gtctcgctct gtcacccagg 59701 ctggagtgca gcagcaggat catagctcac tgagcttcga tctcccggtc tcaagtgatc 59761 ctcccagcta atttttttta tttttatttt attttttttt tttgagatgg agtctggctc 59821 tgtcgcccag actggagtgc agtggcacaa tctcagctca ctgcaacctt gcctcctgga 59881 ttcaagcaat tctctgcctc agcctctgag tagctaggat tacaggcgcc tgccaccacg 59941 cccggataaa ttttgggttt tttttttttt ttttttgaga cagagtctca ccctgtcgcc 60001 tagcctggag tgcagtggtg cgatctcggc tcactgcaag ctctgcctcc cgggttcatg 60061 ccattctccc gcctcccacc tcagcctccc aagtagctgg gactacaggc acccgccacc 60121 atgcccagtt aattttgttt ttgtattttt agtagagacg gggtttcacc gtgttagcca 60181 ggatggtctc aatctcctga cctggtgatc cgccctcctc accctcccaa agtgctagga 60241 ttacaggctt gaggcaccgc ccctggcctg tttttttgtt cgtttgtttt tgtttttttt 60301 ggacggaatt ttgctcttgt tgcccaagct ggagtgcaat ggtgccatct cagctcactg 60361 caacctctgc ctcccaggtt caagcgattc tcctgcctca gcctcctgag tagctcggat 60421 taccggcgtg tgccaccatg cccggctaat tttttgtatt tttagtagaa atggggtttc 60481 accatattgg tcaggctggt ctcaaactct tgacctcgtg atccacccgc ctcggccttc 60541 caaagtgctg ggattacagg cataaaccac tgcgcctggc ctctggcctt tgatatttaa 60601 tagagatgag gtcacactgt gttgcccagg acagtctcga actcctgaat tcacacaatc 60661 tgcctgcctc agcctcccaa agtgccggga ttataggcat gagctacagt gcctggcccc 60721 gttttattat ttttgttttc aatgttcatg cagattctct agcaccctgc ccttctcttg 60781 cagtgattta tttctccctg cttccctctc gccctatgtc ataccttcaa cttggttgcc 60841 accaatacct actgcccttc taaaatctca tcctcagtct tttcatgttg ctccccagta 60901 cccctatccc aaaaattctt caaccccatc tgggcctccc atccactaac ttacttttca 60961 ttgtttgatc tcaattcttc actttcctca ttacccagtt tagattccaa ggttcaacac 61021 ggtaatcgcc tctcagccct caccctgtac tcctggcccc ccttcctctg gcacatttta 61081 tccctggtca aatcccactc tgcctgctcc acacctgcat tcaacagctg gggagaaaca 61141 cacagtcata ctgactggtc ttgcttcctg cgggccctca gtgtggcatg ccagctcgcc 61201 tttccctact cagttcacat tccccaaatc tgagaccaca acttcacaag atgatgactt 61261 ttcttcctat ttcagtgaga aaacagaaac tttcagaagg gaacttcctc atgttcccac 61321 cccaaattca ccagtctacc tgcaactcta ccgcagagaa gctgacatgc acccacctgc 61381 cacctcatct gtaccccggg atcctgttcc ctctcacctg ctcagagatg ttattcctga 61441 aattatcccc tatctctctc tctctcactt aatcagttta cccctctgta ctgcaacact 61501 cccatcatta tgcaaacatg ctataaaatt atacttcatc tctgaaaaga aagaaaaaaa 61561 ctgtttcgct actcacctag gtagtaaaag cacaacattc agagaatggt gcttaccctt 61621 tggcagaaaa gcaatctgga aagggaaatt cagggagcct taactatact ggggaagatt 61681 ttacctcttg tttttgcttt cttttttttt tttttttttt tttgagatgg agtctccctc 61741 tgtcacccag gctggagtgc aatggcacga tctcagctca ctgcaacctc cgcctcccgg 61801 gttcaagcga ttctcctgcc tcagcctccc aagtagctgg gattataggc acgcaccacc 61861 acacctggct aatgttttag tagagacagg atttcaccat gttagccagg ctggtctcca 61921 actcctgacc tcaggtgatc cgcccacctc agcctcccaa agtgctggga ttacaggtgt 61981 gagccacctt gccctgatgg gaagatttta cttcataagc tgcctgatgt tcacaaaatt 62041 attctttgta atacatatat atttatgcat ataaattata attttatatg tataaaataa 62101 tgcatttatt tttaaattta aaaactcttc cctgacccag ctcactccct cagtgttcca 62161 cttctctctc cacttaacaa aacttctcac tgcctctact ttctcacctc tcattcactc 62221 ttgaatttct ctcaatcagg ctctcatccc caccacttta cttaaaccac atgtcagagt 62281 cacaaatgac ctccaagttt ctaaacccaa tggtcaaggg tcagtttgtg gcaggccaat 62341 tctccctgac agtcacacag acaggcctgc atagcacccc agttacacag acagatttcc 62401 acagcattgc cttaacattg agcaaatagt taaacctagg ggaattggtg tacagacatc 62461 aaagctagaa atgaaacaca tggtgagtaa gagccttgca tgggcttctc ccttgctgga 62521 gcaagtcaaa ataacagaga cagccttaca ttcctagtgc caggactcgt ctcgggtcga 62581 cgatatctga gacaagtcaa ggtaacagag gcagctgttt gaatagattc actggacaat 62641 ctaaggcagc tctccgcacc aagctgtaaa ggagataaga tagaaataat cactctggta 62701 ccacagtaaa caggccttga aggtactggg gccctcacag cttaatcaga cttagcaaga 62761 atttttttgc ctctgaccct ctagttgaaa caaaattagt tactgataga ctttggtgaa 62821 tgccatactg catgtaggca tataacctaa acctgtataa acactaagaa aatagtaaca 62881 ctggccgggt gtggtggctc acacctgtaa ttctagcact ttgggaggcc gaggtgggtg 62941 gatc // LOCUS HSMHCHHS 2917 bp DNA PRI 17-MAR-1993 DEFINITION H.sapiens Mahlavu hepatocellular carcinoma hhc(M) DNA. ACCESSION X55777 NID g288143 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2917) AUTHORS Yang,S.S., Zhang,K., Vieira,W., Taub,J.V., Zeilstra-Ryalls,J.H. and Somerville,R.L. TITLE A human hepatocellular carcinoma 3.0-kilobase DNA sequence transforms both rat liver cells and NIH3T3 fibroblasts and encodes a 52-kilodalton protein JOURNAL Cancer Res. 50, 5658-5667 (1990) FEATURES Location/Qualifiers source 1..2917 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 79..1482 /note="52kD protein" /codon_start=1 /db_xref="PID:g288144" /db_xref="SWISS-PROT:Q05877" /translation="MLPFTCGRNANENSPRDVDVGVAPAAEGNVQHVEGSTAKAGLSS RSGGGGSLSHLFCECSSKPCLKHVEKLSELPPGHMQMDTLIIKLSGRLRNKTKMEVPP NQWKFFPFSFLWHSLALTQGSPHSRSRHQGTGGELWGTLQAYSVNGLAAATGATMEPA GTHNTEGRDLASNQISCDSREGGVKATGLFLSTSSHVMTPEGRRGRKCEHRDIMSRSL LTRCPKEESQVTTQHQRNCRVMRNFGKQSIVLSVKPLAHSRAGHAWMVTLDGIDYEEP GEGIYLHRDVRVTCIPKHHEALKTELMWKPQPLQVALHLQHKPNHINCCKTKLQHSPY HLNKTQSLTTFKTPRTQSKITSTKNQENLNEQGKWQSVAASAEMTMRVGIINIFKVII ISILQQVMANTLEINGKIRRLREKVECTKNDQVGIAPLETNHQDKAVSGWANRRMEMK RERVVMAVVQFEQHKRH" CDS 1552..2142 /note="put. ORF" /codon_start=1 /db_xref="PID:g288145" /translation="MYHLRSGVQDYPGQHGKIPSLLKIQELAGHGGRCLQSQLLRRLR QENHLNSGGRGCSEPKSHLCIPAWVTEGDSVSKQNKTKNEQHLRNNTKKSNSCIIGGP EGEEKEWSTEMRSEELMTDNVSILKKDINLKIIDSKAQLNSNRINTDADILSLNCEIN WFCHKPALSLWEKRDQKYTRKEGNTEYYGHGKEVSV" BASE COUNT 987 a 594 c 666 g 670 t ORIGIN 1 aagcttaata gaaaatatga gcaacataca caaacattag caacaatgat ataaaatacc 61 acttaaacat aaggaaaaat gttgcccttc acttgtggaa gaaatgcaaa tgaaaacagc 121 cctagggatg ttgacgttgg ggtggcacct gctgcagagg gtaacgtgca gcatgtcgag 181 ggcagcactg ccaaggctgg tttgagctca aggtcaggtg gaggaggtag tctctcccat 241 ctcttctgcg agtgcagctc taaaccctgc ctgaaacacg tggagaagct atctgagctg 301 cctccaggac acatgcaaat ggacactctg atcataaaat tatcaggaag attgagaaat 361 aagacaaaaa tggaggtgcc accaaaccag tggaaatttt tccccttttc attcctctgg 421 cattccctgg ccttgactca aggcagccca cactctagga gcagacacca gggcacaggt 481 ggggagctct gggggaccct ccaggcttac tcagtgaatg ggttagcagc agccacagga 541 gccaccatgg agcctgcagg gacccacaac actgagggca gggatcttgc ctctaatcag 601 ataagctgtg attcccgaga gggtggggta aaggccacgg gtctttttct ctccacatct 661 tcccacgtca tgaccccaga gggtcgaaga gggagaaagt gtgagcaccg tgacataatg 721 agccgcagcc ttctgactag atgccccaaa gaagaatccc aggtgaccac acagcatcag 781 agaaactgca gggtaatgag gaactttgga aagcaatcca tcgtgttgtc agtaaaacct 841 ctggctcact cccgagctgg gcatgcatgg atggtgaccc tcgatggaat agactatgag 901 gaaccaggtg aggggatcta cctccaccga gacgtgagag tgacctgcat acccaaacac 961 catgaggctt taaagactga gctgatgtgg aagccacagc ctctgcaggt tgctctgcac 1021 ttgcaacata agcccaacca catcaattgc tgcaaaacaa aactacagca ttctccatac 1081 cacttaaata agacacagag tctcacaaca ttcaaaacgc ccaggacaca atccaaaatt 1141 acttctacaa aaaatcagga aaatctcaat gagcaaggaa aatggcaatc agtagctgcc 1201 agtgctgaga tgacaatgag ggttggaatc atcaacatct ttaaagtaat tatcataagc 1261 attctccagc aagtaatggc aaacactctt gagataaatg gaaagataag aaggctcagg 1321 gagaaagtgg aatgtacaaa gaatgaccaa gtgggaattg caccactgga aacaaatcac 1381 caggataaag cagtctctgg ctgggccaac aggagaatgg aaatgaaaag ggaaagagtt 1441 gttatggcag ttgtccaatt tgaacaacac aaaagacact gatttaaaaa aaaatgaggc 1501 agggctcagt ggctcacacc tataatccca ataccttggg aggccgaggc aatgtatcac 1561 ctgaggtcag gagttcaaga ctaccctggc caacatggca aaatcccatc tctactgaaa 1621 atacaagaat tagctgggca tggtggcagg tgcctgcaat cccagctact caggaggctg 1681 aggcaggaga atcacttgaa ctcgggaggt agagggtgca gtgagccaaa atcgcacctc 1741 tgcattccag cctgggtgac agagggagac tctgtctcaa aacaaaacaa aacaaaaaat 1801 gaacagcacc tcaggaacaa taccaaaaag tccaacagct gtataattgg tggcccagaa 1861 ggagaggaga aagagtggag tacagaaatg agatctgaag aactaatgac tgataatgtt 1921 tcaattttga aaaaggacat aaacctaaag attatagatt caaaagccca gctgaattca 1981 aataggataa atacagatgc agatatatta tcattaaact gtgaaataaa ttggttttgt 2041 cacaagccag cattgtcact gtgggagaaa agagatcaaa agtacacaag gaaggaagga 2101 aatacagaat attatggcca tgggaaagag gtgtcagtgt gaatacatag aacagcacac 2161 ttaagcaaca accccaaatg atggggcttc ctacaaaaca gttggccttt actcttcaaa 2221 agtgtcaggt cacgaaataa atccatgctg aggacccgtt ccaggttaaa gcagactaaa 2281 ggggctggac aacacagtga aacgtgtgag cttggattag atatatgctg gactagagaa 2341 ggctgtgagg gggacaatgg ctgaaatgtg aatgaggtta ctatagttat gaaaaatgtt 2401 aagacttgga aaatctatat aaagcagacg gcataattct tgtacttttt ttgcaacttt 2461 ttaataaacc tgaaactatt tcaaaatgaa aagttaatcc aagctgtctt gagtagaagt 2521 taaaacaaca acaacaaaag aaaattgaaa agttaaaaat gaacccccaa cagaatgttc 2581 ccctttattt ttctttcatg taaggacgca ggatatgcat tttgctcagc taccaccctt 2641 cactgcatcc cattttgaga agtggtattt tcttcattca tctgttctag gtttttaaaa 2701 aaatatttaa gatcttctct ttttaaagaa tctgttcatt tggaatgtac tttttgcatt 2761 tttacttgtg aaaatatgta tttatccttt ttgttatgaa tgtatgactt cactgtgtca 2821 gagaatatgg tcttaagaga tacagaaaac ttttgagaat gataagatct ggacatgcta 2881 gatgaaatca aagccctgga tgtccttgtt caagctt // LOCUS HSMHCT8S22 109646 bp DNA PRI 29-AUG-1997 DEFINITION Homo sapiens HLA class III region containing tenascin X (tenascin-X) gene, partial cds; cytochrome P450 21-hydroxylase (CYP21B), complement component C4 (C4B) G11, helicase (SKI2W), RD, complement factor B (Bf), and complement component C2 (C2) genes, complete cds. ACCESSION AF019413 NID g2347130 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 109646) AUTHORS Rowen,L., Dankers,C., Baskin,D., Faust,J., Loretz,C., Ahearn,M.E., Banta,A., Swartzell,S., Smith,T.M., Spies,T. and Hood,L. TITLE Sequence determination of 300 kilobases of the human class III MHC locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 109646) AUTHORS Rowen,L. TITLE Direct Submission JOURNAL Submitted (15-AUG-1997) Department of Molecular Biotechnology, Box 357730 University of Washington, Seattle, Washington 98195, USA COMMENT Cosmids T8E, T5A, T29A, and S22A were obtained from Thomas Spies (Spies et al, Nature (1990) 348: 744-747). Cosmid T8E spans bases 1-37866, cosmid T5A spans 14456-54139, cosmid T29A spans 42399-75678, cosmid S22A spans 75069-109646. This entry overlaps GenBank Accession Number U89337 by 17897 bases. Sequencing methodology: high redundancy shotgun. Interspersed Repeats were identified with RepeatMasker (available from http://ftp.genome.washington.edu/RM/RepeatMasker.html) Simple sequence repeats were identified with sputnik (available from http://serac.mbt.washington.edu/ chrisa/software/sputnik.html). FEATURES Location/Qualifiers source 1..109646 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21" source 1..37866 /organism="Homo sapiens" /clone="cosmid T8E" /clone_lib="Tom Spies library" /map="6p21" gene <1..17135 /gene="tenascin-X" CDS join(<1..231,1386..1703,2121..2438,4611..4901,5302..5619, 7968..8291,8703..9020,9640..9927,10278..10556, 11834..12115,12964..13284,13574..13909,14161..14283, 14398..14541,14734..14853,14972..15115,15209..15339, 15459..15591,15684..15835,15928..16024,16117..16278, 16356..16519,16840..16941) /gene="tenascin-X" /note="Exons 23-30 are defined by dot matrix analysis of internal repeat sequences. Exons 31-45 are defined by a comparison with the cDNA sequence found in GenBank Accession Number M25813." /codon_start=3 /product="tenascin X" /db_xref="PID:g2347137" /translation="SSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQAVRVGGQESKVT VRGLEPGRKYKMHLYGLHEGRRLGPVSAVGVTEDEAETTQAVPTMTPEPPIKPRLGEL TMTDATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGVTISGLEPDHKY KMNLYGFHGGQRVGPISVIGVTEEETPSPTELSTEAREPPEEPLLGELTVTGSSPDSL SLSWTIPQGHFDSFTVQYKDRDGRPQVMRVRGEESEVTVGGLEPGRKYKMHLYGLHEG RRVGPVSTVGVTVPTTTPEPPNKPRLGELTVTDATPDSLSLSWMVPEGQFDHFLVQYR NGDGQPKVVRVPGHEDGVTISGLEPDHKYKMNLYGFHGGQRVGPISVIGVTEEETPAP TEPSTEAPEPPEEPLLGELTVTGSSPDSLSLSWTIPQGRFDSFTVQYKDRDGRPQVVR VRGEESEVTVGGLEPGCKYKMHLYGLHEGQRVGPVSAVGVTAPKDEAETTQAVPTMTP EPPIKPRLGELTVTDATPDSLSLSWMVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGV TISGLEPDHKYKMNLYGFHGGQRVGPVSAIGVTEEETPSPTEPSTEAPEAPEEPLLGE LTVTGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGQPQVVRVRGEESEVTVGGLEPGRK YKMHLYGLHEGQRVGPVSTVGITAPLPTPLPVEPRLGELAVAAVTSDSVGLSWTVAQG PFDSFLVQYRDAQGQPQAVPVSGDLRAVAVSGLDPARKYKFLLFGLQNGKRHGPVPVE ARTAPDTKPSPRLGELTVTDATPDSVGLSWTVPEGEFDSFVVQYKDKDGRLQVVPVAA NQREVTVQGLEPSRKYRFLLYGLSGRKRLGPISADSTTAPLEKELPPHLGELTVAEET SSSLRLSWTVAQGPFDSFVVQYRDTDGQPRAVPVAADQRTVTVEDLEPGKKYKFLLYG LLGGKRLGPVSALGMTAPEEDTPAPELAPEAPEPPEEPRLGVLTVTDTTPDSMRLSWS VAQGPFDSFVVQYEDTNGQPQALLVDGDQSKILISGLEPSTPYRFLLYGLHEGKRLGP LSAEGTTGLAPAGQTSEESRPRLSQLSVTDVTTSSLRLNWEAPPGAFDSFLLRFGVPS PSTLEPHPRPLLQRELMVPGTRHSAVLRDLRSGTLYSLTLYGLRGPHKADSIQGTART LSPVLESPRDLQFSEIRETSAKVNWMPPPSRADSFKVSYQLADGGEPQSVQVDGQART QKLQGLIPGARYEVTVVSVRGFEESEPLTGFLTTVPDGPTQLRALNLTEGFAVLHWKP PQNPVDTYDVQVTAPGAPPLQAETPGSAVDYPLHDLVLHTNYTATVRGLRGPNLTSPA SITFTTGLEAPRDLEAKEVTPRTALLTWTEPPVRPAGYLLSFHTPGGQNQEILLPGGI TSHQLLGLFPSTSYNARLQAMWGQSLLPPVSTSFTTGGLRIPFPRDCGEEMQNGAGAS RTSTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGQTDFWRDWEDYAHGFGNISGE FWLGNEALHSLTQAGDYSMRVDLRAGDEAVFAQYDSFHVDSAAEYYRLHLEGYHGTAG DSMSYHSGSVFSARDRDPNSLLISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSW YHWKGFEFSVPFTEMKLRPRNFRSPAGGG" misc_feature 1..17897 /note="Overlap with cosmid T27B, found in GenBank Accession Number U89337" exon 1..231 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=23 repeat_region complement(571..623) /rpt_family="LINE2" repeat_region complement(810..907) /rpt_family="LINE2" exon 1386..1703 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=24 exon 2121..2438 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=25 repeat_region complement(2577..2671) /rpt_family="MER5A" repeat_region complement(3769..3827) /rpt_family="LINE2" repeat_region complement(4008..4105) /rpt_family="LINE2" exon 4611..4901 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=26 exon 5302..5619 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=27 repeat_region complement(5819..6119) /rpt_family="AluY" repeat_region complement(6127..6182) /rpt_family="MER5A" repeat_region complement(6188..6515) /rpt_family="L1MB3" repeat_region complement(6534..6667) /rpt_family="AluJo" repeat_region complement(6668..6817) /rpt_family="L1MB8" repeat_region complement(7397..7501) /rpt_family="LINE2" exon 7968..8291 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=28 exon 8703..9020 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=29 repeat_region 9307..9516 /rpt_family="MIR" exon 9640..9927 /gene="tenascin-X" /note="repeating unit defined by dot matrix" /number=30 exon 10278..10556 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=31 repeat_region 10762..10880 /rpt_family="MIR" exon 11834..12115 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=32 exon 12964..13284 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=33 repeat_region complement(13464..13529) /rpt_family="MIR" exon 13574..13909 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=34 exon 14161..14283 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=35 exon 14398..14541 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=36 source 14456..54139 /organism="Homo sapiens" /clone="cosmid T5A" /clone_lib="Tom Spies library" /map="6p21" exon 14734..14853 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=37 exon 14972..15115 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=38 exon 15209..15339 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=39 exon 15459..15591 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=40 exon 15684..15835 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=41 exon 15928..16024 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=42 exon 16117..16278 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=43 exon 16356..16519 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813" /number=44 exon 16840..17135 /gene="tenascin-X" /note="Defined by dot matrix to GenBank Accession Number M25813. The 3' UTR is defined by EST with GenBank Accession Number W51953" /number=45 gene complement(17156..19867) /gene="CYP21B" CDS complement(join(17156..17421,17519..17622,17706..17884, 18085..18285,18455..18541,18643..18744,18833..18934, 19042..19196,19479..19568,19666..19867)) /gene="CYP21B" /note="exon boundries based on annotation in GenBank Accession Number M26856" /codon_start=1 /product="cytochrome P450 21-hydroxylase" /db_xref="PID:g2347138" /translation="MLLLGLLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPD LPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVS RNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVA IEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRF FPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQ LLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSR VPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGA HLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAF TLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQSQ" exon complement(17156..17421) /gene="CYP21B" /number=10 exon complement(17519..17622) /gene="CYP21B" /number=9 exon complement(17706..17884) /gene="CYP21B" /number=8 exon complement(18085..18285) /gene="CYP21B" /number=7 exon complement(18455..18541) /gene="CYP21B" /number=6 exon complement(18643..18744) /gene="CYP21B" /number=5 exon complement(18833..18934) /gene="CYP21B" /number=4 exon complement(19042..19196) /gene="CYP21B" /number=3 repeat_region 19358..19442 /rpt_family="FLAM_A" exon complement(19479..19568) /gene="CYP21B" /number=2 exon complement(19666..19867) /gene="CYP21B" /number=1 repeat_region 19811..19866 /rpt_family="(CAG)n" repeat_region 21789..21920 /rpt_family="LINE2" repeat_region 22055..22128 /rpt_family="LINE2" exon complement(22873..23154) /gene="C4B" /number=41 gene complement(22873..43446) /gene="C4B" CDS complement(join(23013..23154,23298..23430,23693..23776, 23959..24057,24143..24232,24398..24500,25992..26066, 26157..26247,26362..26548,26646..26739,27789..27848, 28242..28409,28492..28724,28953..29124,29230..29346, 29442..29598,29759..29834,30014..30223,30359..30448, 30550..30601,30846..31055,31169..31308,31567..31637, 31728..31839,31931..32128,32389..32463,32631..32757, 33009..33167,33321..33506,33639..33821,33967..34146, 34246..34361,41146..41278,41376..41481,41620..41716, 41887..41969,42188..42276,42355..42425,42635..42836, 43051..43249,43382..43446)) /gene="C4B" /note="exon boundries defined by dot matrix to GenBank Accession Number K02403 (C4A mRNA) or annotation in the C4B genomic sequence found in GenBank Accession Number U24578" /codon_start=1 /product="complement component C4" /db_xref="PID:g2347136" /translation="MRLLWGLIWASSFFTLSLQKPRLLLFSPSVVHLGVPLSVGVQLQ DVPRGQVVKGSVFLRNPSRNNVPCSPKVDFTLSSERDFALLSLQVPLKDAKSCGLHQL LRGPEVQLVAHSPWLKDSLSRTTNIQGINLLFSSRRGHLFLQTDQPIYNPGQRVRYRV FALDQKMRPSTDTITVMVENSHGLRVRKKEVYMPSSIFQDDFVIPDISEPGTWKISAR FSDGLESNSSTQFEVKKYVLPNFEVKITPGKPYILTVPGHLDEMQLDIQARYIYGKPV QGVAYVRFGLLDEDGKKTFFRGLESQTKLVNGQSHISLSKAEFQDALEKLNMGITDLQ GLRLYVAAAIIEYPGGEMEEAELTSWYFVSSPFSLDLSKTKRHLVPGAPFLLQALVRE MSGSPASGIPVKVSATVSSPGSVPEVQDIQQNTDGSGQVSIPIIIPQTISELQLSVSA GSPHPAIARLTVAAPPSGGPGFLSIERPDSRPPRVGDTLNLNLRAVGSGATFSHYYYM ILSRGQIVFMNREPKRTLTSVSVFVDHHLAPSFYFVAFYYHGDHPVANSLRVDVQAGA CEGKLELSVDGAKQYRNGESVKLHLETDSLALVALGALDTALYAAGSKSHKPLNMGKV FEAMNSYDLGCGPGGGDSALQVFQAAGLAFSDGDQWTLSRKRLSCPKEKTTRKKRNVN FQKAINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRAARVQQPDCREPFLSCCQFAE SLRKKSRDKGQAGLQRALEILQEEDLIDEDDIPVRSFFPENWLWRVETVDRFQILTLW LPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLPMSVRRFEQLELRPVLY NYLDKNLTVSVHVSPVEGLCLAGGGGLAQQVLVPAGSARPVAFSVVPTAAAAVSLKVV ARGSFEFPVGDAVSKVLQIEKEGAIHREELVYELNPLDHRGRTLEIPGNSDPNMIPDG DFNSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDK TEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRGSSTWLTAFVLKVLSL AQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNDETVALTAFVT IALHHGLAVFQDEGAEPLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYALTLTK APADLRGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQAPALWI ETTAYALLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASH TTEERGLNVTLSSTGRNGFKSHALQLNNRQIRGLEEELQFSLGSKINVKVGGNSKGTL KVLRTYNVLDMKNTTCQDLQIEVTVKGHVEYTMEANEDYEDYEYDELPAKDDPDAPLQ PVTPLQLFEGRRNRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGF HALRADLEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAVQEVPVGLVQPA SATLYDYYNPERRCSVFYGAPSKSRLLATLCSAEVCQCAEGKCPRQRRALERGLQDED GYRMKFACYYPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFL VRASCRLRLEPGKEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAA CAQLNDFLQEYGTQGCQV" exon complement(23298..23430) /gene="C4B" /number=40 exon complement(23693..23776) /gene="C4B" /number=39 exon complement(23959..24057) /gene="C4B" /number=38 exon complement(24143..24232) /gene="C4B" /number=37 exon complement(24398..24500) /gene="C4B" /number=36 repeat_region 24866..24931 /rpt_family="LINE2" repeat_region 25426..25561 /rpt_family="FLAM_C" exon complement(25992..26066) /gene="C4B" /number=35 exon complement(26157..26247) /gene="C4B" /number=34 exon complement(26362..26548) /gene="C4B" /number=33 exon complement(26646..26739) /gene="C4B" /number=32 repeat_region complement(27531..27578) /rpt_family="LINE2" exon complement(27789..27848) /gene="C4B" /number=31 repeat_region complement(27987..28100) /rpt_family="MIR" exon complement(28242..28409) /gene="C4B" /number=30 exon complement(28492..28724) /gene="C4B" /function="29" exon complement(28953..29124) /gene="C4B" /number=28 exon complement(29230..29346) /gene="C4B" /number=27 exon complement(29442..29598) /gene="C4B" /number=26 exon complement(29759..29834) /gene="C4B" /number=25 exon complement(30014..30223) /gene="C4B" /number=24 exon complement(30359..30448) /gene="C4B" /number=23 exon complement(30550..30601) /gene="C4B" /number=22 exon complement(30846..31055) /gene="C4B" /number=21 exon complement(31169..31308) /gene="C4B" /number=20 exon complement(31567..31637) /gene="C4B" /number=19 exon complement(31728..31839) /gene="C4B" /number=18 exon complement(31931..32128) /gene="C4B" /number=17 exon complement(32389..32463) /gene="C4B" /number=16 repeat_region complement(32506..32621) /rpt_family="(GGA)n" exon complement(32631..32757) /gene="C4B" /number=15 exon complement(33009..33167) /gene="C4B" /number=14 exon complement(33321..33506) /gene="C4B" /number=13 exon complement(33639..33821) /gene="C4B" /number=12 variation 33952 /gene="C4B" /note="cosmid T8E: g; cosmid T5A: t" /replace="t" exon complement(33967..34146) /gene="C4B" /number=11 exon complement(34246..34361) /gene="C4B" /number=10 repeat_region 34495..40868 /rpt_family="HERVKC4" exon complement(41146..41278) /gene="C4B" /number=9 exon complement(41376..41481) /gene="C4B" /number=8 exon complement(41620..41716) /gene="C4B" /number=7 exon complement(41887..41969) /gene="C4B" /number=6 exon complement(42188..42276) /gene="C4B" /number=5 exon complement(42355..42425) /gene="C4B" /number=4 source 42399..75678 /organism="Homo sapiens" /clone="cosmid T29A" /clone_lib="Tom Spies library" /map="6p21" exon complement(42635..42836) /gene="C4B" /number=3 exon complement(43051..43249) /gene="C4B" /number=2 exon complement(43382..43446) /gene="C4B" /number=1 exon complement(44108..44550) /gene="G11" /number=7 gene complement(44108..53184) /gene="G11" CDS complement(join(44505..44550,44753..44900,45003..45100, 45998..46137,46541..46648,52787..52923,53033..53132)) /gene="G11" /note="exon boundaries defined by dot matrix to GenBank Accession Number X77386" /codon_start=1 /product="G11" /db_xref="PID:g2347132" /translation="MSWKRHHLIPETFGVKRRRKRGPVESDPLRGEPGSARAAVSELM QLFPRGLFEDALPPIVLRSQVYSLVPDRTVADRQLKELQEQGEIRIVQLGFDLDAHGI IFTEDYRTRVCDCVLKACDGRPYAGAVQKFLASVLPACGDLSFQQDQMTQTFGFRDSE ITHLVNAGVLTVRDAGSWWLAVPGAGRFIKYFVKGRQAVLSMVRKAKYRELLLSELLG RRAPVVVRLGLTYHVHDLIGAQLVDCISTTSGTLLRLPET" exon complement(44753..44900) /gene="G11" /number=6 exon complement(45003..45100) /gene="G11" /number=5 exon complement(45998..46137) /gene="G11" /number=4 repeat_region complement(46267..46388) /rpt_family="(GGA)n" exon complement(46541..46648) /gene="G11" /number=3 repeat_region 47115..47401 /rpt_family="AluJo" repeat_region 47464..47757 /rpt_family="AluSx" repeat_region 47765..47901 /rpt_family="AluSq" repeat_region 47903..48207 /rpt_family="AluY" repeat_region 48208..48371 /rpt_family="AluSc" repeat_region complement(48508..49395) /rpt_family="SVA" repeat_region 49891..49943 /rpt_family="POLY_A" repeat_region 49944..50042 /rpt_family="AluSq" repeat_region 50046..50343 /rpt_family="AluSg" repeat_region 50698..50753 /rpt_family="LINE2" repeat_region complement(50768..51109) /rpt_family="AluSx" repeat_region complement(51247..51541) /rpt_family="AluJb" repeat_region 51619..51748 /rpt_family="MER5B" repeat_region 51667..51781 /rpt_family="MER5A" repeat_region 52028..52230 /rpt_family="LINE2" exon complement(52787..52923) /gene="G11" /number=2 exon complement(53033..53184) /gene="G11" /number=1 gene 53552..55725 /gene="DOM3-like" exon 53552..54224 /gene="DOM3-like" /number=1 CDS join(53869..54224,54397..54632,54719..54828,55166..55201, 55341..55435,55520..55667) /gene="DOM3-like" /note="intron-exon boundaries defined by a contig of ESTs with GenBank Accession Numbers D82148, N43758, Z12952, AA348910, and 35002: similar to DOM-3 protein encoded by C. elegans, SwissProt Accession Number Q10660 and a hypothetical protein from A. thaliana, EMBL Accession Number Z97343" /codon_start=1 /product="unknown" /db_xref="PID:g2347139" /translation="MDPRGTKRGAEKTEVAEPRNKLPRPAPSLPTDPALYSGPFPFYR RPSELGCFSLDAQRQYHGDARALRYYSPPPTNGPGPNFDLRDGYPDRYQPRDEEVQER LDHLLCWLLEHRGRLEGGPGWLAEAIVTWRGHLTKLLTTPYERQEGWQLAASRFQGTL YLSEVETPNARAQRLARPPLLRELMYMGYKFEQYMCADKPGSSPDPSGEVNTNVAFCS VLRSRLGSHPLLFSGETFPTMKMFEYVRNDRDGWNPSVCMNFCAAFLSFAQSTVVQDD PRLVHLFSWEPGGPVTVSVHQDAPYAFLPIWYVEAMTQDLPSPPKTPSPK" exon 54397..54632 /gene="DOM3-like" /number=2 exon 54719..54879 /gene="DOM3-like" /number=3 exon 55166..55201 /gene="DOM3-like" /number=4 exon 55341..55435 /gene="DOM3-like" /number=5 exon 55520..55725 /gene="DOM3-like" /number=6 exon complement(55793..56029) /gene="SKI2W" /number=28 gene complement(55793..66385) /gene="SKI2W" CDS complement(join(55829..56029,56124..56264,56455..56673, 56748..56858,57006..57216,57462..57588,57682..57829, 58165..58269,58401..58538,58696..58833,59529..59759, 61200..61310,61417..61629,61809..61905,61983..62129, 62451..62557,62744..62828,62957..63103,63488..63633, 63867..63995,64156..64346,64448..64501,64802..64876, 64996..65110,65205..65322,65423..65532,66142..66245, 66328..66349)) /gene="SKI2W" /note="exon boundaries defined by dot matrix to GenBank Accession Number X98378 (SKI2W cDNA): 5' UTR defined using EST with GenBank Accession Number AA074489" /codon_start=1 /product="helicase" /db_xref="PID:g2347134" /translation="MMETERLVLPPPDPLDLPLRAVELGCTGHWELLNLPGAPESSLP HGLPPCAPDLQQEAEQLFLSSPAWLPLHGVEHSARKWQRKTDPWSLLAVLGAPVPSDL QAQRHPTTGQILGYKEVLLENTNLSATTSLSLRRPPGPASQSLWGNPTRYPFWPGGMD EPTITDLNTREEAEEEIDFEKDLLTIPPGFKKGMDFAPKDCPTPAPGLLSLSCLLEPL DLGGGDEDENEAVGQPGGPRGDTVSASPCSAPLARASSLEDLVLKEASTAVSTPEAPE PPSQEQWAIPVDATSPVGDFYRLIPQPAFQWAFEPDVFQKQAILHLERHDSVFVAAHT SAGKTVVAEYAIALAQKHMTRTIYTSPIKALSNQKFRDFRNTFGDVGLLTGDVQLHPE ASCLIMTTEILRSMLYSGSDVIRDLEWVIFDEVHYINDVERGVVWEEVLIMLPDHVSI ILLSATVPNALEFADWIGRLKRRQIYVISTVTRPVPLEHYLFTGNSSKTQGELFLLLD SRGAFHTKGYYAAVEAKKERMSKHAQTFGAKQPTHQGGPAQDRGVYLSLLASLRTRAQ LPVVVFTFSRGRCDEQASGLTSLDLTTSSEKSEIHLFLQRCLARLRGSDRQLPQVLHM SELLNRGLGVHHSGILPILKEIVEMLFSRGLVKVLFATETFAMGVNMPARTVVFDSMR KHDGSTFRDLLPGEYVQMAGRAGRRGLDPTGTVILLCKGRVPEMADLHRMMMGKPSQL QSQFRLTYTMILNLLRVDALRVEDMMKRSFSEFPSRKDSKAHEQALAELTKRLGALEE PDMTGQLVDLPEYYSWGEELTETQHMIQRRIMESVNGLKSLSAGRVVVVKNQEHHNAL GVILQVSSNSTSRVFTTLVLCDKPLSQDPQDRGPATAEVPYPDDLVGFKLFLPEGPCD HTMVKLQPGDMAAITTKVLRVNGEKILEDFSKRQQPKFKKDPPLAAVTTAVQELLRLA QAHPAGPPTLDPVNDLQLKDMSVVEGGLRARKLEELIQGAQCVHSPRFPAQYLKLRER MQIQKEMERLRFLLSDQSLLLLPEYHQRVEVLRTLGYVDEVGTVKLAGRVACAMSSHE LLLTELMFDNALSTLRPEEIAALLSGLVCQSPGDAGDQLPNTLKQGIERVRAVAKRIG EVQVACGLNQTVEEFVGELNFGLVEVVYEWARGMPFSELAGLSGTPEGLVVRCIQRLA EMCRSLRGAARLVGEPVLGAKMETAATLLRRDIVFAASLYTQ" exon complement(56124..56264) /gene="SKI2W" /number=27 exon complement(56455..56673) /gene="SKI2W" /number=26 exon complement(56748..56858) /gene="SKI2W" /number=25 exon complement(57006..57216) /gene="SKI2W" /number=24 exon complement(57462..57588) /gene="SKI2W" /number=23 exon complement(57682..57829) /gene="SKI2W" /number=22 exon complement(58165..58269) /gene="SKI2W" /number=21 exon complement(58401..58538) /gene="SKI2W" /number=20 exon complement(58696..58833) /gene="SKI2W" /number=19 repeat_region 59141..59448 /rpt_family="AluJb" exon complement(59529..59759) /gene="SKI2W" /number=18 repeat_region complement(59833..59942) /rpt_family="MIR" repeat_region complement(60148..60370) /rpt_family="L1PA9" repeat_region complement(60405..60704) /rpt_family="AluSc" repeat_region complement(60789..60943) /rpt_family="L1MC4" repeat_region complement(60963..61101) /rpt_family="MIR" exon complement(61200..61310) /gene="SKI2W" /number=17 exon complement(61417..61629) /gene="SKI2W" /number=16 exon complement(61809..61905) /gene="SKI2W" /number=15 exon complement(61983..62129) /gene="SKI2W" /number=14 exon complement(62451..62557) /gene="SKI2W" /number=13 exon complement(62744..62828) /gene="SKI2W" /number=12 exon complement(62957..63103) /gene="SKI2W" /number=11 exon complement(63488..63633) /gene="SKI2W" /number=10 exon complement(63867..63995) /gene="SKI2W" /number=9 exon complement(64156..64346) /gene="SKI2W" /number=8 exon complement(64448..64501) /gene="SKI2W" /number=7 exon complement(64802..64876) /gene="SKI2W" /number=6 exon complement(64996..65110) /gene="SKI2W" /number=5 exon complement(65205..65322) /gene="SKI2W" /number=4 exon complement(65423..65532) /gene="SKI2W" /number=3 exon complement(66142..66245) /gene="SKI2W" /number=2 exon complement(66328..66385) /gene="SKI2W" /note="does not include 5' UTR" /number=1 exon 66573..66650 /gene="RD" /note="part of 5' UTR" /number=1 gene 66573..73314 /gene="RD" repeat_region 66595..66646 /rpt_family="GC_rich" exon 67088..67170 /gene="RD" /number=2 CDS join(67096..67170,68532..68601,68704..68849,70250..70324, 70447..70484,70651..70988,71101..71245,71409..71463, 71712..71814,73145..73242) /gene="RD" /note="exon boundaries defined by dot matrix to GenBank Accession Number L03411 (RD mRNA)" /codon_start=1 /product="RD" /db_xref="PID:g2347135" /translation="MLVIPPGLSEEEEALQKKFNKLKKKKKALLALKKQSSSSTTSQG GVKRSLSEQPVMDTATATEQAKQLVKSGAISAIKAETKNSGFKRSRTLEGKLKDPEKG PVPTFQPFQRSISADDDLQESSRRPQRKSLYESFVSSSDRLRELGPDGEEAEGPGAGD GPPRSFDWGYEERSGAHSSASPPRSRSRDRSHERNRDRDRDRERDRDRDRDRDRERDR DRDRDRDRDRERDRDRERDRDRDREGPFRRSDSFPERRAPRKGNTLYVYGEDMTPTLL RGAFSPFGNIIDLSMDPPRNCAFVTYEKMESADQAVAELNGTQVESVQLKVNIARKQP MLDAATGKSVWGSLAVQNSPKGCHRDKRTQIVYSDDVYKENLVDGF" repeat_region 67319..67436 /rpt_family="LINE2" exon 68532..68601 /gene="RD" /number=3 exon 68704..68849 /gene="RD" /number=4 repeat_region 69168..69312 /rpt_family="AluSx" repeat_region 69314..69605 /rpt_family="AluY" repeat_region 69606..69772 /rpt_family="AluSg" repeat_region 69783..70094 /rpt_family="AluJb" exon 70250..70324 /gene="RD" /number=5 exon 70447..70484 /gene="RD" /number=6 exon 70651..70988 /gene="RD" /number=7 exon 71101..71245 /gene="RD" /number=8 exon 71409..71463 /gene="RD" /number=9 exon 71712..71814 /gene="RD" /number=10 exon 73145..73314 /gene="RD" /number=11 exon complement(73463..73668) /gene="Bf" /number=18 gene complement(73463..79417) /gene="Bf" CDS complement(join(73513..73668,73939..73988,74070..74202, 74299..74399,74599..74675,74771..74924,75140..75257, 75396..75493,75988..76125,76201..76302,76584..76715, 77033..77171,77464..77600,77702..77803,78024..78197, 78353..78538,78939..79172,79260..79323)) /gene="Bf" /note="exons defined by dot matrix to GenBank Accession Number X72875 (Bf mRNA): 5' UTR defined using EST with GenBank Accession Number AA399208" /codon_start=1 /product="complement factor B" /db_xref="PID:g2347133" /translation="MGSNLSPQLCLMPFILGLLSGGVTTTPWSLAQPQGSCSLEGVEI KGGSFRLLQEGQALEYVCPSGFYPYPVQTRTCRSTGSWSTLKTQDQKTVRKAECRAIH CPRPHDFENGEYWPRSPYYNVSDEISFHCYDGYTLRGSANRTCQVNGRWSGQTAICDN GAGYCSNPGIPIGTRKVGSQYRLEDSVTYHCSRGLTLRGSQRRTCQEGGSWSGTEPSC QDSFMYDTPQEVAEAFLSSLTETIEGVDAEDGHGPGEQQKRKIVLDPSGSMNIYLVLD GSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADW VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILMTDGLH NMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQ HVFKVKDMENLEDVFYQMIDESQSLSLCGMVWEHRKGTDYHKQPWQAKISVIRPSKGH ESCMGAVVSEYFVLTAAHCFTVDDKEHSIKVSVGGEKRDLEIEVVLFHPNYNINGKKE AGIPEFYDYDVALIKLKNKLKYGQTIRPICLPCTEGTTRALRLPPTTTCQQQKEELLP AQDIKALFVSEEEKKLTRKEVYIKNGDKKGSCERDAQYAPGYDKVKDISEVVTPRFLC TGGVSPYADPNTCRGDSGGPLIVHKRSRFIQVGVISWGVVDVCKNQKRQKQVPAHARD FHINLFQVLPWLKEKLQDEDLGFL" exon complement(73939..73988) /gene="Bf" /number=17 exon complement(74070..74202) /gene="Bf" /number=16 exon complement(74299..74399) /gene="Bf" /number=15 exon complement(74599..74675) /gene="Bf" /number=14 exon complement(74771..74924) /gene="Bf" /number=13 source 75069..109646 /organism="Homo sapiens" /clone="cosmid S22A" /clone_lib="Tom Spies library" /map="6p21" exon complement(75140..75257) /gene="Bf" /number=12 exon complement(75396..75493) /gene="Bf" /number=11 exon complement(75988..76125) /gene="Bf" /number=10 exon complement(76201..76302) /gene="Bf" /number=9 exon complement(76584..76715) /gene="Bf" /number=8 exon complement(77033..77171) /gene="Bf" /number=7 exon complement(77464..77600) /gene="Bf" /number=6 exon complement(77702..77803) /gene="Bf" /number=5 exon complement(78024..78197) /gene="Bf" /number=4 exon complement(78353..78538) /gene="Bf" /number=3 exon complement(78939..79172) /gene="Bf" /number=2 exon complement(79260..79417) /gene="Bf" /number=1 gene complement(79874..97749) /gene="C2" exon complement(79874..80367) /gene="C2" /number=18 CDS complement(join(80188..80367,80516..80565,80692..80818, 81319..81410,81552..81628,81736..81901,82018..82129, 82226..82320,82446..82586,86235..86324,88102..88242, 89498..89636,91257..91390,91591..91689,91773..91946, 96390..96575,97143..97352,97509..97554)) /gene="C2" /note="exon boundaries defined by dot matrix to GenBank Accession Number X04481 K01236 (C2 mRNA): 5' UTR defined using EST with GenBank Accession Number AA002135" /codon_start=1 /product="complement component C2" /db_xref="PID:g2347131" /translation="MGPLMVLFCLLFLYPGLADSAPSCPQNVNISGGTFTLSHGWAPG SLLTYSCPQGLYPSPASRLCKSSGQWQTPGATRSLSKAVCKPVRCPAPVSFENGIYTP RLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVCDNGAGHCPNPGISLGA VRTGFRFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPICRQPYSYDFPEDVAP ALGTSFSHMLGATNPTQKTKESLGRKIQIQRSGHLNLYLLLDCSQSVSENDFLIFKES ASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENG TGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHI REILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFE HMLDVSKLTDTICGVGNMSANASDQERTPWHVTIKPKSQETCRGALISDQWVLTAAHC FRDGNDHSLWRVNVGDPKSQWGKEFLIEKAVISPGFDVFAKKNQGILEFYGDDIALLK LAQKVKMSTHARPICLPCTMEANLALRRPQGSTCRDHENELLNKQSVPAHFVALNGSK LNINLKMGVEWTSCAEVVSQEKTMFPNLTDVREVVTDQFLCSGTQEDESPCKGESGGA VFLERRFRFFQVGLVSWGLYNPCLGSADKNSRKRAPRSKVPPPRDFHINLFRMQPWLR QHLGDVLNFLPL" exon complement(80516..80565) /gene="C2" /number=17 exon complement(80692..80818) /gene="C2" /number=16 repeat_region 80869..81164 /rpt_family="AluSx" exon complement(81319..81410) /gene="C2" /number=15 exon complement(81552..81628) /gene="C2" /number=14 exon complement(81736..81901) /gene="C2" /number=13 exon complement(82018..82129) /gene="C2" /number=12 exon complement(82226..82320) /gene="C2" /number=11 exon complement(82446..82586) /gene="C2" /number=10 repeat_region 82664..82963 /rpt_family="LINE2" repeat_region 82978..83063 /rpt_family="FLAM_C" repeat_region 83123..83293 /rpt_family="MLT1B" repeat_region 83330..83425 /rpt_family="AluSp" repeat_region 83431..83728 /rpt_family="AluSg" repeat_region 83748..84062 /rpt_family="AluSx" repeat_region 84133..84373 /rpt_family="MLT1C" repeat_region complement(85220..85271) /rpt_family="(CAT)n" repeat_region complement(85340..85389) /rpt_family="MIR" repeat_region complement(85445..85475) /rpt_family="AT_rich" repeat_region 85476..85776 /rpt_family="AluSx" repeat_region 85782..85920 /rpt_family="AluJo" repeat_region 86024..86047 /rpt_family="POLY_A" exon complement(86235..86324) /gene="C2" /number=9 repeat_region 86457..86602 /rpt_family="AluSq" repeat_region 86603..86896 /rpt_family="AluSp" repeat_region 86897..87056 /rpt_family="AluSg" repeat_region 87055..87122 /rpt_type=tandem /rpt_unit=AT repeat_region 87057..87126 /rpt_family="(TA)n" repeat_region complement(87129..87234) /rpt_family="MIR" repeat_region 87472..87521 /rpt_family="(CAT)n" repeat_region complement(87532..87834) /rpt_family="AluSx" repeat_region 87862..87993 /rpt_family="FLAM_C" exon complement(88102..88242) /gene="C2" /number=8 repeat_region complement(88339..88630) /rpt_family="AluSx" unsure 88525 /gene="C2" /note="unclear data" /replace="c" repeat_region 88753..89053 /rpt_family="AluSg" exon complement(89498..89636) /gene="C2" /number=7 repeat_region 89801..90087 /rpt_family="AluSx" repeat_region 90509..90539 /rpt_type=tandem /rpt_unit=TATTT repeat_region complement(90528..90828) /rpt_family="AluSp" repeat_region complement(90877..90917) /rpt_family="AT_rich" exon complement(91257..91390) /gene="C2" /number=6 exon complement(91591..91689) /gene="C2" /number=5 exon complement(91773..91946) /gene="C2" /number=4 repeat_region 92145..92400 /rpt_family="L1MD1" repeat_region complement(92403..94035) /rpt_family="SVA" repeat_region 94048..94343 /rpt_family="L1ME3A" repeat_region 94372..94473 /rpt_family="L1ME3" repeat_region 94798..94927 /rpt_family="FLAM_A" repeat_region 94931..95227 /rpt_family="AluSp" repeat_region complement(95277..95565) /rpt_family="AluSx" unsure 95344 /gene="C2" /note="possible missing base due to compression" /replace="ct" repeat_region complement(95570..95864) /rpt_family="AluSg" repeat_region complement(95865..96041) /rpt_family="MIR" exon complement(96390..96575) /gene="C2" /number=3 exon complement(97143..97352) /gene="C2" /number=2 exon complement(97509..97749) /gene="C2" /number=1 repeat_region 98562..98672 /rpt_family="AluSg1" repeat_region 98676..98973 /rpt_family="AluSq" repeat_region 99716..100012 /rpt_family="AluSx" repeat_region complement(100276..100452) /rpt_family="MLT1G" repeat_region 101208..101508 /rpt_family="AluY" repeat_region 101519..101824 /rpt_family="AluY" repeat_region complement(102446..102747) /rpt_family="AluSx" repeat_region complement(103524..103741) /rpt_family="MIR" repeat_region 105397..105518 /rpt_family="L1ME3A" repeat_region 105863..106165 /rpt_family="AluSq" repeat_region complement(106241..106326) /rpt_family="L1MC4" repeat_region complement(106405..106689) /rpt_family="AluSg1" repeat_region complement(106694..107002) /rpt_family="AluJo" repeat_region complement(107013..107288) /rpt_family="AluJo" repeat_region complement(107384..107683) /rpt_family="AluSx" repeat_region complement(107703..107763) /rpt_family="L1PA16" repeat_region complement(107741..108015) /rpt_family="L1PA13" repeat_region complement(108117..108417) /rpt_family="AluSx" repeat_region complement(108421..108539) /rpt_family="AluSg1" repeat_region complement(108728..109029) /rpt_family="AluSx" BASE COUNT 26631 a 29462 c 28810 g 24743 t ORIGIN 1 gatcctcccc tgactcgctg agcctttcct ggaccgtccc ccagggccgc tttgactcct 61 tcaccgtgca gtacaaggac agggacgggc ggccccaggc ggtgcgtgtt gggggccagg 121 agagcaaggt cactgtgagg ggcctggagc ctgggcgcaa gtacaagatg cacctgtacg 181 gcctccacga ggggcggcgc ctgggcccgg tgtctgccgt gggcgtcaca ggtgagtgag 241 tgtgggtggg gcagggttgg aagacagccc tagaaaatgt gcccttctct accattttcc 301 tatacatatt tctgtcttga tggggctcac agtgaaagga atatagcaac attatggaaa 361 gacatgtcat ggagagacag gctgcaatcc agcaaatgaa gcaaaggcgg gtgagcatgt 421 gatagggagg cccagggctc aggtcaggac cagacaggga cgcctaagtc accctgccca 481 tgggtaccca ggggacagcc aggacctgag gccaggcatg ccttagcttg gtgacagctt 541 tagagagaag gtgaagtgtg tcagataatc acagctggtg caaaggccag gaggctagaa 601 agagcatggc acgtgagagg cactgaggat tgagtggggt gtcctttacg gtgagtatct 661 catcctgaag tgtgggagca gaggagggga ccactcacca ggcctggggt ctcccaggga 721 tgaggatgtg gttgcccagg tctgtgctga gatggcccca gcagctgtgc ctgtgtgaag 781 ctcatccgtg ggggctgaag atggggatgg ggtggcaggg agcctggagg cagcgaggcc 841 agtaggcagt tggtggccct ggtgagaggt gacagtggct caaactagga tcggggactg 901 gaggtggggt aggaaggtat ccaggggaat tcagggtaaa gatgctctga ggctgctggc 961 agctggtgag gagctggatc caggaacaca ccccaggctc tggcctcggg aggagtgtgc 1021 tgagcttgtt gcggagcaaa gacagaagcc cagtgaacaa aagatggcga agagacccca 1081 gtgctgggag gccaggggtg cagaggccga gtggggctgt gctcaaaaga gaggcggtgc 1141 tggagggaca gggagaggtg gcctgggtgt tgggaggtgg gggtgaggtg ggggctgagg 1201 gcaggagggt cagggtgagg gataggaaag gccacaggag aggagaggat gaagagctgt 1261 gctggagggg ctgtgggcag catcgtcctg ctcttgggca ctttgtgttt tgtgacacat 1321 cctttctatg ctgaactgag gagccaggga cctcactgtc cccacacgtg tctgtccaac 1381 tccagaggat gaagccgaga ccacccaagc agtgcctacc atgacccctg agccccccat 1441 caagcctcgc ctgggggagc tgaccatgac agatgccacc cctgactccc tcagcctgtc 1501 ctggacggtt cccgagggcc agtttgacca cttcctggtc cagtacagga atggggatgg 1561 gcagcccaag gcggtgcggg tgccggggca cgaggacggg gtcaccatct caggcctgga 1621 gccagaccat aaatacaaga tgaacctgta cggcttccac ggtggccagc gcgtgggccc 1681 catctctgtc attggggtga cgggtgagtg gatgatggca gccccagggt gggagccgtg 1741 ggagggtcac cctcttgctc tttggtgatg actggtgggg aatgggccag gggtccggtc 1801 agcaccacag acctgcttgt ggctggggct ccccttgggc cttcctctga ggctgacccc 1861 tggctcctcc tgagcaggga ggggccatca ggagttctgc tgtgctggtg actgtcccag 1921 gtcccccaca gctgaccctg gaacttgtca tgtgtgttag ctgtcagttg agcaggacca 1981 cccagcccca agaatgggct tttctgaaat gacctcacat acccagtagt ggccatggtt 2041 tctccctcct tcccttgaag acctgagcac atcccccagg cacctggcat cctctctata 2101 tctccttttc tcagctgcag aggaagagac ccccagcccc acggaactca gcactgaggc 2161 ccgggagccc cctgaggagc cgctcctggg ggagctgaca gtgacaggat cctcccctga 2221 ctcgctgagc ctctcctgga ccatccccca gggccacttc gactccttca ccgtgcagta 2281 caaggacagg gacgggcggc cccaggtgat gcgtgtcagg ggcgaggaga gcgaggtcac 2341 cgtggggggc ctggagcccg ggcgcaaata caagatgcac ctgtacggcc tccacgaggg 2401 gcggcgtgtg ggcccggtgt ccaccgtggg tgtgacaggt gagtgtttgt gagtgaggaa 2461 gatggcccta gaagatgttg ctttctctgc aacttcatga aaacaaaaat attctcacca 2521 gccgggcttc ttttgcacgt ttccatgctt ggggatcttg ctaagaggca gattggggtt 2581 caacaggttt gggctagggc cagagattct gcatttccaa caagaaatga tgcggatgct 2641 gctggcccat gatcacgccc cttgagtagc aaagttcttc acgacaaagg aattggaccc 2701 tcttttgaaa tctgttgaag agaatttttc tgcgtccctg attcctggta gtgtgctttc 2761 tctgtggagt tgactagggg ccgtgaagga agacagaagg cagtgagggg cagcgcgtcg 2821 actgcacact ctggaagccc actattggaa taggaatagg aacagacctt tctcaccaat 2881 gggccaattt gttcattcag caaagaattc cttgccctga tccggacact gtaattttag 2941 ggtatcaatg tcctctggcc aacgacctcc agaaactcac ctaaaattga atggatggaa 3001 ttatgctcca tgcagtgcag gaggactgtg gggtgactta ggaagaggct ttagaaagag 3061 attgttaagg aactgggctt gtgtttggtg ttttaggaaa gcatttaagg aagcaaggtt 3121 ttgctcttta ttagacgctg tcaggaagaa gggataaccc tattaccggg cgtctcacta 3181 agtcttatct ttggggaggt caactagagc aaggccaaag ctgccattgg taaagaagca 3241 gctatcactc agattagctg gatgggtgat gtttggttat ttttgtgctt tggaaaatgt 3301 tagagttcat ccttactgag acatgatcac agactggcct tgttcttgtc ttgatccatc 3361 ctactgacaa atggcctagt ctgatgttga tgttccatga aattgtctgt tcaactggac 3421 cacgccaaga ccagactgtg ccaggccagc cccaagcagc cgtggcctgg cagagggaaa 3481 gggcagctca cgggtgtcag ggctgccttt ttctttcata ggcggaagct ggaaagtgac 3541 acacacaagt tcatgttttc atggggctca caattgtggg cacaagcaga aaaccggaaa 3601 gtaagcaaag aaggatgagc atgtggagcc caggagccaa gccaggattg ggaagggaca 3661 gtgaagtcac tctgcccaca agtacccagg ggacagccag gacctgaggc caggcaggcc 3721 ttagcttgat gacagctttg gagaggaagt gaagcatgtc aaatcatcac agctagtgca 3781 aaggccagga ggctagaaag agcatggtgt gtgagagaca ctgaggattg agtggggtgt 3841 cctttgcagt tggtggatca tcctgaagtg tgggagcaga ggaggggacc actcaccagg 3901 cctggggtct cccagggatg aggatgtggt tgcccaggtc tgtgctgaga tggccccagc 3961 agctgtgcct gtgtgaagtt catccgtggg agctaaagat ggggatgggg tggcagggag 4021 cctggaggca gggaggccag taggcagttg gtggccctgg tgagaggtga cagtggctca 4081 aactaggatt ggggactgga ggtgggggag gaaggtatcc agggaaactc gggggagagg 4141 tactctgagg ctgctggcag ctggtgaggg gctgggtcca ggaacacacc ccaagctctg 4201 gcctcgggag gagtgtgctg agcttgttgc ggagcaaaga cagaagccca gtgaacaaaa 4261 gatggcgagg agacaccagt gcagggaggc taagggtgca gaggccgagt ggggctgtgc 4321 tcaaaagaga ggcggtgctg gagggacagg gagaggtggc ctgggtgttg ggaggtgggg 4381 gtgaggtggg ggctgagggc aggggggtca gggtgaggga taggaaaggc cgcaggagag 4441 gagaggatga agagctgtgc tggaggcgct gtgggcagca tcgtcctgct cttgggcact 4501 ttgtgttttg tgacacatcc tttctatgct gaactgagaa cccagggacc tcactctccc 4561 cacacgtgtc tgtccagctc cagaggatga agcagagacc acccaagcag tgcccaccac 4621 aacccctgag ccccccaaca agcctcgcct cggggagctg accgtgacag atgccacccc 4681 tgactccctc agcctgtcct ggatggtccc cgagggccag tttgaccact tcctggtcca 4741 gtacaggaat ggggatgggc agcccaaggt ggtgcgggtg ccggggcacg aggacggggt 4801 caccatctca ggcctggagc cagaccacaa gtacaagatg aacctgtacg gcttccacgg 4861 tggccagcgc gtgggcccca tctctgtcat tggggtgaca ggtgagtgta cgatgggagc 4921 cccagagtgg ggcctgtggg agggtctccc tttctctggt gatgggtgaa ctggcccagg 4981 aagcccctct gctcttggct gagccatggt acttttttgt ctttccccac ttccctgagg 5041 actgacagat cttcctgggt ggagaagggc cctgtgagct ctgttggtgg ctgtcccaag 5101 ttccccagca ctgacctcag agcttgtcat gtgtgttgac tgtaaactga gcaagaccac 5161 ccagctccaa agatgggcct ctccaagctg accccaggac ccccactcat ggccacagct 5221 tcgctctcct tcctcacaag acccaaggac atcccccagg gaagctgcct caccttctct 5281 gtcccctctt ctcagctgca gaggaagaaa ctcccgcccc cacagaaccc agcacggagg 5341 ccccggagcc ccctgaggag ccgctcctgg gggagctgac agtgacagga tcctcccctg 5401 actcgctgag cctctcctgg accatccccc agggccgctt cgactccttc actgtgcagt 5461 acaaggacag ggacgggcgg ccccaggtgg tgcgtgtcag gggcgaggag agcgaggtca 5521 ccgtgggggg cctggagccc gggtgcaaat acaagatgca cctgtacggc ctccacgagg 5581 ggcagcgcgt gggcccagtg tccgctgtgg gtgtgacagg tgagtaagtg tgagtgaggc 5641 gaggtgggga agatggccct ggaagacact gctgtctctc cagtcttcgt gaaaacatac 5701 tctgaagagt attccttcct ttgtgtatcc aaatgcctgg agatttttgt taaaaagcag 5761 atttggattc agcaggcttg gcctagggca agagaccatg catttctttc tttttctttc 5821 tttctttttt ttttttttga gatagagtct cgctttgtca cccaggctgg agtacagtgg 5881 ctcaatctcg gctcactgga acctccgcct cctgggttca agtgattctc ctacctcagc 5941 ctcctgagta actgggacta caggcgtgcg ccaccacgcc cagctaattt tttcatattt 6001 ttagtagaga tgggtttcac cgtgttagcc aggatggtct caatctcctg acctcgtgat 6061 ctgccctcct cggcctccca aagtgctggg atcataggcg tgagccacgg cgcccagccg 6121 accatccatt tctaacaaga ttctcatcta tcttgatgcc actggtccaa ggaacacact 6181 ttttttgttg ttttttattg tagtaaagta cacttgacat aattcgccat tctaatcgct 6241 ttctagtttg cagtccagca tgaagtccac tcacattgct gtggactgtc accctccact 6301 tccagaactc ctcttcccac actgaaactt cctacccact agacacgaac tcccattctc 6361 cccttgcccc agcccctggc aacaccattc tactctctgt ctctatgaat ttgactctag 6421 gtacctcata taagtgcaat catacaatat ttgtcttttt tgaatggttc atttcattaa 6481 atacaatgtc tttaaggttc atctatgtcg tagcatcagt cagaatttcc tttttatttt 6541 attttctgtg gcaatggggg tctcggtatg ttgcccaggc tggtctcaaa ctcctggcct 6601 caagcgagtc tcccaccttg acctcccaaa gtgctgggat tataggcaag agccactgca 6661 cctggccaga acttccttcc ttttcaggct gaataatgtt tcgttttaca catacagacc 6721 acactttgct catcctttca tccattgatg gacatctggg ttgcttccac ctcttagcta 6781 tcaagaataa tgctgctatg aatatttgtg tgcaaatcac agaacacctt tgtagcaaag 6841 ctcctcccag taacagattt ggaaactctc ttagaatttg ttgaagcgaa tttttcttgc 6901 attcatgaat cctcacagag gttaggctca gagttagggt tcctgtccta atgggaaaag 6961 ttcctgtcct aatggggcta acgtcatggg ggacaggctg tggaccagta agaaagcaaa 7021 ggcgggtgag catgtgacaa gaagcccaga gccaggcagg aatacctaaa ccaccctacc 7081 tgtggatact caggggacag tcaggatctg aggccaggca ggccttagct tggtgacagc 7141 tttagagaga aggtgaagcg tgtcagatca gcacaggtgg tgcaaaggcc aggaggctag 7201 aaagagcatg gtgtgtgaga agtactgagg attgagtggg gtgtcctttg cagtgaatag 7261 atagtcctga agtgtgggag cagaggaggc ctggggtctc ccagggatga ggatgtgatt 7321 gcccaggtct gtgctgagat ggccccagca gctgtgcctg tgtgaagctc atctgtgggg 7381 gctaaagatg gggatggggt ggcagggagc ctagaggcag ggaagccaat aggcagttgg 7441 tggccctggt gagaggtgac agtggctcaa actaggatcg gggactggag gttggggagg 7501 aaggtatcca gggaaactcg ggggagaggt aactctgagg ctgctggcag ctggtgaggg 7561 gctgggtcca ggaacacacc ccaggctctg gcctcgggag gagtgtgctg agcttgttgc 7621 agagcaaaga cagaagccca gtgaacaaaa gatggcgagg agaccccagt gcagggaggg 7681 taagggtgca gaggccaagt ggagctgtgc tcaaaagaga ggcggtgctg gagggacagg 7741 gagaggtggc ctgggtgttg ggaggtgggg gtgaggtggg ggccgagggc aggggggtca 7801 gggtgaggga taggaaaggc cgcaggggag gagaggatga agagctgtgc tggaggcgct 7861 gtgggcagca tcgtcctgtt cttgggcact ttgtgttttg tgacacatcc tttctatgct 7921 gaactgagaa cccagggtcc tcactgtccc cacacgtgtc tgtccagctc caaaggatga 7981 agccgagacc acccaagcag tgcctaccat gacccctgag ccccccatca agcctcgcct 8041 gggggagctg accgtgacag atgccacccc cgactccctc agcctgtcct ggatggttcc 8101 cgagggccag tttgaccact tcctggtcca gtacaggaat ggggatgggc agcccaaggc 8161 ggtgcgggtg ccggggcacg aggacggggt caccatctca ggcctggagc cagaccataa 8221 atacaagatg aacctgtacg gcttccacgg tggccagcgc gtaggccctg tgtctgccat 8281 tggggtgacg ggtgagtgaa tgatgggagc cccagggtgg gagctgtggg agggccacct 8341 cttgctcttt ggtgatgact ggtggggaat gggacagggg tctggtcagc accacagaac 8401 tgcttgtggc tggggctggg actccccttg ggccttccta tgtggttgac ccctggctcc 8461 ccctgagcag ggaggggcca tcaggagttt tgctgtgctg gtggctgtgc caggtccccc 8521 cacagctgac cctggaactt gttacgtgtg ttagctgtca gctgagcagg accacccagc 8581 cccaagaatg gacttctctg aaatgacctc aggtccccca gtcatagcct tggcttctcc 8641 ctccttttcc ccaggaccca aggacatccc cctcactctc tctccctcct tctccactgc 8701 agaggaagag acccccagcc ccacagaacc cagcactgag gccccggagg cccctgagga 8761 gccgctcctg ggggagttga cagtgacagg atcctcccct gactcgctga gcctctcctg 8821 gaccgtcccc cagggccgct tcgactcctt caccgtgcag tacaaggaca gggacgggca 8881 gccccaggtg gtgcgtgtca ggggcgagga gagcgaggtc accgtggggg gcctggagcc 8941 cgggcgcaaa tacaagatgc atctgtacgg cctccacgag gggcagcgcg tgggcccagt 9001 gtccaccgtg ggcatcacgg gtgagtgggg ggacaggccc tcgtccccag gtttacctct 9061 gcagccccct tgtgtttctc ctttggatct tggcacctct tttgactggg cctctaggtt 9121 tctgtctttt ctcccatgtt gctaatgatc ctgcctcatc cttggattca tggagactgt 9181 gtggagtcag atgggcaggt agtccatgcc ctgtttgttg tagcttcctt ctcccattgt 9241 gatctggaac ctccatgttg ctcgtgtgct catttgttag tcttcaggct cccttaagga 9301 gtattttagt ggctcagaaa gtgcttcaga tccagctgcc cagatctgca tgtgccttcg 9361 aatgtagtga ttgtgcgact ttgcaggcgt ttcctaacct tgtgcttcag attcctgcaa 9421 agttggggtg atgataataa tggcacccac ttcatatgtt gtgcgagggt taaatgcacc 9481 actgtttgtg agctgcttac agcaatgcag ggcacagatt ctaaaacaag cgttttagag 9541 gaggccgcta agaaatgctc actccagtcc tgggagagca ctgcctccct tgcgcagggt 9601 cctgcctcct gacccatggg cctgctctgc tcttttcagc gcccctgccc acaccactgc 9661 cggtggagcc ccgcctgggg gagctggcgg tggcggccgt gacctcggac tcagtgggcc 9721 tctcatggac ggtggcccag ggcccctttg actccttcct ggtacagtac agggacgcgc 9781 aggggcagcc ccaggcagtg cctgtgagcg gagacctccg agcggtcgcc gtctcggggc 9841 tggacccggc ccgcaagtac aagttcctgc tctttggact ccagaatggg aaacgccacg 9901 gcccagtccc tgtggaggcc aggaccggtg agtgagggct ggaggcctcc cgcggccaga 9961 gccttcgccc ccttgtggca cctgttggaa tttcacgttc tgtgccccac actccagtcc 10021 tcagcaccca ctgatttatt gggtccagga aaacccaggc cctgagctct cctctcccac 10081 tcacactcat ccctctccct actttccctg caccccaggg gacacttgct ttcttgtctg 10141 gctcctcttt tattcctact cctggcccag tcaccctccc gttcctggga ccggctcaca 10201 gagccatagc agcccaggaa gctccgtggc ccctttgcct ccatcccact ctccatgcac 10261 ctcactgtct tttccagccc cagacaccaa accgtctccc cgcctggggg agctgactgt 10321 gacagatgcg acccctgact ccgtgggcct ctcgtggacg gtccctgagg gcgaattcga 10381 ctccttcgtg gtccagtaca aggataagga tggtcggctc caggtggtgc cggtggcagc 10441 caaccagcgg gaggtcacag tccagggcct ggagcccagt aggaaataca ggttcctgct 10501 ctatggtctg tcaggcagga aacgactggg ccccatctct gctgacagca ccacaggtga 10561 gtcccagtcc agcctccacc ttttccagag ctgcctctca tccgagccct cagagctggc 10621 cctgcagcct tcccgtgaga ttccctctat cagccctgac acaaccatct taactccgaa 10681 agtgagtccc tctgtgtgtc aggcacagtt ctgaagcata tgcattactt cctttcattc 10741 gctgactgct gaggaaataa aagggacacg ggctttgggg ccagcagatc tgtgttccag 10801 ttccagtcct agcctttgcc agggtgacca tggacaagcc gcttcatccc cgggacctca 10861 gtgtcctcac ctgcataatg ctgggaatgg cgctggactc agtagtggtt cccatcctcc 10921 tccctcctcc ctgactcgga gccgaggggt ggaagaaaag ggagggcttg gctatggaaa 10981 gacgatggaa aaacctggtg tcactatgcc tatgactcag ctccctgagg aaggggtgtg 11041 gtgggtcacc gaccctcacc ccactcccag gaccatgagt cactatgccc agaggaggac 11101 agagcagccg gccagtctgg gcctgggtcc ctgggactcc gtaattctct tcccaaaccc 11161 tcactgtggg gacggggtac agatccccac ccatcggccc cagccccacc cgggcaagcc 11221 tctgctctgc cctctgttct cccagctaat cccctgtctt ctccctgccc caccctggct 11281 gccccggccc agtggggtca gtgtggggct ggggaagcag gaggcacgga tgttcaggct 11341 gatggccaca aggggacaag ggggagatca cagcctggca gtgatgggag ccgtgcattg 11401 gccggctgcc cagaccagct ctcgatgttt ggggtgtctc cgtgacaacc cagagacctc 11461 cggggagtct ctctcagctc tggcctcatt cttgctcagg ggctgggggt cagggtaaac 11521 aaaggccctc tctgcacccc cagccaccca tccctcggga gatgatctgt aatgaatttg 11581 gcgcatcctc gatcacagca gggaaggggc ggggcaggaa ggagtttggg cagcagcctc 11641 cagaggagga gggggctgtt ctccctcatt cctgtggggc atggcgggag caggcctgtg 11701 tgtctcctca aggagctgtc cctggggcta cactggaggg accatttccc agaacctcac 11761 acctccggga ggctgccagg gcttaggcaa aggcagcatg tgactaagag ctttccctcc 11821 tccctctgca cagctcccct ggagaaggag ctacctcccc acctggggga actgaccgtg 11881 gctgaggaga cctccagctc tctgcgcctg tcctggacgg tagcccaggg cccctttgac 11941 tccttcgtgg tccagtacag ggacacggac gggcagccca gggcagtgcc tgtggccgca 12001 gaccagcgca cagtcaccgt agaggacctg gagcctggca agaaatacaa gtttctgctc 12061 tacgggctcc ttgggggaaa gcgcctgggc ccggtctctg ccctgggaat gacaggtgag 12121 gctgctgtgc ctggctatag caagccagct tgtgtgggtt tccttgtgca tttgggctga 12181 agacaaagat gactgcagga gtgggcaggc cggagtgggg cgccctggcc tgtccccagg 12241 aaggaggagg agtctgcagc cctgtgggct tcaacatcca tcaaggagtc cagagcagga 12301 gccaggccag gcgggaggga aaggccctgg gaggggctct ctaatctccc agccccgact 12361 ctgccccgtc actgccactg ctcctcatta ctcgctgggg ctgctgtcgc ctccccgaag 12421 ggtggccttg tccagatagc ggcaaacctc cctgccgtgg atgagtcagg agcattttct 12481 taagaggaac atcactggaa aacaaaatga gcggggacac agaaaccaac agcagtggct 12541 gcatttgtgg tacaggctcc tcttccagag ctcgctgatg cccacctcag acaggcctga 12601 ccacggcacg gctggtggga tttgccagtc acctcaacca gccagttcca ccctcagctt 12661 ctctcagaag ggagcaccac actcctcaag ctcagtgaat gtatcccggc atgggtgggg 12721 ccagagcctg tgatatctcg aggtgggctc ggcaggacac cggggtgtgg aagggggaag 12781 cgagcacctg actcagacag cgcgggagct cgcaggagtc acgaggccac agcgacttca 12841 ttgtctgact gggcctggac ctataaactt cccacctcag ccttgggcca agcctggaag 12901 ataaaaatgg agcaccccat ggcgcccctc actcagattc tcccctgggc ttctcccacg 12961 cagccccaga agaggacaca ccagccccag agttagcccc agaggcccct gagcctcctg 13021 aagagccccg cctaggagtg ctgaccgtga ccgacacaac cccagactcc atgcgcctct 13081 cgtggagcgt ggcccagggc ccctttgatt ccttcgtggt ccagtatgag gacacgaacg 13141 ggcagcccca ggccttgctc gtggacggcg accagagcaa gatcctcatc tcaggcctgg 13201 agcccagcac cccctacagg ttcctcctct atggcctcca tgaagggaag cgcctggggc 13261 ccctctcagc tgagggcacc acaggtacca ccaggcgtct ccggcctcta gcctaggact 13321 cagaagggag aaacgggggc tcagaagggg tggtcgcagg gaaagagcgt gaggcgggta 13381 ccagggagag aggatggatg ggctggatgc gagtggcctt tagctctgcc ccacaggacc 13441 cccctgtggc tgcaagtccc tggttacaga tagagaaaca ggggcaggga ggggggtgga 13501 agggacgtgc tctgggtcac caagctggtg tgcttctgtc tccaatccct tctcccccac 13561 ccactccgtg cagggctggc tcctgctggt cagacctcag aggagtcaag gccccgcctg 13621 tcccagctgt ctgtgactga cgtgaccacc agttcactga ggctcaactg ggaggcccca 13681 ccgggggcct tcgactcctt cctgctccgc tttggggttc catcaccaag cactctggag 13741 ccgcatccgc gtccactgct gcagcgcgag ctgatggtgc cggggacacg gcactcggcc 13801 gtgctccggg acctgcgttc cgggactctg tacagcctga cactgtatgg gctgcgagga 13861 ccccacaagg ccgacagcat ccagggaacc gcccgcaccc tcagcccagg taaggaccca 13921 cacacactct gccccaaagt gggggtcttt gtacttcacg ggggggacct agtgcctcag 13981 ccagcggtgg gggtgggcga gttggtggtg ggcctggagg aatctgcaga gcgacttcca 14041 ttcctgggga ctagaggaaa aggggtggtg agcctgtgct ggagcagagg cgaggggggg 14101 actcgcaggg agaagcctcc ctgcccctgc ctgcgtcatt gttccttgac ccctctgcag 14161 ttctggagag cccccgtgac ctccaattca gtgaaatcag ggagacctca gccaaggtca 14221 actggatgcc cccaccatcc cgggcggaca gcttcaaagt ctcctaccag ctggcggacg 14281 gaggtggtgc ctttgccatg tgctcatcgc ctcgcatttc ctctcccccc tgcactctgc 14341 ccaccctcca gccgccctgg ggttccctgg gtaaccctcg atccccaatg ttttcagggg 14401 agcctcagag tgtgcaggtg gatggccagg cccggaccca gaaactccag gggctgatcc 14461 caggcgctcg ctatgaggtg accgtggtct cggtccgagg ctttgaggag agtgagcctc 14521 tcacaggctt cctcaccacg ggtgagatgg actgggaccc ggggcaagag gtgggagcca 14581 agaaaacggc atgggtggga gttgagagag aacgaggagg gtgaaaggga ggtggtggag 14641 gctccgattg cggacgggag gccagtggag tctggggagg cacggagtag agagagccgc 14701 ggggaccctt ctgagcccct ccccttcccc cagttcctga cggtcccaca cagttgcgtg 14761 cactgaactt gaccgaggga ttcgccgtgc tgcactggaa gcccccccag aatcctgtgg 14821 acacctatga cgtccaggtc acagcccctg ggggtgagca ggcctgaggc ctctggaggg 14881 gacttgttca gggtggggat tgcagggggg aggctggact ctggccgagg atggaggggg 14941 caggccttga tgcccctctc tacactccca gccccgcctc tgcaggcgga gaccccaggc 15001 agcgcggtgg actaccccct gcatgacctt gtcctccaca ccaactacac cgccacagtg 15061 cgtggcctgc ggggccccaa cctcacttcc ccagccagca tcaccttcac cacaggtagg 15121 gtctgtgggg tgtgtgggac agggagagga ggtagaggga gccaggttgg gcctcatccc 15181 catctcctct tcctgctttc cctcctaggg ctagaggccc ctcgggactt ggaggccaag 15241 gaagtgaccc cccgcaccgc cctgctcact tggactgagc ccccagtccg gcccgcaggc 15301 tacctgctca gcttccacac ccctggtgga cagaaccagg tgccccggcc ccactgaccc 15361 aactcccctc cctgggtgat tccaggaggt gctgcctctg gccctcccgg agggtctcca 15421 cctccctctc ccctgacccc cccttgtctg tcccacagga gatcctgctc ccaggaggga 15481 tcacatctca ccagctcctt ggcctctttc cctccacctc ctacaatgca cggctccagg 15541 ccatgtgggg ccagagcctc ctgccgcccg tgtccacctc tttcaccacg ggtacctgga 15601 cgcacgggcc cggggccggg ggctgggtgg gcagccaggg cctaaggctt ggaaaaggac 15661 tggcccctgc tctcctctcc caggtgggct gcggatcccc ttccccaggg actgcgggga 15721 ggagatgcag aacggagccg gtgcctccag gaccagcacc atcttcctca acggcaaccg 15781 cgagcggccc ctgaacgtgt tttgcgacat ggagactgat gggggcggct ggctggtggg 15841 tggcattggg aagcccaggg gtctgtgcag ggcagggtct gttgccccgg gagccagagg 15901 ctgatggtgc ccccacttgc ttcccaggtg ttccagcgcc gcatggatgg acagacagac 15961 ttctggaggg actgggagga ctatgcccat ggttttggga acatctctgg agagttctgg 16021 ctgggtcagt gcctcacagg gactggggaa ctacggatgg ggatgggggc cctgtggaca 16081 ccaggaccct gatgagggca cgtatcccac ccccaggcaa tgaggccctg cacagcctga 16141 cacaggcagg tgactactcc atgcgcgtgg acctgcgggc tggggacgag gctgtgttcg 16201 cccagtacga ctccttccac gtagactcgg ctgcggagta ctaccgcctc cacttggagg 16261 gctaccacgg caccgcaggt aagcagaggc tgtgaggctg ggagggtgag gctgggaggg 16321 gaggccctca tggctccttc ctccaccctg cccaggggac tccatgagct accacagcgg 16381 cagtgtcttc tctgcccgtg atcgggaccc caacagcttg ctcatctcct gcgctgtctc 16441 ctaccgaggg gcctggtggt acaggaactg ccactacgcc aacctcaacg ggctctacgg 16501 gagcacagtg gaccatcagg tgaggggtgg ggaggcggct cagagctggg gtggctgggg 16561 ctcggcctgc ctaggtttca gccccacagt gtaacaggca agggactgag tggctgggtg 16621 aaatggaaca atcatgccag cctcgcagag ggagctggag ttgatttatt ggctggaaag 16681 ggccagctca gaattaagcc tcaatcctct gcagcggagg gtcaggaagg gagctctgcg 16741 gggaggttgg ttgagtgctg ggagctacct ccttaagggg aatgggagga gcagatggga 16801 catccggctt tgactctctc ttgacaaccc ctttcccagg gagtgagctg gtaccactgg 16861 aagggcttcg agttctcggt gcccttcacg gaaatgaagc tgagaccaag aaactttcgc 16921 tccccagcgg ggggaggctg agctgctgcc cacctctctc gcaccccagt atgactgccg 16981 agcactgagg ggtcgccccg agagaagagc cagggtcctt caccacccag ccgctggagg 17041 aagccttctc tgccagcgat ctcgcagcac tgtgtttaca ggggggaggg gaggggttcg 17101 tacaggagca ataaaggaga aactgaggta cccggctggc atcggtcctg ccccatcact 17161 ggctctggcc cgggctgtgg gcccccatcc cccggggctg cagccgcact tggaaaggct 17221 gcatcttgag gatgacactg cagtggggca ggggctgcag ggagggcagg gcgtccccgg 17281 agggcagcag cgtgaaggcc tgcagcagtc gggtcagcac cacgaagagc tccaggcgcg 17341 ccagcggctc gcccaggcac acgcgggcac cgcagccgaa ggccagagct ctggagttct 17401 tgcctggctc caggaagcga tctgcgggcg ggtggacagg tgggtgggga ggcgttcagc 17461 ggcagcgggg accagcctcc accacatttt cacggcaggc ccccggcccc ccacatacca 17521 ggccagaact catgtggcct ctcccagacc gtctcatcca ggtgggcgcc ttggaggttc 17581 ggaatgatga ctgtgccctc agggatgtcg tagccggaga tgctgaaggg ggctggagtt 17641 agaggctggc caggacctcc ctgggctcgg gctttcctca ctcatcccca accctcggga 17701 gtcacctgct gggccgtgtg gtgcggtggg gcaaggctaa gggcacaacg ggccgcaggc 17761 gcagcacctc ggcgatggtg gcattgagca agggcagccg tgcacggtcc ttgtagggga 17821 cccgggagct ggaggcacca gggcccagtt cgtggtctag ctcctcctgc agtcgctgct 17881 gaatctgggg aatgatcggg tggagtcctg ccccagcagc ccacagctgc ccagcctcca 17941 gccgctccct cagcaaccca gtgagcctga gtgccggtga ggcaagcaca gccccagccg 18001 cacagtgctc agagctgagt gagggtgccc accgccctgg ccaggttgct gggaaggagc 18061 cttttgcttg tccccaggac gcacctcagg gtggtgaagc aaaaaaacca cggcccagga 18121 gagggtgttt gctgtggtct cagtgccacc gatcaggagg tccactgcag ccatgtgcac 18181 gtgcccttcc aggagctgtc cagagccctc ttccatgctc ggctgcgcca ccccttggag 18241 catgtagtcc atcatgtccc tccactggcc tgccacgagg ctctcctgca gagggtgaaa 18301 ggagcgggct gagcggctgg cctggggaga ggagtacaga gtggcaacag gcccataact 18361 ggggtatgca aaagaacccg cctcatagca atgctgaggc cggtagcatc actggctgtg 18421 ggccgagggg aggccgtcca cgtacagtcc ccaccttgtg ctgcctcagc tgcatctcca 18481 cgatgtgatc cctcttctct atggcctgct tcagcctccg gagacctgga ttggggaaga 18541 actgcggcag gaagcatgag aatgcagctg tgggaaggag cctctccctc caccccagcc 18601 tctcccctac aacccagggg tgtctaggct ccaggtcctc accctgagaa agggaatcac 18661 gtccacaatt tggatggacc agtggctcca ggtttttaac acctcctgga tacatttgta 18721 ataggcaggc attaagttgt cgtcctgcca gaaaaggagg gagtactttc agttcaggac 18781 aaggagaggc tcagggaggg gctgggggtg ggcctgaggg gctgtgaggc accttgatct 18841 tgtctccgaa ggtgaggtaa cagatgatgc tgcaggtgag gagagagaat tcctcctcaa 18901 tggccacagg ggtgccgggc tgggctctca tgcgctgtgg agaaacagtg tgagttcagc 18961 aggccgctgt gcagcgggca gggcgggggc tactgtgaga ggcgaggctg acccgaggtg 19021 gcctcaggag cccagcctta cctcacagaa ctcctgggtc agctgctcca ccactggctc 19081 catggagtca cggatgccca gcagcagggc tgagcgggtg agcttcttgt gggctttcca 19141 gagcagggag tagtctccca aggacaggtc cgggtagttc ctagacacca gcttgtctgc 19201 aggaggagtt gggggctgga gggtgggaac tgatgaaggc agctgagggc ctgaccttct 19261 tcggcctccc caacccctgc tttctcccca ccagatatgc ccccccaaga gcttccaggg 19321 acctggattg gggatgcccc aaaggtggct cacacttgag gctgaggtgg gaggatcatt 19381 tgagactagg aatttaagac cagcctgggc agcatagcaa gaacccatct cttaaaaaaa 19441 aattttttta agaaagaaaa aatgccccca gcccttacag gtaagtggct caggtctgcc 19501 agcaaagtct gcccactttt tgaccatggc ttcctcaatg gtcctcttgg agttcagcac 19561 caccacatct gggagacagc caaagcagcg tcagcggaga gaggaccctc tccgtcacct 19621 ccgccccctc ctatggtgag ggccagagcg agatcagcct ctcaccttgc agcccaaggt 19681 ggagcctgta gatgggcccg aatttctgag tcaggccaag cagatagatt gggaggtcgg 19741 gctgcagcaa gtgcaagaag cccggggcaa gaggcgggag gtggaggctc cggagcttcc 19801 accagttcca cagcaggcgg gcgccagcca gcaggggcag cagcagcagc aggcccagga 19861 gcagcatggc gagacgcccg tcagggccct gaggtgccac ttatagctca agagccccag 19921 ccatccctcc tgctgtgtag actgttttgg ggcctccctt gaccccacct tcaggtaccc 19981 tcccaccgac ccgcccacag agtggccctt ttctggaatg acaccagtct cattggcctt 20041 gggacgtccg tatttcaaaa aaattgatca cccatcaaga agcaaggaag ggaaatgcaa 20101 ccctgacctt tttcctgcat ccagagtcag ctttctggtt ccacatcaac tgtgcaggca 20161 atagtgtccg cccccacctg ggccctctcc ctctgccacc ccaccgtggg ccagtttact 20221 ggcatgatgt ttgtctcaga gagaaaatgg cagtccctgc ctaataggcc ttcctgaatc 20281 gcctgaagaa caaaatgtga tttgtgtggg ttgtttgtta gtttaagcta cagatagagt 20341 aatacagttt aagcctggtt tgcctggggc attccagtat atacaggtat cccagtgtga 20401 ttgttattag tgcctccctt cactcacaaa agtgcccaga agacaaagga ggaagtggtt 20461 tcccgctgcg gagtaatagc atgtaccaga cactgctcta agtgctttgt gggcactcac 20521 tcaactcttc acagccatgc tctatatgca acacgggtat tgttcgcatg ttgtggaggg 20581 caggattcca gcctggaaat ctagctgtcg aatatgtacc tttagccacc aggagcacct 20641 ttctttggtt gaaaggaaaa cagcccacct tgagctggtt tgagataaaa aataagaatt 20701 ggggaatttc acagaactca ggaaaaagaa gcagaggagg ggaccccact ggtcttagaa 20761 agtcaaggat agggaacaga aaaagccccg gagacgggcg gtttcctcgc tctgttcctc 20821 tctcctcttt cttctctctg cagtcaagtt tcttgtgttt ccaggcacag gcagaacatg 20881 actgccctca ggactacagt ccacaggttc tactccagcc agggaaagta tggattccca 20941 gcacaactgg gcatgccccc ctcaaagtcc aaagtcccca ggaaggggct ggatagccac 21001 agctggatca ggatcaaggg tccagcccta tttggggaac tgaagccata aggggtgggg 21061 gtcccactgg actgacactc agagaccact tttgtgggta aggaggcagt ttggggattt 21121 gggaagaggc catgccaaag aggagtggca gggttcagtg agaggccagg gcctcctgga 21181 tggtacctga gaggtcagag gcgacccagc cccatacctt cccccccatt tctggctctg 21241 tgccagaccc tcggaccagt gggggtctct ctcccctcag agcttttcca gactgtgcta 21301 ctgcagagcc agagtcaaga ccctggctgt gccacccact acattggcga gttcgctgct 21361 gctgaatcca tgatcaggac tgagagggag aaaacaggat tgggatttct ggggttgggc 21421 aatgacagct actgccttct tcatggttga cctcccagac cgggccatgt cttgctgcaa 21481 ctgaatccct ggctccggaa aatgggctta tgccaccacc atgctgagca gctactggga 21541 gaaagatctg atgagcaacc atagggccat gtatgcaaga gcctcaggaa acctctgcca 21601 cagtgggaaa ggcccaaact tctcaaagtt acgtgggata agacagtgct catcggaaag 21661 gagcgcacct ggaaggctga cgtgcttact acttgtctca ccatctccat ctcttccacc 21721 acttccaaat ccaaactccg aatgtggtca attacatgca atgtccccaa aggtaccctg 21781 tcctgccccc ggcctctggg cctttgtata ttctccttct gccccagatg ccctcatcta 21841 cccctgctct ttgcctggct ccttctcagt cttcagagcc ccacttggag gccacctgct 21901 ccaggaagcc ttccctgtcc ctgcctggca atgcccctga cttggttaga tgccccacag 21961 gaccacactg ctccctgaca tcacatcttt cacagagggt ggggactgcc cgtcctctta 22021 tctgtctccc cagcgatgag ccctggagtg agcagctgta tctctggcac ctggcacagg 22081 gcatggcatg aagtgaaaga ttcccgaatt tctgtcgaat gactggatgg caggctgctg 22141 aactgtcatg acctctacgg aaactgatct caccaaactt ctttggacag acgcggagtc 22201 atctctgaag accccaggtg tgccactaaa tgggggaaag ttaggcaggt ccggtgggaa 22261 ggggagaccc caggagaaca gcggcttcct cagaggattc acaaacacac caaagtcaga 22321 cttttggctt gtttgaaacc agtaactgga agaagacact gccggacctg aggattgcac 22381 aactccggga aagtcacctc attccactga taacagaacc aagcgtcagc aggctttcaa 22441 cagcccggaa ctcagctaaa atctgcagac ctcagaaccg ccagcagcac gaggacagag 22501 tcggggaggt gtagctggac aacagctcat gaggcagaag agctgtgtca caggtgtcag 22561 ctgagcacaa gccctgtctg aggccgtggt gactggcgac agccgggcag tggagggcct 22621 tggaagctga agggtggtct tggcatggac tctggtcctt ggggtgcagg ctcttgggtc 22681 ccagttctgc tcggggtggt cctgtgaagc actagactcc tagcgggtca tctgggaggt 22741 tctagcagag ctgaacagcc caagggtcat cagggctcag agagtctaag gttatgtcaa 22801 cctggcctgc ccagaactgc atcgggcagg ggcactgctc tcagccctag caacacacac 22861 tgacacttcg ctgccaacac tgacactttg ctgccaaaag cctttaatat gccctggtcc 22921 caggctgtgt tcatgaaagc ggacacagca gtgcttccag cttcatggtt cccaggttca 22981 ggttcctccc agcggaggtg ggagggcagc cctcacacct ggcacccctg agtgccatac 23041 tcctggagga agtcgttgag ctgggcacag gctgcccgct ggcgggtgct ccggcacagg 23101 cgttcagagg gcatctcctc gatccagcta ttcgagtcca gcaggtactg ggggctgcag 23161 ggggcaaagg ggcagtcagc agggctcggg aggatggcag gtggaaacgg agagcacagg 23221 catctggctt ctgaggggca aggcctaggt ggcgaggcat ggggaggaca agagactgag 23281 gggaccagat gactcactgt ccctcgaggt cataggtggc cccatccaga cccatgatca 23341 aatattcttt cccaggttcc aagcgaaggc ggcaggaggc tcgaaccagg aagttgcgca 23401 tctgattagc agcggccttg acatccttgg ctgcggggat gacgtgcgca aaagtggtca 23461 gaggggaaga gaaggtgcag ggtgagccca ggctggggac tctgtgtaga tcctctcatt 23521 ccaccttcgc cacccccatg gagaggtgcc actgcctccc tatttatggc caagcccaag 23581 gcttctgaca gcccaagggg attttcacac ttccagatgg tcaggtcctc ggccacacct 23641 cagcctccct gtctcccccc agccctgccc gcctctccgg tttgcttcat actgaagtgc 23701 aggacttggg tgatcttggt ctcaaagagg cggaaagcag ctctgctgtc ttctcggaga 23761 accttaacct ggaagcctaa gagggggtga ggagaagggg gaaaggtgag ttactttgag 23821 gctgagaggg taggaagttg gtgtcagagc aaacaggctg cgtgcatgac ctgtaagagg 23881 agcaggctac acccagagag accaaaacag ccggtccccg agggagggtc aggccagggc 23941 ctcggtggga agactgaccg tactccacac gggggtagta gcaggcaaac ttcatcctgt 24001 agccatcctc gtcctgcaga ccccgctcca gggcgcgacg ctggcgaggg cacttccctg 24061 aagttgggga acccatcaga cagtgtgggg ggggccccgg ccatcccgcc tccactgccc 24121 cgccccaggc cctcagtctc accctcagca cactggcaga cttcagcaga acacaaggtg 24181 gccaagagtc tgctcttact tggtgccccg taaaacacag aacatctgcg ctctggagaa 24241 cagagaggag ttagggcaca ggcccctcca ttctgcctcc tcagtcccag ggagccccag 24301 ggctctctgc cccctcacta ccccggtgtc cattgtccca taggagggca cctatgccaa 24361 agttctcctg aatttcaggg tgtcctgcag tgctcaccgg ggttgtagta gtcgtacagg 24421 gttgcgctgg ccggctgcac cagccccacc ggcacttcct gcacagcctc aaagcccacg 24481 cactcccggg aggtggggac ctggccaagc gtggggagga gagatgaggg acccactccc 24541 tgggccctgc agccccctgt actgggtttc cttggcctgt ttttgtttgc ttcctattgg 24601 ccttctctcc agtgtccttc acattctgtt accttcctac tcagagaact ctcaaagctg 24661 ctccgcaagg tctctggtga cttcacttcc cagagggtga ccttgcccca gtcttcactg 24721 ctccaggccc tcagcagagt cttgcatcat ggacgtgttc tctgtgaaac tgtccctaag 24781 ctaagggtta gcttctggac cacccttggt tctgacctgg tcatttctta cgttccctcc 24841 ttcggaactt ccttcctcag agctcccctg tgggggtctc aaccactccc tggcttccac 24901 caaaccaatc caggctgatg attcccaaac tgaacttgca gctccatcct tgcattagga 24961 ttgtggcagg acctgtaagt tctccaaggc actctgcctg cccccaaacc cactcgccct 25021 cctcgcggcc tcatctttgt catggatacc actgggtcct cctttatttg ccacaaccta 25081 actgcaggtt ctgtcatcct gcctgacccc ccgactcagg tcccaggcct gacccctcct 25141 cgtgcccacg cgggcccagt ccacacggtg ccgtgccagt gtccctcctg agctaggctg 25201 ctgcaccgtc agctccctat ccgggaatct tgttggctct gtgttttcta ttgtgttcaa 25261 cccagatgtg tcagccaggc ttccccagct gatgggggct ggcccctctg cacacactgg 25321 gtaggggtct ccctgaccta caaacagctg gctaatgaca gccaccacac ctttctcaca 25381 ttttctccca gaggttacag taaatgttcc aaaacttttt ttgaaggccg ggcatggtgg 25441 ctctggcctg taaatccggt actttgaaag gcctaggcca gaggatcgct tgaggccggg 25501 agttcaagac cagcctgggc aacagagcga gaccctgtct ttactaaata aataaataaa 25561 aatgttttga gagccgtaag agggttgata actattttag ccaaataagg tcgtgaaaag 25621 acaaatacaa ctatcatcta tggccaccat cactttgtaa aggaaaatac gctttaatat 25681 taaaaaaaag ataggaagtt caacaaaaaa acaaaggcag cccctcaagc tgaataaaac 25741 cagcattagg aaagactccc tctcaaaccc tgagggtccc tgctgactgg catcggtccc 25801 tgccgcctgc aggtctgccc cttactctgc tgtgtctcac gggctcacat tcttgtcctt 25861 ctctctgatg acatcacaag ttctcaagag caccagccgc tgacacagct gtctcctgag 25921 aaacctccta acgcatactc agtaaacccg gtgccatcga gtcccttcct gcctcatctc 25981 tccccactca ccgagtcaaa atacagcagg acgtggggcc cctcggtctc aaagtgactc 26041 acgtaacggt cagagaggga ggtcagctgt ggaaaagggg agttggtcac aggccctgca 26101 catgacaggg ctcagtacct gggacagagg gggttgccct gggtggctga ccacaccttc 26161 tccaggtcag cacgcagggc gtggaatcca ctcaggaggg tgacgtccgc gatggccatg 26221 ccagacagcc ccaccttgcc gttccgcctg tggagacgtg tgagctgtcg tccaggttct 26281 gcctgcgcgg gctccagaag cccagcccca gcctgggtcc tgccctccct cccctggccc 26341 agggcagctc ccggcgccca ccagatgcac acggtgtagt gcaccctgga ctcctgctcc 26401 tccaccacct tgggcgcctc cctcctgcgg cggttcctcc gaccctcaaa cagctgcagg 26461 ggtgtcacgg gctgcagagg ggcatctggg tcatccttgg ctggaagctc atcgtactca 26521 tagtcctcat agtcctcgtt tgcttccact gcacagggaa gcattgtgag gagggctggg 26581 atggccaccc ggctccctgc gccagcccct gcctggcccc aaggcctccc aacccccaca 26641 ctcactcgtg tactcgacgt ggcctttgac tgtcacttct atctgtaggt cctggcaggt 26701 cgtgttcttc atgtccagga cattgtaggt acgaaggacc tggctcagca aggggcaggg 26761 agaggacgtg ggacatgtga ggccacagac ctctctggtg ccatacctaa gggcatctcc 26821 ctgggctacg ggatttgagg ccttcagccc cttttcctcc ccgtcactgc acagtccaca 26881 cacacgagtg gcaccacatt cgccacaggc gtcaggcgag acagagaggt aaagaagcag 26941 gggcctgccc ccagaggcca acactccaga ctgtgctggg aagaggctgg actgaaaagc 27001 tcaggaaagg cttctggcca aggcttgaag gagaagaaag agcttgtcct gtgggccaag 27061 aagcagggaa ggacaggaaa ggcaagggga tctgtgtaga caaggagagc tgtctggccg 27121 gaggggtggg cagaggccca gccatctgag ccgtgggcag cggggagtga cttttcccca 27181 aggctgatct ggctggggct tttggaaggt cagtgtcact gccgtgtgga cagtggatca 27241 gacaggggag acaggaggta gagagaagtt ccaggtaggg agaaggccgt ctcagcacag 27301 aatctaggat gcagcaccaa gggccacagg cagcagggga gcagaggagg agtcactagg 27361 gacataagga caggcttggg agaggggcca ctaacacggc ctctagagta gtggctgggc 27421 tcctggctgg gaggggctcc caccagccag acaagagaca cagaatgaag aagggctttg 27481 ggacatgcct cgtctgaggt gctgtgggag tgggacgagt ggggcccatg gttcagggaa 27541 gaggcctggg ctgaggagtg ggattcggag ccaccagcac tgaggcgggt aaaggaaaca 27601 acaggtaccc agagatgggg agagcaagcg aaagcaagca gggggaaaaa cacccaggac 27661 ggaagcacgg gacaccaacc ctgaggtgtc tgcctccttc cccggaccat gtgcctctgc 27721 ccacaggcct ctcactccac accctctcct ccaccagtgc ctggccccac cccttccctg 27781 gccctcacct tcagggttcc tttgctgttt cctcccacct tcacattgat cttgctgccc 27841 aaggaaaact tcagcaacag aaaagggaag gcgatgttag aagggcacaa gcaaacctgt 27901 ttttgcggct ccagtaactt atctccttcc tcaaaacagc cccacctcct ctccccacct 27961 cacatttcca caaagtttat accagcacat gaattatctc acttcagcct tctaacgacc 28021 ctgtagggtg aactctgtgt atgttactct acaagtgaga aaactgaagc tcagagctcc 28081 aggtcacgca gccaggaagt cagtcttccc ccgaagaccc caggaagaag tgctgcgggg 28141 attccacggg tgggaggttg gggacggctt ctggcctggc cacgaggccc aggtgtcctg 28201 gctacccagg cgagggagtg gttcaccagg gagtggttca cctgcagctc ctcctccagg 28261 ccgcgaatct ggcggttgtt cagctgcagc gcgtgggact tgaacccatt ccggcctgtg 28321 gagctgagag tcacattgag acccctctcc tcagtggtgt gggaggcaat ccagtaggca 28381 gacagggcat ccagggcaat caccgtgtcc tagggaggtt gagccaggac tcaagcaagc 28441 ccttggtctg aggactaccc acccccgcca gagcccgggg acggccccta cttgggtact 28501 gcggaatccc ccttggaagc tgccctgacg ggtgagccag gccgcagcct ggtctgccat 28561 ctctgctttg ccctcgtgaa gcaggaggtg cagcagggcg taggctgtgg tttcaatcca 28621 cagggctggg gcctggggca tggggtcgga tgggttgcga ggagccgggg tgggcgacac 28681 ggcattgctc tgagaaccag tgactgagcc ccagtacagg ttatctgaaa gtgaagggag 28741 accacgagta aacaggaagg cagggagaag agcctggtcc cctggccctt agccaccccc 28801 tgctgggctc tcctgagtct cccccaaccc acccctgctt ataacttcat tcctcctctg 28861 agtcttcatc cagcctctcc ctctgggcac actcagggat cctaaggtcc cctgggcctc 28921 aggctcactg ccaggagcgc ctcacccctc acctccagtc tcctgggcca ttgccatgag 28981 gttgttgtgg gcaacacccc gcaggtccgc aggggccttg gtcagtgtca gggcataggc 29041 cgtgatggca gctgcgtggg cacccaggag cccagcactt gctttctccc ccaaaaatga 29101 gcttgccttt gagatggagg cttcctggaa gaaaacggga ggagggtctt gggcctggac 29161 ccctgggttc ctgaggaaaa agggagagag ctgggggcca gcagagggca gaaacgccac 29221 tgaacttacc actctctgct tcaatggctc tgcaccctca tcctggaaga cggccagccc 29281 atgatgaagg gcgatggtca caaaggctgt gagtgccaca gtctcatcat tgcccaccaa 29341 acccccctag taaggggaga aaagatgtca aacaggaggg ggaaggggca aagagagtcc 29401 tccgacaggc gcttctcggg ccagccccag catgcccgca cctgcatgct cctatgtatc 29461 actggagaga ggtcctggaa cgagccgtca gcctgctgct gggacagaag ccagttagat 29521 gtctcctgca gtttctcagg cgatcctcct acctgctcct gggccaaact caggaccttc 29581 aacacaaagg ctgtgagcct ggagggcagg aaaggagggt tggggataag gacttgcttc 29641 tcattatgcc cacgccccca ggtgcctggt tatccaccct cccatcatac cagctctgtc 29701 ccacccctgc cccagccctg accccctcag aaccctggaa ccactctccc aagctcacca 29761 ggtgctgctg ccccgtgaca accaagccgc ataggaacca tccgccttcc gaaactgctg 29821 gatccgcatg tagccttgga gaggagaggt ggccgctcag gtgacactca cccttctgct 29881 gtttgagccc aaggccatgc tccccatcct catcaggggc cccctttctt tccctctgta 29941 gcctcctggg ctgtccatct tccagtaact gtcctttcct ggcccccctc ctgcttgccc 30001 ttgcacccag aacctttctg gatcagatcc acggcgtggt ccttggtctc gggaggcagt 30061 gtgctccact gctctgtctt gtccaggtag cgggaagcag ccagtgtcgg agccaagtag 30121 atcatggttt gctccccaca gcctcgagga agcctcaaga gggaggccac gcctcctggt 30181 gacaaggccc cctcagagcc taaagtgtcc aatggatctg aggctacatg ggagggagag 30241 ggtggaggtc tgaggactct gtgtcagagg ctcacgggga gtgggaccaa gcagggatcc 30301 acaggtccca catgaatccg aaggtggcca ctgggaaggg actaaagggc actcccacct 30361 gtaaccctga cgtagctgtt aaagtcccca tcagggatca tattgggatc agagttgcca 30421 ggtatttcca aggtccggcc tcggtggtct ggggaaatgg gggaagttgg cagcctgtcc 30481 tgctgtccac atcccccacc acctgtaccc acttaggaaa ccaatggctg gaggtagagg 30541 gtcactcacc caaggggttg agttcataga ccagctcctc tctatggatg gccccttcct 30601 tctgtctcag ggaaaacatg gttgtgaggt cacacaggac tacagcctgc cctgctagcg 30661 cagactcccc cagaactttg agtttcaact ccagggacag agttggatca gaattgtaga 30721 agatgggaac agaccaaggc attggttcgg gtgttgggcc cctggcctgg ccagcagaga 30781 gagtgctgag ggtggaggac aaagctaggg gcccggggac ttatattcag gggtgctcca 30841 ttcacctcaa tctgcagaac cttggacacc gcatctccca cagggaattc gaaggaccct 30901 cgagccacca ccttcagaga cacagcggcg gctgccgtgg gcaccacaga gaaggcaaca 30961 ggccgggcag agcccgcagg caccagcacc tgctgggcca gccctccgcc cccagccagg 31021 cacagcccct ccactgggga cacgtggacg ctcacctgag ggcaggaaaa cgaggatggc 31081 cagagtcctg gcccggttag cctccccacc cctcactggg ccctggctcc cccaactcct 31141 gtatgctcag gctcccatgg ggcctcacag tcaggttttt atccaggtag ttatagagga 31201 caggccgcag ctccagctgc tcaaagcggc ggacagacat gggcaggcgg aggtgcaggt 31261 ggaactcgcg gaacacccgg agctggactg gggtggccac acataggcct gagggaaagg 31321 aagcgtgggc acaggggcag agatcagagg ggccatcaaa gcttcagggc cacaggaggg 31381 aggggtgggg gtgacccagc tctgtctcag tccgcctctg ccctctggcc cacgccagct 31441 gcacgctggc acagacctcc ccagatcact ggaccctctt cctacactaa gagcaaaggg 31501 aacagggagc tggggtacag ggaaatggaa gcggggtcac ctgaggccca gacagggtga 31561 catcaccttt ggttttggac aggctcaggc catggatctc ccacgtggtc agagagtcgg 31621 ggagccacag tgtcaatctg tgtagggaaa ggcagagaag gcccgtctac cccggctggc 31681 cccgagacac agcacagaga aaaggccggg ccggcacaca ctctcacatt tgaaagcggt 31741 ccactgtttc cactctccag agccagttct ctgggaagaa gctgcgcacg ggaatgtcat 31801 cctcatcaat caggtcctcc tcctgcagga tctccagggc tgggggacca cggtggacgg 31861 gagtgaggag gggaccgttc tgcctttcca agcgccgcca cctgtgccct agccccaccc 31921 agcccctcac ctcgttggag gcccgcctgg cccttgtccc tgctcttctt gcgcagactc 31981 tcagcaaatt ggcagcagga caggaagggc tcccggcagt ccggctgctg cacgcgggct 32041 gcccgctgct cgcaggaacg catcatgggc agacgtgtca ccccatcctg gcagcagcgc 32101 ttggctgtcg gggaagcata ctgacccact gcagggccag gtgggggtga gcatgagagg 32161 acaaaaagga catacacctc agcccccgcc ccgacccctt gagaccagca acataagaga 32221 ggctgctgga gagagctgcc caccttccac tcgtaatccc ggaattcgga tctctggccc 32281 caaataccac acgtgggttc agcaaggagc cgaggggcag agaggcagac ccccaacccg 32341 gatcccaggt ggagagccca agctactgcc taggcacccg caactcacat ttctcattaa 32401 tcgccttttg gaagttcacg tttctctttt tccgggttgt cttctccttg ggacagctta 32461 gtcctggtag agagaaaggc tgcagtccag ccgtcaggca ctcggcctcc tcccctcctc 32521 cccttcccct gcccagccct tcctgcccgg actcttccag ctggtcccct caggcccttc 32581 ctccttcctt atcttcccgc cacccactcc ccttccttct ctgttctcac tctttctgga 32641 taaggtccac tggtctccat cagaaaaggc caggcccgct gcctggaaca cctgaagggc 32701 actgtcccca cccccaggac cacagccgag gtcatagctg ttcatagctt caaagacctg 32761 caagaaaggc aggaatgcta ggagccaagt gtggctgagg ggcaggactg agcactcggc 32821 acaggtgaga gggcagcaca tggggggata ggaaaggata cagagccagg agatggagac 32881 cacagggcca agtggggaag agactgtggg gagcctcagt gggatccagg ggctgcgccc 32941 aaggctcagg gaagcagggg gatgagccat ggaggggtga gagagctgtg gagagggtct 33001 ggacaaacct tgcccatgtt gaggggcttg tgggacttgc tgcctgcagc atacagagct 33061 gtgtccaagg ctcccagcgc caccagggct agggagtcgg tttctaagtg gagcttcacg 33121 gactccccgt tccggtactg cttggcaccg tccacgctga gctccagctg gcaggggcgg 33181 caggtggggg cggtcagagt gggagagctt ccttcagtcc cggtatcctc actgccccca 33241 agctaaatcc atgccctgtt ggcaatcacc ctgtcctcaa ccccctcggc acaagtgcca 33301 tctctcctga ccccggtcac cttgccctcg caggccccag cctggacatc cactcgcagg 33361 gagttggcca ctgggtggtc tccatggtag tagaaggcca caaagtagaa ggagggtgcc 33421 aggtgatggt ccacaaacac cgagaccgag gtcagggtcc tcttgggctc tcgattcatg 33481 aacacgatct gccctcggga taggatctgg gccaagaatg ggagggacaa gagtggttgc 33541 ctcttcatgg gacagcctca gccttgaacc cccccagccc cacccagagg gctcttccct 33601 gcaccccagc cctccttgac tccccagctc atgcacacca tgtagtagta atgagaaaag 33661 gtggccccac tgcccacggc tcgcaagttc aggttcagag tgtccccaac acgaggaggt 33721 cgagaatccg gccgctcaat agacagaaac ccggggcctc ctgaaggtgg ggctgccaca 33781 gtgagcctgg ctatcgctgg atgtggggag cctgcagata cctggagagg gggtcaggtg 33841 cgaatagggt agtagctcag agccaagtac cccaccttcc ccccaagtca ggccatggat 33901 ccttgggacc ccagctcacc ctcctcccct tcccccacca tctcccaggg ggccgaggaa 33961 tcctactgag agctgcagct ctgagatggt ctgagggata attattggaa tgctgacttg 34021 gccgctcccg tctgtgtttt gctgaatgtc ctggacttca ggaacagacc caggagaaga 34081 caccgtggca gaaactttga caggaatgcc agaagctggg gagcctgaca tctcacggac 34141 caaggcctag gcaggtaaag gagggcaggc aaaagagagt ggtcagaccc tgagccctcc 34201 taactaccac atcctcccta ctcatccttc ccctctggaa gaaacctgca gcaggaaggg 34261 ggccccaggc acaaggtgtc gcttggtctt gctaagatcc aaggagaagg gagatgacac 34321 aaaataccag gatgtgagct ctgcctcctc catctcccca cctgcaagac aaaggacaga 34381 gagaggtggg ggacagagcc caagaagaga gggacaggag gacaagtggg gagtggcttg 34441 agtggttccc tcccacaaga cagtgagctc ccagggcaca ggctgccgta ttcctgtctg 34501 tgttgggaaa aggacttgtg gggtgcctgt ataaactggc cataaaaata tgggacaata 34561 agttgtggaa agccacaaga ggcctctgag gagaaaagcc tcctaattgc catgctcaga 34621 gcgagacctg ctctctctta tctgtaaaca ctgtattcaa ggagaaagac cctcctttga 34681 agcattggaa tgtggacaga cgtgcaggct cctagttaag cccactccca ctagctactc 34741 tccgataagt taaagatatg ctgtttgagc acaaaggaga ttcatttaaa gcgcttctgc 34801 tgtagattat gcctgtgacg cactgctacc ctttcactgt tttgccctga acatctgctt 34861 cttagatcta agttattgta ctcaataaat agtgtggaga ccagagctct gagccttttg 34921 cagcctccat tttgcaattg gccccctggc ctccactctt tatgaactct taacctgtct 34981 cttctcattc ctttgtcacc accagacttc aggtacccta caggtggtgt tgaggctggt 35041 ccccaacatt ctggcgccca acgtggggcc caaaagaatc tggtgaggaa acgctcaagc 35101 atgtgaaaca gaggaccaac gaacaaagga ctcccaagga cataaaagtt ttaacctcta 35161 caggtaagcg gggcgcccag agaaagctag ggacacaatg ggaaaaactg aaagtaagta 35221 caccacgtat ttgagcttcc tacggcagct cttcaagcat gcatggtggg gtaaaagttg 35281 atacggaaaa tcttatggat ttgtttcatg ctatggaaca attttgccct tggttcccaa 35341 aacaggaaac tttgaaatta aaacattaag aaggagttgg aaaggacctt aaaagagcat 35401 atagagaagg aaaggaaatt cctttgcctg tttggtcgct ttggtcattg gtgcatgcag 35461 cactggagcc ttttcagaca gataatgagg ctgagtcaga ggaggagaga gaggagtttg 35521 ataatcagaa ctctgaacca cctctaccga gtactaacaa aaaggagagt ctgaagatga 35581 tttatgccaa tctccccagt ctccctaaac ctactcaaaa aattgttcag cccacggttc 35641 ctgtagagga atgtccagaa tggccacctc ctcctcagcc gagtgggtgc agggggaggg 35701 agcccgagac ttggctcacc gtgcccatta ttgcccgacc cacagttcat tatggagatg 35761 gggcaattca ggttcaccct acagttatta cagtgaagga gcaatttccc ttaaaatgga 35821 tgacccggcg ccccgtctgg gttgaacagt ggccgctccc taaggaaaag ttgggggtgc 35881 tttataaaat aaactactaa aaaaaggata tatttcaccc actttctctc cttggaattc 35941 cccagtattt gtaattaaga aaaagtccgg tagatggcgt caacgctgta attcaaccga 36001 tgggagcctt acaacctggg ctcccatccc ccactgtgct ccctaaagac tgaccgcttg 36061 ttattataga tttaaaagac tgctttttta caattccttt agcagaggca gatttcaaaa 36121 aatttgcctt taccattcct gccgttaata acaaaaaacc tgcagccaaa tatcattgga 36181 aagttttgcc ccagggtatg ttaaatagtc ccacagtttg tcaaactttt gtaggcagaa 36241 ctatccagcc tgttagagat cagtttccag atttgtgcag caaaaagtag agaccaactt 36301 attcaatatt attaatcttt gcaaaagaca attacaaatg ctgaattact tatagcacct 36361 gacaaaattc aaacaaccac tccttttcag tatttgaaaa tacaagtaca ggatagagcc 36421 attaagcctc aaaaggttca aattagaaga gattctttca aaaccttaaa taattttcaa 36481 aaattgttag aagatattaa ttggatttgg cccaatttag caattcctac ttatgctatg 36541 tctaatctct tctcaatatt gaggggaaat accaacttac gcagtaacag agaactaaca 36601 cccgaggcca tgaaagagtt atcagtaatt gaaaacaaaa ttcagcaagc ccaggtcagt 36661 aggattgact cagacttgcc tttataattc attgtgttcc ctacttcaca ctaaccacac 36721 aataatgggg gttattgttc aaaatgatga tttagttaaa tggtcctttt tgccacataa 36781 taccataaaa gcacttacag tatacttaaa tcagatggca attctaattg gacaggctca 36841 tatatgaatt attaaacttt gtggcactga gcccaataaa aattatagtt ccaataaata 36901 aaaatcaggt taaacaggca tttattaact cagttacatg acagattaat ttaacaaaat 36961 ttgttggatg tattaataat cattatccta aaaacttttc aattcttaaa attaactaca 37021 taggttcttc caaaaattac ttgtgatgcc cctttggaag gagccatagc tgtttttact 37081 ggtgggtctg gtaaacatga aaaagcaaca gtctggtgga gaccacataa tccaatcact 37141 tgatctgaat ttactaacat tcagagagct aaggttattc tgtgtattta tttaaaaact 37201 attacagcct taagtttgct ctggagccca ctctgtgtgg tctttttctt caacttcaac 37261 aattactaga ccaaggtaca catcctactt ttattacaca cattcgagcc cacagctctc 37321 tgcctggccc attggcttac ggcaataatc aagcagacct tcaggttatg acatcactgc 37381 ttgaccaagc cacccaatca catcgattat tccaccaaaa ttggagaaac ttatctaaat 37441 aatttcaact tacacagagg ctggctaaac aaattatccc acaatgccca gattaccagc 37501 tcacaggcac ataccctcct tcaataggtg ttaaccgtaa agaattggaa cctagtcagt 37561 tctggcaaac agatgttaaa cacatcccta aattttaaaa actaaaatat gtacatatat 37621 ccattgttac caacactcat ctaattatta cacatttaaa aaaataaaag taaaaaaaag 37681 actaagacaa aaatcaaaaa aatacaaaaa aagtaaaaaa atttaaaaag ttataaaaat 37741 gtacctttag taaaaaaatt ataaaacata aaaagttaag acatgttaaa aattgtctgt 37801 aaaagtcata aaaaaagtta taaaaaattt atacaaaaaa ggttgtttaa ttttgtttta 37861 aagatctaaa caagttttaa aatgataatt gtaaaaaatt ccgtgtgtaa acgtatttac 37921 taaagttaaa aagatatcat ccagttttct ataaactaaa cattaaaata aaacacaagt 37981 ttttcttaaa acactaacct gctctttaaa aattgtaaaa agtctcttaa cacagacgcc 38041 actcctaaaa tttccagtac cagcctaaag actacatcct catcaaagga taaaaaatta 38101 aaaaataaaa aaaaatttga accagcctaa aaaagaccct acaggaacta cagcctcaac 38161 aatgcgactt ccacaaacaa cacaggcctc agacattata ctaaaaaaac aaaagtctaa 38221 gccaaataat ttattcattt ttaattctct cactttgcct actacctata cctgctacac 38281 tgtattaagc tcgtatctta aatccgcctt tcttctgccc tgttacttta acaaacaccc 38341 ccttctcagc ttctaataac ataactgctt agctagaata aattaacata cccccagtgg 38401 ggttcctcat taataacata tagtaaacta agatgccaag taacactaca ggtcactctt 38461 ttactaaaaa aaaaagttac taattatact catgtttgtc ttctgttatt tactaatcct 38521 agaatacaaa gccaaaataa aaacagtgac ctcctcgcct aacaaacctg tggctacaac 38581 agcccaaaat tatacctatt gggcatgtgt cccattcctg cctttaatta ggcctgtcac 38641 atggttaaaa cccccagttg aagtttatgt taataatagc gtttggatcc ctaagcctac 38701 aaatactcat gggccctctc acccaaagga aaaaaaaaaa agttaataaa tgtgtccata 38761 ggttatcagt tcccccctct ttacataagg ccaactatcg gttgcctaaa aggctaccga 38821 caacattgac tagttaaaat tccaggtcat aatcaaagac cagtatccta tcatttattt 38881 tctggatgga gcccggatca ttcacagagt tcaattcaat taacagttta agcccccaaa 38941 aaagaggtgc caacaacctt aacaatggtc aaataattta aaaatattaa ttaaaaaaaa 39001 ttacatctct gatcacacta tggtactaca aaataattcc tatgaaattg tcattaattg 39061 gtcccctgag gggaccttta cagttaattg tacccatcaa aataataaat acaagacaaa 39121 actaaaacag aaactatact atcaaaaagg taacactact tacactgaaa aacgtgctca 39181 ttttcccata atttggacca attttagtac agctggccca catcccaaaa taattaatcc 39241 aataataggc cctaaacact ccaaattatg aaagttaata atggcccaat ctcatattta 39301 agtttagaaa aaaatatatt atctttaaaa aaaaaaggtt aaaaacttca atttgcgtat 39361 cagttttctt ccaacaaaac agtgcccatt cagagttgtg tcaaccctcc ttttatgtta 39421 atggtcaaaa atattgacat tcgacctaat tctcaaacta ttacttgtca aaactgtcac 39481 cttttcacct gtattaattc cacgttcggt gtaaaaacat ctgtgttact gataaaaact 39541 aagaaaggag tttggatact ggtttccctc aatagacctt agaaagcctc tccttccatt 39601 catattgtca caaaaatgtt ttaaaaaaaa gtgtttacca aaacaaagag atttattttt 39661 acccttataa cagtcttatg ggccttattg cagtcacagc tactgctgcg gctgctggaa 39721 ttgctttaca ctcctctgtt caaactacaa aatatataaa tagttaacaa aaaattcctc 39781 aaaattgtgg aattctcaga cccaaataga ccaacaattg acaaatcaaa caaatgatct 39841 tagacagact gttatttaaa tggaagatcg tacaataaac ttaaaacatc aattagaagt 39901 acaatgtaat tgaaatactt ccaatttcta cataactccc cattcgtata atactactaa 39961 acatcatttt taaaaagtta gacatcatct aaaagaaaaa aataaaaatt taacattaaa 40021 tataaccaaa ttttaaaaaa acaggttttt aaagcatctc aggctcattt aaccctcctg 40081 cctgagactg acattctcat tggagctact gacggacttt caaacataaa tcctcttaaa 40141 cagattaaga ccattaaatg atcaactatt acaaatttta ctttaatgtg tatctgttta 40201 tgctgtttac ttttagtcta cagatgcaaa agacacttct gaaaacagac caaacaccac 40261 aaataagcca taatagcaat agcggttaaa aaaaaaaaaa gggagggggg catgttggga 40321 aaaggacttg tggggtgcct gtataaactg gccataaaaa tatgagacaa taaattgtgg 40381 aaagccacaa gaggcctctg agaagaaaag cctcctaatt gccatcatgt tcccatgctc 40441 agagtgagac cccctgtctt atctgtaaac actgtgttca aggagaaaga ccctcctttg 40501 aagcattgga cagacatgca gtctcctagc taagcccact tccaccagct actctccgat 40561 aatttaaaga catgctgttt gagcacaaag gagattcatt taaaactcta ttgctataga 40621 ttacgcctat gacccactgc ctccctttca ctgtttctcc ctgaacatct gcttcttaga 40681 tctgagtgac tgtactcaaa aaatagtgtg gagaccagag ctctgagcct tttgcagcct 40741 ccattttgca attggccccc tggcccccac tctttatgaa ctcttaacct gtctcttctc 40801 attcctgtgt caccaatgga cttcaggaac cctacgggtg gcgttgaggc tggtccccaa 40861 catgtctgtg catgctgagg cctagcacgg ggcattgaac aacacatgtc cactggagga 40921 gtgaaggaat gagcagaacg agcaaagaaa taaataaact cagccaggga aaagggccga 40981 gtcacagaga cagagttgga gagaaaccag gtctcctggg tgtttcctgg ttttgagtct 41041 gagatgagat gtgagaagtg gggtggtgtt ggtgccggag gacagagggt tagctcagag 41101 gtcagaggca agggtctggg gttacaataa gggaaagtca cccacctgga tactcaatga 41161 tggctgcagc aacgtagagg cgcagcccct ggaggtcagt aatgcccata ttcagcttct 41221 ccagggcgtc ctggaactct gcctttgaga gggaaatgtg gctctgtcca ttcaccagct 41281 ggggcataga aagaaacagg acatagggtg agactgagtc tcccacctca cctcccttgc 41341 cccttcccct ccccagcccc tattctcctt cctaccttgg tctgactctc cagcccccga 41401 aagaaagtct tcttaccatc ctcatctagg agcccaaagc gcacatatgc caccccctgc 41461 actggcttcc catagatgta cctgtcgtgg cagagagaag agggtgggcc aagggctggg 41521 gggaataatg gcccgagggc agggaagcag gagcccattc atactgagta gggagcagga 41581 cccggtgctg gtgggcagag gtggggaggg aggtattacc tggcctggat gtctaactgc 41641 atttcatcaa gatggcctgg caccgtcagg atgtagggct ttccaggggt gatcttcacc 41701 tcaaagttgg gaaggactga cccagggtga agggataggc aggtcagact ccagctcaaa 41761 gactcccctc tgctcagggc tgtggcaccc atatctccct gccctgagct gctcccagta 41821 cctctccttc cacccttatt tccttcagga aagcagctgc ctgtccctcc agtttccagc 41881 tctcaccata tttcttcacc tcaaactggg tgctgctgtt ggattccagg ccatctgaga 41941 atcgggctga gatcttccag gtccctggcc tgagaatgga caaggaaggg gctcagccca 42001 tctgtacagt ggggcacgga gagccagcag cctgcttccc tgggaagagg actgtggggg 42061 ttaaccagag gctcaggagg ctgagggtca gggcatctgg ggacgtgcct ctgtgtggga 42121 ggtggagagc ctaacaggaa ttggggtggt gtagcttggg ggcagccccc acattgggag 42181 cgctcactct gagatgtctg ggatcacaaa gtcatcctgg aagatggacg agggcatgta 42241 cacctccttc ttccgcacgc ggaggccgtg agagttctgc aaggggagaa gtgctcacag 42301 gcaggaggtc acatcagtgg ccaggatcag gaaggccaga ggtcggggac tcacctccac 42361 catgactgtg atggtgtcag tgctcgggcg catcttctga tccagagcaa agacccggta 42421 ccgaactggg agtggaggag gagagaggtg agcaggggtc catgtgcaag gggagggtgg 42481 gtcaaactcc acagagggag caggggacaa atgtttccta agcacccctt ctgtgtggca 42541 ctttctttca ggttatctca cttagggggc accaaactca tcctgagagg gctcggaggg 42601 ggttaaaggt tgaggccctg gggctgagac tcaccccgct ggccagggtt gtaaatgggc 42661 tggtccgtct gcaaaaagag gtgcccccgg cgagaggaga agagcaggtt gataccctgg 42721 atgtttgtcg ttctggacag agagtccttt agccatggcg aatgggccac cagctggacc 42781 tcagggcctc tgaggagttg atggaggcca cagctcttcg catctttcaa gggcacctgt 42841 caggagaggg agagggagag ggagcgggtc acagagcaag agacagctga ccaaaaagga 42901 cagagaccaa gggagaaacg tggaaggaga atgccagggt gggaagacag gaggggagga 42961 ggccagtggg aagatgatga cacttacaag acagatggga acagggcagg aggcccccac 43021 aagcagcagg agggcatggg gtctggttac ctggagactg aggagtgcga agtctctttc 43081 tgagctaagg gtgaagtcca cctttgggga gcaggggaca ttattacgag atgggtttct 43141 caggaacact gatcctttca ctacctgtcc tcggggcaca tcctggagct gcacccccac 43201 cgataggggg acccccagat gaaccacaga aggagagaac aagagcaacc tggggagaac 43261 agacaggatc agcagtcaga cttcgctctg acacctccac ccctgctctc cctcactcct 43321 gaatcgggtc ccgatgccag ccctgcccca atccaagcac ccagcatccc gcctccagga 43381 cctgggcttc tgcagagata aggtgaagaa gctggatgcc cagatcagcc cccagagcag 43441 cctcatggct ggaggatcca agagaggtta gatccgtctg tctgtctgct accttctggc 43501 caagctaggc ctcggggcag agttgactct ggaccttgct cctcccccag cccagctaag 43561 ctgggaaacc acgtgacagc caagaagtgc aactggcctc aggcccagag ttgtgggggc 43621 accccggaca cctgggtgtc catgggaaac tctgaatctt ggcccagaaa taaccctgtc 43681 cttccccgtt aactcctctc attccagtgc ctgagcacct ccccctacca ggtatatctg 43741 tatcagggta taggtgcaca caagctcatg tatacctggg cgtacaaggg cccaacatgg 43801 cccacgagta caggatattt aaaggccctc acaaaacaat gacaggcttc taagacactt 43861 gtctcttact ttcattccaa cacaaattga actatactag gcttttgctt ttttaggccc 43921 taacagaaat tccgtggttt aggagattag gtacaccctc accactccac agggagaatg 43981 gcctgaacgt cagagtgaac ccctgacccc tttcctctct gaatgaggca aagctcagac 44041 ttcacctcta cccctaaaca aggcacccaa acacacgtca cagtaaataa aggacatcca 44101 gaaaatatca caggcagagg tactttattt ggcaatttta acatgacacg tagagaaaag 44161 aaccctgccc tccttcacca gcctccccag aaatcccacc ttcctatttc aagacagagt 44221 aataacagca ccattttaca cgaaagggaa cagccacagc cttggcacca tttctggttc 44281 cactttccat ggaagggcag agaagcattg ctcaaacccc accactgggt cagaaaccag 44341 gcaaacagca gcagtcacat ctgacctttt gcacacacga gagcccgaca ctcatcctgg 44401 ctcccaagcc ttcccaaggt gaccctgtct cagccaccat catgcagctc ttctagtccc 44461 ctcccggccc actctgagga gctgagcaat gatgagcaga atcttcatgt ctctggcagg 44521 cggaggaggg ttcctgaagt ggtagagatg ctgtgcagga gaaagacaag gctggagcca 44581 gacgcctagg ctcaagctgt gaggagaact tagaacttgg gaaggggtgc tcccgggttc 44641 cagagaagaa tgagagccac tggccttcac taggaaacta agcatgagct gggaggaaat 44701 cccactttgg gtcattgctc cataatctgc cagaggccag ggaaagactc accagtccac 44761 tagctgggcc ccaatgaggt cgtgcacatg gtaggtgagg ccaagccgca ccacgacagg 44821 cgcccgccgg cccaggagct ctgataggag cagttcccgg tactttgcct tccggaccat 44881 gctaaggaca gcctggcgcc ctagagaagt ggcacaggaa aggggaaagg tgaataaggc 44941 ctagaggcct gaggagccac caaaaggtga ggggctgcag gcttgagctg cagatgggat 45001 acctttaaca aagtacttga tgaatctccc agctccaggc acagctagcc accagctccc 45061 agcatctcgg acggtgagga ctccagcatt caccagatgc ctgagggcag tgggaagcca 45121 gcagagatga gggctgggct ctgagaacca tctctcctcc tcatccatta atgtttacac 45181 aagtctccag aaacacgggc aaccatgcaa gagagactgg ataacaaaat cccagcctgg 45241 ggtggttctg tgctcgcacc aactgcctct gttggtgcag ttgcttgttc caggccccct 45301 acctggactg ccctccctcc actttttctt ttcctgctac tattactggt tcagttaaat 45361 agcttctctt ccctgaccac atgtcctgac acacattcgg tcttaggacc gaggagagga 45421 atcagagtct tctgcgctcc tgccactgcc taggcagact gcgccatggc tcccaccaca 45481 tctctctgag agtgtgtctg cccatgtgag atcagtgcta tagtttttca aatgctttca 45541 catattccat gggttacaaa gtaggaggcc tgagccagat acagcctctc aaaattagat 45601 ttttatttag ctcctgccca gttgtttttt agctttgggt tttttttgtt ttaatgtgaa 45661 tgtttagggg tatcccagct cctaccattc tctatcctct tagtcccagt atgattaaca 45721 tcaagctttt acagttcctg ccaggcccct gtgtgctttt gataagccac caagaactga 45781 attctatttc agggctcttt ctaaggccca ggtaagatag gcctcctctc ttgtggggag 45841 gaatacaaac ttgggtcatt ttattaaaaa gaagagtcag ctttttggca gcttttctgg 45901 caggttcacc aaaaaaacag gaggacaggc tggatggaac tggagggagg cagggaagca 45961 ccagatgcct gactttggtt ggttccacaa gtctcacgtg atttctgagt ccctgaagcc 46021 aaaggtctgt gtcatttggt cctgctggaa actaaggtcc ccacaggctg gaagtactga 46081 agctagaaat ttctgcactg ccccagcata cggtcggcca tcacaggcct tgaggaccta 46141 caagcagaga tgagagacat ggtccatccc caccctcact ctctaccaac ctccaccccc 46201 aggacagcat ttctcttcct gtcttacctc ttgctcagat ctttccacct ctcataccct 46261 tcttcttcct cctcctcttc atcctcctcc ccctcttcct ccccccctcc tcctccccct 46321 ccttccttcc tcctcctccc cctcctctcc ccccctcttt accctctcac tcctccctct 46381 ttccctcctg tctctctccc tcatattact atttaccagg gccctgtgcc ctctccatgg 46441 gatgtggggg cagggaaatt ctttcttttg tttttatttc tgtttcccaa cttccctgtc 46501 tcccccaacc tgtgccctcc ccccagcacc cctgacgcac acagtcacat actctggtcc 46561 tgtagtcctc agtgaagata attccatggg catccaagtc gaagcccagc tggacgattc 46621 tgatctcccc ctgctcttga agctccttct gtctccacag agaagagagt gaaaagaggc 46681 gtatcagggt aagatgctct gctatactct gggcgctctc aggcagggag attttgtttg 46741 ccaatatggg ggcattttct cttagtaaag gaaaaactgt gtgagaaaat aatcaagcaa 46801 atgaccaagc agccttttcc aactagctac ccacattctc atccacctgc cttatatatg 46861 gtctatcaaa gagtcaatat aatttctaag accaggcact aagataaaat agtgattaag 46921 atgcaggctg tggcttcaaa acattaacag tctaaattct gggtagggta aggtggggag 46981 gagtatggga ttcaatgttt aaaacaaata aaaataaacc aggacttctg tttacgagaa 47041 tacagcaaga actaatccac ctgttgaaaa ttaactaaaa tcctggataa aaatgttatt 47101 ttaaaattgt ttaagaccag gtacagtggc tcatgcctgt aattcagcat tttgggaggc 47161 tgaggtagca ggatcaattg aggacaggag ttcgagacca gtctgggcaa cacagtgaga 47221 cccccccacc cgccacatct ctacaaaaaa tgtaaaaaca gccaggcgtg ggggcagaca 47281 cctatggtcc tagctacttg ggagactaag gcaagaggat cacttgaccc aggagtttga 47341 ggctgcagta agatatgatt gcaactgggc aacacagtga gaccccatct cttaaaaaaa 47401 attgtttaga cactaaagag ctcacaggat aacaggaatt accactaata taacatctaa 47461 gtaggccggg cgagatggct caagcctgta atcccggcac tttgggaggc tgacgtgggt 47521 ggatcacctg aggtcaggag ttcaatatca gcctgaccaa tatggtaaaa ccccgtctct 47581 actaaaacta caaaaattag ccgggcgtgg tgatgtgtgc ctgtagtccc agctactcag 47641 gaggctgaca caggagaact gcttgaaccc agaaagttgc tgtgagccga gatcatgcca 47701 ctgcgctcca gcctgggtga cagagtgaga ctctaactca aaaaataaaa ataaaaataa 47761 ctggggccgg gcacggtggc tcatgcctgt aatcccagca ctttgggacg ctaagggtgg 47821 atcacttgag gtcagaagtt caagttcaag accagcctgg ccaacatgat gaaacccggt 47881 ctccattaaa aatacaaaaa taggccagat gcggtggctc atgcctgtaa tcccagcact 47941 ttggcaggct gagacgggtg gatcacgagg tccggagatc aagaccatcc tggctaacac 48001 ggtgaaaccc catctctact aaaaatacaa aaaaaaaaac aattagccag gcatagtggc 48061 gggcacctgt agtcccagct actcgggagg ctgaggcggg agaatgggtg aacccaggag 48121 gcagagcttg cagtaagcca agatcatgcc actgaactcc agcctgggtg acagagcgag 48181 actccgtctc aaaaaaaaaa aaaaaaatta gctgggcgtg gtgtcaggtg cctgtaatgc 48241 cagctacttg ggaggctgag gcaggagaat cacttgaact caggaggcag aggttgcagt 48301 gagccaagat cataccactg cactccagct tggtgacagg gtaagacact gtctcaaaaa 48361 aaaaaaagta attggaacat ggagggcatt tattaatcta ggcaaatctg cccagaacat 48421 tttcaaaagt aactgggcgc ataaggaaac aaaactcaaa gagagaataa aaacaaaccc 48481 caaaagacaa gtaatagtgg aattatcctt tccacggtct ccctctgatg ccgagccgaa 48541 gctggactgt actgctgcca tctcagctca ctgcaacctc cctgcctgac tctcctgcct 48601 cagcctgccg agtgcctgcg attgcaggcg cgcgccgcca cgcctgactg gttttcgtat 48661 tttgttagtg gagacggggt ttcgctgtgt tggccgggct ggtctccagc tcctaaccgc 48721 gagtgatcca ccagcctcgg cctccggagg tgccgcgatt gcagacggtg tctggttcac 48781 tcagtgctca atggtgccca ggctggagtg cagtggcggg atctcggctt gctacaacct 48841 ccacctccca gccgcctgcc ttggcctccc aaagtgccca gagtgcagcc tctgcccggc 48901 cgccaccccg tctgggaagt gaggggcgtc tctgcctggc cgcccatcgt ctgggatgtg 48961 aggagcccct ctgcctggct gcccagtctg gaaagtgagg agcgtctctg cccggccgcc 49021 atcccatcta ggaagtgagg agcgtctctg cccggccgct catcgtctga gatgtgggga 49081 gcgcctttgc cccgccgccc cgtctgggat gtgaggagcg cctctgcccg gccgcgaccc 49141 cgtctgggag gtgaggaggg tctctgccca gcctccccgt ctgagaaggg aggagaccct 49201 ccgcccggca gccgccccgt ctgagaagtg aggagcatct ccgcccggca gccgccccct 49261 ccaggaggga ggtggggggg tcagccccct gcccggccag ccaccccgtc cgggaggtga 49321 ggggcgcccc tgcccagcca cccctactgg gaagtgagga gcccctctgc ccggccagcc 49381 gccctgtccg ggagggaggt gggggggtca gccccccacg cagccagccg ccccgtccgg 49441 gagggaggtg gggggtcagc cccccacccg gccagccgcc ccgtctggga gggaggtggt 49501 ggggtcagcc ccccccgccc ggccagccgc ctcgtccggg aggtgagggg cgcctctgcc 49561 cggccgcccc tactgggaag tgaggagccc ctctgcccgg ccagccgccc cgtccgggag 49621 ggaggtgggg gggtcagccc cccgcccggc cagccgccct gtccgggagg tgaggggcgc 49681 ctctgcccgg ccgcccctgc tgggaagtga ggagcccctc tgcccggcca ccaccccgtc 49741 tgggaggtgt gcccaacagc tcattgagaa cgggccacga tgacaatggt ggttttgtgg 49801 actagaaagc gggaaaaggt ggggaaaaga ttgagaaatc ggatggttgc cgtgtctgtg 49861 tggaaagagg tagacatggg agacttttca aaaaaaaaaa aaaatagtgg aattatcaaa 49921 aagacaatta aaaaaaaaaa aaacagcact ttgggaggcc gaggcaggca gatcatctga 49981 gtcaggagtt tgagaccagc ctggccaaca gggtgaaacc ccatccctat aaaaatacaa 50041 aattaggccg ggtgcggtgg atcatgcctg taatcccagc actttgggag gccgaggagg 50101 gcagatcacg aggtcaggag atcaagacca ttatggccaa catgatgaaa cccccatctc 50161 tactaaaaat accaaaatta gctgggcatg gtggcgcgtg cctgtagtct cagctaatca 50221 ggacagtgag gcaggagaat tgcttgaacc caggaggcag aggttgcagt gagttgagat 50281 cacaccactg cactccagcc tgggcaacag agcgacactc cacctcaaaa aaaaaaaaaa 50341 aaagggaaaa ctaaggcaac aatgaccaca gcagtcacct cccagggaca ttctgatcag 50401 agacattttg atcagagagg gatatatgtg ggatactggt catattctat agtttgacct 50461 agggagtggc tagataggta tctgctttat aaacatcatt taaattatat atgtttatat 50521 tttcattctg caacttttaa aggtttttaa aaacaaaagc taacaccaat acattaaaat 50581 caacaaatac tccttaagga gtggttccat ctggcatgct tcctcacggg agacctagtc 50641 agggagacta gctcacctca agtgagtgct agactgcctt ccttttcttt cctcttttct 50701 tgaaagatca ctattaacct tctacctgct aaatccaatg gacagatttc agttctgggt 50761 ttttttgttg ttgttttcat ttttttagag acagggtctc cctctgtcac ccagcctgga 50821 gtgcagtcgt gcaatcatag ctcactgcag cctcaaattc ctgggctcaa gcactccttc 50881 tgcctcaact tcccaaacag ctagaacaat aggtgcacac caccatgccc agctaagaat 50941 ttttgtgttt taatttattt atttagtaga gatgggatct cagttattta gtaaagatgg 51001 ggtctcactt tgttgcccag gctggtctca aacttttggt ttcaagctat actcctgcct 51061 cagtctccca aagtggtggg agtacaggca tgagccacca cactcagcct agatttcagt 51121 ttatatttta ctttacctct tccctcttct tagaccactg tcttgaatca caggttctaa 51181 tactgccaac tacttcactt tctcaaaaat ctctttactt tcacagtagg cctcttcctg 51241 ggttactttt tttttttttt ttttgagaca aagtctagct ctatcaccca gggtggggtg 51301 cagcggtgct atctcagctc actgcaacct ccgcctcccc agctcaagca atcctcccac 51361 ctcagcctcc ccagtagctc tataggcgcg taccaccacg cctggctaat ttttgtattt 51421 tttgtagaga tggggttatg tcatgttgcc catgctggta tcaaactcat gagctcaaga 51481 gatccaccca cctcggcctc ccaaagtgct gggattacag gtgtgagcca ccgcgccggc 51541 ctctgggttc ctcttaactc aggcaagtcc ctctctgcct ctgtcacaga ttcctcttct 51601 tctgctttcc ctttatgtca gtggatctca aattttaagg cacacttaaa aagcacttga 51661 ggcagttcaa aaatgtagag tcccaaatcc cgccccaaga gagcgtaatt cattaacctt 51721 ggatgagtcc caggaacctg tatttttttt aacaagcagc ccatgtaatt ttgacacagg 51781 cataataact cagctcatat attcagtaac actgccttaa atgatggtgt ttcccagcat 51841 tgtggatttg accctattaa cttccactct aaacctggag tccttaacca cggacctcgc 51901 aggattagta aaatatttcc cccctaacta cacatacacc ctatgtatat gtggcaaacg 51961 tgcatttttc tggagagaga tagccatagc ttttatcacc ttctaaagat tacaaaaagg 52021 ttagaaatca ctcctctacc cactcttcct actggatctc accacatcac agaattaacc 52081 acttgctatt cacacctcca tgacttgtaa ttctattatc actggcccag aattctctcc 52141 aactctcata tccaactgtc taataccttt gtctttggat gctccacatg caccatcaac 52201 tcaacttgta taaaattaaa cacattatct gtagtccaac aactaaatgt gctcctaata 52261 ccaccagcca cgcagtcatc tggaaagtct gagttgtcct ggactcctcc ctcttcctca 52321 ctctcctctc cctccctact gcactgacct aatttaggcc ttcaccactt ctcagatgca 52381 gtatcttctt agctgacctc cctgcctcat ccactatctc caatacacca ttccctctct 52441 ctatgatatt ctctaccagc ccatcgcctc agtgtaagat ggaaacaaca ctgatgtaca 52501 aggcctgtca tgatctgacg tctgactgca ctcccagcgc tgtatgttcg aaccatgccc 52561 catgactcgg tttccccaaa agcctcgttc ttctacctcc atgatttgaa atgccctgta 52621 ctacctgcgg gacttttaac agggtctctg tggcacccca gtgcctggca cagtgccctc 52681 gcctgaaagg cctaatgctt gctgaacggc tggcggagtg actgagtgac agggaattcc 52741 aggagttagg aaaggggctt ccggcggtcg caccgacgcc cctcaccagc tgccggtcgg 52801 ccacggtcct gtcaggcaca aggctgtaca cctggctcct cagcacgatg ggcggcagcg 52861 cgtcctcaaa caggcctcgc gggaacagct gcatgagttc tgagacagcc gcgcgcgccg 52921 accctgggca gataaatcgg ctcacggaca gggaactggg gcggggacgc ggttctaaca 52981 aaactctcgg ctaccggaag cgaggcccca cccccggggt tgccatggtt acctggctca 53041 ccccgaagag gatccgactc cacaggccct cgcttccgcc gcctcttaac tccaaaggtc 53101 tccgggatca ggtgatgcct cttccagctc atatcccggg attttatggt accggggaag 53161 gggtaggaat ggagggaaga gaacctgaaa atagggtctt ccggcgcaga gcagtgacgt 53221 acggtctccc cgggcgtccc tcctagacca ggtgatgacg aaaggagcgt caacttgtcg 53281 tccctcaggc ccgtcagtgc tgggaggggc ggtggcgacg cacataccag catcacctcc 53341 gccaggccgg gccccacgcc ggccgcggat tggctccctc caagggcacg cacgcccggg 53401 gactcgttgg cggcgtggag gggcgccggt ggccacgttg gtgtcaacct ccttcgtgaa 53461 gctcacacct cccccgcccc gggaggggtt tgcccgccac tgtcgctgaa tgattgcatc 53521 atcgaaagca gaaaaccact tttgcatcct tcggcctctg gcgtgcctgc catgacgtca 53581 tagctctgcg gaggtggaag ttggggagct ttgaggtaac tggattcttg atctgagcgc 53641 agacgtcctt cctaacctca ctgcatttgg aggacctgga gggagggtgg gtgaagggca 53701 aggaaagagg caggatgaga gcttggccgc ggtggcgtct gaggggcctg aatgtttcaa 53761 ggccagagcc tggcgatcag gtggctcgct tagtcctaaa ccagtcatcc ttctccaggc 53821 ctcctctgta gaatggaaac tctgtacccc tgcttgtctt aggacctcat ggatcccagg 53881 gggaccaaga gaggagctga gaagacagag gtagctgagc ctcggaacaa actacctcgt 53941 ccagcacctt ctctgcccac agaccctgcc ctctactctg ggccctttcc tttctaccgg 54001 cgcccttcgg aactgggctg cttctccctg gatgctcaac gccagtacca tggagatgcc 54061 cgagccctgc gctactatag cccacccccc actaacggtc caggccccaa ctttgacctc 54121 agagacggat acccggatcg ataccagccc cgggacgagg aggtccagga aaggctggac 54181 cacctgctgt gctggctcct ggaacaccga ggccggttgg aggggtgagc aaagcgtggt 54241 aggcagtata tctggagacc cgtacctacc ctctaaaatt aggagagcca aagccggggt 54301 gagaatcagt ccttagaaga aacaagacta ttcttcgagg ggagcatctc actaatgctt 54361 gtgtcagacc ttcagcctgt aactcctgcc tctcaggggt ccaggctggc tggcagaggc 54421 catagtgacg tggcgggggc acctgacaaa actgctgacg acaccgtatg agcggcagga 54481 gggctggcag ctggcagcct cccggttcca gggaacacta tacctgagtg aagtggagac 54541 accgaacgct cgggcccaga ggcttgctcg gccaccgctc ctccgggagc ttatgtacat 54601 gggatacaaa tttgagcagt acatgtgtgc aggtgagttg cccctgcttc atagccccct 54661 tccccttccc agaggttgag agcccgccac gcctgctgct gcttctctcc ttgtgcagac 54721 aaacctggaa gctccccaga cccctctggg gaggttaaca ccaacgtggc cttctgctct 54781 gtgctacgca gccgcctggg aagccaccct ctgctcttct caggggaggt agactgcaca 54841 gacccccaag ccccatccac acagccccca acctgctatg tggagctcaa gacctccaag 54901 gagatgcaca gccctggcca atggaggagt ttctacaggt tcaggatcgg ggtgggcagg 54961 gcgagagctt aggcttgaag gctgggaaag ggacttgggg aggagggtga aggcagaatg 55021 gagggtgcac gaggggtccc actgatccct tgcttttcct gtcagacaca agctcctgaa 55081 atggtgggct cagtcattcc tcccaggggt cccgaatgtt gttgctggct tccgtaaccc 55141 agacggtttt gtctcttccc tcaagacctt tcctaccatg aagatgtttg aatatgtcag 55201 ggtaagggaa cgatgttgca gctcccaccc gtatccccaa acaccaagac cacaggtcta 55261 gcatccaggg caacagcctg ccttctctcc tcccaccacc cccactgccc atcttctgcc 55321 tcctcctctg cctgctccag aatgaccgtg acggctggaa tccctctgtg tgcatgaact 55381 tctgtgccgc cttccttagc tttgcccaga gcacggttgt ccaggatgac cccaggtgag 55441 gcattcagct ctgtccctcc cctctggatc ccaggatcca gcctctggcc ctcaactgat 55501 gcctccatct gccccccagg ctcgttcatc tcttctcttg ggagcctggc ggcccagtca 55561 ccgtgtctgt acaccaagat gcaccttacg ccttcctgcc catatggtat gtggaagcta 55621 tgactcagga cctcccatca ccccccaaga ctccctctcc caaatagtaa tgctttagag 55681 ggaggcagtc atatctctgt gtgcagataa taaaagcata tttctaagag gttctctcgc 55741 tgtcttctta gctgagtcat ccctgtccag caactcaagc acacaacagt gctttgctgt 55801 tttatcatca tgtttttaca tggggcattc actgggtgta gaggctggcc gcaaatacga 55861 tgtcccgccg tagcaaggta gccgctgtct ccatcttggc acccagcaca ggctctccta 55921 ccaggcgggc tgccccccgc agtgagcgac acatctcagc caggcgctga atgcagcgga 55981 ccaccaggcc ctcaggggtc cctgagagcc ctgccaactc ggagaagggc tgtggggaaa 56041 gtagggtgag gactggatgc actcagcctg ggcaggttct ccccagccag ccgtctgcaa 56101 aatcccaaac ctcaggtact caccatgccc cgggcccact catatacaac ctcaaccagc 56161 ccaaaattca gctcccccac aaattcctcc accgtctggt tcaggccaca agccacctgg 56221 acctcaccaa tccgcttggc cacagcccgg acacgttcta ttccctgcag gaaagaagga 56281 gagattaagg cctgccttcc tgctccagga cagggaagga ccagatcaag aaggcagcca 56341 ttttccatcc ttggggcact tacagggctg gagggggtca cctgggggac tacctgaagg 56401 tgaaggtcag gaatgccaca gccctggcag ggagaaaggg gtggtgtccc ctacctgctt 56461 gagggtgttt gggagctgat ccccagcgtc cccagggctc tggcagacca ggccagagag 56521 caaggcagca atctcctcag gccgcagggt gctcagtgca ttgtcaaaca tgagctcagt 56581 gaggagcaac tcatggctgc tcatggcaca agccacccgc cctgccagct tcacagtgcc 56641 cacctcgtcc acgtaaccca gggttcggag cacctgcaag aaaaagggtg ggcattggac 56701 acactgctgt cccctagccc ccctgcccca accactgccc cacccacctc tactcgctga 56761 tggtactcag gaagcagcag caatgactga tccgacagta ggaagcgcag ccgctccatc 56821 tccttctgta tctgcattcg ctcccgcagc ttcaggtact gcagagagcc acccgtcagc 56881 cttctccttg ccccttagca ggggtgcact gaggacaagg gctgagatgg gggcggtcag 56941 tctacagggg accacagagg aaaagcccct actcccagct tgggagttac cacccagggt 57001 cctacctggg caggaaaacg ggggctgtgt acacactgag ccccctggat cagctcctcc 57061 agcttccggg cccggagccc accctctaca actgacatat ctttgagctg caggtcattg 57121 acagggtcga gggtgggagg tccggctggg tgggcctgag ccagacgcag cagttcctgg 57181 acagcagtgg tcacggctgc aaggggagga tccttcctga agagggagca gatcttaaga 57241 tctggggtaa gatcttctct cccccagaag cctccctcag gctgtggaag ccctcttact 57301 acaacctcca cttcacagtc tcccttaggc tgggggagcc ccctcactac aaccccacca 57361 cttcatggtc tccctcaggc tggaggaaac tcccagacca ctgcccctcc ctgcctctac 57421 gtgcccctct ggaggagaag ggcctcccta gcatctctga cttgaatttt ggctgctgcc 57481 tcttgctgaa gtcctccaag atcttctccc cattcacccg gagcaccttg gtggtgatgg 57541 cagccatatc tcctggctgg agcttgacca tggtgtggtc acaaggccct gtgaaaaggg 57601 gagagcaggg cagacattgc catggaccca ggcatcctgc ttatactgct ggcagaaaac 57661 agacatctgc cacactctca ccttcaggca ggaacagctt gaatcccacg aggtcatctg 57721 gatagggcac ctctgcagtg gctggccccc tgtcctgtgg gtcctgggac aagggcttat 57781 cacacaagac cagggttgtg aatactctgc tggtggagtt cgaggagacc tgggggtggt 57841 cagagaggcc aggagagggg ggtgtgatga gtagacatca aaggcagaga ccagataccc 57901 acccttgggc aggaatcagg aagcaaccat gcagagctca ccagtagaga ggggcaacca 57961 agtcaagcga tcagagcaga cttcttgggg tggtgggagg aaggagtgaa catgatgtgg 58021 tgatgggaag ggagggtgaa aaggagggca agaggggaaa atgagggccc tatgatgacg 58081 caacctgccc tggccaagtg gggaaaatgg agagagggct tgctccctcc caccctctgg 58141 agtccaaatt cccatcaccc tcacctgtag gatcactccc aatgcgttgt gatgctcctg 58201 attcttcaca accaccaccc ttcctgctga gagagacttc agcccgttca cagactccat 58261 gatgcgtcgc tgaagagcaa aggacagatg ggaggaggca tcacagggac agacacttgg 58321 actaagttta gtagcaccta gagtaacacg atcttctctt cccttctcct atcacctcct 58381 cagcactcac acttgctcac ctggatcatg tgctgggtct ctgtcagttc ctccccccag 58441 ctgtaatatt caggcaggtc gaccagttgg ccagtcatgt caggctcctc caaagctccc 58501 agcctcttgg tcagttcagc cagggcctgt tcatgggcct tgggtgggag caggggaagc 58561 tgtgaaaggg gagtctccag gcccaatctg tcacactccc caccctcaac acactggcct 58621 ctggatgcct acctattccc agcctgtctt tggccaacct cctgctccac acactggtta 58681 ccccaggctc cttaccttgc tgtctttgcg ggagggaaac tcagagaagc tcctcttcat 58741 catgtcctcc accctgaggg catccactcg cagcaagttg aggatcatag tgtacgtgag 58801 gcggaactgg gactgcagct gggacggctt cccctggagt caggtcacag aagtcactga 58861 gatcagggtg ggacctactg aaccccagcc agtcttctgg actgggccta ctcccgtcag 58921 atccggtcct agctctgcaa ccatggagcc ttggacaaac cacttgctca cctcttctgt 58981 aacataaggg agcaaactga aataaatgga ctcaaaggcc ccatctagct ccatattcac 59041 atgggagaaa actggaagag atgggctcca atctcatcct tggattcact cactcctacc 59101 tcagcttctc ccaggcaaaa caaaaaagtc ggaggaagtt gggcgctgtg gcccacacct 59161 ataatcccag tactttggga agctgaggca gacagactgc ttgagcccag aagttcaaga 59221 ccaacctgtg caacatagca aaacctcatc tctacaaagc ataaaaaaaa aaaaaaaatt 59281 agccaggcct gtgtggcatg tgcctggaat cccacctact tgggaggctg aggtgggagg 59341 atcccttgag cctggaaggc agaggttgcg gtgacccgag attgtgccac tgcactccag 59401 cctaggtaac agagcaagac cctgtctcca aaaaaaaaag gaagaagaaa aagaagagag 59461 gagaatcgaa gacagaatcc agcaaggtcc tggagctggg gccctgccga gcatgctggc 59521 ccgctcacca tcatcatgcg gtgcaggtct gccatctcgg gcactcggcc cttgcagagc 59581 aggataacgg tgcctgtggg gtccaggccc ctccgccctg cccggcctgc catctgcaca 59641 tactccccag ggagcaggtc ccggaaggtg gagccatcgt gtttgcgcat ggagtcaaac 59701 actactgtac gagcaggcat gtttactccc atggcaaagg tctctgtggc aaacaagacc 59761 tgggacagag gagaacagaa aggatcagca aaggctctac atacacacac ccccagccct 59821 ggccaagccc accttacttt gtaggcataa ggcaccatcc cagttgttac ttatgggatc 59881 tcactgaact ctccaggcaa atctataggg caggtattat tgttttcctc attttactta 59941 tgtagctact tatttctatt tcttaacttt taccttcata tcccattccc atccctgact 60001 gcaatcattc caatgtatcc aacaaatatc tttttatata aatatgatct tatagaacat 60061 gtcttctttt gtgagcttgt agtttataat tcatataaaa agtggttata tatctgattt 60121 ttttttgctt ctttttctcc acctagtgtt tcaacatttg tccatgttcc tgtggccaca 60181 tctaacccac tgcttccaac tgctgcagaa cactctatgg gtgcatcccc cacactaacc 60241 ttccctctct cccagtgaag ggcaccctgg tactaccaac accatgccac cacaaacaaa 60301 ggatgggtgt acatgttctc tcacagacca gggtgagaat ttctttgtga tatataccca 60361 ggaatgaaat ataagctcag agtatgatat agtttttgtt tgtttctttg tttgtttttg 60421 ttttgagatg gagtctccct ctgttgccca ggctggagtg cagcggtgct atcttggctc 60481 actgcaatct ctgcctcctg ggttcaagca attctcctgc ctcagcctcc caagtagctg 60541 ggactacagg cacctgccac catgcccagc taatttttgt atttttagta gagacagggt 60601 ttcaccatgt tggccaggat ggtctcgatc tcttgacctt gtgatccacc cacctcagcc 60661 tcccaaagtg ctgggattac aggtgtgagc caccatgccc ggccaatata gttattttgt 60721 ctaaatagtg ccagcgtgca ttccaaactg gctaggcatc ccgtaaagga ttcccagaac 60781 cctacaacct cctatatctc tggcaacact cggcatgaac tagtttccta aattttacca 60841 gtctaatagc tgtaaagtcg tatcttgtga gtacttcaat ttgtatattt ctgattacta 60901 ataacttttc gaatgcttgc tagcttcctg ggtttctttt ctgagaccta cgtattcata 60961 tcttatgcat ccccatttta agatgggaaa cctaaagttc aggaaggtta aataattggt 61021 ccaggatgat actgtgaata catggtgtac ctgggattca aacccaggta gtctgaatcc 61081 agagcccaga ttcttaacca cagcactggc ctgcctaaac ctctgcccct ctgccctctg 61141 gacagcccct aagtgggcaa caagcaccct gaggagtccc tttccaccac cacatgcacc 61201 ttgaccaggc cacggctgaa gagcatctcc acgatctcct tgaggatggg caggatgccg 61261 ctatggtgca cacccaggcc gcgattcagg agctctgaca tgtgcaggac ctttggtggg 61321 aggaggccca tggtcaggga tggagtctcg gaacacccta tccccaccag tctgccaaat 61381 gtgtgcatgc acgcacacag acgcacacag acgcacctgg ggcagctggc ggtcagagcc 61441 acggaggcga gcaaggcagc gctgcaggaa gaggtggatc tcgctcttct ccgaactggt 61501 ggtgaggtca agggaggtga ggcctgaggc ctgctcatca cagcggcccc gggagaaggt 61561 gaacaccacc acgggcaact gggcacgtgt gcggagggag gccaggaggg acaggtacac 61621 tccgcggtcc tggagaagga aggggaaggg gaaggagcag aggttgagtt cctgaaccaa 61681 tgagaggtga gctagtgtta actggggcag tgctcaaagc tcaagtcaaa acccctgggg 61741 gacaagggga aaaaaaagat aggaggaaaa acaggtgctg gcaggtacaa aaccctccca 61801 gttctcacct gtgcagggcc cccctgatgt gtgggctgct tggccccaaa ggtctgggcg 61861 tgtttgctca ttctctcctt cttggcctcc acagctgcat agtacctggg gcacagggag 61921 gggtagccac aatgtccagc tgggggccca gccctaactc tttcccccat ctcgaggctt 61981 acccttttgt atggaaggct cctcgggagt ccagcaacaa aaagagctcc ccctgggtct 62041 tggagctgtt ccctgtgaaa agatagtgct ccaggggcac ggggcgggtt acagtgctaa 62101 tcacatagat ctgacgacgc ttcagccgcc tgaagaaagg agagggaagc aggtcagggg 62161 tgggaacgtg ggaaggaacc cgcatcccca cccccttccc tacccctcca actttaccca 62221 gtcttttctg agacccagaa tggaaggcta tcccccaaca tctcatccca tttcaacatg 62281 caaagtaatc tggacccaaa ggggatccca cagccaccag gtgggacatg ttccccagca 62341 tcacccctaa ggcctgaccg ctctccttgc tgtggtagac ttagtccccc acccgaatca 62401 aggagtactg tagccccctt cacccaggca acccgggaca cacgtctcac ccaatccagt 62461 cagcaaactc aagggcgttg gggacggtgg cactcagaag gatgatagaa acgtggtcag 62521 gtagcatgat aagcacctcc tcccacacga ccccacgctg ggcacagaga gggaagggag 62581 gtcacatgag ggcaggggcc gcccttctgc ccataaagag gcacaggatt ccatatgggg 62641 gtaaagaaag cagtaagggg ccctgagtgt gtggaataag ggagacgctc aactggtccc 62701 aaaggagagg actgccgggt tctggggagc ccatggccct tacctcgaca tcgttgatat 62761 agtgaacctc atcaaagatg acccactcca ggtcccgaat aacatctgag ccactgtaca 62821 gcatggagct ggggagaaga ggccaaggat tgactctcca gtggctcgtc tccaccccct 62881 cctcagacag cacactctcc acaacggcca catcttccca gccaaaactc ccctgtattg 62941 agtgtccatc tctcaccgaa ggatctctgt ggtcatgatg aggcaggagg cctccggatg 63001 cagctgtaca tccccggtga gcagccccac atccccgaat gtgtttcgga agtcccggaa 63061 cttctggttg ctcagggcct tgatgggcga agtgtagatg gtgctgggaa gagagcatga 63121 agttagccag tccctccgca gactcatggc ccgcacttcc tctcccacat ggagccctcg 63181 tggaagcatg ccccagaagc tcaggtgctc ctcttcttca cccaaactgc ccttacattc 63241 tcctgcatga tataacccca gaaaaattct gtccccaacc ccttcctaaa ctaacccagg 63301 tcattcttct aaactttctc cccaccatgt cttttcattt ctctctgggc tcttgtctca 63361 gtcttagccc tgaacctctc acttaaagga tgagctagag aggtggggga agagatgaga 63421 ttttcaaagg atgcaggaga aaatggggct ggctggtgaa gggggaggtt ggcaaaggaa 63481 ctcataccgt gtcatgtgtt tctgggccag ggcaatggca tattcagcca caactgtttt 63541 tcctgcagat gtgtgagctg cgacaaagac agagtcatgc cgttccaagt gcaggatggc 63601 ctgtttctga aacacatctg gctcaaatgc ccactatggg agagagaaat agacaggagc 63661 tgaagaaagg agcggggcct tgctgcctcc tgctcatgga gcacgcaggg cgggcggatg 63721 aaggccggag ccagccgagg ctgggactga gtaccaagac tggcctggga tggatctgac 63781 ctctggccaa tagaggagac aaggggtcgg ctgggagtgt gacccagaaa gaggtagagg 63841 agcgtgtgaa gatggggcca aagtacctgg aaggctggct ggggaatgag gcgatagaaa 63901 tcaccaacag gggaggtggc gtccacaggg atggcccact gctcctgaga tggaggctct 63961 ggggcctctg gggtggatac agctgtggac gcttcctgga agagtcaggg gtagcagtga 64021 taaagataca accagagggc tctcaattag cactcctccc aaaagatgtc cctttcttcc 64081 cctcagaccc atcctagcca gcccacatcc tggtggcaag gctctttctt gcctccacta 64141 cacagaacca ccaaccttca acactaggtc ttccaagctg cttgctcggg ccaggggagc 64201 actgcaggga gaggctgaaa cagtgtcccc tctgggacct cctggctgtc ccactgcctc 64261 attctcatcc tcgtcacccc cacccaaatc cagaggctcc aacagacagc taaggcttag 64321 tagtccagga gctggagttg gacaatctga ggagcaagcc aacaaggtca accttgtcat 64381 gtccatctct gttccttagg agaaggacat gacttctcct acaccccact caaaaactaa 64441 aactaacctt ttggtgcaaa gtccatgcct ttcttgaaac caggtggaat agtaagaaga 64501 tctgtaggat agggacatgg aatcaggtca ctgcacactg gtgaacaaat tgtgtacatt 64561 atataaacct aaaagatacc atttacagga cagatgctgt agatagggat gtttgctatg 64621 acactttccc aacagatgac agtaaaggtt gttgtagaaa tttcccagca gatgacagta 64681 aaggttgtta tggacagaat atctttttct aattttctca aaaacatggg agggctagca 64741 gtagcccagt gatagcctgg gctcttcctc ctcaaggctc agactcagag ccccacctta 64801 cctttctcaa agtctatctc ctcctcagcc tcctcccgtg tgttcagatc tgttatggtg 64861 ggttcatcca tcccccctgg gaagagacag gacagacagg gattcatggg atgagggtaa 64921 tacgagatag gtggggaacc cttccctgga gattaaagac accctcttct accatcccat 64981 ctccacaaga gtcacctggc cagaagggat accgagttgg atttccccat aaggactggg 65041 aggctggccc tggaggccgg cgaagagaca aggaggttgt agccgagaga tttgtgttct 65101 ccagcaagac ctaaggcaag agatcagagc cccaatgagg ctgagcttgt ctctgaccac 65161 ctccccttct ccccaactgt ttctcatgac ccctgacctc ctacctcttt gtaacccagt 65221 atctggcctg tggttgggtg tctttgggcc tgtaggtcgg atgggactgg ggctcccagg 65281 acagccaaaa gagaccaggg atccgtcttc ctctgccatt ttctggagag aaaaatagta 65341 acttttgggt cttctttctt cctctgtccc agcttccttc aggaacccat caccctagtc 65401 taagcccctc cagactcctc accgggctga gtgctccaca ccatgcagag gcagccaggc 65461 tggggatgac agaaacaact gttctgcttc ttgctgcaga tctggggcac aaggagggag 65521 gccatgggga agctgaaaca agggcaagat gagaggcatt aggtaaggaa agaggataaa 65581 atgcagacaa aagagatgat gggacttgtg ttcagatcag aaatggcaga tgtgtcaaac 65641 agactaggac aatcaggctt ggccaaaaac tacctgctac tgaattacct ggggcagatt 65701 ttggatttca gaaacatgga ggctggacct aggaatgtgc atttttaaaa agctccctgg 65761 ctaatcatga agcacactga aattggagac tcactgattt tagaaaaagg ggtcccagac 65821 ctaaggatga aaggttggag gctcaaagcc tatcatctca gtcaccaagt gccctggtgc 65881 ttggatacag aagggagctc cacggagaaa tgccacattt gaaagagcag ctctcaaagc 65941 aaaaatatcc ccagcccttt ctgctctgtc ctcacacgta ctgcaggtac actgggctgt 66001 aatctgtcca gcaccccttc aggctgggat ggacaggagt acaggcacag atgaacaagg 66061 ttaagctaga tcctactgtg aggagggtga aggaggtttg gactcaatgc gggtcaaagg 66121 ttagggtcaa aagtcactca cgctactctc tggagctcca ggcaagttca gcagctccca 66181 gtgccccgtg catccgagct ccacggcccg aaggggtagg tccaggggat ctggaggggg 66241 tagcactacg gggaaaggtc aaaagtcagg ggtcaaagtt cattccatat cggctactcc 66301 gttccatttc cctcccctcc ccctcaccaa gtcgctctgt ctccatcatc ctggagccac 66361 agcagctgcc tggcagcccg gaagtgcggc aagtagtcgc tgcgaagtaa gccccgcccc 66421 ggaacgggcg gaagtagagg caacttccgg tacagccccg ccgagagcgt gaactatcgc 66481 tgcggagggg cagacctgga ccggtggaag gccgggcgga agtgcgcgcc tggggccgcc 66541 ttggttaccg cgttttccgc tcctcgctac gtcatcgttg tgagcccgct atcagcggcc 66601 agcgcgggcg cggccggaga ccgtggggcc cccggttgcc gccccctcgg gtaaggccct 66661 ctgcttctca ctcttcggcc ctttttctca tagccgtttc tctgttcttc ctctttgcgt 66721 tgctctgcgg gcgctacgct ggctccaccg ccttctctgc cgttcaaacc ccgcggttgt 66781 ccttacccta gcgagggtag gggggtcggg tgccatcgtc tttccgacgg agtggatatt 66841 tgtccctcct tgaaccacat ggtacctaaa ggtgctggtg tctgtgatcc ctggagacag 66901 gagggaatgc tggatgatcc cgaccagcgg gagcttggac agcagcctgg tttaagcaag 66961 gggtagggaa agccaaagac aagagtaggc agacctgaag gggtggggtt gggtacagtg 67021 tggacggcgt gtgaaccccg ggtggtaaca gtggagaaag atgtcttggg ccctgcccct 67081 gaactaggag ccaccatgtt ggtgataccc cccggactga gcgaggaaga ggaggctctg 67141 cagaagaaat tcaacaagct caagaaaaag gtgagggact gtgtgtggac atggcctaac 67201 ctttcacaga ctctgcactc tgagaagata tggggtgagt gtgagcgaat ggggaagttt 67261 ttctggccca tacctagaga catatcacct cacagtgtag cttcgagagg tgggctttcc 67321 caagccttgg tgtggacctt ctgtcttttc tatgcaattt gccatggatt gtcttccttt 67381 cccatggctt ttagtaccgc ttaagagctg atgatcccca agtttgtttt ttccagttct 67441 atccagtgtt atactgaaaa acttcagata ctcagtgttg aagctatcat ctttctattc 67501 ttgatcttcc aaataagatt gactacacca attctgccac atgtttggac tcaaaactct 67561 gacttcccca ctatttgcat agtcaaacct ttgccaagtc ttattgtttc ttcttttgaa 67621 gtaatttttc ctgtttattc ttcactgtgc attcccccga gcagggcttt attgacattc 67681 atgtgggcca ttgtaatggt ctctcaacaa atttctcagt cttttttttt gaacagactt 67741 cttcactcct gttcaaatga tccattgaaa acacaaaatt ctgctccctc tgtttctcaa 67801 aaaccataaa tggcttcaca ttgtctataa aaaagcctca aaaccttagt ttggcagcct 67861 tgtctagtaa agtagaacta ttcactactt ccaaaagatg cattcccaaa gccttcttat 67921 ctctgcatct ttgccaccct gttttctttt tctaaaatac ccttctcctc tctgtgttgt 67981 gcagagatac gggagagggc ctagtttaat agccattaaa catcagaaaa tcttactagc 68041 tttcagctat gaataatctg cacctcttat aattgtattg ctaactcaat gtattttatt 68101 tctcagactg taagcttctt gagggtagga acaatgcatt agcctcctgg atatccctgt 68161 tacgaagtca tggctgggcc tcattcaata tgcaggaaat agatggatga agcttccagg 68221 aaaggcacat aaacttatac ttacttgcag cttatatcct aggattctag tccatgattg 68281 agttatttgt cttcaacttc aaaaatgtaa cactctggaa aacctctggg ttccgtggtt 68341 tggttagatt aggaatttcc agcatcatac atgatctagg ggaccagagg caagaggcag 68401 agcacatcag tgtaccactc tgagggcatc aagaaagatg gtccctggag tgatgtgccc 68461 aggggcctta ggagaggaac cggctagggg ctggccctca cttacctctc tcttctcacc 68521 cttgttccca gaaaaaggca ttgctggctc tgaagaagca aagtagcagc agcacaacca 68581 gccaaggtgg tgtcaaacgc tgtgagtgac aggggaaatg gggatggact ggaagtgggc 68641 agcatggagc tgaccttcat catggcttgg ccaacataat gcctcttccc cttgtctctc 68701 cagcactatc agagcagcct gtcatggaca cagccacagc aacagagcag gcaaagcagc 68761 tggtgaagtc aggagccatc agtgccatca aggctgagac caagaactca ggcttcaagc 68821 gttctcgaac ccttgagggg aagttaaagg tgagcacaag cagaaagatt gtttaaaggg 68881 catccctcca agttggaatg tagatgggtt tggggaaaaa tatgcatact caagcactgg 68941 caaattgctg taccattctg agtttcacct ctcatgtata aaatgaaaaa actagacgac 69001 tcctgaattc tccattttac aaattctgtg gttctaatcc accttggatt ggagctcttg 69061 gaatatgttg ccctaaggga tggaaagggc tgaaaatgtt atcaggttca agagaggttc 69121 aaataaactg aatacataat atatcgagtc acttataaaa aataccagct gggcgtggtg 69181 gctcatgcct gtaatcccag cactttggga ggctgaggtg ggcagatcac ttgagattag 69241 gagtttaaga ccaacctggc caacatggta aaaccccgtc tctactaaaa atacaaaaat 69301 cagcagggtg tgaggccggg cacggtggct cacacctgta atcccagcac tttgggaggc 69361 cgaggcaggc ggatcacgag gtcaggagat cgagaccatc ctggctaaca aggtgaaacc 69421 ccatctctac taaaaataca aaaattagcc gggcgtggtg gcgggcgcct gtagtcccag 69481 ctacttggga ggctgaggca ggagaattgc ttgaacccag gaggcggagc ttgcagtgag 69541 tcgagattgt gccactgcac tccagcctgg gtgacagcga gactccgtct caaaaaaaaa 69601 aaaaattagc agaatgtggt ggtgcacact tatagtccca gttactgagg aggctgaggc 69661 cagagaatca cttgaaccca ggaggcagag gttgcagtga gtggagatca caccactgca 69721 ctccagcctg gacgacagag caagattctg tctcaaaaac aaaacaaaac aagccacaat 69781 ggcaggtgcg gcaatggctc attcctggtt aatttcagca ctttaggagg ccgaggtagg 69841 aggattgctt gagcccagga gtttgagacc agccctggca acatagtgaa accctatctc 69901 tacaaaaaaa aaaaaaaata cagaaattag ccaagcatgg tagagcacat ctctagtccc 69961 agctactagg gaggctgagg taggaggatt gcttgagcat gggaggtcaa ggctgcagca 70021 agctacgatt gctctattgc actccagcct ggacaacaga gcaggaccgt gtctcaaaaa 70081 aaaaaaaaaa aaaataagta aaaccacgtc aaataagaaa tgtatgtaga attaaaaggg 70141 aagaaaagat cagaggaaac ctagaagagg gtgagggaag aggaacacct ctggtccatg 70201 gttgggcaac ctcctgctcc acactgaggt aggtctctca ccactccagg accccgagaa 70261 gggaccagtc cccactttcc agccgttcca gaggagcata tctgctgatg atgacctgca 70321 agaggtaaag gctcctaatt tctgtccctg aggggttggg gatggaggaa tatgagagac 70381 aaatatattt tggaccacat gtcagcaact ggaaatcatc ttaactgttc ttggtctctt 70441 ttacagtcat ccagacgtcc ccagaggaaa tctctgtatg agaggttaga gagtagagat 70501 aaggctgggc ccatttggtg gagttgattg cctgagtcct tgggaggttt gggctgtagg 70561 agctgggcta tagagggaat attgagtctc cacagggtct gaggggagaa ggtaaaaccc 70621 ccctttgatc ttcagtctgc atctccccag ctttgtgtct tctagtgatc gacttcgaga 70681 actaggacca gatggagaag aggcagaggg cccaggggct ggtgatggtc cccctcgaag 70741 ctttgactgg ggctatgaag aacgcagtgg tgcccactcc tcagcctccc ctccccgaag 70801 ccgcagccgg gaccgcagcc atgagaggaa ccgggacaga gaccgagatc gggagcggga 70861 tcgagaccgg gatcgagaca gagacagaga gcgggacagg gatcgggatc gggatcgaga 70921 tcgagaccgg gaacgggaca gggatcggga gcgggatcga gaccgagacc gagagggtcc 70981 tttccgcagt gagtgatttt ggctggaggt caaggtgacc ttaactgagg tttatgtggg 71041 tcctactaag tgaaatgtgg catgggctat gtcttgtgac tatgatttgt gctcccaaag 71101 ggtcggattc attccctgaa cggcgagccc ctaggaaagg gaatactctc tatgtatatg 71161 gagaagacat gacacccacc cttctccgtg gggccttctc tccttttgga aacatcattg 71221 acctctccat ggacccaccc agaaagtaag gatgacaaca gggcatgatg agaagtcctg 71281 ggagaatcct ggggtgtgag acctgaggga agaagctgcc ctccctgcag ccgcctatta 71341 tcactgggtt tggttgggtc cagaagagcc ccttggctct ccactgaccg tgttttctct 71401 tatcccagct gtgccttcgt cacctatgaa aagatggagt cagcagatca ggccgttgct 71461 gaggttggaa cttcccaaat ctgttcttcc catcgcttct gctgttctca tgtgttctcc 71521 tcaaaatact agatgccagg aagaagttgg ttcctcttga taaagaaagc ctctccctat 71581 tcacacagaa aaactcacat tctcagattc cttattcagt ctcctcctag gatagcaact 71641 cctggaataa cttctttggt ttatctgtaa tgttttcctc cccatcctct gcctcccctc 71701 ttctgcttca gctcaacggg acccaggtgg agtctgtaca gctcaaagtc aacatagccc 71761 gaaaacagcc catgctggat gccgctactg gcaagtctgt ctggggctcc ctcggtaaga 71821 atgggattct tcctttccca tcctctcccc cacaggccat tccttttggt tcccccacac 71881 atccttgggt ttcctagaga tgatcgaggt cagagtgtgt gcggaggcta tgggaatggg 71941 aaggaaaatt tcagatcctc tgtcagttag aggtaaagaa aggagagaaa aggattgaaa 72001 acagcttcca aagcagtgac tggaacaggg aagaaagtgg gagctggagt gagattggtg 72061 aggaagatga gtgtctcatg gggcttgttc attttgagat ggagtagagt gagtttcata 72121 agtattcgaa atcagggatc tggaactcag gtgtgaggta aggaaggaaa acatagatta 72181 agtcatccac atcacagggt ggctgccaaa caataggatc atcagggaga ctctgatttc 72241 agagaaaaaa aaacgaccca ggactcagtt tgaggagtac tctgggacta tgttaggaaa 72301 caggactgtt cacctcacac cccacatgtc cccagctgtc ctgacatgtt aggctaacgt 72361 tgagcctaac gttagcaaag gagacaggag gaggggtcgg agggctggga gcagcaccgc 72421 aggggcctgg gccactgaag caaaaggaca cacactttga gaaagaggga gtagccagca 72481 gtgttgaggc acagagatca caggatggga gtgggtgaga gagattctct gtagctcaag 72541 ggtggtgggg atggaaccat tggatggggt gaagagagta acatgtctgg tggagggagg 72601 aaagaggaag gggaagaaac agctagaggc ttaagagaga atggtgaggg ccaaagctac 72661 accctgaatg agttctggtg gagctagtag catttcttag tgtgaataat ccattttccc 72721 tgaaagtaga ttttcctggg aaaggagtga gcagaaagaa gggctcagct acagtggccc 72781 ttcaggcaaa agaaaggaac tagaattgac cagcatgtca aaaaagggct acagaggttt 72841 tctagttttt agcttctgac atactgactg taagtagtgg gttaatatca ttcacgtgtc 72901 tcaacaatga catttgggat ttttctaaga acaaaacatt catcaaaatg tccacttata 72961 cttttcttgg ccatgggttg gcacaaagga gctaaggaag acaggccatc ctggccacca 73021 gagggcagcg tggaaactgg gcttccaggg gccagtggcc aggagtgagg tggtcaggag 73081 tcagcctcag ggtctgttct atgatctcct ttagacctta actgttcctc tcctccctcc 73141 ctagctgtcc agaacagccc taagggttgc caccgggaca agaggaccca gattgtctac 73201 agtgatgacg tctacaagga aaaccttgtg gatggcttct agggaacaga gctggattcc 73261 ttgtgcctca tatgccccaa tgctggtctc agtaaaacac tgaggtggaa gcttacacat 73321 ctccctcagc ctctggtttt tcagcacttg ggattggggt taagccttta aaaacggctg 73381 tcaggtttga tctcagtgta acgacatggc cagtgcctgt tccccactcc cttgccccaa 73441 aaggatctgg aacacaggtg ttgtcgcagc tgttttaatt caatcccacg cccctgtcca 73501 gcaggaaacc ccttatagaa aacccaaatc ctcatcttgg agtttctcct tcagccaggg 73561 cagcacttga aagaggttga tgtgaaagtc tcgggcgtga gcaggtacct gcttttgccg 73621 cttctggttt ttgcagacat ccactactcc ccagctgatt acaccaacct gcagaggcag 73681 tggggtgcca tggatcattc agtagaattc ctaatcctgg aagcatggct gttcctgctt 73741 gtgtcttagc tgacctaaag gaatcagact agggacccag ctcagcctcg ttcttgacac 73801 acgcaggcaa gacatgcggt cctaaggtga ggcaggctct gcctacctcg aattactagc 73861 cacatgcatt gagctttcct gctttggggc ccatgctgac cacttggcat ctccccagat 73921 aggaaaggga ggactcactt gaatgaaacg acttctcttg tgaactatca aggggccgcc 73981 agaatcacct gcaaggagag gagaagctgt agagaaaagg actgttgggc cttgggcact 74041 tgtagcacaa ccaaagagca ttctctcacc tctgcaagta ttggggtcag catagggact 74101 cactcctcca gtacaaagga accgaggggt gaccacctct gagatgtcct tgactttgtc 74161 atagcctggg gcatattgag catctctctc acagctgcct ttctgtaggg gaggtgggaa 74221 gcatggagaa gtaatgaaca gaagtggctt aggaaggatt ggggcctaga gtgcctcctt 74281 aggatgcccg tttctcacct tatccccatt cttgatgtag acctccttcc gagtcagctt 74341 tttctcctcc tcagacacaa acagagcttt gatatcctgt gcagggagca gctcttcctc 74401 tggacgaata gactgcgtca cttcagctgc tcccaccact gtcatctccc cattgccttt 74461 gtcactcagt agatgtcccc taagcccttc tagagctagg ttctgggcca ggcatcatct 74521 cttcctattg cacctccctc tcatgcccca cttgtctctt gggatctcat ccttatcctc 74581 ttgccaagta tgtcttactt tgttgctggc aagtggtagt tggaggaagc ctcaaagctc 74641 gagttgttcc ctcggtgcag gggagacaaa tgggcctata aaggacaagg agaacagcaa 74701 accaggcctg ctcctcaccc cagtcctcca gcctttccca gcctttcctc agggatctgg 74761 acgctctcac ctgatagtct ggccatattt cagcttattc ttgagcttga tcagggcaac 74821 gtcatagtca taaaattcag gaattcctgc ttcttttttc ccattaatgt tgtagttggg 74881 gtgaaatagg actacttcta tctccaggtc ccgcttctcc cctcctgaag taggagagta 74941 ggtaccacct ctttgtgggc agcttcctgc ctctggcccc gtgtcattcc taacttcacg 75001 tcttccccca tccctgactg gtctggggtg caaggaatgg ggctgcattg aggatgggtg 75061 gagtgtagga tctgtagaaa gtgggaggtg ttgcctggag agcataggtg cagcccagga 75121 ccttcagttg catccttacc tacgctgacc ttgattgagt gttccttgtc atccacagtg 75181 aaacaatgtg ctgctgtcag cacaaagtac tcagacacca cagcccccat acagctctcg 75241 tgtccctttg aagggcgctg gggacacaac agtagagagg gaaagctcaa ctttcacaaa 75301 ccaccatctc ttatggccat tttttccttc ccctactccc atttcacctt gacctcacct 75361 cccccaagtc cccactactg ggattctgtg cttacaatga ctgagatctt ggcctgccat 75421 ggttgcttgt ggtaatcggt acccttcctg tgttcccaaa ccatgccaca gagactcaga 75481 gactggcttt catctggcag agaaggagaa tgtgctgaaa actcggaaca cttcattccc 75541 aaatggcagc agccttttgg gtaggagtct atggggagca aaggcccaaa aggagaaagg 75601 gaaagaccac gggtatttgt ttcctcgtgt taggagaaaa ctgctatggt ggactgaaaa 75661 gggaaagaac ttgggatcaa gctgtcagag gcctggctgt tttcaagccc acccttgttg 75721 ctaacttgct gtcccttgac ctcctttggc ttctgtttcc acatgtgtca aatgtaggac 75781 tatagaaaga tctctgaagt gctcagagct ctgtgattct aaggttaagt gaacagtgcc 75841 aggaaacaag aatagtgaca ctgaggtaga gaaggaggaa tgaagaaggc tttccaggca 75901 actagagctt caggtgtaga ggaagaatga attacttcag gggaacctga ggagagttgt 75961 gttctttatt cccttgtatc tccctaccga tcatttggta gaaaacatct tccaggtttt 76021 ccatatcctt gactttgaac acatgttgct cattgtcttt cttggaagcc aaagcattga 76081 tgttcacttg gttcaccaaa ggcccgaccc caaacacata gacatctgag ggataaaaag 76141 gaaggatgag ggtccaagcc ctgaggaagt ggggtgctgg gtcctaggca ggttactcac 76201 ccagataatc ctcccttggg tttttgcgat ccttgccaat gtatagcaag tcccggatct 76261 catcaatgac agtaattggg tccccgccca tgttgtgcaa tcctgcagaa gagacaggac 76321 catgagggta ggagataagg aagataaact ggctaaaggc agggatacac gtggcagggt 76381 gagcaagttg aggaagggtt agagatagtt gatcacaggg cttaggaaga attccttatg 76441 aagggtcaca ggagagatga acagccagct atgagtcaca ttcagggccc caaccatggg 76501 tatagtgtta caagtggact taagggccac atgctggtct gagaaggtgg ggaggctggg 76561 acaggagaga ggtcccttct gaccatcagt catgaggatg atgacatggc gggtgcggtt 76621 ccagccttca ggagggacgt catctggcca gctcatcatg ctgtacactg cctggagggc 76681 cttcttggtg ttagtccctg acttcaactt gtggtctgtg gagagggaag agaccatcac 76741 ctcacctggt cttccaagcc atcttttaac cccagagacc aatctgcaac tgaaatccca 76801 catctttcaa ctttttaagt taatcatcat tacacggact tccctttgac cacaaagtgg 76861 cccttccagc ccccaacagg ttcccactaa cctccattgc ccaacgatcc tgctgttcaa 76921 cctttgacta caaagtggtc ctccctgtcg ccctcaaggt agtctcatga ccccctccac 76981 cctgaacctc ctgaccccaa agtgaacctc ccaccattcc ctaacctctg accttcataa 77041 ttgatttcat tgagctgctt cgtgacccag tctgcattac tgctgtctgc ttcagacact 77101 ttgacccaaa ttttggggta tgtggcatat gtcactagac catatcttgg cttcacacca 77161 taacttgcca cctgtgggtg aggagaacaa ggcgccatgg cattgaaaat agaatactgt 77221 gattgggaga tttcagcgac tttgctggga cagggaggcc ctcagtaaga ggctcaaggg 77281 ctgaggttca tggaggaatt accagtcaag aaactgcttc aagtaaaagg gaaggaggac 77341 agaataggac ctggagattt tcctggggcc ctgttgttca gaggggctgg aatgatcagg 77401 gagctagtcc tggaagatca gcgagattcc attcccccga gttcagggat aggaggattc 77461 caccttctca attaagttga ctagacactt tttggctcct gtgaagttgc tggccccaat 77521 gctgtctgat ccatctagca ccaggtagat gttcatggag cctgaagggt ccaggacgat 77581 cttccgcttc tgttgttccc ctgggtgcca ggagagtggc tcaggctcca gcattaacag 77641 ttctgtccct tctccatttt cccccagttc cctgccctgc ctcccttctc tgtcttcaaa 77701 cctgggccgt gcccatcctc agcatcgact ccttctatgg tctctgtcag ggaagacagg 77761 aaagcttcgg ccacctcttg aggggtgtcg tacatgaagg agtctgggag agtcagaaat 77821 gaggtcaaat gtctgggagt gtcagggata cagtgaccaa agagacgggg gatcatgggt 77881 ttccagggta taaaaggctc agaagtgaga tagttgtaca gggaggttta aacaaagtga 77941 ggaaagagca gggttgaggt ggggagagaa gacagtagga tggaagacca ggatctgacc 78001 tgggggtaca ggtcaaaggt caccttggca ggaaggctcc gtcccgctcc aagagccacc 78061 ttcctgacac gttcgccgct gggagccacg cagggtaagc ccccggctgc agtggtaggt 78121 gacgctgtct tcaaggcggt actggctgcc caccttcctt gtgccaatgg ggatgcccgg 78181 gttggagcag taccccgctg cagaggtatg agacatcgag gtaagcactg aagcctgagg 78241 ccccgtgagc aaggtagaga gcaagagtta cagtgtccgg agccgagtgc ccactcctcg 78301 ggctgggcgc cgtcagggag acagcaatgt agggggaggg gatgcttctc acctccgttg 78361 tcacagatcg ctgtctgccc actccaccgg ccattcactt ggcaggtgcg attggcagag 78421 ccccggagag tgtaaccgtc atagcagtgg aaagagatct catcactcac attgtagtag 78481 ggagaccggg gccagtattc cccgttctcg aagtcgtgtg gtcttggaca gtggattgct 78541 ttgagaaggg gggacaagta gaagtcatca agagggaaag gctgccttag gtgtatccct 78601 cctggtctcg gagaccatgt cactgagaaa cagcgcattc ccagtcccgc agaagcagca 78661 tcttacctac tcctcaaccc atatggattt ccactgcttc tccctcccca tttctgagtg 78721 ttctcttgac ttccagggct gcctggaagc ccagggtaaa tgcttagtaa gggttaactc 78781 cgctttttct tgcccccttt ccgcctgcca ccctaaaact gctcctactc ccggtcagcc 78841 caccttgtca ccctgcctag tctcatccta gtcctgacct tgctgccgcc tgccctgttt 78901 ctgccttagg ccactgccca cactcattgc cctcaaacct ctgcactctg ccttcctgac 78961 agtcttttgg tcttgagtct tcagggtgct ccaggacccc gtagatctgc aggtacgtgt 79021 ctgcacaggg tacgggtaga agccagaagg acacacgtac tccagtgcct ggccctcttg 79081 gagaagtcgg aaggagccgc ctttgatctc taccccctcc agagagcagg atccctgggg 79141 ctgggccaaa gaccatggag tggtggtcac acctgaagag aaaggctgat gaagcctggc 79201 cccaaaaggc caaggaggga tgctggagac agcaggaagg gaaggttacc ctcgcttacc 79261 tccagacaag aggcccaaga taaagggcat caggcagagt tgggggctga gattgctccc 79321 catggcgttg gaaggcagga gagaagctgg gcctggggca ggatggtgtg tcctggcttg 79381 ctttgcttgt ctgcttggct cagtgtccaa gctgaaactc cagacctaga cctggtcaca 79441 ttcccttccc ctgctcccca ccagccccca gccttttata caatctgtgt tctggcacct 79501 gcggctcgcc ccgcctgtcc tacccacatc actttcccgg aacatccaag cgggagggcc 79561 ccgctgagct gccagtcaag gaaacagaaa ctgcagaagt cccacccttt gctgccaaag 79621 gtccaggact ctccccttca gtacctcctc tccggcctta gctcctcccc agtaagccca 79681 aaccacccac ttagggacca gaaataagga tccagctcac tcccctgttg attgtgtgtt 79741 atggtgcaga gtccagccac tgtttgtcca gtggggtctc tgacctgcct tcctgtagct 79801 cttggagtca ttctggcctc cccctccccc aaggccagcc ctacctggcc tccagataga 79861 ggcactgaga gatgtggaaa ccattgattt ttattaattt cataactggg aaattccatg 79921 tgaaagtgaa acaagcatga gtcaagtcaa ccagggaagg aatctgggga caggccaagg 79981 agcgggaggt ggggcagcga ggcagtcctg ctggtaggag ccctgaggat ttcccagctt 80041 gtgtgcgctg cctctggcat cctagagacc cggatttact cagctaggag agaggatgga 80101 tcacagggtc taagggtggc cattcagagg tagaagatgg aggggcggca gattctggca 80161 gggcagcaga gggctcagtg gccatggcta gaggggtaaa aaattcagga catcccccag 80221 gtgctgcctc agccagggct gcatgcggaa gagattgatg tgaaagtctc gtggcggcgg 80281 gaccttgcta cgaggggccc ttttgcggga gtttttgtca gcagagccaa ggcaggggtt 80341 gtaaagaccc cagctcacca gacccacctg gagaaaagga gagaactggc tgggaggctg 80401 ccacagcccc aactgtaagc cccaacctcc ctagatccct caaaggcctc cctcatcccc 80461 ccaacaaggc tgagatcctg taacccctgg gtcctgcaag cttctacctt ctcacctgaa 80521 aaaacctgaa tctccgctca aggaaaactg ctcccccaga ttctcctgcc agcaaagggt 80581 gacaggagac agtgtcatct agggcagtgg actggtgttc tggcatgcat gctggccaca 80641 gagacacagg tggccttccc cttgggaatc caggcatggt gagggactca cccttgcagg 80701 gactctcatc ctcctgggtc ccactgcata ggaactggtc tgtcaccacc tccctgacat 80761 ctgtcaagtt ggggaacatg gttttttctt gggagacaac ctcggcacag cttgtccact 80821 gcagcagggg tggaaccaga gagaagggta gaataagcag acccgggtgg gtgcagtggc 80881 tcacgcctgt aatctcagca ctttgggagg ttgaggcagg cagatcactt gaggtcagga 80941 gttcaagacc atcctggcca acatggcgaa atcccgtctc tactaaaaat acaaaaatta 81001 gctggtgtgg tggtgggcac ctgtaatctc agctactcgg gaggctgaag taggagaatc 81061 gcttgaaccc aggaggcaga agttgcagtg agctgaggtc gtgccactgc actccagcct 81121 gggagacaga gcaagactcc atctcaataa caacaaaaaa agaataagca gacccaggga 81181 cacctagcag gtttgtgagg cttgggagga ggggtgaccc gcgtcagaga gagaaagcag 81241 ccagacctgg agaaggaaga gagacattct gggagctgtc acagggggat cccagcatcc 81301 ccaacctgag accctcacct ccactcccat cttaaggtta atgttcagtt tgctcccatt 81361 caaggcgaca aaatgagcag gaacactctg tttgttcagc agttcattct ctatagtcaa 81421 gagaagggga tgttggtgct ggtcctttta cccagaatcc aggttcctgg cttggggact 81481 gcagcctagt ccttgttatc acccccaaac cccggcccca gctctcaagc accataagtc 81541 ccagcactca ccatggtccc tacaggtgct gccttgaggt ctccgcagag ccagattggc 81601 ctccatcgtg cagggaaggc agatgggcct ggggagaaga gcaagggtca caccagcctg 81661 tccccagtat ctcttccagg gatctccaga gcactcttcc ctgcagggca ccctcccatc 81721 ccagactcca ggcacctggc atgggtggac atctttactt tctgggccag cttcagcaga 81781 gctatgtcat caccatagaa ctccaggatt ccctggttct ttttggcaaa gacatcaaac 81841 cctggggaga tcaccgcctt ctcaataagg aattctttgc cccactggga tttggggtct 81901 cctggaaatg atacactaga ttaggctaga ccagggctcc tgcaggggcc agaggctggg 81961 tgaggtggta ggatctgtgg cttcaggatc aggaggctgg tgcatcccct gccttaccca 82021 cattgaccct ccacagggag tggtcgttgc catcgcggaa gcaatgagct gctgtcagga 82081 cccattggtc ggagatgagg gccccccggc aggtctcttg gctcttgggc tgcaggggaa 82141 caggtgattt tcagagattg cagtatgtct ggcccatggc cgcttttacc tctggaatcc 82201 aagccctgcc cctccttcct ggtaccttaa tagtgacatg ccagggtgtc ctctcctggt 82261 cagaggcgtt tgctgacatg ttccccaccc cgcagatggt gtctgtgagc ttggagacat 82321 ctgtgggtgt gaggatcaga tggggaagga ggcaagtgag gggcactgtg tccaggttcc 82381 caaaacgggc ctttggcggg ctcctcacca tcctccccac accaaggagg gcaaagctca 82441 ctcacccagc atatgttcaa agacctggtg cagagccttt gtgtcctgca gaatgaaggc 82501 atgcctctca ccatccttct tggaccctag ctcattcagt tctctccagt ccacatccag 82561 cttgcccacc ccgatggcat agatgtctgg tggggaagag ggaaatcacc agactcctgt 82621 ggctttgggg ctaccccatg agacaggagg ctgtcatctg aaactcactg tgtccaatca 82681 agacctacat gagctggacc cctgcgtcct ccccactgct acctgtctgc cttcatttcc 82741 tgccactccc tgcccttcac tctcctgcag cacacagcct ctttgaagtt cctcaaatcc 82801 ataggcatgg tcacacctca ggccctttgc ccagctgtgc ctctgcctag ttcactcctc 82861 cccccaagac ttccacatgg ctcactttcg taccttttta agtcttggct caaatgtcac 82921 cttctcagtg aggccttccc tggtcttcct gtctaaaact gcaatgcccc agacaaactt 82981 tcatccccac tttgggaggc aaggtgggag gatcccttga agccagaagt ttgagaccag 83041 cctgggcaac atggcaacac cccttagctt gtgtcaccta ccacctgctg ggttctatgg 83101 ttttcttatc ctgtttattc cctgtaatgg tggaattgtg tcccccagaa agatgtgttt 83161 gagtcctaat ccccagtatc tgtgacttta tttggaaaaa gggtctttgc agatgtaatc 83221 aagttaagat taagtcatac tagattaggg tgagctctaa tccaatgact gaggtcctta 83281 taagaagagg taagccagag ccaggcgtgg tggctcacac ctgtaatcac caggaggcgg 83341 tggttgtggt gagccaagat cgcgccattg cactccagcc tgggcaacaa gagcaaaacc 83401 ccgtctcaaa aaaaaaaaaa gaagaggtga gccgggcacg gtggctcaca cctgtaatcc 83461 cagcactctg ggaggctgag gcgggcagat cacgaggtca ggaattcaag accagcctga 83521 ccaacatggt gaaaccctgt ctctactaaa aatacaaaaa ttagccagac atgctggcac 83581 acacctgtaa tcccagctac tcaggaggct gaggcaggag aatcgcttga accgggaggc 83641 ggatgttgca gtgagccgag attgcaccac tgcactccag cctgggcaac agagcaagac 83701 tccatctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gtgaactggc tgggcatggt 83761 ggtgactcat gcctgtaatc ccggcagttt ttttgaggcg aaggcaggca gatcgccttg 83821 aggccaggag tttaagacca gcctagccaa catggcgaga ccatgtctct actaaaaata 83881 caaaaatttg ccgggcatgg tggcacatgc ctgtaatccc agcttcttgg gagactgagg 83941 cacgagaatc acctgaaccc aggaggcaga ggttacagtg agccgggatc ccgccactgc 84001 actgcagcct gggcttctgg gtgacagagc gagactctgt ctcaaacaaa tgaacagaaa 84061 aagaagaaag gaatttggac acaaagacac aggtagtggg tctcctatct atataagaga 84121 acagcatgta atgacacaga ggcacacaca gaaaagaagg cgagttgaag acagaggcag 84181 agaatgggtt tatgctgccg caagccaagg ttggagctgc cggcagccgg aaaaggcagg 84241 aaagaattct tcccaagagc cttctgagga agcacggccc tgccaacacc ttgatttcag 84301 acttctaacc tccagaactg taagaaaaag aaattctgtg ttctaagcca cccaggtttg 84361 tggtagtttg gtaagtactt ttaaatgact gaatgaatag aaagaactca gaacacaaca 84421 tggaaactaa acctcagatc tggtcttcct ctgtaaaagg tagcatctgg gagaagggcc 84481 taaagccacg ttttcccact ggaggccctg gacccacaca acaggccgcg cctgtcctcc 84541 gactgtggtg ccagtcagaa ctgccctcag gcagaccaca gagtctactc ctctcccagc 84601 ctttgcaccc cttgtggccc atttttgttc tcagagagcc ccgcgttgtg tgtaaaaagc 84661 aaggtgctta ggtggaccaa gagatcttca gccaggaggc aatctggcaa tggcccagat 84721 agaaacagtt acagttggag ctagacaacg atcgtgcctt gtgcgttttc tttgactatt 84781 ttcgtttggt ttcctgctta ttttcaataa aatgtttttg tagcatttga ccttggtcca 84841 gtgatgctaa ggaggaaggg agtgggcttt aaattttgtc ttccttgggc cagagtttta 84901 aagaggaatc agcttacccc cgagggccag gtgatggggg agggggcagt gtgaatggaa 84961 aggagaggag ctaaggggaa gatgaaaggt tggggctcaa aaccagctgc cctccctgtc 85021 tgagcatctc tctctctgaa attactttct gatcaaaggt cagtgcaatt tgtgtttagt 85081 ctgaacctga ggaatttttc ctttagaagc tgcaactgcc aaagccttag ttagccagct 85141 gtgctgctga aagagcaacc accagggaca ttccccagag ggaatcgagg tcccttcttc 85201 ctggaaagtt tcctgaaaca tgagatgcta tgatgatgat gatgatgata acgacgacgg 85261 tcgtggtcat ggctaacata tgctgggcat ttttctatgc caggcaggca gtgtttgtct 85321 gagcatcttc ccctgtcagc ttgtgagctg ggtcctagta ctactcccct tttgcagatg 85381 aagaaactgc ccatggccac agcccactga agatgaaacc aaggttctct gtctaaagct 85441 gttgattaaa aataataatt aaaaaattaa aaattggcca ggcacagtgg ctcacgccta 85501 tctgtaatcc cagcaccttg ggagaccaag gcaggcggat cacttgaggc caggggtttg 85561 acaccagact gccaacatgg tgaaactctg tctctattaa aaatgcaaaa atgagccggg 85621 catggtggca cacacctgta atcccagcta ctctggaggc tgaggcagga gaatcatttg 85681 aagccaggac gtggaggttg cagtgagttg agatcgagtc actgcactcc agcctgggca 85741 acacggcaag attctgtctc aaaaaaaatt taaaaattaa ttttaaaaaa tttgtgaaga 85801 cgggctgagc atagtggctc atacctgtaa tcctaccact ttgggaagct gagttgaggg 85861 gatcacttga aaccaggaat ttgaggctgc agtaagctat gattgcacca ctgcacttca 85921 tggtggcatg gtggacatcg ttactttctg gaccagcttc tgggccggca tgggtggaca 85981 tctttccagg aatatactct gagtgtcagt gagacactgt cttaaaaaaa aaaaaaaaaa 86041 aaaagaaggt aaggttgaga ccatgggcag tagtttgacc ccaggctctt caacatgagt 86101 tgcctttgta cagaacaacg cagaaataaa accatgtggc tctggaccat agctaagatg 86161 ctgagatccc tggctgaaga tctcgtgacc tctgcaggag cagaacaaat gtggtggcag 86221 tggcaggggc tcacccagat agtcattcct cttctggttg atgttcagga tctctctgat 86281 atggtcaaca gctgtcttgg gagagccacc catattggac tttcctagaa caaagagaat 86341 aaagaattct tctaggtaat atcaggattt ccccaggccc tggaagagta ggaaagcatt 86401 tagcctggga acctcaacat ttacattgcc tgtgagagct tcaaaatata cttttaggct 86461 gggcatagtg gctcacacct gtaatcctag cactttggga gaccgaagaa ggcggattgc 86521 ctgagctcag gagttccaga ccagcctggg caacatggtg aaaccctgtc tctactaaaa 86581 ccacaaaaaa ttagccaggt gtggccgggc atggtggctc atgcctgtaa tcccagcact 86641 ttgggaggcc aaggcaggtg gatcacctga ggttgggagt ttgagaccag cttgaccaac 86701 atggagaaac cgcacctcta ctaaatatac aaaattagcc gggtgtggtg gcacatgcct 86761 gtaatcccag ctactcagga ggcaggagaa tcgcttgaac ctgggaggcg gaggttgcgg 86821 tgagccaaaa tcgcgccatt gcactccagc ctgggcaaca agagtgaaac tccatctcaa 86881 aaaataaaaa taaaaattag ccaggcatgg tgacacatgc ctgtagtccc agctactctg 86941 gaggctgagg cacgagaatt gcttcagcct gggaggcaga ggttgcagtg agccgagttc 87001 gcaccactgt agtccagcct gggcaataga gtgagaccct gtctcaaaaa aataaatata 87061 tatatatata tatatatata tgtatgtata catatatata tatacataca tatatatata 87121 tacacacttt tactgaaatg ttctacattc caggcaccag gccaggtgct tcctatactt 87181 taactcattt aattctcatg aaaccccagg tagagtgggt atcctcattt tatatactaa 87241 gtcctggagg aactgtgtaa tttggacaat agcaagaggc aaatgacaag gccagatcca 87301 ctacagactg ttccttgcat ccgtgccctt cgacattcct gctcaagagc cacgtttcct 87361 catctatctc caagttacct cccagatggt aacaagagct aacacagagc actttcacac 87421 aggcaactta aatagctttc cataaaaatt gtgtaaggaa aggaggacag ccatcatgat 87481 catcattgct gtcattgtgg ttacggtcat catcatcaac actaattcta tttatttatt 87541 tattttattt tgagacaagg taggatcttg gtctgtcacc caggctggag tgcagtggca 87601 tgatcttggc tcactgcaac ctctgcctcc caggttcaag caattctcct gcctcagcct 87661 cccaagtagc tgggattaca ggcgcctgcc accacaccca gctaatggtt ttgtattttt 87721 agtagagacg gggtttcacc atgttagcca gtctgatctc aaactcctaa cctcaagtga 87781 tccacccgct ttggcctccc aaagtgctgg gactacaggc ataagccacc acactggccc 87841 tattttataa ataagaaact gggccaggtg tggtggctca tgcctgtaat cccagcactt 87901 tgggaggctg aggcaggagg attgcttgaa gccaggagtt tgagatcagc tgggcaacat 87961 agcaaggccc catctctaaa aacaaaaaca aaactggggc ccaaagtgga aggcagaggg 88021 agaatcagaa ttcagggctt ctgaccccat attggtgccc cttccactat tccagacaca 88081 ctcagagacc atgataccca ccatctgtca gaaggatgat ggcatgtcgg atttcctgcc 88141 aggccatcgt ttccatgccg aggagtcgca tttggttgtt catcatgaga tagacactgt 88201 ttaaggctgc ataggtgtta gtcccagttc cattttcatg atctggaata tgccaaaagg 88261 aaggactctc ttagaaactt cccacctacc acctaggggt agggaatcac tctattcccc 88321 caattattgg aaatgcaatt tttttttttt tttttgagag agtctcgctc tgccacccag 88381 gctagagtat agtggcgtga tcttggctga ctgcaacctc cacctcccca gttcaagtga 88441 ttctcatgcc tcagtctccc aagtagctgg gattacaggt gccagccacc atgcccagct 88501 aattttttgt atttttagta gagatggggt ttcaccatgt tggccaagct ggtttcctga 88561 cctcaagtga tccgcccatc tcagcctccc aaagtgctgg gattacaggc atgagccact 88621 gcacccggcc tggaaatgca atttatcgat tgttttgata aactcagaca tttgccattc 88681 agatagagat gtggcatcat tgcttgttca aactttaaaa gcctttccac cttagaaatg 88741 ttcattgcat agggccgggc aaggtggttc acgcctgtaa tcccagcact ttgggaggct 88801 gaggtgggcg gatcacaaag tcaggatcac gagaccagcc tgaccaacat agtgaaaccc 88861 catctcaact aaaaatataa aaaattagcc aggcatggtg gcaggtgccc gaaatctcag 88921 ctactcggga agctgaggca ggagaattgc ctgaaccctg gaggcagagg ttgcagtgag 88981 ccaagatcgt gccactgcac tcaagcctgg gtgacagtgc gagactccat ctcaaaaaaa 89041 aaaaaaagaa gaagaaaaaa gaaatgttca ttgcataacc ccaatccaag accacactaa 89101 agtcccaaga ttcaaagatg aggtttccaa aaccttgagt ggcttcagcc atttgctaag 89161 tgccaaccag gcactttaca tggtgtatgg gtggctaatg acatggggag atggcagtgt 89221 gcagccatgc aaaccagaca gtgtggtttt ggcaactgga aggaattgaa gacctgcaca 89281 gtgtctttgg acctgcagca tctcagaatg agctatgagc tgggggcagc tggactcccc 89341 caccataacc ctggacgttg aaactacccc agactcgtgg gactcaggtg ccaccaagag 89401 gcctcactct ctttttaact catccagaat ttgtttgcag gccctgagag ggtccatctt 89461 ctcctctctc atcaccatca cgtgatgaca cccgtacctt tatagttggc attttccagg 89521 ctgctgatca cctcagtcat atcccgggag ttgtcgttca ggacagacat gaggactttg 89581 ggctctgagg caaaggtgat aatggcaacg ctcacattga tctcaaagct gaagatctgt 89641 gcggggcaag tgagaggcag cgtaaagggc cctgaagcca aaggggagat ggtaagagag 89701 caaacaaagg gccctggagg aggcagagaa actggagtga gaccgacaga ggcaaggacc 89761 aggggagaca ctaggaaatg gctgttagga agaagctagg ggctgggcgc ggtggctcat 89821 gcctgaatcc caacacttta ggcaggtgga tcacctgagg tcaggagttc gagaccagcc 89881 tgaccaacat ggcaaaaccc atctctacta aaaatacaaa aattagcctg gtatggtggc 89941 aagcacctat aatcccagct actcaggagg ctgaggcaga agaatcactt gaacccagga 90001 ggcaaagttt gcagtgagct gagatcgcac cactgtactc cagcctgggt gacagagact 90061 ctatctcaaa aataataata ataataataa taaaaataag gaacagcagc agctaaggaa 90121 gaaacctttg ttgaacatct ttctctttga cagactctct tcactccacc ctgtgagggg 90181 catgatgacc cctattttat ggataaggaa attgaggtat gttgccttgt cccccttccc 90241 ccatcacacc atgaggagga ggtggaagtg ggatctgaat gtgtcacatg attctcaatc 90301 actgctccaa ctggtgtgtg ttgccgcctc gcagtgcaat gagttactta attttgagtc 90361 ccaagagagc acagaaactg ccctctttaa aatgtagctg ggagggaaaa ccatctataa 90421 tgttacatga agtattgtga aaagcacaat tacaaaacaa agttctggcg aggaacttaa 90481 accagtggct tccaaacttc tatcacacat attttatttt attttatttt attttattta 90541 tttttttgag acagagtttt gctctagttg cccaggctgg agtgcaatgg cgtgatctca 90601 gcccactgca acctccacct cccggatcca agcgattctc ctgcctcatc ctcccaagta 90661 gctgggttac aggcgtgcgc catcacacgt ggctaatttt gtatttttag tatagacagg 90721 gtttcatcat gttggtcagg ctggtctcaa actcctgacc tcaagtgatc cacctgcctc 90781 ggcctcccaa agtgttggga ttacaggcac tagccaccat gcccagtcta tcacacacat 90841 tttaaagcct gcacaaaatg ttgaagtttt aaaaggaaat tattaaatat taaattatta 90901 atctgattta aattattcca caggggcacc agtatttttc tgcttcactc ctgtgggtta 90961 tctcactacc tttctgtgga gaccactgtt ctagatgatg aattcctgga aagtaagacc 91021 tcagcctttc catctttgct gtcccccagc cccagcacag tgtcccaaca gtgcttgtgg 91081 aataaaaatt atgattgtat aaattgaatt catgattcct acatggcatg agtataatct 91141 gagaagactt cttggaggag gtgggctgtg agtggggttc tgaggagggg aaggagacag 91201 agagagatag tgagcacagg aaggcctctg ctgcaggcag actcctgatt cctgaccctg 91261 tccaccatga gggaggcgct ctccttgaag atgagaaagt cattttccga cacactctgc 91321 gaacagtcca ggagcaggta gaggttcaga tgaccagagc gctggatttg gattttacgg 91381 cccaggcttt ctggagaaat gtggaagggg aggattcaga tccccacctc cttcctgcgc 91441 tctcgccaag caggtccagc ttctctgcct ccctcccttt cagtggctga gcccaccaag 91501 gctcctggcc tccaggaagg ttgcatcctc ccttgctccg gcccagatcc agcaccctgc 91561 tcaaccagaa accccacctc aaacactcac cctttgtctt ctgggtggga ttggtggccc 91621 caagcatgtg ggagaaggaa gtgcccaggg caggggccac gtcctcaggg aagtcataag 91681 agtagggttc tgtggggaga gccacacagg agtcagccgg ggcagtggcc gggtgtgccg 91741 aggaatctca ggagggcagg gcagctactc acggcggcag atgggctccg ttccactcca 91801 gaccccgttg ccctggcact cccgctccga agaccccgtg agcacaagat tcgaggagca 91861 gcgatagcgg accttgtccc catgaccaaa gcggaagcct gtccgcactg cgcccagtga 91921 aatgcctggg ttggggcagt ggccagctgc gagtgaacaa aggaaggcag aggtgagtag 91981 ccccctccca tcaggctctg cctcctcagc ggcctcagtt gccagcagaa gcttcccagt 92041 tccaacccag ggatgcaccc atgtctgtcc cagagccccc agagacccag ggaagaagtc 92101 agctgatggt ccctccactc cttcagagaa catttatatt tcataaacca cagtgaggca 92161 tccctacaca cccatcaaaa cggctaacac tgaaacaaca ggtaattcca agtgttggag 92221 aggatgcaga gcgactggca ctctgttacc ttgctggcat gtacgtgggt gaaactttct 92281 cagcctctac tacaactaaa catatgccag cctgtgatgt gacaattcca catttagatc 92341 atttactcaa gagaagcaag tgcatacatg taccaaaatc acatgcaaga atgttcctag 92401 agctccctct ccctcaccct ctccccatgg tctccctctc cctctctttc cacggtctcc 92461 ctctgatgcc gagccgaagc tggacggtac tgctgccatc tcggctcact gcaacctccc 92521 tgcctgattc tcctgcctca gcttgccgag tgcctgcgat tgcaggcgcg cgccgccacg 92581 cctgactggt tttcgtattt tgttagtgga gacggggttt cgctgtgttg gccgggctgg 92641 tctccagctc ctaaccgcga gtgatccacc agcctcggcc tcccgaggtg ctgggattgc 92701 agacggagtc tcgttcactc agtgctcaat gatgcccagg ctggagtgca gtggcgtgat 92761 ctcggctcgc tacaacctcc acctcccagc agcctgcctt ggcctcccaa agtgccgaga 92821 ttgcagcctc tgcccggccg ccaccccgtc tgggaagtga ggagtgtctc cgcctggcca 92881 cccatcgtct gggatgtgag gagcgtctct gccctgccgc ccatcgtctg agatgtgggg 92941 agcacctctg cccggccgcc ccgtccggga tgtgaggagc gtcgctgccc ggccgccccg 93001 tctgagaagt gaggagaccc tctgcctggc aaccgctcca tctgagaagt gaggagcccc 93061 tccgcccggc agccgccctg tctgagaagt gaggagcccc tccgcccagc agccacctgg 93121 tccgggaggg aggtgggggg gtcagccccc cgcccggcca gccgccccgt ccgggaggga 93181 ggtggggggg tcagccccca gcccggccag ccgccccgtc cgggaagtga ggggcgcctc 93241 tgcccggccg cccctactgg gaagtgagga gccactttgc ccggccagcc actctgtccg 93301 ggagggaggt gggggggtca gccccccgcc cggccagccg ccccgtccgg gagggaggtg 93361 gggggatcag ccccccgccc agccagccgc cccgtccggg agggaggtgg gggggtcagc 93421 cccccgcccg gccagccgcc ctgtccggga ggtgaggggc gcctctgccc ggccgcgcct 93481 actggaaagt gaggagcccc tctgcccggc caccaccccg tctgggaggt gtgcccaaca 93541 gctcattgag aaggggccat gatgacaatg gcggttttgt ggaatagaaa ggggggaaag 93601 gtggggaaaa gattgagaaa tcggatggtt gccgtgtctg tgtagaaaga ggtagacctg 93661 ggagactttt cattttgttc tgtactaaga aaaattcttc tgccttggga tcctgttgat 93721 cggtgacctt acccccaacc ctgtgctctc tgaaacatgt gctgtatcca ctcagggttg 93781 aatggattaa gggcggtgca agatgtgctt tgttaaacag atgcttgaag gcagcatgct 93841 ccttaagagt catcaccact ccctaatctc aagtacccag ggacacaaac actgcggaag 93901 gccgcagggt cctctgccta ggaaaaccag agacctttgt tcacttgttt atctgctgac 93961 cttccctcca ctattgtcct gtgaccctgc caaatccccc tctgtgagaa acacccaaga 94021 atgatcaata aaaaaaaaaa aaaaaaaaag aatgttccta gagctttact cctaatggcc 94081 ccaaactgga aataatccaa atgtccatca ggaggggaat ggataaattg taatataccc 94141 atataatgga atactacaca gcaataaaaa agaacaaagt actgatacac acaacagcat 94201 ggataaatct cacaggcaga atatagaaca aaggatgcca gaagcaacag accacatccc 94261 gcatgattcc atttacagga aatgtaagca caaggtacct agtccatgga gacagaggtc 94321 agaagagcag ttaccttgtg ggaggggaca agagggaacc tcattctgtc ttgggtggtg 94381 attagatgag tggatgcatt gtaacaaatc attgagctga atagttaaga tttgtacatt 94441 ttacaatagg taaattatgc ctcaatttta aaataagtta gagtcacgtc taataattca 94501 agcattactt atcacagtct taccaggcat ctacttcacc ctcctacaga ataaatggcc 94561 atttgccaaa acccaagggc atgagtgggg atccatggtc tgggccacta ggtgaaggtg 94621 gtcctgccat cttgtggact ccgtgtcttc tgtatgatgt gctaagaacc atgatccaag 94681 aagctcccaa acccacaggc aaatgcagct ccagggaatc ctaggagccc tacttctgcc 94741 aagcagagcc cctcaccatt cacaggggca gcaataacta tgacaaaagt gcttatggcc 94801 aggcgtggtg gcacatgcct gcattcccag ctactgagga ggctgaggta ggaagatcac 94861 ttgagctcag gagtttgaag ccagccttgg caacacagcg agatcctgtc tcttaaagaa 94921 acaaaaacaa ggccggatgc cgtggctcac gcctgtaatc ccagcacttt gggaggccaa 94981 ggtgggcaga atcccctgag gtcaggagtt tgagaccagc ctggccaaca tggagaaacc 95041 ctgtccctac taagaataga aaattaccga ggcatggtgg cgcatgcctg taatcccagc 95101 ttcttgggag gctgaggcag gagagtcact tgaacccggg aggcagatgt tgcggtgagc 95161 tgagattgcg ccattgcact ccagcctggg caacaagagc tagactccgt ctcaaaaaaa 95221 aaaaaaagtg tacatggcat ttatcagatt ccaggggctg ttttttttgt tttttgtttg 95281 tttgtttgtt tgttttgaga cagagtctca ctctgtcacc caggctggag tgcagtggca 95341 cgatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct gcctcagcct 95401 cctgagtagc tgggattaac cacgcccagc taatttttgt atttttagta tagacgaggt 95461 ttcaccacgt tgaccaggct ggtctcaatc tcctgacctc aggtgatcca cccacctcag 95521 cctcccaaag tgctgggatt ataggcgtga ggcaccgagg ctggcatttt tttttttttt 95581 tttttttttg agacagagtc ttactctgtc actcaggctg gagtgcaatg atgcaatctc 95641 ggctcactgc aatctccact tcctggttca aatgattctc ctgcctcagc ctcctgaata 95701 gctggtatta caggcatgcg ccaccgcacc cggctaattt ttgtattttt actagagacg 95761 gggtttcacc atgttggcca agctggtctc aaactcctga ccttgtgatc cacccccctt 95821 ggcctcccaa agtgctggga ttacaggcgt gagccaccgc actccagggc tgttgtatat 95881 gcttactaat gtcaactcat ttaatccaga tggcaatctc aggtcagcac tcactatcct 95941 tgtcttacag atgggaaaac tgagggctag aaaggttaaa tagccttccc aaagtctgtc 96001 aaagtcagag tagggcttgg aacccaaacc tggctttaaa ggcaaggcat taaagcacag 96061 tacttgactc tctcccagca tatcttccct acaacaatct gtggggcctg aatatgaagt 96121 cttgctttga ggcacttaga aaagccaggc agagtgtgtg gaggctttat gctacatctg 96181 aagcttagca ccagctgcct ggggcctggc ccttgcagac ttgtgatgtg tgacctgtag 96241 acctcaattt accaaaatcc aggaatatta ttatggcaca ctgaacaaca gactcctatc 96301 cccacagagg cttcctggat gcacattggg ccccagggtt ccccaggaga ccccagcccc 96361 ctgtgtagcc catcagccag agaactcacc cccattatca cacacagctg tttctccatc 96421 ccacatgccg ttggggcgac actgacgcac aggcgagccc cgcaatatga agccatcctc 96481 acactcgaag ctcacattgc cacccacggg ataggacccc agccgtgggg tataaatgcc 96541 attctcaaag gagacagggg ctggacagcg cacagctgga gagagaggaa gtgggtgggg 96601 aatataggac tggatggata acacctgggc tggggattaa gcaggtgatt ttatgctgga 96661 atgcatggga atggtcaact ggctgtcaaa acactgaaat attaggttgc aacatgtgaa 96721 attgctgttt ttggcaactg attgaatatt gccaatttca cagagtctaa gcgaactgat 96781 tgataaatag ccacttccct cagtctcctg agggccgtgg cctccccaga cactcccctt 96841 gggatcctca gatgtctaga ggactccagg accattcagc atctgttctg aatggtcact 96901 actaggcagt caattgtaca taaaaagtca tccctggctt tagtacccca cagtctccat 96961 caagacctgt tcccaattcc ttgctactgg caattgacat tttctggata ttttgaaaag 97021 ccctctgcaa aagcaacttt gccatctagt caggttaagt cccaccctga gcaactccta 97081 aacaaaagtt ctggggtggc ccctggtgta gcaccctgag caaagcccac agggagcctc 97141 acgtttgcag accgccttag acagagaccg ggtggctcct ggggtctgcc actgtccgct 97201 gctcttgcac agccgtgatg ctggggatgg gtacaggccc tgggggcagg agtaggtgag 97261 aaggctccca ggagcccagc catggctgag ggtgaaggtg ccacccgaga tattcacgtt 97321 ctgagggcag gagggagccg agtctgccag acctagagcc gtgggaacaa ggagacagca 97381 atggagaaag aaggaagaca cagatgagaa aaggaaagca aaagagactg tttcacacat 97441 gggaattcaa gctgggagca ccaacctcac acacaggacc ctgacgttcc cccttccctg 97501 cctcctacct gggtacagga acagcaggca aaaaagaacc atcagtgggc ccatggtgtc 97561 ctccctaggg gcggcgagag gtagagagcc gcgggaggga aaaggtcatc tgacttcccg 97621 aaaactagag ctgggtcagc agctgcttca ttgctgccag agaagaaaaa ccagggagtg 97681 gcaaggggga tggtcttctg ttagggaaat aataagcagg gattggggac tgaactgaag 97741 ggtgaaacct ttgccctgtc tccctgatgc taatatatct atagggtcaa tagatctccc 97801 ccacaggctt tgtgtactgc ccccacgctc cacccaccca tgcgtgcaca tgtgcaccat 97861 acacgcatgc tcacaaacac gcaggcaaac acatatttgg gctcgtgtgt gcacacatgc 97921 accttgctct tgtgtgaata gtcaatagtc taaagcacac aaacccacaa accatccaag 97981 atattgcaaa atatttcatt taagtaaagt tttagtgttc ttttgggttg taccaagtgg 98041 gttttacttc ctgagagcct agaacagttc tgtcttcttg aagataaact tccgaggggg 98101 gtgaaactaa tttttttaga acttcttgat aaataggaag attgacaata cttttttatt 98161 ccctacaatc cttccctaga gataatctag gtggaaaaaa tgaaatcaac acttagtttt 98221 ggagacagga aaaatgttta caacattttt acggaaagtc tagtagatga aattcccaac 98281 tggcaaagta aatacatttg gggttggtta ctcaaaggaa taaaactttt gcgggggctg 98341 ggatatgcct ttaaatcata acctacactt gtatgaaatg caggaagcaa agaaacttga 98401 agctagtgtg ttagtctccc tgacccacag aagtcttttc aatacctttt tgaacaatgt 98461 agggtaaaaa taaataagac caataaaaaa atgaacatga aacctctctg aagtgatcat 98521 ttaggttcgt aaaaaccctt ttgatttata agtgagtcct tagagaactg attgaaccta 98581 gaagatggag gttgcagtga gccaagattg tgccactgca ctccagcctg ggcaacagag 98641 taagactaca tctcaaaaag aaaaaaaaaa aataagaccg ggcacagtga ctcacgcctg 98701 taatcccagc attttgggag gctgaggcag gtggatcatg aggtcaggaa tttgagacca 98761 gcccggccaa catggtgaaa ccccgtctct actaaaatta caaaaaatta gccgggtgtg 98821 gtggcagggg cctgtaatcc cagctactcg ggaggctgag gcaggagaat cacttgaacc 98881 cgggaggcag aggttgcagt gagccaagat catgccactg cattccagcc tgggtgacaa 98941 gagcaagact ctgtctcaaa aaaaaaaaaa agagtacaca aattgagact ccccttggac 99001 agggctaagt gcatgaaaat caggacatcc tggtgttagg gactccaaag tggctgctgt 99061 ggcagagctg ctagtgcctc ccctgcccct caagtatctt ttctccctcc ttccacagta 99121 agagatggtt aggggaacac acagtcaccc agctaaagac catatttcct agcttccctt 99181 gcagctaggt agccaggtga cttggttctg gctaatggga tgttagtaga acttatgtat 99241 gtcatttcca agttatgtcc ctaaaataaa aaggcagtgt gccttttctt ttcatgggct 99301 ggaatataaa cctagcggag atccgtcttc caccaggagg atgagggcaa catgctgggg 99361 atgacagagc aaacagacag aggagcctgg gtgtctgaca cagtgttgat gtcataccag 99421 ccctgcacag ccacttagat tgttacaaga gagaaataaa ctatcgtaaa ggtggggggt 99481 atttaaggat gttcacaaat actgtagcct tgtcttcttt caatcacatg gtaggatagg 99541 atttccccag ccacttgttt tagccaataa gatgtgagta caagtgatat gtatcacttc 99601 ctgatggaag ctcaatgtag tcttcaccct gctcttcccc tctgtatggt aacgtctgag 99661 atggtggctg ctccaccagc caagatccct gggtgatatc gaaagcacag taccagccgg 99721 gcacagtggc tcacacctgt aatcccagca ctttgggagg ccgaggcagg gggatcacct 99781 gagttcagga gttagagacc agcctgacca acatggtgaa accccatctc tactaaaaat 99841 acaaaaatta gctgggcgtg gtggcatacg cctgtaatcc cagctgctca ggaggttgag 99901 gtaggagaat cgcttgaacc cgggaggcag agattgcagt gagccgagat cacaccactg 99961 cacttcagcc tgggtgacat agggagactc tgtctcaaaa aagaaaaaaa aagcgcaata 100021 ctcggctgac tctcagtgga taagtaatat gaccaaaaaa taaacctttg ttattttaat 100081 tcattgcatt tctactaatg caggaaaaaa agtacccacc caatatgtag gaaatttaaa 100141 aatgatttat tctttctcat attggccaga aggttcttct gctgatcttc catgcagcgg 100201 cagtcagcta ggggctgagt tcagttaaca gtaggacagc aggtccttcc tccctagccc 100261 tcatagcatg gcagtctcag gacaacttct gaggaatcca aaacataagc tgcaaggtct 100321 ctcgaggcct agccccaaaa gtcacataac atcacttctg ccacattcta ttggtcaaag 100381 caaatcacaa ggtcagccca gatttaagac tggaggcata gaatttacct cttgatagga 100441 ggaatggcaa agtcacacgg caaaggggca ggcacatcag aataggagag attgatggct 100501 atattttgta agccctccta tggcaacaaa tatttctagg ttgttggtta ccagaacgta 100561 atgaagccta acctgactga acattcaaat cactaaactt tgggtgcctt ttaaagccat 100621 caaactaata tgtaacagct gccttcaaaa gaaagggcct ttctttcatg agtttttccc 100681 agtatttgtt gagctacatc aacttggcac agtgtggcag aattcattct ctcttcctgc 100741 ttaagtaata aaactcccaa gttttctggg cccacagagt atgttcttaa ggagaagtgt 100801 ttactcctca atgtataaaa cttcttccct tcccaccagc tggaatgtgg acagaatggt 100861 ggactctgga acagccacct agatcacaaa atggaagtca catgtggatg atgataaaga 100921 aacaaaatta aagaggccct gctctccaat cctgtagagc taataggatt gacctggacc 100981 acttatgctc agaccattta agctgctgtt attctagctt ggttataaca aagccaaacc 101041 atgtctaact gatataccta gcaagaaatc aatgtgagag ccaaaggcaa cactcagaga 101101 agtgtgttta ttaggataca cgttaaactt ctgtaataaa gagaccctaa aatacagtaa 101161 cttaaataag atagaatttt gctgcacttt caaataagag cctcagaggc cgggcgcggt 101221 ggttcacgcc tgtaatccca gcactttggg aggccgaggc gggtggatca caaggtcagg 101281 agatcgagac aatcctggct aacacggtga aaccccgtct ctactaaaaa atacaaataa 101341 ttagccgggc gcggtggcag gcgcctgtag tcccagctac tcgggaggct gaggcaggag 101401 aatggcagga acccgggagg cggagtttgc agtgagctga gatcgcgcta ctgcactcca 101461 gcctgggaga cagagcgaga ctccatctca aaaaaaataa ataaataaga gcctcagagg 101521 ccgggtgcgg tggcttacgc ctgtaatccc agcactttgg gaggccgagg cgggcggatc 101581 acgaggtcag gagatccaga ccatcctggc taacatggtg aaaccctgtc tctactaaaa 101641 ctacaaaaaa aaaaaaatta gctgggtgtg gtggcggcgc ctgtagtccc agctactcgg 101701 gaggctgagg caggcgaatg gcgtgaaccc gggagatgga gcttgcagtg agcggagatc 101761 gtgccactgc actccagcct gggtgacaga gcaagactcc atctcaacaa aaaaaaaata 101821 aaaacaaaat aagaggctca gggtaagcag acaaggctga gatggcgtgt gtggccctgg 101881 ctggcagagc tgctcatcag cctccttcag gcctccagct tccttccatc tgtggttcat 101941 gcctttggaa attgtccttg tctgcatggt tgaaacaggg tctccatcaa gtctgcctcc 102001 caccccactc ttccattcat attccattgg caaaaagtca gttatagtcc actgtcagct 102061 ctccaggagg cccagtcgaa tgtggaagga ccatgacctg acctgtgggg ctaaagatgc 102121 cagaccttta ttcccctcgg cactcccgat tagcttttca gtagacccca ccagatgagg 102181 gcgaccttgg cttctctatt tcttgtacca ttcaaaagcc cagcaaatgt aaccgtgttt 102241 tcactccagg tagagggaag caggtgaata atatctaaaa aacagacatg cctctcctgc 102301 cagcttcttc catcttgtcc aaatttcttc ttatcaattc taggcttcta atatctgctt 102361 ccaagggtac aattccaaag agaaaaaacc aataccccaa ttctgtactt catctcctag 102421 tcagagttgg atttttgctt ttggtttttt tggttttttg tttttgagac aaagtcttgc 102481 tttgttgccc aggctggagt gaagcagtac gatctcggct cactgcaacc tccacctcct 102541 gggttcaagg gattctcctg cctcagactc ccaagtagct gggattacag gcatgtgcta 102601 ccatgcccag ctaatttttg tatttttagt agagatgggg tttcaccatg ctggtcaggc 102661 tggtctcaaa ctcctgacct caggtgatcc acctgcctct gcctcccaaa gtgctgggat 102721 tacaggtgtg agccactgtg ccaggccccc taatcagagt ttaaaaatca gaaatgatgc 102781 tctgatggcc tctcaggatg ccttattctc ccctggagag ctcagcacct tgcagctgaa 102841 aatttctgag gaaacgtggc atatagcaga ggccgacccc actctagtgg ccagtctact 102901 ccttatccag tctggccact tgggcctgga cttcgcggtc atcctaggat ctgccacccc 102961 tcaaccttct atggcaactg caggtcaacc cccaaattgg aaaactagtc tggcttggcc 103021 tggttcacag ccttaatgtg tgactttaga tggggattca gcattgctct gcaagcctct 103081 tggtggtttt agtggatctg tccttgcccc accatatccc ttgggcctta tcactggggt 103141 gctccactga cttccagctg cagcatctct gtctctaccc aaggattgtt tttggtgtcc 103201 tgcaccatct gtgcacagaa agctggaaat gacagagaat taaggccttc aaccaatgac 103261 tgattggtat aaatactggg ctccctcacc ctggaccatg ggataactct gcaccactcc 103321 agggttcctg agtgggattg acctccactc acccacaatg gtaaatttgc ctgataacac 103381 acccttcatt gactgccttc cttctctatc acactccact acccatgttt cttgcgatca 103441 cctccagaat gcactgtctg cactggaacc cttgtctcca ggtctgcttc tgggggagga 103501 agacttcata ttagcaggac atagtgccag acagtatttt taagtcttaa atgtaggtgc 103561 tcacttaatc ctcacagcaa gcttatgaca caggttctgt tgttactccc actttacaga 103621 tgagggaact gaggcacaca aaggttaagt gagctgcccc aggttcatac aggtactaag 103681 cagcagagcc gggattcaaa cccaggcagc atggctcctg ggtccacact catagccacc 103741 aagtggtagt gcctcttcct cacgtcttct ctttagtgac tgctgggagc tccttgcgca 103801 tccctgaagt ctaccccagc cccttgcatt tccaagccct ctctccccta ggctctttcc 103861 tcttggcacc ccctctccca taacaccttc ttttccatcc ttctcaaatc cctcatcttc 103921 cagactccta caagacctgt ccacgccatt tcccaccttc tagctacttg tgcacttgtc 103981 tcaagctccc cagaggaagg atccaggagt ttttttttta aacaatgagc tatccaagta 104041 aggtcaagcc aatgccctcc cccatcctat ttccacccca agtaaatagc atctttcagg 104101 tcagcaacag aattggcttt ggtttctcat ccatttcttt tttaataaaa atatttacat 104161 tgggtcagac cccacccatt tccacacaaa ggcctctgct aagttcctcg gtacccacac 104221 catcccccat cctagcaggc actgcctact aactttgaag tgattgccca actatgatct 104281 tgaggaatct ccacatacat cacccttaga gcctcagaaa gggttttgcc ctgccccatg 104341 gggctcctcc ccatgcccag gctcttccag ggccctgggc ctcagaggcc accctgcagg 104401 cccagacact gggttagaca ctgaacctcc tgtccttgtc catccattgc acaaacaatt 104461 cctgagaatg gaacgaggaa ctaaggggtg gggtagggcc tcccaagaaa cagaaggcct 104521 gtccctgacc tcctgtaggc gccatatctc tttcagacaa aaactcaacc tctaaagact 104581 cacaggcctg gggtgtacca gggtgtccat ctgcccacac cgcagctctt acctcagccc 104641 tctgaggtct ccactgtcct tgggctggtg gggggcatgg tgcatgttat cacccacttc 104701 ttgctaccca tcagggaagc tgccctgggt aacccaggta agagggtggt aaacacaact 104761 caggtgctca ggggtcagct gaggatgggc cagggggagg ggtggcctat ggctgaattg 104821 ccctggctcc ggtcctcacc accccaaccc cagctctggg cttagcattg gtggcagtgg 104881 gggcctcact agcctcctct gccctttcat tgaaaattcc tctctaatgt tttcctttat 104941 cctggggagt ggggagatat tcatcccctt cccagttctg ggtaccagta ccctcttgac 105001 aaaaggatag cctggggctc acatgggaga atcctcctgc cctcactcct ccagtgctgc 105061 caaggggtga agagggaact tgcccagtag aaagacatac gttcatgcct tgctttcctg 105121 cccaacacaa atgaaaggtt atacctgaga agagctcctc ctgccccgtc tctggcccca 105181 gccccagtgt ggtatgagga attcaggctt tgacttaaga cagcctggaa cctcaggttt 105241 ggcttggcaa attcattagc tatttcgtag ccttactcac cctgttttgt cacctgtcat 105301 ccacagaacc aacagcaaaa tacactcccc ttaaggttac ctttgagaat taggaccatc 105361 aaaaggagaa gatcggctac cctacaggta taaaaggtac caggatattt gctttagcat 105421 tcactgttac caagaggaaa tgtttggaaa ctcccacgtc cattaatagg ggacagacac 105481 agcacattgt ggtatgggga caactgaata ctacgcagtc tctgcaagct taaggcttat 105541 ctgtgtgtac agagctgaat gtctccagat atatatgtta tataatatac acttttaaac 105601 tagatagaca ttcagaactg tatgtatggg atgccaccat ttatgcaaaa aaaggaggga 105661 aagaggatag tagccacata tgctctctgt gtgtatataa tttcactgga aaagtttaca 105721 aaaaactgga taggagagtt gcctctgggg agaactgggg gctggaaaat gaggtgggag 105781 gtaaagaagg caacttaatt ttcaccaaat cccctttatt actccttgta gttttcactg 105841 ggtacatatg atacctattc aaggccaggc ccagtggctc acgcctgtaa tcccagcact 105901 ttgggtggcc aaggcaggtg gatcacctga ggtcaggagt ttgagaccag cctgaccaac 105961 atggtgaaat cccgtctcta ctaaaaatac aaaaattagc cgggtgtctt ggcaggcact 106021 tgtcatccca cctactcagg aggctgaggc aggagaattg cttgaacctg ggaggcagag 106081 gttgctgtga gctgagattg caccactgca ctccaacctg ggcaacaaga acaaaactcc 106141 gtctccagaa aaaaaaaaaa aagaaaagaa aagaaaagaa aagaaatacc ctactggact 106201 attaacaaga gaccctgata caatctctct gcagattaag ttttattttg taaaatttca 106261 aacatattgt aaagtagaga gaatagtaaa atgaactttc atgtatgtat ccatcatcta 106321 gcttcagttc atagacggtc tcggttcata gacggtctgg gttcatgtct atataccacc 106381 caccaaccat tccttccagg aatttttttt tttttttttt ttttgagaca gagtcttgct 106441 ctgtcaccca ggctggagtg cagtgatgca atctcggccc actgcaacct ctgcctccca 106501 ggttcaagcc tcctgagtag ctgggattac aggtgcccgc cactacaccc agctaatttt 106561 tttgtatttt tagtaaaaat ggggtttcac caacttggcc aggctggtct tgaactcctg 106621 acctcgtgat ccgcccacct cagcctccca aagtgctgga attacaggcg tgagccaccg 106681 ctcccagcca ggattttttt ttttttttaa gttaaagaca gggtctcact gtcacccagg 106741 ctggagttta gtggtgcaat cacagctcac tgtaaccaca aactcctggg ctcacatgat 106801 cctcccacct cagccttcca agaagctggg acaagaagca tgcatcacca tgcccagcta 106861 atttattcat ttatttactt ttgtaaagac aggggtctca gtatgttgcc cggactggcc 106921 tcaaactcct agcttcaagt gatcctcctg cctcagtctc ccaaagtgtt aggattacaa 106981 gtgtgagcca ctgtgcctgg cctcgaggaa tatttttttt ttgagatggg gtcttgcttt 107041 gttgcccagg ctggagtgca gagacctgat catagctcac ggcagcctca aactcctggg 107101 cttccacaat cctcccacct cagcctcctg agtagctgag attacaggca tatgccatca 107161 taccaggcta agtttttttt aattgtttac tttaattaga gatgaggtct tgctatgttt 107221 tccaggctgg tcttgaacta gcctcaagca atcctccaac ctaggcctcc caaagtgctg 107281 ggaattcatt aaaaattttt ttacggaaaa gctcaaattt ttcattttaa aatatttctc 107341 ttttttcttt tttcagaaca tttctcactt acataagagc acattttctt tctctttttt 107401 tcagatagag tctctttgtg tcacccaggc tggagtgcaa tggcgtgatc tcggctcact 107461 gcaacctccg cctcccgggt tcaagcaatt ctcctgcctc agcctcccaa gtagctggga 107521 ttacaaacgg ctgacaccac gcccagctaa tttttgcatt tttagtagag acggggtttc 107581 accatgttgg ccaggctggt ctccaactcc tggcctcagg tgatccgcct acctcagcct 107641 cccaaagtgc tgggattaca ggcatgagcc accgcgcctg gccttttttt ctttttttaa 107701 aattccaact tttattttag atacaggggg tacaggtgca ggtttgttac atgggtatat 107761 tgcactaggt agtcatagaa ccaattaggt aattttttaa cccacactcc ctccctctct 107821 ccccttctag tagtcttcag tgtctattgt tcctatattt atgtccatat gtgctcaata 107881 tttagttccg ttataagtga gaacattagg catttggttt tctgttccta cgctaattca 107941 tttaggattc tgtcctccag ctccatccat gttgctgcaa aggacatgat ttaatttttt 108001 ttttttatgg ctgcaaaact atttatcttt cacttcttgg cttttaccaa aataaaatgt 108061 ttcttttgaa ctataaattt ataaaacatc tcattgttct ctgtatcata atttttttct 108121 tttttttttt ttttctgaga cagagtctca ctctgtcacc caggctggag tgcagtggtg 108181 ccgtcttggc tcactgcaac ctccgcctcc tgggttcaag cgattctcct gcctcagcct 108241 cccaagtagc tcggattaca ggtgcccacc agcacgccgg ctaatttttg tatttttagt 108301 agagacgagg tttcaccatg tttgccaggc tggtctcaaa ttcatgatct caggtaatcc 108361 acccgcctcg gcctcccaaa gtgctcagat tacagacatg agccaccaca gccagccttt 108421 tttttttttt tttttttgtt gagatggagt ctcgccctgt cacccaggct ggagtacagt 108481 ggcgctatct cagctcacta caactccagc ctgggcaaaa ggagcaaaac tctgtctcaa 108541 aaaaaaaaaa aaaaagggca gagaactaat gcccagagtc aacctgcaat tattgcagag 108601 tgggaagcag tagacagatg ctcctgcctc ctgtccttca ggtggaaagc gtctggagac 108661 attcagctct ctcctcagga gggcctgggg gaatcgaact ccactgcaca caacagtgac 108721 atcactcttt tttttttttt tttttttgag acggagtctc actctgttgc ccaggctgga 108781 gtgcaatggt gcaatctctg ctcaccgcaa cctccgcctc ccaggttcaa gcaattctcc 108841 tgcctcagcc tcccaaatag ctgggatgac aggcacatgc caccacgcca ggctaatttt 108901 tgcaatttta gtagagacag ggtttcgtcg tgttggccag gctggtctcg aactcctgac 108961 ctcaggtgat ccacccacct cagcctccca aagtgctggg attacaggcg tgagccacca 109021 tacccggcca acatcattct cttaaactgg cctttcctcc ttcaggatct cacactccct 109081 atagcctcgc ttctgctttc tgggatcacc taaattaact acctatggcc aagtcctgtc 109141 tcaagctctg ctgtcagggt caccaaaatt aagatcatcc ctttccatcc tcctctccct 109201 ataaactact gcccttcttc cacaaactcc ttccacgcca gcaaacccag actgtaacat 109261 taacacagag ttataatcca tccatatact ggtctctcca cattcctgga gcacaaactg 109321 ctaaagggta ggaacgctgt gacacgtttg gttcccccac tgtccagtgg gagagagata 109381 tgtgagccag tcagcgctac accgagttga ggcagccatt gaggctcttc atgaattttc 109441 ccgttttctg ccttccaggc acatgatagg atggaattcc tcagccctcc tgaagttagg 109501 ccactgcaag aggcctaact ggcttgcttt ggccagtgaa ataagagcag aagtcacatg 109561 tgttgttact gtcaggcaca agtatttaac tgccaatgta acacaagaca ctccagcacc 109621 ctcttttgat ggagcctcct ttgatc // LOCUS HSMI2218 6417 bp RNA PRI 13-MAR-1996 DEFINITION H.sapiens mRNA for 218kD Mi-2 protein. ACCESSION X86691 NID g1107695 KEYWORDS helicase; Mi-2 gene; Mi-2 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6417) AUTHORS Seelig,H.P. TITLE Direct Submission JOURNAL Submitted (26-APR-1995) H.P. Seelig, Institute of Immunology & Molecular, Genetics, Kriegsstrasse 99, D-76133 Karlsruhe, FRG REFERENCE 2 (bases 1 to 6417) AUTHORS Seelig,H.P., Moosbrugger,I., Ehrfeld,H., Fink,T., Renz,M. and Genth,E. TITLE The major dermatomyositis-specific Mi-2 autoantigen is a presumed helicase involved in transcriptional activation JOURNAL Arthritis Rheum. 38 (10), 1389-1399 (1995) MEDLINE 96017437 FEATURES Location/Qualifiers source 1..6417 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda" /chromosome="12" /map="p13" gene 90..5828 /gene="Mi-2" CDS 90..5828 /gene="Mi-2" /codon_start=1 /product="Mi-2 protein" /db_xref="PID:e149079" /db_xref="PID:g1107696" /translation="MASGLGSPSPCSAGSEEEDMDALLNNSLPPPHPENEEDPEEDLS ETETPKLKKKKKPKKPRDPKIPKSKRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALR SDSEGSDYTPGKKKKKKLGPKKEKKSKSKRKEEEEEDDDDDDSKEPKSSAQLLEDWGM EDIDHVFSEEDYRTLTNYKAFSQFVRPLIAAKNPKIAVSKMMMVLGAKWREFSTNNPF KGSSGASVAAAAAAAVAVVESMVTATEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKG SPRVPDAKKPKPKKVAPLKIKLGGFGSKRKRSSSEDDDLDVESDFDDASINSYSVSDG STSRSSRSRKKLRTTKKKKKGEEEVTAVDGYETDHQDYCEVCQQGGEIILCDTCPRAY HMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVGGDLEEEDDHHME FCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTCPALKGKVQKILIW KWGQPPSPTPVPRPPDADPNTPSPKPLEGRPERQFFVKWQGMSYWHCSWVSELQLELH CQVMFRNYQRKNDMDEPPSGDFGGDEEKSRKRKNKDPKFAEMEERFYRYGIKPEWMMI HRILNHSVDKKGHVHYLIKWRDLPYDQASWESEDVEIQDYDLFKQSYWNHRELMRGEE GRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLDATGGTLHPYQMEGLNWLRFS WAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLVSAPLSTIINWEREFEMWA PDMYVVTYVGDKDSRAIIRENEFSFEDNAIRGGKKASRMKKEASVKFHVLLTSYELIT IDMAILGSIDWACLIVDEAHRLKNNQSKFFRVLNGYSLQHKLLLTGTPLQNNLEELFH LLNFLTPERFHNLEGFLEEFADIAKEDQIKKLHDMLGPHMLRRLKADVFKNMPSKTEL IVRVELSPMQKKYYKYILTRNFEALNARGGGNQVSLLNVVMDLKKCCNHPYLFPVAAM EAPKMPNGMYDGSALIRASGKLLLLQKMLKNLKEGGHRVLIFSQMTKMLDLLEDFLEH EGYKYERIDGGITGNMRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADTVIIYDS DWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKKKMMLTHLVVRPGLG SKTGSMSKQELDDILKFGTEELFKDEATDGGGDNKEGEDSSVIHYDDKAIERLLDRNQ DETEDTELQGMNEYLSSFKVAQYVVREEEMGEEEEVEREIIKQEESVDPDYWEKLLRH HYEQQQEDLARNLGKGKRIRKQVNYNDGSQEDRDWQDDQSDNQSDYSVASEEGDEDFD ERSEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNARQRKAFLNAIMRYGMPPQ DAFTTQWLVRDLRGKSEKEFKAYVSLFMRHLCEPGADGAETFADGVPREGLSRQHVLT RIGVMSLIRKKVQEFEHVNGRWSMPELAEVEENKKMSQPGSPSPKTPTPSTPGDTQPN TPAPVPPAEDGIKIEENSLKEEESIEGEKEVKSTAPETAIECTQAPAPASEDEKVVVE PPEGEEKVEKAEVKERTEEPMETEPKGAADVEKVEEKSAIDLTPIVVEDKEEKKEEEE KKEVMLQNGETPKDLNDEKQKKNIKQRFMFNIADGGFTELHSLWQNEERAATVTKKTY EIWHRRHDYWLLAGIINHGYARWQDIQNDPRYAILNEPFKGEMNRGNFLEIKNKFLAR RFKLLEQALVIEEQLRRAAYLNMSEDPSHPSMALNTRFAEVECLAESHQHLSKESMAG NKPANAVLHKVLKQLEELLSDMKADVTRLPATIARIPPVAVRLQMSERNILSRLANRA PEPTPQQVAQQQ" BASE COUNT 1774 a 1494 c 1784 g 1365 t ORIGIN 1 gcggctccgg gtgactcggg ccagtgtaga ggtcctcagg ccgccggcag gagcagctgg 61 gccaattccc tggccgggag cggaagggga tggcgtcggg cctgggctcc ccgtccccct 121 gctcggcggg cagtgaggag gaggatatgg atgcactttt gaacaacagc ctgcccccac 181 cccacccaga aaatgaagag gacccagaag aggatttgtc agaaacagag actccaaagc 241 tcaagaagaa gaaaaagcct aagaaacctc gggaccctaa aatccctaag agcaagcgcc 301 aaaaaaagga gcgtatgctc ttatgccggc agctggggga cagctctggg gaggggccag 361 agtttgtgga ggaggaggaa gaggtggctc tgcgctcaga cagtgagggc agcgactata 421 ctcctggcaa gaagaagaag aagaagcttg gacctaagaa agagaagaag agcaaatcca 481 agcggaagga ggaggaggag gaggatgatg atgatgatga ttcaaaggag cctaaatcat 541 ctgctcagct cctggaagac tggggcatgg aagacattga ccacgtgttc tcagaggagg 601 attatcgaac cctcaccaac tacaaggcct tcagccagtt tgtcagaccc ctcattgctg 661 ccaaaaatcc caagattgct gtctccaaga tgatgatggt tttgggtgca aaatggcggg 721 agttcagtac caataacccc ttcaaaggca gttctggggc atcagtggca gctgcggcag 781 cagcagcggt agctgtggtg gagagcatgg tgacagccac tgaggttgca ccaccacctc 841 cccctgtgga ggtgcctatc cgcaaggcca agaccaagga gggcaaaggt cccaatgctc 901 ggaggaagcc caagggcagc cctcgtgtac ctgatgccaa gaagcctaaa cccaagaaag 961 tagctcccct gaaaatcaag ctgggaggtt ttggttccaa gcgtaagaga tcctcgagtg 1021 aggatgatga cttagatgtg gaatctgact tcgatgatgc cagtatcaat agctattctg 1081 tttctgatgg ttccaccagc cgtagtagcc gcagccgcaa gaaactccga accactaaaa 1141 agaaaaagaa aggcgaggag gaggtgactg ctgtggatgg ttatgagaca gaccaccagg 1201 actattgcga ggtgtgccag caaggcggtg agatcatcct gtgtgatacc tgtccccgtg 1261 cttaccacat ggtctgcctg gatcccgaca tggagaaggc tcccgagggc aagtggagct 1321 gcccacactg cgagaaggaa ggcatccagt gggaagctaa agaggacaat tcggagggtg 1381 aggagatcct ggaagaggtt gggggagacc tcgaagagga ggatgaccac catatggaat 1441 tctgtcgggt ctgcaaggat ggtggggaac tgctctgctg tgatacctgt ccttcttcct 1501 accacatcca ctgcctgaat cccccacttc cagagatccc caacggtgaa tggctctgtc 1561 cccgttgtac gtgtccagct ctgaagggca aagtgcagaa gatcctaatc tggaagtggg 1621 gtcagccacc atctcccaca ccagtgcctc ggcctccaga tgctgatccc aacacgccct 1681 ccccaaagcc cttggagggg cggccagagc ggcagttctt tgtgaaatgg caaggcatgt 1741 cttactggca ctgctcctgg gtttctgaac tgcagctgga gctgcactgt caggtgatgt 1801 tccgaaacta tcagcggaag aatgatatgg atgagccacc ttctggggac tttggtggtg 1861 atgaagagaa aagccgaaag cgaaagaaca aggaccctaa atttgcagag atggaggaac 1921 gcttctatcg ctatgggata aaacccgagt ggatgatgat ccaccgaatc ctcaaccaca 1981 gtgtggacaa gaagggccac gtccactact tgatcaagtg gcgggactta ccttacgatc 2041 aggcttcttg ggagagtgag gatgtggaga tccaggatta cgacctgttc aagcagagct 2101 attggaatca cagggagtta atgaggggtg aggaaggccg accaggcaag aagctcaaga 2161 aggtgaagct tcggaagttg gagaggcctc cagaaacgcc aacagttgat ccaacagtga 2221 agtatgagcg acagccagag tacctggatg ctacaggtgg aaccctgcac ccctatcaaa 2281 tggagggcct gaattggttg cgcttctcct gggctcaggg cactgacacc atcttggctg 2341 atgagatggg ccttgggaaa actgtacaga cagcagtctt cctgtattcc ctttacaagg 2401 agggtcattc caaaggcccc ttcctagtga gcgcccctct ttctaccatc atcaactggg 2461 agcgggagtt tgaaatgtgg gctccagaca tgtatgtcgt aacctatgtg ggtgacaagg 2521 acagccgtgc catcatccga gagaatgagt tctcctttga agacaatgcc attcgtggtg 2581 gcaagaaggc ctcccgcatg aagaaagagg catctgtgaa attccatgtg ctgctgacat 2641 cctatgaatt gatcaccatt gacatggcta ttttgggctc tattgattgg gcctgcctca 2701 tcgtggatga agcccatcgg ctgaagaaca atcagtctaa gttcttccgg gtattgaatg 2761 gttactcact ccagcacaag ctgttgctga ctgggacacc attacaaaac aatctggaag 2821 agttgtttca tctgctcaac tttctcaccc ccgagaggtt ccacaatttg gaaggttttt 2881 tggaggagtt tgctgacatt gccaaggagg accagataaa aaaactgcat gacatgctgg 2941 ggccgcacat gttgcggcgg ctcaaagccg atgtgttcaa gaacatgccc tccaagacag 3001 aactaattgt gcgtgtggag ctgagcccta tgcagaagaa atactacaag tacatcctca 3061 ctcgaaattt tgaagcactc aatgcccgag gtggtggcaa ccaggtgtct ctgctgaatg 3121 tggtgatgga tcttaagaag tgctgcaacc atccatacct cttccctgtg gctgcaatgg 3181 aagctcctaa gatgcctaat ggcatgtatg atggcagtgc cctaatcaga gcatctggga 3241 aattattgct gctgcagaaa atgctcaaga accttaagga gggtgggcat cgtgtactca 3301 tcttttccca gatgaccaag atgctagacc tgctagagga tttcttggaa catgaaggtt 3361 ataaatacga acgcatcgat ggtggaatca ctgggaacat gcggcaagag gccattgacc 3421 gcttcaatgc accgggtgct cagcagttct gcttcttgct ttccactcga gctgggggcc 3481 ttggaatcaa tctggccact gctgacacag ttattatcta tgactctgac tggaaccccc 3541 ataatgacat tcaggccttt agcagagctc accggattgg gcaaaataaa aaggtaatga 3601 tctaccggtt tgtgacccgt gcgtcagtgg aggagcgcat cacgcaggtg gcaaagaaga 3661 aaatgatgct gacgcatcta gtggtgcggc ctgggctggg ctccaagact ggatctatgt 3721 ccaaacagga gcttgatgat atcctcaaat ttggcactga ggaactattc aaggatgaag 3781 ccactgatgg aggaggagac aacaaagagg gagaagatag cagtgttatc cactacgatg 3841 ataaggccat tgaacggctg ctagaccgta accaggatga gactgaagac acagaattgc 3901 agggcatgaa tgaatatttg agctcattca aagtggccca gtatgtggta cgggaagaag 3961 aaatggggga ggaagaggag gtagaacggg aaatcattaa acaggaagaa agtgtggatc 4021 ctgactactg ggagaaattg ctgcggcacc attatgagca gcagcaagaa gatctagccc 4081 gaaatctggg caaaggaaaa agaatccgta aacaggtcaa ctacaatgat ggctcccagg 4141 aggaccgaga ttggcaggac gaccagtccg acaaccagtc cgattactca gtggcttcag 4201 aggaaggtga tgaagacttt gatgaacgtt cagaagctcc ccgtaggccc agtcgtaagg 4261 gcctgcggaa tgataaagat aagccattgc ctcctctgtt ggcccgtgtt ggtgggaata 4321 ttgaagtact tggttttaat gctcgtcagc gaaaagcctt tcttaatgca attatgcgat 4381 atggtatgcc acctcaggat gcttttacta cccagtggct tgtaagagac ctgcgaggca 4441 aatcagagaa agagttcaag gcatatgtct ctcttttcat gcggcattta tgtgagccgg 4501 gggcagatgg ggctgagacc tttgctgatg gtgtcccccg agaaggcctg tctcgccagc 4561 atgtccttac tagaattggt gttatgtctt tgattcgcaa gaaggttcag gagtttgaac 4621 atgttaatgg gcgctggagc atgcctgaac tggctgaggt ggaggaaaac aagaagatgt 4681 cccagccagg gtcaccctcc ccaaaaactc ctacaccctc cactccaggg gacacgcagc 4741 ccaacactcc tgcacctgtc ccacctgctg aagatgggat aaaaatagag gaaaatagcc 4801 tcaaagaaga agagagcata gaaggagaaa aggaggttaa atctacagcc cctgagactg 4861 ccattgagtg tacacaggcc cctgcccctg cctcagagga tgaaaaggtc gttgttgaac 4921 cccctgaggg agaggagaaa gtggaaaagg cagaggtgaa ggagagaaca gaggaaccta 4981 tggagacaga gcccaaaggt gctgctgatg tagagaaggt ggaggaaaag tcagcaatag 5041 atctgacccc tattgtggta gaagacaaag aagagaagaa agaagaagaa gagaaaaaag 5101 aggtgatgct tcagaatgga gagaccccca aggacctgaa tgatgagaaa cagaagaaaa 5161 atattaaaca acgtttcatg tttaacattg cagatggtgg ttttactgag ttgcactccc 5221 tttggcagaa tgaagagcgg gcagccacag ttaccaagaa gacttatgag atctggcatc 5281 gacggcatga ctactggctg ctagccggca ttataaacca tggctatgcc cggtggcaag 5341 acatccagaa tgacccacgc tatgccatcc tcaatgagcc tttcaagggt gaaatgaacc 5401 gtggcaattt cttagagatc aagaataaat ttctagctcg aaggtttaag ctcttagaac 5461 aagctctggt gattgaggaa cagctgcgcc gggctgctta cttgaacatg tcagaagacc 5521 cttctcaccc ttccatggcc ctcaacaccc gctttgctga ggtggagtgt ttggcggaaa 5581 gtcatcagca cctgtccaag gagtcaatgg caggaaacaa gccagccaat gcagtcctgc 5641 acaaagttct gaaacagctg gaagaactgc tgagtgacat gaaagctgat gtgactcgac 5701 tcccagctac cattgcccga attcccccag ttgctgtgag gttacagatg tcagagcgta 5761 acattctcag ccgcctggca aaccgggcac ccgaacctac cccacagcag gtagcccagc 5821 agcagtgaag atgcagactg ataccacctc caccgctgag cagtgacctt cctcactttc 5881 tcttgtccca gcttctcccc tgggggcctg agagaccctc accttccttc tgcccatctt 5941 ccatgttgta aaggaacagc cccagtgcac tgggggaggg gagggagtga ggggcagtgg 6001 tgcccttcct gcagaagaga catgcagcag tagcgctggc gccatctgca ggagctggcg 6061 ggctggcctt ctggaccctg gcttctcccc actgtaacgc ctgttacaca caaactgttg 6121 tgggttcctg ccaggcttga agaaaatgat ctgaattttt tcctcctttt ggttttattt 6181 tgttggttta ttttgtgttt tcttttctcc tttttggggg gtattcagag tgggctgggc 6241 ccctgggcga gacacagcta cctctgttgg catcttttta ataccaggaa cccagcggct 6301 ctagccactg agcggctaaa tgaaataaag tggaaaaaaa aaaaaaagga aaaaaccaaa 6361 agcataaaaa accacagcaa atttcttgat gaaaattgaa aataaaagtt tccttgt // LOCUS HSMIMPA 897 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens mRNA for myo-insositol monophosphatase. ACCESSION X66922 S38980 NID g395339 KEYWORDS myo-inositol-1(or 4)-monophosphatase; myo-insositol monophosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 897) AUTHORS McAllister,G., Whiting,P., Hammond,E.A., Knowles,M.R., Atack,J.R., Bailey,F.J., Maigetter,R. and Ragan,C.I. TITLE cDNA cloning of human and rat brain myo-inositol monophosphatase. Expression and characterization of the human recombinant enzyme JOURNAL Biochem. J. 284 (Pt 3), 749-754 (1992) MEDLINE 92321996 FEATURES Location/Qualifiers source 1..897 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="hippocampla cDNA in lambda-ZAP" CDS 37..870 /standard_name="myo-insositol monophosphatase" /EC_number="3.1.3.25" /codon_start=1 /product="myo-inositol-1(or 4)-monophosphatase" /db_xref="PID:g395340" /db_xref="SWISS-PROT:P29218" /translation="MADPWQECMDYAVTLARQAGEVVCEAIKNEMNVMLKSSPVDLVT ATDQKVEKMLISSIKEKYPSHSFIGEESVAAGEKSILTDNPTWIIDPIDGTTNFVHRF PFVAVSIGFAVNKKIEFGVVYSCVEGKMYTARKGKGAFCNGQKLQVSQQEDITKSLLV TELGSSRTPETVRMVLSNMEKLFCIPVHGIRSVGTAAVNMCLVATGGADAYYEMGIHC WDVAGAGIIVTEAGGVLMDVTGGPFDLMSRRVIAANNRILAERIAKEIQVIPLQRDDE D" BASE COUNT 282 a 143 c 213 g 259 t ORIGIN 1 ctccgactca agatatttgt caaatatttt cagaagatgg ctgatccttg gcaggaatgc 61 atggattatg cagtaactct agcaagacaa gctggagagg tagtttgtga agctataaaa 121 aatgaaatga atgttatgct gaaaagttct ccagttgatt tggtaactgc tacggaccaa 181 aaagttgaaa aaatgcttat ctcttccata aaggaaaagt atccatctca cagtttcatt 241 ggtgaagaat ctgtggcagc tggggaaaaa agtatcttaa ccgacaaccc cacatggatc 301 attgacccta ttgatggaac aactaacttt gtacatagat ttccttttgt agctgtttca 361 attggctttg ctgtaaataa aaagatagaa tttggagttg tgtacagttg tgtggaaggc 421 aagatgtaca ctgccagaaa aggaaaaggg gccttttgta atggtcaaaa actacaagtt 481 tcacaacaag aagatattac caaatctctc ttggtgactg agttgggctc ttctagaaca 541 ccagagactg tgagaatggt tctttctaat atggaaaagc ttttttgcat tcctgttcat 601 gggatccgga gtgttggaac agcagctgtt aatatgtgcc ttgtggcaac tggcggagca 661 gatgcatatt atgaaatggg aattcactgc tgggatgttg caggagctgg cattattgtt 721 actgaagctg gtggcgtgct aatggatgtt acaggtggac catttgattt gatgtcacga 781 agagtaattg ctgcaaataa tagaatatta gcagaaagga tagctaaaga aattcaggtt 841 atacctttgc aacgagacga cgaagattaa ttaaggcagc tcatagtcat ccagttg // LOCUS HSMITFRN 1788 bp RNA PRI 24-MAR-1994 DEFINITION H.sapiens mitF mRNA. ACCESSION Z29678 NID g468496 KEYWORDS DNA-binding protein; mitF gene; MITF protein; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1788) AUTHORS Tachibana,M., Perez-Jurado,L.A., Nakayama,A., Hodgkinson,C.A., Li,X., Schneider,M., Miki,T., Fex,J., Francke,U. and Arnheiter,H. TITLE Cloning of MITF, the human homolog of the mouse microphthalmia gene and assignment to chromosome 3p14.1-p12.3 JOURNAL Hum. Mol. Genet. 3 (4), 553-557 (1994) MEDLINE 94348499 REFERENCE 2 (bases 1 to 1788) AUTHORS Arnheiter,H. TITLE Direct Submission JOURNAL Submitted (04-FEB-1994) Heinz Arnheiter, NINDS, NIH, Laboratory of Viral and Molecular, Pathogenesis, 9000 Rockville Pike, Bethesda, Maryland, MD 20892, USA FEATURES Location/Qualifiers source 1..1788 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="melanocyte" /cell_line="normal adult human epidermal melanocyte" /chromosome="3p" CDS 121..1380 /note="basic-helix-loop-helix-zipper protein" /codon_start=1 /product="MITF protein" /db_xref="PID:g468497" /translation="MLEMLEYNHYQVQTHLENPTKYHIQQAQRQQVKQYLSTTLANKH ANQVLSLPCPNQPGDHVMPPVPGSSAPNSPMAMLTLNSNCEKEGFYKFEEQNRAESEC PGMNTHSRASCMQMDDVIDDIISLESSYNEEILGLMDPALQMANTLPVSGNLIDLYGN QGLPPPGLTISNSCPANLPNIKRELTACIFPTESEARALAKERQKKDNHNLIERRRRF NINDRIKELGTLIPKSNDPDMRWNKGTILKASVDYIRKLQREQQRAKELENRQKKLEH ANRHLLLRIQELEMQARAHGLSLIPSTGLCSPDLVNRIIKQEPVLENCSQDLLQHHAD LTCTTTLDLTDGTITFNNNLGTGTEANQAYSVPTKMGSKLEDILMDDTLSPVGVTDPL LSSVSPGASKTSSRRSSMSMEETEHTC" BASE COUNT 550 a 434 c 364 g 440 t ORIGIN 1 gggatacctt gtttatagta ccttctcttt gccagtccat cttcaaattg gaattataga 61 aagtagaggg agggatagtc taccgtctct cactggattg gtgccaccta aaacattgtt 121 atgctggaaa tgctagaata taatcactat caggtgcaga cccacctcga aaaccccacc 181 aagtaccaca tacagcaagc ccaacggcag caggtaaagc agtacctttc taccacttta 241 gcaaataaac atgccaacca agtcctgagc ttgccatgtc caaaccagcc tggcgatcat 301 gtcatgccac cggtgccggg gagcagcgca cccaacagcc ccatggctat gcttacgctt 361 aactccaact gtgaaaaaga gggattttat aagtttgaag agcaaaacag ggcagagagc 421 gagtgcccag gcatgaacac acattcacga gcgtcctgta tgcagatgga tgatgtaatc 481 gatgacatca ttagcctaga atcaagttat aatgaggaaa tcttgggctt gatggatcct 541 gctttgcaaa tggcaaatac gttgcctgtc tcgggaaact tgattgatct ttatggaaac 601 caaggtctgc ccccaccagg cctcaccatc agcaactcct gtccagccaa ccttcccaac 661 ataaaaaggg agctcacagc gtgtattttt cccacagagt ctgaagcaag agcactggcc 721 aaagagaggc agaaaaagga caatcacaac ctgattgaac gaagaagaag atttaacata 781 aatgaccgca ttaaagaact aggtactttg attcccaagt caaatgatcc agacatgcgc 841 tggaacaagg gaaccatctt aaaagcatcc gtggactata tccgaaagtt gcaacgagaa 901 cagcaacgcg caaaagaact tgaaaaccga cagaagaaac tggagcacgc caaccggcat 961 ttgttgctca gaatacagga acttgaaatg caggctcgag ctcatggact ttcccttatt 1021 ccatccacgg gtctctgctc tccagatttg gtgaatcgga tcatcaagca agaacccgtt 1081 cttgagaact gcagccaaga cctccttcag catcatgcag acctaacctg tacaacaact 1141 ctcgatctca cggatggcac catcaccttc aacaacaacc tcggaactgg gactgaggcc 1201 aaccaagcct atagtgtccc cacaaaaatg ggatccaaac tggaagacat cctgatggac 1261 gacacccttt ctcccgtcgg tgtcactgat ccactccttt cctcagtgtc ccccggagct 1321 tccaaaacaa gcagccggag gagcagtatg agcatggaag agacggagca cacttgttag 1381 cgaatcctcc ctgcactgca ttcgcacaaa ctgcttcctt tcttgattcg tagatttaat 1441 aacttacctg aaggggtttt cttgataatt ttcctttaat atgaaatttt ttttcatgct 1501 ttatcaatag cccaggatat attttatttt tagaattttg tgaaacagac ttgtatattc 1561 tattttacaa ctacaaatgc ctccaaagta ttgtacaaat aagtgtgcag tatctgtgaa 1621 ctgaattcac cacagacttt agctttctga gcaagaggat tttgcgtcag agaaatgtct 1681 gtccattttt attcagggga aacttgattt gagattttta tgcctgtgac ttccttggaa 1741 atcaaatgta aagtttaatt gaaagaatgt aaagcaaccc cccaaaaa // LOCUS HSMKI67 12515 bp RNA PRI 31-JAN-1994 DEFINITION H.sapiens mki67a mRNA (long type) for antigen of monoclonal antibody Ki-67. ACCESSION X65550 NID g415818 KEYWORDS antigen; monoclonal antibody. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 12515) AUTHORS Gerdes Fors,J. TITLE Direct Submission JOURNAL Submitted (11-APR-1992) J. Gerdes, Forschungsinstitut Borstel, Div. Molecular Immunology, Parkallee 22, 2061 Borstel, FRG REMARK sequence revised by author 13-JUL-93 and 08-OCT-93 REFERENCE 2 (bases 1 to 12515) AUTHORS Schluter,C., Duchrow,M., Wohlenberg,C., Becker,M.H., Key,G., Flad,H.D. and Gerdes,J. TITLE The cell proliferation-associated antigen of antibody Ki-67: a very large, ubiquitous nuclear protein with numerous repeated elements, representing a new kind of cell cycle-maintaining proteins JOURNAL J. Cell Biol. 123 (3), 513-522 (1993) MEDLINE 94043435 FEATURES Location/Qualifiers source 1..12515 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IM9" /clone_lib="lambda gt11" /chromosome="10q25-qter" exon 1..107 /number=1 exon 108..288 /number=2 gene 197..9964 /gene="mki67" CDS 197..9967 /codon_start=1 /product="antigen of the monoclonal antibody Ki-67" /db_xref="PID:g415819" /db_xref="SWISS-PROT:P46013" /translation="MWPTRRLVTIKRSGVDGPHFPLSLSTCLFGRGIECDIRIQLPVV SKQHCKIEIHEQEAILHNFSSTNPTQVNGSVIDEPVRLKHGDVITIIDRSFRYENESL QNGRKSTEFPRKIREQEPARRVSRSSFSSDPDEKAQDSKAYSKITEGKVSGNPQVHIK NVKEDSTADDSKDSVAQGTTNVHSSEHAGRNGRNAADPISGDFKEISSVKLVSRYGEL KSVPTTQCLDNSKKNESPFWKLYESVKKELDVKSQKENVLQYCRKSGLQTDYATEKES ADGLQGETQLLVSRKSRPKSGGSGHAVAEPASPEQELDQNKGKGRDVESVQTPSKAVG ASFPLYEPAKMKTPVQYSQQQNSPQKHKNKDLYTTGRRESVNLGKSEGFKAGDKTLTP RKLSTRNRTPAKVEDAADSATKPENLSSKTRGSIPTDVEVLPTETEIHNEPFLTLWLT QVERKIQKDSLSKPEKLGTTAGQMCSGLPGLSSVDINNFGDSINESEGIPLKRRRVSF GGHLRPELFDENLPPNTPLKRGEAPTKRKSLVMHTPPVLKKIIKEQPQPSGKQESGSE IHVEVKAQSLVISPPAPSPRKTPVASDQRRRSCKTAPASSSKSQTEVPKRGGERVATC LQKRVSISRSQHDILQMICSKRRSGASEANLIVAKSWADVVKLGAKQTQTKVIKHGPQ RSMNKRQRRPATPKKPVGEVHSQFSTGHANSPCTIIIGKAHTEKVHVPARPYRVLNNF ISNQKMDFKEDLSGIAEMFKTPVKEQPQLTSTCHIAISNSENLLGKQFQGTDSGEEPL LPTSESFGGNVFFSAQNAAKQPSDKCSASPPLRRQCIRENGNVAKTPRNTYKMTSLET KTSDTETEPSKTVSTVNRSGRSTEFRNIQKLPVESKSEETNTEIVECILKRGQKATLL QQRREGEMKEIERPFETYKENIELKENDEKMKAMKRSRTWGQKCAPMSDLTDLKSLPD TELMKDTARGQNLLQTQDHAKAPKSEKGKITKMPCQSLQPEPINTPTHTKQQLKASLG KVGVKEELLAVGKFTRTSGETTHTHREPAGDGKSIRTFKESPKQILDPAARVTGMKKW PRTPKEEAQSLEDLAGFKELFQTPGPSEESMTDEKTTKIACKSPPPESVDTPTSTKQW PKRSLRKADVEEEFLALRKLTPSAGKAMLTPKPAGGDEKDIKAFMGTPVQKLDLAGTL PGSKRQLQTPKEKAQALEDLAGFKELFQTPGHTEELVAAGKTTKIPCDSPQSDPVDTP TSTKQRPKRSIRKADVEGELLACRNLMPSAGKAMHTPKPSVGEEKDIIIFVGTPVQKL DLTENLTGSKRRPQTPKEEAQALEDLTGFKELFQTPGHTEEAVAAGKTTKMPCESSPP ESADTPTSTRRQPKTPLEKRDVQKELSALKKLTQTSGETTHTDKVPGGEDKSINAFRE TAKQKLDPAASVTGSKRHPKTKEKAQPLEDLAGWKELFQTPVCTDKPTTHEKTTKIAC RSQPDPVDTPTSSKPQSKRSLRKVDVEEEFFALRKRTPSAGKAMHTPKPAVSGEKNIY AFMGTPVQKLDLTENLTGSKRRLQTPKEKAQALEDLAGFKELFQTRGHTEESMTNDKT AKVACKSSQPDLDKNPASSKRRLKTSLGKVGVKEELLAVGKLTQTSGETTHTHTEPTG DGKSMKAFMESPKQILDSAASLTGSKRQLRTPKGKSEVPEDLAGFIELFQTPSHTKES MTNEKTTKVSYRASQPDLVDTPTSSKPQPKRSLRKADTEEEFLAFRKQTPSAGKAMHT PKPAVGEEKDINTFLGTPVQKLDQPGNLPGSNRRLQTRKEKAQALEELTGFRELFQTP CTDNPTADEKTTKKILCKSPQSDPADTPTNTKQRPKRSLKKADVEEEFLAFRKLTPSA GKAMHTPKAAVGEEKDINTFVGTPVEKLDLLGNLPGSKRRPQTPKEKAKALEDLAGFK ELFQTPGHTEESMTDDKITEVSCKSPQPDPVKTPTSSKQRLKISLGKVGVKEEVLPVG KLTQTSGKTTQTHRETAGDGKSIKAFKESAKQMLDPANYGTGMERWPRTPKEEAQSLE DLAGFKELFQTPDHTEESTTDDKTTKIACKSPPPESMDTPTSTRRRPKTPLGKRDIVE ELSALKQLTQTTHTDKVPGDEDKGINVFRETAKQKLDPAASVTGSKRQPRTPKGKAQP LEDLAGLKELFQTPVCTDKPTTHEKTTKIACRSPQPDPVGTPTIFKPQSKRSLRKADV EEESLALRKRTPSVGKAMDTPKPAGGDEKDMKAFMGTPVQKLDLPGNLPGSKRWPQTP KEKAQALEDLAGFKELFQTPGTDKPTTDEKTTKIACKSPQPDPVDTPASTKQRPKRNL RKADVEEEFLALRKRTPSAGKAMDTPKPAVSDEKNINTFVETPVQKLDLLGNLPGSKR QPQTPKEKAEALEDLVGFKELFQTPGHTEESMTDDKITEVSCKSPQPESFKTSRSSKQ RLKIPLVKVDMKEEPLAVSKLTRTSGETTQTHTEPTGDSKSIKAFKESPKQILDPAAS VTGSRRQLRTRKEKARALEDLVDFKELFSAPGHTEESMTIDKNTKIPCKSPPPELTDT ATSTKRCPKTRPRKEVKEELSAVERLTQTSGQSTHTHKEPASGDEGIKVLKQRAKKKP NPVEEEPSRRRPRAPKEKAQPLEDLAGFTELSETSGHTQESLTAGKATKIPCESPPLE VVDTTASTKRHLRTRVQKVQVKEEPSAVKFTQTSGETTDADKEPAGEDKGIKALKESA KQTPAPAASVTGSRRRPRAPRESAQAIEDLAGFKDPAAGHTEESMTDDKTTKIPCKSS PELEDTATSSKRRPRTRAQKVEVKEELLAVGKLTQTSGETTHTDKEPVGEGKGTKAFK QPAKRNVDAEDVIGSRRQPRAPKEKAQPLEDLASFQELSQTPGHTEELANGAADSFTS APKQTPDSGKPLKISRRVLRAPKVEPVGDVVSTRDPVKSQSKSNTSLPPLPFKRGGGK DGSVTGTKRLRCMPAPEEIVEELPASKKQRVAPRARGKSSEPVVIMKRSLRTSAKRIE PAEELNSNDMKTNKEEHKLQDSVPENKGISLRSRRQDKTEAEQQITEVFVLAERIEIN RNEKKPMKTSPEMDIQNPDDGARKPIPRDKVTENKRCLRSARQNESSQPKVAEESGGQ KSAKVLMQNQKGKGEAGNSDSMCLRSRKTKSQPAASTLESKSVQRVTRSVKRCAENPK KAEDNVCVKKITTRSHRDSEDI" exon 289..367 /gene="mki67" /number=3 exon 368..483 /gene="mki67" /number=4 exon 484..550 /gene="mki67" /number=5 exon 561..596 /gene="mki67" /number=6 exon 597..1676 /gene="mki67" /note="partialy excluded by splicing" /number=7 exon 1677..1852 /gene="mki67" /number=8 exon 1853..2165 /gene="mki67" /number=9 exon 2166..2284 /gene="mki67" /number=10 exon 2285..2456 /gene="mki67" /number=11 exon 2457..2612 /gene="mki67" /number=12 exon 2613..9457 /gene="mki67" /number=13 exon 9458..9901 /gene="mki67" /number=14 exon 9902..12493 /number=15 polyA_signal 9994..9999 polyA_signal 10529..10534 polyA_signal 11237..11242 polyA_signal 12468..12473 polyA_site 12494 BASE COUNT 4166 a 3048 c 2928 g 2373 t ORIGIN 1 ctaccgggcg gaggtgagcg cggcgccggc tcctcctgcg gcggactttg ggtgcgactt 61 gacgagcggt ggttcgacaa gtggccttgc gggccggatc gtcccagtgg aagagttgta 121 aatttgcttc tggccttccc ctacggatta tacctggcct tcccctacgg attatactca 181 acttactgtt tagaaaatgt ggcccacgag acgcctggtt actatcaaaa ggagcggggt 241 cgacggtccc cactttcccc tgagcctcag cacctgcttg tttggaaggg gtattgaatg 301 tgacatccgt atccagcttc ctgttgtgtc aaaacaacat tgcaaaattg aaatccatga 361 gcaggaggca atattacata atttcagttc cacaaatcca acacaagtaa atgggtctgt 421 tattgatgag cctgtacggc taaaacatgg agatgtaata actattattg atcgttcctt 481 caggtatgaa aatgaaagtc ttcagaatgg aaggaagtca actgaatttc caagaaaaat 541 acgtgaacag gagccagcac gtcgtgtctc aagatctagc ttctcttctg accctgatga 601 gaaagctcaa gattccaagg cctattcaaa aatcactgaa ggaaaagttt caggaaatcc 661 tcaggtacat atcaagaatg tcaaagaaga cagtaccgca gatgactcaa aagacagtgt 721 tgctcaggga acaactaatg ttcattcctc agaacatgct ggacgtaatg gcagaaatgc 781 agctgatccc atttctgggg attttaaaga aatttccagc gttaaattag tgagccgtta 841 tggagaattg aagtctgttc ccactacaca atgtcttgac aatagcaaaa aaaatgaatc 901 tcccttttgg aagctttatg agtcagtgaa gaaagagttg gatgtaaaat cacaaaaaga 961 aaatgtccta cagtattgta gaaaatctgg attacaaact gattacgcaa cagagaaaga 1021 aagtgctgat ggtttacagg gggagaccca actgttggtc tcgcgtaagt caagaccaaa 1081 atctggtggg agcggccacg ctgtggcaga gcctgcttca cctgaacaag agcttgacca 1141 gaacaagggg aagggaagag acgtggagtc tgttcagact cccagcaagg ctgtgggcgc 1201 cagctttcct ctctatgagc cggctaaaat gaagacccct gtacaatatt cacagcaaca 1261 aaattctcca caaaaacata agaacaaaga cctgtatact actggtagaa gagaatctgt 1321 gaatctgggt aaaagtgaag gcttcaaggc tggtgataaa actcttactc ccaggaagct 1381 ttcaactaga aatcgaacac cagctaaagt tgaagatgca gctgactctg ccactaagcc 1441 agaaaatctc tcttccaaaa ccagaggaag tattcctaca gatgtggaag ttctgcctac 1501 ggaaactgaa attcacaatg agccattttt aactctgtgg ctcactcaag ttgagaggaa 1561 gatccaaaag gattccctca gcaagcctga gaaattgggc actacagctg gacagatgtg 1621 ctctgggtta cctggtctta gttcagttga tatcaacaac tttggtgatt ccattaatga 1681 gagtgaggga atacctttga aaagaaggcg tgtgtccttt ggtgggcacc taagacctga 1741 actatttgat gaaaacttgc ctcctaatac gcctctcaaa aggggagaag ccccaaccaa 1801 aagaaagtct ctggtaatgc acactccacc tgtcctgaag aaaatcatca aggaacagcc 1861 tcaaccatca ggaaaacaag agtcaggttc agaaatccat gtggaagtga aggcacaaag 1921 cttggttata agccctccag ctcctagtcc taggaaaact ccagttgcca gtgatcaacg 1981 ccgtaggtcc tgcaaaacag cccctgcttc cagcagcaaa tctcagacag aggttcctaa 2041 gagaggagga gaaagagtgg caacctgcct tcaaaagaga gtgtctatca gccgaagtca 2101 acatgatatt ttacagatga tatgttccaa aagaagaagt ggtgcttcgg aagcaaatct 2161 gattgttgca aaatcatggg cagatgtagt aaaacttggt gcaaaacaaa cacaaactaa 2221 agtcataaaa catggtcctc aaaggtcaat gaacaaaagg caaagaagac ctgctactcc 2281 aaagaagcct gtgggcgaag ttcacagtca atttagtaca ggccacgcaa actctccttg 2341 taccataata atagggaaag ctcatactga aaaagtacat gtgcctgctc gaccctacag 2401 agtgctcaac aacttcattt ccaaccaaaa aatggacttt aaggaagatc tttcaggaat 2461 agctgaaatg ttcaagaccc cagtgaagga gcaaccgcag ttgacaagca catgtcacat 2521 cgctatttca aattcagaga atttgcttgg aaaacagttt caaggaactg attcaggaga 2581 agaacctctg ctccccacct cagagagttt tggaggaaat gtgttcttca gtgcacagaa 2641 tgcagcaaaa cagccatctg ataaatgctc tgcaagccct cccttaagac ggcagtgtat 2701 tagagaaaat ggaaacgtag caaaaacgcc caggaacacc tacaaaatga cttctctgga 2761 gacaaaaact tcagatactg agacagagcc ttcaaaaaca gtatccactg taaacaggtc 2821 aggaaggtct acagagttca ggaatataca gaagctacct gtggaaagta agagtgaaga 2881 aacaaataca gaaattgttg agtgcatcct aaaaagaggt cagaaggcaa cactactaca 2941 acaaaggaga gaaggagaga tgaaggaaat agaaagacct tttgagacat ataaggaaaa 3001 tattgaatta aaagaaaacg atgaaaagat gaaagcaatg aagagatcaa gaacttgggg 3061 gcagaaatgt gcaccaatgt ctgacctgac agacctcaag agcttgcctg atacagaact 3121 catgaaagac acggcacgtg gccagaatct cctccaaacc caagatcatg ccaaggcacc 3181 aaagagtgag aaaggcaaaa tcactaaaat gccctgccag tcattacaac cagaaccaat 3241 aaacacccca acacacacaa aacaacagtt gaaggcatcc ctggggaaag taggtgtgaa 3301 agaagagctc ctagcagtcg gcaagttcac acggacgtca ggggagacca cgcacacgca 3361 cagagagcca gcaggagatg gcaagagcat cagaacgttt aaggagtctc caaagcagat 3421 cctggaccca gcagcccgtg taactggaat gaagaagtgg ccaagaacgc ctaaggaaga 3481 ggcccagtca ctagaagacc tggctggctt caaagagctc ttccagacac caggtccctc 3541 tgaggaatca atgactgatg agaaaactac caaaatagcc tgcaaatctc caccaccaga 3601 atcagtggac actccaacaa gcacaaagca atggcctaag agaagtctca ggaaagcaga 3661 tgtagaggaa gaattcttag cactcaggaa actaacacca tcagcaggga aagccatgct 3721 tacgcccaaa ccagcaggag gtgatgagaa agacattaaa gcatttatgg gaactccagt 3781 gcagaaactg gacctggcag gaactttacc tggcagcaaa agacagctac agactcctaa 3841 ggaaaaggcc caggctctag aagacctggc tggctttaaa gagctcttcc agactcctgg 3901 tcacaccgag gaattagtgg ctgctggtaa aaccactaaa ataccctgcg actctccaca 3961 gtcagaccca gtggacaccc caacaagcac aaagcaacga cccaagagaa gtatcaggaa 4021 agcagatgta gagggagaac tcttagcgtg caggaatcta atgccatcag caggcaaagc 4081 catgcacacg cctaaaccat cagtaggtga agagaaagac atcatcatat ttgtgggaac 4141 tccagtgcag aaactggacc tgacagagaa cttaaccggc agcaagagac ggccacaaac 4201 tcctaaggaa gaggcccagg ctctggaaga cctgactggc tttaaagagc tcttccagac 4261 ccctggtcat actgaagaag cagtggctgc tggcaaaact actaaaatgc cctgcgaatc 4321 ttctccacca gaatcagcag acaccccaac aagcacaaga aggcagccca agacaccttt 4381 ggagaaaagg gacgtacaga aggagctctc agccctgaag aagctcacac agacatcagg 4441 ggaaaccaca cacacagata aagtaccagg aggtgaggat aaaagcatca acgcgtttag 4501 ggaaactgca aaacagaaac tggacccagc agcaagtgta actggtagca agaggcaccc 4561 aaaaactaag gaaaaggccc aacccctaga agacctggct ggctggaaag agctcttcca 4621 gacaccagta tgcactgaca agcccacgac tcacgagaaa actaccaaaa tagcctgcag 4681 atcacaacca gacccagtgg acacaccaac aagctccaag ccacagtcca agagaagtct 4741 caggaaagtg gacgtagaag aagaattctt cgcactcagg aaacgaacac catcagcagg 4801 caaagccatg cacacaccca aaccagcagt aagtggtgag aaaaacatct acgcatttat 4861 gggaactcca gtgcagaaac tggacctgac agagaactta actggcagca agagacggct 4921 acaaactcct aaggaaaagg cccaggctct agaagacctg gctggcttta aagagctctt 4981 ccagacacga ggtcacactg aggaatcaat gactaacgat aaaactgcca aagtagcctg 5041 caaatcttca caaccagacc tagacaaaaa cccagcaagc tccaagcgac ggctcaagac 5101 atccctgggg aaagtgggcg tgaaagaaga gctcctagca gttggcaagc tcacacagac 5161 atcaggagag actacacaca cacacacaga gccaacagga gatggtaaga gcatgaaagc 5221 atttatggag tctccaaagc agatcttaga ctcagcagca agtctaactg gcagcaagag 5281 gcagctgaga actcctaagg gaaagtctga agtccctgaa gacctggccg gcttcatcga 5341 gctcttccag acaccaagtc acactaagga atcaatgact aatgaaaaaa ctaccaaagt 5401 atcctacaga gcttcacagc cagacctagt ggacacccca acaagctcca agccacagcc 5461 caagagaagt ctcaggaaag cagacactga agaagaattt ttagcattta ggaaacaaac 5521 gccatcagca ggcaaagcca tgcacacacc caaaccagca gtaggtgaag agaaagacat 5581 caacacgttt ttgggaactc cagtgcagaa actggaccag ccaggaaatt tacctggcag 5641 caatagacgg ctacaaactc gtaaggaaaa ggcccaggct ctagaagaac tgactggctt 5701 cagagagctt ttccagacac catgcactga taaccccaca gctgatgaga aaactaccaa 5761 aaaaatactc tgcaaatctc cgcaatcaga cccagcggac accccaacaa acacaaagca 5821 acggcccaag agaagcctca agaaagcaga cgtagaggaa gaatttttag cattcaggaa 5881 actaacacca tcagcaggca aagccatgca cacgcctaaa gcagcagtag gtgaagagaa 5941 agacatcaac acatttgtgg ggactccagt ggagaaactg gacctgctag gaaatttacc 6001 tggcagcaag agacggccac aaactcctaa agaaaaggcc aaggctctag aagatctggc 6061 tggcttcaaa gagctcttcc agacaccagg tcacactgag gaatcaatga ccgatgacaa 6121 aatcacagaa gtatcctgca aatctccaca accagaccca gtcaaaaccc caacaagctc 6181 caagcaacga ctcaagatat ccttggggaa agtaggtgtg aaagaagagg tcctaccagt 6241 cggcaagctc acacagacgt cagggaagac cacacagaca cacagagaga cagcaggaga 6301 tggaaagagc atcaaagcgt ttaaggaatc tgcaaagcag atgctggacc cagcaaacta 6361 tggaactggg atggagaggt ggccaagaac acctaaggaa gaggcccaat cactagaaga 6421 cctggccggc ttcaaagagc tcttccagac accagaccac actgaggaat caacaactga 6481 tgacaaaact accaaaatag cctgcaaatc tccaccacca gaatcaatgg acactccaac 6541 aagcacaagg aggcggccca aaacaccttt ggggaaaagg gatatagtgg aagagctctc 6601 agccctgaag cagctcacac agaccacaca cacagacaaa gtaccaggag atgaggataa 6661 aggcatcaac gtgttcaggg aaactgcaaa acagaaactg gacccagcag caagtgtaac 6721 tggtagcaag aggcagccaa gaactcctaa gggaaaagcc caacccctag aagacttggc 6781 tggcttgaaa gagctcttcc agacaccagt atgcactgac aagcccacga ctcacgagaa 6841 aactaccaaa atagcctgca gatctccaca accagaccca gtgggtaccc caacaatctt 6901 caagccacag tccaagagaa gtctcaggaa agcagacgta gaggaagaat ccttagcact 6961 caggaaacga acaccatcag tagggaaagc tatggacaca cccaaaccag caggaggtga 7021 tgagaaagac atgaaagcat ttatgggaac tccagtgcag aaattggacc tgccaggaaa 7081 tttacctggc agcaaaagat ggccacaaac tcctaaggaa aaggcccagg ctctagaaga 7141 cctggctggc ttcaaagagc tcttccagac accaggcact gacaagccca cgactgatga 7201 gaaaactacc aaaatagcct gcaaatctcc acaaccagac ccagtggaca ccccagcaag 7261 cacaaagcaa cggcccaaga gaaacctcag gaaagcagac gtagaggaag aatttttagc 7321 actcaggaaa cgaacaccat cagcaggcaa agccatggac accccaaaac cagcagtaag 7381 tgatgagaaa aatatcaaca catttgtgga aactccagtg cagaaactgg acctgctagg 7441 aaatttacct ggcagcaaga gacagccaca gactcctaag gaaaaggctg aggctctaga 7501 ggacctggtt ggcttcaaag aactcttcca gacaccaggt cacactgagg aatcaatgac 7561 tgatgacaaa atcacagaag tatcctgtaa atctccacag ccagagtcat tcaaaacctc 7621 aagaagctcc aagcaaaggc tcaagatacc cctggtgaaa gtggacatga aagaagagcc 7681 cctagcagtc agcaagctca cacggacatc aggggagact acgcaaacac acacagagcc 7741 aacaggagat agtaagagca tcaaagcgtt taaggagtct ccaaagcaga tcctggaccc 7801 agcagcaagt gtaactggta gcaggaggca gctgagaact cgtaaggaaa aggcccgtgc 7861 tctagaagac ctggttgact tcaaagagct cttctcagca ccaggtcaca ctgaagagtc 7921 aatgactatt gacaaaaaca caaaaattcc ctgcaaatct cccccaccag aactaacaga 7981 cactgccacg agcacaaaga gatgccccaa gacacgtccc aggaaagaag taaaagagga 8041 gctctcagca gttgagaggc tcacgcaaac atcagggcaa agcacacaca cacacaaaga 8101 accagcaagc ggtgatgagg gcatcaaagt attgaagcaa cgtgcaaaga agaaaccaaa 8161 cccagtagaa gaggaaccca gcaggagaag gccaagagca cctaaggaaa aggcccaacc 8221 cctggaagac ctggccggct tcacagagct ctctgaaaca tcaggtcaca ctcaggaatc 8281 actgactgct ggcaaagcca ctaaaatacc ctgcgaatct cccccactag aagtggtaga 8341 caccacagca agcacaaaga ggcatctcag gacacgtgtg cagaaggtac aagtaaaaga 8401 agagccttca gcagtcaagt tcacacaaac atcaggggaa accacggatg cagacaaaga 8461 accagcaggt gaagataaag gcatcaaagc attgaaggaa tctgcaaaac agacaccggc 8521 tccagcagca agtgtaactg gcagcaggag acggccaaga gcacccaggg aaagtgccca 8581 agccatagaa gacctagctg gcttcaaaga cccagcagca ggtcacactg aagaatcaat 8641 gactgatgac aaaaccacta aaataccctg caaatcatca ccagaactag aagacaccgc 8701 aacaagctca aagagacggc ccaggacacg tgcccagaaa gtagaagtga aggaggagct 8761 gttagcagtt ggcaagctca cacaaacctc aggggagacc acgcacaccg acaaagagcc 8821 ggtaggtgag ggcaaaggca cgaaagcatt taagcaacct gcaaagcgga acgtggacgc 8881 agaagatgta attggcagca ggagacagcc aagagcacct aaggaaaagg cccaacccct 8941 ggaagacctg gccagcttcc aagagctctc tcaaacacca ggccacactg aggaactggc 9001 aaatggtgct gctgatagct ttacaagcgc tccaaagcaa acacctgaca gtggaaaacc 9061 tctaaaaata tccagaagag ttcttcgggc ccctaaagta gaacccgtgg gagacgtggt 9121 aagcaccaga gaccctgtaa aatcacaaag caaaagcaac acttccctgc ccccactgcc 9181 cttcaagagg ggaggtggca aagatggaag cgtcacggga accaagaggc tgcgctgcat 9241 gccagcacca gaggaaattg tggaggagct gccagccagc aagaagcaga gggttgctcc 9301 cagggcaaga ggcaaatcat ccgaacccgt ggtcatcatg aagagaagtt tgaggacttc 9361 tgcaaaaaga attgaacctg cggaagagct gaacagcaac gacatgaaaa ccaacaaaga 9421 ggaacacaaa ttacaagact cggtccctga aaataaggga atatccctgc gctccagacg 9481 ccaagataag actgaggcag aacagcaaat aactgaggtc tttgtattag cagaaagaat 9541 agaaataaac agaaatgaaa agaagcccat gaagacctcc ccagagatgg acattcagaa 9601 tccagatgat ggagcccgga aacccatacc tagagacaaa gtcactgaga acaaaaggtg 9661 cttgaggtct gctagacaga atgagagctc ccagcctaag gtggcagagg agagcggagg 9721 gcagaagagt gcgaaggttc tcatgcagaa tcagaaaggg aaaggagaag caggaaattc 9781 agactccatg tgcctgagat caagaaagac aaaaagccag cctgcagcaa gcactttgga 9841 gagcaaatct gtgcagagag taacgcggag tgtcaagagg tgtgcagaaa atccaaagaa 9901 ggctgaggac aatgtgtgtg tcaagaaaat aacaaccaga agtcataggg acagtgaaga 9961 tatttgacag aaaaatcgaa ctgggaaaaa tataataaag ttagttttgt gataagttct 10021 agtgcagttt ttgtcataaa ttacaagtga attctgtaag taaggctgtc agtctgctta 10081 agggaagaaa actttggatt tgctgggtct gaatcggctt cataaactcc actgggagca 10141 ctgctgggct cctggactga gaatagttga acaccggggg ctttgtgaag gagtctgggc 10201 caaggtttgc cctcagcttt gcagaatgaa gccttgaggt ctgtcaccac ccacagccac 10261 cctacagcag ccttaactgt gacacttgcc acactgtgtc gtcgtttgtt tgcctatgtt 10321 ctccagggca cggtggcagg aacaactatc ctcgtctgtc ccaacactga gcaggcactc 10381 ggtaaacacg aatgaatgga taagcgcacg gatgaatgga gcttacaaga tctgtctttc 10441 caatggccgg gggcatttgg tccccaaatt aaggctattg gacatctgca caggacagtc 10501 ctatttttga tgtcctttcc tttctgaaaa taaagttttg tgctttggag aatgactcgt 10561 gagcacatct ttagggacca agagtgactt tctgtaagga gtgactcgtg gcttgccttg 10621 gtctcttggg aatacttttc taactagggt tgctctcacc tgagacattc tccacccgcg 10681 gaatctcagg gtcccaggct gtgggccatc acgacctcaa actggctcct aatctccagc 10741 tttcctgtca ttgaaagctt cggaagttta ctggctctgc tcccgcctgt tttctttctg 10801 actctatctg gcagcccgat gccacccagt acaggaagtg acaccagtac tctgtaaagc 10861 atcatcatcc ttggagagac tgagcactca gcaccttcag ccacgatttc aggatcgctt 10921 ccttgtgagc cgctgcctcc gaaatctcct ttgaagccca gacatctttc tccagcttca 10981 gacttgtaga tataactcgt tcatcttcat ttactttcca ctttgccccc tgtcctctct 11041 gtgttcccca aatcagagaa tagcccgcca tcccccagat cacctgtctg gattcctccc 11101 cattcaccca ccttgccagg tgcaggtgag gatggtgcac cagacagggt agctgtcccc 11161 caaaatgtgc cctgtgcggg cagtgccctg tctccacgtt tgtttcccca gtgtctggcg 11221 gggagccagg tgacatcata aatacttgct gaatgaatgc agaaatcagc ggtactgact 11281 tgtactatat tggctgccat gatagggttc tcacagcgtc atccatgatc gtaagggaga 11341 atgacattct gcttgaggga gggaatagaa aggggcaggg aggggacatc tgagggcttc 11401 acagggctgc aaagggtaca gggattgcac cagggcagaa caggggaggg tgttcaagga 11461 agagtggctc ttagcagagg cactttggaa ggtgtgaggc ataaatgctt ccttctacgt 11521 aggccaacct caaaactttc agtaggaatg ttgctatgat caagttgttc taacacttta 11581 gacttagtag taattatgaa cctcacatag aaaaatttca tccagccata tgcctgtgga 11641 gtggaatatt ctgtttagta gaaaaatcct ttagagttca gctctaacca gaaatcttgc 11701 tgaagtatgt cagcaccttt tctcaccctg gtaagtacag tatttcaaga gcacgctaag 11761 ggtggttttc attttacagg gctgttgatg atgggttaaa aatgttcatt taagggctac 11821 ccccgtgttt aatagatgaa caccacttct acacaaccct ccttggtact gggggaggga 11881 gagatctgac aaatactgcc cattccccta ggctgactgg atttgagaac aaatacccac 11941 ccatttccac catggtatgg taacttctct gagcttcagt ttccaagtga atttccatgt 12001 aataggacat tcccattaaa tacaagctgt ttttactttt tcgcctccca gggcctgtgc 12061 gatctggtcc cccagcctct cttgggcttt cttacactaa ctctgtacct accatctcct 12121 gcctccctta ggcaggcacc tccaaccacc acacactccc tgctgttttc cctgcctgga 12181 actttcccac cagccccacc aagatcattt catccagtcc tgagctcagc ttaagggagg 12241 cttcttgcct gtgggttccc tcacccccat gcctgtcctc caggctgggg caggttctta 12301 gtttgcctgg aattgttctg tacctctttg tagcacgtag tgttgtgaaa ctaagccact 12361 aattgagttt ctggctcccc tcctggggtt gtaagttttg ttcattcatg agggccgact 12421 gtatttcctg gttactgtat cccagtgacc agccacagga gatgtccaat aaagtatgtg 12481 atgaaatggt cttaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSMKLP1 3258 bp RNA PRI 25-JUL-1993 DEFINITION H.sapiens mRNA for mitotic kinesin-like protein-1. ACCESSION X67155 S46300 NID g34671 KEYWORDS ATPase; kinesin-like protein; microtubule-associated protein; mitotic protein; MKLP-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3258) AUTHORS Nislow,C.E. TITLE Direct Submission JOURNAL Submitted (03-JUL-1992) C.E. Nislow, University of Colorado at Boulder, Dept MCD Biology, Campus Box 347, Boulder CO 80309, USA REFERENCE 2 (bases 1 to 3258) AUTHORS Nislow,C., Lombillo,V.A., Kuriyama,R. and McIntosh,J.R. TITLE A plus-end-directed motor enzyme that moves antiparallel microtubules in vitro localizes to the interzone of mitotic spindles JOURNAL Nature 359 (6395), 543-547 (1992) MEDLINE 93024924 FEATURES Location/Qualifiers source 1..3258 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa cells" gene 77..2959 /gene="MKLP-1" CDS 77..2959 /gene="MKLP-1" /codon_start=1 /product="mitotic kinase-like protein-1" /db_xref="PID:g34672" /db_xref="SWISS-PROT:Q02241" /translation="MLSARAKTPRKPTVKKGPKRTLKTQLGYCRVRLGFPDQECCIEV INNTTVQLHTPEGYRLNRNGDYKETQYSFKQVFGTHTTQKELFDVVANPLVNDLIHGK NGLLFTYGVTGSGKTHTMTGSPGEGGLLPRCLDMIFNSIGSFQAKRYVFKSNDRNSMD IQCEVDALLERQKREAMPNPKTSSSKRQVDPEFADMITVQEFCKAEEVDEDSVYGVFV SYIEIYNNYIYDLLEEVPFDPINPNLHNLNCFVKIKNHNMYVAGCTEVEVKSTEEAFE VFWRGQKKRRIANTHLNRESSRSHSVFNIKLVQAPLDADGDNVLQEKEQITISQLSLV DLAGSERTNRTRAEGNRLREAGNINQSLMTLRTCMDVLRENQMYGTNKMVPYRDSKLT HLFKNYFDGEGKVRMIVCVNPKAEDYEENLQVMRFAEVTQEVEVARPVDKAICGLTPG RRYRNQPRGPVGNEPLVTDVVLQSFPPLPSCEILDINDEQTLPRLIEALEKRHNLRQM MIDEFNKQSNAFKALLQEFDNAVLSKENHMQGKLNEKEKMISGQKLEIERLEKKNKTL EYKIEILEKTTTIYEEDKRNLQQELETQNQKLQRQFSEKRRLEARLQGMVTETTMKWE KECERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERPSRERDREKVTQRSVS PSPVPLLFQPDQNAPPIRLRHRRSRSAGDRWVDHKPASNMQTETVMQPHVPHAITVSV ANEKALAKCEKYMLTHQELASDGEIETKLIKGDIYKTRGGGQSVQFTDIETLKQESPN GSRKRRSSTVAPAQPDGAESEWTRCRNKVFCGCEMRAGSQLDLISASRHNPSAKSHET DSPSTERTFSFVWMISRKPCQKQSSRSSCRTPALVENHGPQLHHTLTQNKAFPMVPKT TSIQQTLYSVCFAIFNINSRGRLLFSSLYEFFIMFFLKYISCILIN" BASE COUNT 1111 a 598 c 702 g 847 t ORIGIN 1 ttcgtgatgg attcagtact cctcaaccac tcttcctaat gattggaaca aaagaaaaaa 61 aaaagaaaaa aaagccatgt tgtcagcgag agctaagaca ccccggaaac ctaccgtgaa 121 aaagggtccc aaacgaacct taaagaccca gttgggatac tgtagggtgc gactgggctt 181 tcctgatcaa gagtgttgca tagaagtgat caataataca actgttcagc ttcatactcc 241 tgagggctac agactcaacc gaaatggaga ctataaggag actcagtatt catttaaaca 301 agtatttggc actcacacca cccagaagga actctttgat gttgtggcta atcccttggt 361 caatgacctc attcatggca aaaatggtct tctttttaca tatggtgtga cgggaagtgg 421 aaaaactcac acaatgactg gttctccagg ggaaggaggg ctgcttcctc gttgtttgga 481 catgatcttt aacagtatag ggtcatttca agctaaacga tatgttttca aatctaatga 541 taggaatagt atggatatac agtgtgaggt tgatgcctta ttagaacgtc agaaaagaga 601 agctatgccc aatccaaaga cttcttctag caaacgacaa gtagatccag agtttgcaga 661 tatgataact gtacaagaat tctgcaaagc agaagaggtt gatgaagata gtgtctatgg 721 tgtatttgtc tcttatattg aaatatataa taattacata tatgatctat tggaagaggt 781 gccgtttgat cccataaacc caaacctcca caatctaaat tgcttcgtga agattaagaa 841 ccataacatg tatgttgcag gatgtacaga agttgaagtg aaatctactg aggaggcttt 901 tgaagttttc tggagaggcc agaaaaagag acgtattgct aatacccatt tgaatcgtga 961 gtccagccgt tcccatagcg tgttcaacat taaattagtt caggctccct tggatgcaga 1021 tggagacaat gtcttacagg aaaaagaaca aatcactata agtcagttgt ccttggtaga 1081 tcttgctgga agtgaaagaa ctaaccggac cagagcagaa gggaacagat tacgtgaagc 1141 tggtaatatt aatcagtcac taatgacgct aagaacatgt atggatgtcc taagagagaa 1201 ccaaatgtat ggaactaaca agatggttcc atatcgagat tcaaagttaa cccatctgtt 1261 caagaactac tttgatgggg aaggaaaagt gcggatgatc gtgtgtgtga accccaaggc 1321 tgaagattat gaagaaaact tgcaagtcat gagatttgcg gaagtgactc aagaagttga 1381 agtagcaaga cctgtagaca aggcaatatg tggtttaacg cctgggagga gatacagaaa 1441 ccagcctcga ggtccagttg gaaatgaacc attggttact gacgtggttt tgcagagttt 1501 tccacctttg ccgtcatgcg aaattttgga tatcaacgat gagcagacac ttccaaggct 1561 gattgaagcc ttagagaaac gacataactt acgacaaatg atgattgatg agtttaacaa 1621 acaatctaat gcttttaaag ctttgttaca agaatttgac aatgctgttt taagtaaaga 1681 aaaccacatg caagggaaac taaatgaaaa ggagaagatg atctcaggac agaaattgga 1741 aatagaacga ctggaaaaga aaaacaaaac tttagaatat aagattgaga ttttagagaa 1801 aacaactact atctatgagg aagataaacg caatttgcaa caggaacttg aaactcagaa 1861 ccagaaactt cagcgacagt tttctgagaa acgcagatta gaagccaggt tgcaaggcat 1921 ggtgacagaa acgacaatga agtgggagaa agaatgtgag cgtagagtgg cagccaaaca 1981 gctggagatg cagaataaac tctgggttaa agatgaaaag ctgaaacaac tgaaggctat 2041 tgttactgaa cctaaaactg agaagccaga gagaccctct cgggagcgag atcgagaaaa 2101 agttactcaa agatctgttt ctccatcacc tgtgccttta ctctttcaac ctgatcagaa 2161 cgcaccacca attcgtctcc gacacagacg atcacgctct gcaggagaca gatgggtaga 2221 tcataagccc gcctctaaca tgcaaactga aacagtcatg cagccacatg tccctcatgc 2281 catcacagta tctgttgcaa atgaaaaggc actagctaag tgtgagaagt acatgctgac 2341 ccaccaggaa ctagcctccg atggggagat tgaaactaaa ctaattaagg gtgatattta 2401 taaaacaagg ggtggtggac aatctgttca gtttactgat attgagactt taaagcaaga 2461 atcaccaaat ggtagtcgaa aacgaagatc ttccacagta gcacctgccc aaccagatgg 2521 tgcagagtct gaatggacgc gatgtagaaa caaggtgttc tgtggctgtg agatgagagc 2581 aggatcccag ctggacctga tatcagcatc acggcacaac ccaagcgcaa aaagccatga 2641 aactgacagt cccagtactg aaagaacatt ttcatttgtg tggatgattt ctcgaaagcc 2701 atgccagaag cagtcttcca ggtcatcttg tagaactcca gctttggttg aaaatcacgg 2761 acctcagcta catcatacac tgacccagaa taaagctttc cctatggttc caaagacaac 2821 tagtattcaa caaaccttgt atagtgtatg ttttgccata tttaatatta atagcagagg 2881 aagactcctt ttttcatcac tgtatgaatt ttttataatg ttttttttaa aatatatttc 2941 atgtatactt ataaactaat tcacacaagt gtttgtctta gatgattaag gaagactata 3001 tctagatcat gtctgatttt ttattgtgac ttctccagcc ctggtctgaa tttcttaagg 3061 ttttataaac aaatgctgct atttattagc tgcaagaatg cactttagaa ctatttgaca 3121 attcagactt tcaaaataaa gatgtaaatg actggccaat aataaccatt ttaggaaggt 3181 gttttgaatt ctgtatgtat atattcactt tctgacattt agatatgcca aaagaattaa 3241 aatcaaaagc actaaggg // LOCUS HSMLC 836 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for myosin alkali light chain. ACCESSION X13955 NID g34673 KEYWORDS myosin; myosin alkali light chain; myosin light chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 836) AUTHORS Arnold,H.H., Lohse,P., Seidel,U. and Bober,E. TITLE A novel human myosin alkali light chain is developmentally regulated. Expression in fetal cardiac and skeletal muscle and in adult atria JOURNAL Eur. J. Biochem. 178 (1), 53-60 (1988) MEDLINE 89078413 FEATURES Location/Qualifiers source 1..836 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /clone_lib="lambda gt11" /clone="GT14" CDS 57..650 /note="myosin alkali light chain (AA 1-197)" /codon_start=1 /db_xref="PID:g34674" /db_xref="SWISS-PROT:P12829" /translation="MAPKKPEPKKEAAKPAPAPAPAPAPAPAPAPEAPKEPAFDPKSV KIDFTADQIEEFKEAFSLFDRTPTGEMKITYGQCGDVLRALGQNPTNAEVLRVLGKPK PEEMNVKMLDFETFLPILQHISRNKEQGTYEDFVEGLRVFDKESNGTVMGAELRHVLA TLGEKMTEAEVEQLLAGQEDANGCINYEAFVKHIMSG" BASE COUNT 199 a 238 c 232 g 167 t ORIGIN 1 cagtctctcg gtttcttctc agatcactcc tctgccaaag atcccaacaa gacaacatgg 61 ctcccaagaa gcctgagcct aagaaggagg cagccaagcc agctccagct ccagctccag 121 cccctgcacc agcccctgcc ccagctcctg aggctcccaa ggaacctgcc tttgacccca 181 agagtgtaaa gatagacttc actgccgacc agattgaaga gttcaaagag gccttttcat 241 tgtttgaccg gaccccgact ggagagatga agatcaccta cggccagtgc ggggatgtac 301 tgcgggccct gggccagaac cctaccaatg ccgaggtgct gcgtgtgctg ggcaagccca 361 agcctgaaga gatgaatgtc aagatgctgg actttgagac gttcttgccc atcctgcagc 421 acatttcccg caacaaggag cagggcacct atgaggactt cgtggagggc ctgcgtgtct 481 ttgacaagga gagcaatggc acggtcatgg gtgctgagct tcggcacgtc cttgccaccc 541 tgggagagaa gatgactgag gctgaagtgg agcagctgtt agctgggcaa gaggatgcca 601 atggctgcat caattatgaa gcctttgtca agcacatcat gtcagggtga agcagagtct 661 tccaggtgcc tggcccttgg ctttagccat accagggtga gttaaagaga ggccccggct 721 gggtgagctg agatggagtc ctcgacttat caccacacca ctgccccaag gaccttacag 781 gccctccctg ttaataaaca gctctaacac ggccaggctg ggctctggga ttctga // LOCUS HSMLC1 742 bp RNA PRI 12-SEP-1993 DEFINITION Human myosin alkali light chain mRNA. ACCESSION X16434 NID g34675 KEYWORDS myosin; myosin alkali light chain; myosin light chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 742) AUTHORS Starzinski-Powitz,A. TITLE Direct Submission JOURNAL Submitted (14-SEP-1989) Starzinski-Powitz A., Institut f Genetik, Forschungszentrum, Universitaet zu Koeln, Zuelpiicher Str 47, D-5000 Koeln 1, F R G REFERENCE 2 (bases 1 to 742) AUTHORS Zimmermann,K. and Starzinski-Powitz,A. TITLE A novel isoform of myosin alkali light chain isolated from human muscle cells JOURNAL Nucleic Acids Res. 17 (24), 10496 (1989) MEDLINE 90098889 FEATURES Location/Qualifiers source 1..742 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="22XX" /tissue_type="muscle" /cell_type="myotube" /cell_line="primary human skeletal muscle culture" /clone_lib="MEH5-lambda-gt11 cDNA" /clone="Hemp.2" CDS 12..638 /note="myosin alkali light chain (AA 1-208)" /codon_start=1 /db_xref="PID:g34676" /db_xref="SWISS-PROT:P14649" /translation="MPPKKDVPVKKPAGPSISKPAAKPAAAGAPPAKTKAEPAVPQAP QKTQEPPVDLSKVVIEFNKDQLEEFKEAFELFDRVGDGKILYSQCGDVMRALGQNPTN AEVLKVLGNPKSDELKSRRVDFETFLPMLQAVAKNRGQGTYEDYLEGFRVFDKEGNGK VMGAELRHVLTTLGEKMTEEEVETVLAGHEDSNGCINYEAFLKHILSV" BASE COUNT 185 a 203 c 228 g 126 t ORIGIN 1 cgccggacat catgcctccc aagaaggatg tgcccgtgaa gaaaccagca gggccctcca 61 tctccaaacc tgctgctaag ccagcagcag caggggctcc tccagccaag accaaagctg 121 agccagctgt cccccaggcc cctcagaaaa cccaggagcc tccagtcgat ctctccaaag 181 tggtgatcga gtttaacaag gaccagctgg aggagttcaa ggaggccttc gagctgtttg 241 accgagtggg ggatggcaag atcctgtaca gccagtgtgg ggacgtgatg agggccctgg 301 gccagaaccc caccaacgcc gaggtgctca aggtcctggg gaaccccaag agtgatgagc 361 tgaagtcgcg gcgtgtggac tttgagactt tcctgcccat gctccaggca gtggccaaga 421 accgaggcca aggcacatat gaggactact tggaggggtt tcgtgtgttt gacaaggagg 481 ggaacgggaa agtcatggga gcagagctga gacatgttct caccaccctt ggagagaaga 541 tgactgagga ggaggtggag accgttctgg caggacacga ggacagcaac ggctgcatca 601 actacgaggc cttcttgaaa cacatcctaa gcgtctgagt gctgcagatc cagtggggtc 661 cggacactgg gccccgcagg cgaaagcacg ttccagccac caggaggcca cctattgttt 721 caaaataaag actgggttcc tc // LOCUS HSMLC2 498 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ventricular myosin light chain 2. ACCESSION X14332 NID g34686 KEYWORDS ATPase; myosin; ventricular myosin light chain 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 498) AUTHORS Dalla Libera,L. TITLE Direct Submission JOURNAL Submitted (03-FEB-1989) Dalla Libera L., CNR Unit for Muscle Biology and Physiopathology, Institute of General Pathology, Via Loredan 16, 35131 Padova, Italy REFERENCE 2 (bases 1 to 498) AUTHORS Dalla Libera,L., Hoffmann,E., Floroff,M. and Jackowski,G. TITLE Isolation and nucleotide sequence of the cDNA encoding human ventricular myosin light chain 2 JOURNAL Nucleic Acids Res. 17 (6), 2360 (1989) MEDLINE 89202052 COMMENT Data kindly reviewed (10th April 1989) by Dalla Libera L. FEATURES Location/Qualifiers source 1..498 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" /cell_type="myocardial cell" /clone_lib="ventricular cDNA" /clone="PCD HVLC 2" CDS 1..498 /note="ventricular myosin light chain 2 (AA 1-165)" /codon_start=1 /db_xref="PID:g34687" /db_xref="SWISS-PROT:P10916" /translation="MAPKKAKKRAGGANSNVFSMFEQTQIQEFKEAFTIMDQNRDGFI DKNDLRDTFAALRVNVKNEEIDEMIKEAPGPINFTVFLTMFGEKLKGADPEETILNAF KVFDPEGKGVLKADYVREMLTTQAERFSKEEVDQMFAAFPPDVTGNLDYKNLVHIITH GEEKD" BASE COUNT 148 a 115 c 135 g 100 t ORIGIN 1 atggcaccta agaaagcaaa gaagagagcc gggggcgcca actccaacgt gttctccatg 61 ttcgaacaga cccaaatcca ggaatttaag gaggccttca ctatcatgga ccagaacagg 121 gatggcttca ttgacaagaa cgatctgaga gacacctttg ctgcccttcg agtgaacgtg 181 aaaaatgaag aaattgatga aatgatcaag gaggctccgg gtccaattaa ctttactgtg 241 ttcctcacaa tgtttgggga gaaacttaag ggagcggacc ctgaggaaac cattctcaac 301 gcattcaaag tgtttgaccc tgaaggcaaa ggggtgctga aggctgatta cgttcgggaa 361 atgctgacca cgcaggcgga gaggttttcc aaggaggagg ttgaccagat gttcgccgcc 421 ttcccccctg acgtgactgg caacttggac tacaagaacc tggtgcacat catcacccac 481 ggagaagaga aggactag // LOCUS HSMLC3F 835 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for myosin light chain 3 (MLC-3f). ACCESSION X05451 Y00352 NID g34688 KEYWORDS MLC-3f gene; myosin; myosin light chain 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 835) AUTHORS Arnold,H.H. TITLE Direct Submission JOURNAL Submitted (19-JUN-1987) Arnold H.H., Dept. of Toxicology, University of Hamburg, Grindelallee 117, D - 2000 Hamburg 13 REFERENCE 2 (bases 1 to 835) AUTHORS Seidel,U., Bober,E., Winter,B., Lenz,S., Lohse,P. and Arnold,H.H. TITLE The complete nucleotide sequences of cDNA clones coding for human myosin light chains 1 and 3 JOURNAL Nucleic Acids Res. 15 (12), 4989 (1987) MEDLINE 87259977 FEATURES Location/Qualifiers source 1..835 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal muscle" CDS 54..506 /note="MLC-3 (AA 1 - 150)" /codon_start=1 /db_xref="PID:g34689" /db_xref="SWISS-PROT:P06741" /translation="MSFSADQIAEFKEAFLLFDRTGDSKITLSQVGDVLRALGTNPTN AEVRKVLGNPSNEELNAKKIEFEQFLPMMQAISNNKDQATYEDFVEGLRVFDKEGNGT VMGAELRHVLATLGEKMKEEEVEALMAGQEDSNGCINYEAFVKHIMSI" BASE COUNT 257 a 196 c 169 g 213 t ORIGIN 1 gagcctaact actcttagcc ttccctgctg tgttgctgcc cagccgctcc atcatgtcct 61 tcagtgctga ccagattgct gaattcaagg aggcatttct cctctttgac agaacaggtg 121 attccaagat caccttaagc caggtcggtg atgtccttcg agccctgggc acaaatccca 181 ccaatgcaga ggtcaggaaa gttctgggaa accccagcaa tgaagagctg aatgccaaga 241 aaattgagtt tgaacaattt ctgcctatga tgcaagccat ttccaacaac aaggaccagg 301 ccacctatga agactttgtt gagggtctgc gtgtctttga caaggaaggc aatggcacag 361 tcatgggtgc tgaactccgc catgttctag ccaccctggg tgaaaagatg aaagaggaag 421 aagtggaagc cctgatggca ggtcaagaag actccaatgg ctgcatcaac tacgaagctt 481 ttgtcaagca catcatgtct atctgaatgg agctctcaag aacaagcatt gtttaggaag 541 actggctgga aacttatttt aatcacaccc atgacaaact ctccagatct gtttaccatc 601 attcaggaaa acaaagcaat ctggacggtt caagactgag caactccctg aatttttata 661 catcttcagt ttttctctga attgaattca taccacacaa acaaatgtct gctgctctag 721 atgagaagaa taaaatattg acaatctcaa atccaagcag ccttctttat tatctaccat 781 gaatcaacga aacattctta aaacaataaa tcaataaaca attttggtca gtctg // LOCUS HSMLCK 3426 bp RNA PRI 11-APR-1996 DEFINITION H.sapiens mRNA for myosin light chain kinase. ACCESSION X85337 NID g1262344 KEYWORDS myosin light chain kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3426) AUTHORS Potier,M.C., Chelot,E., Pekarsky,Y., Gardiner,K., Rossier,J. and Turnell,W.G. TITLE The human myosin light chain kinase (MLCK) from hippocampus: cloning, sequencing, expression, and localization to 3qcen-q21 JOURNAL Genomics 29 (3), 562-570 (1995) MEDLINE 96121365 REFERENCE 2 (bases 1 to 3426) AUTHORS Potier,M. TITLE Direct Submission JOURNAL Submitted (13-MAR-1995) M. Potier, Centre National de la Recherche Scientifique, Institut Alfred Fessard, C.N.R.S., 91198 Gif-sur-Yvette Cedex, FRANCE FEATURES Location/Qualifiers source 1..3426 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" /clone_lib="Clontech human hippocampus cDNA" /chromosome="3" /map="qcen-q21" mRNA 1..3354 gene 382..3357 /gene="MLCK" CDS 382..3357 /gene="MLCK" /codon_start=1 /product="myosin light chain kinase" /db_xref="PID:e155057" /db_xref="PID:g1262345" /translation="MDFRANLQRQMKPKTVSEEERKVHSPQQVDFRSVLAKKGTSKTP VPEKVPPPKPATPDFRSVLGGKKKLPAENGSSSAETLNAKAVESSKPLSNAQPSGALK PVGNAKPAETLKPMGNAKPAETLEAHGNAKPDENLKSASKEELKKDVKNDVNCKRGHA GTTDNEKRSESQGTAPAFKQKLQDVHVAEGKKLLLQCQVSSDPPATIIWTLNGKTLKT TKFIILSQEGSLCSVSIEKALPEDRGLYKCVAKNDAGQAECSCQVTVDDAPASENTKA PEMKSRRPKSSPPPVLGTESDATVKKKPAPKTPPKAAMPPQIIQFPEDQKVRAGESVE LFGKVTGTQPITCTWMKFRKQIQESEHMKVENSENGSKLTILAGRQEHCGCYTLLVEN KSGSRQAQVNLSVVDKPDPPAGTPCASDIRSSSLTLSWYGSSYDGGSAVQSYSIEIWD SANKTWKELATCRSTSFNVQDLLPDHEYKFRVRAINVYGTSEPSQESELTTVGEKPEE PKDEVEVSDDDEKEPEVDYRTVTINTEQKVSDFYDIEERLGSGKFGQCFRLVEKKTRK VWAGKFFKAYSAKEKENIRQEISIMNCLHHPKLVQCVDAFEEKANIVMVLEIVSGGEL FERIIDEDFELTERECTKYMRQISEGVEYIHKQGIVHLDLKPENIMCVNKTGTRIKLI DFGLPRRLENAGSLKVLFGTPEFVAPEVINYEPIRYATDMWSIRVICYILVSGPFPFM GDNDNETLANVTSATWDFDDEAFDEISDDAKDFISNLLKKDMKNRLDLAQCLQHPWLM KDTKNMEAKKLSKDRMKKYMARRKWQKTGNAVRAIGRLSSMAMISGLSGRKSSTGSPT SPLNAEKLESEDVSQAFLEAVAEEKPHVKPYFSKTIRDLEVVEGSAARFDCKIEGYPD PEVVWFKDDQSIRESRHFQIDYDEDGNCSLIISDVCGDDDAKYTCKAVNSLGEATCTA ELIVETMEEGEGEGEEEEE" BASE COUNT 965 a 857 c 987 g 617 t ORIGIN 1 ttcaggaacc gggttggcga atcgagttgc caggtgtcac tgatgctaca gaacagctct 61 gccagcagcc ttccacgggg gagggagcct gccagctgcg aggacctctg tggtggagga 121 gttggtgctg atggtggtgg tagtgaccgc tatgggtccc tgaggcctgg ctggccagca 181 agagggcagg gttggctaga ggaggaagac ggcgaggacg tgcgaggggt gctgaagagg 241 cgcgtggaga cgaggcagcc aactgaggag gcgatccccg agcaggaggt ggagcagctg 301 gacttccgag acctcctggg gaagaaggtg agtacaaaga ccctatcgga agacgacctg 361 aaggagatcc cagccgagca gatggatttc cgcgccaacc tgcagcggca aatgaagcca 421 aagactgtgt ctgaggaaga gaggaaggtg cacagccccc agcaggtcga ttttcgctct 481 gtcctggcca agaaggggac ttccaagacc cccgtgcctg agaaggtgcc accgccaaaa 541 cctgccaccc cggattttcg ctcagtgctg ggtggcaaga agaaattacc agcagagaat 601 ggcagcagca gtgccgagac cctgaatgcc aaggcagtgg agagttccaa gcccctgagc 661 aatgcacagc cttcaggcgc cttgaaaccc gtgggcaacg ccaagcctgc tgagaccctg 721 aagccaatgg gcaacgcaaa gcctgccgag acccttgaag cccatggcaa tgccaagcct 781 gatgagaacc tgaaatccgc tagcaaagaa gaactcaaga aagacgttaa gaatgatgtg 841 aactgcaaga gaggccatgc agggaccaca gataatgaaa agagatcaga gagccagggg 901 acagccccag ccttcaagca gaagctgcaa gatgttcatg tggcagaggg caagaagctg 961 ctgctccagt gccaggtgtc ttctgacccc ccagccacca tcatctggac gctgaatgga 1021 aagaccctca agaccaccaa gttcatcatc ctctcccagg aaggctcact ctgctccgtc 1081 tccatcgaga aggcactgcc tgaggacaga ggcttataca agtgtgtagc caagaatgac 1141 gctggccagg cggagtgctc ctgccaagtc accgtggatg atgctccagc cagtgagaac 1201 accaaggccc cagagatgaa atcccggagg cccaagagct ctcctcctcc cgtgctagga 1261 actgagagtg atgcgactgt gaaaaagaaa cctgccccca agacacctcc gaaagcagca 1321 atgccccctc agatcatcca gttccctgag gaccagaagg tacgcgcagg agagtcagtg 1381 gagctgtttg gcaaagtgac aggcactcag cccatcacct gtacctggat gaagttccga 1441 aagcagatcc aggaaagcga gcacatgaag gtggagaaca gcgagaatgg cagcaagctc 1501 accatccttg ccgggcgcca ggagcactgc ggctgctaca cactgctggt ggagaacaag 1561 tcgggcagca ggcaggccca ggtcaaccta tctgtcgtgg ataagccaga ccccccagct 1621 ggcacacctt gtgcctctga cattcggagc tcctcactga ccctgtcctg gtatggctcc 1681 tcatatgatg ggggcagtgc tgtacagtcc tacagcatcg agatctggga ctcagccaac 1741 aagacgtgga aggaactagc cacatgccgc agcacctctt tcaacgttca ggacctgctg 1801 cctgaccacg aatataagtt ccgtgtacgt gcaatcaacg tgtatggaac cagtgagcca 1861 agccaggagt ctgaactcac aacggtagga gagaaacctg aagagccgaa ggatgaagtg 1921 gaggtgtcag acgatgatga gaaggagccc gaggttgatt accggacagt gacaatcaat 1981 actgaacaaa aagtatctga cttctacgac attgaggaga gattaggatc tgggaaattt 2041 ggacagtgct ttcgacttgt agaaaagaaa actcgaaaag tctgggcagg gaagttcttc 2101 aaggcatatt cagcaaaaga gaaagagaat atccggcagg agattagcat catgaactgc 2161 ctccaccacc ctaagctggt ccagtgtgtg gatgcctttg aagaaaaggc caacatcgtc 2221 atggtcctgg agatcgtgtc aggaggggag ctgtttgagc gcatcattga cgaggacttt 2281 gagctgacgg agcgtgagtg caccaagtac atgcggcaga tctcggaggg agtggagtac 2341 atccacaagc agggcatcgt gcacctggac ctcaagccgg agaacatcat gtgtgtcaac 2401 aagacgggca ccaggatcaa gctcatcgac tttggtctgc cgaggaggct ggagaacgcg 2461 gggtctctga aggtcctctt tggcacccca gaatttgtgg ctcctgaagt gatcaactat 2521 gagcccatcc ggtacgccac agacatgtgg agcatcaggg tcatctgcta catcctagtc 2581 agtggcccct tccccttcat gggagacaac gataacgaaa ccttggccaa cgttacctca 2641 gccacctggg acttcgacga cgaggcattc gatgagatct ccgacgatgc caaggatttc 2701 atcagcaatc tgctgaagaa agatatgaaa aaccgcctgg acctggcgca gtgccttcag 2761 catccatggc taatgaaaga taccaagaac atggaggcca agaaactctc caaggaccgg 2821 atgaagaagt acatggcaag aaggaaatgg cagaaaacgg gcaatgctgt gagagccatt 2881 ggaagactgt cctctatggc aatgatctca gggctcagtg gcaggaaatc ctcaacaggg 2941 tcaccaacca gcccgctcaa tgcagaaaaa ctagaatctg aagatgtgtc ccaagctttc 3001 cttgaggctg ttgctgagga aaagcctcat gtaaaaccct atttctctaa gaccattcgc 3061 gatttagaag ttgtggaggg aagtgctgct agatttgact gcaagattga aggataccca 3121 gaccccgagg ttgtctggtt caaagatgac cagtcaatca gggagtcccg ccacttccag 3181 atagactacg atgaggacgg gaactgctct ttaattatta gtgatgtttg cggggatgac 3241 gatgccaagt acacctgcaa ggctgtcaac agtcttggag aagccacctg cacagcagag 3301 ctcattgtgg aaacgatgga ggaaggtgaa ggggaagggg aagaggaaga agagtgaaac 3361 aaagccagag aaaagcagtt tctaagtcat attaaaagga ctatttctct aaaactcaaa 3421 aaaaaa // LOCUS HSMLN50 3846 bp RNA PRI 16-SEP-1997 DEFINITION H.sapiens MLN50 mRNA. ACCESSION X82456 NID g2407912 KEYWORDS Lasp-1; MLN 50 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3846) AUTHORS Tomasetto,C. TITLE Direct Submission JOURNAL Submitted (02-NOV-1994) C. Tomasetto, I.G.B.M.C., CNRS UPR 6520, INSERM U. 184-ULP, BP 163 67404 Illkirch, FRANCE REFERENCE 2 (bases 1 to 3846) AUTHORS Tomasetto,C., Regnier,C., Moog-Lutz,C., Mattei,M.G., Chenard,M.P., Lidereau,R., Basset,P. and Rio,M.C. TITLE Identification of four novel human genes amplified and overexpressed in breast carcinoma and localized to the q11-q21.3 region of chromosome 17 JOURNAL Genomics 28 (3), 367-376 (1995) MEDLINE 96039245 FEATURES Location/Qualifiers source 1..3846 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast derived metastatic lymph mode" /chromosome="17" gene 76..861 /gene="MLN 50, Lasp-1" CDS 76..861 /gene="MLN 50, Lasp-1" /codon_start=1 /product="LIM and SH3 domain protein" /db_xref="PID:e347901" /db_xref="PID:g2407913" /translation="MNPNCARCGKIVYPTEKVNCLDKFWHKACFHCETCKMTLNMKNY KGYEKKPYCNAHYPKQSFTMVADTPENLRLKQQSELQSQVRYKEEFEKNKGKGFSVVA DTPELQRIKKTQDQISNIKYHEEFEKSRMGPSGGEGMEPERRDSQDGSSYRRPLEQQQ PHHIPTSAPVYQQPQQQPVAQSYGGYKEPAAPVSIQRSAPGGGGKRYRAVYDYSAADE DEVSFQDGDTIVNVQQIDDGWMYGTVERTGDTGMLPANYVEAI" polyA_signal 3828..3833 BASE COUNT 793 a 1078 c 1076 g 899 t ORIGIN 1 gcctcccgcc agctcgcctc ggggaacagg acgcgcgtga gctcaggcgt ccccgcccca 61 gcttttctcg gaaccatgaa ccccaactgc gcccggtgcg gcaagatcgt gtatcccacg 121 gagaaggtga actgtctgga taagttctgg cataaagcat gcttccattg cgagacctgc 181 aagatgacac tgaacatgaa gaactacaag ggctacgaga agaagcccta ctgcaacgca 241 cactacccca agcagtcctt caccatggtg gcggacaccc cggaaaacct tcgcctcaag 301 caacagagtg agctccagag tcaggtgcgc tacaaggagg agtttgagaa gaacaagggc 361 aaaggtttca gcgtagtggc agacacgccc gagctccaga gaatcaagaa gacccaggac 421 cagatcagta atataaaata ccatgaggag tttgagaaga gccgcatggg ccctagcggg 481 ggcgagggca tggagccaga gcgtcgggat tcacaggacg gcagcagcta ccggcggccc 541 ctggagcagc agcagcctca ccacatcccg accagtgccc cggtttacca gcagccccag 601 cagcagccgg tggcccagtc ctatggtggc tacaaggagc ctgcagcccc agtctccata 661 cagcgcagcg ccccaggtgg tggcgggaag cggtaccgcg cggtgtatga ctacagcgcc 721 gccgacgagg acgaggtctc cttccaggac ggggacacca tcgtcaacgt gcagcagatc 781 gacgacggct ggatgtacgg gacggtggag cgcaccggcg acacggggat gctgccggcc 841 aactacgtgg aggccatctg aacccggagc gcccccatct gtcttcagca cattccacgg 901 catcgcatcc gtcctgggcg tgagccgtcc attcttcagt gtctctgttt tttaaaacct 961 gcgacagctt gtgattccta cccctcttcc agcttctttt gccaactgaa gccttcttct 1021 gccacttctg cgggctccct cctctggcag gcttcccccg tgatcgactt cttggttttc 1081 tctctggatg gaacgggtat gggcctctct gggggaggca gggctggaat gggagacctg 1141 ttggcctgtg ggcctcacct gcccctctgt tctctcccct cacatcctcc tgcccagctc 1201 ctcacatacc cacacattcc agggctgggg tgagcctgac tgccaggacc ccaggtcagg 1261 ggctccctac attccccaga gtgggatcca cttcttggtt cctgggatgg cgatggggac 1321 tctgccgctg tgtagggacc agtgggatgg gctctacctc tctttctcaa agagggggct 1381 ctgcccacct ggggtctctc tccctacctc cctcctcagg ggcaacaaca ggagaatggg 1441 gttcctgctg tggggcgaat tcatcccctc cccgcgcgtt ccttcgcaca ctgtgatttt 1501 gccctcctgc ccacgcagac ctgcagcggg caaagagctc ccgaggaagc acagcttggg 1561 tcaggttctt gcctttctta attttaggga cagctaccgg aaggagggga acaaggagtt 1621 ctcttccgca gcccctttcc ccacgcccac ccccagtctc cagggaccct tgcctgcctc 1681 ctaggctgga agccatggtc ccgaagtgta gggcaagggt gcctcaggac cttttggtct 1741 tcagcctccc tcagccccca ggatctgggt taggtggccg ctcctccctg ctcctcatgg 1801 gaagatgtct cagagccttc catgacctcc cctccccagc ccaatgccaa gtggacttgg 1861 agctgcacaa agtcagcagg gaccactaaa tctccaagac ctggtgtgcg gaggcaggag 1921 catgtatgtc tgcaggtgtc tgacacgcaa gtgtgtgagt gtgagtgtga gagatggggc 1981 gggggtgtgt ctgtaggtgt ctctgggcct gtgtgtgggt ggggttatgt gagggtatga 2041 agagctgtct tcccctgaga gtttcctcag aacccacagt gagaggggag ggctcctggg 2101 gcagagaagt tccttaggtt ttctttggaa tgaaattcct ccttcccccc atctctgagt 2161 ggaggaagcc caccaatctg ccctttgcag tgtgtcaggg tggaaggtaa gaggttggtg 2221 tggagttggg gctgccatag ggtctgcagc ctgctggggc taagcggtgg aggaaggctc 2281 tgtcactcca ggcatatgtt tccccatctc tgtctggggc tacagaatag ggtggcagaa 2341 gtgtcaccct gtgggtgtct ccctcggggg ctcttcccct agacctcccc ctcacttaca 2401 taaagctccc ttgaagcaag aaagagggtc ccagggctgc aaaactggaa gcacagcctc 2461 ggggatgggg agggaaagac ggtgctatat ccagttcctg ctctctgctc atgggtggct 2521 gtgacaaccc tggcctcact tgattcatct ctggttttct tgccaccctc tgggagtccc 2581 catcccattt tcatcctgag cccaaccagg ccctgccatt ggcctcttgt cccttggcac 2641 acttgtaccc acaggtgagg ggcaggacct gaaggtattg gcctgttcaa caatcagtca 2701 tcatgggtgt ttttgtcaac tgcttgttaa ttgatttggg gatgtttgcc ccgaatgaga 2761 ggttgaggaa aagactgtgg gtggggaggc cctgcctgac ccatcccttt tcctttctgg 2821 ccccagccta ggtggaggca agtggaatat cttatattgg gcgatttggg ggctcgggga 2881 ggcagagaat ctcttgggag tcttgggtgg cgctggtgca ttctgtttcc tcttgatctc 2941 aaagcacaat gtggatttgg ggaccaaagg tcagggacac atccccttag aggacctgag 3001 tttgggagag tggtgagtgg aagggaggag cagcaagaag cagcctgttt tcactcagct 3061 taattctcct tcccagataa ggcaagccag tcatggaatc ttgctgcagg ccctccctct 3121 actcttcctg tcctaaaaat aggggccgtt ttcttacaca cccccagaga gaggagggac 3181 tgtcacactg gtgctgagtg accgggggct gctgggcgtc tgttctttac caaaaccatc 3241 catccctaga agagcacaga gccctgaggg gctgggctgg gctgggctga gcccctggtc 3301 ttctctacag ttcacagagg tctttcagct catttaatcc caggaaagag gcatcaaagc 3361 tagaatgtga atataacttt tgtgggccaa tactaagaat aacaagaagc ccagtggtga 3421 ggaaagtgcg ttctcccagc actgcctcct gttttctccc tctcatgtcc ctccagggaa 3481 aatgacttta ttgcttaatt tctgcctttc ccccctcaca catgcacttt tgggcctttt 3541 tttatagctg gaaaaaacaa aataccaccc tacaaacctg tatttaaaaa gaaacagaaa 3601 tgaccacgtg aaatttgcct ctgtccaaac atttcatccg tgtgtatgtg tatgtgtgtg 3661 agtgtgtgaa gccgccagtt catcttttta tatggggttg ttgtctcatt ttggtctgtt 3721 ttggtcccct ccctcgtggg cttgtgctcg ggatcaaacc tttctggcct gttatgattc 3781 tgaacatttg acttgaacca caagtgaatc tttctcctgg tgactcaaat aaaagtataa 3841 ttttta // LOCUS HSMLN51 4253 bp RNA PRI 08-SEP-1997 DEFINITION H.sapiens MLN51 mRNA. ACCESSION X80199 NID g2385366 KEYWORDS MLN 51 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3168) AUTHORS Tomasetto,C. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) C. Tomasetto, IGBMC, BP 163, 67404 ILLKIRCH Cedex, FRANCE REFERENCE 2 (bases 1 to 3168) AUTHORS Tomasetto,C., Regnier,C., Moog-Lutz,C., Mattei,M.G., Chenard,M.P., Lidereau,R., Basset,P. and Rio,M.C. TITLE Identification of four novel human genes amplified and overexpressed in breast carcinoma and localized to the q11-q21.3 region of chromosome 17 JOURNAL Genomics 28 (3), 367-376 (1995) MEDLINE 96039245 REFERENCE 3 (bases 1 to 4253) AUTHORS Tomasetto,C. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) C. Tomasetto, IGBMC, BP 163, 67404 ILLKIRCH Cedex, FRANCE FEATURES Location/Qualifiers source 1..4253 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast derived metastatic lymph node" /chromosome="17" gene 234..3150 /gene="MLN 51" CDS 234..1838 /gene="MLN 51" /codon_start=1 /db_xref="PID:e339883" /db_xref="PID:g2385367" /translation="MADRRRQRASQDTEDEESGASGSDSGGSPLRGGGSCSGSAGGGG SGSLPSQRGGRTGALHLRRVESGGAKSAEESECESEDGIEGDAVLSDYESAEDSEGEE GEYSEEENSKVELKSEANDAVNSSTKEEKGEEKPDTKSTVTGERQSGDGQESTEPVEN KVGKKGPKHLDDDEDRKNPAYIPRKGLFFEHDLRGQTQEEEVRPKGRQRKLWKDEGRW EHDKFREDEQAPKSRQELIALYGYDIRSAHNPDDIKPRRIRKPRYGSPPQRDPNWNGE RLNKSHRHQGLGGTLPPRTFINRNAAGTGRMSAPRNYSRSGGFKEGRAGFRPVEAGGQ HGGRSGETVKHEISYRSRRLEQTSVRDPSPEADAPVLGSPEKEEAASEPPAAAPDAAP PPPDRPIEKKSYSRARRTRTKVGDAVKLAEEVPPPPEGLIPAPPVPETTPTPPTKTGT WEAPVDSSTSGLEQDVAQLNIAEQNWSPGQPSFLQPRELRGMPNHIHMGAGPPPQFNR MEEMLTLQISIKYLPCTKCFSTPKGR" polyA_site 3145..3150 /gene="MLN 51" BASE COUNT 1048 a 1118 c 1059 g 1027 t 1 others ORIGIN 1 gaattccgtt gctgtcgcac acacacacac acacacacac acaccccaac acacacacac 61 acaccccaac acacacacac acacacacac acacacacac acacacacac acacagcggg 121 atggccgagc gccgcacgcg tagcacgccg ggactagcta tccagcctcc cagcagcctc 181 tgcgacgggc gcggtgcgta ngtacctcgc cggtggtggc cgttctccgt aagatggcgg 241 accggcggcg gcagcgcgct tcgcaagaca ccgaggacga ggaatctggt gcttcgggct 301 ccgacagcgg cggctccccg ttgcggggag gcgggagctg cagcggtagc gccggaggcg 361 gcggcagcgg ctctctgcct tcacagcgcg gaggccgaac cggggccctt catctgcggc 421 gggtggagag cgggggcgcc aagagtgctg aggagtcgga gtgtgagagt gaagatggca 481 ttgaaggtga tgctgttctc tcggattatg aaagtgcaga agactcggaa ggtgaagaag 541 gtgaatacag tgaagaggaa aactccaaag tggagctgaa atcagaagct aatgatgctg 601 ttaattcttc aacaaaagaa gagaagggag aagaaaagcc tgacaccaaa agcactgtga 661 ctggagagag gcaaagtggg gacggacagg agagcacaga gcctgtggag aacaaagtgg 721 gtaaaaaggg ccctaagcat ttggatgatg atgaagatcg gaagaatcca gcatacatac 781 ctcggaaagg gctcttcttt gagcatgatc ttcgagggca aactcaggag gaggaagtca 841 gacccaaggg gcgtcagcga aagctatgga aggatgaggg tcgctgggag catgacaagt 901 tccgggaaga tgagcaggcc ccaaagtccc gacaggagct cattgctctt tatggttatg 961 acattcgctc agctcataat cctgatgaca tcaaacctcg aagaatccgg aaaccccgat 1021 atgggagtcc tccacaaaga gatccaaact ggaacggtga gcggctaaac aagtctcatc 1081 gccaccaggg tcttgggggc accctaccac caaggacatt tattaacagg aatgctgcag 1141 gtaccggccg tatgtctgca cccaggaatt attctcgatc tgggggcttc aaggaaggtc 1201 gtgctggttt taggcctgtg gaagctggtg ggcagcatgg tggccggtct ggtgagactg 1261 ttaagcatga gattagttac cggtcacggc gcctagagca gacttctgtg agggatccat 1321 ctccagaagc agatgctcca gtgcttggca gtcctgagaa ggaagaggca gcctcagagc 1381 caccagctgc tgctcctgat gctgcaccac caccccctga taggcccatt gagaagaaat 1441 cctattcccg ggcaagaaga actcgaacca aagttggaga tgcagtcaag cttgcagagg 1501 aggtgccccc tcctcctgaa ggactgattc cagcacctcc agtcccagaa accaccccaa 1561 ctccacctac taagactggg acctgggaag ctccggtgga ttctagtaca agtggacttg 1621 agcaagatgt ggcacaacta aatatagcag aacagaattg gagtccgggg cagccttctt 1681 tcctgcaacc acgggaactt cgaggtatgc ccaaccatat acacatggga gcaggacctc 1741 cacctcagtt taaccggatg gaagaaatgc tcactttgca aatatccatt aaatacctgc 1801 catgtaccaa gtgtttttca acacctaaag gaaggtagga cttgatatga gagccctcta 1861 gaattcttat tgtttaggcc tctttctttg tctcagggtg tccagggtgt ccagggtggt 1921 cgagccaaac gctattcatc ccagcggcaa agacctgtgc cagagccccc cgcccctcca 1981 gtgcatatca gtatcatgga gggacattac tatgatccac tgcagttcca gggaccaatc 2041 tatacccatg gtgacagccc tgccccgctg cctccacagg gcatgcttgt gcagccagga 2101 atgaaccttc cccacccagg tttacatccc catcagacac cagctcctct gcccaatcca 2161 ggcctctatc ccccaccagt gtccatgtct ccaggacagc caccacctca gcagttgctt 2221 gctcctactt acttttctgc tccaggcgtc atgaactttg gtaatcccag ttacccttat 2281 gctccagggg cactgcctcc cccaccaccg cctcatctgt atcctaatac acaggcccca 2341 tcacaggtat atggaggagt gacctactat aaccccgccc agcagcaggt gcagccaaag 2401 ccctccccac cccggaggac tccccagcca gtcaccatca agccccctcc acctgaggtt 2461 gtaagcaggg gttccagtta atacaagttt ctgaatattt taaatcttaa catcatataa 2521 aaagcagcag aggtgagaac tcagaagaga aatacagctg gctatctact accagaaggg 2581 cttcaaagat atagggtgtg gctcctacca gcaaacagct gaaagaggag gacccctgcc 2641 ttcctctgag gacaggctct agagagaggg agaaacaagt ggacctcgtc ccatcttcac 2701 tcttcacttg agttggctgt gttcggggga gcagagagag ccagacagcc ccaagcttct 2761 gagtctagat acagaagccc atgtcttctg ctgttcttca cttctgggaa attgaagtgt 2821 cttctgttcc caaggaagct ccttcctgtt tgttttgttt tctaagatgt tcatttttaa 2881 agcctggctt cttatcctta atattatttt aattttttct ctttgtttct gtttcttgct 2941 ctctctccct gcctttaaat gaaacaagtc tagtcttctg gttttctagc ccctctggat 3001 tcccttttga ctcttccgtg catcccagat aatggagaat gtatcagcca gccttcccca 3061 ccaagtctaa aaagacctgg cctttcactt ttagttggca tttgttatcc tcttgtatac 3121 ttgtattccc ttaactctaa ccctgtggaa gcatggctgt ctgcacagag ggtcccattg 3181 tgcagaaaag ctcagagtag gtgggtagga gcccttctct ttgacttagg tttttaggag 3241 tctgagcatc catcaatacc tgtactatga tgggcttctg ttctctgctg agggccaata 3301 ccctactgtg gggagagatg gcacaccaga tgcttttgtg agaaagggat ggtggagtga 3361 gagcctttgc ctttaggggt gtgtattcac atagtcctca gggctcagtc ttttgaggta 3421 agtggaatta gagggccttg cttctcttct ttccattctt cttgctacac cccttttcca 3481 gttgctgtgg accaatgcat ctctttaaag gcaaatatta tccagcaagc agtctaccct 3541 gtcctttgca attgctcttc tccacgtctt tcctgctaca agtgttttag atgttactac 3601 cttattttcc ccgaattcta tttttgtcct tgcagacaga atataaaaac tcctgggctt 3661 aaggcctaag gaagccagtc accttctggg caagggctcc tatctttcct ccctatccat 3721 ggcactaaac cacttctctg ctgcctctgt ggaagagatt cctattactg cagtacatac 3781 gtctgccagg ggtaacctgg ccactgtccc tgtccttcta cagaacctga gggcaaagat 3841 ggtggctgtg tctctccccg gtaatgtcac tgtttttatt ccttccatct agcagctggc 3901 ctaatcactc tgagtcacag gtgtgggatg gagagtgggg agaggcactt aatctgtaac 3961 ccccaaggag gaaataacta agagattctt ctaggggtag ctggtggttg tgccttttgt 4021 aggctgttcc ctttgcctta aacctgaaga tgtctcctca agcctgtggg cagcatgccc 4081 agattcccag accttaagac actgtgagag ttgtctctgt tggtccactg tgtttagttg 4141 caaggatttt tccatgtgtg gtggtgtttt ttgttactgt tttaaagggt gcccatttgt 4201 gatcagcatt gtgacttgga gataataaaa tttagactat aaacttgaaa aaa // LOCUS HSMLN62 1999 bp RNA PRI 26-JAN-1996 DEFINITION H.sapiens MLN62 mRNA. ACCESSION X80200 NID g951276 KEYWORDS MLN 62 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1999) AUTHORS Tomasetto,C. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) C. Tomasetto, IGBMC, BP 163, 67404 ILLKIRCH Cedex, FRANCE REFERENCE 2 (bases 1 to 1999) AUTHORS Tomasetto,C., Regnier,C., Moog-Lutz,C., Mattei,M.G., Chenard,M.P., Lidereau,R., Basset,P. and Rio,M.C. TITLE Identification of four novel human genes amplified and overexpressed in breast carcinoma and localized to the q11-q21.3 region of chromosome 17 JOURNAL Genomics 28 (3), 367-376 (1995) MEDLINE 96039245 REFERENCE 3 (bases 1 to 1999) AUTHORS Regnier,C.H., Tomasetto,C., Moog-Lutz,C., Chenard,M.P., Wendling,C., Basset,P. and Rio,M.C. TITLE Presence of a new conserved domain in CART1, a novel member of the tumor necrosis factor receptor-associated protein family, which is expressed in breast carcinoma JOURNAL J. Biol. Chem. 270 (43), 25715-25721 (1995) MEDLINE 96029665 COMMENT Related sequence X92346 ; Regnier C.H. et al, J.Biol.Chem. 270: 25715-25722, 1995. FEATURES Location/Qualifiers source 1..1999 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast derived metastatic lymph node" /chromosome="17" misc_feature 18..57 /note="RING finger domain" gene 86..1498 /gene="MLN 62, CART1" CDS 86..1498 /gene="MLN 62, CART1" /codon_start=1 /product="cystein rich domain associated to RING and TRAF protein" /db_xref="PID:g951277" /translation="MPGFDYKFLEKPKRRLLCPLCGKPMREPVQVSTCGHRFCDTCLQ EFLSEGVFKCPEDQLPLDYAKIYPDPELEVQVLGLPIRCIHSEEGCRWSGPLRHLQGH LNTCSFNVIPCPNRCPMKLSRRDLPAHLQHDCPKRRLKCEFCGCDFSGEAYESHEGMC PQESVYCENKCGARMMRGLLAQHATSECPKRTQPCTYCTKEFVFDTIQSHQYQCPRLP VACPNQCGVGTVAREDLPGHLKDSCNTALVLCPFKDSGCKHRCPKLAMARHVEESVKP HLAMMCALVSRQRQELQELRRELEELSVGSDGVLIWKIGSYGRRLQEAKAKPNLECFS PAFYTHKYGYKLQVSAFLNGNGSGEGTHLSLYIRVLPGAFDNLLEWPFARRVTFSLLD QSDPGLAKPQHVTETFHPDPNWKNFQKPGTWRGSLDESSLGFGYPKFISHQDIRKRNY VRDDAVFIRAAVELPRKILS" misc_feature 101..154 /gene="MLN 62, CART1" /note="CART domain 1" misc_feature 155..208 /gene="MLN 62, CART1" /note="CART domain 2" misc_feature 209..258 /gene="MLN 62, CART1" /note="CART domain 3" misc_feature 268..307 /gene="MLN 62, CART1" /note="alpha helix domain" misc_feature 308..470 /gene="MLN 62, CART1" /note="TRAF domain" polyA_site 1976..1981 BASE COUNT 387 a 591 c 594 g 427 t ORIGIN 1 gccgggagcg ccgctccagc gaggcgcggg ctgtggggcc gccgcgtgcc tggccccgct 61 cgcccgtgcc ggccgctcgc ccgccatgcc tggcttcgac tacaagttcc tggagaagcc 121 caagcgacgg ctgctgtgcc cactgtgcgg gaagcccatg cgcgagcctg tgcaggtttc 181 cacctgcggc caccgtttct gcgatacctg cctgcaggag ttcctcagtg aaggagtctt 241 caagtgccct gaggaccagc ttcctctgga ctatgccaag atctacccag acccggagct 301 ggaagtacaa gtattgggcc tgcctatccg ctgcatccac agtgaggagg gctgccgctg 361 gagtgggcca ctacgtcatc tacagggcca cctgaatacc tgcagcttca atgtcattcc 421 ctgccctaat cgctgcccca tgaagctgag ccgccgtgat ctacctgcac acttgcagca 481 tgactgcccc aagcggcgcc tcaagtgcga gttttgtggc tgtgacttca gtggggaggc 541 ctatgagagc catgagggta tgtgccccca ggagagtgtc tactgtgaga ataagtgtgg 601 tgcccgcatg atgcgggggc tgctggccca gcatgccacc tctgagtgcc ccaagcgcac 661 tcagccctgc acctactgca ctaaggagtt cgtctttgac accatccaga gccaccagta 721 ccagtgccca aggctgcctg ttgcctgccc caaccaatgt ggtgtgggca ctgtggctcg 781 ggaggacctg ccaggccatc tgaaggacag ctgtaacacc gccctggtgc tctgcccatt 841 caaagactcc ggctgcaagc acaggtgccc taagctggca atggcacggc atgtggagga 901 gagtgtgaag ccacatctgg ccatgatgtg tgccctggtg agccggcaac ggcaggagct 961 gcaggagctt cggcgagagc tggaggagct atcagtgggc agtgatggcg tgctcatctg 1021 gaagattggc agctatggac ggcggctaca ggaggccaag gccaagccca accttgagtg 1081 cttcagccca gccttctaca cacataagta tggttacaag ctgcaggtgt ctgcattcct 1141 caatggcaat ggcagtggtg agggcacaca cctctcactg tacattcgtg tgctgcctgg 1201 tgcctttgac aatctccttg agtggccctt tgcccgccgt gtcaccttct ccctgctgga 1261 tcagagcgac cctgggctgg ctaaaccaca gcacgtcact gagaccttcc accccgaccc 1321 aaactggaag aatttccaga agccaggcac gtggcggggc tccctggatg agagttctct 1381 gggctttggt tatcccaagt tcatctccca ccaggacatt cgaaagcgaa actatgtgcg 1441 ggatgatgca gtcttcatcc gtgctgctgt tgaactgccc cggaagatcc tcagctgagt 1501 gcaggtgggg ttcgagggga aaggacgatg gggcatgacc tcagtcaggc actggctgaa 1561 cttggagagg gggccggacc cccgtcagct gcttctgctg cctaggttct gttaccccat 1621 cctccctccc ccagccacca ccctcaggtg cctccaattg gtgcttcagc cctggcccct 1681 gtggggaaca ggtcttgggg tcatgaaggg ctggaaacaa gtgaccccag ggcctgtctc 1741 ccttcttggg tagggcagac atgccttggt gccggtcaca ctctacacgg actgaggtgc 1801 ctgctcaggt gctatgtccc aagagccata agggggtggg aattggggag ggagaaaggg 1861 tagttcaaag agtctgtctt gagatctgat tttttccccc tttacctagc tgtgccccct 1921 ctggttattt atttccttag tgccaggagg gcacagcagg ggagccctga tttttaataa 1981 atccggaatt gtatttatt // LOCUS HSMMAR 1334 bp RNA PRI 15-FEB-1993 DEFINITION H.sapiens MacMarcks mRNA. ACCESSION X70326 NID g38434 KEYWORDS actin binding protein; calmodulin binding protein; signal transduction. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1334) AUTHORS Blockx,H., Maertebs,C. and Fransen,L.M.L. TITLE cDNA and derived amino acid sequence of human maeMARCKS an LPS-inducible macrophage PKC substrate JOURNAL Unpublished REFERENCE 2 (bases 1 to 1334) AUTHORS Fransen,L.M.L. TITLE Direct Submission JOURNAL Submitted (05-FEB-1993) L.M.L. Fransen, Innogenetics N.V., Industriepark Zwijnaarde 7,, box 4, B-9052 Ghent, BELGIUM FEATURES Location/Qualifiers source 1..1334 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /cell_line="THP-1 (ATCC TiB 202)" gene 14..601 /gene="MacMARCKS" CDS 14..601 /gene="MacMARCKS" /codon_start=1 /db_xref="PID:g38435" /db_xref="SWISS-PROT:P49006" /translation="MGSQSSKAPRGDVTAEEAAGASPAKANGQENGHVKSNGDLSPKG EGESPPVNGTDEAAGATGDAIEPAPPSQGAEAKGEVPPKETPKKKKKFSFKKPFKLSG LSFKRNRKEGGGDSSASSPTEEEQEQGEIGACSDEGTAQEGKAAATPESQEPQAKGAE ASAASEEEAGPQATEPSTPSGPESGPTPASAEQNE" BASE COUNT 284 a 379 c 359 g 312 t ORIGIN 1 gcagaccccc atcatgggca gccagagctc caaggctccc cggggcgacg tgaccgccga 61 ggaggcagca ggcgcttccc ccgcgaaggc caacggccag gagaatggcc acgtgaaaag 121 caatggagac ttatccccca agggtgaagg ggagtcgccc cctgtgaacg gaacagatga 181 ggcagccggg gccactggcg atgccatcga gccagcaccc cctagccagg gtgctgaggc 241 caagggggag gtccccccca aggagacccc caagaagaag aagaaattct ctttcaagaa 301 gcctttcaaa ttgagcggcc tgtccttcaa gagaaatcgg aaggagggtg ggggtgattc 361 ttctgcctcc tcacccacag aggaagagca ggagcagggg gagatcggtg cctgcagcga 421 cgagggcact gctcaggaag ggaaggccgc agccacccct gagagccagg aaccccaggc 481 caagggggca gaggctagtg cagcctcaga agaagaggca gggccccagg ctacagagcc 541 atccactccc tcggggccgg agagtggccc tacaccagcc agcgctgagc agaatgagta 601 gctaggtagg ggcaggtggg tgatctctaa gctgcaaaaa ctgtgctgtc cttgtgaggt 661 cactgcctgg acctggtgcc ctggctgcct tcctgtgccc agaaaggaag gggctattgc 721 ctcctcccag ccacgttccc tttcctcctc tccctcctgt ggattctccc atcagccatc 781 tggttctcct cttaaggcca gttgaagatg gtcccttaca gcttcccaag ttaggttagt 841 gatgtgaaat gctcctgtcc ctggccctac ctccttccct gtccccaccc ctgcataagg 901 cagttgttgg ttttcttccc caattctttt ccaagtaggt tttgtttacc ctactcccca 961 aatccctgag ccagaagtgg ggtgcttata ctcccaaacc ttgagtgtcc agccttcccc 1021 tgttgttttt agtctcttgt gctgtgccta gtggcacctg ggctggggag gacactgccc 1081 cgtctaggtt tttataaatg tcttactcaa gttcaaacct ccagcctgtg aatcaactgt 1141 gtctcttttt tgacttggta agcaagtatt aggctttggg gtggggggag gtctgtaatg 1201 tgaaacaact tcttgtcttt ttttctccca ctgttgtaaa taacttttaa tggccaaacc 1261 ccagatttgt actttttttt tttttctaac tgctaaaacc attctcttcc acctggtttt 1321 actgtaacat ttgg // LOCUS HSMMP19 1811 bp RNA PRI 18-DEC-1996 DEFINITION H.sapiens mRNA for MMP-19 protein. ACCESSION X92521 NID g1731985 KEYWORDS matrix metalloproteinase; MMP-19 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1811) AUTHORS Pendas,A.M., Llano,E., Puente,X.A., Lopez-Otin,C., Knauper,V.V., Mattei,M.G., Apte,S. and Murphy,G. TITLE Identification and characterization of a novel human matrix metalloproteinase with unique structural characteristics, chromosomal location and tissue distribution JOURNAL Unpublished REFERENCE 2 (bases 1 to 1811) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (24-OCT-1995) C. Lopez-Otin, Universidad de Oviedo, Dept de Biologia Funcional, Area de Bioquimica, Facultad de Medicina, C/Julian Claveria S/N, 33006 Oviedo, SPAIN FEATURES Location/Qualifiers source 1..1811 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda gt11" /clone="2.1" /chromosome="12" /map="12q14 (FISH mapping)" misc_feature 83..89 /note="activation locus" gene 102..1628 /gene="MMP-19" CDS 102..1628 /gene="MMP-19" /note="putative" /codon_start=1 /product="MMP-19 (matrix metalloproteinase)" /db_xref="PID:e208554" /db_xref="PID:g1731986" /translation="MNCQQLWLGFLLPMTVSGRVLGLAEVAPVDYLSQYGYLQKPLEG SNNFKPEDITEALRAFQEASELPVSGQLDDATRARMRQPRCGLEDPFNQKTLKYLLLG RWRKKHLTFRILNLPSTLPPHTARAALRQAFQDWSNVAPLTFQEVQAGAADIRLSFHG RQSSYCSNTFDGPGRVLAHADIPELGSVHFDEDEFWTEGTYRGVNLRIIAAHEVGHAL GLGHSRYSQALMAPVYEGYRPHFKLHPDDVAGIQALYGKKSPVIRDEEEEETELPTVP PVPTEPSPMPDPCSSELDAMMLGPRGKTYAFKGDYVWTVSDSGPGPLFRVSALWEGLP GNLDAAVYSPRTQWIHFFKGDKVWRYINFKMSPGFPKKLNRVEPNLDAALYWPLNQKV FLFKGSGYWQWDELARTDFSSYPKPIKGLFTGVPNQPSAAMSWQDGRVYFFKGKVYWR LNQQLRVEKGYPRNISHNWMHCRPRTIDTTPSGGNTTPSGTGITLDTTLSATETTFEY " misc_feature 212..223 /gene="MMP-19" /note="Zn binding site" BASE COUNT 417 a 546 c 467 g 381 t ORIGIN 1 gaattccggg agcccctctg cctagcactg ctcccccaag gctcccagaa atctcaggtc 61 agaggcacgg acagcctctg gagctctcgt ctggtgggac catgaactgc cagcagctgt 121 ggctgggctt cctactcccc atgacagtct caggccgggt cctggggctt gcagaggtgg 181 cgcccgtgga ctacctgtca caatatgggt acctacagaa gcctctagaa ggatctaata 241 acttcaagcc agaagatatc accgaggctc tgagagcttt tcaggaagca tctgaacttc 301 cagtctcagg tcagctggat gatgccacaa gggcccgcat gaggcagcct cgttgtggcc 361 tagaggatcc cttcaaccag aagaccctta aatacctgtt gctgggccgc tggagaaaga 421 agcacctgac tttccgcatc ttgaacctgc cctccaccct tccaccccac acagcccggg 481 cagccctgcg tcaagccttc caggactgga gcaatgtggc tcccttgacc ttccaagagg 541 tgcaggctgg tgcggctgac atccgcctct ccttccatgg ccgccaaagc tcgtactgtt 601 ccaatacttt tgatgggcct gggagagtcc tggcccatgc cgacatccca gagctgggca 661 gtgtgcactt cgacgaagac gagttctgga ctgaggggac ctaccgtggg gtgaacctgc 721 gcatcattgc agcccatgaa gtgggccatg ctctggggct tgggcactcc cgatattccc 781 aggccctcat ggccccagtc tacgagggct accggcccca ctttaagctg cacccagatg 841 atgtggcagg gatccaggct ctctatggca agaagagtcc agtgataagg gatgaggaag 901 aagaagagac agagctgccc actgtgcccc cagtgcccac agaacccagt cccatgccag 961 acccttgcag tagtgaactg gatgccatga tgctggggcc ccgtgggaag acctatgctt 1021 tcaaggggga ctatgtgtgg actgtatcag attcaggacc gggccccttg ttccgagtgt 1081 ctgccctttg ggaggggctc cccggaaacc tggatgctgc tgtctactcg cctcgaacac 1141 aatggattca cttctttaag ggagacaagg tgtggcgcta cattaatttc aagatgtctc 1201 ctggcttccc caagaagctg aatagggtag aacctaacct ggatgcagct ctctattggc 1261 ctctcaacca aaaggtgttc ctctttaagg gctccgggta ctggcagtgg gacgagctag 1321 cccgaactga cttcagcagc taccccaaac caatcaaggg tttgtttacg ggagtgccaa 1381 accagccctc ggctgctatg agttggcaag atggccgagt ctacttcttc aagggcaaag 1441 tctactggcg cctcaaccag cagcttcgag tagagaaagg ctatcccaga aatatttccc 1501 acaactggat gcactgtcgt ccccggacta tagacactac cccatcaggt gggaatacca 1561 ctccctcagg tacgggcata accttggata ccactctctc agccacagaa accacgtttg 1621 aatactgact gctcacccac agacacaatc ttggacatta acccctgagg ctccaccacc 1681 caccctttca tttccccccc agaagcctaa ggcctaatag ctgaatgaaa tacctgtctg 1741 ctcagtagaa ccttgcaggt gctgtagcag gcgcaagacc gtagatctca ggcctctaac 1801 acttccaact c // LOCUS HSMMPM1 3437 bp RNA PRI 24-AUG-1995 DEFINITION H.sapiens mRNA for membrane-type matrix metalloproteinase 1. ACCESSION Z48481 NID g963053 KEYWORDS matrix metalloproteinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3437) AUTHORS Will,H. and Hinzmann,B. TITLE cDNA sequence and mRNA tissue distribution of a novel human matrix metalloproteinase with a potential transmembrane segment JOURNAL Eur. J. Biochem. 231 (3), 602-608 (1995) MEDLINE 95377289 REFERENCE 2 (bases 1 to 3437) AUTHORS Will,H. TITLE Direct Submission JOURNAL Submitted (23-FEB-1995) Will H., InViTek GmbH, Robert-Roessle-Str. 10, Berlin, Germany, 13125 FEATURES Location/Qualifiers source 1..3437 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="mature" /tissue_type="lung" /clone_lib="lung cDNA library" /sex="Male" CDS 114..1862 /codon_start=1 /product="membrane-type matrix metalloproteinase 1" /db_xref="PID:g963054" /translation="MSPAPRPPRCLLLPLLTLGTALASLGSAQSSSFSPEAWLQQYGY LPPGDLRTHTQRSPQSLSAAIAAMQKFYGLQVTGKADADTMKAMRRPRCGVPDKFGAE IKANVRRKRYAIQGLKWQHNEITFCIQNYTPKVGEYATYEAIRKAFRVWESATPLRFR EVPYAYIREGHEKQADIMIFFAEGFHGDSTPFDGEGGFLAHAYFPGPNIGGDTHFDSA EPWTVRNEDLNGNDIFLVAVHELGHALGLEHSSDPSAIMAPFYQWMDTENFVLPDDDR RGIQQLYGGESGFPTKMPPQPRTTSRPSVPDKPKNPTYGPNICDGNFDTVAMLRGEMF VFKERWFWRVRNNQVMDGYPMPIGQFWRGLPASINTAYERKDGKFVFFKGDKHWVFDE ASLEPGYPKHIKELGRGLPTDKIDAALFWMPNGKTYFFRGNKYYRFNEELRAVDSEYP KNIKVWEGIPESPRGSFMGSDEVFTYFYKGNKYWKFNNQKLKVEPGYPKSALRDWMGC PSGGRPDEGTEEETEVIIIEVDEEGGGAVSAAAVVLPVLLLLLVLAVGLAVFFFRRHG TPRRLLYCQRSLLDKV" BASE COUNT 731 a 998 c 1037 g 671 t ORIGIN 1 aagttcagtg cctaccgaag acaaaggcgc cccgagggag tggcggtgcg accccagggc 61 gtgggcccgg ccgcggagcc cacactgccc ggctgacccg gtggtctcgg accatgtctc 121 ccgccccaag acccccccgt tgtctcctgc tccccctgct cacgctcggc accgcgctcg 181 cctccctcgg ctcggcccaa agcagcagct tcagccccga agcctggcta cagcaatatg 241 gctacctgcc tcccggggac ctacgtaccc acacacagcg ctcaccccag tcactctcag 301 cggccatcgc tgccatgcag aagttttacg gcttgcaagt aacaggcaaa gctgatgcag 361 acaccatgaa ggccatgagg cgcccccgat gtggtgttcc agacaagttt ggggctgaga 421 tcaaggccaa tgttcgaagg aagcgctacg ccatccaggg tctcaaatgg caacataatg 481 aaatcacttt ctgcatccag aattacaccc ccaaggtggg cgagtatgcc acatacgagg 541 ccattcgcaa ggcgttccgc gtgtgggaga gtgccacacc actgcgcttc cgcgaggtgc 601 cctatgccta catccgtgag ggccatgaga agcaggccga catcatgatc ttctttgccg 661 agggcttcca tggcgacagc acgcccttcg atggtgaggg cggcttcctg gcccatgcct 721 acttcccagg ccccaacatt ggaggagaca cccactttga ctctgccgag ccttggactg 781 tcaggaatga ggatctgaat ggaaatgaca tcttcctggt ggctgtgcac gagctgggcc 841 atgccctggg gctcgagcat tccagtgacc cctcggccat catggcaccc ttttaccagt 901 ggatggacac ggagaatttt gtgctgcccg atgatgaccg ccggggcatc cagcaacttt 961 atgggggtga gtcagggttc cccaccaaga tgccccctca acccaggact acctcccggc 1021 cttctgttcc tgataaaccc aaaaacccca cctatgggcc caacatctgt gacgggaact 1081 ttgacaccgt ggccatgctc cgaggggaga tgtttgtctt caaggagcgc tggttctggc 1141 gggtgaggaa taaccaagtg atggatggat acccaatgcc cattggccag ttctggcggg 1201 gcctgcctgc gtccatcaac actgcctacg agaggaagga tggcaaattc gtcttcttca 1261 aaggagacaa gcattgggtg tttgatgagg cgtccctgga acctggctac cccaagcaca 1321 ttaaggagct gggccgaggg ctgcctaccg acaagattga tgctgctctc ttctggatgc 1381 ccaatggaaa gacctacttc ttccgtggaa acaagtacta ccgtttcaac gaagagctca 1441 gggcagtgga tagcgagtac cccaagaaca tcaaagtctg ggaagggatc cctgagtctc 1501 ccagagggtc attcatgggc agcgatgaag tcttcactta cttctacaag gggaacaaat 1561 actggaaatt caacaaccag aagctgaagg tagaaccggg ctaccccaag tcagccctga 1621 gggactggat gggctgccca tcgggaggcc ggccggatga ggggactgag gaggagacgg 1681 aggtgatcat cattgaggtg gacgaggagg gcggcggggc ggtgagcgcg gctgccgtgg 1741 tgctgcccgt gctgctgctg ctcctggtgc tggcggtggg ccttgcagtc ttcttcttca 1801 gacgccatgg gacccccagg cgactgctct actgccagcg ttccctgctg gacaaggtct 1861 gacgcccacc gccggcccgc ccactcctac cacaaggact ttgcctctga aggccagtgg 1921 cagcaggtgg tggtgggtgg gctgctccca tcgtcccgag ccccctcccc gcagcctcct 1981 tgcttctctc tgtcccctgg ctggcctcct tcaccctgac cgcctccctc cctcctgccc 2041 cggcattgca tcttccctag ataggtcccc tgagggctga gtgggagggc ggccctttcc 2101 agcctctgcc cctcagggga accctgtagc tttgtgtctg tccagcccca tctgaatgtg 2161 ttgggggctc tgcacttgaa ggcaggaccc tcagacctcg ctggtaaagg tcaaatgggg 2221 tcatctgctc cttttccatc ccctgacata ccttaacctc tgaactctga cctcaggagg 2281 ctctgggcac tccagccctg aaagccccag gtgtacccaa ttggcagcct ctcactactc 2341 tttctggcta aaaggaatct aatcttgttg agggtagaga ccctgagaca gtgtgagggg 2401 gtggggactg ccaagccacc ctaagacctt gggaggaaaa ctcagagagg gtcttcgttg 2461 ctcagtcagt caagttcctc ggagatctgc ctctgcctca cctaccccag ggaacttcca 2521 aggaaggagc ctgagccact ggggactaag tgggcagaag aaacccttgg cagccctgtg 2581 cctctcgaat gttagccttg gatggggctt tcacagttag aagagctgaa accaggggtg 2641 cagctgtcag gtagggtggg gccggtggga gaggcccggg tcagagccct gggggtgagc 2701 ctgaaggcca cagagaaaga accttgccca aactcaggca gctggggctg aggcccaaag 2761 gcagaacagc cagagggggc aggaggggac caaaaaggaa aatgaggacg tgcagcagca 2821 ttggaaggct ggggccgggc aggccaggcc aagccaagca gggggccaca gggtgggctg 2881 tggagctctc aggaagggcc ctgaggaagg cacacttgct cctgttggtc cctgtccttg 2941 ctgcccaggc agcgtggagg ggaagggtag ggcagccaga gaaaggagca gagaaggcac 3001 acaaacgagg aatgaggggc ttcacgagag gccacagggc ctggctggcc acgctgtccc 3061 ggcctgctca ccatctcagt gaggggcagg agctggggct cgcttaggct gggtccacgc 3121 ttccctggtg ccagcacccc tcaagcctgt ctcaccagtg gcctgccctc tcgctccccc 3181 acccagccca cccattgaag tctccttggg ccaccaaagg tggtggccat ggtaccgggg 3241 acttgggaga gtgagaccca gtggagggag caagaggaga gggatgtcgg gggggtgggg 3301 cacggggtag gggaaatggg gtgaacggtg ctggcagttc ggctagattt ctgtcttgtt 3361 tgtttttttg ttttgtttaa tgtatatttt tattataatt attatatatg aattccaaaa 3421 aaaaaaaaaa aaaaaaa // LOCUS HSMMPM2 3530 bp RNA PRI 24-AUG-1995 DEFINITION H.sapiens mRNA for membrane-type matrix metalloproteinase 2. ACCESSION Z48482 NID g963055 KEYWORDS matrix metalloproteinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3530) AUTHORS Will,H. TITLE Direct Submission JOURNAL Submitted (23-FEB-1995) Will H., InViTek GmbH, Robert-Roessle-Str. 10, Berlin, Germany, 13125 REFERENCE 2 (bases 1 to 3530) AUTHORS Will,H. and Hinzmann,B. TITLE cDNA sequence and mRNA tissue distribution of a novel human matrix metalloproteinase with a potential transmembrane segment JOURNAL Eur. J. Biochem. 231 (3), 602-608 (1995) MEDLINE 95377289 FEATURES Location/Qualifiers source 1..3530 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="mature" /tissue_type="lung" /clone_lib="lung cDNA" /sex="Male" CDS 49..2058 /codon_start=1 /product="membrane-type matrix metalloproteinase 2" /db_xref="PID:g963056" /translation="MGSDPSAPGRPGWTGSLLGDREEAARPRLLPLLLVLLGCLGLGV AAEDAEVHAENWLRLYGYLPQPSRHMSTMRSAQILASALAEMQRFYGIPVTGVLDEET KEWMKRPRCGVPDQFGVRVKANLRRRRKRYALTGRKWNNHHLTFSIQNYTEKLGWYHS MEAVRRAFRVWEQATPLVFQEVPYEDIRLRRQKEADIMVLFASGFHGDSSPFDGTGGF LAHAYFPGPGLGGDTHFDADEPWTFSSTDLHGNNLFLVAVHELGHALGLEHSSNPNAI MAPFYQWKDVDNFKLPEDDLRGIQQLYGTPDGQPQPTQPLPTVTPRRPGRPDHRPPRP PQPPPPGGKPERPPKPGPPVQPRATERPDQYGPNICDGDFDTVAMLRGEMFVFKGRWF WRVRHNRVLDNYPMPIGHFWRGLPGDISAAYERQDGRFVFFKGDRYWLFREANLEPGY PQPLTSYGLGIPYDRIDTAIWWEPTGHTFFFQEDRYWRFNEETQRGDPGYPKPISVWQ GIPASPKGAFLSNDAAYTYFYKGTKYWKFDNERLRMEPGYPKSILRDFMGCQEHVEPG PRWPDVARPPFNPHGGAEPGADSAEGDVGDGDGDFGAGVNKDGGSRVVVQMEEVARTV NVVMVLVPLLLLLCVLGLTYALVQMQRKGAPRVLLYCKRSLQEWV" polyA_site 3491..3496 BASE COUNT 634 a 1160 c 1080 g 656 t ORIGIN 1 gcgaggatcc ggcgtgcagt gttccgagct gggctgggcg ccgagagcat gggcagcgac 61 ccgagcgcgc ccggacggcc gggctggacg ggcagcctcc tcggcgaccg ggaggaggcg 121 gcgcggccgc gactgctgcc gctgctcctg gtgcttctgg gctgcctggg ccttggcgta 181 gcggccgaag acgcggaggt ccatgccgag aactggctgc ggctttatgg ctacctgcct 241 cagcccagcc gccatatgtc caccatgcgt tccgcccaga tcttggcctc ggcccttgca 301 gagatgcagc gcttctacgg gatcccagtc accggtgtgc tcgacgaaga gaccaaggag 361 tggatgaagc ggccccgctg tggggtgcca gaccagttcg gggtacgagt gaaagccaac 421 ctgcggcggc gtcggaagcg ctacgccctc accgggagga agtggaacaa ccaccatctg 481 acctttagca tccagaacta cacggagaag ttgggctggt accactcgat ggaggcggtg 541 cgcagggcct tccgcgtgtg ggagcaggcc acgcccctgg tcttccagga ggtgccctat 601 gaggacatcc ggctgcggcg acagaaggag gccgacatca tggtactctt tgcctctggc 661 ttccacggcg acagctcgcc gtttgatggc accggtggct ttctggccca cgcctatttc 721 cctggccccg gcctaggcgg ggacacccat tttgacgcag atgagccctg gaccttctcc 781 agcactgacc tgcatggaaa caacctcttc ctggtggcag tgcatgagct gggccacgcg 841 ctggggctgg agcactccag caaccccaat gccatcatgg cgccgttcta ccagtggaag 901 gacgttgaca acttcaagct gcccgaggac gatctccgtg gcatccagca gctctacggt 961 accccagacg gtcagccaca gcctacccag cctctcccca ctgtgacgcc acggcggcca 1021 ggccggcctg accaccggcc gccccggcct ccccagccac cacccccagg tgggaagcca 1081 gagcggcccc caaagccggg ccccccagtc cagccccgag ccacagagcg gcccgaccag 1141 tatggcccca acatctgcga cggggacttt gacacagtgg ccatgcttcg cggggagatg 1201 ttcgtgttca agggccgctg gttctggcga gtccggcaca accgcgtcct ggacaactat 1261 cccatgccca tcgggcactt ctggcgtggt ctgcccggtg acatcagtgc tgcctacgag 1321 cgccaagacg gtcgttttgt ctttttcaaa ggtgaccgct actggctctt tcgagaagcg 1381 aacctggagc ccggctaccc acagccgctg accagctatg gcctgggcat cccctatgac 1441 cgcattgaca cggccatctg gtgggagccc acaggccaca ccttcttctt ccaagaggac 1501 aggtactggc gcttcaacga ggagacacag cgtggagacc ctgggtaccc caagcccatc 1561 agtgtctggc aggggatccc tgcctcccct aaaggggcct tcctgagcaa tgacgcagcc 1621 tacacctact tctacaaggg caccaaatac tggaaattcg acaatgagcg cctgcggatg 1681 gagcccggct accccaagtc catcctgcgg gacttcatgg gctgccagga gcacgtggag 1741 ccaggccccc gatggcccga cgtggcccgg ccgcccttca acccccacgg gggtgcagag 1801 cccggggcgg acagcgcaga gggcgacgtg ggggatgggg atggggactt tggggccggg 1861 gtcaacaagg acgggggcag ccgcgtggtg gtgcagatgg aggaggtggc acggacggtg 1921 aacgtggtga tggtgctggt gccactgctg ctgctgctct gcgtcctggg cctcacctac 1981 gcgctggtgc agatgcagcg caagggtgcg ccacgtgtcc tgctttactg caagcgctcg 2041 ctgcaggagt gggtctgacc acccagcgct cctgctaacg gtgctcaggg ggcgcctgtg 2101 gttctgagat ggctcccagg ggctccctcc gcccccaggt aggggcccct ctcagccctc 2161 acacaccctg tctgccccgc cctcattatt tatgtccagg tgtttgtttt gttttgtttt 2221 tggcacctta cttgaccatt tgtttctgtt tccccgactg gggcagggtg tttagaattt 2281 tctaaatgta gttctgctcc agacagggaa ttaggccccc atcatcctct ggcttggcca 2341 cagccagggg agcagagggg cagaggccca cattggaaga gcagcacctc ctcagcctga 2401 accccagggc tgtaactgcc aggctctctt tgcccagttg gagactgtct ggcccccctg 2461 gtcccctcct tcccaagtga gtctctctgg gccttaggaa gagccttcca cccaggggca 2521 gccccaggcc aaaggggacc tggaagggag gtgggccgtg gcccttgagt ccccattgag 2581 gcttggttcc ttcccaatcc agtggacttc gcagtccact tctgacagcc tcagtgaccc 2641 tggctccttg tgccagagaa cccagcccac ccccggcagc agcccccagc tcccacctcc 2701 ccttgggccc acaccttctt ccctctctgg agaaagggcc ctgggcctgc ctcaccacgg 2761 accaaaggga gtctgccagg gcccctctcc ccagggaagc agcagcctcg cccctggcag 2821 agatgcctcc ctgagctaga accctctgtt ccttccctgt gcctcctccc tccctcccga 2881 ctcacaccac tagcctcagg ggtctgagct ccagctcctt tgggcttcag ctgccagtgt 2941 cctgagcccc agggagaggg ggctggtggg tgcctaggcc tgggcagtgg atggccgtga 3001 atgggtgccc acagtgtcag gcactgggca tgaggggttc ctcccctcca gctccctgtg 3061 cccccagggt cctgggagga gagacactgg tggggatagg ccagccgcgc atcagactgt 3121 gaaccccacg aaggagccca ttgtggccta agaggctgcc ctcctgtgct cagccctgag 3181 gacagatgcc tccttcctct tttccttccc aaagcaagca agaggccgtg gctgctgtgg 3241 gaaatggtac tgtacagctg gctctacttc cccatggccc tgagcgagtg gagtctgcca 3301 cccaggatcc ccaaggcact tgagggggaa ggattctgct ggcctctgcg agtggtttct 3361 tgtgcactgg caccaagtgc gggtccggca gcttctgccc cctgcagaac cggagagcca 3421 gctaaggggt ggggctgcgg gggttccgtg tccaccccca tacatttatt tctgtaaata 3481 atgtgcactg aataaattgt acagccggca aaaaaaaaaa aaaaaaaaaa // LOCUS HSMNSOD 977 bp RNA PRI 12-NOV-1990 DEFINITION Human mRNA for mangano-superoxide dismutase (Mn-SOD). ACCESSION X14322 NID g34706 KEYWORDS dismutase; manganese superoxide dismutase; metalloenzyme; superoxide dismutase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 977) AUTHORS Wispe,J.R., Clark,J.C., Burhans,M.S., Kropp,K.E., Korfhagen,T.R. and Whitsett,J.A. TITLE Synthesis and processing of the precursor for human mangano-superoxide dismutase JOURNAL Biochim. Biophys. Acta 994 (1), 30-36 (1989) MEDLINE 89076921 FEATURES Location/Qualifiers source 1..977 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda gt11" /clone="lambda 105" mRNA 1..977 /gene="Mn-SOD" /evidence=experimental gene 1..977 /gene="Mn-SOD" CDS 96..764 /gene="Mn-SOD" /codon_start=1 /product="Manganese superoxide dismutase" /db_xref="PID:g34707" /db_xref="SWISS-PROT:P04179" /translation="MLSRAVCGTSRQLAPALGYLGSRQKHSLPDLPYDYGALEPHINA QIMQLHHSKHHAAYVNNLNVTEEKYQEALAKGDVTAQTALQPALKFNGGGHINHSIFW TNLSPNGGGEPKGELLEAIKRDFGSFDKFKEKLTAASVGVQGSGWGWLGFNKERGHLQ IAACPNQDPLQGTTGLIPLLGIDVWEHAYYLQYKNVRPDYLKAIWNVINWENVTERYM ACKK" transit_peptide 96..167 /gene="Mn-SOD" /evidence=experimental mat_peptide 168..761 /gene="Mn-SOD" /evidence=experimental /product="Manganese superoxide dismutase" polyA_site 977 /gene="Mn-SOD" BASE COUNT 253 a 221 c 260 g 243 t ORIGIN 1 ccgccggcgc gcaggagcgg cactcgtggc tgtggtggct tcggcagcgg cttcagcaga 61 tcggcggcat cagcggtacg accagcacta gcagcatgtt gagccgggca gtgtgcggca 121 ccagcaggca gctggctccg gctttggggt atctgggctc caggcagaag cacagcctcc 181 ccgacctgcc ctacgactac ggcgccctgg aacctcacat caacgcgcag atcatgcagc 241 tgcaccacag caagcaccac gcggcctacg tgaacaacct gaacgtcacc gaggagaagt 301 accaggaggc gttggcaaag ggagatgtta cagcccagac agctcttcag cctgcactga 361 agttcaatgg tggtggtcat atcaatcata gcattttctg gacaaacctc agccctaacg 421 gtggtggaga acccaaaggg gagttgctgg aagccatcaa acgtgacttt ggttcctttg 481 acaagtttaa ggagaagctg acggctgcat ctgttggtgt ccaaggctca ggttggggtt 541 ggcttggttt caataaggaa cggggacact tacaaattgc tgcttgtcca aatcaggatc 601 cactgcaagg aacaacaggc cttattccac tgctggggat tgatgtgtgg gagcacgctt 661 actaccttca gtataaaaat gtcaggcctg attatctaaa agctatttgg aatgtaatca 721 actgggagaa tgtaactgaa agatacatgg cttgcaaaaa gtaaaccacg atcgttatgc 781 tgagtatgtt aagctcttta tgactgtttt tgtagtggta tagagtactg cagaatacag 841 taagctgctc tattgtagca tttcttgatg ttgcttagtc acttatttca taaacaactt 901 aatgttctga ataatttctt actaaacatt ttgttattgg gcaagtgatt gaaaatagta 961 aatgctttgt gtgattg // LOCUS HSMOT 555 bp RNA PRI 29-NOV-1994 DEFINITION Human mRNA for motilin precursor. ACCESSION Y00695 NID g34716 KEYWORDS hormone; motilin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 555) AUTHORS Seino,Y. TITLE Direct Submission JOURNAL Submitted (23-SEP-1987) Seino Y., Division of Metabolism and Clinical Nutrition, Kyoto University Hospital, Shogoin, Sakyo-ku, Kyoto 606, Japan REFERENCE 2 (bases 1 to 555) AUTHORS Seino,Y., Tanaka,K., Takeda,J., Takahashi,H., Mitani,T., Kurono,M., Kayano,T., Koh,G., Fukumoto,H., Yano,H., Fujita,J., Inagaki,N., Yamada,Y. and Imura,H. TITLE Sequence of an intestinal cDNA encoding human motilin precursor JOURNAL FEBS Lett. 223 (1), 74-76 (1987) MEDLINE 88030048 FEATURES Location/Qualifiers source 1..555 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 56..130 CDS 56..403 /note="motilin precursor" /codon_start=1 /db_xref="PID:g599798" /db_xref="SWISS-PROT:P12872" /translation="MVSRKAVAALLVVHVAAMLASQTEAFVPIFTYGELQRMQEKERN KGQKKSLSVWQRSGEEGPVDPAEPIREEENEMIKLTAPLEIGMRMNSRQLEKYPATLE GLLSEMLPQHAAK" mat_peptide 131..400 polyA_signal 535..540 /note="putative" polyA_signal 555 BASE COUNT 155 a 144 c 159 g 97 t ORIGIN 1 agacaagtag agagactcct ccagacccac tcagaccacg tgcacgccct ccaagatggt 61 atcccgtaag gctgtggctg ctctgctggt ggtgcatgta gctgccatgc tggcctccca 121 gacggaagcc ttcgtcccca tcttcaccta tggcgaactc cagaggatgc aggaaaagga 181 acggaataaa gggcaaaaga aatccctgag tgtatggcag aggtctgggg aggaaggtcc 241 tgtagaccct gcggagccca tcagggaaga agaaaacgaa atgatcaagc tgactgctcc 301 tctggaaatt ggaatgagga tgaactccag acagctggaa aagtacccgg ccaccctgga 361 agggctgctg agtgagatgc ttccccagca tgcagccaag tgatggccac gctggggaga 421 aggtggacag atttgggagg cccctcctgc ccaagtgagg ccctgggaat ttacagagcc 481 tgccagctgg gcttggaagg aaaacacctt tccaaagcaa attcccctcc agcaaataaa 541 gcatgaaata tacag // LOCUS HSMOX1 2330 bp mRNA PRI 23-JUL-1994 DEFINITION Human Mox1 protein (MOX1) mRNA, complete cds. ACCESSION U10492 NID g505653 KEYWORDS homeobox domain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2330) AUTHORS Futreal,P.A., Cochran,C., Rosenthal,J., Miki,Y., Swenson,J., Hobbs,M., Bennett,L.M., Haugen-Strano,A., Marks,J., Barrett,J.C., Tavtigian,S.V., Shattuck-Eidens,D., Kamb,A., Skolnick,M. and Wiseman,R.W. TITLE Isolation of a diverged homeobox gene, MOX1, from the BRCA1 region on 17q21 by solution hybrid capture JOURNAL Hum. Mol. Genet. 3, 1359-1364 (1994) MEDLINE 95078841 REFERENCE 2 (bases 1 to 2330) AUTHORS Bennett,L.M. TITLE Direct Submission JOURNAL Submitted (09-JUN-1994) L. Michelle Bennett, Laboratory of Molecular Carcinogenesis, National Institute of Environmental Health Sciences, 111 Alexander Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..2330 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q21" /sex="female" /tissue_type="breast" gene 1..794 /gene="MOX1" exon 1..498 /gene="MOX1" /number=1 CDS 30..794 /gene="MOX1" /codon_start=1 /product="Mox1" /db_xref="PID:g514309" /translation="MDPAASSCMRSLQPPAPVWGCLRNPHSEGNGASGLPHYPPTPFS FHQKPDFLATATAAYPDFSASCLAATPHSLPQEEHIFTEQHPAFPQSPNWHFPVSDAR RRPNSGPAGGSKEMGTSSLGLVDTTGGPGDDYGVLGSTANETEKKSSRRRKESSDNQE NRGKPEGSSKARKERTAFTKEQLRELEAEFAHHNYLTRLRRYEIAVNLDLSERQVKVW FQNRRMKWKRVKGGQPISPNGQDPEDGDSTASPSSE" exon 499..671 /gene="MOX1" /number=2 misc_feature 540..722 /gene="MOX1" /note="homeobox domain" exon 672..794 /gene="MOX1" /number=3 polyA_signal 2296..2310 BASE COUNT 605 a 641 c 614 g 467 t 3 others ORIGIN 1 aaaggaccga ggcgtgcagc ggacagcaga tggatcccgc ggccagcagc tgcatgagga 61 gcctccagcc cccagcccct gtctggggct gccttcgaaa cccccactcg gaaggcaatg 121 gggcctcagg gctaccccac tacccgccca ccccgttctc cttccaccag aaaccagact 181 tcctggcgac agcgacggca gcgtaccctg acttctcagc ctcctgcctg gcagccaccc 241 cacacagcct gccccaggag gagcacatct tcactgagca gcaccccgct ttcccacagt 301 cccccaactg gcacttccct gtctcagacg cccggcgcag gcccaactca ggcccggcag 361 ggggttccaa ggaaatgggg accagcagcc tgggcctggt ggacaccaca ggaggcccag 421 gcgatgacta cggggtgctt gggagcactg ccaatgagac agagaagaaa tcatccaggc 481 ggagaaagga gagttcagac aaccaggaga acagagggaa gccggagggc agcagcaaag 541 cccgcaagga gaggacggcc ttcaccaagg agcagctgcg agagctggag gcagagtttg 601 cccatcataa ctacctgact cggctccgca gatatgagat tgcggtaaac ctggacctct 661 ctgagcgcca ggtcaaagtg tggttccaga accgaaggat gaagtggaag cgtgtgaagg 721 gaggtcagcc catctccccc aatgggcagg accctgagga tggggactcc acagcctctc 781 caagttcaga gtgagattct gcatggagga aaaatgacta aggactgagc cccctaccca 841 actaccccca ccccaatccc accttcaccc tcttccttcc ccagccaggg cagcctctcc 901 acatctttcc ctgactcttg gatatgaaac tgcccagcat tcctgggagt cttaggattt 961 tctaggaagt tctgtccagc ctcttagcag cctcttccct agggcctttg ctcccacact 1021 ctcatggaat cagacagaga tcctaccggg ccggatgaat ctggaaacag cttcagagat 1081 actgcttctc agcgtctctt ggctgccacc catgcctcct cctaccgctg ttctcctagg 1141 tcagccaggc ctcctcctgg tctggacacc acctggcctg gtgggagagg agctttggaa 1201 ccagctggcg actcggaaag taaatgcttc aaaaggaagg aaatgacaga gacacacgcc 1261 cttgcccacc ttcctctgta ggctgcacat ctgaggcttt ggggcccctt agttgtcccg 1321 aaaccccaag aaaaatcaga atgaggagag tcaaggacag caactcagct gctgcaagcc 1381 agaaacacat ccctgtctcc aaatttgttg gctaagtgga gacacttctg agaactgact 1441 agagaagaca agaaaatagc ccgatgtagg tttcggtgtc cccatatagg ccccgtccac 1501 acaggcttga ctgggtggac aagaatgaac ccatgacagc acctgctgct tcaaaatcaa 1561 aatcaattta gggatacagc aggggctgtt gggctgtgct ccagagaaaa ggagcagcta 1621 ctccttttaa atccacgatt tctggattga aaacctgtcc agatgctgag ttgttgggct 1681 gaacaactag gagctgaaaa caacgtagag gctggaaagt gtcccctgca ttctggaggg 1741 gaggggagat aataaggagg gctgctgggt gagggcctgg agatgtggaa ccctggagtg 1801 gaaggtttct ccagtgacag tgtcctgtga cwgcaaaagg grasaagaaa atccctcttc 1861 ctccatggga tggatttaag ctcttgctgt gtgttctaca aatgctgtta ttgtgggagg 1921 aaatgctagg tttttgtgtg tggactgccc agacctcagc caggtcttct ggagatgaca 1981 tttgaggact gatggccaaa gagcatgggg gactgaagcc ctggctgcct cagcgctctg 2041 tctcccaaca ccagctggtg ttgcagaggg aggtcaacgt gagtttggat ctcttgtacg 2101 cagatgtaat cattcacatg taaaaataac cccacctccc caccccaaaa agggcaagag 2161 ctgtggaaaa tgattgccaa atgagatggc tggttagagc atgatttttt ctaaagcata 2221 cttcatatat tttcttaaga ttacatcaag ctaattgtgc gagctcaatt cactttgtaa 2281 gaaaactctc ggagaaataa aatcaataaa aagccaaaaa aaaaaataag // LOCUS HSMOX2 2357 bp RNA PRI 21-SEP-1995 DEFINITION H.sapiens mRNA for Mox-2. ACCESSION X82629 NID g732790 KEYWORDS homeobox gene; MOX2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2357) AUTHORS Grigoriou,M., Kastrinaki,M.C., Modi,W.S., Theodorakis,K., Mankoo,B., Pachnis,V. and Karagogeos,D. TITLE Isolation of the human MOX2 homeobox gene and localization to chromosome 7p22.1-p21.3 JOURNAL Genomics 26 (3), 550-555 (1995) MEDLINE 95331791 REFERENCE 2 (bases 1 to 2357) AUTHORS Karagogeos,D. TITLE Direct Submission JOURNAL Submitted (10-NOV-1994) D. Karagogeos, Inst. of Molecular Biology and Biotechnology, P.O.Box 1527, Heraklion 71110, GREECE FEATURES Location/Qualifiers source 1..2357 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="11-weeks-old embryo" /tissue_type="embryo" /chromosome="7" /map="p22.1-21.3" 5'UTR 1..265 /evidence=experimental gene 266..1177 /gene="MOX2" CDS 266..1177 /gene="MOX2" /codon_start=1 /evidence=experimental /product="Mox-2" /db_xref="PID:g732791" /translation="MEHPLFGCLRSPHATAQGLHPFSQSSLALHGRSDHMSYPELSTS SSSCIIAGYPNEEGMFASQHHRGHHHHHHHHHHHHQQQQHQALQTNWHLPQMSSPPSA ARHSLCLQPDSGGPPELGSSPPVLCSNSSSLGSSTPTGAACAPGDYGRQALSPAEAEK RSGGKRKSDSSDSQEGNYKSEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRYE IAVNLDLTERQVKVWFQNRRMKWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGA ATLQQTGDSIANEDSHDSDHSSEHAHL" misc_feature 824..1006 /gene="MOX2" /note="homeobox domain" 3'UTR 1179..2357 /evidence=experimental polyA_signal 2332..2337 BASE COUNT 700 a 544 c 511 g 601 t 1 others ORIGIN 1 gggaccacct tcttttggct tcaacctctc ccactcttga catctgagta gctcagggaa 61 gctcttccag gtccgactgt tcatatgtaa aggagactgg ccgctgggct caggaccggg 121 attatccgag ctctgcagaa gtgcaccgct attgctttgg gagggaaaaa aaaaaaatca 181 cacggtttcc agtgaaaaag tgacagaggg tggtggcctt tggaaccgtc gtcccgtctc 241 tccctgaacc cgaaacttgc atgctatgga acacccgctc tttggctgcc tgcgcagccc 301 tcacgccacg gcgcaaggct tgcacccgtt ctcccaatcc tctctcgccc tccatggaag 361 atctgaccat atgtcttacc ccgagctctc tacttcttcc tcatcttgca taatcgcggg 421 ataccccaac gaagagggca tgtttgccag ccagcatcac agggggcacc accaccacca 481 ccaccaccac catcaccacc atcagcagca gcagcaccag gctctgcaaa ccaactggca 541 cctcccgcag atgtcttccc caccgagtgc ggctcggcac agcctctgcc tccagcccga 601 ctctggaggg cccccagagt tggggagcag cccgcccgtc ctgtgctcca actcttccag 661 cttgggctcc agcaccccga ctggggccgc gtgcgcgccg ggggactacg gccgccaggc 721 actgtcacct gcggaggcgg agaagcgaag cggcggcaag aggaaaagcg acagctcaga 781 ctcccaggaa ggaaattaca agtcagaagt caacagcaaa cccaggaaag aaaggacagc 841 atttaccaaa gagcaaatca gagaacttga agcagaattt gcccatcata attatctcac 901 cagactgagg cgatacgaga tagcagtgaa tctggatctc actgaaagac aggtgaaagt 961 ctggttccaa aacaggcgga tgaagtggaa gagggtaaag ggtggacagc aaggagctgc 1021 ggctcgggaa aaggaactgg tgaatgtgaa aaagggaaca cttctcccat cagagctgtc 1081 gggaattggt gcagccaccc tccagcaaac aggggactct atagcaaatg aagacagtca 1141 cgacagtgac cacagctcag agcatgcgca cttatgatat aaacagagga ccagctccat 1201 tctcaggaaa gaaatgttgt ggatggcaag cctttaccca aatatcgttt acacagagag 1261 atgactatgg cagtgatgtt taatattatt aaatccaggc atttcgaatc tgtttttcat 1321 tgatttatta gagggtttac acaaagagct tccacagtga agatggagaa ggtgaacttg 1381 ctttgaatat nccagatttg tttggtcatg cgtatggcag tgagcaggta tgtgttttct 1441 tttcttcacg aaaattaaat tgctatcaag agcaaactat gaacattata ttcaagatgt 1501 ctccagagtg aagatgccga ggatgaactt gcattgaaca ttccagatgt gtgagatcat 1561 gtgtattgca gtgggcaggt atttgctttt gcttgcactg aaaattaaat tgctatcaag 1621 aataaaccat gaaacatttt atcctgaaca gccacagtgc ctgaattcac tcaagtggat 1681 aaaaaagtgt attttaactc tgtatattac ccttaagtca ttttcctgtc ttcactaatt 1741 tagcaatgca ttcatattag ctgatgaaat aggcactcac aatgacaacc agagccagtt 1801 tcttgtcttt ttatacattt tgtcatccca gagactcggt atttgcttac tgtgtttcaa 1861 gtagaggaaa tcgtggtctt gaactattct gtaccacagc aaacaatcta tgttgcttta 1921 ctatcaactg ctgtaatcgt ttataaaact tacctagctc cttcccttct tctatcatag 1981 ctttaaacat tagaattcat aggcaaatca gttaaaacat taggatcata ggcaaatcag 2041 ttaccttgca gaaagagctt tgtatgacag acattgtctt attttatttc tgtaaaatat 2101 tagctgtatg aatatgattt aattaacaag aaaacatttc ttcctgattg acaacagtgt 2161 tagcaaggtg caaagcgaaa ctggttgctc aagttgatag aaaacaaaat tctgaatatc 2221 ttcaaattaa ttcggtaaaa acacattatt ttttcatatg tgatgtattc atgcagaaca 2281 actatcttgt attttgtttt taaaatgtgt ttaataaatg atcctttgta aataaaaaaa 2341 aaaaaaaaaa aaaaaaa // LOCUS HSMPNU 1244 bp RNA PRI 26-APR-1993 DEFINITION H.sapiens mRNA for macropain subunit nu. ACCESSION X61969 NID g296737 KEYWORDS macropain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1244) AUTHORS DeMartino,G.N., Orth,K., McCullough,M.L., Lee,L.W., Munn,T.Z., Moomaw,C.R., Dawson,P.A. and Slaughter,C.A. TITLE The primary structures of four subunits of the human, high-molecular-weight proteinase, macropain (proteasome), are distinct but homologous JOURNAL Biochim. Biophys. Acta 1079 (1), 29-38 (1991) MEDLINE 91363412 FEATURES Location/Qualifiers source 1..1244 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 106..897 /codon_start=1 /product="macropaine subunit nu" /db_xref="PID:g296738" /db_xref="SWISS-PROT:P25786" /translation="MFRNQYDNDVTVWSPQGRIHQIEYAMEAVKQGSATVGLKSKTHA VLVALKRAQSELAAHQKKILHVDNHIGISIAGLTADARLLCNFMRQECLDSRFVFDRP LPVSRLVSLIGSKTQIPTQRYGRRPYGVGLLIAGYDDMGPHIFQTCPSANYFDCRAMS IGARSQSARTYLERHMSEFMECNLNELVKHGLRALRETLPAEQDLTTKNVSIGIVGKD LEFTIYDDDDVSPFLEGLEERPQRKAQPAQPADEPAEKADEPMEH" BASE COUNT 379 a 243 c 264 g 358 t ORIGIN 1 ggaaactccc gcagacttct ctgtagatcg ctgagcgata ctttcggcag cacctccttg 61 attctcagtt ttgctggagg ccgcaaccag gcccgcgccg ccaccatgtt tcgaaatcag 121 tatgacaatg atgtcactgt ttggagcccc cagggcagga ttcatcaaat tgaatatgca 181 atggaagctg ttaaacaagg ttcagccaca gttggtctga aatcaaaaac tcatgcagtt 241 ttggttgcat tgaaaagggc gcaatcagag cttgcagctc atcagaaaaa aattctccat 301 gttgacaacc acattggtat ctcaattgcg gggcttactg ctgatgctag actgttatgt 361 aattttatgc gtcaggagtg tttggattcc agatttgtat tcgatagacc actgcctgtg 421 tctcgtcttg tatctctaat tggaagcaag acccagatac caacacaacg atatggccgg 481 agaccatatg gtgttggtct ccttattgct ggttatgatg atatgggccc tcacattttc 541 caaacctgtc catctgctaa ctattttgac tgcagagcca tgtccattgg agcccgttcc 601 caatcagctc gtacttactt ggagagacat atgtctgaat ttatggagtg taatttaaat 661 gaactagtta aacatggtct gcgtgcctta agagagacgc ttcctgcaga acaggacctg 721 actacaaaga atgtttccat tggaattgtt ggtaaagact tggagtttac aatctatgat 781 gatgatgatg tgtctccatt cctggaaggt cttgaagaaa gaccacagag aaaggcacag 841 cctgctcaac ctgctgatga acctgcagaa aaggctgatg aaccaatgga acattaagtg 901 ataagccagt ctatatatgt attatcaaat atgtaagaat acaggcacca catactgatg 961 acaataatct atactttgaa ccaaaagttg cagagtggtg gaatgctatg ctatgtttta 1021 ggaatcagtc cagatgtgag ttttttccaa gcaacctcac tgaaacctat ataatggaat 1081 acatttttct ttgaaagggt ctgtataatc attttctaga agtatgggta tctatactaa 1141 tgttttttat ataagaacat aggtgtcttt gtggttttaa agacaactgt gaaataaaat 1201 tgtttcaccg cctggtaaaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSMPOR 3213 bp RNA PRI 27-MAR-1995 DEFINITION Human mRNA for myeloperoxidase (EC 1.11.1.7). ACCESSION X04876 NID g34720 KEYWORDS glycoprotein; myeloperoxidase; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3213) AUTHORS Johnson,K.R., Nauseef,W.M., Care,A., Wheelock,M.J., Shane,S., Hudson,S., Koeffler,H.P., Selsted,M., Miller,C. and Rovera,G. TITLE Characterization of cDNA clones for human myeloperoxidase: predicted amino acid sequence and evidence for multiple mRNA species JOURNAL Nucleic Acids Res. 15 (5), 2013-2028 (1987) MEDLINE 87174733 FEATURES Location/Qualifiers source 1..3213 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60 promyelocytic leukemia cells" CDS 31..66 /note="ORF (AA 1-11)" /codon_start=1 /db_xref="PID:g34721" /translation="MTAAGKGIREQ" sig_peptide 164..286 CDS 164..2401 /note="(AA -41 to 699)" /codon_start=1 /product="prepro-myeloperoxidase" /db_xref="PID:g34722" /db_xref="SWISS-PROT:P05164" /translation="MGVPFFSSLRCMVDLGPCWAGGLTAEMKLLLALAGVLAILATPQ PSEGAAPAVLGEVDTSLVLSSMEEAKQLVDKAYKERRESIKQRLRSGSASPMELLSYF KQPVAATRTAVRAADYLHVALDLLERKLRSLWRRPFNVTDVLTPAQLNVLSKSSGCAY QDVGVTCPEQDKYRTITGMCNNRRSPTLGASNRAFVRWLPAEYEDGFSLPYGWTPGVK RNGFPVALARAVSNEIVRFPTDQLTPDQERSLMFMQWGQLLDHDLDFTPEPAARASFV TGVNCETSCVQQPPCFPLKIPPNDPRIKNQADCIPFFRSCPACPGSNITIRNQINALT SFVDASMVYGSEEPLARNLRNMSNQLGLLAVNQRFQDNGRALLPFDNLHDDPCLLTNR SARIPCFLAGDTRSSEMPELTSMHTLLLREHNRLATELKSLNPRWDGERLYQEARKIV GAMVQIITYRDYLPLVLGPTAMRKYLPTYRSYNDSVDPRIANVFTNAFRYGHTLIQPF MFRLDNRYQPMEPNPRVPLSRVFFASWRVVLEGGIDPILRGLMATPAKLNRQNQIAVD EIRERLFEQVMRIGLDLPALNMQRSRDHGLPGYNAWRRFCGLPQPETVGQLGTVLRNL KLARKLMEQYGTPNNIDIWMGGVSEPLKRKGRVGPLLACIIGTQFRKLRDGDRFWWEN EGVFSMQQRQALAQISLPRIICDNTGITTVSKNNIFMSNSYPRDFVNCSTLPALNLAS WREAS" misc_feature 287..661 /note="pot. pro-piece of myeloperoxidase" misc_feature 578..586 /note="pot. N-glycosylation site (AA 139-141)" misc_feature 662..997 /note="(AA 126-237)" /product="light subunit of myeloperoxidase" mat_peptide 662..2398 /note="(AA 126-699)" /product="mature myeloperoxidase" misc_feature 998..2398 /note="(AA 238-699)" /product="heavy subunit of myeloperoxidase" misc_feature 1130..1138 /note="pot. N-glycosylation site (AA 323-325)" misc_feature 1226..1234 /note="pot. N-glycosylation site (AA 355-357)" misc_feature 1334..1342 /note="pot. N-glycosylation site (AA 391-393)" misc_feature 2348..2356 /note="pot. N-glycosylation site (AA 729-731)" misc_feature 3172..3177 /note="polyadenylation signal" misc_feature 3207..3213 /note="seq. pot. involved in 3' end formation" BASE COUNT 647 a 950 c 934 g 682 t ORIGIN 1 agctgtggag gtggggtcct tggaagctgg atgacagcag ctggcaaggg gataagagag 61 cagtgagccc ctccctcaag gaggtctggc tttatccata gacagggccc tctgaggtgg 121 ggctgaggta caaaggggga ttgagcagcc caggagaaga gagatggggg ttcccttctt 181 ctcttctctc agatgcatgg tggacttagg accttgctgg gctgggggtc tcactgcaga 241 gatgaagctg cttctggccc tagcaggcgt cctggccatt ctggccacgc cccagccctc 301 tgaaggtgct gctccagctg tcctggggga ggtggacacc tcgttggtgc tgagctccat 361 ggaggaggcc aagcagctgg tggacaaggc ctacaaggag cggcgggaaa gcatcaagca 421 gcggcttcgc agcggctcag ccagccccat ggaactccta tcctacttca agcagccggt 481 ggcagccacc aggacggcgg tgagggccgc tgactacctg cacgtggctc tagacctgct 541 ggagaggaag ctgcggtccc tgtggcgaag gccattcaat gtcactgatg tgctgacgcc 601 cgcccagctg aatgtgttgt ccaagtcaag cggctgcgcc taccaggacg tgggggtgac 661 ttgcccggag caggacaaat accgcaccat caccgggatg tgcaacaaca gacgcagccc 721 cacgctgggg gcctccaacc gtgcctttgt gcgctggctg ccggcggagt atgaggacgg 781 cttctctctt ccctacggct ggacgcccgg ggtcaagcgc aacggcttcc cggtggctct 841 ggctcgcgcg gtctccaacg agatcgtgcg cttccccact gatcagctga ctccggacca 901 ggagcgctca ctcatgttca tgcaatgggg ccagctgttg gaccacgacc tcgacttcac 961 ccctgagccg gccgcccggg cctccttcgt cactggcgtc aactgcgaga ccagctgcgt 1021 tcagcagccg ccctgcttcc cgctcaagat cccgcccaat gacccccgca tcaagaacca 1081 agccgactgc atcccgttct tccgctcctg cccggcttgc cccgggagca acatcaccat 1141 ccgcaaccag atcaacgcgc tcacttcctt cgtggacgcc agcatggtgt acggcagcga 1201 ggagcccctg gccaggaacc tgcgcaacat gtccaaccag ctggggctgc tggccgtcaa 1261 ccagcgcttc caagacaacg gccgggccct gctgcccttt gacaacctgc acgatgaccc 1321 ctgtctcctc accaaccgct cagcgcgcat cccctgcttc ctggcagggg acacccgttc 1381 cagtgagatg cccgagctca cctccatgca caccctctta cttcgggagc acaaccggct 1441 ggccacagag ctcaagagcc tgaaccctag gtgggatggg gagaggctct accaggaagc 1501 ccggaagatc gtgggggcca tggtccagat catcacttac cgggactacc tgcccctggt 1561 gctggggcca acggccatga ggaagtacct gcccacgtac cgttcctaca atgactcagt 1621 ggacccacgc atcgccaacg tcttcaccaa tgccttccgc tacggccaca ccctcatcca 1681 acccttcatg ttccgcctgg acaatcggta ccagcccatg gaacccaacc cccgtgtccc 1741 cctcagcagg gtcttttttg cctcctggag ggtcgtgctg gaaggtggca ttgaccccat 1801 cctccggggc ctcatggcca cccctgccaa gctgaatcgt cagaaccaaa ttgcagtgga 1861 tgagatccgg gagcgattgt ttgagcaggt catgaggatt gggctggacc tgcctgctct 1921 gaacatgcag cgcagcaggg accacggcct cccaggatac aatgcctgga ggcgcttctg 1981 tgggctcccg cagcctgaaa ctgtgggcca gctgggcacg gtgctgagga acctgaaatt 2041 ggcgaggaaa ctgatggagc agtatggcac gcccaacaac atcgacatct ggatgggcgg 2101 cgtgtccgag cctctgaagc gcaaaggccg cgtgggccca ctcctcgcct gcatcatcgg 2161 tacccagttc aggaagctcc gggatggtga tcggttttgg tgggagaacg agggtgtgtt 2221 cagcatgcag cagcgacagg ccctggccca gatctcattg ccccggatca tctgcgacaa 2281 cacaggcatc accaccgtgt ctaagaacaa catcttcatg tccaactcat atccccggga 2341 ctttgtcaac tgcagtacac ttcctgcatt gaacctggct tcctggaggg aagcctccta 2401 gaggccaggt aagggggtgc agcagtgagg ggtatatctg ggctggccag ttggaaccac 2461 ggagatctcc ttgccctaga tgagcccagc cctgttctgg gtgcagctga gaaaatgagt 2521 gactagacgt tcatttgtgt gctcatgtat gtgcgaagta tataaattgg cttttcatgc 2581 gtgtgtgttg tctgaacatg gggagtgttt catgggttat gtgtatgtgc catttatgtg 2641 agtgtgtgtt tgtgctgatg agaatactga gtatgtggaa ggcagcagag cggactggtg 2701 aggagcacag ctcaggaact agacgtgcct gggttccaat cctggctctg tggcttgcta 2761 gctatgtgac cttgagcaaa ttaccctcct taaacaagag ttttcttcct tgtaaattac 2821 atctgtcatg gtttcttgga gggcccactt gtatcctctg gttcttcatt tattgagcac 2881 ctactacatg caaggcactg tactaggcgt gagaagcata tagaggcaag aaagagatac 2941 caagatgcca tctgtgtcct ggttagcaga gctggaccag tggtgccttg gagggataag 3001 ccagctgcag ctgggctgtg tggttgactt atgggcccag ccagccaggc tcaggccatg 3061 gctccccttt ttcttcctca ccctgatttc ttgcttattc actgaagttc tcctgaagag 3121 gaactgggcc tgttgccctt tctgtaccat tagtgctccc atgtttatga taataaaggc 3181 accgtgatgg ggacctccac tctgtctgtg tct // LOCUS HSMPOUHOX 2182 bp RNA PRI 27-JUN-1994 DEFINITION H.sapiens mPOU homeobox protein mRNA. ACCESSION Z21966 NID g437806 KEYWORDS mPOU homeobox protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2182) AUTHORS Wey,E., Lyons,G.E. and Schafer,B.W. TITLE mPOU: A novel human POU domain gene expressed in specific adult tissues JOURNAL Unpublished REFERENCE 2 (bases 1 to 2182) AUTHORS Wey,E. TITLE Direct Submission JOURNAL Submitted (19-MAR-1993) Eva Wey, Pediatrics, University of Zurich, Division of Clinical Chemistry, Steinwiesstrasse 75, Zurich, 8032, Switzerland REFERENCE 3 (bases 1 to 2182) AUTHORS Wey,E., Lyons,G.E. and Schafer,B.W. TITLE A human POU domain gene, mPOU, is expressed in developing brain and specific adult tissues JOURNAL Eur. J. Biochem. 220 (3), 753-762 (1994) MEDLINE 94192665 FEATURES Location/Qualifiers source 1..2182 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="skeletal muscle" /clone="mPOU" /chromosome="12" CDS 193..1098 /codon_start=1 /product="mPOU homeobox protein" /db_xref="PID:g437807" /translation="MPGISSQILTNAQGQVIGTLPWVVNSASVAAPAPAQSLQVQAVT PQLLLNAQGQVIATLASSPLPPPVAVRKPSTPESPAKSEVQPIQPTPTVPQPAVVIAS PAPAAKPSASAPIPITCSETPTVSQLVSKPHTPSLDEDGINLEEIREFAKNFKIRRLS LGLTQTQVGQALTATEGPAYSQSAICRFEKLDITPKSAQKLKPVLEKWLNEAELRNQE GQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFEKNPLPTGQEITEIAKELNYDRE VVRVWFCNRRQTLKNTSKLNVFQIP" polyA_signal 2090..2095 /note="alternate polyA_signal" polyA_signal 2098..2103 /note="alternate polyA_signal" polyA_signal 2143..2148 polyA_site 2182 BASE COUNT 517 a 658 c 530 g 477 t ORIGIN 1 gaagcccaca ttcacatcta cccaccccaa tgccttctgt cctcttccct ctgcccgggg 61 ttcttctcac cccttggctt ctgagcaagg ctcttcctcc acccacagat cagtgctgct 121 tccctcgggg gacagaccca gatcctgggg tccctcacta cagctccagt cattaccagc 181 gccattccca gcatgccagg gatcagcagt cagatcctca ccaatgctca gggacaggtt 241 attggaaccc ttccatgggt agtgaactca gctagtgtgg cggccccagc accagcccaa 301 agcctgcagg tccaggccgt gaccccccag ctgttgttga acgcccaggg ccaggtgatt 361 gcgaccctgg ctagcagccc cctgcctcca cctgtggctg tccggaagcc aagcacacct 421 gagtcccctg ctaagagtga ggtgcagccc atccagccca caccaaccgt gccccagcct 481 gctgtggtca ttgccagccc agctccagcc gccaagccat ctgcctctgc tcctatccca 541 attacctgct cagagacccc caccgtcagc cagttggtgt ccaagccaca tactccaagt 601 ctggatgagg atgggatcaa cttagaagag atccgggagt ttgccaagaa ctttaagatc 661 cggcggctct cgctgggcct tacacagacc caggtgggtc aggctctgac tgcaacggaa 721 ggtccagcct acagccagtc agccatctgc cggttcgaga agctagacat cacacccaag 781 agtgcccaga agctaaagcc ggtgctggaa aagtggctaa acgaagctga actgcggaac 841 caggaaggcc agcagaacct gatggagttt gtgggaggcg agccctccaa gaaacgcaag 901 cgccgcacct ccttcacccc ccaggccata gaggctctca atgcctattt tgagaagaac 961 ccactgccca caggccagga gatcactgaa attgctaagg agctcaacta cgaccgtgag 1021 gtagtgcggg tctggttctg caatcggcgc cagacgctca agaacaccag caagctgaac 1081 gtctttcaga tcccttaggg ctcagccctg gccctgtgtt ctagcacttt gtccatttcc 1141 cgtggcatcc ggctgcagcc actgccatga cagcacctgt cattttgcca cgtgcagctg 1201 tgctcacccc aggtcatcag actccaccgt gtgcatgtgc atcaatgccc ctcttttctc 1261 ccacacatct cacatcatgg ggaggccaga gggggccaca cgagagctcc aggctctggg 1321 ctggtcactc cgaagaagag gatttgtgac gtcacttaga gaagcacctt gctagcatgg 1381 tttctgaagg gtgaattctg gtggggaacc agaaactccc tgtctttggg gcagggctaa 1441 agcagctcct aaggaccact ggccattagc tcttgctttt gatggcattc tctttccacc 1501 ttgtcttctc ctttgctcct ctgtgttagt gtggcaggta tgacaactca tccagtggaa 1561 acacagcctc acactgccct tccgcccccc acactttgcc tgcaggtgca ccgaaaggac 1621 ttgggggata aaattcaaaa aagtgtgatg tgctgctcag aaggtcagac tccatgtctg 1681 ccttggcctc aaggtcagaa ggttcccaaa cccctggggc tggaacatgg gatctcctct 1741 tccacctctt cctggttcct ttgcggggaa aattgcacta aaacagaacc ttttcttaat 1801 ccatgttgga aggaagcaac agtgaactct acctgttctg gagttctcct gggtctgcag 1861 aaggttggga atttagaaaa taaggctgtt ctttcatatt ttaatttaat ctctgtcaat 1921 ggccatccct cccacaaaaa aacgtgggtt aagagaactt gcagactgga tatgcaagca 1981 aacgggcaac tctggagaaa aataaggaaa ggaatgctga ctttctcttt ctttctcttg 2041 tccccacacc cattcccaac ccaatactgg ggccttctca aaaggagcaa attaaacaat 2101 aaaccagaca gcaaggccct gggggaaagg acaacatcct gaaataaatg atggagccca 2161 ggaaggtctc ttgtggaagt tg // LOCUS HSMPP6 1099 bp RNA PRI 08-JAN-1997 DEFINITION H.sapiens mRNA for M-phase phosphoprotein, mpp6. ACCESSION X98263 NID g1770461 KEYWORDS M phase phosphoprotein; MPP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1099) AUTHORS Matsumoto-Taniura,N., Pirollet,F., Monroe,R., Gerace,L. and Westendorf,J.M. TITLE Identification of novel M phase phosphoproteins by expression cloning JOURNAL Mol. Biol. Cell 7 (9), 1455-1469 (1996) MEDLINE 97039687 REFERENCE 2 (bases 1 to 1099) AUTHORS Westendorf,J.M. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) J.M. Westendorf, INSERM U366, DBMS/CS-CENG, 17 rue des Martyrs, F- 38054 Grenoble Cedex 9, FRANCE FEATURES Location/Qualifiers source 1..1099 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /dev_stage="19 years old" /tissue_type="blood" /cell_type="lymphoblast-like" /cell_line="MOLT-4" /clone_lib="lambda gt11" /clone="6" /clone="13" gene 33..515 /gene="mpp6" CDS 33..515 /gene="mpp6" /note="putative" /codon_start=1 /product="M-phase phosphoprotein 6" /db_xref="PID:e248489" /db_xref="PID:g1770462" /translation="MAAERKTKLSKNLLRMKFMQRGLDSETKKQLEEEEKKIISEEHW YLDLPELKEKESFIIEEQSFLLCEDLLYGRMSFRGFNPEVEKLMLQMNAKHKAEEVED ETVELDVSDEEMARRYETLVGTIGKKFARKRDHANYEEDENGDITPIKAKKMFLKPQD " polyA_site 977 /note="clone 6" polyA_site 1079 /note="clone 13" BASE COUNT 371 a 170 c 241 g 317 t ORIGIN 1 cggccggagt gcggcgctgg gcggaagcta ccatggcggc cgagagaaag acaaagttgt 61 ccaagaatct gctgcgcatg aagtttatgc aaaggggact ggactcagaa accaagaaac 121 aactagaaga agaagaaaag aaaatcatta gtgaagagca ctggtacttg gatttgccag 181 agcttaaaga gaaagagagt ttcataatag aagagcagag tttcttacta tgtgaagatc 241 ttctctatgg aagaatgtca ttcagaggat ttaatcctga ggttgagaaa ttgatgcttc 301 agatgaatgc taagcacaaa gcagaagaag ttgaagatga aacagtagag cttgatgtgt 361 cagatgaaga gatggctaga agatatgaga ccttggtggg gacaattggg aaaaagtttg 421 ccagaaagag agaccatgcc aattatgaag aagatgaaaa tggagacatc acaccaatta 481 aagcaaagaa gatgttctta aagccccagg attaagatgg atgccttaag cgatggccca 541 ggggtgcttg gtggaagtca gcagggcatc tggagctcat cccaatggtg tctctatagt 601 tattaatact gtaacgttta cttgtaaaga gattatcatt ttagaaacat gctgtttttg 661 aaacagatgt gtgatggatg ttgtacatcc tttgcttctt ggtattcatt cagagtggat 721 ttttagcccc tgatctacaa atgtacattg ttacagggct gcttcctaaa gatttttttt 781 acctcaggtt tctcttaata tagttctcca gtcactgacc ttgaattgac ttacataaac 841 tactgccaat gtttaaattg cccttatgtt tatgtttatt atgtcaagcc aattcgtaca 901 tacaatttgg aatcaagtgt cataagaatt tattatataa atttatcaag aataaaaatg 961 cctctccagc cttaagtatt tacatgctcc caggtcattg tcagtttatg gtattatgtt 1021 gttttattta aagcattgaa ttgatagaaa aatttgctct gtaataaaaa tctactttca 1081 aaaaaaaaaa aaaaaaaaa // LOCUS HSMPZE 959 bp RNA PRI 26-APR-1993 DEFINITION H.sapiens mRNA for macropain subunit zeta. ACCESSION X61970 NID g296739 KEYWORDS macropain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 959) AUTHORS DeMartino,G.N., Orth,K., McCullough,M.L., Lee,L.W., Munn,T.Z., Moomaw,C.R., Dawson,P.A. and Slaughter,C.A. TITLE The primary structures of four subunits of the human, high-molecular-weight proteinase, macropain (proteasome), are distinct but homologous JOURNAL Biochim. Biophys. Acta 1079 (1), 29-38 (1991) MEDLINE 91363412 FEATURES Location/Qualifiers source 1..959 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 22..747 /codon_start=1 /product="macropain subunit zeta" /db_xref="PID:g296740" /db_xref="SWISS-PROT:P28066" /translation="MFLTRSEYDRGVNTFSPEGRLFQVEYDIEAIKLGSTAIGIQTSE GVCLAVEKRITSPLMEPSSIEKIVEIDAHIGCAMSGLIADAKTLIDKARVETQNHWFT YNETMTVESVTQAVSNLALQFGEEDADPGAMSRPFGVALLFGGVDEKGPQLFHMDPSG TFVQCDARAIGSASEGAQSSLQELYHKSMTLKEAIKSSLIILKQVMEEKLNATNIELA TVQPGQNFHMFTKEELEEVIKDI" BASE COUNT 310 a 189 c 210 g 250 t ORIGIN 1 ctgcctcctc ctaccctcgc catgtttctt acccggtctg agtacgacag gggcgtgaat 61 actttttctc ccgaaggaag attatttcaa gtggaatatg acattgaggc tatcaagctt 121 ggttctacag ccattgggat ccagacatca gagggtgtgt gcctagctgt ggagaagaga 181 attacttccc cactgatgga gcccagcagc attgagaaaa ttgtagagat tgatgctcac 241 ataggttgtg ccatgagtgg gctaattgct gatgctaaga ctttaattga taaagccaga 301 gtggagacac agaaccactg gttcacctac aatgagacaa tgacagtgga gagtgtgacc 361 caagctgtgt ccaatctggc tttgcagttt ggagaagaag atgcagatcc aggtgccatg 421 tctcgtccct ttggagtagc attattattt ggaggagttg atgagaaagg accccagctg 481 tttcatatgg acccatctgg gacctttgta cagtgtgatg ctcgagcaat tggctctgct 541 tcagagggtg cccagagctc cttgcaagaa ctttaccaca agtctatgac tttgaaagaa 601 gccatcaagt cttcactcat catcctcaaa caagtaatgg aggagaagct gaatgcaaca 661 aacattgagc tagccacagt gcagcctggc cagaatttcc acatgttcac aaaggaagaa 721 cttgaagagg ttatcaagga catttaagga atcctgatcc tcagaacttc tctgggacaa 781 tttcagttct aataatgtcc ttaaatttta tttccagctc ctgttccttg gaaaatctcc 841 attgtatgtg cattttttaa atgatgtctg tacataaagg cagttctgaa ataaagaaaa 901 ttttaaaata aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSMRAB11 850 bp RNA PRI 22-JUN-1994 DEFINITION H.sapiens mRNA for Rab11 gene. ACCESSION X56740 NID g505540 KEYWORDS rab-related GTP-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 850) AUTHORS Zahraoui,A., Joberty,G. and Tavitian,A. TITLE Coding sequences of human Rab8 and Rab11 cDNAs JOURNAL Unpublished REFERENCE 2 (bases 1 to 850) AUTHORS Zahraoui,A. TITLE Direct Submission JOURNAL Submitted (26-NOV-1990) A. Zahraoui, INSERM-U 248, 10 AVENUE DE VERDUN, 750-10 PARIS, FRANCE FEATURES Location/Qualifiers source 1..850 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Human pheochromocytoma cDNA library" gene 54..704 /gene="H rab11" CDS 54..704 /gene="H rab11" /codon_start=1 /product="H rab11 small GTP binding protein" /db_xref="PID:g505541" /db_xref="SWISS-PROT:P24410" /translation="MGTRDDEYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLESKSTI GVEFATRSIQVDGKTIKAQIWDTAGQERYRAITSAYYRGAVGALLVYDIAKHLTYENV ERWLKELRDHADSNIVIMLVGNKSDLRHLRAVPTDEARAFAEKNGLSFIETSALDSTN VEAAFQTILTEIYRIVSQKQMSDRRENDMSPSNNVVPIHVPPTTENKPKVQCCQNI" BASE COUNT 248 a 177 c 178 g 247 t ORIGIN 1 gaattcccac agataccact gctgctcccg ccctttcgct cctcggccgc gcaatgggca 61 cccgcgacga cgagtacgac tacctcttta aagttgtcct tattggagat tctggtgttg 121 gaaagagtaa tctcctgtct cgatttactc gaaatgagtt taatctggaa agcaagagca 181 ccattggagt agagtttgca acaagaagca tccaggttga tggaaaaaca ataaaggcac 241 agatatggga cacagcaggg caagagcgat atcgagctat aacatcagca tattatcgtg 301 gagctgtagg tgccttattg gtttatgaca ttgctaaaca tctcacatat gaaaatgtag 361 agcgatggct gaaagaactg agagatcatg ctgatagtaa cattgttatc atgcttgtgg 421 gcaataagag tgatctacgt catctcaggg cagttcctac agatgaagca agagcttttg 481 cagaaaagaa tggtttgtca ttcattgaaa cttcggccct agactctaca aatgtagaag 541 ctgcttttca gacaatttta acagagattt accgcattgt ttctcagaag caaatgtcag 601 acagacgcga aaatgacatg tctccaagca acaatgtggt tcctattcat gttccaccaa 661 ccactgaaaa caagccaaag gtgcagtgct gtcagaacat ctaaggcatt tctcttctcc 721 cctagaaggc tgtgtatagt ccatttccca ggtctcacat ttaaatatat ttgtaattct 781 tgtgtcactt ttgtgtttta ttacttcata cttatgaatt tttccatgtc ctaagtcttt 841 tgattttgat // LOCUS HSMRAB8 660 bp RNA PRI 08-MAR-1994 DEFINITION H.sapiens mRNA for rab8 gene. ACCESSION X56741 NID g452317 KEYWORDS rab-related GTP-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 660) AUTHORS Zahraoui,A. TITLE Direct Submission JOURNAL Submitted (26-NOV-1990) A. Zahraoui, INSERM-U 248, 10 AVENUE DE VERDUN, 750-10 PARIS, FRANCE REFERENCE 2 (bases 1 to 660) AUTHORS Zahraoui,A., Joberty,G., Arpin,M., Fontaine,J.J., Hellio,R., Tavitian,A. and Louvard,D. TITLE A small rab GTPase is distributed in cytoplasmic vesicles in non polarized cells but colocalizes with the tight junction marker ZO-1 in polarized epithelial cells JOURNAL J. Cell Biol. 124 (1-2), 101-115 (1994) MEDLINE 94124602 FEATURES Location/Qualifiers source 1..660 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Human pheochromocytoma cDNA library" gene 11..634 /gene="rab8" CDS 11..634 /gene="rab8" /codon_start=1 /product="rab8 small GTP binding protein" /db_xref="PID:g452318" /db_xref="SWISS-PROT:P24407" /translation="MAKTYDYLFKLLLIGDSGVGKTCVLFRFSEDAFNSTFISTIGID FKIRTIELDGKRIKLQIWDTAGQERFRTITTAYYRGAMGIMLVYDITNEKSFDNIRNW IRNIEEHASADVEKMILGNKCDVNDKRQVSKERGEKLALDYGIKFMETSAKANINVEN AFFTLARDIKAKMDKKLEGNSPQGSNQGVKITPDQQKRSSFFRCVLL" BASE COUNT 197 a 159 c 172 g 132 t ORIGIN 1 gaattccaat atggcgaaga cctacgatta cctgttcaag ctgctgctga tcggggactc 61 gggggtgggg aagacctgtg tcctgttccg cttctccgag gacgccttca actccacttt 121 tatctccacc ataggaattg actttaaaat taggaccata gagctcgatg gcaagagaat 181 taaactgcag atatgggaca cagccggtca ggaacggttt cggacgatca caacggccta 241 ctacaggggt gcaatgggca tcatgctggt ctacgacatc accaacgaga agtccttcga 301 caacatccgg aactggattc gcaacattga ggagcacgcc tctgcagacg tcgaaaagat 361 gatactcggg aacaagtgtg atgtgaatga caagagacaa gtttccaagg aacggggaga 421 aaagctggcc ctcgactatg gaatcaagtt catggagacc agcgcgaagg ccaacatcaa 481 tgtggaaaat gcatttttca ctctcgccag agatatcaaa gcaaaaatgg acaaaaaatt 541 ggaaggcaac agcccccagg ggagcaacca gggagtcaaa atcacaccgg accagcagaa 601 gaggagcagc tttttccgat gtgttcttct gtgaggaaca ccgccttact ctgagcctcg // LOCUS HSMRACP5 1359 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for acid phosphatase type 5 (EC 3.1.3.2). ACCESSION X14618 NID g34733 KEYWORDS acid phosphatase; acid phosphatase type 5; ACP5 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1359) AUTHORS Lord,D.K. TITLE Direct Submission JOURNAL Submitted (21-FEB-1989) Lord D.K., Royal Postgraduate Medical School, Hammersmith Hospital, Du Cane Road, London W12 0HS REFERENCE 2 (bases 1 to 1359) AUTHORS Lord,D.K., Cross,N.C., Bevilacqua,M.A., Rider,S.H., Gorman,P.A., Groves,A.V., Moss,D.W., Sheer,D. and Cox,T.M. TITLE Type 5 acid phosphatase. Sequence, expression and chromosomal localization of a differentiation-associated protein of the human macrophage JOURNAL Eur. J. Biochem. 189 (2), 287-293 (1990) MEDLINE 90249371 REMARK Erratum:[Eur J Biochem 1990 Aug 17;191(3):775]] COMMENT See also acc# J04430. FEATURES Location/Qualifiers source 1..1359 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" CDS 36..1013 /note="acid phosphatase type 5 (AA 1 - 325)" /codon_start=1 /db_xref="PID:g34734" /db_xref="SWISS-PROT:P13686" /translation="MDMWTALLILQALLLPSLADGATPALRFVAVGDWGGVPNAPFHT AREMANAKEIARTVQILGADFILSLGDNFYFTGVQDINDKRFQETFEDVFSDRSLRKV PWYVLAGNHDHLGNVSAQIAYSKISKRWNFPSPFYRLHFKIPQTNVSVAIFMLDTVTL CGNSDDFLSQQPERPRDVKLARTQLSWLKKQLAAAREDYVLVAGHYPVWSIAEHGPTH CLVKQLRPLLATYGVTAYLCGHDHNLQYLQDENGVGYVLSGAGNFMDPSKRHQRKVPN GYLRFHYGTEDSLGGFAYVEISSKEMTVTYIEASGKSLFKTRLPRRARP" misc_feature 1335..1340 /note="pot. polyA signal" polyA_site 1359 /note="polyA site" BASE COUNT 282 a 403 c 383 g 291 t ORIGIN 1 agagcctccg gtgactggcc tgtgtctccc cctggatgga catgtggacg gcgctgctca 61 tcctgcaagc cttgttgcta ccctccctgg ctgatggtgc cacccctgcc ctgcgctttg 121 tagccgtggg tgactgggga ggggtcccca atgccccatt ccacacggcc cgggaaatgg 181 ccaatgccaa ggagatcgct cggactgtgc agatcctggg tgcagacttc atcctgtctc 241 taggggacaa tttttacttc actggtgtgc aagacatcaa tgacaagagg ttccaggaga 301 cctttgagga cgtattctct gaccgctccc ttcgcaaagt gccctggtac gtgctagccg 361 gaaaccatga ccaccttggc aatgtctctg cccagattgc atactctaag atctccaagc 421 gctggaactt ccccagccct ttctaccgcc tgcacttcaa gatcccacag accaatgtgt 481 ctgtggccat ttttatgctg gacacagtga cactatgtgg caactcagat gacttcctca 541 gccagcagcc tgagaggccc cgagacgtga agctggcccg cacacagctg tcctggctca 601 agaaacagct ggcggcggcc agggaggact acgtgctggt ggctggccac taccccgtgt 661 ggtccatagc cgagcacggg cctacccact gcctggtcaa gcagctacgg ccactgctgg 721 ccacatacgg ggtcactgcc tacctgtgcg gccacgatca caatctgcag tacctgcaag 781 atgagaatgg cgtgggctac gtgctgagtg gggctgggaa tttcatggac ccctcaaagc 841 ggcaccagcg caaggtcccc aacggctatc tgcgcttcca ctatgggact gaagactcac 901 tgggtggctt tgcctatgtg gagatcagct ccaaagagat gactgtcact tacatcgagg 961 cctcgggcaa gtccctcttt aagaccaggc tgccgaggcg agccaggccc tgaactccca 1021 tgactgccca gctctgaggc ccgatctcca ctgttgggtg ggtggcctgc cgggaccctg 1081 ctcacaggca ggcttttcct ccaacctgtg gcgctgcagc agggcaggaa ggggaaacac 1141 agctgatgaa ctgtggtgcc acatgaccct tgtggcacag atgcccacgt atgtgaaaca 1201 cacatggaca tgtgtcccag ccacagtgtt atgctctgtg gctggctcac ctttgctgag 1261 ttccggggtg caatggggga gggagggagg gaaagcttcc tcctaaatca agcatctttc 1321 tgttactgat gttcaataaa agaatagttg ccaaggctg // LOCUS HSMRINTX 1701 bp RNA PRI 27-APR-1995 DEFINITION H.sapiens mRNA for mediator of receptor-induced toxicity. ACCESSION X84709 NID g791037 KEYWORDS MORT1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1701) AUTHORS Boldin,M.P., Varfolomeev,E.E., Pancer,Z., Mett,I.L., Camonis,J.H. and Wallach,D. TITLE A novel protein that interacts with the death domain of Fas/APO1 contains a sequence motif related to the death domain JOURNAL J. Biol. Chem. 270 (14), 7795-7798 (1995) MEDLINE 95229578 REFERENCE 2 (bases 1 to 1701) AUTHORS Wallach,D. TITLE Direct Submission JOURNAL Submitted (10-FEB-1995) D. Wallach, The Weizmann Institute, Dept of Membrane Research & Biophysics, Rehovot 76100, ISRAEL FEATURES Location/Qualifiers source 1..1701 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human monocyte, HeLa" gene 145..771 /gene="MORT1" CDS 145..771 /gene="MORT1" /note="putative start point" /codon_start=1 /product="mediator of receptor induced toxicity" /db_xref="PID:g791038" /translation="MDPFLVLLHSVSSSLSSSELTELKFLCLGRVVKRKLERVQSGLD LFSMLLEQNDLEPGHTELLRELLASLRRHDLLRRVDDFEAGAAAGAAPGEEDLCAAFN VICDNVGKDWRRLARQLKVSDTKIDSIEDRYPRNLTERVRESLRIWKNTEKENATVAH LVGALRSCQMNLVADLVQEVQQARDLQNRSGAMSPMSWNSDASTSEAS" BASE COUNT 382 a 459 c 517 g 343 t ORIGIN 1 gtgaatcagg caccggagtg caggttcggg ggtggaatcc ttgggccgct gggcaagcgg 61 cgagacctgg ccagggccag cgagccgagg acagagggcg cgcggagggc cgggccgcag 121 ccccggccgc ttgcagaccc cgccatggac ccgttcctgg tgctgctgca ctcggtgtcg 181 tccagcctgt cgagcagcga gctgaccgag ctcaagttcc tatgcctcgg gcgcgtggtc 241 aagcgcaagc tggagcgcgt gcagagcggc ctagacctct tctccatgct gctggagcag 301 aacgacctgg agcccgggca caccgagctc ctgcgcgagc tgctcgcctc cctgcggcgc 361 cacgacctgc tgcggcgcgt cgacgacttc gaggcggggg cggcggccgg ggccgcgcct 421 ggggaagaag acctgtgtgc agcatttaac gtcatatgtg ataatgtggg gaaagattgg 481 agaaggctgg ctcgtcagct caaagtctca gacaccaaga tcgacagcat cgaggacaga 541 tacccccgca acctgacaga gcgtgtgcgg gagtcactga gaatctggaa gaacacagag 601 aaggagaacg caacagtggc ccacctggtg ggggctctca ggtcctgcca gatgaacctg 661 gtggctgacc tggtacaaga ggttcagcag gcccgtgacc tccagaacag gagtggggcc 721 atgtccccga tgtcatggaa ctcagacgca tctacctccg aagcgtcctg atgggccgct 781 gctttgcgct ggtggaccac aggcatctac acagcctgga ctttggttct ctccaggaag 841 gtagcccagc actgtgaaga cccagcagga agccaggctg agtgagccac agaccacctg 901 cttctgaact caagctgcgt ttattaatgc ctctcccgca ccaggccggg cttgggccct 961 gcacagatat ttccatttct tcctcactat gacactgagc aagatcttgt ctccactaaa 1021 tgagctcctg cgggagtagt tggaaagttg gaaccgtgtc cagcacagaa ggaatctgtg 1081 cagatgagca gtcacactgt tactccacag cggaggagac cagctcagag gcccaggaat 1141 cggagcgaag cagagaggtg gagaactggg atttgaaccc ccgccatcct tcaccagagc 1201 ccatgctcaa ccactgtggc gttctgctgc ccctgcagtt ggcagaaagg atgtttttgt 1261 cccatttcct tggaggccac cgggacagac ctggacacta gggtcaggcg gggtgctgtg 1321 gtggggagag gcatggctgg ggtgggggtg gggagacctg gttggccgtg gtccagctct 1381 tggcccctgt gtgagttgag tctcctctct gagactgcta agtaggggca gtgatggttg 1441 ccaggacgaa ttgagataat atctgtgagg tgctgatgag tgattgacac acagcactct 1501 ctaaatcttc cttgtgagga ttatgggtcc tgcaattcta cagtttctta ctgttttgta 1561 tcaaaatcac tatctttctg ataacagaat tgccaaggca gcgggatctc gtatctttaa 1621 aaagcagtcc tcttattcct aaggtaatcc tattaaaaca cagctttaca acttccatat 1681 tacaaaaaaa aaaaaaaaaa a // LOCUS HSMRL3R 1634 bp RNA PRI 12-SEP-1993 DEFINITION Human MRL3 mRNA for ribosomal protein L3 homologue ( MRL3 = mammalian ribosome L3 ). ACCESSION X06323 NID g34753 KEYWORDS ribosomal protein; ribosomal protein L3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Ou,J.H., Yen,T.S., Wang,Y.F., Kam,W.K. and Rutter,W.J. TITLE Cloning and characterization of a human ribosomal protein gene with enhanced expression in fetal and neoplastic cells JOURNAL Nucleic Acids Res. 15 (21), 8919-8934 (1987) MEDLINE 88067705 REMARK Erratum:[Nucleic Acids Res 1988 May 11;16(9):4196]] COMMENT MRL3 expression is enhanced in HCC, neoplastic and fetal liver cells Data kindly reviewed (08-NOV-1989) by OU J.-H. FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver." /cell_line="PLC/PRF/5 (Alexander)" /clone_lib="lambda gt10" /clone="pGT1 and pGT2" CDS 77..1123 /note="put. ribosomal protein L3 (AA 1 - 348)" /codon_start=1 /db_xref="PID:g34754" /db_xref="SWISS-PROT:P09001" /translation="MPGWRLLTQVGAQVLGRLGDGLGAALGPGNRTHIWLFVRGLHGK SGTWWDEHLSEENVPFIKQLVSDEDKAQLASKLCPLKDEPWPIHPWEPGSFRVGLIAL KLGMMPLWTKDGQKHVVTLLQVQDCHVLKYTSKENCNGKMATLSVGGKTVSRFRKATS ILEFYRELGLPPKQTVKIFNITDNAAIKPGTPLYAAHFRPGQYVDVTAKTIGKGFQGV MKRWGFKGQPATHGQTKTHRRPGAVATGDIGRVWPGTKMPGKMGNIYRTEYGLKVWRI NTKHNIIYVNGSVPGHKNCLVKVKDSKLPAYKDLGKNLPFPTYFPDGDEEELPEDLYD ENVCQPGAPSITFA" polyA_site 1339 /note="polyA site of clone pGT1" polyA_site 1634 /note="polyA site of clone pGT2" BASE COUNT 510 a 305 c 371 g 448 t ORIGIN 1 ggtggcgtgg ggactccctg aaagcagagc ggcagggcgc ccggaagtcg tgagtcgagt 61 cttcccgggc taatccatgc cgggttggag gctgctgacg caggtcggcg cccaggtgct 121 gggtcgactc ggggacggcc tgggtgctgc cctgggcccg gggaacagaa cacacatctg 181 gctttttgtt agaggtcttc atggaaagag tggtacatgg tgggatgagc atctttctga 241 agaaaatgtc ccattcatta agcagttggt ctctgatgaa gataaagccc aattagcaag 301 taaactgtgt cctctgaaag atgaaccatg gcctatacat ccttgggaac caggttcctt 361 tagagttggt cttattgcct tgaagctggg catgatgcct ttatggacca aggatggtca 421 aaagcatgtg gtcacattac ttcaggtaca agactgtcat gtcttaaaat atacgtcaaa 481 ggaaaactgt aatggaaaaa tggcaaccct gtctgtagga ggaaaaactg tatcacgttt 541 tcgtaaagct acatccatat tggaatttta ccgggaactt ggattgccgc cgaaacagac 601 agttaaaatc tttaatataa cagataatgc tgcaattaaa ccaggcactc ctctttatgc 661 tgctcacttt cgtccaggac agtatgtgga tgtcacagcc aaaactattg gtaaaggttt 721 tcaaggtgtc atgaaaagat ggggatttaa aggccagcct gctacgcatg gtcaaacgaa 781 aacccacagg agacctggag ctgttgcaac tggtgatatt ggcagagtct ggcctggaac 841 taaaatgcct ggaaaaatgg gaaacatata caggacagaa tatggactga aagtgtggag 901 aataaacaca aagcacaaca taatctatgt aaatggctct gtacctggac ataaaaattg 961 cttagtaaag gtcaaagatt ctaaactgcc tgcatataag gatctcggta aaaatctacc 1021 attccctaca tattttcctg atggagatga agaggaactg ccagaagatt tgtatgatga 1081 aaacgtgtgt cagcccggtg cgccttctat tacatttgcc taacatcttt ggacgtggca 1141 gaaccttaca tattctgtga gcttcgatga gccagagtga tatcataacc accagaaatc 1201 atactctcct ttcttagtca caacaaaatc acacatgtca tctttgtcaa gggcataaat 1261 atatcattca tacccccatt aaattttgtt agaaaaatta ccacattaaa tatatgagtt 1321 aagtagattg gatttgctga aattggtgtt gggcatatta gcaaaatatt cttaatttgt 1381 ggactcgatt cttttttact acatatttcc caagttatct taagatgtct gtaaatttaa 1441 cttttattaa agttttgtca atctttgtga aatagtggtt gtggaacagt agaaaaccat 1501 atggggacta tagtgcaacc tatttgggta aagaaaccat ttgctaaaat ggagaaagta 1561 aatagatttt tatttaaatt acagaaacat gttaaaggcc ggacaaagga aagacaataa 1621 aatcataaat tatc // LOCUS HSMRLCM 944 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for myosin regulatory light chain. ACCESSION X54304 NID g34755 KEYWORDS myosin; myosin regulatory light chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 944) AUTHORS Grant,J.W. TITLE Direct Submission JOURNAL Submitted (08-AUG-1990) Grant J.W., Washington University School of Medicine, Dept. of Pediatrics, 400 S Kingshighway Blvd., St Louis, MO 63110, USA REFERENCE 2 (bases 1 to 944) AUTHORS Grant,J.W., Zhong,R.Q., McEwen,P.M. and Church,S.L. TITLE Human nonsarcomeric 20,000 Da myosin regulatory light chain cDNA JOURNAL Nucleic Acids Res. 18 (19), 5892 (1990) MEDLINE 91016942 COMMENT *source; tissue=placenta; library=lambda-ZAP; Data kindly reviewed (12-NOV-1990) by Grant J.W. FEATURES Location/Qualifiers source 1..944 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 115..630 /note="myosin regulatory light chain" /codon_start=1 /db_xref="PID:g34756" /db_xref="SWISS-PROT:P19105" /translation="MSSKRTKTKTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNR DGFIDKEDLHDMLASLGKNPTDEYLDAMMNEAPGPINFTMFLTMFGEKLNGTDPEDVI RNAFACFDEEATGTIQEDYLRELLTTMGDRFTDEEVDELYREAPIDKKGNFNYIEFTR ILKHGAKDKDD" misc_feature 929..936 /note="pot. polyA signal" BASE COUNT 270 a 194 c 216 g 264 t ORIGIN 1 gcggccagcg cgtggttttt agcggctctc tgggtagcag ggtggtgtga tagcggccga 61 gggctcggaa gggtgctcgg attctcgtag ctgtgccggg acttaaccac caccatgtcg 121 agcaaaagaa caaagaccaa gaccaagaag cgccctcagc gtgcaacatc caatgtgttt 181 gctatgtttg accagtcaca gattcaggag ttcaaagagg ccttcaacat gattgatcag 241 aacagagatg gtttcatcga caaggaagat ttgcatgata tgcttgcttc attggggaag 301 aatccaactg atgagtatct agatgccatg atgaatgagg ctccaggccc catcaatttc 361 accatgttcc tcaccatgtt tggtgagaag ttaaatggca cagatcctga agatgtcatc 421 agaaatgcct ttgcttgctt tgatgaagaa gcaactggca ccatacagga agattacttg 481 agagagctgc tgacaaccat gggggatcgg tttacagatg aggaagtgga tgagctgtac 541 agagaagcac ctattgataa aaaggggaat ttcaattaca tcgagttcac acgcatcctg 601 aaacatggag ccaaagacaa agatgactga aataacttca aattccagcc aacgtccttg 661 ttgcactttg ggtattctga gattttctct tgccattccc ttaggcttta gcagctttgc 721 atttcctgtt gtatttattc tcagccattt tgggcatatg tatctttata atcagactgg 781 aaacgggact ttctattaat atcattttca gaataaaaaa taggataatt taacctacca 841 gcccttctcc cccaataact gtgggtctat acagagtcaa tatatttttt cagagaaagt 901 tagttcggct cgattttttc tgaatcataa ttaaacttta ttgc // LOCUS HSMRNAEB 5682 bp DNA PRI 30-NOV-1997 DEFINITION H.sapiens genomic DNA, integration site for Epstein-Barr virus. ACCESSION X76785 NID g2193877 KEYWORDS M RNA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5682) AUTHORS Gualandi,G., Frezza,D., Scotto d'Abusco,A., Bianchi,E., Gargano,S., Giorgi,S., Fruscalzo,A. and Calef,E. TITLE Integration of an Epstein-Barr virus episome 3' into the gene encoding immunoglobulin heavy-chain alpha 1 in a lymphoblastoid cell line JOURNAL Gene 166 (2), 221-226 (1995) MEDLINE 96125193 REFERENCE 2 (bases 1 to 5682) AUTHORS Gualandi,G. TITLE Direct Submission JOURNAL Submitted (23-NOV-1993) G. Gualandi, D.A.B.A.C. Universita' della Tuscia, Viterbo, ITALY REMARK revised by [3] MAT REFERENCE 3 (bases 1 to 5682) AUTHORS Gualandi,G. TITLE Direct Submission JOURNAL Submitted (14-APR-1994) G. Gualandi, D.A.B.A.C. Universita' della Tuscia, Viterbo, ITALY REMARK revised by [4] REFERENCE 4 (bases 1 to 5682) AUTHORS Frezza,D. TITLE Direct Submission JOURNAL Submitted (24-FEB-1997) D.Frezza, Dept.of Biology, Universita' di Tor Vergata, Viale della Ricerca scientifica 00133 Roma, Italia REMARK revised by [5] REFERENCE 5 (bases 1 to 5682) AUTHORS Frezza,D. TITLE Direct Submission JOURNAL Submitted (11-JUN-1997) D.Frezza, Dept.of Biology, Universita' di Tor Vergata, Viale della Ricerca scientifica 00133 Roma, Italia COMMENT Related sequence: S42329 Related to X54104. FEATURES Location/Qualifiers source 1..5682 /organism="Homo sapiens" /db_xref="taxon:9606" source 1..3991 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /tissue_type="placenta and PBMC" /cell_type="B lymphocytes and placenta" /cell_line="RGN-1 (lymphoblastoid cell line, LCL)" /map="q32" /clone_lib="pJEH & lpl 8" CDS 853..1446 /codon_start=1 /product="hypothetical protein" /db_xref="PID:e1192243" /db_xref="PID:g2193878" /translation="MSPTPVFGPGPHHSALVLITQPRSWSLSPGPRHSALDLVTQPWT SSLSPGPPHSALVLVTQPWSSSLSPGPRHSALDLVTQPWSSSLSPGPRHSALVLITQP WTSSLSPGPPHSALDLITHPGPPHLALDLITQPWSWSLSPGPHHSALDLITQPWSSSL SPGPDHPALVLITQSWSRSHSPGPDHSILVLITQPWS" misc_feature 2377..2483 /note="matches EST AA215369" CDS 3487..4515 /codon_start=1 /product="hypothetical protein" /db_xref="PID:e321847" /db_xref="PID:g2193879" /translation="MGRDKRDRKGETKTDTEREMGMERDKRQGQGGTEARPVTETELQ ETERERDEMREIRRNGDTDGRKREEMGTGKRDMETDMQREHNRGREHTGRERGSKMGP GTQLQAPLTQSLSQTHTTTIIQHSHTNTQKHTHAHSHTLSHCHIHTHTHTQTHMAQEQ MGRPQPAAGSGRAGAEPESGPVCGWGAWQHGPTCTYKAWPPRSLGHHPALALFPVCAG RGRTLSLGETYRSTQDPEHRAGVGNNMGEAGAGGPRTLRTATQPWGLKPRTASGLPGR TGQWGMVREQDSQQGAARGQGWTLGGQQDRGRGRGVGEGPGGPGEPVGPCLWEGGQEQ PRVARLAG" BASE COUNT 1182 a 1738 c 1635 g 1127 t ORIGIN 1 gcgggagccg ggcaagggca caggtgggag cccaggaggg ggatgaggcc cacagtggat 61 gaggtgggct gcagtgcttg gctaagagga gagcaccacc tgctcccact gtggggggac 121 gtgctctcct ggggggccct tcacagacac tgaggacacg cgcaggccca gggtcagggc 181 tgagcttccc tccagtgcag taacgaggat tccgtccagg ctcccatgag ccaggccagg 241 gctgagacag agggcgttgg caaggatgct gctccttcag gctgtaaccc ctctgtcttt 301 gcagggagga agtgtggagg aacctcttgg agaagccagc tatgcttgcc agagctcagc 361 cctttcagac atcaccgacc cgcccttact cacgtggctt ccaggttgca ataaagtggc 421 cccaaggaaa atgttcacag actctgaatg aggagacggg ggtcagggaa agggtggtgg 481 ctttagactg gagaacgcct cgttcaaagt ccccctgggt gtcatggtgg tcatggtggg 541 catggacaga gggtacccct ggtcccaaaa tcaagaaatg acctgatctt gcatgaggct 601 gaggcccaag gatgaatgct ggattcacca gagaacatgg caaagaagcc tgctcctaag 661 aactacatgg gatccctgtt cctcataacc tagacagccc tggtcctcct cacagggtcc 721 tcatcctgat cacagggccc tggtcctgac cagtgggccc gtttaccgat cactgagctc 781 tggtcctact acttggtccc tgttcctgat cacttaattc tagccttgat cactgagcct 841 tagtcctgat cactgagtcc tactcctgtt tttggccctg gacctcatca ctcagccctg 901 gtcctcatca ctcagccccg gtcctggtca ctcagccctg gtcctcgtca ctcagccctg 961 gacctcgtca ctcagccctg gacctcctca ctcagccctg gacctcctca ctcagccctg 1021 gtcctcgtca ctcagccctg gtcctcgtca ctcagccctg gacctcgtca ctcagccctg 1081 gacctcgtca ctcagccctg gtcctcgtca ctcagccctg gacctcgtca ctcagccctg 1141 gtcctcatca ctcagccctg gacctcatca ctcagccctg gacctcctca ctcagccctg 1201 gacctcatca ctcaccctgg acctcctcac ttagccctgg acctcatcac tcagccctgg 1261 tcctggtcac tcagccctgg tcctcatcac tcagccctgg acctcatcac tcagccctgg 1321 tcctcatcac tcagccctgg tcctgatcac ccagccctgg tcctgatcac tcagtcctgg 1381 tccagatcac acagccctgg tcctgatcac tccatcctag tcctcatcac tcagccctgg 1441 tcctgatcac ccagccctgg tcctgatcac tcagtcctgg tccagatcac tcagccctgg 1501 tcctgatcac tcagctctgg tcctgattat tagatcctga tgctagtcac tgggcactgg 1561 gcactgggcc cggatctgat cctaatcact gaaccctgct cctgatcaat gaaccccggc 1621 cctgatcaaa gggcttgctc ctgatcactc cctggtcatg ctgtgttctg gtccttaccc 1681 ctgaactctg gtccgtgatt actgagtctt ggtcctgatc cctgagtcct agacttgatc 1741 actcaattcc agtcctcatc agtgggccct ggtcgtgatc actgagccct gatctggatc 1801 actaggctct ggtcctaact cagtcctggt ctggatcgct gagtctgatc ctgatcactg 1861 ggccatgttc tggatcaatg agccctggcc ctgatcactg ggccctggtc ctgctaacta 1921 tgctctggtc ctgaccactg agcccttgtc ccagtcaaca gtcaataact gagctctggt 1981 cctagtcaat agtcaatcac tgagccctgg tcctgatcac tgggtcctgg tcctgatcac 2041 tgggtcctgt tcttatccct gaatcttggt cctgatcact gagtcctagc cctggccctc 2101 atcactgggt cttgttccta atcactgggc tctggttctg accaatggcc cctggttctg 2161 gtccctgact cctggtcctg atcaatgggc tctgctcttg acttctgagt cctggtgcta 2221 tcatccagtc ctggtcctga tcactgggcc ctgattctaa tcactcagac ctccttttga 2281 tcactgaacc ctagtattta tcactgagcc ctgatcctga tcgttgggcc ctatttctgg 2341 taactgagct ctgatcctga ccactgacct ctgttcctaa tcactgagcc ctggaccaga 2401 tcactggccc tggtcctgat cactggcccc tggacctgat cactgaccct tgttcccgat 2461 tgctgagccc tggacctgat cactgagccc tgttcctgat cactaacccc tattcctgac 2521 cactgagccc tggtcctgat tactgagccc tggacccagt caatgacccc tgttcctgat 2581 cactgagccc tggaccaaat cactgagccc tggaccagat cactgggccc tggtcttgat 2641 cactgggccc tggtcctgat tactgagccc tgatcctgat cactgggccc tgttcctgac 2701 cactgagccc tggaccagac cacggatccc tgttcctgat cactgatctc tggttctttt 2761 attatgcata ttcattttga aatctgattc cttttctgag catgtatcag tctgactaga 2821 cactgagtcc tgtctgattt ctgagccttg gccctcatga gtaagtgacc tgcagtggtg 2881 gagggagctc caggggagcc gagaccctct cagtgcatgt actcactggt agatgaagaa 2941 atgaccccaa tgattgctcc attcttccag gctcagaggg gtgtgtaggc cccaggagga 3001 cttggtgggg agaagaccag cccaggccct gtgagtacac ccagccccag cccctaaggg 3061 gtcgccaggt ctcgacttag cactggggag ggggtacagt acaggagtgg ggacaggaag 3121 gtgaggggag gccatgccgt ttgtattctc ttgcttttct ctctctcctg aagcctcttg 3181 aatagacctg cagaaatacc caaaatagcc ctgtggggtg gctgagtcat tgtgaacaca 3241 gcccaggtca ggtgttccag ccagagaact gctgttctga gaaacatgcc ccaaaaccga 3301 gacctggcca ggtgtgcctg gggcctgagc gaggggctgc agccacaggt aggcccagcc 3361 ccaaccagcc cagagtcagc tagggctttc caggtccagg gttaggcaga ggtcagccag 3421 ggtcaaccac ggtctatctg aggggagaga caagagacac agagacatag agagagagag 3481 acagggatgg ggagagacaa gagagacagg aaaggagaga caaagacaga cacagagaga 3541 gagatgggga tggagagaga taagagacag ggacagggag ggacagaggc gaggccagtg 3601 acagagacag agttacaaga gacagaaaga gagagagatg agatgagaga gatcagaaga 3661 aacggagaca cagatgggag aaaaagagag gagatgggaa caggaaaaag agacatggag 3721 acagacatgc agagagaaca caacagaggc agagaacaca cagggagaga aagaggcagc 3781 aagatgggcc caggaactca gctgcaagcc cctctcacac agtcactctc acaaacacac 3841 accacgacta ttatacaaca ttcacacaca aacacacaga aacataccca cgctcacagc 3901 cacacactaa gtcactgtca catccatacc cacactcaca cccaaacaca catggcccag 3961 gagcaaatgg ggagacctca gcctgcagct gggagcggcc gagcgggcgc tgagccagag 4021 tcggggcctg tctgtgggtg gggggcatgg cagcacgggc ccacctgcac ctacaaggcc 4081 tggcccccga ggtcactggg ccaccaccca gccctcgccc tgttccccgt ctgtgctgga 4141 cggggcagga ctctgagcct cggggaaacc tacagatcca cacaggaccc cgaacatcgg 4201 gctggggtgg gtaacaacat gggagaagcg ggagcaggag gtcccaggac cctgcgcact 4261 gcgacccagc cctgggggct gaagcccagg acagcctcag gtctcccagg aaggactgga 4321 cagtggggga tggtcagaga acaggacagc cagcagggtg cagcccgagg acagggatgg 4381 acgctgggag gtcaacagga caggggcagg ggccgtggag tgggcgaagg tcctggaggg 4441 cctggagaac ctgtgggtcc gtgtttgtgg gaaggaggcc aggagcagcc cagagtggcc 4501 aggctggcag ggtgaggagg tgggggcagt gaggtgaggg tgaccgagac agtgaggcct 4561 ctggccaggg aggggacctt ggctgggctc tgactgaacc cagggctcct ggagaagggg 4621 ccccaggcgg ggatgaggat gtgggcatct gactccatca acaatggggc ttccaacacg 4681 cacagcctgg gcctcggaga cctgggccct gacccgcctc cccctggcac tgggccgggt 4741 gccgtgtgtg gtccccagtc cccgcagcac ctcccccaca ctggtcacgt tccagggccc 4801 ctctgaagca cctgctgtga ggggatgtgg ggaggggaca gggacttggg cctgagctgc 4861 cgggtcgggg gggagtcggg gacccaggct cagcgtgtgg ctgcggacca gacagatggg 4921 gatggaggag gacacgccct gtacccactg cctgccaagg ggctggaccc acgcccagta 4981 taggccatgt cacccagagg cctgtgaacc attcactctg agccactaaa acattcagga 5041 gctttgaaag cagcccccgt gccttgtcaa tatgcgatga ctctgagcat cacgctgtcc 5101 ctgctggatc caccctccag ccccagcgag ggaggctggg ccccgggcag caggtggtga 5161 gggcagcggg cacagccacc ctacagcaca cacagggtct cagggacgcg tccaccacag 5221 cccgtgcaca ggctcctcac ggcactgagt tcacccgggg cgcgggccgt ttgtcctcag 5281 gagtccggct gtgccctccg cccccagccc tgtcctgctg aggctgcagc tgggtcccgg 5341 ggcacagggc ggccctgagc gaccttgtca tgttggtctg tcgggtgggc tgctggctct 5401 ctgtggagct ggcagagccg cggttcagcc ttggaggccg gtcctggggc ccagcagccg 5461 tggggagcac tgcccagtcc cgtgcccaca gggaatcacc tgggctgagg aagggcccac 5521 acgccgacgg gatcggggtc aggcagcgca cgcctggcac cgagatccca cgtcccgaag 5581 tggggacacg gcccaggggc actgttccgg gagggtctca agatggggtc tcctatttca 5641 atcttcactc cttctgcacc tgttagctgg gaaccttcta ga // LOCUS HSMRNAEN 3181 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for enkephalinase (EC 3.4.24.11). ACCESSION X07166 NID g34757 KEYWORDS enkephalinase; metalloprotein; neutral endopeptidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3181) AUTHORS Malfroy,B., Kuang,W.J., Seeburg,P.H., Mason,A.J. and Schofield,P.R. TITLE Molecular cloning and amino acid sequence of human enkephalinase (neutral endopeptidase) JOURNAL FEBS Lett. 229 (1), 206-210 (1988) MEDLINE 88152222 FEATURES Location/Qualifiers source 1..3181 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt10" /clone="lambda H7" CDS 18..2249 /note="enkephalinase (AA 1-743)" /codon_start=1 /db_xref="PID:g34758" /db_xref="SWISS-PROT:P08473" /translation="MDITDINTPKPKKKQRWTPLEISLSVLVLLLTIIAVTMIALYAT YDDGICKSSDCIKSAARLIQNMDATTEPCTDFFKYACGGWLKRNVIPETSSRYGNFDI LRDELEVVLKDVLQEPKTEDIVAVQKAKALYRSCINESAIDSRGGEPLLKLLPDIYGW PVATENWEQKYGASWTAEKAIAQLNSKYGKKVLINLFVGTDDKNSVNHVIHIDQPRLG LPSRDYYECTGIYKEACTAYVDFMISVARLIRQEERLPIDENQLALEMNKVMELEKEI ANATAKPEDRNDPMLLYNKMTLAQIQNNFSLEINGKPFSWLNFTNEIMSTVNISITNE EDVVVYAPEYLTKLKPILTKYSARDLQNLMSWRFIMDLVSSLSRTYKESRNAFRKALY GTTSETATWRRCANYVNGNMENAVGRLYVEAAFAGESKHVVEDLIAQIREVFIQTLDD LTWMDAETKKRAEEKALAIKERIGYPDDIVSNDNKLNNEYLELNYKEDEYFENIIQNL KFSQSKQLKKLREKVDKDEWISGAAVVNAFYSSGRNQIVFPAGILQPPFFSAQQSNSL NYGGIGMVIGHEITHGFDDNGRNFNKDGDLVDWWTQQSASNFKEQSQCMVYQYGNFSW DLAGGQHLNGINTLGENIADNGGLGQAYRAYQNYIKKNGEEKLLPGLDLNHKQLFFLN FAQVWCGTYRPEYAVNSIKTDVHSPGNFRIIGTLQNSAEFSEAFHCRKNSYMNPEKKC RVW" misc_feature 3073..3078 /note="poly A signal" BASE COUNT 1055 a 582 c 657 g 887 t ORIGIN 1 gcaagtcaga aagtcagatg gatataactg atatcaacac tccaaagcca aagaagaaac 61 agcgatggac tccactggag atcagcctct cggtccttgt cctgctcctc accatcatag 121 ctgtgacaat gatcgcactc tatgcaacct acgatgatgg tatttgcaag tcatcagact 181 gcataaaatc agctgctcga ctgatccaaa acatggatgc caccactgag ccttgtacag 241 actttttcaa atatgcttgc ggaggctggt tgaaacgtaa tgtcattccc gagaccagct 301 cccgttacgg caactttgac attttaagag atgaactaga agtcgttttg aaagatgtcc 361 ttcaagaacc caaaactgaa gatatagtag cagtgcagaa agcaaaagca ttgtacaggt 421 cttgtataaa tgaatctgct attgatagca gaggtggaga acctctactc aaactgttac 481 cagacatata tgggtggcca gtagcaacag aaaactggga gcaaaaatat ggtgcttctt 541 ggacagctga aaaagctatt gcacaactga attctaaata tgggaaaaaa gtccttatta 601 atttgtttgt tggcactgat gataagaatt ctgtgaatca tgtaattcat attgaccaac 661 ctcgacttgg cctcccttct agagattact atgaatgcac tggaatctat aaagaggctt 721 gtacagcata tgtggatttt atgatttctg tggccagatt gattcgtcag gaagaaagat 781 tgcccatcga tgaaaaccag cttgctttgg aaatgaataa agttatggaa ttggaaaaag 841 aaattgccaa tgctacggct aaacctgaag atcgaaatga tccaatgctt ctgtataaca 901 agatgacatt ggcccagatc caaaataact tttcactaga gatcaatggg aagccattca 961 gctggttgaa tttcacaaat gaaatcatgt caactgtgaa tattagtatt acaaatgagg 1021 aagatgtggt tgtttatgct ccagaatatt taaccaaact taagcccatt cttaccaaat 1081 attctgccag agatcttcaa aatttaatgt cctggagatt cataatggat cttgtaagca 1141 gcctcagccg aacctacaag gagtccagaa atgctttccg caaggccctt tatggtacaa 1201 cctcagaaac agcaacttgg agacgttgtg caaactatgt caatgggaat atggaaaatg 1261 ctgtggggag gctttatgtg gaagcagcat ttgctggaga gagtaaacat gtggtcgagg 1321 atttgattgc acagatccga gaagttttta ttcagacttt agatgacctc acttggatgg 1381 atgccgagac aaaaaagaga gctgaagaaa aggccttagc aattaaagaa aggatcggct 1441 atcctgatga cattgtttca aatgataaca aactgaataa tgagtacctc gagttgaact 1501 acaaagaaga tgaatacttc gagaacataa ttcaaaattt gaaattcagc caaagtaaac 1561 aactgaagaa gctccgagaa aaggtggaca aagatgagtg gataagtgga gcagctgtag 1621 tcaatgcatt ttactcttca ggaagaaatc agatagtctt cccagccggc attctgcagc 1681 cccccttctt tagtgcccag cagtccaact cattgaacta tgggggcatc ggcatggtca 1741 taggacacga aatcacccat ggcttcgatg acaatggcag aaactttaac aaagatggag 1801 acctcgttga ctggtggact caacagtctg caagtaactt taaggagcaa tcccagtgca 1861 tggtgtatca gtatggaaac ttttcctggg acctggcagg tggacagcac cttaatggaa 1921 ttaatacact gggagaaaac attgctgata atggaggtct tggtcaagca tacagagcct 1981 atcagaatta tattaaaaag aatggcgaag aaaaattact tcctggactt gacctaaatc 2041 acaaacaact atttttcttg aactttgcac aggtgtggtg tggaacctat aggccagagt 2101 atgcggttaa ctccattaaa acagatgtgc acagtccagg caatttcagg attattggga 2161 ctttgcagaa ctctgcagag ttttcagaag cctttcactg ccgcaagaat tcatacatga 2221 atccagaaaa gaagtgccgg gtttggtgat cttcaaaaga agcattgcag cccttggcta 2281 gacttgccaa caccacagaa atggggaatt ctctaatcga aagaaaatgg gccctagggg 2341 tcactgtact gacttgaggg tgattaacag agagggcacc atcacaatac agataacatt 2401 aggttgtcct agaaagggtg tggagggagg aagggggtct aaggtctatc aagtcaatca 2461 tttctcactg tgtacataat gcttaatttc taaagataat attactgttt atttctgttt 2521 ctcatatggt ctaccagttt gctgatgtcc ctagaaaaca atgcaaaacc tttgaggtag 2581 accaggattt ctaatcaaaa gggaaaagaa gatgttgaag aatacagtta ggcaccagaa 2641 gaacagtagg tgacactata gtttaaaaca cattgcctaa ctactagttt ttacttttat 2701 ttgcaacatt tacagtcctt caaaatcctt ccaaagaatt cttatacaca ttggggcctt 2761 ggagcttaca tagttttaaa ctcatttttg ccatacatca gttattcatt ctgtgatcat 2821 ttattttaag cactcttaaa gcaaaaaatg aatgtctaaa attgtttttt gttgtacctg 2881 ctttgactga tgctgagatt cttcaggctt cctgcaattt tctaagcaat ttcttgctct 2941 atctctcaaa acttggtatt tttcagagat ttatataaat gtaaaaataa taatttttat 3001 atttaattat taactacatt tatgagtaac tattattata ggtaatcaat gaatattgaa 3061 gtttcagctt aaaataaaca gttgtgaacc aagatctata aagcgatata cagatgaaaa 3121 tttgagacta tttaaactta taaatcatat tgatgaaaag atttaagcac aaactttagg 3181 g // LOCUS HSMRNAG 1185 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for glycosylasparaginase. ACCESSION X55762 S57449 NID g34759 KEYWORDS glyosylasparaginase; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1185) AUTHORS Fisher,K. TITLE Direct Submission JOURNAL Submitted (14-OCT-1990) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 1185) AUTHORS Fisher,K.J., Tollersrud,O.K. and Aronson,N.N. Jr. TITLE Cloning and sequence analysis of a cDNA for human glycosylasparaginase. A single gene encodes the subunits of this lysosomal amidase JOURNAL FEBS Lett. 269 (2), 440-444 (1990) MEDLINE 90382595 REMARK Erratum:[FEBS Lett 1990 Dec 10;276(1-2):232]] REFERENCE 3 (bases 1 to 1185) AUTHORS Park,H., Vettese,M.B., Fensom,A.H., Fisher,K.J. and Aronson,N.N. Jr. TITLE Characterization of three alleles causing aspartylglycosaminuria: two from a British family and one from an American patient JOURNAL Biochem. J. 290 (Pt 3), 735-741 (1993) MEDLINE 93207523 COMMENT Data kindly reviewed (29-APR-1991) by Fisher K.J. FEATURES Location/Qualifiers source 1..1185 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 28..1068 /note="glycosylasparaginase precursor (AA -23 to 323)" /codon_start=1 /db_xref="PID:g34760" /db_xref="SWISS-PROT:P20933" /translation="MARKSNLPVLLVPFLLCQALVRCSSPLPLVVNTWPFKNATEAAW RALASGGSALDAVESGCAMCEREQCDGSVGFGGSPDELGETTLDAMIMDGTTMDVGAV GDLRRIKNAIGVARKVLEHTTHTLLVGESATTFAQSMGFINEDLSTSASQALHSDWLA RNCQPNYWRNVIPDPSKYCGPYKPPGILKQDIPIHKETEDDRGHDTIGMVVIHKTGHI AAGTSTNGIKFKIHGRVGDSPIPGAGAYADDTAGAAAATGNGDILMRFLPSYQAVEYM RRGEDPTIACQKVISRIQKHFPEFFGAVICANVTGSYGAACNKLSTFTQFSFMVYNSE KNQPTEEKVDCI" sig_peptide 28..96 /note="signal peptide (AA -23 to -1)" mat_peptide 97..1065 /note="mature glycosylasparaginase (AA 1-323)" misc_feature 1179..1184 /note="polyA signal" BASE COUNT 338 a 256 c 284 g 307 t ORIGIN 1 tcgcgctggt ctcttcggtg gtcagggatg gcgcggaagt cgaacttgcc tgtgcttctc 61 gtgccgtttc tgctctgcca ggccctagtg cgctgctcca gccctctgcc cctggtcgtc 121 aacacttggc cctttaagaa tgcaaccgaa gcagcgtgga gggcattagc atctggaggc 181 tctgccctgg atgcagtgga gagcggctgt gccatgtgtg agagagagca gtgtgacggc 241 tctgtaggct ttggaggaag tcctgatgaa cttggagaaa ccacactaga tgccatgatc 301 atggatggca ctactatgga tgtaggagca gtaggagatc tcagacgaat taaaaatgct 361 attggtgtgg cacggaaagt actggaacat acaacacaca cacttttagt aggagagtca 421 gccaccacat ttgctcaaag tatggggttt atcaatgaag acttatctac cagtgcttct 481 caagctcttc attcagattg gcttgctcgg aattgccagc caaattattg gaggaatgtt 541 ataccagatc cctcaaaata ctgcggaccc tacaaaccac ctggtatctt aaagcaggat 601 attcctatcc ataaagaaac agaagatgat cgtggtcatg acactattgg catggttgta 661 atccataaga caggacatat tgctgctggt acatctacaa atggtataaa attcaaaata 721 catggccgtg taggagactc accaatacct ggagctggag cctatgctga cgatactgca 781 ggggcagccg cagccactgg gaatggtgat atattgatgc gcttcctgcc aagctaccaa 841 gctgtagaat acatgagaag aggagaagat ccaaccatag cttgccaaaa agtgatttca 901 agaatccaga agcattttcc agaattcttt ggggctgtta tatgtgccaa tgtgactgga 961 agttacggtg ctgcttgcaa taaactttca acatttactc agtttagttt catggtttat 1021 aattccgaaa aaaatcagcc aactgaggaa aaagtggact gcatctaatc catctttact 1081 gtcaacatct gtatttaaag aagaaagaaa caaaggctga aaaggctgct cactctcatc 1141 atctagtgtt cctcatgtgt tctaaagtct ttttgtaaaa taaac // LOCUS HSMRNAOXY 4103 bp RNA PRI 10-NOV-1993 DEFINITION H.sapiens mRNA for oxytocin receptor. ACCESSION X64878 NID g34764 KEYWORDS hormone receptor; oxytocin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4103) AUTHORS Kimura,T. TITLE Direct Submission JOURNAL Submitted (28-FEB-1992) T. Kimura, Dept of Obstetrics and Gynecology, Osaka University Medical School, 2-2 Yamadaoka, Suita-shi, Osaka 565, JAPAN REMARK revised by [3] REFERENCE 2 (bases 1 to 4100) AUTHORS Kimura,T., Tanizawa,O., Mori,K., Brownstein,M.J. and Okayama,H. TITLE Structure and expression of a human oxytocin receptor JOURNAL Nature 356 (6369), 526-529 (1992) MEDLINE 92220166 REMARK Erratum:[Nature 1992 May 14;357(6374):176]] FEATURES Location/Qualifiers source 1..4103 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" CDS 368..1537 /codon_start=1 /product="oxytocin receptor" /db_xref="PID:g34765" /db_xref="SWISS-PROT:P30559" /translation="MEGALAANWSAEAANASAAPPGAEGNRTAGPPRRNEALARVEVA VLCLILLLALSGNACVLLALRTTRQKHSRLFFFMKHLSIADLVVAVFQVLPQLLWDIT FRFYGPDLLCRLVKYLQVVGMFASTYLLLLMSLDRCLAICQPLRSLRRRTDRLAVLAT WLGCLVASAPQVHIFSLREVADGVFDCWAVFIQPWGPKAYITWITLAVYIVPVIVLAT CYGLISFKIWQNLRLKTAAAAAAEAPEGAAAGDGGRVALARVSSVKLISKAKIRTVKM TFIIVLAFIVCWTPFFFVQMWSVWDANAPKEASAFIIVMLLASLNSCCNPWIYMLFTG HLFHELVQRFLCCSASYLKGRRLGETSASKKSNSSSFVLSHRSSSQRSCSQPSTA" BASE COUNT 1171 a 966 c 1035 g 931 t ORIGIN 1 atcacattag gtgcagccgg caggccatcc caactcgggc cgggagcgca cgcgtcactg 61 gggccgtcag tcgccgtgca acttccccgg ggggagtcaa ctttaggttc gcctgcggac 121 tcggtgcagt ggaagccgct gaacatcccg aggaactggc acgctggggg ctctgggctt 181 gtggccggta gaggattccc gctcatttgc agtggctcag aggagggtgg acccagcaga 241 tccgtccgtg gagtctccag gagtggagcc ccgggcgccc ctacaccctc cgacacgccg 301 gatccggccc agccgcgcca agccgtaaag ggctcgaagg ccggggcgca ccgctgccgc 361 cagggtcatg gagggcgcgc tcgcagccaa ctggagcgcc gaggcagcca acgccagcgc 421 cgcgccgccg ggggccgagg gcaaccgcac cgccggaccc ccgcggcgca acgaggccct 481 ggcgcgcgtg gaggtggcgg tgctgtgtct catcctgctc ctggcgctga gcgggaacgc 541 gtgtgtgctg ctggcgctgc gcaccacacg ccagaagcac tcgcgcctct tcttcttcat 601 gaagcaccta agcatcgccg acctggtggt ggcagtgttt caggtgctgc cgcagttgct 661 gtgggacatc accttccgct tctacgggcc cgacctgctg tgccgcctgg tcaagtactt 721 gcaggtggtg ggcatgttcg cctccaccta cctgctgctg ctcatgtccc tggaccgctg 781 cctggccatc tgccagccgc tgcgctcgct gcgccgccgc accgaccgcc tggcagtgct 841 cgccacgtgg ctcggctgcc tggtggccag cgcgccgcag gtgcacatct tctctctgcg 901 cgaggtggct gacggcgtct tcgactgctg ggccgtcttc atccagccct ggggacccaa 961 ggcctacatc acatggatca cgctagctgt ctacatcgtg ccggtcatcg tgctcgctac 1021 ctgctacggc cttatcagct tcaagatctg gcagaacttg cggctcaaga ccgctgcagc 1081 ggcggcggcc gaggcgccag agggcgcggc ggctggcgat ggggggcgcg tggccctggc 1141 gcgtgtcagc agcgtcaagc tcatctccaa ggccaagatc cgcacggtca agatgacttt 1201 catcatcgtg ctggccttca tcgtgtgctg gacgcctttc ttcttcgtgc agatgtggag 1261 cgtctgggat gccaacgcgc ccaaggaagc ctcggccttc atcatcgtca tgctcctggc 1321 cagcctcaac agctgctgca acccctggat ctacatgctg ttcacgggcc acctcttcca 1381 cgaactcgtg cagcgcttcc tgtgctgctc cgccagctac ctgaagggca gacgcctggg 1441 agagacgagt gccagcaaaa agagcaactc gtcctccttt gtcctgagcc atcgcagctc 1501 cagccagagg agctgctccc agccatccac ggcgtgaccc accagccagg gccagggctg 1561 cagcctgagg ctcaggctgt gctggcataa gtgctctgct cctaggtgat ggcgtatgtt 1621 tgtgtataag gtacctatca gtttgtatcc ctcccctcct tggggtggct tcagtggggt 1681 ggagagtggc ctccatgatg gaagatgata ggggactcag ccatcagaca acaccctggc 1741 ctcctacacg tacttctacc accctgaacc cactgctgcc ctgggcagtg agtggcttgt 1801 tttttctcct ggacttgtaa tttcactcca gtatattttt acttcttcat tctgggatat 1861 tgtgaaaagc ggtaaatata ggattggtga ccaattgggt caggaagtcc agtgttctgg 1921 acttggggta agcagtgggg ttgggacctc agatgggaag ggtggtgcta agatcctcct 1981 gacctcaaag tgtatttgcc tttaagcgaa caaatgctgg ggtccttggg gaccagcttg 2041 tcagagggta gccctaagag aaggggatta ccttgtaaga ccatctggcg cagtggacct 2101 attagaactt gggttaaaaa tgtttaagaa gctaatgttt aagaagcatt tgggaaagaa 2161 aaagaaataa atgtatccag ataggaaaag aagaagtaaa actatttgca gatgacacag 2221 ttttgtatat agaaaatcct aaggaactca cacacacaca cacacacaca cacgcacaca 2281 gctattagaa ctaataagca agttccgcaa ggtttcaaga tacaagatca atatacaaaa 2341 atgaattgta tttctttata ctagcaacaa acaatatgaa aacgaagtta aataattcca 2401 tttataatac catcagaaag aataaaatag gaatcaactt aacaaaacaa gtgcaagact 2461 gaaaactaca aaattggaaa gaaattaaag aaggcttaaa taaatggaaa gacatcctgt 2521 gttcatggat cagacttagt attgttaaga tggcaatact atcctaactg acatgcagat 2581 tcagtgcaat ccttatgaaa atcatagctg gcttttttac agaaattgat aagctagtcc 2641 caaaattcat aaagaaatgc aagggaccca gatatccaaa taagccttga aaaagaacaa 2701 agttggtgga ttcacacttc ctgatttcat aatttacgat aaaggtaatc agctcagtgt 2761 gttactggtt taaggataga catacggagc agaataaaga gtacagatat gaacacttat 2821 acttacggtc aattgatttt tgacaaggtt cccaagacaa ttcaatagag aaaggagagt 2881 cttttcaaca aatggcaccg agacaatgat atgcaagtgc aaaagaatga ggttggacct 2941 ttactcacac tatgtgcaaa aatcaactca aaacgcatcc aagatctaaa tataagagct 3001 gaaactataa aatcttagaa agaaacatag gcatagatct ttgtaacctt gaattaggca 3061 gtggtttctt agatatgata ccaaagacac aagcaaccaa tggaaaaata ggtaaattgg 3121 acttaatcaa gatttgaagc ttttgtgatt gaaaagaccc tatcaagaag gtgaaaagat 3181 aacctgcaga atgggagaaa atatttgcga gtcatatata tgataagggg cttgtatctg 3241 gaatatataa ataactctta taacacaaca ataaggagaa aaataaatca atttaaaaaa 3301 tgggctaacg gtttgaatag acatttctcc aaagaagata tgcaaatggc tactaagcac 3361 atgaaaaata ctcaacatta ttattcatta gggaaatgca agtcaaaatc acaatgagat 3421 tccagtttac aatcactagg atggctacaa taaaaagatg gacaagaacg agtgtcggtg 3481 aggatgtaga gaaactggta gaaatttaaa ttgttggtgg gaatgtaaat ggtgcacctg 3541 ctttgaaaaa cagtttggca gtacctcaaa aagttaaacg tagagtgacc atatgaccca 3601 ggaatgccac tcctaggtat ttacccaaga gaaatgaaaa cgtacataca cacaaaaact 3661 tgtacaccaa tgttcatagc aacattattt gtaatagcca aaaagtggaa acaacccaaa 3721 tgtctaccaa ctgatgaatg ggaaataaaa tgtggtctgt ccacgcaatg gaacattatt 3781 agactctaaa aagaaatgaa gtactcacac atgccacaac atggatgagc cttgaaaact 3841 tgctaagtga aagaagccag gtgcaaaagc ccacatattg tctgactgca ttgaaatgca 3901 atgtctaaaa tggacgaatc tatatagagt gaatatagat tagcgtttgc cagggcctgg 3961 aggctgtgag agatgaggca tgactactaa gggtttgggg tttctttttc gggtgatgaa 4021 aatgttcgaa attagtggtg attgtgcacg attttgagaa tgtactaaaa accaatgaac 4081 tttaaaaaat aaaaataaac aaa // LOCUS HSMRNAPD 1410 bp RNA PRI 10-APR-1995 DEFINITION H.sapiens mRNA for lung surfactant protein D. ACCESSION X65018 S38981 NID g34766 KEYWORDS lung surfactant protein D. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1410) AUTHORS Lu,J., Willis,A.C. and Reid,K.B. TITLE Purification, characterization and cDNA cloning of human lung surfactant protein D JOURNAL Biochem. J. 284 (Pt 3), 795-802 (1992) MEDLINE 92322003 REFERENCE 2 (bases 1 to 1410) AUTHORS Reid,K.B.M. TITLE Direct Submission JOURNAL Submitted (02-MAR-1992) K.B.M. Reid, MRC Immunochemistry Unit, Dept of Biochemistry, University of Oxford, South Parks, Oxford 0X1 3QU, UK FEATURES Location/Qualifiers source 1..1410 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /clone_lib="lambda gt11 cDNA" misc_feature 4..141 /note="cloning artefact" CDS 172..1299 /gene="hsp-D" /codon_start=1 /product="lung surfactant protein D" /db_xref="PID:g34767" /db_xref="SWISS-PROT:P35247" /translation="MLLFLLSALVLLTQPLGYLEAEMKTYSHRTTPSACTLVMCSSVE SGLPGRDGRDGREGPRGEKGDPGLPGAAGQAGMPGQAGPVGPKGDNGSVGEPGPKGDT GPSGPPGPPGVPGPAGREGPLGKQGNIGPQGKPGPKGEAGPKGEVGAPGMQGSAGARG LAGPKGERGVPGERGVPGNAGAAGSAGAMGPQGSPGARGPPGLKGDKGIPGDKGAKGE SGLPDVASLRQQVEALQGQVQHLQAAFSQYKKVELFPNGQSVGEKIFKTAGFVKPFTE AQLLCTQAGGQLASPRSAAENAALQQLVVAKNEAAFLSMTDSKTEGKFTYPTGESLVY SNWAPGEPNDDGGSEDCVEIFTNGKWNDRACGEKRLVVCEF" sig_peptide 172..231 /gene="hsp-D" /note="lung surfactant protein D precursor" gene 172..1299 /gene="hsp-D" mat_peptide 232..1296 /gene="hsp-D" /product="lung surfactant protein D" BASE COUNT 347 a 337 c 463 g 263 t ORIGIN 1 cggaattagg gagatagttg gtattaggat taggattgtt gtgaagtata gtacggatgc 61 tacttgtgca atgatggtaa aagggtagct tactggttgt cctccgattc aggttagaat 121 gaggaggtct gcggcttgga gctcctgggg cctaacaaaa agaaacctgc catgctgctc 181 ttcctcctct ctgcactggt cctactcaca cagcccctgg gctacctgga agcagaaatg 241 aagacctact cccacagaac aacgcccagt gcttgcaccc tggtcatgtg tagctcagtg 301 gagagtggcc tgcctggtcg cgatggacgg gatgggagag agggccctcg gggcgagaag 361 ggggacccag gtttgccagg agctgcaggg caagcaggga tgcctggaca agctggccca 421 gttgggccca aaggggacaa tggctctgtt ggagaacctg gaccaaaggg agacactggg 481 ccaagtggac ctccaggacc tcccggtgtg cctggtccag ctggaagaga aggtcccctg 541 gggaagcagg ggaacatagg acctcagggc aagccaggcc caaaaggaga agctgggccc 601 aaaggagaag taggtgcccc aggcatgcag ggctcggcag gggcaagagg cctcgcaggc 661 cctaagggag agcgaggtgt ccctggtgag cgtggagtcc ctggaaacgc aggggcagca 721 gggtctgctg gagccatggg tccccaggga agtccaggtg ccaggggacc cccgggattg 781 aagggggaca aaggcattcc tggagacaaa ggagcaaagg gagaaagtgg gcttccagat 841 gttgcttctc tgaggcagca ggttgaggcc ttacagggac aagtacagca cctccaggct 901 gctttctctc agtataagaa agttgagctc ttcccaaatg gccaaagtgt cggggagaag 961 attttcaaga cagcaggctt tgtaaaacca tttacggagg cacagctgct gtgcacacag 1021 gctggtggac agttggcctc tccacgctct gccgctgaga atgccgcctt gcaacagctg 1081 gtcgtagcta agaacgaggc tgctttcctg agcatgactg attccaagac agagggcaag 1141 ttcacctacc ccacaggaga gtccctggtc tattccaact gggccccagg ggagcccaac 1201 gatgatggcg ggtcagagga ctgtgtggag atcttcacca atggcaagtg gaatgacagg 1261 gcttgtggag aaaagcgtct tgtggtctgc gagttctgag ccaactgggg tgggtggggc 1321 agtgcttggc ccaggagttt ggccagaagt caaggcttag accctcatgc tgccaatatc 1381 ctaataaaaa ggtgaccatc aaaaaaaaaa // LOCUS HSMRP1 1120 bp RNA PRI 11-FEB-1992 DEFINITION H.sapiens mRNA for MRP-1. ACCESSION X60111 NID g34768 KEYWORDS motility related protein; MRP-1; transmembrane type cell surface protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1120) AUTHORS Seno,M. TITLE Direct Submission JOURNAL Submitted (14-JUN-1991) M. Seno, Biotechnology Research Labs, Research and Development Division, Tekeda Chemical Industries Ltd, 17-85 Jusohonmachi 2-chome, Yodogawa-ku Osaka, JAPAN REFERENCE 2 (bases 1 to 1120) AUTHORS Miyake,M., Koyama,M., Seno,M. and Ikeyama,S. TITLE Identification of the motility-related protein (MRP-1), recognized by monoclonal antibody M31-15, which inhibits cell motility JOURNAL J. Exp. Med. 174 (6), 1347-1354 (1991) MEDLINE 92078843 COMMENT High similarity with CD37, ME492, TAPA-1, CO-029 & Sm23. FEATURES Location/Qualifiers source 1..1120 /organism="Homo sapiens" /isolate="63 year old white female" /db_xref="taxon:9606" /tissue_type="malignant ascitic effusion" /cell_type="carcinoma cell" /cell_line="ZR-75-1" /clone_lib="lambda gt11" /clone="lambda MRP-1, pTB1352" /sex="Female" CDS 112..798 /codon_start=1 /product="MRP-1 (motility related protein)" /db_xref="PID:g34769" /db_xref="SWISS-PROT:P21926" /translation="MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSI FEQETNNNNSSFYTGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAI EIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVE QFISDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILC CAIRRNREMV" BASE COUNT 250 a 257 c 275 g 338 t ORIGIN 1 gaccagccta cagccgcctg catctgtatc cagcgccagg tcctgccagt cccagctgcg 61 cgcgcccccc agtcccgcac ccgttcggcc caggctaagt tagccctcac catgccggtc 121 aaaggaggca ccaagtgcat caaatacctg ctgttcggat ttaacttcat cttctggctt 181 gccgggattg ctgtccttgc cattggacta tggctccgat tcgactctca gaccaagagc 241 atcttcgagc aagaaactaa taataataat tccagcttct acacaggagt ctatattctg 301 atcggagccg gcgccctcat gatgctggtg ggcttcctgg gctgctgcgg ggctgtgcag 361 gagtcccagt gcatgctggg actgttcttc ggcttcctct tggtgatatt cgccattgaa 421 atagctgcgg ccatctgggg atattcccac aaggatgagg tgattaagga agtccaggag 481 ttttacaagg acacctacaa caagctgaaa accaaggatg agccccagcg ggaaacgctg 541 aaagccatcc actatgcgtt gaactgctgt ggtttggctg ggggcgtgga acagtttatc 601 tcagacatct gccccaagaa ggacgtactc gaaaccttca ccgtgaagtc ctgtcctgat 661 gccatcaaag aggtcttcga caataaattc cacatcatcg gcgcagtggg catcggcatt 721 gccgtggtca tgatatttgg catgatcttc agtatgatct tgtgctgtgc tatccgcagg 781 aaccgcgaga tggtctagag tcagcttaca tccctgagca ggaaagttta cccatgaaga 841 ttggtgggat tttttgtttg tttgttttgt tttgtttgtt gtttgttgtt tgtttttttg 901 ccactaattt tagtattcat tctgcattgc tagataaaag ctgaagttac tttatgtttg 961 tcttttaatg cttcattcaa tattgacatt tgtagttgag cggggggttt ggtttgcttt 1021 ggtttatatt ttttcagttg tttgtttttg cttgttatat taagcagaaa tcctgcaatg 1081 aaaggtacta tatttgctag actctagaca agatattgta // LOCUS HSMRP17 1008 bp RNA PRI 10-MAY-1996 DEFINITION H.sapiens Mrp17 mRNA. ACCESSION X79865 NID g1313961 KEYWORDS mitochondrial ribosomal protein L7L12; MRP17 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1008) AUTHORS Marty,L. and Fort,P. TITLE A delayed-early response nuclear gene encoding MRPL12, the mitochondrial homologue to the bacterial translational regulator L7/L12 protein JOURNAL J. Biol. Chem. 271 (19), 11468-11476 (1996) MEDLINE 96212221 REFERENCE 2 (bases 1 to 1008) AUTHORS Fort,P.P. TITLE Direct Submission JOURNAL Submitted (27-JUN-1994) P.P. Fort, IGMM-CNRS, UMR9942-CNRS, 1919 Route de Mende, 34033 Montpellier Cedex 1, FRANCE FEATURES Location/Qualifiers source 1..1008 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="adenocarcinoma" /cell_line="HeLa" /clone="P2A1" /map="q23-qter" gene 138..734 /gene="Mrp17" CDS 138..734 /gene="Mrp17" /codon_start=1 /product="mitochondrial ribosomal protein L7L12" /db_xref="PID:e89852" /db_xref="PID:g1313962" /translation="MLPAAARPLWGPCLGLRAAAFRLARRQVPCVCTVRHMRSSGHQR CEALAGAPLDNAPKEYPPKIQQLVQDIASLTLLEISDLNELLKKTLKIQDVGLVPMGG VMSGAVPAAAAQEAVEEDIPIAKERTHFTVRLTEAKPVDKVKLIKEIKNYIQGINLVQ AKKLVESLPQEIKANVAKAEAEKIKAALEAVGGTVVLE" BASE COUNT 195 a 320 c 329 g 164 t ORIGIN 1 cccgaatttt ccggctcgaa tgcccggcag ccgtggcggc tagagcgttc ctccccagct 61 cgaatgcccg gcggcgaggc ggctagagcg tcgcctcctc ccggggaacc gcgtgtgacc 121 ttccagcccg cggaccgatg ctgccggcgg ccgctcgccc cctgtggggg ccttgccttg 181 ggcttcgggc cgctgcgttc cgccttgcca ggcgacaggt gccatgtgtc tgtaccgtgc 241 gacatatgag gagcagcggc catcagaggt gtgaggccct cgctggtgca cccctggata 301 acgcccccaa ggagtacccc cccaagatac agcagctggt ccaggacatc gccagcctca 361 ctctcttgga aatctcagac ctcaacgagc tcctgaagaa aacgttgaag atccaggatg 421 tcgggcttgt gccgatgggt ggtgtgatgt ctggggctgt ccctgctgca gcagcccagg 481 aggcggtgga agaagatatc cccatagcga aagaacggac acatttcacc gtccgcctga 541 ccgaggcgaa gcccgtggac aaagtgaagc tgatcaagga aatcaagaac tacatccaag 601 gcatcaacct cgtccaggca aagaagctgg tggagtccct gccccaggaa atcaaagcca 661 atgtcgccaa agctgaggcg gagaagatca aggcggccct ggaggcggtg ggcggcaccg 721 tggttctgga gtagcctcca gctcggagga cttgtgttca ggggtcctgg gccccggcga 781 ggtcccgccc tcccgtggtc actggctccg cccccagcac caggcgccca gtggagccgt 841 ttgggagaat tgcctgcgcc acgcagcggg ccggacaggc cgcacagacc tactgtggcg 901 ggagggaggg gcggctgctg cctggtgacg gcacccggaa gcccaccagg acgcgccacc 961 ggtcaatgtg cctctggtgg ctgctgagaa aaatacactg tgcagctc // LOCUS HSMRPS12 1094 bp RNA PRI 05-SEP-1997 DEFINITION Homo sapiens mRNA for mitochondrial ribosomal protein S12. ACCESSION Y11681 NID g2370129 KEYWORDS mitochondrial ribosomal protein S12. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1094) AUTHORS Shah,Z.H., ODell,K., Miller,S.C.M. and Jacobs,H.T. TITLE Metazoan sequences for mitoribosomal protein S12 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1094) AUTHORS Jacobs,H.T. TITLE Direct Submission JOURNAL Submitted (06-MAR-1997) H.T. Jacobs, University of Tampere, Institute of Medical Technology, PO Box 607, 33101 Tampere, FINLAND FEATURES Location/Qualifiers source 1..1094 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="247801 (IMAGE)" /clone="213649 (IMAGE)" /dev_stage="foetus" /tissue_type="liver" /tissue_type="spleen" /chromosome="19" /map="q13" CDS 191..208 /note="short upstream ORF (5 amino acids) suggestive of translational regulation" /codon_start=1 /product="hypothetical protein" /db_xref="PID:e325322" /db_xref="PID:g2370130" /translation="MRACG" sig_peptide 342..428 /function="mitochondrial targeting" CDS 342..758 /codon_start=1 /evidence=not_experimental /product="mitochondrial ribosomal protein S12 precursor" /db_xref="PID:e325305" /db_xref="PID:g2370131" /translation="MSWSGLLHGLNTSLTCGPALVPRLWATCSMATLNQMHRLGPPKR PPRKLGPTEGRPQLKGVVLCTFTRKPKKPNSANRKCCRVRLSTGREAVCFIPGEGHTL QEHQIVLVEGGRTQDLPGVKLTVVRGKYDCGHVQKK" misc_feature 389^390 /note="intron location in genomic DNA" mat_peptide 429..755 /product="mitochondrial ribosomal protein S12" polyA_signal 1048..1053 polyA_site 1075 BASE COUNT 211 a 327 c 321 g 235 t ORIGIN 1 agctggattc agcgtgtccg cgacctcacc tttaggtcct gtgaggtcgg tggaatcctg 61 gggtcctcca aatctaccag gccatctccc cagtttccca gttcttcctg cgtgcgggcg 121 agagtggttg ggccctcggg aacccactca gagcgaggct aaatttacgg agggactttc 181 tgttagcagc atgagggcct gtggttagac ctatagaggt atttcctttg atttaagcca 241 gaaagtcctg agagcggatc ggggagcatt tgcggatcgg tcactttttc ctcctttctg 301 agtctcttat cccctaccac agggacggcc caggtggcag gatgtcctgg tctggccttc 361 tccatggcct caacacgtcc ctaacttgtg gcccagctct ggttccccgg ctctgggcta 421 cctgctccat ggctaccctg aaccagatgc accgcctggg gccccccaag cggccgcctc 481 ggaagctggg ccccacggaa ggccggccgc agctgaaggg tgtggtcctg tgcacgttta 541 cccgcaagcc gaagaagccc aactcagcca atcgcaagtg ctgtcgagtg cggctcagca 601 ctggccgcga ggccgtctgc ttcatccctg gggagggcca caccctgcag gagcaccaga 661 ttgtccttgt ggagggcggc cgcacccagg acctgccagg cgtcaagctc accgttgtgc 721 gtggcaagta cgactgtggc cacgtgcaga agaagtgacg gctgggggca cagtgggctg 781 ggcgcccctg cagaacatga accttccgct cctggctgcc acagggtcct ccgatgctgg 841 cctttgcgcc tctagaggca gccactcatg gattcaagtc ctggctccgc ctcttccatc 901 aggaccacta ttaagccata ggagtcctgg gggtgcaaag ggtgcccctc tgtcaacacc 961 cttggctcct gtgtttagag gggtggcctg aaggaccttt tctgctggga caagacactg 1021 tactgccctc tgctgggaag gggttttaat aaacagaccc tggcgcttgt gatgtaaaaa 1081 aaaaaaaaaa aaaa // LOCUS HSMRZFING 1785 bp RNA PRI 14-MAR-1996 DEFINITION H.sapiens mRNA for zinc finger gene. ACCESSION X84801 NID g683470 KEYWORDS zinc finger; ZNF165 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1785) AUTHORS Tirosvoutis,K.N., Divane,A., Jones,M. and Affara,N.A. TITLE Characterization of a novel zinc finger gene (ZNF165) mapping to 6p21 that is expressed specifically in testis JOURNAL Genomics 28 (3), 485-490 (1995) MEDLINE 96039260 REFERENCE 2 (bases 1 to 1785) AUTHORS Affara,N.A. TITLE Direct Submission JOURNAL Submitted (16-FEB-1995) N.A. Affara, Univ. of Cambridge, Dept. of Pathology, Tennis Court Road, Cambridge, CB2 1QP, UK FEATURES Location/Qualifiers source 1..1785 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZF388, ZF388-1, ZF388-4" /chromosome="6" /map="p21" gene 239..1763 /gene="ZNF165" CDS 239..1696 /gene="ZNF165" /codon_start=1 /db_xref="PID:g683471" /translation="MATEPKKAAAQNSPEDEGLLIVKIEEEEFIHGQDTCLQRSELLK QELCRQLFRQFCYQDSPGPREALSRLRELCCQWLKPEIHTKEQILELLVLEQFLTILP GDLQAWVHEHYPESGEEAVTILEDLERGTDEAVLQVQAHEHGQEIFQKKVSPPGPALN VKLQPVETKAHFDSSEPQLLWDCDNESENSRSMPKLEIFEKIESQRIISGRISGYISE ASGESQDICKSAGRVKRQWEKESGESQRLSSAQDEGFGKILTHKNTVRGEIISHDGCE RRLNLNSNEFTHQKSCKHGTCDQSFKWNSDFINHQIIYAGEKNHQYGKSFKSPKLAKH AAVFSGDKTHQCNECGKAFRHSSKLARHQRIHTGERCYECNECGKSFAESSDLTRHRR IHTGERPFGCKECGRAFNLNSHLIRHQRIHTREKPYECSECGKTFRVSSHLIRHFRIH TGEKPYECSECGRAFSQSSNLSQHQRIHMRENLLM" polyA_signal 1758..1763 /gene="ZNF165" BASE COUNT 596 a 370 c 425 g 394 t ORIGIN 1 gaacggggag gggtagccac atgtctcaga tctgccattg tctgcgaaaa gaaactgctg 61 cgaggaccat ccccaatccc ctgcttccct tgcccttggg aagagtaacc gccgttttgt 121 aggacacttg gggacaaccc cgcttgtcct gaaatttatt gacacggaaa atagtatttc 181 ctgtgtgccg aggatgcagt taaaccaaca ctgaccccct gcccttgaga aacacaagat 241 ggctacagaa ccaaagaaag ctgcagccca gaactctcca gaggatgaag gacttctgat 301 agtgaagata gaagaggaag aatttatcca tgggcaggac acttgcttac agagaagtga 361 actccttaag caggagctct gcaggcagct ttttaggcag ttctgctacc aggattctcc 421 tggacctcgc gaggcactga gccgcctccg ggagctctgc tgtcagtggc tgaagccaga 481 gatccatacc aaggaacaga ttctggaact gctggtgcta gagcagttcc tgaccatcct 541 gccaggagat ttgcaggcct gggtacatga acattaccca gagagtggag aggaggcagt 601 gaccatacta gaagatttgg agagaggcac tgatgaagca gtactccagg ttcaagccca 661 tgaacatgga caagaaatat tccagaaaaa agtgtcacct cctggaccag cacttaatgt 721 caagttacag ccagtggaga ccaaggccca ttttgattca tcagaacccc agctcctatg 781 ggactgtgat aatgagagtg aaaacagtag atccatgcca aagctggaaa tttttgaaaa 841 aattgaatca cagagaatta tatctggaag aatctcagga tacatatcag aagcatctgg 901 tgagtctcaa gacatctgta agtctgcagg cagggtaaag agacaatggg aaaaagaatc 961 aggggagtct cagagactct cgtctgccca ggatgaaggt tttggtaaaa tcctcaccca 1021 caaaaataca gtcagaggtg aaataataag ccacgatgga tgtgagagga gattaaatct 1081 gaactcaaat gaattcacac accagaaatc ttgtaaacat ggtacctgtg accagagctt 1141 caaatggaac tcagatttta ttaaccatca aataatttat gctggagaaa aaaatcacca 1201 atatggaaaa tctttcaaga gcccaaaact tgctaaacat gcagcagttt tcagtggaga 1261 taaaactcat cagtgtaatg aatgtgggaa agctttcagg cacagctcaa aacttgctag 1321 gcatcagaga atccacactg gagagagatg ctatgaatgt aatgaatgtg ggaaaagctt 1381 tgcagagagc tcagatctta ctagacatcg gcgaattcac actggggaaa gaccctttgg 1441 ttgcaaagaa tgtgggagag cattcaacct gaactcacat cttatcaggc atcagagaat 1501 tcacaccaga gagaaaccct acgagtgtag tgaatgtggg aaaaccttcc gagtgagctc 1561 acatcttatt cgacacttta gaattcacac tggagaaaaa ccctatgaat gcagtgagtg 1621 tggaagagcc ttcagtcaga gctcaaacct tagtcaacac cagagaattc acatgaggga 1681 aaacctatta atgtaaggaa cttaaatttg taagtaaatg ctgaggaaat ggcacaatat 1741 gaaaaatatt aaataaaaaa taaatattgg gcaagtggaa gactg // LOCUS HSMSHRECA 1270 bp RNA PRI 18-JUN-1996 DEFINITION H.sapiens mRNA for MSH receptor. ACCESSION X67594 S43709 NID g1405733 KEYWORDS MSH receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1270) AUTHORS Chhajlani,V. and Wikberg,J.E. TITLE Molecular cloning and expression of the human melanocyte stimulating hormone receptor cDNA JOURNAL FEBS Lett. 309 (3), 417-420 (1992) MEDLINE 92387402 REFERENCE 2 (bases 1 to 1270) AUTHORS Chhajlani,V. TITLE Direct Submission JOURNAL Submitted (15-DEC-1992) V. Chhajlani, Pharmaceutical Pharmacology, Biomedical Centrum, Box 591, Husargatan 3, 75124, SWEDEN REMARK Revised by [3] REFERENCE 3 (bases 1 to 1270) AUTHORS Chhajlani,V. TITLE Direct Submission JOURNAL Submitted (18-JUN-1996) V. Chhajlani, Pharmaceutical Pharmacology, Biomedical Centrum, Box 591, Husargatan 3, 75124, SWEDEN FEATURES Location/Qualifiers source 1..1270 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="11D" CDS 169..1122 /note="Author-given protein sequence is in conflict with the conceptual translation." /codon_start=1 /product="MSH receptor" /db_xref="PID:g38410" /db_xref="SWISS-PROT:Q01726" /translation="MAVQGSQRRLLGSLNSTPTAIPQLGLAANQTGARCLEVSISDGL FLSLGLVSLVENALVVATIAKNRNLHSPMYCFICCLALSDLLVSGSNVLETAVILLLE AGALVARAAVLQQLDNVIDVITCSSMLSSLCFLGAIAVDRYISIFYALRYHSIVTLPR ARRRVAAIWVASVVFSTLFIAYYDHVAVLLCLVVFFLAMLVLMAVLYVHMLARACQHA QGIARLHKRQRPVHQGFGLKGAVTLTILLGIFFLCWGPFFLHLTLIVLCPEHPTCGCI FKNFNLFLALIICNAIIDPLIYAFHSQELRRTLKEVLTCSW" BASE COUNT 210 a 416 c 374 g 270 t ORIGIN 1 ggagagggtg tgagggcaga tctgggggtg cccagatgga aggaggcagg catgggggac 61 acccaaggcc ccctggcagc accatgaact aagcaggaca cctggagggg aagaactgtg 121 gggacctgga ggcctccaac gactccttcc tgcttcctgg acaggactat ggctgtgcag 181 ggatcccaga gaagacttct gggctccctc aactccaccc ccacagccat cccccagctg 241 gggctggctg ccaaccagac aggagcccgg tgcctggagg tgtccatctc tgacgggctc 301 ttcctcagcc tggggctggt gagcttggtg gagaacgcgc tggtggtggc caccatcgcc 361 aagaaccgga acctgcactc acccatgtac tgcttcatct gctgcctggc cttgtcggac 421 ctgctggtga gcgggagcaa cgtgctggag acggccgtca tcctcctgct ggaggccggt 481 gcactggtgg cccgggctgc ggtgctgcag cagctggaca atgtcattga cgtgatcacc 541 tgcagctcca tgctgtccag cctctgcttc ctgggcgcca tcgccgtgga ccgctacatc 601 tccatcttct acgcactgcg ctaccacagc atcgtgaccc tgccgcgggc gcggcaagcc 661 gttgcggcca tctgggtggc cagtgtcgtc ttcagcacgc tcttcatcgc ctactacgac 721 cacgtggccg tcctgctgtg cctcgtggtc ttcttcctgg ctatgctggt gctcatggcc 781 gtgctgtacg tccacatgct ggcccgggcc tgccagcacg cccagggcat cgcccggctc 841 cacaagaggc agcgcccggt ccaccagggc tttggcctta aaggcgctgt caccctcacc 901 atcctgctgg gcattttctt cctctgctgg ggccccttct tcctgcatct cacactcatc 961 gtcctctgcc ccgagcaccc cacgtgcggc tgcatcttca agaacttcaa cctctttctc 1021 gccctcatca tctgcaatgc catcatcgac cccctcatct acgccttcca cagccaggag 1081 ctccgcagga cgctcaagga ggtgctgaca tgctcctggt gagcgcggtg cacgcgcttt 1141 aagtgtgctg ggcagaggga ggtggtgata ttgtgtggtc tggttcctgt gtgaccctgg 1201 gcagttcctt acctccctgg tccccgtttg tcaaagagga tggactaaat gatctctgaa 1261 agtgttgaag // LOCUS HSMSSP 1425 bp RNA PRI 12-APR-1994 DEFINITION H.sapiens MSSP-1 mRNA. ACCESSION X64652 NID g34792 KEYWORDS MSSP-1 mRNA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1425) AUTHORS Ariga,H. TITLE Direct Submission JOURNAL Submitted (24-FEB-1992) H. Ariga, Faculty of Pharmaceutical Sciences, Hokkaido University, Kita 12 Nishi 6, Kita-ku, Sapporo 060, JAPAN REFERENCE 2 (bases 1 to 1425) AUTHORS Negishi,Y., Nishita,Y., Saegusa,Y., Kakizaki,I., Galli,I., Kihara,F., Tamai,K., Miyajima,N., Iguchi-Ariga,S.M. and Ariga,H. TITLE Identification and cDNA cloning of single-stranded DNA binding proteins that interact with the region upstream of the human c-myc gene JOURNAL Oncogene 9 (4), 1133-1143 (1994) MEDLINE 94181265 FEATURES Location/Qualifiers source 1..1425 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /clone_lib="lambda gt11 HL-60 cDNA" gene 53..1171 /gene="MSSP-1" CDS 53..1171 /gene="MSSP-1" /codon_start=1 /product="348 aa protein" /db_xref="PID:g34793" /db_xref="SWISS-PROT:P29558" /translation="MAPPSPSTTSSNNNSSSSSNSGWDQLSKTNLYIRGLPPHTTDQD LVKLCQPYGKIVSTKAILDKTTNKCKGYGFVDFDSPAAAQKAVSALKASGVQAQKAKQ QEQDPTNLYISNLPLSMDEQELENMLKPFGQVISTRILRDSSGTSRGVGFARMESTEK CEAVIGHFNGKFIKTPPGVSAPTEPLLCKFADGGQKKRQNPNKYIPNGRPWHREGEVR LAGMTLTYDPTTAAIQNGFYPSPYSIATNRMITQTSITPYIASPVSAYQVQSPSWMQP QPYILQHPGGVNSLNGAHHVTTARINDQPSGPADESSVTRQHRNIHACNVAMQGAYLP QYAHMQTTAVPVEEASGQQQVAVETSNDHSPYTFQPNK" BASE COUNT 443 a 342 c 293 g 347 t ORIGIN 1 gaattcgggc ggaccgtatc gcaagcagca gtctctggtc ccagcccacc ccatggcccc 61 tcccagtccc agcaccacca gcagtaataa caacagtagc agcagtagca actcaggatg 121 ggatcagctc agcaaaacga acctctatat ccgaggactg cctccccaca ccaccgacca 181 ggacctggtg aagctctgtc aaccatatgg gaaaatagtc tccacaaagg caattttgga 241 taagacaacg aacaaatgca aaggttatgg ttttgtcgac tttgacagcc ctgcagcagc 301 tcaaaaagct gtgtctgccc tgaaggccag tggggttcaa gctcaaaagg caaagcaaca 361 ggaacaagat cctaccaacc tctacatttc taatttgcca ctctccatgg atgagcaaga 421 actagagaat atgctcaaac catttggaca agttatttct acaaggatac tacgtgattc 481 cagtggtaca agtcgtggtg ttggctttgc taggatggaa tcaacagaaa aatgtgaagc 541 tgttattggt cattttaatg gaaaatttat taagacacca ccaggagttt ctgcccccac 601 agaaccttta ttgtgtaagt ttgctgatgg aggacagaaa aagagacaga acccaaacaa 661 atacatccct aatggaagac catggcatag agaaggagag gtgagacttg ctggaatgac 721 acttacttac gacccaacta cagctgctat acagaacgga ttttatcctt caccatacag 781 tattgctaca aaccgaatga tcactcaaac ttctattaca ccctatattg catctcctgt 841 atctgcctac caggtgcaaa gtccttcgtg gatgcaacct caaccatata ttctacagca 901 ccctggtggt gttaactccc tcaatggagc acaccatgtc actacagccc gcatcaatga 961 tcagccctct ggcccagcag atgagtcatc tgtcactagg cagcaccgga acatacatgc 1021 ctgcaacgta gctatgcaag gagcctactt gccacagtat gcacatatgc agacgacagc 1081 ggttcctgtt gaggaggcaa gtggtcaaca gcaggtggct gtcgagacgt ctaatgacca 1141 ttctccatat acctttcaac ctaataagta actgtgagat gtacagaaag gtgttcttac 1201 atgaagaagg gtgtgaaggc tgaacaatca tggatttttc tgatcaattg tgctttagga 1261 aattattgac agttttgcac aggttcttga aaacgttatt tataatgaaa tcaactaaaa 1321 ctatttttgc tataagttct ataaggtgca taaaaccctt aaattcatct agtagctgtt 1381 cccccgaaca ggtttatttt agtaaaaaaa aaaaaccccg aattc // LOCUS HSMSX2 2605 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens MSX2 mRNA for transcription factor. ACCESSION X69295 S64120 NID g396173 KEYWORDS DNA-binding protein; homeobox gene; MSX2 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2605) AUTHORS Hodgkinson,J.E. TITLE Direct Submission JOURNAL Submitted (16-NOV-1992) J.E. Hodgkinson, University of Manchester, Mol. Embryology, Dept of Cell and Structural Biology - Stopford Bldg., Oxford Rd., Manchester, M13 9pt, UK REFERENCE 2 (bases 1 to 2605) AUTHORS Hodgkinson,J.E., Davidson,C.L., Beresford,J. and Sharpe,P.T. TITLE Expression of a human homeobox-containing gene is regulated by 1,25(OH)2D3 in bone cells JOURNAL Biochim. Biophys. Acta 1174 (1), 11-16 (1993) MEDLINE 93326628 REMARK Erratum:[Biochim Biophys Acta 1993 Oct 19;1216(1):173]] FEATURES Location/Qualifiers source 1..2605 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="osteoblast" /clone_lib="lambda ZAPII" gene 20..823 /gene="MSX2" CDS 20..823 /gene="MSX2" /codon_start=1 /product="transcription factor" /db_xref="PID:g396174" /db_xref="SWISS-PROT:P35548" /translation="MASPSKGNDLFSPDEEGPAVVAGPGPGLGGAAGAAEERRVKVSS LPFSVEALMSDKKPPKESPAVPPEGASAGAHLRPLLLSGHRAREAHSPGPLVKPFETA SVKSGNSEDGAAWMQEPGRYSPPPRHMSPTTCTLRKHKTNRKPRTPFTTSQLLALERK FRQKQYLSIAERAEFSSSLNLTETQVKIWFQNRSAKAKRLQEAELEKLKMAAKPMLPS SFSLPFPISSPLQAASIYAASYPFHRPVLPIPPVGLYATPVGYGMYHLS" misc_feature 443..622 /gene="MSX2" /note="homeobox - DNA binding domain" BASE COUNT 673 a 696 c 601 g 635 t ORIGIN 1 tacgtagggc agagaagtca tggcttctcc gtccaaaggc aatgacttgt tttcgcccga 61 cgaggagggc ccagcagtgg tggccggacc aggcccgggg ctggggggcg ccgcgggggc 121 cgcggaggag cgccgcgtca aggtctccag cctgcccttc agcgtggagg cgctcatgtc 181 cgacaagaag ccgcccaagg agtcgcccgc tgtgcctccc gaaggcgcct cggccggggc 241 ccacctgcgg ccactgctgc tgtcggggca ccgcgctcgg gaagcgcaca gccccgggcc 301 gctggtgaag cccttcgaga ccgcctcggt caagtcggga aattcagaag atggagcggc 361 gtggatgcag gaacccggcc gatattcgcc gccgccaaga catatgagcc ctaccacctg 421 caccctgagg aaacacaaga ccaatcggaa gccgcgcacg ccctttacca catcccagct 481 cctcgccctg gagcgcaagt tccgtcagaa acagtacctc tccattgcag agcgtgcaga 541 gttctccagc tctctgaacc tcacagagac ccaggtcaaa atctggttcc agaaccgaag 601 cgccaaggcg aaaagactgc aggaggcgga actggaaaag ctgaaaatgg ctgcaaaacc 661 tatgctaccc tccagcttca gtctcccctt ccccatcagc tcgcccctgc aggcagcgtc 721 catatacgca gcatcctacc cgttccatag acctgtgctt cccatcccgc ccgtgggact 781 ctatgccacg ccagtgggat atggcatgta ccacctgtcc taaggaagac cagatcaata 841 gactccatga tggatgcttg tttcaaaggg tttcctctcc ctctccacaa aggcatagcc 901 agccagtact cctgcgctgc taagccctcg acgttgcacc ccaccccctc taacggctag 961 ctgacagggc cacaccacat agctgaaatt tcgttctgta ggcggaggca ccaagccctg 1021 cttttcttgg tgtaacttcc agagtccccc cttttttccc ttgcacaaaa gcttggctct 1081 gatggttttt ttggcatgat gtatatatat atatacgaaa aatactacag acccttttta 1141 tcagcagacg taaaaattca aattatttta aaaggcaaaa tttatataca tatgtgcttt 1201 ttttctatat ctcaccttcc caaaaagaca catgtgtaag tccatttgtt gtattttctt 1261 aaagagggag acaaattcgg aggagcgccg cgtcaaggtc tccagcctgc ccttcagcgt 1321 ggaggcgctc atgtccgatt tgcaaaaatg tgctaaagtc aatgattttt accgggatta 1381 ttgacttctg cttatacaag aagccgccca aggagtcgcc cgctgtgcct cccgaaggcg 1441 cctcggccgg cctgcggaaa aacaaaagaa aacagacaca atgcagcagc cagaaaatat 1501 tagatatgga gagattatgg ccactgctgc tgaccggcca cggcgtccgg gaagcgcaca 1561 gccccgggcc gctggtgatc aaagtgaacc cacatcatat ttctgcattt tacttgcatt 1621 aaaagaaacc tctttataag cccttcgaga ccgcctcggt caagtcggga aattcagaag 1681 atggagcggc gtggatgcta catacgttgt tcctatctcc cgcccacgcc cacacatatt 1741 tttaaagttt ttaggaaccc ggccgatatt cgccgccgcc aagacatatg agccctacca 1801 cctgcaccct gaccttttta agaatatttt tgtaagacca atacctggga tgagaagaat 1861 ccgtagactg ccggaaacac aagaccaatc ggaagccgcg cacgcccttt accacatccc 1921 agctcctcgc cctggaggtg aggtagaaaa attagaaata cttcctaatt cttctcaagg 1981 ctgttggtaa ctttggagcg caagttccgt cagaaacagt acctctccat tgcagagcgt 2041 gcagagttct ctatttcaga taattggaga gtaaaatgtt aaaacctgtg agaggattgt 2101 acagctctct gaacctcaca gacccaggtc aaaaggttct gagaaatact aggtacattc 2161 atcctcacag attgcaaagg tgctttgggt gggggtttag taattttctg cttaaaaaat 2221 gagtatcttg taaccattac ctatatctaa atattcttga acaattagta gatccagaaa 2281 gaaaaaaaaa atatgcttct ctgtgtgtgt acctgttgta tgtcctaact tattagaaaa 2341 attttatatc tttttacatg tggggggcag aaggtaaagc atgtttgact tgtgaaaatg 2401 ggatgtcaaa cagccataag ttccctggta ttcaccttcc tgtccatctg tcccctccat 2461 cggtatacct ttatcccttt gaaagggtgc ttgtacaatt tgatatattt tattgaagag 2521 ttatctctta ttctgaatta aattaagcat ttgttttatt ctgaattaaa ttaagcattt 2581 gttttattgc agtaaagttt gtcca // LOCUS HSMTCP12A 6165 bp DNA PRI 09-JUL-1997 DEFINITION H.sapiens MTCP1 gene, exons 2A to 7 (and joined mRNA). ACCESSION Z24459 Z24460 NID g2252491 KEYWORDS c6.1B gene; MTCP1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6165) AUTHORS Stern,M.H. TITLE Direct Submission JOURNAL Submitted (05-JUL-1993) Marc-Henri Stern, Laboratoire d'Hematologie Moleculaire, Hopital Saint-Louis, 2, place du Docteur Fournier, Paris, 75475, France REMARK Revised by [4] REFERENCE 2 (bases 1 to 6165) AUTHORS Stern,M.H., Soulier,J., Rosenzwajg,M., Nakahara,K., Canki-Klain,N., Aurias,A., Sigaux,F. and Kirsch,I.R. TITLE MTCP-1: a novel gene on the human chromosome Xq28 translocated to the T cell receptor alpha/delta locus in mature T cell proliferations JOURNAL Oncogene 8 (9), 2475-2483 (1993) MEDLINE 93368950 REFERENCE 3 (bases 1 to 6165) AUTHORS Stern,M.H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1997) Marc-Henri Stern, Unite INSERM U462, Hopital Saint-Louis, 1 avenue Vellefaux, Paris, 75475, France REFERENCE 4 (bases 1 to 6165) AUTHORS Gritti,C., Choukroun,V., Soulier,J., Madani,A., Dastot,H., Leblond,V., Radford-Weiss,I., Valensi,F., Varet,B., Sigaux,F. and Stern,M.H. TITLE Alternative origin of p13MTCP1-encoding transcripts in mature T cell proliferations with t(X;14) translocations JOURNAL Unpublished FEATURES Location/Qualifiers source 1..6165 /organism="Homo sapiens" /isolate="patient Dol." /db_xref="taxon:9606" /map="Xq28" /germline exon 727..955 /note="exon2A" /number=2 misc_feature 727 /note="alternative transcription initiation site" exon 956..1107 /gene="MTCP1" /number=2 gene 956..5956 /gene="MTCP1" CDS join(1003..1107,1224..1394,1476..1523) /gene="MTCP1" /codon_start=1 /product="p13MTCP1 protein" /db_xref="PID:e328243" /db_xref="PID:g2252492" /translation="MAGEDVGAPPDHLWVHQEGIYRDEYQRTWVAVVEEETSFLRARV QQIQVPLGDAARPSHLLTSQLPLMWQLYPEERYMDNNSRLWQIQHHLMVRGVQELLLK LLPDD" exon 1224..1394 /gene="MTCP1" /number=3 exon 1476..1527 /gene="MTCP1" /number=4 exon 1614..2076 /gene="MTCP1" /number=5 exon 2878..2945 /gene="MTCP1" /number=6 CDS join(2888..2945,5587..5735) /gene="MTCP1" /codon_start=1 /product="p8MTCP1 protein" /db_xref="PID:e328244" /db_xref="PID:g2252493" /translation="MPQKDPCQKQACEIQKCLQANSYMESKCQAVIQELRKCCAQYPK GRSVVCSGFEKEEEENLTRKSASK" exon 5587..5956 /gene="MTCP1" /number=7 mRNA join(Z24458:28..328,2878..2945,5587..5956) /label=MTCP1A2_mRNA mRNA join(Z24458:28..328,Z24458:329..598,2878..2945,5587..5956) /label=MTCP1A1_mRNA mRNA join(Z24458:28..328,Z24458:329..598,956..1107,1224..1394, 1476..1527,1614..2076,2878..2945,5587..5956) /label=MTCP1B1_mRNA BASE COUNT 1203 a 813 c 815 g 1312 t 2022 others ORIGIN 1 tctagaagat tgactcatgc aaaatgcaca tcaagttatt gtgaggtcct gaaatgatat 61 ctggggcttg ggcagaggat cgttatagag tttgccctac cactcaaatg ctttgggagg 121 tatgcttcct ttacacctct caaatgtgct tctggtcttc tggcttagga agtcaaaatt 181 taaacctagt cctctatgga aaaatgggct cctgtaacat cctgcagaat tattaaaata 241 aaacaaggac ctagatggat aactttcatt tgcactcact cagctccaaa tcagcatgat 301 ccccaaagtc tcttgcagct cagaaattcc atgaagctag cttacttttc ctaacaagaa 361 aatccccttc tagaaatgcc catccgctat ttgtaaactt tcttagtaaa ctgataaata 421 gatcatttgc ccttaaaaca aacttttaat ttcaactctg aaagacatgc atatatgtcc 481 ttcagaatct taccttttaa aatggaaact gcttggaagt ggcttttacc tggggaatat 541 gacatgcatg tgtctgttat actgccttcc tgccaaaata agtgtgtctt taggcacgct 601 ctttgcatat tctattgcta actacatttt gccagacact gcttgtgaat gcagtatgtg 661 tgagaccact gcccagcttc ctgtagtgct agtcctacat ttccacacag aactcctcac 721 ctagccaaat tcttgagcgc tttgcagtca gcagcactac ctgaggctct gcctgagtgt 781 cactttagtt gtcttgcaga aagcttcaga tgtccttggt tctatttagg ttgtgcaaat 841 agacatatga ggtttggttc tggttagtgg ttgctagtac cagatcacct tgcttactgg 901 tgagttcatt caaacccaaa aagtcacacc tgtgtcctgt gcgcgggtgc tgcaggctta 961 ggtggagaaa agcagggcta gaattggaac ccaaagccca gaatggcagg agaggatgtg 1021 ggggctccac ccgatcacct ctgggttcac caagagggta tctaccgcga cgaataccag 1081 cgcacgtggg tggccgtcgt ggaagaggta actgtttaca ttttgcttat ttctttgatt 1141 ttgctttcaa gtaacttggt ttctattcca gtctcaatat tctgaagtct ttggttttat 1201 ttttgatcat tcctttccat taggagacga gtttcctaag ggcacgagtc cagcaaattc 1261 aggttccctt aggtgacgca gctaggccaa gtcaccttct tacctcccag ctacctctca 1321 tgtggcaact ctacccggag gagcgctaca tggataacaa ctctcgcttg tggcagatac 1381 agcatcattt aatggtacac atgttgctta attattttcc attgtactgg gactctcgta 1441 acagagtact ccataaactt ttctttttct gacaggtcag gggagtacag gagctgttgc 1501 ttaagctttt gcctgatgac taacctggta tgtatttcta tttctcctct gtcctccccc 1561 tctttgtttt ggctctattc agttgtggtg attttaatga tttttttttc caggtgctgg 1621 gattctacga agatatgctc ctttgttctg ttcagtattt ggcaatcact tcatccacta 1681 ctgcagtgca cccacccttg ggcctggggg gagggagtgg gttgaaaaac tgctcaagaa 1741 acagaagttt agcaagggtc atgaagaata ctgcaagtga aactgcagag agaggttacg 1801 taggcagaaa gcaagtcaac aaaagcactt agtcaggatc cgtaacttga aattgactcc 1861 tttggaaatt gccatagaac ccttaatgga catcatcggc tggacctggg atctgatgaa 1921 tcccacaaaa gtcagcacct tctacagaac agatgccctg atcaccaagg acttggtact 1981 gatttagaga gaagagagca gctcctagca gcatcaacat ctatttgtcg cttatttgcc 2041 ctgcagcaat tcacctgcct ttccttctcc catcctgtaa atatcctagg ttttgctcca 2101 gtgtttccca tcccagtact gacctaattt ggatctgctg agtatttaga gggctctgaa 2161 aagagatatg cattgatgaa attagctaca aaggcttgtg ggcctttttt tctatctgaa 2221 atcgtgtctt caaattactg gaagttgctg atcaaaattg tgggctcttg aattcaatca 2281 actcttttga atactgtaaa ccaacgttag gctagaggag ccgacgggca gtggctctat 2341 cattcgcaga agaaaccctt gataaatgat taaactcctt gagctttttt tattacgcat 2401 atataaatgg gaatcatacc tacctcaaag cattnnnnnn nnnnaaaacc ttacttttaa 2461 tgtatataaa ctgttataac tactttaact caggaaattc tttacaagat gtaaatgttt 2521 tcctaggaat cagcaatatt tggtgttatt acttcctgag aaaggtgcca gttaagcagc 2581 taaaatggtt tttcttgaag ttctatagaa accacgttat ttaaaatatc tgtaatcaga 2641 gaatgaattc aaggtttttt ttccatgtaa agctatttat atcttccaag cctaatgaaa 2701 tggatattat aaaaatacag ttgattcatc ctatgtttgg gaagagtatt tttagtgtaa 2761 ttattttcca gcaattgtaa gggcatatag tgcttgtctc agaagaaaat tccacgtaga 2821 ggggctttaa agtctttatg gaaaaaagaa aaataaaaat catttttctt gttttagttt 2881 tctggatatg ccgcagaagg atccgtgcca gaagcaagcc tgtgagatac agaaatgttt 2941 acaaggtagt atattgtaaa atgctttaaa aatatttttc cactgtgaac taactataag 3001 agaccaatta ttcttttatc tgtattaact cttttttaag aatgtaacgt atatgtaatt 3061 atcctgaaga ttctcttcat tgtatcaaag atgcagaaaa taaaaagtaa tgtgatcaaa 3121 gctaaaaatt tcactccagg tgattagcat cacctggaga tctttaaaca cttacctaga 3181 acaattcaga cttcctaggg ctagggctag aaaaaaaggg ttttttgttg ttgttgtttg 3241 tttttactaa aaggctagtt ttaaattggt tatacttcac tgcagctgac ataacacttt 3301 ttaaaagcaa tctggaagag gtgcggttga aataacaatt ggaattnnnn nnnnnnnnnn 3361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3841 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4141 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4321 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4381 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4441 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4501 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4681 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4741 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4801 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4861 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4921 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4981 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5041 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5101 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5161 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5221 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5281 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5341 nnnnnnnnnn nnnnnnnnaa aattatatat taaaatagct ttcttccact ttgcaaatca 5401 tactttcccc taacgaacct gtcgacaggt ctctaaatta gaaagaattc agtggacttt 5461 ggtatatctt ctttcttcca gttccatagg tactgaaagt gaaaaggggt gggaagcact 5521 gatctaaacc tccctgtacc ttagtcctat gagatcatga ctagagtctg cctgtatgtt 5581 tttcagccaa cagctacatg gaatcaaagt gtcaggctgt catccaagaa ctgcgtaagt 5641 gttgtgctca gtatcccaag ggaagatctg tcgtctgttc aggatttgaa aaagaagagg 5701 aagaaaacct aacacggaag tctgcatcaa agtaaagttc ttctgaagtg ctgctccatg 5761 tttccaccaa atgaattttt tttatcctcc tgactcttca ggccaggtag cagcaaatag 5821 caaatgaaaa agtcagctac aaaagttaat gaatatgcca tctatgcaga acaggcagaa 5881 atataaacac attaaaagac aaatatgtag aatgtaatat actgagctgc taaaataaac 5941 ctgtttaaga gaaaaatttg ggttttgtac taaatgtatt ttaagactat tagatatagg 6001 gtattaatga attctcttga agttcaagcc actagtgcca ttcatcaaat ttaagtcata 6061 tgaggtatgt taggtgaaag gactcacttc cagcatcaca attttgacgc cttttttttt 6121 tttttttttt ttttttgaga tggagtctca ctctgtcatc caggc // LOCUS HSMTERF 1995 bp RNA PRI 01-APR-1997 DEFINITION H.sapiens mRNA for mitochondrial transcription termination factor. ACCESSION Y09615 NID g1707506 KEYWORDS mitochondrial transcription termination factor; mTERF gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1995) AUTHORS Fernandez-Silva,P., Martinez-Azorin,F., Micol,V. and Attardi,G. TITLE The human mitochondrial transcription termination factor (mTERF) is a multizipper protein but binds to DNA as a monomer, with evidence pointing to intramolecular leucine zipper interactions JOURNAL EMBO J. 16 (5), 1066-1079 (1997) MEDLINE 97224133 REFERENCE 2 (bases 1 to 1995) AUTHORS Attardi,G. TITLE Direct Submission JOURNAL Submitted (26-NOV-1996) G. Attardi, California Institute of Technology, Division of Biology, Pasadena, California 91125, USA FEATURES Location/Qualifiers source 1..1995 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Clontech human HeLa S3" gene 78..1277 /gene="mTERF" CDS 78..1277 /gene="mTERF" /codon_start=1 /db_xref="PID:e283817" /db_xref="PID:g1707507" /translation="MQSLSLGQTSISKGLNYLTIMAPGNLWHMRNNFLFGSRCWMTRF SAENIFKSVSFRLFGVKCHNTDSEPLKNEDLLKNLLTMGVDIDMARKRQPGVFHRMIT NEQDLKMFLLSKGASKEVIASIISRYPRAITRTPENLSKRWDLWRKIVTSDLEIVNIL ERSPESFFRSNNNLNLENNIKFLYSVGLTRKCLCRLLTNAPRTFSNSLDLNKQMVEFL QAAGLSLGHNDPADFVRKIIFKNPFILIQSTKRVKANIEFLRSTFNLNSEELLVLICG PGAEILDLSNDYARRSYANIKEKLFSLGCTEEEVQKFVLSYPDVIFLAEKKFNDKIDC LMEENISISQIIENPRVLDSSISTLKSRIKELVNAGCNLSTLNITLLSWSKKRYEAKL KKLSRFA" sig_peptide 78..248 /gene="mTERF" mat_peptide 249..1274 /gene="mTERF" /product="mitochondrial transcription termination factor" polyA_signal 1958..1963 BASE COUNT 673 a 332 c 417 g 573 t ORIGIN 1 gagaggcgga agtaaaagcg aactacagtg gatgggtgca gttcaggaga tagctgttct 61 ccagcctttc tggagggatg cagagccttt ccttaggaca aacaagcatt tcaaaaggtt 121 tgaactacct aaccattatg gcaccaggaa acctctggca tatgagaaat aactttctct 181 ttggttcaag atgttggatg actcgatttt cagcagaaaa catcttcaaa tcagtttcat 241 ttaggctttt tggtgtgaag tgtcataata cagacagtga gcctttgaaa aatgaggacc 301 tactgaaaaa cttacttact atgggagtag atattgacat ggcaaggaaa cgacagcctg 361 gagtttttca taggatgatt accaatgagc aggacctgaa gatgttcctt ctttccaaag 421 gagctagcaa agaagtgatc gctagcatca tatcaagata tccacgagca ataacacgta 481 ctcccgagaa tctttcaaaa cggtgggatc tgtggagaaa gattgtgaca tcagaccttg 541 aaattgtaaa tattttggaa cgttctcctg aatccttttt tcggtccaat aacaacctaa 601 acttagagaa taatataaag ttcctctact cagttggatt gacccgtaaa tgcctttgtc 661 gattgttgac caatgcccct cgtaccttct ccaatagtct tgatctgaat aaacagatgg 721 ttgaattttt gcaggcagcc ggtttgtcat tgggtcacaa tgatcccgca gattttgtca 781 gaaagataat ttttaaaaac ccttttatct taattcagag caccaagcgg gtgaaagcta 841 acattgaatt cttacggtca actttcaatt tgaacagtga ggaactgctg gttctgatat 901 gtggtccagg agctgaaatc ctagaccttt ccaatgacta tgccagaaga agctacgcaa 961 acatcaaaga gaagctgttt tctcttggat gtactgaaga agaggtacag aagtttgtct 1021 taagctatcc agatgtgatc ttcttggcag agaaaaagtt taatgataaa atagactgcc 1081 tcatggaaga aaacattagc atttcacaaa taatcgaaaa tcctcgggtt ctggattcaa 1141 gcataagtac tttaaaaagt cgaatcaaag aattggtaaa tgctggctgt aacttgagta 1201 ctttaaacat cactcttcta tcttggagta aaaaaagata tgaagctaaa ttgaaaaagt 1261 taagcagatt tgcctaagga tgccaatgtt tttaattctc aggaactgtg aatatgttat 1321 gccacattcc aaaagaattt tgcagaagtg attaatttaa aatgttatta agggccttga 1381 gatggggaga ttatcctgaa ttactttgga gggcctaata taattgtagg gatctttaaa 1441 agttgaagag ggagccaaaa gaggaggtca gagtgatcct gtataagaaa gacaactact 1501 tttgaaggaa gttaaaaggt ggctttgaag gaggaagaca ccttgagcca aggaatgaag 1561 ggggcctgta aaaaaagatt gaaaaggcaa ggaaacagat tctccattag agtccccaga 1621 aaggaatgca gccttgctgg caccttttta gcccagtgag atcaatgtcc aacttctgac 1681 ccacagaacc ataagatggt aaatttgtgt tgttttaaac cactgagtca gtgataattt 1741 gctgaagaag caaatagatt aacatagcaa ctaagtagat gattagttgc attataggtg 1801 aagtccaaaa tacagtcctt tagtaaagtc tctatttctt ccttttataa gagtaattat 1861 atttaaaaaa tctacctacc cttctggaat tgctaatgtt agtgtttata aagtcttaaa 1921 tgtgtgcata aaaaactaaa tgtgaatata aaatctaaat aaaggaatat gtttgcctaa 1981 tatagtttaa aaaaa // LOCUS HSMTFMR 3302 bp RNA PRI 01-AUG-1994 DEFINITION H.sapiens MTF-1 mRNA for metal-regulatory transcription factor. ACCESSION X78710 NID g520933 KEYWORDS metal-regulatory transcription factor; MTF-1 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3302) AUTHORS Brugnera,E., Georgiev,O., Radtke,F., Heuchel,R., Baker,E., Sutherland,G.R. and Schaffner,W. TITLE Cloning, chromosomal mapping and characterization of the human metal-regulatory transcription factor MTF-1 JOURNAL Nucleic Acids Res. 22 (15), 3167-3173 (1994) MEDLINE 94344782 REFERENCE 2 (bases 1 to 3302) AUTHORS Brugnera,E. TITLE Direct Submission JOURNAL Submitted (12-APR-1994) E. Brugnera, Inst. fr. Molekularbiologie II, der Universitaet, Winterthurerstr 190, 8057 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..3302 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="55 years old" /sex="male" /tissue_type="liver" /cell_line="Namalwa line NM864, NM878" /clone_lib="lambda gt11 cDNA" /clone="17I,B3,J3" /chromosome="1" /map="1p32-34" mRNA 1..3302 /gene="MTF-1" gene 1..3302 /gene="MTF-1" CDS 84..2345 /gene="MTF-1" /codon_start=1 /product="metal-regulatory transcription factor" /db_xref="PID:g520934" /translation="MGEHSPDNNIIYFEAEEDELTPDDKMLRFVDKNGLVPSSSGTVY DRTTVLIEQDPGTLEDEDDDGQCGEHLPFLVGGEEGFHLIDHEAMSQGYVQHIISPDQ IHLTINPGSTPMPRNIEGATLTLQSECPETKRKEVKRYQCTFEGCPRTYSTAGNLRTH QKTHRGEYTFVCNQEGCGKAFLTSHSLRIHVRVHTKEKPFECDVQGCEKAFNTLYRLK AHQRLHTGKTFNCESEGCSKYFTTLSDLRKHIRTHTGEKPFRCDHDGCGKAFAASHHL KTHVRTHTGERPFFCPSNGCEKTFSTQYSLKSHMKGHDNKGHSYNALPQHNGSEDTNH SLCLSDLSLLSTDSELRENSSTTQGQDLSTISPAIIFESMFQNSDDTAIQEDPQQTAS LTESFNGDAESVSDVPPSTGNSASLSLPLVLQPGLSEPPQPLLPASAPSAPPPAPSLG PGSQQAAFGNPPALLQPPEVPVPHSTQFAANHQEFLPHPQAPQPIVPGLSVVAGASAS AAAVASAVAAPAPPQSTTEPLPAMVQTLPLGANSVLTNNPTITITPTPNTAILQSSLV MGEQNLQWILNGATSSPQNQEQIQQASKVEKVFFTTAVPVASSPGSSVQQIGLSVPVI IIKQEEACQCQCACRDSAKERASSRRKGCSSPPPPEPSPQAPDGPSLQLPAQTFSSAP VPGSSSSTLPSSCEQSRQAETPSDPQTETLSAMDVSEFLSLQSLDTPSNLIPIEALLQ GEEEMGLTSSFSK" BASE COUNT 852 a 865 c 788 g 797 t ORIGIN 1 cggtgctgcc gccgttgccg ggagccgcgg agacaagtca ttacgttttc atttctcaca 61 actgggctga gcacaactga accatggggg aacacagtcc agacaacaac atcatctact 121 ttgaggcaga ggaagatgag ctgacccccg atgataaaat gctcaggttt gtggataaaa 181 acggactggt gccttcctca tctggaactg tttatgatag gaccactgtt cttattgagc 241 aggaccctgg cactttggag gatgaagatg acgacggaca gtgcggagaa cacttgcctt 301 ttctagtagg gggtgaagag ggctttcacc tgatagatca tgaagcaatg tcccagggtt 361 atgtgcagca cattatctca ccagatcaga ttcatttgac aataaaccct ggttccacac 421 ccatgccaag aaatattgaa ggtgcaaccc tcactctgca gtcggaatgt ccggaaacaa 481 aacgtaaaga agtaaagcgg taccaatgta cctttgaggg ctgtccccgc acctacagca 541 cagcaggcaa cctgcgaacc caccagaaga ctcaccgagg agagtacacc tttgtctgta 601 atcaggaggg ctgtggcaaa gccttcctta cctctcacag cctcaggatc cacgtgcgag 661 tgcacacgaa ggagaagcca tttgagtgtg acgtgcaggg ctgtgagaag gcattcaaca 721 cactgtacag gctgaaagca catcagaggc ttcacacagg gaaaacgttt aactgtgaat 781 ctgaaggctg cagcaaatac ttcaccacac tcagtgatct gaggaagcac attcgaactc 841 atacagggga aaagccattt cggtgcgatc acgatggctg tggaaaagca tttgcagcaa 901 gccaccacct taaaactcac gttcgtacac atactggtga aagacccttc ttctgcccca 961 gtaatggctg tgagaaaaca ttcagcactc aatacagtct caaaagtcac atgaaaggtc 1021 atgataacaa aggacactca tacaatgcac ttccacaaca caatggatca gaggatacaa 1081 atcactcact ttgtctaagt gacttgagcc ttctgtccac agattctgaa ttgcgagaaa 1141 attccagtac gacccagggc caggacctca gcacaatttc accagcaatc atctttgaat 1201 caatgttcca gaattcagat gatacggcaa ttcaggaaga tcctcaacag acagcttcct 1261 tgactgaaag ttttaatggt gatgcagagt cagtcagtga tgttccgcca tccacaggaa 1321 attcagcatc tttatctctt ccacttgtac tgcaacctgg cctctccgag ccaccccagc 1381 ctctactacc tgcctcagct ccgtctgctc ctccgcctgc tccctcccta ggacctggct 1441 cccagcaagc tgcatttggc aacccccctg ctctcttaca acctccagaa gtgcctgttc 1501 cccacagcac acagtttgct gctaatcatc aagagtttct tccgcacccc caggcaccgc 1561 agcccattgt accaggactt tctgttgttg ctggggcttc tgcatcagca gcggcagtgg 1621 catcagctgt ggcagcacca gccccaccac aaagtactac tgagcccctg ccagccatgg 1681 tccagactct gcccctgggt gccaactctg tcctaactaa taatcccaca ataaccatca 1741 ccccaactcc caacacagct atcctgcagt ccagcctagt catgggagaa cagaacttac 1801 aatggatatt aaatggtgcc accagttctc cacaaaacca agaacaaatt cagcaagcat 1861 ctaaagttga gaaggtgttt tttaccactg cagtaccagt agccagtagc ccagggagct 1921 ctgtccagca gattggcctc agtgttcctg tgatcatcat caaacaagaa gaggcatgtc 1981 agtgtcagtg tgcatgccgg gactctgcaa aggagcgggc atccagcagg agaaagggct 2041 gctcctcccc accccctcca gagccgagcc cccaggctcc tgatgggccc agcctgcagc 2101 tcccagcgca gactttctct tcagcccctg ttcccgggtc atcatcctct accttgccct 2161 cctcctgtga gcaaagccga caagcagaga ctccttcaga ccctcagaca gaaacattaa 2221 gtgccatgga tgtgtcagag tttctatccc tccagagcct ggacaccccg tccaatctga 2281 ttcccattga agcactactg cagggggagg aggagatggg cctcaccagc agcttctcca 2341 agtgaagggc ccatgtgtgc tcacctctgg gaaaagcggg tgagcaggag gcatgaggta 2401 caatgcctgc catcatgggt cagaaatttg aaggatgaag aaatctactg tttgaaatcc 2461 tcacctttca gacgtatttt ctttattcac atcccaggag catccatttt aaggaactat 2521 tctttggaaa aaaacaaaaa acaaaaaaaa caacaaaaaa agctaagtta taagtgaact 2581 gtttggctgc actgtatgtc acttttgctt gttgtcatgt gaacttggaa actaaggtta 2641 ctcgtgtgca taaaaattct aaatgaaagg gtgtggtttc catcaatctg atgctgccca 2701 tcgcttgcac tggggtcttt gtggatcggg caggagtttt cagtgtgttg ggtgttgctc 2761 cttcctatgt gtcttttgaa tctgaggctg acatttgctt ggaaggccag acccttgctc 2821 catcagagag ggcagtggca aaggccagtg aggcagctgt gagttggaca gggttcaggt 2881 gagatggtgt tgtcatttgt gcttagtgtt ggtggtgctc agggtggata acacgggtcg 2941 ttctgcagcc cgcttcagca caaataggca gcttaaggcc tggctcacag gctgtggggt 3001 tgatctggct ctgcagaggc cctaggcagc ttgttgactg ctgtctgttg atgacgtgtg 3061 tgcaaagcag gctctagcaa catgatcact gtccttgcct tcctggttct ttctctcggt 3121 tggttgccag ggcttgcaga tcgcagtgaa ttttccttgg ggaacatcgc tgttttgtcc 3181 tagagtgaac ttgtggctta tggccagtgc tgtttggtgg tctgccttct ttttaatggt 3241 attttcttcc tcagagcaga agggctgcat tttgcttatc agaagaaggt gcagatttaa 3301 gg // LOCUS HSMTTP 3880 bp RNA PRI 13-MAR-1996 DEFINITION H.sapiens mRNA for microsomal triglyceride transfer protein. ACCESSION X91148 NID g1217638 KEYWORDS microsomal triglyceride transfer protein; MTP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3880) AUTHORS Chester,S. TITLE Direct Submission JOURNAL Submitted (21-SEP-1995) S.Chester, MRC Molecular Medicine Group, Collier Building, R.P.M.S., Du Cane Road, London W12 0NN, UK REFERENCE 2 (bases 1 to 3880) AUTHORS Narcisi,T.M.E., Shoulders,C.C., Chester,S.A., Read,J., Brett,D.J., Harrison,G.B., Grantham,T.T., Fox,M.F., Povey,S., de Bruin,T.W.A., Erkelens,D.W., Muller,D.P.R., Lloyd,J.K. and Scott,J. TITLE Mutations of the microsomal triglyceride-transfer-protein gene in abetalipoproteinemia JOURNAL Am. J. Hum. Genet. 57 (6), 1298-1310 (1995) MEDLINE 96065017 COMMENT Related entry X75500. FEATURES Location/Qualifiers source 1..3880 /organism="Homo sapiens" /db_xref="taxon:9606" gene 25..1578 /gene="MTP" CDS 25..1578 /gene="MTP" /codon_start=1 /product="microsomal triglyceride transfer protein" /db_xref="PID:e198934" /db_xref="PID:g1217639" /translation="MILLAVLFLCFISSYSASVKGHTTGLSLNNDRLYKLTYSTEVLL DRGKGKLQDSVGYRISSNVDVALLWRNPDGDDDQLIQITMKDVNVENVNQQRGEKSIF KGKSPSKIMGKENLEALQRPTLLHLIHGKVKEFYSYQNEAVAIENIKRGLASLFQTQL SSGTTNEVDISGNCKVTYQAHQDKVIKIKALDSCKIARSGFTTPNQVLGVSSKATSVT TYKIEDSFVIAVLAEETHNFGLNFLQTIKGKIVSKQKLELKTTEAGPRLMSGKQAAAI IKAVDSKYTAIPIVGQVFQSHCKGCPSLSELWRSTRKYLQPDNLSKAEAVRNFLAFIQ HLRTAKKEEILQILKMENKEVLPQLVDAVTSAQTSDSLEAILDFLDFKSDSSIILQER FLYACGFASHPNEELLRALISKFKGSIGSSDIRETVMIITGTLVRKLCQNEGCKLKAV VEAKKLILGGLEKAEKKEDTRMYLLALKNALLPEGIPSLLKYAEAGEGPISHLATTAL QRYDAPFHN" polyA_signal (3859.3860)..>3880 BASE COUNT 1222 a 786 c 826 g 1046 t ORIGIN 1 tgcagttgag gattgctggt caatatgatt cttcttgctg tgctttttct ctgcttcatt 61 tcctcatatt cagcttctgt taaaggtcac acaactggtc tctcattaaa taatgaccgg 121 ctgtacaagc tcacgtactc cactgaagtt cttcttgatc ggggcaaagg aaaactgcaa 181 gacagcgtgg gctaccgcat ttcctccaac gtggatgtgg ccttactatg gaggaatcct 241 gatggtgatg atgaccagtt gatccaaata acgatgaagg atgtaaatgt tgaaaatgtg 301 aatcagcaga gaggagagaa gagcatcttc aaaggaaaaa gcccatctaa aataatggga 361 aaggaaaact tggaagctct gcaaagacct acgctccttc atctaatcca tggaaaggtc 421 aaagagttct actcatatca aaatgaggca gtggccatag aaaatatcaa gagaggtctg 481 gctagcctat ttcagacaca gttaagctct ggaaccacca atgaggtaga tatctctgga 541 aattgtaaag tgacctacca ggctcatcaa gacaaagtga tcaaaattaa ggccttggat 601 tcatgcaaaa tagcgaggtc tggatttacg accccaaatc aggtcttggg tgtcagttca 661 aaagctacat ctgtcaccac ctataagata gaagacagct ttgttatagc tgtgcttgct 721 gaagaaacac acaattttgg actgaatttc ctacaaacca ttaaggggaa aatagtatcg 781 aagcagaaat tagagctgaa gacaaccgaa gcaggcccaa gattgatgtc tggaaagcag 841 gctgcagcca taatcaaagc agttgattca aagtacacgg ccattcccat tgtggggcag 901 gtcttccaga gccactgtaa aggatgtcct tctctctcgg agctctggcg gtccaccagg 961 aaatacctgc agcctgacaa cctttccaag gctgaggctg tcagaaactt cctggccttc 1021 attcagcacc tcaggactgc gaagaaagaa gagatccttc aaatactaaa gatggaaaat 1081 aaggaagtat tacctcagct ggtggatgct gtcacctctg ctcagacctc agactcatta 1141 gaagccattt tggacttttt ggatttcaaa agtgacagca gcattatcct ccaggagagg 1201 tttctctatg cctgtggatt tgcttctcat cccaatgaag aactcctgag agccctcatt 1261 agtaagttca aaggttctat tggtagcagt gacatcagag aaactgttat gatcatcact 1321 gggacacttg tcagaaagtt gtgtcagaat gaaggctgca aactcaaagc agtagtggaa 1381 gctaagaagt taatcctggg aggacttgaa aaagcagaga aaaaagagga caccaggatg 1441 tatctgctgg ctttgaagaa tgccctgctt ccagaaggca tcccaagtct tctgaagtat 1501 gcagaagcag gagaagggcc catcagccac ctggctacca ctgctctcca gagatatgat 1561 gctccctttc ataactgatg aggtgaagaa gaccttaaac agaatatacc accaaaaccg 1621 taaagttcat gaaaagactg tgcgcactgc tgcagctgct atcattttaa ataacaatcc 1681 atcctacatg gacgtcaaga acatcctgct gtctattggg gagcttcccc aagaaatgaa 1741 taaatacatg ctcgccattg ttcaagacat cctacgtttt gaaatgcctg caagcaaaat 1801 tgtccgtcga gttctgaagg aaatggtcgc tcacaattat gaccgtttct ccaggagtgg 1861 atcttcttct gcctacactg gctacataga acgtagtccc cgttcggcat ctacttacag 1921 cctagacatt ctctactcgg gttctggcat tctaaggaga agtaacctga acatctttca 1981 gtacattggg aaggctggtc ttcacggtag ccaggtggtt attgaagccc aaggactgga 2041 agccttaatc gcagccaccc ctgacgaggg ggaggagaac cttgactcct atgctggtat 2101 gtcagccatc ctctttgatg ttcagctcag acctgtcacc tttttcaacg gatacagtga 2161 tttgatgtcc aaaatgctgt cagcatctgg cgaccctatc agtgtggtga aaggacttat 2221 tctgctaata gatcattctc aggaacttca gttacaatct ggactaaaag ccaatataga 2281 ggtccagggt ggtctagcta ttgatatttc aggtgcaatg gagtttagct tgtggtatcg 2341 tgagtctaaa acccgagtga aaaatagggt gactgtggta ataaccactg acatcacagt 2401 ggactcctct tttgtgaaag ctggcctgga aaccagtaca gaaacagaag caggcttgga 2461 gtttatctcc acagtgcagt tttctcagta cccattctta gtttgcatgc agatggacaa 2521 ggatgaagct ccattcaggc aatttgagaa aaagtacgaa aggctgtcca caggcagagg 2581 ttatgtctct cagaaaagaa aagaaagcgt attagcagga tgtgaattcc cgctccatca 2641 agagaactca gagatgtgca aagtggtgtt tgcccctcag ccggatagta cttccagcgg 2701 atggttttga aactgacctg tgatatttta cttgaatttg tctccccgaa agggacacaa 2761 tgtggcatga ctaagtactt gctctctgag agcacagcgt ttacatattt acctgtattt 2821 aagatttttg taaaaagcta caaaaaactg cagtttgatc aaatttgggt atatgcagta 2881 tgctacccac agcgtcattt tgaatcatca tgtgacgctt tcaacaacgt tcttagttta 2941 cttatacctc tctcaaatct catttggtac agtcagaata gttattctct aagaggaaac 3001 tagtgtttgt taaaaacaaa aataaaaaca aaaccacaca aggagaaccc aattttgttt 3061 caacaatttt tgatcaatgt atatgaagct cttgatagga cttccttaag catgacggga 3121 aaaccaaaca cgttccctaa tcaggaaaaa aaaaaaaaaa gaaaaagtaa gacacaaaca 3181 aaccattttt ttctcttttt ttggagttgg gggcccaggg agaagggaca aggcttttaa 3241 aagacttgtt agccaacttc aagaattaat atttatgtct ctgttattgt tagttttaag 3301 ccttaaggta gaaggcacat agaaataaca tctcatcttt ctgctgacca ttttagtgag 3361 gttgttccaa agagcattca ggtctctacc tccagccctg caaaaatatt ggacctagca 3421 cagaggaatc aggaaaatta atttcagaaa ctccatttga tttttctttt gctgtgtctt 3481 ttttgagact gtaatatggt acactgtcct ctaaggacat cctcatttta tctcaccttt 3541 ttgggggtga gagctctagt tcatttaact gtactctgca caatagctag gatgactaag 3601 agaacattgc ttcaagaaac tggtggattt ggatttccaa aatatgaaat aaggagaaaa 3661 atgtttttat ttgtatgaat taaaagatcc atgttgaaca tttgcaaata tttattaata 3721 aacagatgtg gtgataaacc caaaacaaat gacaggtgct tattttccac taaacacaga 3781 cacatgaaat gaaagtttag ctagcccact atttgttgta aattgaaaac gaagtgtgat 3841 aaaataaata tgtagaaatc aaaaaaaaaa aaaaaaaaaa // LOCUS HSMUARP2 1732 bp RNA PRI 07-APR-1997 DEFINITION H.sapiens mRNA for mu-ARP2 protein. ACCESSION Y08387 NID g1929346 KEYWORDS mu-adaptin-related protein; mu-ARP2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1732) AUTHORS Wang,X. and Kilimann,M.W. TITLE Identification of two new mu-adaptin-related proteins, mu-ARP1 and mu-ARP2 JOURNAL FEBS Lett. 402 (1), 57-61 (1997) MEDLINE 97165966 REFERENCE 2 (bases 1 to 1732) AUTHORS Kilimann,M.W. TITLE Direct Submission JOURNAL Submitted (26-SEP-1996) M.W. Kilimann, Institut fuer Physiologische Chemie I, Ruhr-Universitaet Bochum, Universitaetsstr. 150, D-44780 Bochum, FRG FEATURES Location/Qualifiers source 1..1732 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 55..1416 /gene="mu-ARP2" CDS 55..1416 /gene="mu-ARP2" /function="possible subunit of novel protein coat involved in formation of intracellular membrane transport vesicles" /note="medium-chain of clathrin coat adaptor complexes" /codon_start=1 /product="mu-adaptin-related protein 2" /db_xref="PID:e276509" /db_xref="PID:g1929347" /translation="MISQFFILSSKGDPLIYKDFRGDSGGRDVAELFYRKLTGLPGDE SPVVMHHHGRHFIHIRHSGLYLVVTTSENVSPFSLLELLSRLATLLGDYCGSLGEGTI SRNVALVYELLDEVLDYGYVQTTSTEMLRNFIQTEAVVSKPFSLFDLSSVGLFGAETQ QSKVAPSSAASRPVLSSRSDQSQKNEVFLDVVERLSVLIASNGSLLKVDVQGEIRLKS FLPSGSEMRIGLTEEFCVGKSELRGYGPGIRVDEVSFHSSVNLDEFESHRILRLQPPQ GELTVMRYQLSDDLPSPLPFRLFPSVQWDRGSGRLQVYLKLRCDLLSKSQALNVRLHL PLPRGVVSLSRELSSPEQKAELAEGALRWDLPRVQGGSQLSGLFQMDVPGPPGPPSHG LSTSASPLGLGPASLSFELPRHTCSGLQVRFLRLAFRPCGNANPHKWVRHLSHSDAYV IRI" misc_feature 1492..1715 /note="sequence shared with multiple ESTs; possible repetitive element or cloning artifact" BASE COUNT 371 a 507 c 472 g 382 t ORIGIN 1 agggcggggc aggcccgact ttcgccgtct tcttgtctac tctccagaac ggccatgatt 61 tcccaattct tcattctgtc ctccaagggg gacccgctca tctacaaaga cttccgcggg 121 gacagtggcg gccgggatgt ggccgagctc ttctaccgga agctgacggg actgccagga 181 gacgagtccc cggttgtcat gcatcaccat ggccgtcatt tcattcacat cagacacagc 241 ggcctctatt tggtggtcac aacttcagaa aacgtttctc ccttcagcct cctagaactg 301 ctctccaggt tggccaccct tctgggcgat tactgtggct ccctgggcga ggggaccatc 361 tcccgcaatg tggctctggt atacgaactc ctggatgaag tgctggacta tggctatgta 421 cagaccacat ccacggagat gctgaggaat ttcatccaga cggaagctgt ggtcagcaag 481 cccttcagcc tctttgacct cagcagcgtt ggcttgtttg gggctgagac acaacagagc 541 aaagtggccc ccagcagtgc agccagccgc cccgtcctgt ccagtcgctc tgaccagagc 601 caaaagaatg aagttttttt ggatgtggtc gagagattgt ctgtactgat agcatctaat 661 ggatccctgc tgaaggtgga tgtgcaggga gagattcggc tcaagagctt ccttcctagc 721 ggctctgaga tgcgcattgg cttgacggaa gagttttgtg tggggaagtc agagctgaga 781 ggttatgggc caggaatccg ggtcgatgaa gtctcgtttc acagctctgt gaatctggac 841 gaatttgagt ctcatcgaat cctccgcttg caaccacctc agggcgagct gactgtgatg 901 cggtaccaac tctccgatga cctcccctca ccgctcccct tccggctctt cccctctgtg 961 cagtgggacc gaggctcagg ccggctccag gtttatctaa agttgcgatg tgacctgctc 1021 tcaaagagcc aagccctcaa tgtcaggctg cacctccccc tgcctcgagg ggtggtcagc 1081 ctgtctcggg agctgagcag cccagagcag aaggctgagc tggcagaggg agcccttcgc 1141 tgggacctgc ctcgggtgca aggaggctct caactctcag gccttttcca gatggacgtc 1201 ccagggcccc caggacctcc cagccatggg ctctccacct cggcctctcc tctggggctg 1261 ggccctgcca gtctctcctt cgagcttccc cggcacacgt gctctggcct ccaggtccga 1321 ttcctcaggc tggccttcag gccatgcggc aatgccaacc cccacaagtg ggtgcgacac 1381 ctaagccaca gcgacgccta tgtcattcgg atctgaggct ccccaaacga ggacacgacg 1441 gccaaggtgg cagtttgtcc caagggagga cagtcgtttc ttttccagcc tcctggcctt 1501 cggactctga atctgggcag gaagagtcct cagtcccaag accaggaggg ggcaatgggc 1561 ccagcctttc tgtggtatct gatgcaggaa ggactgcagt ggatcagaac ttacaaacca 1621 aacttttatt ctgagaaact ggctgtacaa tatctaaaaa gaaagtgaca tgaaggaagc 1681 aatctacaac ttccttccgc ttagcgagca tgcaaaaaaa aaaaaaaaaa aa // LOCUS HSMUPS 10302 bp RNA PRI 10-JUL-1996 DEFINITION H.sapiens mRNA for utrophin. ACCESSION X69086 NID g34811 KEYWORDS dystrophin-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10302) AUTHORS Tinsley,J.M. TITLE Direct Submission JOURNAL Submitted (02-NOV-1992) J.M. Tinsley, Molecular Genetics Group, Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK REFERENCE 2 (bases 1 to 10302) AUTHORS Tinsley,J.M., Blake,D.J., Roche,A., Fairbrother,U., Riss,J., Byth,B.C., Knight,A.E., Kendrick-Jones,J., Suthers,G.K., Love,D.R., Edwards,Y.H. and Davies,K.E. TITLE Primary structure of dystrophin-related protein JOURNAL Nature 360 (6404), 591-593 (1992) MEDLINE 93096045 FEATURES Location/Qualifiers source 1..10302 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="glioma" /cell_line="IN 157" /clone_lib="pcDNA II" /chromosome="6q 24" CDS 1..10302 /codon_start=1 /product="utrophin (dystrophin related protein)" /db_xref="PID:g34812" /db_xref="SWISS-PROT:P46939" /translation="MAKYGEHEASPDNGQNEFSDIIKSRSDEHNDVQKKTFTKWINAR FSKSGKPPINDMFTDLKDGRKLLDLLEGLTGTSLPKERGSTRVHALNNVNRVLQVLHQ NNVELVNIGGTDIVDGNHKLTLGLLWSIILHWQVKDVMKDVMSDLQQTNSEKILLSWV RQTTRPYSQVNVLNFTTSWTDGLAFNAVLHRHKPDLFSWDKVVKMSPIERLEHAFSKA QTYLGIEKLLDPEDVAVRLPDKKSIIMYLTSLFEVLPQQVTIDAIREVETLPRKYKKE CEEEAINIQSTAPEEEHESPRAETPSTVTEVDMDLDSYQIALEEVLTWLLSAEDTFQE QDDISDDVEEVKDQFATHEAFMMELTAHQSSVGSVLQAGNQLITQGTLSDEEEFEIQE QMTLLNARWEALRVESMDRQSRLHDVLMELQKKQLQQLSAWLTLTEERIQKMETCPLD DDVKSLQKLLEEHKSLQSDLEAEQVKVNSLTHMVVIVDENSGESATAILEDQLQKLGE RWTAVCRWTEERWNRLQEINILWQELLEEQCLLKAWLTEKEEALNKVQTSNFKDQKEL SVSVRRLAILKEDMEMKRQTLDQLSEIGQDVGQLLDNSKASKKINSDSEELTQRWDSL VQRLEDSSNQVTQAVAKLGMSQIPQKDLLETVRVREQAITKKSKQELPPPPPPKKRQI HVDIEAKKKFDAISAELLNWILKWKTAIQTTEIKEYMKMQDTSEMKKKLKALEKEQRE RIPRADELNQTGQILVEQMGKEGLPTEEIKNVLEKVSSEWKNVSQHLEDLERKIQLQE DINAYFKQLDELEKVIKTKEEWVKHTSISESSRQSLPSLKDSCQRELTNLLGLHPKIE MARASCSALMSQPSAPDFVQRGFDSFLGRYQAVQEAVEDRQQHLENELKGQPGHAYLE TLKTLKDVLNDSENKAQVSLNVLNDLAKVEKALQEKKTLDEILENQKPALHKLAEETK ALEKNVHPDVEKLYKQEFDDVQGKWNKLKVLVSKDLHLLEEIALTLRAFEADSTVIEK WMDGVKDFLMKQQAAQGDDAGLQRQLDQCSAFVNEIETIESSLKNMKEIETNLRSGPV AGIKTWVQTRLGDYQTQLEKLSKEIATQKSRLSESQEKAANLKKDLAEMQEWMTQAEE EYLERDFEYKSPEELESAVEEMKRAKEDVLQKEVRVKILKDNIKLLAAKVPSGGQELT SELNVVLENYQLLCNRIRGKCHTLEEVWSCWIELLHYLDLETTWLNTLEERMKSTEVL PEKTDAVNEALESLESVLRHPADNRTQIRELGQTLIDGGILDDIISEKLEAFNSRYED LSHLAESKQISLEKQLQVLRETDQMLQVLQESLGELDKQLTTYLTDRIDAFQVPQEAQ KIQAEISAHELTLEELRRNMRSQPLTSPESRTARGGSQMDVLQRKLREVSTKFQLFQK PANFEQRMLDCKRVLDGVKAELHVLDVKDVDPDVIQTHLDKCMKLYKTLSEVKLEVET VIKTGRHIVQKQQTDNPKGMDEQLTSLKVLYNDLGAQVTEGKQDLERASQLARKMKKE AASLSEWLSATETELVQKSTSEGLLGDLDTEISWAKNVLKDLEKRKADLNTITESSAA LQNLIEGSEPILEERLCVLNAGWSRVRTWTEDWCNTLMNHQNQLEIFDGNVAHISTWL YQAEALLDEIEKKPTSKQEEIVKRLVSELDDANLQVENVRDQALILMNARGSSSRELV EPKLAELNRNFEKVSQHIKSAKLLIAQEPLYQCLVTTETFETGVPFSDLEKLENDIEN MLKFVEKHLESSDEDEKMDEESAQIEEVLQRGEEMLHQPMEDNKKEKIRLQLLLLHTR YNKIKAIPIQQRKMGQLASGIRSSLLPTDYLVEINKILLCMDDVELSLNVPELNTAIY EDFSFQEDSLKNIKDQLDKLGEQIAVIHEKQPDVILEASGPEAIQIRDTLTQLNAKWD RINRMYSDRKGCFDRAMEEWRQFHCDLNDLTQWITEAEELLVDTCAPGGSLDLEKARI HQQELEVGISSHQPSFAALNRTGDGIVQKLSQADGSFLKEKLAGLNQRWDAIVAEVKD RQPRLKGESKQVMKYRHQLDEIICWLTKAEHAMQKRSTTELGENLQELRDLTQEMEVH AEKLKWLNRTELEMLSDKSLSLPERDKISESLRTVNMTWNKICREVPTTLKECIQEPS SVSQTRIAAHPNVQKVVLVSSASDIPVQSHRTSEISIPADLDKTITELADWLVLIDQM LKSNIVTVGDVEEINKTVSRMKITKADLEQRHPQLDYVFTLAQNLKNKASSSDMRTAI TEKLERVKNQWDGTQHGVELRQQQLEDMIIDSLQWDDHREETEELMRKYEARLYILQQ ARRDPLTKQISDNQILLQELGPGDGIVMAFDNVLQKLLEEYGSDDTRNVKETTEYLKT SWINLKQSIADRQNALEAEWRTVQASRRDLENFLKWIQEAETTVNVLVDASHRENALQ DSILARELKQQMQDIQAEIDAHNDIFKSIDGNRQKMVKALGNSEEATMLQHRLDDMNQ RWNDLKAKSASIRAHLEASAEKWNRLLMSLEELIKWLNMKDEELKKQMPIGGDVPALQ LQYDHCKALRRELKEKEYSVLNAVDQARVFLADQPIEAPEEPRRNLQSKTELTPEERA QKIAKAMRKQSSEVKEKWESLNAVTSNWQKQVDKALEKLRDLQGAMDDLDADMKEAES VRNGWKPVGDLLIDSLQDHIEKIMAFREEIAPINFKVKTVNDLSSQLSPLDLHPSLKM SRQLDDLNMRWKLLQVSVDDRLKQLQEAHRDFGPSSQHFLSTSVQLPWQRSISHNKVP YYINHQTQTTCWDHPKMTELFQSLADLNNVRFSAYRTAIKIRRLQKALCLDLLELSTT NEIFKQHKLNQNDQLLSVPDVINCLTTTYDGLEQMHKDLVNVPLCVDMCLNWLLNVYD TGRTGKIRVQSLKIGLMSLSKGLLEEKYRYLFKEVAGPTEMCDQRQLGLLLHDAIQIP RQLGEVAAFGGSNIEPSVRSCFQQNNNKPEISVKEFIDWMHLEPQSMVWLPVLHRVAA AETAKHQAKCNICKECPIVGFRYRSLKHFNYDVCQSCFFSGRTAKGHKLHYPMVEYCI PTTSGEDVRDFTKVLKNKFRSKKYFAKHPRLGYLPVQTVLEGDNLETPITLISMWPEH YDPSQSPQLFHDDTHSRIEQYATRLAQMERTNGSFLTDSSSTTGSVEDEHALIQQYCQ TLGGESPVSQPQSPAQILKSVEREERGELERIIADLEEEQRNLQVEYEQLKDQHLRRG LPVGSPPESIISPHHTSEDSELIAEAKLLRQHKGRLEARMQILEDHNKQLESQLHRLR QLLEQPESDSRINGVSPWASPQHSALSYSLDPDASGPQFHQAAGEDLLAPPHDTSTDL TEVMEQIHSTFPSCCPNVPSRPQAM" BASE COUNT 3328 a 2096 c 2526 g 2352 t ORIGIN 1 atggccaagt atggagaaca tgaagccagt cctgacaatg ggcagaacga attcagtgat 61 atcattaagt ccagatctga tgaacacaat gacgtacaga agaaaacctt taccaaatgg 121 ataaatgctc gattttcaaa gagtgggaaa ccacccatca atgatatgtt cacagacctc 181 aaagatggaa ggaagctatt ggatcttcta gaaggcctca caggaacatc actgccaaag 241 gaacgtggtt ccacaagggt acatgcctta aataacgtca acagagtgct gcaggtttta 301 catcagaaca atgtggaatt agtgaatata gggggaactg acattgtgga tggaaatcac 361 aaactgactt tggggttact ttggagcatc attttgcact ggcaggtgaa agatgtcatg 421 aaggatgtca tgtcggacct gcagcagacg aacagtgaga agatcctgct cagctgggtg 481 cgtcagacca ccaggcccta cagccaagtc aacgtcctca acttcaccac cagctggaca 541 gatggactcg cctttaatgc tgtcctccac cgacataaac ctgatctctt cagctgggat 601 aaagttgtca aaatgtcacc aattgagaga cttgaacatg ccttcagcaa ggctcaaact 661 tatttgggaa ttgaaaagct gttagatcct gaagatgttg ccgttcggct tcctgacaag 721 aaatccataa ttatgtattt aacatctttg tttgaggtgc tacctcagca agtcaccata 781 gacgccatcc gtgaggtaga gacactccca aggaaatata aaaaagaatg tgaagaagag 841 gcaattaata tacagagtac agcgcctgag gaggagcatg agagtccccg agctgaaact 901 cccagcactg tcactgaggt cgacatggat ctggacagct atcagattgc gttggaggaa 961 gtgctgacct ggttgctttc tgctgaggac actttccagg agcaggatga tatttctgat 1021 gatgttgaag aagtcaaaga ccagtttgca acccatgaag cttttatgat ggaactgact 1081 gcacaccaga gcagtgtggg cagcgtcctg caggcaggca accaactgat aacacaagga 1141 actctgtcag acgaagaaga atttgagatt caggaacaga tgaccctgct gaatgctaga 1201 tgggaggctc ttagggtgga gagtatggac agacagtccc ggctgcacga tgtgctgatg 1261 gaactgcaga agaagcaact gcagcagctc tccgcctggt taacactcac agaggagcgc 1321 attcagaaga tggaaacttg ccccctggat gatgatgtaa aatctctaca aaagctgcta 1381 gaagaacata aaagtttgca aagtgatctt gaggctgaac aggtgaaagt aaattcacta 1441 actcacatgg tggtcattgt tgatgaaaac agtggtgaga gcgctacagc tatcctagaa 1501 gaccagttac agaaacttgg tgagcgctgg acagcagtat gccgttggac tgaagaacgc 1561 tggaataggt tacaagaaat caatatattg tggcaggaat tattggaaga acagtgcttg 1621 ttgaaagctt ggttaaccga aaaagaagag gctttaaata aagtccagac aagcaacttc 1681 aaagaccaaa aggaactaag tgtcagtgtt cgacgtctgg ctattttgaa ggaagacatg 1741 gaaatgaagc gtcaaacatt ggatcagctg agtgagattg gccaggatgt gggacaatta 1801 cttgataatt ccaaggcatc taagaagatc aacagtgact cagaggaact gactcaaaga 1861 tgggattctt tggttcagag actagaagat tcctccaacc aggtgactca ggctgtagca 1921 aagctgggga tgtctcagat tcctcagaag gaccttttgg agactgttcg tgtaagagaa 1981 caagcaatta caaaaaaatc taagcaggaa ctgcctcctc ctcctccccc aaagaagaga 2041 cagatccatg tggatattga agctaagaaa aagtttgatg ctataagtgc agagctgttg 2101 aactggattt tgaaatggaa aactgccatt cagaccacag agataaaaga gtatatgaag 2161 atgcaagaca cttccgaaat gaaaaagaag ttgaaggcat tagaaaaaga acagagagaa 2221 agaatcccca gagcagatga attaaaccaa actggacaaa tccttgtgga gcaaatggga 2281 aaagaaggcc ttcctactga agaaataaaa aatgttctgg agaaggtttc atcagaatgg 2341 aagaatgtat ctcaacattt ggaagatcta gaaagaaaga ttcagctaca ggaagatata 2401 aatgcttatt tcaagcagct tgatgagctt gaaaaggtca tcaagacaaa ggaggagtgg 2461 gtaaaacaca cttccatttc tgaatcttcc cggcagtcct tgccaagctt gaaggattcc 2521 tgtcagcggg aattgacaaa tcttcttggc cttcacccca aaattgaaat ggctcgtgca 2581 agctgctcgg ccctgatgtc tcagccttct gccccagatt ttgtccagcg gggcttcgat 2641 agctttctgg gccgctacca agctgtacaa gaggctgtag aggatcgtca acaacatcta 2701 gagaatgaac tgaagggcca acctggacat gcatatctgg aaacattgaa aacactgaaa 2761 gatgtgctaa atgattcaga aaataaggcc caggtgtctc tgaatgtcct taatgatctt 2821 gccaaggtgg agaaggccct gcaagaaaaa aagacccttg atgaaatcct tgagaatcag 2881 aaacctgcat tacataaact tgcagaagaa acaaaggctc tggagaaaaa tgttcatcct 2941 gatgtagaaa aattatataa gcaagaattt gatgatgtgc aaggaaagtg gaacaagcta 3001 aaggtcttgg tttccaaaga tctacatttg cttgaggaaa ttgctctcac actcagagct 3061 tttgaggccg attcaacagt cattgagaag tggatggatg gcgtgaaaga cttcttaatg 3121 aaacagcagg ctgcccaagg agacgacgca ggtctacaga ggcagttaga ccagtgctct 3181 gcatttgtta atgaaataga aacaattgaa tcatctctga aaaacatgaa ggaaatagag 3241 actaatcttc gaagtggtcc agttgctgga ataaaaactt gggtgcagac aagactaggt 3301 gactaccaaa ctcaactgga gaaacttagc aaggagatcg ctactcaaaa aagtaggttg 3361 tctgaaagtc aagaaaaagc tgcgaacctg aagaaagact tggcagagat gcaggaatgg 3421 atgacccagg ccgaggaaga atatttggag cgggattttg agtacaagtc accagaagag 3481 cttgagagtg ctgtggaaga gatgaagagg gcaaaagagg atgtgttgca gaaggaggtg 3541 agagtgaaga ttctcaagga caacatcaag ttattagctg ccaaggtgcc ctctggtggc 3601 caggagttga cgtctgagct gaatgttgtg ctggagaatt accaacttct ttgtaataga 3661 attcgaggaa agtgccacac gctagaggag gtctggtctt gttggattga actgcttcac 3721 tatttggatc ttgaaactac ctggttaaac actttggaag agcggatgaa gagcacagag 3781 gtcctgcctg agaagacgga tgctgtcaac gaagccctgg agtctctgga atctgttctg 3841 cgccacccgg cagataatcg cacccagatt cgagagcttg gccagactct gattgatggg 3901 gggatcctgg atgatataat cagtgagaaa ctggaggctt tcaacagccg atatgaagat 3961 ctaagtcacc tggcagagag caagcagatt tctttggaaa agcaactcca ggtgctgcgg 4021 gaaactgacc agatgcttca agtcttgcaa gagagcttgg gggagctgga caaacagctc 4081 accacatacc tgactgacag gatagatgct ttccaagttc cacaggaagc tcagaaaatc 4141 caagcagaga tctcagccca tgagctaacc ctagaggagt tgagaagaaa tatgcgttct 4201 cagcccctga cctccccaga gagtaggact gccagaggag gaagtcagat ggatgtgcta 4261 cagaggaaac tccgagaggt gtccacaaag ttccagcttt tccagaagcc agctaacttc 4321 gagcagcgca tgctggactg caagcgtgtg ctggatggcg tgaaagcaga acttcacgtt 4381 ctggatgtga aggacgtaga ccctgacgtc atacagacgc acctggacaa gtgtatgaaa 4441 ctgtataaaa ctttgagtga agtcaaactt gaagtggaaa ctgtgattaa aacaggaaga 4501 catattgtcc agaaacagca aacggacaac ccaaaaggga tggatgagca gctgacttcc 4561 ctgaaggttc tttacaatga cctgggcgca caggtgacag aaggaaaaca ggatctggaa 4621 agagcatcac agttggcccg gaaaatgaag aaagaggctg cttctctctc tgaatggctt 4681 tctgctactg aaactgaatt ggtacagaag tccacttcag aaggtctgct tggtgacttg 4741 gatacagaaa tttcctgggc taaaaatgtt ctgaaggatc tggaaaagag aaaagctgat 4801 ttaaatacca tcacagagag tagtgctgcc ctgcaaaact tgattgaggg cagtgagcct 4861 attttagaag agaggctctg cgtccttaac gctgggtgga gccgagttcg tacctggact 4921 gaagattggt gcaatacctt gatgaaccat cagaaccagc tagaaatatt tgatgggaac 4981 gtggctcaca taagtacctg gctttatcaa gctgaagctc tattggatga aattgaaaag 5041 aaaccaacaa gtaaacagga agaaattgtg aagcgtttag tatctgagct ggatgatgcc 5101 aacctccagg ttgaaaatgt ccgcgatcaa gcccttattt tgatgaatgc ccgtggaagc 5161 tcaagcaggg agcttgtaga accaaagtta gctgagctga ataggaactt tgaaaaggtg 5221 tctcaacata tcaaaagtgc caaattgcta attgctcagg aaccattata ccaatgtttg 5281 gtcaccactg aaacatttga aactggtgtg cctttctctg acttggaaaa attagaaaat 5341 gacatagaaa atatgttaaa atttgtggaa aaacacttgg aatccagtga tgaagatgaa 5401 aagatggatg aggagagtgc ccagattgag gaagttctac aaagaggaga agaaatgtta 5461 catcaaccta tggaagataa taaaaaagaa aagatccgtt tgcaattatt acttttgcat 5521 actagataca acaaaattaa ggcaatccct attcaacaga ggaaaatggg tcaacttgct 5581 tctggaatta gatcatcact tcttcctaca gattatctgg ttgaaattaa caaaatttta 5641 ctttgcatgg atgatgttga attatcgctt aatgttccag agctcaacac tgctatttac 5701 gaagacttct cttttcagga agactctctg aagaatatca aagaccaact ggacaaactt 5761 ggagagcaga ttgcagtcat tcatgaaaaa cagccagatg tcatccttga agcctctgga 5821 cctgaagcca ttcagatcag agatacactt actcagctga atgcaaaatg ggacagaatt 5881 aatagaatgt acagtgatcg gaaaggttgt tttgacaggg caatggaaga atggagacag 5941 ttccattgtg accttaatga cctcacacag tggataacag aggctgaaga attactggtt 6001 gatacctgtg ctccaggtgg cagcctggac ttagagaaag ccaggataca tcagcaggaa 6061 cttgaggtgg gcatcagcag ccaccagccc agttttgcag cactaaaccg aactggggat 6121 gggattgtgc agaaactctc ccaggcagat ggaagcttct tgaaagaaaa actggcaggt 6181 ttaaaccaac gctgggatgc aattgttgca gaagtgaagg ataggcagcc aaggctaaaa 6241 ggagaaagta agcaggtgat gaagtacagg catcagctag atgagattat ctgttggtta 6301 acaaaggctg agcatgctat gcaaaagaga tcaaccaccg aattgggaga aaacctgcaa 6361 gaattaagag acttaactca agaaatggaa gtacatgctg aaaaactcaa atggctgaat 6421 agaactgaat tggagatgct ttcagataaa agtctgagtt tacctgaaag ggataaaatt 6481 tcagaaagct taaggactgt aaatatgaca tggaataaga tttgcagaga ggtgcctacc 6541 accctgaagg aatgcatcca ggagcccagt tctgtttcac agacaaggat tgctgctcat 6601 cctaatgtcc aaaaggtggt gctagtatca tctgcgtcag atattcctgt tcagtctcat 6661 cgtacttcgg aaatttcaat tcctgctgat cttgataaaa ctataacaga actagccgac 6721 tggctggtat taatcgacca gatgctgaag tccaacattg tcactgttgg ggatgtagaa 6781 gagatcaata agaccgtttc ccgaatgaaa attacaaagg ctgacttaga acagcgccat 6841 cctcagctgg attatgtttt tacattggca cagaatttga aaaataaagc ttccagttca 6901 gatatgagaa cagcaattac agaaaaattg gaaagggtca agaaccagtg ggatggcacc 6961 cagcatggcg ttgagctaag acagcagcag cttgaggaca tgattattga cagtcttcag 7021 tgggatgacc atagggagga gactgaagaa ctgatgagaa aatatgaggc tcgactctat 7081 attcttcagc aagcccgacg ggatccactc accaaacaaa tttctgataa ccaaatactg 7141 cttcaagaac tgggtcctgg agatggtatc gtcatggcgt tcgataacgt cctgcagaaa 7201 ctcctggagg aatatgggag tgatgacaca aggaatgtga aagaaaccac agagtactta 7261 aaaacatcat ggatcaatct caaacaaagt attgctgaca gacagaacgc cttggaggct 7321 gagtggagga cggtgcaggc ctctcgcaga gatctggaaa acttcctgaa gtggatccaa 7381 gaagcagaga ccacagtgaa tgtgcttgtg gatgcctctc atcgggagaa tgctcttcag 7441 gatagtatct tggccaggga actcaaacag cagatgcagg acatccaggc agaaattgat 7501 gcccacaatg acatatttaa aagcattgac ggaaacaggc agaagatggt aaaagctttg 7561 ggaaattctg aagaggctac tatgcttcaa catcgactgg atgatatgaa ccaaagatgg 7621 aatgacttaa aagcaaaatc tgctagcatc agggcccatt tggaggccag cgctgagaag 7681 tggaacaggt tgctgatgtc cttagaagaa ctgatcaaat ggctgaatat gaaagatgaa 7741 gagcttaaga aacaaatgcc tattggagga gatgttccag ccttacagct ccagtatgac 7801 cattgtaagg ccctgagacg ggagttaaag gagaaagaat attctgtcct gaatgctgtc 7861 gaccaggccc gagttttctt ggctgatcag ccaattgagg cccctgaaga gccaagaaga 7921 aacctacaat caaaaacaga attaactcct gaggagagag cccaaaagat tgccaaagcc 7981 atgcgcaaac agtcttctga agtcaaagaa aaatgggaaa gtctaaatgc tgtaactagc 8041 aattggcaaa agcaagtgga caaggcattg gagaaactca gagacctgca gggagctatg 8101 gatgacctgg acgctgacat gaaggaggca gagtccgtgc ggaatggctg gaagcccgtg 8161 ggagacttac tcattgactc gctgcaggat cacattgaaa aaatcatggc atttagagaa 8221 gaaattgcac caatcaactt taaagttaaa acggtgaatg atttatccag tcagctgtct 8281 ccacttgacc tgcatccctc tctaaagatg tctcgccagc tagatgacct taatatgcga 8341 tggaaacttt tacaggtttc tgtggatgat cgccttaaac agcttcagga agcccacaga 8401 gattttggac catcctctca gcattttctc tctacgtcag tccagctgcc gtggcaaaga 8461 tccatttcac ataataaagt gccctattac atcaaccatc aaacacagac cacctgttgg 8521 gaccatccta aaatgaccga actctttcaa tcccttgctg acctgaataa tgtacgtttt 8581 tctgcctacc gtacagcaat caaaatccga agactacaaa aagcactatg tttggatctc 8641 ttagagttga gtacaacaaa tgaaattttc aaacagcaca agttgaacca aaatgaccag 8701 ctcctcagtg ttccagatgt catcaactgt ctgacaacaa cttatgatgg acttgagcaa 8761 atgcataagg acctggtcaa cgttccactc tgtgttgata tgtgtctcaa ttggttgctc 8821 aatgtctatg acacgggtcg aactggaaaa attagagtgc agagtctgaa gattggatta 8881 atgtctctct ccaaaggtct cttggaagaa aaatacagat atctctttaa ggaagttgcg 8941 gggccgacag aaatgtgtga ccagaggcag ctgggcctgt tacttcatga tgccatccag 9001 atcccccggc agctaggtga agtagcagct tttggaggca gtaatattga gcctagtgtt 9061 cgcagctgct tccaacagaa taacaataaa ccagaaataa gtgtgaaaga gtttatagat 9121 tggatgcatt tggaaccaca gtccatggtt tggctcccag ttttacatcg agtggcagca 9181 gcggagactg caaaacatca ggccaaatgc aacatctgta aagaatgtcc aattgtcggg 9241 ttcaggtata gaagccttaa gcattttaac tatgatgtct gccagagttg tttcttttcg 9301 ggtcgaacag caaaaggtca caaattacat tacccaatgg tggaatattg tatacctaca 9361 acatctgggg aagatgtacg agacttcaca aaggtactta agaacaagtt caggtcgaag 9421 aagtactttg ccaaacaccc tcgacttggt tacctgcctg tccagacagt tcttgaaggt 9481 gacaacttag agactcctat cacactcatc agtatgtggc cagagcacta tgacccctca 9541 caatctcctc aactgtttca tgatgacacc cattcaagaa tagaacaata tgccacacga 9601 ctggcccaga tggaaaggac taatgggtct tttctcactg atagcagctc caccacagga 9661 agtgtggaag acgagcacgc cctcatccag cagtattgcc aaacactcgg aggagagtcc 9721 ccagtgagcc agccgcagag cccagctcag atcctgaagt cagtagagag ggaagaacgt 9781 ggagaactgg agaggatcat tgctgacctg gaggaagaac aaagaaatct acaggtggag 9841 tatgagcagc tgaaggacca gcacctccga agggggctcc ctgtcggttc accgccagag 9901 tcgattatat ctccccatca cacgtctgag gattcagaac ttatagcaga agcaaaactc 9961 ctcaggcagc acaaaggtcg gctggaggct aggatgcaga ttttagaaga tcacaataaa 10021 cagctggagt ctcagctcca ccgcctccga cagctgctgg agcagcctga atctgattcc 10081 cgaatcaatg gtgtttcccc atgggcttct cctcagcatt ctgcactgag ctactcgctt 10141 gatccagatg cctccggccc acagttccac caggcagcgg gagaggacct gctggcccca 10201 ccgcacgaca ccagcacgga tctcacggag gtcatggagc agattcacag cacgtttcca 10261 tcttgctgcc caaatgttcc cagcaggcca caggcaatgt ga // LOCUS HSMY1 2199 bp RNA PRI 16-FEB-1992 DEFINITION H.sapiens My1 (PML) mRNA. ACCESSION X63131 NID g34813 KEYWORDS My1 gene; nuclear protein; PML gene; retinoic acid receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2199) AUTHORS Kastner,P. TITLE Direct Submission JOURNAL Submitted (02-DEC-1991) P. Kastner, LGME - U. 184 - Faculte de Medicine, 11 Rue Humann, 67085 Strasbourg Cedex, FRANCE REFERENCE 2 (bases 1 to 2199) AUTHORS Kastner,P., Perez,A., Lutz,Y., Rochette-Egly,C., Gaub,M.P., Durand,B., Lanotte,M., Berger,R. and Chambon,P. TITLE Structure, localization and transcriptional properties of two classes of retinoic acid receptor alpha fusion proteins in acute promyelocytic leukemia (APL): structural similarities with a new family of oncoproteins JOURNAL EMBO J. 11 (2), 629-642 (1992) MEDLINE 92164652 COMMENT For related sequences see X61993, M73778-9 & De The et al., Cell 66:675-684(1991). FEATURES Location/Qualifiers source 1..2199 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NB4" /chromosome="15" /map="q22" 5'UTR 1..141 gene 142..2043 /gene="My1 (PML)" CDS 142..2043 /gene="My1 (PML)" /codon_start=1 /db_xref="PID:g34814" /db_xref="SWISS-PROT:P29591" /translation="MEPAPARSPRPQQDPARPQEPTMPPPETPSEGRQPSPSPSPTER APASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQAPWPLGADTP ALDNVFFESLQRRLSVYRQIVDAQAVCTRCKESADFWCFECEQLLCAKCFEAHQWFLK HEARPLAELRNQSVREFLDGTRKTNNIFCSNPNHRTPTLTSIYCRGCSKPLCCSCALL DSSHSELKCDISAEIQQRQEELDAMTQALQEQDSAFGAVHAQMHAAVGQLGRARAETE ELIRERVRQVVAHVRAQERELLEAVDARYQRDYEEMASRLGRLDAVLQRIRTGSALVQ RMKCYASDQEVLDMHGFLRQALCRLRQEEPQSLQAAVRTDGFDEFKVRLQDLSSCITQ GKDAAVSKKASPEAASTPRDPIDVDLPEEAERVKAQVQALGLAEAQPMAVVQSVPGAH PVPVYAFSIKGPSYGEDVSNTTTAQKRKCSQTQCPRKVIKMESEEGKEARLARSSPEQ PRPSTSKAVSPPHLDGPPSPRSPVIGSEVFLPNSNHVASGAGEAEERVVVISSSEDSD AENSSSRELDDSSSESSDLQLEGPSTLRVLDENLADPQAEDRPLVFFDLKIDNESGFS WGYPHPFLI" misc_feature 1324 /gene="My1 (PML)" /note="fusion point with RAR-alpha in type B patients" /evidence=experimental misc_feature 1798 /gene="My1 (PML)" /note="fusion point with RAR-alpha in type A patients" /evidence=experimental 3'UTR 2044..2199 BASE COUNT 443 a 728 c 672 g 356 t ORIGIN 1 gctctccaga ggcgggccct gagccggcac ctcccctttc ggacagctca agggactcag 61 ccaactggct cacgcctccc cttcagcttc tcttcacgca ctccaagatc taaaccgaga 121 atcgaaacta agctggggtc catggagcct gcacccgccc gatctccgag gccccagcag 181 gaccccgccc ggccccagga gcccaccatg cctccccccg agaccccctc tgaaggccgc 241 cagcccagcc ccagccccag ccctacagag cgagcccccg cttcggagga ggagttccag 301 tttctgcgct gccagcaatg ccaggcggaa gccaagtgcc cgaagctgct gccttgtctg 361 cacacgctgt gctcaggatg cctggaggcg tcgggcatgc agtgccccat ctgccaggcg 421 ccctggcccc taggtgcaga cacacccgcc ctggataacg tctttttcga gagtctgcag 481 cggcgcctgt cggtgtaccg gcagattgtg gatgcgcagg ctgtgtgcac ccgctgcaaa 541 gagtcggccg acttctggtg ctttgagtgc gagcagctcc tctgcgccaa gtgcttcgag 601 gcacaccagt ggttcctcaa gcacgaggcc cggcccctag cagagctgcg caaccagtcg 661 gtgcgtgagt tcctggacgg cacccgcaag accaacaaca tcttctgctc caaccccaac 721 caccgcaccc ctacgctgac cagcatctac tgccgaggat gttccaagcc gctgtgctgc 781 tcgtgcgcgc tccttgacag cagccacagt gagctcaagt gcgacatcag cgcagagatc 841 cagcagcgac aggaggagct ggacgccatg acgcaggcgc tgcaggagca ggatagtgcc 901 tttggcgcgg ttcacgcgca gatgcacgcg gccgtcggcc agctgggccg cgcgcgtgcc 961 gagaccgagg agctgatccg cgagcgcgtg cgccaggtgg tagctcacgt gcgggctcag 1021 gagcgcgagc tgctggaggc tgtggacgcg cggtaccagc gcgactacga ggagatggcc 1081 agtcggctgg gccgcctgga tgctgtgctg cagcgcatcc gcacgggcag cgcgctggtg 1141 cagaggatga agtgctacgc ctcggaccag gaggtgctgg acatgcacgg tttcctgcgc 1201 caggcgctct gccgcctgcg ccaggaggag ccccagagcc tgcaagctgc cgtgcgcacc 1261 gatggcttcg acgagttcaa ggtgcgcctg caggacctca gctcttgcat cacccagggg 1321 aaagatgcag ctgtatccaa gaaagccagc ccagaggctg ccagcactcc cagggaccct 1381 attgacgttg acctgcccga ggaggcagag agagtgaagg cccaggttca ggccctgggg 1441 ctggctgaag cccagcctat ggctgtggta cagtcagtgc ccggggcaca ccccgtgcca 1501 gtgtacgcct tctccatcaa aggcccttcc tatggagagg atgtctccaa tacaacgaca 1561 gcccagaaga ggaagtgcag ccagacccag tgccccagga aggtcatcaa gatggagtct 1621 gaggagggga aggaggcaag gttggctcgg agctccccgg agcagcccag gcccagcacc 1681 tccaaggcag tctcaccacc ccacctggat ggaccgccta gccccaggag ccccgtcata 1741 ggaagtgagg tcttcctgcc caacagcaac cacgtggcca gtggcgccgg ggaggcagag 1801 gaacgcgttg tggtgatcag cagctcggaa gactcagatg ccgaaaactc gtcctcccga 1861 gagctggatg acagcagcag tgagtccagt gacctccagc tggaaggccc cagcaccctc 1921 agggtcctgg acgagaacct tgctgacccc caagcagaag acagacctct ggttttcttt 1981 gacctcaaga ttgacaatga aagtgggttc tcctggggct acccccaccc ctttctaatt 2041 tagtctctga gtcccaaaaa gaagtgcagg cagagcatct gccaggccca ggagagctct 2101 gagctctggc caacaactgc agccaggctg ggcagagcac tccggctcac ctgggctcct 2161 ggcgtgtcat ttgctggctt tgaataaaga tgtcgcctt // LOCUS HSMYELIN 757 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens gene for myelin protein zero. ACCESSION Z31718 NID g469516 KEYWORDS myelin; myelin protein zero. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 757) AUTHORS Rautenstrauss,B., Nelis,E., Grehl,H., Pfeiffer,R.A. and Van Broeckhoven,C. TITLE Identification of a de novo insertional mutation in P0 in a patient with a Dejerine-Sottas syndrome (DSS) phenotype JOURNAL Hum. Mol. Genet. 3 (9), 1701-1702 (1994) MEDLINE 95135435 REFERENCE 2 (bases 1 to 757) AUTHORS Rautenstrauss,B. TITLE Direct Submission JOURNAL Submitted (01-APR-1994) Rautenstrauss B., University of Erlangen, Institute of Human Genetics, Schwabachanlage 10, Erlangen, Bavaria, F.R.G., D-91045 FEATURES Location/Qualifiers source 1..757 /organism="Homo sapiens" /isolate="patient H7" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="muscle" /chromosome="1q22-23" /sex="Male" gene 1..756 /gene="P0" CDS 1..756 /gene="P0" /standard_name="Major Protein Zero MPZ, P0" /function="glycoprotein, transmembrane protein" /codon_start=1 /evidence=experimental /product="myelin protein zero" /db_xref="PID:g469517" /translation="MAPGAPSSSPSPILAVLLFSSLVLSPAQAIVVYTDREVHGAVGS RVTLHCSFWSSEWVSDDISFTWRYQPEGGRDAISIFHYAKGQPYIDEVGTFKERIQWV GDPRWKDGSIVIHNLDYSDNGTFTCDVKNPPDIVGKTSQVTLYVFEKVPTRYGVVLGA VIGGVLGVVLLLLLLFYVVRYCWLRRQAALQRRLSAMEKGKLHKPGKDASKRGRQTPV LYAQCWTTAEAPKLSVRRRPRGWGSLARIRNSG" mutation 663..664 /gene="P0" /note="cf. EMBL accession number D10537" /label=INS707GC /replace="" BASE COUNT 158 a 203 c 236 g 160 t ORIGIN 1 atggctcctg gggctccctc atccagcccc agccctatcc tggctgtgct gctcttctct 61 tctttggtgc tgtccccggc ccaggccatc gtggtttaca ccgacaggga ggtccatggt 121 gctgtgggct cccgggtgac cctgcactgc tccttctggt ccagtgagtg ggtctcagat 181 gacatctcct tcacctggcg ctaccagccc gaaggaggca gagatgccat ttcgatcttc 241 cactatgcca agggacaacc ctacattgac gaggtgggga ccttcaaaga gcgcatccag 301 tgggtagggg accctcgctg gaaggatggc tccattgtca tacacaacct agactacagt 361 gacaatggca cgttcacttg tgacgtcaaa aaccctccag acatagtggg caagacctct 421 caggtcacgc tgtatgtctt tgaaaaagtg ccaactaggt acggggtcgt tctgggagct 481 gtgatcgggg gtgtcctcgg ggtggtgctg ttgctgctgc tgcttttcta cgtggttcgg 541 tactgctggc tacgcaggca ggcggccctg cagaggaggc tcagtgctat ggagaagggg 601 aaattgcaca agccaggaaa ggacgcgtcg aagcgcgggc ggcagacgcc agtgctgtat 661 gcgcaatgct ggaccacagc agaagcacca aagctgtcag tgagaagaag gccaaggggc 721 tgggggagtc tcgcaaggat aagaaatagc ggttagc // LOCUS HSMYF4 1418 bp RNA PRI 27-SEP-1996 DEFINITION Human Myf-4 mRNA for myogenic determination factor. ACCESSION X17651 NID g34831 KEYWORDS developmental regulation; Myf gene; Myf-4 gene; myogenin; transcriptional activator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1418) AUTHORS Braun,T., Bober,E., Buschhausen-Denker,G., Kohtz,S., Grzeschik,K.H. and Arnold,H.H. TITLE Differential expression of myogenic determination genes in muscle cells: possible autoactivation by the Myf gene products JOURNAL EMBO J. 8 (12), 3617-3625 (1989) MEDLINE 90059960 REMARK Erratum:[EMBO J 1989 Dec;8(13):4358] FEATURES Location/Qualifiers source 1..1418 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /clone_lib="lambda gt11" /chromosome="12" misc_feature 1..170 /note="sequence derived from genomic DNA" CDS 53..793 /note="Myf-4 protein (AA 1-246)" /codon_start=1 /db_xref="PID:g34832" /db_xref="SWISS-PROT:P15173" /translation="MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSP EAPGPLEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFE ALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPSQGCPA NAALTAPPAVQSGAVHWSSAPTQGIICSRLTLQMPTTCTPSPPSWTASQWKMCLWPSQ MKPCPTEIVFQAGHPSSPPSWPQMPLLL" BASE COUNT 286 a 430 c 422 g 280 t ORIGIN 1 ggggctgctg gagcttgggg gctggtggca ggaacaagcc ctttccgacc ccatggagct 61 gtatgagaca tccccctact tctaccagga accccgcttc tatgatgggg aaaactacct 121 gcctgtccac ctccagggct tcgaaccacc aggctacgag cggacggagc tcaccctgag 181 ccccgaggcc ccagggcccc ttgaggacaa ggggctgggg acccccgagc actgtccagg 241 ccagtgcctg ccgtgggcgt gtaaggtgtg taagaggaag tcggtgtccg tggaccggcg 301 gcgggcggcc acactgaggg agaagcgcag gctcaagaag gtgaatgagg ccttcgaggc 361 cctgaagaga agcaccctgc tcaaccccaa ccagcggctg cccaaggtgg agatcctgcg 421 cagtgccatc cagtacatcg agcgcctcca ggccctgctc agctccctca accaggagga 481 gcgtgacctc cgctaccggg gcgggggcgg gcccagccag gggtgcccag cgaatgcagc 541 tctcacagcg cctcctgcag tccagagtgg ggcagtgcac tggagttcag cgccaaccca 601 ggggatcatc tgctcacggc tgaccctaca gatgcccaca acctgcactc cctcacctcc 661 atcgtggaca gcatcacagt ggaagatgtg tctgtggcct tcccagatga aaccatgccc 721 aactgagatt gtcttccaag ccgggcatcc ttcgagcccc ccaagctggc cacagatgcc 781 actacttctg tagcaggggc ctcctaagcc aggctgccct gatgctagga agccagctct 841 ggggtgccat aggccagact atccccttcc tcatccatgt aaggttaacc caccccccag 901 caagggactg gacgccctca ttcagctgcc tccttagagg agagggcatc cctttccagg 961 gaggtaaagc aggggaccag agcgccccct cgtgtatgcc ccagctcagg gggcaaactc 1021 aggagcttcc tttttatcat aacgcggcct ctaattccac cccccaagtg aaacggtttg 1081 agagacgccg tgccctgacc tggacaagct gtgcacgtct cctgttctgg tctcttcccg 1141 atgcagtggc tggctggcct gccctgaatt gagagagaag aagggggaga ggaacagccc 1201 tctgttccca agtcctgggg ggccaaactt ttgcagtgaa tattgggaac cttccagtgg 1261 ttttatgttt tgttttgttt cgtgtgttgt ttgtaaagct gccatccgac caaggtctcc 1321 tgtgctgaag ttgccgggga caggcaggga aaaggggttg gggcctcttg ggggtgattt 1381 cttttgttaa caaagcatcg tgtggttttg ccggaatt // LOCUS HSMYF5 1427 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for myogenic factor Myf-5. ACCESSION X14894 NID g34835 KEYWORDS Myf-5; myogenesis. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1427) AUTHORS Braun,T., Buschhausen-Denker,G., Bober,E., Tannich,E. and Arnold,H.H. TITLE A novel human muscle factor related to but distinct from MyoD1 induces myogenic conversion in 10T1/2 fibroblasts JOURNAL EMBO J. 8 (3), 701-709 (1989) MEDLINE 89251600 COMMENT library=lambda gt11; developmental stage=embryonic; tissue=skeletal muscle; Data kindly reviewed (02-NOV-1989) by Zerial M. FEATURES Location/Qualifiers source 1..1427 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 43..810 /note="Myf-5 (AA 1-255)" /codon_start=1 /db_xref="PID:g34836" /db_xref="SWISS-PROT:P13349" /translation="MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVPRVAAFGAHKA ELQGSDEDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQA FETLKRCTTTNPNQRLPKVEILRNAIRYIESLQELLREQVENYYSLPGQSCSEPTSPT SNCSDGMPECNSPVWSRKSSTFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSE QPGLPLQDLASLSPVASTDSQPRTPGASSSRLIYHVL" polyA_site 1427 /note="polyA site" BASE COUNT 397 a 331 c 296 g 403 t ORIGIN 1 cctctcgctg ccgtccaggt gcaccgcctg cctctcagca ggatggacgt gatggatggc 61 tgccagttct caccttctga gtacttctac gacggctcct gcataccgtc ccccgagggt 121 gaatttgggg acgagtttgt gccgcgagtg gctgccttcg gagcgcacaa agcagagctg 181 cagggctcag atgaggacga gcacgtgcga gcgcctaccg gccaccacca ggctggtcac 241 tgcctcatgt gggcctgcaa agcctgcaag aggaagtcca ccaccatgga tcggcggaag 301 gcagccacta tgcgcgagcg gaggcgcctg aagaaggtca accaggcttt cgaaaccctc 361 aagaggtgta ccacgaccaa ccccaaccag aggctgccca aggtggagat cctcaggaat 421 gccatccgct acatcgagag cctgcaggag ttgctgagag agcaggtgga gaactactat 481 agcctgccgg gacagagctg ctcggagccc accagcccca cctccaactg ctctgatggc 541 atgcccgaat gtaacagtcc tgtctggtcc agaaagagca gtacttttga cagcatctac 601 tgtcctgatg tatcaaatgt atatgccaca gataaaaact ccttatccag cttggattgc 661 ttatccaaca tagtggaccg gatcacctcc tcagagcaac ctgggttgcc tctccaggat 721 ctggcttctc tctctccagt tgccagcacc gattcacagc ctcgaactcc aggggcttct 781 agttccaggc ttatctatca tgtgctatga actaattttc tggtctatat gacttcttcc 841 aggagggcct aatacacagg acgaagaagg cttcaaaaag tcccaaacca agacaacatg 901 tacataaaga tttcttttca gttgtaaatt tgtaaagatt accttgccac tttataagaa 961 agtgtattta actaaaaagt catcattgca aataatactt tcttcttctt tattattctt 1021 tgcttagata ttaatacata gttccagtaa tactatttct gatagggggc cattgattga 1081 gggtagcttg ttcgaatgct taacttatat atacatatat atatattata aatattgctc 1141 atcaaaatgt ctctggtgtt tagagcttta tttttttctt taaaacatta aaacagctga 1201 gaatcagtta aatggaattt taaatatatt taactatttc ttttctcttt aatcctttag 1261 ttatattgta ttaaataaaa atataatact gcctaatgta tatattttga tcttttcttg 1321 taagaaatgt atcttttaaa tgtaagcaca aaatagtact ttgtggatca tttcaagata 1381 taagaaattt tggaaattcc accataaata aaatttttta ctacaag // LOCUS HSMYF6A 1294 bp RNA PRI 17-DEC-1992 DEFINITION H.sapiens MYF6 gene encoding a muscle determination factor. ACCESSION X52011 NID g34837 KEYWORDS muscle determination factor; myogenic determination factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1294) AUTHORS Braun,T., Bober,E., Winter,B., Rosenthal,N. and Arnold,H.H. TITLE Myf-6, a new member of the human gene family of myogenic determination factors: evidence for a gene cluster on chromosome 12 JOURNAL EMBO J. 9 (3), 821-831 (1990) MEDLINE 90183982 FEATURES Location/Qualifiers source 1..1294 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /chromosome="12" gene 54..782 /gene="MYF6" CDS 54..782 /gene="MYF6" /codon_start=1 /product="muscle determination factor" /db_xref="PID:g34838" /db_xref="SWISS-PROT:P23409" /translation="MMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPC QDQMPPEAGSDSSGEEHVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRE RRRLKKINEAFEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQEL GVDPFSYRPKQENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGGASIDSSASSSLRC LSSIVDSISSEERKLPCVEEVVEK" BASE COUNT 332 a 294 c 338 g 330 t ORIGIN 1 gtttttgagt ccatcaccca gttcagatcg agtcagaggc caaggaggag aacatgatga 61 tggacctttt tgaaactggc tcctatttct tctacttgga tggggaaaat gttactctgc 121 agccattaga agtggcagaa ggctctcctt tgtatccagg gagtgatggt accttgtccc 181 cctgccagga ccaaatgccc ccggaagcgg ggagcgacag cagcggagag gaacatgtcc 241 tggcgccccc gggcctgcag cctccacact gccccggcca gtgtctgatc tgggcttgca 301 agacctgcaa gagaaaatct gcccccactg accggcgaaa agccgccacc ctgcgcgaaa 361 ggaggaggct aaagaaaatc aacgaggcct tcgaggcact gaagcggcga actgtggcca 421 accccaacca gaggctgccc aaggtggaga ttctgcggag cgccatcagc tatattgagc 481 ggctgcagga cctgctgcac cggctggatc agcaggagaa gatgcaggag ctgggggtgg 541 accccttcag ctacagaccc aaacaagaaa atcttgaggg tgcggatttc ctgcgcacct 601 gcagctccca gtggccaagt gtttccgatc attccagggg gctcgtgata acggctaagg 661 aaggaggagc aagtattgat tcgtcagcct cgagtagcct tcgatgcctt tcttccatcg 721 tggacagtat ttcctcggag gaacgcaaac tcccctgcgt ggaggaagtg gtggagaagt 781 aactgagcct gcgcttgaga ccttctccac gcagcaggaa gatcccaccg acccttcctg 841 gcctaatcct ttagattagg tcacattaca ttaacattta ggaacccaga ccgaaaagtt 901 gctgaaaggg aaggagacac attcacaaag aaaagttgcg aaaattgcga aatctgttgt 961 gcacgctcaa atgaaaacgc ctttcggctt tgggctttta tttttttgga actgcgagtg 1021 gcttaggtct agcctcattt tgtttttgtt tggttggttt tatactatat taacttttat 1081 tacggtgatc cttttgtgcc atgttcaaaa gaagttcatt cctgtctgaa gtgggaaagt 1141 tgcatttaat gttaggggta tttaatgtat ttttgtaaat agtttagcac tttctttttt 1201 tacgtaaacc tgaaatatat tttaaatgtg gaatgatgta tataaaatgt gcgaggatcc 1261 tggtattgta atattaaaaa gaagtttcta tatg // LOCUS HSMYHC 6032 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for embryonic myosin heavy chain. ACCESSION X13988 NID g34843 KEYWORDS myosin; myosin heavy chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6032) AUTHORS Eller,M.S. TITLE Direct Submission JOURNAL Submitted (11-JAN-1989) Eller M.S., Tufts University Health Sciences Center, Department of Anatomy and Cellular Biology, 136 Harrison Avenue, M&V 517, Boston, MA 02111 REFERENCE 2 (bases 1 to 6032) AUTHORS Eller,M., Stedman,H.H., Sylvester,J.E., Fertels,S.H., Rubinstein,N.A., Kelly,A.M. and Sarkar,S. TITLE Nucleotide sequence of full length human embryonic myosin heavy chain cDNA JOURNAL Nucleic Acids Res. 17 (9), 3591-3592 (1989) MEDLINE 89263803 FEATURES Location/Qualifiers source 1..6032 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /clone_lib="lambda gt10" /clone="HEMHC-2" /chromosome="chromosome 17" CDS 85..5907 /note="embryonic myosin heavy chain (AA 1 - 1940)" /codon_start=1 /db_xref="PID:g34844" /db_xref="SWISS-PROT:P11055" /translation="MSSDTEMEVFGIAAPFLRKSEKERIEAQNQPFDAKTYCFVVDSK EEYAKGKIKSSQDGKVTVETEDNRTLVVKPEDVYAMNPPKFDRIEDMAMLTHLNEPAV LYNLKDRYTSWMIYTYSGLFCVTVNPYKWLPVYNPEVVEGYRGKKRQEAPPHIFSISD NAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAATGDLAKKKDSKMKGTLE DQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRVTF QLKAERSYHIFYQILSNKKPELIELLLITTNPYDYPFISQGEILVASIDDREELLATD SAIDILGFTPEEKSGLYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKTAYLMGLN SSDLLKALCFPRVKVGNEYVTKGQTVDQVHHAVNALSKSVYEKLFLWMVTRINQQLDT KLPRQHFIGVLDIAGFEIFEYNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIE WTFIDFGMDLAACIELIEKPMGIFSILEEECMFPKATDTSFKNKLYDQHLGKSNNFQK PKVVKGRAEAHFSLIHYAGTVDYSVSGWLEKNKDPLNETVVGLYQKSSNRLLAHLYAT FATADADSGKKKVAKKKGSSFQTVSALFRENLNKLMSNLRTTHPHFVRCIIPNETKTP GAMEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFKQRYRVLNASAILEGQFIDSK KACEKLLASIDIDHTQYKFGHTKVFFKAGLLGTLEEMRDDRLAKLITRTQAVCRGFLM RVEFQKMVQRRESIFCIQYNIRSFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMKEE FQKTKDELAKSEAKRKELEEKLVTLVQEKNDLQLQVQAESENLLDAEERCDQLIKAKF QLEAKIKEVTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHAT ENKVKNLTEELSGLDETIAKLTREKKALQEAHQQALDDLQAEEDKVNSLNKTKSKLEQ QVEDLESSLEQEKKLRVDLERNKRKLEGDLKLAQESILDLENDKQQLDERLKKKDFEY CQLQSKVEDEQTLGLQFQKKIKELQARIEELEEEIEAERATRAKTEKQRSDYARELEE LSERLEEAGGVTSTQIELNKKREAEFLKLRRDLEEATLQHEAMVATLRKKHADSVAEL GEQIDNLQRVKQKLEKEKSEFKLEIDDLSSSMESVSKSKANLEKICRTLEDQLSEARG KNEEIQRSLSELTTQKSRLQTEAGELSRQLEEKESIVSQLSRSKQAFTQQTEELKRQL EEENKAKNALAHALQSSRHDCDLLREQYEEEQEGKAELQRALSKANSEVAQWRTKYET DAIQRTEELEEAQEKLAQRLQDSEEQVEAVNAKCASLEKTKQRLQGEVEDLMVDVERA NSLAAALDKKQRNFDKVLAEWKTKCEESQAELEASLKESRSLSTELFKLKNAYEEALD QLETVKRENKNLEQEIADLTEQIAENGKTIHELEKSRKQIELEKADIQLALEEAEAAL EHEEAKILRIQLELTQVKSEIDRKIAEKDEEIEQLKRNYQRTVETMQSALDAEVRSRN EAIRLKKKMEGDLNEIEIQLSHANRQAAETLKHLRSVQGQLKDTQLHLDDALRGQEDL KEQLAIVERRANLLQAEVEELRATLEQTERARKLAEQELLDSNERVQLLHTQNTSLIH TKKKLETDLMQLQSEVEDASRDARNAEEKAKKAITDAAMMAEELKKEQDTSAHLERMK KNLEQTVKDLQHRLDEAEQLALKGGKKQIQKLETRIRELEFELEGEQKKNTESVKGLR KYERRVKELTYQSEEDRKNVLRLQDLVDKLQVKVKSYKRQAEEADEQANAHLTKFRKA QHELEEAEERADIAESQVNKLRAKTRDFTSSRMVVHESEE" misc_feature 6013..6018 /note="polyA signal" BASE COUNT 1815 a 1386 c 1733 g 1098 t ORIGIN 1 gaattccgtg ggcggaggtc tgggatctcc tggctgttgc tgtcttctgc tctcatcctg 61 caggtgggac tctcagctga caccatgagt agtgacactg aaatggaagt gttcggcata 121 gctgctcctt tcctccggaa gtcagaaaag gagaggatcg aggctcagaa ccagcccttt 181 gatgccaaga cgtattgctt cgtggtggac tcaaaggaag aatatgccaa ggggaaaatc 241 aagagttctc aggatgggaa ggtcactgtg gaaactgagg acaacaggac cctggtggtc 301 aaaccagagg atgtgtacgc catgaacccc cccaagttcg acaggatcga agacatggcc 361 atgctgacgc acctgaatga gccagccgtg ctgtacaacc tgaaggaccg ttacacatct 421 tggatgatct atacctactc aggcctcttc tgtgtcactg tcaaccccta caagtggctg 481 ccggtgtaca accccgaggt ggtggaaggc taccgaggca aaaagcgcca ggaggcccca 541 ccccacatct tctccatctc tgacaacgcc tatcagttca tgctgactga tcgtgaaaac 601 cagtccattc tgatcacggg agaatccggg gcaggaaaga ctgtgaacac caaacgggtc 661 atccagtact ttgcaacaat tgcagctact ggggacctgg ccaagaagaa ggactccaaa 721 atgaagggga ctctggaaga tcaaatcatc agtgccaatc ccctgctgga ggcctttggg 781 aacgccaaga ctgtgaggaa tgacaactcc tcccgttttg gcaagttcat ccgaatccat 841 tttggaacca ctgggaagct ggcctctgca gatattgaaa cttatcttct ggaaaaatca 901 agagtcactt tccagctgaa ggctgaaaga agctaccaca tcttctacca gattctttct 961 aacaagaagc ctgagctcat agagctgctg cttattacga ccaaccctta cgactacccg 1021 ttcattagcc agggggagat cctggtggcc agcatagatg atcgagagga gctgctggct 1081 acagacagcg ccattgacat cctgggcttc accccagaag agaaatctgg gctctacaag 1141 ctgacgggag ccgtgatgca ctacgggaac atgaagttca agcagaagca gcgagaggag 1201 caggccgagc cggatggcac agaagtggct gacaaaacag cctatctgat gggcctgaac 1261 tcttcggacc tcctaaaagc tttgtgcttt cctagagtga aagttgggaa tgagtacgtt 1321 accaaaggtc aaactgtgga tcaggttcac catgctgtga atgctctttc aaaatcagtt 1381 tatgaaaagt tgttcttgtg gatggtcact cgcattaacc agcaactgga tacgaagctt 1441 ccaagacaac acttcattgg tgttttggac attgcaggct ttgaaatctt tgagtataac 1501 agcctggagc agctgtgcat caacttcacc aatgagaaac tgcaacagtt tttcaaccac 1561 cacatgttcg tgctggagca ggaggagtac aagaaggaag gcatcgagtg gacgttcatt 1621 gacttcggga tggacctggc tgcctgcatc gagctcatcg agaagcctat gggcatcttc 1681 tccatcctgg aagaggagtg catgttcccc aaggcaacag acacctcctt caagaacaag 1741 ctgtatgacc agcatcttgg aaagtccaac aacttccaga agcccaaggt ggtcaaaggc 1801 agggcagagg ctcacttctc actgatccac tatgcgggca ccgtggacta cagtgtctca 1861 ggttggctgg agaagaacaa ggaccctctg aacgagactg tggttgggct gtaccagaag 1921 tcttccaaca ggctcctggc acacctctat gccacgtttg ccacggcgga tgctgacagt 1981 ggaaagaaga aagttgccaa gaagaagggt tcttccttcc aaactgtctc tgcccttttc 2041 agggaaaacc tgaacaagct gatgtcaaat ttaagaacta ctcaccctca ttttgtgcgt 2101 tgtataattc ccaatgaaac caaaactcca ggggctatgg aacacagcct tgttctgcac 2161 cagctgcggt gtaacggtgt cctggagggc atccgcatct gcaggaaagg gttcccaaac 2221 aggattctct atggagattt taaacaaaga taccgagtgc tgaatgccag tgcaatcctg 2281 gagggacaat tcattgacag caagaaagcc tgtgaaaagc ttctggcatc cattgatatt 2341 gaccacactc agtacaaatt tggacatacc aaggtgttct tcaaggctgg cttgctggga 2401 accctggaag agatgcggga tgaccgcctg gccaaactaa tcacccggac acaagctgtg 2461 tgcagagggt tcctcatgcg tgtggaattc cagaagatgg tgcagaggag ggagtccatc 2521 ttctgcatcc agtacaacat tcgctcattc atgaacgtca agcactggcc ctggatgaaa 2581 ctcttcttca agatcaagcc cctcctcaag agtgcggaga ctgagaaaga gatggccacc 2641 atgaaggaag aattccagaa aaccaaagat gaactcgcca agtcggaggc aaagaggaag 2701 gagctagagg aaaaactggt gactctggtc caagagaaaa atgacctgca gctccaagta 2761 caagctgaaa gcgaaaattt gttggatgct gaggaaagat gcgatcagct gatcaaagcc 2821 aaattccagc tcgaggccaa gatcaaggag gtgacagaga gagctgaaga tgaggaggag 2881 atcaatgctg agctgacggc caagaagagg aaactggagg atgaatgctc agagctcaag 2941 aaagacattg atgaccttga gttgaccctg gccaaggttg agaaggagaa gcatgccacg 3001 gagaacaagg ttaaaaacct tactgaggaa ctctccgggt tagatgaaac aattgcaaag 3061 ttaaccagag agaagaaggc cctccaagag gcgcaccagc aggccttgga tgacctccaa 3121 gctgaagaag acaaagtcaa ttctttgaac aaaaccaaga gcaaactgga acagcaagtg 3181 gaagacctgg aaagctccct agaacaagaa aagaagctcc gcgtagacct ggaaaggaac 3241 aaaaggaaat tggaaggaga cttgaagctt gctcaagagt ccatattaga tctggagaat 3301 gacaagcaac agctggacga aaggctcaag aagaaagatt ttgaatattg tcaacttcaa 3361 agcaaagtgg aagatgagca gacactgggc ctccagtttc agaagaaaat caaagagttg 3421 caggctcgaa ttgaggagct ggaagaggag atagaggcgg agagggccac ccgcgcgaag 3481 acagagaaac agcgcagcga ctatgcccgg gagctggagg agctgagcga gcggctggag 3541 gaggcgggag gcgtcacctc cacgcagata gagctcaaca agaagcggga ggctgagttc 3601 ctgaagctgc gcagggacct ggaggaggcc acactgcagc acgaagccat ggtggccacg 3661 ctgaggaaga agcatgcgga tagtgtggcc gagcttgggg agcagattga caacctgcag 3721 cgggtcaagc agaagctgga gaaggagaag agcgagttca agctggagat cgatgacctc 3781 tccagcagca tggagagtgt gtcgaaatct aaggcaaatc tggaaaaaat ctgccgaacc 3841 ctggaggatc agttaagtga ggccaggggc aagaatgagg aaattcagag gagcctgagc 3901 gagctgacca cacagaagtc tcgtttgcag accgaggctg gtgagctgag tcgtcagctg 3961 gaagaaaaag aaagcatagt atcccaactt tccaggagca agcaagcctt tacccagcaa 4021 acagaagagc tcaagaggca gctggaggaa gagaacaagg ccaagaacgc cctggcgcac 4081 gccctgcagt cctcccgcca cgactgtgac ctgctgcggg aacagtatga ggaggagcag 4141 gaaggcaaag ctgagctgca gagggcgctg tccaaggcca atagtgaggt tgcccagtgg 4201 agaaccaaat acgagacgga cgccatccag cgcacagaag agctggagga ggcccaagaa 4261 aaacttgctc agcgccttca agattccgag gaacaggttg aggcagtgaa tgctaaatgt 4321 gcttcactgg agaagaccaa gcagaggctg caaggagagg tggaggatct gatggttgat 4381 gttgaaagag ccaattcctt ggccgccgct ctggacaaga agcagaggaa ctttgacaag 4441 gtgttggcag agtggaagac aaagtgtgag gagagccaag cagagctgga ggcatccctg 4501 aaggagtccc gctccttgag cactgagctc ttcaaactga aaaatgccta cgaggaagcc 4561 ttagatcaac ttgaaactgt gaaacgggaa aataagaact tagagcagga gatagcagat 4621 ctcacagaac aaattgctga aaatggcaaa accatccatg aactggagaa atcaagaaag 4681 cagattgagc tggaaaaggc tgatatccag ctggctctcg aggaagcaga ggctgctctt 4741 gagcatgaag aagccaagat cctccgaatc cagcttgaat tgacacaagt gaaatcagaa 4801 attgatagaa agattgccga gaaggatgaa gagatcgagc agctgaagag gaactaccag 4861 agaacagtgg aaaccatgca gagcgccctg gacgccgagg tgcggagcag gaatgaagcc 4921 atccggctca agaagaagat ggagggggac ctgaatgaaa tcgagatcca gctgagccac 4981 gccaaccgcc aggcggcgga gaccctcaaa cacctcagga gtgtccaggg acagctgaag 5041 gatacgcagc tccacctgga tgatgccctc cgaggccagg aggacctgaa ggagcagctg 5101 gcgattgtgg agcgcagagc caacctgctg caggccgagg tggaggagct gcgggctact 5161 ctggagcaga cggagagggc ccggaaactg gcggaacagg agctcctgga ctccaacgag 5221 agggtgcagc tgctgcatac ccagaacacc agcctcatcc acaccaagaa gaagctggag 5281 acagacctca tgcagctcca gagtgaggta gaagatgcca gcagggatgc aaggaacgct 5341 gaggagaagg ccaagaaggc catcacggac gctgccatga tggcggagga gctgaagaag 5401 gagcaggaca ccagcgccca ccttgagcgg atgaagaaga acctggaaca gacggtgaag 5461 gacctgcagc atcgtctaga tgaggccgag cagctggcgc tgaagggcgg gaagaagcag 5521 atccagaaac tggagaccag gatccgagag ctggagtttg aacttgaggg agagcagaag 5581 aagaacacag agtctgttaa gggcctgagg aagtatgagc ggagggtcaa ggagctgacg 5641 taccagagtg aagaggacag gaagaatgtg ctgagattgc aggatctggt ggataaactg 5701 caagtgaaag tcaagtccta caagaggcag gcggaggagg ctgatgaaca agccaatgct 5761 catctcacca aattccggaa agctcagcat gagctggagg aggccgagga acgtgcggat 5821 atcgcagaat ctcaagtcaa caagctccgc gctaagactc gagacttcac ctccagcagg 5881 atggtggtcc acgagagtga agagtgagcc agcccttctg gagcaggagc aggacagaag 5941 atatgcaaaa tgtatatttt cttgattcct gaccattgat acttaatgtc catgtgactc 6001 tttttcacat gcaataaact ttgctttgtt tc // LOCUS HSMYOD 1692 bp RNA PRI 19-MAR-1991 DEFINITION Human MyoD mRNA. ACCESSION X56677 NID g34861 KEYWORDS MyoD gene; MyoD1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1692) AUTHORS Pearson-White,S.H. TITLE Direct Submission JOURNAL Submitted (19-NOV-1990) S.H. Pearson-White, UNIVERSITY OF VIRGINIA MEDICAL CENTRE, HSC MR-4 BLDG BOX 1131 ROOM 1127, CHARLOTTESVILLE, VA 22908, USA REFERENCE 2 (bases 1 to 1692) AUTHORS Pearson-White,S.H. TITLE Human MyoD: cDNA and deduced amino acid sequence JOURNAL Nucleic Acids Res. 19 (5), 1148 (1991) MEDLINE 91212198 FEATURES Location/Qualifiers source 1..1692 /organism="Homo sapiens" /isolate="patient HMP2" /db_xref="taxon:9606" /dev_stage="pediatric" /cell_type="myoblast" /cell_line="HMP2" /clone_lib="S. Pearson-White" /clone="cH431" /chromosome="11" gene 121..1080 /gene="MyoD" CDS 121..1080 /gene="MyoD" /codon_start=1 /db_xref="PID:g34862" /db_xref="SWISS-PROT:P15172" /translation="MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFE DLDPRLMHVGALLKPEEHSHFPAAVHPAPGAREDEHVRAPSGHHQAGRCLLWACKACK RKTTNADRRKAATMRERRRLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGL QALLRDQDAAPPGAAAFYAPGPLPPGRGGEHYSGDSDASSPRSNCSDGMMDYSGPPSG ARRRNCYEGAYYNEAPSEPRPGKSAAVSSLDYLSSIVERISTESPAAPALLLADVPSE SPPRRQEAAAPSEGESSGDPTQSPDAAPQCPAGANPNPIYQVL" BASE COUNT 316 a 590 c 507 g 279 t ORIGIN 1 attcagactg ccagcacttt gctatctaca gccggggctc ccgagcggca gaaagttccg 61 gccactctct gccgcttggg ttgggcgaaa gccaggaccg tgccgcgcca ccgccaggat 121 atggagctac tgtcgccacc gctccgcgac gtagacctga cggcccccga cggctctctc 181 tgctcctttg ccacaacgga cgacttctat gacgacccgt gtttcgactc cccggacctg 241 cgcttcttcg aagacctgga cccgcgcctg atgcacgtgg gcgcgctcct gaaacccgaa 301 gagcactcgc acttccccgc ggcggtgcac ccggccccgg gcgcacgtga ggacgagcat 361 gtgcgcgcgc ccagcgggca ccaccaggcg ggccgctgcc tactgtgggc ctgcaaggcg 421 tgcaagcgca agaccaccaa cgccgaccgc cgcaaggccg ccaccatgcg cgagcggcgc 481 cgcctgagca aagtaaatga ggcctttgag acactcaagc gctgcacgtc gagcaatcca 541 aaccagcggt tgcccaaggt ggagatcctg cgcaacgcca tccgctatat cgagggcctg 601 caggctctgc tgcgcgacca ggacgccgcg ccccctggcg cagccgcctt ctatgcgccg 661 ggcccgctgc ccccgggccg cggcggcgag cactacagcg gcgactccga cgcgtccagc 721 ccgcgctcca actgctccga cggcatgatg gactacagcg gccccccgag cggcgcccgg 781 cggcggaact gctacgaagg cgcctactac aacgaggcgc ccagcgaacc caggcccggg 841 aagagtgcgg cggtgtcgag cctagactac ctgtccagca tcgtggagcg catctccacc 901 gagagccctg cggcgcccgc cctcctgctg gcggacgtgc cttctgagtc gcctccgcgc 961 aggcaagagg ctgccgcccc cagcgaggga gagagcagcg gcgaccccac ccagtcaccg 1021 gacgccgccc cgcagtgccc tgcgggtgcg aaccccaacc cgatatacca ggtgctctga 1081 gggggatgtg gccgcccaac cccgccaggg atggtgccct agggtccctc gcgcccaaaa 1141 gattgaactt aaatgccccc ctcccaacag cgctttaaaa gcgccatctc ttgaggtagg 1201 agaggcggag aactgaagtt tccgcccccc ccgacagggc aaggacacag cgcggttttt 1261 tccacgcagc acccttctcg gagacccatt gcgatggccg ctccgtgttc ctcggtgggc 1321 cagagctgaa ccttgagggg ctaggttcac gtttctcgcg ccctccatgg tgagaccctc 1381 gcagacctaa ccctgccccg ggatgcaccg gttatttggg ggggcgtgag acagtgcact 1441 ccggtcccaa atgtagcagg tgtaaccgta acccaccccc aacccgtttc ccggttcagg 1501 accacttttt gtaatacttt ttgtaatcta ttcctgtaaa taagagttcg tttgccagag 1561 aggagcccct ggggctgtat ttatctctga ggcagggtgt gtggtgctac agggaatttg 1621 tacgtttata ccgcaggcgg gcgagccgcg ggcgctcgct caggtgatca aaataaaggc 1681 gctaatttat aa // LOCUS HSMYOIB 3384 bp RNA PRI 03-APR-1997 DEFINITION H.sapiens mRNA for myosin-I beta. ACCESSION X98507 NID g1926310 KEYWORDS myosin I beta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3384) AUTHORS Crozet,F. TITLE Direct Submission JOURNAL Submitted (17-JUN-1996) F. Crozet, Institut Pasteur, Unite de Genetique Moleculaire Humaine, 25 rue du Docteur Roux, F- 75724, PARIS, cedex 15, FRANCE REFERENCE 2 (bases 1 to 3384) AUTHORS Crozet,F., Amraoui,A.E., Blanchard,S., Lenoir,M., Ripoll,C., Vago,P., Hamel,C., Fizames,C., Levi-Acobas,F., Depetris,D., Mattei,M.G., Weil,D., Pujol,R. and Petit,C. TITLE Cloning of the genes encoding two murine and human cochlear unconventional type I myosins JOURNAL Genomics 40 (2), 332-341 (1997) MEDLINE 97237053 FEATURES Location/Qualifiers source 1..3384 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" /map="17p3.2-p13.3" CDS 66..3152 /codon_start=1 /product="myosin I beta" /db_xref="PID:e255559" /db_xref="PID:g1926311" /translation="MDSALTARDRVGVQDFVLLENFTSEAAFIENLRRRFRENLIYTY IGPVLVSVNPYRDLQIYSRQHMERYRGVSFYEVPPHLFAVADTVYRALRTERRDQAVM ISGESGAGKTEATKKLLQFYAETCPAPQRGGAVRDRLLQSNPVLEAFGNAKTLRNDNS SRFGKYMDVQFDFKGAPVGGHILSYLLEKSRVVHQNHGERNFHIFYQLLEGGEEETLR RLGLERNPQSYLYLVKGQCAKVSSINDKSDWKVVRKALTVIDFTEDEVEDLLSIVASV LHLGNIHFAANEDSNAQVTTENQLKYLTRLLSVEGSTLREALTHRKIIAKGEELLSPL NLEQAAYARNALAKAVYSRTFTWLVGKINRSLASKDVESPSWRSTTVLGLLDIYGFEV FQHNSFEQFCINYCNEKLQQLFIELPLKSEQEEYEAEGIAWEPVQYFNNKIICDLVEE KFKGIISILDEECLRPGEATDLTFLEKLEDTVKHHPHFLTHKLADQRTRKSLGRGEFR LLHYAGEVTYSVTGFLDKNNDLLFRNLKETMCSSKNPIMSQCFDRSELSDKKRPETVA TQFKMSLLQLVEILQSKEPAYVRCIKPNDAKQPGRFDEVLIRHQVKYLGLLENLRVRR AGFAYRRKYEAFLQRYKSLCPETWPTWAGRPQDGVAVLVRHLGYKPEEYKMGRTKIFI RFPKTLFATEDALEVRRQSLATKIQAAWRGFHWRQKFLRVKRSAICIQSWWRGTLGRR KAAKRKWAAQTIRRLIRGFILRHAPRCPENAFFLDHVRTSFLLNLRRQLPRNVLDTYW PTPPPALREASELLRELCIKNMVWKYCRSISPEWKQQLQQKAVASEIFKGKKDNYPQS VPRLFISTRLGTDEISPRVLQALGSEPIQYAVPVVKYDRKGYKPRSRQLLLTPNAVVI VEDAKVKQRIDYANLTGISVSSLSDSLFVLHVQRADIKQKGDVVLQSDHVIETLTKTA LSANRVNSININQGSITFAGGPGRDGTIDFTPGSELLITKAKNGHLAVVAPRLNYR" BASE COUNT 776 a 994 c 996 g 618 t ORIGIN 1 tccaagctga attcgcggcc gcgtcgacca cgccggccct gggcagtgac ggggttcggg 61 tgaccatgga cagtgcgctc accgcccgtg acagggtggg ggtgcaggat ttcgtgctgc 121 tggagaactt caccagcgag gccgccttca tcgagaacct acggcggcga tttcgggaga 181 atctcatcta cacctacatt ggccccgtcc tggtctctgt caatccctac cgggacctgc 241 agatctacag ccggcaacat atggagcgtt accgtggcgt cagcttctat gaagtgcccc 301 ctcacctgtt tgccgtggcg gacactgtgt accgagcact gcgcacggag cgtcgggacc 361 aggctgtgat gatctctggg gagagcgggg caggcaagac cgaagccacc aagaagctgc 421 tgcagttcta tgcagagacc tgcccagccc cccaacgcgg aggtgccgtg cgggaccggc 481 tgctacagag caacccggtg ctggaggcct ttggaaatgc caagaccctc cggaacgata 541 actccagcag gttcgggaag tacatggatg tgcagtttga cttcaagggt gcccccgtgg 601 gtggccacat cctcagttac ctcctggaaa agtcacgagt ggtgcaccag aatcatgggg 661 agcggaactt ccacatcttc taccagctgc tggagggggg cgaggaagaa actcttcgca 721 ggctgggctt ggaacggaac ccccagagct acctgtacct ggtgaagggc cagtgtgcca 781 aagtctcctc catcaacgac aagagtgact ggaaggtcgt caggaaggct ctgacagtca 841 ttgatttcac cgaggatgaa gtggaggacc tgctaagcat cgtggccagc gtccttcatt 901 tgggcaacat ccactttgct gccaacgagg acagcaatgc ccaggtcacc accgagaacc 961 agctcaagta tctgaccagg ctcctcagcg tggaaggctc gacgctgcga gaagccctga 1021 cacacaggaa gatcatcgcc aagggggaag agctcctgag cccgctgaac ctggaacagg 1081 ccgcgtacgc acgaaacgcc ctcgccaagg ctgtgtacag ccgcactttt acctggctcg 1141 tcgggaaaat caacaggtcg ctggcctcca aggacgtgga gagccccagc tggcggagca 1201 ccacggttct cgggctcctg gatatttatg gcttcgaagt gtttcagcat aacagctttg 1261 agcagttctg catcaattac tgcaacgaaa agctgcagca gctcttcatc gaactcccgc 1321 tcaagtcgga gcaggaggaa tacgaggcag agggcatcgc gtgggaaccc gtccagtatt 1381 tcaacaacaa aatcatctgt gatctggtgg aggagaagtt taagggcatc atctcgattt 1441 tggatgagga gtgtctgcgc ccgggggagg ccacagacct gaccttcctg gagaagctgg 1501 aggatactgt caagcaccat ccacacttcc tgacgcacaa gctggctgac cagaggacca 1561 ggaaatctct gggccgaggg gaattccgcc ttctgcacta tgcgggggag gtgacctaca 1621 gcgtgaccgg gtttctggac aaaaacaatg accttctctt ccggaacctt aaggagacca 1681 tgtgtagctc aaagaatccc attatgagcc agtgcttcga ccggagcgag ctcagtgaca 1741 agaagcggcc agagacggtc gccacccagt tcaagatgag cctcctgcag ctggtggaga 1801 tcctgcagtc taaggagccc gcctacgtcc gctgcatcaa acccaatgat gccaaacagc 1861 ccggccgctt tgacgaggtg ctgatccgcc accaggtgaa gtacctgggg ctgttggaaa 1921 acctgcgtgt gcgcagagct ggctttgcct atcgccgcaa atacgaagct ttcctgcaaa 1981 ggtacaagtc actgtgccca gagacgtggc ccacgtgggc aggacggccg caggatgggg 2041 tggctgtgct ggtccgacac ctgggctaca agccagaaga gtacaagatg ggcaggacca 2101 agatcttcat ccgcttcccc aagaccctgt ttgccacaga ggatgccctg gaggtccggc 2161 ggcagagcct ggccacaaag atccaagctg cctggagggg ctttcactgg cggcagaaat 2221 tcctccgggt gaagagatca gccatctgca tccagtcgtg gtggcgtgga acactgggcc 2281 ggaggaaggc agccaagagg aagtgggcgg cacagaccat ccggcggctc atccgaggct 2341 tcatcctgcg ccacgccccc cgctgccccg agaacgcctt cttcttggac catgtgcgca 2401 cgtctttttt gctaaacctg aggcggcagc tgccccggaa tgtcctggac acctactggc 2461 ccacgccccc acctgccctg cgagaggcct cagagcttct gcgggagttg tgcataaaga 2521 acatggtgtg gaaatactgc cggagtatca gccctgagtg gaagcagcag ctgcagcaga 2581 aggccgtggc tagtgagatc ttcaagggca agaaggataa ttaccctcag agtgtaccca 2641 ggctcttcat cagcactcgg cttggtacag atgagatcag cccccgagtg ctgcaggcct 2701 tgggctctga gcccattcag tatgcggtgc ctgttgtgaa atacgaccgc aagggctaca 2761 agcctcgctc ccggcagctg ctgctcacgc ccaacgccgt cgtcatcgtg gaggacgcca 2821 aagtcaagca gaggattgat tacgccaacc tgaccggaat ctctgtcagc agcctgagcg 2881 acagtctttt tgtgcttcat gtacagcgtg cggacataaa gcaaaaggga gatgtggtgc 2941 tgcagagtga ccacgtgatt gagacgctga ccaagacagc cctcagtgcc aaccgcgtga 3001 acagcatcaa catcaaccag ggcagcataa cgtttgcagg gggccccggc agggatggca 3061 ccattgactt cacacccggc tcggagctgc tcatcaccaa ggccaagaac gggcacctgg 3121 ctgtggtcgc cccacggctg aattatcggt gataaaggcg cccactggac catcccaacg 3181 cccaaagctt tgcttttctc ctcctcccct tcccagttac caaagagtcg aatttccaga 3241 cagggaccca gggacacccc gaagcccacc tgcaatttcc cacctcctgc ccatcccttt 3301 cttgagggag cagcaggggc caggagctac cccaggagtg ggccaggccg ggccacagca 3361 ataggaaagc cagggccaga gcga // LOCUS HSMYOL1 588 bp RNA PRI 26-JAN-1995 DEFINITION Human mRNA for ventricular myosin light chain 1. ACCESSION X07373 NID g34865 KEYWORDS myosin; myosin light chain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 588) AUTHORS Hoffmann,E., Shi,Q.W., Floroff,M., Mickle,D.A., Wu,T.W., Olley,P.M. and Jackowski,G. TITLE Molecular cloning and complete nucleotide sequence of a human ventricular myosin light chain 1 JOURNAL Nucleic Acids Res. 16 (5), 2353 (1988) MEDLINE 88189843 REFERENCE 2 (bases 1 to 588) AUTHORS Jackowski,G. TITLE Direct Submission JOURNAL Submitted (30-MAY-1988) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (30-MAY-1988) by Jackowski G. FEATURES Location/Qualifiers source 1..588 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pCD HLVC1" CDS 1..588 /note="ventricular myosin light chain 1 (AA 1 - 195)" /codon_start=1 /db_xref="PID:g34866" /db_xref="SWISS-PROT:P08590" /translation="MAPKKPEPKKDDAKAAPKAAPAPEPPPEPERPKEVEFDASKIKI EFTPEQIEEFKEAFMLFDRTPKCEMKITYGQCGDVLRALGQNPTQAEVLRVLGKPRQE ELNTKMMDFETFLPMLQHISKNKDTGTYEDFVEGLRVFDKEGNGTVMGAELRHVLATL GERLTEDEVEKLMAGQEDSNGCINYEAFVKHIMSS" old_sequence 215 /note="c was a in [1]" /citation=[1] old_sequence 291..300 /note="gaagccaagac was aaaccgaagca in [1]" /citation=[1] old_sequence 512 /note="a was g in [1]" /citation=[1] BASE COUNT 156 a 153 c 173 g 106 t ORIGIN 1 atggccccca aaaagccaga gcccaagaag gatgatgcca aggcagcccc caaggcagct 61 ccagctcccg aacctccccc tgagcctgag cgccctaagg aggtcgagtt tgatgcttcc 121 aagatcaaga ttgagttcac acctgagcag attgaagagt tcaaggaagc cttcatgctg 181 ttcgaccgca cacccaagtg tgagatgaag atcacctacg ggcagtgtgg ggatgtcctg 241 cgggcgctgg gccagaaccc cacacaggca gaagtgctcc gtgtcctggg gaagccaaga 301 caggaagagc tcaataccaa gatgatggac tttgaaactt tcctgcctat gctccagcac 361 atttccaaga acaaggacac aggcacctat gaggacttcg tggaggggct gcgggtcttc 421 gacaaggagg gcaatggcac tgtcatgggt gctgagcttc gccacgtgct ggccacgctg 481 ggtgagaggc tgacagaaga cgaagtggag aagttgatgg ctgggcaaga ggactccaat 541 ggctgcatca actatgaagc atttgtgaag cacatcatgt ccagctaa // LOCUS HSMYOSIN 6010 bp RNA PRI 06-SEP-1995 DEFINITION H.sapiens mRNA for myosin. ACCESSION Z38133 NID g558668 KEYWORDS myosin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6010) AUTHORS Jullian,E.H., Kelly,A.M., Pompidou,A.J., Hoffman,R., Schiaffino,S., Stedman,H.H. and Rubinstein,N.A. TITLE Characterization of a human perinatal myosin heavy-chain transcript JOURNAL Eur. J. Biochem. 230 (3), 1001-1006 (1995) MEDLINE 95324556 REFERENCE 2 (bases 1 to 6010) AUTHORS Jullian,E.H. TITLE Direct Submission JOURNAL Submitted (14-OCT-1994) Eric H. Jullian, Department of Histology & Embryology, University, of Paris V - Rene Descartes, U.F.R.Medecine Cochin, 24, rue du faubourg Saint Jacques, Paris, 75014, France FEATURES Location/Qualifiers source 1..6010 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone HFMHC-1, clone HFMHC-2" /dev_stage="Fetus" /tissue_type="Muscle, skeletal" 5'UTR 1..73 CDS 74..5887 /codon_start=1 /product="Myosin" /db_xref="PID:g558669" /translation="MSASSDAEMAVFGERAPYLRKSEKERIEAQNKPFDAKTSVFVAE PKESYVKSTIQSKEGGKVTVKTEGGATLTVREDQVFPMNPPKYDKIEDMAMMTHLHEP GVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYKPEVVAAYRGKKRQEAPPHIFSI SDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAVTGEKKKDESGKMQGT LEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLASADIETYLLEKSRV TFQLKAERSYHIFYQITSNKKPDLIEMLLITTNPYDYAFVSQGEITVPSIDDQEELMA TDSAIDILGFTPEEKVSIYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKAAYLQS LNSADLLKALCYPRVKVGNEYVTKGQTVQQVYNAVGALAKAVYEKMFLWMVTRINQQL DTKQPRQYFIGVLDIAGFEIFDFNSLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEG IEWTFIDFGMDLAACIELIEKPLGIFSILEEECMFPKATDTSFKNKLYDQHLGKSANF QKPKVVKGKAEAHFSLIHYAGTVDYNITGWLDKNKDPLNDTVVGLYQKSAMKTLASLF STYASAEADSSAKKGAKKKGSSFQTVSALFRENLNKLMTNLRSTHPHFVRCIIPNETK TPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRILYGDFKQRYKVLNASAIPEGQFID SKKASEKLLASIDIDHTQYKFGHTKVFFKAGLLGLLEEMRDEKLAQIITRTQAVCRGF LMRVEYQKMLQRREALFCIQYNVRAFMNVKHWPWMKLFFKIKPLLKSAETEKEMATMK EEFQKTKDELAKSEAKRKELEEKMVTLLKEKNDLQLQVQSEADSLADAEERCEQLIKN KIQLEAKIKEVTERAEEEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKH ATENKVKNLTEEMAGLDETIAKLSKEKKALQETHQQTLDDLQAEEDKVNILTKAKTKL EQQVDDLEGSLEQEKKLRMDLERAKRKLEGDLKLAQESTMDMENDKQQLDEKLEKKEF EISNLISKIEDEQAVEIQLQKKIKELQARIEELGEEIEAERASRAKAEKQRSDLSREL EEISERLEEAGGATSAQVELNKKREAEFQKLRRDLEEATLQHEAMVAALRKKHADSMA ELGEQIDNLQRVKQKLEKEKSELKMETDDLSSNAEAISKAKGNLEKMCRSLEDQVSEL KTKEEEQQRLINDLTAQRARLQTEAGEYSRQLDEKDALVSQLSRSKQASTQQIEELKH QLEEETKAKNALAHALQSSRHDCDLLREQYEEEQEGKAELQRALSKANSEVAQWRTKY ETDAIQRTEELEEAKKKLAQRLQEAEEHVEAVNAKCASLEKTKQRLQNEVEDLMLDVE RSNAACAALDKKQRNFDKVLSEWKQKYEETQAELEASQKESRSLSTELFKVKNVYEES LDQLETLRRENKNLQQEISDLTEQIAEGGKQIHELEKIKKQVEQEKCEIQAALEEAEA SLEHEEGKILRIQLELNQVKSEVDRKIAEKDEEIDQLKRNHTRVVETMQSTLDAEIRS RNDALRVKKKMEGDLNEMEIQLNHANRLAAESLRNYRNTQGILKETQLHLDDALRGQE DLKEQLAIVERRANLLQAEIEELWATLEQTERSRKIAEQELLDASERVQLLHTQNTSL INTKKKLENDVSQLQSEVEEVIQESRNAEEKAKKAITDAAMMAEELKKEQDTSAHLER MKKNLEQTVKDLQHRLDEAEQLALKGGKKQIQKLEARVRELEGEVENEQKRNAEAVKG LRKHERRVKELTYQTEEDRKNVLRLQDLVDKLQAKVKSYKRQAEEAEEQSNANLSKFR KLQHELEEAEERAHIAESQVNKLRVKSREVHTKISAE" 3'UTR 5888..>6010 BASE COUNT 1956 a 1292 c 1587 g 1175 t ORIGIN 1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa 61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaacgtgc 121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc 181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag 241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga 301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat 361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat 421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt 481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca 541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc 601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca 661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca 721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc 781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg 841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt 901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa 961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt 1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga 1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac 1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc 1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc 1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa 1321 aggccagact gtgcagcagg tgtacaatgc ggtgggtgct ctggccaaag ccgtctacga 1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag 1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct 1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat 1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt 1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat 1681 cctggaagag gagtgcatgt tccctaaggc aacggacacc tccttcaaga acaagctgta 1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc 1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg 1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc 1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc 1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga 2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat 2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct 2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat 2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg 2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca 2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct 2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag 2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg 2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt 2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa 2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct 2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc 2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat 2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa 2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga 2941 cattgatgac cttgagctga cactggccaa ggttgagaag gagaaacatg ccacggagaa 3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc 3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga 3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga 3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg 3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa 3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa 3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc 3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga 3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc 3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa 3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg 3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt 3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag 3781 taacgcagag gccatttcca aagccaaggg aaaccttgaa aagatgtgcc gctctctaga 3841 agatcaagtg agtgagctta agaccaagga agaggagcag cagcggctga tcaatgacct 3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga 3961 gaaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga 4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct 4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg 4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac 4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt 4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc 4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga 4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct 4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga 4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga 4561 tcaactcgaa acgctaagaa gagaaaataa gaacttgcaa caggagattt ctgacctcac 4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt 4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca 4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga 4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt 4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag 4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa 4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac 5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat 5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga 5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt 5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga 5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga 5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca 5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct 5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca 5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa 5581 tgcagaggct gttaaaggtt tacggaaaca tgagcgacga gtaaaagaac tcacctacca 5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc 5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct 5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctcacattgc 5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc 5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaagcgccaa atgtgctatt 5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac 6001 agtaaatgta // LOCUS HSNACHR3B 1595 bp mRNA PRI 22-JAN-1998 DEFINITION H.sapiens mRNA for nicotinic acetylcholine receptor beta3 subunit precursor. ACCESSION Y08417 NID g1702909 KEYWORDS nAChR gene; nicotinic acetylcholine receptor beta-3 subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1595) AUTHORS Groot Kormelink,P.J. and Luyten,W.H. TITLE Cloning and sequence of full-length cDNAs encoding the human neuronal nicotinic acetylcholine receptor (nAChR) subunits beta3 and beta4 and expression of seven nAChR subunits in the human neuroblastoma cell line SH-SY5Y and/or IMR-32 JOURNAL FEBS Lett. 400 (3), 309-314 (1997) MEDLINE 97162233 REFERENCE 2 (bases 1 to 1595) AUTHORS Groot Kormelink,P.J. TITLE Direct Submission JOURNAL Submitted (27-SEP-1996) P.J. Groot Kormelink, Janssen Research Foundation, Exp. Mol. Biol. Dept., Turnhoutseweg 30, B-2340 Beerse, BELGIUM FEATURES Location/Qualifiers source 1..1595 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neuroblastoma" /tissue_type="pons" /cell_line="SH-SY5Y" 5'UTR 1..128 gene 129..1505 /gene="nAChRB3" CDS 129..1505 /gene="nAChRB3" /codon_start=1 /product="nicotinic acetylcholine receptor beta3 subunit precursor" /db_xref="PID:e274567" /db_xref="PID:g1702910" /db_xref="SWISS-PROT:Q05901" /translation="MLPDFMLVLIVLGIPSSATTGFNSIAENEDALLRHLFQGYQKWV RPVLHSNDTIKVYFGLKISQLVDVDEKNQLMTTNVWLKQEWTDHKLRWNPDDYGGIHS IKVPSESLWLPDIVLFENADGRFEGSLMTKVIVKSNGTVVWTPPASYKSSCTMDVTFF PFDRQNCSMKFGSWTYDGTMVDLILINENVDRKDFFDNGEWEILNAKGMKGNRRDGVY SYPFITYSFVLRRLPLFYTLFLIIPCLGLSFLTVLVFYLPSDEGEKLSLSTSVLVSLT VFLLVIEEIIPSSSKVIPLIGEYLLFIMIFVTLSIIVTVFVINVHHRSSSTYHPMAPW VKRLFLQKLPKLLCMKDHVDRYSSPEKEESQPVVKGKVLEKKKQKQLSDGEKVLVAFL EKAADSIRYISRHVKKEHFISQVVQDWKFVAQVLDRIFLWLFLIVSVTGSVLIFTPAL KMWLHSYH" sig_peptide 129..200 /gene="nAChRB3" mat_peptide 198..1502 /gene="nAChRB3" 3'UTR 1506..1595 BASE COUNT 433 a 374 c 319 g 469 t ORIGIN 1 tgttgctgtc ctcttgggtt ccacttcgga ttttgaaccc ctgtattttc ttttcaaaac 61 ccccttttcc aatggaaatg ctctgttgtt aaaaaggaag aaactgtctt tctgaaactg 121 acatcacgat gctcccagat tttatgctgg ttctcatcgt ccttggcatc ccttcctcag 181 ccaccacagg tttcaactca atcgccgaaa atgaagatgc cctcctcaga catttgttcc 241 aaggttatca gaaatgggtc cgccctgtat tacattctaa tgacaccata aaagtatatt 301 ttggattgaa aatatcccag cttgtagatg tggatgaaaa gaatcagctg atgacaacca 361 atgtgtggct caaacaggaa tggacagacc acaagttacg ctggaatcct gatgattatg 421 gtgggatcca ttccattaaa gttccatcag aatctctgtg gcttcctgac atagttctct 481 ttgaaaatgc tgacggccgc ttcgaaggct ccctgatgac caaggtcatc gtgaaatcaa 541 acggaactgt tgtctggacc cctcccgcca gctacaaaag ctcctgcacc atggacgtca 601 cgtttttccc gttcgaccga cagaactgct ccatgaagtt tggatcctgg acttatgatg 661 gcaccatggt tgacctcatt ttgatcaatg aaaatgtcga cagaaaagac ttcttcgata 721 acggagaatg ggaaatactg aacgcaaagg ggatgaaggg gaacagaagg gacggcgtgt 781 actcctatcc ctttatcacg tattccttcg tcctgagacg cctgccttta ttctataccc 841 tctttctcat catcccctgc ctggggctgt ctttcctaac agttcttgtg ttctatttac 901 cttcggatga aggagaaaaa ctttcattat ccacatcggt cttggtttct ctgacagttt 961 tccttttagt gattgaagaa atcatcccat cgtcttccaa agtcattcct ctcattggag 1021 agtacctgct gttcatcatg atttttgtga ccctgtccat cattgttacc gtgtttgtca 1081 ttaacgttca ccacagatct tcttccacgt accaccccat ggccccctgg gttaagaggc 1141 tctttctgca gaaacttcca aaattacttt gcatgaaaga tcatgtggat cgctactcat 1201 ccccagagaa agaggagagt caaccagtag tgaaaggcaa agtcctcgaa aaaaagaaac 1261 agaaacagct tagtgatgga gaaaaagttc tagttgcttt tttggaaaaa gctgctgatt 1321 ccattagata catttcgaga catgtgaaga aagaacattt tatcagccag gtagtacaag 1381 actggaaatt tgtagctcaa gttcttgacc gaatcttcct gtggctcttt ctgatagtgt 1441 cagtaacagg ctcggttctg atttttaccc ctgctttgaa gatgtggcta catagttacc 1501 attaggaatt taaaagacat aagactaaat tacaccttag acctgacatc tggctatcac 1561 acagacagaa tccaaatgca tgtgcttgtt ctacg // LOCUS HSNADCM 2058 bp RNA PRI 05-AUG-1994 DEFINITION H.sapiens NADP+ dependent cytoplasmic malic enzyme mRNA. ACCESSION X77244 NID g495122 KEYWORDS malic enzyme; NADP-dependent malic enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Loeber,G., Dworkin,M.B., Infante,A. and Ahorn,H. TITLE Characterization of cytosolic malic enzyme in human tumor cells JOURNAL FEBS Lett. 344 (2-3), 181-186 (1994) MEDLINE 94244767 REFERENCE 2 (bases 1 to 2058) AUTHORS Loeber,G. TITLE Direct Submission JOURNAL Submitted (25-JAN-1994) G. Loeber, Ernst Boehringer Institut, Bender & Co., Dr. Boehringergasse 5 - 11, 1121 Vienna, AUSTRIA FEATURES Location/Qualifiers source 1..2058 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="white adipose tissue" CDS 1..1719 /EC_number="1.1.1.40" /codon_start=1 /product="malate dehydrogenase (oxaloacetate decarboxylating) (NADP+)" /db_xref="PID:g495123" /db_xref="SWISS-PROT:P48163" /translation="MEPEAPRRRHTHQRGYLLTRNPHLNKDLAFTLEERQQLNIHGLL PPSFNSQEIQVLRVVKNFEHLNSDFDRYLLLMDLQDRNEKLFYRVLTSDIEKFMPIVY TPTVGLACQQYSLVFRKPRGLFITIHDRGHIASVLNAWPEDVIKAIVVTDGERILGLG DLGCNGMGIPVGKLALYTACGGMNPQECLPVILDVGTENEELLKDPLYIGLRQRRVRG SEYDDFLDEFMEAVSSKYGMNCLIQFEDFANVNAFRLLNKYRNQYCTFNDDIQGTASV AVAGLLAALRITKNKLSDQTILFQGAGEAALGIAHLIVMALEKEGLPKEKAIKKIWLV DSKGLIVKGRASLTQEKEKFAHEHEEMKNLEAIVQEIKPTALIGVAAIGGAFSEQILK DMAAFNERPIIFALSNPTSKAECSAEQCYKITKGRAIFASGSPFDPVTLPNGQTLYPG QGNNSYVFPGVALGVVACGLRQITDNIFLTTAEVIAQQVSDKHLEEGRLYPPLNTIRD VSLKIAEKIVKDAYQEKTATVYPEPQNKEAFVRSQMYSTDYDQILPDCYSWPEEVQKI QTKVDQ" polyA_site 2045..2050 BASE COUNT 628 a 407 c 437 g 586 t ORIGIN 1 atggagcccg aagccccccg tcgccgccac acccatcagc gcggctacct gctgacacgg 61 aaccctcacc tcaacaagga cttggccttt accctggaag agagacagca attgaacatt 121 catggattgt tgccaccttc cttcaacagt caggagatcc aggttcttag agtagtaaaa 181 aatttcgagc atctgaactc tgactttgac aggtatcttc tcttaatgga tctccaagat 241 agaaatgaaa aactctttta tagagtgctg acatctgaca ttgagaaatt catgcctatt 301 gtttatactc ccactgtggg tctggcttgc caacaatata gtttggtgtt tcggaagcca 361 agaggtctct ttattactat ccacgatcga gggcatattg cttcagttct caatgcatgg 421 ccagaagatg tcatcaaggc cattgtggtg actgatggag agcgtattct tggcttggga 481 gaccttggct gtaatggaat gggcatccct gtgggtaaat tggctctata tacagcttgc 541 ggagggatga atcctcaaga atgtctgcct gtcattctgg atgtgggaac cgaaaatgag 601 gagttactta aagatccact ctacattgga ctacggcaga gaagagtaag aggttctgaa 661 tatgatgatt ttttggacga attcatggag gcagtttctt ccaagtatgg catgaattgc 721 cttattcagt ttgaagattt tgccaatgtg aatgcatttc gtctcctgaa caagtatcga 781 aaccagtatt gcacattcaa tgatgatatt caaggaacag catctgttgc agttgcaggt 841 ctccttgcag ctcttcgaat aaccaagaac aaactgtctg atcaaacaat actattccaa 901 ggagctggag aggctgccct agggattgca cacctgattg tgatggcctt ggaaaaagaa 961 ggtttaccaa aagagaaagc catcaaaaag atatggctgg ttgattcaaa aggattaata 1021 gttaagggac gtgcttcctt aacacaagag aaagagaagt ttgcccatga acatgaagaa 1081 atgaagaacc tagaagccat tgttcaagaa ataaaaccaa ctgccctcat aggagttgct 1141 gcaattggtg gtgcattctc agaacaaatt ctcaaagata tggctgcctt caatgaacgg 1201 cctattattt ttgctttgag taatccaact agcaaagcag aatgttctgc agagcagtgc 1261 tacaaaataa ccaagggacg tgcaattttt gccagtggca gtccttttga tccagtcact 1321 cttccaaatg gacagaccct atatcctggc caaggcaaca attcctacgt gttccctgga 1381 gttgctcttg gtgttgtggc gtgtggattg aggcagatca cagataatat tttcctcact 1441 actgctgagg ttatagctca gcaagtgtca gataaacact tggaagaggg tcggctttat 1501 cctcctttga ataccattag agatgtttct ctgaaaattg cagaaaagat tgtgaaagat 1561 gcataccaag aaaagacagc cacagtttat cctgaaccgc aaaacaaaga agcatttgtc 1621 cgctcccaga tgtatagtac tgattatgac cagattctac ctgattgtta ttcttggcct 1681 gaagaggtgc agaaaataca gaccaaagtt gaccagtagg ataatagcaa acatttctaa 1741 ctctattaat gaggtcttta aacctttcat aatttttaaa ggttggaatc ttttataatg 1801 attcataaga cacttagatt aagattttac tttaacagtc taaaaattga tagaagaata 1861 tcgatataaa ttgggataaa catcacatga gacaattttg cttcactttg ccttctggtt 1921 atttatggtt tctgtctgaa ttattctgcc tacgttctct ttaaaagctg ttgtacgtac 1981 tacggagaaa ctcatcattt ttatacagga cactaatggg aagaccaaaa ttactaataa 2041 attgaaataa ccaacatt // LOCUS HSNADHMF 352 bp RNA PRI 21-JUL-1997 DEFINITION H.sapiens mRNA for NADH dehydrogenase. ACCESSION X81900 NID g2274973 KEYWORDS NADH dehydrogenase; NADH oxidoreductase subunit MWFE. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 352) AUTHORS Frattini,A. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) A. Frattini, ITBA CNR, Via Ampere 56, I-20131 Milan, ITALY COMMENT Related sequence: X63222. FEATURES Location/Qualifiers source 1..352 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="q24-25" /tissue_type="liver" exon <1..119 /gene="ND" /number=1 mRNA <1..>352 /gene="ND" gene 1..352 /gene="ND" CDS 18..230 /gene="ND" /EC_number="1.6.99.3" /codon_start=1 /product="NADH oxidoreductase subunit MWFE" /db_xref="PID:e276472" /db_xref="PID:g2274974" /translation="MWFEILPGLSVMGVCLLIPGLATAYIHRFTNGGKEKRVAHFGYH WSLMERDRRISGVDRYYVSKGLENID" exon 120..352 /gene="ND" /number=2 3'UTR 231..352 /gene="ND" polyA_site 337..342 /gene="ND" BASE COUNT 98 a 59 c 95 g 100 t ORIGIN 1 taggtaacgg ggcagagatg tggttcgaga ttctccccgg actctccgtc atgggcgtgt 61 gcttgttgat tccaggactg gctactgcgt acatccacag gttcactaac gggggcaagg 121 aaaaaagggt tgctcatttt gggtatcact ggagtctgat ggaaagagat aggcgcatct 181 ctggagttga tcgttactat gtgtcaaagg gtttggagaa cattgattaa ggaagcattt 241 tcctgattga tgaaaaaaat aactcagtta tggccatcta cccctgctag aaggttacag 301 tgtattatgt agcatgcaat gtgttatgta gtgcttaata aaaataaaat ga // LOCUS HSNAPI1 1549 bp RNA PRI 25-JAN-1994 DEFINITION H.sapiens mRNA for sodium-phophate transport system 1. ACCESSION X71355 NID g450531 KEYWORDS brush border membrane; NPT1 gene; renal sodium-dependent phosphate transporter; transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1549) AUTHORS Chong,S.S., Kristjansson,K., Zoghbi,H.Y. and Hughes,M.R. TITLE Molecular cloning of the cDNA encoding a human renal sodium phosphate transport protein and its assignment to chromosome 6p21.3-p23 JOURNAL Genomics 18 (2), 355-359 (1993) MEDLINE 94117004 REFERENCE 2 (bases 1 to 1549) AUTHORS Chong,S.S. TITLE Direct Submission JOURNAL Submitted (02-APR-1993) S.S. Chong, Baylor College of Medicine, Institute for Molecular Genetics, One Baylor Plaza, Houston, Texas 77030, USA COMMENT Related sequence: M76466. FEATURES Location/Qualifiers source 1..1549 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /clone_lib="lambdaMAX kidney-cortex cDNA (Clontech)" /chromosome="6" /map="6p21.3-p23" gene 13..1416 /gene="NPT1" CDS 13..1416 /gene="NPT1" /codon_start=1 /product="Sodium-Phosphate Transport System 1" /db_xref="PID:g450532" /translation="MQMDNRLPPKKVPGFCSFRYGLSFLVHCCNVIITAQRACLNLTM VVMVNSTDPHGLPNTSTKKLLDNIKNPMYNWSPDIQGIILSSTSYGVIIIQVPVGYFS GIYSTKKMIGFALCLSSVLSLLIPPAAGIGVAWVVVCRAVQGAAQGIVATAQFEIYVK WAPPLERGRLTSMSTSGFLLGPFIVLLVTGVICESLGWPMVFYIFGACGCAVCLLWFV LFYDDPKDHPCISISEKEYITSSLVQQVSSSRQSLPIKAILKSLPVWAISIGSFTFFW SHNIMTLYTPMFINSMLHVNIKENGFLSSLPYLFAWICGNLAGQLSDFFLTRNILSVI AVRKLFTAAGFLLPAIFGVCLPYLSSTFYSIVIFLILAGATGSFCLGGVFINGLDIAP RYFGFIKACSTLTGMIGGLIASTLTGLILKQDPESAWFKTFILMAAINVTGLIFYLIV ATAEIQDWAKEKQHTRL" BASE COUNT 381 a 356 c 345 g 467 t ORIGIN 1 cttcagccgt gtatgcaaat ggataaccgg ttgcctccca aaaaagttcc aggtttctgt 61 tcctttcgct atggattgtc tttccttgtg cactgttgta atgttataat aacagcacag 121 cgtgcgtgcc tgaacctcac aatggtagtc atggtgaata gcacagatcc acatggtttg 181 cccaacacct ccacaaagaa gctcctggat aatataaaga accctatgta taattggagc 241 ccagatatcc agggaatcat cttgagttcc acctcctatg gtgtcatcat catccaagtt 301 cctgttggat acttctctgg aatatattct acaaagaaaa tgattggctt tgcattatgc 361 ctcagctctg tgttaagcct gctcatccca ccagcagctg gaattggagt agcttgggtc 421 gttgtatgtc gagcagttca gggagcagcc caggggatag ttgcaacagc ccagtttgaa 481 atatatgtca aatgggctcc tcccctggaa cgaggccgac ttacttctat gagtacatca 541 gggtttttgc tgggaccctt tattgtccta cttgtgactg gagttatctg tgaatctctg 601 ggctggccca tggtcttcta tatttttggt gcttgtggct gtgccgtatg tcttctctgg 661 ttcgttctgt tttatgatga ccccaaagac cacccatgta taagcatcag tgaaaaggaa 721 tacatcacat cctccctggt ccagcaggtc agttcaagta gacaatctct gcctatcaag 781 gctatactta agtcgcttcc agtctgggct atttccattg gtagttttac gtttttctgg 841 tcacataaca tcatgacact atacactcca atgtttatca actccatgct tcatgttaat 901 ataaaagaga atgggttctt gtcttccctt ccctatttgt ttgcctggat ctgtggtaac 961 ctagcaggtc agttatcaga cttcttcctg accaggaata ttctcagcgt aattgctgtc 1021 cggaaactct tcacagcagc aggatttctc cttcctgcaa tctttggtgt ctgcctgcct 1081 tacctgagtt ccaccttcta cagcattgtc attttcctaa tacttgctgg tgcaacaggc 1141 agcttttgct tgggtggagt gtttataaat ggcttggata ttgctcccag atattttgga 1201 tttattaaag catgttcaac tttaactgga atgataggag gactaattgc ttccactttg 1261 actggattga tccttaagca ggatccggaa tccgcctggt ttaaaacctt catcctgatg 1321 gcagccatta atgtgactgg cctaattttc taccttatag ttgctacagc agaaattcag 1381 gactgggcta aagaaaaaca acacacacgt ctctgaagtg tgaaacagag cacttgcaga 1441 gcctgggaca acctccttat tgaagggaag agggaccagc acatgaggct gaggctgagg 1501 ggcagtcacc agcaccagga agaaggtggt aggaggagtc ctaggggct // LOCUS HSNBKGENE 1217 bp RNA PRI 31-JUL-1995 DEFINITION H.sapiens mRNA for NBK apoptotic inducer protein. ACCESSION X89986 NID g929654 KEYWORDS apoptotic inducer; nbk gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1217) AUTHORS Pun,K.T., Farrow,S.N., Raven,T., Wride,C.J., White,J.H.M. and Brown,R. TITLE E1B-19K interacts with a novel apoptotic inducer, NBK JOURNAL Unpublished REFERENCE 2 (bases 1 to 1217) AUTHORS Pun,K. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) K. Pun, Glaxo Research & Development Ltd., Greenford Road, Greenford, Middlesex UB6 0HE, UK FEATURES Location/Qualifiers source 1..1217 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoid" /cell_type="B-lymphocyte" /clone_lib="EBV-transformed B-cell DNA" /clone="3.1" 5'UTR 1..334 /note="contains" gene 335..817 /gene="nbk" CDS 335..817 /gene="nbk" /codon_start=1 /product="NBK" /db_xref="PID:g929655" /translation="MSEVRPLSRDILMETLLYEQLLEPPTMEVLGMTDSEEDLDPMED FDSLECMEGSDALALRLACIGDEMDVSLRAPRLAQLSEVAMHSLGLAFIYDQTEDIRD VLRSFMDGFTTLKENIMRFWRSPNPGSWVSCEQVLLALLLLLALLLPLLSGGLHLLLK " polyA_signal 1200..1205 BASE COUNT 261 a 321 c 347 g 287 t 1 others ORIGIN 1 gaattccgtc caccatctga gtaacagaaa ttccagaaag aaaaaccaca gacagccggg 61 cctggtggct cacgcctgta atcccagcac tttgggaggc caaggcaggc ggatcacccg 121 aggtcacgag tttgagacca gcctgaccaa caaggggaaa atctgcctct actaaaaata 181 caaaaattag ctgggcgtgg tggcgagcac ctgtagtccc agctactccg gagtctgagt 241 caggagaatc gcctgaacct gggaggcaga agttgttgtg agccanggtc aagggcttac 301 agacgctgcc agcatcgccg ccgccagagg agaaatgtct gaagtaagac ccctctccag 361 agacatcttg atggagaccc tcctgtatga gcagctcctg gaacccccga ccatggaggt 421 tcttggcatg actgactctg aagaggacct ggaccctatg gaggacttcg attctttgga 481 atgcatggag ggcagtgacg cattggccct gcggctggcc tgcatcgggg acgagatgga 541 cgtgagcctc agggccccgc gcctggccca gctctccgag gtggccatgc acagcctggg 601 tctggctttc atctacgacc agactgagga catcagggat gttcttagaa gtttcatgga 661 cggtttcacc acacttaagg agaacataat gaggttctgg agatccccga accccgggtc 721 ctgggtgtcc tgcgaacagg tgctgctggc gctgctgctg ctgctggcgc tgctgctgcc 781 gctgctcagc gggggcctgc acctgctgct caagtgaggc cccggcggct cagggcgggg 841 ctggccccac ccccatgacc actgccctgg aggtggcggc ctgctgctgt tatcttttta 901 actgttttct catgatgcct ttttatattt aaaccccgag atagtgctgg aacactgctg 961 aggttttata ctcaggtttt ttgttttttt tttattccag ttttcgtttt ttctaaaaga 1021 tgaattccta tggctctgca attgtcaccg gttaactgtg gcctgtgttt aggaagagcc 1081 attcactcct gccctgccac acggcaggta gcagggggag tgctggtacg cccctgtgtg 1141 atatgttgat ccctcggcaa agaatctact ggaatagatt ccgaggagca ggtgtgctca 1201 ataaaatgtt ggtttcc // LOCUS HSNC2ALPH 724 bp RNA PRI 14-AUG-1996 DEFINITION H.sapiens mRNA for NC2 alpha subunit. ACCESSION X96506 NID g1491709 KEYWORDS alpha subunit; NC2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 724) AUTHORS Goppelt,A., Stelzer,G., Lottspeich,F. and Meisterernst,M. TITLE A mechanism for repression of class II gene transcription through specific binding of NC2 to TBP-promoter complexes via heterodimeric histone fold domains JOURNAL EMBO J. 15 (12), 3105-3116 (1996) MEDLINE 96272170 REFERENCE 2 (bases 1 to 724) AUTHORS Goppelt,A.R. TITLE Direct Submission JOURNAL Submitted (08-MAR-1996) A.R. Goppelt, Ludwig-Maximilians-Universitaet Muenchen, Laboratorium fuer Molekulare Biologie, Wuermtalstr. 221, D-81375 Muenchen, FRG FEATURES Location/Qualifiers source 1..724 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" gene 1..618 /gene="NC2" CDS 1..618 /gene="NC2" /function="repressor of class II transcription" /note="alpha subunit; forms heterodimer with NC2 alpha/Dr1" /codon_start=1 /db_xref="PID:e229494" /db_xref="PID:g1491710" /translation="MPSKKKKYNARFPPARIKKIMQTDEEIGKVAAAVPVIISRALEL FLESLLKKACQVTQSRNAKTMTTSHLKQCIELEQQFDFLKDLVASVPDMQGDGEDNHM DGDKGARRGRKPGSGGRKNGGMGTKSKDKKLSGTDSEQEDESEDTDTDGEEETSQPPP QASHPSAHFQSPPTPFLPFASTLPLPPAPPGPSAPDEEDEEDYDS" BASE COUNT 186 a 205 c 218 g 115 t ORIGIN 1 atgccgagca agaagaagaa gtacaacgcg cggttcccgc cggcgcggat caagaagatc 61 atgcagacgg acgaagagat tgggaaggtg gcggcggcgg tgcctgtcat catctcccgg 121 gcgctcgagc tcttcctaga gtcgctgttg aagaaggcct gccaggtgac ccagtcgcgg 181 aacgcgaaga ccatgaccac atcccacctg aagcagtgca tcgagctgga gcagcagttt 241 gacttcttga aggacctggt ggcatctgtt cccgacatgc agggggacgg ggaagacaac 301 cacatggatg gggacaaggg cgcccgcagg ggccggaagc caggcagcgg cggccggaag 361 aacggtggga tgggaacgaa aagcaaggac aagaagctgt ccgggacaga ctcggagcag 421 gaggatgaat ctgaggacac agatactgat ggggaagagg agacatcaca acccccaccc 481 caggccagcc acccctctgc ccactttcag agccccccga cacccttcct gcccttcgcc 541 tctactctgc ctttgccccc agcgcccccg ggcccctcag cacctgatga agaggacgaa 601 gaagattacg actcctagcg ccttctgccc cccagaccat agcccctttt agttggtttt 661 agttgctctg gggggaggag agaaggtaga gctgttctta aatttattaa aaaaaaaaaa 721 aaaa // LOCUS HSNCAME 2799 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for a nontransmembrane isoform of N-CAM from skeletal muscle. ACCESSION X16841 M36948 NID g35005 KEYWORDS developmental regulation; glycoprotein; neural cell adhesion molecule; phoshatidylinositol-linked glycoprotein; sialoglycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2799) AUTHORS Barton,C.H., Dickson,G., Gower,H.J., Rowett,L.H., Putt,W., Elsom,V., Moore,S.E., Goridis,C. and Walsh,F.S. TITLE Complete sequence and in vitro expression of a tissue-specific phosphatidylinositol-linked N-CAM isoform from skeletal muscle JOURNAL Development 104 (1), 165-173 (1988) MEDLINE 89305258 COMMENT Data kindly reviewed (15-FEB-1990) by Barton C.H. FEATURES Location/Qualifiers source 1..2799 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /clone_lib="lambda gt 11" /clone="CHB1" sig_peptide 146..202 /note="signal peptide (AA -19 to -1)" CDS 146..2431 /note="precursor protein (-19 to 742)" /codon_start=1 /db_xref="PID:g35006" /db_xref="SWISS-PROT:P13592" /translation="MLQTKDLIWTLFFLGTAVSLQVDIVPSQGEISVGESKFFLCQVA GDAKDKDISWFSPNGEKLTPNQQRISVVWNDDSSSTLTIYNANIDDAGIYKCVVTGED GSESEATVNVKIFQKLMFKNAPTPQEFREGEDAVIVCDVVSSLPPTIIWKHKGRDVIL KKDVRFIVLSNNYLQIRGIKKTDEGTYRCEGRILARGEINFKDIQVIVNVPPTIQARQ NIVNATANLGQSVTLVCDAEGFPEPTMSWTKDGEQIEQEEDDEKYIFSDDSSQLTIKK VDKNDEAEYICIAENKAGEQDATIHLKVFAKPKITYVENQTAMELEEQVTLTCEASGD PIPSITWRTSTRNISSEEKTLDGHMVVRSHARVSSLTLKSIQYTDAGEYICTASNTIG QDSQSMYLEVQYAPKLQGPVAVYTWEGNQVNITCEVFAYPSATISWFRDGQLLPSSNY SNIKIYNTPSASYLEVTPDSENDFGNYNCTAVNRIGQESLEFILVQADTPSSPSIDQV EPYSSTAQVQFDEPEATGGVPILKYKAEWRAVGEEVWHSKWYDAKEASMEGIVTIVGL KPETTYAVRLAALNGKGLGEISAASEFKTQPVHSPPPPASASSSTPVPLSPPDTTWPL PALATTEPAKGEPSAPKLEGQMGEDGNSIKVNLIKQDDGGSPIRHYLVRYRALSSEWK PEIRLPSGSDHVMLKSLDWNAEYEVYVVAENQQGKSKAAHFVFRTSAQPTAIPATLGG NSASYTFVSLLFSAVTLLLLC" mat_peptide 203..2428 /note="mature phosphatidylinositol linked N-CAM (AA 1-742)" BASE COUNT 744 a 752 c 698 g 605 t ORIGIN 1 gaattccttt ccaaaaataa tcatactcag cctggcaatt gtctgcccct aggtctgtcg 61 ctcagccgcc gtccacactc gctgcagggg gggggggcac agaatttacc gcggcaagaa 121 catccctccc agccagcaga ttacaatgct gcaaactaag gatctcatct ggactttgtt 181 tttcctggga actgcagttt ctctgcaggt ggatattgtt cccagccagg gggagatcag 241 cgttggagag tccaaattct tcttatgcca agtggcagga gatgccaaag ataaagacat 301 ctcctggttc tcccccaatg gagaaaagct caccccaaac cagcagcgga tctcagtggt 361 gtggaatgat gattcctcct ccaccctcac catctataac gccaacatcg acgacgccgg 421 catttacaag tgtgtggtta caggcgagga tggcagtgag tcagaggcca ccgtcaacgt 481 gaagatcttt cagaagctca tgttcaagaa tgcgccaacc ccacaggagt tccgggaggg 541 ggaagatgcc gtgattgtgt gtgatgtggt cagctccctc ccaccaacca tcatctggaa 601 acacaaaggc cgagatgtca tcctgaaaaa agatgtccga ttcatagtcc tgtccaacaa 661 ctacctgcag atccggggca tcaagaaaac agatgaaggc acttatcgct gtgagggcag 721 aatcctggca cggggggaga tcaacttcaa ggacattcag gtcattgtga atgtgccacc 781 taccatccag gccaggcaga atattgtgaa tgccaccgcc aacctcggcc agtccgtcac 841 cctggtgtgc gatgccgaag gcttcccaga gcccaccatg agctggacaa aggatgggga 901 acagatagag caagaggaag acgatgagaa gtacatcttc agcgacgata gttcccagct 961 gaccatcaaa aaggtggata agaacgacga ggctgagtac atctgcattg ctgagaacaa 1021 ggctggcgag caggatgcga ccatccacct caaagtcttt gcaaaaccca aaatcacata 1081 tgtagagaac cagactgcca tggaattaga ggagcaggtc actcttacct gtgaagcctc 1141 cggagacccc attccctcca tcacctggag gacttctacc cggaacatca gcagcgaaga 1201 aaagactctg gatgggcaca tggtggtgcg tagccatgcc cgtgtgtcgt cgctgaccct 1261 gaagagcatc cagtacactg atgccggaga gtacatctgc accgccagca acaccatcgg 1321 ccaggactcc cagtccatgt accttgaagt gcaatatgcc ccaaagctac agggccctgt 1381 ggctgtgtac acttgggagg ggaaccaggt gaacatcacc tgcgaggtat ttgcctatcc 1441 cagtgccacg atctcatggt ttcgggatgg ccagctgctg ccaagctcca attacagcaa 1501 tatcaagatc tacaacaccc cctctgccag ctatctggag gtgaccccag actctgagaa 1561 tgattttggg aactacaact gtactgcagt gaaccgcatt gggcaggagt ccttggaatt 1621 catccttgtt caagcagaca ccccctcttc accatccatc gaccaggtgg agccatactc 1681 cagcacagcc caggtgcagt ttgatgaacc agaggccaca ggtggggtgc ccatcctcaa 1741 atacaaagct gagtggagag cagttggtga agaagtatgg cattccaagt ggtatgatgc 1801 caaggaagcc agcatggagg gcatcgtcac catcgtgggc ctgaagcccg aaacaacgta 1861 cgccgtaagg ctggcggcgc tcaatggcaa agggctgggt gagatcagcg cggcctccga 1921 gttcaagacg cagccagtcc atagccctcc tccaccggca tctgctagct cgtctacccc 1981 tgttccattg tctccaccag atacaacttg gcctcttcct gcccttgcaa ccacagaacc 2041 agctaaaggg gaacccagtg cacctaagct cgaagggcag atgggagagg atggaaactc 2101 tattaaagtg aacctgatca agcaggatga cggcggctcc cccatcagac actatctggt 2161 caggtaccga gcgctctcct ccgagtggaa accagagatc aggctcccgt ctggcagtga 2221 ccacgtcatg ctgaagtccc tggactggaa tgctgagtat gaggtctacg tggtggctga 2281 gaaccagcaa ggaaaatcca aggcggctca ttttgtgttc aggacctcgg cccagcccac 2341 agccatccca gcaaccttgg gaggcaattc tgcatcctac acctttgtct cattgctttt 2401 ctctgcagtg actcttcttt tgctctgtta ggaacttgaa cacaaaaatt aaatttgctt 2461 aaaagcccag ttcctatgaa aaagatcagt gccccctttg gaagaacctg gcaggaccac 2521 catggccaca gctgctgagc aaccattctg tgtggaagag aaggttttgt gattggaaaa 2581 agctttacct ccagacatgt caccactcac agatactttt gtgccacttc ataaggagtt 2641 tgcccccttt ttaatggcag taaaaagaat ttgagagctc tttctttaaa tgctattttt 2701 aaaaaccatc atgctagatt tacagagaag tttctgcata tctgctactt gttgcatttt 2761 gggttcaaac ctaaatatga tgtagcagag gaagaattc // LOCUS HSNCHIM 2167 bp RNA PRI 08-MAR-1994 DEFINITION Human mRNA for n-chimaerin. ACCESSION X51408 NID g35012 KEYWORDS n-chimaerin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2167) AUTHORS Hall,C. TITLE Direct Submission JOURNAL Submitted (19-JAN-1990) Hall C., Institute of Neurology, Department of Neurochemistry, 1 Wakefield Street, London WC1 N1PJ, UK REFERENCE 2 (bases 1 to 2167) AUTHORS Hall,C., Monfries,C., Smith,P., Lim,H.H., Kozma,R., Ahmed,S., Vanniasingham,V., Leung,T. and Lim,L. TITLE Novel human brain cDNA encoding a 34,000 Mr protein n-chimaerin, related to both the regulatory domain of protein kinase C and BCR, the product of the breakpoint cluster region gene JOURNAL J. Mol. Biol. 211 (1), 11-16 (1990) MEDLINE 90133942 COMMENT Data kindly reviewed (09-APR-1990) by Hall C. FEATURES Location/Qualifiers source 1..2167 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /clone_lib="lambda gt10" /clone="H631.2" CDS 553..1452 /note="n-chimaerin (AA 1-299)" /codon_start=1 /db_xref="PID:g35013" /db_xref="SWISS-PROT:P15882" /translation="MKLGSPKSSVTIWQPLKLFAYSQLTSLVRRATLKENEQIPKYEK IHNFKVHTFRGPHWCEYCANFMWGLIAQGVKCADCGLNVHKQCSKMVPNDCKPDLKHV KKVYSCDLTTLVKAHTTKRPMVVDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFD RDGEKADISVNMYEDINIITGALKLYFRDLPIPLITYDAYPKFIESAKIMDPDEQLET LHEALKLLPPAHCETLRYLMAHLKRVTLHEKENLMNAENLGIVFGPTLMRSPELDAMA ALNDIRYQRLVVELLIKNEDILF" repeat_region 1456..1464 /note="direct repeat 1" misc_feature 1466..1474 /note="inverted DE1 albumin gene 5'-regulatory element" misc_feature 1495..1503 /note="inverted DE1 albumin gene 5'-regulatory element" repeat_region 1510..1518 /note="direct repeat 1" misc_feature 2036..2041 /note="pot. polyadenylation signal" misc_feature 2055..2060 /note="pot. polyadenylation signal" misc_feature 2147..2152 /note="pot. polyadenylation signal" polyA_site 2167 /note="polyadenylation site" BASE COUNT 666 a 425 c 446 g 630 t ORIGIN 1 acaaggaaag aaaacctata gtggtctatg tctgtcgatg atatctattc agctaacaca 61 tgagcattct gccaggcagc acagaaccct aacctacaga gagctgcaga gaaacaccac 121 ggagaggtgg aggaggagga tgaagcactt cttaaacaga ggtcgactaa ccagcaaatt 181 cttctttctt cctttttttt tcttaaaggg atctatattc tagcttctaa aaacttgagt 241 ctgaacagaa ataaaaagaa agagtcgatg ctaacataca aaacactcag catctctctt 301 tatcattttt taaaaggcat gcaattttga caaatgatac atttctaaaa gctttcttct 361 ctattcaaga tattaatgtc cattctgaat gaaagatgcc tacatactgc ctgcaatcag 421 tttctagcaa cagacaccct atgaagcccc tagcagggaa ggtgggggga agggagggaa 481 actaataggg ctgcagttca caaatcaaaa caagagggcc gtcagcaaga tttattgata 541 gcagccttgg gaatgaaact gggttctcca aagtcgtctg tgacaatctg gcaacctctg 601 aaactctttg cttattcgca gttgacatca cttgttagaa gagcaactct gaaagaaaac 661 gagcaaattc caaaatatga aaagattcac aatttcaagg tgcatacatt cagagggcca 721 cactggtgtg aatactgtgc caactttatg tggggtctca ttgctcaggg agtgaaatgt 781 gcagattgtg gtttgaatgt tcataagcag tgttccaaga tggtcccaaa tgactgtaag 841 ccagacttga agcatgtcaa aaaggtgtac agctgtgacc ttacgacgct cgtgaaagca 901 cataccacta agcggccaat ggtggtagac atgtgcatca gggagattga gtctagaggt 961 cttaattctg aaggactata ccgagtatca ggatttagtg acctaattga agatgtcaag 1021 atggctttcg acagagatgg tgagaaggca gatatttctg tgaacatgta tgaagatatc 1081 aacattatca ctggtgcact taaactgtac ttcagggatt tgccaattcc actcattaca 1141 tatgatgcct accctaagtt tatagaatct gccaaaatta tggatccgga tgagcaattg 1201 gaaacccttc atgaagcact gaaactactg ccacctgctc actgcgaaac cctccggtac 1261 ctcatggcac atctaaagag agtgaccctc cacgaaaagg agaatcttat gaatgcagag 1321 aaccttggaa tcgtctttgg acccaccctt atgagatctc cagaactaga cgccatggct 1381 gcattgaatg atatacggta tcagagactg gtggtggagc tgcttatcaa aaacgaagac 1441 attttatttt aaatttttaa tttgagggga aaagaaatgt tttacagatg aaggaatgtt 1501 ttatagtaat ttaatttgct cctgtagctg cattatttct tgattagagg tttgggcata 1561 taaccagatt aaagtgaagg aactttctgt tgtttttgta gcaccgctca gctgtcttgt 1621 aaaacagtga acacacgctt tctggttcta gtaatcctgg gtgtttatca cgttcagaga 1681 aactcaagct attgcatgat tagcccccta tctggcaagg aaaccccata cagaagaaac 1741 aacaaacctg cgcctgcacc gcctctgcgt cctgggtagt ctgtgcttgt aatccagcat 1801 gtttcacaga gtaagcctgt tgtgactttg cttttggggt ctatgtcatt ggtttctgat 1861 gcttgtacaa acacgcacac acaaatggat aaaacagcac ctctggctgt tacattacca 1921 taaaccatat cacatgccta cattttacaa atgatttctg gtttctctta gttcttctct 1981 aacatagtac tttctttcca gcaaaagcaa aatgtgtttt cagatttgtt actttaataa 2041 aggttatcca taccaataaa aagtgtacaa cacagcattt tctgttaaat tattattggt 2101 tttcagttgt aatttggtat tttttctggc atgcgtttat taatttatta aattggcttt 2161 tagaaat // LOCUS HSNCK 1414 bp RNA PRI 12-SEP-1993 DEFINITION Human melanoma mRNA for nck protein, showing homology to src. ACCESSION X17576 NID g35014 KEYWORDS src oncogene; src-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1414) AUTHORS Johnson,J.P. TITLE Direct Submission JOURNAL Submitted (21-NOV-1989) Johnson J.P., Institute for Immunology, University of Muenich, Goethestr 31, 8000 Muenchen 2, F R G REFERENCE 2 (bases 1 to 1414) AUTHORS Lehmann,J.M., Riethmuller,G. and Johnson,J.P. TITLE Nck, a melanoma cDNA encoding a cytoplasmic protein consisting of the src homology units SH2 and SH3 JOURNAL Nucleic Acids Res. 18 (4), 1048 (1990) MEDLINE 90192089 COMMENT Data kindly reviewed (23-APR-1990) by Johnson J.P. FEATURES Location/Qualifiers source 1..1414 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 102..1235 /note="nck protein (AA 1-377)" /codon_start=1 /db_xref="PID:g35015" /db_xref="SWISS-PROT:P16333" /translation="MAEEVVVVAKFDYVAQQEQELDIKKNERLWLLDDSKSWWRVRNS MNKTGFVPSNYVERKNSARKASIVKNLKDTLGIGKVKRKPSVPDSASPADDSFVDPGE RLYDLNMPAYVKFNYMAEREDELSLIKGTKVIVMEKCSDGWWRGSYNGQVGWFPSNYV TEEGDSPLGDHVGSLSEKLAAVVNNLNTGQVLHVVQALYPFSSSNDEELNFEKGDVMD VIEKPENDPEWWKCRKINGMVGLVPKNYVTVMQNNPLTSGLEPSPPQCDYIRPSLTGK FAGNPWYYGKVTRHQAEMALNERGHEGDFLIRDSESSPNDFSVSLKAQGKNKHFKVQL KETVYCIGQRKFSTMEELVEHYKKAPIFTSEQGEKLYLVKHLS" BASE COUNT 446 a 248 c 342 g 378 t ORIGIN 1 ggaattccgg aattccgcgg agcaggcctc gtgccgttac ggccatcacg gccgccgcag 61 tggcgtcctg gagccctcct cagtgctgaa gctgctgaaa gatggcagaa gaagtggtgg 121 tagtagccaa atttgattat gtggcccaac aagaacaaga gttggacatc aagaagaatg 181 agagattatg gcttctggat gattctaagt cctggtggcg agttcgaaat tccatgaata 241 aaacaggttt tgtgccttct aactatgtgg aaaggaaaaa cagtgctcgg aaagcatcta 301 ttgtgaaaaa cctaaaggat accttaggca ttggaaaagt gaaaagaaaa cctagtgtgc 361 cagattctgc atctcctgct gatgatagtt ttgttgaccc aggggaacgt ctctatgacc 421 tcaacatgcc cgcttatgtg aaatttaact acatggctga gagagaggat gaattatcat 481 tgataaaggg gacaaaggtg atcgtcatgg agaaatgcag tgatgggtgg tggcgtggta 541 gctacaatgg acaagttgga tggttccctt caaactatgt aactgaagaa ggtgacagtc 601 ctttgggtga ccatgtgggt tctctgtcag agaaattagc agcagtcgtc aataacctaa 661 atactgggca agtgttgcat gtggtacagg ctctttaccc attcagctca tctaatgatg 721 aagaacttaa tttcgagaaa ggagatgtaa tggatgttat tgaaaaacct gaaaatgacc 781 cagagtggtg gaaatgcagg aagatcaatg gtatggttgg tctagtacca aaaaactatg 841 ttaccgttat gcagaataat ccattaactt caggtttgga accatcacct ccacagtgtg 901 attacattag gccttcactc actggaaagt ttgctggcaa tccttggtat tatggcaaag 961 tcaccaggca tcaagcagaa atggcattaa atgaaagagg acatgaaggg gatttcctca 1021 ttcgtgatag tgaatcttcg ccaaatgatt tctcagtatc actaaaagca caagggaaaa 1081 acaagcattt taaagtccaa ctaaaagaga ctgtctactg cattgggcag cgtaaattca 1141 gcaccatgga agaacttgta gaacattaca aaaaggcacc aatttttaca agtgaacaag 1201 gagaaaaatt atatcttgtc aagcatttat catgatactg ctgaccagaa gtgactgctg 1261 tgtagctgta atttgtcatg taattgaatt gaagactgag aaaatgttgg gtccagtcgt 1321 gcttgattgg aaattgttgt ttctaaatct atatgagatt tgacatagta ttttattata 1381 ctcagccata catatatact atgtatgagc agtg // LOCUS HSNCL1 2466 bp RNA PRI 27-APR-1995 DEFINITION H.sapiens mRNA for skeletal muscle-specific calpain. ACCESSION X85030 NID g791039 KEYWORDS calcium-dependent protease; calpain; nCL1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2466) AUTHORS Richard,I. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) I. Richard, Genethon B.P. 60, F- 91002 Evry Cedex, FRANCE REFERENCE 2 (bases 1 to 2466) AUTHORS Richard,I., Broux,O., Allamand,V., Fougerousse,F., Chiannilkulchai,N., Bourg,N., Brenguier,L., Devaud,C., Pasturaud,P., Roudaut,C., Hillaire,D., Passos-Bueno,M.R., Zatz,M., Tishfield,J., Fardeau,M., Jackson,C.E., Cohen,D. and Beckmann,J.S. TITLE Mutations in the proteolytic enzyme calpain 3 cause limb-girdle muscular dystrophy type 2A JOURNAL Cell 81 (1), 27-40 (1995) MEDLINE 95236448 FEATURES Location/Qualifiers source 1..2466 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q15.1-q15.2" /chromosome="15" gene 1..2466 /gene="nCL1" exon 1..309 /gene="nCL1" /number=1 CDS 1..2466 /gene="nCL1" /standard_name="calcium activated neutral protease" /EC_number="3.4.22.17" /codon_start=1 /product="calpain" /db_xref="PID:g791040" /translation="MPTVISASVAPRTAAEPRSPGPVPHPAQSKATEAGGGNPSGIYS AIISRNFPIIGVKEKTFEQLHKKCLEKKVLYVDPEFPPDETSLFYSQKFPIQFVWKRP PEICENPRFIIDGANRTDICQGELGDCWFLAAIACLTLNQHLLFRVIPHDQSFIENYA GIFHFQFWRYGEWVDVVIDDCLPTYNNQLVFTKSNHRNEFWSALLEKAYAKLHGSYEA LKGGNTTEAMEDFTGGVAEFFEIRDAPSDMYKIMKKAIERGSLMGCSIDDGTNMTYGT SPSGLNMGELIARMVRNMDNSLLQDSDLDPRGSDERPTRTIIPVQYETRMACGLVRGH AYSVTGLDEVPFKGEKVKLVRLRNPWGQVEWNGSWSDRWKDWSFVDKDEKARLQHQVT EDGEFWMSYEDFIYHFTKLEICNLTADALQSDKLQTWTVSVNEGRWVRGCSAGGCRNF PDTFWTNPQYRLKLLEEDDDPDDSEVICSFLVALMQKNRRKDRKLGASLFTIGFAIYE VPKEMHGNKQHLQKDFFLYNASKARSKTYINMREVSQRFRLPPSEYVIVPSTYEPHQE GEFILRVFSEKRNLSEEVENTISVDRPVKKKKTKPIIFVSDRANSNKELGVDQESEEG KGKTSPDKQKQSPQPQPGSSDQESEEQQQFRNIFKQIAGDDMEICADELKKVLNTVVN KHKDLKTHGFTLESCRSMIALMDTDGSGKLNLQEFHHLWNKIKAWQKIFKHYDTDQSG TINSYEMRNAVNDAGFHLNNQLYDIITMRYADKHMNIDFDSFICCFVRLEGMFRAFHA FDKDGDGIIKLNVLEWLQLTMYA" exon 310..379 /gene="nCL1" /number=2 exon 380..498 /gene="nCL1" /number=3 exon 499..632 /gene="nCL1" /number=4 exon 633..801 /gene="nCL1" /number=5 exon 802..945 /gene="nCL1" /number=6 exon 946..1029 /gene="nCL1" /number=7 exon 1030..1115 /gene="nCL1" /number=8 exon 1116..1193 /gene="nCL1" /number=9 exon 1194..1354 /gene="nCL1" /number=10 exon 1355..1524 /gene="nCL1" /number=11 exon 1525..1536 /gene="nCL1" /number=12 exon 1537..1745 /gene="nCL1" /number=13 exon 1746..1782 /gene="nCL1" /number=14 exon 1783..1800 /gene="nCL1" /number=15 exon 1801..1914 /gene="nCL1" /number=16 exon 1915..1992 /gene="nCL1" /number=17 exon 1993..2050 /gene="nCL1" /number=18 exon 2051..2115 /gene="nCL1" /number=19 exon 2116..2184 /gene="nCL1" /number=20 exon 2185..2263 /gene="nCL1" /number=21 exon 2264..2380 /gene="nCL1" /number=22 exon 2381..2439 /gene="nCL1" /number=23 exon 2440..2466 /gene="nCL1" /number=24 BASE COUNT 657 a 636 c 662 g 511 t ORIGIN 1 atgccgaccg tcattagcgc atctgtggct ccaaggacag cggctgagcc ccggtcccca 61 gggccagttc ctcacccggc ccagagcaag gccactgagg ctgggggtgg aaacccaagt 121 ggcatctatt cagccatcat cagccgcaat tttcctatta tcggagtgaa agagaagaca 181 ttcgagcaac ttcacaagaa atgtctagaa aagaaagttc tttatgtgga ccctgagttc 241 ccaccggatg agacctctct cttttatagc cagaagttcc ccatccagtt cgtctggaag 301 agacctccgg aaatttgcga gaatccccga tttatcattg atggagccaa cagaactgac 361 atctgtcaag gagagctagg ggactgctgg tttctcgcag ccattgcctg cctgaccctg 421 aaccagcacc ttcttttccg agtcataccc catgatcaaa gtttcatcga aaactacgca 481 gggatcttcc acttccagtt ctggcgctat ggagagtggg tggacgtggt tatagatgac 541 tgcctgccaa cgtacaacaa tcaactggtt ttcaccaagt ccaaccaccg caatgagttc 601 tggagtgctc tgctggagaa ggcttatgct aagctccatg gttcctacga agctctgaaa 661 ggtgggaaca ccacagaggc catggaggac ttcacaggag gggtggcaga gttttttgag 721 atcagggatg ctcctagtga catgtacaag atcatgaaga aagccatcga gagaggctcc 781 ctcatgggct gctccattga tgatggcacg aacatgacct atggaacctc tccttctggt 841 ctgaacatgg gggagttgat tgcacggatg gtaaggaata tggataactc actgctccag 901 gactcagacc tcgaccccag aggctcagat gaaagaccga cccggacaat cattccggtt 961 cagtatgaga caagaatggc ctgcgggctg gtcagaggtc acgcctactc tgtcacgggg 1021 ctggatgagg tcccgttcaa aggtgagaaa gtgaagctgg tgcggctgcg gaatccgtgg 1081 ggccaggtgg agtggaacgg ttcttggagt gatagatgga aggactggag ctttgtggac 1141 aaagatgaga aggcccgtct gcagcaccag gtcactgagg atggagagtt ctggatgtcc 1201 tatgaggatt tcatctacca tttcacaaag ttggagatct gcaacctcac ggccgatgct 1261 ctgcagtctg acaagcttca gacctggaca gtgtctgtga acgagggccg ctgggtacgg 1321 ggttgctctg ccggaggctg ccgcaacttc ccagatactt tctggaccaa ccctcagtac 1381 cgtctgaagc tcctggagga ggacgatgac cctgatgact cggaggtgat ttgcagcttc 1441 ctggtggccc tgatgcagaa gaaccggcgg aaggaccgga agctaggggc cagtctcttc 1501 accattggct tcgccatcta cgaggttccc aaagagatgc acgggaacaa gcagcacctg 1561 cagaaggact tcttcctgta caacgcctcc aaggccagga gcaaaaccta catcaacatg 1621 cgggaggtgt cccagcgctt ccgcctgcct cccagcgagt acgtcatcgt gccctccacc 1681 tacgagcccc accaggaggg ggaattcatc ctccgggtct tctctgaaaa gaggaacctc 1741 tctgaggaag ttgaaaatac catctccgtg gatcggccag tgaaaaagaa aaaaaccaag 1801 cccatcatct tcgtttcgga cagagcaaac agcaacaagg agctgggtgt ggaccaggag 1861 tcagaggagg gcaaaggcaa aacaagccct gataagcaaa agcagtcccc acagccacag 1921 cctggcagct ctgatcagga aagtgaggaa cagcaacaat tccggaacat tttcaagcag 1981 atagcaggag atgacatgga gatctgtgca gatgagctca agaaggtcct taacacagtc 2041 gtgaacaaac acaaggacct gaagacacac gggttcacac tggagtcctg ccgtagcatg 2101 attgcgctca tggatacaga tggctctgga aagctcaacc tgcaggagtt ccaccacctc 2161 tggaacaaga ttaaggcctg gcagaaaatt ttcaaacact atgacacaga ccagtccggc 2221 accatcaaca gctacgagat gcgaaatgca gtcaacgacg caggattcca cctcaacaac 2281 cagctctatg acatcattac catgcggtac gcagacaaac acatgaacat cgactttgac 2341 agtttcatct gctgcttcgt taggctggag ggcatgttca gagcttttca tgcatttgac 2401 aaggatggag atggtatcat caagctcaac gttctggagt ggctgcagct caccatgtat 2461 gcctga // LOCUS HSNDMENZ 1930 bp RNA PRI 23-FEB-1995 DEFINITION H.sapiens mRNA for NADP+-dependent malic enzyme. ACCESSION X79440 NID g535011 KEYWORDS NADP-dependent malic enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1930) AUTHORS Loeber,G., Maurer-Fogy,I. and Schwendenwein,R. TITLE Purification, cDNA cloning and heterologous expression of the human mitochondrial NADP(+)-dependent malic enzyme JOURNAL Biochem. J. 304 (Pt 3), 687-692 (1994) MEDLINE 95118281 REFERENCE 2 (bases 1 to 1930) AUTHORS Loeber,G. TITLE Direct Submission JOURNAL Submitted (25-MAY-1994) G. Loeber, Dept of Cell Biology, Bender & Co., Dr. Boehringergasse 5-11, 1121 Vienna, AUSTRIA FEATURES Location/Qualifiers source 1..1930 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" /clone_lib="Stratagene" CDS 33..1847 /EC_number="1.1.1.40" /note="NADP+-dependent malic enzyme" /codon_start=1 /product="malate dehydrogenase (oxaloacetate decarboxylating) (NADP+)" /db_xref="PID:g535012" /translation="MGAALGTGTRLAPWPGRACGALPRWTPTAPAQGCHSKPGPARPV PLKKRGYDVTRNPHLNKGMAFTLEERLQLGIHGLIPPCFLSQDVQLLRIMRYYERQQS DLDKYIILMTLQDRNEKLFYRVLTSDVEKFMPIVYTPTVGLACQHYGLTFRRPRGLFI TIHDKGHLATMLNSWPEDNIKAVVVTDGERILGLGDLGCYGMGIPVGKLALYTACGGV NPQQCLPVLLDVGTNNEELLRDPLYIGLKHQRVHGKAYDDLLDEFMQAVTDKFGINCL IQFEDFANANAFRLLNKYRNKYCMFNDDIQGTASVAVAGILAALRITNNKLSNHVFVF QGAGEAAMGIAHLLVMALEKEGVPKAEATRKIWMVDSKGLIVKGRSHLNHEKEMFAQD HPEVNSLEEVVRLVKPTAIIGVAAIAGAFTEQILRDMASFHERPIIFALSNPTSKAEC TAEKCYRVTEGRGIFASGSPFKSVTLEDGKTFIPGQGNNAYVFPGVALGVIAGGIRHI PDEIFLLTAEQIAQEVSEQHLSQGRLYPPLSTIRDVSLRIAIKVLDYAYKHNLASYYP EPKDKEAFVRSLVYTPDYDSFTLDSYTWPKEAMNVQTV" BASE COUNT 459 a 549 c 539 g 383 t ORIGIN 1 cggaaggaga ggaccgaggt ctgccaagga ccatgggtgc cgcgctgggg acaggcacgc 61 ggctggctcc ctggccgggc cgggcctgcg gcgccctccc gcgctggaca cccaccgcgc 121 ccgcccaagg ctgccactcc aagcctggcc cggcgcgccc tgtgcccctg aagaagcgcg 181 gatacgatgt caccaggaac cctcatctca acaaggggat ggcctttacc cttgaagaaa 241 ggctgcagct tggaatccac ggcctaatcc cgccctgctt tctgagccag gacgtccagc 301 tcctccgaat catgagatat tacgagcggc agcagagtga cctggacaag tacatcattc 361 tcatgacact ccaagaccgg aacgagaagc tcttctaccg agtgctgact tcggatgtgg 421 agaagttcat gccaatcgtg tacacgccta ccgtggggct ggcctgtcag cactatggcc 481 tgactttccg caggccccgt ggactgttca tcaccattca tgacaaaggt catcttgcaa 541 caatgctgaa ttcttggcca gaagacaata ttaaggccgt ggtggtgact gatggggagc 601 gcatcctggg cctgggagac ctgggctgct acggcatggg catccctgtg ggcaagctgg 661 ccctgtacac ggcatgcgga ggggtgaacc cgcagcagtg cctccctgtg ctgctggacg 721 tcggcaccaa caatgaggag ctgctcagag accctctgta catcggcctg aaacaccagc 781 gcgtgcacgg gaaggcatac gatgacttgc tggatgagtt catgcaggct gtgacagaca 841 agtttggaat aaattgcctc atccaatttg aagacttcgc caatgccaat gccttccgcc 901 tgctcaacaa ataccgtaac aagtactgca tgttcaatga tgacatccaa ggcacagcct 961 ccgttgctgt ggcagggatc ttggctgctc tgcgaatcac caacaacaag ctttccaatc 1021 acgtgtttgt tttccaaggt gcaggcgagg cagctatggg cattgcccac ctccttgtca 1081 tggccctaga gaaagaaggt gtaccgaagg cagaggccac aagaaagatc tggatggtgg 1141 actctaaagg gctcattgtc aaggggagga gccacctgaa ccatgaaaag gagatgtttg 1201 cccaagacca tcctgaagtc aactccctgg aggaggtggt gaggctggtg aagcccacag 1261 ccatcatagg tgttgctgcc atcgcaggag ccttcacgga gcagattctg agggacatgg 1321 cctccttcca cgagcgccct atcatctttg ccctgagcaa ccccaccagc aaggccgagt 1381 gcacggctga gaagtgctac cgggtcaccg agggccgagg gatttttgcc agtggaagtc 1441 cttttaagag tgtgactctg gaagatggca agaccttcat tcctgggcag ggaaacaatg 1501 cttacgtgtt ccccggggtg gcactgggag tcatcgccgg cgggatccgg cacatcccag 1561 atgagatctt cctcctgaca gcagagcaaa ttgcccagga agtctctgag cagcatctgt 1621 cccaggggag actctatcca ccactcagca ccatccgaga cgtgtctttg agaattgcca 1681 tcaaagttct cgactacgcg tacaaacaca acctggcttc ctactaccca gagcctaagg 1741 acaaggaggc ttttgtaaga tccctggtct acactccaga ctatgactcc tttacactgg 1801 acagctacac ttggcccaag gaagccatga atgttcagac ggtctgaggc agtctcagag 1861 gctagtatgg ggctagatga agcccagagt aacacccaca atataaatgg gttccaaaat 1921 ggcccaagtg // LOCUS HSNEFAP 1586 bp RNA PRI 19-DEC-1997 DEFINITION H.sapiens mRNA for NEFA protein. ACCESSION X76732 NID g2706486 KEYWORDS acute lymphoblastic leukemia; calcium-binding protein; DNA-binding protein:nuclear localization signal; EF-hand; leucine zipper; NEFA protein; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1586) AUTHORS Barnikol-Watanabe,S., Gross,N.A., Gotz,H., Henkel,T., Karabinos,A., Kratzin,H., Barnikol,H.U. and Hilschmann,N. TITLE Human protein NEFA, a novel DNA binding/EF-hand/leucine zipper protein. Molecular cloning and sequence analysis of the cDNA, isolation and characterization of the protein JOURNAL Biol. Chem. Hoppe-Seyler 375 (8), 497-512 (1994) MEDLINE 95110446 REFERENCE 2 (bases 1 to 1586) AUTHORS Barnikol,H.U. TITLE Direct Submission JOURNAL Submitted (15-DEC-1993) H.U. Barnikol, Max-Planck-Inst. fuer Experimentelle Medizin, Hermann-Rein-Str. 3, 37075 Goettingen, FRG REMARK revised by [3] REFERENCE 3 (bases 1 to 1586) AUTHORS Barnikol,H.U. TITLE Direct Submission JOURNAL Submitted (18-DEC-1997) H.U. Barnikol, Max-Planck-Inst. fuer Experimentelle Medizin, Hermann-Rein-Str. 3, 37075 Goettingen, FRG COMMENT Name: NEFA (N=DNA-binding, EF=EF-hand, A=acidic region) Function: DNA-binding protein. The calcium-binding EF-hand structures together with the acidic region may be involved in regulation of the DNA-binding function of protein NEFA Disease: CALLA (CD10) positive human acute lymphoblastic leukemia (ALL) Similarity: human nucleobindin EMBL: M96824; HSNUCLEOB GB:M96824; HUMNUCLEOB PATCHX:M96824 SWISSPROT:Q02818; NUBN_HUMAN murine nucleobindin EMBL:M96823; MMNUCLEOB GB:M96823; MUSNUCLEOB PIR:JC1224; PC1116 SWISSPROT:Q02819; NUBN_MOUSE. FEATURES Location/Qualifiers source 1..1586 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="Male" /dev_stage="12 year old" /tissue_type="blood" /cell_type="pre-B lymphoblast" /cell_line=" KM-3" /clone_lib="lambda gt11" /clone="pBluescript (Stratagene)" gene 220..1482 /gene="NEFA" CDS 220..1482 /gene="NEFA" /function="DNA-binding protein" /note="NEFA (N=DNA-binding, EF=EF-hand, A=acidic region)" /codon_start=1 /evidence=experimental /product="NEFA protein" /db_xref="PID:g436418" /db_xref="SWISS-PROT:P80303" /translation="MRWRTILLQYCFLLITCLLTALEAVPIDIDKTKVQNIHPVESAK IEPPDTGLYYDEYLKQVIDVLETDKHFREKLQKADIEEIKSGRLSKELDLVSHHVRTK LDELKRQEVGRLRMLIKAKLDSLQDIGMDHQALLKQFDHLNHLNPDKFESTDLDMLIK AATSDLEHYDKTRHEEFKKYEMMKEHERREYLKTLNEEKRKEEESKFEEMKKKHENHP KVNHPGSKDQLKEVWEETDGLDPNDFDPKTFFKLHDVNSDGFLDEQELEALFTKELEK VYDPKNEEDDMVEMEEERLRMREHVMNEVDTNKDRLVTLEEFLKATEKKEFLEPDSWE TLDQQQFFTEEELKEYENIIALQENELKKKADELQKQKEELQRQHDQLEAQKLEYHQV IQQMEQKKLQQGIPPSGPAGELKFEPHI" sig_peptide 220..294 /gene="NEFA" /evidence=not_experimental misc_feature 232..300 /gene="NEFA" /note="hydrophobic region, HY1, AA 5-27" mat_peptide 295..1479 /gene="NEFA" /evidence=experimental /product="NEFA protein" misc_feature 703..888 /gene="NEFA" /note="DNA-binding domain, DNB, AA 162-223" /evidence=not_experimental repeat_unit 810..816 /gene="NEFA" /note="repeat 1.1" /rpt_type=DIRECT misc_feature 814..822 /gene="NEFA" /note="nuclear localization signal, NLS1, AA 199-201" /evidence=not_experimental repeat_unit 817..823 /gene="NEFA" /note="repeat 2.1" /rpt_type=DIRECT repeat_unit 840..847 /gene="NEFA" /note="repeat 2.1" /rpt_type=DIRECT repeat_unit 848..855 /gene="NEFA" /note="repeat 2.2" /rpt_type=DIRECT misc_feature 850..861 /gene="NEFA" /note="nuclear localization signal, NLS2, AA 211-214" /evidence=not_experimental misc_feature 952..1038 /gene="NEFA" /note="calcium-binding EF-hand, EF1, AA 245-273" /evidence=not_experimental misc_feature 1063..1098 /gene="NEFA" /note="acidic region, AR, AA 282-293" misc_feature 1108..1194 /gene="NEFA" /note="calcium-binding EF-hand, EF2, AA 297-325" /evidence=not_experimental misc_feature 1258..1374 /gene="NEFA" /note="leucine zipper, LEU, AA 347-382" /evidence=not_experimental misc_feature 1420..1455 /gene="NEFA" /note="hydrophobioc region, HY2, AA 401-412" polyA_signal 1568..1573 BASE COUNT 601 a 269 c 340 g 376 t ORIGIN 1 gaggacaggt ttgtgcgctg gacgcaagca ccaggcgcag cctcgctcgc cgacacccgg 61 ccagaacgtg ttacgagtca gtttttagtg aaaaaacatt gagctaggag ccaagaccca 121 tctcttcact attttggtat tgtgcaagtc atcttacctc tctggatctc agttgtctca 181 tctgtaaaaa ggagataaaa attatttacc tgcctgaaca tgaggtggag gaccatcctg 241 ctacagtatt gctttctctt gattacatgt ttacttactg ctcttgaagc tgtgcctatt 301 gacatagaca agacaaaagt acaaaatatt caccctgtgg aaagtgcgaa gatagaacca 361 ccagatactg gactttatta tgatgaatat ctcaagcaag tgattgatgt gctggaaaca 421 gataaacact tcagagaaaa gctccagaaa gcagacatag aggaaataaa gagtgggagg 481 ctaagcaaag aactggattt agtaagtcac catgtgagga caaaacttga tgaactgaaa 541 aggcaagaag taggaaggtt aagaatgtta attaaagcta agttggattc ccttcaagat 601 ataggcatgg accaccaagc tcttctaaaa caatttgatc acctaaacca cctgaatcct 661 gacaagtttg aatccacaga tttagatatg ctaatcaaag cggcaacaag tgatctggaa 721 cactatgaca agactcgtca tgaagaattt aaaaaatatg aaatgatgaa ggaacatgaa 781 aggagagaat atttaaaaac attgaatgaa gaaaagagaa aagaagaaga gtctaaattt 841 gaagaaatga agaaaaagca tgaaaatcac cctaaagtta atcacccagg aagcaaagat 901 caactaaaag aggtatggga agagactgat ggattggatc ctaatgactt tgaccccaag 961 acatttttca aattacatga tgtcaatagt gatggattcc tggatgaaca agaattagaa 1021 gccctattta ctaaagagtt ggagaaagta tatgacccta aaaatgaaga ggatgatatg 1081 gtagaaatgg aagaagaaag gcttagaatg agggaacatg taatgaatga ggttgatact 1141 aacaaagaca gattggtgac tctggaggag tttttgaaag ccacagaaaa aaaagaattc 1201 ttggagccag atagctggga gacattagat cagcaacagt tcttcacaga ggaagaacta 1261 aaagaatatg aaaatattat tgctttacaa gaaaatgaac ttaagaagaa ggcagatgag 1321 cttcagaaac aaaaagaaga gctacaacgt cagcatgatc aactggaggc tcagaagctg 1381 gaatatcatc aggtcataca gcagatggaa caaaaaaaat tacaacaagg aattcctcca 1441 tcagggccag ctggagaatt gaagtttgag ccacacattt aaagtctgaa gtccaccaga 1501 acttggaaga aagctgttaa ctcaacatct atttcatctt tttagctccc ttcccttttc 1561 tctgctcaat aaatatttta aaagca // LOCUS HSNEK2R 2051 bp RNA PRI 20-FEB-1995 DEFINITION H.sapiens nek2 mRNA for protein kinase. ACCESSION Z29066 NID g479170 KEYWORDS nek2 gene; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2051) AUTHORS Schultz,S.J., Fry,A.M., Suetterlin,C., Ried,T. and Nigg,E.A. TITLE Molecular characterization of Nek2 and Nek3, two novel human protein kinases related to the cell cycle regulator NIMA of Aspergillus nidulans: Nek2 is maximally expressed at the onset of mitosis JOURNAL Unpublished REFERENCE 2 (bases 329 to 691) AUTHORS Schultz,S.J. and Nigg,E.A. TITLE Identification of 21 novel human protein kinases, including 3 members of a family related to the cell cycle regulator nimA of Aspergillus nidulans JOURNAL Cell Growth Differ. 4 (10), 821-830 (1993) MEDLINE 94100173 REFERENCE 3 (bases 1 to 2051) AUTHORS Schultz,S.J. TITLE Direct Submission JOURNAL Submitted (13-DEC-1993) Schultz S. J., University of Washington, Microbiology, SEATTLE, Washington, USA, 98195 REFERENCE 4 (bases 1 to 2051) AUTHORS Schultz,S.J., Fry,A.M., Sutterlin,C., Ried,T. and Nigg,E.A. TITLE Cell cycle-dependent expression of Nek2, a novel human protein kinase related to the NIMA mitotic regulator of Aspergillus nidulans JOURNAL Cell Growth Differ. 5 (6), 625-635 (1994) MEDLINE 94368699 FEATURES Location/Qualifiers source 1..2051 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" gene 83..1420 /gene="nek2" CDS 83..1420 /gene="nek2" /standard_name="NIMA-related protein kinase 1" /note="Partial PCR fragment of nek2 encoding nt 329 - 691 previously identified as HsPK 21 (Schultz and Nigg, Cell Growth & Differentiation vol 4, 821-830, October 1993); accession number Z25425" /citation=[1] /citation=[2] /codon_start=1 /product="protein kinase" /db_xref="PID:g479171" /translation="MPSRAEDYEVLYTIGTGSYGRCQKIRRKSDGKILVWKELDYGSM TEAEKQMLVSEVNLLRELKHPNIVRYYDRIIDRTNTTLYIVMEYCEGGDLASVITKGT KERQYLDEEFVLRVMTQLTLALKECHRRSDGGHTVLHRDLKPANVFLDGKQNVKLGDF GLARILNHDTSFAKTFVGTPYYMSPEQMNRMSYNEKSDIWSLGCLLYELCALMPPFTA FSQKELAGKIREGKFRRIPYRYSDELNEIITRMLNLKDYHRPSVEEILENPLIADLVA DEQRRNLERRGRQLGEPEKSQDSSPVLSELKLKEIQLQERERALKAREERLEQKEQEL CVRERLAEDKLARAENLLKNYSLLKERKFLSLASNPELLNLPSSVIKKKVHFSGESKE NIMRSENSESQLTSKSKCKDLKKRLHAAQLRAQALSDIEKNYQLKSRQILGMR" BASE COUNT 633 a 371 c 473 g 574 t ORIGIN 1 gggcggggtt cctggtccct ggagctccgc acttggcggc gcaacctgcg tgaggcagcg 61 cgactctggc gactggccgg ccatgccttc ccgggctgag gactatgaag tgttgtacac 121 cattggcaca ggctcctacg gccgctgcca gaagatccgg aggaagagtg atggcaagat 181 attagtttgg aaagaacttg actatggctc catgacagaa gctgagaaac agatgcttgt 241 ttctgaagtg aatttgcttc gtgaactgaa acatccaaac atcgttcgtt actatgatcg 301 gattattgac cggaccaata caacactgta cattgtaatg gaatattgtg aaggagggga 361 tctggctagt gtaattacaa agggaaccaa ggaaaggcaa tacttagatg aagagtttgt 421 tcttcgagtg atgactcagt tgactctggc cctgaaggaa tgccacagac gaagtgatgg 481 tggtcatacc gtattgcatc gggatctgaa accagccaat gttttcctgg atggcaagca 541 aaacgtcaag cttggagact ttgggctagc tagaatatta aaccacgaca cgagttttgc 601 aaaaacattt gttggcacac cttattacat gtctcctgaa caaatgaatc gcatgtccta 661 caatgagaaa tcagatatct ggtcattggg ctgcttgctg tatgagttat gtgcattaat 721 gcctccattt acagctttta gccagaaaga actcgctggg aaaatcagag aaggcaaatt 781 caggcgaatt ccataccgtt actctgatga attgaatgaa attattacga ggatgttaaa 841 cttaaaggat taccatcgac cttctgttga agaaattctt gagaaccctt taatagcaga 901 tttggttgca gacgagcaaa gaagaaatct tgagagaaga gggcgacaat taggagagcc 961 agaaaaatcg caggattcca gccctgtatt gagtgagctg aaactgaagg aaattcagtt 1021 acaggagcga gagcgagctc tcaaagcaag agaagaaaga ttggagcaga aagaacagga 1081 gctttgtgtt cgtgagagac tagcagagga caaactggct agagcagaaa atctgttgaa 1141 gaactacagc ttgctaaagg aacggaagtt cctgtctctg gcaagtaatc cagaacttct 1201 taatcttcca tcctcagtaa ttaagaagaa agttcatttc agtggggaaa gtaaagagaa 1261 catcatgagg agtgagaatt ctgagagtca gctcacatct aagtccaagt gcaaggacct 1321 gaagaaaagg cttcacgctg cccagctgcg ggctcaagcc ctgtcagata ttgagaaaaa 1381 ttaccaactg aaaagcagac agatcctggg catgcgctag ccaggtagag agacacagag 1441 ctgtgtacag gatgtaatat taccaacctt taaagactga tattcaaatg ctgtagtgtt 1501 gaatacttgg ttccatgagc catgcctttc tgtatagtac acatgatatt tcggaattgg 1561 ttttactgtt cttcagcaac tattgtacaa aatgttcaca tttaattttt ctttcttctt 1621 ttaagaacat attataaaaa gaatactttc ttggttgggc ttttaatcct gtgtgtgatt 1681 actagtagga acatgagatg tgacattcta aatcttggga gaaaaaataa tgttagaaaa 1741 aaaatattta tgcaggaagg tagcactcac tgaatagttt taaatgactg agtggtatgc 1801 ttacaattgt catgtctaga tttaaatttt aagtctgaga ttttaaatgt ttttgagctt 1861 agaaaaccca gttagatgca atttggtcat taataccatg acatcttgct tataaatatt 1921 ccattgctct gtagttcaaa tctgttagct ttgtgaaaat tcatcactgt gatgtttgta 1981 ttcttttttt ttttctgttt aacagatatg agctgtctgt catttaccta cttctttccc 2041 actaaataaa a // LOCUS HSNETTF 1600 bp RNA PRI 13-MAR-1995 DEFINITION H.sapiens mRNA for Net transcription factor. ACCESSION Z36715 NID g531522 KEYWORDS Net; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1600) AUTHORS Giovane,A., Pintzas,A., Maira,S.M., Sobieszczuk,P. and Wasylyk,B. TITLE Net, a new ets transcription factor that is activated by Ras JOURNAL Genes Dev. 8 (13), 1502-1513 (1994) MEDLINE 95047310 REFERENCE 2 (bases 1 to 1600) AUTHORS Giovane,A. TITLE Direct Submission JOURNAL Submitted (17-AUG-1994) Antoine Giovane, CNRS-LGME, INSERM-U184, Institut de Chimie, Biologique, 11 rue Humann, STRASBOURG Cedex, 67085, France FEATURES Location/Qualifiers source 1..1600 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HELA" CDS 280..1503 /codon_start=1 /product="Net" /db_xref="PID:g531523" /db_xref="SWISS-PROT:P41970" /translation="MESAITLWQFLLQLLLDQKHEHLICWTSNDGEFKLLKAEEVAKL WGLRKNKTNMNYDKLSRALRYYYDKNIIKKVIGQKFVYKFVSFPEILKMDPHAVEISR ESLLLQDSDCKVSPEGREAHKHGLAVLRSTSRNEYIHSGLYSSFTINSLENPPDAFKA IKREKLEEPPEDSPPVEEVRTVIRFVTNKTDKHVTRPVVSLPSTSEAAAASAFLASSV SAKISSLMLPNAASISSASPFSSRSPSLSPKSPLPSEHRSLFLEAACHDSDSLEPLNL SSGSKTKSPSLPPKAKKPKGLEISAPPLVLSGTDIGSIALNSPALPSGSLTPAFFTAQ TPNGLLLTPSPLLSSIHFWSSLSPVAPLSPARLQGPSTLFQFPTLLNGHMPVPIPSLD RAASPVLLSSNSQKS" BASE COUNT 397 a 498 c 385 g 320 t ORIGIN 1 gggcggaaaa gcctgtttac acagactgca caccgcctgg ggaataatgc agtaaaggaa 61 gtgagccggc tcggcctgac tgctccaact tcctgctctc acacacacca gaggggaaaa 121 aaaaagagga gcgagagaaa gaaaaaaagg gggaaaaatc aggatctcat tacaagagcc 181 acagaccgtc tgcagacgcc tgtcagcatg gaaagtcggg ggctttcgcc cgggtcctcc 241 tagaaattcc ccccgaagaa gactccccca catctgggta tggagagtgc aatcacgctg 301 tggcagttcc tgttgcagtt gctgctggat cagaaacatg agcatttgat ctgctggacc 361 tcgaacgatg gtgaattcaa gctcctcaaa gcagaagaag tggccaagct gtggggactc 421 cgaaaaaaca aaacaaatat gaactatgat aagctgagca gagccctgcg atactattat 481 gacaagaaca tcatcaagaa ggtgatcggg cagaagtttg tgtacaagtt tgtctctttc 541 ccggagatcc tgaagatgga tcctcacgcg gtggagatca gccgggagag ccttctgctg 601 caggacagcg actgcaaggt gtctccggag ggccgcgagg cccacaaaca cggcctggcc 661 gtcctcagaa gcacgagccg caacgaatac atccactcag gcctgtactc gtccttcacc 721 attaattccc tggagaaccc accagacgcc ttcaaggcca tcaagaggga gaagctggag 781 gagccgcccg aagacagccc ccccgtggaa gaagtcagga ctgtgatcag gtttgtgacc 841 aataaaaccg acaagcacgt caccaggccg gtggtgtccc tgccttccac gtcagaggct 901 gcggcggcgt ccgccttcct ggcctcgtcc gtctcggcca agatctcctc tttaatgttg 961 ccaaacgctg ccagtatttc atccgcctca cccttctcat ctcggtcccc gtccctgtcc 1021 cccaagtcac ccctcccttc tgaacacaga agcctcttcc tggaggccgc ctgccatgac 1081 tccgattccc tggagccctt gaacctgtca tcgggctcca agaccaagtc tccatctctt 1141 cccccaaagg ccaaaaaacc caaaggcttg gaaatctcag cgcccccgct ggtgctctcc 1201 ggcaccgaca tcggctccat cgccctcaac agcccagccc tcccctcggg atccctcacc 1261 ccagccttct tcaccgcaca gacaccaaat ggattgcttc tgactccgag tccactgctc 1321 tccagcatac atttctggag cagccttagt ccagttgctc cgctgagtcc tgccaggctg 1381 caagggccaa gcacgctgtt ccagttcccc acactgctta atggccacat gccagtgcca 1441 atccccagtc tggacagagc tgcttctcca gtactgcttt cttcaaactc tcagaaatcc 1501 tgatgacgtc tggccacaat taaggactca ttaactgatg aaacaaattt gtccccacgg 1561 gctagtttac ctgtgtcgtg agaaggacat tgtgaaactc // LOCUS HSNEURA 4131 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for neurotensin receptor. ACCESSION X70070 S54181 NID g35020 KEYWORDS G-protein coupled receptor; neurotensin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4131) AUTHORS Vita,N. TITLE Direct Submission JOURNAL Submitted (28-DEC-1992) N. Vita, Sanofi Elf Bio Recherches, Bp 137, 31676 Laberge Cedex, FRANCE REFERENCE 2 (bases 1 to 4131) AUTHORS Vita,N., Laurent,P., Lefort,S., Chalon,P., Dumont,X., Kaghad,M., Gully,D., Le Fur,G., Ferrara,P. and Caput,D. TITLE Cloning and expression of a complementary DNA encoding a high affinity human neurotensin receptor JOURNAL FEBS Lett. 317 (1-2), 139-142 (1993) MEDLINE 93154505 FEATURES Location/Qualifiers source 1..4131 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="adenocarcinome" /cell_line="HT29" /clone_lib="HT29 plasmids cDNA" gene 373..1629 /gene="NTRR" CDS 373..1629 /gene="NTRR" /codon_start=1 /product="neurotensin receptor" /db_xref="PID:g35021" /db_xref="SWISS-PROT:P30989" /translation="MRLNSSAPGTPGTPAADPFQRAQAGLEEALLAPGFGNASGNASE RVLAAPSSELDVNTDIYSKVLVTAVYLALFVVGTVGNTVTAFTLARKKSLQSLQSTVH YHLGSLALSDLLTLLLAMPVELYNFIWVHHPWAFGDAGCRGYYFLRDACTYATALNVA SLSVERYLAICHPFKAKTLMSRSRTKKFISAIWLASALLTVPMLFTMGEQNRSADGQH AGGLVCTPTIHTATVKVVIQVNTFMSFIFPMVVISVLNTIIANKLTVMVRQAAEQGQV CTVGGEHSTFSMAIEPGRVQALRHGVRVLRAVVIAFVVCWLPYHVRRLMFCYISDEQW TPFLYDFYHYFYMVTNALFYVSSTINPILYNLVSANFRHIFLATLACLCPVWRRRRKR PAFSRKADSVSSNHTLSSNATRETLY" variation 970 /gene="NTRR" /replace="g" BASE COUNT 753 a 1418 c 1231 g 729 t ORIGIN 1 tcaagctcgc cccgcgcagc ccgagccggg ctgggcgctg tcctcggggg cctggggaac 61 cgcgcggttt ggagatcgga ggcacctgga acccgtggca agcgccgagc cgggagacag 121 cccgaggaac cacgggttct ggagctagga gccggaagct gggagtccgg aggagagcgg 181 agcccggagc ccggagcccg gggcggcgcg tctgggtctg gcgcttcccg actggacggc 241 gcgcccgctg gtcttcgcca cgcgccctcc cctgggctcg cgttcatcgg tccccgcctg 301 agacgcgccc actcctgccc ggacttccag ccccggaggc gccggacaga gccgcggact 361 ccagcgccca ccatgcgcct caacagctcc gcgccgggaa ccccgggcac gccggccgcc 421 gaccccttcc agcgggcgca ggccggactg gaggaggcgc tgctggcccc gggcttcggc 481 aacgcttcgg gcaacgcgtc ggagcgcgtc ctggcggcac ccagcagcga gctggacgtg 541 aacaccgaca tctactccaa agtgctggtg accgccgtgt acctggcgct cttcgtggtg 601 ggcacggtgg gcaacacggt gacggcgttc acgctggcgc ggaagaagtc gctgcagagc 661 ctgcagagca cggtgcatta ccacctgggc agcctggcgc tgtccgacct gctcaccctg 721 ctgctggcca tgcccgtgga gctgtacaac ttcatctggg tgcaccaccc ctgggccttc 781 ggcgacgccg gctgccgcgg ctactacttc ctgcgcgacg cctgcaccta cgccacggcc 841 ctcaacgtgg ccagcctgag tgtggagcgc tacctggcca tctgccaccc cttcaaggcc 901 aagaccctca tgtcccgaag ccgcaccaag aagttcatca gcgccatctg gctcgcctcg 961 gccctgctga cggtgcctat gctgttcacc atgggcgagc agaaccgcag cgccgacggc 1021 cagcacgccg gcggcctggt gtgcaccccc accatccaca ctgccaccgt caaggtcgtc 1081 atacaggtca acaccttcat gtccttcata ttccccatgg tggtcatctc ggtcctgaac 1141 accatcatcg ccaacaagct gaccgtcatg gtacgccagg cggccgagca gggccaagtg 1201 tgcacggtcg ggggcgagca cagcacattc agcatggcca tcgagcctgg cagggtccag 1261 gccctgcggc acggcgtgcg cgtcctacgt gcagtggtca tcgcctttgt ggtctgctgg 1321 ctgccctacc acgtgcggcg cctcatgttc tgctacatct cggatgagca gtggactccg 1381 ttcctctatg acttctacca ctacttctac atggtgacca acgcactctt ctacgtcagc 1441 tccaccatca accccatcct gtacaacctc gtctctgcca acttccgcca catcttcctg 1501 gccacactgg cctgcctctg cccggtgtgg cggcgcagga ggaagaggcc agccttctcg 1561 aggaaggccg acagcgtgtc cagcaaccac accctctcca gcaatgccac ccgcgagacg 1621 ctgtactagg ctgtgcgccc cggaacgtgt ccaggaggag cctggccatg ggtccttgcc 1681 cccgacagac agagcagccc ccacccggga gccttgatgg gggtcaggca gaggccagcc 1741 tgcactggag tctgaggcct gggacccccc cctcccaccc cctaacccat gtttctcatt 1801 agtgtctccc gggcctgtcc ccaactcctc cccacccctc ccccatctcc tctttgaaag 1861 ccagaacaag agagcgctcc tctcccagat aggaaaaggg cctctaacaa ggagaaatta 1921 gtgtgcggca aaaggcagtt ttctttgttc tcagactaat ggatggttcc agagaaggaa 1981 atgaaatgtg ctgggtgggg ccgggcctcc ggcggcccgg ctgctgttcc catgtccaca 2041 tctctgaggc ctgcaccccc tctgtctagc tcggggagtc cagccccagt cccgcaggct 2101 ccgtggcttt gggcctcacg tgcagaccct gccatgcaga cccatgcccc cctcccccag 2161 gcagctccaa gaaagctccc tgactcgccc cttcaggcct ggcaagctgg gggcccatcg 2221 ccgtggggag tccctcccac caccctcgcc gcaggcagct gcagccccca gaggggacca 2281 caagcccaaa aaggacaaaa atgggctggc ctggaatggc ccagacccca gcctcccctc 2341 ctccctccca tcctcaccca ggccaaggcc caggggctct gccaggacac cacatgggag 2401 ggggctcagg cctcagcctc aagatcttca gctgtggcct ctcgggctcg gcagaaggga 2461 cgccggatca ggggcctggt ctccagcacc tgcccgagtg gccgtggcca ggatggggtg 2521 cgcattccgt gtgctttgct tgtagctgtg caggctgagg tctggagcca ggcccagagc 2581 tggcttcagg gtggggcctt gagaagggga atgtgggaca ggggcgatgg tgcctggtct 2641 ctgagtaaga tgccaggtcc caggaactca ggcttcaggt gagaaggagc ggtgtgtcca 2701 ggcaccgctg gccggcagcc ctgggctgag gcacagactc atttgtcacc ttctggcggc 2761 ggcagccctg gccccggcct ccaagcagtt gaaaaagctg gcgcctcctt ggtctctagg 2821 atccaggctc cacagagcac atgactagcc aggcccctgg cttaagaagg tcgcctaagc 2881 ctaagagaag acagtcccag gagaagctgg ccgggaccag ccaggagctg ggagccacag 2941 gaagcaaaag tcagcctttt cttcaaggga tttccctgtc tcagagcagc ctttgcccca 3001 gggaaatggg ctctgggctg gctgcctgca ccggccatgt cgacccagga cccggacacc 3061 tggtcttggg ctgtgttcag ccactttgcc ttctctggac tcagtttccc cgtctgagaa 3121 atgagagtcg aatgctacag tatctgcagt cgcttggatc tggctgttga gttgacgggt 3181 tccttgaacc ccacaaaatc cctctccaac cacaggaccc ttcggctcac caagaacggg 3241 gcccagggga gtcaggccta ttcgctgcac ttcctgccaa actttgcccc cacaagcctg 3301 gtcatcagcc aggcagccct cccagtgccc aagggccacc aaccccaggg aaacagggcc 3361 agcacagagg ggccttcctc ccccacagag ctcccatgac atagtctgct ctgggcggaa 3421 gagctttgct gccagccagg gatgtccaga ggtcggtgca gcccctatcc ctgctcagga 3481 gtgggctcag agtctagcaa atgctaaggc ccctcaggct gggctctgaa cgaggacctg 3541 gactcagagc cagacagggc agcctcagac ccttctctgg ggctcctgga ccttgggcca 3601 taatttctga gcctcggttt ccccatctaa ggaacagatg tggtcgttcc gccctctcag 3661 ctggatgaga ctgtcctgga ggatccaccc cggaacagac agaacggtgt ctctcaggat 3721 ggtgctctga gagagggcag agtggatgcc ccactgccct agaccctcgg tagacgtggg 3781 gtctctgggg cggggtctgt ggctgtgact gaagtcggct ttcccgttga tgtcttgatg 3841 ctcctatctg tgcacttacc gtaggtaggg acacgtgtcc atgcaccaca gacacaccca 3901 cgacacctga tctcgtatca ctagcttgcg gccaggtcat gatgtggccc cggaagctgg 3961 ccctgcgtgc catgagtgcg tcggtcatgg agtccggagc ccctgagccg gcccctggtg 4021 acggcacagc cctcacagct caaacgccca cccccactcc caccatctgc aggtggtgaa 4081 aacaaacccc gtgtatctct caataaaggt ggccgaaggg cctcgatgtg g // LOCUS HSNEUROTR 3344 bp RNA PRI 02-DEC-1997 DEFINITION Homo sapiens mRNA for neurotrypsin. ACCESSION AJ001531 NID g2661423 KEYWORDS neurotrypsin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3344) AUTHORS Proba,K., Gschwend,T. and Sonderegger,P. TITLE Cloning and sequencing of the cDNA encoding human neurotrypsin JOURNAL Unpublished REFERENCE 2 (bases 1 to 3344) AUTHORS Sonderegger,P. TITLE Direct Submission JOURNAL Submitted (11-SEP-1997) Sonderegger P., Institute of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..3344 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA library HL3003a, Clontech" /dev_stage="fetal" /tissue_type="brain" sig_peptide 41..100 CDS 41..2668 /codon_start=1 /product="neurotrypsin" /db_xref="PID:e1198527" /db_xref="PID:g2661424" /translation="MTLARFVLALMLGALPEVVGFDSVLNDSLHHSHRHSPPAGPHYP YYLPTQQRPPTTRPPPPLPRFPRPPRALPAQRPHALQAGHTPRPHPWGCPAGEPWVSV TDFGAPCLRWAEVPPFLERSPPASWAQLRGQRHNFCRSPDGAGRPWCFYGDARGKVDW GYCDCRHGSVRLRGGKNEFEGTVEVYASGVWGTVCSSHWDDSDASVICHQLQLGGKGI AKQTPFSGLGLIPIYWSNVRCRGDEENILLCEKDIWQGGVCPQKMAAAVTCSFSHGPT FPIIRLAGGSSVHEGRVELYHAGQWGTVCDDQWDDADAEVICRQLGLSGIAKAWHQAY FGEGSGPVMLDEVRCTGNELSIEQCPKSSWGEHNCGHKEDAGVSCTPLTDGVIRLAGG KGSHEGRLEVYYRGQWGTVCDDGWTELNTYVVCRQLGFKYGKQASANHFEESTGPIWL DDVSCSGKETRFLQCSRRQWGRHDCSHREDVSIACYPGGEGHRLSLGFPVRLMDGENK KEGRVEVFINGQWGTICDDGWTDKDAAVICRQLGYKGPARARTMAYFGEGKGPIHVDN VKCTGNERSLADCIKQDIGRHNCRHSEDAGVICDYFGKKASGNSNKESLSSVCGLRLL HRRQKRIIGGKNSLRGGWPWQVSLRLKSSHGDGRLLCGATLLSSCWVLTAAHCFKRYG NSTRSYAVRVGDYHTLVPEEFEEEIGVQQIVIHREYRPDRSDYDIALVRLQGPEEQCA RFSSHVLPACLPLWRERPQKTASNCYITGWGDTGRAYSRTLQQAAIPLLPKRFCEERY KGRFTGRMLCAGNLHEHKRVDSCQGDSGGPLMCERPGESWVVYGVTSWGYGCGVKDSP GVYTKVSAFVPWIKSVTKL" mat_peptide 101..2665 /product="neurotrypsin" BASE COUNT 844 a 787 c 912 g 801 t ORIGIN 1 aagctgggga gcatggacca gaccccgcag cgctggcacc atgacgctcg cccgcttcgt 61 gctagccctg atgttagggg cgctccccga agtggtcggc tttgattctg tcctcaatga 121 ttccctccac cacagccacc gccattcgcc ccctgcgggt ccgcactacc cctattacct 181 tcccacccag cagcggcccc cgacgacgcg tccgccgccg cctctcccgc gcttcccgcg 241 ccccccgcgg gcgctccctg cccagcgccc gcacgccctc caggccgggc acacgccccg 301 gccgcacccc tggggctgcc ccgccggcga gccatgggtc agcgtgacgg acttcggcgc 361 cccgtgtctg cggtgggcgg aggtgccacc cttcctggag cggtcgcccc cagcgagctg 421 ggctcagctg cgaggacagc gccacaactt ttgtcggagc cccgacggcg cgggcagacc 481 ctggtgtttc tacggagacg cccgtggcaa ggtggactgg ggctactgcg actgcagaca 541 cggatcagta cgacttcgtg gcggcaaaaa tgagtttgaa ggcacagtgg aagtatatgc 601 aagtggagtt tggggcactg tctgtagcag ccactgggat gattctgatg catcagtcat 661 ttgtcaccag ctgcagctgg gaggaaaagg aatagcaaaa caaaccccgt tttctggact 721 gggccttatt cccatttatt ggagcaatgt ccgttgccga ggagatgaag aaaatatact 781 gctttgtgaa aaagacatct ggcagggtgg ggtgtgtcct cagaagatgg cagctgctgt 841 cacgtgtagc ttttcccatg gcccaacgtt ccccatcatt cgccttgctg gaggcagcag 901 tgtgcatgaa ggccgggtgg agctctacca tgctggccag tggggaaccg tttgtgatga 961 ccaatgggat gatgccgatg cagaagtgat ctgcaggcag ctgggcctca gtggcattgc 1021 caaagcatgg catcaggcat attttgggga agggtctggc ccagttatgt tggatgaagt 1081 acgctgcact gggaatgagc tttcaattga gcagtgtcca aagagctcct ggggagagca 1141 taactgtggc cataaagaag atgctggagt gtcctgtacc cctctaacag atggggtcat 1201 cagacttgca ggtgggaaag gcagccatga gggtcgcttg gaggtatatt acagaggcca 1261 gtggggaact gtctgtgatg atggctggac tgagctgaat acatacgtgg tttgtcgaca 1321 gttgggattt aaatatggta aacaagcatc tgccaaccat tttgaagaaa gcacagggcc 1381 catatggttg gatgacgtca gctgctcagg aaaggaaacc agatttcttc agtgttccag 1441 gcgacagtgg ggaaggcatg actgcagcca ccgcgaagat gttagcattg cctgctaccc 1501 tggcggcgag ggacacaggc tctctctggg ttttcctgtc agactgatgg atggagaaaa 1561 taagaaagaa ggacgagtgg aggtttttat caatggccag tggggaacaa tctgtgatga 1621 tggatggact gataaggatg cagctgtgat ctgtcgtcag cttggctaca agggtcctgc 1681 cagagcaaga accatggctt actttggaga aggaaaagga cccatccatg tggataatgt 1741 gaagtgcaca ggaaatgaga ggtccttggc tgactgtatc aagcaagata ttggaagaca 1801 caactgccgc cacagtgaag atgcaggagt tatttgtgat tattttggca agaaggcctc 1861 aggtaacagt aataaagagt ccctctcatc tgtttgtggc ttgagattac tgcaccgtcg 1921 gcagaagcgg atcattggtg ggaaaaattc tttaaggggt ggttggcctt ggcaggtttc 1981 cctccggctg aagtcatccc atggagatgg caggctcctc tgcggggcta cgctcctgag 2041 tagctgctgg gtcctcacag cagcacactg tttcaagagg tatggcaaca gcactaggag 2101 ctatgctgtt agggttggag attatcatac tctggtacca gaggagtttg aggaagaaat 2161 tggagttcaa cagattgtga ttcatcggga gtatcgaccc gaccgcagtg attatgacat 2221 agccctggtt agattacaag gaccagaaga gcaatgtgcc agattcagca gccatgtttt 2281 gccagcctgt ttaccactct ggagagagag gccacagaaa acagcatcca actgttacat 2341 aacaggatgg ggtgacacag gacgagccta ttcaagaaca ctacaacaag cagccattcc 2401 cttacttcct aaaaggtttt gtgaagaacg ttataagggt cggtttacag ggagaatgct 2461 ttgtgctgga aacctccatg aacacaaacg cgtggacagc tgccagggag acagcggagg 2521 accactcatg tgtgaacggc ccggagagag ctgggtggtg tatggggtga cctcctgggg 2581 gtatggctgt ggagtcaagg attctcctgg tgtttatacc aaagtctcag cctttgtacc 2641 ttggataaaa agtgtcacca aactgtaatt cttcatggaa acttcaaagc agcatttaaa 2701 caaatggaaa actttgaacc cccactatta gcactcagca gagatgacaa caaatggcaa 2761 gatctgtttt tgctttgtgt tgtggtaaaa aattgtgtac cccctgctgc ttttgagaaa 2821 tttgtgaaca ttttcagagg cctcagtgta gtggaagtga taatccttaa atgaacattt 2881 tctaccctaa tttcactgga gtgacttatt ctaagcctca tctatcccct acctatttct 2941 caaaatcatt ctatgctgat tttacaaaag atcattttta catttgaact gagaacccct 3001 tttaattgaa tcagtggtgt ctgaaatcat attaaatacc cacatttgac ataaatgcgg 3061 taccctttac tacactcatg agtggcatat ttatgcttag gtcttttcaa aagacttgac 3121 aagaaatctt catattctct gtagcctttg tcaagtgagg aaatcagtgg ttaaagaatt 3181 ccactataaa cttttaggcc tgaataggag tagtaaagcc tcaaggacat ctgcctgtca 3241 caatatattc tcaaagtgat ctgatatttg gaaacaagta tccttgttga gtaccaagtg 3301 ctacagaaac cataagataa aaatactttc tacctacagc gtgc // LOCUS HSNEUROU 817 bp RNA PRI 05-OCT-1995 DEFINITION H.sapiens mRNA for neuromedin U. ACCESSION X76029 NID g609012 KEYWORDS neuromedin U; neuromedin U peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 817) AUTHORS Austin,C., Lo,G., Nandha,K., Meleagros,L. and Bloom,S.R. TITLE Cloning and characterization of the cDNA encoding the human neuromedin U (NmU) precursor: NmU expression in the human gastrointestinal tract JOURNAL J. Mol. Endocrinol. 14, 57-69 (1995) REFERENCE 2 (bases 1 to 817) AUTHORS Austin,C. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) C. Austin, Royal Posgraduate Medical School, Dept. Medicine, Francis Fraser Lab. 1st Floor, Hammersmith Hospital, Ducane Rd. London W12 ONN, UK FEATURES Location/Qualifiers source 1..817 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="pituitary" CDS 106..630 /codon_start=1 /product="neuromedin U" /db_xref="PID:g609013" /db_xref="SWISS-PROT:P48645" /translation="MLRTESCRPRSPAGQVAAASPLLLLLLLLAWCAGACRGAPILPQ GLQPEQQLQLWNEIDDTCSSFLSIDSQPQASNALEELCFMIMGMLPKPQEQDEKDNTK RFLFHYSKTQKLGKSNVVSSVVHPLLQLVPHLHERRMKRFRVDEEFQSPFASQSRGYF LFRPRNGRRSAGFI" sig_peptide 106..207 mat_peptide 529..627 /product="neuromedin U" BASE COUNT 218 a 192 c 202 g 205 t ORIGIN 1 agtcctgcgt ccgggccccg aggcgcagca gggcaccagg tggagcacca gctacgcgtg 61 gcgcagcgca gcgtccctag caccgagcct cccgcagccg ccgagatgct gcgaacagag 121 agctgccgcc ccaggtcgcc cgccggacag gtggccgcgg cgtccccgct cctgctgctg 181 ctgctgctgc tcgcctggtg cgcgggcgcc tgccgaggtg ctccaatatt acctcaagga 241 ttacagcctg aacaacagct acagttgtgg aatgagatag atgatacttg ttcgtctttt 301 ctgtccattg attctcagcc tcaggcatcc aacgcactgg aggagctttg ctttatgatt 361 atgggaatgc taccaaagcc tcaggaacaa gatgaaaaag ataatactaa aaggttctta 421 tttcattatt cgaagacaca gaagttgggc aagtcaaatg ttgtgtcgtc agttgtgcat 481 ccgttgctgc agctcgttcc tcacctgcat gagagaagaa tgaagagatt cagagtggac 541 gaagaattcc aaagtccctt tgcaagtcaa agtcgaggat attttttatt caggccacgg 601 aatggaagaa ggtcagcagg gttcatttaa aatggatgcc agctaatttt ccacagagca 661 atgctatgga atacaaaatg tactgacatt ttgttttctt ctgaaaaaaa tccttgctaa 721 atgtactctg ttgaaaatcc ctgtgttgtc aatgttctca gttgtaacaa tgttgtaaat 781 gttcaatttg ttgaaaatta aaaaatctaa aaataaa // LOCUS HSNEURVGF 3038 bp DNA PRI 14-NOV-1997 DEFINITION H.sapiens vgf gene. ACCESSION Y12661 NID g2244658 KEYWORDS neuroendocrine-specific protein; vgf gene; VGF protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3038) AUTHORS Canu,N., Possenti,R., Ricco,A.S., Rocchi,M. and Levi,A. TITLE Cloning, structural organization analysis, and chromosomal assignment of the human gene for the neurosecretory protein VGF JOURNAL Genomics 45 (2), 443-446 (1997) MEDLINE 98008940 REFERENCE 2 (bases 1 to 3038) AUTHORS Canu,N. TITLE Direct Submission JOURNAL Submitted (03-APR-1997) N. Canu, Istituto di Neurobiologia, C.N.R, Viale Marx 43, 00137 Roma, ITALY FEATURES Location/Qualifiers source 1..3038 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="EMBL3/SP6/T7" /tissue_type="placenta" mRNA join(1..196,708..3038) exon 1..196 intron 197..707 exon 708..3038 gene 724..2574 /gene="vgf" CDS 724..2574 /gene="vgf" /codon_start=1 /product="neuro-endocrine specific protein VGF" /db_xref="PID:e315149" /db_xref="PID:g2244659" /translation="MKALRLSASALFCLLLINGLGAAPPGRPEAQPPPLSSEHKEPVA GDAVPGPKDGSAPEVRGARNSEPQDEGELFQGVDPRALAAVLLQALDRPASPPAPSGS QQGPEEEAAEALLTETVRSQTHSLPAAGEPEPAAPPRPQTPENGPEASDPSEELEALA SLLQELRDFSPSSAKRQQETAAAETETRTHTLTRVNLESPGPERVWRASWGEFQARVP ERAPLPPPAPSQFQARMPDSGPLPETHKFGEGVSSPKTHLGEALAPLSKAYQGVAAPF PKARRAESALLGGSEAGERLLQQGLAQVEAGRRQAEATRQAAAQEERLADLASDLLLQ YLLQGGARQRGLGGRGLQEAAEERESAREEEEAEQERRGGEERVGEEDEEAAEAAEAE ADEAERARQNALLFAEEEDGEAGAEDKRSQEETPGHRRKEAEGTEEGGEEEDDEEMDP QTIDSLIELSTKLHLPADDVVSIIEEVEEKRNRKKKAPPEPVPPPRAAPAPTHVRSPQ PPPPPPSARDELPDWNEVLPPWDREEDEVYPPGPYHPFPNYIRPRTLQPPSALRRRHY HHALPPSRHYPGREAQARHAQQEEAEAEERRLQEQEELENYIEHVLLRRP" polyA_signal 3019..3027 BASE COUNT 524 a 1047 c 1022 g 445 t ORIGIN 1 ccagcgtgct gaagccggag cgagctagcc gcccggagcc gcgccgaccc agctgagccc 61 agcccacggg acgccagacc tcgaccgtcg ctcctacccc ggccaccgct cggagccgag 121 gcggacgcgt cccgatcttc ccctgtcccc accctgcccc gaccctcctc tccacctctc 181 gcgtcgtgac accagctggt aaatactccg ctgttcgtcc ctcaaaccct cggcagccag 241 ccgtgggcgt gagggagggt tctctctcct ctcgatgggg gtgttgcaaa cacagcgggg 301 agccccctgg taagggtccc cggtaaacgg gggagtcgca gctttttctc ttgctgctga 361 agtcgcccac gcaccatccg gggagtccta cggggaggga gcagagattt ttttttcccc 421 catattgctg ctgcttagta cgtgggcgat ggcagtgaga tggctcaggg aaggccgagg 481 aggccctggg taagcgaggg cttcgggggt tattttccca tttacacggc tccagagatc 541 ggcacaacat cttcctcctt tgctcctaaa cgttcctctt ctgggtaagg tttgggggat 601 cagggaagcc ccgggtttcc tgctgaaagg tgggggaagg gaacgtagac ctagagaggg 661 gggaattctt acagaaatcc tctttttttg gtcccttcta tttttcagtc tccggcagcc 721 tccatgaaag ccctcagatt gtcggcttcc gccctcttct gccttctgct gatcaacggg 781 ttaggggcag caccccctgg tcgccctgag gcgcagcctc ctcctctcag ctctgagcat 841 aaagagccgg tagccgggga cgcagtgccc gggccaaagg atggcagcgc cccagaggtc 901 cgaggcgctc ggaattccga gccgcaggac gagggagagc ttttccaggg cgtggatccc 961 cgggcgctgg ccgcggtgct gctgcaggca ctcgaccgtc ccgcctcacc cccggcacca 1021 agcggctccc agcaggggcc ggaggaagaa gcagctgaag ctctgctgac cgagaccgtg 1081 cgcagccaga cccacagcct cccggcggcc ggagagcccg agcccgcggc gccccctcgc 1141 cctcagactc cggagaatgg gcccgaggcg agcgatccct ccgaggagct cgaggcgcta 1201 gcgtccctgc tccaggaact gcgagatttc agtccaagta gcgccaagcg ccagcaggag 1261 acggcggcag cagagacgga aacccgcacg cacacgctga cccgagtgaa tctggagagc 1321 ccggggccag agcgcgtatg gcgcgcttcc tggggagagt tccaggcgcg tgtcccggag 1381 cgcgcgcccc tgccgccccc ggccccctct caattccagg cgcgtatgcc cgacagcggg 1441 ccccttcccg aaacccacaa gttcggggaa ggagtgtcct cccccaaaac acacctaggc 1501 gaggcattgg cacccctgtc caaggcgtac caaggcgtgg ccgccccgtt ccccaaggcg 1561 cgccgggccg agagcgcact cctgggcggc tccgaggcgg gcgagcgcct tctccagcaa 1621 gggctggcgc aggtggaggc cgggcggcgg caggcggagg ccacgcggca ggccgcggcg 1681 caggaagagc ggctggccga cctcgcctcg gacctgctgc tccagtattt gctgcagggc 1741 ggggcccggc agcgcggcct cgggggtcgg gggctgcagg aggcggcgga ggagcgagag 1801 agtgcaaggg aggaggagga ggcggagcag gagagacgcg gcggggagga gagggtgggg 1861 gaagaggatg aggaggcggc cgaggcggcg gaggcagagg cggacgaggc ggagagggcg 1921 cggcagaacg cgctcctgtt cgcggaggag gaggacgggg aagccggcgc cgaggacaag 1981 cgctcccagg aggagacgcc gggccaccgg cggaaggagg ccgaggggac agaggagggc 2041 ggggaggagg aggacgacga ggagatggat ccgcagacga tcgacagcct cattgagctg 2101 tccaccaaac tccacctgcc agcggacgac gtggtcagca tcatcgagga ggtggaggag 2161 aagcggaacc gaaagaagaa agcccctccc gagcccgtgc cgcccccccg tgccgccccc 2221 gcccccaccc acgtccgctc cccgcagccc ccgcccccgc ccccgtccgc acgagacgag 2281 ctgccggact ggaacgaggt gctcccgccc tgggatcggg aggaggacga ggtgtacccg 2341 ccagggccgt accacccttt ccccaactac atccggccgc ggacactgca gccgccctcg 2401 gccttgcgcc gccgccacta ccaccacgcc ttgccgcctt cgcgccacta tcccggccgg 2461 gaggcccagg cccggcacgc gcagcaggag gaggcggagg cggaggagcg ccggctgcag 2521 gagcaggagg agctggagaa ttacatcgag cacgtgctgc tccggcgccc gtgactgccc 2581 ttcccggtcc cgcccccgcg cgcccccgcc gcgcgcgcgc gccggcgccc ccctccgtgt 2641 tgctccccct cggtgtttgc atgcgccccg ccctgcccct tgcccctgtc cccgggctgc 2701 gtcgggacct gccagacccc cctcccgggt cctgagcccg aactcccaga gctcacccgc 2761 gggtgaccgg ggccagccca ggagggcggg tggtttgtgc gagttccctt gccacgcgcc 2821 ccggccccat caagtcctct ggggacgtcc ccgtcggaaa ccggaaaaag cagttccagt 2881 taattgtgtg aagtgtgtct gtctgtcctc ccagtcgggc ctcccacgag cccctccagc 2941 ctctccaagt cgctgtgaat gaccccttct ttcctttctc tgttgtaaat accctcacgg 3001 aggaaatagt tttgctaaga aataaaagtg actatttt // LOCUS HSNFKBS 3001 bp RNA PRI 29-APR-1992 DEFINITION H.sapiens mRNA for NF-kB subunit. ACCESSION X61498 NID g35039 KEYWORDS NF-kb subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3001) AUTHORS Schmid,R.M. TITLE Direct Submission JOURNAL Submitted (26-AUG-1991) R.M. Schmid, Howard Hughes Medical Inst, Dept of Internal Medicine and Biological Chemistry, 1150 West Medical Center Drive, Ann Harbor Michigan 48109, USA REFERENCE 2 (bases 1 to 3001) AUTHORS Schmid,R.M., Perkins,N.D., Duckett,C.S., Andrews,P.C. and Nabel,G.J. TITLE Cloning of an NF-kappa B subunit which stimulates HIV transcription in synergy with p65 JOURNAL Nature 352 (6337), 733-736 (1991) MEDLINE 91343004 FEATURES Location/Qualifiers source 1..3001 /organism="Homo sapiens" /strain="B-lymophoid leukaemia" /db_xref="taxon:9606" CDS 164..2965 /codon_start=1 /product="NF-kB subunit" /db_xref="PID:g35040" /db_xref="SWISS-PROT:Q00653" /translation="MESCYNPGLDGIIEYDDFKLNSSIVEPKEPAPETADGPYLVIVE QPKQRGFRFRYGCEGPSHGGLPGASSEKGRKTYPTVKICNYEGPAKIEVDLVTHSDPP RAHAHSLVGKQCSELGICAVSVGPKDMTAQFNNLGVLHVTKKNMMGTMIQKLQRQRLR SRPQGLTEAEQRELEQEAKELKKVMDLSIVRLRFSAFLRASDGSFSLPLKPVTSQPIH DSKSPGASNLKISRMDKTAGSVRGGDEVYLLCDKVQKDDIEVRFYEDDENGWQAFGDF SPTDVHKQYAIVFRTPPYHKMKIERPVTVFLQLKRKRGGDVSDSKQFTYYPLVEDKEE VQRKRRKALPTFSQPFGGGSHMGGGSGGAAGGYGGAGGGGSLGFFPSSLAYSPYQSGA GPMRCYPGGGGGAQMAATVPSRDSGEEAAEPSAPSRTPQCEPQAPEMLQRAREYNARL FGLAHAAPSPTRLLRHRGRRALLAGQRHLLTAQDENGDTPLHLAIIHGQTSVIEQIVY VIHHAQDLGVVNLTNHLHQTPLHLAVITGQTSVVSFLLRVGADPALLDRHGDSAMHLA LRAGAGAPELLRALLQSGAPAVPQLLHMPDFEGLYPVHLAVRARSPECLDLLVDSGAE VEATERQGGRTALHLATEMEELGLVTHLVTKLRANVNARTFAGNTPLHLAAGLGYPTL TRLLLKAGADIHAENEEPLCPLPSPPTSDSDSDSEGPEKDTRSSFRGHTPLDLTCSTL VKTLLLNAAQNTMEPPLTPPSPAGPGLSLGDTALQNLEQLLDGPEAQGSWAELAERLG LRSLVDTYRQTTSPSGSLLRSYELAGGDLAGLLEALSDMGLEEGVRLLRGPETRDKLP STEVKEDSAYGSQSVEQEAEKLGPPPEPPGGLSHGHPQPQVTDLLPAPSPLPGPPVQR PHLFQILFNTPHPPLSWDK" BASE COUNT 643 a 924 c 912 g 522 t ORIGIN 1 actttcctgc cccttccccg gccaagccca actccggatc tcgctctcca ccggatctca 61 cccgccacac ccggacaggc ggctggagga ggcgggcgtc taaaattctg ggaagcagaa 121 cctggccgga gccactagac agagccgggc ctagcccaga gacatggaga gttgctacaa 181 cccaggtctg gatggtatta ttgaatatga tgatttcaaa ttgaactcct ccattgtgga 241 acccaaggag ccagccccag aaacagctga tggcccctac ctggtgatcg tggaacagcc 301 taagcagaga ggcttccgat ttcgatatgg ctgtgaaggc ccctcccatg gaggactgcc 361 cggtgcctcc agtgagaagg gccgaaagac ctatcccact gtcaagatct gtaactacga 421 gggaccagcc aagatcgagg tggacctggt aacacacagt gacccacctc gtgctcatgc 481 ccacagtctg gtgggcaagc aatgctcgga gctggggatc tgcgccgttt ctgtggggcc 541 caaggacatg actgcccaat ttaacaacct gggtgtcctg catgtgacta agaagaacat 601 gatggggact atgatacaaa aacttcagag gcagcggctc cgctctaggc cccagggcct 661 tacggaggcc gagcagcggg agctggagca agaggccaaa gaactgaaga aggtgatgga 721 tctgagtata gtgcggctgc gcttctctgc cttccttaga gccagtgatg gctccttctc 781 cctgcccctg aagccagtca cctcccagcc catccatgat agcaaatctc cgggggcatc 841 aaacctgaag atttctcgaa tggacaagac agcaggctct gtgcggggtg gagatgaagt 901 ttatctgctt tgtgacaagg tgcagaaaga tgacattgag gttcggttct atgaggatga 961 tgagaatgga tggcaggcct ttggggactt ctctcccaca gatgtgcata aacagtatgc 1021 cattgtgttc cggacacccc cctatcacaa gatgaagatt gagcggcctg taacagtgtt 1081 tctgcaactg aaacgcaagc gaggagggga cgtgtctgat tccaaacagt tcacctatta 1141 ccctctggtg gaagacaagg aagaggtgca gcggaagcgg aggaaggcct tgcccacctt 1201 ctcccagccc ttcgggggtg gctcccacat gggtggaggc tctgggggtg cagccggggg 1261 ctacggagga gctggaggag gtggcagcct cggtttcttc ccctcctccc tggcctacag 1321 cccctaccag tccggcgcgg gccccatgcg gtgctacccg ggaggcgggg gcggggcgca 1381 gatggccgcc acggtgccca gcagggactc cggggaggaa gccgcggagc cgagcgcccc 1441 ctccaggacc ccccagtgcg agccgcaggc cccggagatg ctgcagcgag ctcgagagta 1501 caacgcgcgc ctgttcggcc tggcgcacgc agccccgagc cctactcgac tactgcgtca 1561 ccgcggacgc cgcgcgctgc tggcgggaca gcgccacctg ctgacggcgc aggacgagaa 1621 cggagacaca ccactgcacc tagccatcat ccacgggcag accagtgtca ttgagcagat 1681 agtctatgtc atccaccacg cccaggacct cggcgttgtc aacctcacca accacctgca 1741 ccagacgccc ctgcacctgg cggtgatcac ggggcagacg agtgtggtga gctttctgct 1801 gcgggtaggt gcagacccag ctctgctgga tcggcatgga gactcagcca tgcatctggc 1861 gctgcgggca ggcgctggtg ctcctgagct gctgcgtgca ctgcttcaga gtggagctcc 1921 tgctgtgccc cagctgttgc atatgcctga ctttgaggga ctgtatccag tacacctggc 1981 ggtccgagcc cgaagccctg agtgcctgga tctgctggtg gacagtgggg ctgaagtgga 2041 ggccacagag cggcaggggg gacgaacagc cttgcatcta gccacagaga tggaggagct 2101 ggggttggtc acccatctgg tcaccaagct ccgggccaac gtgaacgctc gcacctttgc 2161 gggaaacaca cccctgcacc tggcagctgg actggggtac ccgaccctca cccgcctcct 2221 tctgaaggct ggtgctgaca tccatgctga aaacgaggag cccctgtgcc cactgccttc 2281 accccctacc tctgatagcg actcggactc tgaagggcct gagaaggaca cccgaagcag 2341 cttccggggc cacacgcctc ttgacctcac ttgcagcacc ttggtgaaga ccttgctgct 2401 aaatgctgct cagaacacca tggagccacc cctgaccccg cccagcccag cagggccggg 2461 actgtcactt ggtgatacag ctctgcagaa cctggagcag ctgctagacg ggccagaagc 2521 ccagggcagc tgggcagagc tggcagagcg tctggggctg cgcagcctgg tagacacgta 2581 ccgacagaca acctcaccca gtggcagcct cctgcgcagc tacgagctgg ctggcgggga 2641 cctggcaggt ctactggagg ccctgtctga catgggccta gaggagggag tgaggctgct 2701 gaggggtcca gaaacccgag acaagctgcc cagcacagag gtgaaggaag acagtgcgta 2761 cgggagccag tcagtggagc aggaggcaga gaagctgggc ccaccccctg agccaccagg 2821 agggctctcg cacgggcacc cccagcctca ggtgactgac ctgctgcctg cccccagccc 2881 ccttcccgga ccccctgtac agcgtcccca cctatttcaa atcttattta acaccccaca 2941 cccacccctc agttgggaca aataaaggat tctcatggga aggggaggac cccgaattcc 3001 t // LOCUS HSNFYA 1383 bp RNA PRI 18-JAN-1995 DEFINITION H.sapiens mRNA for CAAT-box DNA binding protein subunit A. ACCESSION X59711 NID g35047 KEYWORDS CAAT-box DNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1383) AUTHORS Benoist,C. TITLE Direct Submission JOURNAL Submitted (21-JAN-1992) C. Benoist, L.G.M.E., Dept of Immunology, 11, Rue Humann, Strassbourg 67000, FRANCE REFERENCE 2 (bases 1 to 1383) AUTHORS Li,X.Y., Mantovani,R., Hooft van Huijsduijnen,R., Andre,I., Benoist,C. and Mathis,D. TITLE Evolutionary variation of the CCAAT-binding transcription factor NF-Y JOURNAL Nucleic Acids Res. 20 (5), 1087-1091 (1992) MEDLINE 92195809 REMARK Erratum:[Nucleic Acids Res 1992 Apr 11;20(7):1841]] COMMENT . FEATURES Location/Qualifiers source 1..1383 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 175..1218 /codon_start=1 /product="CAAT-box DNA binding protein subunit A" /db_xref="PID:g35048" /db_xref="SWISS-PROT:P23511" /translation="MEQYTANSNSSTEQIVVQAGQIQQQQQGGVTAVQLQTEAQVASA SGQQVQTLQVVQGQPLMVQVSGGQLITSTGQPIMVQAVPGGQGQTIMQVPVSGTQGLQ QIQLVPPGQIQIQGGQAVQVQGQQGQTQQIIIQQPQTAVTAGQTQTQQQIAVQGQQVA QTAEGQTIVYQPVNADGTILQQVTVPVSGMITIPAASLAGAQIVQTGANTNTTSSGQG TVTVTLPVAGNVVNSGGMVMMVPGAGSVPAIQRIPLPGAEMLEEEPLYVNAKQYHRIL KRRQARAKLEAEGKIPKERRKYLHESRHRHAMARKRGEGGRFFSPKEKDSPHMQDPNQ ADEEAMTQIIRVS" BASE COUNT 377 a 356 c 381 g 269 t ORIGIN 1 acggaattcc cggcagtggc ggcggcagcg gcggctggag cctcctgatt gggtttcgga 61 gtccggtact ggagccaatc agcgcgggca gcgaaccggg ggagcgaggc acggagtgta 121 cctcacagcc ttctaggatc tccagagtgg acaggaatct cacttggagg gaccatggag 181 cagtatacag caaacagcaa tagttcgaca gagcagattg ttgtccaggc aggacagatt 241 cagcagcagc agcagggtgg tgtcactgct gtgcagttgc agactgaggc ccaggtggca 301 tccgcctcag gccagcaagt ccagaccctc caggtagtcc aagggcagcc attaatggtg 361 caggtcagtg gaggccagct aatcacatca actggccaac ccatcatggt ccaggctgtc 421 cctggtggac aaggtcaaac catcatgcaa gtacctgttt ctggaacaca gggtttgcag 481 caaatacagt tggtcccacc tggacagatc cagatccagg gtggacaggc tgtgcaggtg 541 cagggccagc agggccagac ccagcagatc atcatccagc agccccagac ggctgtcact 601 gctggccaga ctcagacaca gcagcagatt gctgtccagg gacagcaagt ggcacagact 661 gctgaagggc agaccatcgt ctatcaacca gttaatgcag atggcaccat tctccagcaa 721 gttacagtcc ctgtttcagg catgatcact atcccagcag ccagtttggc aggagcacag 781 attgttcaaa caggagccaa taccaacaca accagcagtg ggcaagggac tgtcactgtg 841 acactaccag tggcaggcaa tgtggtcaat tcaggaggga tggtcatgat ggttcctggg 901 gctggctctg tgcctgctat ccaaagaatc cctctacctg gagcagagat gcttgaagaa 961 gagcctctct acgtgaatgc caaacaatac caccgtattc ttaagaggag gcaagcccga 1021 gctaaactag aggcagaagg gaaaattcca aaggagagaa ggaaatacct gcatgagtct 1081 cggcaccgtc atgccatggc acggaagcgt ggtgaaggtg gacgattttt ctctccaaag 1141 gaaaaggata gtccccatat gcaggatcca aaccaagccg atgaagaagc aatgacacag 1201 atcatccgag tgtcctaacc ccacgccatg tgatggagct gatcaaggtc atgtttctca 1261 ctgttccagg aaattgatca actcttccaa tgggacattg atgatcacat tctgcccttt 1321 actacaggac agaaaccact tagtttttaa taagtggctc agtattacaa ttgttaacaa 1381 acg // LOCUS HSNFYB 739 bp RNA PRI 18-JAN-1995 DEFINITION H.sapiens mRNA for CAAT-box DNA binding protein subunit B (NF-YB). ACCESSION X59710 NID g35049 KEYWORDS CAAT-box DNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 739) AUTHORS Benoist,C. TITLE Direct Submission JOURNAL Submitted (21-JAN-1992) C. Benoist, L.G.M.E., Dept of Immunology, 11, Rue Humann, Strassbourg 67000, FRANCE REFERENCE 2 (bases 1 to 739) AUTHORS Li,X.Y., Mantovani,R., Hooft van Huijsduijnen,R., Andre,I., Benoist,C. and Mathis,D. TITLE Evolutionary variation of the CCAAT-binding transcription factor NF-Y JOURNAL Nucleic Acids Res. 20 (5), 1087-1091 (1992) MEDLINE 92195809 REMARK Erratum:[Nucleic Acids Res 1992 Apr 11;20(7):1841]] COMMENT . FEATURES Location/Qualifiers source 1..739 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 112..729 /codon_start=1 /product="CAAT-box DNA binding protein subunit B (NF-YB)" /db_xref="PID:g35050" /db_xref="SWISS-PROT:P25208" /translation="MDGDSSTTDASQLGISADYIGGSHYVIQPHDDTEDSMNDHEDTN GSKESFREQDIYLPIANVARIMKNAIPQTGKIAKDAKECVQECVSEFISFITSEASER CHQEKRKTINGEDILFAMSTLGFDSYVEPLKLYLQKFREAMKGEKGIGGAVTATDGLS EELTEEAFTNQLPAGLITTDGQQQNVMVYTTSYQQISGVQQIQFS" BASE COUNT 264 a 127 c 159 g 189 t ORIGIN 1 gaattccaac cagggctgca ttggaggttg aaatcacaaa gattagacac ctttttagat 61 aggtgttctt cagcaccact gacaacacgg ttctgacagt atttcatgac aatggatggt 121 gacagttcta caacagatgc ttctcaacta ggaatctctg cagactatat tggaggaagt 181 cattatgtta tacagcctca tgatgatact gaggacagca tgaatgatca tgaagacaca 241 aatggttcaa aagaaagttt cagagaacaa gatatatatc ttccaatagc aaacgtggct 301 aggataatga aaaatgccat acctcaaacg ggaaagattg caaaagatgc caaagaatgt 361 gttcaagaat gtgtaagtga gttcatcagt tttataacat ctgaagcaag tgaaaggtgc 421 catcaagaga aacggaaaac aatcaatgga gaagatattc tctttgctat gtctacttta 481 ggctttgaca gttatgtgga acctctgaaa ttataccttc agaaattcag agaggctatg 541 aaaggagaaa agggaattgg tggagcagtc acagctacag atggactaag tgaagagctt 601 acagaggagg catttactaa ccagttacca gctggcttaa taaccacaga cggtcaacaa 661 caaaatgtta tggtttacac aacatcatat caacagattt ctggtgttca gcaaattcag 721 ttttcatgat ctgaagaaa // LOCUS HSNGF2 883 bp RNA PRI 11-MAR-1993 DEFINITION H.sapiens mRNA for NGF-2. ACCESSION X53655 NID g287794 KEYWORDS nerve growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 883) AUTHORS Kaisho,Y., Yoshimura,K. and Nakahama,K. TITLE Cloning and expression of a cDNA encoding a novel human neurotrophic factor JOURNAL FEBS Lett. 266 (1-2), 187-191 (1990) MEDLINE 90306351 FEATURES Location/Qualifiers source 1..883 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lambda gt11 human glioma (Hs683) cDNA library" sig_peptide 44..457 /gene="NGF" /product="nerve growth factor" CDS 44..817 /gene="NGF" /codon_start=1 /product="nerve growth factor" /db_xref="PID:g287795" /db_xref="SWISS-PROT:P20783" /translation="MSILFYVIFLAYLRGIQGNNMDQRSLPEDSLNSLIIKLIQADIL KNKLSKQMVDVKENYQSTLPKAEAPREPERGGPAKSAFQPVIAMDTELLRQQRRYNSP RVLLSDSTPLEPPPLYLMEDYVGSPVVANRTSRRKRYAEHKSHRGEYSVCDSESLWVT DKSSAIDIRGHQVTVLGEIKTGNSPVKQYFYETRCKEARPVKNGCRGIDDKHWNSQCK TSQTYVRALTSENNKLVGWRWIRIDTSCVCALSRKIGRT" gene 44..817 /gene="NGF" mat_peptide 458..814 /gene="NGF" /product="nerve growth factor" BASE COUNT 248 a 214 c 229 g 192 t ORIGIN 1 tgccatggtt acttttgcca cgatcttaca ggtgaacaag gtgatgtcca tcttgtttta 61 tgtgatattt ctcgcttatc tccgtggcat ccaaggtaac aacatggatc aaaggagttt 121 gccagaagac tcgctcaatt ccctcattat taagctgatc caggcagata ttttgaaaaa 181 caagctctcc aagcagatgg tggacgttaa ggaaaattac cagagcaccc tgcccaaagc 241 tgaggctccc cgagagccgg agcggggagg gcccgccaag tcagcattcc agccagtgat 301 tgcaatggac accgaactgc tgcgacaaca gagacgctac aactcaccgc gggtcctgct 361 gagcgacagc acccccttgg agcccccgcc cttgtatctc atggaggatt acgtgggcag 421 ccccgtggtg gcgaacagaa catcacggcg gaaacggtac gcggagcata agagtcaccg 481 aggggagtac tcggtatgtg acagtgagag tctgtgggtg accgacaagt catcggccat 541 cgacattcgg ggacaccagg tcacggtgct gggggagatc aaaacgggca actctcccgt 601 caaacaatat ttttatgaaa cgcgatgtaa ggaagccagg ccggtcaaaa acggttgcag 661 gggtattgat gataaacact ggaactctca gtgcaaaaca tcccaaacct acgtccgagc 721 actgacttca gagaacaata aactcgtggg ctggcggtgg atacggatag acacgtcctg 781 tgtgtgtgcc ttgtcgagaa aaatcggaag aacatgaatt ggcatctctc cccatatata 841 aattattact ttaaattata tgatatgcat gtagcatata aat // LOCUS HSNGFIPC3 796 bp RNA PRI 12-FEB-1997 DEFINITION H.sapiens mRNA for NGF-inducible PC3 anti-proliferative protein. ACCESSION Y09943 NID g1841431 KEYWORDS nerve growth factor-inducible anti-proliferative gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 796) AUTHORS Buanne,P., Incerti,B., Ballabio,A., Guardavaccaro,D. and Tirone,F. TITLE Molecular cloning of the human homologue of nerve Growth factor-inducible PC3 (pheochromocytoma cell-3) gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 796) AUTHORS Tirone,F. TITLE Direct Submission JOURNAL Submitted (10-DEC-1996) F. Tirone, Consiglio Nazionale Ricerche, Institute of Neurobiology, Viale Marx 43, 00137 Rome, ITALY REMARK revised by [3] REFERENCE 3 (bases 1 to 796) AUTHORS Tirone,F. TITLE Direct Submission JOURNAL Submitted (12-FEB-1917) F. Tirone, Consiglio Nazionale Ricerche, Institute of Neurobiology, Viale Marx 43, 00137 Rome, ITALY COMMENT Publication showing that PC3 is antiproliferative: Montagnoli, A., Guardavaccaro, D., Starace, G. and Tirone, F. Over expression of the nerve growth factor-inducible PC3 immediate early gene is associated to growth inhibition. Cell Growth & Differ., 7, 1327-1336, 1996. FEATURES Location/Qualifiers source 1..796 /organism="Homo sapiens" /isolate="Soares fetal liver spleen INFLS cDNA library" /db_xref="taxon:9606" /tissue_type="liver" /tissue_type="spleen" /clone="ye67b04" /clone_lib="dBEST IS: 169818" /dev_stage="20 week foetus" /lab_host="DH10B" gene 64..540 /gene="NGF-inducible PC3" CDS 64..540 /gene="NGF-inducible PC3" /note="nerve growth factor-inducible anti-proliferative" /codon_start=1 /db_xref="PID:e301195" /db_xref="PID:g1841432" /translation="MSHGKGTDMLPEIAAAVGFLSSLLRTRGCVSEQRLKVFSGALQE ALTEHYKHHWFPEKPSKGSGYRCIRINHKMDPIISRVASQIGLSQPQLHQLLPSELTL WVDPYEVSYRIGEDGSICVLYEEAPLAASCGLLTCKNQVLLGRSSPSKNYVMAVSS" BASE COUNT 172 a 251 c 217 g 156 t ORIGIN 1 gctgtcttgt ggacccgcac ttcccacccg agacctctca ctgagcccga gccgcgcgcg 61 acaatgagcc acgggaaggg aaccgacatg ctcccggaga tcgccgccgc cgtgggcttc 121 ctctccagcc tcctgaggac ccggggctgc gtgagcgagc agaggcttaa ggtcttcagc 181 ggggcgctcc aggaggcact cacagagcac tacaaacacc actggtttcc cgaaaagccg 241 tccaagggct ccggctaccg ctgcattcgc atcaaccaca agatggaccc catcatcagc 301 agggtggcca gccagatcgg actcagccag ccccagctgc accagctgct gcccagcgag 361 ctgaccctgt gggtggaccc ctatgaggtg tcctaccgca ttggggagga cggctccatc 421 tgcgtcttgt acgaggaggc cccactggcc gcctcctgtg ggctcctcac ctgcaagaac 481 caagtgctgc tgggccggag cagcccctcc aagaactacg tgatggcagt ctccagctag 541 gcccttccgc ccccgccctg ggcgccgccg tgctcatgct gccgtgacaa caggccacca 601 catacctcaa cctggggaac tgtattttta aatgaagagc tatttatata tattattttt 661 ttttaagaaa ggaggaaaag aaaccaaaag ttttttttaa gaaaaaaaat ccttcaaggg 721 agctgcttgg aagtggcctc cccaggtgcc tttggagaaa ctgttgcgtg cttgatctgt 781 gagccagtgt ctgcct // LOCUS HSNGIPC4 1791 bp RNA PRI 19-DEC-1997 DEFINITION H.sapiens mRNA for nerve growth factor-inducible PC4 homologue. ACCESSION Y10313 NID g2706510 KEYWORDS nerve growth factor-inducible protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1791) AUTHORS Buanne,P., Incerti,B., Guardavaccaro,D., Avvantaggiato,V., Simeone,A. and Tirone,F. TITLE Cloning of the human PC4 gene from chromosome 7q22-31 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1791) AUTHORS Tirone,F. TITLE Direct Submission JOURNAL Submitted (03-JAN-1997) F. Tirone, National Research Council, Institute of Neurobiology, Viale Marx 43, 00137 Rome, ITALY REMARK revised by [3] REFERENCE 3 (bases 1 to 1791) AUTHORS Tirone,F. TITLE Direct Submission JOURNAL Submitted (17-DEC-1997) F. Tirone, National Research Council, Institute of Neurobiology, Viale Marx 43, 00137 Rome, ITALY FEATURES Location/Qualifiers source 1..1791 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /tissue_type="whole brain" /dev_stage="73 days post natal" /lab_host="DH10B" /clone_lib="Soares 1NIB" /map="q22-31" CDS 220..1581 /function="putative growth factor-sensitive positive regulator of cell differentiation" /codon_start=1 /product="nerve growth factor-inducible PC4 homologue" /db_xref="PID:e1216872" /db_xref="PID:g2706511" /translation="MPKNKKRNTPHRGSSAGGGGSGAAAATAATAGGQHRNVQPFSDE DASIETMSHCSGYSDPSSFAEDGPEVLDEEGTQEDLEYKRKGLIDLTLDKSAKTRQAA LEGIKNALASKMLYEFILERRMTLTDSIERCLKKGKSDEQRAAAALASVLCIQLGPGI ESEEILKTLGPILKKIICDGSASMQARQTCATCFGVCCFIATDDITELYSTLECLENI FTKSYLKEKDTTVICSTPNTVLHISSLLAWTLLLTICPINEVKKKLEMHFHKLPSLLS CDDVNMRIAAGESLALLFELARGIESDFFYEDMESLTQMLRALATDGNKHRAKVDKRK QRSVFRDVLRAVEERDFPTETIKFGPERMYIDCWVKKHTYDTFKEVLGSGMQYPLAVK MEFLENVFETWTPSDALMLQRLKTMKISRFERHLYNSAAFKARTKARSKCRDKRADVG EFF" BASE COUNT 524 a 375 c 409 g 483 t ORIGIN 1 cctcgtgcca gagaaacatg tatcgttttc gatcacagct cttcacgggg atttctgctg 61 ccgccaccgc ccactcttac ccccgccgct tctcgactct gttgttagcc gaagactcgc 121 ctctcagccg cccgccgcac agacgcacga gtaaaaagtg cagctccatc ggctgatcct 181 cgctaagctc cgactctggg cggcaccggg cgtcccacga tgccgaagaa caagaagcgg 241 aacactcccc accgcggtag cagtgctggc ggcggcgggt caggagcagc cgcagcgacg 301 gcggcgacag caggtggcca gcatcgaaat gttcagcctt ttagtgatga agatgcatca 361 attgaaacaa tgagccattg cagtggttat agcgatcctt ccagttttgc tgaagatgga 421 ccagaagtcc ttgatgagga aggaactcaa gaagacctag agtacaagag aaagggatta 481 attgacctaa ccctggataa gagtgcgaag acaaggcaag cagctcttga aggtattaaa 541 aatgcactgg cttcaaaaat gctgtatgaa tttattctgg aaaggagaat gactttaact 601 gatagcattg aacgctgcct gaaaaaaggt aagagtgatg agcaacgtgc agctgcagcg 661 ttagcatctg ttctttgtat tcagctgggc cctggaattg aaagtgaaga gattttgaaa 721 actcttggac caatcctaaa gaaaatcatt tgtgatgggt cagctagtat gcaggctagg 781 caaacttgtg caacttgctt tggtgtttgc tgttttattg ccacagatga cattactgaa 841 ctatactcaa ctctggaatg tttggaaaat atcttcacta aatcctatct caaagagaaa 901 gacactactg ttatttgcag cactcctaat acagtgcttc atatcagctc tcttcttgca 961 tggacactac tgctgaccat atgcccaatc aatgaagtga agaaaaagct tgagatgcat 1021 ttccataagc ttccaagcct cctctcttgt gatgatgtaa acatgagaat agctgctggt 1081 gaatctttgg cacttctctt tgaattggcc agaggaatag agagtgactt tttttatgaa 1141 gacatggagt ccttgacgca gatgcttagg gccttggcaa cagatggaaa taaacaccgg 1201 gccaaagtgg acaagagaaa gcagcggtca gttttcagag atgtcctgag ggcagtggag 1261 gaacgggatt ttccaacaga aaccattaaa tttggtcctg aacgcatgta tattgattgc 1321 tgggtaaaaa aacacaccta tgacaccttt aaggaggttc ttggatcagg gatgcagtac 1381 ccacttgcag tcaaaatgga attcctcgaa aatgtatttg aaacttggac ccccagtgat 1441 gccttgatgc tgcaacgcct taaaacgatg aagatttctc gtttcgaaag gcatttatat 1501 aactctgcag ccttcaaagc tcgaaccaaa gctagaagca aatgtcgaga taagagagca 1561 gatgttggag aattcttcta gattttcaga acttgaagac tattttctaa tttctatttt 1621 tttttctatt tcaatgtatt taaactctag acacagtttt tatcttggat taacttagat 1681 aacttttgta gccagtggtt atattgctta taatttaatg tacaatacta ttgaaactgg 1741 tgagttctga ttattaaata ttctctgtaa atcagtaaac atgtataaag t // LOCUS HSNGRC3 1194 bp RNA PRI 14-JAN-1997 DEFINITION H.sapiens mRNA for neurogranin. ACCESSION Y09689 NID g1702921 KEYWORDS calmodulin binding protein; neurogranin; Ng/RC3 gene; protein kinase C substrate; RC3 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Gundelfinger,E.D. TITLE Direct Submission JOURNAL Submitted (28-NOV-1996) E.D. Gundelfinger, Institute for Neurobiology, Dept. of Neurochemistry and Molecular Biology, Postfach 1860, D- 39008 Magdeburg, FRG REFERENCE 2 (bases 1 to 1194) AUTHORS Mertsalov,I.B., Gundelfinger,E. and Tsetlin,V.I. TITLE Cloning cDNA for human neurogranin JOURNAL Bioorg. Khim. 22 (5), 366-369 (1996) MEDLINE 97041074 FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 130..1176 /gene="Ng/RC3" CDS 130..366 /gene="Ng/RC3" /codon_start=1 /product="neurogranin" /db_xref="PID:e283789" /db_xref="PID:g1702922" /translation="MDCCTENACSKPDDDILDIPLDDPGANAAAAKIQASFRGHMARK KIKSGERGRKGPGPGGPGGAGVARGGAGGGPSGD" polyA_signal 1171..1176 /gene="Ng/RC3" BASE COUNT 209 a 429 c 336 g 220 t ORIGIN 1 agcagagctg ctgtttcggc gcgggtcggc tggcggccga ctgccccaga gcccccaccc 61 ggcaccacac agaccccacc cccgccctgc gccagccttc gtccccgcag aggacccccc 121 gacaccagca tggactgctg caccgagaac gcctgctcca agccggacga cgacattcta 181 gacatcccgc tggacgatcc cggcgccaac gcggccgccg ccaaaatcca ggcgagtttt 241 cggggccaca tggcgcggaa gaagataaag agcggagagc gcggccggaa gggcccgggc 301 cctggggggc ctggcggagc tggggtggcc cggggaggcg cgggcggcgg ccccagcgga 361 gactaggcca gaactgagca ttttcaaagt tcccgaggag agatggatgc cgcgtcccct 421 tcgcagcgac gagacttccc tgccgtgttt gtgaccccct cctgcccagc aacctgccag 481 ctacaggagc cccctgcgtc ccagagactc cctcacccag gcaggctccg tcgcggagtc 541 gctgagtccg tgccctttta gttagttctg cagtctagta tggtccccat ttgcccttcc 601 actccacccc accctaaacc atgcgctccc aatcttcctt cttttgcttc tcgcccacct 661 cttcccgcac ccagcatgca gctctgcctc cgcagcctca gtgcgctttc ctgcgcgcac 721 tgcggagggc gccctaagcg tcacccaagc acactcactt aaagaaaaaa cgagttcttt 781 cgttctgtgc gcagctaaaa ggggcgccct acatctccgt gccactcccg ccccagccta 841 gccccaagac tttggatccg gggcgagatg aagggaagag ggttgttttg gtttcggacg 901 acccttgctc tgaccggaag agaagtccct atcccacacc tgcctgtcac gttccctccc 961 ctttccccag cgcactgttg agggcagcct ctccagctct cttgtttatg caaacgccga 1021 gcgcctggga ggctcggtag gaggagtctt ccacggcccc gccccgcccc tgtcggtccc 1081 gccctccccc ccgccgggct cctggggctg tggccgaaag gtttctgatc tccgtgtgtg 1141 catgtgactg tgctgggttg gaatgtgaac aataaagagg aatgtccaag tgtt // LOCUS HSNIK 4596 bp mRNA PRI 13-JAN-1998 DEFINITION H.sapiens mRNA for serine/threonine protein kinase, NIK. ACCESSION Y10256 NID g1841433 KEYWORDS MAP kinase; NIK protein; serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4596) AUTHORS Malinin,N.L., Boldin,M.P., Kovalenko,A.V. and Wallach,D. TITLE MAP3K-related kinase involved in NF-kappaB induction by TNF, CD95 and IL-1 JOURNAL Nature 385 (6616), 540-544 (1997) MEDLINE 97172277 REFERENCE 2 (bases 1 to 4596) AUTHORS Wallach,D. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) D. Wallach, The Weizmann Institute, Dept of Membrane Research & Biophysics, Rehovot 76100, ISRAEL COMMENT NIK is a serine/threonine protein-kinase, resembling several MAP kinase kinase kinases (MAP3K), that binds specifically to TRAF2, an adapter proteins associated, either directly or through interaction with other adapter proteins, with several receptors of the TNF/NGF family. NIK overexpression in cells activates the transcription factor NF kappa B. Cellular expression of kinase-deficient NIK-mutants blocks NF kappa B induction by TNF, by either of the two TNF receptors, by CD95 (Fas/Apo-1) and by TRADD, RIP and MORT1/FADD, adapter proteins that bind to these receptors. It also blocks NF kappa B induction by IL-1. FEATURES Location/Qualifiers source 1..4596 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cells" misc_feature 137..149 /note="lysin-rich region" CDS 233..3076 /codon_start=1 /product="NIK, serine/threonine protein-kinase" /db_xref="PID:e290770" /db_xref="PID:g1841434" /translation="MAVMEMACPGAPGSAVGQQKELPKPKEKTPPLGKKQSSVYKLEA VEKSPVFCGKWEILNDVITKGTAKEGSEAGPAAISIIAQAECENSQEFSPTFSERIFI AGSKQYSQSESLDQIPNNVAHATEGKMARVCWKGKRRSKARKKRKKKSSKSLAHAGVA LAKPLPRTPEQESCTIPVQEDESPLGAPYVRNTPQFTKPLKEPGLGQLCFKQLGEGLR PALPRSELHKLISPLQCLNHVWKLHHPQDGGPLPLPTHPFPYSRLPHPFPFHPLQPWK PHPLESFLGKLACVDSQKPLPDPHLSKLACVDSPKPLPGPHLEPSCLSRGAHEKFSVE EYLVHALQGSVSSSQAHSLTSLAKTWAARGSRSREPSPKTEDNEGVLLTEKLKPVDYE YREEVHWATHQLRLGRGSFGEVHRMEDKQTGFQCAVKKVRLEVFRAEELMACAGLTSP RIVPLYGAVREGPWVNIFMELLEGGSLGQLVKEQGCLPEDRALYYLGQALEGLEYLHS RRILHGDVKADNVLLSSDGSHAALCDFGHAVCLQPDGLGKSLLTGDYIPGTETHMAPE VVLGRSCDAKVDVWSSCCMMLHMLNGCHPWTQFFRGPLCLKIASEPPPVREIPPSCAP LTAQAIQEGLRKEPIHRVSAAELGGKVNRALQQVGGLKSPWRGEYKEPRHPPPNQANY HQTLHAQPRELSPRAPGPRPAEETTGRAPKLQPPLPPEPPEPNKSPPLTLSKEESGMW EPLPLSSLEPAPARNPSSPERKATVPEQELQQLEIELFLNSLSQPFSLEEQEQILSCL SIDSLSLSDDSEKNPSKASQSSRDTLSSGVHSWSSQAEARSSSWNMVLARGRPTDTPS YFNGVKVQIQSLNGEHLHIREFHRVKVGDIATGISSQIPAAAFSLVTKDGQPVRYDME VPDSGIDLQCTLAPDGSFAWSWRVKHGQLENRP" misc_feature 401..653 /note="kinase domain" misc_feature 711..729 /note="proline-rich region" BASE COUNT 1055 a 1401 c 1299 g 841 t ORIGIN 1 aagcggggga ctgtgccgtg tggaacgtgt agctgttgag aggtggactc tgttaccatt 61 gaggatgttt ggaggatgag tatgtgtggc agaggcacac ataaacaggc agagaccctt 121 tgcccctgcc tttctccccc aacccaaggc tgacctgtgt tctcccaggt ctgggattct 181 aagtgacctg ctctgtgttt ggtctctctc aggatgagca caagcctggg agatggcagt 241 gatggaaatg gcctgcccag gtgcccctgg ctcagcagtg gggcagcaga aggaactccc 301 caagccaaag gagaagacgc cgccactggg gaagaaacag agctccgtct acaagcttga 361 ggccgtggag aagagccctg tgttctgcgg aaagtgggag atcctgaatg acgtgattac 421 caagggcaca gccaaggaag gctccgaggc agggccagct gccatctcta tcatcgccca 481 ggctgagtgt gagaatagcc aagagttcag ccccaccttt tcagaacgca ttttcatcgc 541 tgggtccaaa cagtacagcc agtccgagag tcttgatcag atccccaaca atgtggccca 601 tgctacagag ggcaaaatgg cccgtgtgtg ttggaaggga aagcgtcgca gcaaagcccg 661 gaagaaacgg aagaagaaga gctcaaagtc cctggctcat gcaggagtgg ccttggccaa 721 acccctcccc aggacccctg agcaggagag ctgcaccatc ccagtgcagg aggatgagtc 781 tccactcggc gccccatatg ttagaaacac cccgcagttc accaagcctc tgaaggaacc 841 aggccttggg caactctgtt ttaagcagct tggcgagggc ctacggccgg ctctgcctcg 901 atcagaactc cacaaactga tcagcccctt gcaatgtctg aaccacgtgt ggaaactgca 961 ccacccccag gacggaggcc ccctgcccct gcccacgcac cccttcccct atagcagact 1021 gcctcatccc ttcccattcc accctctcca gccctggaaa cctcaccctc tggagtcctt 1081 cctgggcaaa ctggcctgtg tagacagcca gaaacccttg cctgacccac acctgagcaa 1141 actggcctgt gtagacagtc caaagcccct gcctggccca cacctggagc ccagctgcct 1201 gtctcgtggt gcccatgaga agttttctgt ggaggaatac ctagtgcatg ctctgcaagg 1261 cagcgtgagc tcaagccagg cccacagcct gaccagcctg gccaagacct gggcagcacg 1321 gggctccaga tcccgggagc ccagccccaa aactgaggac aacgagggtg tcctgctcac 1381 tgagaaactc aagccagtgg attatgagta ccgagaagaa gtccactggg ccacgcacca 1441 gctccgcctg ggcagaggct ccttcggaga ggtgcacagg atggaggaca agcagactgg 1501 cttccagtgc gctgtcaaaa aggtgcggct ggaagtattt cgggcagagg agctgatggc 1561 atgtgcagga ttgacctcac ccagaattgt ccctttgtat ggagctgtga gagaagggcc 1621 ttgggtcaac atcttcatgg agctgctgga aggtggctcc ctgggccagc tggtcaagga 1681 gcagggctgt ctcccagagg accgggccct gtactacctg ggccaggccc tggagggtct 1741 ggaatacctc cactcacgaa ggattctgca tggggacgtc aaagctgaca acgtgctcct 1801 gtccagcgat gggagccacg cagccctctg tgactttggc catgctgtgt gtcttcaacc 1861 tgatggcctg ggaaagtcct tgctcacagg ggactacatc cctggcacag agacccacat 1921 ggctccggag gtggtgctgg gcaggagctg cgacgccaag gtggatgtct ggagcagctg 1981 ctgtatgatg ctgcacatgc tcaacggctg ccacccctgg actcagttct tccgagggcc 2041 gctctgcctc aagattgcca gcgagcctcc gcctgtgagg gagatcccac cctcctgcgc 2101 ccctctcaca gcccaggcca tccaagaggg gctgaggaaa gagcccatcc accgcgtgtc 2161 tgcagcggag ctgggaggga aggtgaaccg ggcactacag caagtgggag gtctgaagag 2221 cccttggagg ggagaatata aagaaccaag acatccaccg ccaaatcaag ccaattacca 2281 ccagaccctc catgcccagc cgagagagct ttcgccaagg gccccagggc cccggccagc 2341 tgaggagaca acaggcagag cccctaagct ccagcctcct ctcccaccag agcccccaga 2401 gccaaacaag tctcctccct tgactttgag caaggaggag tctgggatgt gggaaccctt 2461 acctctgtcc tccctggagc cagcccctgc cagaaacccc agctcaccag agcggaaagc 2521 aaccgtcccg gagcaggaac tgcagcagct ggaaatagaa ttattcctca acagcctgtc 2581 ccagccattt tctctggagg agcaggagca aattctctcg tgcctcagca tcgacagcct 2641 ctccctgtcg gatgacagtg agaagaaccc atcaaaggcc tctcaaagct cgcgggacac 2701 cctgagctca ggcgtacact cctggagcag ccaggccgag gctcgaagct ccagctggaa 2761 catggtgctg gcccgggggc ggcccaccga caccccaagc tatttcaatg gtgtgaaagt 2821 ccaaatacag tctcttaatg gtgaacacct gcacatccgg gagttccacc gggtcaaagt 2881 gggagacatc gccactggca tcagcagcca gatcccagct gcagccttca gcttggtcac 2941 caaagacggg cagcctgttc gctacgacat ggaggtgcca gactcgggca tcgacctgca 3001 gtgcacactg gcccctgatg gcagcttcgc ctggagctgg agggtcaagc atggccagct 3061 ggagaacagg ccctaaccct gccctccacc gccggctcca cactgccgga aagcagcctt 3121 cctgctcggt gcacgatgct gccctgaaaa cacaggctca gccgttccca ggggattgcc 3181 agccccccgg ctcacagtgg gaaccagggc ctcgcagcag caaggtgggg gcaagcagaa 3241 tgcctcccag gatttcacac ctgagccctg ccccaccctg ctgaaaaaac atccgccacg 3301 tgaagagaca gaaggaggat ggcaggagtt acctggggaa acaaaacagg gatctttttc 3361 tgcccctgct ccagtcgagt tggcctgacc cgcttggatc agtgaccatt tgttggcaga 3421 caggggagag cagcttccag cctgggtcag aaggggtggg cgagcccttc ggcccctcac 3481 cctccaggct gctgtgagag tgtcaagtgt gtaagggccc aaactcaggt tcagtgcaga 3541 accaggtcag caggtatgcc cgcccgtagg ttaagggggc cctctaaacc ccttgcctgg 3601 cctcacctgg ccagctcacc ccttttgggt gtaggggaaa agaatgcctg accctgggaa 3661 ggctccctgg tagaatacac cacacttttc aggttgttgc aacacaggtc ctgagttgac 3721 ctctggttca gccaaggacc aaagaaggtg tgtaagtgaa gtggttctca gtccccagac 3781 atgtgcccct ttgctgctgg ctaccactct tccccagagc agcaggcccc gagccccttc 3841 aggcccagca ctgccccaga ctcgctggca ctcagttccc tcatctgtaa aggtgaaggg 3901 tgatgcagga tatgcctgac aggaacagtc tgtggatgga catgatcagt gctaaggaaa 3961 gcagcagaga gagacgtccg gcgccccagc cccactatca gtgtccagcg tgctggttcc 4021 ccagagcaca gctcagcatc acactgacac tcaccctgcc ctgcccctgg ccagagggta 4081 ctgccgacgg cactttgcac tctgatgacc tcaaagcact ttcatggctg ccctctggca 4141 gggcagggca gggcagtgac actgtaggag catagcaagc caggagatgg ggtgaaggga 4201 cacagtcttg agctgtccac atgcatgtga ctcctcaaac ctcttccaga tttctctaag 4261 aatagcaccc ccttccccat tgccccagct tagcctcttc tcccagggga gctactcagg 4321 actcacgtag cattaaatca gctgtgaatc gtcagggggt gtctgctagc ctcaacctcc 4381 tggggcaggg gacgccgaga ctccgtggga gaagctcatt cccacatctt gccaagacag 4441 cctttgtcca gctgtccaca ttgagtcaga ctgctcccgg ggagagagcc ccggccccca 4501 gcacataaag aactgcagcc ttggtactgc agagtctggg ttgtagagaa ctctttgtaa 4561 gcaataaagt ttggggtgat gacaaatgtt aaaaaa // LOCUS HSNIPSNA1 2233 bp mRNA PRI 10-JAN-1998 DEFINITION Homo sapiens mRNA for NIPSNAP1 protein. ACCESSION AJ001258 NID g2769648 KEYWORDS NIPSNAP1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2233) AUTHORS Seroussi,E., Pan,H.Q., Kedra,D., Roe,B. and Dumanski,J.P. TITLE Characterization of the human NIPSNAP gene from 22q12: A member of a novel gene family JOURNAL Unpublished REFERENCE 2 (bases 1 to 2233) AUTHORS Seroussi,E. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Seroussi E., Karolinska Hospital, Department of Molecular Medicine, CMM Building-L8:00, S-171 76 Stockholm, SWEDEN FEATURES Location/Qualifiers source 1..2233 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="q12" exon <1..352 /number=1 /evidence=experimental misc_feature 215 /note="putative transcription start" gene 255..1109 /gene="nipsnap1" CDS 255..1109 /gene="nipsnap1" /codon_start=1 /product="NIPSNAP1 protein" /db_xref="PID:e1231231" /db_xref="PID:g2769649" /translation="MAPRLCSISVTARRLLGGPGPRAGDVASAAAARFYSKDNEGSWF RSLFVHKVDPRKDAHSTLLSKKETSNLYKIQFHNVKPEYLDAYNSLTEAVLPKLHLDE DYPCSLVGNWNTWYGEQDQAVHLWRFSGGYPALMDCMNKLKNNKEYLEFRRERTQMLL SRRNQLLLEFSFWNEPQPRMGPNIYELRTYKLKPGTMIEWGNNWARAIKYRQENQEAV GGFFSQIGELYVVHHLWAYKDLQSREETRNAAWRKRGWDENVYYTVPLVRHMESRIMI PLKISPLQ" exon 353..480 /gene="nipsnap1" /number=2 /evidence=experimental exon 481..526 /gene="nipsnap1" /number=3 /evidence=experimental exon 527..621 /gene="nipsnap1" /number=4 /evidence=experimental exon 622..692 /gene="nipsnap1" /number=5 /evidence=experimental exon 693..833 /gene="nipsnap1" /number=6 /evidence=experimental variation 758 /gene="nipsnap1" /replace="c" exon 834..865 /gene="nipsnap1" /number=7 /evidence=experimental exon 861..1044 /gene="nipsnap1" /number=9 /evidence=experimental exon 866..960 /gene="nipsnap1" /number=8 /evidence=experimental exon 1045..2233 /number=10 /evidence=experimental polyA_signal 2218..2223 BASE COUNT 530 a 603 c 618 g 482 t ORIGIN 1 tcccacgctc ccattttacc gaggtggatc tgaggcccag agtgggcaaa gggcttgcca 61 aggtcgcaga gcgcgcagag gattagaacc ctgattccgt ccagcacctg cggagttgga 121 gacgcccaca ggaaacgccc acctgaccgg gtgagtgagg gggcgggcga gccggagggg 181 tggagtttat tcgcgccccg ctccgccacc agggcttggg ggcggggcct tcctgcaacc 241 tttgcggctc caacatggct ccgcggctgt gcagcatctc tgtgacggcg cggcggctgc 301 tggggggccc ggggcctcgc gctggggacg ttgcgtctgc agctgcggcg cgtttctatt 361 ccaaggacaa tgaaggcagc tggttccgct ccctctttgt tcacaaagtg gatccccgga 421 aggatgccca ctccaccctg ctgtccaaga aggaaaccag caacctctat aagatccagt 481 ttcacaatgt aaagcctgaa tacctggatg cctacaacag cctcacggag gctgtgctgc 541 ccaagcttca cctggatgag gactacccat gctcactcgt gggcaactgg aacacgtggt 601 atggggagca ggaccaggca gtgcacctgt ggcgattctc aggtggctac ccagccctca 661 tggactgcat gaacaagctc aaaaacaata aggagtacct ggagttccga agggagcgga 721 cgcagatgct gctgtccagg agaaaccagc tgctccttga gttcagcttc tggaatgagc 781 cacagcccag aatgggtccc aacatctatg agctgaggac atacaagctc aagccaggaa 841 ccatgatcga gtgggggaac aactgggctc gggccatcaa gtaccggcag gagaaccagg 901 aggcagtggg cggcttcttc tcacagatag gagagctcta cgtggtgcac catctctggg 961 cctataaaga cctgcagtct cgggaggaga ctcgaaacgc tgcctggagg aagagaggct 1021 gggatgaaaa tgtctactat acagtccccc tggtgcgaca catggagtct aggatcatga 1081 tccccttgaa gatctcgcct ctgcagtgat gctgcctaca cctccacctc cttccccttc 1141 tccctcaggc aaccccgaca gacagtggca ctcctggtcc tgctgtcttg gttttgtctt 1201 ggctctggga ggactctgag gggcagtgct cagttcagac aagggggaac tgaaggctga 1261 caagttctga ggattacagc tctgcccagg ccctttctac tttccctgcc tgcctccctc 1321 cccctgctag aagtagtttc tgatttccct gaatgaaaga tagtgatctt ttaggccttc 1381 atatatcttc taaaattcct catctcaagg ccaccagtcc ccagttatca cccttggaaa 1441 tcctctgttt agcttagttg cacaaaaggt agtatgtcca gtggctagag gtggggtggg 1501 aagagaggag gccaggccag gccctcagga attccctggt ttccactggg ccacccactt 1561 gctggacagt ctgagccaag tccctcctgc tggagaaaag cctggaactg ccaaaacaaa 1621 aaaacaaata caaacaaacc tcgtcaccac ggtcaggagg aggttcagac attctgccca 1681 gaggggctgg ggaaagggat gggagccagg aaagatttag gggcaggaga cagacacgta 1741 acaggaactc ggaaggtaga acttagaatt cagaacttgg gtttcaattg tagaatttgg 1801 aatctggaat acagaattgg gaagaagaac agagatgaca aaagaccact tgaagagagg 1861 ttcctaatac tagacagatg gagagtggct tggctgtttc cccctgcata ttggggttga 1921 aaacttaaat gtgtagttcc agtccttcta tgcagagtct ctcctgccct ccctccttcc 1981 tgcaggcaaa tatacaagct agaccctgtt ccctcaccct gtatcctgtc tcccctaatt 2041 gacattctat cagcccaacc ctcattctgc agggccacct ggtgtgtgga ggtggcggga 2101 ggggtcagtg actctggggg tctcctcagt gacccccata cctcatcatg aacccttggg 2161 tagcttgatt ctgcctacca tccaggaaaa cttgtgctgg aattgctgat ggaaacaatt 2221 aaatatttac tat // LOCUS HSNIPSNA2 1978 bp mRNA PRI 09-JAN-1998 DEFINITION Homo sapiens mRNA for NIPSNAP2 protein. ACCESSION AJ001259 NID g2769253 KEYWORDS NIPSNAP2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1978) AUTHORS Seroussi,E., Pan,H.Q., Kedra,D., Roe,B. and Dumanski,J.P. TITLE Characterization of the human NIPSNAP gene from 22q12: A member of a novel gene family JOURNAL Unpublished REFERENCE 2 (bases 1 to 1978) AUTHORS Seroussi,E. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) Seroussi E., Karolinska Hospital, Department of Molecular Medicine, CMM Building-L8:00, S-171 76 Stockholm, SWEDEN FEATURES Location/Qualifiers source 1..1978 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="near D7S499" gene 4..861 /gene="nipsnap2" CDS 4..861 /gene="nipsnap2" /codon_start=1 /product="NIPSNAP2 protein" /db_xref="PID:e1231233" /db_xref="PID:g2769254" /translation="MAARVLRAAEALGRRLLQRAAPCSLLPRLRTWTSSSNRSREDSW LKSLFVRKVDPRKDAHSNLLAKKETSNLYKLQFHNVKPECLEAYNKICQEVLPKIHED KHYPCTLVGTWNTWYGEQDQAVHLWRYEGGYPALTEVMNKLRENKEFLEFRKARSDML LSRKNQLLLEFSFWNEPVPRSGPNIYELRSYQLRPGTMIEWGNYWARAIRFRQDGNEA VGGFFSQIGQLYMVHHLWAYRDLQTREDIRNAAWHKHGWEELVYYTVPLIQEMESRIM IPLKTSPLQ" BASE COUNT 645 a 400 c 397 g 536 t ORIGIN 1 aagatggcgg cgcgagtgct gcgcgccgcg gaggccctgg gccggcggct cctgcagcgg 61 gcggccccct gcagcctcct gcccaggctc cggacatgga catcttccag caacagatct 121 cgagaagaca gctggctaaa atccttattt gtccggaaag ttgatccaag aaaagatgcc 181 cactccaatc tcctagccaa aaaggaaaca agcaatctat acaaattaca gtttcacaat 241 gttaaaccgg aatgcctaga agcatacaac aaaatttgtc aagaggtgtt gccaaagatt 301 cacgaagata aacactaccc ttgtactttg gtggggactt ggaacacgtg gtatggcgag 361 caggaccaag ctgtccacct ctggaggtat gaaggaggct atccagccct cacagaagtc 421 atgaataaac tcagagaaaa taaggaattt ttggaatttc gtaaggcaag aagtgacatg 481 cttctctcca ggaagaatca gctcctgttg gagttcagtt tctggaatga gcctgtgcca 541 agatccggac ctaatatata tgaactcagg tcttaccaac tccgaccagg aaccatgatt 601 gaatggggca attactgggc tcgtgcaatc cgcttcagac aggatggtaa cgaagccgtc 661 ggaggattct tctctcagat tgggcagctg tacatggtgc accatctttg ggcttacagg 721 gatcttcaga ccagggaaga catacggaat gcagcatggc acaaacatgg ctgggaggaa 781 ttggtatatt acacagttcc acttattcag gaaatggaat ccagaatcat gatcccactg 841 aagacctcgc ccctccagta aagctgtaga gtttctatgt gcctacatac atttctgtga 901 caagtatttg tcgtaaatta attttaattg tgtatcaagt gaaaaagaaa cactgaggtt 961 ttaagctgct gtatatagct tgtgagaaac ctcttttctt taaaatttac ataatcacaa 1021 gaaaggaaag aattacagtt ggactgattg tgacagtgcc ttgtcgtcct ctttgaaaca 1081 ccccgtgttg tccagtatac cttataacac ttagccactt ctccccaccc tccagaaggg 1141 gtccacgttg aattctgaat catcttgaaa ataagattcc aaccacaaaa aaaatttagc 1201 catttcttta ctaaaaaaaa ccaaaaaaca aatctgtttt ataatcacag atttttagac 1261 aaatttcttg tatcaggaag aaatacaaat tttgtcatgt ttctcaagca gtttttctga 1321 gtagtttctg aggaggaaca aattacaagt gtacccaata actgaaaatg ttttaactca 1381 ctctcatttg taagcagtcc acatagtaga caatgggttt tccaagctgg ccaagctaca 1441 caaaataaat aaaacaagta aacaacatgc acagcaagga ttcaaaggga gaaacaaaaa 1501 aaacaaaaac aaacgcgtga aacatatgta acatagaaaa ttttgaaaga acaccttgaa 1561 tagccactaa tttttatttg tggtattttt ctataacaaa acaagtagct ctaggaaaag 1621 aggttttatt ttgtaaacga tcatttgtga cctcagacac tctctggcta atattttaat 1681 aagctcacag cagataattc tgagatcatg ggtgaggggt ggtgcatgtt gagatttaaa 1741 ttggcataaa gctgcatact ttttgtctag ctgtttgatt tcatttttta atatagtatg 1801 ccaattttgt gactgttacc atgtgaaagt cctgttgaaa tgaacaattg tctgccccac 1861 aatcaagaat gtatgtgtaa agtgtgaata aatctcatat caaatgtcaa acttttacat 1921 gtgaatgatt ttctcaaaga acatagaaaa gtcaataaaa tcctcttaat ttccacat // LOCUS HSNKG2D 1770 bp RNA PRI 07-MAY-1991 DEFINITION Human mRNA for NKG2-D gene. ACCESSION X54870 NID g35062 KEYWORDS cell surface receptor; lectin; supergene family. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1770) AUTHORS Houchins,J.P. TITLE Direct Submission JOURNAL Submitted (16-OCT-1990) Houchins J.P., University of Minnesota, Immunobiology Research Center, Box 724 UMHC, Minneapolis MN 55455, U S A REFERENCE 2 (bases 1 to 1770) AUTHORS Houchins,J.P., Yabe,T., McSherry,C. and Bach,F.H. TITLE DNA sequence analysis of NKG2, a family of related cDNA clones encoding type II integral membrane proteins on human natural killer cells JOURNAL J. Exp. Med. 173 (4), 1017-1020 (1991) MEDLINE 91178434 COMMENT See - for associated sequences. FEATURES Location/Qualifiers source 1..1770 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NK B22" /chromosome="12" mRNA 1..1770 /gene="NKG2-D gene" /note="Type II integral membrane protein" gene 1..1770 /gene="NKG2-D gene" CDS 339..989 /gene="NKG2-D gene" /note="Type II integral membrane protein" /codon_start=1 /db_xref="PID:g35063" /db_xref="SWISS-PROT:P26718" /translation="MGWIRGRRSRHSWEMSEFHNYNLDLKKSDFSTRWQKQRCPVVKS KCRENASPFFFCCFIAVAMGIRFIIMVAIWSAVFLNSLFNQEVQIPLTESYCGPCPKN WICYKNNCYQFFDESKNWYESQASCMSQNASLLKVYSKEDQDLLKLVKSYHWMGLVHI PTNGSWQWEDGSILSPNLLTIIEMQKGDCALYASSFKGYIENCSTPNTYICMQRTV" misc_feature 339..491 /gene="NKG2-D gene" /note="intracellular protein" misc_feature 492..584 /gene="NKG2-D gene" /note="transmembrane segment" misc_feature 585..986 /gene="NKG2-D gene" /note="extracellular segment" misc_feature 633..986 /gene="NKG2-D gene" /note="C-type lectin domain" polyA_signal 1720..1725 /gene="NKG2-D gene" BASE COUNT 556 a 358 c 388 g 468 t ORIGIN 1 gaggagtgga ttacatattc caacagttgt tattacattg gtaaggaaag aagaacttgg 61 gaagaaagag tttgctggcc tgtgcttcga agaactctga tctgctttct atagataatg 121 aggaagaaat ggtatgtgtg gggacttccc agttggctgt aagttgccat ttgaactaaa 181 cgaaatagat caggaactga ggacatatct aaattttcta gttttataga aggcttttat 241 ccacaagaat caagatcttc cctctctgag caggaatcct ttgtgcattg aagactttag 301 attcctctct gcggtagacg tgcacttata agtatttgat ggggtggatt cgtggtcgga 361 ggtctcgaca cagctgggag atgagtgaat ttcataatta taacttggat ctgaagaaga 421 gtgatttttc aacacgatgg caaaagcaaa gatgtccagt agtcaaaagc aaatgtagag 481 aaaatgcatc tccatttttt ttctgctgct tcatcgctgt agccatggga atccgtttca 541 ttattatggt agcaatatgg agtgctgtat tcctaaactc attattcaac caagaagttc 601 aaattccctt gaccgaaagt tactgtggcc catgtcctaa aaactggata tgttacaaaa 661 ataactgcta ccaatttttt gatgagagta aaaactggta tgagagccag gcttcttgta 721 tgtctcaaaa tgccagcctt ctgaaagtat acagcaaaga ggaccaggat ttacttaaac 781 tggtgaagtc atatcattgg atgggactag tacacattcc aacaaatgga tcttggcagt 841 gggaagatgg ctccattctc tcacccaacc tactaacaat aattgaaatg cagaagggag 901 actgtgcact ctatgcctcg agctttaaag gctatataga aaactgttca actccaaata 961 catacatctg catgcaaagg actgtgtaaa gatgatcaac catctcaata aaagccagga 1021 acagagaaga gattacacca gcggtaacac tgccaaccga gactaaagga aacaaacaaa 1081 aacaggacaa aatgaccaaa gactgtcaga tttcttagac tccacaggac caaaccatag 1141 aacaatttca ctgcaaacat gcatgattct ccaagacaaa agaagagaga tcctaaaggc 1201 aattcagata tccccaaggc tgcctctccc accacaagcc cagagtggat gggctggggg 1261 aggggtgctg ttttaatttc taaaggtagg accaacaccc aggggatcag tgaaggaaga 1321 gaaggccagc agatcagtga gagtgcaacc ccaccctcca caggaaattg cctcatgggc 1381 agggccacag cagagagaca cagcatgggc agtgccttcc ctgcctgtgg gggtcatgct 1441 gccactttta atgggtcctc cacccaacgg ggtcagggag gtggtgctgc cccagtgggc 1501 catgattatc ttaaaggcat tattctccag ccttaagatc ttaggacgtt tcctttgcta 1561 tgatttgtac ttgcttgagt cccatgactg tttctcttcc tctctttctt ccttttggaa 1621 tagtaatatc catcctatgt ttgtcccact attgtatttt ggaagcacat aacttgtttg 1681 gtttcacagg ttcacagtta agaaggaatt ttgcctctga ataaatagaa tcttgagtct 1741 catgcaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSNM23H1A 576 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens NM23-H1 mRNA. ACCESSION X73066 S54233 NID g312823 KEYWORDS metastasis-suppressing protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 576) AUTHORS Wang,L., Patel,U., Ghosh,L., Chen,H.C. and Banerjee,S. TITLE Mutation in the nm23 gene is associated with metastasis in colorectal cancer JOURNAL Cancer Res. 53 (4), 717-720 (1993) MEDLINE 93153759 REMARK Erratum:[Cancer Res 1993 Aug 1;53(15):3652]] FEATURES Location/Qualifiers source 1..576 /organism="Homo sapiens" /db_xref="taxon:9606" gene 85..543 /gene="NM23H1" CDS 85..543 /gene="NM23H1" /codon_start=1 /db_xref="PID:g312824" /db_xref="SWISS-PROT:P15531" /translation="MANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRLVGLKFMQAS EDLLKEHYVDLKDRPFFAGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETNPADSKPG TIRGDFCIQVGRNIIHGSDSVESAEKEIGLWFHPEELVDYTSCAQNWIYE" mutation 342..405 /gene="NM23H1" /note="60 bp deletion in mutant" BASE COUNT 136 a 134 c 177 g 129 t ORIGIN 1 tgctgcgaac cacgtgggtc ccgggcgcgt ttcgggtgct ggcggctgca gccggagttc 61 aaacctaagc agctggaagg aaccatggcc aactgtgagc gtaccttcat tgcgatcaaa 121 ccagatgggg tccagcgggg tcttgttgga gagattatca agcgttttga gcagaaagga 181 ttccgccttg tgggtctgaa attcatgcaa gcttccgaag atcttctcaa ggaacactac 241 gttgacctga aggaccgtcc attctttgcc ggcctggtga aatacatgca ctcagggccg 301 gtagttgcca tggtctggga ggggctgaat gtggtgaaga cgggccgagt catgctcggg 361 gagaccaacc ctgcagactc caagcctggg accatccgtg gagacttctg catacaagtt 421 ggcaggaaca ttatacatgg cagtgattct gtggagagtg cagagaagga gatcggcttg 481 tggtttcacc ctgaggaact ggtagattac acgagctgtg ctcagaactg gatctatgaa 541 tgacaggagg gcagaccaca ttgcttttca catcca // LOCUS HSNM23H4 879 bp RNA PRI 22-APR-1997 DEFINITION H.sapiens mRNA for nucleoside-diphosphate kinase. ACCESSION Y07604 NID g1945761 KEYWORDS nm23-H4 gene; nucleoside-diphosphate kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Milon,L., Rousseau-Merck,M.F., Munier,A., Erent,M., Lascu,I., Capeau,J. and Lacombe,M.L. TITLE nm23-H4, a new member of the family of human nm23/nucleoside diphosphate kinase genes localised on chromosome 16p13 JOURNAL Hum. Genet. 99 (4), 550-557 (1997) MEDLINE 97254440 REFERENCE 2 (bases 1 to 879) AUTHORS Lacombe,M. TITLE Direct Submission JOURNAL Submitted (21-AUG-1996) M. Lacombe, INSERM U402, Faculte de Medecine Saint-Antoine, 27 rue Chaligny, F- 75012 Paris, FRANCE COMMENT Related sequences: T70747, R17745 & T50922. FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="stomach" /clone_lib="lambda gt11" /clone="1.1" /map="16p13.3" gene 12..575 /gene="nm23-H4" CDS 12..575 /gene="nm23-H4" /EC_number="2.7.4.6" /codon_start=1 /product="nucleoside-diphosphate kinase" /db_xref="PID:e291906" /db_xref="PID:g1945762" /translation="MGGLFWRSALRGLRCGPRAPGPSLLVRHGSGGPSWTRERTLVAV KPDGVQRRLVGDVIQRFERRGFTLVGMKMLQAPESVLAEHYQDLRRKPFYPALIRYMS SGPVVAMVWEGYNVVRASRAMIGHTDSAEAAPGTIRGDFSVHISRNVIHASDSVEGAQ REIQLWFQSSELVSWADGGQHSSIHPA" BASE COUNT 168 a 291 c 267 g 153 t ORIGIN 1 ggccgggcgt catgggcggc ctcttctggc gctccgcgct gcgggggctg cgctgcggcc 61 cgcgggcccc gggcccgagc ctgctagtgc gccacggctc gggagggccc tcctggaccc 121 gggagcggac cctggtggcg gtgaagcccg atggcgtgca acggcggctc gttggggacg 181 tgatccagcg ctttgagagg cggggcttca cgctggtggg gatgaagatg ctgcaggcac 241 cagagagcgt ccttgccgag cactaccagg acctgcggag gaagcccttc taccctgccc 301 tcatccgcta catgagctct gggcctgtgg tggccatggt ctgggaaggg tacaatgtcg 361 tccgcgcctc aagggccatg attggacaca ccgactcggc tgaggctgcc ccaggaacca 421 taaggggtga cttcagcgtc cacatcagca ggaatgtcat ccacgccagc gactccgtgg 481 agggggccca gcgggagatc cagctgtggt tccagagcag tgagctggtg agctgggcag 541 acgggggcca gcacagcagc atccacccag cctgaggctc aagctgccct taccacccca 601 tcccccacgc aggaccaact acctccgtca gcaagaaccc aagcccacat ccaaacctgc 661 ctgtcccaaa ccacttactt ccctgttcac ctctgcccca ccccagccca gaggagtttg 721 agccaccaac ttcagtgcct ttctgtaccc caagccagca caagattgga ccaatccttt 781 ttgcaccaaa gtgccggaca acctttgtgg tggggggggg tcttcacatt atcataacct 841 ctcctctaaa ggggaggcat taaaattcac tgtgcccag // LOCUS HSNMB 2669 bp RNA PRI 09-FEB-1995 DEFINITION H.sapiens NMB mRNA. ACCESSION X76534 NID g666042 KEYWORDS NMB gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2669) AUTHORS Weterman,M. TITLE Direct Submission JOURNAL Submitted (03-DEC-1993) M. Weterman, University of Nijmegen, Dept of Biochemistry, PO Box 9101, 6500 HB Nijmegen, NETHERLANDS REFERENCE 2 (bases 1 to 2669) AUTHORS Weterman,M.A., Ajubi,N., van Dinter,I.M., Degen,W.G., van Muijen,G.N., Ruitter,D.J. and Bloemers,H.P. TITLE nmb, a novel gene, is expressed in low-metastatic human melanoma cell lines and xenografts JOURNAL Int. J. Cancer 60 (1), 73-81 (1995) MEDLINE 95113576 FEATURES Location/Qualifiers source 1..2669 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MV1" /clone="NMB" /clone_lib="MV1" gene 92..1774 /gene="NMB" CDS 92..1774 /gene="NMB" /codon_start=1 /db_xref="PID:g666043" /translation="MECLYYFLGFLLLAARLPLDAAKRFHDVLGNERPSAYMREHNQL NGWSSDENDWNEKLYPVWKRGDMRWKNSWKGGRVQAVLTSDSPALVGSNITFAVNLIF PRCQKEDANGNIVYEKNCRNEAGLSADPYVYNWTAWSEDSDGENGTGQSHHNVFPDGK PFPHHPGWRRWNFIYVFHTLGQYFQKLGRCSVRVSVNTANVTLGPQLMEVTVYRRHGR AYVPIAQVKDVYVVTDQIPVFVTMFQKNDRNSSDETFLKDLPIMFDVLIHDPSHFLNY STINYKWSFGDNTGLFVSTNHTVNHTYVLNGTFSLNLTVKAAAPGPCPPPPPPPRPSK PTPSLGPAGDNPLELSRIPDENCQINRYGHFQATITIVEGILEVNIIQMTDVLMPVPW PESSLIDFVVTCQGSIPTEVCTIISDPTCEITQNTVCSPVDVDEMCLLTVRRTFNGSG TYCVNLTLGDDTSLALTSTLISVPDRDPASPLRMANSALISVGCLAIFVTVISLLVYK KHKEYNPIENSPGNVVRSKGLSVFLNRAKAVFFPGNQEKDPLLKNQEFKGVS" mat_peptide 155..1771 /gene="NMB" BASE COUNT 752 a 595 c 597 g 725 t ORIGIN 1 cagatgccag aagaacactg ttgctcttgg tggacgggcc cagaggaatt cagagttaaa 61 ccttgagtgc ctgcgtccgt gagaattcag catggaatgt ctctactatt tcctgggatt 121 tctgctcctg gctgcaagat tgccacttga tgccgccaaa cgatttcatg atgtgctggg 181 caatgaaaga ccttctgctt acatgaggga gcacaatcaa ttaaatggct ggtcttctga 241 tgaaaatgac tggaatgaaa aactctaccc agtgtggaag cggggagaca tgaggtggaa 301 aaactcctgg aagggaggcc gtgtgcaggc ggtcctgacc agtgactcac cagccctcgt 361 gggctcaaat ataacatttg cggtgaacct gatattccct agatgccaaa aggaagatgc 421 caatggcaac atagtctatg agaagaactg cagaaatgag gctggtttat ctgctgatcc 481 atatgtttac aactggacag catggtcaga ggacagtgac ggggaaaatg gcaccggcca 541 aagccatcat aacgtcttcc ctgatgggaa accttttcct caccaccccg gatggagaag 601 atggaatttc atctacgtct tccacacact tggtcagtat ttccagaaat tgggacgatg 661 ttcagtgaga gtttctgtga acacagccaa tgtgacactt gggcctcaac tcatggaagt 721 gactgtctac agaagacatg gacgggcata tgttcccatc gcacaagtga aagatgtgta 781 cgtggtaaca gatcagattc ctgtgtttgt gactatgttc cagaagaacg atcgaaattc 841 atccgacgaa accttcctca aagatctccc cattatgttt gatgtcctga ttcatgatcc 901 tagccacttc ctcaattatt ctaccattaa ctacaagtgg agcttcgggg ataatactgg 961 cctgtttgtt tccaccaatc atactgtgaa tcacacgtat gtgctcaatg gaaccttcag 1021 ccttaacctc actgtgaaag ctgcagcacc aggaccttgt ccgccaccgc caccaccacc 1081 cagaccttca aaacccaccc cttctttagg acctgctggt gacaaccccc tggagctgag 1141 taggattcct gatgaaaact gccagattaa cagatatggc cactttcaag ccaccatcac 1201 aattgtagag ggaatcttag aggttaacat catccagatg acagacgtcc tgatgccggt 1261 gccatggcct gaaagctccc taatagactt tgtcgtgacc tgccaaggga gcattcccac 1321 ggaggtctgt accatcattt ctgaccccac ctgcgagatc acccagaaca cagtctgcag 1381 ccctgtggat gtggatgaga tgtgtctgct gactgtgaga cgaaccttca atgggtctgg 1441 gacgtactgt gtgaacctca ccctggggga tgacacaagc ctggctctca cgagcaccct 1501 gatttctgtt cctgacagag acccagcctc gcctttaagg atggcaaaca gtgccctgat 1561 ctccgttggc tgcttggcca tatttgtcac tgtgatctcc ctcttggtgt acaaaaaaca 1621 caaggaatac aacccaatag aaaatagtcc tgggaatgtg gtcagaagca aaggcctgag 1681 tgtctttctc aaccgtgcaa aagccgtgtt cttcccggga aaccaggaaa aggatccgct 1741 actcaaaaac caagaattta aaggagtttc ttaaatttcg accttgtttc tgaagctcac 1801 ttttcagtgc cattgatgtg agatgtgctg gagtggctat taaccttttt ttcctaaaga 1861 ttattgttaa atagatattg tggtttgggg aagttgaatt ttttataggt taaatgtcat 1921 tttagagatg gggagaggga ttatactgca ggcagcttca gccatgttgt gaaactgata 1981 aaagcaactt agcaaggctt cttttcatta ttttttatgt ttcacttata aagtcttagg 2041 taactagtag gatagaaaca ctgtgtcccg agagtaagga gagaagctac tattgattag 2101 agcctaaccc aggttaactg caagaagagg cgggatactt tcagctttcc atgtaactgt 2161 atgcataaag ccaatgtagt ccagtttcta agatcatgtt ccaagctaac tgaatcccac 2221 ttcaatacac actcatgaac tcctgatgga acaataacag gcccaagcct gtggtatgat 2281 gtgcacactt gctagactca gaaaaaatac tactctcata aatgggtggg agtattttgg 2341 tgacaaccta ctttgcttgg ctgagtgaag gaatgatatt catatattca tttattccat 2401 ggacatttag ttagtgcttt ttatatacca ggcatgatgc tgagtgacac tcttgtgtat 2461 atttccaaat ttttgtatag tcgctgcaca tatttgaaat catatattaa gactttccaa 2521 agatgaggtc cctggttttt catggcaact tgatcagtaa ggatttcacc tctgtttgta 2581 actaaaacca tctactatat gttagacatg acattctttt tctctccttc ctgaaaaata 2641 aagtgtggga agagacaaaa aaaaaaaaa // LOCUS HSNMCFL1 1059 bp RNA PRI 03-DEC-1996 DEFINITION H.sapiens mRNA for non-muscle type cofilin. ACCESSION X95404 NID g1177470 KEYWORDS cfl1 gene; cofilin; non-muscle cofilin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1059) AUTHORS Gillett,G.T., Fox,M.F., Rowe,P.S., Casimir,C.M. and Povey,S. TITLE Mapping of human non-muscle type cofilin (CFL1) to chromosome 11q13 and muscle-type cofilin (CFL2) to chromosome 14 JOURNAL Ann. Hum. Genet. 60 (Pt 3), 201-211 (1996) MEDLINE 96393663 REFERENCE 2 (bases 1 to 1059) AUTHORS Gillett,G.T. TITLE Direct Submission JOURNAL Submitted (27-JAN-1996) G.T. Gillett, GOS Hospital NHS Trust, Clinical Biochemistry, Great Ormond Street, London, WC1N 3JH, UK FEATURES Location/Qualifiers source 1..1059 /organism="Homo sapiens" /note="induced with DMSO" /db_xref="taxon:9606" /cell_line="HL60" /cell_type="promyelocyte" /chromosome="11" /map="q13" gene 52..552 /gene="cfl1" CDS 52..552 /gene="cfl1" /note="non-muscle type" /codon_start=1 /product="cofilin" /db_xref="PID:e220309" /db_xref="PID:g1177471" /db_xref="SWISS-PROT:P23528" /translation="MASGVAVSDGVIKVFNDMKVRKSSTPEEVKKRKKAVLFCLSEDK KNIILEEGKEILVGDVGQTVDDPYATFVKMLPDKDCRYALYDATYETKESKKEDLVFI FWAPESAPLKSKMIYASSKDAIKKKLTGIKHELQANCYEEVKDRCTLAEKLGGSAVIS LEGKPL" BASE COUNT 220 a 311 c 282 g 246 t ORIGIN 1 gctctcgtct tctgcggctc tcggtgccct ctccttttcg tttccggaaa catggcctcc 61 ggtgtggctg tctctgatgg tgtcatcaag gtgttcaacg acatgaaggt gcgtaagtct 121 tcaacgccag aggaggtgaa gaagcgcaag aaggcggtgc tcttctgcct gagtgaggac 181 aagaagaaca tcatcctgga ggagggcaag gagatcctgg tgggcgatgt gggccagact 241 gtcgacgatc cctacgccac ctttgtcaag atgctgccag ataaggactg ccgctatgcc 301 ctctatgatg caacctatga gaccaaggag agcaagaagg aggatctggt gtttatcttc 361 tgggcccccg agtctgcgcc ccttaagagc aaaatgattt atgccagctc caaggacgcc 421 atcaagaaga agctgacagg gatcaagcat gaattgcaag caaactgcta cgaggaggtc 481 aaggaccgct gcaccctggc agagaagctg gggggcagtg cggtcatctc cctggagggc 541 aagcctttgt gagccccttc tggccccctg cctggagcat ctggcagccc cacacctgcc 601 cttgggggtt gcaggctgcc cccttcctgc cagaccggag gggctggggg gatcccagca 661 gggggaggca atcccttcac cccagttgcc aaacagaccc cccaccccct ggattttcct 721 tctccctcca tcccttgacg gttctggcct tcccaaactg cttttgatct tttgattcct 781 cttgggctga agcagaccaa gttcccccca ggcaccccag ttgtggggga gcctgtattt 841 tttttaacaa catccccatt ccccacctgg tcctccccct tcccatgctg ccaacttcta 901 accgcaatag tgactctgtg cttgtctgtt tagttctgtg tataaatgga atgttgtgga 961 gatgacccct ccctgtgccg gctggttcct ctcccttttc ccctggtcac ggctactcat 1021 ggaagcagga ccagtaaggg accttcgatt aaaaaaaaa // LOCUS HSNMTDC 2102 bp RNA PRI 27-MAR-1995 DEFINITION Human mRNA for NAD-dependent methylene tetrahydrofolate dehydrogenase cyclohydrolase (EC 1.5.1.15). ACCESSION X16396 NID g35070 KEYWORDS methylenetetrahydrofolate dehydrogenase; methylenetetrahydrofolate dehydrogenase (NAD+). SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2102) AUTHORS MacKenzie,R.E. TITLE Direct Submission JOURNAL Submitted (22-AUG-1989) Mackenzie R.E., McGill University, Dept of Biochemistry, 3655 Drummond St # 817, Montreal P Q, Canada H3G 1Y6 REFERENCE 2 (bases 1 to 2102) AUTHORS Peri,K.G., Belanger,C. and Mackenzie,R.E. TITLE Nucleotide sequence of the human NAD-dependent methylene tetrahydrofolate dehydrogenase-cyclohydrolase JOURNAL Nucleic Acids Res. 17 (21), 8853 (1989) MEDLINE 90067849 FEATURES Location/Qualifiers source 1..2102 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="LS 180" /clone_lib="LS 180 cDNA" /clone="30 EB 34." CDS 16..1050 /note="precursor polypeptide (AA -29 to 315)" /codon_start=1 /db_xref="PID:g35071" /db_xref="SWISS-PROT:P13995" /translation="MSALAARLLQPAHSCSLRLRPFHLAAVRNEAVVISGRKLAQQIK QEVRQEVEEWVASGNKRPHLSVILVGENPASHSYVLNKTRAAAVVGINSETIMKPASI SEEELLNLINKLNNDDNVDGLLVQLPLPEHIDERRICNAVSPDKDVDGFHVINVGRMC LDQYSMLPATPWGVWEIIKRTGIPTLGKNVVVAGRSKNVGMPIAMLLHTDGAHERPGG DATVTISHRYTPKEQLKKHTILADIVISAAGIPNLITADMIKEGAAVIDVGINRVHDP VTAKPKLVGDVDFEGVRQKAGYITPVPGGVGPMTVAMLMKNTIIAAKKVLRLEEREVL KSKELGVATN" transit_peptide 16..102 /note="transit peptide (AA -29 to -1)" mat_peptide 103..1047 /note="mature polypeptide (AA 1-315)" misc_feature 2067..2073 /note="polyA signal" BASE COUNT 604 a 391 c 471 g 636 t ORIGIN 1 cctgcgactt ctctaatgtc tgctttggct gcccggctgc tgcagcccgc gcacagctgc 61 tcccttcgcc ttcgcccttt ccacctcgcg gcagttcgaa atgaagctgt tgtcatttct 121 ggaaggaaac tggcccagca gatcaagcag gaagtgcggc aggaggtaga agagtgggtg 181 gcctcaggca acaaacggcc acacctgagt gtgatcctgg ttggcgagaa tcctgcaagt 241 cactcctatg tcctcaacaa aaccagggca gctgcagttg tgggaatcaa cagtgagaca 301 attatgaaac cagcttcaat ttcagaggaa gaattgttga atttaatcaa taaactgaat 361 aatgatgata atgtagatgg cctccttgtt cagttgcctc ttccagagca tattgatgag 421 agaaggatct gcaatgctgt ttctccagac aaggatgttg atggctttca tgtaattaat 481 gtaggacgaa tgtgtttgga tcagtattcc atgttaccgg ctactccatg gggtgtgtgg 541 gaaataatca agcgaactgg cattccaacc ctagggaaga atgtggttgt ggctggaagg 601 tcaaaaaacg ttggaatgcc cattgcaatg ttactgcaca cagatggggc gcatgaacgt 661 cccggaggtg atgccactgt tacaatatct catcgatata ctcccaaaga gcagttgaag 721 aaacatacaa ttcttgcaga tattgtaata tctgctgcag gtattccaaa tctgatcaca 781 gcagatatga tcaaggaagg agcagcagtc attgatgtgg gaataaatag agttcacgat 841 cctgtaactg ccaaacccaa gttggttgga gatgtggatt ttgaaggagt cagacaaaaa 901 gctgggtata tcactccagt tcctggaggt gttggcccca tgacagtggc aatgctaatg 961 aagaatacca ttattgctgc aaaaaaggtg ctgaggcttg aagagcgaga agtgctgaag 1021 tctaaagagc ttggggtagc cactaattaa ctactgtgtc ttctgtgtca caaacagcac 1081 tccaggccag ctcaagaagc aaagcaggcc aatagaaatg caatattttt aatttattct 1141 actgaaatgg tttaaaatga tgccttgtat ttattgaaag cttaaatggg tgggtgtttc 1201 tgcacatacc tctgcagtac ctcaccaggg agcattccag tatcatgcag ggtcctgtga 1261 tctagccagg agcagccatt aacctagtga ttaatatggg agacattacc atatggagga 1321 tggatgcttc actttgtcaa gcacctcagt tacacattcg ccttttctag gattgcattt 1381 cccaagtgct attgcaataa cagttgatac tcattttagg taccagacct tttgagttca 1441 actgatcaaa ccaaaggaaa agtgttgcta gagaaaattg gggaaaaggt gaaaaagaaa 1501 aaatggtagt aattgagcag aaaaaaatta atttatatat gtattgattg gcaaccagat 1561 ttatctaagt agaactgaat tggctaggaa aaaagaaaaa ctgcatgtta atcattttcc 1621 taagctgtcc ttttgaggct tagtcagttt attgggaaaa tgtttaggat tattccttgc 1681 tattagtact cattttatgt atgttaccct tcagtaagtt ctccccattt tagttttcta 1741 ggactgaaag gattcttttc tacattatac atgtgtgttg tcatatttgg cttttgctat 1801 atactttaac ttcattgtta aatttttgta ttgtatagtt tctttggtgt atcttaaaac 1861 ctatttttga aaaacaaagt tggcttgata atcatttggg cagcttgggt aagtacgcaa 1921 cttacttttc caccaaagaa ctgtcaccac ctgcctgctt ttctgtgatg tatgtatcct 1981 gttgactttt ccagaaattt tttaagagtt tgagttacta ttgaatttaa tcagactttc 2041 tgattaaagg gttttctttc ttttttaata aaacactctg tctggtgtgg tatgaatttc 2101 tg // LOCUS HSNMYC 8762 bp DNA PRI 24-APR-1993 DEFINITION Human germ line n-myc gene. ACCESSION Y00664 NID g35074 KEYWORDS amplification; n-myc cellular oncogene; oncogene; proto-oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8762) AUTHORS Ibson,J.M. TITLE Direct Submission JOURNAL Submitted (28-APR-1988) Ibson J.M., Dept. of Microbiology and Immunology, Room HSE 453,, University of California at San Francisco,, 3rd and Parnassus, San Francisco,, California 94143, USA REFERENCE 2 (bases 1 to 8762) AUTHORS Ibson,J.M. and Rabbitts,P.H. TITLE Sequence of a germ-line N-myc gene and amplification as a mechanism of activation JOURNAL Oncogene 2 (4), 399-402 (1988) MEDLINE 88202932 COMMENT the sequence represents a non-activated N-myc gene; see X03293-95 and M13241 for activated amplyfied N-myc sequences Data kindly reviewed (09-JUN-1988)) by Ibson J.M. FEATURES Location/Qualifiers source 1..8762 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda SUP-T1.4" /cell_line="T-cell line SUP-T1" /clone_lib="lambda gt10" misc_feature 1395..1399 /note="CTF binding site" misc_feature 1696..1700 /note="CTF binding site" GC_signal 1730..1735 GC_signal 1799..2004 TATA_signal 1828..1836 CDS 1894..2154 /note="open reading frame 1 (AA 1 - 86)" /codon_start=1 /db_xref="PID:g35075" /translation="MRGAPGNCVGAEQALARRKRAQTVAIRGHPRPPGPPGDTRAESP PDPLQSAGGKEQGLQTARRPGKRRAPGQGKPWTGLRRAHRAP" TATA_signal 2152..2157 /note="pot. alt. TATA-box" intron 2361..3254 /number=1 exon 3255..4161 /number=2 RBS 3390..3395 /note="pot. ribosome binding site" CDS join(3396..4161,6798..7402) /codon_start=1 /product="N-myc protein" /db_xref="PID:g35076" /db_xref="SWISS-PROT:P04198" /translation="MPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKK FELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENELWGSPAEEDAFGLGGLGGLTPNPV ILQDCMWSGFSAREKLERAVSEKLQHGRGPPTAGSTAQSPGAGAASPAGRGHGGAAGA GRAGAALPAELAHPAAECVDPAVVFPFPVNKREPAPVPAAPASAPAAGPAVASGAGIA APAGAPGVAPPRPGGRQTSGGDHKALSTSGEDTLSDSDDEDDEEEDEEEEIDVVTVEK RRSSSNTKAVTTFTITVRPKNAALGPGRAQSSELILKRCLPIHQQHNYAAPSPYVESE DAPPQKKIKSEASPRPLKSVVPPKAKSLSPRNSDSEDSERRRNHNILERQRRNDLRSS FLTLRDHVPELVKNEKAAKVVILKKATEYVHSLQAEEHQLLLEKEKLQARQQQLLKKI EHARTC" intron 4162..6797 /number=2 exon 6798..8295 /number=3 polyA_signal 8289..8294 polyA_site 8295 BASE COUNT 1977 a 2256 c 2408 g 2121 t ORIGIN 1 ttcagcagga gggcttggtg gcgattaggg ggctagggaa aagcagaagt ggggctgtaa 61 aaggcttttg caaagatgaa ggcttttaca tttgtgaggg aaataacaat aataaaggtg 121 cttggaaggg ctgctcatcc gcattcatgg gcctcttcct agaactctgg gaatctttct 181 ccctagagct ctgtaggcgt ccagggtcag ggctggaaag gccttcacaa atgctaatct 241 gtgagcattt tatgggttgc cccctgaccc tcagcatcct cctccttaac tagatccaat 301 ggtgaggtga agaggtgggg gcgatggaga atggcatttc cagtttggag aaaaaagtag 361 ggagaggcgg ggaagccttc ccaccccaat cccagggcca ggttcccctc tctccaaaca 421 agcctaagga atgtttttca aggcaaaatc ccacctcact gtgtatttgg ttgtgaaggg 481 ggctgtccag gagatcaggg ttgtttttgg agagtttgga gggcaggggc aataatgtct 541 gaccccctgc aaacaaatac acctgtaaga aaagctagac cagagtgaag taaagccagg 601 atttgatggt ggcgggaagg gaaggggcca atgctgctgc tggacagagg ggtgttattt 661 tccccagaag aatagcatgt cgtggtcatc atcataataa tagctgacac aactattatg 721 ggctggggac ttataagagc tttcataact ctcaggtagg aggcactacg aactccattt 781 cacaattgag gaaactgagg cacaaagaaa cgaagtagtg acagcagctt tcctttttcc 841 cttcaacagt ttggaatcaa gctgtttgag ccgaggctgg gtctcaggag gtgtggacag 901 cacttccatc tcggtaaatc agattcgcct gcccataact ggggagaatg gtggctttga 961 aaaggttaac ttgggagccc tggggacggc tgcagggaac gtacgcccat tccccctagg 1021 aagcagagcc aggccgcctc ctcctccgca gtggtgagga cgcgtggaca gtcccgcggg 1081 ggcggcggga gacctgcgga gtggccgcac tccaggtccg cgccaaggcc gggcagctcc 1141 gctttctgct cagtctccgc gaggtgtcgc cttcggccga agaaaccacc gcggcgccac 1201 cctcgtagct cgcacttatt tatttattta ttttcaaaca aggggggcgc ccctcttctt 1261 tcaatttgaa actggaaaca tccagaggtc ttgttcctaa gggggcgcgt cctctccctg 1321 ctattttgca ccttcggact agccttcttt cgtaattaca caggagcaac ctccctgcaa 1381 ggccttgctc aacgttggcc tcgcgctcag ctgcacaaca cgcagtcaaa gcgggggctg 1441 ggttagaagc atcggtctcc cctccccaac acacaccccc ggagccctcc gtaatttttt 1501 tttcttttaa tgacaagcaa ttgccaggct cgcagggtgg gtgctgcatt gcaccgctcc 1561 gcgcgcagct ggttctcaga gtgcagccgg tgcaagcccg ggggtccaaa agggcgggag 1621 gagcacaccc tgggcttccc agctttgcag ccttctctct gcaaagaaaa gcaagtggct 1681 tttggcgcga aagccttggc gcctcccctg atttttatgg aaatcaggag ggcggggtaa 1741 agccgctttc ctctcctttc tccctccccc ttgtctgcgc cacagccccc ttctctcccc 1801 gccccccggg tgtgtcagat ttttcagtta ataatatccc ccgagcttca aagcgcaggc 1861 tgtgacagtc atctgtctgg acgcgctggg tggatgcggg gggctcctgg gaactgtgtt 1921 ggagccgagc aagcgctagc caggcgcaag cgcgcacaga ctgtagccat ccgaggacac 1981 ccccgccccc ccggcccacc cggagacacc cgcgcagaat cgcctccgga tcccctgcag 2041 tcggcgggag gtaaggagca gggcttgcaa accgcccggc gcccagggaa gcgacgagcg 2101 ccggggcaag gcaagccctg gacgggattg cgacgtgcgc accgggcgcc ctaatatgcc 2161 cgggggactg tttctgcttc cgaaacaaaa ccatctctgg gttttcccag aaaagccagt 2221 tccagccccg aaggcatcct ggctagagga gacccgccct aatccttttg cagcccttac 2281 cggggggagt aatggcttct gcgaaaagaa attccctcgg ctctagaaga tctgtctgtg 2341 tttgagctgt cggagagccg gtgcgtcccc accccaggct ggggttcttc tccaaagggt 2401 gcccctggag gaagaagagg gggggattag gcagggcgag gccgccgcgg tcgcaatctg 2461 ggtcacggct gctccagctt ggaggagagg cggctctccc ggcgaccctc ctcgcgcggg 2521 cgcccctgcc attcccggga acaggggctc agcctctccc tccctggaag aggacgttgt 2581 cgtgggtttg gaagagcagg ggtgggctta gagagcttcc aattaagcta ttggcaggag 2641 tatccctgca gcgggtgaat gccgaggggc gtttgctcaa atttggggag gggaaggatt 2701 tgtggatatg ggtgtctgtt gttggtctct gtctagagaa aggctttttt ttatttgcaa 2761 agttttctaa atcccctgct atcatttgca ctcctgaggt tgcattttta caaagggggt 2821 agaaggtact ccaaatacca ttcccggtag ctgggtcgga gagcctgggg cttcccctga 2881 gcagccggcc ccacaccgct gcgagtgcgg ttgtctgcgt gctcgtgaga gctagaattc 2941 tgcagccagg aacagccccc tcccccaggc agtgccttgt gtgaatgaaa tggcagtttc 3001 caaagttgcg gagcctcgcc accaccccct gcatctgcat gccccctccc accccctgtc 3061 gtagacagct tgtacacaaa aggagggcgg gagggaggga gcgagaggca caacttcctc 3121 caccttcggg agcagtgggc agagtggggg gcttggaggg aagattgggg aacctggtta 3181 gagggggcgc ccattgccta tcccctcggt ctgccccgtt tgcccaccct ctccggtgtg 3241 tctgtcggtt gcagtgttgg aggtcggcgc cggcccccgc cttccgcgcc ccccacggga 3301 aggaagcacc cccggtatta aaacgaacgg ggcggaaaga agccctcagt cgccggccgg 3361 gaggcgagcc gatgccgagc tgctccacgt ccaccatgcc gggcatgatc tgcaagaacc 3421 cagacctcga gtttgactcg ctacagccct gcttctaccc ggacgaagat gacttctact 3481 tcggcggccc cgactcgacc cccccggggg aggacatctg gaagaagttt gagctgctgc 3541 ccacgccccc gctgtcgccc agccgtggct tcgcggagca cagctccgag cccccgagct 3601 gggtcacgga gatgctgctt gagaacgagc tgtggggcag cccggccgag gaggacgcgt 3661 tcggcctggg gggactgggt ggcctcaccc ccaacccggt catcctccag gactgcatgt 3721 ggagcggctt ctccgcccgc gagaagctgg agcgcgccgt gagcgagaag ctgcagcacg 3781 gccgcgggcc gccaaccgcc ggttccaccg cccagtcccc gggagccggc gccgccagcc 3841 ctgcgggtcg cgggcacggc ggggctgcgg gagccggccg cgccggggcc gccctgcccg 3901 ccgagctcgc ccacccggcc gccgagtgcg tggatcccgc cgtggtcttc ccctttcccg 3961 tgaacaagcg cgagccagcg cccgtgcccg cagccccggc cagtgccccg gcggcgggcc 4021 ctgcggtcgc ctcgggggcg ggtattgccg ccccagccgg ggccccgggg gtcgcccctc 4081 cgcgcccagg cggccgccag accagcggcg gcgaccacaa ggccctcagt acctccggag 4141 aggacaccct gagcgattca ggtaaagacc gaactcgggt ccggctgcct ccctggggca 4201 ctggaccccg ggtcgcgtcc cctttgttag tgctcgtatg tcttggcctg gggagcattt 4261 tggaggcagt gctaggggca gagaggtcct gtttccccca agtctctcct cggggtaaag 4321 agaaggggct gagagaatgc cgttgcaaaa ggggtgctct ccaattctcg ccttcactaa 4381 agttccttcc accctctcct ggggagccct cctctaggcc atcacgggcc ctcacccggt 4441 cccccacctc tcttttgcag cgcagtctga ggaataaaat tggagaaagt tggtggctaa 4501 accgggtggg ggtttagggg gttgctgggt gcactgcctg gacagaaacc tgttagcgca 4561 ggggtgaaag ggactctctg gcccaggtca ggggagggaa agacatcccc gagaagattc 4621 aagggctgtg caaagccctg tttaaggcgc aggaacttat aggagggttg caacagatgg 4681 ctagagccga ttttctattc tttttctttt tttttttttt ttttcaaatg tcggtacctt 4741 tcccttcccc catcctcggt gggtggtggg ctatttgctc ctggtgcgtg gccagcaggc 4801 ggcgatatgc gaggccagca ggcgggcccg ggatctgaaa ggctgggggt ggtgggggca 4861 ccctccctcc ctccattcag cagctggctg caagtgcaac agcagttgtg tacattctca 4921 gggggcctcc tctttccagt gtgcagtgga aagtggctgt agttttgtct tccagcctga 4981 attccaggcc taatttgaga tgtgagttgt atctgtaacc cagtgccctt gaaggtgagg 5041 gcaggcactc agcagcctct ccaggaaggc tcacatcctg ggaggactca ctgattagtt 5101 ctattgtgtt catttgtctg tgtcttaagc tgaagggaag agttaaaacc aagcctttcc 5161 ctgggggtct ggatgaacag aactcaaccc aaagagtggc attgccttgt ccttggagca 5221 gggagctggg accccccttg gactttgaaa accagtgttt tcagaatgca ggtggataac 5281 aagcctaaat ttacttctgg gctgaggaga gatctttgag gctcctggaa ggaaacttgg 5341 tgataagcct ccagtttgaa acggctctgt ccctttaatg tctgtgcctt gacagctttt 5401 ggtgaggaag cacttccttc caacagctgt cttcttggca gaaaaccaaa aattggctta 5461 aagggaccca cagactggaa cagcctcaca tttcggcttt agaacaaatc ccacaattgt 5521 tcagctttcc ggtccccttc agatcaagca gaagatatgt tttgattttc atgcttgtat 5581 tttaaacaat aattttctac cccagcgtgg tagtcaatga ggagagaggg gaagaatgcg 5641 cacatgatgc tacacgtttc tgttgttgct gttattattg gtggctttga ggagagctgc 5701 tcccatttgg gggtttatac caactgtgga ttatggcttt gtcattaaga tttgatcttt 5761 gttaaatgaa aaactgttta ttgtataaaa ctcaggtttg tggacgaaaa gttgtttttt 5821 ttcttcagtt aattaaattg ttcctcaagt ttgtttaagg acttaaaatc aaacacaacc 5881 atgtgtaaac tgctaaatga ggctcctaaa atgagaggcc tcaactcttt aagtgtggag 5941 ctagaaatgt aaataagtcc acagggcaga ctggtgatta tgataaaagc taccatttac 6001 tgagcatctg tctactaggc tcagctctat gctaagtcta catgttatct gtcaaagtgg 6061 tatcatcccc atttaatagc tgaggaaaca gaggcttaga aaggctgggt aacttgacca 6121 gggtcatgca actagtctgc ggtggagcca ggattctgtc tgaccctaaa ggccaagttc 6181 tttatattta tttctaccac ctgctaaagt cttgaatgga ggctgaaagc acagttgggg 6241 tatggggaag aaaaatatat atacatacat atatgtatat gtatgtatgt atgtatgggg 6301 ggttgttttg tttttgtttt tgataaggag ttttgctctt gttgcccagg ctggagtgca 6361 gtggtatgat ctgggctcac tgcaacctcc gcctcccggg ttcaagtcat tctcctgcct 6421 cagcctcccg agtagctggg attaccggag catgccacca cacccagcaa agttttgtat 6481 ttttagtaga gacagggttt caccatgttg gccaggctga tcttgaactc ctcatctcag 6541 gtgatctgcc cgcctccgct tcccaaagtg ctgggattac aggtgtgagt caccgcgtcc 6601 ggcctacaga tatatttaat ttaaagagat ctaaaacaaa tacaaaactg tccacatcta 6661 tgttgatgga cccataaaaa tagcagtctg ccagggtctg ccggaagaga cagataagca 6721 tacatattaa catggatata tatgtgaatt tcattcaaat ggttctcaca tgagagtaac 6781 tagcatcttt ctctcagatg atgaagatga tgaagaggaa gatgaagagg aagaaatcga 6841 cgtggtcact gtggagaagc ggcgttcctc ctccaacacc aaggctgtca ccacattcac 6901 catcactgtg cgtcccaaga acgcagccct gggtcccggg agggctcagt ccagcgagct 6961 gatcctcaaa cgatgccttc ccatccacca gcagcacaac tatgccgccc cctctcccta 7021 cgtggagagt gaggatgcac ccccacagaa gaagataaag agcgaggcgt ccccacgtcc 7081 gctcaagagt gtcgtccccc caaaggctaa gagcttgagc ccccgaaact ctgactcgga 7141 ggacagtgag cgtcgcagaa accacaacat cctggagcgc cagcgccgca acgaccttcg 7201 gtccagcttt ctcacgctca gggaccacgt gccggagttg gtaaagaatg agaaggccgc 7261 caaggtggtc attttgaaaa aggccactga gtatgtccac tccctccagg ccgaggagca 7321 ccagcttttg ctggaaaagg aaaaattgca ggcaagacag cagcagttgc taaagaaaat 7381 tgaacacgct cggacttgct agacgcttct caaaactgga cagtcactgc cactttgcac 7441 attttgattt tttttttaaa caaacattgt gttgacatta agaatgttgg tttactttca 7501 aatcggtccc ctgtcgagtt cggctctggg tgggcagtag gaccaccagt gtggggttct 7561 gctgggacct tggagagcct gcatcccagg atgctgggtg gccctgcagc ctcctccacc 7621 tcacctccat gacagcgcta aacgttggtg acggttggga gcctctgggg ctgttgaagt 7681 caccttgtgt gttccaagtt tccaaacaac agaaagtcat tccttctttt taaaatggtg 7741 cttaagttcc agcagatgcc acataagggg tttgccattt gatacccctg gggaacattt 7801 ctgtaaatac cattgacaca tccgcctttt gtatacatcc tgggtaatga gaggtggctt 7861 ttgcggccag tattagactg gaagttcata cctaagtact gtaataatac ctcaatgttt 7921 gaggagcatg ttttgtatac aaatatattg ttaatctctg ttatgtactg tactaattct 7981 tacactgcct gtatacttta gtatgacgct gatacataac taaatttgat acttatattt 8041 tcgtatgaaa atgagttgtg aaagttttga gtagatatta ctttatcact ttttgaacta 8101 agaaactttt gtaaagaaat ttactatata tatatgcctt tttcctagcc tgtttcttcc 8161 tgttaatgta tttgttcatg tttggtgcat agaactgggt aaatgcaaag ttctgtgttt 8221 aatttcttca aaatgtatat atttagtgct gcatcttata gcactttgaa atacctcatg 8281 tttatgaaaa taaatagctt aaaattaaat gatgcaactc aaccttttcc ttaatggcat 8341 tacactctgt cccttaagga gcaaccataa ataatccata acctatagag ggaatttggt 8401 ttccgtaaat agcccttttt gcacctgtac aatcctggtt ggggggcatg agtcattgtc 8461 cccacttaag gtggagggaa ctgagtttgg ggaaattaag gcagttctcc aagattatgc 8521 agaatagaga tgttattagc gactattgtg tgcattgtag caatggcatt tgataattta 8581 cagagcactt ctgtatactg tggctcctta gtggaattaa gctgagacct cagatcagtc 8641 cctttaaaag aaaagtaaaa atagccacag ggttgtttaa ctcgcttgta ttgggctttg 8701 gtagtattcg tcccattggc agacagtctt ctattttaag gtagagcata gtttgtctcc 8761 aa // LOCUS HSNOP56 1973 bp RNA PRI 30-NOV-1997 DEFINITION Homo sapiens mRNA for nucleolar protein hNop56. ACCESSION Y12065 NID g2230877 KEYWORDS hNop56; nucleolar protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1973) AUTHORS Gautier,T., Berges,T., Tollervey,D. and Hurt,E. TITLE Nucleolar KKE/D repeat proteins Nop56p and Nop58p interact with Nop1p and are required for ribosome biogenesis JOURNAL Mol. Cell. Biol. 17 (12), 7088-7098 (1997) MEDLINE 98038777 REFERENCE 2 (bases 1 to 1973) AUTHORS Gautier,T. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) T. Gautier, Universitaet Heidelberg, Institut fuer Biochemie I, Im Neuenheimer Feld 328, 69120 Heidelberg, FRG COMMENT Related sequence: U14913. FEATURES Location/Qualifiers source 1..1973 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" CDS 22..1830 /note="homologue to yeast nucleolar protein" /codon_start=1 /product="hNop56" /db_xref="PID:e1188703" /db_xref="PID:g2230878" /translation="MAGRGAMVLLHVLFEHAVGYALVALKEVEEISLLQPQVEESVLN LGKFHSIVRLVAFCPFASSQVALENANAVSEGVVHEDLRLLLETHLPSKKKKVLLGVG DPKIGAAIQEELGYNCQTGGVIAEILRGVRLHFHNLVKGLTDLSACKAQLGLGHSYSR AKVKFNVNRVDNMIIQSISLLDQLDKDINTFSMRVREWYGYHFPELVKIINDNATYCR LAQFIGNRRELNEDKLEKLEELTMDGAKAKAILDASRSSMGMDISAIDLINIESFSSR VVSLSEYRQSLHTYLRSKMSQVAPSLSALIGEAVGARLIAHAGSLTNLAKYPASTVQI LGAEKALFRALKTRGNTPKYGLIFHSTFIGRAAAKNKGRISRYLANKCSIASRIDCFS EVPTSVFGEKLREQVEERLSFYETGEIPRKNLDVMKEAMVQAEAEEAAAEITRKLEKQ EKKRLKKEKKRLAALALASSENSSSTPEECEETSEKPKKKKKQKPQEVPQENGMEDPS ISFSKPKKKKSFSKEELMSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSRSKK KRKFSKEEPVSSGPEEAVGKSSSKKKKKFHKASQED" BASE COUNT 567 a 470 c 544 g 392 t ORIGIN 1 ggatccggca acgaaggtac catggcggga cgtggcgcca tggtgctgtt gcacgtgctg 61 tttgagcacg cggtcggcta cgcgctcgtg gcgctgaagg aagtggagga gatcagtctg 121 ctgcagccgc aggtggagga gtccgtgctc aacctgggca aattccacag catcgttcgt 181 ctggtggcct tttgtccctt tgcctcatcc caggttgcct tggaaaatgc caacgccgtg 241 tctgaagggg ttgttcatga ggacctccgc ctgctcttgg agacccacct gccgtccaaa 301 aagaagaaag tactcttggg agttggggat cccaagattg gtgccgcaat acaggaggag 361 ttagggtaca actgccagac tggaggagtc atagctgaga tcctgcgagg agttcgtctg 421 cacttccaca atctggtgaa gggtctgacc gatctgtcag cttgtaaagc acagctgggg 481 ctgggacaca gctattcccg tgccaaagtt aagtttaatg tgaaccgggt ggacaatatg 541 atcatccagt ccattagcct cctggaccag ctggataagg acatcaatac cttctctatg 601 cgtgtcaggg agtggtacgg gtatcacttt ccggagctgg tgaagatcat caacgacaat 661 gccacatact gccgtcttgc ccagtttatt ggaaaccgaa gggaactgaa tgaggacaag 721 ctggagaagc tggaggagct gacaatggat ggggccaagg ctaaggctat tctggatgcc 781 tcacggtcct ccatgggcat ggacatatct gccattgact tgataaacat cgagagcttc 841 tccagtcgtg tggtgtcttt atctgaatac cgccagagcc tacacactta cctgcgctcc 901 aagatgagcc aagtagcccc cagcctgtca gccctaattg gggaagcggt aggtgcacgt 961 ctcatcgcac atgctggcag cctcaccaac ctggccaagt atccagcatc cacagtgcag 1021 atccttgggg ctgaaaaggc cctgttcaga gccctgaaga caaggggtaa cactccaaaa 1081 tatggactca ttttccactc caccttcatt ggccgagcag ctgccaagaa caaaggccgc 1141 atctcccgat acctggcaaa caaatgcagt attgcctcac gaatcgattg cttctctgag 1201 gtgcccacga gtgtattcgg ggagaagctt cgagaacaag ttgaagagcg actgtccttc 1261 tatgagactg gagagatacc acgaaagaat ctggatgtca tgaaggaagc aatggttcag 1321 gcagaggcag aggaagcggc tgctgagatt actaggaagc tggagaaaca ggagaagaaa 1381 cgcttaaaga aggaaaagaa acggctggct gcacttgccc tcgcgtcttc agaaaacagc 1441 agtagtactc cagaggagtg tgaggagacg agtgaaaaac ccaaaaagaa gaaaaagcaa 1501 aagccccagg aggttcctca ggagaatgga atggaagacc catctatctc tttctccaaa 1561 cccaagaaaa agaaatcttt ttccaaggag gagttgatga gtagcgatct tgaagagacc 1621 gctggcagca ccagtattcc caagaggaag aagtctacac ccaaggagga aacagttaat 1681 gaccctgagg aggcaggcca cagaagtcgg tccaagaaaa agaggaaatt ctccaaagag 1741 gagccggtca gcagtgggcc tgaagaggcg gttggcaaga gcagctccaa gaagaagaaa 1801 aagttccata aagcatccca ggaagattag aatgcaaatg gacattctct gggaggtggg 1861 gcataccata gcccaaggtg acatttccca ccctgtgccc gtgttcccaa taaaaacaaa 1921 ttcacaagaa aaaaaaaaaa aaaaaaaaaa ttcctgaggc cgcaagggaa ttc // LOCUS HSNOT 3427 bp RNA PRI 06-JAN-1995 DEFINITION H.sapiens mRNA for NOT. ACCESSION X75918 NID g415822 KEYWORDS immediate early gene; steroid-thyroid hormone; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3427) AUTHORS Mages,H.W., Rilke,O., Bravo,R., Senger,G. and Kroczek,R.A. TITLE NOT, a human immediate-early response gene closely related to the steroid/thyroid hormone receptor NAK1/TR3 JOURNAL Mol. Endocrinol. 8 (11), 1583-1591 (1994) MEDLINE 95183071 REFERENCE 2 (bases 1 to 3427) AUTHORS Mages,H.W. TITLE Direct Submission JOURNAL Submitted (04-NOV-1993) H.W. Mages, Robert Koch-Institute, Molecular Immunology, Nordufer 20, 13353 Berlin, FRG FEATURES Location/Qualifiers source 1..3427 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood" /cell_type="T-lymphocyt" gene 318..2114 /gene="NOT" CDS 318..2114 /gene="NOT" /codon_start=1 /db_xref="PID:g415823" /db_xref="SWISS-PROT:P43354" /translation="MPCVQAQYGSSPQGASPASQSYSYHSSGEYSSDFLTPEFVKFSM DLTNTEITATTSLPSFSTFMDNYSTGYDVKPPCLYQMPLSGQQSSIKVEDIQMHNYQQ HSHLPPQSEEMMPHSGSVYYKPSSPPTPTTPGFQVQHSPMWDDPGSLHNFHQNYVATT HMIEQRKTPVSRLSLFSFKQSPPGTPVSSCQMRFDGPLHVPMNPEPAGSHHVVDGQTF AVPNPIRKPASMGFPGLQIGHASQLLDTQVPSPPSRGSPSNEGLCAVCGDNAACQHYG VRTCEGCKGFFKRTVQKNAKYVCLANKNCPVDKRRRNRCQYCRFQKCLAVGMVKEVVR TDSLKGRRGRLPSKPKSPQEPSPPSPPVSLISALVRAHVDSNPAMTSLDYSRFQANPD YQMSGDDTQHIQQFYDLLTGSMEIIRGWAEKIPGFADLPKADQDLLFESAFLELFVLR LAYRSNPVEGKLIFCNGVVLHRLQCVRGFGEWIDSIVEFSSNLQNMNIDISAFSCIAA LAMVTERHGLKEPKRVEELQNKIVNCLKDHVTFNNGGLNRPNYLSKLLGKLPELRTLC TQGLQRIFYLKLEDLVPPPAIIDKLFLDTLPF" polyA_signal 2719..2724 polyA_signal 2769..2774 polyA_signal 3401..3406 BASE COUNT 916 a 885 c 746 g 880 t ORIGIN 1 gctcgcgcac ggctccgcgg tcccttttgc ctgtccagcc ggccgcctgt ccctgctccc 61 tccctccgtg agtgtccggg ttcccttcgc ccagctctcc cacccctacc cgaccccggc 121 gcccgggctc ccagagggaa ctgcacttcg gcagagttga atgaatgaag agagacgcgg 181 agaactccta aggaggagat tggacaggct ggactcccca ttgcttttct aaaaatcttg 241 gaaactttgt ccttcattga attacgacac tgtccacctt taatttcctc gaaaacgcct 301 gtaactcggc tgaagccatg ccttgtgttc aggcgcagta tgggtcctcg cctcaaggag 361 ccagccccgc ttctcagagc tacagttacc actcttcggg agaatacagc tccgatttct 421 taactccaga gtttgtcaag tttagcatgg acctcaccaa cactgaaatc actgccacca 481 cttctctccc cagcttcagt acctttatgg acaactacag cacaggctac gacgtcaagc 541 caccttgctt gtaccaaatg cccctgtccg gacagcagtc ctccattaag gtagaagaca 601 ttcagatgca caactaccag caacacagcc acctgccccc ccagtctgag gagatgatgc 661 cgcactccgg gtcggtttac tacaagccct cctcgccccc gacgcccacc accccgggct 721 tccaggtgca gcacagcccc atgtgggacg acccgggatc tctccacaac ttccaccaga 781 actacgtggc cactacgcac atgatcgagc agaggaaaac gccagtctcc cgcctctccc 841 tcttctcctt taagcaatcg ccccctggca ccccggtgtc tagttgccag atgcgcttcg 901 acgggcccct gcacgtcccc atgaacccgg agcccgccgg cagccaccac gtggtggacg 961 ggcagacctt cgctgtgccc aaccccattc gcaagcccgc gtccatgggc ttcccgggcc 1021 tgcagatcgg ccacgcgtct cagctgctcg acacgcaggt gccctcaccg ccgtcgcggg 1081 gctccccctc caacgagggg ctgtgcgctg tgtgtgggga caacgcggcc tgccaacact 1141 acggcgtgcg cacctgtgag ggctgcaaag gcttctttaa gcgcacagtg caaaaaaatg 1201 caaaatacgt gtgtttagca aataaaaact gcccagtgga caagcgtcgc cggaatcgct 1261 gtcagtactg ccgatttcag aagtgcctgg ctgttgggat ggtcaaagaa gtggttcgca 1321 cagacagttt aaaaggccgg agaggtcgtt tgccctcgaa accgaagagc ccacaggagc 1381 cctctccccc ttcgcccccg gtgagtctga tcagtgccct cgtcagggcc catgtcgact 1441 ccaacccggc tatgaccagc ctggactatt ccaggttcca ggcgaaccct gactatcaaa 1501 tgagtggaga tgacacccag catatccagc aattctatga tctcctgact ggctccatgg 1561 agatcatccg gggctgggca gagaagatcc ctggcttcgc agacctgccc aaagccgacc 1621 aagacctgct ttttgaatca gctttcttag aactgtttgt ccttcgatta gcatacaggt 1681 ccaacccagt ggagggtaaa ctcatctttt gcaatggggt ggtcttgcac aggttgcaat 1741 gcgttcgtgg ctttggggaa tggattgatt ccattgttga attctcctcc aacttgcaga 1801 atatgaacat cgacatttct gccttctcct gcattgctgc cctggctatg gtcacagaga 1861 gacacgggct caaggaaccc aagagagtgg aagaactgca aaacaagatt gtaaattgtc 1921 tcaaagacca cgtgactttc aacaatgggg ggttgaaccg ccccaattat ttgtccaaac 1981 tgttggggaa gctcccagaa cttcgtaccc tttgcacaca ggggctacag cgcattttct 2041 acctgaaatt ggaagacttg gtgccaccgc cagcaataat tgacaaactt ttcctggaca 2101 ctttaccttt ctaagacctc ctcccaagca cttcaaagga actggaatga taatggaaac 2161 tgtcaagagg gggcaagtca catgggcaga gatagccgtg tgagcagtct cagctcaagc 2221 tgccccccat ttctgtaacc ctcctagccc ccttgatccc taaagaaaac aaacaaacaa 2281 acaaaaactg ttgctatttc ctaacctgca ggcagaacct gaaagggcat tttggctccg 2341 gggcatcctg gatttagaac atggactaca cacaatacag tggtataaac tttttattct 2401 cagtttaaaa atcagtttgt tgttcagaag aaagattgct ataaggtata atgggaaatg 2461 tttggccatg cttggttgtt gcagttcaga caaatgtaac acacacacac atacacacac 2521 acacacacac agagacacat cttaagggga cccacaagta ttgcccttta acaagacttc 2581 aaagttttct gctgtaaaga aagctgtaat atatagtaaa actaaatgtt gcgtgggtgg 2641 catgagttga agaaggcaaa ggcttgtaaa tttacccaat gcagtttggc tttttaaatt 2701 attttgtgcc tatttatgaa taaatattac aaattctaaa agataagtgt gtttgcaaaa 2761 aaaaagaaaa taaatacata aaaaagggac aagcatgttg attctaggtt gaaaatgtta 2821 taggcacttg ctacttcagt aatgtctata ttatataaat agtatttcag acactatgta 2881 gtctgttaga ttttataaag attggtagtt atctgagctt aaacattttc tcaattgtaa 2941 aataggtggg cacaagtatt acacatcaga aaatcctgac aaaagggaca catagtgttt 3001 gtaacaccgt ccaacattcc ttgtttgtaa gtgttgtatg taccgttgat gttgataaaa 3061 agaaagttta tatcttgatt attttgttgt ctaaagctaa acaaaacttg catgcagcag 3121 cttttgactg tttccagagt gcttataata tacataactc cctggaaata actgagcact 3181 ttgaattttt tttatgtcta aaattgtcag ttaatttatt attttgtttg agtaagaatt 3241 ttaatattgc catattctgt agtatttttc tttgtatatt tctagtatgg cacatgatat 3301 gagtcactgc ctttttttct atggtgtatg acagttagag atgctgattt tttttctgat 3361 aaattctttc tttgagaaag acaattttaa tgtttacaac aataaaccat gtaaatgaaa 3421 aaaaaaa // LOCUS HSNOT56LP 1463 bp RNA PRI 29-OCT-1996 DEFINITION H.sapiens mRNA for Not56-like protein. ACCESSION Y09022 NID g1653999 KEYWORDS not gene; Not56-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1463) AUTHORS Kurzik-Dumke,U. and Kaymer,M. TITLE Sequence of the human homologue of the Drosophila melanogaster Not56 protein and its expression in various tissues JOURNAL Unpublished REFERENCE 2 (bases 1 to 1463) AUTHORS Kurzik-Dumke,U. TITLE Direct Submission JOURNAL Submitted (24-OCT-1996) U. Kurzik-Dumke, Institut fuer Genetik, Johannes Gutenberg Universitaet, Saarstrasse 21, D- 55099 Mainz, FRG COMMENT Related sequences: X77820 and X79489. FEATURES Location/Qualifiers source 1..1463 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" /clone_lib="Uni-ZAP XR vector (Stratagene 937226)" /clone="KH21" gene 32..1348 /gene="not" CDS 32..1348 /gene="not" /codon_start=1 /product="Not56-like protein" /db_xref="PID:e276888" /db_xref="PID:g1654000" /translation="MAAGLRKRGRSGSAAQAEGLCKQWLQRAWQERRLLLREPRYTLL VAACLCLAEVGITFWVIHRVAYTEIDWKAYMAEVEGVINGTYDYTQLQGDTGPLVYPA GFVYIFMGLYYATSRGTDIRMAQNIFAVLYLATLLLVFLIYHQTCKVPPFVFFFMCCA SYRVHSIFVLRLFNDPVAMVLLFLSINLLLAQRWGWGCCFFSLAVSVKMNVLLFAPGL LFLLLTQFGFRGALPKLGICAGLQVVLGLPFLLENPSGYLSRSFDLGRQFLFHWTVNW RFLPEALFLHRAFHLALLTAHLTLLLLFALCRWHRTGESILSLLRDPSKRKVPPQPLT PNQIVSTLFTSNFIGICFSRSLHYQFYVWYFHTLPYLLWAMPARWLTHLLRLLVLGLI ELSWNTYPSTSCSSAALHICHAVILLQLWLGPQPFPKSTQHSKKAH" BASE COUNT 262 a 484 c 368 g 349 t ORIGIN 1 ggtgggccca cacaagcggc gcaccgttaa gatggcggct gggctgcgga aacgcggccg 61 gtccggttcc gcggcccagg cagagggact ctgcaagcaa tggctgcagc gcgcctggca 121 agagcggcgc ctgctgctgc gggagccgcg ctacacgctg ctggtggccg cctgcctctg 181 cctggcggag gtgggcatca ccttctgggt cattcacagg gtggcataca cagagattga 241 ctggaaggcc tacatggccg aggtagaagg cgtcatcaat ggtacctatg actataccca 301 actgcagggt gacaccggac cacttgtgta cccagctggt ttcgtgtaca tctttatggg 361 gttgtactat gccaccagcc gaggcactga catccgcatg gcccagaaca tctttgctgt 421 gctctacctg gctaccttgc tgcttgtctt cttgatctat caccagacct gcaaggtacc 481 tcccttcgtc tttttcttca tgtgctgcgc ctcttaccgt gtccactcca tctttgtgct 541 gcggctcttc aatgacccag tggccatggt gctgctcttc ctcagtatca acctcctgct 601 ggcccagcgc tggggctggg gctgctgctt tttcagcctg gcagtctctg tgaagatgaa 661 tgtgctgctc ttcgcccctg ggttactgtt tcttctcctc acacagtttg gcttccgtgg 721 ggccctcccc aagctgggaa tctgtgctgg ccttcaggtg gtgctggggc tgcccttcct 781 gctggagaac cccagcggct acctgtcccg ctcctttgac cttggccgcc agtttctgtt 841 ccactggaca gtgaactggc gcttcctccc agaggcgctc ttcctgcatc gagccttcca 901 cctggccctg ttgactgccc acctcaccct gctcctgctg tttgccctct gcaggtggca 961 caggacaggg gaaagtatct tgtcgctgct gagggatccc tccaaaagga aggttccacc 1021 ccagcccctt acacccaacc agatcgtttc taccctcttc acctccaact tcattggcat 1081 ctgcttcagc cgctccctcc actaccagtt ctacgtctgg tatttccaca cactgcccta 1141 cctcctgtgg gccatgcctg cacggtggct cacacacctg ctcaggttgt tggtgctggg 1201 gctcatcgag ctctcctgga acacataccc ttccacatcc tgcagctctg ctgccctgca 1261 catatgccat gccgtcatcc tgctgcagct ctggctgggc ccgcagcctt tccccaagag 1321 cacccaacac agcaagaaag cccactgaag tccacccctt tccctcagga cctgagtcta 1381 ccctcaggac ctggggtggg ttggactctg cccttccaaa taaaccttgc taagtccaaa 1441 aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSNOV 2588 bp RNA PRI 04-SEP-1996 DEFINITION H.sapiens mRNA for novel gene in Xq28 region. ACCESSION X92396 NID g1150415 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2588) AUTHORS D'Esposito,M., Ciccodicola,A., Gianfrancesco,F., Esposito,T., Flagiello,L., Mazzarella,R., Schlessinger,D. and D'Urso,M. TITLE A synaptobrevin-like gene in the Xq28 pseudoautosomal region undergoes X inactivation JOURNAL Nature Genet. 13 (2), 227-229 (1996) MEDLINE 96225453 REFERENCE 2 (bases 1 to 2588) AUTHORS D'Urso,M. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) M. D'Urso, International Institute of Genetics and, Biophysics,, CNR, Via Marconi, 10, I- 80125 Naples, ITALY FEATURES Location/Qualifiers source 1..2588 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NTERA2" /clone="D1" /clone_lib="phage10" /chromosome="X" /map="q28" gene 115..777 /gene="ORF" CDS 115..777 /gene="ORF" /codon_start=1 /db_xref="PID:e206118" /db_xref="PID:g1150416" /db_xref="SWISS-PROT:P51809" /translation="MAILFAVVARGTTILAKHAWCGGNFLEVTEQILAKIPSENNKLT YSHGNYLFHYICQDRIVYLCITDDDFERSRAFNFLNEIKKRFQTTYGSRAQTALPYAM NSEFSSVLAAQLKHHSENKGLDKVMETQAQVDELKGIMVRNIDLVAQRGERLELLIDK TENLVDSSVTFKTTSRNLARAMCMKNLKLTIIIIIVSIVFIYIIVSPLCGGFTWPSCV KK" BASE COUNT 763 a 486 c 500 g 839 t ORIGIN 1 gaattcgccg gtccagcctc ctctgggagc gggcagttgg cgaccctgca ctgacccgcg 61 tccctccgtc ccgagcccgc gcgccctcag agggtgcccg gacagactga agccatggcg 121 attctttttg ctgttgttgc cagggggacc actatccttg ccaaacatgc ttggtgtgga 181 ggaaacttcc tggaggtgac agagcagatt ctggctaaga taccttctga aaataacaaa 241 ctaacgtact cacatggcaa ttatttgttt cattacatct gccaagacag gattgtatat 301 ctttgtatca ctgatgatga ttttgaacgt tcccgagcct ttaattttct gaatgagata 361 aagaagaggt tccagactac ttacggttca agagcacaga cagcacttcc atatgccatg 421 aatagcgagt tctcaagtgt cttagctgca cagctgaagc atcactctga gaataagggc 481 ctagacaaag tgatggagac tcaagcccaa gtggatgaac tgaaaggaat catggtcaga 541 aacatagatc tggtagctca gcgaggagaa agattggaat tattgattga caaaacagaa 601 aatcttgtgg attcttctgt caccttcaaa actaccagca gaaatcttgc tcgagccatg 661 tgtatgaaga acctcaagct cactattatc atcatcatcg tatcaattgt gttcatctat 721 atcattgttt cacctctctg tggtggattt acatggccaa gctgtgtgaa gaaataggaa 781 agaagaagtt accattaacc aaggatatga gagaacaagg agttaaaagc aatccatgtg 841 actcaagcct ttcacatact gacagatggt atctgccagt ctcttcaacc ctcttctcac 901 tttttaaaat cttgttccat gcctccaggt ttatctttgt cttatctacc agtttattcc 961 tgtgaacttc agattgaacc attcattgca gcagtagcct taaaaaggct tttgtttatt 1021 tctttggttt gttaactagt gtcatctatt tagagaaaca tttttgtttt taattgctca 1081 aagctgtcgc cgctagtctt atgagctatc tactaaaact atggagaaac tttgtatgtg 1141 cacacaaaag tattcaagag acagtattgc taacatctca tcttaatgtc ttttgttatt 1201 gagaagtttt aggtgcttca aaacaatata aatggataat agttgttatt tggggaattg 1261 taatgatgtt ggtgctgctt ccttctaaga gctcagacaa gtaaagtatg aaacattctt 1321 atttcagtta gatggggaac attttgctag cccattagaa gcacacagaa ttatccttgt 1381 cctcctaata ttgactttca ggaataaagt tcagtgtgct gatcattcac aatacagtgg 1441 atagcttgat atcttctgtt ttcccattgc agttgatttg agaagatgaa ggtttaaata 1501 ttgttgaaag ttgcagtttt ttaaatgtgt tcctttttct tctgtgaata tttagggcaa 1561 tcgtgtcgct aatagaatat gtagtagagg gggtggggag gtaaattcct ctgacttgcc 1621 aaagaaaaag aagggaacca cagtggatat gctagcattt tagctgtgca aagggaggta 1681 gtgtgggaaa agtgtttcca ttctgggaaa agcccaaacc gaatacggtc agcagtcaac 1741 tccagggttt gggcttgatt cctgttgaat aatagttttg agcattcttt gtggttaaat 1801 aaattcttaa atctgcctag ttttgatgaa ttcttttgtg aaacttgaaa gagaatagac 1861 agtatgacat atagaattaa tacaaaacag tttaacaacc atttaactgc agtgtaagaa 1921 aattggactg taatcatatc gctactggca tctgttatct agtatgcatt tctggtgtgt 1981 atctgaaagg aagacatttt ctaccctaga tccaattgca tttatttatc aataagtgcc 2041 attaaattga aattatatta cattttacac tttctcaatg aatgaacaaa ttagtctgta 2101 gaatctagcc acctgtttag cctagtcatg tgccttgaac atatatgtgt cccataatct 2161 ggctcatggt acctgttctt ctatccaaac ctttcaattc atgctacctg attcatttat 2221 ttgacataga tcttaggccc acttgaactc ttttcttgtt tatctagcat agcacaaacg 2281 tttttccagt cttctttatc aacactaatg cctcttaatt gcatcagtat ttcctattgg 2341 aaaatacatc tgttccagaa aaacatttgg cattcctgaa taatttccaa atgtttttaa 2401 tccaaagaaa aaggtttaaa gcttatttcc ctttcttata cacacctgaa taaaattgat 2461 gtgcatgttt tagggatcaa ttacctaact gttccttggt ctatttatgt ataagaatgc 2521 tttttaaagc acatgtctca ttttaaatga cgcacaaact gaagatgtta ataaaattta 2581 aggaattc // LOCUS HSNOVH 1973 bp RNA PRI 21-OCT-1996 DEFINITION H.sapiens mRNA for NOV protein. ACCESSION X96584 NID g1225993 KEYWORDS NOV gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1973) AUTHORS Perbal,B. TITLE Direct Submission JOURNAL Submitted (11-MAR-1996) B. Perbal, Institut Curie, Centre Universitaire, Bat 110, 91405 Orsay Cedex, FRANCE REFERENCE 2 (bases 1 to 1973) AUTHORS Martinerie,C., Chevalier,G., Rauscher,F.J. 3rd and Perbal,B. TITLE Regulation of nov by WT1: a potential role for nov in nephrogenesis JOURNAL Oncogene 12 (7), 1479-1492 (1996) MEDLINE 96204003 COMMENT Overlaps with X59284, X78351, X78352, X78353 and X78354. FEATURES Location/Qualifiers source 1..1973 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda tap" /clone="LC82" /chromosome="8" /map="q24.1" gene 73..1146 /gene="nov" CDS 73..1146 /gene="nov" /codon_start=1 /db_xref="PID:e228691" /db_xref="PID:g1225994" /db_xref="SWISS-PROT:P48745" /translation="MQSVQSTSFCLRKQCLCLTFLLLHLLGQVAATQRCPPQCPGRCP ATPPTCAPGVRAVLDGCSCCLVCARQRGESCSDLEPCDESSGLYCDRSADPSNQTGIC TAVEGDNCVFDGVIYRSGEKFQPSCKFQCTCRDGQIGCVPRCQLDVLLPEPNCPAPRK VEVPGECCEKWICGPDEEDSLGGLTLAAYRPEATLGVEVSDSSVNCIEQTTEWTACSK SCGMGFSTRVTNRNRQCEMLKQTRLCMVRPCEQEPEQPTDKKGKKCLRTKKSLKAIHL QFKNCTSLHTYKPRFCGVCSDGRCCTPHNTKTIQAEFQCSPGQIVKKPVMVIGTCTCH TNCPKNNEAFLQELELKTTRGKM" BASE COUNT 548 a 484 c 471 g 470 t ORIGIN 1 gggaaggcga gcagtgccaa tctacagcga agaaagtctc gtttggtaaa agcgagaggg 61 gaaagcctga gcatgcagag tgtgcagagc acgagctttt gtctccgaaa gcagtgcctt 121 tgcctgacct tcctgcttct ccatctcctg ggacaggtcg ctgcgactca gcgctgccct 181 ccccagtgcc cgggccggtg ccctgcgacg ccgccgacct gcgcccccgg ggtgcgcgcg 241 gtgctggacg gctgctcatg ctgtctggtg tgtgcccgcc agcgtggcga gagctgctca 301 gatctggagc catgcgacga gagcagtggc ctctactgtg atcgcagcgc ggaccccagc 361 aaccagactg gcatctgcac ggcggtagag ggagataact gtgtgttcga tggggtcatc 421 taccgcagtg gagagaaatt tcagccaagc tgcaaattcc agtgcacctg cagagatggg 481 cagattggct gtgtgccccg ctgtcagctg gatgtgctac tgcctgagcc taactgccca 541 gctccaagaa aagttgaggt gcctggagag tgctgtgaaa agtggatctg tggcccagat 601 gaggaggatt cactgggagg ccttaccctt gcagcttaca ggccagaagc caccctagga 661 gtagaagtct ctgactcaag tgtcaactgc attgaacaga ccacagagtg gacagcatgc 721 tccaagagct gtggtatggg gttctccacc cgggtcacca ataggaaccg tcaatgtgag 781 atgctgaaac agactcggct ctgcatggtg cggccctgtg aacaagagcc agagcagcca 841 acagataaga aaggaaaaaa gtgtctccgc accaagaagt cactcaaagc catccacctg 901 cagttcaaga actgcaccag cctgcacacc tacaagccca ggttctgtgg ggtctgcagt 961 gatggccgct gctgcactcc ccacaatacc aaaaccatcc aggcagagtt tcagtgctcc 1021 ccagggcaaa tagtcaagaa gccagtgatg gtcattggga cctgcacctg tcacaccaac 1081 tgtcctaaga acaatgaggc cttcctccag gagctggagc tgaagactac cagagggaaa 1141 atgtaacctg tcactcaaga agcacaccta cagagcacct gtagctgctg cgccacccac 1201 catcaaagga atataagaaa agtaatgaag aatcacgatt tcatccttga atcctatgta 1261 ttttcctaat gtgatcatat gaggaccttt catatctgtc ttttatttaa caaaaaatgt 1321 aattaactgt aaacttggaa tcaaggtaag ctcaggatat ggcttaggaa tgacttactt 1381 tcctgtggtt ttattacaaa tgcaaatttc tataaattta agaaaacaag tatataattt 1441 actttgtaga ctgtttcaca ttgcactcat catattttgt tgtgcactag tgcaattcca 1501 agaaaatatc actgtaatga gtcagtgaag tctagaatca tacttaacat ttcattgtac 1561 aagtattaca accatatatt gaggttcatt gggaagattc tctattggct ccctttttgg 1621 gtaaaccagc tctgaacttc caagctccaa atccaaggaa acatgcagct cttcaacatg 1681 acatccagag atgactatta cttttctgtt tagttttaca ctaggaacgt gttgtatcta 1741 cagtaatgaa atgtttacta agtggactgg tgtcataact tctccattag acacatgact 1801 ccttccaata gaaagaaact aaacagaaaa ctcccaatac aaagatgact ggtccctcat 1861 agccctcaga catttatata ttggaagctg ctgaggcccc caagtttttt aattaagcag 1921 aaacagcata ttagcaggga ttctctcatc taactgatga gtaaactgag gcc // LOCUS HSNP62 1840 bp RNA PRI 29-NOV-1993 DEFINITION Human mRNA for p62 nucleoporin. ACCESSION X58521 S59346 NID g432653 KEYWORDS nucleoporin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1840) AUTHORS Hurt,E.C. TITLE Direct Submission JOURNAL Submitted (12-MAR-1991) E.C. Hurt, E M B L, Cell Biology Program, Meyerhofstr 1, Postfach 10.2209, 6900 Heidelberg, Germany REFERENCE 2 (bases 1 to 1840) AUTHORS Carmo-Fonseca,M., Kern,H. and Hurt,E.C. TITLE Human nucleoporin p62 and the essential yeast nuclear pore protein NSP1 show sequence homology and a similar domain organization JOURNAL Eur. J. Cell Biol. 55 (1), 17-30 (1991) MEDLINE 92007939 FEATURES Location/Qualifiers source 1..1840 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 152..1720 /codon_start=1 /product="nucleoporin p62" /db_xref="PID:g432654" /db_xref="SWISS-PROT:P37198" /translation="MSGFNFGGTGAPTGGFTFGTAKTATTTPATGFSFSTSGTGGFNF GAPFQPATSTPSTGLFSLATQTPATQTTGFTFGTATLASGGTGFSLGIGASKLNLSNT AATPAMANPSGFGLGSSNLTNAISSTVTSSQGTAPTGFVFGPSTTSVAPATTSGGFSF TGGSTAQPSGFNIGSAGNSAQPTAPATLPFTPATPAATTAGATQPAAPTPTATITSTG PSLFASIATAPTSSATTGLSLCTPVTTAGAPTAGTQGFSLKAPGAASGTSTTTSTAAT ATATTTTSSSTTGFALNLKPLAPAGIPSNTAAAVTAPPGPGAAAGAAASSAMTYAQLE SLINKWSLELEDQERHFLQQATQVNAWDRTLIENGEKITSLHREVEKVKLDQKRLDQE LDFILSQQKELEDLLSPLEELVKEQRATIYLQHADEERQKTYKLAENIDAQLKRMAQD LKDIIEHLNTSGAPADTSDPLQQICKILNAHMDSLQWIDQNSALLQRKVEEVTKVCVG RRKEQERSFRITFD" BASE COUNT 413 a 606 c 502 g 319 t ORIGIN 1 ggaagaggta agcggttact cactccatgg ctgcagcaag gagaggcggc ggcggcctcg 61 gctgaagaaa gaagaaatct tcccaaggct gcagacaccg acggatttgc tttgggagcc 121 agagtagctg ccgccaccag agtccggagc catgagcggc tttaattttg gaggcactgg 181 ggcccctaca ggcgggttca cgtttggcac tgcaaagacg gcaacaacca cacctgctac 241 agggttttct ttctccacct ctggcactgg agggtttaat tttggggctc ccttccaacc 301 agccacaagt accccttcca ccggcctgtt ctcacttgcc acccagactc cggccacaca 361 gacgacaggc ttcacttttg gaacagcgac tcttgcttcg gggggaactg gattttcttt 421 ggggatcggt gcttcaaagc tcaacttgag caacacagct gccaccccag ccatggcaaa 481 ccccagcggc tttgggctgg gcagcagcaa cctcactaat gccatatcga gcaccgtcac 541 ctccagccag ggcacagcac ccaccggctt tgtgtttggc ccctccacca cctctgtggc 601 tccagctacc acatctggag gcttctcatt cactggtgga agcacggccc aaccctccgg 661 tttcaacatt ggctcagcag ggaattcagc ccagcccacg gcacctgcca cgttgccctt 721 cactccggcc acgccagcag ccaccacagc aggtgccaca cagccagctg ctcccacacc 781 cacagccacc atcaccagta ctgggcccag cctctttgcg tcaatagcaa ctgctccaac 841 ctcatctgcc accactggac tctccctctg tacccctgtg accacagcgg gcgcccccac 901 tgctgggaca cagggattca gcttaaaggc acctggagca gcttccggca cctccacaac 961 aacatccacc gctgccaccg ccaccgccac caccaccacc agcagcagca ccaccggctt 1021 tgccttgaat ttaaaaccac tggcgccagc cgggatcccc agcaatacag cagctgccgt 1081 gaccgctcca cctggccctg gcgcagctgc aggggcggct gccagctccg ccatgaccta 1141 cgcgcagctg gagagcctga tcaacaaatg gagcctggag ctagaggacc aggagcggca 1201 cttcctccag caggccaccc aggtcaacgc ctgggaccgc acgctgatcg agaatggaga 1261 aaagatcacc agcctgcacc gcgaggtgga gaaggtgaag ctggaccaga agaggctgga 1321 ccaggagctc gacttcatcc tgtcccagca gaaggagctg gaagacctgc tgagcccact 1381 ggaggagttg gtcaaggagc agagggcgac catctacctg cagcacgcgg atgaggagcg 1441 tcagaaaacc tacaagctgg ctgagaacat cgacgcacag ctcaagcgca tggcccagga 1501 tctcaaggac atcatcgagc acctgaacac gtccggggcc cccgccgaca ccagtgaccc 1561 actgcagcag atctgcaaga tcctcaatgc gcacatggac tcactgcagt ggatcgacca 1621 gaactcggcc ctgctgcaga ggaaggtgga ggaggtgacc aaggtgtgcg tgggccggcg 1681 caaggagcag gagcgcagct tccggatcac ctttgactga gcgacagcag ccctggggcc 1741 cgcaggtccc tagggagttc atgaggggaa tgcgccctgt tgtctgtagt ttggggttgt 1801 ggcaagatac ttgtttgttt gtttctttct ttcacagacg // LOCUS HSNRA3 1584 bp RNA PRI 26-MAY-1992 DEFINITION H.sapiens mRNA for nicotinic receptor alpha-3 subunit. ACCESSION X52239 NID g35089 KEYWORDS nicotinic receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1584) AUTHORS Fornasari,D. TITLE Direct Submission JOURNAL Submitted (21-MAR-1990) Fornasari D., CNR Centre of Cytopharmacology, Dept of Medical Pharmacology, Via Vanvitelli 32, 20129 Milano, Italy REFERENCE 2 (bases 1 to 1584) AUTHORS Fornasari,D., Chini,B., Tarroni,P. and Clementi,F. TITLE Molecular cloning of human neuronal nicotinic receptor alpha e subunit JOURNAL Neurosci. Lett. (1990) In press FEATURES Location/Qualifiers source 1..1584 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="nervous system" /cell_type="neuron" /cell_line="neuroblastoma IMR 32" /clone_lib="lambda gt10" sig_peptide 43..128 /note="nicotinic receptor alpha-3 subunit" CDS 43..1551 /codon_start=1 /product="nicotinic receptor alpha-3 subunit" /db_xref="PID:g35090" /db_xref="SWISS-PROT:P32297" /translation="MALAVSLPLACRARLLLLLLSLLPVARASEAEHRLFERLFEDYN EIIRPVANVSDPVIIHFEVSMSQLVKVDEVNQIMETNLWLKQIWNDYKLKWNPSGYGG AEFMRVPAQKIWKPDIVLYNNAVGDFQVTTKTKALLKYTGEVTWIPPAIFKSSCKIDV TYFPFDYQNCTMKFGSWSYDKAKIDLVLIGSSMNLKDYWESGEWAIIKAPGYKHDIKY NCCEEIYPDITYSLYSRRLPLFYTINLIIPCLLISFLTVLVFYLPSDCGEKVTLCISV LLSLTVFLLVITETIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHYRTPTTHT MPSWVKTVFLNLLPRVMFMTRPTSNEGNAQKPRPLYGAELSNLNCFSRAESKGCKEGY PCQDGMCGYCHHRRIKISNFSANLTRSSSSESVDAVVSLSALSPEIKEAIQSVKYIAE NMKAQNEAKEIQDDWKYVAMVIDRIFLWVFTLVCILGTAGLFLQPLMAREDA" mat_peptide 127..1548 /product="nicotinic receptor alpha-3 subunit" BASE COUNT 362 a 456 c 384 g 382 t ORIGIN 1 cccactcccg accgtccggt ccggcccacc cggccaccag ccatggctct ggccgtctcg 61 ctgcccctgg cctgtcgcgc gcggctgctg ctgctgctgc tgtctctgct gccagtggcc 121 agggcctcag aggctgagca ccgtctattt gagcggctgt ttgaagatta caatgagatc 181 atccggcctg tagccaacgt gtctgaccca gtcatcatcc atttcgaggt gtccatgtct 241 cagctggtga aggtggatga agtaaaccag atcatggaga ccaacctgtg gctcaagcaa 301 atctggaatg actacaagct gaagtggaac ccctctggct atggtggggc agagttcatg 361 cgtgtccctg cacagaagat ctggaagcca gacattgtgc tgtataacaa tgctgttggg 421 gatttccagg tgacgaccaa gaccaaagcc ttactcaagt acactgggga ggtgacttgg 481 atacctccgg ccatctttaa gagctcctgt aaaatcgacg tgacctactt cccgtttgat 541 taccaaaact gtaccatgaa gttcggttcc tggtcctacg ataaggcgaa aatcgatctg 601 gtcctgatcg gctcttccat gaacctcaag gactattggg agagcggcga gtgggccatc 661 atcaaagccc caggctacaa acacgacatc aagtacaact gctgcgagga gatctacccc 721 gacatcacat actcgctgta cagtcggcgc ctgcccttgt tctacaccat caacctcatc 781 atcccctgcc tgctcatctc cttcctcact gtgctcgtct tctacctgcc ctccgactgc 841 ggtgagaagg tgaccctgtg catttctgtc ctcctctccc tgacggtgtt tctcctggtg 901 atcactgaga ccatcccttc cacctcgctg gtcatccccc tgattggaga gtacctcctg 961 ttcaccatga tttttgtaac cttgtccatc gtcatcaccg tcttcgtgct caacgtgcac 1021 tacagaaccc cgacgacaca cacaatgccc tcatgggtga agactgtatt cttgaacctg 1081 ctccccaggg tcatgttcat gaccaggcca acaagcaacg agggcaacgc tcagaagccg 1141 aggcccctct acggtgccga gctctcaaat ctgaattgct tcagccgcgc agagtccaaa 1201 ggctgcaagg agggctaccc ctgccaggac gggatgtgtg gttactgcca ccaccgcagg 1261 ataaaaatct ccaatttcag tgctaacctc acgagaagct ctagttctga atctgttgat 1321 gctgtggtgt ccctctctgc tttgtcacca gaaatcaaag aagccatcca aagtgtcaag 1381 tatattgctg aaaatatgaa agcacaaaat gaagccaaag agattcaaga tgattggaag 1441 tatgttgcca tggtgattga tcgtattttt ctgtgggttt tcaccctggt gtgcattcta 1501 gggacagcag gattgtttct gcaacccctg atggccaggg aagatgcata agcactaagc 1561 tgtgtgcctg cctgggaaga cttc // LOCUS HSNRASR 2436 bp RNA PRI 12-SEP-1993 DEFINITION Human N-ras mRNA and flanking regions. ACCESSION X02751 NID g35102 KEYWORDS oncogene; ras oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2436) AUTHORS Hall,A. and Brown,R. TITLE Human N-ras: cDNA cloning and gene structure JOURNAL Nucleic Acids Res. 13 (14), 5255-5268 (1985) MEDLINE 85269641 COMMENT Data kindly reviewed (26-JUN-1986) by R. Brown. FEATURES Location/Qualifiers source 1..2436 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region 296..301 /note="GGGCGG with pot. regulatory function" misc_feature 464..477 /note="palindrome" repeat_unit 464..470 /note="imp. inverted repeat A" repeat_unit 471..477 /note="imp. inverted repeat A" misc_feature 474..478 /note="transcription start site" misc_feature 484..488 /note="alternative transcription start site" repeat_region 493..498 /note="GGGCGG with pot. regulatory function" repeat_region 549..554 /note="GGGCGG with pot. regulatory function" repeat_region 591..596 /note="GGGCGG with pot. regulatory function" misc_feature 709..710 /note="exon 1/exon 2 boundary" CDS 727..1296 /note="N-ras gene product (aa 1-189)" /codon_start=1 /db_xref="PID:g35103" /db_xref="SWISS-PROT:P01111" /translation="MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQV VIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNSKSFADINLYREQIKR VKDSDDVPMVLVGNKCDLPTRTVDTKQAHELAKSYGIPFIETSAKTRQGVEDAFYTLV REIRQYRMKKLNSSDDGTQGCMGLPCVVM" misc_feature 837..838 /note="exon 2/exon 3 boundary" misc_feature 1016..1017 /note="exon 3/exon 4 boundary" misc_feature 1176..1177 /note="exon 4/exon 5 boundary" misc_feature 1300..1301 /note="exon 5/exon 6 boundary" misc_feature 1339..1340 /note="exon 6/exon 7 boundary" misc_feature 2416..2421 /note="pot. polyadenylation signal" polyA_site 2436 /note="polyadenylation site" BASE COUNT 670 a 477 c 543 g 746 t ORIGIN 1 ctgcagcttc taggacccgg tttcttttac tgatttaaaa acaaaacaaa aaaaaataaa 61 aaagttgtgc ctgaaatgaa tcttgttttt tttttataag tagccgcctg gttactgtgt 121 cctgtaaaat acagacattg acccttggtg tagcttctgt tcaactttat atcacgggaa 181 tggatgggtc tgatttcttg gccctcttct tgaattggcc atatacaggg tccctggcca 241 gtggactgaa ggctttgtct aagatgacaa gggtcagctc aggggatgtg ggggagggcg 301 gttttatctt cccccttgtc gtttgaggtt ttgatctctg ggtaaagagg ccgtttatct 361 ttgtaaacac gaaacatttt tgctttctcc agttttctgt taatggcgaa agaatggaag 421 cgaataaagt tttactgatt tttgagacac tagcacctag cgctttcatt attgaaacgt 481 cccgtgtggg aggggcgggt ctgggtgcgg ctgccgcatg actcgtggtt cggaggccca 541 cgtggccggg gcggggactc aggcgcctgg cagccgactg attacgtagc gggcggggcc 601 ggaagtgccg ctccttggtg ggggctgttc atggcggttc cggggtctcc aacatttttc 661 ccggtctgtg gtcctaaatc tgtccaaagc agaggcagtg gagcttgagg ttcttgctgg 721 tgtgaaatga ctgagtacaa actggtggtg gttggagcag gtggtgttgg gaaaagcgca 781 ctgacaatcc agctaatcca gaaccacttt gtagatgaat atgatcccac catagaggat 841 tcttacagaa aacaagtggt tatagatggt gaaacctgtt tgttggacat actggataca 901 gctggacaag aagagtacag tgccatgaga gaccaataca tgaggacagg cgaaggcttc 961 ctctgtgtat ttgccatcaa taatagcaag tcatttgcgg atattaacct ctacagggag 1021 cagattaagc gagtaaaaga ctcggatgat gtacctatgg tgctagtggg aaacaagtgt 1081 gatttgccaa caaggacagt tgatacaaaa caagcccacg aactggccaa gagttacggg 1141 attccattca ttgaaacctc agccaagacc agacagggtg ttgaagatgc tttttacaca 1201 ctggtaagag aaatacgcca gtaccgaatg aaaaaactca acagcagtga tgatgggact 1261 cagggttgta tgggattgcc atgtgtggtg atgtaacaag atacttttaa agttttgtca 1321 gaaaagagcc actttcaagc tgcactgaca ccctggtcct gacttcctgg aggagaagta 1381 ttcctgttgc tgtcttcagt ctcacagaga agctcctgct acttccccag ctctcagtag 1441 tttagtacaa taatctctat ttgagaagtt ctcagaataa ctacctcctc acttggctgt 1501 ctgaccagag aatgcacctc ttgttactcc ctgttatttt tctgccctgg gttcttccac 1561 agcacaaaca cacctcaaca cacctctgcc accccaggtt tttcatctga aaagcagttc 1621 atgtctgaaa cagagaacca aaccgcaaac gtgaaattct attgaaaaca gtgtcttgag 1681 ctctaaagta gcaactgctg gtgatttttt ttttcttttt actgttgaac ttagaactat 1741 gcctaatttt tggagaaatg tcataaatta ctgttttgcc aagaatatag ttattattgc 1801 tgtttggttt gtttataatg ttatcggctc tattctctaa actggcatct gctctagatt 1861 cataaataca aaaatgaata ctgaattttg agtctatcct agtcttcaca actttgacgt 1921 aattaaatcc aacttttcac agtgaagtgc ctttttccta gaagtggttt gtagactcct 1981 ttataatatt tcagtggaat agatgtctca aaaatcctta tgcatgaaat gaatgtctga 2041 gatacgtctg tgacttatct accattgaag gaaagctata tctatttgag agcagatgcc 2101 attttgtaca tgtatgaaat tggttttcca gaggcctgtt ttggggcttt cccaggagaa 2161 agatgaaact gaaagcatat gaataatttc acttaataat ttttacctaa tctccacttt 2221 tttcataggt tactacctat acaatgtatg taatttgttt cccctagctt actgataaac 2281 ctaatattca atgaacttcc atttgtattc aaatttgtgt cataccagaa agctctacat 2341 ttgcagatgt tcaaatattg taaaactttg gtgcattgtt atttaatagc tgtgatcagt 2401 gattttcaaa cctcaaatat agtatattaa caaatt // LOCUS HSNRD1 3647 bp RNA PRI 25-NOV-1997 DEFINITION H.sapiens mRNA for NRD1 convertase. ACCESSION X93209 NID g2462483 KEYWORDS NRD convertase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3647) AUTHORS Hospital,V., Prat,A., Joulie,C., Cherif,D., Day,R. and Cohen,P. TITLE Human and rat testis express two mRNA species encoding varients of NRD convertase, a metalloendopeptidase of the insulinase family JOURNAL Biochem. J. 327, 773-779 (1997) REFERENCE 2 (bases 1 to 3647) AUTHORS Cohen,P. TITLE Direct Submission JOURNAL Submitted (20-NOV-1995) P. Cohen, Universite P. & M. Curie CNRS, Lab de Biochimie des Signaux Regulateurs Cellulaires et Moleculaires, 96 Bd Raspail, 75006 Paris, FRANCE REMARK Revised by submittor 05-MAR-96 FEATURES Location/Qualifiers source 1..3647 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="testis" /map="p32" 5'UTR 1..108 gene 109..135 /gene="ORF1" CDS 109..135 /gene="ORF1" /codon_start=1 /db_xref="PID:e1187378" /db_xref="PID:g2462484" /translation="MSIAPAWW" CDS 136..3591 /EC_number="3.4.24.61" /codon_start=1 /product="NRD1 convertase" /db_xref="PID:e1187379" /db_xref="PID:g2462485" /translation="MLRRVTVAAVCATRRKLCEAGRDVAALWGIETRGRCEDSAAARP FPILAMPGRNKAKSTCSCPDLQPNGQDLGENSRVARLGADESEEEGRRGSLSNAGDPE IVKSPSDPKQYRYIKLQNGLQALLISDLSNMEGKTGNTTDDEEEEEVEEEEEDDDEDS GAEIEDDDEEGFDDEDEFDDEHDDDLDTEDNELEELEERAEARKKTTEKQSAAALCVG VGSFADPDDLPGLAHFLEHMVFMGSLKYPDENGFDAFLKKHGGSDNASTDCERTVFQF DVQRKYFKEALDRWAQFFIHPLMIRDAIDREVEAVDSEYQLARPSDANRKEMLFGSLA RPGHPMGKFFWGNAETLKHEPRKNNIDTHARLREFWMRYYSSHYMTLVVQSKETLDTL EKWVTEIFSQIPNNGLPRPNFGHLTDPFDTPAFNKLYRVVPIRKIHALTITWALPPQQ QHYRVKPLHYISWLVGHEGKGSILSFLRKKCWALALFGGNGETGFEQNSTYSVFSISI TLTDEGYEHFYEVAYTVFLYLKMLQKLGPEKRIFEEIRKIEDNEFHYQEQTDPVEYVE NMCENMQLYPLQDILTGDQLLFEYKPEVIGEALNQLVPQKANLVLLSGANEGKCDLKE KWFGTQYSIEDIENSWAELWNSNFELNPDLHLPAENKYIATDFTLKAFDCPETEYPVK IVNTPQGCLWYKKDNKFKIPKAYIRFHLISPLIQKSAANVVLFDIFVNILTHNLAEPA YEADVAQLEYKLAAGEHGLIIRVKGFNHKLPLLFQLIIDYLAEFNSTPAVFTMITEQL KKTYFNILIKPETLAKDVRLLILEYARWSMIDKYQALMDGLSLESLLSFVKEFKSQLF VEGLVQGNVTSTESMDFLKYVVDKLNFKPLEQEMPVQFQVVELPSGHHLCKVKALNKG DANSEVTVYYQSGTRSLREYTLMELLVMHMEEPCFDFLRTKQTLGYHVYPTCRNTSGI LGFSVTVGTQATKYNSEVVDKKIEEFLSSFEEKIENLTEEAFNTQVTALIKLKECEDT HLGEEVDRNWNEVVTQQYLFDRLAHEIEALKSFSKSDLVNWFKAHRGPGSKMLSVHVV GYGKYELEEDGSPSSEDSNSSCEVMQLTYLPTSPLLADCIIPITDIRAFTTTLNLLPY HKIVK" polyA_signal 3587..3592 /note="putative" polyA_signal 3591..3596 /note="putative" 3'UTR 3592..3647 BASE COUNT 1085 a 736 c 895 g 931 t ORIGIN 1 agactggggt gggggagggg ttcaggcctg ttccccgcgg ctgcggcagc accagggccg 61 gccgccaccg cctctagaac gcggaggagg tgggtcctgg gaagcgggat gtccatcgct 121 ccagcttggt ggtgaatgct gaggagagtc actgttgctg cagtctgtgc cacccggagg 181 aagttgtgtg aggccgggcg ggacgtcgcg gcgctctggg gaatcgaaac gcggggtcgg 241 tgcgaagact ctgctgctgc cagacccttt cctattctgg ccatgcctgg aaggaacaag 301 gcgaagtcta cctgcagctg ccctgacctg cagcccaatg gacaggatct gggcgagaac 361 agccgggttg cccgtctagg agcggatgaa tctgaggaag agggacggag ggggtctctc 421 agtaatgctg gggaccctga gatcgtcaag tctcccagcg accccaagca ataccgatac 481 atcaaattac agaatggcct acaggcactt ctgatttcag acctaagtaa tatggaaggt 541 aaaacaggaa atacaacaga tgatgaagaa gaagaggagg tggaggaaga agaagaagat 601 gatgatgaag attctggagc tgaaatagaa gatgacgatg aagagggttt tgatgatgaa 661 gatgagtttg atgatgaaca tgatgatgat cttgatactg aggataatga attggaagaa 721 ttagaagaga gagcagaagc tagaaaaaaa actactgaaa aacagtctgc agcggctctt 781 tgtgttggag ttgggagttt cgctgatcca gatgacctgc cggggctggc acactttttg 841 gagcacatgg tattcatggg tagtttgaaa tatccagatg agaatggatt tgatgccttc 901 ctgaagaagc atgggggtag tgataatgcc tcaactgatt gtgaacgcac tgtctttcag 961 tttgatgtcc agaggaagta cttcaaggaa gctcttgata gatgggcgca gttcttcatc 1021 cacccactaa tgatcagaga tgcaattgac cgtgaagttg aagctgttga tagtgaatat 1081 caacttgcaa ggccttctga tgcaaacaga aaggaaatgt tgtttggaag ccttgctaga 1141 cctggccatc ctatgggaaa atttttttgg ggaaatgctg agacgctcaa gcatgagcca 1201 agaaagaata atattgatac acatgctaga ttgagagaat tctggatgcg ttactactct 1261 tctcattaca tgactttagt ggttcaatcc aaagaaacac tggatacttt ggaaaagtgg 1321 gtgactgaaa tcttctctca gataccaaac aatgggttac ccagaccaaa ctttggccat 1381 ttaacggatc catttgacac accagcattt aacaaacttt atagagttgt tccaatcaga 1441 aaaattcatg ctctgaccat cacatgggca cttcctcctc aacagcaaca ttacagggtg 1501 aagccacttc attatatatc ctggctggtt ggacatgaag gcaaaggcag cattctttct 1561 ttccttagga aaaaatgctg ggctcttgca ctgtttggtg gaaatggtga gacaggattt 1621 gagcaaaatt ctacttattc agtgttcagc atttctatta cattgactga tgagggttat 1681 gaacattttt atgaggttgc ttacactgtc tttctgtatt taaaaatgct gcagaagcta 1741 ggcccagaaa aaagaatttt tgaagagatt cggaaaattg aggataatga atttcattac 1801 caagaacaga cagatccagt tgagtatgtg gaaaacatgt gtgagaacat gcagctgtac 1861 ccattgcagg acattctcac tggagatcag cttctttttg aatacaagcc agaagtcatt 1921 ggtgaagcct tgaatcagct agttcctcaa aaagcaaatc ttgttttact gtctggtgct 1981 aatgagggaa aatgtgacct caaggagaaa tggtttggaa ctcaatatag tatagaagat 2041 attgaaaact cttgggctga actgtggaat agtaatttcg aattaaatcc agatcttcat 2101 cttccagctg aaaacaagta catagccacg gactttacgt tgaaggcttt cgattgcccg 2161 gaaacagaat acccagttaa aattgtgaat actccacaag gttgcctgtg gtataagaaa 2221 gacaacaaat tcaaaatccc caaagcatat atacgtttcc atctaatttc accgttgata 2281 cagaaatctg cagcaaatgt ggtcctcttt gatatctttg tcaatatcct tacgcataac 2341 cttgcggaac cagcttatga agcagatgtg gcacagctgg agtataaact ggcagctgga 2401 gaacatggtt taattattcg agtgaaagga tttaaccaca aactacctct actgtttcag 2461 ctcattattg actacttagc tgagttcaat tccacaccag ctgtctttac aatgataact 2521 gagcagttga agaagaccta ctttaacatc ctcatcaagc ctgagacttt ggccaaagat 2581 gtacggcttt taatcttgga atatgcccgt tggtctatga ttgacaagta ccaggctttg 2641 atggacggcc tttcccttga gtctctgctg agcttcgtca aagaattcaa atcccagctc 2701 tttgtggagg gcctggtaca agggaatgtc acaagcacag aatctatgga tttcctgaaa 2761 tatgttgttg acaaactaaa cttcaagcct ctggagcagg agatgcctgt gcagttccag 2821 gtggtagagc tgcccagtgg ccaccatcta tgcaaagtga aagctctgaa caagggtgat 2881 gccaactctg aagtcactgt gtactaccag tcaggtacca ggagtctaag agaatatacg 2941 cttatggagc tgcttgtgat gcacatggaa gaaccttgtt ttgacttcct tcgaaccaag 3001 cagacccttg ggtaccatgt ctaccctacc tgtaggaaca catccgggat tctaggattt 3061 tctgtcactg tggggactca ggcaaccaaa tacaattctg aagttgttga taagaagata 3121 gaagagtttc tttctagctt tgaggagaag attgagaacc tcactgaaga ggcattcaac 3181 acccaggtca cagctctcat caagctgaag gagtgtgagg atacccacct tggggaggag 3241 gtggatagga actggaatga agtggttaca cagcagtacc tctttgaccg ccttgcccac 3301 gagattgaag cactgaagtc attctcaaaa tcagacctgg tcaactggtt caaggctcat 3361 agagggccag gaagtaaaat gctcagcgtt catgttgttg ggtatgggaa gtatgaactg 3421 gaagaggatg gatccccttc tagtgaggat tcaaattctt cttgtgaagt gatgcagctg 3481 acctacctgc caacctctcc tctgctggca gattgtatca tccccattac tgatatcagg 3541 gctttcacaa caacactcaa ccttctcccc taccataaaa tagtcaaata aataaactgc 3601 agtcacgttg gcctgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSNRD2 3851 bp RNA PRI 25-NOV-1997 DEFINITION H.sapiens mRNA for NRD2 convertase. ACCESSION X93207 NID g2462486 KEYWORDS NRD convertase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3851) AUTHORS Hospital,V., Prat,A., Joulie,C., Cherif,D., Day,R. and Cohen,P. TITLE Human and rat testis express two mRNA species encoding varients of NRD convertase, a metalloendopeptidase of the insulinase family JOURNAL Biochem. J. 327, 773-779 (1997) REFERENCE 2 (bases 1 to 3851) AUTHORS Cohen,P. TITLE Direct Submission JOURNAL Submitted (20-NOV-1995) P. Cohen, Universite P. & M. Curie CNRS, Lab de Biochimie des Signaux Regulateurs Cellulaires et Moleculaires, 96 Bd Raspail, 75006 Paris, FRANCE REMARK Revised by submittor 05-MAR-96 FEATURES Location/Qualifiers source 1..3851 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="testis" /map="p32" 5'UTR 1..108 gene 109..135 /gene="ORF1" CDS 109..135 /gene="ORF1" /codon_start=1 /db_xref="PID:e1187380" /db_xref="PID:g2462487" /translation="MSIAPAWW" CDS 136..3795 /EC_number="3.4.24.61" /codon_start=1 /product="NRD2 convertase" /db_xref="PID:e1187381" /db_xref="PID:g2462488" /translation="MLRRVTVAAVCATRRKLCEAGRDVAALWGIETRGRCEDSAAARP FPILAMPGRNKAKSTCSCPDLQPNGQDLGENSRVARLGADESEEEGRRGSLSNAGDPE IVKSPSDPKQYRYIKLQNGLQALLISDLSNMEGKTGNTTDDEEEEEVEEEEEDDDEDS GAEIEDDDEEGFDDEDEFDDEHDDDLDTEDNELEELEERAEARKKTTEKQQLQSLFLL WSKLTDRLWFKSTYSKMSSTLLVETRNLYGVVGAESRSAPVQHLAGWQAEEQQGETDT VLSAAALCVGVGSFADPDDLPGLAHFLEHMVFMGSLKYPDENGFDAFLKKHGGSDNAS TDCERTVFQFDVQRKYFKEALDRWAQFFIHPLMIRDAIDREVEAVDSEYQLARPSDAN RKEMLFGSLARPGHPMGKFFWGNAETLKHEPRKNNIDTHARLREFWMRYYSSHYMTLV VQSKETLDTLEKWVTEIFSQIPNNGLPRPNFGHLTDPFDTPAFNKLYRVVPIRKIHAL TITWALPPQQQHYRVKPLHYISWLVGHEGKGSILSFLRKKCWALALFGGNGETGFEQN STYSVFSISITLTDEGYEHFYEVAYTVFLYLKMLQKLGPEKRIFEEIRKIEDNEFHYQ EQTDPVEYVENMCENMQLYPLQDILTGDQLLFEYKPEVIGEALNQLVPQKANLVLLSG ANEGKCDLKEKWFGTQYSIEDIENSWAELWNSNFELNPDLHLPAENKYIATDFTLKAF DCPETEYPVKIVNTPQGCLWYKKDNKFKIPKAYIRFHLISPLIQKSAANVVLFDIFVN ILTHNLAEPAYEADVAQLEYKLAAGEHGLIIRVKGFNHKLPLLFQLIIDYLAEFNSTP AVFTMITEQLKKTYFNILIKPETLAKDVRLLILEYARWSMIDKYQALMDGLSLESLLS FVKEFKSQLFVEGLVQGNVTSTESMDFLKYVVDKLNFKPLEQEMPVQFQVVELPSGHH LCKVKALNKGDANSEVTVYYQSGTRSLREYTLMELLVMHMEEPCFDFLRTKQTLGYHV YPTCRNTSGILGFSVTVGTQATKYNSEVVDKKIEEFLSSFEEKIENLTEEAFNTQVTA LIKLKECEDTHLGEEVDRNWNEVVTQQYLFDRLAHEIEALKSFSKSDLVNWFKAHRGP GSKMLSVHVVGYGKYELEEDGSPSSEDSNSSCEVMQLTYLPTSPLLADCIIPITDIRA FTTTLNLLPYHKIVK" polyA_signal 3791..3796 /note="putative" 3'UTR 3793..3851 polyA_signal 3795..3800 /note="putative" BASE COUNT 1139 a 774 c 953 g 985 t ORIGIN 1 agactggggt gggggagggg ttcaggcctg ttccccgcgg ctgcggcagc accagggccg 61 gccgccaccg cctctagaac gcggaggagg tgggtcctgg gaagcgggat gtccatcgct 121 ccagcttggt ggtgaatgct gaggagagtc actgttgctg cagtctgtgc cacccggagg 181 aagttgtgtg aggccgggcg ggacgtcgcg gcgctctggg gaatcgaaac gcggggtcgg 241 tgcgaagact ctgctgctgc cagacccttt cctattctgg ccatgcctgg aaggaacaag 301 gcgaagtcta cctgcagctg ccctgacctg cagcccaatg gacaggatct gggcgagaac 361 agccgggttg cccgtctagg agcggatgaa tctgaggaag agggacggag ggggtctctc 421 agtaatgctg gggaccctga gatcgtcaag tctcccagcg accccaagca ataccgatac 481 atcaaattac agaatggcct acaggcactt ctgatttcag acctaagtaa tatggaaggt 541 aaaacaggaa atacaacaga tgatgaagaa gaagaggagg tggaggaaga agaagaagat 601 gatgatgaag attctggagc tgaaatagaa gatgacgatg aagagggttt tgatgatgaa 661 gatgagtttg atgatgaaca tgatgatgat cttgatactg aggataatga attggaagaa 721 ttagaagaga gagcagaagc tagaaaaaaa actactgaaa aacagcaatt gcagagcctg 781 tttttgctgt ggtcaaagct gactgataga ctgtggttta agtcaactta ttcaaaaatg 841 tcttcaaccc tgctggtcga gacaagaaat ctttatgggg tagttggagc tgaaagcagg 901 tctgcacctg ttcagcattt ggcaggatgg caagcggagg agcagcaggg tgaaactgac 961 acagttctgt ctgcagcggc tctttgtgtt ggagttggga gtttcgctga tccagatgac 1021 ctgccggggc tggcacactt tttggagcac atggtattca tgggtagttt gaaatatcca 1081 gatgagaatg gatttgatgc cttcctgaag aagcatgggg gtagtgataa tgcctcaact 1141 gattgtgaac gcactgtctt tcagtttgat gtccagagga agtacttcaa ggaagctctt 1201 gatagatggg cgcagttctt catccaccca ctaatgatca gagatgcaat tgaccgtgaa 1261 gttgaagctg ttgatagtga atatcaactt gcaaggcctt ctgatgcaaa cagaaaggaa 1321 atgttgtttg gaagccttgc tagacctggc catcctatgg gaaaattttt ttggggaaat 1381 gctgagacgc tcaagcatga gccaagaaag aataatattg atacacatgc tagattgaga 1441 gaattctgga tgcgttacta ctcttctcat tacatgactt tagtggttca atccaaagaa 1501 acactggata ctttggaaaa gtgggtgact gaaatcttct ctcagatacc aaacaatggg 1561 ttacccagac caaactttgg ccatttaacg gatccatttg acacaccagc atttaacaaa 1621 ctttatagag ttgttccaat cagaaaaatt catgctctga ccatcacatg ggcacttcct 1681 cctcaacagc aacattacag ggtgaagcca cttcattata tatcctggct ggttggacat 1741 gaaggcaaag gcagcattct ttctttcctt aggaaaaaat gctgggctct tgcactgttt 1801 ggtggaaatg gtgagacagg atttgagcaa aattctactt attcagtgtt cagcatttct 1861 attacattga ctgatgaggg ttatgaacat ttttatgagg ttgcttacac tgtctttctg 1921 tatttaaaaa tgctgcagaa gctaggccca gaaaaaagaa tttttgaaga gattcggaaa 1981 attgaggata atgaatttca ttaccaagaa cagacagatc cagttgagta tgtggaaaac 2041 atgtgtgaga acatgcagct gtacccattg caggacattc tcactggaga tcagcttctt 2101 tttgaataca agccagaagt cattggtgaa gccttgaatc agctagttcc tcaaaaagca 2161 aatcttgttt tactgtctgg tgctaatgag ggaaaatgtg acctcaagga gaaatggttt 2221 ggaactcaat atagtataga agatattgaa aactcttggg ctgaactgtg gaatagtaat 2281 ttcgaattaa atccagatct tcatcttcca gctgaaaaca agtacatagc cacggacttt 2341 acgttgaagg ctttcgattg cccggaaaca gaatacccag ttaaaattgt gaatactcca 2401 caaggttgcc tgtggtataa gaaagacaac aaattcaaaa tccccaaagc atatatacgt 2461 ttccatctaa tttcaccgtt gatacagaaa tctgcagcaa atgtggtcct ctttgatatc 2521 tttgtcaata tccttacgca taaccttgcg gaaccagctt atgaagcaga tgtggcacag 2581 ctggagtata aactggcagc tggagaacat ggtttaatta ttcgagtgaa aggatttaac 2641 cacaaactac ctctactgtt tcagctcatt attgactact tagctgagtt caattccaca 2701 ccagctgtct ttacaatgat aactgagcag ttgaagaaga cctactttaa catcctcatc 2761 aagcctgaga ctttggccaa agatgtacgg cttttaatct tggaatatgc ccgttggtct 2821 atgattgaca agtaccaggc tttgatggac ggcctttccc ttgagtctct gctgagcttc 2881 gtcaaagaat tcaaatccca gctctttgtg gagggcctgg tacaagggaa tgtcacaagc 2941 acagaatcta tggatttcct gaaatatgtt gttgacaaac taaacttcaa gcctctggag 3001 caggagatgc ctgtgcagtt ccaggtggta gagctgccca gtggccacca tctatgcaaa 3061 gtgaaagctc tgaacaaggg tgatgccaac tctgaagtca ctgtgtacta ccagtcaggt 3121 accaggagtc taagagaata tacgcttatg gagctgcttg tgatgcacat ggaagaacct 3181 tgttttgact tccttcgaac caagcagacc cttgggtacc atgtctaccc tacctgtagg 3241 aacacatccg ggattctagg attttctgtc actgtgggga ctcaggcaac caaatacaat 3301 tctgaagttg ttgataagaa gatagaagag tttctttcta gctttgagga gaagattgag 3361 aacctcactg aagaggcatt caacacccag gtcacagctc tcatcaagct gaaggagtgt 3421 gaggataccc accttgggga ggaggtggat aggaactgga atgaagtggt tacacagcag 3481 tacctctttg accgccttgc ccacgagatt gaagcactga agtcattctc aaaatcagac 3541 ctggtcaact ggttcaaggc tcatagaggg ccaggaagta aaatgctcag cgttcatgtt 3601 gttgggtatg ggaagtatga actggaagag gatggatccc cttctagtga ggattcaaat 3661 tcttcttgtg aagtgatgca gctgacctac ctgccaacct ctcctctgct ggcagattgt 3721 atcatcccca ttactgatat cagggctttc acaacaacac tcaaccttct cccctaccat 3781 aaaatagtca aataaataaa ctgcagtcac gttggcctga aaaaaaaaaa aaaaaaaaaa 3841 aaaaaaaaaa a // LOCUS HSNRNPE 500 bp RNA PRI 28-JAN-1995 DEFINITION Human mRNA for snRNP E protein. ACCESSION X12466 X13772 NID g35104 KEYWORDS small nuclear ribonucleoprotein E; small nuclear RNA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 500) AUTHORS Wieben,E.D. TITLE Direct Submission JOURNAL Submitted (25-JUL-1988) Wieben E.D., Mayo Clinic, 200 First Street South West, Rochester, MN 55905, USA REFERENCE 2 (bases 1 to 500) AUTHORS Stanford,D.R., Kehl,M., Perry,C.A., Holicky,E.L., Harvey,S.E., Rohleder,A.M., Rehder,K. Jr., Luhrmann,R. and Wieben,E.D. TITLE The complete primary structure of the human snRNP E protein JOURNAL Nucleic Acids Res. 16 (22), 10593-10605 (1988) MEDLINE 89083484 COMMENT The sequence overlaps with that reported by Stanford et al. in J.Biol.Chem. 262:9931-9934(1987), M15919. FEATURES Location/Qualifiers source 1..500 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratoma" /clone_lib="lambda gt10" /clone="p11HB1" CDS 46..324 /note="snRNP E protein (AA 1-92)" /codon_start=1 /db_xref="PID:g35105" /db_xref="SWISS-PROT:P08578" /translation="MAYRGQGQKVQKVMVQPINLIFRYLQNRSRIQVWLYEQVNMRIE GCIIGFDEYMNLVLDDAEEIHSKTKSRKQLGRIMLKGDNITLLQSVSN" misc_feature 444..449 /note="polyA signal" misc_feature 468..473 /note="alt. polyA signal" polyA_site 492 /note="polyA site" BASE COUNT 159 a 79 c 112 g 150 t ORIGIN 1 gctctcagag gcagcgtgcg ggtgtgctct ttgtgaaatt ccaccatggc gtaccgtggc 61 cagggtcaga aagtgcagaa ggttatggtg cagcccatca acctcatctt cagatactta 121 caaaatagat cgcggattca ggtgtggctc tatgagcaag tgaatatgcg gatagaaggc 181 tgtatcattg gttttgatga gtatatgaac cttgtattag atgatgcaga agagattcat 241 tctaaaacaa agtcaagaaa acaactgggt cggatcatgc taaaaggaga taatattact 301 ctgctacaaa gtgtctccaa ctagaaatga tcaatgaagt gagaaattgt tgagaaggat 361 acagtttgtt tttagatgtc ctttgtccaa tgtgaacatt tattcatatt gttttgatta 421 ccctcgtgtt actacaagat ggcaataaat actatgggat tgtttgtatt aaaaaattta 481 cattgcttct taaaaaaaaa // LOCUS HSNUACBIP 1558 bp RNA PRI 27-APR-1994 DEFINITION H.sapiens mRNA for nucleic acid binding protein sub2.3. ACCESSION Z29505 NID g444020 KEYWORDS nucleic acid binding protein; sub2.3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1558) AUTHORS Aasheim,H.C., Loukianova,T., Deggerdal,A. and Smeland,E.B. TITLE Tissue specific expression and cDNA structure of a human transcript encoding a nucleic acid binding [oligo(dC)] protein related to the pre-mRNA binding protein K JOURNAL Nucleic Acids Res. 22 (6), 959-964 (1994) MEDLINE 94203810 REFERENCE 2 (bases 1 to 1558) AUTHORS Aasheim,H. TITLE Direct Submission JOURNAL Submitted (18-JAN-1994) Aasheim H., The Norwegian Radium Hospital, Department of Immunology, Ullernchausseen 70, Oslo, Norway, N-0310 FEATURES Location/Qualifiers source 1..1558 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="sub2.3" /dev_stage="adult" /tissue_type="peripheral blood leucocyte" /cell_type="TPA stimulated T lymphocytes" /germline gene 86..985 /gene="sub2.3" CDS 86..985 /gene="sub2.3" /function="nucleic acid binding protein, oligo(dC)" /codon_start=1 /product="sub2.3" /db_xref="PID:g444021" /translation="MDAGVTESGLNVTLTIRLLMHGKEVGSIIGKKGESVKRIREESG ARINISEGNCPERIITLTGPTNAIFKAFAMIIDKLEEDINSSMTNSTAASRPPVTLRL VVPATQCGSLIGKGGCKIKEIRESTGAQVQVAGDMLPNSTERAITIAGVPQSVTECVK QICLVMLETLSQSPQGRVMTIPYQPMPASSPVICAGGQDRCSDAAGYPHATHDLEGPP LDAYSIQGQHTISPLDLAKLNQVARQQSHFAMMHGGTGFAGIDSSSPEVKGYWASLDA STQTTHELTIPNNLIGCIIGRQH" BASE COUNT 400 a 405 c 384 g 369 t ORIGIN 1 cctacacctc ccctcccccc gccagccgcc aaagacttga ccacgtaacg agcccaactc 61 ccccgaacgc cgcccgccgc tcgccatgga tgccggtgtg actgaaagtg gactaaatgt 121 gactctcacc attcggcttc ttatgcacgg aaaggaagta ggaagcatca ttgggaagaa 181 aggggagtcg gttaagagga tccgcgagga gagtggcgcg cggatcaaca tctcggaggg 241 gaattgtccg gagagaatca tcactctgac cggccccacc aatgccatct ttaaggcttt 301 cgctatgatc atcgacaagc tggaggaaga tatcaacagc tccatgacca acagtaccgc 361 ggccagcagg cccccggtca ccctgaggct ggtggtgccg gccacccagt gcggctccct 421 gattgggaaa ggcgggtgta agatcaaaga gatccgcgag agtacggggg cgcaggtcca 481 ggtggcgggg gatatgctgc ccaactccac cgagcgggcc atcaccatcg ctggcgtgcc 541 gcagtctgtc accgagtgtg tcaagcagat ttgcctggtc atgctggaga cgctctccca 601 gtctccgcaa gggagagtca tgaccattcc gtaccagccc atgccggcca gctccccagt 661 catctgcgcg ggcggccaag atcggtgcag cgacgctgcg ggctaccccc atgccaccca 721 tgacctggag ggaccacctc tagatgccta ctcgattcaa ggacaacaca ccatttctcc 781 gctcgatctg gccaagctga accaggtggc aagacaacag tctcactttg ccatgatgca 841 cggcgggacc ggattcgccg gaattgactc cagctctcca gaggtgaaag gctattgggc 901 aagtttggat gcatctactc aaaccaccca tgaactcacc attccaaata acttaattgg 961 ctgcataatc gggcgccaac attaatgaga tccgccagat gtccggggcc cagatcaaaa 1021 ttgccaaccc agtggaaggc tcctctggta ggcaggttac tatcactggc tctgctgcca 1081 gtattagtct ggcccagtat ctaatcaatg ccaggctttc ctctgagaag ggcatggggt 1141 gcagctagaa cagtgtaggt tccctcaata acccctttct gctgttctcc catgatccaa 1201 ctgtgtaatt tctggtcagt gattccaggt tttaaataat ttgtaagtgt tcaagtttct 1261 acacaacttt atcatccgct aagaatttaa aaatcacatt ctctgttcag ctgttaatgc 1321 tgggatccat attagtttta taagcttttc cctgttttta gttttgtttt gggttttttg 1381 gctcatgaat tttatttctg tttgtcgata agaaatgtaa gagtggaatg ttaataaatt 1441 tcagtttagt tctgtaatgt caagaattta agaattaaaa aacggattgg ttaaaaaatg 1501 cttcatattt gaaaaagctg ggaattgctg tcttaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSNUCPP 2502 bp RNA PRI 01-JUN-1995 DEFINITION H.sapiens mRNA for nucleolar phosphoprotein p130. ACCESSION Z34289 NID g663007 KEYWORDS nucleolar phosphoprotein; nucleolar phosphoprotein p130; nucleologenesis. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2502) AUTHORS Pai,C., Chen,H., Sheu,H. and Yeh,N. TITLE Cell cycle-dependent alterations of a highly phosphorylated nucleolar protein p130 are associated with nucleologenesis JOURNAL J. Cell Sci. 108, 1911-1920 (1995) REFERENCE 2 (bases 1 to 2502) AUTHORS Yeh,N. TITLE Direct Submission JOURNAL Submitted (08-JUN-1994) Ning-Hsing Yeh, Graduate Inst. Microbiol. and Immunol., National, Yang-Ming Medical College, 155 Li-Long Street, Section 2, Taipei, Taiwan, 11221, Republic of China FEATURES Location/Qualifiers source 1..2502 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone CP130-51" /tissue_type="leukemia" /cell_type="myeloblast" /cell_line="HL60" /clone_lib="HL60 cDNA 5'stretch in lambda gt11, Clontech cat#HL1119b" CDS 55..2154 /function="nucleologenesis" /citation=[1] /codon_start=1 /product="nucleolar phosphoprotein p130" /db_xref="PID:g663008" /translation="MADAGIRRVVPSDLYPLVLGFLRDNQLSEVANKFAKATGATQQD ANASSLLDIYSFWLKSAKVPERKLQANGPVAKKAKKKASSSDSEDSSEEEEEVQGPPA KKAAVPAKRVGLPPGKAAAKASESSSSEESRDDDDEEDQKKQPVQKGVKPQAKAAKAP PKKAKSSDSDSDSSSEDEPPKNQKPKITPVTVKAQTKAPPKPARAAPKIANGKAASSS SSSSSSSSSDDSEEEKAAATPKKTVPKKQVVAKAPVKAATTPTRKSSSSEDSSSDEEE EQKKPMKNKPGPYSYAPPPSAPPPKKSLGTQPPKKAVEKQQPVESSEDSSDESDSSSE EEKKPPTKAVVSKATTKPPPAKKAAESSSDSSDSDSSEDDEAPSKPAGTTKNSSNKPA VTTKSPAVKPAAAPKQPVGGGQKLLTRKADSSSSEEESSSSEEEKTKKMVATTKPKAT AKAALSLPAKQAPQGSRDSSSDSDSSSSEEEEEKTSKSAVKKKPQKVAGGAAPSKPAS AKKGKAESSNSSSSDDSSEEEEEKLKGKGSPRPQAPKANGTSALTAQNGKAAKNSEEE EEEKKKAAVVVSKSGSLKKRKQNEAAKEAETPQAKKIKLQTPNTFPKRKKGEKRASSP FRRVREEEIEVDSRVADNSFDAKRGAAGDWGERANQVLKFTKGKSFRHEKTKKKRGSY RGGSISVQVNSIKFDSE" BASE COUNT 772 a 607 c 669 g 454 t ORIGIN 1 gaattcgtgg gtcgtgctgc gtcgacaacg gtagtgacgc gtattgcctg gaggatggcg 61 gacgccggca ttcgccgcgt ggttcccagc gacctgtatc ccctcgtgct cggcttcctg 121 cgcgataacc aactctcaga ggtggccaat aagttcgcca aagcgacagg agctacacag 181 caggatgcca atgcctcttc cctcttagac atctatagct tctggctcaa gtctgccaag 241 gtcccagagc gaaagttaca ggcaaatgga ccagtggcta agaaagctaa gaagaaggcc 301 tcatccagtg acagtgagga cagcagcgag gaggaggagg aagttcaagg gcctccagca 361 aagaaggctg ctgtacctgc caagcgagtc ggtctgcctc ctgggaaggc tgcagccaaa 421 gcatcagaga gtagcagcag tgaagagtcc agagatgatg atgatgagga ggaccaaaag 481 aaacagcctg tccagaaggg agttaagccc caagccaagg cagccaaagc tcctcctaag 541 aaggccaaga gctctgattc tgattctgac tcaagctccg aggatgagcc accaaagaac 601 cagaagccaa agataacacc tgtgacagtt aaagctcaga ctaaagcccc tcccaaacca 661 gctcgagcag cacctaaaat agccaatggt aaagcagcca gtagcagcag tagcagcagc 721 agcagcagta gcagtgatga ctcagaggag gagaaggcag cagccacccc caagaagact 781 gtacctaaaa agcaagttgt ggccaaagcc ccagtgaaag cagctaccac ccctacccgg 841 aagagttcta gcagtgagga ttcctccagt gacgaggaag aggagcaaaa aaaacccatg 901 aaaaataaac caggtcccta cagttacgcc cccccgcctt ctgctccccc accaaagaag 961 tctctgggaa cccagcctcc caagaaggct gtggagaagc agcagcctgt ggaaagcagt 1021 gaagacagca gtgatgagtc tgattcaagt tctgaagaag agaagaaacc cccaactaag 1081 gcagtagtct ctaaagcaac cactaaacca cctccagcaa agaaagcagc agagagctct 1141 tcagacagct cagactctga cagctctgag gatgatgaag ctccttctaa gccagctggt 1201 accaccaaga attcttcaaa taagccagct gtcaccacca agtcacctgc agtgaagcca 1261 gctgcagccc ccaagcaacc tgtgggcggt ggccagaagc ttctgacgag aaaggctgac 1321 agcagctcca gcgaggaaga gagcagctcc agtgaggagg agaagacaaa gaagatggtg 1381 gccaccacta agcccaaggc gactgccaaa gcagctctat ctctgcctgc caagcaggct 1441 cctcagggta gtagggacag cagctctgat tcagacagct ccagcagtga ggaggaggaa 1501 gagaagacat ctaagtctgc agttaagaag aagccacaga aggtagcagg aggtgcagcc 1561 ccttccaagc cagcctctgc aaagaaagga aaggctgaga gcagcaacag ttcttcttct 1621 gatgactcca gtgaggaaga ggaagagaag ctcaagggca agggctctcc aagaccacaa 1681 gcccccaagg ccaatggcac ctctgcactg actgcccaga atggaaaagc agctaagaac 1741 agtgaggagg aggaagaaga aaagaaaaag gcggcagtgg tagtttccaa atcaggttca 1801 ttaaagaagc ggaagcagaa tgaggctgcc aaggaggcag agactcctca ggccaagaag 1861 ataaagcttc agacccctaa cacatttcca aaaaggaaga aaggagaaaa aagggcatca 1921 tccccattcc gaagggtcag ggaggaggaa attgaggtgg attcacgagt tgcggacaac 1981 tcctttgatg ccaagcgagg tgcagccgga gactggggag agcgagccaa tcaggttttg 2041 aagttcacca aaggcaagtc ctttcggcat gagaaaacca agaagaagcg gggcagctac 2101 cggggaggct caatctctgt ccaggtcaat tctattaagt ttgacagcga gtgacctgag 2161 gccatcttcg gtgaagcaag ggtgatgatc ggagactact tactttctcc agtggacctg 2221 ggaaccctca ggtctctagg tgagggtctt gatgaggaca gaagtttaga gtaggtccta 2281 agactttaca gtgtaacatc ctctctggtc cttttctgtg ttcctagttt tgtacagact 2341 tgtttttgag tgttgagtag cagggacaaa ataagggaat gttatttttt aagaaaattc 2401 attttcattg ttgtctcctt ccttttctgt gaaagtcctc atactgagaa atttgtatat 2461 tttatattaa atcacttact attgaaaaaa aaaaaggaat tc // LOCUS HSNUMAMRB 7217 bp RNA PRI 09-APR-1992 DEFINITION H.sapiens mRNA for NuMA protein. ACCESSION Z11584 NID g35120 KEYWORDS NuMA protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7217) AUTHORS Compton,D.A., Szilak,I. and Cleveland,D.W. TITLE Primary structure of NuMA, an intranuclear protein that defines a novel pathway for segregation of proteins at mitosis JOURNAL J. Cell Biol. 116 (6), 1395-1408 (1992) MEDLINE 92176238 REFERENCE 2 (bases 1 to 7217) AUTHORS Cleveland,D.W. TITLE Direct Submission JOURNAL Submitted (17-JAN-1992) Don W. Cleveland Ph.D., Biological Chemistry, Johns Hopkins, University School of Medicine, 725 N. Wolfe St., Baltimore, Maryland, 21205, USA FEATURES Location/Qualifiers source 1..7217 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..258 CDS 259..6564 /codon_start=1 /product="NuMA protein" /db_xref="PID:g35121" /translation="MTLHATRGAALLSWVNSLHVADPVEAVLQLQDCSIFIKIIDRIH GTEEGQQILKQPVSERLDFVCSFLQKNRKHPSSPECLVSAQKVLEGSELELAKMTMLL LYHSTMSSKSPRDWEQFEYKIQAELAVILKFVLDHEDGLNLNEDLENFLQKAPVPSTC SSTFPEELSPPSHQAKREIRFLELQKVASSSSGNNFLSGSPASPMGDILQTPQFQMRR LKKQLADERSNRDELELELAENRKLLTEKDAQIAMMQQRIDRLALLNEKQAASPLEPK ELEELRDKNESLTMRLHETLKQCQDLKTEKSQMDRKINQLSEENGDLSFKLREFASHL QQLQDALNELTEEHSKATQEWLEKQAQLEKELSAALQDKKCLEEKNEILQGKLSQLEE HLSQLQDNPPQEKGEVLGDVLQLETLKQEAATLAANNTQLQARVEMLETERGQQEAKL LAERGHFEEEKQQLSSLITDLQSSISNLSQAKEELEQASQAHGARLTAQVASLTSELT TLNATIQQQDQELAGLKQQAKEKQAQLAQTLQQQEQASQGLRHQVEQLSSSLKQKEQQ LKEVAEKQEATRQDHAQQLATAAEEREASLRERDAALKQLEALEKEKAAKLEILQQQL QVANEARDSAQTSVTQAQREKAELSRKVEELQACVETARQEQHEAQAQVAELELQLRS EQQKATEKERVAQEKDQLQEQLQALKESLKVTKGSLEEEKRRAADALEEQQRCISELK AETRSLVEQHKRERKELEEERAGRKGLEARLLQLGEAHQAETEVLRRELAEAMAAQHT AESECEQLVKEVAAWRDGYEDSQQEEAQYGAMFQEQLMTLKEECEKARQELQEAKEKV AGIESHSELQISRQQNKLAELHANLARALQQVQEKEVRAQKLADDLSTLQEKMAATSK EVARLETLVRKAGEQQETASRELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMER EAEQMGNELERLRAALMESQGQQQEERGQQEREVARLTQERGRAQADLALEKAARAEL EMRLQNALNEQRVEFATLQEALAHALTEKEGKDQELAKLRGLEAAQIKELEELRQTVK QLKEQLAKKEKEHASGSGAQSEAAGRTEPTGPKLEALRAEVSKLEQQCQKQQEQADSL ERSLEAERASRAERDSALETLQGQLEEKAQELGHSQSALASAQRELAAFRTKVQDHSK AEDEWKAQVARGRQEAERKNSLISSLEEEVSILNRQVLEKEGESKELKRLVMAESEKS QKLEESCACCRQRQPATVPELQNAALLCGRRCRASGREAEKQRVASENLRQELTSQAE RAEELGQELKAWQEKFFQKEQALSTLQLEHTSTQALVSELLPAKHLCQQLQAEQAAAE KRHREELEQSKQAAGGLRAELLRAQRELGELIPLRQKVAEQERTAQQLRAEKASYAEQ LSMLKKAHGLLAEENRGLGERANLGRQFLEVELDQAREKYVQELAAVRADAETRLAEV QREAQSTARELEVMTAKYEGAKVKVLEERQRFQEERQKLTAQVEELSKKLADSDQASK VQQQKLKAVQAQGGESQQEAQRFQAQLNELQAQLSQKEQAAEHYKLQMEKAKTHYDAK KQQNQELQEQLRSLEQLQKENKELRAEAERLGHELQQAGLKTKEAEQTCRHLTAQVRS LEAQVAHADQQLRDLGKFQVATDALKSREPQAKPQLDLSIDSLDLSCEEGTPLSITSK LPRTQPDGTSVPGEPASPISQRLPPKVESLESLYFTPIPARSQAPLESSLDSLGDVFL DSGRKTRSARRRTTQIINITMTKKLDVEEPDSANSSFYSTRSAPASQASLRATSSTQS LARLGSPDYGNSALLSLPGYRPTTRSSARRSQAGVSSGAPPGRNSFYMGTCQDEPEQL DDWNRIAELQQRNRVCPPHLKTCYPLESRPSLSLGTITDEEMKTGDPQETLRRASMQP IQIAEGTGITTRQQRKRVSLEPHQGPGTPESKKATSCFPRPMTPRDRHEGRKQSTTEA QKKAAPASTKQADRRQSMAFSILNTPKKLGNSLLRRGASKKALSKASPNTRSGTRRSP RIATTTASAATAAAIGATPRAKGKAKH" 3'UTR 6562..7217 polyA_signal 7202..7207 BASE COUNT 1793 a 2012 c 2274 g 1138 t ORIGIN 1 gcccacgaag aggtacgatt ccggagaatc gcgaggcaga gcgggagcgc gcagccaggt 61 ggaaactaat tctaagccag actgctggag atcaccctgt tctagtgtgt ggaggcttcc 121 accaggagtc tggagtgcaa tggcacgatc tcggctcact gcaacctcca cctcccaggt 181 tcaagcgatt ctcctgcctc agcctcccaa gtagctggga ttacaggcgc attggagtga 241 ctgtctggca tcaccaagat gacactccac gccacccggg gggctgcact cctctcttgg 301 gtgaacagtc tacacgtggc tgaccctgtg gaggctgtgc tgcagctcca ggactgcagc 361 atcttcatca agatcattga cagaatccat ggcactgaag agggacagca aatcttgaag 421 cagccggtgt cagagagact ggactttgtg tgcagttttc tgcagaaaaa tcgaaaacat 481 ccctcttccc cagaatgcct ggtatctgca cagaaggtgc tagagggatc agagctggaa 541 ctggcgaaga tgaccatgct gctcttatac cactctacca tgagctccaa aagtcccagg 601 gactgggaac agtttgaata taaaattcag gctgagttgg ctgtcattct taaatttgtg 661 ctggaccatg aggacgggct aaaccttaat gaggacctag agaacttcct acagaaagct 721 cctgtgcctt ctacctgttc tagcacattc cctgaagagc tctccccacc tagccaccag 781 gccaagaggg agattcgctt cctagagcta cagaaggttg cctcctcttc cagtgggaac 841 aactttctct caggttctcc agcttctccc atgggtgata tcctgcagac cccacagttc 901 cagatgagac ggctgaagaa gcagcttgct gatgagagaa gtaataggga tgagctggag 961 ctggagctag ctgagaaccg caagctcctc accgagaagg atgcacagat agccatgatg 1021 cagcagcgca ttgaccgcct agccctgctg aatgagaagc aggcggccag cccactggag 1081 cccaaggagc ttgaggagct gcgtgacaag aatgagagcc ttaccatgcg gctgcatgaa 1141 accctgaagc agtgccagga cctgaagaca gagaagagcc agatggatcg caaaatcaac 1201 cagctttcgg aggagaatgg agacctttcc tttaagctgc gggagtttgc cagtcatctg 1261 cagcagctac aggatgccct caatgagctg acggaggagc acagcaaggc cactcaggag 1321 tggctagaga agcaggccca gctggagaag gagctcagcg cagccctgca ggacaagaaa 1381 tgccttgaag agaagaacga aatccttcag ggaaaacttt cacagctgga agaacacttg 1441 tcccagctgc aggataaccc accccaggag aagggcgagg tgctgggtga tgtcttgcag 1501 ctggaaacct tgaagcaaga ggcagccact cttgctgcaa acaacacaca gctccaagcc 1561 agggtagaga tgctggagac tgagcgaggc cagcaggaag ccaagctgct tgctgagcgg 1621 ggccacttcg aagaagaaaa gcagcagctg tctagcctga tcactgacct gcagagctcc 1681 atctccaacc tcagccaggc caaggaagag ctggagcagg cctcccaggc tcatggggcc 1741 cggttgactg cccaggtggc ctctctgacc tctgagctca ccacactcaa tgccaccatc 1801 cagcaacagg atcaagaact ggctggcctg aagcagcagg ccaaagagaa gcaggcccag 1861 ctagcacaga ccctccaaca gcaagaacag gcctcccagg gcctccgcca ccaggtggag 1921 cagctaagca gtagcctgaa gcagaaggag cagcagttga aggaggtagc ggagaagcag 1981 gaggcaacta ggcaggacca tgcccagcaa ctggccactg ctgcagagga gcgagaggcc 2041 tccttaaggg agcgggatgc ggctctcaag cagctggagg cactggagaa ggagaaggct 2101 gccaagctgg agattctgca gcagcaactt caggtggcta atgaagcccg ggacagtgcc 2161 cagacctcag tgacacaggc ccagcgggag aaggcagagc tgagccggaa ggtggaggaa 2221 ctccaggcct gtgttgagac agcccgccag gaacagcatg aggcccaggc ccaggttgca 2281 gagctagagt tgcagctgcg gtctgagcag caaaaagcaa ctgagaaaga aagggtggcc 2341 caggagaagg accagctcca ggagcagctc caggccctca aagagtcctt gaaggtcacc 2401 aagggcagcc ttgaagagga gaagcgcagg gctgcagatg ccctggaaga gcagcagcgt 2461 tgtatctctg agctgaaggc agagacccga agcctggtgg agcagcataa gcgggaacga 2521 aaggagctgg aagaagagag ggctgggcgc aaggggctgg aggctcgatt actgcagctt 2581 ggggaggccc atcaggctga gactgaagtc ctgcggcggg agctggcaga ggccatggct 2641 gcccagcaca cagctgagag tgagtgtgag cagctcgtca aagaagtagc tgcctggcgt 2701 gacgggtatg aggatagcca gcaagaggag gcacagtatg gcgccatgtt ccaggaacag 2761 ctgatgactt tgaaggagga atgtgagaag gcccgccagg agctgcagga ggcaaaggag 2821 aaggtggcag gcatagaatc ccacagcgag ctccagataa gccggcagca gaacaaacta 2881 gctgagctcc atgccaacct ggccagagca ctccagcagg tccaagagaa ggaagtcagg 2941 gcccagaagc ttgcagatga cctctccact ctgcaggaaa agatggctgc caccagcaaa 3001 gaggtggccc gcttggagac cttggtgcgc aaggcaggtg agcagcagga aacagcctcc 3061 cgggagttag tcaaggagcc tgcgagggca ggagacagac agcccgagtg gctggaagag 3121 caacagggac gccagttctg cagcacacag gcagcgctgc aggctatgga gcgggaggca 3181 gagcagatgg gcaatgagct ggaacggctg cgggccgcgc tgatggagag ccaggggcag 3241 cagcaggagg agcgtgggca gcaggaaagg gaggtggcgc ggctgaccca ggagcggggc 3301 cgtgcccagg ctgaccttgc cctggagaag gcggccagag cagagcttga gatgcggctg 3361 cagaacgccc tcaacgagca gcgtgtggag ttcgctaccc tgcaagaggc actggctcat 3421 gccctgacgg aaaaggaagg caaggaccag gagttggcca agcttcgtgg tctggaggca 3481 gcccagataa aagagctgga ggaacttcgg caaaccgtga agcaactgaa ggaacagctg 3541 gctaagaaag aaaaggagca cgcatctggc tcaggagccc aatctgaggc tgctggcagg 3601 acagagccaa caggccccaa gctggaagca ctgcgggcag aggtgagcaa gctggaacag 3661 caatgccaga agcagcagga gcaggctgac agcctggaac gcagcctcga ggctgagcgg 3721 gcctcccggg ctgagcggga cagtgctctg gagactctgc agggccagtt agaggagaag 3781 gcccaggagc tagggcacag tcagagtgcc ttagcctcgg cccaacggga gttggctgcc 3841 ttccgcacca aggtacaaga ccacagcaag gctgaagatg agtggaaggc ccaggtggcc 3901 cggggccggc aagaggctga gaggaaaaat agcctcatca gcagcttgga ggaggaggtg 3961 tccatcctga atcgccaggt cctggagaag gagggggaga gcaaggagtt gaagcggctg 4021 gtgatggccg agtcagagaa gagccagaag ctggaggaga gctgcgcctg ctgcaggcag 4081 agacagccag caacagtgcc agagctgcag aacgcagctc tgctctgcgg gaggaggtgc 4141 agagcctccg ggagggaggc tgagaaacag cgggtggctt cagagaacct gcggcaggag 4201 ctgacctcac aggctgagcg tgcggaggag ctgggccaag aattgaaggc gtggcaggag 4261 aagttcttcc agaaagagca ggccctctcc accctgcagc tcgagcacac cagcacacag 4321 gccctggtga gtgagctgct gccagctaag cacctctgcc agcagctgca ggccgagcag 4381 gccgctgccg agaaacgcca ccgtgaggag ctggagcaga gcaagcaggc cgctggggga 4441 ctgcgggcag agctgctgcg ggcccagcgg gagcttgggg agctgattcc tctgcggcag 4501 aaggtggcag agcaggagcg aacagctcag cagctgcggg cagagaaggc cagctatgca 4561 gagcagctga gcatgctgaa gaaggcgcat ggcctgctgg cagaggagaa ccgggggctg 4621 ggtgagcggg ccaaccttgg ccggcagttt ctggaagtgg agttggacca ggcccgggaa 4681 aagtatgtcc aagagttggc agccgtacgt gctgatgctg agacccgtct ggctgaggtg 4741 cagcgagaag cacagagcac tgcccgggag ctggaggtga tgactgccaa gtatgagggt 4801 gccaaggtca aggtcctgga ggagaggcag cggttccagg aagagaggca gaaactcact 4861 gcccaggtgg aagaactgag taagaaactg gctgactctg accaagccag caaggtgcag 4921 cagcagaagc tgaaggctgt ccaggctcag ggaggcgaga gccagcagga ggcccagcgc 4981 ttccaggccc agctgaatga actgcaagcc cagttgagcc agaaggagca ggcagctgag 5041 cactataagc tgcagatgga gaaagccaaa acacattatg atgccaagaa gcagcagaac 5101 caagagctgc aggagcagct gcggagcctg gagcagctgc agaaggaaaa caaagagctg 5161 cgagctgaag ctgaacggct gggccatgag ctacagcagg ctgggctgaa gaccaaggag 5221 gctgaacaga cctgccgcca ccttactgcc caggtgcgca gcctggaggc acaggttgcc 5281 catgcagacc agcagcttcg agacctgggc aaattccagg tggcaactga tgctttaaag 5341 agccgtgagc cccaggctaa gccccagctg gacttgagta ttgacagcct ggatctgagc 5401 tgcgaggagg ggaccccact cagtatcacc agcaagctgc ctcgtaccca gccagacggc 5461 accagcgtcc ctggagaacc agcctcacct atctcccagc gcctgccccc caaggtagaa 5521 tccctggaga gtctctactt cactcccatc cctgctcgga gtcaggcccc cctggagagc 5581 agcctggact ccctgggaga cgtcttcctg gactcgggtc gtaagacccg ctccgctcgt 5641 cggcgcacca cgcagatcat caacatcacc atgaccaaga agctagatgt ggaagagcca 5701 gacagcgcca actcatcgtt ctacagcacg cggtctgctc ctgcttccca ggctagcctg 5761 cgagccacct cctctactca gtctctagct cgcctgggtt ctcccgatta tggcaactca 5821 gccctgctca gcttgcctgg ctaccgcccc accactcgca gttctgctcg tcgttcccag 5881 gccggggtgt ccagtggggc ccctccagga aggaacagct tctacatggg cacttgccag 5941 gatgagcctg agcagctgga tgactggaac cgcattgcag agctgcagca gcgcaatcga 6001 gtgtgccccc cacatctgaa gacctgctat cccctggagt ccaggccttc cctgagcctg 6061 ggcaccatca cagatgagga gatgaaaact ggagaccccc aagagaccct gcgccgagcc 6121 agcatgcagc caatccagat agccgagggc actggcatca ccacccggca gcagcgcaaa 6181 cgggtctccc tagagcccca ccagggccct ggaactcctg agtctaagaa ggccaccagc 6241 tgtttcccac gccccatgac tccccgagac cgacatgaag ggcgcaaaca gagcactact 6301 gaggcccaga agaaagcagc tccagcttct actaaacagg ctgaccggcg ccagtcgatg 6361 gccttcagca tcctcaacac acccaagaag ctagggaaca gccttctgcg gcggggagcc 6421 tcaaagaagg ccctgtccaa ggcttccccc aacactcgca gtggaacccg ccgttctccg 6481 cgcattgcca ccaccacagc cagtgccgcc actgctgccg ccattggtgc cacccctcga 6541 gccaagggca aggcaaagca ctaaagggcc agtaccagtg agtggcccca cctgtgtccc 6601 cgatgctgcc gtcacctggt cctccgccta ctgtccctct cagtgccttc tctcagctcc 6661 caggccaaca gtagccaaac ccctagagac agtgatgcct gcccgcaccc tggcctggcc 6721 cctggtcctt cactggcgcc ttctcggagc tggcccaggg ggcctggagc atggacagtg 6781 tgggcgctct ccctaccttg cctccttttt tcttaaagca aagtcacttc tccatcacaa 6841 ccagatttga ggctggtttt gatggctggg tccttgggcc tggccagtct tcctcttagc 6901 ctctggatct agaagggacc ataagaggag taggccctgg ttcctgctgt cctggtggct 6961 gggccagcag gggccctcac tcttgaagtc caggactggg tctgacctgg tgggagcacc 7021 tgccagagga tgctctttcc caggacggat gggccctgtg tctcaggagt ggggttgggg 7081 gacagccttc agcagcagct cacaccctac cttccccaga cttgcactgg ggtgggattt 7141 ggagtgatgg gaaggttttt aagggccggg gatggatctt ttctaaatgt tattacttgt 7201 aaataaagtc tattttt // LOCUS HSNUP153 5487 bp RNA PRI 07-APR-1994 DEFINITION H.sapiens mRNA for nuclear pore complex protein hnup153. ACCESSION Z25535 NID g406224 KEYWORDS nuclear pore complex protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5487) AUTHORS McMorrow,I., Bastos,R., Horton,H. and Burke,B. TITLE Sequence analysis of a cDNA encoding a human nuclear pore complex protein, hnup153 JOURNAL Biochim. Biophys. Acta 1217 (2), 219-223 (1994) MEDLINE 94154002 REFERENCE 2 (bases 1 to 5487) AUTHORS Burke,B. TITLE Direct Submission JOURNAL Submitted (11-AUG-1993) Brian Burke, Cell Biology, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA FEATURES Location/Qualifiers source 1..5487 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1..4428 /codon_start=1 /product="nuclear pore complex protein hnup153" /db_xref="PID:g406225" /translation="MASGAGGVGGGGGGKIRTRRCHQGPIKPYQQGRQQHQGILSRVT ESVKNIVPGWLQRYFNKNEDVCSCSTDTSEVPRWPENKEDHLVYADEESSNITDGRIT PEPAVSNTEEPSTTSTASNYPDVLTRPSLHRSHLNFSMLESPALHCQPSTSSAFPIGS SGFSLVKEIKDSTSQHDDDNISTTSGFSSRASDKDITVSKNTSLPPLWSPEAERSHSL SQHTATSSKKPAFNLSAFGTLSPSLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQS KLRNTPYQAPVRRQMKAKQLSAQSYGVTSSTARRILQSLEKMSSPLADAKRIPSIVSS PLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLMTPKPVSIATNRSVYFKPSLTPSGE FRKTNQRIDNKCSTGYEKNMTPGQNREQRESGFSYPNFSLPAANGLSSGVGGGGGKMR RERHAFVASKPLEEEEMEVPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSSQAL TNKVQMTSPSSTGSPMFKFSSPIVKSTEANVLPPSSIGFTFSVPVAKTAELSGSSSTL EPIISSSAHHVTTVNSTNCKKTPPEDCEGPFRPAEILKEGSVLDILKSPGFASPKIDS VAAQPTATSPVVYTRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCIACQ AAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGTWDCDTCLVQNKPEAI KCVACETPKPGTCVKRALTLTVVSESAETMTASSSSCTVTTGTLGFGDKFKRPIGSWE CSVCCVSNNAEDNKCVSCMSEKPGSSVPASSSSTVPVSLPSGGSLGLEKFKKPEGSWD CELCLVQNKADSTKCLACESAKPGTKSGFKGFDTSSSSSNSAASSSFKFGVSSSSSGP SQTLTSTGNFKFGDQGGFKIGVSSDSGSINPMSEGFKFSKPIGDFKFGVSSESKPEEV KKDSKNDNFKFGLSSGLSNPVSLTPFQFGVSNLGQEEKKEELPKSSSAGFSFGTGVIN STPAPANTIVTSENKSSFNLGTIETKSASVAPFTCKTSEAKKEEMPATKGGFSFGNVE PASLPSASVFVLGRTEEKQQEPVTSTSLVFGKKADNEEPKCQPVFSFGNSEQTKDENS SKSTFSFSMTKPSEKESEQPAKATFAFGAQTSTTADQGAAKPVFSFLNNSSSSSSTPA TSAGGGIFGSSTSSSNPPVATFVFGQSSNPVSSSAFGNTAESSTSQSLLFSQDSKLAT TSSTGTAVTPFVFGPGASSNNTTTSGFGFGATTTSSSAGSSFVFGTGPSAPSASPAFG ANQTPTFGQSQGASQPNPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGQQP SQSAFGSGTTPNSSSAFQFGSSTTNFNFTNNSPSGVFTFGANSSTPAASAQPSGSGGF PFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIKTAVRRRK" BASE COUNT 1593 a 1192 c 1117 g 1585 t ORIGIN 1 atggcctcag gagccggagg agtcggaggg ggcggtggcg gcaagatccg gacgcggcgt 61 tgccaccagg ggccaattaa gccttaccag caggggcgac aacagcatca gggcattctt 121 agcagggtta cagaatctgt taagaatatt gtgccagggt ggttacaaag atacttcaac 181 aagaatgaag atgtatgcag ctgttcaaca gacacaagcg aggttccacg ctggccagaa 241 aataaagagg accatctggt atatgccgat gaggagagct ctaatattac tgatgggaga 301 atcacacctg agccagcagt cagtaataca gaagaacctt caacaactag tactgcttca 361 aattatccag atgtgttaac aaggccttct cttcatcgga gccatctgaa tttttccatg 421 ttggaatccc ctgcattaca ctgtcagcca tctacatcct cggcattccc aattggcagt 481 tcgggatttt cccttgtaaa ggaaattaaa gattctacct ctcagcatga tgatgataac 541 atctcaacta ccagtggttt ttcttcaaga gcttctgata aagatataac tgtttcaaag 601 aacacttcat tgccacctct gtggtcccca gaagctgaac gttctcactc actctcacag 661 cacactgcca ccagctcaaa aaaaccagca ttcaacttgt ctgcctttgg aacactttcc 721 ccttcacttg ggaattcttc aatccttaaa accagtcagc ttggagattc tcctttttat 781 cctggaaaaa caacatacgg tggggcagca gctgctgtaa gacagtctaa actacgaaat 841 acaccttatc aggcaccagt tagaagacaa atgaaagcta agcaactcag tgcacaatct 901 tacggtgtga ccagttcaac agctcggcga atattgcagt ctttagagaa gatgtcaagc 961 cctttagcgg atgcaaaaag aattccatcc attgtttctt ctcctctgaa ttctcctctt 1021 gataggagtg ggatagatat cacagatttt caggccaaaa gagaaaaggt ggattctcaa 1081 tatcctcctg ttcagagact tatgacccca aagccagttt ccatagcaac aaatcgaagt 1141 gtttatttta aaccatctct gactccttct ggtgaattca ggaagactaa tcaaagaata 1201 gataacaagt gcagtactgg atatgaaaaa aatatgacac ccggacaaaa tagagaacaa 1261 cgagaaagtg gcttttcata tccaaatttc agtttgcctg cagccaatgg tttatcttct 1321 ggagtaggtg gtggaggtgg caagatgaga cgagaaagac acgcctttgt tgcttctaaa 1381 cctctggagg aggaggaaat ggaagttcca gtattaccga aaatctctct accgatcacc 1441 agttcttcac tgcctacctt taattttagt tcccctgaga tcacaacttc ctctccatca 1501 cccatcaatt cgtctcaagc attaacaaac aaggtacaaa tgacctctcc gagcagcact 1561 ggcagtccca tgtttaaatt ttcatctcca atcgtaaaat ctactgaggc aaatgtacta 1621 cctccatcat ctattggatt tacatttagt gtgcctgttg caaaaacagc agaactttct 1681 ggttctagta gtactttaga accaattata agtagttcag ctcatcatgt cactacagtg 1741 aacagtacaa attgtaagaa gacaccacct gaagattgtg agggtccttt tagacctgca 1801 gaaatcctga aagaaggaag tgttctagat attctgaaaa gccctggttt cgcatcgccg 1861 aagatagatt ctgttgctgc tcagcccacc gcaacaagcc cagtagttta tacaagacca 1921 gcaataagta gcttttcttc tagtggaatt gggtttgggg agagtttaaa agctgggtca 1981 tcatggcagt gtgatacatg tctactccag aacaaagtta cagacaacaa atgcatagcc 2041 tgtcaagcag caaaattgtc acccagagat actgctaaac agactggaat tgaaacacca 2101 aataaaagtg gcaaaacaac tctttctgca tcagggacag gctttggaga caaatttaaa 2161 ccagtgatag gcacttggga ttgtgatacc tgtttagtgc aaaataaacc tgaagcaata 2221 aaatgtgtag cctgtgaaac accgaaacct ggaacttgtg tgaagcgagc ccttacattg 2281 acagtggttt cggaaagtgc tgagactatg actgcttcat cttccagctg cactgtaacc 2341 actggtacct taggatttgg agataaattc aaaaggccca ttggatcttg ggagtgttca 2401 gtatgctgtg tttctaataa tgcagaagac aataagtgtg tgtcctgtat gtctgagaaa 2461 ccaggaagtt cagtacctgc ttcaagtagc agcactgtac ctgtctctct gccttctgga 2521 ggctctctag gattggaaaa gttcaagaaa cccgagggaa gctgggactg tgaattgtgc 2581 ctagtgcaga ataaggcaga ctctaccaaa tgtttggcat gtgaaagtgc aaagccaggc 2641 acaaaatctg ggtttaaagg ctttgacaca tcttcctcat cttcgaactc agcagcctcc 2701 tcatccttca aatttggtgt ctcatcatcc tcttctgggc cttctcagac tttaacaagc 2761 actggaaatt ttaaatttgg agatcaggga ggattcaaaa taggtgtgtc atctgattct 2821 gggtctataa accccatgag tgaaggcttt aaattttcta aaccaatagg agattttaaa 2881 tttggagttt catctgaatc taagcccgaa gaagttaaaa aagatagtaa gaatgataat 2941 tttaagtttg gactttcttc tggtttaagc aacccagttt ctttaactcc atttcaattt 3001 ggggtatcta atcttggaca ggaagaaaag aaagaggaac tgcccaaatc ttcctctgca 3061 ggttttagct ttggtacagg tgttattaac tccacccctg ctcctgctaa caccatagtg 3121 acctctgaga acaagagcag cttcaacctt ggaaccatag aaaccaagag tgcttcagtg 3181 gctcctttca catgtaagac atcagaagct aaaaaagaag aaatgcctgc caccaaagga 3241 ggattctctt ttggcaacgt ggagcctgcc tctctgccat ctgcctcagt gtttgttttg 3301 ggaaggacag aagagaaaca gcaagagcct gtcacttcta cttccctagt ttttgggaag 3361 aaagctgaca atgaagagcc aaagtgtcaa ccagtgtttt cctttgggaa ttcagagcaa 3421 accaaagatg agaattcttc aaagtccaca tttagtttta gtatgacaaa accatctgag 3481 aaggaatctg aacagccagc aaaagccact tttgcctttg gagctcaaac tagtactaca 3541 gctgatcaag gtgcagcaaa gccagttttt agtttcttga acaacagttc ctctagttca 3601 agtacaccag ccacttctgc tggtggtggc atatttggta gttccacctc ttcctccaat 3661 ccacctgtgg ctacctttgt gtttggacag tccagcaatc ctgtgagcag ctctgccttt 3721 ggtaacactg ctgaatccag cacctctcag tctttgctat tttctcaaga tagcaaacta 3781 gcaaccacat ccagcacagg tacagctgtc accccatttg tctttggtcc aggagccagc 3841 agtaataata ctaccacctc tggtttcggc tttggagcca caaccacatc tagctctgca 3901 ggatcctcct ttgtatttgg aactggaccc tcagcaccat ctgccagtcc agcatttggt 3961 gctaaccaga ccccaacatt tggacaaagt caaggtgcca gccagcccaa tcccccaggc 4021 tttggatcta tatcatcttc cacagcatta tttcccactg gttctcagcc tgcaccacct 4081 acttttggga cagtgtcaag cagtagccag ccccctgtgt ttggacagca acctagtcag 4141 tctgcatttg gctctggaac aactcctaat tctagttcgg ctttccagtt tggcagcagc 4201 actacaaatt tcaacttcac aaacaacagt ccatcaggag tgttcacatt tggtgcaaat 4261 tctagcacac ctgcagcctc agcccagcct tcaggctcgg ggggctttcc atttaaccag 4321 tctccagcag catttacagt ggggtcaaat gggaaaaatg tgttctcttc ttctggaact 4381 tcattctctg gtcgcaagat aaagactgct gttagacgca ggaaataaag gtcacattgg 4441 tgttgtactc aattttaaca acagctggtg ccctgctttc agatactgga ttgtactttg 4501 tgctggggtt atctgaagtc agatctgcct aaggacttct ttaattttgg aattttcctc 4561 ctttctcttt cgttacagaa gccccaccct gcctcaccca ccctttttta aataaataaa 4621 tagctagact ggtgactgat tcttcagcaa aaatatttta tgatccagca gattattcac 4681 tgatttgaca tagtctggct gtacccagga atggagcctg cacggtgaat ggctttgtat 4741 agaacctctt tgtctacacc attatgtgcg ctgataacgt tcatggaacg cgttgaaatt 4801 gtaattatat ctgaggaatt ctgtatagat tagaattctg tatagattag agagtgttga 4861 aacggatgat ttctatgctg agtttgtgct ggtgtatgtg tgaagtgagt gagttgggtg 4921 tattgtgcgc taaacttttc tgatagagga agcctgatta aagaatggtc cgtgctaagg 4981 acttgttaga tctagttcac tctccattta ataattatat gctatttcta tattttcatt 5041 ctcctatcac ctgtcttgcc tttttcatta ttttattatg aaacttgtgt aaatacaatt 5101 ttgtttctgt actttttggc ataacataaa tctgtgaact tgaaatttga attttgtgtt 5161 agagattttt ttgttgtttg tttagtcttg tctcagattt tattatgtaa atcccattat 5221 tcaaagttgc ctaaatccat ttggaaatct ttaaaaaaaa aattggggat tcttaaagtt 5281 gaatttattg gcttttctga tccagttttg tttggaccaa aaaccagtat tgtacaaagt 5341 attaagcata tatttttata tttactaaaa tggtctgtgg tgacttttgg ataataagga 5401 aaagtttaat attaaagcca tgtttattac agtataatta acatgttaaa ccatgggata 5461 aatgccatca ataaaaaatt atgacat // LOCUS HSNUP88 2390 bp RNA PRI 02-OCT-1997 DEFINITION H.sapiens mRNA for Nup88 protein. ACCESSION Y08612 NID g1707521 KEYWORDS nuclear pore complex protein; Nup88 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2390) AUTHORS Fornerod,M., van Deursen,J., van Baal,S., Reynolds,A., Davis,D., Murti,K.G., Fransen,J. and Grosveld,G. TITLE The human homologue of yeast CRM1 is in a dynamic subcomplex with CAN/Nup214 and a novel nuclear pore component Nup88 JOURNAL EMBO J. 16 (4), 807-816 (1997) MEDLINE 97201523 REFERENCE 2 (bases 1 to 2390) AUTHORS Fornerod,M. TITLE Direct Submission JOURNAL Submitted (02-OCT-1996) M. Fornerod, St. Jude Children's Research Hospital, Department of Genetics, 332 N. Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..2390 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /germline /tissue_type="placenta" /clone_lib="CloneTech Hu2002B#29203" /chromosome="17" /map="17p13" gene 49..2274 /gene="Nup88" CDS 49..2274 /gene="Nup88" /codon_start=1 /product="88kDa nuclear pore complex protein" /db_xref="PID:e274558" /db_xref="PID:g1707522" /translation="MAAAEGPVGDGELWQTWLPNHVVFLRLREGLKNQSPTEAEKPAS SSLPSSPPPQLLTRNVVFGLGGELFLWDGEDSSFLVVRLRGPSGGGEEPALSQYQRLL CINPPLFEIYQVLLSPTQHHVALIGIKGLMVLELPKRWGKNSEFEGGKSTVNCSTTPV AERFFTSSTSLTLKHAAWYPSEILDPHVVLLTSDNVIRIYSLREPQTPTNVIILSEAE EESLVLNKGRAYTASLGETAVAFDFGPLDAVPKTLFGQNGKDEVVAYPLYILYENGET FLTYISLLHSPGNIWKAVGSIAHASAAEDNYGYDACAVLCLPCVPNILVIATESGMLY HCVVLEGEEEDDHTSEKSWDSRIDLIPSLYVFECVELELALKLASGEDDPFDSDFSCP VKLHRDPKCPSRYHCTHEAGVHSVGLTWIHKLHKFLGSDEEDKDSLQELSTEQKCFVE HILCTRPLPCRQPAPIRGFWIVPDILGPTMICITSTYECLIWPLLSTVHPASPPLLCT REDVEVAESSLRVLAETPDSFEKHIRSILQRSVANPAFLKASEKDIAPPPEECLQLLS RATQVFREQYILKQDLAKEEIQRRVKLLCDQKKKQLEDLSYCREERKSLREMAERLAD KYEEAKEKQEDIMNRMKKLLHSFHSELPVLSDSERDMKKELQLIPDQLRHLGNAIKQV TMKKDYQQQKMEKVLSLPKPTIILSAYQRKCIQSILKEEGEHIREMVKQINDIRNHVN F" polyA_signal 2360..2364 BASE COUNT 708 a 533 c 553 g 595 t 1 others ORIGIN 1 gataaaccca caagacacaa aacatacctt tcgagcagtt gggccaagat ggcggccgcc 61 gagggaccgg tgggcgacgg cgagctgtgg cagacctggc ttcctaacca cgtcgtgttc 121 ttgcggctcc gggagggact gaaaaaccag agtccaaccg aagctgagaa accagcttct 181 tcgtcgttgc cttcgtcgcc gccgccgcag ttgctgacga gaaacgtggt ctttggcctc 241 ggcggagagc ttttcctgtg ggacggagaa gacagctcct tcttagtcgt tcgccttcgg 301 ggccccagcg gcggcggcga agagcccgcc ctgtcccagt accagagatt gctttgcata 361 aatccacccc tgtttgaaat ctatcaagtc ttgttaagcc caacacaaca tcatgtagca 421 cttataggra taaaaggact tatggtatta gaattaccta aaagatgggg gaagaattct 481 gaatttgaag gtggaaaatc aacagtgaat tgtagtacca ctccagttgc ggagagattt 541 ttcaccagtt ccacctctct gactctaaag catgctgcat ggtatccaag tgaaatcctg 601 gatccccacg tagtgctgtt aacatcagac aacgtaatca gaatttactc tctacgtgag 661 ccgcagacac ccactaacgt gataatactt tcagaagccg aagaggaaag tctagtactc 721 aataaaggaa gggcgtatac cgcatctcta ggagagacag cagttgcatt tgactttggg 781 ccattggacg cagtcccaaa gactctattt ggacaaaacg gcaaagatga agtagtggca 841 tacccactgt acatcttata tgaaaatgga gagactttcc tgacatacat cagtctgtta 901 cacagccctg gaaatatttg gaaagctgtt gggtccattg cccatgcatc tgcggctgaa 961 gataactatg gttatgatgc gtgtgctgta ctctgcttac cctgtgtccc caatatctta 1021 gtgatcgcta ctgaatcagg aatgctgtat cactgtgtcg tgctagaagg ggaagaagaa 1081 gatgaccaca cgtcagaaaa gtcctgggat tccaggattg acctcattcc ttctctgtat 1141 gtgtttgaat gtgttgagtt ggagcttgct ttgaaactgg catctggaga ggatgaccct 1201 tttgattctg acttttcttg tccagtcaaa cttcatagag atcccaagtg tccttcaaga 1261 tatcactgta ctcatgaagc tggtgtacat agtgttgggc taacttggat tcataaactt 1321 cacaaatttc ttggatcaga tgaagaagat aaggatagtt tacaggaact ctctacagaa 1381 cagaaatgct ttgttgaaca catcctttgt acgaggccat tgccctgcag gcagccagct 1441 ccaattcgag gattttggat tgtacctgac attctgggac ccacgatgat ctgcatcacc 1501 agtacctatg aatgcctcat atggccgtta ttaagtacag tccatccagc gtctcctccc 1561 ctgctttgta ctcgagaaga tgttgaagtg gcagagtctt ccctccgtgt tctggctgaa 1621 accccagatt cctttgaaaa gcatattaga agcattttgc aacgtagtgt tgccaatcca 1681 gcatttttga aagcttctga aaaggacata gcccctcctc ctgaagaatg ccttcagctc 1741 ctcagcagag ccacccaggt gttcagagag cagtacattc tcaaacagga cttggcaaag 1801 gaggagattc agcggagggt caaattatta tgtgaccaaa aaaagaaaca actagaagat 1861 ctcagttatt gtcgagaaga gaggaaaagt ctgcgggaaa tggctgagcg tttagctgac 1921 aaatatgagg aagctaaaga aaaacaagag gatatcatga acaggatgaa aaaactactt 1981 cacagttttc actctgagct cccagttctc tctgatagtg agcgagacat gaagaaagaa 2041 ttacagctga tacctgatca acttcgacat ttgggcaatg ccatcaaaca ggttactatg 2101 aaaaaggatt atcaacagca aaagatggag aaggtgttga gtcttccaaa acccaccatt 2161 attctcagtg cctaccagcg aaagtgcatt cagtccatcc tgaaagagga gggtgaacat 2221 ataagggaaa tggtgaagca aatcaatgat atccgcaatc atgtaaactt ctgacaccac 2281 caggagctga ctcacacctg aactgaacac cattgaaggc ttaaacccat attgtaaaac 2341 aggtagaatt atctaattta taaaaaggtg ttttgatgaa aaaaaaaaaa // LOCUS HSOA1MRNA 1607 bp RNA PRI 10-DEC-1995 DEFINITION H.sapiens mRNA (ocular albinism type 1 related). ACCESSION Z48804 NID g886873 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1607) AUTHORS Bassi,M.T., Schiaffino,M.V., Renieri,A., De Nigris,F., Galli,L., Bruttini,M., Gebbia,M., Bergen,A.A., Lewis,R.A. and Ballabio,A. TITLE Cloning of the gene for ocular albinism type 1 from the distal short arm of the X chromosome JOURNAL Nature Genet. 10 (1), 13-19 (1995) MEDLINE 95375777 REFERENCE 2 (bases 1 to 1607) AUTHORS Schiaffino,M.V. TITLE Direct Submission JOURNAL Submitted (28-MAR-1995) Maria V. Schiaffino, TIGEM - Telethon Institute of Genetics and Medicine, Via Olgettina, 58, Milano, ITALY, 20132 FEATURES Location/Qualifiers source 1..1607 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /chromosome="X" /map="p22.3-22.2" mRNA 1..1607 gene 1..1275 /gene="OA1" CDS 1..1275 /gene="OA1" /function="defective potential cause of ocular albinism type 1" /codon_start=1 /db_xref="PID:e213550" /db_xref="PID:g1113125" /translation="MTQAGRRGPGTPEPRPRTQPMASPRLGTFCCPTRDAATQLVLSF QPRAFHALCLGSGGLRLALGLLQLLPGRRPAGPGSPATSPPASVRILRAAAACDLLGC LGMVIRSTVWLGFPNFVDSVSDMNHTEIWPAAFCVGSAMWIQLLYSACFWWLFCYAVD AYLVIRRSAGLSTILLYHIMAWGLATLLCVEGAAMLYYPSVSRCERGLDHAIPHYVTM YLPLLLVLVANPILFQKTVTAVASLLKGRQGIYTENERRMGAVIKIRFFKIMLVLIIC WLSNIINESLLFYLEMQTDINGGSLKPVRTAAKTTWFIMGILNPAQGFLLSLAFYGWT GCSLGFQSPRKEIQWESLTTSAAEGAHPSPLMPHENPASGKVSQVGGQTSDEALSMLS EGSDASTIEIHTASESCNKNEGDPALPTHGDL" polyA_signal 1589..1594 polyA_site 1607 BASE COUNT 325 a 466 c 439 g 377 t ORIGIN 1 atgacccagg caggccggcg gggtcctggc acacccgagc cgcgtccgcg aacacagccc 61 atggcctccc cgcgcctagg gaccttctgc tgccccacgc gggacgcagc cacgcagctc 121 gtgctgagct tccagccgcg ggccttccac gcgctctgcc tgggcagcgg cgggctccgc 181 ttggcgctgg gccttctgca gctgctgccc ggccgccggc ccgcgggccc cgggtccccc 241 gcgacgtccc cgccggcctc ggtccgcatc ctgcgcgctg ccgctgcctg cgaccttctc 301 ggctgcctgg gtatggtgat ccggtccacc gtgtggttag gattcccaaa ttttgttgac 361 agcgtctcgg atatgaacca cacggaaatt tggcctgctg ctttctgcgt ggggagtgcg 421 atgtggatcc agctgttgta cagtgcctgc ttctggtggc tgttttgcta tgcagtggat 481 gcttatctgg tgatccggag atcggcagga ctgagcacca tcctgctgta tcacatcatg 541 gcgtggggcc tggccaccct gctctgtgtg gagggagccg ccatgctcta ctacccttcc 601 gtgtccaggt gtgagcgggg cctggaccac gccatccccc actatgtcac catgtacctg 661 cccctgctgc tggttctcgt ggcgaacccc atcctgttcc aaaagacagt gactgcagtg 721 gcctctttac ttaaaggaag acaaggcatt tacacggaga acgagaggag gatgggagcc 781 gtgatcaaga tccgattttt caaaatcatg ctggttttaa ttatttgttg gttgtcgaat 841 atcatcaatg aaagcctttt attctatctt gagatgcaaa cagatatcaa tggaggttct 901 ttgaaacctg tcagaactgc agccaagacc acatggttta ttatgggaat cctgaatcca 961 gcccagggat ttctcttgtc tttggccttc tacggctgga caggatgcag cctgggtttt 1021 cagtctccca ggaaggagat ccagtgggaa tcactgacca cctcggctgc tgagggggct 1081 cacccatccc cactgatgcc ccatgaaaac cctgcttccg ggaaggtgtc tcaagtgggt 1141 gggcagactt ctgacgaagc cctgagcatg ctgtctgaag gttctgatgc cagcacaatt 1201 gaaattcaca ctgcaagtga atcctgcaac aaaaatgagg gtgaccctgc tctcccaacc 1261 catggagacc tatgaagggg atgtgctggg ggtccagacc ccatattcct cagactcaac 1321 aattcttgtt ctttagaact gtgttctcac cttcccaaca ctgcactgcc gaagtgtagc 1381 ggcccccaaa ccttgctctc atcaccagct agagcttctt cccgaagggc ctttaggata 1441 ggagaaaggg ttcatgcaca cacgtgtgag aatggaagag ccccctccag accactctac 1501 agctgctcta gccttagttg ccactaggaa gttttctgag gctggctgta aagtaagtgt 1561 aaggtccaca tccttgggga agtagttaaa taaaatagtt atgactg // LOCUS HSOA3MR 1285 bp RNA PRI 20-SEP-1993 DEFINITION H.sapiens mRNA for OA3 antigenic surface determinant. ACCESSION X69398 NID g396175 KEYWORDS surface glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1285) AUTHORS Campbell,I.G., Freemont,P.S., Foulkes,W. and Trowsdale,J. TITLE An ovarian tumor marker with homology to vaccinia virus contains an IgV-like region and multiple transmembrane domains JOURNAL Cancer Res. 52 (19), 5416-5420 (1992) MEDLINE 93007897 REFERENCE 2 (bases 1 to 1285) AUTHORS Campbell,I.G. TITLE Direct Submission JOURNAL Submitted (20-SEP-1993) I.G. Campbell, Univ. of Southampton,, Dept of Obstetrics and Gynaecology, Princess Anne Hospital, Coxford Rd., Southampton, Hants, S09 4HA, UK FEATURES Location/Qualifiers source 1..1285 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 107..1078 /codon_start=1 /product="antigenic surface determinant OA3" /db_xref="PID:g396176" /db_xref="SWISS-PROT:Q08722" /translation="MWPLVAALLLGSACCGSAQLLFNKTKSVEFTFCNDTVVIPCFVT NMEAQNTTEVYVKWKFKGRDIYTFDGALNKSTVPTDFSSAKIEVSQLLKGDASLKMDK SDAVSHTGNYTCEVTELTREGETIIELKYRVVSWFSPNENILIVIFPIFAILLFWGQF GIKTLKYRSGGMDEKTIALLVAGLVITVIVIVGAILFVPGEYSLKNATGLGLIVTSTG ILILLHYYVFSTAIGLTSFVIAILVIQVIAYILAVVGLSLCIAACIPMHGPLLISGLS ILALAQLLGLVYMKFVASNQKTIQPPRKAVEEPLNAFKESKGMMNDE" BASE COUNT 358 a 244 c 297 g 386 t ORIGIN 1 gggctgcctg tgacgcgcgg cgcggtcggt cctgcctgta acggcggcgg cggctgctgc 61 tccagacacc tgcggcggcg gcggcgaccc cgcggcgggc gcggagatgt ggcccctggt 121 agcggcgctg ttgctgggct cggcgtgctg cggatcagct cagctactat ttaataaaac 181 aaaatctgta gaattcacgt tttgtaatga cactgtcgtc attccatgct ttgttactaa 241 tatggaggca caaaacacta ctgaagtata cgtaaagtgg aaatttaaag gaagagatat 301 ttacaccttt gatggagctc taaacaagtc cactgtcccc actgacttta gtagtgcaaa 361 aattgaagtc tcacaattac taaaaggaga tgcctctttg aagatggata agagtgatgc 421 tgtctcacac acaggaaact acacttgtga agtaacagaa ttaaccagag aaggtgaaac 481 gatcatcgag ctaaaatatc gtgttgtttc atggttttct ccaaatgaaa atattcttat 541 tgttattttc ccaatttttg ctatactcct gttctgggga cagtttggta ttaaaacact 601 taaatataga tccggtggta tggatgagaa aacaattgct ttacttgttg ctggactagt 661 gatcactgtc attgtcattg ttggagccat tcttttcgtc ccaggtgaat attcattaaa 721 gaatgctact ggccttggtt taattgtgac ttctacaggg atattaatat tacttcacta 781 ctatgtgttt agtacagcga ttggattaac ctccttcgtc attgccatat tggttattca 841 ggtgatagcc tatatcctcg ctgtggttgg actgagtctc tgtattgcgg cgtgtatacc 901 aatgcatggc cctcttctga tttcaggttt gagtatctta gctctagcac aattacttgg 961 actagtttat atgaaatttg tggcttccaa tcagaagact atacaacctc ctaggaaagc 1021 tgtagaggaa ccccttaatg cattcaaaga atcaaaagga atgatgaatg atgaataact 1081 gaagtgaagt gatggactcc gatttggaga gtagtaagac gtgaaaggaa tacacttctg 1141 tttaagcacc atggccttga tgattcactg ttggggagaa gaaacaagaa aagtaactgg 1201 ttgtcaccta tgagaccctt acgtgattgt tagttaagtt tttattcaaa gcagctgtaa 1261 tttagttaat aaaataatta tgatc // LOCUS HSOAS1 1322 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for (2'-5') oligo A synthetase E (1,6 kb RNA). ACCESSION X02874 K00006 NID g35122 KEYWORDS synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1322) AUTHORS Benech,P., Mory,Y., Revel,M. and Chebath,J. TITLE Structure of two forms of the interferon-induced (2'-5') oligo A synthetase of human cells based on cDNAs and gene sequences JOURNAL EMBO J. 4 (9), 2249-2256 (1985) MEDLINE 86081732 REFERENCE 2 (bases 1 to 1322) AUTHORS Merlin,G., Chebath,J., Benech,P., Metz,R. and Revel,M. TITLE Molecular cloning and sequence of partial cDNA for interferon-induced (2'-5')oligo(A) synthetase mRNA from human cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (16), 4904-4908 (1983) MEDLINE 83273721 COMMENT Data kindly reviewed (14-NOV-1985) by Chebath J. FEATURES Location/Qualifiers source 1..1322 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 34..1128 /note="(2'-5') oligo A synthetase E16 (aa 1-364)" /codon_start=1 /db_xref="PID:g35123" /db_xref="SWISS-PROT:P00973" /translation="MMDLRNTPAKSLDKFIEDYLLPDTCFRMQIDHAIDIICGFLKER CFRGSSYPVCVSKVVKGGSSGKGTTLRGRSDADLVVFLSPLTTFQDQLNRRGEFIQEI RRQLEACQRERALSVKFEVQAPRWGNPRALSFVLSSLQLGEGVEFDVLPAFDALGQLT GSYKPNPQIYVKLIEECTDLQKEGEFSTCFTELQRDFLKQRPTKLKSLIRLVKHWYQN CKKKLGKLPPQYALELLTVYAWERGSMKTHFNTAQGFRTVLELVINYQQLCIYWTKYY DFKNPIIEKYLRRQLTKPRPVILDPADPTGNLGGGDPKGWRQLAQEAEAWLNYPCFKN WDGSPVSSWILLVRPPASSLPFIPAPLHEA" misc_feature 37..39 /note="pot. altern. translation start site" variation 376 /note="U is C in variant clone" variation 525 /note="A is G in variant clone" variation 807 /note="C is G in variant clone" variation 811 /note="G is A in variant clone" polyA_site 1317 /note="polyadenylation site" BASE COUNT 334 a 353 c 320 g 315 t ORIGIN 1 gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 61 aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 121 atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 181 tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 241 ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 301 gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 361 caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 421 cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 481 ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 541 tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 601 ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 661 cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct gccacctcag 721 tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa aacacatttc 781 aacacagccc aaggatttcg gacggtcttg gaattagtca taaactacca gcaactctgc 841 atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta cctgagaagg 901 cagctcacga aacccaggcc tgtgatcctg gacccggcgg accctacagg aaacttgggt 961 ggtggagacc caaagggttg gaggcagctg gcacaagagg ctgaggcctg gctgaattac 1021 ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct ggtgagacct 1081 cctgcttcct ccctgccatt catccctgcc cctctccatg aagcttgaga catatagctg 1141 gagaccattc tttccaaaga acttacctct tgccaaaggc catttatatt catatagtga 1201 caggctgtgc tccatatttt acagtcattt tggtcacaat cgagggtttc tggaattttc 1261 acatcccttg tccagaattc attcccctaa gagtaataat aaataatctc taacaccaaa 1321 aa // LOCUS HSOBT1 909 bp RNA PRI 03-MAY-1995 DEFINITION H.sapiens OBF-1 mRNA for octamer binding factor 1. ACCESSION Z47550 NID g732792 KEYWORDS OBF-1; octamer binding factor 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 909) AUTHORS Strubin,M., Newell,J.W. and Matthias,P. TITLE OBF-1, a novel B cell-specific coactivator that stimulates immunoglobulin promoter activity through association with octamer-binding proteins JOURNAL Cell 80 (3), 497-506 (1995) MEDLINE 95163103 REFERENCE 2 (bases 1 to 909) AUTHORS Matthias,P. TITLE Direct Submission JOURNAL Submitted (11-JAN-1995) Patrick Matthias, Friedrich Miescher Institute, Mattenstrasse 22, Basel, 4002, Switzerland FEATURES Location/Qualifiers source 1..909 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pRS314/UNVP/clone9" /tissue_type="spleen" /cell_type="B cell" /cell_line="Namalwa" /clone_lib="pWR1/2" CDS 97..867 /codon_start=1 /product="OBF-1 (Oct Binding Factor 1)" /db_xref="PID:g732793" /translation="MLWQKPTAPEQAPAPARPYQGVRVKEPVKELLRRKRGHASSGAA PAPTAVVLPHQPLATYTTVGPSCLDMEGSVSAVTEEAALCAGWLSQPTPATLQPLAPW TPYTEYVPHEAVSCPYSADMYVQPVCPSYTVVGPSSVLTYASPPLITNVTTRSSATPA VGPPLEGPEHQAPLTYFPWPQPLSTLPTSTLQYQPPAPALPGPQFVQLPISIPEPVLQ DMEDPRRAASSLTIDKLLLEEEDSDAYALNHTLSVEGF" BASE COUNT 185 a 328 c 236 g 160 t ORIGIN 1 gcggtggctc cactggagga aaacacaccc cggtctcaca ttaaagaagc caaactgtcg 61 gcttcaaaga gaaaaggcaa catcctgtca caggccatgc tctggcaaaa acccacagct 121 ccggagcaag ccccagcccc ggcccggcca taccagggcg tccgtgtgaa ggagccagtg 181 aaggaactgc tgaggaggaa gcgaggccac gccagcagtg gggcagcacc tgcacctacg 241 gcggtggtgc tgccccatca gcccctggcg acctacacca cagtgggtcc ttcctgcctg 301 gacatggaag gttctgtgtc tgcagtgaca gaggaggctg ccctgtgtgc cggctggctc 361 tcccagccca ccccggccac cctgcagccc ctggccccat ggacacctta caccgagtat 421 gtgccccatg aagctgtcag ctgcccctac tcagctgaca tgtatgtgca gcccgtgtgc 481 cccagctaca cggtggtggg gccctcctca gtgttgacct atgcctctcc gccactcatc 541 accaatgtca cgacaagaag ctccgccacg cccgcagtgg ggcccccgct ggagggccca 601 gagcaccagg cacccctcac ctatttcccg tggcctcagc ccctttccac actacccacc 661 tccaccctgc agtaccagcc tccggcccca gccctacctg ggccccagtt tgtccagctc 721 cccatctcta tcccagagcc agtccttcag gacatggaag accccagaag agccgccagc 781 tcgttgacca tcgacaagct gcttttggag gaagaggata gcgacgccta tgcgcttaac 841 cacactctct ctgtggaagg cttttaggcg tggctcccac ctgagtcctg ttccctgaaa 901 ctgggattt // LOCUS HSOC2RNA 1669 bp RNA PRI 18-APR-1995 DEFINITION H.sapiens mRNA for cathepsin O. ACCESSION X82153 NID g562756 KEYWORDS cathepsin O; OC-2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1669) AUTHORS Inaoka,T., Bilbe,G., Ishibashi,O., Tezuka,K., Kumegawa,M. and Kokubo,T. TITLE Molecular cloning of human cDNA for cathepsin K: novel cysteine proteinase predominantly expressed in bone JOURNAL Biochem. Biophys. Res. Commun. 206 (1), 89-96 (1995) MEDLINE 95118380 REFERENCE 2 (bases 1 to 1669) AUTHORS Inaoka,T. TITLE Direct Submission JOURNAL Submitted (10-OCT-1994) T. Inaoka, Bio-Organics Research Dept., International Research Laboratories, Ciba-Geigy, Japan Ltd., 10-66 Miyuki-cho, Takarazuka Hyogo 665, JAPAN FEATURES Location/Qualifiers source 1..1669 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="osteoarthritic hip joint bone" /dev_stage="adult" sig_peptide 130..174 /gene="hOC-2" CDS 130..1119 /gene="hOC-2" /codon_start=1 /product="Cathepsin O" /db_xref="PID:g562757" /db_xref="SWISS-PROT:P43235" /translation="MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEI SRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRS NDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLL NLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKA AKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNH AVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM" gene 130..1119 /gene="hOC-2" polyA_signal 1655..1660 BASE COUNT 455 a 373 c 399 g 442 t ORIGIN 1 gaaacaagca ctggattcca tatcccactg ccaaaaccgc atggttcaga ttatcgctat 61 tgcagctttc atcataatac acacctttgc tgccgaaacg aagccagaca acagatttcc 121 atcagcagga tgtgggggct caaggttctg ctgctacctg tggtgagctt tgctctgtac 181 cctgaggaga tactggacac ccactgggag ctatggaaga agacccacag gaagcaatat 241 aacaacaagg tggatgaaat ctctcggcgt ttaatttggg aaaaaaacct gaagtatatt 301 tccatccata accttgaggc ttctcttggt gtccatacat atgaactggc tatgaaccac 361 ctgggggaca tgaccagtga agaggtggtt cagaagatga ctggactcaa agtacccctg 421 tctcattccc gcagtaatga caccctttat atcccagaat gggaaggtag agccccagac 481 tctgtcgact atcgaaagaa aggatatgtt actcctgtca aaaatcaggg tcagtgtggt 541 tcctgttggg cttttagctc tgtgggtgcc ctggagggcc aactcaagaa gaaaactggc 601 aaactcttaa atctgagtcc ccagaaccta gtggattgtg tgtctgagaa tgatggctgt 661 ggagggggct acatgaccaa tgccttccaa tatgtgcaga agaaccgggg tattgactct 721 gaagatgcct acccatatgt gggacaggaa gagagttgta tgtacaaccc aacaggcaag 781 gcagctaaat gcagagggta cagagagatc cccgagggga atgagaaagc cctgaagagg 841 gcagtggccc gagtgggacc tgtctctgtg gccattgatg caagcctgac ctccttccag 901 ttttacagca aaggtgtgta ttatgatgaa agctgcaata gcgataatct gaaccatgcg 961 gttttggcag tgggatatgg aatccagaag ggaaacaagc actggataat taaaaacagc 1021 tggggagaaa actggggaaa caaaggatat atcctcatgg ctcgaaataa gaacaacgcc 1081 tgtggcattg ccaacctggc cagcttcccc aagatgtgac tccagccagc caaatccatc 1141 ctgctcttcc atttcttcca cgatggtgca gtgtaacgat gcactttgga agggagttgg 1201 tgtgctattt ttgaagcaga tgtggtgata ctgagattgt ctgttcagtt tccccatttg 1261 tttgtgcttc aaatgatcct tcctactttg cttctctcca cccatgacct ttttcactgt 1321 ggccatcagg actttccctg acagctgtgt actcttaggc taagagatgt gactacagcc 1381 tgcccctgac tgtgttgtcc cagggctgat gctgtacagg tacaggctgg agattttcac 1441 ataggttaga ttctcattca cgggactagt tagctttaag caccctagag gactagggta 1501 atctgacttc tcacttccta agttcccttc tatatcctca aggtagaaat gtctatgttt 1561 tctactccaa ttcataaatc tattcataag tctttggtac aagtttacat gataaaaaga 1621 aatgtgattt gtcttccctt ctttgcactt ttgaaataaa gtatttatc // LOCUS HSOCT1 2584 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for octamer-binding protein Oct-1. ACCESSION X13403 NID g35126 KEYWORDS DNA-binding protein; homeobox; Oct-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2584) AUTHORS Sturm,R.A. TITLE Direct Submission JOURNAL Submitted (28-OCT-1988) Sturm R.A., Cold Spring Harbor Lab, PO Box 100, Cold Spring Harbor, New York 11724, USA REFERENCE 2 (bases 1 to 2584) AUTHORS Sturm,R.A., Das,G. and Herr,W. TITLE The ubiquitous octamer-binding protein Oct-1 contains a POU domain with a homeo box subdomain JOURNAL Genes Dev. 2 (12A), 1582-1599 (1988) MEDLINE 89107993 FEATURES Location/Qualifiers source 1..2584 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="teratocarcinoma" /cell_line="NTera2D1" /clone_lib="lambda gt10 and 11" /clone="lambda C7 and lambda C5" CDS 60..2291 /note="Oct-1 protein (AA 1 - 743)" /codon_start=1 /db_xref="PID:g35127" /db_xref="SWISS-PROT:P14859" /translation="MNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQA QAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQL MLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATP MTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGL LQAQNLQTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDT PSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNL SFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRV ALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPI KAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATV ISTAPPASSAVTSPSLSPSPSASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTA SGLQTAAAAALQGAAQLPANASLAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSG ALSPALMSNSTLATIQALASGGSLPITSLDATGNLVFANAGGAPNIVTAPLFLNPQNL SLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQNSLFTVASASGAASTTTTAS KAQ" misc_feature 866..1380 /note="DNA-binding domain" misc_feature 894..1121 /note="POU specific box (AA 279-354)" misc_feature 1194..1373 /note="homeo box (AA 379-438)" misc_feature 2359..2364 /note="pot. polyA signal" misc_feature 2509..2514 /note="pot.alt. polyA signal" BASE COUNT 704 a 782 c 578 g 520 t ORIGIN 1 gaggagcagc gagtcaagat gagagttcag ccgcggcggc agcagcagca gactcaagaa 61 tgaacaatcc gtcagaaacc agtaaaccat ctatggagag tggagatggc aacacaggca 121 cacaaaccaa tggtctggac tttcagaagc agcctgtgcc tgtaggagga gcaatctcaa 181 cagcccaggc gcaggctttc cttggacatc tccatcaggt ccaactcgct ggaacaagtt 241 tacaggctgc tgctcagtct ttaaatgtac agtctaaatc taatgaagaa tcgggggatt 301 cgcagcagcc aagccagcct tcccagcagc cttcagtgca ggcagccatt ccccagaccc 361 agcttatgct agctggagga cagataactg ggcttacttt gacgcctgcc cagcaacagt 421 tactactcca gcaggcacag gcacaggcac agctgctggc tgctgcagtg cagcagcact 481 ccgccagcca gcagcacagt gctgctggag ccaccatctc cgcctctgct gccacgccca 541 tgacgcagat ccccctgtct cagcccatac agatcgcaca ggatcttcaa caactgcaac 601 agcttcaaca gcagaatctc aacctgcaac agtttgtgtt ggtgcatcca accaccaatt 661 tgcagccagc gcagtttatc atctcacaga cgccccaggg ccagcagggt ctcctgcaag 721 cgcaaaatct tcaaacgcaa ctacctcagc aaagccaagc caacctccta cagtcgcagc 781 caagcatcac cctcacctcc cagccagcaa ccccaacacg cacaatagca gcaaccccaa 841 ttcagacact tccacagagc cagtcaacac caaagcgaat tgatactccc agcttggagg 901 agcccagtga ccttgaggag cttgagcagt ttgccaagac cttcaaacaa agacgaatca 961 aacttggatt cactcagggt gatgttgggc tcgctatggg gaaactatat ggaaatgact 1021 tcagccaaac taccatctct cgatttgaag ccttgaacct cagctttaag aacatgtgca 1081 agttgaagcc acttttagag aagtggctaa atgatgcaga gaacctctca tctgattcgt 1141 ccctctccag cccaagtgcc ctgaattctc caggaattga gggcttgagc cgtaggagga 1201 agaaacgcac cagcatagag accaacatcc gtgtggcctt agagaagagt ttcttggaga 1261 atcaaaagcc tacctcggaa gagatcacta tgattgctga tcagctcaat atggaaaaag 1321 aggtgattcg tgtttggttc tgtaaccgcc gccagaaaga aaaaagaatc aacccaccaa 1381 gcagtggtgg gaccagcagc tcacctatta aagcaatttt ccccagccca acttcactgg 1441 tggcgaccac accaagcctt gtgactagca gtgcagcaac taccctcaca gtcagccctg 1501 tcctccctct gaccagtgct gctgtgacga atctttcagt tacaggcact tcagacacca 1561 cctccaacaa cacagcaacc gtgatttcca cagcgcctcc agcttcctca gcagtcacgt 1621 ccccctctct gagtccctcc ccttctgcct cagcctccac ctccgaggca tccagtgcca 1681 gtgagaccag cacaacacag accacctcca ctcctttgtc ctcccctctt gggaccagcc 1741 aggtgatggt gacagcatca ggtttgcaaa cagcagcagc tgctgccctt caaggagctg 1801 cacagttgcc agcaaatgcc agtcttgctg ccatggcagc tgctgcagga ctaaacccaa 1861 gcctgatggc accctcacag tttgcggctg gaggtgcctt actcagtctg aatccaggga 1921 ccctgagcgg tgctctcagc ccagctctaa tgagcaacag tacactggca actattcaag 1981 ctcttgcttc tggtggctct cttccaataa catcacttga tgcaactggg aacctggtat 2041 ttgccaatgc gggaggagcc cccaacatcg tgactgcccc tctgttcctg aaccctcaga 2101 acctctctct gctcaccagc aaccctgtta gcttggtctc tgccgccgca gcatctgcag 2161 ggaactctgc acctgtagcc agccttcacg ccacctccac ctctgctgag tccatccaga 2221 actctctctt cacagtggcc tctgccagcg gggctgcgtc caccaccacc accgcctcca 2281 aggcacagtg agctgggcag agctgggctg ccagaagcct ttttcactct gcagtgtgat 2341 tggactgcca gccaggttaa taaactgaaa aatgtgattg gcttcctctc gccgtgttgt 2401 gagggcaaag gagagaaggg agaaaaaaaa aaaaaaaacc acacacaccc atacacaata 2461 taccagaaaa ggaaggaagg atggagacgg aacatttgcc taatttgtaa taaaacactg 2521 tcttttcagg gttgcttcat gggttggagg actttctaac caaaaattaa aaaaaaaaaa 2581 aaaa // LOCUS HSOCT2A 1717 bp RNA PRI 12-SEP-1993 DEFINITION Human oct-2 mRNA for B-cell specific transcription factor NF-A2. ACCESSION X53468 Y00227 NID g35128 KEYWORDS alternative splicing; B-cell specific transcription factor; DNA-binding protein; leucine zipper; Oct-2 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1717) AUTHORS Clerc,R.G., Corcoran,L.M., LeBowitz,J.H., Baltimore,D. and Sharp,P.A. TITLE The B-cell-specific Oct-2 protein contains POU box- and homeo box-type domains JOURNAL Genes Dev. 2 (12A), 1570-1581 (1988) MEDLINE 89107992 COMMENT see x53469 for oct-2 cDNA clone showing different Oct-2 protein C-term. Alternative splicing of the oct-2 transcript is purposed. FEATURES Location/Qualifiers source 1..1717 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" /cell_line="BJAB" /clone_lib="lambda gt11" /clone="pass-5.5, pass-3, 3-1." CDS 67..1470 /note="Oct-2 protein (AA 1-467)" /codon_start=1 /db_xref="PID:g35129" /translation="MVHSSMGAPEIRMSKPLEAEKQGLDSPSEHTDTERNGPDTNHQN PQNKTSPFSVSPTGPSTKIKAEDPSGDSAPAAPLPPQPAQPHLPQAQLMLTGSQLAGD IQQLLQLQQLVLVPGHHLQPPAQFLLPQAQQSQPGLLPTPNLFQLPQQTQGALLTSQP RAGLPTQAVTRPTLPDPHLSHPQPPKCLEPPSHPEEPSDLEELEQFARTFKQRRIKLG FTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMCKLKPLLEKWLNDAETMSVDSS LPSPNQLSSPSLGFDGLPGRRRKKRTSIETNVRFALEKSFLANQKPTSEEILLIAEQL HMEKEVIRVWFCNRRQKEKRINPCSAAPMLPSPGKPASYSPHMVTPQGGAGTLPLSQA SSSLSTTVTTLSSAVGTLHPSRTAGGGGGGGGAAPPLNSIPSVTPPPPATTNSTNPSP QGSHSAIGLSGLNPSTG" CDS 735..1568 /note="long overlapping ORF (no atg) (278 AA); Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e21891" /db_xref="PID:g1335237" /translation="CGPGHGQALRQRLQPDDHFPLRGPQPELQEHVQTQAPPGEVAQR CRDYVCGLKPAQPQPAEQPQPGFRRPARPETQEEDQHRDKRPLRLREEFSSEPEAYLR GDPADRRAAAHGEGSDPRLVLQPAPEGETHQPLQCGPHAAQPREAGQLQPPYGHTPRG RGDLTVVPSFQQSEHNSYYLILSCGDAPPQPDSWRGWGRGRGCAPPQFHPLCHSPTPG HHQQHKPQPSRQPLGYRLVRPEPQHGVSGCTWEAVGRSRVAAASRVGSGTPVMLAGPC PC" misc_feature 1466..1467 /note="site of divergence between analyzed cDNA clones (see X53469)" BASE COUNT 374 a 611 c 461 g 271 t ORIGIN 1 ctggggcccc cagagagggt ggggagatga cacagttgtt cccccagccc tggcggggcg 61 ggcagcatgg ttcactccag catgggggct ccagaaataa gaatgtctaa gcccctggag 121 gccgagaagc aaggtctgga ctccccatca gagcacacag acaccgaaag aaatggacca 181 gacactaatc atcagaaccc ccaaaataag acctccccat tctccgtgtc cccaactggc 241 cccagtacaa agatcaaggc tgaagacccc agtggcgatt cagccccagc agcacccctg 301 ccccctcagc cggcccagcc tcatctgccc caggcccaac tcatgttgac gggcagccag 361 ctagctgggg acatacagca gctcctccag ctccagcagc tggtgcttgt gccaggccac 421 cacctccagc cacctgctca gttcctgcta ccgcaggccc agcagagcca gccaggcctg 481 ctaccgacac caaatctatt ccagctacct cagcaaaccc agggagctct tctgacctcc 541 cagccccggg ccgggcttcc cacacaggcc gtgacccgcc ctacgctgcc cgacccgcac 601 ctctcgcacc cgcagccccc caaatgcttg gagccaccat cccaccccga ggagcccagt 661 gatctggagg agctggagca attcgcccgc accttcaagc aacgccgcat caagctgggc 721 ttcacgcagg gtgatgtggg cctggccatg ggcaagctct acggcaacga cttcagccag 781 acgaccattt cccgcttcga ggccctcaac ctgagcttca agaacatgtg caaactcaag 841 cccctcctgg agaagtggct caacgatgca gagactatgt ctgtggactc aagcctgccc 901 agccccaacc agctgagcag ccccagcctg ggtttcgacg gcctgcccgg ccggagacgc 961 aagaagagga ccagcatcga gacaaacgtc cgcttcgcct tagagaagag ttttctagcg 1021 aaccagaagc ctacctcaga ggagatcctg ctgatcgccg agcagctgca catggagaag 1081 gaagtgatcc gcgtctggtt ctgcaaccgg cgccagaagg agaaacgcat caacccctgc 1141 agtgcggccc ccatgctgcc cagcccaggg aagccggcca gctacagccc ccatatggtc 1201 acaccccaag ggggcgcggg gaccttaccg ttgtcccaag cttccagcag tctgagcaca 1261 acagttacta ccttatcctc agctgtgggg acgctccacc ccagccggac agctggaggg 1321 ggtgggggcg ggggcggggc tgcgcccccc ctcaattcca tcccctctgt cactccccca 1381 cccccggcca ccaccaacag cacaaacccc agccctcaag gcagccactc ggctatcggc 1441 ttgtcaggcc tgaaccccag cacggggtaa gtgggtgcac gtgggaagct gtggggagaa 1501 gcagggtcgc tgctgcttct agggtgggga gcggcacccc agttatgttg gcaggtccct 1561 gcccctgcta atgcctctgc tttgcctctt gcagaagcac aatggtgggg ttgagctccg 1621 gctgagtcca gccctcatga gcaacaaccc tttggccact atccaaggtg cgtgctgcct 1681 catgtcacac ccatcgtcac cagccccgga attcgag // LOCUS HSOCTK 2257 bp RNA PRI 25-JUL-1997 DEFINITION H.sapiens mRNA for organic cation transporter, kidney. ACCESSION X98333 NID g2281941 KEYWORDS ion transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2257) AUTHORS Gorboulev,V., Ulzheimer,J.C., Akhoundova,A., Ulzheimer-Teuber,I., Karbach,U., Quester,S., Baumann,C., Lang,F., Busch,A.E. and Koepsell,H. TITLE Cloning and characterization of two human polyspecific organic cation transporters JOURNAL DNA Cell Biol. 16 (7), 871-881 (1997) MEDLINE 97405886 REFERENCE 2 (bases 1 to 2257) AUTHORS Gorboulev,V.G. TITLE Direct Submission JOURNAL Submitted (05-JUN-1996) V.G. Gorboulev, Anatomisches Institut, Bayerische Maximilians Universitaet, Koellikerstr. 6, D- 97070 Wuerzburg, FRG REMARK Revised by submittor 02-OCT-1996 FEATURES Location/Qualifiers source 1..2257 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 145..1812 /codon_start=1 /product="organic cation transporter" /db_xref="PID:e269560" /db_xref="PID:g2281942" /translation="MPTTVDDVLEHGGEFHFFQKQMFFLLALLSATFAPIYVGIVFLG FTPDHRCRSPGVAELSLRCGWSPAEELNYTVPGPGPAGEASPRQCRRYEVDWNQSTFD CVDPLASLDTNRSRLPLGPCRDGWVYETPGSSIVTEFNLVCANSWMLDLFQSSVNVGF FIGSMSIGYIADRFGRKLCLLTTVLINAAAGVLMAISPTYTWMLIFRLIQGLVSKAGW LIGYILITEFVGRRYRRTVGIFYQVAYTVGLLVLAGVAYALPHWRWLQFTVALPNFFF LLYYWCIPESPRWLISQNKNAEAMRIIKHIAKKNGKSLPASLQRLRLEEETGKKLNPS FLDLVRTPQIRKHTMILMYNWFTSSVLYQGLIMHMGLAGDNIYLDFFYSALVEFPAAF MIILTIDRIGRRYPWAASNMVAGAACLASVFIPGDLQWLKIIISCLGRMGITMAYEIV CLVNAELYPTFIRNLGVHICSSMCDIGGIITPFLVYRLTNIWLELPLMVFGVLGLVAG GLVLLLPETKGKALPETIEEAENMQRPRKNKEKMIYLQVQKLDIPLN" polyA_site 2235 BASE COUNT 556 a 581 c 540 g 578 t 2 others ORIGIN 1 ggccctgccc tgaaggctgg tcacttgcag aggtaaactc ccctctttga cttctggcca 61 gggtttgtgc tgagctggct gcagccgctc tcagcctcgc tccgggcacg tcgggcagcc 121 tcgggccctc ctgcctgcag gatcatgccc accaccgtgg acgatgtcct ggagcatgga 181 ggggagtttc actttttcca gaagcaaatg tttttcctct tggctctgct ctcggctacc 241 ttcgcgccca tctacgtggg catcgtcttc ctgggcttca cccctgacca ccgctgccgg 301 agccccggag tggccgagct gagtctgcgc tgcggctgga gtcctgcaga ggaactgaac 361 tacacggtgc cgggcccagg acctgcgggc gaagcctccc caagacagtg taggcgctac 421 gaggtggact ggaaccagag cacctttgac tgcgtggacc ccctggccag cctggacacc 481 aacaggagcc gcctgccact gggcccctgc cgggacggct gggtgtacga gacgcctggc 541 tcgtccatcg tcaccgagtt taacctggta tgtgccaact cctggatgtt ggacctattc 601 cagtcatcag tgaatgtagg attctttatt ggctctatga gtatcggcta catagcagac 661 aggtttggcc gtaagctctg cctcctaact acagtcctca taaatgctgc agctggagtt 721 ctcatggcca tttccccaac ctatacgtgg atgttaattt ttcgcttaat ccaaggactg 781 gtcagcaaag caggctggtt aataggctac atcctgatta cagaatttgt tgggcggaga 841 tatcggagaa cagtggggat tttttaccaa gttgcctata cagttgggct cctggtgcta 901 gctggggtgg cttacgcact tcctcactgg aggtggttgc agttcacagt tgctctgccc 961 aacttcttct tcttgctcta ttactggtgc atacctgagt ctcccaggtg gctgatctcc 1021 cagaataaga atgctgaagc catgagaatc attaagcaca tcgcaaagaa aaatggaaaa 1081 tctctacccg cctcccttca gcgcctgaga cttgaagagg aaactggcaa gaaattgaac 1141 ccttcatttc ttgacttggt cagaactcct cagataagga aacatactat gatattgatg 1201 tacaactggt tcacgagctc tgtgctctac cagggcctca tcatgcacat gggccttgca 1261 ggtgacaata tctacctgga tttcttctac tctgccctgg ttgaattccc agctgccttc 1321 atgatcatcc tcaccatcga ccgcatcgga cgccgttacc cttgggctgc atcaaatatg 1381 gttgcagggg cagcctgtct ggcctcagtt tttatacctg gtgatctaca atggctaaaa 1441 attattatct catgcttggg aagaatgggg atcacaatgg cctatgagat agtctgcctg 1501 gtcaatgctg agctgtaccc cacattcatt aggaatcttg gcgtccacat ctgttcctca 1561 atgtgtgaca ttggtggcat catcacgcca ttcctggtct accggctcac taacatctgg 1621 cttgagctcc cgctgatggt tttcggcgta cttggcttgg ttgctggagg tctggtgctg 1681 ttgcttccag aaactaaagg gaaagctttg cctgagacca tcgaggaagc cgaaaatatg 1741 caaagaccaa gaaaaaataa agaaaagatg atttacctcc aagttcagaa actagacatt 1801 ccattgaact aagaagagag accgttgctg ctgtcatgac ctagctttga tggcagcaag 1861 accaaaagta gaaatccctg cactcatcac aaagcccata caactcaacc aaacttaccc 1921 ctgagcccta tcaacctagg tctacagcca gtggagtcta ttgtacactg tggaaaaata 1981 cccatgggac cagatcctgc caaattcttc cagctcactt tattctcagc attcctagga 2041 cattggacat tggttttctg gagggttttt tttccgatct ttgtattttt ttaaatttga 2101 ttcttttctt tgcaatgcta gcaaccagaa tacatagggg aactgtgggc taggcaaana 2161 aaatagaaaa agtgtgaaaa acagtaaagt tgggagagga gcatctattt tcttaaagaa 2221 ataaaacacc naaaacaaaa aaaaaaaaaa aaaaaaa // LOCUS HSOCTL 1888 bp RNA PRI 10-OCT-1997 DEFINITION H.sapiens mRNA for organic cation transporter, liver. ACCESSION X98332 NID g2511669 KEYWORDS ion transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1888) AUTHORS Gorboulev,V., Ulzheimer,J.C., Akhoundova,A., Ulzheimer-Teuber,I., Karbach,U., Quester,S., Baumann,C., Lang,F., Busch,A.E. and Koepsell,H. TITLE Cloning and characterization of two human polyspecific organic cation transporters JOURNAL DNA Cell Biol. 16 (7), 871-881 (1997) MEDLINE 97405886 REFERENCE 2 (bases 1 to 1888) AUTHORS Gorboulev,V.G. TITLE Direct Submission JOURNAL Submitted (05-JUN-1996) V.G. Gorboulev, Anatomisches Institut, Bayerische Maximilians Universitaet, Koellikerstr. 6, D- 97070 Wuerzburg, FRG REMARK revised by [3] REFERENCE 3 (bases 1 to 1888) AUTHORS Gorboulev,V.G. TITLE Direct Submission JOURNAL Submitted (31-JUL-1997) V.G. Gorboulev, Anatomisches Institut, Bayerische Maximilians Universitaet, Koellikerstr. 6, D- 97070 Wuerzburg, FRG REMARK revised by [4] REFERENCE 4 (bases 1 to 1888) AUTHORS Gorboulev,V.G. TITLE Direct Submission JOURNAL Submitted (10-OCT-1997) V.G. Gorboulev, Anatomisches Institut, Bayerische Maximilians Universitaet, Koellikerstr. 6, D- 97070 Wuerzburg, FRG FEATURES Location/Qualifiers source 1..1888 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 73..1737 /codon_start=1 /product="organic cation transporter" /db_xref="PID:e354303" /db_xref="PID:g2511670" /translation="MPTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGF TPDHHCQSPGVAELSQRCGWSPAEELNYTVPGLGPAGEAFLGQCRRYEVDWNQSALSC VDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSCLNAGFF FGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWM AGYTLITEFVGSGSRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFL LYYWCVPESPRWLLSQKRNTEAIKIMDHIAQKNGKLPPADLKMLSLEEDVTEKLSPSF ADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALVEIPGAFI ALITIDRVGRIYPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMIC LVNAELYPTFVRNLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAG VTLLLPETKGVALPETMKDAENLGRKAKPKENTIYLKVQTSEPSGT" polyA_site 1868 BASE COUNT 379 a 541 c 518 g 450 t ORIGIN 1 gagggagaca ttgcacctgg ccactgcagc ccagagcagg tctggccacg gccatgagca 61 tgctgagcca tcatgcccac cgtggatgac attctggagc aggttgggga gtctggctgg 121 ttccagaagc aagccttcct catcttatgc ctgctgtcgg ctgcctttgc gcccatctgt 181 gtgggcatcg tcttcctggg tttcacacct gaccaccact gccagagccc tggggtggct 241 gagctgagcc agcgctgtgg ctggagccct gcggaggagc tgaactatac agtgccaggc 301 ctggggcccg cgggcgaggc cttccttggc cagtgcaggc gctatgaagt ggactggaac 361 cagagcgccc tcagctgtgt agaccccctg gctagcctgg ccaccaacag gagccacctg 421 ccgctgggtc cctgccagga tggctgggtg tatgacacgc ccggctcttc catcgtcact 481 gagttcaacc tggtgtgtgc tgactcctgg aagctggacc tctttcagtc ctgtttgaat 541 gcgggcttct tctttggctc tctcggtgtt ggctactttg cagacaggtt tggccgtaag 601 ctgtgtctcc tgggaactgt gctggtcaac gcggtgtcgg gcgtgctcat ggccttctcg 661 cccaactaca tgtccatgct gctcttccgc ctgctgcagg gcctggtcag caagggcaac 721 tggatggctg gctacaccct aatcacagaa tttgttggct cgggctccag aagaacggtg 781 gcgatcatgt accagatggc cttcacggtg gggctggtgg cgcttaccgg gctggcctac 841 gccctgcctc actggcgctg gctgcagctg gcagtctccc tgcccacctt cctcttcctg 901 ctctactact ggtgtgtgcc ggagtcccct cggtggctgt tatcacaaaa aagaaacact 961 gaagcaataa agataatgga ccacatcgct caaaagaatg ggaagttgcc tcctgctgat 1021 ttaaagatgc tttccctcga agaggatgtc accgaaaagc tgagcccttc atttgcagac 1081 ctgttccgca cgccgcgcct gaggaagcgc accttcatcc tgatgtacct gtggttcacg 1141 gactctgtgc tctatcaggg gctcatcctg cacatgggcg ccaccagcgg gaacctctac 1201 ctggatttcc tttactccgc tctggtcgaa atcccggggg ccttcatagc cctcatcacc 1261 attgaccgcg tgggccgcat ctaccccatg gccatgtcaa atttgttggc gggggcagcc 1321 tgcctcgtca tgatttttat ctcacctgac ctgcactggt taaacatcat aatcatgtgt 1381 gttggccgaa tgggaatcac cattgcaata caaatgatct gcctggtgaa tgctgagctg 1441 taccccacat tcgtcaggaa cctcggagtg atggtgtgtt cctccctgtg tgacataggt 1501 gggataatca cccccttcat agtcttcagg ctgagggagg tctggcaagc cttgcccctc 1561 attttgtttg cggtgttggg cctgcttgcc gcgggagtga cgctacttct tccagagacc 1621 aagggggtcg ctttgccaga gaccatgaag gacgccgaga accttgggag aaaagcaaag 1681 cccaaagaaa acacgattta ccttaaggtc caaacctcag aaccctcggg cacctgagag 1741 agatgttttg cggcgatgtc gtgttggagg gatgaagatg gagttatcct ctgcagaaat 1801 tcctagacgc cttcacttct ctgtattctt cctcatactt gcctaccccc aaattaatat 1861 cagtcctaaa gaaaaaaaaa aaaaaaaa // LOCUS HSONHORE 1450 bp RNA PRI 07-MAR-1995 DEFINITION H.sapiens mRNA for orphan nuclear hormone receptor. ACCESSION Z30425 L29263 NID g458541 KEYWORDS nuclear hormone receptor; orphan receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1450) AUTHORS Baes,M., Gulick,T., Choi,H.S., Martinoli,M.G., Simha,D. and Moore,D.D. TITLE A new orphan member of the nuclear hormone receptor superfamily that interacts with a subset of retinoic acid response elements JOURNAL Mol. Cell. Biol. 14 (3), 1544-1551 (1994) MEDLINE 94158827 REFERENCE 2 (bases 1 to 1450) AUTHORS Moore,D.D. TITLE Direct Submission JOURNAL Submitted (04-MAR-1994) David D. Moore, Molecular Biology, Massachusetts General Hospital, Boston, MA, O2114, USA FEATURES Location/Qualifiers source 1..1450 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="mb67" /tissue_type="liver" /clone_lib="cDNA" CDS 273..1319 /codon_start=1 /product="orphan nuclear hormone receptor" /db_xref="PID:g458542" /translation="MASREDELRNCVVCGDQATGYHFNALTCEGCKGFFRRTVSKSIG PTCPFAGSCEVSKTQRRHCPACRLQKCLDAGMRKDMILSAEALALRRAKQAQRRAQQT PVQLSKEQEELIRTLLGAHTRHMGTMFEQFVQFRPPAHLFIHHQPLPTLAPVLPLVTH FADINTFMVLQVIKFTKDLPVFRSLPIEDQISLLKGAAVEICHIVLNTTFCLQTQNFL CGPLRYTIEDGARVGFQVEFLELLFHFHGTLRKLQLQEPEYVLLAAMALFSPDRPGVT QRDEIDQLQEEMALTLQSYIKGQQRRPRDRFLYAKLLGLLAELRSINEAYGYQIQHIQ GLSAMMPLLQEICS" BASE COUNT 359 a 399 c 380 g 312 t ORIGIN 1 gtgagcttgc tccttaagtt acaggaactc tccttataat agacacttca ttttcctagt 61 ccatccctca tgaaaaatga ctgaccactg ctgggcagca ggagggatga taatcctaac 121 tccaatcact ggcaactcct gagatcagag gaaaaccagc aacagcgtgg gagtttgggg 181 agaggcattc cataccagat tctgtggcct gcaggtgaca tgctgcctaa gagaagcagg 241 agtctgtgac agccacccca acacgtgacg tcatggccag tagggaagat gagctgagga 301 actgtgtggt atgtggggac caagccacag gctaccactt taatgcgctg acttgtgagg 361 gctgcaaggg tttcttcagg agaacagtca gcaaaagcat tggtcccacc tgcccctttg 421 ctggaagctg tgaagtcagc aagactcaga ggcgccactg cccagcctgc aggttgcaga 481 agtgcttaga tgctggcatg aggaaagaca tgatactgtc ggcagaagcc ctggcattgc 541 ggcgagcaaa gcaggcccag cggcgggcac agcaaacacc tgtgcaactg agtaaggagc 601 aagaagagct gatccggaca ctcctggggg cccacacccg ccacatgggc accatgtttg 661 aacagtttgt gcagtttagg cctccagctc atctgttcat ccatcaccag cccttgccca 721 ccctggcccc tgtgctgcct ctggtcacac acttcgcaga catcaacact ttcatggtac 781 tgcaagtcat caagtttact aaggacctgc ccgtcttccg ttccctgccc attgaagacc 841 agatctccct tctcaaggga gcagctgtgg aaatctgtca catcgtactc aataccactt 901 tctgtctcca aacacaaaac ttcctctgcg ggcctcttcg ctacacaatt gaagatggag 961 cccgtgtggg gttccaggta gagtttttgg agttgctctt tcacttccat ggaacactac 1021 gaaaactgca gctccaagag cctgagtatg tgctcttggc tgccatggcc ctcttctctc 1081 ctgaccgacc tggagttacc cagagagatg agattgatca gctgcaagag gagatggcac 1141 tgactctgca aagctacatc aagggccagc agcgaaggcc ccgggatcgg tttctgtatg 1201 cgaagttgct aggcctgctg gctgagctcc ggagcattaa tgaggcctac gggtaccaaa 1261 tccagcacat ccagggcctg tctgccatga tgccgctgct ccaggagatc tgcagctgag 1321 gccatgctca cttccttccc cagctcacct ggaacaccct ggatacactg gagtgggaaa 1381 atgctgggac caaagattgg gccgggttca aagggagccc agtggttgca atgaaagact 1441 aaagcaaaac // LOCUS HSOP1 1878 bp RNA PRI 23-MAR-1995 DEFINITION Human OP-1 mRNA for osteogenic protein. ACCESSION X51801 NID g35151 KEYWORDS OP-1 gene; osteogenic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1878) AUTHORS Oppermann,H. TITLE Direct Submission JOURNAL Submitted (04-FEB-1990) Oppermann H., Creative Biomolecules Inc, 35 South Street, Hopkinton MA 01748, U S A REFERENCE 2 (bases 1 to 1878) AUTHORS Ozkaynak,E., Rueger,D.C., Drier,E.A., Corbett,C., Ridge,R.J., Sampath,T.K. and Oppermann,H. TITLE OP-1 cDNA encodes an osteogenic protein in the TGF-beta family JOURNAL EMBO J. 9 (7), 2085-2093 (1990) MEDLINE 90291971 FEATURES Location/Qualifiers source 1..1878 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 123..1418 /note="pre-propolypeptide (AA -29 to 402)" /codon_start=1 /db_xref="PID:g35152" /db_xref="SWISS-PROT:P18075" /translation="MHVRSLRAAAPHSFVALWAPLFLLRSALADFSLDNEVHSSFIHR RLRSQERREMQREILSILGLPHRPRPHLQGKHNSAPMFMLDLYNAMAVEEGGGPGGQG FSYPYKAVFSTQGPPLASLQDSHFLTDADMVMSFVNLVEHDKEFFHPRYHHREFRFDL SKIPEGEAVTAAEFRIYKDYIRERFDNETFRISVYQVLQEHLGRESDLFLLDSRTLWA SEEGWLVFDITATSNHWVVNPRHNLGLQLSVETLDGQSINPKLAGLIGRHGPQNKQPF MVAFFKATEVHFRSIRSTGSKQRSQNRSKTPKNQEALRMANVAENSSSDQRQACKKHE LYVSFRDLGWQDWIIAPEGYAAYYCEGECAFPLNSYMNATNHAIVQTLVHFINPETVP KPCCAPTQLNAISVLYFDDSSNVILKKYRNMVVRACGCH" sig_peptide 123..209 /note="signal peptide (AA -29 to -1)" misc_feature 210..1037 /note="propeptide (AA 1-276)" misc_feature 210..1020 /note="propeptide (alt.) (AA 1-270)" mat_peptide 1021..1415 /note="mature osteogenic protein (AA 271-402)" mat_peptide 1038..1415 /note="mature osteogenic protein (AA 277-402)" misc_feature 1850..1855 /note="polyA signal" misc_feature 1862..1867 /note="polyA signal" polyA_site 1878 /note="polyA site" BASE COUNT 411 a 592 c 541 g 334 t ORIGIN 1 gggcgcagcg gggcccgtct gcagcaagtg accgacggcc gggacggccg cctgccccct 61 ctgccacctg gggcggtgcg ggcccggagc ccggagcccg ggtagcgcgt agagccggcg 121 cgatgcacgt gcgctcactg cgagctgcgg cgccgcacag cttcgtggcg ctctgggcac 181 ccctgttcct gctgcgctcc gccctggccg acttcagcct ggacaacgag gtgcactcga 241 gcttcatcca ccggcgcctc cgcagccagg agcggcggga gatgcagcgc gagatcctct 301 ccattttggg cttgccccac cgcccgcgcc cgcacctcca gggcaagcac aactcggcac 361 ccatgttcat gctggacctg tacaacgcca tggcggtgga ggagggcggc gggcccggcg 421 gccagggctt ctcctacccc tacaaggccg tcttcagtac ccagggcccc cctctggcca 481 gcctgcaaga tagccatttc ctcaccgacg ccgacatggt catgagcttc gtcaacctcg 541 tggaacatga caaggaattc ttccacccac gctaccacca tcgagagttc cggtttgatc 601 tttccaagat cccagaaggg gaagctgtca cggcagccga attccggatc tacaaggact 661 acatccggga acgcttcgac aatgagacgt tccggatcag cgtttatcag gtgctccagg 721 agcacttggg cagggaatcg gatctcttcc tgctcgacag ccgtaccctc tgggcctcgg 781 aggagggctg gctggtgttt gacatcacag ccaccagcaa ccactgggtg gtcaatccgc 841 ggcacaacct gggcctgcag ctctcggtgg agacgctgga tgggcagagc atcaacccca 901 agttggcggg cctgattggg cggcacgggc cccagaacaa gcagcccttc atggtggctt 961 tcttcaaggc cacggaggtc cacttccgca gcatccggtc cacggggagc aaacagcgca 1021 gccagaaccg ctccaagacg cccaagaacc aggaagccct gcggatggcc aacgtggcag 1081 agaacagcag cagcgaccag aggcaggcct gtaagaagca cgagctgtat gtcagcttcc 1141 gagacctggg ctggcaggac tggatcatcg cgcctgaagg ctacgccgcc tactactgtg 1201 agggggagtg tgccttccct ctgaactcct acatgaacgc caccaaccac gccatcgtgc 1261 agacgctggt ccacttcatc aacccggaaa cggtgcccaa gccctgctgt gcgcccacgc 1321 agctcaatgc catctccgtc ctctacttcg atgacagctc caacgtcatc ctgaagaaat 1381 acagaaacat ggtggtccgg gcctgtggct gccactagct cctccgagaa ttcagaccct 1441 ttggggccaa gtttttctgg atcctccatt gctcgccttg gccaggaacc agcagaccaa 1501 ctgccttttg tgagaccttc ccctccctat ccccaacttt aaaggtgtga gagtattagg 1561 aaacatgagc agcatatggc ttttgatcag tttttcagtg gcagcatcca atgaacaaga 1621 tcctacaagc tgtgcaggca aaacctagca ggaaaaaaaa acaacgcata aagaaaaatg 1681 gccgggccag gtcattggct gggaagtctc agccatgcac ggactcgttt ccagaggtaa 1741 ttatgagcgc ctaccagcca ggccacccag ccgtgggagg aagggggcgt ggcaaggggt 1801 gggcacattg gtgtctgtgc gaaaggaaaa ttgacccgga agttcctgta ataaatgtca 1861 caataaaacg aatgaatg // LOCUS HSORL1 1973 bp RNA PRI 12-APR-1994 DEFINITION H.sapiens mRNA for ORL1 receptor. ACCESSION X77130 NID g471316 KEYWORDS opioid receptor; ORL1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1973) AUTHORS Mollereau,C., Parmentier,M., Mailleux,P., Butour,J.L., Moisand,C., Chalon,P., Caput,D., Vassart,G. and Meunier,J.C. TITLE ORL1, a novel member of the opioid receptor family. Cloning, functional expression and localization JOURNAL FEBS Lett. 341 (1), 33-38 (1994) MEDLINE 94185768 REFERENCE 2 (bases 1 to 1973) AUTHORS Parmentier,M. TITLE Direct Submission JOURNAL Submitted (10-JAN-1994) M. Parmentier, Universite Libre de Bruxelles, I R I B H N ULB Campus Erasme, 808 Route de Lennik, 1070 Bruxelles, BELGIUM FEATURES Location/Qualifiers source 1..1973 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain stem" /clone_lib="pTZ18R vector" /clone="hORL1" gene 178..1290 /gene="ORL1" CDS 178..1290 /gene="ORL1" /codon_start=1 /db_xref="PID:g471317" /db_xref="SWISS-PROT:P41146" /translation="MEPLFPAPFWEVIYGSHLQGNLSLLSPNHSLLPPHLLLNASHGA FLPLGLKVTIVGLYLAVCVGGLLGNCLVMYVILRHTKMKTATNIYIFNLALADTLVLL TLPFQGTDILLGFWPFGNALCKTVIAIDYYNMFTSTFTLTAMSVDRYVAICHPIRALD VRTSSKAQAVNVAIWALASVVGVPVAIMGSAQVEDEEIECLVEIPTPQDYWGPVFAIC IFLFSFIVPVLVISVCYSLMIRRLRGVRLLSGSREKDRNLRRITRLVLVVVAVFVGCW TPVQVFVLAQGLGVQPSSETAVAILRFCTALGYVNSCLNPILYAFLDENFKACFRKFC CASALRRDVQVSDRVRSIAKDVALACKTSETVPRPA" BASE COUNT 315 a 650 c 587 g 421 t ORIGIN 1 ctgccggctc actcggctgc tgcgtctggt ctggcgtctg ctgagaagat cctcttctac 61 cctgctctgc acctgtgctc gactgccagc cggctgaggg cgggggtctc cacggtggtc 121 ccagctccca aggaggttgc agaagtaccg tacagagtgg atttgcaggg cagtggcatg 181 gagcccctct tccccgcgcc gttctgggag gttatctacg gcagccacct tcagggcaac 241 ctgtccctcc tgagccccaa ccacagtctg ctgcccccgc atctgctgct caatgccagc 301 cacggcgcct tcctgcccct cgggctcaag gtcaccatcg tggggctcta cctggccgtg 361 tgtgtcggag ggctcctggg gaactgcctt gtcatgtacg tcatcctcag gcacaccaaa 421 atgaagacag ccaccaatat ttacatcttt aacctggccc tggccgacac tctggtcctg 481 ctgacgctgc ccttccaggg cacggacatc ctcctgggct tctggccgtt tgggaatgcg 541 ctgtgcaaga cagtcattgc cattgactac tacaacatgt tcaccagcac cttcacccta 601 actgccatga gtgtggatcg ctatgtagcc atctgccacc ccatccgtgc cctcgacgtc 661 cgcacgtcca gcaaagccca ggctgtcaat gtggccatct gggccctggc ctctgttgtc 721 ggtgttcccg ttgccatcat gggctcggca caggtcgagg atgaagagat cgagtgcctg 781 gtggagatcc ctacccctca ggattactgg ggcccggtgt ttgccatctg catcttcctc 841 ttctccttca tcgtccccgt gctcgtcatc tctgtctgct acagcctcat gatccggcgg 901 ctccgtggag tccgcctgct ctcgggctcc cgagagaagg accggaacct gcggcgcatc 961 actcggctgg tgctggtggt agtggctgtg ttcgtgggct gctggacgcc tgtccaggtc 1021 ttcgtgctgg cccaagggct gggggttcag ccgagcagcg agactgccgt ggccattctg 1081 cgcttctgca cggccctggg ctacgtcaac agctgcctca accccatcct ctacgccttc 1141 ctggatgaga acttcaaggc ctgcttccgc aagttctgct gtgcatctgc cctgcgccgg 1201 gacgtgcagg tgtctgaccg cgtgcgcagc attgccaagg acgtggccct ggcctgcaag 1261 acctctgaga cggtaccgcg gcccgcatga ctaggcgtgg acctgcccat ggtgcctgtc 1321 agcccgcaga gcccatctac gcccaacaca gagctcacac aggtcactgc tctctaggcg 1381 gacacaccct gggccctgag catccagagc ctgggatggg cttttccctg tgggccaggg 1441 atgctcggtc ccagaggagg acctagtgac atcatgggac aggtcaaagc attagggcca 1501 cctccatggc cccagacaga ctaaagctgc cctcctggtg cagggccgag gggacacaag 1561 gacctacctg gaagcagctg acatgctggt ggacggccgt tactggagcc cgtgcccctc 1621 cctccccgtg cttcatgtga ctcttggcct ctctgctgct gcgttggcag aaccctgggt 1681 gggcaggcac ccggaggagg agcagcagct gtgtcatcct gtgcccccca tgtgctgtgt 1741 gctgtttgca tggcagggct ccagctgcct tcagccctgt gacgtctcct cagggcagct 1801 ggacaggctt ggcacggccc gggaagtgca gcaggcagct tttctttggg gtgggacttg 1861 ccctgagctt ggagctgcca cctggaggac ttgcctgttc cgactccacc tgtgcagccg 1921 gggccacccc aggagaaagt gtccaggtgg gggctggcag tccctggctg cag // LOCUS HSOTF3AMR 1371 bp RNA PRI 28-SEP-1992 DEFINITION H.sapiens OTF3 mRNA encoding octamer binding protein 3A. ACCESSION Z11898 NID g35168 KEYWORDS octamer-binding protein; octamer-binding protein 3A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1371) AUTHORS Takeda,J., Seino,S. and Bell,G.I. TITLE Human Oct3 gene family: cDNA sequences, alternative splicing, gene organization, chromosomal location, and expression at low levels in adult tissues JOURNAL Nucleic Acids Res. 20 (17), 4613-4620 (1992) MEDLINE 93027160 REFERENCE 2 (bases 1 to 1371) AUTHORS Bell,G.I. TITLE Direct Submission JOURNAL Submitted (03-APR-1992) Graeme I Bell, Howard Hughes Medical Institute, University of, Chicago, 5841 S. Maryland Ave., Chicago, IL, 60637, USA FEATURES Location/Qualifiers source 1..1371 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="Adult" /tissue_type="Kidney" /chromosome="Chromosome 6" gene 43..1125 /gene="OTF3" CDS 43..1125 /gene="OTF3" /standard_name="oct3A" /codon_start=1 /product="octamer binding protein 3A" /db_xref="PID:g35169" /db_xref="SWISS-PROT:Q01860" /translation="MAGHLASDFAFSPPPGGGGDGPGGPEPGWVDPRTWLSFQGPPGG PGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLVPQGGLETSQPEGEAGVG VESNSDGASPEPCTVTPGAVKLEKEKLEQNPEESQDIKALQKELEQFAKLLKQKRITL GYTQADVGLTLGVLFGKVFSQTTICRFEALQLSFKNMCKLRPLLQKWVEEADNNENLQ EICKAETLVQARKRKRTSIENRVRGNLENLFLQCPKPTLQQISHIAQQLGLEKDVVRV WFCNRRQKGKRSSSDYAQREDFEAAGSPFSGGPVSFPLAPGPHFGTPGYGSPHFTALY SSVPFPEGEAFPPVSVTTLGSPMHSN" BASE COUNT 296 a 357 c 437 g 281 t ORIGIN 1 ctcatttcac caggcccccg gcttggggcg ccttccttcc ccatggcggg acacctggct 61 tcggatttcg ccttctcgcc ccctccaggt ggtggaggtg atgggccagg ggggccggag 121 ccgggctggg ttgatcctcg gacctggcta agcttccaag gccctcctgg agggccagga 181 atcgggccgg gggttgggcc aggctctgag gtgtggggga ttcccccatg ccccccgccg 241 tatgagttct gtggggggat ggcgtactgt gggccccagg ttggagtggg gctagtgccc 301 caaggcggct tggagacctc tcagcctgag ggcgaagcag gagtcggggt ggagagcaac 361 tccgatgggg cctccccgga gccctgcacc gtcacccctg gtgccgtgaa gctggagaag 421 gagaagctgg agcaaaaccc ggaggagtcc caggacatca aagctctgca gaaagaactc 481 gagcaatttg ccaagctcct gaagcagaag aggatcaccc tgggatatac acaggccgat 541 gtggggctca ccctgggggt tctatttggg aaggtattca gccaaacgac catctgccgc 601 tttgaggctc tgcagcttag cttcaagaac atgtgtaagc tgcggccctt gctgcagaag 661 tgggtggagg aagctgacaa caatgaaaat cttcaggaga tatgcaaagc agaaaccctc 721 gtgcaggccc gaaagagaaa gcgaaccagt atcgagaacc gagtgagagg caacctggag 781 aatttgttcc tgcagtgccc gaaacccaca ctgcagcaga tcagccacat cgcccagcag 841 cttgggctcg agaaggatgt ggtccgagtg tggttctgta accggcgcca gaagggcaag 901 cgatcaagca gcgactatgc acaacgagag gattttgagg ctgctgggtc tcctttctca 961 gggggaccag tgtcctttcc tctggcccca gggccccatt ttggtacccc aggctatggg 1021 agccctcact tcactgcact gtactcctcg gtccctttcc ctgaggggga agcctttccc 1081 cctgtctctg tcaccactct gggctctccc atgcattcaa actgaggtgc ctgcccttct 1141 aggaatgggg gacaggggga ggggaggagc tagggaaaga aaacctggag tttgtgccag 1201 ggtttttgga ttaagttctt cattcactaa ggaaggaatt gggaacacaa agggtggggg 1261 caggggagtt tggggcaact ggttggaggg aaggtgaagt tcaatgatgc tcttgatttt 1321 aatcccacat catgtatcac ttttttctta aataaagaag cttgggacac a // LOCUS HSOXA1HS 1551 bp RNA PRI 23-MAR-1995 DEFINITION H.sapiens OXA1Hs mRNA. ACCESSION X80695 NID g619490 KEYWORDS OXA1Hs gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1551) AUTHORS Bonnefoy,N., Kermorgant,M., Groudinsky,O., Minet,M., Slonimski,P.P. and Dujardin,G. TITLE Cloning of a human gene involved in cytochrome oxidase assembly by functional complementation of an oxa1- mutation in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (25), 11978-11982 (1994) MEDLINE 95083624 REFERENCE 2 (bases 1 to 1551) AUTHORS Bonnefoy,N. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) N. Bonnefoy, Centre de Genetique Moleculaire, Ave de la Terrasse, 91198 Gif-sur-Yvette, Cedex, FRANCE FEATURES Location/Qualifiers source 1..1551 /organism="Homo sapiens" /db_xref="taxon:9606" gene 7..1314 /gene="OXA1Hs" CDS 7..1314 /gene="OXA1Hs" /codon_start=1 /db_xref="PID:g619491" /translation="MAMGLMCGRRELLRLLQSGRRVHSVAGPSQWLGKPLTTRLLFPV APCCCRPHYLFLAASGPRSLSTSAISFAEVQVQAPPVVAATPSPTAVPEVASGETADV VQTAAEQSFAELGLGSYTPVGLIQNLLEFMHVDLGLPWWGAIAACTVFARCLIFPLIV TGQREAARIHNHLPEIQKFSSRIREAKLAGDHIEYYKASSEMALYQKKHGIKLYKPLI LPVTQAPIFISFFIALREMANLPVPSLQTGGLWWFQDLTVSDPIYILPLAVTATMWAV LELGAETGVQSSDLQWMRNVIRMMPLITLPITMHFPTAVFMYWLSSNLFSLVQVSCLR IPAVRTVLKIPQRVVHDLDKLPPREGFLESFKKGWKNAEMTRQLREREQRMRNQLELA ARGPLRQTFTHNPLLQPGKDNPPNIPSSSSKPKSKYPWHDTLG" BASE COUNT 376 a 433 c 360 g 382 t ORIGIN 1 ggcaaaatgg cgatgggact aatgtgcgga cgccgggagc ttctgcgctt gctacagtcc 61 gggcgtcggg tccacagcgt cgcagggccc tcgcaatggc ttgggaaacc gctgaccaca 121 cggctcctat tcccagtagc cccgtgctgc tgtcgcccac actacctctt ccttgcggct 181 tccggccccc gcagcctcag tacctctgct atctcttttg cagaagtcca ggttcaggcc 241 cctcctgttg ttgctgcaac tccctcaccc acagcagtac ctgaggtggc ttctggagag 301 actgcagatg tagtccaaac tgctgcagag cagagcttcg ctgaactggg gctggggtca 361 tacaccccag tgggactgat ccagaattta ctggaattta tgcatgttga tctgggccta 421 ccttggtggg gggccattgc tgcatgtaca gtctttgccc gctgcctgat ttttcctctc 481 atcgtgacgg gccagcgaga ggcagccagg atccacaatc acttgccaga gatccagaag 541 ttttccagtc gaatcagaga ggccaagtta gcaggagacc atattgagta ttacaaggct 601 tcctcggaga tggcacttta ccagaaaaaa catggtatta aactctataa acctctcatt 661 ctccctgtga ctcaggcccc aatcttcatc tccttcttca ttgctttgag agagatggcc 721 aaccttcctg tgcccagcct gcagacaggt ggcctctggt ggttccagga tctcacggta 781 tccgatccca tctacatatt accactggca gtcactgcta caatgtgggc tgttcttgag 841 ctaggtgctg agacaggtgt gcaaagttct gaccttcagt ggatgagaaa tgtcatcaga 901 atgatgcccc tgataacctt gcccataacc atgcatttcc ccacggcagt gtttatgtac 961 tggctctcct ccaatttgtt ttccctggtc caagtatcct gtctccggat tccagcagta 1021 cgcactgtac ttaaaatccc ccagcgtgtt gtacatgacc tggacaaatt acctccacgg 1081 gaaggcttcc tagagagctt caaaaaaggc tggaaaaatg ctgaaatgac gcgtcagctg 1141 cgagagcgtg aacaacgcat gcggaatcag ttggagctag cagccagggg tcctttacga 1201 cagaccttta cccacaaccc tctcctacaa cctggaaagg ataaccctcc caatatccct 1261 agcagcagca gcaaaccaaa gtcaaagtat ccctggcacg acacacttgg ctgacttatg 1321 ttctgtgcgc attctggcag gaattctgtc tcttcagaga ctcatcctca aaacaagact 1381 tgacactgtg tccttgcccc agtcctagga actgtggcac acagagatgt tcattttaaa 1441 aacggatttc atgaaacact cttgtactta tgtttataag agagcactgg gtagccaagt 1501 gatcttccca ttcacagagt tagtaaacct ctgtactaca aaaaaaaaaa a // LOCUS HSOXYGR 1550 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for heme oxygenase. ACCESSION X06985 NID g35172 KEYWORDS haem oxygenase; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1550) AUTHORS Yoshida,T., Biro,P., Cohen,T., Muller,R.M. and Shibahara,S. TITLE Human heme oxygenase cDNA and induction of its mRNA by hemin JOURNAL Eur. J. Biochem. 171 (3), 457-461 (1988) MEDLINE 88151939 COMMENT library=Okayama-Berg; clone=pHHO1. FEATURES Location/Qualifiers source 1..1550 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="histiocytic lymphoma cells" /cell_line="U937" mRNA <1..1550 /note="heme oxygenase mRNA" variation 1..2 /note="tc is aa in pHHO2" CDS 81..947 /note="heme oxygenase (AA 1 - 288)" /codon_start=1 /db_xref="PID:g35173" /db_xref="SWISS-PROT:P09601" /translation="MERPQPDSMPQDLSEALKEATKEVHTQAENAEFMRNFQKGQVTR DGFKLVMASLYHIYVALEEEIERNKESPVFAPVYFPEELHRKAALEQDLAFWYGPRWQ EVIPYTPAMQRYVKRLHEVGRTEPELLVAHAYTRYLGDLSGGQVLKKIAQKALDLPSS GEGLAFFTFPNIASATKFKQLYRSRMNSLEMTPAVRQRVIEEAKTAFLLNIQLFEELQ ELLTHDTKDQSPSRAPGLRQRASNKVQDSAPVETPRGKPPLNTRSQAPLLRWVLTLSF LVATVAVGLYAM" misc_feature 879..944 /note="transmembrane domain" misc_feature 1533..1538 /note="polyA signal" polyA_site 1550 /note="polyA site" BASE COUNT 322 a 465 c 433 g 330 t ORIGIN 1 tcaacgcctg cctcccctcg agcgtcctca gcgcagccgc cgcccgcgga gccagcacga 61 acgagcccag caccggccgg atggagcgtc cgcaacccga cagcatgccc caggatttgt 121 cagaggccct gaaggaggcc accaaggagg tgcacaccca ggcagagaat gctgagttca 181 tgaggaactt tcagaagggc caggtgaccc gagacggctt caagctggtg atggcctccc 241 tgtaccacat ctatgtggcc ctggaggagg agattgagcg caacaaggag agcccagtct 301 tcgcccctgt ctacttccca gaagagctgc accgcaaggc tgccctggag caggacctgg 361 ccttctggta cgggccccgc tggcaggagg tcatccccta cacaccagcc atgcagcgct 421 atgtgaagcg gctccacgag gtggggcgca cagagcccga gctgctggtg gcccacgcct 481 acacccgcta cctgggtgac ctgtctgggg gccaggtgct caaaaagatt gcccagaaag 541 ccctggacct gcccagctct ggcgagggcc tggccttctt caccttcccc aacattgcca 601 gtgccaccaa gttcaagcag ctctaccgct cccgcatgaa ctccctggag atgactcccg 661 cagtcaggca gagggtgata gaagaggcca agactgcgtt cctgctcaac atccagctct 721 ttgaggagtt gcaggagctg ctgacccatg acaccaagga ccagagcccc tcacgggcac 781 cagggcttcg ccagcgggcc agcaacaaag tgcaagattc tgcccccgtg gagactccca 841 gagggaagcc cccactcaac acccgctccc aggctccgct tctccgatgg gtccttacac 901 tcagctttct ggtggcgaca gttgctgtag ggctttatgc catgtgaatg caggcatgct 961 ggctcccagg gccatgaact ttgtccggtg gaaggccttc tttctagaga gggaattctc 1021 ttggctggct tccttaccgt gggcactgaa ggctttcagg gcctccagcc ctctcactgt 1081 gtccctctct ctggaaagga ggaaggagcc tatggcatct tccccaacga aaagcacatc 1141 caggcaatgg cctaaacttc agagggggcg aaggggtcag ccctgccctt cagcatcctc 1201 agttcctgca gcagagcctg gaagacaccc taatgtggca gctgtctcaa acctccaaaa 1261 gccctgagtt tcaagtatcc ttgttgacac ggccatgacc actttccccg tgggccatgg 1321 caatttttac acaaacctga aaagatgttg tgtcttgtgt ttttgtctta tttttgttgg 1381 agccactctg ttcctggctc agcctcaaat gcagtatttt tgttgtgttc tgttgttttt 1441 atagcagggt tggggtggtt tttgagccat gcgtgggtgg ggagggaggt gtttaacggc 1501 actgtggcct tggtctaact tttgtgtgaa ataataaaca acattgtctg // LOCUS HSOZF 3186 bp RNA PRI 12-APR-1994 DEFINITION H.sapiens OZF mRNA. ACCESSION X70394 NID g468707 KEYWORDS OZF gene; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3186) AUTHORS Le Chalony,C., Prosperi,M.T., Haluza,R., Apiou,F., Dutrillaux,B. and Goubin,G. TITLE The OZF gene encodes a protein consisting essentially of zinc finger motifs JOURNAL J. Mol. Biol. 236 (2), 399-404 (1994) MEDLINE 94149744 REFERENCE 2 (bases 1 to 3186) AUTHORS Goubin,G.J. TITLE Direct Submission JOURNAL Submitted (10-FEB-1993) G.J. Goubin, Institut Curie, Laboratoire d'Oncogenese, 26, rue d'Ulm, 75231 Paris, Cedex 05, FRANCE FEATURES Location/Qualifiers source 1..3186 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast" /cell_type="mammary cells" /cell_line="HBL100/RAS1" gene 857..1735 /gene="OZF" CDS 857..1735 /gene="OZF" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g468708" /translation="MSHLSQQKIYSGENPFACKVCGKVFSHKSNLTEHEHFHTREKPF ECNECGKAFSQKQYVIKHQNTHTGEKLFECNECGKSFSQKENLLTHQKIHTGEKPFEC KDCGKAFIQKSNLIRHQRTHTGEKPFVCKECGKTFSGKSNLTEHEKIHIGEKPFKCSE CGTAFGQKKYLIKHQNIHTGEKPYECNECGKAFSQRTSLIVHVRIHSGDKPYECNVCG KAFSQSSSLTVHVRSHTGEKPYGCNECGKAFSQFSTLALHLRIHTGKKPYQCSECGKA FSQKSHHIRHQKIHTH" polyA_signal 3166..3171 BASE COUNT 1114 a 532 c 674 g 866 t ORIGIN 1 agttaatatg atcctctgtt gaagcaagaa gacaaactgt atggtggaga aagaagtggg 61 aaggatctgc gcggaagaag cctgagaaga tgatgcacag atagagaggc accaggactt 121 gagaggcacc aggacttggg aggcatgttg atccatctcc aggaaagact gagaaaaaga 181 gcgttgaata taagaaaaaa tacttcctct gttctcagat cgtatttgtt ttaaggccac 241 acctttttga agttttcagt ttgaaacaca acctggactg aaatcatgag ggaggttgtg 301 taggaaagaa tcatcaaggg acttagttgg gagcttctct accacagctt actccttatg 361 gtattaaccc cctttaagtg taaatgtctt tggtttaaaa cgtttgtacc tcatctgtta 421 ccagagtgtt catactggag agaaacccta tgaatttact gaatgtgggg gaactgttac 481 tcatatgttg catttcgtcg aacgtgcaag tcatgctgaa gagaaaccct gtgcttctaa 541 ggagtattga caagccctta gctagatgac aatctcattg agaagcagaa aattcatgct 601 ggagagaaac tagtgaaggt agcaatactt cattgaatat tagaaaattt ttctagagag 661 aaagcattga atatactgag tatgataaca tttcctctca aaccttatcc cttactctgc 721 atttgggaga tcatacacag agaagcctta taaatgtaag agatgtggaa atatcttcag 781 ccaaaagtaa atcctcactc atcaagaaat ttttactgga gagaaacctt gtgaatgtgg 841 gaaagcttcc attcagatgt cacacctcag ccagcagaaa atttacagtg gggaaaaccc 901 ctttgcctgt aaggtatgtg gaaaagtctt cagccacaaa tcaaacctca ctgagcatga 961 gcattttcac acgagagaga aaccttttga atgtaacgag tgtggaaaag cctttagcca 1021 aaagcagtat gtcattaaac atcagaacac ccatactggc gagaagcttt tcgaatgtaa 1081 tgaatgtgga aaatcattta gccagaagga aaacctcctt acgcaccaga aaattcacac 1141 tggagaaaaa ccttttgagt gtaaagattg cgggaaagct ttcattcaga agtcaaacct 1201 catcagacac cagagaactc acacaggaga gaagcccttt gtatgtaagg agtgtggaaa 1261 aaccttcagt ggcaaatcca accttactga gcatgagaaa atccatattg gagagaagcc 1321 ttttaaatgt agtgaatgtg gaacagcctt tggccagaag aagtacctca taaaacatca 1381 gaacattcac actggagaga aaccctatga atgtaacgaa tgtggaaaag ccttctctca 1441 gcgaacatca cttattgtac atgtgaggat tcattcaggt gataaacctt acgaatgcaa 1501 tgtttgtgga aaagccttct ctcagagctc atctctcact gtgcatgtga gaagccatac 1561 aggggagaag ccctatggtt gtaatgaatg tgggaaagct ttctctcagt tctcaaccct 1621 tgctctgcat ttgagaatac acacaggtaa gaagccttat cagtgcagtg aatgtgggaa 1681 agctttcagc cagaagtcac accacattag acaccagaaa attcatactc actaaaaacc 1741 ccatgaaagc cttgaaagtg ggaaagcttt cattagaaat ttgcacctca tcatgcccca 1801 gaaataatcc ttctgaagca aagcaccacg aatgaggtta actttaacaa gtactaaaaa 1861 cttaagggac accagaaaat ttgtactgaa gagaaagaca tgcatatgat taaaaccctg 1921 tgtccaacag agaaacctgc agcagagata atggtgaaag tttaggcaca ttttcactaa 1981 aagtgggaac agaaaatgga ggcctgtttt ttattgctac caccatagat ctggagatct 2041 tcgccagtaa caagaaaaat taagttgtaa atattggaaa ggaacagaca aaaatagatg 2101 acatggtcat ctacaactaa tattaagact cctgtggtat tgattgagca gagcagaaga 2161 gatttcaaaa aaagacctat gcatttatta gaatttggaa tatgatacaa gtggcatcag 2221 gaaatgagga aataatggaa ctattttttt taagtggagg tagtttgttc tccaaggggg 2281 aaaaatagac catgaactat acacaaaagt gaatccagag attaaaaaca gaaatatgga 2341 aaagctatgt actaaaatgc atatgacctt gggaataggg aaacattcct tggaacaatt 2401 tgagaaatac ttagcggttc tacttatgag tatgaatgct caggaaatac gcactaggat 2461 atttactgtg gcttgatttg tagtagccaa aaattagaaa caactgagaa agcccatcac 2521 gaacaatatg ggaaattctt atgtaatact atatatctgt aaaatgcaac aaatgaaatc 2581 cacatgtatt aacagaaaga tcccagaatt gttgaggggg gatcaagttg cagaatgata 2641 cagatagtac aatgactttt ttgtaaattt gaattaaaag tttcctcata aaactcagta 2701 ctatctattg gtaatggatg catatatgta tgtacatgta catgtttttg aaacggattg 2761 gaaggataca gaccaaactc ttgatagtgg tcacctgtga agagtggaga agggaaaata 2821 atgagtgtgg gaggtgggat gggtattggt taaaggggac ttcagctttt tatataaaca 2881 tccacttctc tttcaaaaga cttcaagtaa atataagaag atactgattg attctggatt 2941 gtggtatagg gatatcttgt cttatttttt gtacttttct gtatttttaa aatttctcaa 3001 aataggaatg ggagtgagga tgggaatgct gtatctgtgg aagtcatgtt atactggatt 3061 catttccaat taaatactaa acattttata gaaaatattt ctaaaatttt atcacggatg 3121 aatggatgga cttgctctct atgtaatatg aaattaatct attttattaa aatttaaaag 3181 gacagc // LOCUS HSP0071 3907 bp RNA PRI 18-APR-1997 DEFINITION H.sapiens mRNA for p0071 protein. ACCESSION X81889 NID g1702923 KEYWORDS armadillo protein; p0071 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3907) AUTHORS Hatzfeld,M. and Nachtsheim,C. TITLE Cloning and characterization of a new armadillo family member, p0071, associated with the junctional plaque: evidence for a subfamily of closely related proteins JOURNAL J. Cell. Sci. 109 (Pt 11), 2767-2778 (1996) MEDLINE 97092329 REFERENCE 2 (bases 1 to 3907) AUTHORS Hatzfeld,M. TITLE Direct Submission JOURNAL Submitted (13-AUG-1996) M. Hatzfeld, Max Planck Inst. for Biophysical Chem., PO Box 2841, D- 37018 Goettingen, FRG COMMENT Related sequences: EST UXT00071, and M62015. FEATURES Location/Qualifiers source 1..3907 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /dev_stage="85 years old" /tissue_type="brain, frontal cortex" /clone_lib="lambda ZAP II (Stratagene)" CDS 142..3777 /note="member of armadillo multigene family; component of the junctional plaque of desmosomes" /codon_start=1 /product="p0071 protein" /db_xref="PID:e259279" /db_xref="PID:g1702924" /translation="MPAPEQASLVEEGQPQTRQEAASTGPGMEPETTATTILASVKEQ ELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTGVSK PRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEA GSFHNSQNVSKADNRQQHSFIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSVPSR AQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYS QRPASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVG SSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQYDIYERMVPPRPDSLTGLRSSYAS QHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGVSGI GNLQRTSSQRSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTT RSPSIDSIQKDPREFAWRDPELPEVIHMLEHQFPSVQANAAAYLQHLCFGDNKVKMEV CRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLR KSIDAEVRELVTGVLWNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKI KFQTSLVLRNTTGCLRNLTSAGEEARKQMRSCEGLVDSLLYVIHTCVNTSDYDSKTVE NCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQ EDQWDGVGPIPGLSKSPKGVEMLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAS NWKFAAYIRGGRPKRKGLPILVELLRMDNDRVVSSGATALRNMALDVRNKELIGKYAM RDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGR GDRSSLKVVKAAAQVLNTLWQYRDLRSIYKKDGWNQNHFITPVSTLERDRFKSHPSLS TTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGDATHKGLYPGSS KPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDD RVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSYRAEQYPGSPDSWVYDQDAQQRNSFF LTLFRLR" BASE COUNT 1083 a 1001 c 955 g 868 t ORIGIN 1 ctactgttgt ttttgagggg cgggcagccg cgccgccgcg gcactttttt aattttttcg 61 ggtgccgcag cagcgacccc tcggcgccga tgtccctgat ccctggagcg acgacggccg 121 ctgcctaagc tgggaagagg aatgccagct cctgagcagg cctcattggt ggaggagggg 181 caaccacaga cccgccagga agctgcctcc actggcccag gcatggaacc cgagaccaca 241 gccaccacta ttctagcatc cgtgaaggag caggagcttc agtttcagcg actcacccga 301 gaactggaag tggaaaggca gattgttgcc agtcagctag aaagatgtag gcttggagca 361 gaatcaccaa gcatcgccag caccagctca actgagaagt catttccttg gagatcaaca 421 gacgtgccaa atactggtgt aagcaaacct agagtttctg acgctgtcca gcccaacaac 481 tatctcatca ggacagagcc agaacaagga accctctatt caccagaaca gacatctctc 541 catgaaagtg agggatcatt gggtaactca agaagttcaa cacaaatgaa ttcttattcc 601 gacagtggat accaggaagc agggagtttc cacaacagcc agaacgtgag caaggcagac 661 aacagacagc agcattcatt cataggatca actaacaacc atgtggtgag gaattcaaga 721 gctgaaggac aaacactggt tcagccatca gtagccaatc gggccatgag aagagttagt 781 tcagttccat ctagagcaca gtctccttct tatgttatca gcacaggcgt gtctccttca 841 agggggtctc tgagaacttc tctgggtagt ggatttggct ctccgtcagt gaccgacccc 901 cgacctctga accccagtgc atattcctcc accacattac ctgctgcacg ggcagcctct 961 ccgtactcac agagacccgc ctccccaaca gctatacggc ggattgggtc agtcacctcc 1021 cggcagacct ccaatcccaa cggaccaacc cctcaatacc aaaccaccgc cagagtgggg 1081 tccccactga ccctgacgga tgcacagact cgagtagctt ccccatccca aggccaggtg 1141 gggtcgtcgt cccccaaacg ctcagggatg accgccgtac cacagcatct gggaccttca 1201 ctgcaaagga ctgttcatga catggagcaa ttcggacagc agcagtatga catttatgag 1261 aggatggttc cacccaggcc agacagcctg acaggcttac ggagttccta tgctagtcag 1321 catagtcagc ttgggcaaga ccttcgttct gccgtgtctc ccgacttgca cattactcct 1381 atatatgagg ggaggaccta ttacagccca gtgtaccgca gcccaaacca tggaactgtg 1441 gagctccaag gatcgcagac ggcgttgtat cgcacaggtg tatcaggtat tggaaatcta 1501 caaaggacat ccagccaacg aagtaccctt acataccaaa gaaataatta tgctctgaac 1561 acaacagcta cctacgcgga gccctacagg cctatacaat accgagtgca agagtgcaat 1621 tataacaggc ttcagcatgc agtgccggct gatgatggca ccacaagatc cccatcaata 1681 gacagcattc agaaggaccc cagggagttt gcctggcgtg atcctgagtt gcctgaggtc 1741 attcacatgc ttgagcacca gttcccatct gttcaggcaa atgcagcggc ctacctgcag 1801 cacctgtgct ttggtgacaa caaagtgaag atggaggtgt gtaggttagg gggaatcaag 1861 catctggttg accttctgga ccacagagtt ttggaagttc agaagaatgc ttgtggtgcc 1921 cttcgaaacc tcgtttttgg caagtctaca gatgaaaata aaatagcaat gaagaatgtt 1981 ggtgggatac ctgccttgtt gcgactgttg agaaaatcta ttgatgcaga agtaagggag 2041 cttgttacag gagttctttg gaatttatcc tcatgtgatg ctgtaaaaat gacaatcatt 2101 cgagatgctc tctcaacctt aacaaacact gtgattgttc cacattctgg atggaataac 2161 tcttcttttg atgatgatca taaaattaaa tttcagactt cactagttct gcgtaacacg 2221 acaggttgcc taaggaacct cacgtccgcg ggggaagaag ctcggaagca aatgcggtcc 2281 tgcgaggggc tggtagactc actgttgtat gtgatccaca cgtgtgtgaa cacatccgat 2341 tacgacagca agacggtgga gaactgcgtg tgcaccctga ggaacctgtc ctatcggctg 2401 gagctggagg tgccccaggc ccggttactg ggactgaacg aattggatga cttactagga 2461 aaagagtctc ccagcaaaga ctctgagcca agttgctggg ggaagaagaa gaaaaagaaa 2521 aagaggactc cgcaagaaga tcaatgggat ggagttggtc ctatcccagg actgtcgaag 2581 tcccccaaag gggttgagat gctgtggcac ccatcggtgg taaaaccata tctgactctt 2641 ctagcagaaa gttccaaccc agccaccttg gaaggctctg cagggtctct ccagaacctc 2701 tctgctagca actggaagtt tgcagcatat atccggggcg gccgtccgaa aagaaaaggg 2761 ctccccatcc ttgtggagct tctgagaatg gataacgata gagttgtttc ttccggtgca 2821 acagccttga ggaatatggc actagatgtt cgcaacaagg agctcatagg caaatacgcc 2881 atgcgagacc tggtcaaccg gctccccggc ggcaatggcc ccagtgtctt gtctgatgag 2941 accatggcag ccatctgctg tgctctgcac gaggtcacca gcaaaaacat ggagaacgca 3001 aaagccctgg ccgactcagg aggcatagag aagctggtga acataaccaa aggcaggggc 3061 gacagatcat ctctgaaagt ggtgaaggca gcagcccagg tcttgaatac attatggcaa 3121 tatcgggacc tccggagcat ttataaaaag gatgggtgga atcagaacca ttttattaca 3181 cctgtgtcga cattggagcg agaccgattc aaatcacatc cttccttgtc taccaccaac 3241 caacagatgt cacccatcat tcagtcagtc ggcagcacct cttcctcacc agcactgtta 3301 ggaatcagag accctcgctc tgaatacgat aggacccagc cacctatgca gtattacaat 3361 agccaagggg atgccacaca taaaggcctg taccctggct ccagcaaacc ttcaccaatt 3421 tacatcagtt cctattcctc accagcaaga gaacaaaata gacggctaca gcatcaacag 3481 ctgtattata gtcaagatga ctccaacaga aagaactttg atgcatacag attgtatttg 3541 cagtctcctc atagctatga agatccttat tttgatgacc gagttcactt tccagcttct 3601 actgattact caacacagta tggactgaaa tcgaccacaa attatgtaga cttttattcc 3661 actaaacgac cttcttatag agcagaacag tacccagggt ccccagactc atgggtgtac 3721 gatcaagatg cccaacagag gaactctttc tttctaacct tgttcagatt gaggtgaaaa 3781 gtccatcttg ctgatttcat gattgaaatg tgaaagtgaa gtggaaggaa tgaatgaagt 3841 gtgttttttt ttcctttttg aggaattatc aggggaattc gatatcaagc ttatcgatac 3901 cgtcgac // LOCUS HSP110DEL 3868 bp RNA PRI 16-MAY-1997 DEFINITION H.sapiens mRNA for phosphoinositide 3-kinase. ACCESSION Y10055 NID g2104839 KEYWORDS p110delta; phosphoinositide 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3868) AUTHORS Vanhaesebroeck,B., Welham,M.J., Kotani,K., Stein,R., Warne,P.H., Zvelebil,M.J., Higashi,K., Volinia,S., Downward,J. and Waterfield,M.D. TITLE P110delta, a novel phosphoinositide 3-kinase in leukocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (9), 4330-4335 (1997) MEDLINE 97272223 REFERENCE 2 (bases 1 to 3868) AUTHORS Vanhaesebroeck,B.A.M. TITLE Direct Submission JOURNAL Submitted (13-DEC-1996) B.A.M. Vanhaesebroeck, Ludwig Institute For Cancer Research, UCL/Middlesex Hospital Branch, 91 Riding House Street, London N3 2EJ, UK FEATURES Location/Qualifiers source 1..3868 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" gene 197..3331 /gene="P110delta" CDS 197..3331 /gene="P110delta" /codon_start=1 /product="phosphoinositide 3-kinase" /db_xref="PID:e308053" /db_xref="PID:g2104840" /translation="MPPGVDCPMEFWTKEENQSVVVDFLLPTGVYLNFPVSRNANLST IKQLLWHRAQYEPLFHMLSGPEAYVFTCINQTAEQQELEDEQRRLCDVQPFLPVLRLV AREGDRVKKLINSQISLLIGKGLHEFDSLCDPEVNDFRAKMCQFCEEAAARRQQLGWE AWLQYSFPLQLEPSAQTWGPGTLRLPNRALLVNVKFEGSEESFTFQVSTKDVPLALMA CALRKKATVFRQPLVEQPEDYTLQVNGRHEYLYGSYPLCQFQYICSCLHSGLTPHLTM VHSSSILAMRDEQSNPAPQVQKPRAKPPPIPAKKPSSVSLWSLEQPFRIELIQGSKVN ADERMKLVVQAGLFHGNEMLCKTVSSSEVSVCSEPVWKQRLEFDINICDLPRMARLCF ALYAVIEKAKKARSTKKKSKKADCPIAWANLMLFDYKDQLKTGERCLYMWPSVPDEKG ELLNPTGTVRSNPNTDSAAALLICLPEVAPHPVYYPALEKILELGRHSECVHVTEEEQ LQLREILERRGSGELYEHEKDLVWKLRHEVQEHFPEALARLLLVTKWNKHEDVAQMLY LLCSWPELPVLSALELLDFSFPDCHVGSFAIKSLRKLTDDELFQYLLQLVQVLKYESY LDCELTKFLLDRALANRKIGHFLFWHLRSEMHVPSVALRFGLILEAYCRGRTHHMKVL MKQGEALSKLKALNDFVKLSSQKTPKPQTKELMHLCMRQEAYLEALSHLQSPLDPSTL LAEVCVEQCTFMDSKMKPLWIMYSNEEAGSGGSVGIIFKNGDDLRQDMLTLQMIQLMD VLWKQEGLDLRMTPYGCLPTGDRTGLIEVVLRSDTIANIQLNKSNMAATAAFNKDALL NWLKSKNPGEALDRAIEEFTLSCAGYCVATYVLGIGDRHSDNIMIRESGQLFHIDFGH FLGNFKTKFGINRERVPFILTYDFVHVIQQGKTNNSEKFERFRGYCERAYTILRRHGL LFLHLFALMRAAGLPELSCSKDIQYLKDSLALGKTEEEALKHFRVKFNEALRESWKTK VNWLAHNVSKDNRQ" BASE COUNT 800 a 1190 c 1141 g 737 t ORIGIN 1 gaattcggca cgagcggccg cgagcagagc cgcccagccc tgccagctgc gccgggacga 61 taaggagtca ggccagggcg ggatgacact cattgattct aaagcatctt taatctgcca 121 ggcggagggg gctttgctgg tctttcttgg actattccag agaggacaac tgtcatctgg 181 gaagtaacaa cgcaggatgc cccctggggt ggactgcccc atggaattct ggaccaagga 241 ggagaatcag agcgttgtgg ttgacttcct gctgcccaca ggggtctacc tgaacttccc 301 tgtgtcccgc aatgccaacc tcagcaccat caagcagctg ctgtggcacc gcgcccagta 361 tgagccgctc ttccacatgc tcagtggccc cgaggcctat gtgttcacct gcatcaacca 421 gacagcggag cagcaagagc tggaggacga gcaacggcgt ctgtgtgacg tgcagccctt 481 cctgcccgtc ctgcgcctgg tggcccgtga gggcgaccgc gtgaagaagc tcatcaactc 541 acagatcagc ctcctcatcg gcaaaggcct ccacgagttt gactccttgt gcgacccaga 601 agtgaacgac tttcgcgcca agatgtgcca attctgcgag gaggcggccg cccgccggca 661 gcagctgggc tgggaggcct ggctgcagta cagtttcccc ctgcagctgg agccctcggc 721 tcaaacctgg gggcctggta ccctgcggct cccgaaccgg gcccttctgg tcaacgttaa 781 gtttgagggc agcgaggaga gcttcacctt ccaggtgtcc accaaggacg tgccgctggc 841 gctgatggcc tgtgccctgc ggaagaaggc cacagtgttc cggcagccgc tggtggagca 901 gccggaagac tacacgctgc aggtgaacgg caggcatgag tacctgtatg gcagctaccc 961 gctctgccag ttccagtaca tctgcagctg cctgcacagt gggttgaccc ctcacctgac 1021 catggtccat tcctcctcca tcctcgccat gcgggatgag cagagcaacc ctgcccccca 1081 ggtccagaaa ccgcgtgcca aaccacctcc cattcctgcg aagaagcctt cctctgtgtc 1141 cctgtggtcc ctggagcagc cgttccgcat cgagctcatc cagggcagca aagtgaacgc 1201 cgacgagcgg atgaagctgg tggtgcaggc cgggcttttc cacggcaacg agatgctgtg 1261 caagacggtg tccagctcgg aggtgagcgt gtgctcggag cccgtgtgga agcagcggct 1321 ggagttcgac atcaacatct gcgacctgcc ccgcatggcc cgtctctgct ttgcgctgta 1381 cgccgtgatc gagaaagcca agaaggctcg ctccaccaag aagaagtcca agaaggcgga 1441 ctgccccatt gcctgggcca acctcatgct gtttgactac aaggaccagc ttaagaccgg 1501 ggaacgctgc ctctacatgt ggccctccgt cccagatgag aagggcgagc tgctgaaccc 1561 cacgggcact gtgcgcagta accccaacac ggatagcgcc gctgccctgc tcatctgcct 1621 gcccgaggtg gccccgcacc ccgtgtacta ccccgccctg gagaagatct tggagctggg 1681 gcgacacagc gagtgtgtgc atgtcaccga ggaggagcag ctgcagctgc gggaaatcct 1741 ggagcggcgg gggtctgggg agctgtatga gcacgagaag gacctggtgt ggaagctgcg 1801 gcatgaagtc caggagcact tcccggaggc gctagcccgg ctgctgctgg tcaccaagtg 1861 gaacaagcat gaggatgtgg cccagatgct ctacctgctg tgctcctggc cggagctgcc 1921 cgtcctgagc gccctggagc tgctagactt cagcttcccc gattgccacg taggctcctt 1981 cgccatcaag tcgctgcgga aactgacgga cgatgagctg ttccagtacc tgctgcagct 2041 ggtgcaggtg ctcaagtacg agtcctacct ggactgcgag ctgaccaaat tcctgctgga 2101 ccgggccctg gccaaccgca agatcggcca cttccttttc tggcacctcc gctccgagat 2161 gcacgtgccg tcggtggccc tgcgcttcgg cctcatcctg gaggcctact gcaggggcag 2221 gacccaccac atgaaggtgc tgatgaagca gggggaagca ctgagcaaac tgaaggccct 2281 gaatgacttc gtcaagctga gctctcagaa gacccccaag ccccagacca aggagctgat 2341 gcacttgtgc atgcggcagg aggcctacct agaggccctc tcccacctgc agtccccact 2401 cgaccccagc accctgctgg ctgaagtctg cgtggagcag tgcaccttca tggactccaa 2461 gatgaagccc ctgtggatca tgtacagcaa cgaggaggca ggcagcggcg gcagcgtggg 2521 catcatcttt aagaacgggg atgacctccg gcaggacatg ctgaccctgc agatgatcca 2581 gctcatggac gtcctgtgga agcaggaggg gctggacctg aggatgaccc cctatggctg 2641 cctccccacc ggggaccgca caggcctcat tgaggtggta ctccgttcag acaccatcgc 2701 caacatccaa ctcaacaaga gcaacatggc agccacagcc gccttcaaca aggatgccct 2761 gctcaactgg ctgaagtcca agaacccggg ggaggccctg gatcgagcca ttgaggagtt 2821 caccctctcc tgtgctggct attgtgtggc cacatatgtg ctgggcattg gcgatcggca 2881 cagcgacaac atcatgatcc gagagagtgg gcagctgttc cacattgatt ttggccactt 2941 tctggggaat ttcaagacca agtttggaat caaccgcgag cgtgtcccat tcatcctcac 3001 ctacgacttt gtccatgtga ttcagcaggg gaagactaat aatagtgaga aatttgaacg 3061 gttccggggc tactgtgaaa gggcctacac catcctgcgg cgccacgggc ttctcttcct 3121 ccacctcttt gccctgatgc gggcggcagg cctgcctgag ctcagctgct ccaaagacat 3181 ccagtatctc aaggactccc tggcactggg gaaaacagag gaggaggcac tgaagcactt 3241 ccgagtgaag tttaacgaag ccctccgtga gagctggaaa accaaagtga actggctggc 3301 ccacaacgtg tccaaagaca acaggcagta gtggctcctc ccagccctgg gcccaagagg 3361 aggcggctgc gggtcgtggg gaccaagcac attggtccta aaggggctga agagcctgaa 3421 ctgcacctaa cgggaaagaa ccgacatggc tgccttttgt ttacactggt tatttattta 3481 tgacttgaaa tagtttaagg agctaaacag ccataaacgg aaacgcctcc ttcatgcagc 3541 ggcggtgctg ggccccccga ggctgcacct ggctctcggc tgaggattgt caccccaagt 3601 cttccagctg gtggatctgg gcccagcaaa gactgttctc ctcccgaggg aaccttcttc 3661 ccaggcctcc cgccagactg cctgggtcct ggcgcctggc ggtcacctgg tgcctactgt 3721 ccgacaggat gccttgatcc tcgtgcgacc caccctgtgt atcctcccta gactgagttc 3781 tggcagctcc ccgaggcagc cggggtaccc tctagattca gggatgcttg ctctccactt 3841 ttcaagtggg tcttgggtac gagaattc // LOCUS HSP120A 2568 bp RNA PRI 10-MAR-1993 DEFINITION H.sapiens mRNA for P120 antigen. ACCESSION X55504 NID g287722 KEYWORDS P120 antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2568) AUTHORS Busch,H. TITLE The final common pathway of cancer JOURNAL Cancer Res. 50 (16), 4830-4838 (1990) MEDLINE 90335772 REFERENCE 2 (bases 1 to 2568) AUTHORS Valdez,B.C., Perlaky,L., Saijo,Y., Henning,D., Zhu,C., Busch,R.K., Zhang,W.W. and Busch,H. TITLE A region of antisense RNA from human p120 cDNA with high homology to mouse p120 cDNA inhibits NIH 3T3 proliferation JOURNAL Cancer Res. 52 (20), 5681-5686 (1992) MEDLINE 93007949 FEATURES Location/Qualifiers source 1..2568 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..2568 /codon_start=1 /product="P120 antigen" /db_xref="PID:g287723" /db_xref="SWISS-PROT:P46087" /translation="MGRKLDPTKEKRGPGRKARKQKGAETELVRFLPAVSDENSKRLS SRARKRAAKRRLGSVEAPKTNKSPEAKPSPGKLPKGISAGAVQTAGKKGPQSLFNAPR GKKRPAPGSDEEEEEEDSEEDGMVNHGDLWGSEDDADTVDDYGADSNSEDEEEGEALL PIERAARKQKAREAAAGIQWSEEETEDEEEEKEVTPESGPPKVEEADGGLQINVDEEP FVLPPAGEMEQDAQAPDLQRVHKRIQDIVGILRDFGAQREEGRSRSEYLNRLKKDLAI YYSYGDFLLGKLMDLFPLSELVEFLEANEVPRPVTLRTNTLKTRRRDLAQALINRGVN LDPLGKWSKTGLVVYDSSVPIGATPEYLAGHYMLQGASSMLPVMALAPQEHERILDMC CAPGGKTSYMAQLMKNTGVILANDANAERLKSVVGNLHRLGVTNTIISHYDGRQFPKV VGGFDRVLLDAPCSGTGVISKDPAVKTNKDEKDILRCAHLQKELLLSAIDSVNATSKT GGYLVYCTCSITVEENEWVVDYALKKRNVRLVPTGLDFGQEGFTRFRERRFHPSLRST RRFYPHTHNMDGFFIAKFKKFSNSIPQSQTGNSETATPTNVDLPQVIPKSENSSQPAK KAKGAAKTKQQLQKQQHPKKASFQKLNGISKGADSELSTVPSVTKTQASSSFQDSSQP AGKAEGIREPKVTGKLKQRSPKLQSSKKVAFLRQNAPPKGTDTQTPAVLSPSKTQATL KPKDHHQPLGRAKGVEKQQFAEQPFEKAAFQKQNDTPKGLSLPLCLPSVPAAPHQQRG RNLSPGATASCCYLRWLKTRRVAHCHCHQVGTLASVRMPSLLCIPMKFNTHFKTSGH" BASE COUNT 667 a 681 c 717 g 503 t ORIGIN 1 atggggcgca agttggaccc tacgaaggag aagcgggggc caggccgaaa ggcccggaag 61 cagaagggtg ccgagacaga actcgtcaga ttcttgcctg cagtaagtga cgaaaattcc 121 aagaggctgt ctagtcgtgc tcgaaagagg gcagccaaga ggagattggg ctctgttgaa 181 gcccctaaga caaataagtc tcctgaggcc aaaccatcgc ctggaaagct accaaaaggg 241 atctctgcag gagctgtcca gacagctggt aagaagggac cccagtccct atttaatgct 301 cctcgaggca agaagcgccc agcacctggc agtgatgagg aagaggagga ggaagactct 361 gaagaagatg gtatggtgaa ccacggggac ctctggggct ccgaggacga tgctgatacg 421 gtagatgact atggagctga ctccaactct gaggatgagg aggaaggtga agcgttgctg 481 cccattgaaa gagctgctcg gaagcagaag gcccgggaag ctgctgctgg gatccagtgg 541 agtgaagagg agaccgagga cgaggaggaa gagaaagaag tgacccctga gtcaggcccc 601 ccaaaggtgg aagaggcaga tgggggcctg cagatcaatg tggatgagga accatttgtg 661 ctgccccctg ctggggagat ggagcaggat gcccaggctc cagacctgca acgagttcac 721 aagcggatcc aggatattgt gggaattctg cgtgattttg gggctcagcg ggaggaaggg 781 cggtctcgtt ctgaatacct gaaccggctc aagaaggatc tggccattta ctactcctat 841 ggagacttcc tgcttggcaa gctcatggac ctcttccctc tgtctgagct ggtggagttc 901 ttagaagcta atgaggtgcc tcggcccgtc accctccgga ccaatacctt gaaaacccga 961 cgccgagacc ttgcacaggc tctaatcaat cgtggggtta acctggatcc cctgggcaag 1021 tggtcaaaga ctggactagt ggtgtatgat tcttctgtgc ccattggtgc tacccccgag 1081 tacctggctg ggcactacat gctgcaggga gcctccagca tgttgcccgt catggccttg 1141 gcaccccagg aacatgagcg gatcctggac atgtgttgtg cccctggagg aaagaccagc 1201 tacatggccc agctgatgaa gaacacgggt gtgatccttg ccaatgacgc caatgctgag 1261 cggctcaaga gtgttgtggg caacttgcat cggctgggag tcaccaacac cattatcagc 1321 cactatgatg ggcgccagtt ccccaaggtg gtggggggct ttgaccgagt actgctggat 1381 gctccctgca gtggcactgg ggtcatctcc aaggatccag ccgtgaagac taacaaggat 1441 gagaaggaca tcctgcgctg tgctcacctc cagaaggagt tgctcctgag tgctattgac 1501 tctgtcaatg cgacctccaa gacaggaggc tacctggttt actgcacctg ttctatcaca 1561 gtagaagaga atgagtgggt ggtagactat gctctgaaaa agaggaatgt gcgactggtg 1621 cccacgggcc tagactttgg ccaggaaggt tttacccgct ttcgagaaag gcgcttccac 1681 cccagtctgc gttctacccg acgcttctac cctcataccc acaatatgga tgggttcttc 1741 attgccaagt tcaagaaatt ttccaattct atccctcagt cccagacagg aaattctgaa 1801 acagccacac ctacaaatgt agacttgcct caggtcatcc ccaagtctga gaacagcagc 1861 cagccagcca agaaagccaa gggggctgca aagacaaagc agcagctgca gaaacagcaa 1921 catcccaaga aggcctcctt ccagaagctg aatggcatct ccaaaggggc agactcagaa 1981 ttgtccactg taccttctgt cacaaagacc caagcttcct ccagcttcca ggatagcagt 2041 cagccagctg gaaaagccga agggatcagg gagccaaagg tgactgggaa gctaaagcaa 2101 cgatcaccta aattacagtc ctccaagaaa gttgctttcc tcaggcagaa tgcccctccc 2161 aagggcacag acacacaaac accggctgtg ttatccccat ccaagactca ggccaccctg 2221 aaacctaagg accatcatca gccccttgga agggccaagg gggttgagaa gcagcagttc 2281 gcagagcagc cttttgagaa agctgccttc cagaaacaga atgatacccc caagggcctc 2341 agcctcccac tgtgtctccc atccgttcca gccgcccccc accagcaaag aggaagaaat 2401 ctcagtccag gggcaacagc cagctgctgc tatcttagat ggttgaaaac tagacgggtg 2461 gctcactgcc attgtcacca ggttggaact cttgcctctg tgaggatgcc ttctctactg 2521 tgcataccca tgaaatttaa tacacatttt aaaacctctg gccactga // LOCUS HSP130K 4853 bp RNA PRI 14-JAN-1994 DEFINITION H.sapiens p130 mRNA for 130K protein. ACCESSION X76061 NID g416030 KEYWORDS 130K protein; p130 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4853) AUTHORS Whyte,P.F.M. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) P.F.M. Whyte, Inst. for Molecular Biology & Biotechn., McMacter University, 1280 Main Street West, Hamilton, Ontario L8S 4K1, CANADA REFERENCE 2 (bases 1 to 4853) AUTHORS Li,Y., Graham,C., Lacy,S., Duncan,A.M. and Whyte,P. TITLE The adenovirus E1A-associated 130-kD protein is encoded by a member of the retinoblastoma gene family and physically interacts with cyclins A and E JOURNAL Genes Dev. 7 (12A), 2366-2377 (1993) MEDLINE 94074895 FEATURES Location/Qualifiers source 1..4853 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta /spleen" /chromosome="16" /map="q12.2-q13" gene 70..3489 /gene="p130" CDS 70..3489 /gene="p130" /codon_start=1 /product="130K protein" /db_xref="PID:g416031" /translation="MPSGGDQSPPPPPPPPAAAASDEEEEDDGEAEDAAPSAESPTPQ IQQRFDELCSRLNMDEAARPEAWDSYRSMSESYTLEGNDLHWLACALYVACRKSVPTV SKGTVEGNYVSLTRILKCSEQSLIEFFNKMKKWEDMANLPPHFRERTERLERNFTVSA VIFKKYEPIFQDIFKYPQEEQPRQQRGRKQRRQPCTVSEIFHFCWVLFIYAKGNFPMI SDDLVNSYHLLLCALDLVYGNALQCSNRKELVNPNFKGLSEDFHAKDSKPSSDPPCII EKLCSLHDGLVLEAKGIKEHFWKPYIRKLYEKKLLKGKEENLTGFLEPGNFGESFKAI NKAYEEYVLSVGNLDERIFLGEDAEEEIGTLSRCLNAGSGTETAERVQMKNILQQHFD KSKALRISTPLTGVRYIKENSPCVTPVSTATHSLSRLHTMLTGLRNAPSEKLEQILRT CSRDPTQAIANRLKEMFEIYSQHFQPDEDFSNCAKEIASKHFRFAEMLYYKVLESVIE QEQKRLGDMDLSGILEQDAFHRSLLACCLEVVTFSYKPPGNFPFITEIFDVPLYHFYK VIEVFIRAEDGLCREVVKHLNQIEEQILDHLAWKPESPLWEKIRDNENRVPTCEEVMP PQNLERADEICIAGSPLTPRRVTEVRADTGGLGRSITSPTTLYDRYSSPPASTTRRRL FVENDSPSDGGTPGRMPPQPLVNAVPVQNVSGETVSVTPVPGQTLVTMATATVTANNG QTVTIPVQGIANENGGITFFPVQVNVGGQAQAVTGSIQPLSAQALAGSLSSQQVTGTT LQVPGQVAIQQISPGGQQQKQGQSVTSSSNRPRKTSSLSLFFRKVYHLAAVRLRDLCA KLDISDELRKKIWTCFEFSIIQCPELMMDRHLDQLLMCAIYVMAKVTKEDKSFQNIMR CYRTQPQARSQVYRSVLIKGKRKRRNSGSSDSRSHQNSPTELNKDRTSRDSSPVMRSS STLPVPQPSSAPPTPTRLTGANSDMEEEERGDLIQFYNNIYIKQIKTFAMKYSQANMD APPLSPYPFVRTGSPRRIQLSQNHPVYISPHKNETMLSPREKIFYYFSNSPSKRLREI NSMIRTGETPTKKRGILLEDGSESPAKRICPENHSALLRRLQDVANDRGSH" BASE COUNT 1445 a 1014 c 1050 g 1344 t ORIGIN 1 ttcgccgttt gaattgctgc gggcccgggc cctcacctca cctgaggtcc ggccgcccag 61 gggtgcgcta tgccgtcggg aggtgaccag tcgccaccgc ccccgcctcc ccctccggcg 121 gcggcagcct cggatgagga ggaggaggac gacggcgagg cggaagacgc cgcgccgtct 181 gccgagtcgc ccacccctca gatccagcag cggttcgacg agctgtgcag ccgcctcaac 241 atggacgagg cggcgcggcc cgaggcctgg gacagctacc gcagcatgag cgaaagctac 301 acgctggagg gaaatgatct tcattggtta gcatgtgcct tatatgtggc ttgcagaaaa 361 tctgttccaa ctgtaagcaa agggacagtg gaaggaaact atgtatcttt aactagaatc 421 ctgaaatgtt cagagcagag cttaatcgaa ttttttaata agatgaagaa gtgggaagac 481 atggcaaatc tacccccaca tttcagagaa cgtactgaga gattagaaag aaacttcact 541 gtttctgctg taatttttaa gaaatatgaa cccatttttc aggacatctt taaataccct 601 caagaggagc aacctcgtca gcagcgagga aggaaacagc ggcgacagcc ctgtactgtg 661 tctgaaattt tccatttttg ttgggtgctt tttatatatg caaaaggtaa tttccccatg 721 attagtgatg atttggtcaa ttcttatcac ctgctgctgt gtgctttgga cttagtttat 781 ggaaatgcac ttcagtgttc taatcgtaaa gaacttgtga accctaattt taaaggctta 841 tctgaagatt ttcatgctaa agattctaaa ccttcctctg accccccttg tatcattgag 901 aaactgtgtt ccttacatga tggcctagtt ttggaagcaa aggggataaa ggaacatttc 961 tggaaaccct atattaggaa actttatgaa aaaaagctcc ttaagggaaa agaagaaaat 1021 ctcactgggt ttctagaacc tgggaacttt ggagagagtt ttaaagccat caataaggcc 1081 tatgaggagt atgttttatc tgttgggaat ttagatgagc ggatatttct tggagaggat 1141 gctgaggagg aaattgggac tctctcaagg tgtctgaacg ctggttcagg aacagagact 1201 gctgaaaggg tgcagatgaa aaacatctta cagcagcatt ttgacaagtc caaagcactt 1261 agaatctcca caccactaac tggtgttagg tacattaagg agaatagccc ttgtgtgact 1321 ccagtttcta cagctacgca tagcttgagt cgtcttcaca ccatgctgac aggcctcagg 1381 aatgcaccaa gtgagaaact ggaacagatt ctcaggacat gttccagaga tccaacccag 1441 gctattgcta acagactgaa agaaatgttt gaaatatatt ctcagcattt ccagccagac 1501 gaggatttca gtaattgtgc taaagaaatt gccagcaaac attttcgttt tgcggagatg 1561 ctttactata aagtattaga atctgttatt gagcaggaac aaaaaagact aggagacatg 1621 gatttatctg gtattctgga acaagatgca ttccacagat ctctcttggc ctgctgcctt 1681 gaggtcgtca ctttttctta taagcctcct gggaattttc catttattac tgaaatattt 1741 gatgtgcctc tttatcattt ttataaggtg atagaagtat tcattagagc agaagatggc 1801 ctttgtagag aggtggtaaa acaccttaat cagattgaag aacagatctt agatcatttg 1861 gcatggaaac cagagtctcc actctgggaa aaaattagag acaatgaaaa cagagttcct 1921 acatgtgaag aggtcatgcc acctcagaac ctggaaaggg cagatgaaat ttgcattgct 1981 ggctcccctt tgactcccag aagggtgact gaagttcgtg ctgatactgg aggacttgga 2041 aggagcataa catctccaac cacattatac gataggtaca gctccccacc agccagcact 2101 accagaaggc ggctatttgt tgagaatgat agcccctctg atggagggac gcctgggcgc 2161 atgcccccac agcccctagt caatgctgtc cctgtgcaga atgtatctgg ggagactgtt 2221 tctgtcacac cagttcctgg acagactttg gtcaccatgg caaccgccac tgtcacagcc 2281 aacaatgggc aaacggtaac cattcctgtg caaggtattg ccaatgaaaa tggagggata 2341 acattcttcc ctgtccaagt caatgttggg gggcaggcac aagctgtgac aggctccatc 2401 cagcccctca gtgctcaggc cctggctgga agtctgagct ctcaacaggt gacaggaaca 2461 actttgcaag tccctggtca agtggccatt caacagattt ccccaggtgg ccaacagcag 2521 aagcaaggcc agtctgtaac cagcagtagt aatagaccca ggaagaccag ctctttatcg 2581 cttttcttta gaaaggtata ccatttagca gctgtccgcc ttcgggatct ctgtgccaaa 2641 ctagatattt cagatgaatt gaggaaaaaa atctggacct gctttgaatt ctccataatt 2701 cagtgtcctg aacttatgat ggacagacat ctggaccagt tattaatgtg tgccatttat 2761 gtgatggcaa aggtcacaaa agaagataag tccttccaga acattatgcg ttgttatagg 2821 actcagccgc aggcccggag ccaggtgtat agaagtgttt tgataaaagg gaaaagaaaa 2881 agaagaaatt ctggcagcag tgatagcaga agccatcaga attctccaac agaactaaac 2941 aaagatagaa ccagtagaga ctccagtcca gttatgaggt caagcagcac cttgccagtt 3001 ccacagccca gcagtgctcc tcccacacct actcgcctca caggtgccaa cagtgacatg 3061 gaagaagagg agaggggaga cctcattcag ttctacaaca acatctacat caaacagatt 3121 aagacatttg ccatgaagta ctcacaggca aatatggatg ctcctccact ctctccctat 3181 ccatttgtaa gaacaggctc ccctcgccga atacagttgt ctcaaaatca tcctgtctac 3241 atttccccac ataaaaatga aacaatgctt tctcctcgag aaaagatttt ctattacttc 3301 agcaacagtc cttcaaagag actgagagaa attaatagta tgatacgcac aggagaaact 3361 cctactaaaa agagaggaat tcttttggaa gatggaagtg aatcacctgc aaaaagaatt 3421 tgcccagaaa atcattctgc cttattacgc cgtctccaag atgtagctaa tgaccgtggt 3481 tcccactgag gttagtctct tgtattaaac tcttcacaaa atctgtttag cagcagcctt 3541 taatgcatct agattatgga gcttttttcc ttaatccagc tgatgagtta cagcctgtta 3601 gtaacatgag gggacatttt ggtgagaaat gggacttaac tccttccagt gtccttagaa 3661 cattttaatt catcccaact gtcttttttt ccctaccact cagtgattac tgtcaaggct 3721 gcttacaatc caaacttggg tttttggctc tggcaaagct tttagaaata ctgcaagaaa 3781 tgatgtgtac ccaacgtgag cataggaggc ttctgttgac gtctccaaca gaagaactgt 3841 gtttcaagtt caatcctacc tgttttgtgg tcagctgtag tcctcataaa aagcaaaaca 3901 aaaattaggt attttgtcct aaaacacctg gtaggagtgt gtgatttttt gcattcctga 3961 caaaggagag cacacccagg tttggaggtc ctaggtcatt agccctcgtc tcccgttccc 4021 tttgtgcaca tcttccctct ccccattcgg tgtggtgcag tgtgaaaagt ccttgattgt 4081 tcgggtgtgc aatgtctgag tgaacctgta taagtggagg cactttaggg ctgtaaaatg 4141 catgattttg taacccagat tttgctgtat atttgtgata gcactttcta caatgtgaac 4201 tttattaaat acaaaacttc caggctaaac atccaatatt ttctttaatg cttttatatt 4261 tttttaaaat gttaaaaccc ctatagccac cttttgggaa tgttttaaat tctccagttt 4321 tttgttatat agggatcaac cagctaagaa aagattttaa gtcaagttga attgagggga 4381 ttaatatgaa aacttatgac ctcttccttt aggagggagt tatctaaaag aaatgtctat 4441 taaggtgata tatttaaaaa tatttttggg tgttcctggc agtttaaaaa aattggttgg 4501 agaatttagg tttttattag taccatagta ccatttatac aaattagaaa atgttattta 4561 acagctgaat tatctataca tatctttatt aatcactatt gttccagcag ttttcaagtc 4621 aaattaataa tcttattagg gagaaaattc aattgtaaat tgaatcagta taaacaaagt 4681 tactaggtaa cttcatattg ctgagagaaa tatggaactt acattgttca attagaatag 4741 tgttctcccc aaatatttat aaaacttctc aagatactgc tacgtgtaat tttatatgaa 4801 gataagtgta tttttcaata aagcatttat aaattaaaaa aaaaaaaaaa aaa // LOCUS HSP14PROT 865 bp RNA PRI 19-DEC-1997 DEFINITION Homo sapiens mRNA for NA14 protein. ACCESSION Z96932 NID g2706619 KEYWORDS NA14 gene; NA14 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 865) AUTHORS Ramos-Morales,F., Infante,C., Fedriani,C., Bornens,M. and Rios,R.M. TITLE NA14 is a novel nuclear autoantigen with a coiled-coil domain JOURNAL J. Biol. Chem. In press REFERENCE 2 (bases 1 to 865) AUTHORS Ramos-Morales,F. TITLE Direct Submission JOURNAL Submitted (17-JUN-1997) Ramos-morales F., Departamento de Microbiologia, Universidad de Sevilla, Apdo 1095, Sevilla 41080 SPAIN FEATURES Location/Qualifiers source 1..865 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="Male" /tissue_type="testis" gene 1..865 /gene="na14" 5'UTR 1..46 /gene="na14" CDS 47..406 /gene="na14" /codon_start=1 /product="nuclear autoantigen fo 14 kDa" /db_xref="PID:e322419" /db_xref="PID:g2706620" /translation="MTQQGAALQNYNNELVKCIEELCQKREELCRQIQEEEDEKQRLQ NEVRQLTEKLARVNENLARKIASRNEFDRTIAETEAAYLKILESFQTLLSVLKREAGN LTKATAPDQKSSGGRDS" 3'UTR 407..865 /gene="na14" polyA_signal 837..842 /gene="na14" polyA_site 854 /gene="na14" BASE COUNT 191 a 260 c 271 g 143 t ORIGIN 1 tgcttccgcg gcggttgggg tggtggggcc ccgggcggcg ttgaccatga cccagcaggg 61 cgcggcgctg cagaactaca acaacgagct ggtcaagtgc atagaggagc tgtgccagaa 121 gcgggaggag ctgtgccggc agatccagga ggaggaggac gagaagcagc ggctgcagaa 181 tgaggtgagg cagctgacag agaagctggc ccgcgtcaac gagaacctgg cacgcaagat 241 tgcctctcgc aacgagttcg accggaccat cgcggagacg gaggccgcct acctcaagat 301 cctggagagc ttccagactt tgctcagcgt tctcaagagg gaagctggga acctgaccaa 361 ggctacagcc ccagaccaga aaagtagcgg cggcagggac agctgaccag accacgggca 421 gggcctgcct ccgtgtgccc ctcagctcag ccccagcaag tgtgtgctca gagcatcttt 481 gttcttcacg gcagcagcta ccttccctca ctgtctcagg tgccgagagg ggcaggtgcc 541 agcctccact ggcatcagtg acaagcccag gcacagccca cccgggggtc ctcgcttcat 601 gctcacacag gctatgggga tggtgggctc caggtcagct ctgcaagggg cttgtctctg 661 tggcacccac actcctgccc tgccagggag gctctggttg tctgagcacc atgggggccc 721 cctcaccttg tccctcctca gccagcagag gcccagggca agggacagga ggacaggggt 781 tctccttcac cacagaaccc aaacctcagg tctcacccct gtggcctgtg attatgaata 841 aagattatct ttgtaaaaaa aaaaa // LOCUS HSP150 5085 bp RNA PRI 04-FEB-1997 DEFINITION H.sapiens mRNA for adaptor protein p150. ACCESSION Y08991 NID g1817583 KEYWORDS adaptor protein; p150 gene; phosphatidylinositol 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5085) AUTHORS Panaretou,C., Domin,J., Cockcroft,S. and Waterfield,M.D. TITLE Characterization of p150, an adaptor protein for the human phosphatidylinositol (PtdIns) 3-kinase. Substrate presentation by phosphatidylinositol transfer protein to the p150.Ptdins 3-kinase complex JOURNAL J. Biol. Chem. 272 (4), 2477-2485 (1997) MEDLINE 97153028 REFERENCE 2 (bases 1 to 5085) AUTHORS Panaretou,C. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) C. Panaretou, University College London, School of Medicine, Ludwig Inst., Courtauld Building, 91 Riding House Street, London, W1P 8BT, UK FEATURES Location/Qualifiers source 1..5085 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /clone="3" /clone_lib="lambda ZAP" 5'UTR 1..567 /gene="p150" gene 1..5085 /gene="p150" CDS 568..4644 /gene="p150" /codon_start=1 /product="adaptor protein" /db_xref="PID:e284057" /db_xref="PID:g1817584" /translation="MGNQLAGIAPSQILSVESYFSDIHDFEYDKSLGSTRFFKVARAK HREGLVVVKVFAIQDPTLPLTSYKQELEELKIRLNSAQNCLPFQKASEKASEKAAMLF RQYVRDNLYDRISTRPFLNNIEKRWIAFQILTAVDQAHKSGVRHGDIKTENVMVTSWN WVLLTDFASFKPTYLPEDNPADFNYFFDTSRRRTCYIAPERFVDGGMFATELEYMRDP STPLVDLNSNQRTRGELKRAMDIFSAGCVIAELFTEGVPLFDLSQLLAYRNGHFFPEQ VLNKIEDHSIRELVTQMIHREPDKRLEAEDYLKQQRGNAFPEIFYTFLQPYMAQFAKE TFLSADERILVIRKDLGNIIHNLCGHDLPEKAEGEPKENGLVILVSVITSCLQTLKYC DSKLAALELILHLAPRLSVEILLDRITPYLLHFSNDSVPRVRAEALRTLTKVLALVKE VPRNDINIYPEYILPGIAHLAQDDATIVRLAYAENIALLAETALRFLELVQLKNLNME NDPNNEEIDEVTHPNGNYDTELQALHEMVQQKVVTLLSDPENIVKQTLMENGITRLCV FFGRQKANDVLLSHMITFLNDKNDWHLRGAFFDSIVGVAAYVGWQSSSILKPLLQQGL SDAEEFVIVKALYALTCMCQLGLLQKPHVYEFASDIAPFLCHPNLWIRYGAVGFITVV ARQISTADVYCKLMPYLDPYITQPIIQIERKLVLLSVLKEPVSRSIFDYALRSKDITS LFRHLHMRQKKRNGSLPDCPPPEDPAIAQLLKKLLSQGMTEEEEDKLLALKDFMMKSN KAKANIVDQSHLHDSSQKGVIDLAALGITGRQVDLVKTKQEPDDKRARKHVKQDSNVN EEWKSMFGSLDPPNMPQALPKGSDQEVIQTGKPPRSESSAGICVPLSTSSQVPEVTTV QNKKPVIPVLSSTILPSTYQIRITTCKTELQQLIQQKREQCNAERIAKQMMENAEWES KPPPPGWRPKGLLVAHLHEHKSAVNRIRVSDEHSLFATCSNDGTVKIWNSQKMEGKTT TTRSILTYSRIGGRVKTLTFCQGSHYLAIASDNGAVQLLGIEASKLPKSPKIHPLQSR ILDQKEDGCVVDMHHFNSGAQSVLAYATVNGSLVGWDLRSSSNAWTLKHDLKSGLITS FAVDIHQCWLCIGTSSGTMACWDMRFQLPISSHCHPSRARIRRLSMHPLYQSWVIAAV QGNNEVSMWDMETGDRRFTLWASSAPPLSELQPSPHSVHGIYCSPADGNPILLTAGSD MKIRFWDLAYPERSYVVAGSTSSPSVSYYRKIIEGTEVVQEIQNKQKVGPSDDTPRRG PESLPVGHHDIITDVATFQTTQGFIVTASRDGIVKVWK" 3'UTR 4645..5085 /gene="p150" BASE COUNT 1467 a 1093 c 1132 g 1393 t ORIGIN 1 ggatcccccg ggctgcagga attcggcacg aggggagttc ggcgtttgct ggggctgcag 61 cagctgaagt gtagtgtttt cttgggactg gcggtctgca cttctctccc gggttccatc 121 tccccccgcc cggtggtgag gccctcgagg agggctcgga cgggtgtagc gatccgcgct 181 agaggaagac gaggcccggg aacgcatgtc ccccagggca ggttaggggg ctggaggggt 241 caaatcccgg ggtacttgtg gagactcttt agcgtggctt cttctctctg ctgagacccc 301 gagagctttc ccagttctcc tcccaggacc accggggttc ctgaagatcg ggacttttct 361 gcgcccctcc accaacagcc catctcctgt ctatgaagaa agacccttcg tagaaacaac 421 ttccccgctg ctgacgcgtt ttcccgtccc gtccccgaag tagtctacta tgacctcgtt 481 gtgagcctct gaacgatttt gacactttcc cgaggcctag ggtattatat cctaacctta 541 ctaaagacca cagaggtgct tgccattatg ggaaatcagc ttgctggcat tgctccctcc 601 cagatccttt ctgtagagag ttatttttca gatattcatg actttgaata tgataaaagc 661 ctggggagta ctcggttttt taaagttgct cgagccaagc accgagaagg cctggtcgtt 721 gtgaaggttt ttgcaattca ggatcccaca ttgcctttaa ccagctataa acaagagctg 781 gaggaactga aaatcaggct taattctgca cagaattgtc tacctttcca gaaagcatca 841 gaaaaagcat ctgagaaagc agctatgctc tttaggcagt atgtgcgaga caatctctat 901 gatcgcatca gtacccgtcc attcttgaat aacattgaga agcgctggat tgctttccag 961 atcctgacag ctgtggacca agcacacaaa tctggagttc gtcatgggga catcaagact 1021 gagaatgtga tggtcaccag ttggaattgg gttcttctaa ctgattttgc cagttttaag 1081 cccacttatc ttccagaaga caacccggca gatttcaatt atttctttga cacatcacgg 1141 aggagaactt gctatattgc tcctgaacgt tttgttgatg gtgggatgtt tgccactgag 1201 ttagaatata tgagagatcc ttcaactccg cttgtagact taaatagcaa tcagagaaca 1261 agaggagagt tgaagagagc aatggacatc ttttcagcag gttgtgtgat agctgagctt 1321 tttacagaag gtgtaccatt atttgatctc tctcaacttt tggcttatag aaatggacat 1381 tttttccctg aacaagtgct aaataaaatt gaagatcaca gtatcagaga attggtaact 1441 cagatgattc accgtgagcc agataaacgt ttagaggcag aagattactt aaaacagcag 1501 cgtggcaatg cctttcctga aatattttac acttttcttc agccctacat ggcccagttt 1561 gccaaggaaa cgtttctttc tgcagatgag cgtattctgg ttatacggaa ggatttgggc 1621 aacattattc acaatctctg tggacatgat ctgccagaaa aagccgaagg agagcctaag 1681 gaaaatgggc tggttatctt ggtatctgtt ataacatcct gcctacagac ccttaaatac 1741 tgtgattcca aactagctgc tttggaactg attcttcatt tggctccaag attaagtgtt 1801 gaaatccttt tggatcgtat tactccatat cttttgcatt tcagcaatga ctctgttcct 1861 agggtgaggg ctgaagcctt gaggacgttg accaaagttc ttgctctcgt caaagaggtt 1921 cctcgtaatg atatcaatat ttatccggaa tacattctgc caggcatagc ccacttagcc 1981 caagatgatg ctactatcgt tagactagcc tatgctgaaa acatagctct gctggcagaa 2041 acagctctga gattcctgga attagtacag ttaaaaaatc ttaatatgga aaatgacccc 2101 aataatgaag aaatagatga ggttacacat ccaaatggaa attatgacac agagctccaa 2161 gccttacatg aaatggtcca gcagaaagtt gttactttgc taagtgaccc tgaaaatatt 2221 gtaaaacaaa ccttgatgga aaatggaata acacggctgt gtgtattctt tggacgtcag 2281 aaagccaacg atgttttgtt gtcccacatg attactttcc taaatgataa gaatgattgg 2341 catctacgtg gagcattttt tgatagtata gttggtgttg ctgcctatgt tggctggcaa 2401 agctcctcaa ttctcaagcc tctgctgcaa caaggtctta gtgatgctga ggaatttgtc 2461 attgtgaaag ctctttatgc ccttacttgt atgtgccagt taggactgct acaaaaaccc 2521 catgtttacg aatttgccag tgatattgcc cccttcctgt gtcatcccaa tttatggata 2581 cgttatggtg ccgtgggatt tatcacagtg gtagctcgtc aaataagtac agctgatgtc 2641 tactgtaaac tgatgcctta tcttgaccca tatattaccc aaccaataat acagattgaa 2701 agaaaacttg ttctgctcag tgttttaaag gaaccagtaa gtcgttctat atttgattat 2761 gctttgaggt ctaaagatat tactagcttg ttcagacatc ttcacatgcg tcagaagaaa 2821 cgaaatggtt ctcttcccga ctgccctccg ccagaggatc ctgccatagc acagcttctg 2881 aagaagttgc tctcacaggg aatgacagag gaagaggaag acaaacttct ggcactgaaa 2941 gacttcatga tgaaatctaa taaagcaaag gccaatatag tggaccagag ccatcttcat 3001 gatagtagtc agaaaggtgt aattgacttg gcagctttag gcataactgg gagacaagtt 3061 gatcttgtta aaaccaaaca agaaccagat gacaaacggg ccagaaaaca tgtaaaacaa 3121 gactcaaatg taaatgaaga atggaaaagc atgtttgggt cactggaccc accaaacatg 3181 ccacaggccc tacctaaagg gagtgatcag gaggtgattc agactgggaa acctcctcgt 3241 tccgagtcct ctgctggcat ttgtgtccct ttgtcaactt cttcacaggt tccagaagtg 3301 acaactgtcc aaaataaaaa accagtaata ccggttttaa gtagtacaat cttaccatcc 3361 acctatcaga ttcgaattac aacttgtaaa actgaacttc agcaactcat ccagcaaaag 3421 cgggagcagt gcaatgctga gagaatagct aagcagatga tggaaaatgc tgaatgggag 3481 agtaaaccac caccacctgg atggcgtcct aaagggctgt tagttgccca tcttcatgag 3541 cataaatctg ctgtgaatcg aattagagtc tctgatgaac actcactttt tgcaacatgt 3601 tcaaatgatg gcacagtgaa aatctggaac agtcaaaaga tggaggggaa gaccaccact 3661 accagatcta ttcttacata cagccgaatt ggaggacgag tcaagacgct cacattctgc 3721 caaggctccc actatttagc catagcatct gataatggtg ctgtccagct tcttggaatt 3781 gaggcttcta agctgcccaa gtctcctaaa atccatcctc tacaaagcag aattctagat 3841 cagaaggagg acggttgtgt tgtggatatg catcacttca actctggagc acagtctgtt 3901 cttgcctatg ccactgtgaa tggctctctg gttggctggg accttaggtc ttcaagcaat 3961 gcgtggactt taaagcatga tttaaagtcg ggcctcatca cttcctttgc tgtggacatc 4021 caccaatgct ggctctgcat tggtacaagc agtggtacca tggcttgttg ggacatgagg 4081 ttccagttgc caatttcaag tcactgtcat ccttccaggg ctcgaatcag acgcctctca 4141 atgcaccctc tgtatcagtc ctgggtgatt gcagctgttc agggcaacaa cgaagtgtcc 4201 atgtgggaca tggagactgg tgacagaaga tttactctct gggccagcag tgcaccacca 4261 ctttctgaat tacagccttc tcctcatagc gtccatggta tctactgtag tcctgcagat 4321 ggaaatccta tcctactaac agctggctca gatatgaaaa taaggttttg ggacttggct 4381 tacccagaaa ggtcctatgt tgttgcagga agtactagtt ccccatctgt gtcctactac 4441 aggaaaataa ttgaaggcac tgaagttgtc caggaaattc agaataagca gaaagtagga 4501 ccaagtgatg acacccctcg aaggggccca gagtccctgc ccgtgggaca tcatgacatc 4561 atcactgatg tcgccacatt ccagaccaca cagggcttca tcgtaactgc ttctagagat 4621 gggattgtga aggtgtggaa ataaaaccta ctgatttgta taaattttaa tagttataaa 4681 tataatacta taactcgaga aaaggcattt ctagagaaca gattcatttg cttaattttc 4741 aaaattatgt ctccatatta ctgtttcatg actgactgac taaatgacac ccaaaatggt 4801 taagatgtac ttgactagtt tacttatgca tctctttgca agaatcagcc agccaacaat 4861 gtctgggatt tttattgtat atgttataga ggtgagaaat gtaaaatatg aaaatgaata 4921 tgtttatttt gtattgaaaa agatggttga aaagatggtt gtaagctatt atagtataaa 4981 cacatttttg ctattaaaaa tgctattcaa agcagttaaa ctgtaaaaaa aaaaaaaaaa 5041 aaaaaaaaaa aaaaaaaaaa aaaactcgag ggggggcccg gtacc // LOCUS HSP162 4780 bp RNA PRI 08-JAN-1996 DEFINITION H.sapiens p162 mRNA. ACCESSION X78998 X86691 NID g475933 KEYWORDS endosomal protein; P162 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4780) AUTHORS Seelig,H.P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 4780) AUTHORS Seelig,H.P. TITLE Direct Submission JOURNAL Submitted (26-APR-1994) H.P. Seelig, Inst. of Immunology & Mol. Genetics, Kriegsstr. 99, 76133 Karlsruhe, FRG FEATURES Location/Qualifiers source 1..4780 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 72..4307 /gene="P162" CDS 72..4307 /gene="P162" /codon_start=1 /product="endosomal protein" /db_xref="PID:g475934" /translation="MLRRILQRTPGRVGSQGSDLDSSATPINTVDVNNESSSEGFICP QCMKSLGSADELFKHYEAVHDAGNDSGHGGESNLALKRDDVTLLRQEVQDLQASLKEE KWYSEELKKELEKYQGLQQQEAKPDGLVTDSSAELQSLEQQLEEAQTENFNIKQMKDL FEQKAAQLATEIADIKSKYDEERSLREAAEQKVTRLTEELNKEATVIQDLKTELLQRP GIEDVAVLKKELVQVQTLMDNMTLERERESEKLKDECKKLQSQYASSEATISQLRSEL AKGPQEVAVYVQELQKLKSSVNELTQKNQTLTENLLKKEQDYTKLEEKHNEESVSKKN IQATLHQKDLDCQQLQSRLSASETSLHRIHVELSEKGEATQKLKEELSEVETKYQHLK AEFKQLQQQREEKEQHGLQLQSEINQLHSKLLETERQLGEAHGRLKEQRQLSSEKLMD KEQQVADLQLKLSRLEEQLKEKVTNSTELQHQLDKTKQQHQEQQALQQSTTAKLREAQ NDLEQVLRQIGDKDQKIQNLEALLQKSKENISLLEKEREDLYAKIQAGEGETAVLNQL QEKNHTLQEQVTQLTEKLKNQSESHKQAQENLHDQVQEQKAHLRAAQDRVLSLETSVN ELNSQLNESKEKVSQLDIQIKAKTELLLSAEAAKTAQRADLQNHLDTAQNALQDKQQE LNKITTQLDQVTAKLQDKQEHCSQLESHLKEYKEKYLSLEQKTEELEGQIKKLEADSL EVKASKEQALQDLQQQRQLNTDLELRATELSKQLEMEKEIVSSTRLDLQKKSEALESI KQKLTKQEEEKQILKQDFETLSQETKIQHEELNNRIQTTVTELQKVKMEKEALMTELS TVKDKLSKVSDSLKNSKSEFEKENQKGKAAILDLEKTCKELKHQLQVQMENTLKEQKE LKKSLEKEKEASHQLKLELNSMQEQLIQAQNTLKQNEKEEQQLQGNINELKQSSEQKK KQIEALQGELKIAVLQKTELENKLQQQLTQAAQELAAEKEKISVLQNNYEKSQETFKQ LQSDFYGRESELLATRQDLKSVEEKLSLAQEDLISNRNQIGNQNKLIQELKTAKATLE QDSAKKEQQLQERCKALQDIQKEKSLKEKELVNEKSKLAEIEEIKCRQEKEITKLNEE LKSHKLESIKEITNLKDAKQLLIQQKLELQGKADSLKAAVEQEKRNQQILKDQVKKEE EELKKEFIEKEAKLHSEIKEKEVGMKKHEENEAKLTMQITALNENLGTVKKEWQSSQR RVSELEKQTDDLRGEIAVLEATVQNNQDERRALLERCLKGEGEIEKLQTKVLELQRKL DNTTAAVQELGRENQSLQIKHTQALNRKWAEDNEVQNCMACGKGFSVTVRRHHCRQCG NIFCAECSAKNALTPSSKKPVRVCDACFNDLQG" BASE COUNT 1916 a 801 c 1008 g 1055 t ORIGIN 1 gccccgagta gtgagtggcc ccgcgcaggg tctggagagt caccgcggcg gcgccgggtg 61 gtggttaaac catgttaagg aggattttac agaggactcc tgggagagtt ggctctcaag 121 gttctgattt agattcatca gcaactccta taaacacagt ggacgtcaat aatgaaagct 181 cttcagaggg tttcatatgt ccccagtgta tgaaatctct tggatctgct gatgaacttt 241 tcaaacatta tgaagctgtt catgatgctg gtaatgactc aggtcatgga ggagagtcta 301 atcttgcttt gaagcgagat gatgtaacac tgctcagaca agaggtccaa gacctacagg 361 cttcacttaa ggaagaaaaa tggtactcgg aagaattaaa gaaggaatta gaaaaatatc 421 aagggctgca gcagcaagag gccaaacctg atgggttggt gactgattca tcagcagaac 481 tacagtcttt ggaacagcaa ttagaagaag cccaaacaga aaattttaat attaagcaaa 541 tgaaagactt atttgaacag aaagcagccc aacttgctac tgaaattgca gatataaagt 601 caaagtatga tgaagaaagg agtcttcgag aagctgctga acaaaaagtg acacgtctga 661 cagaagaatt aaacaaagag gcaactgtaa ttcaagatct gaagacggaa ctgcttcaga 721 gacctggtat agaagatgtt gccgtgctaa agaaagaact ggtccaagtt caaacactaa 781 tggataacat gaccttggaa cgtgagcgag aatctgaaaa actcaaagat gaatgcaaaa 841 aattgcagtc acaatatgct agctcagagg ccacaataag ccagctaagg agtgaacttg 901 ccaaaggccc ccaggaagtt gctgtatatg tacaggaact acaaaaactg aaaagttcag 961 ttaatgaatt aacacaaaaa aatcagacct tgacagaaaa cttgctgaaa aaagaacaag 1021 actatactaa gttagaggag aaacataatg aagaatctgt gagtaaaaag aatattcagg 1081 caacccttca tcaaaaagac ctagattgtc aacagcttca gtcaagattg tctgcatctg 1141 aaacctcact gcatagaata catgtagaac taagtgaaaa aggagaagct actcaaaagc 1201 tcaaagaaga attatctgag gtagagacca agtaccagca tctaaaggcg gagtttaagc 1261 agctacaaca acagagagaa gaaaaggagc agcatgggtt acaactccaa agtgaaatta 1321 atcaattaca tagcaaactt ctggagacag agcgccaact aggggaagct catggtaggc 1381 tgaaggaaca gagacagctt tcaagtgaaa agttgatgga taaagaacaa caagtggctg 1441 atttacaact caaactttct cggttagaag agcagttgaa ggaaaaagtt acaaattcta 1501 cagaattgca gcatcaatta gataaaacaa agcaacagca tcaagaacaa caggctcttc 1561 agcaaagcac cacggcaaaa cttcgagaag ctcagaatga tttggaacaa gttctacgtc 1621 aaattggcga taaggaccaa aagatccaga accttgaagc tttattacag aagagtaaag 1681 aaaatatttc attactagaa aaagaaagag aagatcttta tgcaaaaatt caggctggtg 1741 aaggagagac tgctgttctt aaccagttac aagaaaaaaa ccatacacta caggagcaag 1801 taactcaact aacagagaag ctgaagaatc agtcagaaag tcataaacaa gcccaggaga 1861 atttgcatga ccaggtacaa gagcagaagg cacatcttag agctgcacaa gaccgtgtcc 1921 tttccctaga aactagtgtc aatgaattaa atagtcaatt aaatgaaagc aaggagaagg 1981 tctcccagct tgacatacag attaaagcca aaaccgaact attactatca gcagaagcag 2041 caaaaactgc tcaaagagct gatcttcaga atcatttgga cacagctcaa aatgcattac 2101 aagataaaca gcaggagtta aataagatta ctactcagtt ggatcaggtc actgcaaagt 2161 tacaagacaa gcaagaacat tgcagtcagc tggaaagtca tcttaaagaa tataaagaga 2221 aatacctctc tttagaacag aaaaccgaag agctagaagg tcaaattaag aaactagaag 2281 ctgatagtct tgaagttaaa gcaagcaagg agcaggcttt gcaagatcta caacagcaaa 2341 gacagctgaa cacagattta gagctcagag ccacagaatt gagtaaacaa cttgaaatgg 2401 agaaggaaat agtatccagt acaagattgg atctacagaa aaaatctgaa gcccttgaaa 2461 gtatcaagca aaagcttacc aagcaagagg aagaaaaaca aatcctgaaa caagattttg 2521 aaactttaag tcaagaaaca aagattcagc atgaggaatt gaataacaga attcaaacaa 2581 cagtaacaga actacaaaaa gtgaaaatgg agaaagaagc tttaatgaca gagctttcta 2641 cagtaaagga caaactatca aaagtttctg attctttgaa aaactctaaa agtgaatttg 2701 aaaaggagaa tcagaaagga aaagccgcta tattagactt ggaaaaaact tgcaaagaat 2761 taaagcatca acttcaagtg cagatggaaa acacacttaa ggaacagaag gaactgaaaa 2821 agtcacttga aaaagagaag gaggcttctc atcagttgaa attggaactc aattcaatgc 2881 aggaacaact tatacaggcc cagaatactt taaaacaaaa tgaaaaagaa gagcaacaac 2941 ttcaggggaa cataaatgag ctaaagcaat caagtgaaca gaagaaaaaa caaattgaag 3001 cactccaagg agagcttaaa attgctgttt tacagaagac agagcttgag aataaactac 3061 agcagcagtt aacacaggca gcccaggaac ttgcagcaga gaaagagaaa atatcagtat 3121 tacaaaacaa ctatgaaaaa agtcaggaaa ctttcaaaca gcttcaatct gatttctatg 3181 ggagggaatc tgaacttcta gccaccaggc aagatcttaa gtctgtagaa gagaagcttt 3241 ctctagcaca ggaggacttg atttcaaaca gaaatcaaat tggaaatcaa aataaattga 3301 ttcaagaact gaagactgcc aaggctacat tggagcagga ttcagcaaag aaagaacagc 3361 aattgcagga gcgatgtaaa gcactacaag acattcagaa agaaaagtca ctgaaagaaa 3421 aagaactggt aaatgagaag tctaaattgg cagagataga agaaattaaa tgtagacaag 3481 aaaaagaaat cactaaacta aacgaagaac tcaagtccca caaactagaa agcataaagg 3541 agataacaaa tcttaaagat gctaaacagc ttctaattca gcagaaatta gaacttcaag 3601 gaaaagcgga ctccctgaag gcagctgttg aacaggagaa gagaaatcag cagatactaa 3661 aagaccaggt gaaaaaggaa gaagaggagc tgaagaaaga atttattgag aaagaagcta 3721 agttgcattc cgaaataaaa gaaaaggaag taggaatgaa gaagcatgaa gaaaatgagg 3781 ctaaacttac catgcagatt acagcattaa atgaaaactt aggcactgtg aagaaggagt 3841 ggcaatctag tcaacggaga gttagtgagc ttgagaaaca aacggatgac ttacggggtg 3901 aaattgcagt attagaagca acggttcaga ataatcaaga tgaaaggaga gcactactgg 3961 aaagatgtct taaaggagaa ggtgaaatag aaaagcttca aaccaaagta ttagaattgc 4021 aaagaaagct ggataataca actgcagcag tgcaggagct gggcagagaa aaccaatcac 4081 ttcagatcaa acatacacaa gcgttgaata gaaagtgggc cgaagacaat gaagtacaaa 4141 actgtatggc ctgtgggaaa ggcttttcag taacagtgag acggcatcac tgccgacagt 4201 gtggaaatat cttctgtgct gaatgttcag ccaaaaatgc cttaactcct tcctccaaga 4261 agcctgttcg tgtctgtgat gcatgtttca atgacttgca aggataatgg gttatcacaa 4321 cttcagagta atattacact aacattagat ttttaataaa tgtacttaat agaggtcttg 4381 gactactatt tggtttggac actggttgca atactagacc aaataggaat tagtatattt 4441 tggaatgtta tggatatcaa gtaaaactct tctatttttg tgagacttgg gcttatcatc 4501 tttcattact tttttcccat ttagcccctg gaatagcttt catgccaagt ttagaattag 4561 ccaatgaaat gtaaataaac tttggacaga aaatagcaat gtattttttt aatataattt 4621 cattattcac aaatgaagaa tgcaattcca tagcttactt ctttctccta aatttaatgt 4681 acaaaaatgt acatggatga tattatcact tgttgctgtt tttgttgcac aatagcacat 4741 taattatatt aaatattctc tttgtgaagt taaaaaaaaa // LOCUS HSP1H 2575 bp RNA PRI 20-SEP-1996 DEFINITION H.sapiens mRNA for P1 protein (P1.h). ACCESSION X62153 NID g397871 KEYWORDS p1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2575) AUTHORS Knippers,R. TITLE Direct Submission JOURNAL Submitted (16-SEP-1991) R. Knippers, Universitaet Konstanz, Fakultaet fuer Biologie, D-775 Konstanz, FRG REMARK revised by [3] REFERENCE 2 (bases 1 to 2575) AUTHORS Thommes,P., Fett,R., Schray,B., Burkhart,R., Barnes,M., Kennedy,C., Brown,N.C. and Knippers,R. TITLE Properties of the nuclear P1 protein, a mammalian homologue of the yeast Mcm3 replication protein JOURNAL Nucleic Acids Res. 20 (5), 1069-1074 (1992) MEDLINE 92195806 REFERENCE 3 (bases 1 to 2575) AUTHORS Schulte,D. TITLE Direct Submission JOURNAL Submitted (03-SEP-1993) R. Knippers, Universitaet Konstanz, Fakultaet fuer Biologie, D-775 Konstanz, FRG REFERENCE 4 (bases 1 to 2575) AUTHORS Hu,B., Burkhart,R., Schulte,D., Musahl,C. and Knippers,R. TITLE The P1 family: a new class of nuclear mammalian proteins related to the yeast Mcm replication proteins JOURNAL Nucleic Acids Res. 21 (23), 5289-5293 (1993) MEDLINE 94089373 COMMENT High sequence similarity to yeast Mcm 3 sequence, X53540. FEATURES Location/Qualifiers source 1..2575 /organism="Homo sapiens" /isolate="P1.h" /db_xref="taxon:9606" /cell_type="epithelian" /haplotype="aneuploid" /tissue_type="cervix" /clone="DSzap10" /chromosome="6" /clone_lib="UNI-ZAP XR" /map="p12" /dev_stage="carcinoma" /cell_line="HeLa S3" CDS 47..2473 /codon_start=1 /product="P1.h protein" /db_xref="PID:g397872" /db_xref="SWISS-PROT:P25205" /translation="MAGTVVLDDVELREAQRDYLDFLDDEEDQGIYQSKVRELISDNQ YRLIVNVNDLRRKNEKRANRLLNNAFEEMVAFQRALKDFVASIDATYAKQYEEFYVGL EGSFGSKHVSPRTLTSCFLSCVVCVEGLCTKCSLVRSKVVRSVHYCPATKKTIERRYS DLTTLVAFPSSSVYPTKDEENNPLETEYGLSVYRYHQTITIQEMPEKAPAGQLPRSVD VILDDDLVDKAKPGDRVQVVGTYRCVPGKKGGYTSGTFRTVLIACNVKQMSKDAQPSF SAEDIAKIKKFSKTRSKDIFEQLAKSLAPSIHGHDYVKKAILCLLLGGVERDLENGSH IRGDINILLIGDPSVAKSQLLRYVLCTAPRAIPTTGRGSSGVGLTAAVTTDQETGERR LEAGAMVLADRGVVCIDEFDKMSDMDRTAIHEVMEQGRVTIAKAGIHARLNARCSVLA AANPVYGRYDQYKTPMENIGLQDSLLSRFDLLFIMLDQMDPEQDREISDHVLRIHGYR APGEQDGDAMPLGSAVDILATDDPNFSQEDQQDTQIYEKHDNLLHGTKKKKEKMVSAA FMKKYIHVAKIIKPVLTQESATYIAEEYSRLRSQDSMSSDTARTSPVTARTLETLIRL ATAHAKARMSKTVDLQDAEEAVELVQYAYFKKVLEKEKKRKKRSEDESETEDEEEKSQ EDQEQKRKRRKTRQPDAKDGDSYDPYDFSDTEEEMPQVHTPKTADSQETKESQKVELS ESRLKAFKVALLDVFREAHAQSIGMNRLTESINRDSEEPFSSVEIQAALSKMQDDNQV MVSEGIIFLI" BASE COUNT 660 a 635 c 722 g 558 t ORIGIN 1 cggcacgagg cacgactttg gtggaggtag ttctttggca gcgggcatgg cgggtaccgt 61 ggtgctggac gatgtggagc tgcgggaggc tcagagagat tacctggact tcctggacga 121 cgaggaagac cagggaattt atcagagcaa agttcgggag ctgatcagtg acaaccaata 181 ccggctgatt gtcaatgtga atgacctgcg caggaaaaac gagaagaggg ctaaccggct 241 tctgaacaat gcctttgagg agatggttgc cttccagcgg gccttaaagg attttgtggc 301 ctccattgat gctacctatg ccaagcagta tgaggagttc tacgtaggac tggaaggcag 361 ctttggctcc aagcacgtct ccccgcggac tcttacctcc tgcttcctca gctgtgtggt 421 ctgtgtggag ggcttatgca ctaaatgttc tctagttcgg tccaaagtcg ttcgcagtgt 481 ccactactgt cctgctacta agaagaccat agagcgacgt tattctgatc tcaccaccct 541 ggtggctttc ccatccagct ctgtctatcc taccaaggat gaggagaaca atcccttgga 601 gacagaatat ggcctttctg tctacaggta ccaccagacc atcaccatcc aggagatgcc 661 agagaaggcc ccagccggcc agctccctcg ctctgtggac gtcattctgg atgatgactt 721 ggtggataaa gcgaagcctg gtgaccgggt tcaggtggtg ggaacctacc gttgcgttcc 781 tggaaagaag ggaggctaca cctctgggac cttcaggact gtcctgattg cctgtaatgt 841 taagcagatg agcaaggatg ctcagccctc tttctctgct gaggatatag ccaagatcaa 901 gaagttcagt aaaacccgat ccaaggatat ctttgagcag ctggccaagt cattggcccc 961 aagtatccat gggcatgact atgtcaagaa agcaatcctc tgcttgctct tgggaggggt 1021 ggaacgagac ctagaaaatg gcagccacat ccgtggggac atcaatattc ttctaatagg 1081 agacccatcc gttgccaagt ctcagcttct gcggtatgtg ctttgcactg caccccgagc 1141 tatccccacc actggccggg gctcctctgg agtgggtctg acggctgctg tcaccacaga 1201 ccaggaaaca ggagagcgcc gtctggaagc aggggccatg gtcctggctg accgaggcgt 1261 ggtttgcatt gatgaatttg acaaaatgtc tgacatggat cgcacagcca tccatgaagt 1321 gatggagcag ggtcgagtga ccattgccaa ggctggcatc catgctcggc tgaatgcccg 1381 ctgcagtgtt ttggcagctg ccaatcctgt ctacggcagg tatgaccagt ataagactcc 1441 aatggagaac attgggctac aggactcact gctgtcacga tttgacttgc tcttcatcat 1501 gctggatcag atggatcctg agcaggatcg ggagatctca gaccatgtcc ttcggataca 1561 cggttacaga gcacctgggg agcaggatgg cgatgctatg ccattgggta gtgctgtgga 1621 tatcctggcc acagatgatc ccaactttag ccaggaagat cagcaggaca cccagattta 1681 tgagaagcat gacaaccttc tacatgggac caagaagaaa aaggagaaga tggtgagtgc 1741 agcattcatg aagaagtaca tccatgtggc caaaatcatc aagcctgtcc tgacacagga 1801 gtcggccacc tacattgcag aagagtattc acgcctgcgc agccaggata gcatgagctc 1861 agacaccgcc aggacatctc cagttacagc ccgaacactg gaaactctga ttcgactggc 1921 cacagcccat gcgaaggccc gcatgagcaa gactgtggac ctgcaggatg cagaggaagc 1981 tgtggagttg gtccagtatg cttactttaa gaaggttctg gagaaggaga agaaacgtaa 2041 gaagcgaagt gaggatgaat cagagacaga agatgaagag gagaaaagcc aagaggacca 2101 ggagcagaag aggaagagaa ggaagactcg ccagccagat gccaaagatg gggattcata 2161 cgacccctat gacttcagtg acacagagga ggaaatgcct caagtacaca ctccaaagac 2221 ggcagactca caggagacca aggaatccca gaaagtggag ttgagtgaat ccaggttgaa 2281 ggcattcaag gtggccctct tggatgtgtt ccgggaagct catgcgcagt caatcggcat 2341 gaatcgcctc acagaatcca tcaaccggga cagcgaagag cccttctctt cagttgagat 2401 ccaggctgct ctgagcaaga tgcaggatga caatcaggtc atggtgtctg agggcatcat 2461 cttcctcatc tgaggaggcc tcgtctctga acttgggttg tgccgagaga gtttgttctg 2521 tgtttcccac cctctccctg acccaagtct ttgcctctac tcccttaaca gtgtt // LOCUS HSP27 597 bp RNA PRI 04-NOV-1993 DEFINITION H.sapiens p27 mRNA. ACCESSION X67325 NID g35183 KEYWORDS hydrophobic protein; interferon-alpha inducible gene; p27 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 597) AUTHORS Rasmussen,U.B., Wolf,C., Mattei,M.G., Chenard,M.P., Bellocq,J.P., Chambon,P., Rio,M.C. and Basset,P. TITLE Identification of a new interferon-alpha-inducible gene (p27) on human chromosome 14q32 and its expression in breast carcinoma JOURNAL Cancer Res. 53 (17), 4096-4101 (1993) MEDLINE 93364913 REFERENCE 2 (bases 1 to 597) AUTHORS Rasmussen,U.B. TITLE Direct Submission JOURNAL Submitted (20-JUL-1992) U.B. Rasmussen, Transgene, Dept. of Mol. Cell Biology, 11, rue de Molsheim, F-67082 Strasbourg, FRANCE FEATURES Location/Qualifiers source 1..597 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="breast adenocarcinoma" /cell_line="MCF-7" /clone_lib="ATCC" /chromosome="14q32" gene 55..570 /gene="p27" CDS 55..423 /gene="p27" /codon_start=1 /db_xref="PID:g35184" /db_xref="SWISS-PROT:P40305" /translation="MEASALTSSAVTSVAKVVRVASGSAVVLPLARIATVVIGGVVAM AAVPMVLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASGSLVGTLQSLGATGLSGL TKFILGSIGSAIAAVIARFY" polyA_signal 565..570 /gene="p27" BASE COUNT 134 a 161 c 168 g 134 t ORIGIN 1 agctgaagtt gaggatctct tactctctaa gccacggaat taacccgagc aggcatggag 61 gcctctgctc tcacctcatc agcagtgacc agtgtggcca aagtggtcag ggtggcctct 121 ggctctgccg tagttttgcc cctggccagg attgctacag ttgtgattgg aggagttgtg 181 gccatggcgg ctgtgcccat ggtgctcagt gccatgggct tcactgcggc gggaatcgcc 241 tcgtcctcca tagcagccaa gatgatgtcc gcggcggcca ttgccaatgg gggtggagtt 301 gcctcgggca gccttgtggg tactctgcag tcactgggag caactggact ctccggattg 361 accaagttca tcctgggctc cattgggtct gccattgcgg ctgtcattgc gaggttctac 421 tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa ggcacccagc 481 catcctgacc cagcgaggag ccaactatcc caaatatacc tgggtgaaat ataccaaatt 541 ctgcatctcc agaggaaaat aagaaataaa gatgaattgt tgcaactctt aaaaaaa // LOCUS HSP2RNA 2150 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for P2 protein of peripheral myelin. ACCESSION X62167 NID g35185 KEYWORDS myelin; myelin protein; P2 protein; peripheral myelin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2150) AUTHORS Hayasaka,K. TITLE Direct Submission JOURNAL Submitted (13-SEP-1991) K. Hayasaka, Dept of Pediatrics, Akita Univ School of Medicine, 1-1-1 Hondo, Akita 010, JAPAN REFERENCE 2 (bases 1 to 2150) AUTHORS Hayasaka,K., Nanao,K., Tahara,M., Sato,W., Takada,G., Miura,M. and Uyemura,K. TITLE Isolation and sequence determination of cDNA encoding P2 protein of human peripheral myelin JOURNAL Biochem. Biophys. Res. Commun. 181 (1), 204-207 (1991) MEDLINE 92068191 FEATURES Location/Qualifiers source 1..2150 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="spinal cord" /clone_lib="lambda gt11 cDNA library of human foetus spinal cord" /clone="A2h" CDS 35..433 /codon_start=1 /product="P2 protein of peripheral myelin" /db_xref="PID:g35186" /db_xref="SWISS-PROT:P02689" /translation="MSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIIS KKGDIITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKE TTIKRKLVNGKMVAECKMKGVVCTRIYEKV" variation 76 /note="polymorphism" /replace="t" BASE COUNT 771 a 344 c 404 g 631 t ORIGIN 1 cgcttagaac tgtgttgagc tctcacccat cacgatgagc aacaaattcc tgggcacctg 61 gaaacttgtc tctagcgaga actttgacga ttacatgaaa gctctgggtg tggggttagc 121 caccagaaaa ctgggaaatt tggccaaacc cactgtgatc atcagcaaga aaggagatat 181 tataactata cgaactgaaa gtacctttaa aaatacagaa atctccttca agctaggcca 241 ggaatttgaa gaaaccacag ctgacaatag aaagaccaag agcatcgtaa ccctgcagag 301 aggatcactg aatcaagtgc agagatggga tggcaaagag acaaccataa agagaaagct 361 agtgaatggg aaaatggtag cggaatgtaa aatgaagggc gtggtgtgca ccagaatcta 421 tgagaaggtc tgaaaaatca tttcttcatt gaagtggctt tttatcattt aatgatggaa 481 atcaattgct tccattgaca aaactgaata cactgcaaat atttgttttt gcttttgtct 541 taatatatca gatatgcaaa ggcctaaact gagaattaat ctaaaagtca gtgttattta 601 aacattttca atgtgcatgc atgtcattat tacatcaaag catatatatt ggccagacac 661 aaacagttga tgatgtcatt caattaacta caaaattcta atctatgttg aactttgtat 721 acttgaaatg ataataaaaa ggatataatt tcttagtaaa atgaaatcaa agtattgatc 781 agggtagcaa actcaaatgc tgacaggggc cagaggagat atggggaagg agcatcagaa 841 atgaggcaag ctaggagaat gggctattat aatgtaaaga attgtagtct cagttaaaag 901 gggtagcctc tactccagcc aacattttaa aattaatgga taatttatag acagttaaat 961 ttatagacag ttaagtaaaa atggataatt tatagacaga taatttatag acaggtaaat 1021 gtgagttaaa tataactcac atcccactca agacacaaaa cattttctta atcctagtac 1081 atttttttct gtcccttccc aatcagtgtc cttttctgtt ccacccctac caaaagcaag 1141 tagtggtttg gtttctatca tatagattaa ttttacctgc tcatatgaag ggaattgtac 1201 atcatgcatt cttttctgtt tgcctttttt aaattcagca tcatgttttt gtgatacatc 1261 cacattgttg catgcagctg tagtttgttt ctttttatta ccaagtacta tttcattgta 1321 tgaatatatc acagtttatc cattctacta ttaagacaat tgagctattt ctaattttcg 1381 gctgctatga ataaagctgc tacaaatatt tttgtacaag actttttgta aacataggag 1441 tccctttatt ttaaataaat aactagtcat atcattaggt ccagtaattg ttgacaggca 1501 ggaacggggg accattgcat tgtgcccaag taataataaa actatttcag atgtattata 1561 tgattgagca aatgagaaaa catgttgatg ttgatgggag tcaggatgtt cactatggaa 1621 aaacaaatat acaaatatga aatgagggaa ggcaagaaag aaccatgtgg aaatggaata 1681 gaattggtat aaattcataa tttctaaacc atgtatatgt acgtttatat gtattataat 1741 tgcatacaca tgcctccatg catatatgtg tgtgataata cacatgcatt tatgtgcgtg 1801 tgtgtataca catgcatata tttactaatc ctatctgcca aaatggctta gacacaaaaa 1861 cacctcagca gaaatgaata tacctagcac tcagatcttc gtgtctaata tagtttgcca 1921 ctaaaaggaa ccaaggctac ttggaaaaat ggatgattcc aaagcaaggg caaggtagga 1981 acaagatgag cttgaaatat cttgttatgc cagaaagtaa tgttaaaaaa aaaaaataga 2041 ggtatattgc caaaacatag agccagcttg aaggggctcc cactggccaa acttgagcca 2101 atctgagagc aaaataattg agaaaaataa ataacaagat aattgaaaaa // LOCUS HSP2X4PC 1389 bp RNA PRI 14-JAN-1997 DEFINITION H.sapiens mRNA for P2X4 purinoceptor. ACCESSION Y07684 NID g1781008 KEYWORDS P2X4 purinoceptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1389) AUTHORS Garcia-Guzman,M., Soto,F., Gomez-Hernandez,J.M., Lund,P.E. and Stuhmer,W. TITLE Characterization of recombinant human P2X4 receptor reveals pharmacological differences to the rat homologue JOURNAL Mol. Pharmacol. 51 (1), 109-118 (1997) MEDLINE 97168759 REFERENCE 2 (bases 1 to 1389) AUTHORS Garcia-Guzman,M. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) M. Garcia-Guzman, Max-Planck institut fuer Experimentelle Medizin, (Abt. XI), Hermann-Rein Strasse, 3, D- 37075-Goettingen, FRG FEATURES Location/Qualifiers source 1..1389 /organism="Homo sapiens" /note="caucasian" /db_xref="taxon:9606" /sex="male" /dev_stage="57 year old adult" /tissue_type="whole cerebral brain" CDS 28..1194 /codon_start=1 /product="P2X4 purinoceptor" /db_xref="PID:e276428" /db_xref="PID:g1781009" /translation="MAGCCSALAAFLFEYDTPRIVLIRSRKVGLMNRAVQLLILAYVI GWVFVWEKGYQETDSVVSSVTTKVKGVAVTNTSKLGFRIWDVADYVIPAQEENSLFVM TNVILTMNQTQGLCPEIPDATTVCKSDASCTAGSAGTHSNGVSTGRCVAFNGSVKTCE VAAWCPVEDDTHVPQPAFLKAAENFTLLVKNNIWYPKFNFSKRNILPNITTTYLKSCI YDAKTDPFCPIFRLGKIVENAGHSFQDMAVEGGIMGIQVNWDCNLDRAASLCLPRYSF RRLDTRDVEHNVSPGYNFRFAKYYRDLAGNEQRTLIKAYGIRFDIIVFGKAGKFDIIP TMINIGSGLALLGMATVLCDIIVLYCMKKRLYYREKKYKYVEDYEQGLASELDQ" BASE COUNT 328 a 389 c 376 g 296 t ORIGIN 1 gggggactgg gagcgggcgg cgcggccatg gcgggctgct gctccgcgct ggcggccttc 61 ctgttcgagt acgacacgcc gcgcatcgtg ctcatccgca gccgcaaagt ggggctcatg 121 aaccgcgccg tgcaactgct catcctggcc tacgtcatcg ggtgggtgtt tgtgtgggaa 181 aagggctacc aggaaactga ctccgtggtc agctccgtta cgaccaaggt caagggcgtg 241 gctgtgacca acacttctaa acttggattc cggatctggg atgtggcgga ttatgtgata 301 ccagctcagg aggaaaactc cctcttcgtc atgaccaacg tgatcctcac catgaaccag 361 acacagggcc tgtgccccga gattccagat gcgaccactg tgtgtaaatc agatgccagc 421 tgtactgccg gctctgccgg cacccacagc aacggagtct caacaggcag gtgcgtagct 481 ttcaacgggt ccgtcaagac gtgtgaggtg gcggcctggt gcccggtgga ggatgacaca 541 cacgtgccac aacctgcttt tttaaaggct gcagaaaact tcactctttt ggttaagaac 601 aacatctggt atcccaaatt taatttcagc aagaggaata tccttcccaa catcaccact 661 acttacctca agtcgtgcat ttatgatgct aaaacagatc ccttctgccc catattccgt 721 cttggcaaaa tagtggagaa cgcaggacac agtttccagg acatggccgt ggagggaggc 781 atcatgggca tccaggtcaa ctgggactgc aacctggaca gagccgcctc cctctgcttg 841 cccaggtact ccttccgccg cctcgataca cgggacgttg agcacaacgt atctcctggc 901 tacaatttca ggtttgccaa gtactacaga gacctggctg gcaacgagca gcgcacgctc 961 atcaaggcct atggcatccg cttcgacatc attgtgtttg ggaaggcagg gaaatttgac 1021 atcatcccca ctatgatcaa catcggctct ggcctggcac tgctaggcat ggcgaccgtg 1081 ctgtgtgaca tcatagtcct ctactgcatg aagaaaagac tctactatcg ggagaagaaa 1141 tataaatatg tggaagatta cgagcagggt cttgctagtg agctggacca gtgaggccta 1201 ccccacacct gggctctcca cagccccatc aaagaacaga gaggaggagg agggagaaat 1261 ggccaccaca tcaccccaga gaaatttctg gaatctgatt gagtctccac tccacaagca 1321 ctcagggttc cccagcagct cctgtgtgtt gtgtgcagga tctgtttgcc cactcggccc 1381 aggaggtca // LOCUS HSP2X7 1853 bp RNA PRI 14-MAR-1997 DEFINITION H.sapiens mRNA for P2X7 receptor. ACCESSION Y09561 NID g1854511 KEYWORDS ATP ligand gated cationic channel; extracellular; P2X7 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1853) AUTHORS Rassendren,F., Buell,G., Virginio,C., North,R.A. and Surprenant,A. TITLE The permeabilizing ATP receptor (P2X7) : Cloning and expression of human cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1853) AUTHORS Buell,G.N. TITLE Direct Submission JOURNAL Submitted (21-NOV-1996) G.N. Buell, Geneva Biomedical Research Institute, Molecular Biology, 14 chemin des Aulx, 1228 Plan-les-Ouates, Geneva, SWITZERLAND FEATURES Location/Qualifiers source 1..1853 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="adult" gene 27..1814 /gene="P2X7" CDS 27..1814 /gene="P2X7" /codon_start=1 /product="ATP receptor" /db_xref="PID:e303906" /db_xref="PID:g1854512" /translation="MPACCSCSDVFQYETNKVTRIQSMNYGTIKWFFHVIIFSYVCFA LVSDKLYQRKEPVISSVHTKVKGIAEVKEEIVENGVKKLVHSVFDTADYTFPLQGNSF FVMTNFLKTEGQEQRLCPEYPTRRTLCSSDRGCKKGWMDPQSKGIQTGRCVVHEGNQK TCEVSAWCPIEAVEEAPRPALLNSAENFTVLIKNNIDFPGHNYTTRNILPGLNITCTF HKTQNPQCPIFRLGDIFRETGDNFSDVAIQGGIMGIEIYWDCNLDRWFHHCHPKYSFR RLDDKTTNVSLYPGYNFRYAKYYKENNVEKRTLIKVFGIRFDILVFGTGGKFDIIQLV VYIGSTLSYFGLAAVFIDFLIDTYSSNCCRSHIYPWCKCCQPCVVNEYYYRKKCESIV EPKPTLKYVSFVDESHIRMVNQQLLGRSLQDVKGQEVPRPAMDFTDLSRLPLALHDTP PIPGQPEEIQLLRKEATPRSRDSPVWCQCGSCLPSQLPESHRCLEELCCRKKPGACIT TSELFRKLVLSRHVLQFLLLYQEPLLALDVDSTNSRLRHCAYRCYATWRFGSQDMADF AILPSCCRWRIRKEFPKSEGQYSGFKSPY" BASE COUNT 456 a 503 c 477 g 417 t ORIGIN 1 aaaacgcagg gagggaggct gtcaccatgc cggcctgctg cagctgcagt gatgttttcc 61 agtatgagac gaacaaagtc actcggatcc agagcatgaa ttatggcacc attaagtggt 121 tcttccacgt gatcatcttt tcctacgttt gctttgctct ggtgagtgac aagctgtacc 181 agcggaaaga gcctgtcatc agttctgtgc acaccaaggt gaaggggata gcagaggtga 241 aagaggagat cgtggagaat ggagtgaaga agttggtgca cagtgtcttt gacaccgcag 301 actacacctt ccctttgcag gggaactctt tcttcgtgat gacaaacttt ctcaaaacag 361 aaggccaaga gcagcggttg tgtcccgagt atcccacccg caggacgctc tgttcctctg 421 accgaggttg taaaaaggga tggatggacc cgcagagcaa aggaattcag accggaaggt 481 gtgtagtgca tgaagggaac cagaagacct gtgaagtctc tgcctggtgc cccatcgagg 541 cagtggaaga ggccccccgg cctgctctct tgaacagtgc cgaaaacttc actgtgctca 601 tcaagaacaa tatcgacttc cccggccaca actacaccac gagaaacatc ctgccaggtt 661 taaacatcac ttgtaccttc cacaagactc agaatccaca gtgtcccatt ttccgactag 721 gagacatctt ccgagaaaca ggcgataatt tttcagatgt ggcaattcag ggcggaataa 781 tgggcattga gatctactgg gactgcaacc tagaccgttg gttccatcac tgccatccca 841 aatacagttt ccgtcgcctt gacgacaaga ccaccaacgt gtccttgtac cctggctaca 901 acttcagata cgccaagtac tacaaggaaa acaatgttga gaaacggact ctgataaaag 961 tcttcgggat ccgttttgac atcctggttt ttggcaccgg aggaaaattt gacattatcc 1021 agctggttgt gtacatcggc tcaaccctct cctacttcgg tctggccgct gtgttcatcg 1081 acttcctcat cgacacttac tccagtaact gctgtcgctc ccatatttat ccctggtgca 1141 agtgctgtca gccctgtgtg gtcaacgaat actactacag gaagaagtgc gagtccattg 1201 tggagccaaa gccgacatta aagtatgtgt cctttgtgga tgaatcccac attaggatgg 1261 tgaaccagca gctactaggg agaagtctgc aagatgtcaa gggccaagaa gtcccaagac 1321 ctgcgatgga cttcacagat ttgtccaggc tgcccctggc cctccatgac acacccccga 1381 ttcctggaca accagaggag atacagctgc ttagaaagga ggcgactcct agatccaggg 1441 atagccccgt ctggtgccag tgtggaagct gcctcccatc tcaactccct gagagccaca 1501 ggtgcctgga ggagctgtgc tgccggaaaa agccgggggc ctgcatcacc acctcagagc 1561 tgttcaggaa gctggtcctg tccagacacg tcctgcagtt cctcctgctc taccaggagc 1621 ccttgctggc gctggatgtg gattccacca acagccggct gcggcactgt gcctacaggt 1681 gctacgccac ctggcgcttc ggctcccagg acatggctga ctttgccatc ctgcccagct 1741 gctgccgctg gaggatccgg aaagagtttc cgaagagtga agggcagtac agtggcttca 1801 agagtcctta ctgaagccag gcaccgtggc tcacgtctgt aatcccacct ttt // LOCUS HSP2XRCPR 2643 bp RNA PRI 25-JAN-1996 DEFINITION H.sapiens mRNA for ATP receptor. ACCESSION X83688 NID g1166437 KEYWORDS ATP receptor; P2X gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2643) AUTHORS Valera,S., Talabot,F., Evans,R.J., Gos,A., Antonarakis,S.E., Morris,M.A. and Buell,G.N. TITLE Characterization and chromosomal localization of a human P2X receptor from the urinary bladder JOURNAL Recept. Channels 3 (4), 283-289 (1995) MEDLINE 96430919 REFERENCE 2 (bases 1 to 2643) AUTHORS Talabot,F. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) F. Talabot, GLAXO Institute for Molecular Biology, 14 Chemin des Aulx, CH-1228 Plan-les-Ouates, Geneva, SWITZERLAND COMMENT Related sequence: X80477. FEATURES Location/Qualifiers source 1..2643 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="urinary bladder, smooth muscle" /clone_lib="lambda gt10" /chromosome="17" gene 174..1373 /gene="P2X" CDS 174..1373 /gene="P2X" /codon_start=1 /evidence=experimental /product="ATP receptor" /db_xref="PID:e133805" /db_xref="PID:g1166438" /translation="MARRFQEELAAFLFEYDTPRMVLVRNKKVGVIFRLIQLVVLVYV IGWVFLYEKGYQTSSGLISSVSVKLKGLAVTQLPGLGPQVWDVADYVFPAQGDNSFVV MTNFIVTPKQTQGYCAEHPEGGICKEDSGCTPGKAKRKAQGIRTGKCVAFNDTVKTCE IFGWCPVEVDDDIPRPALLREAENFTLFIKNSISFPRFKVNRRNLVEEVNAAHMKTCL FHKTLHPLCPVFQLGYVVQESGQNFSTLAEKGGVVGITIDWHCDLDWHVRHCRPIYEF HGLYEEKNLSPGFNFRFARHFVENGTNYRHLFKVFGIRFDILVDGKAGKFDIIPTMTT IGSGIGIFGVATVLCDLLLLHILPKRHYYKQKKFKYAEDMGPGAAERDLAATSSTLGL QENMRTS" BASE COUNT 609 a 792 c 701 g 541 t ORIGIN 1 gcctccagct gacctctggc tcctgtcctc tggctccacc tgcaccgccc tgctcttcct 61 aaggggccag gaagccccca gaagctctac catcgacgtg ggtggtggca cccggctcac 121 cctgagagca gagggcgtgc agggggctca gttctgagcc cagccggccc accatggcac 181 ggcggttcca ggaggagctg gccgccttcc tcttcgagta tgacaccccc cgcatggtgc 241 tggtgcgtaa taagaaggtg ggcgttatct tccgactgat ccagctggtg gtcctggtct 301 acgtcatcgg gtgggtgttt ctctatgaga agggctacca gacctcgagc ggcctcatca 361 gcagtgtctc tgtgaaactc aagggcctgg ccgtgaccca gctccctggc ctcggccccc 421 aggtctggga tgtggctgac tacgtcttcc cagcccaggg ggacaactcc ttcgtggtca 481 tgaccaattt catcgtgacc ccgaagcaga ctcaaggcta ctgcgcagag cacccagaag 541 ggggcatatg caaggaagac agtggctgta cccctgggaa ggccaagagg aaggcccaag 601 gcatccgcac gggcaagtgt gtggccttca acgacactgt gaagacgtgt gagatctttg 661 gctggtgccc cgtggaggtg gatgacgaca tcccgcgccc tgcccttctc cgagaggccg 721 agaacttcac tcttttcatc aagaacagca tcagctttcc acgcttcaag gtcaacaggc 781 gcaacctggt ggaggaggtg aatgctgccc acatgaagac ctgcctcttt cacaagaccc 841 tgcaccccct gtgcccagtc ttccagcttg gctacgtggt gcaagagtca ggccagaact 901 tcagcaccct ggctgagaag ggtggagtgg ttggcatcac catcgactgg cactgtgacc 961 tggactggca cgtacggcac tgcagaccca tctatgagtt ccatgggctg tacgaagaga 1021 aaaatctctc cccaggcttc aacttcaggt ttgccaggca ctttgtggag aacgggacca 1081 actaccgtca cctcttcaag gtgtttggga ttcgctttga catcctggtg gacggcaagg 1141 ccgggaagtt tgacatcatc cctacaatga ccaccatcgg ctctggaatt ggcatctttg 1201 gggtggccac agttctctgt gacctgctgc tgcttcacat cctgcctaag aggcactact 1261 acaagcagaa gaagttcaaa tacgctgagg acatggggcc aggggcggct gagcgtgacc 1321 tcgcagctac cagctccacc ctgggcctgc aggagaacat gaggacatcc tgatgctcgg 1381 gccccaactc ctgactgggt gcagcgtgag gcttcagcct ggagccctgg tgggtcccag 1441 ccagggcaga ggggcctccc caggaagtct cctaccctct cagccaggca gagagcagtt 1501 tgccagaagc tcagggtgca tagtaggaga gacctgtgca aatctgagct ccggctccga 1561 ccccacacac cctgagggag gcctacccta gcctcagccg ctcctggtgg gggaatggct 1621 gggggttggg caggaccctc ccacacacct gcaccctagc ttcgtgcttc tctctccgga 1681 ctctcattat ccaacccgct gcctccattt ctctagatct gtgctctccg atgtggcagt 1741 cagtaaccat aggtgactaa attaaactaa aataaaatag aatgaaacac aaaattcaat 1801 tcctcggctg aactagccac atttcaactg ctcagtagat acgtgtggtt agtggctgcc 1861 atactggaca gctcggggca ttttcactgt caaagaaagt tctattagac agccctgctt 1921 gagccctgtt tcttcctggc ttcggtttcc ctggggaact tatcgacaat gcaagctcct 1981 gggcccaccc ccagacctcc tgaaccaaaa gctccagggc tggccgtatg atctgtgtgg 2041 atggcaaact ccccaggcca ttctgggacc taagtttaag aagtgccgtc ctcgaacttt 2101 ctgactctaa gctcctgagc gggagtcaga cttagccctg agcctgcact tcctgttcag 2161 gtgcagacac tgaacagggt ctcaaacacc ttcagcatgt gtgttgtgtg ctcacgtgcc 2221 acacagtgtc tcatgcacac aacccagtgt acacaccacc tacgtgcaca cagcatcctt 2281 ccacactgtg tatgtgaaca gcttgggccc tgcaaacaca accatctaca cacatctaca 2341 cccccaagca cacacacatg gtccgtgcca tgtcacctcc atagggaaag gcttctctcc 2401 aagtgtgcca ggccaggaca gccctcccag ccatgaatcc ttactcagct acctcgggtt 2461 ggggtgggag ccccagccaa atcctgggct ccctgcctgt ggctcagccc cagctcccaa 2521 ggcctgcctg gctctgtctg aacagaaggt ctgggggaag cgaggggtgg agtacaataa 2581 agggaatgag gacaaacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2641 aaa // LOCUS HSP2Y6 1571 bp RNA PRI 03-MAY-1996 DEFINITION H.sapiens mRNA for P2Y6 receptor. ACCESSION X97058 NID g1296659 KEYWORDS G-coupled nucleotide receptor; P2Y6 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1571) AUTHORS Communi,D., Parmentier,M. and Boeynaems,J.M. TITLE Cloning, functional expression and tissue distribution of the human P2Y6 receptor JOURNAL Unpublished REFERENCE 2 (bases 1 to 1571) AUTHORS Communi,D.B.C. TITLE Direct Submission JOURNAL Submitted (02-APR-1996) D.B.C. Communi, Institute of Interdisciplinary Research., U.L.B., Building C (local C5-145), Campus Erasme,Routede Lennik 808, B-1070 Brussels, BELGIUM REMARK Revised by submittor 26-APR-1996 FEATURES Location/Qualifiers source 1..1571 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" gene 277..1263 /gene="P2Y6" CDS 277..1263 /gene="P2Y6" /note="Author-given protein sequence is in conflict with the conceptual translation; expressed in placenta" /codon_start=1 /product="G-coupled nucleotide receptor" /db_xref="PID:e236011" /db_xref="PID:g1296660" /translation="MEWDNGTGQALGLPPTTCVYRENFKQLLLPPVYSAVLAAGLPLN ICVITQICTSRRALTRTAVYNLNLALADLLYACSLPLLIYNYAQGDHWPFGDFACRLV RFLFYANLHGSILFLTCISFQRYLGICHPLAPWHKRGGRRAAWLVCVAVWLAVTTQCL PTAIFAATGIQRNRTVCYDLSPPALATHYMPYGMALTVIGFLLPFAALLACYCLLACR LCRQDGPAEPVAQERRGKAARMAVVVAAAFAISFLPFHITKTAYLAVRSTPGVPCTVL EAFAAAYKGTRPFASANSVLDPILFYFTQKKFRRRPHELLQKLTAKWQRQGR" BASE COUNT 301 a 531 c 415 g 324 t ORIGIN 1 ctcagtttcc tcatctgctg cctctccaga cttctgccag aacattgcac gcgacagttt 61 caggcacaga actgactggc agcaggggct gctccacgag tgggaatttg ctccagcact 121 tcacggactg caagcgaggc acttgctaac tcttggataa caagacctct gccagaagaa 181 ccatggcttt ggaaggcgga gttcaggctg aggagatggg tgcggtcctc agtgagcccc 241 tgcctccctg aacataggaa acccacctgg gcagccatgg aatgggacaa tggcacaggc 301 caggctctgg gcttgccacc caccacctgt gtctaccgcg agaacttcaa gcaactgctg 361 ctgccacctg tgtattcggc ggtgctggcg gctggcctgc cgctgaacat ctgtgtcatt 421 acccagatct gcacgtcccg ccgggccctg acccgcacgg ccgtgtacac cctaaacctt 481 gctctggctg acctgctata tgcctgctcc ctgcccctgc tcatctacaa ctatgcccaa 541 ggtgatcact ggccctttgg cgacttcgcc tgccgcctgg tccgcttcct cttctatgcc 601 aacctgcacg gcagcatcct cttcctcacc tgcatcagct tccagcgcta cctgggcatc 661 tgccacccgc tggccccctg gcacaaacgt gggggccgcc gggctgcctg gctagtgtgt 721 gtagccgtgt ggctggccgt gacaacccag tgcctgccca cagccatctt cgctgccaca 781 ggcatccagc gtaaccgcac tgtctgctat gacctcagcc cgcctgccct ggccacccac 841 tatatgccct atggcatggc tctcactgtc atcggcttcc tgctgccctt tgctgccctg 901 ctggcctgct actgtctcct ggcctgccgc ctgtgccgcc aggatggccc ggcagagcct 961 gtggcccagg agcggcgtgg caaggcggcc cgcatggccg tggtggtggc tgctgccttt 1021 gccatcagct tcctgccttt tcacatcacc aagacagcct acctggcagt gcgctcgacg 1081 ccgggcgtcc cctgcactgt attggaggcc tttgcagcgg cctacaaagg cacgcggccg 1141 tttgccagtg ccaacagcgt gctggacccc atcctcttct acttcaccca gaagaagttc 1201 cgccggcgac cacatgagct cctacagaaa ctcacagcca aatggcagag gcagggtcgc 1261 tgagtcctcc aggtcctggg cagccttcat atttgccatt gtgtccgggg caccaggagc 1321 cccaccaacc ccaaaccatg cggagaatta gagttcagct cagctgggca tggagttaag 1381 atccctcaca ggacccagaa gctcaccaaa aactatttct tcagcccctt ctctggccca 1441 gaccctgtgg gcatggagat ggacagacct gggcctggct cttgagaggt cccagtcagc 1501 catggagagc tggggaaacc acattaaggt gctcacaaaa atacagtgtg acgtgtactg 1561 tcaaaaaaaa a // LOCUS HSP2YLG 2070 bp RNA PRI 06-JAN-1998 DEFINITION H.sapiens mRNA for P2Y-like G-protein coupled receptor. ACCESSION Y12546 NID g2687818 KEYWORDS G-protein coupled receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2070) AUTHORS Blaesius,R.H., Weber,R.G., Lichter,P. and Ogilvie,A. TITLE A novel orphan G-protein coupled receptor primarily expressed in the brain is localized on human chromosomal band 2q21 JOURNAL J. Neurochem. In press REFERENCE 2 (bases 1 to 2070) AUTHORS Blaesius,R.H. TITLE Direct Submission JOURNAL Submitted (14-APR-1997) R.H. Blaesius, Universitaet Erlangen- Nuernberg, Institut fuer Biochemie, Fahrstr. 17, Erlangen, D-91054, FRG FEATURES Location/Qualifiers source 1..2070 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" mRNA join(<1..>2070) exon <1..71 /number=1 exon 72..137 /number=2 CDS 75..1178 /codon_start=1 /product="P2Y-like G-protein coupled receptor" /db_xref="PID:e322207" /db_xref="PID:g2687819" /translation="MSKRSWWAGSRKPPREMLKLSGSDSSQSMNGLEVAPPGLITNFS LATAEQCGQETPLENMLFASFYLLDFILALVGNTLALWLFIRDHKSGTPANVFLMHLA VADLSCVLVLPTRLVYHFSGNHWPFGEIACRLTGFLFYLNMYASIYFLTCISADRFLA IVHPVKSLKLRRPLYAHLACAFLWVVVAVAMAPLLVSPQTVQTNHTVVCLQLYREKAS HHALVSLAVAFTFPFITTVTCYLLIIRSLRQGLRVEKRLKTKAVRMIAIVLAIFLVCF VPYHVNRSVYVLHYRSHGASCATQRILALANRITSCLTSLNGALDPIMYFFVAEKFRH ALCNLLCGKRLKGPPPSFEGKTNESSLSAKSEL" exon 138..>2070 /number=3 BASE COUNT 421 a 700 c 527 g 422 t ORIGIN 1 ccgacaccca cgggcggaga tcacctgctg ccccgcagac ccctgtccct tcctcccgga 61 ccagcagcta gaggatgtcc aaacggagtt ggtgggctgg atccagaaag cccccaagag 121 agatgctgaa actctcaggc tctgactcca gccaaagcat gaatggcctt gaagtggctc 181 ccccaggtct gatcaccaac ttctccctgg ccacggcaga gcaatgtggc caggagacgc 241 cactggagaa catgctgttc gcctccttct accttctgga ttttatcctg gctttagttg 301 gcaataccct ggctctgtgg cttttcatcc gagaccacaa gtccgggacc ccggccaacg 361 tgttcctgat gcatctggcc gtggccgact tgtcgtgcgt gctggtcctg cccacccgcc 421 tggtctacca cttctctggg aaccactggc catttgggga aatcgcatgc cgtctcaccg 481 gcttcctctt ctacctcaac atgtacgcca gcatctactt cctcacctgc atcagcgccg 541 accgtttcct ggccattgtg cacccggtca agtccctcaa gctccgcagg cccctctacg 601 cacacctggc ctgtgccttc ctgtgggtgg tggtggctgt ggccatggcc ccgctgctgg 661 tgagcccaca gaccgtgcag accaaccaca cggtggtctg cctgcagctg taccgggaga 721 aggcctccca ccatgccctg gtgtccctgg cagtggcctt caccttcccg ttcatcacca 781 cggtcacctg ctacctgctg atcatccgca gcctgcggca gggcctgcgt gtggagaagc 841 gcctcaagac caaggcagtg cgcatgatcg ccatagtgct ggccatcttc ctggtctgct 901 tcgtgcccta ccacgtcaac cgctccgtct acgtgctgca ctaccgcagc catggggcct 961 cctgcgccac ccagcgcatc ctggccctgg caaaccgcat cacctcctgc ctcaccagcc 1021 tcaacggggc actcgacccc atcatgtatt tcttcgtggc tgagaagttc cgccacgccc 1081 tgtgcaactt gctctgtggc aaaaggctca agggcccgcc ccccagcttc gaagggaaaa 1141 ccaacgagag ctcgctgagt gccaagtcag agctgtgagc ggggggcgcc gtccaggccg 1201 agcgcagact gtttaggact cagcagaccc agcaagaggc atctgccctt tccccagcca 1261 cctccccagc aagcaacctg aaatctcagc agatgcccac catttctcta gatcgcctag 1321 tctcaaccca taaaaaggaa gaactgacaa aggggatcca tcggccaccc ctctgcaggg 1381 gcttgtgatg gctacaatgg ctcctagaca ctcaacgact tcatctgtgg cagggagaga 1441 ggaggccgga agaacaaccc ctgaacaatg gaggcctttc tttcccgcta ggctcccagc 1501 ctccttcccg ctacagaatc gctcatcggc gaggctcagc agaaagaccc tgaaggcagg 1561 ctgcaaatga cccagaagag ggacctggga gtcctggtgg ggacggggag ggagtctcaa 1621 tactcctttg cagcgcaagg tactctgagt cccctctgta gtgcctctgc cagacacaca 1681 ctgcctgagt tgaagagaca caggccacac atttcaggct ggttgccagc ggacgtcagc 1741 actcacggcc tgcggggact cagcacagct ctggattctg gatctctcct gctgtaaccc 1801 cacgcacaag cctgcaaccc ccagagctct ttgacaggct cccaggcctc ccagtcctgg 1861 acaagcatgt gcagtcacgg gagctcagct caggccaggg ctgggctgtg cacctgcctc 1921 ccactgaccc agacccactt cctccagaga ggcctctctc cgcctgagct atttcccttg 1981 ctagtgtgca gatatttccc taacatgtcc ttttttgtat ttgtttgtac ggaccataaa 2041 tataactgta gctttaagac taaaaaaaaa // LOCUS HSP35R 1097 bp RNA PRI 09-JAN-1995 DEFINITION H.sapiens p35 mRNA for regulatory subunit of cdk5 kinase. ACCESSION X80343 NID g558670 KEYWORDS cdk5 kinase; P35 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1097) AUTHORS Tsai,L.H., Delalle,I., Caviness,V.S. Jr., Chae,T. and Harlow,E. TITLE p35 is a neural-specific regulatory subunit of cyclin-dependent kinase 5 JOURNAL Nature 371 (6496), 419-423 (1994) MEDLINE 94376895 REFERENCE 2 (bases 1 to 1097) AUTHORS Tsai,L. TITLE Direct Submission JOURNAL Submitted (15-JUL-1994) L. Tsai, Massachusetts General Hospital, Cancer Center, Building 149, 13th Street, Charlestown MA 02129, USA FEATURES Location/Qualifiers source 1..1097 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="foetal brain" gene 98..1021 /gene="p35" CDS 98..1021 /gene="p35" /note="regulatory partner for cdk5 kinase" /codon_start=1 /db_xref="PID:g558671" /translation="MGTVLSLSPSYRKATLFEDGAATVGHYTAVQNSKNAKDKNLKRH SIISVLPWKRIVAVSAKKKNSKKVQPNSSYQNNITHLNNENLKKSLSCANLSTFAQPP PAQPPAPPASQLSGSQTGGSSSVKKAPHPAVTSAGTPKRVIVQASTSELLRCLGEFLC RRCYRLKHLSPTDPVLWLRSVDRSLLLQGWQDQGFITPANVVFLYMLCRDVISSEVGS DHELQAVLLTCLYLSYSYMGNEISYPLKPFLVESCKEAFWDRCLSVINLMSSKMLQIN ADPHYFTQVFSDLKNESGQEDKKRLLLGLDR" BASE COUNT 225 a 368 c 296 g 208 t ORIGIN 1 aaactcagaa ttttcgcggg ctcggtgagc ggttttatcc ctccggccgg caggctgggc 61 gcagggggcg agcccccgcc cggcgcgcag cagcaccatg ggcacggtgc tgtccctgtc 121 tcccagctac cggaaggcca cgctgtttga ggatggcgcg gccaccgtgg gccactatac 181 ggccgtacag aacagcaaga acgccaagga caagaacctg aagcgccact ccatcatctc 241 cgtgctgcct tggaagagaa tcgtggccgt gtcggccaag aagaagaact ccaagaaggt 301 gcagcctaac agcagctacc agaacaacat cacgcacctc aacaatgaga acctgaagaa 361 gtcgctgtcg tgcgccaacc tgtccacatt cgcccagccc ccaccggccc agccgcctgc 421 acccccggcc agccagctct cgggttccca gaccgggggc tcctcctcag tcaagaaagc 481 ccctcaccct gccgtcacct ccgcagggac gcccaaacgg gtcatcgtcc aggcgtccac 541 cagtgagctg cttcgctgcc tgggtgagtt tctctgccgc cggtgctacc gcctgaagca 601 cctgtccccc acggaccccg tgctctggct gcgcagcgtg gaccgctcgc tgcttctgca 661 gggctggcag gaccagggct tcatcacgcc ggccaacgtg gtcttcctct acatgctctg 721 cagggatgtt atctcctccg aggtgggctc ggatcacgag ctccaggccg tcctgctgac 781 atgcctgtac ctctcctact cctacatggg caacgagatc tcctacccgc tcaagccctt 841 cctggtggag agctgcaagg aggccttttg ggaccgttgc ctctctgtca tcaacctcat 901 gagctcaaag atgctgcaga taaatgccga cccacactac ttcacacagg tcttctccga 961 cctgaagaac gagagcggcc aggaggacaa gaagcggctc ctcctaggcc tggatcggtg 1021 agcactgtag cctgcgtcat ggctcaagga ttcaatgcat ttttaagaat ttattattaa 1081 atcagttttg tgtacag // LOCUS HSP38A20 1876 bp RNA PRI 04-DEC-1997 DEFINITION Homo sapiens cDNA similar to RNA binding protein C. elegans, complete. ACCESSION AL009266 NID g2664428 KEYWORDS RNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1876) AUTHORS Collins,J.E. and Burton,J. TITLE Direct Submission JOURNAL Submitted (18-NOV-1997) E-mail contact: humquery@sanger.ac.uk Clone requests:clonerequest@sanger.ac.uk COMMENT This sequence was generated from a cDNA clone isolated using bacterial clone contigs and genomic sequence of human chromosome 22, generated by the Sanger Centre chromosome 22 mapping and sequencing groups. All matches to EMBL sequences shown 90% or more. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/. FEATURES Location/Qualifiers source 1..1876 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /tissue_type="placental" /map="q12-13" exon 1..193 /note="match Z81369 genomic clone 106I20" /number=1 misc_feature join(12..223,221..271,270..293) /note="match: 3' EST D81973 clone c-2la05" misc_feature join(33..423,934..981) /note="match: 5' EST N56061 clone J6510" misc_feature 159..299 /note="match: 5' EST R75495 clone 3NHC5171" misc_feature 159..477 /note="match: 5' EST AA244540 clone 679245" CDS 171..1274 /codon_start=1 /product="hypothetical protein" /db_xref="PID:e1202792" /db_xref="PID:g2664429" /translation="MEKKKMVTQGNQEPTTTPDAMVQPFTTIPFPPPPQNGIPTEYGV PHTQDYAGQTGEHNLTLYGSTQAHGEQSSNSPSTQNGSLTQTEGGAQTDGQQSQTQSS ENSESKSTPKRLHVSNIPFRFRDPDLRQMFGQFGKILDVEIIFNERGSKGFGFVTFEN SADADRAREKLHGTVVEGRKIEVNNATARVMTNKKMVTPYANGWKLSPVVGAVYGPEL YAASSFQADVSLGNDAAVPLSGRGGINTYIPLIIPGFPYPTAATTAAAFRGAHLRGRG RTVYGAVRAVPPTAIPAYPGVDMQPTDMHSLLLQPQPPLLQPLQPLTVTVMAGCTQPT PTMPLPLPLAMELALWRVYTEVATADLPPTEVT" exon 194..418 /note="match Z81369 genomic clone 106I20" /number=2 misc_feature join(195..422,425..557) /note="match: 3' EST AA577482 clone IMAGE:1075648" misc_feature join(195..333,332..388,388..422,425..524) /note="match: 5' EST H66074 clone 210608" misc_feature join(195..379,375..422,425..569) /note="match: 3' EST N23176 clone 267670" misc_feature join(195..335,336..422,425..502) /note="match: 3' EST W61351 clone 341872" misc_feature 195..422 /note="match: 3' EST Z44044 clone c-1rc07" misc_feature join(195..287,279..301,297..379,375..412,418..442, 447..464) /note="match: 3' EST N72967 clone 291820" misc_feature join(195..422,425..459) /note="match: 5' EST R98096 clone 206827" misc_feature 195..455 /note="match: 3' EST AA280296 clone IMAGE:712715" misc_feature join(357..696,695..853) /note="match: 5' EST W75346 clone 390877" exon 419..565 /note="match Z81357 genomic clone 41P2" /number=3 misc_feature complement(join(425..551,552..927,925..1033)) /note="match: 5' EST AA402524 clone 741203" misc_feature 425..534 /note="match: 3' EST AA362936 clone IMAGE:1010377" misc_feature join(425..645,641..754) /note="match: 5' EST AA081998 clone 548575" misc_feature complement(join(530..551,552..927,925..1033)) /note="=" exon 566..619 /note="match Z81357 genomic clone 41P2" /number=4 exon 620..712 /note="match Z81357 genomic clone 41P2" /number=5 misc_feature 678..714 /note="match: 3' EST Z25303 clone B7F03" misc_feature complement(join(678..733,733..847,925..1033)) /note="match: 5' EST H53695 clone 236125" misc_feature 695..745 /note="match: 3' EST Z25293 clone B6B04" exon 713..773 /note="match Z81357 genomic clone 41P2" /number=6 misc_feature join(738..1070,675..738,1058..1113) /note="match: 5' EST AA253750 clone 669831" exon 774..827 /note="match Z81357 genomic clone 41P2" /number=7 misc_feature 800..1070 /note="match: 3' EST M85812 clone HFBCO27" exon 828..920 /note="match Z81357 genomic clone 41P2" /number=8 misc_feature 829..926 /note="match: 3' EST H12785 clone 148765" exon 921..1053 /note="match Z81357 genomic clone 41P2" /number=9 misc_feature complement(join(925..956,952..1070,1110..1238,1239..1330)) /note="match: 5' EST N32641 clone 267670" misc_feature complement(join(937..1070,1058..1099,1145..1238, 1239..1352)) /note="match: 5' EST W61350 clone 341872" misc_feature 975..1289 /note="match: 3' EST C16002 clone 341872" misc_feature complement(join(997..1025,1057..1395)) /note="match: 3' EST N95026 clone 306551" misc_feature complement(join(1000..1022,1032..1059,1053..1111, 1099..1356)) /note="match: 3' EST H66029 clone 210608" misc_feature complement(join(1043..1072,1099..1383)) /note="match: 3' EST R98097 clone 206827" misc_feature complement(join(1053..1094,1105..1238,1239..1363)) /note="match: 3' EST T97163 clone 121169" exon 1054..1142 /note="match Z81357 genomic clone 41P2" /number=10 misc_feature complement(join(1057..1092,1239..1394)) /note="match: 3' EST AA101104 clone 548575" misc_feature complement(join(1058..1238,1239..1409)) /note="match: 5' EST AA280574 clone IMAGE:712715" misc_feature join(1058..1185,891..1070) /note="match: 5' EST AA003706 clone 437454" misc_feature complement(join(1058..1093,1110..1212,1210..1238, 1239..1383)) /note="match: 3' EST AA180436 clone 612865" misc_feature complement(join(1110..1129,1201..1227,1231..1313, 1310..1535)) /note="match: 3' EST H17976 clone 50414" misc_feature complement(1119..1213) /note="match: 3' EST T86607 clone 115242" misc_feature join(1135..1191,1058..1136,1182..1372) /note="match: 5' EST AA179762 clone 612865" exon 1143..1215 /note="match Z81314 genomic clone 41P2" /number=11 exon 1216..1863 /note="match Z81314 genomic clone 41P2" /number=12 misc_feature complement(join(1219..1238,1239..1527)) /note="match: 3' EST AA451903 clone 786673" misc_feature complement(1230..1533) /note="match: 3' EST R45356 clone 35755" misc_feature complement(1232..1524) /note="match: 3' EST F04160 clone c-2lb07" misc_feature complement(1243..1524) /note="match: 3' EST F03257 clone c-1rc07" misc_feature complement(1251..1524) /note="match: 3' EST F04190 clone c-2lh05" misc_feature complement(1259..1524) /note="match: 3' EST F04155 clone c-2la05" misc_feature complement(1364..1533) /note="match: 3' EST AA230027 clone IMAGE:1010377" misc_feature 1384..1539 /note="match: 3' EST C05291 clone 3NHC5171" misc_feature 1624..1870 /note="match: 3' EST T07396 clone HFBEI40" misc_feature join(1802..1870,1774..1801) /note="match: 5' EST AA418165 clone 767460" BASE COUNT 555 a 425 c 455 g 441 t ORIGIN 1 ctctaaagaa gaaacattag aaagaaaaag gaaggaaaac ggtataaaga gagatcaatt 61 acccaccctt aaatagctag attggggggg gaggggggtg gaaaagaaag ctgtggaggt 121 gtgccccagc acggctgctt tgaaaggttt atcatctatc cgtttggttt atggagaaaa 181 agaaaatggt aactcagggt aaccaggagc cgacaacaac tcctgacgca atggttcagc 241 cttttactac catcccattt ccaccacctc cgcagaatgg aattcccaca gagtatgggg 301 tgccacacac tcaagactat gccggccaga ccggtgagca taacctgaca ctctacggaa 361 gtacgcaagc ccacggggag cagagcagca actcacccag cacacaaaat ggatctctta 421 cgcagacaga aggtggagca cagacagacg gccagcagtc acagacacaa agtagtgaaa 481 attcagagag taaatctacc ccgaaacggc tgcatgtctc taatattcct ttccgcttcc 541 gggaccctga cctccggcag atgtttgggc agtttggcaa aatcctagat gtagaaataa 601 tctttaatga acgtggctct aagggattcg ggttcgtaac tttcgagaat agtgctgatg 661 cagacagggc cagggagaaa ttacacggca ccgtggtaga gggccgtaaa atcgaggtga 721 ataatgctac agcacgtgta atgaccaata agaagatggt cacaccatat gcaaatggtt 781 ggaaattaag cccagtagtt ggagctgtat atggtccgga gttatatgca gcatccagct 841 ttcaagcaga tgtgtcccta ggcaatgatg cagcagtgcc cctatcagga agagggggta 901 tcaacactta cattccttta atcattcctg gcttccctta ccctactgca gccaccacgg 961 cagccgcttt cagaggagcc catttgaggg gcagagggcg gacagtatat ggtgcagtcc 1021 gagcggtacc tccaacagcc atccccgcct atccaggggt ggatatgcag cctacagata 1081 tgcacagcct gctactgcaa ccgcagccac cgctgctgca gccgctgcag ccgcttacag 1141 tgacggttat ggcagggtgt acacagccga cccctaccat gcccttgccc ctgccgctag 1201 ctatggagtt ggcgctgtgg cgagtttata ccgaggtggc tacagccgat ttgcccccta 1261 ctgaagtgac gtgagacccc tgcaaatggg acagcccccc agttcatgag gcctggctat 1321 tgcaatattt actagtagag gaactctata gcaagatgaa gaggaaaaac aaacaaacaa 1381 acaaaaaaaa acacaaaaaa agaaagaata cttttttata cctcactatg ttctttgaat 1441 atgtattttt cctttaaatt tctgccttta attcttttgt tccaaagatt gtgcattttt 1501 ttcttttttt ttttaaactg tggtaaaaaa aaaaaaaaat aatgcatttc catgtctgta 1561 tgtccgtgct tagcttattc tatcaatcac ggaagaggca gtcaaggagg aaggagagac 1621 attaggagcc gataaatgca tctgatcaga aatcagcaga cagaattacc aaagtgtatc 1681 tggtgctgaa tgactggggg acaagcagaa gtggaagaga tctttctgca acaggatatt 1741 cttctagtct tctgagtttc tggtctttga caggcaattc tggttggctg tggctggaat 1801 ccacatgctg atagatagga atttgtgctt acaaagcagg agaattaaaa agacgctttc 1861 ctctcctcct ttagag // LOCUS HSP40PHOX 1245 bp RNA PRI 07-MAR-1994 DEFINITION H.sapiens mRNA for p40phox. ACCESSION X77094 NID g458543 KEYWORDS p40phox gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1245) AUTHORS Wientjes,F.B., Hsuan,J.J., Totty,N.F. and Segal,A.W. TITLE p40phox, a third cytosolic component of the activation complex of the NADPH oxidase to contain src homology 3 domains JOURNAL Biochem. J. 296 (Pt 3), 557-561 (1993) MEDLINE 94107216 REFERENCE 2 (bases 1 to 1245) AUTHORS Wientjes,F.B. TITLE Direct Submission JOURNAL Submitted (23-FEB-1994) Department of Medicine,University College London,Rayne Institute, 5 University Street, London WC1E 6JJ, UK FEATURES Location/Qualifiers source 1..1245 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="retinoic acid-induced" /cell_type="HL60" /cell_line="HL60" /clone_lib="lambda gt10" gene 131..1150 /gene="p40phox" CDS 131..1150 /gene="p40phox" /codon_start=1 /product="p40phox protein" /db_xref="PID:g458544" /translation="MAVAQQLRAESDFEQLPDDVAISANIADIEEKRGFTSHFVFVIE VKTKGGSKYLIYRRYRQFHALQSKLEERFGPDSKSSALACTLPTLPAKVYVGVKQEIA EMRIPALNAYMKSLLSLPVWVLMDEDVRIFFYQSPYDSEQVPQAIRRLRPRTRKVKSV SPQGNSVDRMAAPRAEALFDFTGNSKLELNFKAGDVIFLLSRINKDWLEGTVRGATGI FPLSFVKILKDFPEEDDPTNWLRCYYYEDTISTIKDIAVEEDLSSTPLLKDLLELTRR EFQREDIALNYRDAEGDLVRLLSDEDVALMVRQARGLPSQKRLFPWKLHITQKDNYRV YNTMP" BASE COUNT 282 a 378 c 355 g 230 t ORIGIN 1 ggggagagtc tcgcccagcc agtcccaggc tgagttcacc tctcacttcc tccagccacc 61 ctgcgtctcg tctcagcctg ggactggctg ggcgagactc tccacctgct ccctgggacc 121 atcgcccacc atggctgtgg cccagcagct gcgggccgag agtgactttg aacagcttcc 181 ggatgatgtt gccatctcgg ccaacattgc tgacatcgag gagaagagag gcttcaccag 241 ccactttgtt ttcgtcatcg aggtgaagac aaaaggagga tccaagtacc tcatctaccg 301 ccgctaccgc cagttccatg ctttgcagag caagctggag gagcgcttcg ggccagacag 361 caagagcagt gccctggcct gtaccctgcc cacactccca gccaaagtct acgtgggtgt 421 gaaacaggag atcgccgaga tgcggatacc tgccctcaac gcctacatga agagcctgct 481 cagcctgccg gtctgggtgc tgatggatga ggacgtccgg atcttctttt accagtcgcc 541 ctatgactca gagcaggtgc cccaggccat ccgccggctc cgcccgcgca cccggaaagt 601 caagagcgtg tccccacagg gcaacagcgt tgaccgcatg gcagctccga gagcagaggc 661 tctatttgac ttcactggaa acagcaaact ggagctgaat ttcaaagctg gagatgtgat 721 cttcctcctc agtcggatca acaaagactg gctggagggc actgtccggg gagccacggg 781 catcttccct ctctccttcg tgaagatcct caaagacttc cctgaggagg acgaccccac 841 caactggctg cgttgctact actacgaaga caccatcagc accatcaagg acatcgcggt 901 ggaggaagat ctcagcagca ctcccctatt gaaagacctg ctggagctca caaggcggga 961 gttccagaga gaggacatag ctctgaatta ccgggacgct gagggggatc tggttcggct 1021 gctgtcggat gaggacgtag cgctcatggt gcggcaggct cgtggcctcc cctcccagaa 1081 gcgcctcttc ccctggaagc tgcacatcac gcagaaggac aactacaggg tctacaacac 1141 gatgccatga gctgacggtg tccctggagc agtgagggga caccagcaaa aaccttcagc 1201 tctcagagga gattgggacc aggaaaacct gggaggatgg gcaga // LOCUS HSP450P2 2130 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cytochrome P-450HP. ACCESSION X16699 NID g35204 KEYWORDS cytochrome; cytochrome P450; haemprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2130) AUTHORS Yokotani,N. TITLE Direct Submission JOURNAL Submitted (17-OCT-1989) Yokotani N., Department of Chemistry, Faculty of Science, Tohoku University, Sendai 980, Japan REFERENCE 2 (bases 1 to 2130) AUTHORS Yokotani,N., Sogawa,K., Matsubara,S., Gotoh,O., Kusunose,E., Kusunose,M. and Fujii-Kuriyama,Y. TITLE cDNA cloning of cytochrome P-450 related to P-450p-2 from the cDNA library of human placenta. Gene structure and expression JOURNAL Eur. J. Biochem. 187 (1), 23-29 (1990) MEDLINE 90126824 COMMENT Data kindly reviewed (10-MAR-1990) by Yokotani N. FEATURES Location/Qualifiers source 1..2130 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" CDS 1..1536 /note="cytochrome P-450HP (AA 1-511)" /codon_start=1 /db_xref="PID:g35205" /db_xref="SWISS-PROT:P13584" /translation="MVPSFLSLSFSSLGLWASGLILVLGFLKLIHLLLRRQTLAKAMD KFPGPPTHWLFGHALEIQETGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKA VYSRGDPKAPDVYDFFLQWIGRGLLVLEGPKWLQHRKLLTPGFHYDVLKPYVAVFTES TRIMLDKWEEKAREGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDL TLLMQQRLVSFQYHNDFIYWLTPHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKK IQNRRHLDFLDILLGARDEDDIKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALY PEHQHRCREEVREILGDQDFFQWDDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVT FVDGRSLPAGSLISMHIYALHRNSAVWPDPEVFDSLRFSTENASKRHPFAFMPFSAGP RNCIGQQFAMSEMKVVTAMCLLRFEFSLDPSRLPIKMPQLVLRSKNGFHLHLKPLGPG SGK" misc_feature 180..181 /note="exon 1/2 splice junction" misc_feature 322..323 /note="exon 2/3 splice junction" misc_feature 367..368 /note="exon 3/4 splice junction" misc_feature 495..496 /note="exon 4/5 splice junction" misc_feature 772..773 /note="exon 5/6 splice junction" misc_feature 879..880 /note="exon 6/7 splice junction" misc_feature 1070..1071 /note="exon 7/8 splice junction" misc_feature 1204..1205 /note="exon 8/9 splice junction" misc_feature 1269..1270 /note="exon 9/10 splice junction" misc_feature 1352..1353 /note="exon 10/11 splice junction" misc_feature 2047..2052 /note="put.polyA signal" BASE COUNT 482 a 565 c 550 g 533 t ORIGIN 1 atggtgccca gcttcctctc cctgagcttc tcctccttgg gcctgtgggc ttctgggctg 61 atcttggtct taggctttct caagctcatc cacctgctgc tgcggaggca gacgttggct 121 aaggctatgg acaaattccc agggcctccc acccactggc tttttggaca tgccctcgag 181 atccaggaga cggggagcct ggacaaagtg gtgtcctggg cccaccagtt cccgtatgcc 241 cacccactct ggttcggaca gttcattggc ttcctgaaca tctatgagcc tgactatgcc 301 aaagctgtgt acagccgtgg ggaccctaag gcccctgatg tgtatgactt cttcctccag 361 tggattggga gaggcctgct ggttcttgag gggcccaagt ggttgcagca ccgcaagctg 421 ctcacacctg gctttcatta tgatgtgctg aagccctatg tggccgtgtt cactgagtct 481 acacgtatca tgctggacaa gtgggaagag aaagctcggg agggtaagtc ctttgacatc 541 ttctgcgatg tgggtcacat ggcgctgaac acactcatga agtgcacctt tggaagagga 601 gacaccggcc tgggccacag ggacagcagc tactaccttg cagtcagcga tctcactctg 661 ttgatgcagc agcgccttgt gtccttccag taccataatg acttcatcta ctggctcacc 721 ccacatggcc gccgcttcct gcgggcctgc caggtggccc atgaccatac agaccaggtc 781 atcagggagc ggaaggcagc cctgcaggat gagaaggtgc ggaagaagat ccagaaccgg 841 aggcacctgg acttcctgga cattctcctg ggtgcccggg atgaagatga catcaaactg 901 tcagatgcag acctccgggc tgaagtggac acattcatgt ttgaaggcca tgacaccacc 961 accagtggta tctcctggtt tctctactgc atggccctgt accctgagca ccagcatcgt 1021 tgtagagagg aggtccgcga gatcctaggg gaccaggact tcttccagtg ggatgatctg 1081 ggcaaaatga cttatctgac catgtgcatc aaggagagct tccgcctcta cccacctgtg 1141 ccccaggtgt accgccagct cagcaagcct gtcacctttg tggatggccg gtctctacct 1201 gcaggaagcc tgatctctat gcatatctat gccctccata ggaacagtgc tgtatggccc 1261 gaccctgagg tctttgactc tctgcgcttt tccactgaga atgcatccaa acgccatccc 1321 tttgccttta tgcccttctc tgctgggccc aggaactgca ttgggcagca gtttgccatg 1381 agtgagatga aggtggtcac agccatgtgc ttgctccgct ttgagttctc tctggacccc 1441 tcacggctgc ccatcaagat gccccagctt gtcctgcgct ccaagaatgg ctttcacctc 1501 cacctgaagc cactgggccc tgggtctggg aagtagctct gatgagaatg gggtcccaga 1561 tggctcaggc tgtgacctcc ctgggcacca ccctccccag gctgggtgtg gaggagttgg 1621 ggccccctgc cgttcagagc ttgtagttta gaagggaagt aggcattacc atagacgact 1681 cctagaggac agtgctatgt aaaaatgtgt gtctataatg tttatcatgc atgtattcta 1741 gagctcattc atttattcaa caaacatttg gtgagcacct atttcgttcg agaaacttca 1801 tttatctcct ataattggca aacttaaaaa tgcagcagaa acttacattc caaccttaga 1861 gactcatagt gagcacaagg aaagttttgc cctgagattc atggttatgg ctgggtacca 1921 ccaaatagaa gaatggctta ggggagtgcc ccttcacgag tgtgtttctt tgttgaactt 1981 tgtgtgtgtg tgtttagaat ataacagaca taagaaaaaa ttacctaaat gaagactgta 2041 caaaataata aataattctg aagcagactc tcttgtaacc atcactgaag tcaagaaata 2101 tggaatatgg tatcactgat ggtcttctgc // LOCUS HSP53ASSG 2372 bp RNA PRI 28-JUL-1992 DEFINITION H.sapiens mRNA for p53-associated gene. ACCESSION Z12020 NID g35211 KEYWORDS p53 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2372) AUTHORS Oliner,J.D., Kinzler,K.W., Meltzer,P.S., George,D.L. and Vogelstein,B. TITLE Amplification of a gene encoding a p53-associated protein in human sarcomas JOURNAL Nature 358 (6381), 80-83 (1992) MEDLINE 92310576 REFERENCE 2 (bases 1 to 2372) AUTHORS Kinzler,K. TITLE Direct Submission JOURNAL Submitted (29-APR-1992) Kinzler K., Johns Hopkins School of Medicine, 424 North Bond Street, Baltimore, MD, U.S.A COMMENT . FEATURES Location/Qualifiers source 1..2372 /organism="Homo sapiens" /db_xref="taxon:9606" gene 312..1787 /gene="p53 associated" CDS 312..1787 /gene="p53 associated" /codon_start=1 /db_xref="PID:g35212" /db_xref="SWISS-PROT:Q00987" /translation="MCNTNMSVPTDGAVTTSQIPASEQETLVRPKPLLLKLLKSVGAQ KDTYTMKEVLFYLGQYIMTKRLYDEKQQHIVYCSNDLLGDLFGVPSFSVKEHRKIYTM IYRNLVVVNQQESSDSGTSVSENRCHLEGGSDQKDLVQELQEEKPSSSHLVSRPSTSS RRRAISETEENSDELSGERQRKRHKSDSISLSFDESLALCVIREICCERSSSSESTGT PSNPDLDAGVSEHSGDWLDQDSVSDQFSVEFEVESLDSEDYSLSEEGQELSDEDDEVY QVTVYQAGESDTDSFEEDPEISLADYWKCTSCNEMNPPLPSHCNRCWALRENWLPEDK GKDKGEISEKAKLENSTQAEEGFDVPDCKKTIVNDSRESCVEENDDKITQASQSQESE DYSQPSTSSSIIYSSQEDVKEFEREETQDKEESVESSLPLNAIEPCVICQGRPKNGCI VHGKTGHLMACFTCAKKLKKRNKPCPVCRQPIQMIVLTYFP" BASE COUNT 698 a 491 c 541 g 642 t ORIGIN 1 gcaccgcgcg agcttggctg cttctggggc ctgtgtggcc ctgtgtgtcg gaaagatgga 61 gcaagaagcc gagcccgagg ggcggccgcg acccctctga ccgagatcct gctgctttcg 121 cagccaggag caccgtccct ccccggatta gtgcgtacga gcgcccagtg ccctggcccg 181 gagagtggaa tgatccccga ggcccagggc gtcgtgcttc cgcagtagtc agtccccgtg 241 aaggaaactg gggagtcttg agggaccccc gactccaagc gcgaaaaccc cggatggtga 301 ggagcaggca aatgtgcaat accaacatgt ctgtacctac tgatggtgct gtaaccacct 361 cacagattcc agcttcggaa caagagaccc tggttagacc aaagccattg cttttgaagt 421 tattaaagtc tgttggtgca caaaaagaca cttatactat gaaagaggtt cttttttatc 481 ttggccagta tattatgact aaacgattat atgatgagaa gcaacaacat attgtatatt 541 gttcaaatga tcttctagga gatttgtttg gcgtgccaag cttctctgtg aaagagcaca 601 ggaaaatata taccatgatc tacaggaact tggtagtagt caatcagcag gaatcatcgg 661 actcaggtac atctgtgagt gagaacaggt gtcaccttga aggtgggagt gatcaaaagg 721 accttgtaca agagcttcag gaagagaaac cttcatcttc acatttggtt tctagaccat 781 ctacctcatc tagaaggaga gcaattagtg agacagaaga aaattcagat gaattatctg 841 gtgaacgaca aagaaaacgc cacaaatctg atagtatttc cctttccttt gatgaaagcc 901 tggctctgtg tgtaataagg gagatatgtt gtgaaagaag cagtagcagt gaatctacag 961 ggacgccatc gaatccggat cttgatgctg gtgtaagtga acattcaggt gattggttgg 1021 atcaggattc agtttcagat cagtttagtg tagaatttga agttgaatct ctcgactcag 1081 aagattatag ccttagtgaa gaaggacaag aactctcaga tgaagatgat gaggtatatc 1141 aagttactgt gtatcaggca ggggagagtg atacagattc atttgaagaa gatcctgaaa 1201 tttccttagc tgactattgg aaatgcactt catgcaatga aatgaatccc ccccttccat 1261 cacattgcaa cagatgttgg gcccttcgtg agaattggct tcctgaagat aaagggaaag 1321 ataaagggga aatctctgag aaagccaaac tggaaaactc aacacaagct gaagagggct 1381 ttgatgttcc tgattgtaaa aaaactatag tgaatgattc cagagagtca tgtgttgagg 1441 aaaatgatga taaaattaca caagcttcac aatcacaaga aagtgaagac tattctcagc 1501 catcaacttc tagtagcatt atttatagca gccaagaaga tgtgaaagag tttgaaaggg 1561 aagaaaccca agacaaagaa gagagtgtgg aatctagttt gccccttaat gccattgaac 1621 cttgtgtgat ttgtcaaggt cgacctaaaa atggttgcat tgtccatggc aaaacaggac 1681 atcttatggc ctgctttaca tgtgcaaaga agctaaagaa aaggaataag ccctgcccag 1741 tatgtagaca accaattcaa atgattgtgc taacttattt cccctagttg acctgtctat 1801 aagagaatta tatatttcta actatataac cctaggaatt tagacaacct gaaatttatt 1861 cacatatatc aaagtgagaa aatgcctcaa ttcacataga tttcttctct ttagtataat 1921 tgacctactt tggtagtgga atagtgaata cttactataa tttgacttga atatgtagct 1981 catcctttac accaactcct aattttaaat aatttctact ctgtcttaaa tgagaagtac 2041 ttggtttttt ttttcttaaa tatgtatatg acatttaaat gtaacttatt attttttttg 2101 agaccgagtc ttgctctgtt acccaggctg gagtgcagtg ggtgatcttg gctcactgca 2161 agctctgccc tccccgggtt cgcaccattc tcctgcctca gcctcccaat tagcttggcc 2221 tacagtcatc tgccaccaca cctggctaat tttttgtact tttagtagag acagggtttc 2281 accgtgttag ccaggatggt ctcgatctcc tgacctcgtg atccgcccac ctcggcctcc 2341 caaagtgctg ggattacagg catgagccac cg // LOCUS HSP5CS 2907 bp RNA PRI 05-DEC-1996 DEFINITION H.sapiens mRNA for pyrroline 5-carboxylate synthetase. ACCESSION X94453 NID g1304313 KEYWORDS pyrroline-5-carboxlyate synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2907) AUTHORS Aral,B., Schlenzig,J.S., Liu,G. and Kamoun,P. TITLE Database cloning human delta 1-pyrroline-5-carboxylate synthetase (P5CS) cDNA: a bifunctional enzyme catalyzing the first 2 steps in proline biosynthesis JOURNAL C. R. Acad. Sci. III, Sci. Vie 319 (3), 171-178 (1996) MEDLINE 96340872 REFERENCE 2 (bases 1 to 2907) AUTHORS Aral,B. TITLE Direct Submission JOURNAL Submitted (28-DEC-1995) B. Aral, CNRS URA1335, Biochimie Genetique, Hop. NECKER-EM, 149, RUE DE SEVRES, 75015 PARIS, FRANCE FEATURES Location/Qualifiers source 1..2907 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 139..2526 /codon_start=1 /product="pyrroline 5-carboxylate synthetase" /db_xref="PID:e218500" /db_xref="PID:g1304314" /db_xref="SWISS-PROT:P54886" /translation="MLSQVYRCGFQPFNQHLLPWVKCTTVFRSHCIQPSVIRHVRSWS NIPFITVPLSRTHGKSFAHRSELKHAKRIVVKLGSAVVTRGDECGLALGRLASIVEQV SVLQNQGREMMLVTSGAVAFGKQTLRHEILLSQSVRQALHSGQNQLKEMAIPVLEARA CAAAGQSGLMALYEAMFTQYSICAAQILVTNLDFHDEQKRRNLNGTLHELLRMNIVPI VNTNDAVVPPAEPNSDLQGVNVISVKDNDSLAARLAVEMKTDLLIVLPDVEGLFDSPP GSDDAKLIDIFYPGDQQSVTFGPKSRVGNGCMEAKVKSTLWALQGGTSVVIANGTHPK VSGHVITDIVEGKKVGTFFSEVKPAGPTVEQQGEMARSGGRMLATLEPEQRAEIIHHL ADLLTDQRDEILLANKKDLEEAEGRLAAPLLKRLSLSTSKLNSLAIGLRQIAASSQDS VGRVLRRTRIAKNLELEQVTVPIGVLLVIFESRPDCPTPGGSFAIASGNGLLLKGGKE AAHSNRILHLLTQEALSIHGVKEAVQLVNTREEVEDLCRLDKMIDLIIPRGSSQLVRD IQKAAKGIPVMGHSEGICHMYVDSEASVDKVTRLVRDSKCEYPAACNALETLLIHRDL LRTPLFDQIIDMLRVEQVKIHAGPKFASYLTFSPSEVKSLRTEYGDLELCIEVVDNVQ DAIDHIHKYGSSHTDVIVTEDENTAEFFLQHVDSACVFWNASTRFSDGYRFGLGAEVG ISTSRIHARGPVGLEGLLTTKWLLRGKDHVVSDFSEHGSLKYLHENLPIPQRNTN" BASE COUNT 732 a 702 c 782 g 691 t ORIGIN 1 gcgggacgtg gacggaagaa aaaagagagt gagcgagcgc cggaatcagc ccggcgtgga 61 gtgcggaccc gcggtagtgc cagggggcga aggcggcggt ggtgaggaag atactttggt 121 tagtgaccac atcgcagcat gttgagtcaa gtttaccgct gtgggttcca gcccttcaac 181 caacatcttc tgccctgggt caagtgtaca accgtcttca gatctcattg tatccagcct 241 tcagtcatca gacatgttcg ttcttggagc aacatcccgt ttatcactgt acccctcagt 301 cgtacacatg gcaagtcctt cgcccaccgc agtgagctga agcatgccaa gagaatcgtg 361 gtgaagctcg gcagtgccgt ggtgacccga ggggatgaat gtggcctggc cctggggcgc 421 ttggcatcta ttgttgagca ggtatcagtg ctgcagaatc agggcagaga gatgatgctg 481 gtgaccagtg gagccgtagc ctttggcaaa caaacgttgc gccatgagat ccttctgtct 541 cagagcgtgc ggcaggccct ccactcgggg cagaaccagc tgaaagaaat ggcaattcca 601 gtcttagagg cacgagcctg tgcagctgcc ggacagagtg ggctgatggc cttgtatgag 661 gctatgttta cccagtacag catctgtgct gcccagattt tggtgaccaa tttggatttc 721 catgatgagc agaagcgccg gaacctcaat ggaacacttc atgaactcct tagaatgaac 781 attgtcccca ttgtcaacac aaatgatgct gttgtccccc cagctgagcc caacagtgac 841 ctgcaggggg taaatgttat tagtgttaaa gataatgata gcctggctgc ccgactggct 901 gtggaaatga aaactgatct cttgattgtc cttccagatg tagaaggcct ttttgacagc 961 cccccaggtt cagatgatgc aaagcttatt gatatatttt atcccggaga tcagcagtct 1021 gtgacatttg gacccaagtc tagagtggga aatgggtgca tggaagccaa ggtgaaaagc 1081 accctctggg ctttgcaagg tggcacttct gttgttattg ccaatggaac ccacccaaag 1141 gtgtctgggc acgtcatcac agacattgtg gaggggaaga aagttggtac cttcttttca 1201 gaagtaaagc ctgcaggccc tactgttgag cagcagggag aaatggcgcg atctggagga 1261 aggatgttgg ccaccttgga acctgagcag agagcagaaa ttatccatca tctggctgat 1321 ctgttgacgg accagcgtga tgagatcctg ttagccaaca aaaaagactt ggaggaggca 1381 gaggggagac ttgcagctcc tctgctgaaa cgtttaagcc tctccacatc caaattgaac 1441 agcctggcca tcggtctgcg acagatcgca gcctcctccc aggacagcgt gggacgtgtt 1501 ttgcgccgca cccgaatcgc caaaaacttg gaactggaac aagtgactgt cccaattgga 1561 gttctgctgg tgatctttga atctcgtcct gactgtccta ccccaggtgg cagctttgct 1621 atcgcaagtg gcaatggctt gttactcaaa ggagggaagg aggctgcaca cagcaaccgg 1681 attctccacc tcctgaccca ggaggctctc tcaatccatg gagtcaagga ggccgtgcaa 1741 ctggtgaata ccagagaaga agttgaagat ctttgccgcc tagacaaaat gatagatctg 1801 atcattccac gtggctcttc ccagctggtc agagacatcc agaaagctgc taaggggatt 1861 ccagtgatgg ggcacagcga agggatctgt cacatgtatg tggattccga ggccagtgtt 1921 gataaggtca ccaggctagt cagagactct aaatgtgaat atccagctgc ctgtaatgct 1981 ttggagactt tgttaatcca ccgggatctg ctcaggacac cattatttga ccagatcatt 2041 gatatgctga gagtggaaca ggtaaaaatt catgcaggcc ccaaatttgc ctcctatctg 2101 accttcagcc cctccgaagt gaagtcactc cgaactgagt atggggacct ggaattatgc 2161 attgaagtag tggacaacgt tcaggatgcc attgaccaca tccacaagta tggcagctcc 2221 cacacggatg tcatcgtcac agaggacgaa aacacagcgg agttcttcct gcagcacgta 2281 gacagtgcct gtgtgttctg gaatgccagc actcgctttt ctgatggtta ccgctttgga 2341 ctgggagctg aagtgggaat cagtacatcg agaatccacg cccggggacc agtaggactt 2401 gagggactgc ttactactaa gtggctgctg cgagggaagg accacgtggt ctcagatttc 2461 tcagagcatg gaagtttaaa atatcttcat gagaacctcc ctattcctca gagaaacacc 2521 aactgaaaag agccaggaaa acccgggaat tttccaaaag gtcttcacgt taaacttgtc 2581 ttatctcagg agagagcccg ctcttgtctc ccagttcctg gtagggtctg cctgttggaa 2641 agtgtacctg gatgcttctg ggctccgttt ggcaatagca atcttggctg atgtgcacag 2701 tctggctccc agctcaccct ttttttttaa agtaagaaaa tagttgctac cgatagggac 2761 tttgccaagt ccaattatct tctaggattg aaaggtgcat tttccccata aaaaaggcga 2821 ggaaaaccca tggctgcttt gtgtcacctc agtgacttac agtccccctt ggcatttagt 2881 tggtactaga gccagtatcc ttaacaa // LOCUS HSP63 2910 bp RNA PRI 13-JUL-1994 DEFINITION H.sapiens p63 mRNA for transmembrane protein. ACCESSION X69910 NID g297407 KEYWORDS ER-golgi intermediate compartment; p63 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2910) AUTHORS Schweizer,A. TITLE Direct Submission JOURNAL Submitted (28-DEC-1992) A. Schweizer, Biocenter of the University of Basel, Department of Pharmacology, Klingelbergstrasse 70, CH-4056 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 2910) AUTHORS Schweizer,A., Rohrer,J., Jeno,P., DeMaio,A., Buchman,T.G. and Hauri,H.P. TITLE A reversibly palmitoylated resident protein (p63) of an ER-Golgi intermediate compartment is related to a circulatory shock resuscitation protein JOURNAL J. Cell. Sci. 104 (Pt 3), 685-694 (1993) MEDLINE 93300949 FEATURES Location/Qualifiers source 1..2910 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="human placenta lambda gt10" gene 85..1890 /gene="p63" CDS 85..1890 /gene="p63" /codon_start=1 /product="P63 protein" /db_xref="PID:g297408" /translation="MPSAKQRGSKGGHGAASPSEKGAHPSGGADDVAKKPPPAPQQPP PPPAPHPQQHPQQHPQNQAHGKGGHRGGGGGGGKSSSSSSASAAAAAAAASSSASCSR RLGRALNFLFYLALVAAAAFSGWCVHHVLEEVQQVRRSHQDFSRQREELGQGLQGVEQ KVQSLQATFGTFESILRSSQHKQDLTEKAVKQGESEVSRISEVLQKLQNEILKDLSDG IHVVKDARERDFTSLENTVEERLTELTKSINDNIAIFTEVQKRSQKEINDMKAKVASL EESEGNKQDLKALKEAVKEIQTSAKSREWDMEALRSTLQTMESDIYTEVRELVSLKQE QQAFKEAADTERLALQALTEKLLRSEESVSRLPEEIRRLEEELRQLKSDSHGPKEDGG FRHSEAFEALQQKSQGLDSRLQHVEDGVLSMQVASARQTESLESLLSKSQEHEQRLAP AGALEGLGSSEADQDGLASTVRSLGETQLVLYGDVEELKRSVGELPSTVESLQKVQEQ VHTLLSQDQAQAARLPPQDFLDRLSSLDNLKASVSQVEADLKMLRTAVDSLVAYSVKI ETNENNLESAKGLLDDLRNDLDRLFVKVEKIHEKV" misc_feature 403..465 /gene="p63" /note="transmembrane domain" BASE COUNT 665 a 806 c 849 g 590 t ORIGIN 1 gggggagccc ctgcaagttt cccgggccgc gcgccgcgct cgctcgcctc ccagcccgcg 61 gcccgagccg ccgccgcgcc cgccatgccc tcggccaaac aaaggggctc caagggcggc 121 cacggcgccg cgagcccctc ggagaagggt gcccacccgt cgggcggcgc ggatgacgtg 181 gcgaagaagc cgccgccggc gccgcagcag ccgccgccgc cgcccgcgcc gcacccgcag 241 cagcacccgc agcagcaccc gcagaaccag gcgcacggca agggcggcca ccgcggcggc 301 ggcggcggcg gcggcaagtc ctcctcctcc tcctccgcct ccgccgccgc tgccgccgcc 361 gccgcctcgt cctcggcgtc ctgctcgcgc aggctcggca gggcgctcaa ctttctcttc 421 tacctcgccc tggtggcggc ggccgctttc tcgggctggt gcgtccacca cgtcctggag 481 gaggtccagc aggtccggcg cagccaccag gacttctccc ggcagaggga ggagctgggc 541 cagggcttgc agggcgtcga gcagaaggtg cagtctttgc aagccacatt tggaactttt 601 gagtccatct tgagaagctc ccaacataaa caagacctca cagagaaagc tgtgaagcaa 661 ggggagagtg aggtcagccg gatcagcgaa gtgctgcaga aactccagaa tgagattctc 721 aaagacctct cggatgggat ccatgtggtg aaggacgccc gggagcggga cttcacgtcc 781 ctggagaaca cggtggagga gcggctgacg gagctcacca aatccatcaa cgacaacatc 841 gccatcttca cagaagtcca gaagaggagc cagaaggaga tcaatgacat gaaggcaaag 901 gttgcctccc tggaagaatc tgaggggaac aagcaggatt tgaaagcctt aaaggaagct 961 gtgaaggaga tacagacctc agccaagtcc agagagtggg acatggaggc cctgagaagt 1021 acccttcaga ctatggagtc tgacatctac accgaggttc gcgagctggt gagcctcaag 1081 caggagcagc aggctttcaa ggaggcggcc gacacggagc ggctcgccct gcaggccctc 1141 acggagaagc ttctcaggtc tgaggagtcc gtctcccgcc tcccggagga gatccggaga 1201 ctggaggaag agctccgcca gctgaagtcc gattcccacg ggccgaagga ggacggaggc 1261 ttcagacact cggaagcctt tgaggcactc cagcaaaaga gtcagggact ggactccagg 1321 ctccagcacg tggaggatgg ggtgctctcc atgcaggtgg cttctgcgcg ccagaccgag 1381 agcctggagt ccctcctgtc caagagccag gagcacgagc agcgcctggc ccctgcaggg 1441 gccctggaag gcctcgggtc ctcagaggca gaccaggatg gcctggccag cacggtgagg 1501 agcctgggcg agacccagct ggtgctctac ggtgacgtgg aggagctgaa gaggagtgtg 1561 ggcgagctcc ccagcaccgt ggaatcactc cagaaggtgc aggagcaggt gcacacgctg 1621 ctcagtcagg accaagccca ggccgcccgt ctgcctcctc aggacttcct ggacagactt 1681 tcttctctag acaacctgaa agcctcagtc agccaagtgg aggcggactt gaaaatgctc 1741 aggactgctg tggacagttt ggttgcatac tcggtcaaaa tagaaaccaa cgagaacaat 1801 ctggaatcag ccaagggttt actagatgac ctgaggaatg atctggatag gttgtttgtg 1861 aaagtggaga agattcacga aaaggtctaa atgaattgcg tgtgcagggc gcggatttaa 1921 agtccaattt ctcatgacca aaaaatgtgt ggttttttcc catgtgtccc ctacccccca 1981 atttcttgtc ccctcttaaa gagcagttgt caccacctga acaccaaggc attgtatttt 2041 catgcccagt taacttattt acaatattta agttctctgc ttctgcattt ggttggtttc 2101 ctgaagcgca gcccctgtga ataacaggtg gcttttcatg gatgtctcta gtcagagaaa 2161 aatgataaag gcttaaattg aggattaaca gaagcagatt aacctcagaa atcctgtctg 2221 gctggcagat ttcaagtaaa aaaaaaaaaa aggtgggttg gggggaccct tttctttcta 2281 gttgtcttta aggaaaatta attttacttt tttttttgtt ctggccgaaa tttttatgag 2341 atatctctca cttgtcttcc actttgaacc ggttaaagct catagctgtc agctctgaat 2401 gaggagggga gaagcccctg ggtctttctt tgaaaggaat ccgctgcttg agggctgcct 2461 ccctcatggt gtgcgtgtcg ttctcttcct gacgcatctg tgatatcaga ggtaactatg 2521 caaagcatcc aggcggttct gaatgtgaag cactacaccc agcagagtcc cggtgccctc 2581 tgtccccact gccggcccat gtcctctctc cggaggtcac caaggaatgc acaggtttcg 2641 actaccagaa aggggagtcc ttgggttctt tcaaaaaatt cgtgaggaga gctgtctaca 2701 gtggaatagg gggtctccct ggggaatgca ggccaagtcc ttttatttta acatgatgtc 2761 catgaagagg tttgccgtct gggcagccct gtcggcaagg agcgtgcata ctgcgtttgt 2821 gtaattgttt gctgtatctc ccttccctct gagctgtatt gttctttaat ggctgtcttg 2881 cccttccaaa aaaaattgaa aaaaaaaaaa // LOCUS HSP64BCCP 1229 bp RNA PRI 01-NOV-1997 DEFINITION H.sapiens mRNA homologous to the p64 bovine chloride channel peptide. ACCESSION Y12696 NID g2584784 KEYWORDS chloride channel protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1229) AUTHORS Heiss,N.S. TITLE Direct Submission JOURNAL Submitted (21-APR-1997) N.S. Heiss, Deutsches Krebsforschungszentrum (DKFZ), Molecular Genome Analysis, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG REFERENCE 2 (bases 1 to 1229) AUTHORS Heiss,N.S. and Poustka,A. TITLE Genomic structure of a novel chloride channel gene, CLIC2, in Xq28 JOURNAL Genomics 45 (1), 224-228 (1997) MEDLINE 97480736 FEATURES Location/Qualifiers source 1..1229 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="q28" CDS 222..953 /codon_start=1 /product="p64 bovine chloride channel-like protein" /db_xref="PID:e1169565" /db_xref="PID:g2584785" /translation="MSGLRPGTQVDPEIELFVKAGSDGESIGNCPFCQRLFMILWLKG VKFNVTTVDMTRKPEELKDLAPGTNPPFLVYNKELKTDFIKIEEFLEQTLAPPRYPHL SPKYKESFDVGCNLFAKFSAYIKNTQKEANKNFEKSLLKEFKRLDDYLNTPLLDEIDP DSAGEPPVSRRLFLDGDQLTLADCSLLPKLNIIKVAAKKYRDFDIPAEFSGVWRYLHN AYAREEFTHTCPEDKEIENTYANVA" BASE COUNT 391 a 251 c 249 g 338 t ORIGIN 1 ataacgattt caagagctgc acttaagcat ctagaatttt ctgcgtcaca cctcttgaga 61 gaagagactg gctccaggtc tgactcagtc cactacaagc tagacggtct tcttaaagca 121 ccaacattac ttgagtcttt ggataaaatt gagaaaagag tctacaagta ttgtggactc 181 tacaggaggc aggaggctga caactggcag taaagacaaa gatgtcaggc ctgcggcccg 241 gcactcaagt ggaccctgag attgagcttt ttgtaaaggc tggaagtgat ggagagagta 301 ttggaaactg tcccttttgc caacgccttt tcatgatcct ctggcttaaa ggagttaaat 361 ttaatgtgac aactgttgac atgaccagaa agcctgaaga actaaaggac ttagccccag 421 gtaccaatcc tccgttcctg gtgtataaca aggagttgaa aacagacttc attaaaattg 481 aggagttttt agaacaaacc ctggctcctc caaggtaccc tcacctgagt cccaagtaca 541 aggagtcttt tgatgtgggc tgtaacctct ttgccaagtt ttctgcatac attaagaata 601 cacaaaagga ggcaaataag aattttgaaa aatctctgct caaagaattc aagcgtctgg 661 atgactactt aaacacccca cttctggatg aaattgatcc agacagtgct ggggaacccc 721 cagtttccag aagactattc ttggatgggg accagctaac actggctgat tgtagcttgt 781 tacccaagct gaacattatt aaagttgctg ccaagaaata tcgtgacttt gacattccag 841 cagaattctc aggagtctgg cgttatctcc acaatgccta tgcccgtgaa gaatttaccc 901 acacgtgtcc tgaagacaaa gaaattgaaa atacttacgc aaatgtggct taaacagaag 961 agttaggaga gctcttacag gagaaaaggc tatatttgtg atcagatttt acttattgac 1021 atattagaaa ggtttttgca aataagaata tgaaaaatac tgtttcttct atccaactct 1081 cttatgaaaa ggaactctgt attttctatt agccataaat aatctgtcca ctgtatttta 1141 caggtcttca tacttttact taattttctt tatctgtatg gcaaaccact gcaatcctga 1201 atgacatgga aagcatcaca aaaaaaaaa // LOCUS HSP64CLCP 1193 bp RNA PRI 24-JAN-1997 DEFINITION H.sapiens mRNA for putative p64 CLCP protein. ACCESSION X87689 NID g895844 KEYWORDS p64 CLCP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1193) AUTHORS Borsani,G. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1193) AUTHORS Hillier,L., Clark,N., Dubuque,T., Elliston,K., Hawkins,M., Holman,M., Hultman,M., Kucaba,T., Le,M., Lennon,G., Marra,M., Parsons,J., Rifkin,L., Rohlfing,T., Soares,M., Tan,F., Trevaskis,E., Waterston,R., Williamson,A., Wohldmann,P. and Wilson,R. TITLE The WashU-Merck EST Project JOURNAL Unpublished REFERENCE 3 (bases 1 to 1193) AUTHORS Adams,M.D., Kerlavage,A.R., Fleischmann,R.D., Fuldner,R.A., Bult,C.J., Lee,N., Kirkness,E.F., Weinstock,K.G., Gocayne,J.D., White,O., Sutton,G., Blake,J.A., Brandon,R.C., Chiu,M.W., Clayton,R.A., Cline,R.T., Cotton,M.D., Earle-Hughes,J., Fine,L.D., FitzGerald,L.M., FitzHugh,W.M., Fritchman,J.L., Geoghagen,N.S.M., Glodek,A., Gnehm,C.L., Hanna,M.C., Hedblom,E., Hinkle Jr,P.S., Kelley,J.M., Klimek,K.M., Kelley,J.C., Liu,L.I., Marmaros,S.M., Merrick,J.M., MORENO-PALANQUES,R.F., McDonald,L.A., Nguyen,D.T., Pellegrino,S.M., Phillips,C.A., Ryder,S.E., Scott,J.L., Saudek,D.M., Shirley,R., Small,K.V., Spriggs,T.A., Utterback,T.R., Weidman,J.F., Li,Y., Bednarik,D.P., Cao,L., Cepeda,M.A., Coleman,T.A., Collins,E.J., Dimke,D., Feng,P., Ferrie,A., Fischer,C., Hastings,G.A., He,W.W., Hu,J.S., Greene,J.M., Gruber,J., Hudson,P., Kim,A., Kozak,D.L., Kunsch,C., Ji,H., Li,H., Meissner,P.S., Olsen,H., Raymond,L., Wei,Y.F., Wing,J., Xu,C., Yu,G.L., Ruben,S.M., Dillon,P.J., Fannon,M.R., Rosen,C.A., Haseltine,W.A., Fields,C., Fraser,C.M. and Venter,J.C. TITLE Initial Assessment of Human Gene Diversity and Expression Patterns Based Upon 52 Million Basepairs of cDNA Sequence JOURNAL Unpublished REFERENCE 4 (bases 1 to 1193) AUTHORS Stevens,T.J., Berry,R., Goold,R., Walter,N.A.R., Wilcox,A.S., Hopkins,J.A., Rubano,T., Weber,J., Soares,M.B. and Sikela,J.M. TITLE Gene-based STSs as the basis for a human gene map JOURNAL Unpublished REFERENCE 5 (bases 1 to 1193) AUTHORS Genexpress. TITLE The Genexpress cDNA program JOURNAL Unpublished REFERENCE 6 (bases 1 to 1193) AUTHORS Liew,C.C., Hwang,D.M., Fung,Y.W., Laurenssen,C., Cukerman,E., Tsui,S. and Lee,C.Y. TITLE A catalogue of genes in the cardiovascular system as identified by expressed sequence tags JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (22), 10645-10649 (1994) MEDLINE 95024171 REFERENCE 7 (bases 1 to 1193) AUTHORS Frigerio,J.M., Berthezene,P., Garrido,P., Ortiz,E., Barthellemy,S., Vasseur,S., Sastre,B., Seleznieff,I., Dagorn,J.C. and Iovanna,J.L. TITLE Analysis of 2166 clones from a human colorectal cancer cDNA library by partial sequencing JOURNAL Hum. Mol. Genet. 4 (1), 37-43 (1995) MEDLINE 95227175 REFERENCE 8 (bases 1 to 1193) AUTHORS Okubo,K., Hori,N., Matoba,R., Niiyama,T., Fukushima,A., Kojima,Y. and Matsubara,K. TITLE Large scale cDNA sequencing for analysis of quantitative and qualitative aspects of gene expression JOURNAL Nature Genet. 2 (3), 173-179 (1992) MEDLINE 94258199 REFERENCE 9 (bases 1 to 1193) AUTHORS Borsani,G. TITLE Direct Submission JOURNAL Submitted (31-MAY-1995) G. Borsani, Tigem, Telethon Inst. of Genet. & Med., Via Olgettina 58, 20132 Milan, ITALY COMMENT citation [2]: nt 1-401: overlaps with R15118 nt 1-441: overlaps with R24789 nt 5-296: overlaps with R12444 nt 122-528: overlaps with R11164 nt 218-458: overlaps with R01554 nt 230-398: overlaps with T86513 nt 446-800: overlaps with R21192 nt 573-876: overlaps with R25960 nt 723-1167: overlaps with T56746 nt 773-1193: overlaps with R37330 nt 782-1184: overlaps with R41542 nt 790-1167: overlaps with T56690 nt 827-1165: overlaps with T86423 nt 849-1167: overlaps with R26766 nt 871-1132: overlaps with R00891 nt 879-1182: overlaps with R45521 nt 918-1174: overlaps with R10741 citation [3]: nt 27-281: overlaps with T34255 nt 39-407: overlaps with T34116 nt 40-317: overlaps with T35224 nt 65-317: overlaps with T35910 nt 793-1180: overlaps with T34324 nt 859-1193: overlaps with T35544 citation [4]: nt 33-323: overlaps with T17495 nt 754-1187: overlaps with T17494 citation [5]: nt 214-551: overlaps with Z15318 nt 214-549: overlaps with Z15381 nt 861-1166: overlaps with Z15317 citation [6]: Overlaps with with T11522(252..538) & T11650 (626..838) citation [7]: nt 611-826: overlaps with T24611 citation [8]: nt 887-1170 and nt 1171-1177: overlap with D12467. FEATURES Location/Qualifiers source 1..1193 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 214..930 gene 298..930 /gene="p64 CLCP" CDS 298..930 /gene="p64 CLCP" /note="putative start codon" /codon_start=1 /db_xref="PID:g895845" /translation="MVLWLKGVTFNVTTVDTKRRTETVQKLCPGGQLPFLLYGTEVHT DTNKIEEFLEAVLCPPRYPKLAALNPESNTAGLDIFAKFSAYIKNSNPALNDNLEKGL LKALKVLDNYLTSPLPEEVDETSAEDEGVSQRKFLDGNELTLADCNLLPKLHIVQVVC KKYRGFTIPEAFRGVHRYLSNAYAREEFASTCPDDEEIELAYEQVAKALK" BASE COUNT 299 a 297 c 341 g 256 t ORIGIN 1 gagctggagg agctgggtgt ggggtgcgtt gggctggtgg ggaggcctag tttgggtgca 61 agtaggtctg attgagcttg tgttgtgctg aagggacagc cctgggtcta ggggagagag 121 tccctgagtg tgagacccgc cttccccggt cccagcccct cccagttccc ccagggacgg 181 ccacttcctg gtccccgacg caaccatggc tgaagaacaa ccgcagtcga attgttcgtg 241 aaggctggca gtgatggggc caagattggg aactgcccat tctcccagag actgttcatg 301 gtactgtggc tcaagggagt caccttcaat gttaccaccg ttgacaccaa aaggcggacc 361 gagacagtgc agaagctgtg cccagggggg cagctcccat tcctgctgta tggcactgaa 421 gtgcacacag acaccaacaa gattgaggaa tttctggagg cagtgctgtg ccctcccagg 481 taccccaagc tggcagctct gaaccctgag tccaacacag ctgggctgga catatttgcc 541 aaattttctg cctacatcaa gaattcaaac ccagcactca atgacaatct ggagaaggga 601 ctcctgaaag ccctgaaggt tttagacaat tacttaacat cccccctccc agaagaagtg 661 gatgaaacca gtgctgaaga tgaaggtgtc tctcagagga agtttttgga tggcaacgag 721 ctcaccctgg ctgactgcaa cctgttgcca aagttacaca tagtacaggt ggtgtgtaag 781 aagtaccggg gattcaccat ccccgaggcc ttccggggag tgcatcggta cttgagcaat 841 gcctacgccc gggaagaatt cgcttccacc tgtccagatg atgaggagat cgagctcgcc 901 tatgagcaag tggcaaaggc cctcaaataa gcccctcctg ggactccctc aaccccctcc 961 attttctcca caaaggccct ggtggtttcc acattgctac ccaatggaca cactccaaaa 1021 tggccagtgg gcagggaatc ctggagcact tgttccggga tggtgtggtg gaagagggga 1081 tgagggaaag aaatgggggg cctgggtcag atttttattg tggggtggga tgagtaggac 1141 aacatatttc agtaataaaa tacagaataa aaatcaagtg tttttaaaaa aaa // LOCUS HSP68 2468 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for protein p68. ACCESSION Y00097 NID g35217 KEYWORDS calcium binding protein; membrane protein; protein p68. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2468) AUTHORS Crompton,M.R. TITLE Direct Submission JOURNAL Submitted (18-DEC-1987) Crompton M.R., Imperial Cancer Research Fund, P.O. Box 123, Loncoln's Inn Fields, London WC2A 3PX REFERENCE 2 (bases 1 to 2468) AUTHORS Crompton,M.R., Owens,R.J., Totty,N.F., Moss,S.E., Waterfield,M.D. and Crumpton,M.J. TITLE Primary structure of the human, membrane-associated Ca2+-binding protein p68 a novel member of a protein family JOURNAL EMBO J. 7 (1), 21-27 (1988) MEDLINE 88196081 REMARK Erratum:[EMBO J 1988 Jun;7(6):1914]] COMMENT Data kindly reviewed (21-APR-1988) by Crompton M. FEATURES Location/Qualifiers source 1..2468 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="J6 T-leukemia" /clone="A2, C2, 10-6" CDS 101..2122 /note="protein p68 (1 - 673)" /codon_start=1 /db_xref="PID:g35218" /db_xref="SWISS-PROT:P08133" /translation="MAKPAQGAKYRGSIHDFPGFDPNQDAEALYTAMKGFGSDKEAIL DIITSRSNRQRQEVCQSYKSLYGKDLIADLKYELTGKFERLIVGLMRPPAYCDAKEIK DAISGIGTDEKCLIEILASRTNEQMHQLVAAYKDAYERDLEADIIGDTSGHFQKMLVV LLQGTREEDDVVSEDLVQQDVQDLYEAGELKWGTDEAQFIYILGNRSKQHLRLVFDEY LKTTGKPIEASIRGELSGDFEKLMLAVVKCIRSTPEYFAERLFKAMKGLGTRDNTLIR IMVSRSELDMLDIREIFRTKYEKSLYSMIKNDTSGEYKKTLLKLSGGDDDAAGQFFPE AAQVAYQMWELSAVARVELKGTVRPANDFNPDADAKALRKAMKGLGTDEDTIIDIITH RSNVQRQQIRQTFKSHFGRDLMTDLKSEISGDLARLILGLMMPPAHYDAKQLKKAMEG AGTDEKALIEILATRTNAEIRAINEAYKEDYHKSLEDALSSDTSGHFRRILISLATGH REEGGENLDQAREDAQVAAEILEIADTPSGDKTSLETRFMTILCTRSYPHLRRVFQEF IKMTNYDVEHTIKKEMSGDVRDAFVAIVQSVKNKPLFFADKLYKSMKGAGTDDKTLTR IMVSRSEIDLLNIRREFIEKYDKSLHQAIEGDTSGDFLKALLALCGGED" BASE COUNT 613 a 646 c 710 g 499 t ORIGIN 1 ccgatccagc gagcgctgcg tcctcgagtc cctgcgcccg tgcgtccgtc tgcgacccga 61 ggcctccgct gcgcgtggat tctgctgcga accggagacc atggccaaac cagcacaggg 121 tgccaagtac cggggctcca tccatgactt cccaggcttt gaccccaacc aggatgccga 181 ggctctgtac actgccatga agggctttgg cagtgacaag gaggccatac tggacataat 241 cacctcacgg agcaacaggc agaggcagga ggtctgccag agctacaagt ccctctacgg 301 caaggacctc attgctgatt taaagtatga attgacgggc aagtttgaac ggttgattgt 361 gggcctgatg aggccacctg cctattgtga tgccaaagaa attaaagatg ccatctcggg 421 cattggcact gatgagaagt gcctcattga gatcttggct tcccggacca atgagcagat 481 gcaccagctg gtggcagcat acaaagatgc ctacgagcgg gacctggagg ctgacatcat 541 cggcgacacc tctggccact tccagaagat gcttgtggtc ctgctccagg gaaccaggga 601 ggaggatgac gtagtgagcg aggacctggt acaacaggat gtccaggacc tatacgaggc 661 aggggaactg aaatggggaa cagatgaagc ccagttcatt tacatcttgg gaaatcgcag 721 caagcagcat cttcggttgg tgttcgatga gtatctgaag accacaggga agccgattga 781 agccagcatc cgaggggagc tgtctgggga ctttgagaag ctaatgctgg ccgtagtgaa 841 gtgtatccgg agcaccccgg aatattttgc tgaaaggctc ttcaaggcta tgaagggcct 901 ggggactcgg gacaacaccc tgatccgcat catggtctcc cgtagtgagt tggacatgct 961 cgacattcgg gagatcttcc ggaccaagta tgagaagtcc ctctacagca tgatcaagaa 1021 tgacacctct ggcgagtaca agaagactct gctgaagctg tctgggggag atgatgatgc 1081 tgctggccag ttcttcccgg aggcagcgca ggtggcctat cagatgtggg aacttagtgc 1141 agtggcccga gtagagctga agggaactgt gcgcccagcc aatgacttca accctgacgc 1201 agatgccaaa gcgctgcgga aagccatgaa gggactcggg actgacgaag acacaatcat 1261 cgatatcatc acgcaccgca gcaatgtcca gcggcagcag atccggcaga ccttcaagtc 1321 tcactttggc cgggacttaa tgactgacct gaagtctgag atctctggag acctggcaag 1381 gctgattctg gggctcatga tgccaccggc ccattacgat gccaagcagt tgaagaaggc 1441 catggaggga gccggcacag atgaaaaggc tcttattgaa atcctggcca ctcggaccaa 1501 tgctgaaatc cgggccatca atgaggccta taaggaggac tatcacaagt ccctggagga 1561 tgctctgagc tcagacacat ctggccactt caggaggatc ctcatttctc tggccacggg 1621 gcatcgtgag gagggaggag aaaacctgga ccaggcacgg gaagatgccc aggtggctgc 1681 tgagatcttg gaaatagcag acacacctag tggagacaaa acttccttgg agacacgttt 1741 catgacgatc ctgtgtaccc ggagctatcc gcacctccgg agagtcttcc aggagttcat 1801 caagatgacc aactatgacg tggagcacac catcaagaag gagatgtctg gggatgtcag 1861 ggatgcattt gtggccattg ttcaaagtgt caagaacaag cctctcttct ttgccgacaa 1921 actttacaaa tccatgaagg gtgctggcac agatgacaag actctgacca ggatcatggt 1981 atcccgcagt gagattgacc tgctcaacat ccggagggaa ttcattgaga aatatgacaa 2041 gtctctccac caagccattg agggtgacac ctccggagac ttcctgaagg ccttgctggc 2101 tctctgtggt ggtgaggact agggccacag ctttggcggg cacttctgcc aagaaatggt 2161 tatcagcacc agccgccatg gccaagcctg attgttccag ctccagagac taaggaaggg 2221 gcaggggtgg ggggaggggt tgggttgggc tcttatcttc agtggagctt aggaaacgct 2281 cccactccca cgggccatcg agggcccagc acggctgagc ggctgaaaaa ccgtagccat 2341 agatcctgtc cacctccact cccctctgac cctcaggctt tcccagcttc ctccccttgc 2401 tagagcctct gccctggttt gggctatgtc agatccaaaa acatcctgaa cctctgtctg 2461 taaaaaaa // LOCUS HSPABPII 1569 bp RNA PRI 27-FEB-1995 DEFINITION H.sapiens mRNA for polyadenylate binding protein II. ACCESSION Z48501 NID g693936 KEYWORDS polyadenylate binding protein II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1569) AUTHORS Murphy,E.P., McKenna,N.J. and Headon,D.R. TITLE Nucleotide sequence of a partial cDNA encoding a novel human polyadenylate-binding protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1569) AUTHORS McKenna,N.J. TITLE Direct Submission JOURNAL Submitted (27-FEB-1995) Neil J McKenna, Cell and Molecular Biology Group, Department of, Biochemistry, University College Galway, University Road, Galway, Ireland FEATURES Location/Qualifiers source 1..1569 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="breast cancer" /cell_line="T47D" CDS 1..1569 /codon_start=1 /product="polyadenylate binding protein II" /db_xref="PID:g693937" /translation="MLYEKFSPAGPILSIRVCRDMITRRSLGYAYVNFQQPADAERAL DTMNFDVIKGKPVRIMWSQRDPSLRKSGVGNIFIKNLDKSIDNKALYDTFSAFGNILS CKVVCDENGSKGYGFVHFETQEAAERAIEKMNGMLLNDRKVFVGRFKSRKEREAELGA RAKEFTNVYIKNFGEDMDDERLKDLFGKFGPALSVKVMTDESGKSKGFGFVSFERHED AQKAVDEMNGKELNGKQIYVGRAQKKVERQTELKRKFEQMKQDRITRYQGVNLYVKNL DDGIDDERLRKEFSPFGTITSAKVMMEGGRSKGFGFVCFSSPEEATKAVTEMNGRIVA TKPLYVALAQRKEERQAHLTNQYMQRMASVRAVPNPVINPYQPAPPSGYFIAAIPQTQ NRAAYYPPSQIAQLRPSPRWTAQGARPHPAVHVQGQEPLTASMLASAPPQEQKQMLGE RLFPLIQAMHPTLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAVAVLQAHQAKEA AQKAVNSATGVPTV" misc_feature 79..102 /note="ribonucleoprotein consensus sequence-1" misc_feature 310..333 /note="ribonucleoprotein consensus sequence-1" misc_feature 337..360 /note="ribonucleoprotein consensus sequence-1" misc_feature 616..639 /note="ribonucleoprotein consensus sequence-1" BASE COUNT 457 a 338 c 376 g 398 t ORIGIN 1 atgctctacg agaagttcag cccggccggg cccatcctct ccatccgggt ctgcagggac 61 atgatcaccc gccgctcctt gggctacgcg tatgtgaact tccagcagcc ggcggacgcg 121 gagcgtgctt tggacaccat gaattttgat gttataaagg gcaagccagt acgcatcatg 181 tggtctcagc gtgatccatc acttcgcaaa agtggagtag gcaacatatt cattaaaaat 241 ctggacaaat ccattgataa taaagcactg tatgatacat tttctgcttt tggtaacatc 301 ctttcatgta aggtggtttg tgatgaaaat ggttccaagg gctatggatt tgtacacttt 361 gagacgcagg aagcagctga aagagctatt gaaaaaatga atggaatgct cctaaatgat 421 cgcaaagtat ttgttggacg atttaagtct cgtaaagaac gagaagctga acttggagct 481 agggcaaaag aattcaccaa tgtttacatc aagaattttg gagaagacat ggatgatgag 541 cgccttaagg atctctttgg caagtttggg cctgccttaa gtgtgaaagt aatgactgat 601 gaaagtggaa aatccaaagg atttggattt gtaagctttg aaaggcatga agatgcacag 661 aaagctgtgg atgagatgaa cggaaaggag ctcaatggaa aacaaattta tgttggtcga 721 gctcagaaaa aggtggaacg gcagacggaa cttaagcgca aatttgaaca gatgaaacaa 781 gataggatca ccagatacca gggtgttaat ctttatgtga aaaatcttga tgatggtatt 841 gatgatgaac gtctccggaa agagttttct ccatttggta caatcactag tgcaaaggtt 901 atgatggagg gtggtcgcag caaagggttt ggttttgtat gtttctcctc cccagaagaa 961 gccactaaag cagttacaga aatgaacggt agaattgtgg ccacaaagcc attgtatgta 1021 gctttagctc agcgcaaaga agagcgccag gctcacctca ctaaccagta tatgcagaga 1081 atggcaagtg tacgagctgt tcccaaccct gtaatcaacc cctaccagcc agcacctcct 1141 tcaggttact tcatcgcagc tatcccacag actcagaacc gtgctgcata ctatcctcct 1201 agccaaattg ctcaactaag accaagtcct cgctggactg ctcagggtgc cagacctcat 1261 cctgctgttc atgtacaagg tcaggaacct ttgactgctt ccatgttggc atctgcccct 1321 cctcaagagc aaaagcaaat gttgggtgaa cggctgtttc ctcttattca agccatgcac 1381 cctactcttg ctggtaaaat cactggcatg ttgttggaga ttgataattc agaacttctt 1441 catatgctcg agtctccaga gtcactccgt tctaaggttg atgaagctgt agctgtacta 1501 caagcccacc aagctaaaga ggctgcccag aaagcagtta acagtgccac cggtgttcca 1561 actgtttaa // LOCUS HSPACO 2070 bp RNA PRI 02-SEP-1994 DEFINITION H.sapiens mRNA for peroxisomal acyl-CoA oxidase. ACCESSION X71440 NID g535031 KEYWORDS peroxisomal acyl-CoA oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2070) AUTHORS Fournier,B. TITLE Direct Submission JOURNAL Submitted (16-APR-1993) B. T. Poll-The, Whilhemina Children's Hospital, Nieuwe Gracht 137, 3512 LK Utrecht, NETHERLANDS REFERENCE 2 (bases 1 to 2070) AUTHORS Fournier,B., Saudubray,J.M., Benichou,B., Lyonnet,S., Munnich,A., Clevers,H. and Poll-The,B.T. TITLE Large deletion of the peroxisomal acyl-CoA oxidase gene in pseudoneonatal adrenoleukodystrophy JOURNAL J. Clin. Invest. 94 (2), 526-531 (1994) MEDLINE 94314953 COMMENT Related sequence: J02752. FEATURES Location/Qualifiers source 1..2070 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="B cell" 5'UTR 1..13 CDS 14..1996 /codon_start=1 /product="peroxisomal acyl-CoA oxidase" /db_xref="PID:g535032" /translation="MNPDLRRERDSASFNPELLTHILDGSLEKTRRRREIENMILNDP DFQHEDLNFLTRSQRYEVAVRKSAIMVKKMREFGIRDPDEIMWFKNFVHRGRPEPLDL HLGMFLPTLLHQATAEQQERFFMPAWNLEIIGTYAQTEMGHGTHLRGLETTATYDPET QEFILNSPTVTSIKWWPGGLGKTSNHAIVLAQLITKGKCYGLHAFIVPIREIGTHKPL PGITVGDIGPKFGYDEIDNGYLKMDNHRIPRENMLMKYAQVKPDGTYVKPLSNKLTYG TMVFVRSFLVGEAARALSKACTIAIRYSAVRHQSEMKPGEPEPQILDFQTQQYKLFPL LATAYAFQFVGAYMKETYHRINEGIGQGDLSELPELHALTAGLKAFTSWTANTGIEAC RMACGGHGYSHCSGLPNIYVNFTPSCTFEGENTVMMLQTARFLMKSYDQVHSGKLVCG MVSYLNDLPSQRIQPQQVAVWPTMVDINSPESLTEAYKLRAARLVEIAAKNLQKEVIH RKSKEVAWNLTSVDLVRASEAHCHYVVVKLFSEKLLKIQDKAIQAVLRSLCLLYSLYG ISQNAGDFLQGSIMTEPQITQVNQRVKELLTLIRSDAVALVDAFDFQDATLGSVLGRY DGNVYENLFEWAKNSPLNKAEVHESYKHLKSLQSKL" 3'UTR 1994..2070 BASE COUNT 543 a 504 c 518 g 505 t ORIGIN 1 gctggtcgtc gccatgaacc cggacctgcg cagggagcgg gattccgcca gcttcaaccc 61 ggagctgctt acacacatcc tggacggcag cctcgagaaa acccggcgcc gccgagagat 121 cgagaacatg atcctgaacg acccagactt ccagcatgag gacttgaact tcctcactcg 181 cagccagcgt tatgaggtgg ctgtcaggaa aagtgccatc atggtgaaga agatgaggga 241 gtttggcatc cgtgaccctg atgaaattat gtggtttaaa aattttgtgc accgagggcg 301 gcctgagcct ctggatcttc acttgggcat gttcctgccc accttgcttc accaggcaac 361 tgcggagcag caggagcgct tcttcatgcc cgcctggaac ttggagatca ttggcactta 421 tgcccagaca gagatgggtc atggaactca ccttcgaggc ttggaaacca cagccacgta 481 tgaccctgaa acccaggagt tcattctcaa cagtcctact gtgacctcca ttaaatggtg 541 gcctggtggg cttggaaaga cttcaaatca tgcaatagtt cttgcccagc tcatcactaa 601 ggggaaatgc tatggattac atgcctttat cgtacctatt cgtgaaatcg ggacccataa 661 gcctttgcca ggaattaccg ttggtgacat cggccccaaa tttggttatg atgagataga 721 caatggctac ctcaaaatgg acaaccatcg tattcccaga gaaaacatgc tgatgaagta 781 tgcccaggtg aagcctgatg gcacatacgt gaaaccgctg agtaacaagc tgacttacgg 841 gaccatggtg tttgtcaggt ccttccttgt gggagaagct gctcgggctc tgtctaaggc 901 gtgcaccatt gccatccgat acagcgctgt gaggcaccag tctgaaatga agccaggtga 961 accagaacca cagattttgg attttcaaac ccagcagtat aaactctttc cactcctggc 1021 cactgcctat gccttccagt ttgtgggcgc atacatgaag gagacctatc accggattaa 1081 cgaaggcatt ggtcaagggg acctgagtga actgcctgag cttcatgccc tcaccgctgg 1141 actgaaggct ttcacctcct ggactgcaaa cactggcatt gaagcatgtc ggatggcttg 1201 tggtgggcat ggctattctc attgcagtgg tcttccaaat atttatgtca atttcacccc 1261 aagctgtacc tttgagggag aaaacactgt catgatgctc cagacggcta ggttcctgat 1321 gaaaagttat gatcaggtgc actcaggaaa gttggtgtgt ggcatggtgt cctatttgaa 1381 cgacctgccc agtcagcgca tccagccaca gcaggtagca gtctggccaa ccatggtgga 1441 tatcaacagc cccgaaagcc taaccgaagc atataaactc cgtgcagcca gattagtaga 1501 aattgctgca aaaaaccttc aaaaagaagt gattcacaga aaaagcaagg aggtagcttg 1561 gaacctaact tctgttgacc ttgttcgagc aagtgaggca cattgccact atgtggtagt 1621 taagctcttt tcagaaaaac tcctcaaaat tcaagataaa gccattcaag ctgtcttaag 1681 gagtttatgt ctgctgtatt ctctgtatgg aatcagtcag aacgcggggg atttccttca 1741 ggggagcatc atgacagagc ctcagattac acaagtaaac cagcgtgtaa aggagttact 1801 cactctgatt cgctcagatg ctgttgcttt ggttgatgca tttgattttc aggatgcgac 1861 acttggctct gtgcttggcc gctatgatgg gaatgtgtat gaaaacttgt ttgagtgggc 1921 taagaactcc ccactgaaca aagcagaggt ccacgaatct tacaagcacc tgaagtcact 1981 gcagtccaag ctctgaagtg tcacaaggac aagtttaatc tgcttcagaa agcgcctgtg 2041 tgcaactcaa attttgtgga atctttttcg // LOCUS HSPAG 937 bp RNA PRI 01-FEB-1994 DEFINITION H.sapiens mRNA for proliferation-associated gene (pag). ACCESSION X67951 NID g287640 KEYWORDS proliferation associated gene; serum inducible. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 937) AUTHORS Goubin,G.J. TITLE Direct Submission JOURNAL Submitted (10-JUL-1992) G.J. Goubin, Inst. Curie, 26 Rue D'Ulm, 75231 Paris Cedex 05, FRANCE REFERENCE 2 (bases 1 to 937) AUTHORS Prosperi,M.T., Ferbus,D., Karczinski,I. and Goubin,G. TITLE A human cDNA corresponding to a gene overexpressed during cell proliferation encodes a product sharing homology with amoebic and bacterial proteins JOURNAL J. Biol. Chem. 268 (15), 11050-11056 (1993) MEDLINE 93266552 FEATURES Location/Qualifiers source 1..937 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="mammary" /cell_line="HBL100" gene 61..660 /gene="proliferation associated gene (pag)" CDS 61..660 /gene="proliferation associated gene (pag)" /codon_start=1 /db_xref="PID:g287641" /db_xref="SWISS-PROT:Q06830" /translation="MSSGNAKIGHPAPNFKATAVMPDGQFKDISLSDYKGKYVVFFFY PLDFTFVCPTEIIAFSDRAEEFKKLNCQVIGASVDSHFCHLAWVNTPKKQGGLGPMNI PLVSDPKRTIAQDYGVLKADEGISFRGLFIIDDKGILRQITVNDLPVGRSVDETLRLV QAFQFTDKHGEVCPAGWKPGSDTIKPDVQKSKEYFSKQK" polyA_signal 921..926 BASE COUNT 230 a 197 c 236 g 274 t ORIGIN 1 gttcttgcct ggtgtcggtg gttagtttct gcgacttgtg ttgggactgc tgataggaag 61 atgtcttcag gaaatgctaa aattgggcac cctgccccca acttcaaagc cacagctgtt 121 atgccagatg gtcagtttaa agatatcagc ctgtctgact acaaaggaaa atatgttgtg 181 ttcttctttt accctcttga cttcaccttt gtgtgcccca cggagatcat tgctttcagt 241 gatagggcag aagaatttaa gaaactcaac tgccaagtga ttggtgcttc tgtggattct 301 cacttctgtc atctagcatg ggtcaataca cctaagaaac aaggaggact gggacccatg 361 aacattcctt tggtatcaga cccgaagcgc accattgctc aggattatgg ggtcttaaag 421 gctgatgaag gcatctcgtt caggggcctt tttatcattg atgataaggg tattcttcgg 481 cagatcactg taaatgacct ccctgttggc cgctctgtgg atgagacttt gagactagtt 541 caggccttcc agttcactga caaacatggg gaagtgtgcc cagctggctg gaaacctggc 601 agtgatacca tcaagcctga tgtccaaaag agcaaagaat atttctccaa gcagaagtga 661 gcgctgggct gttttagtgc caggctgcgg tgggcagcca tgagaacaaa acctcttctg 721 tatttttttt ttccattagt aaaacacaag acttcagatt cagccgaatt gtggtgtctt 781 acaaggcagg cctttcctac agggggtgga gagaccagcc tttcttcctt tggtaggaat 841 ggcctgagtt ggcgttgtgg gcaggctact ggtttgtatg atgtattagt agagcaaccc 901 attaatcttt tgtagtttgt attaaacttg aactgag // LOCUS HSPAI2R 1900 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for Arg-Serpin (plasminogen activator-inhibitor 2, PAI-2). ACCESSION Y00630 NID g35267 KEYWORDS anti-urokinase; plasminogen activator-inhibitor type 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1900) AUTHORS Webb,A.C. TITLE Direct Submission JOURNAL Submitted (12-JUN-1987) Andrew C. Webb, Department of Biological Sciences, Wellesley College, Wellesley, MA 02181, USA REFERENCE 2 (bases 1 to 1900) AUTHORS Webb,A.C., Collins,K.L., Snyder,S.E., Alexander,S.J., Rosenwasser,L.J., Eddy,R.L., Shows,T.B. and Auron,P.E. TITLE Human monocyte Arg-Serpin cDNA. Sequence, chromosomal assignment, and homology to plasminogen activator-inhibitor JOURNAL J. Exp. Med. 166 (1), 77-94 (1987) MEDLINE 87252928 COMMENT *source=LPS-stimulated monocytes; clone=pcD-1214 PAI-2 is a member CC of the serine protease inhibitor (serpin) superfamily. It inhibits urokinase-type plasminogen activator. The monocyte derived PAI-2 is distinct from the endothelial cell-derived PAI-1. FEATURES Location/Qualifiers source 1..1900 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 73..138 /note="signal peptide (AA -22 to -1)" CDS 73..1320 /note="PAI-2 precursor (AA -22 to 393)" /codon_start=1 /db_xref="PID:g35268" /db_xref="SWISS-PROT:P05120" /translation="MEDLCVANTLFALNLFKHLAKASPTQNLFLSPWSISSTMAMVYM GSRGSTEDQMAKVLQFNEVGANAVTPMTPENFTSCGFMQQIQKGSYPDAILQAQAADK IHSSFRSLSSAINASTGNYLLESVNKLFGEKSASFREEYIRLCQKYYSSEPQAVDFLE CAEEARKKINSWVKTQTKGKIPNLLPEGSVDGDTRMVLVNAVYFKGKWKTPFEKKLNG LYPFRVNSAQRTPVQMMYLREKLNIGYIEDLKAQILELPYAGDVSMFLLLPDEIADVS TGLELLESEITYDKLNKWTSKDKMAEDEVEVYIPQFKLEEHYELRSILRSMGMEDAFN KGRANFSGMSERNDLFLSEVFHQAMVDVNEEGTEAAAGTGGVMTGRTGHGGPQFVADH PFLFLIMHKITNCILFFGRFSSP" mat_peptide 139..1317 /note="PAI-2 (AA 1 - 393)" misc_feature 295..297 /note="N-glycosylation site" misc_feature 415..417 /note="N-glycosylation site" misc_feature 1087..1089 /note="N-glycosylation site" misc_feature 1210..1212 /note="active site of PAI-2" misc_feature 1568..1577 /note="AT-rich sequence (pot. RNase target)" misc_feature 1597..1609 /note="AT-rich sequence (pot. RNase target)" misc_feature 1647..1703 /note="AT-rich sequence (pot. RNase target)" BASE COUNT 592 a 393 c 380 g 535 t ORIGIN 1 acaactctca gaggagcatt gcccgtcaga cagcaactca gagaataacc agagaacaac 61 cagattgaaa caatggagga tctttgtgtg gcaaacacac tctttgccct caatttattc 121 aagcatctgg caaaagcaag ccccacccag aacctcttcc tctccccatg gagcatctcg 181 tccaccatgg ccatggtcta catgggctcc aggggcagca ccgaagacca gatggccaag 241 gtgcttcagt ttaatgaagt gggagccaat gcagttaccc ccatgactcc agagaacttt 301 accagctgtg ggttcatgca gcagatccag aagggtagtt atcctgatgc gattttgcag 361 gcacaagctg cagataaaat ccattcatcc ttccgctctc tcagctctgc aatcaatgca 421 tccacaggga attatttact ggaaagtgtc aataagctgt ttggtgagaa gtctgcgagc 481 ttccgggaag aatatattcg actctgtcag aaatattact cctcagaacc ccaggcagta 541 gacttcctag aatgtgcaga agaagctaga aaaaagatta attcctgggt caagactcaa 601 accaaaggca aaatcccaaa cttgttacct gaaggttctg tagatgggga taccaggatg 661 gtcctggtga atgctgtcta cttcaaagga aagtggaaaa ctccatttga gaagaaacta 721 aatgggcttt atcctttccg tgtaaactcg gctcagcgca cacctgtaca gatgatgtac 781 ttgcgtgaaa agctaaacat tggatacata gaagacctaa aggctcagat tctagaactc 841 ccatatgctg gagatgttag catgttcttg ttgcttccag atgaaattgc cgatgtgtcc 901 actggcttgg agctgctgga aagtgaaata acctatgaca aactcaacaa gtggaccagc 961 aaagacaaaa tggctgaaga tgaagttgag gtatacatac cccagttcaa attagaagag 1021 cattatgaac tcagatccat tctgagaagc atgggcatgg aggacgcctt caacaaggga 1081 cgggccaatt tctcagggat gtcggagagg aatgacctgt ttctttctga agtgttccac 1141 caagccatgg tggatgtgaa tgaggagggc actgaagcag ccgctggcac aggaggtgtt 1201 atgacaggga gaactggaca tggaggccca cagtttgtgg cagatcatcc ttttcttttt 1261 cttattatgc ataagataac caactgcatt ttatttttcg gcagattttc ctcaccctaa 1321 aactaagcgt gctgcttctg caaaagattt ttgtagatga gctgtgtgcc tcagaattgc 1381 tatttcaaat tgccaaaaat ttagagatgt tttctacata tttctgctct tctgaacaac 1441 ttctgctacc cactaaataa aaacacagaa ataattagac aattgtctat tataacatga 1501 caaccctatt aatcatttgg tcttctaaaa tgggatcatg cccatttaga ttttccttac 1561 tatcagttta tttttataac attaactttt actttgttat ttattatttt atataatggt 1621 gagtttttaa attattgctc actgcctatt taatgtagct aataaagtta tagaagcaga 1681 tgatctgtta atttcctatc taataaatgc ctttaattgt tctcataatg aagaataagt 1741 aggtaccctc catgcccttc tgtaataaat atctggaaaa aacattaaac aataggcaaa 1801 tatatgttat gtgcatttct agaaatacat aacacatata tatgtctgta tcttatattc 1861 aattgcaagt atataataaa taaacctgct tccaaacaac // LOCUS HSPAIP 1587 bp RNA PRI 29-MAR-1996 DEFINITION H.sapiens mRNA for GAIP protein. ACCESSION X91809 NID g1107697 KEYWORDS GAIP protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1587) AUTHORS De Vries,L., Mousli,M., Wurmser,A. and Farquhar,M.G. TITLE GAIP, a protein that specifically interacts with the trimeric G protein G alpha i3, is a member of a protein family with a highly conserved core domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (25), 11916-11920 (1995) MEDLINE 96102226 REFERENCE 2 (bases 1 to 1587) AUTHORS De Vries,L. TITLE Direct Submission JOURNAL Submitted (25-SEP-1995) L. De Vries, Univ. of California San Diego, CMM West, 9500 Gilman Drive, La Jolla, CA 92093, USA COMMENT Overlaps with R43864. FEATURES Location/Qualifiers source 1..1587 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epitelial" /cell_line="HeLa S3" CDS 289..942 /codon_start=1 /product="GAIP" /db_xref="PID:e202294" /db_xref="PID:g1107698" /translation="MPTPHEAEKQITGPEEADRPPSMSSHDTASPAAPSRNPCCLCWC CCCSCSWNQERRRAWQASRESKLQPLPSCEVCATPSPEEVQSWAQSFDKLMHSPAGRS VFRAFLRTEYSEENMLFWLACEELKAEANQHVVDEKARLIYEDYVSILSPKEVSLDSR VREGINKKMQEPSAHTFDDAQLQIYTLMHRDSYPRFLSSPTYRALLLQGPSQSSSEA" polyA_site 1570 BASE COUNT 309 a 524 c 466 g 288 t ORIGIN 1 tggcactagg ggcagccact cagcccttcg agctgtaggg gaggcgctgc gcccccggcc 61 ccagccgatc gggggctcct ggcgctactg ccctccagag tccctcctgc cgtctgactt 121 gagtccctgc tgctgcagtg cacgcccccc tttcggcagc cagggtgggg cccagaccca 181 acctggcccc tcctgccccc accccgcctt tgggtcgctg acccccaggc tgtggtgagg 241 gtctgagagc tggtaccacg gagcctgggc aacctcctcc gcccacccat gcccaccccg 301 catgaggctg agaagcagat cacagggcca gaggaggcgg accggccccc ttcaatgtcc 361 agtcatgata cagcctctcc agcggccccc agccgcaacc cctgctgcct gtgctggtgc 421 tgctgctgta gctgctcctg gaaccaagag cggcggcgcg cgtggcaggc ctcccgggag 481 agcaagctgc agcccctccc cagctgtgaa gtatgtgcca cgccaagtcc tgaggaggtg 541 cagagctggg cgcagtcttt tgacaagctg atgcacagcc cagcgggacg cagcgtgttc 601 cgggcgttcc tgcggacaga gtacagcgag gagaacatgc tcttctggtt ggcctgcgag 661 gagctgaagg ccgaggccaa ccagcatgtg gtagacgaga aggcgaggct catctacgag 721 gactacgtat ccatcctgtc ccccaaggag gtgagcctgg actcccgtgt gcgggagggc 781 atcaacaaga agatgcagga gccgtccgca cacacgttcg acgacgcgca gctgcagatc 841 tacacgctca tgcaccggga ctcctacccc cgcttcctca gctctcccac ctaccgtgcc 901 ctgctgctgc aggggccatc acagtcctcc tccgaggcct aggccgcccc cagcagcaca 961 gaccccgccg cctcctacgg ccgactctgg gttcccttca ggtgttggtg tcggcagggc 1021 tgtcagggca catgcctggc cggctggggt ccccacccgc ggagacggtc ctagacccag 1081 tggggagtgc tgtgtctcct gggtctgccc acccgtggcc aagcaggaac tccaggtgca 1141 gggcgggcct ccaggtgcag ggcgggccag aagccccctc atcggcccgc agctggctgg 1201 cccagtgctc ctggctgggg gctcttgcgt ggtgagagta ggggtccccc caggagccat 1261 gctggaccca tgagcctcgc tggggcctcc tccccaggca gccaatgggc cccagacaag 1321 cctttggtgg ggaacagaac ctccgcatcg tgtagttttg tgacataagg agacctccta 1381 cttgagctgt ctgtacccca gaatcaaaca cagaactcag aaccagattt aggccctcag 1441 aatcctgcac tcaaggtggc aggacccaat ttctgtttta tatgttcatg aactttaaac 1501 ctggaaacat gtccttacta ggtgttttat caaaaaaaag ttttttatta ttgaagatat 1561 ttttaaagca aaaaaaaaaa aaaaaaa // LOCUS HSPAPSSYN 2511 bp RNA PRI 08-DEC-1997 DEFINITION H.sapiens mRNA for PAPS synthetase. ACCESSION Y10387 NID g2673861 KEYWORDS APS kinase; ATP sulfurylase; bifunctional protein; PAPS synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2511) AUTHORS Girard,J.P. and Amalric,F. TITLE cDNA Cloning of the PAPS synthetase from human high endothelial venules, an enzyme required for sulfation of L-selectin ligands JOURNAL Unpublished REFERENCE 2 (bases 1 to 2511) AUTHORS Girard,J.P. TITLE Direct Submission JOURNAL Submitted (08-JAN-1997) J.P. Girard, LBME-CNRS, Molecular Biology, 118 Route de Narbonne, Toulouse, 31062, FRANCE REMARK Revised by author 01-JUL-97 FEATURES Location/Qualifiers source 1..2511 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HEV endothelial cells" /cell_line="purified primary HEV endothelial cells" /clone="18a1" /dev_stage="adult" /tissue_type="tonsil" CDS 37..1911 /function="bifunctional ATP sulfurylase /APS kinase" /codon_start=1 /product="PAPS sunthetase" /db_xref="PID:e1204135" /db_xref="PID:g2673862" /translation="MEIPGSLCKKVKLSNNAQNWGMQRATNVTYQAHHVSRNKRGQVV GTRGGFRGCTVWLTGLSGAGKTTVSMALEEYLVCHGIPCYTLDGDNIRQGLNKNLGFS PEDREENVRRIAEVAKLFADAGLVCITSFISPYTQDRNNARQIHEGASLPFFEVFVDA PLHVCEQRDVKGLYKKARAGEIKGFTGIDSEYEKPEAPELVLKTDSCDVNDCVQQVVE LLQERDIVPVDASYEVKELYVPENKLHLAKTDAETLPALKINKVDMQWVQVLAEGWAT PLNGFMREREYLQCLHFDCLLDGGVINLSVPIVLTATHEDKERLDGCTAFALMYEGRR VAILRNPEFFEHRKEERCARQWGTTCKNHPYIKMVMEQGDWLIGGDLQVLDRVYWNDG LDQYRLTPTELKQKFKDMNADAVFAFQLRNPVHNGHALLMQDTHKQLLERGYRRPVLL LHPLGAWTKDDDVPLMWRMKQHAAVLEEGVLNPETTVVAIFPSPMMYAGPTEVQWHCR ARMVAGANFYIVGRDPAGMPHPETGKDLYEPSHGAKVLTMAPGLITLEIVPFRVAAYN KKKKRMDYYDSEHHEDFEFISGTRMRKLAREGQKPPEGFMAPKAWTVLTEYYKSLEKA " polyA_signal 2469..2474 BASE COUNT 708 a 499 c 592 g 712 t ORIGIN 1 cgcagagaac cccggctgct cagcgcgctc cgggtcatgg agatccccgg gagcttgtgc 61 aagaaagtca agctgagcaa taacgcgcag aactggggaa tgcagagagc aaccaatgtc 121 acctaccaag cccatcatgt cagcaggaac aagagaggtc aggtggtggg gaccagaggt 181 ggctttcgtg gttgcacagt ttggctaaca ggcttgtctg gagcgggaaa gactactgtg 241 agcatggcct tggaggagta cctggtttgt catggtattc catgctacac tctggatggt 301 gacaatattc gtcaaggtct caataaaaat cttggcttta gtcctgaaga cagagaagag 361 aatgttcgac gcatcgcaga agttgctaaa ctgtttgcag atgctggctt agtgtgcatc 421 acaagtttca tatcacctta cactcaggat cgcaacaatg caaggcaaat tcatgaaggt 481 gcaagtttac cgttttttga agtatttgtt gatgctcctc tgcatgtttg tgaacagagg 541 gatgtcaaag gactctacaa aaaagcccgg gcaggagaaa ttaaaggttt cactgggatc 601 gattctgaat atgaaaagcc agaggcccct gagttggtgc tgaaaacaga ctcctgtgat 661 gtaaatgact gtgtccagca agttgtggaa cttctacagg aacgggatat tgtacctgtg 721 gatgcatctt atgaagtaaa agaactatat gtgccagaaa ataaacttca tttggcaaaa 781 acagatgcgg aaacattacc agcactgaaa attaataaag tggatatgca gtgggtgcag 841 gttttggcag aaggttgggc aaccccattg aatggcttta tgagagagag ggagtacttg 901 cagtgccttc attttgattg tcttctggat ggaggtgtca ttaacttgtc agtacctata 961 gttctgactg cgactcatga agataaagag aggctggacg gctgtacagc atttgctctg 1021 atgtatgagg gccgccgtgt ggccattctt cgcaatccag agttttttga gcacaggaaa 1081 gaggagcgct gtgccagaca gtggggaacg acatgcaaga accaccccta tattaagatg 1141 gtgatggaac aaggagattg gctgattgga ggagatcttc aagtcttgga tcgagtttat 1201 tggaatgatg gtcttgatca gtatcgtctt actcctactg agctaaagca gaaatttaaa 1261 gatatgaatg ctgatgctgt ctttgcattt caactacgca acccagtgca caatggacat 1321 gccctgttaa tgcaggatac ccataagcaa cttctagaga ggggctaccg gcgccctgtc 1381 ctcctcctcc accctctggg tgcttggaca aaggatgacg atgttccttt gatgtggcgt 1441 atgaagcagc atgctgcagt gttggaggaa ggagttctga atcctgagac gacagtggtg 1501 gccatcttcc catctcccat gatgtatgct ggaccaactg aggtccagtg gcattgcaga 1561 gcacggatgg ttgcaggagc caacttttac attgttggac gagaccctgc tggcatgcct 1621 catccagaaa cagggaagga tctttatgag ccaagtcatg gtgccaaagt gctgacgatg 1681 gcccctggtt taatcacttt ggaaatagtt ccctttcgag ttgcagctta caacaagaaa 1741 aagaagcgta tggactacta tgactctgaa caccatgaag actttgaatt tatttcagga 1801 acacgaatgc gcaaacttgc tcgagaaggc cagaaaccac ctgaaggttt catggctccc 1861 aaggcttgga ccgtgctgac agaatactac aaatccttgg agaaagctta ggctgttaac 1921 ccagtcactc cacctttgac acattactag taacaagagg ggaccacata gtctctgttg 1981 gcatttcttt gtggtgtctg tctggacatg cttcctaaaa acagaccatt ttccttaact 2041 tgcatcagtt ttggtctgcc ttatgagttc tgttttgaac aagtgtaaca cactgatggt 2101 tttaatgtat cttttccact tattatagtt atattcctac aatacaattt taaaattgtc 2161 tttttatatt atatttatgc ttctgtgtca tgattttttc aagctgttat attagttgta 2221 accagtagta ttcacattaa atcttgcttt ttttcccctt aaaaaaagaa aaaaattacc 2281 aaacaataaa cttggctaga ccttgttttg aggattttac aagacctttg tagcgattag 2341 attttttttc tacattgaaa atagaaactg cttcctttct tctttccagt cagctattgg 2401 tctttccagc tgttataatc taaagtattc ttatgatctg tgtaagctct gaatgaactt 2461 ctttactcaa taaaattaat tttttggctt cttaaaaaaa aaaaaaaaaa a // LOCUS HSPARTC1 3236 bp RNA PRI 07-MAY-1996 DEFINITION H.sapiens partial C1 mRNA. ACCESSION X78817 NID g840785 KEYWORDS c1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 349) AUTHORS Tribioli,C., Mancini,M., Plassart,E., Bione,S., Rivella,S., Sala,C., Torri,G. and Toniolo,D. TITLE Isolation of new genes in distal Xq28: transcriptional map and identification of a human homologue of the ARD1 N-acetyl transferase of Saccharomyces cerevisiae JOURNAL Hum. Mol. Genet. 3 (7), 1061-1067 (1994) MEDLINE 95072568 REFERENCE 2 (bases 1 to 3236) AUTHORS Tribioli,C. TITLE Direct Submission JOURNAL Submitted (15-APR-1994) D. Toniolo, Istituto di Gentica Biochimica, ed Evoluzioinistica, C.N.R., Via Abbiategrasso 207, 27100 Pavia, ITALY REFERENCE 3 (bases 1 to 3236) AUTHORS Tribioli,C., Droetto,S., Bione,S., Cesareni,G., Torrisi,M.R., Lotti,L.V., Lanfrancone,L., Toniolo,D. and Pelicci,P. TITLE An X chromosome-linked gene encoding a protein with characteristics of a rhoGAP predominantly expressed in hematopoietic cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (2), 695-699 (1996) MEDLINE 96149366 FEATURES Location/Qualifiers source 1..3236 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human total embryo cDNA library" /clone="E1,E2" /chromosome="X" /map="q28" misc_feature 38..46 /note="Kozack consensus" gene 43..3216 /gene="C1" CDS 43..2883 /gene="C1" /codon_start=1 /product="p115" /db_xref="PID:g840786" /translation="MAAHGKLRRERGLQAEYETQVKEMRWQLSEQLRCLELQGELRRE LLQELAEFMRRRAEVELEYSRGLEKLAERFSSRGGRLGSSREHQSFRKEPSLLSPLHC WAVLLQHTRQQSRESAALSEVLAGPLAQRLSHIAEDVGRLVKKSRDLEQQLQDELLEV VSELQTAKKTYQAYHMESVNAEAKLREAERQEEKRAGRSVPTTTAGATEAGPLRKSSL KKGGRLVEKRQAKFMEHKLKCTKARNEYLLSLASVNAAVSNYYLHDVLDLMDCCDTGF HLALGQVLRSYTAAESRTQASQVQGLGSLEEAVEALDPPGDKAKVLEVHATVFCPPLR FDYHPHDGDEVAEICVEMELRDEILPRAQNIQSRLDRQTIETEEVNKTLKATLQALLE VVASDDGDVLDSFQTSPSTESLKSTSSDPGSRQAGRRRGQQQETETFYLTKLQEYLSG RSILAKLQAKHEKLQEALQRGDKEEQEVSWTQYTQRKFQKSRQPRPSSQYNQRLFGGD MEKFIQSSGQPVPLVVESCIRFINLNGLQHEGIFRVSGAQLRVSEIRDAFERGEDPLV EGCTAHDLDSVAGVLKLYFRSLEPPLFPPDLFGELLASSELEDTAERVEHVSRLLWRL PAPVLVVLRYLFTFLNHLAQYSDENMMDPYNLAVCFGPTLLPVPAGQDPVALQGRVNQ LVQTLIVQPDRVFPPLTSLPGPVYEKCMAPPSASCLGDAQLESLGADNDPELEAEMPA QEDDLEGVVEAVACFAYTGRTAQELSFRRGDVLRLHERASSDWWRGEHNGMRGLIPHK YITLPAGTEKQVVGAGLQTAGESGSSPEGLLASELVHRPEPCTSPEAMGPSGHRRRCL VPASPEQHVEVDKAVAQNMDSVFKELLGKTSVRQGLGPASTTSPSPGPRSPKAPPSSR LGRNKGFSRGPGAPASPSASHPQGLDTTPKPH" polyA_site 3212..3216 /gene="C1" BASE COUNT 611 a 1012 c 1103 g 510 t ORIGIN 1 cgtgggagca gtggggttcg acggcgcggc cgcgaggccg ccatggccgc tcacgggaag 61 ctgcggcggg agcgggggct gcaggctgag tatgagacgc aagtcaaaga gatgcgctgg 121 cagctgagcg agcagctgcg ctgcctggag ctgcagggcg agctgcggcg ggagttgctg 181 caggagctgg cagagttcat gcggcgccgc gctgaggtgg agctggaata ctcccggggc 241 ctggaaaagc tggccgagcg cttctccagc cgtggaggcc gcctggggag cagccgggag 301 caccaaagct tccggaagga gccgtccctc ctgtcgccct tgcactgctg ggcggtgctg 361 ctgcagcaca cgcggcagca gagccgggag agcgcggccc tgagtgaggt gctggccggg 421 cccctggccc agcgcctgag tcacattgca gaggacgtgg ggcgcctggt caagaagagc 481 agggatctgg agcagcagct gcaggatgag ctcctggagg tggtctcaga gctccagacg 541 gccaagaaga cgtaccaggc atatcacatg gagagcgtga atgccgaggc caagctccgg 601 gaggccgagc ggcaggagga gaagcgggca ggccggagtg tccccaccac caccgctggt 661 gccactgagg cagggcccct ccgcaagagc tccctcaaga agggagggag gctggtggag 721 aagcggcagg ccaagttcat ggagcacaaa ctcaagtgca caaaggcgcg caacgagtac 781 ctgcttagcc tggctagtgt caacgctgct gtcagtaact actacctgca tgacgtcttg 841 gacctcatgg actgctgtga cacagggttc cacctggccc tggggcaggt gctccggagc 901 tacacggccg ctgagagccg cacccaagcc tcccaagtgc agggcctggg cagcctggaa 961 gaagctgtgg aggccctgga tcctccaggg gacaaagcca aggttctcga ggtgcatgct 1021 accgtcttct gtcccccgct gcgctttgac taccaccccc atgatgggga tgaggtggct 1081 gagatctgcg ttgaaatgga gctgcgggac gagattctgc ccagagccca gaacatccag 1141 agccgcctgg accgacagac cattgagaca gaggaggtga acaagactct gaaggcgaca 1201 ctgcaggccc tgctggaggt ggtggcctcg gatgacgggg atgtgcttga ttccttccag 1261 accagcccct ccaccgagtc cctcaagtcc accagctcag acccaggcag ccggcaggcg 1321 ggccggaggc gcggccagca gcaggagacc gaaaccttct acctcacgaa gctccaggag 1381 tatctgagtg gacggagcat cctcgccaag ctgcaggcca agcacgagaa gctgcaggag 1441 gcccttcagc gaggtgacaa ggaggagcag gaggtgtctt ggacccagta cacacagaga 1501 aaattccaga agagccgcca gccccgcccc agctcccagt ataaccagag actctttggg 1561 ggagacatgg agaagtttat ccagagctca ggccagcctg tgcccctggt ggtggagagc 1621 tgcattcgct tcatcaacct caatggcctg cagcatgaag gcatcttccg ggtatcgggt 1681 gcccagctcc gggtctcaga gatccgtgat gccttcgaga gaggggagga cccactggtg 1741 gagggctgca ctgcccatga cctggactcg gtggccgggg tgctgaagct ctacttccgg 1801 agcctggagc ccccactctt ccccccagac ctgttcggcg agctgctggc ttcttcggag 1861 ctggaggaca cagcggagag ggtggagcac gtgagccgcc tgctgtggcg gctgcccgcg 1921 ccggtgctgg tggttctgcg ctacctcttc accttcctca accacctggc ccagtacagc 1981 gatgagaaca tgatggaccc ctacaacctg gccgtgtgct tcgggcccac gctgctaccg 2041 gtgcccgctg ggcaggaccc ggtggcgctg cagggccggg tgaaccagct ggtgcagacg 2101 ctcatagtgc agcccgatcg ggtcttcccg cccctgacct cgctgcctgg ccccgtctac 2161 gagaagtgca tggcaccgcc ttccgccagc tgcctggggg acgcccagct ggagagcctg 2221 ggggcggaca atgatccgga gctggaagcc gagatgcccg cacaggagga tgacctggag 2281 ggggtcgtgg aggctgtggc ctgctttgcc tacacgggcc gcacagccca ggagctgagc 2341 ttccggcggg gggacgtact gcggctgcac gagagggcct cgagcgactg gtggcggggg 2401 gagcacaacg gcatgcgggg cctcatcccc cacaagtata tcacgctgcc cgccgggacg 2461 gagaagcagg tggtgggcgc agggctgcag actgcagggg agtctgggag cagtcccgag 2521 ggcctcctgg catcggagct ggtccaccgg ccagagccat gcacctcacc tgaggccatg 2581 ggaccctctg gacacagacg acgctgcttg gtcccagcct ccccagagca acacgtggag 2641 gtggataagg ctgtggcaca gaacatggac tctgtgttta aggagctctt gggaaagacc 2701 tctgtccgcc agggccttgg gccagcatct accacctctc ccagtcctgg gccccgaagc 2761 ccaaaggccc cgcccagcag ccgcctgggc aggaacaaag gcttctcccg gggccctggg 2821 gccccagcct caccctcagc ttcccacccc cagggcctag acacgacccc caagccacac 2881 tgaggtgccg ctgctggaga tgcgtgcccc cggcggctac ccgctggacc ggccactctc 2941 cccagccccc ttgcttctct ccagccctgt ccagcaagtg cagggtgcct gcacttcacc 3001 ctgtgcagag aggtgggatg gggccgtgca cacagggtat gcccgctcca catcctgcct 3061 gcccctcagc cctggcccag gccccttgtg gaggcagctg aggaaggatg ctggggaaag 3121 ccctcttctg cagctttgtg gaaggctgat cagtggctgc tgggtggcgg gtacccttgc 3181 tcagatgcct ggcagggctg ggtggcgatt cataaagacc tcgtgttgat tccccg // LOCUS HSPARVAL 424 bp RNA PRI 05-OCT-1993 DEFINITION H.sapiens mRNA for parvalbumin. ACCESSION X63070 NID g35289 KEYWORDS calcium binding protein; parvalbumin gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 424) AUTHORS Heizmann,C.W. TITLE Direct Submission JOURNAL Submitted (25-OCT-1991) C.W. Heizmann, Division of Clinical Chemistry, Dept of Pediatrics, University of Zuerich, Steinwiesstr 75, CH-8032 Zuerich, SWITZERLAND REFERENCE 2 (bases 1 to 424) AUTHORS Fohr,U.G., Weber,B.R., Muntener,M., Staudenmann,W., Hughes,G.J., Frutiger,S., Banville,D., Schafer,B.W. and Heizmann,C.W. TITLE Human alpha and beta parvalbumins. Structure and tissue-specific expression JOURNAL Eur. J. Biochem. 215 (3), 719-727 (1993) MEDLINE 93358895 COMMENT Related sequence: Berchtold,M.W., J.Mol.Biol. 210:417 (1989). FEATURES Location/Qualifiers source 1..424 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum" gene 57..389 /gene="parvalbumin" CDS 57..389 /gene="parvalbumin" /codon_start=1 /db_xref="PID:g35290" /db_xref="SWISS-PROT:P20472" /translation="MSMTDLLNAEDIKKAVGAFSATDSFDHKKFFQMVGLKKKSADDV KKVFHMLDKDKSGFIEEDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDE FSTLVAES" BASE COUNT 113 a 109 c 116 g 86 t ORIGIN 1 accagcccag cctttcagtg caggctccag ccctccaccc ccacccgagt tgcaggatgt 61 cgatgacaga cttgctgaac gctgaggaca tcaagaaggc ggtgggagcc tttagcgcta 121 ccgactcctt cgaccacaaa aagttcttcc aaatggtcgg cctgaagaaa aagagtgcgg 181 atgatgtgaa gaaggtgttt cacatgctgg acaaggacaa aagtggcttc atcgaggagg 241 atgagctggg attcatccta aaaggcttct ccccagatgc cagagacctg tctgctaaag 301 aaaccaagat gctgatggct gctggagaca aagatgggga cggcaaaatt ggggttgacg 361 aattctccac tctggtggct gaaagctaag aagcactgac tgcccctggt cttccacctc 421 tctg // LOCUS HSPAX7M 2272 bp RNA PRI 27-OCT-1997 DEFINITION H.sapiens mRNA for paired box containing transcription factor, PAX7. ACCESSION X96743 NID g2570020 KEYWORDS developmental control gene; paired box; PAX7 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2272) AUTHORS Vorobyov,E., Mertsalov,I., Dockhorn-Dworniczak,B., Dworniczak,B. and Horst,J. TITLE The genomic organization and the full coding region of the human PAX7 gene JOURNAL Genomics 45 (1), 168-174 (1997) MEDLINE 97480728 REFERENCE 2 (bases 1 to 2272) AUTHORS Vorobyov,E.V. TITLE Direct Submission JOURNAL Submitted (21-MAR-1996) E.V. Vorobyov, Westf. Wilhelms-Universitaet Muenster, Institiut fuer Humangenetic, Vesaliusweg 12-14, 48149, Muenster, FRG REMARK revised by submitter 05-MAR-1997 COMMENT Related entries: X15043, X15252, X15253, Z35141, Z63842-Z64432. FEATURES Location/Qualifiers source 1..2272 /organism="Homo sapiens" /isolate="patient 282A" /db_xref="taxon:9606" /tissue_type="alveolar rhabdomyosarcoma tumour, containing t(2;13)" /cell_line="ARMS282A" /clone="P7" /map="1p36.2" gene 600..2162 /gene="PAX7" CDS 600..2162 /gene="PAX7" /codon_start=1 /product="paired box containing transcription factor" /db_xref="PID:e304684" /db_xref="PID:g2570021" /translation="MAALPGTVPRMMRPAPGQNYPRTGFPLEVSTPLGQGRVNQLGGV FINGRPLPNHIRHKIVEMAHHGIRPCVISRQLRVSHGCVSKILCRYQETGSIRPGAIG GSKPRQVATPDVEKKIEEYKRENPGMFSWEIRDRLLKDGHCDRSTVPSGLVSSISRVL RIKFGKKEEEDEADKKEDDGEKKAKHSIDGILGDKGNRLDEGSDVESEPDLPLKRKQR RSRTTFTAEQLEELEKAFERTHYPDIYTREELAQRTKLTEARVQVWFSNRRARWRKQA GANQLAAFNHLLPGGFPPTGMPTLPPYQLPDSTYPTTTISQDGGSTVHRPQPLPPSTM HQGGLAAAAAAADTSSAYGARHSFSSYSDSFMNPAAPSNHMNPVSNGLSPQVMSILGN PSAVPPQPQADFSISPLHGGLDSATSISASCSQRADSIKPGDSLPTSQAYCPPTYSTT GYSVDPVAGYQYGQYGQSECLVPWASPVPIPSPTPRASCLFMESYKVVSGWGMSISQM EKLKSSQMEQFT" polyA_signal 2237..2242 BASE COUNT 512 a 715 c 686 g 359 t ORIGIN 1 gaaagctggt gtggagggag aagcgagtgt ggtccggaga aagaaggcgt ggagaagagg 61 gagggagcga gagcgagaga ataaatatat aaataaatac gagaacgaaa tccactccgc 121 agtctccggg ctcggaaact ttggccccga gcgccagagc gccagagcgc gagagcgcgg 181 cgctcgccac tctgaggctg gcggcctcga ttccggccgc gttcccccgg cccccctccg 241 tccgcggggc ctggtctccg ggttctgcca ggcgcatcag cccgcacaac ttctggccga 301 ggccagccgg cagaggcgga cttggggttg gagtgtttgt ttgtttgaac ttcctcgtcg 361 tcgccacctt ccctcccccc aacctccacc ccacctcacc cccctcccca gcttctggac 421 gcgtttgact gcagccaggg gtggggggtg ggggtaggga gtgtgtgtgg aggggaggga 481 gaagaggtta aaaaaaagaa gacgaagaag acggaaagaa agagatcgca gcaggggtga 541 agggagcgga cgggaagcga tttttgccga ctttggattc gtccccggcg tgcgcaagaa 601 tggcggccct tcccggcacg gtaccgagaa tgatgcggcc ggctccgggg cagaactacc 661 cccgcacggg attccctttg gaagtgtcca ccccgcttgg ccaaggccgg gtcaatcagc 721 tgggaggggt cttcatcaat gggcgacccc tgcctaacca catccgccac aagatagtgg 781 agatggccca ccatggcatc cggccctgtg tcatctcccg acagctgcgt gtctcccacg 841 gctgcgtctc caagattctt tgccgctacc aggagaccgg gtccatccgg cctggggcca 901 tcggcggcag caagcccaga caggtggcga ctccggatgt agagaaaaag attgaggagt 961 acaagaggga aaacccaggc atgttcagct gggagatccg ggacaggctg ctgaaggatg 1021 ggcactgtga ccgaagcact gtgccctcag gtttagtgag ttcgattagc cgcgtgctca 1081 gaatcaagtt cgggaagaaa gaggaggagg atgaagcgga caagaaggag gacgacggcg 1141 aaaagaaggc caaacacagc atcgacggca tcctgggcga caaagggaac cggctggacg 1201 agggctcgga tgtggagtcg gaacctgacc tcccactgaa gcgcaagcag cgacgcagtc 1261 ggaccacatt cacggccgag cagctggagg agctggagaa ggcctttgag aggacccact 1321 acccagacat atacacccgc gaggagctgg cgcagaggac caagctgaca gaggcgcgtg 1381 tgcaggtctg gttcagtaac cgccgcgccc gttggcgtaa gcaggcagga gccaaccagc 1441 tggcggcgtt caaccacctt ctgccaggag gcttcccacc caccggcatg cccacgctgc 1501 ccccctacca gctgccggac tccacctacc ccaccaccac catctcccaa gatgggggca 1561 gcactgtgca ccggcctcag cccctgccac cgtccaccat gcaccagggc gggctggctg 1621 cagcggctgc agccgccgac accagctctg cctacggagc ccgccacagc ttctccagct 1681 actctgacag cttcatgaat ccggcggcgc cctccaacca catgaacccg gtcagcaacg 1741 gcctgtctcc tcaggtgatg agcatcttgg gcaaccccag tgcggtgccc ccgcagccac 1801 aggctgactt ctccatctcc ccgctgcatg gcggcctgga ctcggccacc tccatctcag 1861 ccagctgcag ccagcgggcc gactccatca agccagggga cagcctgccc acctcccagg 1921 cctactgccc acccacctac agcaccaccg gctacagcgt ggaccccgtg gccggctatc 1981 agtacggcca gtacggccag agtgagtgcc tggtgccctg ggcgtccccc gtccccattc 2041 cttctcccac ccccagggcc tcctgcttgt ttatggagag ctacaaggtg gtgtcagggt 2101 ggggaatgtc catttcacag atggaaaaat tgaagtccag ccagatggaa cagttcacct 2161 aaaatgacac tgagttgggc aaaacccagg acatctcctg gctaagcctc tgcttccgta 2221 ctatggctcc aacagaaata aaatacacag cacaaatatc aaaaaaaaaa aa // LOCUS HSPBEF 2376 bp mRNA PRI 04-MAY-1994 DEFINITION Human pre-B cell enhancing factor (PBEF) mRNA, complete cds. ACCESSION U02020 NID g404012 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2376) AUTHORS Samal,B., Sun,Y., Stearns,G., Xie,C., Suggs,S. and McNiece,I. TITLE Cloning and characterization of the cDNA encoding a novel human pre-B-cell colony-enhancing factor JOURNAL Mol. Cell. Biol. 14, 1431-1437 (1994) MEDLINE 94119094 REFERENCE 2 (bases 1 to 2376) AUTHORS Samal,B.B. TITLE Direct Submission JOURNAL Submitted (21-SEP-1993) Samal B.B., Amgen Inc, Developmental Biology, Amgen Center, Thousand Oaks, CA 91320 USA FEATURES Location/Qualifiers source 1..2376 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="p64 cDNA" /clone_lib="human PWM activated lymphocyte oligo(dT) and random-primed cDNA" /cell_line="Hut 78" /cell_type="lymphocytes" /tissue_type="blood" /dev_stage="adult" mRNA 1..2376 /partial /gene="PBEF" /standard_name="PBEF" /evidence=experimental /product="pre-B cell enhancing factor" 5'UTR 1..27 /partial /gene="PBEF" /evidence=experimental gene 1..2376 /gene="PBEF" CDS 28..1503 /gene="PBEF" /standard_name="PBEF" /codon_start=1 /function="pre-B cell colony enhancement" /evidence=experimental /product="pre-B cell enhancing factor" /db_xref="PID:g404013" /translation="MNPAAEAEFNILLATDSYKVTHYKQYPPNTSKVYSYFECREKKT ENSKLRKVKYEETVFYGLQYILNKYLKGKVVTKEKIQEAKDVYKEHFQDDVFNEKGWN YILEKYDGHLPIEIKAVPEGFVIPRGNVLFTVENTDPECYWLTNWIETILVQSWYPIT VATNSREQKKILAKYLLETSGNLDGLEYKLHDFGYRGVSSQETAGIGASAHLVNFKGT DTVAGLALIKKYYGTKDPVPGYSVPAAEHSTITAWGKDHEKDAFEHIVTQFSSVPVSV VSDSYDIYNACEKIWGEDLRHLIVSRSTQAPLIIRPDSGNPLDTVLKVLEILGKKFPV TENSKGYKLLPPYLRVIQGDGVDINTLQEIVEGMKQKMWSIENIAFGSGGGLLQKLTR DLLNCSFKCSYVVTNGLGINVFKDPVADPNKRSKKGRLSLHRTPAGNFVTLEEGKGDL EEYGQDLLHTVFKNGKVTKSYSFDEIRKNAQLNIELEAAHH" sig_peptide 28..105 /gene="PBEF" /standard_name="PBEF" /evidence=experimental /product="pre-B cell enhancing factor" mat_peptide 106..1500 /gene="PBEF" /standard_name="PBEF" /evidence=experimental /product="pre-B cell enhancing factor" 3'UTR 1504..2376 /partial /gene="PBEF" /evidence=experimental polyA_signal 2013..2018 /gene="PBEF" /evidence=experimental polyA_site 2032..2076 /gene="PBEF" /evidence=experimental BASE COUNT 775 a 395 c 481 g 725 t ORIGIN 1 cgcgcggccc ctgtcctccg gcccgagatg aatcctgcgg cagaagccga gttcaacatc 61 ctcctggcca ccgactccta caaggttact cactataaac aatatccacc caacacaagc 121 aaagtttatt cctactttga atgccgtgaa aagaagacag aaaactccaa attaaggaag 181 gtgaaatatg aggaaacagt attttatggg ttgcagtaca ttcttaataa gtacttaaaa 241 ggtaaagtag taaccaaaga gaaaatccag gaagccaaag atgtctacaa agaacatttc 301 caagatgatg tctttaatga aaagggatgg aactacattc ttgagaagta tgatgggcat 361 cttccaatag aaataaaagc tgttcctgag ggctttgtca ttcccagagg aaatgttctc 421 ttcacggtgg aaaacacaga tccagagtgt tactggctta caaattggat tgagactatt 481 cttgttcagt cctggtatcc aatcacagtg gccacaaatt ctagagagca gaagaaaata 541 ttggccaaat atttgttaga aacttctggt aacttagatg gtctggaata caagttacat 601 gattttggct acagaggagt ctcttcccaa gagactgctg gcataggagc atctgctcac 661 ttggttaact tcaaaggaac agatacagta gcaggacttg ctctaattaa aaaatattat 721 ggaacgaaag atcctgttcc aggctattct gttccagcag cagaacacag taccataaca 781 gcttggggga aagaccatga aaaagatgct tttgaacata ttgtaacaca gttttcatca 841 gtgcctgtat ctgtggtcag cgatagctat gacatttata atgcgtgtga gaaaatatgg 901 ggtgaagatc taagacattt aatagtatcg agaagtacac aggcaccact aataatcaga 961 cctgattctg gaaaccctct tgacactgtg ttaaaggttt tggagatttt aggtaagaag 1021 tttcctgtta ctgagaactc aaagggttac aagttgctgc caccttatct tagagttatt 1081 caaggggatg gagtagatat taatacctta caagagattg tagaaggcat gaaacaaaaa 1141 atgtggagta ttgaaaatat tgccttcggt tctggtggag gtttgctaca gaagttgaca 1201 agagatctct tgaattgttc cttcaagtgt agctatgttg taactaatgg ccttgggatt 1261 aacgtcttca aggacccagt tgctgatccc aacaaaaggt ccaaaaaggg ccgattatct 1321 ttacatagga cgccagcagg gaattttgtt acactggagg aaggaaaagg agaccttgag 1381 gaatatggtc aggatcttct ccatactgtc ttcaagaatg gcaaggtgac aaaaagctat 1441 tcatttgatg aaataagaaa aaatgcacag ctgaatattg aactggaagc agcacatcat 1501 taggctttat gactgggtgt gtgttgtgtg tatgtaatac ataatgttta ttgtacagat 1561 gtgtggggtt tgtgttttat gatacattac agccaaatta tttgttggtt tatggacata 1621 ctgccctttc attttttttc ttttccagtg tttaggtgat ctcaaattag gaaatgcatt 1681 taaccatgta aaagatgagt gctaaagtaa gctttttagg gccctttgcc aataggtagt 1741 cattcaatct ggtattgatc ttttcacaaa taacagaact gagaaacttt tatatataac 1801 tgatgatcac ataaaacaga tttgcataaa attaccatga ttgctttatg tttatattta 1861 acttgtattt ttgtacaaac aagattgtgt aagatatatt tgaagtttca gtgatttaac 1921 agtctttcca acttttcatg atttttatga gcacagactt tcaagaaaat acttgaaaat 1981 aaattacatt gccttttgtc cattaatcag caaataaaac atggccttaa caaagttgtt 2041 tgtgttattg tacaatttga aaattatgtc gggacatacc ctatagaatt actaacctta 2101 ctgccccttg tagaatatgt attaatcatt ctacattaaa gaaaataatg gttcttactg 2161 gaatgtctag gcactgtaca gttattatat atcttggttg ttgtattgta ccagtgaaat 2221 gccaaatttg aaaggcctgt actgcaattt tatatgtcag agattgcctg tggctctaat 2281 atgcacctca agattttaag gagataatgt ttttagagag aatttctgct tccactatag 2341 aatatataca taaatgtaaa atacttacaa aagtgg // LOCUS HSPBGDR 1380 bp RNA PRI 06-SEP-1995 DEFINITION Human mRNA for porphobilinogen deaminase (PBG-D, EC 4.3.1.8). ACCESSION X04217 NID g35306 KEYWORDS deaminase; porphobilinogen deaminase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1380) AUTHORS Raich,N., Romeo,P.H., Dubart,A., Beaupain,D., Cohen-Solal,M. and Goossens,M. TITLE Molecular cloning and complete primary sequence of human erythrocyte porphobilinogen deaminase JOURNAL Nucleic Acids Res. 14 (15), 5955-5968 (1986) MEDLINE 86312872 COMMENT Porphobilinogen deaminase is the third enzyme of the heme biosynthetic pathway. Deficiency of this enzyme leads to the dominant hereditary disease Acute Intermittent Porphyria (AIP). FEATURES Location/Qualifiers source 1..1380 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 82..1116 /note="PBG-D (aa 1-344)" /codon_start=1 /db_xref="PID:g35307" /db_xref="SWISS-PROT:P08397" /translation="MRVIRVGTRKSQLARIQTDSVVATLKASYPGLQFEIIAMSTTGD KILDTALSKIGEKSLFTKELEHALEKNEVDLVVHSLKDLPTVLPPGFTIGAICKRENP HDAVVFHPKFVGKTLETLPEKSVVGTSSLRRAAQLQRKFPHLEFRSIRGNLNTRLRKL DEQQEFSAIILATAGLQRMGWHNRVGQILHPEECMYAVGQGALGVEVRAKDQDILDLV GVLHDPETLLRCIAERAFLRHLEGGCSVPVAVHTAMKDGQLYLTGGVWSLDGSDSIQE TMQATIHVPAQHEDGPEDDPQLVGITARNIPRGPQLAAQNLGISLANLLLSKGAKTIL DVARQLNDAH" misc_feature 1361..1366 /note="pot. polyA signal" polyA_site 1380 /note="polyA site" BASE COUNT 322 a 368 c 390 g 300 t ORIGIN 1 agcaggtcct actatcgcct ccctctagtc tctgcttctt tggatccctg aggagggcag 61 aaggaagaaa acagcccaaa gatgagagtg attcgcgtgg gtacccgcaa gagccagctt 121 gctcgcatac agacggacag tgtggtggca acattgaaag cctcgtaccc tggcctgcag 181 tttgaaatca ttgctatgtc caccacaggg gacaagattc ttgatactgc actctctaag 241 attggagaga aaagcctgtt taccaaggag cttgaacatg ccctggagaa gaatgaagtg 301 gacctggttg ttcactcctt gaaggacctg cccactgtgc ttcctcctgg cttcaccatc 361 ggagccatct gcaagcggga aaaccctcat gatgctgttg tctttcaccc aaaatttgtt 421 gggaagaccc tagaaaccct gccagagaag agtgtggtgg gaaccagctc cctgcgaaga 481 gcagcccagc tgcagagaaa gttcccgcat ctggagttca ggagtattcg gggaaacctc 541 aacacccggc ttcggaagct ggacgagcag caggagttca gtgccatcat cctagcaaca 601 gctggcctgc agcgcatggg ctggcacaac cgggttgggc agatcctgca ccctgaggaa 661 tgcatgtatg ctgtgggcca gggggccttg ggcgtggaag tgcgagccaa ggaccaggac 721 atcttggatc tggtgggtgt gctgcacgat cccgagactc tgcttcgctg catcgctgaa 781 agggccttcc tgaggcacct ggaaggaggc tgcagtgtgc cagtagccgt gcatacagct 841 atgaaggatg ggcaactgta cctgactgga ggagtctgga gtctagacgg ctcagatagc 901 atacaagaga ccatgcaggc taccatccat gtccctgccc agcatgaaga tggccctgag 961 gatgacccac agttggtagg catcactgct cgtaacattc cacgagggcc ccagttggct 1021 gcccagaact tgggcatcag cctggccaac ttgttgctga gcaaaggagc caaaaccatc 1081 ctggatgttg cacggcagct taacgatgcc cattaactgg tttgtggggc acagatgcct 1141 gggttgctgc tgtccagtgc ctacatcccg ggcctcagtg ccccattctc actgctatct 1201 ggggagtgat taccccggga gactgaactg cagggttcaa gccttccagg gatttgcctc 1261 accttggggc cttgatgact gccttgcctc ctcagtatgt gggggcttca tctctttaga 1321 gaagtccaag caacagcctt tgaatgtaac caatcctact aataaaccag ttctgaaggt // LOCUS HSPBX3 2591 bp RNA PRI 06-DEC-1991 DEFINITION Human PBX3 mRNA. ACCESSION X59841 NID g35314 KEYWORDS homeobox gene; PBX3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2591) AUTHORS Monica,K. TITLE Direct Submission JOURNAL Submitted (23-MAY-1991) K. Monica, Stanford University, Dept of Pathology, Stanford, CA 94305, USA REFERENCE 2 (bases 1 to 2591) AUTHORS Monica,K., Galili,N., Nourse,J., Saltman,D. and Cleary,M.L. TITLE PBX2 and PBX3, new homeobox genes with extensive homology to the human proto-oncogene PBX1 JOURNAL Mol. Cell. Biol. 11 (12), 6149-6157 (1991) MEDLINE 92049345 FEATURES Location/Qualifiers source 1..2591 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="non-lymphoid" /cell_type="epithelial" /cell_line="HeLa" /clone_lib="HeLa cDNA" /clone="2C,3C,8A,2A,9A" /chromosome="9" /map="q33-34 (band)" mRNA <113..2582 /gene="PBX3" /evidence=experimental gene 113..2582 /gene="PBX3" CDS 113..1417 /gene="PBX3" /codon_start=1 /product="homeobox protein" /db_xref="PID:g35315" /db_xref="SWISS-PROT:P40426" /translation="MDDQSRMLQTLAGVNLAGHSVQGGMALPPPPHGHEGADGDGRKQ DIGDILHQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSIRGAQEED PPDPQLMRLDNMLLAEGVSGPEKGGGSAAAAAAAAASGGSSDNSIEHSDYRAKLTQIR QIYHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVGIIHRKFSSIQMQLK QSTCEAVMILRSRFLDARRKRRNFSKQATEILNEYFYSHLSNPYPSEEAKEELAKKCS ITVSQVSNWFGNKRIRYKKNIGKFQEEANLYAAKTAVTAAHAVAAAVQNNQTNSPTTP NSGSSGSFNLPNSGDMFMNMQSLNGDSYQGSQVGANVQSQVDTLRHVINQTGGYSDGL GGNSLYSPHNLNANGGWQDATTPSSVTSPTEGPGSVHSDTSN" misc_feature 812..1003 /gene="PBX3" /note="homeobox" BASE COUNT 752 a 621 c 552 g 666 t ORIGIN 1 gcggccgcct ccccctcccc ctccccctct ttcttctcct ccctcgtcgc cgccgccgcc 61 gccgccgcct cagccttcgc ctcagcgccg cccgctcgcc gcgcgcggcg ggatggacga 121 tcaatccagg atgctgcaga ctctggccgg ggtgaacctg gctggccact cggtgcaggg 181 gggcatggcc ctgccgcctc ccccgcacgg ccacgaaggg gcggacggcg acggcaggaa 241 gcaggacatc ggcgacatcc tccaccagat catgaccatc accgaccaga gcttggacga 301 ggcgcaagca aagaaacatg ccctgaactg tcacagaatg aaaccagcgc tcttcagcgt 361 cctgtgtgag atcaaagaga aaacaggtct cagcatcaga ggagcccagg aggaggaccc 421 tcccgatccc cagctaatga gactggacaa tatgcttttg gcagaagggg tttcaggtcc 481 tgagaaaggt gggggatcgg cggcagcagc tgcagccgcg gcagcctctg gaggttcttc 541 agataactct attgaacact cagattacag agccaaattg acccagatca gacaaatcta 601 tcacacagaa ctggagaaat atgaacaggc atgtaatgaa tttactacac atgtgatgaa 661 ccttctccga gaacagagta gaacacgtcc catttctcca aaagagattg aaagaatggt 721 gggcatcatc catcgaaaat ttagttccat tcagatgcag ctcaaacaaa gcacttgtga 781 agcagttatg attttaagat caaggttcct tgatgccaga cggaaaaggc gtaacttcag 841 taaacaggcc acagaaatct tgaatgaata tttttactca cacctcagca acccctaccc 901 cagtgaagaa gccaaagagg agctggccaa gaaatgcagc atcacagtgt cacaggtatc 961 caattggttt ggcaacaaac gaatcaggta caagaagaac attggcaagt ttcaggaaga 1021 agccaacctc tatgctgcaa agacggccgt gacagctgca cacgcagtag cagcagctgt 1081 gcagaacaac cagaccaatt cgcccaccac accaaattcc ggttcttctg gttcttttaa 1141 cctcccaaat tctggggaca tgttcatgaa catgcagagt ctgaatgggg attcttacca 1201 agggtcccaa gtcggagcca atgtgcaatc acaggtggat accctccgtc atgttatcaa 1261 tcagacggga ggctacagtg atggccttgg aggaaattca ctgtacagtc cacataattt 1321 aaatgctaat ggaggctggc aggacgcaac aactccatct tctgtgactt ctcctacaga 1381 aggcccagga agtgtgcact cggatacctc taactaatct ctggccacac ttttcctgag 1441 ctacatgcct tgataagtgc attcagagca ataggaggaa aaggaaagcg tttttgtagc 1501 ccaccatcta cagctttact gtaaaacctt gtcttattcg agaacttggt aaatctgttt 1561 ttaaggaatc ataatcattt gtatttatac ttaaaaacac acaatgttaa aaaaaataaa 1621 gcactttatc caattaggcc aagatttaac attgttgaca gtcctgtagc tattttatca 1681 taatttatta tcaatatttt acattaatgg tttcacagtt gccaattact tggccttaag 1741 ggtaaaaagt acaatataca ctaaacctca accgttaaag cagatgcaaa aattcacctc 1801 acctaaattg aacttcttgc atatttccat tactgacttg gattgtcttt ctttcatatc 1861 actaatggag ttggaataaa gagctgtttg cctatccctg ttaatgatgg ttgtgtttaa 1921 gaatcttcct cgtcacgttt gtgttcagat ctcttatgtt ataattagat cagagactgg 1981 tagcatcgtt tctctctctg aaagcaccag tgcccagagt ctgctcggta ataaaattat 2041 ggatccagat tgttctgaga gacgaagata cttgctgctg atagaggtga aaacgagatt 2101 gatccgtctg gggttttacg gtgtgcactg ggtgctgcac agacttgtca aggtttgcta 2161 cgtcctctgg gcatctgcaa aaggccctgc tctctggagt gttgtatata gtgtagcaaa 2221 agagtattta tacatcccac caatcaaaac acagctttat tacctcatgc gaactcatac 2281 aaaccaatag aatttcaaca tgttctgtag cttagagtgc tcacttacta cctctgaaca 2341 atactcacgc tgtagtttgt ctctttctta tctttttgca tcttgtaatt aactctttgt 2401 ttcccttcat aaaatgtaat gtacattgta atcttttaaa agaaaaatca gggttgcact 2461 tgcaactttt aaaaaaccga gtgtggaaac attgggtctt aattcaacac aggatcggta 2521 aaactgttgt aaatactgag aaacattttg aatgttcttc aatcttatta ctaatccatg 2581 caaaaaaaaa a // LOCUS HSPC13M 5037 bp RNA PRI 23-NOV-1995 DEFINITION H.sapiens encoding PC1/PC3. ACCESSION X64810 S88573 NID g35317 KEYWORDS endoprotease; PC1/PC3 protein; subtilisin homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5037) AUTHORS Creemers,J.W., Roebroek,A.J. and Van de Ven,W.J. TITLE Expression in human lung tumor cells of the proprotein processing enzyme PC1/PC3. Cloning and primary sequence of a 5 kb cDNA JOURNAL FEBS Lett. 300 (1), 82-88 (1992) MEDLINE 92192290 REFERENCE 2 (bases 1 to 5037) AUTHORS Roebroek,A.J.M. TITLE Direct Submission JOURNAL Submitted (12-JUN-1992) A.J.M. Roebroek, Universitaire Ziekenhuizen, Leuven, Centrum voor Menselijke Erfelijkheid, UZ Gasthuisberg, Herestraat 49 3000 Leuven, BELGIUM FEATURES Location/Qualifiers source 1..5037 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="human lung cancer: carcinoid tumor" 5'UTR 1..189 /citation=[1] CDS 190..2451 /function="proprotein processing enzyme" /citation=[1] /codon_start=1 /product="PC1/PC3" /db_xref="PID:g35318" /db_xref="SWISS-PROT:P29120" /translation="MERRAWSLQCTAFVLFCAWCALNSAKAKRQFVNEWAAEIPGGPE AASAIAEELGYDLLGQIGSLENHYLFKHKNHPRRSRRSAFHITKRLSDDDRVIWAEQQ YEKERSKRSALRDSALNLFNDPMWNQQWYLQDTRMTAALPKLDLHVIPVWQKGITGKG VVITVLDDGLEWNHTDIYANYDPEASYDFNDNDHDPFPRYDPTNENKHGTRCAGEIAM QANNHKCGVGVAYNSKVGGIRMLDGIVTDAIEASSIGFNPGHVDIYSASWGPNDDGKT VEGPGRLAQKAFEYGVKQGRQGKGSIFVWASGNGGRQGDNCDCDGYTDSIYTISISSA SQQGLSPWYAEKCSSTLATSYSGGDYTDQRITSADLHNDCTETHTGTSASAPLAAGIF ALALEANPNLTWRDMQHLVVWTSEYDPLANNPGWKKNGAGLMVNSRFGFGLLNAKALV DLADPRTWRSVPEKKECVVKDNDFEPRALKANGEVIIEIPTRACEGQENAIKSLEHVQ FEATIEYSRRGDLHVTLTSAAGTSTVLLAERERDTSPNGFKNWDFMSVHTWGENPIGT WTLRITDMSGRIQNEGRIVNWKLILHGTSSQPEHMKQPRVYTSYNTVQNDRRGVEKMV DPGEEQPTQENPKENTLVSKSPSSSSVGGRRDELEEGAPSQAMLRLLQSAFSKNSPPK QSPKKSPSAKLNIPYENFYEALEKLNKPSQLKDSEDSLYNDYVDVFYNTKPYKHRDDR LLQALVDILNEEN" 3'UTR 2449..5037 BASE COUNT 1458 a 1061 c 1106 g 1412 t ORIGIN 1 aagcgcttca ctgagcgctc gccgccgccc agcctctcct ctcgcgcctc ctagctcttc 61 gcagagcaac caggagccag gagtggtcta gagcccgagg gtgggaaggg ggagtctgtc 121 tggcttttct cctatcttgc ttctttttcc tcttcccttc ccactcttgt tcaagcgagt 181 gtgtgagcta tggagcgaag agcctggagt ctgcagtgca ctgctttcgt cctcttttgc 241 gcttggtgtg cactgaacag tgcaaaagcg aaaaggcaat ttgtcaatga atgggcagcg 301 gagatccccg ggggcccgga agcagcctcg gccatcgccg aggagctggg ctatgacctt 361 ttgggtcaga ttggttcact tgaaaatcac tacttattca aacataaaaa ccaccccaga 421 aggtctcgaa ggagtgcctt tcatatcact aagagattat ctgatgatga tcgtgtgata 481 tgggctgaac aacagtatga aaaagaaaga agtaaacgtt cagctctaag ggactcagca 541 ctaaatctct tcaatgatcc catgtggaat cagcaatggt acttgcaaga taccaggatg 601 acggcagccc tgcccaagct ggaccttcat gtgatacctg tttggcaaaa aggcattacg 661 ggcaaaggag ttgttatcac cgtactggat gatggtttgg agtggaatca cacggacatt 721 tatgccaact atgatccaga ggctagctat gattttaatg ataatgacca tgatccattt 781 ccccgatatg atcccacaaa cgagaacaaa cacgggacca gatgtgcagg agaaattgcc 841 atgcaagcaa ataatcacaa atgcggggtt ggagttgcat acaattccaa agttggaggc 901 ataagaatgc tggatggcat tgtgacggat gctattgagg ccagttcaat tggattcaat 961 cctggacacg tggatattta cagtgcaagc tggggcccta atgatgatgg gaaaactgtg 1021 gaggggcctg gccggctagc ccagaaggct tttgaatatg gtgtcaaaca ggggagacag 1081 gggaaggggt ccatcttcgt ctgggcttcg ggaaacgggg ggcgtcaggg agataattgt 1141 gactgtgatg gctacacaga cagcatctac accatctcca tcagcagtgc ctcccagcaa 1201 ggcctatccc cctggtacgc tgagaagtgc tcctccacac tggccacctc ttacagcggc 1261 ggagattaca ccgaccagag aatcacgagc gctgacctgc acaatgactg cacggagacg 1321 cacacaggca cctcggcctc tgcacctctg gctgctggca tcttcgctct ggccctggaa 1381 gcaaacccaa atctcacctg gcgagatatg cagcacctgg ttgtctggac ctctgagtat 1441 gacccgctgg ccaataaccc tggatggaaa aagaatggag caggcttgat ggtgaatagt 1501 cgatttggat ttggcttgct aaatgccaaa gctctggtgg atttagctga ccccaggacc 1561 tggaggagcg tgcctgagaa gaaagagtgt gttgtaaagg acaatgactt tgagcccaga 1621 gccctgaaag ctaatggaga agttatcatt gaaattccaa caagagcttg tgaaggacaa 1681 gaaaatgcta tcaagtccct ggagcatgta caatttgaag caacaattga atattcccga 1741 agaggagacc ttcatgtcac acttacttct gctgctggaa ctagcactgt gctcttggct 1801 gaaagagaac gggatacatc tcctaatggc tttaagaact gggacttcat gtctgttcac 1861 acatggggag agaaccctat aggtacttgg actttgagaa ttacagacat gtctggaaga 1921 attcaaaatg aaggaagaat tgtgaactgg aagctgattt tgcacgggac ctcttctcag 1981 ccagagcata tgaagcagcc tcgtgtgtac acgtcctaca acactgttca gaatgacaga 2041 agaggggtgg agaagatggt ggatccaggg gaggagcagc ccacacaaga gaaccctaag 2101 gagaacaccc tggtgtccaa aagccccagc agcagcagcg tagggggccg gagggatgag 2161 ttggaggagg gagccccttc ccaggccatg ctgcgactcc tgcaaagtgc tttcagtaaa 2221 aactcaccgc caaagcaatc accaaagaag tccccaagtg caaagctcaa catcccttat 2281 gaaaacttct acgaagccct ggaaaagctg aacaaacctt cccagcttaa agactctgaa 2341 gacagtctgt ataatgacta tgttgatgtt ttttataaca ctaaacctta caagcacaga 2401 gacgaccggc tgcttcaagc tctggtggac attctgaatg aggaaaatta aaataagtgt 2461 gtggtcccaa gttggaaata ttcatgcttc ttccttaccc tgcgattttg cctgtgtctg 2521 aagtggttgt tttgtcatga attcttatgc ttataatatc ctttgtggca ccttttcttt 2581 ttctccctaa actgtacatg tgaaggggat gagctcaagc aggaagttca acttccagaa 2641 ttgatcatag gtatttcaaa acacatcttt cctgtctgca caagtgaagt gttttgttct 2701 ttctggagtc acagttgaca aaaagctctt acactacatt agaacactgc attagagccc 2761 atttcaattc tcaaaagaaa aggcaaaacc tgggatatca attaatttga aaacataatc 2821 tgcaaagaat gagaaggagt cagaaactgt ttctgtagct tgttccctgt cttgtccatg 2881 tggttcttca aattttgatg ccaagaaagt atttggtagg cctaatgaag gagttcactg 2941 taagactcat tccctagatc tttctattcc aaagtgccac tcattcctgt agtcaaaatc 3001 tggtcatgtt ggtcaaaagc tggattattt agatctagaa acagatcttg aaatctgaat 3061 gctctggttt gagcaatttt cgaacattct ttgcctggtg cactgtgtct gtggtgccag 3121 aggcgtccgt ggatccagag gtggttatga ctcgtgctgc atgcctggtc tttcctctgt 3181 ttctccttct gaaagttttc tatacctgtc tcctttctca gccacaaaat aaatgttggg 3241 agaaatgata tataccactt tcccagaaaa aaaaaaactt acacttggga cttggcaaat 3301 tcctagtcac aatttttttc agcagtaaca ggaaaccact tatcacatgg agacctaatg 3361 taataataga aaaatactca taatagggag aaaccaagag aagttttgtt tttgtttttt 3421 tccaactgtg ttcattagaa cagcgtgttc taagtatttg aaactgaatg tttattcctt 3481 gatactaaaa gttcttctcc aatcctatca ctgatagtgt ccaaattctc accaaattgc 3541 tcctaagctt caaatcagaa gcagaaactg gcaggccatg gaccttaatt gtccctcagg 3601 tagattttgt ttggtatgca gaatgttttt aaaatatgag tggttattga aaatatgatg 3661 tttcacataa aacctcattc tcggacccat ctttgctcat ggcaacagtt agctggagct 3721 gagtagcagc tgcctgatta gatgactctc agtccccatg gcaccctgct ccatgttacc 3781 tagagcaggc acttgattcc ttgctgggca gtatccaata ggcatttgat tttgcccact 3841 cctacactaa gcgaatgtgt acaaagtgta aatgcattag gaaaaacaaa ctacccgcat 3901 cttctgttag gcaggatctg tacaataata attatgagtt tgcttatgta atctcacctc 3961 acctggatga tcactaatac taattcattt attactaacc ttctggcttc cttctctcaa 4021 tatgcttaca aagtctccag tcacctacaa tgctggcttt ctcccactga gtttgctgtt 4081 tgcaattttt ccatgaagtt tgaacttcat aaggtaattc atggcattga actggttcat 4141 gaaaagaaca ctagagtctg tcatttgctt tggcttgaag tatggttggt aacacaaatt 4201 ttcacctgct cttctaccat ttgaatttgt gtagagggtg tttgcagagc aatgcccgta 4261 atgcttagag aatgttctcc taaaagactt gcggaatcac tctgtccttg gaagtttcat 4321 atattgtttg atatgaagtg ttagatagaa tttccaatat tggagcatat caaaaagtat 4381 taaaactaaa aaggaccaga gaattcttag attggcccgg aaaggccaat aaagagttag 4441 aatgaaaact cattactttt ccattcccaa tctagtgcta gatgtataaa tctttctttt 4501 gattcttcct aacaaaatat tttctgggtt aaaaccccag ccaactcatt gggttgtagc 4561 caaaggttca ctctcaagaa gctttaatat ttaaataaaa tcatattgaa tgtttccaac 4621 ctggagtata atattcagat ataaaacagt tttgtcagtc tttcttagtg cctgtgtgga 4681 tttttgtgaa aatgtcaaag agaaaactta tatactattt cccttgaaat tttaaactat 4741 attttcttta caggtattta taatatacca atgcttttat caaacagaat tttaaagagc 4801 ataataaatt atattaaaga accaaaagtt ttcctgagaa taagaaagtt tcacccaata 4861 aaatattttt gaaaggcatg ttcctctgtc aatgaaaaaa agtacatgta tgtgttgtga 4921 tattaaaagt gacatttgtc taatagccta atacaacatg tagctgagtt taacatgtgt 4981 ggtcttggta ttcttaaggg aacttccaca ttatacattt gatgtattga ccagaat // LOCUS HSPCAD 3171 bp RNA PRI 06-FEB-1992 DEFINITION H.sapiens mRNA for p cadherin. ACCESSION X63629 NID g35322 KEYWORDS cadherin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3171) AUTHORS Shimoyama,Y., Yoshida,T., Terada,M., Shimosato,Y., Abe,O. and Hirohashi,S. TITLE Molecular cloning of a human Ca2+-dependent cell-cell adhesion molecule homologous to mouse placental cadherin: its low expression in human placental tissues JOURNAL J. Cell Biol. 109 (4 Pt 1), 1787-1794 (1989) MEDLINE 90009051 FEATURES Location/Qualifiers source 1..3171 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 54..2543 /codon_start=1 /product="p-cadherin" /db_xref="PID:g35323" /db_xref="SWISS-PROT:P22223" /translation="MGLPRGPLASLLLLQVCWLQCAASEPCRAVFREAEVTLEAGGAE QEPGQALGKVFMGCPGQEPALFSTDNDDFTVRNGETVQERRSLKERNPLKIFPSKRIL RRHKRDWVVAPISVPENGKGPFPQRLNQLKSNKDRDTKIFYSITGPGADSPPEGVFAV EKETGWLLLNKPLDREEIAKYELFGHAVSENGASVEDPMNISIIVTDQNDHKPKFTQD TFRGSVLEGVLPGTSVMQVTATDEDDAIYTYNGVVAYSIHSQEPKDPHDLMFTIHRST GTISVISSGLDREKVPEYTLTIQATDMDGDGSTTTAVAVVEILDANDNAPMFDPQKYE AHVPENAVGHEVQRLTVTDLDAPNSPAWRATYLIMGGDDGDHFTITTHPESNQGILTT RKGLDFEAKNQHTLYVEVTNEAPFVLKLPTSTATIVVHVEDVNEAPVFVPPSKVVEVQ EGIPTGEPVCVYTAEDPDKENQKISYRILRDPAGWLAMDPDSGQVTAVGTLDREDEQF VRNNIYEVMVLAMDNGSPPTTGTGTLLLTLIDVNDHGPVPEPRQITICNQSPVRHVLN ITDKDLSPHTSPFQAQLTDDSDIYWTAEVNEEGDTVVLSLKKFLKQDTYDVHLSLSDH GNKEQLTVIRATVCDCHGHVETCPGPWKGGFILPVLGAVLALLFLLLVLLLLVRKKRK IKEPLLLPEDDTRDNVFYYGEEGGGEEDQDYDITQLHRGLEARPEVVLRNDVAPTIIP TPMYRPRPANPDEIGNFIIENLKAANTDPTAPPYDTLLVFDYEGSGSDAASLSSLTSS ASDQDQDYDYLNEWGSRFKKLADMYGGGEDD" polyA_signal 3162..3167 BASE COUNT 740 a 903 c 864 g 664 t ORIGIN 1 gcggaacacc ggcccgccgt cgcggcagct gcttcacccc tctctctgca gccatggggc 61 tccctcgtgg acctctcgcg tctctcctcc ttctccaggt ttgctggctg cagtgcgcgg 121 cctccgagcc gtgccgggcg gtcttcaggg aggctgaagt gaccttggag gcgggaggcg 181 cggagcagga gcccggccag gcgctgggga aagtattcat gggctgccct gggcaagagc 241 cagctctgtt tagcactgat aatgatgact tcactgtgcg gaatggcgag acagtccagg 301 aaagaaggtc actgaaggaa aggaatccat tgaagatctt cccatccaaa cgtatcttac 361 gaagacacaa gagagattgg gtggttgctc caatatctgt ccctgaaaat ggcaagggtc 421 ccttccccca gagactgaat cagctcaagt ctaataaaga tagagacacc aagattttct 481 acagcatcac ggggccgggg gcagacagcc cccctgaggg tgtcttcgct gtagagaagg 541 agacaggctg gttgttgttg aataagccac tggaccggga ggagattgcc aagtatgagc 601 tctttggcca cgctgtgtca gagaatggtg cctcagtgga ggaccccatg aacatctcca 661 tcatcgtgac cgaccagaat gaccacaagc ccaagtttac ccaggacacc ttccgaggga 721 gtgtcttaga gggagtccta ccaggtactt ctgtgatgca ggtgacagcc acagatgagg 781 atgatgccat ctacacctac aatggggtgg ttgcttactc catccatagc caagaaccaa 841 aggacccaca cgacctcatg ttcacaattc accggagcac aggcaccatc agcgtcatct 901 ccagtggcct ggaccgggaa aaagtccctg agtacacact gaccatccag gccacagaca 961 tggatgggga cggctccacc accacggcag tggcagtagt ggagatcctt gatgccaatg 1021 acaatgctcc catgtttgac ccccagaagt acgaggccca tgtgcctgag aatgcagtgg 1081 gccatgaggt gcagaggctg acggtcactg atctggacgc ccccaactca ccagcgtggc 1141 gtgccaccta ccttatcatg ggcggtgacg acggggacca ttttaccatc accacccacc 1201 ctgagagcaa ccagggcatc ctgacaacca ggaagggttt ggattttgag gccaaaaacc 1261 agcacaccct gtacgttgaa gtgaccaacg aggccccttt tgtgctgaag ctcccaacct 1321 ccacagccac catagtggtc cacgtggagg atgtgaatga ggcacctgtg tttgtcccac 1381 cctccaaagt cgttgaggtc caggagggca tccccactgg ggagcctgtg tgtgtctaca 1441 ctgcagaaga ccctgacaag gagaatcaaa agatcagcta ccgcatcctg agagacccag 1501 cagggtggct agccatggac ccagacagtg ggcaggtcac agctgtgggc accctcgacc 1561 gtgaggatga gcagtttgtg aggaacaaca tctatgaagt catggtcttg gccatggaca 1621 atggaagccc tcccaccact ggcacgggaa cccttctgct aacactgatt gatgtcaacg 1681 accatggccc agtccctgag ccccgtcaga tcaccatctg caaccaaagc cctgtgcgcc 1741 acgtgctgaa catcacggac aaggacctgt ctccccacac ctcccctttc caggcccagc 1801 tcacagatga ctcagacatc tactggacgg cagaggtcaa cgaggaaggt gacacagtgg 1861 tcttgtccct gaagaagttc ctgaagcagg atacatatga cgtgcacctt tctctgtctg 1921 accatggcaa caaagagcag ctgacggtga tcagggccac tgtgtgcgac tgccatggcc 1981 atgtcgaaac ctgccctgga ccctggaaag gaggtttcat cctccctgtg ctgggggctg 2041 tcctggctct gctgttcctc ctgctggtgc tgcttttgtt ggtgagaaag aagcggaaga 2101 tcaaggagcc cctcctactc ccagaagatg acacccgtga caacgtcttc tactatggcg 2161 aagagggggg tggcgaagag gaccaggact atgacatcac ccagctccac cgaggtctgg 2221 aggccaggcc ggaggtggtt ctccgcaatg acgtggcacc aaccatcatc ccgacaccca 2281 tgtaccgtcc taggccagcc aacccagatg aaatcggcaa ctttataatt gagaacctga 2341 aggcggctaa cacagacccc acagccccgc cctacgacac cctcttggtg ttcgactatg 2401 agggcagcgg ctccgacgcc gcgtccctga gctccctcac ctcctccgcc tccgaccaag 2461 accaagatta cgattatctg aacgagtggg gcagccgctt caagaagctg gcagacatgt 2521 acggtggcgg ggaggacgac taggcggcct gcctgcaggg ctggggacca aacgtcaggc 2581 cacagagcat ctccaagggg tctcagttcc cccttcagct gaggacttcg gagcttgtca 2641 ggaagtggcc gtagcaactt ggcggagaca ggctatgagt ctgacgttag agtggttgct 2701 tccttagcct ttcaggatgg aggaatgtgg gcagtttgac ttcagcactg aaaacctctc 2761 cacctgggcc agggttgcct cagaggccaa gtttccagaa gcctcttacc tgccgtaaaa 2821 tgctcaaccc tgtgtcctgg gcctgggcct gctgtgactg acctacagtg gactttctct 2881 ctggaatgga accttcttag gcctcctggt gcaacttaat tttttttttt aatgctatct 2941 tcaaaacgtt agagaaagtt cttcaaaagt gcagcccaga gctgctgggc ccactggccg 3001 tcctgcattt ctggtttcca gaccccaatg cctcccattc ggatggatct ctgcgttttt 3061 atactgagtg tgcctaggtt gccccttatt ttttattttc cctgttgcgt tgctatagat 3121 gaagggtgag gacaatcgtg tatatgtact agaacttttt tattaaagaa a // LOCUS HSPCAP15 384 bp RNA PRI 04-JAN-1995 DEFINITION H.sapiens mRNA for PC4 and p15. ACCESSION X79805 NID g619160 KEYWORDS p15 protein; PC4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 384) AUTHORS Kretzschmar,M., Kaiser,K., Lottspeich,F. and Meisterernst,M. TITLE A novel mediator of class II gene transcription with homology to viral immediate-early transcriptional regulators JOURNAL Cell 78 (3), 525-534 (1994) MEDLINE 94340741 REFERENCE 2 (bases 1 to 384) AUTHORS Meisterernst,M. TITLE Direct Submission JOURNAL Submitted (21-JUN-1994) M. Meisterernst, Laboratory for Molecular Biol. -Genzentrum, Am Klopferspitz 18A, 82152 Martinsried, FRG FEATURES Location/Qualifiers source 1..384 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HELA" CDS 1..384 /codon_start=1 /product="PC4, p15" /db_xref="PID:g619161" /translation="MPKSKELVSSSSSGSDSDSEVDKKLKRKKQVAPEKPVKKQKTGE TSRALSSSKQSSSSRDDNMFQIGKMRYVSVRDFKGKVLIDIREYWMDPEGEMKPGRKG ISLNPEQWSQLKEQISDIDDAVRKL" BASE COUNT 142 a 58 c 94 g 90 t ORIGIN 1 atgcctaaat caaaggaact tgtttcttca agctcttctg gcagtgattc tgacagtgag 61 gttgacaaaa agttaaagag gaaaaagcaa gttgctccag aaaaacctgt aaagaaacaa 121 aagacaggtg agacttcgag agccctgtca tcttctaaac agagcagcag cagcagagat 181 gataacatgt ttcagattgg gaaaatgagg tacgttagtg ttcgcgattt taaaggcaaa 241 gtgctaattg atattagaga atattggatg gatcctgaag gtgaaatgaa accaggaaga 301 aaaggtattt ctttaaatcc agaacaatgg agccagctga aggaacagat ctctgatata 361 gatgacgcag taagaaagct gtga // LOCUS HSPCBXA1 1380 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for procarboxypeptidase A1. ACCESSION X67318 S46257 NID g35329 KEYWORDS carboxypeptidase; carboxypeptidase A; carboxypeptidase A1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1380) AUTHORS Catasus,L., Villegas,V., Pascual,R., Aviles,F.X., Wicker-Planquart,C. and Puigserver,A. TITLE cDNA cloning and sequence analysis of human pancreatic procarboxypeptidase A1 JOURNAL Biochem. J. 287 (Pt 1), 299-303 (1992) MEDLINE 93038569 REFERENCE 2 (bases 1 to 1380) AUTHORS Puigserver,A. TITLE Direct Submission JOURNAL Submitted (21-JUL-1992) A. Puigserver, C.N.R.S. C.B.M.3, 31 Chemin Joseph Aiguier, B.P. 71, 13402 Marseille Cedex 9, FRANCE FEATURES Location/Qualifiers source 1..1380 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" sig_peptide 8..55 CDS 8..1267 /EC_number="3.4.17.1" /codon_start=1 /product="carboxypeptidase a" /db_xref="PID:g35330" /db_xref="SWISS-PROT:P15085" /translation="MRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKVKELED LEHLQLDFWRGPAHPGSPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQE QMFAFRSRARSTDTFNYATYHTLEEIYDFLDLLVAENPHLVSKIQIGNTYEGRPIYVL KFSTGGSKRPAIWIDTGIHSREWVTQASGVWFAKKITQDYGQDAAFTAILDTLDIFLE IVTNPDGFAFTHSTNRMWRKTRSHTAGSLCIGVDPNRNWDAGFGLSGASSNPCSETYH GKFANSEVEVKSIVDFVKDHGNIKAFISIHSYSQLLMYPYGYKTEPVPDQDELDQLSK AAVTALASLYGTKFNYGSIIKAIYQASGSTIDWTYSQGIKYSFTFELRDTGRYGFLLP ASQIIPTAKETWLALLTIMEHTLNHPY" mat_peptide 56..1264 /EC_number="3.4.17.1" /product="carboxypeptidase a" polyA_signal 1322..1327 polyA_site 1363 BASE COUNT 311 a 413 c 369 g 287 t ORIGIN 1 cagcagcatg cgggggttgc tggtgttgag tgtcctgttg ggggctgtct ttggcaagga 61 ggactttgtg gggcatcagg tgctccgaat ctctgtagcc gatgaggccc aggtacagaa 121 ggtgaaggag ctggaggacc tggagcacct gcagctggac ttctggcggg ggcctgccca 181 ccctggctcc cccatcgacg tccgagtgcc cttccccagc atccaggcgg tcaagatctt 241 tctggagtcc cacggcatca gctatgagac catgatcgag gacgtgcagt cgctgctgga 301 cgaggagcag gagcagatgt tcgccttccg gtcccgggcg cgctccaccg acacttttaa 361 ctacgccacc taccacaccc tggaggagat ctatgacttc ctggacctgc tggtggcgga 421 gaacccgcac cttgtcagca agatccagat tggcaacacc tatgaagggc gtcccattta 481 tgtgctgaag ttcagcacgg ggggcagtaa gcgtccagcc atctggatcg acacgggcat 541 ccattcccgg gagtgggtca cccaggccag tggggtctgg tttgcaaaga agatcactca 601 agactatggg caggatgcag ctttcaccgc cattctcgac accttggaca tcttcctgga 661 gatcgtcacc aaccctgatg gctttgcctt cacgcacagc acgaatcgca tgtggcgcaa 721 gactcggtcc cacacagcag gctccctctg tattggcgtg gaccccaaca ggaactggga 781 cgctggcttt gggttgtccg gagccagcag taacccctgc tcggagactt accacggcaa 841 gtttgccaat tccgaagtgg aggtcaagtc cattgtagac tttgtgaagg accatgggaa 901 catcaaggcc ttcatctcca tccacagcta ctcccagctc ctcatgtatc cctatggcta 961 caaaacagaa ccagtccctg accaggatga gctggatcag ctttccaagg ctgctgtgac 1021 agccctggcc tctctctacg ggaccaagtt caactatggc agcatcatca aggcaattta 1081 tcaagccagt ggaagcacta ttgactggac ctacagccag ggcatcaagt actccttcac 1141 cttcgagctc cgggacactg ggcgctatgg cttcctgctg ccagcctccc agatcatccc 1201 cacagccaag gagacgtggc tggcgcttct gaccatcatg gagcacaccc tgaatcaccc 1261 ctactgagct gaccctttga cacccttctt gtcctcctct ctggccccat ccaggcaacc 1321 aaataaagtt tgagtgtacc aggaacagaa tcctggggct tgcaaaaaaa aaaaaaaaaa // LOCUS HSPCCAR 2423 bp RNA PRI 20-APR-1993 DEFINITION Human mRNA for propionyl-CoA carboxylase alpha-chain (EC 6.4.1.3). ACCESSION X14608 NID g296365 KEYWORDS propionyl-CoA carboxylase; propionyl-CoA carboxylase alpha. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2423) AUTHORS Lamhonwah,A.M. TITLE Direct Submission JOURNAL Submitted (08-MAR-1989) Lamhonwah A.-M., University-Montreal Children's Hospital Research Institute, 2300 Tupper Street, Montreal, Quebec, Canada H3H 1P3 REMARK revised by [3] REFERENCE 2 (bases 1 to 2423) AUTHORS Lamhonwah,A.M., Mahuran,D. and Gravel,R.A. TITLE Human mitochondrial propionyl-CoA carboxylase: localization of the N-terminus of the pro- and mature alpha chains in the deduced primary sequence of a full-length cDNA JOURNAL Nucleic Acids Res. 17 (11), 4396 (1989) MEDLINE 89296507 REFERENCE 3 (bases 1 to 2423) AUTHORS Gravel,R. TITLE Direct Submission JOURNAL Submitted (15-APR-1993) Gravel R., Hospital For Sick Children, 555 University Avenue, Toronto Ontario, CANADA M5G 1X8 FEATURES Location/Qualifiers source 1..2423 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHA32" /chromosome="13." misc_feature 1..26 /note="mitochondrial leader peptide" sig_peptide 49..108 /gene="PCCA" gene 49..2160 /gene="PCCA" CDS 49..2160 /gene="PCCA" /standard_name="propionyl-CoA carboxylase alpha-chain" /EC_number="6.4.1.3" /codon_start=1 /product="propionyl-CoA carboxylase" /db_xref="PID:g296366" /db_xref="SWISS-PROT:P05165" /translation="MLSAALRTLKHVLYYSRQCLMVSRNLGSVGYDPNEKTFDKILVA NRGEIACRVIRTCKKMGIKTVAIHSDVDASSVHVKMADEAVCVGPAPTSKSYLNMDAI MEAIKKTRAQAVHPGYGFLSENKEFARCLAAEDVVFIGPDTHAIQAMGDKIESKLLAK KAEVNTIPGFDGVVKDAEEAVRIAREIGYPVMIKASAGGGGKGMRIAWDDEETRDGFR LSSQEAASSFGDDRLLIEKFIDNPRHIEIQVLGDKHGNALWLNERECSIQRRNQKVVE EAPSIFLDAETRRAMGEQAVALARAVKYSSAGTVEFLVDSKKNFYFLEMNTRLQVEHP VTECITGLDLVQEMIRVAKGYPLRHKQADIRINGWAVECRVYAEDPYKSFGLPSIGRL SQYQEPLHLPGVRVDSGIQPGSDISIYYDPMISKLITYGSDRTEALKRMADALDNYVI RGVTHNIALLREVIINSRFVKGDISTKFLSDVYPDGFKGHMLTKSEKNQLLAIASSLF VAFQLRAQHFQENSRMPVIKPDIANWELSVKLHDKVHTVVASNNGSVFSVEVDGSKLN VTSTWNLASPLLSVSVDGTQRTVQCLSREAGGNMSIQFLGTVYKVNILTRLAAELNKF MLEKVTEDTSSVLRSPMPGVVVAVSVKPGDAVAEGQEICVIEAMKMQNSMTAGKTGTV KSVHCQAGDTVGEGDLLVELE" mat_peptide 109..2157 /gene="PCCA" /standard_name="propionyl-CoA carboxylase alpha-chain" /EC_number="6.4.1.3" /product="propionyl-CoA carboxylase" misc_feature 340..366 /gene="PCCA" /note="conflicts with paper 1989" misc_feature 1066..1146 /gene="PCCA" /note="conflicts with paper 1989" polyA_signal 2386..2391 BASE COUNT 713 a 470 c 597 g 643 t ORIGIN 1 cccgctggtc gctgccggac ggcgtgggcg gtggccgcgc agcagctgat gctgagcgcg 61 gcgctgcgga ccctgaagca tgttctgtac tattcaagac agtgcttaat ggtgtcccgt 121 aatcttggtt cagtgggata tgatcctaat gaaaaaactt ttgataaaat tcttgttgct 181 aatagaggag aaattgcatg tcgggttatt agaacttgca agaagatggg cattaagaca 241 gttgccatcc acagtgatgt tgatgctagt tctgttcatg tgaaaatggc ggatgaggct 301 gtctgtgttg gcccagctcc caccagtaaa agctacctca acatggatgc catcatggaa 361 gccattaaga aaaccagggc ccaagctgta catccaggtt atggattcct ttcagaaaac 421 aaagaatttg ccagatgttt ggcagcagaa gatgtcgttt tcattggacc tgacacacat 481 gctattcaag ccatgggcga caagattgaa agcaaattat tagctaagaa agcagaggtt 541 aatacaatcc ctggctttga tggagtagtc aaggatgcag aagaagctgt cagaattgca 601 agggaaattg gctaccctgt catgatcaag gcctcagcag gtggtggtgg gaaaggcatg 661 cgcattgctt gggatgatga agagaccagg gatggtttta gattgtcatc tcaagaagct 721 gcttctagtt ttggcgatga tagactacta atagaaaaat ttattgataa tcctcgtcat 781 atagaaatcc aggttctagg tgataaacat gggaatgctt tatggcttaa tgaaagagag 841 tgctcaattc agagaagaaa tcagaaggtg gtggaggaag caccaagcat ttttttggat 901 gcggagactc gaagagcgat gggagaacaa gctgtagctc ttgccagagc agtaaaatat 961 tcctctgctg ggaccgtgga gttccttgtg gactctaaga agaattttta tttcttggaa 1021 atgaatacaa gactccaggt tgagcatcct gtcacagaat gcattactgg cctggaccta 1081 gtccaggaaa tgatccgtgt tgctaagggc taccctctca ggcacaaaca agctgatatt 1141 cgcatcaacg gctgggcagt tgaatgtcgg gtttatgctg aggaccccta caagtctttt 1201 ggtttaccat ctattgggag attgtctcag taccaagaac cgttacatct acctggtgtc 1261 cgagtggaca gtggcatcca accaggaagt gatattagca tttattatga tcctatgatt 1321 tcaaaactaa tcacatatgg ctctgataga actgaggcac tgaagagaat ggcagatgca 1381 ctggataact atgttattcg aggtgttaca cataatattg cattacttcg agaggtgata 1441 atcaactcac gctttgtaaa aggagacatc agcactaaat ttctctccga tgtgtatcct 1501 gatggcttca aaggacacat gctaaccaag agtgagaaga accagttatt ggcaatagca 1561 tcatcattgt ttgtggcatt ccagttaaga gcacaacatt ttcaagaaaa ttcaagaatg 1621 cctgttatta aaccagacat agccaactgg gagctctcag taaaattgca tgataaagtt 1681 cataccgtag tagcatcaaa caatgggtca gtgttctcgg tggaagttga tgggtcgaaa 1741 ctaaatgtga ccagcacgtg gaacctggct tcgcccttat tgtctgtcag cgttgatggc 1801 actcagagga ctgtccagtg tctttctcga gaagcaggtg gaaacatgag cattcagttt 1861 cttggtacag tgtacaaggt gaatatctta accagacttg ccgcagaatt gaacaaattt 1921 atgctggaaa aagtgactga ggacacaagc agtgttctgc gttccccgat gcccggagtg 1981 gtggtggccg tctctgtcaa gcctggagac gcggtagcag aaggtcaaga aatttgtgtg 2041 attgaagcca tgaaaatgca gaatagtatg acagctggga aaactggcac ggtgaaatct 2101 gtgcactgtc aagctggaga cacagttgga gaaggggatc tgctcgtgga gctggaatga 2161 aggatttata acctttcagt catcacccaa tttaattagc catttgcatg atgctttcac 2221 acacaattga ttcaagcatt atacaggaac acccctgtgc agctacgttt acgtcgtcat 2281 ttattccaca gagtcaagac caatattctg ccaaaaaatc accaatggaa attttcatga 2341 tataaatact tgtactatag atgtacttct gctgtgagat tccctagtgt caaaattaaa 2401 tcaataaaac tgagcatttg tct // LOCUS HSPCHDP7 3407 bp RNA PRI 16-NOV-1993 DEFINITION Human pcHDP7 mRNA for liver dipeptidyl peptidase IV. ACCESSION X60708 S40353 NID g35335 KEYWORDS dipeptidyl peptidase IV; pcHDP7. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3407) AUTHORS Misumi,Y. TITLE Direct Submission JOURNAL Submitted (30-JUN-1991) Y. Misumi, Second Dept of Biochemistry, School of Medicine, Fukuoka University, 7-45-1 Nanakuma Jonnan-ku, Fukuoka 814-01, JAPAN REFERENCE 2 (bases 1 to 3407) AUTHORS Misumi,Y., Hayashi,Y., Arakawa,F. and Ikehara,Y. TITLE Molecular cloning and sequence analysis of human dipeptidyl peptidase IV, a serine proteinase on the cell surface JOURNAL Biochim. Biophys. Acta 1131 (3), 333-336 (1992) MEDLINE 92329551 COMMENT See Ann. Hum. Genet. 54:191-197(1990) for overlapping sequence. FEATURES Location/Qualifiers source 1..3407 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda ZAPII" mRNA 1..3407 /gene="pcHDP7" /evidence=experimental gene 1..3407 /gene="pcHDP7" CDS 76..2376 /gene="pcHDP7" /EC_number="3.4.14.5" /codon_start=1 /product="dipeptidyl peptidase iv" /db_xref="PID:g35336" /db_xref="SWISS-PROT:P27487" /translation="MKTPWKILLGLLGAAALVTIITVPVVLLNKGTDDATADSRKTYT LTDYLKNTYRLKLYSLRWISDHEYLYKQENNILVFNAEYGNSSVFLENSTFDEFGHSI NDYSISPDGQFILLEYNYVKQWRHSYTASYDIYDLNKRQLITEERIPNNTQWVTWSPV GHKLAYVWNNDIYVKIEPNLPSYRITWTGKEDIIYNGITDWVYEEEVFSAYSALWWSP NGTFLAYAQFNDTEVPLIEYSFYSDESLQYPKTVRVPYPKAGAVNPTVKFFVVNTDSL SSVTNATSIQITAPASMLIGDHYLCDVTWATQERISLQWLRRIQNYSVMDICDYDESS GRWNCLVARQHIEMSTTGWVGRFRPSEPHFTLDGNSFYKIISNEEGYRHICYFQIDKK DCTFITKGTWEVIGIEALTSDYLYYISNEYKGMPGGRNLYKIQLIDYTKVTCLSCELN PERCQYYSVSFSKEAKYYQLRCSGPGLPLYTLHSSVNDKGLRVLEDNSALDKMLQNVQ MPSKKLDFIILNETKFWYQMILPPHFDKSKKYPLLLDVYAGPCSQKADTVFRLNWATY LASTENIIVASFDGRGSGYQGDKIMHAINRRLGTFEVEDQIEAARQFSKMGFVDNKRI AIWGWSYGGYVTSMVLGSGSGVFKCGIAVAPVSRWEYYDSVYTERYMGLPTPEDNLDH YRNSTVMSRAENFKQVEYLLIHGTADDNVHFQQSAQISKALVDVGVDFQAMWYTDEDH GIASSTAHQHIYTHMSHFIKQCFSLP" BASE COUNT 1077 a 678 c 704 g 948 t ORIGIN 1 cgcgcgtctc cgccgcccgc gtgacttctg cctgcgctcc ttctctgaac gctcacttcc 61 gaggagacgc cgacgatgaa gacaccgtgg aagattcttc tgggactgct gggtgctgct 121 gcgcttgtca ccatcatcac cgtgcccgtg gttctgctga acaaaggcac agatgatgct 181 acagctgaca gtcgcaaaac ttacactcta actgattact taaaaaatac ttatagactg 241 aagttatact ccttaagatg gatttcagat catgaatatc tctacaaaca agaaaataat 301 atcttggtat tcaatgctga atatggaaac agctcagttt tcttggagaa cagtacattt 361 gatgagtttg gacattctat caatgattat tcaatatctc ctgatgggca gtttattctc 421 ttagaataca actacgtgaa gcaatggagg cattcctaca cagcttcata tgacatttat 481 gatttaaata aaaggcagct gattacagaa gagaggattc caaacaacac acagtgggtc 541 acatggtcac cagtgggtca taaattggca tatgtttgga acaatgacat ttatgttaaa 601 attgaaccaa atttaccaag ttacagaatc acatggacgg ggaaagaaga tataatatat 661 aatggaataa ctgactgggt ttatgaagag gaagtcttca gtgcctactc tgctctgtgg 721 tggtctccaa acggcacttt tttagcatat gcccaattta acgacacaga agtcccactt 781 attgaatact ccttctactc tgatgagtca ctgcagtacc caaagactgt acgggttcca 841 tatccaaagg caggagctgt gaatccaact gtaaagttct ttgttgtaaa tacagactct 901 ctcagctcag tcaccaatgc aacttccata caaatcactg ctcctgcttc tatgttgata 961 ggggatcact acttgtgtga tgtgacatgg gcaacacaag aaagaatttc tttgcagtgg 1021 ctcaggagga ttcagaacta ttcggtcatg gatatttgtg actatgatga atccagtgga 1081 agatggaact gcttagtggc acggcaacac attgaaatga gtactactgg ctgggttgga 1141 agatttaggc cttcagaacc tcattttacc cttgatggta atagcttcta caagatcatc 1201 agcaatgaag aaggttacag acacatttgc tatttccaaa tagataaaaa agactgcaca 1261 tttattacaa aaggcacctg ggaagtcatc gggatagaag ctctaaccag tgattatcta 1321 tactacatta gtaatgaata taaaggaatg ccaggaggaa ggaatcttta taaaatccaa 1381 cttattgact atacaaaagt gacatgcctc agttgtgagc tgaatccgga aaggtgtcag 1441 tactattctg tgtcattcag taaagaggcg aagtattatc agctgagatg ttccggtcct 1501 ggtctgcccc tctatactct acacagcagc gtgaatgata aagggctgag agtcctggaa 1561 gacaattcag ctttggataa aatgctgcag aatgtccaga tgccctccaa aaaactggac 1621 ttcattattt tgaatgaaac aaaattttgg tatcagatga tcttgcctcc tcattttgat 1681 aaatccaaga aatatcctct actattagat gtgtatgcag gcccatgtag tcaaaaagca 1741 gacactgtct tcagactgaa ctgggccact taccttgcaa gcacagaaaa cattatagta 1801 gctagctttg atggcagagg aagtggttac caaggagata agatcatgca tgcaatcaac 1861 agaagactgg gaacatttga agttgaagat caaattgaag cagccagaca attttcaaaa 1921 atgggatttg tggacaacaa acgaattgca atttggggct ggtcatatgg agggtacgta 1981 acctcaatgg tcctgggatc gggaagtggc gtgttcaagt gtggaatagc cgtggcgcct 2041 gtatcccggt gggagtacta tgactcagtg tacacagaac gttacatggg tctcccaact 2101 ccagaagaca accttgacca ttacagaaat tcaacagtca tgagcagagc tgaaaatttt 2161 aaacaagttg agtacctcct tattcatgga acagcagatg ataacgttca ctttcagcag 2221 tcagctcaga tctccaaagc cctggtcgat gttggagtgg atttccaggc aatgtggtat 2281 actgatgaag accatggaat agctagcagc acagcacacc aacatatata tacccacatg 2341 agccacttca taaaacaatg tttctcttta ccttagcacc tcaaaatacc atgccattta 2401 aagcttatta aaactcattt ttgttttcat tatctcaaaa ctgcactgtc aagatgatga 2461 tgatctttaa aatacacact caaatcaaga aacttaaggt tacctttgtt cccaaatttc 2521 atacctatca tcttaagtag ggacttctgt cttcacaaca gattattacc ttacagaagt 2581 ttgaattatc cggtcgggtt ttattgttta aaatcatttc tgcatcagct gctgaaacaa 2641 caaataggaa ttgtttttat ggaggctttg catagattcc ctgagcagga ttttaatctt 2701 tttctaactg gactggttca aatgttgttc tcttctttaa agggatggca agatgtgggc 2761 agtgatgtca ctagggcagg gacaggataa gagggattag ggagagaaga tagcagggca 2821 tggctgggaa cccaagtcca agcataccaa cacgagcagg ctactgtcag ctcccctcgg 2881 agaagagctg ttcaccacga gactggcaca gttttctgag aaagactatt caaacagtct 2941 caggaaatca aatatcgaaa gcactgactt ctaagtaaac cacagcagtt gaaagactcc 3001 aaagaaatgt aagggaaact gccagcaacg cagcccccag gtgccagtta tggctatagg 3061 tgctacaaaa acacagcaag ggtgatggga aagcattgta aatgtgcttt taaaaaaaaa 3121 tactgatgtt cctagtgaaa gaggcagctt gaaactgaga tgtgaacaca tcagcttgcc 3181 ctgttaaaag atgaaaatat ttgtatcaca aatcttaact tgaaggagtc cttgcatcaa 3241 tttttcttat ttcatttctt tgagtgtctt aattaaaaga atattttaac ttccttggac 3301 tcattttaaa aaatggaaca taaaatacaa tgttatgtat tattattccc attctacata 3361 ctatggaatt tctcccagtc atttaataaa tgtgccttca ttttttc // LOCUS HSPCTTSIR 1977 bp RNA PRI 12-OCT-1995 DEFINITION H.sapiens mRNA for peroxisomal C-terminal targeting signal import receptor. ACCESSION X84899 NID g695565 KEYWORDS peroxisomal targeting signal import receptor; PTS1-BP bene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1977) AUTHORS Fransen,M., Brees,C., Baumgart,E., Vanhooren,J.C., Baes,M., Mannaerts,G.P. and Van Veldhoven,P.P. TITLE Identification and characterization of the putative human peroxisomal C-terminal targeting signal import receptor JOURNAL J. Biol. Chem. 270 (13), 7731-7736 (1995) MEDLINE 95221441 REFERENCE 2 (bases 1 to 1977) AUTHORS Van Veldhoven,P.P. TITLE Direct Submission JOURNAL Submitted (22-FEB-1995) P.P. Van Veldhoven, Inst. K.U. Leuven, Farmakologie, Campus Gasthuisberg (O&N), Herestraat 49, B-3000 Leuven, BELGIUM FEATURES Location/Qualifiers source 1..1977 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="Human liver Matchmaker" gene 19..1938 /gene="PTS1-BP" CDS 19..1938 /gene="PTS1-BP" /note="putative" /codon_start=1 /product="peroxisomal C-terminal targeting signal import receptor" /db_xref="PID:g695566" /translation="MAMRELVEAECGGANPLMKLAGHFTQDKALRQEGLRPGPWPPGA PASEAASKPLGVASEDELVAEFLQDQNAPLVSRAPQTFKMDDLLAEMQQIEQSNFRQA PQRAPGVADLALSENWAQEFLAAGDAVDVTQDYNETDWSQEFISEVTDPLSVSPARWA EEYLEQSEEKLWLGEPEGTATDRWYDEYHPEEDLQHTASDFVAKVDDPKLANSEFLKF VRQIGEGQVSLESGAGSGRAQAEQWAAEFIQQQGTSDAWVDQFTRPVNTSALDMEFER AKSAIESDVDFWDKLQAELEEMAKRDAEAHPWLSDYDDLTSATYDKGYQFEEENPLRD HPQPFEEGLRRLQEGDLPNAVLLFEAAVQQDPKHMEAWQYLGTTQAENEQELLAISAL RRCLELKPDNQTALMALAVSFTNESLQRQACETLRDWLRYTPAYAHLVTPAEEGAGGA GLGPSKRILGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAV DCFTAALSVRPNDYLLWNKLGATLANGNQSEEAVAAYRRALELQPGYIRSRYNLGISC INLGAHREAVEHFLEALNMQRKSRGPRGEGGAMSENIWSTLRLALSMLGQSDAYGAAD ARDLSTLLTMFGLPQ" BASE COUNT 429 a 523 c 612 g 413 t ORIGIN 1 cgagagctgg cggtcaccat ggcaatgcgg gagctggtgg aggccgaatg cgggggtgcc 61 aacccgctca tgaagctcgc cgggcacttc acccaggaca aggcccttcg gcaggaggga 121 ttgaggcctg gcccctggcc ccccggagcc ccggcctctg aggcagcctc caagcctttg 181 ggagtagctt ctgaagatga gttggtggct gaattcctgc aggaccagaa tgcacccctt 241 gtgtcccgtg cccctcagac cttcaagatg gatgacctcc tggctgagat gcagcagatt 301 gagcagtcaa acttccgcca ggctccccag agagcccctg gtgtggcaga cttggccttg 361 tctgagaact gggcccagga gtttcttgca gctggagatg ctgtggatgt aactcaggat 421 tataatgaga ctgactggtc ccaagaattc atctctgaag ttacagaccc cttgtctgtg 481 tcccctgccc gctgggctga ggaatatttg gagcaatcag aggagaagct gtggctggga 541 gaacctgagg gaacagccac cgatcgctgg tatgatgaat atcatcctga ggaggatctg 601 cagcacacgg ccagtgactt tgtggccaaa gtggatgacc ccaaattggc taattctgag 661 ttcctgaaat tcgtgcggca gattggcgaa gggcaggtgt ccctggagtc cggtgcaggg 721 tcgggccgag ctcaggcaga acagtgggca gcagagttta tacagcagca gggtacatca 781 gatgcctggg ttgaccagtt cacaagacca gtaaacacat ctgcccttga tatggagttt 841 gaacgagcca agtcagctat agagtctgat gtcgatttct gggacaagtt gcaggcagag 901 ttggaggaga tggcaaaacg ggatgctgag gcccacccct ggctttctga ctatgatgac 961 cttacgtcag ctacctatga taaggggtac cagtttgagg aggagaaccc cttgcgtgat 1021 caccctcagc cttttgaaga agggctgcgg cgccttcagg agggggacct gccaaatgct 1081 gtgctgcttt ttgaggcagc tgtgcagcag gatcctaagc acatggaagc ttggcagtat 1141 ctgggtacca cccaggcaga gaatgaacaa gaactattag ccatcagtgc attgcggagg 1201 tgtctggagc taaagccaga taaccagaca gcactgatgg cgctggctgt gagcttcacc 1261 aacgagtccc tgcagcgaca ggcctgtgaa accctacgag actggctgcg gtacacacca 1321 gcctatgccc atctggtgac acctgctgaa gaaggggctg gtggggcagg actgggcccc 1381 agcaagcgta tcctgggatc tctcttgtct gactccctgt ttcttgaagt gaaagagctc 1441 ttcctggcag ctgtgcggct ggaccctacc tccattgacc ctgatgtgca gtgtggcttg 1501 ggagtccttt tcaacctgag tggggagtat gacaaggccg tggactgctt cacagctgcc 1561 ctcagcgttc gtcccaatga ctatttgctg tggaataagc taggcgccac cctggccaat 1621 ggaaaccaga gtgaagaagc agtagctgcg taccgccggg ccctcgagct ccagcctggc 1681 tatatccggt cccgctataa cctgggcatc agctgcatca acctcggggc tcaccgggag 1741 gctgtggagc actttctgga ggccctgaac atgcagagga aaagccgggg cccccggggt 1801 gaaggaggtg ccatgtcgga gaacatctgg agcaccctgc gtttggcatt gtctatgtta 1861 ggccagagcg atgcctatgg ggcagccgac gcgcgggatc tgtccaccct cctaactatg 1921 tttggcctgc cccagtgaca gtgggacggg ctgccctgtg agtgtccacc tggaggg // LOCUS HSPDE1A3A 2008 bp mRNA PRI 12-APR-1996 DEFINITION Human 3',5' cyclic nucleotide phosphodiesterase (HSPDE1A3A) mRNA, complete cds. ACCESSION U40370 NID g1151108 KEYWORDS calmodulin-stimulated phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2008) AUTHORS Loughney,K., Martins,T.J., Harris,E.A., Sadhu,K., Hicks,J.B., Sonnenburg,W.K., Beavo,J.A. and Ferguson,K. TITLE Isolation and characterization of cDNAs corresponding to two human calcium, calmodulin-regulated, 3',5'-cyclic nucleotide phosphodiesterases JOURNAL J. Biol. Chem. 271 (2), 796-806 (1996) MEDLINE 96132810 REFERENCE 2 (bases 1 to 2008) AUTHORS Loughney,K., Martins,T.J., Harris,E.A.S., Sadhu,K., Hicks,J.B., Sonnenburg,W.K., Beavo,J.A. and Ferguson,K. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Kate Loughney, ICOS, 22021 20th Ave. S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..2008 /organism="Homo sapiens" /db_xref="taxon:9606" gene 85..1692 /gene="PDE1A" CDS 85..1692 /gene="PDE1A" /note="PDE1A3; type I phosphodiesterase" /codon_start=1 /product="3',5' cyclic nucleotide phosphodiesterase" /db_xref="PID:g1151109" /translation="MGSSATEIEELENTTFKYLTGEQTEKMWQRLKGILRCLVKQLER GDVNVVDLKKNIEYAASVLEAVYIDETRRLLDTEDELSDIQTDSVPSEVRDWLASTFT RKMGMTKKKPEEKPKFRSIVHAVQAGIFVERMYRKTYHMVGLAYPAAVIVTLKDVDKW SFDVFALNEASGEHSLKFMIYELFTRYDLINRFKIPVSCLITFAEALEVGYSKYKNPY HNLIHAADVTQTVHYIMLHTGIMHWLTELEILAMVFAAAIHDYEHTGTTNNFHIQTRS DVAILYNDRSVLENHHVSAAYRLMQEEEMNILINLSKDDWRDLRNLVIEMVLSTDMSG HFQQIKNIRNSLQQPEGIDRAKTMSLILHAADISHPAKSWKLHYRWTMALMEEFFLQG DKEAELGLPFSPLCDRKSTMVAQSQIGFIDFIVEPTFSLLTDSTEKIVIPLIEEASKA ETSSYVASSSTTIVGLHIADALRRSNTKGSMSDGSYSPDYSLAAVDLKSFKNNLVDII QQNKERWKELAAQEARTSSQKCEFIHQ" BASE COUNT 627 a 400 c 437 g 544 t ORIGIN 1 gaattctgat gtgcttcagt gcacagaaca gtaacagatg agctgctttt ggggagagct 61 tgagtactca gtcggagcat catcatgggg tctagtgcca cagagattga agaattggaa 121 aacaccactt ttaagtatct tacaggagaa cagactgaaa aaatgtggca gcgcctgaaa 181 ggaatactaa gatgcttggt gaagcagctg gaaagaggtg atgttaacgt cgtcgactta 241 aagaagaata ttgaatatgc ggcatctgtg ctggaagcag tttatatcga tgaaacaaga 301 agacttctgg atactgaaga tgagctcagt gacattcaga ctgactcagt cccatctgaa 361 gtccgggact ggttggcttc tacctttaca cggaaaatgg ggatgacaaa aaagaaacct 421 gaggaaaaac caaaatttcg gagcattgtg catgctgttc aagctggaat ttttgtggaa 481 agaatgtacc gaaaaacata tcatatggtt ggtttggcat atccagcagc tgtcatcgta 541 acattaaagg atgttgataa atggtctttc gatgtatttg ccctaaatga agcaagtgga 601 gagcatagtc tgaagtttat gatttatgaa ctgtttacca gatatgatct tatcaaccgt 661 ttcaagattc ctgtttcttg cctaatcacc tttgcagaag ctttagaagt tggttacagc 721 aagtacaaaa atccatatca caatttgatt catgcagctg atgtcactca aactgtgcat 781 tacataatgc ttcatacagg tatcatgcac tggctcactg aactggaaat tttagcaatg 841 gtctttgctg ctgccattca tgattatgag catacaggga caacaaacaa ctttcacatt 901 cagacaaggt cagatgttgc cattttgtat aatgatcgct ctgtccttga gaatcaccac 961 gtgagtgcag cttatcgact tatgcaagaa gaagaaatga atatcttgat aaatttatcc 1021 aaagatgact ggagggatct tcggaaccta gtgattgaaa tggttttatc tacagacatg 1081 tcaggtcact tccagcaaat taaaaatata agaaacagtt tgcagcagcc tgaagggatt 1141 gacagagcca aaaccatgtc cctgattctc cacgcagcag acatcagcca cccagccaaa 1201 tcctggaagc tgcattatcg gtggaccatg gccctaatgg aggagttttt cctgcaggga 1261 gataaagaag ctgaattagg gcttccattt tccccacttt gtgatcggaa gtcaaccatg 1321 gtggcccagt cacaaatagg tttcatcgat ttcatagtag agccaacatt ttctcttctg 1381 acagactcaa cagagaaaat tgttattcct cttatagagg aagcctcaaa agccgaaact 1441 tcttcctatg tggcaagcag ctcaaccacc attgtggggt tacacattgc tgatgcacta 1501 agacgatcaa atacaaaagg ctccatgagt gatgggtcct attccccaga ctactccctt 1561 gcagcagtgg acctgaagag tttcaagaac aacctggtgg acatcattca gcagaacaaa 1621 gagaggtgga aagagttagc tgcacaagaa gcaagaacca gttcacagaa gtgtgagttt 1681 attcatcagt aaacaccttt aagtaaaacc tcgtgcatgg tggcagctct aatttgacca 1741 aaagacttgg agattttgat tatgcttgct ggaaatctac cctgtcctgt gtgagacagg 1801 aaatctattt ttgcagattg ctcaataagc atcatgagcc acataaataa cagctgtaaa 1861 ctccttaatt caccgggctc aactgctacc gaacagattc atctagtggc tacatcagca 1921 ccttgtgctt tcagatatct gtttcaatgg cattttgtgg catttgtctt taccgagtgc 1981 caataaattt tctttgagca aaaaaaaa // LOCUS HSPDE1C1A 2694 bp mRNA PRI 12-APR-1996 DEFINITION Human 3',5' cyclic nucleotide phosphodiesterase (HSPDE1C1A) mRNA, complete cds. ACCESSION U40371 NID g1151110 KEYWORDS calmodulin-stimulated phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2694) AUTHORS Loughney,K., Martins,T.J., Harris,E.A., Sadhu,K., Hicks,J.B., Sonnenburg,W.K., Beavo,J.A. and Ferguson,K. TITLE Isolation and characterization of cDNAs corresponding to two human calcium, calmodulin-regulated, 3',5'-cyclic nucleotide phosphodiesterases JOURNAL J. Biol. Chem. 271 (2), 796-806 (1996) MEDLINE 96132810 REFERENCE 2 (bases 1 to 2694) AUTHORS Loughney,K., Martins,T.J., Harris,E.A.S., Sadhu,K., Hicks,J.B., Sonnenburg,W.K., Beavo,J.A. and Ferguson,K. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Kate Loughney, ICOS, 22021 20th Ave. S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..2694 /organism="Homo sapiens" /db_xref="taxon:9606" gene 177..2081 /gene="PDE1C" CDS 177..2081 /gene="PDE1C" /note="PDE1C1; splice variant" /codon_start=1 /product="3',5' cyclic nucleotide phosphodiesterase" /db_xref="PID:g1151111" /translation="MESPTKEIEEFESNSLKYLQPEQIEKIWLRLRGLRKYKKTSQRL RSLVKQLERGEASVVDLKKNLEYAATVLESVYIDETRRLLDTEDELSDIQSDAVPSEV RDWLASTFTRQMGMMLRRSDEKPRFKSIVHAVQAGIFVERMYRRTSNMVGLSYPPAVI EALKDVDKWSFDVFSLNEASGDHALKFIFYELLTRYDLISRFKIPISALVSFVEALEV GYSKHKNPYHNLMHAADVTQTVHYLLYKTGVANWLTELEIFAIIFSAAIHDYEHTGTT NNFHIQTRSDPAILYNDRSVLENHHLSAAYRLLQDDEEMNILINLSKDDWREFRTLVI EMVMATDMSCHFQQIKAMKTALQQPEAIEKPKALSLMLHTADISHPAKAWDLHHRWTM SLLEEFFRQGDREAELGLPFSPLCDRKSTMVAQSQVGFIDFIVEPTFTVLTDMTEKIV SPLIDETSQTGGTGQRRSSLNSISSSDAKRSGVKTSGSEGSAPINNSVISVDYKSFKA TWTEVVHINRERWRAKVPKEEKAKKEAEEKARLAAEEQQKEMEAKSQAEEGASGKAEK KTSGETKNQVNGTRANKSDNPRGKNSKAEKSSGEQQQNGDFKDGKNKTDKKDHSNIGN DSKKTDDSQE" BASE COUNT 777 a 618 c 652 g 647 t ORIGIN 1 gtcgcttcaa tatttcaaaa tggatccggt tctgtggcgg gtgcgagagt gaggctgtgg 61 gggacctcca ggccgaacct ccgcgaagcc tcgcggcttc tgcgtgccct ggccccggga 121 ggataaggat ttcccttccc tcctacttgc gcgcggagcc gagctcttgt tgagctatgg 181 agtcgccaac caaggagatt gaagaatttg agagcaactc tctgaaatac ctgcaaccgg 241 aacagatcga gaaaatctgg cttcggctcc gcgggctgag gaaatataag aaaacgtccc 301 agagattacg gtctttggtc aaacaattag agagagggga agcttcagtg gtagatctta 361 agaagaattt ggaatatgca gccacagtgc ttgaatctgt gtatattgat gaaacaagga 421 gactcctgga tacagaggat gagctcagtg acattcagtc agatgctgtg ccttctgagg 481 tccgagactg gctggcctcc accttcacgc ggcagatggg gatgatgctc aggaggagcg 541 acgagaagcc ccggttcaag agcatcgttc acgcagtgca ggctgggata tttgtggaga 601 gaatgtatag acggacatca aacatggttg gactgagcta tccaccagct gttattgagg 661 cattaaagga tgtggacaag tggtcctttg acgtcttttc cctcaatgag gccagtgggg 721 atcatgcact gaaatttatt ttctatgaac tactcacacg ttatgatctg atcagccgtt 781 tcaagatccc catttctgca cttgtctcat ttgtggaggc cctggaagtg ggatacagca 841 agcacaaaaa tccttaccat aacttaatgc acgctgccga tgttacacag acagtgcatt 901 acctcctcta taagacagga gtggcgaact ggctgacgga gctggagatc tttgctataa 961 tcttctcagc tgccatccat gactacgagc ataccggaac caccaacaat ttccacattc 1021 agactcggtc tgatccagct attctgtata atgacagatc tgtactggag aatcaccatt 1081 taagtgcagc ttatcgcctt ctgcaagatg acgaggaaat gaatattttg attaacctct 1141 caaaggatga ctggagggag tttcgaacct tggtaattga aatggtgatg gccacagata 1201 tgtcttgtca cttccaacaa atcaaagcaa tgaagactgc tctgcagcag ccagaagcca 1261 ttgaaaagcc aaaagcctta tcccttatgc tgcatacagc agatattagc catccagcaa 1321 aagcatggga cctccatcat cgctggacaa tgtcactcct ggaggagttc ttcagacagg 1381 gtgacagaga agcagagctg gggctgcctt tttctcctct gtgtgaccga aagtccacta 1441 tggttgctca gtcacaagta ggtttcattg atttcatcgt ggaacccacc ttcactgtgc 1501 ttacggacat gaccgagaag attgtgagtc cattaatcga tgaaacctct caaactggtg 1561 ggacaggaca gaggcgttcg agtttgaata gcatcagctc gtcagatgcc aagcgatcag 1621 gtgtcaagac ctctggttca gagggaagtg ccccgatcaa caattctgtc atctccgttg 1681 actataagag ctttaaagct acttggacgg aagtggtgca catcaatcgg gagagatgga 1741 gggccaaggt acccaaagag gagaaggcca agaaggaagc agaggaaaag gctcgcctgg 1801 ccgcagagga gcagcaaaag gaaatggaag ccaaaagcca ggctgaagaa ggcgcatctg 1861 gcaaagctga gaaaaagacg tctggagaaa ctaagaatca agtcaatgga acacgggcaa 1921 acaaaagtga caaccctcgt gggaaaaatt ccaaagccga gaagtcatca ggagaacagc 1981 aacagaatgg tgacttcaaa gatggtaaaa ataagacaga caagaaggat cactctaaca 2041 tcggaaatga ttcaaagaaa acagatgatt cacaagagta aaaaagacct catagacaat 2101 aaaagaggct gccagtgtct tgcatcattc tagctgagct tcttcattct ccttcttctc 2161 cttcttccac aaagacccat atctggagaa ggtgtacaac tttcaaacac aagcccccca 2221 ccccctgacc cttggccttc cctcacacca tctccttcca ggggatgaat ctttgggggt 2281 tggtttgagg tcttagaact ctgggggata ttcccctgag caaaacaaac aacgtgagat 2341 ttttactcaa acagaaacaa aacatgaagg ggcatcctca aaatcctttg ctaatgacct 2401 ggctttcaag gcatctgtct ggcctgatga gaatggacat cctggatatg ctgggagagg 2461 cctgaaaaaa gccacacaca cagtaattgc cattttatga ctgtcaatgc cgttacttta 2521 aatgttgtca tttttgcact ggctactgat gatacagcca tgctgacatt catcaccgca 2581 aagatgatga ttccagtctc tggttccttt cctgagtcag gaacatttgt tttctccaat 2641 ttcctttcag acttaaaatt gttcttatgc tttttttccc acttctgtaa taca // LOCUS HSPDE4C1 3495 bp RNA PRI 06-MAR-1995 DEFINITION H.sapiens HSPDE4C1 gene for 3',5'-cyclic AMP phosphodiesterase. ACCESSION Z46632 NID g727222 KEYWORDS 3',5'-cyclic AMP phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3495) AUTHORS Engels,P., Sullivan,M., Muller,T. and Lubbert,H. TITLE Molecular cloning and functional expression in yeast of a human cAMP-specific phosphodiesterase subtype (PDE IV-C) JOURNAL FEBS Lett. 358 (3), 305-310 (1995) MEDLINE 95145731 REFERENCE 2 (bases 1 to 3495) AUTHORS Luebbert,H. TITLE Direct Submission JOURNAL Submitted (09-NOV-1994) Luebbert H., Sandoz Pharma Ltd., Preclinical Research, 386-222, Basel, Switzerland, 4002 FEATURES Location/Qualifiers source 1..3495 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="substantia nigra" /clone_lib="lambda ZAP substantia nigra" gene 112..2250 /gene="HSPDE4C1" CDS 112..2250 /gene="HSPDE4C1" /function="cAMP hydrolysis" /citation=[1] /codon_start=1 /evidence=experimental /product="3',5'-cyclic AMP phosphodiesterase" /db_xref="PID:g727223" /translation="MENLGVGEGAEACSRLSRSRGRHSMTRAPKHLWRQPRRPIRIQQ RFYSDPDKSAGCRERDLSPRPELRKSRLSWPVSSCRRFDLENGLSCGRRALDPQSSPG LGRIMQAPVPHSQRRESFLYRSDSDYELSPKAMSRNSSVASDLHGEDMIVTPFAQVLA SLRTVRSNVAALARQQCLGAAKQGPVGNPSSSNQLPPAEDTGQKLALETLDELDWCLD QLETLQTRHSVGEMASNKFKRILNRELTHLSETSRSGNQVSEYISRTFLDQQTEVELP KVTAEEAPQPMSRISGLHGLCHSASLSSATVPRFGVQTDQEEQLAKELEDTNKWGLDV FKVADVSGNRPLTAIIFSIFQERDLLKTFQIPADTLATYLLMLEGHYHANVAYHNSLH AADVAQSTHVLLATPALEAVFTDLEILAALFASAIHDVDHPGVSNQFLINTNSDVALM YNDASVLENHHLAVGFKLLQAENCDIFQNLSAKQRLSLRRMVIDMVLATDMSKHMNLL ADLKTMVETKKVTSLGVLLLDNYSDRIQVLQNLVHCADLSNPTKPLPLYRQWTDRIMA EFFQQGDRERESGLDISPMCDKHTASVEKSQVGFIDYIAHPLWETWADLVHPDAQDLL DTLEDNREWYQSKIPRSPSDLTNPERDGPDRFQFELTLEEAEEEDEEEEEEGEETALA KEALELPDTELLSPEAGPDPGDLPLDNQRT" BASE COUNT 815 a 1014 c 1006 g 660 t ORIGIN 1 ccagtctgcg gaccgttcgg agcaacgtgg cggcccttgc ccgccagcaa tgcctaggag 61 cagccaagca gggacccgtc ggaaaccctt catccagcct ttctctggcg catggagaac 121 ctgggggtcg gcgaaggggc agaggcttgc agcaggttga gtcgctctcg cggccgccac 181 agcatgacca gagccccgaa gcacctgtgg cggcaacccc ggcgccccat ccgcatccaa 241 cagcgcttct attcggatcc ggacaagtcc gcgggctgcc gcgagaggga cctgagcccg 301 cggccggagc tcaggaagtc gcggctctcc tggcccgttt cctcctgcag gcgctttgac 361 ctggaaaatg ggctctcgtg tgggaggagg gccctggacc ctcagtccag ccctggcctg 421 ggccggatta tgcaggctcc agtcccgcac agccagcggc gcgagtcctt cctgtaccgc 481 tcagatagcg actatgaact ctcgcccaag gccatgtctc ggaactcctc tgtggccagc 541 gacctacatg gagaggacat gattgtgacg ccctttgccc aggtcctggc cagtctgcgg 601 accgttcgga gcaacgtggc ggcccttgcc cgccagcaat gcctaggagc agccaagcag 661 ggacccgtcg gaaacccttc atccagcaat cagctccctc ctgcagagga cacggggcag 721 aagctggcat tggagacgct agacgagctg gactggtgcc tggatcagtt ggagacgctg 781 cagacccggc actcggtggg ggagatggcc tccaacaagt tcaagcggat cctgaaccgg 841 gagttgaccc acctgtccga aaccagccgc tccgggaacc aggtgtccga gtacatctcc 901 cggaccttcc tggaccagca gaccgaggtg gagctgccca aggtgaccgc tgaggaggcc 961 ccacagccca tgtcccggat cagtggccta catgggctct gccacagtgc cagcctctcc 1021 tcagccactg tcccacgctt tggggtccag actgaccagg aggagcaact ggccaaggag 1081 ctagaagaca ccaacaagtg gggacttgat gtgttcaagg tggcggacgt aagtgggaac 1141 cggcccctca cagctatcat attcagcatt tttcaggagc gggacctgct gaagacattc 1201 cagatcccag cagacacact ggccacctac ctgctgatgc tggagggtca ctaccacgcc 1261 aatgtggcct accacaacag cctacatgcc gccgacgtgg cccagtccac gcatgtgctg 1321 ctggctacgc ccgccctcga ggctgtgttc acagacttgg aaatcctggc tgccctcttt 1381 gcaagcgcca tccacgacgt ggaccatcct ggggtctcca accagtttct gattaacacc 1441 aactcagacg tggcgcttat gtacaacgac gcctcggtgc tggagaacca tcacctggct 1501 gtgggcttca agctgctgca ggcagagaac tgcgatatct tccagaacct cagcgccaag 1561 cagcgactga gtctgcgcag gatggtcatt gacatggtgc tggccacaga catgtccaaa 1621 cacatgaacc tcctggccga cctcaagacc atggtggaga ccaagaaggt gacaagcctc 1681 ggtgtcctcc tcctggacaa ctattccgac cgaatccagg tcttgcagaa cctggtgcac 1741 tgtgctgatc tgagcaaccc caccaagccg ctgcccctgt accgccagtg gacggaccgc 1801 atcatggccg agttcttcca gcagggagac cgcgagcgtg agtcgggcct ggacatcagt 1861 cccatgtgtg acaagcatac ggcctcagtg gagaagtccc aggtgggttt cattgactac 1921 attgctcacc cactgtggga gacttgggct gacctggtcc acccagatgc acaggacctg 1981 ctggacacgc tggaggacaa tcgagagtgg taccagagca agatcccccg aagtccctca 2041 gacctcacca accccgagcg ggacgggcct gacagattcc agtttgaact gactctggag 2101 gaggcagagg aagaggatga ggaggaagaa gaggaggggg aagagacagc tttagccaaa 2161 gaggccttgg agttgcctga cactgaactc ctgtcccctg aagccggccc agaccctggg 2221 gacttacccc tcgacaacca gaggacttag ggccagccct gcgtgaactg caggggcaat 2281 ggatggtaaa gccctttggc tcttggcagg cagactttcc aggaagaggc tccatgtggc 2341 tcctgcttca ctttcccacc catttaggga gacaatcaag ctcttagtta taggtggctc 2401 ccagggtcta attggaggca cctggctggg gtccactctg accctagact tgcctaaaag 2461 agctctctaa ggggcagcct cttacgatgc cctggtgtct ttctcctggg cttctatccc 2521 tgtgaggaga ggtgctgtct gctggagcct ctagtccacc ctctccagtg gtcactcttg 2581 agtcacatct gtcacttaat tatttccttc tttatcaaat atttattgct catctacttc 2641 gggccagctt tctgcctctg tagtagccct gcacaaaggg tggggagtca ggagaccatc 2701 ccaaaggcat ctccctgtct tcctctacca agcggctctc tgcaagagca tggaaatgtg 2761 agtggggaaa attttcagca ccaaagcttc actcataccc agttttgttt ctgaaactac 2821 ggtagggggc aggaagagga gcagaaaaga agggctgggc aaggcatagt ggcttatgcc 2881 tgtaatcccg gtactttggg aggctgaggt gggaggactg cttaagctca ggagtttgag 2941 accagcctgg gcaacatagc aagaccccca ccatctctga aaaaaaaaat tagccaggca 3001 tggtggtgtg cacctgagaa tcccagctac tcagaaggtt gagacaaagg ggatcgcttg 3061 agcccaggag ttggaggctg aagagagcta tgactgcatc actgcactcc agcctgggca 3121 acacagcaag atcctgtcta aaaataaaaa gaaaagagaa ggaaaggaaa gagacggggc 3181 tctgaggccg agcacagtgg cccatgccta taatcccagc actttgggag gctgaggcag 3241 gtggatcacc tgaggttagg agttcgagac cagcctggcc aacatggtga aaccccatct 3301 ctactaaaaa tacaaaaatt ggctgggcat ggtggcgggt gcctgtaatc ccagctactg 3361 gggaggctga ggcaggagaa tcacttgaat tcaggaggtg gaggttgcag tgagccgaca 3421 tcatgccact gcactccagc ctggggctga cagagcaaga cactgtctca aaaaagaaaa 3481 aaaaaaaaaa aaaaa // LOCUS HSPDE7A 1739 bp mRNA PRI 06-AUG-1997 DEFINITION Human cAMP phosphodiesterase (Pde7A2) mRNA, complete cds. ACCESSION U67932 NID g2306763 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1739) AUTHORS Han,P., Zhu,X. and Michaeli,T. TITLE Alternative splicing of the high affinity cAMP-specific phosphodiesterase (PDE7A) mRNA in human skeletal muscle and heart JOURNAL J. Biol. Chem. 272 (26), 16152-16157 (1997) MEDLINE 97341143 REFERENCE 2 (bases 1 to 1739) AUTHORS Han,P. and Michaeli,T. TITLE Direct Submission JOURNAL Submitted (23-AUG-1996) Department of Developmental & Molecular Biology, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1739 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle and heart" /chromosome="8" /map="8q13-q21" /dev_stage="fetus" mRNA 1..1739 /gene="Pde7A2" gene 1..1739 /gene="Pde7A2" CDS 157..1527 /gene="Pde7A2" /codon_start=1 /product="cAMP phosphodiesterase" /db_xref="PID:g2306764" /translation="MGITLIWCLALVLIKWITSKRRGAISYDSSDQTALYIRMLGDVR VRSRAGFESERRGSHPYIDFRIFHSQSEIEVSVSARNIRRLLSFQRYLRSSRFFRGTA VSNSLNILDDDYNGQAKCMLEKVGNWNFDIFLFDRLTNGNSLVSLTFHLFSLHGLIEY FHLDMMKLRRFLVMIQEDYHSQNPYHNAVHAADVTQAMHCYLKEPKLANSVTPWDILL SLIAAATHDLDHPGVNQPFLIKTNHYLATLYKNTSVLENHHWRSAVGLLRESGLFSHL PLESRQQMETQIGALILATDISRQNEYLSLFRSHLDRGDLCLEDTRHRHLVLQMALKC ADICNPCRTWELSKQWSEKVTEEFFHQGDIEKKYHLGVSPLCDRHTESIANIQIGFMT YLVEPLFTEWARFSNTRLSQTMLGHVGLNKASWKGLQREQSSSEDTDAAFELNSQLLP QENRLS" BASE COUNT 499 a 338 c 384 g 518 t ORIGIN 1 ggggatcact gttggaaggc agctgcttga ggtccaaggc agtcagtgtc ccctctcttt 61 tgcctcggga cagctggtat ttatcagact cctaagaagt tttccttgct ccctagtaga 121 agagagagat tatgcagcgg gcttttgatt gatccaatgg gaattacatt gatctggtgt 181 ctggccttgg ttcttatcaa gtggatcacc tctaagaggc gtggagctat ttcctatgac 241 agttctgatc agactgcatt atacattcgt atgctaggag atgtacgtgt aaggagccga 301 gcaggatttg aatcagaaag aagaggttct cacccatata ttgattttcg tattttccac 361 tctcaatctg aaattgaagt gtctgtctct gcaaggaata tcagaaggct actaagtttc 421 cagcgatatc ttagatcttc acgctttttt cgtggtactg cggtttcaaa ttccctaaac 481 attttagatg atgattataa tggacaagcc aagtgtatgc tggaaaaagt tggaaattgg 541 aattttgata tctttctatt tgatagacta acaaatggaa atagtctagt aagcttaacc 601 tttcatttat ttagtcttca tggattaatt gagtacttcc atttagatat gatgaaactt 661 cgtagatttt tagttatgat tcaagaagat taccacagtc aaaatcctta ccataacgca 721 gtccacgctg cggatgttac tcaggccatg cactgttact taaaggaacc taagcttgcc 781 aattctgtaa ctccttggga tatcttgctg agcttaattg cagctgccac tcatgatctg 841 gatcatccag gtgttaatca acctttcctt attaaaacta accattactt ggcaacttta 901 tacaagaata cctcagtact ggaaaatcac cactggagat ctgcagtggg cttattgaga 961 gaatcaggct tattctcaca tctgccatta gaaagcaggc aacaaatgga gacacagata 1021 ggtgctctga tactagccac agacatcagt cgccagaatg agtatctgtc tttgtttagg 1081 tcccatttgg atagaggtga tttatgccta gaagacacca gacacagaca tttggtttta 1141 cagatggctt tgaaatgtgc tgatatttgt aacccatgtc ggacgtggga attaagcaag 1201 cagtggagtg aaaaagtaac ggaggaattc ttccatcaag gagatataga aaaaaaatat 1261 catttgggtg tgagtccact ttgcgatcgt cacactgaat ctattgccaa catccagatt 1321 ggttttatga cttacctagt ggagccttta tttacagaat gggccaggtt ttccaataca 1381 aggctatccc agacaatgct tggacacgtg gggctgaata aagccagctg gaagggactg 1441 cagagagaac agtcgagcag tgaggacact gatgctgcat ttgagttgaa ctcacagtta 1501 ttacctcagg aaaatcggtt atcataaccc ccagaaccag tgggacaaac tgcctcctgg 1561 aggtttttag aaatgtgaaa tggggtcttg aggtgagaga acttaactct tgactgccaa 1621 ggtttccaag tgagtgatgc cagccagcat tatttatttc caagatttcc tctgttggat 1681 catttgaacc cacttgttaa ttgcaagacc cgaacataca gcaatatgaa tttggcttt // LOCUS HSPDGFA 2305 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for platelet-derived growth factor PDGF-A. ACCESSION X06374 NID g35363 KEYWORDS growth factor; platelet-derived growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2305) AUTHORS Hoppe,J., Schumacher,L., Eichner,W. and Weich,H.A. TITLE The long 3'-untranslated regions of the PDGF-A and -B mRNAs are only distantly related JOURNAL FEBS Lett. 223 (2), 243-246 (1987) MEDLINE 88030061 FEATURES Location/Qualifiers source 1..2305 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U-2OS osteosarcoma cells" /clone="pPG2-F." /chromosome="chromosome 7" CDS 404..994 /note="PDGF-A (AA 1 - 196)" /codon_start=1 /db_xref="PID:g35364" /db_xref="SWISS-PROT:P04085" /translation="MRTLACLLLLGCGYLAHVLAEEAEIPREVIERLARSQIHSIRDL QRLLEIDSVGSEDSLDTSLRAHGVHATKHVPEKRPLPIRRKRSIEEAVPAVCKTRTVI YEIPRSQVDPTSANFLIWPPCVEVKRCTGCCNTSSVKCQPSRVHHRSVKVAKVEYVRK KPKLKEVQVRLEEHLECACATTSLNPDYREEDTDVR" polyA_site 2305 /note="polyA site" BASE COUNT 500 a 617 c 595 g 593 t ORIGIN 1 ttcttggggc tgatgtccgc aaatatgcag aattaccggc cgggtcgctc ctgaagccag 61 cgcggggagc gagcgcggcg gcggccagca ccgggaacgc accgaggaag aagcccagcc 121 cccgccctcc gccccttccg tccccacccc ctacccggcg gcccaggagg ctccccggct 181 gcggcgcgca ctccctgttt ctcctcctcc tggctggcgc tgcctgcctc tccgcactca 241 ctgctcgccg ggcgccgtcc gccagctccg tgctccccgc gccaccctcc tccgggccgc 301 gctccctaag ggatggtact gaatttcgcc gccacaggag accggctgga gcgcccgccc 361 cgcgcctcgc ctctcctccg agcagccagc gcctcgggac gcgatgagga ccttggcttg 421 cctgctgctc ctcggctgcg gatacctcgc ccatgttctg gccgaggaag ccgagatccc 481 ccgcgaggtg atcgagaggc tggcccgcag tcagatccac agcatccggg acctccagcg 541 actcctggag atagactccg tagggagtga ggattctttg gacaccagcc tgagagctca 601 cggggtccac gccactaagc atgtgcccga gaagcggccc ctgcccattc ggaggaagag 661 aagcatcgag gaagctgtcc ccgctgtctg caagaccagg acggtcattt acgagattcc 721 tcggagtcag gtcgacccca cgtccgccaa cttcctgatc tggcccccgt gcgtggaggt 781 gaaacgctgc accggctgct gcaacacgag cagtgtcaag tgccagccct cccgcgtcca 841 ccaccgcagc gtcaaggtgg ccaaggtgga atacgtcagg aagaagccaa aattaaaaga 901 agtccaggtg aggttagagg agcatttgga gtgcgcctgc gcgaccacaa gcctgaatcc 961 ggattatcgg gaagaggaca cggatgtgag gtgaggatga gccgcagccc tttcctggga 1021 catggatgta catggcgtgt tacattcctg aacctactat gtacggtgct ttattgccag 1081 tgtgcggtct ttgttctcct ccgtgaaaaa ctgtgtccga gaacactcgg gagaacaaag 1141 agacagtgca catttgttta atgtgacatc aaagcaagta ttgtagcact cggtgaagca 1201 gtaagaagct tccttgtcaa aaagagagag agagagagag agagagaaaa caaaaccaca 1261 aatgacaaaa acaaaacgga ctcacaaaaa tatctaaact cgatgagatg gagggtcgcc 1321 ccgtgggatg gaagtgcaga ggtctcagca gactggattt ctgtccgggt ggtcacaggt 1381 gcttttttgc cgaggatgca gagcctgctt tgggaacgac tccagagggg tgctggtggg 1441 ctctgcaggg cccgcaggaa gcaggaatgt cttggaaacc gccacgcgaa ctttagaaac 1501 cacacctcct cgctgtagta tttaagccca tacagaaacc ttcctgagag ccttaagtgg 1561 tttttttttt tgtttttgtt ttgttttttt tttttttgtt tttttttttt tttttttttt 1621 ttacaccata aagtgattat taagcttcct tttactcttt ggctagcttt tttttttttt 1681 tttttttttt ttttttttaa ttatctcttg gatgacattt acaccgataa cacacaggct 1741 gctgtaactg tcaggacagt gcgacggtat ttttcctagc aagatgcaaa ctaatgagat 1801 gtattaaaat aaacatggta tacctaccta tgcatcattt cctaaatgtt tctggctttg 1861 tgtttctccc ttaccctgct ttatttgtta atttaagcca ttttgaaaga actatgcgtc 1921 aaccaatcgt acgccgtccc tgcggcacct gccccagagc ccgtttgtgg ctgagtgaca 1981 acttgttccc cgcagtgcac acctagaatg ctgtgttccc acgcggcacg tgagatgcat 2041 tgccgcttct gtctgtgttg ttggtgtgcc ctggtgccgt ggtggcggtc actccctctg 2101 ctgccagtgt ttggacagaa cccaaattct ttatttttgg taagatattg tgctttacct 2161 gtattaacag aaatgtgtgt gtgtggtttg tttttttgta aaggtgaagt ttgtatgttt 2221 acctaatatt acctgttttg tatacctgag agcctgctat gttcttcttt tgttgatcca 2281 aaattaaaaa aaaaatacca ccaac // LOCUS HSPDGFBB 514 bp RNA PRI 26-MAY-1993 DEFINITION H.sapiens synthetic gene for platelet-derived growth factor-BB. ACCESSION X63966 NID g311378 KEYWORDS platelet-derived growth factor; platelet-derived growth factor-BB. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 514) AUTHORS Cook,A.L., Kirwin,P.M., Craig,S., Bawden,L.J., Green,D.R., Price,M.J., Richardson,S.J., Fallon,A., Drummond,A.H., Edwards,R.M. and Clements,J.M. TITLE Purification and analysis of proteinase-resistant mutants of recombinant platelet-derived growth factor-BB exhibiting improved biological activity JOURNAL Biochem. J. 281 (Pt 1), 57-65 (1992) MEDLINE 92117992 REFERENCE 2 (bases 1 to 514) AUTHORS Clements,J.M. TITLE Direct Submission JOURNAL Submitted (11-MAY-1992) J.M. Clements, British Biotechnology, Watlington RD, Cowley, Oxford OX4 5LY, UK FEATURES Location/Qualifiers source 1..514 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 14..499 /codon_start=1 /product="platelet-derived growth factor-BB" /db_xref="PID:g35377" /translation="MSLGSLTIAEPAMIAECKTRTEVFEISRRLIDRTNANFLVWPPC VEVQRCSGCCNNRNVQCRPTQVQLRPVQVRKIEIVRKKPIFKKATVTLEDHLACKCET VAAARPVTRSPGGSQEQRAKTPQTRVTIRTVRVRRPPKGKHRKFKHTHDKTALKETLG A" BASE COUNT 142 a 141 c 113 g 118 t ORIGIN 1 aagcttacct gctatgtcct tgggttcgtt aaccatcgct gaaccggcta tgatcgccga 61 atgtaagacg cgtaccgaag ttttcgaaat ctcgagacgt ttgattgacc gcaccaacgc 121 caacttcctg gtttggccgc catgtgttga agtccaacgc tgcagtggtt gctgtaacaa 181 cagaaacgtt cagtgtcgac ctactcaggt tcaactgcgt cctgtccaag ttcgtaagat 241 cgaaattgta cgtaagaaac caatcttcaa gaaagccact gtaactctag aagaccacct 301 ggcatgcaag tgtgaaactg ttgcagctgc tcgccctgtt actagatctc cgggtggttc 361 ccaggaacaa cgcgctaaaa ccccacaaac ccgggttacc atcagaactg ttcgcgtccg 421 tagacctccc aagggtaaac accgcaaatt caagcacacc cacgacaaaa ccgctttaaa 481 ggaaacctta ggtgcttagt aaggatccga attc // LOCUS HSPDZDOMA 5350 bp RNA PRI 17-SEP-1997 DEFINITION Homo sapiens mRNA for PDZ domain protein. ACCESSION AJ001306 NID g2370148 KEYWORDS PDZ domain protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5350) AUTHORS Philipp,S. TITLE Direct Submission JOURNAL Submitted (28-AUG-1997) Philipp S., Institut fuer Pharmakologie und Toxikologie, Universitaet des Saarlandes, Gebaeude 46, 66421 Homburg/Saar, Saarland, 66421 Homburg, GERMANY REFERENCE 2 (bases 1 to 5350) AUTHORS Philipp,S. and Flockerzi,V. TITLE Molecular characterization of a novel human PDZ domain protein with homology to INAD from Drosophila melanogaster JOURNAL FEBS Lett. 413 (2), 243-248 (1997) MEDLINE 97424368 REFERENCE 3 (bases 1 to 5350) AUTHORS Lennon,G., Auffray,C., Polymeropoulos,M. and Soares,M.B. TITLE The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression JOURNAL Genomics 33 (1), 151-152 (1996) MEDLINE 96224170 COMMENT Related sequence AA005420. FEATURES Location/Qualifiers source 1..5350 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="epithelial" /clone="I.M.A.G.E. clone ID 428338" CDS 115..4689 /codon_start=1 /product="PDZ domain protein" /db_xref="PID:e339121" /db_xref="PID:g2370149" /translation="MPENPATDKLQVLQVLDRLKMKLQEKGDTSQNEKLSMFYETLKS PLFNQILTLQQSIKQLKGQLNHIPSDCSANFDFSRKGLLVFTDGSITNGNVHRPSNNS TVSGLFPWTPKLGNEDFNSVIQQMAQGRQIEYIDIERPSTGGLGFSVVALRSQNLGKV DIFVKDVQPGSVADRDQRLKENDQILAINHTPLDQNISHQQAIALLQQTTGSLRLIVA REPVHTKSSTSSSLNDTTLPETVCWGHVEEVELINDGSGLGFGIVGGKTSGVVVRTIV PGGLADRDGRLQTGDHILKIGGTNVQGMTSEQVAQVLRNCGNSVRMLVARDPAGDISV TPPAPAALPVALPTVASKGPGSDSSLFETYNVELVRKDGQSLGIRIVGYVGTSHTGEA SGIYVKSIIPGSAAYHNGHIQVNDKIVAVDGVNIQGFANHDVVEVLRNAGQVVHLTLV RRKTSSSTSPLEPPSDRGTVVEPLKPPALFLTGAVETETNVDGEDEEIKERIDTLKND NIQALEKLEKVPDSPENELKSRWENLLGPDYEVMVATLDTQIADDAELQKYSKLLPIH TLRLGVEVDSFDGHHYISSIVSGGPVDTLGLLQPEDELLEVNGMQLYGKSRREAVSFL KEVPPPFTLVCCRRLFDDEASVDEPRRTETSLPETEVDHNMDVNTEEDDDGELALWSP EVKIVELVKDCKGLGFSILDYQDPLDPTRSVIVIRSLVADGVAERSGGLLPGDRLVSV NEYRLDNTSLAEAVEILKAVPPGLVHLGICKPLVEDNEEESCYILHSSSNEDKTEFSG TIHDINSSLILEAPKGFRDEPYFKEELVDEPFLDLGKSFHSQQKEIEQSKEAWEMHEF LTPRLQEMDEEREMLVDEEYELYQDPSPSMELYPLSHIQEATPVPSVNELHFGTQWLH DNEPSESQEARTGRTVYSQEAQPYGYCPENVMKENFVMESLPSVPSTEGNSQQGRFDD LENLNSLAKTSLDLGMIPNDVQGPSLLIDLPVVAQRREQEDLPLYQHQATRVISKASA YTGMLSSRYATDTCELPEREEGEGEETPNFSHWGPPRIVEIFREPNVSLGISIVGGQT VIKRLKNGEELKGIFIKQVLEDSPAGKTNALKTGDKILEVSGVDLQNASHSEAVEAIK NAGNPVVFIVQSLSSTPRVIPNVHNKANKITGNQNQDTQEKKEKRQGTAPPPMKLPPP YKALTDDSDENEEEDAFTDQKIRQRYADLPGELHIIELEKDKNGLGLSLAGNKDRSRM SIFVVGINPEGPAAADGRMHIGDELLEINNQILYGRSHQNASAIIKTAPSKVKLVFIR NEDAVNQMAVTPFPVPSSSPSSIEDQSGTEPISSEEDGSLEVGIKQLPESESFKLAVS QMKQQKYPTKVSFSSQEIPLAPASSYHSTDADFTGYGGFQAPLSVDPATCPIVPGQEM IIEISKRRSGLGLSIVGGKDTPLVNGVDLRNSSHEEAITALRQTPQKVRLVVYRDEAH YRDEENLEIFPVDLQKKAGRGLGLSIVGKR" BASE COUNT 1648 a 1086 c 1266 g 1350 t ORIGIN 1 ctcacttccg cccaggtgag gcagggccga caccgagccc gcccgacccg ggctcccacc 61 tgctcctcca gcgcaccagg tgtctttaag agtgattgaa gagaataatt caaaatgcct 121 gaaaatcctg ctacagataa actgcaggtg ctgcaggtac ttgatcgcct gaaaatgaaa 181 ttgcaggaga agggtgacac gtcgcagaat gagaagttat ctatgtttta tgagacacta 241 aagagtcctc tcttcaacca gatactcaca cttcagcagt ccatcaagca actgaagggt 301 caactcaacc atataccctc agattgttca gccaactttg atttttctag gaaaggtttg 361 ttagtgttca cagatggttc cattactaat ggaaatgtcc acaggccctc taataactcg 421 actgtatctg ggttatttcc gtggaccccg aagttgggaa atgaagactt taactcagtc 481 attcaacaga tggctcaggg ccggcaaatt gaatatatag atatagaacg gccttcaact 541 ggaggccttg gattcagtgt ggtggccctc agaagtcaaa atctcggaaa agttgatatc 601 ttcgtgaagg atgtccagcc agggagtgta gcagacaggg atcaaagatt aaaggaaaat 661 gatcaaatat tggccattaa tcacacgcca ttggatcaga acatttccca tcagcaagca 721 attgcattat tacaacaaac cactggatct ttgagactga ttgtggccag ggaaccagtc 781 cacacaaaaa gcagtacttc tagcagccta aatgatacaa ctctgcctga aacagtttgt 841 tggggccatg ttgaagaggt tgagctcatt aatgatggct ctggactagg ttttggaata 901 gttggaggaa aaacaagtgg cgtggttgtg aggactatag ttcctggagg attagcagat 961 cgagatggaa gactccagac aggggaccac atcttgaaga ttggtggcac aaacgtgcag 1021 ggaatgacca gtgagcaagt tgcacaagtt ctaaggaact gtgggaattc agtcaggatg 1081 ctcgttgcta gagatccagc tggtgacatt tcagtcaccc cccctgcccc tgcagcctta 1141 cctgttgccc tgcctactgt agccagcaag ggccctggtt ctgacagttc tctttttgaa 1201 acttataatg ttgagcttgt gagaaaagat gggcagagtc ttggaattag aattgttggc 1261 tatgttggaa catctcatac aggggaagct tcagggattt atgtgaaaag tataatacct 1321 ggcagtgctg cgtaccacaa tggccacatt caagtgaatg acaaaatagt tgctgtcgat 1381 ggcgtgaaca ttcagggttt tgccaaccat gatgttgttg aagtattacg aaatgcaggg 1441 caggtggtac acctaaccct agttcgaagg aagacatcct catctacttc tccacttgaa 1501 ccaccttcag acagaggaac tgttgtagaa ccactgaaac caccagctct ctttctaact 1561 ggagcagtgg aaactgaaac taatgtggat ggtgaagatg aggaaattaa agaaagaatt 1621 gatactttaa aaaatgacaa catacaagcc ttagaaaaat tggaaaaagt cccagactct 1681 ccagaaaatg agctgaaatc cagatgggaa aacctgttgg gtcctgatta tgaagtaatg 1741 gttgctactt tggacacaca gattgcagat gatgctgagt tacagaaata ttcaaagctg 1801 ctgcctattc acactctgag gcttggtgtg gaagtggatt cctttgatgg gcaccattat 1861 atttcttcaa ttgtttctgg tggtcctgtt gatacattgg gtctcctaca gccagaagat 1921 gagctgcttg aggtcaatgg catgcagctt tatggaaaat ctcgccgaga agcagtctcc 1981 tttcttaaag aagtgccacc cccttttact ttggtttgct gtcggaggtt gtttgatgat 2041 gaagcttctg tagatgaacc aaggcgcact gaaacctctc ttcctgagac agaggttgac 2101 cacaatatgg atgtcaatac tgaagaagat gatgatgggg aattagcact gtggtcccct 2161 gaagtcaaga ttgttgaact agtaaaagat tgtaaaggtt tgggattcag cattttggat 2221 taccaggacc ctttagatcc tacaagatca gtgattgtga tccgctccct ggtagcagat 2281 ggtgtagcag aaagaagtgg gggactatta cctggagacc gcctggtctc agtcaatgaa 2341 taccgtttgg acaacacctc acttgctgaa gctgtggaaa tattgaaagc tgtgccacca 2401 ggcctagtac accttggcat ctgtaagcct ttggtggaag ataatgaaga agaaagttgt 2461 tatattttac attcaagcag taatgaagac aagactgaat tttcaggaac aattcatgat 2521 ataaattcat ctttaatact cgaagcaccc aagggattta gagatgaacc atattttaaa 2581 gaagaacttg tggatgaacc atttctagat ctgggaaagt ctttccattc ccaacaaaaa 2641 gagatagagc aaagcaagga ggcctgggag atgcatgaat ttctgactcc tagattgcag 2701 gaaatggatg aagaaagaga aatgcttgtt gatgaagaat atgagttata tcaagatccc 2761 tcaccatcca tggagttgta tcccttgtcg cacattcaag aggccactcc tgtgccctct 2821 gtgaatgaac ttcactttgg tacacagtgg ttgcatgata atgaaccatc cgagtctcaa 2881 gaggcaagaa ccgggaggac tgtctattcc caggaggcac agccgtatgg ctattgccct 2941 gaaaatgtga tgaaagaaaa ttttgtcatg gagtccctac catctgtacc atcaactgaa 3001 ggaaacagtc aacaaggcag atttgacgac ctggaaaatc ttaattcatt agcaaaaact 3061 agtctggatt taggcatgat cccgaatgat gtccaaggtc ctagcttgct cattgacctt 3121 cctgttgtgg ctcaaaggag ggagcaagaa gatttgcctt tatatcaaca ccaagcgaca 3181 cgagttattt ccaaggcctc agcatacaca ggaatgttgt cttctagata tgccactgat 3241 acatgtgagt tacctgagag agaagaaggc gaaggagaag aaactccaaa ttttagccac 3301 tggggtccac cgagaattgt tgagattttt agagaaccca atgtgtctct tgggatcagt 3361 attgttggtg gacaaactgt tataaaacgt ctaaagaatg gagaggagct taaaggtata 3421 ttcatcaaac aagttttaga agacagtcca gcagggaaga cgaacgcact taaaactgga 3481 gataaaatac ttgaggtgtc tggagtagat ttgcagaatg cctcacacag cgaagcagtt 3541 gaggccatta agaatgcagg aaaccctgtg gtgttcattg ttcagagttt gtcatccact 3601 ccacgagtca ttcctaacgt acataacaag gccaacaaaa tcaccggtaa ccagaaccag 3661 gacacccaag aaaagaaaga aaagaggcaa ggaactgctc caccgccaat gaaacttcct 3721 cctccttata aagctctgac tgatgacagt gatgaaaatg aagaagaaga tgcctttacc 3781 gaccaaaaaa tcagacaaag atatgcagat ctgcctggag aactgcacat tattgaactt 3841 gaaaaagata agaatggact tggactcagc cttgctggta ataaagaccg atcacgcatg 3901 agcatatttg tggtgggaat taacccggaa ggacctgctg ccgcagatgg acgaatgcat 3961 attggagatg aactcttaga gataaacaat cagattctgt atggaagaag tcaccaaaat 4021 gcatctgcca ttattaagac tgccccatca aaggtcaagc tggttttcat cagaaacgag 4081 gatgcagtca atcagatggc cgttactccc tttccagtgc catcaagttc tccatcttct 4141 attgaggatc agagcggcac cgaacctatt agtagtgagg aagatggcag cctcgaagtt 4201 ggtattaaac aattgcctga aagtgaaagc ttcaaactgg ctgtcagcca gatgaaacag 4261 caaaaatatc caacaaaagt ctccttcagt tcacaagaga taccattagc accagcttca 4321 tcataccatt caacagatgc agacttcaca ggctatggtg gtttccaggc tcctctgtca 4381 gtggaccccg caacgtgtcc cattgtccct ggacaggaaa tgattataga aatatccaag 4441 agacgttcag ggcttggtct cagcattgtg ggaggaaaag acacaccctt ggttaatggg 4501 gttgacctga ggaactccag ccacgaagaa gccatcacag ccctgaggca gaccccccag 4561 aaggtgcggc tggtggtgta tagagatgag gcacactacc gggatgagga gaacttggag 4621 attttccctg tggatctgca gaagaaagct ggccggggcc tgggcctgag catcgttggg 4681 aaacggtaaa gacgtgctgt gggagttggg atctgccttt ttcatccaga gctctgatgc 4741 ctgtgaacac tgaaagagaa agcctaatgt aaagtagtga tgggatttct aaaaataaga 4801 tatttatgaa aatttgacaa catgggtcat atttctgagc aaggtcttac cagaaaaaat 4861 tgtcatatca agatagaact ccaagtccaa tcaatccaga ctgatatatt tctgtacaga 4921 gtaagaacaa ctagcaaggt tctttcactt ggaattacta agatcggagt tttgcagagg 4981 ttgattaaag caaccatacc caagaaatag ctagcatcaa gaatgagatt tatccaatgt 5041 tgggtcaaga acattgcttc gacatggaaa ttaacatgga acattgcttt tcgtgatact 5101 gttaatttca tactatgttg aaactagttg agtagacata gttaagagat aaacataatt 5161 cttcacgata gtagttttct attaagaaaa atgtcctgct gggcacagtg gcatgtacct 5221 gttgtctcag ctatgtggga agatcacttg aggccaggag ttcaaggcta tagtgtgcta 5281 tgatcatgcc tgtgaatagc cactgcactt gagcctcttg ggaaacataa caagacccca 5341 tctgtaaatt // LOCUS HSPEA15 2385 bp RNA PRI 23-JUL-1996 DEFINITION H.sapiens mRNA for major astrocytic phosphoprotein PEA-15. ACCESSION X86809 NID g854166 KEYWORDS PEA-15 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2385) AUTHORS Estelles,A., Yokoyama,M., Nothias,F., Vincent,J.D., Glowinski,J., Vernier,P. and Chneiweiss,H. TITLE The major astrocytic phosphoprotein PEA-15 is encoded by two mRNAs conserved on their full length in mouse and human JOURNAL J. Biol. Chem. 271 (25), 14800-14806 (1996) MEDLINE 96278966 REFERENCE 2 (bases 1 to 2385) AUTHORS Chneiweiss,H.M. TITLE Direct Submission JOURNAL Submitted (02-MAY-1995) H. Chneiweiss, INSERM U114/Chaire de Neuropharmacologie, College de France, 11 Place Marcelin Berthelot, F- 75231 Paris cedex 05, FRANCE COMMENT Overlaps with L31958. FEATURES Location/Qualifiers source 1..2385 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="brain" gene 114..506 /gene="PEA-15" CDS 114..506 /gene="PEA-15" /codon_start=1 /db_xref="PID:g854167" /translation="MVEYGTLFQDLTNNITLEDLEQLKSACKEDIPSEKSEEITTGSA WFSFLESHNKLDKDNLSIIEHIFEISRRPDLLTMVVDYRTRVLKISEEDELDTKLTRI PSAKKYKDIIRQPSEEEIIKLGPPPKKA" BASE COUNT 610 a 608 c 604 g 563 t ORIGIN 1 gccagagcgc gcggggcagt gtgcgcggga gccgaggagg aggttccgga cgctgcttag 61 gaaccgggga ctcaggagtg cccgcgccct gagcgctcag ctccagaggc gtcatggttg 121 agtacgggac cctctttcaa gacctgacca acaacatcac ccttgaagat ctagaacagc 181 tcaagtcggc ctgcaaggaa gacatcccca gcgaaaagag tgaggagatc actactggca 241 gtgcctggtt tagcttcctg gagagccaca acaagctgga caaagacaac ctctccatca 301 ttgagcacat ctttgagatc tcccgccgtc ctgacctact cactatggtg gttgactaca 361 gaacccgtgt gctgaagatc tctgaggagg atgagctgga caccaagcta acccgtatcc 421 ccagtgccaa gaagtacaaa gacattatcc ggcagccctc tgaggaagag atcatcaaat 481 tgggtccccc accgaagaag gcctgagcaa gggggaggaa gaggaggaag gttggacctt 541 catcagacca cttccttccc ccatcctcca ggagaggggg caagggcaac ccaccatcta 601 cccacttact aacctggtcc taaccccctt actgtgcgcg tgtgtgtgcg tgtgcgcagc 661 tctggctgtt tgtctatatg tctagctcat ctagttcctc ttcttaaggg gatgggggtc 721 aggggctagg ggagggggct gagtttcccc actttaggag gaggtggggg ctatttctat 781 gcaaatagaa atcagcacat tcctcctact tccctttcct ccactccccc catatcttta 841 aagtgtggaa gcagaaagga cctgcatttt cctacattga ggagctgaca taggggtaag 901 gtatgggaga ggtaggtgga tccagggaaa agcagtgggg acggaaggca aagagaccac 961 ttaaccccca cctggaaggg gcaaagaaaa gccagagttc catgtttgta ctcctgtgct 1021 ggactgtttc ctgagtacca gcaggtccct ttttgtctct catgggccta gcataggtat 1081 gagccaggga tcctttcctg gtccctaaga tcaaacccca tggagcagcc agcgttagat 1141 gcccccaccc acctgtactc tggagagact gtgctgggaa catgtaccac tgagcctgag 1201 atggggatga gggcagagag aggggagccc cctcttccac tcagttgttc ctactcagac 1261 tgttgcactc taaaccttag ggaggttgaa agaattgaga cccttaggtt ttaacaacga 1321 atcctgacaa caccatctat tagggtccca aattggttat tgtaggcaac cttccctctt 1381 ttcttggtga agaacatccc aagccagaaa gaagttaact acagtgtttt cctttgcacc 1441 gatccccacc ccaattcaat cccggaaggg acttacttag gaaacccttc tttactagat 1501 atcctggccc cctgggcttg tgaacacctc ctagccacat cactacagta cagtgagtga 1561 ccccagcctc ctgcctaccc caagatgccc ctccccaccc tgaccgtgct aactgtgtgt 1621 acatatatat tctacatata tgtatattaa aactgcactg ccatgtctgc ccttttttgt 1681 ggtgtctagc attaacttat tgtctaggcc agagcggggg tgggagggga atgccacagt 1741 gaagggagtg gcagaatcaa attgctacat agtccaaaca aaaaagaagg ctttttcaaa 1801 aaacattaaa ttcacatgca gtctcagaga ctttttagac aaagttcaag ttaggagctt 1861 ttaggatgtg ggagtaaaac tttaatggga ggggagggct ggctgctgga agaaggaaga 1921 agccagactg gttagacagt actcttaact cctagcccag cctagcgtgc cctgcccctc 1981 tggccactgc tgcagacacc tgccttaaca cacacacctc taggactcca cagttttgcc 2041 ttaaaggacc ttcccaagtc tccctttccc tgtctggctt ctcccttaag aagagagaga 2101 tacttgtaga attgggtggg gggaatgagc atgaactgtc cttccatttg ggatatgtta 2161 cattagagtg agagagagaa taaggagcct ttcttatgga agaaatggga gaagagagac 2221 agggttcttt tcagcagagt ctagtagttt ctctgtaagg caaaataatc taaaaagact 2281 aacctgccca cccactcctt atattgctgt gagattgccc ctatcttgtg ctcttctgtc 2341 tgcagtgtgc acggccttgt tctaacccgg aataaaggtg attga // LOCUS HSPEABP 1444 bp RNA PRI 25-AUG-1995 DEFINITION H.sapiens phosphatidylethanolamine binding protein mRNA. ACCESSION X75252 NID g406289 KEYWORDS phosphatidylethanolamine-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1444) AUTHORS Tohdoh,N., Tojo,S., Agui,H. and Ojika,K. TITLE Sequence homology of rat and human HCNP precursor proteins, bovine phosphatidylethanolamine-binding protein and rat 23-kDa protein associated with the opioid-binding protein JOURNAL Brain Res. Mol. Brain Res. 30 (2), 381-384 (1995) MEDLINE 95364631 REFERENCE 2 (bases 1 to 1444) AUTHORS Tohdoh,N. TITLE Direct Submission JOURNAL Submitted (08-SEP-1993) N. Tohdoh, Sumitomo Pharmaceuticals Co., Ltd., 1-98, Kasugade-Naka 3-chome, Konohanaku, Osaka, 554, JAPAN FEATURES Location/Qualifiers source 1..1444 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="placenta" /clone_lib="lambda gt11" /clone="pP1-3, pP1-8" CDS 111..674 /codon_start=1 /product="phosphatidylethanolamine binding protein" /db_xref="PID:g406290" /db_xref="SWISS-PROT:P30086" /translation="MPVDLSKWSGPLSLQEVDEQPQHPLHVTYAGAAVDELGKVLTPT QVKNRPTSISWDGLDSGKLYTLVLTDPDAPSRKDPKYREWHHFLVVNMKGNDISSGTV LSDYVGSGPPKGTGLHRYVWLVYEQDRPLKCDEPILSNRSGDHRGKFKVASFRKKYEL RAPVAGTCYQAEWDDYVPKLYEQLSGK" BASE COUNT 345 a 350 c 403 g 346 t ORIGIN 1 ggggggtctg cgtcttcccg agccagtgtg ctgagctctc cgcgtcgcct ctgtcgcccg 61 cgcctggcct accgcggcac tcccggctgc acgctctgct tggcctcgcc atgccggtgg 121 acctcagcaa gtggtccggg cccttgagcc tgcaagaagt ggacgagcag ccgcagcacc 181 cgctgcatgt cacctacgcc ggggcggcgg tggacgagct gggcaaagtg ctgacgccca 241 cccaggttaa gaatagaccc accagcattt cgtgggatgg tcttgattca gggaagctct 301 acaccttggt cctgacagac ccggatgctc ccagcaggaa ggatcccaaa tacagagaat 361 ggcatcattt cctggtggtc aacatgaagg gcaatgacat cagcagtggc acagtcctct 421 ccgattatgt gggctcgggg cctcccaagg gcacaggcct ccaccgctat gtctggctgg 481 tttacgagca ggacaggccg ctaaagtgtg acgagcccat cctcagcaac cgatctggag 541 accaccgtgg caaattcaag gtggcgtcct tccgtaaaaa gtatgagctc agggccccgg 601 tggctggcac gtgttaccag gccgagtggg atgactatgt gcccaaactg tacgagcagc 661 tgtctgggaa gtagggggtt agcttgggga cctgaactgt cctggaggcc ccaagccatg 721 ttccccagtt cagtgttgca tgtataatag atttctcctc ttcctgcccc ccttggcatg 781 ggtgagacct gaccagtcag atggtagttg agggtgactt ttcctgctgc ctggccttta 841 taattttact cactcactct gatttatgtt ttgatcaaat ttgaacttca ttttgggggg 901 tattttggta ctgtgatggg gtcatcaaat tattaatctg aaaatagcaa cccagaatgt 961 aaaaaagaaa aaactggggg gaaaaagacc aggtctacag tgatagagca aagcatcaaa 1021 gaatctttaa gggaggttta aaaaaaaaaa aaaaaaaaaa gattggttgc ctctgccttt 1081 gtgatcctga gtccagaatg gtacacaatg tgattttatg gtgatgtcac tcacctagac 1141 aaccagaggc tggcattgag gctaacctcc aacacagtgc atctcagatg cctcagtagg 1201 catcagtatg tcactctggt ccctttaaag agcaatcctg gaagaagcag gagggagggt 1261 ggctttgctg ttgttgggac atggcaatct agaccggtag cagcgcctcg ctgacagctt 1321 gggaggaaac ctgagatctg tgttttttaa attgatcgtt cttcatgggg gtaagaaaag 1381 ctggtctgga gttgctgaat gttgcattaa ttgtgctgtt tgcttgtagt tgaataaaaa 1441 cccg // LOCUS HSPEG1 1501 bp DNA PRI 29-JUL-1997 DEFINITION H.sapiens PEG1/MEST gene. ACCESSION Y10620 NID g2285949 KEYWORDS PEG1/MEST gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1501) AUTHORS Riesewijk,A.M., Hu,L., Schulz,U., Tariverdian,G., Hoglund,P., Kere,J., Ropers,H.H. and Kalscheuer,V.M. TITLE Monoallelic expression of human PEG1/MEST is paralleled by parent-specific methylation in fetuses JOURNAL Genomics 42 (2), 236-244 (1997) MEDLINE 97336048 REFERENCE 2 (bases 1 to 1501) AUTHORS Kalscheuer,V.M.M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1997) V.M.M. Kalscheuer, Max-Planck Institut fuer Molekulare Genetik, Ihnestrasse 73, D- 14195 Berlin, FRG REMARK revised by submitter 29-JUL-1997 COMMENT Related sequences: D78611 & Y11534. FEATURES Location/Qualifiers source 1..1501 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cosmid library 113" /clone="ICRfc113G0353Q4" /sub_clone="4.3HpT7T3" /chromosome="7" /map="q32" exon 479..722 /gene="PEG1/MEST" /number=1 gene 479..1501 /gene="PEG1/MEST" CDS 697..726 /gene="PEG1/MEST" /codon_start=1 /db_xref="PID:e332036" /db_xref="PID:g2285950" /translation="MVRRDRLRR" intron 723..>1501 /gene="PEG1/MEST" /number=1 BASE COUNT 242 a 414 c 527 g 318 t ORIGIN 1 gagggatggg agcaggcgcc acggccggca ccccagagcc ctgctgcccc ttagttcgag 61 cggccatcct cctgtggggc ttgtgggcag cctgtggggt ttgtgggcgg cctgtggggt 121 ttgtgggtgg tctaaggaaa gagttggggc actcaggggt ctgctgtttt tgcccgtggc 181 cttaactcat caggggaggg tttctgcagc agaatctcgg gctcagggtt ggcggttaac 241 gagggagcag cggggtcttg gggagggggc tcgacacccc tgaaggtgcc ccctaaagga 301 gccactgtta gaggggcacc ccatctttgt ggccatggcg gtggtagagc ggctgggagg 361 ggctctgcgg cgagcaaggg agcaggcggt aggggttttg cggcgatggg cgggctaggg 421 gcggggcgcg ggtgggctct aaaagtcggt gcccactcgc tccgcgctgc cgcggcaacc 481 agcacacccc ggcacctcct ctgcggcagc tgcgcctcgc aagcgcagtg ccgcagcgca 541 cgccggagtg gctgtagctg cccggcgcgg cgccgccctg cgcgggctgt gggctgcggg 601 ctgcgccccc gctgctggcc agctctgcac ggctgcgggc tctgcggcgc ccggtgctct 661 gcaacgctgc ggcgggcggc atgggataac gcggccatgg tgcgccgaga tcgcctccgc 721 aggtgagtgt gcggtgggaa cgagggggtg tggctggcgg ccctgggact agggcgcagg 781 cgagcggagg actgtgtgcc cgtgtccgag ctggggctgc ctctgggccg aaactctacc 841 gacaggcggc acgcattccg cgcccgctct gcctacttga ggagggggtg tcactcctgc 901 ccgcaatgga atgttcagaa cgcgggacct ccttgggtta ggatttctag accccgggat 961 cgtcgtggtg agatttagga tttctggacc ccagcgtcat cttgatatga cttaggatcc 1021 ataatgaccc tggtctcacc ctgatgcgaa ttgggatttt tagatcctgg catcaccctg 1081 gtgcgattta ggatttttat actcagtcat tgctgcagca tgatttagga tttctaaccc 1141 ccagcatcgc cctggtttga tttaggatat ttagactccg gcttccctct ggtgcgattc 1201 aggattctta gactccgccg ttgccgtggc gcgatttagg atttatagat cccggcaaag 1261 ccctggtgcg atgtaggatt tttagaaccc cagcatcgct ctggtgcgac ttaaaggata 1321 ggccccagca tcgccctggt gcgatgtagg atttttagaa ccccggtatc tccgtggcgc 1381 accttaggat ttcaagaacg ggataatcgc agtgccgaga tcgccgcggt gcagcttagg 1441 atttcaagac ccaggtatca cggtggcggg agtcaccgca gtgactagaa ctcgcagtgc 1501 c // LOCUS HSPEP19 540 bp RNA PRI 21-JAN-1997 DEFINITION H.sapiens mRNA for PEP-19. ACCESSION X93349 NID g1072377 KEYWORDS PEP-19. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 540) AUTHORS Chen,H., Bouras,C. and Antonarakis,S.E. TITLE Cloning of the cDNA for a human homolog of the rat PEP-19 gene and mapping to chromosome 21q22.2-q22.3 JOURNAL Hum. Genet. 98 (6), 672-677 (1996) MEDLINE 97085564 REFERENCE 2 (bases 1 to 540) AUTHORS Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (22-NOV-1995) S.E. Antonarakis, Div.of Medical Genetics, Univ. and Cantonal Hospital of Geneva, CMU, 1 Rue Michel-Servet, 1211 Geneva, SWITZERLAND FEATURES Location/Qualifiers source 1..540 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" /chromosome="21" /map="21q22.1-q22.2" CDS 59..247 /note="human homolog of rat PEP-19" /codon_start=1 /product="PEP-19" /db_xref="PID:g1072378" /db_xref="SWISS-PROT:P48539" /translation="MSERQGAGPTNGKDKTSGENDGQKKVQEEFDIDMDAPETERAAV AIQSQFRKFQKKKAGSQS" BASE COUNT 187 a 113 c 120 g 120 t ORIGIN 1 gaattccgag gggtcgctgt gctgagcggc gggactgagc tgttgagtta gagccaacat 61 gagtgagcga caaggtgctg ggccaaccaa tggaaaagac aagacatctg gtgaaaatga 121 tggacagaag aaagttcaag aagaatttga cattgacatg gatgcaccag agacagaacg 181 tgcagcggtg gccattcagt ctcagttcag aaaattccag aagaagaagg ctgggtctca 241 gtcctagtgg gagaaccccc tcctagtcca cctgaaagca ccaaattcaa ccatcatctg 301 tcaagaaatt aaaagaacaa caccctagag agaagtcatc cacacacaat ccacacacgc 361 atagcaaacc tccaatgcat gtacagaaac ctgtgatatt tatacccttg taggaaggta 421 tagacaatgg aattgtgagt agcttaatct ctatgtttct ctccattttc attcctcctg 481 caactatttt ccttgatgtt gtaataaaat gaagttacga tgagaaaaaa aaaaaaaaaa // LOCUS HSPEPPGEN 1872 bp RNA PRI 01-NOV-1997 DEFINITION H.sapiens mRNA for aminopeptidase P-like. ACCESSION X95762 NID g2584786 KEYWORDS aminopeptidase P; XPNPEPL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1872) AUTHORS Vanhoof,G.C.P., Goossens,F., Juliano,M.A., Juliano,L., Schatteman,K., Lin,A.H. and Scharpe,S. TITLE Isolation and sequence analysis of a human cDNA clone (XPNPEPL) homologous to X-prolyl aminopeptidase (Aminopeptidase P) JOURNAL Unpublished REFERENCE 2 (bases 1 to 1872) AUTHORS Vanhoof,G.C.P. TITLE Direct Submission JOURNAL Submitted (19-FEB-1996) G.C.P. Vanhoof, University of Antwerp, Pharmacy S-6, Medical Biochemistry, Universiteitsplein 1, 2610 Wilrijk, BELGIUM REMARK Revised by F.Goossens 25-MAR-97 COMMENT Related sequences: T06411, T08900, H18713, T32023, H37788. FEATURES Location/Qualifiers source 1..1872 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="PHA activated lymphocytes" mat_peptide 1..1869 /gene="XPNPEPL" /product="Aminopeptidase P-like" gene 1..1872 /gene="XPNPEPL" CDS 1..1872 /gene="XPNPEPL" /EC_number="3.4.11.9" /codon_start=1 /product="Aminopeptidase P-like" /db_xref="PID:e1169567" /db_xref="PID:g2584787" /translation="MPPKVTSELLRQLRQAMRNSEYVTEPIQAYIIPSGDAHQSEYIA PCDCRRAFVSGFDGSAGTAIITEEHAAMWTDGRYFLQAAKQMDSNWTLMKMGLKDTPT QEDWLVSVLPEGSRVGVDPLIIPTDYWKKMAKVLRSAGHHLIPVKENLVDKIWTDRPE RPCKPLLTLGLDYTGISWKDKVADLRLKMAERNVMWFVVTALDEIAWLFNLRGSDVEH NPVFFSYAIIGLETIMLFIDGDRIDAPSVKEHLLLDLGLEAEYRIQVHPYKSILSELK ALCADLSPREKVWVSDKASYAVSETIPKDHRCCMPYTPICIAKAVKNSAESEGMRPAH IKDAVALCELFNWLEKEVPKGGVTEISAADKAEEFRRQQADFVDLSFPTISSTGPNGA IIHYAPVPETNRTLSLDEVYLIDSGAQYKDGTTDVTRTMHFGTPTAYEKECFTYVLKG HIAVSAAVFPTGTKGHLLDSFARSALWDSGLDYLHGTGHGVGSFLNVHEGPCGISYKT FSDEPLEAGMIVTDEPGYYEDGAFGIRIENVVLVVPVKTKYNFNNRGSLTFEPLTLVP IQTKMIDVDSLTDKECDWLNNYHLTCRDVIGKELQKQGRQEALEWLIRETQPISKQH" BASE COUNT 468 a 465 c 497 g 442 t ORIGIN 1 atgcctccaa aggtgacttc agagctgctt cggcagctga gacaagccat gaggaactct 61 gagtatgtga ccgaaccgat ccaggcctac atcatcccat cgggagatgc tcatcagagt 121 gagtatattg ctccatgtga ctgtcggcgg gcttttgtct ctggattcga tggctctgcg 181 ggcacagcca tcatcacaga agagcatgca gccatgtgga ctgacgggcg ctactttctc 241 caggctgcca agcaaatgga cagcaactgg acacttatga agatgggtct gaaggacaca 301 ccaactcagg aagactggct ggtgagtgtg cttcctgaag gatccagggt tggtgtggac 361 cccttgatca ttcctacaga ttattggaag aaaatggcca aagttctgag aagtgccggc 421 catcacctca ttcctgtcaa ggagaacctc gttgacaaaa tctggacaga ccgtcctgag 481 cgcccttgca agcctctcct cacactgggc ctggattaca caggcatctc ctggaaggac 541 aaggttgcag accttcggtt gaaaatggct gagaggaacg tcatgtggtt tgtggtcact 601 gccttggatg agattgcgtg gctatttaat ctccgaggat cagatgtgga gcacaatcca 661 gtatttttct cctacgcaat cataggacta gagacgatca tgctcttcat tgatggtgac 721 cgcatagacg cccccagtgt gaaggagcac ctgcttcttg acttgggtct ggaagccgaa 781 tacaggatcc aggtgcatcc ctacaagtcc atcctgagcg agctcaaggc cctgtgtgct 841 gacctctccc caagggagaa ggtgtgggtc agtgacaagg ccagctatgc tgtgagcgag 901 accatcccca aggaccaccg ctgctgtatg ccttacaccc ccatctgcat cgccaaagct 961 gtgaagaatt cagctgagtc agaaggcatg aggccggctc acattaaaga tgctgttgct 1021 ctctgtgaac tctttaactg gctggagaaa gaggttccca aaggtggtgt gacagagatc 1081 tcagctgctg acaaagctga ggagtttcgc aggcaacagg cagactttgt ggacctgagc 1141 ttcccaacaa tttccagtac gggacccaac ggcgccatca ttcactacgc gccagtccct 1201 gagacgaata ggaccttgtc cctggatgag gtgtacctta ttgactcggg tgctcaatac 1261 aaggatggca ccacagatgt gacgcggaca atgcattttg ggacccctac agcctacgag 1321 aaggaatgct tcacatatgt cctcaagggc cacatagctg tgagtgcagc cgttttcccg 1381 actggaacca aaggtcacct tcttgactcc tttgcccgtt cagctttatg ggattcaggc 1441 ctagattact tgcacgggac tggacatggt gttgggtctt ttttgaatgt ccatgagggt 1501 ccttgcggca tcagttacaa aacattctct gatgagccct tggaggcagg catgattgtc 1561 actgatgagc ccgggtacta tgaagatggg gcttttggaa ttcgcattga gaatgttgtc 1621 cttgtggttc ctgtgaagac caagtataat tttaataacc ggggaagcct gacctttgaa 1681 cctctaacat tggttccaat tcagaccaaa atgatagatg tggattctct tacagacaaa 1741 gagtgcgact ggctcaacaa ttaccacctg acctgcaggg atgtgattgg gaaggaattg 1801 cagaaacagg gccgccagga agctctcgag tggctcatca gagagacgca acccatctcc 1861 aaacagcatt aa // LOCUS HSPFKLA 2914 bp RNA PRI 29-OCT-1992 DEFINITION Human liver-type 1-phosphofructokinase (PFKL) mRNA, complete cds. ACCESSION X15573 NID g35430 KEYWORDS 1-phosphofructokinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2914) AUTHORS Levanon,D., Danciger,E., Dafni,N., Bernstein,Y., Elson,A., Moens,W., Brandeis,M. and Groner,Y. TITLE The primary structure of human liver type phosphofructokinase and its comparison with other types of PFK JOURNAL DNA 8 (10), 733-743 (1989) MEDLINE 90126227 REFERENCE 2 (bases 1 to 2914) AUTHORS Elson,A. JOURNAL Unpublished COMMENT Draft entry and computer-readable sequence [DNA 8, 733-743 (1989)] kindly submitted by A.Elson, 13-JUN-1989. FEATURES Location/Qualifiers source 1..2914 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 56..2398 /note="1-phosphofructokinase" /codon_start=1 /db_xref="PID:g35431" /translation="MAAVDLEKLRASGAGKAIGVLTSGGDRQGMNAAVRAVTRMGIYV GAKVFLIYEGYEGLVEGGENIKQANWLSVSNIIQLGGTIIGSARSKAFTTREGRRAAA YNLVQHGITNLCVIGGDGSLTGANIFRSEWGSLLEELVAEGKISETTAWTYSHLNIAG LVGSIDNDFCGTDMTIGTDSALHRIMEVIDAITTTAQSHQRTFVLEVMGRHCGYLALV SALASGADWLFIPEAPPEDGWENFMCERLGETRSRGSRLNIIIIAEGAIDRNGKPISS SYVKDLVVQRLGFDTRVTVLGHVQRGGTPSAFDRILSSKMGMEAVMALLEATPDTPAC VVTLSGNQSVRLPLMECVQMTKEVQKAMDDKRFDEATQLRGGSFENNWNIYKLLTHQK PPKEKSNFSLAILNVGAPAAGMNAAVRSAVRTGISHGHTVYVVHDGFEGLAKGQVQEV GWHDVAGWLGRGGSMLGTKRTLPKGQLESIVENIRIYGIHALLVVGGFEAYEGVLQLV EARGRYEELCIVMCVIPATISNNVPGTDFSLGSDTAVNAAMESCDRIKQSASGTKRRV FIVETMGGYCGYLATVTGIAVGADAAYVFEDPFNIHDLKVNVEHMTEKMKTDIQRGLV LRNEKCHDYYTTEFLYNLYSSEGKGVFDCRTNVLGHLQQGGAPTPFDRNYGTKLGVKA MLWLSEKLREVYRKGRVFANAPDSACVIGLKKKAVAFSPVTELKKDTDFEHRMPREQW WLSLRLMLKMLAQYRISMAAYVSGELEHVTRRTLSMDKGF" BASE COUNT 543 a 891 c 957 g 523 t ORIGIN 1 gcgacgcggc gcaggcggcg ggagtgcgag ctgggcccgt gtttcggccg ccgccatggc 61 cgcggtggac ctggagaagc tgcgggcgtc gggcgcgggc aaggccatcg gcgtcctgac 121 cagcggcggc gaccggcaag gcatgaacgc tgctgtccgg gctgtgacgc gcatgggcat 181 ttatgtgggt gccaaagtct tcctcatcta cgagggctat gagggcctcg tggagggagg 241 tgagaacatc aagcaggcca actggctgag cgtctccaac atcatccagc tgggcggcac 301 tatcattggc agcgctcgct cgaaggcctt taccaccagg gaggggcgcc gggcagcggc 361 ctacaacctg gtccagcacg gcatcaccaa cctgtgcgtc atcggcgggg atggcagcct 421 cacaggtgcc aacatcttcc gcagcgagtg gggcagcctg ctggaggagc tggtggcgga 481 aggtaagatc tcagagacta cagcctggac ctactcgcac ctgaacatcg cgggcctagt 541 gggctccatc gataacgact tctgcggcac cgacatgacc atcggcacgg actcggccct 601 ccaccgcatc atggaggtca tcgatgccat caccaccact gcccagagcc accagaggac 661 cttcgtgctg gaagtgatgg gccggcactg cgggtacctg gcgctggtat ctgcactggc 721 ctcaggggcc gactggctgt tcatccccga ggctccaccc gaggacggct gggagaactt 781 catgtgtgag aggctgggtg agactcggag ccgtgggtcc cgactgaaca tcatcatcat 841 cgctgagggt gccattgacc gcaacgggaa gcccatctcg tccagctacg tgaaggacct 901 ggtggttcag aggctgggct tcgacacccg tgtaactgtg ctgggccacg tgcagcgggg 961 agggacgccc tctgccttcg accggatcct gagcagcaag atgggcatgg aggcggtgat 1021 ggcgctgctg gaagccacgc ctgacacgcc ggcctgcgtg gtcaccctct cggggaacca 1081 gtcagtgcgg ctgcccctca tggagtgcgt gcagatgacc aaggaagtgc agaaagccat 1141 ggatgacaag aggtttgacg aggccaccca gctccgtggt gggagcttcg agaacaactg 1201 gaacatttac aagctcctca cccaccagaa gccccccaag gagaagtcta acttctccct 1261 ggccatcctg aatgtggggg ccccggcggc tggcatgaat gcggccgtgc gctcggcggt 1321 gcggaccggc atctcccatg gacacacagt atacgtggtg cacgatggct tcgaaggcct 1381 agccaagggt caggtgcaag aagtaggctg gcacgacgtg gccggctggt tggggcgtgg 1441 tggctccatg ctggggacca agaggaccct gcccaagggc cagctggagt ccattgtgga 1501 gaacatccgc atctatggta ttcacgccct gctggtggtc ggtgggtttg aggcctatga 1561 aggggtgctg cagctggtgg aggctcgcgg gcgctacgag gagctctgca tcgtcatgtg 1621 tgtcatccca gccaccatca gcaacaacgt ccctggcacc gacttcagcc tgggctccga 1681 cactgctgta aatgccgcca tggagagctg tgaccgcatc aaacagtctg cctcggggac 1741 caagcgccgt gtgttcatcg tggagaccat ggggggttac tgtggctacc tggccaccgt 1801 gactggcatt gctgtggggg ccgacgccgc ctacgtcttc gaggaccctt tcaacatcca 1861 cgacttaaag gtcaacgtgg agcacatgac ggagaagatg aagacagaca ttcagagggg 1921 cctggtgctg cggaacgaga agtgccatga ctactacacc acggagttcc tgtacaacct 1981 gtactcatca gagggcaagg gcgtcttcga ctgcaggacc aatgtcctgg gccacctgca 2041 gcagggtggc gctccaaccc cctttgaccg gaactatggg accaagctgg gggtgaaggc 2101 catgctgtgg ttgtcggaga agctgcgcga ggtttaccgc aagggacggg tgttcgccaa 2161 tgccccagac tcggcctgcg tgatcggcct gaagaagaag gcggtggcct tcagccccgt 2221 cactgagctc aagaaagaca ctgatttcga gcaccgcatg ccacgggagc agtggtggct 2281 gagcctgcgg ctcatgctga agatgctggc acaataccgc atcagtatgg ccgcctacgt 2341 gtcaggggag ctggagcacg tgacccgccg caccctgagc atggacaagg gcttctgagg 2401 ccagccatgc ccgagctgcc cctccccagc ccccacccat gccagcgcac gcgccagggc 2461 tcagatgggg cctgggctgt tgtgtctgga gcctgcaggc aggtgggggc tgcgtccctg 2521 ctcagcccat cccctgcctc tactccctgg ccacctgcca ggcctccctc cggctggtgt 2581 cttgagacca gcctgccagg cctccagcag gaggacagag tgccctgggg catccacctt 2641 cctgcccagg ggacgtggcg ctgtcggtgt ttggaggctg ctgccccctg gctttggcgc 2701 cccatgggcc ctcagcgtct ccccatgctg ggctcactac atgggccagc ccttgctcta 2761 cctggccggt aggctgctgg cgcctaggtt gtgttgagag ggggatgccc ctggccctgc 2821 ctcactgtga cctgctcctg cccacgtgca gcacctgtca ccttttctag aaataaaatc 2881 accctgactg tggggtgcca tcggtctccg gaga // LOCUS HSPGK1 1767 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA encoding phosphoglycerate kinase. ACCESSION V00572 NID g35434 KEYWORDS complementary DNA; kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1767) AUTHORS Michelson,A.M., Markham,A.F. and Orkin,S.H. TITLE Isolation and DNA sequence of a full-length cDNA clone for human X chromosome-encoded phosphoglycerate kinase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (2), 472-476 (1983) MEDLINE 83169680 COMMENT Data kindly reviewed (22-AUG-1983) by Michelson A.M. This gene is encoded by the human X chromosome. FEATURES Location/Qualifiers source 1..1767 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 80..1333 /note="coding sequence" /codon_start=1 /db_xref="PID:g35435" /db_xref="SWISS-PROT:P00558" /translation="MSLSNKLTLDKLDVKGKRVVMRVDFNVPMKNNQITNNQRIKAAV PSIKFCLDNGAKSVVLMSHLGRPDGVPMPDKYSLEPVAVELKSLLGKDVLFLKDCVGP EVEKACANPAAGSVILLENLRFHVEEEGKGKDASGNKVKAEPAKIEAFRASLSKLGDV YVNDAFGTAHRAHSSMVGVNLPQKAGGFLMKKELNYFAKALESPERPFLAILGGAKVA DKIQLINNMLDKVNEMIIGGGMAFTFLKVLNNMEIGTSLFDEEGAKIVKDLMSKAEKN GVKITLPVDFVTADKFDENAKTGQATVASGIPAGWMGLDCGPESSKKYAEAVTRAKQI VWNGPVGVFEWEAFARGTKALMDEVVKATSRGCITIIGGGDTATCCAKWNTEDKVSHV STGGGASLELLEGKVLPGVDALSNI" BASE COUNT 461 a 389 c 455 g 462 t ORIGIN 1 aagcctccgg agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct 61 ccccagctgt atttccaaaa tgtcgctttc taacaagctg acgctggaca agctggacgt 121 taaagggaag cgggtcgtta tgagagtcga cttcaatgtt cctatgaaga acaaccagat 181 aacaaacaac cagaggatta aggctgctgt cccaagcatc aaattctgct tggacaatgg 241 agccaagtcg gtagtcctta tgagccacct aggccggcct gatggtgtgc ccatgcctga 301 caagtactcc ttagagccag ttgctgtaga actcaaatct ctgctgggca aggatgttct 361 gttcttgaag gactgtgtag gcccagaagt ggagaaagcc tgtgccaacc cagctgctgg 421 gtctgtcatc ctgctggaga acctccgctt tcatgtggag gaagaaggga agggaaaaga 481 tgcttctggg aacaaggtta aagccgagcc agccaaaata gaagctttcc gagcttcact 541 ttccaagcta ggggatgtct atgtcaatga tgcttttggc actgctcaca gagcccacag 601 ctccatggta ggagtcaatc tgccacagaa ggctggtggg tttttgatga agaaggagct 661 gaactacttt gcaaaggcct tggagagccc agagcgaccc ttcctggcca tcctgggcgg 721 agctaaagtt gcagacaaga tccagctcat caataatatg ctggacaaag tcaatgagat 781 gattattggt ggtggaatgg cttttacctt ccttaaggtg ctcaacaaca tggagattgg 841 cacttctctg tttgatgaag agggagccaa gattgtcaaa gacctaatgt ccaaagctga 901 gaagaatggt gtgaagatta ccttgcctgt tgactttgtc actgctgaca agtttgatga 961 gaatgccaag actggccaag ccactgtggc ttctggcata cctgctggct ggatgggctt 1021 ggactgtggt cctgaaagca gcaagaagta tgctgaggct gtcactcggg ctaagcagat 1081 tgtgtggaat ggtcctgtgg gggtatttga atgggaagct tttgcccggg gaaccaaagc 1141 tctcatggat gaggtggtga aagccacttc taggggctgc atcaccatca taggtggtgg 1201 agacactgcc acttgctgtg ccaaatggaa cacggaggat aaagtcagcc atgtgagcac 1261 tgggggtggt gccagtttgg agctcctgga aggtaaagtc cttcctgggg tggatgctct 1321 cagcaatatt tagtactttc ctgcctttta gttcctgtgc acagccccta agtcaactta 1381 gcattttctg catctccact tggcattagc taaaaccttc catgtcaaga ttcagctagt 1441 ggccaagaga tgcagtgcca ggaaccctta aacagttgca cagcatctca gctcatcttc 1501 actgcaccct ggatttgcat acattcttca agatcccatt tgaatttttt agtgactaaa 1561 ccattgtgca ttctagagtg catatattta tattttgcct gttaaaaaga aagtgagcag 1621 tgttagctta gttctctttt gatgtaggtt attatgatta gctttgtcac tgtttcacta 1681 ctcagcatgg aaacaagatg aaattccatt tgtaggtagt gagacaaaat tgatgatcca 1741 ttaagtaaac aataaaagtg tccattg // LOCUS HSPGP95 1014 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for protein gene product (PGP) 9.5. ACCESSION X04741 NID g35439 KEYWORDS neuroendocrine marker protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1014) AUTHORS Day,I.N. and Thompson,R.J. TITLE Molecular cloning of cDNA coding for human PGP 9.5 protein. A novel cytoplasmic marker for neurones and neuroendocrine cells JOURNAL FEBS Lett. 210 (2), 157-160 (1987) MEDLINE 87080796 COMMENT Data kindly reviewed (09-JUL-1987) by Day I.N.M. FEATURES Location/Qualifiers source 1..1014 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 27..34 /note="translation initiation consensus sequence" CDS 32..670 /note="PGP 9.5 (AA 1-212)" /codon_start=1 /db_xref="PID:g35440" /db_xref="SWISS-PROT:P09936" /translation="MLNKVLSRLGVAGQWRFVDVLGLEEESLGSVPAPACALLLLFPL TAQHENFRKKQIEELKGQEVSPKVYFMKQTIGNSCGTIGLIHAVANNQDKLGFEDGSV LKQFLSETEKMSPEDRAKCFEKNEAIQAAHDAVAQEGQCRVDDKVNFHFILFNNVDGH LYELDGRMPFPVNHGASSEDTLLKDAAKVCREFTEREQGEVRFSAVALCKAA" misc_feature 993..998 /note="polyadenylation signal" polyA_site 1014 /note="polyA site" BASE COUNT 256 a 242 c 267 g 249 t ORIGIN 1 gcagaaatag cctagggaga tcaaccccga gatgctgaac aaagtgctgt cccggctggg 61 ggtcgccggc cagtggcgct tcgtggacgt gctggggctg gaagaggagt ctctgggctc 121 ggtgccagcg cctgcctgcg cgctgctgct gctgtttccc ctcacggccc agcatgagaa 181 cttcaggaaa aagcagattg aagagctgaa gggacaagaa gttagtccta aagtgtactt 241 catgaagcag accattggga attcctgtgg cacaatcgga cttattcacg cagtggccaa 301 taatcaagac aaactgggat ttgaggatgg atcagttctg aaacagtttc tttctgaaac 361 agagaaaatg tcccctgaag acagagcaaa atgctttgaa aagaatgagg ccatacaggc 421 agcccatgat gccgtggcac aggaaggcca atgtcgggta gatgacaagg tgaatttcca 481 ttttattctg tttaacaacg tggatggcca cctctatgaa cttgatggac gaatgccttt 541 tccggtgaac catggcgcca gttcagagga caccctgctg aaggacgctg ccaaggtgtg 601 cagagaattc accgagcgtg agcaaggaga agtccgcttc tctgccgtgg ctctctgcaa 661 ggcagcctaa tgctctgtgg gagggacttt gctgatttcc cctcttccct tcaacatgaa 721 aatatatacc ccccatgcag tctaaaatgc ttcagtactt gtgaaacaca gctgttcttc 781 tgttctgcag acacgccttc ccctcagcca cacccaggca cttaagcaca agcagagtgc 841 acagctgtcc actgggccat tgtggtgtga gcttcagatg gtgaagcatt ctccccagtg 901 tatgtcttgt atccgatatc taacgcttta aatggctact ttggtttctg tctgtaagtt 961 aagaccttgg atgtggttat gttgtcctaa agaataaatt ttgctgatag tagc // LOCUS HSPGPIX 888 bp RNA PRI 15-SEP-1997 DEFINITION Human mRNA for platelet glycoprotein IX. ACCESSION X52997 NID g2160045 KEYWORDS membrane glycoprotein; platelet glycoprotein; platelet glycoprotein IX; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 888) AUTHORS Hickey,M.J. TITLE Direct Submission JOURNAL Submitted (11-MAY-1990) Hickey M.J., Seattle VA Med. Center, GMR/Mail Stop 151, 1660 S. Columbian Way, Seattle, WA 98108, U.S.A REMARK revised by [3] REFERENCE 2 (bases 1 to 885) AUTHORS Hickey,M.J., Deaven,L.L. and Roth,G.J. TITLE Human platelet glycoprotein IX. Characterization of cDNA and localization of the gene to chromosome 3 JOURNAL FEBS Lett. 274 (1-2), 189-192 (1990) MEDLINE 91071429 REFERENCE 3 (bases 1 to 888) AUTHORS Roth,G. TITLE Direct Submission JOURNAL Submitted (02-JUN-1997) G.Roth, Seattle VA Med. Center, GMR/Mail Stop 151, 1660 S. Columbian Way, Seattle, WA 98108, U.S.A REFERENCE 4 (bases 1 to 888) AUTHORS Hollmann,C., Haag,F., Schlott,M., Damaske,A., Bertuleit,H., Matthes,M., Kuhl,M., Thiele,H.G. and Koch-Nolte,F. TITLE Molecular characterization of mouse T-cell ecto-ADP-ribosyltransferase Rt6: cloning of a second functional gene and identification of the Rt6 gene products JOURNAL Mol. Immunol. 33 (9), 807-817 (1996) MEDLINE 96406989 COMMENT See also . Data kindly reviewed (06-SEP-1990) by Hickey M.J. FEATURES Location/Qualifiers source 1..888 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEL" /clone_lib="lambda gt11" /clone="lambda E, lambda L" /chromosome="chromosome 3" sig_peptide 223..270 /note="signal peptide (AA -16 to -1)" CDS 223..756 /note="precursor polypeptide (AA -16 to 160)" /codon_start=1 /db_xref="PID:e319503" /db_xref="PID:g2160046" /translation="MPAWGALFLLWATAEATKDCPSPCTCRALETMGLWVDCRGHGLT ALPALPARTRHLLLANNSLQSVPPGAFDHLPQLQTLDVTQNPWHCDCSLTYLRLWLED RTPEALLQVRCASPSLAAHGPLGRLTGYQLGSCGWQLQASWVRPGVLWDVALVAVAAL GLALLAGLLCATTEALD" mat_peptide 271..753 /note="mature platelet glycoprotein IX (AA 1 to 160)" misc_feature 856..861 /note="polyA signal" polyA_site 888 /note="polyA site" BASE COUNT 150 a 329 c 265 g 144 t ORIGIN 1 cagctgtatc ccatagagtt gccacccagg cctcagccag gacctttcag gccagacagg 61 agcacctgac caaaggcttc acagccgccc tcaccgcccg gccttctacg gtgtccagag 121 acagttagcc aggcctgggc tgggcacact ccaccttccc tagtcaccag ctggtttccc 181 agaggagaag gctgagaccc gagaagggag ccagcctgtc ccatgcctgc ctggggagcc 241 ctgttcctgc tctgggccac agcagaggcc accaaggact gccccagccc atgtacctgc 301 cgcgccctgg aaaccatggg gctgtgggtg gactgcaggg gccacggact cacggccctg 361 cctgccctgc cggcccgcac ccgccacctt ctgctggcca acaacagcct tcagtccgtg 421 cccccgggag cctttgacca cctgccccag ctgcagaccc tcgatgtgac gcagaacccc 481 tggcactgtg actgcagcct cacctatctg cgcctctggc tggaggaccg cacgcccgag 541 gccctgctgc aggtccgctg tgccagcccc agcctcgctg cccatggccc gctgggccgg 601 ctgacaggct accagctggg cagctgtggc tggcagctgc aggcgtcctg ggtgcgcccg 661 ggggtcttgt gggacgtggc gctggtcgcc gtggccgcgc tgggcctggc tcttctggct 721 ggcctgctgt gtgccaccac agaggccctg gattgagcca ggcccccaga acctggctcc 781 agccaggggg ccagtccctg aggcaggtcc ccagactcca ccaagcctgg tcagcccaaa 841 ccaccagaag cccagaataa actggcagct cagctgtttt atataagc // LOCUS HSPGSR 2602 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for plasma gelsolin. ACCESSION X04412 NID g35447 KEYWORDS gelsolin; plasma protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2602) AUTHORS Kwiatkowski,D.J., Stossel,T.P., Orkin,S.H., Mole,J.E., Colten,H.R. and Yin,H.L. TITLE Plasma and cytoplasmic gelsolins are encoded by a single gene and contain a duplicated actin-binding domain JOURNAL Nature 323 (6087), 455-458 (1986) MEDLINE 87014807 COMMENT The tetrapeptide Asp-Glu-Ser-Gly (residues 96-99) is thought to participate in the actin contact between adjacent actins in the filament. FEATURES Location/Qualifiers source 1..2602 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 15..95 /note="put. signal peptide (-27 to -1)" CDS 15..2363 /codon_start=1 /product="plasma gelsolin" /db_xref="PID:g736249" /db_xref="SWISS-PROT:P06396" /translation="MAPHRPAPALLCALSLALCALSLPVRAATASRGASQAGAPQGRV PEARPNSMVVEHPEFLKAGKEPGLQIWRVEKFDLVPVPTNLYGDFFTGDAYVILKTVQ LRNGNLQYDLHYWLGNECSQDESGAAAIFTVQLDDYLNGRAVQHREVQGFESATFLGY FKSGLKYKKGGVASGFKHVVPNEVVVQRLFQVKGRRVVRATEVPVSWESFNNGDCFIL DLGNNIHQWCGSNSNRYERLKATQVSKGIRDNERSGRARVHVSEEGTEPEAMLQVLGP KPALPAGTEDTAKEDAANRKLAKLYKVSNGAGTMSVSLVADENPFAQGALKSEDCFIL DHGKDGKIFVWKGKQANTEERKAALKTASDFITKMDYPKQTQVSVLPEGGETPLFKQF FKNWRDPDQTDGLGLSYLSSHIANVERVPFDAATLHTSTAMAAQHGMDDDGTGQKQIW RIEGSNKVPVDPATYGQFYGGDSYIILYNYRHGGRQGQIIYNWQGAQSTQDEVAASAI LTAQLDEELGGTPVQSRVVQGKEPAHLMSLFGGKPMIIYKGGTSREGGQTAPASTRLF QVRANSAGATRAVEVLPKAGALNSNDAFVLKTPSAAYLWVGTGASEAEKTGAQELLRV LRAQPVQVAEGSEPDGFWEALGGKAAYRTSPRLKDKKMDAHPPRLFACSNKIGRFVIE EVPGELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEKTEALTSAKRYIETDPANRDR RTPITVVKQGFEPPSFVGWFLGWDDDYWSVDPLDRAMAELAA" mat_peptide 96..2360 /note="mature plasma gelsolin (aa 1-755)" misc_feature 381..392 /note="tetrapeptide (aa 96-99)" misc_feature 2581..2586 /note="polyadenylation signal" polyA_site 2602 /note="polyadenylation site" BASE COUNT 551 a 750 c 804 g 497 t ORIGIN 1 gccgtgtcgc caccatggct ccgcaccgcc ccgcgcccgc gctgctttgc gcgctgtccc 61 tggcgctgtg cgcgctgtcg ctgcccgtcc gcgcggccac tgcgtcgcgg ggggcgtccc 121 aggcgggggc gccccagggg cgggtgcccg aggcgcggcc caacagcatg gtggtggaac 181 accccgagtt cctcaaggca gggaaggagc ctggcctgca gatctggcgt gtggagaagt 241 tcgatctggt gcccgtgccc accaaccttt atggagactt cttcacgggc gacgcctacg 301 tcatcctgaa gacagtgcag ctgaggaacg gaaatctgca gtatgacctc cactactggc 361 tgggcaatga gtgcagccag gatgagagcg gggcggccgc catctttacc gtgcagctgg 421 atgactacct gaacggccgg gccgtgcagc accgtgaggt ccagggcttc gagtcggcca 481 ccttcctagg ctacttcaag tctggcctga agtacaagaa aggaggtgtg gcatcaggat 541 tcaagcacgt ggtacccaac gaggtggtgg tgcagagact cttccaggtc aaagggcggc 601 gtgtggtccg tgccaccgag gtacctgtgt cctgggagag cttcaacaat ggcgactgct 661 tcatcctgga cctgggcaac aacatccacc agtggtgtgg ttccaacagc aatcggtatg 721 aaagactgaa ggccacacag gtgtccaagg gcatccggga caacgagcgg agtggccggg 781 cccgagtgca cgtgtctgag gagggcactg agcccgaggc gatgctccag gtgctgggcc 841 ccaagccggc tctgcctgca ggtaccgagg acaccgccaa ggaggatgcg gccaaccgca 901 agctggccaa gctctacaag gtctccaatg gtgcagggac catgtccgtc tccctcgtgg 961 ctgatgagaa ccccttcgcc cagggggccc tgaagtcaga ggactgcttc atcctggacc 1021 acggcaaaga tgggaaaatc tttgtctgga aaggcaagca ggcaaacacg gaggagagga 1081 aggctgccct caaaacagcc tctgacttca tcaccaagat ggactacccc aagcagactc 1141 aggtctcggt ccttcctgag ggcggtgaga ccccactgtt caagcagttc ttcaagaact 1201 ggcgggaccc agaccagaca gatggcctgg gcttgtccta cctttccagc catatcgcca 1261 acgtggagcg ggtgcccttc gacgccgcca ccctgcacac ctccactgcc atggccgccc 1321 agcacggcat ggatgacgat ggcacaggcc agaaacagat ctggagaatc gaaggttcca 1381 acaaggtgcc cgtggaccct gccacatatg gacagttcta tggaggcgac agctacatca 1441 ttctgtacaa ctaccgccat ggtggccgcc aggggcagat aatctataac tggcagggtg 1501 cccagtctac ccaggatgag gtcgctgcat ctgccatcct gactgctcag ctggatgagg 1561 agctgggagg tacccctgtc cagagccgtg tggtccaagg caaggagccc gcccacctca 1621 tgagcctgtt tggtgggaag cccatgatca tctacaaggg cggcacctcc cgcgagggcg 1681 ggcagacagc ccctgccagc acccgcctct tccaggtccg cgccaacagc gctggagcca 1741 cccgggctgt tgaggtattg cctaaggctg gtgcactgaa ctccaacgat gcctttgttc 1801 tgaaaacccc ctcagccgcc tacctgtggg tgggtacagg agccagcgag gcagagaaga 1861 cgggggccca ggagctgctc agggtgctgc gggcccaacc tgtgcaggtg gcagaaggca 1921 gcgagccaga tggcttctgg gaggccctgg gcgggaaggc tgcctaccgc acatccccac 1981 ggctgaagga caagaagatg gatgcccatc ctcctcgcct ctttgcctgc tccaacaaga 2041 ttggacgttt tgtgatcgaa gaggttcctg gtgagctcat gcaggaagac ctggcaacgg 2101 atgacgtcat gcttctggac acctgggacc aggtctttgt ctgggttgga aaggattctc 2161 aagaagaaga aaagacagaa gccttgactt ctgctaagcg gtacatcgag acggacccag 2221 ccaatcggga tcggcggacg cccatcaccg tggtgaagca aggctttgag cctccctcct 2281 ttgtgggctg gttccttggc tgggatgatg attactggtc tgtggacccc ttggacaggg 2341 ccatggctga gctggctgcc tgaggagggg cagggcccac ccatgtcacc ggtcagtgcc 2401 ttttggaact gtccttccct caaagaggcc ttagagcgag cagagcagct ctgctatgag 2461 tgtgtgtgtg tgtgtgtgtt gtttcttttt ttttttttta cagtatccaa aaatagccct 2521 gcaaaaattc agagtccttg caaaattgtc taaaatgtca gtgtttggga aattaaatcc 2581 aataaaaaca ttttgaagtg tg // LOCUS HSPH1CAT 1374 bp RNA PRI 18-AUG-1993 DEFINITION H.sapiens mRNA for phosphatase 1 catalytic subunit. ACCESSION X70848 NID g287796 KEYWORDS phosphatase; phosphatase 1 catalytic subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1374) AUTHORS Lavin,M. TITLE Direct Submission JOURNAL Submitted (24-DEC-1992) M. Lavin, Queensland Inst.of Medical Research, The Bancroft Centre, 300 Herston Road, Brisbane QLD 4029, AUSTRALIA REFERENCE 2 (bases 1 to 1374) AUTHORS Song,Q., Khanna,K.K., Lu,H. and Lavin,M.F. TITLE Cloning and characterization of a human protein phosphatase 1-encoding cDNA JOURNAL Gene 129 (2), 291-295 (1993) MEDLINE 93314976 FEATURES Location/Qualifiers source 1..1374 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /chromosome="11" /map="11q13" CDS 28..1020 /EC_number="3.1.3.16" /note="protein phosphatase 1" /codon_start=1 /product="serine/threonine specific protein phosphatase" /db_xref="PID:g35451" /db_xref="SWISS-PROT:P08129" /translation="MSDSEKLNLDSIIGRLLEVQGSRPGKNVQLTENEIRGLCLKSRE IFLSQPILLELEAPLKICGDIHGQYYDLLRLFEYGGFPPESNYLFLGDYVDRGKQSLE TICLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRYNIKLWKTFTDCFNCLP IAAIVDEKIFCCHGGLSPDLQSMEQIRRIMRPTDVPDQGLLCDLLWSDPDKDVQGWGE NDRGVSFTFGAEVVAKFLHKHDLDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEF DNAGAMMSVDETLMCSFQILKPADKNKGKYGQFSGLNPGGRPITPPRNSAKAKK" BASE COUNT 291 a 403 c 393 g 287 t ORIGIN 1 gcaaggagct gctggctgga cggcggcatg tccgacagcg agaagctcaa cctggactcg 61 atcatcgggc gcctgctgga agtgcagggc tcgcggcctg gcaagaatgt acagctgaca 121 gagaacgaga tccgcggtct gtgcctgaaa tcccgggaga tttttctgag ccagcccatt 181 cttctggagc tggaggcacc cctcaagatc tgcggtgaca tacacggcca gtactacgac 241 cttctgcgac tatttgagta tggcggtttc cctcccgaga gcaactacct ctttctgggg 301 gactatgtgg acaggggcaa gcagtccttg gagaccatct gcctgctgct ggcctataag 361 atcaagtacc ccgagaactt cttcctgctc cgtgggaacc acgagtgtgc cagcatcaac 421 cgcatctatg gtttctacga tgagtgcaag agacgctaca acatcaaact gtggaaaacc 481 ttcactgact gcttcaactg cctgcccatc gcggccatag tggacgaaaa gatcttctgc 541 tgccacggag gcctgtcccc ggacctgcag tctatggagc agattcggcg gatcatgcgg 601 cccacagatg tgcctgacca gggcctgctg tgtgacctgc tgtggtctga ccctgacaag 661 gacgtgcagg gctggggcga gaacgaccgt ggcgtctctt ttacctttgg agccgaggtg 721 gtggccaagt tcctccacaa gcacgacttg gacctcatct gccgagcaca ccaggtggta 781 gaagacggct acgagttctt tgccaagcgg cagctggtga cacttttctc agctcccaac 841 tactgtggcg agtttgacaa tgctggcgcc atgatgagtg tggacgagac cctcatgtgc 901 tctttccaga tcctcaagcc cgccgacaag aacaagggga agtacgggca gttcagtggc 961 ctgaaccctg gaggccgacc catcacccca ccccgcaatt ccgccaaagc caagaaatag 1021 cccccgcaca ccaccctgtg ccccagatga tggattgatt gtacagaaat catgctgcca 1081 tgctgggggg gggtcacccc gacccctaag gcccacctgt cacggggaac atggagcctt 1141 ggtgtatttt tcttttcttt ttttaatgaa tcaatagcag cgtccagtcc cccagggctg 1201 cttcctgcct gcacctgcgg tactgtgagc aggatcctgg ggccgaggct gcagctcagg 1261 gcaacggcag gccaggtcgt gggtctccag ccgtgcttgg cctcaggctg gcagcccgga 1321 tcctggggca acccatctgg tctcttgaat aaaggtcaaa gctggatcgg aatc // LOCUS HSPHAPI 916 bp RNA PRI 19-JUL-1994 DEFINITION H.sapiens mRNA for HLA-DR associated protein I (PHAPI). ACCESSION X75090 NID g403006 KEYWORDS HLA-DR associated protein I; PHAPI. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 916) AUTHORS Kratzin,H.D. TITLE Direct Submission JOURNAL Submitted (20-SEP-1993) H.D. Kratzin, Max Planck Inst. for Experimental Med., Dept. for Immunochemistry, Hermann-Rein-str. 3, 37075 Goettingen, FRG REFERENCE 2 (bases 1 to 916) AUTHORS Vaesen,M., Barnikol-Watanabe,S., Gotz,H., Awni,L.A., Cole,T., Zimmermann,B., Kratzin,H.D. and Hilschmann,N. TITLE Purification and characterization of two putative HLA class II associated proteins: PHAPI and PHAPII JOURNAL Biol. Chem. Hoppe-Seyler 375 (2), 113-126 (1994) MEDLINE 94250340 FEATURES Location/Qualifiers source 1..916 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" /cell_line="EBV transformed cell line H2LCL typed (HLA-A3,3; B7,7; Cw7,7; DR2,2; Dw,2.2 DQwl,1; DPw4,4)" mat_peptide 104..850 /product="PHAPI (Putative HLA-DR Associated Protein I)" CDS 104..853 /codon_start=1 /product="PHAPI (Putative HLA DR Associated Protein I)" /db_xref="PID:g403007" /db_xref="SWISS-PROT:P39687" /translation="MEMGRRIHLELRNRTPSDVKELVLDNSRSNEGKLEGLTDEFEEL EFLSTINVGLTSIANLPKLNKLKKLELSDNRVSGGLEVLAEKCPNLTHLNLSGNKIKD LSTIEPLKKLENLKSLDLFNCEVTNLNDYRENVFKLLPQLTYLDGYDRDDKEAPDSDA EGYVEGLDDEEEDEDEEEYDEDAQVVEDEEDEDEEEEGEEEDVSGEEEEDEEGYNDGE VDDEEDEEELGEEERGQKRKREPEDEGEDDD" BASE COUNT 296 a 167 c 275 g 178 t ORIGIN 1 gctggttgag ccttcaaagt cctaaaacgc gcggccgtgg gttcggggtt tattgattga 61 attccgccgg cgcgggagcc tctgcagaga gagagcgcga gagatggaga tgggcagacg 121 gattcattta gagctgcgga acaggacgcc ctctgatgtg aaagaacttg tcctggacaa 181 cagtcggtcg aatgaaggca aactcgaagg cctcacagat gaatttgaag aactggaatt 241 cttaagtaca atcaacgtag gcctcacctc aatcgcaaac ttaccaaagt taaacaaact 301 taagaagctt gaactaagcg ataacagagt ctcagggggc ctggaagtat tggcagaaaa 361 gtgtccgaac ctcacgcatc taaatttaag tggcaacaaa attaaagacc tcagcacaat 421 agagccactg aaaaagttag aaaacctcaa gagcttagac cttttcaatt gcgaggtaac 481 caacctgaac gactaccgag aaaatgtgtt caagctcctc ccgcaactca catatctcga 541 cggctatgac cgggacgaca aggaggcccc tgactcggat gctgagggct acgtggaggg 601 cctggatgat gaggaggagg atgaggatga ggaggagtat gatgaagatg ctcaggtagt 661 ggaagacgag gaggacgagg atgaggagga ggaaggtgaa gaggaggacg tgagtggaga 721 ggaggaggag gatgaagaag gttataacga tggagaggta gatgacgagg aagatgaaga 781 agagcttggt gaagaagaaa ggggtcagaa gcgaaaacga gaacctgaag atgagggaga 841 agatgatgac taagtggaat aacctatttt gaaaaattcc tattgtgatt tgactgtttt 901 tacccatatc ccctct // LOCUS HSPHAPII 924 bp RNA PRI 19-JUL-1994 DEFINITION H.sapiens mRNA for HLA-DR associated protein II (PHAPII). ACCESSION X75091 NID g403008 KEYWORDS HLA-DR associated protein II; PHAPII. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 924) AUTHORS Kratzin,H.D. TITLE Direct Submission JOURNAL Submitted (20-SEP-1993) H.D. Kratzin, Max Planck Inst. for Experimental Med., Dept. for Immunochemistry, Hermann-Rein-str. 3, 37075 Goettingen, FRG REFERENCE 2 (bases 1 to 924) AUTHORS Vaesen,M., Barnikol-Watanabe,S., Gotz,H., Awni,L.A., Cole,T., Zimmermann,B., Kratzin,H.D. and Hilschmann,N. TITLE Purification and characterization of two putative HLA class II associated proteins: PHAPI and PHAPII JOURNAL Biol. Chem. Hoppe-Seyler 375 (2), 113-126 (1994) MEDLINE 94250340 FEATURES Location/Qualifiers source 1..924 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" /cell_line="EBV transformed cell line H2LCL typed (HLA-A3,3; B7,7; Cw7,7 DR2,2; Dw2,2; DQw1,1; DPw4,4)" misc_feature 1..18 /note="conflict to AC M93651" CDS 19..852 /codon_start=1 /product="PHAPII (Putative HLA DR Associated Protein II)" /db_xref="PID:g403009" /db_xref="SWISS-PROT:Q01105" /translation="MSAPAAKVSKKELNSNHDGADETSEKEQQEAIEHIDEVQNEIDR LNEQASEEILKVEQKYNKLRQPFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEE ALHYLTRVEVTEFEDIKSGYRIDFYFDENPYFENKVLSKEFHLNESGDPSSKSTEIKW KSGKDLTKRSSQTQNKASRKRQHEEPESFFTWFTDHSDAGADELGEVIKDDIWPNPLQ YYLVPDMDDEEGEGEEDDDDDEEEEGLEDIDEEGDEDEGEEDEDDDEGEEGEEDEGED D" BASE COUNT 329 a 153 c 237 g 205 t ORIGIN 1 cgaccgcgga gcagcaccat gtcggcgccg gcggccaaag tcagtaaaaa ggagctcaac 61 tccaaccacg acggggccga cgagacctca gaaaaagaac agcaagaagc gattgaacac 121 attgatgaag tacaaaatga aatagacaga cttaatgaac aagccagtga ggagattttg 181 aaagtagaac agaaatataa caaactccgc caaccatttt ttcagaagag gtcagaattg 241 atcgccaaaa tcccaaattt ttgggtaaca acatttgtca accatccaca agtgtctgca 301 ctgcttgggg aggaagatga agaggcactg cattatttga ccagagttga agtgacagaa 361 tttgaagata ttaaatcagg ttacagaata gatttttatt ttgatgaaaa tccttacttt 421 gaaaataaag ttctctccaa agaatttcat ctgaatgaga gtggtgatcc atcttcgaag 481 tccaccgaaa tcaaatggaa atctggaaag gatttgacga aacgttcgag tcaaacgcag 541 aataaagcca gcaggaagag gcagcatgag gaaccagaga gcttctttac ctggtttact 601 gaccattctg atgcaggtgc tgatgagtta ggagaggtca tcaaagatga tatttggcca 661 aacccattac agtactactt ggttcccgat atggatgatg aagaaggaga aggagaagaa 721 gatgatgatg atgatgaaga ggaggaagga ttagaagata ttgacgaaga aggggatgag 781 gatgaaggtg aagaagatga agatgatgat gaaggggagg aaggagagga ggatgaagga 841 gaagatgact aaatagaaca ctgatggatt ccaaccttcc tttttttaaa ttttctccag 901 tccctgggag caagttgcag tctt // LOCUS HSPHBIPRM 1073 bp RNA PRI 20-APR-1995 DEFINITION H.sapiens mRNA for phenylalkylamine binding protein. ACCESSION Z37986 NID g780262 KEYWORDS phenylalkylamine binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1073) AUTHORS Hanner,M., Moebius,F.F., Weber,F., Grabner,M., Striessnig,J. and Glossmann,H. TITLE Phenylalkylamine Ca2+ antagonist binding protein. Molecular cloning, tissue distribution, and heterologous expression JOURNAL J. Biol. Chem. 270 (13), 7551-7557 (1995) MEDLINE 95221417 REFERENCE 2 (bases 1 to 1073) AUTHORS Grabner,M. TITLE Direct Submission JOURNAL Submitted (29-SEP-1994) Grabner M., Universitaet Innsbruck, Biochemische Pharmakologie, Peter Mayrstr. 1, Innsbruck, Austria, A-6020 FEATURES Location/Qualifiers source 1..1073 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-HS3" /dev_stage="Adult" /tissue_type="Liver" /clone_lib="lambda gt10" /sex="Male" CDS 112..804 /codon_start=1 /product="phenylalkylamine binding protein" /db_xref="PID:g780263" /translation="MTTNAGPLHPYWPQHLRLDNFVPNDRPTWHILAGLFSVTGVLVV TTWLLSGRAAVVPLGTWRRLSLCWFAVCGFIHLVIEGWFVLYYEDLLGDQAFLSQLWK EYAKGDSRYILGDNFTVCMETITACLWGPLSLWVVIAFLRQHPLRFILQLVVSVGQIY GDVLYFLTEHRDGFQHGELGHPLYFWFYFVFMNALWLVLPGVLVLDAVKHLTHAQSTL DAKATKAKSKKN" polyA_signal 1043..1048 polyA_site 1062 BASE COUNT 238 a 285 c 287 g 263 t ORIGIN 1 cggagccagc gtgggaggcc gctgccgtcg cgcgccttgg tttttctgtt cctttttttt 61 tttttttttt aacttcctgc ctatcacacg cagccatcag cccacaaaga catgactacc 121 aacgcgggcc ccttgcaccc atactggcct cagcacctaa gactggacaa ctttgtacct 181 aatgaccgcc ccacctggca tatactggct ggcctcttct ctgtcacagg ggtcttagtc 241 gtgaccacat ggctgttgtc aggtcgtgct gcggttgtcc cattggggac ttggcggcga 301 ctgtccctgt gctggtttgc agtgtgtggg ttcattcacc tggtgatcga gggctggttc 361 gttctctact acgaagacct gcttggagac caagccttct tatctcaact ctggaaagag 421 tatgccaagg gagacagccg atacatcctg ggtgacaact tcacagtgtg catggaaacc 481 atcacagctt gcctgtgggg accactcagc ctgtgggtgg tgatcgcctt tctccgccag 541 catcccctcc gcttcattct acagcttgtg gtctctgtgg gccagatcta tggggatgtg 601 ctctacttcc tgacagagca ccgcgacgga ttccagcacg gagagctggg ccaccctctc 661 tacttctggt tttactttgt cttcatgaat gccctgtggc tggtgctgcc tggagtcctt 721 gtgcttgatg ctgtgaagca cctcactcat gcccagagca cgctggatgc caaggccaca 781 aaagccaaga gcaagaagaa ctgaggagtg gtggaccagg ctcgaacact ggccgaggag 841 gagctctctg cctgccagaa gagtctagtc ctgctcccac agtttggagg gacaaagcta 901 attgatctgt cacactcagg ctcatgggca ggcacaagaa ggggaataaa ggggctgtgt 961 gaaggcactg ctgggagcca ttagaacaca gatacaagag aagccaggag gtctatgatg 1021 gtgacgattt ttaaaatcag gaaataaaag atcttgactc taaaaaaaaa aaa // LOCUS HSPHI3K 3424 bp RNA PRI 24-AUG-1995 DEFINITION H.sapiens mRNA for phosphatidylinositol 3-kinase. ACCESSION Z29090 NID g472990 KEYWORDS phosphatidylinositol 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1068) AUTHORS Volinia,S., Hiles,I., Ormondroyd,E., Nizetic,D., Antonacci,R., Rocchi,M. and Waterfield,M.D. TITLE Molecular cloning, cDNA sequence, and chromosomal localization of the human phosphatidylinositol 3-kinase p110 alpha (PIK3CA) gene JOURNAL Genomics 24 (3), 472-477 (1994) MEDLINE 95229146 REFERENCE 2 (bases 1 to 3424) AUTHORS Volinia,S. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) Stefano Volinia, Receptor Studies, Ludwig Institute for Cancer, Research, 91 Riding House Street, London, W1P 8BT, UK FEATURES Location/Qualifiers source 1..3424 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG1a" /chromosome="3q26.3" CDS 13..3219 /codon_start=1 /product="phosphatidylinositol 3-kinase" /db_xref="PID:g472991" /db_xref="SWISS-PROT:P42336" /translation="MPPRPSSGELWGIHLMPPRILVECLLPNGMIVTLECLREATLVT IKHELFKEARKYPLHQLLQDESSYIFVSVTQEAEREEFFDETRRLCDLRLFQPFLKVI EPVGNREEKILNREIGFAIGMPVCEFDMVKDPEVQDFRRNILNVCKEAVDLRDLNSPH SRAMYVYPPHVESSPELPKHIYNKLDRGQIIVVIWVIVSPNNDKQKYTLKINHDCVPE QVIAEAIRKKTRSMLLSSEQLKLCVLEYQGKYILKVCGCDEYFLEKYPLSQYKYIRSC IMLGRMPNLKMMAKESLYSQLPMDCFTMPSYSRRISTATPYMNGETSTKSLWVINRAL RIKILCATYVNLNIRDIDKIYVRTGIYHGGEPLCDNVNTQRVPCSNPRWNEWLNYDIY IPDLPRAARLCLSICSVKGRKGAKEEHCPLAWGNINLFDYTDTLVSGKMALNLWPVPH GLEDLLNPIGVTGSNPNKETPCLELEFDWFSSVVKFPDMSVIEEHANWSVSREAGFSY SHAGLSNRLARDNELRENDKEQLKAISTRDPLSEITEQEKDFLWSHRHYCVTIPEILP KLLLSVKWNSRDEVAQMYCLVKDWPPIKPEQAMELLDCNYPDPMVRGFAVRCLEKYLT DDKLSQYLIQLVQVLKYEQYLDNLLVRFLLKKALTNQRIGHFFFWHLKSEMHNKTVSQ RFGLLLESYCRACGMYLKHLNRQVEAMEKLINLTDILKQERKDETQKVQMKFLVEQMR RPDFMDALQGLLSPLNPAHQLGNLRLKECRIMSSAKRPLWLNWENPDIMSELLFQNNE IIFKNGDDLRQDMLTLQIIRIMENIWQNQGLDLRMLPYGCLSIGDCVGLIEVVRNSHT IMQIQCKGGLKGALQFNSHTLHQWLKDKNKGEIYDAAIDLFTRSCAGYCVATFILGIG DRHNSNIMVKDDGQLFHIDFGHFLDHKKKKFGYKRERVPFVLTQDFLIVISKGAQECT KTREFERFQEMCYKAYLAIRQHANLFINLFSMMLGSGMPELQSFDDIAYIRKTLALDK TEQEALEYFMKQMNDAHHGGWTTKMDWIFHTIKQHALN" BASE COUNT 1134 a 618 c 709 g 963 t ORIGIN 1 aggatcagaa caatgcctcc aagaccatca tcaggtgaac tgtggggcat ccacttgatg 61 cccccaagaa tcctagtgga atgtttacta ccaaatggaa tgatagtgac tttagaatgc 121 ctccgtgagg ctacattagt aactataaag catgaactat ttaaagaagc aagaaaatac 181 cctctccatc aacttcttca agatgaatct tcttacattt tcgtaagtgt tacccaagaa 241 gcagaaaggg aagaattttt tgatgaaaca agacgacttt gtgatcttcg gctttttcaa 301 ccatttttaa aagtaattga accagtaggc aaccgtgaag aaaagatcct caatcgagaa 361 attggttttg ctatcggcat gccagtgtgc gaatttgata tggttaaaga tcctgaagta 421 caggacttcc gaagaaatat tcttaatgtt tgtaaagaag ctgtggatct tagggatctt 481 aattcacctc atagtagagc aatgtatgtc tatccgccac atgtagaatc ttcaccagag 541 ctgccaaagc acatatataa taaattggat agaggccaaa taatagtggt gatttgggta 601 atagtttctc caaataatga caagcagaag tatactctga aaatcaacca tgactgtgtg 661 ccagaacaag taattgctga agcaatcagg aaaaaaacta gaagtatgtt gctatcatct 721 gaacaattaa aactctgtgt tttagaatat cagggcaagt acattttaaa agtgtgtgga 781 tgtgatgaat acttcctaga aaaatatcct ctgagtcagt ataagtatat aagaagctgt 841 ataatgcttg ggaggatgcc caatttgaag atgatggcta aagaaagcct ttattctcaa 901 ctgccaatgg actgttttac aatgccatct tattccagac gcatttccac agctacacca 961 tatatgaatg gagaaacatc tacaaaatcc ctttgggtta taaatagagc actcagaata 1021 aaaattcttt gtgcaaccta cgtgaatcta aatattcgag acattgacaa gatttatgtt 1081 cgaacaggta tctaccatgg aggagaaccc ttatgtgaca atgtgaacac tcaaagagta 1141 ccttgttcca atcccaggtg gaatgaatgg ctgaattatg atatatacat tcctgatctt 1201 cctcgtgctg ctcgactttg cctttccatt tgctctgtta aaggccgaaa gggtgctaaa 1261 gaggaacact gtccattggc atggggaaat ataaacttgt ttgattacac agacactcta 1321 gtatctggaa aaatggcttt gaatctttgg ccagtacctc atggattaga agatttgctg 1381 aaccctattg gtgttactgg atcaaatcca aataaagaaa ctccatgctt agagttggag 1441 tttgactggt tcagcagtgt ggtaaagttc ccagatatgt cagtgattga agagcatgcc 1501 aattggtctg tatcccgaga agcaggattt agctattccc acgcaggact gagtaacaga 1561 ctagctagag acaatgaatt aagggaaaat gacaaagaac agctcaaagc aatttctaca 1621 cgagatcctc tctctgaaat cactgagcag gagaaagatt ttctatggag tcacagacac 1681 tattgtgtaa ctatccccga aattctaccc aaattgcttc tgtctgttaa atggaattct 1741 agagatgaag tagcccagat gtattgcttg gtaaaagatt ggcctccaat caaacctgaa 1801 caggctatgg aacttctgga ctgtaattac ccagatccta tggttcgagg ttttgctgtt 1861 cggtgcttgg aaaaatattt aacagatgac aaactttctc agtatttaat tcagctagta 1921 caggtcctaa aatatgaaca atatttggat aacttgcttg tgagattttt actgaagaaa 1981 gcattgacta atcaaaggat tgggcacttt ttcttttggc atttaaaatc tgagatgcac 2041 aataaaacag ttagccagag gtttggcctg cttttggagt cctattgtcg tgcatgtggg 2101 atgtatttga agcacctgaa taggcaagtc gaggcaatgg aaaagctcat taacttaact 2161 gacattctca aacaggagag gaaggatgaa acacaaaagg tacagatgaa gtttttagtt 2221 gagcaaatga ggcgaccaga tttcatggat gccctacagg gcttgctgtc tcctctaaac 2281 cctgctcatc aactaggaaa cctcaggctt aaagagtgtc gaattatgtc ttctgcaaaa 2341 aggccactgt ggttgaattg ggagaaccca gacatcatgt cagagttact gtttcagaac 2401 aatgagatca tctttaaaaa tggggatgat ttacggcaag atatgctaac acttcaaatt 2461 attcgtatta tggaaaatat ctggcaaaat caaggtcttg atcttcgaat gttaccttat 2521 ggttgtctgt caatcggtga ctgtgtggga cttattgagg tggtgcgaaa ttctcacact 2581 attatgcaaa ttcagtgcaa aggcggcttg aaaggtgcac tgcagttcaa cagccacaca 2641 ctacatcagt ggctcaaaga caagaacaaa ggagaaatat atgatgcagc cattgacctg 2701 tttacacgtt catgtgctgg atactgtgta gctaccttca ttttgggaat tggagatcgt 2761 cacaatagta acatcatggt gaaagacgat ggacaactgt ttcatataga ttttggacac 2821 tttttggatc acaagaagaa aaaatttggt tataaacgag aacgtgtgcc atttgttttg 2881 acacaggatt tcttaatagt gattagtaaa ggagcccaag aatgcacaaa gacaagagaa 2941 tttgagaggt ttcaggagat gtgttacaag gcttatctag ctattcgaca gcatgccaat 3001 ctcttcataa atcttttctc aatgatgctt ggctctggaa tgccagaact acaatctttt 3061 gatgacattg catacattcg aaagacccta gccttagata aaactgagca agaggctttg 3121 gagtatttca tgaaacaaat gaatgatgca catcatggtg gctggacaac aaaaatggat 3181 tggatcttcc acacaattaa acagcatgca ttgaactgaa agataactga gaaaatgaaa 3241 gctcactctg gattccacac tgcactgtta ataactctca gcaggcaaag accgattgca 3301 taggaattgc acaatccatg aacagcatta gatttacagc aagaacagaa ataaaatact 3361 atataattta aataatgtaa acgcaaacag ggtttgatag cacttaaact agttcatttc 3421 aaaa // LOCUS HSPHKA1 4215 bp RNA PRI 27-APR-1995 DEFINITION H.sapiens PHKA 1 mRNA. ACCESSION X73874 NID g439271 KEYWORDS muscle isoform; phosphorylase kinase; phosphorylase kinase alpha-subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4215) AUTHORS Kilimann,M.W. TITLE Direct Submission JOURNAL Submitted (05-JUL-1993) M.W. Kilimann, Ruhr-Universitaet Bochum, Inst. fuer Physiologische Chemie I, Universitaetsstr. 150, 44780 Bochum, FRG REMARK revised by [3] REFERENCE 2 (bases 1 to 4215) AUTHORS Wullrich,A., Hamacher,C., Schneider,A. and Kilimann,M.W. TITLE The multiphosphorylation domain of the phosphorylase kinase alpha M and alpha L subunits is a hotspot of differential mRNA processing and of molecular evolution JOURNAL J. Biol. Chem. 268 (31), 23208-23214 (1993) MEDLINE 94043107 FEATURES Location/Qualifiers source 1..4215 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="muscle" /clone="h alphaM1, h alphac2/1, PCR3/4, PCRc9 /c10" /chromosome="Xq12-q13" gene 162..3833 /gene="PHKA 1" CDS 162..3833 /gene="PHKA 1" /EC_number="2.7.1.38" /codon_start=1 /product="phosphorylase kinase" /db_xref="PID:g791043" /db_xref="SWISS-PROT:P46020" /translation="MRSRSNSGVRLDGYARLVQQTILCHQNPVTGLLPASYDQKDAWV RDNVYSILAVWGLGLAYRKNADRDEDKAKAYELEQSVVKLMRGLLHCMIRQVDKVESF KYSQSTKDSLHAKYNTKTCATVVGDDQWGHLQLDATSVYLLFLAQMTASGLHIIHSLD EVNFIQNLVFYIEAAYKTADFGIWERGDKTNQGISELNASSVGMAKAALEALDELDLF GVKGGPQSVIHVLADEVQHCQSILNSLLPRASTSKEVDASLLSVVSFPAFAVEDSQLV ELTKQEIITKLQGRYGCCRFLRDGYKTPKEDPNRLYYEPAELKLFENIECEWPLFWTY FILDGVFSGNAEQVQEYKEALEAVLIKGKNGVPLLPELYSVPPDRVDEEYQNPHTVDR VPMGKLPHMWGQSLYILGSLMAEGFLAPGEIDPLNRRFSTVPKPDVVVQVSILAETEE IKTILKDKGIYVETIAEVYPIRVQPARILSHIYSSLGCNNRMKLSGRPYRHMGVLGTS KLYDIRKTIFTFTPQFIDQQQFYLALDNKMIVEMLRTDLSYLCSRWRMTGQPTITFPI SYSMLDEDGTSLNSSILAALRKMQDGYFGGARVQTGKLSEFLTTSCCTHLSFMDPGPE GKLYSEDYDDNYDYLESGNWMNDYDSTSHARCGDEVARYLDHLLAHTAPHPKLAPTSQ KGGLDRFQAAVQTTCDLMSLVTKAKELHVQNVHMYLPTKLFQASRPSFNLLDSPHPRQ ENQVPSVRVEIHLPRDQSGEVDFKALVLQLKETSSLQEQADILYMLYTMKGPDWNTEL YNERSATVRELLTELYGKVGEIRHWGLIRYISGILRKKVEALDEACTDLLSHQKHLTV GLPPEPREKTISAPLPYEALTQLIDEASEGDMSISILTQEIMVYLAMYMRTQPGLFAE MFRLRIGLIIQVMATELAHSLRCSAEEATEGLMNLSPSAMKNLLHHILSGKEFGVERS VRPTDSNVSPAISIHEIGAVGATKTERTGIMQLKSEIKQVEFRRLSISAESQSPGTSM TPSSGSFPSAYDQQSSKDSRQGQWQRRRRLDGALNRVPVGFYQKVWKVLQKCHGLSVE GFVLPSSTTREMTPGEIKFSVHVESVLNRVPQPEYRQLLVEAILVLTMLADIEIHSIG SIIAVEKIVHIANDLFLQEQKTLGADDTMLAKDPASGICTLLYDSAPSGRFGTMTYLS KAAATYVQEFLPHSICAMQ" BASE COUNT 1136 a 977 c 1014 g 1088 t ORIGIN 1 gccgccgggc gccaggcctg agcggtggga gggctctgcg gggcctggtg ttcaggcgtc 61 ccaccacgag ggtggagcag cgttggatac ttgttcctta gggaccgaag ctccggtggc 121 acccgggcta tttctcagag gacaattagt aacgtgtcgc catgaggagc cggagtaact 181 ccggggtccg gctggacggc tacgctcgac tggtgcaaca gaccatcctg tgccatcaga 241 atccagtgac tggcttgctt ccagccagct atgatcagaa agatgcttgg gtccgagata 301 atgtgtacag catcttggct gtgtggggtt tgggcctggc ctatcggaag aatgcagacc 361 gggatgagga taaggcaaag gcctatgaat tggagcagag tgtagtgaag ctgatgagag 421 gactactgca ctgcatgatc agacaggtgg ataaagtaga atccttcaaa tatagtcaga 481 gtactaagga tagcctccat gcaaagtaca acaccaaaac ctgtgccact gtagtgggtg 541 atgatcaatg gggacacctg cagttggatg ctacctctgt gtacctgctc ttcttagccc 601 aaatgactgc ctcaggactc catatcatcc acagcctaga tgaagtcaat ttcatacaga 661 accttgtgtt ttacattgaa gctgcatata aaactgctga cttcgggata tgggaacgtg 721 gagacaagac caaccaaggg atctcagagt tgaatgccag ttcagttgga atggcaaagg 781 cagccctgga agcattagat gaactggatc tgtttggtgt gaaaggtggg cctcaatcag 841 ttatccatgt cctggctgat gaagtacagc actgccagtc tatcctaaat tcactactgc 901 cccgtgcttc aacatcaaaa gaggttgatg ctagtctact ctcagtggtt tccttccctg 961 cctttgcagt agaggatagc cagttggtgg agctcacaaa acaggaaatc atcaccaagc 1021 ttcagggtcg ttatggttgc tgtcgctttc tacgagatgg atataaaact cctaaagagg 1081 atcccaatcg tctgtactat gaaccagctg agctgaagct atttgaaaac attgagtgtg 1141 aatggccatt gttctggaca tactttattc ttgatggggt cttcagtggc aatgcagaac 1201 aggttcaaga atataaagag gctcttgaag cagtcctcat caagggcaaa aatggagtcc 1261 cacttctgcc agagctgtac agtgttcctc ctgacagggt cgatgaagaa tatcagaatc 1321 ctcacactgt ggaccgagtc cccatgggga aattgcctca catgtggggt cagtctctat 1381 acattttagg aagcttgatg gcagagggat ttttagcccc tggagaaatt gatcccctga 1441 atcgcaggtt ttctactgta ccgaagcccg atgttgtggt tcaagtctcc attctagctg 1501 aaacagaaga aatcaagacc attttgaagg acaagggaat ttacgtggag accattgctg 1561 aggtataccc catcagagta caaccagctc gtattctcag ccacatttat tccagcctag 1621 gatgcaacaa tagaatgaaa ctcagtggac gaccctacag acacatggga gtgcttggaa 1681 cttcaaaact ctatgacatt cggaaaacta tctttacttt cactccacag tttatagacc 1741 agcaacagtt ctacctggct ctggacaaca agatgatagt ggaaatgctt agaacagacc 1801 tctcctacct ctgtagccgc tggcggatga caggccagcc caccatcacc ttccccatct 1861 catacagcat gcttgatgaa gatggaacaa gcttgaattc aagtatcctg gcagcactcc 1921 gaaaaatgca agatgggtat tttggtgggg caagggttca aacaggtaaa ttgtcagagt 1981 ttttgacaac atcttgttgc acacacttga gcttcatgga ccctggacct gagggtaagc 2041 tgtacagtga agattatgat gacaactatg attacctgga atctggcaac tggatgaatg 2101 attatgattc aaccagtcat gctcgctgtg gtgatgaagt tgctcgttat ttagatcacc 2161 ttttggcgca cactgctccc catcctaaac tagcccctac ctcacagaag ggagggctag 2221 atcggttcca agctgctgtg caaacaacct gcgacttaat gtccttggtg accaaggcca 2281 aggaactgca tgtacagaat gttcacatgt atcttcctac gaagttattt caggcttccc 2341 ggccttcatt caacttactt gattcacctc atccccgaca ggagaaccag gttccctctg 2401 ttcgtgtaga aatacatctt cctagagacc agtctgggga ggtggacttt aaagcactgg 2461 ttttacagtt gaaggagacc tcaagcttac aggaacaagc tgatatcctc tatatgctgt 2521 atactatgaa aggacctgac tggaacactg aattgtataa tgaacggagt gctacagtga 2581 gagagcttct taccgagctg tatggcaaag tgggagaaat tcgtcactgg ggcctgatcc 2641 gatacatttc tgggatctta aggaagaaag tggaagcact tgatgaggcc tgcacagacc 2701 ttctctccca ccagaaacat ttgacagtag gacttcctcc agaacctcga gaaaagacta 2761 tctctgcacc tctgccctat gaggcgctca ctcagctgat agatgaagcc agtgaagggg 2821 atatgagcat ttcaatcctt acacaggaaa taatggtata tctagccatg tatatgcgaa 2881 cccagcctgg cctctttgct gaaatgtttc gacttcgaat tggtctgatc atacaagtta 2941 tggcaacaga actggcccac tcccttcgat gctcagctga ggaagccaca gagggcctga 3001 tgaatctcag tccttcggcc atgaagaatc tcctgcatca cattctcagc ggcaaggagt 3061 ttggagtgga acgaagcgtt cgtcccactg attcaaatgt cagtcctgct atttctatcc 3121 acgagattgg tgctgttgga gcaaccaaaa cagaacgaac tgggatcatg cagttaaaaa 3181 gtgagataaa gcaggtggaa tttcgtagac tgtcaatctc agctgagagt cagtcacctg 3241 gaacctctat gactccaagt agtgggtcct ttcctagtgc atatgatcag cagtcatcta 3301 aagatagtcg tcaaggtcaa tggcaacgcc gaagaaggct ggatggggca ctgaatagag 3361 ttccagttgg attttatcag aaagtatgga aagttttgca gaagtgtcac ggactttctg 3421 ttgaagggtt tgtccttcct tcctctacca ctagagagat gactccaggt gagattaaat 3481 tctctgttca tgtggagtct gtcctgaatc gtgtacctca gccagagtac cgtcagctgc 3541 tggttgaagc catccttgtc ctcaccatgc tggcagatat tgaaattcat agcatcggaa 3601 gcatcattgc tgtggaaaaa atagtgcata ttgccaatga cttgttcctt caagaacaga 3661 aaacccttgg cgcagatgat accatgttgg caaaggatcc cgcatctggc atctgtactc 3721 ttctgtatga cagtgcaccc agtggcaggt ttggcaccat gacctacctc tccaaggcag 3781 ccgccaccta cgtgcaggag ttcctgcccc acagcatctg tgccatgcaa tgagggcttt 3841 ggttcctggc ttctgggagc cttttgacag ctggtccctg cctcggttga ttgtgcatgg 3901 aactaaaatg ttattgccta atcactccaa ccctgcccct ttctgtccca tccttcccaa 3961 gaagagagaa ctttttcgat aaactaacta ctgtagaaga agtgaacact tacctggagg 4021 ctcaccttgc agaaccagtg acaatcttat gagtataatg aacactcagc caggcctgtc 4081 atgattggct ttatttcttt catcattcat aaaagtttgc atgtgttttt attctctaga 4141 tctgttacca atatagtttt ctaactcctg tttggggagc aagtgttaat aataacttat 4201 tcctaaaaaa aaaaa // LOCUS HSPHKG1 1377 bp RNA PRI 29-JAN-1996 DEFINITION H.sapiens PHKG1 mRNA. ACCESSION X80590 NID g1147566 KEYWORDS PHKG1 gene; phosphorylase kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1377) AUTHORS Wehner,M. and Kilimann,M.W. TITLE Human cDNA encoding the muscle isoform of the phosphorylase kinase gamma subunit (PHKG1) JOURNAL Hum. Genet. 96 (5), 616-618 (1995) MEDLINE 96071000 REFERENCE 2 (bases 1 to 1377) AUTHORS Kilimann,M.W. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) M.W. Kilimann, Inst f Physiol. Chemie, Universitaet Bochum, 44780 Bochum, FRG FEATURES Location/Qualifiers source 1..1377 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="skeletal muscle" /chromosome="7" gene 120..1283 /gene="PHKG1" CDS 120..1283 /gene="PHKG1" /EC_number="2.7.1.38" /note="gamma subunit" /codon_start=1 /product="phosphorylase kinase" /db_xref="PID:e113562" /db_xref="PID:g1147567" /translation="MTRDEALPDSHSAQDFYENYEPKEILGRGVSSVVRRCIHKPTSQ EYAVKVIDVTGGGSFSPEEVRELREATLKEVDILRKVSGHPNIIQLKDTYETNTFFFL VFDLMKRGELFDYLTEKVTLSEKETRKIMRALLEVICTLHKLNIVHRDLKPENILLDD NMNIKLTDFGFSCQLEPGERLREVCGTPSYLAPEIIECSMNEDHPGYGKEVDMWSTGV IMYTLLAGSPPFWHRKQMLMLRMIMSGNYQFGSPEWDDYSDTVKDLVSRFLVVQPQNR YTAEEALAHPFFQQYLVEEVRHFSPRGKFKVIALTVLASVRIYYQYRRVKPVTREIVI RDPYALRPLRRLIDAYAFRIYGHWVKKGQQQNRAALFENTPKAVLLSLAEEDY" BASE COUNT 324 a 385 c 412 g 256 t ORIGIN 1 ggccttcagc cctctgtggt cccctctccc cggggggctt tgggattctt gtcaagctcc 61 ttcaagagcc tgcaagcact taaccagcca cccagagttc cctcactgaa gatctgagca 121 tgacccggga cgaggcactg ccggactctc attctgcaca ggacttctat gagaattatg 181 agcccaaaga gatcctgggc aggggcgtta gcagtgtggt caggcgatgc atccacaagc 241 ccacgagcca ggagtacgcc gtgaaggtca tcgacgtcac cggtggaggc agcttcagcc 301 cggaggaggt gcgggagctg cgagaagcca cgctgaagga ggtggacatc ctgcgcaagg 361 tctcagggca ccccaacatc atacagctga aggacactta tgagaccaac actttcttct 421 tcttggtgtt tgacctgatg aagagagggg agctctttga ctacctcact gagaaggtca 481 ccttgagtga gaaggaaacc agaaagatca tgcgagctct gctggaggtg atctgcacct 541 tgcacaaact caacatcgtg caccgggacc tgaagcccga gaacattctc ttggatgaca 601 acatgaacat caagctcaca gactttggct tttcctgcca gctggagccg ggagagaggc 661 tgcgagaggt ctgcgggacc cccagttacc tggcccctga gattatcgag tgctccatga 721 atgaggacca cccgggctac gggaaagagg tggacatgtg gagcactggc gtcatcatgt 781 acacgctgct ggccggctcc ccgcccttct ggcaccggaa gcagatgctg atgctgagga 841 tgatcatgag cggcaactac cagtttggct cgcccgagtg ggatgattac tcggacaccg 901 tgaaggacct ggtctcccga ttcctggtgg tgcaacccca gaaccgctac acagcggaag 961 aggccttggc acaccccttc ttccagcagt acttggtgga ggaagtgcgg cacttcagcc 1021 cccgggggaa gttcaaggtg atcgctctga ccgtgctggc ttcagtgcgg atctactacc 1081 agtaccgccg ggtgaagcct gtgacccggg agatcgtcat ccgagacccc tatgccctcc 1141 ggcctctgcg ccggctcatc gacgcctacg ctttccgaat ctatggccac tgggtgaaga 1201 aggggcagca gcagaaccgg gcagcccttt tcgagaacac acccaaggcc gtgctcctct 1261 ccctggccga ggaggactac tgaggggctg gccagtcagg gagggctagg gggcaggtgg 1321 ggaggggaag ccatggaaat acaagtcaaa ggggtaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSPHKLA 4566 bp RNA PRI 07-JUL-1995 DEFINITION H.sapiens PHKLA mRNA. ACCESSION X80497 NID g663009 KEYWORDS PHKLA gene; phosphorylase kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4566) AUTHORS van den Berg,I.E., van Beurden,E.A., Malingre,H.E., van Amstel,H.K., Poll-The,B.T., Smeitink,J.A., Lamers,W.H. and Berger,R. TITLE X-linked liver phosphorylase kinase deficiency is associated with mutations in the human liver phosphorylase kinase alpha subunit JOURNAL Am. J. Hum. Genet. 56 (2), 381-387 (1995) MEDLINE 95150027 REFERENCE 2 (bases 1 to 4566) AUTHORS Van Den Berg,I.E.T. TITLE Direct Submission JOURNAL Submitted (25-JUL-1994) I.E.T. Van Den Berg, Wilhelmina Children's Hospital, Nieuwe Gracht 137, 3512 LK Utrecht, NETHERLANDS FEATURES Location/Qualifiers source 1..4566 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda ZAP" /clone="HLA1 and HLA2" /chromosome="x" /tissue_type="liver" gene 127..3834 /gene="PHKLA" CDS 127..3834 /gene="PHKLA" /EC_number="2.7.1.38" /note="alpha subunit" /codon_start=1 /product="phosphorylase kinase" /db_xref="PID:g663010" /db_xref="SWISS-PROT:P46019" /translation="MRSRSNSGVRLDGYARLVQQTILCYQNPVTGLLSASHEQKDAWV RDNIYSILAVWGLGMAYRKNADRDEDKAKAYELEQNVVKLMRGLLQCMMRQVAKVEKF KHTQSTKDSLHAKYNTATCGTVVGDDQWGHLQVDATSLFLLFLAQMTASGLRIIFTLD EVAFIQNLVFYIEAAYKVADYGMWERGDKTNQGIPELNASSVGMAKAALEAIDELDLF GAHGGRKSVIHVLPDEVEHCQSILFSMLPRASTSKEIDAGLLSIISFPAFAVEDVNLV NVTKNEIISKLQGRYGCCRFLRDGYKTPREDPNRLHYDPAELKLFENIECEWPVFWTY FIIDGVFSGDAVQVQEYREALEGILIRGKNGIRLVPELYAVPPNKVDEEYKNPHTVDR VPMGKVPHLWGQSLYILSSLLAEGFLAAGEIDPLNRRFSTSVKPDVVVQVTVLAENNH IKDLLRKHGVNVQSIADIHPIQVQPGRILSHIYAKLGRNKNMNLSGRPYRHIGVLGTS KLYVIRNQIFTFTPQFTDQHHFYLALDNEMIVEMLRIELAYLCTCWRMTGRPTLTFPI SRTMLTNDGSDIHSAVLSTIRKLEDGYFGGARVKLGNLSEFLTTSFYTYLTFLDPDCD EKLFDNASEGTFSPDSDSDLVGYLEDTCNQESQDELDHYINHLLQSTSLRSYLPPLCK NTEDRHVFSAIHSTRDILSVMAKAKGLEVPFVPMTLPTKVLSAHRKSLNLVDSPQPLL EKVPESDFQWPRDDHGDVDCEKLVEQLKDCSNLQDQADILYILYVIKGPSWDTNLSGQ HGVTVQNLLGELYGKAGLNQEWGLIRYISGLLRKKVEVLAEACTDLLSHQKQLTVGLP PEPREKIISAPLPPEELTKLIYEASGQDISIAVLTQEIVVYLAMYVRAQPSLFVEMLR LRIGLIIQVMATELARSLNCSGEEASESLMNLSPFDMKNLLHHILSGKEFGVERSVRP IHSSTSSPTISIHEVGHTGVTKTERSGINRLRSEMKQMTRRFSADEQFFSVGQAASSS AHSSKSARSSTPSSPTGTSSSDSGGHHIGWGERQGQWLRRRRLDGAINRVPVGFYQRV WKILQKCHGLSIDGYVLPSSTTREMTPHEIKFAVHVESVLNRVPQPEYRQLLVEAIMV LTLLSDTEMTSIGGIIHVDQIVQMASQLFLQDQVSIGAMDTLEKDQATGICHFFYDSA PSGAYGTMTYLTRAVASYLQELLPNSGCQMQ" BASE COUNT 1124 a 1167 c 1188 g 1087 t ORIGIN 1 cggtcccatc ccaagaaccg actaaggctg tgagtgtccg ggaaccagac ccgcttggag 61 gccacagccc cgacgtcccg cgcccacgcg gcagatcggg cgctgcggcc tgggagcctc 121 ggggagatgc ggagcaggag caattccggg gtccgcttgg acgggtacgc gcggctggtg 181 cagcaaacca tcctgtgtta ccagaatccc gtcacggggc tgctgtcagc cagccatgag 241 cagaaggatg cctgggtgcg ggataacatc tacagtatcc tggccgtgtg gggcctgggc 301 atggcctacc gtaagaatgc agaccgcgat gaggacaagg ccaaggccta cgagctggag 361 cagaacgtgg tgaagctgat gcgaggtctt ctccagtgca tgatgagaca ggtggccaaa 421 gtggagaagt tcaaacacac tcagagcacc aaggacagcc tgcacgccaa gtacaacacc 481 gccacctgtg gcacggtggt gggcgacgac cagtggggcc acctccaggt ggatgccacc 541 tctctcttcc tcctgttcct ggcccagatg accgcctcag gcttacgtat cattttcact 601 ctcgatgagg tggccttcat acagaatctt gtcttttaca tagaagctgc atataaagtc 661 gctgattatg gaatgtggga gcgtggagat aagactaatc agggcatccc ggaattgaat 721 gcaagctccg taggaatggc caaggcagct cttgaggcaa ttgatgaact ggaccttttt 781 ggagcccatg gaggacgcaa gtcagtgatt catgttctgc cagatgaggt cgagcactgc 841 cagtctattc tgttctccat gctgccaaga gcgtcgacat ctaaagaaat tgatgctgga 901 cttctttcca ttatttcctt cccggccttt gcagtggaag atgtaaacct tgtaaatgtg 961 accaaaaatg aaattatttc taagctccag gggcgttatg gatgctgtcg cttccttcga 1021 gatggttata aaactccaag agaggaccct aatcgactgc attatgaccc tgctgaactc 1081 aagctcttcg aaaacattga atgtgagtgg cctgtgtttt ggacatattt tataatagat 1141 ggagtcttca gtggtgatgc tgttcaggtc caagaatacc gagaggccct ggagggaata 1201 ctcatcagag gcaagaatgg gatccgcctg gtgcctgaac tctacgctgt cccgcctaac 1261 aaggtagatg aagagtacaa gaatcctcac acagtagacc gagttcctat ggggaaggtg 1321 cctcatctgt ggggccaatc cttgtacatc ctcagctcgc tgttggcaga gggattcctt 1381 gccgctggtg aaatcgatcc cttaaataga agattttcca cttcagtcaa acctgatgtt 1441 gtagtacaag ttactgtttt ggcagaaaac aatcacatta aggacttatt gaggaaacac 1501 ggggtgaacg tccagagtat cgcggacatt catccaattc aagtccagcc gggccggatt 1561 cttagtcaca tatatgccaa gcttggacgg aataagaata tgaatttgag tgggcgaccg 1621 tatcgacata ttggtgtcct tggaacctct aaactatatg tgattaggaa ccaaatcttt 1681 acttttacac cccagttcac cgaccagcat cacttctacc tggccctcga caatgagatg 1741 atcgtggaga tgctaaggat cgagctggcc tacctgtgca cctgctggag gatgacgggc 1801 agacccacac tcaccttccc catcagtcgc accatgctca caaatgatgg ctcagacatt 1861 cattctgctg tgctctccac aattagaaaa ctagaggatg gatattttgg aggagccaga 1921 gtaaaattag ggaacctttc ggaatttctc accacatcgt tctacacata tctgactttt 1981 ctggatccag actgtgatga gaagttgttt gacaatgcca gcgaagggac tttcagtcct 2041 gatagtgatt cagatttggt aggatatctg gaagacacct gtaatcaaga aagccaagac 2101 gaacttgacc attatatcaa ccaccttctg caaagcacat cgttgaggtc ctatctgcct 2161 cctctttgta agaacacaga agaccgccat gtcttcagtg ctatccactc cacgcgggac 2221 atactttctg tgatggcaaa agcaaagggt ttggaagttc catttgttcc catgactttg 2281 ccgactaaag ttctaagtgc ccaccgtaaa tcactgaatc ttgttgattc tcctcagcca 2341 ctcctagaaa aggttcctga aagtgacttt cagtggccca gagatgacca tggtgacgtg 2401 gactgtgaga agctggttga gcagctaaaa gattgttcga acctacagga ccaagcagac 2461 attctgtaca ttctttatgt cataaagggt cccagctggg acacaaatct ctctggacag 2521 cacggggtca ccgttcaaaa ccttcttggt gagctctatg ggaaagccgg cttgaaccag 2581 gagtggggtc tgattcgcta catctcaggc cttctcagga agaaagtgga ggtcctggct 2641 gaggcctgca cagacctgct ttcgcaccag aagcagctca ccgtgggcct gccgcccgag 2701 ccccgggaga agatcatctc tgcgcccctt cccccagagg agctcacaaa actcatctac 2761 gaggccagtg ggcaggacat cagcattgcc gtcctcacgc aggagattgt ggtttacctg 2821 gccatgtatg tcagggcgca gcccagcctc tttgtggaga tgctgagact ccggattgga 2881 ctgatcattc aggtgatggc cacggagctg gcacggagcc tgaactgctc aggagaagag 2941 gcttctgaaa gtttgatgaa cctcagccct ttcgatatga aaaatctcct gcaccatatt 3001 ctaagtggga aagagtttgg cgttgaaaga agtgtgcgcc ctatccactc ctccacatcc 3061 agccctacca tctccatcca cgaggtgggc cataccggag tcaccaaaac tgagaggagt 3121 ggcattaaca gactgaggag tgaaatgaaa cagatgacta ggcggtttag tgctgatgaa 3181 cagttctttt ctgtgggcca ggccgcgtcc agcagtgcgc attcctccaa gtctgcgagg 3241 tccagcaccc catcctcgcc cactggcacg tcatcctcag actcgggagg acatcacatc 3301 ggctggggtg agcggcaggg ccagtggctg cgcaggagaa ggctggatgg ggccatcaac 3361 agggtccccg tgggattcta ccagagggtg tggaagatcc tccagaagtg ccacggtctc 3421 tccatcgatg gttatgtcct cccatcctcg acgacccgag agatgacccc gcatgagatc 3481 aagtttgctg tccatgtcga atcggtgctg aaccgcgtgc cgcagcccga gtaccggcag 3541 ctgctggtgg aagccatcat ggtgctgacg ctgctctcgg acacggagat gaccagcatc 3601 gggggcatca tccacgtgga ccagatcgtg cagatggcca gtcagctgtt cttgcaggac 3661 caggtgtcaa ttggtgccat ggacaccctg gagaaagacc aagccacagg aatctgccac 3721 ttcttttatg acagcgctcc gagtggggct tatgggacga tgacctacct aacaagagca 3781 gtggcttctt atttgcagga attgttgccc aattcgggct gccagatgca atagggtctc 3841 acctggaaac atgatcacac tctcaatctg tcacgtgccc cctagcctta ctgggaacct 3901 tctgtccccc aagatcccct gtgctatcag gaaagcatgt cccatcagaa acactctcgg 3961 ggggcaatgg tagcactcac cctgaaactg atgtatgtta aagccacaga gatagagctg 4021 aggagtcctg tgttcccccg caaggagcac cccgggatca ttttctaggt tcatttctct 4081 ggaacatttg ctgtagcatc tggtctcacg gactctgagg aggaattgga aattggtctc 4141 ttttgagtgc agagggaact gagacgccag cttaaattgg ctcttgcaga gagttacaga 4201 aatagtttcg atgagctagt gacacatcct aaagatgcaa agatcctcct ggcggcagta 4261 gccttgacaa gggccacctc ttcacaggat gcagtctgtc tgtgcaccaa actcttcacc 4321 aaatagaaca cttgtgtctc tctgtggaat ggggggtttt cttgtgcctt gcttgctttc 4381 atagccttcc attttattgg catggctgcc ttgatgtaac ataattctct gtccccaaga 4441 tttagaaaat tcctcttcgt tcaccttggc tcatggtctt ccagggtttt tatcctggct 4501 gtctatgaac tagggttttc tcctgcctta ggaaaaatac tgcatcttct ggaatctaga 4561 aaaaaa // LOCUS HSPHOSCYC 1539 bp RNA PRI 27-MAY-1997 DEFINITION H.sapiens mRNA for phosphate cyclase. ACCESSION Y11651 NID g2125811 KEYWORDS RNA 3'-phosphate cyclase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1539) AUTHORS Genschik,P., Billy,E., Swianiewicz,M. and Filipowicz,W. TITLE The human RNA 3'-terminal phosphate cyclase is a member of a new family of proteins conserved in Eucarya, Bacteria and Archaea JOURNAL EMBO J. 16 (10), 2955-2967 (1997) MEDLINE 97327572 REFERENCE 2 (bases 1 to 1539) AUTHORS Filipowicz,W. TITLE Direct Submission JOURNAL Submitted (05-MAR-1997) W. Filipowicz, Friedrich Miescher Institut, PO Box 2543, 4002 Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..1539 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="Lambda gt11" CDS 171..1271 /codon_start=1 /product="phosphate cyclase" /db_xref="PID:e311534" /db_xref="PID:g2125812" /translation="MAGPRVEVDGSIMEGGGQILRVSTALSCLLGLPLRVQKIRAGRS TPGLRPQHLSGLEMIRDLCDGQLEGAEIGSTEITFTPEKIKGGIHTADTKTAGSVCLL MQVSMPCVLFAASPSELHLKGGTNAEMAPQIDYTVMVFKPIVEKFGFIFNCDIKTRGY YPKGGGEVIVRMSPVKQLNPINLTERGCVTKIYGRAFVAGVLPFKVAKDMAAAAVRCI RKEIRDLYVNIQPVQEPKDQAFGNGNGIIIIAETSTGCLFAGSSLGKRGVNADKVGIE AAEMLLANLRHGGTVDEYLQDQLIVFMALANGVSRIKTGPVTLHTQTAIHFAEQIAKA KFIVKKSEDEEDAAKDTYIIECQGIGMTNPNL" polyA_signal 1516..1521 /note="putative" polyA_site 1539 /note="putative" BASE COUNT 447 a 299 c 380 g 413 t ORIGIN 1 gctgactcca gtgtcccgag aggcgccgct tcttccgctt tctcgtcagg ctcctgcaac 61 cccaggcatg aaccaaggtt tctgaactac tgggcgggag ccaacgtctc ttctttctcc 121 cgctctggcg gaggctttgt cgctgcgggc tgggccccag ggtgtccccc atggcggggc 181 cgcgggtgga ggtcgatggc agcatcatgg aagggggcgg ccagatcctg agagtctcta 241 cggccttgag ctgtctccta ggcctcccct tgcgggtgca gaagatccga gccggccgga 301 gcacgccagg cctgaggcct caacatttat ctggactgga aatgattcga gatttgtgtg 361 atgggcaact ggagggggca gaaattggct caacagaaat aacctttaca ccagagaaga 421 tcaaaggtgg aatccacaca gcagatacca agacagcagg gagtgtgtgc ctcttgatgc 481 aggtctcaat gccgtgtgtt ctctttgctg cttctccatc agaacttcat ttgaaaggtg 541 gaactaatgc tgaaatggca ccacagatcg attatacagt gatggtcttc aagccaattg 601 ttgaaaaatt tggtttcata tttaattgtg acattaaaac aaggggatat tacccaaaag 661 ggggtggtga agtgattgtt cgaatgtcac cagttaaaca attgaaccct ataaatttaa 721 ctgagcgtgg ctgtgtgact aagatatatg gaagagcttt cgttgctggt gttttgccat 781 ttaaagtagc aaaagatatg gcagcggcag cagttagatg catcagaaag gagatccggg 841 atttgtatgt taacatccag cctgttcaag aacctaaaga ccaagcattt ggcaatggaa 901 atggaataat aattattgct gagacctcca ctggctgttt gtttgctgga tcatcgcttg 961 gtaaacgagg tgtaaatgca gacaaagttg gaattgaagc tgccgaaatg ctattagcaa 1021 atcttagaca tggtggtact gtggatgagt atctgcaaga ccagctgatt gttttcatgg 1081 cattagccaa tggagtttcc agaataaaaa caggaccagt tacactccat acgcaaaccg 1141 cgatacattt tgctgaacaa atagcaaagg ctaaatttat tgtgaagaaa tcagaagatg 1201 aagaagacgc cgctaaagat acttatatta ttgaatgcca aggaattggg atgacaaatc 1261 caaatctata gagtatttgc ctcttaaatg atacctcatt gatatattgc actatttcat 1321 aaatactata aaataatgac taggaagtaa cttattaaag gctatgactt aaatttgaag 1381 atgaagtaca gtgttctagg tttgctgaga aggcttcatt aaattaatct cactttgaat 1441 atctcctgag agatggacaa tgaaatatca gttggtggat atgtgtgata gctgatttca 1501 atattgaagt attgaaataa aatattcttt acacctgag // LOCUS HSPHOSI3K 5061 bp RNA PRI 26-NOV-1997 DEFINITION H.sapiens mRNA for phosphoinositide 3-kinase. ACCESSION Y13367 NID g2143259 KEYWORDS phosphoinositide 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5061) AUTHORS Domin,J., Pages,F., Volinia,S., Rittenhouse,S.E., Zvelebil,M.J., Stein,R.C. and Waterfield,M.D. TITLE Cloning of a human phosphoinositide 3-kinase with a C2 domain that displays reduced sensitivity to the inhibitor wortmannin JOURNAL Biochem. J. 326 (Pt 1), 139-147 (1997) MEDLINE 97479209 REFERENCE 2 (bases 1 to 5061) AUTHORS Domin,J. TITLE Direct Submission JOURNAL Submitted (23-MAY-1997) J. Domin, Ludwig Institute for Cancer Research Receptor Studies, 91 Riding House St, London, W1P 8BT, UK FEATURES Location/Qualifiers source 1..5061 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /cell_type="histiocytic lymphoma" CDS 1..5061 /codon_start=1 /product="phosphoinositide 3-kinase" /db_xref="PID:e1188595" /db_xref="PID:g2143260" /translation="MAQIFSNSGFKECPFSHPEPTRAKDVDKEEALQMEAEALAKLQK DRQVTDNQRGFELSSSTRKKAQVYNKQDYDLMVFPESDSQKRALDIDVEKLTQAELEK LLLDDSFETKKTPVLPVTPILSPSFSAQLYFRPTIQRGQWPPGLPGPSTYALPSIYPS TYSKQAAFQNGFNPRMPTFPSTEPIYLSLPGQSPYFSYPLTPATPFHPQGSLPIYRPV VSTDMAKLFDKIASTSEFLKNGKARTDLEITDSKVSNLQVSPKSEDISKFDWLDLDPL SKPKVDNVEVLDHEEEKNVSSLLAKDPWDAVLLEERSTANCHLERKVNGKSLSVATVT RSQSLNIRTTQLAKAQGHISQKDPNGTSSLPTGSSLLQEVEVQNEEMAAFCRSITKLK TKFPYTNHRTNPGYLLSPVTAQRNICGENASVKVSIDIEGFQLPVTFTCDVSSTVEII IMQALCWVHDDLNQVDVGSYVLKVCGQEEVLQNNHCLGSHEHIQNCRKWDTEIRLQLL TFSAMCQNLARTAEDDETPVDLNKHLYQIEKPCKEAMTRHPVEELLDSYHNQVELALQ IENQHRAVDQVIKAVRKICSALDGVETLAITESVKKLKRAVNLPRSKTADVTSLFGGE DTSRSSTRGSLNPENPVQVSINQLTAAIYDLLRLHANSGRSPTDCAQSSKSVKEAWTT TEQLQFTIFAAHGISSNWVSNYEKYYLICSLSHNGKDLFKPIQSKKVGTYKNFFYLIK WDELIIFPIQISQLPLESVLHLTLFGILNQSSGSSPDSNKQRKGPEALGKVSLPLCDF RRFLTCGTKLLYLWTSSHTNSVPGTVTKKGYVMERIVLQVDFPSPAFDIIYTTPQVDR SIIQQHNLETLENDIKGKLLDILHKDSSLGLSKEDKAFLWEKRYYCFKHPNCLPKILA SAPNWKWGNLAKTYSLLHQWPALYPLIALELLDSKFADQEVRSLAVTWIEAISDDELT DLLPQFVQALKYEIYLNSSLVQFLLSRALGNIQIAHNLYWLLKDALHDVQFSTRYEHV LGALLSVGGKRLREELLKQTKLVQLLGGVAEKVRQASGSARQVVLQRSMERVQSFFQK NKCRLPLKPSLVAKELNIKSCSFFSSNAVPLKVTMVNADPLGEEINVMFKVGEDLRQD MLALQMIKIMDKIWLKEGLDLRMVIFKCLSTGRDRGMVELVPASDTLRKIQVEYGVTG SFKDKPLAEWLRKYNPSEEEYEKASENFIYSCAGCCVATYVLGICDRHNDNIMLRSTG HMFHIDFGKFLGHAQMFGSFKRDRAPFVLTSDMAYVINGGEKPTIRFQLFVDLCCQAY NLIRKQTNLFLNLLSLMIPSGLPELTSIQDLKYVRDALQPQTTDAEATIFFTRLIESS LGSIATKFNFFIHNLAQLRFSGLPSNDEPILSFSPKTYSFRQDGRIKEVSVFTYHKKY NPDKHYIYVVRILWEGQIEPSFVFRTFVEFQELHNKLSIIFPLWKLPGFPNRMVLGRT HIKDVAAKRKIELNSYLQSLMNASTDVAECDLVCTFFHPLLRDEKAEGIARSADAGSF SPTPGQIGGAVKLSISYRNGTLFIMVMHIKDLVTEDGADPNPYVKTYLLPDNHKTSKR KTKISRKTRNPTFNEMLVYSGYSKETLRQRELQLSVLSAESLRENFFLGGVTLPLKDF NLSKETVKWYQLTAATYL" BASE COUNT 1634 a 991 c 1002 g 1434 t ORIGIN 1 atggctcaga tatttagcaa cagcggattt aaagaatgtc cattttcaca tccggaacca 61 acaagagcaa aagatgtgga caaagaagaa gcattacaga tggaagcaga ggctttagca 121 aaactgcaaa aggatagaca agtgactgac aatcagagag gctttgagtt gtcaagcagc 181 accagaaaaa aagcacaggt ttataacaag caggattatg atctcatggt gtttcctgaa 241 tcagattccc aaaaaagagc attagatatt gatgtagaaa agctcaccca agctgaactt 301 gagaaactat tgctggatga cagtttcgag actaaaaaaa cacctgtatt accagttact 361 cctattctga gcccttcctt ttcagcacag ctctatttta gacctactat tcagagagga 421 cagtggccac ctggattacc tgggccttcc acttatgctt taccttctat ttatccttct 481 acttacagta aacaggctgc attccaaaat ggcttcaatc caagaatgcc cacttttcca 541 tctacagaac ctatatattt aagtcttccg ggacaatctc catatttctc atatcctttg 601 acacctgcca caccctttca tccacaagga agcttaccta tctatcgtcc agtagtcagt 661 actgacatgg caaaactatt tgacaaaata gctagtacat cagaattttt aaaaaatggg 721 aaagcaagga ctgatttgga gataacagat tcaaaagtca gcaatctaca ggtatctcca 781 aagtctgagg atatcagtaa atttgactgg ttagacttgg atcctctaag taagcctaag 841 gtggataatg tggaggtatt agaccatgag gaagagaaaa atgtttcaag tttgctagca 901 aaggatcctt gggatgctgt tcttcttgaa gagagatcga cagcaaattg tcatcttgaa 961 agaaaggtga atggaaaatc cctttctgtg gcaactgtta caagaagcca gtctttaaat 1021 attcgaacaa ctcagcttgc aaaagcccag ggccatatat ctcagaaaga cccaaatggg 1081 accagtagtt tgccaactgg aagttctctt cttcaagaag ttgaagtaca gaatgaggag 1141 atggcagctt tttgtcgatc cattacaaaa ttgaagacca aatttccata taccaatcac 1201 cgcacaaacc caggctattt gttaagtcca gtcacagcgc aaagaaacat atgcggagaa 1261 aatgctagtg tgaaggtctc cattgacatt gaaggatttc agctaccagt tacttttacg 1321 tgtgatgtga gttctactgt agaaatcatt ataatgcaag ccctttgctg ggtacatgat 1381 gacttgaatc aagtagatgt tggcagctat gttctaaaag tttgtggtca agaggaagtg 1441 ctgcagaata atcattgcct tggaagtcat gagcatattc aaaactgtcg aaaatgggac 1501 acagaaatta gactacaact cttgaccttc agtgcaatgt gtcaaaatct ggcccgaaca 1561 gcagaagatg atgaaacacc cgtggattta aacaaacacc tgtatcaaat agaaaaacct 1621 tgcaaagaag ccatgacgag acaccctgtt gaagaactct tagattctta tcacaaccaa 1681 gtagaactgg ctcttcaaat tgaaaaccaa caccgagcag tagatcaagt aattaaagct 1741 gtaagaaaaa tctgtagtgc tttagatggt gtcgagactc ttgccattac agaatcagta 1801 aagaagctaa agagagcagt taatcttcca aggagtaaaa ctgctgatgt gacttctttg 1861 tttggaggag aagacactag caggagttca actaggggct cacttaatcc tgaaaatcct 1921 gttcaagtaa gcataaacca attaactgca gcaatttatg atcttctcag actccatgca 1981 aattctggta ggagtcctac agactgtgcc caaagtagca agagtgtcaa ggaagcatgg 2041 actacaacag agcagctcca gtttactatt tttgctgctc atggaatttc aagtaattgg 2101 gtatcaaatt atgaaaaata ctacttgata tgttcactgt ctcacaatgg aaaggatctt 2161 tttaaaccta ttcaatcaaa gaaggttggc acttacaaga atttcttcta tcttattaaa 2221 tgggatgaac taatcatttt tcctatccag atatcacaat tgccattaga atcagttctt 2281 caccttactc tttttggaat tttaaatcag agcagtggaa gttcccctga ttctaataag 2341 cagagaaagg gaccagaagc tttgggcaaa gtttctttac ctctttgtga ctttagacgg 2401 tttttaacat gtggaactaa acttctatat ctttggactt catcacatac aaattctgtt 2461 cctggaacag ttaccaaaaa aggatatgtc atggaaagaa tagtgctaca ggttgatttt 2521 ccttctcctg catttgatat tatttataca actcctcaag ttgacagaag cattatacag 2581 caacataact tagaaacact agagaatgat ataaaaggga aacttcttga tattcttcat 2641 aaagactcat cacttggact ttctaaagaa gataaagctt ttttatggga gaaacgttat 2701 tattgcttca aacacccaaa ttgtcttcct aaaatattag caagcgcccc aaactggaaa 2761 tggggtaatc ttgccaaaac ttactcattg cttcaccagt ggcctgcatt gtacccacta 2821 attgcattgg aacttcttga ttcaaaattt gctgatcagg aagtaagatc cctagctgtg 2881 acctggattg aggccattag tgatgatgag ctaacagatc ttcttccaca gtttgtacaa 2941 gctttgaaat atgaaattta cttgaatagt tcattagtgc aattcctttt gtccagggca 3001 ttgggaaata tccagatagc acacaattta tattggcttc tcaaagatgc cctgcatgat 3061 gtacagttta gtacccgata cgaacatgtt ttgggtgctc tcctgtcagt aggaggaaaa 3121 cgacttagag aagaacttct aaaacagacg aaacttgtac agcttttagg aggagtagca 3181 gaaaaagtaa ggcaggctag tggatcagcc agacaggttg ttctccaaag aagtatggaa 3241 cgagtacagt ccttttttca gaaaaataaa tgccgtctcc ctctcaagcc aagtctagtg 3301 gcaaaagaat taaatattaa gtcgtgttcc ttcttcagtt ctaatgctgt ccccctaaaa 3361 gtcacaatgg tgaatgctga ccctctggga gaagaaatta atgtcatgtt taaggttggt 3421 gaagatcttc ggcaagatat gttagcttta cagatgataa agattatgga taagatctgg 3481 cttaaagaag gactagatct gaggatggta attttcaaat gtctctcaac tggcagagat 3541 cgaggcatgg tggagctggt tcctgcttcc gataccctca ggaaaatcca agtggaatat 3601 ggtgtgacag gatcctttaa agataaacca cttgcagagt ggctaaggaa atacaatccc 3661 tctgaagaag aatatgaaaa ggcttcagag aactttatct attcctgtgc tggatgctgt 3721 gtagccacct atgttttagg catctgtgat cgacacaatg acaatataat gcttcgaagc 3781 acgggacaca tgtttcacat tgactttgga aagtttttgg gacatgcaca gatgtttggc 3841 agcttcaaaa gggatcgggc tccttttgtg ctgacctctg atatggcata tgtcattaat 3901 gggggtgaaa agcccaccat tcgttttcag ttgtttgtgg acctctgctg tcaggcctac 3961 aacttgataa gaaagcagac aaaccttttt cttaacctcc tttcactgat gattccttca 4021 gggttaccag aacttacaag tattcaagat ttgaaatacg ttagagatgc acttcaaccc 4081 caaactacag acgcagaagc tacaattttc tttactaggc ttattgaatc aagtttggga 4141 agcattgcca caaagtttaa cttcttcatt cacaaccttg ctcagcttcg tttttctggt 4201 cttccttcta atgatgagcc catcctttca ttttcaccta aaacatactc ctttagacaa 4261 gatggtcgaa tcaaggaagt ctctgttttt acatatcata agaaatacaa cccagataaa 4321 cattatattt atgtagtccg aattttgtgg gaaggacaga ttgaaccatc atttgtcttc 4381 cgaacatttg tcgaatttca ggaacttcac aataagctca gtattatttt tccactttgg 4441 aagttaccag gctttcctaa taggatggtt ctaggaagaa cacacataaa agatgtagca 4501 gccaaaagga aaattgagtt aaacagttac ttacagagtt tgatgaatgc ttcaacggat 4561 gtagcagagt gtgatcttgt ttgtactttc ttccaccctt tacttcgtga tgagaaagct 4621 gaagggatag ctaggtctgc agatgcaggt tccttcagtc ctactccagg ccaaatagga 4681 ggagctgtga aattatccat ctcttaccga aatggtactc ttttcatcat ggtgatgcat 4741 atcaaagatc ttgttactga agatggagct gacccaaatc catatgtcaa aacataccta 4801 cttccagata accacaaaac atccaaacgt aaaaccaaaa tttcacgaaa aacgaggaat 4861 ccgacattca atgaaatgct tgtatacagt ggatatagca aagaaaccct aagacagcga 4921 gaacttcaac taagtgtact cagtgcagaa tctctgcggg agaatttttt cttgggtgga 4981 gtaaccctgc ctttgaaaga tttcaacttg agcaaagaga cggttaaatg gtatcagctg 5041 actgcggcaa catacttgta a // LOCUS HSPHOSINK 3201 bp RNA PRI 06-DEC-1997 DEFINITION H.sapiens mRNA for p85 beta subunit of phosphatidyl-inositol-3-kinase. ACCESSION X80907 NID g2160047 KEYWORDS p85 gene; phosphatidylinositol 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3201) AUTHORS Janssen,J.W.G., Schleithhoff,L., Bartram,C.R. and Schulz,A.S. TITLE An oncogene fusion product of the phosphatidylinositol 3-kinase P85beta subunit and HUMORF8, putative deudiquitinating enzyme JOURNAL Oncogene In press REFERENCE 2 (bases 1 to 3201) AUTHORS Schulz,A. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) A. Schulz, Section of Molecular Biology, Pediatrics 2, University of Ulm, Ulm, FRG FEATURES Location/Qualifiers source 1..3201 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="1-gt11-library" /clone_lib="fetal liver" gene 242..2428 /gene="p85-beta" CDS 242..2428 /gene="p85-beta" /codon_start=1 /product="p85 beta subunit of phosphatidyl-inositol-3-kinase" /db_xref="PID:e115624" /db_xref="PID:g2160048" /translation="MAGPEGFQYRALYPFRRERPEDLELLPGDVLVVSRAALQALGVA EGGERCPQSVGWMPGLNERTRQRGDFPGTYVEFLGPVALARPGPRPRGPRPLPARPRD GAPEPGLTLPDLPEQFSPPDVAPPLLVKLVEAIERTGLDSESHYRPELPAPRTDWSLS DVDQWDTAALADGIKSFLLALPAPLVTPEASAEARRALREAAGPVGPALEPPTLPLHR ALTLRFLLQHLGRVARRAPALGPAVRALGATFGPLLLRAPPPPSSPPPGGAPDGSEPS PDFPALLVEKLLQEHLEEQEVAPPALPPKPPKAKPAPTVLANGGSPPSLQDAEWYWGD ISREEVNEKLRDTPDGTFLVRDASSKIQGEYTLTLRKGGNNKLIKVFHRDGHYGFSEP LTFCSVVDLINHYRHESLAQYNAKLDTRLLYPVSKYQQDQIVKEDSVEAVGAQLKVYH QQYQDKSREYDQLYEEYTRTSQELQMKRTAIEAFNETIKIFEEQGQTQEKCSKEYLER FRREGNEKEMQRILLNSERLKSRIAEIHESRTKLEQQLRAQASDNREIDKRMNSLKPD LMQLRKIRDQYLVWLTQKGARQKKINEWLGIKNETEDQYALMEDEDDLPHHEERTWYV GKINRTQAEEMLSGKRDGTFLIRESSQRGCYACSVVVDGDTKHCVIYRTATGFGFAEP YNLYGSLKELVLHYQHASLVQHNDALTVTLAHPVRAPGPGPPPAAR" BASE COUNT 616 a 1113 c 957 g 515 t ORIGIN 1 caactccctc ccaccagctg acgaatggtg gacccagtga cgagtggccc ttgtaagggt 61 catggaataa tttgaagcga ggcatgagcg gcccctgtgg tcgcctgtga ctgctggaga 121 tagaggtccc agcaccccaa gccaacccag cggaccctcc cagccctgct tcaaccaatg 181 gggccagtgg ggctccaagc agccaaccta accatccaga ccccacccca ctcacgcggc 241 catggcgggc cctgagggct tccagtaccg cgctctgtac ccgttccgcc gggagcggcc 301 ggaggacctg gagctgctgc ccggcgacgt gctggtagtg agccgggcgg ccttgcaggc 361 gctgggcgtg gccgagggtg gcgagcgctg cccacagagc gtgggctgga tgcccggcct 421 caacgagcgc acacggcagc gaggtgactt ccctggcacc tatgtggagt tcctggggcc 481 cgtggccctg gcccggcccg gccctcgccc acggggcccc cgcccactgc ccgccaggcc 541 ccgtgatggg gcccctgagc caggcctcac actccccgac ttgcccgagc agttctcccc 601 acctgatgtg gctccccctc ttctggtgaa gcttgtggag gccattgaaa ggacagggct 661 ggacagcgaa tctcactacc gcccggagct gcccgcaccg cgtacagact ggtccctgag 721 cgacgtggat cagtgggaca cggcagccct ggctgacggc attaagagct tcctgctggc 781 actgcccgcg ccgctcgtga cccccgaggc ctcggccgag gcgcgccggg ccctgcggga 841 ggccgcgggg cccgtggggc cggcgctgga gccaccgacg ctgccgctgc accgcgcgct 901 cacgctgcgc ttcctgctcc agcacctggg ccgcgtggcc cgccgcgccc cggccctggg 961 tcccgcggtc cgggccctgg gcgccacctt tgggccgctg ctcctgcgcg cgccgccgcc 1021 gccgtcctcg ccgccgccag ggggcgctcc cgacgggagt gagcccagcc ctgacttccc 1081 ggcgctgctg gtggagaagc tgcttcagga acacttggaa gagcaggagg ttgcgccccc 1141 agcgctgccg cctaaacccc ccaaggcaaa gccggccccc acagtcctgg ccaatggagg 1201 gagcccaccc tccctgcagg atgctgagtg gtactggggg gacatttcaa gggaggaggt 1261 gaacgagaaa ctccgggaca ctcccgatgg caccttccta gtccgagatg cttctagcaa 1321 gatccagggc gagtacacgc tgaccctcag gaaaggcggg aacaataagc tgatcaaggt 1381 cttccaccga gatgggcact atggcttctc agagccactc accttctgct ccgttgtgga 1441 cctcatcaat cactaccgcc acgagtctct ggcccagtac aatgccaagc tggacacacg 1501 gctcctctac cctgtgtcca aataccagca ggaccagatt gtcaaggagg acagcgtgga 1561 ggcagtgggc gcccagctta aggtctatca ccagcagtac caggacaaga gccgcgagta 1621 tgaccagctt tatgaagagt acacacggac ctcccaggag ctgcagatga agcgtactgc 1681 aattgaggcc ttcaatgaga ctatcaagat ctttgaagag cagggccaga ctcaagagaa 1741 atgcagcaag gaatacctgg agcgcttccg gcgtgagggc aacgagaaag agatgcaaag 1801 gatcctgctg aactccgagc ggctcaagtc ccgcattgcc gagatccatg agagccgcac 1861 gaagctggag cagcagctgc gggcccaggc ctcggacaac agagagatcg acaagcgcat 1921 gaacagcctc aagccggacc tcatgcagct gcgcaagatc cgagaccagt acctcgtgtg 1981 gctcacccag aaaggcgccc ggcagaagaa aatcaacgag tggctgggga ttaaaaatga 2041 gactgaggac cagtacgcac tcatggagga cgaggacgat ctcccgcacc acgaggaacg 2101 cacttggtac gtgggcaaga tcaaccgcac gcaggcagag gagatgctga gcggcaagcg 2161 ggatggcacc ttcctcatcc gcgagagcag ccagcggggc tgctacgcct gctccgtggt 2221 agtggacggc gacaccaagc actgcgtcat ctaccgcacg gccaccggct tcggcttcgc 2281 ggagccctac aacctgtacg ggtcgctgaa ggagctggtg ctgcactacc agcacgcctc 2341 gctggtgcag cacaacgacg cgctcactgt caccctggcg cacccagtgc gcgccccggg 2401 ccccggcccg ccgcctgccg cccgctgagc accgaggacc cgccccaagc agagccgccc 2461 ctgggcccgt ctgcgccgga ggctgcggcg gcgggagcca cggaccaaga ccagccacat 2521 ccaggggtcc tcatttctcc ggctctggct cttgtttggg gttctctcac cctctttctc 2581 tttccttccc tcccccattc tccagatctc cctctgtctc cttttctctg tctttcttgg 2641 cccctgtctc tctccatgtt gggggtccta actcccccac cccatatcta cgtgtcctcc 2701 gggcattgcc ctctccatgg ctctggtcac cctgaccctc tgccctgccc accgcaggtc 2761 ccccggggtc ccggaagccc cttctggctg cacctgccat gtttacagag ggcccctggg 2821 ctgcgcggcc ccagcctggg caccctgatt tttaagccat agacctgggg tcagggcagg 2881 aaggaacttc actctgctgc ttccgagaac ctcggccgtg acattcgggg ccgggcggga 2941 cccgccccac agactccaac ttcccctcca aaccccgaag tgaaacccgc caccgggtta 3001 ccccacaagg gggccgctgc gagaagttca cccacccccg aaaaaataat taaactcgca 3061 ggccaggcac ggtggctcat gcctgtaatc ccagcacttt gggaggccaa gacgggcgga 3121 tcttttgagg tcgggagttg gaggccagcc tggccaaaat ggcaaaaccc cgcatctact 3181 aaaatacaaa aattagccgg g // LOCUS HSPHR 2095 bp RNA PRI 18-AUG-1993 DEFINITION H.sapiens mRNA for parathyroid hormone receptor. ACCESSION X68596 NID g396812 KEYWORDS G-protein coupled receptor; parathyroid hormone receptor; transmembrane domain. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2095) AUTHORS Schneider,H. TITLE Direct Submission JOURNAL Submitted (02-OCT-1992) H. Schneider, Preclinical Research, Sandoz Pharma AG, Bau 386 / 322, CH- 4002 Basel, SWITZERLAND REMARK sequence revised by author (17-AUG-1993) REFERENCE 2 (bases 1 to 2095) AUTHORS Schneider,H., Feyen,J.H., Seuwen,K. and Movva,N.R. TITLE Cloning and functional expression of a human parathyroid hormone receptor JOURNAL Eur. J. Pharmacol. 246 (2), 149-155 (1993) MEDLINE 93387403 FEATURES Location/Qualifiers source 1..2095 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="kidney cDNA" /clone="pB-5, pA-19" CDS 175..1956 /codon_start=1 /product="parathyroid hormone receptor" /db_xref="PID:g396813" /db_xref="SWISS-PROT:Q03431" /translation="MGTARIAPGLALLLCCPVLSSAYALVDADDVMTKEEQIFLLHRA QAQCEKRLKEVLQRPASIMESDKGWTSASTSGKPRKDKASGKLYPESEEDKEAPTGSR YRGRPCLPEWDHILCWPLGAPGEVVAVPCPDYIYDFNHKGHAYRRCDRNGSWELVPGH NRTWANYSECVKFLTNETREREVFDRLGMIYTVGYSVSLASLTVAVLILAYFRRLHCT RNYIHMHLFLSFMLRAVSIFVKDAVLYSGATLDEAERLTEEELRAIAQAPPPPATAAA GYAGCRVAVTFFLYFLATNYYWILVEGLYLHSLIFMAFFSEKKYLWGFTVFGWGLPAV FVAVWVSVRATLANTGCWDLSSGNKKWIIQVPILASIVLNFILFINIVRVLATKLRET NAGRCDTRQQYRKLLKSTLVLMPLFGVHYIVFMATPYTEVSGTLWQVQMHYEMLFNSF QGFFVAIIYCFCNGEVQAEIKKSWSRWTLALDFKRKARSGSSSYSYGPMVSHTSVTNV GPRVGLGLPLSPRLLPTATTNGHPQLPGHAKPGTPALETLETTPPAMAAPKDDGFLNG SCSGLDEEASGPERPPALLQEEWETVM" sig_peptide 175..249 mat_peptide 250..1953 /product="parathyroid hormone receptor" BASE COUNT 437 a 644 c 616 g 398 t ORIGIN 1 gcctccccgt ggccaacttg agtctgctct gcagctttag gcccgacttg gaaggcccat 61 gggctgcaga tgaggaaact gaggtccaga cagccgaaga gtggtagtgt ccaggacaca 121 caactgggcc ggcggcggcg gctgccccga gggacgcggc cctaggcggt ggcgatgggg 181 accgcccgga tcgcacccgg cctggcgctc ctgctctgct gccccgtgct cagctccgcg 241 tacgcgctgg tggatgcaga tgacgtcatg actaaagagg aacagatctt cctgctgcac 301 cgtgctcagg cccagtgcga aaaacggctc aaggaggtcc tgcagaggcc agccagcata 361 atggaatcag acaagggatg gacatctgcg tccacatcag ggaagcccag gaaagataag 421 gcatctggga agctctaccc tgagtctgag gaggacaagg aggcacccac tggcagcagg 481 taccgagggc gcccctgtct gccggaatgg gaccacatcc tgtgctggcc gctgggggca 541 ccaggtgagg tggtggctgt gccctgtccg gactacattt atgacttcaa tcacaaaggc 601 catgcctacc gacgctgtga ccgcaatggc agctgggagc tggtgcctgg gcacaacagg 661 acgtgggcca actacagcga gtgtgtcaaa tttctcacca atgagactcg tgaacgggag 721 gtgtttgacc gcctgggcat gatttacacc gtgggctact ccgtgtccct ggcgtccctc 781 accgtagctg tgctcatcct ggcctacttt aggcggctgc actgcacgcg caactacatc 841 cacatgcacc tgttcctgtc cttcatgctg cgcgccgtga gcatcttcgt caaggacgct 901 gtgctctact ctggcgccac gcttgatgag gctgagcgcc tcaccgagga ggagctgcgc 961 gccatcgccc aggcgccccc gccgcctgcc accgccgctg ccggctacgc gggctgcagg 1021 gtggctgtga ccttcttcct ttacttcctg gccaccaact actactggat tctggtggag 1081 gggctgtacc tgcacagcct catcttcatg gccttcttct cagagaagaa gtacctgtgg 1141 ggcttcacag tcttcggctg gggtctgccc gctgtcttcg tggctgtgtg ggtcagtgtc 1201 agagctaccc tggccaacac cgggtgctgg gacttgagct ccgggaacaa aaagtggatc 1261 atccaggtgc ccatcctggc ctccattgtg ctcaacttca tcctcttcat caatatcgtc 1321 cgggtgctcg ccaccaagct gcgggagacc aacgccggcc ggtgtgacac acggcagcag 1381 taccgcaagc tgctcaaatc cacgctggtg ctcatgcccc tctttggcgt ccactacatt 1441 gtcttcatgg ccacaccata caccgaggtc tcagggacgc tctggcaagt ccagatgcac 1501 tatgagatgc tcttcaactc cttccaggga ttttttgtcg caatcatata ctgtttctgc 1561 aatggcgagg tacaagctga gatcaagaaa tcttggagcc gctggacact ggcactggac 1621 ttcaagcgaa aggcacgcag cgggagcagc agctatagct acggccccat ggtgtcccac 1681 acaagtgtga ccaatgtcgg cccccgtgtg ggactcggcc tgcccctcag cccccgccta 1741 ctgcccactg ccaccaccaa cggccaccct cagctgcctg gccatgccaa gccagggacc 1801 ccagccctgg agaccctcga gaccacacca cctgccatgg ctgctcccaa ggacgatggg 1861 ttcctcaacg gctcctgctc aggcctggac gaggaggcct ctgggcctga gcggccacct 1921 gccctgctac aggaagagtg ggagacagtc atgtgaccag gcgctggggg ctggacctgc 1981 tgacatagtg gatggacaga tggaccaaaa gatgggttgg ttgaatgatt tcccactcag 2041 ggctggggcc aagaggaaaa acagggaaaa aaagaaaaaa aaaagaaaaa aaaaa // LOCUS HSPI12 1559 bp RNA PRI 05-MAR-1997 DEFINITION H.sapiens mRNA for protease inhibitor 12 (PI12; neuroserpin). ACCESSION Z81326 NID g1785653 KEYWORDS neuroserpin; PI12; protease inhibitor 12. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1559) AUTHORS Schrimpf,S.P., Bleiker,A.J., Brecevic,L., Kozlov,S.V., Berger,P., Osterwalder,T., Krueger,S.R., Schinzel,A. and Sonderegger,P. TITLE Human neuroserpin (PI12): cDNA cloning and chromosomal localization to 3q26 JOURNAL Genomics 40 (1), 55-62 (1997) MEDLINE 97224485 REFERENCE 2 (bases 1 to 1559) AUTHORS Sonderegger,P. TITLE Direct Submission JOURNAL Submitted (25-OCT-1996) Peter Sonderegger, Institute of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, CH-8057, Switzerland FEATURES Location/Qualifiers source 1..1559 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="retina" 5'UTR 1..81 sig_peptide 82..129 CDS 82..1314 /standard_name="protease inhibitor 12 (PI12)" /codon_start=1 /product="neuroserpin" /db_xref="PID:e276639" /db_xref="PID:g1785654" /translation="MAFLGLFSLLVLQSMATGATFPEEAIADLSVNMYNRLRATGEDE NILFSPLSIALAMGMMELGAQGSTQKEIRHSMGYDSLKNGEEFSFLKEFSNMVTAKES QYVMKIANSLFVQNGFHVNEEFLQMMKKYFNAAVNHVDFSQNVAVANYINKWVENNTN NLVKDLVSPRDFDAATYLALINAVYFKGNWKSQFRPENTRTFSFTKDDESEVQIPMMY QQGEFYYGEFSDGSNEAGGIYQVLEIPYEGDEISMMLVLSRQEVPLATLEPLVKAQLV EEWANSVKKQKVEVYLPRFTVEQEIDLKDVLKALGITEIFIKDANLTGLSDNKEIFLS KAIHKSFLEVNEEGSEAAAVSGMIAISRMAVLYPQVIVDHPFFFLIRNRRTGTILFMG RVMHPETMNTSGHDFEEL" mat_peptide 130..1311 /standard_name="protease inhibitor 12 (PI12)" /product="neuroserpin" 3'UTR 1315..1559 polyA_signal 1535..1540 BASE COUNT 489 a 264 c 344 g 462 t ORIGIN 1 gcggagcaca gtccgccgag cacaagctcc agcatcccgt caggggttgc aggtgtgtgg 61 gaggcttgaa actgttacaa tatggctttc cttggactct tctctttgct ggttctgcaa 121 agtatggcta caggggccac tttccctgag gaagccattg ctgacttgtc agtgaatatg 181 tataatcgtc ttagagccac tggtgaagat gaaaatattc tcttctctcc attgagtatt 241 gctcttgcaa tgggaatgat ggaacttggg gcccaaggat ctacccagaa agaaatccgc 301 cactcaatgg gatatgacag cctaaaaaat ggtgaagaat tttctttctt gaaggagttt 361 tcaaacatgg taactgctaa agagagccaa tatgtgatga aaattgccaa ttccttgttt 421 gtgcaaaatg gatttcatgt caatgaggag tttttgcaaa tgatgaaaaa atattttaat 481 gcagcagtaa atcatgtgga cttcagtcaa aatgtagccg tggccaacta catcaataag 541 tgggtggaga ataacacaaa caatctggtg aaagatttgg tatccccaag ggattttgat 601 gctgccactt atctggccct cattaatgct gtctatttca aggggaactg gaagtcgcag 661 tttaggcctg aaaatactag aaccttttct ttcactaaag atgatgaaag tgaagtccaa 721 attccaatga tgtatcagca aggagaattt tattatgggg aatttagtga tggctccaat 781 gaagctggtg gtatctacca agtcctagaa ataccatatg aaggagatga aataagcatg 841 atgctggtgc tgtccagaca ggaagttcct cttgctactc tggagccatt agtcaaagca 901 cagctggttg aagaatgggc aaactctgtg aagaagcaaa aagtagaagt atacctgccc 961 aggttcacag tggaacagga aattgattta aaagatgttt tgaaggctct tggaataact 1021 gaaattttca tcaaagatgc aaatttgaca ggcctctctg ataataagga gatttttctt 1081 tccaaagcaa ttcacaagtc cttcctagag gttaatgaag aaggctcaga agctgctgct 1141 gtctcaggaa tgattgcaat tagtaggatg gctgtgctgt atcctcaagt tattgtcgac 1201 catccatttt tctttcttat cagaaacagg agaactggta caattctatt catgggacga 1261 gtcatgcatc ctgaaacaat gaacacaagt ggacatgatt tcgaagaact ttaagttact 1321 ttatttgaat aacaaggaaa acagtaacta agcacattat gtttgcaact ggtatatatt 1381 taggatttgt gttttacagt atatcttaag ataatattta aaatagttcc agataaaaac 1441 aatatatgta aattataagt aacttgtcaa ggaatgttat cagtattaag ctaatggtcc 1501 tgttatgtca ttgtgtttgt gtgctgttgt ttaaaataaa agtacctatt gaacatgtg // LOCUS HSPIR 2919 bp RNA PRI 22-FEB-1994 DEFINITION Homo sapiens encoding Polymeric immunoglobulin receptor. ACCESSION X73079 NID g456345 KEYWORDS cytoplasmic region; polymeric immunoglobulin receptor; secretory component; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2919) AUTHORS Piskurich,J.F., France,J.A., Tamer,C.M., Willmer,C.A., Kaetzel,C.S. and Kaetzel,D.M. TITLE Interferon-gamma induces polymeric immunoglobulin receptor mRNA in human intestinal epithelial cells by a protein synthesis dependent mechanism JOURNAL Mol. Immunol. 30 (4), 413-421 (1993) MEDLINE 93205018 REFERENCE 2 (bases 1 to 2919) AUTHORS Piskurich,J.F. TITLE Direct Submission JOURNAL Submitted (10-FEB-1994) J.F. Piskurich, Case Western Reserve University, Dept of Pathology, Biomedical Research Building, Cleveland, OH 44060, USA REFERENCE 3 (bases 1 to 2919) AUTHORS Piskurich,J.F. TITLE Molecular Cloning and Regulation of the Polymeric Immunoglobulin Receptor - Thesis (1994) Pathology, Case Western Reserve University JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2919 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="lactating" /tissue_type="breast" /clone="hpIgR-1,2" /clone_lib="catalog #HL1061b, Clontech Laboratories Inc., Palo Alto, CA" /sex="female" 5'UTR 1..180 sig_peptide 181..234 CDS 181..2475 /function="Binds and transports polymeric immunoglobulin" /codon_start=1 /product="Polymeric immunoglobulin receptor" /db_xref="PID:g456346" /translation="MLLFVLTCLLAVFPAISTKSPIFGPEEVNSVEGNSVSITCYYPP TSVNRHTRKYWCRQGARGGCITLISSEGYVSSKYAGRANLTNFPENGTFVVNIAQLSQ DDSGRYKCGLGINSRGLSFDVSLEVSQGPGLLNDTKVYTVDLGRTVTINCPFKTENAQ KRKSLYKQIGLYPVLVIDSSGYVNPNYTGRIRLDIQGTGQLLFSVVINQLRLSDAGQY LCQAGDDSNSNKKNADLQVLKPEPELVYEDLRGSVTFHCALGPEVANVAKFLCRQSSG ENCDVVVNTLGKRAPAFEGRILLNPQDKDGSFSVVITGLRKEDAGRYLCGAHSDGQLQ EGSPIQAWQLFVNEESTIPRSPTVVKGVAGSSVAVLCPYNRKESKSIKYWCLWEGAQN GRCPLLVDSEGWVKAQYEGRLSLLEEPGNGTFTVILNQLTSRDAGFYWCLTNGDTLWR TTVEIKIIEGEPNLKVPGNVTAVLGETLKVPCHFPCKFSSYEKYWCKWNNTGCQALPS QDEGPSKAFVNCDENSRLVSLTLNLVTRADEGWYWCGVKQGHFYGETAAVYVAVEERK AAGSRDVSLAKADAAPDEKVLDSGFREIENKAIQDPRLFAEEKAVADTRDQADGSRAS VDSGSSEEQGGSSRALVSTLVPLGLVLAVGAVAVGVARARHRKNVDRVSIRSYRTDIS MSDFENSREFGANDNMGASSITQETSLGGKEEFVATTESTTETKEPKKAKRSSKEEAE MAYKDFLLQSSTVAAEAQDGPQEA" misc_RNA 235..2094 /function="Binds polymeric immunoglobulin" mat_peptide 235..2472 /product="Polymeric immunoglobulin receptor" misc_RNA 2095..2163 /product="Transmembrane segment of receptor" misc_RNA 2164..2472 /product="Transmembrane segment of receptor" misc_RNA 2164..2472 /product="Cytoplasmic tail region of the receptor" 3'UTR 2476..2919 BASE COUNT 684 a 806 c 854 g 575 t ORIGIN 1 agagtttcag ttttggcagc agcgtccagt gccctgccag tagctcctag agaggcaggg 61 gttaccaact ggccagcagg ctgtgtccct gaagtcagat caacgggaga gaaggaagtg 121 gctaaaacat tgcacaggag aagtcggcct gagtggtgcg gcgctcggga cccaccagca 181 atgctgctct tcgtgctcac ctgcctgctg gcggtcttcc cagccatctc cacgaagagt 241 cccatatttg gtcccgagga ggtgaatagt gtggaaggta actcagtgtc catcacgtgc 301 tactacccac ccacctctgt caaccggcac acccggaagt actggtgccg gcagggagct 361 agaggtggct gcataaccct catctcctcg gagggctacg tctccagcaa atatgcaggc 421 agggctaacc tcaccaactt cccggagaac ggcacatttg tggtgaacat tgcccagctg 481 agccaggatg actccgggcg ctacaagtgt ggcctgggca tcaatagccg aggcctgtcc 541 tttgatgtca gcctggaggt cagccagggt cctgggctcc taaatgacac taaagtctac 601 acagtggacc tgggcagaac ggtgaccatc aactgccctt tcaagactga gaatgctcaa 661 aagaggaagt ccttgtacaa gcagataggc ctgtaccctg tgctggtcat cgactccagt 721 ggttatgtga atcccaacta tacaggaaga atacgccttg atattcaggg tactggccag 781 ttactgttca gcgttgtcat caaccaactc aggctcagcg atgctgggca gtatctctgc 841 caggctgggg atgattccaa tagtaataag aagaatgctg acctccaagt gctaaagccc 901 gagcccgagc tggtttatga agacctgagg ggctcagtga ccttccactg tgccctgggc 961 cctgaggtgg caaacgtggc caaatttctg tgccgacaga gcagtgggga aaactgtgac 1021 gtggtcgtca acaccctggg gaagagggcc ccagcctttg agggcaggat cctgctcaac 1081 ccccaggaca aggatggctc attcagtgtg gtgatcacag gcctgaggaa ggaggatgca 1141 gggcgctacc tgtgtggagc ccattcggat ggtcagctgc aggaaggctc gcctatccag 1201 gcctggcaac tcttcgtcaa tgaggagtcc acgattcccc gcagccccac tgtggtgaag 1261 ggggtggcag gaagctctgt ggccgtgctc tgcccctaca accgtaagga aagcaaaagc 1321 atcaagtact ggtgtctctg ggaaggggcc cagaatggcc gctgccccct gctggtggac 1381 agcgaggggt gggttaaggc ccagtacgag ggccgcctct ccctgctgga ggagccaggc 1441 aacggcacct tcactgtcat cctcaaccag ctcaccagcc gggacgccgg cttctactgg 1501 tgtctgacca acggcgatac tctctggagg accaccgtgg agatcaagat tatcgaagga 1561 gaaccaaacc tcaaggtacc agggaatgtc acggctgtgc tgggagagac tctcaaggtc 1621 ccctgtcact ttccatgcaa attctcctcg tacgagaaat actggtgcaa gtggaataac 1681 acgggctgcc aggccctgcc cagccaagac gaaggcccca gcaaggcctt cgtgaactgt 1741 gacgagaaca gccggcttgt ctccctgacc ctgaacctgg tgaccagggc tgatgagggc 1801 tggtactggt gtggagtgaa gcagggccac ttctatggag agactgcagc cgtctatgtg 1861 gcagttgaag agaggaaggc agcggggtcc cgcgatgtca gcctagcgaa ggcagacgct 1921 gctcctgatg agaaggtgct agactctggt tttcgggaga ttgagaacaa agccattcag 1981 gatcccaggc tttttgcaga ggaaaaggcg gtggcagata caagagatca agccgatggg 2041 agcagagcat ctgtggattc cggcagctct gaggaacaag gtggaagctc cagagcgctg 2101 gtctccaccc tggtgcccct gggcctggtg ctggcagtgg gagccgtggc tgtgggggtg 2161 gccagagccc ggcacaggaa gaacgtcgac cgagtttcaa tcagaagcta caggacagac 2221 attagcatgt cagacttcga gaactccagg gaatttggag ccaatgacaa catgggagcc 2281 tcttcgatca ctcaggagac atccctcgga ggaaaagaag agtttgttgc caccactgag 2341 agcaccacag agaccaaaga acccaagaag gcaaaaaggt catccaagga ggaagccgag 2401 atggcctaca aagacttcct gctccagtcc agcaccgtgg ccgccgaggc ccaggacggc 2461 ccccaggaag cctagacggt gtcgccgcct gctccctgca cccatgacaa tcaccttcag 2521 aatcatgtcg atcctggggg ccctcagctc ctggggaccc cactccctgc tctaacacct 2581 gcctaggttt ttcctactgt cctcagaggc gtgctggtcc cctcctcagt gacatcaaag 2641 cctggcctaa ttgttcctat tggggatgag ggtggcatga ggaggtccca cttgcaactt 2701 ctttctgttg agagaacctc aggtacggag aagaatagag gtcctcatgg gtcccttgaa 2761 ggaagaggga ccagggtggg agagctgatt gcagaaagga gagacgtgca gcgcccctct 2821 gcacccttat catgggatgt caacagaatt ttttccctcc actccatccc tccctcccgt 2881 ccttcccctc ttcttctttc cttaccatca aaagatgta // LOCUS HSPIRIN1 1292 bp RNA PRI 25-MAR-1997 DEFINITION H.sapiens mRNA for Pirin, isolate 1. ACCESSION Y07867 NID g1907075 KEYWORDS pirin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1292) AUTHORS Wendler,W.M.F., Kremmer,E., Forster,R. and Winnacker,E.L. TITLE Identification of pirin, a novel highly conserved nuclear protein JOURNAL J. Biol. Chem. 272 (13), 8482-8489 (1997) MEDLINE 97236804 REFERENCE 2 (bases 1 to 1292) AUTHORS Wendler,W.M.F. TITLE Direct Submission JOURNAL Submitted (10-SEP-1996) W.M.F. Wendler, Institut fuer Biochemie der Universitaet Muenchen, Genzentrum, Feodor-Lynen Strasse 25, D-81377 Muenchen, FRG FEATURES Location/Qualifiers source 1..1292 /organism="Homo sapiens" /isolate="pDR-Pirin #1" /db_xref="taxon:9606" /cell_type="HeLa" /clone_lib="lambda DR2" 5'UTR 1..204 CDS 205..1077 /codon_start=1 /product="pirin" /db_xref="PID:e274873" /db_xref="PID:g1907076" /translation="MGSSKKVTLSVLSREQSEGVGARVRRSIGRPELKNLDPFLLFDE FKGGRPGGFPDHPHRGFETVSYLLEGGSMAHEDFCGHTGKMNPGDLQWMTAGRGILHA EMPCSEEPAHGLQLWVNLRSSEKMVEPQYQELKSEEIPKPSKDGVTVAVISGEALGIK SKVYTRTPTLYLDFKLDPGAKHSQPIPKGWTSFIYTISGDVYIGPDDAQQKIEPHHTA VLGEGDSVQVENKDPKRSHFVLIAGEPLREPVIQHGPFVMNTNEEISQAILDFRNAKN GFERAKTWKSKIGN" 3'UTR 1078..1277 polyA_signal 1254..1259 polyA_site 1277 BASE COUNT 367 a 307 c 314 g 304 t ORIGIN 1 cctcccgcct cctctaggcc gccggccgcg aagcgctgag tcacggtgag gcgactggac 61 ccacactctc ttaacctgcc ctccctgcac tcgctcccgg cggctcttcg cgtcaccccc 121 gccgctaagg ctccaggtgc cgctaccgca gcccctccat cctctacagc tcagcatcag 181 aacactctct ttttagactc cgatatgggg tcctccaaga aagttactct ctcagtgctc 241 agccgggagc agtcggaagg ggttggagcg agggtccgga gaagcattgg cagacccgag 301 ttaaaaaatc tggatccgtt tttactgttt gatgaattta aaggaggtag accaggagga 361 tttcctgatc atccacatcg aggttttgaa acagtatcct acctcctgga agggggcagc 421 atggcccatg aagacttctg tggacacact ggtaaaatga acccaggaga tttgcagtgg 481 atgactgcgg gccggggcat tctgcacgct gagatgcctt gctcagagga gccagcccat 541 ggcctacaac tgtgggttaa tttgaggagc tcagagaaga tggtggagcc tcagtaccag 601 gaactgaaaa gtgaagaaat ccctaaaccc agtaaggatg gtgtgacagt tgctgtcatt 661 tctggagaag ccctgggaat aaagtccaag gtttacactc gcacaccaac cttatatttg 721 gacttcaaat tggacccagg agccaaacat tcccaaccta tccctaaagg gtggacaagc 781 ttcatttaca cgatatctgg agatgtgtat attgggcccg atgatgcaca acaaaaaata 841 gaacctcatc acacagcagt gcttggagaa ggtgacagtg tccaggtgga gaacaaggat 901 cccaagagaa gccactttgt cttaattgct ggggagccat taagagaacc agttatccaa 961 catggtccat ttgtgatgaa caccaatgaa gagatttctc aagctattct tgatttcaga 1021 aacgcaaaaa atgggtttga aagggccaaa acctggaaat caaagattgg gaactagtgg 1081 aaagcggaag agcaggtctt gatgtgtcct agaattttgc catttctgag attgagccat 1141 tgaaggcatt ccatttctaa agcttattta gccggtgctt ctaaagaatt ccacactaac 1201 gtgataacat ggtttttgta acaataaatg taggatattt cctggcacat gcaaataaac 1261 ctaatcattg tttctttaaa aaaaaaaaaa aa // LOCUS HSPISSLRE 1883 bp RNA PRI 17-OCT-1994 DEFINITION H.sapiens PISSLRE mRNA. ACCESSION X78342 NID g556650 KEYWORDS CDC2 kinase; serine/threonine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1883) AUTHORS Brambilla,R. and Draetta,G. TITLE Molecular cloning of PISSLRE, a novel putative member of the cdk family of protein serine/threonine kinases JOURNAL Oncogene 9 (10), 3037-3041 (1994) MEDLINE 94366755 REFERENCE 2 (bases 1 to 1883) AUTHORS Brambilla,R. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) R. Brambilla, EMBL, Meyerhofstr. 1, 69012 Heidelberg, FRG FEATURES Location/Qualifiers source 1..1883 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RT112 (bladder carcinoma)" gene 121..1203 /gene="PISSLRE" CDS 121..1203 /gene="PISSLRE" /codon_start=1 /db_xref="PID:g556651" /translation="MAEPDLECEQIRLKCIRKEGFFTVPPEHRLGRCRSVKEFEKLNR IGEGTYGIVYRARDTQTDEIVALKKVRMDKEKDGIPISSLREITLLLRLRHPNIVELK EVVVGNHLESIFLVMGYCEQDLASLLENMPTPFSEAQVKCIVLQVLRGLQYLHRNFII HRDLKVSNLLMTDKGCVKTADFGLARAYGVPVKPMTPKVVTLWYRAPELLLGTTTQTT SIDMWAVGCILAELLAHRPLLPGTSEIHQIDLIVQLLGTPSENIWPGFSKLPLVGQYS LRKQPYNNLKHKFPWLSEAGLRLLHFLFMYDPKKRATAGDCLESSYFKEKPLPCEPEL MPTFPHHRNKRAAPATSEGQSKRCKP" BASE COUNT 399 a 547 c 582 g 355 t ORIGIN 1 gtgagccacc gcccccagcc tggcctggca tttctttgag ttcaggaagt gtgacaagga 61 tttggacacc cagaaataag cgtgtcgaga agagcacaag cagaggatcc agcgctcggc 121 atggcggagc cagatctgga gtgcgagcag atccgtctga agtgtattcg taaggagggc 181 ttcttcacgg tgcctccgga acacaggctg ggacgatgcc ggagtgtgaa ggagtttgag 241 aagctgaacc gcattggaga gggtacctac ggcattgtgt atcgggcccg ggacacccag 301 acagatgaga ttgtcgcact gaagaaggtg cggatggaca aggagaagga tggcatcccc 361 atcagcagct tgcgggagat cacgctgctg ctccgcctgc gtcatccgaa catcgtggag 421 ctgaaggagg tggttgtggg gaaccacctg gagagcatct tcctggtgat gggttactgt 481 gagcaggacc tggccagcct cctggagaat atgccaacac ccttctcgga ggctcaggtc 541 aagtgcatcg tgctgcaggt gctccggggc ctccagtatc tgcacaggaa cttcattatc 601 cacagggacc tgaaggtttc caacttgctc atgaccgaca agggttgtgt gaagacagcg 661 gatttcggcc tggcccgggc ctatggtgtc ccagtaaagc caatgacccc caaggtggtc 721 actctctggt accgagcccc tgaactgctg ttgggaacca ccacgcagac caccagcatc 781 gacatgtggg ctgtgggctg catactggcc gagctgctgg cgcacaggcc tcttctcccc 841 ggcacttccg agatccacca gatcgacttg atcgtgcagc tgctgggcac gcccagtgag 901 aacatctggc cgggcttttc caagctgcca ctggtcggcc agtacagcct ccggaagcag 961 ccctacaaca acctgaagca caagttccca tggctgtcgg aggccgggct gcgcctgctg 1021 cacttcctgt tcatgtacga ccctaagaaa agggcgacgg ccggggactg cctggagagc 1081 tcctatttca aggagaagcc cctaccctgt gagccggagc tcatgccgac ctttccccac 1141 caccgcaaca agcgggccgc cccagccacc tccgagggcc agagcaagcg ctgtaaaccc 1201 tgacggtggg cctggcacac gcctgtattc ccacaccagg tcttccgatc agtggtgtct 1261 gtgaagggtg ccgcgagcca ggctgaccag gcgcccggga tccagctcat ccccttggct 1321 gggaacatcc tccactgact tcctcccact gtctgccctg aacccactgc tgcccccaga 1381 aaaaggccgg gtgacaccgg gggctcccag cccgtgcacc ctggaagggc aggtctggcg 1441 gctccatccg tggctgcagg ggtctcatgt ggtcctcctc gctatgttgg aaatgtgcaa 1501 ccactgcttc ttgggaggag tggtgggtgc agtccccccg ctgtctttga gttgtggtgg 1561 accgctggcc tgggatgaga gggcccagaa gaccttcgta tcccctctca gtcgcccggg 1621 gctgtcccgt gcatgggttg gctgtgggga ccccaggtgg gcctggcagg actccagatg 1681 aggacaagag ggacaaggta tggggtggga gccacaattg aggatacccc gagctaccag 1741 gagagccctg ggctggaggc tgagctggat ccctgctccc cacacggagg acccaacagg 1801 aggccgtggc tctgatgctg agcgaagcta taggctcttg ttggataaaa gcttttttaa 1861 cagaaaaaaa aaaaaaaaaa aaa // LOCUS HSPIT1 1050 bp RNA PRI 25-NOV-1991 DEFINITION H.sapiens mRNA for transcription factor Pit-1. ACCESSION X62429 NID g35474 KEYWORDS pit-1 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1050) AUTHORS Lew,A.M. and Elsholtz,H.P. TITLE Cloning of the human cDNA for transcription factor Pit-1 JOURNAL Nucleic Acids Res. 19 (22), 6329 (1991) MEDLINE 92066490 REFERENCE 2 (bases 1 to 1050) AUTHORS Elsholtz,H. TITLE Direct Submission JOURNAL Submitted (01-OCT-1991) H. Elsholtz, University of Toronto, Clinical Biochemistry, 100 College Street, Toronto Ontario M5G 1L5, CANADA FEATURES Location/Qualifiers source 1..1050 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="anterior pituitary" /cell_type="somatomammotroph" /clone_lib="lambda gt10, Clontech" gene 20..895 /gene="pit-1" CDS 20..895 /gene="pit-1" /codon_start=1 /product="transcription factor" /db_xref="PID:g35475" /db_xref="SWISS-PROT:P28069" /translation="MSCQAFTSADTFIPLNSDASATLPLIMHHSAAECLPVSNHATNV MSTATGLHYSVPSCHYGNQPSTYGVMAGSLTPCLYKFPDHTLSHGFPPIHQPLLAEDP TAADFKQELRRKSKLVEEPIDMDSPEIRELEKFANEFKVRRIKLGYTQTNVGEALAAV HGSEFSQTTICRFENLQLSFKNACKLKAILSKWLEEAEQVGALYNEKVGANERKRKRR TTISIAAKDALERHFGEQNKPSSQEIMRMAEELNLEKEVVRVWFCNRRQREKRVKTSL NQSLFSISKEHLECR" BASE COUNT 336 a 212 c 214 g 288 t ORIGIN 1 tttctactct cttgtgggaa tgagttgcca agcatttact tcggctgata cctttatacc 61 tctgaattcg gacgcctctg caactctgcc tctgataatg catcacagtg ctgccgagtg 121 tctaccagtc tccaaccatg ccaccaatgt gatgtctaca gcaacaggac ttcattattc 181 tgttccttcc tgtcattatg gaaaccagcc atcaacctat ggagtgatgg caggtagttt 241 aaccccttgt ctttataaat ttcctgacca caccttgagt catggatttc ctcctataca 301 ccagcctctt ctggcagagg accccacagc tgctgatttc aagcaggaac tcaggcggaa 361 aagtaaattg gtggaagagc caatagacat ggattctcca gaaatcagag aacttgaaaa 421 gtttgccaat gaatttaaag tgagacgaat taaattagga tacacccaga caaatgttgg 481 ggaggccctg gcagctgtgc atggctctga attcagtcaa acaacaatct gccgatttga 541 aaatctgcag ctcagcttta aaaatgcatg caaactgaaa gcaatattat ccaaatggct 601 ggaggaagct gagcaagtag gagctttgta caatgaaaaa gtgggagcaa atgaaaggaa 661 aagaaaacga agaacaacta taagcattgc tgctaaagat gctctggaga gacactttgg 721 agaacagaat aaaccttctt ctcaagagat catgaggatg gctgaagaac tgaatctgga 781 gaaagaagta gtaagagttt ggttttgcaa ccggaggcag agagaaaaac gggtgaaaac 841 aagcctgaat cagagtttat tttctatttc gaaggaacat cttgagtgca gataagattt 901 ttctaattgg tataatacgg tttttctccc gtttcattcc tttctcttcc tcaacaaaaa 961 cagaaattac cttggttgac cttaaaatca ttttatatca atagctttta cagaagcttt 1021 acttttccac ttttttttaa aaaaaaaaaa // LOCUS HSPITR1 2990 bp RNA PRI 26-OCT-1995 DEFINITION H.sapiens mRNA for phosphatidylinositol 3-kinase. ACCESSION Z46973 NID g987947 KEYWORDS phosphatidylinositol 3-kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 887) AUTHORS Volinia,S., Dhand,R., Vanhaesebroeck,B., MacDougall,L.K., Stein,R., Zvelebil,M.J., Domin,J., Panaretou,C. and Waterfield,M.D. TITLE A human phosphatidylinositol 3-kinase complex related to the yeast Vps34p-Vps15p protein sorting system JOURNAL EMBO J. 14 (14), 3339-3348 (1995) MEDLINE 95354652 REFERENCE 2 (bases 1 to 2990) AUTHORS Volinia,S. TITLE Direct Submission JOURNAL Submitted (21-DEC-1994) Stefano Volinia PhD, Receptor Studies, Ludwig Institute For Cancer, Research, 91 Riding House Street, London, W1P 8BT, UK FEATURES Location/Qualifiers source 1..2990 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PITR-1" /cell_line="TF-1, KG1a" /clone_lib="Lambda Zap TF1, KG1a" 5'UTR 1..47 CDS 48..2711 /codon_start=1 /product="phosphatidylinositol 3-kinase" /db_xref="PID:g987948" /translation="MGEAEKFHYIYSCDLDINVQLKIGSLEGKREQKSYNAVLEDPML KFSGLYQETCSDLYVTCQVFAEGKPSALPVRTSYKAFSTRWNWNEWLKLPVKYPDLPR NAQVALTIWDVYGPGKAVPVGGTTVSLFGKYGMSRQGMHDLKVWPNVEADGSEPTNTP GRTSSTLSEDQMSRLAKLTKAHRQGHMVKVDWLDRLTFREIEMINESVKRSSNFMYLM GGFRCVKCDDKEYGIVYYEKDGDESSPILTSFELVKVPDPQMSLENLVESKHHNLPRS LRSGPSDHDLKPYPSPRDQLKNIVSYPPSKPPTYEEQDLVWEFRYYLTNQDKALTKIL TSVIWDLPQGAKQALALLGKWNPMDVEDSLELISSHYTNPTVRRYAVARLRQADDEDL LMYLSQLVQALKYENFDDIKNGLEPTKKDSQSSVSGNVSNSGINSAEIDSSQIITSPL PSVSSPPPASKTKEVPDGENLEQDLCTFLISRASKNSTLANYLYWYVIVECEDQDTQQ RDPKTHEMYLNVMRRFSQALLKGDKSVRVMRSLLAAQQTFVDRLVHLMKAVQRESGNR KKKNERLQALLGDNEKMNLSDVELIPLPLEPQVKIRGIIPETATLFKSALMPAQLFFK TEDGGKYPVIFKHGDDLRQDQLILQIISLMDKLLRKENLDLKLTPYKVLATSTKHGFM QFIQSVPVAEVLDTEGSIQNFFRKYAPSENGPNGISAEVMDTYVKSCAGYCVITYILG VGDRHLDNLVLTKTGKLFHIDFGYILGRDPKPLPPPMKLNKEMVEGMGGTQSEQYQEF RKQCYTAFLHLRRYSNLILNLFSLMVDPNIPDIALEPDKTVKKVQDKFRLDLSDEEAV HYMQSLIDESVHALFAAVVEQIHKFAQYWRK" 3'UTR 2712..2990 BASE COUNT 949 a 566 c 678 g 797 t ORIGIN 1 cctgtaccta agttcccgct gtaggtggta cctttgcaga cggtgcgatg ggggaagcag 61 agaagtttca ctacatctat agttgtgacc tggatatcaa cgtccagctt aagataggaa 121 gcttggaagg gaagagagaa caaaagagtt ataacgctgt cctggaagac ccaatgttga 181 agttctcagg actatatcaa gagacatgct ctgatcttta tgttacttgt caagtttttg 241 cagaagggaa gccttcggcc ttgccagtga gaacatccta caaagcattt agtacaagat 301 ggaactggaa tgaatggctg aaactaccag taaaataccc tgacctgccc aggaatgccc 361 aagtggccct caccatatgg gatgtgtatg gtcccggaaa agcagtgcct gtaggaggaa 421 caacggtttc gctctttgga aaatacggca tgtctcgcca agggatgcat gacttgaaag 481 tctggcctaa tgtagaagca gatggatcag aacccacaaa tactcctggc agaacaagta 541 gcactctctc agaagatcag atgagccgtc ttgccaagct caccaaagct catcgacaag 601 gacacatggt gaaagtagat tggctggata gattgacatt tagagaaata gaaatgataa 661 atgagagtgt gaaacgaagt tctaatttca tgtacctgat gggtggattt cgatgtgtca 721 agtgtgatga taaggaatat ggtattgttt attatgaaaa ggacggtgat gaatcatctc 781 caattttaac aagttttgaa ttagtgaaag ttcctgaccc ccagatgtcc ctggagaatt 841 tagttgagag caaacaccac aaccttcccc ggagtttaag aagtggacct tctgaccacg 901 atctgaaacc ctatccttcc ccgagagatc agttaaaaaa tattgtgagt tatcctccat 961 ccaagccacc cacatatgaa gaacaagatc ttgtttggga gtttagatat tatcttacga 1021 atcaagataa agccttgacc aaaatcctga catctgttat ttgggatcta cctcaggggg 1081 ccaaacaggc cttggcactt ctggggaaat ggaacccgat ggatgtagag gactccttgg 1141 agctgatatc ctctcattac accaacccaa ctgtgaggcg ttatgctgtt gcccggttgc 1201 gacaggccga tgatgaggat ttgttgatgt acctatcaca attggtccag gctctcaaat 1261 atgaaaattt tgatgatata aagaatggat tggaacctac caagaaggat agtcagagtt 1321 cagtgtcagg aaatgtgtca aattctggaa taaattctgc agaaatagat agctcccaaa 1381 ttataaccag cccccttcct tcagtctctt cacctcctcc tgcatcaaaa acaaaagaag 1441 ttccagatgg cgaaaatctg gaacaagatc tctgtacctt cttgatatcg agagcctcca 1501 aaaactcaac actggctaat tatttatact ggtatgtgat agtggaatgt gaagatcaag 1561 atactcagca gagagatcca aagacccatg agatgtactt gaacgtaatg agaagattca 1621 gccaagcatt gttgaagggt gataagtctg tcagagttat gcgttctttg ctggctgcac 1681 aacagacatt tgtagatcgg ttggtgcatc taatgaaggc agtacaacgc gaaagtggaa 1741 atcgtaagaa aaagaatgag agactacagg cattgcttgg agataatgaa aagatgaatt 1801 tgtcagatgt ggaacttatc ccgttgcctt tagaacccca agtgaaaatt agaggaataa 1861 ttccggaaac agctacactg tttaaaagtg cccttatgcc tgcacagttg ttttttaaga 1921 cggaagatgg aggcaaatat ccagttatat ttaagcatgg agatgattta cgtcaagatc 1981 aacttattct tcaaatcatt tcactcatgg acaagctgtt acggaaagaa aatctggact 2041 tgaaattgac accttataag gtgttagcca ccagtacaaa acatggcttc atgcagttta 2101 tccagtcagt tcctgtggct gaagttcttg atacagaggg aagcattcag aactttttta 2161 gaaaatatgc accaagtgag aatgggccaa atgggattag tgctgaggtc atggacactt 2221 acgttaaaag ctgtgctgga tattgcgtga tcacctatat acttggagtt ggagacaggc 2281 acctggataa ccttgtgcta acaaaaacag gcaaactctt ccacatagac tttggatata 2341 ttttgggtcg ggatccaaag cctcttcctc caccaatgaa gctgaataaa gaaatggtag 2401 aaggaatggg gggcacacag agtgagcagt accaagagtt ccgtaaacag tgttacacgg 2461 ctttcctcca cctgcggagg tattctaatc tgattttgaa cttgttttcc ttgatggttg 2521 atccaaacat tccagatatt gcacttgaac cagataaaac tgtgaaaaag gttcaggata 2581 aattccgctt agacctgtcg gatgaagagg ctgtgcatta catgcagagt ctgattgatg 2641 agagtgtcca tgctcttttt gctgcagtgg tggaacagat tcacaagttt gcccagtact 2701 ggagaaaatg aaactgggat tgacccatca agatgcttgg ctcaataaga aaaccacgtt 2761 aggagcaacc tttgtatatt ggagacttca gagtaaccag caaggaagag aaatcttaat 2821 cttcaagtta ccatattttc caaatattac atggtacctg agttctgctt ccttggatgt 2881 cattgcttaa atatagtctt gaagggcttg ttttgaaata ttgtatatat tttttcaaat 2941 gtatacattg ttaataaatt aagaaatgag aaaaaaaaaa aaaaaaaaaa // LOCUS HSPKA 2549 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cAMP-dependent protein kinase catalytic subunit type alpha (EC 2.7.1.37). ACCESSION X07767 M36872 NID g35478 KEYWORDS protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2549) AUTHORS Hanks,S.K. TITLE Direct Submission JOURNAL Submitted (30-MAY-1988) S.K. Hanks, The Salk Institute, P.O. Box 85800, San Diego, CA 92138, USA REFERENCE 2 (bases 1 to 2549) AUTHORS Maldonado,F. and Hanks,S.K. TITLE A cDNA clone encoding human cAMP-dependent protein kinase catalytic subunit C alpha JOURNAL Nucleic Acids Res. 16 (16), 8189-8190 (1988) MEDLINE 88335571 FEATURES Location/Qualifiers source 1..2549 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone="PSK-G4" CDS 81..1136 /note="protein kinase catalytic subunit type alpha (AA 1-351)" /codon_start=1 /db_xref="PID:g35479" /db_xref="SWISS-PROT:P17612" /translation="MGNAAAAKKGSEQESVKEFLAKAKEDFLKKWESPAQNTAHLDQF ERIKTLGTGSFGRVMLVKHKETGNHYAMKILDKQKVVKLKQIEHTLNEKRILQAVNFP FLVKLEFSFKDNSNLYMVMEYVPGGEMFSHLRRIGRFSEPHARFYAAQIVLTFEYLHS LDLIYRDLKPENLLIDQQGYIQVTDFGFAKRVKGRTWTLCGTPEYLAPEIILSKGYNK AVDWWALGVLIYEMAAGYPPFFADQPIQIYEKIVSGKVRFPSHFSSDLKDLLRNLLQV DLTKRFGNLKNGVNDIKNHKWFATTDWIAIYQRKVEAPFIPKFKGPGDTSNFDDYEEE EIRVSINEKCGKEFSEF" misc_feature 2537..2543 /note="pot polyA signal" BASE COUNT 568 a 795 c 618 g 567 t 1 others ORIGIN 1 cagtgngctc cgggccgccg gccgcagcca gcacccgccg cgccgcagct ccgggaccgg 61 ccccggccgc cgccgccgcg atgggcaacg ccgccgccgc caagaagggc agcgagcagg 121 agagcgtgaa agaattctta gccaaagcca aagaagattt tcttaaaaaa tgggaaagtc 181 ccgctcagaa cacagcccac ttggatcagt ttgaacgaat caagaccctc ggcacgggct 241 ccttcgggcg ggtgatgctg gtgaaacaca aggagaccgg gaaccactat gccatgaaga 301 tcctcgacaa acagaaggtg gtgaaactga aacagatcga acacaccctg aatgaaaagc 361 gcatcctgca agctgtcaac tttccgttcc tcgtcaaact cgagttctcc ttcaaggaca 421 actcaaactt atacatggtc atggagtacg tgcccggcgg ggagatgttc tcacacctac 481 ggcggatcgg aaggttcagt gagccccatg cccgtttcta cgcggcccag atcgtcctga 541 cctttgagta tctgcactcg ctggatctca tctacaggga cctgaagccg gagaatctgc 601 tcattgacca gcagggctac attcaggtga cagacttcgg tttcgccaag cgcgtgaagg 661 gccgcacttg gaccttgtgc ggcacccctg agtacctggc ccctgagatt atcctgagca 721 aaggctacaa caaggccgtg gactggtggg ccctgggggt tcttatctat gaaatggccg 781 ctggctaccc gcccttcttc gcagaccagc ccatccagat ctatgagaag atcgtctctg 841 ggaaggtgcg cttcccttcc cacttcagct ctgacttgaa ggacctgctg cggaacctcc 901 tgcaggtaga tctcaccaag cgctttggga acctcaagaa tggggtcaac gatatcaaga 961 accacaagtg gtttgccaca actgactgga ttgccatcta ccagaggaag gtggaagctc 1021 ccttcatacc aaagtttaaa ggccctgggg atacgagtaa ctttgacgac tatgaggaag 1081 aagaaatccg ggtctccatc aatgagaagt gtggcaagga gttttctgag ttttaggggc 1141 atgcctgtgc ccccatgggt tttctttttt cttttttctt ttttttggtc gggggggtgg 1201 gagggttgga ttgaacagcc agagggcccc agagttcctt gcatctaatt tcacccccac 1261 cccaccctcc agggttaggg ggagcaggaa gcccagataa tcagagggac agaaacacca 1321 gctgctcccc ctcatcccct tcaccctcct gccccctctc ccacttttcc cttcctcttt 1381 ccccacagcc ccccagcccc tcagccctcc cagcccactt ctgcctgttt taaacgagtt 1441 tctcaactcc agtcagacca ggtcttgctg gtgtatccag ggacagggta tggaaagagg 1501 ggctcacgct taactccagc ccccacccac acccccatcc cacccaacca caggccccac 1561 ttgctaaggg caaatgaacg aagcgccaac cttcctttcg gagtaatcct gcctgggaag 1621 gagagatttt tagtgacatg ttcagtgggt tgcttgctag aattttttta aaaaaacaac 1681 aatttaaaat cttatttaag ttccaccagt gcctccctcc ctccttcctc tactcccacc 1741 cctcccatgt ccccccattc ctcaaatcca ttttaaagag aagcagactg actttggaaa 1801 gggaggcgct ggggtttgaa cctccccgct gctaatctcc cctgggcccc tccccgggga 1861 atcctctctg ccaatcctgc gagggtctag gcccctttag gaagcctccg ctctcttttt 1921 ccccaacaga cctgtcttca cccttgggct ttgaaagcca gacaaagcag ctgcccctct 1981 ccctgccaaa gaggagtcat cccccaaaaa gacagagggg gagccccaag cccaagtctt 2041 tcctcccagc agcgtttccc cccaactcct taattttatt ctccgctaga ttttaacgtc 2101 cagccttccc tcagctgagt ggggagggca tccctgcaaa agggaacaga agaggccaag 2161 tccccccaag ccacggcccg gggttcaagg ctagagctgc tggggagggg ctgcctgttt 2221 tactcaccca ccagcttccg cctcccccat cctgggcgcc cctcctccag cttagctgtc 2281 agctgtccat cacctctccc ccactttctc atttgtgctt ttttctctcg taatagaaaa 2341 gtggggagcc gctggggagc caccccattc atccccgtat ttccccctct cataacttct 2401 ccccatccca ggaggagttc tcaggcctgg ggtggggccc cgggtgggtg cgggggcgat 2461 tcaacctgtg tgctgcgaag gacgagactt cctcttgaac agtgtgctgt tgtaaacata 2521 tttgaaaact attaccaata aagtttgtt // LOCUS HSPKCA1 2245 bp RNA PRI 12-SEP-1993 DEFINITION Human PKC alpha mRNA for protein kinase C alpha. ACCESSION X52479 NID g35482 KEYWORDS PKC alpha gene; protein kinase; protein kinase C. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2245) AUTHORS Hug,H. TITLE Direct Submission JOURNAL Submitted (05-MAR-1990) Hug H., Molecular Cell Biology, University of Freiburg, c/o Goedecke AG, Mooswaldallee 1-9, D 7800 Freiburg, F R G REFERENCE 2 (bases 1 to 2245) AUTHORS Finkenzeller,G., Marme,D. and Hug,H. TITLE Sequence of human protein kinase C alpha JOURNAL Nucleic Acids Res. 18 (8), 2183 (1990) MEDLINE 90245676 COMMENT See for overlapping sequence. FEATURES Location/Qualifiers source 1..2245 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="T-cell" /cell_line="Jurkat" /clone_lib="lambda gt10" /clone="hPKC-alpha-GF29" /chromosome="17q21-17qter" CDS 28..2046 /note="protein kinase C alpha (AA 1-672)" /codon_start=1 /db_xref="PID:g35483" /db_xref="SWISS-PROT:P17252" /translation="MADVFPGNDSTASQDVANRFARKGALRQKNVHEVKDHKFIARFF KQPTFCSHCTDFIWGFGKQGFQCQVCCFVVHKRCHEFVTFSCPGADKGPDTDDPRSKH KFKIHTYGSPTFCDHCGSLLYGLIHQGMKCDTCDMNVHKQCVINVPSLCGMDHTEKRG RIYLKAEVADEKLHVTVRDAKNLIPMDPNGLSDPYVKLKLIPDPKNESKQKTKTIRST LNPQWNESFTFKLKPSDKDRRLSVEIWDWDRTTRNDFMGSLSFGVSELMKMPASGWYK LLNQEEGEYYNVPIPEGDEEGNMELRQKFEKAKLGPAGNKVISPSEDRKQPSNNLDRV KLTDFNFLMVLGKGSFGKVMLADRKGTEELYAIKILKKDVVIQDDDVECTMVEKRVLA LLDKPPFLTQLHSCFQTVDRLYFVMEYVNGGDLMYHIQQVGKFKEPQAVFYAAEISIG LFFLHKRGIIYRDLKLDNVMLDSEGHIKIADFGMCKEHMMDGVTTRTFCGTPDYIAPE IIAYQPYGKSVDWWAYGVLLYEMLAGQPPFDGEDEDELFQSIMEHNVSYPKSLSKEAV SICKGLMTKHPAKRLGCGPEGERDVREHAFFRRIDWEKLENREIQPPFKPKVCGKGAE NFDKFFTRGQPVLTPPDQLVIANIDQSDFEGFSYVNPQFVHPILQSAV" BASE COUNT 632 a 530 c 574 g 509 t ORIGIN 1 ggagcaagag gtggttgggg ggggaccatg gctgacgttt tcccgggcaa cgactccacg 61 gcgtctcagg acgtggccaa ccgcttcgcc cgcaaagggg cgctgaggca gaagaacgtg 121 cacgaggtga aggaccacaa attcatcgcg cgcttcttca agcagcccac cttctgcagc 181 cactgcaccg acttcatctg ggggtttggg aaacaaggct tccagtgcca agtttgctgt 241 tttgtggtcc acaagaggtg ccatgaattt gttacttttt cttgtccggg tgcggataag 301 ggacccgaca ctgatgaccc caggagcaag cacaagttca aaatccacac ttacggaagc 361 cccaccttct gcgatcactg tgggtcactg ctctatggac ttatccatca agggatgaaa 421 tgtgacacct gcgatatgaa cgttcacaag caatgcgtca tcaatgtccc cagcctctgc 481 ggaatggatc acactgagaa gagggggcgg atttacctaa aggctgaggt tgctgatgaa 541 aagctccatg tcacagtacg agatgcaaaa aatctaatcc ctatggatcc aaacgggctt 601 tcagatcctt atgtgaagct gaaacttatt cctgatccca agaatgaaag caagcaaaaa 661 accaaaacca tccgctccac actaaatccg cagtggaatg agtcctttac attcaaattg 721 aaaccttcag acaaagaccg acgactgtct gtagaaatct gggactggga tcgaacaaca 781 aggaatgact tcatgggatc cctttccttt ggagtttcgg agctgatgaa gatgccggcc 841 agtggatggt acaagttgct taaccaagaa gaaggtgagt actacaacgt acccattccg 901 gaaggggacg aggaaggaaa catggaactc aggcagaaat tcgagaaagc caaacttggc 961 cctgctggca acaaagtcat cagtccctct gaagacagga aacaaccttc caacaacctt 1021 gaccgagtga aactcacgga cttcaatttc ctcatggtgt tgggaaaggg gagttttgga 1081 aaggtgatgc ttgccgacag gaagggcaca gaagaactgt atgcaatcaa aatcctgaag 1141 aaggatgtgg tgattcagga tgatgacgtg gagtgcacca tggtagaaaa gcgagtcttg 1201 gccctgcttg acaaaccccc gttcttgacg cagctgcact cctgcttcca gacagtggat 1261 cggctgtact tcgtcatgga atatgtcaac ggtggggacc tcatgtacca cattcagcaa 1321 gtaggaaaat ttaaggaacc acaagcagta ttctatgcgg cagagatttc catcggattg 1381 ttctttcttc ataaaagagg aatcatttat agggatctga agttagataa cgtcatgttg 1441 gattcagaag gacatatcaa aattgctgac tttgggatgt gcaaggaaca catgatggat 1501 ggagtcacga ccaggacctt ctgtgggact ccagattata tcgccccaga gataatcgct 1561 tatcagccgt atggaaaatc tgtggactgg tgggcctatg gcgtcctgtt gtatgaaatg 1621 cttgccgggc agcctccatt tgatggtgaa gatgaagacg agctatttca gtctatcatg 1681 gagcacaacg tttcctatcc aaaatccttg tccaaggagg ctgtttctat ctgcaaagga 1741 ctgatgacca aacacccagc caagcggctg ggctgtgggc ctgaggggga gagggacgtg 1801 agagagcatg ccttcttccg gaggatcgac tgggaaaaac tggagaacag ggagatccag 1861 ccaccattca agcccaaagt gtgtggcaaa ggagcagaga actttgacaa gttcttcaca 1921 cgaggacagc ccgtcttaac accacctgat cagctggtta ttgctaacat agaccagtct 1981 gattttgaag ggttctcgta tgtcaacccc cagtttgtgc accccatctt acagagtgca 2041 gtatgaaact caccagcgag aacaaacacc tccccagccc ccagccctcc ccgcagtgga 2101 agtgaatcct taaccctaaa attttaaggc cacggcttgt gtctgattcc atatggaggc 2161 ctgaaaattg tagggttatt agtccaaatg tgatcaactg ttcagggtct ctctcttaca 2221 accaagaaca ttatcttagt ggaag // LOCUS HSPKCB1A 2574 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for protein kinase C (PKC) type beta I. ACCESSION X06318 M27545 NID g35488 KEYWORDS alternate splicing; protein kinase C; protein kinase C type beta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2574) AUTHORS Kubo,K., Ohno,S. and Suzuki,K. TITLE Primary structures of human protein kinase C beta I and beta II differ only in their C-terminal sequences JOURNAL FEBS Lett. 223 (1), 138-142 (1987) MEDLINE 88030028 COMMENT see x07109; cloned PKC type betaI and PKC type beta II cDNA share identical nucleotide sequences from the 5' end to pos. 2000, suggesting that their mRNAs are derived from a single gene by an alternative splicing mechanism. Data kindly reviewed (6 June 1988) by K. Kubo. FEATURES Location/Qualifiers source 1..2574 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /clone_lib="lambda gt10" /clone="H106" CDS 137..2152 /note="PKC beta 1 (AA 1-671)" /codon_start=1 /db_xref="PID:g35489" /db_xref="SWISS-PROT:P05771" /translation="MADPAAGPPPSEGEESTVRFARKGALRQKNVHEVKNHKFTARFF KQPTFCSHCTDFIWGFGKQGFQCQVCCFVVHKRCHEFVTFSCPGADKGPASDDPRSKH KFKIHTYSSPTFCDHCGSLLYGLIHQGMKCDTCMMNVHKRCVMNVPSLCGTDHTERRG RIYIQAHIDRDVLIVLVRDAKNLVPMDPNGLSDPYVKLKLIPDPKSESKQKTKTIKCS LNPEWNETFRFQLKESDKDRRLSVEIWDWDLTSRNDFMGSLSFGISELQKASVDGWFK LLSQEEGEYFNVPVPPEGSEANEELRQKFERAKISQGTKVPEEKTTNTVSKFDNNGNR DRMKLTDFNFLMVLGKGSFGKVMLSERKGTDELYAVKILKKDVVIQDDDVECTMVEKR VLALPGKPPFLTQLHSCFQTMDRLYFVMEYVNGGDLMYHIQQVGRFKEPHAVFYAAEI AIGLFFLQSKGIIYRDLKLDNVMLDSEGHIKIADFGMCKENIWDGVTTKTFCGTPDYI APEIIAYQPYGKSVDWWAFGVLLYEMLAGQAPFEGEDEDELFQSIMEHNVAYPKSMSK EAVAICKGLMTKHPGKRLGCGPEGERDIKEHAFFRYIDWEKLERKEIQPPYKPKARDK RDTSNFDKEFTRQPVELTPTDKLFIMNLDQNEFAGFSYTNPEFVINV" BASE COUNT 679 a 643 c 660 g 592 t ORIGIN 1 cagagccggc gcaggggaag cgcccggggc cccgggtgca gcagcgcccg ccgcctcccg 61 ggcctccccg gcccgcagcc cggggtcccg ggccccgggg ccggcacctc tcgggctccg 121 gctccccgcg cgcaagatgg ctgacccggc tgcggggccg ccgccgagcg agggcgagga 181 gagcaccgtg cgcttcgccc gcaaaggcgc cctccggcag aagaacgtgc atgaggtcaa 241 gaaccacaaa ttcaccgccc gcttcttcaa gcagcccacc ttctgcagcc actgcaccga 301 cttcatctgg ggcttcggga agcagggatt ccagtgccaa gtttgctgct ttgtggtgca 361 caagcggtgc catgaatttg tcacattctc ctgccctggc gctgacaagg gtccagcctc 421 cgatgacccc cgcagcaaac acaagtttaa gatccacacg tactccagcc ccacgttttg 481 tgaccactgt gggtcactgc tgtatggact catccaccag gggatgaaat gtgacacctg 541 catgatgaat gtgcacaagc gctgcgtgat gaatgttccc agcctgtgtg gcacggacca 601 cacggagcgc cgcggccgca tctacatcca ggcccacatc gacagggacg tcctcattgt 661 cctcgtaaga gatgctaaaa accttgtacc tatggacccc aatggcctgt cagatcccta 721 cgtaaaactg aaactgattc ccgatcccaa aagtgagagc aaacagaaga ccaaaaccat 781 caaatgctcc ctcaaccctg agtggaatga gacatttaga tttcagctga aagaatcgga 841 caaagacaga agactgtcag tagagatttg ggattgggat ttgaccagca ggaatgactt 901 catgggatct ttgtcctttg ggatttctga acttcagaag gccagtgttg atggctggtt 961 taagttactg agccaggagg aaggcgagta cttcaatgtg cctgtgccac cagaaggaag 1021 tgaggccaat gaagaactgc ggcagaaatt tgagagggcc aagatcagtc agggaaccaa 1081 ggtcccggaa gaaaagacga ccaacactgt ctccaaattt gacaacaatg gcaacagaga 1141 ccggatgaaa ctgaccgatt ttaacttcct aatggtgctg gggaaaggca gctttggcaa 1201 ggtcatgctt tcagaacgaa aaggcacaga tgagctctat gctgtgaaga tcctgaagaa 1261 ggacgttgtg atccaagatg atgacgtgga gtgcactatg gtggagaagc gggtgttggc 1321 cctgcctggg aagccgccct tcctgaccca gctccactcc tgcttccaga ccatggaccg 1381 cctgtacttt gtgatggagt acgtgaatgg gggcgacctc atgtatcaca tccagcaagt 1441 cggccggttc aaggagcccc atgctgtatt ttacgctgca gaaattgcca tcggtctgtt 1501 cttcttacag agtaagggca tcatttaccg tgacctaaaa cttgacaacg tgatgctcga 1561 ttctgaggga cacatcaaga ttgccgattt tggcatgtgt aaggaaaaca tctgggatgg 1621 ggtgacaacc aagacattct gtggcactcc agactacatc gcccccgaga taattgctta 1681 tcagccctat gggaagtccg tggattggtg ggcatttgga gtcctgctgt atgaaatgtt 1741 ggctgggcag gcaccctttg aaggggagga tgaagatgaa ctcttccaat ccatcatgga 1801 acacaacgta gcctatccca agtctatgtc caaggaagct gtggccatct gcaaagggct 1861 gatgaccaaa cacccaggca aacgtctggg ttgtggacct gaaggcgaac gtgatatcaa 1921 agagcatgca tttttccggt atattgattg ggagaaactt gaacgcaaag agatccagcc 1981 cccttataag ccaaaagcta gagacaagag agacacctcc aacttcgaca aagagttcac 2041 cagacagcct gtggaactga cccccactga taaactcttc atcatgaact tggaccaaaa 2101 tgaatttgct ggcttctctt atactaaccc agagtttgtc attaatgtgt aggtgaatgc 2161 aaactccatc gttgagcctg gggtgtaaga cttcaagcca agcgtatgta tcaattctag 2221 tcttccagga ttcacggtgc acatgctggc attcaacatg tggaaagctt gtcttagagg 2281 cctttcttgt atgtgtagct tgctagtttg ttttctacat ttgaaaatgt ttagtttaga 2341 ataagcgcat tatccaatta tagaggtaca attttccaaa cttccagaaa ctcatcaaat 2401 gaacagacaa tgtcaaaact actgtgtctg ataccaaaat gcttcagtat ttgtaatttt 2461 tcaagtcaga agctgatgtt cctggtaaaa gtttttacag ttattctata atatcttctt 2521 tgaatgctaa gcatgagcga tatttttaaa aattgtgagt aagcttcgga attc // LOCUS HSPKCE 2244 bp RNA PRI 25-JUL-1993 DEFINITION H.sapiens mRNA for protein kinase C-Epsilon. ACCESSION X65293 S46030 NID g35494 KEYWORDS protein kinase C epsilon. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2244) AUTHORS Burns,D.J. TITLE Direct Submission JOURNAL Submitted (18-MAR-1992) D.J. Burns, Sphinx Pharmaceutical, Dept of Molecular Biology, Two University Place, P.O.Box 52330, Durham, NC 27717, USA REFERENCE 2 (bases 1 to 2244) AUTHORS Basta,P., Strickland,M.B., Holmes,W., Loomis,C.R., Ballas,L.M. and Burns,D.J. TITLE Sequence and expression of human protein kinase C-epsilon JOURNAL Biochim. Biophys. Acta 1132 (2), 154-160 (1992) MEDLINE 93003318 FEATURES Location/Qualifiers source 1..2244 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 16..2229 /EC_number="2.7.1.37" /codon_start=1 /product="protein kinase C epsilon" /db_xref="PID:g35495" /db_xref="SWISS-PROT:Q02156" /translation="MVVFNGLLKIKICEAVSLKPTAWSLRHAVGPRPQTFLLDPYIAL NVDDSRIGQTATKQKTNSPAWHDEFVTDVCNGRKIELAVFHDAPIGYDDFVANCTIQF EELLQNGSRHFEDWIDLEPEGRVYVIIDLSGSSGEAPKDNEERVFRERMRPRKRQGAV RRRVHQVNGHKFMATYLRQPTYCSHCRDFIWGVIGKQGYQCQVCTCVVHKRCHELIIT KCAGLKKQETPDQVGSQRFSVNMPHKFGIHNYKVPTFCDHCGSLLWGLLRQGLQCKVC KMNVHRRCETNVAPNCGVDARGIAKVLADLGVTPDKITNSGQRRKKLIAGAESPQPAS GSSPSEEDRSKSAPTSPCDQEIKELENNIRKALSFDNRGEEHRAASSPDGQLMSPGEN GEVRQGQAKRLGLDEFNFIKVLGKGSFGKVMLAELKGKDEVYAVKVLKKDVILQDDDV DCTMTEKRILALARKHPYLTQLYCCFQTKDRLFFVMEYVNGGDLMFQIQRSRKFDEPR SRFYAAEVTSALMFLHQHGVIYRDLKLDNILLDAEGHCKLADFGMCKEGILNGVTTTT FCGTPDYIAPEILQELEYGPSVDWWALGVLMYEMMAGQPPFEADNEDDLFESILHDDV LYPVWLSKEAVSILKAFMTKNPHKRLGCVASQNGEDAIKQHPFFKEIDWVLLEQKKIK PPFKPRIKTKRDVNNFDQDFTREEPVLTLVDEAIVKQINQEEFKGFSYFGEDLMP" BASE COUNT 558 a 611 c 620 g 455 t ORIGIN 1 ctccccgccc cgaccatggt agtgttcaat ggccttctta agatcaaaat ctgcgaggcc 61 gtgagcttga agcccacagc ctggtcgctg cgccatgcgg tgggaccccg gccgcagact 121 ttccttctcg acccctacat tgccctcaat gtggacgact cgcgcatcgg ccaaacggcc 181 accaagcaga agaccaacag cccggcctgg cacgacgagt tcgtcaccga tgtgtgcaac 241 ggacgcaaga tcgagctggc tgtctttcac gatgccccca taggctacga cgacttcgtg 301 gccaactgca ccatccagtt tgaggagctg ctgcagaacg ggagccgcca cttcgaggac 361 tggattgatc tggagccaga aggaagagtg tatgtgatca tcgatctctc agggtcgtcg 421 ggtgaagccc ctaaagacaa tgaagagcgt gtgttcaggg aacgcatgcg gccgaggaag 481 cggcaggggg ccgtcaggcg cagggtccat caggtcaacg gccacaagtt catggccacc 541 tatcttcggc agcccaccta ctgctcccat tgcagagact tcatctgggg tgtcatagga 601 aagcagggat accagtgtca agtctgcacc tgcgtggtcc acaagcggtg ccacgagctc 661 ataatcacaa agtgtgctgg gttaaagaag caggagaccc ccgaccaggt gggctcccag 721 cggttcagcg tcaacatgcc ccacaagttc ggtatccaca actacaaggt ccctaccttc 781 tgcgatcact gtgggtccct gctctgggga ctcttgcggc agggtttgca gtgtaaagtc 841 tgcaaaatga atgttcaccg tcgatgtgag accaacgtgg ctcccaactg tggagtggat 901 gccagaggaa tcgccaaagt actggccgac ctgggcgtta ccccagacaa aatcaccaac 961 agcggccaga gaaggaaaaa gctcattgct ggtgccgagt ccccgcagcc tgcttctgga 1021 agctcaccat ctgaggaaga tcgatccaag tcagcaccca cctccccttg tgaccaggaa 1081 ataaaagaac ttgagaacaa cattcggaaa gccttgtcat ttgacaaccg aggagaggag 1141 caccgggcag catcgtctcc tgatggccag ctgatgagcc ccggtgagaa tggcgaagtc 1201 cggcaaggcc aggccaagcg cctgggcctg gatgagttca acttcatcaa ggtgttgggc 1261 aaaggcagct ttggcaaggt catgttggca gaactcaagg gcaaagatga agtatatgct 1321 gtgaaggtct taaagaagga cgtcatcctt caggatgatg acgtggactg cacaatgaca 1381 gagaagagga ttttggctct ggcacggaaa cacccgtacc ttacccaact ctactgctgc 1441 ttccagacca aggaccgcct ctttttcgtc atggaatatg taaatggtgg agacctcatg 1501 tttcagattc agcgctcccg aaaattcgac gagcctcgtt cacggttcta tgctgcagag 1561 gtcacatcgg ccctcatgtt cctccatcag catggagtca tctacaggga tttgaaactg 1621 gacaacatcc ttctggatgc agaaggtcac tgcaagctgg ctgacttcgg gatgtgcaag 1681 gaagggattc tgaatggtgt gacgaccacc acgttctgtg ggactcctga ctacatagct 1741 cctgagatcc tgcaggagtt ggagtatggc ccctccgtgg actggtgggc cctgggggtg 1801 ctgatgtacg agatgatggc tggacagcct ccctttgagg ccgacaatga ggacgaccta 1861 tttgagtcca tcctccatga cgacgtgctg tacccagtct ggctcagcaa ggaggctgtc 1921 agcatcttga aagctttcat gacgaagaat ccccacaagc gcctgggctg tgtggcatcg 1981 cagaatggcg aggacgccat caagcagcac ccattcttca aagagattga ctgggtgctc 2041 ctggagcaga agaagatcaa gccacccttc aaaccacgca ttaaaaccaa aagagacgtc 2101 aataattttg accaagactt tacccgggaa gagccggtac tcacccttgt ggacgaagca 2161 attgtaaagc agatcaacca ggaggaattc aaaggtttct cctactttgg tgaagacctg 2221 atgccctgag agcccactgc agtt // LOCUS HSPKCMU 3742 bp RNA PRI 14-APR-1994 DEFINITION H.sapiens mRNA for protein kinase C mu. ACCESSION X75756 NID g438372 KEYWORDS protein kinase C mu. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3742) AUTHORS Johannes,F.J., Prestle,J., Eis,S., Oberhagemann,P. and Pfizenmaier,K. TITLE PKCu is a novel, atypical member of the protein kinase C family JOURNAL J. Biol. Chem. 269 (8), 6140-6148 (1994) MEDLINE 94164979 REFERENCE 2 (bases 1 to 3742) AUTHORS Johannes,F. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) F. Johannes, Inst. of Cell Biology and Immunology, Allmandring 31, 70563 Stuttgart, FRG FEATURES Location/Qualifiers source 1..3742 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 236..2974 /codon_start=1 /product="protein kinase C mu" /db_xref="PID:g438373" /translation="MSAPPVLRPPSPLLPVAAAAAAAAAALVPGSGPGPAPFLAPVAA PVGGISFHLQIGLSREPVLLLQDSSGDYSLAHVREMACSIVDQKFPECGFYGMYDKIL LFRHDPTSENILQLVKAASDIQEGDLIEVVLSRSATFEDFQIRPHALFVHSYRAPAFC DHCGEMLWGLVRQGLKCEGCGLNYHKRCAFKIPNNCSGVRRRRLSNVSLTGVSTIRTS SAELSTSAPDEPLLQKSPSESFIGREKRSNSQSYIGRPIHLDKILMSKVKVPHTFVIH SYTRPTVCQYCKKLLKGLFRQGLQCKDCRFNCHKRCAPKVPNNCLGEVTINGDLLSPG AESDVVMEEGSDDNDSERNSGLMDDMEEAMVQDAEMAMAECQNDSGEMQDPDPDHEDA NRTISPSTSNNIPLMRVVQSVKHTKRKSSTVMKEGWMVHYTSKDTLRKRHYWRLDSKC ITLFQNDTGSRYYKEIPLSEILSLEPVKTSALIPNGANPHCFEITTANVVYYVGENVV NPSSPSPNNSVLTSGVGADVARMWEIAIQHALMPVIPKGSSVGTGTNLHRDISVSISV SNCQIQENVDISTVYQIFPDEVLGSGQFGIVYGGKHRKTGRDVAIKIIDKLRFPTKQE SQLRNEVAILQNLHHPGVVNLECMFETPERVFVVMEKLHGDMLEMILSSEKGRLPEHI TKFLITQILVALRHLHFKNIVHCDLKPENVLLASADPFPQVKLCDFGFARIIGEKSFR RSVVGTPAYLAPEVLRNKGYNRSLDMWSVGVIIYVSLSGTFPFNEDEDIHDQIQNAAF MYPPNPWKEISHEAIDLINNLLQVKMRKRYSVDKTLSHPWLQDYQTWLDLRELECKIG ERYITHESDDLRWEKYAGEQRLQYPTHLINPSASHSDTPETEETEMKALGERVSIL" BASE COUNT 1015 a 897 c 865 g 965 t ORIGIN 1 gaattccttc tctcctcctc ctcgcccttc tcctcgccct cctcctcctc ctcgccctcc 61 cctcccgatc ctcatcccct tgccctcccc cagcccaggg acttttccgg aaagttttta 121 ttttccgtct gggctctcgg agaaagaagc tcctggctca gcggctgcaa aactttcctg 181 ctgccgcgcc gccagccccc gccctccgct gcccggccct gcgccccgcc gagcgatgag 241 cgcccctccg gtcctgcggc cgcccagtcc gctgctgccc gtggcggcgg cagctgccgc 301 agcggccgcc gcactggtcc cagggtccgg gcccgggccc gcgccgttct tggctcctgt 361 cgcggccccg gtcgggggca tctcgttcca tctgcagatc ggcctgagcc gtgagccggt 421 gctgctgctg caggactcgt ccggggacta cagcctggcg cacgtccgcg agatggcttg 481 ctccattgtc gaccagaagt tccctgaatg tggtttctac ggaatgtatg ataagatcct 541 gctttttcgc catgacccta cctctgaaaa catccttcag ctggtgaaag cggccagtga 601 tatccaggaa ggcgatctta ttgaagtggt cttgtcacgt tccgccacct ttgaagactt 661 tcagattcgt ccccacgctc tctttgttca ttcatacaga gctccagctt tctgtgatca 721 ctgtggagaa atgctgtggg ggctggtacg tcaaggtctt aaatgtgaag ggtgtggtct 781 gaattaccat aagagatgtg catttaaaat acccaacaat tgcagcggtg tgaggcggag 841 aaggctctca aacgtttccc tcactggggt cagcaccatc cgcacatcat ctgctgaact 901 ctctacaagt gcccctgatg agccccttct gcaaaaatca ccatcagagt cgtttattgg 961 tcgagagaag aggtcaaatt ctcaatcata cattggacga ccaattcacc ttgacaagat 1021 tttgatgtct aaagttaaag tgccgcacac atttgtcatc cactcctaca cccggcccac 1081 agtgtgccag tactgcaaga agcttctgaa ggggcttttc aggcagggct tgcagtgcaa 1141 agattgcaga ttcaactgcc ataaacgttg tgcaccgaaa gtaccaaaca actgccttgg 1201 cgaagtgacc attaatggag atttgcttag ccctggggca gagtctgatg tggtcatgga 1261 agaagggagt gatgacaatg atagtgaaag gaacagtggg ctcatggatg atatggaaga 1321 agcaatggtc caagatgcag agatggcaat ggcagagtgc cagaacgaca gtggcgagat 1381 gcaagatcca gacccagacc acgaggacgc caacagaacc atcagtccat caacaagcaa 1441 caatatccca ctcatgaggg tagtgcagtc tgtcaaacac acgaagagga aaagcagcac 1501 agtcatgaaa gaaggatgga tggtccacta caccagcaag gacacgctgc ggaaacggca 1561 ctattggaga ttggatagca aatgtattac cctctttcag aatgacacag gaagcaggta 1621 ctacaaggaa attcctttat ctgaaatttt gtctctggaa ccagtaaaaa cttcagcttt 1681 aattcctaat ggggccaatc ctcattgttt cgaaatcact acggcaaatg tagtgtatta 1741 tgtgggagaa aatgtggtca atccttccag cccatcacca aataacagtg ttctcaccag 1801 tggcgttggt gcagatgtgg ccaggatgtg ggagatagcc atccagcatg cccttatgcc 1861 cgtcattccc aagggctcct ccgtgggtac aggaaccaac ttgcacagag atatctctgt 1921 gagtatttca gtatcaaatt gccagattca agaaaatgtg gacatcagca cagtatatca 1981 gatttttcct gatgaagtac tgggttctgg acagtttgga attgtttatg gaggaaaaca 2041 tcgtaaaaca ggaagagatg tagctattaa aatcattgac aaattacgat ttccaacaaa 2101 acaagaaagc cagcttcgta atgaggttgc aattctacag aaccttcatc accctggtgt 2161 tgtaaatttg gagtgtatgt ttgagacgcc tgaaagagtg tttgttgtta tggaaaaact 2221 ccatggagac atgctggaaa tgatcttgtc aagtgaaaag ggcaggttgc cagagcacat 2281 aacgaagttt ttaattactc agatactcgt ggctttgcgg caccttcatt ttaaaaatat 2341 cgttcactgt gacctcaaac cagaaaatgt gttgctagcc tcagctgatc cttttcctca 2401 ggtgaaactt tgtgattttg gttttgcccg gatcattgga gagaagtctt tccggaggtc 2461 agtggtgggt acccccgctt acctggctcc tgaggtccta aggaacaagg gctacaatcg 2521 ctctctagac atgtggtctg ttggggtcat catctatgta agcctaagcg gcacattccc 2581 atttaatgaa gatgaagaca tacacgacca aattcagaat gcagctttca tgtatccacc 2641 aaatccctgg aaggaaatat ctcatgaagc cattgatctt atcaacaatt tgctgcaagt 2701 aaaaatgaga aagcgctaca gtgtggataa gaccttgagc cacccttggc tacaggacta 2761 tcagacctgg ttagatttgc gagagctgga atgcaaaatc ggggagcgct acatcaccca 2821 tgaaagtgat gacctgaggt gggagaagta tgcaggcgag cagcggctgc agtaccccac 2881 acacctgatc aatccaagtg ctagccacag tgacactcct gagactgaag aaacagaaat 2941 gaaagccctc ggtgagcgtg tcagcatcct ctgagttcca tctcctataa tctgtcaaaa 3001 cactgtggaa ctaataaata catacggtca ggtttaacat ttgccttgca gaactgccat 3061 tattttctgt cagatgagaa caaagctgtt aaactgttag cactgttgat gtatctgagt 3121 tgccaagaca aatcaacaga agcatttgta ttttgtgtga ccaactgtgt tgtattaaca 3181 aaagttccct gaaacacgaa acttgttatt gtgaatgatt catgttatat ttaatgcatt 3241 aaacctgtct ccactgtgcc tttgcaaatc agtgtttttc ttactggagc ttcattttgg 3301 taagagacag aatgtatctg tgaagtagtt ctgtttggtg tgtcccattg gtgttgtcat 3361 tgtaaacaaa ctcttgaaga gtcgattatt tccagtgttc tatgaacaac tccaaaaccc 3421 atgtgggaaa aaaatgaatg aggagggtag ggaataaaat cctaagacac aaatgcatga 3481 acaagtttta atgtatagtt ttgaatcctt tgcctgcctg gtgtgcctca gtatatttaa 3541 actcaagaca atgcacctag ctgtgcaaga cctagtgctc ttaagcctaa atgccttaga 3601 aatgtaaact gccatatata acagatacat ttccctcttt cttataatac tctgttgtac 3661 tatggaaaat cagctgctca gcaacctttc acctttgtgt atttttcaat aataaaaaat 3721 attcttgtca aaaaaaaaaa aa // LOCUS HSPKCZ 2146 bp RNA PRI 27-SEP-1993 DEFINITION H.sapiens mRNA for protein kinase C zeta. ACCESSION Z15108 NID g35500 KEYWORDS protein kinase; protein kinase C; protein kinase C zeta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2146) AUTHORS Hug,H.P. TITLE Direct Submission JOURNAL Submitted (07-SEP-1992) Hubert P. Hug, Deptartment of Molecular Biology, Osaka Bioscience, Institute, 6-2-4 Furuedai, Osaka, 565, Japan REFERENCE 2 (bases 1 to 2146) AUTHORS Kochs,G., Hummel,R., Meyer,D., Hug,H., Marme,D. and Sarre,T.F. TITLE Activation and substrate specificity of the human protein kinase C alpha and zeta isoenzymes JOURNAL Eur. J. Biochem. 216 (2), 597-606 (1993) MEDLINE 93387312 FEATURES Location/Qualifiers source 1..2146 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /tissue_type="Brain-Hippocampus" /clone_lib="LambdaZAPII vector (Stratagene, No.936205)" /clone="PKC z11" /sex="Female" CDS 7..1761 /codon_start=1 /product="protein kinase C zeta" /db_xref="PID:g35501" /db_xref="SWISS-PROT:Q05513" /translation="MEGSGGRVRLKAHYGGDIFITSVDAATTFEELCEEVRDMCRLHQ QHPLTLKWVDSEGDPCTVSSQMELEEAFRLARQCRDEGLIIHVFPSTPEQPGLPCPGE DKSIYRRGARRWRKLYRANGHLFQAKRFNRRAYCGQCSERIWGLARQGYRCINCKLLV HKRCHGLVPLTCRKHMDSVMPSQEPPVDDKNEDADLPSEETDGIAYISSSRKHDSIKD DSEDLKPVIDGMDGIKISQGLGLQDFDLIRVIGRGSYAKVLLVRLKKNDQIYAMKVVK KELVHDDEDIDWVQTEKHVFEQASSNPFLVGLHSCFQTTSRLFLVIEYVNGGDLMFHM QRQRKLPEEHARFYAAEICIALNFLHERGIIYRDLKLDNVLLDADGHIKLTDYGMCKE GLGPGDTTSTFCGTPNYIAPEILRGEEYGFSVDWWALGVLMFEMMAGRSPFDIITDNP DMNTEDYLFQVILEKPIRIPRFLSVKASHVLKGFLNKDPKERLGCRPQTGFSDIKSHA FFRSIDWDLLEKKQALPPFQPQITDDYGLDNFDTQFTSEPVQLTPDDEDAIKRIDQSE FEGFEYINPLLLSTEESV" BASE COUNT 506 a 602 c 633 g 405 t ORIGIN 1 cccaagatgg aagggagcgg cggccgcgtc cgcctcaagg cgcattacgg gggggacatc 61 ttcatcacca gcgtggacgc cgccacgacc ttcgaggagc tctgtgagga agtgagagac 121 atgtgtcgtc tgcaccagca gcacccgctc accctcaagt gggtggacag cgaaggtgac 181 ccttgcacgg tgtcctccca gatggagctg gaagaggctt tccgcctggc ccgtcagtgc 241 agggatgaag gcctcatcat tcatgttttc ccgagcaccc ctgagcagcc tggcctgcca 301 tgtccgggag aagacaaatc tatctaccgc cggggagcca gaagatggag gaagctgtac 361 cgtgccaacg gccacctctt ccaagccaag cgctttaaca ggagagcgta ctgcggtcag 421 tgcagcgaga ggatatgggg cctcgcgagg caaggctaca ggtgcatcaa ctgcaaactg 481 ctggtccata agcgctgcca cggcctcgtc ccgctgacct gcaggaagca tatggattct 541 gtcatgcctt cccaagagcc tccagtagac gacaagaacg aggacgccga ccttccttcc 601 gaggagacag atggaattgc ttacatttcc tcatcccgga agcatgacag cattaaagac 661 gactcggagg accttaagcc agttatcgat gggatggatg gaatcaaaat ctctcagggg 721 cttgggctgc aggactttga cctaatcaga gtcatcgggc gcgggagcta cgccaaggtt 781 ctcctggtgc ggttgaagaa gaatgaccaa atttacgcca tgaaagtggt gaagaaagag 841 ctggtgcatg atgacgagga tattgactgg gtacagacag agaagcacgt gtttgagcag 901 gcatccagca accccttcct ggtcggatta cactcctgct tccagacgac aagtcggttg 961 ttcctggtca ttgagtacgt caacggcggg gacctgatgt tccacatgca gaggcagagg 1021 aagctccctg aggagcacgc caggttctac gcggccgaga tctgcatcgc cctcaacttc 1081 ctgcacgaga gggggatcat ctacagggac ctgaagctgg acaacgtcct cctggatgcg 1141 gacgggcaca tcaagctcac agactacggc atgtgcaagg aaggcctggg ccctggtgac 1201 acaacgagca ctttctgcgg aaccccgaat tacatcgccc ccgaaatcct gcggggagag 1261 gagtacgggt tcagcgtgga ctggtgggcg ctgggagtcc tcatgtttga gatgatggcc 1321 gggcgctccc cgttcgacat catcaccgac aacccggaca tgaacacaga ggactacctt 1381 ttccaagtga tcctggagaa gcccatccgg atcccccggt tcctgtccgt caaagcctcc 1441 catgttttaa aaggattttt aaataaggac cccaaagaga ggctcggctg ccggccacag 1501 actggatttt ctgacatcaa gtcccacgcg ttcttccgca gcatagactg ggacttgctg 1561 gagaagaagc aggcgctccc tccattccag ccacagatca cagacgacta cggtctggac 1621 aactttgaca cacagttcac cagcgagccc gtgcagctga ccccagacga tgaggatgcc 1681 ataaagagga tcgaccagtc agagttcgaa ggctttgagt atatcaaccc attattgctg 1741 tccaccgagg agtcggtgtg aggccgcgtg cgtctctgtc gtggacacgc gtgattgacc 1801 ctttaactgt atccttaacc accgcatatg catgccaggc tgggcacggc tccgagggcg 1861 gccagggaca gacgcttgcg ccgagaccgc agagggaagc gtcagcgggc gctgctggga 1921 gcagaacagt ccctcacacc tggcccggca ggcagcttcg tgctggagga acttgctgct 1981 gtgcctgcgt cgcggcggat ccgcggggac cctgccgagg gggctgtcat gcggtttcca 2041 aggtgcacat tttccacgga aacagaactc gatgcactga cctgctccgc caggaaagtg 2101 agcgtgtagc gtcctgagga ataaaatgtt ccgatgaaaa aaaaaa // LOCUS HSPKFB 1741 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase (EC 2.7.1.105, EC 3.1.3.46). ACCESSION X52638 NID g35502 KEYWORDS 6-phosphofructo-2-kinase; fructose-2,6-bisphosphatase; kinase; phosphatase; phosphofructokinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1741) AUTHORS Lange,A.J. TITLE Direct Submission JOURNAL Submitted (20-APR-1990) Lange A.J., SUNY at Stonybrook, Dept. of Physiology and Biophysics, Health Science Center, Basic Science Tower Level 6 Room 140, Stonybrook, New York 11794, USA REFERENCE 2 (bases 1 to 1741) AUTHORS Lange,A.J. and Pilkis,S.J. TITLE Sequence of human liver 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase JOURNAL Nucleic Acids Res. 18 (12), 3652 (1990) MEDLINE 90301497 COMMENT See for conflicting sequence. FEATURES Location/Qualifiers source 1..1741 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda gt11" /clone="HL2K-1" CDS 80..1495 /note="6-phosphofructo-2-kinase/fructose-2,6- bisphosphatase (AA 1-471)" /codon_start=1 /db_xref="PID:g35503" /db_xref="SWISS-PROT:P16118" /translation="MSPEMGELTQTRLQKIWIPHSSGSSRLQRRRGSSIPQFTNSPTM VIMVGLPARGKTYISTKLTRYLNWIGTPTKVFNLGQYRREAVSYKNYEFFLPDNMEAL QIRKQCALAALKDVHNYLSHEEGHVAVFDATNTTRERRSLILQFAKEHGYKVFFIESI CNDPGIIAENIRQVKLGSPDYIDCDREKVLEDFLKRIECYEVNYQPLDEELDSHLSYI KIFDVGTRYMVNRVQDHIQSRTVYYLMNIHVTPRSIYLCRHGESELNIRGRIGGDSGL SVRGKQYAYALANFIQSQGISSLKVWTSRMKRTIQTAEALGVPYEQWKALNEIDAGVC EEMTYEEIQEHYPEEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVIC HQAVMRCLLAYFLDKSSDELPYLKCPLHTVLKLTPVAYGCKVESIYLNVEAVNTHREK PENVDITREPEEALDTVPAHY" misc_feature 474..476 /note="cca is caa in " misc_feature 879..881 /note="gag is ggg in " misc_feature 882..884 /note="gcc is gac in " misc_feature 992..994 /note="cgc is cac in " misc_feature 1266..1267 /note="gt is gtt in " misc_feature 1272..1274 /note="cgg is cg in " misc_feature 1479..1481 /note="tcc is tac in " misc_feature 1526..1527 /note="cc is ctc in " polyA_site 1741 /note="polyA signal" BASE COUNT 468 a 457 c 426 g 390 t ORIGIN 1 gaattccgga caggtagtaa gataggaagt gaggccaggt accttgtggg cagtgatgtc 61 attcggtgcg actcctaaga tgtctccaga gatgggagag ctcacccaaa ccaggttgca 121 gaagatctgg attccacaca gcagcggcag cagcaggctg caacggagaa ggggctcatc 181 cataccccag tttaccaatt cccccacaat ggtgatcatg gtgggtttac cagctcgagg 241 caagacctat atctccacaa agctcacacg atatctcaac tggataggaa caccaactaa 301 agtgtttaat ttaggccagt atcgacgaga ggcagtgagc tacaagaact atgaattctt 361 tcttccagac aacatggaag ccctgcaaat caggaagcag tgcgccctgg cagccctgaa 421 ggatgttcac aactatctca gccatgagga aggtcatgtt gcggtttttg atgccaccaa 481 cactaccaga gaacgacggt cactgatcct gcagtttgca aaagaacatg gttacaaggt 541 gtttttcatt gagtccattt gtaatgaccc tggcataatt gcagaaaaca tcaggcaagt 601 gaaacttggc agccctgatt atatagactg tgaccgggaa aaggttctgg aagactttct 661 aaagagaatt gagtgctatg aggtcaacta ccaacccttg gatgaggaac tggacagcca 721 cctgtcctac atcaagatct tcgacgtggg cacacgctac atggtgaacc gagtgcagga 781 tcacatccag agccgcacag tctactacct catgaatatc catgtcacac ctcgctccat 841 ctacctttgc cgacatggcg agagtgaact caacatcaga ggccgcatcg gaggtgactc 901 tggcctctca gttcgcggca agcagtatgc ctatgccctg gccaacttca ttcagtccca 961 gggcatcagc tccctgaagg tgtggaccag tcgcatgaag aggaccatcc agacagctga 1021 ggccctgggt gtcccctatg agcagtggaa ggccctgaat gagattgatg cgggtgtctg 1081 tgaggagatg acctatgaag aaatccagga acattaccct gaagaatttg cactgcgaga 1141 ccaagataaa tatcgctacc gctatcccaa gggagagtcc tatgaggatc tggttcagcg 1201 tctggagcca gtgataatgg agctagaacg acaggagaat gtactggtga tctgccacca 1261 ggctgtcatg cggtgcctcc tggcctattt cctggataaa agttcagatg agcttccata 1321 tctcaagtgc cctctgcaca cagtgctcaa actcactcct gtggcttatg gctgcaaagt 1381 ggaatccatc tacctgaatg tggaggccgt gaacacacac cgggagaagc ctgagaatgt 1441 ggacatcacc cgggaacctg aggaagccct ggatactgtc ccagcccact actgagccct 1501 ttccaagaag tcaaactgcc tgtgtcctca tcgccttcca cctttaggaa atgctatctt 1561 tgctcttctc ctactctgcc ttggcctcac tgaggcaccc cacttccagt gaagaagtcc 1621 tccgcaactc ccaaacaagc ctcgcttgct ggccgcaacc aaggagctat ctagctctgg 1681 aggaaacttt ctttcttaat tcctattctc tgacgaataa agacttactg cctacaagag 1741 g // LOCUS HSPKX1MR 6034 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for protein kinase, PKX1. ACCESSION X85545 NID g1052736 KEYWORDS cAMP-dependent protein kinase; protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6034) AUTHORS Rappold,G.A. TITLE Direct Submission JOURNAL Submitted (10-MAR-1995) G.A. Rappold, Institut of human genetics, Heidelberg, Im Neuenheimer Feld 328, D- 69120 Heidelberg, FRG REFERENCE 2 (bases 1 to 6034) AUTHORS Klink,A., Schiebel,K., Winkelmann,M., Rao,E., Horsthemke,B., Ludecke,H.J., Claussen,U., Scherer,G. and Rappold,G. TITLE The human protein kinase gene PKX1 on Xp22.3 displays Xp/Yp homology and is a site of chromosomal instability JOURNAL Hum. Mol. Genet. 4 (5), 869-878 (1995) MEDLINE 95360006 FEATURES Location/Qualifiers source 1..6034 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /sex="female" /tissue_type="brain" /clone_lib="ICRFp507" /clone="B01166" /chromosome="X" /map="Xp22.3" gene 367..1443 /gene="PKX1" CDS 367..1443 /gene="PKX1" /codon_start=1 /product="protein kinase" /db_xref="PID:g1052737" /db_xref="SWISS-PROT:P51817" /translation="MEAPGLAQAAAAESDSRKVAEETPDGAPALCPSPEALSPEPPVY SLQDFDTLATVGTGTFGRVHLVKEKTAKHFFALKVMSIPDVIRLKQEQHVHNEKSVLK EVSHPFLIRLFWTWHDERFLYMLMEYVPGGELFSYLRNRGRFSSTTGLFYSAEIICAI EYLHSKEIVYRDLKPENILLDRDGHIKLTDFGFAKKLVDRTWTLCGTPEYLAPEVIQS KGHGRAVDWWALGILIFEMLSGFPPFFDDNPFGIYQKILAGKIDFPRHLDFHVKDLIK KLLVVDRTRRLGNMKNGANDVKHHRWFRSVDWEAVPQRKLKPPIVPKIAGDGDTSNFE TYPENDWDTAAPVPQKDLEIFKNF" repeat_region 1789..1828 /rpt_unit=1789..1790 repeat_region 2208..2527 /note="Alu type J" repeat_region 2598..2722 /note="Alu left monomer" repeat_region 3723..4025 /note="Alu type Sx" repeat_region 5340..5379 /rpt_unit=5340..5341 BASE COUNT 1619 a 1368 c 1493 g 1554 t ORIGIN 1 ccacgcgtcc ggagctggag gaggcggcgg gcgcgagacc cggaatgcgc agggcccccg 61 cctcgccccc cccagcccgg gccgcggccc ccgccttccc cgcagtcgtc ccgcactcgg 121 tgcccgcccc ccgaggccgg cggctgctcc cactcggggc cgttgctgct tgtgccgtga 181 gcgccgccca gccattgtcc ccgtcgctcc gtcagccgcg ccggaccgcg caccaggagg 241 cgagagcgcg catggggagc ctctgttgat gccgccgccg cgccgccctc cgaggctgcg 301 tcccgggaag cccggctccc cgagcgctcc ggcctggccc ggtgccccgg acctgagtgc 361 gtccccatgg aggcgcccgg gctggcccag gcggccgcgg cggagagcga ctcccgcaag 421 gtggcggagg agacccccga cggggcgccc gcgctctgcc ccagccctga ggcgctgtcg 481 ccggagccgc ctgtgtacag cctgcaggac tttgacacgc tggccaccgt gggcactggg 541 acgttcgggc gggtgcacct ggtgaaggag aagacagcca agcatttctt cgccctcaag 601 gtgatgagca ttcctgacgt catccgccta aagcaggagc aacacgtaca caatgagaag 661 tctgtcctga aggaagtcag ccacccgttc ctcatcaggc tgttctggac gtggcatgac 721 gagcgcttcc tctacatgct catggagtac gtgccgggcg gcgagctctt cagctacctg 781 cgcaaccggg ggcgcttctc cagcaccacg gggctcttct actctgcaga gatcatctgt 841 gccatcgagt acctgcactc caaagagatc gtctacaggg acttgaagcc agagaacatc 901 ctgctggata gggatggcca cattaagctc acggactttg ggttcgccaa gaagctggta 961 gacaggactt ggaccctctg tggaacaccc gagtacctag cccccgaagt cattcagagc 1021 aagggccacg gaagggccgt ggactggtgg gccctcggca tcctgatatt cgagatgctt 1081 tcggggtttc ctccgttttt tgatgacaac ccgtttggca tttatcagaa aattcttgca 1141 ggcaaaatag atttccccag acatttggat ttccatgtaa aagacctcat taagaaactg 1201 ctcgtggttg acagaacaag gcgattagga aacatgaaga acggggcgaa tgatgtgaag 1261 catcatcggt ggttccgctc cgtggactgg gaagctgttc cgcagagaaa actgaagcct 1321 cccatcgtgc ccaagatagc tggtgacggc gacacttcca acttcgaaac ttaccctgag 1381 aatgactggg acacagccgc gcccgtgccg cagaaggatt tagaaatctt caagaatttc 1441 tgaggacagg agctcacatc tggaagaaac aggaagattg gaatctgcct ggaacaaaga 1501 actgcaccta agcagaccag aagcaaaatg tcttcttcac ggcataagga catctccact 1561 tttctctgta cctgtgtgta tagaaataga ttagagcaca gttgaaattc atggaactgg 1621 cattatttaa gcaactggaa ttccacactg taggaaggtt ttgaaaattg tttggttgta 1681 gattttatct tatcctttag tgttgtgttc ctactgtgat gtcttggttt ttgtcataga 1741 cttaagttta taagtttgaa ctggacttgt tcgattataa ccacaaattg tgtgtgtgtg 1801 tgtgtgtgtg tgtgtgtgtg tgtgtatgcc tgtgtgtata tatagaagtc attatggcag 1861 atgcacagaa attgtgcagt gatgtaaatg ttcatacttt acagagccta taatttttat 1921 ttttcaattt gttttttcaa aaatctcttc tcggggacaa catctgaagg gtatgttgca 1981 tgcattaaaa aaaatcatct cacatgcatt ttatagtttt ggggaagaaa atatcatggg 2041 gaggtctacc ttcagtatct ttagttcttc ttaccgggta acttgagact ttaaaagaag 2101 aaacaaagag gggaagatat gggagcgaat ttattccaag aatctacaat gacattgaag 2161 ttgttggagg aatgtactgt atttaaaaaa accttctgtg acacattcaa aaatttcatc 2221 tgagctggat gcagtggctt gttcctatag tcccagcact ttgggaggct gaggtgggtg 2281 gattgcttga gcccaggagt tggagaccag tctgggaaac gtggtgagac ctcatctcta 2341 caaaatacaa aaaaattagc cgggcatggt ggcacgtgag ttaggtctca gctactcagg 2401 aggctgagat gagaggatca cttgagcctg gggaggtcca ggccgcagtg atccgagatc 2461 acaccactgc attccagcct gggtgacaga gtgagaccct gtccaaaaaa aaaaaaaaaa 2521 aaaagaaggc aatggctttg tgcttttgaa gatggattga agagaacgtc caccttaagg 2581 ctttaaaaga cagtgaagct gggtgcggtg gtgcactcct gtaaccctgg gactttggga 2641 agctgaggca ggaagattga gcctaggagt tcgagactga cctgggcagc atagcgagac 2701 cccatctcta tataaaaaaa aagatgttaa aacagtgaaa ttacagaaac agaacatagc 2761 cacccttttt catcagtgac cttttcttca cgtggaggtc aactcagtag gtgagaataa 2821 ttagtaggtt cagcagaata aactctttac aattaataac tcagtggaaa atgatgttca 2881 gatggtgaaa tgtggaaatg ttttagacaa ctctattttc agcctgtgct tctcactcac 2941 catcctgttg ggattaacag ttgagggcca tcgctgccta aacatttagc gtgcggtttc 3001 ccatcagttt taccgtgaat gtgaaaaggt gaaactggtg ctgacttcgg cagcaggtat 3061 actaaaattg gaatgacaca gaggagatta gcatggcccc ctgcgcgagg aagatctgca 3121 gattcatgca gcattccata tttttttaag aaaaggttac actgtgcacc gacatgtgga 3181 gctagaagga aaagctttta agtctcatgt ttgctgacta taaaaataaa tcatagtaat 3241 aatattgaac attctacttt aaaaattatt aataaaacca taaatgaaac acttgagcac 3301 actttaaaaa gtacataaat ccatgagtcc atatgcctat ttttaggttg tctgttttat 3361 tatccaggat ttcatactca gatgagaatt agcatcagga aagcctgaaa tgccttaatt 3421 gtgaactgat atgcaggaaa atgaaatatg caagcacatt ggtgaccttg ttctttacag 3481 acgtcgtgat cagcctccag ccatacagaa caaatgagaa attagaagcg attccaagaa 3541 cgtgtgatgg tgtgaaccca ctttggacca tcacacacat ttcagcattt gaagtgacac 3601 aagctacctg gtaactgccc atggcggggt catagtttaa atggcccaat gtcatgtaga 3661 ggttaattta atcaaaatga gctataaaca tggagcaatc actgtataat tttaaaatag 3721 tttacctggc caggcatggt cgtcacgcct gtaatcccag cactttggga ggccgaggca 3781 ggtggatcac ctgaggtcag gagttcgaga ccagcctggc caacatggca aaacccccgt 3841 ctctactaaa atacaaaaat taggtgtggt ggcgggcacc tgtaatcctg gctactcggg 3901 aggctgaggc aggagaatcg cttgaacccg ggaggcagag gctgcagtga gccgagatcg 3961 tgccactgca ctccagcctg ggccacagag caagactccg tctcaaaaca taaaaataaa 4021 aaaaatacaa atagctgacc cgggtgggca ttccaaaatg tccgcatgga cgatcacgaa 4081 acaatcacgg tgcgaaatgt ccgcatggat gatcacgaaa caatcacggt gcggaatgtc 4141 cgtatggatg atcatgacaa acaatcactg cgcaaaatgt ggaggaaaat tcattaccat 4201 aatcacaaca tttgtatcag aaaaacatta caaatttact ttgcttcatg taattttacg 4261 agttggcaat attttcttaa attggaatac tctaagagaa taggacatgg atatgttaca 4321 agaagaagtg tattggggca agtcttgttt ttctgaaggg aaatgccacc ggcctgcttt 4381 cataggcatt gataaagcca ggaaggaaat aaagtcacaa ccaaatagta tacactgtct 4441 ctactgaatt cttctaattc cttcgcaaca gaagtctaaa tcctgcttag cctaaacata 4501 tctatagcca gcgatgggca aattatattt gctgtcagca tgatagaata agaagcagaa 4561 accccccagc agtggggtaa gattgcgcgc ttcatcccct caaaatgcag ctcagcccca 4621 ctccctctcc atgcccatca aacaactgca caatcacttg ttactgaatt tcccattcag 4681 cgttttgttt ctttctgtcc atacttaaag accaaatcat ttgtttgctt tgaagaaaaa 4741 gcaaaaacaa gaacaaaaac accaaatcat ttgaaatcac tgtccattaa aaaaaatcac 4801 cagtggaact aaacagttga gtggtttggg atgtgttcgc catttctgtg tttgacctac 4861 atggcctgta attctttgga ccattgtaac tgtgaactat ggcttaatac tgttaaagac 4921 cccaaggctc cactctgtag ccaggtgcac ctcaggctgc acaaataccc caccagaatg 4981 tcacaatcat tcattgctcc agctactgag tgaacagggt gcaagacagc actggcttgg 5041 gttgtatttt tgcagccagt taactttgca tcaggtctac acagtccttg cagcctttga 5101 tttctccagt tgtagagatt tgaaatgggg tgcccgggtt gctctcgttt ttccttcaag 5161 ctacctagca tgcattttac acacattgga ggagtcgtat gcactgtttt tctgttgctg 5221 atggaggcaa gcaggccttc taattagtct aagttaaacg gatgggattc acttaagaaa 5281 agcaagaaga aaagcacacg aacaggtgtc ttaatgcatt tggatagttt ctttgcgggg 5341 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtc ttggtgaagg tgaggggctg 5401 tattttcacc aagccttttc ataatcaagt ttgctttgca aaccttatgg ccttgcaccc 5461 cgcagaacag tcccttccta atagcaggga tgctctgtca tggtttttct gtaaattatc 5521 gtgtgttggg aagttctgtt gaggcttagt ttgatcttcc atggtggaca cgtctgttct 5581 gtatgtaaaa ggcattacaa ttgtgtttta gcagatgaga cttgaagcct ttccacagtc 5641 cttgtgctct gagatggctg ttgagctctg ctcaactgtc gtagctgaat tctttctttg 5701 cgctgaacac tgggcagcct caccatgttg ccaacgtgct tctggggccc ctgtcactgc 5761 cgctggatgc cgcgcacaca ggcagagtgc cttggcaggt ttgtgcaccc cttggcgagc 5821 cagaactggg aaccccgcgg ggtgacccct tctcatgggg ggtcggggag gggagtgcct 5881 atttttaatc ctgtctgttt gttgcacaat ggaaatcact gtgatttgta catatgccct 5941 aggaaaattt tactgctgtc taatttatgt aataatactg ttgattccag gtttatttaa 6001 taaaactttg tatcttttca gaaaaaaaaa aaaa // LOCUS HSPLE 2745 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for pleckstrin (P47). ACCESSION X07743 NID g35517 KEYWORDS P47 protein; pleckstrin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2745) AUTHORS Harley,C.B. TITLE Direct Submission JOURNAL Submitted (24-MAY-1988) Harley C.B., McMaster University, Dept. of Biochemistry, Health Sciences Center, 1200 Main St. W., Hamilton, Ontario L8N 3Z5 REFERENCE 2 (bases 1 to 2745) AUTHORS Tyers,M., Rachubinski,R.A., Stewart,M.I., Varrichio,A.M., Shorr,R.G., Haslam,R.J. and Harley,C.B. TITLE Molecular cloning and expression of the major protein kinase C substrate of platelets JOURNAL Nature 333 (6172), 470-473 (1988) MEDLINE 88232910 COMMENT Data kindly reviewed (23-JUN-1988) by Harvey C.B. FEATURES Location/Qualifiers source 1..2745 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 61..1113 /note="pleckstrin (AA 1-350)" /codon_start=1 /db_xref="PID:g35518" /db_xref="SWISS-PROT:P08567" /translation="MEPKRIREGYLVKKGSVFNTWKPMWVVLLEDGIEFYKKKSDNSP KGMIPLKGSTLTSPCQDFGKRMFVFKITTTKQQDHFFQAAFLEERDAWVRDINKAIKC IEGGQKFARKSTRRSIRLPETIDLGALYLSMKDTEKGIKELNLEKDKKIFNHCFTGNC VIDWLVSNQSVRNRQEGLMIASSLLNEGYLQPAGDMSKSAVDGTAENPFLDNPDAFYY FPDSGFFCEENSSDDDVILKEEFRGVIIKQGCLLKQGHRRKNWKVRKFILREDPAYLH YYDPAGAEDPLGAIHLRGCVVTSVESNSNGRKSEEENLFEIITADEVHYFLQAATPKE RTEWIKAIQMASRTGK" variation 334 /note="u is c in variant" variation 1434..1465 /note="multiple base changes" misc_feature 2722..2727 /note="pot. polyA signal" BASE COUNT 746 a 613 c 654 g 732 t ORIGIN 1 ggcccagctg ctgagaggag ttgcctgaga gtgacctttg catctgcctg tccagccagc 61 atggaaccaa agcggatcag agagggctac cttgtgaaga aggggagcgt gttcaatacg 121 tggaaaccca tgtgggttgt attgttagaa gatggaattg aattctataa gaagaaaagt 181 gacaacagcc ccaaaggaat gatcccgctg aaagggagca ctctgactag cccttgtcaa 241 gactttggca aaaggatgtt tgtgtttaag atcactacga ccaaacagca ggaccacttc 301 ttccaggcag ccttcctgga ggagagagat gcctgggttc gggatatcaa taaggccatt 361 aaatgcattg aaggaggcca gaaatttgcc aggaaatcta ccaggaggtc cattcgactg 421 ccagaaacca ttgacttagg tgccttatat ttgtccatga aagacactga aaaaggaata 481 aaagaactga atctagagaa ggacaagaag atttttaatc actgcttcac aggtaactgc 541 gtcattgatt ggctggtatc caaccagtct gttaggaatc gccaggaagg cctcatgatt 601 gcttcatcgc tgctcaatga ggggtatctg cagcctgctg gagacatgtc caagagtgca 661 gtggatggaa ctgctgaaaa ccctttcctg gacaaccctg atgccttcta ctactttcca 721 gacagtgggt tcttctgtga agagaattcc agtgatgatg atgtgattct gaaagaagaa 781 ttcagagggg tcattatcaa gcagggatgt ttactgaagc aggggcatag aaggaaaaac 841 tggaaagtga ggaagttcat cttgagagaa gaccctgcct acctgcacta ctatgaccct 901 gctggggcag aagatcccct gggagcaatt cacttgagag gctgtgtggt gacttcagtg 961 gagagcaact caaatggcag gaagagtgag gaagagaacc tttttgagat catcacagca 1021 gatgaagtgc actatttctt gcaagcagcc acccccaagg agcgcacaga gtggatcaaa 1081 gccatccaga tggcctcccg aactgggaag taaagagact cctgcattcc tcctcccctc 1141 ctgagggaag cccatggaca agctcagtcc aggacctgtc cacttctgtg acaaatcaac 1201 gggaaacagc ccaggggtgg gaagttttca tttgcagggg ggtctgaatg taactcacca 1261 tgtggtgtgc aaggttcccc tgcattgtat tgctcactgc agcccctctg cccctatcca 1321 tgacccccaa gcagatataa caagctgtgc agcctcagta ggctgcttgc cctctccagc 1381 ctcagggcct cttctggaaa atgaagaaat tcaactagta gattcctgag gtccccctag 1441 cttaaaaaaa aaaaaatctg ccccatgatt ctaacactcg cagtagtgat agtgtatcta 1501 gttgttctgc tggtgtcctt ccttggctaa gtcttggcct tcagttatct tcaaatgtac 1561 cagaacctga gccaacgcct ccctgtgaaa ctgttgctga tctgtagtac agtaccagga 1621 agaaacctct tttgttctct ttagacatct tctacttgct cttggccttg agatcgtgta 1681 acaaaatgaa ggagggctct cttctttctt cctcatccta ctcaaaaact tcccgagagc 1741 agtggtggtt ttgagggttt tgacttctat tacttttggc agcctggaaa gttgtgtctt 1801 ctgggaaaga gacccgggga ggccaggagt agctgagggt cctttctgtg cccttaaacc 1861 gcccagagga gccctattcc actctggttt taggctgatc tgagagggtc tccctttgtt 1921 cctttctgga gcatttctct aacgtttatt acaattagga gggggacccc acatctgtga 1981 gattctgttt catttgaggt ttacagaaaa aaaaaagtgg ccagatgtgt tccccccatg 2041 ggtgagaggc ctgggcaact gcctggtgaa tgtgtcttgc ggcagctgca gcaagtggag 2101 gggctgaact actggccagc tcactggatg atgggttaat acaacaactg cactgtaagg 2161 actcagagcc acacagaact tctgagaggg gctgttagca ttgcgcagca tcttcagttc 2221 tccagtaaat gatattgcgt tcgtgcctca gctttaagca caagtagcag cagctcctgc 2281 ttgagttctg agggcatcat ggccctatga ttaaccagag tgatctaacc tagactaaaa 2341 ttgggaactt atttgcaatt tttgaccctg accactaact agtgattctt ctccaaaatt 2401 gagaaagaca gcacccattg aaacagatat gtgtgtgaaa gtatattttt caattccaga 2461 tttttaattt taaggctcca ggaaagaaag gagagtagaa catttttcct cattttatca 2521 aatcctctct tgccctccct caattcccct gtaacattcc tgaagctgtt cccactccca 2581 gatggtttta tcaatagcct agaggtaaag aactgtcttt ttctctgatt ctttaataaa 2641 ttatctttat agaatatgca caagtttttc tacactcagt gttaaagtat ttattaatgg 2701 gaagtcaact taatgttttg aaataaatat atgactctgt ttaat // LOCUS HSPLGF 1645 bp RNA PRI 12-NOV-1991 DEFINITION H.sapiens mRNA for placenta growth factor (PlGF). ACCESSION X54936 NID g35521 KEYWORDS placenta growth factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1645) AUTHORS Persico,M.G. TITLE Direct Submission JOURNAL Submitted (22-OCT-1990) Persico M.G., I.I.G.B., CNR, Via Marconi 12, 80125 Napoli, ITALY REFERENCE 2 (bases 1 to 1645) AUTHORS Maglione,D., Guerriero,V., Viglietto,G., Delli-Bovi,P. and Persico,M.G. TITLE Isolation of a human placenta cDNA coding for a protein related to the vascular permeability factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (20), 9267-9271 (1991) MEDLINE 92021031 FEATURES Location/Qualifiers source 1..1645 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="14" stem_loop 245..310 CDS 322..771 /codon_start=1 /product="placenta growth factor (PlGF)" /db_xref="PID:g35522" /translation="MPVMRLFPCFLQLLAGLALPAVPPQQWALSAGNGSSEVEVVPFQ EVWGRSYCRALERLVDVVSEYPSEVEHMFSPSCVSLLRCTGCCGDENLHCVPVETANV TMQLLKIRSGDRPSYVELTFSQHVRCECRPLREKMKPERCGDAVPRR" polyA_signal 1620..1625 BASE COUNT 294 a 521 c 517 g 313 t ORIGIN 1 gggattcggg ccgcccagct acgggaggac ctggagtggc actgggcgcc cgacggacca 61 tccccgggac ccgcctgccc ctcggcgccc cgccccgccg ggccgctccc cgtcgggttc 121 cccagccaca gccttaccta cgggctcctg actccgcaag gcttccagaa gatgctcgaa 181 ccaccggccg gggcctcggg gcagcagtga gggaggcgtc cagcccccca ctcagctctt 241 ctcctcctgt gccaggggct ccccggggga tgagcatggt ggttttccct cggagccccc 301 tggctcggga cgtctgagaa gatgccggtc atgaggctgt tcccttgctt cctgcagctc 361 ctggccgggc tggcgctgcc tgctgtgccc ccccagcagt gggccttgtc tgctgggaac 421 ggctcgtcag aggtggaagt ggtacccttc caggaagtgt ggggccgcag ctactgccgg 481 gcgctggaga ggctggtgga cgtcgtgtcc gagtacccca gcgaggtgga gcacatgttc 541 agcccatcct gtgtctccct gctgcgctgc accggctgct gcggcgatga gaatctgcac 601 tgtgtgccgg tggagacggc caatgtcacc atgcagctcc taaagatccg ttctggggac 661 cggccctcct acgtggagct gacgttctct cagcacgttc gctgcgaatg ccggcctctg 721 cgggagaaga tgaagccgga aaggtgcggc gatgctgttc cccggaggta acccacccct 781 tggaggagag agaccccgca cccggctcgt gtatttatta ccgtcacact cttcagtgac 841 tcctgctggt acctgccctc tatttattag ccaactgttt ccctgctgaa tgcctcgctc 901 ccttcaagac gaggggcagg gaaggacagg accctcagga attcagtgcc ttcaacaacg 961 tgagagaaag agagaagcca gccacagacc cctgggagct tccgctttga aagaagcaag 1021 acacgtggcc tcgtgagggg caagctaggc cccagaggcc ctggaggtct ccaggggcct 1081 gcagaaggaa agaagggggc cctgctacct gttcttgggc ctcaggctct gcacagacaa 1141 gcagcccttg ctttcggagc tcctgtccaa agtagggatg cggattctgc tggggccgcc 1201 acggcctggt ggtgggaagg ccggcagcgg gcggagggga ttcagccact tccccctctt 1261 cttctgaaga tcagaacatt cagctctgga gaacagtggt tgcctggggg cttttgccac 1321 tccttgtccc ccgtgatctc ccctcacact ttgccatttg cttgtactgg gacattgttc 1381 tttccggccg aggtgccacc accctgcccc cactaagaga cacatacaga gtgggccccg 1441 ggctggagaa agagctgcct ggatgagaaa cagctcagcc agtggggatg aggtcaccag 1501 gggaggagcc tgtgcgtccc agctgaaggc agtggcaggg gagcaggttc cccaagggcc 1561 ctggcacccc cacaagctgt ccctgcaggg ccatctgact gccaagccag attctcttga 1621 ataaagtatt ctagtgtgga aacgc // LOCUS HSPLGLN 3490 bp RNA PRI 14-DEC-1995 DEFINITION H.sapiens mRNA for plakoglobin. ACCESSION Z68228 NID g1122888 KEYWORDS plakoglobin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3490) AUTHORS Franke,W.W., Goldschmidt,M.D., Zimbelmann,R., Mueller,H.M., Schiller,D.L. and Cowin,P. TITLE Molecular cloning and amino acid sequence of human plakoglobin, the common junctional plaque protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (11), 4027-4031 (1989) MEDLINE 89264555 REFERENCE 2 (bases 1 to 3490) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (14-DEC-1995) Zimbelmann R., German Cancer Research Center, Division for Cell Biology, Im Neuenheimer Feld 280, D-69120 Heidelberg, Federal Republik of Germany FEATURES Location/Qualifiers source 1..3490 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HPG Ca 5.1" /dev_stage="adult" CDS 120..2357 /codon_start=1 /product="plakoglobin" /db_xref="PID:e214034" /db_xref="PID:g1122889" /translation="MEVMNLMEQPIKVTEWQQTYTYDSGIHSGANTCVPSVSSKGIME EDEACGRQYTLKKTTTYTQGVPPSQGDLEYQMSTTARAKRVREAMCPGVSGEDSSLLL ATQVEGQATNLQRLAEPSQLLKSAIVHLINYQDDAELATRALPELTKLLNDEDPVVVT KAAMIVNQLSKKEASRRALMGSPQLVAAVVRTMQNTSDLDTARCTTSILHNLSHHREG LLAIFKSGGIPALVRMLSSPVESVLFYAITTLHNLLLYQEGAKMAVRLADGLQKMVPL LNKNNPKFLAITTDCLQLLAYGNQESKLIILANGGPQALVQIMRNYSYEKLLWTTSRV LKVLSVCPSNKPAIVEAGGMQALGKHLTSNSPRLVQNCLWTLRNLSDVATKQEGLESV LKILVNQLSVDDVNVLTCATGTLSNLTCNNSKNKTLVTQNSGVEALIHAILRAGDKDD ITEPAVCALRHLTSRHPEAEMAQNSVRLNYGIPAIVKLLNQPNQWPLVKATIGLIRNL ALCPANHAPLQEAAVIPRLVQLLVKAHQDAQRHVAAGTQQPYTDGVRMEEIVEGCTGA LHILARDPMNRMEIFRLNTIPLFVQLLYSSVENIQRVAAGVLCELAQDKEAADAIDAE GASAPLMELLHSRNEGTATYAAAVLFRISEDKNPDYRKRVSVELTNSLFKHDPAAWEA AQSMIPINEPYGDDMDATYRPMYSSDVPLDPLEMHMDMDGDYPIDTYSDGLRPPYPTA DHMLA" polyA_signal 3475..3480 BASE COUNT 672 a 1172 c 979 g 667 t ORIGIN 1 cgccagagtc cggagcagcc gccgcccgac cgcgccgagc tcagttcgct gtccgcgccg 61 gctcccaccc cggcccgacc ccgacccggc ccggtcaggc cccatactca gtagccacga 121 tggaggtgat gaacctgatg gagcagccta tcaaggtgac tgagtggcag cagacataca 181 cctacgactc gggtatccac tcgggcgcca acacctgcgt gccctccgtc agcagcaagg 241 gcatcatgga ggaggatgag gcctgcgggc gccagtacac gctcaagaaa accaccactt 301 acacccaggg ggtgcccccc agccaaggtg acctggagta ccagatgtcc acaacagcca 361 gggccaaacg ggtgcgggag gccatgtgcc ctggtgtgtc aggcgaggac agctcgcttc 421 tgctggccac ccaggtggag gggcaggcca ccaacctgca gcgactggcc gagccgtccc 481 agctgctcaa gtcggccatt gtgcatctca tcaactacca ggacgatgcc gagctggcca 541 ctcgcgccct gcccgagctc accaaactgc tcaacgacga ggacccggtg gtggtgacca 601 aggcggccat gattgtgaac cagctgtcga agaaggaggc gtcgcggcgg gccctgatgg 661 gctcgcccca gctggtggcc gctgtcgtgc gtaccatgca gaataccagc gacctggaca 721 cagcccgctg caccaccagc atcctgcaca acctctccca ccaccgggag gggctgctcg 781 ccatcttcaa gtcgggtggc atccctgctc tggtccgcat gctcagctcc cctgtggagt 841 cggtcctgtt ctatgccatc accacgctgc acaacctgct cctgtaccag gagggcgcca 901 agatggccgt gcgcctggcc gacgggctgc aaaagatggt gcccctgctc aacaagaaca 961 accccaagtt cctggccatc accaccgact gcctgcagct cctggcctac ggcaaccagg 1021 agagcaagct gatcatcctg gccaatggtg ggccccaggc cctcgtgcag atcatgcgta 1081 actacagtta tgaaaagctg ctctggacca ccagtcgtgt gctcaaggtg ctatccgtgt 1141 gtcccagcaa taagcctgcc attgtggagg ctggtgggat gcaggccctg ggcaagcacc 1201 tgaccagcaa cagcccccgc ctggtgcaga actgcctgtg gaccctgcgc aacctctcag 1261 atgtggccac caagcaggag ggcctggaga gtgtgctgaa gattctggtg aatcagctga 1321 gtgtggatga cgtcaacgtc ctcacctgtg ccacgggcac actctccaac ctgacatgca 1381 acaacagcaa gaacaagacg ctggtgacac agaacagcgg tgtggaggct ctcatccatg 1441 ccatcctgcg tgctggtgac aaggacgaca tcacggagcc tgccgtctgc gctctgcgcc 1501 acctcactag ccgccaccct gaggccgaga tggcccagaa ctctgtgcgt ctcaactatg 1561 gcatcccagc catcgtgaag ctgctcaacc agcccaacca gtggccactg gtcaaggcaa 1621 ccatcggctt gatcaggaat ctggccctgt gcccagccaa ccatgccccg ctgcaggagg 1681 cagcggtcat cccccgcctc gtccaactgc tggtgaaggc ccaccaggat gcccagcgcc 1741 acgtagctgc aggcacacag cagccctaca cggatggtgt gaggatggag gagattgtgg 1801 agggctgcac cggagcactg cacatcctcg cccgggaccc catgaaccgc atggagatct 1861 tccggctcaa caccattccc ctgtttgtgc agctcctgta ctcgtcggtg gagaacatcc 1921 agcgcgtggc tgccggggtg ctgtgtgagc tggcccagga caaggaggcg gccgacgcca 1981 ttgatgcaga gggggcctcg gccccactca tggagttgct gcactcccgc aacgagggca 2041 ctgccaccta cgctgctgcc gtcctgttcc gcatctccga ggacaagaac ccagactacc 2101 ggaagcgcgt gtccgtggag ctcaccaact ccctcttcaa gcatgacccg gctgcctggg 2161 aggctgccca gagcatgatt cccatcaatg agccctatgg agatgacatg gatgccacct 2221 accgccccat gtactccagc gatgtgcccc ttgacccgct ggagatgcac atggacatgg 2281 atggagacta ccccatcgac acctacagcg acggcctcag gcccccgtac cccactgcag 2341 accacatgct ggcctaggcg gcctggcccc agtgacggcc ccctctttgc aggcttttcc 2401 tcctctctag aacctccttc tgttggaggc cctcccatct ccccgctgaa acctgcgctc 2461 cttttttggg gggatccttt gctgctgagc ttccccaagc acggtgtgcc ctggcctgcc 2521 ttcttcttgt gtctttggtg gggatgggga ggcctattcc tgctggcccc ttctgggggt 2581 ggtgggcagg tgacacggag tggcttgagc ttctggggat gcaggtccac cgagcccctg 2641 acccctgtct gtccccgctc ccctaacagg tgcggttcct catctgagag gctctccgtg 2701 caggcgatgg ggcaagacag aaaagtgcct gagctgggga agccggggtg taacttcctg 2761 ctgcaccctg cgcctccaga ggtcctccgt agggtctttc ttgggatagt gttctgctcc 2821 tgcttttctg tcctgggcat gggtccaggg cctgacaccc cctccccgcc cctgtggccc 2881 tggccactaa agcttcagac tcaagtaccc attctgtttt cccccagcaa cgcccctcca 2941 aacctccagc ctccctgtct ccagctgcct gggcccggaa gggctttggt tccttctctg 3001 ggtctgattt tctcactgaa ctccaccgac caactgccct aagcccccag ggcctccagg 3061 gcccaggttc gagacccaaa cccccaaaat ccaaaacttc tcttgaaaag ttcagggacc 3121 gtccagggga gatggggagg agatatggag tgagtcacct gctccagaag atgccagctt 3181 ctctctccag ggtgcttagt tggctttgcc cacccctcac tccccaggga gctccgggga 3241 cagcttcctc acacccctgt cccacccaca cagctgccct agctgacccc gagaagtgct 3301 cttggctgac ccctctggtg tgtggtgagg ggctttctct tccccttcct gtttcagacc 3361 cccccatttc ccgcacatgg tgtggggggc tgggggaggt ccaagcagag tgttttatta 3421 ttatcgcttt atgtttttgg ttattggttt ttttgtatag accaaagcaa agaaaataaa 3481 aataacacag // LOCUS HSPLK1 2137 bp RNA PRI 03-AUG-1994 DEFINITION H.sapiens plk-1 mRNA. ACCESSION X73458 NID g312997 KEYWORDS plk-1 gene; protein serine/threonine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2137) AUTHORS Golsteyn,R.M., Schultz,S.J., Bartek,J., Ziemiecki,A., Ried,T. and Nigg,E.A. TITLE Cell cycle analysis and chromosomal localization of human Plk1, a putative homologue of the mitotic kinases Drosophila polo and Saccharomyces cerevisiae Cdc5 JOURNAL J. Cell. Sci. 107 (Pt 6), 1509-1517 (1994) MEDLINE 95051109 REFERENCE 2 (bases 1 to 2137) AUTHORS Nigg,E.A. TITLE Direct Submission JOURNAL Submitted (21-JUN-1993) E.A. Nigg, Swiss Institute for Experimental Cancer Research (ISREC), Chemin des Boveresses 155, 1066 Epalinges, SWITZERLAND FEATURES Location/Qualifiers source 1..2137 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="nasopharyngeal carcinoma" /clone_lib="lambda gt10" /chromosome="16p12" gene 78..1889 /gene="plk-1" CDS 78..1889 /gene="plk-1" /codon_start=1 /product="protein kinase" /db_xref="PID:g312998" /translation="MSAAVTAGKLARAPADPGKAGVPGVAAPGAPAAAPPAKEIPEVL VDPRSRRRYVRGRFLGKGGFAKCFEISDADTKEVFAGKIVPKSLLLKPHQREKMSMEI SIHRSLAHQHVVGFHGFFEDNDFVFVVLELCRRRSLLELHKRRKALTEPEARYYLRQI VLGCQYLHRNRVIHRDLKLGNLFLNEDLEVKIGDFGLATKVEYDGERKKTLCGTPNYI APEVLSKKGHSFEVDVWSIGCIMYTLLVGKPPFETSCLKETYLRIKKNEYSIPKHINP VAASLIQKMLQTDPTARPTINELLNDEFFTSGYIPARLPITCLTIPPRFSIAPSSLDP SNRKPLTVLNKGLENPLPERPREKEEPVVRETGEVVDCHLSDMLQQLHSVNASKPSER GLVRQEEAEDPACIPIFWVSKWVDYSDKYGLGYQLCDNSVGVLFNDSTRLILYNDGDS LQYIERDGTESYLTVSSHPNSLMKKITLLKYFRNYMSEHLLKAGANITPREGDELARL PYLRTWFRTRSAIILHLSNGSVQINFFQDHTKLILCPLMAAVTYIDEKRDFRTYRLSL LEEYGCCKELASRLRYARTMVDKLLSSRSASNRLKAS" BASE COUNT 458 a 641 c 595 g 443 t ORIGIN 1 ctcgagagtt gccggggagg agcggagcgg tgcggaggct ctgctcggat cgaggtctgc 61 agcgcagctt cgggagcatg agtgctgcag tgactgcagg gaagctggca cgggcaccgg 121 ccgaccctgg gaaagccggg gtccccggag ttgcagctcc cggagctccg gcggcggctc 181 caccggcgaa agagatcccg gaggtcctag tggacccacg cagccggcgg cgctatgtgc 241 ggggccgctt tttgggcaag ggcggctttg ccaagtgctt cgagatctcg gacgcggaca 301 ccaaggaggt gttcgcgggc aagattgtgc ctaagtctct gctgctcaag ccgcaccaga 361 gggagaagat gtccatggaa atatccattc accgcagcct cgcccaccag cacgtcgtag 421 gattccacgg ctttttcgag gacaacgact tcgtgttcgt ggtgttggag ctctgccgcc 481 ggaggtctct cctggagctg cacaagagga ggaaagccct gactgagcct gaggcccgat 541 actacctacg gcaaattgtg cttggctgcc agtacctgca ccgaaaccga gttattcatc 601 gagacctcaa gctgggcaac cttttcctga atgaagatct ggaggtgaaa ataggggatt 661 ttggactggc aaccaaagtc gaatatgacg gggagaggaa gaagaccctg tgtgggactc 721 ctaattacat agctcccgag gtgctgagca agaaagggca cagtttcgag gtggatgtgt 781 ggtccattgg gtgtatcatg tataccttgt tagtgggcaa accacctttt gagacttctt 841 gcctaaaaga gacctacctc cggatcaaga agaatgaata cagtattccc aagcacatca 901 accccgtggc cgcctccctc atccagaaga tgcttcagac agatcccact gcccgcccaa 961 ccattaacga gctgcttaat gacgagttct ttacttctgg ctatatccct gcccgtctcc 1021 ccatcacctg cctgaccatt ccaccaaggt tttcgattgc tcccagcagc ctggacccca 1081 gcaaccggaa gcccctcaca gtcctcaata aaggcttgga gaaccccctg cctgagcgtc 1141 cccgggaaaa agaagaacca gtggttcgag agacaggtga ggtggtcgac tgccacctca 1201 gtgacatgct gcagcagctg cacagtgtca atgcctccaa gccctcggag cgtgggctgg 1261 tcaggcaaga ggaggctgag gatcctgcct gcatccccat cttctgggtc agcaagtggg 1321 tggactattc ggacaagtac ggccttgggt atcagctctg tgataacagc gtgggggtgc 1381 tcttcaatga ctcaacacgc ctcatcctct acaatgatgg tgacagcctg cagtacatag 1441 agcgtgacgg cactgagtcc tacctcaccg tgagttccca tcccaactcc ttgatgaaga 1501 agatcaccct ccttaaatat ttccgcaatt acatgagcga gcacttgctg aaggcaggtg 1561 ccaacatcac gccgcgcgaa ggtgatgagc tcgcccggct gccctaccta cggacctggt 1621 tccgcacccg cagcgccatc atcctgcacc tcagcaacgg cagcgtgcag atcaacttct 1681 tccaggatca caccaagctc atcttgtgcc cactgatggc agccgtgacc tacatcgacg 1741 agaagcggga cttccgcaca taccgcctga gtctcctgga ggagtacggc tgctgcaagg 1801 agctggccag ccggctccgc tacgcccgca ctatggtgga caagctgctg agctcacgct 1861 cggccagcaa ccgtctcaag gcctcctaat agctgccctc ccctccggac tggtgccctc 1921 ctcactccca cctgcatctg gggcccatac tggttggctc ccgcggtgcc atgtctgcag 1981 tgtgcccccc agccccggtg gctgggcaga gctgcatcat ccttgcaggt gggggttgct 2041 gtataagtta tttttgtaca tgttcgggtg tgggttctac agccttgtcc ccctccccct 2101 caaccccacc atatgaattg tacagaatat ttctatt // LOCUS HSPLKPHL 2680 bp RNA PRI 23-SEP-1994 DEFINITION H.sapiens mRNA for plakophilin (partial). ACCESSION Z34974 NID g550114 KEYWORDS plakophilin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2680) AUTHORS Schmidt,A., Hans,H.W., Schaefer,S., Nuber,U.A., Zimbelmann,R. and Franke,W.W. TITLE Cell type-specific desmosomal components and epithelial differentiation JOURNAL Unpublished REFERENCE 2 (bases 1 to 2680) AUTHORS Zimbelmann,R. TITLE Direct Submission JOURNAL Submitted (04-JUL-1994) Zimbelmann R., German Cancer Research Center, Institute for Cell Biology, Im Neuenheimer Feld 280, D-69120 Heidelberg, Federal Republic of Germany REMARK revised by [4] MAT REFERENCE 3 (bases 1 to 2680) AUTHORS Bosch,A. TITLE Direct Submission JOURNAL Submitted (23-SEP-1994) ASSUMPCIO BOSCH, MOLECULAR GENETICS, CANCER RESEARCH INSTITUTE, HOPITAL DURAN I REYNALS, CTRA. CASTELLDEFELS KM 2.7, L'HOPITALET, DE LLOBREGAT, BARCELONA, 08907, SPAIN FEATURES Location/Qualifiers source 1..2680 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H16-2" CDS 253..2433 /codon_start=1 /product="plakophilin" /db_xref="PID:g550115" /translation="MNHSPLKTALAYECFQDQDNSTLALPSDQKMKTGTSGRQRVQEQ VMMTVKRQKSKSSQSSTLSHSNRGSMYDGLADNYNYGTTSRSSYYSKFQAGNGSWGYP IYNGTLKREPDNRRFSSYSQMENWSRHYPRGSCNTTGAGSDICFMQKIKASRSEPDLY CDPRGTLRKGTLGSKGQKTTQNRYSFYSTCSGQKAIKKCPVRPPSCASKQDPVYIPPI SCNKDLSFGHSRASSKICSEDIECSGLTIPKAVQYLSSQDEKYQAIGAYYIQHTCFQD ESAKQQVYQLGGICKLVDLLRSPNQNVQQAAAGALRNLVFRSTTNKLETRRQNGIREA VSLLRRTGNAEIQKQLTGLLWNLSSTDELKEELIADALPVLADRVIIPFSGWCDGNSN MSREVVDPEVFFNATGCLRNLSSADAGRQTMRNYSGLIDSLMAYVQNCVAASRCDDKS VENCMCVLHNLSYRLDAEVPTRYRQLEYNARNAYTEKSSTGCFSNKSDKMMNNNYDCP LPEEETNPKGSGWLYHSDAIRTYLNLMGKSKKDATLEACAGALQNLTASKGLMSSGMS QLIGLKEKGLPQIARLLQSGNSDVVRSGASLLSNMSRHPLLHRVMGNQVFPEVTRLLT SHTGNTSNSEDILSSACYTVRNLMASQPQLAKQYFSSSMLNNIINLCRSSASPKAAEA ARLLLSDMWSSKELQGVLRQQGFDRNMLGTLAGANSLRNFTSRF" BASE COUNT 618 a 818 c 763 g 481 t ORIGIN 1 ggggtggtgc agggcagggg tggtatatcc tgtctgacgg agggcgggcc tcgccagtgc 61 cagagaggga cgaaccaggg tggaagcgcc aggagcagct gcagggagcc ctcacgcgga 121 cctcgcactc tatggccgta gggagccgct gagagcgaga agagcacgct cctgcccgcc 181 cgctgcaccg cacctcgcct cgcctctctg ctctcctagg ccccggccgc gcgccacccg 241 cctcccgcca ccatgaacca ctcgccgctc aagaccgcct tggcgtacga atgcttccag 301 gaccaggaca actccacgtt ggctttgccg tcggaccaaa agatgaaaac aggcacgtct 361 ggcaggcagc gcgtgcagga gcaggtgatg atgaccgtca agcggcagaa gtccaagtct 421 tcccagtcgt ccaccctgag ccactccaat cgaggttcca tgtatgatgg cttggctgac 481 aattacaact atgggaccac cagcaggagc agctactact ccaagttcca ggcagggaat 541 ggctcatggg gatatccgat ctacaatgga accctcaagc gggagcctga caacaggcgc 601 ttcagctcct acagccagat ggagaactgg agccggcact acccccgggg cagctgtaac 661 accaccggcg caggcagcga catctgcttc atgcagaaaa tcaaggcgag ccgcagtgag 721 cccgacctct actgtgaccc acggggcacc ctgcgcaagg gcacgctggg cagcaagggc 781 cagaagacca cccagaaccg ctacagcttt tacagcacct gcagtggtca gaaggccata 841 aagaagtgcc ctgtgcgccc gccctcttgt gcctccaagc aggaccctgt gtatatcccg 901 cccatctcct gcaacaagga cctgtccttt ggccactcta gggccagctc caagatctgc 961 agtgaggaca tcgagtgcag tgggctgacc atccccaagg ctgtgcagta cctgagctcc 1021 caggatgaga agtaccaggc cattggggcc tattacatcc agcatacctg cttccaggat 1081 gaatctgcca agcaacaggt ctatcagctg ggaggcatct gcaagctggt ggacctcctc 1141 cgcagcccca accagaacgt ccagcaggcc gcggcagggg ccctgcgcaa cctggtgttc 1201 aggagcacca ccaacaagct ggagacccgg aggcagaatg ggatccgcga ggcagtcagc 1261 ctcctgagga gaaccgggaa cgccgagatc cagaagcagc tgactgggct gctctggaac 1321 ctgtcttcca ctgacgagct gaaggaggaa ctcattgccg acgccctgcc tgttctggcc 1381 gaccgcgtca tcattccctt ctctggctgg tgcgatggca atagcaacat gtcccgggaa 1441 gtggtggacc ctgaggtctt cttcaatgcc acaggctgct tgaggaacct gagctcggcc 1501 gatgcaggcc gccagaccat gcgtaactac tcagggctca ttgattccct catggcctat 1561 gtccagaact gtgtagcggc cagccgctgt gacgacaagt ctgtggaaaa ctgcatgtgt 1621 gttctgcaca acctctccta ccgcctggac gccgaggtgc ccacccgcta ccgccagctg 1681 gagtataacg cccgcaacgc ctacaccgag aagtcctcca ctggctgctt cagcaacaag 1741 agcgacaaga tgatgaacaa caactatgac tgccccctgc ctgaggaaga gaccaacccc 1801 aagggcagcg gctggttgta ccattcagat gccatccgca cctacctgaa cctcatgggc 1861 aagagcaaga aagatgctac cctggaggcc tgtgctggtg ccctgcagaa cctgacagcc 1921 agcaaggggc tgatgtccag tggcatgagc cagttgattg ggctgaagga aaagggcctg 1981 ccacaaattg cccgcctcct gcaatctggc aactctgatg tggtgcggtc cggagcctcc 2041 ctcctgagca acatgtcccg ccaccctctg ctgcacagag tgatggggaa ccaggtgttc 2101 ccggaggtga ccaggctcct caccagccac actggcaata ccagcaactc cgaagacatc 2161 ttgtcctcgg cctgctacac tgtgaggaac ctgatggcct cgcagccaca actggccaag 2221 cagtacttct ccagcagcat gctcaacaac atcatcaacc tgtgccgaag cagtgcctca 2281 cccaaggccg cagaagctgc ccggcttctc ctgtctgaca tgtggtccag caaggaactg 2341 cagggtgtcc tcagacagca aggtttcgat aggaacatgc tgggaacctt agctggggcc 2401 aacagcctca ggaacttcac ctcccgattc taagaagaga ctgtccaagc aagttaggct 2461 tgcaggaaga tatgacccag ctgagaagcc ctcaggcctc gctggatggg gttttctgtc 2521 catcctgtgc agtatttggg aaagttcaca agaaactgag aagaaaccta aaaactgtgg 2581 atagtggaaa gatttttaga tttttttttt ccttggggaa actggcaggc aatgggggtt 2641 agggaggttg gggcgggggg ggctttcttg agttaaaggg // LOCUS HSPLZFA 2197 bp RNA PRI 16-MAR-1993 DEFINITION H.sapiens of PLZF gene encoding kruppel-like zinc finger protein. ACCESSION Z19002 NID g38517 KEYWORDS kruppel-like zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2197) AUTHORS Chen,Z., Brand,N.J., Chen,A., Chen,S.J., Tong,J.H., Wang,Z.Y., Waxman,S. and Zelent,A. TITLE Fusion between a novel Kruppel-like zinc finger gene and the retinoic acid receptor-alpha locus due to a variant t(11;17) translocation associated with acute promyelocytic leukaemia JOURNAL EMBO J. 12 (3), 1161-1167 (1993) MEDLINE 93209216 REFERENCE 2 (bases 1 to 2197) AUTHORS Zelent,A.Z. TITLE Direct Submission JOURNAL Submitted (09-DEC-1992) Zelent A. Z., Institute of Cancer Research, Leukaemia Research Fund Centre, Fulham Road, London, United Kingdom, SW3 6JB FEATURES Location/Qualifiers source 1..2197 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="adult" /tissue_type="heart" /clone_lib="human ventricular cDNA lambda ZapII library" /clone="PLZF" /chromosome="11q23.1" 5'UTR 1..75 gene 76..2097 /gene="PLZF" CDS 76..2097 /gene="PLZF" /function="transcription factor" /codon_start=1 /product="kruppel-like zinc finger protein" /db_xref="PID:g38518" /db_xref="SWISS-PROT:Q05516" /translation="MDLTKMGMIQLQNPSHPTGLLCKANQMRLAGTLCDVVIMVDSQE FHAHRTVLACTSKMFEILFHRNSQHYTLDFLSPKTFQQILEYAYTATLQAKAEDLDDL LYAAEILEIEYLEEQCLKMLETIQASDDNDTEATMADGGAEEEEDRKARYLKNIFISK HSSEESGYASVAGQSLPGPMVDQSPSVSTSFGLSAMSPTKAAVDSLMTIGQSLLQGTL QPPAGPEEPTLAGGGRHPGVAEVKTEMMQVDEVPSQDSPGAAESSISGGMGDKVEERG KEGPGTPTRSSVITSARELHYGREESAEQVPPPAEAGQAPTGRPEHPAPPPEKHLGIY SVLPNHKADAVLSMPSSVTSGLHVQPALAVSMDFSTYGGLLPQGFIQRELFSKLGELA VGMKSESRTIGEQCSVCGVELPDNEAVEQHRKLHSGMKTYGCELCGKRFLDSLRLRMH LLAHSAGAKAFVCDQCGAQFSKEDALETHRQTHTGTDMAVFCLLCGKRFQAQSALQQH MEVHAGVRSYICSECNRTFPSHTALKRHLRSHTGDHPYECEFCGSCFRDESTLKSHKR IHTGEKPYECNGCDKKFSLKHQLETHYRVHTGEKPFECKLCHQRSRDYSAMIKHLRTH NGASPYQCTICTEYCPSLSSMQKHMKGHKPEEIPPDWRIEKTYLYLCYV" 3'UTR 2098..2197 BASE COUNT 518 a 628 c 683 g 368 t ORIGIN 1 caggaagccc acccagcccc gccacgcaga gcccagaagg aaagaaagcc tcatgcctga 61 gccgagggga gcaccatgga tctgacaaaa atgggcatga tccagctgca gaaccctagc 121 caccccacgg ggctactgtg caaggccaac cagatgcggc tggccgggac tttgtgcgat 181 gtggtcatca tggtggacag ccaggagttc cacgcccacc ggacggtgct ggcctgcacc 241 agcaagatgt ttgagatcct cttccaccgc aatagtcaac actatacttt ggacttcctc 301 tcgccaaaga ccttccagca gattctggag tatgcatata cagccacgct gcaagccaag 361 gcggaggacc tggatgacct gctgtatgcg gccgagatcc tggagatcga gtacctggag 421 gaacagtgcc tgaagatgct ggagaccatc caggcctcag acgacaatga cacggaggcc 481 accatggccg atggcggggc cgaggaagaa gaggaccgca aggctcggta cctcaagaac 541 atcttcatct cgaagcattc cagcgaggag agtgggtatg ccagtgtggc tggacagagc 601 ctccctgggc ccatggtgga ccagagccct tcagtctcca cttcatttgg tctttcagcc 661 atgagtccca ccaaggctgc agtggacagt ttgatgacca taggacagtc tctcctgcag 721 ggaactcttc agccacctgc agggcccgag gagccaactc tggctggggg tgggcggcac 781 cctggggtgg ctgaggtgaa gacggagatg atgcaggtgg atgaggtgcc cagccaggac 841 agccctgggg cagccgagtc cagcatctca ggagggatgg gggacaaggt tgaggaaaga 901 ggcaaagagg ggcctgggac cccgactcga agcagcgtca tcaccagtgc tagggagcta 961 cactatgggc gagaggagag tgccgagcag gtgccacccc cagctgaggc tggccaggcc 1021 cccactggcc gacctgagca cccagcaccc ccgcctgaga agcatctggg catctactcc 1081 gtgttgccca accacaaggc tgacgctgta ttgagcatgc cgtcttccgt gacctctggc 1141 ctccacgtgc agcctgccct ggctgtctcc atggacttca gcacctatgg ggggctgctg 1201 ccccagggct tcatccagag ggagctgttc agcaagctgg gggagctggc tgtgggcatg 1261 aagtcagaga gccggaccat cggagagcag tgcagcgtgt gtggggtcga gcttcctgat 1321 aacgaggctg tggagcagca caggaagctg cacagtggga tgaagacgta cgggtgcgag 1381 ctctgcggga agcggttcct ggatagtttg cggctgagaa tgcacttact ggctcattca 1441 gcgggtgcca aagcctttgt ctgtgatcag tgcggtgcac agttttcgaa ggaggatgcc 1501 ctggagacac acaggcagac ccatactggc actgacatgg ccgtcttctg tctgctgtgt 1561 gggaagcgct tccaggcgca gagcgcactg cagcagcaca tggaggtcca cgcgggcgtg 1621 cgcagctaca tctgcagtga gtgcaaccgc accttcccca gccacacggc tctcaaacgc 1681 cacctgcgct cacatacagg cgaccacccc tacgagtgtg agttctgtgg cagctgcttc 1741 cgggatgaga gcacactcaa gagccacaaa cgcatccaca cgggtgagaa accctacgag 1801 tgcaatggct gtgacaagaa gttcagcctc aagcatcagc tggagacgca ctatagggtg 1861 cacacaggtg agaagccctt tgagtgtaag ctctgccacc agcgctcccg ggactactcg 1921 gccatgatca agcacctgag aacgcacaac ggcgcctcgc cctaccagtg caccatctgc 1981 acagagtact gccccagcct ctcctccatg cagaagcaca tgaagggcca caagcccgag 2041 gagatcccgc ccgactggag gatagagaag acgtacctct acctgtgcta tgtgtgaagg 2101 gaggcccgcg gcggtggagc cgagcgggga gccaggaaag aagagttgga gtgagatgaa 2161 ggaaggacta tgacaaataa aaaaaaaaaa ggaattc // LOCUS HSPMIPR 1211 bp RNA PRI 12-SEP-1993 DEFINITION Human PMI gene for a putative receptor protein. ACCESSION X51804 NID g35534 KEYWORDS receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1211) AUTHORS Murphy,P.M. TITLE Direct Submission JOURNAL Submitted (08-FEB-1990) Murphy P.M., National Institute of Health, Bldg 10 Room 11N 110, Bethesda Maryland 20892, U S A REFERENCE 2 (bases 1 to 1211) AUTHORS Murphy,P.M. and Malech,H.L. TITLE Nucleotide sequence of a cDNA encoding a protein with primary structural similarity to G-protein coupled receptors JOURNAL Nucleic Acids Res. 18 (7), 1896 (1990) MEDLINE 90245590 FEATURES Location/Qualifiers source 1..1211 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HL60" /clone_lib="lambda gt10" CDS 259..837 /note="putative receptor (AA 1-192)" /codon_start=1 /db_xref="PID:g35535" /db_xref="SWISS-PROT:P17152" /translation="MAAWGRRRLGPGSSGGSARERVSLSATDCYIVHEIYNGENAQDQ FEYELEQALEAQYKYIVIEPTRIGDETARWITVGNCLHKTAVLAGTACLFTPLALPLD YSHYISLPAGVLSLACCTLYGISWQFDPCCKYQVEYDAYKLSRLPLHTLTSSTPVVLV RKDDLHRKRLHNTIALAALVYCVKKIYELYAV" BASE COUNT 254 a 354 c 357 g 246 t ORIGIN 1 ggcccccccc ccccctagaa atgctcgaac caggacggct cctggagtcc tcgcgccctc 61 gcagaaggac tacgggcccc ggcgaccccg ggggcggggc ttccggcgcg ctgccttgtg 121 ggcacggtag ttccgccggg tctggcttcc gcctgccgag cggccccgga ccgcaggccg 181 gactacactt cccgtcggcc cgcctgctct cccgatgccg ccttggcgcg agacgttggc 241 aagcagagtg tctccaagat ggccgcttgg ggaaggaggc gtcttggccc gggcagcagt 301 ggcggcagcg cccgagagag ggtgagcttg tcggccacag actgctacat tgtgcatgag 361 atctacaatg gggagaatgc ccaagaccag tttgagtacg agctggagca ggccctggaa 421 gcccagtaca agtacattgt gattgagccc actcgcattg gcgacgagac agcccgctgg 481 atcaccgtgg gcaactgcct gcacaagacg gccgtgctgg cgggcaccgc ctgcctcttc 541 accccgttgg cgctgccctt agattattcc cactacattt ccctgcccgc tggtgtgctg 601 agcctggcct gctgcaccct ctatgggatc tcctggcagt ttgacccttg ctgcaagtac 661 caagtggagt acgacgccta taaactgtcg cgcctgcctc tgcacacact cacctcctcc 721 accccggtgg tgctggtccg gaaggacgac ctgcacagaa agagactgca caacacgata 781 gcactggccg ccctggtgta ctgtgtaaag aagatttacg aactctatgc cgtatgattt 841 cagtagaaca gggagcgaag caaaaccacc cggcccacaa gagacaacag agtattcaga 901 tcgccacact ctgtgaggca gcagagcctg ggcaggtgtt tggcttagta tttgttattt 961 ttaaaaaata acagatcacg ggtgtaccca gggtttttca gctcattaca ctaagatgtg 1021 gatttccata acccaagagg ggggtctgag gctgtggaag tccgactggg cagtggaatg 1081 ctgatggagg cagacgctgc cgagggggtg tggacgtgct ttgggggagg tctttaagtc 1141 tattgtttaa ctgtaccatc cagagcccac cagaagctat tgatcattaa aattatgaga 1201 atttcaactc c // LOCUS HSPMP35HM 1838 bp mRNA PRI 21-JAN-1998 DEFINITION Homo sapiens mRNA for peroxisomal integral membrane protein. ACCESSION Y12860 NID g2808530 KEYWORDS peroxisomal integral membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1838) AUTHORS Wylin,T., Fransen,M., Mannaerts,G.P. and Van Veldhoven,P.P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1838) AUTHORS Van Veldhoven,P.P. TITLE Direct Submission JOURNAL Submitted (30-APR-1997) P.P. Van Veldhoven, Katholieke Universiteit Leuven, Campus Gasthuisberg, Afd. Farmakologie, Herestraat, B-3000 Leuven, BELGIUM COMMENT Related sequences: AA356050, AA447027, AA326069, AA385739, AA330152, N87113, R54274, Z44749, T19546, AA092826, T19547. FEATURES Location/Qualifiers source 1..1838 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone_lib="lambda gt11" /dev_stage="adult" /tissue_type="liver" /map="q13" CDS 122..1045 /note="Candida boidinii PMP47 homologue" /codon_start=1 /product="peroxisomal integral membrane protein" /db_xref="PID:e1245450" /db_xref="PID:g2808531" /translation="MASVLSYESLVHAVAGAVGSVTAMTVFFPLDTARLRLQVDEKRK SKTTHMVLLEIIKEEGLLAPYRGWFPVISSLCCSNFVYFYTFNSLKALWVKGQHSTTG KDLVVGFVAGVVNVLLTTPLWVVNTRLKLQGAKFRNEDIVPTNYKGIIDAFHQIIRDE GISALWNGTFPSLLLVFNPAIQFMFYEGLKRQLLKKRMKLSSLDVFIIGAVAKAIATT VTYPLQTVQSILRFGRHRLNPENRTLGSLRNILYLLHQRVRRFGIMGLYKGLEAKLLQ TVLTAALMFLVYEKLTAATFTVMGLKRAHQH" BASE COUNT 515 a 383 c 394 g 546 t ORIGIN 1 gcgggtttga gagggcgggg attgcgactc tcacaccctg agctccggtg ctcctttcct 61 aactccactg gctgcggcat ctgtgggaaa agtgtggctg ggtcttcgag gagccgaacc 121 aatggcttcc gtgctgtcct acgaaagcct ggtccacgcc gtggccggag ccgtgggaag 181 cgtgacagca atgacagtgt tttttcccct ggatacagct agacttcgac ttcaggttga 241 tgagaaaaga aaatccaaaa ctacacacat ggtgctcctg gagatcatta aagaagaagg 301 actcctggca ccatatcgag ggtggtttcc agtgatttcc agtctctgct gctccaattt 361 tgtctatttc tacactttta atagcctcaa agcactctgg gtcaaaggtc aacattctac 421 cactggaaaa gatctcgtag ttgggtttgt tgcaggagtg gttaatgtgt tgctaacaac 481 tccactctgg gtggtaaaca ccagactgaa gcttcaagga gcaaaattta ggaatgaaga 541 cattgtacca acaaactaca aaggtatcat tgatgctttt catcagatca ttcgcgatga 601 aggaatctcg gctttatgga atggcacatt tccctcattg ctgttggtct tcaatcctgc 661 catccagttc atgttttatg aaggtttaaa acggcagctt ttaaagaaac ggatgaagct 721 ttcttccttg gatgtgttca tcattggtgc agtagccaaa gcgattgcca ccacggtgac 781 ctatcccctg cagacggtac agtcaattct gaggtttggg cgtcatagac taaacccaga 841 aaacagaaca ttgggaagtc ttcggaatat tctctatctt cttcaccaac gagtaagacg 901 ttttggaata atgggactct acaaaggcct tgaagccaaa ctgctgcaga cagtcctcac 961 tgctgctctc atgttccttg tttatgagaa actgacagct gccaccttca cagttatggg 1021 gctgaagcgt gcacaccaac actgagacgc cttcccatga aaaattccga agatgctcaa 1081 gagggaggtt tcctcctgag tgaagagaag tgattctccc ttgactctgg ctcctgcacc 1141 acaaatgtta ccctcattgg cttgaaaagc atccaagggt gcacagggag tatggccaac 1201 tggacctgtt gtcaccttaa ttgtcatgct ggctggttgg attttggggt ggcagttgga 1261 ctaatgtgaa aaaaacattg ctgaaaacct aaaaatgaaa gtttgtgagt gtttattggt 1321 tttcttaaga gaaatggact attttgctct catgtgtaat gttttctatt taaatctttc 1381 ttaaatatac cagctgttct ctttccctga actctccccc aggttctagg acaaatttaa 1441 taacatgtaa ttctgctcaa atacttttgt atgtctcagt gttggtgttt tcctccctaa 1501 aactaacatt agggcttgtc cacgggcatg actttatttt tgttgggctt ttttttccct 1561 gcttaaggag aggtgtcttt tttggatatg agctatttat tttgtgaaat gaaaattgtt 1621 cacccaaatg attctcttat aaactatttg taaatgtcac ttattcatta gtgtttgaca 1681 taatttttag aatatttatt ttgaatcaat cctttcatta cgaaagactt gaagttttgt 1741 gtccattctt acaagccctg gtcagtcaag tcccaataaa tggtcagcac aaaaaaaaaa 1801 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSPMP70 3285 bp RNA PRI 23-OCT-1992 DEFINITION Human PMP70 mRNA for a peroxisomal membrane protein. ACCESSION X58528 NID g35552 KEYWORDS peroxisomal membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3285) AUTHORS Kamijo,K. TITLE Direct Submission JOURNAL Submitted (15-MAR-1991) K. Kamijo, Dept of Biochemistry, Shinshu University School of Medicine, 3-1-1 Asahi, Matsumoto 390, JAPAN REFERENCE 2 (bases 1 to 3285) AUTHORS Kamijo,K., Kamijo,T., Ueno,I., Osumi,T. and Hashimoto,T. TITLE Nucleotide sequence of the human 70 kDa peroxisomal membrane protein: a member of ATP-binding cassette transporters JOURNAL Biochim. Biophys. Acta 1129 (3), 323-327 (1992) MEDLINE 92162752 COMMENT multidrug resistannt protein (P-glycoprotein) related. FEATURES Location/Qualifiers source 1..3285 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="cDNA in lambda-gt11" /clone="C2, C5, A5, A20, NC1" mRNA 1..3285 /gene="PMP70" /note="cDNA" gene 1..3285 /gene="PMP70" 5'UTR 1..24 /gene="PMP70" CDS 25..2004 /gene="PMP70" /codon_start=1 /product="70kDa peroxisomal membrane protein" /db_xref="PID:g35553" /db_xref="SWISS-PROT:P28288" /translation="MAAFSKYLTARNSSLAGAAFLLLCLLHKRRRALGLHGKKSGKPP LQNNEKEGKKERAVVDKVFFSRLIQILKIMVPRTFCKETGYLVLIAVMLVSRTYCDVW MIQNGTLIESGIIGRSRKDFKRYLLNFIAAMPLISLVNNFLKYGLNELKLCFRVRLTK YLYEEYLQAFTYYKKGNLDNRIANPDQLLTQDVEKFCNSVVDLYSNLSKPFLDIVLYI FKLTSAIGAQGPASMMAYLVVSGLFLTRLRRPIGKMTITEQKYEGEYRYVNSRLITNS EEIAFYNGNKREKQTVHSVFRKLVEHLHNFILFRFSMGFIDSIIAKYLATVVGYLVVS RPFLDLSHPRHLKSTHSELLEDYYQSGRMLLRMSQALGRIVLAGREMTRLAGFTARIT ELMQVLKDLNHGKYERTMVSQQEKGIEGVQVIPLIPGAGEIIIADNIIKFDHVPLATP NGDVLIRDLNFEVRSGANVLICGPNGCGKSSLFRVLGELWPLFGGRLTKPERRKLFYV PQRPYMTLGTLRDQVIYPDGREDQKRKGISDLVQKEYLDNVQLGHILEREGGWDSVQD WMDVLSGGEKQRMAMARLFYHKPQFAILDECTSAVSVDVEGYIYSHCRKVGITLFTVS HRKSLWKHHEYYLHMDGRGNYEFKQITEDTVEFGS" 3'UTR 2002..3285 /gene="PMP70" BASE COUNT 997 a 574 c 690 g 1024 t ORIGIN 1 cggctcgctg gtaccggcag tgccatggcg gccttcagca agtacttgac ggcgcgaaac 61 tcctcgctgg ctggtgccgc gttcctgctg ctctgcctgc tccacaagcg gcgccgcgcc 121 ctcggcctgc acggtaagaa aagtggaaaa ccaccattac agaataatga gaaagaagga 181 aaaaaagaac gagctgtggt ggacaaagtg tttttctcaa ggctcataca gatcctgaaa 241 atcatggtcc ctagaacatt ttgtaaagag acaggttact tggtacttat tgctgttatg 301 ctggtgtctc gaacatattg tgatgtttgg atgattcaaa atgggacact aattgaaagt 361 ggtatcattg gtcgtagcag gaaagatttc aagagatact tactcaactt catcgctgcc 421 atgcctctta tctctctggt taataacttc ttgaagtatg ggttaaatga gcttaaactg 481 tgcttccgag taaggctcac taaatacctc tatgaggagt atcttcaagc cttcacatat 541 tataaaaagg ggaatctgga caacagaata gctaatccag accagctgct tacacaagat 601 gtagaaaaat tttgtaacag tgtagtcgat ctgtattcaa atcttagtaa gccattttta 661 gacatagttt tgtatatctt taagttaacg agtgcaattg gagctcaggg cccagcgagc 721 atgatggcct acttggttgt ttctgggcta ttcctaactc gacttcgaag acccattggt 781 aagatgacaa taactgagca aaagtatgaa ggagaatata gatatgttaa ttctcggctc 841 atcacaaaca gtgaagaaat tgccttttac aatgggaata aaagagaaaa gcagacagtc 901 cactcagtct tccgaaaact ggtggaacac ctacataatt tcattttgtt tcggttttca 961 atgggcttca ttgatagtat tattgccaaa taccttgcca ctgttgttgg ttacctagtt 1021 gtcagtcgcc ctttcttaga tttgtctcat cctcgacatc tcaagagtac acattcggaa 1081 cttctagagg attactacca aagtggaaga atgcttttgc gaatgtctca agctctgggt 1141 cgaatagttt tggctgggcg tgaaatgact agattggccg gttttactgc tcggattaca 1201 gaattaatgc aagtactgaa ggatttaaat catggcaaat atgagcgcac aatggtctca 1261 caacaggaaa agggtattga aggagtacaa gtcattccct tgatacctgg tgctggagaa 1321 atcattattg cagataacat tataaagttt gatcatgttc ctttagcaac gccaaatgga 1381 gatgttttga tccgagacct taattttgaa gttcgatctg gggctaatgt tctaatttgt 1441 ggtccaaatg gctgcggaaa gagttcactt ttccgtgttc ttggtgaatt atggcctctt 1501 tttggaggac gtctaactaa acctgaaaga agaaaattat tttatgttcc tcagagacct 1561 tacatgaccc ttggaacact tcgagatcaa gtgatatatc cagatggacg agaagatcag 1621 aaaaggaagg gaatttctga cctagtacag aaggaatact tagacaatgt ccagttgggt 1681 catatccttg aacgtgaagg aggctgggac agtgttcagg attggatgga cgtactcagt 1741 ggtggagaaa agcaaagaat ggcgatggca agattatttt atcataaacc ccagtttgcc 1801 attttggatg aatgcacaag tgcagttagt gtcgacgtgg aaggctacat ttatagtcat 1861 tgtcgaaagg ttggcatcac tctcttcact gtgtctcata ggaaatctct ttggaaacat 1921 catgagtact acctgcatat ggatggcaga ggcaactatg aattcaaaca gataacagaa 1981 gatacagttg agtttggctc ttagagaaat ctggagaact atacctgctt cagtgaaata 2041 attacagaat atacttagaa aggcaaagta cattgtaaaa taaagttgag cttagttttt 2101 tttaaaaaaa aaaacaaagc caaccaaatt atattagata cagaataatg gagaacaagt 2161 tgttaaaaca tttaatatta tataggatat tgctaattgt gtatatgttg gtttaattaa 2221 taatatgtac taagaatgtc cttattcttg tggttaaaaa cctgcctaaa ttaaattggg 2281 cttcaatcat gtaacctgat tcatcctggg atgtaaacca ttcgaagtca gctaattgga 2341 cttttatggc tctatctttt ccttcatgaa gaaccctatt taaaactggg tcatcatttg 2401 tcctgttcta gcaagatagt cttcagtttc atttcctgtg ccctgtggta gttggaaacc 2461 atatcataat gtattattta aatgtttaac atcattgcat aacacgttta ttatacagtg 2521 gcagatttct ttagctgcca cagtaatact cattccttgt gtgtgtcttg gagtgcattt 2581 gactccagga aaagccattt tggttttcct taactaaatg ataaatgtac ccctctcagt 2641 ctgcagtatt gagttgttta aagtatatgt gcagtcttgc ttacaaggag gggttaccat 2701 gtatcacacc taatcttccc aatgtttggg aatattaaaa caccaacagt ccttaacatg 2761 ccaggctcaa ggtcttataa gagttctaga tttttaagag aattagacaa atttgtgtgt 2821 gttagaagcc cattcattag aagtgtggtg gttatttggt attaaactca aacagtgcca 2881 agcttgggaa ggcactacaa tgaaataatg cactgagtat gcaatgctat cactgtcttt 2941 gactgtgatt ttatgtttaa aaagtatgtt ctaaaattat tatatataca tgggtgaatt 3001 atgtttccga ggcactgttt tatctctgtg aatcttgaat aactttttta tatttgggtt 3061 atgatgtcaa acgatcctaa gcgaagatga tttcagttca tcaaatcatc attaatgact 3121 ttatgtatta tttgcacagg gagaattgaa actgagtata atcaataagc tagatacgaa 3181 atcagtttct caaactgagc ttcagaaagg ggcattttgt actcttgttt ttgcataact 3241 ggttttgttt ttttgcagaa ttaactataa caatcactgg ctacg // LOCUS HSPMSCL 2834 bp RNA PRI 25-JUL-1993 DEFINITION H.sapiens mRNA for PM/Scl 100kD nucleolar protein. ACCESSION X66113 S45703 NID g35554 KEYWORDS autoantigen; nucleolar protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2834) AUTHORS Bluthner,M. TITLE Direct Submission JOURNAL Submitted (19-MAY-1992) M. Bluthner, Institute of Molecular Genetics, INF230, 6900 Heidelberg, FRG REFERENCE 2 (bases 1 to 2834) AUTHORS Bluthner,M. and Bautz,F.A. TITLE Cloning and characterization of the cDNA coding for a polymyositis-scleroderma overlap syndrome-related nucleolar 100-kD protein JOURNAL J. Exp. Med. 176 (4), 973-980 (1992) MEDLINE 93018847 FEATURES Location/Qualifiers source 1..2834 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" CDS 38..2695 /codon_start=1 /product="PM/Scl 100kD nucleolar protein" /db_xref="PID:g35555" /translation="MAPPSTREPRVLSATSATKSDGEMVLPGFPDADSFVKFALGSVV AVTKASGGLPQFGDEYDFYRSFPGFQAFCETQGDRLLQCMSRVMQYHGCRSNIKDRSK VTELEDKFDLLVDANDVILERVGILLDEASGVNKNQQPVLPAGLQVPKTVVSSWNRKA AEYGKKAKSETFRLLHAKNIIRPQLKFREKIDNSNTPFLPKIFIKPNAQKPLPQALSK ERRERPQDRPEDLDVPPALADFIHQQRTQQVEQDMFAHPYQYELNHFTPADAVLQKPQ PQLYRPIEETPCHFISSLDELVELNEKLLNCQEFAVDLEHHSYRSFLGLTCLMQISTR TEDFIIDTLELRSDMYILNESLTDPAIVKVFHGADSDIEWLQKDFGLYVVNMFDTHQA ARLLNLGRHSLDHLLKLYCNVDSNKQYQLADWRIRPLPEEMLSYARDDTHYLLYIYDK MRLEMWERGNGQPVQLQVVWQRSRDICLKKFIKPIFTDESYLELYRKQKKHLNTQQLT AFQLLFAWRDKTARREDESYGYVLPNHMMLKIAEELPKEPQGIIACCNPVPPLVRQQI NEMHLLIQQAREMPLLKSEVAAGVKKSGPLPSAERLENVLFGPHDCSHAPPDGYPIIP TSGSVPVQKQASLFPDEKEDNLLGTTCLIATAVITLFNEPSAEDSKKGPLTVAQKKAQ NIMESFENPFRMFLPSLGHRAPVSQAAKFDPSTKIYEISNRWKLAQVQVQKDSKEAVK KKAAEQTAAREQAKEACKAAAEQAISVRQQVVLENAAKKRERATSDPRTTEQKQEKKR LKISKKPKDPEPPEKEFTPYDYSQSDFKAFAGNSKSKVSSQFDPNKQTPSGKKCIAAK KIKQSVGNKSMSFPTGKSDRGFRYNWPQR" BASE COUNT 870 a 710 c 663 g 591 t ORIGIN 1 acaagctctc gcgagacgag ccgtgcaggc tgaaaaaatg gcgccaccca gtacccggga 61 gcccagggtc ctgtcggcga ccagcgcaac caaatccgac ggagagatgg tgctgccagg 121 cttcccggac gccgacagct ttgtgaagtt tgctcttggg tccgtggtgg cagtcaccaa 181 ggcatctggg ggcctaccac agtttggcga tgagtatgat ttttaccgaa gttttcctgg 241 cttccaagca ttttgcgaaa cacagggaga caggttgctt cagtgcatga gcagagtaat 301 gcagtaccat gggtgtcgca gcaacattaa ggatcgaagt aaagtgactg agctggaaga 361 caagtttgat ttactagttg atgccaatga tgtaattctg gagagagtgg gtattttact 421 ggatgaagcc tcaggtgtaa acaagaatca acagcctgtc ctccctgccg gcttgcaggt 481 ccccaaaacg gtagtgtcca gctggaaccg taaggcagca gaatatggca aaaaagcaaa 541 atctgaaact ttccggctgc ttcatgcaaa aaatatcatc cgacctcagc tcaagtttcg 601 agagaagatt gacaattcca acacaccatt tcttcctaaa atcttcatca aacccaatgc 661 tcagaaacct ctccctcaag ctctctctaa ggaaaggcgg gaacgcccac aggatcgtcc 721 tgaggacttg gacgtccccc ctgcactggc tgatttcatc catcagcaga gaacccagca 781 ggttgagcaa gacatgtttg cacatcctta tcaatatgaa ctaaatcact ttaccccagc 841 agatgcagtg cttcaaaagc cacaacccca gttatacaga cctatagaag agacaccatg 901 ccatttcata tcctccctgg atgaactcgt ggaactcaac gaaaagctct tgaattgtca 961 ggaatttgca gttgacttgg agcaccactc ttacaggagc ttcctgggac tgacctgcct 1021 gatgcaaatt tctactcgga cggaagactt catcattgac accctcgagc ttcgaagtga 1081 catgtacatt ctcaatgaga gcctcacaga cccagccatc gttaaggtct ttcatggtgc 1141 tgattcagac atagaatggc tacagaaaga ctttgggttg tatgtagtaa acatgtttga 1201 tactcatcag gcagcacgcc ttcttaacct gggcaggcac tcactcgatc atctcctgaa 1261 actctactgc aacgtggact caaacaagca atatcagctg gctgattgga gaatacgccc 1321 tctgcccgag gagatgctca gctacgcccg ggatgacacc cattacctgc tatatatcta 1381 tgacaaaatg aggctggaga tgtgggagcg cggcaacggg cagcccgtgc agctgcaggt 1441 ggtgtggcaa cggagcaggg acatctgcct caagaaattc atcaaaccta tcttcacgga 1501 tgagtcctac cttgaactct ataggaagca gaagaagcac cttaacacac agcagttgac 1561 agcctttcag ctgctgtttg cctggaggga taaaacagct cgcagggaag atgaaagtta 1621 cggatatgta ctgccaaacc acatgatgct gaaaatagct gaagaactgc ctaaggaacc 1681 tcagggcatc atagcttgct gcaacccagt accgcccctt gtgcggcagc agatcaacga 1741 aatgcacctt ttaatccagc aggcccgaga gatgcccctg ctcaagtctg aagttgcagc 1801 cggagtgaag aagagcggac cgctgcccag tgctgagaga ttggagaatg ttctctttgg 1861 acctcacgac tgctcccatg cccctccgga tggctatcca atcatcccaa ccagtggatc 1921 tgtgccagtt cagaagcagg cgagcctctt ccctgatgaa aaagaagata acttgctggg 1981 taccacatgc ctgattgcca cagctgtcat cacgttattt aatgaaccta gtgctgaaga 2041 cagtaaaaag ggtccattga cagttgcaca gaaaaaagcc cagaacatca tggagtcctt 2101 tgaaaatcca tttaggatgt ttctgccctc actgggacac cgtgctcccg tctctcaggc 2161 agcgaagttc gatccatcaa ccaaaatcta tgaaatcagc aaccgttgga agctggccca 2221 ggtacaagta caaaaagact ctaaagaagc tgtcaagaag aaggcagctg agcaaacagc 2281 tgcccgggaa caggcaaagg aggcgtgcaa agctgcagca gaacaggcca tctccgtccg 2341 acagcaggtc gtgctagaaa atgctgcaaa gaagagagag cgagcaacaa gcgacccaag 2401 gaccacagaa cagaaacaag agaagaaacg actcaaaatt tccaagaagc caaaggaccc 2461 agagccacca gaaaaagagt ttacgcctta cgactacagc cagtcagact tcaaggcttt 2521 tgctggaaac agcaaatcca aagtttcttc tcagtttgat ccaaataaac agaccccgtc 2581 tggcaagaaa tgcattgcag ccaaaaaaat taaacagtcg gtgggaaaca aaagcatgtc 2641 ctttccaact ggaaagtcag acagaggctt caggtacaac tggccacaga gatagtcctg 2701 gaagacacgt ggcgcctgtg gaccggaagc accaaatgct ggtgctgctt ttgtacatac 2761 atatttttaa accattaaaa ttcttcctga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2821 aaaaaaaaaa aaaa // LOCUS HSPNP 1418 bp RNA PRI 19-JAN-1995 DEFINITION Human mRNA for purine nucleoside phosphorylase (PNP; EC 2.4.2.1). ACCESSION X00737 K02574 NID g35564 KEYWORDS phosphorylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1418) AUTHORS Williams,S.R., Goddard,J.M. and Martin,D.W. Jr. TITLE Human purine nucleoside phosphorylase cDNA sequence and genomic clone characterization JOURNAL Nucleic Acids Res. 12 (14), 5779-5787 (1984) MEDLINE 84272252 COMMENT Data kindly reviewed (30-JAN-1986) by S.R. Williams. FEATURES Location/Qualifiers source 1..1418 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 110..979 /note="PNP" /codon_start=1 /db_xref="PID:g35565" /db_xref="SWISS-PROT:P00491" /translation="MENGYTYEDYKNTAEWLLSHTKHRPQVAIICGSGLGGLTDKLTQ AQIFDYSEIPNFPRSTVPGHAGRLVFGFLNGRACVMMQGRFHMYEGYPLWKVTFPVRV FHLLGVDTLVVTNAAGGLNPKFEVGDIMLIRDHINLPGFSGQNPLRGPNDERFGDRFP AMSDAYDRTMRQRALSTWKQMGEQRELQEGTYVMVAGPSFETVAECRVLQKLGADAVG MSTVPEVIVARHCGLRVFGFSLITNKVIMDYESLEKANHEEVLAAGKQAAQKLEQFVS ILMASIPLPDKAS" BASE COUNT 352 a 358 c 362 g 346 t ORIGIN 1 aactgtgcga accagacccg gcagccttgc tcagttcagc atagcggagc ggatccgatc 61 ggatcggagc acaccggagc aggctcatcg agaaggcgtc tgcgagacca tggagaacgg 121 atacacctat gaagattata agaacactgc agaatggctt ctgtctcata ctaagcaccg 181 acctcaagtt gcaataatct gtggttctgg attaggaggt ctgactgata aattaactca 241 ggcccagatc tttgactaca gtgaaatccc caactttcct cgaagtacag tgccaggtca 301 tgctggccga ctggtgtttg ggttcctgaa tggcagggcc tgtgtgatga tgcagggcag 361 gttccacatg tatgaagggt acccactctg gaaggtgaca ttcccagtga gggttttcca 421 ccttctgggt gtggacaccc tggtagtcac caatgcagca ggagggctga accccaagtt 481 tgaggttgga gatatcatgc tgatccgtga ccatatcaac ctacctggtt tcagtggtca 541 gaaccctctc agagggccca atgatgaaag gtttggagat cgtttccctg ccatgtctga 601 tgcctacgac cggactatga ggcagagggc tctcagtacc tggaaacaaa tgggggagca 661 acgtgagcta caggaaggca cctatgtgat ggtggcaggc cccagctttg agactgtggc 721 agaatgtcgt gtgctgcaga agctgggagc agacgctgtt ggcatgagta cagtaccaga 781 agttatcgtt gcacggcact gtggacttcg agtctttggc ttctcactca tcactaacaa 841 ggtcatcatg gattatgaaa gcctggagaa ggccaaccat gaagaagtct tagcagctgg 901 caaacaagct gcacagaaat tggaacagtt tgtctccatt cttatggcca gcattccact 961 ccctgacaaa gccagttgac ctgccttgga gtcgtctggc atctcccaca caagacccaa 1021 gtagctgcta ccttctttgg ccccttgctg gagtcatgtg cctctgtcct taggttgtag 1081 cagaaaggaa aagattcctg tccttcacct ttcccacttt cttctaccag acccttctgg 1141 tgccagatcc tcttctcaaa gctgggatta caggtgtgag catagtgaga ccttggcgct 1201 acaaaataaa gctgttctca ttcctgttct ttcttacaca agagctggag cccgtgccct 1261 accacacatc tgtggagatg cccaggattt gactcgggcc ttagaacttt gcatagcagc 1321 tgctactagc tctttgagat aatacattcc gaggggctca gttctgcctt atctaaatca 1381 ccagagacca aacaaggact aatccaatac ctcttgga // LOCUS HSPNUTL1 2000 bp RNA PRI 29-OCT-1997 DEFINITION Homo sapiens mRNA for peanut-like protein 1, PNUTL1 (hCDCrel-1). ACCESSION Y11593 NID g2370150 KEYWORDS cytokinesis; peanut-like protein 1; PNUTL1 gene; septin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2000) AUTHORS McKie,J.M., Sutherland,H.F., Harvey,E., Kim,U.J. and Scambler,P.J. TITLE A human gene similar to Drosophila melanogaster peanut maps to the DiGeorge syndrome region of 22q11 JOURNAL Hum. Genet. 101 (1), 6-12 (1997) MEDLINE 98046335 REFERENCE 2 (bases 1 to 2000) AUTHORS Scambler,P.J. TITLE Direct Submission JOURNAL Submitted (04-MAR-1997) P.J. Scambler, Institute of Child Health, Room 214, 30 Guilford St., London WC1N 1EH, UK REFERENCE 3 (bases 1 to 2000) AUTHORS Zieger,B., Hashimoto,Y. and Ware,J. TITLE Alternative expression of platelet glycoprotein Ib(beta) mRNA from an adjacent 5' gene with an imperfect polyadenylation signal sequence JOURNAL J. Clin. Invest. 99 (3), 520-525 (1997) MEDLINE 97174353 FEATURES Location/Qualifiers source 1..2000 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /germline /dev_stage="adult" /tissue_type="heart" /map="q11" gene 46..1155 /gene="PNUTL1" CDS 46..1155 /gene="PNUTL1" /function="putative cytokinesis" /note="peanut-like protein 1" /codon_start=1 /product="putative septin" /db_xref="PID:e1167897" /db_xref="PID:g2370151" /translation="MSTGLRYKSKLATPEDKQDIDKQYVGFATLPNQVHRKSVKKGFD FTLMVAGESGLGKSTLVHSLFLTDLYKDRKLLSAEERISQTVEILKHTVDIEEKGVKL KLTIVDTPGFGDAVNNTECWKPITDYVDQQFEQYFRDESGLNRKNIQDNRVHCCLYFI SPFGHGLRPVDVGFMKALHEKVNIVPLIAKADCLVPSEIRKLKERIREEIDKFGIHVY QFPECNSDEDEDFKQQDRELKESAPFAVIGSNTVVEAKGQRVRGRLYPWGIVEVENQA HCDFVKLRNMLIRTHMHDLKDVTCDVHYENYRAHCIQQMTSKLTQDSRMESPIPILPL PTPDAETEKLIRMKDEELRRMQEMLQRMKQQMQDQ" polyA_signal 1972..1977 BASE COUNT 428 a 640 c 594 g 338 t ORIGIN 1 ccccgcccgc gagcccgccc cgcacgtccc ccgccggcgg ccaccatgag cacaggcctg 61 cggtacaaga gcaagctggc gaccccagag gacaagcagg acattgacaa gcagtacgtg 121 ggcttcgcca cactgcccaa ccaggtgcac cgcaagtcgg tgaagaaagg ctttgacttc 181 acactcatgg tggcaggtga gtcaggcctg gggaagtcca cactggtcca cagcctcttc 241 ctgacagact tgtacaagga ccggaagctg ctcagtgctg aggagcgcat cagccagacg 301 gtagagattc taaaacacac ggtggacatt gaggagaagg gagtcaagct gaagctcacc 361 atcgtggaca cgccgggatt cggggacgct gtcaacaaca ccgagtgctg gaagcccatc 421 accgactatg tggaccagca gtttgagcag tacttccgtg atgagagcgg cctcaaccga 481 aagaacatcc aagacaaccg agtgcactgc tgcctatact tcatctcccc cttcgggcat 541 gggctgcggc cagtggatgt gggtttcatg aaggcattgc atgagaaggt caacatcgtg 601 cctctcatcg ccaaagctga ctgtcttgtc cccagtgaga tccggaagct gaaggagcgg 661 atccgggagg agattgacaa gtttgggatc catgtatacc agttccctga gtgtaactcg 721 gacgaggatg aggacttcaa gcagcaggac cgggaactga aggagagcgc gcccttcgcc 781 gttataggca gcaacacggt ggtggaggcc aaggggcagc gggtccgggg ccgactgtac 841 ccctggggga tcgtggaggt ggagaaccag gcgcattgcg acttcgtgaa gctgcgcaac 901 atgctcatcc gcacgcatat gcacgacctc aaggacgtga cgtgcgacgt gcactacgag 961 aactaccgcg cgcactgcat ccagcagatg accagcaaac tgacccagga cagccgcatg 1021 gagagcccca tcccgatcct gccgctgccc accccggacg ccgagactga gaagcttatc 1081 aggatgaagg atgaggaact gaggcgcatg caggagatgc tgcagaggat gaagcagcag 1141 atgcaggacc agtgacgctc gccgcggaca caccgtccgt ctccgggacg ccctcgcacc 1201 cctggacacc agaccggact gttcccgacc cggagacgcg gggccacagc ccccagctga 1261 ccctaattta ttctcagcac caccccctcc caggtcattg tgtctgtttc cgaggggcct 1321 ggaccgtagc ccccgcccag ctggccctct ctgaccttgg gggatcagga gcgaagttgg 1381 gcgggacttc agagatccgc ctcccttgcc cttcccccgc ccccggacgg tcacagcacc 1441 caaaccgcag gccctgctct ggcaggcagg caaagctagg cagaagagga ttcccaggat 1501 cctgggtctg ttccctgccc cagtgctgca gaacggactt gggagccctc ctttgcctgc 1561 tcccgcgggt cacccagcga gtgctgagac cccattttct gtcgaggcgg gccgagtctt 1621 cccttatccc cagacgccta gcgggcaggg ttgggctgaa tcaaatggga gccctccaga 1681 cataaggagg ccagaggctg caaggagcgg ggtcgtgacc gcttacaccc cttctccaca 1741 gcccggcccg acctggaggg cccccggggc actgggcggt gagccacctc ctggcaactc 1801 tcggtgccgt cccctgccct cgctcgaggc ctcttctccc cagcaccgct gtggtgtgcc 1861 gggatcctga gcctaggcct cccgatgttc ccacccgcat gatcccttcc cgccacacga 1921 tgctccgttt tcttccgttg tgaatgccgc gtcctgtcct ggtgacagga gaacaatgtt 1981 ggtgaacgtc aaaaaaaaaa // LOCUS HSPOLAR 5433 bp RNA PRI 26-JUL-1995 DEFINITION Human mRNA for DNA polymerase alpha-subunit. ACCESSION X06745 NID g35567 KEYWORDS DNA polymerase; DNA polymerase alpha subunit; polymerase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5433) AUTHORS Wong,S.W., Wahl,A.F., Yuan,P.M., Arai,N., Pearson,B.E., Arai,K., Korn,D., Hunkapiller,M.W. and Wang,T.S. TITLE Human DNA polymerase alpha gene expression is cell proliferation dependent and its primary structure is similar to both prokaryotic and eukaryotic replicative DNA polymerases JOURNAL EMBO J. 7 (1), 37-47 (1988) MEDLINE 88196090 COMMENT Data kindly reviewed (16-Jan-1989) by Wang T.S.F. FEATURES Location/Qualifiers source 1..5433 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pre-B-cell" /cell_line="KB" /clone_lib="E1 in lambda gt10" /clone="E1-14b8, E1-14a, E1-12, E1-19" mRNA <1..5433 /note="DNA polymerase alpha mRNA" CDS 17..4405 /note="DNA polymerase alpha-subunit (AA 1 - 1462)" /codon_start=1 /db_xref="PID:g35568" /db_xref="SWISS-PROT:P09884" /translation="MAPVHGDDSLSDSGSFVSSRARREKKSKKGRQEALERLKKAKAG EKYKYEVEDFTGVYEEVDEEQYSKLVQARQDDDWIVDDDGIGYVEDGREIFDDDLEDD ALDADEKGKDGKARNKDKRNVKKLAVTKPNNIKSMFIACAGKKTADKAVDLSKDGLLG DILQDLNTETPQITPPPVMILKKKRSIGASPNPFSVHTATAVPSGKIASPVSRKEPPL TPVPLKRAEFAGDDVQVESTEEEQESGAMEFEDGDFDEPMEVEEVDLEPMAAKAWDKE SEPAEEVKQEADSGKGTVSYLGSFLPDVSCWDIDQEGDSSFSVQEVQVDSSHLPLVKG ADEEQVFHFYWLDAYEDQYNQPGVVFLFGKVWIESAETHVSCCVMVKNIERTLYFLPR EMKIDLNTGKETGTPISMKDVYEEFDEKIATKYKIMKFKSKPVEKNYAFEIPDVPEKS EYLEVKYSAEMPQLPQDLKGETFSHVFGTNTSSLELFLMNRKIKGPCWLEVKKSTALN QPVSWCKVEAMALKPDLVNVIKDVSPPPLVVMAFSMKTMQNAKNHQNEIIAMAALVHH SFALDKAAPKPPFQSHFCVVSKPKDCIFPYAFKEVIEKKNVKVEVAATERTLLGFFLA KVHKIDPDIIVGHNIYGFELEVLLQRINVCKAPHWSKIGRLKRSNMPKLGGRSGFGER NATCGRMICDVEISAKELIRCKSYHLSELVQQILKTERVVIPMENIQNMYSESSQLLY LLEHTWKDAKFILQIMCELNVLPLALQITNIAGNIMSRTLMGGRSERNEFLLLHAFYE NNYIVPDKQIFRKPQQKLGDEDEEIDGDTNKYKKGRKKGAYAGGLVLDPKVGFYDKFI LLLDFNSLYPSIIQEFNICFTTVQRVASEAQKVTEDGEQEQIPELPDPSLEMGILPRE IRKLVERRKQVKQLMKQQDLNPDLILQYDIRQKALKLTANSMYGCLGFSYSRFYAKPL AALVTYKGREILMHTKEMVQKMNLEVIYGDTDSIMINTNSTNLEEVFKLGNKVKSEVN KLYKLLEIDIDGVFKSLLLLKKKKYAALVVEPTSDGNYVTKQELKGLDIVRRDWCDLA KDTGNFVIGQILSDQSRDTIVENIQKRLIEIGENVLNGSVPVSQFEINKALTKDPQDY PDKKSLPHVHVALWINSQGGRKVKAGDTVSYVICQDGSNLTASQRAYAPEQLQKQDNL TIDTQYYLAQQIHPVVARICEPIDGIDAVLIATWLGLDPTQFRVHHYHKDEENDALLG GPAQLTDEEKYRDCERFKCPCPTCGTENIYDNVFDGSGTDMEPSLYRCSNIDCKASPL TFTVQLSNKLIMDIRRFIKKYYDGWLICEEPTCRNRTRHLPLQFSRTGPLCPACMKAT LQPEYSDKSLYTQLCFYRYIFDAECALEKLTTDHEKDKLKKQFFTPKVLQDYRKLKNT AEQFLSRSGYSEVNLSKLFAGCAVKS" misc_feature 5415..5420 /note="polyA signal" polyA_site 5433 /note="polyA site" BASE COUNT 1673 a 1052 c 1271 g 1437 t ORIGIN 1 ggggagattc gggaccatgg cacctgtgca cggcgacgac tctctgtcag attcagggag 61 ttttgtatct tctcgagccc ggcgagaaaa aaaatcaaag aaggggcgcc aagaagccct 121 agaaagactg aaaaaggcta aagctggtga gaagtataaa tatgaagtcg aggacttcac 181 aggtgtttat gaagaagttg atgaagaaca gtattcgaag ctggttcagg cacgccagga 241 tgatgactgg attgtggatg atgatggtat tggctatgtg gaagatggcc gagagatttt 301 tgatgatgac cttgaagatg atgcccttga tgctgatgag aaaggaaaag atggtaaagc 361 acgcaataaa gacaagagga atgtaaagaa gctcgcagtg acaaaaccga acaacattaa 421 gtcaatgttc attgcttgtg ctggaaagaa aactgcagat aaagctgtag acttgtccaa 481 ggatggtctg ctaggtgaca ttctacagga tcttaacact gagacacctc aaataactcc 541 accacctgta atgatactga agaagaaaag atccattgga gcttcaccga atcctttctc 601 tgtgcacacc gccacggcag ttccttcagg aaaaattgct tcccctgtct ccagaaagga 661 gcctccatta actcctgttc ctcttaaacg tgctgaattt gctggcgatg atgtacaggt 721 cgagagtaca gaagaagagc aggagtcagg ggcaatggag tttgaagatg gtgactttga 781 tgagcccatg gaagttgaag aggtggacct ggagcctatg gctgccaagg cttgggacaa 841 agagagtgag ccagcagagg aagtgaaaca agaggcggat tctgggaaag ggaccgtgtc 901 ctacttagga agttttctcc cggatgtctc ttgttgggac attgatcaag aaggtgatag 961 cagtttctca gtgcaagaag ttcaagtgga ttccagtcac ctcccattgg taaaaggggc 1021 agatgaggaa caagtattcc acttttattg gttggatgct tatgaggatc agtacaacca 1081 accaggtgtg gtatttctgt ttgggaaagt ttggattgaa tcagccgaga cccatgtgag 1141 ctgttgtgtc atggtgaaaa atatcgagcg aacgctttac ttccttcccc gtgaaatgaa 1201 aattgatcta aatacgggga aagaaacagg aactccaatt tcaatgaagg atgtttatga 1261 ggaatttgat gagaaaatag caacaaaata taaaattatg aagttcaagt ctaagccagt 1321 ggaaaagaac tatgcttttg agatacctga tgttccagaa aaatctgagt acttggaagt 1381 taaatactcg gctgaaatgc cacagcttcc tcaagatttg aaaggagaaa ctttttctca 1441 tgtatttggg accaacacat ctagcctgga actgttcttg atgaacagaa agatcaaagg 1501 accttgttgg cttgaagtaa aaaagtccac agctcttaat cagccagtca gttggtgtaa 1561 agttgaggca atggctttga aaccagacct ggtgaatgta attaaggatg tcagtccacc 1621 accgcttgtc gtgatggctt tcagcatgaa gacaatgcag aatgcaaaga accatcaaaa 1681 tgagattatt gctatggcag ctttggtcca tcacagtttt gcattggata aagcagcccc 1741 aaagcctccc tttcagtcac acttctgtgt tgtgtctaaa ccaaaggact gtatttttcc 1801 atatgctttc aaagaagtca ttgagaaaaa gaatgtgaag gttgaggttg ctgcaacaga 1861 aagaacactg ctaggttttt tccttgcaaa agttcacaaa attgatcctg atatcattgt 1921 gggtcataat atttatgggt ttgaactgga agtactactg cagagaatta atgtgtgcaa 1981 agctcctcac tggtccaaga taggtcgact gaagcgatcc aacatgccaa agcttggggg 2041 ccggagtgga tttggtgaaa gaaatgctac ctgtggtcga atgatctgtg atgtggaaat 2101 ttcagcaaag gaattgattc gttgtaaaag ctaccatctg tctgaacttg ttcagcagat 2161 tctaaaaact gaaagggttg taatcccaat ggaaaatata caaaatatgt acagtgaatc 2221 ttctcaactg ttatacctgt tggaacacac ctggaaagat gccaagttca ttttgcagat 2281 catgtgtgag ctaaatgttc ttccattagc attgcagatc actaacatcg ctgggaacat 2341 tatgtccagg acgctgatgg gtggacgatc cgagcgtaac gagttcttgt tgcttcatgc 2401 attttacgaa aacaactata ttgtgcctga caagcagatt ttcagaaagc ctcagcaaaa 2461 actgggagat gaagatgaag aaattgatgg agataccaat aaatacaaga aaggacgtaa 2521 gaaaggagct tatgctggag gcttggtttt ggaccccaaa gttggttttt atgataagtt 2581 cattttgctt ctggacttca acagtctata tccttccatc attcaggaat ttaacatttg 2641 ttttacaaca gtacaaagag ttgcttcaga ggcacagaaa gttacagagg atggagaaca 2701 agaacagatc cctgagttgc cagatccaag cttagaaatg ggcattttgc ccagagagat 2761 ccggaaactg gtagaacgga gaaaacaagt caaacagcta atgaaacagc aagacttaaa 2821 tccagacctt attcttcagt atgacattcg acagaaggct ttgaagctca cagcgaacag 2881 tatgtatggt tgcctgggat tttcctatag cagattttac gccaaaccac tggctgcctt 2941 ggtgacatac aaaggaaggg agattttgat gcatacgaaa gagatggtac aaaagatgaa 3001 tcttgaagtt atttatggag atacagattc aattatgata aacaccaata gcaccaatct 3061 ggaagaagta tttaagttgg gaaacaaggt aaaaagtgaa gtgaataagt tgtacaaact 3121 gcttgaaata gacattgatg gggttttcaa gtctctgcta ctgctgaaaa aaaagaagta 3181 cgctgctctg gttgttgagc caacgtcgga tgggaattat gtcaccaaac aggagctcaa 3241 aggattagat atagttagaa gagattggtg tgatcttgct aaagacactg gaaactttgt 3301 gattggccag attctttctg atcaaagccg ggacactata gtggaaaaca ttcagaagag 3361 gctgatagaa attggagaaa atgtgctaaa tggcagtgtc ccagtgagcc agtttgaaat 3421 taacaaggca ttgacaaagg atccccagga ttaccctgat aaaaaaagcc tacctcatgt 3481 acatgttgcc ctctggataa attctcaagg aggcagaaag gtgaaagctg gagatactgt 3541 gtcatatgtc atctgtcagg atggatcaaa cctcactgca agtcagaggg cctatgcgcc 3601 tgagcagctg cagaaacagg ataatctaac cattgacacc cagtactacc tggcccagca 3661 gatccaccca gtcgtggctc ggatctgtga accaatagac ggaattgatg ctgtcctcat 3721 tgcaacgtgg ttgggacttg accccaccca atttagagtt catcattatc ataaagatga 3781 agagaatgat gctctacttg gtggcccagc acagctcact gatgaagaga aatacaggga 3841 ctgtgaaaga ttcaaatgtc catgccctac atgtggaact gagaatattt atgataatgt 3901 ctttgatggt tcgggaacag atatggagcc cagcttgtat cgttgcagta acatcgattg 3961 taaggcttca cctctgacct ttacagtaca actgagcaac aaattgatca tggacattag 4021 acgtttcatt aaaaagtact atgatggctg gttgatatgt gaagagccaa cctgtcgcaa 4081 tcgaactcgt caccttcccc ttcaattctc ccgaactggg cctctttgcc cagcctgcat 4141 gaaagctaca cttcaaccag agtattctga caagtccctg tacacccagc tgtgctttta 4201 ccggtacatt tttgatgcgg agtgtgcact ggagaaactt actaccgatc atgagaaaga 4261 taaattgaag aagcaatttt ttacccccaa agttctgcag gactacagaa aactcaagaa 4321 cacagcagag caattcttgt cccgaagtgg ctactccgaa gtgaatctga gcaaactctt 4381 cgctggttgt gccgtgaaat cctaagggaa tcccaggagt aaccaaggag ggggtagttg 4441 aaaaatccca gcttcctctg tgcctccact ctggccctaa atgctcctcc agcatctgtt 4501 tctcccttgg gactgtgtct catgtttgtg tgaatgtaga ccaggaaagg gggctgcaaa 4561 aatgttgagt ctaatgttcg taagcatcat agaaattcct gtcttcatat taagatgtac 4621 tgctttaaaa cacaactcca gagcccctcc ccaagctccc ctccccaagc tcctgaagac 4681 ccggtttctg agggagggaa attgctactt ggattgagag tagctggaat gtaagtgacc 4741 ccaggctttg ctcagggcct ttagcctatg tcccccccac ataaagagag cttctcagag 4801 cctgactgaa gagctgacgt tttgcttttt catatgccaa ttaaacccgg tctaaatcca 4861 aatgcttctc cagccatcca ggagtggctg tccttttcag tcttgtcttt tatataggta 4921 gctgaggggg aagatttaga agccttgcac tcactaaata gattaaacag agcaggcttg 4981 tttgttgaat tgctccaaag tccaacagac acacactgag caggtgtttt acactcacat 5041 tccctttttg ccccttaaat agaaagtgca ggtaaaggtt tatacaacaa gaaagcacat 5101 tgaaaataat ttgatactct aacaatccat taacatgtgt aggggttacg gtgaggatca 5161 tgtgttgtat tcgaaaaacg gggagaggga tgcttaattg gccctcgctt gctatttttt 5221 tctcatttct tcacaatagg accgtctttg gcagcagcaa aatgtatttc agtatggcag 5281 tctttcctct cttacattat tggtaagatt atactaacaa aatgtttccc cttgtacaat 5341 tatgctgtgt ttttaaaaaa cattgacctg tgtgttttta taaaagaaaa agtatgttgt 5401 gccttcttct taagaataaa gttttctaaa ggg // LOCUS HSPP15 894 bp RNA PRI 12-SEP-1993 DEFINITION Human gene for PP15 (placental protein 15). ACCESSION X07315 NID g35578 KEYWORDS placental protein 15. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 894) AUTHORS Grundmann,U. TITLE Direct Submission JOURNAL Submitted (30-MAR-1988) Grundmann U., Behringwerke AG, Forschung Molekularbiologie, Postfach 1140, 3550 Marburg, FRG REFERENCE 2 (bases 1 to 894) AUTHORS Grundmann,U., Nerlich,C., Rein,T., Lottspeich,F. and Kupper,H.A. TITLE Isolation of cDNA coding for the placental protein 15 (PP15) JOURNAL Nucleic Acids Res. 16 (10), 4721 (1988) MEDLINE 88247772 FEATURES Location/Qualifiers source 1..894 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" RBS 36..41 /note="ribosome binding site" CDS 100..483 /note="PP15 (AA 1-127)" /codon_start=1 /db_xref="PID:g35579" /db_xref="SWISS-PROT:P13662" /translation="MGDKPIWEQIGSSFIQHYYQLFDNDRTQLGAIYIDASCLTWEGQ QFQGKAAIVEKLSSLPFQKIQHSITAQDHQPTPDSCIISMVVGQLKADEDPIMGFHQM FLLKNINDAWVCTNDMFRLALHNFG" polyA_site 860..865 /note="polyA site" BASE COUNT 226 a 230 c 213 g 225 t ORIGIN 1 ggaagggaca gtcggccgca gaccgcgctg ggttgccgct gccgctgccg ccatcgtgcc 61 agcccctcgg gtctccgtga ggccgggtga cgctccagaa tgggagacaa gccaatttgg 121 gagcagattg gatccagctt cattcaacat tactaccagt tatttgataa tgatagaacc 181 caactaggcg caatttacat tgacgcgtca tgccttacgt gggaaggaca acagttccag 241 gggaaagctg ccattgtgga gaagttgtct agccttccgt tccagaaaat tcagcacagc 301 atcaccgcgc aggaccatca gcccactcca gatagctgca tcatcagcat ggttgtgggc 361 cagcttaagg cggatgaaga ccccatcatg gggttccacc agatgttcct attaaagaac 421 atcaacgatg cttgggtttg caccaatgac atgttcaggc tcgccctgca caactttggc 481 tgacctcctc tcagctaggc actcacgctg tttcctcctc cctcctcttc ccaatactat 541 tcccactcct ccagatgctc caaatatcat gcacaaatga gcagggccgc ggtgggagtg 601 ggcgcagtgc gctgctgcca ctgaggtgtt gtgcatgatg tttggatgct agactagttg 661 catctgacgg gagaagtttg tgttgtacca gcgcatgcct tggaaagact taagtaatgc 721 aaaaggttgt cctttttttt tttttttttt ttttaatcta ctgacaagtt gctctagtaa 781 cccaaagaag tgaaggagaa agcagctgcc tcaccgccca gacattgatt tgttcagatg 841 tttcaatgcc tcatgataca ataaaaccac aaaaattttc ttaacaaaaa aaaa // LOCUS HSPP2A 1541 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for protein phosphatase 2A (beta-type). ACCESSION X12656 NID g35580 KEYWORDS phosphatase; protein phosphatase 2A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1541) AUTHORS Hemmings,B.A., Wernet,W., Mayer,R., Maurer,F., Hofsteenge,J. and Stone,S.R. TITLE The nucleotide sequence of the cDNA encoding the human lung protein phosphatase 2A beta catalytic subunit JOURNAL Nucleic Acids Res. 16 (23), 11366 (1988) MEDLINE 89083568 REFERENCE 2 (bases 1 to 1541) AUTHORS Stone,S. TITLE Direct Submission JOURNAL Submitted (22-AUG-1988) Stone S., Friedrich Miescher Institut, P.O. Box 2543, CH-4002 Basel, Switzerland COMMENT Data kindly reviewed (24-OCT-1988) by Stone S. FEATURES Location/Qualifiers source 1..1541 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" CDS 22..951 /note="protein phosphatase 2A (AA 1 - 309)" /codon_start=1 /db_xref="PID:g35581" /db_xref="SWISS-PROT:P11082" /translation="MDDKAFTKELDQWVEQLNECKQLNENQVRTLCEKAKEILTKESN VQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVA LKVRYPERITILRGNHESRQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVD GQIFCLHGGLSPSIDTLDHIRALDRLQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGY TFGQDISETFNHANGLTLVSRAHQLVMEGYNWCHDRNVVTIFSAPNYCYRCGNQAAIM ELDDTLKYSFLQFDPAPRRGEPHVTRRTPDYFL" BASE COUNT 436 a 296 c 327 g 482 t ORIGIN 1 ccgagcccca gcccggccgc catggacgac aaggcgttca ccaaggagct ggaccagtgg 61 gtcgagcagc tgaacgagtg taagcagctg aacgagaacc aagtgcggac gctgtgcgag 121 aaggcaaagg aaattttaac aaaagaatca aatgtgcaag aggttcgttg ccctgttact 181 gtctgtggag atgtgcatgg tcaatttcat gatcttatgg aactctttag aattggtgga 241 aaatcaccgg atacaaacta cttattcatg ggtgactatg tagacagagg atattattca 301 gtggagactg tgactcttct tgtagcatta aaggtgcgtt atccagaacg cattacaata 361 ttgagaggaa atcacgaaag ccgacaaatt acccaagtat atggctttta tgatgaatgt 421 ctgcgaaagt atgggaatgc caacgtttgg aaatatttta cagatctctt tgattatctt 481 ccacttacag ctttagtaga tggacagata ttctgcctcc atggtggcct ctctccatcc 541 atagacacac tggatcatat aagagccctg gatcgtttac aggaagttcc acatgagggc 601 ccaatgtgtg atctgttatg gtcagatcca gatgatcgtg gtggatgggg tatttcacca 661 cgtggtgctg gctacacatt tggacaagac atttctgaaa cctttaacca tgccaatggt 721 ctcacactgg tttctcgtgc ccaccagctt gtaatggagg gatacaattg gtgtcatgat 781 cggaatgtgg ttaccatttt cagtgcaccc aattactgtt atcgttgtgg gaaccaggct 841 gctatcatgg aattagatga cactttaaaa tattccttcc ttcaatttga cccggcgcct 901 cgtcgtggtg agcctcatgt tacacggcgc accccagact acttcctata aatttctcct 961 gggaaacctg cctttgtatg tggaagtata cctggctttt taaaatatat gtatttaaaa 1021 acaaaaagca acagtaatct atgtgtttct gtaacaaatt gggatctgtc ttggcattaa 1081 accacatcat ggaccaaatg tgccatacta atgatgagca tttagcacaa tttgagactg 1141 aaatttagta cactatgttc tagataggtc agtctaacag tttgcctgct gtatttatag 1201 taaccatttt cctttggact gttcaagcaa aaaaggtaac taactgcttc atctcctttt 1261 gcgcttattt ggaaatttta gttatagtgt ttaactggca tggattaata gagttggagt 1321 tttattttta agaaaaattc acaagctaac ttccactaat ccattatcct ttattttatt 1381 gaaatgtata attaacttaa ctgaagaaaa ggttcttctt gggagtatgt tgtcataaca 1441 tttaaagaga tttcccttca tttaaactaa attactgttt tatgttgatc tgcatatttc 1501 tgtatatttg tcatgacagt gcttgcatcc tatttggtgt g // LOCUS HSPP6C 1342 bp RNA PRI 29-APR-1997 DEFINITION H.sapiens mRNA for protein phosphatase 6. ACCESSION X92972 NID g1945270 KEYWORDS PP6C gene; protein phosphatase 6. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1342) AUTHORS Bastians,H. and Ponstingl,H. TITLE The novel human protein serine/threonine phosphatase 6 is a functional homologue of budding yeast Sit4p and fission yeast ppe1, which are involved in cell cycle regulation JOURNAL J. Cell. Sci. 109 (Pt 12), 2865-2874 (1996) MEDLINE 97165573 REFERENCE 2 (bases 1 to 1342) AUTHORS Bastians,H. TITLE Direct Submission JOURNAL Submitted (13-NOV-1995) H. Bastians, German Cancer Research Center, Div. Molecular Biology of Mitosis, /0230, Im Neuenheimer Feld 280, D- 69120 Heidelberg, FRG REFERENCE 3 (bases 1 to 1342) AUTHORS Bastians,H., Krebber,H., Vetrie,D., Hoheisel,J., Lichter,P., Ponstingl,H. and Joos,S. TITLE Localization of the novel serine/threonine protein phosphatase 6 gene (PPP6C) to human chromosome Xq22.3 JOURNAL Genomics 41 (2), 296-297 (1997) MEDLINE 97288535 FEATURES Location/Qualifiers source 1..1342 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="lambda gt11" gene 72..989 /gene="PP6C" CDS 72..989 /gene="PP6C" /codon_start=1 /product="protein phosphatase 6" /db_xref="PID:e212188" /db_xref="PID:g1945271" /translation="MAPLDLDKYVEIARLCKYLPENDLKRLCDYVCDLLLEESNVQPV STPVTVCGDIHGQFYDLCELFRTGGQVPDTNYIFMGDFVDRGYYSLETFTYLLALKAK WPDRITLLRGNHESRQITQVYGFYDECQTKYGNANAWRYCTKVFDMLTVAALIDEQIL CVHGGLSPDIKTLDQIRTIERNQEIPHKGAFCDLVWSDPEDVDTWAISPRGAGWLFGA KVTNEFVHINNLKLICRAHQLVHEGYKFMFDEKLVTVWSAPNYCYRCGNIASIMVFKD VNTREPKLFRAVPDSERVIPPRTTTPYFL" BASE COUNT 361 a 284 c 278 g 419 t ORIGIN 1 ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggtg ccgcggcttg 61 ttcttcttaa aatggcgccg ctagacctgg acaagtatgt ggaaatagcg cggctgtgca 121 agtacctgcc agagaacgac ctgaagcggc tatgtgacta cgtttgtgac ctcctcttag 181 aagagtcaaa tgttcagcca gtatcaacac cagtaacagt gtgtggagat atccatggac 241 agttttatga cctttgtgaa ctgttcagaa ctggaggtca ggttcctgac acaaactaca 301 tatttatggg tgattttgta gacagaggtt actatagttt ggagaccttc acttaccttc 361 ttgcattaaa ggctaaatgg cctgatcgta ttacactttt gcgaggaaat catgagagta 421 gacagataac acaggtctat ggattttatg atgagtgcca aaccaaatat ggaaatgcta 481 atgcctggag atactgtacc aaagtttttg acatgctcac agtagcagct ttaatagatg 541 agcagatttt gtgtgtccat ggtggtttat ctcctgatat caaaacactg gatcaaattc 601 gaaccatcga acggaatcag gaaattcctc ataaaggagc attttgtgat ctggtttggt 661 cagatcctga agatgtggat acctgggcta tcagtccccg aggagcaggt tggctttttg 721 gagcaaaggt cacaaatgag tttgttcata tcaacaactt aaaactcatc tgcagagcac 781 atcaactagt gcacgaaggc tataaattta tgtttgatga gaagctggtg acagtatggt 841 ctgctcctaa ttactgctat cgttgtggaa atattgcttc gatcatggtc ttcaaagatg 901 taaatacaag agaaccaaag ttattccggg cagttccaga ttcagaacgt gttattcctc 961 ccagaacgac aacgccatat ttcctttgag gccttcgccc atcctgctga cccatttttc 1021 tgccctcttc ttaccccaat tttcttgtat taccctctac aatatacttt ttattgagca 1081 ctttgctgct gaaatgctgc ctcttgcctt tttttttttt taaattttta aattatctaa 1141 atttattgtt tgttgtggtg tctatagcaa agtttttcta tcaattttcc cccatcccat 1201 ccccaccctg gactcatttg agaagacttg agaaatgtct taatactcac actgctgcat 1261 gtagctcttg cttatttact ggtctgggaa acaggatgtg tttccttttt ttaaaagcca 1321 attgacagat tacacctaaa tc // LOCUS HSPPP1CB 3590 bp RNA PRI 16-AUG-1994 DEFINITION H.sapiens PPP1CB mRNA. ACCESSION X80910 NID g531475 KEYWORDS PPP1CB gene; protein phosphatase 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3590) AUTHORS Barker,H.M., Brewis,N.D., Street,A.J., Spurr,N.K. and Cohen,P.T. TITLE Three genes for protein phosphatase 1 map to different human chromosomes: sequence, expression and gene localisation of protein serine/threonine phosphatase 1 beta (PPP1CB) JOURNAL Biochim. Biophys. Acta 1220 (2), 212-218 (1994) MEDLINE 94146118 REFERENCE 2 (bases 1 to 3590) AUTHORS Cohen,P.T.W. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) P.T.W. Cohen, Medical Research Council Protein, Phosphorylation Unit, University of Dundee, Dept of Biochemistry, The University, Dundee, DD1 4HN Scotland, UK FEATURES Location/Qualifiers source 1..3590 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /cell_type="teratocarcinoma" /cell_line="NTERA2-cloneD1" /clone_lib="lgt10 cDNA" /chromosome="2" /map="q23" gene 259..1242 /gene="PPP1CB" CDS 259..1242 /gene="PPP1CB" /codon_start=1 /product="protein phosphotase 1 catyltic subunit beta isoform" /db_xref="PID:g531476" /db_xref="SWISS-PROT:P37140" /translation="MADGELNVDSLITRLLEVRGCRPGKIVQMTEAEVRGLCIKSREI FLSQPILLELEAPLKICGDIHGQYTDLLRLFEYGGFPPEANYLFLGDYVDRGKQSLET ICLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRFNIKLWKTFTDCFNCLPI AAIVDEKIFCCHGGLSPDLQSMEQIRRIMRPTDVPDTGLLCDLLWSDPDKDVQGWGEN DRGVSFTFGADVVSKFLNRHDLDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEFD NAGGMMSVDETLMCSFQILKPSEKKAKYQYGGLNSGRPVTPPRTANPPKKR" BASE COUNT 1030 a 580 c 730 g 1250 t ORIGIN 1 cctgggtctg acgcggccct gttcgagggg gcctctcttg tttatttatt tattttccgt 61 gggtgcctcc gagtgtgcgc gcgctctcgc tacccggcgg ggagggggtg gggggagggc 121 ccgggaaaag ggggagttgg agccggggtc gaaacgccgc gtgacttgta ggtgagagaa 181 cgccgagccg tcgccgcagc ctccgccgcc gagaagccct tgttcccgct gctgggaagg 241 agagtctgtg ccgacaagat ggcggacggg gagctgaacg tggacagcct catcacccgg 301 ctgctggagg tacgaggatg tcgtccagga aagattgtgc agatgactga agcagaagtt 361 cgaggcttat gtatcaagtc tcgggagatc tttctcagcc agcctattct tttggaattg 421 gaagcaccgc tgaaaatttg tggagatatt catggacaat atacagattt actgagatta 481 tttgaatatg gaggtttccc accagaagcc aactatcttt tcttaggaga ttatgtggac 541 agaggaaagc agtctttgga aaccatttgt ttgctattgg cttataaaat caaatatcca 601 gagaacttct ttctcttaag aggaaaccat gagtgtgcta gcatcaatcg catttatgga 661 ttctatgatg aatgcaaacg aagatttaat attaaattgt ggaagacctt cactgattgt 721 tttaactgtc tgcctatagc agccattgtg gatgagaaga tcttctgttg tcatggagga 781 ttgtcaccag acctgcaatc tatggagcag attcggagaa ttatgagacc tactgatgtc 841 cctgatacag gtttgctctg tgatttgcta tggtctgatc cagataagga tgtgcaaggc 901 tggggagaaa atgatcgtgg tgtttccttt acttttggag ctgatgtagt cagtaaattt 961 ctgaatcgtc atgatttaga tttgatttgt cgagctcatc aggtggtgga agatggatat 1021 gaattttttg ctaaacgaca gttggtaacc ttattttcag ccccaaatta ctgtggcgag 1081 tttgataatg ctggtggaat gatgagtgtg gatgaaactt tgatgtgttc atttcagata 1141 ttgaaaccat ctgaaaagaa agctaaatac cagtatggtg gactgaattc tggacgtcct 1201 gtcactccac ctcgaacagc taatccgccg aagaaaaggt gaagaaagga attctgtaaa 1261 gaaaccatca gatttgttaa ggacatactt cataatatat aagtgtgcac tgtaaaacca 1321 tccagccatt tgacaccctt tatgatgtca cacctttaac ttaaggagac gggtaaagga 1381 tcttaaattt ttttctaata gaaagatgtg ctacactgta ttgtaataag tatactctgt 1441 tatagtcaac aaagttaaat ccaaattcaa aattatccat taaagttaca tcttcatgta 1501 tcacaatttt taaagttgaa aagcatccca gttaaactag atgtgatagt taaaccagat 1561 gaaagcatga tgatccatct gtgtaatgtg gttttagtgt tgcttggttg tttaattatt 1621 ttgagcttgt tttgtttttg tttgttttca ctagaataat ggcaaatact tctaattttt 1681 ttccctaaac atttttaaaa gtgaaatatg ggaagagctt tacagacatt caccaactat 1741 tattttccct tgtttatcta cttagatatc tgtttaatct tactaagaaa actttcgcct 1801 cattacatta aaaaggaatt ttagagattg attgttttaa aaaaaaatac gcacattgtc 1861 caatccagtg attttaatca tacagtttga ctgggcaaac tttacagctg atagtgaata 1921 ttttgcttta tacaggaatt gacactgatt tggatttgtg cactctaatt tttaacttat 1981 tgatgctcta ttgtgcagta gcatttcatt taagataagg ctcatatagt attacccaac 2041 tagttggtaa tgtgattatg tggtaccttg gctttaggtt ttcattcgca cggaacacct 2101 tttggcatgc ttaacttcct ggtaacacct tcacctgcat tggttttctt tttctttttt 2161 ctttcttttt tttttttttt ttttttttga gttgttgttt gtttttagat ccacagtaca 2221 tgagaatcct tttttgacaa gccttggaaa gctgacactg tctctttttc ctccctctat 2281 acgaaggatg tatttaaatg aatgctggtc agtgggacat tttgtcaact atgggtattg 2341 ggtgcttaac tgtctaatat tgccatgtga atgttgtata cgattgtaag gcttatgtca 2401 ctaaagattt ttattctgat tttttcataa tcaaaggtca tatgatactg tatagacaag 2461 ctttgtagtg aagtatagta gcaataattt ctgtacctga tcaagtttat tgcagccttt 2521 cttttcctat ttcttttttt taagggttag tattaacaaa tggcaatgag tagaaaagtt 2581 aacatgaaga ttttagaagg agagaactta caggacacag atttgtgatt ctttgactgt 2641 gacactattg gatgtgattc taaaagcttt tattgagcat tgtcaaattt gtaagcttca 2701 tagggatgga catcatatct ataatgccct tctatatgtg ctaccataga tgtgacattt 2761 ttgaccttaa tatcgtcttt gaaaatgtta aattgagaaa cctgttaact tacattttat 2821 gaattggcac attgtattac ttactgcaag agatatttca ttttcagcac agtgcaaaag 2881 ttctttaaaa tgcatatgtc tttttttcta attccgtttt gttttaaagc acattttaaa 2941 tgtagttttc tcatttagta aaagttgtct aattgatatg aagcctgact gatttttttt 3001 ttccttacag tgagacattt aagcacacat tttattcaca tagatactat gtccttgaca 3061 tattgaaatg attcttttct gaaagtattc atgatctgca tatgatgtat taggttaggt 3121 cacaaaggtt ttatctgagg tgatttaaat aacttcctga ttggagtgtg taagctgagc 3181 gatttctaat aaaattttag ttgtacactt ttagtagtca tagtgaagca ggtctagaaa 3241 ataagccttt ggcagggaaa aagggcaatg ttgattaatc tcagtattaa accacattaa 3301 tctgtatccc attgtctggc ttttgtaaat tcatccaggt caagactaag tatgttggtt 3361 aataggaatc cttttttttt tttaaagact aaatgtgaaa aaataatcac tacttaagct 3421 aattaatatt ggtcattaaa tttaaaggat ggaaatttat catgtttaaa aattattcaa 3481 gcactcttaa aaccacttaa acagcctcca gtcataaaaa tgtgttcttt acaaatattt 3541 gcttggcaac acgacttgaa ataaataaaa ctttgtttct taggagaaaa // LOCUS HSPPPICC 2263 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens mRNA for protein phosphatase 1 gamma. ACCESSION X74008 S64371 NID g402777 KEYWORDS protein phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2263) AUTHORS Cohen,P.T.W. TITLE Direct Submission JOURNAL Submitted (07-JUL-1993) P.T.W. Cohen, University of Dundee, MRC Protein Phosphorylation Unit, Dept of Biochemistry, Dundee, DD1 4HN, Scotland, UK REFERENCE 2 (bases 1 to 2263) AUTHORS Barker,H.M., Craig,S.P., Spurr,N.K. and Cohen,P.T. TITLE Sequence of human protein serine/threonine phosphatase 1 gamma and localization of the gene (PPP1CC) encoding it to chromosome bands 12q24.1-q24.2 JOURNAL Biochim. Biophys. Acta 1178 (2), 228-233 (1993) MEDLINE 93349989 FEATURES Location/Qualifiers source 1..2263 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="teratocarcinoma" /cell_line="NTERA2" /clone_lib="cDNA in lambda gt10" /chromosome="12" /map="q24.1-q24.2" gene 155..1126 /gene="PPPICC" CDS 155..1126 /gene="PPPICC" /EC_number="3.1.3.16" /codon_start=1 /product="serine /threonine specific protein phosphatase" /db_xref="PID:g402778" /db_xref="SWISS-PROT:P36873" /translation="MADLDKLNIDSIIQRLLEVRGSKPGKNVQLQENEIRGLCLKSRE IFLSQPILLELEAPLKICGDIHGQYYDLLRLFEYGGFPPESNYLFLGDYVDRGKQSLE TICLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRYNIKLWKTFTDCFNCLP IAAIVDEKIFCCHGGLSPDLQSMEQIRRIMRPTDVPDQGLLCDLLWSDPDKDVLGWGE NDRGVSFTFGAEVVAKFLHKHDLDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEF DNAGAMMSVDETLMCSFQILKPAEKKKPNATRPVTPPRGMITKQAKK" BASE COUNT 648 a 427 c 498 g 690 t ORIGIN 1 aggaagtagg gagcggggtg gcaggggggg gacccgccgc ggctgctgcc accgccgcca 61 ccaccgcctc tgctcgtggc gtgggaaagg aggtgtgagt cccgggcgcg agccgcggcg 121 gcgccgctgc gggagggtcg gcggtgggaa ggcgatggcg gatttagata aactcaacat 181 cgacagcatt atccaacggc tgctggaagt gagagggtcc aagcctggta agaatgtcca 241 gcttcaggag aatgaaatca gaggactgtg cttaaagtct cgtgaaatct ttctcagtca 301 gcctatccta ctagaacttg aagcaccact caaaatatgt ggtgacatcc atggacaata 361 ctatgatttg ctgcgacttt ttgagtacgg tggtttccca ccagaaagca actacctgtt 421 tcttggggac tatgtggaca ggggaaagca gtcattggag acgatctgcc tcttactggc 481 ctacaaaata aaatatcctg agaatttttt tcttctcaga gggaaccatg aatgtgccag 541 catcaacaga atttatggat tttatgatga atgtaaaaga agatacaaca ttaaactatg 601 gaaaactttc acagactgtt ttaactgttt accgatagca gccatcgtgg atgagaagat 661 attctgctgt catggaggtt tatcaccaga tcttcaatct atggagcaga ttcggcgaat 721 tatgcgacca actgatgtac cagatcaagg tcttctttgt gatcttttgt ggtctgaccc 781 cgataaagat gtcttaggct ggggtgaaaa tgacagagga gtgtccttca catttggtgc 841 agaagtggtt gcaaaatttc tccataagca tgatttggat cttatatgta gagcccatca 901 ggtggttgaa gatggatatg aattttttgc aaagaggcag ttggtcactc tgttttctgc 961 gcccaattat tgcggagagt ttgacaatgc aggtgccatg atgagtgtgg atgaaacact 1021 aatgtgttct tttcagattt taaagcctgc agagaaaaag aagccaaatg ccacgagacc 1081 tgtaacgcct ccaaggggta tgatcacaaa gcaagcaaag aaatagatgt cgttttgaca 1141 ctgcctagtc gggacttgta acatagagta tataaccttc atttttaaga ctgtaatgtg 1201 tactggtcag cttgctcaga tagatctgtg tttgtggggg cccttccttc catttttgat 1261 ttagtgaatg gcatttgctg gttataacag caaatgaaag actcttcact ccaaaaagaa 1321 aagtgttttg ttttttaatt ctctgttcct tttgcaaaca attttaatga tggtgttaaa 1381 gctgtacacc ccaggacagt ttatcctgtc tgaggagtaa gtgtacaatt gatctttttt 1441 aattcagtac aacccataat catgtaaatg ctcattttct ttaggacata aagagagccc 1501 tagggtgctc tgaatctgta catgttcttg tcataaaatg catactgttg atacaaacca 1561 ctgtgaacat tttttatttg agaattttgt ttcaaaggga ttgctttttc ctctcattgt 1621 cttgttatgt acaaactagt ttttatagct atcaacatta ggagtaactt tcaaccttgc 1681 cagcatcact ggtatgatgt atatttaatt aaagcacact tttccccgac cgtatactta 1741 aaatgacaaa gccattcttt taaatatttg tgactctttc ctaaagccaa agtttctgtt 1801 gaattatgtt ttgacacacc cctaagtaca aggtggtatg gttgtataca catgctgcct 1861 tcttggggat tcaaaaacag gtttttgatt ttgaatagca attagtgata tagtgctgtt 1921 taagctacta acgataaaag gtaataacat tttatacaat ttccatatag tctattcatt 1981 aagtaatctt tttacagttg catcaggcct gaacccgtcc attcagaaag cttcaaatta 2041 tagaaacaat actgttctat acgagtgacc gattatgctt tctttggcct acattcttta 2101 ttctgcggtg aagttgaggc ttataagtta aaacaaagga actaacttac tgtccaccag 2161 tttatacaga actcacagta cctatgactt ttttaaacta agatctgtta aaaaagaaat 2221 ctgtttcaac agatgaccgt gtacaatacc gtgtggtgaa aat // LOCUS HSPPX 1382 bp RNA PRI 30-JUN-1993 DEFINITION H. sapiens mRNA for protein phosphatase X. ACCESSION X70218 S55208 NID g312813 KEYWORDS protein phosphatase X. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1382) AUTHORS Brewis,N.D. and Cohen,P.T. TITLE Protein phosphatase X has been highly conserved during mammalian evolution JOURNAL Biochim. Biophys. Acta 1171 (2), 231-233 (1992) MEDLINE 93129628 FEATURES Location/Qualifiers source 1..1382 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="teratocarcnoma" /clone_lib="teratocarcnoma cDNA lambda 10" gene 139..1062 /gene="PPX" CDS 139..1062 /gene="PPX" /codon_start=1 /product="protein phosphatase X" /db_xref="PID:g312814" /db_xref="SWISS-PROT:P33172" /translation="MAEISDLDRQIEQLRRCELIKESEVKALCAKAREILVEESNVQR VDSPVTVCGDIHGQFYDLKELFRVGGDVPERNYLFMGDFVDRGFYSVETFLLLLALKV RYPDRITLIRGNHESRQITQVYGFYDECLRKYGSVTVWRYCTEIFDYLSLSAIIDGKI FCVHGGLSPSIQTLDQIRTIDRKQEVPHDGPMCDLLWSDPEDTTGWGVSPRGAGYLFG SDVVAQFNAANDIDMICRAHQLVMEGYKWHFNETVLTVWSAPNYCYRCGNVAAILELD EHLQKDFIIFEAAPQETRGIPSKKPVADYFL" primer_bind 666..688 /gene="PPX" primer_bind 807..836 /gene="PPX" polyA_signal 1350..1355 BASE COUNT 302 a 386 c 400 g 294 t ORIGIN 1 cggcggcggc ggtcgaaagc ggagtgaaag agggaggcag ggagccggag agccggaacc 61 ggagtcgcag cggcggagac ccctgtgcgg tgcggagggg gcggcggccc cgactctgac 121 ccgcgccggg ggtgggccat ggcggagatc agcgacctgg accggcagat cgagcagctg 181 cgtcgctgcg agctcatcaa ggagagcgaa gtcaaggccc tgtgcgctaa ggccagagag 241 atcttggtag aggagagcaa cgtgcagagg gtggactcgc cagtcacagt gtgcggcgac 301 atccatggac aattctatga cctcaaagag ctgttcagag taggtggcga cgtccctgag 361 aggaactacc tcttcatggg ggactttgtg gaccgtggct tctatagcgt cgaaacgttc 421 ctcctgctgc tggcacttaa ggttcgctat cctgatcgca tcacactgat ccggggcaac 481 catgagagtc gccagatcac gcaggtctat ggcttctacg atgagtgcct gcgcaagtac 541 ggctcggtga ctgtgtggcg ctactgcact gagatctttg actacctcag cctgtcagcc 601 atcatcgatg gcaagatctt ctgcgtgcac gggggcctct ccccctccat ccagaccctg 661 gatcagattc ggacaatcga ccgaaagcaa gaggtgcctc atgatgggcc catgtgtgac 721 ctcctctggt ctgacccaga agacaccaca ggctggggcg tgagcccgcg cggagccggc 781 tacctatttg gcagtgacgt ggtggcccag ttcaacgcag ccaatgacat tgacatgatc 841 tgccgtgccc accaactggt gatggaaggt tacaagtggc acttcaatga gacggtgctc 901 actgtgtggt cggcacccaa ctactgctac cgctgtggga atgtggcagc catcttggag 961 ctggacgagc atctccagaa agatttcatc atctttgagg ctgctcccca agagacacgg 1021 ggcatcccct ccaagaagcc cgtggccgac tacttcctgt gaccccgccc ggcccctgcc 1081 ccctccaacc cttctggccc tcgcaccact gtgactctgc catcttcctc agacggaggc 1141 tgggggggct gtcctggctc tgctgtcccc caagagggtg ccttcgaggg tgaggacttc 1201 tctggagagg cctggagacc tagctccatg ttcctcctcc tctctcccca cttgaaccat 1261 gaagtttcca ataatttttt tttctttttt tccttctttt tctgtttgtt tttagataaa 1321 aatttttgag aaaaaaaatg aaaaattcta ataaaagaag aaaaatggta aaaaaaaaaa 1381 aa // LOCUS HSPQPROT 1412 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for PQ-rich protein. ACCESSION Z50194 NID g929659 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1412) AUTHORS Wagner,F.F. and Flegel,W.A. TITLE A cDNA, which predicts a protein with PQ-rich repeats, isolated from a phage library of human fetal liver tissue JOURNAL Unpublished REFERENCE 2 (bases 1 to 1412) AUTHORS Flegel,W.A. TITLE Direct Submission JOURNAL Submitted (31-JUL-1995) Flegel W.A., Universitat Ulm, Transfusionsmedizin, Helmholtzstrasse 10, Ulm, Germany, D-89081 FEATURES Location/Qualifiers source 1..1412 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" 5'UTR 1..159 /partial /citation=[1] CDS 160..1362 /function="unknown" /citation=[1] /codon_start=1 /product="PQ-rich protein" /db_xref="PID:g929660" /translation="MRRAPAAERLLELGFPPRCGRQEPPFPLGVTRGWGRWPIQKRRE GARPVPFSERSQEDGRGPAARSSGTLWRIRTRLSLCRDPEPPPPLCLLRVSLLCALRA GGRGSRWGEDGARLLLLPPARAAGNGEAEPSGGPSYAGRMLESSGCKALKEGVLEKRS DGLLQLWKKKCCILTEEGLLLIPPKQLQHQQQQQQQQQQQQQQPGQGPAEPSQPSGPA VASLEPPVKLKELHFSNMKTVDCVERKGKYMYFTVVMAEGKEIDFRCPQDQGWNAEIT LQMVQYKNRQAILAVKSTRQKQQHLVQQQPPSQPQPQPQLQPQPQPQPQPQPQPQSQP QPQPQPKPQPQQLHPYPHPHPHPHSHPHSHPHPHPHPHPHQIPHPHPQPHSQPHGHRL LRSTSNSA" variation 738 /replace="a" variation 757..759 /replace="" 3'UTR 1363..1412 BASE COUNT 278 a 489 c 457 g 188 t ORIGIN 1 ctccttgggc gatcgcctgg gtagaggaga ggagtttccg gggctcgggt ccgggtcgcc 61 ttccagggga acgagcgcgg aagcaagtgg gcggcgagag gcggagcaag agacgctgga 121 gggcgtggac gcagcgggct ttggaaaggc cccaagttaa tgaggcgtgc gccggctgcc 181 gagcgcctct tggagctggg ctttcccccg cggtgcgggc gccaggagcc gccttttccg 241 ctgggtgtca ctcgggggtg gggaagatgg cccattcaaa agcgccgcga gggggcccgg 301 ccagtgccct tcagtgagcg ctcgcaagag gacggcagag gcccggcagc tcggagctcc 361 gggaccttgt ggcgcatcag gacgcggctg tccctctgcc gggacccaga gccgccgccg 421 ccgctctgcc tcctgcgtgt tagcctcctc tgcgcgctcc gggcaggcgg ccgtgggagc 481 cgctggggcg aggacggcgc gaggctgctg ctgctgcccc cggcccgcgc ggctggaaac 541 ggagaggccg agccaagcgg cggcccctct tatgctggga ggatgctgga gagtagcggc 601 tgcaaagcgc tgaaggaggg cgtgctggag aagcgcagcg acgggttgtt gcagctctgg 661 aagaaaaagt gttgcatcct caccgaggaa gggctgctgc ttatcccgcc caagcagctg 721 caacaccagc agcagcagca acagcagcag cagcagcagc aacaacagcc cgggcagggg 781 ccggccgagc cgtcccaacc cagtggcccc gctgtcgcca gcctcgagcc gccggtcaag 841 ctcaaggaac tgcacttctc caacatgaag accgtggact gtgtggagcg caagggcaag 901 tacatgtact tcactgtggt gatggcagag ggcaaggaga tcgactttcg gtgcccgcaa 961 gaccagggct ggaacgccga gatcacgctg cagatggtgc agtacaagaa tcgtcaggcc 1021 atcctggcgg tcaaatccac gcggcagaag cagcagcacc tggtccagca gcagcccccc 1081 tcgcagccgc agccgcagcc gcagctccag ccccaacccc agcctcagcc tcagccgcaa 1141 ccccagcccc aatcacaacc ccagcctcag ccccaaccca agcctcagcc ccagcagctc 1201 cacccgtatc cgcatccaca tccacatcca cactctcatc ctcactcgca cccacaccct 1261 cacccgcacc cgcatccgca ccaaataccg cacccacacc cacagccgca ctcgcagccg 1321 cacgggcacc ggcttctccg cagcacctcc aactctgcct gaaaggggca gctcccgggc 1381 aagacaaggt tttgaggact tgaggaagtg gg // LOCUS HSPRAD1CY 4244 bp RNA PRI 18-JUN-1993 DEFINITION Human PRAD1 mRNA for cyclin. ACCESSION X59798 X59485 NID g35631 KEYWORDS cyclin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4244) AUTHORS Arnold,A. TITLE Direct Submission JOURNAL Submitted (13-MAY-1991) A. Arnold, Harvard Medical School, Massachusetts General Hospital, Boston MA 02114, USA REFERENCE 2 (bases 1 to 4244) AUTHORS Motokura,T., Bloom,T., Kim,H.G., Juppner,H., Ruderman,J.V., Kronenberg,H.M. and Arnold,A. TITLE A novel cyclin encoded by a bcl1-linked candidate oncogene JOURNAL Nature 350 (6318), 512-515 (1991) MEDLINE 91194766 REFERENCE 3 (bases 1 to 4244) AUTHORS Seto,M., Yamamoto,K., IIda,S., Akao,Y., Utsumi,K.R., Kubonishi,I., Miyoshi,I., Ohtsuki,T., Yawata,Y., Nanba,M., Motokura,T., Arnold,A., Takahashi,T. and Ueda,R. TITLE Gene rearrangement and overexpression of PRAD1 in lymphoid malignancy with t(11;14)(q13;q32) translocation JOURNAL Oncogene 7 (7), 1401-1406 (1992) MEDLINE 92319550 FEATURES Location/Qualifiers source 1..4244 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human placental cDNA library (Clontech)" /chromosome="11" /map="11q13" 5'UTR 1..147 CDS 148..1035 /codon_start=1 /product="cyclin" /db_xref="PID:g35632" /db_xref="SWISS-PROT:P24385" /translation="MEHQLLCCEVETIRRAYPDANLLNDRVLRAMLKAEETCAPSVSY FKCVQKEVLPSMRKIVATWMLEVCEEQKCEEEVFPLAMNYLDRFLSLEPVKKSRLQLL GATCMFVASKMKETIPLTAEKLCIYTDNSIRPEELLQMELLLVNKLKWNLAAMTPHDF IEHFLSKMPEAEENKQIIRKHAQTFVALCATDVKFISNPPSMVAAGSVVAAVQGLNLR SPNNFLSYYRLTRFLSRVIKCDPDCLRACQEQIEALLESSLRQAQQNMDPKAAEEEEE EEEEVDLACTPTDVRDVDI" 3'UTR 1036..4244 polyA_signal 4209..4214 polyA_site 4228 BASE COUNT 1044 a 1023 c 1102 g 1073 t 2 others ORIGIN 1 ggcgcagtag cagcgagcag cagagtccgc acgctccggc gaggggcaga agagcgcgag 61 ggagcgcggg gcagcagaag cgagagccga gcgcggaccc agccaggacc cacagccctc 121 cccagctgcc caggaagagc cccagccatg gaacaccagc tcctgtgctg cgaagtggaa 181 accatccgcc gcgcgtaccc cgatgccaac ctcctcaacg accgggtgct gcgggccatg 241 ctgaaggcgg aggagacctg cgcgccctcg gtgtcctact tcaaatgtgt gcagaaggag 301 gtcctgccgt ccatgcggaa gatcgtcgcc acctggatgc tggaggtctg cgaggaacag 361 aagtgcgagg aggaggtctt cccgctggcc atgaactacc tggaccgctt cctgtcgctg 421 gagcccgtga aaaagagccg cctgcagctg ctgggggcca cttgcatgtt cgtggcctct 481 aagatgaagg agaccatccc cctgacggcc gagaagctgt gcatctacac cgacaactcc 541 atccggcccg aggagctgct gcaaatggag ctgctcctgg tgaacaagct caagtggaac 601 ctggccgcaa tgaccccgca cgatttcatt gaacacttcc tctccaaaat gccagaggcg 661 gaggagaaca aacagatcat ccgcaaacac gcgcagacct tcgttgccct ctgtgccaca 721 gatgtgaagt tcatttccaa tccgccctcc atggtggcag cggggagcgt ggtggccgca 781 gtgcaaggcc tgaacctgag gagccccaac aacttcctgt cctactaccg cctcacacgc 841 ttcctctcca gagtgatcaa gtgtgaccca gactgcctcc gggcctgcca ggagcagatc 901 gaagccctgc tggagtcaag cctgcgccag gcccagcaga acatggaccc caaggccgcc 961 gaggaggagg aagaggagga ggaggaggtg gacctggctt gcacacccac cgacgtgcgg 1021 gacgtggaca tctgagggcg ccaggcaggc gggcgccacc gccacccgca gcgagggcgg 1081 agccggcccc aggtgctcca ctgacagtcc ctcctctccg gagcattttg ataccagaag 1141 ggaaagcttc attctccttg ttgttggttg ttttttcctt tgctctttcc cccttccatc 1201 tctgacttaa gcaaaagaaa aagattaccc aaaaactgtc tttaaaagag agagagagaa 1261 aaaaaaaata gtatttgcat aaccctgagc ggtgggggag gagggttgtg ctacagatga 1321 tagaggattt tataccccaa taatcaactc gtttttatat taatgtactt gtttctctgt 1381 tgtaagaata ggcattaaca caaaggaggc gtctcgggag aggattaggt tccatccttt 1441 acgtgtttaa aaaaaagcat aaaaacattt taaaaacata gaaaaattca gcaaaccatt 1501 tttaaagtag aagagggttt taggtagaaa aacatattct tgtgcttttc ctgataaagc 1561 acagctgtag tggggttcta ggcatctctg tactttgctt gctcatatgc atgtagtcac 1621 tttataagtc attgtatgtt attatattcc gtaggtagat gtgtaacctc ttcaccttat 1681 tcatggctga agtcacctct tggttacagt agcgtagcgt ggccgtgtgc atgtcctttg 1741 cgcctgtgac caccacccca acaaaccatc cagtgacaaa ccatccagtg gaggtttgtc 1801 gggcaccagc cagcgtagca gggtcgggaa aggccacctg tcccactcct acgatacgct 1861 actataaaga gaagacgaaa tagtgacata atatattcta tttttatact cttcctattt 1921 ttgtagtgac ctgtttatga gatgctggtt ttctacccaa cggccctgca gccagctcac 1981 gtccaggttc aacccacagc tacttggttt gtgttcttct tcatattcta aaaccattcc 2041 atttccaagc actttcagtc caataggtgt aggaaatagc gctgtttttg ttgtgtgtgc 2101 agggagggca gttttctaat ggaatggttt gggaatatcc atgtacttgt ttgcaagcag 2161 gactttgagg caagtgtggg ccactgtggt ggcagtggag gtggggtgtt tgggaggctg 2221 cgtgccagtc aagaagaaaa aggtttgcat tctcacattg ccaggatgat aagttccttt 2281 ccttttcttt aaagaagttg aagtttagga atcctttggt gccaactggt gtttgaaagt 2341 agggacctca gaggtttacc tagagaacag gtggttttta agggttatct tagatgtttc 2401 acaccggaag gtttttaaac actaaaatat ataatttata gttaaggcta aaaagtatat 2461 ttattgcaga ggatgttcat aaggccagta tgatttataa atgcaatctc cccttgattt 2521 aaacacacag atacacacac acacacacac acacacacaa accttctgcc tttgatgtta 2581 cagatttaat acagtttatt tttaaagata gatcctttta taggtgagaa aaaaacaatc 2641 tggaagaaaa aaaccacaca aagacattga ttcagcctgt ttggcgtttc ccagagtcat 2701 ctgattggac aggcatgggt gcaaggaaaa ttagggtact caacctaagt tcggttccga 2761 tgaattctta tcccctgccc cttcctttaa aaaacttagt gacaaaatag acaatttgca 2821 catcttggct atgtaattct tgtaattttt atttaggaag tgttgaaggg aggtggcaag 2881 agtgtggagg ctgacgtgtg agggaggaca ggcgggagga ggtgtgagga ggaggctccc 2941 gaggggaagg ggcggtgccc acaccgggga caggccgcag ctccattttc ttattgcgct 3001 gctaccgttg acttccaggc acggtttgga aatattcaca tcgcttctgt gtatctcttt 3061 cacattgttt gctgctattg gaggatcagt tttttgtttt acaatgtcat atactgccat 3121 gtactagttt tagttttctc ttagaacatt gtattacaga tgcctttttt gtagtttttt 3181 ttttttttat gtgatcaatt ttgacttaat gtgattactg ctctattcca aaaaggttgc 3241 tgtttcacaa tacctcatgc ttcacttagc catggtggac ccagcgggca ggttctgcct 3301 gctttggcgg gcagacacgc gggcgcgatc ccacacaggc tggcgggggc cggccccgag 3361 gccgcgtgcg tgagaaccgc gccggtgtcc ccagagacca ggctgtgtcc ctcttctctt 3421 ccctgcgcct gtgatgctgg gcacttcatc tgatcggggg cgtagcatca tagtagtttt 3481 tacagctgtg ttatwctttg cgtgtagcta tggaagttgc ataattatta ttattattat 3541 tataacaagt gtgtcttacg tgccaccacg gcgttgtacc tgtaggactc tcattcggga 3601 tgattggaat agcttctgga atttgttcaa gttttgggta tgtttaatct gttatgtact 3661 agtgttctgt ttgttattgt tttgttaatt acaccataat gctaatttaa agagactcca 3721 aatctcaatg aagccagctc acagtgctgt gtgccccggt cacctagcaa gctgccgaac 3781 caaaagaatt tgcaccccgc tgcgggccca cgtggttggg gccctgccct ggcagggtca 3841 tcctgtgctc ggaggccatc tcgggcacag gcccaccccg ccccacccct ccagaacacg 3901 gctcacgctt acctcaacca tcctggctgc ggcgtctgtc tgaaccacgc gggggccttg 3961 agggacgctt tgtctgtcgt gatggggcaa gggcacaagt cctggatgtt gtgtgtrtcg 4021 agaggccaaa ggctggtggc aagtgcacgg ggcacagcgg agtctgtcct gtgacgcgca 4081 agtctgaggg tctgggcggc gggcggctgg gtctgtgcat ttctggttgc accgcggcgc 4141 ttcccagcac caacatgtaa ccggcatgtt tccagcagaa gacaaaaaga caaacatgaa 4201 agtctagaaa taaaactggt aaaaccccaa aaaaaaaaaa aaaa // LOCUS HSPRCOX 2415 bp RNA PRI 12-AUG-1997 DEFINITION H.sapiens mRNA for pristanoyl-CoA oxidase. ACCESSION Y11411 NID g2326548 KEYWORDS PRCOX gene; pristanoyl-CoA oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2415) AUTHORS Vanhooren,J.C.T., Marynen,P., Mannaerts,G.P. and Van Veldhoven,P.P. TITLE Evidence for the existence of a pristanoyl-CoA oxidase gene in man JOURNAL Biochem. J. 325 (Pt 3), 593-599 (1997) MEDLINE 97373507 REFERENCE 2 (bases 1 to 2415) AUTHORS Van Veldhoven,P.P. TITLE Direct Submission JOURNAL Submitted (21-FEB-1997) P.P. Van Veldhoven, Catholic University Leuven, Faculty of Medicine, Dept. Pharmacology, Herestraat 49, 3000 Leuven, Belgium FEATURES Location/Qualifiers source 1..2415 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="p15.3" /clone_lib="lambda gt11" /dev_stage="adult" gene 71..2173 /gene="PRCOX" CDS 71..2173 /gene="PRCOX" /note="peroxisomal beta-oxidation enzyme" /codon_start=1 /product="pristanoyl-CoA oxidase" /db_xref="PID:e321913" /db_xref="PID:g2326549" /translation="MASTVEGGDTALLPEFPRGPLDAYRARASFSWKDVALFTEGEGN VRFKKTIFSALENDPLFARSPGADLSLEKYRELNFLRCKRIFEYDFLSVEAMFKSPLK VPALIQCLGMYDSSLAAKYLLHSLVFGSAVYSSGSERHLTYIQKIFRMEIFGCFALTE LSHGSNTKAIRTTAHYDPATEEFIIHSPDFEAAKFWVGNMGKTATHAVVFAKLCVPGD QCHGLHPFIVQIRDPKTLLPMPGVMVGDIGKKLGQNGLDNGFAMFHKVRVPRQSLLNR MGDVTPEGTYVSPFKDVRQRFGASLGSLSSGRVSIVSLAILNLKLAVAIALRFSATRR QFGPTEEEEIPVLEYPMQQWRLLPYLAAVYGLDHFSKSLFLDLVELQRGLASGDRSAR QAELGREIHALASASKPLASWTTQQGIQECREACGGHGYLAMNRLGVLRDDNDPNCTY EGDNNILLQQTSNYLLGLLAHQVHDGACFRSPLKSVDFLDAYPGILDQKFEVSSVADC LDSAVALAAYKWLVCYLLRETYQKLNQEKRSGSSDFEARNKCQVSHGRPLALAFVELT VVQRFHEHVHQPSVPPSLRAVLGRLSALYALWSLKRHAALLYRGGYFSGEQAGEVLES AVLALCSQLKDDAVALVDVIAPPDFVLDSPIGRADGELYKNLWGAVLQESKVLERASW WPEFSVNKPVIGSLKSKL" BASE COUNT 514 a 671 c 683 g 547 t ORIGIN 1 cggatccttt cctgcttttg gtttccctgg caggggttga actgtggagt gtgtgggctc 61 ttatcacgcg atggcatcca ctgtggaagg aggcgacaca gctctgctcc cagaattccc 121 cagggggccc ctcgatgcct accgagcaag agcgtccttc agctggaagg acgtggcgct 181 gttcacggaa ggggagggca atgtccgctt taagaaaacc atcttctcag ctcttgagaa 241 tgaccctctt ttcgctcgtt cccctggagc cgacctgtcc ttggagaagt atcgcgagct 301 gaacttcctt cgatgcaagc ggatcttcga gtatgacttc ctcagtgtcg aagcaatgtt 361 caagagccct ctgaaggtcc ccgccttgat tcagtgcctg ggcatgtatg actcttctct 421 ggctgccaag tacctcctcc atagcttggt ttttggatca gcagtttaca gttctggttc 481 tgaaagacat ctcacatata ttcaaaagat cttcaggatg gagatttttg gatgttttgc 541 tctgaccgaa ttaagccacg gcagtaatac caaggccatt cgcacaactg cccactacga 601 tcctgccact gaggaattca tcatacattc ccctgatttc gaagctgcca agttttgggt 661 tggcaacatg ggcaagacag ccactcacgc ggtggtgttt gctaagctgt gtgtgccagg 721 ggaccagtgc catgggctgc atccctttat cgtgcagatc cgggacccga agacccttct 781 tcccatgcct ggagtgatgg ttggcgacat aggaaaaaaa ctcgggcaga acggtctaga 841 taatggtttc gccatgttcc acaaggtcag agttcctcgc cagagccttc tgaaccggat 901 gggagacgtc acccccgagg gcacctatgt cagccccttt aaggacgtca ggcagcgctt 961 tggagcgtcc ctggggagcc tgtcctcggg ccgggtctcc atcgtgagcc tggccatcct 1021 taacctaaag ctggccgtgg ccatcgctct tcgcttctca gccactcggc gtcagtttgg 1081 acccacagag gaggaggaaa taccagtgct tgagtatcca atgcagcaat ggcgcttgct 1141 tccatatctg gcagctgtct acggcttaga ccatttctcc aagtcgctct tcctggacct 1201 ggtggagctc cagcgaggac ttgcatcggg agaccgcagc gccagacagg cagagcttgg 1261 acgtgagatc cacgccctgg catcggccag caagcccctg gcctcgtgga ccacccagca 1321 aggaattcag gaatgccggg aggcgtgtgg aggacacggc tatctggcca tgaaccggtt 1381 gggtgtcctt agagatgaca acgatcccaa ctgcacatac gaaggtgaca acaacatcct 1441 gctgcagcag acaagcaact atttgctggg tctcctggca caccaggtcc acgatggagc 1501 ttgcttccgc agtccgctga agtcagtgga ctttctggac gcctatcccg gcatccttga 1561 ccagaagttt gaggtctcca gtgttgccga ctgcttggac tctgcagtcg ccctggcagc 1621 atacaagtgg ctggtttgct acctgctccg agagacttat caaaaattaa accaagagaa 1681 aagatcagga agcagtgact ttgaagcaag gaacaaatgc caggtgtccc acggccgtcc 1741 gttggcgctg gccttcgtgg agctcacggt ggtccagagg ttccacgagc acgtgcacca 1801 gccttccgtg ccgccctcgc tgcgggccgt gctggggcgg ctcagtgctc tgtacgccct 1861 gtggtccctg aagcgccacg cggccctgct ctaccgagga ggatacttct ccggtgagca 1921 ggcgggagaa gtgttggaga gcgccgtcct ggctttgtgt tcccagctga aagacgatgc 1981 agttgccctg gtagacgtga tcgctcctcc tgactttgtt ctggactcac cgattggcag 2041 agccgacggc gagctctaca aaaacctctg gggcgctgtc ctgcaggaaa gcaaggtgtt 2101 ggagcgggca tcctggtggc cagagttttc tgtgaacaaa cctgtcatag gaagtctgaa 2161 atcgaagctc tagtgggact ggcacattca gccaagtcta atgaaacgaa gggaactaat 2221 cagacgtgga cctcaacttc tgattccaga acacgccgga gattgctgct gctttctgag 2281 cccgcacctg tgcgcctaaa ctgctgattg gcctcaactt gcccaggcgg acgggaggga 2341 ggcacccggc cggctggacc taatctggga tcgcggtgat ttgcaccgtg gaaaagaaat 2401 tgcagattga tcatg // LOCUS HSPREC 5003 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA and promoter DNA for progesterone receptor. ACCESSION X51730 NID g35651 KEYWORDS hormone receptor; progesterone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5003) AUTHORS Kastner,P. TITLE Direct Submission JOURNAL Submitted (16-FEB-1990) Kastner P., LGME/CNRS - Ul84/INSERM, 11 rue Humann, 67085 Strasbourg Cedex, France REFERENCE 2 (bases 1 to 5003) AUTHORS Kastner,P., Krust,A., Turcotte,B., Stropp,U., Tora,L., Gronemeyer,H. and Chambon,P. TITLE Two distinct estrogen-regulated promoters generate transcripts encoding the two functionally different human progesterone receptor forms A and B JOURNAL EMBO J. 9 (5), 1603-1614 (1990) MEDLINE 90228361 COMMENT See also . Bases 1-711 were derived from genomic DNA. FEATURES Location/Qualifiers source 1..5003 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T47D" /clone_lib="lambda-gt11" /chromosome="11" /map="q22" misc_feature 712 /note="beginning of mRNA sequence" CDS 1455..4256 /note="progesterone receptor (AA 1-933)" /codon_start=1 /db_xref="PID:g35652" /db_xref="SWISS-PROT:P06401" /translation="MTELKAKGPRAPHVAGGPPSPEVGSPLLCRPAAGPFPGSQTSDT LPEVSAIPISLDGLLFPRPCQGQDPSDEKTQDQQSLSDVEGAYSRAEATRGAGGSSSS PPEKDSGLLDSVLDTLLAPSGPGQSQPSPPACEVTSSWCLFGPELPEDPPAAPATQRV LSPLMSRSGCKVGDSSGTAAAHKVLPRGLSPARQLLLPASESPHWSGAPVKPSPQAAA VEVEEEDSSESEESAGPLLKGKPRALGGAAAGGGAAACPPGAAAGGVALVPKEDSRFS APRVALVEQDAPMAPGRSPLATTVMDFIHVPILPLNHALLAARTRQLLEDESYDGGAG AASAFAPPRTSPCASSTPVAVGDFPDCAYPPDAEPKDDAYPLYSDFQPPALKIKEEEE GAEASARSPRSYLVAGANPAAFPDFPLGPPPPLPPRATPSRPGEAAVTAAPASASVSS ASSSGSTLECILYKAEGAPPQQGPFAPPPCKAPGASGCLLPRDGLPSTSASAAAAGAA PALYPALGLNGLPQLGYQAAVLKEGLPQVYPPYLNYLRPDSEASQSPQYSFESLPQKI CLICGDEASGCHYGVLTCGSCKVFFKRAMEGQHNYLCAGRNDCIVDKIRRKNCPACRL RKCCQAGMVLGGRKFKKFNKVRVVRALDAVALPQPLGVPNESQALSQRFTFSPGQDIQ LIPPLINLLMSIEPDVIYAGHDNTKPDTSSSLLTSLNQLGERQLLSVVKWSKSLPGFR NLHIDDQITLIQYSWMSLMVFGLGWRSYKHVSGQMLYFAPDLILNEQRMKESSFYSLC LTMWQIPQEFVKLQVSQEEFLCMKVLLLLNTIPLEGLRSQTQFEEMRSSYIRELIKAI GLRQKGVVSSSQRFYQLTKLLDNLHDLVKQLHLYCLNTFIQSRALSVEFPEMMSEVIA AQLPKILAGMVKPLLFHKK" BASE COUNT 1233 a 1303 c 1240 g 1227 t ORIGIN 1 ggatccattt tataagctca aagataatta cttttcagac taagaatatt tagggtaaaa 61 agtactgttc aacatctcta ctgaggatgt tatgatgtag cacactctat aagctggagc 121 taaaggaaac tttccttaaa gtgctattta ctaaaaattg gaacacattc cttaagacaa 181 atcgaagtgt ggcacacaac atccaaactt ccatcataga tacagaggtg ttaccatctc 241 ccactcccaa atttctttgt cacgctgagg atactcaaga ggagcaggac atgttggtcg 301 cagcaggaga aacttgaaag cattcacttt tatggaactc ataagggaga gaatctctta 361 tttagtatcg tccttgatac atttattatt ttaaaagata atgtagccaa atgtcttcct 421 ctgtgttaaa tctttacaaa actgaaatct taaaatggtg acaaaaattc tacttctgat 481 agaatctatt catttttcca attagatagg gcataattct taatttgcaa aacaaaacgt 541 aatatgctta tgaggttcca tcccaaagaa cctgctattg agagtagcat tcagaataac 601 gggtggaaat gccaactcca gagtttcaga tcctaccggt aattggggta gggaggggct 661 ttgggcgggg cctccctaga ggaggaggcg ttgttagaaa gctgtctggc cagtccacag 721 ctgtcactaa tcggggtaag ccttgttgta tttgcgcgtg tgggtggcat tctcaatgag 781 aactagcttc acttgtcatt tgagtgaaat ctacaacccg aggcggctag tgctcccgca 841 ctactgggat ctgagatctt cggagatgac tgtcgcccgc agtacggagc cagcagaagt 901 ccgacccttc ctgggaatgg gctgtaccga gaggtccgac tagccccagg gttttagtga 961 gggggcagtg gaactcagcg agggactgag agcttcacag catgcacgag tttgatgcca 1021 gagaaaaagt cgggagataa aggagccgcg tgtcactaaa ttgccgtcgc agccgcagcc 1081 actcaagtgc cggacttgtg agtactctgc gtctccagtc ctcggacaga agttggagaa 1141 ctctcttgga gaactccccg agttaggaga cgagatctcc taacaattac tactttttct 1201 tgcgctcccc acttgccgct cgctgggaca aacgacagcc acagttcccc tgacgacagg 1261 atggaggcca agggcaggag ctgaccagcg ccgccctccc ccgcccccga cccaggaggt 1321 ggagatcctc cggtccagcc acattcaaca cccactttct cctccctctg cccctatatt 1381 cccgaaaccc cctcctcctt cccttttccc tcctccctgg agacggggga ggagaaaagg 1441 ggagtccagt cgtcatgact gagctgaagg caaagggtcc ccgggctccc cacgtggcgg 1501 gcggcccgcc ctcccccgag gtcggatccc cactgctgtg tcgcccagcc gcaggtccgt 1561 tcccggggag ccagacctcg gacaccttgc ctgaagtttc ggccatacct atctccctgg 1621 acgggctact cttccctcgg ccctgccagg gacaggaccc ctccgacgaa aagacgcagg 1681 accagcagtc gctgtcggac gtggagggcg catattccag agctgaagct acaaggggtg 1741 ctggaggcag cagttctagt cccccagaaa aggacagcgg actgctggac agtgtcttgg 1801 acactctgtt ggcgccctca ggtcccgggc agagccaacc cagccctccc gcctgcgagg 1861 tcaccagctc ttggtgcctg tttggccccg aacttcccga agatccaccg gctgcccccg 1921 ccacccagcg ggtgttgtcc ccgctcatga gccggtccgg gtgcaaggtt ggagacagct 1981 ccgggacggc agctgcccat aaagtgctgc cccggggcct gtcaccagcc cggcagctgc 2041 tgctcccggc ctctgagagc cctcactggt ccggggcccc agtgaagccg tctccgcagg 2101 ccgctgcggt ggaggttgag gaggaggata gctctgagtc cgaggagtct gcgggtccgc 2161 ttctgaaggg caaacctcgg gctctgggtg gcgcggcggc tggaggagga gccgcggctt 2221 gtccgccggg ggcggcagca ggaggcgtcg ccctggtccc caaggaagat tcccgcttct 2281 cagcgcccag ggtcgccctg gtggagcagg acgcgccgat ggcgcccggg cgctccccgc 2341 tggccaccac ggtgatggat ttcatccacg tgcctatcct gcctctcaat cacgccttat 2401 tggcagcccg cactcggcag ctgctggaag acgaaagtta cgacggcggg gccggggctg 2461 ccagcgcctt tgccccgccg cggacttcac cctgtgcctc gtccaccccg gtcgctgtag 2521 gcgacttccc cgactgcgcg tacccgcccg acgccgagcc caaggacgac gcgtaccctc 2581 tctatagcga cttccagccg cccgctctaa agataaagga ggaggaggaa ggcgcggagg 2641 cctccgcgcg ctccccgcgt tcctaccttg tggccggtgc caaccccgca gccttcccgg 2701 atttcccgtt ggggccaccg cccccgctgc cgccgcgagc gaccccatcc agacccgggg 2761 aagcggcggt gacggccgca cccgccagtg cctcagtctc gtctgcgtcc tcctcggggt 2821 cgaccctgga gtgcatcctg tacaaagcgg agggcgcgcc gccccagcag ggcccgttcg 2881 cgccgccgcc ctgcaaggcg ccgggcgcga gcggctgcct gctcccgcgg gacggcctgc 2941 cctccacctc cgcctctgcc gccgccgccg gggcggcccc cgcgctctac cctgcactcg 3001 gcctcaacgg gctcccgcag ctcggctacc aggccgccgt gctcaaggag ggcctgccgc 3061 aggtctaccc gccctatctc aactacctga ggccggattc agaagccagc cagagcccac 3121 aatacagctt cgagtcatta cctcagaaga tttgtttaat ctgtggggat gaagcatcag 3181 gctgtcatta tggtgtcctt acctgtggga gctgtaaggt cttctttaag agggcaatgg 3241 aagggcagca caactactta tgtgctggaa gaaatgactg catcgttgat aaaatccgca 3301 gaaaaaactg cccagcatgt cgccttagaa agtgctgtca ggctggcatg gtccttggag 3361 gtcgaaaatt taaaaagttc aataaagtca gagttgtgag agcactggat gctgttgctc 3421 tcccacagcc attgggcgtt ccaaatgaaa gccaagccct aagccagaga ttcacttttt 3481 caccaggtca agacatacag ttgattccac cactgatcaa cctgttaatg agcattgaac 3541 cagatgtgat ctatgcagga catgacaaca caaaacctga cacctccagt tctttgctga 3601 caagtcttaa tcaactaggc gagaggcaac ttctttcagt agtcaagtgg tctaaatcat 3661 tgccaggttt tcgaaactta catattgatg accagataac tctcattcag tattcttgga 3721 tgagcttaat ggtgtttggt ctaggatgga gatcctacaa acatgtcagt gggcagatgc 3781 tgtattttgc acctgatcta atactaaatg aacagcggat gaaagaatca tcattctatt 3841 cattatgcct taccatgtgg cagatcccac aggagtttgt caagcttcaa gttagccaag 3901 aagagttcct ctgtatgaaa gtattgttac ttcttaatac aattcctttg gaagggctac 3961 gaagtcaaac ccagtttgag gagatgaggt caagctacat tagagagctc atcaaggcaa 4021 ttggtttgag gcaaaaagga gttgtgtcga gctcacagcg tttctatcaa cttacaaaac 4081 ttcttgataa cttgcatgat cttgtcaaac agcttcatct gtactgcttg aatacattta 4141 tccagtcccg ggcactgagt gttgaatttc cagaaatgat gtctgaagtt attgctgcac 4201 aattacccaa gatattggca gggatggtga aaccccttct ctttcataaa aagtgaatgt 4261 catctttttc ttttaaagaa ttaaattttg tggcatgtct ttttgttttg gtcaggatta 4321 tgaggtcttg agtttttata atgttcttct gaaagcctta catttataac atcatagtgt 4381 gtaaatttaa aagaaaaatt gtgaggttct aattattttc ttttataaag tataattaga 4441 atgtttaact gttttgttta cccatatttt cttgaagaat ttacaagatt gaaaaagtac 4501 taaaattgtt aaagtaaact atatcttatc catattattt cataccatgt aggtgaggat 4561 ttttaacttt tgcatctaac aaatcatcga cttaagagaa aaaatcttac atgtaataac 4621 acaaagctat tatatgttat ttctaggtaa ctccctttgt gtcaattata tttccaaaaa 4681 tgaaccttta aaatggtatg caaaattttg tctatatata tttgtgtgag gaggaaattc 4741 ataactttcc tcagattttc aaaagtattt ttaatgcaaa aaatgtagaa agagtttaaa 4801 accactaaaa tagattgatg ttcttcaaac taggcaaaac aactcatatg ttaagaccat 4861 tttccagatt ggaaacacaa atctcttagg aagttaataa gtagattcat atcattatac 4921 aaatagtatt gtgggttttg taggttttta aaataacctt ttttggggag agaattgtcc 4981 tctaatgagg tattgcgagt ggc // LOCUS HSPRIM1 1399 bp RNA PRI 14-NOV-1994 DEFINITION H.sapiens mRNA for DNA primase (subunit p48). ACCESSION X74330 NID g510405 KEYWORDS DNA primase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1399) AUTHORS Nasheuer,H. TITLE Direct Submission JOURNAL Submitted (11-NOV-1994) H. Nasheuer, IMB d. LMU Muenchen, Wuermtalstr. 221, D-81375 Muenchen, Germany REFERENCE 2 (bases 1 to 1399) AUTHORS Stadlbauer,F., Brueckner,A., Rehfuess,C., Eckerskorn,C., Lottspeich,F., Forster,V., Tseng,B.Y. and Nasheuer,H.P. TITLE DNA replication in vitro by recombinant DNA-polymerase-alpha-primase JOURNAL Eur. J. Biochem. 222 (3), 781-793 (1994) MEDLINE 94298818 REFERENCE 3 (bases 1 to 1399) AUTHORS Nasheuer,H. TITLE Direct Submission JOURNAL Submitted (28-JUL-1993) H. Nasheuer, IMB d. LMU Muenchen, Karlstr.23, D-80333 Muenchen, Germany FEATURES Location/Qualifiers source 1..1399 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Human 293S from Dr. B.Stillman (NY,USA)" CDS 26..1288 /codon_start=1 /product="DNA primase (subunit p48)" /db_xref="PID:g510406" /translation="METFDPTELPELLKLYYRRLFPYSQYYRWLNYGGVIKNYFQHRE FSFTLKDDIYIRYQSFNNQSDLEKEMQKMNPYKIDIGAVYSHRPNQHNTVKLGAFQAQ EKELVFDIDMTDYDDVRRCCSSADICPKCWTLMTMAIRIIDRALKEDFGFKHRLWVYS GRRGVHCWVCDESVRKLSSAVRSGIVEYLSLVKGGQDVKKKVHLSEKIHPFIRKSINI IKKYFEEYALVNQDILENKESWDKILALVPETIHDELQQSFQKSHNSLQRWEHLKKVA SRYQNNIKNDKYGPWLEWEIMLQYCFPRLDINVSKGINHLLKSPFSVHPKTGRISVPI DLQKVDQFDPFTVPTISFICRELDAISTNEEEKEENEAESDVKHRTRDYKKTSLAPYV KVFEHFLENLDKSRKGELLKKSDLQKDF" BASE COUNT 469 a 269 c 289 g 372 t ORIGIN 1 cttaccgtgg cgagttccgc gctcaatgga gacgtttgac cccaccgagc tgcccgagct 61 gcttaaactt tattaccgga ggctctttcc ctactctcag tactatcgct ggctcaacta 121 cggtggagtg ataaagaatt actttcaaca ccgtgaattt tcattcacac tgaaagatga 181 tatttacatt cgctaccaat ccttcaacaa ccagagtgat ctggaaaagg agatgcagaa 241 aatgaatcca tacaagattg atataggcgc agtatattct cacagaccca atcaacacaa 301 tacagtgaag ctgggagctt tccaggctca ggaaaaagaa ctggtatttg acattgacat 361 gacagactac gacgatgtga ggagatgttg tagttctgca gacatatgtc ctaagtgctg 421 gaccctcatg acaatggcca tacgcatcat tgacagagca ttgaaggagg actttggatt 481 taagcatcgt ctctgggtat attctggaag gagaggtgtt cattgttggg tctgtgatga 541 atcagttaga aaactgtctt ctgcagtacg ttctgggata gttgagtatt tgagccttgt 601 aaagggtggt caagacgtta aaaagaaagt tcacctaagt gaaaaaattc acccttttat 661 cagaaaatct ataaacataa taaaaaaata ctttgaagaa tatgccttgg ttaatcaaga 721 tattctcgaa aataaagaaa gctgggataa gattttagcc cttgttcctg aaacaattca 781 tgatgaactt caacaaagct tccaaaagtc tcacaattca cttcagcgtt gggagcactt 841 gaagaaagta gccagcagat atcagaataa catcaaaaat gacaaatatg gaccctggct 901 ggagtgggag attatgctcc agtactgttt tccacggctg gatatcaatg tcagcaaagg 961 aatcaatcat ctactgaaga gcccttttag tgttcatcct aaaacaggtc gcatctctgt 1021 gcctattgat ttgcagaaag tggaccagtt tgatccattt actgttccga ccataagctt 1081 catctgccgt gaattggatg ccatttccac taatgaagag gaaaaagagg agaatgaagc 1141 tgaatctgat gtcaaacata gaaccagaga ttataagaag accagtctag caccttatgt 1201 gaaagttttt gaacattttc ttgaaaatct ggataaatcc cgaaaaggag aacttcttaa 1261 gaagagtgat ttacaaaaag atttctgaag acagagctcc tcaaaccatt gtggatatct 1321 tctgccttca accacagatc aaatacttca agagccattt aataaatatg gcagaactat 1381 aaagaaaaaa aaaaaaagg // LOCUS HSPRIM2 2306 bp RNA PRI 14-NOV-1994 DEFINITION H.sapiens mRNA for DNA primase (subunit p58). ACCESSION X74331 NID g510407 KEYWORDS DNA primase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2306) AUTHORS Nasheuer,H. TITLE Direct Submission JOURNAL Submitted (11-NOV-1994) H. Nasheuer, IMB d. LMU Muenchen, Wuermtalstr. 221, D-81375 Muenchen, Germany REFERENCE 2 (bases 1 to 2306) AUTHORS Stadlbauer,F., Brueckner,A., Rehfuess,C., Eckerskorn,C., Lottspeich,F., Forster,V., Tseng,B.Y. and Nasheuer,H.P. TITLE DNA replication in vitro by recombinant DNA-polymerase-alpha-primase JOURNAL Eur. J. Biochem. 222 (3), 781-793 (1994) MEDLINE 94298818 FEATURES Location/Qualifiers source 1..2306 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Human 293S from Dr. B.Stilliman (NY,USA)" CDS 88..1617 /codon_start=1 /product="DNA primase (p58 subunit)" /db_xref="PID:g510408" /translation="MEFSGRKRRKLRLAGDQRNASYPHCLQFYLQPPSENISLTEFEN LAIDRVKLLKSVENLGVSYVKGTEQYQSKLESELRKLKFSYREKLEDEYEPRRRDHIS HFILRLAYCQSEELRRWFIQQEMDLLRFRFSILPKDKIQDFLKDSQLQFEAISDEEKT LREQEIVASSPSLSGLKLGFESIYKIPFADALDLFRGRKVYLEDGFAYVPLKDIVAII LNEFRAKLSKALALTARSLPAVQSDERLQPLLNHLSHSYTGQDYSTQGNVGKISLDQI DLLSTKSFPPCMRQLHKALRENHHLRHGGRMQYGLFLKGIGLTLEQALQFWKQEFIKG KMDPDKFDKGYSYNIRHSFGKEGKRTDYTPFSCLKIILSNPPSQGDYHGCPFRHSDPE LLKQKLQSYKISPGGISQILDLVKGTHYQVACQKYFEMIHNVDDCGFSLNHPNQFFCE SQRILNGGKDIKKEPIQPETPQPKPSVQKTKDASSALASLNSSLEMDMEGLEDYFSED S" BASE COUNT 651 a 469 c 493 g 693 t ORIGIN 1 ggtttcatat gaactctccc gccacccggg aacagctggc tgccaccgtt tgtgttttcc 61 gagtttgtat tcttgcaggt gaccaagatg gagttttctg gaagaaagcg gaggaagctg 121 aggttggcag gtgaccagag gaatgcttcc taccctcatt gccttcagtt ttacttgcag 181 ccaccttctg aaaacatatc tttaacagaa tttgaaaact tggctattga tagagttaaa 241 ttgttaaaat cagttgaaaa tcttggagtg agctatgtga aaggaactga acaataccag 301 agtaagttgg agagtgagct tcggaagctc aagttttcct acagagagaa gctagaagat 361 gaatatgaac cacgaagaag agatcatatt tctcatttta ttttgcggct tgcttattgc 421 cagtctgaag aacttagacg ctggttcatt caacaagaaa tggatctcct tcgatttaga 481 tttagtattt tacccaagga taaaattcag gatttcttaa aggatagcca attgcagttt 541 gaggctataa gtgatgaaga gaagactctt cgagaacagg agattgttgc ctcatcacca 601 agtttaagtg gacttaagtt ggggttcgag tccatttata agatcccttt tgctgatgct 661 ctggatttgt ttcgaggaag gaaagtctat ttggaagatg gctttgctta cgtaccactt 721 aaggacattg tggcaatcat cctgaatgaa tttagagcca aactgtccaa ggctttggca 781 ttaacagcca ggtccttgcc tgctgtgcag tctgatgaaa gacttcagcc tctgctcaat 841 cacctcagtc attcctacac tggccaagat tacagtaccc agggaaatgt tgggaagatt 901 tctttagatc agattgattt gctttctacc aaatccttcc caccttgcat gcgtcagtta 961 cataaagcct tgcgggaaaa tcaccatctt cgtcatggag gccgaatgca gtatggccta 1021 tttctgaagg gcattggttt aactttggaa caggcattgc agttctggaa gcaagaattt 1081 atcaaaggaa agatggatcc agacaagttt gataaaggtt actcttacaa catccgtcac 1141 agctttggaa aggaaggcaa gaggacagac tatacacctt tcagttgcct gaagattatt 1201 ctgtccaatc caccaagcca aggggattat catgggtgcc cattccgtca cagtgatcca 1261 gagctgctga agcaaaagtt gcagtcatac aagatctctc ctggagggat aagccagatt 1321 ttggatttag taaaggggac acattaccag gtagcctgtc aaaaatactt tgagatgata 1381 cacaatgtgg atgattgtgg cttttctttg aatcatccta atcagttctt ttgtgagagc 1441 caacgtattc taaatggtgg taaagacata aagaaggaac ctatccaacc agaaactcct 1501 caacccaaac caagtgtcca gaaaaccaag gatgcatcat ctgctctggc ctctttaaat 1561 tcctctctgg aaatggatat ggaaggacta gaagattact ttagtgaaga ttcttaggca 1621 gttttataac cctttttcct caatagcctg tttcctgttt ttaagatttt gcctttgttg 1681 ttgaaaaagg gtttcactgt caccaaggct tagtgcagtg acacaattac agctgattgc 1741 agccttgacc ttcccagctc aagtgatcct cctacctcag cctcccaagt agttaggaca 1801 cacaggtgtg cacctcatat ccagataatt tttttcaatt tttttttgta gaggtggggg 1861 gtctccctat gttgcccagg cagatctcag actcctgggc tcaagcgatc ctcacacctc 1921 agcgtcccag agtgctggga ttacagttgt gagccactgt gcctggcctt tttttttttt 1981 taaccttttc gtttaacttc tctcttcact gcatcccaat ccatctacag gcatgcacac 2041 ttattaggaa aggaggtttg aggtaacaac agagactttc actatatttt gctttgacag 2101 aaggaaagag gaggagtttc tattaaaatc tgtcacttga gtgatgtcat ttaagtccta 2161 ttttaggaga taaaaacagc tttggggact ggttaaagtc ccccagaaac tacaataaag 2221 aacaactttt gttttaactc ttaatcactt tgtaattttg actcaatcct tttctggacc 2281 atttttgtta ataaatatca aagtgt // LOCUS HSPRO205 970 bp RNA PRI 12-AUG-1994 DEFINITION H.sapiens mRNA for prolactin (clone PRL205). ACCESSION X54393 NID g531102 KEYWORDS prolactin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 970) AUTHORS Hiraoka,Y. TITLE Direct Submission JOURNAL Submitted (26-JUL-1990) Hiraoka Y., Dept. of Microbiology, Keio University School of Medicine, 35 Shinanomachi, Shinijuku-ku, 160 Tokyo, Japan REFERENCE 2 (bases 1 to 970) AUTHORS Hiraoka,Y., Tatsumi,K., Shiozawa,M., Aiso,S., Fukasawa,T., Yasuda,K. and Miyai,K. TITLE A placenta-specific 5' non-coding exon of human prolactin JOURNAL Mol. Cell. Endocrinol. 75 (1), 71-80 (1991) MEDLINE 91267286 FEATURES Location/Qualifiers source 1..970 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" /clone="lambda PRL205" CDS 143..166 /note="ORF 1" /codon_start=1 /db_xref="PID:g580411" /translation="MNIKGSP" CDS 163..825 /note="ORF 2" /codon_start=1 /product="prolactin" /db_xref="PID:g531103" /translation="MKGSLLLLLVSNLLLCQSVAPLPICPGGAARCQVTLRDLFDRAV VLSHYIHNLSSEMFSEFDKRYTHGRGFITKAINSCHTSSLATPEDKEQAQQMNQKDFL SLIVSILRSWNEPLYHLVTEVRGMQEAPEAILSKAVEIEEQTKRLLEGMELIVSQVHP ETKENEIYPVWSGLPSLQMADEESRLSAYYNLLHCLRRDSHKIDNYLKLLKCRIIHNN NC" mat_peptide 226..822 /product="prolactin" BASE COUNT 265 a 265 c 207 g 233 t ORIGIN 1 ccctcaaaga cagagacacc aagaagaatc ggaacataca ggctttgata tcaaaggttt 61 ataaagccaa tatctgggaa agagaaaacc gtgacttcca gatcttctct ggtgaagtgt 121 gtttcctgca acgatcacga acatgaacat caaaggatcg ccatgaaagg gtccctcctg 181 ctgctgctgg tgtcaaacct gctcctgtgc cagagcgtgg cccccttgcc catctgtccc 241 ggcggggctg cccgatgcca ggtgaccctt cgagacctgt ttgaccgcgc cgtcgtcctg 301 tcccactaca tccataacct ctcctcagaa atgttcagcg aattcgataa acggtatacc 361 catggccggg ggttcattac caaggccatc aacagctgcc acacttcttc ccttgccacc 421 cccgaagaca aggagcaagc ccaacagatg aatcaaaaag actttctgag cctgatagtc 481 agcatattgc gatcctggaa tgagcctctg tatcatctgg tcacggaagt acgtggtatg 541 caagaagccc cggaggctat cctatccaaa gctgtagaga ttgaggagca aaccaaacgg 601 cttctagagg gcatggagct gatagtcagc caggttcatc ctgaaaccaa agaaaatgag 661 atctaccctg tctggtcggg acttccatcc ctgcagatgg ctgatgaaga gtctcgcctt 721 tctgcttatt ataacctgct ccactgccta cgcagggatt cacataaaat cgacaattat 781 ctcaagctcc tgaagtgccg aatcatccac aacaacaact gctaagccca catccatttc 841 atctatttct gagaaggtcc ttaatgatcc gttccattgc aagcttcttt tagttgtatc 901 tcttttgaat ccatgcttgg gtgtaacagg tctcctctta aaaaataaaa actgactcct 961 tagagacatc // LOCUS HSPROGBIN 1941 bp RNA PRI 30-APR-1997 DEFINITION H.sapiens mRNA for putative progesterone binding protein. ACCESSION Y12711 NID g2062021 KEYWORDS progesterone binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1941) AUTHORS Falkenstein,E. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1941) AUTHORS Falkenstein,E. TITLE Direct Submission JOURNAL Submitted (21-APR-1997) E. Falkenstein, University Heidelberg, Institute Clinical Pharmacology Mannheim, Klinikum Mannheim, Theodor-Kutzer-Ufer, 68165 Mannheim, FRG FEATURES Location/Qualifiers source 1..1941 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="human1" /tissue_type="liver" /clone_lib="pSPORT" CDS 52..639 /codon_start=1 /product="putative progesterone binding protein" /db_xref="PID:e314174" /db_xref="PID:g2062022" /translation="MAAEDVVATGADPSDLESGGLLHEIFTSPLNLLLLGLCIFLLYK IVRGDQPAASGDSDDDEPPPLPRLKRRDFTPAELRRFDGVQDPRILMAINGKVFDVTK GRKFYGPEGPYGVFAGRDASRGLATFCLDKEALKDEYDDLSDLTAAQQETLSDWESQF TFKYHHVGKLLKEGEEPTVYSDEEEPKDESARKND" polyA_site 1924 BASE COUNT 568 a 409 c 437 g 524 t 3 others ORIGIN 1 ggcgagttcc ggatccctgc ctagcgcggc ccaaccttta ctccagagat catggctgcc 61 gaggatgtgg tggcgactgg cgccgaccca agcgatctgg agagcggcgg gctgctgcat 121 gagattttca cgtcgccgct caacctgctg ctgcttggcc tctgcatctt cctgctctac 181 aagatcgtgc gcggggacca gccggcggcc agcggcgaca gcgacgacga cgagccgccc 241 cctctgcccc gcctcaagcg gcgcgacttc acccccgccg agctgcggcg cttcgacggc 301 gtccaggacc cgcgcatact catggccatc aacggcaagg tgttcgatgt gaccaaaggc 361 cgcaaattct acgggcccga ggggccgtat ggggtctttg ctggaagaga tgcatccagg 421 ggccttgcca cattttgcct ggataaggaa gcactgaagg atgagtacga tgacctttct 481 gacctcactg ctgcccagca ggagactctg agtgactggg agtctcagtt cactttcaag 541 tatcatcacg tgggcaaact gctgaaggag ggggaggagc ccactgtgta ctcagatgag 601 gaagaaccaa aagatgagag tgcccggaaa aatgattaaa gcattcagtg gaagtatatc 661 tatttttgta ttttgcaaaa tcatttgtaa cagtccactc tgtctttaaa acatagtgat 721 tacaatattt agaaagtttt gagcacttgc tataagtttt ttaattaaca tcactagtga 781 cactaataaa attaacttct tagaatgaaa aagaaaaaaa aaagggcggc cgctctagag 841 gatccctcga gggmcccaag cmmacgcgtg catgcgacgt cacatgatgt gtttgtgtgt 901 cacaaatcca gaaagtgaac tgcagtgctg taatacacat gttaatactg tttttcttct 961 atctgtagtt agtacaggat gaatttaaat gtgtttttcc tgagagacaa ggaagacttg 1021 ggtatttccc aaaacaggta aaaatcttaa atgtgcacca agagcaaagg atcaactttt 1081 agtcatgatg ttctgtaaag acaacaaatc cctttttttt tctcaattga cttaactgca 1141 tgatttctgt tttatctacc tctaaagcaa atctgcagtg ttccaaagac tttggtatgg 1201 attaagcgct gtccagtaac aaaatgaaat ctcaaaacag agctcagctg caaaaaagca 1261 tattttctgt gtttctggac tgcactgttg tccttgccct cacatagaca ctcagacacc 1321 ctcacaaaca cagtagtcta tagttaggat taaaatagga tctgaacatt caaaagaaag 1381 ctttggaaaa aaagagctgg ctggcctaaa aacctaaata tatgatgaag attgtaggac 1441 tgtcttccca agccccatgt tcatggtggg gcaatggtta tttggttatt ttactcaatt 1501 ggttactctc atttgaaatg agggagggac atacagaata ggaacaggtg tttgctctcc 1561 taagagcctt catgcacacc cctgaaccac gaggaaacag tacagtcgct agtcaagtgg 1621 tttttaaagt aaagtatatt cataaggtaa cagttattct gttgttataa aactataccc 1681 actgcaaaag tagtagtcaa gtgtctaggt ctttgatatt gctcttttgg ttaacactaa 1741 gcttaagtag actatacagt tgtatgaatt tgtaaaagta tatgaacacc tagtgagatt 1801 tcaaacttgt aattgtggtt aaatagtcat tgtattttct tgtgaactgt gttttatgat 1861 tttacctcaa atcagaaaac aaaatgatgt gctttggtca gttaataaaa atggttttac 1921 ccacaaaaaa aaaaaaaaaa a // LOCUS HSPROKINX 3018 bp RNA PRI 01-JUN-1995 DEFINITION H.sapiens mRNA for Ndr protein kinase. ACCESSION Z35102 NID g854169 KEYWORDS protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3018) AUTHORS Millward,T. TITLE Direct Submission JOURNAL Submitted (06-JUL-1994) Thomas Millward, Friedrich Miescher-Institut, Basel, Postfach, 2543, Switzerland REFERENCE 2 (bases 1 to 3018) AUTHORS Millward,T., Cron,P. and Hemmings,B.A. TITLE Molecular cloning and characterization of a conserved nuclear serine(threonine) protein kinase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (11), 5022-5026 (1995) MEDLINE 95281588 FEATURES Location/Qualifiers source 1..3018 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="human Ndr protein kinase" /clone_lib="Human fetal brain/Stratagene 936206" CDS 596..1993 /codon_start=1 /product="Ndr protein kinase" /db_xref="PID:g854170" /translation="MAMTGSTPCSSMSNHTKERVTMTKVTLENFYSNLIAQHEEREMR QKKLEKVMEEEGLKDEEKRLRRSAHARKETEFLRLKRTRLGLEDFESLKVIGRGAFGE VRLVQKKDTGHVYAMKILRKADMLEKEQVGHIRAERDILVEADSLWVVKMFYSFQDKL NLYLIMEFLPGGDMMTLLMKKDTLTEEETQFYIAETVLAIDSIHQLGFIHRDIKPDNL LLDSKGHVKLSDFGLCTGLKKAHRTEFYRNLNHSLPSDFTFQNMNSKRKAETWKRNRR QLAFSTVGTPDYIAPEVFMQTGYNKLCDWWSLGVIMYEMLIGYPPFCSETPQETYKKV MNWKETLTFPPEVPISEKAKDLILRFCCEWEHRIGAPGVEEIKSNSFFEGVDWEHIRE RPAAISIEIKSIDDTSNFDEFPESDILKPTVATSNHPETDYKNKDWVFINYTYKRFEG LTARGAIPSYMKAAK" BASE COUNT 908 a 640 c 672 g 798 t ORIGIN 1 gaattccggg ccaggcatgg tagcgcatcg ctgtaatccc agctactcgg gaaactgagg 61 tgggagaatc gattgaacct ggaagtggag gttgcggtga gccaagatca tcctgtcgca 121 ctccagcctg ggcaacaaga gcgaaactcc atctcaaaaa gaaaaaaaaa gatatatatg 181 tgtgacttac aggtacaggt aaagttgctt ctggttttct ggttgttgca tggtatttcc 241 tatgcagcca caggtcttta ttttcttact taagtgcctc caacttccca taacacaaat 301 taaggcatga tgaacatcct ctctgtgctg aacatcctgt gtatgtcact tcagaagcct 361 gtgtgacggt ttctttagtc tttataccta ggggtgggat ttctgggtca taggacagta 421 atttatattt atttcactaa gtattctctt tctctggctt ttgttacata ttacctgttt 481 gtcctccaga aaacttgcac caatttacat tcctaccaat agggtaggag agtgcacaat 541 gggtggattc taactccaaa tctaacacct cttcttttct ttgtttctag cagccatggc 601 aatgacaggc tcaacacctt gctcatccat gagtaaccac acaaaggaaa gggtgacaat 661 gaccaaagtg acactggaga atttttatag caaccttatc gctcaacatg aagaacgaga 721 aatgagacaa aagaagttag aaaaggtgat ggaagaagaa ggcctaaaag atgaggagaa 781 acgactccgg agatcagcac atgctcggaa ggaaacagag tttcttcgtt tgaagagaac 841 aagacttgga ttggaagatt ttgagtcctt aaaagtaata ggcagaggag catttggtga 901 ggtacggctt gttcagaaga aagatacggg acatgtgtat gcaatgaaaa tactccgtaa 961 agcagatatg cttgaaaaag agcaggttgg ccacattcgt gcggagcgtg acattctagt 1021 ggaggcagac agtttgtggg ttgtgaaaat gttctatagt tttcaggata agctaaacct 1081 ctacctaatc atggagttcc tgcctggagg ggacatgatg accttgttga tgaaaaaaga 1141 cactctgaca gaagaggaga ctcagtttta tatagcagaa acagtattag ccatagactc 1201 tattcaccaa cttggattca tccacagaga catcaaacca gacaaccttc ttttggacag 1261 caagggccat gtgaaacttt ctgactttgg tctttgcaca ggactgaaaa aagcacatag 1321 gacagaattt tataggaatc tgaaccacag cctccccagt gatttcactt tccagaacat 1381 gaattccaaa aggaaagcag aaacctggaa aagaaataga cgtcagctag ccttctccac 1441 agtaggcact cctgactaca ttgctcctga ggtgttcatg cagaccgggt acaacaagct 1501 ctgtgattgg tggtcgcttg gggtgatcat gtatgagatg ctcatcggct acccaccttt 1561 ctgttctgag acccctcaag agacatataa gaaggtgatg aactggaaag aaactttgac 1621 ttttcctcca gaagttccca tctctgagaa agccaaggat ctaattttga ggttctgctg 1681 tgaatgggaa catagaattg gagctcctgg agttgaggaa ataaaaagta actctttttt 1741 tgaaggcgtt gactgggaac atatcagaga gagacctgct gcaatatcta ttgaaatcaa 1801 aagcattgat gatacctcaa acttcgatga gtttccagaa tctgatattc ttaagccaac 1861 agtggccaca agtaatcatc ctgagactga ctacaagaac aaagactggg tcttcatcaa 1921 ttacacgtac aagcgctttg agggcctgac tgcaaggggg gcaatacctt cctacatgaa 1981 agcagcaaaa tagtactctt gccacggaat cctatgtgga gcagagttct ttgtataaca 2041 tcatgctttt cctctcacac tcttgaagag cttccaagaa gttgatggaa cccaccaata 2101 tgtcatagta aagtctcctg aaatgtggta gtaagaggat tttcttccat aatgcatctg 2161 aaaaactgta aacaaagaca accatttcta ctacgtcggc cataaacagc tatcctgctt 2221 tggaagagaa gcatcatgag ccaatttgat aggtgtttta aaaataactt gagttttcct 2281 aagttcatca gaatgaaggg gaaaaacagc catcatccaa cattattgag attgtcgtgt 2341 atagtcatcg aatatcagcc agttcctgta attttgtgac acgctctctg ccaagcccac 2401 caagtatttc ctttatagct aaaagttcca tagtactaag gaaataaagc aataaagaca 2461 gtctcagcag ccaggattct ggctgaagga aatgatccgc caccctgagg gtggtgatgg 2521 tagtttctac ccatacctca gcctcaggcg agtggcttat agcctccatt catggtgcac 2581 tttatttatg gtactaagat aaagactgtc aatccattga tttatctcct cctgtccccc 2641 atctaaaata cccatgctgc ttttctgagt gttgatgggg gttaccagct tgatccactg 2701 ttgctcttag aaggcccaga aagtctttgg gcattgcaag aaatcccgaa ttatgtggaa 2761 aaccctcact ttctcttcac ggctgtacca gaaaatccct aagacagatc ttgccgtgga 2821 ctagcaatac ctgcaagtgc tgccaatggg aactcaattt attcctggga acctaacgag 2881 gagagcccag gcctaggcag gaggcctgga accctcttgg ctaaggtgct gttcctgttc 2941 ctgcaaggtc tccagaaccc ctttggaaat ggtgaaggaa ccagcccaat agaagtacag 3001 agccagctga cggaattc // LOCUS HSPROOLIG 2562 bp RNA PRI 16-DEC-1994 DEFINITION H.sapiens mRNA for prolyl oligopeptidase. ACCESSION X74496 NID g558595 KEYWORDS prolyl oligopeptidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2562) AUTHORS Vanhoof,G. TITLE Direct Submission JOURNAL Submitted (09-AUG-1993) G. Vanhoof, University of Antwerp, Universiteitsplein 1, 2610 Wilrijk, BELGIUM REFERENCE 2 (bases 1 to 2291) AUTHORS Vanhoof,G., Goossens,F., Hendriks,L., De Meester,I., Hendriks,D., Vriend,G., Van Broeckhoven,C. and Scharpe,S. TITLE Cloning and sequence analysis of the gene encoding human lymphocyte prolyl endopeptidase JOURNAL Gene 149 (2), 363-366 (1994) MEDLINE 95047504 FEATURES Location/Qualifiers source 1..2562 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocytes" /tissue_type="blood" CDS 1..2133 /codon_start=1 /product="prolyl oligopeptidase" /db_xref="PID:g558596" /db_xref="SWISS-PROT:P48147" /translation="MLSFQYPDVYRDETAVQDYHGHKICDPYAWLEDPDSEQTKAFVE AQNKITVPFLEQCPIRGLYKERMTELYDYPKYSCHFKKGKRYFYFYNTGLQNQRVLYV QDSLEGEARVFLDPNILSDDGTVALRGYAFSEDGEYFAYGLSASGSDWVTIKFMKVDG AKELPDVLERVKFSCMAWTHDGKGMFYNSYPQQDGKSDGTETSTNLHQKLYYHVLGTD QSEDILCAEFPDEPKWMGGAELSDDGRYVLLSIREGCDPVNRLWYCDLQQESSGIAGI LKWVKLIDNFEGEYDYVTNEGTVFTFKTNRQSPNYRVINIDFWDPEESKWKVLVPEHE KDVLEWIACVRSNFLVLCYLHDVKNILQLHDLTTGALLKTFPLDVGSIVGYSGQKKDT EIFYQFTSFLSPGIIYHCDLTKEELEPRVFREVTVKGIDASDYQTVQIFYPSKDGTKI PMFIVHKKGIKLDGSHPAFLYGYGGFNISITPNYSVSRLIFVRHMGGILAVANIRGGG EYGETWHKGGILANKQNCFDDFQCAAEYLIKEGYTSPKRLTINGGSNGGLLVAACANQ RPDLFGCVIAQVGVMDMLKFHKYTIGHAWTTDYGCSDSKQHFEWLVKYSPLHNVKLPE ADDIQYPSMLLLTADHDDRVVPLHSLKFIATLQYIVGRSRKQSNPLLIHVDTKAGHGA GKPTAKVIEEVSDMFAFIARCLNVDWIP" misc_feature 1660..1662 /note="active site serine" polyA_signal 2490..2495 polyA_signal 2537..2542 BASE COUNT 701 a 560 c 609 g 692 t ORIGIN 1 atgctgtcct tccagtaccc cgacgtgtac cgcgacgaga ccgccgtaca ggattatcat 61 ggtcataaaa tttgtgaccc ttacgcctgg cttgaagacc ccgacagtga acagactaag 121 gcctttgtgg aggcccagaa taagattact gtgccatttc ttgagcagtg tcccatcaga 181 ggtttataca aagagagaat gactgaacta tatgattatc ccaagtatag ttgccacttc 241 aagaaaggaa aacggtattt ttatttttac aatacaggtt tgcagaacca gcgagtatta 301 tatgtacagg attccttaga gggggaggcc agagtgttcc tggaccccaa catactgtct 361 gacgatggca cagtggcact ccgaggttat gcgttcagcg aagatggtga atattttgcc 421 tatggtctga gtgccagtgg ctcagactgg gtgacaatca agttcatgaa agttgatggt 481 gccaaagagc ttccagatgt gcttgaaaga gtcaagttca gctgtatggc ctggacccat 541 gatgggaagg gaatgttcta caactcatac cctcaacagg atggaaaaag tgatggcaca 601 gagacatcta ccaatctcca ccaaaagctc tactaccatg tcttgggaac cgatcagtca 661 gaagatattt tgtgtgctga gtttcctgat gaacctaaat ggatgggtgg agctgagtta 721 tctgatgatg gccgctatgt cttgttatca ataagggaag gatgtgatcc agtaaaccga 781 ctctggtact gtgacctaca gcaggaatcc agtggcatcg cgggaatcct gaagtgggta 841 aaactgattg acaactttga aggggaatat gactacgtga ccaatgaggg gacggtgttc 901 acattcaaga cgaatcgcca gtctcccaac tatcgcgtga tcaacattga cttctgggat 961 cctgaagagt ctaagtggaa agtacttgtt cctgagcatg agaaagatgt cttagaatgg 1021 atagcttgtg tcaggtccaa cttcttggtc ttatgctacc tccatgacgt caagaacatt 1081 ctgcagctcc atgacctgac tactggtgct ctccttaaga ccttcccgct cgatgtcggc 1141 agcattgtag ggtacagcgg tcagaagaag gacactgaaa tcttctatca gtttacttcc 1201 tttttatctc caggtatcat ttatcactgt gatcttacca aagaggagct ggagccaaga 1261 gttttccgag aggtgaccgt aaaaggaatt gatgcttctg attaccagac agtccagatt 1321 ttctacccta gcaaggatgg tacgaagatt ccaatgttca ttgtgcataa aaaaggcata 1381 aaattggatg gctctcatcc agctttctta tatggctatg gcggcttcaa catatccatc 1441 acacccaact acagtgtttc caggcttatt tttgtgagac acatgggtgg tatcctggca 1501 gtggccaaca tcagaggagg tggcgaatat ggagagacgt ggcataaagg tggtatcttg 1561 gccaacaaac aaaactgctt tgatgacttt cagtgtgctg ctgagtatct gatcaaggaa 1621 ggttacacat ctcccaagag gctgactatt aatggaggtt caaatggagg cctcttagtg 1681 gctgcttgtg caaatcagag acctgacctc tttggttgtg ttattgccca agttggagta 1741 atggacatgc tgaagtttca taaatatacc atcggccatg cttggaccac tgattatggg 1801 tgctcggaca gcaaacaaca ctttgaatgg cttgtcaaat actctccatt gcataatgtg 1861 aagttaccag aagcagatga catccagtac ccgtccatgc tgctcctcac tgctgaccat 1921 gatgaccgcg tggtcccgct tcactccctg aagttcattg ccacccttca gtacatcgtg 1981 ggccgcagca ggaagcaaag caaccccctg cttatccacg tggacaccaa ggcgggccac 2041 ggggcgggga agcccacagc caaagtgata gaggaagtct cagacatgtt tgcgttcatc 2101 gcgcggtgcc tgaatgtcga ctggattcca taaacagttt tcgtgcttcc tcctgacagc 2161 gacagaaaac ctcaagggct ttcccacgtt gacaccaaga aaccactggg cataatgctt 2221 ccccacggga acattattcc tgcactcaca ggctacagtt gaacagaact gccgtgggaa 2281 ttttatcttt tttaggcttc tcctttttag caaggccttg gtgtttcttt ttccaccctg 2341 tctaggcaca tgtggttttt tggtgttttt tttaagggca tgttgggata aatagctaaa 2401 tggcaacaaa cacattgtga atattagatt gctgaattaa ggatcatagt cgggcatact 2461 tatttatatc cataacctct atatctttaa ataaatgtga gaactgttct catggagaag 2521 acttctttgc aacaataata aatgttattt aagaatgaaa aa // LOCUS HSPROS27 979 bp RNA PRI 23-JUL-1993 DEFINITION H.sapiens PROS-27 mRNA. ACCESSION X59417 S56931 NID g35681 KEYWORDS PROS-27 gene; prosomal consensus; prosomal protein; RNA-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 979) AUTHORS Scherrer,K. TITLE Direct Submission JOURNAL Submitted (08-MAY-1991) K. Scherrer, Inst Jacques Monod-CNRS, Universite' Paris 7, 2 Place Jussieu, 75251 Paris Cedex 05, FRANCE REFERENCE 2 (bases 1 to 979) AUTHORS Bey,F., Silva Pereira,I., Coux,O., Viegas-Pequignot,E., Recillas Targa,F., Nothwang,H.G., Dutrillaux,B. and Scherrer,K. TITLE The prosomal RNA-binding protein p27K is a member of the alpha-type human prosomal gene family JOURNAL Mol. Gen. Genet. 237 (1-2), 193-205 (1993) MEDLINE 93204895 FEATURES Location/Qualifiers source 1..979 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HELA" /clone_lib="LAMBDA gt11" /chromosome="14" /map="14q13" gene 63..803 /gene="pros-27" CDS 63..803 /gene="pros-27" /codon_start=1 /product="prosomal P27K protein" /db_xref="PID:g35682" /db_xref="SWISS-PROT:P34062" /translation="MSRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSVAVRG KDCAVIVTQKKVPDKLLDSSTVTHLFKITENIGCVMTGMTADSRSQVQRARYEAANWK YKYGYEIPVDMLCKRIADISQVYTQNAEMRPLGCCMILIGIDEEQGPQVYKCDPAGYY CGFKATAAGVKQTESTSFLEKKVKKKFDWTFEQTVETAITCLSTVLSIDFKPSEIEVG VVTVENPKFRILTEAEIDAHLVALAERD" BASE COUNT 303 a 192 c 229 g 255 t ORIGIN 1 cggtgcctgg tgcgggagct acggggccca gggattgtgt ttaaagtagt gcttctacca 61 acatgtcccg tggttccagc gccggttttg accgccacat taccattttt tcacccgagg 121 gtcggctcta ccaagtagaa tatgctttta aggctattaa ccagggtggc cttacatcag 181 tagctgtcag agggaaagac tgtgcagtaa ttgtcacaca gaagaaagta cctgacaaat 241 tattggattc cagcacagtg actcacttat tcaagataac tgaaaacatt ggttgtgtga 301 tgaccggaat gacagctgac agcagatccc aggtacagag ggcacgctat gaggcagcta 361 actggaaata caagtatggc tatgagattc ctgtggacat gctgtgtaaa agaattgccg 421 atatttctca ggtctacaca cagaatgctg aaatgaggcc tcttggttgt tgtatgattt 481 taattggtat agatgaagag caaggccctc aggtatataa gtgtgatcct gcaggttact 541 actgtgggtt taaagccact gcagcgggag ttaaacaaac tgagtcaacc agcttccttg 601 aaaaaaaagt gaagaagaaa tttgattgga catttgaaca gacagtggaa actgcaatta 661 catgcctgtc tactgttcta tcaattgatt tcaaaccttc agaaatagaa gttggagtag 721 tgacagttga aaatcctaaa ttcaggattc ttacagaagc agagattgat gctcaccttg 781 ttgctctagc agagagagac taaacattgt cgttagttta ccagatccgt gatgccactt 841 acctgtgtgt ttggtaacaa caaacaaaca tcatggaggt ccctggattg aaaaaggagc 901 ctctcccact cctcctacca ccgaagtggt taggactcta tataaataaa aacaaggctt 961 ttggaaaaaa aaaaaaaaa // LOCUS HSPROSAP 2403 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for prostatic acid phosphatase (EC 3.1.3.2). ACCESSION X53605 NID g35683 KEYWORDS acid phosphatase; prostatic acid phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2403) AUTHORS Patel,P.C. TITLE Direct Submission JOURNAL Submitted (22-JUN-1990) Patel P.C., Institut Armand-Frappier, 531, boulevard des Prairies, Laval, Quebec H7V 1B7, Canada REFERENCE 2 (bases 1 to 2403) AUTHORS Tailor,P.G., Govindan,M.V. and Patel,P.C. TITLE Nucleotide sequence of human prostatic acid phosphatase determined from a full-length cDNA clone JOURNAL Nucleic Acids Res. 18 (16), 4928 (1990) MEDLINE 90370491 COMMENT See also and . FEATURES Location/Qualifiers source 1..2403 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="prostate" /clone_lib="human prostate lambda gt11" /clone="PAP1-1B" /chromosome="chromosome 3" CDS 37..1197 /note="acid phosphatase (AA 1 - 386)" /codon_start=1 /db_xref="PID:g35684" /db_xref="SWISS-PROT:P15309" /translation="MRAAPLLLARAASLAFASCFCFFCWLDRSVLAKELKFVTLVFRH GHRSPIDTFPTDPIKESSWPQRIWPTHPAGMEQHYELGEYIRKRYRKFLNESYKHEQV YIRSTDVDRTLMSAMTNLAALFPPEGVSIWNPILLWEPIPVHTVPLSEDQLLYLRFRN CPRFQELESETLKSEEFQKRLHPYKDFIATLGKLSGLHGQDLFGIWSKVYDPLYCESV HNFTLPSWATEDTMTKLRELSELSLLSLYGIHKQKEKSRLQGGVLVNEILNHMKRATQ IPSYKKLIMYSAHDTTVSGLQMALDVYNGLLPPYASCHLTELYFEKGEYFVEMYYRNE TQHEPYPLMLPGCSPSCPLERFAELVGPVIPQDWSTECMTTNSHQGTENSTD" misc_feature 2058..2403 /note="alu sequence" misc_feature 2353..2358 /note="polyA signal" misc_feature 2357..2362 /note="alt. polyA signal" misc_feature 2361..2366 /note="alt. polyA signal" misc_feature 2365..2370 /note="alt. polyA signal" misc_feature 2369..2374 /note="alt. polyA signal" misc_feature 2373..2378 /note="alt. polyA signal" misc_feature 2377..2382 /note="alt. polyA signal" misc_feature 2381..2386 /note="alt. polyA signal" misc_feature 2385..2390 /note="alt. polyA signal" misc_feature 2389..2394 /note="alt. polyA signal" BASE COUNT 710 a 522 c 544 g 627 t ORIGIN 1 gctcctaact cctggccaga aacagctctc ctcaacatga gagctgcacc cctcctcctg 61 gccagggcag caagcttagc ctttgcttct tgtttctgct ttttttgctg gctagaccga 121 agtgtactag ccaaggagtt gaagtttgtg actttggtgt ttcggcatgg acaccgaagt 181 cccattgaca cctttcccac tgaccccata aaggaatcct catggccaca aaggatttgg 241 ccaactcacc cagctggcat ggagcagcat tatgaacttg gagagtatat aagaaagaga 301 tatagaaaat tcttgaatga gtcctataaa catgaacagg tttatattcg aagcacagac 361 gttgaccgga ctttgatgag tgctatgaca aacctggcag ccctgtttcc cccagaaggt 421 gtcagcatct ggaatcctat cctactctgg gagcccatcc cggtgcacac agttcctctt 481 tctgaagatc agttgctata cctgcgtttc aggaactgcc ctcgttttca agaacttgag 541 agtgagactt tgaaatcaga ggaattccag aagaggctgc acccttataa ggattttata 601 gctaccttgg gaaaactttc aggattacat ggccaggacc tttttggaat ttggagtaaa 661 gtctacgacc ctttatattg tgagagtgtt cacaatttca ctttaccctc ctgggccact 721 gaggacacca tgactaagtt gagagaattg tcagaattgt ccctcctgtc cctctatgga 781 attcacaagc agaaagagaa atctaggctc caagggggtg tcctggtcaa tgaaatcctc 841 aatcacatga agagagcaac tcagatacca agctacaaaa aactcatcat gtattctgcg 901 catgacacta ctgtgagtgg cctacagatg gcgctagatg tttacaacgg actccttcct 961 ccctatgctt cttgccactt gacggaattg tactttgaga agggggagta ctttgtggag 1021 atgtactacc ggaatgagac gcagcacgag ccgtatcccc tcatgctacc tggatgcagc 1081 cccagctgtc ctctggagag gtttgctgag ctggttggcc ctgtgatccc tcaagactgg 1141 tccacggagt gtatgaccac aaacagccat caaggtactg agaacagtac agattagtgt 1201 gcacagagat ctctgtagaa ggagtagctg ccctttctca gggcagatga tgctttgaga 1261 acgtactttg gccattaccc cccagctttg aggaaaatgg gctttggatg attattttat 1321 gttttaggga cccccaacct caggcaattc ctacctcttc acctgaccct gcccccactt 1381 gccataaaac ttagctaagt tttgttttgt ttttcagcgt taatgtaaag gggcagcagt 1441 gccaaaatat aatcagagat aaagcttagg tcaaagttca tagagttccc atgaactata 1501 tgactggcca cacaggatct tttgtattta aggattctga gattttgctt gagcaggatt 1561 agacaagcct gttctttaaa ttgctgaaat ggaacagatt tcaaaaaaaa cgcccacaat 1621 ctagggtggg aacaaggaag gaaagatgtg aataggctga tgggcaaaaa accaatttac 1681 ccatcagttc cagccttctc tcaaggagag gcaaagaaag gagatacagt ggagacatct 1741 ggaaagtttt ctccactgga aaactgctac tatctgtttt tatatttctg ttaaaatata 1801 tgaggctaca gaactaaaaa ttaaaacctc ttggtgtccc ttggtcctgg aacatttatg 1861 ttccttttaa agaaacaaaa atcaaacttt acagaaagat ttgatgtatg taatacatat 1921 agcagctctt gaagtatata tatcatagca aataagtcat ctgatgagaa caagctattt 1981 gggcacaaca catcaggaaa gagagcacca cgtgatggag tttctccaga agctccagtg 2041 ataagaagtg ttgactctaa agttgattta agggcaggca tggtggttta cgcctataat 2101 cccagcattt tgggagtccg aggtgggcag atcacttgag ctcaggaggt caagatcagc 2161 ctgggcaaca tggtgaaacc ttggctctac ataaaataca aaaacttaga tgggcatggt 2221 ggtgtgtgcc tatagtccac tacttgtggg gctaaggcag gaggatcact tgagccccgg 2281 aggtcgaggc tacagtgagc caagagtgca ctactgtact ccagccaggg caagagagcg 2341 agaccctgtc tcaataaata aataaataaa taaataaata aataaataaa taaaaaaaaa 2401 aaa // LOCUS HSPROSCHY 13863 bp DNA PRI 21-JUL-1997 DEFINITION H.sapiens genes for proteasome-like subunit (MECL-1), chymotrypsin-like protease (CTRL-1) and protein serine kinase (PSK-H1) last exon. ACCESSION X71874 NID g406226 KEYWORDS CpG island; gene cluster; proteasome subunit; serine protease; serine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13863) AUTHORS Larsen,F. TITLE Direct Submission JOURNAL Submitted (10-MAY-1993) F. Larsen, Biotechnology Centre of Oslo, University of Oslo, PO Box 1125 Blindern, N0317 Oslo, NORWAY REFERENCE 2 (bases 1 to 13863) AUTHORS Larsen,F., Solheim,J., Kristensen,T., Kolsto,A.B. and Prydz,H. TITLE A tight cluster of five unrelated human genes on chromosome 16q22.1 JOURNAL Hum. Mol. Genet. 2 (10), 1589-1595 (1993) MEDLINE 94093544 REFERENCE 3 (bases 1 to 13863) AUTHORS Mastroianni,N., De Fusco,M., Zollo,M., Arrigo,G., Zuffardi,O., Bettinelli,A., Ballabio,A. and Casari,G. TITLE Molecular cloning, expression pattern, and chromosomal localization of the human Na-Cl thiazide-sensitive cotransporter (SLC12A3) JOURNAL Genomics 35 (3), 486-493 (1996) MEDLINE 97001149 FEATURES Location/Qualifiers source 1..13863 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cosmid COS202" /clone="cosODIN" /chromosome="16" /map="q22.1" /cell_type="leukocytes" repeat_unit 123..423 /rpt_family="Alu" repeat_unit 513..805 /rpt_family="Alu" RBS 566..571 /note="putative Sp1" repeat_unit 820..1131 /rpt_family="Alu" repeat_unit 1134..1296 /rpt_family="Alu" repeat_unit 1350..1482 /rpt_family="Alu" RBS 1576..1581 /note="putative Sp1" misc_feature 2251..3970 /note="CpG island" TATA_signal 2267..2272 RBS 2308..2313 /note="putative Sp1" exon <2310..2400 /number=1 CDS join(2345..2400,2609..2696,2782..2879,2991..3131, 3395..3510,3615..3673,4145..4296,4422..4533) /codon_start=1 /product="proteasome-like subunit MECL-1" /db_xref="PID:g406227" /db_xref="SWISS-PROT:P40306" /translation="MLKPALEPRGGFSFENCQRNASLERVLPGLKVPHARKTGTTIAG LVFQDGVILGADTRATNDSVVADKSCEKIHFIAPKIYCCGAGVAADAEMTTRMVASKM ELHALSTGREPRVATVTRILRQTLFRYQGHVGASLIVGGVDLTGPQLYGVHPHGSYSR LPFTALGSGQDAALAVLEDRFQPNMTLEAAQGLLVEAVTAGILGDLGSGGNVDACVIT KTGAKLLRTLSSPTEPVKRSGRYHFVPGTTAVLTQTVKPLTLELVEETVQAMEVE" intron 2401..2608 /number=1 RBS 2419..2424 /note="putative Sp1" exon 2609..2696 /number=2 intron 2697..2781 /number=2 exon 2782..2879 /number=3 intron 2880..2990 /number=3 exon 2991..3131 /number=4 intron 3132..3394 /number=4 RBS 3208..3212 /note="putative Sp1" RBS 3376..3381 /note="putative Sp1" exon 3395..3510 /number=5 RBS 3428..3433 /note="putative Sp1" intron 3511..3614 /number=5 exon 3615..3673 /number=6 intron 3674..4144 /number=6 exon 4145..4296 /number=7 intron 4297..4421 /number=7 polyA_signal 4363..4568 exon 4422..>4568 /number=8 repeat_unit 5560..6000 /rpt_family="Alu" RBS 5942..5948 /note="putative Sp1" repeat_unit 6037..6326 /rpt_family="Alu" TATA_signal 7202..7207 prim_transcript 7232..9523 /note="C" prim_transcript 7232..9484 /note="B" prim_transcript 7232..9220 /note="A2" prim_transcript 7232..9216 /note="A1" prim_transcript 7232..9210 /note="A3" exon 7232..7293 /number=1 sig_peptide join(7242..7293,7893..7894) CDS join(7242..7293,7893..7996,8095..8174,8287..8368, 8506..8686,8774..8907,8998..9159) /codon_start=1 /product="chymotrypsin-like protease CTRL-1" /db_xref="PID:g406228" /db_xref="SWISS-PROT:P40313" /translation="MLLLSLTLSLVLLGSSWGCGIPAIKPALSFSQRIVNGENAVLGS WPWQVSLQDSSGFHFCGGSLISQSWVVTAAHCNVSPGRHFVVLGEYDRSSNAEPLQVL SVSRAITHPSWNSTTMNNDVTLLKLASPAQYTTRISPVCLASSNEALTEGLTCVTTGW GRLSGVGNVTPAHLQQVALPLVTVNQCRQYWGSSITDSMICAGGAGASSCQGDSGGPL VCQKGNTWVLIGIVSWGTKNCNVRAPAVYTRVSKFSTWINQVIAYN" sig_peptide join(7293,7893..7894) intron 7294..7892 exon 7893..7996 /number=2 misc_feature 7895..7939 /note="activation peptide" mat_peptide join(7940..7996,8095..8174,8287..8368,8506..8686, 8774..8907,8998..9156) /product="chymotrypsin-like protease CTRL-1" misc_feature 7940..7996 /label=st1 intron 7997..8094 /number=2 exon 8095..8174 /number=3 intron 8175..8286 /number=3 exon 8287..8368 /number=4 intron 8369..8505 /number=4 exon 8506..8686 /number=5 intron 8687..8773 /number=5 exon 8774..8907 /number=6 intron 8908..8997 /number=6 exon 8998..9484 /note="number 7B" exon 8998..9220 /note="number 7A2" exon 8998..9216 /note="number 7A1" exon 8998..9210 /note="number 7A3" exon 8998..9523 /note="number 7C" polyA_signal 9190..9195 /note="number A" precursor_RNA complement(9415..13863) misc_feature 9415..9523 /note="overlapping transcription units" polyA_signal 9462..9467 /note="number B" polyA_signal 9507..9512 /note="number C" RBS 10295..10300 /note="putative Sp1" RBS 10704..10709 /note="putative Sp1" gene complement(11448..11763) /gene="PSKH1" exon complement(11448..11763) /gene="PSKH1" /number=2 /product="protein serine kinase" RBS 11899..11904 /note="putative Sp1" repeat_unit 13574..>13863 /rpt_family="Alu" BASE COUNT 3188 a 3846 c 4065 g 2761 t 3 others ORIGIN 1 aagcttagga agcacaagag gctgagcctt tcaggtcagc aaagacttcc cagaggaggc 61 agtgcctaca ctgaggtcag agtgacaaga agagtaatgg accactgtaa agacttgggt 121 tcggccgggc gcggtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcgggtg 181 gatcatgagg tcaggagatc gagaccatcc tggctaacaa ggtgaaaccc cgtctctact 241 aaaaatacag aaaattagcc gggcgcggtg gcgggcgcct gtggtcccag ctactcggga 301 ggctgaggca ggagaatggc gtgaacccgg gaagcggagc ttgcagtgag ccgagattgc 361 gccactgcag tccgcagtcc ggcctgggcg acagagcgag actccgtctc aaaaaaaaaa 421 aaagacttgg gtttgacttg attgagccca ggagttcgag acaagcctgg gcaatatagt 481 gagacctcat ctctacaaaa attttaaaaa ttagcctggt gcggtggctc atgcctgtaa 541 tcccagcact ctgggaggcc gaggtgggcg gatcacttga ggtcagaagt ttgagaccac 601 cctgaccaac atggagaaac cccgtctcta ctaaaaatac aaaattagcc gggcatggtg 661 gcgcatgcct gtaatcccag ctactcggga ggctgaggca ggagaattgt ttgaacctgg 721 gaggtggacg ttgcggtgag ccaagatcac actattgcac tccagcctgg gcaacaagag 781 caaaactccg tctcaaaaaa aaaaatttat ttttaaatta gccaggtgta gccacagctg 841 tagtcaaatc tactaggcag gctgaggtgg gaggattgct tgaacctggg aggcagaggt 901 tgcagtgagc caagatggtg ccacggcatt ccagcctgag caacagcaag accctgtgtc 961 caaaaaaaaa aaaaaaaaaa accgtaaaat aggccaggca cagtggttca tggttataag 1021 cctagcactt tggaaggctg aggagggtgg atcgcctgag ctcaggagtt caagaccagc 1081 ctgggcaaca cggtgaaacc ccatctctac caaaaaaaaa aaaaaaaaaa attagccagg 1141 catggtggtg tgtgcctgtg gtcccagcta ctcaggaggc tgaggtggaa gagtgcttgt 1201 gcctgggagg cagaggttcc agtgaaccga gatcacacca ttgtactcca gcctgggcaa 1261 cagagtaaga ccccatctca aaaaaaaaaa aaaaaattaa gataaaccct ttggcagctg 1321 cgtgctgctc ttagcctcaa acccaagtct tttttttccc cctttgagac ggggtctatt 1381 gcccaggctg gagtgcaatg gtatgatcca tactcactgc agccccgaac tcctgggctt 1441 ccaaagtgct gggattacag gtgtgagcca ccaggcccag actgctgaag ggtttaaacc 1501 agagaaagaa tgtgaccaga tttccaattt agaaagaccc gctctctgca gggtaaggag 1561 agcctggggg tccgggggcg gggggcaaga attgcaaggt aaccagggag gccagtgcaa 1621 tgtccaggtg ggagaggatg ctagctgaga ctagaagtgc taggaaaagg atgtgtgcag 1681 acaagaggtc actggggagg tgaaataaca aggcttggcc atgagtggaa cccaacaccc 1741 atggtgccct cttgagagag ggaagatggc acctgagatg gaagatggaa agaccagggt 1801 ccctgtgact gaggactgag cctctgtttg aggtttttgc agaggagtaa aggcaacaaa 1861 agaggcaaga gttggaagaa aggtgacaag gaacaaaagt cagctatgcc tgatgctact 1921 gggtggccag caacaatgct gacttggcca aggctctgag agctttacta tgctgggact 1981 ggaggtcaga gttgaggcta gggtaagagc aaggggctca gagatggagg gggaggagga 2041 cctgaacaag tccagaaggg aagagatttg tccctctatc caacagagta cccagtgagc 2101 agcacagagg gcacagcaag ggacatcacc cggttcccca aatgctcaga gccacaagtg 2161 aagccaaaag tgaaagacaa gatgcagaaa accgccacgg gcctttgagg aagggtaaag 2221 gcgaaagcga aagcaggaag tacagacgtg aagcctagca gaggactttt tagctgctca 2281 ctggccccgc ttgtctggcc gactcatccg cccgcgaccc ctaatcccct ctgcctgccc 2341 caagatgctg aagccagccc tggagccccg agggggcttc tccttcgaga actgccaaag 2401 gtgaagcggg ggcgcggggg gcggtcactc ctgagccgcc tctgcttgct cgtggccttt 2461 tttcctggct gggggtgggg gagggtgtgt tggtcgactt gggttccagg cttaccccgg 2521 aagatgaggg agacggggac caggttaggg gaagcaacag gggtcttgaa agcagagccg 2581 aaacatgggc gccctcctcc gtttccagaa atgcatcatt ggaacgcgtc ctcccggggc 2641 tcaaggtccc tcacgcacgc aagaccggga ccaccatcgc gggcctggtg ttccaagtga 2701 gcagcgggga gggacgggga gctggagggg agccgagagt atcgagcagg cactgaagct 2761 gcggtccctc cctctcctca ggacggggtc attctgggcg ccgatacgcg agccactaac 2821 gattcggtcg tggcggacaa gagctgcgag aagatccact tcatcgcccc caaaatctag 2881 tgagactccc gagcccagtt cccgtacgca aaaaagaacg gccccctcgt tcccactccg 2941 gtccccgcac gtcccagccc tgcccacacc gatcctccct tttgcctcag ctgctgtggg 3001 gctggagtag ccgcggacgc cgagatgacc acacggatgg tggcgtccaa gatggagcta 3061 cacgcgttat ctacgggccg cgagccccgc gtggccacgg tcactcgcat cctgcgccag 3121 acgctcttca ggtgcggggg cagggctaac aggaccccgg caggtagttt acggggttgg 3181 ggccattgga aggcgggaca gaaagaaggg cgggaccgcg acgggccagg tgaccggaag 3241 aggccggccc aagagaacct gggctacagg aaaaggcgat gtcagtcatc gggcgccagc 3301 ccacaggaag gagcggggat agcacctagg agctgggcat agagaggtgg gcctaggccc 3361 cagcttgtgg ccgaccccgc ccatcctcga gcaggtacca gggccacgtg ggtgcatcgc 3421 tgatcgtggg cggcgtagac ctgactggac cgcagctcta cggtgtgcat ccccatggct 3481 cctacagccg tctgcccttc acagccctgg gtgagcgctt ctgtcccttc tcctcgaact 3541 ctgcccctgg tgaccttggc ctcactccaa acggcgtcgc agcggttgac ttcagatgct 3601 tctcctgcct tcaggctctg gtcaggacgc ggccctggcg gtgctagaag accggttcca 3661 gccgaacatg acggtgagcg gcctctgtcc ccgactttgt ggtcgctggt gggatgtgca 3721 cccgggagct gggggagcac aggaccctgg cccagtgcgg gtggctaagg cttgtcggag 3781 gaggtgacca ctgaagggtg agtggagtaa gggcagagaa gtgcggtccc gacataacac 3841 cgtccaatac caaagcctgc acggctggga gaagtcgaag ctcacagagg atctttagga 3901 gccgagggcg gagagaagga ccagtagggt cctacttata tcaacgtctg gagcctagat 3961 tttgtttggg gtgggatgga agcaggtgat gttgcctcag aggtggctaa ggctcagagg 4021 gagaaacaca gtgggggttt ggagggcaag accagattgg gtaagtggac aggcaagtcc 4081 ccaggctgta gcctaagtta acagcagaga gagcccgtta ggtctcacac acccatcacc 4141 gcagctggag gctgctcagg ggctgctggt ggaagccgtc accgccggga tcttgggtga 4201 cctgggctcc gggggcaatg tggacgcatg tgtgatcaca aagactggcg ccaagctgct 4261 gcggacactg agctcaccca cagagcccgt gaagaggtga gagctggaga tcggggacca 4321 cagggatgtg tggggctata gcaggggaga tagggggctg caaaaagggg atgggccaca 4381 tgacaggccc atgttcagag gctgtccctc ctccctccca ggtctggccg ctaccacttt 4441 gtgcctggaa ccacagctgt cctgacccag acagtgaagc cactaaccct ggagctagtg 4501 gaggaaactg tgcaggctat ggaggtggag taagctgagg cttagagctt ggaacaaggg 4561 ggaataaacc cagaaaatac agttaaacag atggctgtgt cattcttgag tggaatgggg 4621 tgggcaggca gccagcaggg ctctgtagct aaggcgtccc tgcaggggcc attacctacc 4681 atagctctag tgtctggcct aagagatgcc cttcacccat aacctcaggc acctacaact 4741 ccagaacccc agccctggcc agcattgcag gcttggtctc cacccaaacc ttccttctga 4801 ctccacactt gaaggctccc ccaccactcc actgtcttgc tcttgccctc tagtccactg 4861 ggagacttgt aaattatgaa ataccccatg tactaccccc tcctagagac tttccatggc 4921 tcctcagtgg cccaggacaa gctcatacct ttcaatcagg cccccacagg ccccactgag 4981 ggctaaagtg ctgacaagag gagccgctcc ctgactccaa ggcaagttct caccaagcac 5041 tcctcaacct cgcaacatct ttacctgtga caccccttag atgacgaggc atgcctgcac 5101 tgctcacgtg aagctcgtct tctgtctgca catgctgggc ttgtgactcc aagttttcca 5161 ggctaataag ggtcacagga ctcacatggg gagagatgac acgtttctcc aacaaacctt 5221 tgctgggccc ctgctgagtc tcaggcctgg ctgctgggtg ccagcaagag catcctgtcc 5281 tcagcgagaa cggctgaact ccgctggagc ttcagaaatg tcagggagag tctacccagg 5341 gcccagggag ggtctatgcc gggctgcaca tccccaggct gctgagtgtg ctccctgcac 5401 cccaacattc tattaatgaa catttgtaaa tgtaacagaa aagtagaaag agttgtatat 5461 tgaataccct tatactgtca ggtcaccaca gacctgacag tattttgtta tatttgtttt 5521 atcatctatt catccctcta tccattaatt catcgctcct tttttttttt tttttttttt 5581 tttgagacgg cgtctcgctc tgtcacccag gctctggagt gcaaatattt tgttatattt 5641 gttttatcat ctattcatcc ctctatccat taattcatcg ctcctttttt tttttttttt 5701 tttgagacgg agtctcgctc tgtcacccag gctctggagt gcagtggcgc aatctcagct 5761 cactggaagc tccgcctccc aggttcacgc cattctcctg cctcagcctc ccgagtagct 5821 gggactacag gtgcccgcca ccacgcgcgg ctaatttttt tttttttttg tatttttagt 5881 agagacgagg ttctactgaa cctgttagcc aggatggtct ttgatctcct gacctcatga 5941 tccgcccgcg tcggcctccc aaagtgctgg gattacaggc gtgagccacc gtgcccagcc 6001 aattcatctc attttttggc tgatgctgtt tctttgagat ggggtctagc tccatcgccc 6061 aggccggaat gcagtggtgc actcatggct cactgcagcc ttgaacttaa gggctcaagt 6121 gatccctcct gcctcagcct tctgagttgc tgggactaca ggtgtgtacc atcataccca 6181 gcacatttct taatttaaaa aaattttttt tgtagagaca gggtttcatg atgttgctca 6241 ggctggtctc gaactcctgg aatcaagcct cctacgtctg cctcccaaag ttttgggatt 6301 acaggtgtga gccaccacac ccagccctga tctgttcttg aatcagttaa agccctcaca 6361 ctcccagaag gccgccagcc aatgcacctg ttggaacttt gcacacaggg tgtcttctcc 6421 cttcaagctt ggtctgcagc tcagtaacaa atgggctaca gacaccaggg gcttgcccat 6481 gggagcccca aggcctaaag agggtggcag agatttgatg tctgtcactc tccacctgca 6541 gcctcagtcc acggtcggcc aggcaccaag agctcacact ttgccctcct aaatgccagg 6601 cccttcataa gtatcatctc attgttaaga gcggaggctt cagcgccaga caaatgcgag 6661 tttgcgtaca actcaaccac gtgctggtgg gagagtcacc atctctgagc agacctgtga 6721 ctcctgttcc aaatggacga ggaaccactg cgatgatgtg ttaggactcc cagcctgcca 6781 gaacctcaca gcccctggcc cttcacagca aagttgaccg cagtgagcat tccatccacc 6841 agtcagaaca ccctggacgc tgagcggacc ttctctgaaa gcctggtgcc tttgttagcc 6901 ctgggtgact cctgtgatcc cagccaccag gttgtcacta tagacctaat ttaaccatct 6961 gtcctcagta ccgagggctc aacatttgga atgggaggtg gttctgggag ccaattagag 7021 gccaggcttt gggaggtggc agaggtgagt ctcacacctt gggctctgtc tgataagtct 7081 aggtctcggt caggggacct tggcctaaag ggcctgtctt gcctggagcg tgggaggggg 7141 ctgagtctac acagctggcc tggcctcagg cctggagctt tagctcaagg acgagaagac 7201 ccataaagcc agacccagct cccaacctca catctgccac gatgttgctg ctcagcctga 7261 ccctaagcct ggttctcctc ggctcctcct ggggtgagtg ggccaggacc agccctgatt 7321 cagccctggg agcaactcag ctcccagcaa cagcccaggg aaggagctag gctggctgga 7381 agggacgaag gtggacagag tgggtaaaag aaacaggata tgccagggca gtggagcagg 7441 gaacagtcct gcagggctgg gagggggcaa gaggtggggt ggtctcacaa ataggaccag 7501 agattgagcc aggccctgga gcccgggagg gtttaggaag ctgagacagg aagacctgtc 7561 catgtctttt agaaagaacc ttctggctgc atgaagggta tgaactgttc aggtcgggag 7621 ggggcagaga gaccaggggt agagatgggg aacagcgggg actaggctgg agacagatgt 7681 aggagaacag cagggctggg ggactgggtg gatagggata accaagatag ctgtggggcc 7741 cgaaggtgct tgcatgtacc ctgttgggga aggggtagtg ctgtaccctc tcgacagacc 7801 tctctggggt gcacagcctg gggcacccaa aaggaggtgg ggaaagatgg gctgaggcat 7861 gggaagcagg tcctcattag cccaatggcc aggctgcggc attcctgcca tcaaaccggc 7921 actgagcttc agccagagga ttgtcaacgg ggagaatgca gtgttgggct cctggccctg 7981 gcaggtgtcc ctgcaggtac accaccagag gggtgggcag ggtcctgggt acgtcatgcc 8041 taggggcagc ctcagcagcc catccccact ctgacctctg agccctgacc acaggacagc 8101 agcggcttcc acttctgcgg tggttctctc atcagccagt cctgggtggt cactgctgcc 8161 cactgcaatg tcaggtgagt gcctgcattc cacctgcccc gcccctcgcc tcttcctgcc 8221 tcctcccctg gctgtccccc tctcgcgctg gcctccctgc agctgcctaa tcccaccccc 8281 ttgcagccct ggccgccatt ttgttgtcct gggcgagtat gaccgatcat caaacgcaga 8341 gcccttgcag gttctgtccg tctctcgggt gagtgcctgg gctgcagaca cggaggaaaa 8401 gtgggcagtg caggtgggtg ggtgctggga acgaggaatt caggacatgc cctggcctac 8461 cctgctcagc acccatcaga acatggactg tttctgaccc cacaggccat tacacaccct 8521 agctggaact ctaccaccat gaacaatgac gtgacgctgc tgaagctcgc ctcgccagcc 8581 cagtacacaa cacgcatctc gccagtttgc ctggcatcct caaacgaggc tctgactgaa 8641 ggcctcacgt gtgtcaccac cggctggggt cgcctcagtg gcgtgggtag ggactcaggc 8701 caaagctcag ggtgggagga ctggggtggg gacagtgttc tgggccccat gtgaccaccc 8761 ctcctggcca caggcaatgt gacaccagca catctgcagc aggtggcttt gcccctggtc 8821 actgtgaatc agtgccggca gtactggggc tcaagtatca ctgactccat gatctgtgca 8881 ggtggcgcag gtgcctcctc gtgccaggta agccccagca cccgctcctc tgcgctgtcc 8941 tagtggtata cctccccaac cccccctact caattctccc tccctcttcc ctctcagggt 9001 gactccggag gccctcttgt ctgccagaag ggaaacacat gggtgcttat tggtattgtc 9061 tcctggggca ccaaaaactg caatgtgcgc gcacctgctg tgtatactcg agttagcaag 9121 ttcagcacct ggatcaacca ggtcatagcc tacaactgag ctcaccacag gccctcccca 9181 gctcaaccca ttaaagaccc aggccctgtc ccatcatgca ttcatgtctg tcttcctggc 9241 tcaggagaaa gaagaggctg ttgagggtcc gactccctac ttggacttct ggcacagaag 9301 gggctgagtg actccttgag tagcagtggc tcttcctaga gtagccatgc cgaggccggg 9361 gcccccaccc ctcctccagg gcaacccctt ggtcctacag caagaagcca gaactgttgg 9421 aatgaatggc agccctccct ggagaggcag cctgtttact gaatacagag gatacgttta 9481 caaactgaat acgcataata aataactgca cattctccat ccacaggcca tggcatgaag 9541 gcccaagtgg gtctatcaaa ggcccacatc tccaaacccc tgtcctgccc tcaggaccag 9601 gcccaccctg ggcaagagag aacgtaagcc ccagggcttc aggtccccag agacacttgg 9661 ggaactgggg ggaaattctg aggccatggg gcttggttct ccactgcctc ctgcccaggg 9721 ggatttgggg acggtaggag gatgtgtcta aggcatagtc gacttggcac agagtggtct 9781 ctttagtttt gtttcccact ggaggtggca catgcaggaa aagggcctgg cccaggctgc 9841 cgaccggcag aagctgagtg ggaaccaaac cctcctgcaa ttggcagggc cctgccgtca 9901 agctaaggcc aaagctgggc cctgggccca ttctacccac tgaaggcagc tgtggaggaa 9961 ggggcttggg ttccagcctg gtttgtggta gggggagata ccacaaaaga aatggggatg 10021 gttctggctc aggcctctgg gaaagcagcc acccaacccc acccacctcc cgcaggggct 10081 ccttccagct tgaggctcag tgggacccag actggaaggt taatgctgtg aagggaagca 10141 gcacagggtg gacggggcaa ggccagctgt gagaaggcag tgcccctggc accctggttt 10201 cagaggcagg tcacacagta tggctaagtt ccagggaggg gtgcgcagaa gctcagcaga 10261 aggggagagg tgagcagccc gggaccctcc cccagggcgg caactcctac cttcccatgt 10321 cctcatggag gactacaggt gtgcaccatg ggtgggtgtg cacgatgggc aggtgtgcac 10381 gatgggcgtg cagtgatcac tcccaggctg ccaacaccca tgcagacacc agatggcgcc 10441 ttcgtgcagc tgcagaggag ggagcaacag agcctgaagg gaaaaggcaa tggggctgca 10501 ccaaaggata gaacccaggc tgacactcga ccctaatcgg gaggaccccc ttccctctgc 10561 cttggccccc aggtgcccca ttccccaggt agcagcagtg gggctccctt taaccacccc 10621 cagttgggaa ggaggcacct ggggaatgga atggacatca acggggagag ggaggtagcg 10681 gtgctctaca aagaaggcac caagggcggt gggctgagac ccctcagaat cttggagagg 10741 ctggagcctg ggcaagccga tgaccagcat ggccacacag tccagaaggg tgaaggtcca 10801 cgccatggcc ctccaccaga ggtcctggga ccaggaaggc tccctggagg caccatgaag 10861 gaagacagat cttggctggg aggtggaggg ctgtttcgac ctagccaggg gctacgggtc 10921 cagtcaaggc acaagctttg tgcctaccag ggtctcccac tggagcataa tcttaaggat 10981 caggatgcat gggaatgtgt gaaaccaggg agaagggctc tgtggaggaa agggggtccc 11041 agaagtaact gtcccaaagg gtcctgaggc cacaggacac tccacccagc actgcagttc 11101 cctttgattg gggaaaagtc aaagggcaag ggagacagtg aaggccaggt cctatccctt 11161 cccaactcca ccagagcagc tgcccaccaa gaggggtatc agtgccagcc aggctcccag 11221 ttcaggggga gtcacagccc cctgtgctac ctctactctg tcacacctgg cccaggccat 11281 ggtgaggaca ggggctgctg aaggcacaga gaaagggctg gagccagaca ttcttcacct 11341 actgtgggcc acataggcct atctccagag agggcatcgg acccagatgg caccacagtg 11401 tgtggccagg ctgggtcgtg ctgcatgtgt gcacagccag gcggctcagc cattgtattg 11461 ctgctggtag cgcaggttga gctcccgcag ctcccgttcc cgcacacggc gtgacttatt 11521 ggagcgtgtg gagcggctgg aacgcgtgga ctgggcagat ttggtgctct ggcagcgcga 11581 ggaggcacgt ttaaggaggt tctgggatat ggagcggtgc aggttcttca tggatgaaga 11641 ggcagccatg ctcaccaccc acgggtgcct cagggcctgc agtgcagtca tacgggctcc 11701 agggtccact gtcagcaggc ggtcaatgaa gtccttggcc aggttggaca cactaggcca 11761 gggctagaga ccaaggacaa gcattagagt gagagcatct gacactgccc accccatctg 11821 gatgaggcca ctactcagca accctcccct ttccagagag aggtgctgcc cctcctctca 11881 tgtagcactt ggggcctccc cgcccaacgc tggctcaggc tgaacaaggg ctgctctcca 11941 ggtgatggag tctggcaagg aaggaaagga cctgtgcact ctcccaggga gcaaattcta 12001 tggtgcactg gacccgaagc ctggctccag ggagatggcc tctgccaaga ccccccggaa 12061 cgtgtcccag gagtatcata actcagggga ctgttagaga atgattcaaa ctttcccacc 12121 acatcctaag tcagattgaa gctccaatct ctggatgacc aggatcaggc tacttaaagg 12181 ggaacttcct agtccttaca gagaagatcc aacctctctc caactgccga agcagtggca 12241 gaagaccact gctccctgcc tctcctcccg gcatggggag gaaaggaaac aattcaaggc 12301 aactagattt cccagtcggc tgagggcagg cgatcccggg ccaggaagga accaggaccc 12361 ttctcagtgg caccctctgg cccgcattac ttctctaagc cacaaagggc tcctggcagt 12421 gctgtgcgcc agcctcattt tagtacattc tgtcccctgg gaggaactcc ataaagccca 12481 ctctgccaca tgcaccccgg gctgcctcat ctcagccccg aacccagcag ctgtctgtct 12541 cagggcctca ggttgtacgg ctgtcttcac ctgactggat cctcaggttc tcagggtaaa 12601 ggacacttgc tcagactccc tcttagcccc cagtgcttcc agcaattatt ccagctgtaa 12661 cgtgagactg caatttcatg ttcgtttagt attcccatga gatcatgctg agctggatga 12721 gcccggcctg gtgctgcgca tacaggaagc actcagtagg cacaggctca gacagtaaac 12781 aacccacggt gctgccggat gggtgccctt tcctggagct gcttccaggc cttggggctc 12841 agccaggtga gtccttgcgt ccctgcatct cctaggaaca cttctggcac gggctctgag 12901 gctcccccaa ggataggcag ctaggacctt tcctgagcct gctgcagatg actcaacagg 12961 gatgctaacg atcccctcat cttccttcct gccaggtgag gtctgcctgt tccacccatg 13021 gtacccttca ccttgaggaa cccctgaaca tgccctccag ggggttcagg aggatctgag 13081 agaccacctt cagggcaggt gcacagccat ctagcagaca cacacactca ctgactactg 13141 ctactcccag tctggctcgc ctgacctcca actctttccc tacccccttc cccactgcca 13201 cagagggatg aggcanngag aacacgcttc caccgtcctg aggaaggcnt ggggctacct 13261 gcagctgctg tcttcaccca ctctttggaa ggttattcca agttttactg agctgaagtg 13321 ggagcaacag gggaaccata ttcccaaaca cacctaacag ggtcatcctc atcagtgggc 13381 cagcagcaca cagtgactcc tggggagatg ctggccccag gaggaggaag tcagggtcca 13441 ggagcatgca gccaacgaag gcccatagat gccttactat ccaagggctg tgggtgggcg 13501 cagagagcaa cagccctccc cgacaggcag gtaagtctcc tgggggcttg tgtagttcaa 13561 gattcatatt gagggccagg cgtggtggct catgcctgta atcccagcac tttggggagg 13621 ctgaggcagg tggatcacaa ggtcatgaga tcaagaccat cctggccaac atggtgaaac 13681 cccgtctcta ctaaaaatac aaaaattagt cgggcgtggt ggcgtgcctg tagtccagct 13741 actcaggaag ctgaggcagg agaattgctt gaacctgaga ggcggaggtt gcagtgagcc 13801 aagatcgcac cactgcactc caggctggga aagagggggg ttccgtttcc aaaaaaaaaa 13861 aaa // LOCUS HSPRPL2 2578 bp RNA PRI 06-JAN-1998 DEFINITION H.sapiens mRNA for PRPL-2 protein. ACCESSION X86019 NID g2760482 KEYWORDS prpL_2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2578) AUTHORS Kreideweiss,S., Delany-Heiken,P., Nordheim,A. and Ruhlmann,A. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2578) AUTHORS Ruhlmann,A.C.C. TITLE Direct Submission JOURNAL Submitted (30-MAR-1995) A.C.C. Ruhlmann, Medizinische Hochschule Hannover, Institut fur Molekularbiologie, OE 5250, D- 30623 Hannover, FRG REMARK revised by [3] REFERENCE 3 (bases 1 to 2578) AUTHORS Ruhlmann,A.C.C. TITLE Direct Submission JOURNAL Submitted (06-JAN-1998) A.C.C. Ruhlmann, Medizinische Hochschule Hannover, Institut fur Molekularbiologie, OE 5250, D- 30623 Hannover, FRG COMMENT Includes sequence Z16137. FEATURES Location/Qualifiers source 1..2578 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="tonsils" /tissue_type="peripheral blood" gene 205..1689 /gene="prpL_2" CDS 205..1689 /gene="prpL_2" /codon_start=1 /product="SH3-domain interacting protein" /db_xref="PID:e1226443" /db_xref="PID:g2760483" /translation="MPVPPPPAPPPPPTFALANTEKPTLNKTEQAGRNALLSDISKGK KLKKTVTNDRSAPILDKPKGAGAGGGGGGFGGGGGFGGGGGGGGGGSFGGGGPPGLGG LFQAGMPKLRSTANRDNDSGGSRPPLLPPGGRSTSAKPFSPPSGPGRFPVPSPGHRSG PPEPQRNRMPPPRPDVGSKPDSIPPPVPSTPRPIQSSLHNRGSPPVPGGPRQPSPGPT PPPFPGNRGTALGGGSIRQSPLSSSSPFSNRPPLPPTPSRALDDKPPPPPPPVGNRPS IHREAVPPPPPQNNKPPVPSTPRPSAPHRPHLRPPPPSRPGPPPLPPSSSGNDETPRL PQRNLSLSSSTPPLPSPGRSGPLPPPVPSERPPPPVRDPPGRSGPLPPPPPVSRNGST SRALPATPQLPSRSGVDSPRSGPRPPLPPDRPSAGAPPPPPPSTSIRNGFQDSPCEDE WESRFYFHPISDLPPPEPYVQTTKSYPSKLARNESRSEYFCQGF" misc_feature 2448..2487 /note="U7 snRNA similarity" BASE COUNT 639 a 748 c 563 g 628 t ORIGIN 1 gaacaagagg cctgagcttg tccaaaatgg ctgtccaaaa acgaccccac acttgcgtta 61 gaagatgata cctttttcag ggacagtttt ctctcttggc tcctgtcccg ctggccctct 121 gtctgcctgt gtccctgacc atggctccct gcagtaccct ttaacgattt atcagcaaga 181 ctgttgaacg cataactgcc caagatgcct gtccctcccc ctccagcacc cccgccgccc 241 ccgacgtttg cactggccaa tacagagaag cctaccttga ataagacaga gcaggctggg 301 agaaatgctc tcctttctga tatcagcaaa gggaagaaac taaagaagac ggtcaccaat 361 gacagaagtg caccaatact ggacaaacct aaaggagctg gtgctggagg cggtggtggt 421 ggctttggtg gaggcggcgg atttggcgga ggaggtggtg gcggaggcgg tggaagtttt 481 ggagggggcg gacctccagg tctgggagga ttgttccagg ctggaatgcc gaagctgaga 541 tccacggcca acagggataa tgattctgga ggaagccgac caccattgtt gccaccggga 601 ggaagatcca catctgcgaa acccttttca cccccaagtg gcccagggag gtttcctgtg 661 ccttctccag gccacagaag tggtccccca gagcctcaga ggaaccgaat gccgccccca 721 aggcccgacg tgggctcaaa gcctgatagc attcctcctc cagtacctag tactccaaga 781 cccattcaat caagtctgca caaccggggg tccccaccag tgcccggagg ccccaggcag 841 cccagccccg ggcccactcc tccccctttc cctggaaacc gcggcactgc tttgggagga 901 ggctcaatac gtcagtcccc cttgagctcc tcctcgccct tctccaaccg gcctcccctg 961 ccgcctaccc ccagcagggc cttggatgac aaaccccctc caccacctcc tccagtgggc 1021 aacaggccct ccatccacag ggaagcggtt ccccctcctc ctcctcagaa caacaagcct 1081 ccagtgcctt ccactccgcg gccttcggct cctcacaggc cccacctccg cccgccacct 1141 cccagcaggc ccgggccgcc tcctctgcct ccaagttcca gcggcaatga cgaaacccca 1201 agactcccac agcggaatct gtccctcagt tcgtccacgc ccccgttacc ttcgccagga 1261 cgttcaggtc ctcttcctcc cccagtgccc agtgagagac ccccacctcc agtgagggac 1321 ccgccaggcc gatcaggccc cctcccacca cctcctccag taagcagaaa cggcagcaca 1381 tctcgggccc tgcctgctac ccctcagttg ccatccagga gtggagtaga cagtcccagg 1441 agtggaccca ggcctcccct tcctcctgat aggcccagtg ctggggcacc tcccccacct 1501 ccaccatcaa catctattag aaatggcttc caagactctc catgtgaaga tgagtgggaa 1561 agcagattct acttccatcc gatttccgat ttgccacctc cagagccata tgtacaaacg 1621 accaaaagtt atcccagcaa actggcaaga aacgaaagcc ggagtgagta tttctgccaa 1681 ggtttttgaa gattcacgct tgagtagcct gagtgacact cgggctatat agaaagtcca 1741 ggcttcttgc ctgcagtggg gtagagaacc ctttggagcc atctacattc tagtaaattc 1801 cccttccaca ggcagtgacg aaattcgaga tgcccagaat gcatatgaga aacagttatt 1861 ttaatgtgtc attacaagct caaaataccg tggtttgcta gtattaaatt ttattatttt 1921 acttctttgg acgtcttcct ctgttttcat cttcctgaga cagcacaagt agagccatct 1981 tctattcctt ctagttttga aattggcata atgtctgatt aattataaca aaaactccct 2041 ttaactgcct tgatctttta aacgtattgt aatcatagct ttacagagca ttacatgaaa 2101 atgtaaatat atccataatc tctccacata atagatattt tcattttacc agattctcat 2161 ccaatttttt atccatataa ttctgttttt ttatgtagtt ataatgacga tgtacagttt 2221 tttggcgggg tttacataac attgcttagt aaatagtgcc catgttatca catggttttt 2281 atttcagcat atagtaattt accattacca tggacattta aattatttct gttgttctgc 2341 taatgtaggt aatgctacca taaagaacta aattgctttt cctccattga agaagaatgt 2401 caacaagaaa ggaaaaatag acaaactgga attttataac agtaaatgta taggaagtgt 2461 tacagctctt tgagaatttg tctagcaggc tttccagggt tttgctggaa agcctctgaa 2521 atttactttt aaaaaaacac aatagctggg gccaattggg gttcgtcacg cctgtaat // LOCUS HSPRPS2 2457 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for phosphoriobosyl pyrophosphate synthetase subunit II (EC 2.7.6.1). ACCESSION Y00971 NID g35699 KEYWORDS phosphoribosylpyrophosphate synthetase; ribose-phosphate pyrophosphokinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2457) AUTHORS Iizasa,T. TITLE Direct Submission JOURNAL Submitted (24-JAN-1989) Iizasa T., Department of Biochemistry, Chiba University School of Medicine, Inohana, Chiba 280, Japan REFERENCE 2 (bases 1 to 2457) AUTHORS Iizasa,T., Taira,M., Shimada,H., Ishijima,S. and Tatibana,M. TITLE Molecular cloning and sequencing of human cDNA for phosphoribosyl pyrophosphate synthetase subunit II JOURNAL FEBS Lett. 244 (1), 47-50 (1989) MEDLINE 89171273 COMMENT Data kindly reviewed (10-APR-1989) by Iizasa T. FEATURES Location/Qualifiers source 1..2457 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis." /clone_lib="lambda gt10" /clone="lambda hRPSII-5, -9 -15" /chromosome="Xpter-q21." mRNA <1..2457 /note="PRPS2 mRNA" CDS 61..1017 /note="PPRibP synthetase (AA 1 - 318)" /codon_start=1 /db_xref="PID:g35700" /db_xref="SWISS-PROT:P11908" /translation="MPNIVLFSGSSHQDLSQRVADRLGLELGKVVTKKFSNQETSVEI GESVRGEDVYIIQSGCGEINDNLMELLIMINACKIASSSRVTAVIPCFPYARQDKKDK SRAPISAKLVANMLSVAGADHIITMDLHASQIQGFFDIPVDNLYAEPAVLQWIRENIA EWKNCIIVSPDAGGAKRVTSIADRLNVEFALIHKERKKANEVDRMVLVGDVKDRVAIL VDDMADTCGTICHAADKLLSAGATKVYAILTHGIFSGPAISRINNAAFEAVVVTNTIP QEDKMKHCTKIQVIDISMILAEAIRRTHNGESVSYLFSHVPL" variation 1538..1555 /note="u stretch in lambda hPRSII-9 consists of 17 u" polyA_site 2445..2457 /note="polyA site" BASE COUNT 693 a 513 c 519 g 732 t ORIGIN 1 cgctgttgcc tccgccacct cctccgccgc cgcgcgcccc tcggagttcc gcgccccacc 61 atgcccaaca tcgtgctgtt cagcggcagc tcgcatcagg acctatccca gcgcgtggcc 121 gaccgcctgg gcctggagct gggcaaggtg gtcacgaaga agttcagcaa ccaggagacc 181 agcgtggaga ttggtgaaag cgtgagaggg gaagatgtct acatcatcca gagcggctgc 241 ggggaaatta acgacaacct gatggaactc ctcatcatga tcaatgcctg caagattgcg 301 tcatcatcca gagtaactgc cgtgatcccg tgtttcccat acgcccgaca agataaaaag 361 gacaagagtc gtgccccaat ttctgcaaaa cttgtggcca atatgctgtc ggtggctggg 421 gcggatcaca tcatcaccat ggacctgcat gcttctcaga tacagggatt ctttgatatt 481 cctgtggata atttgtatgc ggagcccgca gtcctgcagt ggattcggga aaacattgcc 541 gagtggaaga actgtatcat tgtttcacct gacgcagggg gagccaaaag ggttacatca 601 attgcagaca ggttgaatgt ggaatttgct ttgatccaca aagagaggaa gaaggcgaat 661 gaagtggacc ggatggtcct ggtgggcgac gtgaaggacc gtgtggccat cctcgtggat 721 gacatggctg acacttgcgg caccatctgc catgctgcgg acaagctgct gtcagctgga 781 gccaccaaag tgtatgctat ccttacccat gggatcttct ctggaccagc tatttccaga 841 ataaataatg ccgcctttga ggctgttgtc gtcacaaaca caattccgca agaggacaaa 901 atgaaacact gcaccaagat tcaggtcatt gacatttcca tgatcttggc cgaagcaatc 961 cgaaggacac acaatgggga atccgtgtcc tacctgttca gccatgtccc gctataaatc 1021 cagaatggga agtgtccagc aagcctactc tgacttctga cttgtttttg ttttctggat 1081 ttttagctgt aggtattcag caatgatagg ttaatcactg gcaaaagcat cagatctttg 1141 tatatgctaa gatttattgt ttccccttct aaagctcaag atcatttctt tccagttttt 1201 ggggaaatgg tggtggttat ttggtcttta agtgaactgt cttaaatgag aaacgttttt 1261 gtcattttga cttttaacag gtacaggtga tctcttcctt tgttctttca gtactttgag 1321 gcgacaactt tcaagtatat aatttcattg tggaagtcat agtttatata tttcgaggtt 1381 gccaaaggtg acttcacatt aaagccttct gtgtaaatat atactgataa tgcctatgga 1441 catttgggta aaaccctgta tagaattaat tatcctttta ctttggagtg aaccttggaa 1501 aatttataat tataatacca tggattttga attttccttt tttttttttt tttttggata 1561 actcagtttc agataaacca tcttggttac tgtgcttaat ttggaccaaa ttttatttag 1621 cttaatatgg acactgacac attttggggg gtatacatta gacatatcag agcagtgtat 1681 ttctggatca ttttttaaat gacctcttct aaaacataac tgtcacttac ctgaaatgct 1741 gcatcctaaa attccaaaat tatattgagc aatcgccaag gcctaaagcc aactgactta 1801 aaggtaatca tttcagctaa gattaaattt aaagcctaag aatgtataga gctagtttta 1861 aaataatgat ctcagatttt taaaaaggat ataggaacct gcattgtcat tctctgaatt 1921 aagaactgat ggtttctatc attatttagc cccacctttg tattttaaaa tccttcagaa 1981 tacatttatg aaccaatgcg actggactta gccacacaca atggaaattc agaccttgac 2041 tatttggtgt ttccagttca caaaggtgat gaagactgtc ttgggagcag cttaatccca 2101 aaatttgtac atttcttgct gctcctggcg tggaaactta agtgagacca ccaaatacat 2161 tggtcctgtc caattctact gaatgggggt ggacctggca tttatctggc caaaaacagg 2221 agccagagaa atatgaatat accaaagttg tttgtttagc ctccaactta aattacatta 2281 gtcaacttat agatactcat atgatcactt ttctttttag atactacatc aactagattc 2341 aggagtatat catttgcagt gcttgtattg gtttaaaatg taagatttta agatcctcta 2401 acactgtact aaaacatttc aataaaatca ttctgactgc gttcaaaaaa aaaaaaa // LOCUS HSPRR 1557 bp RNA PRI 10-MAR-1995 DEFINITION H.sapiens PRR1 mRNA. ACCESSION X76400 NID g732795 KEYWORDS poliovirus receptor related. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1557) AUTHORS Lopez,M. TITLE Direct Submission JOURNAL Submitted (26-NOV-1993) M. Lopez, INSERM U.119, 27, Boulevard Lei-Roure, 13009 Marseille, FRANCE REFERENCE 2 (bases 1 to 1557) AUTHORS Lopez,M., Eberle,F., Mattei,M.G., Gabert,J., Birg,F., Bardin,F., Maroc,C. and Dubreuil,P. TITLE Complementary DNA characterization and chromosomal localization of a human gene related to the poliovirus receptor-encoding gene JOURNAL Gene 155 (2), 261-265 (1995) MEDLINE 95237621 FEATURES Location/Qualifiers source 1..1557 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TF-1" /cell_type="erythroleukemic" /clone_lib="cDNA in pSPORT" /clone="12.5" /chromosome="11q23-11q24" gene 1..1557 /gene="PRR1" CDS 1..1557 /gene="PRR1" /codon_start=1 /db_xref="PID:g732796" /translation="MARMGLAGAAGRWWGLALGLTAFFLPGVHSQVVQVNDSMYGFIG TDVVLHCSFANPLPSVKITQVTWQKSTNGSKQNVAIYNPSMGVSVLAPYRERVEFLRP SFTDGTIRLSRLELEDEGVYICEFATFPTGNRESQLNLTVMAKPTNWIEGTQAVLRAK KGQDDKVLVATCTSANGKPPSVVSWETRLKGEARVPGDSGTPMAPVTVISRYRLVPSR EAHQQSLACIVNYHMDRFKESLTLNVQYEPEVTIEGFDGNWYLQRMDVKLTCKADANP PATEYHWTTLNGSLPKGVEAQNRTLFFKGPINYSLAGTYICEATNPIGTRSGQVEVNI TEFPYTPSPPEHGRRAGPVPTAIIGGVAGSILLVLIVVGGIVVALRRRRHTFKGDYST KKHVYGNGYSKAGIPQHHPPMAQNLQYPDDSDDEKKAGPLGGSSYEEEEEEEEGGGGG ERKVGGPHPKYDEDAKRPYFTVDEAEARQDGYGDRTLGYQYDPEQLDLAENMVSQNDG SFISKKEWYV" BASE COUNT 341 a 479 c 469 g 268 t ORIGIN 1 atggctcgga tggggcttgc gggcgccgct ggacgctggt ggggactcgc tctcggcttg 61 accgcattct tcctcccagg cgtccactcc caggtggtcc aggtgaacga ctccatgtat 121 ggcttcatcg gcacagacgt ggttctgcac tgcagctttg ccaacccgct tcccagcgtg 181 aagatcaccc aggtcacatg gcagaagtcc accaatggct ccaagcagaa cgtggccatc 241 tacaacccat ccatgggcgt gtccgtgctg gctccctacc gcgagcgtgt ggaattcctg 301 cggccctcct tcaccgatgg cactatccgc ctctcccgcc tggagctgga ggatgagggt 361 gtctacatct gcgagtttgc taccttccct acgggcaatc gagaaagcca gctcaatctc 421 acggtgatgg ccaaacccac caattggata gagggtaccc aggcagtgct tcgagccaag 481 aaggggcagg atgacaaggt cctggtggcc acctgcacct cagccaatgg gaagcctccc 541 agtgtggtat cctgggaaac tcggttaaaa ggtgaggcca gagtaccagg agactccgga 601 accccaatgg caccagtgac ggtcatcagc cgctaccgcc tggtgcccag cagggaagcc 661 caccagcagt ccttggcctg catcgtcaac taccacatgg accgcttcaa ggaaagcctc 721 actctcaacg tgcagtatga gcctgaggta accattgagg ggtttgatgg caactggtac 781 ctgcagcgga tggacgtgaa gctcacctgc aaagctgatg ctaacccccc agccactgag 841 taccactgga ccacgctaaa tggctctctc cccaagggtg tggaggccca gaacagaacc 901 ctcttcttca agggacccat caactacagc ctggcaggga cctacatctg tgaggccacc 961 aaccccatcg gtacacgctc aggccaggtg gaggtcaata tcacagaatt cccctacacc 1021 ccgtctcctc ccgaacatgg gcggcgcgcc gggccggtgc ccacggccat cattgggggc 1081 gtggcgggga gcatcctgct ggtgttgatt gtggtcggcg ggatcgtggt cgccctgcgt 1141 cggcgccggc acaccttcaa gggtgactac agcaccaaga agcacgtgta tggcaacggc 1201 tacagcaagg caggcatccc ccagcaccac ccaccaatgg cacagaacct gcagtacccc 1261 gacgactcag acgacgagaa gaaggccggc ccactgggtg gaagcagcta tgaggaggag 1321 gaggaggagg aggagggcgg tggagggggc gagcgcaagg tgggcggccc ccaccccaaa 1381 tatgacgagg acgccaagcg gccctacttc accgtggatg aggccgaggc ccgtcaggac 1441 ggctacgggg accggactct gggctaccag tacgaccctg agcagctgga cttggctgag 1501 aacatggttt ctcagaacga cgggtctttc atttccaaga aggagtggta cgtgtag // LOCUS HSPRS1 1196 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for phosphoribosylpyrophosphate synthetase subunit one. ACCESSION X15331 M25042 NID g35701 KEYWORDS phosphoribosylpyrophosphate synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1196) AUTHORS Roessler,B.J. TITLE Direct Submission JOURNAL Submitted (24-MAY-1989) Roessler B.J., University of Michigan, 5520 MSRBI box 6080, Medical Centre Drive, Ann Arbor MI 48109, U S A REFERENCE 2 (bases 1 to 1196) AUTHORS Roessler,B.J., Bell,G., Heidler,S., Seino,S., Becker,M. and Palella,T.D. TITLE Cloning of two distinct copies of human phosphoribosylpyrophosphate synthetase cDNA JOURNAL Nucleic Acids Res. 18 (1), 193 (1990) MEDLINE 90174926 FEATURES Location/Qualifiers source 1..1196 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblast" /clone_lib="lambda gt10" /clone="1" CDS 67..1023 /note="phosphoribosylpyrophosphate synthetase (AA 1-319)" /codon_start=1 /db_xref="PID:g35702" /db_xref="SWISS-PROT:P09329" /translation="MPNIKIFSGSSHQDLSQKIADRLGLELGKVVTKKFSNQETCVEI GESVRGEDVYIVQSGCGEINDNLMELLIMINACKIASASRVTAVIPCFPYARQDKKDK SRAPISAKLVANMLSVAGADHIITMDLHASQIQGFFDIPVDNLYAEPAVLKWIRENIS EWRNCTIVSPDAGGAKRVTSIADRLNVDFALIHKERKKANEVDRMVLVGDVKDRVAIL VDDMADTCGTICHAADKLLSAGATRVYAILTHGIFSGPAISRINNACFEAVVVTNTIP QEDKMKHCSKIQVIDISMILAEAIRRTHNGESVSYLFSHVPL" BASE COUNT 312 a 269 c 302 g 313 t ORIGIN 1 gacttcggtt ccggtctctg cagcagccgt gatcgcttag tggagtgctt agggtagttg 61 gccaggatgc cgaatatcaa aatcttcagc ggcagttccc accaggactt atctcagaaa 121 attgctgacc gcctgggcct ggagctaggc aaggtggtga ctaagaagtt cagcaaccag 181 gagacctgtg tggaaatcgg tgaaagtgta cgtggagagg atgtctacat tgttcagagt 241 ggttgtggcg aaatcaatga caatttaatg gagcttttga tcatgattaa tgcctgcaag 301 attgcttcag ccagccgggt tactgcagtc atcccatgct tcccttatgc ccggcaggat 361 aagaaggata agagccgggc gccaatctca gccaagcttg ttgcaaatat gctatctgta 421 gcaggtgcag atcatattat caccatggac ctacatgctt ctcaaattca gggctttttt 481 gatatcccag tagacaattt gtatgcagag ccggctgtcc taaagtggat aagggagaat 541 atctctgagt ggaggaactg cactattgtc tcacctgatg ctggtggagc taagagagtg 601 acctccattg cagacaggct gaatgtggac tttgccttga ttcacaaaga acggaagaag 661 gccaatgaag tggaccgcat ggtgcttgtg ggagatgtga aggatcgggt ggccatcctt 721 gtggatgaca tggctgacac ttgtggcaca atctgccatg cagctgacaa acttctctca 781 gctggcgcca ccagagttta tgccatcttg actcatggaa tcttctccgg tcctgctatt 841 tctcgcatca acaacgcatg ctttgaggca gtagtagtca ccaataccat acctcaggag 901 gacaagatga agcattgctc caaaatacag gtgattgaca tctctatgat ccttgcagaa 961 gccatcagga gaactcacaa tggagaatcc gtttcttacc tattcagcca tgtcccttta 1021 taatagagta aggtattgat gacaaattca gcagaagacc cggcttgctc cagtgtagct 1081 ttctacatcc cacatcagga tattagaggt tatccgaact ggggaaagac ggattgagat 1141 taactgctgg acctcctacc tgcattatct cattctggct tccttgataa ttctgt // LOCUS HSPRTXPRS 1291 bp RNA PRI 10-NOV-1995 DEFINITION H.sapiens mRNA for TX protease precursor. ACCESSION Z48810 S78281 NID g999453 KEYWORDS TX protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1291) AUTHORS Faucheu,C., Diu,A., Chan,A.W., Blanchet,A.M., Miossec,C., Herve,F., Collard-Dutilleul,V., Gu,Y., Aldape,R.A., Lippke,J.A. et,al. TITLE A novel human protease similar to the interleukin-1 beta converting enzyme induces apoptosis in transfected cells JOURNAL EMBO J. 14 (9), 1914-1922 (1995) MEDLINE 95262631 REFERENCE 2 (bases 1 to 1291) AUTHORS Lalanne,J. TITLE Direct Submission JOURNAL Submitted (29-MAR-1995) Jean-Louis Lalanne, Roussel-Uclaf, 102 route de Noisy, Romainville, 93235 Cedex, France FEATURES Location/Qualifiers source 1..1291 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 42..1175 /codon_start=1 /product="TX protease precursor" /db_xref="PID:g999454" /translation="MAEGNHRKKPLKVLESLGKDFLTGVLDNLVEQNVLNWKEEEKKK YYDAKTEDKVRVMADSMQEKQRMAGQMLLQTFFNIDQISPNKKAHPNMEAGPPESGES TDALKLCPHEEFLRLCKERAEEIYPIKERNNRTRLALIICNTEFDHLPPRNGADFDIT GMKELLEGLDYSVDVEENLTARDMESALRAFATRPEHKSSDSTFLVLMSHGILEGICG TVHDEKKPDVLLYDTIFQIFNNRNCLSLKDKPKVIIVQACRGANRGELWVRDSPASLE VASSQSSENLEEDAVYKTHVEKDFIAFCSSTPHNVSWRDSTMGSIFITQLITCFQKYS WCCHLEEVFRKVQQSFETPRAKAQMPTIERLSMTRYFYLFPGN" polyA_signal 1267..1272 polyA_site 1281 BASE COUNT 413 a 297 c 288 g 293 t ORIGIN 1 gctctttcca acgctgtaaa aaaggacaga ggctgttccc tatggcagaa ggcaaccaca 61 gaaaaaagcc acttaaggtg ttggaatccc tgggcaaaga tttcctcact ggtgttttgg 121 ataacttggt ggaacaaaat gtactgaact ggaaggaaga ggaaaaaaag aaatattacg 181 atgctaaaac tgaagacaaa gttcgggtca tggcagactc tatgcaagag aagcaacgta 241 tggcaggaca aatgcttctt caaacctttt ttaacataga ccaaatatcc cccaataaaa 301 aagctcatcc gaatatggag gctggaccac ctgagtcagg agaatctaca gatgccctca 361 agctttgtcc tcatgaagaa ttcctgagac tatgtaaaga aagagctgaa gagatctatc 421 caataaagga gagaaacaac cgcacacgcc tggctctcat catatgcaat acagagtttg 481 accatctgcc tccgaggaat ggagctgact ttgacatcac agggatgaag gagctacttg 541 agggtctgga ctatagtgta gatgtagaag agaatctgac agccagggat atggagtcag 601 cgctgagggc atttgctacc agaccagagc acaagtcctc tgacagcaca ttcttggtac 661 tcatgtctca tggcatcctg gagggaatct gcggaactgt gcatgatgag aaaaaaccag 721 atgtgctgct ttatgacacc atcttccaga tattcaacaa ccgcaactgc ctcagtctga 781 aggacaaacc caaggtcatc attgtccagg cctgcagagg tgcaaaccgt ggggaactgt 841 gggtcagaga ctctccagca tccttggaag tggcctcttc acagtcatct gagaacctgg 901 aggaagatgc tgtttacaag acccacgtgg agaaggactt cattgctttc tgctcttcaa 961 cgccacacaa cgtgtcctgg agagacagca caatgggctc tatcttcatc acacaactca 1021 tcacatgctt ccagaaatat tcttggtgct gccacctaga ggaagtattt cggaaggtac 1081 agcaatcatt tgaaactcca agggccaaag ctcaaatgcc caccatagaa cgactgtcca 1141 tgacaagata tttctacctc tttcctggca attgaaaatg gaagccacaa gcagcccagc 1201 cctccttaat caacttcaag gagcaccttc attagtacag cttgcatatt taacattttg 1261 tatttcaata aaagtgaaga caaaaaaaaa a // LOCUS HSPRTYKIN 2586 bp RNA PRI 25-JUL-1994 DEFINITION H.sapiens mRNA for protein tyrosin kinase. ACCESSION X73568 NID g515870 KEYWORDS protein tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2586) AUTHORS Muller,B., Cooper,L. and Terhorst,C. TITLE Molecular cloning of the human homologue to the pig protein-tyrosine kinase syk JOURNAL Immunogenetics 39 (5), 359-362 (1994) MEDLINE 94222446 REFERENCE 2 (bases 1 to 2586) AUTHORS Mueller,B. TITLE Direct Submission JOURNAL Submitted (24-MAY-1993) B. Mueller, Deutsches Rheuma Forschungszentrum, Forschungslaboratorium, Haus 11, Nordufer 20,, D-1000 Berlin 65, FRG FEATURES Location/Qualifiers source 1..2586 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tonsil" CDS 155..2047 /codon_start=1 /product="protein tyrosin kinase" /db_xref="PID:g515871" /db_xref="SWISS-PROT:P43405" /translation="MADSANHLPFFFGNITREEAEDYLVQGGMSDGLYLLRQSRNYLG GFALSVAHGRKAHHYTIERELNGTYAIAGGRTHASPADLCHYHSQESDGLVCLLKKPF NRPQGVQPKTGAFEDLKENLIREYVKQTWNLQGQALEQAIISQKPQLEKLIATTAHEK MPWFHGKISREESEQIVLIGSKTNGKFLIRARDNNGSYALCLLHEGKVLHYRIDKDKT GKLSIPEGKKFDTLWQLVEHYSYKADPLLRVLTVPCQKIGTQGNVNFGGRPQLPGSHP ATWSAGGIISRIKSYSFPKPGHRKSSPAQGNRQESTVSFNPYEPELAPWAADKGPQRE ALPMDTEVYESPYADPEEIRPKEVYLDRKLLTLEDKELGSGNFGTVKKGYYQMKKVVK TVAVKILKNEANDPALKDELLAEANVMQQLDNPYIVRMIGICEAESWMLVMEMAELGP LNKYLQQNRHVKDKNIIELVHQVSMGMKYLEESNFVHRDLAARNVLLVTQHYAKISDF GLSKALRADENYYKAQTHGKWPVKWYAPECINYYKFSSKSDVWSFGVLMWEAFSYGQK PYRGMKGSEVTAMLEKGERMGCPAGCPREMYDLMNLCWTYDVENRPGFAAVELRLRNY YYDVVN" polyA_signal 2463..2468 BASE COUNT 684 a 662 c 687 g 553 t ORIGIN 1 aagatatcct ctcagatttg tgtgcttctt cccttcccac attctatact gcccagcgcc 61 tgggattcca gatctgcgtt tgaatccagg aaagggaggt ggacacctgc gcaggtgtgt 121 gccctccggc ccctgaagca tggccagcag cggcatggct gacagcgcca accacctgcc 181 cttctttttc ggcaacatca cccgggagga ggcagaagat tacctggtcc aggggggcat 241 gagtgatggg ctttatttgc tgcgccagag ccgcaactac ctgggtggct tcgccctgtc 301 cgtggcccac gggaggaagg cacaccacta caccatcgag cgggagctga atggcaccta 361 cgccatcgcc ggtggcagga cccatgccag ccccgccgac ctctgtcact accactccca 421 ggagtctgat ggcctggtct gcctcctcaa gaagcccttc aaccggcccc aaggggtgca 481 gcctaagact ggggcctttg aggatttgaa ggaaaacctc atcagggaat atgtgaagca 541 gacatggaac ctgcagggtc aggctctgga gcaggccatc atcagtcaga agcctcagct 601 ggagaagctg atcgctacca cagcccatga aaaaatgcct tggttccatg gaaaaatctc 661 tcgggaagaa tctgagcaaa ttgtcctgat aggatcaaag acaaatggaa agttcctgat 721 ccgagccaga gacaacaacg gctcctacgc cctgtgcctg ctgcacgaag ggaaggtgct 781 gcactatcgc atcgacaaag acaagacagg gaagctctcc atccccgagg gaaagaagtt 841 cgacacgctc tggcagctag tcgagcatta ttcttataaa gcagatcctt tgttaagagt 901 tcttactgtc ccatgtcaaa aaatcggcac acagggaaat gttaattttg gaggccgtcc 961 acaacttcca ggttcccatc ctgcgacttg gtcagcgggt ggaataatct caagaatcaa 1021 atcatactcc ttcccaaagc ctggccacag aaagtcctcc cctgcccaag ggaaccggca 1081 agagagtact gtgtcattca atccgtatga gccagaactt gcaccctggg ctgcagacaa 1141 aggcccccag agagaagccc tacccatgga cacagaggtg tacgagagcc cctacgcgga 1201 ccccgaggag atcaggccca aggaggttta cctggaccga aagctgctga cgctggaaga 1261 caaagaactg ggctctggta attttggaac tgtgaaaaag ggctactacc aaatgaaaaa 1321 agttgtgaaa accgtggctg tgaaaatact gaaaaacgag gccaatgacc ccgctcttaa 1381 agatgagtta ttagcagaag caaatgtcat gcagcagctg gacaacccgt acatcgtgcg 1441 gatgatcggg atatgcgagg ccgagtcctg gatgctggtt atggagatgg cagaacttgg 1501 tcccctcaat aagtatttgc agcagaacag acatgtcaag gataagaaca tcatagaact 1561 ggttcatcag gtttccatgg gcatgaagta cttggaggag agcaattttg tgcacagaga 1621 tctggctgca agaaatgtgt tgctagttac ccaacattac gccaagatca gtgatttcgg 1681 actttccaaa gcactgcgtg ctgatgaaaa ctactacaag gcccagaccc atggaaagtg 1741 gcctgtcaag tggtacgctc cggaatgcat caactactac aagttctcca gcaaaagcga 1801 tgtctggagc tttggagtgt tgatgtggga agcattctcc tatgggcaga agccatatcg 1861 agggatgaaa ggaagtgaag tcaccgctat gttagagaaa ggagagcgga tggggtgccc 1921 tgcaggttgt ccaagagaga tgtacgatct catgaatctg tgctggacat acgatgtgga 1981 aaacaggccc ggattcgcag cagtggaact gcggctgcgc aattactact atgacgtggt 2041 gaactaaccg ctcccgcacc tgtcggtggc tgcctttgat cacaggagca atcacaggaa 2101 aatgtatcca gaggaattga ttgtcagcca cctccctctg ccagtcggga gagccaggct 2161 tggatggaac atgcccacaa cttgtcaccc aaagcctgtc ccaggactca ccctccacaa 2221 agcaaaggca gtcccgggag aaaagacgga tggcaggatc caaggggcta gctggatttg 2281 tttgttttct tgtctgtgtg attttcatac aggttatttt tacgatctgt ttccaaatcc 2341 ctttcatgtc tttccacttc tctgggtccc ggggtgcatt tgttactcat cgggcccagg 2401 gacattgcag agtggcctag agcactctca ccccaagcgg ccttttccaa atgcccaagg 2461 atgccttagc atgtgactcc tgaagggaag gcaaaggcag aggaatttgg ctgcttctac 2521 ggccatgaga ctgatccctg gccactgaaa agctttcctg acaataaaaa tgttttgagg 2581 ctgtaa // LOCUS HSPS2MKN 490 bp RNA PRI 26-MAY-1993 DEFINITION H.sapiens pS2 protein gene. ACCESSION X52003 NID g311379 KEYWORDS pS2 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 490) AUTHORS Takahashi,H., Kida,N., Fujii,R., Tanaka,K., Ohta,M., Mori,K. and Hayashi,K. TITLE Expression of the pS2 gene in human gastric cancer cells derived from poorly differentiated adenocarcinoma JOURNAL FEBS Lett. 261 (2), 283-286 (1990) MEDLINE 90184461 FEATURES Location/Qualifiers source 1..490 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="gastric cancer" /cell_line="MKN-45" gene 41..295 /gene="pS2" CDS 41..295 /gene="pS2" /codon_start=1 /product="pS2 protein" /db_xref="PID:g35718" /db_xref="SWISS-PROT:P04155" /translation="MATMENKVICALVLVSMLALGTLAEAQTETCTVAPRERQNCGFP GVTPSQCANKGCCFDDTVRGVPWCFYPNTIDVPPEEECEF" BASE COUNT 105 a 146 c 130 g 109 t ORIGIN 1 atccctgact cggggtcgcc tttggagcag agaggaggca atggccacca tggagaacaa 61 ggtgatctgc gccctggtcc tggtgtccat gctggccctc ggcaccctgg ccgaggccca 121 gacagagacg tgtacagtgg ccccccgtga aagacagaat tgtggttttc ctggtgtcac 181 gccctcccag tgtgcaaata agggctgctg tttcgacgac accgttcgtg gggtcccctg 241 gtgcttctat cctaatacca tcgacgtccc tccagaagag gagtgtgaat tttagacact 301 tctgcaggga tctgcctgca tcctgacgcg gtgccgtccc cagcacggtg attagtccca 361 gagctcggct gccacctcca ccggacacct cagacacgct tctgcagctg tgcctcggct 421 cacaacacag attgactgct ctgactttga ctactcaaaa ttggcctaaa aattaaaaga 481 gatcgatatt // LOCUS HSPSANLZP 420 bp RNA PRI 06-FEB-1997 DEFINITION H.sapiens mRNA for leucine zipper protein. ACCESSION Z50781 NID g1834506 KEYWORDS leucine zipper; leucine zipper protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 420) AUTHORS Vogel,P., Magert,H.J., Cieslak,A., Adermann,K. and Forssmann,W.G. TITLE hDIP--a potential transcriptional regulator related to murine TSC-22 and Drosophila shortsighted (shs)--is expressed in a large number of human tissues JOURNAL Biochim. Biophys. Acta 1309 (3), 200-204 (1996) MEDLINE 97136879 REFERENCE 2 (bases 1 to 420) AUTHORS Vogel,P. TITLE Direct Submission JOURNAL Submitted (14-AUG-1995) Petra Vogel, Molecular Biology, Lower Saxony Institute for Peptide Research, Feodor-Lynen-Strasse 31, Hannover, Lower Saxon, 30625, Germany FEATURES Location/Qualifiers source 1..420 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSAN" /dev_stage="fetus" /tissue_type="brain" /clone_lib="cDNA in lambda ZAP II, Stratagene, cat.no. 936206" /sex="Female" 5'UTR 1..135 /note="determined by consensus rules" CDS 136..369 /note="putative" /codon_start=1 /evidence=experimental /product="leucine zipper protein" /db_xref="PID:e208115" /db_xref="PID:g1834507" /translation="MDLVKNHLMYAVREEVEILKEQIRELVEKNSQLERENTLLKTLA SPEQLEKFQSCLSPEEPAPESPQVPEAPGGSAV" 3'UTR 370..420 /partial BASE COUNT 97 a 108 c 139 g 76 t ORIGIN 1 gggggctggc cgagcgccgt gcgcgcttgg gagaaggccg gaagcttacc agccgagaag 61 gaattcctag ctagcttcag agccggtgcc tccggagcca gcgtggtggc catagacaac 121 aagttcgaac aggccatgga tctggtgaag aatcatctga tgtatgctgt gagagaggag 181 gtggagatcc tgaaggagca gatccgagag ctggtggaga agaactccca gctagagcgt 241 gagaacaccc tgttgaagac cctggcaagc ccagagcagc tggagaagtt ccagtcctgt 301 ctgagccctg aagagccagc tcccgaatcc ccacaagtgc ccgaggcccc tggtggttct 361 gcggtgtaag tcgctctgtc ctcagggtgg gcagagccac taaacttgtt ttacctaggg // LOCUS HSPSG10 1591 bp RNA PRI 22-MAR-1995 DEFINITION Human PSG10 mRNA for pregnancy specific glycoprotein 10. ACCESSION X17098 NID g35747 KEYWORDS CEA gene family; immunoglobulin superfamily; PSG10 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1591) AUTHORS Barnett,T.R. TITLE Direct Submission JOURNAL Submitted (03-NOV-1989) Barnett T. R., Molecular Diagnostics, Inc., 400 Morgan Lane, West Haven, CT. 06516, USA REFERENCE 2 (bases 1 to 1591) AUTHORS Barnett,T.R., Pickle,W. II. and Elting,J.J. TITLE Characterization of two new members of the pregnancy-specific beta 1-glycoprotein family from the myeloid cell line KG-1 and suggestion of two distinct classes of transcription unit JOURNAL Biochemistry 29 (44), 10213-10218 (1990) MEDLINE 91104939 FEATURES Location/Qualifiers source 1..1591 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1 (ATCC CCL 229)" /clone_lib="lambda gt10" /chromosome="chromosome 19" sig_peptide 54..155 /note="signal peptide (AA -34 to -1)" CDS 54..1328 /note="precursor (AA -34 to -390)" /codon_start=1 /db_xref="PID:g35748" /translation="MGPLSAPPCTQHITWKGLLLTASLLNFWNLPTTAQVIIEAQPPK VSEGKDVLLLVHNLPQNLTGYIWYKGQMTDLYHYITSYVVDGQIIYGPAYSGRETVYS NASLLIQNVTQEDAGSYTLHIIKRGDGTGGVTGYFTVTLYSETPKRSISSSNLNPREV MEAVRLICDPETPDASYLWLLNGQNLPMTHRLQLSKTNRTLYLFGVTKYIAGPYECEI RRGVSASRSDPVTLNLLPKLPMPYITINNLNPREKKDVLAFTCEPKSRNYTYIWWLNG QSLPVSPRVKRPIENRILILPSVTRNETGPYQCEIRDRYGGIRSNPVTLNVLYGPDLP RIYPYFTYYRSGENLDLSCFADSNPPAEYFWTINGKFQLSGQKLFIPQITTNHSGLYA CSVRNSATGKEISKSMIVKVSGPCHGNQTESH" mat_peptide 156..1325 /note="mature PSG10 (AA 1-390)" BASE COUNT 460 a 408 c 330 g 393 t ORIGIN 1 gggtggatcc taggctcatc tccatagggg agaacacaca tacagcagag accatgggac 61 ccctctcagc ccctccctgc actcagcaca tcacctggaa ggggctcctg ctcacagcat 121 cacttttaaa cttctggaac ctgcccacca ctgcccaagt aataattgaa gcccagccac 181 ccaaagtttc tgaggggaag gatgttcttc tacttgtcca caatttgccc cagaatctta 241 ctggctacat ctggtacaaa gggcaaatga cggacctcta ccattacatt acatcatatg 301 tagtagacgg tcaaattata tatgggcctg cctacagtgg acgagaaaca gtatattcca 361 atgcatccct gctgatccag aatgtcacac aggaggatgc aggatcctac accttacaca 421 tcataaagcg aggcgatggg actggaggag taactggata tttcactgtc accttatact 481 cggagactcc caagcgctcc atctccagca gcaacttaaa ccccagggag gtcatggagg 541 ctgtgcgctt aatctgtgat cctgagactc cggatgcaag ctacctgtgg ttgctgaatg 601 gtcagaacct ccctatgact cacaggttgc agctgtccaa aaccaacagg accctctatc 661 tatttggtgt cacaaagtat attgcagggc cctatgaatg tgaaatacgg aggggagtga 721 gtgccagccg cagtgaccca gtcaccctga atctcctccc gaagctgccc atgccttaca 781 tcaccatcaa caacttaaac cccagggaga agaaggatgt gttagccttc acctgtgaac 841 ctaagagtcg gaactacacc tacatttggt ggctaaatgg tcagagcctc ccggtcagtc 901 cgagggtaaa gcgacccatt gaaaacagga tactcattct acccagtgtc acgagaaatg 961 aaacaggacc ctatcaatgt gaaatacggg accgatatgg tggcatccgc agtaacccag 1021 tcaccctgaa tgtcctctat ggtccagacc tccccagaat ttacccttac ttcacctatt 1081 accgttcagg agaaaacctc gacttgtcct gctttgcgga ctctaaccca ccggcagagt 1141 atttttggac aattaatggg aagtttcagc tatcaggaca aaagctcttt atcccccaaa 1201 ttactacaaa tcatagcggg ctctatgctt gctctgttcg taactcagcc actggcaagg 1261 aaatctccaa atccatgata gtcaaagtct ctggtccctg ccatggaaac cagacagagt 1321 ctcattaatg gctgccacaa tagagacact gagaaaaaga acaggttgat accttcatga 1381 aattcaagac aaagaagaaa aaggctcaat gttattggac taaataatca aaaggataat 1441 gttttcataa tttttattgg aaaatgtgct gattcttgga atgttttatt ctccagattt 1501 atgaactttt tttcttcagc aattggtaaa gtatactttt gtaaacaaaa attgaaacat 1561 ttgcttttgc tctctatctg agtgcccccc c // LOCUS HSPSTI 368 bp RNA PRI 12-SEP-1993 DEFINITION Homo sapiens pstI mRNA for pancreatic secretory inhibitor (expressed in neoplastic tissue). ACCESSION Y00705 NID g35765 KEYWORDS pst1 gene; trypsin inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 368) AUTHORS Tomita,N. TITLE Direct Submission JOURNAL Submitted (16-NOV-1987) Tomita N., Institute of Molecular and Cellular Biology, Osaka University, Yamada-oka, Suita 565, Japan REFERENCE 2 (bases 1 to 368) AUTHORS Tomita,N., Horii,A., Yamamoto,T. and Ogawa,M. TITLE Expression of pancreatic secretory trypsin inhibitor (PSTI) gene in neoplastic tissues JOURNAL FEBS Lett. (1987) In press COMMENT This sequence is identical with that of PSTI RNA isolated from human pancreatic cDNA library. FEATURES Location/Qualifiers source 1..368 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="sigmoid colon" /clone="lambda TI-C1" CDS 61..300 /note="inhibitor (AA 1-79)" /codon_start=1 /db_xref="PID:g35766" /db_xref="SWISS-PROT:P00995" /translation="MKVTGIFLLSALALLSLSGNTGADSLGREAKCYNELNGCTKIYD PVCGTDGNTYPNECVLCFEGRKRQTSILIQKSGPC" misc_feature 349..354 /note="polyA signal" polyA_site 368 /note="polyA site" BASE COUNT 100 a 79 c 89 g 100 t ORIGIN 1 gaagagacgt ggtaagtgcg gtgcagtttt caactgacct ctggacgcag aacttcagcc 61 atgaaggtaa caggcatctt tcttctcagt gccttggccc tgttgagtct atctggtaac 121 actggagctg actccctggg aagagaggcc aaatgttaca atgaacttaa tggatgcacc 181 aagatatatg accctgtctg tgggactgat ggaaatactt atcccaatga atgcgtgtta 241 tgttttgaag gtcggaaacg ccagacttct atcctcattc aaaaatctgg gccttgctga 301 gaaccaaggt tttgaaatcc catcaggtca ccgcgaggcc tattgttgaa taaatgtatc 361 tgaatatc // LOCUS HSPTBASF 3071 bp RNA PRI 22-NOV-1993 DEFINITION H.sapiens mRNA for PTB-associated splicing factor. ACCESSION X70944 S56626 NID g38457 KEYWORDS splicing factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3071) AUTHORS Patton,J.G. TITLE Direct Submission JOURNAL Submitted (16-FEB-1993) J.G. Patton, Vanderbilt University, Dept of Molecular Biology, Box 1820 Station B, Nashville TN 31235, USA REFERENCE 2 (bases 1 to 3071) AUTHORS Patton,J.G., Porro,E.B., Galceran,J., Tempst,P. and Nadal-Ginard,B. TITLE Cloning and characterization of PSF, a novel pre-mRNA splicing factor JOURNAL Genes Dev. 7 (3), 393-406 (1993) MEDLINE 93194059 FEATURES Location/Qualifiers source 1..3071 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone="B" CDS 86..2209 /codon_start=1 /product="PTB-associated splicing factor" /db_xref="PID:g38458" /translation="MSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNR GPMGPGPGQSGPKPPIPPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPP PQDSSKPVVAQGPGPAPGVGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSA PPGAPPPTPPSSGVPTTPPQAGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKP GGGPGLSTPGGHPKPPHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSE GFKANLSLLRRPGEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGF GFIKLESRALAEIAKAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFS QFGPIERAVVIVDDRGRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPL EQLDDEDGLPEKLAQKNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQV EKNMKDAKDKLESEMEDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLR QEEERRRREEEMMIRQREMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPY GSGGQKFPPLGGGGGIGYEANPGVPPATMSGSMMGSDMRTERFGQGGAGPVGGQGPRG MGPGTPAGYGRGREEYEGPNKKPRF" BASE COUNT 801 a 735 c 804 g 731 t ORIGIN 1 ccgccatttt gtgagaagca aggtggcctc cacgtttcct gagcgtcttc ttcgcttttg 61 cctcgaccgc cccttgacca cagacatgtc tcgggatcgg ttccggagtc gtggcggtgg 121 cggtggtggc ttccacaggc gtggaggagg cggcggccgc ggcggcctcc acgacttccg 181 ttctccgccg cccggcatgg gcctcaatca gaatcgcggc cccatgggtc ctggcccggg 241 ccagagcggc cctaagcctc cgatcccgcc accgcctcca caccaacagc agcaacagcc 301 accaccgcag cagccaccgc cgcagcagcc gccaccgcat cagccgccgc cgcatccaca 361 gccgcatcag cagcagcagc cgccgccacc gccgcaggac tcttccaagc ccgtcgttgc 421 tcagggaccc ggccccgctc ccggagtagg cagcgcacca ccagcctcca gctcggcccc 481 gcccgccact ccaccaacct cgggggcccc gccagggtcc gggccaggcc cgactccgac 541 cccgccgcct gcagtcacct cggcccctcc cggggcgccg ccacccaccc cgccaagcag 601 cggggtccct accacacctc ctcaggccgg aggcccgccg cctccgcccg cggcagtccc 661 gggcccgggt ccagggccta agcagggccc aggtccgggt ggtcccaaag gcggcaaaat 721 gcctggcggg ccgaagccag gtggcggccc gggcctaagt acgcctggcg gccaccccaa 781 gccgccgcat cgaggcggcg gggagccccg cgggggccgc cagcaccacc cgccctacca 841 ccagcagcat caccaggggc ccccgcccgg cgggcccggc ggccgcagcg aggagaagat 901 ctcggactcg gaggggttta aagccaattt gtctctcttg aggaggcctg gagagaaaac 961 ttacacacag cgatgtcggt tgtttgttgg gaatctacct gctgatatca cggaggatga 1021 attcaaaaga ctatttgcta aatatggaga accaggagaa gtttttatca acaaaggcaa 1081 aggattcgga tttattaagc ttgaatctag agctttggct gaaattgcca aagccgaact 1141 ggatgataca cccatgagag gtagacagct tcgagttcgc tttgccacac atgctgctgc 1201 cctttctgtt cgtaatcttt caccttatgt ttccaatgaa ctgttggaag aagcctttag 1261 ccaatttggt cctattgaaa gggctgttgt aatagtggat gatcgtggaa gatctacagg 1321 gaaaggcatt gttgaatttg cttctaagcc agcagcaaga aaggcatttg aacgatgcag 1381 tgaaggtgtt ttcttactga cgacaactcc tcgtccagtc attgtggaac cacttgaaca 1441 actagatgat gaagatggtc ttcctgaaaa acttgcccag aagaatccaa tgtatcaaaa 1501 ggagagagaa acccctcctc gttttgccca gcatggcacg tttgagtacg aatattctca 1561 gcgatggaag tctttggatg aaatggaaaa acagcaaagg gaacaagttg aaaaaaacat 1621 gaaagatgca aaagacaaat tggaaagtga aatggaagat gcctatcatg aacatcaggc 1681 aaatcttttg cgccaagatc tgatgagacg acaggaagaa ttaagacgca tggaagaact 1741 tcacaatcaa gaaatgcaga aacgtaaaga aatgcaattg aggcaagagg aggaacgacg 1801 tagaagagag gaagagatga tgattcgtca acgtgagatg gaagaacaaa tgaggcgcca 1861 aagagaggaa agttacagcc gaatgggcta catggatcca cgggaaagag acatgcgaat 1921 gggtggcgga ggagcaatga acatgggaga tccctatggt tcaggaggcc agaaatttcc 1981 acctctagga ggtggtggtg gcataggtta tgaagctaat cctggcgttc caccagcaac 2041 catgagtggt tccatgatgg gaagtgacat gcgtactgag cgctttgggc agggaggtgc 2101 ggggcctgtg ggtggacagg gtcctagagg aatggggcct ggaactccag caggatatgg 2161 tagagggaga gaagagtacg aaggcccaaa caaaaaaccc cgattttaga tgtgatattt 2221 aggctttcat tccagtttgt tttgtttttt tgtttagata ccaatctttt aaattcttgc 2281 attttagtaa gaaagctatc tttttatgga tgttagcagt ttattgacct aatatttgta 2341 aatggtctgt ttgggcaggt aaaattatgt aatgcagtgt ttggaacagg agaatttttt 2401 tttccttttt atttctttat tttttctttt ttactgtata atgtccctca agtttatggc 2461 agtgtacctt gtgccactga atttccaaag tgtaccaatt tttttttttt tactgtgctt 2521 caaataaata gaaaaatagt tataatattg gatcttcaac tttgccattc atgcttctat 2581 gcatattagg ctacgtattc cacattgaaa gcatgagagt gtctaggcct ttgaatggca 2641 tatgccattt ctgggaaatg catctggagg ctaagtattg ctttctacaa ataattgccc 2701 cctttgtttt aaaaagaaga aatgcatatt gaagtagttt gatgatttgt ttggcatata 2761 ggaagcacgc tggtgctaag tattttttaa atggttatgt aagcaaagct gaactgtaaa 2821 tcttcaggaa tatgtattaa gattgtggaa tgggtgtaag acaattggta gggggtgaaa 2881 gtgggtttga ttaaatggat cttttatggc cctatgatct atcctttact tgaaagcttt 2941 tgaaaagtgg aaaggtcatt ttgttgcatt tccccatttc ttgtttttaa aagaccaaca 3001 aatctcaagc cctataaatg gcttgtattg aacttttaca tttgaattaa agatgttaaa 3061 catgaaaaaa a // LOCUS HSPTKR 3805 bp RNA PRI 06-DEC-1993 DEFINITION H.sapiens HEK2 mRNA for protein tyrosine kinase receptor. ACCESSION X75208 NID g406867 KEYWORDS protein tyrosine kinase receptor; receptor protein tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3805) AUTHORS Bohme,B., Holtrich,U., Wolf,G., Luzius,H., Grzeschik,K.H., Strebhardt,K. and Rubsamen-Waigmann,H. TITLE PCR mediated detection of a new human receptor-tyrosine-kinase, HEK 2 JOURNAL Oncogene 8 (10), 2857-2862 (1993) MEDLINE 93390963 REFERENCE 2 (bases 1 to 3805) AUTHORS Boehme,B. TITLE Direct Submission JOURNAL Submitted (23-SEP-1993) B. Boehme, Georg-Speyer-Haus, Paul-Ehrlich-Str. 42-44, D60596 Frankfurt, FRG FEATURES Location/Qualifiers source 1..3805 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" gene 28..3024 /gene="HEK2" CDS 28..3024 /gene="HEK2" /EC_number="2.7.1.112" /codon_start=1 /product="protein tyrosine kinase-receptor" /db_xref="PID:g406868" /translation="MARARPPPPPSPPPGLLPLLPPLLLLPLLLLPAGCRALEETLMD TKWVTSELAWTSHPESGWEEVSGYDEAMNPIRTYQVCNVRESSQNNWLRTGFIWRRDV QRVYVELKFTVRDCNSIPNIPGSCKETFNLFYYEADSDVASASSPFWMENPYVKVDTI APDESFSRLDAGRVNTKVRSFGPLSKAGFYLAFQDQGACMSLISVRAFYKKCASTTAG FALFPETLTGAEPTSLVIAPGTCIPNAVEVSVPLKLYCNGDGEWMVPVGACTCATGHE PAAKESQCRPCPPGSYKAKQGEGPCLPCPPNSRTTSPAASICTCHNNFYRADSDSADS ACTTVPSPPRGVISNVNETSLILEWSEPRDLGVRDDLLYNVICKKCHGAGGASACSRC DDNVEFVPRQLGLSEPRVHTSHLLAHTRYTFEVQAVNGVSGKSPLPPRYAAVNITTNQ AAPSEVPTLRLHSSSGSSLTLSWAPPERPNGVILDYEMKYFEKSEGIASTVTSQMNSV QLDGLRPDARYVVQVRARTVAGYGQYSRPAEFETTSERGSGAQQLQEQLPLIVGSATA GLVFVVAVVVIAIVCLRKQRHGSDSEYTEKLQQYIAPGMKVYIDPFTYEDPNEAVREF AKEIDVSCVKIEEVIGAGEFGEVCRGRLKQPGRREVFVAIKTLKVGYTERQRRDFLSE ASIMGQFDHPNIIRLEGVVTKSRPVMILTEFMENCALDSFLRLNDGQFTVIQLVGMLR GIAAGMKYLSEMNYVHRDLAARNILVNSNLVCKVSDFGLSRFLEDDPSDPTYTSSLGG KIPIRWTAPEAIAYRKFTSASDVWSYGIVMWEVMSYGERPYWDMSNQDVINAVEQDYR LPPPMDCPTALHQLMLDCWVRDRNLRPKFSQIVNTLDKLIRNAASLKVIASAQSGMSQ PLLDRTVPDYTTFTTVGDWLDAIKMGRYKESFVSAGFASFDLVAQMTAEDLLRIGVTL AGHQKKILSSIQDMRLQMNQTLPVQV" sig_peptide 28..126 /gene="HEK2" /note="putative" mat_peptide 127..3021 /gene="HEK2" /EC_number="2.7.1.112" /product="protein tyrosine kinase-receptor" misc_feature 1695..1773 /gene="HEK2" /note="transmembrane region" BASE COUNT 743 a 1175 c 1143 g 744 t ORIGIN 1 ggctcggctc ctagagctgc cacggccatg gccagagccc gcccgccgcc gccgccgtcg 61 ccgccgccgg ggcttctgcc gctgctccct ccgctgctgc tgctgccgct gctgctgctg 121 cccgccggct gccgggcgct ggaagagacc ctcatggaca caaaatgggt aacatctgag 181 ttggcgtgga catctcatcc agaaagtggg tgggaagagg tgagtggcta cgatgaggcc 241 atgaatccca tccgcacata ccaggtgtgt aatgtgcgcg agtcaagcca gaacaactgg 301 cttcgcacgg ggttcatctg gcggcgggat gtgcagcggg tctacgtgga gctcaagttc 361 actgtgcgtg actgcaacag catccccaac atccccggct cctgcaagga gaccttcaac 421 ctcttctact acgaggctga cagcgatgtg gcctcagcct cctccccctt ctggatggag 481 aacccctacg tgaaagtgga caccattgca cccgatgaga gcttctcgcg gctggatgcc 541 ggccgtgtca acaccaaggt gcgcagcttt gggccacttt ccaaggctgg cttctacctg 601 gccttccagg accagggcgc ctgcatgtcg ctcatctccg tgcgcgcctt ctacaagaag 661 tgtgcatcca ccaccgcagg cttcgcactc ttccccgaga ccctcactgg ggcggagccc 721 acctcgctgg tcattgctcc tggcacctgc atccctaacg ccgtggaggt gtcggtgcca 781 ctcaagctct actgcaacgg cgatggggag tggatggtgc ctgtgggtgc ctgcacctgt 841 gccaccggcc atgagccagc tgccaaggag tcccagtgcc gcccctgtcc ccctgggagc 901 tacaaggcga agcagggaga ggggccctgc ctcccatgtc cccccaacag ccgtaccacc 961 tccccagccg ccagcatctg cacctgccac aataacttct accgtgcaga ctcggactct 1021 gcggacagtg cctgtaccac cgtgccatct ccaccccgag gtgtgatctc caatgtgaat 1081 gaaacctcac tgatcctcga gtggagtgag ccccgggacc tgggtgtccg ggatgacctc 1141 ctgtacaatg tcatctgcaa gaagtgccat ggggctggag gggcctcagc ctgctcacgc 1201 tgtgatgaca acgtggagtt tgtgcctcgg cagctgggcc tgtcggagcc ccgggtccac 1261 accagccatc tgctggccca cacgcgctac acctttgagg tgcaggcggt caacggtgtc 1321 tcgggcaaga gccctctgcc gcctcgttat gcggccgtga atatcaccac aaaccaggct 1381 gccccgtctg aagtgcccac actacgcctg cacagcagct caggcagcag cctcacccta 1441 tcctgggcac ccccagagcg gcccaacgga gtcatcctgg actacgagat gaagtacttt 1501 gagaagagcg agggcatcgc ctccacagtg accagccaga tgaactccgt gcagctggac 1561 gggcttcggc ctgacgcccg ctatgtggtc caggtccgtg cccgcacagt agctggctat 1621 gggcagtaca gccgccctgc cgagtttgag accacaagtg agagaggctc tggggcccag 1681 cagctccagg agcagcttcc cctcatcgtg ggctccgcta cagctgggct tgtcttcgtg 1741 gtggctgtcg tggtcatcgc tatcgtctgc ctcaggaagc agcgacacgg ctctgattcg 1801 gagtacacgg agaagctgca gcagtacatt gctcctggaa tgaaggttta tattgaccct 1861 tttacctacg aggaccctaa tgaggctgtt cgggagtttg ccaaggagat cgacgtgtcc 1921 tgcgtcaaga tcgaggaggt gatcggagct ggggaatttg gggaagtgtg ccgtggtcga 1981 ctgaaacagc ctggccgccg agaggtgttt gtggccatca agacgctgaa ggtgggctac 2041 accgagaggc agcggcggga cttcctaagc gaggcctcca tcatgggtca gtttgatcac 2101 cccaatataa tccggctcga gggcgtggtc accaaaagtc ggccagttat gatcctcact 2161 gagttcatgg aaaactgcgc cctggactcc ttcctccggc tcaacgatgg gcagttcacg 2221 gtcatccagc tggtgggcat gttgcggggc attgctgccg gcatgaagta cctgtccgag 2281 atgaactatg tgcaccgcga cctggctgct cgcaacatcc ttgtcaacag caacctggtc 2341 tgcaaagtct cagactttgg cctctcccgc ttcctggagg atgacccctc cgatcctacc 2401 tacaccagtt ccctgggcgg gaagatcccc atccgctgga ctgccccaga ggccatagcc 2461 tatcggaagt tcacttctgc tagtgatgtc tggagctacg gaattgtcat gtgggaggtc 2521 atgagctatg gagagcgacc ctactgggac atgagcaacc aggatgtcat caatgccgtg 2581 gagcaggatt accggctgcc accacccatg gactgtccca cagcactgca ccagctcatg 2641 ctggactgct gggtgcggga ccggaacctc aggcccaaat tctcccagat tgtcaatacc 2701 ctggacaagc tcatccgcaa tgctgccagc ctcaaggtca ttgccagcgc tcagtctggc 2761 atgtcacagc ccctcctgga ccgcacggtc ccagattaca caaccttcac gacagttggt 2821 gattggctgg atgccatcaa gatggggcgg tacaaggaga gcttcgtcag tgcggggttt 2881 gcatcttttg acctggtggc ccagatgacg gcagaagacc tgctccgtat tggggtcacc 2941 ctggccggcc accagaagaa gatcctgagc agtatccagg acatgcggct gcagatgaac 3001 cagacgctgc ctgtgcaggt ctgacaccgg ctcccacggg gaccctgagg accgtgcagg 3061 gatgccaagc agccggctgg actttcggac tcttggactt ttggatgcct ggccttaggc 3121 tgtggcccag aagctggaag tttgggaaag gcccaagctg ggacttctcc aggcctgtgt 3181 tccctcccca ggaagtgcgc cccaaacctc ttcatattga agatggatta ggagaggggg 3241 tgatgacccc tccccaagcc cctcagggcc cagaccttcc tgctctccag caggggatcc 3301 ccacaacctc acacttgtct gttcttcagt gctggaggtc ctggcagggt caggctgggg 3361 taagccgggg ttccacaggg cccagccctg gcaggggtct ggccccccag gtaggcggag 3421 agcagtccct ccctcaggaa ctggaggagg ggactccagg aatggggaaa tgtgacacca 3481 ccatcctgaa gccagcttgc acctccagtt tgcacaggga tttgtcctgg gggctgaggg 3541 ccctgtcccc acccccgccc ttggtgctgt cataaaaggg caggcagggg caggctgagg 3601 agttgcccgt tgccccccag agactgactc tcagagccag agatgggatg tgtgagtgtg 3661 tgtgtgtgtg tgtgcgcgcg cgcgcgcgtg tgtgtgtgca cgcactggcc tgcacagaga 3721 gcatgggtga gcgtgtaaaa gcttggccct gtgccctaca gtggggacag ctgggccgac 3781 agcagaataa aggcaataag atgaa // LOCUS HSPTP1D 1980 bp RNA PRI 03-AUG-1994 DEFINITION H.sapiens mRNA for phosphotyrosine phosphatase. ACCESSION X70766 NID g35783 KEYWORDS phosphotyrosine phosphatase; SH2 domain; src homology 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1980) AUTHORS Ullrich,A. TITLE Direct Submission JOURNAL Submitted (20-JAN-1993) A. Ullrich, Max-Planck-Inst fuer Biochemie, Dept of Molecular Biology, Am Klopferspitz 18A, 8033 Martinsried, FRG REFERENCE 2 (bases 1 to 1980) AUTHORS Vogel,W., Lammers,R., Huang,J. and Ullrich,A. TITLE Activation of a phosphotyrosine phosphatase by tyrosine phosphorylation JOURNAL Science 259 (5101), 1611-1614 (1993) MEDLINE 93206095 COMMENT Related sequences: L03535 & D13540 Related sequences: L03535 & D13540. FEATURES Location/Qualifiers source 1..1980 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SK-BR-3" /clone_lib="lambda zap II" gene 130..1911 /gene="PTP 1D" CDS 130..1911 /gene="PTP 1D" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g35784" /db_xref="SWISS-PROT:Q06124" /translation="MTSRRWFHPNITGVEAENLLLTRGVDGSFLARPSKSNPGDFTLS VRRNGAVTHIKIQNTGDYYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKYPL NCADPTSERWFHGHLSGKEAEKLLTEKGKHGSFLVRESQSHPGDFVLSVRTGDDKGES NDGKSKVTHVMIRCQELKYDVGGGERFDSLTDLVEHYKKNPMVETLGTVLQLKQPLNT TRINAAEIESRVRELSKLAETTDKVKQGFWEEFETLQQQECKLLYSRKEGQRQENKNK NRYKNILPFDHTRVVLHDGDPNEPVSDYINANIIMPEFETKCNNSKPKKSYIATQGCL QNTVNDFWRMVFQENSRVIVMTTKEVERGKSKCVKYWPDEYALKEYGVMRVRNVKESA AHDYTLRELKLSKVGQGNTERTVWQYHFRTWPDHGVPSDPGGVLDFLEEVHHKQESIM DAGPVVVHCSAGIGRTGTFIVIDILIDIIREKGVDCDIDVPKTIQMVRSQRSGMVQTE AQYRFIYMAVQHYIETLQRRIEEEQKSKRKGHEYTNIKYSLADQTSGDQSPLPPCTPT PPCAEMREDSARVYENVGLMQQQKSFR" misc_feature 145..432 /gene="PTP 1D" /note="SH2 domain" misc_feature 463..774 /gene="PTP 1D" /note="SH2 domain" misc_feature 955..1680 /gene="PTP 1D" /note="PTP domain" BASE COUNT 620 a 416 c 521 g 423 t ORIGIN 1 ggcacgagcg gctggctctg cccgcgtccg gtcccgagcg ggcctccctc gggccagccc 61 gatgtgaccg agcccagcgg agcctgagca aggagcgggt ccgtcgcgga gccggagggc 121 gggaggaaca tgacatcgcg gagatggttt cacccaaata tcactggtgt ggaggcagaa 181 aacctactgt tgacaagagg agttgatggc agttttttgg caaggcctag taaaagtaac 241 cctggagact tcacactttc cgttagaaga aatggagctg tcacccacat caagattcag 301 aacactggtg attactatga cctgtatgga ggggagaaat ttgccacttt ggctgagttg 361 gtccagtatt acatggaaca tcacgggcaa ttaaaagaga agaatggaga tgtcattgag 421 cttaaatatc ctctgaactg tgcagatcct acctctgaaa ggtggtttca tggacatctc 481 tctgggaaag aagcagagaa attattaact gaaaaaggaa aacatggtag ttttcttgta 541 cgagagagcc agagccaccc tggagatttt gttctttctg tgcgcactgg tgatgacaaa 601 ggggagagca atgacggcaa gtctaaagtg acccatgtta tgattcgctg tcaggaactg 661 aaatacgacg ttggtggagg agaacggttt gattctttga cagatcttgt ggaacattat 721 aagaagaatc ctatggtgga aacattgggt acagtactac aactcaagca gccccttaac 781 acgactcgta taaatgctgc tgaaatagaa agcagagttc gagaactaag caaattagct 841 gagaccacag ataaagtcaa acaaggcttt tgggaagaat ttgagacact acaacaacag 901 gagtgcaaac ttctctacag ccgaaaagag ggtcaaaggc aagaaaacaa aaacaaaaat 961 agatataaaa acatcctgcc ctttgatcat accagggttg tcctacacga tggtgatccc 1021 aatgagcctg tttcagatta catcaatgca aatatcatca tgcctgaatt tgaaaccaag 1081 tgcaacaatt caaagcccaa aaagagttac attgccacac aaggctgcct gcaaaacacg 1141 gtgaatgact tttggcggat ggtgttccaa gaaaactccc gagtgattgt catgacaacg 1201 aaagaagtgg agagaggaaa gagtaaatgt gtcaaatact ggcctgatga gtatgctcta 1261 aaagaatatg gcgtcatgcg tgttaggaac gtcaaagaaa gcgccgctca tgactatacg 1321 ctaagagaac ttaaactttc aaaggttgga caagggaata cggagagaac ggtctggcaa 1381 taccactttc ggacctggcc ggaccacggc gtgcccagcg accctggggg cgtgctggac 1441 ttcctggagg aggtgcacca taagcaggag agcatcatgg atgcagggcc ggtcgtggtg 1501 cactgcagtg ctggaattgg ccggacaggg acgttcattg tgattgatat tcttattgac 1561 atcatcagag agaaaggtgt tgactgcgat attgacgttc ccaaaaccat ccagatggtg 1621 cggtctcaga ggtcagggat ggtccagaca gaagcacagt accgatttat ctatatggcg 1681 gtccagcatt atattgaaac actacagcgc aggattgaag aagagcagaa aagcaagagg 1741 aaagggcacg aatatacaaa tattaagtat tctctagcgg accagacgag tggagatcag 1801 agccctctcc cgccttgtac tccaacgcca ccctgtgcag aaatgagaga agacagtgct 1861 agagtctatg aaaacgtggg cctgatgcaa cagcagaaaa gtttcagatg agaaaacctg 1921 ccaaaacttc agcacagaaa tagatgtgga ctttcacctc tccctaaaaa gatcaggacc // LOCUS HSPTPAA 2661 bp RNA PRI 28-JUN-1994 DEFINITION H.sapiens hPTPA mRNA. ACCESSION X73478 NID g509242 KEYWORDS phosphotyrosyl phosphatase activator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2661) AUTHORS Cayla,X., Van Hoof,C., Bosch,M., Waelkens,E., Vandekerckhove,J., Peeters,B., Merlevede,W. and Goris,J. TITLE Molecular cloning, expression, and characterization of PTPA, a protein that activates the tyrosyl phosphatase activity of protein phosphatase 2A JOURNAL J. Biol. Chem. 269 (22), 15668-15675 (1994) MEDLINE 94253154 REFERENCE 2 (bases 1 to 2661) AUTHORS Goris,J. TITLE Direct Submission JOURNAL Submitted (21-JUN-1993) J. Goris, Katholieke Universiteit Leuven, Faculteit der Geneeskunde, Campus Gasthuisberg Herestraat, B-3000 Leuven, BELGIUM FEATURES Location/Qualifiers source 1..2661 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" /clone="H6" gene 190..2638 /gene="PTPA" CDS 190..1161 /gene="PTPA" /codon_start=1 /product="phosphotyrosyl phosphatase activator" /db_xref="PID:g509243" /translation="MAEGERQPPPDSSEEAPPATQNFIIPKKEIHTVPDMGKWKRSQA YADYIGFILTLNEGVKGKKLTFEYRVSEAIEKLLALLNTLDRWIDETPPVDQPSRFGN KAYRTWYAKLDEEAENLVATVVPTHLAAAVPEVAVYLKESVGNSTRIDYGTGHEAAFA AFLCCLCKIGVLRVDDQIAIVFKVFNRYLEVMRKLQKTYRMEPAGSQGVWGLDDFQFL PFIWGSSQLIDHPYLEPRHFVDEKAVNENHKDYMFLECILFITEMKTGPFAEHSNQLW NISAVPSWSKVNQGLIRMYKAECLEKFPVIQHFKFGSLLPIHPVTSG" polyA_signal 2613..2618 /gene="PTPA" polyA_site 2638 /gene="PTPA" BASE COUNT 509 a 764 c 789 g 599 t ORIGIN 1 ccggcaccga catggcggcc gtcttcgctg tggtgacttt aactctcggt tttcggttct 61 agccggccgg cgctcacttg tcttcaggaa gctcggagcc tttggtggag ccggggagag 121 gaagggtggg tgcaagagtg aaaggcgaga ggggactgca agcatccggg tcgctgctgg 181 ccggagcaga tggctgaggg cgagcggcag ccgccgccag attcttcaga ggaggcccct 241 ccagccactc agaacttcat cattccaaaa aaggagatcc acacagttcc agacatgggc 301 aaatggaagc gttctcaggc atacgctgac tacatcggat tcatccttac cctcaacgaa 361 ggtgtgaagg ggaagaagct gaccttcgag tacagagtct ccgaggccat tgagaaacta 421 ctcgctcttc tcaacacgct ggacaggtgg attgatgaga ctcctccagt ggaccagccc 481 tctcggtttg ggaataaggc atacaggacc tggtatgcca aacttgatga ggaagcagaa 541 aacttggtgg ccacagtggt ccctacccat ctggcagctg ctgtgcctga ggtggctgtt 601 tacctaaagg agtcagtggg gaactccacg cgcattgact acggcacagg gcatgaggca 661 gccttcgctg ctttcctctg ctgtctctgc aagattgggg tgctccgggt ggatgaccaa 721 atagctattg tcttcaaggt gttcaatcgg taccttgagg ttatgcggaa actccagaaa 781 acatacagga tggagccagc cggcagccag ggagtgtggg gtctggatga cttccagttt 841 ctgcccttca tctggggcag ttcgcagctg atagaccacc catacctgga gcccagacac 901 tttgtggatg agaaggccgt gaatgagaac cacaaggact acatgttcct ggagtgtatc 961 ctgtttatta ccgagatgaa gactggccca tttgcagagc actccaacca gctgtggaac 1021 atcagcgccg tgccttcctg gtccaaagtg aaccagggtc tcatccgcat gtataaggcc 1081 gagtgcctgg agaagttccc tgtgatccag cacttcaagt tcgggagcct gctgcccatc 1141 catcctgtca cgtcgggcta ggagggccaa gccgaagagc cacccaggcc acagttcctg 1201 tgcctgcctt ccccacccca gcagtggccc ctcccccatc ccctccctct gttcgtcccg 1261 tttgatgaga ggctgtttac tggggtgggg tggcgagatg ggcttgaggg ggctcagagc 1321 ataaggcttc agggcccaag ttgggagaag tgaccaaagt gtagccagtt ttctgagttc 1381 ccgtgtgcta gactggccag aagagagggt ctggggcctg gtcactcggc cactctctcc 1441 tgtttctggc ctcttctccc ttcactcccg gtccagtctg gttttgagag caggggctgt 1501 tctgcagcac ctcagggaag ggaggagaga tacctgctgc ttccattgct tttcccttcc 1561 tggagtcgat gcctttctaa gggttggagc tgctccttgc aggggcgggt cagtttccca 1621 ggccatgccg gggtggccat ctatgctagg gctggaagct gagctggccg ccagctgtgg 1681 gctggggtgg ggtgggtggg gtcgggtggt ggagaggcct tagctgtcct gctggtgccc 1741 ctcccaggct ccttttcacc ctgccccctg cctgaggccc cctgtgtcca agcctccccc 1801 tggctcttca gttctctagc ccttggcttt gctgggtttc ctgactgtag ccacatctct 1861 cccgctccct aagggtaacc tagccaatgg aagctggcct ttgggtaggt gctgggctcc 1921 tgggagggcc cagatgatgg gtgaggcatg tctttccaga actttcctgg cagggagggg 1981 atggcagaaa ctcagggagg cttggggccc attgtatctg gagagcctgg attcctcttg 2041 gcagtcttag cccagccact tctgctacct ttgcgctgct gtgagcctca ccctgcccct 2101 gggccctgct tctctgctcc cctgggtgat gggtgggccc agaaggtggc agtcccacac 2161 cttgtcctcc cacctccctg aactgtccat tgcttttata gggtgaggta agtgacagcc 2221 tcccaagccc aggctttggc actcagaatg ggcccagtgg gggctgggca gcccattgag 2281 ggccaccgcc gaggcgcgag gtttctccta gggctgttcc tgggcctggc tcttacaggc 2341 ttggtcagga gggctggcct tcttcactgc cccctcctgt gtctgggtcc acacaccctt 2401 cagtaaccaa cggcactgag aagcacagca caggggctca gcctgggatc cggtgatggt 2461 ctgggcagag gctgggtcag gagtcccaaa ggtcagtgac agtttctcag aagaggccca 2521 gcgtccacct ctctcccagg gccagacacc ccttcctggc tcccccatcc ccctatggct 2581 cccagcccct tgcaccctca ttgctgttca gattaaagcc tctgttttgc acctgtcaaa 2641 aaaaaaaaaa aaaaaaaaaa a // LOCUS HSPTPB 6075 bp RNA PRI 03-APR-1991 DEFINITION Human HPTP beta mRNA for protein tyrosine phosphatase beta. ACCESSION X54131 NID g35787 KEYWORDS HPTP gene; protein tyrosine phosphatase; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6075) AUTHORS Saito,H. TITLE Direct Submission JOURNAL Submitted (24-JUL-1990) Saito H., Dana Farber Cancer Institute, Harvard Medical School, 44 Binney Street, Boston MA 02115, U S A REFERENCE 2 (bases 1 to 6075) AUTHORS Krueger,N.X., Streuli,M. and Saito,H. TITLE Structural diversity and evolution of human receptor-like protein tyrosine phosphatases JOURNAL EMBO J. 9 (10), 3241-3252 (1990) MEDLINE 91006018 COMMENT See also X54130-X54135. FEATURES Location/Qualifiers source 1..6075 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="human placental cDNA" /clone="HPTP beta 28, 65, 68, 74, 126 & 132" mRNA 1..6075 /gene="HPTP beta" gene 1..6075 /gene="HPTP beta" sig_peptide 31..96 /gene="HPTP beta" /note="product is human protein tyrosine phosphatse beta" CDS 31..6024 /gene="HPTP beta" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g35788" /db_xref="SWISS-PROT:P23467" /translation="MLSHGAGLALWITLSLLQTGLAEPERCNFTLAESKASSHSVSIQ WRILGSPCNFSLIYSSDTLGAALCPTFRIDNTTYGCNLQDLQAGTIYNFKIISLDEER TVVLQTDPLPPARFGVSKEKTTSTGLHVWWTPSSGKVTSYEVQLFDENNQKIQGVQIQ ESTSWNEYTFFNLTAGSKYNIAITAVSGGKRSFSVYTNGSTVPSPVKDIGISTKANSL LISWSHGSGNVERYRLMLMDKGILVHGGVVDKHATSYAFHGLSPGYLYNLTVMTEAAG LQNYRWKLVRTAPMEVSNLKVTNDGSLTSLKVKWQRPPGNVDSYNITLSHKGTIKESR VLAPWITETHFKELVPGRLYQVTVSCVSGELSAQKMAVGRTFPDKVANLEANNNGRMR SLVVSWSPPAGDWEQYRILLFNDSVVLLNITVGKEETQYVMDDTGLVPGRQYEVEVIV ESGNLKNSERCQGRTVPLAVLQLRVKHANETSLSIMWQTPVAEWEKYIISLADRDLLL IHKSLSKDAKEFTFTDLVPGRKYMATVTSISGDLKNSSSVKGRTVPAQVTDLHVANQG MTSSLFTNWTQAQGDVEFYQVLLIHENVVIKNESISSETSRYSFHSLKSGSLYSVVVT TVSGGISSRQVVVEGRTVPSSVSGVTVNNSGRNDYLSVSWLVAPGDVDNYEVTLSHDG KVVQSLVIAKSVRECSFSSLTPGRLYTVTITTRSGKYENHSFSQERTVPDKVQGVSVS NSARSDYLRVSWVHATGDFDHYEVTIKNKNNFIQTKSIPKSENECVFVQLVPGRLYSV TVTTKSGQYEANEQGNGRTIPEPVKDLTLRNRSTEDLHVTWSGANGDVDQYEIQLLFN DMKVFPPFHLVNTATEYRFTSLTPGRQYKILVLTISGDVQQSAFIEGFTVPSAVKNIH ISPNGATDSLTVNWTPGGGDVDSYTVSAFRHSQKVDSQTIPKHVFEHTFHRLEAGEQY QIMIASVSGSLKNQINVVGRTVPASVQGVIADNAYSSYSLIVSWQKAAGVAERYDILL LTENGILLRNTSEPATTKQHKFEDLTPGKKYKIQILTVSGGLFSKEAQTEGRTVPAAV TDLRITENSTRHLSFRWTASEGELSWYNIFLYNPDGNLQERAQVDPLVQSFSFQNLLQ GRMYKMVIVTHSGELSNESFIFGRTVPASVSHLRGSNRNTTDSLWFNWSPASGDFDFY ELILYNPNGTKKENWKDKDLTEWRFQGLVPGRKYVLWVVTHSGDLSNKVTAESRTAPS PPSLMSFADIANTSLAITWKGPPDWTDYNDFELQWLPRDALTVFNPYNNRKSEGRIVY GLRPGRSYQFNVKTVSGDSWKTYSKPIFGSVRTKPDKIQNLHCRPQNSTAIACSWIPP DSDFDGYSIECRKMDTQEVEFSRKLEKEKSLLNIMMLVPHKRYLVSIKVQSAGMTSEV VEDSTITMIDRPPPPPPHIRVNEKDVLISKSSINFTVNCSWFSDTNGAVKYFTVVVRE ADGSDELKPEQQHPLPSYLEYRHNASIRVYQTNYFASKCAENPNSNSKSFNIKLGAEM ESLGGKRDPTQQKFCDGPLKPHTAYRISIRAFTQLFDEDLKEFTKPLYSDTFFSLPIT TESEPLFGAIEGVSAGLFLIGMLVAVVALLICRQKVSHGRERPSARLSIRRDRPLSVH LNLGQKGNRKTSCPIKINQFEGHFMKLQADSNYLLSKEYEELKDVGRNQSCDIALLPE NRGKNRYNNILPYDATRVKLSNVDDDPCSDYINASYIPGNNFRREYIVTQGPLPGTKD DFWKMVWEQNVHNIVMVTQCVEKGRVKCDHYWPADQDSLYYGDLILQMLSESVLPEWT IREFKICGEEQLDAHRLIRHFHYTVWPDHGVPETTQSLIQFVRTVRDYINRSPGAGPT VVHCSAGVGRTGTFIALDRILQQLDSKDSVDIYGAVHDLRLHRVHMVQTECQYVYLHQ CVRDVLRARKLRSEQENPLFPIYENVNPEYHRDPVYSRH" mat_peptide 97..6021 /gene="HPTP beta" /EC_number="3.1.3.48" /product="protein-tyrosine phosphatase" misc_feature 4894..4956 /gene="HPTP beta" /note="transmembrane region" BASE COUNT 1717 a 1441 c 1447 g 1470 t ORIGIN 1 gtctcctctg gatcttaact actgagcgca atgctgagcc atggagccgg gttggccttg 61 tggatcacac tgagcctgct gcagactgga ctggcggagc cagagagatg taacttcacc 121 ctggcggagt ccaaggcctc cagccattct gtgtctatcc agtggagaat tttgggctca 181 ccctgtaact ttagcctcat ctatagcagt gacaccctgg gggccgcgtt gtgccctacc 241 tttcggatag acaacaccac atacggatgt aaccttcaag atttacaagc aggaaccatc 301 tataacttca agattatttc tctggatgaa gagagaactg tggtcttgca aacagatcct 361 ttacctcctg ctaggtttgg agtcagtaaa gagaagacga cttcaaccgg cttgcatgtt 421 tggtggactc cttcttccgg aaaagtcacc tcatatgagg tgcaattatt tgatgaaaat 481 aaccaaaaga tacagggggt tcaaattcaa gaaagtactt catggaatga atacactttt 541 ttcaatctca ctgctggtag taaatacaat attgccatca cagctgtttc tggaggaaaa 601 cgttcttttt cagtttatac caatggatca acagtgccat ctccagtgaa agatattggt 661 atttccacaa aagccaattc tctcctgatt tcctggtccc atggttctgg gaatgtggaa 721 cgataccggc tgatgctaat ggataaaggg atcctagttc atggcggtgt tgtggacaaa 781 catgctactt cctatgcttt tcacgggctg tcccctggct acctctacaa cctcactgtt 841 atgactgagg ctgcagggct gcaaaactac aggtggaaac tagtcaggac agcccccatg 901 gaagtctcaa atctgaaggt gacaaatgat ggcagtttga cctctctaaa agtcaaatgg 961 caaagacctc ctggaaatgt ggattcttac aatatcaccc tgtctcacaa agggaccatc 1021 aaggaatcca gagtattagc accttggatt actgaaactc actttaaaga gttagtcccc 1081 ggtcgacttt atcaagttac tgtcagctgt gtctctggtg aactgtctgc tcagaagatg 1141 gcagtgggca gaacatttcc agacaaagtt gcaaacctgg aggcaaacaa taatggcagg 1201 atgaggtctc ttgtagtgag ctggtcgccc cctgctggag actgggagca gtatcggatc 1261 ctactcttca atgattctgt ggtgctgctc aacatcactg tgggaaagga agaaacacag 1321 tatgtcatgg atgacacggg gctcgtaccg ggaagacagt atgaggtgga agtcattgtt 1381 gagagtggaa atttgaagaa ttctgagcgt tgccaaggca ggacagtccc cctggctgtc 1441 ctccagcttc gtgtcaaaca tgccaatgaa acctcactga gtatcatgtg gcagacccct 1501 gtagcagaat gggagaaata catcatttcc ctagctgaca gagacctctt actgatccac 1561 aagtcactct ccaaagatgc caaagaattc acttttactg acctggtgcc tggacgaaaa 1621 tacatggcta cagtcaccag tattagtgga gacttaaaaa attcctcttc agtaaaagga 1681 agaacagtgc ctgcccaagt gactgacttg catgtggcca accaaggaat gaccagtagt 1741 ctgtttacta actggaccca ggcacaagga gacgtagaat tttaccaagt cttactgatc 1801 catgaaaatg tggtcattaa aaatgaaagc atctccagtg agaccagcag atacagcttc 1861 cactctctca agtccggcag cctgtactcc gtggtggtaa caacagtgag tggagggatc 1921 tcttcccgac aagtggttgt ggagggaaga acagtccctt ccagtgtgag tggagtaacg 1981 gtgaacaatt ccggtcgtaa tgactacctc agcgtttcct ggctcgtggc gcccggagat 2041 gtggataact atgaggtaac attgtctcat gacggcaagg tggttcagtc ccttgtcatt 2101 gccaagtctg tcagagaatg ttccttcagc tccctcaccc caggccgcct ctacaccgtg 2161 accataacta caaggagtgg caagtatgaa aatcactcct tcagccaaga gcggacagtg 2221 cctgacaaag tccagggagt cagtgttagc aactcagcca ggagtgacta tttaagggta 2281 tcctgggtgc atgccactgg agactttgat cactatgaag tcaccattaa aaacaaaaac 2341 aacttcattc aaactaaaag cattcccaag tcagaaaacg aatgtgtatt tgttcagcta 2401 gtccctggac ggttgtacag tgtcactgtt actacaaaaa gtggacaata tgaagccaat 2461 gaacaaggga atgggagaac aattccagag cctgttaagg atctaacatt gcgcaacagg 2521 agcactgagg acttgcatgt gacttggtca ggagctaatg gggatgtcga ccaatatgag 2581 atccagctgc tcttcaatga catgaaagta tttcctcctt ttcaccttgt aaataccgca 2641 accgagtatc gatttacttc cctaacacca ggccgccaat acaaaattct tgtcttgacg 2701 attagcgggg atgtacagca gtcagccttc attgagggct tcacagttcc tagtgctgtc 2761 aaaaatattc acatttctcc caatggagca acagatagcc tgacggtgaa ctggactcct 2821 ggtgggggag acgttgattc ctacacggtg tcggcattca ggcacagtca aaaggttgac 2881 tctcagacta ttcccaagca cgtctttgag cacacgttcc acagactgga ggccggggag 2941 cagtaccaga tcatgattgc ctcagtcagc gggtccctga agaatcagat aaatgtggtt 3001 gggcggacag ttccagcatc tgtccaagga gtaattgcag acaatgcata cagcagttat 3061 tccttaatag taagttggca aaaagctgct ggtgtggcag aaagatatga tatcctgctt 3121 ctaactgaaa atggaatcct tctgcgcaac acatcagagc cagccaccac taagcaacac 3181 aaatttgaag atctaacacc aggcaagaaa tacaagatac agatcctaac tgtcagtgga 3241 ggcctcttta gcaaggaagc ccagactgaa ggccgaacag tcccagcagc tgtcaccgac 3301 ctgaggatca cagagaactc caccaggcac ctgtccttcc gctggaccgc ctcagagggg 3361 gagctcagct ggtacaacat ctttttgtac aacccagatg ggaatctcca ggagagagct 3421 caagttgacc cactagtcca gagcttctct ttccagaact tgctacaagg cagaatgtac 3481 aagatggtga ttgtaactca cagtggggag ctgtctaatg agtctttcat atttggtaga 3541 acagtcccag cctctgtgag tcatctcagg gggtccaatc ggaacacgac agacagcctt 3601 tggttcaact ggagtccagc ctctggggac tttgactttt atgagctgat tctctataat 3661 cccaatggca caaagaagga aaactggaaa gacaaggacc tgacggagtg gcggtttcaa 3721 ggccttgttc ctggaaggaa gtacgtgctg tgggtggtaa ctcacagtgg agatctcagc 3781 aataaagtca cagcggagag cagaacagct ccaagtcctc ccagtcttat gtcatttgct 3841 gacattgcaa acacatcctt ggccatcacg tggaaagggc ccccagactg gacagactac 3901 aacgactttg agctgcagtg gttgcccaga gatgcactta ctgtcttcaa cccctacaac 3961 aacagaaaat cagaaggacg cattgtgtat ggtcttcgtc cagggagatc ctatcaattc 4021 aacgtcaaga ctgtcagtgg tgattcctgg aaaacttaca gcaaaccaat ttttggatct 4081 gtgaggacaa agcctgacaa gatacaaaac ctgcattgcc ggcctcagaa ctccacggcc 4141 attgcctgtt cttggatccc tcctgattct gactttgatg gttatagtat tgaatgccgg 4201 aaaatggaca cccaagaagt tgagttttcc agaaagctgg agaaagaaaa atctctgctc 4261 aacatcatga tgctagtgcc ccataagagg tacctggtgt ccatcaaagt gcagtcggcc 4321 ggcatgacca gcgaggtggt tgaagacagc actatcacaa tgatagaccg cccccctcct 4381 ccacccccac acattcgtgt gaatgaaaag gatgtgctaa ttagcaagtc ttccatcaac 4441 tttactgtca actgcagctg gttcagcgac accaatggag ctgtgaaata cttcacagtg 4501 gtggtgagag aggctgatgg cagtgatgag ctgaagccag aacagcagca ccctctccct 4561 tcctacctgg agtacaggca caatgcctcc attcgggtgt atcagactaa ttattttgcc 4621 agcaaatgtg ccgaaaatcc taacagcaac tccaagagtt ttaacattaa gcttggagca 4681 gagatggaga gcttaggtgg aaaacgcgat cccactcagc aaaaattctg tgatggacca 4741 ctgaagccac acactgccta cagaatcagc attcgagctt ttacacagct ctttgatgag 4801 gacctgaagg aattcacaaa gccactctat tcagacacat ttttttcttt acccatcact 4861 actgaatcag agcccttgtt tggagctatt gaaggtgtga gtgctggtct gtttttaatt 4921 ggcatgctag tggctgttgt tgccttattg atctgcagac agaaagtgag ccatggtcga 4981 gaaagaccct ctgcccgtct gagcattcgt agggatcgac cattatctgt ccacttaaac 5041 ctgggccaga aaggtaaccg gaaaacttct tgtccaataa aaataaatca gtttgaaggg 5101 catttcatga agctacaggc tgactccaac taccttctat ccaaggaata cgaggagtta 5161 aaagacgtgg gccgaaacca gtcatgtgac attgcactct tgccggagaa tagagggaaa 5221 aatcgataca acaatatatt gccctatgat gccacgcgag tgaagctctc caatgtagat 5281 gatgatcctt gctctgacta catcaatgcc agctacatcc ctggcaacaa cttcagaaga 5341 gaatacattg tcactcaggg accgcttcct ggcaccaagg atgacttctg gaaaatggtg 5401 tgggaacaaa acgttcacaa catcgtcatg gtgacccagt gtgttgagaa gggccgagta 5461 aagtgtgacc attactggcc agcggaccag gattccctct actatgggga cctcatcctg 5521 cagatgctct cagagtccgt cctgcctgag tggaccatcc gggagtttaa gatatgcggt 5581 gaggaacagc ttgatgcaca cagactcatc cgccactttc actatacggt gtggccagac 5641 catggagtcc cagaaaccac ccagtctctg atccagtttg tgagaactgt cagggactac 5701 atcaacagaa gcccgggtgc tgggcccact gtggtgcact gcagtgctgg tgtgggtagg 5761 actggaacct ttattgcatt ggaccgaatc ctccagcagt tagactccaa agactctgtg 5821 gacatttatg gagcagtgca cgacctaaga cttcacaggg ttcacatggt ccagactgag 5881 tgtcagtatg tctacctaca tcagtgtgta agagatgtcc tcagagcaag aaagctacgg 5941 agtgaacaag aaaacccctt gtttccaatc tatgaaaatg tgaatccaga gtatcacaga 6001 gatccagtct attcaaggca ttgagaatgt acctgaagag ctcctggata aaaattattc 6061 actgtgtgat ttgtt // LOCUS HSPTPD1 4080 bp RNA PRI 19-AUG-1994 DEFINITION H.sapiens mRNA for protein-tyrosine-phosphatase D1. ACCESSION X79510 NID g532055 KEYWORDS protein-tyrosine-phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4080) AUTHORS Moller,N.P., Moller,K.B., Lammers,R., Kharitonenkov,A., Sures,I. and Ullrich,A. TITLE Src kinase associates with a member of a distinct subfamily of protein-tyrosine phosphatases containing an ezrin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (16), 7477-7481 (1994) MEDLINE 94329538 REFERENCE 2 (bases 1 to 4080) AUTHORS Moller,N.P.H. TITLE Direct Submission JOURNAL Submitted (31-MAY-1994) N.P.H. Moller, Max Planck Inst. fuer Biochemie, Dept of Mol. Biology, Am Klopferspitz 18 A, 82152 Martinsried, FRG FEATURES Location/Qualifiers source 1..4080 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" CDS 333..3857 /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine-phosphatase" /db_xref="PID:g532056" /translation="MPLPFGLKLKRTRRYTVSSKSCLVARIQLLNNEFVEFTLSVEST GQESLEAVAQRLELREVTYFSLWYYNKQNQRRWVDLEKPLKKQLDKYALEPTVYFGVV FYVPSVSQLQQEITRYQYYLQLKKDILEGSIPCTLEQAIQLAGLAVQADFGDFDQYES QDFLQKFALFPVGWLQDEKVLEEATQKVALLHQKYRGLTAPDAEMLYMQEVERMDGYG EESYPAKDSQGSDISIGACLEGIFVKHKNGRHPVVFRWHDIANMSHNKSFFALELANK EETIQFQTEDMETAKYIWRLCVARHKFYRLNQCNLQTQTVTVNPIRRRSSSRMSLPKP QPYVMPPPPQLHYNGHYTEPYASSQDNLFVPNQNGYYCHSQTSLDRAQIDFNGRIRNG SVYSAHSTNSLNNPQPYLQPSPMSSNPSITGSDVMRPDYLPSHRHSAVIPPSYRPTPD YETVMKQLNRGLVHAERQSHSLRNLNIGSSYAYSRPAALVYSQPEIREHAQLPSPAAA HCPFSLSYSFHSPSPYPYPAERRPVVGAVSVPELTNAQLQAQDYPSPNIMRTQVYRPP PPYPPPRPANSTPDLSRHLYISSSNPDLITRRVHHSVQTFQEDSLPVAHSLQEVSEPL TAARHAQLHKRNSIEVAGLSHGLEGLRLKERTLSASAAEVAPRAVSVGSQPSVFTERT QREGPEEAEGLRYGHKKSLSDATMLIHSSEEEEDEDFEEESGARAPPARAREPRPGLA QDPPGCPRVLLAGPLHILEPKAHVPDAEKRMMDSSPVRTTAEAQRPWRDGLLMPSMSE SDLTTSGRYRARRDSLKKRPVSDLLSGKKNIVEGLPPLGGMKKTRVDAKKIGPLKLAA LNGLSLSRVPLPDEGKEVATRATNDERCKILEQRLEQGMVFTEYERILKKRLVDGECS TARLPENAERNRFQDVLPYDDARVELVPTKENNTGYINASHIKVSVSGIEWDYIATQG PLQNTCQDFWQMVWEQGIAIIAMVTAEEEGGREKSFRYWPRLGSRHNTVTYGRFKITT RFRTDSGCYATTGLKMKHLLTGQERTVWHLQYTDWPEHGCPEDLKGFLSYLEEIQSVR RHTNSTSDPQSPNPPLLVHCSAGVGRTGVVILSEIMIACLEHNEVLDIPRVLDMLRQQ RMMLVQTLCQYTFVYRVLIQFLKSSRLI" BASE COUNT 1001 a 1137 c 1145 g 797 t ORIGIN 1 gcccatgagc gcgccgcggc ccgggctggc gtgcgggtgc ggctgcggcg gccgcgcggc 61 ggggccccgg gaggcgggtc gctgagcggg gcgcgcggcc ccgaggatgc gggagcggga 121 gcgggagcag cgctggcgtc aatgctccct tcctcgggcc attggagact ccgttgcttt 181 ttaatggcgg cagcggctgc tgggtgagca gctggaggcc ggacagtgtt cgtcccatcc 241 ggagaggatc gctttctcct ggcgtcacca gcgctgggtt ggtgggggta gcttttccct 301 ctttgctcct ccattcttga agaaagaaga agatgccact gccatttggg ttgaaactga 361 aacgcacccg gcgctacacg gtgtccagca agagttgcct ggttgcccgg atccaactgc 421 ttaataacga gtttgtggag ttcaccctgt ccgtggagag cactggccag gaaagcctcg 481 aggccgtggc ccagaggctg gagctgcggg aggtcactta cttcagcctc tggtactaca 541 acaagcaaaa tcagcgccgg tgggtagatt tggaaaaacc tttgaagaag cagctggata 601 aatatgcatt ggaacctacc gtctattttg gagtggtgtt ttatgtgcct tcagtttctc 661 agctgcagca ggagattacc aggtatcagt attatctgca actgaagaaa gatatcttgg 721 aaggaagtat tccttgtacc ttagaacaag caattcagct agcaggctta gctgttcaag 781 cggattttgg tgactttgat cagtatgaat cccaggactt tcttcagaaa tttgccttgt 841 ttcctgtggg atggttacaa gatgaaaaag tattggaaga agcaacccaa aaagtggcct 901 tactacatca gaaatacaga gggctcacag ctcctgatgc tgaaatgctg tacatgcagg 961 aggtagagag aatggatggc tatggagaag agagctaccc tgctaaggat agccaaggaa 1021 gtgacatatc cattggagcg tgtcttgaag gtatctttgt gaaacacaag aatggaaggc 1081 atcctgtggt atttaggtgg catgacattg ccaacatgtc ccacaacaag tccttttttg 1141 cattagagct ggcaaataaa gaggagacca ttcaatttca aactgaagac atggaaacag 1201 caaaatacat ttggagactc tgtgttgcgc gacacaagtt ttacagacta aaccagtgta 1261 acctgcaaac tcagactgtc acagtgaacc caatcaggag gaggtcttct tcaaggatgt 1321 ctctgcctaa accccagccc tacgtgatgc ctcccccacc gcagttgcac tataatggac 1381 attatacaga accatatgct tcttcccaag ataacctctt tgtgcccaac cagaacggat 1441 actactgtca ctctcagaca agcttggata gagcccagat tgacttcaac ggtcggatcc 1501 gtaatggcag tgtctacagt gcacacagca ccaactcctt aaataatcct cagccctact 1561 tgcagccctc gccgatgtcg tccaacccta gcatcaccgg gagtgacgtc atgaggcctg 1621 actacctccc gtcccatcgg cacagcgccg tgataccccc gtcctaccgc cccaccccag 1681 actatgagac tgtgatgaag cagctcaaca ggggcctggt gcatgcggaa cggcagagcc 1741 actcgctgcg aaacctcaac atcggcagct cgtacgccta cagcaggccc gcggcgctgg 1801 tctacagcca gcccgagatc cgcgagcacg cacagctccc ctcgccagcg gccgcacact 1861 gcccgttcag cctgagctac agcttccaca gcccgtctcc ctacccctac cctgccgagc 1921 ggcggcccgt ggtgggcgcg gtcagcgtgc cggagctgac caatgcgcag ctgcaggcgc 1981 aggactaccc gtctcccaac atcatgcgga cgcaggtgta ccggccaccc ccaccctacc 2041 cgccccccag gcccgccaac agcacgccag acctgtcccg ccacctttac atcagcagca 2101 gcaaccccga cctcatcacg cggcgcgtgc accactcggt gcaaacgttc caggaggaca 2161 gcctgcccgt ggcgcactcg ctgcaggagg tcagcgagcc cctcaccgcc gcgcgccacg 2221 cgcagctgca caaacggaac agcatcgagg tggccgggct cagccacggc ctggagggcc 2281 tgcggctcaa ggagcgcacc ctatccgcgt cggcggcaga ggtggcgccg cgagccgtct 2341 cggtgggctc ccagcccagc gttttcaccg agaggacaca gcgagaaggg ccggaggagg 2401 cggagggctt gaggtacggc cataagaagt ccctgtcgga cgccaccatg ctaatccaca 2461 gcagcgagga ggaggaggac gaggacttcg aggaggagag cggggcccgg gcgccccctg 2521 cacgtgcgcg cgagcctcgg cccggcctgg cccaggaccc acctggctgc cctcgcgtcc 2581 tgctcgccgg gcccctgcac atcctggagc ccaaggccca cgtcccagac gcggagaaga 2641 ggatgatgga cagcagcccc gtccgcacga ccgcagaggc ccagcggccc tggagagacg 2701 ggctgctgat gccctccatg tcggagtccg acctcaccac gtcaggccgc taccgagccc 2761 ggagggactc tctgaagaaa aggccggtgt cggaccttct ctctgggaag aagaacatcg 2821 tggaagggct cccgcctcta gggggaatga aaaagactcg agtagatgca aaaaaaattg 2881 gtcctcttaa actggctgcc ctaaatggac tctccctatc tcgagtgcct ctgcctgatg 2941 aaggaaagga agtggctacc agagcaacga atgatgaaag gtgtaaaatt ctggaacaac 3001 gattagaaca aggaatggta ttcacagaat atgaaagaat tcttaagaaa cggctagttg 3061 atggggagtg ctcaacagca cgactccctg aaaatgcaga aagaaatcga ttccaagatg 3121 ttcttcctta tgatgatgcg agagtggagt tggtcccaac taaagaaaac aacactggtt 3181 acatcaacgc atcacatatt aaggtctctg tcagtggaat cgaatgggat tatattgcca 3241 cacagggacc attacagaat acctgtcaag atttttggca gatggtatgg gaacagggaa 3301 ttgcaattat agcaatggtg acagcagaag aggagggtgg aagggagaag agctttaggt 3361 actggccacg acttggttcc aggcacaaca ctgtcaccta tggaaggttt aagatcacga 3421 cccggttccg cacagactct ggctgctatg ccaccacagg cctgaagatg aagcacctcc 3481 ttaccgggca agagaggacc gtctggcacc tccaatacac agactggcct gaacatggct 3541 gtccagaaga cctcaaggga tttttatcat atcttgaaga gatccagtct gttcgacgcc 3601 atacaaatag cacaagtgat ccccaaagcc ccaaccctcc gttgttggtc cactgcagtg 3661 ctggggtagg aaggactggc gtggtgattt tgtcggagat catgatcgcc tgcctggaac 3721 acaatgaggt gctggacatc ccgagagtgc tggacatgct gaggcaacag agaatgatgc 3781 tggtgcagac tctctgccag tacacatttg tgtacagagt cctcatccag ttcctgaaaa 3841 gctccaggct catctaagct cccacaattt cttacggggc cagtcatgtg aagcgtttac 3901 agcttaaaaa aaaagcgctt gcctaactca tactttcccg ttgacacttg atccacgcag 3961 cgtggcactg ggacgtaagt ggcgcagtct gaatggcggc acgctgaagg aaacgtgcga 4021 agcacaggct gaagaggggt ttctaacctg ggaaaggtgc tcaaggagga cttggtttca // LOCUS HSPTPE 2160 bp RNA PRI 18-APR-1997 DEFINITION Human HPTP epsilon mRNA for protein tyrosine phosphatase epsilon. ACCESSION X54134 NID g35791 KEYWORDS HPTP gene; protein tyrosine phosphatase; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2160) AUTHORS Saito,H. TITLE Direct Submission JOURNAL Submitted (24-JUL-1990) Saito H., Dana Farber Cancer Institute, Harvard Medical School, 44 Binney Street, Boston MA 02115, U S A REFERENCE 2 (bases 1 to 2160) AUTHORS Krueger,N.X., Streuli,M. and Saito,H. TITLE Structural diversity and evolution of human receptor-like protein tyrosine phosphatases JOURNAL EMBO J. 9 (10), 3241-3252 (1990) MEDLINE 91006018 REFERENCE 3 (bases 1 to 2160) AUTHORS Gastier,J.M., Brody,T., Pulido,J.C., Businga,T., Sunden,S., Hu,X., Maitra,S., Buetow,K.H., Murray,J.C., Sheffield,V.C., Boguski,M., Duyk,G.M. and Hudson,T.J. TITLE Development of a screening set for new (CAG/CTG)n dynamic mutations JOURNAL Genomics 32 (1), 75-85 (1996) MEDLINE 96230328 COMMENT See also X54130-X54135. FEATURES Location/Qualifiers source 1..2160 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="human placental cDNA" /clone="HPTP epsilon 43, 89 & 125" mRNA 1..2160 /gene="HPTP epsilon" gene 1..2160 /gene="HPTP epsilon" CDS 52..2154 /gene="HPTP epsilon" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g35792" /db_xref="SWISS-PROT:P23469" /translation="MEPLCPLLLVGFSLPLARALRGNETTADSNETTTTSGPPDPGAS QPLLAWLLLPLLLLLLVLLLAAYFFRFRKQRKAVVSTSDKKMPNGILEEQEQQRVMLL SRSPSGPKKYFPIPVEHLEEEIRIRSADDCKQFREEFNSLPSGHIQGTFELANKEENR EKNRYPNILPNDHSRVILSQLDGIPCSDYINASYIDGYKEKNKFIAAQGPKQETVNDF WRMVWEQKSATIVMLTNLKERKEEKCHQYWPDQGCWTYGNIRVCVEDCVVLVDYTIRK FCIQPQLPDGCKAPRLVSQLHFTSWPDFGVPFTPIGMLKFLKKVKTLNPVHAGPIVVH CSAGVGRTGTFIVIDAMMAMMHAEQKVDVFEFVSRIRNQRPQMVQTDMQYTFIYQALL EYYLYGDTELDVSSLEKHLQTMHGTTTHFDKIGLEEEFRKLTNVRIMKENMRTGNLPA NMKKARVIQIIPYDFNRVILSMKRGQEYTDYINASFIDGYRQKDYFIATQGPLAHTVE DFWRMIWEWKSHTIVMLTEVQEREQDKCYQYWPTEGSVTHGEITIEIKNDTLSEAISI RDFLVTLNQPQARQEEQVRVVRQFHFHGWPEIGIPAEGKGMIDLIAAVQKQQQQTGNH PITVHCSAGAGRTGTFIALSNILERVKAEGLLDVFQAVKSLRLQRPHMVQTLEQYEFC YKVVQDFIDIFSDYANFK" sig_peptide 52..108 /gene="HPTP epsilon" /note="product is human protein tyrosine phosphatase epsilon" mat_peptide 109..2151 /gene="HPTP epsilon" /EC_number="3.1.3.48" /product="protein-tyrosine phosphatase" misc_feature 190..258 /gene="HPTP epsilon" /note="transmembrane region" BASE COUNT 578 a 584 c 561 g 437 t ORIGIN 1 gaccagaccg gcccccccga gactatagcc ttcactttcc ctcggtccac catggagccc 61 ttgtgtccac tcctgctggt gggttttagc ttgccgctcg ccagggctct caggggcaac 121 gagaccactg ccgacagcaa cgagacaacc acgacctcag gccctccgga cccgggcgcc 181 tcccagccgc tgctggcctg gctgctactg ccgctgctgc tcctcctcct cgtgctcctt 241 ctcgccgcct acttcttcag gttcaggaag cagaggaaag ctgtggtcag caccagcgac 301 aagaagatgc ccaacggaat cttggaggag caagagcagc aaagggtgat gctgctcagc 361 aggtcaccct cagggcccaa gaagtatttt cccatccccg tggagcacct ggaggaggag 421 atccgtatca gatccgccga cgactgcaag cagtttcggg aggagttcaa ctcattgcca 481 tctggacaca tacaaggaac ttttgaactg gcaaataaag aagaaaacag agaaaaaaac 541 agatatccca acatccttcc caatgaccat tctagggtga ttctgagcca actggatgga 601 attccctgtt cagactacat caatgcttcc tacatagatg gttacaaaga gaagaataaa 661 ttcatagcag ctcaaggtcc caaacaggaa acggttaacg acttctggag aatggtctgg 721 gagcaaaagt ctgcgaccat cgtcatgtta acaaacttga aagaaaggaa agaggaaaag 781 tgccatcagt actggcccga ccaaggctgc tggacctatg gaaacatccg ggtgtgcgtg 841 gaggactgcg tggttttggt cgactacacc atccggaagt tctgcataca gccacagctc 901 cccgacggct gcaaagcccc caggctggtc tcacagctgc acttcaccag ctggcccgac 961 ttcggagtgc cttttacccc cattgggatg ctgaagttcc tcaagaaagt aaagacgctc 1021 aaccccgtgc acgctgggcc catcgtggtc cactgtagcg cgggcgtggg ccggacgggc 1081 accttcattg tgatcgatgc catgatggcc atgatgcacg cggagcagaa ggtggatgtg 1141 tttgaatttg tgtctcgaat ccgtaatcag cgccctcaga tggttcaaac ggatatgcag 1201 tacacgttca tctaccaagc cttactcgag tactacctct acggggacac agagctggac 1261 gtgtcctccc tggagaagca cctgcagacc atgcacggca ccaccaccca cttcgacaag 1321 atcgggctgg aggaggagtt caggaaattg acaaatgtcc ggatcatgaa ggagaacatg 1381 aggacgggca acttgccggc aaacatgaag aaggccaggg tcatccagat catcccgtat 1441 gacttcaacc gagtgatcct ttccatgaaa aggggtcaag aatacacaga ctacatcaac 1501 gcatccttca tagacggcta ccgacagaag gactatttca tcgccaccca ggggccactg 1561 gcacacacgg ttgaggactt ctggaggatg atctgggaat ggaaatccca cactatcgtg 1621 atgctgacgg aggtgcagga gagagagcag gataaatgct accagtattg gccaaccgag 1681 ggctcagtta ctcatggaga aataacgatt gagataaaga atgataccct ttcagaagcc 1741 atcagtatac gagactttct ggtcactctc aatcagcccc aggcccgcca ggaggagcag 1801 gtccgagtag tgcgccagtt tcacttccac ggctggcctg agatcgggat tcccgccgag 1861 ggcaaaggca tgattgacct catcgcagcc gtgcagaagc agcagcagca gacaggcaac 1921 caccccatca ccgtgcactg cagtgccgga gctgggcgaa caggtacatt catagccctc 1981 agcaacattt tggagcgagt aaaagccgag ggacttttag atgtatttca agctgtgaag 2041 agtttacgac ttcagagacc acatatggtg caaaccctgg aacagtatga attctgctac 2101 aaagtggtac aagattttat tgatatattt tctgattatg ctaatttcaa atgaagattc // LOCUS HSPTPL1 8043 bp RNA PRI 31-OCT-1994 DEFINITION H.sapiens PTPL1 mRNA for protein tyrosine phosphatase. ACCESSION X80289 NID g515030 KEYWORDS protein tyrosine phophatase; protein tyrosine phophatase 1; PTPL1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8043) AUTHORS Saras,J., Claesson-Welsh,L., Heldin,C.H. and Gonez,L.J. TITLE Cloning and characterization of PTPL1, a protein tyrosine phosphatase with similarities to cytoskeletal-associated proteins JOURNAL J. Biol. Chem. 269 (39), 24082-24089 (1994) MEDLINE 95014139 REFERENCE 2 (bases 1 to 8043) AUTHORS Saras,J. TITLE Direct Submission JOURNAL Submitted (14-JUL-1994) J. Saras, Ludwig Institute for Cancer, Research, Box 595 BMC, Husargatan 3, S-75124 Uppsala, SWEDEN FEATURES Location/Qualifiers source 1..8043 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AG1518" /cell_type="foreskin fibroblast" gene 78..7478 /gene="PTPL1" CDS 78..7478 /gene="PTPL1" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine-phosphatase" /db_xref="PID:g515031" /translation="MHVSLAEALEVRGGPLQEEEIWAVLNQSAESLQELFRKVSLADP AALGFIISPWSLLLLPSGSVSFTDENISNQDLRAFTAPEVLQNQSLTSLSDVEKIHIY SLGMTLYWGADYEVPQSQPIKLGDHLNSILLGMCEDVIYARVSVRTVLDACSAHIRNS NCAPSFSYVKHLVKLVLGNLSGTDQLSCNSEQKPDRSQAIRDRLRGKGLPTGRSSTSD VLDIQKPPLSHQTFLNKGLSKSMGFLSIKDTQDENYFKDILSDNSGREDSENTFSPYQ FKTSGPEKKPIPGIDVLSKKKIWASSMDLLCTADRDFSSGETATYRRCHPEAVTVRTS TTPRKKEARYSDGSIALDIFGPQKMDPIYHTRELPTSSAISSALDRIRERQKKLQVLR EAMNVEEPVRRYKTYHGDVFSTSSESPSIISSESDFRQVRRSEASKRFESSSGLPGVD ETLSQGQSQRPSRQYETPFEGNLINQEIMLKRQEEELMQLQAKMALRQSRLSLYPGDT IKASMLDITRDPLREIALETAMTQRKLRNFFGPEFVKMTIEPFISLDLPRSILTKKGK NEDNRRKVNIMLLNGQRLELTCDTKTICKDVFDMVVAHIGLVEHHLFALATLKDNEYF FVDPDLKLTKVAPEGWKEEPKKKTKATVNFTLFFRIKFFMDDVSLIQHTLTCHQYYLQ LRKDILEERMHCDDETSLLLASLALQAEYGDYQPEVHGVSYFRMEHYLPARVMEKLDL SYIKEELPKLHNTYVGASEKETELEFLKVCQRLTEYGVHFHRVHPEKKSQTGILLGVC SKGVLVFEVHNGVRTLVLRFPWRETKKISFSKKKITLQNTSDGIKHGFQTDNSKICQY LLHLCSYQHKFQLQMRARQSNQDAQDIERASFRSLNLQAESVRGFNMGRAISTGSLAS STLNKLAVRPLSVQAEILKRLSCSELSLYQPLQNSSKEKNDKASWEEKPREMSKSYHD LSQASLYPHRKNVIVNMEPPPQTVAELVGKPSHQMSRSDAESLAGVTKLNNSKSVASL NRSPERRKHESDSSSIEDPGQAYVLDVLHKRWSIVSSPEREITLVNLKKDAKYGLGFQ IIGGEKMGRLDLGIFISSVAPGGPADFHGCLKPGDRLISVNSVSLEGVSHHAAIEILQ NAPEDVTLVISQPKEKISKVPSTPVHLTNEMKNYMKKSSYMQDSAIDSSSKDHHWSRG TLRHISENSFGPSGGLREGSLSSQDSRTESASLSQSQVNGFFASHLGDQTWQESQHGS PSPSVISKATEKETFTDSNQSKTKKPGISDVTDYSDRGDSDMDEATYSSSQDHQTPKQ ESSSSVNTSNKMNFKTFSSSPPKPGDIFEVELAKNDNSLGISVTGGVNTSVRHGGIYV KAVIPQGAAESDGRIHKGDRVLAVNGVSLEGATHKQAVETLRNTGQVVHLLLEKGQSP TSKEHVPVTPQCTLSDQNAQGQGPEKVKKTTQVKDYSFVTEENTFEVKLFKNSSGLGF SFSREDNLIPEQINASIVRVKKLFAGQPAAESGKIDVGDVILKVNGASLKGLSQQEVI SALRGTAPEVFLLLCRPPPGVLPEIDTALLTPLQSPAQVLPNSSKDSSQPSCVEQSTS SDENEMSDKSKKQCKSPSRRDSYSDSSGSGEDDLVTAPANISNSTWSSALHQTLSNMV SQAQSHHEAPKSQEDTICTMFYYPQKIPNKPEFEDSNPSPLPPDMAPGQSYQPQSESA SSSSMDKYHIHHISEPTRQENWTPLKNDLENHLEDFELEVELLITLIKSEKASLGFTV TKGNQRIGCYVHDVIQDPAKSDGRLKPGDRLIKVNDTDVTNMTHTDAVNLLRAASKTV RLVIGRVLELPRIPMLPHLLPDITLTCNKEELGFSLCGGHDSLYQVVYISDINPRSVA AIEGNLQLLDVIHYVNGVSTQGMTLEEVNRALDMSLPSLVLKATRNDLPVVPSSKRSA VSAPKSTKGNGSYSVGSCSQPALTPNDSFSTVAGEEINEISYPKGKCSTYQIKGSPNL TLPKESYIQEDDIYDDSQEAEVIQSLLDVVDEEAQNLLNENNAAGYSCGPGTLKMNGK LSEERTEDTDCDGSPLPEYFTEATKMNGCEEYCEEKVKSESLIQKPQEKKTDDDEITW GNDELPIERTNHEDSDKDHSFLTNDELAVLPVVKVLPSGKYTGANLKSVIRVLRGLLD QGIPSKELENLQELKPLDQCLIGQTKENRRKNRYKNILPYDATRVPLGDEGGYINASF IKIPVGKEEFVYIACQGPLPTTVGDFWQMIWEQKSTVIAMMTQEVEGEKIKCQRYWPN ILGKTTMVSNRLRLALVRMQQLKGFVVRAMTLEDIQTREVRHISHLNFTAWPDHDTPS QPDDLLTFISYMRHIHRSGPIITHCSAGIGRSGTLICIDVVLGLISQDLDFDISDLVR CMRLQRHGMVQTEDQYIFCYQVILYVLTRLQAEEEQKQQPQLLK" misc_feature 1493..1582 /gene="PTPL1" /note="leucine zipper motif" misc_feature 1790..2699 /gene="PTPL1" /note="band 4,1 homology region" repeat_unit 3315..3545 /gene="PTPL1" /note="GLGF repeat1" repeat_unit 4137..4367 /gene="PTPL1" /note="GLGF repeat2" repeat_unit 4533..4775 /gene="PTPL1" /note="GLGF repeat 3" repeat_unit 5401..5615 /gene="PTPL1" /note="GLGF repeat 4" repeat_unit 5682..5906 /gene="PTPL1" /note="GLGF repeat 5" BASE COUNT 2560 a 1668 c 1732 g 2082 t 1 others ORIGIN 1 cccgccccga cgccgcgtcc ctgcagccct gcccggcgct ccagtagcag gacccggtct 61 cgggaccagc cggtaatatg cacgtgtcac tagctgaggc cctggaggtt cggggtggac 121 cacttcagga ggaagaaata tgggctgtat taaatcaaag tgctgaaagt ctccaagaat 181 tattcagaaa agtaagccta gctgatcctg ctgcccttgg cttcatcatt tctccatggt 241 ctctgctgtt gctgccatct ggtagtgtgt catttacaga tgaaaatatt tccaatcagg 301 atcttcgagc attcactgca ccagaggttc ttcaaaatca gtcactaact tctctctcag 361 atgttgaaaa gatccacatt tattctcttg gaatgacact gtattggggg gctgattatg 421 aagtgcctca gagccaacct attaagcttg gagatcatct caacagcata ctgcttggaa 481 tgtgtgagga tgttatttac gctcgagttt ctgttcggac tgtgctggat gcttgcagtg 541 cccacattag gaatagcaat tgtgcaccct cattttccta cgtgaaacac ttggtaaaac 601 tggttctggg aaatctttct gggacagatc agctttcctg taacagtgaa caaaagcctg 661 atcgaagcca ggctattcga gatcgattgc gaggaaaagg attaccaaca ggaagaagct 721 ctacttctga tgtactagac atacaaaagc ctccactctc tcatcagacc tttcttaaca 781 aagggcttag taaatctatg ggatttctgt ccatcaaaga tacacaagat gagaattatt 841 tcaaggacat tttatcagat aattctggac gtgaagattc tgaaaataca ttctcccctt 901 accagttcaa aactagtggc ccagaaaaaa aacccatccc tggcattgat gtgctttcta 961 agaagaagat ctgggcttca tccatggact tgctttgtac agctgacaga gacttctctt 1021 caggagagac tgccacatat cgtcgttgtc accctgaggc agtaacagtg cggacttcaa 1081 ctacgcctag aaaaaaggag gcaagatact cagatggaag tatagccttg gatatctttg 1141 gccctcagaa aatggatcca atatatcaca ctcgagaatt gcccacctcc tcagcaatat 1201 caagtgcttt ggaccgaatc cgagagagac aaaagaaact tcaggttctg agggaagcca 1261 tgaatgtaga agaaccagtt cgaagataca aaacttatca tggtgatgtc tttagtacct 1321 ccagtgaaag tccatctatt atttcctctg aatcagattt cagacaagtg agaagaagtg 1381 aagcctcaaa gaggtttgaa tccagcagtg gtctcccagg ggtagatgaa accttaagtc 1441 aaggccagtc acagagaccg agcagacaat atgaaacacc ctttgaaggc aacttaatta 1501 atcaagagat catgctaaaa cggcaagagg aagaactgat gcagctacaa gccaaaatgg 1561 cccttagaca gtctcggttg agcctatatc caggagacac aatcaaagcg tccatgcttg 1621 acatcaccag ggatccgtta agagaaattg ccctagaaac agccatgact caaagaaaac 1681 tgaggaattt ctttggccct gagtttgtga aaatgacaat tgaaccattt atatctttgg 1741 atttgccacg gtctattctt actaagaaag ggaagaatga ggataaccga aggaaagtaa 1801 acataatgct tctgaacggg caaagactgg aactgacctg tgataccaaa actatatgta 1861 aagatgtgtt tgatatggtt gtggcacata ttggcttagt agagcatcat ttgtttgctt 1921 tagctaccct caaagataat gaatatttct ttgttgatcc tgacttaaaa ttaaccaaag 1981 tggccccaga gggatggaaa gaagaaccaa agaaaaagac caaagccact gttaatttta 2041 ctttgttttt cagaattaaa ttttttatgg atgatgttag tctaatacaa catactctga 2101 cgtgtcatca gtattacctt cagcttcgaa aagatatttt ggaggaaagg atgcactgtg 2161 atgatgagac ttccttattg ctggcatcct tggctctcca ggctgagtat ggagattatc 2221 aaccagaggt tcatggtgtg tcttacttta gaatggagca ctatttgccc gccagagtga 2281 tggagaaact tgatttatcc tatatcaaag aagagttacc caaattgcat aatacctatg 2341 tgggagcttc tgaaaaagag acagagttag aatttttaaa ggtctgccaa agactgacag 2401 aatatggagt tcattttcac cgagtgcacc ctgagaagaa gtcacaaaca ggaatattgc 2461 ttggagtctg ttctaaaggt gtccttgtgt ttgaagttca caatggagtg cgcacattgg 2521 tccttcgctt tccatggagg gaaaccaaga aaatatcttt ttctaaaaag aaaatcacat 2581 tgcaaaatac atcagatgga ataaaacatg gcttccagac agacaacagt aagatatgcc 2641 agtacctgct gcacctctgc tcttaccagc ataagttcca gctacagatg agagcaagac 2701 agagcaacca agatgcccaa gatattgaga gagcttcgtt taggagcctg aatctccaag 2761 cagagtctgt tagaggattt aatatgggac gagcaatcag cactggcagt ctggccagca 2821 gcaccctcaa caaacttgct gttcgacctt tatcagttca agctgagatt ctgaagaggc 2881 tatcctgctc agagctgtcg ctttaccagc cattgcaaaa cagttcaaaa gagaagaatg 2941 acaaagcttc atgggaggaa aagcctagag agatgagtaa atcataccat gatctcagtc 3001 aggcctctct ctatccacat cggaaaaatg tcattgttaa catggaaccc ccaccacaaa 3061 ccgttgcaga gttggtggga aaaccttctc accagatgtc aagatctgat gcagaatctt 3121 tggcaggagt gacaaaactt aataattcaa agtctgttgc gagtttaaat agaagtcctg 3181 aaaggaggaa acatgaatca gactcctcat ccattgaaga ccctgggcaa gcatatgttc 3241 tagatgtgct acacaaaaga tggagcatag tatcttcacc agaaagggag atcaccttag 3301 tgaacctgaa aaaagatgca aagtatggct tgggatttca aattattggt ggggagaaga 3361 tgggaagact ggacctaggc atatttatca gctcagttgc ccctggagga ccagctgact 3421 tccatggatg cttgaagcca ggagaccgtt tgatatctgt gaatagtgtg agtctggagg 3481 gagtcagcca ccatgctgca attgaaattt tgcaaaatgc acctgaagat gtgacacttg 3541 ttatctctca gccaaaagaa aagatatcca aagtgccttc tactcctgtg catctcacca 3601 atgagatgaa aaactacatg aagaaatctt cctacatgca agacagtgct atagattctt 3661 cttccaagga tcaccactgg tcacgtggta ccctgaggca catctcggag aactcctttg 3721 ggccgtctgg gggcctgcgg gaaggaagcc tgagttctca agattccagg actgagagtg 3781 ccagcttgtc tcaaagccag gtcaatggtt tctttgccag ccatttaggt gaccaaacct 3841 ggcaggaatc acagcatggc agcccttccc catctgtaat atccaaagcc accgagaaag 3901 agactttcac tgatagtaac caaagcaaaa ctaaaaagcc aggcatttct gatgtaactg 3961 attactcaga ccgtggagat tcagacatgg atgaagccac ttactccagc agtcaggatc 4021 atcaaacacc aaaacaggaa tcttcctctt cagtgaatac atccaacaag atgaatttta 4081 aaactttttc ttcatcacct cctaagcctg gagatatctt tgaggttgaa ctggctaaaa 4141 atgataacag cttggggata agtgtcacgg gaggtgtgaa tacgagtgtc agacatggtg 4201 gcatttatgt gaaagctgtt attccccagg gagcagcaga gtctgatggt agaattcaca 4261 aaggtgatcg cgtcctagct gtcaatggag ttagtctaga aggagccacc cataagcaag 4321 ctgtggaaac actgagaaat acaggacagg tggttcatct gttattagaa aagggacaat 4381 ctccaacatc taaagaacat gtcccggtaa ccccacagtg taccctttca gatcagaatg 4441 cccaaggtca aggcccagaa aaagtgaaga aaacaactca ggtcaaagac tacagctttg 4501 tcactgaaga aaatacattt gaggtaaaat tatttaaaaa tagctcaggt ctaggattca 4561 gtttttctcg agaagataat cttataccgg agcaaattaa tgccagcata gtaagggtta 4621 aaaagctctt tgctggacag ccagcagcag aaagtggaaa aattgatgta ggagatgtta 4681 tcttgaaagt gaatggagcc tctttgaaag gactatctca gcaggaagtc atatctgctc 4741 tcaggggaac tgctccagaa gtattcttgc ttctctgcag acctccacct ggtgtgctac 4801 cggaaattga tactgcgctt ttgaccccac ttcagtctcc agcacaagta cttccaaaca 4861 gcagtaaaga ctcttctcag ccatcatgtg tggagcaaag caccagctca gatgaaaatg 4921 aaatgtcaga caaaagcaaa aaacagtgca agtccccatc cagaagagac agttacagtg 4981 acagcagtgg gagtggagaa gatgacttag tcacagctcc agcaaacata tcaaattcga 5041 cctggagttc agctttgcat cagactctaa gcaacatggt atcacaggca cagagtcatc 5101 atgaagcacc caagagtcaa gaagatacca tttgtaccat gttttactat cctcagaaaa 5161 ttcccaataa accagagttt gaggacagta atccttcccc tctaccaccg gatatggctc 5221 ctgggcagag ttatcaaccc caatcagaat ctgcttcctc tagttcgatg gataagtatc 5281 atatacatca catttctgaa ccaactagac aagaaaactg gacacctttg aaaaatgact 5341 tggaaaatca ccttgaagac tttgaactgg aagtagaact cctcattacc ctaattaaat 5401 cagaaaaagc aagcctgggt tttacagtaa ccaaaggcaa tcagagaatt ggttgttatg 5461 ttcatgatgt catacaggat ccagccaaaa gtgatggaag gctaaaacct ggggaccggc 5521 tcataaaggt taatgataca gatgttacta atatgactca tacagatgca gttaatctgc 5581 tccgggctgc atccaaaaca gtcagattag ttattggacg agttctagaa ttacccagaa 5641 taccaatgtt gcctcatttg ctaccggaca taacactaac gtgcaacaaa gaggagttgg 5701 gtttttcctt atgtggaggt catgacagcc tttatcaagt ggtatatatt agtgatatta 5761 atccaaggtc cgtcgcagcc attgagggta atctccagct attagatgtc atccattatg 5821 tgaacggagt cagcacacaa ggaatgacct tggaggaagt taacagagca ttagacatgt 5881 cacttccttc attggtattg aaagcaacaa gaaatgatct tccagtggtt cccagctcaa 5941 agaggtctgc tgtttcagct ccaaagtcaa ccaaaggcaa tggttcctac agtgtggggt 6001 cttgcagcca gcctgccctc actcctaatg attcattctc cacggttgct ggggaagaaa 6061 taaatgaaat atcgtacccc aaaggaaaat gttctactta tcagataaag ggatcaccaa 6121 acttgactct gcccaaagaa tcttatatac aagaagatga catttatgat gattcccaag 6181 aagctgaagt tatccagtct ctgctggatg ttgttgatga ggaagcccag aatcttttaa 6241 acgaaaataa tgcagcagga tactcctgtg gtccaggtac attaaagatg aatgggaagt 6301 tatcagaaga gagaacagaa gatacagact gcgatggttc acctttacct gagtatttta 6361 ctgaggccac caaaatgaat ggctgtgaag aatattgtga agaaaaagta aaaagtgaaa 6421 gcttaattca gaagccacaa gaaaagaaga ctgatgatga tgaaataaca tggggaaatg 6481 atgagttgcc aatagagaga acaaaccatg aagattctga taaagatcat tcctttctga 6541 caaacgatga gctcgctgta ctccctgtcg tcaaagtgct tccctctggt aaatacacgg 6601 gtgccaactt aaaatcagtc attcgagtcc tgcggggttt gctagatcaa ggaattcctt 6661 ctaaggagct ggagaatctt caagaattaa aacctttgga tcagtgtcta attgggcaaa 6721 ctaaggaaaa cagaaggaag aacagatata aaaatatact tccctatgat gctacaagag 6781 tgcctcttgg agatgaaggt ggctatatca atgccagctt cattaagata ccagttggga 6841 aagaagagtt cgtttacatt gcctgccaag gaccactgcc tacaactgtt ggagacttct 6901 ggcagatgat ttgggagcaa aaatccacag tgatagccat gatgactcaa gaagtagaag 6961 gagaaaaaat caaatgccag cgctattggc ccaacatcct aggcaaaaca acaatggtca 7021 gcaacagact tcgactggct cttgtgagaa tgcagcagct gaagggcttt gtggtgaggg 7081 caatgaccct tgaagatatt cagaccagag aggtgcgcca tatttctcat ctgaatttca 7141 ctgcctggcc agaccatgat acaccttctc aaccagatga tctgcttact tttatctcct 7201 acatgagaca catccacaga tcaggcccaa tcattacgca ctgcagtgct ggcattggac 7261 gttcagggac cctgatttgc atagatgtgg ttctgggatt aatcagtcag gatcttgatt 7321 ttgacatctc tgatttggtg cgctgcatga gactacaaag acacggaatg gttcagacag 7381 aggatcaata tattttctgc tatcaagtca tcctttatgt cctgacacgt cttcaagcag 7441 aagaagagca aaaacagcag cctcagcttc tgaagtgaca tgaaaagagc ctctggatgc 7501 atttccattt ctctccttaa cctccagcag actcctgctc tctatccaaa taaagatcac 7561 agagcagnaa gttcatacaa catgcatgtt ctcctctatc ttagaggggt attcttcttg 7621 aaaataaaaa atattgaaat gctgtatttt tacagctact ttaacctatg ataattattt 7681 acaaaatttt aacactaacc aaacaatgca gatcttaggg atgattaaag gcagcattga 7741 tgatagcaag acattgttac aaggacatgg tgagtctatt tttaatgcac caatcttgtt 7801 tatagcaaaa atgttttcca atattttaat aaagtagtta ttttataggg catacttgaa 7861 accagtattt aagctttaaa tgacagtaat attggcatag aaaaaagtag caaatgttta 7921 ctgtatcaat ttctaatgtt tactatatag aatttcctgt aatatattta tatacttttt 7981 catgaaaatg gagttatcag ttatctgttt gttactgcat catctgtttg taatcattat 8041 ctc // LOCUS HSPTPU2GN 5100 bp RNA PRI 24-AUG-1995 DEFINITION H.sapiens mRNA for protein tyrosine phosphatase. ACCESSION Z48541 NID g963058 KEYWORDS protein tyrosine phosphatase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5100) AUTHORS Seimiya,H., Sawabe,T., Inazawa,J. and Tsuruo,T. TITLE Cloning, expression and chromosomal localization of a novel gene for protein tyrosine phosphatase (PTP-U2) induced by various differentiation-inducing agents JOURNAL Oncogene 10 (9), 1731-1738 (1995) MEDLINE 95273089 REFERENCE 2 (bases 1 to 5100) AUTHORS Tsuruo,T. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) Takashi Tsuruo, Inst. of Mol. & Cell. Biosciences, University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo, 113, JAPAN FEATURES Location/Qualifiers source 1..5100 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" 5'UTR 1..120 CDS 121..3771 /EC_number="3.1.3.48" /codon_start=1 /product="protein tyrosine phosphatase" /db_xref="PID:g963059" /translation="MGHLPTGIHGARRLLPLLWLFVLFKNATAFHVTVQDDNNIVVSL EASDVISPASVYVVKITGESKNYFFEFEEFNSTLPPPVIFKASYHGLYYIITLVVVNG NVVTKPSRSITVLTKPLPVTSVSIYDYKPSPETGVLFEIHYPEKYNVFTRVNISYWGG KDFRTMLYKDFFKGKTVFNHWLPGMCYSNITFQLVCEATFNKSTVVEYSGVSHEPKQH RTAPYPPQNISVRIVNLNKNNWEEQSGNFPEESFMRSQDTIGKEKLFHFTEETPEIPS GNISSGWPDFNSSDYETTSQPYWWDSASAAPESEDEFFSVLPMEYENNSTLSETEKST SGSFSFFPVQMILTWLPPKPPTAFDGFHIHIEREENFTEYLMVDEEAHEFVAELKEPG KYKLSVTTFSSSGSCETRKSQSAKSLSFYISPSGEWIEELTEKPQHVSFHVLSSTTAL MSWTSSQENYNSTIVSVVSLTCQKQKESQRLEKQYCTQVNSSKPIIENLVPGAQYQVV IYLRKGPLIGPPSDPVTFAIVPTGIKDLMLYPLGPTAVVLSWTRPYLGVFRKYVVEMF YFNPATMTSEWTTYYEIAATVSLTASVRIANLLPAWYYNFRVTMVTWGDPELSCCDSS TISFITAPVAPEITSVEYFNSLLYISWTYGDDTTDLSHSRMLHWMVVTEGKKKIKKSV TRNVMTAILSLPPGDTYNLSVTTCTERGSNTSMLRLVKLEPAPPKSLFAVNKTQTSVT LLWVEEGVADFFKVFFQHVGSSQKTKLQEPVAVSPHVVTISSLLPATAYSCSVTSFSH DSPSVPTFIAVSTMVTEMNPNVVVISVLAILSTLLIGLLLVTLIILRKKHLQMARECG AGTFANCASLERDGKLPYNCRRSIFAFLTLLPSCLWTDYPLAFYINPWSKNGLKKRKL TNPVQLDDFDAYIKDMAKDSDYKFSLQFEELKLIGLDIPHFAADLPLNRCKNRYTNIL PYDFSRVRLVSMNEEEGADYINANYIPGYNSPQEYIATQGPLPETRNDFWKMVLQQKS QIIVMLTQCNEKRRVKCDHYWPFTEEPIAYGDITVEMISEEEQDDWACRHFRINYADE MQDVMHFNYTAWPDHGVPTANAAESILQFVHMVRQQATKSKGPMIIHCSAGVGRTGTF IALDRLLQHIRDHEFVDILGLVSEMRSYRMSMVQTEEQYIFIHQCVQLMWMKKKQQFC ISDVIYENVSKS" 3'UTR 3772..5100 polyA_signal 5007..5012 /note="putative" polyA_signal 5066..5071 /note="putative" BASE COUNT 1429 a 1111 c 1114 g 1446 t ORIGIN 1 gctcgccagg agcaacctcg gcgcccaggg tctgaggctg cagccccagt tcgacattgt 61 gagccgccgc cgggggagtc cctagcgcaa ccgtgccccc aagtccccgt ccgcgcagcg 121 atggggcacc tgcccacggg gatacacggc gcccgccgcc tcctgcctct gctctggctc 181 tttgtgctgt tcaagaatgc tacagctttc catgtaactg tccaagatga taataacatc 241 gttgtctcat tagaagcttc agacgtcatc agtccagcat ctgtgtatgt tgtgaagata 301 actggtgaat ccaaaaatta tttcttcgaa tttgaggaat tcaacagcac tttgcctcct 361 cctgttattt tcaaggccag ttatcatggc ctttattata taatcactct ggtagtggta 421 aatggaaatg tggtgaccaa gccatccaga tcaatcactg tgttaacaaa acctctacct 481 gtaaccagtg tttccatata tgactataaa ccttctcctg aaacaggagt cctgtttgaa 541 atacattatc cagaaaaata taacgttttc acaagagtga acattagcta ctggggaggt 601 aaagacttcc ggacaatgct atataaagat ttctttaagg gaaaaacagt atttaatcac 661 tggctgccag gaatgtgtta tagtaatatc acctttcagc tggtatgtga ggcaactttt 721 aataaaagta ccgttgttga gtacagtggt gtcagtcacg aacccaaaca gcacagaact 781 gccccttatc cacctcaaaa catttccgtt cgtatcgtaa acttgaacaa aaacaactgg 841 gaagaacaga gtggcaattt cccagaagaa tccttcatga gatcacaaga tacaatagga 901 aaagaaaaac tcttccattt tacagaagaa acccctgaaa ttccctcggg caacatttct 961 tccggttggc ctgattttaa tagcagtgac tatgaaacta cgtctcagcc atattggtgg 1021 gacagtgcat ctgcagctcc tgaaagtgaa gatgaatttt tcagcgtact tcccatggaa 1081 tacgaaaata acagtacact cagtgagaca gagaagtcaa catcaggctc tttctccttt 1141 ttccctgtgc aaatgatatt gacctggtta ccacccaaac cacccactgc ttttgatggg 1201 ttccatatcc atattgaacg agaagagaac tttactgaat atttgatggt ggatgaagaa 1261 gcacatgaat ttgttgcaga actgaaggaa cctgggaaat ataagttatc tgtgacaacc 1321 tttagttcct caggatcttg tgaaactcga aaaagtcagt cagcaaaatc actcagcttt 1381 tatatcagtc cttcaggaga gtggattgaa gaactgaccg agaagccgca gcacgtgagt 1441 ttccacgttt taagctcaac cactgccttg atgtcctgga catcttccca agagaactac 1501 aacagcacca ttgtgtctgt ggtgtcgctg acctgccaga aacaaaagga gagccagagg 1561 cttgaaaagc agtactgcac tcaggtgaac tcaagcaaac ctattattga aaatctggtt 1621 cctggtgccc agtaccaggt tgtaatatac ctaaggaaag gccctttgat tggaccacct 1681 tcagatcctg tgacatttgc tattgttccc acaggaataa aggatttaat gctctatcct 1741 ttgggtccta cggccgtggt tctgagctgg accagacctt atttaggcgt gttcagaaaa 1801 tacgtggttg aaatgtttta tttcaaccct gctacaatga catcagagtg gaccacctac 1861 tatgaaatag cagcaactgt ttccttaact gcatccgtga gaatagctaa tctgctgcca 1921 gcatggtact acaacttccg ggttaccatg gtgacgtggg gagatccaga actgagctgc 1981 tgtgacagct ctaccatcag cttcataaca gccccagtgg ctccggaaat cacttctgtg 2041 gaatatttca acagtctgtt atatatcagt tggacatatg gggatgatac aacggacttg 2101 tcccattcta gaatgcttca ctggatggtg gttacagaag gaaaaaagaa aattaaaaag 2161 agtgtaacac gcaatgtcat gactgcaatt ctcagcttgc ctccaggcga cacctataac 2221 ctctcagtaa ctacttgtac tgaaagagga agtaatacct ccatgctccg ccttgtcaag 2281 ctagaaccag ctccacccaa atcactcttc gcagtgaaca aaacccagac ttcagtgact 2341 ttgctgtggg tggaagaggg agtagctgat ttctttaaag ttttctttca acacgttggc 2401 tccagtcaga aaaccaaact tcaggaacca gttgctgttt ctccccatgt ggtgaccatc 2461 tccagccttc ttcctgccac tgcctacagt tgtagtgtca ccagctttag ccatgacagc 2521 cccagtgtcc ctacgttcat agccgtctca acaatggtta cagagatgaa tcccaatgtg 2581 gtagtgatct ccgtgctggc catccttagc acacttttaa ttggactgtt gcttgttacc 2641 ctcattattc ttaggaaaaa gcatctgcag atggctaggg agtgtggagc tggtacattt 2701 gccaattgtg catccttaga gagggatgga aagcttccat acaactgccg taggagtata 2761 tttgctttct taaccctgct accatcatgt ctttggactg attatccttt ggcattttat 2821 attaatcctt ggagtaaaaa tggtttaaag aagaggaaac tgacaaaccc ggttcaactg 2881 gatgactttg atgcctatat taaggatatg gccaaagact ctgactataa attttctctt 2941 cagtttgagg agttgaaatt gattggactg gatatcccac actttgctgc agatcttcca 3001 ctgaatcgat gtaaaaaccg ttacacaaac atcctaccat atgacttcag ccgtgtgaga 3061 ttagtctcca tgaatgaaga ggaaggtgca gactacatca atgccaacta tattcctgga 3121 tacaactcac cccaggagta tattgccacc caggggccac tgcctgaaac cagaaatgac 3181 ttctggaaga tggtcctgca acaaaagtct cagattattg tcatgctcac tcagtgtaat 3241 gagaaaagga gggtgaaatg tgaccattac tggccattca cggaagaacc tatagcctat 3301 ggagacatca ctgtggagat gatttcagag gaagagcagg acgactgggc ctgtagacac 3361 ttccggatca actatgctga cgagatgcag gatgtgatgc attttaacta cactgcatgg 3421 cctgatcatg gtgtgcccac agcaaatgct gcagaaagta tcctgcagtt tgtacacatg 3481 gtccgacagc aagctaccaa gagcaaaggt cccatgatca ttcactgcag tgctggcgtg 3541 ggacggacag gaacattcat tgccctggac aggctcttgc agcacattcg ggatcatgag 3601 tttgttgaca tcttagggct ggtgtcagaa atgaggtcat accggatgtc tatggtacag 3661 acagaggagc agtacatttt tatccatcag tgtgtgcaac tgatgtggat gaagaagaag 3721 cagcagttct gcatcagtga tgtcatatac gagaatgtta gcaagtccta gttcagaatc 3781 cggagcagag aggacatgat gtgcgcccat cctcccttgc ttccagattg ttttagtggg 3841 ccctgatggt catttttcta aacagaggcc ctgctttgta atatgtggcc aaggagataa 3901 tttatctcac agaagcaccg ggaagactta gccttaaaga gcctacagtg tccttttgga 3961 ctctttcact tcgggacatt taataatgga ccaaattcaa cagaacacca ggaaggtcaa 4021 gacgctctcc aaagggcagg aagtacagca cttccgaaga gtttagttgg ccctttgctg 4081 gttgggctga gttttttatt tttaagtgtt tgtttttcag tgcaataatt tttgtgtgtg 4141 tgtgattctt atcagaaagt tgaattgttt tctgcctaca ccgttcatca gccccataac 4201 ccaggaagga acaggcattg ttagcatcag attatacctc attattaaaa ggaggcatgg 4261 ccacacatga agaaatggtc attctacttc aaagaaattg agccagcact atctgtactc 4321 caacattacc ggatctggat tgggggaggt tggtcaggga agagaggggt tctacccaca 4381 gatcaactgt gtaatctttt actattcaag ctataattca gcttcaaagt agagtagaaa 4441 aaaaattgtc ttaactgttc tagttcttga tggttttctt ccttattaac agttggtgtt 4501 tcttccttgg cccttttgga ctaatgttac tgtccaagtt ctttctcaag aaaccacatc 4561 tggttcagaa gagtgtcaag ttggactctt tgaactctgt tgctgtctga gcaatcgtgg 4621 tgcctagact ttgcattcct tgttctgttg acctgcatac atgtgagagc tatttcttta 4681 agaactatat aggctgtgaa aacgcacttt ctttccccca aagagctggg aatttatgaa 4741 gttatggcaa tgaactgcag catgctggga caattatttg actacttttt tttgtaatat 4801 tgtcaaatgt ctctatggat tctgacagag atttcttttt gttttgttat tcttttggtt 4861 gtcagtttca ttttaacgag tgtaactagt aacattttat tctttggatt ttgtataatt 4921 acagtacatg attgtgtatt gtgacatgaa tgctgtcaaa atgacattga tggcattgtg 4981 aagcctgtta ctttgtgtca cttcctgata aataagaggt gatgacatgg atatacaaca 5041 gaaaacactt tgagttgaaa gtaaacacaa gctggctgct tccctgtggc aactgtggct // LOCUS HSPTX3 1775 bp RNA PRI 10-OCT-1993 DEFINITION H.sapiens PTX3 mRNA. ACCESSION X63053 NID g407079 KEYWORDS PTX3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1775) AUTHORS Breviario,F., Dejana,E., Mantovani,A. and Introna,M. TITLE CLONING OF A NEW MEMBER OF THE PENTAXIN GENE FAMILY FROM INTERLEUKIN-1 STIMULATED HUMAN ENDOTHELIAL CELLS JOURNAL Unpublished REFERENCE 2 (bases 1 to 1775) AUTHORS Breviario,F. TITLE Direct Submission JOURNAL Submitted (21-OCT-1991) F. Breviario, Istituto Ricerche Farmacol. Mario Negri, Via Eritrea 62, 20157 Milano, ITALY FEATURES Location/Qualifiers source 1..1775 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Neonate" /tissue_type="Umbilical vein" /cell_type="Endothelial cell" /clone_lib="lambda-ZAP-FB1" /clone="PTX3/D" sig_peptide 6..56 CDS 6..1151 /codon_start=1 /product="PTX3" /db_xref="PID:g407080" /translation="MHLLAILFCALWSAVLAENSDDYDLMYVNLDNEIDNGLHPTEDP TPCDCGQEHSEWDKLFIMLENSQMRERMLLQATDDVLRGELQRLREELGRLAESLARP CAPGAPAEARLTSALDELLQATRDAGRRLARMEGAEAQRPEEAGRALAAVLEELRQTR ADLHAVQGWAARSWLPAGCETAILFPMRSKKIFGSVHPVRPMRLESFSACIWVKATDV LNKTILFSYGTKRNPYEIQLYLSYQSIVFVVGGEENKLVAEAMVSLGRWTHLCGTWNS EEGLTSLWVNGELAATTVEMATGHIVPEGGILQIGQEKNGCCVGGGFDETLAFSGRLT GFNIWDSVLSNEEIRETGGAESCHIRGNIVGWGVTEIQPHGGAQYVS" mat_peptide 57..1148 /product="PTX3" 3'UTR 1152..1760 polyA_signal 1740..1745 polyA_site 1761..1775 BASE COUNT 494 a 349 c 488 g 444 t ORIGIN 1 cagcaatgca tctccttgcg attctgtttt gtgctctctg gtctgcagtg ttggccgaga 61 actcggatga ttatgatctc atgtatgtga atttggacaa cgaaatagac aatggactcc 121 atcccactga ggaccccacg ccgtgcgact gcggtcagga gcactcggaa tgggacaagc 181 tcttcatcat gctggagaac tcgcagatga gagagcgcat gctgctgcaa gccacggacg 241 acgtcctgcg gggcgagctg cagaggctgc gggaggagct gggccggctc gcggaaagcc 301 tggcgaggcc gtgcgcgccg ggggctcccg cagaggccag gctgaccagt gctctggacg 361 agctgctgca ggcgacccgc gacgcgggcc gcaggctggc gcgtatggag ggcgcggagg 421 cgcagcgccc agaggaggcg gggcgcgccc tggccgcggt gctagaggag ctgcggcaga 481 cgcgagccga cctgcacgcg gtgcagggct gggctgcccg gagctggctg ccggcaggtt 541 gtgaaacagc tattttattc ccaatgcgtt ccaagaagat ttttggaagc gtgcatccag 601 tgagaccaat gaggcttgag tcttttagtg cctgcatttg ggtcaaagcc acagatgtat 661 taaacaaaac catcctgttt tcctatggca caaagaggaa tccatatgaa atccagctgt 721 atctcagcta ccaatccata gtgtttgtgg tgggtggaga ggagaacaaa ctggttgctg 781 aagccatggt ttccctggga aggtggaccc acctgtgcgg cacctggaat tcagaggaag 841 ggctcacatc cttgtgggta aatggtgaac tggcggctac cactgttgag atggccacag 901 gtcacattgt tcctgaggga ggaatcctgc agattggcca agaaaagaat ggctgctgtg 961 tgggtggtgg ctttgatgaa acattagcct tctctgggag actcacaggc ttcaatatct 1021 gggatagtgt tcttagcaat gaagagataa gagagaccgg aggagcagag tcttgtcaca 1081 tccgggggaa tattgttggg tggggagtca cagagatcca gccacatgga ggagctcagt 1141 atgtttcata aatgttgtga aactccactt gaagccaaag aaagaaactc acacttaaaa 1201 cacatgccag ttgggaaggt ctgaaaactc agtgcataat aggaacactt gagactaatg 1261 aaagagagag ttgagaccaa tctttatttg tactggccaa atactgaata aacagttgaa 1321 ggaaagacat tggaaaaagc ttttgaggat aatgttacta gactttatgc catggtgctt 1381 tcagtttaat gctgtgtctc tgtcagataa actctcaaat aattaaaaag gactgtattg 1441 ttgaacagag ggacaattgt tttacttttc tttggttaat tttgttttgg ccagagatga 1501 attttacatt ggaagaataa caaaataaga tttgttgtcc attgttcatt gttattggta 1561 tgtaccttat tacaaaaaaa atgatgaaaa catatttata ctacaaggtg acttaacaac 1621 tataaatgta gtttatgtgt tataatcgaa tgtcacgttt ttgagaagat agtcatataa 1681 gttatattgc aaaagggatt tgtattaatt taagactatt tttgtaaagc tctactgtaa 1741 ataaaatatt ttataaaact aaaaaaaaaa aaaaa // LOCUS HSPUMP1 1078 bp RNA PRI 21-MAR-1995 DEFINITION Human pump-1 mRNA homolog. to metalloproteinase, collagenase and stromelysin. ACCESSION X07819 Y00728 NID g35798 KEYWORDS metalloproteinase; pump-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1078) AUTHORS Breathnach,R. TITLE Direct Submission JOURNAL Submitted (06-JUN-1988) Breathnach R., Laboratoire de Genetique Moleculaire des Eucaryots de CNRS, Unite 184 de Biologie Moleculaire et de Genie Genetique de l'INSE RM, Faculte de Medecine, 11 rue Humann, 67085 STRASBOURG CEDEX, France REFERENCE 2 (bases 1 to 836) AUTHORS Muller,D., Quantin,B., Gesnel,M.C., Millon-Collard,R., Abecassis,J. and Breathnach,R. TITLE The collagenase gene family in humans consists of at least four members JOURNAL Biochem. J. 253 (1), 187-192 (1988) MEDLINE 88339885 FEATURES Location/Qualifiers source 1..1078 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10" sig_peptide 28..78 /note="pot. signal peptide (AA -17 to -1)" CDS 28..831 /note="pre-pump-1 proteinase (AA -17 to 250)" /codon_start=1 /db_xref="PID:g35799" /db_xref="SWISS-PROT:P09237" /translation="MRLTVLCAVCLLPGSLALPLPQEAGGMSELQWEQAQDYLKRFYL YDSETKNANSLEAKLKEMQKFFGLPITGMLNSRVIEIMQKPRCGVPDVAEYSLFPNSP KWTSKVVTYRIVSYTRDLPHITVDRLVSKALNMWGKEIPLHFRKVVWGTADIMIGFAR GAHGDSYPFDGPGNTLAHAFAPGTGLGGDAHFDEDERWTDGSSLGINFLYAATHELGH SLGMGHSSDPNAVMYPTYGNGDPQNFKLSQDDIKGIQKLYGKRSNSRKK" mat_peptide 79..828 /note="mature pump-1 proteinase (AA 1 - 250)" misc_feature 1056..1061 /note="polyA signal" polyA_site 1078 /note="polyA site" BASE COUNT 303 a 230 c 246 g 299 t ORIGIN 1 aagaacaatt gtctctggac ggcagctatg cgactcaccg tgctgtgtgc tgtgtgcctg 61 ctgcctggca gcctggccct gccgctgcct caggaggcgg gaggcatgag tgagctacag 121 tgggaacagg ctcaggacta tctcaagaga ttttatctct atgactcaga aacaaaaaat 181 gccaacagtt tagaagccaa actcaaggag atgcaaaaat tctttggcct acctataact 241 ggaatgttaa actcccgcgt catagaaata atgcagaagc ccagatgtgg agtgccagat 301 gttgcagaat actcactatt tccaaatagc ccaaaatgga cttccaaagt ggtcacctac 361 aggatcgtat catatactcg agacttaccg catattacag tggatcgatt agtgtcaaag 421 gctttaaaca tgtggggcaa agagatcccc ctgcatttca ggaaagttgt atggggaact 481 gctgacatca tgattggctt tgcgcgagga gctcatgggg actcctaccc atttgatggg 541 ccaggaaaca cgctggctca tgcctttgcg cctgggacag gtctcggagg agatgctcac 601 ttcgatgagg atgaacgctg gacggatggt agcagtctag ggattaactt cctgtatgct 661 gcaactcatg aacttggcca ttctttgggt atgggacatt cctctgatcc taatgcagtg 721 atgtatccaa cctatggaaa tggagatccc caaaatttta aactttccca ggatgatatt 781 aaaggcattc agaaactata tggaaagaga agtaattcaa gaaagaaata gaaacttcag 841 gcagaacatc cattcattca ttcattggat tgtatatcat tgttgcacaa tcagaattga 901 taagcactgt tcctccactc catttagcaa ttatgtcacc cttttttatt gcagttggtt 961 tttgaatgtc tttcactcct tttattggtt aaactccttt atggtgtgac tgtgtcttat 1021 tccatctatg agctttgtca gtgcgcgtag atgtcaataa atgttacata cacaaata // LOCUS HSPYST1 2109 bp RNA PRI 25-JUL-1996 DEFINITION H.sapiens mRNA for protein-tyrosine-phosphatase (tissue type: foreskin). ACCESSION X93920 NID g1418933 KEYWORDS protein-tyrosine-phosphatase; pyst1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2109) AUTHORS Groom,L.A., Sneddon,A.A., Alessi,D.R., Dowd,S. and Keyse,S.M. TITLE Differential regulation of the MAP, SAP and RK/p38 kinases by Pyst1, a novel cytosolic dual-specificity phosphatase JOURNAL EMBO J. 15 (14), 3621-3632 (1996) MEDLINE 96312959 REFERENCE 2 (bases 1 to 2109) AUTHORS Keyse,S.M. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) S.M. Keyse, I.C.R.F., Molecular Pharmacology Unit, Biomedical Research Centre, Level 5, Ninewells Hospital, Dundee, DD1 9SY, UK FEATURES Location/Qualifiers source 1..2109 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="foreskin" /cell_type="fibroblast" /cell_line="EK4" /clone_lib="cDNA lambda gt10" gene 352..1497 /gene="pyst1" CDS 352..1497 /gene="pyst1" /EC_number="3.1.3.48" /codon_start=1 /product="protein-tyrosine-phosphatase" /db_xref="PID:e218263" /db_xref="PID:g1418934" /translation="MIDTLRPVPFASEMAISKTVAWLNEQLELGNERLLLMDCRPQEL YESSHIESAINVAIPGIMLRRLQKGNLPVRALFTRGEDRDRFTRRCGTDTVVLYDESS SDWNENTGGESLLGLLLKKLKDEGCRAFYLEGGFSKFQAEFSLHCETNLDGSCSSSSP PLPVLGLGGLRISSDSSSDIESDLDRDPNSATDSDGSPLSNSQPSFPVEILPFLYLGC AKDSTNLDVLEEFGIKYILNVTPNLPNLFENAGEFKYKQIPISDHWSQNLSQFFPEAI SFIDEARGKNCGVLVHCLAGISRSVTVTVAYLMQKLNLSMNDAYDIVKMKKSNISPNF NFMGQLLDFERTLGLSSPCDNRVPAQQLYFTTPSNQNVYQVDSLQST" BASE COUNT 484 a 555 c 561 g 509 t ORIGIN 1 ccagcctcgg agggagggat tagaagccgc tagacttttt ttcctcccct ctcagtagca 61 cggagtccga attaattgga tttcattcac tggggaggaa caaaaactat ctgggcagct 121 tcattgagag agattcattg acactaagag ccagcgctgc agctggtgca gagagaacct 181 ccggctttga cttctgtctc gtctgcccca aggccgctag cctcggcttg ggaaggcgag 241 gcggaattaa accccgctcc gagagcgcac gttcgcgcgc ggtgcgtcgg ccattgcctg 301 ccccgagggg cgtctggtag gcaccccgcc ctctcccgca gctcgacccc catgatagat 361 acgctcagac ccgtgccctt cgcgtcggaa atggcgatca gcaagacggt ggcgtggctc 421 aacgagcagc tggagctggg caacgagcgg ctgctgctga tggactgccg gccgcaggag 481 ctatacgagt cgtcgcacat cgagtcggcc atcaacgtgg ccatcccggg catcatgctg 541 cggcgcctgc agaagggtaa cctgccggtg cgcgcgctct tcacgcgcgg cgaggaccgg 601 gaccgcttca cccggcgctg tggcaccgac acagtggtgc tctacgacga gagcagcagc 661 gactggaacg agaatacggg cggcgagtcg ttgctcgggc tgctgctcaa gaagctcaag 721 gacgagggct gccgggcgtt ctacctggaa ggtggcttca gtaagttcca agccgagttc 781 tccctgcatt gcgagaccaa tctagacggc tcgtgtagca gcagctcgcc gccgttgcca 841 gtgctggggc tcgggggcct gcggatcagc tctgactctt cctcggacat cgagtctgac 901 cttgaccgag accccaatag tgcaacagac tcggatggta gtccgctgtc caacagccag 961 ccttccttcc cagtggagat cttgcccttc ctctacttgg gctgtgccaa agactccacc 1021 aacttggacg tgttggagga attcggcatc aagtacatct tgaacgtcac ccccaatttg 1081 ccgaatctct ttgagaacgc aggagagttt aaatacaagc aaatccccat ctcggatcac 1141 tggagccaaa acctgtccca gtttttccct gaggccattt ctttcataga tgaagcccgg 1201 ggcaagaact gtggtgtctt ggtacattgc ttggctggca ttagccgctc agtcactgtg 1261 actgtggctt accttatgca gaagctcaat ctgtcgatga acgatgccta tgacattgtc 1321 aaaatgaaaa aatccaacat atcccctaac ttcaacttca tgggtcagct gctggacttc 1381 gagaggacgc tgggactcag cagcccatgt gacaacaggg ttccagcaca gcagctgtat 1441 tttaccaccc cttccaacca gaatgtatac caggtggact ctctgcaatc tacgtgaaag 1501 accccacacc cctccttgct ggaatgtgtc tggcccttca gcagtttctc ttggcagcat 1561 cagctgggct gctttctttg tgtgtggccc caggtgtcaa aatgacacca gctgtctgta 1621 ctagacaagg ttaccaagtg cggaattggt taatactaac agagagattt gctccattct 1681 ctttggaata acaggacatg ctgtatagat acaggcagta ggtttgctct gtacccatgt 1741 gtacagccta cccatgcagg gactgggatt cgaggacttc caggcgcata gggtagaacc 1801 aaatgatagg gtaggagcat gtgttcttta gggccttgta aggctgtttc cttttgcatc 1861 tggaactgac tatataattg tcttcaatga agactaattc aattttgcat atagaggagc 1921 caaagagaga tttcagctct gtatttgtgg tatcagtttg gaaaaaaaaa tctgatactc 1981 catttgatta ttgtaaatat ttgatcttga atcacttgac agtgtttgtt tgaattgtgt 2041 ttgttttttc ctttgatggg cttaaaagaa attatccaaa gggagaaaga gcagtatgcc 2101 acttcttaa // LOCUS HSQAE1 3419 bp RNA PRI 10-AUG-1992 DEFINITION Human mRNA for ubiquitin activating enzyme E1. ACCESSION X56976 NID g35829 KEYWORDS ubiquitin activating enzyme E1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3419) AUTHORS Kaneda,S. TITLE Direct Submission JOURNAL Submitted (13-DEC-1990) S. Kaneda, NATIONAL INSTITUTE OF GENETICS, 1111 YATA, MISHIMA, SHIZUOKA 411, JAPAN REMARK revised by [3] REFERENCE 2 (bases 1 to 3419) AUTHORS Ayusawa,D., Kaneda,S., Itoh,Y., Yasuda,H., Murakami,Y., Sugasawa,K., Hanaoka,F. and Seno,T. TITLE Complementation by a cloned human ubiquitin-activating enzyme E1 of the S-phase-arrested mouse FM3A cell mutant with thermolabile E1 JOURNAL Cell Struct. Funct. 17 (2), 113-122 (1992) MEDLINE 92298399 REFERENCE 3 (bases 1 to 3419) AUTHORS Kaneda,S. TITLE Direct Submission JOURNAL Submitted (01-JUL-1992) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..3419 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="breast carcinoma ZR-75-1" /clone="pcHUBA-1" /chromosome="X Xp11.23" CDS 33..3209 /codon_start=1 /evidence=experimental /product="ubiquitin activating enzyme E1" /db_xref="PID:g35830" /db_xref="SWISS-PROT:P22314" /translation="MSSSPLSKKRRVSGPDPKPGSNCSPAQSVLSEVPSVPTNGMAKN GSEADIDEGLYSRQLYVLGHEAMKRLQTSSVLVSGLRGLGVEIAKNIILGGVKAVTLH DQGTAQWADLSSQFYLREEDIGKNRAEVSQPRLAELNSYVPVTAYTGPLVEDFLSGFQ VVVLTNTPLEDQLRVGEFCHNRGIKLVVAGTRGLFGQLFCDFGEEMILTDSNGEQPLS AMVSMVTKDNPGVVTCLDEARHGFESGDFVSFSEVQGMVELNGNQPMEIKVLGPYTFS ICDTSNFSDYIRGGIVSQVKVPKKISFKSLVASLAEPDFVVTDFAKFSRPAQLHIGFQ ALHQFCAQHGRPPRPRNEEDAAELVALAQAVNARALPAVQQNNLDEDLIRKLAYVAAG DLAPINAFIGGLAAQEVMKACSGKFMPIMQWLYFDALECLPQDKEVLTEDKCLQRQNR YDGQVAVFGSDLQEKLGKQKYFLVGAGAIGCELLKNFAMIGLGCGEGGEIIVTDMDTI EKSNLNRQFLFRPWDVTKLKSDTAAAAVRQMNPHIRVTSHQNRVGPDTERIYDDDFFQ NLDGVANALDNVDARMYMDRRCVYYRKPLLESGTLGTKGNVQVVIPFLTESYSSSQDP PEKSIPICTLKNFPNAIEHTLQWARDEFEGLFKQPAENVNQYLTDPKFVERTLRLAGT QPLEVLEAVQRSLVLQRPQTWADCVTWACHHWHTQYSNNIRQLLHNFPPDQLTSSGAP FWSGPKRCPHPLTFDVNNPLHLDYVMAAANLFAQTYGLTGSQDRAAVATFLQSVQVPE FTPKSGVKIHVSDQELQSANASVDDSRLEELKATLPSPDKLPGFKMYPIDFEKDDDSN FHMDFIVAASNLRAENYDIPSADRHKSKLIAGKIIPAIATTTAAVVGLVCLELYKVVQ GHRQLDSYKNGFLNLALPFFGFSEPLAAPRHQYYNQEWTLWDRFEVQGLQPNGEEMTL KQFLDYFKTEHKLEITMLSQGVSMLYSFFMPAAKLKERLDQPMTEIVSRVSKRKLGRH VRALVLELCCNDESGEDVEVPYVRYTIR" polyA_signal 3387..3392 BASE COUNT 753 a 995 c 928 g 743 t ORIGIN 1 tggcggcggc gcgacccggg gaaccggcat tgatgtccag ctcgccgctg tccaagaaac 61 gtcgcgtgtc cgggcctgat ccaaagccgg gttctaactg ctcccctgcc cagtccgtgt 121 tgtccgaagt gccctcggtg ccaaccaacg gaatggccaa gaacggcagt gaagcagaca 181 tagacgaggg cctttactcc cggcagctgt atgtgttggg ccatgaggca atgaagcggc 241 tccagacatc cagtgtcctg gtatcaggcc tgcggggcct gggcgtggag atcgctaaga 301 acatcatcct tggtggggtc aaggctgtta ccctacatga ccagggcact gcccagtggg 361 ctgatctttc ctcccagttc tacctgcggg aggaggacat cggtaaaaac cgggccgagg 421 tatcacagcc ccgcctcgct gagctcaaca gctatgtgcc tgtcactgcc tacactggac 481 ccctcgttga ggacttcctt agtggtttcc aggtggtggt gctcaccaac acccccctgg 541 aggaccagct gcgagtgggt gagttctgtc acaaccgtgg catcaagctg gtggtggcag 601 gcacgcgggg cctgtttggg cagctcttct gtgactttgg agaggaaatg atcctcacag 661 attccaatgg ggagcagcca ctcagtgcta tggtttctat ggttaccaag gacaaccccg 721 gtgtggttac ctgcctggat gaggcccgac acgggtttga gagcggggac tttgtctcct 781 tttcagaagt acagggcatg gttgaactca acggaaatca gcccatggag atcaaagtcc 841 tgggtcctta tacctttagc atctgtgaca cctccaactt ctccgactac atccgtggag 901 gcatcgtcag tcaggtcaaa gtacctaaga agattagctt taaatccttg gtggcctcac 961 tggcagaacc tgactttgtg gtgacggact tcgccaagtt ttctcgccct gcccagctgc 1021 acattggctt ccaggccctg caccagttct gtgctcagca tggccggcca cctcggcccc 1081 gcaatgagga ggatgcagca gaactggtag ccttagcaca ggctgtgaat gctcgagccc 1141 tgccagcagt gcagcaaaat aacctggacg aggacctcat ccggaagctg gcatatgtgg 1201 ctgctgggga tctggcaccc ataaacgcct tcattggggg cctggctgcc caggaagtca 1261 tgaaggcctg ctccgggaag ttcatgccca tcatgcagtg gctatacttt gatgcccttg 1321 agtgtctccc tcaggacaaa gaggtcctca cagaggacaa gtgcctccag cgccagaacc 1381 gttatgacgg gcaagtggct gtgtttggct cagacctgca agagaagctg ggcaagcaga 1441 agtatttcct ggtgggtgcg ggggccattg gctgtgagct gctcaagaac tttgccatga 1501 ttgggctggg ctgcggggag ggtggagaaa tcatcgttac agacatggac accattgaga 1561 agtcaaatct gaatcgacag tttcttttcc ggccctggga tgtcacgaag ttaaagtctg 1621 acacggctgc tgcagctgtg cgccaaatga atccacatat ccgggtgaca agccaccaga 1681 accgtgtggg tcctgacacg gagcgcatct atgatgacga ttttttccaa aacctagatg 1741 gcgtggccaa tgccctggac aacgtggatg cccgcatgta catggaccgc cgctgtgtct 1801 actaccggaa gccactgctg gagtcaggca cactgggcac caaaggcaat gtgcaggtgg 1861 tgatcccctt cctgacagag tcgtacagtt ccagccagga cccacctgag aagtccatcc 1921 ccatctgtac cctgaagaac ttccctaatg ccatcgagca caccctgcag tgggctcggg 1981 atgagtttga aggcctcttc aagcagccag cagaaaatgt caaccagtac ctcacagacc 2041 ccaagtttgt ggagcgaaca ctgcggctgg caggcactca gcccttggag gtgctggagg 2101 ctgtgcagcg cagcctggtg ctgcagcgac cacagacctg ggctgactgc gtgacctggg 2161 cctgccacca ctggcacacc cagtactcga acaacatccg gcagctgctg cacaacttcc 2221 ctcctgacca gctcacaagc tcaggagcgc cgttctggtc tgggcccaaa cgctgtccac 2281 acccgctcac ctttgatgtc aacaatcccc tgcatctgga ctatgtgatg gctgctgcca 2341 acctgtttgc ccagacctac gggctgacag gctctcagga ccgagctgct gtggccacat 2401 tcctgcagtc tgtgcaggtc cccgaattca cccccaagtc tggcgtcaag atccatgttt 2461 ctgaccagga gctgcagagc gccaatgcct ctgttgatga cagtcgtcta gaggagctca 2521 aagccactct gcccagccca gacaagctcc ctggattcaa gatgtacccc attgactttg 2581 agaaggatga tgacagcaac tttcatatgg atttcatcgt ggctgcatcc aacctccggg 2641 cagaaaacta tgacattcct tctgcagacc ggcacaagag caagctgatt gcagggaaga 2701 tcatcccagc cattgccacg accacagcag ccgtggttgg ccttgtgtgt ctggagctgt 2761 acaaggttgt gcaggggcac cgacagcttg actcctacaa gaatggtttc ctcaacttgg 2821 ccctgccttt ctttggtttc tctgaacccc ttgccgcacc acgtcaccag tactataacc 2881 aagagtggac attgtgggat cgctttgagg tacaagggct gcagcctaat ggtgaggaga 2941 tgaccctcaa acagttcctc gactatttta agacagagca caaattagag atcaccatgc 3001 tgtcccaggg cgtgtccatg ctctattcct tcttcatgcc agctgccaag ctcaaggaac 3061 ggttggatca gccgatgaca gagattgtga gccgtgtgtc gaagcgaaag ctgggccgcc 3121 acgtgcgggc gctggtgctt gagctgtgct gtaacgacga gagcggcgag gatgtcgagg 3181 ttccctatgt ccgatacacc atccgctgac cccgtctgct cctctaggct ggccccttgt 3241 ccacccctct ccacacccct tccagcccag ggttcccatt tggcttctgg cagtggccca 3301 actagccaag tctggtgttc cctcatcatc cccctacctg aacccctctt gccactgcct 3361 tctaccttgt ttgaaacctg aatcctaata aagaattaat aactcccaaa aaaaaaaaa // LOCUS HSR2IMP 2049 bp RNA PRI 17-JUN-1991 DEFINITION Human R2 mRNA for an inducible membrane protein. ACCESSION X53795 NID g35832 KEYWORDS membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2049) AUTHORS Baumruker,T. TITLE Direct Submission JOURNAL Submitted (11-JUN-1990) Baumruker T., Sandoz Forschungsinstitut Ges.m.b.H., Brunnerstrasse 59, A 1235 Vienna, Austria REFERENCE 2 (bases 1 to 2049) AUTHORS Gaugitsch,H.W., Hofer,E., Huber,N.E., Schnabl,E. and Baumruker,T. TITLE A new superfamily of lymphoid and melanoma cell proteins with extensive homology to Schistosoma mansoni antigen Sm23 JOURNAL Eur. J. Immunol. 21 (2), 377-383 (1991) MEDLINE 91153380 FEATURES Location/Qualifiers source 1..2049 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="PBL" /clone_lib="cDNA" /clone="lambda ZAP" mRNA 1..2049 /note="R2; cDNA" CDS 157..960 /note="R2" /codon_start=1 /product="inducible membrane protein" /db_xref="PID:g35833" /db_xref="SWISS-PROT:P27701" /translation="MGSACIKVTKYFLFLFNLIFFILGAVILGFGVWILADKSSFISV LQTSSSSLRMGAYVFIGVGAVTMLMGFLGCIGAVNEVRCLLGLYFAFLLLILIAQVTA GALFYFNMGKLKQEMGGIVTELIRDYNSSREDSLQDAWDYVQAQVKCCGWVSFYNWTD NAELMNRPEVTYPCSCEVKGEEDNSLSVRKGFCEAPGNRTQSGNHPEDWPVYQEGCME KVQAWLQENLGIILGVGVGVAIIELLGMVLSICLCRHVHSEDYSKVPKY" repeat_region 1317..1597 /rpt_family="Alu, consensus" BASE COUNT 434 a 539 c 601 g 475 t ORIGIN 1 gcacgagcgg gtgacgctgg gcctgcagcg cggagcagaa agcagaaccc gcagagtcct 61 ccctgctgct gtgtggacga cacgtgggca caggcagaag tgggccctgt gaccagctgc 121 actggtttcg tggaaggaag ctccaggact ggcgggatgg gctcagcctg tatcaaagtc 181 accaaatact ttctcttcct cttcaacttg atcttcttta tcctgggcgc agtgatcctg 241 ggcttcgggg tgtggatcct ggccgacaag agcagtttca tctctgtcct gcaaacctcc 301 tccagctcgc ttaggatggg ggcctatgtc ttcatcggcg tgggggcagt cactatgctc 361 atgggcttcc tgggctgcat cggcgccgtc aacgaggtcc gctgcctgct ggggctgtac 421 tttgctttcc tgctcctgat cctcattgcc caggtgacgg ccggggccct cttctacttc 481 aacatgggca agctgaagca ggagatgggc ggcatcgtga ctgagctcat tcgagactac 541 aacagcagtc gcgaggacag cctgcaggat gcctgggact acgtgcaggc tcaggtgaag 601 tgctgcggct gggtcagctt ctacaactgg acagacaacg ctgagctcat gaatcgccct 661 gaggtcacct acccctgttc ctgcgaagtc aagggggaag aggacaacag cctttctgtg 721 aggaagggct tctgcgaggc ccccggcaac aggacccaga gtggcaacca ccctgaggac 781 tggcctgtgt accaggaggg ctgcatggag aaggtgcagg cgtggctgca ggagaacctg 841 ggcatcatcc tcggcgtggg cgtgggtgtg gccatcatcg agctcctggg gatggtcctg 901 tccatctgct tgtgccggca cgtccattcc gaagactaca gcaaggtccc caagtactga 961 ggcagctgct atccccatct ccctgcctgg cccccaacct cagggctccc aggggtctcc 1021 ctggctccct cctccaggcc tgcctcccac ttcactgcga agaccctctt gcccaccctg 1081 actgaaagta gggggctttc tggggcctag cgatctctcc tggcctatcc gctgccagcc 1141 ttgagccctg gctgttctgt ggttcctctg ctcaccgccc atcagggttc tcttagcaac 1201 tcagagaaaa atgctcccca cagcgtccct ggcgcaggtg ggctggactt ctacctgccc 1261 tcaagggtgt gtatattgta taggggcaac tgtatgaaaa attggggagg agggggccgg 1321 gcgcggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcggg tggatcacga 1381 ggtcaggaga tcgagaccat cctggctaac atggtgaaac cccgtctcta ctaaaaatac 1441 aaaaaaaatt tagccgggcg cggtggcggg cacctgtagt cccagctact tgggaggctg 1501 aggcaggaga atggtgtgaa cccgggagcg gaggttgcag tgagctgaga tcgtgctact 1561 gcactccagc ctgggggaca gaaagagact ccgtctcaga aattgaaaca ctcagaaaaa 1621 cagttgagga ctatttctgc ttttgctatg ggaaagcttt aggcaaatcc acagtggtac 1681 ctgtaccata tgagaagatg ctgcgagacc agtcggctgt ggtagtgcag gggcttacgg 1741 aaggtgttgc ctttaaacac cccgagaact atgatcttgc aaccctgaaa tggattttgg 1801 agaacaaagc agggatttca ttcatcatta agagaccttt tttagagcca aagaagcatg 1861 taggtaagta agtgctttgc ttccttgata gctggctggc ctccgttttg ctagattttc 1921 atacacttta atggtttctg ttttattgtc tttgagaata tgatgtcaga cattttcgga 1981 tgggctgttt agatgttata taatccacaa aaggttcatt gagctaaaaa agtggagact 2041 tgttttttt // LOCUS HSRAB13 1238 bp RNA PRI 02-FEB-1994 DEFINITION H.sapiens mRNA for rab 13. ACCESSION X75593 NID g452319 KEYWORDS rab13 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1238) AUTHORS Zahraoui,A., Joberty,G., Arpin,M., Fontaine,J.J., Hellio,R., Tavitian,A. and Louvard,D. TITLE A small rab GTPase is distributed in cytoplasmic vesicles in non polarized cells but colocalizes with the tight junction marker ZO-1 in polarized epithelial cells JOURNAL J. Cell Biol. 124 (1-2), 101-115 (1994) MEDLINE 94124602 REFERENCE 2 (bases 1 to 1238) AUTHORS Zahraoui,A. TITLE Direct Submission JOURNAL Submitted (03-NOV-1993) A. Zahraoui, INSERM U.248, 10 Avenue de Verdun, 75010 Paris, FRANCE FEATURES Location/Qualifiers source 1..1238 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="CaCo2" gene 140..751 /gene="rab 13" CDS 140..751 /gene="rab 13" /codon_start=1 /product="rab 13" /db_xref="PID:g452320" /translation="MAKAYDHLFKLLLIGDSGVGKTCLIIRFAEDNFNNTYISTIGID FKIRTVDIEGKKIKLQVWDTAGQERFKTITTAYYRGAMGIILVYDITDEKSFENIQNW MKSIKENASAGVERLLLGNKCDMEAKRKVQKEQADKLAREHGIRFFETSAKSSMNVDE AFSSLARDILLKSGGRRSGNGNKPPSTDLKTCDKKNTNKCSLG" BASE COUNT 364 a 275 c 355 g 244 t ORIGIN 1 gaattcgagg atccgggtac catgggagga aaacttcttc ctggcctggg ctccgtgccg 61 ctctgtttgc caaccgtcca gtcccgccta ccagtgccgg gcgctcccca cccctccccc 121 ggctcccccg gtgtccgcca tggccaaagc ctacgaccac ctcttcaagt tgctgctgat 181 cggggactcg ggggtgggca agacttgtct gatcattcgc tttgcagagg acaacttcaa 241 caacacttac atctccacca tcggaattga tttcaagatc cgcactgtgg atatagaggg 301 gaagaagatc aaactacaag tctgggacac ggctggccaa gagcggttca agacaataac 361 tactgcctac taccgtggag ccatgggcat tatcctagta tacgacatca cggatgagaa 421 atctttcgag aatattcaga actggatgaa aagcatcaag gagaatgcct cggctggggt 481 ggagcgcctc ttgctgggga acaaatgtga catggaggcc aagaggaagg tgcagaagga 541 gcaggccgat aagttggctc gagagcatgg aatccgattt ttcgaaacta gtgctaaatc 601 cagtatgaat gtggatgagg cttttagttc cctggcccgg gacatcttgc tcaagtcagg 661 aggccggaga tcaggaaacg gcaacaagcc tcccagtact gacctgaaaa cttgtgacaa 721 gaagaacacc aacaagtgct ccctgggctg aggacccttt cttgcctccc caccccggaa 781 gctgaacctg agggagacaa cggcagaggg agtgagcagg ggagaaatag cagaggggct 841 tggagggtca cataggtaga tggtaaagag aatgaggaga aaaaggagaa aagggaaaag 901 cagaaaggaa aaaaaggaag agagaggaag ggagaaggga gaggaatgaa ttgaggaagt 961 gaaagaaggc aaggaggtag gaagagaggg aggaggaaag gaaggagaga gatgcctcag 1021 gcttcagacc ttacctgggt tttcagggca aacataaatg taaatacact gatttattct 1081 gttactagat caggttttag ggtcctgcaa aaggctagct cggcactaca ctagggaatt 1141 tgctcctgtt ctgtcacttg tcatggtctt tcttggtatt aaaggccacc atttgcacaa 1201 aaaaaaaaaa aaaccatggt acccggatcc tcgaattc // LOCUS HSRAB2 1148 bp RNA PRI 12-SEP-1993 DEFINITION Human rab2 mRNA, YPT1-related and member of ras family. ACCESSION X12953 NID g35836 KEYWORDS GTP-binding protein; ras gene family; ras oncogene; ras-related protein; YPT1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1148) AUTHORS Takano,T. TITLE Direct Submission JOURNAL Submitted (20-SEP-1988) Takano T., Dept. of Microbiology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo 160, Japan REFERENCE 2 (bases 1 to 1148) AUTHORS Tachibana,K., Umezawa,A., Kato,S. and Takano,T. TITLE Nucleotide sequence of a new YPT1-related human cDNA which belongs to the ras gene superfamily JOURNAL Nucleic Acids Res. 16 (21), 10368 (1988) MEDLINE 89057482 COMMENT The nucleotide sequence and the deduced amino acid sequence of Hrab2 have homologies of 92.9% and 98.1 % with those of a YPT1- related rat cDNA, rab2 ( J02999 ), respectively. Data kindly reviewed (18-Oct-1988) by Takano T. FEATURES Location/Qualifiers source 1..1148 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="histiocytic lymphoma" /cell_line="U937" /clone="U14" CDS 209..847 /note="rab2 gene product (AA 1 - 212)" /codon_start=1 /db_xref="PID:g35837" /db_xref="SWISS-PROT:P08886" /translation="MAYAYLFKYIIIGDTGVGKSCLLLQFTDKRFQPVHDLTIGVEFG ARMITIDGKQIKLQIWDTAGQESFRSITRSYYRGAAGALLVYDITRRDTFNHLTTWLE DARQHSNSNMVIMLIGNKSDLESRREVKKEEGEAFAREHGLIFMETSAKTASNVEEAF INTAKEIYEKIQEGVFDINNEANGIKIGPQHAATNATHAGNQGGQQAGGGCC" misc_feature 245..268 /note="GTP-binding domain" misc_feature 389..406 /note="GTP-binding domain" misc_feature 554..577 /note="GTP-binding domain" misc_feature 647..660 /note="GTP-binding domain" misc_feature 839..844 /note="palmitoylation site" misc_feature 1131..1136 /note="polyA signal" polyA_site 1148 /note="polyA site" BASE COUNT 313 a 245 c 285 g 305 t ORIGIN 1 gctcggtcgg gcgctgtctc cctcggctct gcgggtgtca gttcgtccgg cttcctcaca 61 gcccctcact cccggcggct gacagcagca gcggcggcgg cgggcggcgc ctggcgtttc 121 gaggctgagc ggcaccgggg ttggggcgcg gaggaggagc agcagcggga ggaggagccg 181 tgtgccctgg cactgagcgg ccgcggccat ggcgtacgcc tatctcttca agtacatcat 241 aatcggcgac acaggtgttg gtaaatcatg cttattgcta cagtttacag acaagaggtt 301 tcagccagtg catgacctta ctattggtgt agagttcggt gctcgaatga taactattga 361 tgggaaacag ataaaacttc agatatggga tacggcaggg caagaatcct ttcgttccat 421 cacaaggtcg tattacagag gtgcagcagg agctttacta gtttacgata ttacacggag 481 agatacattc aaccacttga caacctggtt agaagatgcc cgccagcatt ccaattccaa 541 catggtcatt atgcttattg gaaataaaag tgatttagaa tctagaagag aagtaaaaaa 601 agaagaaggt gaagcttttg cacgagaaca tggactcatc ttcatggaaa cgtctgctaa 661 gactgcttcc aatgtagaag aggcatttat taatacagca aaagaaattt atgaaaaaat 721 tcaagaagga gtctttgaca ttaataatga ggcaaatggc attaaaattg gccctcagca 781 tgctgctacc aatgcaacac atgcaggcaa tcagggagga cagcaggctg ggggcggctg 841 ctgttgagtc tgtttttact gtctagctgc ccaacggggc ctactcactt attctttcac 901 cccctctcct cctgctcagc tgagacatga aactatttga aatggcttta tgtcacagaa 961 gactttaatc cgtcaaattc ttgtataact ttgaataaat ggttaatgtt cacttaaaag 1021 acagattttg gagattgtat tcatatctat ttgcatttga tttctaggtc aattgatgtg 1081 attatttttg ttaaatgttg tcttgtgccc ttaactacga actgaattgt attaaacact 1141 acaaagtc // LOCUS HSRAB28 773 bp RNA PRI 25-JUN-1996 DEFINITION H.sapiens rab28 mRNA. ACCESSION X94703 NID g1154851 KEYWORDS GTP-binding protein; rab28 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 773) AUTHORS Brauers,A., Schurmann,A., Massmann,S., Muhl-Zurbes,P., Becker,W., Kainulainen,H., Lie,C. and Joost,H.G. TITLE Alternative mRNA splicing of the novel GTPase Rab28 generates isoforms with different C-termini JOURNAL Eur. J. Biochem. 237 (3), 833-840 (1996) MEDLINE 96235252 REFERENCE 2 (bases 1 to 773) AUTHORS Joost,H. TITLE Direct Submission JOURNAL Submitted (08-JAN-1996) H. Joost, Inst.f.Pharmakologie und Toxikologie, der RWTH, Wendlingweg 2, D- 52057 Aachen, FRG FEATURES Location/Qualifiers source 1..773 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" gene 1..663 /gene="rab28" CDS 1..663 /gene="rab28" /function="GTP-binding protein" /codon_start=1 /db_xref="PID:e218039" /db_xref="PID:g1154852" /translation="MSDSEEESQDRQLKIVVLGDGTSGKTSLTTCFAQETFGKQYKQT IGLDFFLRRITLPGNLNVTLQIWDIGGQTIGGKMLDKYIYGAQGVLLVYDITNYQSFE NLEDWYTVVKKVSEESETQPLVALVGNKIDLEHMRTIKPEKHLRFCQENGFSSHFVSA KTGDSVFLCFQKVAAEILGIKLNKAEIEQSQRIVRAEIVKYPEEENQHTTSTQSRICS VQ" misc_feature 574..668 /note="alternatively spliced insert" BASE COUNT 250 a 139 c 191 g 193 t ORIGIN 1 atgtcggatt cggaggagga gagccaggac cggcaactga aaatcgtcgt gctgggggac 61 ggcacctccg ggaagacctc cttaactacg tgttttgctc aagaaacttt tgggaaacag 121 tacaaacaaa ctataggact ggatttcttt ttgagaagga taacattgcc aggaaacttg 181 aatgttaccc ttcaaatttg ggatatagga gggcagacaa taggaggcaa aatgttggat 241 aaatatatct atggagcaca gggagtcctc ttggtatatg atattacaaa ttatcaaagc 301 tttgagaatt tagaagattg gtatactgtg gtgaagaaag tgagcgagga gtcagaaact 361 cagccactgg ttgccttggt aggcaataaa attgatttgg agcatatgcg aacaataaaa 421 cctgaaaaac acttacggtt ttgccaggaa aatggtttta gtagccactt tgtctcagcc 481 aagacaggag actctgtctt cctgtgcttt cagaaagttg ctgctgaaat ccttgggatc 541 aaattaaaca aagcagaaat agaacagtca cagcgtattg tcagggcaga aatagtgaag 601 tacccggaag aagaaaatca acataccacc tctactcaga gtagaatctg ttcagtacag 661 tagtgcagag ggtggtgaag gcagatattg taaactacaa ccaggaacct atgtcaagga 721 ctgttaaccc tcctagaagc tctatgtgtg cagttcagtg agcgcatttt tcc // LOCUS HSRAB5B 1630 bp RNA PRI 28-APR-1992 DEFINITION H.sapiens mRNA for ras-related protein Rab5b. ACCESSION X54871 NID g35838 KEYWORDS ras-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1630) AUTHORS Wilson,D. TITLE Direct Submission JOURNAL Submitted (02-OCT-1990) Wilson D., Boston Childrens Hospital-Hematology Division, 300 Longwood Ave., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1630 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" CDS 21..668 /codon_start=1 /product="ras related protein Rab5b" /db_xref="PID:g35839" /db_xref="SWISS-PROT:P35239" /translation="MTSRSTARPNGQPQASKICQFKLVLLGESAVGKSSLVLRFVKGQ FHEYQESTIGAAFLTQSVCLDDTTVKFEIWDTAGQERYHSLAPMYYRGAQAAIVVYDI TNQETFARAKTWVKELQRQASPSIVIALAGNKADLANKRMVEYEEAQAYADDNSLLFM ETSAKTAMNVNDLFLAIAKKLPKSEPQNLGGAAGRSRGVDLHEQSQQNKSQCCSN" BASE COUNT 395 a 424 c 411 g 400 t ORIGIN 1 cccattctga taatctggcc atgactagca gaagcacagc taggcccaat gggcaacccc 61 aggccagcaa aatttgccag ttcaaattgg tcctgctggg agaatctgca gtgggaaagt 121 caagcctggt attacgtttt gtcaaagggc agttccatga gtaccaggag agcaccattg 181 gagcggcctt cctcacccag tccgtttgtc tagatgacac aacagtgaag tttgagatct 241 gggacacagc tgggcaggag cgatatcaca gcttagcccc catgtactac aggggtgccc 301 aagctgcaat cgtggtttac gacattacta atcaggaaac ctttgcccga gcaaagacat 361 gggtgaagga actacagcga caggccagtc ctagcatcgt tattgccctg gcagggaaca 421 aagctgacct ggccaacaaa cgtatggtgg agtatgaaga ggcccaggca tatgcagatg 481 acaacagctt attgttcatg gagacttcag ccaagacagc tatgaacgtg aatgatctct 541 tcctggcaat agctaagaag ttgccaaaga gtgaacccca gaatctggga ggtgcagcag 601 gccgaagccg gggtgtggat ctccatgaac agtcccagca gaacaagagc cagtgttgta 661 gcaactgagg gggtggctag cagcaaacaa gtatggagct agcacaagag ctaagaaata 721 accgccatcc ctacccctcg acacacaacc cctacggtac agcacactag ccctggctcc 781 aagggctgcc tcctgacagc tccgtcatgg cactttttaa cgcttcagca acaaacacca 841 ggcagctgtt ccgactggcc tcctaccccc tactctgggg cttgggggtc aactcccccc 901 aggacttacc ttccaaaaca aactttcttc acttgtatta taggtacaag acagcgactt 961 acgtatcttt tctcctcctc cctagtgttc ctccccgatt ttttcagaaa acacttctga 1021 ctcctgtccc ttccccttct gcttttggtc agtccctgtt cttgagcctc ttttctcctc 1081 tccccaggat gctgtttgtg gtgaacccag gaactgagaa ggaggtttcc agttcattta 1141 cattaagggc ctgggggaga taaagctcga gcaggaggga gtaaggaaac attccttttt 1201 gtttttattt ggttggagtt tctcatattt gaaaacattg cggtatccat gatttggcct 1261 tgtggagggt gttcctaggt agaggtgaga atggggaggc aagatctcag ggacaccaag 1321 caggaggtgc cgggtaagct aactgggcgg aggtggaggt gcagggtcaa ctgtggctct 1381 gtaactcttc aaaggccagt ttcccctcac gcagcctctt aggtagcgtt tcccctaatg 1441 gtgtttcccc taatcgtggg gttggacccc agagtcttcc aaagaatttt cactggttgc 1501 ctacgtcttt ggctctgctg tagtctgatt ggaggaggga cagtttctgg tacccatcct 1561 ctgatttata catatgcgtt ttttcccctc tggcctttag atggcctcag ccccagccac 1621 catatacccg // LOCUS HSRAB9P40 1297 bp RNA PRI 23-JUN-1997 DEFINITION Homo sapiens mRNA for Rab9 effector p40, complete cds. ACCESSION Z97074 NID g2217969 KEYWORDS p40 gene; Rab9 effector. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1297) AUTHORS Diaz,E.D. TITLE Direct Submission JOURNAL Submitted (20-JUN-1997) Diaz E.D., Biochemistry, Stanford University, Stanford University School of Medicine Stanford, CA 94305 USA REFERENCE 2 (bases 1 to 1297) AUTHORS Diaz,E., Schimmoeller,F. and Pfeffer,S.R. TITLE A novel Rab9 effector required for endosome to TGN transport JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1297 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="lymphoma" gene 150..1268 /gene="p40" CDS 150..1268 /gene="p40" /function="Rab9 effector" /codon_start=1 /evidence=experimental /product="p40" /db_xref="PID:e323546" /db_xref="PID:g2217970" /translation="MKQLPVLEPGDKPRKATWYTLTVPGDSPCARVGHSCSYLPPVGN AKRGKVFIVGGANPNRSFSDVHTMDLGKHQWDLDTCKGLLPRYEHASFIPSCTPDRIW VFGGANQSGNRNCLQVLNPETRTWTTPEVTSPPPSPRTFHTSSAAIGNQLYVFGGGER GAQPVQDTKLHVFDANTLTWSQPETLGNPPSPRHGHVMVAAGTKLFIHGGLAGDRFYD DLHCIDISDMKWQKLNPTGAAPAGCAAHSAVAMGKHVYIFGGMTPAGALDTMYQYHTE EQHWTLLKFDTLLPPGRLDHSMCIIPWPVTCASEKEDSNSLTLNHEAEKEDSVDKVMS HSGDSHEESQTATLLCLVFGGMNTEGEIYDDCIVTVVD" BASE COUNT 345 a 328 c 340 g 284 t ORIGIN 1 cctaagtcgc cgcagaactg ccacgtgggg atgagatttg ctgggctggt agcggcggct 61 gctgcgggga ggtcccgccc acgtgaagcc agcctaactg agctctggac tttggggaca 121 gctgtcagtg gcctaggccg caggacacca tgaagcaact gccagtcttg gaacctggag 181 acaagcccag gaaagcaaca tggtacacct tgactgtccc tggagacagc ccctgtgctc 241 gagttggcca cagctgttca tatttacccc cagttggtaa tgccaagaga gggaaggtct 301 tcattgttgg gggagcaaat ccaaacagaa gcttctcaga cgtgcacacc atggatctgg 361 gaaaacacca gtgggactta gatacctgca agggcctctt gccccggtat gaacatgcta 421 gcttcattcc ctcctgcaca cctgaccgta tttgggtatt tggaggtgcc aaccaatcag 481 gaaatcgaaa ttgtctacaa gtcctgaatc ctgaaaccag gacgtggacc acgccagaag 541 tgaccagccc cccaccatcc ccaagaacat tccacacatc atcggcagcc attggaaacc 601 agctatatgt ctttgggggt ggagagagag gtgcccagcc cgtgcaggac acgaagctgc 661 atgtgtttga cgcaaacact ctgacctggt cacagccaga gacacttgga aatcctccat 721 ctccccggca tggtcatgtg atggtggcag cagggacaaa gctcttcatc cacggaggct 781 tggcggggga cagattctat gatgacctcc actgcattga tataagtgac atgaaatggc 841 agaagctaaa tcccactggg gctgctccag caggctgtgc tgcccactca gctgtggcca 901 tgggaaaaca tgtgtacatc tttggtggaa tgactcctgc aggagcactg gacacaatgt 961 accagtatca cacagaagag cagcattgga ccttgcttaa atttgatact cttctacccc 1021 ctggacgatt ggaccattcc atgtgtatca ttccatggcc agtgacgtgt gcttctgaga 1081 aagaagattc caactctctc actctgaacc atgaagctga gaaagaggat tcagttgaca 1141 aagtaatgag ccacagtggt gactcacatg aggaaagcca gactgctaca ctgctctgtt 1201 tggtgtttgg tgggatgaat acagaagggg aaatctatga cgattgtatt gtgactgtag 1261 tggactaata aaacccacat ttttattaaa aaaaaaa // LOCUS HSRABAPTI 2965 bp RNA PRI 18-MAR-1996 DEFINITION H.sapiens mRNA for RABAPTIN-5 protein. ACCESSION X91141 NID g1050522 KEYWORDS effector; rabaptin-5 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2965) AUTHORS Stenmark,H., Vitale,G., Ullrich,O. and Zerial,M. TITLE Rabaptin-5 is a direct effector of the small GTPase Rab5 in endocytic membrane fusion JOURNAL Cell 83 (3), 423-432 (1995) MEDLINE 96067640 REFERENCE 2 (bases 1 to 2965) AUTHORS Vitale,G. TITLE Direct Submission JOURNAL Submitted (30-AUG-1995) G. Vitale, EUROPEAN MOLECULAR BIOLOGY LABORATORY, Cell Biology, Meyerhofstrasse 1, Postfach 10.2209, D-69012 Heidelberg, FRG FEATURES Location/Qualifiers source 1..2965 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>2965 CDS 189..2777 /function="effector of small GTPase RAB5 in endocytic membrane fusion" /codon_start=1 /evidence=experimental /product="rabaptin-5" /db_xref="PID:g1050523" /translation="MAQPGPASQPDVSLQQRVAELEKINAEFLRAQQQLEQEFNQKRA KFKELYLAKEEDLKRQNAVLQAAQDDLGHLRTQLWEAQAEMENIKAIATVSENTKQEA IDEVKRQWREEVASLQAVMKETVRDYEHQFHLRLEQERTQWAQYREYAEREIADLRRR LSEGQEEENLENEMKKAQEDAEKLRSVVMPMEKEIAALKDKLTEAEDKIKELEASKVK ELNHYLEAEKSCRTDLEMYVAVLNTQKSVLQEDAEKLRKELHEVCHLLEQERQQHNQL KHTWQKANDQFLESQRLLMRDMQRMEIVLTSEQLRQVEELKKKDQEDDEQQRLNKRKD HKKADVEEEIKIPVVCALTQEESSAQLSNEEEHLDSTRGSVHSLDAGLLLPSGDPFSK SDNDMFKDGLRRAQSTDSLGTSGSLQSKALGYNYKAKSAGNLDESDFGPLVGADSVSE NFDTASLGSLQMPSGFMLTKDQERAIKAMTPEQEETASLLSSVTQGMESAYVSPSGYR LVSETEWNLLQKEVHNAGNKLGRRCDMCSNYEKQLQGIQIQEAETRDQVKKLQLMLRQ ANDQLEKTMKDKQELEDFIKQSSEDSSHQISALVLRAQASEILLEELQQGLSQAKRDV QEQMAVLMQSREQVSEELVRLQKDNDSLQGKHSLHVSLQQAEDFILPDTTEALRELVL KYREDIINVRTAADHVEEKLKAEILFLKEQIQAEQCLKENLEETLQLEIENCKEEIAS ISSLKAELERIKVEKGQLESTLREKSQQLESLQEIKISLEEQLKKETAAKATVEQLMF EEKNKAQRLQTELDVSEQVQRDFVKLSQTLQVQLERIRQADSLERIRAILNDTKLTDI NQLPET" BASE COUNT 983 a 586 c 774 g 622 t ORIGIN 1 gcggaggtcg gcggtcgggt ccgtctctgc ccgcggctgt ggcggcgccg gcggatccag 61 ccttagcgtt cctctctggg cggcggcggc ggcggctcgg ttgacgcctc ctccgccagc 121 tgagcccgcg ggagcccagg acgccgcttc cccgcccatc cccgctcccc gaggccggcc 181 gcctggtcat ggcgcagccg ggcccggctt cccagcctga cgtttctctt cagcaacggg 241 tagcagaatt ggaaaaaatt aatgcagaat ttttacgtgc acaacagcag cttgaacaag 301 aatttaatca aaagagagca aaatttaagg agttatattt ggctaaagag gaggatctga 361 agaggcaaaa tgcagtatta caagctgcac aagatgattt gggacacctt cgaacccagc 421 tgtgggaagc tcaagcagag atggagaata ttaaggcgat tgccacagtc tctgagaaca 481 ccaagcaaga agctatagat gaagtgaaaa gacagtggag agaagaagtt gcttcacttc 541 aggctgttat gaaagaaaca gttcgtgact atgagcacca gttccacctt aggctggagc 601 aggagcgaac acagtgggca cagtatagag aatacgcaga gagggaaata gctgatttaa 661 gaagaaggct gtctgaaggt caagaggagg aaaatttaga aaatgaaatg aaaaaggccc 721 aagaggatgc tgagaaactt cggtccgttg tgatgccaat ggaaaaggaa attgcagctt 781 tgaaggataa actgacagag gctgaagaca aaattaaaga gctggaggcc tcaaaggtta 841 aagaactgaa tcattatctg gaagctgaga aatcttgtag gactgatcta gagatgtatg 901 tagctgtttt gaatactcag aaatctgttc tacaggaaga tgctgagaaa ctgcggaaag 961 aattgcatga agtttgccat ctcttggagc aagagcgaca acaacacaac cagttaaaac 1021 atacgtggca gaaggccaat gaccagtttc tggaatctca gcgtttactg atgagagaca 1081 tgcagcgaat ggagattgtg ctaacttcag aacagctccg acaagttgaa gaactgaaga 1141 agaaagatca ggaggatgat gaacaacaaa gactcaataa gagaaaggat cacaaaaaag 1201 cagatgttga ggaagaaata aaaataccag tagtgtgtgc tttaactcaa gaagaatctt 1261 cagcccagtt atcaaatgaa gaggagcatt tagacagcac ccgtggctca gttcattcct 1321 tagatgcagg cttgctgttg ccatctggag atcctttcag taaatcggac aatgacatgt 1381 ttaaagatgg actcaggaga gcacagtcta cagacagctt gggaacctcg ggctcattgc 1441 aatccaaagc tttaggctat aactacaaag caaaatctgc tggaaacctg gacgagtcag 1501 attttggacc actggtagga gcagattcag tgtctgagaa ctttgatact gcatcccttg 1561 ggtcactcca gatgccaagt gggtttatgt taaccaaaga tcaggaaaga gcaatcaagg 1621 cgatgacacc agaacaagaa gagacagcgt ccctcctctc cagcgttacc cagggcatgg 1681 agagtgccta tgtgtcccct agtggttatc gtttagttag tgaaacagaa tggaatctct 1741 tgcagaaaga ggtacataat gctggaaata aacttggtag acgttgtgat atgtgttcca 1801 attacgaaaa acagttacaa ggaattcaga ttcaggaggc tgaaacgaga gaccaggtga 1861 aaaaactaca gctgatgcta aggcaagcta atgaccagtt agagaagaca atgaaagata 1921 agcaggagct ggaagacttc ataaagcaaa gcagcgaaga ttcgagtcac cagatctctg 1981 cactcgtcct aagagcccag gcctccgaga tcttacttga agagttacag caggggcttt 2041 cccaggcaaa gagggatgtt caggaacaga tggcggtgct gatgcagtca cgggaacagg 2101 tttcagaaga gctggtgagg ttacagaaag ataatgacag tctccaggga aagcacagcc 2161 tgcatgtgtc attacagcaa gcagaagact tcatcctccc agacactaca gaggcactgc 2221 gggagttggt attaaaatac cgtgaggaca tcattaatgt gcggacagca gcagaccacg 2281 tagaagaaaa gctgaaggct gagatacttt tcctaaaaga gcagatccaa gcagaacagt 2341 gtttaaaaga aaatcttgaa gaaactctgc aactagaaat agaaaactgc aaggaggaaa 2401 tagcttctat ttctagccta aaagctgaat tagaaagaat aaaagtggaa aaaggacagt 2461 tggagtccac attaagagag aagtctcaac agcttgagag tcttcaggaa ataaagatca 2521 gtttggaaga gcagttaaag aaagagactg ctgctaaggc taccgttgaa cagctaatgt 2581 ttgaagagaa gaacaaagct cagagattac agacagaatt agatgtcagt gagcaagtcc 2641 agagagattt tgtaaagctt tcacagaccc ttcaggtgca gttagagcgg atccggcaag 2701 ctgactcctt ggagagaatc cgggcaattc tgaatgatac taaactgaca gacattaacc 2761 agcttcctga gacatgacac cctcatggca ggattctagc ctgcactttg ggtttttaac 2821 tcatctttag agcaacagta attattattt aactcttaac tgaagaaaga gaagtcacaa 2881 caaaaggaag actggagaaa tgcttacttc tagagggaga agactgtgcg gcacaggaaa 2941 cagcaaacag tggggtgatc tgcag // LOCUS HSRABGTRA 2067 bp RNA PRI 14-JAN-1997 DEFINITION H.sapiens mRNA for rab geranylgeranyl transferase, alpha-subunit. ACCESSION Y08200 NID g1552546 KEYWORDS alpha-subunit; rab geranylgeranyl transferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2067) AUTHORS van Bokhoven,H., Rawson,R.B., Merkx,G.F., Cremers,F.P. and Seabra,M.C. TITLE cDNA cloning and chromosomal localization of the genes encoding the alpha- and beta-subunits of human rab geranylgeranyl transferase: the 3' end of the alpha-subunit gene overlaps with the transglutaminase 1 gene promoter JOURNAL Genomics 38 (2), 133-140 (1996) MEDLINE 97127587 REFERENCE 2 (bases 1 to 2067) AUTHORS van Bokhoven,H. TITLE Direct Submission JOURNAL Submitted (18-SEP-1996) H. van Bokhoven, University Hospital Nijmegen, Human Genetics 417, Box 9101, 6500 HB Nijmegen, NETHERLANDS FEATURES Location/Qualifiers source 1..2067 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal brain" /chromosome="14" /map="q11.2" CDS 275..1978 /note="alpha-subunit" /codon_start=1 /product="rab geranylgeranyl transferase" /db_xref="PID:e266737" /db_xref="PID:g1552547" /translation="MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKRQAGEL DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV NPKSYGTWHHRCWLLGRLPEPNWTRELELCARFLEVDERNFHCWDYRRFVATQAAVPP AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPEDVLLKELELVQNA FFTDPNDQSAWFYHRWLLGRADPQDALRCLHVSRDEACLTVSFSRPLLVGSRMEILLL MVDDSPLIVEWRTPDGRNRPSHVWLCDLPAASLNDQLPQHTFRVIWTAGDVQKECVLL KGRQEGWCRDSTTDEQLFRCELSVEKSTVLQSELESCKELQELEPENKWCLLTIILLM RALDPLLYEKERLQYFQTLKAVDPMRATYLDDLRSKFLLENSVLKMEYAEVRVLHLAH KDLTVLCHLEQLLLVTHLDLSHNRLRTLPPALAALRCLEVLQASDNAIESLEGVTNLP RLQELLLCNNRLQQPAVLQPLASCPRLVLLNLQGNPLCQAVGILEQLAELLPSVSSVL T" BASE COUNT 420 a 625 c 590 g 430 t 2 others ORIGIN 1 gaattccctc gcgctctggn ccgggcgaat cgggntatag gaagggccac acggatggaa 61 gtcctagtcc gggtgctcac ctcttgtgga acgtgcaaag cctgtcccag gacctctcta 121 cactctgggg gtctctgccc aggcacgctt gctgcttccg gacacagctg tgggcggagc 181 tagtaggggc gggctacgtg attgacactt ctctcctcag acttcaaggg ctaccactgg 241 acccttcccc tgtcttgaac cctgagccgg caccatgcac ggacgcctga aggtgaagac 301 gtcagaagag caggcggagg ccaaaaggct agagcgagag cagaagctga agctatacca 361 gtcagccacc caggccgtat tccagaagcg ccaggctggt gagctggatg agtccgtgct 421 ggaactgaca agccagattc tgggagccaa ccctgatttt gccaccctct ggaactgccg 481 acgagaggtg ctccagcagc tggagactca gaagtctcct gaagagttgg ctgctctggt 541 gaaggcagaa ctgggcttcc tggagagctg cctgcgggtg aaccccaagt cttatggtac 601 ctggcaccac cgatgctggc tgctaggccg cctgcctgag cccaactgga cccgagagct 661 ggagctctgt gcccgtttcc tggaggtgga tgagcggaac tttcactgct gggactatcg 721 gcggtttgtg gccacacagg cagccgtgcc ccctgcagaa gagctagcct tcactgacag 781 cctcatcacc cgaaacttct ccaactactc ttcctggcat taccgctcct gtctcttgcc 841 ccagttgcac ccccagccgg attctggacc acaggggcgc ctccctgagg atgtgctgct 901 caaagagctg gagctggtgc agaatgcctt cttcactgac cccaatgacc agagtgcctg 961 gttttatcac cggtggctcc taggtcgagc tgacccccag gatgcactgc gctgtctgca 1021 tgtgagccgg gacgaggcct gtctgactgt ctccttctct cggcccctct tagtgggctc 1081 caggatggag atcttgctgc tcatggttga tgattctccc ctgattgtgg agtggaggac 1141 cccagatggc aggaaccggc ccagccatgt ctggctctgt gacctgcctg ctgcctccct 1201 caacgaccag ttgccccaac atacatttcg cgtcatttgg acagcaggcg atgtccagaa 1261 agaatgcgtg cttttaaaag gccgccagga gggctggtgc cgggactcca cgacagacga 1321 gcagctattc aggtgtgagc tgtcagtgga gaagtccaca gtgctgcagt ctgagctgga 1381 atcctgtaag gagctgcagg agctggagcc tgagaataaa tggtgcctgc ttaccatcat 1441 cctgctgatg cgggcactgg accccctgct gtatgagaag gagaggctgc agtacttcca 1501 gaccctcaag gccgtggacc ccatgcgggc aacgtatctg gatgacctgc gcagcaagtt 1561 cttgctggag aatagcgtgc tcaagatgga gtatgccgag gtgcgtgtgc tgcacctggc 1621 tcacaaggat ctgacagtgc tctgccatct ggaacagctg ctcttggtca cccatcttga 1681 cttgtcacac aatcgcctcc gaaccctgcc acctgcactg gctgccctgc gctgccttga 1741 ggtgctgcag gccagtgata atgcaataga gtccctggag ggcgtcacca acctaccccg 1801 gctgcaggag ctgctactgt gcaacaaccg cctccagcag cctgcagtgc tccagcctct 1861 tgcctcctgc cccaggctgg tcctcctcaa cctgcagggt aacccgctgt gccaagcggt 1921 gggcatcttg gagcaactgg ctgaactgct gccttcagtt agcagcgtcc tcacctaaga 1981 ggccctgccc cctacccttg ccctttaact tattgggact gaataaagaa tggagaggcc 2041 cctctcaggc taccaaaaaa aaaaaaa // LOCUS HSRAD50 4123 bp RNA PRI 11-DEC-1997 DEFINITION H.sapiens mRNA for RAD50. ACCESSION Z75311 NID g2687852 KEYWORDS RAD50. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4123) AUTHORS Offenberg,H.H. JOURNAL Unpublished REFERENCE 2 (bases 1 to 4123) AUTHORS Offenberg,H.H. TITLE Direct Submission JOURNAL Submitted (10-JUL-1996) Offenberg H.H., Agricultural University, Genetics, Dreyenlaan 2, Wageningen, 6703 HA The Netherlands FEATURES Location/Qualifiers source 1..4123 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /clone_lib="library HL1161a in lambda gt10 (Clontech)" gene 43..3999 /gene="RAD50" CDS 43..3999 /gene="RAD50" /function="DNA repair and recombindation protein" /codon_start=1 /product="RAD50 homologue hsRAD50" /db_xref="PID:e252505" /db_xref="PID:g2687853" /translation="MLIFSVRDMFAKMSILGVRSFGIEDKDKQIITFFSPLTILVGPN GAGKTTIIECLKYICTGDFPPGTKGNTFVHDPKVAQETDVRAQIRLQFRDVNGELIAV QRSMVCTQKSKKTEFKTLEGVITRTKHGEKVSLSSKCAEIDREMISSLGVSKAVLNNV IFCHQEDSNWPLSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVEEYQMELKY LKQYKEKACEIRDQITSKEAQLTSSKEIVKSYENELDPLKNRLKEIEHNLSKIMKLDN EIKALDSRKKQMEKDNSELEEKMEKVFQGTDEQLNDLYHNHQRTVREKERKLVDCHRE LEKLNKESRLLNQEKSELLVEQGRLQLQADRHQEHIRARDSLIQSLATQLELDGFERG PFSERQIKNFHKLVRERQEGEAKTANQLMNDFAEKETLKQKQIDEIRDKKTGLGRIIE LKSEILSKKQNELKNVKYELQQLEGSSDRILELDQELIKAERELSKAEKNSNVETLKM EVISLQNEKADLDRTLRKLDQEMEQLNHHTTTRTQMEMLTKDKADKDEQIRKIKSRHS DELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASSEQNKNHINNELERK EEQLSSYEDKLFDVCGSQDFESDLDRLKEEIEKSSKQRAMLAGATAVYSQFITQLTDE NQSCCPVCQRVFQTEAELQEAISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP MRQSIIDLKEKEIPELRNKLQNVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDV TIMERFQMELKDVERKIAQQAAKLQGIDLDRTVQQVNQEKQEKQHKLDTVSSKIELNR KLIQDQQEQIQHLKSTTNELKSEKLQISTNLQRRQQLEEQTVELSTEVQSLYREIKDA KEQVSPLETTLEKFQQEKEELINKKNTSNKIAQDKLNDIKEKVKNIHGYMKDIENHIQ DGKDDYMKQKETELNKVIAQLSECEKHKEKINEDMRLMRQDIDTQKIQERWLQDNLTL RKRNEELKEVEEEGKQHLKEMGQMQVLQMKSEHQKLEENIDNIKRNHNLALGRQKGYE EEIIHFKKELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTLDQAIMKFHSMKM EEINKIIRDLWRSTYRGQDIEYIEIRSDADENVSASDKRRNYNYRVVMLKGDTALDMR GRCSAGQKVLASLIIRLALAETFCLNCGIIALDEPTTNLDRENIESLAHALVEIIKSR SQQRNFQLLVITHDEDFVELLGRSEYVEKFYRIKKNIDQCSEIVKCSVSSLGFNVH" BASE COUNT 1630 a 661 c 901 g 931 t ORIGIN 1 tgcggagttt tggaatagag gacaaagata agcacgccca gaatgctcat cttttcggtc 61 cgggacatgt ttgcaaagat gagcattctg ggcgtgcgga gttttggaat agaggacaaa 121 gataagcaaa ttatcacttt cttcagcccc cttacaattt tggttggacc caatggggcg 181 ggaaagacga ccatcattga atgtctaaaa tatatttgta ctggagattt ccctcctgga 241 accaaaggaa atacatttgt acacgatccc aaggttgctc aagaaacaga tgtgagagcc 301 cagattcgtc tgcaatttcg tgatgtcaat ggagaactta tagctgtgca aagatctatg 361 gtgtgtactc agaaaagcaa aaagacagaa tttaaaaccc tggaaggagt cattactaga 421 acaaagcatg gtgaaaaggt cagtctgagc tctaagtgtg cagaaattga ccgagaaatg 481 atcagttctc ttggggtttc caaggctgtg ctaaataatg tcattttctg tcatcaagaa 541 gattctaatt ggcctttaag tgagggaaag gctttgaagc aaaagtttga tgagattttt 601 tcagcaacaa gatacattaa agccctagaa acacttcggc aggtacgtca gacacaaggt 661 cagaaagtag aagaatatca aatggaacta aaatatctga agcaatataa ggaaaaagct 721 tgtgagattc gtgatcagat tacaagtaag gaagcccagt taacatcttc aaaggaaatt 781 gtcaaatcct atgagaatga acttgatcca ttgaagaatc gtctaaaaga aattgaacat 841 aatctctcta aaataatgaa acttgacaat gaaattaaag ccttggatag ccgaaagaag 901 caaatggaga aagataatag tgaactggaa gagaaaatgg aaaaggtttt tcaagggact 961 gatgagcaac taaatgactt atatcacaat caccagagaa cagtaaggga gaaagaaagg 1021 aaattggtag actgtcatcg tgaactggaa aaactaaata aagaatctag gcttctcaat 1081 caggaaaaat cagaactgct tgttgaacag ggtcgtctac agctgcaagc agatcgccat 1141 caagaacata tccgagctag agattcatta attcagtctt tggcaacaca gctagaattg 1201 gatggctttg agcgtggacc attcagtgaa agacagatta aaaattttca caaacttgtg 1261 agagagagac aagaagggga agcaaaaact gccaaccaac tgatgaatga ctttgcagaa 1321 aaagagactc tgaaacaaaa acagatagat gagataagag ataagaaaac tggactggga 1381 agaataattg agttaaaatc agaaatccta agtaagaagc agaatgagct gaaaaatgtg 1441 aagtatgaat tacagcagtt ggaaggatct tcagacagga ttcttgaact ggaccaggag 1501 ctcataaaag ctgaacgtga gttaagcaag gctgagaaaa acagcaatgt agaaacctta 1561 aaaatggaag taataagtct ccaaaatgaa aaagcagact tagacaggac cctgcgtaaa 1621 cttgaccagg agatggagca gttaaaccat catacaacaa cacgtaccca aatggagatg 1681 ctgaccaaag acaaagctga caaagatgaa caaatcagaa aaataaaatc taggcacagt 1741 gatgaattaa cctcactgtt gggatatttt cccaacaaaa aacagcttga agactggcta 1801 catagtaaat caaaagaaat taatcagacc agggacagac ttgccaaatt gaacaaggaa 1861 ctagcttcat ctgagcagaa taaaaatcat ataaataatg aactagaaag aaaggaagag 1921 cagttgtcca gttacgaaga caagctgttt gatgtttgtg gtagccagga ttttgaaagt 1981 gatttagaca ggcttaaaga ggaaattgaa aaatcatcaa aacagcgagc catgctggct 2041 ggagccacag cagtttactc ccagttcatt actcagctaa cagacgaaaa ccagtcatgt 2101 tgccccgttt gtcagagagt ttttcagaca gaggctgagt tacaagaagc catcagtgat 2161 ttgcagtcta aactgcgact tgctccagat aaactcaagt caacagaatc agagctaaaa 2221 aaaaaggaaa agcggcgtga tgaaatgctg ggacttgcgc ccatgaggca aagcataatt 2281 gatttgaagg agaaggaaat accagaatta agaaacaaac tgcagaatgt caatagagac 2341 atacagcgcc taaagaacga catagaagaa caagaaacac tcttgggtac aataatgcct 2401 gaagaagaaa gtgccaaagt atgcctgaca gatgttacaa ttatggagag gttccagatg 2461 gaacttaaag atgttgaaag aaaaattgca caacaagcag ctaagctaca aggaatagac 2521 ttagatcgaa ctgtccaaca agtcaaccag gagaaacaag agaaacagca caagttagac 2581 acagtttcta gtaagattga attgaatcgt aagcttatac aggaccagca ggaacagatt 2641 caacatctaa aaagtacaac aaatgagcta aaatctgaga aacttcagat atccactaat 2701 ttgcaacgtc gtcagcaact ggaggagcag actgtggaat tatccactga agttcagtct 2761 ttgtacagag agataaagga tgctaaagag caggtaagcc ctttggaaac aacattggaa 2821 aagttccagc aagaaaaaga agaattaatc aacaaaaaaa atacaagcaa caaaatagca 2881 caggataaac tgaatgatat taaagagaag gttaaaaata ttcatggcta tatgaaagac 2941 attgagaatc atattcaaga tgggaaagac gactatatga agcaaaaaga aactgaactt 3001 aataaagtaa tagctcaact aagtgaatgc gagaaacaca aagaaaagat aaatgaagat 3061 atgagactca tgagacaaga tattgataca cagaagatac aagaaaggtg gctacaagat 3121 aaccttactt taagaaaaag aaatgaggaa ctaaaagaag ttgaagaaga aggaaaacaa 3181 catttgaagg aaatgggtca aatgcaggtt ttgcaaatga aaagtgaaca tcagaagttg 3241 gaagagaaca tagacaatat aaaaagaaat cataatttgg cattagggcg acagaaaggt 3301 tatgaagaag aaattattca ttttaagaaa gaacttcgag aaccacaatt tcgggatgct 3361 gaggaaaagt atagagaaat gatgattgtt atgaggacaa cagaacttgt gaacaaggat 3421 ctggatattt attataagac tcttgaccaa gcaataatga aatttcacag tatgaagatg 3481 gaagaaatca ataaaattat acgtgacctg tggcgaagta cctatcgtgg acaagatatt 3541 gaatacatag aaatacggtc tgatgccgat gaaaatgtat cagcttctga taaaaggcgg 3601 aattataact accgagtggt gatgctgaag ggagacacag ccttggatat gcgaggacga 3661 tgcagtgctg gacaaaaggt attagcctca ctcatcattc gcctggccct ggctgaaacg 3721 ttctgcctca actgtggcat cattgccttg gatgagccaa caacaaatct tgaccgagaa 3781 aacattgaat ctcttgcaca tgctctggtt gagataataa aaagtcgctc acagcagcgt 3841 aacttccagc ttctggtaat cactcatgat gaagattttg tggagctttt aggacgttct 3901 gaatatgtgg agaaattcta caggattaaa aagaacatcg atcagtgctc agagattgtg 3961 aaatgcagtg ttagctccct gggattcaat gttcattaaa aatatccaag atttaaatgc 4021 catagaaatg taggtcctca gaaagtgtat aataagaaac ttatttctca tatcaactta 4081 gtcaataaga aaatatattc tttcaaagga aaaaaaaaaa aaa // LOCUS HSRAD54 2607 bp RNA PRI 17-AUG-1996 DEFINITION H.sapiens mRNA homologous to S. cerevisiae RAD54. ACCESSION X97795 NID g1495482 KEYWORDS DNA repair protein; RAD54 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2607) AUTHORS Kanaar,R., Troelstra,C., Swagemakers,S.M.A., Essers,J., Smit,B., Franssen,J.H., Pastink,A., Bezzubova,O.Y., Buerstedde,J.M., Clever,B., Heyer,W.D. and Hoeijmakers,J.H.J. TITLE Human and mouse homologs of the Saccharomyces cerevisiae RAD54 DNA repair gene: Evidence for functional conservation JOURNAL Unpublished REFERENCE 2 (bases 1 to 2607) AUTHORS Kanaar,R. TITLE Direct Submission JOURNAL Submitted (14-MAY-1996) R. Kanaar, Erasmus University Rotterdam, Cell Biology and Genetics, PO Box 1738, 3000 DR Rotterdam, Netherlands FEATURES Location/Qualifiers source 1..2607 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="p32" /clone_lib="lambda gt11" /tissue_type="testis" gene 101..2344 /gene="RAD54" CDS 101..2344 /gene="RAD54" /function="DNA repair" /codon_start=1 /db_xref="PID:e246056" /db_xref="PID:g1495483" /translation="MRRSLAPSQLAKRKPEGRSCDDEDWQPGLVTPRKRKSSSETQIQ ECFLSPFRKPLSQLTNQPPCLDSSQHEAFIRSILSKPFKVPIPNYQGPLGSRALGLKR AGVRRALHDPLEKDALVLYEPPPLSAHDQLKLDKEKLPVHVVVDPILSKVLRPHQREG VKFLWECVTSRRIPGSHGCIMADEMGLGKTLQCITLMWTLLRQSPECKPEIDKAVVVS PSSLVKNWYNEVGKWLGGRIQPLAIDGGSKDEIDQKLEGFMNQRGARVSSPILIISYE TFRLHVGVLQKGSVGLVICDEGHRLKNSENQTYQALDSLNTSRRVLISGTPIQNDLLE YFSLVHFVNSGILGTAHEFKKHFELPILKGRDAAASEADRQLGEERLRELTSIVNRCL IRRTSDILSKYLPVKIEQVVCCRLTPLQTELYKRFLRQAKPAEELLEGKMSVSSLSSI TSLKKLCNHPALIYDKCVEEEDGFVGALDLFPPGYSSKALEPQLSGKMLVLDYILAVT RSRSSDKVVLVSNYTQTLDLFEKLCRARRYLYVRLDGTMSIKKRAKVVERFNSPSSPD FVFMLSSKAGGCGLNLIGANRLVMFDPDWNPANDEQAMARVWRDGQKKTCYIYRLLSA GTIEEKIFQRQSHKKALSSCVVDEEQDVERHFSLGELKELFILDEASLSDTHDRLHCR RCVNSRQIRPPPDGSDCTSDLAGWNHCTDKWGLRDEVLQAAWDAASTAITFVFHQHSH EEQRGLR" BASE COUNT 632 a 630 c 724 g 621 t ORIGIN 1 gaattcgggc agattagacc ctggtcctac actcttagcc gctgcctgct tttgaccttt 61 ggctcatggg tacttgacgt tttaaactcc taggcccagg atgaggagga gcttggctcc 121 cagccagctg gccaagagaa aacctgaagg caggtcctgt gatgatgaag actggcaacc 181 tggcctagtg actcctagga aacggaaatc cagcagtgag acccagatcc aggagtgttt 241 cctgtctcct tttcggaaac ctttgagtca gctaaccaat caaccacctt gtctggacag 301 cagtcagcat gaagcattta ttcgaagcat tttgtcaaag cctttcaaag tccccattcc 361 aaattatcaa ggtcctctgg gctctcgagc attgggcctg aaaagggctg gggtccgccg 421 ggccctccat gaccccctgg aaaaagatgc cttggttctg tatgagcctc ccccgctgag 481 cgctcatgac cagctgaagc ttgacaagga gaaactccct gtccatgtgg ttgttgaccc 541 tattctcagt aaggttttgc ggcctcatca gagagaggga gtgaaattcc tgtgggagtg 601 tgtcaccagt cggcgcatcc ctggcagcca tggctgcatc atggctgatg agatgggcct 661 aggaaagacg ctgcagtgca tcacattgat gtggacactt ttacgccaga gtccagagtg 721 caagccagaa attgacaagg cagtggtggt gtcgccttcc agcctggtga agaactggta 781 caatgaggtt gggaaatggc tcggagggag gatccaacct ctggccatcg atggaggatc 841 taaggatgaa atagaccaaa agctggaagg attcatgaac cagcgtggag ccagggtgtc 901 ttctcccatc ctcatcattt cctatgagac cttccgcctt catgttggag tcctccagaa 961 aggaagtgtt ggtctggtca tatgtgacga gggacacagg ctcaagaact ctgagaatca 1021 gacttaccaa gccctggaca gcttgaacac cagccggcgg gtgctcatct ccggaactcc 1081 catccagaat gatctgcttg agtatttcag cttggtacat tttgttaatt ccggcatcct 1141 agggactgcc catgaattca agaagcattt tgaattgcca attttgaagg gtcgagacgc 1201 tgctgctagt gaggcagaca ggcagctagg agaggagcgg ctgcgggagc tcaccagcat 1261 tgtgaataga tgcctgatac ggaggacttc tgatatcctt tctaaatatc tgcctgtgaa 1321 gattgagcag gtcgtttgtt gtaggctgac accccttcag actgagttat acaagaggtt 1381 tctgagacaa gccaaaccgg cagaagaatt gcttgagggc aagatgagtg tgtcttccct 1441 ttcttccatc acctcgctaa agaagctttg taatcatcca gctctaatct atgataagtg 1501 tgtggaagag gaggatggct ttgtgggtgc cttggacctc ttccctcctg gttacagctc 1561 taaggccctg gagccccagc tgtcaggtaa gatgctggtc ctggattata ttctggcggt 1621 gacccgaagc cgtagcagtg acaaagtagt gctggtgtcg aattacaccc agactttgga 1681 tctctttgag aagctgtgcc gtgcccgaag gtacttatac gtccgcctgg atggcacgat 1741 gtccattaag aagcgagcca aggttgtaga acgcttcaat agtccatcga gccctgactt 1801 tgtcttcatg ctgagcagca aagctggggg ctgtggcctc aatctcattg gggctaaccg 1861 gctggtcatg tttgaccctg actggaaccc agccaatgat gaacaagcca tggcccgggt 1921 ctggcgagat ggtcaaaaga agacttgcta tatctaccgc ctgctgtctg cagggaccat 1981 tgaggagaag atcttccagc gtcagagcca caagaaggca ctgagcagct gtgtggtgga 2041 tgaggagcag gatgtagagc gccacttctc tctgggcgag ttgaaggagc tgtttatcct 2101 ggatgaagct agcctcagtg acacacatga caggttgcac tgccgacgtt gtgtcaacag 2161 ccgtcagatc cggccacccc ctgatggttc tgactgcact tcagacctgg cagggtggaa 2221 ccactgcact gataagtggg ggctccggga tgaggtactc caggctgcct gggatgctgc 2281 ctccactgct atcaccttcg tcttccacca gcattctcat gaggaacagc ggggcctccg 2341 ctgataacca gctggtctgg gtgtagctct tagaggaagg agatagggaa aaggggctcc 2401 ttgctccaca gggccctgtt gaattttgtt ctctgggaga aaatcatcaa gaagggctgc 2461 atgatgtttg cccaaaattt attttataag aaaaactttt ttggttaaaa aaaagaataa 2521 aggtatgaaa gggctggtga cagtcaggga tgcccccggc acacagggac taggtctagt 2581 gagaacatca ggagcagcca gggatcc // LOCUS HSRAF1P1 4257 bp RNA PRI 10-OCT-1994 DEFINITION H.sapiens AF-1p mRNA. ACCESSION Z29064 NID g470034 KEYWORDS AF-1p gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4257) AUTHORS Bernard,O. TITLE Direct Submission JOURNAL Submitted (09-DEC-1993) Olivier Bernard, U 301 INSERM, 27, rue Juliette Dodu, Paris, 75010, France REFERENCE 2 (bases 1 to 4257) AUTHORS Bernard,O.A., Mauchauffe,M., Mecucci,C., Van den Berghe,H. and Berger,R. TITLE A novel gene, AF-1p, fused to HRX in t(1;11)(p32;q23), is not related to AF-4, AF-9 nor ENL JOURNAL Oncogene 9 (4), 1039-1045 (1994) MEDLINE 94181254 FEATURES Location/Qualifiers source 1..4257 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="AF-1p cDNA" /cell_line="RPMI 8226" /chromosome="1p32" gene 93..2783 /gene="AF-1p" CDS 93..2783 /gene="AF-1p" /note="Highly similar to murine eps 15 GB A.N. L221768" /codon_start=1 /db_xref="PID:g470035" /db_xref="SWISS-PROT:P42566" /translation="MAAAAQLSLTQLSSGNPVYEKYYRQVDTGNTGRVLASDAAAFLK KSGLPDLILGKIWDLADTDGKGILNKQEFFVALRLVACAQNGLEVSLSSLNLAVPPPR FHDTSSPLLISGTSAAELPWAVKPEDKAKYDAIFDSLSPVNGFLSGDKVKPVLLNSKL PVDILGRVWELSDIDHDGMLDRDEFAVAMFLVYCALEKEPVPMSLPPALVPPSKRKTW VVSPAEKAKYDEIFLKTDKDMDGFVSGLEVREIFLKTGLPSTLLAHIWSLCDTKDCGK LSKDQFALAFHLISQKLIKGIDPPHVLTPEMIPPSDRASLQKNIIGSSPVADFSAIKE LDTLNNEIVDLQREKNNVEQDLKEKEDTIKQRTSEVQDLQDEVQRENTNLQKLQAQKQ QVQELLDELDEQKAQLEEQLKEVRKKCAEEAQLISSLKAELTSQESQISTYEEELAKA REELSRLQQETAELEESVESGKAQLEPLQQHLQDSQQEISSMQMKLMEMKDLENHNSQ LNWCSSPHSILVNGATDYCSLSTSSSETANLNEHVEGQSNLESEPIHQESPARSSPEL LPSGVTDENEVTTAVTEKVCSELDNNRHSKEEDPFNVDSSSLTGPVADTNLDFFQSDP FVGSDPFKDDPFGKIDPFGGDPFKGSDPFASDCFFRQSTDPFATSSTDPFSAANNSSI TSVETLKHNDPFAPGGTVVAASDSATDPFASVFGNESFGGGFADFSTLSKVNNEDPFR SATSSSVSNVVITKNVFEETSVKSEDEPPALPPKIGTPTRPCPLPPGKRSINKLDSPD PFKLNDPFQPFPGNDSPKEKDPEIFCDPFTSATTTTNKEADPSNFANFSAYPSEEDMI EWAKRESEREEEQRLARLNQQEQEDLELAIALSKSEISEA" misc_feature 135..136 /gene="AF-1p" /note="fusion site with HRX in t(1;11)(p32;q23) chromosomal translocation" misc_feature 742..744 /gene="AF-1p" /note="111 nucleotide insertion in 1 cDNA" polyA_signal 4223..4228 BASE COUNT 1303 a 836 c 871 g 1247 t ORIGIN 1 ggcctcgcct gcggccgctc cctccgcctc ctccccgccc cgagccccag tcagcccgtc 61 ttccttcccc tcccttgcat gatggaaaca ccatggctgc ggcggcccag ctctctctga 121 cacagttatc aagtgggaat cctgtatatg aaaaatacta tagacaggtt gatacaggca 181 atactggaag ggtgttggct tctgatgctg ctgctttcct gaaaaaatca gggcttccag 241 acttgatact tggaaagatt tgggatttag ccgacacaga tggcaaaggt atcctgaaca 301 aacaagaatt ctttgttgct ttgcgtcttg tggcatgtgc ccagaatgga ttggaagttt 361 cactaagtag tttgaacctg gctgttcctc caccaagatt tcatgatacc agtagtcctt 421 tgctaatcag tggaacctct gcagctgagc tcccatgggc tgtaaaacct gaagataagg 481 ccaaatatga tgcaatattt gatagtttaa gcccagtgaa tggatttctg tctggtgata 541 aagtgaaacc agtgttgctc aactctaagt tacctgtgga tatccttgga agagtttggg 601 agttgagtga tattgaccat gatggaatgc ttgacagaga tgagtttgca gttgccatgt 661 ttttggtata ctgtgcactg gagaaagaac ctgtgccaat gtccttgcct ccagccttgg 721 tgccaccatc taagagaaaa acgtgggttg tatcccctgc agaaaaagct aaatatgatg 781 aaatcttcct gaaaactgat aaagatatgg acggatttgt gtctggattg gaggtccgtg 841 aaatattctt gaaaacaggt ttaccttcta ccttactagc ccatatatgg tcattatgcg 901 acacaaagga ctgtgggaag ctttcaaagg atcagtttgc cttggctttt cacttaatca 961 gtcagaagtt aatcaagggc attgatcctc ctcacgttct tactcctgaa atgattccac 1021 catcagacag ggccagttta caaaagaaca tcataggatc aagtcctgtt gcagatttct 1081 ctgctattaa ggaactagat actcttaaca atgaaatagt tgacctacag agggaaaaga 1141 ataatgtgga acaggacctt aaggagaagg aagatactat taaacagagg acaagtgagg 1201 ttcaggatct tcaagatgaa gttcaaaggg agaatactaa tctgcaaaaa ctacaggccc 1261 agaaacagca ggtacaggaa ctccttgatg aactggatga gcagaaagcc cagctggagg 1321 agcaactcaa ggaagtcaga aagaaatgtg ctgaggaggc ccaactgatc tcttctctga 1381 aagctgaatt aactagtcag gaatcgcaga tctccactta tgaagaagaa ttggcaaaag 1441 ctagagaaga gctgagccgt ctacagcaag aaacagcaga attggaggag agtgtagagt 1501 cagggaaggc tcagttggaa cctcttcagc agcacctaca agattcacaa caggaaatta 1561 gttcaatgca aatgaaactg atggaaatga aagatttgga aaatcataat agtcagttaa 1621 attggtgcag tagcccacac agcattcttg taaacggagc tacagattat tgcagcctca 1681 gcaccagcag cagtgaaaca gccaacctta atgaacatgt tgaaggccag agcaacctag 1741 agtctgagcc catacaccag gaatctccag caagaagtag tcctgaacta ctgccttctg 1801 gtgtgactga tgaaaatgag gtgactacag ctgttactga aaaagtttgt tctgaactcg 1861 acaataatag acattcaaaa gaggaagatc catttaatgt agactcaagt tcgctgacag 1921 gtccagttgc agatacaaac ttggattttt tccagtctga tccttttgtt ggcagtgatc 1981 ctttcaagga tgatcctttt ggaaaaatcg atccatttgg tggtgatcct ttcaaaggtt 2041 cagatccatt tgcatcagac tgtttcttca ggcaatctac tgatcctttt gccacttcaa 2101 gcactgaccc tttcagtgca gccaacaata gcagtattac atcggtagaa acgttgaagc 2161 acaatgatcc ttttgctcct ggtggaacag ttgttgcagc aagcgattca gccacagacc 2221 cctttgcttc tgtttttggg aatgaatcat ttggaggtgg atttgctgac ttcagcacat 2281 tgtcaaaggt caacaatgaa gatccttttc gttcagccac atcgagctct gtcagcaacg 2341 tagtgattac aaaaaatgta tttgaggaaa catcggtcaa aagtgaagat gaacccccag 2401 cactgccacc aaagatcgga actccaacaa gaccctgccc tctaccacct gggaaaagat 2461 ccatcaacaa attggattct cctgatccct ttaaactgaa tgatccattt cagcctttcc 2521 caggcaacga tagccccaaa gaaaaagatc ctgaaatatt ttgtgatcca ttcacttctg 2581 ctactaccac taccaataaa gaggctgatc caagcaattt tgccaacttc agtgcttatc 2641 cctctgaaga agatatgatc gaatgggcca agagggaaag tgagagagag gaagagcaga 2701 ggcttgcccg actaaatcag caggaacaag aagacttaga actggctatt gcactcagca 2761 aatctgagat atcagaagca tgaagaattc tcttgttctt tggcaacaat atagtattct 2821 tcttcctgaa tactgaaact atttacaatg tgtatcaaaa ctacctgtga gcatgggaat 2881 acaaaaggtt tgagattcct gtaaatgtga caaaatttta ggattttttt tttttcttca 2941 ttacagattc gtcttttttt tttttcttat aaaagccgta acccagtcag acaaattcac 3001 cttcacttag gcccctgttc tgggatacat ttactgtgag cttttgcctg cctgtgctat 3061 tttacttgta aagctagagc acccaagctt ctgccttctg gaatatagag aaatagtttc 3121 accctgcact accctgttct gtagttattc tgatgatagc cagtgaggtt cttaaagttt 3181 gcagtattct cccctgattg gaatggttga gtgagggtaa gggaaagaat atcttatttc 3241 ttttatgatt ggtgcaaatt ggctaaagtg catttttaaa tttcctctac ttaatttgtt 3301 tttcagagat aaggaaaaat attttgcaca gatttactcc actatggaaa agggatgctg 3361 taggttgaac cattatagcc tcagattcga tcttttccta actaaaaata ttaaagcctc 3421 atgtgtgaaa taaattttta aaaagattta tctggattta gagaatttta gatcaacaga 3481 tacctctcag tgtgtttgct aattaataaa aatcagtttc ttacaaataa agtttgtaag 3541 aaaatgttca ttttaagtga tagatagtgg agaaaattta tcacctaaaa tatacccatc 3601 agtataaggc aagcaaaagt cttaacatgg cagccattct gcctttgccg tggccctgtc 3661 ctgtttagtt cttagtgggt taatttttgt acttttgcag aagaaacttc agcaagctag 3721 aactggaagg tactttaatt tttcatatat atttgttttt tttttttttt aatgaaggct 3781 catttacttg aaatgtaaaa actttcactg aatacaaata gaaaaagtga tgtgttttat 3841 atcatattgc tttttgtcca tctttgtggt ttagtttatt tactcacttc atgtttttca 3901 cctataaaat tgtcaagcta gcaaaaaaac tcttgttttt ttaattggga gagaagagac 3961 ctgccagatt atcagacctc ttcatgttaa aagaccatct cctgtaaaac tgacctagtg 4021 gacaagctga atttgaaata gactgtgaag taagctgtaa cttgtcattt taattttgtt 4081 taacacggtt actgacttag atgatgtatt aaataccaag ataaagaaaa atgcacctaa 4141 aatctaatta gaattctctg ggtcaacaag tcaaggtggt attgatctgt gttaatctga 4201 gtaacttatt gcctagccta taaataaatt ccaaaatatc caattcattt cttcttg // LOCUS HSRAFR 2977 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for raf oncogene. ACCESSION X03484 NID g35841 KEYWORDS oncogene; raf oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2977) AUTHORS Bonner,T.I., Oppermann,H., Seeburg,P., Kerby,S.B., Gunnell,M.A., Young,A.C. and Rapp,U.R. TITLE The complete coding sequence of the human raf oncogene and the corresponding structure of the c-raf-1 gene JOURNAL Nucleic Acids Res. 14 (2), 1009-1015 (1986) MEDLINE 86120351 FEATURES Location/Qualifiers source 1..2977 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 130..2076 /note="raf protein (aa 1-648)" /codon_start=1 /db_xref="PID:g35842" /db_xref="SWISS-PROT:P04049" /translation="MEHIQGAWKTISNGFGFKDAVFDGSSCISPTIVQQFGYQRRASD DGKLTDPSKTSNTIRVFLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLL HEHKGKKARLDWNTDAASLIGEELQVDFLDHVPLTTHNFARKTFLKLAFCDICQKFLL NGFRCQTCGYKFHEHCSTKVPTMCVDWSNIRQLLLFPNSTIGDSGVPALPSLTMRRMR ESVSRMPVSSQHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVDSRM IEDAIRSHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQEKNKIRPRG QRDSSYYWEIEASEVMLSTRIGSGSFGTVYKGKWHGDVAVKILKVVDPTPEQFQAFRN EVAVLRKTRHVNILLFMGYMTKDNLAIVTQWCEGSSLYKHLHVQETKFQMFQLIDIAR QTAQGMDYLHAKNIIHRDMKSNNIFLHEGLTVKIGDFGLATVKSRWSGSQQVEQPTGS VLWMAPEVIRMQDNNPFSFQSDVYSYGIVLYELMTGELPYSHINNRDQIIFMVGRGYA SPDLSKLYKNCPKAMKRLVADCVKKVKEERPLFPQILSSIELLQHSLPKINRSASEPS LHRAAHTEDINACTLTTSPRLPVF" misc_feature 2959..2964 /note="pot. polyA signal" misc_feature 2964..2969 /note="pot. polyA signal" polyA_site 2977 /note="polyA site" BASE COUNT 767 a 751 c 725 g 734 t ORIGIN 1 ccgaatgtga ccgcctcccg ctccctcacc cgccgcgggg aggaggagcg ggcgagaagc 61 tgccgccgaa cgacaggacg ttggggcggc ctggctccct caggtttaag aattgtttaa 121 gctgcatcaa tggagcacat acagggagct tggaagacga tcagcaatgg ttttggattc 181 aaagatgccg tgtttgatgg ctccagctgc atctctccta caatagttca gcagtttggc 241 tatcagcgcc gggcatcaga tgatggcaaa ctcacagatc cttctaagac aagcaacact 301 atccgtgttt tcttgccgaa caagcaaaga acagtggtca atgtgcgaaa tggaatgagc 361 ttgcatgact gccttatgaa agcactcaag gtgaggggcc tgcaaccaga gtgctgtgca 421 gtgttcagac ttctccacga acacaaaggt aaaaaagcac gcttagattg gaatactgat 481 gctgcgtctt tgattggaga agaacttcaa gtagatttcc tggatcatgt tcccctcaca 541 acacacaact ttgctcggaa gacgttcctg aagcttgcct tctgtgacat ctgtcagaaa 601 ttcctgctca atggatttcg atgtcagact tgtggctaca aatttcatga gcactgtagc 661 accaaagtac ctactatgtg tgtggactgg agtaacatca gacaactctt attgtttcca 721 aattccacta ttggtgatag tggagtccca gcactacctt ctttgactat gcgtcgtatg 781 cgagagtctg tttccaggat gcctgttagt tctcagcaca gatattctac acctcacgcc 841 ttcaccttta acacctccag tccctcatct gaaggttccc tctcccagag gcagaggtcg 901 acatccacac ctaatgtcca catggtcagc accacgctgc ctgtggacag caggatgatt 961 gaggatgcaa ttcgaagtca cagcgaatca gcctcacctt cagccctgtc cagtagcccc 1021 aacaatctga gcccaacagg ctggtcacag ccgaaaaccc ccgtgccagc acaaagagag 1081 cgggcaccag tatctgggac ccaggagaaa aacaaaatta ggcctcgtgg acagagagat 1141 tcaagctatt attgggaaat agaagccagt gaagtgatgc tgtccactcg gattgggtca 1201 ggctcttttg gaactgttta taagggtaaa tggcacggag atgttgcagt aaagatccta 1261 aaggttgtcg acccaacccc agagcaattc caggccttca ggaatgaggt ggctgttctg 1321 cgcaaaacac ggcatgtgaa cattctgctt ttcatggggt acatgacaaa ggacaacctg 1381 gcaattgtga cccagtggtg cgagggcagc agcctctaca aacacctgca tgtccaggag 1441 accaagtttc agatgttcca gctaattgac attgcccggc agacggctca gggaatggac 1501 tatttgcatg caaagaacat catccataga gacatgaaat ccaacaatat atttctccat 1561 gaaggcttaa cagtgaaaat tggagatttt ggtttggcaa cagtaaagtc acgctggagt 1621 ggttctcagc aggttgaaca acctactggc tctgtcctct ggatggcccc agaggtgatc 1681 cgaatgcagg ataacaaccc attcagtttc cagtcggatg tctactccta tggcatcgta 1741 ttgtatgaac tgatgacggg ggagcttcct tattctcaca tcaacaaccg agatcagatc 1801 atcttcatgg tgggccgagg atatgcctcc ccagatctta gtaagctata taagaactgc 1861 cccaaagcaa tgaagaggct ggtagctgac tgtgtgaaga aagtaaagga agagaggcct 1921 ctttttcccc agatcctgtc ttccattgag ctgctccaac actctctacc gaagatcaac 1981 cggagcgctt ccgagccatc cttgcatcgg gcagcccaca ctgaggatat caatgcttgc 2041 acgctgacca cgtccccgag gctgcctgtc ttctagttga ctttgcacct gtcttcaggc 2101 tgccagggga ggaggagaag ccagcaggca ccacttttct gctccctttc tccagaggca 2161 gaacacatgt tttcagagaa gctctgctaa ggaccttcta gactgctcac agggccttaa 2221 cttcatgttg ccttcttttc tatccctttg ggccctggga gaaggaagcc atttgcagtg 2281 ctggtgtgtc ctgctccctc cccacattcc ccatgctcaa ggcccagcct tctgtagatg 2341 cgcaagtgga tgttgatggt agtacaaaaa gcaggggccc agccccagct gttggctaca 2401 tgagtattta gaggaagtaa ggtagcaggc agtccagccc tgatgtggag acacatggga 2461 ttttggaaat cagcttctgg aggaatgcat gtcacaggcg ggactttctt cagagagtgg 2521 tgcagcgcca gacattttgc acataaggca ccaaacagcc caggactgcc gagactctgg 2581 ccgcccgaag gagcctgctt tggtactatg gaacttttct taggggacac gtcctccttt 2641 cacagcttct aaggtgtcca gtgcattggg atggttttcc aggcaaggca ctcggccaat 2701 ccgcatctca gccctctcag gagcagtctt ccatcatgct gaattttgtc ttccaggagc 2761 tgcccctatg gggcgggccg cagggccagc ctgtttctct aacaaacaaa caaacaaaca 2821 gccttgtttc tctagtcaca tcatgtgtat acaaggaagc caggaataca ggttttcttg 2881 atgatttggg ttttaatttt gtttttattg cacctgacaa aatacagtta tctgatggtc 2941 cctcaattat gttattttaa taaaataaat taaattt // LOCUS HSRAI 1921 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ribonuclease/angiogenin inhibitor (RAI). ACCESSION X13973 NID g35843 KEYWORDS angiogenin inhibitor; glycoprotein; ribonuclease inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1921) AUTHORS Schneider,R., Schneider-Scherzer,E., Thurnher,M., Auer,B. and Schweiger,M. TITLE The primary structure of human ribonuclease/angiogenin inhibitor (RAI) discloses a novel highly diversified protein superfamily with a common repetitive module JOURNAL EMBO J. 7 (13), 4151-4156 (1988) MEDLINE 89210799 FEATURES Location/Qualifiers source 1..1921 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt11" /clone="OBR2.1" CDS 361..1746 /note="ribonuclease/angiogenin inhibitor (AA 1-461)" /codon_start=1 /db_xref="PID:g35844" /db_xref="SWISS-PROT:P13489" /translation="MSLDIQSLDIQCEELSDARWAELLPLLQQCQVVRLDDCGLTEAR CKDISSALRVNPALAELNLRSNELGDVGVHCVLQGLQTPSCKIQKLSLQNCCLTGAGC GVLSSTLRTLPTLQELHLSDNLLGDAGLQLLCEGLLDPQCRLEKLQLEYCSLSAASCE PLASVLRAKPDFKELTVSNNDINEAGVRVLCQGLKDSPCQLEALKLESCGVTSDNCRD LCGIVASKASLRELALGSNKLGDVGMAELCPGLLHPSSRLRTLWIWECGITAKGCGDL CRVLRAKESLKELSLAGNELGDEGARLLCETLLEPGCQLESLWVKSCSFTAACCSHFS SVLAQNRFLLELQISNNRLEDAGVRELCQGLGQPGSVLRVLWLADCDVSDSSCSSLAA TLLANHSLRELDLSNNCLGDAGILQLVESVSEPGCLLEQLVLYDIYWSEEMEDRLQAL EKDKPSLRVIS" misc_feature 1903..1908 /note="pot.polyA signal" polyA_site 1921 /note="polyA site" BASE COUNT 369 a 605 c 591 g 356 t ORIGIN 1 agtgccttag attccagcga gctacgaagc aatcctggcc cagccgagct tgcttcccca 61 aatcccgtaa tccttgacct tattccccca aagaagcggc ctcccgggaa ggagcgccct 121 ggcggagaag actcgaacgg ctcccacagc cgggcgttgg gggtaaaggc atgaagaact 181 cttgactgac agaaacggag ggtgtgtcca aagttttgag gacggccgag cggcgctcca 241 aaacccgtcc tcacagcctc gccccgttcg cctcagctac aacaaatcat cgtcaacctg 301 ttccaccttc tccagtctgg tagcaaaaag gggtgtctca ggccactctt cacctccacc 361 atgagcctgg acatccagag cctggacatc cagtgtgagg agctgagcga cgctagatgg 421 gccgagctcc tccctctgct ccagcagtgc caagtggtca ggctggacga ctgtggcctc 481 acggaagcac ggtgcaagga catcagctct gcacttcgag tcaaccctgc actggcagag 541 ctcaacctgc gcagcaacga gctgggcgat gtcggcgtgc attgcgtgct ccagggcctg 601 cagaccccct cctgcaagat ccagaagctg agcctccaga actgctgcct gacgggggcc 661 ggctgcgggg tcctgtccag cacactacgc accctgccca ccctgcagga gctgcacctc 721 agcgacaacc tcttggggga tgcgggcctg cagctgctct gcgaaggact cctggacccc 781 cagtgccgcc tggaaaagct gcagctggag tattgcagcc tctcggctgc cagctgcgag 841 cccctggcct ccgtgctcag ggccaagccg gacttcaagg agctcacggt tagcaacaac 901 gacatcaatg aggctggcgt ccgtgtgctg tgccagggcc tgaaggactc cccctgccag 961 ctggaggcgc tcaagctgga gagctgcggt gtgacatcag acaactgccg ggacctgtgc 1021 ggcattgtgg cctccaaggc ctcgctgcgg gagctggccc tgggcagcaa caagctgggt 1081 gatgtgggca tggcggagct gtgcccaggg ctgctccacc ccagctccag gctcaggacc 1141 ctgtggatct gggagtgtgg catcactgcc aagggctgcg gggatctgtg ccgtgtcctc 1201 agggccaagg agagcctgaa ggagctcagc ctggccggca acgagctggg ggatgagggt 1261 gcccgactgc tgtgtgagac cctgctggaa cctggctgcc agctggagtc gctgtgggtg 1321 aagtcctgca gcttcacagc cgcctgctgc tcccacttca gctcagtgct ggcccagaac 1381 aggtttctcc tggagctaca gataagcaac aacaggctgg aggatgcggg cgtgcgggag 1441 ctgtgccagg gcctgggcca gcctggctct gtgctgcggg tgctctggtt ggccgactgc 1501 gatgtgagtg acagcagctg cagcagcctc gccgcaaccc tgttggccaa ccacagcctg 1561 cgtgagctgg acctcagcaa caactgcctg ggggacgcgg gcatcctgca gctggtggag 1621 agcgtatccg agccgggctg cctcctggag cagctggtcc tgtacgacat ttactggtct 1681 gaggagatgg aggaccggct gcaggccctg gagaaggaca agccatccct gagggtcatc 1741 tcctgaagct cttcctgctg ctgctctccc tggacgaccg gcctcgaggc aaccctgggg 1801 cccaccagcc cctgccatgc tctcaccctg catatcctag gtttgaagag aaacgctcag 1861 atccgcttat ttctgccagt atattttgga cactttataa tcattaaagc actttcttgg 1921 c // LOCUS HSRALA 621 bp RNA PRI 12-SEP-1993 DEFINITION Human RAL A gene. ACCESSION X15014 NID g35845 KEYWORDS GTP binding protein; membrane protein; RAL A gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 621) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (07-APR-1989) Chardin P., Inserm Unit 248, 10 Avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 621) AUTHORS Chardin,P. and Tavitian,A. TITLE Coding sequences of human ralA and ralB cDNAs JOURNAL Nucleic Acids Res. 17 (11), 4380 (1989) MEDLINE 89296492 FEATURES Location/Qualifiers source 1..621 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="pheochromocytoma" CDS 1..621 /note="RAL A polypeptide (AA 1-206)" /codon_start=1 /db_xref="PID:g35846" /db_xref="SWISS-PROT:P11233" /translation="MAANKPKGQNSLALHKVIMVGSGGVGKSALTLQFMYDEFVEDYE PTKADSYRKKVVLDGEEVQIDILDTAGQEDYAAIRDNYFRSGEGFLCVFSITEMESFA ATADFREQILRVKEDENVPFLLVGNKSDLEDKRQVSVEEAKNRAEQWNVNYVETSAKT RANVDKVFFDLMREIRARKMEDSKEKNGKKKRKSLAKRIRERCCIL" BASE COUNT 213 a 94 c 167 g 147 t ORIGIN 1 atggctgcaa ataagcccaa gggtcagaat tctttggctt tacacaaagt catcatggtg 61 ggcagtggtg gcgtgggcaa gtcagctctg actctacagt tcatgtacga tgagtttgtg 121 gaggactatg agcctaccaa agcagacagc tatcggaaga aggtagtgct agatggggag 181 gaagtccaga tcgatatctt agatacagct gggcaggagg actacgctgc aattagagac 241 aactacttcc gaagtgggga ggggttcctc tgtgttttct ctattacaga aatggaatcc 301 tttgcagcta cagctgactt cagggagcag attttaagag taaaagaaga tgagaatgtt 361 ccatttctac tggttggtaa caaatcagat ttagaagata aaagacaggt ttctgtagaa 421 gaggcaaaaa acagagctga gcagtggaat gttaactacg tggaaacatc tgctaaaaca 481 cgagctaatg ttgacaaggt attttttgat ttaatgagag aaattcgagc gagaaagatg 541 gaagacagca aagaaaagaa tggaaaaaag aagaggaaaa gtttagccaa gagaatcaga 601 gaaagatgct gcattttata a // LOCUS HSRALB 621 bp RNA PRI 12-SEP-1993 DEFINITION Human RAL B gene. ACCESSION X15015 NID g35847 KEYWORDS RAL B gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 621) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (07-APR-1989) Chardin P., Inserm Unit 248, 10 Avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 621) AUTHORS Chardin,P. and Tavitian,A. TITLE Coding sequences of human ralA and ralB cDNAs JOURNAL Nucleic Acids Res. 17 (11), 4380 (1989) MEDLINE 89296492 FEATURES Location/Qualifiers source 1..621 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="pheochromocytoma" CDS 1..621 /note="RAL B polypeptide (AA 1-206)" /codon_start=1 /db_xref="PID:g35848" /db_xref="SWISS-PROT:P11234" /translation="MAANKSKGQSSLALHKVIMVGSGGVGKSALTLQFMYDEFVEDYE PTKADSYRKKVVLDGEEVQIDILDTAGQEDYAAIRDNYFRSGEGFLLVFSITEHESFT ATAEFREQILRVKAEEDKIPLLVVGNKSDLEERRQVPVEEARSKAEEWGVQYVETSAK TRANVDKVFFDLMREIRTKKMSENKDKNGKKSSKNKKSFKERCCLL" BASE COUNT 198 a 118 c 180 g 125 t ORIGIN 1 atggctgcca acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt 61 ggcagcggag gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta 121 gaagactatg aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa 181 gaagttcaga tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat 241 aactactttc ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc 301 tttacagcaa cagccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa 361 attccactgc tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg 421 gaggaggcca ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag 481 acccgggcca acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag 541 atgtcagaaa acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa 601 gaaagatgtt gcttactatg a // LOCUS HSRANBP1 837 bp RNA PRI 27-JAN-1996 DEFINITION H.sapiens mRNA for RanBP1. ACCESSION X83617 NID g620082 KEYWORDS binding protein; RanBP1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 837) AUTHORS Bischoff,F.R., Krebber,H., Smirnova,E., Dong,W. and Ponstingl,H. TITLE Co-activation of RanGTPase and inhibition of GTP dissociation by Ran-GTP binding protein RanBP1 JOURNAL EMBO J. 14 (4), 705-715 (1995) MEDLINE 95188875 REFERENCE 2 (bases 1 to 837) AUTHORS Krebber,H. TITLE Direct Submission JOURNAL Submitted (23-DEC-1994) H. Krebber, German Cancer Research Center (dkfz), Division for Molecular Biology of Mitosis, Im Neuenheimer Feld 280, D-69120 Heidelberg, FRG COMMENT Related sequence: D38076. Additional codon at position 606-608 in X83617. FEATURES Location/Qualifiers source 1..837 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="lambda gt11" CDS 103..708 /function="product binds to RanGTP" /codon_start=1 /product="RanBP1" /db_xref="PID:g620083" /db_xref="SWISS-PROT:P43487" /translation="MAAAKDTHEDHDTSTENTDESNHDPQFEPIVSLPEQEIKTLEED EEELFKMRAKLFRFASENDLPEWKERGTGDVKLLKHKEKGAIRLLMRRDKTLKICANH YITPMMELKPNAGSDRAWVWNTHADFADECPKPELLAIRFLNAENAQKFKTKFEECRK EIEEREKKAGSGKNDHAEKVAEKLEALSVKEETKEDAEEKQ" BASE COUNT 241 a 213 c 215 g 168 t ORIGIN 1 gccggcgcca gacgcggagg gaaggagcta cgagtagccg ccgagaggcc gcggagccag 61 cgacgaccga cccagccgag ccgccgccgc cgccgcgccc ccatggcggc cgccaaggac 121 actcatgagg accatgatac ttccactgag aatacagacg agtccaacca tgaccctcag 181 tttgagccaa tagtttctct tcctgagcaa gaaattaaaa cactggaaga agatgaagag 241 gaacttttta aaatgcgggc aaaactgttc cgatttgcct ctgagaacga tctcccagaa 301 tggaaggagc gaggcactgg tgacgtcaag ctcctgaagc acaaggagaa aggggccatc 361 cgcctcctca tgcggaggga caagaccctg aagatctgtg ccaaccacta catcacgccg 421 atgatggagc tgaagcccaa cgcaggtagc gaccgtgcct gggtctggaa cacccacgct 481 gacttcgccg acgagtgccc caagccagag ctgctggcca tccgcttcct gaatgctgag 541 aatgcacaga aattcaaaac aaagtttgaa gaatgcagga aagagatcga agagagagaa 601 aagaaagcag gatcaggcaa aaatgatcat gccgaaaaag tggcggaaaa gctagaagct 661 ctctcggtga aggaggagac caaggaggat gctgaggaga agcaataaat cgtcttattt 721 tattttcttt tcctctcttt cctttccttt ttttaaaaaa ttttaccctg cccctctttt 781 tcggtttgtt tttattcttt catttttaca agggacgtta tataaagaac tgaactc // LOCUS HSRANBP5 4826 bp RNA PRI 09-SEP-1997 DEFINITION H.sapiens mRNA for Ran_GTP binding protein 5. ACCESSION Y08890 NID g2253155 KEYWORDS RanBP5; Ran_GTP binding protein 5. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4826) AUTHORS Deane,R., Schafer,W., Zimmermann,H.P., Mueller,L., Gorlich,D., Prehn,S., Ponstingl,H. and Bischoff,F.R. TITLE Ran-binding protein 5 (RanBP5) is related to the nuclear transport factor importin-beta but interacts differently with RanBP1 JOURNAL Mol. Cell. Biol. 17 (9), 5087-5096 (1997) MEDLINE 97415587 REFERENCE 2 (bases 1 to 4822) AUTHORS Deane,R. TITLE Direct Submission JOURNAL Submitted (16-OCT-1996) R. Deane, German Cancer Research Center, Molekulare Biologie der Mitosis, Abt. 0230, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..4826 /organism="Homo sapiens" /db_xref="taxon:9606" /lab_host="AM1" /lab_host="DH5a" /clone_lib="HeLa 5' strech (lDR2, Clontech)" gene 237..4777 /gene="RanBP5" CDS 237..3530 /gene="RanBP5" /note="RanBP5 is a predominantly cytoplasmic protein that can bind to nuclear pore complexes" /codon_start=1 /product="Ran_GTP binding protein 5" /db_xref="PID:e328731" /db_xref="PID:g2253156" /translation="MAAAAAEQQQFYLLLGNLLSPDNVVRKQAEETYENIPGQSKITF LLQAIRNTTAAEEARQMAAVLLRRLLSSAFDEVYPALPSDVQTAIKSELLMIIQMETQ SSMRKKVCDIAAELARNLIDEDGNNQWPEGLKFLFDSVSSQNVGLREAALHIFWNFPG IFGNQQQHYLDVIKRMLVQCMQDQEHPSIRTLSARATAAFILANEHNVALFKHFADLL PGFLQAVNDSCYQNDDSVLKSLVEIADTVPKYLRPHLEATLQLSLKLCGDTSLNNMQR QLALEVIVTLSETAAAMLRKHTNIVAQTIPQMLAMMVDLEEDEDWANADELEDDDFDS NAVAGESALDRMACGLGGKLVLPMIKEHIMQMLQNPDWKYRHAGLMALSAIGEGCHQQ MEGILNEIVNFVLLFLQDPHPRVRYAACNAVGQMATDFAPGFQKKFHEKVIAALLQTM EDQGNQRVQAHAAAALINFTEDCPKSLLIPYLDNLVKHLHSIMVLKLQELIQKGTKLV LEQVVTSIASVADTAEEKFVPYYDLFMPSLKHIVENAVQKELRLLRGKTIECISLIGL AVGKEKFMQDASDVMQLLLKTQTDFNDMEDDDPQISYMISAWARMCKILGKEFQQYLP VVMGPLMKTASIKPEVALLDTQDMENMSDDDGWEFVNLGDQQSFGIKTAGLEEKSTAC QMLVCYAKELKEGFVEYTEQVVKLMVPLLKFYFHDGVRVAAAESMPLLLECARVRGPE YLTQMWHFMCDALIKAIGTEPDSDVLSEIMHSFAKCIEVMGDGCLNNEHFEELGGILK AKLEEHFKNQELRQVKRQDEDYDEQVEESLQDEDDNDVYILTKVSDILHSIFSSYKEK VLPWFEQLLPLIVNLICPHRPWPDRQWGLCIFDDVIEHCSPASFKYAEYFLRPMLQYV CDNSPEVRQAAAYGLGVMAQYGGDNYRPFCTEALPLLVRVIQSADSKTKENVNATENC ISAVGKIMKFKPDCVNVEEVLPHWLSWLPLHEDKEEAVQTFNYLCDLIESNHPIVLGP NNTNLPKIFSIIAEGEMHEAIKHEDPCAKRLANVVRQVQTSGGLWTECIAQLSPEQQA AIQELLNSA" polyA_signal 4772..4777 /gene="RanBP5" BASE COUNT 1471 a 952 c 1055 g 1348 t ORIGIN 1 tctcactata gggctcgagc ggccgcccgg gcaggtctga ccacagtggt tccggggaga 61 agccttccag gacccatgtg taggcacaac tgttttccct gatcaggata cttccggcac 121 tcaacagagg aaagaaattc ctaagggaac actgctcaga aagtactgca gcatgtcttc 181 aaatgcctga ggatcaagtt ggaaaactag aagcaacaga aaacacaata agcgcaatgg 241 cggcggccgc ggcggagcag caacagttct acctgctgct gggaaacctg ctcagccccg 301 acaatgtggt ccggaaacag gcagaggaaa cctatgagaa tatcccaggc cagtcaaaga 361 tcacattcct cttacaagcc atcagaaata caacagctgc tgaagaggct agacaaatgg 421 ccgccgttct cctaagacgt ctcttgtcct ctgcatttga tgaagtctat ccagcacttc 481 cctctgatgt tcagactgcc atcaagagtg agctactcat gattattcag atggaaacac 541 aatctagcat gaggaaaaaa gtttgtgata ttgcggcaga actggccagg aatttaatag 601 atgaggatgg caataaccag tggcccgaag gtttgaagtt cctttttgat tcagtcagct 661 ctcaaaatgt gggactgcgg gaagctgccc ttcacatttt ctggaacttt cctggaattt 721 ttgggaacca gcaacaacac tatttagatg tcatcaaacg aatgttagtt cagtgtatgc 781 aagatcagga acacccgtcg atcaggacgt tatctgctag agctacagct gcatttatac 841 ttgcaaatga gcataatgtt gctctgttca aacattttgc agacttgcta ccgggattcc 901 tacaggcggt aaatgactcg tgctaccaga atgatgattc tgtcctaaaa tccctcgttg 961 agattgcaga tactgttcca aagtatttgc gtcctcactt ggaagcaact ctacagctaa 1021 gtctaaagtt gtgtggagac actagcctca acaatatgca acgccagctt gcccttgaag 1081 tgatcgtcac cctctctgag actgcagctg ctatgttaag aaaacatacc aatattgttg 1141 cacagactat tcctcagatg ttagcaatga tggttgattt ggaagaagat gaggactggg 1201 caaatgcaga tgaactagaa gatgatgatt ttgacagcaa tgcagttgca ggcgagagtg 1261 ctctagatcg aatggcttgc ggacttggtg gaaagctcgt tctgccgatg atcaaggaac 1321 acattatgca aatgcttcaa aatcctgact ggaaataccg gcatgcagga ttgatggcct 1381 tatctgccat tggtgaaggg tgccaccagc aaatggaagg aattctaaat gagatcgtaa 1441 attttgtttt actttttctc caggatcctc atccaagagt aaggtatgca gcctgtaatg 1501 ccgtgggaca gatggctaca gattttgcac ctggtttcca aaagaaattt catgagaagg 1561 tgattgcagc tctgctgcag accatggaag accaaggcaa tcaacgtgtg caggcccatg 1621 cagctgctgc cctcattaac tttactgaag actgtcccaa gtcactactt attccatact 1681 tggataattt ggtgaaacat ctgcattcca ttatggtact gaagcttcaa gagctgattc 1741 agaaaggcac caagttagtt ttggaacaag ttgtgacatc cattgcatca gttgccgata 1801 ctgcagaaga aaaatttgtc ccctactatg atttatttat gccatcactg aagcacatcg 1861 ttgagaatgc ggttcaaaaa gaactgagac ttctgagagg aaaaactatt gaatgcatta 1921 gcctcattgg tctggctgtt gggaaggaaa aattcatgca ggatgcatca gatgtgatgc 1981 agcttttgtt aaagacccag acagacttca atgatatgga agatgatgat cctcagatct 2041 cttacatgat ctcagcatgg gccagaatgt gcaaaatcct tggaaaagaa tttcagcaat 2101 accttccagt ggttatgggg cctttaatga agacggcttc aattaagccc gaagtagccc 2161 ttttagatac ccaagacatg gagaatatga gtgatgatga tggttgggaa tttgtgaacc 2221 ttggagatca gcaaagcttt ggtattaaaa ctgcaggact agaagaaaaa tcaactgctt 2281 gccagatgtt ggtttgctat gctaaggagt taaaggaagg ctttgtggag tacaccgaac 2341 aggttgtcaa actgatggtc cctttactga aattttattt ccacgatggt gttcgagtgg 2401 cagcagcgga atccatgcct cttctcctgg agtgtgcaag agtccgtggt cctgagtatc 2461 tcacacagat gtggcatttt atgtgtgatg ctctaattaa ggccattggt acagaaccag 2521 attcagacgt cctctcagaa ataatgcatt cttttgcaaa gtgcattgaa gtaatgggag 2581 atggatgcct taataatgaa cactttgaag aactgggagg tatattgaaa gcaaagcttg 2641 aagaacattt taaaaatcaa gaattacgac aagttaaaag acaagatgaa gactatgatg 2701 aacaggtcga agagtcacta caagatgagg atgataatga tgtttatatt ctgaccaaag 2761 tgtcagatat tttacactca atattcagta gctacaaaga aaaggtgtta ccatggtttg 2821 aacagctgct tccattaatt gtcaacctca tttgtccaca tagaccatgg ccagacagac 2881 aatggggatt atgcatcttt gatgatgtca tagaacactg tagtccagcc tcatttaaat 2941 acgcagaata tttcttaaga ccaatgctcc aatatgtatg tgacaacagc ccagaagtca 3001 ggcaagcagc tgcatatggc ctgggagtca tggcacagta cggtggagat aattatcgcc 3061 ctttttgtac agaagcactt cccctgctgg taagagttat tcagtctgcg gattctaaga 3121 ccaaagaaaa tgtcaatgct acagagaact gcatctcagc agtagggaaa atcatgaagt 3181 tcaagcctga ctgtgtaaac gttgaagagg tccttccaca ctggttgtct tggcttccac 3241 tacatgaaga taaagaagaa gctgttcaga ctttcaatta tctgtgtgac ctgattgaaa 3301 gtaatcatcc aattgttctt ggcccaaaca ataccaatct gcccaaaata tttagtataa 3361 ttgcggaagg agaaatgcac gaggcaatta aacatgaaga tccttgtgcc aaacgtctgg 3421 ccaatgtcgt tcgccaagta cagacttctg gaggactgtg gactgagtgc atagcacagc 3481 tcagtcctga gcagcaggcc gccattcagg agctcctgaa ctctgcgtga agggccttaa 3541 tgtcacccac cagaaaacta actccaaata aacgcttacc ctttccttta ggtttctttg 3601 ttttgttttt gagcaaaaga gatcggtagt gttgtgtgta ggccattctt ctggagagcc 3661 acaagcagga agagcagcgc tgtgttgcag aatggagttt ccatggattt ctaccagacc 3721 actgaaggag ttcctggaag ccctgcgtac gtagcactga agactatttt tctattggta 3781 taacccgccc acctgaaggg gaaagggaaa tcaaattaat ttttctcgtt agacataagg 3841 aaatttaagg aaaaacagct ttaagaacag ttactcagcg tagatgtgtg ttcacacaaa 3901 ttgccttgca ttcagtgttc attgtgaatt gggagtgtga gtctttctgt agggtacaaa 3961 gaagcctcct acccagcaaa ccagtagacc caaaagttga aaaaaactgg atgacagaca 4021 acaagcatga agatggcata tttgatgtca ctttggttct ttttcccaga aggcttatac 4081 agtgactcag tcgggaagct ttccagcttc agcccttgaa tgtgaagtgt cattggcatg 4141 tctggcagta gtctctcatt cactcccaat aaacaacatt gaatacaaaa gaggcttgtg 4201 taaaaactca gtactgtctg gcttggattc atttcatgtt ttttaatata agaatgatct 4261 aatatttttt taaagtaata gctatcagta atagctgagt gttttttccc ctaatatttt 4321 ccttgtgcaa ttcagactta agcatcgagt ttttaccatc ttccacttta agctaagtta 4381 tgatacctat tccattcaca attggtgttc tttttaaggt ttgcaaattt cagccaattt 4441 tgtagctaag attgttctga tcagctcaaa aagatttggc ttagtgtttt cattgcaaat 4501 tataattgct gtagagccac acacaacttt tgaactttta attataagtg ttatggctaa 4561 agttatttac tgaaaatttc agtaaaatgt gtgaatgttt ctttatgtat taacctcata 4621 gcagtaaatg acttgctgtt gtttaatttt tctaaggcat cttaatagac ttctcttgga 4681 aaaacctttc caaggtgtta acatttttat agtttgtact aaatttaacc gtgatataaa 4741 aatgaatttt atgcatagat cagaatttta aattaaaggt tttttcttta aaaaaaaaaa 4801 aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSRAP1A 558 bp RNA PRI 12-SEP-1993 DEFINITION Human rap1A mRNA for ras-related protein. ACCESSION X12533 NID g35858 KEYWORDS GTP-binding protein; oncogene; rap gene; ras-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 558) AUTHORS Pizon,V. TITLE Direct Submission JOURNAL Submitted (01-AUG-1988) Pizon V., Inserm U-248, Faculte de Medecine Lariboisiere Saint Louis, 10 Avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 558) AUTHORS Pizon,V., Chardin,P., Lerosey,I., Olofsson,B. and Tavitian,A. TITLE Human cDNAs rap1 and rap2 homologous to the Drosophila gene Dras3 encode proteins closely related to ras in the 'effector' region JOURNAL Oncogene 3 (2), 201-204 (1988) MEDLINE 88319657 REFERENCE 3 (bases 1 to 558) AUTHORS Pizon,V. TITLE Direct Submission JOURNAL Submitted (02-FEB-1989) to the EMBL/GenBank/DDBJ databases COMMENT The reported sequence is homologous to Drosophila Dras3 gene. Data kindly reviewed (02-FEB-1989) by Pizon V. FEATURES Location/Qualifiers source 1..558 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RAJI human lymphoma" /clone_lib="lambda gt10." CDS 4..558 /note="rap1A protein (AA 1-184)" /codon_start=1 /db_xref="PID:g35859" /db_xref="SWISS-PROT:P10113" /translation="MREYKLVVLGSGGVGKSALTVQFVQGIFVEKYDPTIEDSYRKQV EVDCQQCMLEILDTAGTEQFTAMRDLYMKNGQGFALVYSITAQSTFNDLQDLREQILR VKDTEDVPMILVGNKCDLEDERVVGKEQGQNLARQWCNCAFLESSAKSKINVNEIFYD LVRQINRKTPVEKKKPKKKSCLLL" BASE COUNT 179 a 91 c 142 g 146 t ORIGIN 1 atcatgcgtg agtacaagct agtggtcctt ggttcaggag gcgttgggaa gtctgctctg 61 acagttcagt ttgttcaggg aatttttgtt gaaaaatatg acccaacgat agaagattcc 121 tacagaaagc aagttgaagt cgattgccaa cagtgtatgc tcgaaatcct ggatactgca 181 gggacagagc aatttacagc aatgagggat ttgtatatga agaacggcca aggttttgca 241 ctagtatatt ctattacagc tcagtccacg tttaacgact tacaggacct gagggaacag 301 attttacggg ttaaggacac ggaagatgtt ccaatgattt tggttggcaa taaatgtgac 361 ctggaagatg agcgagtagt tggcaaagag cagggccaga atttagcaag acagtggtgt 421 aactgtgcct ttttagaatc ttctgcaaag tcaaagatca atgttaatga gatattttat 481 gacctggtca gacagataaa taggaaaaca ccagtggaaa agaagaagcc taaaaagaaa 541 tcatgtctgc tgctctag // LOCUS HSRAP2 555 bp RNA PRI 12-SEP-1993 DEFINITION Human rap2 mRNA for ras-related protein. ACCESSION X12534 NID g35860 KEYWORDS GTP-binding protein; oncogene; rap gene; ras-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 555) AUTHORS Pizon,V. TITLE Direct Submission JOURNAL Submitted (01-AUG-1988) Pizon V., Inserm U-248, Faculte de Medecine Lariboisiere Saint Louis, 10 Avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 555) AUTHORS Pizon,V., Chardin,P., Lerosey,I., Olofsson,B. and Tavitian,A. TITLE Human cDNAs rap1 and rap2 homologous to the Drosophila gene Dras3 encode proteins closely related to ras in the 'effector' region JOURNAL Oncogene 3 (2), 201-204 (1988) MEDLINE 88319657 COMMENT The reported sequence is homologous to Drosophila Dras3 gene. Data kindly reviewed (16-FEB-1989) by Pizon V. FEATURES Location/Qualifiers source 1..555 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RAJI human lymphoma" CDS 4..555 /note="rap2 protein (AA 1-183)" /codon_start=1 /db_xref="PID:g35861" /db_xref="SWISS-PROT:P10114" /translation="MREYKVVVLGSGGVGKSALTVQFVTGTFIEKYDPTIEDFYRKEI EVDSSPSVLEILDTAGTEQFASMRDLYIKNGQGFILVYSLVNQQSFQDIKPMRDQIIR VKRYEKVPVILVGNKVDLESEREVSSSEGRALAEEWGCPFMETSAKSKTMVDELFAEI VRQMNYAAQPDKDDPCCSACNIQ" BASE COUNT 141 a 142 c 165 g 107 t ORIGIN 1 acgatgcgcg agtacaaagt ggtggtgctg ggctcgggcg gggtaggcaa atccgccctg 61 accgtgcagt tcgtgaccgg caccttcatc gagaaatacg accccaccat cgaggacttc 121 taccgcaagg agatcgaggt ggattcgtcg ccgtcggtgc tggagatcct ggacacggcg 181 ggcaccgagc agttcgcgtc catgcgggac ctgtacatca agaacggcca gggcttcatc 241 ctcgtctaca gcctcgtcaa ccagcagagc ttccaggaca tcaagcccat gcgggaccag 301 atcatccgcg tgaagcggta tgagaaagtg ccagtcatct tggttgggaa caaagtggac 361 ctggaaagtg agagagaagt atcgtccagc gaaggcagag cccttgctga agagtggggc 421 tgccccttta tggaaacttc cgctaagagt aaaacaatgg tggacgaact ctttgcagaa 481 attgtgaggc agatgaacta tgctgctcag cctgacaaag atgacccatg ctgttctgca 541 tgtaacatac aatag // LOCUS HSRAP30M 750 bp RNA PRI 11-OCT-1991 DEFINITION H.sapiens RAP30 mRNA encoding RAP30. ACCESSION X59745 NID g35866 KEYWORDS RAP30. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 750) AUTHORS Horikoshi,M. TITLE Direct Submission JOURNAL Submitted (29-JUL-1991) M. Horikoshi, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA REFERENCE 2 (bases 1 to 750) AUTHORS Horikoshi,M., Fujita,H., Wang,J., Takada,R. and Roeder,R.G. TITLE Nucleotide and amino acid sequence of RAP30 JOURNAL Nucleic Acids Res. 19 (19), 5436 (1991) MEDLINE 92020241 COMMENT RAP30 sequence data conflicts with that previously reported; see entry x16901. FEATURES Location/Qualifiers source 1..750 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Namalwa" gene 1..750 /gene="RAP30" CDS 1..750 /gene="RAP30" /note="final product consists of two (RAP30 /RAP74) subunits" /codon_start=1 /product="RAP30" /db_xref="PID:g35867" /db_xref="SWISS-PROT:P13984" /translation="MAERGELDLTGAKQNTGVWLVKVPKYLSQQWAKASGRGEVGKLR IAKTQGRTEVSFTLNEDLANIHDIGGKPASVSAPREHPFVLQSVGGQTLTVFTESSSD KLSLEGIVVQRAECRPAASENYMRLKRLQIEESSKPVRLSQQLDKVVTTNYKPVANHQ YNIEYERKKKEDGKRARADKQHVLDMLFSAFEKHQYYNLKDLVDITKQPVVYLKEILK EIGVQNVKGIHKNTWELKPEYRHYQGEEKSD" BASE COUNT 271 a 135 c 178 g 166 t ORIGIN 1 atggccgagc gcggggaact cgacttgacc ggcgccaaac agaacacagg agtgtggcta 61 gtcaaggttc ctaaatattt gtcacagcaa tgggctaaag cctctggaag aggtgaagtt 121 gggaaactgc ggattgccaa gactcaagga aggactgagg tgtcatttac tttgaatgag 181 gatcttgcaa atattcatga tattggtgga aaaccagctt cagtcagtgc tcctagagaa 241 catccatttg tcttgcaaag tgttggagga cagacattaa cagtatttac tgagagctca 301 tcagataagc tgtcattgga aggaatagtg gtacaaagag ctgaatgccg accagctgcc 361 agtgaaaact acatgcgatt aaaaagattg caaatagaag agtcttccaa accagtgagg 421 ctatcacaac agctggacaa agttgtaaca accaattaca aacctgttgc taatcatcaa 481 tacaatatcg aatatgaaag gaaaaagaaa gaagacggaa agcgagctcg agctgataaa 541 caacatgttt tagacatgct attttcagcc tttgagaaac atcaatacta taatcttaag 601 gacttggtgg acatcacaaa acaacctgtg gtgtacctga aggaaatctt aaaagaaatt 661 ggtgttcaga atgtaaaagg gatccacaaa aacacatggg agctgaagcc agagtacaga 721 cactatcaag gagaagaaaa gagtgactaa // LOCUS HSRAP74 2440 bp RNA PRI 25-JAN-1994 DEFINITION H.sapiens mRNA for RNA polymerase II associated protein RAP74. ACCESSION X64037 S78216 NID g35868 KEYWORDS RAP74 gene; transcription initiation factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2440) AUTHORS Aso,T., Vasavada,H.A., Kawaguchi,T., Germino,F.J., Ganguly,S., Kitajima,S., Weissman,S.M. and Yasukochi,Y. TITLE Characterization of cDNA for the large subunit of the transcription initiation factor TFIIF JOURNAL Nature 335, 461-464 (1992) REFERENCE 2 (bases 1 to 2440) AUTHORS Aso,T. TITLE Direct Submission JOURNAL Submitted (27-MAR-1992) T. Aso, Yale Univ School of Medicine, Boyer Center for Mol Medicine, 295 Congress Avenue, New Haven CT 06536-0812, USA REFERENCE 3 (bases 1 to 2440) AUTHORS Aso,T., Vasavada,H.A., Kawaguchi,T., Germino,F.J., Ganguly,S., Kitajima,S., Weissman,S.M. and Yasukochi,Y. TITLE Characterization of cDNA for the large subunit of the transcription initiation factor TFIIF JOURNAL Nature 355 (6359), 461-464 (1992) MEDLINE 92131135 FEATURES Location/Qualifiers source 1..2440 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-lymphocyte" CDS 179..1732 /codon_start=1 /product="RNA polymerase II associated protein RAP74" /db_xref="PID:g35869" /db_xref="SWISS-PROT:P35269" /translation="MAALGPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQ ARLERDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYGIVLKEFRPEDQPWLLRV NGKSGRKFKGIKKGGVTENTSYYIFTQCPDGAFEAFPVHNWYNFTPLARHRTLTAEEA EEEWERRNKVLNHFSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDLEDDLEMSS DASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDSDDGDFEGQEVDYMSD GSSSSQEEPESKAKAPQQEEGPKGVDEQSDSSEESEEEKPPEEDKEEEEEKKAPTPQE KKRRKDSSEESDSSEESDIDSEASSAFFMAKKKTPPKRERKPSGGSSRGNSRPGTPSA EGGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKSTPQPPSGKTTPNSG DVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPERKMIN DKMHFSLKE" BASE COUNT 622 a 686 c 741 g 391 t ORIGIN 1 cttgagcgcc tcttccggtt accttttccc agcgccagag gcgcctaggg ttggggtcct 61 cgctcaggca cagagacccg acaccgagcg gcggcttccc cgggatcgag ggacgcgcac 121 gccagaggag acgaaaggaa cccgggtcgg accagatcgg aaccactgac cattgcccat 181 ggcggcccta ggccctagca gccagaatgt cactgaatac gtcgttcgag ttcctaagaa 241 tacaaccaaa aaatataaca tcatggcttt taatgcagcc gacaaagtca actttgctac 301 gtggaatcag gctcggctgg agcgggactt gagcaacaag aaaatctacc aagaggagga 361 gatgcccgaa tcgggcgcgg gcagtgagtt caaccgcaag cttcgggagg aggctcggag 421 gaagaagtac ggcatcgtcc tcaaggagtt ccggcccgag gaccagccct ggctgctccg 481 ggtcaacggc aaatcaggca ggaagttcaa gggcatcaag aagggaggcg taacagagaa 541 cacgtcctac tacatcttca cccagtgccc cgacggggcc ttcgaggcct tccccgtgca 601 caactggtac aatttcacac cgctggcccg gcatcgcacg ctcactgccg aggaggccga 661 ggaggagtgg gagaggagga acaaggtgct gaaccacttc agcatcatgc agcagcggcg 721 gctcaaggat caggaccagg acgaggatga ggaggagaag gagaaacgtg gccgcaggaa 781 ggcgagcgag ctgcgcatcc acgacctgga ggacgacctg gagatgtcgt ccgatgccag 841 tgatgccagt ggtgaggagg ggggcagagt ccccaaggcc aagaagaagg cgccgctggc 901 caagggcggc aggaaaaaga agaagaagaa gggttcagac gacgaggcct tcgaggacag 961 cgatgatggg gacttcgagg gccaagaggt ggactacatg tcagacggct ccagtagctc 1021 ccaagaagag cctgagagca aggccaaggc gccgcagcag gaggaggggc ccaagggtgt 1081 cgatgagcag agcgacagta gtgaggagag tgaggaggag aagccgcctg aggaggacaa 1141 ggaggaggag gaggagaaga aggcacccac cccgcaggag aagaagcgca ggaaagacag 1201 cagcgaggag tcggacagct cagaggagag cgacattgac agcgaggcct cctcagcctt 1261 cttcatggcg aagaagaaga cgccacccaa gagagagcgg aagccgtcgg gagggagctc 1321 aaggggcaac agccgcccag gcacgcccag cgcagagggt ggcagcacct cctccaccct 1381 gcgggcggct gccagcaaac tcgagcaagg gaagcgggtg agcgagatgc ctgcagccaa 1441 gcggttgcgg ctggacacgg gaccccagag cctgtctggg aagtcgacac cccagccacc 1501 atcaggcaag acaacaccca acagcggcga cgtgcaggtg actgaggatg ccgtgcgccg 1561 ctacctgaca cggaagccca tgaccactaa ggacctgctg aaaaagttcc agaccaagaa 1621 gacagggctg agcagcgagc agacagtgaa cgtgttggcc cagatcctca agcgactcaa 1681 ccccgagcgc aagatgatca acgacaaaat gcacttctcc ctcaaggagt gaggcttggt 1741 ccaatacatg gctctgcccc ccagaactta aggctctcac tgccccttcg ccatcctaga 1801 gtgaggctct gtccaataca tggctctgcc ctccagaact tcaggctctc agtgaccctt 1861 cgacatcctg cttgctccct gactcccagg gccccgtagt tagcaattct ggaaaagtta 1921 agccatctcc tcctctggcc cttccttctg gaatcttcag atgcctgtta ggccttctta 1981 ttgtcctcct cctcctggct cggcctccct cacactgacc aagggcctgt gctgcccact 2041 gggtaacttc tacagttctc ccttccactt ccctaagtct ctcttcagct gtgacttatc 2101 caccacatag cccagtaagt cttcagtttt ggctgggcat gatggtgagc gcctgcagtc 2161 ccagctactt gggaggctaa gcccagttca aggctgcagt gaactatgat ggtgccactg 2221 cattccagcc tgggtgacag aatgaaatcc tggcacaaaa aaaaaaaagt agccaggcat 2281 ggtggcggga gcctgttgtc ccagctgttc cgtaggctga ggcacgagat tcacttgaac 2341 ctgggaggtg gaggttgctg tgagctgaca ccacgccact gcactccagc ctgggtgaca 2401 gtgagactct gtctcaataa ataaaaaata ataataaatt // LOCUS HSRAR 1920 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for retinoic acid receptor. ACCESSION X06538 NID g35873 KEYWORDS DNA binding protein; hormone receptor; receptor; retinoic acid receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 260 to 1920) AUTHORS Petkovich,M., Brand,N.J., Krust,A. and Chambon,P. TITLE A human retinoic acid receptor which belongs to the family of nuclear receptors JOURNAL Nature 330 (6147), 444-450 (1987) MEDLINE 88065872 REFERENCE 2 (bases 1 to 420) AUTHORS Chambon,P. TITLE Direct Submission JOURNAL Submitted (22-DEC-1988) to the EMBL/GenBank/DDBJ databases COMMENT cell line=MCF-7; library=lambda gt10; clone=p63. FEATURES Location/Qualifiers source 1..1920 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 317..1615 /note="retinoic acid receptor (AA 1 - 432)" /codon_start=1 /db_xref="PID:g35874" /db_xref="SWISS-PROT:P10276" /translation="MLGGLSPPGALTTLQHQLPVSGYSTPSPATIETQSSSSEEIVPS PPSPPPLPRIYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMVYTCHRDKNCII NKVTRNRCQYCRLQKCFEVGMSKESVRNDRNKKKKEVPKPECSESYTLTPEVGELIEK VRKAHQETFPALCQLGKYTTNNSSEQRVSLDIDLWDKFSELSTKCIIKTVEFAKQLPG FTTLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGPLT DLVFAFANQLLPLEMDDAETGLLSAICLICGDRQDLEQPDRVDMLQEPLLEALKVYVR KRRPSRPHMFPKMLMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLENSEGLDT LSGQPGGGGRDGGGLAPPPGSCSPSLSPSSNRSSPATHSP" misc_feature 488..685 /note="put. DNA-binding domain" old_sequence 763..764 /note="gc was cg in [1]" /citation=[1] misc_feature 824..1483 /note="put. ligand-binding domain" old_sequence 1396 /note="g was c in [1]" /citation=[1] old_sequence 1733 /note="g was c in [1]" /citation=[1] BASE COUNT 386 a 638 c 566 g 330 t ORIGIN 1 aattccctcc tggagccccc accctagctg ggggccaggt gggttggggg atgtctcaag 61 agccggtcct ttggtcaagc agttctgtga gctggcactt ttcctgggaa ggtccagagc 121 tggccttcag ggcaccaaaa aagatgccac tcctagatgg gccccagcta ggtggctgag 181 ggggccatgt cctgtgatgc tcgcagcggg ggaagcaacc cgttcggggt gccagcgagc 241 acagcgtctg ccctaacccg ggggcgggca cctcaatggg tacccggtgc ctccctacgc 301 cttcttcttc ccccctatgc tgggtggact ctccccgcca ggcgctctga ccactctcca 361 gcaccagctt ccagttagtg gatatagcac accatcccca gccaccattg agacccagag 421 cagcagttct gaagagatag tgcccagccc tccctcgcca ccccctctac cccgcatcta 481 caagccttgc tttgtctgtc aggacaagtc ctcaggctac cactatgggg tcagcgcctg 541 tgagggctgc aagggcttct tccgccgcag catccagaag aacatggtgt acacgtgtca 601 ccgggacaag aactgcatca tcaacaaggt gacccggaac cgctgccagt actgccgact 661 gcagaagtgc tttgaagtgg gcatgtccaa ggagtctgtg agaaacgacc gaaacaagaa 721 gaagaaggag gtgcccaagc ccgagtgctc tgagagctac acgctgacgc cggaggtggg 781 ggagctcatt gagaaggtgc gtaaagcgca ccaggaaacc ttccctgccc tctgccagct 841 gggcaaatac actacgaaca acagctcaga acaacgtgtc tctctggaca ttgacctctg 901 ggacaagttc agtgaactct ccaccaagtg catcattaag actgtggagt tcgccaagca 961 gctgcccggc ttcaccaccc tcaccatcgc cgaccagatc accctcctca aggctgcctg 1021 cctggacatc ctgatcctgc ggatctgcac gcggtacacg cccgagcagg acaccatgac 1081 cttctcggac gggctgaccc tgaaccggac ccagatgcac aacgctggct tcggccccct 1141 caccgacctg gtctttgcct tcgccaacca gctgctgccc ctggagatgg atgatgcgga 1201 gacggggctg ctcagcgcca tctgcctcat ctgcggagac cgccaggacc tggagcagcc 1261 ggaccgggtg gacatgctgc aggagccgct gctggaggcg ctaaaggtct acgtgcggaa 1321 gcggaggccc agccgccccc acatgttccc caagatgcta atgaagatta ctgacctgcg 1381 aagcatcagc gccaaggggg ctgagcgggt gatcacgctg aagatggaga tcccgggctc 1441 catgccgcct ctcatccagg aaatgttgga gaactcagag ggcctggaca ctctgagcgg 1501 acagccgggg ggtggggggc gggacggggg tggcctggcc cccccgccag gcagctgtag 1561 ccccagcctc agccccagct ccaacagaag cagcccggcc acccactccc cgtgaccgcc 1621 cacgccacat ggacacagcc ctcgccctcc gccccggctt ttctctgcct ttctaccgac 1681 catgtgaccc cgcaccagcc ctgcccccac ctgccctgcc cgggagtact ggggaccttc 1741 cctgggggac ggggagggag gaggcagcga ctccttggac agaggcctgg gccctcagtg 1801 gactgcctgc tcccacagcc tgggctgacg tcagaggccg aggccaggaa ctgagtgagg 1861 cccctggtcc tgggtctcag gatgggtcct gggggcctcg tgttcatcaa gacggaattc // LOCUS HSRARLP 1866 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for retinoic acid receptor-like protein. ACCESSION X52773 NID g35884 KEYWORDS receptor; retinoic acid-like receptor; retinoic acid-responsive protein; retinoid X receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Mangelsdorf,D.J., Ong,E.S., Dyck,J.A. and Evans,R.M. TITLE Nuclear receptor that identifies a novel retinoic acid response pathway JOURNAL Nature 345 (6272), 224-229 (1990) MEDLINE 90238542 COMMENT Data kindly reviewed (13-DEC-1990) by Mangelsdorf D.J. Data kindly reviewed (10-DEC-1990) by Goddard J.P. FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone="lambda XR3-1." CDS 76..1464 /note="receptor protein (AA 1 - 462)" /codon_start=1 /db_xref="PID:g35885" /db_xref="SWISS-PROT:P19793" /translation="MDTKHFLPLDFSTQVNSSLTSPTGRGSMAAPSLHPSLGPGIGSP GQLHSPISTLSSPINGMGPPFSVISSPMGPHSMSVPTTPTLGFSTGSPQLSSPMNPVS SSEDIKPPLGLNGVLKVPAHPSGNMASFTKHICAICGDRSSGKHYGVYSCEGCKGFFK RTVRKDLTYTCRDNKDCLIDKRQRNRCQYCRYQKCLAMGMKREAVQEERQRGKDRNEN EVESTSSANEDMPVERILEAELAVEPKTETYVEANMGLNPSSPNDPVTNICQAADKQL FTLVEWAKRIPHFSELPLDDQVILLRAGWNELLIASFSHRSIAVKDGILLATGLHVHR NSAHSAGVGAIFDRVLTELVSKMRDMQMDKTELGCLRAIVLFNPDSKGLSNPAEVEAL REKVYASLEAYCKHKYPEQPGRFAKLLLRLPALRSIGLKCLEHLFFFKLIGDTPIDTF LMEMLEAPHQMT" misc_feature 478..675 /note="cysteine-rich DNA-binding domain" BASE COUNT 339 a 646 c 537 g 344 t ORIGIN 1 gaattccggc gccgggggcc gcccgcccgc cgcccgctgc ctgcgccgcc ggccgggcat 61 gagttagtcg cagacatgga caccaaacat ttcctgccgc tcgatttctc cacccaggtg 121 aactcctccc tcacctcccc gacggggcga ggctccatgg ctgccccctc gctgcacccg 181 tccctggggc ctggcatcgg ctccccggga cagctgcatt ctcccatcag caccctgagc 241 tcccccatca acggcatggg cccgcctttc tcggtcatca gctcccccat gggcccccac 301 tccatgtcgg tgcccaccac acccaccctg ggcttcagca ctggcagccc ccagctcagc 361 tcacctatga accccgtcag cagcagcgag gacatcaagc cccccctggg cctcaatggc 421 gtcctcaagg tccccgccca cccctcagga aacatggctt ccttcaccaa gcacatctgc 481 gccatctgcg gggaccgctc ctcaggcaag cactatggag tgtacagctg cgaggggtgc 541 aagggcttct tcaagcggac ggtgcgcaag gacctgacct acacctgccg cgacaacaag 601 gactgcctga ttgacaagcg gcagcggaac cggtgccagt actgccgcta ccagaagtgc 661 ctggccatgg gcatgaagcg ggaagccgtg caggaggagc ggcagcgtgg caaggaccgg 721 aacgagaatg aggtggagtc gaccagcagc gccaacgagg acatgccggt ggagaggatc 781 ctggaggctg agctggccgt ggagcccaag accgagacct acgtggaggc aaacatgggg 841 ctgaacccca gctcgccgaa cgaccctgtc accaacattt gccaagcagc cgacaaacag 901 cttttcaccc tggtggagtg ggccaagcgg atcccacact tctcagagct gcccctggac 961 gaccaggtca tcctgctgcg ggcaggctgg aatgagctgc tcatcgcctc cttctcccac 1021 cgctccatcg ccgtgaagga cgggatcctc ctggccaccg ggctgcacgt ccaccggaac 1081 agcgcccaca gcgcaggggt gggcgccatc tttgacaggg tgctgacgga gcttgtgtcc 1141 aagatgcggg acatgcagat ggacaagacg gagctgggct gcctgcgcgc catcgtcctc 1201 tttaaccctg actccaaggg gctctcgaac ccggccgagg tggaggcgct gagggagaag 1261 gtctatgcgt ccttggaggc ctactgcaag cacaagtacc cagagcagcc gggaaggttc 1321 gctaagctct tgctccgcct gccggctctg cgctccatcg ggctcaaatg cctggaacat 1381 ctcttcttct tcaagctcat cggggacaca cccattgaca ccttccttat ggagatgctg 1441 gaggcgccgc accaaatgac ttaggcctgc gggcccatcc tttgtgccca cccgttctgg 1501 ccaccctgcc tggacgccag ctgttcttct cagcctgagc cctgtccctg cccttctctg 1561 cctggcctgt ttggactttg gggcacagcc tgtcactgct ctgcctaaga gatgtgttgt 1621 caccctcctt atttctgtta ctacttgtct gtggcccagg gcagtggctt tcctgagcag 1681 cagccttcgt ggcaagaact agcgtgagcc cagccaggcg cctccccacc gggctctcag 1741 gacgccctgc cacacccacg gggcttgggc gactacaggg tcttcggccc cagccctgga 1801 gctgcaggag ttgggaacgg ggcttttgtt tccgttgctg tttatcgatg ctggttttca 1861 gaattc // LOCUS HSRAY1 730 bp RNA PRI 06-APR-1995 DEFINITION H.sapiens ray mRNA. ACCESSION X79781 NID g763121 KEYWORDS ray gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 730) AUTHORS Zhu,A.X., Zhao,Y. and Flier,J.S. TITLE Molecular cloning of two small GTP-binding proteins from human skeletal muscle JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1875-1882 (1994) MEDLINE 95110337 REFERENCE 2 (bases 1 to 730) AUTHORS Zhu,A.X. TITLE Direct Submission JOURNAL Submitted (21-JUN-1994) A.X. Zhu, Beth Israel Hospital, Harvard Medical School, Div. of Endocrinology and Metabolism, 330 Brookline Ave, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..730 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" /dev_stage="fetal" /cell_line="human fetal muscle" gene 82..687 /gene="ray" CDS 82..687 /gene="ray" /codon_start=1 /db_xref="PID:g763122" /translation="MARDYDHLFKLLIIGDSGVGKSSLLLRFADNTFSGSYITTIGVD FKIRTVEINGEKVKLQIWDTAGQERFRTITSTYYRGTHGVIVVYDVTSAESFVNVKRW LHEINQNCDDVCRILVGNKNDDPERKVVETEDAYKFAGQMGIQLFETSAKENVNVEEM FNCITELVLRAKKDNLAKQQQQQQNDVVKLTKNSKRKKRCC" BASE COUNT 187 a 206 c 213 g 124 t ORIGIN 1 gctgccggag cagcccgaag agctgcggat cgcgaggcca gtaccgaccc cgcccgcccg 61 cgcgctccgc ccccgcccgc catggcccgg gactacgacc acctcttcaa gctgctcatc 121 atcggcgaca gcggtgtggg caagagcagt ttactgttgc gttttgcaga caacactttc 181 tcaggcagct acatcaccac gatcggagtg gatttcaaga tccggaccgt ggagatcaac 241 ggggagaagg tgaagctgca gatctgggac acagcggggc aggagcgctt ccgcaccatc 301 acctccacgt attatcgggg gacccacggg gtcattgtgg tttacgacgt caccagtgcc 361 gagtcctttg tcaacgtcaa gcggtggctt cacgaaatca accagaactg tgatgatgtg 421 tgccgaatat tagtgggtaa taagaatgac gaccctgagc ggaaggtggt ggagacggaa 481 gatgcctaca aattcgccgg gcagatgggc atccagttgt tcgagaccag cgccaaggag 541 aatgtcaacg tggaagagat gttcaactgc atcacggagc tggtcctccg agcaaagaaa 601 gacaacctgg caaaacagca gcagcaacaa cagaacgatg tggtgaagct cacgaagaac 661 agtaaacgaa agaaacgctg ctgctaatgg cacccagtcc actgcagaga ctgcactgcg 721 gtccctcccc // LOCUS HSRB18A 5810 bp mRNA PRI 23-JAN-1998 DEFINITION Homo sapiens mRNA for RB18A protein. ACCESSION Y13467 NID g2765321 KEYWORDS p53 regulatory protein; RB18A protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5810) AUTHORS Drane,P., Barel,M., Balbo,M. and Frade,R. TITLE Identification of RB18A, a 205 kDa new p53 regulatory protein which shares antigenic and functional properties with p53 JOURNAL Oncogene 15 (25), 3013-3024 (1997) MEDLINE 98105695 REFERENCE 2 (bases 1 to 5810) AUTHORS Frade,R. TITLE Direct Submission JOURNAL Submitted (29-MAY-1997) R. Frade, Inserm U.354, Centre Inserm, Hopital Saint-Antoine, 184, rue du Faubourg Saint-Antoine, 75012 - Paris, FRANCE FEATURES Location/Qualifiers source 1..5810 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" /cell_type="B lymphoma" /tissue_type="heart" CDS 236..4936 /function="p53 regulatory protein" /codon_start=1 /product="RB18A protein" /db_xref="PID:e1227605" /db_xref="PID:g2765322" /translation="MSSLLERLHAKFNQNRPWSETIKLVRQVMEKRVVMSSGGHQHLV SCLETLQKALKVTSLPAMTDRLESIAGQNGLGSHLSASGTECYITSDMFYVEVQLDPA GQLCDVKVAHHGENPVSCPELVQQLREKNSDEFSKHLKGLVNLYNLPGDNKLKTKMYL ALQSLEQDLSKMAIMYWKATNAGPLDKILHGSVGYLTPRSGGHLMNLKYYVSPSDLLD DKTASPIILHENNVSRSLGMNASVTIEGTSAVYKLPIAPLIMGSHPVDNKWTPSFSSI TSANSVDLPACFFLKFPQPIPVSRAFVQKLQNCTGIPLFETQPTYAPLYELITQFELS KDPDPIPLNHNMRFYAALPGQQHCYFLNKDAPLPDGRSLQGTLVSKITFQHPGRVPLI LNLIRHQVAYNTLIGSCVKRTILKEDSPGLLQFEVCPLSESRFSVSFQHPVNDSLVCV VMDVQGLTHVSCKLYKGLSDALICTDDFIAKVVQRCMSIPVTMRAIRRKAETIQADTP ALSLIAETVEDMVKKNLPPASSPGYGMTTGNNPMSGTTTSTNTFPGGPIATLFNMSMS IKDRHESVGHGEDFSKVSQNPILTSLLQITGNGGSTIGSSPTPPHHTPPPVSSMAGNT KNHPMLMNLLKDNPAQDFSTLYGSSPLERQNSSSGSPRMEICSGSNKTKKKKSSRLPP EKPKHQTEDDFQRELFSMDVDSQNPIFDVNMTADTLDTPHITPAPSQCSTPPTTYPQP VPHPQPSIQRMVRLSSSDSIGPDVTDILSDIAEEASKLPSTSDDCPAIGTPLRDSSSS GHSQSTLFDSDVFQTNNNENPYTDPADLIADAAGSPSSDSPTNHFFHDGVDFNPDLLN SQSQSGFGEEYFDESSQSGDNDDFKGFASQALNTLGVPMLGGDNGETKFKGNNQADTV DFSIISVAGKALAPADLMEHHSGSQGPLLTTGDLGKEKTQKRVKEGNGTSNSTLSGPG LDSKPGKRSRTPSNDGKSKDKPPKRKKADTEGKSPSHSSSNRPFTPPTSTGGSKSPGS AGRSQTPPGVATPPIPKITIQIPKGTVMVGKPSSHSQYTSSGSVSSSGSKSHHSHSSS SSSSASTSGKMKSSKSEGSSSSKLSSSMYSSQGSSGSSQSKNSSQSGGKPGSSPITKH GLSSGSSSTKMKPQGKPSSLMNPSLSKPNISPSHSRPPGGSDKLASPMKPVPGTPPSS KAKSPISSGSGGSHMSGTSSSSGMKSSSGLGSSGSLSQKTPPSSNSCTASSSSFSSSG SSMSSSQNQHGSSKGKSPSRNKKPSLTAVIDKLKHGVVTSGPGGEDPLDGQMGVSTNS SSHPMSSKHNMSGGEFQGKREKSDKDKSKVSTSGSSVDSSKKTSESKNVGSTGVAKII ISKHDGGSPSIKAKVTLQKPGESSGEGLRPQMASSKNYGSPLISGSTPKHERGSPSHS KSPAYTPQNLDSESESGSSIAEKSYQNSPSSDDGIRPLPEYSTEKHKKHKKEKKKVKD KDRDRDRDKDRDKKKSHSIKPESWSKSPISSDQSLSMTSNTILSADRPSRLSPDFMIG EEDDDLMDVALIGN" protein_bind 1234..1406 /bound_moiety="p53" BASE COUNT 1661 a 1358 c 1326 g 1465 t ORIGIN 1 gggaagatgg cggcggcctc gagcaccctc ctcttcttgc cgccggggac ttcagattga 61 tccttcccgg gaagagtagg gactgctggt gccctgcgtc ccgggatccc gagccaactt 121 gtttcctccg ttagtggtgg ggaagggctt atccttttgt ggcggatcta gcttctcctc 181 gccttcagga tgaaagctca ggggggaaac cgaggagtca gaaaagctga gtaagatgag 241 ttctctcctg gaacggctcc atgcaaaatt taaccaaaat agaccctgga gtgaaaccat 301 taagcttgtg cgtcaagtca tggagaagag ggttgtgatg agttctggag ggcatcaaca 361 tttggtcagc tgtttggaga cattgcagaa ggctctcaaa gtaacatctt taccagcaat 421 gactgatcgt ttggagtcca tagcaggaca gaatggactg ggctctcatc tcagtgccag 481 tggcactgaa tgttacatca cgtcagatat gttctatgtg gaagtgcagt tagatcctgc 541 aggacagctt tgtgatgtaa aagtggctca ccatggggag aatcctgtga gctgtccgga 601 gcttgtacag cagctaaggg aaaaaaattc tgatgaattt tctaagcacc ttaagggcct 661 tgttaatctg tataaccttc caggggacaa caaactgaag actaaaatgt acttggctct 721 ccaatcctta gaacaagatc tttctaaaat ggcaattatg tactggaaag caactaatgc 781 tggtcccttg gataagattc ttcatggaag tgttggctat ctcacaccaa ggagtggggg 841 tcatttaatg aacctgaagt actatgtctc tccttctgac ctactggatg acaagactgc 901 atctcccatc attttgcatg agaataatgt ttctcgatct ttgggcatga atgcatcagt 961 gacaattgaa ggaacatctg ctgtgtacaa actcccaatt gcaccattaa ttatggggtc 1021 acatccagtt gacaataaat ggaccccttc cttctcctca atcaccagtg ccaacagtgt 1081 tgatcttcct gcctgtttct tcttgaaatt tccccagcca atcccagtat ctagagcatt 1141 tgttcagaaa ctgcagaact gcacaggaat tccattgttt gaaactcaac caacttatgc 1201 acccctgtat gaactgatca ctcagtttga gctatcaaag gaccctgacc ccataccttt 1261 gaatcacaac atgagatttt atgctgctct tcctggtcag cagcactgct atttcctcaa 1321 caaggatgct cctcttccag atggccgaag tctacaggga acccttgtta gcaaaatcac 1381 ctttcagcac cctggccgag ttcctcttat cctaaatctg atcagacacc aagtggccta 1441 taacaccctc attggaagct gtgtcaaaag aactattctg aaagaagatt ctcctgggct 1501 tctccaattt gaagtgtgtc ctctctcaga gtctcgtttc agcgtatctt ttcagcaccc 1561 tgtgaatgac tccctggtgt gtgtggtaat ggatgtgcag ggcttaacac atgtgagctg 1621 taaactctac aaagggctgt cggatgcact gatctgcaca gatgacttca ttgccaaagt 1681 tgttcaaaga tgtatgtcca tccctgtgac gatgagggct attcggagga aagctgaaac 1741 cattcaagcc gacaccccag cactgtccct cattgcagag acagttgaag acatggtgaa 1801 aaagaacctg cccccggcta gcagcccagg gtatggcatg accacaggca acaacccaat 1861 gagtggtacc actacatcaa ccaacacctt tccggggggt cccattgcca ccttgtttaa 1921 tatgagcatg agcatcaaag atcggcatga gtcggtgggc catggggagg acttcagcaa 1981 ggtgtctcag aacccaattc ttaccagttt gttgcaaatc acagggaacg gggggtctac 2041 cattggctcg agtccgaccc ctcctcatca cacgccgcca cctgtctctt cgatggccgg 2101 caacaccaag aaccacccga tgctcatgaa ccttctcaaa gataatcctg cccaggattt 2161 ctcaaccctt tatggaagca gccctttaga aaggcagaac tcctcttccg gctcaccccg 2221 catggaaata tgctcgggga gcaacaagac caagaaaaag aagtcatcaa gattaccacc 2281 tgagaaacca aagcaccaga ctgaagatga ctttcagagg gagctatttt caatggatgt 2341 tgactcacag aaccctatct ttgatgtcaa catgacagct gacacgctgg atacgccaca 2401 catcactcca gctccaagcc agtgtagcac tcccccaaca acttacccac aaccagtacc 2461 tcacccccaa cccagtattc aaaggatggt ccgactatcc agttcagaca gcattggccc 2521 agatgtaact gacatccttt cagacattgc agaagaagct tctaaacttc ccagcactag 2581 tgatgattgc ccagccattg gcacccctct tcgagattct tcaagctctg ggcattctca 2641 gagtaccctg tttgactctg atgtctttca aactaacaat aatgaaaatc catacactga 2701 tccagctgat cttattgcag atgctgctgg aagccccagt agtgactctc ctaccaatca 2761 tttttttcat gatggagtag atttcaatcc tgatttattg aacagccaga gccaaagtgg 2821 ttttggagaa gaatattttg atgaaagcag ccaaagtggg gataatgatg atttcaaagg 2881 atttgcatct caggcactaa atactttggg ggtgccaatg cttggaggtg ataatgggga 2941 gaccaagttt aagggcaata accaagccga cacagttgat ttcagtatta tttcagtagc 3001 cggcaaagct ttagctcctg cagatcttat ggagcatcac agtggtagtc agggtccttt 3061 actgaccact ggggacttag ggaaagaaaa gactcaaaag agggtaaagg aaggcaatgg 3121 caccagtaat agtactctct cggggcccgg attagacagc aaaccaggga agcgcagtcg 3181 gaccccttct aatgatggga aaagcaaaga taagcctcca aagcggaaga aggcagacac 3241 tgagggaaag tctccatctc atagttcttc taacagacct tttaccccac ctaccagtac 3301 aggtggatct aaatcgccag gcagtgcagg aagatctcag actcccccag gtgttgccac 3361 accacccatt cccaaaatca ctattcagat tcctaaggga acagtgatgg tgggcaagcc 3421 ttcctctcac agtcagtata ccagcagtgg ttctgtgtct tcctcaggca gcaaaagcca 3481 ccatagccat tcttcctcct cttcctcatc tgcttccacc tcagggaaga tgaaaagcag 3541 taaatcagaa ggttcatcaa gttccaagtt aagtagcagt atgtattcta gccaggggtc 3601 ttctggatct agccagtcca aaaattcatc ccagtctggg gggaagccag gctcctctcc 3661 cataaccaag catggactga gcagtggctc tagcagcacc aagatgaaac ctcaaggaaa 3721 gccatcatca cttatgaatc cttctttaag taaaccaaac atatcccctt ctcattcaag 3781 gccacctgga ggctctgaca agcttgcctc tccaatgaag cctgttcctg gaactcctcc 3841 atcctctaaa gccaagtccc ctatcagttc aggttctggt ggttctcata tgtctggaac 3901 tagttcaagc tctggcatga agtcatcttc agggttagga tcctcaggct cgttgtccca 3961 gaaaactccc ccatcatcta attcctgtac ggcatcttcc tcctcctttt cctcaagtgg 4021 ctcttccatg tcatcctctc agaaccagca tgggagttct aaaggaaaat ctcccagcag 4081 aaacaagaag ccgtccttga cagctgtcat agataaactg aagcatgggg ttgtcaccag 4141 tggccctggg ggtgaagacc cactggacgg ccagatgggg gtgagcacaa attcttccag 4201 ccatcctatg tcctccaaac ataacatgtc aggaggagag tttcagggca agcgtgagaa 4261 aagtgataaa gacaaatcaa aggtttccac ctccgggagt tcagtggatt cttctaagaa 4321 gacctcagag tcaaaaaatg tggggagcac aggtgtggca aaaattatca tcagtaagca 4381 tgatggaggc tcccctagca ttaaagccaa agtgactttg cagaaacctg gggaaagtag 4441 tggagaaggg cttaggcctc aaatggcttc ttctaaaaac tatggctctc cactcatcag 4501 tggttccact ccaaagcatg agcgtggctc tcccagccat agtaagtcac cagcatatac 4561 cccccagaat ctggacagtg aaagtgagtc aggctcctcc atagcagaga aatcttatca 4621 gaatagtccc agctcagacg atggtatccg accacttcca gaatacagca cagagaaaca 4681 taagaagcac aaaaaggaaa agaagaaagt aaaagacaaa gatagggacc gagaccggga 4741 caaagaccga gacaagaaaa aatctcatag catcaagcca gagagttggt ccaaatcacc 4801 catctcttca gaccagtcct tgtctatgac aagtaacaca atcttatctg cagacagacc 4861 ctcaaggctc agcccagact ttatgattgg ggaggaagat gatgatctta tggatgtggc 4921 cctgattggg aattaggaac cttatttcct aaaagaaaca gggccagagg aaaaaaaact 4981 attgataagt ttataggcaa accaccataa ggggtgagtc agacaggtct gatttggtta 5041 agaatcctaa atggcatggc tttgacatca agctgggtga attagaaagg catatccaga 5101 ccctattaaa gaaaccacag ggtttgattc tggttaccag gaagtcttct ttgttcctgt 5161 gccagaaaga aagttaaaat acttgcttaa gaaagggagg ggggtgggag gggtgtaggg 5221 agagggaagg gagggaaaca gttttgtggg aaatattcat atatattttc ttctcccttt 5281 ttccattttt aggccatgtt ttaaactcat tttagtgcat gtatatgaag ggctgggcag 5341 aaaatgaaaa agcaatacat tccttgatgc atttgcatga aggttgttca actttgtttg 5401 aggtagttgt ccgtttgagt catgggcaaa tgaaggactt tggtcatttt ggacacttaa 5461 gtaatgtttg gtgtctgttt cttaggagtg actgggggag ggaagattat tttagctatt 5521 tatttgtaat attttaaccc tttatctgtt tgtttttata cagtgtttcg ttctaaatct 5581 atgaggttta gggttcaaaa tgatggaagg ccgaagagca aggcttatat ggtggtaggg 5641 agcttatagc ttgtgctaat actgtagcat caagcccaag caaattagtc agagcccgcc 5701 tttagagtta aatataatag aaaaaccaaa atgatatttt tattttagga gggtttaaat 5761 agggttcaga gatcatagga atattaggag ttacctctct gtggaggtat // LOCUS HSRBP1 882 bp RNA PRI 03-APR-1995 DEFINITION Human mRNA for retinol binding protein (RBP). ACCESSION X00129 NID g35896 KEYWORDS retinol binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 882) AUTHORS Colantuoni,V., Romano,V., Bensi,G., Santoro,C., Costanzo,F., Raugei,G. and Cortese,R. TITLE Cloning and sequencing of a full length cDNA coding for human retinol-binding protein JOURNAL Nucleic Acids Res. 11 (22), 7769-7776 (1983) MEDLINE 84069802 FEATURES Location/Qualifiers source 1..882 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 52..99 /note="pot. leader peptide" CDS 52..651 /note="precursor RBP" /codon_start=1 /db_xref="PID:g35897" /db_xref="SWISS-PROT:P02753" /translation="MKWVWALLLLAAWAAAERDCRVSSFRVKENFDKARFSGTWYAMA KKDPEGLFLQDNIVAEFSVDETGQMSATAKGRVRLLNNWDVCADMVGTFTDTEDPAKF KMKYWGVASFLQKGNDDHWIVDTDYDTYAVQYSCRLLNLDGTCADSYSFVFSRDPNGL PPEAQKIVRQRQEELCLARQYRLIVHNGYCDGRSERNLL" mat_peptide 100..648 /note="retinol binding protein" misc_feature 862..867 /note="polyadenylation signal" polyA_site 882 /note="polyadenylation site" BASE COUNT 195 a 244 c 241 g 202 t ORIGIN 1 cggccaggct tgcgcgtggt tcccctcccg gtgggcggat tcctgggcaa gatgaagtgg 61 gtgtgggcgc tcttgctgtt ggcggcgtgg gcagcggccg agcgcgactg ccgagtgagc 121 agcttccgag tcaaggagaa cttcgacaag gctcgcttct ctgggacctg gtacgccatg 181 gccaagaagg accccgaggg cctctttctg caggacaaca tcgtcgcgga gttctcggtg 241 gacgagaccg gccagatgag cgccacagcc aagggccgag tccgtctttt gaataactgg 301 gacgtgtgcg cagacatggt gggcaccttc acagacaccg aggaccctgc caagttcaag 361 atgaagtact ggggcgtagc ctcctttctg cagaaaggaa atgatgacca ctggatcgtc 421 gacacagact acgacacgta tgccgtacag tactcctgcc gcctcctgaa cctcgatggc 481 acctgtgctg acagctactc cttcgtgttt tcccgggacc ccaacggcct gcccccagaa 541 gcgcagaaga ttgtaaggca gcggcaggag gagctgtgcc tggccaggca gtacaggctg 601 atcgtccaca acggttactg cgatggcaga tcagaaagaa accttttgta gcaatatcaa 661 gaatctagtt tcatctgaga acttctgatt agctctcagt cttcagctct atttatctta 721 ggagtttaat ttgcccttct ctccccatct tccctcagtt cccataaaac cttcattaca 781 cataaagata cacgtggggg tcagtgaatc tgcttgcctt tcctgaaagt ttctggggct 841 taagattcca gactctgatt cattaaacta tagtcacccg tg // LOCUS HSRBPL8 852 bp RNA PRI 14-MAR-1994 DEFINITION H.sapiens mRNA for ribosomal protein L8. ACCESSION Z28407 NID g433898 KEYWORDS ribosomal protein L8. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 852) AUTHORS Hanes,J., Klaudiny,J., von der Kammer,H. and Scheit,K.H. TITLE Characterization by cDNA cloning of the mRNA of human ribosomal protein L8 JOURNAL Biochem. Biophys. Res. Commun. 197 (3), 1223-1228 (1993) MEDLINE 94107320 REFERENCE 2 (bases 1 to 852) AUTHORS von der Kammer,H. TITLE Direct Submission JOURNAL Submitted (03-DEC-1993) Heinz von der Kammer, Abteilung fuer molekulare Biologie, Max-Planck-Institut fuer biophysikalische Chemie, Am Fassberg 11, Goettingen, D-37077, Germany FEATURES Location/Qualifiers source 1..852 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="ovary" CDS 44..817 /note="putative" /codon_start=1 /product="ribosomal protein L8" /db_xref="PID:g433899" /db_xref="SWISS-PROT:P25120" /translation="MGRVIRGQRKGAGSVFRAHVKHRKGAARLRAVDFAERHGYIKGI VKDIIHDPGRGAPLAKVVFRDPYRFKKRTELFIAAEGIHTGQFVYCGKKAQLNIGNVL PVGTMPEGTIVCCLEEKPGDRGKLARASGNYATVISHNPETKKTRVKLPSGSKKVISS ANRAVVGVVAGGGRIDKPILKAGRAYHKYKAKRNCWPRVRGVAMNPVEHPFGGGNHQH IGKPSTIRRDAPAGRKVGLIAARRTGRLRGTKTVQEKEN" polyA_signal 831..836 polyA_site 852 BASE COUNT 177 a 246 c 265 g 164 t ORIGIN 1 tctctctctc tctctctctc tctggtgaac aggacccgtc gccatgggcc gtgtgatccg 61 tggacagagg aagggcgccg ggtctgtgtt ccgcgcgcac gtgaagcacc gtaaaggcgc 121 tgcgcgcctg cgcgccgtgg atttcgctga gcggcacggc tacatcaagg gcatcgtcaa 181 ggacatcatc cacgacccgg gccgcggcgc gcccctcgcc aaggtggtct tccgggatcc 241 gtatcggttt aagaagcgga cggagctgtt cattgccgcc gagggcattc acacgggcca 301 gtttgtgtat tgcggcaaga aggcccagct caacattggc aatgtgctcc ctgtgggcac 361 catgcctgag ggtacaatcg tgtgctgcct ggaggagaag cctggagacc gtggcaagct 421 ggcccgggca tcagggaact atgccaccgt tatctcccac aaccctgaga ccaagaagac 481 ccgtgtgaag ctgccctccg gctccaagaa ggttatctcc tcagccaaca gagctgtggt 541 tggtgtggtg gctggaggtg gccgaattga caaacccatc ttgaaggctg gccgggcgta 601 ccacaaatat aaggcaaaga ggaactgctg gccacgagta cggggtgtgg ccatgaatcc 661 tgtggagcat ccttttggag gtggcaacca ccagcacatc ggcaagccct ccaccatccg 721 cagagatgcc cctgctggcc gcaaagtggg tctcattgct gcccgccgga ctggacgtct 781 ccggggaacc aagactgtgc aggagaaaga gaactagtgc tgagggcctc aataaagttt 841 gtgtttatgc ca // LOCUS HSRBPRL7A 827 bp RNA PRI 19-JAN-1993 DEFINITION H.sapiens mRNA for ribosomal protein L7. ACCESSION X57959 NID g35902 KEYWORDS leucine-zipper; ribosomal protein L7. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 827) AUTHORS Krawinkel,U. TITLE Direct Submission JOURNAL Submitted (14-FEB-1991) U. Krawinkel, Klinische Forschergruppe fuer Rheumatologie, Mooswaldallee 1-9, 7800 Freiburg REFERENCE 2 (bases 1 to 827) AUTHORS Hemmerich,P., von Mikecz,A., Neumann,F., Sozeri,O., Wolff-Vorbeck,G., Zoebelein,R. and Krawinkel,U. TITLE Structural and functional properties of ribosomal protein L7 from humans and rodents JOURNAL Nucleic Acids Res. 21 (2), 223-231 (1993) MEDLINE 93181195 FEATURES Location/Qualifiers source 1..827 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="T-cell" /cell_type="GWK1" /clone_lib="lambda ZAP" gene 11..757 /gene="humL7-14" CDS 11..757 /gene="humL7-14" /codon_start=1 /product="ribosomal protein L7" /db_xref="PID:g35903" /db_xref="SWISS-PROT:P18124" /translation="MEGVEEKKKEVPAVPETLKKKRRNFAELKIKRLRKKFAQKMLRK ARRKLIYEKAKHYHKEYRQMYRTEIRMARMARKAGNFYVPAEPKLAFVIRIRGINGVS PKVRKVLQLLRLRQIFNGTFVKLNKASINMLRIVEPYIAWGYPNLKSVNELIYKLGYG KINKKRIALTDNALIARSLGKYGIICMEDLIHEIYTVGKRFKEANNFLWPFKLSSPRG GMKKKTTHFVEGGDAGNREDQINRLIRRMN" BASE COUNT 276 a 149 c 203 g 199 t ORIGIN 1 ggctggaacc atggagggtg tagaagagaa gaagaaggag gttcctgctg tgccagaaac 61 ccttaagaaa aagcgaagga atttcgcaga gctgaagatc aagcgcctga gaaagaagtt 121 tgcccaaaag atgcttcgaa aggcaaggag gaagcttatc tatgaaaaag caaagcacta 181 tcacaaggaa tataggcaga tgtacagaac tgaaattcga atggcgagga tggcaagaaa 241 agctggcaac ttctatgtac ctgcagaacc caaattggcg tttgtcatca gaatcagagg 301 tatcaatgga gtgagcccaa aggttcgaaa ggtgttgcag cttcttcgcc ttcgtcaaat 361 cttcaatgga acctttgtga agctcaacaa ggcttcgatt aacatgctga ggattgtaga 421 gccatatatt gcatgggggt accccaatct gaagtcagta aatgaactaa tctacaagct 481 tggttatggc aaaatcaata agaagcgaat tgctttgaca gataacgctt tgattgctcg 541 atctcttggt aaatacggca tcatctgcat ggaggatttg attcatgaga tctatactgt 601 tggaaaacgc ttcaaagagg caaataactt cctgtggccc ttcaaattgt cttctccacg 661 aggtggaatg aagaaaaaga ccacccattt tgtagaaggt ggagatgctg gcaacaggga 721 ggaccagatc aacaggctta ttagaagaat gaactaaggt gtctaccatg attatttttc 781 taagctggtt ggttaataaa cagtacctgc tctcaaattg aaaaaaa // LOCUS HSRBQ1 3011 bp RNA PRI 29-MAY-1996 DEFINITION H.sapiens RBQ-1 mRNA. ACCESSION X85133 NID g728590 KEYWORDS RB protein binding protein; RBQ-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3011) AUTHORS Sakai,Y. TITLE Direct Submission JOURNAL Submitted (06-MAR-1995) Y. Sakai, Biology Div., National Cancer Center Research Inst., Tsukiji 5-1-1, Chuo-ku, Tokyo 104, JAPAN REFERENCE 2 (bases 1 to 3011) AUTHORS Sakai,Y., Saijo,M., Coelho,K., Kishino,T., Niikawa,N. and Taya,Y. TITLE cDNA sequence and chromosomal localization of a novel human protein, RBQ-1 (RBBP6), that binds to the retinoblastoma gene product JOURNAL Genomics 30 (1), 98-101 (1995) MEDLINE 96129310 FEATURES Location/Qualifiers source 1..3011 /organism="Homo sapiens" /strain="NCI-H69" /db_xref="taxon:9606" /cell_line="NEC" /cell_type="small cell lung carcinoma" /clone_lib="lambda ZAPII" /clone="1" /chromosome="16" /map="16p11.2-12" mRNA 1..2994 /gene="RBQ-1" /evidence=experimental /product="RB protein binding protein" gene 1..2994 /gene="RBQ-1" CDS 92..2938 /gene="RBQ-1" /codon_start=1 /product="RB protein binding protein" /db_xref="PID:g755748" /translation="MGIKTLNLVLGLKRALEFPEVFMMEVKDPNMKGAMLTNTGKYAI PTIDAEAYAIGKKEKPPFLPEEPSSSSEEDDPIPDELLCLICKDIMTDAVVIPCCGNS YCDECIRTALLESDEHTCPTCHQNDVSPDALIANKFLRQAVNNFKNETGYTKRLRKQL PPPPPPIPPPRPLIQRNLQPLMRSPISRQQDPLMIPVTSSSTHPAPSISSLTSNQSSL APPVSGNPSSAPAPVPDITATVSISVHSEKSDGPFRDSDNKILPAAALASEHSKGTSS IAITALMEEKGYQVPVLGTPSLLGQSLLHGQLIPTTGPVRINTARPGGGRPGWEHSNK LGYLVSPPQQIRRGERSCYRSINRGRHHSERSQRTQGPSLPATPVFVPVPPPPLYPPP PHTLPLPPGVPPPQFSPQFPPGQPPPAGYSVPPPGFPPAPANLSTPWVSSGVQTAHSN TIPTTQAPPLSREEFYREQRRLKEEEKKKSKLDEFTNDFAKELMEYKKIQKERRRSFS RSKSPYSGSSYSRSSYTYSKSRSGSTRSRSYSRSFSRSHSRSYSRSPPYPRRGRGKSR NYRSRSRSHGYHRSRSRSPPYRRYHSRSRSPQAFRGQSPNKRNVPQGETEREYFNRYR EVPPPYDMKAYYGRSVDFRDPFEKERYREWERKYREWYEKYYKGYAAGAQPRPSANRE NFSPERFLPLNIRNSPFTRGRREDYVGGQSHRSRNIGSNYPEKLSARDGHNQKDNTKS KEKESENAPGDGKGNKHKKHRKRRKGEESEGFLNPELLETSRKSREPTGVEENKTDSL FVLPSRDDATPVRDEPMDAESITFKSVSEKDKRERDKPKAKGDKTKRKNDGSAVSKKE NIVKPAKGPQEKVDGDVRDLLDLNLQLKKPKRRLRRLTILNHHLPLRRMKKSLEPPEK LTLNQQKTPRNKTSQRGKSEEGLFQRCQIRKANN" BASE COUNT 1034 a 682 c 600 g 695 t ORIGIN 1 ggcacgagga aacctctagg tccaccacct ccatcttaca cgtgtttccg ttgtggtaaa 61 cctggacatt atattaagaa ttgcccaaca aatggggata aaaactttga atctggtcct 121 aggattaaaa agagcactgg aattcccaga agttttcatg atggaagtga aagatcctaa 181 tatgaaaggt gcaatgctta ccaacactgg aaaatatgca ataccaacta tagatgcaga 241 agcatatgca attgggaaga aagagaaacc tcccttctta ccagaggagc catcttcttc 301 ctcagaagaa gatgatccta tcccagatga attgttgtgt ctcatctgca aggatattat 361 gactgatgct gttgtgattc cctgctgtgg aaacagttac tgtgatgaat gtataagaac 421 agcactcctg gaatcagatg agcacacatg tccgacgtgt catcaaaatg atgtttctcc 481 tgatgcttta attgccaata aatttttacg acaggctgta aataacttca aaaatgaaac 541 tggctataca aaaagactac gaaaacagtt acctcctcca ccacccccaa taccacctcc 601 gagaccactg attcagagga acctacaacc tctgatgaga tctccgatat caagacaaca 661 agatcctctt atgattccag tgacatcttc atcaactcac ccagctccgt ctatatcttc 721 attaacttct aatcagtctt ccttggcccc tcctgtgtct ggaaatccgt cttctgctcc 781 agctcctgta cctgatataa ctgcaacagt atccatatca gttcattcag aaaaatcaga 841 tggacctttt cgggattctg ataataaaat attgccagct gcagctcttg catcagagca 901 ctcaaaggga acctcctcaa ttgcaattac cgctcttatg gaagagaagg gttaccaggt 961 gcctgttctt ggaaccccat ctttgcttgg acagtcatta ttgcatggac agttgatccc 1021 cacaactggt ccagtaagaa taaatactgc tcgtccaggt ggtggtcgac caggctggga 1081 acattccaac aaacttggct atctggtttc tccaccacaa caaattagaa gaggggagag 1141 gagctgctac agaagtataa accgtgggcg acaccacagc gaaagatcac agaggactca 1201 aggcccgtca ctaccagcaa ctccagtctt tgtacctgtt ccaccacctc ctttgtatcc 1261 gcctcctccc catacacttc ctctccctcc gggtgttcct cctccacagt tttctcctca 1321 gtttcctcct ggccagccac cacccgctgg gtatagtgtc cctcctccag ggtttcctcc 1381 agctcctgcc aatttatcaa caccttgggt atcatcagga gtgcagacag ctcattcaaa 1441 taccatccca acaacacaag caccaccttt gtccagggaa gaattctata gagagcagcg 1501 acgactaaaa gaagaggaaa agaaaaagtc caagctagat gagtttacaa atgattttgc 1561 taaggaattg atggaataca aaaagattca aaaggagcgt aggcgctcat tttccaggtc 1621 taaatctccc tatagtggtt cttcgtattc aagaagttca tatacttatt ctaaatcaag 1681 atctgggtca acacgttcac gctcttattc tcgatcattc agccgctcac attctcgttc 1741 ctattcacgg tcacctccat accccagaag aggcagaggc aagagccgca attaccgttc 1801 acggtctaga tctcatggat atcatcgatc taggtcaagg tcaccccctt acagacgcta 1861 tcattcacga tcaagatctc ctcaagcgtt taggggacag tctcctaata aacgtaatgt 1921 acctcaaggg gaaacagaac gtgaatattt taatagatac agagaagttc caccaccata 1981 tgacatgaaa gcatattatg ggagaagtgt tgactttaga gacccatttg aaaaagaacg 2041 ctaccgagaa tgggagagaa aatatagaga gtggtatgaa aaatattata aaggttatgc 2101 tgctggagca cagcctagac cctcagcaaa tagagagaac ttttctccag agagattttt 2161 gccacttaac atcaggaatt ctcccttcac aagaggccgc agagaagact atgttggtgg 2221 gcaaagtcat agaagtcgaa acataggtag caactatcca gaaaagcttt cagcaagaga 2281 tggtcacaat cagaaggata atacaaagtc aaaagagaag gagagtgaaa acgctccagg 2341 agatggtaaa ggaaataagc ataagaaaca cagaaaaaga agaaaagggg aggaaagtga 2401 gggttttctg aacccagagt tattagagac ttctaggaaa tcaagagaac ctacaggtgt 2461 tgaagaaaat aaaacagact cattgtttgt tctcccaagt agagatgatg ccacacctgt 2521 tagagatgaa ccaatggatg cagaatcaat cacttttaaa tcagtgtctg aaaaagacaa 2581 gagagaaagg gataaaccaa aagcaaaggg tgataaaacc aaacggaaga atgatggatc 2641 tgctgtgtcc aaaaaagaaa atattgtaaa acctgctaaa ggaccccaag aaaaagtaga 2701 tggagacgtg agagatctcc tcgatctgaa cctccaatta aaaaagccaa agaggagact 2761 ccgaagactg acaatactaa atcatcatct tcctctcaga aggatgaaaa aatcactgga 2821 acccccagaa aagctcactc taaatcagca aaagacacca agaaacaaaa ccagtcaaag 2881 aggaaaaagt gaagaaggac tattccaaag atgtcaaatc agaaaagcta acaactaagg 2941 aagaaaaggc caagaagcct aatgagaaaa acaaaccact tgataataag ggagaaaaaa 3001 aaaaaaaaaa a // LOCUS HSRBQ3 3318 bp RNA PRI 29-FEB-1996 DEFINITION H.sapiens RBQ-3 mRNA. ACCESSION X85134 NID g755749 KEYWORDS RB protein binding protein; RBQ-3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3318) AUTHORS Yoichi,T. TITLE Direct Submission JOURNAL Submitted (06-MAR-1995) T. Yoichi, Biology Division, National Cancer Center, Research Institute, Tsukiji5-1-1, chuo-ku, Tokyo 104, JAPAN REFERENCE 2 (bases 1 to 3317) AUTHORS Saijo,M., Sakai,Y., Kishino,T., Niikawa,N., Matsuura,Y., Morino,K., Tamai,K. and Taya,Y. TITLE Molecular cloning of a human protein that binds to the retinoblastoma protein and chromosomal mapping JOURNAL Genomics 27 (3), 511-519 (1995) MEDLINE 96047338 FEATURES Location/Qualifiers source 1..3318 /organism="Homo sapiens" /strain="NCI-H69" /db_xref="taxon:9606" /cell_line="NEC" /cell_type="small cell lung carcinoma" /chromosome="1" /map="1q32" /clone_lib="lambda ZAPII" /clone="1" mRNA 1..3302 /gene="RBQ-3" /evidence=experimental /product="RB protein binding protein" gene 1..3302 /gene="RBQ-3" CDS 116..1732 /gene="RBQ-3" /codon_start=1 /product="RB protein binding protein" /db_xref="PID:g755750" /translation="MNLELLESFGQNYPEEADGTLDCISMALTCTFNRWGTLLAVGCN DGRIVIWDFLTRGIAKIISAHIHPVCSLCWSRDGHKLVSASTDNIVSQWDVLSGDCDQ RFRFPSPILKVQYHPRDQNKVLVCPMKSAPVMLTLSDSKHVVLPVDDDSDLNVVASFD RRGEYIYTGNAKGKILVLKTDSQDLVASFRVTTGTSNTTAIKSIEFARKGSCFLINTA DRIIRVYDGREILTCGRDGEPEPMQELQDLVNRTPWKKCCFSGDGEYIVAGSARQHAL YIWEKSIGNLVKILHGTRGELLLDVAWHPVRPIIASISSGVVSIWAQNQVENWSAFAP DFKELDENVEYEERESGFDIEDEDKSEPEQTGADAAEDEEVDVTSVDPIAAFCSSDEE LEDSKALLYLPIAPEVEDPEENPYGPPPDAVQTSLMDEGASSEKKRQSSADGSQPPKK KPKTTNIELQGVPNDEVHPLLGVKGDGKSKKKQAGRPKGSKGKEKDSPFKPKLYKGDR GLPLEGSAKGKVQAELSQPLTAGGAISELL" BASE COUNT 922 a 675 c 802 g 919 t ORIGIN 1 ggcacgaggc ggcagtctct tcgcggcgtc caccacttag acgcaagttg ctgaagccgg 61 ccggggagaa ggtgttgttg ccggagctga gaccgggcgg ccacagtccg cagggatgaa 121 cctcgagttg ctggagtcct ttgggcagaa ctatccagag gaagctgatg gaactttgga 181 ttgtatcagc atggctttga cttgcacctt taacaggtgg ggcacactgc ttgcagttgg 241 ctgtaatgat ggccgaattg tcatctggga tttcttgaca agaggcattg ctaaaataat 301 tagtgcacac atccatccag tgtgttcttt atgctggagc cgagatggtc ataaactcgt 361 gagtgcttcc actgataaca tagtgtcaca gtgggatgtt ctttcaggcg actgtgacca 421 gaggtttcga ttcccttcac ccatcttaaa agtccaatat catccacgag atcagaacaa 481 ggttctcgtg tgtcccatga aatctgctcc tgtcatgttg accctttcag attccaaaca 541 tgttgttctg ccggtggacg atgactccga tttgaacgtt gtggcatctt ttgataggcg 601 aggggaatat atttatacgg gaaacgcaaa aggcaagatt ttggtcctaa aaacagattc 661 tcaggatctt gttgcttcct tcagagtgac aactggaaca agcaatacca cagccattaa 721 gtcaattgag tttgcccgga aggggagttg ctttttaatt aacacggcag atcgaataat 781 cagagtttat gatggcagag aaatcttaac atgtggaaga gatggagagc ctgaacctat 841 gcaggaattg caggatttgg tgaataggac cccatggaag aaatgttgtt tctctgggga 901 tggggaatac atcgtggcag gttctgcccg gcagcatgcc ctgtacatct gggagaagag 961 cattggcaac ctggtgaaga ttctccatgg gacgagagga gaactcctct tggatgtagc 1021 ttggcatcct gttcgaccca tcatagcatc catttccagt ggagtggtat ctatctgggc 1081 acagaatcaa gtagaaaact ggagtgcatt tgcaccagac ttcaaagaat tggatgaaaa 1141 tgtagaatac gaagaaaggg aatcagggtt tgatattgaa gatgaagata agagtgagcc 1201 tgagcagaca ggggctgatg ctgcagaaga tgaggaagtg gatgtcacca gcgtggaccc 1261 tattgctgcc ttctgtagca gtgatgaaga gctggaagat tcaaaggctc tattgtattt 1321 acccattgcc cctgaggtag aagacccaga agaaaatcct tacggccccc caccggatgc 1381 agtccaaacc tccttgatgg atgaaggggc tagttcagag aagaagaggc agtcctcagc 1441 agatgggtcc cagccaccta agaagaaacc caaaacaacc aatatagaac ttcaaggagt 1501 accaaatgat gaagtccatc cactactggg tgtgaagggg gatggcaaat ccaagaagaa 1561 gcaagcaggc cggcctaaag gatcaaaagg taaagagaaa gattctccat ttaaaccgaa 1621 actctacaaa ggggacagag gtttacctct ggaaggatca gcgaagggta aagtgcaggc 1681 ggaactcagc cagcccttga cagcaggagg agcaatctca gaactgttat gaagaccttc 1741 gaagttcttc attctttctc actttgccat catgtggcct ctggacactg tggtcagtca 1801 tttgaaaatt gactttaatt taaaacaaag gcctgtgctc cacccaggag gtgggaggtg 1861 aattttatgt ttaaatgaag aagtgaatta tggaagaagg tatacgacct tcccttcctt 1921 ttcaagcata agtccaaata gactctcagg aatgaagatt tgtgaagaca tcagatagga 1981 attttggact catttaaact ttgatgctta gttatgttgc tggagaaaag atacttatgt 2041 tttgctcatc taacttcatt gtacccagcg tcattttgac atgtcatttc ctatctccca 2101 tttgccttcg gtcctcaatg catgtctttg agtgacttct tatctgaaat tttgctactg 2161 gtatcctagg aaagcttttg ttggatactc tcattttaaa cttctcctct ccccagatac 2221 ctcctatatt tccatattgt gtgcaaagga tgggcagaaa agaaagtgct tgaaagattt 2281 caaattttca gaaagggaac aacgaaggcc ctctcttcct ctcataccac gttttgctca 2341 agaagctggg ctgtaacaat tcagggtttt cccttgtttt cctctcattg catgtttccc 2401 tccaatattg gttcattgtc atcaatcatg gtttttgaag atagctagtt ttatccatct 2461 ccagcaaaga atcatcaata gtttatattg ctttacctgt gctggcttcc agagatggaa 2521 acaaacccag gtgtctctca acaagctact tttttactgg ggtgggggaa tctatgcaag 2581 gagtaaagta aaaccatcca gaatcaaagc agcaaccaca tagttcaaat caaagatcaa 2641 ggtgaatttt ttgtatcact gcctgtggaa atctatcctc atcagtcatt gcatttttcc 2701 ctgcctatac ctgtgctcct ttttcttact gtgttttcag tcacttcctt tctgtgaaag 2761 gttgcttagc tttttttttg acatttgttg ttctttatat aaaaataaca gattggatag 2821 atgtgtacat ttggtgtttg aaattctctg aaaatcccat taggaaacca ggtgtgaaaa 2881 gggctcagta gcttctctga gtggcgtttt tagctgactg gaagtgctta atctggatcg 2941 tctttttttt tttttttttt ttcaatattt taaaaggaga atttaaatac tgtgcttact 3001 gtgaaatata tcagttggtg agccgggcgt ggtgggtcac gcctgtaatc ccagcacttt 3061 gggaggccaa ggcgggtgga tcacccgagg tcaggagttc aagaccagcc tggccaacgt 3121 ggtgaaagcc tgtatctatt aaaagacaaa aattagctgg gcgtggtggt acatgcctgt 3181 aatcccagct acactggagg ctgagtcagg agaatcactt gaacgtggga ggcagaggtt 3241 gcagtgagtg gagatcgcac cactgccctc cagcctaggt gacagaatga gactctatct 3301 caaaaaaaaa aaaaaaaa // LOCUS HSRCAER 1431 bp RNA PRI 27-FEB-1995 DEFINITION H.sapiens mRNA for red cell anion exchanger (EPB3, AE1, Band 3) 3' non-coding region. ACCESSION X77737 NID g535096 KEYWORDS Alu repeat; red cell anion exchanger. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1431) AUTHORS Schofield,A.E., Martin,P.G., Spillett,D. and Tanner,M.J. TITLE The structure of the human red blood cell anion exchanger (EPB3, AE1, band 3) gene JOURNAL Blood 84 (6), 2000-2012 (1994) MEDLINE 94362252 REFERENCE 2 (bases 1 to 1431) AUTHORS Tanner,M.J., Martin,P.G. and High,S. TITLE The complete amino acid sequence of the human erythrocyte membrane anion-transport protein deduced from the cDNA sequence JOURNAL Biochem. J. 256 (3), 703-712 (1988) MEDLINE 89134172 REFERENCE 3 (bases 1 to 1431) AUTHORS Tanner,M.J.A. TITLE Direct Submission JOURNAL Submitted (14-FEB-1994) M.J.A. Tanner, Bristol University, Department of Biochemistry, Med. School, University Walk, Bristol, BS8 1TD, UK FEATURES Location/Qualifiers source 1..1431 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="reticulocyte" /chromosome="17" /map="17q21qter" misc_signal 334..343 /note="GC box" misc_signal 339..350 /note="GC box" misc_signal 392..398 /note="GC box" GC_signal 405..412 misc_difference 441..445 /note="cDNA has 46 bp deletion" CAAT_signal 514..518 TATA_signal 569..574 CDS 618..728 /codon_start=1 /product="anion exchanger" /db_xref="PID:g535097" /translation="MEETESGSGRKERRWLGAVAHVCNPSTLGGQVGRII" repeat_region 661..971 /rpt_family="Alu" misc_difference 981 /note="A (cDNA) to T (genomic)" misc_feature 1124..1169 /note="Donehower" polyA_signal 1411..1416 BASE COUNT 379 a 338 c 382 g 332 t ORIGIN 1 gggatttgaa ccaggccttc tgatttcaag gtccgagctc tgtcctctgt cagtcatgcg 61 tccactttcc cttcccctgt gactcctccc ttccccactc tgctcccagc ccctaccttg 121 agaccctctt ctctgggccc agagagaggc gtcctggtga ggacaaggta caggcaagga 181 tgatccaggg attgggcctg ggactcaggc ctcctaagtg tttggttcct ccctccaaac 241 actcattagt tcactcattc attcattcca caaacattta ctgagggccc cggaatcagt 301 ggactccgag gggactgaga caagccctgc cctggggtgg gggtgggggg caaggtacag 361 ttgattctac atttggatag ggagtggggg agggtgggaa ggtaggggcg ggagagtgag 421 ggggtttgta atttattaat tttcaggccc ctcatttgag agccatttcc tcaactccat 481 ctaaactgaa tcttggggag aacccagatc tgaccaattg gggtaggaga cagcaggctc 541 tccaagaaca tgggcaaatt tattttttta taaaacaaaa agataaaaag agttgaaaga 601 cgtgaaagtg gtgagagatg gaggaaacag aatcaggaag tggtagaaaa gagaggaggt 661 ggctgggcgc agtggctcac gtttgtaatc ccagcacttt gggaggccaa gttgggcgga 721 tcatttgagg tcaggagttt gagaccagcc tggccaacat ggtgaaaccc cgttactact 781 aaaaatacaa aaattagctg ggtgtctcgt ggcaggcacc tgtaatccca gctacttaga 841 aggctgaggc aagagaatca cctgaaccca ggaggtggag gttgcagtga gccaagattg 901 caccactgca ctccagcctg ggcaacagag caagaccctg tctcaaaaaa aaaaaaaaaa 961 aaaaaaaaaa acggaaggaa acatcagcct tgggggccac agactcaaca tgtgtgtgtg 1021 gtggggttcc agcccaacat agagtaacat tatttgtacc tcccaggcta gctcagtcca 1081 tgggaggctc tcctgtccct gaaagctgac acccaccttt caccacttcg cccatgctac 1141 agttcagttt cctcgtctgt aaaatgggga tgataatggt acctaccttg cagtgttgtt 1201 ataaggatta aaggagacag tgcaagaaaa ggccttggtt ggtgaagagc ccaacctcgg 1261 aggggagctg ctgggatcct ccttatcttg actgggatgt ccctgtctcc ccctcccctt 1321 gctccttgaa catggccaag gaaagtgaaa aacaaaaatt attcactctg ctagcaccct 1381 tccccttgat gcctgggaat aggttttgcc aataaacgta tctgtgttgg a // LOCUS HSRCC1A 1724 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for cell cycle gene RCC1. ACCESSION X06130 NID g35906 KEYWORDS cell cycle control; RCC1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1724) AUTHORS Ohtsubo,M., Kai,R., Furuno,N., Sekiguchi,T., Sekiguchi,M., Hayashida,H., Kuma,K., Miyata,T., Fukushige,S., Murotsu,T., Matsubara,K. and Nishimoto,T. TITLE Isolation and characterization of the active cDNA of the human cell cycle gene (RCC1) involved in the regulation of onset of chromosome condensation JOURNAL Genes Dev. 1 (6), 585-593 (1987) MEDLINE 88056300 COMMENT see x12654 for RCC1 pcD51 cDNA sequence. FEATURES Location/Qualifiers source 1..1724 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="fibroblast cDNA" /clone="pcD40" /chromosome="1" CDS 303..1568 /note="RCC1 protein (AA 1-421)" /codon_start=1 /db_xref="PID:g35907" /db_xref="SWISS-PROT:P18754" /translation="MSPKRIAKRRSPPADAIPKSKKVKVSHRSHSTEPGLVLTLGQGD VGQLGLGENVMERKKPALVSIPEDVVQAEAGGMHTVCLSKSGQVYSFGCNDEGALGRD TSVEGSEMVPGKVELQEKVVQVSAGDSHTAALTDDGRVFLWGSFRDNNGVIGLLEPMK KSMVPVQVQLDVPVVKVASGNDHLVMLTADGDLYTLGCGEQGQLGRVPELFANRGGRQ GLERLLVPKCVMLKSRGSRGHVRFQDAFCGAYFTFAISHEGHVYGFGLSNYHQLGTPG TESCFIPQNLTSFKNSTKSWVGFSGGQHHTVCMDSEGKAYSLGRAEYGRLGLGEGAEE KSIPTLISRLPAVSSVACGASVGYAVTKDGRVFAWGMGTNYQLGTGQDEDAWSPVEMM GKQLENRVVLSVSSGGQHTVLLVKDKEQS" BASE COUNT 401 a 419 c 532 g 372 t ORIGIN 1 ctttttggag acagattcgc agtggtcgct tcttctcctt ggatttgtta aggattccaa 61 gtaactctta tttggagaga agacgatctg cacttcgcat tttggcattg acatttaatt 121 ttagggtcct ttatatagaa gggagagtag ctacatgaat gtgtaagatc ttggaggaag 181 acagcagaga gagagagaga gatcagagat cccagggtta aaagttggag aaatttcaca 241 gtacatcatc caaaagagga gtccatgatg gaggcagagg taaacttgga gaggacagga 301 agatgtcacc caagcgcata gctaaaagaa ggtccccccc agcagatgcc atccccaaaa 361 gcaagaaggt gaaggtctca cacaggtccc acagcacaga acccggcttg gtgctgacac 421 taggccaggg cgacgtgggc cagctggggc tgggtgagaa tgtgatggag aggaagaagc 481 cggccctggt atccattccg gaggatgttg tgcaggctga ggctgggggc atgcacaccg 541 tgtgtctaag caaaagtggc caggtctatt ccttcggctg caatgatgag ggtgccctgg 601 gaagggacac atcagtggag ggctcggaga tggtccctgg gaaagtggag ctgcaagaga 661 aggtggtaca ggtgtcagca ggagacagtc acacagcagc cctcaccgat gatggccgtg 721 tcttcctctg gggctccttc cgggacaata acggtgtgat tggactgttg gagcccatga 781 agaagagcat ggtgcctgtg caggtgcagc tggatgtgcc tgtggtaaag gtggcctcag 841 gaaacgacca cttggtgatg ctgacagctg atggtgacct ctacaccttg ggctgcgggg 901 aacagggcca gctaggccgt gtgcctgagt tatttgccaa ccgtggtggc cggcaaggcc 961 tcgaacgact cctggtcccc aagtgtgtga tgctgaaatc caggggaagc cggggccacg 1021 tgagattcca ggatgccttt tgtggtgcct atttcacctt tgccatctcc catgagggcc 1081 acgtgtacgg cttcggcctc tccaactacc atcagcttgg aactccgggc acagaatctt 1141 gcttcatacc ccagaaccta acatccttca agaattccac caagtcctgg gtgggcttct 1201 ctggtggcca gcaccataca gtctgcatgg attcggaagg aaaagcatac agcctgggcc 1261 gggctgagta tgggcggctg ggccttggag agggtgctga ggagaagagc atacccaccc 1321 tcatctccag gctgcctgct gtctcctcgg tggcttgtgg ggcctctgtg gggtatgctg 1381 tgaccaagga tggtcgtgtt ttcgcctggg gcatgggcac caactaccag ctgggcacag 1441 ggcaggatga ggacgcctgg agccctgtgg agatgatggg caaacagctg gagaaccgtg 1501 tggtcttatc tgtgtccagc gggggccagc atacagtctt attagtcaag gacaaagaac 1561 agagctgatg aagcctctga gggcctggct tctgtcctgc acaacctccc tcacagaaca 1621 gggaagcagt gacagctgca gatggcagcg ggcctctccc cagccctgag cactgtgtca 1681 gttcctgcct tttctcatca gcagaacaga atccttttcc tctt // LOCUS HSRCYP3 2759 bp RNA PRI 24-AUG-1995 DEFINITION Human mRNA for cytochrome P-450 (cyp3 locus). ACCESSION X12387 NID g35910 KEYWORDS cyp3 locus; cytochrome; cytochrome P450. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2759) AUTHORS Spurr,N.K. TITLE Direct Submission JOURNAL Submitted (15-JUL-1988) Spurr, N.K., Imperial Cancer Research Fund, Blanche Lane, South mimms, Potters bar, Herts EN6 3LD, United Kingdom REFERENCE 2 (bases 1 to 2759) AUTHORS Spurr,N.K., Gough,A.C., Stevenson,K. and Wolf,C.R. TITLE The human cytochrome P450 CYP3 locus: assignment to chromosome 7q22-qter JOURNAL Hum. Genet. 81 (2), 171-174 (1989) MEDLINE 89108438 FEATURES Location/Qualifiers source 1..2759 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="adult liver" /clone_lib="lambda gt11" /chromosome="7q22-qter" CDS 70..1581 /note="cytochrome P-450 (AA 1-503)" /codon_start=1 /db_xref="PID:g35911" /db_xref="SWISS-PROT:P08684" /translation="MALIPDLAMETWLLLAVSLVLLYLYGTHSHGLFKKLGIPGPTPL PFLGNILSYHKGFCMFDMECHKKYGKVWGFYDGQQPVLAITDPDMIKTVLVKECYSVF TNRRPFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKEMVPIIAQYGDVLVRNL RREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDP FFLSITVFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQKHRVDFLQLMI DSQNSKETESHKALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEE IDAVLPNKAPPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGW VVMIPSYALHRDPKYWTEPEKFLPERFSKKNKDNIDPYIYTPFGSGPRNCIGMRFALM NMKLALIRVLQNFSFKPCKETQIPLKLSLGGLLQPEKPVVLKVESRDGTVSGA" BASE COUNT 840 a 572 c 578 g 769 t ORIGIN 1 gaattcccaa agagcaacac agagctgaaa ggaagactca gaggagagag ataagtaagg 61 aaagtagtga tggctctcat cccagacttg gccatggaaa cctggcttct cctggctgtc 121 agcctggtgc tcctctatct atatggaacc cattcacatg gactttttaa gaagcttgga 181 attccagggc ccacacctct gccttttttg ggaaatattt tgtcctacca taagggcttt 241 tgtatgtttg acatggaatg tcataaaaag tatggaaaag tgtggggctt ttatgatggt 301 caacagcctg tgctggctat cacagatcct gacatgatca aaacagtgct agtgaaagaa 361 tgttattctg tcttcacaaa ccggaggcct tttggtccag tgggatttat gaaaagtgcc 421 atctctatag ctgaggatga agaatggaag agattacgat cattgctgtc tccaaccttc 481 accagtggaa aactcaagga gatggtccct atcattgccc agtatggaga tgtgttggtg 541 agaaatctga ggcgggaagc agagacaggc aagcctgtca ccttgaaaga cgtctttggg 601 gcctacagca tggatgtgat cactagcaca tcatttggag tgaacatcga ctctctcaac 661 aatccacaag acccctttgt ggaaaacacc aagaagcttt taagatttga ttttttggat 721 ccattctttc tctcaataac agtctttcca ttcctcatcc caattcttga agtattaaat 781 atctgtgtgt ttccaagaga agttacaaat tttttaagaa aatctgtaaa aaggatgaaa 841 gaaagtcgcc tcgaagatac acaaaagcac cgagtggatt tccttcagct gatgattgac 901 tctcagaatt caaaagaaac tgagtcccac aaagctctgt ccgatctgga gctcgtggcc 961 caatcaatta tctttatttt tgctggctat gaaaccacga gcagtgttct ctccttcatt 1021 atgtatgaac tggccactca ccctgatgtc cagcagaaac tgcaggagga aattgatgca 1081 gttttaccca ataaggcacc acccacctat gatactgtgc tacagatgga gtatcttgac 1141 atggtggtga atgaaacgct cagattattc ccaattgcta tgagacttga gagggtctgc 1201 aaaaaagatg ttgagatcaa tgggatgttc attcccaaag ggtgggtggt gatgattcca 1261 agctatgctc ttcaccgtga cccaaagtac tggacagagc ctgagaagtt cctccctgaa 1321 agattcagca agaagaacaa ggacaacata gatccttaca tatacacacc ctttggaagt 1381 ggacccagaa actgcattgg catgaggttt gctctcatga acatgaaact tgctctaatc 1441 agagtccttc agaacttctc cttcaaacct tgtaaagaaa cacagatccc cctgaaatta 1501 agcttaggag gacttcttca accagaaaaa cccgttgttc taaaggttga gtcaagggat 1561 ggcaccgtaa gtggagcctg aattttccta aggacttctg ctttgctctt caagaaatct 1621 gtgcctgaga acaccagaga cctcaaatta ctttgtgaat agaactctga aatgaagatg 1681 ggcttcatcc aatggactgc ataaataacc ggggattctg tacatgcatt gagctctctc 1741 attgtctgtg tagagtgtta tacttgggaa tataaaggag gtgaccaaat cagtgtgagg 1801 aggtagattt ggctcctctg cttctcacgg gactatttcc accaccccca gttagcacca 1861 ttaactcctc ctgagctctg ataagagaat caacatttct caataatttc ctccacaaat 1921 tattaatgaa aataagaatt attttgatgg ctctaacaat gacatttata tcacatgttt 1981 tctctggagt attctatagt tttatgttaa atcaataaag accactttac aaaagtatta 2041 tcagatgctt tcctgcacat taaggagaat ctatagaact gaatgagaac caacaagtaa 2101 atatttttgg tcattgtaat cactgttggc gtggggcctt tgtcagaact agaatttgat 2161 tattaacata ggtgaaagtt aatccactgt gactttgccc attgtttaga aagaatattc 2221 atagtttaat tatgcctttt ttgatcaggc acatggctca cgcctgtaat cctagcagtt 2281 tgggaggctg agccgggtgg atcgcctgag gtcaggagtt caagacaagc ctggcctaca 2341 tggtgaaacc ccatctctac taaaaataca caaattagct aggcatggtg gactcgcctg 2401 taatctcact acacaggagg ctgaggcagg agaatcactt gaacctggga ggcggatgtt 2461 gaagtgagct gagattgcac cactgcactc cagtctgggt gagagtgaga ctcagtctta 2521 aaaaaatatg cctttttgaa gcacgtacat tttgtaacaa agaactgaag ctcttattat 2581 attattagtt ttgatttaat gttttcagcc catctccttt catatttctg ggagacagaa 2641 aacatgtttc cctacacctc ttgcttccat cctcaacacc caactgtctc gatgcaatga 2701 acacttaata aaaaacagtc gattggtcaa aaaaaaaaaa aaaaaaaaaa aaagaattc // LOCUS HSRDC1MR 3492 bp RNA PRI 23-SEP-1992 DEFINITION H.sapiens mRNA for RDC-1 POU domain containing protein. ACCESSION X64624 NID g35914 KEYWORDS POU domain; POU domain protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3492) AUTHORS Collum,R.G., Fisher,P.E., Datta,M., Mellis,S., Thiele,C., Huebner,K., Croce,C.M., Israel,M.A., Theil,T., Moroy,T. et,al. TITLE A novel POU homeodomain gene specifically expressed in cells of the developing mammalian nervous system JOURNAL Nucleic Acids Res. 20 (18), 4919-4925 (1992) MEDLINE 93027214 REFERENCE 2 (bases 1 to 3492) AUTHORS Alt,F.W. TITLE Direct Submission JOURNAL Submitted (25-FEB-1992) F.W. Alt, Howard Hughes Medical Institute, The Children's Hospital, 300 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3492 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa, CHP100" /clone_lib="HeLa genomic lib., CHP100 genomic lib." /chromosome="13" /map="p14-22" gene 278..1273 /gene="RDC-1" CDS 278..1273 /gene="RDC-1" /codon_start=1 /product="RDC-1" /db_xref="PID:g35915" /db_xref="SWISS-PROT:Q01851" /translation="MNSVPCHTSTVPLAHHHHHHHHHQALEPGDLLDHISSPSLALMA GAGARRGAGGGGAHDAAGGGGPRGGGGGPGGGGPGGGGGGAAGGGGGGPGGGLLGASA HPHPHMHSLGHLSHPAAAAAMNMPSGLPHPGLVAAAAHHGAAAAAAAAAAGQVAAASA AAVVGRAGLASICDSDTDPRELEAFGSFKQRRIKLGVTQADVGSALANLKIPGVGSLS QSTICRFESLTLSHNNMIALKPILQAWLEEAEGPSEKMNKPELFNGGEKKRKRTSIAA PEKRSLEAYFAVQPRPSSEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKFSATY" misc_feature 796..1252 /gene="RDC-1" /note="POU domain" BASE COUNT 807 a 884 c 953 g 848 t ORIGIN 1 tccactgccc ccaaacccgt tgttattttg gttgctttgt gtttgccctt cgcatgctgt 61 gctttccggc ctgtgtgtgt gtttcttctg ttgttttgtt ttgctctttt ggttttgtgg 121 tttttttggt ttggtttgtg tcgcctgcag ctgcaccgag caacctcttc gccagcctgg 181 acgagacgct gctggccggg ccgaggcctg gcggccgtgg acatcgccgt gtcccagggc 241 aagagcatcc ttcaagccgg acgccacgta ccacacgatg aacagcgtgc cgtgtcacac 301 ttccacggtg cctctggcgc accaccacca ccaccaccac caccaccagg cgctcgaacc 361 cggcgatctg ctggaccaca tctcctcgcc gtcgctcgcg ctcatggccg gcgcgggcgc 421 gcggcgcggc gccggcggcg gcggcgccca cgacgccgcg gggggcggtg gcccgcgggg 481 cggcggcggc ggcccgggcg gcggcggccc cgggggaggc ggcggtggcg ccgcgggggg 541 cggcggcggc ggcccgggcg gcgggctcct gggcgcgtcc gcgcaccctc acccgcatat 601 gcacagcctg ggccacctgt cgcaccccgc ggcggcggcc gccatgaaca tgccgtccgg 661 gctgccgcac cccgggctgg tggcggcggc ggcgcaccac ggcgcggcag cggcagcggc 721 ggcggcggcg gccgggcagg tggcagcggc atcggcggcg gccgtggtgg gccgagcggg 781 cctggcgtcc atctgcgact cggacacgga cccgcgcgag ctcgaggcgt tcgggagctt 841 caagcagcgg cgcatcaagc tgggcgtgac gcaggccgac gtgggctcgg cgctggccaa 901 cctcaagatc ccgggcgtgg gctcactcag ccagagcacc atctgcaggt tcgagtcgct 961 cacgctctcg cacaacaaca tgatcgcgct caagcccatc ctgcaggcgt ggctcgagga 1021 ggccgagggg cccagcgaga aaatgaacaa gcctgagctc ttcaacggcg gcgagaagaa 1081 gcgcaagcgg acttccatcg ccgcgcccga gaagcgctcc ctcgaggcct acttcgccgt 1141 gcagccccgg ccctcgtccg agaagatcgc cgccatcgcc gagaaactgg acctcaaaaa 1201 gaacgtggtg cgggtgtggt tttgcaacca gagacagaag cagaagcgga tgaaattctc 1261 tgccacttac tgagggggct gggaggtgtc gggcgggaca gaatggggag ctgaggaggc 1321 atttttgggg ggctttcctc tgcttgcctc ccctcggatt tggagtgtcc gttatcctgc 1381 ctgcatttgg ggagtccctt ctcgctctct ttcctccacc cattctctga ttttcctgcc 1441 tttgctgtcc cctagcctta gaggactggg gtgctgggtg tggggattgg agtatagggt 1501 aggggagaag ggggggagca ttcgggggag tggggagtgg ggggaaggaa agcggagacc 1561 cgagcagggg ttttaaggag caggatggtt ctggggtttg ggtgggggga gacgcgggaa 1621 gggtaggaaa atggactgtt ctgaccagag acacttacct aatatcctgg gacaagaact 1681 atgtacaaaa caaacctacc aaccaccaaa aactagacaa ataaagacaa actaaaacaa 1741 aacagaacaa aagcaaagga aaatgcttta gaaattttaa ctccggggag ccataatctg 1801 caacttcatt ttcccccata gaagagaaaa aagagcacca ccattattac cacctcccca 1861 accctacacg cacgaactga gtcgaaaaac gaaaaccaaa cgagcgagaa gttgaagttc 1921 tgggtatcaa agctagttgt tctgtctgcg tgtttaattt ttccctctct cacctccacc 1981 ccatccatat cctctttatt tcctccgttc caatgagacc tatggctgct ctccaatccc 2041 gggaagtgag tgggagacag ctgaaaagag agggtcaggg ggaggctggc tgcttgctta 2101 ggtggaatcc aagttttccc gtggccctgc ctatactctg gtggcctggt cctgttgggg 2161 tgggggtctt tggagagaag ggcatagtct ttgagctact aaaaagcaga attccggagc 2221 ttcgagatat cttattctag gaaaatgaaa caattttaac aacagttttt tttcctctta 2281 tgtcgaagat ctagttttag acaatttcaa aataagcttt tcccactcat agaactttaa 2341 cttgcccttt cagttttatc ttttttttag agagaggttt aaactactga ttttggctgt 2401 tgattcaaat agactaatgg ggtgaaagtt attaggagag atactctctc ctgtttctcc 2461 actgaacgag actcatcttg ctcttctagg tcccgtttct tcctctcttg gacatgaaat 2521 tatagaaatg ttgagaagtc tgcctgcttt cttttgcggt aggacttggc tgtgagaaaa 2581 tcacctaaat cccagaaaag aggaagacag atttaaagtg cccccacccc catttgtttc 2641 aaagaggtct gcatgttggg cgaaaacaga acaactgtgt ttccttttac ttgttcttat 2701 tattcaagag tcatttatta caggggataa atgttgggta gcaagaactt taatttgcac 2761 taccagtctc ccaaatagaa aatcatgtat agtatttcat agtaataatc aggtacctta 2821 caagctgcgg tggattttaa aaaattaaga tagttgaagg tggttaggta aaatgcctgc 2881 tttgtgtaca agatactctt tggatctctc gtaggaatgg tttgttacca tcctttaatc 2941 ataactaaaa cattgaaaac agaacaaatg agaaaagaaa aaaaacctgc cgattaacaa 3001 tgacgaaaat catgcatgat ctgaaaggtg tggaaagaaa cacaattagg tctcactctg 3061 gttaggcatt atttatttaa ttatgttgta tatcattgtt tgcagggcaa cattctatgc 3121 attgaactga gcactaactg ggctagcttc tggtagacgt ttgtggctag tgcgattcac 3181 agtctactgc ctgttccact gaaacatttt gtcatattct tgtattcaaa gaaaaaagga 3241 aaaaaagatt attgtaaata ttttatttaa tgcacacatt cacacagtgg taacagactg 3301 ccagtgttca tcctgaaatg tctcacggat tgatctacct gtccatgtat gtctgctgag 3361 ctttctcctt ggttatgttt tttctctttt acctttctcc tcccttactt ctatcagaac 3421 caattctatg cgccaaaata caacaggggg atgtgtccca gtacacttac aaataaaacc 3481 tcgtgccgaa tt // LOCUS HSREC1L 2161 bp RNA PRI 20-OCT-1992 DEFINITION H.sapiens REC1L mRNA. ACCESSION X57303 NID g35919 KEYWORDS rec1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2161) AUTHORS Abritton,M. TITLE Direct Submission JOURNAL Submitted (16-JAN-1991) M. Abritton, howard Hughes Medical Institute, Brigham & Women's Hospital, thorn Building-Room 928, 75 Francis Street, Boston MA 02115, USA REFERENCE 2 (bases 1 to 2161) AUTHORS Abritton,M., Tseng,L. and Cunningham,J.M. TITLE Nucleotide sequence of the human gene similar to the murine ecotropic retrovirus receptor JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2161 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="bladder" /cell_line="EJ bladder carcinoma" /clone_lib="Lambda gt10LT" /clone="LT3 and LT40" /chromosome="13" /map="q12-q14" gene 151..2040 /gene="REC1L" CDS 151..2040 /gene="REC1L" /codon_start=1 /db_xref="PID:g35920" /translation="MGCKVLLNIGQQMLRRKVVDCSPEETRLSRCLNTFDLVALGVGS TLGAGVYVLAGAVARENAGPAIVISFLIAALASVLAGLCYGEFGARVPKTGSAYLYSY VTVGELWAFITGWNLILSYIIGTSSVARAWSATFDELIGRPIGEFSRTHMTLNAPGVL AENPDIFAVIIILILTGLLTLGVKESAMVNKIFTCINVLVLGFIMVSGFVKGSVKNWQ LTEEDFGNTSGRLCLNNDTKEGKPGVGGFMPFGFSGVLSGAATCFYAFVGFDCIATTG EEVKNPQKAIPVGIVASLLICFIAYFGVSAALTLMMPYFCLDNNSPLPDAFKHVGWEG AKYAVAVGSLCALSASLLGSMFPMPRVIYAMAEDGLLFKFLANVNDRTKTPIIATLAS GAVAAVMAFLFDLKDLVDLMSIGTLLAYSLVAACVLVLRYQPEQPNLVYQMASTSDEL DPADQNELASTNDSQLGFLPEAEMFSLKTILSPKNMEPSKISGLIVNISTSLIAVLII TFCIVTVLGREALTKGALWAVFLLAGSALLCAVVTGVIWRQPESKTKLSFKVPFLPVL PILSIFVNVYLMMQLDQGTWVRFAVWMLIGFIIYFGYGLWHSEEASLDADQARTPDGN LDQCK" BASE COUNT 428 a 623 c 594 g 516 t ORIGIN 1 gggcgatcct gccggagccc cgccgccgcc ggcttggatt ctgaaacctt ccttgtatcc 61 ctcctgagac atctttgctg caagatcgag gctgtcctct ggtgagaagg tggtgaggct 121 tcccgtcata ttccagctct gaacagcaac atggggtgca aagtcctgct caacattggg 181 cagcagatgc tgcggcggaa ggtggtggac tgtagcccgg aggagacgcg gctgtctcgc 241 tgcctgaaca cttttgatct ggtggccctc ggggtgggca gcacactggg tgctggtgtc 301 tacgtcctgg ctggagctgt ggcccgtgag aatgcaggcc ctgccattgt catctccttc 361 ctgatcgctg cgctggcctc agtgctggct ggcctgtgct atggcgagtt tggtgctcgg 421 gtccccaaga cgggctcagc ttacctctac agctatgtca ccgttggaga gctctgggcc 481 ttcatcaccg gctggaactt aatcctctcc tacatcatcg gtacttcaag cgtagcgagg 541 gcctggagcg ccaccttcga cgagctgata ggcagaccca tcggggagtt ctcacggaca 601 cacatgactc tgaacgcccc cggcgtgctg gctgaaaacc ccgacatatt cgcagtgatc 661 ataattctca tcttgacagg acttttaact cttggtgtga aagagtcggc catggtcaac 721 aaaatattca cttgtattaa cgtcctggtc ctgggcttca taatggtgtc aggatttgtg 781 aaaggatcgg ttaaaaactg gcagctcacg gaggaggatt ttgggaacac atcaggccgt 841 ctctgtttga acaatgacac aaaagaaggg aagcccggtg ttggtggatt catgcccttc 901 gggttctctg gtgtcctgtc gggggcagcg acttgcttct atgccttcgt gggctttgac 961 tgcatcgcca ccacaggtga agaggtgaag aacccacaga aggccatccc cgtggggatc 1021 gtggcgtccc tcttgatctg cttcatcgcc tactttgggg tgtcggctgc cctcacgctc 1081 atgatgccct acttctgcct ggacaataac agccccctgc ccgacgcctt taagcacgtg 1141 ggctgggaag gtgccaagta cgcagtggcc gtgggctccc tctgtgctct ttccgccagt 1201 cttctaggtt ccatgtttcc catgcctcgg gttatctatg ccatggctga ggatggactg 1261 ctatttaaat tcttagccaa cgtcaatgat aggaccaaaa caccaataat cgccacatta 1321 gcctcgggtg ccgttgctgc tgtgatggcc ttcctctttg acctgaagga cttggtggac 1381 ctcatgtcca ttggcactct cctggcttac tcgttggtgg ctgcctgtgt gttggtctta 1441 cggtaccagc cagagcagcc taacctggta taccagatgg ccagtacttc cgacgagtta 1501 gatccagcag accaaaatga attggcaagc accaatgatt cccagctggg gtttttacca 1561 gaggcagaga tgttctcttt gaaaaccata ctctcaccca aaaacatgga gccttccaaa 1621 atctctgggc taattgtgaa catttcaacc agccttatag ctgttctcat catcaccttc 1681 tgcattgtga ccgtgcttgg aagggaggct ctcaccaaag gggcgctgtg ggcagtcttt 1741 ctgctcgcag ggtctgccct cctctgtgcc gtggtcacgg gcgtcatctg gaggcagccc 1801 gagagcaaga ccaagctctc atttaaggtt cccttcctgc cagtgctccc catcctgagc 1861 atcttcgtga acgtctatct catgatgcag ctggaccagg gcacctgggt ccggtttgct 1921 gtgtggatgc tgataggctt catcatctac tttggctatg gcctgtggca cagcgaggag 1981 gcgtccctgg atgccgacca agcaaggact cctgacggca acttggacca gtgcaagtga 2041 cgcacagccc cgccccccgg aggtggcagc agccccgagg gacgccccca gaggaccggg 2101 aggcacccca ccctccccac cagtgcaaca gaaaccacct gcgtccacac cctcactgca 2161 g // LOCUS HSREL3 558 bp RNA PRI 30-MAR-1995 DEFINITION Human mRNA for prepro-relaxin H2. ACCESSION X00948 NID g35926 KEYWORDS hormone; relaxin; signal peptide. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 558) AUTHORS Hudson,P., John,M., Crawford,R., Haralambidis,J., Scanlon,D., Gorman,J., Tregear,G., Shine,J. and Niall,H. TITLE Relaxin gene expression in human ovaries and the predicted structure of a human preprorelaxin by analysis of cDNA clones JOURNAL EMBO J. 3 (10), 2333-2339 (1984) MEDLINE 85051298 COMMENT Data kindly reviewed (15-MAY-1985) by R. Crawford. FEATURES Location/Qualifiers source 1..558 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 1..75 /note="signal peptide" CDS 1..558 /note="prepro-relaxin H2" /codon_start=1 /db_xref="PID:g35927" /db_xref="SWISS-PROT:P04090" /translation="MPRLFFFHLLGVCLLLNQFSRAVADSWMEEVIKLCGRELVRAQI AICGMSTWSKRSLSQEDAPQTPRPVAEIVPSFINKDTETINMMSEFVANLPQELKLTL SEMQPALPQLQQHVPVLKDSSLLFEEFKKLIRNRQSEAADSSPSELKYLGLDTHSRKK RQLYSALANKCCHVGCTKRSLARFC" misc_feature 76..171 /note="B-chain" misc_feature 172..483 /note="C-peptide" misc_feature 484..555 /note="A-chain" BASE COUNT 169 a 122 c 116 g 151 t ORIGIN 1 atgcctcgcc tgtttttttt ccacctgcta ggagtctgtt tactactgaa ccaattttcc 61 agagcagtcg cggactcatg gatggaggaa gttattaaat tatgcggccg cgaattagtt 121 cgcgcgcaga ttgccatttg cggcatgagc acctggagca aaaggtctct gagccaggaa 181 gatgctcctc agacacctag accagtggca gaaattgtgc catccttcat caacaaagat 241 acagaaacca taaatatgat gtcagaattt gttgctaatt tgccacagga gctgaagtta 301 accctgtctg agatgcagcc agcattacca cagctacaac aacatgtacc tgtattaaaa 361 gattccagtc ttctctttga agaatttaag aaacttattc gcaatagaca aagtgaagcc 421 gcagacagca gtccttcaga attaaaatac ttaggcttgg atactcattc tcgaaaaaag 481 agacaactct acagtgcatt ggctaataaa tgttgccatg ttggttgtac caaaagatct 541 cttgctagat tttgctga // LOCUS HSRER1 646 bp RNA PRI 08-SEP-1997 DEFINITION Homo sapiens mRNA for Rer1 protein. ACCESSION AJ001421 NID g2385368 KEYWORDS Rer1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 646) AUTHORS Fuellekrug,J. TITLE Direct Submission JOURNAL Submitted (05-SEP-1997) Fuellekrug J., Cell Biology, European Molecular Biology Laboratory, Meyerhofstr. 1, D-69117 Heidelberg, GERMANY REFERENCE 2 (bases 1 to 646) AUTHORS Fullekrug,J., Boehm,J., Rottger,S., Nilsson,T., Mieskes,G. and Schmitt,H.D. TITLE Human Rer1 is localized to the Golgi apparatus and complements the deletion of the homologous Rer1 protein of Saccharomyces cerevisiae JOURNAL Eur. J. Cell Biol. 74 (1), 31-40 (1997) MEDLINE 97454982 FEATURES Location/Qualifiers source 1..646 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1..591 /codon_start=1 /product="Rer1 protein" /db_xref="PID:e339854" /db_xref="PID:g2385369" /translation="MSEGDSVGESVHGKPSVVYRFFTRLGQIYQSWLDKSTPYTAVRW VVTLGLSFVYMIRVYLLQGWYIVTYALGIYHLNLFIAFLSPKVDPSLMEDSDDGPSLP TKQNEEFRPFIRRLPEFKFWHAATKGILVAMVCTFFDAFNVPVFWPILVMYFIMLFCI TMKRQIKHMIKYRYIPFTHGKRRYRGKEDAGKAFAS" BASE COUNT 152 a 156 c 164 g 174 t ORIGIN 1 atgtctgaag gggacagtgt gggagaatcc gtccatggga aaccttcggt ggtgtacaga 61 tttttcacaa gacttggaca gatttatcag tcctggctag acaagtccac accctacacg 121 gctgtgcgat gggtcgtgac actgggcctg agctttgtct acatgattcg agtttacctg 181 ctgcagggtt ggtacattgt gacctatgcc ttggggatct accatctaaa tcttttcata 241 gcttttcttt ctcccaaagt ggatccttcc ttaatggaag actcagatga cggtccttcg 301 ctacccacca aacagaacga ggaattccgc cccttcattc gaaggctccc agagtttaaa 361 ttttggcatg cggctaccaa gggcatcctt gtggctatgg tctgtacttt cttcgacgct 421 ttcaacgtcc cggtgttctg gccgattctg gtgatgtact tcatcatgct cttctgtatc 481 acgatgaaga ggcaaatcaa gcacatgatt aagtaccggt acatcccgtt cacacatggg 541 aagagaaggt acagaggcaa ggaggatgcc ggcaaggcct tcgccagcta gaagcgggac 601 tgaggctgcc tcacgtgttg caagaacagt tttgagccat tgttaa // LOCUS HSRESTIN 5857 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for restin. ACCESSION X64838 S38010 NID g35998 KEYWORDS cytoplasmic protein; intermediate filament associated protein; restin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5857) AUTHORS Bilbe,G. TITLE Direct Submission JOURNAL Submitted (09-MAR-1992) G. Bilbe, Biotechnology Department, Pharma Research, Ciba-Geigy AG, K-681-4-42, CH-4002 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 5857) AUTHORS Bilbe,G., Delabie,J., Bruggen,J., Richener,H., Asselbergs,F.A., Cerletti,N., Sorg,C., Odink,K., Tarcsay,L., Wiesendanger,W. et,al. TITLE Restin: a novel intermediate filament-associated protein highly expressed in the Reed-Sternberg cells of Hodgkin's disease JOURNAL EMBO J. 11 (6), 2103-2113 (1992) MEDLINE 92289675 FEATURES Location/Qualifiers source 1..5857 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937 and peripheral blood mononuclear leukocytes" /clone_lib="lambda-gt11" CDS 133..4416 /codon_start=1 /product="restin" /db_xref="PID:g35999" /db_xref="SWISS-PROT:P30622" /translation="MSMLKPSGLKAPTKILKPGSTALKTPTAVVAPVEKTISSEKASS TPSSETQEEFVDDFRVGERVWVNGNKPGFIQFLGETQFAPGQWAGIVLDEPIGKNDGS VAGVRYFQCEPLKGIFTRPSKLTRKVQAEDEANGLQTTPASRATSPLCTSTASMVSSS PSTPSNIPQKPSQPAAKEPSATPPISNLTKTASESISNLSEAGSIKKGERELKIGDRV LVGGTKAGVVRFLGETDFAKGEWCGVELDEPLGKNDGAVAGTRYFQCQPKYGLFAPVH KVTKIGFPSTTPAKAKANAVRRVMATTSASLKRSPSASSLSSMSSVASSVSSRPSRTG LLTETSSRYARKISGTTALQEALKEKQQHIEQLLAERDLERAEVAKATSHVGEIEQEL ALARDGHDQHVLELEAKMDQLRTMVEAADREKVELLNQLEEEKRKVEDLQFRVEEESI TKGDLETQTKLEHARIKELEQSLLFEKTKADKLQRELEDTRVATVSEKSRIMELEKDL ALRVQEVAELRRRLESNKPAGDVDMSLSLLQEISSLQEKLEVTRTDHQREITSLKEHF GAREETHQKEIKALYTATEKLSKENESLKSKLEHANKENSDVIALWKSKLETAIASHQ QAMEELKVSFSKGLGTETAEFAELKTQIEKMRLDYQHEIENLQNQQDSERAAHAKEME ALRAKLMKVIKEKENSLEAIRSKLDKAEDQHLVEMEDTLNKLQEAEIKVKELEVLQAK CNEQTKVIDNFTSQLKATEEKLLDLDALRKASSEGKSEMKKLRQQLEAAEKQIKHLEI EKNAESSKASSITRELQGRELKLTNLQENLSEVSQVKETLEKELQILKEKFAEASEEA VSVQRSMQETVNKLHQKEEQFNMLSSDLEKLRENLADMEAKFREKDEREEQLIKAKEK LENDIAEIMKMSGDNSSQLTKMNDELRLKERDVEELQLKLTKANENASFLQKSIEDMT VKAEQSQQEAAKKHEEEKKELERKLSDLEKKMETSHNQCQELKARYERATSETKTKHE EILQNLQKTLLDTEDKLKGAREENSGLLQELEELRKQADKAKAAQTAEDAMQIMEQMT KEKTETLASLEDTKQTNAKLQNELDTLKENNLKNVEELNKSKELLTVENQKMEEFRKE IETLKQAAAQKSQQLSALQEENVKLAEELGRSRDEVTSHQKLEEERSVLNNQLLEMKK RESKFIKDADEEKASLQKSISITSALLTEKDAELEKLRNEVTVLRGENASAKSLHSVV QTLESDKVKLELKVKNLELQLKENKRQLSSSSGNTDTQADEDERAQESQIDFLNSVIV DLQRKNQDLKMKVEMMSEAALNGNGDDLNNYDSDDQEKQSKKKPRLFCDICDCFDLHD TEDCPTQAQMSEDPPHSTHHGSRGEERPYCEICEMFGHWATNCNDDETF" polyA_signal 4464..4470 polyA_signal 5707..5713 BASE COUNT 1961 a 1172 c 1371 g 1353 t ORIGIN 1 cggcgcaggc ggcggcgtcc gaggagattt aatccagaga ctgacttcac tatagaaccc 61 acagttgtat caatggttgg ggaaagatag tggcaacagg caaaggagaa acagctctga 121 catacaaaga aaatgagtat gctaaagcca agtgggctta aggcccccac caagatcctg 181 aagcctggaa gcacagctct gaagacacct acggctgttg tagctccagt agaaaaaacc 241 atatccagtg aaaaagcatc aagcactcca tcatctgaga ctcaggagga atttgtggat 301 gactttcgag ttggggagcg agtttgggtg aatggaaata agcctggatt tatccagttt 361 cttggagaaa cccagtttgc accaggccag tgggctggaa ttgttttaga tgaacccata 421 ggcaagaacg atggttcggt ggcaggagtt cggtatttcc agtgtgaacc tttaaagggc 481 atatttaccc gaccttcaaa gttaacaagg aaggtgcaag cagaagatga agctaatggc 541 ctgcagacaa cgcccgcctc ccgagctact tcaccgctgt gcacttctac ggccagcatg 601 gtgtcttcct ccccctccac cccttcaaac atccctcaga aaccatcaca gccagcagca 661 aaggaacctt cagctacgcc tccgatcagc aaccttacaa aaactgccag tgaatctatc 721 tccaaccttt cagaggctgg ctcaatcaag aaaggagaaa gagagctcaa aatcggagac 781 agagtattgg ttggtggcac taaggctggt gtagtccggt ttcttgggga gaccgacttt 841 gccaaggggg agtggtgtgg cgtggagtta gatgagccac ttgggaagaa tgatggcgct 901 gttgctggaa caaggtattt tcagtgtcaa cccaaatatg gcttgttcgc tcctgtccac 961 aaagttacca agattggctt cccttccact acaccagcca aagccaaggc caacgcagtg 1021 aggcgagtga tggcgaccac gtccgccagc ctgaagcgca gcccttctgc ctcttccctc 1081 agctccatga gctcagtggc ctcctctgtg agcagcaggc ccagtcggac aggactattg 1141 actgaaacct cctcccgtta cgccaggaag atctccggta ccactgccct ccaggaggcc 1201 ctgaaggaga agcagcagca cattgagcag ctgctggcgg aacgggatct ggagagggcg 1261 gaggtggcca aggccacgag ccacgtgggg gagatagagc aggagctagc tctggcccgg 1321 gacggacatg accagcatgt cctggaattg gaagccaaaa tggaccagct gcgaacaatg 1381 gtggaagctg ctgacaggga gaaggtggag cttctcaacc agcttgaaga ggagaaaagg 1441 aaggttgagg accttcagtt ccgggttgaa gaagaatcaa ttaccaaagg tgatcttgag 1501 acgcagacca aactggagca tgcccgcatt aaggagcttg aacagagcct gctctttgaa 1561 aagaccaaag ctgacaaact ccagagggag ttagaagaca ctagggtggc tacagtttca 1621 gaaaagtcac gtataatgga actggagaaa gacctagcat tgagagtaca ggaagtagct 1681 gagctccgaa gaaggctaga gtccaataag cctgctgggg atgtggacat gtcactttcc 1741 cttttgcaag agataagctc tttgcaagaa aagttagaag tcacccgtac tgaccaccag 1801 agagaaataa cttctctgaa ggagcatttt ggagcccggg aagaaactca tcagaaggag 1861 ataaaggctc tgtataccgc cacggaaaag ctttccaaag agaacgagtc attgaaaagc 1921 aagctggagc atgccaacaa agagaactca gatgtgatag ctctatggaa gtccaaactg 1981 gagactgcca tcgcatccca ccagcaggcg atggaagaac tgaaggtatc tttcagcaaa 2041 gggcttggaa cagagacggc agaatttgct gaactaaaaa cacaaataga gaaaatgaga 2101 ctagattacc aacacgaaat agaaaatttg cagaatcaac aagactctga acgggctgcc 2161 catgctaaag agatggaagc cttgagggct aaactgatga aagttattaa agaaaaggaa 2221 aacagtctgg aagccatcag gtcgaaactg gacaaagcag aagaccagca tctcgtagaa 2281 atggaagaca cgttaaacaa attacaggaa gctgaaataa aggtaaagga gctagaggta 2341 ctgcaagcca aatgcaatga acaaaccaag gttattgata attttacatc acagctcaag 2401 gctactgaag aaaagctctt ggatcttgat gcacttcgga aagccagttc cgaaggtaaa 2461 tcggaaatga agaaacttag acagcagctt gaggcagctg agaaacagat taaacattta 2521 gagattgaaa agaatgctga aagtagcaag gctagtagca ttaccagaga gctccagggg 2581 agagagctaa agcttactaa ccttcaggaa aatttgagtg aagtcagtca agtgaaagag 2641 actttggaaa aagaacttca gattttgaaa gaaaagtttg ctgaagcttc agaggaggca 2701 gtctctgttc agagaagtat gcaagaaact gtaaataagt tacaccaaaa ggaggaacag 2761 tttaacatgc tgtcttctga cttggagaag ctgagagaaa acttagcaga tatggaggca 2821 aaatttagag agaaagatga gagagaagag cagctgataa aggcaaagga aaaactggaa 2881 aatgacattg cagaaataat gaagatgtca ggagataact cttctcagct gacaaaaatg 2941 aacgatgaat tacgtctgaa agaaagagat gtagaagaat tacagctaaa acttacaaag 3001 gctaatgaaa atgcaagttt tctgcaaaaa agtattgagg acatgactgt caaagctgaa 3061 cagagccagc aagaagcagc taaaaagcat gaggaagaaa agaaagaatt ggagaggaaa 3121 ttgtcggacc tggaaaagaa aatggaaaca agccacaacc agtgtcagga gctgaaagcc 3181 aggtatgaga gagccacttc tgagacaaaa accaagcatg aagaaatcct acagaacctc 3241 cagaagacgc tgctggacac agaggacaag ctgaagggcg cacgggagga gaacagtggc 3301 ttgctgcagg agctggagga gctgagaaag caagccgaca aagccaaagc tgctcaaaca 3361 gcggaagatg ccatgcagat aatggaacag atgaccaaag agaagactga gactctggcc 3421 tccttggagg acaccaagca aacaaatgca aaactacaga atgaattgga cacacttaaa 3481 gaaaacaact tgaaaaatgt ggaagagctg aacaaatcaa aagaactcct gactgtagag 3541 aatcaaaaaa tggaagaatt taggaaagaa atagaaaccc taaagcaggc agcagctcag 3601 aagtcccagc agctttcagc gttgcaagaa gagaacgtta aacttgctga ggagctgggg 3661 agaagcaggg acgaagtcac aagtcatcaa aagctggaag aagaaagatc tgtgctcaat 3721 aatcagttgt tagaaatgaa aaaaagagaa tccaagttca taaaagacgc agatgaagag 3781 aaagcttcct tgcagaaatc catcagtata actagtgcct tactcacaga aaaggatgcc 3841 gagctggaga aactgagaaa tgaggtcaca gtgctcaggg gagaaaacgc ctctgccaag 3901 tccttgcatt cagttgttca gactctagag tctgataagg tgaagctcga gctcaaggta 3961 aagaacttgg agcttcaact caaagaaaac aagaggcagc tcagcagctc ctcaggtaat 4021 acagacactc aggcagacga ggatgaaaga gcccaggaga gtcagattga tttcctaaat 4081 tcagtaatag tggaccttca aaggaagaat caagacctca agatgaaggt ggagatgatg 4141 tcagaagcag ccctgaatgg gaacggggat gacctaaaca attatgacag tgatgatcag 4201 gagaaacagt ccaagaagaa acctcgcctc ttctgtgaca tttgtgactg ctttgatctc 4261 cacgacacag aggattgtcc tacccaggca cagatgtcag aggaccctcc ccattccaca 4321 caccatggca gtcggggtga ggaacgccca tactgtgaaa tctgtgagat gtttggacac 4381 tgggccacca actgcaatga cgacgaaacc ttctgatgaa gcctccagtg gagaactggg 4441 cttgctcaga cgcactcgca ttgacacaac gtaacaccag cattgtgtgt gcagacttca 4501 ggagaactca tgttattttt taaccccgtc aacaaatcta ggaaaatatt ttgatcttca 4561 acaaattgcc ctttagtctc cccgtatgag ttagaataat aaatatttag taggtgagtt 4621 ttcacctcga attttgtttt cttgattttt acgtttgaag acattgcacc agatgccatt 4681 acatttattg gccccccgac cttgtagaaa aacccctacc ctcacaatac cttatttaag 4741 taactttaaa ttatgccgtt acttttcata tttgcaccta agatatttcc aggctgcatt 4801 tgtatattta gattttttgg ttaagctttg acactggaat gagttgaaaa aatgtgccat 4861 tttgcatttt catctactca tttaaagtat tttattctta ttcaaagaaa tatctgagct 4921 ctttgcacta cctgttatca gtagtgcctt tacttcaggc ttgataatac ttaggtgtga 4981 ttataaaatc atgaagcagg taaagggagg ggcaagcccc aaactgctgt ggggacattt 5041 tataatctat atgctgcacc cacttaatct actgtggtgt tttgtttatt agttttgcat 5101 aatttcagct tctatatatt gtatgtatat attttttaaa aatctatatt ttgggaaaaa 5161 aacatacaca atgtgtcttt ctttttggac atttaccttt ttgaaaaaga aaacacttaa 5221 aatgatcatt aggacataac agactaggcc agacatagca tcttgtggct ttgcaaccat 5281 tttcatttgt ttgttttcct tttatttctt caccagattt aaataaaagg aggaattttc 5341 tccaattttt ttttccttct ctggcaggta tccccagcag tcaattaaca ataagccagt 5401 ataaaacacc taaataacca atctacaatc tcccttcaca agttttttta ctgtttttag 5461 atgaatgtac gatgagaaat tcaacgttaa taattctgga ttttcttatc acaaaaaaga 5521 aaatgaagga cctcaaagca cctgaacagt ttatcgacca gtttgaatct atttatcttc 5581 atttgaatgt cttctagata tgtaaaaagt cataaaatgt atcttccatg ctacatgtac 5641 aataagaact tctataattg tatatatgcc tttgatgtat tttcccctca agattatcaa 5701 ctgtgtgttc gacagtgaat attcaatctg gtaccagttg aaatttttgg ttataaatgt 5761 aatacgaatt gtttcacaaa cagaaaacat gtaaagcagt attaaaattt ggccaaacaa 5821 gtgttctgta tctactttta ataaatggtt attcttt // LOCUS HSRETPON 4508 bp RNA PRI 12-SEP-1993 DEFINITION Human ret proto-oncogene mRNA for tyrosine kinase. ACCESSION X12949 M57464 NID g38274 KEYWORDS glycoprotein; oncogene; receptor; ret proto-oncogene; transforming capacity; transmembrane protein; tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4508) AUTHORS Takahashi,M. TITLE Direct Submission JOURNAL Submitted (20-SEP-1988) Takahashi M., Aichi Cancer Center Research Institute, Lab. of Exp. Pathology, Chikusa-ku, Nagoya 464, Japan REFERENCE 2 (bases 1 to 4508) AUTHORS Takahashi,M., Buma,Y., Iwamoto,T., Inaguma,Y., Ikeda,H. and Hiai,H. TITLE Cloning and expression of the ret proto-oncogene encoding a tyrosine kinase with two potential transmembrane domains JOURNAL Oncogene 3 (5), 571-578 (1988) MEDLINE 90272230 COMMENT for overlapping sequence see M16029 The 5' terminal 112 nucleotides are thought to represent a cloning artefact, see x15262 for revised seq. FEATURES Location/Qualifiers source 1..4508 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" CDS 694..3276 /note="ret tyrosine kinase (AA 1 - 860)" /codon_start=1 /db_xref="PID:g38275" /db_xref="SWISS-PROT:P07949" /translation="MVPFPVTVYDEDDSAPTFPAGVDTASAVVEFKRKEDTVVATLRV FDADVVPASGELVRRYTSTLLPGDTWAQQTFRVEHWPNETSVQANGSFVRATVHDYRL VLNRNLSISENRTMQLAVLVNDSDFQGPGAGVLLLHFNVSVLPVSLHLPSTYSLSVSR RARRFAQIGKVCVENCQAFSGINVQYKLHSSGANCSTLGVVTSAEDTSGILFVNDTKA LRRPKCAELHYMVVATDQQTSRQAQAQLLVTVEGSYVAEEAGCPLSCAVSKRRLECEE CGGLGSPTGRCEWRQGDGKGITRNFSTCSPSTKTCPDGHCDVVETQDINICPQDCLRG SIVGGHEPGEPRGIKAGYGTCNCFPEEEKCFCEPEDIQDPLCDELCRTVIAAAVLFSF IVSVLLSAFCIHCYHKFAHKPPISSAEMTFRRPAQAFPVSYSSSGARRPSLDSMENQV SVDAFKILEDPKWEFPRKNLVLGKTLGEGEFGKVVKATAFHLKGRAGYTTVAVKMLKE NASPSELRDLLSEFNVLKQVNHPHVIKLYGACSQDGPLLLIVEYAKYGSLRGFLRESR KVGPGYLGSGGSRNSSSLDHPDERALTMGDLISFAWQISQGMQYLAEMKLVHRDLAAR NILVAEGRKMKISDFGLSRDVYEEDSYVKRSQGRIPVKWMAIESLFDHIYTTQSDVWS FGVLLWEIVTLGGNPYPGIPPERLFNLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDK RPVFADISKDLEKMMVKRRDYLDLAASTPSDSLIYDDGLSEEETPLVDCNNAPLPRAL PSTWIENKLYGMSDPNWPGESPVPLTRADGTNTGFPRYPNDSVYANWMLSPSAAKLMD TFDS" misc_feature 1087..1166 /note="transmembrane domain" misc_feature 1837..1902 /note="transmembrane domain" misc_feature 2107..2928 /note="tyrosine kinase domain" BASE COUNT 983 a 1274 c 1265 g 986 t ORIGIN 1 cagatccagt tgttctcatg cagccgtgtg cggtacgtgc cgtagagatg ctggcccagg 61 cggaagctgg gcacctcctt actgtacttc cataccctat ggaagtacag taaggaggtg 121 cccagcttcc gcctgggcca gcatctctac ggcacgtacc gcacacggct gcatgagaac 181 aactggatct gcatccagga ggacaccggc ctcctctacc ttaaccggag cctggaccat 241 agctcctggg agaagctcag tgtccgcaac cgcggctttc ccctgctcac cgtctacctc 301 aaggtcttcc tgtcacccac atcccttcgt gagggcgagt gccagtggcc aggctgtgcc 361 cgcgtatact tctccttctt caacacctcc tttccagcct gcagctccct caagccccgg 421 gagctctgct tcccagagac aaggccctcc ttccgcattc gggagaaccg acccccaggc 481 accttccacc agttccgcct gctgcctgtg cagttcttgt gccccaacat cagcgtggcc 541 tacaggctcc tggagggtga gggtctgccc ttccgctgcg ccccggacag cctggaggtg 601 agcacgcgct gggccctgga ccgcgagcag cgggagaagt acgagctggt ggccgtgtgc 661 accgtgcacg ccggcgcgcg cgaggaggtg gtgatggtgc ccttcccggt gaccgtgtac 721 gacgaggacg actcggcgcc caccttcccc gcgggcgtcg acaccgccag cgccgtggtg 781 gagttcaagc ggaaggagga caccgtggtg gccacgctgc gtgtcttcga tgcagacgtg 841 gtacctgcat caggggagct ggtgaggcgg tacacaagca cgctgctccc cggggacacc 901 tgggcccagc agaccttccg ggtggaacac tggcccaacg agacctcggt ccaggccaac 961 ggcagcttcg tgcgggcgac cgtacatgac tataggctgg ttctcaaccg gaacctctcc 1021 atctcggaga accgcaccat gcagctggcg gtgctggtca atgactcaga cttccagggc 1081 ccaggagcgg gcgtcctctt gctccacttc aacgtgtcgg tgctgccggt cagcctgcac 1141 ctgcccagta cctactccct ctccgtgagc aggagggctc gccgatttgc ccagatcggg 1201 aaagtctgtg tggaaaactg ccaggcgttc agtggcatca acgtccagta caagctgcat 1261 tcctctggtg ccaactgcag cacgctaggg gtggtcacct cagccgagga cacctcgggg 1321 atcctgtttg tgaatgacac caaggccctg cggcggccca agtgtgccga acttcactac 1381 atggtggtgg ccaccgacca gcagacctct aggcaggccc aggcccagct gcttgtaaca 1441 gtggaggggt catatgtggc cgaggaggcg ggctgccccc tgtcctgtgc agtcagcaag 1501 agacggctgg agtgtgagga gtgtggcggc ctgggctccc caacaggcag gtgtgagtgg 1561 aggcaaggag atggcaaagg gatcaccagg aacttctcca cctgctctcc cagcaccaag 1621 acctgccccg acggccactg cgatgttgtg gagacccaag acatcaacat ttgccctcag 1681 gactgcctcc ggggcagcat tgttggggga cacgagcctg gggagccccg ggggattaaa 1741 gctggctatg gcacctgcaa ctgcttccct gaggaggaga agtgcttctg cgagcccgaa 1801 gacatccagg atccactgtg cgacgagctg tgccgcacgg tgatcgcagc cgctgtcctc 1861 ttctccttca tcgtctcggt gctgctgtct gccttctgca tccactgcta ccacaagttt 1921 gcccacaagc cacccatctc ctcagctgag atgaccttcc ggaggcccgc ccaggccttc 1981 ccggtcagct actcctcttc cggtgcccgc cggccctcgc tggactccat ggagaaccag 2041 gtctccgtgg atgccttcaa gatcctggag gatccaaagt gggaattccc tcggaagaac 2101 ttggttcttg gaaaaactct aggagaaggc gaatttggaa aagtggtcaa ggcaacggcc 2161 ttccatctga aaggcagagc agggtacacc acggtggccg tgaagatgct gaaagagaac 2221 gcctccccga gtgagcttcg agacctgctg tcagagttca acgtcctgaa gcaggtcaac 2281 cacccacatg tcatcaaatt gtatggggcc tgcagccagg atggcccgct cctcctcatc 2341 gtggagtacg ccaaatacgg ctccctgcgg ggcttcctcc gcgagagccg caaagtgggg 2401 cctggctacc tgggcagtgg aggcagccgc aactccagct ccctggacca cccggatgag 2461 cgggccctca ccatgggcga cctcatctca tttgcctggc agatctcaca ggggatgcag 2521 tatctggccg agatgaagct cgttcatcgg gacttggcag ccagaaacat cctggtagct 2581 gaggggcgga agatgaagat ttcggatttc ggcttgtccc gagatgttta tgaagaggat 2641 tcctacgtga agaggagcca gggtcggatt ccagttaaat ggatggcaat tgaatccctt 2701 tttgatcata tctacaccac gcaaagtgat gtatggtctt ttggtgtcct gctgtgggag 2761 atcgtgaccc tagggggaaa cccctatcct gggattcctc ctgagcggct cttcaacctt 2821 ctgaagaccg gccaccggat ggagaggcca gacaactgca gcgaggagat gtaccgcctg 2881 atgctgcaat gctggaagca ggagccggac aaaaggccgg tgtttgcgga catcagcaaa 2941 gacctggaga agatgatggt taagaggaga gactacttgg accttgcggc gtccactcca 3001 tctgactccc tgatttatga cgacggcctc tcagaggagg agacaccgct ggtggactgt 3061 aataatgccc ccctccctcg agccctccct tccacatgga ttgaaaacaa actctatggc 3121 atgtcagacc cgaactggcc tggagagagt cctgtaccac tcacgagagc tgatggcact 3181 aacactgggt ttccaagata tccaaatgat agtgtatatg ctaactggat gctttcaccc 3241 tcagcggcaa aattaatgga cacgtttgat agttaacatt tctttgtgaa aggtaatgga 3301 ctcacaaggg gaagaaacat gctgagaatg gaaagtctac cggccctttc tttgtgaacg 3361 tcacattggc cgagccgtgt tcagttccca ggtggcagac tcgtttttgg tagtttgttt 3421 taacttccaa ggtggtttta cttctgatag ccggtgattt tccctcctag cagacatgcc 3481 acaccgggta agagctctga gtcttagtgg ttaagcattc ctttctcttc agtgcccagc 3541 agcacccagt gttggtctgt gtccatcagt gaccaccaac attctgtgtt cacatgtgtg 3601 ggtccaacac ttactacctg gtgtatgaaa ttggacctga actgttggat ttttctagtt 3661 gccgccaaac aaggcaaaaa aatttaaaca tgaagcacac acacaaaaaa ggcagtagga 3721 aaaatgctgg ccctgatgac ctgtccttat tcagaatgag agactgcggg gggggcctgg 3781 gggtagtgtc aatgcccctc cagggctgga ggggaagagg ggccccgagg atgggcctgg 3841 gctcagcatt cgagatcttg agaatgattt ttttttaatc atgcaacctt tccttaggaa 3901 gacatttggt tttcatcatg attaagatga ttcctagatt tagcacaatg gagagattcc 3961 atgccatctt tactatgtgg atggtggtat cagggaagag ggctcacaag acacatttgt 4021 cccccgggcc caccacatca tcctcacgtg ttcggtactg agcagccact acccctgatg 4081 agaacagtat gaagaaaggg ggctgttgga gtcccagaat tgctgacagc agaggctttg 4141 ctgctgtgaa tcccacctgc caccagcctg cagcacaccc cacagccaag tagaggcgaa 4201 agcagtggct catcctacct gttaggagca ggtagggctt gtactcactt taatttgaat 4261 cttatcaact tactcataaa gggacaggct agctagctgt gttagaagta gcaatgacaa 4321 tgaccaagga ctgctacacc tctgattaca attctgatgt gaaaaagatg gtgtttggct 4381 cttatagagc ctgtgtgaaa ggcccatgga tcagctcttc ctgtgtttgt aatttaatgc 4441 tgctacaagg tgtttctgtt tcttagattc tgaccatgac tcataagctt cttgtcattc 4501 ttcattgc // LOCUS HSRETSA 1582 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for retinal S-antigen (48 KDa protein). ACCESSION X12453 NID g36005 KEYWORDS antigen; GTP-binding protein; S-antigen; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1582) AUTHORS Yamaki,K., Tsuda,M. and Shinohara,T. TITLE The sequence of human retinal S-antigen reveals similarities with alpha-transducin JOURNAL FEBS Lett. 234 (1), 39-43 (1988) MEDLINE 88271621 REMARK Erratum:[FEBS Lett 1988 Aug 29;236(2):507]] REFERENCE 2 (bases 1 to 1582) AUTHORS Yamaki,K., Tsuda,M. and Shinohara,T. TITLE Errata: FEBS Letters 234:39-43(1988) JOURNAL FEBS Lett. 236, 507-507 (1988) FEATURES Location/Qualifiers source 1..1582 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina." mRNA <1..1582 /note="retinal S-antigen mRNA" CDS 15..200 /note="pot. ORF (AA 1 - 61)" /codon_start=1 /db_xref="PID:g36006" /translation="MPPTNSPHHVGQEPQRCPFRLIWQDGTSLLRTGAGYSSSQSIET LSLPPGPSHLVGDNHKV" CDS 222..1439 /note="retinal S-antigen (AA 1 - 405)" /codon_start=1 /db_xref="PID:g36007" /db_xref="SWISS-PROT:P10523" /translation="MAASGKTSKSEPNHVIFKKISRDKSVTIYLGNRDYIDHVSQVQP VDGVVLVDPDLVKGKKVYVTLTCAFRYGQEDVDVIGLTFRRDLYFSRVQVYPPVGAAS TPTKLQESLLKKLGSNTYPFLLTFPDYLPCSVMLQPAPQDSGKSCGVDFEVKAFATDS TDAEEDKIPKKSSVRYLIRSVQHAPLEMGPQPRAEATWQFFMSDKPLHLAVSLNREIY FHGEPIPVTVTVTNNTEKTVKKIKACVEQVANVVLYSSDYYVKPVAMEEAQEKVPPNS TLTKTLTLLPLLANNRERRGIALDGKIKHEDTNLASSTIIKEGIDRTVLGILVSYQIK VKLTVSGFLGELTSSEVATEVPFRLMHPQPEDPAKESIQDANLVFEEFARHNLKDAGE AEEGKRDKNDADE" polyA_site 1582 /note="polyA site" BASE COUNT 405 a 421 c 421 g 335 t ORIGIN 1 gggatctagc gaggatgccc cctacaaatt ccccacatca cgtaggccag gagcctcagc 61 gctgcccctt caggctcatc tggcaagacg gtaccagctt gctcagaaca ggggctggct 121 attcatcatc tcagagcata gagaccctct ccttgccacc cggcccttcc cacctggttg 181 gtgacaatca caaggtgtag aagttgccag ggacagataa catggcagcc agcgggaaga 241 ccagcaagtc cgaaccgaac catgttatct tcaagaagat ctcccgggac aaatcggtga 301 ccatctacct ggggaacaga gactacatag accatgtcag ccaagtccag cctgtggatg 361 gtgtcgtgtt ggttgatcct gatcttgtga agggaaagaa agtgtatgtc actctgacct 421 gcgccttccg ctatggccaa gaggacgttg acgtgatcgg cttgaccttc cgcagggacc 481 tgtacttctc ccgggtccag gtgtatcctc ctgtgggggc cgcgagcacc cccacaaaac 541 tgcaagagag cctgcttaaa aagctgggga gcaacacgta cccctttctc ctgacgtttc 601 ctgactactt gccctgttca gtgatgttgc agccagctcc acaagattca gggaagtcct 661 gtggggttga ctttgaggtc aaagcattcg ccacagacag caccgatgcc gaagaggaca 721 aaatccccaa gaagagctcc gtgcgatatc tgatccgtag tgtacagcat gccccacttg 781 agatgggtcc ccagccccga gctgaggcga cctggcagtt cttcatgtct gacaagcccc 841 tgcaccttgc ggtctctctc aacagagaga tctatttcca tggggagccc atccctgtga 901 ccgtgactgt caccaataac acagagaaga ccgtgaagaa gattaaagca tgcgtggaac 961 aggtggccaa tgtggttctc tactcgagtg attattacgt caagcccgtg gctatggagg 1021 aagcgcaaga aaaagtgcca ccaaacagca ctttgaccaa gacgttgacg ctgctgccct 1081 tgctggctaa caatcgagaa aggagaggca ttgccctgga tgggaaaatc aagcacgagg 1141 acacaaacct tgcctccagc accatcatta aggagggcat agaccggacc gtcctgggaa 1201 tcctggtgtc ttaccagatc aaggtgaagc tcacagtgtc aggctttctg ggagagctca 1261 cctccagtga agtcgccact gaggtcccat tccgcctcat gcaccctcag cctgaggacc 1321 cagctaagga aagtattcag gatgcaaatt tagtttttga ggagtttgct cgccataatc 1381 tgaaagatgc aggagaagct gaggagggga agagagacaa gaatgacgct gatgagtgaa 1441 gatgtcggct caggatgccg gaaaatgacc tgtagttacc agtgcaacga gcaaagcccc 1501 acagtttagt cctttggagt tatgctgcgt atgaaaggat gagtcttctt ccgagaaata 1561 aagcttgttt gttctcccct gg // LOCUS HSRETYK1 3841 bp RNA PRI 20-APR-1995 DEFINITION H.sapiens EDDR1 gene for receptor tyrosine kinase. ACCESSION Z29093 NID g732799 KEYWORDS receptor tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3841) AUTHORS Laval,S., Butler,R., Shelling,A.N., Hanby,A.M., Poulsom,R. and Ganesan,T.S. TITLE Isolation and characterization of an epithelial-specific receptor tyrosine kinase from an ovarian cancer cell line JOURNAL Cell Growth Differ. 5 (11), 1173-1183 (1994) MEDLINE 95151638 REFERENCE 2 (bases 1 to 3841) AUTHORS Shelling,A.N., Butler,R., Jones,T., Laval,S., Boyle,J.M. and Ganesan,T.S. TITLE Localization of an epithelial-specific receptor kinase (EDDR1) to chromosome 6q16 JOURNAL Genomics 25 (2), 584-587 (1995) MEDLINE 95309932 REFERENCE 3 (bases 1 to 3841) AUTHORS Kedinger,C. TITLE Direct Submission JOURNAL Submitted (17-DEC-1993) Claude Kedinger, CNRS Laboratoire de genetique moleculaire-U184, INSERM, 11, rue Humann, Strasbourg, Alsace, 67085 cedex, FRANCE FEATURES Location/Qualifiers source 1..3841 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RTK6" /dev_stage="Adult" /tissue_type="Ovary" /cell_type="Epithelial cell" /cell_line="SKOV-3" /clone_lib="pKS" /chromosome="6" sig_peptide 336..423 /note="putatitve signal peptide" gene 337..2967 /gene="EDDR1" CDS 337..2967 /gene="EDDR1" /standard_name="epithelial discoidin domain receptor" /codon_start=1 /product="receptor tyrosine kinase" /db_xref="PID:g732800" /translation="MGPEALSSLLLLLLVASGDADMKGHFDPAKCRYALGMQDRTIPD SDISASSSWSDSTAARHSRLESSDGDGAWCPAGSVFPKEEEYLQVDLQRLHLVALVGT QGRHAGGLGKEFSRSYRLRYSRDGRRWMGWKDRWGQEVISGNEDPEGVVLKDLGPPMV ARLVRFYPRADRVMSVCLRVELYGCLWRDGLLSYTAPVGQTMYLSEAVYLNDSTYDGH TVGGLQYGGLGQLADGVVGLDDFRKSQELRVWPGYDYVGWSNHSFSSGYVEMEFEFDR LRAFQAMQVHCNNMHTLGARLPGGVECRFRRGPAMAWEGEPMRHNLGGNLGDPRARAV SVPLGGRVARFLQCRFLFAGPWLLFSEISFISDVVNNSSPALGGTFPPAPWWPPGPPP TNFSSLELEPRGQQPVAKAEGSPTAILIGCLVAIILLLLLIIALMLWRLHWRRLLSKA ERRVLEEELTVHLSVPGDTILINNRPGPREPPPYQEPRPRGNPPHSAPCVPNGSAYSG DYMEPEKPGAPLLPPPPQNSVPHYAEADIVTLQGVTGGNTYAVPALPPGAVGDGPPRV DFPRSRLRFKEKLGEGQFGEVHLCEVDSPQDLVSLDFPLNVRKGHPLLVAVKILRPDA TKNARNDFLKEVKIMSRLKDPNIIRLLGVCVQDDPLCMITDYMENGDLNQFLSAHQLE DKAAEGAPGDGQAAQGPTISYPMLLHVAAQIASGMRYLATLNFVHRDLATRNCLVGEN FTIKIADFGMSRNLYAGDYYRVQGRAVLPIRWMAWECILMGKFTTASDVWAFGVTLWE VLMLCRAQPFGQLTDEQVIENAGEFFRDQGRQVYLSRPPACPQGLYELMLRCWSRESE QRPPFSQLHRFLAEDALNTV" polyA_signal 3807 polyA_site 3828 BASE COUNT 750 a 1123 c 1126 g 842 t ORIGIN 1 ggcttaggaa gtattaactg atctctgccc tagttctcat gtgttaaata tggatagtaa 61 tagtatctac cttatgaagt gactgtgaag ataaaattat ggattctgtt taagggttta 121 ggccagtgtc tggcacaggg gaagcattct aaaaatatag ctgatgctgt taaacaatga 181 ctgttgttgt tgttttactg ttattatccc caaagcggcc cattctgtct gttgctgtca 241 gctatgactc agtcccctga ttaacttacg caccacccat tttatcccct gcagagatgc 301 tgcccccacc cccttaggcc cgagggatca ggagctatgg gaccagaggc cctgtcatct 361 ttactgctgc tgctcttggt ggcaagtgga gatgctgaca tgaagggaca ttttgatcct 421 gccaagtgcc gctatgccct gggcatgcag gaccggacca tcccagacag tgacatctct 481 gcttccagct cctggtcaga ttccactgcc gcccgccaca gcaggttgga gagcagtgac 541 ggggatgggg cctggtgccc cgcagggtcg gtgtttccca aggaggagga gtacttgcag 601 gtggatctac aacgactgca cctggtggct ctggtgggca cccagggacg gcatgccggg 661 ggcctgggca aggagttctc ccggagctac cggctgcgtt actcccggga tggtcgccgc 721 tggatgggct ggaaggaccg ctggggtcag gaggtgatct caggcaatga ggaccctgag 781 ggagtggtgc tgaaggacct tgggcccccc atggttgccc gactggttcg cttctacccc 841 cgggctgacc gggtcatgag cgtctgtctg cgggtagagc tctatggctg cctctggagg 901 gatggactcc tgtcttacac cgcccctgtg gggcagacaa tgtatttatc tgaggccgtg 961 tacctcaacg actccaccta tgacggacat accgtgggcg gactgcagta tgggggtctg 1021 ggccagctgg cagatggtgt ggtggggctg gatgacttta ggaagagtca ggagctgcgg 1081 gtctggccag gctatgacta tgtgggatgg agcaaccaca gcttctccag tggctatgtg 1141 gagatggagt ttgagtttga ccggctgagg gccttccagg ctatgcaggt ccactgtaac 1201 aacatgcaca cgctgggagc ccgtctgcct ggcggggtgg aatgtcgctt ccggcgtggc 1261 cctgccatgg cctgggaggg ggagcccatg cgccacaacc tagggggcaa cctgggggac 1321 cccagagccc gggctgtctc agtgcccctt ggcggccgtg tggctcgctt tctgcagtgc 1381 cgcttcctct ttgcggggcc ctggttactc ttcagcgaaa tctccttcat ctctgatgtg 1441 gtgaacaatt cctctccggc actgggaggc accttcccgc cagccccctg gtggccgcct 1501 ggcccacctc ccaccaactt cagcagcttg gagctggagc ccagaggcca gcagcccgtg 1561 gccaaggccg aggggagccc gaccgccatc ctcatcggct gcctggtggc catcatcctg 1621 ctcctgctgc tcatcattgc cctcatgctc tggcggctgc actggcgcag gctcctcagc 1681 aaggctgaac ggagggtgtt ggaagaggag ctgacggttc acctctctgt ccctggggac 1741 actatcctca tcaacaaccg cccaggtcct agagagccac ccccgtacca ggagccccgg 1801 cctcgtggga atccgcccca ctccgctccc tgtgtcccca atggctctgc ctacagtggg 1861 gactatatgg agcctgagaa gccaggcgcc ccgcttctgc ccccacctcc ccagaacagc 1921 gtcccccatt atgccgaggc tgacattgtt accctgcagg gcgtcaccgg gggcaacacc 1981 tatgctgtgc ctgcactgcc cccaggggca gtcggggatg ggccccccag agtggatttc 2041 cctcgatctc gactccgctt caaggagaag cttggcgagg gccagtttgg ggaggtgcac 2101 ctgtgtgagg tcgacagccc tcaagatctg gttagtcttg atttccccct taatgtgcgt 2161 aagggacacc ctttgctggt agctgtcaag atcttacggc cagatgccac caagaatgcc 2221 aggaatgatt tcctgaaaga ggtgaagatc atgtcgaggc tcaaggaccc aaacatcatt 2281 cggctgctgg gcgtgtgtgt gcaggacgac cccctctgca tgattactga ctacatggag 2341 aacggcgacc tcaaccagtt cctcagtgcc caccagctgg aggacaaggc agccgagggg 2401 gcccctgggg acgggcaggc tgcgcagggg cccaccatca gctacccaat gctgctgcat 2461 gtggcagccc agatcgcctc cggcatgcgc tatctggcca cactcaactt tgtacatcgg 2521 gacctggcca cgcggaactg cctagttggg gaaaatttca ccatcaaaat cgcagacttt 2581 ggcatgagcc ggaacctcta tgctggggac tattaccgtg tgcagggccg ggcagtgctg 2641 cccatccgct ggatggcctg ggagtgcatc ctcatgggga agttcacgac tgcgagtgac 2701 gtgtgggcct ttggtgtgac cctgtgggag gtgctgatgc tctgtagggc ccagcccttt 2761 gggcagctca ccgacgagca ggtcatcgag aacgcggggg agttcttccg ggaccagggc 2821 cggcaggtgt acctgtcccg gccgcctgcc tgcccgcagg gcctatatga gctgatgctt 2881 cggtgctgga gccgggagtc tgagcagcga ccaccctttt cccagctgca tcggttcctg 2941 gcagaggatg cactcaacac ggtgtgaatc acacatccag ctgcccctcc ctcagggagc 3001 gatccagggg aagccagtga cactaaaaca agaggacaca atggcacctc tgcccttccc 3061 ctcccgacag cccatcacct ctaatagagg cagtgagact gcaggtgggc tgggcccacc 3121 cagggagctg atgccccttc tccccttcct ggacacactc tcatgtcccc ttcctgttct 3181 tccttcctag aagcccccct gtcgcccacc cagctggtcc tgtggatggg atcctctcca 3241 ccctcctcta gccatccctt ggggaagggt ggggagaaat ataggataga cactggacat 3301 ggcccattgg agcacctggg ccccactgga caacactgat tcctggagag gtggctgcgc 3361 ccccagcttc tctctccctg tcacacactg gaccccactg gctgagaatc tgggggtgag 3421 gaggacaaga aggagaggaa aatgtttcct tgtgcctgct cctgtacttg tcctcagctt 3481 gggcttcttc ctcctccatc acctgaaaca ctggacctgg gggtagcccc gccccagccc 3541 tcagtcaccc ccacttccca cttgcagtct tgtagctaga acttctctaa gcctatacgt 3601 ttctgtggag taaatattgg gattgggggg aaagagggag caacggccca tagccttggg 3661 gttggacatc tctagtgtag ctgccacatt gatttttcta taatcacttg gggtttgtac 3721 atttttgggg ggagagacac agatttttac actaatatat ggacctagct tgaggcaatt 3781 ttaatcccct gcactaggca ggtaataata aaggttgagt tttccacaaa aaaaaaaaaa 3841 a // LOCUS HSREVERB2 2355 bp RNA PRI 15-MAR-1995 DEFINITION H.sapiens mRNA encoding Rev-ErbAalpha (internal fragment). ACCESSION X72632 X53327 NID g732802 KEYWORDS Rev-ErbAalpha; thyroid hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 285) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA alpha: transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA Cell Biol. 9 (2), 77-83 (1990) MEDLINE 90262650 FEATURES Location/Qualifiers source 1..2355 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Embryo" /clone_lib="Lambda gt10 cDNA library." /tissue_lib="Human fetal skeletal muscle library." gene 717..875 /gene="hRev" CDS 717..875 /partial /gene="hRev" /note="putitive coding region" /codon_start=1 /db_xref="PID:g732803" /db_xref="SWISS-PROT:P20393" /translation="MRIAPSSASIATAASNVASRSVSLWACLETLCVLGASPNERSSG CLLRCRVP" BASE COUNT 496 a 810 c 567 g 482 t ORIGIN 1 ccgttgcctc aacgtccaac ccttctgcag ggctgcagtc cggccacccc aagaccttgc 61 tgcagggtgc ttcggatcct gatcgtgagt cgcggggtcc actccccgcc cttagccagt 121 gcccaggggg caacagcggc gatcgcaacc tctagtttga gtcaaggtcc agtttgaatg 181 accgctctca gctggtgaag acatgaccac cctggactcc aacaacaaca caggtggcgt 241 catcacctac attggctcca gtggctcctc cccaagccgc accagccctg aatccctcta 301 tagtgacaac tccaatggca gcttccagtc cctgacccaa ggctgtccca cctacttccc 361 accatccccc actggctccc tcacccaaga cccggctcgc tcctttggga gcattccacc 421 cagcctgagt gatgacggct ccccttcttc ctcatcttcc tcgtcgtcat cctcctcctc 481 cttctataat gggagccccc ctgggagtct acaagtggcc atggaggaca gcagccgagt 541 gtcccccagc aagagcacca gcaacatcac caagctgaat ggcatggtgt tactgtgtaa 601 agtgtgtggg gacgttgcct cgggcttcca ctacggtgtg ctcgcctgcg agggctgcaa 661 gggctttttc cgtcggagca tccagcagaa catccagtac aaaaggtgtc tgaagaatga 721 gaattgctcc atcgtccgca tcaatcgcaa ccgctgccag caatgtcgct tcaagaagtg 781 tctctctgtg ggcatgtctc gagacgctgt gcgttttggg cgcatcccca aacgagagaa 841 gcagcggatg cttgctgaga tgcagagtgc catgaacctg gccaacaacc agttgagcag 901 ccagtgcccg ctggagactt cacccaccca gcaccccacc ccaggcccca tgggcccctc 961 gccaccccct gctccggtcc cctcacccct ggtgggcttc tcccagtttc cacaacagct 1021 gacgcctccc agatccccaa gccctgagcc cacagtggag gatgtgatat cccaggtggc 1081 ccgggcccat cgagagatct tcacctacgc ccatgacaag ctgggcagct cacctggcaa 1141 cttcaatgcc aaccatgcat caggtagccc tccagccacc accccacatc gctgggaaaa 1201 tcagggctgc ccacctgccc ccaatgacaa caacaccttg gctgcccagc gtcataacga 1261 ggccctaaat ggtctgcgcc aggctccctc ctcctaccct cccacctggc ctcctggccc 1321 tgcacaccac agctgccacc agtccaacag caacgggcac cgtctatgcc ccacccacgt 1381 gtatgcagcc ccagaaggca aggcacctgc caacagtccc cggcagggca actcaaagaa 1441 tgttctgctg gcatgtccta tgaacatgta cccgcatgga cgcagtgggc gaacggtgca 1501 ggagatctgg gaggatttct ccatgagctt cacgcccgct gtgcgggagg tggtagagtt 1561 tgccaaacac atcccgggct tccgtgacct ttctcagcat gaccaagtca ccctgcttaa 1621 ggctggcacc tttgaggtgc tgatggtgcg ctttgcttcg ttgttcaacg tgaaggacca 1681 gacagtgatg ttcctaagcc ggaccaccta cagcctgcag gagcttggtg ccatgggcat 1741 gggagacctg ctcagtgcca tgttcgactt cagcgagaag ctcaactccc tggcgcttac 1801 cgaggaggag ctgggcctct tcaccgcggt ggtgcttgtc tctgcagacc gctcgggcat 1861 ggagaattcc gcttcggtgg agcagctcca ggagacgctg ctgcgggctc ttcgggctct 1921 ggtgctgaag aaccggccct tggagacttc ccgcttcacc aagctgctgc tcaagctgcc 1981 ggacctgcgg accctgaaca acatgcattc cgagaagctg ctgtccttcc gggtggacgc 2041 ccagtgaccc gcccggccgg ccttctgccg ctgccccctt gtacagaatc gaactctgca 2101 cttctctctc ctttacgaga cgaaaaggaa aagcaaacca gaatcttatt tatattgtta 2161 taaaatattc caagatgagc ctctggcccc ctgagccttc ttgtaaatac ctgcctccct 2221 cccccatcac cgaacttccc ctcctcccct atttaaacca ctctgtctcc cccacaaccc 2281 tcccctggcc ctctgatttg ttctgttcct gtctcaaatc caatagttca cagctaaaaa 2341 aaaaaaaaaa aaaag // LOCUS HSRFG1 3436 bp RNA PRI 06-APR-1994 DEFINITION H. sapiens cDNA for RFG. ACCESSION X77548 NID g469145 KEYWORDS ret fused gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3436) AUTHORS Santoro,M., Dathan,N.A., Berlingieri,M.T., Bongarzone,I., Paulin,C., Grieco,M., Pierotti,M.A., Vecchio,G. and Fusco,A. TITLE Molecular characterization of RET/PTC3; a novel rearranged version of the RETproto-oncogene in a human thyroid papillary carcinoma JOURNAL Oncogene 9 (2), 509-516 (1994) MEDLINE 94119592 REFERENCE 2 (bases 1 to 3436) AUTHORS Santoro,M. TITLE Direct Submission JOURNAL Submitted (06-APR-1994) M. Santoro, Centro di Endocrinologia Ed Oncologia, Sperimentale del CNR, Facolta di Medicina e Chirurgia, Universita di Napoli, via S. Pansini 5, 80131 Napoli, ITALY FEATURES Location/Qualifiers source 1..3436 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human thyroid cDNA" misc_feature 53..790 /note="region identical to the sequence HSECE1 X71413 (nucleotides 8-790)" gene 77..1921 /gene="RFG" CDS 77..1921 /gene="RFG" /codon_start=1 /product="Ret fused gene" /db_xref="PID:g469146" /translation="MNTFQDQSGSSSNREPLLRCSDARRDLELAIGGVLRAEQQIKDN LREVKAQIHSCISRHLECLRSREVWLYEQVDLIYQLKEETLQQQAQQLYSLLGQFNCL THQLECTQNKDLANQVSVCLERLGSLTLKPEDSTVLLFEADTITLRQTITTFGSLKTI QIPEHLMAHASSANIGPFLEKRGCISMPEQKSASGIVAVPFSEWLLGSKPASGYQAPY IPSTDPQDWLTQKQTLENSQTSSRACNFFNNVGGNLKGLENWLLKSEKSSYQKCNSHS TTSSFSIEMEKVGDQELPDQDEMDLSDWLVTPQESHKLRNAENGSRETSEKFKLLFQS YNVNDWLVKTDSCTNCQGNQPKGVEIENLANLKCLNDHLEAKKPLSTPSMVTEDWLVQ NHQDPCKVEEVCRANEPCTSFAECVCDENCEKEALYKWLLKKEGKDKNGMPVEPKPEP EKHKDSLNMWLCPRKEVIEQTKAPKAMTPSRIADSFQVIKNSPLSEWLIRPPYKEGSP KEVPGTEDRAGKQKFKSPMNTSWCSFNTADWVLPGKKMGNLSQLSSGEDKWLLRKKAQ EVLLNSPLQEEHNFPPDHYGLPAVCDLFACMQLKVDKEKWLYRTPLQM" BASE COUNT 1012 a 729 c 732 g 963 t ORIGIN 1 caatcgcgac cctcagtcca cccaaggtct cctcggatcg cctggagagg cactcggacc 61 tggagcagtg aggagaatga ataccttcca agaccagagt ggcagctcca gtaatagaga 121 accccttttg aggtgtagtg atgcacggag ggacttggag cttgctattg gtggagttct 181 ccgggctgaa cagcaaatta aagataactt gcgagaggtc aaagctcaga ttcacagttg 241 cataagccgt cacctggaat gtcttagaag ccgtgaggta tggctgtatg aacaggtgga 301 ccttatttat cagcttaaag aggagacact tcaacagcag gctcagcagc tctactcgtt 361 attgggccag ttcaattgtc ttactcatca actggagtgt acccaaaaca aagatctagc 421 caatcaagtc tctgtgtgcc tggagagact gggcagtttg acccttaagc ctgaagattc 481 aactgtcctg ctctttgaag ctgacacaat tactctgcgc cagaccatca ccacatttgg 541 gtctctcaaa accattcaaa ttcctgagca cttgatggct catgctagtt cagcaaatat 601 tgggcccttc ctggagaaga gaggctgtat ctccatgcca gagcagaagt cagcatccgg 661 tattgtagct gtccctttca gcgaatggct ccttggaagc aaacctgcca gtggttatca 721 agctccttac atacccagca ccgaccccca ggactggctt acccaaaagc agaccttgga 781 gaacagtcag acttcttcca gagcctgcaa tttcttcaat aatgtcgggg gaaacctaaa 841 gggcttagaa aactggctcc tcaagagtga aaaatcaagt tatcaaaagt gtaacagcca 901 ttccactact agttctttct ccattgaaat ggaaaaggtt ggagatcaag agcttcctga 961 tcaagatgag atggacctat cagattggct agtgactccc caggaatccc ataagctgcg 1021 gaacgctgag aatggcagtc gtgaaaccag tgagaagttt aagctcttat tccagtccta 1081 taatgtgaat gattggcttg tcaagactga ctcctgtacc aactgtcagg gaaaccagcc 1141 caaaggtgtg gagattgaaa acctggccaa tctgaagtgc ctgaatgacc acttggaggc 1201 caagaaacca ttgtccaccc ccagcatggt tacagaggat tggcttgtcc agaaccatca 1261 ggacccatgt aaggtagagg aggtgtgcag agccaatgag ccctgcacaa gctttgcaga 1321 gtgtgtgtgt gatgagaatt gtgagaagga ggctctgtat aagtggcttc tgaagaaaga 1381 aggaaaggat aaaaatggga tgcctgtgga acccaaacct gagcctgaga agcataaaga 1441 ttccctgaat atgtggctct gtcctagaaa agaagtaata gaacaaacta aagcaccaaa 1501 ggcaatgact ccttctagaa ttgctgattc cttccaagtc ataaagaaca gccccttgtc 1561 ggagtggctt atcaggcccc catacaaaga aggaagtccc aaggaagtgc ctggtactga 1621 agacagagct ggcaaacaga agtttaaaag ccccatgaat acttcctggt gttcctttaa 1681 cacagctgac tgggtcctgc caggaaagaa gatgggcaac ctcagccagt tatcttctgg 1741 agaagacaag tggctgcttc gaaagaaggc ccaggaagta ttacttaatt cacctctaca 1801 ggaggaacat aacttccccc cagaccatta tggcctccct gcagtttgtg atctctttgc 1861 ctgtatgcag cttaaagttg ataaagagaa gtggttatat cgaactcctc tacagatgtg 1921 aaggaatgga caagagttga gcagcctttc tgctgattat cacacatcat gagctgagtg 1981 actgcagctt gccaaatctt tgtgtttctg ggtctgacca attagcttag ttcttctcct 2041 gcctaatttt gaactagtaa agcaaagtga gtcatcagat tatgagttac tgtttaaaag 2101 aaaaatgctg tttattcatg ctgaggtgat tcagttccct ccttcttaca gaagtatttt 2161 aattcacccc acactagaaa tgcagcatct ttgtggacgt ctttttcaca agcctccaag 2221 gctccttaga ttgggtcgtt actaaaagta cattaaaaca ctcttgttta tcgaagtata 2281 ttgatgtatt ctaaagctag taaacttccc taacgtttaa ttgccctaca gatgcttctc 2341 ttgctgtggg ttttcttttg ttagtggtct gaaataatta ttttcctgtt ctattaatac 2401 atagtgtatt ttgcacaaaa aaattaacct ggtcaatagt gattaccaaa atatatatta 2461 ataatcttgg caatttttga cattaattat gaaacatttt agcccacgtt agttctacat 2521 tattcttcac ttaaactcag ctactgcaaa ttttgtcttt ctgtaaatgt tattaaaata 2581 tccagtgagc tctttagaag gactcagtat tatttcaaga ctatttttga ggtaattcta 2641 gccttttaaa atattctaca gacctacggg gcttaaaaga accccagtac cgactaagca 2701 aataggcaaa agacatgttg gaaatgtagt atagtacttg aaacagtcac tatcataggg 2761 ataattggtg catcctgtgt aaatggaagc tgagcttgac acctggtgct tttaagtagg 2821 gataaagtca tcctctcact gcaagcacag catacctgta cctccaaaag tgacgtttta 2881 gtgaacaggc cgttttcaac acttgtgcct tggggtgttc attgaagctt tgtgaaaact 2941 actgatgttt tctcagtctc cttaaagtta cgtccatgct ttaaaatgtc tgtgtaggag 3001 agaagtgggg tttataatgt tttctctaag atatctttgc tgctttccag actttgaaac 3061 tattaagctt tccaactgcc tcttaccgga aatacttctg ggggaacttc atggtcccaa 3121 aatgtcattg ccatacagct tcaccagagt tctttgaacc acagctgaaa agagctttgt 3181 attatttttt aattccctcc ccagatatca tttaggagta ttatataaag gtggtgggca 3241 aaaacaatgt aaggagcctt tccagttatc ttgagttgca gctctgtagt ttcttgaggc 3301 caaacacact gtatttgtca aaatataatt tcccttaatc actatgttaa tgagtatgta 3361 aaacattctt ttgcattgat gaattttgta tctgcttccc ttaaagcata acagccataa 3421 aaaaaaaaaa aaaaaa // LOCUS HSRFX 2262 bp RNA PRI 01-JUN-1995 DEFINITION H.sapiens mRNA for DNA binding regulatory factor. ACCESSION X85786 NID g840788 KEYWORDS binding regulatory protein); RFX5 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2262) AUTHORS Steimle,V., Durand,B., Barras,E., Zufferey,M., Hadam,M.R., Mach,B. and Reith,W. TITLE A novel DNA-binding regulatory factor is mutated in primary MHC class II deficiency (bare lymphocyte syndrome) JOURNAL Genes Dev. 9 (9), 1021-1032 (1995) MEDLINE 95262896 REFERENCE 2 (bases 1 to 2262) AUTHORS Reith,W. TITLE Direct Submission JOURNAL Submitted (23-MAR-1995) W. Reith, Jeantet Laboratory of Molecular Genetics, Dept of Genetics & Microbiology, University of Geneva Medical School, Centre Medical Universitaire, 9 Avenue de Champel, 1211 Geneva 4, SWITZERLAND FEATURES Location/Qualifiers source 1..2262 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell line" /cell_line="RAJI" gene 162..2012 /gene="RFX5" CDS 162..2012 /gene="RFX5" /codon_start=1 /product="binding regulatory factor" /db_xref="PID:g840789" /db_xref="SWISS-PROT:P48382" /translation="MAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQN KVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEE HTDTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYC YSGIRRKTLVSMPPLPGLDLKGSESPEMGPEVTPAPRDELVEAACALTCDWAERILKR SFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGLENPEGG AHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLPLLLP RAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPALP GPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKA AKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGE GNSAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVS VIKGSRSQKEAFPLAKGEVDTAPQGNKDLKEHVLQSSLSQEHKDPKATPP" BASE COUNT 580 a 587 c 627 g 468 t ORIGIN 1 gtcgactaca aggtggctac gagttttcca gatttaggag acttcagaaa ggtggggcag 61 atagaatgga gatggcaaag atctctttgg gcatatatgg gcctggcgaa gtaatggaat 121 aatttctaat tttcggagaa ggcaagtgcc ctcatgccgg gatggcagaa gatgagcctg 181 atgctaagag ccccaagact gggggaaggg cccccccagg tggtgctgag gctggggaac 241 ctaccaccct tcttcagagg ctccgaggta ccatttccaa ggccgtgcag aacaaagtag 301 aggggatcct gcaagatgta cagaaatttt ctgacaatga caagctgtat ctctaccttc 361 agctcccctc aggacccacc actggagaca aaagctcaga gccaagtaca ctgagcaatg 421 aggagtacat gtatgcctat aggtggatcc gcaaccacct ggaagagcac actgacacct 481 gtctgccaaa gcaaagtgtt tatgatgcct atcggaagta ctgtgagagt cttgcctgtt 541 gccgcccact cagcacagcc aactttggca agatcatcag agagatcttc cctgacatca 601 aagctcgaag gcttggtggc cggggccagt ccaaatattg ctacagtggc ataaggagga 661 agaccttggt gtctatgcca cccctgcctg gacttgacct aaagggttct gagagtccag 721 aaatgggccc agaagtaacc ccagcacctc gagatgaact ggtggaggca gcgtgtgccc 781 tgacctgtga ctgggcagag cggatcctga aacggtcctt cagttccatc gttgaggtcg 841 cccgcttcct gctacagcag catctcatct ctgcccgatc tgcacatgcc catgtgctta 901 aggccatggg gcttgctgaa gaggacgaac atgcacctcg ggaacggtca tctaaaccaa 961 agaatggttt agagaaccca gagggtggag cccacaagaa gccagagaga ctggcccagc 1021 ctcctaagga tctggaagcc cgaactgggg ccggtcctct cgcacgtgga gagcggaaga 1081 agagtgtagt tgagagctcg gccccaggag ccaataacct gcaggttaat gccctagtgg 1141 ctcggctgcc tctgctcctt ccccgggccc ctcgctcact aattccgcca atcccagtct 1201 ctccacctat tctggccccc aggctttctt caggtgccct gaaagtggct acactgcctc 1261 tgtctagtag ggccggggca cccccagcag ctgtgcccat cattaacatg atcttaccaa 1321 ctgttcctgc tttgcctgga cctggacctg ggcctgggcg agctccacct gggggactca 1381 ctcagccccg gggcacagag aacagagagg taggcatagg tggtgaccaa ggaccacatg 1441 acaagggtgt caagaggaca gctgaagtac ctgtgagtga ggccagtggg caggctccac 1501 cagctaaagc agcaaagcag gatatagagg atacagcaag tgatgccaaa aggaaacggg 1561 ggcgccctcg aaaaaagtca ggtggaagtg gggaaaggaa ttctacccct ctcaagtcag 1621 cagctgccat ggaatctgcc cagtcctcaa ggttaccatg ggagacatgg ggctcaggag 1681 gggaaggcaa ctcagctgga ggggcagaga ggccagggcc aatgggagag gctgaaaagg 1741 gggcagtact tgcccagggt cagggagatg gtactgtttc caaaggagga aggggccccg 1801 gttcccagca taccaaagaa gcagaagata aaattccctt ggtcccctca aaagtgagtg 1861 tcatcaaggg cagcagaagc caaaaggagg cttttccttt ggcaaaggga gaggtagaca 1921 ctgcaccaca gggtaataaa gacttaaagg agcatgtgct tcaaagttcc ttatcccagg 1981 agcataaaga cccaaaagca acacccccat gatacaggtc tgtggggaag agtgtttata 2041 tccctacgtt aactttgcct agtagaggcc cttctttgca cttgcttctc atttggctat 2101 tcttttccta aggaagtcca ttctcctctg tacagacagc tgagtcaccc agtctactag 2161 tacctggttg ctgcctctga ccttttcaga ttgataccct gggcctttag tgtaaccaat 2221 aaatctgtag tgaccttacc tgtattccct gtgctatcct gt // LOCUS HSRFX3 2514 bp RNA PRI 28-FEB-1994 DEFINITION H.sapiens HRFX3 mRNA. ACCESSION X76092 NID g452403 KEYWORDS DNA-binding protein RFX3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2514) AUTHORS Reith,W.C.A. TITLE Direct Submission JOURNAL Submitted (11-NOV-1993) W.C.A. Reith, Department of Genetics & Microbiology, University of Geneva Medical School, Centre Medical Universitaire (C.M.U.), 9 ave de Champel, CH-1211 Geneva 4, SWITZERLAND REFERENCE 2 (bases 1 to 2514) AUTHORS Reith,W., Ucla,C., Barras,E., Gaud,A., Durand,B., Herrero-Sanchez,C., Kobr,M. and Mach,B. TITLE RFX1, a transactivator of hepatitis B virus enhancer I, belongs to a novel family of homodimeric and heterodimeric DNA-binding proteins JOURNAL Mol. Cell. Biol. 14 (2), 1230-1244 (1994) MEDLINE 94119075 FEATURES Location/Qualifiers source 1..2514 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /cell_line="RAJI" /clone_lib="lambda gt10" gene 9..2132 /gene="HRFX3" CDS 9..2132 /gene="HRFX3" /codon_start=1 /evidence=experimental /product="DNA binding protein RFX3" /db_xref="PID:g452404" /db_xref="SWISS-PROT:P48380" /translation="MQTSETGSDTGSTVTLQTSVASQAAVPTQVVQQVPVQQQVQQVQ TVQQVQHVYPAQVQYVEGSDTVYTNGAIRTTTYPYTETQMYSQNTGGNYFDTQGSSAQ VTTVVSSHSMVGTGGIQMGVTGGQLISSSGGTYLIGNSMENSGHSVTHTTRASPATIE MAIETLQKSDGLSTHRSSLLNSHLQWLLDNYETAEGVSLPRSTLYNHYLRHCQEHKLD PVNAASFGKLIRSIFMGLRTRRLGTRGNSKYHYYGIRVKPDSPLNRLQEDMQYMAMRQ QPMQQKQRYKPMQKVDGVADGFTGSGQQTGTSVEQTVIAQSQHHQQFLDASRALPEFG EVEISSLPDGTTFEDIKSLQSLYREHCEAILDVVVNLQFSLIEKLWQTFWRYSPSTPT DGTTITESSNLSEIESRLPKAKLITLCKHESILKWMCNCDHGMYQALVEILIPDVLRP IPSALTQAIRNFAKSLEGWLSNAMNNIPQRMIQTKVAAVSAFAQTLRRYTSLNHLAQA ARAVLQNTSQINQMLSDLNRVDFANVQEQASWVCQCDDNMVQRLETDFKMTLQQQSTL EQWAAWLDNVMMQALKPYEGRPSFPKAARQFLLKWSFYSSMVIRDLTLRSAASFGSFH LIRLLYDEYMFYLVEHRVAQATGETPIAVMGEVREAERAVTHWVIKNKPELHFSLNTL LIKTMVPNQVSLRARRDCGVIARVP" BASE COUNT 766 a 558 c 566 g 624 t ORIGIN 1 agaccatcat gcagacatca gagactgggt cggacacagg ctcgacagtg accttacaaa 61 catctgtggc tagtcaagca gcagtgccta cgcaggtggt acagcaagta ccagtacaac 121 aacaggtaca gcaggtacag actgtgcagc aggtacaaca tgtctatccc gctcaggtgc 181 agtatgtgga aggaagcgat actgtctata ccaatggagc aatccgaaca acaacgtatc 241 cttacacaga gacacagatg tacagccaaa atactggagg gaattacttt gatactcaag 301 ggagttccgc ccaggtgact accgtggtct catcccacag tatggtgggc actggtggga 361 ttcagatggg cgtcacagga ggacaactca tcagcagctc tggaggaacc tatctgatcg 421 gcaactcaat ggagaattct ggtcactcag tgacacacac aactcgggcc tccccagcga 481 caattgaaat ggcgattgag acgctgcaaa agtctgacgg tctgtccact cacagaagct 541 ctcttctcaa cagccatctc cagtggctgt tggacaatta tgagacagca gaaggagtga 601 gccttcccag aagcactctg tacaaccact accttcgaca ctgtcaggaa cacaaactgg 661 acccagtcaa tgctgcctct tttggaaaat taataagatc aatttttatg gggctacgaa 721 ccaggagatt gggcactaga ggaaactcca aataccacta ctatgggatt cgtgtcaagc 781 cagattcccc tcttaatcgt ctgcaagaag acatgcagta tatggctatg agacaacaac 841 ccatgcaaca gaaacaaagg tacaagccta tgcagaaagt ggatggggtt gcagatggtt 901 tcacaggaag tggtcaacag acaggcacat ctgttgagca aactgtaatt gcccaaagcc 961 aacatcatca acagttttta gatgcatctc gagcacttcc agagtttgga gaagttgaaa 1021 tctcttctct gccagatggt actacctttg aggatatcaa gtcactgcag agtctttata 1081 gagagcactg tgaggcaata ttggacgttg ttgtgaatct tcaatttagc ctgatagaaa 1141 aattgtggca aacattctgg cgctattctc cctctactcc aactgatggc actaccatta 1201 ccgaatcgag caatctgagt gaaatagaaa gtcgacttcc gaaagcaaag ctgataactc 1261 tgtgcaaaca tgagtctatc ctgaaatgga tgtgtaactg tgaccatggg atgtaccagg 1321 ctttggtgga gattctcatc cccgacgtcc ttagacctat tcctagtgcc ttgacccaag 1381 ccattcgaaa ttttgcaaaa agccttgaag gttggctttc caatgccatg aacaatattc 1441 cacagagaat gatacaaacc aaggttgccg ctgtaagtgc ctttgcccag actctgcgaa 1501 gatacacgtc gcttaatcac ctggcccagg cagctcgtgc agtgcttcag aacacttccc 1561 aaatcaacca gatgcttagt gacctcaacc gtgtcgactt tgccaatgtc caggagcagg 1621 cttcctgggt gtgccagtgt gatgacaaca tggttcagag actagaaaca gacttcaaga 1681 tgactcttca gcagcagagc accctggagc agtgggctgc gtggcttgac aatgtgatga 1741 tgcaagcact gaaaccctat gaaggaagac ccagttttcc taaagccgcc aggcagtttc 1801 tgctaaaatg gtctttctac agctcaatgg ttattcggga cttaacctta cgcagtgctg 1861 ctagctttgg ctccttccac ctgatccgtc tactctacga cgaatatatg ttttacttag 1921 tagaacatcg tgttgctcag gcaacaggag agactcctat agcagtcatg ggcgaggtaa 1981 gagaggctga aagagctgtg acccactggg ttattaaaaa taagccagaa ttacacttca 2041 gtctaaatac attattgatt aaaaccatgg ttcctaacca agtcagcctc agagccagga 2101 gggactgtgg agttattgca agagttcctt aaattcctct gttcttatgt gcttaatcaa 2161 aagattctaa aattgtggat tatatcatgt gaaagttcat ggaatgtgtc tctatattat 2221 acaggtacct atgtaaaaga aaaacttgag aaccactgga atatgaaaaa tattttaagt 2281 gggaaaaaga ttggtgctcc tgataaagca aagggctagg aatacaatgg aaaggattaa 2341 atatgtattt tgtagactct ttcccggtct ccaagaacaa agtaaatgat tctattttag 2401 aaggataaaa atggtaatga tgataactaa catttttgac acttactatg cgaggcacag 2461 ttctatgttt tttacatgta ataaagcact taatgcaagt ctttgaaaaa aaaa // LOCUS HSRFXAP 2785 bp DNA PRI 18-SEP-1997 DEFINITION H.sapiens RFXAP mRNA. ACCESSION Y12812 NID g2073409 KEYWORDS 36kD subunit; RFX DNA-binding complex; RFXAP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2785) AUTHORS Durand,B., Sperisen,P., Emery,P., Barras,E., Zufferey,M., Mach,B. and Reith,W. TITLE RFXAP, a novel subunit of the RFX DNA binding complex is mutated in MHC class II deficiency JOURNAL EMBO J. 16 (5), 1045-1055 (1997) MEDLINE 97224131 REFERENCE 2 (bases 1 to 2785) AUTHORS Reith,W. TITLE Direct Submission JOURNAL Submitted (29-APR-1997) W. Reith, Department of Genetics & Microbiology, University of Geneva Medical School, 1 rue Michel-Servet, CH-1211 Geneva 4, SWITZERLAND FEATURES Location/Qualifiers source 1..2785 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /cell_line="Namalwa" gene 117..935 /gene="RFXAP" CDS 117..935 /gene="RFXAP" /codon_start=1 /product="36kD subunit of RFX DNA-binding complex" /db_xref="PID:e314739" /db_xref="PID:g2073410" /translation="MEAQGVAEGAGPGAASGVPHPAALAPAAAPTLAPASVAAAASQF TLLVMQPCAGQDEAAAPGGSVGAGKPVRYLCEGAGDGEEEAGEDEADLLDTSDPPGGG ESAASLEDLEDEETHSGGEGSSGGARRRGSGGGSMSKTCTYEGCSETTSQVAKQRKPW MCKKHRNKMYKDKYKKKKSDQALNCGGTASTGSAGNVKLEESADNILSIVKQRTGSFG DRPARPTLLEQVLNQKRLSLLRSPEVVQFLQKQQQLLNQQVLEQRQQQFPGTSM" BASE COUNT 807 a 512 c 662 g 804 t ORIGIN 1 cccggtatag gcgcctttta ccccagcgtg tcctgagtct ttggttcgcg aagtgccgtt 61 aggccaagca ggtgctaaaa gcccggggtc gtggaccccg gccaggtctt agcagcatgg 121 aggcgcaggg tgtagcggag ggcgcggggc cgggcgccgc cagcggcgtg ccccaccccg 181 cggccctagc cccggctgcg gctcccacct tggcgccagc ctcggtggcg gccgcggcct 241 ctcaattcac cctgctagtg atgcaaccct gtgctgggca ggacgaggct gcggcccccg 301 ggggcagcgt tggggcgggc aagcccgtta ggtacctgtg cgaaggggcc ggggatggcg 361 aagaggaggc tggggaggac gaggcggacc tgttagacac ttcggaccct ccggggggag 421 gcgagagcgc ggctagtttg gaggatctag aggacgagga gactcactcg gggggcgagg 481 gcagcagcgg gggcgcccgg aggcggggca gcggtggggg cagcatgagc aagacctgca 541 cctacgaagg ctgcagcgag accacgagcc aggtggccaa gcagcgcaaa ccgtggatgt 601 gcaagaaaca ccgcaacaag atgtacaagg acaagtataa aaagaagaag agcgaccagg 661 ccctgaactg cggtgggact gcctcgactg gcagcgcggg aaacgtcaaa ctcgaggaaa 721 gtgcagataa catactctcc attgttaaac aaagaacagg atcttttggg gatcgtcctg 781 caagacctac tcttttagaa caagtgttaa atcaaaaaag actgtcgtta ctaagaagtc 841 cagaagtagt gcaattttta cagaaacagc aacagctatt aaatcagcaa gttttggagc 901 aaagacaaca gcagtttcca ggaacatcaa tgtgagggaa cttaccaaga acatctacat 961 ggtttttatc ttattgtaat agatgagcat atttttttac cagacataaa tggggtaata 1021 atctatgcct gtagaacata aacattttcc tgtaaatgta tgtgtgcatt tggggataag 1081 taagtattgc actttgtgca tctaatcttt cagattactg tgagtttgaa gaagtcagct 1141 tatctttcca aataacattt aattataatg ttttttaaaa aatatattcc tcttcagtca 1201 ttgttactga gggtaatgaa gcagttactt tctgtgggag tcataaagtt aatagatatt 1261 aatcttgact catctagctc agtggttctc atcaagggtc aatttgattg tcatagtgac 1321 cttgaaaacc actggctttt agtgagtggc caggaaatgc taaatgttct gcagtgtcag 1381 gggtagtccc acatactaaa gattgtctca cccgcagtgc caataacact cctaagaaat 1441 gttgatggct attttgtggt gctaacatgt agttggggca cctacaattg ggttctctta 1501 ataacctttc tttgcagtta agactgaagc tgtcaaagag gtaagcacat tttatataga 1561 cgtaaggaaa gtgattattg tttaatatct gtgaatttag gatgtgcatc tcttttcaga 1621 ggtgtgttag taaaacctga cggattaact aagcacactg ggatgtgtct cctacagttg 1681 gcttctctct ttgatgttac ctgttagtgc tgatctctta aagcagacat ttcttgtttg 1741 ttgaatttgt gaacagtata gatctcagcc caccaatgcc aagacaaaat tatttttctt 1801 atacttattt tttattaaac aaaatgaaaa agatcctttt caaaaaggtg atcctgaaaa 1861 taaaactaac actccagtat tttgtcattg tttttcgcaa ttgagctatc tgaaaactgt 1921 tattcctaag taatgttcaa aaatgataag taatctggat acctttttct tatactttct 1981 cctaggaaaa ctttaaaact ttaaaaaggc aaacctacca ataggaataa caaattaaat 2041 gtcaagagag tatatccaat attaggatat aaatgtatgt gtctcaagtt taactctaca 2101 aaaatttgtt acttgttttt taaactctat atataaagtt cgacttaatc atggctgttc 2161 taagaagtac ttatggagag caagaacatt tttgttcatt tcttaatgtg tgtgttttta 2221 cttgcatatc tgttcaaaac acttttaaca aaattaattc attaaagtcc agttgttgac 2281 ctttgagtta gccgatttct ttattctgtt ctttagttta ttcttactag atgcagagga 2341 attcatctac tgtctgttat taactgttag tttattctca tacttacgat gttgagagtt 2401 tttttgaagc ttaagttacc ctttatggtg gaaaacatta gcttatgctt ctttagatgg 2461 aataatggga aaggagggaa atgggaaatg gatggaaatg ggaaaggagg gaaaataata 2521 gcccagtgag agctgaatga aaagggactg aatttaaata tttgtaagaa ctttgtgatg 2581 atgagtaatt gtcagacgtg ggatagataa ctgagaggct cagaatcttt accaaggata 2641 ttttttagga taaggtagct gcctgttcat gaatttggat aagaatagta ggacaatatt 2701 caacacaatt taatttttgt ctgccacatt agacattttt ttaccttata aaatgatcaa 2761 taaagcaata aggtttattt tgggt // LOCUS HSRH30A 1471 bp RNA PRI 24-JAN-1991 DEFINITION Human mRNA for erythrocyte membrane protein Rh30A (Rhesus antigen). ACCESSION X54534 NID g36017 KEYWORDS blood-group antigen; membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1471) AUTHORS Avent,N.D. TITLE Direct Submission JOURNAL Submitted (20-AUG-1990) Avent N.D., Dept of Biochemistry, University of Bristol, University Walk, Bristol BS8 1TD, U K REFERENCE 2 (bases 1 to 1471) AUTHORS Avent,N.D., Ridgwell,K., Tanner,M.J. and Anstee,D.J. TITLE cDNA cloning of a 30 kDa erythrocyte membrane protein associated with Rh (Rhesus)-blood-group-antigen expression JOURNAL Biochem. J. 271 (3), 821-825 (1990) MEDLINE 91058522 COMMENT The cloned protein is associated with the Rhesus blood group antigens. See also M34015. FEATURES Location/Qualifiers source 1..1471 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="bone marrow" /clone_lib="lambda gt10" mRNA 1..1455 /note="Rh30A polypeptide" /evidence=experimental CDS 45..1298 /codon_start=1 /product="Rh30A polypeptide" /db_xref="PID:g36018" /db_xref="SWISS-PROT:P18577" /translation="MSSKYPRSVRRCLPLWALTLEAALILLFYFFTHYDASLEDQKGL VASYQVGQDLTVMAALGLGFLTSNFRRHSWSSVAFNLFMLALGVQWAILLDGFLSQFP PGKVVITLFSIRLATMSAMSVLISAGAVLGKVNLAQLVVMVLVEVTALGTLRMVISNI FNTDYHMNLRHFYVFAAYFGLTVAWCLPKPLPKGTEDNDQRATIPSLSAMLGALFLWM FWPSVNSPLLRSPIQRKNAMFNTYYALAVSVVTAISGSSLAHPQRKISMTYVHSAVLA GGVAVGTSCHLIPSPWLAMVLGLVAGLISIGGAKCLPVCCNRVLGIHHISVMHSIFSL LGLLGEITYIVLLVLHTVWNGNGMIGFQVLLSIGELSLAIVIALTSGLLTGLLLNLKI WKAPHVAKYFDDQVFWKFPHLAVGF" mat_peptide 45..1295 /product="Rh30A polypeptide" polyA_site 1455 BASE COUNT 329 a 375 c 384 g 383 t ORIGIN 1 gatgcctggt gctggtggaa cccctgcaca gagacggaca caggatgagc tctaagtacc 61 cgcggtctgt ccggcgctgc ctgcccctct gggccctaac actggaagca gctctcattc 121 tcctcttcta tttttttacc cactatgacg cttccttaga ggatcaaaag gggctcgtgg 181 catcctatca agtcggccaa gatctgaccg tgatggcggc ccttggcttg ggcttcctca 241 cctcaaattt ccggagacac agctggagca gtgtggcctt caacctcttc atgctggcgc 301 ttggtgtgca gtgggcaatc ctgctggacg gcttcctgag ccagttccct cctgggaagg 361 tggtcatcac actgttcagt attcggctgg ccaccatgag tgctatgtcg gtgctgatct 421 cagcgggtgc tgtcttgggg aaggtcaact tggcgcagtt ggtggtgatg gtgctggtgg 481 aggtgacagc tttaggcacc ctgaggatgg tcatcagtaa tatcttcaac acagactacc 541 acatgaacct gaggcacttc tacgtgttcg cagcctattt tgggctgact gtggcctggt 601 gcctgccaaa gcctctaccc aagggaacgg aggataatga tcagagagca acgataccca 661 gtttgtctgc catgctgggc gccctcttct tgtggatgtt ctggccaagt gtcaactctc 721 ctctgctgag aagtccaatc caaaggaaga atgccatgtt caacacctac tatgctctag 781 cagtcagtgt ggtgacagcc atctcagggt catccttggc tcacccccaa aggaagatca 841 gcatgactta tgtgcacagt gcggtgttgg caggaggcgt ggctgtgggt acctcgtgtc 901 acctgatccc ttctccgtgg cttgccatgg tgctgggtct tgtggctggg ctgatctcca 961 tcgggggagc caagtgcctg ccggtgtgtt gtaaccgagt gctggggatt caccacatct 1021 ccgtcatgca ctccatcttc agcttgctgg gtctgcttgg agagatcacc tacattgtgc 1081 tgctggtgct tcatactgtc tggaacggca atggcatgat tggcttccag gtcctcctca 1141 gcattgggga actcagcttg gccatcgtga tagctctcac gtctggtctc ctgacaggtt 1201 tgctcctaaa tctcaaaata tggaaagcac ctcatgtggc taaatatttt gatgaccaag 1261 ttttctggaa gtttcctcat ttggctgttg gattttaagc aaaacaaaag catccaagaa 1321 aaacaaggcc tgttcaaaaa caagacaact tcctctcact gttgcctgca tttgtacgtg 1381 agaaacgctc atgacagcaa agtctcctta tgtataatga aacaaggtca gagacagatt 1441 tgatattaaa aaattaaaaa aaaaaaaaaa a // LOCUS HSRHO1 1819 bp RNA PRI 18-FEB-1994 DEFINITION H.sapiens mRNA for rho GDP-dissociation Inhibitor 1. ACCESSION X69550 NID g456190 KEYWORDS actin polymerization; cytoplasmic; rho GDP dissociation inhibitor; rho protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1819) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (03-DEC-1992) H. Leffers, Inst of Medical Biochemistry & Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, DK-8000 Aarhus C, DENMARK REFERENCE 2 (bases 1 to 1819) AUTHORS Leffers,H., Nielsen,M.S., Andersen,A.H., Honore,B., Madsen,P., Vandekerckhove,J. and Celis,J.E. TITLE Identification of two human Rho GDP dissociation inhibitor proteins whose overexpression leads to disruption of the actin cytoskeleton JOURNAL Exp. Cell Res. 209 (2), 165-174 (1993) MEDLINE 94085490 FEATURES Location/Qualifiers source 1..1819 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA cells" /cell_line="AMA cells" /clone_lib="cDNA/lambda ZAP II" /clone="8118" CDS 54..668 /codon_start=1 /product="Human rho GDP-dissociation Inhibitor 1(IEF 8118)" /db_xref="PID:g456191" /translation="MAEQEPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKD DESLRKYKEALLGRVAVSADPNVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSF VLKEGVEYRIKISFRVNREIVSGMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLT PVEEAPKGMLARGSYSIKSRFTDDDKTDHLSWEWNLTIKKDWKD" polyA_signal 1802..1807 BASE COUNT 325 a 634 c 540 g 320 t ORIGIN 1 cctgaaccgc gcggccgaac cctccggtgt cccgacccag gctaagcttg agcatggctg 61 agcaggagcc cacagccgag cagctggccc agattgcagc ggagaacgag gaggatgagc 121 actcggtcaa ctacaagccc ccggcccaga agagcatcca ggagatccag gagctggaca 181 aggacgacga gagcctgcga aagtacaagg aggccctgct gggccgcgtg gccgtttccg 241 cagaccccaa cgtccccaac gtcgtggtga ctggcctgac cctggtgtgc agctcggccc 301 cgggccccct ggagctggac ctgacgggcg acctggagag cttcaagaag cagtcgtttg 361 tgctgaagga gggtgtggag taccggataa aaatctcttt ccgggttaac cgagagatag 421 tgtccggcat gaagtacatc cagcatacgt acaggaaagg cgtcaagatt gacaagactg 481 actacatggt aggcagctat gggccccggg ccgaggagta cgagttcctg acccccgtgg 541 aggaggcacc caagggtatg ctggcccggg gcagctacag catcaagtcc cgcttcacag 601 acgacgacaa gaccgaccac ctgtcctggg agtggaatct caccatcaag aaggactgga 661 aggactgagc ccagccagag gcgggcaggg cagagtgatg gacggaagac ggacaggcgg 721 atgtgtcccc cccagcccct cccctcccca taccaaggtg ctgagcaggc cctccgtgcc 781 cctccaccct ggtccgcctc cctggcctgg ctcaaccgag tgcctccgac ccccctcctc 841 agccctcccc cacccacagg cccagcctcc tcggtctcct gtctcgttgc tgcttctgcc 901 tgtgctgtgg gggagagagg ccgcagccag gcctctgctg ccctttctgt gccccccagg 961 ttctatctcc ccgtcacacc cgaggcctgg cttcaggagg gagcggagca gccattctcc 1021 aggccccgtg gttgcccctg gacgtgtgcg tctgctgctc cggggtggag ctggggtgtg 1081 ggatgcacgg cctcgtgggg gccgggccgt cctccagccc cgctgctccc tggccagccc 1141 ccttgtcgct gtcggtcccg tctaaccatg atgccttaac atgtggagtg taccgtgggg 1201 cctcactagc ctctactccc tgtgtctgca tgagcatgtg gcctccccgt cccttccccg 1261 gtggcgaacc cagtgaccca gggacacgtg gggtgtgctg gtgctgctcc ccagcccacc 1321 aatgcctggc cagcctgccc ccttccctgg acagggctgt ggagatggct ccggcggctt 1381 ggggaaagcg aaattgccaa cactcaagtc acctcagtac catccaggag gctgggtatt 1441 gtcctgcctc tgccttttct gtctcagcgg cagtgcccag agcccacacc cccccaagag 1501 ccctcgatgg acaggcctga cccaccccac ctggggccag ccaggagccc cgcctgggcc 1561 atcagtattt attgcctccg tccgtgccgt ccctgggcca ctggctggcg cctcttcccc 1621 cagcctctca gtgccaccac ccccggcagc cttccctgac ccagccagga caaacaaggg 1681 accaagtgca cacattgctg agagccgtct cctataggtc ccccgcccca tccccggtgt 1741 tggtgttgtg tctgccaggc tcaggcagag gcgcctgtcc ctgcttcttt tctgaccggg 1801 aaataaatgc ccctgaagg // LOCUS HSRHO2 1150 bp RNA PRI 13-JAN-1994 DEFINITION H.sapiens mRNA for rho GDP-dissociation Inhibitor 2. ACCESSION X69549 NID g441454 KEYWORDS actin polymerization; cytoplasmic; rho GDP dissociation inhibitor; rho protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1150) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (03-DEC-1992) H. Leffers, Inst of Medical Biochemistry & Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, DK-8000 Aarhus C, DENMARK REFERENCE 2 (bases 1 to 1150) AUTHORS Leffers,H., Nielsen,M.S., Andersen,A.H., Honore,B., Madsen,P., Vandekerckhove,J. and Celis,J.E. TITLE Identification of two human Rho GDP dissociation inhibitor proteins whose overexpression leads to disruption of the actin cytoskeleton JOURNAL Exp. Cell Res. 209 (2), 165-174 (1993) MEDLINE 94085490 FEATURES Location/Qualifiers source 1..1150 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA cells" /cell_line="AMA cells" /clone_lib="cDNA/lambda ZAP II" /clone="8120" CDS 55..660 /codon_start=1 /product="Human rho GDP-dissociation Inhibitor 2(IEF 8120)" /db_xref="PID:g441455" /translation="MTEKAPEPHVEEDDDDELDSKLNYKPPPQKSLKELQEMDKDDES LIKYKKTLLGDGPVVTDPKAPNVVVTRLTLVCESAPGPITMDLTGDLEALKKETIVLK EGSEYRVKIHFKVNRDIVSGLKYVQHTYRTGVKVDKATFMVGSYGPRPEEYEFLTPVE EAPKGMLARGTYHNKSFFTDDDKQDHLSWEWNLSIKKEWTE" polyA_signal 1133..1138 BASE COUNT 325 a 298 c 268 g 259 t ORIGIN 1 gagagacaga ggcaccccgg acagagacgt gaagcactga ataaatagat cagaatgact 61 gaaaaagccc cagagccaca tgtggaggag gatgacgatg atgagctgga cagcaagctc 121 aattataagc ctccaccaca gaagtccctg aaagagctgc aggaaatgga caaagatgat 181 gagagtctaa ttaagtacaa gaaaacgctg ctgggagatg gtcctgtggt gacagatccg 241 aaagccccca atgtcgttgt cacccggctc accctggttt gtgagagtgc cccgggacca 301 atcaccatgg accttactgg agatctggaa gccctcaaaa aggaaaccat tgtgttaaag 361 gaaggttctg aatatagagt caagattcac ttcaaagtga acagggatat tgtgtcaggc 421 ctgaaatacg ttcagcacac ctacaggact ggggtgaaag tggataaggc aacatttatg 481 gttggcagct atggacctcg gcctgaggag tatgagttcc tcactccagt tgaggaggct 541 cccaagggca tgctggcgcg aggcacgtac cacaacaagt ccttcttcac cgacgatgac 601 aagcaagacc acctcagctg ggagtggaac ctgtcgatta agaaggagtg gacagaatga 661 atgcatccac ccctttccca cccttgccac ctggaagaat tctctcaggc gtgttcagca 721 ccctgtccct cctccctgtc cacagctggg tccctcttca acactgccac atttccttat 781 tgatgcatct tttcccaccc tgtcactcaa cgtggtccct agaacaagag gcttaaaacc 841 gggctttcac ccgaacctgc tccctctgat cctccatcag ggccagatct tccacgtctc 901 catctcagta cacaatcatt taatatttcc ctgtcttacc cctattcaag caactagagg 961 ccagaaaatg ggaaattatc actaacaggt ctttgactca ggttccagta gttcattcta 1021 atgcctagat tcttttgtgg ttgttgctgg cccaatgagt ccctagtcac atcccctgcc 1081 agagggagtt cttcttttgt gagagacact gtaaacgaca caagagaaca agaataaaac 1141 aataactgtg // LOCUS HSRHO6 699 bp RNA PRI 16-SEP-1996 DEFINITION H.sapiens mRNA for Rho6 protein. ACCESSION Y07923 NID g1546901 KEYWORDS GTP-binding protein; rho6 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 699) AUTHORS Chardin,P. TITLE Characterization of a new family of rho-related proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 699) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (12-SEP-1996) P. Chardin, CNRS UPR 411, Institut de Pharmacologie, 660 Route des Lucioles, F- 06560 Valbonne France, FRANCE FEATURES Location/Qualifiers source 1..699 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" gene 1..699 /gene="rho6" CDS 1..699 /gene="rho6" /codon_start=1 /product="GTP-binding protein" /db_xref="PID:e266092" /db_xref="PID:g1546902" /translation="MKERRAPQPVVARCKLVLVGDVQCGKTAMLQVLAKDCYPETYVP TVFENYTACLETEEQRVELSLWDTSGSPYYDNVRPLCYSDSDAVLLCFDISRPETVDS ALKKWRTEILDYCPSTRVLLIGCKTDLRTDLSTLMELSHQKQAPISYEQGCAIAKQLG AEIYLEGSAFTSEKSIHSIFRTASMLCLNKPSPLPQKSPVRSLSKRLLHLPSRSELIS STFKKEKAKSCSIM" BASE COUNT 179 a 191 c 180 g 149 t ORIGIN 1 atgaaggaga gacgggcccc ccagccagtc gtggccagat gtaagctcgt tctggtcggg 61 gacgtgcagt gtgggaagac cgcgatgttg caagtgttag cgaaggattg ctatccagag 121 acctatgtgc ccaccgtgtt cgaaaattac acagcctgtt tggagacaga ggaacagagg 181 gtggagctta gtctctggga tacctcagga tctccctact acgataatgt ccgtccactc 241 tgctacagcg actcggatgc agtattacta tgttttgaca tcagccgtcc agagacagtg 301 gacagcgcac tgaagaagtg gaggacagaa atcctagatt attgtcccag cacccgcgtt 361 ttgctcattg gctgcaagac agacctgcga acagacctga gtactctgat ggagctgtcc 421 caccagaagc aggcgcccat ctcctatgag cagggttgtg caatagcaaa gcagctgggt 481 gcagaaatct acctggaagg ctcagctttc acctcagaaa agagcatcca cagcatcttt 541 cggacggcat ccatgctgtg tctgaacaag cctagcccac tgccccagaa gagccctgtc 601 cgaagcctct ccaaacgact gctccacctc cccagtcgct ctgaactcat ctcttctacc 661 ttcaagaagg aaaaggccaa aagctgttcc attatgtga // LOCUS HSRHO7GEN 684 bp RNA PRI 03-JUN-1997 DEFINITION H.sapiens mRNA for Rho7 protein. ACCESSION X95456 NID g2168148 KEYWORDS rho7 gene; Rho7 protein; small GTP binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 684) AUTHORS Chardin,P. TITLE Characterization of a new family of rho-related proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 684) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (29-JAN-1996) P. Chardin, CNRS UPR 411, Institut de Pharmacologie, 660 Route des Lucioles, F- 06560 Valbonne, FRANCE REMARK Revised by author 03-JUN-97 FEATURES Location/Qualifiers source 1..684 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="Clontech #HL 1065a" gene 1..684 /gene="rho7" CDS 1..684 /gene="rho7" /note="small GTP binding protein" /codon_start=1 /product="Rho7 protein" /db_xref="PID:e320410" /db_xref="PID:g2168149" /translation="MEGQSGRCKIVVVGDAECGKTALLQVFAKDAYPGSYVPTVFENY TASFEIDKRRIELNMWDTSGSSYYDNVRPLAYPDSDAVLICFDISRPETLDSVLKKWQ GETQEFCPNAKVVLVGCKLDMRTDLATLRELSKQRLIPVTHEQGTVLAKQVGAVSYVE CSSRSSERSVRDVFHVATVASLGRGHRQLRRTDSRRGMQRSAQLSGRPDRGNEGEIHK DRAKSCNLM" BASE COUNT 146 a 182 c 214 g 142 t ORIGIN 1 atggaggggc agagcggccg ctgcaagatc gtggtggtgg gagacgcaga gtgcggcaag 61 acggcgctgc tgcaggtgtt cgccaaggac gcctatcccg ggagttatgt ccccaccgtg 121 tttgagaact acactgcgag ctttgagatc gacaagcgcc gcattgagct caacatgtgg 181 gacacttcag gttcctctta ctatgataat gtccggcctc tggcctatcc tgattctgat 241 gctgtgctca tctgcttcga cattagccga ccagaaacac tggacagtgt tctcaagaag 301 tggcaaggag agactcaaga gttctgcccc aatgccaagg ttgtgctggt tggctgtaaa 361 ctggacatgc ggactgacct ggccacactg agggagctgt ccaagcagag gcttatccct 421 gttacacatg agcagggcac tgtgctggcc aagcaggtgg gggctgtgtc ctatgttgag 481 tgctcctccc ggtcctctga gcgcagcgtc agggatgtct tccatgtggc tacagtggcc 541 tcccttggcc gtggccatag gcagctgcgc cgaactgact cacgccgggg aatgcagcga 601 tccgctcagc tgtcaggacg gccagaccgg gggaatgagg gcgagataca caaggatcga 661 gccaaaagct gcaacctcat gtga // LOCUS HSRHO8GEN 735 bp RNA PRI 29-JAN-1996 DEFINITION H.sapiens mRNA for Rho8 protein. ACCESSION X95282 NID g1171565 KEYWORDS rho8 gene; Rho8 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 735) AUTHORS Chardin,P. TITLE Characterization of a new family of rho-related proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 735) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (23-JAN-1996) P. Chardin, CNRS UPR 411, Institut de Pharmacologie, 660 Route des Lucioles, F- 06560 Valbonne, FRANCE COMMENT Related sequence T58202. FEATURES Location/Qualifiers source 1..735 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /germline /tissue_type="spleen" /clone_lib="Stratagene #937205" gene 1..735 /gene="rho8" CDS 1..735 /gene="rho8" /codon_start=1 /product="Rho8 protein" /db_xref="PID:e220185" /db_xref="PID:g1171566" /translation="MKERRASQKLSSKSIMDPNQNVKCKIVVVGDSQCGKTALLHVFA KDCFPENYVPTVFENYTASFEIDTQRIELSLWDTSGSPYYDNVRPLSYPDSDAVLICF DISRPETLDSVLKKWKGEIQEFCPNTKMLLVGCKSDLRTDVSTLVELSNHRQTPVSYD QGANMAKQIGAATYIECSALQSENSVRDIFHVATLACVNKTNKNVKRNKSQRATKRIS HMPSRPELSAVATDLRKDKAKSCTVM" BASE COUNT 224 a 165 c 176 g 170 t ORIGIN 1 atgaaggaga gaagagccag ccagaaatta tccagcaaat ctatcatgga tcctaatcag 61 aacgtgaaat gcaagatagt tgtggtggga gacagtcagt gtggaaaaac tgcgctgctc 121 catgtcttcg ccaaggactg cttccccgag aattacgttc ctacagtgtt tgagaattac 181 acggccagtt ttgaaatcga cacacaaaga atagagttga gcctgtggga cacttcgggt 241 tctccttact atgacaatgt ccgccccctc tcttaccctg attcggatgc tgtgctgatt 301 tgctttgaca tcagtagacc agagaccctg gacagtgtcc tcaaaaagtg gaaaggtgaa 361 atccaggaat tttgtccaaa taccaaaatg ctcttggtcg gctgcaagtc tgatctgcgg 421 acagatgtta gtacattagt agagctctcc aatcacaggc agacgccagt gtcctatgac 481 cagggggcaa atatggccaa acagattgga gcagctactt atatcgaatg ctcagcttta 541 cagtcggaaa atagcgtcag agacattttt cacgttgcca ccttggcatg tgtaaataag 601 acaaataaaa acgttaagcg gaacaaatca cagagagcca caaagcggat ttcacacatg 661 cctagcagac cagaactctc ggcagttgct acggacttac gaaaggacaa agcgaagagc 721 tgcactgtga tgtga // LOCUS HSRHOB 1074 bp RNA PRI 12-SEP-1993 DEFINITION Human rho mRNA (clone 12). ACCESSION X05026 NID g36029 KEYWORDS rho gene; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1074) AUTHORS Yeramian,P., Chardin,P., Madaule,P. and Tavitian,A. TITLE Nucleotide sequence of human rho cDNA clone 12 JOURNAL Nucleic Acids Res. 15 (4), 1869 (1987) MEDLINE 87146500 FEATURES Location/Qualifiers source 1..1074 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="12" CDS 159..740 /note="ORF (AA 1-193)" /codon_start=1 /db_xref="PID:g36030" /db_xref="SWISS-PROT:P06749" /translation="MAAIRKKLVIVGDGACGKTCLLIVFSKDQFPEVYVPTVFENYVA DIEVDGKQVELALWDTAGQEDYDRLRPLSYPDTDVILMCFSIDSPDSLENIPEKWTPE VKHFCPNVPIILVGNKKDLRNDEHTRRELAKMKQEPVKPEEGRDMANRIGAFGYMECS AKTKDGVREVFEMATRAALQARRGKKKSGCLVL" BASE COUNT 269 a 251 c 272 g 282 t ORIGIN 1 gaattcgggc taccctcgcc ccgcccgcgg tcctccgtcg gttctctcat tagtccacgg 61 tctggtcttc agctacccgc cttcgtctcc gagtttgcga ctcgcgggac cggcgtcccc 121 ggcgcgaaga ggctggactc ggattcgttg cctgagcaat ggctgccatc cggaagaaac 181 tggtgattgt tggtgatgga gcctgtggaa agacatgctt gctcatagtc ttcagcaagg 241 accagttccc agaggtgtat gtgcccacag tgtttgagaa ctatgtggca gatatcgagg 301 tggatggaaa gcaggtagag ttggctttgt gggacacagc tgggcaggaa gattatgatc 361 gcctgaggcc cctctcctac ccagataccg atgttatact gatgtgtttt tccatcgaca 421 gccctgatag tttagaaaac atcccagaaa agtggacccc agaagtcaag catttctgtc 481 ccaacgtgcc catcatcctg gttgggaata agaaggatct tcggaatgat gagcacacaa 541 ggcgggagct agccaagatg aagcaggagc cggtgaaacc tgaagaaggc agagatatgg 601 caaacaggat tggcgctttt gggtacatgg agtgttcagc aaagaccaaa gatggagtga 661 gagaggtttt tgaaatggct acgagagctg ctctgcaagc tagacgtggg aagaaaaaat 721 ctggttgcct tgtcttgtga aaccttgctg caagcacagc ccttatgcgg ttaattttga 781 agtgctgttt attaatctta gtgtatgatt actggccttt ttcatttatc tataatttac 841 ctaagattac aaatcagaag tcatcttgct accagtattt agaagccaac tatgattatt 901 aacgatgtcc aacccgtctg gcccaccagg gtccttttga cactgctcta acagccctcc 961 tctgcactcc cacctgacac accaggcgct aattcaagga atttcttaac ttcttgcttc 1021 tttctagaaa gagaaacagt tggtaacttt tgtcaattag gctgtaacta cttt // LOCUS HSRHOB6 591 bp RNA PRI 24-OCT-1996 DEFINITION H.sapiens rhoB gene mRNA. ACCESSION X06820 NID g36031 KEYWORDS GTP-binding protein; ras-related protein; rhoB gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 591) AUTHORS Chardin,P. TITLE Direct Submission JOURNAL Submitted (12-FEB-1988) Chardin P., INSERM U-248, Faculte de Medecine, Lariboisiere Saint-Louis, 10 av. de Verdun 75010 Paris, France REFERENCE 2 (bases 1 to 591) AUTHORS Chardin,P., Madaule,P. and Tavitian,A. TITLE Coding sequence of human rho cDNAs clone 6 and clone 9 JOURNAL Nucleic Acids Res. 16 (6), 2717 (1988) MEDLINE 88203210 FEATURES Location/Qualifiers source 1..591 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tumour" /tissue_type="pheochromocytoma" /clone="6" gene 1..591 /gene="rhoB" CDS 1..591 /gene="rhoB" /codon_start=1 /db_xref="PID:g36032" /db_xref="SWISS-PROT:P01121" /translation="MAAIRKKLVVVGDGACGKTCLLIVFSKDEFPEVYVPTVFENYVA DIEVDGKQVELALWDTAGQEDYDRLRPLSYPDTDVILMCFSVDSPDSLENIPEKWVPE VKHFCPNVPIILVANKKDLRSDEHVRTELARMKQEPVRTDDGRAMAVRIQAYDYLECS AKTKEGVREVFETATRAALQKRYGSQNGCINCCKVL" BASE COUNT 120 a 182 c 194 g 95 t ORIGIN 1 atggcggcca tccgcaagaa gctggtggtg gtgggcgacg gcgcgtgtgg caagacgtgc 61 ctgctgatcg tgttcagtaa ggacgagttc cccgaggtgt acgtgcccac cgtcttcgag 121 aactatgtgg ccgacattga ggtggacggc aagcaggtgg agctggcgct gtgggacacg 181 gcgggccagg aggactacga ccgcctgcgg ccgctctcct acccggacac cgacgtcatt 241 ctcatgtgct tctcggtgga cagcccggac tcgctggaga acatccccga gaagtgggtc 301 cccgaggtga agcacttctg tcccaatgtg cccatcatcc tggtggccaa caaaaaagac 361 ctgcgcagcg acgagcatgt ccgcacagag ctggcccgca tgaagcagga acccgtgcgc 421 acggatgacg gccgcgccat ggccgtgcgc atccaagcct acgactacct cgagtgctct 481 gccaagacca aggaaggcgt gcgcgaggtc ttcgagacgg ccacgcgcgc cgcgctgcag 541 aagcgctacg gctcccagaa cggctgcatc aactgctgca aggtgctatg a // LOCUS HSRHOG 1284 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens rhoG mRNA for GTPase. ACCESSION X61587 S38935 NID g36035 KEYWORDS GTP binding protein; GTPase; ras-related; rhoG gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1284) AUTHORS Fort,P.P. TITLE Direct Submission JOURNAL Submitted (25-SEP-1991) P.P. Fort, Lab de Biologie Moleculaire, URACRNS 1191 Genetique Moleculaire, Universite Montpellier II CPO12, Ple E. Bataillon, 34095 Montpellier Cedex5, FRANCE REFERENCE 2 (bases 1 to 1284) AUTHORS Vincent,S., Jeanteur,P. and Fort,P. TITLE Growth-regulated expression of rhoG, a new member of the ras homolog gene family JOURNAL Mol. Cell. Biol. 12 (7), 3138-3148 (1992) MEDLINE 92318931 FEATURES Location/Qualifiers source 1..1284 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="adenocarcinoma" /cell_type="HeLa" mRNA 1..1284 /gene="rhoG" /evidence=experimental gene 1..1284 /gene="rhoG" 5'UTR 1..129 /gene="rhoG" CDS 130..705 /gene="rhoG" /codon_start=1 /product="GTPase" /db_xref="PID:g36036" /db_xref="SWISS-PROT:P35238" /translation="MQSIKCVVVGDGAVGKTCLLICYTTNAFPKEYIPTVFDNYSAQS AVDGRTVNLNLWDTAGQEEYDRLRTLSYPQTNVFVICFSIASPPSYENVRHKWHPEVC HHCPDVPILLVGTKKDLRAQPDTLRRLKEQSQAPITPQQGQALAKQIHAVRYLECSAL QQDGVKEVFAEAVRAVLNPTPIKRGRSCILL" 3'UTR 706..1284 /gene="rhoG" BASE COUNT 243 a 437 c 331 g 273 t ORIGIN 1 gcttctcgag cccggagccg ctgccgccgc ccccagctcc cccgcctcgg gaggggcacc 61 aggtcactgc agccagaggg gtccagaaga gagaggaggc actgcctcac tacagcaact 121 gcacccacga tgcagagcat caagtgcgtg gtggtgggtg atggggctgt gggcaagacg 181 tgcctgctca tctgctacac aactaacgct ttccccaaag agtacatccc caccgtgttc 241 gacaattaca gcgcgcagag cgcagttgac gggcgcacag tgaacctgaa cctgtgggac 301 actgcgggcc aggaggagta tgaccgcctc cgtacactct cctaccctca gaccaacgtt 361 ttcgtcatct gtttctccat tgccagtccg ccgtcctatg agaacgtgcg gcacaagtgg 421 catccagagg tgtgccacca ctgccctgat gtgcccatcc tgctggtggg caccaagaag 481 gacctgagag cccagcctga caccctacgg cgcctcaagg agcagagcca ggcgcccatc 541 acaccgcagc agggccaggc actcgcgaaa cagatccacg ctgtgcgcta cctcgaatgc 601 tcagccctgc aacaggatgg tgtcaaggaa gtgttcgccg aggctgtccg ggctgtgctc 661 aaccccacgc cgatcaagcg tgggcggtcc tgcatcctct tgtgaccctg gcacttggct 721 tggaggctgc ccctgccctc cccccaccag ttgtgccttg gtgccttgtc cgcctcagct 781 gtgccttaag gactaattct ggcacccctt tccaggggtt ccctgaatgc ctttttctct 841 gagtgccttt ttctccttaa ggaggcctgc agagaaaggg gctttgggct ctgcccctct 901 ggcttgggaa cactgggtat tctcatgagc tcatccaagc caaggttgga cccctcccca 961 agaggccaac ccagtgcccc ctcccatttt ccgctactga ccagttcatc cagctttcca 1021 cacagttgtt gctgcctatt gtggtgccgc ctcaggttag gggctctcag ccatctctaa 1081 cctctgccct cgctgctctt ggaattgcgc ccccaagatg ctctctccct tctccaatga 1141 gggagccaca gaatcctgag aaggtgaatg taccctaacc tgctcctctg tgcctaggcc 1201 ttacgcattt gctgactgac tcagccccca tgcttctggg gacctttcct acccccatca 1261 gcatcaataa aacctcctgt ctcc // LOCUS HSRHOGAPX 1430 bp RNA PRI 25-AUG-1997 DEFINITION H.sapiens rhoGAP protein. ACCESSION Z23024 NID g312211 KEYWORDS rhoGAP protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1430) AUTHORS Lancaster,C.A., Taylor-Harris,P.M., Self,A.J., Brill,S., van Erp,H.E. and Hall,A. TITLE Characterization of rhoGAP. A GTPase-activating protein for rho-related small GTPases JOURNAL J. Biol. Chem. 269 (2), 1137-1142 (1994) MEDLINE 94117418 REFERENCE 2 (bases 1 to 1430) AUTHORS Hall,A. TITLE Direct Submission JOURNAL Submitted (08-JUN-1993) Alan Hall Dr., Cell and Molecular Biology, Institute of Cancer, Research, Chester Beatty Laboratories, 239 Fulham Road, London, SW3 6JB, England FEATURES Location/Qualifiers source 1..1430 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fibrosarcoma" /cell_line="HT1080" /sex="Male" 5'UTR 1..98 CDS 99..1418 /codon_start=1 /product="rhoGAP protein" /db_xref="PID:g312212" /db_xref="SWISS-PROT:Q07960" /translation="MDPLSELQDDLTLDDTSEALNQLKLASIDEKNWPSDEMPDFPKS DDSKSSSPELVTHLKWDDPYYDIARHQIVEVAGDDKYGRKIIVFSACRMPPSHQLDHS KLLGYLKHTLDQYVESDYTLLYLHHGLTSDNKPSLSWLRDAYREFDRKYKKNIKALYI VHPTMFIKTLLILFKPLISFKFGQKIFYVNYLSELSEHVKLEQLGIPRQVLKYDDFLK STQKSPATAPKPMPPRPPLPNQQFGVSLQHLQEKNPEQEPIPIVLRETVAYLQAHALT TEGIFRRSANTQVVREVQQKYNMGLPVDFDQYNELHLPAVILKTFLRELPEPLLTFDL YPHVVGFLNIDESQRVPATLQVLQTLPEENYQVLRFLTAFLVQISAHSDQNKMTNTNL AVVFGPNLLWAKDAAITLKAINPINTFTKFLLDHQGELFPSPDPSGL" BASE COUNT 332 a 453 c 365 g 280 t ORIGIN 1 gtttgccgag ccttgacagg cgtggcagag ggagcagtgc ccgagcgcgg tttctcttaa 61 ggttctgcag ggcaaggctg tctgggacag gcttggccat ggatccgctc tcagagctgc 121 aggatgatct gaccttggat gacaccagcg aggctctgaa ccagctgaag ctggcctcca 181 tcgatgagaa gaactggccc tcggatgaaa tgcctgactt ccccaagtca gatgactcca 241 aaagcagctc cccggaactt gtcacacacc tgaagtggga tgacccatac tatgacatcg 301 cccggcacca gatcgtggag gtggcaggag atgacaagta tgggcggaag atcattgtgt 361 ttagtgcctg tcgaatgccc cccagccacc agctcgacca cagcaagctc ctggggtacc 421 tgaagcacac cctggaccag tacgtggaga gtgactacac acttctgtat ctgcaccacg 481 gcctgaccag cgacaacaag ccctccctca gctggctccg tgatgcctac cgggagtttg 541 accgcaagta caagaagaac atcaaggcct tgtacatcgt gcatccaacc atgttcatca 601 aaactctgct catcctcttc aagcccctca tcagcttcaa gttcgggcag aagatcttct 661 atgtgaatta cctgagcgag ctgagcgagc acgtgaagct ggagcagctg gggatccctc 721 gccaagtgct caaatatgac gacttcctga aatccacaca gaagagcccc gcgacagccc 781 ccaagcccat gcccccacgg ccccccctgc ccaaccagca gtttggagtc tcgctgcagc 841 acctccagga gaagaatcca gagcaggagc ccattcccat tgtactcagg gagactgttg 901 cctacttaca ggcccacgct ctcaccaccg agggcatctt ccggaggtcg gccaacaccc 961 aagtggtccg ggaagtgcag cagaagtaca acatggggct gcctgtggat ttcgaccagt 1021 acaatgagct gcacctgcca gcagtcatcc tcaagacctt cctccgggag cttcctgagc 1081 ccctgctcac ctttgacctc tacccccatg tggtgggctt cctcaacatt gatgaaagcc 1141 agagggtgcc agcgacactg caggtcctcc agacgctgcc cgaggagaac taccaggtgc 1201 ttcgtttcct gactgctttc ctggtgcaga tttctgcaca cagtgaccag aacaagatga 1261 ccaacactaa cctggctgtt gttttcggcc ctaacctgct gtgggccaag gatgcggcca 1321 tcaccctcaa ggccattaat cccatcaaca ccttcaccaa gttccttctg gatcaccaag 1381 gggagctgtt cccaagcccg gaccccagcg ggctctgaac ctggcccctg // LOCUS HSRIBIIR 2509 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for ribophorin II. ACCESSION Y00282 NID g36048 KEYWORDS glycoprotein; integral membrane protein; ribophorin II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2509) AUTHORS Meyer,D.I. TITLE Direct Submission JOURNAL Submitted (05-DEC-1986) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 2509) AUTHORS Crimaudo,C., Hortsch,M., Gausepohl,H. and Meyer,D.I. TITLE Human ribophorins I and II: the primary structure and membrane topology of two highly conserved rough endoplasmic reticulum-specific glycoproteins JOURNAL EMBO J. 6 (1), 75-82 (1987) MEDLINE 87218477 COMMENT *source cell=Hela. FEATURES Location/Qualifiers source 1..2509 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 289..2184 /note="precursor" /codon_start=1 /db_xref="PID:g36049" /db_xref="SWISS-PROT:P04844" /translation="MAPPGSSTVFLLALTIIASTWALTPTHYLTKHDVERLKASLDRP FTNLESAFYSIVGLSSLGAQVPDAKKACTYIRSNLDPSNVDSLFYAAQASQALSGCEI SISNETKDLLLAAVSEDSSVTQIYHAVAALSGFGLPLASQEALSALTARLSKEETVLA TVQALQTASHLSQQADLRSIVEEIEDLVARLDELGGLYLQCEEGLETTALFVAATYKL MDHVGTEPSIKEDQVIQLMNAIFSKKNFESLSEAFSVASAASVLSHNRYHVPVVVVPE GSASDTHEQAILRLQVTNVLSQPLTQATVKLEHAKSVASRATVLQKTSFTPVGDVFEL NFMNVKFSSGYYDFLVEVEGDNRYIANTVELRVKISTEVGITNVDLSTVDKDQSIAPK TTRVTYPAKAKGTFIADSHQNFALFFQLVDMNTGAELTPHQTFVRLHNQKTGQEVVFV AEPDNKNVYKFELDTSERKIEFDSASGTYTLYLIIGDATLKNPILWNVADVVIKFPEE EAPSTVLSQNLFTPKQEIQHLFREPEKRPPTVVSNTFTALILSPLLLLFALWIRIGAN VSNFTFAPSTIIFHLGHAAMLGLMYVYWTQLNMFQTLKYLAILGSVTFLAGNRMLAQQ AVKRTAH" sig_peptide 289..354 /note="signal peptide" mat_peptide 355..2181 /note="mature ribophorin II" misc_feature 604..606 /note="pot. glycosylation site" BASE COUNT 630 a 663 c 608 g 607 t 1 others ORIGIN 1 ttccagcgtt gcgagacggt cggttccaag tgggcctggg cgcgggggag aggcgggtct 61 gtcctcggga actgcaaggc cctgtgagcg ggaggactgg gatcccggcc gcggctgctg 121 gaagcgtcga agctcagcgg gccgcgcaca tgacctgtgc ttagaactca tcctggcccg 181 cagagcctgc cgcgagtccc tggcgtcccc tgtggcgggc tcttggagcc actttcccca 241 gcggaactca gcccgcggct cggactccgg cgggacctgc tgggaggaat ggcgccgccg 301 ggttcaagca ctgtcttcct gttggccctg acaatcatag ccagcacctg ggctctgacg 361 cccactcact acctcaccaa gcatgacgtg gagagactaa aagcctcgct ggatcgccct 421 ttcacaaatt tggaatctgc cttctactcc atcgtgggac tcagcagcct tggtgctcag 481 gtgccagatg caaagaaagc atgtacctac atcagatcta accttgatcc cagcaatgtg 541 gattccctct tctacgctgc ccaggccagc caggccctct caggatgtga gatctctatt 601 tcaaatgaga ccaaagatct gcttctggca gctgtcagtg aggactcatc tgttacccag 661 atctaccatg cagttgcagc tctaagtggc tttggccttc ccttggcatc ccaagaagca 721 ctcagtgccc ttactgctcg tctcagcaag gaggagactg tgctggcaac agtccaggct 781 ctgcagacag catcccacct rtcccagcag gctgacctga ggagcatcgt ggaggagatt 841 gaggaccttg ttgctcgcct ggatgaactc gggggcctgt atctccagtg tgaagaagga 901 ctggaaacaa cagcgttatt tgtggctgcc acctacaagc tcatggatca tgtggggact 961 gagccatcca ttaaggagga tcaggtcatc cagctgatga acgcgatctt cagcaagaag 1021 aactttgagt ccctctccga agccttcagc gtggcctctg cagcttctgt gctctcgcat 1081 aatcgctacc acgtgccagt tgtggttgtg cctgagggct ctgcttccga cactcatgaa 1141 caggctatct tgcggttgca agtcaccaat gttctgtctc agcctctgac tcaggccact 1201 gttaaactag aacatgctaa atctgttgct tccagagcca ctgtcctcca gaagacatcc 1261 ttcacccctg taggggatgt ttttgaacta aatttcatga acgtcaaatt ttccagtggt 1321 tattatgact tccttgtcga agttgaaggt gacaaccggt atattgcaaa taccgtagag 1381 ctcagagtca agatctccac tgaagttggc atcacaaatg ttgatctttc caccgtggat 1441 aaggatcaga gcattgcacc caaaactacc cgggtgacat acccagccaa agccaagggc 1501 acattcatcg cagacagcca ccagaacttc gccttgttct tccagctggt agatatgaac 1561 actggtgctg aactcactcc tcaccagaca tttgtccgac tccataacca gaagactggc 1621 caggaagtgg tgtttgttgc cgagccagac aacaagaacg tgtacaagtt tgaactggat 1681 acctctgaaa gaaagattga atttgactct gcctctggca cctacactct ctacttaatc 1741 attggagatg ccactttgaa gaacccaatc ctctggaatg tggctgatgt ggtcatcaag 1801 ttccctgagg aagaagctcc ctcgactgtc ttgtcccaga accttttcac tccaaaacag 1861 gaaattcagc acctgttccg cgagcctgag aagaggcccc ccaccgtggt gtccaataca 1921 ttcactgccc tgatcctctc gccgttgctt ctgctcttcg ctctgtggat ccggattggt 1981 gccaatgtct ccaacttcac ttttgctcct agcacgatta tatttcacct gggacatgct 2041 gctatgctgg gactcatgta tgtctactgg actcagctca acatgttcca gaccttgaag 2101 tacctggcca tcttgggcag tgtgacgttt ctggctggca atcggatgct ggcccagcag 2161 gcagtcaaga gaacagcaca ttagttccag aagaaagatg gaaattctga aaactgaatg 2221 tcaagaaaag gagtcaagaa caattcacag tatgagaaga aaaatggaaa aaaaaaactt 2281 tatttaaaaa agaaaaaagt ccagattgta gttatacttt tgcttgtttt tcagtttccc 2341 caacacacag cagatacctg gtgagctcag atagtctctt tctctgacac tgtgtaagaa 2401 gctgtgaata ttcctaactt acccagatgt tgcttttgaa aagttgaaat gtgtaattgt 2461 tttggaataa agagggtaac aataggaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSRIBIR 2397 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for ribophorin I. ACCESSION Y00281 NID g36052 KEYWORDS glycoprotein; integral membrane protein; ribophorin I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2397) AUTHORS Meyer,D.I. TITLE Direct Submission JOURNAL Submitted (05-DEC-1986) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 2397) AUTHORS Crimaudo,C., Hortsch,M., Gausepohl,H. and Meyer,D.I. TITLE Human ribophorins I and II: the primary structure and membrane topology of two highly conserved rough endoplasmic reticulum-specific glycoproteins JOURNAL EMBO J. 6 (1), 75-82 (1987) MEDLINE 87218477 COMMENT *source cell=Hela. FEATURES Location/Qualifiers source 1..2397 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 138..1961 /note="precursor" /codon_start=1 /db_xref="PID:g36053" /db_xref="SWISS-PROT:P04843" /translation="MEAPAAGLFLLLLLGTWAPAPGSASSEAPPLINEDVKRTVDLSS HLAKVTAEVVLAHLGGGSTSRATSFLLALEPELEARLAHLGVQVKGEDEEENNLEVRE TKIKGKSGRFFTVKLPVALDPGAKISVIVETVYTHVLHPYPTQITQSEKQFVVFEGNH YFYSPYPTKTQTMRVKLASRNVESYTKLGNPTRSEDLLDYGPFRDVPAYSQDTFKVHY ENNSPFLTITSMTRVIEVSHWGNIAVEENVDLKHTGAVLKGPFSRYDYQRQPDSGISS IRSFKTILPAAAQDVYYRDEIGNVSTSHLLILDDSVEMEIRPRFPLFGGWKTHYIVGY NLPSYEYLYNLGDQYALKMRFVDHVFDEQVIDSLTVKIILPEGAKNIEIDSPYEISRA PDELHYTYLDTFGRPVIVAYKKNLVEQHIQDIVVHYTFNKVLMLQEPLLVVAAFYILF FTVIIYVRLDFSITKDPAAEARMKVACITEQVLTLVNKRIGLYRHFDETVNRYKQSRD ISTLNSGKKSLETEHKALTSEIALLQSRLKTEGSDLCDRVSEMQKLDAQVKELVLKSA VEAERLVAGKLKKDTYIENEKLISGKRQELVTKIDHILDAL" sig_peptide 138..206 /note="signal peptide" mat_peptide 207..1958 /note="mature ribophorin I" misc_feature 1032..1034 /note="pot. glycosylation site" BASE COUNT 589 a 585 c 632 g 591 t ORIGIN 1 ttagctaggc atggtggcac atgcctgtaa tcccagctac tcgggaggct gaggcaggag 61 aattgcttga acccgggagg cagaggttgc agccggctga gatggcgcca tcacactcca 121 gcctgctctt cccggtcatg gaggcgccag ccgccggctt gtttctgctc ctgttgcttg 181 ggacttgggc cccggcgccg ggcagcgcct cctccgaggc accgccgctg atcaatgagg 241 acgtgaagcg cacagtggac ctaagcagcc acctggctaa ggtgacggcc gaggtggtcc 301 tggcgcacct gggcggcggc tccacgtccc gagctacctc tttcctgctg gctttggagc 361 ctgagctcga ggcccggctg gcgcacctgg gcgtgcaggt aaagggagaa gatgaggaag 421 agaacaattt ggaagtacgt gaaaccaaaa ttaagggtaa aagtgggaga ttcttcacag 481 tcaagctccc agttgctctt gatcctgggg ccaagatttc agtcattgtg gaaacagtct 541 acacccatgt gcttcatcca tatccaaccc agatcaccca gtcagagaaa cagtttgtgg 601 tgtttgaggg gaaccattat ttctactctc cctatccaac gaagacacaa accatgcgtg 661 tgaagcttgc ctctcgaaat gtggagagct acaccaagct ggggaacccc acgcgctctg 721 aggacctact ggattatggg cctttcagag atgtgcctgc ctatagtcag gatactttta 781 aagtacatta tgagaacaac agccctttcc tgaccatcac cagcatgacc cgagtcattg 841 aagtctctca ctggggtaat attgctgtgg aagaaaatgt ggacttaaag cacacaggag 901 ctgtgcttaa ggggcctttc tcacgctatg attaccagag acagccagat agtggaatat 961 cctccatccg ttcttttaag accatccttc ctgctgctgc ccaggatgtt tattaccggg 1021 atgagattgg caatgtttct accagccacc tccttatttt ggatgactct gtagagatgg 1081 aaatccggcc tcgcttccct ctctttggcg ggtggaagac ccattacatc gttggctaca 1141 acctcccaag ctatgagtac ctctataatt tgggtgacca gtatgcactg aagatgaggt 1201 ttgtggacca tgtgtttgat gaacaagtga tagattctct gactgtgaag atcatcctgc 1261 ctgaaggagc caagaacatt gaaattgata gtccctatga aatcagccgt gccccagatg 1321 agctgcacta cacctatctg gatacatttg gccgccctgt gattgttgcc tacaagaaaa 1381 atctggtaga acagcacatt caggacattg tggtccacta cacgttcaac aaggtgctca 1441 tgctgcagga gcccctgctg gtggtggcgg ccttctacat cctgttcttc accgttatca 1501 tctatgttcg gctggacttc tccatcacca aggatccagc cgcagaagcc aggatgaagg 1561 tagcctgcat cacagagcag gtcttgaccc tggtcaacaa gagaataggc ctttaccgtc 1621 actttgacga gaccgtcaat aggtacaagc aatcccggga catctccacc ctcaacagtg 1681 gcaagaagag cctggagact gaacacaagg ccttgaccag tgagattgca ctgctgcagt 1741 ccaggctgaa gacagagggc tctgatctgt gcgacagagt gagcgaaatg cagaagctgg 1801 atgcacaggt caaggagctg gtgctgaagt cggcggtgga ggctgagcgc ctggtggctg 1861 gcaagctcaa gaaagacacg tacattgaga atgagaagct catctcagga aagcgccagg 1921 agctggtcac caagatcgac cacatcctgg atgccctgta gcccctgccc gcatcctcca 1981 gggggcccag ggtgcctgca ctttgctgtg gcaggcagat tgggtggtag tgggaggttg 2041 tgcatggagg ccagtgaaag ctgacatctg taaaaggcct tcaaggaaga gaaaccaggc 2101 cctgcgtcag gcagtgtgag tttgccgttt gtccttaact ttcttttttt ttttttttaa 2161 aaaagaaaac tttaaaaaaa ctcccattaa aaacaaaaca tctttgtgtt gtgaacaaag 2221 gaattttcaa tatttgattg gtattctgtt ctgaagtcta ggatattttt cagcctataa 2281 agccccctgt tttatgccct tctaattctg atgtttgggt attgtgtgag tgcatgtgtt 2341 tttttttttt tttttttaaa gcgtgtgtga acaaatggaa ataaagcagg gactgtg // LOCUS HSRING1 1529 bp RNA PRI 30-JUN-1995 DEFINITION H.sapiens RING1 gene. ACCESSION Z14000 NID g296063 KEYWORDS RING1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1529) AUTHORS Lovering,R., Hanson,I.M., Borden,K.L., Martin,S., O'Reilly,N.J., Evan,G.I., Rahman,D., Pappin,D.J., Trowsdale,J. and Freemont,P.S. TITLE Identification and preliminary characterization of a protein motif related to the zinc finger JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (6), 2112-2116 (1993) MEDLINE 93211912 REFERENCE 2 (bases 1 to 1529) AUTHORS Lovering,R.C. TITLE Direct Submission JOURNAL Submitted (19-JUN-1992) Ruth C. Lovering, Human Immunogenetics Laboratory, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London, WC2A 3PX, UK FEATURES Location/Qualifiers source 1..1529 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="A1,31(19),B8,40,Cw3,w7" /cell_type="T lymphoblastoid" /cell_line="CEM" /clone_lib="CEM" /clone="RING1" /chromosome="Chromosome 6" gene 76..1209 /gene="RING1" CDS 76..1209 /gene="RING1" /codon_start=1 /db_xref="PID:g296064" /db_xref="SWISS-PROT:Q06587" /translation="MDGTEIAVSPRSLHSELMCPICLDMLKNTMTTKECLHRFCSDCI VTALRSGNKECPTCRKKLVSKRSLRPDPNFDALISKIYPSREEYEAHQDRVLIRLSRL HNQQALSSSIEEGLRMQAMHRAQRVRRPIPGSDQTTTMSGGEGEPGEGEGDGEDVSSD SAPDSAPGPAPKRPRGGGAGGSSVGTGGGGTGGVGGGAGSEDSGDRGGTLGGGTLGPP SPPGAPSPPEPGGEIELVFRPHPLLVEKGEYCQTRYVKTTGNATVDHLSKYLALRIAL ERRQQQEAGEPGGPGGGASDTGGPDGCGGEGGGAGGGDGPEEPALPSLEGVSEKQYTI YIAPGGGAFTTLNGSLTLELVNEKFWKVSRPLELCYAPTKDPK" BASE COUNT 344 a 436 c 472 g 277 t ORIGIN 1 ggctgctgtt tctaaaaccc ctttccctct aacccacacc acctttctac tcactgatgc 61 cttcaggaag ccataatgga tggcacagag attgctgttt cccctcggtc actgcattca 121 gaactcatgt gccctatctg cctggacatg ctgaagaata cgatgaccac caaggagtgc 181 ctccacagat tctgctctga ctgcattgtc acagccctac ggagcgggaa caaggagtgt 241 cctacctgcc gaaagaagct ggtgtccaag cgatccctac ggccagaccc caactttgat 301 gccctgatct ctaagatcta tcctagccgg gaggaatacg aggcccatca agaccgagtg 361 cttatccgcc tgagccgcct gcacaaccag caggcattga gctccagcat tgaggagggg 421 ctacgcatgc aggccatgca cagggcccag cgtgtgaggc ggccgatacc agggtcagat 481 cagaccacaa cgatgagtgg gggggaagga gagcccgggg agggagaagg ggatggagaa 541 gatgtgagct cagactccgc ccctgactct gccccaggcc ctgctcccaa gcgaccccgt 601 ggagggggcg caggggggag cagtgtaggg acggggggag gcggcactgg tggggtgggt 661 gggggtgccg gttcggaaga ctctggtgac cggggaggga ctctgggagg gggaacgctg 721 ggccccccaa gccctcctgg ggcccccagc cccccagagc caggtggaga aattgagctc 781 gtgttccggc cccaccccct gctcgtggag aagggagaat actgccagac gaggtatgtg 841 aagacaactg ggaatgccac agtggaccac ctctccaagt acttggccct gcgcattgcc 901 ctcgagcgga ggcaacagca ggaagcaggg gagccaggag ggcctggagg gggcgcctct 961 gacaccggag gacctgatgg gtgtggcggg gagggtgggg gtgccggagg aggtgacggt 1021 cctgaggagc ctgctttgcc cagcctggag ggcgtcagtg aaaagcagta caccatctac 1081 atcgcacctg gaggcggggc gttcacgacg ttgaatggct cgctgaccct ggagctggtg 1141 aatgagaaat tctggaaggt gtcccggcca ctggagctgt gctatgctcc caccaaggat 1201 ccaaagtgac cccaccaggg gacagccaga ggaaggggac catggggtat ccctgtgtcc 1261 tggtctatca ccccagcttc tttgtccccc agtaccccca gcccagccag ccaataagag 1321 gacacaaatg aggacacgtg gcttttatac aaagtatcta tatgagattc ttctatattg 1381 tacagagtgg ggcaaaacac gcccccatct gctgcctttt ccattgccct gcaacgtccc 1441 atctatacga ggtgttggag aaggtgaaga accctcccat tcacgcccgc ctaccaacaa 1501 caaacgtgct tttttcctct ttgaaaaaa // LOCUS HSRING10 1155 bp RNA PRI 09-JUL-1992 DEFINITION H.sapiens RING10 mRNA. ACCESSION X62598 NID g36056 KEYWORDS major histocompatibility complex class II; proteasome-related gene; RING10 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1155) AUTHORS Glynne,R., Powis,S.H., Beck,S., Kelly,A., Kerr,L.A. and Trowsdale,J. TITLE A proteasome-related gene between the two ABC transporter loci in the class II region of the human MHC JOURNAL Nature 353 (6342), 357-360 (1991) MEDLINE 92018193 REFERENCE 2 (bases 1 to 1155) AUTHORS Glynne,R.J. TITLE Direct Submission JOURNAL Submitted (10-FEB-1992) R.J. Glynne, Imperial Cancer Research Fund, P.O.Box 123, Lincoln's Inn Fields, London WC2A 3PX, UK FEATURES Location/Qualifiers source 1..1155 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="blood" /cell_type="T cell" /cell_line="CEM, U937 (gamma interferon induced)" /clone_lib="cDNA in CDM8" /clone="S6C1.1, U1OU3.1" /chromosome="6p21.3" /map="MHC class II" gene 221..1039 /gene="RING10" CDS 221..1039 /gene="RING10" /codon_start=1 /db_xref="PID:g36057" /db_xref="SWISS-PROT:P28062" /translation="MLIGTPTPRDTTPSSWLTSSLLVEAAPLDDTTLPTPVSSGCPGL EPTEFFQSLGGDGERNVQIEMAHGTTTLAFKFQHGVIAAVDSRASAGSYISALRVNKV IEINPYLLGTMSGCAADCQYWERLLAKECRLYYLRNGERISVSAASKLLSNMMCQYRG MGLSMGSMICGWDKKGPGLYYVDEHGTRLSGNMFSTGSGNTYAYGVMDSGYRPNLSPE EAYDLGRRAIAYATHRDSYSGGVVNMYHMKEDGWVKVESTDVSDLLHQYREANQ" BASE COUNT 253 a 306 c 338 g 258 t ORIGIN 1 gggcagaaag ggcacgctct tgtgggtgac tacaggttag gagaccgttg aacctggagg 61 ggccctagga tggaccccgt ggaaagattc agagactgcg ccctctccct ggcgccgcct 121 tcccctacac gcggcgggta tattctgttg cagttggccc aggacctgtt tccaagactc 181 tgccccctcg cacttccgtc cctcctggtt ttgtaaagtg atgctcatag gaacccccac 241 cccgcgtgac actactccca gctcctggct gacttctagt cttctggttg aagctgcgcc 301 tttagatgac acgaccctac ccacccctgt ttccagcgga tgcccgggcc tggagcccac 361 agaattcttc cagtccctgg gtggggacgg agaaaggaac gttcagattg agatggccca 421 tggcaccacc acgctcgcct tcaagttcca gcatggagtg attgcagcag tggattctcg 481 ggcctcagct gggtcctaca ttagtgcctt acgggtgaac aaggtgattg agattaaccc 541 ttacctgctt ggcaccatgt ctggctgtgc agcagactgt cagtactggg agcgcctgct 601 ggccaaggaa tgcaggctgt actatctgcg aaatggagaa cgtatttcag tgtcggcagc 661 ctccaagctg ctgtccaaca tgatgtgcca gtaccggggc atgggcctct ctatgggcag 721 tatgatctgt ggctgggata agaagggtcc tggactctac tacgtggatg aacatgggac 781 tcggctctca ggaaatatgt tctccacggg tagtgggaac acttatgcct acggggtcat 841 ggacagtggc tatcggccta atcttagccc tgaagaggcc tatgaccttg gccgcagggc 901 tattgcttat gccactcaca gagacagcta ttctggaggc gttgtcaata tgtaccacat 961 gaaggaagat ggttgggtga aagtagaaag tacagatgtc agtgacctgc tgcaccagta 1021 ccgggaagcc aatcaataat ggtggtggtg gcagctgggc aggtctcctc tgggaggtct 1081 tggccgactc agggacctaa gccacgttaa gtccaaggag aagaagaggc ctagcctgag 1141 ccaaagagag agtac // LOCUS HSRING4 2824 bp RNA PRI 01-JUL-1992 DEFINITION H.sapiens RING4 cDNA. ACCESSION X57522 NID g36060 KEYWORDS RING4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2824) AUTHORS Trowsdale,J., Hanson,I., Mockridge,I., Beck,S., Townsend,A. and Kelly,A. TITLE Sequences encoded in the class II region of the MHC related to the 'ABC' superfamily of transporters JOURNAL Nature 348 (6303), 741-744 (1990) MEDLINE 91080927 FEATURES Location/Qualifiers source 1..2824 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA U937, gamma-interferon induced" /clone="p21U" gene 31..2457 /gene="RING4" CDS 31..2457 /gene="RING4" /note="class II region of NHC" /codon_start=1 /product="peptide transporter" /db_xref="PID:g36061" /translation="MAELLASAGSACSWDFPRAPPSFPPPAASRGGLGGTRSFRPHRG AESPRPGRDRDGVRVPMASSRCPAPRGCRCLPGASLAWLGTVLLLLADWVLLRTALPR IFSLLVPTALPLLRVWAVGLSRWAVLWLGACGVLRATVGSKSENAGAQGWLAALKPLA AALGLALPGLALFRELISWGAPGSADSTRLLHWGSHPTAFVVSYAAALPAAALWHKLG SLWVPGGQGGSGNPVRRLLGCLGSETRRLSLFLVLVVLSSLGEMAIPFFTGRLTDWIL QDGSADTFTRNLTLMSILTIASAVLEFVGDGIYNNTMGHVHSHLQGEVFGAVLRQETE FFQQNQTGNIMSRVTEDTSTLSDSLSENLSLFLWYLVRGLCLLGIMLWGSVSLTMVTL ITLPLLFLLPKKVGKWYQLLEVQVRESLAKSSQVAIEALSAMPTVRSFANEEGEAQKF REKLQEIKTLNQKEAVAYAVNSWTTSISGMLLKVGILYIGGQLVTSGAVSSGNLVTFV LYQMQFTQAVEVLLSIYPRVQKAVGSSEKIFEYLDRTPRCPPSGLLTPLHLEGLVQFQ DVSFAYPNRPDVLVLQGLTFTLRPGEVTALVGPNGSGKSTVAALLQNLYQPTGGQLLL DGKPLPQYEHRYLHRQVAAVGQEPQVFGRSLQENIAYGLTQKPTMEEITAAAVKSGAH SFISGLPQGYDTEVDEAGSQLSGGQRQAVALARALIRKPCVLILDDATSALDANSQLQ VEQLLYESPERYSRSVLLITQHLSLVEQADHILFLEGGAIREGGTHQQLMEKKGCYWA MVQAPADAPE" polyA_signal 2806..2811 polyA_site 2824 BASE COUNT 563 a 811 c 821 g 629 t ORIGIN 1 gcggccgctt tcgatttcgc tttcccctaa atggctgagc ttctcgccag cgcaggatca 61 gcctgttcct gggactttcc gagagccccg ccctcgttcc ctcccccagc cgccagtagg 121 ggaggactcg gcggtacccg gagcttcagg ccccaccggg gcgcggagag tcccagaccc 181 ggccgggacc gggacggcgt ccgagtgcca atggctagct ctaggtgtcc cgctccccgc 241 gggtgccgct gcctccccgg agcttctctc gcatggctgg ggacagtact gctacttctc 301 gccgactggg tgctgctccg gaccgcgctg ccccgcatat tctccctgct ggtgcccacc 361 gcgctgccac tgctccgggt ctgggcggtg ggcctgagcc gctgggccgt gctctggctg 421 ggggcctgcg gggtcctcag ggcaacggtt ggctccaaga gcgaaaacgc aggtgcccag 481 ggctggctgg ctgctttgaa gccattagct gcggcactgg gcttggccct gccgggactt 541 gccttgttcc gagagctgat ctcatgggga gcccccgggt ccgcggatag caccaggcta 601 ctgcactggg gaagtcaccc taccgccttc gttgtcagtt atgcagcggc actgcccgca 661 gcagccctgt ggcacaaact cgggagcctc tgggtgcccg gcggtcaggg cggctctgga 721 aaccctgtgc gtcggcttct aggctgcctg ggctcggaga cgcgccgcct ctcgctgttc 781 ctggtcctgg tggtcctctc ctctcttggg gagatggcca ttccattctt tacgggccgc 841 ctcactgact ggattctaca agatggctca gccgatacct tcactcgaaa cttaactctc 901 atgtccattc tcaccatagc cagtgcagtg ctggagttcg tgggtgacgg gatctataac 961 aacaccatgg gccacgtgca cagccacttg cagggagagg tgtttggggc tgtcctgcgc 1021 caggagacgg agtttttcca acagaaccag acaggtaaca tcatgtctcg ggtaacagag 1081 gacacgtcca ccctgagtga ttctctgagt gagaatctga gcttatttct gtggtacctg 1141 gtgcgaggcc tatgtctctt ggggatcatg ctctggggat cagtgtccct caccatggtc 1201 accctgatca ccctgcctct gcttttcctt ctgcccaaga aggtgggaaa atggtaccag 1261 ttgctggaag tgcaggtgcg ggaatctctg gcaaagtcca gccaggtggc cattgaggct 1321 ctgtcggcca tgcctacagt tcgaagcttt gccaacgagg agggcgaagc ccagaagttt 1381 agggaaaagc tgcaagaaat aaagacactc aaccagaagg aggctgtggc ctatgcagtc 1441 aactcctgga ccactagtat ttcaggtatg ctgctgaaag tgggaatcct ctacattggt 1501 gggcagctgg tgaccagtgg ggctgtaagc agtgggaacc ttgtcacatt tgttctctac 1561 cagatgcagt tcacccaggc tgtggaggta ctgctctcca tctaccccag agtacagaag 1621 gctgtgggct cctcagagaa aatatttgag tacctggacc gcacccctcg ctgcccaccc 1681 agtggtctgt tgactccctt acacttggag ggccttgtcc agttccaaga tgtctccttt 1741 gcctacccaa accgcccaga tgtcttagtg ctacaggggc tgacattcac cctacgccct 1801 ggcgaggtga cggcgctggt gggacccaat gggtctggga agagcacagt ggctgccctg 1861 ctgcagaatc tgtaccagcc caccggggga cagctgctgt tggatgggaa gccccttccc 1921 caatatgagc accgctacct gcacaggcag gtggctgcag tgggacaaga gccacaggta 1981 tttggaagaa gtcttcaaga aaatattgcc tatggcctga cccagaagcc aactatggag 2041 gaaatcacag ctgctgcagt aaagtctggg gcccatagtt tcatctctgg actccctcag 2101 ggctatgaca cagaggtaga cgaggctggg agccagctgt cagggggtca gcgacaggca 2161 gtggcgttgg cccgagcatt gatccggaaa ccgtgtgtac ttatcctgga tgatgccacc 2221 agtgccctgg atgcaaacag ccagttacag gtggagcagc tcctgtacga aagccctgag 2281 cggtactccc gctcagtgct tctcatcacc cagcacctca gcctggtgga gcaggctgac 2341 cacatcctct ttctggaagg aggcgctatc cgggaggggg gaacccacca gcagctcatg 2401 gagaaaaagg ggtgctactg ggccatggtg caggctcctg cagatgctcc agaatgaaag 2461 ccttctcaga cctgcgcact ccatctccct cccttttctt ctctctgtgg tggagaacca 2521 cagctgcaga gtagcagctg cctccaggat gagttacttg aaatttgcct tgagtgtgtt 2581 acctcctttc caagctcctc gtgataatgc agacttcctg gagtacaaac acaggatttg 2641 taattcctac tgtaacggag tttagagcca gggctgatgc tttggtgtgg ccagcactct 2701 gaaactgaga aatgttcaga atgtacggaa agatgatcag ctattttcaa cataactgaa 2761 ggcatatgct ggcccataaa caccctgtag gttcttgata tttataataa aattggtgtt 2821 ttgt // LOCUS HSRING6 1100 bp RNA PRI 14-OCT-1994 DEFINITION Human RING6 mRNA for HLA class II alpha chain-like product. ACCESSION X62744 NID g36062 KEYWORDS HLA class II antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1100) AUTHORS Kelly,A.P., Monaco,J.J., Cho,S.G. and Trowsdale,J. TITLE A new human HLA class II-related locus, DM JOURNAL Nature 353 (6344), 571-573 (1991) MEDLINE 92018223 REFERENCE 2 (bases 1 to 1100) AUTHORS Kelly,A.P. TITLE Direct Submission JOURNAL Submitted (11-FEB-1992) Adrian P Kelly, Human Immunogenetics, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, London, WC2A 3PX, United Kingdom FEATURES Location/Qualifiers source 1..1100 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphoblastoid" /clone="RING6" /chromosome="6" gene 46..831 /gene="RING6" CDS 46..831 /gene="RING6" /note="HLA class II alpha chain-like" /codon_start=1 /product="RING6" /db_xref="PID:g36063" /db_xref="SWISS-PROT:P28067" /translation="MGHEQNQGAALLQMLPLLWLLPHSWAVPEAPTPMWPDDLQNHTF LHTVYCQDGSPSVGLSEAYDEDQLFFFDFSQNTRVPRLPEFADWAQEQGDAPAILFDK EFCEWMIQQIGPKLDGKIPVSRGFPIAEVFTLKPLEFGKPNTLVCFVSNLFPPMLTVN WHDHSVPVEGFGPTFVSAVDGLSFQAFSYLNFTPEPSDIFSCIVTHEIDRYTAIAYWV PRNALPSDLLENVLCGVAFGLGVLGIIVGIVLIIYFRKPCSGD" BASE COUNT 247 a 305 c 264 g 284 t ORIGIN 1 ctaaagctgg gttggtagct cctacctact gtgtggcaag aaggtatggg tcatgaacag 61 aaccaaggag ctgcgctgct acagatgtta ccacttctgt ggctgctacc ccactcctgg 121 gccgtccctg aagctcctac tccaatgtgg ccagatgacc tgcaaaacca cacattcctg 181 cacacagtgt actgccagga tgggagtccc agtgtgggac tctctgaggc ctacgacgag 241 gaccagcttt tcttcttcga cttttcccag aacactcggg tgcctcgcct gcccgaattt 301 gctgactggg ctcaggaaca gggagatgct cctgccattt tatttgacaa agagttctgc 361 gagtggatga tccagcaaat agggccaaaa cttgatggga aaatcccggt gtccagaggg 421 tttcctatcg ctgaagtgtt cacgctgaag cccctggagt ttggcaagcc caacactttg 481 gtctgttttg tcagtaatct cttcccaccc atgctgacag tgaactggca cgatcattcc 541 gtccctgtgg aaggatttgg gcctactttt gtctcagctg tcgatggact cagcttccag 601 gccttttctt acttaaactt cacaccagaa ccttctgaca ttttctcctg cattgtgact 661 cacgaaattg accgctacac agcaattgcc tattgggtac cccggaacgc actgccctca 721 gatctgctgg agaatgtgct gtgtggcgtg gcctttggcc tgggtgtgct gggcatcatc 781 gtgggcattg ttctcatcat ctacttccgg aagccttgct caggtgactg attcttccag 841 accagagttt gatgccagca gcttcggcca tccaaacaga ggatgctcag atttctcaca 901 tcctgcccag gatctcctct tagggtagaa gaagtctctg ggacatccct ggggtgtgtg 961 tgtagatttc ccacctgggg actctgctgt ccctgggctt gcatcccagg gatcccagag 1021 tggcctgcct atcacaacca catcccttcc ccccacaagg caataaatct catttcttta 1081 aaaaaaaaaa aaaaaaaaaa // LOCUS HSRINGENE 1113 bp RNA PRI 02-DEC-1996 DEFINITION H.sapiens mRNA for RIN protein. ACCESSION Y07565 NID g1702925 KEYWORDS calmodulin; ras-like GTPase; Rin gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1113) AUTHORS Wes,P.D., Yu,M. and Montell,C. TITLE RIC, a calmodulin-binding Ras-like GTPase JOURNAL EMBO J. 15 (21), 5839-5848 (1996) MEDLINE 97076145 REFERENCE 2 (bases 1 to 1113) AUTHORS Wes,P.D. TITLE Direct Submission JOURNAL Submitted (16-AUG-1996) P.D. Wes, John Hopkins University School Medicine, Department of Biological Chemistry, 725 N.Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1113 /organism="Homo sapiens" /db_xref="taxon:9606" gene 175..828 /gene="RIN" CDS 175..828 /gene="RIN" /codon_start=1 /product="RIN (Ric-related gene expressed in neurons)" /db_xref="PID:e276465" /db_xref="PID:g1702926" /translation="MEVENEASCSPGSASGGSREYKVVMLGAGGVGKSAMTMQFISHQ FPDYHDPTIEDAYKTQVRIDNEPAYLDILDTAGQAEFTAMREQYMRGGEGFIICYSVT DRQSFQEAAKFKELIFQVRHTYEIPLVLVGNKIDLEQFRQVSTEEGLSLAQEYNCGFF ETSAALRFCIDDAFHGLVREIRKKESMPSLMEKKLKRKDSLWKKLKGSLKKKRENMT" BASE COUNT 307 a 232 c 258 g 316 t ORIGIN 1 gggttgctat gagacccgct cgagcggagg accagcagcc agcccgacgc tgatggttct 61 tacctcgtac taaaaccttt gctttgacac agttttagag ttgcttaata ttcgagcaag 121 cacctgacac gggtgacttt ctccttcttt ttttcctccg gtccctcggg taagatggag 181 gtagaaaatg aagccagctg ctccccgggc agcgcatcag gcgggtccag agagtacaag 241 gtggtaatgc tgggagcagg gggagttggt aaaagcgcaa tgacaatgca gtttattagt 301 catcagttcc ctgattatca tgaccctact atagaagatg cttataagac ccaggtcagg 361 attgacaatg agccagctta cttggacatc ttggacactg ctggccaggc agaattcaca 421 gccatgcggg agcagtacat gcgaggtggg gaaggcttca tcatctgcta ctccgtcact 481 gaccgtcaat catttcagga ggctgccaag tttaaagagc tcatttttca ggtccgccac 541 acctatgaaa ttcccctggt gctggtgggt aacaaaattg atctggaaca gttccgccag 601 gtttctacag aagaaggctt gagtcttgcc caagaatata attgtggttt ttttgagacc 661 tctgcagccc tcagattctg tattgatgat gcttttcatg gcttagtgag ggaaattcgc 721 aagaaggagt ccatgccatc cttgatggaa aagaaactga agagaaaaga cagcctgtgg 781 aagaagctca aaggttcttt gaagaagaag agagaaaata tgacatgata tctttgcttt 841 tgagttcctc acgctctctg aattttatta gttggacaat tccatatgta gcattctgct 901 tcaatattat ctctctatgt gtctctctct ctttaaatat ctgcctgtag gtaaaagcaa 961 gctctgcata tctgtacctc ttgagatagt tttgttttgc ctttaacagt tggatggatt 1021 ttgtcaatca gctggatatg ctgtttagtt tttacaacat agtactaaat aaaattgaca 1081 ttccaattgc ctcaaaaaaa aaaaaaaaaa aaa // LOCUS HSRINGR 1037 bp RNA PRI 08-JAN-1997 DEFINITION H.sapiens mRNA for RING protein. ACCESSION Y07828 NID g1770498 KEYWORDS major histocompatibility complex; RING protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1037) AUTHORS Henry,J., Ribouchon,M.T., Depetris,D., Mattei,M.G., Offer,C., Tazi-Ahnini,R. and Pantarotti,P. TITLE Cloning, structural analysis and mapping of B30 and B7 family members, to the MHC and other chromosomal regions. Toward the identification of the ancestral major histocompatability complex JOURNAL Unpublished REFERENCE 2 (bases 1 to 1037) AUTHORS Pontarotti,P. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) P. Pontarotti, Unite 119 INSERM, 27 bd.Lei Roure, 13009 Marseille, FRANCE FEATURES Location/Qualifiers source 1..1037 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p21.3" /chromosome="6" CDS 110..820 /note="expression:colon" /codon_start=1 /product="put. ring protein" /db_xref="PID:e283119" /db_xref="PID:g1770499" /translation="MASGQFVNKLQEEVICPICLDILQKPVTIDCGHNFCPQCITQIG ETSCGFFKCPLCKTSVRRDAIRFNSLLRNLVEKIQALQASEVQSKRKEATCPRHQEMF HYFCEDDGKFLCFVCRESKDHKSHNVSLIEEAAQNYQGQIQEQIQVLQQKEKETVQVK AQGVHRVDVFTDQVEHEKQRILTEFELLHQVLEEEKNFLLSRIYWLGHEGTEAGKHYV EIPLMPTVERSQEARCYP" BASE COUNT 323 a 241 c 256 g 217 t ORIGIN 1 tctagaggat ccgccaagag gtagaagaaa cagtatccac agtggactcc ggggctccta 61 gacttggcac agcttcctac agtcttgaaa cagccctgtt gttcctgtca tggccagtgg 121 gcagtttgtg aacaaactgc aagaggaagt gatctgcccc atctgcctgg acattctgca 181 gaaacctgtc accatcgact gtgggcacaa tttctgccct caatgcatca ctcagattgg 241 ggaaacatca tgtggatttt tcaaatgtcc cctctgcaaa acttccgtaa gaagagacgc 301 aatcaggttc aactcgctgt tgcggaatct ggtggagaaa atccaagctc tacaagcctc 361 tgaggtgcag tccaaaagga aagaggctac atgcccgagg caccaggaga tgttccacta 421 tttctgcgag gatgatggga agttcctctg ttttgtgtgt cgtgaatcca aggaccacaa 481 atcccataat gtcagcttga tcgaagaagc tgcccagaat tatcaggggc agattcaaga 541 gcagatccaa gtcttgcagc aaaaggagaa ggagacagta caagtgaagg cacaaggtgt 601 acacagggtc gatgtcttca cggaccaggt agaacatgag aagcaaagga tcctcacaga 661 atttgaactc ctgcatcaag tcctagagga ggagaagaat ttcctgctat cacggattta 721 ctggctgggt catgagggaa cggaagcggg gaaacactat gttgaaattc cactgatgcc 781 cacagttgaa cgatctcaag aagctcgttg ttacccctga agaccaatgg cagaactagc 841 cacccaggca gctgctggag gtatacaaag tcgtcttgtg gcagaagtga agagtttcag 901 tttctcaacc caacccctgt tcctctggaa ctggagaaaa actcagtgaa ggcaaaatca 961 agacacgact ccatcacagg gagcctgaaa aaattcaaag accaactcca ggcagataga 1021 aaaaaaaaaa aaaaaaa // LOCUS HSRIP140 7247 bp RNA PRI 09-AUG-1995 DEFINITION H.sapiens mRNA for nuclear factor RIP140. ACCESSION X84373 NID g940538 KEYWORDS nuclear factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7247) AUTHORS Cavailles,V., Dauvois,S., L'Horset,F., Lopez,G., Hoare,S., Kushner,P.J. and Parker,M.G. TITLE Nuclear factor RIP140 modulates transcriptional activation by the estrogen receptor JOURNAL EMBO J. 14 (15), 3741-3751 (1995) MEDLINE 95369246 REFERENCE 2 (bases 1 to 7247) AUTHORS Parker,M.G. TITLE Direct Submission JOURNAL Submitted (02-FEB-1995) M.G. Parker, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields, PO Box 123, London WC2A 3PX, UK COMMENT Related sequences:T07281 and HUMGS01390. FEATURES Location/Qualifiers source 1..7247 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast" /cell_type="epithelium" /cell_line="ZR75-1" /chromosome="21" /map="21q11.2" CDS 288..3764 /codon_start=1 /product="nuclear factor RIP140" /db_xref="PID:g940539" /db_xref="SWISS-PROT:P48552" /translation="MTHGEELGSDVHQDSIVLTYLEGLLMHQAAGGSGTAVDKKSAGH NEEDQNFNISGSAFPTCQSNGPVLNTHTYQGSGMLHLKKARLLQSSEDWNAAKRKRLS DSIMNLNVKKEALLAGMVDSVRKGKQDSTLLASLLQSFSSRLQTVALSQQIRQSLKEQ GYALSHDSLKVEKDLRCYGVASSHLKTLLKKSKVKDQKPDTNLPDVTKNLIRDRFAES PHHVGQSGTKVMSEPLSCAARLQAVASMVEKRASPATSPKPSVACSQLALLLSSEAHL QQYSREHALKTQNANQAASERLAAMARLQENGQKDVGSYQLPKGMSSHLNGQARTSSS KLMASKSSATVFQNPMGIIPSSPKNAGYKNSLERNNIKQAANNSLLLHLLKSQTIPKP MNGHSHSERGSIFEESSTPTTIDEYSDNNPSFTDDSSGDESSYSNCVPIDLSCKHGTE KSESDQPVSLDNFTQSLLNTWDPKVPDVDIKEDQDTSKNSKLNSHQKVTLLQLLLGHK NEENVEKNTSPQGVHNDVSKFNTQNYARTSVIESPSTNRTTPVSTPPLLTSSKAGSPI NLSQHSLVIKWNSPPYVCSTQSEKLTNTASNHSMDLTKSKDPPGEKPAQNEGAQNSAT FSASKLLQNLAQCGMQSSMSVEEQRPSKQLLTGNTDKPIGMIDRLNSPLLSNKTNAVE ENKAFSSQPTGPEPGLSGSEIENLLERRTVLQLLLGNPTKGRVKKKEKTPLRDESTQE HSERALSEQILMVKIKSEPCDDLQIPNTNVHLSHDAKSAPFLGMAPAVQRSAPALPVS EDFKSEPVSPQDFSFSKNGLLSRLLRQNQDSYLADDSDRSHRNNEMALLESKNLCMVP KKRKLYTEPLENPFKKMKNNIVDAANNHSAPEVLYGSLLNQEELKFSRNDLEFKYPAG HGSASESEHRSWARESKSFNVLKQLLLSENCVRDLSPHRSNSVADSKKKGHKNNVTNS KPEFSISSLNGLMYSSTQPSSCMDNRTFSYPGVVKTPVSPTFPEHLGCAGSRPESGLL NGCSMPSEKGPIKWVITDAEKNEYEKDSPRLTKTNPILYYMLQKGGNSVASRETQDKD IWREASSAESVSQVTAKEELLPTAETKASFFNLRSPYNSHMGNNASRPHSANGEVYGL LGSVLTIKKESE" BASE COUNT 2409 a 1316 c 1374 g 2148 t ORIGIN 1 gggaatatat tcactaagga ttctatctgc ttactgctac agacctatgt gttaaggaat 61 tcttctcctc ctccttgcgt agaagttgat cagcactgtg gtcagactgc atttatcttg 121 tcattgccag aagaaatctt ggacagaatg taacagtacg tctctctctg attgcgatgg 181 aaggtgataa actgatactc ctttattaaa gttacatcgc actcaccaca gaaaaccatt 241 ctttaaagtg aatagaaacc aagcccttgt gaacacttct attgaacatg actcatggag 301 aagagcttgg ctctgatgtg caccaggatt ctattgtttt aacttaccta gaaggattac 361 taatgcatca ggcagcaggg ggatcaggta ctgccgttga caaaaagtct gctgggcata 421 atgaagagga tcagaacttt aacatttctg gcagtgcatt tcccacctgt caaagtaatg 481 gtccagttct caatacacat acatatcagg gatctggcat gctgcacctc aaaaaagcca 541 gactgttgca gtcttctgag gactggaatg cagcaaagcg gaagaggctg tctgattcta 601 tcatgaattt aaacgtaaag aaggaagctt tgctagctgg catggttgac agtgtccgta 661 aaggcaaaca ggatagcaca ttactggcct ctttgcttca gtcattcagc tctaggctgc 721 agactgttgc tctgtcacaa caaatcaggc agagcctcaa ggagcaagga tatgccctca 781 gtcatgattc tttaaaagtg gagaaggatt taaggtgcta tggtgttgca tcaagtcact 841 taaaaacttt gttgaagaaa agtaaagtta aagatcaaaa gcctgatacg aatcttcctg 901 atgtgactaa aaacctcatc agagataggt ttgcagagtc tcctcatcat gttggacaaa 961 gtggaacaaa ggtcatgagt gaaccgttgt catgtgctgc aagattacag gctgttgcaa 1021 gcatggtgga aaaaagggct agtcctgcca cctcacctaa acctagtgtt gcttgtagcc 1081 agttagcatt acttctgtca agcgaagccc atttgcagca gtattctcga gaacacgctt 1141 taaaaacgca aaatgcaaat caagcagcaa gtgaaagact tgctgctatg gccagattgc 1201 aagaaaatgg ccagaaggat gttggcagtt accagctccc aaaaggaatg tcaagccatc 1261 ttaatggtca ggcaagaaca tcatcaagca aactgatggc tagcaaaagt agtgctacag 1321 tgtttcaaaa tccaatgggt atcattcctt cttcccctaa aaatgcaggt tataagaact 1381 cactggaaag aaacaatata aaacaagctg ctaacaatag tttgctttta catcttctta 1441 aaagccagac tatacctaag ccaatgaatg gacacagtca cagtgagaga ggaagcattt 1501 ttgaggaaag tagtacacct acaactattg atgaatattc agataacaat cctagtttta 1561 cagatgacag cagtggtgat gaaagttctt attccaactg tgttcccata gacttgtctt 1621 gcaaacacgg aactgaaaaa tcagaatctg accaacctgt ttccctggat aacttcactc 1681 aatccttgct aaacacttgg gatccaaaag tcccagatgt agatatcaaa gaagatcaag 1741 atacctcaaa gaattctaag ctaaactcac accagaaagt aacacttctt caattgctac 1801 ttggccataa gaatgaagaa aatgtagaaa aaaacaccag ccctcaggga gtacacaatg 1861 atgtgagcaa gttcaataca caaaattatg caaggacttc tgtgatagaa agccccagta 1921 caaatcggac tactccagtg agcactccac ctttacttac atcaagcaaa gcagggtctc 1981 ccatcaatct ctctcaacac tctctggtca tcaaatggaa ttccccacca tatgtctgca 2041 gtactcagtc tgaaaagcta acaaatactg catctaacca ctcaatggac cttacaaaaa 2101 gcaaagaccc accaggagag aaaccagccc aaaatgaagg tgcacagaac tctgcaacgt 2161 ttagtgccag taagctgtta caaaatttag cacaatgtgg aatgcagtca tccatgtcag 2221 tggaagagca gagacccagc aaacagctgt taactggaaa cacagataaa ccgataggta 2281 tgattgatag attaaatagc cctttgctct caaataaaac aaatgcagtt gaagaaaata 2341 aagcatttag tagtcaacca acaggtcctg aaccagggct ttctggttct gaaatagaaa 2401 atctgcttga aagacgtact gtcctccagt tgctcctggg gaacccaaca aagggaagag 2461 tgaaaaaaaa agagaaaact cccttaagag atgaaagtac tcaggaacac tcagagagag 2521 ctttaagtga acaaatactg atggtgaaaa taaaatctga gccttgtgat gacttacaaa 2581 ttcctaacac aaatgtgcac ttgagccatg atgctaagag tgccccattc ttgggtatgg 2641 ctcctgctgt gcagagaagc gcacctgcct taccagtgtc cgaagacttt aaatcggagc 2701 ctgtttcacc tcaggatttt tctttctcca agaatggtct gctaagtcga ttgctaagac 2761 aaaatcaaga tagttacctg gcagatgatt cagacaggag tcacagaaat aatgaaatgg 2821 cacttctaga atcaaagaat ctttgcatgg tccctaagaa aaggaagctt tatactgagc 2881 cattagaaaa tccatttaaa aagatgaaaa acaacattgt tgatgctgca aacaatcaca 2941 gtgccccaga agtactgtat gggtccttgc ttaaccagga agagctgaaa tttagcagaa 3001 atgatcttga atttaaatat cctgctggtc atggctcagc cagcgaaagt gaacacagga 3061 gttgggccag agagagcaaa agctttaatg ttctgaaaca gctgcttctc tcagaaaact 3121 gtgtgcgaga tttgtccccg cacagaagta actctgtggc tgacagtaaa aagaaaggac 3181 acaaaaataa tgtgaccaac agcaaacctg aatttagcat ttcttcttta aatggactga 3241 tgtacagttc cactcagccc agcagttgca tggataacag gacattttca tacccaggtg 3301 tagtaaaaac tcctgtgagt cctactttcc ctgagcactt gggctgtgca gggtctagac 3361 cagaatctgg gcttttgaat gggtgttcca tgcccagtga gaaaggaccc attaagtggg 3421 ttatcactga tgcggagaag aatgagtatg aaaaagactc tccaagattg accaaaacca 3481 acccaatact atattacatg cttcaaaaag gaggcaattc tgttgccagt cgagaaacac 3541 aagacaagga catttggagg gaggcttcat ctgctgaaag tgtctcacag gtcacagcca 3601 aagaagagtt acttcctact gcagaaacga aagcttcttt ctttaattta agaagccctt 3661 acaatagcca tatgggaaat aatgcttctc gcccacacag cgcaaatgga gaagtttatg 3721 gacttctggg aagcgtgcta acgataaaga aagaatcaga ataaaatgta cctgccatcc 3781 agttttggat ctttttaaaa ctaatgagta tgaacttgag atctgtataa ataagagcat 3841 gatttgaaaa aagcatggta taattgaaac ttttttcatt ttgaaaagta ttggttactg 3901 gtgatgttga aatatgcata ctaatttttg cttaacatta gatgtcatga ggaaactact 3961 gaactagcaa ttggttgttt aacacttctg tatgcgtcag ataacaactg tgagtagcct 4021 atgaatgaaa ttcttttata aatattaggc ataaattaaa atgtaaaact ccattcatag 4081 tggattaatg cattttgctg cctttattag ggtactttat tttgcttttc agaagtcagc 4141 ctacataaca catttttaaa gtctaaactg ttaaacaact ctttaaagga taattatcca 4201 ataaaaaaaa acctagtgct gattcacagc ttattatcca attcaaaaat aaattagaaa 4261 aatatatgct tacatttttc acttttgcta aaaagaaaaa aaaaggtgtt tatttttaac 4321 tcttggaaga ggttttgtgg ttcccaatgt gtctgtccca ccctgagcct tttcaatata 4381 tatttcttta aaccttgtgc tacttagtaa aaattgatta caattgaggg aagtttgata 4441 gatcctttaa aaaaaaggca gatttccatt ttttgtattt taactacttt actaaattaa 4501 tactcctcct tttacaaatt agaaaagtta acatttatct ttaggtggtt tcctgaaaag 4561 ttgaatattt aagaaattgt ttttaacaga agcaaaatgg cttttctttg gacagttttc 4621 accatctctt gtaaaagtta attctcacca ttcctgtggt acctgcgagt gttatgacca 4681 ggattcctta aacctgaact cagaccactt gcattagaac catctggagc acttgtttta 4741 aaatgcagat tcataggcag catctcagat ctacagaaca agaatctctg ctaagtggac 4801 ctggaatctt ccatctgcat cttaacatgc tctctaggtg tttcttgtgt ttgagaacca 4861 tgacttatga ctttcctcag aacatgagac tgtaaaacaa aaacaaaaaa ctatgtgatg 4921 cctctatttt ccccaataca gtcacacatc agctcaaaat ttgcaatatt gtagttatat 4981 attaccgtta tatctttgga aatccgggtt cagaacactt tttatgacaa aaattgggtg 5041 gaggggataa ctttcatatc tggctcaaca tctcaggaaa atctgtgatt atttgtgtgt 5101 tctaatgagt aacatctact tagttagcct tagggatgga aaaacagggc cacttaccaa 5161 actcaggtga ttccaggatg gtttggaaac ttctcctgaa tgcatcctta acctttatta 5221 aaaccattgt cctaagaaca atgccaacaa agcttacaac atttagttta aacccaagaa 5281 ggacactaaa ctagattgac taataaaagt acaaggcaca tatacgtgac agaattggta 5341 ctcaatcact ccattggatc ttttacttta aagtagtgat gaaaagtaca tgttgatact 5401 gtcttagaag aaattaatat attagtgaag ccacatgggg tttcagttgc gaaacaggtc 5461 tgtttttatg ttcagtttgt acaatccaca attcattcac cagatatttt gttcttaatt 5521 gtgttccagg ttagcaaatg acctatcaaa aattattcta taatcactac tagttaggat 5581 attgatttaa aattgttcta cttgaagtgg tttctaagat ttttatattt aaaaataggt 5641 gtgatttcct aatatgatct aaaaccctaa atggttattt ttcctcagaa tgatttgtaa 5701 atagctactg gaaatattat acagtaatag gagtgggtat tatgcaacat catggagagt 5761 gaaggcatag gcttattctg acataaaatt ccactggcca gttgaatata ttctattcca 5821 tgtccatact ctgacaatct tattgtcaac actatataaa taagctttta aacaagtcat 5881 ttttcttgat cgttgtggaa ggtttggagc cttagaggta tgtcagaaaa aatatgttgg 5941 tattctccct tgggtagggg gaaatgacct ttttacaaga gagatgaaat ttaggtcagg 6001 gaaaagacca agggccagca tgctactttt gtgtgtgtgt gtgtgggttt tgttttgttt 6061 tttttggttg gctggttgtt ttcattatta ttaacaaagg aatgagaata tgtaatactt 6121 aaataaacat gaccacgaag aatgctgttc tgatttacta gagaatgttc ccaatttgaa 6181 tttagggtga ttttaaagaa cagtgagaaa gggcatacat ccacagattc actttgttta 6241 tgcatatgta gatacaagga tgcacatata cacattttca aggactattt tagatatcta 6301 gacaatttct tctaataaag tcatttgtga aagggtacta cagcttattg acatcagtaa 6361 ggtagcattc attacctgtt tattctctgc tgcatcttac agaagagtaa actggtgaga 6421 gtatatattt tatatatata tatatatata atatgtatat atatatatat tgacttgtta 6481 catgaagatg ttaaaatcgg tttttaaagg tgatgtaaat agtgatttcc ttaatgaaaa 6541 atacatattt tgtattgttc taatgcaaca gaaaagcctt ttaatctctt tggttcctgt 6601 atattccatg tataagtgta aatataatca gacaggttta aaagttgtgc atgtatgtat 6661 acagttgcaa gtctggacaa atgtatagaa taaacctttt atttaagttg tgattacctg 6721 ctgcatgaaa agtggcatgg gggaccctgt gcatctgtgc atttggcaaa atgtcttaac 6781 aaatcagatc agatgttcat cctaacatga cagtattcca tttctggaca tgacgtctgt 6841 ggtttaagct ttgtgaaaga atgtgctttg attcgaaggg tcttaaagaa tttttttaat 6901 cgtcaaccac ttttaaacat aaagaattca cacaactact ttcatgaatt ttttaatccc 6961 attgcaaaca ttattccaag agtatcccag tattagcaat actggaatat aggcacatta 7021 ccattcatag taagaattct ggtgtttaca caaccaaatt tgatgcgatc tgctcagtaa 7081 tataatttgc catttttatt agaaatttaa tttcttcatg tgatgtcatg aaactgtaca 7141 tactgcagtg tgaatttttt tgttttgttt tttaatcttt tagtgtttac ttcctgcagt 7201 gaatttgaat aaatgagaaa aaatgcaaaa aaaaaaaaaa aaaaaaa // LOCUS HSRIREM1 3075 bp RNA PRI 09-JUL-1991 DEFINITION Human mRNA for M1 subunit of ribonucleotide reductase. ACCESSION X59543 NID g36064 KEYWORDS ribonucleotide reductase; ribonucleotide reductase M1 subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3075) AUTHORS Parker,N.J. TITLE Direct Submission JOURNAL Submitted (20-MAY-1991) N.J. Parker, The Royal Melbourne Hospital, P.O. Royal Melbourne Hospital, 3050, AUSTRALIA REFERENCE 2 (bases 1 to 3075) AUTHORS Parker,N.J., Begley,C.G. and Fox,R.M. TITLE Human M1 subunit of ribonucleotide reductase: cDNA sequence and expression in stimulated lymphocytes JOURNAL Nucleic Acids Res. 19 (13), 3741 (1991) MEDLINE 91305124 FEATURES Location/Qualifiers source 1..3075 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" /chromosome="11" mRNA 1..3059 CDS 188..2566 /codon_start=1 /product="M1 subunit of ribonucleotide reductase" /db_xref="PID:g36065" /db_xref="SWISS-PROT:P23921" /translation="MHVIKRDGRQERVMFDKITSRIQKLCYGLNMDFVDPAQITMKVI QGLYSGVTTVELDTLAAETAATLTTKHPDYAILAARIAVSNLHKETKKVFSDVMEDLY NYINPHNGKHSPMVAKSTLDIVLANKDRLNSAIIYDRDFSYNYFGFKTLERSYLLKIN GKVAERPQHMLMRVSVGIHKEDIDAAIETYNLLSERWFTHASPTLFNAGTNRPQLSSC FLLSMKDDSIEGIYDTLKQCALISKSAGGIGVAVSCIRATGSYIAGTNGNSNGLVPML RVYNNTARYVDQGGNKRPGAFAIYLEPWHLDIFEFLDLKKNTGKEEQRARDLFFALWI PDLFMKRVETNQDWSLMCPNECPGLDEVWGEEFEKLYASYEKQGRVRKVVKAQQLWYA IIESQTETGTPYMLYKDSCNRKSNQQNLGTIKCSNLCTEIVEYTSKDEVAVCNLASLA LNMYVTSEHTYDFKKLAEVTKVVVRNLNKIIDINYYPVPEACLSNKRHRPIGIGVQGL ADAFILMRYPFESAEAQLLNKQIFETIYYGALEASCDLAKEQGPYETYEGSPVSKGIL QYDMWNVTPTDLWDWKVLKEKIAKYGIRNSLLIAPMPTASTAQILGNNESIEPYTSNI YTRRVLSGEFQIVNPHLLKDLTERGLWHEEMKNQIIACNGSIQSIPEIPDDLKQLYKT VWEISQKTVLKMAAERGAFIDQSQSLNIHIAEPNYGKLTSMHFYGWKQGLKTGMYYLR TRPAANPIQFTLNKEKLKDKEKVSKEEEEKERNTAAMVCSLENRDECLMCGS" misc_feature 1037 /note="C/A polymorphism" polyA_signal 3036..3041 polyA_site 3059 BASE COUNT 929 a 603 c 705 g 838 t ORIGIN 1 ggggaagggg atttggattg ttgcgcctct gctctgaaga aagtgctgtc tggctccaac 61 tccagttctt tcccctgagc aacgcctgga acctaaccct tcccactctg tcaccttctc 121 gatcccgccg gcgctttaga gcggcagtcc agtcttggat ccttcagagc ctcagccact 181 agctgcgatg catgtgatca agcgagatgg ccgccaagaa cgagtcatgt ttgacaaaat 241 tacatctcga atccagaagc tttgttatgg actcaatatg gattttgttg atcctgctca 301 gatcaccatg aaagtaatcc aaggcttgta cagtggggtc accacagtgg aactagatac 361 tttggctgct gaaacagctg caaccttgac tactaagcac cctgactatg ctatcctggc 421 agccaggatc gctgtctcta acttgcacaa agaaacaaag aaagtgttca gtgatgtgat 481 ggaagacctc tataactaca taaatccaca taatggcaaa cactctccca tggtggccaa 541 gtcaacattg gatattgttc tggccaataa agatcgcctg aattctgcta ttatctatga 601 ccgagatttc tcttacaatt acttcggctt taagacgcta gagcggtctt atttgttgaa 661 gatcaatgga aaagtggctg aaagaccaca acatatgttg atgagagtat ctgttgggat 721 ccacaaagaa gacattgatg cagcaattga aacatataat cttctttctg agaggtggtt 781 tactcatgct tcgcccactc tcttcaatgc tggtaccaac cgcccacaac tttctagctg 841 ttttcttctg agtatgaaag atgacagcat tgaaggcatt tatgacactc taaagcaatg 901 tgcattgatt tctaagtctg ctggaggaat tggtgttgct gtgagttgta ttcgggctac 961 tggcagctac attgctggga ctaatggcaa ttccaatggc cttgtaccga tgctgagagt 1021 atataacaac acagctcgat atgtggatca aggtgggaac aagcgtcctg gggcatttgc 1081 tatttacctg gagccttggc atttagacat ctttgaattc cttgatttaa agaagaacac 1141 aggaaaggaa gagcagcgtg ccagagatct tttctttgct ctttggattc cggatctctt 1201 catgaaacga gtggagacta atcaggactg gtctttgatg tgtccaaatg agtgtcctgg 1261 tctggatgag gtttggggag aggaatttga gaaactatat gcaagttatg agaaacaagg 1321 tcgtgtccgc aaagttgtaa aagctcagca gctttggtat gccatcattg agtctcagac 1381 ggaaacaggc accccgtata tgctctacaa agattcctgt aatcgaaaga gcaaccagca 1441 gaacctggga accatcaaat gcagcaacct gtgcacagaa atagtggagt acaccagcaa 1501 agatgaggtt gctgtttgta atttggcttc cctggccctg aatatgtatg tcacatcaga 1561 acacacatac gactttaaga agttggctga agtcactaaa gtcgttgtcc gaaacttgaa 1621 taaaattatt gatataaact actatcctgt accagaggca tgcctatcaa ataaacgcca 1681 tcgccccatt ggaattgggg tacaaggtct ggcagatgct tttatcctga tgagataccc 1741 ttttgagagt gcagaagccc agttactgaa taagcagatc tttgaaacta tttattatgg 1801 tgctctggaa gccagctgtg accttgccaa ggagcagggc ccatacgaaa cctatgaggg 1861 ctctccagtt agcaaaggaa ttcttcagta tgatatgtgg aatgttactc ctacagacct 1921 atgggactgg aaggttctca aggagaagat tgcaaagtat ggtataagaa acagtttact 1981 tattgccccg atgcctacag cttccactgc tcagatcctg gggaataatg agtccattga 2041 accttacacc agcaacatct atactcgcag agtcttgtca ggagaatttc agattgtaaa 2101 tcctcactta ttgaaagatc ttaccgagcg gggcctatgg catgaagaga tgaaaaacca 2161 gattattgca tgcaatggct ctattcagag cataccagaa attcctgatg acctgaagca 2221 actttataaa actgtgtggg aaatctctca gaaaactgtt ctcaagatgg cagctgagag 2281 aggtgctttc attgatcaaa gccaatcttt gaacatccac attgctgagc ctaactatgg 2341 caaactcact agtatgcact tctacggctg gaagcagggt ttgaagactg ggatgtatta 2401 tttaaggacg agaccagcag ctaatccaat ccagttcact ctaaataagg agaagctaaa 2461 agataaagaa aaggtatcaa aagaggaaga agagaaggag aggaacacag cagccatggt 2521 gtgctctttg gagaatagag atgaatgtct gatgtgtgga tcctgaggaa agacttggaa 2581 gagaccagca tgtcttcagt agccaaacta cttcttgagc atagataggt atagtgggtt 2641 tgcttgaggt ggtaaggctt tgctggaccc tgttgcaggc aaaaggagta attgatttaa 2701 agtactgtta atgatgttaa tgattttttt ttaaactcat atattgggat tttcaccaaa 2761 ataatgcttt tgaaaaaaag aaaaaaaaaa cggatatatt gagaatcaaa gtagaagctt 2821 ttaggaatgc aaaataagtc atcttgcata cagggagtgg ttaagtaagg tttcatcacc 2881 catttagcac tgcttttctg aagacttcag ttttgttaag gagatttagt tttactgctt 2941 tgactggtgg gtcctctaga tgcaaaactg agtgataact catgagaagt actgatagga 3001 cctttatctg gatatggtcc tataggttat tctgaaataa agataaacat ttctaagtga 3061 aaaaaaaaaa aaaaa // LOCUS HSRITGENE 1112 bp RNA PRI 02-DEC-1996 DEFINITION H.sapiens mRNA for RIT protein. ACCESSION Y07566 NID g1702927 KEYWORDS calmodulin; ras-like GTPase; Rit gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1112) AUTHORS Wes,P.D., Yu,M. and Montell,C. TITLE RIC, a calmodulin-binding Ras-like GTPase JOURNAL EMBO J. 15 (21), 5839-5848 (1996) MEDLINE 97076145 REFERENCE 2 (bases 1 to 1112) AUTHORS Wes,P.D. TITLE Direct Submission JOURNAL Submitted (16-AUG-1996) P.D. Wes, John Hopkins University School Medicine, Department of Biological Chemistry, 725 N.Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1112 /organism="Homo sapiens" /db_xref="taxon:9606" gene 146..805 /gene="RIT (Ric-related gene expressed in many tissues)" CDS 146..805 /gene="RIT (Ric-related gene expressed in many tissues)" /codon_start=1 /db_xref="PID:e276470" /db_xref="PID:g1702928" /translation="MDSGTRPVGSCCSSPAGLSREYKLVMLGAGGVGKSAMTMQFISH RFPEDHDPTIEDAYKIRIRIDDEPANLDILDTAGQAEFTAMRDQYMRAGEGFIICYSI TDRRSFHEVREFKQLIYRVRRTDDTPVVLVGNKSDLKQLRQVTKEEGLALAREFSCPF FETSAAYRYYIDDVFHALVREIRRKEKEAVLAMEKKSKPKNSVWKRLKSPFRKKKDSV T" BASE COUNT 315 a 220 c 290 g 286 t 1 others ORIGIN 1 ggcacgaggc gcgagtgaag gaagacgaag tgcgtgaccc gaccggctgt ggtgttccag 61 tccccactga ccagtaggag cagcagggcg tcggcttgtg aggtggcttt tcctcggggc 121 aacccaggaa ggccccaaga ggacaatgga ttctggaact cgcccagttg gtagctgctg 181 tagcagcccc gctgggctct cacgggagta caaactagtg atgctgggtg ctggtggtgt 241 agggaagagt gccatgacca tgcagttcat cagccaccga ttcccagaag atcatgatcc 301 caccattgaa gatgcttata agatcaggat ccgtattgat gatgagcctg ccaatctgga 361 cattttggat acagctggac aggcagagtt tacagccatg cgggaccagt atatgagggc 421 aggagaaggg tttatcatct gttactctat cacggatcgt cgaagtttcc atgaagttcg 481 wgagtttaaa cagcttattt atcgagtccg acgtactgac gatacacctg tggttcttgt 541 gggaaacaag tcagacctca aacagctaag acaggtcacc aaggaagaag gattggcctt 601 ggcccgagaa ttcagctgtc ccttttttga gacatctgct gcataccgct actatattga 661 tgatgttttc catgcccttg tacgggagat acgtaggaaa gaaaaggagg cagtactggc 721 catggagaaa aaatctaagc ccaaaaacag tgtatggaag aggctaaaat caccattccg 781 gaagaagaaa gattcagtaa cttgaagaga agatgtgaag tgtttatctg tgaactgcag 841 tgctgtatca aagcagtcca gtaacctgca gtactgagta tggtgcttgc tctttcactt 901 aactgataag agggacatgc ctactaggag tttttaatga tgtggtattt aaagtattgt 961 ctcttagtta agtatgattt attaacccag tggagcactg tctgctttta aattgtcaca 1021 ttagaatttg ttctaccaat gttttgggtt ctgttgcgct attaattaat gtaaatttgt 1081 ttatacccag gagaaaaaaa aaaaaaaaaa aa // LOCUS HSRNAAHK 1616 bp RNA PRI 19-DEC-1995 DEFINITION H.sapiens mRNA for acidic hair keratin 1. ACCESSION X86570 NID g1134841 KEYWORDS keratin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1616) AUTHORS Fink,P., Rogers,M.A., Korge,B., Winter,H. and Schweizer,J. TITLE A cDNA encoding the human type I hair keratin hHal JOURNAL Biochim. Biophys. Acta 1264 (1), 12-14 (1995) MEDLINE 96038811 REFERENCE 2 (bases 1 to 1616) AUTHORS Winter,H. TITLE Direct Submission JOURNAL Submitted (25-APR-1995) H. Winter, German Cancer Research Center, Research Program 2, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..1616 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda Zap II" /chromosome="17" /map="q12-21" gene 54..1304 /gene="HHa1" CDS 54..1304 /gene="HHa1" /codon_start=1 /product="type I keratin" /db_xref="PID:e148885" /db_xref="PID:g1134842" /translation="MPYNFCLPSLSCRTSCSSRPCVPPSCHSCTLPGACNIPANVSNC NWFCEGSFNGSEKETMQFLNDRLASYLEKVRHVERDNAELENLIRERSQQQEPLLCPS YQWYFKTIEELQQKILCTKSENARLVVQIDNAKLAADDFRTKYQTELSLRHVVESDIN GLRRILDELTLCKSDLEAQVESLKEELLCLKSHHEQEVNTLRCQLGDRLNVEVDAAPT VDLNRVLNETRSQYEALVETNRREVEQWFTTQTEELNKQVVSSSEQLQSYQAEIIELR RTVNALEIELQAQHNLRDSLENTLTESEARYSSQLSQVQSLITNVESQLAEIRSDLER QNQEYQVLLDVRARLECEINTYRSLLESEDCNLPSNPCATTNACSKPIGPCLSNPCTS CVPPAPCTPCAPRPRCGPCNSFVR" BASE COUNT 370 a 506 c 445 g 295 t ORIGIN 1 gagaatttag actctgtctt cagccaggca ctccctccct ccctcccagc actatgccct 61 acaacttctg cctgcccagc ctgagctgcc gcaccagctg ctcctcccgg ccctgcgtgc 121 cccccagctg ccacagctgc accctgcccg gggcctgcaa catccccgcc aatgtgagca 181 actgcaactg gttctgcgag ggctccttca atggtagcga gaaggagact atgcagttcc 241 tgaacgaccg cctggccagc tacctggaga aagtgcgtca cgtggagcgg gacaacgcgg 301 agctggagaa cctcatccgg gagcggtctc agcagcagga gcccttgctg tgccccagtt 361 accagtggta ttttaagacc attgaggagc tccagcagaa gatcctgtgt accaagtctg 421 agaatgccag gcttgtggtg cagatcgaca acgccaagct ggctgcggat gatttcagaa 481 ccaagtacca gaccgagctg tccctgcggc acgtggtgga gtcggacatc aacggtctgc 541 gcaggatcct ggatgagctg accctgtgca agtccgacct ggaggcccag gtggagtccc 601 tgaaggagga gctgctctgc ctcaagagcc accatgagca ggaggtcaat accctgcgct 661 gccagcttgg agaccgcctc aatgtggagg tggatgctgc tcccactgtg gacctgaatc 721 gggtgctgaa cgagaccagg agtcagtatg aggccctggt ggaaaccaac cgcagggaag 781 tggagcaatg gttcaccacg cagaccgagg agctgaacaa gcaggtggta tccagctcag 841 agcagctgca gtcctaccag gcggagatca tcgagctgag acgcacagtc aacgccctgg 901 agatcgagct gcaggcccag cacaacctgc gagactctct ggaaaacacg ctgacagaga 961 gtgaggcccg ctacagctcc cagctgtccc aggtgcagag cctgatcacc aacgtggagt 1021 cccagctggc ggagatccgc agtgacctgg agcggcagaa ccaggagtac caggtgctgc 1081 tggatgtgcg tgcccggctg gagtgtgaga tcaacacata ccggagcctg ctggagagcg 1141 aggactgcaa tctgcccagc aatccctgtg ccacgaccaa cgcgtgcagc aagcccatcg 1201 gaccctgtct ctccaatccc tgtacctctt gtgtccctcc tgccccctgc acaccctgtg 1261 ccccacgccc ccgctgtggg ccctgcaatt ccttcgtgcg ctagaaccta gggaatgcca 1321 gaggagcaag gatgcagggc ccaggactcc agagctgtga cctggctctg gttcaacaaa 1381 aggggcctga aaacatcatt tgcatggctg gagttgcccg cgtaaggcag ccaagaaact 1441 cacccaaagc ctgtagcctc cccaactact ccagactgtc ctgctcaccc tttccttcct 1501 gggggtctgt tccttcctat gctcacccag agaactctct gatgtgccag tggccctccc 1561 ttttaacctc ctaataaata tcatttcctt ggcaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSRNAAM 2096 bp RNA PRI 31-OCT-1996 DEFINITION H.sapiens mRNA for arginine methyltransferase. ACCESSION X99209 NID g1655624 KEYWORDS arginine methyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2096) AUTHORS Scott,H.S., Lalioti,M.D., Rossier,C. and Antonarakis,S.E. TITLE Isolation and mapping of two human genes (hHMT1 and hHMT2) homologous to a yeast arginine methyltransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 2096) AUTHORS Scott,H.S. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) H.S. Scott, University of Geneva Medical School, Department of Genetics and Microbiology, 1 Rue Michel-Servet, 1211 Geneva, Switzerland FEATURES Location/Qualifiers source 1..2096 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="q22.3" CDS 177..1478 /codon_start=1 /product="arginine methyltransferase" /db_xref="PID:e255350" /db_xref="PID:g1655625" /translation="MATSGDCPRSESQGEEPAECSEAGLLQEGVQPEEFVAIADYAAT DETQLSFLRGEKILILRQTTADWWWGERAGCCGYIPANHVGKHVDEYDPEDTWQDEEY FGSYGTLKLHLEMLADQPRTTKYHSVILQNKESLTDKVILDVGCGTGIISLFCAHYAR PRAVYAVEASEMAQHTGQLVLQNGFADIITVYQQKVEDVVLPEKVDVLVSEWMGTCLL FEFMIESILYARDAWLKEDGVIWPTMAALHLVPCSADKDYRSKVLFWDNAYEFNLSAL KSLAVKEFFSKPKYNHILKPEDCLSEPCTILQLDMRTVQISDLETLRGELRFDIRKAG TLHGFTAWFSVHFQSLQEGQPPQVLSTGPFHPTTHWKQTLFMMDDPVPVHTGDVVTGS VVLQRNPVWRRHMSVALSWAVTSRQDPTSQKVGEKVFPIWR" BASE COUNT 484 a 531 c 625 g 456 t ORIGIN 1 ggcacgagct gcactgcgct tgcgcgggtt gagggcggtg gctcagtctc ctggaaagga 61 ccgtccaccc ctccgcgctg gcggtgtgga cgcggaactc agcggagaaa cgcgattgag 121 aaatggaaaa gaaaatgaaa taaatcagca gttatgaggc agagcctaag agaactatgg 181 caacatcagg tgactgtccc agaagtgaat cgcagggaga agagcctgct gagtgcagtg 241 aggcgggtct cctgcaggag ggagtacagc cagaggagtt tgtggccatc gcggactacg 301 ctgccaccga tgagacccag ctcagttttt tgagaggaga aaaaattctt atcctgagac 361 aaaccactgc agattggtgg tggggtgagc gtgcgggctg ctgtgggtac attccggcaa 421 accatgtggg gaagcacgtg gatgagtacg accccgagga cacgtggcag gatgaagagt 481 acttcggcag ctatggaact ctgaaactcc acttggagat gttggcagac cagccacgaa 541 caactaaata ccacagtgtc atcctgcaga ataaagaatc cctgacggat aaagtcatcc 601 tggacgtggg ctgtgggact gggatcatca gtctcttctg tgcacactat gcgcggccta 661 gagcggtgta cgcggtggag gccagtgaga tggcacagca cacggggcag ctggtcctgc 721 agaacggctt tgctgacatc atcaccgtgt accagcagaa ggtggaggat gtggtgctgc 781 ccgagaaggt ggacgtgctg gtgtctgagt ggatggggac ctgcctgctg tttgagttca 841 tgatcgagtc catcctgtat gcccgggatg cctggctgaa agaggacggg gtcatttggc 901 ccaccatggc tgcgttgcac cttgtgccct gcagtgctga taaggattat cgtagcaagg 961 tgctcttctg ggacaacgcg tacgagttca acctcagcgc tctgaaatct ttagcagtta 1021 aggagttttt ttcaaagccc aagtataacc acattttgaa accagaagac tgtctctctg 1081 aaccgtgcac tatattgcag ttggacatga gaaccgtgca aatttctgat ctagagaccc 1141 tgaggggcga gctgcgcttc gacatcagga aggcggggac cctgcacggc ttcacggcct 1201 ggtttagcgt ccacttccag agcctgcagg aggggcagcc gccgcaggtg ctcagcaccg 1261 ggcccttcca ccccaccaca cactggaagc agacgctgtt catgatggac gacccagtcc 1321 ctgtccatac aggagacgtg gtcacgggtt cagttgtgtt gcagagaaac ccagtgtgga 1381 gaaggcacat gtctgtggct ctgagctggg ctgtcacttc cagacaagac cccacatctc 1441 aaaaagttgg agaaaaagtc ttccccatct ggagatgaca gttgatgctt tatttggaaa 1501 gcagtgtgca tatcttgagg ggtgatgaac acaagcaaac caagttgcac ctggcttctg 1561 cacactcctg cgaaagtcgg tgaacattca ctccacattg acccctccct agcctggcag 1621 gtgacgtcag ggtccttcac agacaaacac gcttgggctc ggcaggagct gccgtggcca 1681 cccccgctgc ccagtgtctg ccctctagaa gtaggctgtg tttccaggtg ttcacccgtg 1741 gtgcccacag tgccgacccg tggctgggtc ggagctccat gttcctaagc taggtctagg 1801 tctacactcc taggacgcac gcatatcagc ccgtgtaccc tgtgacagtg actgtcccca 1861 cctcctgtgt tagtggtgcc cttactgccg tcgctcatcc actcgtgtgg gacgtaggat 1921 tgcacagggc tgtgccagtg gcgtgtaggg aacactgccc tggctcagcg tgcgagctaa 1981 ggtggcgatg tatgcgatgg gactctgcat gggatagtac agttgtgtag acgtcttcca 2041 aataaattat gtgttggtgc atcgcacatg ctcaataaat atttttaaat gagtga // LOCUS HSRNABCL9 6267 bp RNA PRI 27-OCT-1997 DEFINITION Homo sapiens mRNA for BCL9 gene. ACCESSION Y13620 NID g2181877 KEYWORDS BCL9 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6267) AUTHORS Willis,T.G. JOURNAL Unpublished REFERENCE 2 (bases 1 to 6267) AUTHORS Willis,T.G. TITLE Direct Submission JOURNAL Submitted (05-JUN-1997) T.G. Willis, Institute of Cancer Research, Academic Haematology and Cytogenetics, Haddow Laboratories, Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..6267 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /germline /map="q21" gene 740..4924 /gene="BCL9" CDS 740..4924 /gene="BCL9" /codon_start=1 /db_xref="PID:e1154538" /db_xref="PID:g2570024" /translation="MHSSNPKVRSSPSGNTQSSPKSKQEVMVRPPTVMSPSGNPQLDS KFSNQGKQGGSASQSQPSPCDSKSGGHTPKALPGPGGSMGLKNGAGNGAKGKGKRERS ISADSFDQRDPGTPNDDSDIKECNSADHIKSQDSQHTPHSMTPSNATAPRSSTPSHGQ TTATEPTPAQKTPAKVVYVFSTEMANKAAEAVLKGQVETIVSFHIQNISNNKTERSTA PLNTQISALRNDPKPLPQQPPVPANQDQNSSQNTRLQPTPPIPAPAPKPAAPPRPLDR ESPGVENKLIPSVGSPASSTPLPPDGTGPNSTPNNRAVTPVSQGSNSSSADPKAPPPP PVSSGEPPTLGENPDGLSQEQLEHRERSLQTLRDIQRMLFPDEKEFTGAQSGGPQQNP GVLDGPQKKPEGPIQAMMAQSQSLGKGPGPRTDVGAPFGPQGHRDVPFSPDEMVPPSM NSQSGTIGPDHLDHMTPEQIAWLKLQQEFYEEKRRKPEQVVVQQCSLQDMMVHQHGPR GVVRGPPPPYQMTPSEGWAPGGTEPFSDGINMPHSLPPRGMAPHPNMPGSQMRLPGFA GMINSEMEGPNVPNPASRPGLSGVSWPDDVPKIPDGRNFPPGRGIFSGPGRGERFPNP QGLSEEMFQQQLAEKQLGLPPGMAMEGIRPSMEMNRMIPGSQRHMEPGNNPIFPRIPV EGPLSPSRGDFPKGIPPQMGPGRELEFGMVPSGMKGDVNLNVNMGSNSQMIPQKMREA GAGPEEMLKLRPGGSDMLPAQQKMVPLPFGEHPQQEYGMGPRPFLPMSQGPGSNSGLR NLREPIGPDQRTNSRLSHMPPLPLNPSSNPTSLNTAPPVQRGLGRKPLDISVAGSQVH SPGINPLKSPTMHQVQSPMLGSPSGNLKSPQTPSQLAGMLAGPAAAASIKSPPVLGSA AASPVHLKSPSLPAPSPGWTSSPKPPLQSPGIPPNHKAPLTMASPAMLGNVESGGPPP PTASQPASVNIPGSLPSSTPYTMPPEPTLSQNPLSIMMSRMSKFAMPSSTPLYHDAIK TVASSDDDSPPARSPNLPSMNNMPGMGINTQNPRISGPNPVVPMPTLSPMGMTQPLSH SNQMPSPNAVGPNIPPHGVPMGPGLMSHNPIMGHGSQEPPMVPQGRMGFPQGFPPVQS PPQQVPFPHNGPSGGQGSFPGGMGFPGEGPLGRPSNLPQSSADAALCKPGGPGGPDSF TVLGNSMPSVFTDPDLQEVIRPGATGIPEFDLSRIIPSEKPSQTLQYFPRGEVPGRKQ PQGPGPGFSHMQGMMGEQAPRMGLALPGMGGPGPVGTPDIPLGTAPSMPGHNPMRPPA FLQQGMMGPHHRMMSPAQSTMPGQPTLMSNPAAAVGMIPGKDRGPAGLYTHPGPVGSP GMMMSMQGMMGPNRTS" BASE COUNT 1509 a 1789 c 1532 g 1437 t ORIGIN 1 cttccatgtg tgaaagctac ttggcatgaa tgcctgggcc gtaccaagtg tcaccgcagc 61 aagaggggga gtcagaagaa aaacagatca gaccaagcag acaataggcc cctaaagtgt 121 tccccctaag ttgctttgat gttgtcctgg tgtcttgata ccaggaggcc agggattgcg 181 ggaaaagggt cttttttgtc ttcattcact ttcccccctc agtttctgaa atgattctcc 241 agaatttctc ctcataaaaa aggactgaat gtgggcccag ttggcgtcat tctgctttga 301 cctaaacatt cccatctgat tgggtggcag agatcatttt tggaaagttc ttccgtgtcc 361 cgatgtagaa gaaatagcaa attggacata ttgaaagaca agggtcatct ttgagaaggg 421 ggttcctgga ctcctcacct ccaggatgag cactgcagtg tcgtgaccct tggggttttg 481 tatgccctgg agatgcgaga ttttcctctg gcagcaggag gcacgcaccc agagaatgct 541 ggagctgcaa ggggaaagga cccacttcca cagcagagaa aaacaaagag gaaaaaggca 601 tacaggcagc gagcgctaag ggacgcaccc agcaagcagt gggccagtgc cactgccccc 661 agcagctgtt tctgctgcaa cccgagagga actcggtgag cctgtcccgt ttgtgactgc 721 aagctcagga tttcaatcaa tgcattccag taaccctaaa gtgaggagct ctccatcagg 781 aaacacacag agtagcccta agtcaaagca ggaggtgatg gtccgtcccc ctacagtgat 841 gtccccatct ggaaaccccc agctggattc caaattctcc aatcagggta aacagggggg 901 ctcagccagc caatcccagc catccccctg tgactccaag agtgggggcc atacccctaa 961 agcactccct ggcccaggtg ggagcatggg gctgaagaat ggggctggaa atggtgccaa 1021 gggcaagggg aaaagggagc gaagtatttc cgccgactcc tttgatcaga gagatcctgg 1081 gactccaaac gatgactctg acattaaaga atgtaattct gctgaccaca taaagtccca 1141 ggattcccag cacacaccac actcgatgac cccatcaaat gctacagccc ccaggtcttc 1201 taccccctcc catggccaaa ctactgccac agagcccaca cctgctcaga agactccagc 1261 caaagtggtg tacgtgtttt ctactgagat ggccaataaa gctgcagaag ctgttttgaa 1321 gggccaggtt gaaactatcg tctctttcca catccagaac atttctaaca acaagacaga 1381 gagaagcaca gcgcctctga acacacagat atctgccctt cggaatgatc cgaaacctct 1441 cccacaacag cccccagttc cggccaacca ggaccagaat tcttcccaga ataccagact 1501 gcagccaact ccacccattc cggcaccagc acccaagcct gccgcacccc cacgtcccct 1561 ggaccgggag agtcctgggg tagaaaacaa actgattcct tctgtaggaa gtcctgccag 1621 ctccactcca ctgcccccag atggtactgg gcccaactca actcccaaca atagggcagt 1681 gacccctgtc tcccagggga gcaatagctc ttcagcagat cccaaagccc ctccgcctcc 1741 accagtgtcc agtggcgagc cccccacact gggagagaat cccgatggct tatctcagga 1801 gcagctggag caccgggagc gctccttaca aactctcaga gatatccagc gcatgctttt 1861 tcctgatgag aaagaattca caggagcaca aagtggggga ccgcagcaga atcctggggt 1921 attagatggg cctcagaaaa aaccagaagg gccaatacag gccatgatgg cccaatccca 1981 aagcctaggt aagggacctg ggccccggac agacgtggga gctccatttg gccctcaagg 2041 acatagagat gtaccctttt ctccagatga aatggttcca ccttctatga actcccagtc 2101 tgggaccata ggacccgacc accttgacca tatgactccc gagcagatag cgtggctgaa 2161 actgcagcag gagttttatg aagagaagag gaggaagccg gaacaagtgg ttgtccagca 2221 gtgttccctc caggacatga tggtccatca gcacgggcct cggggagtgg tccgaggacc 2281 cccccctcca taccagatga cccctagtga aggctgggca cctgggggta cagagccatt 2341 ttctgatggt atcaacatgc cacattctct gcccccgagg ggcatggctc cccaccccaa 2401 catgccaggg agccagatgc gcctccctgg atttgcaggc atgataaact ctgaaatgga 2461 agggccgaat gtccccaacc ctgcatctag accaggtctt tctggagtca gttggccaga 2521 tgatgtgcca aaaatcccag atggtcgaaa ttttcctcct ggccggggca ttttcagcgg 2581 ccctggccga ggggaacgct tcccaaaccc ccaaggattg tctgaagaga tgtttcagca 2641 gcagctggca gagaaacagc tgggtctccc cccagggatg gccatggaag gcatcaggcc 2701 cagcatggag atgaacagga tgattccagg ctcccagcgc cacatggagc ctgggaataa 2761 ccccattttc cctcgaatac cagttgaggg ccctctgagt ccttctaggg gtgactttcc 2821 aaaaggaatt cccccacaga tgggccctgg tcgggaactt gagtttggga tggttcctag 2881 tgggatgaag ggagatgtca atctaaatgt caacatggga tccaactctc agatgatacc 2941 tcagaagatg agagaggctg gggcgggccc tgaggagatg ctgaaattac gcccaggtgg 3001 ctcagacatg ctgcctgctc agcagaagat ggtgccactg ccatttggtg agcaccccca 3061 gcaggagtat ggcatgggcc ccagaccatt ccttcccatg tctcagggtc caggcagcaa 3121 cagtggcttg cggaatctca gagaaccaat tgggcccgac cagaggacta acagccggct 3181 cagtcatatg ccaccactac ctctcaaccc ttccagtaac cccaccagcc tcaacacagc 3241 tcctccagtt cagcgcggcc tggggcggaa gcccttggat atatctgtgg caggcagcca 3301 ggtgcattcc ccaggcatta accctctgaa gtctcccacg atgcaccaag tccagtcacc 3361 aatgctgggc tcgccctcgg ggaacctcaa gtccccccag actccatcgc agctggcagg 3421 catgctggcg ggcccagctg ctgctgcttc cattaagtcc ccccctgttt tggggtctgc 3481 tgctgcttca cctgtccacc tcaagtctcc atcacttcct gccccgtcac ctggatggac 3541 ctcttctcca aaacctcccc ttcagagtcc tgggatccct ccaaaccata aagcacccct 3601 caccatggcc tccccagcca tgctgggaaa tgtagagtca ggtggccccc cacctcctac 3661 agccagccag cctgcctctg tgaatatccc tggaagtctt ccctctagta caccttatac 3721 catgcctcca gagccaaccc tttcccagaa cccactctct attatgatgt ctcgaatgtc 3781 caagtttgca atgcccagtt ccaccccgtt ataccatgat gctatcaaga ctgtggccag 3841 ctcagatgac gactcccctc cagctcgttc tcccaacttg ccatcaatga ataatatgcc 3901 aggaatgggc attaatacac agaatcctcg aatttcaggt ccaaaccccg tggttccgat 3961 gccaaccctc agcccaatgg gaatgaccca gccactttct cactccaatc agatgccctc 4021 tccaaatgcc gtgggaccca acatacctcc tcatggggtc ccaatggggc ctggcttgat 4081 gtcacacaat cctatcatgg ggcatgggtc ccaagagcca ccgatggtac ctcaaggacg 4141 gatgggcttc ccccagggct tccctccagt acagtctccc ccacagcagg ttccattccc 4201 tcacaatggc cccagtgggg ggcagggcag cttcccagga gggatgggtt tcccaggaga 4261 aggccccctt ggccgcccca gcaacctgcc ccaaagttca gcagatgcag cactttgcaa 4321 gcctggaggc cccgggggtc ctgactcctt cactgtcctg gggaacagca tgccttcggt 4381 gtttacagac ccagatctgc aggaggtcat ccgacctgga gccaccggaa tacctgagtt 4441 tgatctatcc cgcattattc catctgagaa gcccagccag acgctgcaat atttccctcg 4501 aggggaagtt ccaggccgta aacagcccca gggtcctgga cctgggtttt cacacatgca 4561 ggggatgatg ggcgaacaag cccccagaat gggactagca ttacctggca tgggaggtcc 4621 agggccagtg ggaactccgg acatccctct tggtacagct ccatccatgc caggccacaa 4681 ccccatgaga ccaccagcct ttctccaaca aggcatgatg ggacctcacc atcggatgat 4741 gtcaccagca caatctacaa tgcccggcca gcccaccctg atgagcaatc cagctgctgc 4801 cgtgggcatg attcctggca aggatcgggg gcctgccggg ctctacaccc accctgggcc 4861 tgtgggctct ccaggcatga tgatgtccat gcagggcatg atgggaccca acagaacatc 4921 atgatccccc cacagatgag gccccggggc atggctgctg acgtgggcat gggtggattt 4981 agccaaggac ctggcaaccc aggaaacatg atgttttaag ctgctaagat gggatgtgcc 5041 gatccttgtc aaaatgagat tccaggtcct gagagctgct ttgagggagt tccaggagta 5101 cttactattg gtcatgcaat aggagaacag agacccgagg gctgctttgg gggagggggg 5161 aactcgagaa tgtatggatt tacctgaaaa caaattattc atttaatcaa caggtgtgtt 5221 ttttttaaga tttatttttt aaaaattatt tttgtggact tgggtatcaa tgatggcacc 5281 tacttttggg aatctgtagc tgtgctttga gaattgccat cggtcatgtg ttgcaccgtt 5341 ctctgtatgt ttacgtcctt tggactggct tctcccagga ttcttttctg tttttgtttt 5401 tttgatttgg gctttatttt tttctgtgta ctgtactata ttgtaaaagg gattttagca 5461 gagactttag tctttggggc aagaggagaa caggaatgct gggctgttta ctttaggtgg 5521 agaatccatc ttcagacctt tggactattt tctttcaact gcagtgtata gaaaaaccaa 5581 actacgacct cagagcagag tattaatgaa aagcacaaaa aaaggaacta agttcagcga 5641 ggggtggggg gaggggggag atttttcttt tgaaaaataa tgactcttag gacatttgtt 5701 tttaagttca agtgctcttc agcactgtct tgtctcccaa tataccaacc cactggcaca 5761 ttttctctgt ttctctctcc gattttgctc tgtctcctca gttaagtgtt tccttccttt 5821 gtgccccccg ctggtgaccc tctgcttccc tctctctttc cctttggcag ctgcaataca 5881 cagtgttatt ttggggaaat aaatctagca aagcctcgcc ttccatgccg agcgtcctct 5941 tggctctgag agggaaagtc tgtcctggga tgcttctctg tcttttttcc ccctaagtct 6001 ttctctttcc catcataccc ttccctgccc caccttgttt tctgttcccc ttttattagg 6061 aattcccaag tgaattttat taatgtggga gtggaacaga tgctaaaagc tatccaggat 6121 tttgtttctg tttgttttaa attttgtggt tccttccctt tcctccccct cccatgcgta 6181 agacgttctg tgtaacctcc attaaatttg gtacaaaacc actcgccaga gctgtggtgt 6241 cagaaaaata aaatatattg tttctta // LOCUS HSRNABS69 2722 bp RNA PRI 11-JUL-1995 DEFINITION H.sapiens mRNA for BS69 protein. ACCESSION X86098 NID g899293 KEYWORDS BS69 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2722) AUTHORS Hateboer,G., Gennissen,A., Ramos,Y.F., Kerkhoven,R.M., Sonntag-Buck,V., Stunnenberg,H.G. and Bernards,R. TITLE BS69, a novel adenovirus E1A-associated protein that inhibits E1A transactivation JOURNAL EMBO J. 14 (13), 3159-3169 (1995) MEDLINE 95347342 REFERENCE 2 (bases 1 to 2722) AUTHORS Bernards,R. TITLE Direct Submission JOURNAL Submitted (06-APR-1995) R. Bernards, Netherlands Cancer Institute, Divisions of Molecular Carcinogenesis, Plesmanlaan 121, 1066 CX Amsterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..2722 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon carcinoma" /clone_lib="T84" gene 245..1933 /gene="BS69" CDS 245..1933 /gene="BS69" /function="inhibits E1A mediated transactivation" /note="binds directly to adenovirus type 5 E1A protein" /codon_start=1 /db_xref="PID:g899294" /translation="MSRVHGMHPKETTRQLSLAVKDGLIVETLTVGCKGSKAGIEQEG YWLPGDEIDWETENHDWYCFECHLPGEVLICDLCFRVYHSKCLSDEFRLRDSSSPWQC PVCRSIKKKNTNKQEMGTYLRFIVSRMKERAIDLNKKGKDNKHPMYRRLVHSAVDVPT IQEKVNEGKYRSYEEFKADAQLLLHNTVIFYGADSEQADIARMLYKDTCHELDELQLC KNCFYLSNARPDNWFCYPCIPNHELVWAKMKGFGFWPAKVMQKEDNQVDVRFFGHHHQ RAWIPSENIQDITVNIHRLHVKRSMGWKKACDELELHQRFLREGRFWKSKNEDRGEEE AESSISSTSNEQLKVTQEPRAKKGRRNQSVEPKKEEPEPETEAVSSSQEIPTMPQPIE KVSVSTQTKKLSASSPRMLHRSTQTTNDGVCQSMCHDKYTKIFNDFKDRMKSDHKRET ERVVREALEKLRSEMEEEKRQAVNKAVANMQGEMDRKCKQVKEKCKEEFVEEIKKLAT QHKQLISQTKKKQWCYNCEEEAMYHCCWNTSYCSIKCQQEHWHAEHKRTCRRKR" BASE COUNT 879 a 515 c 617 g 711 t ORIGIN 1 ggagcataat gctaaagaag taaacaggtc atggcacgtt taacaaaaag acgacaggcg 61 atacaaaagc tatccagcat ctttgggcag ccattgagat tatacggaac cagaagcaga 121 ttgccaacat tgaccgtatt acaaaatgtg aaacaactac attattcttg aacctatggt 181 gatttttaca tcattacaca gatatgtcat tttcattagt tgtatcattg ttataaactg 241 gtatatgtct cgagtccacg gtatgcaccc taaagagacc acccgtcagc tgagcttagc 301 tgtgaaagat ggtcttattg tcgaaactct aacagtgggc tgcaaaggtt caaaagctgg 361 tattgaacaa gaaggatatt ggttgccagg agatgagatt gactgggaaa cagaaaatca 421 tgactggtat tgttttgaat gccatttgcc tggagaggtg ttgatatgtg acctgtgttt 481 tcgtgtgtat cattccaagt gtttgtctga tgagttcagg cttagagaca gcagtagtcc 541 ctggcagtgc ccagtttgca ggagcattaa gaagaagaat acaaacaaac aggagatggg 601 cacatacctc agattcattg tctcccgcat gaaggagagg gctatagatc ttaataaaaa 661 ggggaaggac aataaacacc cgatgtacag gaggctggtg cactcagctg tggacgttcc 721 caccattcaa gagaaagtga atgaagggaa ataccgaagt tatgaagagt tcaaagctga 781 tgcccaattg cttctccaca ataccgtgat tttctatgga gcagacagtg agcaagctga 841 cattgcgagg atgctatata aagacacatg tcatgagctg gatgaactgc agctttgcaa 901 gaattgcttt tacttgtcaa atgctcgtcc tgacaactgg ttctgttatc cttgtatacc 961 taatcatgag ctggtttggg ctaaaatgaa aggttttggg ttttggccag ccaaagtcat 1021 gcagaaagaa gacaatcaag tcgacgttcg cttctttggc caccaccacc agagggcctg 1081 gattccttct gaaaacattc aagatatcac agtcaacatt catcggctgc acgtgaagcg 1141 cagtatgggt tggaaaaagg cctgtgatga gctggagctg catcagcgtt tcctacgaga 1201 agggagattt tggaaatcta agaatgagga ccgaggtgag gaagaggcag aatccagtat 1261 ctcctccacc agtaatgagc agctaaaggt cactcaagaa ccaagagcaa agaaaggacg 1321 acgtaatcaa agtgtggagc ccaaaaagga agaaccagag cctgaaacag aagcagtaag 1381 ttctagccag gaaataccca cgatgcctca gcccatcgaa aaagtctccg tgtcaactca 1441 gacaaagaag ttaagtgcct cttcaccaag aatgctgcat cggagcaccc agaccacaaa 1501 cgacggcgtg tgtcagagca tgtgccatga caaatacacc aagatcttca atgacttcaa 1561 agaccggatg aagtcggacc acaagcggga gacagagcgt gttgtccgag aagctctgga 1621 gaagctgcgt tctgaaatgg aagaagaaaa gagacaagct gtaaataaag ctgtagccaa 1681 catgcagggt gagatggaca gaaaatgtaa gcaagtaaag gaaaagtgta aggaggaatt 1741 tgtagaagaa atcaagaagc tggcaacaca gcacaagcaa ctgatttctc agaccaagaa 1801 gaagcagtgg tgctacaact gtgaggagga ggccatgtac cactgctgct ggaacacatc 1861 ctactgctcc atcaagtgcc agcaggagca ctggcacgcg gagcacaagc gcacctgccg 1921 ccggaaaaga tgaagctggc ccttcccgga gtcaccccga tgattactct tttcagacac 1981 agcggttttt gtttccaaga agccaaaatt gtttagaatt tgcttcccat tttgcaccag 2041 cctttaaaca cttttcgtga agaaattttg cacagtagtt taaatctttt gttaatgctc 2101 ctccgaagtt tttcaggggg taaaagtaac atcagtggag ggtattattt taaataaatt 2161 ttaattgaga atttgttgca ttttcagcaa attttaaaac atttttaggt tttacagaga 2221 ttttaacctt taaacaacag atctttaaaa aacaggtgaa tacaagtgag tttaacaaag 2281 aaacatttag aatagatctg aatgtaagaa ctacagaact gtttcagaaa taaaacatac 2341 taccttgatg tgacattttt ttcttaacct tgttgagctg gttttgttca gcttaattta 2401 ctgttcaaag gcattatctg ttggtcacac cagtgggtat atgattgaat ttagggaaca 2461 gggttgacac agcagggcta gtcctgcata ttttttctta aatatttccc aattgtgttt 2521 ttcattattt cttttcaata tataactttt ataacaaatt attagctttg atcttgtagt 2581 ttaaaattgc agggaactgg ggtaatcttt tactgagctg gatcttagag aaaatgaata 2641 tttaaatttt aaagtttgcc acatttcatc tttgtcctaa catgagtgct tgtaacaaaa 2701 taaaacaaca aaaacaaagc ct // LOCUS HSRNAC61A 2407 bp RNA PRI 20-JUL-1992 DEFINITION H.sapiens c6.1A mRNA. ACCESSION X64643 NID g36087 KEYWORDS CpG island associated sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2407) AUTHORS Kenwrick,S.J. TITLE Direct Submission JOURNAL Submitted (20-FEB-1992) S.J. Kenwrick, Cambridge University, Dept of Medicine, Level 5-Addenbrooke's Hospital, Hills Road, Camridge CB2 2QQ, UK REFERENCE 2 (bases 1 to 2407) AUTHORS Kenwrick,S., Levinson,B., Taylor,S., Shapiro,A. and Gitschier,J. TITLE Isolation and sequence of two genes associated with a CpG island 5' of the factor VIII gene JOURNAL Hum. Mol. Genet. 1 (3), 179-186 (1992) MEDLINE 93265009 FEATURES Location/Qualifiers source 1..2407 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="cDNA" /chromosome="Xq28, upstream of the factor VIII gene" gene 7..882 /gene="C6.1A" CDS 7..882 /gene="C6.1A" /codon_start=1 /db_xref="PID:g36088" /db_xref="SWISS-PROT:P46736" /translation="MAVQVVQAVQAVHLESDAFLVCLNHALSTEKEEVMGLCIGELND DTRSDSKFAYTGTEMRTVAEKVDAVRIVHIHSVIILRRSDKRKDRVEISPEQLSAAST EAERLAELTGRPMRVVGWYHSHPHITVWPSHVDVRTQAMYQMMDQGFVGLIFSCFIED KNTKTGRVLYTCFQSIQAQKSSEYERIEIPIHIVPHVTIGKVCLESAVELPKILCQEE QDAYRRIHSLTHLDSVTKIHNGSVFTKNLCSQMSAVSGPLLQWLEDRLEQNQQHLQEL QQEKEELMQELSSLE" misc_feature complement(500..900) /note="Alu homology" misc_feature 1400..2000 /note="line sequence homology; c6.1A sequence" BASE COUNT 838 a 463 c 522 g 584 t ORIGIN 1 gccaagatgg cggtgcaggt ggtgcaggcg gtgcaggcgg ttcatctcga gtctgacgct 61 ttcctcgttt gtctcaacca cgctctgagc acagagaagg aggaagtaat ggggctgtgc 121 ataggggagt tgaacgatga tacaaggagt gactccaaat ttgcatatac tggaactgaa 181 atgcgcacag ttgctgaaaa ggttgatgcc gtcagaattg ttcacattca ttctgtcatc 241 atcttacgac gttctgataa gaggaaggac cgagtagaaa tttctccaga gcagctgtct 301 gcagcttcaa cagaggcaga gaggttggct gaactgacag gccgccccat gagagttgtg 361 ggctggtatc attcccatcc tcatataact gtttggcctt cacatgttga tgttcgcaca 421 caagccatgt accagatgat ggatcaaggc tttgtaggac ttattttttc ctgtttcata 481 gaagataaga acacaaagac tggccgggta ctctacactt gcttccaatc catacaggcc 541 caaaagagtt cagagtatga gagaatcgaa atcccaatcc atattgtacc tcatgtcact 601 atcgggaaag tgtgccttga atcagcagta gagctgccca agatcctgtg ccaggaggag 661 caggatgcgt ataggaggat ccacagcctt acacatctgg actcagtaac caagatccat 721 aatggctcag tgtttaccaa gaatctgtgc agtcagatgt cggcagtcag cgggcctctc 781 ctacagtggt tggaggacag actggagcaa aaccaacagc atttgcagga attacaacaa 841 gaaaaggaag agcttatgca agaactttct tctctagaat aaatcaggag acaaaatggg 901 gaaagatgaa aatatccagt gtaaagttac ttaagctaaa tcaatttcaa agaagaaaaa 961 cttggaggac tcattttacc tgacttcaag acttactata aagctatagt aatcaagata 1021 gatggtattg gcagaggaac agacacatac gtcaatggaa cagatgagag aacccagaaa 1081 taaacccata taaatatgct cagctgattt tgaaaaagtg aaaaagcaat tcaatggagg 1141 aagaatagcc tttctgacaa attatgctag agcaattaga cacccatggc gaggagaaaa 1201 aagaacctct acttaaacct cacatcttat ataaaattta actcaaaatg tataacggac 1261 ttaaatgtga tacataaaac tagataactt tgaaaaaagc cacaggagaa aaatcttcag 1321 gatcttgggc taggtgacaa gttcttggac tttgccccga aagcacatcc ataaaagaca 1381 aaatctgata tattggactt cttcaaaatt taaaaacttg tgatttaaga agaggaaaag 1441 ataagctaca gattgagatg aatttgcaaa ccatatatct gatcaatttg gaatatataa 1501 agtgtactaa aaactcaact gaagtcaggc atggtagctc atgcttgtaa tctcaccact 1561 ttgggaggcc aagatgggag gagtgcttga ggttaggcgt tccagaccag cctgggcaac 1621 atagtgagac tcttgtctct acaaaaagtt tttttaaaaa attaactggg caccatgaca 1681 cacaccagta gtcccagcta ctagggaggc aggaggatca cttgagccca ggagtttgag 1741 gcctgcggtg agctgtgatc acaccaccac actccaacct gtgtaacaga gtgaggcctc 1801 atctcaaaaa aaaaaaaaag gccacaaaac tcaacaataa aaacaaacag tccaattaga 1861 aaatgggcaa aagacatgaa tagatgtttc actgaagagg atctatagat ggcaagtaag 1921 catatgaaaa gctgttaaac tccataagtc atcagggaat gcaaattgaa accacagcga 1981 ggctatgact tacttatctc aatggctaaa gaaaaaatag tgaaaatacc aaatactgat 2041 gaggatacaa actggatatt ttatacattg ctgacaggaa tgtaaaatgg tacagccact 2101 ctgggaaaga gtttatgaat ttcttatcaa gttaaacata attttttaat caagttaaac 2161 ataagaccca gcagttgtgc tcctggacat tcattccaga gaaatgaaaa cctatattgt 2221 acttgtactc aaatattcat aggagcttta tttgtaatag ccccaaactg gaaacaaccc 2281 agatgtccta caacaggtac atggttaaac aaaccatcca taacttggaa tactgctctg 2341 gaatgaaaag gaactaactg ttgatacaag aacttggttg tacctcaggg gtattatggt 2401 gaacaaa // LOCUS HSRNACINP 1901 bp RNA PRI 05-SEP-1995 DEFINITION H.sapiens mRNA for cytokine inducible nuclear protein. ACCESSION X83703 NID g793840 KEYWORDS ankyrin-like repeat; nuclear localisation signal; nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1901) AUTHORS Chu,W., Burns,D.K., Swerlick,R.A. and Presky,D.H. TITLE Identification and characterization of a novel cytokine-inducible nuclear protein from human endothelial cells JOURNAL J. Biol. Chem. 270 (17), 10236-10245 (1995) MEDLINE 95247734 REFERENCE 2 (bases 1 to 1901) AUTHORS Chu,W. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) W. Chu, Hoffmann-La Roche, 340 Kingsland Street, Dept. of Inflammation/Autoimmune Disease, Hoffmann-La Roche, Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1..1901 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="endothelial" /clone_lib="HDMEC cDNA" /clone="C-193" /chromosome="10" mRNA 1..1901 misc_feature 94..98 /note="nuclear localization signal" repeat_unit 152..283 /note="ankyrin-like repeats" CDS 250..1209 /note="cytokine-inducible expression" /codon_start=1 /product="nuclear protein" /db_xref="PID:g793841" /translation="MMVLKVEELVTGKKNGNGEAGEFLPEDFRDGEYEAAVTLEKQED LKTLLAHPVTLGEQQWKSEKQREAELPKKKLEQRSKLENLEDLEIIIQLKKRKKYRKT KVPVVKEPEPEIITEPVDVPTFLKAALENKLPVVEKFLSDKNNPDVCDEYKRTALHRA CLEGHLAIVEKLMEAGAQIEFRDMLESTAIHWASRGGNLDVLKLLLNKGAKISARDKL LSTALHVAVRTGHYECAEHLIACEADLNAKDREGDTPLHDAVRLNRYKMIRLLIMYGA DLNIKNCAGKTPMDLVLHWQNGTKAIFDSLRENSYKTSRIATF" BASE COUNT 592 a 378 c 460 g 471 t ORIGIN 1 aaaaaacagc agggttagct tgtccctccc ctccctcttc agcttcccag acactgattc 61 tggaatgaaa attcacctgc ctctgagttg gctcctaatg ggggtgggag tgttacttcg 121 gttcccaggt tggaagatta tctcacccgg ccccagctat ataagctgac cggtgtggag 181 gggcccagca gggccaactc cagggattcc ttccacgaca gaaaaacata caagactcct 241 tcagccaaca tgatggtact gaaagtagag gaactggtca ctggaaagaa gaatggcaat 301 ggggaggcag gggaattcct tcctgaggat ttcagagatg gagagtatga agctgctgtt 361 actttagaga agcaggagga tctgaagaca cttctagccc accctgtgac cctgggggag 421 caacagtgga aaagcgagaa acaacgagag gcagagctcc caaagaaaaa actagaacaa 481 agatccaagc ttgaaaattt agaagacctt gaaataatca ttcaactgaa gaaaaggaaa 541 aaatacagga aaactaaagt tccagttgta aaggaaccag aacctgaaat cattacggaa 601 cctgtggatg tgcctacgtt tctgaaggct gctctggaga ataaactgcc agtagtagaa 661 aaattcttgt cagacaagaa caatccagat gtttgtgatg agtataaacg gacagctctt 721 catagagcat gcttggaagg acatttggca attgtggaga agttaatgga agctggagcc 781 cagatcgaat tccgtgatat gcttgaatcc acagccatcc actgggcaag ccgtggagga 841 aacctggatg ttttaaaatt gttgctgaat aaaggagcaa aaattagcgc ccgagataag 901 ttgctcagca cagcgctgca tgtggcggtg aggactggcc actatgagtg cgcggagcat 961 cttatcgcct gtgaggcaga cctcaacgcc aaagacagag aaggagatac cccgttgcat 1021 gatgcggtga gactgaaccg ctataagatg atccgactcc tgattatgta tggcgcggat 1081 ctcaacatca agaactgtgc tgggaagacg ccgatggatc tggtgctaca ctggcagaat 1141 ggaaccaaag caatattcga cagcctcaga gagaactcct acaagacctc tcgcatagct 1201 acattctgag gcaaacgaca gactcttaat cagtaaatgt tcactggcat tttgaaggca 1261 tggcccagga gaagagacac tagccataaa atctagtttc tatttatcaa cgtgttgtga 1321 agatgtacct aatgaagttt tgagaaagca cagggttata ggtgtttaaa tttcctttag 1381 tgaaactctt atttattttt atgtattcct gtttatttat ttactgccac gctactgata 1441 ttcagacctt catgatcatc catctggtga gcagagcttc atttgtatat aacactttca 1501 gagccttccc acccataggt agttcttaaa ccaggtgaaa gagcaaagtt caagtgccta 1561 cttatgtgtc attcgctcat gtaagagttt ttaagagagg gctgattatc acagccctct 1621 tttctcctga atttttaatg cagaagtttg aatgaagcaa gggaaggcat gtagggacag 1681 gaaaggaaac aatggaagga aagtgattct gtgaaaagga cagtgaagcc agctatttta 1741 cccccaggct ggattttttt tttttttttt tttttttttt tttttaccga gtacacagag 1801 tacccaagtg aagagaacgt catgagtgta agtgcaaatc agtggaagga gcggcaaact 1861 gggacatgca gaattgaatt tgctcaaaaa aaaaaaaaaa a // LOCUS HSRNAE2F5 1748 bp RNA PRI 31-MAY-1995 DEFINITION H.sapiens mRNA for E2F-5 protein. ACCESSION X86097 NID g854171 KEYWORDS E2F-5 gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1748) AUTHORS Bernards,R. TITLE Direct Submission JOURNAL Submitted (06-APR-1995) R. Bernards, Netherlands Cancer Institute, Divisions of Molecular Carcinogenesis, Plesmanlaan 121, 1066 CX Amsterdam, NETHERLANDS REFERENCE 2 (bases 1 to 1748) AUTHORS Hijmans,E.M., Voorhoeve,P.M., Beijersbergen,R.L., van 't Veer,L.J. and Bernards,R. TITLE E2F-5, a new E2F family member that interacts with p130 in vivo JOURNAL Mol. Cell. Biol. 15 (6), 3082-3089 (1995) MEDLINE 95280906 FEATURES Location/Qualifiers source 1..1748 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon carcinoma" /clone_lib="T84" gene 31..1071 /gene="E2F-5" CDS 31..1071 /gene="E2F-5" /codon_start=1 /product="transcription factor" /db_xref="PID:g854172" /translation="MAAAEPASSGQQAPAGQGQGQRPPPQPPQAQAPQPPPPPQLGGA GGGSSRHEKSLGLLTTKFVSLLQEAKDGVLDLKAAADTLAVRQKRRIYDITNVLEGID LIEKKSKNSIQWKGVGAGCNTKEVIDRLRYLKAEIEDLELKERELDQQKLWLQQSIKN VMDDSINNRFSYVTHEDICNCFNGDTLLAIQAPSGTQLEVPIPEMGQNGQKKYQINLK SHSGPIHVLLINKESSSSKPVVFPVPPPDDLTQPSSQSLTPVTPQKSSMATQNLPEQH VSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEG VCDLFDVQILNY" BASE COUNT 524 a 373 c 371 g 480 t ORIGIN 1 ggggcccgac caccgcgggg ccgggacgcg atggcggcgg cagagcccgc gagctcgggc 61 cagcaggcgc cggcagggca ggggcagggc cagcggccgc cgccgcagcc tccgcaggcg 121 caagccccgc agccgccccc gccgccgcag ctcgggggcg cggggggcgg cagcagcagg 181 cacgagaaga gcctggggct gctcactacc aagttcgtgt cgctgctgca ggaggccaag 241 gacggcgttc tggatctcaa agcggctgct gatactttgg ctgtgaggca aaaaaggaga 301 atttatgata tcaccaatgt cttagaggga attgacttga ttgaaaaaaa gtcaaaaaac 361 agtatccagt ggaaaggtgt aggtgctggc tgtaatacta aagaagtcat agatagatta 421 agatatctta aagctgaaat tgaagatcta gaactgaagg aaagagaact tgatcagcag 481 aagttgtggc tacagcaaag catcaaaaat gtgatggacg attccattaa taatagattt 541 tcctatgtaa ctcatgaaga catctgtaat tgctttaatg gtgatacact tttggccatt 601 caggcacctt ctggtacaca actggaggta cccattccag aaatgggtca gaatggacaa 661 aagaaatacc agatcaatct aaagagtcat tcaggaccta tccatgtgct gcttataaat 721 aaagagtcga gttcatctaa gcccgtggtt tttcctgttc ccccacctga tgacctcaca 781 cagccttcct cccagtcctt gactccagtg actccacaga aatccagcat ggcaactcaa 841 aatctgcctg agcaacatgt ctctgaaaga agccaggctc tgcagcagac atcagctaca 901 gatatatctt cagcaggatc tattagtgga gatatcattg atgagttaat gtcttctgac 961 gtgtttcctc tcttaaggct ttctcctacc ccggcagatg actacaactt taatttagat 1021 gataacgaag gagtttgtga tctgtttgat gtccagatac taaattatta gattccatgg 1081 aaacttggga ctgttatcta cctctaactg tgtaacattt tagacttctt aataacctaa 1141 atatttaaaa taatgaatgt aacacctttt ttagttcact gattctgaag tgttcttccc 1201 taatactttc tttacttcac aaaacttcaa ccataaaaac aaagggctct gattgcttta 1261 ggggataagt gatttaatat tcacaaacgt ccccactccc aaaagtaact atattctgga 1321 tttcaacttt tcttctaatt gtgaatcctt ccgttttttc ttcttaagga ggaaagttaa 1381 aggacactac aggtcatcaa aaacaagttg gccaaggact cattacttgt cttatatttt 1441 tactgccact aaactgcctg tatttctgta tgtccttcta tccaaacaga cgttcactgc 1501 cacttgtaaa gtgaaggatg taaacgagga tatataactg tttcagtgaa cagattttgt 1561 gaagtgcctt ctgttttagc actttaagtt tatcacattt tgttgacttc tgacattcca 1621 ctttcctagg ttataggaaa gatctgttta tgtagtttgt ttttaaaatg tgccaatgcc 1681 tgtacattaa caagattttt aaaaataaaa ttgtataaaa cattaaaaaa aaaaaaaaaa 1741 aaaaaaaa // LOCUS HSRNAEMP2 520 bp RNA PRI 22-NOV-1996 DEFINITION H.sapiens mRNA for epithelial membrane protein-2. ACCESSION X94770 NID g1359880 KEYWORDS epithelial membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 520) AUTHORS Taylor,V. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) V. Taylor, INSTITUTE OF CELL BIOLOGY, SWISS FERERAL INSTITUTE OF TECHNOLOGY, ETH-HONGGERBERG, CH-8093 ZURICH, SWITZERLAND REFERENCE 2 (bases 1 to 520) AUTHORS Taylor,V. and Suter,U. TITLE Epithelial membrane protein-2 and epithelial membrane protein-3: two novel members of the peripheral myelin protein 22 gene family JOURNAL Gene 175 (1-2), 115-120 (1996) MEDLINE 97074659 COMMENT Overlaps with T56151. FEATURES Location/Qualifiers source 1..520 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_lib="liver spleen" sig_peptide 11..76 CDS 11..514 /codon_start=1 /product="epithelial membrane protein" /db_xref="PID:e218670" /db_xref="PID:g1359881" /db_xref="SWISS-PROT:P54851" /translation="MLVLLAFIIAFHITSAALLLIATVDNAWWVGDEFFADVWRICTN NTNCTVINDSFQEYSTLQAFQATMILSTILCCIAFFIFVLQLFRLKQGERFVLTSIIQ LMSCLCVMIAASIYTDRREDIHDKNAKFYPVTREGSYGYSYILAWVAFACTFISGMMY LILRKRK" mat_peptide 77..511 BASE COUNT 118 a 150 c 115 g 137 t ORIGIN 1 ccctgtgaaa atgttggtgc ttcttgcttt catcatcgcc ttccacatca cctctgcagc 61 cttgcttctc attgccaccg tcgacaatgc ctggtgggta ggagatgagt tttttgcaga 121 tgtctggaga atatgtacca acaacacgaa ttgcacagtc atcaatgaca gctttcaaga 181 gtactccacg ctgcaggcgt tccaggccac catgatcctc tccaccattc tctgctgcat 241 cgccttcttc atcttcgtgc tccagctctt ccgcctgaag cagggagaga ggtttgtcct 301 aacctccatc atccagctaa tgtcatgtct gtgtgtcatg attgcggcct ccatttatac 361 agacaggcgt gaagacattc acgacaaaaa cgcgaaattc tatcccgtga ccagagaagg 421 cagctacggc tactcctaca tcctggcgtg ggtggccttc gcctgcacct tcatcagcgg 481 catgatgtac ctgatactga ggaagcgcaa atagagttcc // LOCUS HSRNAEMP3 500 bp RNA PRI 22-NOV-1996 DEFINITION H.sapiens mRNA for epithelial membrane protein-3. ACCESSION X94771 NID g1359882 KEYWORDS epithelial membrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 500) AUTHORS Taylor,V. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) V. Taylor, INSTITUTE OF CELL BIOLOGY, SWISS FERERAL INSTITUTE OF TECHNOLOGY, ETH-HONGGERBERG, CH-8093 ZURICH, SWITZERLAND REMARK Revised by author 02-FEB-96 REFERENCE 2 (bases 1 to 500) AUTHORS Taylor,V. and Suter,U. TITLE Epithelial membrane protein-2 and epithelial membrane protein-3: two novel members of the peripheral myelin protein 22 gene family JOURNAL Gene 175 (1-2), 115-120 (1996) MEDLINE 97074659 COMMENT Overlaps with T56151. FEATURES Location/Qualifiers source 1..500 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="spleen" /clone_lib="Stratagene" sig_peptide 4..69 CDS 4..495 /codon_start=1 /product="epithelial membrane protein-3" /db_xref="PID:e218671" /db_xref="PID:g1359883" /db_xref="SWISS-PROT:P54852" /translation="MSLLLLVVSALHILILILLFVATLDKSWWTLPGKESLNLWYDCT WNNDTKTWACSNVSENGWLKAVQVLMVLSLILCCLSFILFMFQLYTMRRGGLFYAHRL CQLCTSVAVFTGALIYAIHAEEILEKHPRGGSFGYCFALAWVAFPLALVSGIIYIHLR KRE" mat_peptide 70..492 BASE COUNT 86 a 159 c 131 g 124 t ORIGIN 1 gccatgtcac tcctcttgct ggtggtctca gcccttcaca tcctcattct tatactgctt 61 ttcgtggcca ctttggacaa gtcctggtgg actctccctg ggaaagagtc cctgaatctc 121 tggtacgact gcacgtggaa caacgacacc aaaacatggg cctgcagtaa tgtcagcgag 181 aatggctggc tgaaggcggt gcaggtcctc atggtgctct ccctcattct ctgctgtctc 241 tccttcatcc tgttcatgtt ccagctctac accatgcgac gaggaggtct cttctatgcc 301 caccggctct gccagctttg caccagcgtg gcggtgttta ctggcgcctt gatctatgcc 361 attcacgccg aggagatcct ggagaagcac ccgcgagggg gcagcttcgg atactgcttc 421 gccctggcct gggtggcctt ccccctcgcc ctggtcagcg gcatcatcta catccaccta 481 cggaagcggg agtgagcgcc // LOCUS HSRNAERB 1560 bp RNA PRI 05-SEP-1996 DEFINITION H.sapiens mRNA for estrogen receptor. ACCESSION X99101 NID g1518262 KEYWORDS estrogen receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Mosselman,S., Polman,J. and Dijkema,R. TITLE ER beta: identification and characterization of a novel human estrogen receptor JOURNAL FEBS Lett. 392 (1), 49-53 (1996) MEDLINE 96354875 REFERENCE 2 (bases 1 to 1560) AUTHORS Mosselman,S. TITLE Direct Submission JOURNAL Submitted (04-JUL-1996) S. Mosselman, N.V.Organon, Biotechnology and Biochemistry, PO box 20 Oss, Molenstraat 110 Oss, 5340 BH, NETHERLANDS FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /dev_stage="adult" /clone_lib="Clontech; HL1010B" /clone_lib="Clontech; 7414-1" CDS 19..1452 /codon_start=1 /product="estrogen receptor beta" /db_xref="PID:e255390" /db_xref="PID:g1518263" /translation="MNYSIPSNVTNLEGGPGRQTTSPNVLWPTPGHLSPLVVHRQLSH LYAEPQKSPWCEARSLEHTLPVNRETLKRKVSGNRCASPVTGPGSKRDAHFCAVCSDY ASGYHYGVWSCEGCKAFFKRSIQGHNDYICPATNQCTIDKNRRKSCQACRLRKCYEVG MVKCGSRRERCGYRLVRRQRSADEQLHCAGKAKRSGGHAPRVRELLLDALSPEQLVLT LLEAEPPHVLISRPSAPFTEASMMMSLTKLADKELVHMISWAKKIPGFVELSLFDQVR LLESCWMEVLMMGLMWRSIDHPGKLIFAPDLVLDRDEGKCVEGILEIFDMLLATTSRF RELKLQHKEYLCVKAMILLNSSMYPLVTATQDADSSRKLAHLLNAVTDALVWVIAKSG ISSQQQSMRLANLLMLLSHVRHASNKGMEHLLNMKCKNVVPVYDLLLEMLNAHVLRGC KSSITGSECSPAEDSKSKEGSQNPQSQ" BASE COUNT 367 a 408 c 445 g 340 t ORIGIN 1 ggctatagcc ctgctgtgat gaattacagc attcccagca atgtcactaa cttggaaggt 61 gggcctggtc ggcagaccac aagcccaaat gtgttgtggc caacacctgg gcacctttct 121 cctttagtgg tccatcgcca gttatcacat ctgtatgcgg aacctcaaaa gagtccctgg 181 tgtgaagcaa gatcgctaga acacacctta cctgtaaaca gagagacact gaaaaggaag 241 gttagtggga accgttgcgc cagccctgtt actggtccag gttcaaagag ggatgctcac 301 ttctgcgctg tctgcagcga ttacgcatcg ggatatcact atggagtctg gtcgtgtgaa 361 ggatgtaagg ccttttttaa aagaagcatt caaggacata atgattatat ttgtccagct 421 acaaatcagt gtacaatcga taaaaaccgg cgcaagagct gccaggcctg ccgacttcgg 481 aagtgttacg aagtgggaat ggtgaagtgt ggctcccgga gagagagatg tgggtaccgc 541 cttgtgcgga gacagagaag tgccgacgag cagctgcact gtgccggcaa ggccaagaga 601 agtggcggcc acgcgccccg agtgcgggag ctgctgctgg acgccctgag ccccgagcag 661 ctagtgctca ccctcctgga ggctgagccg ccccatgtgc tgatcagccg ccccagtgcg 721 cccttcaccg aggcctccat gatgatgtcc ctgaccaagt tggccgacaa ggagttggta 781 cacatgatca gctgggccaa gaagattccc ggctttgtgg agctcagcct gttcgaccaa 841 gtgcggctct tggagagctg ttggatggag gtgttaatga tggggctgat gtggcgctca 901 attgaccacc ccggcaagct catctttgct ccagatcttg ttctggacag ggatgagggg 961 aaatgcgtag aaggaattct ggaaatcttt gacatgctcc tggcaactac ttcaaggttt 1021 cgagagttaa aactccaaca caaagaatat ctctgtgtca aggccatgat cctgctcaat 1081 tccagtatgt accctctggt cacagcgacc caggatgctg acagcagccg gaagctggct 1141 cacttgctga acgccgtgac cgatgctttg gtttgggtga ttgccaagag cggcatctcc 1201 tcccagcagc aatccatgcg cctggctaac ctcctgatgc tcctgtccca cgtcaggcat 1261 gcgagtaaca agggcatgga acatctgctc aacatgaagt gcaaaaatgt ggtcccagtg 1321 tatgacctgc tgctggagat gctgaatgcc cacgtgcttc gcgggtgcaa gtcctccatc 1381 acggggtccg agtgcagccc ggcagaggac agtaaaagca aagagggctc ccagaaccca 1441 cagtctcagt gacgcctggc cctgaggtga actggcccac agaggtcaca agctgaagcg 1501 tgaactccag tgtgtcagga gcctgggctt catctttctg ctgtgtggtc cctcatttgg // LOCUS HSRNAESM1 2006 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for ESM-1 protein. ACCESSION X89426 NID g1150418 KEYWORDS ESM-1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2006) AUTHORS Lassalle,P., Molet,S., Janin,A., Heyden,J.V., Tavernier,J., Fiers,W., Devos,R. and Tonnel,A.B. TITLE ESM-1 is a novel human endothelial cell-specific molecule expressed in lung and regulated by cytokines JOURNAL J. Biol. Chem. 271 (34), 20458-20464 (1996) MEDLINE 96355375 REFERENCE 2 (bases 1 to 2006) AUTHORS Lassalle,P.M. TITLE Direct Submission JOURNAL Submitted (06-JUL-1995) P.M. Lassalle, INSERM, Unite 416, 1, bd du Prof. CALMETTE, LILLE 59019, FRANCE FEATURES Location/Qualifiers source 1..2006 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUVEC" /clone="A11.1" sig_peptide 56..112 CDS 56..610 /codon_start=1 /product="ESM-1 secretory protein" /db_xref="PID:e189266" /db_xref="PID:g1150419" /translation="MKSVLLLTTLLVPAHLVAAWSNNYAVDCPQHCDSSECKSSPRCK RTVLDDCGCCRVCAAGRGETCYRTVSGMDGMKCGPGLRCQPSNGEDPFGEEFGICKDC PYGTFGMDCRETCNCQSGICDRGTGKCLKFPFFQYSVTKSSNRFVSLTEHDMASGDGN IVREEVVKENAAGSPVMRKWLNPR" BASE COUNT 623 a 333 c 475 g 575 t ORIGIN 1 cttcccacca gcaaagacca cgactggaga gccgagccgg aggcagctgg gaaacatgaa 61 gagcgtcttg ctgctgacca cgctcctcgt gcctgcacac ctggtggccg cctggagcaa 121 taattatgcg gtggactgcc ctcaacactg tgacagcagt gagtgcaaaa gcagcccgcg 181 ctgcaagagg acagtgctcg acgactgtgg ctgctgccga gtgtgcgctg cagggcgggg 241 agaaacttgc taccgcacag tctcaggcat ggatggcatg aagtgtggcc cggggctgag 301 gtgtcagcct tctaatgggg aggatccttt tggtgaagag tttggtatct gcaaagactg 361 tccctacggc accttcggga tggattgcag agagacctgc aactgccagt caggcatctg 421 tgacaggggg acgggaaaat gcctgaaatt ccccttcttc caatattcag taaccaagtc 481 ttccaacaga tttgtttctc tcacggagca tgacatggca tctggagatg gcaatattgt 541 gagagaagaa gttgtgaaag agaatgctgc cgggtctccc gtaatgagga aatggttaaa 601 tccacgctga tcccggctgt gatttctgag agaaggctct attttcgtga ttgttcaaca 661 cacagccaac attttaggaa ctttctagat atagcataag tacatgtaat ttttgaagat 721 ccaaattgtg atgcatggtg gatccagaaa acaaaaagta ggatacttac aatccataac 781 atccatatga ctgaacactt gtatgtgttt gttaaatatt cgaatgcatg tagatttgtt 841 aaatgtgtgt gtatagtaac actgaagaac taaaaatgca atttaggtaa tcttacatgg 901 agacaggtca accaaagagg gagctaggca aagctgaaga ccgcagtgag tcaaattagt 961 tctttgactt tgatgtacat taatgttggg atatggaatg aagacttaag agcaggagaa 1021 gatggggagg gggtgggagt gggaaataaa atatttagcc cttccttggt aggtagcttc 1081 tctagaattt aattgtgctt tttttttttt tttggctttg ggaaaagtca aaataaaaca 1141 accagaaaac ccctgaagga agtaagatgt ttgaagctta tggaaatttg agtaacaaac 1201 agctttgaac tgagagcaat ttcaaaaggc tgctgatgta gttcccgggt tacctgtatc 1261 tgaaggacgg ttctggggca taggaaacac atacacttcc ataaatagct ttaacgtatg 1321 ccacctcaga gataaatcta agaagtattt tacccactgg tggtttgtgt gtgtatgaag 1381 gtaaatattt atatattttt ataaataaat gtgttagtgc aagtcatctt ccctacccat 1441 atttatcatc ctcttgagga aagaaatcta gtattatttg ttgaaaatgg ttagaataaa 1501 aacctatgac tctataaggt tttcaaacat ctgaggcatg ataaatttat tatccataat 1561 tataggagtc actctggatt tcaaaaaatg tcaaaaaatg agcaacagag ggaccttatt 1621 taaacataag tgctgtgact tcggtgaatt ttcaatttaa ggtatgaaaa taagttttta 1681 ggaggtttgt aaaagaagaa tcaattttca gcagaaaaca tgtcaacttt aaaatatagg 1741 tggaattagg agtatatttg aaagaatctt agcacaaaca ggactgttgt actagatgtt 1801 cttaggaaat atctcagaag tattttattt gaagtgaaga acttatttaa gaattatttc 1861 agtatttacc tgtattttat tcttgaagtt ggccaacaga gttgtgaatg tgtgtggaag 1921 gcctttgaat gtaaagctgc ataagctgtt aggttttgtt ttaaaaggac atgtttatta 1981 ttgttcaata aaaaagaaca agatac // LOCUS HSRNAFIB 1247 bp RNA PRI 03-FEB-1994 DEFINITION H.sapiens mRNA for fibromodulin. ACCESSION X75546 NID g453156 KEYWORDS fibromodulin; proteoglycan. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1247) AUTHORS Hildebrand,A., Twardzig, Border,W.A. and Ruoslahti,E. TITLE Interaction of the small interstitial proteoglycans biglycan decorin and fibromodulin with TGF JOURNAL Unpublished REFERENCE 2 (bases 1 to 1247) AUTHORS Hildebrand,A. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) A. Hildebrand, University of Muenster, Dept of Dermatology, Von-Esmarch-Str. 56, 48127 Muenster/Westf., FRG FEATURES Location/Qualifiers source 1..1247 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="human lung fibroblast" /cell_line="WI-38 human lung fibroblast" gene 1..1247 /gene="hFM" primer_bind 1..35 /gene="hFM" /note="plus" CDS 21..1151 /gene="hFM" /codon_start=1 /product="fibromodulin" /db_xref="PID:g453157" /translation="MQWASLLLLAGLFSLSQAQYEDDPHWWFHYLRSQQSTYYDPYDP YPYETYEPYPYGVDEGPAYTYGSPSPPDPRDCPQECDCPPNFLTAMYCDNRNLKYLPF VPSRMKYVYFQNNQITSIQEGVFDNATGLLWIALHGNQITSDKVGRKVFSKLRHLERL YLDHNNLTRMPGPLPRSLRELHLDHNQISRVPNNALEGLENLTALYLQHDEIQEVGSS MRGLRSLILLDLSYNHLRKVPDGLPSALEQLYMEHNNVYTVPDSYFRGAPKLLYVRLS HNSLTNNGLASNTFNSSSLLELDLSYNQLQKIPPVNTNLENLYLQGNRINEFSISSFC TVVDVVNFSKLQVVRLDGNEIKRSAMPADAPLCLRLASLIEI" prim_transcript 21..1151 /gene="hFM" sig_peptide 21..74 /gene="hFM" /product="fibromomdulin" mat_peptide 75..1151 /gene="hFM" /product="fibromodulin" primer_bind 1210..1247 /gene="hFM" /note="minus" BASE COUNT 272 a 418 c 307 g 250 t ORIGIN 1 cggaattcaa gaaacacaaa atgcagtggg cgtccctcct gctgctggca gggctcttct 61 ccctctccca ggcccagtat gaagatgacc ctcattggtg gttccactac ctccgcagcc 121 agcagtccac ctactacgat ccctatgacc cttacccgta tgagacctac gagccttacc 181 cctatggggt ggatgaaggg ccagcctaca cctacggctc tccatcccct ccagatcccc 241 gcgactgccc ccaggaatgc gactgcccac ccaacttcct cacggccatg tactgtgaca 301 atcgcaacct caagtacctg cccttcgttc cctcccgcat gaagtatgtg tacttccaga 361 acaaccagat cacctccatc caggaaggcg tctttgacaa tgccacaggg ctgctctgga 421 ttgctctcca cggcaaccag atcaccagtg ataaggtggg caggaaggtc ttctccaagc 481 tgaggcacct ggagaggctg tacctggacc acaacaacct gacccggatg cccggtcccc 541 tgcctcgatc cctgagagag ctccatctcg accacaacca gatctcacgg gtccccaaca 601 atgctctgga ggggctggag aacctcacgg ccttgtacct ccaacacgat gagatccagg 661 aagtgggcag ttccatgagg ggcctccggt cactgatctt gctggacctg agttataacc 721 accttcggaa ggtgcctgat gggctgccct cagctcttga gcagctgtac atggagcaca 781 acaatgtcta caccgtcccc gatagctact tccggggggc gcccaagctg ctgtatgtgc 841 ggctgtccca caacagtcta accaacaatg gcctggcctc caacaccttc aattccagca 901 gcctccttga gctagacctc tcctacaacc agctgcagaa gatcccccca gtcaacacca 961 acctggagaa cctctacctc caaggcaata ggatcaatga gttctccatc agcagcttct 1021 gcaccgtggt ggacgtcgtg aacttctcca agctgcaggt cgtgcgcctg gacgggaacg 1081 agatcaagcg cagcgccatg cctgccgacg cgcccctctg cctgcgcctt gccagcctca 1141 tcgagatctg agcagccctg gcaccgggta ctgggcggag agcccccgtg gcatttggct 1201 tgatggtttg gtttggctta tggaagatct gggacagacc gtgtgac // LOCUS HSRNAGSSP 1155 bp RNA PRI 02-JUN-1995 DEFINITION H.sapiens mRNA for gamma subunit of sodium potassium ATPase. ACCESSION X86400 NID g791046 KEYWORDS gamma subunit; sodium/potassium ATPase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1155) AUTHORS Austruy,E., Belley,L., Millasot,P., Junien,C. and Jeanpierre,C. TITLE Characterization of the human cDNA with partial homology with the gamma subunit of sodium potassium ATPase of rat, mouse, rabbit and sheep JOURNAL Unpublished REFERENCE 2 (bases 1 to 1155) AUTHORS Austruy,E. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) E. Austruy, INSERM 383, Hopital Necker-Enfants Malades, Clinique M. Lamy, 149 rue de Sevres, 75015 Paris, FRANCE COMMENT Sequence overlapping with that under the acc#X65705. FEATURES Location/Qualifiers source 1..1155 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="lambda gt10" /chromosome="11" /map="q23.2-24.2" CDS 153..533 /codon_start=1 /product="gamma subunit of sodium potassium ATPase like" /db_xref="PID:g791047" /translation="MPPGGHIESGASFSPSAGSDGDKGLKPREEGEAPEAGPCLASQL APVQWVAPPRLLLRLIFIDLPALLIAPTAESSAEEDEEPHDEGQSSEDQAPIANGLIV IVERVHVPLGAAATVHRQPSHFPR" BASE COUNT 294 a 245 c 306 g 310 t ORIGIN 1 gcaaaagaat tccgggcccg gctgtatttg tacttgagct tggagaggct ggtggtgaca 61 gaggagggag acaggcaggc aggtgaggga agcagcggtc tctgacgccg gctttattat 121 aagcatgagg tggcacgagg caggagttgg cgatgccacc tgggggtcac attgagtctg 181 gagcttcctt cagcccaagc gctggcagtg acggggacaa aggtctaaag cccagggaag 241 aaggggaggc gccagaggca gggccatgct tggcttccca gctggcccca gtgcagtggg 301 tggcaccgcc gaggctgctg ttacggctca tcttcattga tttgcctgcg cttcttattg 361 cccccacagc ggaatcttct gctgaggagg atgaggagcc ccacgatgaa ggccagtcca 421 gcgaagatca ggcccccatt gcgaacggtc tcatagtcat agtagaacgg gtccacgtcc 481 cccttggggc tgccgccacc gtccatcgac aacccagtca tttccccagg tgaatgggct 541 gcctccactc ccctcttcct gcaagaagtt ctcaagcctt tttgattttt gtgcaataaa 601 gtacagcttt gcataagagt gaaattgggc tagcttaaat ggatccataa actttcttct 661 aattttaagt gagaatcttt taaacacctg ttaaatttaa tgtagcagtc tgagaatcta 721 aaattatgta ccactcgttt atttgttcat tcatccatcc cttttcccat gaatatttca 781 tttgtttatc cagctagttt tattaactgt tttaatccag gcattgggat ataatgataa 841 gcaagatatc attcctgtgg aaacttcttc caggcactaa agatatagag gatgaattag 901 aggaacataa aattggagga gattgaatac ttaaaggcag tggccatagg gatggcaaag 961 aggaagtaga gatagagtca ttaggactta gtgccaatta aatacggggt gagtgagaga 1021 agatggcttt gggatgtcct caagttccta gcttatttat tcattatgct tttattggag 1081 gactatcatg ttttaggcat tgttatgttt caggcattgc cggaattcag cttggactta 1141 accaggctga actgg // LOCUS HSRNAHELC 3130 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens mRNA for RNA helicase (Myc-regulated dead box protein). ACCESSION X98743 NID g1498228 KEYWORDS DEAD-box; Myc-Max heterodimer; RNA helicase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3130) AUTHORS Grandori,C. TITLE Direct Submission JOURNAL Submitted (21-JUN-1996) C. Grandori, Fred Hutchinson Cancer Research Center, Basic Sciences, 1124 Columbia Street, Seattle, Washington, USA REFERENCE 2 (bases 1 to 3130) AUTHORS Grandori,C., Mac,J., Siebelt,F., Ayer,D.E. and Eisenman,R.N. TITLE Myc-Max heterodimers activate a DEAD box gene and interact with multiple E box-related sites in vivo JOURNAL EMBO J. 15 (16), 4344-4357 (1996) MEDLINE 97015134 FEATURES Location/Qualifiers source 1..3130 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Manca B" /cell_type="B" CDS 18..1850 /note="Myc regulated dead box protein, MrDb" /codon_start=1 /evidence=not_experimental /product="RNA helicase" /db_xref="PID:e254454" /db_xref="PID:g1498229" /translation="MNVGLSETQNGGMSQEAVGNIKVTKSPQKSTVLTNGEAAMQSSN SESKKKKKKKRKMVNDAEPDTKKAKTENKGKSEEESAETTKETENNVEKPDNDEDESE VPSLPLGLTGAFEDTSFASLCNLVNENTLKAIKEMGFTNMTEIQHKSIRPLLEGRDLL AAAKTGSGKTLAFLIPAVELIVKLRFMPRNGTGVLILSPTRELAMQTFGVLKELMTHH VHTYGLIMGGSNRSAEAQKLGNGINIIVATPGRLLDHMQNTPGFMYKNLQCLVIDEAD RILDVGFEEELKQIIKLLPTRRQTMLFSATQTRKVEDLARISLKKEPLYVGVDDDKAN ATVDGLEQGYVVCPSEKRFLLLFTFLKKNRKKKLMVFFSSCMSVKYHYELLNYIDLPV LAIHGKQKQNKRTTTFFQFCNADSGTLLCTDVAARGLDIPEVDWIVQYDPPDDPKEYI HRVGRTARGLNGRGHALLILRPEELGFLRYLKQSKVPLSEFDFSWSKISDIQSQLEKL IEKNYFLHKSAQEAYKSYIRAYDSHSLKQIFNVNNLNLPQVALSFGFKVPPFVDLNVN SNEGKQKKRGGGGGFGYQKTKKVEKSKIFKHISKKSSDSRQFSH" BASE COUNT 982 a 572 c 642 g 933 t 1 others ORIGIN 1 aatcaaaaca aaagcccatg aatgtgggct tatcagaaac tcaaaatgga ggcatgtctc 61 aagaagcagt gggaaatata aaagttacaa agtctcccca gaaatccact gtattaacca 121 atggagaagc agcaatgcag tcttccaatt cagaatcaaa aaagaaaaag aagaaaaaga 181 gaaaaatggt gaatgatgct gagcctgata cgaaaaaagc aaaaactgaa aacaaaggga 241 aatctgaaga agaaagtgcc gagactacta aagaaacaga aaataatgtg gagaagccag 301 ataatgatga agatgagagt gaggtgccca gtctgcccct gggactgaca ggagcttttg 361 aggatacttc gtttgcttct ctatgtaatc ttgtcaatga aaacactctg aaggcaataa 421 aagaaatggg ttttacaaac atgactgaaa ttcagcataa aagtatcaga ccacttctgg 481 aaggcaggga tcttctagca gctgcaaaaa caggcagtgg taaaaccctg gcttttctca 541 tccctgcagt tgaactcatt gttaagttaa ggttcatgcc caggaatgga acaggagtcc 601 ttattctctc acctactaga gaactagcca tgcaaacctt tggtgttctt aaggagctga 661 tgactcacca cgtgcatacc tatggcttga taatgggtgg cagtaacaga tctgctgaag 721 cacagaaact tggtaatggg atcaacatca ttgtggccac accaggccgt ctgctggacc 781 atatgcagaa taccccagga tttatgtata aaaacctgca gtgtctggtt attgatgaag 841 ctgatcgtat cttggatgtg gggtttgaag aggaattaaa gcaaattatt aaacttttgc 901 caacacgtag acagactatg ctcttttctg ccacccaaac tcgaaaagtt gaagacctgg 961 caaggatttc tctgaaaaag gagccattgt atgttggcgt tgatgatgat aaagcgaatg 1021 caacagtgga tggtcttgaa cagggatatg ttgtttgtcc ttctgaaaag agattccttc 1081 tgctctttac attccttaag aagaaccgaa agaagaagct tatggtcttc ttttcatctt 1141 gtatgtctgt gaaataccac tatgagttgc tgaactacat tgatttgccc gtcttggcca 1201 ttcatggaaa gcaaaagcaa aataagcgta caaccacatt cttccagttc tgcaatgcag 1261 attcgggaac actattgtgt acggatgtgg cagcgagagg actagacatt cctgaagtcg 1321 actggattgt tcagtatgac cctccggatg accctaagga atatattcat cgtgtgggta 1381 gaacagccag aggcctaaat gggagagggc atgccttgct cattttgcgc ccagaagaat 1441 tgggttttct tcgttactta aaacaatcca aggttccatt aagtgaattt gacttttcct 1501 ggtctaaaat ttctgacatt cagtctcagc ttgagaaatt gattgaaaag aattactttc 1561 ttcataagtc agcccaggaa gcatataagt catacatacg agcctatgat tcccattctc 1621 tgaaacagat ctttaatgtt aataacctaa atttgcctca ggttgctctg tcatttggtt 1681 tcaaggtgcc tcccttcgtt gatctgaacg tcaacagtaa tgaaggcaag cagaaaaagc 1741 gaggaggtgg tggtggattt ggctaccaga aaaccaagaa agttgagaaa tccaaaatct 1801 ttaaacacat tagcaagaaa tcatctgaca gcaggcagtt ctctcactga acacatgcct 1861 tcctttcatc ttgaataact ttgtcctaaa atgaattttt tttccccttg atttaacagg 1921 atttttgtag actttagaat ttggacttac ctaacaagag tataaattga cttgggttgc 1981 aagcactgag cactgttact tctatcacgt ctctctttta tttctgggat ataaaacagg 2041 ctttaagttt cttggttgcc caagggcaga gcaaggaata tctggtgttt cttgtgatga 2101 taatatttta attttaaata tccctccctc atacaagtgt atgttaccat tttaatataa 2161 ttctttttgt acctttcctt cttgttttgt gaagattttt gtggcatgga ttgctgtgct 2221 cactgctgta aaaggtgacc tagtgtactg ggcagctggt ggcggtgcag aaaagagtct 2281 caggttattt tagatttgtt taattcaagg tggtttggat ttggtaagcc tttgcactct 2341 gtagagtact tagaagacaa gggcaactta cttggagtta gagccaagct gtcagacggt 2401 gcccagcaca cattaatgtt agcttctttc tgagaaaaaa atacctcttc caggccctga 2461 aacaaaaaat acatttgctg tgaagattga aaatgaacaa agttagaaaa aaaaacagca 2521 aaatcagtga tttagtcasa tgagtttttc gttgtaggag cacttgattt ctagtgtgtt 2581 ttgtacagta tataactaca agatagtaca ttttgtagca gttcaaagcc aaagttgcta 2641 gcatcatttt gctgttgtgc cagttaatca taggatccca ttaaataagt gtgctaacat 2701 cgaatataga gaaaactggt aaagaacatt ccagtaggaa aagaaaagaa caatcttcca 2761 tttctgggct tggccaccat caccctggtc ggacctgtcc tggacttcca accttgactg 2821 ctgagctcct ggcttagctt cttgggttcc taattcctgg tgtttaataa ttctctccac 2881 gatcatgttt ttctgatttt ttttttcaga aataatgttt tttaaaagac aaaaacaaag 2941 ggaagaatat ttaattactg agcagaagta aatactgttg gtattttgta cataaaccta 3001 atttttatat gcatgtttat gctttttaat ttttttatca aaaattaagt catctaccta 3061 ctacttgtaa ccagcttgtt tcataacatg ttattttcct gtgtcattaa ataattactt 3121 caaaaaaaaa // LOCUS HSRNAHTRH 1197 bp RNA PRI 27-SEP-1993 DEFINITION H.sapiens mRNA for human thyrotropin-releasing hormone receptor. ACCESSION X75071 NID g404157 KEYWORDS thyrotropin-releasing hormone receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1197) AUTHORS Matre,V. TITLE Direct Submission JOURNAL Submitted (16-SEP-1993) V. Matre, Institute of Medical Bichemistry, University of Oslo, 0317 Oslo, NORWAY REFERENCE 2 (bases 1 to 1197) AUTHORS Matre,V., Karlsen,H.E., Wright,M.S., Lundell,I., Fjeldheim,A.K., Gabrielsen,O.S., Larhammar,D. and Gautvik,K.M. TITLE Molecular cloning of a functional human thyrotropin-releasing hormone receptor JOURNAL Biochem. Biophys. Res. Commun. 195 (1), 179-185 (1993) MEDLINE 93371401 FEATURES Location/Qualifiers source 1..1197 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetal" /sex="female" CDS 1..1197 /codon_start=1 /product="thyrotropin-releasing hormone receptor" /db_xref="PID:g404158" /db_xref="SWISS-PROT:P34981" /translation="MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNI MVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCIT YLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLW FFLLDLNISTYKDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARIL FLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFA LLWMPYRTLVVVNSFLSSPFQENWFLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKL CNCKQKPTEKPANYSVALNYSVIKESDHFSTELDDITVTDTYLSATKVSFDDTCLASE VSFSQS" BASE COUNT 320 a 294 c 231 g 352 t ORIGIN 1 atggaaaacg agacagtcag tgaactgaac caaacacagc ttcagccacg agcagtggtg 61 gccttagaat accaggtggt caccatctta cttgtactca ttatttgtgg cctgggtatt 121 gtaggcaaca tcatggtagt cctggttgtc atgagaacca agcacatgag gacccccaca 181 aactgctacc tggtgagcct ggcagtagct gatctcatgg tcttggtggc cgcaggcctc 241 cccaacataa cagacagtat ctacggttcc tgggtctatg gctatgttgg atgcctctgc 301 attacttacc tccagtattt gggaattaat gcatcctctt gttcaataac agcctttacc 361 attgagaggt acatagcaat ctgtcacccc atcaaagccc agtttctctg cacattttcc 421 agagccaaaa agattatcat ctttgtctgg gctttcacat ctctttactg tatgctctgg 481 ttcttcttgc tggatctcaa tattagcacc tacaaagatg ctattgtgat atcctgtggc 541 tacaagatct ccaggaatta ctactcacct atttacctaa tggactttgg tgtcttttat 601 gttgtgccaa tgatcctggc taccgtcctc tatggattca tagctagaat ccttttctta 661 aatcccattc cttcagatcc taaagaaaac tctaagacat ggaaaaatga ttcaacccat 721 cagaacacaa atctgaatgt aaatacctct aatagatgtt tcaacagcac agtatcttca 781 aggaagcagg tcaccaagat gctggcagtg gttgtaattc tgtttgccct tttatggatg 841 ccctacagga ctctagtggt tgtcaactca tttctctcca gtcctttcca agaaaattgg 901 tttttgctct tttgcagaat ttgcatttat ctcaacagtg ccatcaaccc ggtgatttac 961 aatctcatgt cccagaaatt ccgtgcagcc ttcagaaagc tctgcaactg caagcagaag 1021 ccaacagaga aacctgctaa ctacagtgtg gccctaaatt acagcgtcat caaggagtca 1081 gaccatttca gcacagagct tgatgatatc actgtcactg acacttacct gtctgccaca 1141 aaagtgtctt ttgatgacac ctgcttggct tctgaggtat cctttagcca aagttga // LOCUS HSRNAHUGL 3228 bp RNA PRI 10-OCT-1995 DEFINITION H.sapiens mRNA for tumour suppressor protein, HUGL. ACCESSION X86371 NID g784996 KEYWORDS hugl gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3228) AUTHORS Strand,D.J., Unger,S., Corvi,R., Hartenstein,K., Schenkel,H., Kalmes,A., Merdes,G., Neumann,B., Kreig-Schneider,F., Coy,J.F., Poustka,A., Schwab,M. and Mechler,B. TITLE A human homologue of the Drosophila tumour suppressor gene l(2)gl maps to 17p11.2-12 and codes for a cytoskeletal protein that associates with nonmuscle myosin II heavy chain JOURNAL Oncogene 11 (2), 291-301 (1995) MEDLINE 95349952 REFERENCE 2 (bases 1 to 3228) AUTHORS Strand,D.J. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) D.J. Strand, Devel. Genetics, DKFZ, Dept. of Developmental Genetics, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..3228 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /chromosome="17" /map="p11,2-12" gene 7..3180 /gene="hugl" CDS 7..3180 /gene="hugl" /function="associated with nonmuscle myosinII heavy chain" /note="homologue to Drosophila tumour supressor gene" /codon_start=1 /db_xref="PID:g784997" /translation="MMKFPFRRQGADPQREKLKQELFAFNKTVEHGFPNQPSALAFDP ELRIMAIGTRSGAVKIYGAPGVEFTGLHRDAATVTQMHFLTGQGRLLSLLDDSSLHLW EIVHHNGCAHLEEALSFQLPSRPGFDGASAPLSLTRVTVVLLVAAGDIAALGTEGSSS VFFLDVTTLTLLEGQTLAPGEVLRSVPDDYRCGKDLGPVESLQGHLQDPTKILIGYSR GLLVIRNQASQCVDHIFLGNQQLESLCWGRDSSTVVSSHSDGSYAVWSVDAGSFPTLQ PTVATTPYGPFPCKAINKILWRNCESGGHFIIFSGGMPRASYGDRHCVSVLRAETLVT LDFHFRIIDFFTVHSTRPEDEFDDPQALAVLLEEELVVLDLQTPGWPAVPAPYLAPLH SSAITCSAHVASVPAKLWARIVSAGEQQSPQPVSSALSWPITGGRNLAQEPSQRGLLL TGHEDGTVRFWDASGVALRPLYKLSTAGLFQTDCEHSDSLAQAAEDDWPPFRKVGCFD PYSDDPRLGVQKVALCKYTAQMVVAGTAGQVLVLELSDVPVEHAVSVAIIDLLQDREG FTWKGHERLSPRTGLLPWPAGFQPCVLVQCLPPAAVTAVTLHTEWSLVAFGTSHGFGL LSPVLARCTLHPNDSLAMEGPLSRVKSLKKSLRQSFRRIRKSRVSGKKRAANASSKLQ EANAQLAEQACPHDVEMTPVQRRIEPRSADDSLSGVVRCLYFADTFLRDGAHHGPTMW AGTNSGSVFAYALEVPAAAVGGEKRPEQAVEAVLGKELQLMHRAPVVAIAVLDGGRPL PEPYEASRDLAQAPHMQGGHAVLIASEEQFKVFTLPKVSAKTKFKLTAHEGCRVRKVV ALATFASVACEDYAETCLACLTNLGDVHVFSVPGLRPEVHYSCIRKEDISGIASCVFT RHGQGFYLISPSEFERFSLSARNITEGLCSLDINWPRDATQASYRIRESPKLSQANGT PSILLAPQSLDGSPDPAHSMGPDTPEPPEAALSPMSIDSATSADTTLDTTGDVTVEDV KDFLGSSEESEKNLRNLAEDEAHACCI" BASE COUNT 596 a 1026 c 984 g 622 t ORIGIN 1 cgcaagatga tgaagtttcc gttccggcgg cagggcgccg acccgcagcg cgagaagctc 61 aagcaggagc ttttcgcctt caacaagact gtggagcatg gcttccccaa tcagcccagc 121 gccctggctt tcgacccgga acttcgcatc atggccatcg gcaccaggtc tggggctgtc 181 aagatctatg gtgcacctgg cgtggagttc acaggcctgc accgggatgc agccactgtc 241 acacagatgc acttcttgac cggccagggc cgcctcctgt ccctgcttga tgacagcagt 301 ctgcatctct gggagattgt ccaccataat ggctgtgccc acctggaaga agcactcagt 361 ttccagctgc ccagccggcc cggctttgat ggtgccagtg ctccgctcag ccttacccga 421 gtcacagtgg tcctgctggt ggctgccggc gacatagcag ccctgggcac tgagggcagc 481 agcagtgtct tcttcctgga tgttaccacc ctgaccctgc tcgaggggca gacgcttgcc 541 ccaggcgagg ttctgcgcag cgtgccagac gactaccgct gtgggaagga cctgggcccc 601 gtggagtcac tccagggaca cctgcaagac cccacaaaga ttctcattgg ctacagccgg 661 ggcctgctgg tcatcaggaa ccaggcctcg cagtgtgtgg accacatctt cctggggaac 721 cagcagctgg agagcctatg ctggggccgt gatagcagca ctgtggtcag ctcacacagc 781 gatggcagct atgctgtctg gtctgtggat gccggcagct tcccaacgct gcagcccacg 841 gtagccacca caccttacgg cccctttccc tgcaaggcca ttaacaagat tctgtggcgg 901 aactgtgaat ctgggggcca ctttatcatc ttcagcggtg gcatgccccg tgccagctat 961 ggtgaccgcc actgtgtaag tgtgcttcga gccgagacat tggtgacgct ggacttccac 1021 ttccgcatca tcgacttctt cacagtgcac agcacacggc ccgaggatga atttgatgac 1081 ccccaggccc tggctgtgct gctggaagag gagctggtgg tgctggacct gcagactcct 1141 ggctggccag ctgtgcctgc cccatacctg gccccgctgc actcctctgc aatcacttgc 1201 tcggcccacg tggccagtgt ccccgccaag ctgtgggccc gcattgtgag cgctggcgag 1261 cagcagagcc cccagcctgt ctccagtgcc ttgagctggc ccatcactgg gggccgaaac 1321 ctggcccagg agccgtcaca acgagggctg ctgctgacgg gccatgagga cgggaccgtg 1381 aggttctggg atgcctcggg tgtggcgctg cggccgctct ataagctgag cacagctggc 1441 ctcttccaga cagactgtga gcactctgac agcctggccc aggctgccga ggacgactgg 1501 ccacccttcc gcaaggtggg ctgctttgat ccctacagtg acgatccccg gcttggcgtg 1561 cagaaggttg ctctctgcaa gtatacagcc cagatggtgg tggctggcac tgcaggccag 1621 gtgctggtac tggagcttag tgatgtgccg gtggagcacg cggtcagcgt ggctatcata 1681 gacctcctcc aggaccgcga gggcttcaca tggaagggcc acgagcggct gagcccacgc 1741 acggggctgc tgccctggcc tgctggcttc cagccctgtg tcctggtgca gtgcctgccg 1801 ccagctgctg taaccgctgt cacactccac accgagtgga gcctcgtggc ttttggcacc 1861 agtcatggct ttggcctctt gagccctgtg ctggccaggt gcactcttca ccccaatgac 1921 tccctggcca tggagggtcc gctctcccgg gtgaagtctc tcaagaagtc actgcgccag 1981 tctttccggc gcattcgcaa gagtcgtgtc tctggcaaga agcgggctgc taatgccagc 2041 agcaagttgc aggaagccaa tgcacagctg gctgagcagg cctgccccca cgacgtggag 2101 atgacacccg tgcagcgccg cattgagccc cgctctgccg atgactcctt gtcgggtgtc 2161 gtgcgttgcc tatactttgc cgacacattc cttcgagatg gggcccacca cgggcccacc 2221 atgtgggctg gcaccaactc aggctctgtg ttcgcctatg cactggaggt cccggcagca 2281 gcagtgggtg gtgagaagcg ccctgagcaa gcggtggagg ccgtgctggg caaggagctg 2341 cagctgatgc accgggcgcc tgtggtggcc attgccgtgt tggacggtgg ccgcccactg 2401 cccgagccct acgaggcctc acgggacctg gcgcaggcac ctcacatgca gggtgggcac 2461 gctgtgctga tcgcatctga ggagcagttc aaggtgttca cactgcccaa ggtgagcgcg 2521 aagaccaagt tcaagctgac ggcccatgag ggctgtcgtg tgcgcaaggt ggtggcactg 2581 gccacgtttg ccagtgtggc ctgcgaggac tatgctgaga cctgcctggc ctgcctcacc 2641 aacctgggtg acgtccacgt cttctcggtg cctggcctgc ggcccgaggt gcactattcc 2701 tgcatccgga aggaggacat cagcggcatc gcttcgtgcg tctttacgcg ccatggccag 2761 ggcttttacc tgatatcccc atcagaattt gaacgcttct ccctaagtgc ccggaacatc 2821 acagagggcc tctgctctct ggacattaac tggccccgcg atgccaccca ggccagttac 2881 aggatccgag agtcacccaa gctgagccag gctaacggga ccccaagcat cctgctggcc 2941 ccacagagcc ttgatggaag ccctgatcca gcccacagca tgggacctga caccccggag 3001 ccacccgagg ctgcactctc acccatgtcc atcgactcag ccaccagtgc tgacaccacg 3061 ctggacacga caggggacgt cacagtggaa gatgtgaagg atttcctggg ctcctctgag 3121 gagtcagaga agaacctgag gaacctggca gaagacgagg cccacgcctg ttgcatctga 3181 tcaatgagga aggcagaact atccgctgag gaacctggca gaagacga // LOCUS HSRNAIDH 1370 bp RNA PRI 02-OCT-1997 DEFINITION H.sapiens mRNA for NAD (H)-specific isocitrate dehydrogenase gamma subunit precursor. ACCESSION Z68907 NID g1167848 KEYWORDS IDH gene; isocitrate dehydrogenase gamma subunit precursor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1370) AUTHORS Brenner,V., Nyakatura,G., Rosenthal,A. and Platzer,M. TITLE Genomic organization of two novel genes on human Xq28: compact head to head arrangement of IDH gamma and TRAP delta is conserved in rat and mouse JOURNAL Genomics 44 (1), 8-14 (1997) MEDLINE 97432815 REFERENCE 2 (bases 1 to 1370) AUTHORS Brenner,V. TITLE Direct Submission JOURNAL Submitted (23-JAN-1996) Volker Brenner, Genome Analysis, Institiute of Molecular, Biotechnology, Beutenbergstrasse 11, Jena, Thueringen, 07745, Germany FEATURES Location/Qualifiers source 1..1370 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I.M.A.A.G.E. Consortium ID 42942" /dev_stage="infant" /tissue_type="brain" /clone_lib="Soares infant brain 1NIB" /chromosome="X" /map="q28" gene 80..1261 /gene="H-IDH gamma" CDS 80..1261 /gene="H-IDH gamma" /codon_start=1 /product="NAD (H)-specific isocitrate dehydrogenase gamma subunit precursor" /db_xref="PID:e219959" /db_xref="PID:g1167849" /db_xref="SWISS-PROT:P51553" /translation="MALKVATVAGSAAKAVLGPALLCRPWEVLGAHEVPSRNIFSEQT IPPSAKYGGRHTVTMIPGDGIGPELMLHVKSVFRHACVPVDFEEVHVSSNADEEDIRN AIMAIRRNRVALKGNIETNHNLPPSHKSRNNILRTSLDLYANVIHCKSLPGVVTRHKD IDILIVRENTEGEYSSLEHESVAGVVESLKIITKAKSLRIAEYAFKLAQESGRKKVTA VHKANIMKLGDGLFLQCCREVAARYPQITFENMIVDNTTMQLVSRPQQFDVMVMPNLY GNIVNNVCAGLVGGPGLVAGANYGHVYAVFETATRNTGKSIANKNIANPTATLLASCM MLDHLKLHSYATSIRKAVLASMDNENMHTPDIGGQGTTSEAIQDVIRHIRVINGRAVE A" polyA_signal 1346 polyA_site 1364 BASE COUNT 311 a 426 c 381 g 252 t ORIGIN 1 ccgaaacttc gcaccccgtc gaactctcgc gagagcggta tctgcgtgtc gggacgtgcg 61 gaggctctca ctttccgtca tggcgctgaa ggtagcgacc gtcgccggca gcgccgcgaa 121 ggcggtgctc gggccagccc ttctctgccg tccctgggag gttctaggcg cccacgaggt 181 cccctcgagg aacatctttt cagaacaaac aattcctccg tccgctaagt atggcgggcg 241 gcacacggtg accatgatcc caggggatgg catcgggcca gagctcatgc tgcatgtcaa 301 gtccgtcttc aggcacgcat gtgtaccagt ggactttgaa gaggtgcacg tgagttccaa 361 tgctgatgaa gaggacattc gcaatgccat catggccatc cgccggaacc gcgtggccct 421 gaagggcaac atcgaaacca accataacct gccaccgtcg cacaaatctc gaaacaacat 481 ccttcgcacc agcctggacc tctatgccaa cgtcatccac tgtaagagcc ttccaggcgt 541 ggtgacccgg cacaaggaca tagacatcct cattgtccgg gagaacacag agggcgagta 601 cagcagcctg gagcatgaga gtgtggcggg agtggtggag agcctgaaga tcatcaccaa 661 ggccaagtcc ctgcgcattg ccgagtatgc cttcaagctg gcgcaggaga gcgggcgcaa 721 gaaagtgacg gccgtgcaca aggccaacat catgaaactg ggcgatgggc ttttcctcca 781 gtgctgcagg gaggtggcag cccgctaccc tcagatcacc ttcgagaaca tgattgtgga 841 taacaccacc atgcagctgg tgtcccggcc ccagcagttt gatgtcatgg tgatgcccaa 901 tctctatggc aacatcgtca acaatgtctg cgcgggactg gtcgggggcc caggccttgt 961 ggctggggcc aactatggcc atgtgtacgc ggtgtttgaa acagctacga ggaacaccgg 1021 caagagtatc gccaataaga acatcgccaa ccccacggcc accctgctgg ccagctgcat 1081 gatgctggac cacctcaagc tgcactccta tgccacctcc atccgtaagg ctgtcctggc 1141 atccatggac aatgagaata tgcacactcc ggacatcggg ggccagggca caacatctga 1201 agccatccag gacgtcatcc gccacatccg cgtcatcaac ggccgggccg tggaggccta 1261 ggctggccct aggaccttct tggtttgctc cttggattcc ccttcccact ccagcacccc 1321 agccagcctg gtacgcagat cccagaataa agcaccttct ccctaaaaaa // LOCUS HSRNAIFMH 1510 bp RNA PRI 13-APR-1994 DEFINITION H.sapiens IFMH mRNA. ACCESSION X76562 NID g472992 KEYWORDS IFMH gene; intrinsic factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1510) AUTHORS Hannappel,M. TITLE Direct Submission JOURNAL Submitted (06-DEC-1993) M. Hannappel, Lab fuer Molekulare Biologie, Am Klopferspitz, 82152 Martinsried, FRG REFERENCE 2 (bases 1 to 1510) AUTHORS Hannappel,M., Kehl,M. and Winnacker,E.L. TITLE A cDNA sequence of the human intrinsic factor JOURNAL Unpublished FEATURES Location/Qualifiers source 1..1510 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="corpus region of the stomach" /chromosome="11" gene 1..1510 /gene="IFHM" sig_peptide 1..18 /gene="IFHM" gene 48..1301 /gene="IFMH" CDS 48..1301 /gene="IFMH" /codon_start=1 /product="intrinsic factor" /db_xref="PID:g472993" /translation="MAWFALYLLSLLWATAGTSTQTQSSCSVPSAQEPLVNGIQVLME NSVTSSAYPNPSILIAMNLAGAYNLKAQKLLTYQLMSSDNNDLTIGQLGLTIMALTSS CRDPGDKVSILQRQMENWAPSSPNAEASAFYGPSLAILALCQKNSEATLPIAVRFAKT LLANSSPFNVDTGAMATLALTCMYNKIPVGSEEGYRSLFGQVLKDIVEKISMKIKDNG IIGDIYSTGLAMQALSVTPEPSKKEWNCKKTTDMILNEIKQGKFHNPMSIAQILPSLK GKTYLDVPQVTCSPDHEVQPTLPSNPGPGPTSASNITVIYTINNQLRGVELLFNETIN VSVKSGSVLLVVLEEAQRKNPMFKFETTMTSWGLVVSSINNIAENVNHKTYWQFLSGV TPLNEGVADYIPFNHEHITANFTQY" misc_feature 81..118 /gene="IFMH" /note="palindromic sequence" polyA_site 1460 /gene="IFHM" /note="addition site 1" polyA_site 1510 /gene="IFHM" /note="addition site 2" BASE COUNT 434 a 401 c 320 g 355 t ORIGIN 1 ggcaaatcgc ggtacctgtg gatgagagac atagacgaga gagtgagatg gcctggtttg 61 ccctctacct cctgagcctt ctctgggcta cagctgggac tagtacccag acccagagtt 121 catgctccgt tccctcagca caggagccct tggtcaatgg aatacaagta ctcatggaga 181 actcggtgac ttcatcagcc tacccaaacc ccagcatcct gattgccatg aatctggccg 241 gagcctacaa cttgaaggcc cagaagctcc tgacttacca gctcatgtcc agcgacaaca 301 acgatctaac cattgggcag ctcggcctca ccatcatggc cctcacctcc tcctgccgag 361 accctgggga taaagtatcc attctacaaa gacaaatgga gaactgggca ccttccagcc 421 ccaacgctga agcatcagcc ttctatgggc ccagtctagc gatcttggca ctgtgccaga 481 agaactctga ggcgaccttg ccgatagccg tccgctttgc caagaccctg ctggccaact 541 cctctccctt caatgtagac acaggagcaa tggcaacctt ggctctgacc tgtatgtaca 601 acaagatccc tgtaggttca gaggaaggtt acagatccct gtttggtcag gtactaaagg 661 atattgtgga gaaaatcagc atgaagatca aagataatgg catcattgga gacatctaca 721 gtactggcct cgccatgcag gctctctctg taacacctga gccatctaaa aaggaatgga 781 actgcaagaa gactacggat atgatactca atgagattaa gcaggggaaa ttccacaacc 841 ccatgtccat tgctcaaatc ctcccttccc tgaaaggcaa gacataccta gatgtgcccc 901 aggtcacttg tagtcctgat catgaggtac aaccaactct acccagcaac cctggccctg 961 gccccacctc tgcatctaac atcactgtca tatacaccat aaataaccag ctgagggggg 1021 ttgagctgct cttcaacgag accatcaatg ttagtgtgaa aagtgggtca gtgttacttg 1081 ttgtcctaga ggaagcacag cgcaaaaatc ctatgttcaa atttgaaacc acaatgacat 1141 cttggggcct tgtcgtctct tctatcaaca atatcgcgga aaatgttaat cacaagacat 1201 actggcagtt tcttagtggt gtaacacctt tgaatgaagg ggttgctgac tacataccct 1261 tcaaccacga gcacatcaca gccaatttca cacagtacta acgaagaggt gggttcagct 1321 tctatcaaac atctccaaag gatgggtgaa attttttcca cttcatttta aatctatgca 1381 aaaaagcgaa tgcctgtgat gctaccatat tcctggtaaa aacatggaga accactatgt 1441 agaataaaaa tgcaaagttc actggagtct caacatctat gactcatgaa aataaaattt 1501 tcatcttctc // LOCUS HSRNALA4P 6064 bp RNA PRI 17-SEP-1997 DEFINITION H.sapiens mRNA for laminin alpha 4 protein. ACCESSION X91171 NID g1212962 KEYWORDS LAMA4 gene; laminin alpha 4 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6064) AUTHORS Richards,A., Al-Imara,L. and Pope,F.M. TITLE The complete cDNA sequence of laminin alpha 4 and its relationship to the other human laminin alpha chains JOURNAL Eur. J. Biochem. 238 (3), 813-821 (1996) MEDLINE 96300249 REFERENCE 2 (bases 1 to 6064) AUTHORS Richards,A.J. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) A.J. Richards, University of Cambridge, MRC Connective Tissue Genetics Group, Strangeways Research Lab., Worts Causeway, Cambridge CB2 4RN, UK REFERENCE 3 (bases 1 to 6064) AUTHORS Durkin,M.E., Loechel,F., Mattei,M.G., Gilpin,B.J., Albrechtsen,R. and Wewer,U.M. TITLE Tissue-specific expression of the human laminin alpha5-chain, and mapping of the gene to human chromosome 20q13.2-13.3 and to distal mouse chromosome 2 near the locus for the ragged (Ra) mutation JOURNAL FEBS Lett. 411 (2-3), 296-300 (1997) MEDLINE 97415425 COMMENT Overlaps with X76939 and X70904. FEATURES Location/Qualifiers source 1..6064 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /clone_lib="lambda ZAP II" /chromosome="6" /map="q21" gene 284..5734 /gene="LAMA4" CDS 284..5734 /gene="LAMA4" /codon_start=1 /product="laminin alpha 4" /db_xref="PID:e198045" /db_xref="PID:g1212963" /db_xref="SWISS-PROT:Q16363" /translation="MALSSAWRSVLPLWLLWSAACSRAASGDDNAFPFDIEGSSAVGR QDPPETSEPRVALGRLPPAAEKCNAGFFHTLSGECVPCDCNGNSNECLDGSGYCVHCQ RNTTGEHCEKCLDGYIGDSIRGAPQFCQPCPCPLPHLANFAESCYRKNGAVRCICNEN YAGPNCERCAPGYYGNPLLIGSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTG FKCERCAPGYYGDARIAKNCAVCNCGGGPCDSVTGECLEEGFEPPTGCDKCVWDLTDD LRLAALSIEEGKSGVLSVSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINN AENTMKSLLSDVEELVEKENQASRKGQLVQKESMDTINHASQLVEQAHDMRDKIQEIN NKMLYYGEEHELSPKEISEKLVLAQKMLEEIRSRQPFFTQRELVDEEADEAYELLSQA ESWQRLHNETRTLFPVVLEQLDDYNAKLSDLQEALDQALNHVRDAEDMNRATAARQRD HEKQQERVREQMEVVNMSLSTSADSLTTPRLTLSELDDIIKNASGIYAEIDGAKSELQ VKLSNLSNLSHDLVQEAIDHAQDLQQEANELSRKLHSSDMNGLVQKALDASNVYENIV NYVSEANETAEFALNTTDRIYDAVSGIDTQIIYHKDESENLLNQARELQAKAESSSDE AVADTSRRVGGALARKSALKTRLSDAVKQLQAAERGDAQQRLGQSRLITEEANRTTME VQQATAPMANNLTNWSQNLQHFDSSAYNTAVNSARDAVRNLTEVVPQLLDQLRTVEQK RPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVHSRTSMDDLKAFTSLSL YMKPPVKRPELTETADQFILYLGSKNAKKEYMGLAIKNDNLVYVYNLGTKDVEIPLDS KPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEFSGDDSLLDLDPED TVFYVGGVPSNFKLPTSLNLPGFVGCLELATLNNDVISLYNFKHIYNMDPSTSVPCAR DKLAFTQSRAASYFFDGSGYAVVRDITRRGKFGQVTRFDIEVRTPADNGLILLMVNGS MFFRLEMRNGYLHVFYDFGFSSGRVHLEDTLKKAQINDAKYHEISIIYHNDKKMILVV DRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRALRAHLPLDINFRGCMKGFQFQKKD FNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGGFNFRTLQPN GLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSVDKQYNDGLSHFVISSVSPTRYELI VDKSRVGSKNPTKGKIEQTQASEKKFYFGGSPISAQYANFTGCISNAYFTRVDRDVEV EDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWDPVA LKLPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLRT RSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVIFI RERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGC LSNLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFE VRPRSSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHR ITVIRDSNVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTG CIRHFVIDGHPVSFSKAALVSGAVSINSCPAA" polyA_signal 6016..6021 polyA_signal 6037..6042 polyA_site 6043 BASE COUNT 1786 a 1370 c 1496 g 1412 t ORIGIN 1 agaaggtaaa aagggagtgg tgagaatgaa tgtgagaagg aagccaggac agcgcagtcc 61 ccagtcccga acggccaggg agaggaggtg gcctagcgct ggcggggctc accccaatcc 121 gtctgccttt tgatgccgta ctctgctggt tgcgcagcca cctcgggata ctgcacacgg 181 agaggaggga aaataagcga ggcaccgccg caccacgcgg gagacctacg gagacccaca 241 gcgcccgagc cctggaagag cactactgga tgtcagcgga gaaatggctt tgagctcagc 301 ctggcgctcg gttctgcctc tgtggctcct ctggagcgct gcctgctccc gcgccgcgtc 361 cggggacgac aacgcttttc cttttgacat tgaagggagc tcagcggttg gcaggcaaga 421 cccgcctgag acgagcgaac cccgcgtggc tctgggacgc ctgccgcctg cggccgagaa 481 atgcaatgct ggattctttc acaccctgtc gggagaatgt gtgccctgcg actgtaatgg 541 caattccaac gagtgtttgg acggctcagg atactgtgtg cactgccagc ggaacacaac 601 aggagagcac tgtgaaaagt gtctggatgg ttatatcgga gattccatca ggggagcacc 661 ccaattctgc cagccgtgcc cctgtcccct gccccacttg gccaattttg cagaatcctg 721 ctataggaaa aatggagctg ttcggtgcat ttgtaacgaa aattatgctg gacctaactg 781 tgaaagatgt gctcccggtt actatggaaa ccccttactc attggaagca cctgtaagaa 841 atgtgactgc agtggaaatt cagatcccaa cctgatcttt gaagattgtg atgaagtcac 901 tggccagtgt aggaattgct tacgcaacac caccggattc aagtgtgaac gttgcgctcc 961 tggctactat ggggacgcca ggatagccaa gaactgtgca gtgtgcaact gcgggggagg 1021 cccatgtgac agtgtaaccg gagaatgctt ggaagaaggt tttgaacccc ctacaggctg 1081 tgataagtgc gtctgggacc tgactgatga cctgcggtta gcagcgctct ccatcgagga 1141 aggcaaatcc ggggtgctga gcgtatcctc tggggccgcc gctcataggc acgtgaatga 1201 aatcaacgcc accatctacc tcctcaaaac aaaattgtca gaaagagaaa accaatacgc 1261 cctaagaaag atacaaatca acaatgctga gaacacgatg aaaagccttc tgtctgacgt 1321 agaggaatta gttgaaaagg aaaatcaagc ctccagaaaa ggacaacttg ttcagaagga 1381 aagcatggac accattaacc acgcaagtca gctggtagag caagcccatg atatgaggga 1441 taaaatccaa gagatcaaca acaagatgct ctattatggg gaagagcatg aacttagccc 1501 caaggaaatc tctgagaagc tggtgttggc ccagaagatg cttgaagaga ttagaagccg 1561 tcaaccattt ttcacccaac gggagctcgt ggatgaggag gcagatgagg cttacgaact 1621 actgagccag gctgagagct ggcagcggct gcacaatgag acccgcactc tgtttcctgt 1681 cgtcctggag cagctggatg actacaatgc taagttgtca gatctccagg aagcacttga 1741 ccaggccctt aaccatgtca gggatgccga agacatgaac agggccacag cagccaggca 1801 gcgggaccat gagaaacaac aggaaagagt gagggaacaa atggaagtgg tgaacatgtc 1861 tctgagcaca tctgcggact ctctgacaac acctcgtcta actctttcag aacttgatga 1921 tataataaag aatgcgtcag ggatttatgc agaaatagat ggagccaaaa gtgaactaca 1981 agtaaaacta tctaacctaa gtaacctcag ccatgattta gtccaagaag ctattgacca 2041 tgcacaggac cttcaacaag aagctaatga attgagcagg aagttgcaca gttcagatat 2101 gaacgggctg gtacagaagg ctttggatgc atcaaatgtc tatgaaaata ttgttaatta 2161 tgttagtgaa gccaatgaaa cagcagaatt tgctttgaac accactgacc gaatttatga 2221 tgcggtgagt gggattgata ctcaaatcat ttaccataaa gatgaaagtg agaacctcct 2281 caatcaagcc agagaactgc aagcaaaggc agagtctagc agtgatgaag cagtggctga 2341 cactagcagg cgtgtgggtg gagccctagc aaggaaaagt gcccttaaaa ccagactcag 2401 tgatgccgtt aagcaactac aagcagcaga gagaggggat gcccagcagc gcctggggca 2461 gtctagactg atcaccgagg aagccaacag gacgacgatg gaggtgcagc aggccactgc 2521 ccccatggcc aacaatctaa ccaactggtc acagaatctt caacattttg actcttctgc 2581 ttacaacact gcagtgaact ctgctaggga tgcagtaaga aatctgaccg aggttgtccc 2641 tcagctcctg gatcagcttc gtacggttga gcagaagcga cctgcaagca acgtttctgc 2701 cagcatccag aggatccgag agctcattgc tcagaccaga agtgttgcca gcaagatcca 2761 agtctccatg atgtttgatg gccagtcagc tgtggaagtg cactcgagaa ccagtatgga 2821 tgacttaaag gccttcacgt ctctgagcct gtacatgaaa ccccctgtga agcggccgga 2881 actgaccgag actgcagatc agtttatcct gtacctcgga agcaaaaacg ccaaaaaaga 2941 gtatatgggt cttgcaatca aaaatgataa tctggtatac gtctataatt tgggaactaa 3001 agatgtggag attcccctgg actccaagcc cgtcagttcc tggcctgctt acttcagcat 3061 tgtcaagatt gaaagggtgg gaaaacatgg aaaggtgttt ttaacagtcc cgagtctaag 3121 tagcacagca gaggaaaagt tcattaaaaa gggggaattt tcgggagatg actctctgct 3181 ggacctggac cctgaggaca cagtgtttta tgttggtgga gtgccttcca acttcaagct 3241 ccctaccagc ttaaacctgc ctggctttgt tggctgcctg gaactggcca ctttgaataa 3301 tgatgtgatc agcttgtaca actttaagca catctataat atggacccct ccacatcagt 3361 gccatgtgcc cgagataagc tggccttcac tcagagtcgg gctgccagtt acttcttcga 3421 tggctccggt tatgccgtgg tgagagacat cacaaggaga gggaaatttg gtcaggtgac 3481 tcgctttgac atagaagttc gaacaccagc tgacaacggc cttattctcc tgatggtcaa 3541 tggaagtatg tttttcagac tggaaatgcg caatggttac ctacatgtgt tctatgattt 3601 tggattcagc agtggccgtg tgcatcttga agatacgtta aagaaagctc aaattaatga 3661 tgcaaaatac catgagatct caatcattta ccacaatgat aagaaaatga tcttggtagt 3721 tgacagaagg catgtcaaga gcatggataa tgaaaagatg aaaatacctt ttacagatat 3781 atacattgga ggagctcctc cagaaatctt acaatccagg gccctcagag cacaccttcc 3841 cctagatatc aacttcagag gatgcatgaa gggcttccag ttccaaaaga aggacttcaa 3901 tttactggag cagacagaaa ccctgggagt tggttatgga tgcccagaag actcacttat 3961 atctcgcaga gcatatttca atggacagag cttcattgct tcaattcaga aaatatcttt 4021 ctttgatggc tttgaaggag gttttaattt ccgaacatta caaccaaatg ggttactatt 4081 ctattatgct tcagggtcag acgtgttctc catctcactg gataatggta ctgtcatcat 4141 ggatgtaaag ggaatcaaag ttcagtcagt agataagcag tacaatgatg ggctgtccca 4201 cttcgtcatt agctctgtct cacccacaag atatgaactg atagtagata aaagcagagt 4261 tgggagtaag aatcctacca aagggaaaat agaacagaca caagcaagtg aaaagaagtt 4321 ttacttcggt ggctcaccaa tcagtgctca gtatgctaat ttcactggct gcataagtaa 4381 tgcctacttt accagggtgg atagagatgt ggaggttgaa gatttccaac ggtatactga 4441 aaaggtccac acttctcttt atgagtgtcc cattgagtct tcaccattgt ttctcctcca 4501 taaaaaagga aaaaatttat ccaagcctaa agcaagtcag aataaaaagg gagggaaaag 4561 taaagatgca ccttcatggg atcctgttgc tctgaaactc ccagagcgga atactccaag 4621 aaactctcat tgccaccttt ccaacagccc tagagcaata gagcacgcct atcaatatgg 4681 aggaacagcc aacagccgcc aagagtttga acacttaaaa ggagattttg gtgccaaatc 4741 tcagttttcc attcgtctga gaactcgttc ctcccatggc atgatcttct atgtctcaga 4801 tcaagaagag aatgacttca tgactctatt tttggcccat ggccgcttgg tttacatgtt 4861 taatgttggt cacaaaaaac tgaagattag aagccaggag aaatacaatg atggcctgtg 4921 gcatgatgtg atatttattc gagaaaggag cagtggccga ctggtaattg atggtctccg 4981 agtcctagaa gaaagtcttc ctcctactga agctacctgg aaaatcaagg gtcccattta 5041 tttgggaggt gtggctcctg gaaaggctgt gaaaaatgtt cagattaact ccatctacag 5101 ttttagtggc tgtctcagca atctccagct caatggggcc tccatcacct ctgcttctca 5161 gacattcagt gtgacccctt gctttgaagg ccccatggaa acaggaactt acttttcaac 5221 agaaggagga tacgtggttc tagatgaatc tttcaatatt ggattgaagt ttgaaattgc 5281 atttgaagtc cgtcccagaa gcagttccgg aaccctggtc cacggccaca gtgtcaatgg 5341 ggagtaccta aatgttcaca tgaaaaatgg acaggtcata gtgaaagtca ataatggcat 5401 cagagatttt tccacctcag taacacccaa gcagagtctc tgtgatggca gatggcacag 5461 aattacagtt attagagatt ctaatgtggt tcagttggat gtggactctg aagtgaacca 5521 tgtggttgga cccctgaatc caaaaccaat tgatcacagg gagcctgtgt ttgttggagg 5581 tgttccagaa tctctactga caccacgctt ggcccccagc aaacccttca caggctgcat 5641 acgccacttt gtgattgatg gacacccagt gagcttcagt aaagcagccc tggtcagcgg 5701 cgccgtaagc atcaactcct gtccagcagc ctgacatgac agagcacagc tgcccaaata 5761 caaagttctt tagagcactg aaagaaacac aaagccagcc aggaggaaca gtaactcttc 5821 cttcgggtgg aagctttcat cgagttgaac aggacttaaa cgaatcatca gggaccggat 5881 atttcttatt tctcatttgg attcttaacc ttgaatccaa agtgtctgca atggacaaca 5941 attgaaggag aggcaaactt acttgtattg agagcacacg caattcctac tggtgaaatt 6001 actgtttctg tttctaataa aatagaaggg attccaaata aaaaaaaaaa aaaaaaaaaa 6061 aaaa // LOCUS HSRNALAGA 2296 bp RNA PRI 02-JUN-1995 DEFINITION H.sapiens mRNA for L-arginine:glycine amidinotransferase. ACCESSION X86401 NID g791048 KEYWORDS L-arginine: glycine amidinotransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2296) AUTHORS Austruy,E., Belley,L., Millasot,P., Junien,C. and Jeanpierre,C. TITLE Characterization of the human cDNA with partial homology with the gamma subunit of sodium potassium ATPase of rat, mouse, rabbit and sheep JOURNAL Unpublished REFERENCE 2 (bases 1 to 2296) AUTHORS Austruy,E. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) E. Austruy, INSERM 383, Hopital Necker-Enfants Malades, Clinique M. Lamy, 149 rue de Sevres, 75015 Paris, FRANCE COMMENT Sequence overlapping with that under the acc#X65706. FEATURES Location/Qualifiers source 1..2296 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone_lib="lambda gt10" /chromosome="15" /map="q15-21.3" CDS 506..1681 /codon_start=1 /product="L-arginine: glycine amidinotransferase" /db_xref="PID:g791049" /translation="MNILKSTQAATASSRNSCAADDKATEPLPKDCPVSSYNEWDPLE EVIVGRAENACVPPFTIEVKANTYEKYWPFYQKQGGHYFPKDHLKKAVAEIEEMCNIL KTEGVTVRRPDPIDWSLKYKTPDFESTGLYSAMPRDILIVVGNEIIEAPMAWRSRFFE YRAYRSIIKDYFHRGAKWTTAPKPTMADELYNQDYPIHSVEDRHKLAAQGKFVTTEFE PCFDAADFIRAGRDIFAQRSQVTNYLGIEWMRRHLAPDYRVHIISFKDPNPMHIDATF NIIGPGIVLSNPDRPCHQIDLFKKAGWTIITPPTPIIPDDHPLWMSSKWLSMNVLMLD EKRVMVDANEVPIQKMFEKLGITTIKVNIRNANSLGGGFHCWTCDVRRRGTLQSYLD" BASE COUNT 697 a 450 c 482 g 667 t ORIGIN 1 atgataaatt taatcccgtt tcagaccttg aaggaagtaa ttttgaagtt attggcaaca 61 tacatcaata ttctaattta ttagttgatg acaccgaccg tatcagtcct gatgacatcg 121 gcaatgatat tcatgagttt ttacacggag gagataagtg atggctgttg aatatgaaaa 181 tggtggtcat ggttacgtgg aacatgcatc tataggaggc gttattgtac acgtgtttta 241 tgcagacgat aaggaagagg aggataaata atgttagata actttgttga attgaaaaat 301 aagtatgccg atgaacttgg aaaccgtggt agaaagtgtg atgagatgaa attcaccagc 361 gaaaaagtta atgaattgat cggtgttgat gaagcgttta aagttccaaa caagttaatg 421 tcaatcatga tgaatcgtga acaacgtgag caaacgttta aagcattttt ggaagttgag 481 cgtgatacat cattcgattg gttccatgaa tattttgaag agcacccagg cagctacggc 541 ttcctcccgg aactcctgtg cagctgacga caaagccact gagcctctgc ccaaggactg 601 ccctgtctct tcttacaacg aatgggaccc cttagaggaa gtgatagtgg gcagagcaga 661 aaacgcctgt gttccaccgt tcaccatcga ggtgaaggcc aacacatatg aaaagtactg 721 gccattttac cagaagcaag gagggcatta ttttcccaaa gatcatttga aaaaggctgt 781 tgctgaaatt gaagaaatgt gcaatatttt aaaaacggaa ggagtgacag taaggaggcc 841 tgaccccatt gactggtcat tgaagtataa aactcctgat tttgagtcta cgggtttata 901 cagtgcaatg cctcgagaca tcctgatagt tgtgggcaat gagattatcg aggctcccat 961 ggcatggcgt tcacgcttct ttgagtaccg agcgtacagg tcaattatca aagactactt 1021 ccaccgtggc gccaagtgga caacagctcc taagcccaca atggctgatg agctttataa 1081 ccaggattat cccatccact ctgtagaaga cagacacaaa ttggctgctc agggaaaatt 1141 tgtgacaact gagtttgagc catgctttga tgctgctgac ttcattcgag ctggaagaga 1201 tatttttgca cagagaagcc aggttacaaa ctacctaggc attgaatgga tgcgtaggca 1261 tcttgctcca gactacagag tgcatatcat ctcctttaaa gatcccaatc ccatgcatat 1321 tgatgctacc ttcaacatca ttggacctgg tattgtgctt tccaaccctg accgaccatg 1381 tcaccagatt gatcttttca agaaagcagg atggactatc attactcctc caacaccaat 1441 catcccagac gatcatccac tctggatgtc atccaaatgg ctttccatga atgtcttaat 1501 gctagatgaa aaacgtgtta tggtggatgc caatgaagtt ccaattcaaa agatgtttga 1561 aaagctgggt atcactacca ttaaagttaa cattcgtaat gccaattccc tgggaggagg 1621 cttccattgc tggacctgcg atgtccggcg ccgaggcacc ttacagtcct acttggactg 1681 aacaggcctg atggagcttg tggctggcct cagatacacc taagaagctt aggggcaagg 1741 ttcattctcc tgctttaaaa agtgcatgaa ctgtagtgct ttaaacaatc atctccttaa 1801 caggggtcgt aagcctggtt tgcttctatt acttttcttt gacataaaga aaataacttc 1861 tgctaggtat tactctctac tcctaaagtt atttactatt tggcttcaag tataaaattt 1921 tggtgaatgt gtaccaagaa aaaattagtc acctgagtaa cttggccact aataattaac 1981 catctacctc tgtttttaat tttctttcca aaaggcagct tgaaatgttg gtcctaatct 2041 taattttttt tcctcttcta tagacttgag aatgtttttc tctaaatgag agaaagactt 2101 agaatgtaca cagatccaaa atagaatcag attatctctt tttttctaaa ggagagaaag 2161 acttagaaca tacacagatc ctaagtagaa ccaggtaatt gtctcttttt ctaataagga 2221 atttgggtaa tttttaattt tttgtttttt aaaaaataac ctagactatg caaaacatca 2281 aagccggaat tctttt // LOCUS HSRNALICA 3654 bp RNA PRI 01-JUN-1995 DEFINITION H.sapiens mRNA for LI-cadherin. ACCESSION X83228 NID g854174 KEYWORDS LI-cadherin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3654) AUTHORS Boettinger,A., Kreft,B., Fieger,C., Dlouhy,B., Berndorff,D., Goessner,R. and Tauber,R. TITLE Molecular cloning of human LI-cadherin:evidence for a novel type of cadherin within the cadherin superfamily JOURNAL Unpublished REFERENCE 2 (bases 1 to 3654) AUTHORS Boettinger,A.M. TITLE Direct Submission JOURNAL Submitted (05-DEC-1994) A.M. Boettinger, Institut fuer Klinische Chemie & Bioch., Universitaetsklinikum Rudolf Virchow, Freie Universitaet Berlin, Spandauer Damm 130, 14050 Berlin, FRG COMMENT Sequence overlapping with that unde the accession number U07969. FEATURES Location/Qualifiers source 1..3654 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="uni-ZAP TM vector" CDS 93..2591 /function="cell adhesion molecule" /codon_start=1 /product="LI-cadherin" /db_xref="PID:g854175" /translation="MILQAHLHSLCLLMLYLATGYGQEGKFSGPLKPMTFSIYEGQEP SQIIFQFKANPPAVTFELTGETDNIFVIEREGLLYYNRALDRETRSTHNLQVAALDAN GIIVEGPVPITIKVKDINDNRPTFLQSKYEGSVRQNSRPGKPFLYVNATDLDDPATPN GQLYYQIVIQLPMINNVMYFQINNKTGAISLTREGSQELNPAKNPSYNLVISVKDMGG QSENSFSDTTSVDIIVTENIWKAPKPVEMVENSTDPHPIKITQVRWNDPGAQYSLVDK EKLPRFPFSIDQEGDIYVTQPLDREEKDAYVFYAVAKDEYGKPLSYPLEIHVKVKDIN DNPPTCPSPVTVFEVQENERLGNSIGTLTAHDRDEENTANSFLNYRIVEQTPKLPMDG LFLIQTYAGMLQLAKQSLKKQDTPQYNLTIEVSDKDFKTLCFVQINVIDINDQTPIFE KSDYGNLTLAEDTNIGSTILTIQATDADEPFTGSSKILYHIIKGDSEGRLGVDTDPHT NTGYVIIKKPLDFETAAVSNIVFKAENPEPLVFGVKYNASSFAKFTLIVTDVNEAPQF SQHVFQAKVSEDVAIGTKVGNVTAKDPEGLDISYSLRGDTRGWLKIDHVTGEIFSVAP LDREAGSPYRVQVVATEVGGSSLSSVSEFHLILMDVNDNPPRLAKDYTGLFFCHPLSA PGSLIFEATDDDQHLFRGPHFTFSLGSGSLQNDWEVSKINGTHARLSTRHTEFEEREY VVLIRINDGGRPPLEGIVSLPVTFCSCVEGSCFRPAGHQTGIPTVGMAVGILLTTLLV IGIILAVVFIRIKKDKGKDNVESAQASEVKPLRS" BASE COUNT 1052 a 804 c 777 g 1021 t ORIGIN 1 gtcgtagcaa gagtctcgac cactgaatgg aagaaaagga cttttaacca ccattttgtg 61 acttacagaa aggaatttga ataaagaaaa ctatgatact tcaggcccat cttcactccc 121 tgtgtcttct tatgctttat ttggcaactg gatatggcca agaggggaag tttagtggac 181 ccctgaaacc catgacattt tctatttatg aaggccaaga accgagtcaa attatattcc 241 agtttaaggc caatcctcct gctgtgactt ttgaactaac tggggagaca gacaacatat 301 ttgtgataga acgggaggga cttctgtatt acaacagagc cttggacagg gaaacaagat 361 ctactcacaa tctccaggtt gcagccctgg acgctaatgg aattatagtg gagggtccag 421 tccctatcac cataaaagtg aaggacatca acgacaatcg acccacgttt ctccagtcaa 481 agtacgaagg ctcagtaagg cagaactctc gcccaggaaa gcccttcttg tatgtcaatg 541 ccacagacct ggatgatccg gccactccca atggccagct ttattaccag attgtcatcc 601 agcttcccat gatcaacaat gtcatgtact ttcagatcaa caacaaaacg ggagccatct 661 ctcttacccg agagggatct caggaattga atcctgctaa gaatccttcc tataatctgg 721 tgatctcagt gaaggacatg ggaggccaga gtgagaattc cttcagtgat accacatctg 781 tggatatcat agtgacagag aatatttgga aagcaccaaa acctgtggag atggtggaaa 841 actcaactga tcctcacccc atcaaaatca ctcaggtgcg gtggaatgat cccggtgcac 901 aatattcctt agttgacaaa gagaagctgc caagattccc attttcaatt gaccaggaag 961 gagatattta cgtgactcag cccttggacc gagaagaaaa ggatgcatat gttttttatg 1021 cagttgcaaa ggatgagtac ggaaaaccac tttcatatcc gctggaaatt catgtaaaag 1081 ttaaagatat taatgataat ccacctacat gtccgtcacc agtaaccgta tttgaggtcc 1141 aggagaatga acgactgggt aacagtatcg ggacccttac tgcacatgac agggatgaag 1201 aaaatactgc caacagtttt ctaaactaca ggattgtgga gcaaactccc aaacttccca 1261 tggatggact cttcctaatc caaacctatg ctggaatgtt acagttagct aaacagtcct 1321 tgaagaagca agatactcct cagtacaact taacgataga ggtgtctgac aaagatttca 1381 agaccctttg ttttgtgcaa atcaacgtta ttgatatcaa tgatcagacc cccatctttg 1441 aaaaatcaga ttatggaaac ctgactcttg ctgaagacac aaacattggg tccaccatct 1501 taaccatcca ggccactgat gctgatgagc catttactgg gagttctaaa attctgtatc 1561 atatcataaa gggagacagt gagggacgcc tgggggttga cacagatccc cataccaaca 1621 ccggatatgt cataattaaa aagcctcttg attttgaaac agcagctgtt tccaacattg 1681 tgttcaaagc agaaaatcct gagcctctag tgtttggtgt gaagtacaat gcaagttctt 1741 ttgccaagtt cacgcttatt gtgacagatg tgaatgaagc acctcaattt tcccaacacg 1801 tattccaagc gaaagtcagt gaggatgtag ctataggcac taaagtgggc aatgtgactg 1861 ccaaggatcc agaaggtctg gacataagct attcactgag gggagacaca agaggttggc 1921 ttaaaattga ccacgtgact ggtgagatct ttagtgtggc tccattggac agagaagccg 1981 gaagtccata tcgggtacaa gtggtggcca cagaagtagg ggggtcttcc ttgagctctg 2041 tgtcagagtt ccacctgatc cttatggatg tgaatgacaa ccctcccagg ctagccaagg 2101 actacacggg cttgttcttc tgccatcccc tcagtgcacc tggaagtctc attttcgagg 2161 ctactgatga tgatcagcac ttatttcggg gtccccattt tacattttcc ctcggcagtg 2221 gaagcttaca aaacgactgg gaagtttcca aaatcaatgg tactcatgcc cgactgtcta 2281 ccaggcacac agagtttgag gagagggagt atgtcgtctt gatccgcatc aatgatgggg 2341 gtcggccacc cttggaaggc attgtttctt taccagttac attctgcagt tgtgtggaag 2401 gaagttgttt ccggccagca ggtcaccaga ctgggatacc cactgtgggc atggcagttg 2461 gtatactgct gaccaccctt ctggtgattg gtataatttt agcagttgtg tttatccgca 2521 taaagaagga taaaggcaaa gataatgttg aaagtgctca agcatctgaa gtcaaacctc 2581 tgagaagctg aatttgaaaa ggaatgtttg aatttatata gcaagtgcta tttcagcaac 2641 aaccatctca tcctattact tttcatctaa cgtgcattat aattttttaa acagatattc 2701 cctcttgtcc tttaatattt gctaaatatt tcttttttga ggtggagtct tgctctgtcg 2761 cccaggctgg agtacagtgg tgtgatccca gctcactgca acctccgcct cctgggttca 2821 catgattctc ctgcctcagc ttcctaagta gctgggttta caggcaccca ccaccatgcc 2881 cagctaattt ttgtattttt aatagagacg gggtttcgcc atttggccag gctggtcttg 2941 aactcctgac gtcaagtgat ctgcctgcct tggtctccca atacaggcat gaaccactgc 3001 acccacctac ttagatattt catgtgctat agacattaga gagatttttc atttttccat 3061 gacatttttc ctctctgcaa atggcttagc tacttgtgtt tttccctttt ggggcaagac 3121 agactcatta aatattctgt acattttttc tttatcaagg agatatatca gtgttgtctc 3181 atagaactgc ctggattcca tttatgtttt ttctgattcc atcctgtgtc cccttcatcc 3241 ttgactcctt tggtatttca ctgaatttca aacatttgtc agagaagaaa aacgtgagga 3301 ctcaggaaaa ataaataaat aaaagaacag ccttttccct tagtattaac agaaatgttt 3361 ctgtgtcatt aaccatcttt aatcaatgtg acatgttgct ctttggctga aattcttcaa 3421 cttggaaatg acacagaccc acagaaggtg ttcaaacaca acctactctg caaaccttgg 3481 taaaggaacc agtcagctgg ccagatttcc tcactacctg ccatgcatac atgctgcgca 3541 tgttttcttc attcgtatgt tagtaaagtt ttggttatta tatatttaac atgtggaaga 3601 aaacaagaca tgaaaagagt ggtgacaaat caagaataaa cactggttgt agtc // LOCUS HSRNAMUF1 2165 bp RNA PRI 06-JAN-1998 DEFINITION H.sapiens mRNA for MUF1 protein. ACCESSION X86018 NID g762952 KEYWORDS muf1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2165) AUTHORS Kreideweiss,S., Delany-Heiken,P., Nordheim,A. and Ruhlmann,A. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2165) AUTHORS Ruhlmann,A.C.C. TITLE Direct Submission JOURNAL Submitted (30-MAR-1995) A.C.C. Ruhlmann, Medizinische Hochschule Hannover, Institut fur Molekularbiologie, OE 5250, D- 30623 Hannover, FRG COMMENT Sequence overlapping with those under the acc#T78608, T79088, T82209, T82210, T84170, T86616, T86923, T91814, T91903. FEATURES Location/Qualifiers source 1..2165 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="tonsils" gene 1..1854 /gene="muf1" CDS 1..1854 /gene="muf1" /codon_start=1 /db_xref="PID:g762953" /translation="MSAGFWQPGPGGPPCRLCGEASRGRAPSRDEGSLLLGSRRPRRD AAERCAAALMASRRKSEAKQMPRAAPATRVTRRSTQESLTAGGTDLKRELHPPATSHE APGTKRSPSAPAATSSASSSTSSYKRAPASSAPQPKPLKRFKRAAGKKGARTRQGPGA ESEDLYDFVFIVAGEKEDGEEMEIGEVACGALDGSDPSCLGLPALEASQRFRSISTLE LFTVPLSTEAALTLCHLLSSWVSLESLTLSYNGLGSNIFRLLDSLRALSGQAGCRLRA LHLSDLFSPLPILELTRAIVRALPLLRVLSIRVDHPSQRDNPGVPGNAGPPSHIIGDE EIPENCLEQLEMGFPRGAQPAPLLCSVLKASGSLQQLSLDSATFASPQDFGLVLQTLK EYNLALKRLSFHDMNLADCQSEVLFLLQNLTLQEITFSFCRLFEKRPAQFLPEMVAAM KGNSTLKGLRLPGNRLGNAGLLALADVFSEDSSSSLCQLDISSNCIKPDGLLEFAKRL ERWGRGAFGHLRLFQNWLDQDAVTAREAIRRLRATCHVVSDSWTHPRPSQIMLAPCDG ARTSQSHARYHQLAGAEAWAAQNPNHQFYLSLSVTFFLFFPSSLALRSWRP" polyA_signal 2138..2143 polyA_site 2159..>2165 BASE COUNT 437 a 665 c 566 g 497 t ORIGIN 1 atgagtgctg gcttctggca accagggcct ggtggcccac cctgccgcct ctgtggagag 61 gcctcccgag gccgggcccc atcccgagat gaagggtccc tcttattggg ctcacgtcgg 121 ccccgccggg atgctgctga gcgatgtgct gcagccctga tggccagccg gcgtaagagt 181 gaagccaagc agatgcccag agctgcacct gccactcggg taacacgccg gagcacacag 241 gagagcctga cagcaggcgg aacagacctt aagagggagc tgcacccccc agccacctcc 301 catgaggctc ctggcaccaa gcggtcacct tctgctccag cagccacctc ctctgcctct 361 tcttctacat cctcatacaa acgggcacca gctagctcag ccccacagcc taagccccta 421 aagcgtttca agcgagctgc agggaagaag ggtgctcgca cccgtcaggg gcctggtgca 481 gagtctgaag acctgtatga cttcgttttt attgtggctg gcgagaagga ggatggcgaa 541 gagatggaga ttggggaagt ggcttgtgga gctttggatg gatcagatcc cagctgcctg 601 gggcttccag cactggaagc ttcacaaaga ttccgcagca tctccacctt ggagctattc 661 acagttccac tctccacaga ggcagccctg acactatgcc acctgctgag ctcctgggtg 721 tcactggaga gcctcacact ctcctacaat ggcctgggct ctaacatctt ccgcctgcta 781 gacagcctgc gggccctgtc aggccaggct ggatgtcgcc tccgtgccct gcatctcagt 841 gacctgttct caccactgcc catcctggag ctgacacgtg ctatcgtgcg agcactgccc 901 ctgctacggg tcctctctat tcgtgttgac cacccaagcc agcgggacaa ccctggtgtg 961 ccagggaatg cagggccccc tagccacata ataggcgatg aggagatacc agaaaactgc 1021 ctggagcagt tggagatggg atttccacgg ggagcccagc cagccccact gctgtgctcc 1081 gttctgaagg cctcgggttc tctgcagcag ctgtccctgg atagtgccac ctttgcctct 1141 ccccaggatt ttgggcttgt tttgcaaaca ctcaaagagt acaacctagc cctgaaaaga 1201 ctgagcttcc atgacatgaa tctcgctgac tgtcagagcg aggtgctctt tttgctacag 1261 aatctgactc tgcaagagat taccttctcc ttctgccgtc tgtttgagaa gcgcccagcc 1321 caatttctgc ctgagatggt tgctgctatg aagggcaact ccacactgaa gggcctccgg 1381 ctgccaggga accgcctggg gaatgctggc ctgctggcct tggcagatgt tttctcagag 1441 gattcatcct cctctctctg tcagctggac atcagttcca actgcatcaa gccagatggg 1501 cttctggagt tcgccaagcg gctggagcgc tggggccgtg gagcctttgg tcacctgcgc 1561 ctcttccaaa actggctgga ccaggatgca gtcacagcca gggaagccat ccggcggctc 1621 cgggctacct gccatgtggt tagcgactca tggactcatc ccaggccttc gcagattatg 1681 ttagcaccat gtgatggggc ccgtacctca cagtctcatg ctcggtacca tcagcttgca 1741 ggggctgaag catgggctgc ccagaacccc aaccaccagt tctatctttc tctttctgtc 1801 accttttttc tcttttttcc ttcttccctt gcactgaggt cctggaggcc ttgatgaggc 1861 ccagcaaaca ggcattctca cagctgggtt tatagtcttt gggcccctta ctcagtatcc 1921 tgggaaccct gggccaggag gttacagtgg tcatcataat tgctgaagag atcccctccc 1981 ctgcccctgg gttcctgcct tccctcctca agcaggcacc caggctttag agaagtatag 2041 ggggcttctt ccctgctggg cttaccacac tgctctcagg cctcaaaccc tttcatacct 2101 ttattctttt ttttaaccaa aaaagttttt cttataaaat aaattttggg caaacatcaa 2161 aaaaa // LOCUS HSRNANEB 20881 bp RNA PRI 11-MAY-1995 DEFINITION H.sapiens mRNA for nebulin. ACCESSION X83957 NID g806561 KEYWORDS nebulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 20881) AUTHORS Labeit,S. and Kolmerer,B. TITLE The complete primary structure of human nebulin and its correlation to muscle structure JOURNAL J. Mol. Biol. 248 (2), 308-315 (1995) MEDLINE 95257391 REFERENCE 2 (bases 1 to 20881) AUTHORS Labeit,S. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) S. Labeit, EMBL Heidelberg, Meyerhofstr. 1, D- 69012 Heidelberg, FRG COMMENT Sequence overlapping with thos under the accession numbers X58122, X58123, X70032. FEATURES Location/Qualifiers source 1..20881 /organism="Homo sapiens" /variety="Caucasian" /db_xref="taxon:9606" /dev_stage="adult" /clone_lib="Clontech HL1124a" /chromosome="2" /map="q24" CDS 441..20450 /function="skeletal muscle structural protein" /codon_start=1 /product="nebulin" /db_xref="PID:g806562" /translation="MADDEDYEEVVEYYTEEVVYEEVPGETITKIYETTTTRTSDYEQ SETSKPALAQPALAQPASAKPVERRKVIRKKVDPSKFMTPYIAHSQKMQDLFSPNKYK EKFEKTKGQPYASTTDTPELRRIKKVQDQLSEVKYRMDGDVAKTICHVDEKAKDIEHA KKVSQQVSKVLYKQNWEDTKDKYLLPPDAPELVQAVKNTAMFSKKLYTEDWEADKSLF YPYNDSPELRRVAQAQKALSDVAYKKGLAEQQAQFTPLADPPDIEFAKKVTNQVSKQK YKEDYENKIKGKWSETPCFEVANARMNADNISTRKYQEDFENMKDQIYFMQTETPEYK MNKKAGVAASKVKYKEDYEKNKGKADYNVLPASENPQLRQLKAAGDALSDKLYKENYE KTKAKSINYCETPKFKLDTVLQNFSSDKKYKDSYLKDILGHYVGSFEDPYHSHCMKVT AQNSDKNYKAEYEEDRGKGFFPQTITQEYEAIKKLDQCKDHTYKVHPDKTKFTQVTDS PVLLQAQVNSKQLSDLNYKAKHESEKFKCHIPPDTPAFIQHKVNAYNLSDNLYKQDWE KSKAKKFDIKVDAIPLLAAKANTKNTSDVMYKKDYEKNKGKMIGVLSINDDPKMLHSL KVAKNQSDRLYKENYEKTKAKSMNYCETPKYQLDTQLKNFSEARYKDLYVKDVLGHYV GSMEDPYHTHCMKVAAQNSDKSYKAEYEEDKGKCYFPQTITQEYDAIKKLDQCKDHTY KVHPDKTKFTAVTDSPVLLQAQLNTKQLSDLNYKAKHEGERFKCHIPADAPQFIQHRV NAYNLSDNVYKQDWEKSKAKKFDIKVDAIPLLAAKANTKNTSDVMYKKDYEKSKGKMI GALSINDDPKMLHSLKTAKNQSDREYRKDYEKSKTIYTAPLDMLQVTQAKKSQAIASD VDYKHILHSYSYPPDSINVDLAKKAYALQSDVEYKADYNSWMKGCGWVPFGSLEMEKA KRASDILNEKKYRQHPDTLKFTSIEDAPITVQSKINQAQRSDIAYKAKGEEIIHNYNL PPDLPQFIQAKVNAYNISENMYKADLKDLSKKGYDLRTDAIPIRAAKAARQAASDVQY KKDYEKAKGKMVGFQSLQDDPKLVHYMNVAKIQSDREYKKDYEKTKSKYNTPHDMFNV VAAKKAQDVVSNVNYKHSLHHYTYLPDAMDLELSKNMMQIQSDNVYKEDYNNWMKGIG WIPIGSLDVEKVKKAGDALNEKKYRQHPDTLKFTSIVDSPVMVQAKQNTKQVSDILYK AKGEDVKHKYTMSPDLPQFLQAKCNAYSISDVCYKRDWHDLIRKGNNVLGDAIPITAA KASRNIASDYKYKEAYEKSKGKHVGFRSLQDDPKLVHYMNVAKLQSDREYKKNYENTK TSYHTPGDMVTITAAKMAQDVATNVNYKQPLHHYTYLPDAMSLEHTRNVNQIQSDNVY KDEYNSFLKGIGWIPIGSLEVEKVKKAGDALNERKYRQHPDTVKFTSVPDSMGMMLAQ HNTKQLSDLNYKVEGEKLKHKYTIDPELPQFIQAKVNALNMSDAHYKADWKKTIRKGY DLRPDAIPIVAAKSSRNIASDCKYKEAYEKAKGKQVGFLSLQDDPKLVHYMNVAKIQS DREYKKGYEASKTKYHTPLDMVSVTAAKKSQEVATNANYRQSYHHYTLLPDALNVEHS RNAMQIQSDNLYKSDFTNWMKGIGWVPIESLEVEKAKKAGEILSEKKYRQHPEKLKFT YAMDTMEQALNKSNKLNMDKRLYTEKWNKDKTTIHVMPDTPDILLSRVNQITMSDKLY KAGWEEEKKKGYDLRPDAIAIKAARASRDIASDYKYKKAYEQAKGKHIGFRSLEDDPK LVHFMQVAKMQSDREYKKGYEKSKTSFHTPVDMLSVVAAKKSQEVATNANYRNVIHTY NMLPDAMSFELAKNMMQIQSDNQYKADYADFMKGIGWLPLGSLEAEKNKKAMEIISEK KYRQHPDTLKYSTLMDSMNMVLAQNNAKIMNEHLYKQAWEADKTKVHIMPDIPQIILA KANAINISDKLYKLSLEESKKKGYDLRPDAIPIKAAKASRDIASDYKYKYNYEKGKGK MVGFRSLEDDPKLVHSMQVAKMQSDREYKKNYENTKTSYHTPADMLSVTAAKDAQANI TNTNYKHLIHKYILLPDAMNIELTRNMNRIQSDNEYKQDYNEWYKGLGWSPAGSLEVE KAKKATEYASDQKYRQHPSNFQFKKLTDSMDMVLAKQNAHTMNKHLYTIDWNKDKTKI HVMPDTPDILQAKQNQTLYSQKLYKLGWEEALKKGYDLPVDAISVQLAKASRDIASDY KYKQGYRKQLGHHVGFRSLQDDPKLVLSMNVAKMQSEREYKKDFEKWKTKFSSPVDML GVVLAKKCQELVSDVDYKNYLHQWTCLPDQNDVVQAKKVYELQSENLYKSDLEWLRGI GWSPLGSLEAEKNKRASEIISEKKYRQPPDRNKFTSIPDAMDIVLAKTNAKNRSDRLY REAWDKDKTQIHIMPDTPDIVLAKANLINTSDKLYRMGYEELKRKGYDLPVDAIPIKA AKASREIASEYKYKEGFRKQLGHHIGARNIEDDPKMMWSMHVAKIQSDREYKKDFEKW KTKFSSPVDMLGVVLAYKCQTLVSDVDYKNYLHQWTCLPDQSDVIHARQAYDLQSDNL YKSDLQWLKGIGWMTSGSLEDEKNKRATQILSDHVYRQHPDQFKFSSLMDSIPMVLAK NNAITMNHRLYTEAWDKDKTTVHIMPDTPEVLLAKQNKVNYSEKLYKLGLEEAKRKGY DMRVDAIPIKAAKASRDIASEFKYKEGYRKQLGHHIGARAIRDDPKMMWSMHVAKIQS DREYKKDFEKWKTKFSSPVDMLGVVLAKKCQTLVSDVDYKNYLHQWTCLPDQSDVIHA RQAYDLQSDNMYKSDLQWMRGIGWVSIGSLDVEKCKRATEILSDKIYRQPPDRFKFTS VTDSLEQVLAKNNALNMNKRLYTEAWDKDKTQIHIMPDTPEIMLARQNKINYSETLYK LANEEAKKKGYDLRSDAIPIVAAKASRDVISDYKYKDGYRKQLGHHIGARNIEDDPKM MWSMHVAKIQSDREYKKDFEKWKTKFSSPVDMLGVVLAKKCQTLVSDVDYKNYLHEWT CLPDQNDVIHARQAYDLQSDNIYKSDLQWLRGIGWVPIGSMDVVKCKRAAEILSDNIY RQPPDKLKFTSVTDSLEQVLAKNNALNMNKRLYTEAWDKDKTQVHIMPDTPEIMLARQ NKINYSESLYRQAMEEAKKEGYDLRSDAIPIVAAKASRDIASDYKYKEAYRKQLGHHI GARAVHDDPKIMWSLHIAKVQSDREYKKDFEKYKTRYSSPVDMLGIVLAKKCQTLVSD VDYKHPLHECICLPDQNDIIHARKAYDLQSDNLYKSDLEWMKGIGWVPIDSLEVVRAK RAGELLSDTIYRQRPETLKFTSITDTPEQVLAKNNALNMNKRLYTEAWDNDKKTIHVM PDTPEIMLAKLNRINYSDKLYKLALEESKKEGYDLRLDAIPIQAAKASRDIASDYKYK EGYRKQLGHHIGARNIKDDPKMMWSIHVAKIQSDREYKKEFEKWKTKFSSPVDMLGVV LAKKCQILVSDIDYKHPLHEWTCLPDQNDVIQARKAYDLQSDAIYKSDLEWLRGIGWV PIGSVEVEKVKRAGEILSDRKYRQPADQLKFTCITDTPEIVLAKNNALTMSKHLYTEA WDADKTSIHVMPDTPDILLAKSNSANISQKLYTKGWDESKMKDYDLRADAISIKSAKA SRDIASDYKYKEAYEKQKGHHIGAQSIEDDPKIMCAIHAEKIQSEREYKKEFQKWKTK FSSPVDMLSILLAKKCQTLVTDIYYRNYLHEWTCMPDQNDIIQAKKAYDLQSDALYKA DLEWLRGIGWMPQGSPEVLRVKNAQNIFCDSVYRTPVVNLKYTSIVDTPEVVLAKSNA ENISIPKYREVWDKDKTSIHIMPDTPEINLARANALNVSNKLYREGWDEMKAGCDVRL DAIPIQAAKASREIASDYKYKLDHEKQKGHYVGTLTARDDNKIRWALIADKLQNEREY RLDWAKWKAKIQSPVDMLSILHSKNSQALVSDMDYRNYLHQWTCMPDQNDVIQAKKAY ELQSDNVYKADLEWLRGIGWMPNDSVSVNHAKHAADIFSEKKYRTKIETLNFTPVDDR VDYVTAKQSGEILDDIKYRKDWNATKSKYTLTETPLLHTAQEAARILDQYLYKEGWER QKATGYILPPDAVPFVHAHHCNDVQSELKYKAEHVKQKGHYVGVPTMRDDPKLVWFEH AGQIQNERLYKEDYHKTKAKINIPADMVSVLAAKQGQTLVSDIDYRNYLHQWMCHPDQ NDVIQARKAYDLQSDNVYRADLEWLRGIGWIPLDSVDHVRVTKNQEMMSQIKYKKNAL ENYPNFTSVVDPPEIVLAKINSVNQSDVKYKETFNKAKGKYTFSPDTPHISHSKDMGK LYSTILYKGAWEGTKAYGYTLDERYIPIVGAKHADLVNSELKYKETYEKQKGHYLAGK VIGEFPGVVHCLDFQKMRSALNYRKHYEDTKANVHIPNDMMNHVLAKRCQYILSDLEY RHYFHQWTSLLEEPNVIRVRNAQEILSDNVYKDDLNWLKGIGCYVWDTPQILHAKKSY DLQSQLQYTAAGKENLQNYNLVTDTPLYVTAVQSGINASEVKYKENYHQIKDKYTTVL ETVDYDRTRNLKNLYSSNLYKEAWDRVKATSYILPSSTLSLTHAKNQKHLASHIKYRE EYEKFKALYTLPRSVDDDPNTARCLRVGKLNIDRLYRSVYEKNKMKIHIVPDMVEMVT AKDSQKKVSEIDYRLRLHEWICHPDLQVNDHVRKVTDQISDIVYKDDLNWLKGIGCYV WDTPEILHAKHAYDLRDDIKYKAHMLKTRNDYKLVTDTPVYVQAVKSGKQLSDAVYHY DYVHSVRGKVAPTTKTVDLDRALHAYKLQSSNLYKTSLRTLPTGYRLPGDTPHFKHIK DTRYMSSYFKYKEAYEHTKAYGYTLGPKDVPFVHVRRVNNVTSERLYRELYHKLKDKI HTTPDPPEIRQVKKTQEAVSELIYKSDFFKMQGHMISLPYTPQVIHCRYVGDITSDIK YKEDLQVLKGFGCFLYDTPDMVRSRHLRKLWSNYLYTDKAREMRDKYKVVLDTPEYRK VQELKTHLSELVYRAAGKKQKSIFTSVPDTPDLLRAKRGQKLQSQYLYVELATKERPH HHAGNQTTALKHAKDVKDMVSEKKYKIQYEKMKDKYTPVPDTPILIRAKRAYWNASDL RYKETFQKTKGKYHTVKDALDIVYHRKVTDDISKIKYKENYMSQLGIWRSIPDRPEHF HHRAVTDTVSDVKYKEDLTWLKGIGCYAYDTPDFTLAEKNKTLYSKYKYKEVFERTKS DFKYVADSPINRHFKYATQLMNEKKYRADYEQRKDKYHLVVDEPRHLLAKTRSDQISQ IKYRKNYEKSKDKFTSIVDTPEHLRTTKVNKQISDILYKLEYNKAKPRGYTTIHDTPM LLHVRKVKDEVSDLKYKEVYQRNKSNCTIEPDAVHIKAAKDAYKVNTNLDYKKQYEAN KAHWKWTPDRPDFLQAAKSSLQQSDFEYKLDREFLKGCKLSVTDDKNTVLALRNTLIE SDLKYKEKHVKERGTCHAVPDTPQILLAKTVSNLVSENKYKDHVKKHLAQGSYTTLPE TRDTVHVKEVTKHVSDTNYKKKFVKEKGKSNYSIMLEPPEVKHAMEVAKKQSDVAYRK DAKENLHYTTVADRPDIKKATQAAKQASEVEYRAKHRKEGSHGLSMLGRPDIEMAKKA AKLSSQVKYRENFDKEKGKTPKYNPKDSQLYKVMKDANNLASEVKYKADLKKLHKPVT DMKESLIMNHVLNTSQLASSYQYKKKYEKSKGHYHTIPDNLEQLHLKEATELQSIVKY KEKYEKERGKPMLDFETPTYITAKESQQMQSGKEYRKDYEESIKGRNLTGLEVTPALL HVKYATKIASEKEYRKDLEESIRGKGLTEMEDTPDMLRAKNATQILNEKEYKRDLELE VKGRGLNAMANETPDFMRARNATDIASQIKYKQSAEMEKANFTSVVDTPEIIHAQQVK NLSSQKKYKEDAEKSMSYYETVLDTPEIQRVRENQKNFSLLQYQCDLKNSKGKITVVQ DTPEILRVKENQKNFSSVLYKEDVSPGTAIGKTPEMMRVKQTQDHISSVKYKEAIGQG TPIPDLPEVKRVKETQKHISSVMYKENLGTGIPTTVTPEIERVKRNQENFSSVLYKEN LGKGIPTPITPEMERVKRNQENFSSVLYKENMGKGTPLPVTPEMERVKHNQENISSVL YKENVGKATATPVTPEMQRVKRNQENISSVLYKENLGKATPTPFTPEMERVKRNQENF SSVLYKENMRKATPTPVTPEMERAKRNQENISSVLYSDSFRKQIQGKAAYVLDTPEMR RVRETQRHISTVKYHEDFEKHKGCFTPVVTDPITERVKKNMQDFSDINYRGIQRKVVE MEQKRNDQDQETITGLRVWRTNPGSVFDYDPAEDNIQSRSLHMINVQAQRRSREQSRS ASALSVSGGEEKSEHSEAPDHHLSTYSDGGVFAVSTAYKHAKTTELPQQRSSSVATQQ TTVSSIPSHPSTAGKIFRAMYDYMAADADEVSFKDGDAIINVQAIDEGWMYGTVQRTG RTGMLPANYVEAI" BASE COUNT 7071 a 4578 c 4754 g 4478 t ORIGIN 1 ccccaccttt tgagcaagtt cagcctggtt aagtccaagc tggtgataaa actacaaagc 61 agaatacgaa gaagacagag gcaaaggctt cttccctcag accataactc aagaatatgg 121 gggtctcgca gtaatttatg ctctttgctt ttgtcttttc atagttttcc ttgtatagtt 181 tgtcacttag ggcatctcct gctgccttca gctgcctaag ctgtgggttc tctgaagcag 241 gaagcacatt ataatctgct tttcctttat tcttttcata gtcttctttg tattttgctg 301 ctgaggaaat ttatttggta gattgaaggt ttgaacgaga gctacagaaa cgaaagaaaa 361 agtctgtata agccaatggt gttcgggaag aaaataaccc cattgccttg agtttgtagg 421 tgccactact actctgaaaa atggcagatg acgaagacta tgaggaggtg gtggagtact 481 acacagaaga agtggtttac gaagaggtgc cgggagagac aataacaaaa atttatgaga 541 ctacgacaac aaggacatct gactatgagc aatcagaaac ttccaaacca gctctggcac 601 agccagcact ggcacagcca gcatcagcaa agccggtgga gaggaggaag gtcatccgga 661 agaaagtgga tccttcaaag ttcatgaccc cctacattgc acacagtcag aaaatgcagg 721 atctttttag cccaaataaa tacaaggaga agtttgagaa aacaaaagga cagccatacg 781 ccagcacaac agatactcca gaacttcgca gaatcaaaaa agtacaagat caactcagtg 841 aggttaagta tcgaatggat ggtgatgttg ctaagactat atgtcacgta gatgaaaaag 901 caaaggatat tgaacatgca aagaaagtgt cgcagcaagt cagtaaggtt ttatacaagc 961 agaactggga agacaccaag gataagtacc tgcttcctcc tgatgcccct gaacttgtcc 1021 aggccgttaa gaacaccgcc atgttcagca agaaactgta cactgaagac tgggaagcag 1081 acaaaagttt gttttacccc tataatgata gcccggaact gaggagagtt gcccaggccc 1141 agaaagctct cagtgatgtt gcctacaaaa aaggtctcgc tgaacagcaa gctcaattca 1201 cgcctctggc cgatcctcca gatatagaat ttgccaagaa agtaaccaat caagtgagca 1261 agcaaaaata caaagaagac tatgaaaata aaatcaaagg caaatggagt gagacacctt 1321 gctttgaagt tgcaaatgcc agaatgaatg ctgataacat tagcacaagg aaataccagg 1381 aagattttga aaacatgaaa gaccagatct acttcatgca gaccgaaaca ccagagtata 1441 aaatgaataa aaaagctggt gtggcagcta gcaaggtaaa atacaaagaa gactatgaaa 1501 agaataaagg aaaagcagat tataatgtgc ttcctgcttc agagaaccca cagcttaggc 1561 agctgaaggc agcaggagat gccctaagtg acaaactata caaggaaaac tatgaaaaga 1621 caaaagcaaa gagcataaat tactgcgaga cccccaaatt caagctcgat actgttctgc 1681 agaacttcag tagtgataaa aaatataaag attcctactt aaaagatatt ttgggacatt 1741 atgtaggcag cttcgaggat ccataccatt cacactgcat gaaagtcaca gctcaaaaca 1801 gtgataaaaa ctacaaagca gaatacgaag aagacagagg caaaggcttc ttccctcaga 1861 ccataactca agaatatgaa gcaattaaga aactagatca gtgtaaagac cacacctaca 1921 aagtccatcc agataagaca aaattcaccc aagttacaga ctctcctgtt ctgctacaag 1981 cccaagtcaa ttccaaacaa ctgagtgact taaattacaa agcaaaacat gaaagtgaaa 2041 agttcaagtg ccatatcccc cctgatactc ctgcttttat ccagcacaaa gtcaatgcct 2101 ataacttgag tgataatctt tataagcaag actgggagaa gagcaaagcc aaaaagtttg 2161 acattaaagt ggatgccatt cccctgctgg cagccaaagc caacaccaag aacaccagcg 2221 atgtgatgta caagaaagac tatgaaaaaa acaaagggaa aatgattgga gtcctcagca 2281 ttaatgacga tcccaagatg ctgcactcct tgaaggtggc caaaaaccag agtgatagat 2341 tatacaagga aaactatgag aagacaaagg caaagagtat gaattactgt gagaccccaa 2401 aatatcaact tgatactcag ctgaagaact tcagtgaggc tagatataaa gacttatatg 2461 taaaggatgt tttgggacat tatgtaggca gcatggagga cccatatcac acacactgca 2521 tgaaagttgc agctcaaaac agtgataaaa gttacaaagc agaatatgaa gaagataaag 2581 gaaaatgcta tttccctcag acaataacac aagaatatga cgcaatcaag aagctggacc 2641 agtgtaaaga tcatacctac aaagttcatc cagataagac caaattcacg gcagtcactg 2701 attctcctgt actgttgcaa gcccagctca acacgaaaca gcttagtgat ctgaattaca 2761 aagcaaaaca tgaaggtgag aggttcaagt gccatatacc agcagatgct ccacagttta 2821 tccaacacag agtcaatgcc tataatctga gtgataatgt ttataagcaa gactgggaga 2881 agagcaaagc caagaagttt gacattaaag tggacgccat tcccctgttg gcagccaaag 2941 ccaacaccaa gaacaccagc gatgtgatgt acaagaaaga ctatgaaaag agcaaaggga 3001 aaatgattgg agccctcagc attaatgacg atccaaagat gctgcactcc ttgaagacag 3061 ccaaaaacca gagtgatcgc gaatatcgaa aagattatga aaagtcaaaa actatctaca 3121 cggcacctct tgatatgctc caagtcactc aagctaagaa atctcaggca attgccagcg 3181 acgttgatta taagcacatc ttacacagtt acagctaccc ccctgatagc atcaatgtgg 3241 accttgccaa gaaggcatat gcgctgcaga gcgatgttga atacaaagct gactacaata 3301 gctggatgaa aggttgtggc tgggtgcctt ttgggtcctt agaaatggaa aaggcaaagc 3361 gagcttcaga catcctcaat gagaaaaaat atcgccaaca tccagacacc ctcaagttta 3421 cctcgattga agatgctcca attacagtac agtctaaaat taaccaggcc cagaggagtg 3481 atatcgctta caaagccaaa ggagaggaaa ttattcacaa ttacaacctg ccaccagacc 3541 tgccccagtt catccaggct aaagttaatg cctacaatat cagtgagaat atgtacaaag 3601 cagacttgaa agacttgagc aagaagggat atgacctgag aactgatgcg attcccatca 3661 gagctgccaa agctgccagg caggcggcga gtgacgttca gtacaaaaaa gactatgaaa 3721 aggctaaagg gaaaatggtt ggcttccaaa gtcttcaaga tgaccctaaa ctggttcatt 3781 atatgaacgt ggccaagata caatcagatc gggagtataa aaaagactat gagaagacaa 3841 agtccaaata caacacgccc catgatatgt tcaatgtcgt ggcggctaag aaagcccagg 3901 atgtggtcag caatgtcaac tataagcatt ctctccatca ttacacctac ttgcctgacg 3961 ccatggacct ggagctgtct aagaacatga tgcagataca gagtgataac gtctacaagg 4021 aagactacaa caactggatg aaaggcattg gctggattcc tattggcagt ctcgacgtcg 4081 aaaaagttaa aaaggccggt gatgctctga atgaaaagaa gtacaggcaa catccagaca 4141 ccctcaaatt taccagcatt gtggactccc cagttatggt ccaggcaaaa cagaacacga 4201 agcaagtcag tgatatctta tacaaggcta aaggagaaga tgtgaaacat aaatacacca 4261 tgagtcctga tcttcctcag tttctccagg ccaagtgcaa tgcttacagt ataagtgacg 4321 tctgttataa acgggattgg catgacttaa tacgcaaggg caacaatgtg ctgggcgatg 4381 ctattcccat cactgcagcc aaggcatcga gaaacattgc cagtgattat aaatacaagg 4441 aagcttatga gaagtcaaag ggaaagcatg tgggtttcag aagcctccag gatgatccca 4501 agctggtcca ctatatgaat gtggcaaagc tgcagtctga tcgtgaatac aagaagaact 4561 atgagaacac caaaaccagc taccataccc ctggggacat ggttacgatc acagctgcaa 4621 agatggccca ggatgtcgct accaatgtca actacaaaca gccattgcat cattacacat 4681 acctacctga cgccatgagt cttgagcata cgaggaatgt caatcaaatt cagagtgata 4741 atgtgtataa agacgagtat aacagcttct tgaagggcat cggatggatc cctattggtt 4801 ccctggaggt ggagaaggtc aagaaagcag gcgatgcatt aaatgagagg aagtatcgac 4861 agcacccaga taccgtcaag ttcacaagtg tgcctgattc catgggcatg atgttggctc 4921 agcataacac aaagcagcta agtgatttga actacaaggt agagggagag aaactgaagc 4981 acaagtatac tattgaccct gaattgcctc agtttattca agccaaagtc aacgccctca 5041 acatgagtga tgctcattat aaagcagatt ggaagaaaac cattcgcaag ggctatgatt 5101 tgagaccaga tgccatccca attgttgctg caaaaagttc aaggaatatt gctagtgatt 5161 gcaaatataa ggaggcctac gagaaagcca aaggcaagca agttggattt ctcagtcttc 5221 aggatgatcc taaactggtt cactacatga atgtggccaa aatccagtct gatcgtgagt 5281 acaaaaaggg ctatgaagcc agcaagacca agtaccacac acctctggat atggtcagtg 5341 tgacagctgc aaagaaatct caggaggttg ccaccaacgc caactacaga cagtcatacc 5401 accactacac tctcctgccc gatgccttga atgtggagca ctccaggaat gccatgcaga 5461 ttcagagtga taatctgtac aaatctgact tcaccaattg gatgaaaggg atcggctggg 5521 tgcccataga gtccctggag gtggagaagg caaagaaagc aggagagatt cttagtgaga 5581 agaagtatcg ccagcacccc gagaagctga agttcactta cgccatggac acaatggaac 5641 aggcacttaa caagagtaac aaactgaaca tggacaagag gctctacact gaaaaatgga 5701 acaaggacaa gaccaccatt catgtcatgc ctgacacacc ggatatttta ctctccagag 5761 taaaccaaat caccatgagt gataaactgt acaaagctgg ctgggaagag gaaaagaaga 5821 aaggatatga cctgaggcct gatgccattg caataaaggc tgcaagagcc tctagagaca 5881 ttgccagtga ttacaaatac aagaaagcct atgaacaagc caaagggaaa cacattggct 5941 tccggagcct ggaagatgac cccaagctgg tgcacttcat gcaagtggcc aagatgcagt 6001 cagaccggga atacaagaag ggatatgaga aatccaagac ctccttccac accccggtgg 6061 acatgctcag tgtggtggca gccaagaagt ctcaggaagt ggccaccaat gccaactaca 6121 ggaacgtgat ccatacctac aacatgcttc ctgatgccat gagctttgaa ttggccaaaa 6181 atatgatgca gattcaaagt gataatcagt acaaggctga ctatgctgac ttcatgaagg 6241 gcattggatg gctccctctg ggctccctgg aagcagagaa aaacaagaaa gccatggaga 6301 ttattagtga aaagaagtac cgccagcacc cagacacttt gaagtattcc acactcatgg 6361 actcgatgaa catggttttg gcccagaata atgcaaaaat tatgaacgaa catctctaca 6421 aacaagcatg ggaggctgac aaaaccaaag tccacatcat gcctgatatc ccccagatta 6481 ttttggcaaa ggcaaatgca attaatataa gtgataaact ctacaaactt tccttggaag 6541 agtctaaaaa gaaaggctat gatctcagac ctgatgcaat tcctatcaaa gctgccaagg 6601 cttccagaga tattgcaagt gattataaat acaagtacaa ttatgaaaaa gggaagggga 6661 aaatggttgg tttccgcagt ctcgaggatg atcccaaatt agtccattcc atgcaagtgg 6721 ctaagatgca atctgatcgg gagtacaaga aaaactatga gaacacaaag accagctacc 6781 acacccctgc cgacatgctc agtgtcacgg ctgcaaagga tgcccaagcc aacatcacca 6841 acactaacta caagcacctg attcacaagt acatcctcct tccagatgca atgaacattg 6901 agctgaccag gaatatgaat cgcatacaga gtgataatga atataagcaa gattacaatg 6961 aatggtacaa agggcttggc tggagtccag caggttctct ggaagtggag aaggccaaga 7021 aagcaactga atatgccagt gatcagaaat accgccagca cccgagcaac ttccagttta 7081 agaagctgac tgattccatg gacatggtgc ttgccaagca gaatgcacat accatgaaca 7141 agcatttata caccattgat tggaataaag ataagaccaa gattcatgtg atgcctgata 7201 caccagatat tttacaagcc aagcagaatc aaacactgta tagtcagaaa ctctataaac 7261 ttggatggga agaagctttg aagaaaggct atgatctccc agttgatgca atttctgtac 7321 agctagctaa agcttcaaga gacattgcta gtgattataa atacaaacaa ggctaccgaa 7381 agcaacttgg ccaccatgtt ggattccgga gtctgcaaga tgacccaaaa cttgtgttgt 7441 ccatgaatgt agccaaaatg cagagtgaaa gagaatacaa gaaggacttt gagaagtgga 7501 aaactaagtt ctccagccca gtggacatgt tgggagtggt actggccaag aagtgtcagg 7561 agttggttag tgacgtggac tacaagaact acctgcatca gtggacatgt ctgcctgatc 7621 agaacgatgt tgtgcaagct aagaaagttt atgaactgca aagtgagaat ctatataaat 7681 ctgaccttga gtggctgaga ggcataggat ggagtccctt gggttcttta gaggcagaaa 7741 agaacaagcg ggcttcggaa atcatcagtg agaagaaata tcgtcagcct ccagacagaa 7801 acaagttcac cagcattcct gatgccatgg atatagttct ggcaaagaca aatgccaaaa 7861 ataggagtga tagactttat agagaagctt gggacaaaga caagactcag atccacatca 7921 tgcctgatac acctgacatt gttctggcta aagcaaactt aatcaacaca agtgataaac 7981 tctaccgaat gggttatgag gagctgaaga gaaaaggtta cgatcttcct gttgatgcca 8041 taccaatcaa agcagcaaaa gcctcccggg aaattgccag tgaatacaag tacaaggaag 8101 gctttcgcaa gcagctcggc caccacattg gtgcccggaa cattgaagat gaccccaaga 8161 tgatgtggtc catgcatgtg gccaagatcc agagtgacag ggagtacaag aaggactttg 8221 agaagtggaa gaccaagttc agcagcccag tggacatgct gggggtggtg ttggcctata 8281 agtgccagac cttagtcagc gacgtggact acaagaacta cctgcaccag tggacatgcc 8341 tgcccgacca gagcgatgtc atccatgctc ggcaggccta tgacctccag agcgataatt 8401 tgtacaagtc agaccttcag tggctaaaag gcattggctg gatgactagt ggttctctcg 8461 aggatgagaa aaataaacga gccacccaga ttttgagtga ccatgtttac cgtcagcacc 8521 cagatcaatt taagttttcc agccttatgg attccatacc aatggttttg gcaaaaaaca 8581 atgctattac catgaatcat cgcctctata cagaagcttg ggataaagat aaaaccactg 8641 tccacattat gccagatacc cctgaagttt tattagctaa acaaaacaaa gtaaattaca 8701 gtgagaaatt gtataagctt ggcctagaag aagccaagag gaaaggttat gacatgcggg 8761 tagatgccat tcctatcaag gcagccaagg cctccagaga tattgcaagt gaattcaagt 8821 acaaagaagg ctatcgtaag cagctcggcc accacattgg tgcccgagct atacgtgatg 8881 accccaagat gatgtggtcc atgcacgtgg ccaagatcca gagtgacagg gagtacaaga 8941 aggactttga gaagtggaag accaagttca gcagcccagt ggacatgctg ggggtggtgc 9001 tggccaagaa gtgccagacc ttagtcagcg atgtggacta caagaactac ctgcaccagt 9061 ggacatgcct gcccgaccag agcgacgtca tccatgctcg gcaggcctat gacctccaga 9121 gcgataatat gtacaagtct gatctccagt ggatgagagg cattggctgg gtgtccattg 9181 gctctttgga tgtggaaaaa tgcaaaaggg caactgaaat tttgagtgat aaaatctatc 9241 gccagcctcc agacagattc aaatttacca gtgtgactga ctctctggaa caagtgctgg 9301 ccaagaacaa tgctctcaac atgaataagc gtttatacac agaggcctgg gacaaagaca 9361 agactcaaat tcacataatg cctgatacac cagagattat gttggcaagg cagaacaaaa 9421 tcaactacag tgagactcta tacaaacttg ccaatgaaga agcaaaaaag aaaggctacg 9481 acttgcgaag tgacgccatc cccatcgtgg ctgccaaggc ctccagggac gttatcagtg 9541 attacaaata caaagatggt taccgcaagc agctcggcca ccacattgga gcccggaaca 9601 ttgaagatga ccccaagatg atgtggtcca tgcatgtggc caagatccag agtgacaggg 9661 agtataagaa ggactttgag aagtggaaga ccaagttcag cagcccagtg gacatgctgg 9721 gagtggtgtt agccaagaag tgccagacct tagtcagcga tgtggactac aagaactacc 9781 tgcacgagtg gacgtgcctg cccgaccaga atgatgtcat ccatgctcgg caggcctatg 9841 acctccagag cgataacatt tacaaatctg atctccagtg gctgagaggc attggctggg 9901 tccccattgg gtctatggat gtggtcaagt gcaagagagc tgctgaaata ctgagtgata 9961 acatctaccg ccagcctccg gacaagctga aatttaccag tgtgactgac tctctagagc 10021 aggtgctggc caagaacaat gctctcaata tgaacaagcg cttatacaca gaagcctggg 10081 acaaagacaa gacccaagtc catattatgc ctgatacacc tgaaatcatg ttggcaagac 10141 aaaataaaat aaattatagt gagagcctct atcgtcaggc catggaagaa gccaagaaag 10201 aaggctatga cttgagaagt gatgccattc ccattgtggc tgccaaggcc tctcgggata 10261 ttgccagtga ttacaaatac aaagaagcat atcgtaagca gttgggtcac cacattggcg 10321 cccgagcagt acacgatgac cccaagataa tgtggtccct ccacattgcc aaagtgcaga 10381 gtgaccgtga gtacaagaaa gattttgaga aatacaagac aaggtacagc agcccagtgg 10441 acatgcttgg tatcgttttg gccaagaagt gtcagacctt ggtcagcgat gtggactata 10501 aacatcctct gcatgaatgc atctgcctgc ccgaccagaa tgacatcatt catgcacgga 10561 aagcctatga cctccagagt gacaatttgt ataagtcaga ccttgaatgg atgaaaggca 10621 ttggctgggt tccgattgat tccttggaag ttgttagggc caagagagct ggagaattac 10681 ttagtgatac tatctaccgt cagcgtccag aaacgctgaa atttaccagt ataacggaca 10741 ctccggagca ggtgctggca aaaaacaatg ctttaaacat gaataagcgc ttatatactg 10801 aagcctggga caatgacaag aaaactattc atgtcatgcc tgatacacca gaaatcatgt 10861 tagccaaact caaccgaata aactacagtg ataaactcta taaacttgct ttggaagagt 10921 ccaagaagga aggctatgac ttgcgtctgg atgccattcc aatccaagca gccaaggctt 10981 caagagatat tgctagtgat tacaagtaca aggaaggcta ccgcaaacag cttggccacc 11041 atattggggc ccggaacatt aaggatgacc cgaagatgat gtggtccatc catgtggcca 11101 agatccagag tgacagggag tacaagaagg agtttgagaa gtggaagacc aagttcagca 11161 gcccagtgga catgctgggg gtggtgctgg ccaagaagtg tcagatcctt gtaagcgaca 11221 tagactacaa gcatcccctg catgaatgga cctgcctgcc tgatcagaat gacgtcattc 11281 aggctcggaa ggcctatgac ctgcagagtg atgctattta caaatctgat cttgagtggc 11341 tgagaggcat aggatgggtt cccattggct ctgtagaggt cgagaaagtg aagagagctg 11401 gagaaatcct gagtgacagg aagtatcgcc agcctgcaga ccagctcaaa ttcacatgca 11461 ttaccgacac tccggaaatt gtcctagcaa agaataatgc cctgacaatg agcaagcatt 11521 tatacacaga agcttgggat gctgacaaaa cctccatcca cgtgatgcca gacaccccag 11581 atatcctgct ggccaagagt aattctgcca atatcagcca aaaactttac accaagggat 11641 gggatgaatc aaagatgaag gactatgatc tgagagcaga tgctatttcc atcaaaagtg 11701 ccaaggcctc cagggacatc gccagtgact acaaatacaa ggaagcctat gagaaacaga 11761 aaggccacca cattggagcc cagagcattg aagatgatcc caagattatg tgtgccatac 11821 atgcagaaaa aattcaaagt gaaagggagt acaagaagga attccaaaag tggaaaacca 11881 agttctctag cccagtggac atgttaagca tcttgctggc caagaaatgt cagactttgg 11941 tcactgacat ttattatcgc aattacctgc atgaatggac atgcatgccg gatcaaaacg 12001 acattatcca agcaaaaaag gcctatgacc tgcagagtga tgccctctac aaggctgact 12061 tggagtggtt gcgtggcatt ggctggatgc cccaagggtc tcctgaagtg ttgagagtca 12121 aaaacgccca gaatatcttt tgtgacagtg tctatcggac gcctgtggtg aaccttaagt 12181 acacaagcat tgttgacaca cctgaagtgg tccttgctaa atcaaatgct gaaaatatta 12241 gtattccaaa gtacagagag gtttgggaca aggataaaac ttcaatacac ataatgccag 12301 atactccaga aattaatctc gctagagcaa atgctcttaa tgtgagcaat aaactttacc 12361 gtgagggctg ggatgaaatg aaggcgggct gtgatgtccg gctggatgcc atccccatcc 12421 aggctgccaa ggcctccagg gagattgcca gtgactataa atataagctt gaccatgaga 12481 agcagaaggg acactacgtg ggcaccctca cagccaggga tgacaacaag atccgctggg 12541 ccctcatagc tgacaagctc cagaatgaac gagagtaccg gctggactgg gccaaatgga 12601 aggccaagat ccagagccct gtggacatgc tttccatcct gcactctaaa aattcccagg 12661 ctctggtcag tgacatggat taccgcaatt acctgcacca gtggacctgc atgcccgacc 12721 agaacgatgt gattcaggcc aagaaggcct acgaactgca gagcgataat gtttacaagg 12781 ctgacttgga atggttgcgt ggaattgggt ggatgccaaa tgactccgtg tccgtcaatc 12841 atgccaaaca tgccgcggac atcttcagtg agaaaaaata tcgcacaaaa atagaaactc 12901 tcaactttac gcctgtggat gacagagttg attatgtgac agcgaaacaa agtggcgaga 12961 tcctcgatga tattaaatac cggaaagact ggaatgccac caaatcaaag tacaccctca 13021 cagaaacccc cctgctgcac actgcccagg aggctgctag gatactggac cagtatctct 13081 acaaggaagg ctgggagaga caaaaagcca caggttacat tttgcctcca gatgctgtgc 13141 catttgttca tgcccatcac tgcaatgacg ttcagagtga gctgaaatac aaagctgaac 13201 atgtgaagca aaaaggtcat tatgttggtg tcccgacgat gagagatgat cctaagctgg 13261 tttggtttga gcatgcaggc cagattcaga atgagagact atacaaagag gactatcaca 13321 aaacaaaggc caaaatcaat atacctgctg atatggtgtc agtcttggcc gccaagcagg 13381 ggcagaccct tgtcagtgat attgattatc gtaattactt gcaccaatgg atgtgtcatc 13441 ctgaccagaa cgatgttatt caggcaagaa aggcctatga cctacagagt gataatgtct 13501 acagagctga cctggagtgg ctccgaggca ttggctggat cccactggat tctgtggacc 13561 atgtaagggt tactaagaac caggaaatga tgagtcagat caaatataag aaaaatgccc 13621 ttgaaaacta tcctaacttt acaagtgtgg tggatcctcc agagattgtt ttagccaaga 13681 ttaattctgt caatcaaagt gatgtaaaat ataaagaaac atttaataaa gcaaagggca 13741 aatatacgtt ttcaccagat acaccacata tctcccactc caaagacatg ggaaaactct 13801 acagtactat actgtataaa ggggcgtggg agggcaccaa ggcctatggc tacaccctgg 13861 atgagcgcta cattcccatt gttggagcca agcatgctga tctggtgaac agtgagctta 13921 aatacaaaga gacatatgag aagcagaaag gtcactacct ggctggaaaa gtgatcggtg 13981 aattccctgg tgtggttcac tgtctggatt tccaaaagat gaggagtgcg ttgaactaca 14041 gaaaacatta tgaggatacc aaagcaaatg ttcatatccc caatgacatg atgaatcacg 14101 tgctggctaa aaggtgccag tacatcctca gtgacctgga gtatcgacac tatttccacc 14161 agtggacgtc tcttctggaa gaacccaatg ttatacgcgt ccgaaacgcc caggagatct 14221 tgagtgataa tgtgtataaa gatgacctga attggttgaa aggcattggt tgctacgttt 14281 gggatacacc ccaaatcctc catgccaaga aatcatacga ccttcagagt cagctacaat 14341 atacagcagc aggtaaagaa aatctacaaa actataatct ggtcacagac acgcccctct 14401 atgtgactgc tgttcagagt ggcattaatg ccagtgaggt aaaatataaa gaaaattatc 14461 atcagattaa ggacaaatac acaacagttc tagaaacagt ggattatgac agaaccagaa 14521 acctgaagaa tctttacagc agtaacctgt acaaggaggc ctgggataga gtgaaagcca 14581 ccagctacat cctgccttcc agcaccttgt ccctgacaca cgccaagaac cagaagcatc 14641 tggccagcca tatcaaatat cgggaagaat atgaaaagtt caaagctctt tatacgttac 14701 caagaagtgt tgacgatgat ccgaacacag cacggtgcct ccgagttggc aagcttaaca 14761 tcgatcgcct gtacagatca gtttatgaaa agaacaagat gaaaatccac atcgtgcccg 14821 acatggtaga gatggttact gccaaggatt cccagaagaa agtcagtgag attgattacc 14881 gcctgcgcct ccacgaatgg atttgccacc ccgacttgca agtcaatgat cacgtcagga 14941 aagtcacaga tcagatcagc gatattgtat acaaggatga cctcaactgg ctgaaaggca 15001 ttggttgcta cgtctgggac actcctgaaa tcctccatgc caagcatgct tatgatctac 15061 gtgatgatat caagtataaa gctcacatgt tgaaaacaag gaatgactac aagcttgtca 15121 cagatacacc agtctacgtg caggctgtca aaagtgggaa acagctaagt gacgctgtct 15181 accactatga ctatgtgcac agtgtcagag gcaaagtggc tccaactacc aaaaccgtgg 15241 atctggaccg ggcccttcat gcatacaagc tccagagttc gaatctatac aaaaccagcc 15301 tgcgcaccct gcccactgga tatagacttc caggtgacac tcctcacttc aaacacatca 15361 aggacacccg ttacatgagc agttatttca agtacaaaga agcctatgaa cacaccaagg 15421 catatgggta tacacttggc cccaaagatg ttccatttgt ccacgtccgg agagtcaaca 15481 atgttaccag cgagagactg tatcgggaat tgtaccacaa actgaaagac aagatccata 15541 caactcccga tccccctgag atccgccaag tcaagaagac acaagaggct gtcagtgagt 15601 tgatctacaa atcagacttc ttcaagatgc agggccacat gatctctctg ccatacacac 15661 cccaagtgat ccattgccgc tatgtgggag acatcaccag tgatattaaa tacaaagagg 15721 acttgcaggt cctgaaggga tttggctgct tcctgtatga cactcctgac atggtccgct 15781 cccggcacct gcggaagctc tggtctaatt acctatacac tgataaggca agggagatgc 15841 gagacaaata caaagtggtg cttgacactc cagaatacag aaaagtgcaa gaactgaaga 15901 cacatctgag tgagctggtc tacagagctg caggcaagaa gcagaagtca atctttactt 15961 cagttcctga tactcctgat cttttaagag ccaagcgagg gcagaagctt cagagtcagt 16021 atctgtatgt tgaacttgcc accaaagaga gaccccatca tcacgctgga aaccagacca 16081 cagccttgaa gcatgctaaa gacgtgaagg acatggtcag tgagaaaaag tacaagattc 16141 aatatgaaaa gatgaaagac aagtacactc cggttccaga tacgccaatc ctcatcagag 16201 ccaagagggc ttactggaat gccagtgatc tacgctacaa agaaacattt caaaagacca 16261 aagggaaata ccacacggtg aaagatgccc tagacattgt ctatcatcgc aaagtcacag 16321 atgacatcag taaaataaaa tacaaggaga actacatgag ccagttgggt atctggaggt 16381 ccattcctga tcgtccagag catttccacc accgagcagt cactgacaca gtcagtgatg 16441 taaaatataa agaagacttg acttggctta aaggcattgg ttgctatgcc tatgataccc 16501 ctgatttcac tctggctgaa aagaacaaga ctctctacag caagtataag tataaagaag 16561 tatttgaaag gacaaagtca gatttcaagt atgttgccga ctctccgatc aataggcatt 16621 tcaagtatgc aactcaattg atgaatgaga aaaaatacag agctgattat gagcagcgga 16681 aagataaata ccacctggta gtcgatgagc ctagacatct gctggctaag acccgcagcg 16741 accagatcag tcagatcaaa tacaggaaaa actatgaaaa atcaaaggac aaatttacct 16801 caattgtgga tactccagaa cacctgcgta ctacaaaagt caacaaacaa atcagcgata 16861 tcctttataa attggaatac aacaaggcca aacccagagg ctacaccaca atccacgaca 16921 cgcccatgtt gctgcatgtc cgcaaggtta aagatgaagt cagtgatctg aaatacaaag 16981 aagtatacca aagaaataaa tccaactgca ccattgagcc agatgctgtt catatcaaag 17041 cagccaagga cgcctacaaa gtcaacacca atctggacta taagaaacag tacgaagcca 17101 acaaagccca ctggaagtgg actcctgacc gaccggactt cctccaggct gccaagtcat 17161 ccctgcagca aagcgatttt gaatataagc tggaccggga gttcctcaag ggttgcaagc 17221 tttctgtcac tgatgacaaa aacacggtgc tcgccctcag gaatacttta atagaaagtg 17281 atctgaaata caaagagaaa catgtcaagg aaagaggaac ctgccatgcc gtacctgaca 17341 cgcctcagat cctgctggcg aagactgtca gcaacctggt gtctgagaac aagtacaagg 17401 accatgtcaa gaagcacttg gcacagggct catacacaac actaccagag acccgggaca 17461 ctgttcacgt caaggaagtg accaagcatg tcagtgatac aaattacaaa aagaagtttg 17521 tcaaggagaa aggaaaatcc aactactcca tcatgctgga gccaccagag gtgaaacatg 17581 ctatggaagt ggccaagaag caaagtgatg tcgcttacag aaaagatgcc aaagagaacc 17641 tgcattacac cacagtggct gatcgaccag acatcaagaa ggccacacag gcagccaaac 17701 aggccagtga ggtggagtac agagccaagc accgcaagga aggcagccat ggcttaagca 17761 tgctcggtcg cccagacata gaaatggcca agaaggcagc caagctgagc agccaggtta 17821 aataccgaga aaatttcgat aaagaaaagg gcaagacacc aaaatacaat ccaaaagaca 17881 gccagctcta caaagtcatg aaagatgcta ataatcttgc aagtgaggtt aaatacaagg 17941 ctgacctgaa gaaacttcac aaacccgtga ctgacatgaa ggagtctctg atcatgaatc 18001 atgtcctgaa tacaagccaa cttgccagtt cttaccagta caagaagaag tatgagaaga 18061 gtaaaggcca ctaccacacc atacccgata atctggagca gcttcaccta aaagaggcca 18121 cagaattaca gagtatagtg aaatacaaag aaaagtatga aaaggaacga ggaaaaccca 18181 tgctggactt tgaaacacca acgtacatca ctgccaaaga gtctcagcag atgcagagtg 18241 ggaaagaata taggaaagat tatgaagagt ccattaaagg cagaaacctg actggcctgg 18301 aggtcacgcc agctttgtta catgtcaaat atgcaactaa aatagcaagc gagaaagagt 18361 acaggaaaga tctagaggaa agcatccgtg ggaagggcct cactgaaatg gaagatacac 18421 ctgacatgct aagagcaaag aatgccactc aaatcctcaa tgagaaagaa tataagcgag 18481 acctggaact ggaagtcaaa ggaagaggcc tgaatgccat ggccaatgaa actccggatt 18541 ttatgagggc caggaatgct actgatattg ccagtcagat taagtataag caatcagcag 18601 aaatggagaa agccaatttc acttctgtgg ttgatactcc agagatcatt catgcccaac 18661 aagtcaagaa tctttcaagc cagaaaaagt acaaggaaga tgctgagaag tccatgtcgt 18721 attatgagac tgttttggac accccagaga tacagagagt ccgggagaac caaaagaact 18781 tcagccttct ccaataccag tgtgacctta aaaacagtaa aggaaaaatt acagttgttc 18841 aagacacgcc agaaatactg cgtgtaaaag aaaatcagaa gaatttcagc tcggttttat 18901 ataaagagga tgtctcacca ggaacggcta tcggaaagac acctgagatg atgagagtga 18961 aacaaacaca ggaccacatt agctcggtga agtataagga agcaatagga caaggaactc 19021 caatccctga cctgcctgaa gtgaaacgtg tgaaggagac gcagaagcac attagctcgg 19081 ttatgtacaa agaaaacttg ggaacaggca ttccaaccac tgtgactcca gagattgaga 19141 gagtcaaacg caatcaagag aactttagct cggttttgta caaagaaaat ttggggaaag 19201 gaatcccaac acctatcact ccagagatgg agagagtcaa acgcaatcaa gagaacttta 19261 gctcggtgtt atacaaagaa aacatgggca agggaactcc tttacctgtc actcccgaga 19321 tggagcgagt caaacacaat caagaaaata ttagctcggt tttgtacaaa gaaaatgtgg 19381 ggaaagccac cgcaacccct gtcactcctg agatgcagag agtcaaacgc aatcaagaaa 19441 acattagctc ggtgttatac aaagagaacc tggggaaagc aacccccaca ccctttactc 19501 ctgagatgga aagagtgaaa cgcaatcaag aaaactttag ctcggtattg tacaaagaga 19561 acatgagaaa agcaactccg acacctgtta ctccagagat ggagagagct aagcgcaacc 19621 aagaaaacat tagctcggtt ctttattctg atagtttccg gaaacaaata caaggcaaag 19681 ctgcctatgt attggatacc cccgagatga gacgggtgag ggagacccaa cggcacatct 19741 caacggtgaa atatcatgaa gactttgaga aacacaaggg ttgcttcaca ccagtggtga 19801 cagatcctat cactgaacga gtaaagaaga acatgcagga cttcagtgac attaactacc 19861 gaggtattca gaggaaagtg gtagaaatgg aacaaaaacg gaatgaccaa gatcaggaga 19921 ctattacagg tttacgtgtc tggcgtacta atcctggttc ggtttttgac tatgatccag 19981 cagaagacaa catccagtcc cgaagcttac acatgattaa tgtccaagct cagcgccgga 20041 gccgggagca gtcacgatct gccagtgcac taagcgtcag tgggggtgag gagaagtctg 20101 agcattcaga agcaccagac caccaccttt cgacttacag cgacgggggt gtctttgcag 20161 tctcaacagc ttacaaacat gcaaaaacca cagagctccc acaacaacga tcatcttcag 20221 ttgctaccca acagacaacg gtatcttcca tcccatctca tccatctact gctggaaaaa 20281 tcttccgtgc catgtatgac tatatggctg ctgatgcaga tgaggtgtcc ttcaaggatg 20341 gagatgccat cataaatgtt caagcaattg atgaaggctg gatgtatggc actgtgcaga 20401 ggactggcag gaccggaatg ctcccagcca actacgttga agctatttag gcatttcaaa 20461 gcatcacact tgtctgcagg acttacagat cctgcagtca atgtttcggt ttagactctc 20521 cactgttacc taagttctca agctgcctat ggtttttctg tgtcaatgtg atttatggta 20581 gtaccatcct ttctcctttg ggttttaaaa taagttgcag aacagacact ttaaaagctt 20641 ctgcaatatt atttctgtgc ctagagtctt tctccattat aaacatgttt taacattatt 20701 tcttttctaa aacagggatt ttgaatatgc caaacacatt aaaggaaaaa tagcagagat 20761 gttcaccttt tccttgctga ttgctaatgc ttattatttc taattcagtt ctgaagttat 20821 aaacttataa tcaatacaaa ccagcaacta ataaaacctc taattctgca aaaaaaaaaa 20881 a // LOCUS HSRNANLP 2414 bp RNA PRI 17-AUG-1995 DEFINITION H.sapiens mRNA for nucleoporin-like protein. ACCESSION X89478 NID g950050 KEYWORDS nucleoporin-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2414) AUTHORS Fritz,C.C., Zapp,M.L. and Green,M.R. TITLE A human nucleoporin-like protein that specifically interacts with HIV Rev JOURNAL Nature 376 (6540), 530-533 (1995) MEDLINE 95364930 REFERENCE 2 (bases 1 to 2414) AUTHORS Green,M.R. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) M.R. Green, Univ. of Massachusetts Medical Center, Program in Molecular Medicine, 373 Plantation Street, Worcester, MA 01605, USA FEATURES Location/Qualifiers source 1..2414 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 67..1755 /codon_start=1 /product="nucleoporin-like protein" /db_xref="PID:g950051" /translation="MAASAKRKQEEKHLKMLRDMTGLPRNRKCFDCDQRGPTYVNMTV GSFVCTSCSGSLRGLNPPHRVKSISMTTFTQQEIEFLQKHGNEVCKQIWLGLFDDRSS AIPDFRDPQKVKEFLQEKYEKKRWYVPPEQAKVVASVHASISGSSASSTSSTPEVKPL KSLLGDSAPTLHLNKGTPSQSPVVGRSQGQQQEKKQFDLLSDLGSDIFAAPAPQSTAT ANFANFAHFNSHAAQNSANADFANFDAFGQSSGSSNFGGFPTASHSPFQPQTTGGSAA SVNANFAHFDNFPKSSSADFGTFNTSQSHQTASAVSKVSTNKAGLQTADKYAALANLD NIFSAGQGGDQGSGFGTTGKAPVGSVVSVPSQSSASSDKYAALAELDSVFSSAATSSN AYTSTSNASSNVFGTVPVVASAQTQPASSSVPAPFGRTPSTNPFVAAAGPSVASSTNP FQTNARGATAATFGTASMSMPTGFGTPAPYSLPTSFSGSFQQPAFPAQAAFPQQTAFS QQPNGAGFAAFGQTKPVVTPFGQVAAAGVSSNPFMTGAPTGQFPTGSSSTNPFL" BASE COUNT 687 a 577 c 496 g 654 t ORIGIN 1 gcgggccccc ggcgcagcgc tgcccggctc ccggccctgc cggcctcctc ccttggcgcc 61 gcggccatgg cggccagcgc gaagcggaag caggaggaga agcacctgaa gatgctgcgg 121 gacatgaccg gcctcccgcg caaccgaaag tgcttcgact gcgaccagcg cggccccacc 181 tacgttaaca tgacggtcgg ctccttcgtg tgtacctcct gctccggcag cctgcgagga 241 ttaaatccac cacacagggt gaaatctatc tccatgacaa cattcacaca acaggaaatt 301 gaattcttac aaaaacatgg aaatgaagtc tgtaaacaga tttggctagg attatttgat 361 gatagatctt cagcaattcc agacttcagg gatccacaaa aagtgaaaga gtttctacaa 421 gaaaagtatg aaaagaaaag atggtatgtc ccgccagaac aagccaaagt cgtggcatca 481 gttcatgcat ctatttcagg gtcctctgcc agtagcacaa gcagcacacc tgaggtcaaa 541 ccactgaaat ctcttttagg ggattctgca ccaacactgc acttaaataa gggcacacct 601 agtcagtccc cagttgtagg tcgttctcaa gggcagcagc aggagaagaa gcaatttgac 661 cttttaagtg atctcggctc agacatcttt gctgctccag ctcctcagtc aacagctaca 721 gccaattttg ctaactttgc acatttcaac agtcatgcag ctcagaattc tgcaaatgca 781 gattttgcaa actttgatgc atttggacag tctagtggtt cgagtaattt tggaggtttc 841 cccacagcaa gtcactctcc ttttcagccc caaactacag gtggaagtgc tgcatcagta 901 aatgctaatt ttgctcattt tgataacttc cccaaatcct ccagtgctga ttttggaacc 961 ttcaatactt cccagagtca tcaaacagca tcagctgtta gtaaagtttc aacgaacaaa 1021 gctggtttac agactgcaga caaatatgca gcacttgcta atttagacaa tatcttcagt 1081 gccgggcaag gtggtgatca gggaagtggc tttgggacca caggtaaagc tcctgttggt 1141 tctgtggttt cagttcccag tcagtcaagt gcatcttcag acaagtatgc agctctggca 1201 gaactagaca gcgttttcag ttctgcagcc acctccagta atgcgtatac ttccacaagt 1261 aatgctagca gcaatgtttt tggaacagtg ccagtggttg cttctgcaca gacacagcct 1321 gcttcatcaa gtgtgcctgc tccatttgga cgtacgcctt ccacaaatcc atttgttgct 1381 gctgctggtc cttctgtggc atcttctaca aacccatttc agaccaatgc cagaggagca 1441 acagcggcaa cctttggcac tgcatccatg agcatgccca cgggattcgg cactcctgct 1501 ccctacagtc ttcccaccag ctttagtggc agctttcagc agcctgcctt tccagcccaa 1561 gcagctttcc ctcaacagac agctttttct caacagccca atggtgcagg ttttgcagca 1621 tttggacaaa caaagccagt agtaacccct tttggtcaag ttgcagctgc tggagtatct 1681 agtaatcctt ttatgactgg tgcaccaaca ggacaatttc caacaggaag ctcatcaacc 1741 aatcctttct tatagcctta tatagacaat ttactggaac gaacttttat gtggtcacat 1801 tacatctctc cacctcttgc actgttgtct tgtttcactg atcttagctt taaacacaag 1861 agaagtcttt aaaaagcctg cattgtgtat taaacaccag gtaatatgtg caaaaccgag 1921 ggctccagta acaccttcta acctgtgaat tggcagaaaa gggtagcggt atcatgtata 1981 ttaaaattgg ctaatattaa gttattgcag ataccacatt cattatgctg cagtactgta 2041 catatttttc ttagaaatta gctatttgtg catatcagta tttgtaactt taacacattg 2101 ttatgtgaga aatgttactg gggaaataga tcagccactt ttaaggtgct gtcatatatc 2161 ttggaatgaa tgacctaaaa tcattttaac cattgctact ggaaagtaac agagtcaaaa 2221 ttggaaggtt ttattcattc ttgaattttt cctttctaaa gagctcttct atttatacat 2281 gcctaaattc ttttaaaatg tagagggata cctgtctgca taataaagct gatcatgttt 2341 tgctacagtt tgcaggtgaa aaaaaataaa tattataaaa taaaaaaaaa aaaaaagaaa 2401 aaaaaaagga attc // LOCUS HSRNANPH1 2695 bp RNA PRI 28-DEC-1997 DEFINITION Homo sapiens mRNA for NPH1 candidate protein. ACCESSION AJ001815 NID g2570025 KEYWORDS NPH1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2695) AUTHORS Saunier,S., Calado,J., Heilig,R., Silbermann,F., Benessy,F., Morin,G., Konrad,M., Broyer,M., Gubler,M.C., Weissenbach,J. and Antignac,C. TITLE The article (A novel gene that encodes a protein with a putative src homology 3 domain is a candidate gene for familial juvenile nephronophthisis) JOURNAL Hum. Mol. Genet. 6, 2317-2323 (1997) REFERENCE 2 (bases 1 to 2695) AUTHORS Antignac,C. TITLE Direct Submission JOURNAL Submitted (26-SEP-1997) Antignac, C., U423, INSERM, Hopital Necker Tour Lavoisier 149 rue de Sevres, 75015 PARIS, FRANCE FEATURES Location/Qualifiers source 1..2695 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="q13" /tissue_lib="kidney" /dev_stage="foetus" gene 34..2232 /gene="NPH1" CDS 34..2232 /gene="NPH1" /codon_start=1 /db_xref="PID:e1172408" /db_xref="PID:g2570026" /translation="MLARRQRDPLQALRRRNQELKQQVDSLLSESQLKEALEPNKRQH IYQRCIQLKQAIDENKNALQKLSKADESAPVANYNQRKEEEHTLLDKLTQQLQGLAVT ISRENITEVGAPTEEEEESESEDSEDSGGEEEDAEEEEEEKEENESHKWSTGEEYIAV GDFTAQQVGDLTFKKGEILLVIEKKPDGWWIAKDAKGNEGLVPRTYLEPYSEEEEGQE SSEEGSEEDVEAVDETADGAEVKQRTDPHWSAVQKAISEAGIFCLVNHVSFCYLIVLM RNRMETVEDTNGSETGFRAWNVQSRGRIFLVSKPVLQINTVDVLTTMGAIPAGFRPST LSQLLEEGNQFRANYFLQPELMPSQLAFRDLMWDATEGTIRSRPSRISLILTLWSCKM IPLPGMSIQVLSRHVRLCLFDGNKVLSNIHTVRATWQPKKPKTWTFSPQVTRILPCLL DGDCFIRSNSASPDLGILFELGISYIRNSTGERGELSCGWVFLKLFDASGVPIPAKTY ELFLNGGTPYEKGIEVDPSISRRAHGSVFYQIMTMRRQPQLLVKLRSLNRRSRNVLSL LPETLIGNMCSIHLLIFYRQILGDVLLKDRMSLQSTDLISHPMLATFPMLLEQPDVMD ALRSSWAGKESTLKRSEKRDKEFLKSTFLLVYHDCVLPLLHSTRLPPFRWAEEETETA RWKVITDFLKQNQENQGALQALLSPDGVHEPFDLSEQTYDFLGEMRKNAV" exon 34..102 /gene="NPH1" /number=1 exon 103..176 /gene="NPH1" /number=2 exon 177..237 /gene="NPH1" /number=3 exon 238..362 /gene="NPH1" /number=4 exon 363..555 /gene="NPH1" /number=5 exon 556..657 /gene="NPH1" /number=6 exon 658..761 /gene="NPH1" /number=7 exon 762..972 /gene="NPH1" /number=8 exon 973..1057 /gene="NPH1" /number=9 exon 1058..1152 /gene="NPH1" /number=10 exon 1153..1281 /gene="NPH1" /number=11 exon 1282..1356 /gene="NPH1" /number=12 exon 1357..1467 /gene="NPH1" /number=13 exon 1468..1550 /gene="NPH1" /number=14 exon 1551..1627 /gene="NPH1" /number=15 exon 1628..1727 /gene="NPH1" /number=16 exon 1728..1840 /gene="NPH1" /number=17 exon 1841..1914 /gene="NPH1" /number=18 exon 1915..1959 /gene="NPH1" /number=19 exon 1960..2232 /gene="NPH1" /number=20 BASE COUNT 886 a 506 c 597 g 706 t ORIGIN 1 aactggagca atcagagcac cgcagccagg gagatgctgg cgagacgaca gcgagatcct 61 ctccaggccc tgcggcgccg caatcaggag ctgaagcaac aggttgatag tttgctttct 121 gagagccaac tgaaagaagc tctagaaccc aataaaagac aacatattta tcaaagatgt 181 atccagttaa agcaggcaat agatgaaaat aaaaatgctc ttcaaaaatt aagcaaagct 241 gatgaatctg cacctgttgc aaactataat cagagaaaag aagaggagca tactcttttg 301 gacaagctta cccaacaact gcagggcctt gctgtgacaa taagcagaga aaatataact 361 gaagttgggg cacctactga agaagaggaa gaaagtgaaa gtgaagatag tgaagacagt 421 ggtggggagg aagaagatgc agaggaggaa gaggaagaga aagaggaaaa tgaatctcac 481 aaatggtcaa ccggtgaaga atacatcgct gttggagatt ttactgctca gcaagttgga 541 gatcttacat ttaagaaagg ggaaattctc cttgtaattg aaaaaaaacc tgatggttgg 601 tggatagcta aggatgccaa aggaaatgaa ggtcttgttc ccagaaccta cctagagcct 661 tatagtgaag aagaagaagg ccaagaatca agtgaagagg gcagtgaaga agatgtagag 721 gcggtggatg aaacagcaga tggagcagaa gttaagcaaa gaactgatcc ccactggagt 781 gctgttcaga aagcgatttc agaggcgggc atcttctgtc ttgttaatca tgtctcgttt 841 tgctacctaa tagttctgat gcgaaatagg atggagactg tggaagacac caatggatct 901 gaaacagggt tcagggcatg gaatgtacag agcagaggac gtatatttct ggtttctaag 961 cctgtgctcc aaataaacac tgttgatgtg ttaactacga tgggagctat tcctgcaggg 1021 ttcaggcctt ccacgctctc acagcttctg gaggaaggga atcaatttcg agcaaattac 1081 ttcttacaac cagagctcat gccttcacaa ctggccttca gagatctgat gtgggatgct 1141 acagaaggca ctattaggtc gagaccaagt cgtatttcat tgattctgac attatggagc 1201 tgtaaaatga ttcctcttcc aggaatgagc atacaggttc tcagcagaca tgtacgcctc 1261 tgtctatttg atggtaataa ggttctgagc aacattcata cagtcagagc cacatggcaa 1321 cctaaaaagc ccaaaacatg gaccttttct ccccaggtta ctcgcatctt accatgtttg 1381 cttgatggtg attgctttat caggtctaat tctgcatctc cagatcttgg aatattattt 1441 gaacttggaa tttcttatat tcgcaattca actggtgaaa gaggagagtt aagctgtggc 1501 tgggtgtttc ttaaactttt tgatgccagt ggagttccta ttccagcaaa aacttatgag 1561 cttttcttga atggtggtac tccttatgaa aaaggtattg aagtggaccc ttcaatatcc 1621 agaagagcac acggcagtgt tttctaccag attatgacaa tgagaaggca gcctcaactt 1681 ctagtgaaac tgagatcctt gaacagaaga tcaagaaatg tactaagtct actgccagaa 1741 acattaattg gaaatatgtg ttctattcac ttgttgatat tttatcgaca aattcttgga 1801 gatgtgctcc tgaaagacag gatgagcttg caaagtactg atttaattag ccatcccatg 1861 ctggccacct tccccatgct cttggagcag cctgatgtga tggatgctct caggagttcg 1921 tgggctggaa aagaaagcac attaaaaaga tcagagaaga gagacaaaga gttcctgaag 1981 tccacgtttc tcctggttta ccatgactgc gtgctcccac ttctccactc cacacgccta 2041 cccccattca ggtgggcaga agaagagact gagactgcac ggtggaaagt tatcactgac 2101 ttccttaagc aaaaccaaga aaaccagggc gccctccaag ctctgctgtc accagacgga 2161 gttcatgaac cttttgacct ttcagagcag acctatgact tcttgggtga aatgagaaag 2221 aatgcagtgt gacagtggca gcctctagcc ctcagcttcc cacggaatca gatggatcct 2281 ccacgattac gtgaataaaa tgatggaacc caaaaatcac tgtcacttta caacttaggt 2341 tttactcttt tctttctaca gaccatattt ttaaagaaat gtttatacaa taatttaaat 2401 attttttaaa accataaaat aaatttttat aaggaatact gttatatcta aatttaaaca 2461 gtatttattt tttcaaaaac agctacttaa gttaatggta tagatttcta taaaagcaag 2521 attttgtcaa aaactaaatt tatgattatt caagaaagtg aaaaaaacaa cctacagaat 2581 gggaaaacat atttgcaaat catctaactg ataaaggtct agtatccaaa atatttaaat 2641 ttatgagtgt taataaaatt tatcttgttc aatgaagagg aagttaaaaa aaaaa // LOCUS HSRNAP2EF 1456 bp RNA PRI 07-JUN-1995 DEFINITION H.sapiens mRNA for RNA polymerase II elongation factor-like protein. ACCESSION Z47087 NID g860989 KEYWORDS RNA polymerase II elongation factor-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1456) AUTHORS Sowden,J.J. and Edwards,Y.Y. TITLE A conserved mammalian embryonic mRNA with homology to RNA polymeraseII elongation factor JOURNAL Unpublished REFERENCE 2 (bases 1 to 1456) AUTHORS Edwards,Y.Y. TITLE Direct Submission JOURNAL Submitted (21-DEC-1994) Yvonne Y.H. Edwards Dr, Wolfson House, MRC Human Biochemical Genetics Unit,UCL, 4 Stephenson Way, London, NW1 2HE, England FEATURES Location/Qualifiers source 1..1456 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cl19" /dev_stage="6.5 week gestation" /tissue_type="whole embryo" /clone_lib="lambda ZAP embryo cDNA library" /chromosome="5" CDS 110..601 /codon_start=1 /product="RNA polymerase II elongation factor-like protein" /db_xref="PID:g860990" /db_xref="SWISS-PROT:P34991" /translation="MPSIKLQSSDGEIFEVDVEIAKQSVTIKTMLEDLGMDDEGDDDP VPLPNVNAAILKKVIQWCTHHKDDPPPPEDDENKEKRTDDIPVWDQEFLKVDQGTLFE LILAANYLDIKGLLDVTCKTVANMIKGKTPEEIRKTFNIKNDFTEEEEAQVRKENQWC EEK" BASE COUNT 435 a 275 c 296 g 450 t ORIGIN 1 gcggccgctc gacgctgtag tggcttcgtc ttcggttttt ctcttccttc gctaacgcct 61 cccggctctc gtcagcctcc cgccggccgt ctccttaaca ccgaacacca tgccttcaat 121 taagttgcag agttctgatg gagagatatt tgaagttgat gtggaaattg ccaaacaatc 181 tgtgactatt aagaccatgt tggaagattt gggaatggat gatgaaggag atgatgaccc 241 agttcctcta ccaaatgtga atgcagcaat attaaaaaag gtcattcagt ggtgcaccca 301 ccacaaggat gaccctcctc ctcctgaaga tgatgagaac aaagaaaagc gaacagatga 361 tatccctgtt tgggaccaag aattcctgaa agttgaccaa ggaacacttt ttgaactcat 421 tctggctgca aactacttag acatcaaagg tttgcttgat gttacatgca agactgttgc 481 caatatgatc aaggggaaaa ctcctgagga gattcgcaag accttcaata tcaaaaatga 541 ctttactgaa gaggaggaag cccaggtacg caaagagaac cagtggtgtg aagagaagtg 601 aaatgttgtg cctgacactg taacactgta aggattgttc caaatactag ttgcactgct 661 ctgtttataa ttgttaatat tagacaaaca gtagacaaat gcagcagcaa gtcaattgta 721 ttagcagaat attgtcctca ttgcatgtgt agttgtagct cgagtcccaa accttacggc 781 caagtttctt ctagtatgat ggaaagtttc ttttttcttt gctctgaata aaactgaact 841 gtgggttctc tataagtggc attttgggct ttcctccctt ttttgtaaag caatgtctgc 901 ctagtttatt gtccaagtta actttaggtg accttttaaa agttggcatt gaaaataaaa 961 caacttgcaa aaaagttttc tggaatagaa ttaacaaaat attatcttta tcatgagttg 1021 gaaactggaa aaaggcttct tgaagtaaat gttctgagtg gagctactag gatgtcttcc 1081 agcctcctgc agtcaaggag taccactgta ttgattagcc tgtatgtagc agggctccct 1141 tcattgcatc tgaggacttg ttttcttttt ctttattttt aatcctctta gttttaaata 1201 tattgcctag agactcagtt actacccagt ttgtggtttt ttgggagaaa tgtaactgga 1261 cagttagctt ttcaattaaa aagacactta acccatgtgg gatgtcatct ttttataatt 1321 agtgttccca tgtggagaaa attattcaca ctacttgcat gtaaagaata atttaacttt 1381 taacattaaa atatgtggta aaacccagaa agcatccatc atgaatgcaa gatactttca 1441 ataaagtaag ttatat // LOCUS HSRNAPHKB 4284 bp RNA PRI 21-AUG-1996 DEFINITION H.sapiens mRNA for phosphorylase-kinase, beta subunit. ACCESSION X84908 NID g1502344 KEYWORDS beta subunit; PHKB gene; phosphorylase kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4284) AUTHORS Wullrich-Schmoll,A. and Kilimann,M.W. TITLE Structure of the human gene encoding the phosphorylase kinase beta subunit (PHKB) JOURNAL Eur. J. Biochem. 238 (2), 374-380 (1996) MEDLINE 96283831 REFERENCE 2 (bases 1 to 4284) AUTHORS Kilimann,M.W. TITLE Direct Submission JOURNAL Submitted (22-FEB-1995) M.W. Kilimann, Institut fuer Physiologische Chemie I, Ruhr-Universitaet Bochum, Universitaetsstrasse 150, 44801 Bochum, FRG FEATURES Location/Qualifiers source 1..4284 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" /clone="MM6" /chromosome="16" /map="q12-q13" gene 25..3306 /gene="PHKB" CDS 25..3306 /gene="PHKB" /EC_number="2.7.1.38" /note="beta subunit" /codon_start=1 /product="phosphorylase kinase" /db_xref="PID:e139819" /db_xref="PID:g1502345" /translation="MAGAAGLTAEVSWKVLERRARTKRSGSVYEPLKSINLPRPDNET LWDKLDHYYRIVKSTLLLYQSPTTGLFPTKTCGGDQKAKIQDSLYCAAGAWALALAYR RIDDDKGRTHELEHSAIKCMRGILYCYMRQADKVQQFKQDPRPTTCLHSVFNVHTGDE LLSYEEYGHLQINAVSLYLLYLVEMISSGLQIIYNTDEVSFIQNLVFCVERVYRVPDF GVWERGSKYNNGSTELHSSSVGLAKAALEAINGFNLFGNQGCSWSVIFVDLDAHNRNR QTLCSLLPRESRSHNTDAALLPCISYPAFALDDEVLFSQTLDKVVRKLKGKYGFKRFL RDGYRTSLEDPNRCYYKPAEIKLFDGIECEFPIFFLYMMIDGVFRGNPKQVQEYQDLL TPVLHHTTEGYPVVPKYYYVPADFVEYEKNNPGSQKRFPSNCGRDGKLFLWGQALYII AKLLADELISPKDIDPVQRYVPLKDQRNVSMRFSNQGPLENDLVVHVALIAESQRLQV FLNTYGIQTQTPQQVEPIQIWPQQELVKAYLQLGINEKLGLSGRPDRPIGCLGTSKIY RILGKTVVCYPIIFDLSDFYMSQDVFLLIDDIKNALQFIKQYWKMHGRPLFLVLIRED NIRGSRFNPILDMLAALKKGIIGGVKVHVDRLQTLISGAVVEQLDFLRISDTEELPEF KSFEELEPPKHSKVKRQSSTPSAPELGQQPDVNISEWKDKPTHEILQKLNDCSCLASQ AILLGILLKREGPNFITKEGTVSDHIERVYRRAGSQKLWLAVRYGAAFTQKFSSSIAP HITTFLVHGKQVTLGAFGHEEEVISNPLSPRVIQNIIYYKCNTHDEREAVIQQELVIH IGWIISNNPELFSGMLKIRIGWIIHAMEYELQIRGGDKPALDLYQLSPSEVKQLLLDI LQPQQNGRCWLNRRQIDGSLNRTPTGFYDRVWQILERTPNGIIVAGKHLPQQPTLSDM TMYEMNFSLLVEDTLGNIDQPQYRQIVVELLMVVSIVLERNPELEFQDKVDLDRLVKE AFNEFQKDQSRLKEIEKQDDMTSFYNTPPLGKRGTCSYLTKAVMNLLLEGEVKPNNDD PCLIS" BASE COUNT 1293 a 871 c 966 g 1154 t ORIGIN 1 ggccaaggcg gcgaccggag cgcgatggcg ggggcggcgg gactcacggc agaagtgagc 61 tggaaggtct tggagcgaag agctcggacc aagcgctcag gctcagttta tgaacctctt 121 aaaagcatta atcttccaag acctgataat gaaactctct gggataagtt ggaccattat 181 tacagaattg tcaagtcaac attgctgctg tatcaaagtc caactaccgg tctctttccc 241 actaaaacat gcggtggtga ccagaaggcc aagatccagg acagcctata ctgcgctgct 301 ggggcctggg ctttggctct tgcatacagg cgaattgatg atgacaaggg aaggacccat 361 gagctggagc actcagctat aaaatgcatg agaggaattc tctactgcta tatgcgtcag 421 gccgataagg tccagcagtt taagcaggat ccacgcccaa caacatgtct tcactctgtt 481 ttcaatgtgc atacaggaga tgagttgctt tcctatgagg aatatggtca tcttcagata 541 aatgcagtgt cactttatct cctttacctt gtggaaatga tttcctcagg actccagatt 601 atctacaaca ctgatgaggt ctcttttatt caaaaccttg tattttgtgt ggaaagagtt 661 taccgtgtgc ctgactttgg tgtctgggaa agaggaagca aatataataa tggcagcaca 721 gagctacatt cgagctcggt tggtttagca aaagcagctc tagaagcaat taatggattc 781 aacctttttg gcaaccaggg ctgttcgtgg tcagttatat ttgtggatct cgatgctcac 841 aatcgcaaca ggcaaacttt gtgctcgctg ttacccagag aatcaagatc acataataca 901 gatgctgccc tgctcccctg catcagttat cctgcatttg ccctggatga tgaagttctt 961 tttagccaga cacttgataa agtggttaga aaattaaaag gaaaatatgg atttaaacgt 1021 ttcttgagag atgggtatag aacatcattg gaagatccca acagatgcta ctacaagcca 1081 gctgaaatta agctatttga tggcattgaa tgtgaatttc ccatattttt cctttatatg 1141 atgattgatg gagtttttag aggcaatcct aagcaagtac aggaatatca ggatcttttg 1201 actccagtac ttcatcatac cacagaagga tatcctgttg taccaaagta ctattatgtg 1261 ccagctgact ttgtagaata tgaaaaaaat aaccctggta gtcaaaaacg atttcctagc 1321 aactgtggcc gtgatggaaa actgtttctt tggggacaag cactttatat catcgcaaaa 1381 ctcctggctg atgaacttat tagtcctaaa gacattgatc ctgtccagcg ctatgtccca 1441 ctaaaggatc aacgtaacgt gagcatgagg ttttccaatc agggcccact ggaaaatgac 1501 ttggtagttc atgtggcact tatagcagaa agccaaagac ttcaagtttt tctgaacaca 1561 tatggtattc aaactcaaac tcctcaacaa gtagaaccca ttcagatatg gcctcagcag 1621 gagcttgtga aagcttattt gcagctgggt atcaatgaaa agttaggact ctctggaagg 1681 ccagacaggc ccattggctg cctcgggaca tcaaagattt atcgcattct aggaaagact 1741 gtggtttgtt acccgattat tttcgaccta agtgatttct acatgtctca ggatgttttc 1801 ctgctgatag atgacataaa gaatgcgctg cagttcatta aacaatattg gaaaatgcat 1861 ggacgtccac ttttccttgt tctcatccgg gaagacaata taagaggtag ccggttcaac 1921 cccatattag atatgctggc agcccttaaa aaaggaataa ttggaggagt caaagttcat 1981 gtggatcgtc tacagacact aatatctgga gctgtggtag aacaacttga tttcctacga 2041 atcagtgaca cagaagagct tccagaattt aagagttttg aggaactaga acctcccaaa 2101 cattcaaaag tcaaacggca aagcagcacc cctagtgctc ctgaactggg acagcagccg 2161 gatgtcaaca ttagtgaatg gaaggacaaa cccacccacg aaattcttca aaaactgaat 2221 gattgcagtt gtctggctag ccaagccatc ctgctgggta tactgctcaa aagagaaggc 2281 cccaacttca tcacaaagga aggtaccgtt tctgatcaca ttgagagagt ctatagaaga 2341 gctggcagcc aaaaactttg gttggcggtg cgctacgggg ctgcatttac ccagaaattt 2401 tcttcctcta tagccccaca cattactact tttctggtac atgggaaaca ggtaactctg 2461 ggtgcctttg ggcatgaaga agaagttatc tctaatcctt tgtctccaag agtgattcaa 2521 aacatcatct attataagtg taacacccat gatgagaggg aagcggtcat tcagcaagaa 2581 ctggtcatcc atattggctg gatcatctcc aataaccctg agttattcag tggcatgctg 2641 aaaatacgaa tcgggtggat catccatgcc atggagtatg aacttcagat ccgtggcgga 2701 gacaagccag ccttggactt gtatcagctg tcacctagtg aagttaaaca gcttctgctg 2761 gatattctgc agcctcaaca gaatggaaga tgttggctga acaggcgtca gatcgatggg 2821 tctttgaata gaactcccac cgggttctat gaccgagtgt ggcagattct ggagcgcacg 2881 cccaatggga tcattgttgc tgggaagcat ttgcctcagc aaccaaccct gtcagatatg 2941 accatgtatg agatgaattt ctctctcctt gttgaagaca cgttgggaaa tattgaccag 3001 ccacagtaca gacagatcgt tgtagagtta cttatggttg tatccattgt actggaaaga 3061 aaccccgagc tagaatttca agacaaagta gatctagaca gactggtcaa agaagcattt 3121 aatgaatttc aaaaagatca gagtcggcta aaggaaattg aaaaacaaga tgacatgact 3181 tccttttaca acactcctcc cctgggaaaa agaggaacat gcagctattt gacaaaggcg 3241 gtgatgaatc tgctgctgga aggagaagtc aagccaaaca atgatgaccc gtgtctgatt 3301 agctagtggg gaaggtgtag gaagctctgt tgagacacat gttctgaagt gtgttgtgtt 3361 tcatgttcaa gcttaatcaa ggcagccatt aatatacgaa ctgagcatgc tggggaggtg 3421 aatgccacat ccttggcggg gttatggacc tcttgcatgt catagccaat ctaacggtaa 3481 tggtaaatgc ttttaatcaa gcaggaaaaa gttctcatga ttatgccaac tataatagta 3541 atcctcactg agtgataaaa atagtttatg aattgaaaat ttgccgctgc atgttgtatg 3601 atcaaatagt tcatcaaaat gaatctttgc tctttggact gaattcttac catactgcca 3661 ttaaaataaa tttgccaact agtaatgcat actggaaatc aaaagatact gaaagaatgg 3721 tgaacttctc ttagtggtat tgtcatgcta aaagatgtta atatacatca taaaagcaaa 3781 gtcagccagc tgatattttg gttctcaaaa actgcattat taataatatt ttagtataca 3841 gagctattct acagttttta cattgtaaac atgactgtgg ttttgtattt gctaaatata 3901 ggggttggac taaaatataa taaatctgta ccttatcaaa cattttcttt gagctcctgc 3961 taaaaatagg acatgtctat gattgttcaa aaatatgtta aatttaggct cagcacagta 4021 gctcacacct gaaatcttag cacttcggga ggctgaggca ggtggatcac ttgaggttag 4081 gagttcaaga ccagcccagc caacatggtg aaaaccctgt ctctactaaa aatacaaaaa 4141 ttagccaggc atgatggtgc atgcctttta aacccagcta ctgaggaggc tgaggcatga 4201 gaattgcttg aaccaggaga cggaggttgc agtgagctga aatcctgcca ctgcacacca 4261 gcctgggtga cagagcgaga ctcc // LOCUS HSRNAPRCC 1989 bp RNA PRI 17-SEP-1996 DEFINITION H.sapiens mRNA for prcc protein. ACCESSION X97124 NID g1518264 KEYWORDS prcc gene; translocation associated gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1989) AUTHORS Sidhar,S.K., Clark,J., Gill,S., Hamoudi,R., Crew,A.J., Gwilliam,R., Ross,M., Linehan,W.M., Birdsall,S., Shipley,J. and Cooper,C.S. TITLE The t(X;1)(p11.2;q21.2) translocation in papillary renal cell carcinoma fuses a novel gene PRCC to the TFE3 transcription factor gene JOURNAL Hum. Mol. Genet. 5 (9), 1333-1338 (1996) MEDLINE 97026295 REFERENCE 2 (bases 1 to 1989) AUTHORS Sidhar,S.K. TITLE Direct Submission JOURNAL Submitted (04-APR-1996) S.K. Sidhar, Institute of Cancer Research, Molecular Carcinogenesis, 15 Cotswold Road, Sutton, Surrey, SM2 5NG, UK REMARK Revised by author 11-JUN-96 FEATURES Location/Qualifiers source 1..1989 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" /chromosome="1" /map="q21.2" /clone="77B12" /clone="75M18" /cell_line="U937" /dev_stage="adult" gene 191..1666 /gene="prcc" CDS 191..1666 /gene="prcc" /function="translocation sith TFE3 gene in papillary renal cell cancer" /codon_start=1 /db_xref="PID:e242908" /db_xref="PID:g1518265" /translation="MSLVAYASSDESEPDEAEPEPEEEEAVAPTSGPALGGLFASLPA PKGPALLPPPPQMLAPAFPPPLLLPPPTGDPRLQPPPPLPFGLGGFPPPPGVSPAEAA GVGEGLGLGLPSPRGPGLNLPPPIGGAGPPLGLPKPKKRKEPVKIAAPELHKGDSDSE EDEPTKKKTILQGSSEGTGLSALLPQPKNLTVKETNRLLLPHAFSRKPSDGSPDTKPS RLASKTKTSSLAPVVGTTTTTPSPSAIKAAAKSAALQVTKQITQEEDDSDEEVAPENF FSLPEKAEPPGVEPYPYPIPTVPEELPPGTEPEPAFQDDAANAPLEFKMAAGSSGAPW MPKPGDDYSYNQFSTYGDANAAGAYYQDYYSGGYYPAQDPALVPPQEIAPDASFIDDE AFKRLQGKRNRGREEINFVEIKGDDQLSGAQQWMTKSLTEEKTMKSFSKKKGEQPTGQ QRRKHQITYLIHQAKERELELKNTWSENKLSRRQTQAKYGF" BASE COUNT 440 a 612 c 538 g 399 t ORIGIN 1 ctaaaggcct tgttcggtgg aaatcagccg tagccatgag tttctgccgg ggctagccct 61 agagtacgga gcaggcggac ttttcggttc cccgccccgc caggtggcgg ggcctactag 121 gcctccgggc atccccggtc tcaagtaggc ctcatctgcc ggcaagggcg cccgaaacgc 181 gggaggcgcc atgtcgctgg ttgcttacgc cagcagcgat gagagcgagc cggatgaggc 241 tgagcccgag ccggaggaag aggaggcggt ggctcctaca tctgggcccg ctttaggggg 301 cttgttcgct tctctccctg cgcccaaggg tccggccttg ctgcctccgc cccctcagat 361 gctggcgcca gcctttcccc cgccgctgtt gcttccccca cccaccggag accccaggct 421 tcagcctcct ccccccttgc ccttcggcct gggaggcttc cccccacctc caggcgtgag 481 cccggctgaa gcggcgggag ttggggaggg actgggattg gggttgccct cgccccgagg 541 ccctggcctc aatctgcccc ctccaattgg cggtgccggt cccccgctgg ggcttcccaa 601 gccaaagaag aggaaagagc ccgtgaagat cgcggcgccg gagttgcata agggagattc 661 agattctgag gaagatgaac ccacaaagaa gaaaactatc cttcagggat ccagtgaggg 721 gactggtttg tctgccttgc ttccccaacc taaaaacctg actgtgaaag agactaacag 781 gttgctcctg ccccatgcct tctcccgcaa accctcggat ggctcccctg atactaagcc 841 ctccagactg gcttctaaga ccaagacttc ctctcttgcc cctgttgtgg gcaccacaac 901 caccactccg tcgccctctg ctatcaaggc tgctgccaag agtgctgccc tgcaggtgac 961 aaagcagatc acgcaggaag aagacgacag tgatgaggaa gtagcccccg aaaacttttt 1021 ctccctccct gaaaaggctg agccacctgg agttgagcca tacccttacc ccatccccac 1081 tgtccctgaa gagctgcctc caggcacgga accagagccg gctttccagg acgatgcagc 1141 caatgccccc cttgaattca agatggcagc aggttcaagt ggggcccctt ggatgcctaa 1201 gcctggggac gactacagct acaatcagtt ttccacatat ggcgatgcca atgccgctgg 1261 tgcttattat caggattatt acagtggtgg ctactatcct gcacaggacc cggccctggt 1321 ccccccccag gaaattgccc cagatgcctc cttcatcgat gacgaagcat ttaagcggct 1381 gcagggcaag aggaaccgag ggagagaaga aatcaacttt gtggagatca aaggtgatga 1441 ccagctcagt ggggcccagc aatggatgac taagtcattg acagaagaga aaaccatgaa 1501 gtcattcagc aaaaagaaag gtgagcagcc aacaggccag cagcggcgga aacaccagat 1561 cacatatctt attcatcagg ccaaggagcg ggagctggaa ctgaagaaca cctggtcaga 1621 gaacaagctc agccgccgtc agacccaagc caaatatgga ttctagggct ctggaactga 1681 ttgctcccag gatctcctgc cagcccagct ggcctggccc ccagcttcac ctctgggacc 1741 ccagctgctc taagcccagg atctctttcc ccaaggaccc agccctcgcc tctgcgagaa 1801 tgaacatatt tgatagattt ttcttaacaa gttagaaaat tcagctcctt tctgtcctgg 1861 agctagcaaa gacttgtgtg atgcctccga aggggctctg agttctgggg tgggagtttt 1921 gctctctgtc aggtgtgata aaatgttgaa ccctccccac caccactttt ttttttttaa 1981 accagggat // LOCUS HSRNARAGA 1350 bp RNA PRI 13-NOV-1995 DEFINITION H.sapiens mRNA for ragA protein. ACCESSION X90529 NID g1063395 KEYWORDS GTP-binding protein; RagA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1350) AUTHORS Joost,H. TITLE Direct Submission JOURNAL Submitted (03-AUG-1995) H. Joost, Inst.f.Pharmakologie und Toxikologie, der RWTH, Wendlingweg 2, D- 52057 Aachen, FRG REFERENCE 2 (bases 1 to 1350) AUTHORS Schurmann,A., Brauers,A., Massmann,S., Becker,W. and Joost,H.G. TITLE Cloning of a novel family of mammalian GTP-binding proteins (RagA, RagBs, RagB1) with remote similarity to the Ras-related GTPases JOURNAL J. Biol. Chem. 270 (48), 28982-28988 (1995) MEDLINE 96081972 FEATURES Location/Qualifiers source 1..1350 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="lambda Zap" CDS 32..973 /codon_start=1 /product="ragA" /db_xref="PID:g1063396" /translation="MPNTAMKKKVLLMGKSGSGKTSMRSIIFANYIARDTRRLGATID VEHSHVRFLGNLVLNLWDCGGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESRELEKD MHYYQSCLEAILQNSPDAKIFCLVHKMDLVQEDQRDLIFKEREEDLRRLSRPLECACF RTSIWDETLYKAWSSIVYQLIPNVQQLEMNLRNFAQIIEADEVLLFERATFLVISHYQ CKEQRDVHRFEKISNIIKQFKLSCSKLAASFQSMEVRNSNFAAFIDIFTSNTYVMVVM SDPSIPSAATLINIRNARKHFEKLERVDGPKHSLLMR" BASE COUNT 332 a 328 c 326 g 364 t ORIGIN 1 cctgcgagtc cccggcagcc cccggcgggt gatgccaaat acagccatga agaaaaaggt 61 gctgctgatg gggaagagcg ggtcggggaa gaccagcatg aggtcgataa tcttcgccaa 121 ttacattgct cgcgacaccc ggcgcctggg ggccaccatt gacgtggaac actcccacgt 181 ccgattccta gggaacctgg tgctgaacct gtgggactgt ggcggtcagg acaccttcat 241 ggaaaattac ttcaccagcc agcgagacaa tatcttccgt aacgtggaag ttttgattta 301 cgtgtttgac gtggagagcc gcgaactgga aaaggacatg cattattacc agtcgtgtct 361 ggaggccatc ctccagaact ctcctgacgc caaaatcttc tgcctggtgc acaaaatgga 421 tctggttcag gaggatcagc gtgacctgat ttttaaagag cgagaggaag acctgaggcg 481 tctgtctcgc ccgctggagt gtgcttgttt tcgaacgtcc atctgggatg agacgctcta 541 caaagcctgg tccagcatcg tctaccagct gatccccaac gttcagcagc tggagatgaa 601 cctcaggaat tttgcccaaa tcattgaggc cgatgaagtt ctgctgttcg aaagagctac 661 attcttggtt atttcccact accagtgcaa agagcagcgc gacgtccacc ggtttgagaa 721 gatcagcaac atcatcaaac agttcaagct gagctgcagt aaattggccg cttccttcca 781 gagcatggaa gttaggaatt ccaacttcgc tgctttcatc gacatcttca cctcaaatac 841 gtacgtgatg gtggtcatgt cagatccgtc gatcccttct gcggccactc tgatcaacat 901 tcgcaatgcc cggaaacact ttgagaagct ggagagagtg gatggcccca agcacagtct 961 ccttatgcgt tgaatattgc caaatgctct ttctgaaaat gctgaattgc cttttttgtt 1021 tgcatccttt atttttaata ttcataatgt cgtgtgctta aaagtgggct ttgaagtgtg 1081 tgctgcttac tcctttcatc tttctccccg cttccccagt ctttaaacat tggacgctat 1141 ttactcagct acccagtaga gcttgaagct gacctttctg agaagttggt atggtgtaac 1201 actaaagtag gtggttcgtg tgtgttctca ttacctggtt atgatagata tgcacatcaa 1261 agcctttacc agtatcttcc tgtattccgt atcagattgc aaagacggaa tgttactatt 1321 ttatcagcaa ggtattaaaa tgcatttata // LOCUS HSRNAREL 2337 bp RNA PRI 21-SEP-1993 DEFINITION H.sapiens rel proto-oncogene mRNA. ACCESSION X75042 NID g402648 KEYWORDS rel oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2337) AUTHORS Brownell,E., Mittereder,N. and Rice,N.R. TITLE A human rel proto-oncogene cDNA containing an Alu fragment as a potential coding exon JOURNAL Oncogene 4 (7), 935-942 (1989) MEDLINE 89330980 REFERENCE 2 (bases 1 to 2337) AUTHORS Rice,N.R. TITLE Direct Submission JOURNAL Submitted (15-SEP-1993) N.R. Rice, Lab of Molec Virology & Carcinogenesis, BRI-Basic Research Program, NCI-Frederick Cancer Research Facility, PO Box B, Frederick, Maryland 21701, USA FEATURES Location/Qualifiers source 1..2337 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Daudi Burkitt lymphoma cell line" /clone="1" gene 178..2037 /gene="c-rel" CDS 178..2037 /gene="c-rel" /codon_start=1 /db_xref="PID:g402649" /db_xref="SWISS-PROT:Q04864" /translation="MASGAYNPYIEIIEQPRQRGMRFRYKCEGRSAGSIPGEHSTDNN RTYPSIQIMNYYGKGKVRITLVTKNDPYKPHPHDLVGKDCRDGYYEAEFGQERRPLFF QNLGIRCVKKKEVKEAIITRIKAGINPFNVPEKQLNDIEDCDLNVVRLCFQVFLPDEH GNLTTALPPVVSNPIYDNRAPNTAELRICRVNKNCGSVRGGDEIFLLCDKVQKDDIEV RFVLNDWEAKGIFSQADVHRQVAIVFKTPPYCKAITEPVTVKMQLRRPSDQEVSESMD FRYLPDEKDTYGNKAKKQKTTLLFQKLCQDHVETGFRHVDQDGLELLTSGDPPTLASQ SAGITVNFPERPRPGLLGSIGEGRYFKKEPNLFSHDAVVREMPTGVSSQAESYYPSPG PISSGLSHHASMAPLPSSSWSSVAHPTPRSGNTNPLSSFSTRTLPSNSQGIPPFLRIP VGNDLNASNACIYNNADDIVGMEASSMPSADLYGISDPNMLSNCSVNMMTTSSDSMGE TDNPRLLSMNLENPSCNSVLDPRDLRQLHQMSSSSMSAGANSNTTVFVSQSDAFEGSD FSCADNSMINESGPSNSTNPNSHGFVQDSQYSGIGSMQNEQLSDSFPYEFFQV" BASE COUNT 722 a 491 c 524 g 598 t 2 others ORIGIN 1 cggaaggtgt gagccgcaaa cccagcggag ggcgggaaga aggaggaggc ctctagggtg 61 ntcgggggac tgggggcccc gccggcagag gtccctcggc ctcctgactg actgactgcg 121 gccgcctccg gccaggacgc tgggagctgc ctgcgggaag gtgcggggag cggagccatg 181 gcctccggtg cgtataaccc gtatatagag ataattgaac aacccaggca gaggggaatg 241 cgttttagat acaaatgtga agggcgatca gcaggcagca ttccagggga gcacagcaca 301 gacaacaacc gaacataccc ttctatccag attatgaact attatggaaa aggaaaagtg 361 agaattacat tagtaacaaa gaatgaccca tataaacctc atcctcatga tttagttgga 421 aaagactgca gagacggcta ctatgaagca gaatttggac aagaacgcag acctttgttt 481 ttccaaaatt tgggtattcg atgtgtgaag aaaaaagaag taaaagaagc tattattaca 541 agaataaagg caggaatcaa tccattcaat gtccctgaaa aacagctgaa tgatattgaa 601 gattgtgacc tcaatgtggt gagactgtgt tttcaagttt ttctccctga tgaacatggt 661 aatttgacga ctgctcttcc tcctgttgtc tcgaacccaa tttatgacaa ccgtgctcca 721 aatactgcag aattaaggat ttgtcgtgta aacaagaatt gtggaagtgt cagaggagga 781 gatgaaatat ttctactttg tgacaaagtt cagaaagatg acatagaagt tcgttttgtg 841 ttgaacgatt gggaagcaaa aggcatcttt tcacaagctg atgtacaccg tcaagtagcc 901 attgttttca aaactccacc atattgcaaa gctatcacag aacccgtaac agtaaaaatg 961 cagttgcgga gaccttctga ccaggaagtt agtgaatcta tggattttag atatctgcca 1021 gatgaaaaag atacttacgg caataaagca aagaaacaaa agacaactct gcttttccag 1081 aaactgtgcc aggatcacgt agaaacaggg tttcgccatg ttgaccagga tggtcttgaa 1141 ctcctgacat caggtgatcc acccaccttg gcctcccaaa gtgctgggat tacagttaat 1201 tttcctgaga gaccaagacc tggtctcctc ggttcaattg gagaaggaag atacttcaaa 1261 aaagaaccaa acttgttttc tcatgatgca gttgtgagag aaatgcctac aggggtttca 1321 agtcaagcag aatcctacta tccctcacct gggcccatct caagtggatt gtcacatcat 1381 gcctcaatgg cacctctgcc ttcttcaagc tggtcatcag tggcccaccc caccccacgc 1441 tcaggcaata caaacccact gagtagtttt tcaacaagga cacttccttc taattcgcaa 1501 ggtatcccac cattcctgag aatacctgtt gggaatgatt taaatgcttc taatgcttgc 1561 atttacaaca atgccgatga catagtcgga atggaagcgt catccatgcc atcagcagat 1621 ttatatggta tttctgatcc caacatgctg tctaattgtt ctgtgaatat gatgacaacc 1681 agcagtgaca gcatgggaga gactgataat ccaagacttc tgagcatgaa tcttgaaaac 1741 ccctcatgta attcagtgtt agacccaaga gacttgagac agctccatca gatgtcctct 1801 tccagtatgt cagcaggcgc caattccaat actactgttt ttgtttcaca atcagatgca 1861 tttgagggat ctgacttcag ttgtgcagat aacagcatga taaatgagtc gggaccatca 1921 aacagtacta atccaaacag tcatggtttt gttcaagata gtcagtattc aggtattggc 1981 agtatgcaaa atgagcaatt gagtgactcc tttccatatg aattttttca agtataactt 2041 gcaagattta aatcctttta aatcttgata ccacctatat agatgcagca ttttgtattt 2101 gtctaactgg ggatataata ctatatttat actgtatata taatactgac tgagaatata 2161 atactgtatt tgagaatata aaaaactttt ttcagggaag aagcatacaa ctttggacat 2221 agcgaataca aaattggaag ctgtcataaa aagacaactc agaggccagg cgcaggngct 2281 cacacctgta atcctagcac tttgggaggc caaggcgggt ggatcacttg agaccag // LOCUS HSRNARS6K 2791 bp RNA PRI 19-OCT-1995 DEFINITION H.sapiens mRNA for ribosomal S6 kinase. ACCESSION X85106 NID g1033032 KEYWORDS ribosomal protein S6 kinase; RSK3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2791) AUTHORS Zhao,Y., Bjorbaek,C., Weremowicz,S., Morton,C.C. and Moller,D.E. TITLE RSK3 encodes a novel pp90rsk isoform with a unique N-terminal sequence: growth factor-stimulated kinase function and nuclear translocation JOURNAL Mol. Cell. Biol. 15 (8), 4353-4363 (1995) MEDLINE 95349602 REFERENCE 2 (bases 1 to 2791) AUTHORS Zhao,Y. TITLE Direct Submission JOURNAL Submitted (02-MAR-1995) Y. Zhao, Beth Israel Hospital/Harvard Med.School, Div. of Endocrinology and Metabolism, 330 Brookline Ave., Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..2791 /organism="Homo sapiens" /db_xref="taxon:9606" gene 175..2376 /gene="pp90RSK3" CDS 175..2376 /gene="pp90RSK3" /codon_start=1 /product="ribosomal S6 kinase" /db_xref="PID:g1033033" /translation="MDLSMKKFAVRRFFSVYLRRKSRSKSSSLSRLEEEGVVKEIDIS HHVKEGFEKADPSQFELLKVLGQGSYGKVFLVRKVKGSDAGQLYAMKVLKKATLKVRD RVRSKMERDILAEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFTEED VKFYLAELALALDHLHSLGIIYRDLKPENILLDEEGHIKITDFGLSKEAIDHDKRAYS FCGTIEYMAPEVVNRRGHTQSADWWSFGVLMFEMLTGSLPFQGKDRKETMALILKAKL GMPQFLSGEAQSLLRALFKRNPCNRLGAGIDGVEEIKRHPFFVTIDWNTLYRKEIKPP FKPALGRPEDTFHFDPEFTARTPTDSPGVPPSANAHHLFRGFSFVASSLIQEPSQQDL HKVPVHPIVQQLHGNNIHFTDGYEIKEDIGVGSYSVCKRCVHKATDTEYAVKIIDKSK RDPSEEIEILLRYGQHPNIITLKDVYDDGKFVYLVMELMRGGELLDRILRQRYFSERE ASDVLCTITKTMDYLHSQGVVHRDLKPSNILYRDESGSPESIRVCDFGFAKQLRAGNG LLMTPCYTANFVAPEVLKRQGYDAACDIWSLGILLYTMLAGFTPFANGPDDTPEEILA RIGSGKYALSGGNWDSISDAAKDVVSKMLHVDPHQRLTAMQVLKHPWVVNREYLSPNQ LSRQDVHLVKGAMAATYFALNRTPQAPRLEPVLSSNLAQRRGMKRLTSTRL" BASE COUNT 590 a 829 c 831 g 541 t ORIGIN 1 cccggcgcgg cctgcccttt gtgaccgcag ctcgcgcccc acgccccgcg cccatggccg 61 ccgtgccggg ctccctggcc acgcgtgccc gcccgcggac ctgagccccg cgcctgggat 121 gccggggatg cgcgtccccc ggccctgcgg ctgctccggg ctgggcgcgg ggcgatggac 181 ctgagcatga agaagttcgc cgtgcgcagg ttcttctctg tgtacctgcg caggaagtcg 241 cgctccaaga gctccagcct gagccggctc gaggaagaag gtgtcgtgaa ggagatagac 301 atcagccatc atgtgaagga gggctttgag aaggcagatc cttcccagtt tgagctgctg 361 aaggttttag gacaaggatc ctatggaaag gtgttcctgg tgaggaaggt gaaggggtcc 421 gacgctgggc agctctacgc catgaaggtc cttaagaaag ccaccctaaa agttcgggac 481 cgagtgagat cgaagatgga gagagacatc ttggcagaag tgaatcaccc cttcattgtg 541 aagcttcatt atgcctttca gacggaagga aagctctacc tgatcctgga cttcctgcgg 601 ggaggggacc tcttcacccg gctctccaaa gaggtcatgt tcacggagga ggatgtcaag 661 ttctacctgg ctgagctggc cttggcttta gaccatctcc acagcctggg gatcatctac 721 agagatctga agcctgagaa catcctcctg gatgaagagg ggcacattaa gatcacagat 781 ttcggcctga gtaaggaggc cattgaccac gacaagagag cgtactcctt ctgcgggacg 841 atcgagtaca tggcacccga ggtggtgaac cggcgaggac acacgcagag tgccgactgg 901 tggtccttcg gcgtgctcat gtttgagatg ctcacggggt ccctgccgtt ccaggggaag 961 gacaggaagg agaccatggc tctcatcctc aaagccaagc tggggatgcc gcagttcctc 1021 agtggggagg cacagagttt gctgcgagct ctcttcaaac ggaacccctg caaccggctg 1081 ggtgctggca ttgacggagt ggaggaaatt aagcgccatc ccttctttgt gaccatagac 1141 tggaacacgc tgtaccggaa ggagatcaag ccaccgttca aaccagcatt gggcaggcct 1201 gaggacacct tccactttga ccccgagttc acagcgcgga cgcccacaga ctctcctggc 1261 gtccccccga gtgcaaacgc tcatcacctg tttagaggat tcagctttgt ggcctcaagc 1321 ctgatccagg agccctcaca gcaagatctg cacaaagtcc cagttcaccc aatcgtgcag 1381 cagttacacg ggaacaacat ccacttcacc gatggctacg agatcaagga ggacatcggg 1441 gtgggctcct actcagtgtg caagcgatgt gtgcataaag ccacagacac cgagtatgcc 1501 gtgaagatca ttgataagag caagagagac ccctcggaag agattgagat cctcctgcgg 1561 tacggccagc acccgaacat catcaccctc aaggatgtct atgatgatgg caagtttgtg 1621 tacctggtaa tggagctgat gcgtggtggg gagctcctgg accgcatcct ccggcagaga 1681 tacttctcgg agcgcgaagc cagtgacgtc ctgtgcacca tcaccaagac catggactac 1741 ctccattccc agggggttgt tcatcgagac ctgaagccga gtaacatcct gtacagggat 1801 gagtcgggga gcccagaatc catccgagtc tgcgacttcg gctttgccaa gcagctgcgc 1861 gcggggaacg ggctgctcat gacaccctgc tacacggcca atttcgtggc cccggaggtc 1921 ctgaagcgtc aaggctatga tgcggcgtgt gacatctgga gtttggggat cctgttgtac 1981 accatgctgg caggatttac cccttttgca aatgggccag acgatacccc tgaggagatt 2041 ctggcgcgga tcggcagtgg gaagtatgcc ctttctgggg gaaactggga ctcgatatct 2101 gacgcagcta aagacgtcgt gtccaagatg ctccacgtgg accctcatca gcgcctgacg 2161 gcgatgcaag tgctcaaaca cccgtgggtg gtcaacagag agtacctgtc cccaaaccag 2221 ctcagccgac aggacgtgca cctggtgaag ggcgcgatgg ccgccaccta ctttgctcta 2281 aacagaacac ctcaggcccc gcggctggag cccgtgctgt catccaacct ggctcagcgc 2341 agaggcatga agagactcac gtccacgcgg ctgtagcggg tgggaccctg gccccagcgt 2401 cccctgccag catcctcgtg ggctcacaga ccccggcctc ggagcccgtc tggcacccag 2461 agtgaccaca agtccagcag ggaggcggcg ccgcctcgcc gtgtccgtgt tttctttttc 2521 agccccggag aggtcctgac ctgggggctt ctccaagcct cactgcgcca cgctccccgc 2581 ccgctctctt ttctcccaag cgaaaccaaa tgcgcccctt cacctcgcgt gcccgtgcga 2641 ggccgggggc ttctttcaga gcccgcgggt cctctcatac atggcttctg tgtctgccga 2701 gagatctgtt ttccaattat gaagccggtc ggtttggtca gactcccgac acccacgtcc 2761 aggtacccgg tggaaagtgg cagtgcgagg g // LOCUS HSRNASIS 593 bp RNA PRI 07-SEP-1995 DEFINITION H.sapiens mRNA for c-sis proto-oncogene. ACCESSION X83705 NID g951023 KEYWORDS choriocarcinoma; sis oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 593) AUTHORS Dirks,R.P., Onnekink,C., Jansen,H.J., de Jong,A. and Bloemers,H.P. TITLE A novel human c-sis mRNA species is transcribed from a promoter in c-sis intron 1 and contains the code for an alternative PDGF B-like protein JOURNAL Nucleic Acids Res. 23 (15), 2815-2822 (1995) MEDLINE 95388493 REFERENCE 2 (bases 1 to 593) AUTHORS Dirks,R.P.H. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) R.P.H. Dirks, Department of Molecular Biology, Faculty of Science, University of Nijmegen, Toernooiveld 1, 6525 ED Nijmegen, NETHERLANDS FEATURES Location/Qualifiers source 1..593 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="choriocarcinoma" /cell_line="JEG-3" /clone="pCORD1" mRNA 1..593 /gene="c-sis proto-oncogene" /note="alternative promoter located within intron-1 predicts two CDSs" gene 1..593 /gene="c-sis proto-oncogene" exon 1..55 /gene="c-sis proto-oncogene" /note="alternative exon-1a" gene 30..92 /gene="orf 1" CDS 30..92 /gene="orf 1" /codon_start=1 /db_xref="PID:g951024" /translation="MVRCLSWASGTPFPRSFMRC" gene 38..593 /gene="orf 2" CDS 38..>593 /gene="orf 2" /codon_start=1 /db_xref="PID:g951025" /translation="MFIMGLGDPIPEELYEMLSDHSIRSFDDLQRLLHGDPGEEDGAE LDLNMTRSHSGGELESLARGRRSLGSLTIAEPAMIAECKTRTEVFEISRRLIDRTNAN FLVWPPCVEVQRCSGCCNNRNVQCRPTQVQLRPVQVRKIEIVRKKPIFKKATVTLEDH LACKCETVAAARPVTRSPGGSQEQR" BASE COUNT 129 a 168 c 195 g 101 t ORIGIN 1 agagagagag agagactgac tgagcaggaa tggtgagatg tttatcatgg gcctcgggga 61 ccccattccc gaggagcttt atgagatgct gagtgaccac tcgatccgct cctttgatga 121 tctccaacgc ctgctgcacg gagaccccgg agaggaagat ggggccgagt tggacctgaa 181 catgacccgc tcccactctg gaggcgagct ggagagcttg gctcgtggaa gaaggagcct 241 gggttccctg accattgctg agccggccat gatcgccgag tgcaagacgc gcaccgaggt 301 gttcgagatc tcccggcgcc tcatagaccg caccaacgcc aacttcctgg tgtggccgcc 361 ctgtgtggag gtgcagcgct gctccggctg ctgcaacaac cgcaacgtgc agtgccgccc 421 cacccaggtg cagctgcgac ctgtccaggt gagaaagatc gagattgtgc ggaagaagcc 481 aatctttaag aaggccacgg tgacgctgga agaccacctg gcatgcaagt gtgagacagt 541 ggcagctgca cggcctgtga cccgaagccc ggggggttcc caggagcagc gag // LOCUS HSRNASMF 462 bp RNA PRI 11-MAY-1995 DEFINITION H.sapiens mRNA for Sm protein F. ACCESSION X85372 NID g806563 KEYWORDS pBSCF gene; Sm protein F. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 462) AUTHORS Hermann,H., Fabrizio,P., Raker,V.A., Foulaki,K., Hornig,H., Brahms,H. and Luhrmann,R. TITLE snRNP Sm proteins share two evolutionarily conserved sequence motifs which are involved in Sm protein-protein interactions JOURNAL EMBO J. 14 (9), 2076-2088 (1995) MEDLINE 95262647 REFERENCE 2 (bases 1 to 462) AUTHORS Raker,V.A. TITLE Direct Submission JOURNAL Submitted (15-MAR-1995) V.A. Raker, Institut fuer Molekularbiologie & Tumorforschung, Emil-Mannkopff-Str. 2, 35037 Marburg, FRG FEATURES Location/Qualifiers source 1..462 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela S3" /clone_lib="lambda gt10" gene 116..376 /gene="pBSCF" CDS 116..376 /gene="pBSCF" /codon_start=1 /evidence=experimental /product="Sm protein F" /db_xref="PID:g806564" /translation="MSLPLNPKPFLNGLTGKPVMVKLKWGMEYKGYLVSVDGYMNMQL ANTEEYIDGALSGHLGEVLIRCNNVLYIRGVEEEEEDGEMRE" BASE COUNT 140 a 74 c 122 g 126 t ORIGIN 1 tctggccatt tctcttgaaa ctgcggctcg ggacctgcgg tacctgctgt agtcacgagg 61 gacgggcggc ggctggtcgg cagagagtag cctgcaacat tcggccgtgg tttacatgag 121 tttacccctc aatcccaaac ctttcctcaa tggactaaca ggaaagccag tgatggtgaa 181 acttaagtgg ggaatggagt acaagggcta tctggtatct gtagatggct acatgaacat 241 gcagcttgca aatacagaag aatacataga tggagctttg tctggacatc tgggtgaagt 301 tttaataagg tgtaataatg tcctttatat cagaggtgtg gaagaagagg aagaagatgg 361 ggaaatgaga gaatagcatc ttttgtgggg gatttttttt atatatattt ctagacaata 421 aagatttgtt tgtttttcaa cttgaaaaaa aaaaaaaaaa aa // LOCUS HSRNASMG 455 bp RNA PRI 11-MAY-1995 DEFINITION H.sapiens mRNA for Sm protein G. ACCESSION X85373 NID g806565 KEYWORDS pBSCF gene; Sm protein G. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 455) AUTHORS Hermann,H., Fabrizio,P., Raker,V.A., Foulaki,K., Hornig,H., Brahms,H. and Luhrmann,R. TITLE snRNP Sm proteins share two evolutionarily conserved sequence motifs which are involved in Sm protein-protein interactions JOURNAL EMBO J. 14 (9), 2076-2088 (1995) MEDLINE 95262647 REFERENCE 2 (bases 1 to 455) AUTHORS Raker,V.A. TITLE Direct Submission JOURNAL Submitted (15-MAR-1995) V.A. Raker, Institut fuer Molekularbiologie & Tumorforschung, Emil-Mannkopff-Str. 2, 35037 Marburg, FRG FEATURES Location/Qualifiers source 1..455 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /clone_lib="lambda gt10" gene 84..314 /gene="pBSCF" CDS 84..314 /gene="pBSCF" /codon_start=1 /evidence=experimental /product="Sm protein G" /db_xref="PID:g806566" /translation="MSKAHPPELKKFMDKKLSLKLNGGRHVQGILRGFDPFMNLVIDE CVEMATSGQQNNIGMVVIRGNSIIMLEALERV" BASE COUNT 152 a 76 c 108 g 119 t ORIGIN 1 tagacgccgg gcctacagcg ggaggctgag gaaagccgtg cgttgcgttc caaggacatc 61 tgtgagcccg cggagtatac accatgagca aagctcaccc tcccgagttg aaaaaattta 121 tggacaagaa gttatcattg aaattaaatg gtggcagaca tgtccaagga atattgcggg 181 gatttgatcc ctttatgaac cttgtgatag atgaatgtgt ggagatggcg actagtggac 241 aacagaacaa tattggaatg gtggtaatac gaggaaatag tatcatcatg ttagaagcct 301 tggaacgagt ataaataatg gctgttcagc agagaaaccc atgtcctctc tccatagggc 361 ctgtttacta tgatgtaaaa attaggtcat gtacattttc atattagact ttttgttaaa 421 taaacttttg taatagtcaa aaaaaaaaaa aaaaa // LOCUS HSRNASPIB 1444 bp RNA PRI 09-OCT-1996 DEFINITION H.sapiens mRNA for Spi-B transcription factor. ACCESSION X96998 NID g1403052 KEYWORDS Spi-B gene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1444) AUTHORS Ray-Gallet,D., Tavitian,A. and Moreau-Gachelin,F. TITLE An alternatively spliced isoform of the Spi-B transcription factor JOURNAL Biochem. Biophys. Res. Commun. 223 (2), 257-263 (1996) MEDLINE 96264628 REFERENCE 2 (bases 1 to 1444) AUTHORS Ray-Gallet,D. TITLE Direct Submission JOURNAL Submitted (29-MAR-1996) D. Ray-Gallet, INSERM U.248, Institut Curie Section Recherche, 26 rue dUlm 75231 Paris cedex 05, 75231 Paris cedex 05, France FEATURES Location/Qualifiers source 1..1444 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji lymphoma" /chromosome="19" /map="q13.3-q13.4" CDS 6..539 /note="alternatively spliced isoform" /codon_start=1 /product="Spi-B transcription factor" /db_xref="PID:e241800" /db_xref="PID:g1403053" /translation="MLALEAAQLDGPHFSCLYPDGVFYDLDSCKHSSYPDSEGAPDSL WDWTVAPPVPATPYEAFDPAAAAFSHPQAAQLCYEPPTYSPAGNLELAPSLEAPGPGL PAYPTENFASQTLVPPAYAPYPSPVLSEEEDLPLDSPALEVSDSESDEALVAGPEGKG SEGLARSCACTSSCWGY" BASE COUNT 257 a 483 c 400 g 304 t ORIGIN 1 ccaccatgct cgccctggag gctgcacagc tcgacgggcc acacttcagc tgtctgtacc 61 cagatggcgt cttctatgac ctggacagct gcaagcattc cagctaccct gattcagagg 121 gggctcctga ctccctgtgg gactggactg tggccccacc tgtcccagcc accccctatg 181 aagccttcga cccggcagca gccgctttta gccaccccca ggctgcccag ctctgctacg 241 aaccccccac ctacagccct gcagggaacc tcgaactggc ccccagcctg gaggccccgg 301 ggcctggcct ccccgcatac cccacggaga acttcgctag ccagaccctg gttcccccgg 361 catatgcccc gtaccccagc cctgtgctat cagaggagga agacttaccg ttggacagcc 421 ctgccctgga ggtctcggac agcgagtcgg atgaggccct cgtggctggc cccgagggga 481 agggatccga gggactcgca agaagctgcg cctgtaccag ttcctgctgg ggctactgac 541 gcgcggggac atgcgtgagt gcgtgtggtg ggtggagcca ggcgccggcg tcttccagtt 601 ctcctccaag cacaaggaac tcctggcgcg ccgctggggc cagcagaagg ggaaccgcaa 661 gcgcatgacc taccagaagc tggcgcgcgc cctccgaaac tacgccaaga ccggcgagat 721 ccgcaaggtc aagcgcaagc tcacctacca gttcgacagc gcgctgctgc ctgcagtccg 781 ccgggcctga gcacacccga ggctcccacc tgcggagccg ctgggggacc tcacgtccca 841 gccaggatcc ccctggaaga aaaagggcgt ccccacactc taggtgatag gacttacgca 901 tccccacctt ttggggtaag gggagtgctg ccctgccata atccccaagc ccagcccggg 961 cctgtctggg attccccact tgtgcctggg gtccctctgg gatttctttg tcatgtacag 1021 actccctggg atcctcatgt tttgggtgac aggacctatg gaccactata ctcggggagg 1081 cagggtagca gtgcttccag agtcccaaga gcttctctgg gattttcttg tgatatctga 1141 ttccccagtg aggcctggga cctttttaag atcgctgtgt gtctgtaaac cctgaatctc 1201 atctggggtg ggggccctgc tggcaaccct gagccctgtc caaggttccc tcttgtcaga 1261 tctgagattt cctagttatg tctggggccc tctgggagct gttatcatct cagatctctt 1321 cgcccatcta tggctgtgtt gtcacatctg tcccctcatt tttgagatcc cccaattctc 1381 tggaactatt ctgctgcccc tttttatgtg tctggagttc cccaatcaca tctagggctc 1441 ctcc // LOCUS HSRNASYNT 840 bp RNA PRI 31-DEC-1995 DEFINITION H.sapiens mRNA for syntaxin. ACCESSION X90581 NID g1143493 KEYWORDS syntaxin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 840) AUTHORS Le Bivic,A. TITLE Direct Submission JOURNAL Submitted (03-AUG-1995) A. Le Bivic, CNRS, lgpd, campus de Luminy case 907, Marseille cedex 9, 13288, FRANCE REFERENCE 2 (bases 1 to 840) AUTHORS Delgrossi,M.H. and Le Bivic,A. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..840 /organism="Homo sapiens" /sub_species="caucasian" /db_xref="taxon:9606" /clone="hsyn3-c12" /clone_lib="5'-strech cDNA, Clontech" /dev_stage="adult" /sex="female" /tissue_type="duodenum" CDS 1..840 /codon_start=1 /product="syntaxin" /db_xref="PID:e194870" /db_xref="PID:g1143494" /translation="MQLTQDDDTDAVEIAIDNTAFMDEFFSEIEETRLNIDKISEHVE EAKKLYSIILSAPIPEPKTKDDLEQLSTEIKKRANNVRNKLKSMEKHIEEDEVRSSAD LRIRKSQHSVLSRKFVEVMTKYNEAQVDFRERSKGRIQRQLEITGKKTTDEELEEMLE SGNPAIFTSGIIDSQISKQALSEIEGRHKDIVRLESSIKELHDMFMDIAMLVENQGEM LDNIELNVMHTVDHVEKARDESKKAVKYQSQARKKLIIIIVLVVVLLGILALIIGLSV GLN" BASE COUNT 262 a 166 c 226 g 186 t ORIGIN 1 atgcaactga cacaggatga tgatactgat gcggttgaga ttgctatcga caacacggct 61 tttatggacg agttcttttc tgagattgag gaaactcggc ttaacattga caagatctca 121 gaacatgtag aggaggctaa gaaactctac agtatcattc tctctgcacc gattccagag 181 ccaaaaacca aggatgacct agagcagctc agcactgaga ttaagaaaag ggccaacaac 241 gtccggaaca aactgaagag catggagaag catattgaag aagatgaggt caggtcatcg 301 gcagaccttc ggattcggaa atcccagcac tctgtccttt ctcggaagtt tgtggaggtg 361 atgaccaaat acaatgaagc tcaagtggac ttccgagaac gcagcaaagg gcgaatccag 421 cggcagctcg aaattactgg caaaaagaca accgatgagg agctggagga gatgttggag 481 agtggcaacc cggccatctt cacttctggg atcattgact cacagatttc caagcaagcc 541 ctcagtgaga ttgagggacg acacaaggac attgtgaggc tggagagcag catcaaggag 601 cttcacgaca tgtttatgga catcgccatg ctggtggaga atcagggtga gatgttagat 661 aacatagagt tgaatgtcat gcacacagtg gaccacgtgg agaaggcacg agatgaaagc 721 aaaaaagctg tgaaatacca gagtcaggcc cggaagaaat tgataattat cattgtgcta 781 gtagttgtgt tgctgggcat tttagcattg attattggac tttccgttgg cctgaattaa // LOCUS HSRNATAF 1883 bp RNA PRI 15-APR-1996 DEFINITION H.sapiens mRNA for tafazzins protein. ACCESSION X92762 NID g1263131 KEYWORDS G4.5 gene; tafazzins. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1883) AUTHORS Bione,S., D'Adamo,P., Maestrini,E., Gedeon,A.K., Bolhuis,P.A. and Toniolo,D. TITLE A novel X-linked gene, G4.5. is responsible for Barth syndrome JOURNAL Nature Genet. 12 (4), 385-389 (1996) MEDLINE 96224398 REFERENCE 2 (bases 1 to 1883) AUTHORS Toniolo,D. TITLE Direct Submission JOURNAL Submitted (03-NOV-1995) D. Toniolo, Instituto di Genetica Biochimica ed, Evoluzionistica CNR, Via Abbiategrasso 207, 27100 Pavia, ITALY FEATURES Location/Qualifiers source 1..1883 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="q28" gene 289..1167 /gene="G4.5" CDS 289..1167 /gene="G4.5" /note="responsible for Barth syndrome" /codon_start=1 /product="tafazzins" /db_xref="PID:e209046" /db_xref="PID:g1263132" /translation="MPLHVKWPFPAVPPLTWTLASSVVMGLVGTYSCFWTKYMNHLTV HNREVLYELIEKRGPATPLITVSNHQSCMDDPHLWGILKLRHIWNLKLMRWTPAAADI CFTKELHSHFFSLGKCVPVCRGAEFFQAENEGKGVLDTGRHMPGAGKRREKGDGVYQK GMDFILEKLNHGDWVHIFPEGKVNMSSEFLRFKWGIGRLIAECHLNPIILPLWHVGMN DVLPNSPPYFPRFGQKITVLIGKPFSALPVLERLRAENKSAVEMRKALTDFIQEEFQH LKTQAEQLHNHLQPGR" polyA_signal 1860..1864 BASE COUNT 365 a 584 c 555 g 379 t ORIGIN 1 ccgggccggg gtgccagcgc ccgccttccc gtttcctccc gttccgcagc gcgcccacgg 61 cctgtgaccc cggcgaccgc tccccagtga cgagagagcg gggccgggcg ctgctccggc 121 ctgacctgcg aagggacctc ggtccagtcc cctgttgcgc cgcgcccccg tccgtccgtg 181 cgcgggccag tcaggggcca gtgtctcgag cggtcgaggt cgcagaccta gaggcgcccc 241 acaggccggc ccggggcgct gggagcgccg gccgcgggcc gggtggggat gcctctgcac 301 gtgaagtggc cgttccccgc ggtgccgccg ctcacctgga ccctggccag cagcgtcgtc 361 atgggcttgg tgggcaccta cagctgcttc tggaccaagt acatgaacca cctgaccgtg 421 cacaacaggg aggtgctgta cgagctcatc gagaagcgag gcccggccac gcccctcatc 481 accgtgtcca atcaccagtc ctgcatggac gaccctcatc tctgggggat cctgaaactc 541 cgccacatct ggaacctgaa gttgatgcgt tggacccctg cagctgcaga catctgcttc 601 accaaggagc tacactccca cttcttcagc ttgggcaagt gtgtgcctgt gtgccgagga 661 gcagaatttt tccaagcaga gaatgagggg aaaggtgttc tagacacagg caggcacatg 721 ccaggtgctg gaaaaagaag agagaaagga gatggcgtct accagaaggg gatggacttc 781 attttggaga agctcaacca tggggactgg gtgcatatct tcccagaagg gaaagtgaac 841 atgagttccg aattcctgcg tttcaagtgg ggaatcgggc gcctgattgc tgagtgtcat 901 ctcaacccca tcatcctgcc cctgtggcat gtcggaatga atgacgtcct tcctaacagt 961 ccgccctact tcccccgctt tggacagaaa atcactgtgc tgatcgggaa gcccttcagt 1021 gccctgcctg tactcgagcg gctccgggcg gagaacaagt cggctgtgga gatgcggaaa 1081 gccctgacgg acttcattca agaggaattc cagcatctga agactcaggc agagcagctc 1141 cacaaccacc tccagcctgg gagataggcc ttgcttgctg ccttctggat tcttggcccg 1201 cacagagctg gggctgaggg atggactgat gcttttagct caaacgtggc ttttagacag 1261 atttgttcat agaccctctc aagtgccctc tccgagctgg taggcattcc agctcctccg 1321 tgcttcctca gttacacaaa ggacctcagc tgcttctccc acttggccaa gcagggagga 1381 agaagcttag gcagggctct ctttccttct tgccttcaga tgttctctcc caggggctgg 1441 cttcaggagg gagcatagaa ggcaggtgag caaccagttg gctaggggag cagggggccc 1501 accagagctg tggagagggg accctaagac tcctcggcct ggctcctacc caccgccctt 1561 gccgaaccag gagctgctca ctacctcctc agggatggcc gttggccacg tcttccttct 1621 gcctgagctt cccccccacc acaggccctt tcctcaggca aggtctggcc tcaggtgggc 1681 cgcaggcggg aaaagcagcc cttggccaga agtcaagccc agccacgtgg agcctagagt 1741 gagggcctga ggtctggctg cttgccccca tgctggcgcc aacaacttct ccatcctttc 1801 tgcctctcaa catcacttga atcctagggc ctgggttttc atgtttttga aacagaacca 1861 taaagcatat gtgttggctt gtt // LOCUS HSRNATFG 1677 bp RNA PRI 13-MAY-1997 DEFINITION H.sapiens mRNA for TFG protein. ACCESSION Y07968 NID g1552327 KEYWORDS TFG gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Mencinger,M., Panagopoulos,I., Andreasson,P., Lassen,C., Mitelman,F. and Aman,P. TITLE Characterization and chromosomal mapping of the human TFG gene involved in thyroid carcinoma JOURNAL Genomics 41 (3), 327-331 (1997) MEDLINE 97312688 REFERENCE 2 (bases 1 to 1677) AUTHORS Mencinger,M. TITLE Direct Submission JOURNAL Submitted (11-SEP-1996) M. Mencinger, University Hospital Of Lund, Department Of Clinical Genetics, Lund, S-221 85, SWEDEN FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="pancreas" /chromosome="3" /map="q11-12" gene 19..1221 /gene="TFG" CDS 19..1221 /gene="TFG" /codon_start=1 /db_xref="PID:e266646" /db_xref="PID:g1552328" /translation="MNGQLDLSGKLIVKAQLGEDIRRIPIHNEDITYDELVLMMQRVF RGKLLSNDEVTIKYKDEDGDLITIFDSSDLSFAIQCSRILKLTLFVNGQPRPLESSQV KYLRRELIELRNKVNRLLDSLEPPGEPGPSTNIPENDTVDGREEKSASDSSGKQSTQV MAASMSAFDPLKNQDEINKNVMSAFGLTDDQVSGPPSAPAEDRSGTPDSIASSSSAAH PPGVQPQQPPYTGAQTQAGQIEGQMYQQYQQQAGYGAQQPQAPPQQPQQYGIQYSASY SQQTGPQQPQQFQGYGQQPTSQAPAPAFSGQPQQLPAQPPQQYQASNYPAQTYTAQTS QPTNYTVAPASQPGMAPSQPGAYQPRPGFTSLPGSTMTPPPSGPNPYARNRPPFGQGY TQPGPGYR" polyA_signal 1640..1645 polyA_site 1658 BASE COUNT 513 a 380 c 338 g 446 t ORIGIN 1 aacatcctgg agtccaccat gaacggacag ttggatctaa gtgggaagct aatcgtcaaa 61 gctcaacttg gggaggatat tcggcgaatt cctattcata atgaagatat tacttatgat 121 gaattagtgc taatgatgca acgagttttc agaggaaaac ttctgagtaa tgatgaagta 181 acaataaagt ataaagatga agatggagat cttataacaa tttttgatag ttctgacctt 241 tcctttgcaa ttcagtgcag taggatactg aaactgacat tatttgttaa tggccagcca 301 agaccccttg aatcaagtca ggtgaaatat ctccgtcgag aactgataga acttcgaaat 361 aaagtgaatc gtttattgga tagcttggaa ccacctggag aaccaggacc ttccaccaat 421 attcctgaaa atgatactgt ggatggtagg gaagaaaagt ctgcttctga ttcttctgga 481 aaacagtcta ctcaggttat ggcagcaagt atgtctgctt ttgatccttt aaaaaaccaa 541 gatgaaatca ataaaaatgt tatgtcagcg tttggcttaa cagatgatca ggtttcaggg 601 ccacccagtg ctcctgcaga agatcgttca ggaacacccg acagcattgc ttcctcctcc 661 tcagcagctc acccaccagg cgttcagcca cagcagccac catatacagg agctcagact 721 caagcaggtc agattgaagg tcagatgtac caacagtacc agcaacaggc cggctatggt 781 gcacagcagc cgcaggctcc acctcagcag cctcaacagt atggtattca gtattcagca 841 agctatagtc agcagactgg acctcaacaa cctcagcagt tccagggata tggccagcaa 901 ccaacttccc aggcaccagc tcctgccttt tctggtcagc ctcaacaact gcctgctcag 961 ccgccacagc agtaccaggc gagcaattat cctgcacaaa cttacactgc ccaaacttct 1021 cagcctacta attatactgt ggctcctgcc tctcaacctg gaatggctcc aagccaacct 1081 ggggcctatc aaccaagacc aggttttact tcacttcctg gaagtaccat gacccctcct 1141 ccaagtgggc ctaatcctta tgcgcgtaac cgtcctccct ttggtcaggg ctatacccaa 1201 cctggacctg gttatcgata aggaggctcc tctacaccaa ttaatgttag ctgctagcta 1261 ttggcctccc aaaagactcc agtactattt taatttgtat tgaagaagtt cagaaattta 1321 aaagcagagc attttttatg atatcattgt tggtgttaat tggaaagtat aatttgctgg 1381 ggaacacaaa gaccaaaatg gaaagttttt tcctccctgc ttaaaaatgt agcagcttct 1441 tagttacttt ggaacactac tcttacatgt ataaagtgat tgacttgact ttctagcttc 1501 ccttgtccgg aggatattaa aatgctaggg tgaggtttag ccatcttact tggcttttta 1561 ctattaacat gatgtactaa agtagagccc tttgagaata caagatatta tgtataaaat 1621 gtaacactga tgataggtta ataaagatga ttgaatccaa aaaaaaaaaa aaaaaaa // LOCUS HSRNATFII 1272 bp RNA PRI 30-DEC-1993 DEFINITION H.sapiens mRNA for TFIIA-alpha. ACCESSION X75383 NID g433499 KEYWORDS TFIIA gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1272) AUTHORS Ma,D., Watanabe,H., Mermelstein,F., Admon,A., Oguri,K., Sun,X., Wada,T., Imai,T., Shiroya,T., Reinberg,D. and Handa,H. TITLE Isolation of a cDNA encoding the largest subunit of TFIIA reveals functions important for activated transcription JOURNAL Genes Dev. 7 (11), 2246-2257 (1993) MEDLINE 94040744 REFERENCE 2 (bases 1 to 1272) AUTHORS Reinberg,D. TITLE Direct Submission JOURNAL Submitted (08-OCT-1993) D. Reinberg, UMDNJ Robert Wood Johnson Medical School, Dept of Biochemistry, 663 Hoes Lane, Piscataway, New Jersey 08854-5635, USA FEATURES Location/Qualifiers source 1..1272 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /dev_stage="rearranged" /clone_lib="human cDNA library" gene 142..1272 /gene="TFIIA-alpha" CDS 142..1272 /gene="TFIIA-alpha" /note="major translational start codon" /codon_start=1 /product="TFIIA" /db_xref="PID:g433500" /translation="MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKT LWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQT QQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSG QLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTG VIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGT GDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENV VVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW" CDS 259..1272 /gene="TFIIA-alpha" /note="minor translational start codon" /codon_start=1 /product="TFIIA" /db_xref="PID:g433501" /translation="MELKTLWENKLMQSRAVDGFHSEEQQLLLQVQQQHQPQQQQHHH HHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHMNASNMSAAATAA TLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVIPQMQPGGVQA PVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQITATGQQQ PQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQVEEEPLN SEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGD AEW" BASE COUNT 389 a 270 c 290 g 323 t ORIGIN 1 gattgatttt gtttaaaatt ttttccccaa tcttgcggtg atttgggtca ccctccgggt 61 gttaatagtt tttttttttt tggttttgtt tttatcttgt tttcttgggg ttgccccctc 121 ttgtttgtgt tgtgtgtgga aatggcgaac tcggcaaata caaacaccgt gcctaaatta 181 tacagatctg tgattgaaga tgtcattaat gatgtgagag acatctttct ggatgatgga 241 gtggatgaac aagtactgat ggaactaaaa actttatggg aaaacaaact aatgcagtcc 301 agggcagtag atggatttca ttcagaagag cagcagcttc tactgcaagt tcaacagcag 361 catcaacccc agcagcagca gcatcaccac catcaccatc atcagcaagc tcagcctcag 421 cagacagtac ctcagcaagc gcagacccag caggttctta ttcctgcatc acagcaagcc 481 acagcaccac aagttattgt tccagattct aagttgatac agcatatgaa tgcatcaaac 541 atgagtgctg ctgctacagc tgctacctta gcactccctg caggtgtgac tcctgttcag 601 cagatattaa caaattcagg ccagcttctt caggtggtca gagcagccaa tggtgcccaa 661 tatatctttc agcctcagca gtcagtggtt ctacaacaac aggttatacc acaaatgcag 721 cctggtggag tacaagctcc tgttatacag caggtgctgg ctcctcttcc tggagggatt 781 tcaccacaga caggtgtcat catccagcct cagcaaatct tatttacagg aaataagact 841 caagttatac ctacgacagt ggcagcacct acaccagccc aagcacagat aactgcaact 901 ggccagcagc aaccgcaggc ccagcctgct caaacacaag ctccattggt cttacaagtt 961 gatggaactg gggatacatc atctgaagaa gatgaagatg aagaagaaga ctatgatgat 1021 gatgaggagg aagacaaaga gaaagatgga gctgaagatg ggcaggtgga agaagagccc 1081 ctcaatagtg aagatgatgt gagtgatgag gaaggacagg aactctttga cacagaaaat 1141 gttgttgtat gccaatatga taagatacac agaagtaaaa acaaatggaa atttcatctc 1201 aaggatggca ttatgaatct taatggaaga gattatatat tttccaaagc cattggagat 1261 gcagaatggt ga // LOCUS HSRNATRAP 621 bp RNA PRI 22-NOV-1995 DEFINITION H.sapiens mRNA for rat translocon-associated protein delta homolog. ACCESSION X90583 NID g1071680 KEYWORDS proopiomelanocortin; translocon-associated protein; TRAP like. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 621) AUTHORS Holthuis,J.C.M. TITLE Direct Submission JOURNAL Submitted (01-JUN-1995) J.C.M. Holthuis, KUN, Dept. Animal Physiology, Univ.of Nijmegen, Toernooiveld, 6525 ED Nijmegen, NETHERLANDS REFERENCE 2 (bases 1 to 621) AUTHORS Holthuis,J.C., van Riel,M.C. and Martens,G.J. TITLE Translocon-associated protein TRAP delta and a novel TRAP-like protein are coordinately expressed with pro-opiomelanocortin in Xenopus intermediate pituitary JOURNAL Biochem. J. 312 (Pt 1), 205-213 (1995) MEDLINE 96077146 FEATURES Location/Qualifiers source 1..621 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" /clone_lib="uniZAP XR" /clone="HO286" CDS 31..552 /note="unnamed protein product" /codon_start=1 /db_xref="PID:g1071681" /translation="MAAMASLGALALLLLSSLSRCSAEACLEPQITPSYYTTSDAVIS TETVFIVEISLTCKNRVQNMALYADVGGKQFPVTRGQDVGRYQVSWSLDHKSAHAGTY EVRFFDEESYSLLRKAQRNNEDISIIPPLFTVSVDHRGTWNGPWVSTEVLAAAIGLVI YYLAFSAKSHIQA" mat_peptide 100..549 /product="translocon-associated protein TRAPdelta" BASE COUNT 138 a 188 c 165 g 130 t ORIGIN 1 gaattccttc ctctaggcag agaagaggcg atggcggcga tggcatctct cggcgccctg 61 gcgctgctcc tgctgtccag cctctcccgc tgctcagccg aggcctgcct ggagccccag 121 atcacccctt cctactacac cacttctgac gctgtcattt ccactgagac cgtcttcatt 181 gtggagatct ccctgacatg caagaacagg gtccagaaca tggctctcta tgctgacgtc 241 ggtggaaaac aattccctgt cactcgaggc caggatgtgg ggcgttatca ggtgtcctgg 301 agcctggacc acaagagcgc ccacgcaggc acctatgagg ttagattctt cgacgaggag 361 tcctacagcc tcctcaggaa ggctcagagg aataacgagg acatttccat catcccgcct 421 ctgtttacag tcagcgtgga ccatcggggc acttggaacg ggccctgggt gtccactgag 481 gtgctggctg cggcgatcgg ccttgtgatc tactacttgg ccttcagtgc gaagagccac 541 atccaggcct gaggcggcac ccagcctgcc cttgattcct tcaataaaca tcacaggacc 601 tgggaaaaaa aaaaaaaaaa a // LOCUS HSRNATTFI 2847 bp RNA PRI 10-AUG-1995 DEFINITION H.sapiens mRNA for TTF-I. ACCESSION X83973 NID g639692 KEYWORDS transcription termination factor; TTF-I gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2847) AUTHORS Evers,R. and Grummt,I. TITLE Molecular coevolution of mammalian ribosomal gene terminator sequences and the transcription termination factor TTF-I JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (13), 5827-5831 (1995) MEDLINE 95320168 REFERENCE 2 (bases 1 to 2847) AUTHORS Evers,R. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) R. Evers, German Cancer Research Center, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..2847 /organism="Homo sapiens" /strain="Hela" /db_xref="taxon:9606" gene 45..2705 /gene="TTF-I" CDS 45..2705 /gene="TTF-I" /function="transcriptional termination of RNA-Pol I" /codon_start=1 /product="transcription factor" /db_xref="PID:g639693" /translation="MEGESSRFEIHTPVSDKKKKKCSIHKERPQKHSHEIFRDSSLVN EQSQITRRKKRKKDFQHLISSPLKKSRICDETANATSTLKKRKKRRYSALEVDEEAGV TVVLVDKENINNTPKHFRKDVDVVCVDMSIEQKLPRKPKTDKFQVLAKSHAHKSEALH SKVREKKNKKHQRKAASWESQRARDTLPQSESHQEESWLSVGPGGEITELPASAHKNK SKKKKKKSSNREYETLAMPEGSQAGREAGTDMQESQPTVGLDDETPQLLGPTHKKKSK KKKKKKSNHQEFEALAMPEGSQVGSEVGADMQESRPAVGLHGETAGIPAPAYKNKSKK KKKKSNHQEFEAVAMPESLESAYPEGSQVGSEVGTVEGSTALKGFKESNSTKKKSKKR KLTSVKRARVSGDDFSVPSKNSESTLFDSVEGDGAMMEEGVKSRPRQKKTQACLASKH VQEAPRLEPANEEHNVETAEDSEIRYLSADSGDADDSDADLGSAVKQLQEFIPNIKDR ATSTIKRMYRDDLERFKEFKAQGVAIKFGKFSVKENKQLEKNVEDFLALTGIESADKL LYTDRYPEEKSVITNLKRRYSFRLHIGRNIARPWKLIYYRAKKMFDVNNYKGRYSEGD TEKLKMYHSLLGNDWKTIGEMVARRSLSVALKFSQISSQRNRGAWSKSETRKLIKAVE EVILKKMSPQELKEVDSKLQENPESCLSIVREKLYKGISWVEVEAKVQTRNWMQCKSK WTEILTKRMTNGRRIYYGMNALRAKVSLIERLYEINVEDTNEIDWEDLASAIGDVPPS YVQTKFSRLKAVYVPFWQKKTFPEIIDYLYETTLPLLKEKLEKMMEKKGTKIQTPAAP KQVFPFRDIFYYEDDSEGGGHRKRKRRGIP" BASE COUNT 995 a 542 c 710 g 600 t ORIGIN 1 ttgggggttg ggagaaaggt ggcggtgctt tcggagggaa taaaatggaa ggagaatcaa 61 gcagatttga aatccacact ccagtttctg acaagaaaaa gaaaaagtgt tctatacata 121 aggaaagacc tcagaaacat tcccacgaaa ttttcagaga ctcctccctg gtgaatgaac 181 agtctcaaat aactaggagg aaaaagagga aaaaagattt ccagcatctc atttcttctc 241 ctttgaaaaa atccagaatc tgtgatgaga ctgcaaatgc cacttccaca ctcaaaaaga 301 gaaaaaagag aagatatagt gctttggagg tggacgagga agcaggtgtt acagttgtcc 361 ttgtggataa agaaaatatt aacaacacac caaagcattt tagaaaggat gttgatgttg 421 tttgtgttga tatgagcata gaacagaagt taccaagaaa gcctaaaaca gacaaatttc 481 aggtacttgc taagtcacat gcacataaat cagaagccct gcacagtaaa gttagggaga 541 aaaagaataa aaagcatcag aggaaagctg catcctggga gagccagcgg gcaagggaca 601 ccctgcctca gtcagaatcc caccaggagg agtcctggct ttctgtgggt ccagggggtg 661 aaattacaga actaccagca tctgctcata aaaacaagtc taagaaaaaa aagaaaaagt 721 ccagtaaccg ggaatatgag acactggcca tgcctgaagg atcgcaagca ggcagagagg 781 ccgggactga tatgcaggaa tcccagccta ctgtgggctt ggatgatgaa actccacaac 841 tactaggacc tactcacaaa aaaaagtcta agaaaaaaaa gaagaaaaag tccaatcacc 901 aggaatttga ggcattggcc atgcctgaag gatcacaagt gggcagtgag gttggggctg 961 atatgcagga atcccggcct gctgtgggcc tgcatggtga aactgcagga ataccagcac 1021 ctgcttataa aaacaagtct aagaaaaaaa agaaaaagtc caatcaccag gaatttgagg 1081 cagtggccat gcctgagagc ctcgagagtg cataccctga aggatcacag gtgggcagtg 1141 aggttgggac tgtggaaggc agtacagctc ttaaagggtt caaggaatcc aacagtacaa 1201 agaagaagtc taagaaaagg aagcttacgt ctgtcaaaag ggcacgagtg tctggtgatg 1261 atttttcagt gcccagtaag aactctgaga gcacactctt tgattcagta gaaggtgatg 1321 gcgccatgat ggaagaaggt gtgaaatcta ggccccgaca aaagaaaacc caggcctgtt 1381 tggcaagcaa gcacgtgcaa gaggcgccaa ggttagaacc tgcaaatgaa gaacacaatg 1441 tggaaacagc tgaagattcc gaaataagat acttatctgc agattcagga gatgccgatg 1501 attcagatgc ggatttgggt tctgccgtga aacagcttca ggagttcatt cctaacatca 1561 aggacagggc caccagcaca atcaagcgga tgtaccggga cgacttggaa cggtttaagg 1621 aatttaaagc acaaggtgtc gctattaaat ttggcaagtt ttctgtaaag gaaaataagc 1681 agttagagaa aaatgtggaa gactttctag ccctgacagg cattgagagt gcagacaagc 1741 tcctgtacac ggacagatat cctgaggaaa aatctgtgat caccaactta aaaaggagat 1801 actcgtttag attacacatt ggtaggaaca ttgcccggcc ctggaaactt atatactatc 1861 gagcaaagaa gatgttcgat gtcaacaatt acaaaggcag gtatagcgaa ggagatactg 1921 agaagttaaa gatgtaccat tctctccttg ggaatgactg gaagacgatt ggtgagatgg 1981 tggcccgacg tagcctctcc gtggccctca agttctcaca gatcagcagt caaagaaatc 2041 gtggtgcttg gagtaagtct gaaacccgga aactaatcaa ggctgtcgaa gaagtgattc 2101 tgaagaagat gtctccccag gagttaaaag aggtggattc caaactccaa gaaaatcctg 2161 aaagttgcct atcaattgtt cgggaaaaac tctacaaggg catatcttgg gtagaagtag 2221 aagctaaagt gcaaaccaga aattggatgc agtgtaaaag taagtggaca gaaattctaa 2281 ccaagaggat gactaatggt cggcgtatat actatggcat gaatgccctg cgggccaagg 2341 tcagccttat tgaaaggttg tatgaaataa atgtggaaga tactaatgaa atagactggg 2401 aagatcttgc tagtgccata ggtgatgttc ctccatctta cgttcaaact aaattttcta 2461 ggctgaaagc tgtctatgtt ccattttggc agaaaaagac ttttccagag atcatcgact 2521 acctttatga gacgactcta cctttgctga aggaaaagtt agaaaaaatg atggagaaaa 2581 aaggcactaa aatccagact cctgcagcac ccaagcaagt tttcccattt cgagacatct 2641 tttattatga agacgatagt gaaggaggag gacatagaaa aagaaagcga aggggaattc 2701 cgtaaagcct agaatcaaaa gaaaacaaaa cccatagtca agccacagac aagcccagaa 2761 taatatggcc aggggatcaa tccgattagc cgactggccc agatccagca ggcaaaaaag 2821 gagaaggagc cagagtacac gctcctc // LOCUS HSRNAU 4237 bp RNA PRI 05-MAR-1997 DEFINITION H.sapiens mRNA for plakophilin 2a and b. ACCESSION X97675 NID g1834512 KEYWORDS plakophilin 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4237) AUTHORS Mertens,C., Kuhn,C. and Franke,W.W. TITLE Plakophilins 2a and 2b: constitutive proteins of dual location in the karyoplasm and the desmosomal plaque JOURNAL J. Cell Biol. 135 (4), 1009-1025 (1996) MEDLINE 97081101 REFERENCE 2 (bases 1 to 4237) AUTHORS Mertens,C. TITLE Direct Submission JOURNAL Submitted (29-APR-1996) C. Mertens, German Cancer Research Center, Cellbiology, Im Neuenheimer Feld 280, Heidelberg, 69120, FRG REMARK Revised by [3] REFERENCE 3 (bases 1 to 4237) AUTHORS Mertens,C. TITLE Direct Submission JOURNAL Submitted (09-JAN-1997) C. Mertens, German Cancer Research Center, Cellbiology, Im Neuenheimer Feld 280, Heidelberg, 69120, FRG FEATURES Location/Qualifiers source 1..4237 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>4237 CDS join(26..1402,1535..2671) /codon_start=1 /product="plakophilin 2a" /db_xref="PID:e304952" /db_xref="PID:g1871541" /translation="MAAPGAPAEYGYIRTVLGQQILGQLDSSSLALPSEAKLKLAGSS GRGGQTVKSLRIQEQVQQTLARKGRSSVGNGNLHRTSSVPEYVYNLHLVENDFVGGRS PVPKTYDMLKAGTTATYEGRWGRGTAQYSSQKSVEERSLRHPLRRLEISPDSSPERAH YTHSDYQYSQRSQAGHTLHHQESRRAALLVPPRYARSEIVGVSRAGTTSRQRHFDTYH RQYQHGSVSDTVFDSIPANPALLTYPRPGTSRSMGNLLEKENYLTAGLTVGQVRPLVP LQPVTQNRASRSSWHQSSFHSTRTLREAGPSVAVDSSGRRAHLTVGQAAAGGSGNLLT ERSTFTDSQLGNADMEMTLERAVSMLEADHMPPSRISAAATFIQHECFQKSEARKRVN QLRGILKLLQLLKVQNEDVQRAVCGALRNLVFEDNDNKLEVAELNGVPRLLQVLKQTR DLETKKQITGLLWNLSSNDKLKNLMITEALLTLTENIIIPFSGWPEGDYPKANGLLDF DIFYNVTGCLRNMSSAGADGRKAMRRCDGLIDSLVHYVRGTIADYQPDDKATENCVCI LHNLSYQLEAELPEKYSQNIYIQNRNIQTDNNKSIGCFGSRSRKVKEQYQDVPMPEEK SNPKGVEWLWHSIVIRMYLSLIAKSVRNYTQEASLGALQNLTAGSGPMPTSVAQTVVQ KESGLQHTRKMLHVGDPSVKKTAISLLRNLSRNLSLQNEIAKETLPDLVSIIPDTVPS TDLLIETTASACYTLNNIIQNSYQNARDLLNTGGIQKIMAISAGDAYASNKASKAASV LLYSLWAHTELHHAYKKAQFKKTDFVNSRTAKAYHSLKD" CDS 26..2671 /codon_start=1 /product="plakophilin 2b" /db_xref="PID:e305051" /db_xref="PID:g1871540" /translation="MAAPGAPAEYGYIRTVLGQQILGQLDSSSLALPSEAKLKLAGSS GRGGQTVKSLRIQEQVQQTLARKGRSSVGNGNLHRTSSVPEYVYNLHLVENDFVGGRS PVPKTYDMLKAGTTATYEGRWGRGTAQYSSQKSVEERSLRHPLRRLEISPDSSPERAH YTHSDYQYSQRSQAGHTLHHQESRRAALLVPPRYARSEIVGVSRAGTTSRQRHFDTYH RQYQHGSVSDTVFDSIPANPALLTYPRPGTSRSMGNLLEKENYLTAGLTVGQVRPLVP LQPVTQNRASRSSWHQSSFHSTRTLREAGPSVAVDSSGRRAHLTVGQAAAGGSGNLLT ERSTFTDSQLGNADMEMTLERAVSMLEADHMPPSRISAAATFIQHECFQKSEARKRVN QLRGILKLLQLLKVQNEDVQRAVCGALRNLVFEDNDNKLEVAELNGVPRLLQVLKQTR DLETKKQITDHTVNLRSRNGWPGAVAHACNPSTLGGQGGRITRSGVRDQPDQHGLLWN LSSNDKLKNLMITEALLTLTENIIIPFSGWPEGDYPKANGLLDFDIFYNVTGCLRNMS SAGADGRKAMRRCDGLIDSLVHYVRGTIADYQPDDKATENCVCILHNLSYQLEAELPE KYSQNIYIQNRNIQTDNNKSIGCFGSRSRKVKEQYQDVPMPEEKSNPKGVEWLWHSIV IRMYLSLIAKSVRNYTQEASLGALQNLTAGSGPMPTSVAQTVVQKESGLQHTRKMLHV GDPSVKKTAISLLRNLSRNLSLQNEIAKETLPDLVSIIPDTVPSTDLLIETTASACYT LNNIIQNSYQNARDLLNTGGIQKIMAISAGDAYASNKASKAASVLLYSLWAHTELHHA YKKAQFKKTDFVNSRTAKAYHSLKD" BASE COUNT 1234 a 1005 c 976 g 1022 t ORIGIN 1 cagctcggtc gcccccaccg gccccatggc agcccccggc gccccagctg agtacggcta 61 catccggacc gtcctgggcc agcagatcct gggacaactg gacagctcca gcctggcgct 121 gccctccgag gccaagctga agctggcggg gagcagcggc cgcggcggcc agacagtcaa 181 gagcctgcgg atccaggagc aggtgcagca gaccctcgcc cggaagggcc gcagctccgt 241 gggcaacgga aatcttcacc gaaccagcag tgttcctgag tatgtctaca acctacactt 301 ggttgaaaat gattttgttg gaggccgttc ccctgttcct aaaacctatg acatgctaaa 361 ggctggcaca actgccactt atgaaggtcg ctggggaaga ggaacagcac agtacagctc 421 ccagaagtcc gtggaagaaa ggtccttgag gcatcctctg aggagactgg agatttctcc 481 tgacagcagc ccggagaggg ctcactacac gcacagcgat taccagtaca gccagagaag 541 ccaggctggg cacaccctgc accaccaaga aagcaggcgg gccgccctcc tagtgccacc 601 gagatatgct cgttccgaga tcgtgggggt cagccgtgct ggcaccacaa gcaggcagcg 661 ccactttgac acataccaca gacagtacca gcatggctct gttagcgaca ccgtttttga 721 cagcatccct gccaacccgg ccctgctcac gtaccccagg ccagggacca gccgcagcat 781 gggcaacctc ttggagaagg agaactacct gacggcaggg ctcactgtcg ggcaggtcag 841 gccgctggtg cccctgcagc ccgtcactca gaacagggct tccaggtcct cctggcatca 901 gagctccttc cacagcaccc gcacgctgag ggaagctggg cccagtgtcg ccgtggattc 961 cagcgggagg agagcgcact tgactgtcgg ccaggcggcc gcagggggaa gtgggaatct 1021 gctcactgag agaagcactt tcactgactc ccagctgggg aatgcagaca tggagatgac 1081 tctggagcga gcagtgagta tgctcgaggc agaccacatg ccgccatcca ggatttctgc 1141 tgcagctact ttcatacagc acgagtgctt ccagaaatct gaagctcgga agagggttaa 1201 ccagcttcgt ggcatcctca agcttctgca gctcctaaaa gttcagaatg aagacgttca 1261 gcgagctgtg tgtggggcct tgagaaactt agtatttgaa gacaatgaca acaaattgga 1321 ggtggctgaa ctaaatgggg tacctcggct gctccaggtg ctgaagcaaa ccagagactt 1381 ggagactaaa aaacaaataa cagaccatac agtcaattta agaagtagga atggctggcc 1441 gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccaaggcg ggcggatcac 1501 gaggtcagga gttcgagacc agcctgacca acatggtttg ctgtggaatt tgtcatctaa 1561 tgacaaactc aagaatctca tgataacaga agcattgctt acgctgacgg agaatatcat 1621 catccccttt tctgggtggc ctgaaggaga ctacccaaaa gcaaatggtt tgctcgattt 1681 tgacatattc tacaacgtca ctggatgcct aagaaacatg agttctgctg gcgctgatgg 1741 gagaaaagcg atgagaagat gtgacggact cattgactca ctggtccatt atgtcagagg 1801 aaccattgca gattaccagc cagatgacaa ggccacggag aattgtgtgt gcattcttca 1861 taacctctcc taccagctgg aggcagagct cccagagaaa tattcccaga atatctatat 1921 tcaaaaccgg aatatccaga ctgacaacaa caaaagtatt ggatgttttg gcagtcgaag 1981 caggaaagta aaagagcaat accaggacgt gccgatgccg gaggaaaaga gcaaccccaa 2041 gggcgtggag tggctgtggc attccattgt tataaggatg tatctgtcct tgatcgccaa 2101 aagtgtccgc aactacacac aagaagcatc cttaggagct ctgcagaacc tcacggccgg 2161 aagtggacca atgccgacat cagtggctca gacagttgtc cagaaggaaa gtggcctgca 2221 gcacacccga aagatgctgc atgttggtga cccaagtgtg aaaaagacag ccatctcgct 2281 gctgaggaat ctgtcccgga atctttctct gcagaatgaa attgccaaag aaactctccc 2341 tgatttggtt tccatcattc ctgacacagt cccgagtact gaccttctca ttgaaactac 2401 agcctctgcc tgttacacat tgaacaacat aatccaaaac agttaccaga atgcacgcga 2461 ccttctaaac accgggggca tccagaaaat tatggccatt agtgcaggcg atgcctatgc 2521 ctccaacaaa gcaagtaaag ctgcttccgt ccttctgtat tctctgtggg cacacacgga 2581 actgcatcat gcctacaaga aggctcagtt taagaagaca gattttgtca acagccggac 2641 tgccaaagcc taccactccc ttaaagactg aggaaaatga caaagtattc tcggctgcaa 2701 aaatccccaa aggaaaacac ctatttttct actacccagc ccaagaaacc tcaaaagcat 2761 gccttgtttc tatctttctc tatttccgtg gtcccctgaa tccagaaaac aaatagaaca 2821 taattttatg agtcttccag aagacctttg caagtttgcc accagtagat accggccaca 2881 ggctcgacaa atagtggtct ttgttattag ggcttatggt agatggcttc ctggaatcaa 2941 aatgtgaatt catgtggaag ggacattaat ccaataaata aggaaagaag ctgttgcatt 3001 actgggattt taaaagtttg atttacattt atattccttt tctggttccc atgttttgtc 3061 actcatgtgc acattgcttc gccattgggc ctccagtgta ttgttctgca gtgttgaaac 3121 agaatggaaa tgacaagaaa tatctgcagt tatccaggag aaagtataat ggcaaaatta 3181 ttggtttctt tctttacttt gtgcttgttt ttatcccctt ggttgttttt ctctgatttt 3241 taaataaact taagaaattt agattacaga gtatgcatga ctgtaagaaa aagaaattga 3301 gaggaagtga tcatagcaaa ttaaagaagt cttttcctcc cagaacttaa agtaaaataa 3361 aaaataaata aataaataaa atcttttcca cagagaaagg caactgtgat gataaaattt 3421 aagctccccc aacactgagt caatgagatt tttctcagga gatactttac ctataacaac 3481 gccgttaaat ccaaatctct tctaaacgat ggcattctat gtaatgcctt tcctggactt 3541 ttttggccac tgcctggact agtgaaagaa tggactctat ctttatctgc aagaggaact 3601 aaggccttct atcagactgc ctggccagcc tggggcactg aaaatacggc tcatgttaat 3661 gagttacatt atcagccagc ccagccttgc ccaccattta agaaatatca cagagccact 3721 agatctcata tgatcttctt caagccatta ttttaactca agaaaactct agagaagaaa 3781 agtgaagaag tcatgttgaa gaagatgtaa gaatgtgtca agaccatcca gaaatgatat 3841 gagaaatact gatattttaa atggttgaca tcatccagcg aaatgaatct acattaaatg 3901 ttgttttaac tgcgctatga ttaaaaccat tcatatagag ttagtcttta caactactat 3961 tctgttattt ttttttttaa tctgacaaca tttgtcctaa gtaagataag caaaaaaatt 4021 cttcaactcc ttttggcaag aaaactgtaa cagaaaataa attttgaatg tgtacttaag 4081 tctttattat atttgaagca attttttttc aattttaaaa gctgaatgaa gacaacttag 4141 gttgctaacc tagttcaaaa tgaaattatt tagataccaa tttttaaaat actggagaga 4201 atttatatgt ctttttccag agttctgatg ataagca // LOCUS HSRNAURPH 1349 bp RNA PRI 04-MAR-1996 DEFINITION H.sapiens mRNA for uridine phosphorylase. ACCESSION X90858 NID g1050524 KEYWORDS Uridine phosphorylase gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1349) AUTHORS Watanabe,S. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) S. Watanabe, Nippon Roche Research Center, Oncology, Kajiwara 200, Kamakura, Kanagawa, JAPAN REFERENCE 2 (bases 1 to 1349) AUTHORS Watanabe,S. and Uchida,T. TITLE Cloning and expression of human uridine phosphorylase JOURNAL Biochem. Biophys. Res. Commun. 216 (1), 265-272 (1995) MEDLINE 96067560 FEATURES Location/Qualifiers source 1..1349 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HCT116" /cell_line="HCT116" /tissue_type="colon tumor" CDS join(353..396,789..855) /note="truncated; non-active" /codon_start=1 /product="uridine phosphorylase" /db_xref="PID:e225470" /db_xref="PID:g1213611" /translation="MAATGANAEKAESHKSGARHCGHNRAGSGYLLQGRV" CDS 353..1285 /EC_number="2.4.2.3" /codon_start=1 /product="uridine phosphorylase" /db_xref="PID:g1050525" /translation="MAATGANAEKAESHNDCPVRLLNPNIAKMKEDILYHFNLTTSRH NFPALFGDVKFVCVGGSPSRMKAFIRCVGAELGLDCPGRDYPNICAGTDRYAMYKVGP VLSVSHGMGIPSISIMLHELIKLLYYARCSNVTIIRIGTSGGIGLEPGTVVITEQAVD TCFKAEFEQIVLGKRVIRKTDLNKKLVQELLLCSAELSEFTTVVGNTMCTLDFYEGQG RLDGALCSYTEKDKQAYLEAAYAAGVRNIEMESSVFAAMCSACGLQAAVVCVTLLNRL EGDQISSPRNVLSEYQQRPQRLVSYFIKKKLSKA" misc_feature 397..788 /note="spliced out in truncated mRNA" BASE COUNT 283 a 383 c 399 g 284 t ORIGIN 1 attcgagtgc tccggagaac agacccgcgc cccgccgtcc gcgagcctcc cgagagccgt 61 cccttcgtcc ggccctggag cattgcgttt gtcgccggtg tcgcagtgcg aggatggcgc 121 cgcgggtgta gcggctctct gcgcaggccg agtgggccca gagaagcgag gaactcccca 181 gatcgccgac acgtctcgtc tcctgtccca attcagggct tggtgaggtg actcgcggtc 241 gcgggtgact cgccggcagg acactgcctg gaacgcctgg agcgcctccc actgcagacg 301 tctgtccgcc tccagccgct ctcctctgac gggtcctgcc tcagttggcg gaatggcggc 361 cacgggagcc aatgcagaga aagctgaaag tcacaatgat tgccccgtca gacttttaaa 421 tccaaacata gcaaaaatga aagaagatat tctctatcat ttcaatctca ccactagcag 481 acacaatttc ccagccttgt ttggagatgt gaagtttgtg tgtgttggtg gaagcccctc 541 ccggatgaaa gccttcatca ggtgcgttgg tgcagagctg ggccttgact gcccaggtag 601 agactatccc aacatctgtg cgggaactga ccgctatgcc atgtataaag taggaccggt 661 gctgtctgtc agtcatggta tgggcattcc ttctatctca atcatgttgc atgagctcat 721 aaagctgctg tactatgccc ggtgctccaa cgtcactatc atccgcattg gcacttctgg 781 tgggataggt ctggagcccg gcactgtggt cataacagag caggcagtgg atacctgctt 841 caaggcagag tttgagcaga ttgtcctggg gaagcgggtc atccggaaaa cggaccttaa 901 caagaagctg gtgcaggagc tgttgctgtg ttctgcagag ctgagcgagt tcaccacagt 961 ggtggggaac accatgtgca ccttggactt ctatgaaggg caaggccgtc tggatggggc 1021 tctctgctcc tacacggaga aggacaagca ggcgtatctg gaggcagcct atgcagccgg 1081 cgtccgcaat atcgagatgg agtcctcggt gtttgccgcc atgtgcagcg cctgcggcct 1141 ccaagcggcc gtggtgtgtg tcaccctcct gaaccgcctg gaaggggacc agatcagcag 1201 ccctcgcaat gtgctcagcg agtaccagca gaggccgcag cggctggtga gctacttcat 1261 caagaagaaa ctgagcaagg cctgagcgct gccctgcacc tccgcagacc tgctgtgatg 1321 acttgccatt aaaagcattg tccaaaccc // LOCUS HSRNF3A 995 bp RNA PRI 24-SEP-1997 DEFINITION Homo sapiens mRNA for RNF3A (DONG1) ring finger protein. ACCESSION AJ001019 NID g2437832 KEYWORDS RNF3A gene, ring finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 995) AUTHORS Dyer,M.J. TITLE Direct Submission JOURNAL Submitted (19-SEP-1997) Dyer M.J., Academic Haematology and Cytogenetics, Institute of Cancer Research, Haddow Laboratories, Sutton, Surrey SM2 5NG, UK REMARK revised by [3] REFERENCE 2 (bases 1 to 995) AUTHORS Abdul-Rauf,M. and Dyer,M.J. TITLE Interactions of the BCL7A protein with novel ring finger proteins JOURNAL Unpublished REFERENCE 3 (bases 1 to 995) AUTHORS Dyer,M.J. TITLE Direct Submission JOURNAL Submitted (22-SEP-1997) Dyer M.J., Academic Haematology and Cytogenetics, Institute of Cancer Research, Haddow Laboratories, Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..995 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /cell_type="brain" /clone_lib="Stratagene fetal brain 18-19wk oligo(dT) primed" /chromosome="4" /map="p16.3" gene 115..858 /gene="RNF3A" CDS 115..858 /gene="RNF3A" /codon_start=1 /product="ring finger protein" /db_xref="PID:e351238" /db_xref="PID:g2440074" /translation="MEFPKMLTRKIKLWDINAHITCRLCSGYLIDATTVTECLHTFCR SCLVKYLEENNTCPTCRIVIHQSHPLQYIGHDRTMQDIVYKLVPGLQEAEMRKQREFY HKLGMEVPGDIKGETCSAKQHLDSHRNGETKADDSSNKEAAEEKPEEDNDYHRSDEQV SICLECNSSKLRGLKRKWIRCSAQATVLHLKKFIAKKLNLSSFNELDILCNEEILGKD HTLKFVVVTRWRFKKAPLLLHYRPKMDLL" misc_feature 178..295 /gene="RNF3A" /note="ring finger domain" BASE COUNT 269 a 247 c 277 g 202 t ORIGIN 1 tttgggggtg ataaaaaggg gggcccaaaa aacgggggag cggagatttt tttgggaaat 61 tttttttttt ttcctttgga tatatgacca gcagtgggat tgctggatct tacgatggaa 121 ttcccaaaga tgttgaccag gaagatcaag ctgtgggaca tcaacgccca catcacctgc 181 cgcctgtgca gcgggtacct catcgacgcc accacggtga ccgagtgtct gcacaccttc 241 tgcaggagct gcctggtgaa gtacctggag gagaacaaca cctgccccac ctgcaggatt 301 gtgatccacc agagccaccc cctgcagtac atcggtcatg acagaaccat gcaagatatt 361 gtttacaaat tggtaccagg cctccaagaa gcggaaatga gaaagcagag ggagttctat 421 cacaaattgg gcatggaggt gccgggagac atcaaggggg agacctgctc tgcaaaacag 481 cacttagatt cccatcggaa tggtgaaacc aaagcagacg acagttcaaa caaagaggcc 541 gcggaggaga agccggagga ggacaacgac taccaccgca gcgacgagca ggtgagcatc 601 tgcttggagt gtaacagcag caaactgcgc gggctgaagc ggaagtggat ccgctgctca 661 gcccaggcga ccgtcttgca tctgaagaag ttcatcgcca aaaaactcaa cctttcatcc 721 tttaacgagc tggacatttt atgcaacgag gagatcctgg gcaaggacca cacactcaag 781 ttcgtggttg tcactaggtg gagattcaag aaggcgccgc tcctgctgca ctacagaccc 841 aagatggact tgctgtgaat ggtgccacac agcgcccaca gactgggctc gcacccttgg 901 gtgctcccgg ccgccgcgct taagaacatt gcctctgggt gtcatgtgga ccagacttct 961 gaatagagaa tatttataac ttttgtatga gagag // LOCUS HSRNP24PT 780 bp RNA PRI 09-SEP-1996 DEFINITION H.sapiens mRNA for transmembrane protein rnp24. ACCESSION X92098 NID g1212964 KEYWORDS rnp24 gene; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 780) AUTHORS Blum,R. TITLE Direct Submission JOURNAL Submitted (09-OCT-1995) R. Blum, Universitat des Saarlandes, 2. Physiologisches Institut, Geb.58, Homburg, D-66421, FRG REFERENCE 2 (bases 1 to 780) AUTHORS Blum,R., Feick,P., Puype,M., Vandekerckhove,J., Klengel,R., Nastainczyk,W. and Schulz,I. TITLE Tmp21 and p24A, two type I proteins enriched in pancreatic microsomal membranes, are members of a protein family involved in vesicular trafficking JOURNAL J. Biol. Chem. 271 (29), 17183-17189 (1996) MEDLINE 96291865 FEATURES Location/Qualifiers source 1..780 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="ZAPII, R1.1" gene 28..633 /gene="rnp24" CDS 28..633 /gene="rnp24" /note="microsomal fraction" /codon_start=1 /product="transmembrane protein" /db_xref="PID:e205529" /db_xref="PID:g1212965" /translation="MVTLAELLVLLAALLATVSGYFVSIDAHAEECFFERVTSGTKMG LIFEVAEGGFLDIDVEITGPDNKGIYKGDRESSGKYTFAAHMDGTYKFCFSNRMSTMT PKIVMFTIDIGEAPKGQDMETEAHQNKLEEMINELAVAMTAVKHEQEYMEVRERIHRA INDNTNSRVVLWSFFEALVLVAMTLGQIYYLKRFFEVRRVV" BASE COUNT 220 a 175 c 186 g 199 t ORIGIN 1 cgtcctggct tcggcctcag ccccaccatg gtgacgcttg ctgaactgct ggtgctcctg 61 gccgctctcc tggccacggt ctcgggctat ttcgttagca tcgacgccca tgctgaagag 121 tgcttctttg agcgggtcac ctcgggcacc aagatgggcc tcatcttcga ggtggcggag 181 ggcggcttcc tggacatcga cgtggagatt acaggaccag ataacaaagg aatttacaaa 241 ggagacagag aatccagtgg gaaatacaca tttgctgctc acatggatgg aacatacaaa 301 ttttgtttta gtaaccggat gtccaccatg actccaaaaa tagtgatgtt caccattgat 361 attggggagg ctccaaaagg acaagatatg gaaacagaag ctcaccagaa caagctagaa 421 gaaatgatca atgagctagc agtggcgatg acagctgtaa agcacgaaca ggaatacatg 481 gaagtccggg agagaataca cagagccatc aacgacaaca caaacagcag agtggtcctt 541 tggtccttct ttgaagctct tgttctagtt gccatgacat tgggacagat ctactacctg 601 aagagatttt ttgaagtccg gagagttgtt taaaaagcct cttcctgatg atcccaactc 661 agaattcact gtttaccaaa caccttggtc ataataatgt cattagtttc tccattttta 721 ttttctgaac tgtacattcc caacttatgt ttctttgaga ttaatagata ttgggggaaa // LOCUS HSRNP70K 2693 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for U1 RNA-associated 70K protein. ACCESSION X04654 NID g36099 KEYWORDS ribonucleoprotein; small nuclear ribonucleoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2693) AUTHORS Theissen,H., Etzerodt,M., Reuter,R., Schneider,C., Lottspeich,F., Argos,P., Luhrmann,R. and Philipson,L. TITLE Cloning of the human cDNA for the U1 RNA-associated 70K protein JOURNAL EMBO J. 5 (12), 3209-3217 (1986) MEDLINE 87133480 FEATURES Location/Qualifiers source 1..2693 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 681..2525 /note="70 K protein (AA 1-614)" /codon_start=1 /db_xref="PID:g36100" /db_xref="SWISS-PROT:P08621" /translation="MGTISGGGGSNAATRQVGCAPSGRPSTRPSGTAIRARPVASVKP IDEGLAEVRVIEDEAIGIEGERLDRRKERRRQEALIEDQQQRQRRWPGLPAARPGRAA SSAGIGGRQGLLARGTLWWLSSGLVRSSSGRRNQTDVDAPGVEAEAGVVVAEGLPQPP RASGQTPERGGATRLGKMTQFLPPNLLALFAPRDPIPYLPPLEKLPHEKHHNQPYCGI APYIREFEDPRDAPPPTRAETREERMERKRREKIERRQQEVETELKMWDPHNDPNAQG DAFKTLFVARVNYDTTESKLRREFEVYGPIKRIHMVYSKRSGKPRGYAFIEYEHERDM HSAYKHADGKKIDGRRVLVDVERGRTVKGWRPRRLGGGLGGTRRGGADVNIRHSGRDD TSRYDERPGPSPLPHRDRDRDRERERRERSRERDKERERRRSRSRDRRRRSRSRDKEE RRRSRERSKDKDRDRKRRSSRSRERARRERERKEELRGGGGDMAEPSEAGDAPPDDGP PGELGPDGPDGPEEKGRDRDRERRRSHRSERERRRDRDRDRDRDREHKRGERGSERGR DEARGGGGGQDNGLEGLGNDSRDMYMESEGGDGYLAPENGYLMEAAPE" misc_feature 1035..1049 /note="ctgcagggggcgggg in p70.1" misc_feature 1035..2356 /note="homologous region of clone p70.1" misc_feature 1068..1069 /note="tc is ct in p70.1" misc_feature 1791..1792 /note="gg is cc in p70.1" misc_feature 1876..1902 /note="sequence is missing in p70.1" misc_feature 2352..2356 /note="cgccc in p70.1" misc_feature 2665..2670 /note="pot. poly A signal" polyA_site 2693 /note="poly A site" BASE COUNT 591 a 780 c 949 g 373 t ORIGIN 1 gcgagacgaa ggtgcgcagg ccgatgcccg agagcgggat gatgaccgtg aagacgagcc 61 agccggcgat gagggtgaag gccacccagc gccacttgcc cagcggcacc gcttctggcg 121 cgcgcccttg cccttgatgg cgacgaactt gttggccgag cgcacgagcc agcgctgcag 181 catcaccagc ggcatggtca ccgccaccag gcacacggcc accgccgcca tcaggtgata 241 cgaaggcgta cccagcttgt tggtgagctt gtagagatag gtcggcagca ccaggtggcc 301 ttccggatca cccagcacca gcaccaggcc gaacacttca aagccgagga agaacaccag 361 cacgccggag taggccagcg ccggggtgat catcggcagc gacacgttca acgccacctg 421 cagcggcgaa cgaccggcca cgcgggccgc ttcttccaca tccgaaccca ggctgcgcag 481 ggccgccgag gcatacaggt agacgtgcgg cacgtgggtc aggccggcga tgatgacgat 541 gctggtgaag gaatagatgt tccacgggtc gccctcgaaa ccgacgaccg acagcaggtt 601 cttgacccac accgtgtaga agccgaccgg ccccatcgag accacgtagc cgaagccgat 661 caccagtggg cgagacgaag atgggcacca tcagtggcgg gggcggatcg aacgctgcga 721 cccggcaggt cggctgcgca ccatcaggaa ggccaagcac tcggccgagc ggcacggcga 781 tcagggccag gccggtcgcc agcgtcaagc cgattgacga aggcctggcg gaagtccggg 841 tcatcgaaga tgaagcgata ggaatcgaag gtgagcgtct tgaccggcgc aaagaacggc 901 gccgacagga agctttgata gaagatcagc agcagcggca gcgaagatgg ccagggctgc 961 cagcagcacg accaggccgc gcggccagtt cagccggaat cggcgggcgg cagggcttgc 1021 tcgcgcgcgg cacgttgtgg tggctgagca gcggcttggt gcgctcgtct agcgggcgac 1081 ggaatcagac ggacgtggac gcccccggag tggaagccga agcaggagtt gttgttgctg 1141 aggggctgcc gcagccgccg cgagcctccg gacagacgcc agagcgagga ggcgctacgc 1201 gacttggcaa gatgacccag ttcctgccgc ccaaccttct ggccctcttt gccccccgtg 1261 accctattcc atacctgcca cccctggaga aactgccaca tgaaaaacac cacaatcaac 1321 cttattgtgg cattgcgccg tacattcgag agtttgagga ccctcgagat gcccctcctc 1381 caactcgtgc tgaaacccga gaggagcgca tggagaggaa aagacgggaa aagattgagc 1441 ggcgacagca agaagtggag acagagctta aaatgtggga ccctcacaat gatcccaatg 1501 ctcaggggga tgccttcaag actctcttcg tggcgagagt gaattatgac acaacagaat 1561 ccaagctccg gagagagttt gaggtgtacg gacctatcaa aagaatacac atggtctaca 1621 gtaagcggtc aggaaagccc cgtggctatg ccttcatcga gtacgaacac gagcgagaca 1681 tgcactccgc ttacaaacac gcagatggca agaagattga tggcaggagg gtccttgtgg 1741 acgtggagag gggccgaacc gtgaagggct ggaggccccg gcggctagga ggaggcctcg 1801 gtggtaccag aagaggaggg gctgatgtga acatccggca ttcaggccgc gatgacacct 1861 cccgctacga tgagaggccc ggcccctccc cgcttccgca cagggaccgg gaccgggacc 1921 gtgagcggga gcgcagagag cggagccggg agcgagacaa ggagcgagaa cggcgacgct 1981 cccgctcccg ggaccggcgg aggcgctcac ggagtcgcga caaggaggag cggaggcgct 2041 ccagggagcg gagcaaggac aaggaccggg accggaagcg gcgaagcagc cggagtcggg 2101 agcgggcccg gcgggagcgg gagcgcaagg aggagctgcg tggcggcggt ggcgacatgg 2161 cggagccctc cgaggcgggt gacgcgcccc ctgatgatgg gcctccaggg gagctcgggc 2221 ctgacggccc tgacggtcca gaggaaaagg gccgggatcg tgaccgggag cgacggcgga 2281 gccaccggag cgagcgcgag cggcgccggg accgggatcg tgaccgtgac cgtgaccgcg 2341 agcacaaacg gggggagcgg ggcagtgagc ggggcaggga tgaggcccga ggtgggggcg 2401 gtggccagga caacgggctg gagggtctgg gcaacgacag ccgagacatg tacatggagt 2461 ctgagggcgg cgacggctac ctggctccgg agaatgggta tttgatggag gctgcgccgg 2521 agtgaagagg tcgtcctctc catctgctgt gtttggacgc gttcctgccc agccccttgc 2581 tgtcatcccc tcccccaacc ttggccactt gagtttgtcc tccaagggta ggtgtctcat 2641 ttgttctggc cccttggatt taaaaataaa attaatttcc tgttgatagt ggg // LOCUS HSROHU 1253 bp RNA PRI 28-NOV-1993 DEFINITION Human rohu mRNA for rhodanese. ACCESSION X59434 S61764 NID g432375 KEYWORDS rhodanese; rohu gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1253) AUTHORS Pallini,R. TITLE Direct Submission JOURNAL Submitted (14-MAY-1991) R. Pallini, Dept of Molecular Biology, University of Siena, Via G Mameli 21, 53100 Siena, Italy REFERENCE 2 (bases 1 to 1253) AUTHORS Pallini,R., Guazzi,G.C., Cannella,C. and Cacace,M.G. TITLE Cloning and sequence analysis of the human liver rhodanese: comparison with the bovine and chicken enzymes JOURNAL Biochem. Biophys. Res. Commun. 180 (2), 887-893 (1991) MEDLINE 92062122 FEATURES Location/Qualifiers source 1..1253 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" /clone_lib="lambda gt11 fetal liver" /clone="1" mRNA 1..1253 /gene="rohu" gene 1..1253 /gene="rohu" CDS 35..925 /gene="rohu" /EC_number="2.8.1.1" /note="rhodanese" /codon_start=1 /product="thiosulfate sulfurtransferase" /db_xref="PID:g432376" /db_xref="SWISS-PROT:P25325" /translation="MASPQLCRALVSAQWVAEALRAPRAGQPLQLLDASWYLPKLGRD ATQFEERHIPGAAFFDIDQCSDRTSPYDHMLPGAEHFAEYAGRLGVGAATHVVIYDAS DQGLYSAPRVWWMFRAFGHHAVSLLDGGLRHWLRQNLPLSSGKSQPAPAEFRAQLDPA FIKTYEDIKENLESRRFQVVDSRATGRFRGTEPEPRDGIEPGHIPGTVNIPFTDFLSQ EGLEKSPEEIRHLFQEKKVDLSKPLVATCGSGVTACHVALGAYLCGKPDVPIYDGSWV EWYMRARPEDVISEGRGKTH" BASE COUNT 232 a 414 c 393 g 214 t ORIGIN 1 gaattccggc ccccgcggcg cgagtgtcgc cgccatggct tcgccgcagc tctgccgcgc 61 gctggtgtcg gcgcaatggg tggcggaggc gctgcgggcc ccgcgcgctg ggcagcctct 121 gcagctgctg gacgcctcct ggtacctgcc gaagctgggg cgcgacgcga cgcagttcga 181 ggagcgccac atcccgggcg ccgctttctt cgacatcgac cagtgcagcg accgcacctc 241 gccctacgac cacatgctgc ccggggccga gcatttcgcg gagtacgcag gccgcctggg 301 cgtgggcgcg gccacccacg tcgtgatcta cgacgccagc gaccagggcc tctactccgc 361 cccgcgcgtc tggtggatgt tccgcgcctt cggccaccac gccgtgtcac tgcttgatgg 421 cggcctccgc cactggctgc gccagaacct cccgctcagc tccggcaaga gccaacctgc 481 tcccgccgag ttccgcgctc agctcgaccc cgccttcatc aagacctacg aggacatcaa 541 ggagaacctg gaatcccggc gcttccaggt ggtggactcc cgagccactg gcaggttccg 601 cggcaccgag cccgagcccc gagacggcat tgaacctggc cacatcccag gtaccgtgaa 661 catccccttc acagacttcc tgagccagga ggggctggag aagagccctg aggagatccg 721 ccatctgttc caggagaaga aagtggacct gtctaagcca ctggtggcca cgtgtggctc 781 tggcgtcaca gcctgccacg tggcactagg ggcctacctc tgcggcaagc cagacgtgcc 841 catctacgat ggctcctggg tggagtggta catgcgcgcc cggcccgagg atgtcatctc 901 agagggccgg gggaagaccc actgaagctg ggcaggacac aggcgagctc aggtgatgcc 961 ggccaccagc aatgcctggc ctggtagctc cgcttctgct ttcaccaaga gagtgtttct 1021 tcactcaact caggtggcat ttggggtgac atctcaaagg ccaggaattc cgttgacttg 1081 ttggctgcca gtaggggcgg gaggaaaggc ggaggcgagc cctggaggag ggaggccaca 1141 acaccgagct gcccacctgg tgctgagctg gggccccgcc tcctttctgt tttatttttg 1201 aggaaataaa ataaccaagt gctaaatctt gtaaaaaaaa aaaaaggaat tcc // LOCUS HSROXPROT 4811 bp RNA PRI 20-JUN-1997 DEFINITION H.sapiens mRNA for ROX protein. ACCESSION X96401 NID g1841919 KEYWORDS CA repeat; polymorphic dinucleotide repeat; rox gene; ROX protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4811) AUTHORS Meroni,G., Reymond,A., Alcalay,M., Borsani,G., Tanigami,A., Tonlorenzi,R., Lo Nigro,C., Messali,S., Zollo,M., Ledbetter,D.H., Brent,R., Ballabio,A. and Carrozzo,R. TITLE Rox, a novel bHLHZip protein expressed in quiescent cells that heterodimerizes with Max, binds a non-canonical E box and acts as a transcriptional repressor JOURNAL EMBO J. 16 (10), 2892-2906 (1997) MEDLINE 97327566 REFERENCE 2 (bases 1 to 4811) AUTHORS Carrozzo,R. TITLE Direct Submission JOURNAL Submitted (01-MAR-1996) R. Carrozzo, Hospital San Raffaele, Via Olgettina 60, I 20132 Milano, ITALY COMMENT Related sequence X69879. FEATURES Location/Qualifiers source 1..4811 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="17" /map="p13.3" gene 213..1961 /gene="rox" CDS 213..1961 /gene="rox" /codon_start=1 /product="ROX protein" /db_xref="PID:e228622" /db_xref="PID:g1841920" /translation="MSIETLLEAARFLEWQAQQQQRAREEQERLRLEQEREQEQKKAN SLARLAHTLPVEEPRMEAPPLPLSPPAPPPAPPPPLATPAPLTVIPIPVVTNSPQPLP PPPPLPAAAQPLPLAPRQPALVGAPGLSIKEPAPLPSRPQVPTPAPLLPDSKATIPPN GSPKPLQPLPTPVLTIAPHPGVQPQLAPQQPPPPTLGTLKLAPAEEVKSSEQKKRPGG IGTREVHNKLEKNRRAHLKECFETLKRNIPNVDDKKTSNLSVLRTALRYIQSLKRKEK EYEHEMERLAREKIATQQRLAELKHELSQWMDVLEIDRVLRQTGQPEDDQASTSTASE GEDNIDEDMEEDRAGLGPPKLSHRPQPELLKSTLPPPSTTPAPLPPHPHPHPHSVALP PAHLPVQQQQPQQKTPLPAPPPPPAAPAQTLVPAPAHLVATAGGGSTVIAHTATTHAS VIQTVNHVLQGPGGKHIAHIAPSAPSPAVQLAPATPPIGHITVHPATLNHVAHLGSQL PLYPQPVAVSHIAHTLSHQQVNGTAGLGPPATVMAKPAVGAQVVHHPQLVGQTVLNPV TMVTMPSFPVSTLKLA" repeat_region 4217..4254 /note="polymorphic dinucleotide repeat" /rpt_unit=ca polyA_signal 4753..4758 BASE COUNT 974 a 1630 c 1304 g 903 t ORIGIN 1 aaatttgcaa ttttatattt tgcaaatatt ttgagagaca ttgatttttc tccccgtgct 61 cccccgttct tccctgcgga gtgcgctgcg ccgcccagcc ctgtcgcccc ccggaggtga 121 tccctccctc ctgcctgccc gccagcctga cctgtgcccg gctcgcgggc cgcagcctcg 181 gccccggcgc gcccccggca gctctcggcg cgatgagcat agagacgcta ctggaggcgg 241 cccgcttcct ggaatggcaa gcgcagcaac aacagagagc acgtgaggag caggagcggc 301 ttcgcttgga gcaggagcga gagcaggaac agaagaaggc caatagcctg gccaggctgg 361 cacataccct tcctgtggag gaaccccgca tggaggcgcc acccctgcct ctgtctccac 421 cggctccccc gccggcaccc ccaccaccac ttgccacccc tgccccactg actgtcatcc 481 ctatccctgt ggtgaccaac tcccctcagc ctctaccccc acccccaccc ttgcccgcgg 541 cagcccagcc tctgcccctg gcgcctcgtc agccggccct ggttggcgcc cccggactca 601 gcattaagga gcctgccccc ctgcccagca ggccgcaggt gcccacccct gctcccctac 661 tgccggactc gaaggccacc attccaccca atggcagccc caagcctttg cagcccctcc 721 ccacgcctgt cctgaccata gcgccacacc ctggagtcca gcctcagctg gccccccagc 781 agccgccccc acccacgctg gggaccctga agttggcacc agctgaagaa gtcaaatcca 841 gtgaacagaa gaagaggccc ggggggatcg gaaccagaga agtccacaac aaattggaga 901 agaacaggag ggcccatctg aaagagtgct ttgagaccct gaagcggaac atccccaacg 961 tggatgacaa gaagacgtcc aatctgagcg tgctgcggac ggcgctgcgg tacatccagt 1021 ccctgaagag gaaggagaag gaatatgagc atgaaatgga gcggctggca cgtgagaaga 1081 ttgccacgca gcagcggctg gcagagctca agcacgagct gagccagtgg atggacgtac 1141 tggagattga ccgcgtgctg cggcagacgg gccagcccga ggatgaccag gcctccacct 1201 ccaccgcctc tgagggtgag gacaacatag acgaggatat ggaggaggac cgggcgggcc 1261 tgggcccacc taagctgagc catcgtcccc agccggagct gctgaagtcc accctgccac 1321 cccccagcac cacccctgcg cctctgcctc cacacccaca ccctcacccc cactccgtgg 1381 ccctacctcc tgcccacctc cccgtgcagc agcagcagcc acagcagaag acccctctgc 1441 cagcccctcc tcccccaccg gctgcccctg cccagacact ggtgccagct ccagcccatc 1501 tggtggcgac ggctgggggt ggctccacgg tcatcgccca cacagctacc actcacgctt 1561 cagtcatcca gactgtgaac cacgttctgc aggggccagg cggcaagcac atcgcccaca 1621 tcgccccctc ggcccccagc cctgcggtgc aactggcgcc tgccacaccc cccattgggc 1681 acatcactgt gcaccctgcc accctcaacc atgtggccca cctgggctcc cagctgccct 1741 tgtacccgca gcccgtggca gtgagccaca tcgcccacac cctctcgcac cagcaagtca 1801 acggcacggc cggcctgggg cccccggcta ctgtcatggc aaagccggcc gtgggggctc 1861 aggtggtgca ccacccccag ctggtgggcc agaccgtgct caaccctgtg accatggtca 1921 ccatgccctc cttcccagtc agcacactca agctggcttg aggacgaggc cactcagagg 1981 cccccagtgg ggacagggag ggggacctgt cccccactct ctcacccacc agctccacac 2041 attccagcca ggcccaggcc agccccccca cccaccccca ggcctcctag gggaaggggg 2101 tgcaaagact ctgagccaag ggagggaagg gccaccctgc tgcactagga cttggtaaga 2161 tgactctgag aaaatgcgag actctgatgg aatgtgccac ctgtccggcc cagtgccagc 2221 tccagtgccg ctcctgcttc ccctccctac cctcggaaat cagtgcgatg tggacgtcac 2281 gctccctgac ttcttccccc ggccctgccc cggccttcgt gtgctgctgc tgctattgct 2341 gtctggtgaa ggtggcccag gcccccggct ttcctccgga gcctcatgtt ctcttcccag 2401 gcctttggag gggaaatggg gaaagcagaa ctgaagccac ttggcccaga aagctgcgga 2461 ttggggtgat aaggggcctt gctctgagca caggtgacag atcataggaa gtggctggtc 2521 tggagtccca cccggacagg tggggcctca gcctggggct ctctgacccg gttgcagtca 2581 ctgtgattcg ttaccgtaga tactacttaa aatgattctc taccaacaat aaaccaaacc 2641 cagccactgc caagcttcct gtgccctcac cccaatccct gccactgggc tctggacctc 2701 aggagggcag gctgaggtgg ggaaggaggg acttggctgt ccttcccctt ccccgtcccc 2761 tgcagcctgg gtctggatgg gaggagagcc actgggcccc tgtccccagt ccccagccct 2821 gggcttggct cttggcttcc agaaggcagc aaagaggggc gctgtcctgc gttcagcccc 2881 tgatctctga cctctgctga gtgctgggca ctgctccaag ggacaggtgg gcctggcggc 2941 ctgtgggttt gggtgcctcc tgcagtttgg gagacatgga ccagcatctg gtcttgtttc 3001 caggagcata gaagccacat cgttgagaca tcaggaaggt aaaaacccag cggcttagcc 3061 aagccctaag cctgtcccca gaccaaccct gggacctata cagaacagag ggccagagct 3121 agggctgctg cttctgctcc agcccctttg cctctgtcct cccatcccct caacaccctg 3181 cttctcccgg ggacgctttt gagtgggccc tgcccgggga gctgcagagc agcagcacct 3241 ttctctgaga agaggtcctt ggttgggtca aggacagggc tgagcgtgga agggggagga 3301 gtcaggggct ctgtgttagg atgcggcttt ctctgcctct gggcagcctg ctttggcctt 3361 tccttgtatg tgggtgttta ttacaagtgg gccccccccc cctccccacc tggacacaca 3421 cacaccagtg gccttcagtg tagcgggagg aagccggtgt ccctggatgt gaagctcaca 3481 ctgatgggct ggggcagggg ctgggcggcg agggggcccg gggaggggac agagctgagg 3541 atttcctgga gtgccctgca ggccaagagg aggtcagcaa aggtcttgaa tagattttct 3601 ctggaaataa agaatcctta gatcctaaaa atcccttcct gttccctcct ggtcctggac 3661 acctcccagg ggactgttcc ttatttctct ctcctggtgt gggtaaaggg acagttacaa 3721 accaggtcac catcctcaga ggctgagcct gtacccaccc cagcacagcc acctgccaga 3781 cccgtggctc tgggagagag cccttttgct attccattgt gacagcgatg caggcctgac 3841 cggacggggc agttagagca gcctcgtgga gctcgcccct cccctcagcc tgcccgcctc 3901 tccctgcatc ctgagccagt gcccccgggc ctgcacagat acctcatcat ccacccgctg 3961 cccctccccg tccctgtccc cagcatcggg ggcctgcaaa tctagtgccg aaatctagtg 4021 ccgaatgact atgtccagat tggtgacgat gggtgttgct gtttttcttg tgtttgtgaa 4081 cgcatgtgat gctgtggaag tgggcagtca ccacccagcc cttccttggg cgtctccccc 4141 agagtgctgt cgggaccaca tctgtcctca cctgtgggag cgcctggccc tccctcccta 4201 gcccttccag cctgggacac acacacacac acacacacac acacacacac acacgcacgc 4261 acacgcacac atcttacctc tcatgcgtgt tttacctttt gatgttcaga gtggctcact 4321 ggctgggagt ccttacctcg gggaggaggg ggaggttggt tccttggggg gccaaagaag 4381 gcagggaatg cctggagggt aactggggcc accatgaacc ccttttctcc agaaaagctg 4441 cttctccccc catcccgggt cccaccccca aacccccaga ggtggccctt gtttacagtg 4501 aggactcggc cactgtgtct ctgtttcctg aaatataaac tgtagcgacc ccagactgta 4561 gagattttta tgtgtttgga acatctgctg tgtggaaaaa aaaaaaaact acaaaaaccc 4621 taattttgta catactgtat ttttactatt gaactgtatt ctagtggctg ttcatgctcc 4681 aagactttag ttaccgagac atgaatacta tccatgtaat aagcacttgc ctggaataaa 4741 atataaaact gaaataaacc tgcactgaaa cctgagatgg agctgccaaa aaaaaaaaaa 4801 aaaaaaaaaa a // LOCUS HSRP19 894 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for 19kD protein of signal recognition particle (SRP). ACCESSION X12791 NID g36112 KEYWORDS signal recognition particle. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 894) AUTHORS Lingelbach,K. TITLE Direct Submission JOURNAL Submitted (05-SEP-1988) Lingelbach K., EMBL, Postfach 10.2209, D-6900 Heidelberg, FRG REFERENCE 2 (bases 1 to 894) AUTHORS Lingelbach,K., Zwieb,C., Webb,J.R., Marshallsay,C., Hoben,P.J., Walter,P. and Dobberstein,B. TITLE Isolation and characterization of a cDNA clone encoding the 19 kDa protein of signal recognition particle (SRP): expression and binding to 7SL RNA JOURNAL Nucleic Acids Res. 16 (20), 9431-9442 (1988) MEDLINE 89041541 FEATURES Location/Qualifiers source 1..894 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="lambda NM1149" CDS 82..516 /note="19kD SRP-protein (AA 1 - 144)" /codon_start=1 /db_xref="PID:g36113" /db_xref="SWISS-PROT:P09132" /translation="MACAAARSPADQDRFICIYPAYLNNKKTIAEGRRIPISKAVENP TATEIQDVCSAVGLNVFLEKNKMYSREWNRDVQYRGRVRVQLKQEDGSLCLVQFPSRK SVMLYAAEMIPKLKTRTQKTGGADQSLQQGEGSKKGKGKKKK" BASE COUNT 297 a 157 c 196 g 244 t ORIGIN 1 cggaaactca gagccgggtt cctcccgggt ttctgccggg tttctccctg cggctcctgg 61 gttgttgaga ctcttgtgaa gatggcttgc gctgccgcgc ggtccccggc cgaccaggac 121 aggtttattt gtatctatcc tgcttattta aataataaga agaccatcgc agagggaagg 181 cgaatcccca taagtaaggc tgttgaaaat cctacagcta cagagattca agatgtatgt 241 tcagcagttg gacttaacgt atttcttgag aaaaataaaa tgtactctag agaatggaat 301 cgtgatgtcc aatacagagg cagagtccgg gtccagctca aacaggaaga tgggagcctc 361 tgccttgtac agttcccatc acgtaagtca gtaatgttgt atgcagcaga aatgatacct 421 aaactaaaaa caaggacaca aaaaacagga ggtgctgacc aaagtcttca acaaggagag 481 ggaagtaaaa aagggaaagg aaagaaaaag aagtaaccta gtatcagcat caagtatgtg 541 gtactactgt aagagacatg aatggagact tctaatttgt atcggaggga aacagaagct 601 ttttgtttgc atcatttaac tgaactgtga acccttgtgc ctctcatctt tatcatcgga 661 gttgacagtg aaacaaattt acatcagaag tttgcatctc gcgtatatgc cgtataaaag 721 aatttttttg tctttcaatg cagttttttg gaagaaaata tttttaaatg gacaatggac 781 tgtacaataa gttacttgaa ataagttgtt tcagataaat ttcaattaga tttaaaataa 841 acattatgtc caccttttaa gttaatgaaa taaaatttga aactgaaaaa aaaa // LOCUS HSRP26AA 773 bp RNA PRI 31-MAR-1993 DEFINITION H.sapiens mRNA for ribosomal protein L26. ACCESSION X69392 NID g36114 KEYWORDS ribosomal protein L26. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 773) AUTHORS Zaman,G.J. TITLE Sequence of a cDNA encoding human ribosomal protein L26 and of a cDNA probably encoding human ribosomal protein L6 JOURNAL Nucleic Acids Res. 21 (7), 1673 (1993) MEDLINE 93241958 REFERENCE 2 (bases 1 to 773) AUTHORS Zaman,G.J.R. TITLE Direct Submission JOURNAL Submitted (24-NOV-1992) G.J.R. Zaman, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, THE NETHERLANDS FEATURES Location/Qualifiers source 1..773 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="polyploid" /tissue_type="lung" /cell_type="non-small cell lung cancer cell" /cell_line="SW-1573/1R50b" /clone="pR29" gene 7..444 /gene="RPL26" CDS 7..444 /gene="RPL26" /codon_start=1 /product="ribosomal protein L26" /db_xref="PID:g36115" /db_xref="SWISS-PROT:Q02877" /translation="MKFNPFVTSDRSKNRKRHFNAPSHIRRKIMSSPLSKELRQKYNV RSMPIRKDDEVQVVRGHYKGQQIGKVVQVYRKKYVIYIERVQREKANGTTVHVGIHPS KVVITRLKLDKDRKKILERKAKSRQVGKEKGKYKEETIEKMQE" BASE COUNT 240 a 177 c 184 g 172 t ORIGIN 1 gccaaaatga agtttaatcc ctttgtgact tccgaccgaa gcaagaatcg caaaaggcat 61 ttcaatgcac cttcccacat tcgaaggaag attatgtctt cccctctttc caaagagctg 121 agacagaagt acaacgtgcg atccatgccc atccgaaagg atgatgaagt tcaggttgta 181 cgtggacact ataaaggtca gcaaattggc aaagtagtcc aggtttacag gaagaaatat 241 gttatctaca ttgaacgggt gcagcgggaa aaggctaatg gcacaactgt ccacgtaggc 301 attcacccca gcaaggtggt tatcactagg ctaaaactgg acaaagaccg caaaaagatc 361 ctcgaacgga aagccaaatc tcgccaagta ggaaaggaaa agggcaaata caaggaagaa 421 accattgaga agatgcagga ataaagtaat cttatataca agctttgatt aaaacttgaa 481 acaaaaaaaa aggggaagaa acgacagcct cacttctgta tggactgctg atgtggcctg 541 ccatcctgtt cagcgggcat tgtctttgga gcagcaggag actaggatgc ctctcactca 601 catcgagttc ctggctggcc agctgctcag ggctcaggct ggggcctccc attgacatcc 661 tccccctaca ctccctctct gagcctccgt cgcccctcct gttgggtaag ggtgttgagt 721 gtgacttgtg ctgaaaacct ggttcatata taataaataa tggtgatgaa aag // LOCUS HSRP3GEN 625 bp RNA PRI 27-AUG-1997 DEFINITION H.sapiens mRNA for RP3 gene. ACCESSION Y11174 NID g2065172 KEYWORDS RP3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 625) AUTHORS Pfitzenmeier,J. TITLE Direct Submission JOURNAL Submitted (11-FEB-1997) J. Pfitzenmeier, University Of The Saarland, Medical Dept. I, Oscar-Orth-Str., 66421 Homburg, FRG REFERENCE 2 (bases 1 to 625) AUTHORS Renner,C., Pfitzenmeier,J.P., Gerlach,K., Held,G., Ohnesorge,S., Sahin,U., Bauer,S. and Pfreundschuh,M. TITLE RP1, a new member of the adenomatous polyposis coli-binding EB1-like gene family, is differentially expressed in activated T cells JOURNAL J. Immunol. 159 (3), 1276-1283 (1997) MEDLINE 97376852 FEATURES Location/Qualifiers source 1..625 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" gene 25..621 /gene="RP3" CDS 25..621 /gene="RP3" /note="similarities with RP1 and EB1" /codon_start=1 /db_xref="PID:e311336" /db_xref="PID:g2065173" /translation="MPVNVYSTSVTSVNLSRHDMLAWVNDSLHLNYTKIEQLCSGAAY CQFMDMLFPGCVHLRKVKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQ DNFEFIQWFKKFFDANYDGKGYNPLLARQGQDVAPPPNPGDQIFNKSKKLIGTAVPQR TSPTGPKNMQTSGRLSNVAPPCILRKNPPSARNGGHET" BASE COUNT 171 a 167 c 148 g 139 t ORIGIN 1 gagccgcctc gtgcactctg gggtatgccc gtcaatgtgt actccacatc tgtgaccagt 61 gtaaatctga gtcgccatga tatgcttgca tgggtcaacg actccctgca cctcaactat 121 accaagatag aacagctttg ttcaggggca gcctactgcc agttcatgga catgctcttc 181 cccggctgtg tgcacttgag gaaagtgaag ttccaggcca aactagagca tgaatacatc 241 cacaacttca aggtgctgca agcagctttc aagaagatgg gtgttgacaa aatcattcct 301 gtagagaaat tagtgaaagg aaaattccaa gataattttg agtttattca gtggtttaag 361 aaattctttg acgcaaacta tgatggaaag ggttacaacc ctctgctggc gcggcagggc 421 caggacgtag cgccacctcc taacccaggt gatcagatct tcaacaaatc caagaaactc 481 attggcacag cagttccaca gaggacgtcc cccacaggcc caaaaaacat gcagacctct 541 ggccggctga gcaatgtggc ccccccctgc attctccgga agaatcctcc gtcagcccga 601 aatggcggcc atgagacttg atgcc // LOCUS HSRPII140 3748 bp RNA PRI 07-FEB-1992 DEFINITION H.sapiens mRNA for RNA polymerase II 140 kDa subunit. ACCESSION X63563 NID g36121 KEYWORDS RNA polymerase II; RNA polymerase II 140 kDa subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3748) AUTHORS Acker,J., WINTZERITH,M., Vigneron,M. and Kedinger,C. TITLE Sequence of the human RNA polymerase II 140 kDa subunit JOURNAL Unpublished REFERENCE 2 (bases 1 to 3748) AUTHORS Kedinger,C. TITLE Direct Submission JOURNAL Submitted (27-DEC-1991) C. Kedinger, Lab. de Gen. Molec. des Eucaryotes, 11 Rue Humann, Strassbourg 67000, FRANCE FEATURES Location/Qualifiers source 1..3748 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HELA" CDS 44..3568 /codon_start=1 /product="RNA polymerase II 140 kDa subunit" /db_xref="PID:g36122" /db_xref="SWISS-PROT:P30876" /translation="MYDADEDMQYDEDDDEITPDLWQEACWIVISSYFDEKGLVRQQL DSFDEFIQMSVQRIVEDAPPIDLQAEAQHASGEVEEPPRYLLKFEQIYLSKPTHWERD GAPSPMMPNEARLRNLTYSAPLYVDITKTVIKEGEEQLQTQHQKTFIGKIPIMLRSTY CLLNGLTDRDLCELNECPLDPGGYFIINGSEKVLIAQEKMATNTVYVFAKKDSKYAYT GECRSCLENSSRPTSTIWVSMLARGGQGAKKSAIGQRIVATLPYIKQEVPIIIVFRAL GFVSDRDILEHIIYDFEDPEMMEMVKPSLDEAFVIQEQNVALNFIGSRGAKPGVTKEK RIKYAKEVLQKEMLPHVGVSDFCETKKAYFLGYMVHRLLLAALGRRELDDRDHYGNKR LDLAGPLLAFLFRGMFKNLLKEVRIYAQKFIDRGKDFNLELAIKTRIISDGLKYSLAT GNWGDQKKAHQARAGVSQVLNRLTFASTLSHLRRLNSPIGRDGKLAKPRQLHNTLWGM VCPAETPEGHAVGLVKNLALMAYISVGSQPSPILEFLEEWSMENLEEISPAAIADATK IFVNGCWVGIHKDPEQLMNTLRKLRRQMDIIVSEVSMIRDIREREIRIYTDAGRICRP LLIVEKQKLLLKKRHIDQLKEREYNNYSWQDLVASGVVEYIDTLEEETVMLAMTPDDL QEKEVAYCSTYTHCEIHPSMILGVCASIIPFPDHNQSPRNTYQSAMGKQAMGVYITNF HVRMDTLAHVLYYPQKPLVTTRSMEYLRFRELPAGINSIVAIASYTGYNQEDSVIMNR SAVDRGFFRSVFYRSYKEQESKKGFDQEEVFEKPTRETCQGMRHAIYDKLDDDGLIAP GVRVSGDDVIIGKTVTLPENEDELESTNRRYTKRDCSTFLRTSETGIVDQVMVTLNQE GYKFCKIRVRSVRIPQIGDKFASRHGQKGTCGIQYRQEDMPFTCEGITPDIIINPHAI PSRMTIGHLIECLQGKVSANKGEIGDATPFNDAVNVQKISNLLSDYGYHLRGNEVLYN GFTGRKITSQIFIGPTYYQRLKHMVDDKIHSRARGPIQILNRQPMEGRSRDGGLRFGE MERDCQIAHGAAQFLRERLFEASDPYQVHVCNLCGIMAIANTRTHTYECRGCRNKTQI SLVRMPYACKLLFQELMSMSIAPRMMSV" BASE COUNT 1123 a 670 c 860 g 1095 t ORIGIN 1 cttttgattt caagagttag gagctcgaga accgtttggc aatatgtacg acgcggatga 61 ggatatgcaa tatgatgagg atgatgatga aatcaccccg gatttgtggc aagaagcatg 121 ctggattgta atcagttcct attttgacga gaaaggcttg gttagacaac agctggattc 181 ttttgatgag tttattcaga tgtctgttca aagaattgtg gaagacgctc ctcctataga 241 cctacaggct gaagctcagc atgctagtgg agaagttgaa gaaccgccac gatatttgct 301 gaagtttgaa caaatttatc tttccaagcc tacccattgg gaaagagatg gtgctccttc 361 accaatgatg cccaatgaag ctagattaag gaatctcacg tattctgctc cgctttatgt 421 tgatataaca aaaacagtca ttaaagaagg tgaagaacaa cttcagactc agcatcagaa 481 aacttttata ggaaaaattc caattatgtt gcggtcaact tactgccttt tgaatggctt 541 gacagatcgt gatctttgtg agttaaatga atgccctttg gatcctggtg gctatttcat 601 tattaatgga tcagaaaagg ttctgattgc ccaagagaaa atggcaacaa acacagttta 661 tgtgtttgcc aaaaaggatt ctaaatatgc ctacacagga gagtgtagat catgtcttga 721 gaattcttcc cgacccacca gtactatatg ggttagcatg ctggcaagag gaggacaggg 781 tgccaagaag agtgctattg gtcagcgcat tgtggcaact ctaccatata tcaagcaaga 841 agttcccatc attattgtgt tcagagcatt aggttttgtg tccgacagag atattttaga 901 acatattatt tatgattttg aagatccaga gatgatggaa atggttaaac cttctctcga 961 tgaagctttt gtcatccaag aacagaatgt tgcactaaat ttcattggtt cacgaggagc 1021 aaagcctggt gttactaaag agaaaagaat taaatatgca aaggaagttt tacaaaaaga 1081 aatgctccct catgttggtg tcagtgattt ttgtgagacc aaaaaagcct atttcttggg 1141 atacatggtt cataggttac ttctggcagc tttgggtaga agagaactag atgacagaga 1201 tcactatgga aacaagagat tggatcttgc tgggccgctg cttgcattct tatttagagg 1261 tatgtttaag aatttgctta aagaagtgcg gatctatgca cagaaattta ttgatcgagg 1321 aaaggatttt aacttggagt tggcaattaa aacacggatc atatctgatg gcctaaaata 1381 ctctttagct actggaaact ggggtgatca aaagaaagct catcaagcca gagctggagt 1441 atctcaggtg ttaaaccgcc tgacttttgc gtctactctt tctcacctgc gtcgtttaaa 1501 ttctcctatt ggtagagacg gcaagctagc aaaaccaaga cagttgcata atacgttgtg 1561 gggaatggtg tgtcctgccg agaccccaga gggccatgct gtaggacttg tgaagaattt 1621 agccttgatg gcgtatattt cagttggatc tcaaccatct ccaattctgg aatttttaga 1681 agaatggagt atggaaaatt tagaagaaat ttctcctgca gctattgctg atgcaaccaa 1741 gatttttgtt aatggctgct gggttggaat acataaagat cccgaacaac ttatgaacac 1801 cctaaggaaa ttgagacgtc agatggacat cattgtgtct gaagtttcta tgatcagaga 1861 tattcgagag agggagattc ggatctatac ggatgcaggc cgtatttgta gaccacttct 1921 gattgtggaa aaacaaaagc tacttttgaa gaagaggcat attgaccaat tgaaagagag 1981 agaatataac aactatagtt ggcaggatct tgtggccagt ggggtagtgg agtatattga 2041 taccctggaa gaagaaacag tgatgcttgc aatgactcca gatgatttac aggagaaaga 2101 agtagcttat tgttccacat atacacactg tgagattcat ccctcaatga tccttggtgt 2161 ctgtgcatct attattccct ttcctgatca taaccagtcc cctagaaaca cataccagtc 2221 tgctatgggt aagcaggcta tgggagttta catcaccaac ttccatgttc gcatggacac 2281 attggcccat gttctctatt atcctcaaaa gccacttgtg actacacggt ctatggaata 2341 tctacgattt agagagctgc cagcaggcat caactcaatt gtggccattg catcatacac 2401 tggatataat caggaagact ctgttatcat gaatcgttca gctgtagacc gcggcttctt 2461 caggtctgtt ttctatcgct catacaaaga acaggagtct aaaaaaggat ttgatcaaga 2521 agaagttttt gagaagccta cacgtgaaac atgccagggc atgaggcatg ccatttacga 2581 caagctggat gatgatggtt tgatagctcc aggggttcgt gtatcaggag atgatgttat 2641 tataggcaaa acagtcacct tgcctgaaaa tgaagatgaa ttggagagca ccaatagacg 2701 ctataccaag agagactgta gcacttttct cagaactagt gagacgggca ttgtggatca 2761 ggttatggta actctcaatc aggaaggata taaattttgt aaaataaggg tacgctctgt 2821 taggattcca cagattggag acaaatttgc tagtcgacat ggtcaaaagg gtacttgtgg 2881 tattcagtat agacaagagg atatgccttt cacctgtgaa ggtatcaccc ctgatatcat 2941 catcaatccc catgccatcc cctctcgtat gactattggt cacttaattg aatgccttca 3001 agggaaggta tcggctaaca agggtgaaat tggtgatgcc actccattta atgatgctgt 3061 taacgtgcag aagatttcta atcttttatc tgattatggc tatcatctca gaggaaatga 3121 ggtcctgtac aatgggttca ctggtcgaaa aatcacatca caaatattta ttggccccac 3181 ttattaccag cgtttgaagc atatggtgga tgataagatt cactctcgtg ctaggggacc 3241 tattcagatc ctcaatagac agcccatgga gggtagatct cgtgatggtg gcctgcgttt 3301 tggagaaatg gaacgagatt gtcagattgc ccatggagca gcccagtttt taagggaaag 3361 attgtttgag gcatcagatc catatcaggt tcatgtttgc aatctttgtg gaataatggc 3421 gattgccaac accaggaccc atacatatga atgcaggggc tgccgcaata aaacccagat 3481 ttctttggtg cgaatgcctt acgcatgcaa actattgttt caggaactta tgtctatgag 3541 tattgcaccg cgaatgatga gtgtttagct attttacagg agtcaacaag ataattaaat 3601 atcttggtgt cttgtttcta ttgtgtggct ttataaaaat gacaaatatg tactgtgttg 3661 tgataaaaag tattttattt gtttaatgat atgcatgctt ttcttctgta aatatataat 3721 aaatttttgt agatagtctt gatgtgtg // LOCUS HSRPIILS 6732 bp RNA PRI 13-FEB-1992 DEFINITION H.sapiens mRNA for RNA polymerase II largest subunit. ACCESSION X63564 NID g36123 KEYWORDS RNA polymerase II; RNA polymerase II largest subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6732) AUTHORS Wintzerith,M., Acker,J., Vicaire,S., Vigneron,M. and Kedinger,C. TITLE Complete sequence of the human RNA polymerase II largest subunit JOURNAL Nucleic Acids Res. 20 (4), 910 (1992) MEDLINE 92178992 REFERENCE 2 (bases 1 to 6732) AUTHORS Kedinger,C. TITLE Direct Submission JOURNAL Submitted (27-DEC-1991) C. Kedinger, Lab. de Gen. Molec. des Eucaryotes, 11 Rue Humann, Strassbourg 67000, FRANCE FEATURES Location/Qualifiers source 1..6732 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 387..6299 /codon_start=1 /product="RNA polymerase II largest subunit" /db_xref="PID:g36124" /db_xref="SWISS-PROT:P24928" /translation="MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYP ETTEGGRPKLGGLMDPRQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVK TMKVLRCVCFFCSKLLVDSNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEM DNKFGVEQPEGDEDLTKEKGHGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSP ERVHEIFKRISDEECFVLGMEPRYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDL THKLADIVKINNQLRRNEQNGAAAHVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGR PLKSLKQRLKGKEGRVRGNLMGKRVDFSARTVITPDPNLSIDQVGVPRSIAANMTFAE IVTPFNIDRLQELVRRGNSQYPGAKYIIRDNGDRIDLRFHPKPSDLHLQTGYKVERHM CDGDIVIFNRQPTLHKMSMMGHRVRILPWSTFRLNLSVTTPYNADFDGDEMNLHLPQS LETRAEIQELAMVPRMIVTPQSNRPVMGIVQDTLTAVRKFTKRDVFLERGEVMNLLMF LSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHINCIRTHSTHPDDEDSGPYKHISPG DTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEMGHDITRLFYSNIQTVINNWLLI EGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKAHNNELEPTPGNTLRQTFENQ VNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKINISQVIAVVGQQNVEGKRI PFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHAMGGREGLIDTAVKTAE TGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGESVEFQNLATLKPSNK AFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMREDREVLRVIFPT GDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKLVIVNGDDPLS RQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAIAHPGEMVG ALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLTVFLLGQ SARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMPDFDV ARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIM NSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKAL ERELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETV DVLMEAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLG AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAAS DASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTP QSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS PSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS YSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYS PTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYTPT SPSYSPSSPEYTPTSPKYSPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSP VYTPTSPKYSPTSPTYSPTSPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTS PAISPDDSDEEN" BASE COUNT 1512 a 2090 c 1685 g 1445 t ORIGIN 1 gagagcgcgg ccgggacggt tggagaagaa ggcggctccc cggaaggggg agagacaaac 61 tgccgtaacc tctgccgttc aggaacccgg ttacttattt attcgttacc ctttttcttc 121 ttcctccccc aaaaaccttt tccttttccc ttcttttttt ttcctttttg ggagctgaaa 181 aatttccggt aagggaaaga agggctcctt tcgctcctta tttcgccgcc tccttccctc 241 cgccaccttc ccctcctccg gctttttcct cccaactcgg ggaggtcctt cccggtggcc 301 gccctgacga ggtctgagca cctaggcgga ggcggcgcag gctttttgta gtgaggtttg 361 cgcctgcgca ggcgcctgcc tccgccatgc acgggggtgg ccccccctcg ggggacagcg 421 catgcccgct gcgcaccatc aagagagtcc agttcggagt cctgagtccg gatgaactga 481 agcgaatgtc tgtgacggag ggtggcatca aatacccaga gacgactgag ggaggccgcc 541 ccaagcttgg ggggctgatg gacccgaggc agggggtgat tgagcggact ggccgctgcc 601 aaacatgtgc aggaaacatg acagagtgtc ctggccactt tggccacatt gaactggcca 661 agcctgtgtt tcacgtgggc ttcctggtga agacaatgaa agttttgcgc tgtgtctgct 721 tcttctgctc caaactgctt gtggactcta acaacccaaa gatcaaggat atcctggcta 781 agtccaaggg acagcccaag aagcggctca cacatgtcta cgacctttgc aagggcaaaa 841 acatatgcga gggtggggag gagatggaca acaagttcgg tgtggaacaa cctgagggtg 901 acgaggatct gaccaaagaa aagggccatg gtggctgtgg gcggtaccag cccaggatcc 961 ggcgttctgg cctagagctg tatgcggaat ggaagcacgt taatgaggac tctcaggaga 1021 agaagatcct gctgagtcca gagcgagtgc atgagatctt caaacgcatc tcagatgagg 1081 agtgttttgt gctgggcatg gagccccgct atgcacggcc agagtggatg attgtcacag 1141 tgctgcctgt gcccccgctc tccgtgcggc ctgctgttgt gatgcagggc tctgcccgta 1201 accaggatga cctgactcac aaactggctg acatcgtgaa gatcaacaat cagctgcggc 1261 gcaatgagca gaacggcgca gcggcccatg tcattgcaga ggatgtgaag ctcctccagt 1321 tccatgtggc caccatggtg gacaatgagc tgcctggctt gccccgtgcc atgcagaagt 1381 ctgggcgtcc cctcaagtcc ctgaagcagc ggttgaaggg caaggaaggc cgggtgcgag 1441 ggaacctgat gggcaaaaga gtggacttct cggcccgtac tgtcatcacc cccgacccca 1501 acctctccat tgaccaggtt ggcgtgcccc gctccattgc tgccaacatg acctttgcgg 1561 agattgtcac ccccttcaac attgacagac ttcaagaact agtgcgcagg gggaacagtc 1621 agtacccagg cgccaagtac atcatccgag acaatggtga tcgcattgac ttgcgtttcc 1681 accccaagcc cagtgacctt cacctgcaga ccggctataa ggtggaacgg cacatgtgtg 1741 atggggacat tgttatcttc aaccggcagc caactctgca caaaatgtcc atgatggggc 1801 atcgggtccg cattctccca tggtctacct ttcgcttgaa tcttagcgtg acaactccgt 1861 acaatgcaga ctttgacggg gatgagatga acttgcacct gccacagtct ctggagacgc 1921 gagcagagat ccaggagctg gccatggttc ctcgcatgat tgtcaccccc cagagcaatc 1981 ggcctgtcat gggtattgtg caggacacac tcacagcagt gcgcaaattc accaagagag 2041 acgtcttcct ggagcggggt gaagtgatga acctcctgat gttcctgtcg acgtgggatg 2101 ggaaggtccc acagccggcc atcctaaagc cccggcccct gtggacaggc aagcaaatct 2161 tctccctcat catacctggt cacatcaatt gtatccgtac ccacagcacc catcccgatg 2221 atgaagacag tggcccttac aagcacatct ctcctgggga caccaaggtg gtggtggaga 2281 atggggagct gatcatgggc atcctgtgta agaagtctct gggcacgtca gctggctccc 2341 tggtccacat ctcctaccta gagatgggtc atgacatcac tcgcctcttc tactccaaca 2401 ttcagactgt cattaacaac tggctcctca tcgagggtca tactattggc attggggact 2461 ccattgctga ttctaagact taccaggaca ttcagaacac tattaagaag gccaagcagg 2521 acgtaataga ggtcatcgag aaggcacaca acaatgagct ggagcccacc ccagggaaca 2581 ctctgcggca gacgtttgag aatcaggtga accgcattct taacgatgcc cgagacaaga 2641 ctggctcctc tgctcagaaa tccctgtctg aatacaacaa cttcaagtct atggtcgtgt 2701 ccggagctaa aggttccaag attaacatct cccaggtcat tgctgtcgtt ggacagcaga 2761 acgtcgaggg caagcggatt ccatttggct tcaagcaccg gactctgcct cacttcatca 2821 aggatgacta cgggcctgag agccgtggct ttgtggagaa ctcctaccta gccggcctca 2881 cacccactga gttctttttc cacgccatgg ggggtcgtga ggggctcatt gacacggctg 2941 tcaagactgc tgagactgga tacatccagc ggcggctgat caagtccatg gagtcagtga 3001 tggtgaagta cgacgcgact gtgcggaact ccatcaacca ggtggtgcag ctgcgctacg 3061 gcgaagacgg cctggcaggc gagagcgttg agttccagaa cctggctacg cttaagcctt 3121 ccaacaaggc ttttgagaag aagttccgct ttgattatac caatgagagg gccctgcggc 3181 gcactctgca ggaggacctg gtgaaggacg tgctgagcaa cgcacacatc cagaacgagt 3241 tggagcggga atttgagcgg atgcgggagg atcgggaggt gctcagggtc atcttcccaa 3301 ctggagacag caaggtcgtc ctcccctgta acctgctgcg gatgatctgg aatgctcaga 3361 aaatcttcca catcaaccca cgccttccct ccgacctgca ccccatcaaa gtggtggagg 3421 gagtcaagga attgagcaag aagctggtga ttgtgaatgg ggatgaccca ctaagtcgac 3481 aggcccagga aaatgccacg ctgctcttca acatccacct gcggtccacg ttgtgttccc 3541 gccgcatggc agaggagttt cggctcagtg gggaggcctt cgactggctg cttggggaga 3601 ttgagtccaa gttcaaccaa gccattgcgc atcccgggga aatggtgggg gctctggctg 3661 cgcagtccct tggagaacct gccacccaga tgaccttgaa taccttccac tatgctggtg 3721 tgtctgccaa gaatgtgacg ctgggtgtgc cccgacttaa ggagctcatc aacatttcca 3781 agaagccaaa gactccttcg cttactgtct tcctgttggg ccagtccgct cgagatgctg 3841 agagagccaa ggatattctg tgccgtctgg agcatacaac gttgaggaag gtgactgcca 3901 acacagccat ctactatgac cccaaccccc agagcacggt ggtggcagag gatcaggaat 3961 gggtgaatgt ctactatgaa atgcctgact ttgatgtggc ccgaatctcc ccctggctgt 4021 tgcgggtgga gctggatcgg aagcacatga ctgaccggaa gctcaccatg gagcagattg 4081 ctgaaaagat caatgctggt tttggtgacg acttgaactg catctttaat gatgacaatg 4141 cagagaagct ggtgctccgt attcgcatca tgaacagcga tgagaacaag atgcaagagg 4201 aggaagaggt ggtggacaag atggatgatg atgtcttcct gcgctgcatc gagtccaaca 4261 tgctgacaga tatgaccctg cagggcatcg agcagatcag caaggtgtac atgcacttgc 4321 cacagacaga caacaagaag aagatcatca tcacggagga tggggaattc aaggccctgc 4381 aggagtggat cctggagacg gacggcgtga gcttgatgcg ggtgctgagt gagaaggacg 4441 tggaccccgt acgcaccacg tccaatgaca ttgtggagat cttcacggtg ctgggcattg 4501 aagccgtgcg gaaggccctg gagcgggagc tgtaccacgt catctccttt gatggctcct 4561 atgtcaatta ccgacacttg gctctcttgt gtgataccat gacctgtcgt ggccacttga 4621 tggccatcac ccgacacgga gtcaaccgcc aggacacagg accactcatg aagtgttcct 4681 ttgaggaaac ggtggacgtg cttatggaag cagccgcaca cggtgagagt gaccccatga 4741 agggggtctc tgagaatatc atgctgggcc agctggctcc ggccggcact ggctgctttg 4801 acctcctgct tgatgcagag aagtgcaagt atggcatgga gatccccacc aatatccccg 4861 gcctgggggc tgctggaccc accggcatgt tctttggttc agcacccagt cccatgggtg 4921 gaatctctcc tgccatgaca ccttggaacc agggtgcaac ccctgcctat ggcgcctggt 4981 cccccagtgt tgggagtgga atgaccccag gggcagccgg tttctctccc agtgctgcgt 5041 cagatgccag cggcttcagc ccaggttact cccctgcctg gtctcccaca ccgggctccc 5101 cggggtcccc aggtccctca agcccctaca tcccttcacc aggtggcgcc atgtctccca 5161 gctactcgcc aacgtcacct gcctacgagc cccgctctcc tgggggctac acaccccaga 5221 gtccctctta ttcccccact tcaccctcct actcccctac ctctccatcc tattctccaa 5281 ccagtcccaa ctatagtccc acatcaccca gctattcgcc aacgtcaccc agctactcac 5341 cgacctctcc cagctactca cccacctctc ccagctactc gcccacctct cccagctatt 5401 cgcccacctc tcccagctac tcacccactt cccctagcta ttcgcccact tcccctagct 5461 actcgccaac gtctcccagc tactcgccga catctcccag ctactcgcca acttcaccca 5521 gctattctcc cacttctccc agctactcac ctacctctcc aagctattca cccacctccc 5581 ccagctactc acccacttcc ccaagttact cacccaccag cccgaactat tctccaacca 5641 gtcccaatta caccccaaca tcacccagct acagcccgac atcacccagc tattccccta 5701 ctagtcccaa ctacacacct accagcccta actacagccc aacctctcca agctactctc 5761 caacatcacc cagctattcc ccgacctcac caagttactc cccttccagc ccacgataca 5821 caccacagtc tccaacctat accccaagct cacccagcta cagccccagt tcgcccagct 5881 acagcccaac ctcacccaag tacaccccaa ccagtccttc ttatagtccc agctccccag 5941 agtatacccc aacctctccc aagtactcac ctaccagtcc caaatattca cccacctctc 6001 ccaagtactc gcctaccagt cccacctatt cacccaccac cccaaaatac tccccaacat 6061 ctcctactta ttccccaacc tctccagtct acaccccaac ctctcccaag tactcaccta 6121 ctagccccac ttactcgccc acttccccca agtactcgcc caccagcccc acctactcgc 6181 ccacctcccc caaaggctca acctactctc ccacttcccc tggttactcg cccaccagcc 6241 ccacctacag tctcacaagc ccggctatca gcccggatga cagtgacgag gagaactgag 6301 ggcacgtggg gtgcggcagc gggctagggc ccagggcagc ttgcccgtgc tgccgtgcag 6361 ttcttgcctc cctcacgggg cgtcaccccc agcccagctc cgttgtacat aaataccttg 6421 tgacagagct cccggtgaac ttctggatcc cgtttctgat gcagattctt gtcttgttct 6481 ccacttgtgc tgttagaact cactggccca gtggtgttct acctcctacc ccacccaccc 6541 cctgcctgtc cccaaattga agatccttcc ttgcctgtgg cttgatgcgg ggcgggtaaa 6601 gggtatttta acttaggggt agttcctgct gtgagtggtt acagctgatc ctcgggaaga 6661 acaaagctaa agctgccttt tgtctgttat tttatttttt tgaagtttaa ataaagttta 6721 ctaattttga cc // LOCUS HSRPL11 591 bp RNA PRI 29-AUG-1995 DEFINITION H.sapiens mRNA for ribosomal protein L11. ACCESSION X79234 NID g495125 KEYWORDS ribosomal protein L11. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 591) AUTHORS Mishin,V.P., Filipenko,M.L., Muravlev,A.I., Karpova,G.G. and Mertvetsov,N.P. TITLE Cloning and determination of the primary structure of DNA complementary to the mRNA of human ribosomal protein L11 (letter) JOURNAL Bioorg. Khim. 21 (2), 158-160 (1995) MEDLINE 95267091 REFERENCE 2 (bases 1 to 591) AUTHORS Filipenko,M.L. TITLE Direct Submission JOURNAL Submitted (13-MAY-1994) M.L. Filipenko, Institute of Bioorganic Chemistry, Lavrentjeva 8, Novosibirsk, 630090 Russia, USSR FEATURES Location/Qualifiers source 1..591 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /sex="female" gene 1..537 /gene="rpl11" CDS 1..537 /gene="rpl11" /codon_start=1 /product="ribosomal protein L11" /db_xref="PID:g495126" /db_xref="SWISS-PROT:P39026" /translation="MAQDQGEKENPMRELRIRKLCLNICVGESGGRLTRAAKVLEQLT GQTPVFSKARYTVRSFGIRRNEKIAVHCAVRGAKAEEILEKGLKVRELELRKNNFSDT GNFGFGIQEHIDLGIEYDPSIGIYGLDFYVVLGRPGFSIADKKRRTGCIGAKHRISKE EAMRWFQQKYDGIILPGK" polyA_signal 562..567 BASE COUNT 174 a 124 c 168 g 125 t ORIGIN 1 atggcgcagg atcaaggtga aaaggagaac cccatgcggg aacttcgcat ccgcaaactc 61 tgtctcaaca tctgtgttgg ggagagtgga ggcagactga cgcgagcagc caaggtgttg 121 gagcagctca cagggcagac ccctgtgttt tccaaagcta gatacactgt cagatccttt 181 ggcatccgga gaaatgaaaa gattgctgtc cactgcgcag ttcgaggggc caaggcagaa 241 gaaatcttgg agaagggtct aaaggtgcgg gagttggagt taagaaaaaa caacttctca 301 gatactggaa actttggttt tgggatccag gaacacattg atctgggtat cgaatatgac 361 ccaagcattg gtatctacgg cctggacttc tatgtggtgc tgggtaggcc aggtttcagc 421 atcgcagaca agaagcgcag gacaggctgc attggggcca aacacagaat cagcaaagag 481 gaggccatgc gctggttcca gcagaagtat gatgggatca tccttcctgg caaataaatt 541 cccgtttcca tccaaaagag caataaaaag ttttcagtga aatgtgcaaa a // LOCUS HSRPL19 698 bp RNA PRI 11-MAY-1992 DEFINITION H.sapiens mRNA for ribosomal protein L19. ACCESSION X63527 NID g36127 KEYWORDS ribosomal protein L19. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 698) AUTHORS Kumabe,T. TITLE Direct Submission JOURNAL Submitted (06-DEC-1991) T. Kumabe, Tohoku Univ Gene Research Center, 1-1 Tsutsumidori-Amamiyamachi, Aobaku, Sendai, 981, JAPAN REFERENCE 2 (bases 1 to 698) AUTHORS Kumabe,T., Sohma,Y. and Yamamoto,T. TITLE Human cDNAs encoding elongation factor 1 gamma and the ribosomal protein L19 JOURNAL Nucleic Acids Res. 20 (10), 2598 (1992) MEDLINE 92285147 FEATURES Location/Qualifiers source 1..698 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte" CDS 29..619 /codon_start=1 /product="ribosomal protein L19" /db_xref="PID:g36128" /db_xref="SWISS-PROT:P14118" /translation="MSMLRLQKRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKL IKDGLIIRKPVTVHSRARCRKNTLARRKGRHMGIGKRKGTANARMPEKVTWMRRMRIL RRLLRRYRESKKIDRHMYHSLYLKVKGNVFKNKRILMEHIHKLKADKARKKLLADQAE ARRSKTKEARKRREERLQAKKEEIIKTLSKEEETKK" BASE COUNT 204 a 181 c 187 g 126 t ORIGIN 1 ttttcctttc gctgctgcgg ccgcagccat gagtatgctc aggcttcaga agaggctcgc 61 ctctagtgtc ctccgctgtg gcaagaagaa ggtctggtta gaccccaatg agaccaatga 121 aatcgccaat gccaactccc gtcagcagat ccggaagctc atcaaagatg ggctgatcat 181 ccgcaagcct gtgacggtcc attcccgggc tcgatgccgg aaaaacacct tggcccgccg 241 gaagggcagg cacatgggca taggtaagcg gaagggtaca gccaatgccc gaatgccaga 301 gaaggtcaca tggatgagga gaatgaggat tttgcgccgg ctgctcagaa gataccgtga 361 atctaagaag atcgatcgcc acatgtatca cagcctgtac ctgaaggtga aggggaatgt 421 gttcaaaaac aagcggattc tcatggaaca catccacaag ctgaaggcag acaaggcccg 481 caagaagctc ctggctgacc aggctgaggc ccgcaggtct aagaccaagg aagcacgcaa 541 gcgccgtgaa gagcgcctcc aggccaagaa ggaggagatc atcaagactt tatccaagga 601 ggaagagacc aagaaataaa acctcccact ttgtctgtac atactggcct ctgtgattac 661 atagatcagc cattaaaata aaacaagcct taatctgc // LOCUS HSRPL29 630 bp RNA PRI 03-AUG-1996 DEFINITION H.sapiens mRNA for ribosomal protein L29. ACCESSION Z49148 NID g793842 KEYWORDS ribosomal protein L29. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 630) AUTHORS Liew,C. TITLE Direct Submission JOURNAL Submitted (28-APR-1995) Liew C., University of Toronto, Clinical Biochemistry, Banting Institute, 100 College Street, Toronto, Ontario, Canada, M5G 1L5 REFERENCE 2 (bases 1 to 630) AUTHORS Law,P.T., Tsui,S.K., Lam,W.Y., Luk,S.C., Hwang,D.M., Liew,C.C., Lee,C.Y., Fung,K.P. and Waye,M.M. TITLE A novel cDNA encoding a human homologue of ribosomal protein L29 JOURNAL Biochim. Biophys. Acta, Gene Struct. Expr. 1305 (3), 105-108 (1996) MEDLINE 96180309 FEATURES Location/Qualifiers source 1..630 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Expressed sequence tags" /clone_lib="dbEST" misc_feature 1..630 /note="Sequence derived from ESTs, gb R30698, R30705, T55369, T48831, T34653, T57508, T32287, T32887, T40672, T54341, T35147, T48830, T40846, T58263, T39577, T12242, and dbj D19752." CDS 30..509 /codon_start=1 /product="ribosomal protein L29" /db_xref="PID:g793843" /db_xref="SWISS-PROT:P47914" /translation="MAKSKNHTTHNQSRKWHRNGIKKPRSQRYESLKGVDPKFLRNMR FAKKHNKKGLKKMQANNAKAMSARAEAIKALVKPKEVKPKIPKGVSRKLDRLAYIAHP KLGKRARARIAKGLRLCAPKAKAKAKAKDQTKAQAAAPASVPAQAPKRTQAPTKASE" polyA_site 613..618 BASE COUNT 176 a 192 c 165 g 97 t ORIGIN 1 gttcgggagc cgcggcttat ggtgcagaca tggccaagtc caagaaccac accacacaca 61 accagtcccg aaaatggcac agaaatggta tcaagaaacc ccgatcacaa agatacgaat 121 ctcttaaggg ggtggacccc aagttcctga ggaacatgcg ctttgccaag aagcacaaca 181 aaaagggcct aaagaagatg caggccaaca atgccaaggc catgagtgca cgtgccgagg 241 ctatcaaggc cctcgtaaag cccaaggagg ttaagcccaa gatcccaaag ggtgtcagcc 301 gcaagctcga tcgacttgcc tacattgccc accccaagct tgggaagcgt gctcgtgccc 361 gtattgccaa ggggctcagg ctgtgcgcgc caaaggccaa ggccaaggcc aaggccaagg 421 atcaaaccaa ggcccaggct gcagccccag cttcagttcc agctcaggct cccaaacgta 481 cccaggcccc tacaaaggct tcagagtaga tatctctgcc aacatgagga cagaaggact 541 ggtgcgaccc cccacccccg cccctgggct accatctgca tggggctggg gtcctcctgt 601 gctatttgta caaataaacc tgaggcagga // LOCUS HSRPL30 357 bp RNA PRI 17-MAY-1994 DEFINITION H.sapiens mRNA for ribosomal protein L30. ACCESSION X79238 NID g488414 KEYWORDS ribosomal protein L30. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 357) AUTHORS Filipenko,M.L. TITLE Direct Submission JOURNAL Submitted (13-MAY-1994) M.L. Filipenko, Institute of Bioorganic Chemistry, Lavrentjeva 8, Novosibirsk, 630090 Russia, USSR REFERENCE 2 (bases 1 to 357) AUTHORS Filipenko,M.L. and Karpova,G.G. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..357 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /sex="female" gene 1..348 /gene="rpl30" CDS 1..348 /gene="rpl30" /codon_start=1 /product="ribosomal protein L30" /db_xref="PID:g488415" /db_xref="SWISS-PROT:P04645" /translation="MVAAKKTKKSLESINSRLQLVMKSGKYVLGYKQTLKMIRQGKAK LVILANNCPALRKSEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPGDS DIIRSMPEQTGEK" BASE COUNT 120 a 70 c 90 g 77 t ORIGIN 1 atggtggctg caaagaagac gaaaaagtcg ctggagtcga tcaactctag gctccaactc 61 gttatgaaaa gtgggaagta cgtcctgggg tacaagcaga ctctgaagat gatcagacaa 121 ggcaaagcga aattggtcat tctcgctaac aactgcccag ctttgaggaa atctgaaata 181 gagtactatg ctatgttggc taaaactggt gtccatcact acagtggcaa taatattgaa 241 ctgggcacag catgcggaaa atactacaga gtgtgcacac tggctatcat tgatccaggt 301 gactctgaca tcattagaag catgccagaa cagactggtg agaagtaaac aagaaag // LOCUS HSRPL31 414 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ribosomal protein L31. ACCESSION X15940 X16953 NID g36129 KEYWORDS ribosomal protein; ribosomal protein L31. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 414) AUTHORS Nobori,T. TITLE Direct Submission JOURNAL Submitted (02-AUG-1989) Nobori T., Scripps Clinic Research Foundation, 10666 North Torreey Pines Rd, La Jolla CA 92037, U S A REFERENCE 2 (bases 1 to 414) AUTHORS Nobori,T., Hexdall,L.E. and Carson,D.A. TITLE cDNA sequence of human ribosomal protein L31 JOURNAL Nucleic Acids Res. 17 (17), 7105 (1989) MEDLINE 89386063 REFERENCE 3 (bases 137 to 259) AUTHORS Chester,K.A., Robson,L., Begent,R.H., Talbot,I.C., Pringle,J.H., Primrose,L., Macpherson,A.J., Boxer,G., Southall,P. and Malcolm,A.D. TITLE Identification of a human ribosomal protein mRNA with increased expression in colorectal tumours JOURNAL Biochim. Biophys. Acta 1009 (3), 297-300 (1989) MEDLINE 90089407 FEATURES Location/Qualifiers source 1..414 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 8..385 /note="ribosomal protein L31 (AA 1-125)" /codon_start=1 /db_xref="PID:g36130" /db_xref="SWISS-PROT:P12947" /translation="MAPAKKGGEKKKGRSAINEVVTREYTINIHKRIHGVGFKKRAPR ALKEIRKFAMKEMGTPDVRIDTRLNKAVWAKGIRNVPYRIRVRLSRKRNEDEDSPNKL YTLVTYVPVTTFKNLQTVNVDEN" BASE COUNT 137 a 92 c 100 g 85 t ORIGIN 1 ccgcagaatg gctcccgcaa agaagggtgg cgagaagaaa aagggccgtt ctgccatcaa 61 cgaagtggta acccgagaat acaccatcaa cattcacaag cgcatccatg gagtgggctt 121 caagaagcgt gcacctcggg cactcaaaga gattcggaaa tttgccatga aggagatggg 181 aactccagat gtgcgcattg acaccaggct caacaaagct gtctgggcca aaggaataag 241 gaatgtgcca taccgaatcc gtgtgcggct gtccagaaaa cgtaatgagg atgaagattc 301 accaaataag ctatatactt tggttaccta tgtacctgtt accactttca aaaatctaca 361 gacagtcaat gtggatgaga actaatcgct gatcaaataa cgttataaaa ttgc // LOCUS HSRPL32 505 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ribosomal protein L32. ACCESSION X03342 NID g36131 KEYWORDS ribosomal protein; ribosomal protein L32. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 505) AUTHORS Young,J.A. and Trowsdale,J. TITLE A processed pseudogene in an intron of the HLA-DP beta 1 chain gene is a member of the ribosomal protein L32 gene family JOURNAL Nucleic Acids Res. 13 (24), 8883-8891 (1985) MEDLINE 86093685 COMMENT Data kindly reviewed (02-JUN-1986) by J. Trowsdale. FEATURES Location/Qualifiers source 1..505 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 35..442 /note="rpL32 (aa 1-135)" /codon_start=1 /db_xref="PID:g36132" /db_xref="SWISS-PROT:P02433" /translation="MAALRPLVKPKIVKKRTKKFIRHQSDRYVKIKRNWRKPRGIDNR VRRRFKGQILMPNIGYGSNKKTKHMLPSGFRKFLVHNVKELEVLLMCNKSYCAEIAHN VSSKNRKAIVERAAQLAIRVTNPNARLRSEENE" BASE COUNT 150 a 134 c 119 g 102 t ORIGIN 1 ccgaggaggt ggcagccatc tccttctcgg catcatggcc gccctcagac cccttgtgaa 61 gcccaagatc gtcaaaaaga gaaccaagaa gttcatccgg caccagtcag accgatatgt 121 caaaattaag cgtaactggc ggaaacccag aggcattgac aacagggttc gtagaagatt 181 caagggccag atcttgatgc ccaacattgg ttatggaagc aacaaaaaaa caaagcacat 241 gctgcccagt ggcttccgga agttcctggt ccacaacgtc aaggagctgg aagtgctgct 301 gatgtgcaac aaatcttact gtgccgagat cgctcacaat gtttcctcca agaaccgcaa 361 agccatcgtg gaaagagctg cccaactggc catcagagtc accaacccca atgccaggct 421 gcgcagtgaa gaaaatgagt aggcagctca tgtgcacgtt ttctgtttaa ataaatgtaa 481 aaactgccat ctggcatctt ccttc // LOCUS HSRPL37A 349 bp RNA PRI 13-OCT-1992 DEFINITION H.sapiens mRNA for ribosomal protein L37a. ACCESSION X66699 NID g36133 KEYWORDS ribosomal protein; ribosomal protein L37a. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 349) AUTHORS Hoof,T., Fislage,R. and Tummler,B. TITLE Primary sequence of the human ribosomal protein L37a JOURNAL Nucleic Acids Res. 20 (20), 5475 (1992) MEDLINE 93065220 REFERENCE 2 (bases 1 to 349) AUTHORS Hoof,T. TITLE Direct Submission JOURNAL Submitted (08-JUL-1992) T. Hoof, Medizinische Hochschule Hannover, OE 4350 Institut f Biophysikalische Chemie, Konstanty Gutschow Str 8, 3000 Hannover 61, FRG FEATURES Location/Qualifiers source 1..349 /organism="Homo sapiens" /isolate="patient NH" /db_xref="taxon:9606" /tissue_type="nasal polyps" /cell_type="resp. epith." /clone_lib="lambda gt10/WB12_87" /clone="RFTHSC11" CDS 5..283 /codon_start=1 /product="ribosomal protein L37a" /db_xref="PID:g36134" /db_xref="SWISS-PROT:P12751" /translation="MAKRTKKVGIVGKYGTRYGASLRKMVKKIEISQHAKYTCSFCGK TKMKRRAVGIWHCGSCMKTVAGGAWTYNTTSAVTVKSAIRRLKELKDQ" BASE COUNT 105 a 81 c 90 g 73 t ORIGIN 1 cgacatggcc aaacgtacca agaaagtcgg gatcgtcggt aaatacggga cccgctatgg 61 ggcctccctc cggaaaatgg tgaagaaaat tgaaatcagc cagcacgcca agtacacttg 121 ctctttctgt ggcaaaacca agatgaagag acgagctgtg gggatctggc actgtggttc 181 ctgcatgaag acagtggctg gcggtgcctg gacgtacaat accacttccg ctgtcacggt 241 aaagtccgcc atcagaagac tgaaggagtt gaaagaccag tagacgctcc tctactcttt 301 gagacatcac tggcctataa taaatgggtt aatttatgta acaaaaaaa // LOCUS HSRPL38 372 bp mRNA PRI 16-JAN-1998 DEFINITION H.sapiens gene for ribosomal protein L38. ACCESSION Z26876 NID g407422 KEYWORDS ribosomal protein; ribosomal protein L38. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 372) AUTHORS Espinosa,L., Martin,M., Nicolas,A., Fabre,M. and Navarro,E. TITLE Primary sequence of the human, lysine-rich, ribosomal protein RPL38 and detection of an unusual RPL38 processed pseudogene in the promoter region of the type-1 angiotensin II receptor gene JOURNAL Biochim. Biophys. Acta 1354 (1), 58-64 (1997) MEDLINE 98041641 REFERENCE 2 (bases 1 to 372) AUTHORS Navarra,E. TITLE Direct Submission JOURNAL Submitted (08-OCT-1993) Navarro E., Institut Municipal Investigacio Medica, Immubology, Dr Aiguader 80, Barcelona, SPAIN, E-08003 FEATURES Location/Qualifiers source 1..372 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="Female" /clone="HCoE30" /cell_type="Mucin producing intestinal epithelial cell line" /cell_line="HT29-M6" /clone_lib="Lambda ZAP II cDNA library" misc_feature 1..21 /note="poly-T run, present in the 5'UTR of different r-protein cDNAs" /function="putative regulatory region" CDS 111..323 /standard_name="human ribosomal protein L38" /codon_start=1 /product="ribosomal protein" /db_xref="PID:g407423" /db_xref="SWISS-PROT:P23411" /translation="MPRKIEEIKDFLLTARRKDAKSVKIKKNKDNVKFKVRCSRYLYT LVITDKEKAEKLKQSLPPGLAVKELK" polyA_signal 353..358 /note="Non-canonical form (AUUAAA)" polyA_site 372 BASE COUNT 110 a 78 c 91 g 93 t ORIGIN 1 tttttttttt tttttttttt tatggggtgt gataggtgtg agtgtctcta gggtgatacg 61 tgggtgagaa aggtcctggt ccgcgccaga gcccagcgcg cctcgtcgcc atgcctcgga 121 aaattgagga aatcaaggac ttcctgctca cagcccgacg aaaggatgcc aaatctgtca 181 agatcaagaa aaataaggac aacgtgaagt ttaaagttcg atgcagcaga tacctttaca 241 ccctggtcat cactgacaaa gagaaggcag agaaactgaa gcagtccctg ccccccggtt 301 tggcagtgaa ggaactgaaa tgaaccagac acactgattg gaactgtatt atattaaaat 361 actaaaaatc ct // LOCUS HSRPL3A 1272 bp RNA PRI 07-JUL-1993 DEFINITION H.sapiens mRNA for ribosomal protein L3. ACCESSION X73460 NID g313658 KEYWORDS ribosomal protein L3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1272) AUTHORS Leffers,H. TITLE Complete coding sequence of human ribosomal protein L3 mRNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1272) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (21-JUN-1993) H. Leffers, Institute of Medical Biochemistry and Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..1272 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA" /clone="28-5-r19" /clone_lib="lambda ZapII /AMA cDNA" CDS 7..1218 /codon_start=1 /product="ribosomal protein L3" /db_xref="PID:g313659" /db_xref="SWISS-PROT:P39023" /translation="MSHRKFSAPRHGSLGFLPRKRSSRHRGKVKSFPKDDPSKPVHLT AFLGYKAGMTHIVREVDRPGSKVNKKEVVEAVTIVETPPMVVVGIVGYVETPRGLRTF KTVFAEHISDECKRRFYKNWHKSKKKAFTKYCKKWQDEDGKKQLEKDFSSMKKYCQVI RVIAHTQMRLLPLRQKKAHLMEIQVNGGTVAEKLDWARERLEQQVPVNQVFGQDEMID VIGVTKGKGYKGVTSRWHTKKLPRKTHRGLRKVACIGAWHPARVAFSVARAGQKGYHH RTEINKKIYKIGQGYLIKDGKLIKNNASTDYDLSDKSINPLGGFVHYGEVTNDFVMLK GCVVGTKKRVLTLRKSLLVQTKRRALEKIDLKFIDTTSKFGHGRFQTMEEKKAFMGPL KKDRIAKEEGA" polyA_signal 1253..1258 BASE COUNT 332 a 316 c 374 g 250 t ORIGIN 1 ggcgtgatgt ctcacagaaa gttctccgct cccagacatg ggtccctcgg cttcctgcct 61 cggaagcgca gcagcaggca tcgtgggaag gtgaagagct tccctaagga tgacccatcc 121 aagccggtcc acctcacagc cttcctggga tacaaggctg gcatgactca catcgtgcgg 181 gaagtcgaca ggccgggatc caaggtgaac aagaaggagg tggtggaggc tgtgaccatt 241 gtagagacac cacccatggt ggttgtgggc attgtgggct acgtggaaac ccctcgaggc 301 ctccggacct tcaagactgt ctttgctgag cacatcagtg atgaatgcaa gaggcgtttc 361 tataagaatt ggcataaatc taagaagaag gcctttacca agtactgcaa gaaatggcag 421 gatgaggatg gcaagaagca gctggagaag gacttcagca gcatgaagaa gtactgccaa 481 gtcatccgtg tcattgccca cacccagatg cgcctgcttc ctctgcgcca gaagaaggcc 541 cacctgatgg agatccaggt gaacggaggc actgtggccg agaagctgga ctgggcccgc 601 gagaggcttg agcagcaggt acctgtgaac caagtgtttg ggcaggatga gatgatcgac 661 gtcatcgggg tgaccaaggg caaaggctac aaaggggtca ccagtcgttg gcacaccaag 721 aagctgcccc gcaagaccca ccgaggcctg cgcaaggtgg cctgtattgg ggcatggcat 781 cctgctcgtg tagccttctc tgtggcacgc gctgggcaga aaggctacca tcaccgcact 841 gagatcaaca agaagattta taagattggc cagggctacc ttatcaagga cggcaagctg 901 atcaagaaca atgcctccac tgactatgac ctatctgaca agagcatcaa ccctctgggt 961 ggctttgtcc actatggtga agtgaccaat gactttgtca tgctgaaagg ctgtgtggtg 1021 ggaaccaaga agcgggtgct caccctccgc aagtccttgc tggtgcagac gaagcggcgg 1081 gctctggaga agattgacct taagttcatt gacaccacct ccaagtttgg ccatggccgc 1141 ttccagacca tggaggagaa gaaagcattc atgggaccac tgaagaaaga ccgaattgca 1201 aaggaagaag gagcttaatg ccaggaacag attttgcagt tggtggggtc tcaataaaat 1261 tattttccac tg // LOCUS HSRPL41 478 bp RNA PRI 18-JAN-1994 DEFINITION H.sapiens mRNA for homologue to yeast ribosomal protein L41. ACCESSION Z12962 S45214 NID g36135 KEYWORDS ribosomal protein L41. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 478) AUTHORS Scheit,K.K. TITLE Direct Submission JOURNAL Submitted (15-JUN-1992) Karl Heinz K.H. Scheit, Molecular Biology, Max-Planck-Institut fuer, Biophysikalische Chemie, Am Fassberg, Goettingen, 3400, Germany REFERENCE 2 (bases 1 to 478) AUTHORS Klaudiny,J., von der Kammer,H. and Scheit,K.H. TITLE Characterization by cDNA cloning of the mRNA of a highly basic human protein homologous to the yeast ribosomal protein YL41 JOURNAL Biochem. Biophys. Res. Commun. 187 (2), 901-906 (1992) MEDLINE 92412140 FEATURES Location/Qualifiers source 1..478 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="ovary" /clone_lib="ovarian granulosa cell cDNA in lamdaZAP (Dr.Scheit)" /clone="pHGP12" mat_peptide 84..158 /product="human homologue to yeast ribosomal protein YL41" CDS 84..161 /codon_start=1 /product="human homologue to yeast ribosomal protein YL41" /db_xref="PID:g36136" /db_xref="SWISS-PROT:P28751" /translation="MRAKWRKKRMRRLKRKRRKMRQRSK" polyA_signal 427..432 BASE COUNT 131 a 117 c 115 g 115 t ORIGIN 1 acccggcgct ccattaaata gccgtagacg gaacttcgcc tttctctcgg ccttagcgcc 61 atttttttgg aaacctctgc gccatgagag ccaagtggag gaagaagcga atgcgcaggc 121 tgaagcgcaa aagaagaaag atgaggcaga ggtccaagta aaccgctagc ttgttgcacc 181 gtggaggcca caggagcaga aacatggaat gccagacgct ggggatgctg gtacaagttg 241 tgggactgca tgctactgtc tagagcttgt ctcaatggat ctagaacttc atcgccctct 301 gatcgccgat cacctctgag acccaccttg ctcataaaca aaatgcccat gttggtcctc 361 tgccctggac ctgtgacatt ctggactatt tctgtgttta tttgtggccg agtgtaacaa 421 ccatataata aatcacctct tccgctgttt tagctgaaga attaaatcaa aaaaaaaa // LOCUS HSRPL6AA 926 bp RNA PRI 31-MAR-1993 DEFINITION H.sapiens mRNA for ribosomal protein L6. ACCESSION X69391 NID g36137 KEYWORDS ribosomal protein L6. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 926) AUTHORS Zaman,G.J. TITLE Sequence of a cDNA encoding human ribosomal protein L26 and of a cDNA probably encoding human ribosomal protein L6 JOURNAL Nucleic Acids Res. 21 (7), 1673 (1993) MEDLINE 93241958 REFERENCE 2 (bases 1 to 926) AUTHORS Zaman,G.J.R. TITLE Direct Submission JOURNAL Submitted (24-NOV-1992) G.J.R. Zaman, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, THE NETHERLANDS FEATURES Location/Qualifiers source 1..926 /organism="Homo sapiens" /db_xref="taxon:9606" /haplotype="aneuploid" /tissue_type="lung" /cell_type="non-small cell lung cancer cell" /cell_line="SW-1573/1R50b" /clone="pR6" gene 27..893 /gene="RPL6" CDS 27..893 /gene="RPL6" /codon_start=1 /product="ribosomal protein L6" /db_xref="PID:g36138" /db_xref="SWISS-PROT:Q02878" /translation="MAGEKVEKPDTKEKKPEAKKVDAGGKVKKGNLKAKKPKKGKPPL QPQPCPSQRNWQVFPICHVSRKAMYKRKYSAAKSKVEKKKKEKVLATVTKPVGGDKNG GTRVVKLRKMPRYYPTEDVPRKLLSHGKKPFSQHVRKLRASITPGTILIILTGRHRGK RVVFLKQLASGLLLVTDLWSSIEVPLRRTHQKFVIATSTKIDISNVKIPKHLTDAYFK KKKLRKPRHQEGEIFDTEKEKYEITEQRKIDQKAVDSQILPKIKAIPQLQGYLRSVFA LTNGIYPHKLVF" BASE COUNT 300 a 212 c 213 g 201 t ORIGIN 1 cttaattctc tttcccatct tgcaagatgg cgggtgaaaa agttgagaag ccagatacta 61 aagagaagaa acccgaagcc aagaaggttg atgctggtgg caaggtgaaa aagggtaacc 121 tcaaagctaa aaagcccaag aaggggaagc ccccattgca gccgcaaccc tgtccttctc 181 agaggaattg gcaggtattc ccgatctgcc atgtatccag aaaggccatg tacaagagga 241 agtactcagc cgctaaatcc aaggttgaaa agaaaaagaa ggagaaggtt ctcgcaactg 301 ttacaaaacc agttggtggt gacaagaacg gcggtacccg ggtggttaaa cttcgcaaaa 361 tgcctagata ttatcctact gaagatgtgc ctcgaaagct gttgagccac ggcaaaaaac 421 ccttcagtca gcacgtgaga aaactgcgag ccagcattac ccccgggacc attctgatca 481 tcctcactgg acgccacagg ggcaagaggg tggttttcct gaagcagctg gctagtggct 541 tattacttgt gactgacctc tggtcctcaa tcgaggttcc tctacgaaga acacaccaga 601 aatttgtcat tgccacttca accaaaatcg atatcagcaa tgtaaaaatc ccaaaacatc 661 ttactgatgc ttacttcaag aagaagaagc tgcggaagcc cagacaccag gaaggtgaga 721 tcttcgacac agaaaaagag aaatatgaga ttacggagca gcgcaagatt gatcagaaag 781 ctgtggactc acaaatttta ccaaaaatca aagctattcc tcagctccag ggctacctgc 841 gatctgtgtt tgctctgacg aatggaattt atcctcacaa attggtgttc taaatgtctt 901 aagaacctaa ttaaatagct gactac // LOCUS HSRPMI 1771 bp RNA PRI 02-FEB-1994 DEFINITION H.sapiens PMI1 mRNA for phosphomannose isomerase. ACCESSION X76057 NID g416016 KEYWORDS phosphomannose isomerase; PMI1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1771) AUTHORS Smith,D.J. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) D.J. Smith, University of Turku, Medicity, Tykistokatu 6A, Turku, 20520, FINLAND REFERENCE 2 (bases 1 to 1771) AUTHORS Proudfoot,A.E., Turcatti,G., Wells,T.N., Payton,M.A. and Smith,D.J. TITLE Purification, cDNA cloning and heterologous expression of human phosphomannose isomerase JOURNAL Eur. J. Biochem. 219 (1-2), 415-423 (1994) MEDLINE 94139717 FEATURES Location/Qualifiers source 1..1771 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testes" /clone_lib="CDNA in lambda gt11" /clone="pHPM11" gene 6..1277 /gene="PMI1" CDS 6..1277 /gene="PMI1" /EC_number="5.3.1.8" /codon_start=1 /product="phosphomannose isomerase" /db_xref="PID:g416017" /db_xref="SWISS-PROT:P34949" /translation="MAAPRVFPLSCAVQQYAWGKMGSNSEVARLLASSDPLAQIAEDK PYAELWMGTHPRGDAKILDNRISQKTLSQWIAENQDSLGSKVKDTFNGNLPFLFKVLS VETPLSIQAHPNKELAEKLHLQAPQHYPDANHKPEMAIALTPFQGLCGFRPVEEIVTF LKKVPEFQFLIGDEAATHLKQTMSHDSQAVASSLQSCFSHLMKSEKKVVVEQLNLLVK RISQQAAAGNNMEDIFGELLLQLHQQYPGDIGCFAIYFLNLLTLKPGEAMFLEANVPH AYLKGDCVECMACSDNTVRAGLTPKFIDVPTLCEMLSYTPSSSKDRLFLPTRSQEDPY LSIYDPPVPDFTIMKTEVPGSVTEYKVLALDSASILLMVQGTVIASTPTTQTPIPLQR GGVLFIGANESVSLKLTEPKDLLIFRACCLL" BASE COUNT 407 a 518 c 461 g 385 t ORIGIN 1 cgagcatggc cgctccgcga gtattcccac tttcctgtgc ggtgcagcag tatgcctggg 61 ggaagatggg ttccaacagc gaagtggcgc ggctgttggc cagcagtgat ccactggccc 121 agatcgcaga ggacaagcct tatgcagagt tgtggatggg gactcacccc cgaggggatg 181 ccaagatcct tgacaaccgc atctcacaga agaccctaag ccagtggatt gctgagaacc 241 aggacagctt gggctcaaag gtcaaggaca cctttaatgg caacctgccc ttcctcttca 301 aagtgctctc agttgaaaca cccctgtcca tccaggcaca ccctaacaag gagctggcag 361 agaagctgca cctccaggct ccgcagcact accccgatgc caaccacaag ccagagatgg 421 ccattgccct cacccccttc cagggcttgt gtggcttccg gccagttgag gagattgtaa 481 cctttctaaa gaaggtgcct gagtttcagt tcctgattgg agatgaggca gcaacacacc 541 tgaagcagac catgagccat gactcccagg ctgtggcctc ctctctgcag agctgtttct 601 cccacctgat gaagagtgag aagaaggtgg tggtggaaca gctcaacctg ttggtgaagc 661 ggatctccca gcaagcggct gccggaaaca acatggagga catctttggg gagcttttgc 721 tacagctgca ccagcagtac ccaggtgata tcggctgctt tgccatctac ttcctgaacc 781 tgcttaccct gaagcctggg gaggccatgt ttctggaggc caacgtaccc catgcctacc 841 tgaaaggaga ctgcgtggag tgcatggcgt gttcagacaa cacagttcgt gctggcctga 901 cacccaagtt cattgatgtg ccaaccctgt gtgaaatgct cagctatacc cctagctcca 961 gcaaggacag gctctttctc ccaacacgga gtcaggaaga cccctacctc tcaatctatg 1021 acccccctgt accagacttc accattatga agacggaggt ccctggctct gtcactgaat 1081 acaaggtctt ggcactggac tctgccagca tcctcctgat ggtacagggg acagtaatag 1141 ccagcacacc cacaacccag acaccaatcc ctctgcaacg tggtggcgtg ctcttcattg 1201 gggccaatga gagtgtctca ctgaagctta ctgagccgaa ggacctgctg atattccgtg 1261 cctgctgtct gctgtaaagg ctgcagcctc cccagctctc ctctgccagc caccctaaat 1321 tccagccaac ctcacctcct cgggcccagc tcaagccccc ttccttgctc tggacccctt 1381 aggtataccc tggaagagct ggggtggggg aggagggagc gtgaaggtag tgactcctga 1441 acacacccag gtggaaccat ctttggggag gagaggcccg tgtgaggggt ctgatactcc 1501 ctttgtcttc cctctctact cctcgctaca cctgagccag gctcttgcca actctgttcc 1561 agcctatggc tttaggctag ctgttaaata tgtgacccag cattagctca gcatctgtca 1621 gagcaagaga ccaggtaatt tctaagaaca gggttctagc gatgggactg cccatttcct 1681 cagctgcaga ggaggaaagg gaaagggtag gcctgtagac taacgctgtt tacacccttg 1741 ttctgtcaaa gcaattaaag atcacttgtg t // LOCUS HSRPS11 543 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for ribosomal protein S11. ACCESSION X06617 NID g36143 KEYWORDS ribosomal protein; ribosomal protein S11. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 543) AUTHORS Lott,J.B. and Mackie,G.A. TITLE Direct Submission JOURNAL Submitted (19-JAN-1988) Lott J.B., Mackie G.A., Dept. of Biochemistry, The University of Western Ontario, London, Ontario N6A 5C1 REFERENCE 2 (bases 1 to 543) AUTHORS Lott,J.B. and Mackie,G.A. TITLE Sequence of a cloned cDNA encoding human ribosomal protein S11 JOURNAL Nucleic Acids Res. 16 (3), 1205 (1988) MEDLINE 88143998 FEATURES Location/Qualifiers source 1..543 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fibroblast" mRNA <1..543 /note="rp S11 mRNA" CDS 16..492 /note="ribosomal protein S11 (AA 1 - 158)" /codon_start=1 /db_xref="PID:g36144" /db_xref="SWISS-PROT:P04643" /translation="MADIQTERAYQKQPTIFQNKKRVLLGETGKEKLPRYYKNIGLGF KTPKEAIEGTYIDKKCPFTGNVSIRGRILSGVVTKMKMQRTIVIRRDYLHYIRKYNRF EKRHKNMSVHLSPCFRDVQIGDIVTVGECRPLSKTVRFNVLKVTKAAGTKKQFQKF" polyA_site 543 /note="polyA site" BASE COUNT 144 a 151 c 145 g 103 t ORIGIN 1 caggcggccg ggaagatggc ggacattcag actgagcgtg cctaccaaaa gcagccgacc 61 atctttcaaa acaagaagag ggtcctgctg ggagaaactg gcaaggagaa gctcccgcgg 121 tactacaaga acatcggtct gggcttcaag acacccaagg aggctattga gggcacctac 181 attgacaaga aatgcccctt cactggtaat gtgtccattc gagggcggat cctctctggc 241 gtggtgacca agatgaagat gcagaggacc attgtcatcc gccgagacta tctgcactac 301 atccgcaagt acaaccgctt cgagaagcgc cacaagaaca tgtctgtaca cctgtccccc 361 tgcttcaggg acgtccagat cggtgacatc gtcacagtgg gcgagtgccg gcctctgagc 421 aagacagtgc gcttcaacgt gctcaaggtc accaaggctg ccggcaccaa gaagcagttc 481 cagaagttct gaggctggac attcggcccg ctcccacaat gaaataaagt tattttctat 541 tcc // LOCUS HSRPS12 492 bp RNA PRI 26-JUL-1991 DEFINITION Human mRNA for ribosomal protein S12. ACCESSION X53505 NID g36145 KEYWORDS ribosomal protein; ribosomal protein S12. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 492) AUTHORS Herault,Y. TITLE Direct Submission JOURNAL Submitted (22-JUN-1990) Herault Y., Laboratoire de Biologie Moleculaire et Cellulaire, Ecole Normale Superierue de Lyon, 46 Allee de Italie, 69364 Lyon Cedex 07, France REFERENCE 2 (bases 1 to 492) AUTHORS Herault,Y., Michel,D., Chatelain,G. and Brun,G. TITLE cDNA and predicted amino acid sequences of the human ribosomal protein genes rpS12 and rpL17 JOURNAL Nucleic Acids Res. 19 (14), 4001 (1991) MEDLINE 91319568 FEATURES Location/Qualifiers source 1..492 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphocyte" /cell_line="raji" /clone_lib="okayama et berg" mRNA 1..492 /note="ribosomal protein S12" /evidence=experimental CDS 80..478 /codon_start=1 /product="ribosomal protein S12" /db_xref="PID:g36146" /db_xref="SWISS-PROT:P25398" /translation="MAEEGIAAGGVMDVNTALQEVLKTALIHDGLARGIREAAKALDK RQAHLCVQASNCDEPMYVKLVEALLAEHQINLIKVDDNKKLGEWVGLCKIDREGNPRK VVGCSCVVVKDYGKESQAKDVIEEYFKCKK" BASE COUNT 135 a 105 c 135 g 117 t ORIGIN 1 ctttccctgc cgccgccgag tcgcgcggag ggcaggcttg ggtgcgttca agattcagct 61 tcacccgtaa cccaccgcca tggccgagga aggcattgct gctggaggtg taatggacgt 121 taatactgct ttacaagagg ttctgaagac tgccctcatc cacgatggcc tagcacgtgg 181 aattcgcgaa gctgccaaag ctttagacaa gcgccaagcc catctttgtg tgcaagcatc 241 caactgtgat gagcctatgt atgtcaagtt ggtggaggcc cttttggctg aacaccaaat 301 caacctaatt aaggttgatg acaacaagaa actaggagaa tgggtaggcc tttgtaaaat 361 tgacagagag gggaatcccc gtaaagtggt tggttgcagt tgtgtagtag ttaaggacta 421 tggcaaggag tctcaggcca aggatgtcat tgaagagtat ttcaaatgca agaaatgaag 481 aaataaatct tt // LOCUS HSRPS15A 450 bp RNA PRI 09-FEB-1995 DEFINITION H.sapiens mRNA for ribosomal protein S15a. ACCESSION X84407 NID g666046 KEYWORDS ribosomal protein S15a; S15a gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 450) AUTHORS Mays,G. and Burchert-Graeve,M. JOURNAL Unpublished REFERENCE 2 (bases 1 to 450) AUTHORS Mays,G. TITLE Direct Submission JOURNAL Submitted (06-FEB-1995) G. Mays, Klinische Chemie und Pathobiochemie, RWTH Aachen, Klinikum, Pauwelsstrasse 30, D- 52057 Aachen, FRG COMMENT Sequence overlapping with that under the acc#X62691. FEATURES Location/Qualifiers source 1..450 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="embryo" /tissue_type="carcinoma" /cell_line="NTera-2" /clone="D1" gene 1..393 /gene="S15a" CDS 1..393 /gene="S15a" /codon_start=1 /product="ribosomal protein S15a" /db_xref="PID:g666047" /db_xref="SWISS-PROT:P39027" /translation="MVRMNVLADALKSINNAEKRGKRQVLIRPCSKVIVRFLTLMMKH GYIGEFEIIDDHRAGKIVVNLTGRLNKCGVISPRFDVQLKDLEKWQNNLLPSRQFGFI VLTTSAGIMDHEEARRKHTGGKILGFFF" BASE COUNT 149 a 91 c 105 g 105 t ORIGIN 1 atggtgcgca tgaatgtcct ggcagatgct ctcaagagta tcaacaatgc cgaaaagaga 61 ggcaaacgcc aggtgcttat taggccgtgc tccaaagtca tcgtccggtt tctcacgttg 121 atgatgaagc atggttacat tggcgaattt gaaatcattg atgaccacag agctgggaaa 181 attgttgtga acctcacagg caggctaaac aagtgtgggg tgatcagccc cagatttgac 241 gtgcaactca aagacctgga aaaatggcag aataatctgc ttccatcccg ccagtttggt 301 ttcattgtac tgacaacctc agctggcatc atggaccatg aagaagcaag acgaaaacac 361 acaggaggga aaatcctggg attctttttc tagggatgta atacatatat ttacaaataa 421 aatgcctcat ggactaaaaa aaaaaaaaaa // LOCUS HSRPS26 414 bp RNA PRI 01-MAY-1995 DEFINITION H.sapiens RPS26 mRNA. ACCESSION X77770 NID g456350 KEYWORDS ribosomal protein S26; rps26 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 414) AUTHORS Filipenko,M.L., Vladimirov,S.N., Muravlev,A.I., Karpova,G.G. and Mertvetsov,N.P. TITLE Cloning cDNA of human S26 ribosomal protein and determination of its primary structure JOURNAL Bioorg. Khim. 20 (6), 644-649 (1994) MEDLINE 95032205 REFERENCE 2 (bases 1 to 414) AUTHORS Filipenko,M.L. TITLE Direct Submission JOURNAL Submitted (21-FEB-1994) M.L. Filipenko, Novosibirsk Institute of Bioorganic, Chemistry, 630090, Novosibirsk, Lavrentjev 8, Russia, USSR FEATURES Location/Qualifiers source 1..414 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="16" gene 1..348 /gene="RPS26" CDS 1..348 /gene="RPS26" /codon_start=1 /product="ribosomal protein S26" /db_xref="PID:g456351" /db_xref="SWISS-PROT:Q06722" /translation="MTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKFVIRNI VEAAAVRDISEASVFDAYVLPKLYVKLHYCVSCVIHSKVVRNRSREARKDRTPPPRFR PAGAAPRPPPKPM" BASE COUNT 118 a 106 c 107 g 83 t ORIGIN 1 atgacaaaga agagaaggaa caatggtcgt gccaaaaagg gccgcggcca cgtgcagcct 61 attcgctgca ctaactgtgc ccgatgcgtg cccaaggaca aggccattaa gaaattcgtc 121 attcggaaca tagtggaggc cgcagcagtc agggacattt ctgaagcgag cgtcttcgat 181 gcctatgtgc ttcccaagct gtatgtgaag ctacattact gtgtgagttg tgtaattcac 241 agcaaagtag tcaggaatcg atctcgtgaa gcccgcaagg accgaacacc cccaccccga 301 tttagacctg cgggtgctgc cccacgtccc ccaccaaagc ccatgtaagg agctgagttc 361 ttgaagactg aagacaggct attccctgga gaaaaataaa acggaaattg tact // LOCUS HSRPTK 3096 bp RNA PRI 30-NOV-1993 DEFINITION H.sapiens mRNA for receptor protein tyrosine kinase. ACCESSION X74764 NID g433337 KEYWORDS receptor protein-tyrosine kinase; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3096) AUTHORS Karn,T., Holtrich,U., Brauninger,A., Bohme,B., Wolf,G., Rubsamen-Waigmann,H. and Strebhardt,K. TITLE Structure, expression and chromosomal mapping of TKT from man and mouse: a new subclass of receptor tyrosine kinases with a factor VIII-like domain JOURNAL Oncogene 8 (12), 3433-3440 (1993) MEDLINE 94067796 REFERENCE 2 (bases 1 to 3096) AUTHORS Karn,T. TITLE Direct Submission JOURNAL Submitted (20-AUG-1993) T. Karn, Chemotherapeutisches Forschungsinstitut, Georg-Speyer-Haus, Paul-Ehrlich-Strasse 42-44, D-60596 Frankfurt, FRG FEATURES Location/Qualifiers source 1..3096 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="K1#1-1" /clone="K1#2-1,K1#9-1,K1#78-1,EB-3,U8-P12,lambdaTK-3" /clone_lib="lambdaZAP cDNA heart, lambdaMax1 cDNA thymus" /chromosome="1" /map="1q12-qter" sig_peptide 354..416 /gene="TKT" CDS 354..2921 /gene="TKT" /EC_number="2.7.1.112" /codon_start=1 /product="protein-tyrosine kinase" /db_xref="PID:g433338" /translation="MILIPRMLLVLFLLLPILSSAKAQVNPAICRYPLGMSGGQIPDE DITASSQWSESTAAKYGRLDSEEGDGAWCPEIPVEPDDLKEFLQIDLHTLHFITLVGT QGRHAGGHGIEFAPMYKINYSRDGTRWISWRNRHGKQVLDGNSNPYDIFLKDLEPPIV ARFVRFIPVTDHSMNVCMRVELYGCVWLDGLVSYNAPAGQQFVLPGGSIIYLNDSVYD GAVGYSMTEGLGQLTDGVSGLDDFTQTHEYHVWPGYDYVGWRNESATNGYIEIMFEFD RIRNFTTMKVHCNNMFAKGVKIFKEVQCYFRSEASEWEPNAISFPLVLDDVNPSARFV TVPLHHRMASAIKCQYHFADTWMMFSEITFQSDAAMYNNSEALPTSPMAPTTYDPMLK VDDSNTRILIGCLVAIIFILLAIIVIILWRQFWQKMLEKASRRMLDDEMTVSLSLPSD SSMFNNNRSSSPSEQGSNSTYDRIFPLRPDYQEPSRLIRKLPEFAPGEEESGCSGVVK PVQPSGPEGVPHYAEADIVNLQGVTGGNTYSVPAVTMDLLSGKDVAVEEFPRKLLTFK EKLGEGQFGEVHLCEVEGMEKFKDKDFALDVSANQPVLVAVKMLRADANKNARNDFLK EIKIMSRLKDPNIIHLLSVCITDDPLCMITEYMENGDLNQFLSRHEPPNSSSSDVRTV SYTNLKFMATQIASGMKYLSSLNFVHRDLATRNCLVGKNYTIKIADFGMSRNLYSGDY YRIQGRAVLPIRWMSWESILLGKFTTASDVWAFGVTLWETFTFCQEQPYSQLSDEQVI ENTGEFFRDQGRQTYLPQPAICPDSVYKLMLSCWRRDTKNRPSFQEIHLLLLQQGDE" gene 354..2921 /gene="TKT" mat_peptide 417..2918 /gene="TKT" /EC_number="2.7.1.112" /product="protein-tyrosine kinase" BASE COUNT 762 a 791 c 752 g 791 t ORIGIN 1 catcttgcat cagcctgtgg atgtatgcct accaccgggc tccttcacca gcaaagtgga 61 aaaagaagcg tttcacaaca aattcttctt tttgggttgg ggaaacgcag tggattatag 121 ctctgttttc ttctttccaa aactgtgcac ccctggatga aacctccatc aagggagacc 181 tacaagttgc ctggggttca gtgctctaga aagttccaag gtttgtggct tgaattattc 241 taaagaagct gaaataattg aagagaagca gaggccagct gtttttgagg atcctgctcc 301 acagagaatg ctctgcaccc gttgatactc cagttccaac accatcttct gagatgatcc 361 tgattcccag aatgctcttg gtgctgttcc tgctgctgcc tatcttgagt tctgcaaaag 421 ctcaggttaa tccagctata tgccgctatc ctctgggcat gtcaggaggc cagattccag 481 atgaggacat cacagcttcc agtcagtggt cagagtccac agctgccaaa tatggaaggc 541 tggactcaga agaaggggat ggagcctggt gccctgagat tccagtggaa cctgatgacc 601 tgaaggagtt tctgcagatt gacttgcaca ccctccattt tatcactctg gtggggaccc 661 aggggcgcca tgcaggaggt catggcatcg agtttgcccc catgtacaag atcaattaca 721 gtcgggatgg cactcgctgg atctcttggc ggaaccgtca tgggaaacag gtgctggatg 781 gaaatagtaa cccctatgac attttcctaa aggacttgga gccgcccatt gtagccagat 841 ttgtccggtt cattccagtc accgaccact ccatgaatgt gtgtatgaga gtggagcttt 901 acggctgtgt ctggctagat ggcttggtgt cttacaatgc tccagctggg cagcagtttg 961 tactccctgg aggttccatc atttatctga atgattctgt ctatgatgga gctgttggat 1021 acagcatgac agaagggcta ggccaattga ccgatggtgt gtctggcctg gacgatttca 1081 cccagaccca tgaataccac gtgtggcccg gctatgacta tgtgggctgg cggaacgaga 1141 gtgccaccaa tggctacatt gagatcatgt ttgaatttga ccgcatcagg aatttcacta 1201 ccatgaaggt ccactgcaac aacatgtttg ctaaaggtgt gaagatcttt aaggaggtac 1261 agtgctactt ccgctctgaa gccagtgagt gggaacctaa tgccatttcc ttcccccttg 1321 tcctggatga cgtcaacccc agtgctcggt ttgtcacggt gcctctccac caccgaatgg 1381 ccagtgccat caagtgtcaa taccattttg cagatacctg gatgatgttc agtgagatca 1441 ccttccaatc agatgctgca atgtacaaca actctgaagc cctgcccacc tctcctatgg 1501 cacccacaac ctatgatcca atgcttaaag ttgatgacag caacactcgg atcctgattg 1561 gctgcttggt ggccatcatc tttatcctcc tggccatcat tgtcatcatc ctctggaggc 1621 agttctggca gaaaatgctg gagaaggctt ctcggaggat gctggatgat gaaatgacag 1681 tcagcctttc cctgccaagt gattctagca tgttcaacaa taaccgctcc tcatcaccta 1741 gtgaacaagg gtccaactcg acttacgatc gcatctttcc ccttcgccct gactaccagg 1801 agccatccag gctgatacga aaactcccag aatttgctcc aggggaggag gagtcaggct 1861 gcagcggtgt tgtgaagcca gtccagccca gtggccctga gggggtgccc cactatgcag 1921 aggctgacat agtgaacctc caaggagtga caggaggcaa cacatactca gtgcctgccg 1981 tcaccatgga cctgctctca ggaaaagatg tggctgtgga ggagttcccc aggaaactcc 2041 taactttcaa agagaagctg ggagaaggac agtttgggga ggttcatctc tgtgaagtgg 2101 agggaatgga aaaattcaaa gacaaagatt ttgccctaga tgtcagtgcc aaccagcctg 2161 tcctggtggc tgtgaaaatg ctccgagcag atgccaacaa gaatgccagg aatgattttc 2221 ttaaggagat aaagatcatg tctcggctca aggacccaaa catcatccat ctattatctg 2281 tgtgtatcac tgatgaccct ctctgtatga tcactgaata catggagaat ggagatctca 2341 atcagtttct ttcccgccac gagcccccta attcttcctc cagcgatgta cgcactgtca 2401 gttacaccaa tctgaagttt atggctaccc aaattgcctc tggcatgaag tacctttcct 2461 ctcttaattt tgttcaccga gatctggcca cacgaaactg tttagtgggt aagaactaca 2521 caatcaagat agctgacttt ggaatgagca ggaacctgta cagtggtgac tattaccgga 2581 tccagggccg ggcagtgctc cctatccgct ggatgtcttg ggagagtatc ttgctgggca 2641 agttcactac agcaagtgat gtgtgggcct ttggggttac tttgtgggag actttcacct 2701 tttgtcaaga acagccctat tcccagctgt cagatgaaca ggttattgag aatactggag 2761 agttcttccg agaccaaggg aggcagactt acctccctca accagccatt tgtcctgact 2821 ctgtgtataa gctgatgctc agctgctgga gaagagatac gaagaaccgt ccctcattcc 2881 aagaaatcca ccttctgctc cttcaacaag gcgacgagtg atgctgtcag tgcctggcca 2941 tgttcctacg gctcaggtcc tccctacaag acctaccact cacccatgcc tatgccactc 3001 catctggaca tttaatgaaa ctgagagaca gaggcttgtt tgctttgccc tcttttcctg 3061 gtcaccccca ctccctaccc ctgactcata tatact // LOCUS HSRR2 15731 bp RNA PRI 09-SEP-1996 DEFINITION H.sapiens mRNA for ryanodine receptor 2. ACCESSION X98330 NID g1526977 KEYWORDS ryanodine receptor 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15731) AUTHORS Tunwell,R.E., Wickenden,C., Bertrand,B.M., Shevchenko,V.I., Walsh,M.B., Allen,P.D. and Lai,F.A. TITLE The human cardiac muscle ryanodine receptor-calcium release channel: identification, primary structure and topological analysis JOURNAL Biochem. J. 318 (Pt 2), 477-487 (1996) MEDLINE 96404895 REFERENCE 2 (bases 1 to 15731) AUTHORS Lai,F.A. TITLE Direct Submission JOURNAL Submitted (06-JUN-1996) T. Lai, National Institute for Medical Research, Neurophysiology, The Ridgeway, Mill Hill, London NW7 1AA, UK FEATURES Location/Qualifiers source 1..15731 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 years old" /sex="male" /tissue_type="heart" CDS 122..15025 /codon_start=1 /product="ryanodine receptor 2" /db_xref="PID:e248827" /db_xref="PID:g1526978" /translation="MADGGEGEDEIQFLRTDDEVVLQCTATIHKEQQKLCLAAEGFGN RLCFLESTSNSKNVPPDLSICTFVLEQSLSVRALQEMLANTVEKSEGQVDVEKWKFMM KTAQGGGHRTLLYGHAILLRHSYSGMYLCCLSTSRSSTDKLAFDVGLQEDTTGEACWW TIHPASKQRSEGEKVRVGDDLILVSVSSERYLHLSYGNGSLHVDAAFQQTLWSVAPIS SGSEAAQGYLIGGDVLRLLHGHMDECLTVPSGEHGEEQRRTVHYEGGAVSVHARSLWR LETLRVAWSGSHIRWGQPFRLRHVTTGKYLSLMEDKNLLLMDKEKADVKSTAFTFRSS KEKLDVGVRKEVDGMGTSEIKYGDSVCYIQHVDTGLWLTYQSVDVKSVRMGSIQRKAI MHHEGHMDDGISLSRSQHEESRTARVIRSTVFLFNRFIRGLDALSKKAKASTVDLPIE SVSLSLQDLIGYFHPPDEHLEHEDKQNRLRALKNRQNLFQEEGMINLVLECIDRLHVY SSAAHFADVAGREAGESWKSILNSLYELLAALIRGNRKNCAQFSGSLDWLISRLERLE ASSGILEVLHCVLVESPEALNIIKEGHIKSIISLLDKHGRNHKVLDVLCSLCVCHGVA VRSNQHLICDNLLPGRDLLLQTRLVNHVSSMRPNIFLGVSEGSAQYKKWYYELMVDHT EPFVTAEATHLRVGWASTEGYSPYPGGGEEWGGNGVGDDLFSYGFDGLHLWSGCIART VSSPNQHLLRTDDVISCCLDLSAPSISFRINGQPVQGMFENFNIDGLFFPVVSFSAGI KVRFLLGGRHGEFKFLPPPGYAPCYEAVLPKEKLKVEHSREYKQERTYTRDLLGPTVS LTQAAFTPIPVDTSQIVLPPHLERIREKLAENIHELWVMNKIELGWQYGPVRDDNKRQ HPCLVEFSKLPEQERNYNLQMSLETLKTLLALGCHVGISDEHAEDKVKKMKLPKNYQL TSGYKPAPMDLSFIKLTPSQEAMVDKLAENAHNVWARDRIRQGWTYGIQQDVKNRRNP RLVPYTPLDDRTKKSNKDSLREAVRTLLGYGYNLEAPDQDHAARAEVCSGTGERFRIF RAEKTYAVKAGRWYFEFETVTAGDMRVGWSRPGCQPDQELGSDERAFAFDGFKAQRWH QGNEHYGRSWQAGDVVGCMVDMNEHTMMFTLNGEILLDDSGSELAFKDFDVGDGFIPV CSLGVAQVGRMNFGKDVSTLKYFTICGLQEGYEPFAVNTNRDITMWLSKRLPQFLQVP SNHEHIEVTRIDGTIDSSPCLKVTQKSFGSQNSNTDIMFYRLSMPIECAEVFSKTVAG GLPGAGLFGPKNDLEDYDADSDFEVLMKTAHGHLVPDRVDKDKEATKPEFNNHKDYAQ EKPSRLKQRFLLRRTKPDYSTSHSARLTEDVLADDRDDYDFLMQTSTYYYSVRIFPGQ EPANVWVGWITSDFHQYDTGFDLDRVRTVTVTLGDEKGKVHESIKRSNCYMVCAGESM SPGQGRNNNGLEIGCVVDAASGLLTFIANGKELSTYYQVEPSTKLFPAVFAQATSPNV FQFELGRIKNVMPLSAGLFKSEHKNPVPQCPPRLHVQFLSHVLWSRMPNQFLKVDVSR ISERQGWLVQCLDPLQFMSLHIPEENRSVDILELTEQEELLKFHYHTLRLYSAVCALG NHRVAHALCSHVDEPQLLYAIENKYMPGLLRAGYYDLLIDIHLSSYATARLMMNNEYI VPMTEETKSITLFPDENKKHGLPGIGLSTSLRPRMQFSSPSFVSISNECYQYSPEFPL DILKSKTIQMLTEAVKEGSLHARDPVGGTTEFLFVPLIKLFYTLLIMGIFHNEDLKHI LQLIEPSVFKEAATPEEESDTLEKELSVDDAKLQGAGEEEAKGGKRPKEGLLQMKLPE PVKLQMCLLLQYLCDCQVRHRIEAIVAFSDDFVAKLQDNQRFRYNEVMQALNMSAALT ARKTKEFRSPPQEQINMLLNFKDDKSECPCPEEIRDQLLDFHEDLMTHCGIELDEDGS LDGNSDLTIRGRLLSLVEKVTYLKKKQAEKPVESDSKKSSTLQQLISETMVRWAQESV IEDPELVRAMFVLLHRQYDGIGGLVRALPKTYTINGVSVEDTINLLASLGQIRSLLSV RMGKEEEKLMIRGLGDIMNNKVFYQHPNLMRALGMHETVMEVMVNVLGGGESKEITFP KMVANCCRFLCYFCRISRQNQKAMFDHLSYLLENSSVGLASPAMRGSTPLDVAAASVM DNNELALALREPDLEKVVRYLAGCGLQSCQMLVSKGYPDIGWNPVEGERYLDFLRFAV FCNGESVEENANVVVRLLIRRPECFGPALRGEGGNGLLAAMEEAIKIAEDPSRDGPSP NSGSSKTLDTEEEEDDTIHMGNAIMTFYSALIDLLGRCAPEMHLIHAGKGEAIRIRSI LRSLIPLGDLVGVISIAFQMPTIAKDGNVVEPDMSAGFCPDHKAAMVLFLDRVYGIEV QDFLLHLLEVGFLPDLRAAASLDTAALSATDMALALNRYLCTAVLPLLTRCAPLFAGT EHHASLIDSLLHTVYRLSKGCSLTKAQRDSIEVCLLSICGQLRPSMMQHLLRRLVFDV PLLNEHAKMPLKLLTNHYERCWKYYCLPGGWGNFGAASEEELHLSRKLFWGIFDALSQ KKYEQELFKLALPCLSAVAGALPPDYMESNYVSMMEKQSSMDSEGNFNPQPVDTSNIT IPEKLEYFINKYAEHSHDKWSMDKLANGWIYGEIYSDSSKVQPLMKPYKLLSEKEKEI YRWPIKESLKTMLARTMRTERTREGDSMALYNRTRRISQTSQVSVDAAHGYSPRAIDM SNVTLSRDLHAMAEMMAENYHNIWAKKKKMELESKGGGNHPLLVPYDTLTAKEKAKDR EKAQDILKFLQINGYAVSRGFKDLELDTPSIEKRFAYSFLQQLIRYVDEAHQYILEFD GGSRGKGEHFPYEQEIKFFAKVVLPLIDQYFKNHRLYFLSAASRPLCSGGHASNKEKE MVTSLFCKLGVLVRHRISLFGNDATSIVNCLHILGQTLDARTVMKTGLESVKSALRAF LDNAAEDLEKTMENLKQGQFTHTRNQPKGVTQIINYTTVALLPMLSSLFEHIGQHQFG EDLILEDVQVSCYRILTSLYALGTSKSIYVERQRSALGECLAAFAGAFPVAFLETHLD KHNIYSIYNTKSSRERAALSLPTNVEDVCPNIPSLEKLMEEIVELAESGIRYTQMPHV MEVILPMLCSYMSRWWEHGPENNPERAEMCCTALNSEHMNTLLGNILKIIYNNLGIDE GAWMKRLAVFSQPIINKVKPQLLKTHFLPLMEKLKKKAATVVSEEDHLKAEARGDMSE AELLILDEFTTLARDLYAFYPLLIRFVDYNRAKWLKEPNPEAEELFRMVAEVFIYWSK SHNFKREEQNFVVQNEINNMSFLITDTKSKMSKAAVSDQERKKMKRKGDRYSMQTSLI VAALKRLLPIGLNICAPGDQELIALAKNRFSLKDTEDEVRDIIRSNIHLQGKLEDPAI RWQMALYKDLPNRTDDTSDPEKTVERVLDIANVLFHLEQKSKRVGRRHYCLVEHPQRS KKAVWHKLLSKQRKRAVVACFRMAPLYNLPRHRAVNLFLQGYEKSWIETEEHYFEDKL IEDLAKPGAEPPEEDEGTKRVDPLHQLILLFSRTALTEKCKLEEDFLYMAYADIMAKS CHDEEDDDGEEEVKSFEEKEMEKQKLLYQQARLHDRGAAEMVLQTISASKGETGPMVA ATLKLGIAILNGGNSTVQQKMLDYLKEKKDVGFFQSLAGLMQSCSVLDLNAFERQNKA EGLGMVTEEGSGEKVLQDDEFTCDLFRFLQLLCEGHNSDFQNYLRTQTGNNTTVNIII STVDYLLRVQESISDFYWYYSGKDVIDEQGQRNFSKAIQVAKQVFNTLTEYIQGPCTG NQQSLAHSRLWDAVVGFLHVFAHMQMKLSQDSSQIELLKELMDLQKDMVVMLLSMLEG NVVNGTIGKQMVDMLVESSNNVEMILKFFDMFLKLKDLTSSDTFKEYDPDGKGVISKR DFHKAMESHKHYTQSETEFLLSCAETDENETLDYEEFVKRFHEPAKDIGFNVAVLLTN LSEHMPNDTRLQTFLELAESVLNYFQPFLGRIEIMGSAKRIERVYFEISESSRTQWEK PQVKESKRQFIFDVVNEGGEKEKMELFVNFCEDTIFEMQLAAQISESDLNERSANKEE SEKERPEEQGPRMAFFSILTVRSALFALRYNILTLMRMLSLKSLKKQMKKVKKMTVKD MVTAFFSSYWSIFMTLLHFVASVFRGFFRIICSLLLGGSLVEGAKKIKVAELLANMPD PTQDEVRGDGEEGERKPLEAALPSEDLTDLKELTEESDLLSDIFGLDLKREGGQYKLI PHNPNAGLSDLMSNPVPMPEVQEKFQEQKAKEEEKEEKEETKSEPEKAEGEDGEKEEK AKEDKGKQKLRQLHTHRYGEPEVPESAFWKKIIAYQQKLLNYFARNFYNMRMLALFVA FAINFILLFYKVSTSSVVEGKELPTRSSSENAKVTSLDSSSHRIIAVHYVLEESSGYM EPTLRILAILHTVISFFCIIGYYCLKVPLVIFKREKEVARKLEFDGLYITEQPSEDDI KGQWDRLVINTQSFPNNYWDKFVKRKVMDKYGEFYGRDRISELLGMDKAALDFSDARE KKKPKKDSSLSAVLNSIDVKYQMWKLGVVFTDNSFLYLAWYMTMSVLGHYNNFFFAAH LLDIAMGFKTLRTILSSVTHNGKQLVLTVGLLAVVVYLYTVVAFNFFRKFYNKSEDGD TPDMKCDDMLTCYMFHMYVGVRAGGGIGDEIEDPAGDEYEIYRIIFDITFFFFVIVIL LAIIQGLIIDAFGELRDQQEQVKEDMETKCFICGIGNDYFDTVPHGFETHTLQEHNLA NYLFFLMYLINKDETEHTGQESYVWKMYQERCWEFFPAGDCFRKQYEDQLN" BASE COUNT 4498 a 3407 c 3874 g 3952 t ORIGIN 1 cgcgcggccc cctccagccc ccggctcccg gcagcagaag cagaaggcag cgccaggggc 61 cgccgccgcc gccgagctcc gcggggctcg ggagccggcc ccggcgagga ggcgcggaac 121 catggccgat gggggcgagg gcgaagacga gatccagttc ctgcgaactg atgatgaagt 181 ggttctgcag tgcaccgcaa ccatccacaa agaacaacag aagctatgct tggcagcaga 241 aggatttggc aacagacttt gtttcttgga gtccacttcc aattccaaga atgtgccccc 301 agacctctcc atctgcacct ttgtgctgga gcagtccctc tctgtccggg cgctgcagga 361 gatgctggct aacaccgtgg agaaatcaga agggcaagtt gatgtggaaa aatggaaatt 421 catgatgaag actgctcaag gtggtggtca tcgaacactc ctctacggac atgccatatt 481 gctgcgccat tcctatagtg gcatgtatct gtgctgcctg tccacctccc ggtcttcaac 541 tgataagctg gcttttgatg ttggcttgca agaggacacc acaggggagg cttgttggtg 601 gaccatacac cctgcctcta agcagcgatc agaaggagaa aaagtacgag ttggagatga 661 cctcatctta gttagcgtgt cctctgaaag gtacttgcac ttgtcttatg gcaacggcag 721 cttacacgtg gatgccgctt tccagcagac tctctggagc gtggccccaa tcagctcagg 781 aagtgaggca gcccaagggt atctcattgg tggtgatgtc ctcaggttgc tgcatggaca 841 catggacgag tgtctcactg tcccttcagg agaacatggt gaagagcagc ggagaactgt 901 tcattatgaa ggtggcgctg tgtctgttca tgcacgttcc ctttggagac tagagacgct 961 aagagttgcg tggagtggaa gccacataag atggggacag ccattccgac tacgccatgt 1021 cacaacagga aaatacttga gtctcatgga agacaaaaac cttctactca tggacaaaga 1081 gaaagctgat gtaaaatcaa cagcatttac cttccggtct tccaaggaaa aattggatgt 1141 aggggtgaga aaagaagtag atggcatggg aacatctgaa ataaaatacg gtgactcagt 1201 atgctatata caacatgtag acacaggcct atggcttact taccagtctg tggacgtgaa 1261 atccgtgaga atgggatcta tacaacgtaa ggctattatg catcatgaag gccacatgga 1321 tgatggcata agtttgtcga gatcccagca tgaagaatca cgcacagccc gagttatccg 1381 gagcacagtc ttccttttca atagatttat aaggggcctt gatgctctca gcaagaaagc 1441 gaaggcttcc acagtcgatt tgcctataga gtccgtaagc ctaagtctgc aggatctcat 1501 tggctacttc caccccccag atgagcattt agagcatgaa gacaaacaga acagactacg 1561 agccctgaag aatcggcaaa atctcttcca ggaagaggga atgatcaacc tcgtgcttga 1621 gtgcatagac cgtttgcacg tctacagcag tgcagcacac tttgctgatg ttgctgggcg 1681 agaagcagga gagtcttgga aatccattct gaattctctg tatgagttgc tggcggctct 1741 aattagagga aatcgtaaaa actgtgctca attttctggc tccctcgact ggttgatcag 1801 cagattggaa agactggaag cttcttcagg cattctggaa gttttacact gtgttttagt 1861 agaaagtcca gaagctctaa atattattaa agaaggacat attaaatcta ttatctcact 1921 tttagacaaa catggaagaa atcacaaggt tctggatgtc ttgtgctcac tctgtgtttg 1981 ccacggggtt gcagtccgtt ctaaccagca tctcatctgt gacaatctcc taccaggaag 2041 agacttgtta ttgcagacac gtcttgtgaa ccatgtcagc agcatgagac ccaatatttt 2101 tctgggcgtc agtgaaggtt ctgctcagta taagaaatgg tactatgaat tgatggtgga 2161 ccacacagag ccctttgtga cagctgaagc aactcacctg cgagtgggct gggcttccac 2221 tgaaggatat tctccctacc ctggaggggg cgaagagtgg ggtggaaatg gtgttggaga 2281 tgatctcttc tcctatggat ttgatggcct tcatctctgg tcaggttgta ttgctcgtac 2341 tgtaagctca ccaaaccaac atctgttaag aactgatgat gtcatcagtt gctgtttaga 2401 tctgagtgcc ccaagcatct cgttccgaat taatggacaa cctgttcaag gaatgtttga 2461 gaatttcaac atcgatggcc tcttctttcc agtcgttagt ttctctgcag gaataaaagt 2521 acgctttctg cttggagggc gacatggaga attcaaattt cttcctccac ctgggtatgc 2581 tccttgttat gaagctgttc tgccaaaaga aaagttgaaa gtggaacaca gccgagagta 2641 caagcaagaa agaacttaca cacgcgacct gctgggcccc acagtttccc tgacgcaagc 2701 tgccttcaca cccatccctg tggataccag ccagatcgtg ttgcctcctc atctagaaag 2761 aataagagaa aaactggcag agaatatcca tgaactctgg gttatgaata aaattgagct 2821 tggctggcag tatggtccgg ttagagatga caacaagaga caacacccat gcctggtgga 2881 gttctccaag ctgcctgaac aggagcgcaa ttacaactta caaatgtcgc ttgagaccct 2941 gaagactttg ttggcattag gatgtcatgt gggtatatca gatgaacatg ctgaagacaa 3001 ggtgaaaaaa atgaagctac ccaagaatta ccagctgaca agtggataca agcctgcccc 3061 tatggacctg agctttatca aactcacccc atcgcaagaa gcaatggtgg acaagttggc 3121 agaaaatgca cataatgtgt gggcgcggga tcgaatccgg cagggctgga cttatggcat 3181 ccaacaggac gtaaagaaca gaagaaatcc tcgccttgtt ccctacactc ctctggatga 3241 ccgaaccaag aaatccaaca aggacagcct ccgcgaggct gtgcgcacgc tgctggggta 3301 cggctacaac ttggaagcac cagatcaaga tcatgcagcc agagccgaag tgtgcagcgg 3361 caccggggaa aggttccgaa tcttccgtgc cgagaagacc tatgcagtga aggccggacg 3421 gtggtatttt gaatttgaga cggtcactgc tggagacatg agggttggtt ggagtcgtcc 3481 tggttgtcaa ccggatcagg agcttggctc agatgaacgt gcctttgcct ttgatggctt 3541 caaggcccag cggtggcatc agggcaatga acactatggg cgctcttggc aagcaggcga 3601 tgtcgtgggg tgtatggttg acatgaacga acacaccatg atgttcacac tgaatggtga 3661 aatccttctt gatgattcag gctcagaact ggctttcaag gactttgatg ttggcgatgg 3721 attcatacct gtgtgtagcc ttggagtggc tcaagtgggt aggatgaact ttggaaagga 3781 tgtcagcacc ttgaaatatt tcaccatctg tggcttacaa gagggctatg aaccatttgc 3841 cgttaataca aacagggata ttaccatgtg gctgagcaag aggcttcctc agtttcttca 3901 agttccatca aaccatgaac atatagaggt gaccagaata gacggcacca tagacagttc 3961 cccatgttta aaggtcactc agaagtcttt tggttctcag aacagcaaca ctgatatcat 4021 gttttatcgc ctgagcatgc cgatcgagtg cgcggaggtc ttctccaaga cggtggctgg 4081 agggctccct ggggctggcc tttttgggcc caagaatgac ttggaagatt atgatgctga 4141 ttctgacttt gaggttctga tgaagacagc tcatggccat ctagtgcccg atcgtgttga 4201 caaagacaaa gaagctacta aaccagagtt taacaaccac aaagattatg cccaggaaaa 4261 gccctctcgt ctgaaacaaa gatttttgct tagaagaaca aagccagatt acagcacaag 4321 ccattctgca agactcaccg aagatgtcct tgctgatgat cgggatgact atgatttctt 4381 gatgcaaacg tccacgtact attactcagt gagaatcttt cctggacaag aacctgctaa 4441 tgtctgggtg ggctggatta catcagattt ccatcagtat gacacaggct ttgacttgga 4501 cagagttcgc acagtaacag ttactctagg agatgaaaaa ggaaaagtgc atgaaagcat 4561 caaacgcagc aactgctata tggtatgtgc gggtgagagc atgagccccg ggcaaggacg 4621 caacaataat ggactggaga ttggctgtgt ggtggatgct gccagcgggc tgctcacatt 4681 cattgccaat ggcaaggaac tgagcacata ctatcaggtg gaaccgagta caaaattatt 4741 tcctgcggtt tttgcacaag ctacaagtcc caatgttttc cagtttgagt tgggaagaat 4801 aaagaatgtg atgcctctct cggcgggatt attcaagagt gagcacaaga accccgtgcc 4861 gcagtgcccc ccgcgcctcc acgtgcagtt cctgtcacac gtcctgtgga gcagaatgcc 4921 caaccagttt ttgaaggtag atgtgtctcg aataagtgaa cgccaaggct ggttggtgca 4981 gtgtttggat cctctgcagt tcatgtctct tcatatccct gaggaaaaca gatctgttga 5041 catcttagag ttgacagagc aggaggaatt gctgaaattt cactatcaca ctctccggct 5101 ctactcagcc gtctgtgctc ttgggaacca ccgggtggcc catgccctgt gcagccatgt 5161 ggatgaacct cagctcctct atgccattga gaacaagtac atgcctggtt tgctgcgtgc 5221 tggctactat gacctgctga ttgacatcca cctgagctcc tatgccactg ccaggctcat 5281 gatgaacaac gagtacattg tccccatgac ggaggagacg aagagcatca ccctgttccc 5341 tgatgagaac aaaaaacacg gccttccagg gatcggcctc agcacctccc tcaggccacg 5401 gatgcagttt tcctccccca gttttgtaag cattagtaat gaatgttacc agtacagtcc 5461 agagttccca ctggacatcc tcaagtccaa aaccatacag atgctgacag aagctgttaa 5521 agagggcagt cttcatgccc gggacccagt tggagggact actgaattcc tctttgtacc 5581 tctcatcaag cttttctata ccctgctgat catgggcatc tttcacaacg aggacttgaa 5641 gcacatcttg cagttgattg agcccagtgt gtttaaagaa gctgccactc cggaggagga 5701 gagtgacacg ctggagaaag agctcagtgt ggacgatgca aagctgcaag gagctggtga 5761 ggaagaagcc aaggggggca agcggcccaa ggaaggcctg ctccaaatga aactgccaga 5821 gccagttaaa ttgcagatgt gcctactgct tcagtacctc tgtgactgcc aggtccggca 5881 ccggatagaa gccattgtag ccttttcaga tgattttgtg gctaagctcc aagacaatca 5941 acgtttccga tacaacgaag tcatgcaagc cttaaacatg tcagctgcac tcacagccag 6001 gaagacaaag gaatttagat caccacctca agaacagatc aatatgcttc tcaattttaa 6061 ggatgacaaa agtgaatgtc catgtccaga agaaattcgt gaccaactat tggatttcca 6121 tgaagatttg atgacacatt gtggaattga gctggatgaa gatgggtctc tggatggaaa 6181 cagtgattta acaattagag ggcgtctgct atccctggta gaaaaggtga catatctgaa 6241 gaagaagcaa gcagaaaaac cagttgagag tgactccaaa aagtcctcca ctctgcagca 6301 gctgatttct gagaccatgg tccgatgggc tcaggagtct gtcattgaag accccgagct 6361 ggtgagggcc atgtttgtgt tgctccatcg gcagtatgac ggcattgggg gtcttgttcg 6421 ggccctgcca aagacctaca cgataaatgg tgtgtccgtg gaggacacca tcaacctgct 6481 ggcatccctt ggtcagattc ggtccctgct gagtgtgaga atgggcaaag aagaagagaa 6541 gctcatgatt cgtggattag gggatattat gaataacaaa gtgttttacc agcaccctaa 6601 tctcatgagg gcactgggga tgcacgagac tgtgatggag gtcatggtga acgtccttgg 6661 aggtggagag tccaaggaaa tcacctttcc caagatggtg gccaactgtt gccgttttct 6721 ctgttacttc tgtcgtataa gtaggcagaa tcaaaaagct atgtttgatc atctcagtta 6781 tttactggaa aacagcagtg ttggtcttgc ctccccagct atgagaggtt caacaccact 6841 ggatgtggct gcagcttcgg tgatggataa taatgaacta gcattagctc tgcgtgagcc 6901 ggatctagaa aaggtagttc gttatttggc tggttgtgga ctgcaaagtt gccagatgct 6961 ggtgtctaag ggctatccag acattgggtg gaacccagtt gaaggagaga gatatcttga 7021 ctttctcaga tttgctgtct tctgtaatgg ggagagtgtg gaggaaaatg caaatgtcgt 7081 ggtgagattg ctcattcgga ggcctgagtg ttttggtcct gctttgagag gagaaggtgg 7141 gaatgggctt cttgcagcaa tggaagaagc catcaaaatc gccgaggatc cttcccgaga 7201 tggtccctca ccaaatagcg gatccagtaa aacacttgac acagaggagg aggaagatga 7261 cactatccac atggggaacg cgatcatgac cttctattca gctttgattg acctcttggg 7321 acgctgtgct cctgagatgc atttgattca tgccgggaag ggagaagcca tcagaattag 7381 gtccattttg agatccctca ttcccctggg agatttggtg ggcgttatca gcatcgcttt 7441 tcagatgcca acaatagcca aagatgggaa tgtggtggaa cctgacatgt ctgcggggtt 7501 ttgcccagat cacaaggcag ccatggtttt attccttgac agggtctatg ggattgaggt 7561 tcaagacttc ctcctccatc ttcttgaggt tggctttctg ccagatctcc gggcggctgc 7621 ttctttagat acggcagctt tgagtgctac agacatggcc ttggccctca atcggtacct 7681 ttgcacagcc gtcttgccat tgttaacaag atgtgctcct ctctttgctg gcacagagca 7741 ccacgcttct ctcattgact cattacttca tactgtgtat agactttcta agggctgttc 7801 acttaccaaa gctcagcggg attccataga agtttgttta ctctctattt gtggacaact 7861 gagaccttct atgatgcagc acttactcag aagattagta tttgatgtcc cattattaaa 7921 tgaacacgca aagatgcctc ttaaactgct gacaaatcat tatgaaagat gctggaaata 7981 ttactgcctg cctggagggt ggggaaactt tggtgctgcc tcagaagaag aacttcattt 8041 atcaagaaag ttgttctggg gcatttttga tgccctgtct caaaagaaat atgaacaaga 8101 acttttcaaa ctggcactgc cttgcctgag tgcagttgcg ggagctttgc ctccagacta 8161 catggagtca aattatgtca gtatgatgga aaaacagtca tcaatggatt ctgaagggaa 8221 ctttaaccca caacctgttg atacctcaaa tattacaatt cctgagaaat tggaatactt 8281 cattaacaaa tatgcagaac actcccatga caaatggtca atggacaagt tggcaaatgg 8341 atggatttat ggagaaatat attcagactc ttctaaggtt cagccattaa tgaagccata 8401 taagctattg tctgaaaagg aaaaagaaat ttatcgctgg ccaatcaaag aatctttaaa 8461 aactatgctg gctaggacta tgagaactga aagaactcgg gagggagaca gcatggccct 8521 ttacaaccgg actcgtcgta tttctcagac aagccaggtt tctgtggacg ctgcccatgg 8581 ttacagtccc cgggccattg acatgagcaa tgttacacta tctagagacc tgcatgctat 8641 ggcagaaatg atggctgaaa actaccataa tatatgggca aagaaaaaga aaatggagtt 8701 ggagtccaaa ggaggaggaa accatcctct gctggtgccc tatgatacac tgacagccaa 8761 agagaaagcc aaggatagag aaaaagcaca ggacatcctc aagttcttgc agatcaatgg 8821 atatgctgta tccagaggat ttaaggacct ggaactggac acgccttcta ttgagaaacg 8881 atttgcctat agtttcctcc aacaactcat tcgctatgtg gatgaagccc atcagtatat 8941 cctggagttt gatggtggca gcagaggcaa aggagaacat ttcccttatg aacaagaaat 9001 caagttcttt gcaaaagtcg ttcttccttt aattgatcag tatttcaaaa accatcgttt 9061 atacttctta tctgcagcaa gcagacctct ctgctctgga ggacatgctt ccaacaaaga 9121 gaaagaaatg gtgactagcc tattctgcaa acttggagtt cttgtcaggc ataggatttc 9181 actatttggc aatgatgcaa catcaattgt caactgtctt catattttgg gtcagacttt 9241 ggatgcaagg acagtgatga agactggcct ggagagtgtt aaaagtgcac tcagagcttt 9301 tctggacaac gctgcagagg atctggagaa gaccatggaa aacctcaagc agggccagtt 9361 cactcacacc cgaaaccagc ccaaaggggt tactcagatt atcaattaca ccacagtggc 9421 cctgctgcca atgctgtcgt cattatttga acatattggc cagcatcagt tcggagaaga 9481 cctaatattg gaagatgtcc aggtgtcttg ttatagaatt ctgactagct tatatgcttt 9541 gggaaccagc aagagtattt acgtggagag gcaacgttct gcattaggag aatgtctagc 9601 tgcctttgct ggtgcttttc ctgtagcatt tttggaaact catctggaca aacataatat 9661 ttactccatc tacaatacca agtcttcacg agaaagagca gctctcagtt tgccaactaa 9721 tgtggaagat gtttgtccaa acattccgtc tttggagaaa ctcatggaag aaatcgtgga 9781 attagccgag tccggcattc gctacactca aatgccacat gtcatggaag tcatactgcc 9841 catgctttgc agctacatgt ctcgttggtg ggagcatgga cctgagaaca atccagaacg 9901 ggccgagatg tgctgcacag ccctgaactc agagcacatg aacacacttc tagggaacat 9961 attgaaaatc atatataata acttggggat tgatgaggga gcctggatga agaggctagc 10021 agtgttttcc cagcctataa taaataaagt gaaacctcag ctcttgaaaa ctcatttctt 10081 gccgttaatg gagaaactca agaaaaaggc agctacggtg gtgtctgagg aagaccacct 10141 gaaagctgag gccagggggg acatgtcgga ggcagaactc ctcatcctag atgagttcac 10201 cacactggcc agagatctct atgccttcta ccctctcttg attagatttg tggactataa 10261 cagggcaaag tggctaaagg agcctaaccc agaagcagag gagctcttcc gcatggtggc 10321 tgaagtgttt atctactggt cgaagtccca taatttcaaa agagaagagc agaacttcgt 10381 tgtacagaat gaaatcaaca atatgtcttt ccttattact gataccaagt caaagatgtc 10441 aaaggcagct gtttctgatc aggaaaggaa gaaaatgaag cgcaaaggag atcggtattc 10501 catgcagacc tctctgattg tagcagctct gaagcggtta ctgcccattg ggttgaacat 10561 ctgtgcccct ggggaccagg agctcattgc tctggccaaa aatcgattta gcctgaaaga 10621 tactgaggat gaagtacgag atataatccg cagcaatatt catttacaag gcaagttgga 10681 ggatcctgct attagatggc aaatggctct ttacaaagac ttaccaaaca ggactgatga 10741 tacctcagat ccagagaaga cggtagaaag agtattggat atagcaaatg tgctttttca 10801 tcttgaacag aagtctaaac gtgtgggtcg aagacattac tgtctggtgg aacatcctca 10861 gagatctaaa aaggctgtat ggcataaact actgtctaag cagaggaaaa gggctgttgt 10921 agcctgcttc cggatggccc ccttatataa tctgccaagg catcgggctg tcaatctctt 10981 tcttcaggga tatgaaaagt cttggattga aacagaagaa cattactttg aagataaact 11041 gatagaagat ttagcaaaac ctggggctga acctccagaa gaagatgaag gcactaagag 11101 agttgatcct ctacatcagc tgatccttct gtttagtcgg acagctttaa cagagaaatg 11161 caaactggag gaagattttt tatatatggc ctatgcagat attatggcaa agagttgtca 11221 tgatgaggaa gatgacgatg gtgaagagga agtgaagagt tttgaagaaa aagaaatgga 11281 aaagcaaaag cttctatacc agcaagcccg actccacgat cgtggcgcgg ctgagatggt 11341 gctacagaca atcagtgcca gcaaaggtga aactggacca atggtagcag ctactctgaa 11401 acttggaatt gctattttaa atggtgggaa ctccacagta cagcagaaaa tgcttgacta 11461 cctcaaggag aaaaaggatg tgggcttctt tcagagcctg gccggcctga tgcagtcatg 11521 tagtgtcctt gacctaaatg catttgagcg acaaaacaaa gctgaaggtc ttgggatggt 11581 gacagaggaa ggatcaggag aaaaggttct gcaggacgat gagttcacct gtgacctctt 11641 ccgattcctg caactactct gtgagggaca caactcagat tttcagaatt atctgagaac 11701 tcagactggc aataatacaa ctgtcaacat aattatctcc actgtagact acctactgag 11761 agttcaggaa tcaattagtg acttttattg gtattactct gggaaagatg ttattgatga 11821 acaaggacaa cggaatttct ccaaagctat ccaagtggca aaacaagtct ttaacactct 11881 tacagagtat attcagggtc cttgcactgg gaatcaacag agtttggcac acagcaggct 11941 gtgggatgct gtggtcggct ttcttcatgt gtttgcccat atgcagatga agctgtcgca 12001 ggattccagt caaattgagc tattaaaaga attaatggat ctgcagaagg atatggtggt 12061 catgttgctg tccatgttag aaggtaatgt tgttaatgga acgattggca aacagatggt 12121 ggatatgctt gtggaatctt ccaacaacgt ggagatgatt ctcaaatttt ttgacatgtt 12181 cttaaaacta aaggatttga cgtcgtctga tacttttaaa gaatatgacc ccgatggcaa 12241 gggagtcatt tccaagaggg acttccacaa agcgatggag agccataagc actacacgca 12301 gtcagaaacg gaatttcttt tgtcttgtgc ggagacggat gagaatgaaa ccctcgacta 12361 cgaagagttc gtcaaacgct tccacgaacc tgcgaaggac atcggcttca acgtcgccgt 12421 ccttctgaca aacctctctg agcacatgcc caacgatacc cgacttcaga cttttctgga 12481 attagcagag agcgtcctga attatttcca gccctttctg ggccgcatcg aaatcatggg 12541 aagcgccaaa cgcatcgaga gggtctattt tgaaatcagt gagtccagcc gaacccagtg 12601 ggagaagccc caggtcaagg agtccaaaag acagttcata tttgacgtgg tcaacgaagg 12661 cggagagaaa gagaagatgg aactctttgt gaacttctgc gaggacacca tctttgaaat 12721 gcagctggcg gctcagatct cggagtcgga cttgaacgag aggtcagcga ataaggaaga 12781 aagcgagaag gagaggccgg aagagcaggg gccgaggatg gctttcttct ccattctgac 12841 ggtcaggtcg gccctgtttg cgctcaggta caatatcttg acccttatgc gaatgctcag 12901 tctgaagagc ctgaagaagc agatgaaaaa agtaaaaaag atgaccgtga aggacatggt 12961 cacggccttc ttttcatcct actggagtat tttcatgacc ctcttgcact tcgtggccag 13021 cgttttcaga ggctttttcc gcatcatttg cagcctgctg cttgggggaa gcctcgtcga 13081 aggtgctaaa aagatcaaag ttgcagaact gttagccaac atgccagacc ccactcagga 13141 tgaggttaga ggagatgggg aggagggaga gaggaaaccc ctggaagccg ccctgccctc 13201 cgaggatctg accgacttaa aggagctgac agaggaaagt gaccttcttt cggacatctt 13261 tggcctggat ctgaagagag aaggaggaca gtacaaactg attcctcata atccaaatgc 13321 tgggctcagt gacctcatga gcaacccagt ccccatgcct gaggtgcagg aaaaatttca 13381 ggaacagaag gcaaaagaag aagaaaagga agaaaaagaa gaaaccaaat ctgaacctga 13441 aaaagccgag ggagaagatg gagaaaaaga agagaaagcc aaggaagaca agggcaaaca 13501 aaagttgagg cagcttcaca cacacagata cggagaacca gaagtgccag agtcagcatt 13561 ctggaagaaa atcatagcat atcaacagaa acttctaaac tattttgctc gcaactttta 13621 caacatgaga atgttagcct tatttgtcgc atttgctatc aatttcatct tgctctttta 13681 taaggtctcc acttcttctg tggttgaagg aaaggagctc cccacgagaa gttcaagtga 13741 aaatgccaaa gtgacaagcc tggacagcag ctcccataga atcatcgcag ttcactatgt 13801 actagaggag agcagcggct acatggagcc cacgttgcgt atcttagcta ttctgcacac 13861 ggtcatttct ttcttctgca tcattggata ctactgcttg aaagtcccat tggttatttt 13921 taagcgagaa aaggaagtgg cacggaaatt ggaatttgat gggctttata ttacagaaca 13981 gccttcagaa gatgatatta aaggccagtg ggatagactc gtaatcaaca cacagtcatt 14041 tcccaacaac tactgggaca aatttgttaa aagaaaggtt atggataaat atggagagtt 14101 ctacggccga gacagaatca gtgaattact tggcatggac aaggcagctc tggacttcag 14161 tgatgccaga gaaaagaaga agccaaagaa agacagctcc ttatcagctg tactgaactc 14221 cattgatgtg aagtatcaga tgtggaaact aggagtcgtt ttcactgaca actccttcct 14281 ctacctagcc tggtatatga ctatgtctgt tcttggacac tataacaact ttttttttgc 14341 cgctcacctt ctcgacattg ctatgggatt caagacatta agaaccatct tgtcctcagt 14401 aactcacaat ggcaaacagc tcgtattaac cgttggctta ttagctgttg ttgtatacct 14461 atacactgtg gtggcattca attttttccg aaaattctac aataaaagtg aagatggtga 14521 tacaccagat atgaaatgtg acgatatgct aacatgctat atgttccaca tgtatgttgg 14581 agttcgtgct ggaggaggga tcggggatga aatcgaagac ccagcaggag atgaatatga 14641 gatctatcga atcatctttg acatcacttt cttcttcttt gttattgtca ttctcttggc 14701 cataatacaa ggtctaatta ttgatgcttt tggagaacta agagaccaac aggaacaagt 14761 caaagaagac atggagacca aatgcttcat ctgtgggata ggcaatgatt acttcgacac 14821 agtgccacat ggctttgaaa cccacacttt acaggagcac aacttggcta attacttgtt 14881 ttttctgatg tatcttataa acaaagatga aacagaacac acaggacagg aatcttatgt 14941 ctggaagatg tatcaagaaa ggtgttggga atttttccca gcaggggatt gcttccggaa 15001 acagtatgaa gaccagctaa attaaactca gacccaatca cctctaaaaa ccaaaaccct 15061 acccctctct ctccctctct caatttctct gctctcttgg aaacattttg ctgattttgt 15121 gaattgccag cgatgtgtgt tttctgggag catcgaagct ctgtttcgga agagctgttt 15181 cctcccccca ccttttgtat ttactttgag actaaagact gaagaataat ctaaattcat 15241 actcagacaa aaaaaggaat tctggaaaga aaaccattct ggacactgtc ataacacaca 15301 tagatagatt ttcttctgag actcccggag tcttctcgag ctacgagacc ttcacagaga 15361 cacgtggcag ccacactcac ccagcctctt tatttcacca tcctggaagg aaactgtctg 15421 tctaatggtc acagagcact gtagcactta acagattgcc atggacacca gttgcgaagg 15481 gaaatagtgc cttactatat gtgggttgag ctatgcagaa gatacgtgca tgaaaaaaca 15541 tctttatttt ctttatgtcg acctttcttt tcttagattg attttgtgag gttttttttt 15601 tttcctttag tcttttcttt agtgggggag ggtaagaaaa gcagtttgca cttaaaaaga 15661 aaaaaaaaaa acgggtggtg tgtctcagga caaaaggagg ctcttctcat tcagctaaat 15721 tcacatttgc c // LOCUS HSRR2SS 2500 bp RNA PRI 18-NOV-1993 DEFINITION H.sapiens RR2 mRNA for small subunit ribonucleotide reductase. ACCESSION X59618 S40301 S40302 NID g36154 KEYWORDS DNA synthesis enzyme; holoenzyme; ribonucleotide reductase; small subunit ribonucleotide reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2500) AUTHORS Mes-Masson,A.M. TITLE Direct Submission JOURNAL Submitted (23-APR-1991) A.M. Mes-Masson, Biotechnology Research Institute, 6100 Royalmount, Montreal, Quebec, H4P 2R2, CANADA REFERENCE 2 (bases 1 to 2500) AUTHORS Pavloff,N., Rivard,D., Masson,S., Shen,S.H. and Mes-Masson,A.M. TITLE Sequence analysis of the large and small subunits of human ribonucleotide reductase JOURNAL DNA Seq. 2 (4), 227-234 (1992) MEDLINE 92329977 COMMENT See also X59617. FEATURES Location/Qualifiers source 1..2500 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast carcinoma" /clone_lib="Clonetech 5'stretch lambda" gene 195..1364 /gene="RR2" CDS 195..1364 /gene="RR2" /codon_start=1 /product="small subunit ribonucleotide reductase" /db_xref="PID:g36155" /db_xref="SWISS-PROT:P31350" /translation="MLSLRVPLAPITDPQQLQLSPLKGLSLVDKENTPPALSGTRVLA SKTARRIFQEPTEPKTKAAAPGVEDEPLLRENPRRFVIFPIEYHDIWQMYKKAEASFW TAEEVDLSKDIQHWESLKPEERYFISHVLAFFAASDGIVNENLVERFSQEVQITEARC FYGFQIAMENIHSEMYSLLIDTYIKDPKEREFLFNAIETMPCVKKKADWALRWIGDKE ATYGERVVAFAAVEGIFFSGSFASIFWLKKRGLMPGLTFSNELISRDEGLHCDFACLM FKHLVHKPSEERVREIIINAVRIEQEFLTEALPVKLIGMNCTLMKQYIEFVADRLMLE LGFSKVFRVENPFDFMENISLEGKTNFFEKRVGEYQRMGVMSSPTENSFTLDADF" polyA_signal 1755..1760 polyA_site 2475 BASE COUNT 679 a 522 c 595 g 704 t ORIGIN 1 cccaggcgca gccaatggga agggtcggag gcatggcaca gccaatggga agggccgggg 61 caccaaagcc aatgggaagg gccgggagcg cgcggcgcgg gagatttaaa ggctgctgga 121 gtgaggggtc gcccgtgcac cctgtcccag ccgtcctgtc ctggctgctc gctctgcttc 181 gctgcgcctc cactatgctc tccctccgtg tcccgctcgc gcccatcacg gacccgcagc 241 agctgcagct ctcgccgctg aaggggctca gcttggtcga caaggagaac acgccgccgg 301 ccctgagcgg gacccgcgtc ctggccagca agaccgcgag gaggatcttc caggagccca 361 cggagccgaa aactaaagca gctgcccccg gcgtggagga tgagccgctg ctgagagaaa 421 acccccgccg ctttgtcatc ttccccatcg agtaccatga tatctggcag atgtataaga 481 aggcagaggc ttccttttgg accgccgagg aggttgacct ctccaaggac attcagcact 541 gggaatccct gaaacccgag gagagatatt ttatatccca tgttctggct ttctttgcag 601 caagcgatgg catagtaaat gaaaacttgg tggagcgatt tagccaagaa gttcagatta 661 cagaagcccg ctgtttctat ggcttccaaa ttgccatgga aaacatacat tctgaaatgt 721 atagtcttct tattgacact tacataaaag atcccaaaga aagggaattt ctcttcaatg 781 ccattgaaac gatgccttgt gtcaagaaga aggcagactg ggccttgcgc tggattgggg 841 acaaagaggc tacctatggt gaacgtgttg tagcctttgc tgcagtggaa ggcattttct 901 tttccggttc ttttgcgtcg atattctggc tcaagaaacg aggactgatg cctggcctca 961 cattttctaa tgaacttatt agcagagatg agggtttaca ctgtgatttt gcttgcctga 1021 tgttcaaaca cctggtacac aaaccatcgg aggagagagt aagagaaata attatcaatg 1081 ctgttcggat agaacaggag ttcctcactg aggccttgcc tgtgaagctc attgggatga 1141 attgcactct aatgaagcaa tacattgagt ttgtggcaga cagacttatg ctggaactgg 1201 gttttagcaa ggttttcaga gtagagaacc catttgactt tatggagaat atttcactgg 1261 aaggaaagac taacttcttt gagaagagag taggcgagta tcagaggatg ggagtgatgt 1321 caagtccaac agagaattct tttaccttgg atgctgactt ctaaatgaac tgaagatgtg 1381 cccttacttg gctgattttt tttttccatc tcataagaaa aatcagctga agtgttacca 1441 actagccaca ccatgaattg tccgtaatgt tcattaacag catctttaaa actgtgtagc 1501 tacctcacaa ccagtcctgt ctgtttatag tgctggtagt atcacctttt gccagaaggc 1561 ctggctggct gtgacttacc atagcagtga caatggcagt cttggcttta aagtgagggg 1621 tgacccttta gtgagcttag cacagcggga ttaaacagtc ctttaaccag cacagccagt 1681 taaaagatgc agcctcactg cttcaacgca gattttaatg tttacttaaa tataaacctg 1741 gcactttaca aacaaataaa cattgttttg tactcacggc ggcgataata gcttgattta 1801 tttggtttct acaccaaata cattctcctg accactaatg ggagccaatt cacaattcac 1861 taagtgacta aagtaagtta aacttgtgta gactaagcat gtaattttta agttttattt 1921 taatgaatta aaatatttgt taaccaactt taaagtcagt cctgtgtata cctagatatt 1981 agtcagttgg tgccagatag aagacaggtt gtgtttttat cctgtggctt gtgtagtgtc 2041 ctgggattct ctgccccctc tgagtagagt gttgtgggat aaaggaatct ctcagggcaa 2101 ggagcttctt aagttaaatc actagaaatt taggggtgat ctgggccttc atatgtgtga 2161 gaagccgttt cattttattt ctcactgtat tttcctcaac gtctggttga tgagaaaaaa 2221 ttcttgaaga gttttcatat gtgggagcta aggtagtatt gtaaaatttc aagtcatcct 2281 taaacaaaat gatccaccta agatcttgcc cctgttaagt ggtgaaatca actagaggtg 2341 gttcctacaa gttgttcatt ctagttttgt ttggtgtaag taggttgtgt gagttaattc 2401 atttatattt actatgtctg ttaaatcaga aattttttat tatctatgtt cttctagatt 2461 ttacctgtag ttcataaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSRRP22 1296 bp RNA PRI 22-JAN-1997 DEFINITION H.sapiens mRNA for RRP22 protein. ACCESSION Y07847 NID g1666072 KEYWORDS RRP22 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1296) AUTHORS Zucman-Rossi,J., Legoix,P. and Thomas,G. TITLE Identification of new members of the Gas2 and Ras families in the 22q12 chromosome region JOURNAL Genomics 38 (3), 247-254 (1996) MEDLINE 97131501 REFERENCE 2 (bases 1 to 1296) AUTHORS Zucman-Rossi,J. TITLE Direct Submission JOURNAL Submitted (05-SEP-1996) J. Zucman-Rossi, Inserm U434, Institut Curie, 26 Rue d'Ulm, Paris 75231 Cedex 05, FRANCE FEATURES Location/Qualifiers source 1..1296 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q12" /dev_stage="fetal" /germline /tissue_type="brain" exon <1..450 /number=1 CDS 322..933 /codon_start=1 /product="RRP22 protein" /db_xref="PID:e280554" /db_xref="PID:g1666073" /translation="MGGSLRVAVLGAPGVGKTAIIRQFLFGDYPERHRPTDGPRLYRP AVLLDGAVYDLSIRDGDVAGPGSSPGGPEEWPDAKDWSLQDTDAFVLVYDICSPDSFD YVKALRQRIAETRPAGAPEAPILVVGNKRDRQRLRFGPRRALAALVRRGWRCGYLECS AKYNWHVLRLFRELLRCALVRARPAHPALRLQGALHPARCSLM" exon 451..665 /number=2 exon 666..>1296 /number=3 polyA_signal 1280..1285 /evidence=experimental polyA_signal 1291..1296 /evidence=experimental BASE COUNT 198 a 454 c 434 g 210 t ORIGIN 1 ccggcgcctg ggttggcgct gcggggcgga ggcggtgtct gagcgccgct ccggctctgc 61 tctctctcga gcttcggcac ccgcccgagc cgctcgcgcg cccgccacct gtctgcccac 121 tcggctgtct gtctgccctc ccgccgccag ctcctgcctc gggcctgccc tctccggtct 181 cggtgctccg aggggcgacg agaagcgcga cggggccgtg gcgcaccggg cagggcgcgc 241 ggggcgcacg gcctgggggc gcacggtgcg gcgccggccc atgaggcttt ccagcgcggg 301 gagcggcagc gccggccggc catggggggt agcctgcggg tggccgttct aggcgccccg 361 ggcgtgggca agacggccat catccgccag ttcctgttcg gtgactaccc cgagcgccac 421 cggcccacgg acgggccgcg cctctaccga cccgcggtgc tgctcgacgg cgccgtctac 481 gacttgagca tccgcgacgg cgacgtcgct ggccccggct cgagccccgg gggtccggag 541 gagtggccag acgctaagga ctggagcttg caggacacgg acgccttcgt gctcgtctac 601 gacatctgca gcccggacag tttcgactac gtgaaggccc tgcggcagcg catcgcggag 661 accaggccgg cgggcgcgcc cgaagcgccc atcctcgtgg taggcaacaa gcgggacagg 721 cagcggctgc gcttcggacc gcggcgcgcg ctggccgccc tagtgcgcag gggctggcgc 781 tgcggctacc tcgagtgctc cgccaagtac aactggcacg tgctgcgtct cttccgcgag 841 ctgctgcgct gcgctctggt gcgcgcgcgc cctgcacacc cggccctgcg cctgcagggg 901 gcgctgcatc ccgcgcgctg cagcctcatg tgacccgatc ggacagtgcc atccatgggc 961 cccaccttgt gactgggaca atcagggacc tggattggac gggatcgccc aacttcactg 1021 ggactggaca gggaagtctc cgccctgatt ggatgaggaa agctccaacc cagtctccta 1081 agcgactggc ccccttttga acctcattgg acccaaccag gtcccaagct ccattggaga 1141 tgaccagtcc tttctgggac ctcaatgggt cacaatccca ttggatggaa aggacttggc 1201 tatgaacttg actggaaaca cgcagcctgc tcctggagct tcactggaca tattctttat 1261 gccacaccta ccacgggata ataaaaggga aaataa // LOCUS HSRRRGBP 987 bp RNA PRI 10-JAN-1996 DEFINITION H.sapiens mRNA for ras-related GTP-binding protein. ACCESSION Z29677 NID g453469 KEYWORDS GTP-binding protein; ras-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 987) AUTHORS Gromov,P.S., Madsen,P., Tomerup,N. and Celis,J.E. TITLE A novel approach for expression cloning of small GTPases: identification, tissue distribution and chromosome mapping of the human homolog of rheb JOURNAL FEBS Lett. 377 (2), 221-226 (1995) MEDLINE 96128233 REFERENCE 2 (bases 1 to 987) AUTHORS Gromov,P.S. TITLE Direct Submission JOURNAL Submitted (04-FEB-1994) Pavel S. Gromov, Department of Medical Biochemistry, University of, Aarhus, Ole Worms Alle, Build.170, Aarhus, DK-8000, Denmark FEATURES Location/Qualifiers source 1..987 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="psoriatic keratinocytes" /clone_lib="Lambda ZAPII cDNA library" CDS 24..578 /codon_start=1 /product="Ras-related GTP-binding protein" /db_xref="PID:g453470" /translation="MPQSKSRKIAILGYRSVGKSSLTIQFVEGQFVDSYDPTIENTFT KLITVNGQEYHLQLVDTAGQDEYSIFPQTYSIDINGYILVYSVTSIKSFEVIKVIHGK LLDMVGKVQIPIMLVGNKKDLHMERVISYEEGKALAESWNAAFLESSAKENQTAVDVF RRIILEAEKMDGAASQGKSSCSVM" polyA_signal 967..972 polyA_site 987 BASE COUNT 297 a 173 c 200 g 317 t ORIGIN 1 ccggggctga ggaggccgcc aagatgccgc agtccaagtc ccggaagatc gcgatcctgg 61 gctaccggtc tgtggggaaa tcctcattga cgattcaatt tgttgaaggc caatttgtgg 121 actcctacga tccaaccata gaaaacactt ttacaaagtt gatcacagta aatggacaag 181 aatatcatct tcaacttgta gacacagccg ggcaagatga atattctatc tttcctcaga 241 catactccat agatattaat ggctatattc ttgtgtattc tgttacatca atcaaaagtt 301 ttgaagtgat taaagttatc catggcaaat tgttggatat ggtggggaaa gtacaaatac 361 ctattatgtt ggttgggaat aagaaagacc tgcatatgga aagggtgatc agttatgaag 421 aagggaaagc tttggcagaa tcttggaatg cagctttttt ggaatcttct gctaaagaaa 481 atcagactgc tgtggatgtt tttcgaagga taattttgga ggcagaaaaa atggacgggg 541 cagcttcaca aggcaagtct tcatgctcgg tgatgtgatt ctgctgcaaa gcctgaggac 601 actgggaata tattctacct gaagaagcaa actgcccgtt ctccttgaag ataaactatg 661 cttctttttt cttctgttaa cctgaaagat atcatttggg tcagagctcc cctcccttca 721 gattatgtta actctgagtc tgtccaaatg agttcacttc cattttcaaa ttttaagcaa 781 tcatattttc aatttatata ttgtatttct taatattatg accaagaatt ttatcggcat 841 taatttttca gtgtagtttg ttgtttaaaa taatgtaatc atcaaaatga tgcatattgt 901 tacactacta ttaactaggc ttcagtatat cagtgtttat ttcattgtgt taaatgtata 961 cttgtaaata aaatagctgc aaacctc // LOCUS HSS100A 607 bp RNA PRI 16-DEC-1994 DEFINITION Human mRNA for S100 alpha protein. ACCESSION X58079 S46852 NID g36175 KEYWORDS calcium binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 607) AUTHORS Engelkamp,D. TITLE Direct Submission JOURNAL Submitted (25-FEB-1991) D. Engelkamp, Division of Clinical Chemistry, Dept of Pediatrics, University of Zurich, Steinwiesstr 75, CH-8032 Zurich, SWITZERLAND REFERENCE 2 (bases 1 to 607) AUTHORS Engelkamp,D., Schafer,B.W., Erne,P. and Heizmann,C.W. TITLE S100 alpha, CAPL, and CACY: molecular cloning and expression analysis of three calcium-binding proteins from human heart JOURNAL Biochemistry 31 (42), 10258-10264 (1992) MEDLINE 93041710 FEATURES Location/Qualifiers source 1..607 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" mRNA 1..607 /note="S100 alpha protetein" /evidence=experimental CDS 114..398 /codon_start=1 /product="S100 alpha protein" /db_xref="PID:g36176" /db_xref="SWISS-PROT:P23297" /translation="MGSELETAMETLINVFHAHSGKEGDKYKLSKKELKELLQTELSG FLDAQKDVDAVDKVMKELDENGDGEVDFQEYVVLVAALTVACNNFFWENS" BASE COUNT 148 a 195 c 150 g 114 t ORIGIN 1 ggactgttga agacaggtct ccacacacag ctccagcagc cacatttgca accttggcca 61 tctgtccaga acctgctccc acctcaggcc caggccaacc gtgcactgct gcaatgggct 121 ctgagctgga gacggcgatg gagaccctca tcaacgtgtt ccacgcccac tcgggcaaag 181 agggggacaa gtacaagctg agcaagaagg agctgaaaga gctgctgcag acggagctct 241 ctggcttcct ggatgcccag aaggatgtgg atgctgtgga caaggtgatg aaggagctag 301 acgagaatgg agacggggag gtggacttcc aggagtatgt ggtgcttgtg gctgctctca 361 cagtggcctg taacaatttc ttctgggaga acagttgagc agacagccac attgggcagc 421 gcccttcctc tccaccctcc cagacctgcc tcttccccct gcttccacct caccccactt 481 atccctctcc ataaccccac ccttgcccac cccaccccca cccccaccaa gggcgcaaga 541 gtagcggtcc aagcctgcaa ctcatctttc attaaaggct tctctctcac cagcaaaaaa 601 aaaaaaa // LOCUS HSS100A13 481 bp RNA PRI 25-NOV-1996 DEFINITION H.sapiens mRNA for S100 calcium-binding protein A13. ACCESSION X99920 NID g1694827 KEYWORDS calcium-binding; S100 protein; S100A13 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 481) AUTHORS Wicki,R., Schafer,B.W., Erne,P. and Heizmann,C.W. TITLE Characterization of the human and mouse cDNAs coding for S100A13, a new member of the S100 protein family JOURNAL Biochem. Biophys. Res. Commun. 227 (2), 594-599 (1996) MEDLINE 97032809 REFERENCE 2 (bases 1 to 481) AUTHORS Wicki,R. TITLE Direct Submission JOURNAL Submitted (09-AUG-1996) R. Wicki, Department Of Pediatrics, Division Of Clinical Chemistry, Steinwiesstrasse 75, Ch-8032 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..481 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="q21; S100 gene cluster" gene 74..370 /gene="S100A13" CDS 74..370 /gene="S100A13" /note="expressed in heart and skeletal muscle" /codon_start=1 /product="S100 calcium-binding protein A13 (S100A13)" /db_xref="PID:e268253" /db_xref="PID:g1694828" /translation="MAAEPLTELEESIETVVTTFFTFARQEGRKDSLSVNEFKELVTQ QLPHLLKDVGSLDEKMKSLDVNQDSELKFNEYWRLIGELAKEIRKKKDLKIRKK" polyA_site 457 BASE COUNT 145 a 110 c 137 g 89 t ORIGIN 1 tctccttgcc gggtcagccc tgacaaaggt cagctagccc cttgaggaca tcagctttgg 61 cctcagggtc ctaatggcag cagaaccact gacagagcta gaggagtcca ttgagaccgt 121 ggtcaccacc ttcttcacct ttgcaaggca ggagggccgg aaggatagcc tcagcgtcaa 181 cgagttcaaa gagctggtta cccagcagtt gccccatctg ctcaaggatg tgggctctct 241 tgatgagaag atgaagagct tggatgtgaa tcaggactcg gagctcaagt tcaatgagta 301 ctggagattg attggggagc tggccaagga aatcaggaag aagaaagacc tgaagatcag 361 gaagaagtaa agccgcctgg ctgagatggg gtgggcaggg cagagctgat cagggccgag 421 cagaaccgca ctcttcccaa ataaagcttc ctccttgaaa aaaaaaaaaa aaaaaaaaaa 481 a // LOCUS HSS100D 710 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for S100D calcium binding protein. ACCESSION Z18954 NID g396706 KEYWORDS calcium binding protein; calcium binding protein S100D. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 710) AUTHORS Engelkamp,D. TITLE Direct Submission JOURNAL Submitted (07-DEC-1992) Dieter Engelkamp, Pediatrics, Division of Clinical Chemistry, University of Zurich, Steinwiesstrasse 75, Zurich, CH-8032, Switzerland REFERENCE 2 (bases 1 to 710) AUTHORS Engelkamp,D., Schafer,B.W., Mattei,M.G., Erne,P. and Heizmann,C.W. TITLE Six S100 genes are clustered on human chromosome 1q21: identification of two genes coding for the two previously unreported calcium-binding proteins S100D and S100E JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (14), 6547-6551 (1993) MEDLINE 93342029 FEATURES Location/Qualifiers source 1..710 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Kidney" /clone="S100D" /chromosome="1q21" exon 1..181 exon 182..268 CDS 229..561 /codon_start=1 /product="S100D calcium binding protein" /db_xref="PID:g396707" /db_xref="SWISS-PROT:P33763" /translation="MPAAWILWAHSHSELHTVMETPLEKALTTMVTTFHKYSGREGSK LTLSRKELKELIKKELCLGEMKESSIDDLMKSLDKNSDQEIDFKEYSVFLTMLCMAYN DFFLEDNK" exon 269..420 exon 421..709 allele 443 /replace="g" polyA_signal 683..688 polyA_site 710 BASE COUNT 167 a 180 c 192 g 171 t ORIGIN 1 tcccacactt ctgaggtttt ctttccagga cagcctggtt tcccttcttc ggcttattgt 61 tccatcagat ttcagatttt gagttctgat ttttggtcag aagagtaaag tttctgggat 121 tggggacgtg tgtgctatgg gaactcagtg tgtccccagc ccttgtttgt aaacaaggaa 181 gggacagaga tcagggaaat aaaggcagaa ggcagtgaga gggaggctat gcctgctgct 241 tggattctct gggctcactc ccacagtgag ctgcacactg tgatggagac tcctctggag 301 aaggccctga ccactatggt gaccacgttt cacaaatatt cggggagaga gggtagcaaa 361 ctgaccctga gtaggaagga actcaaggag ctgatcaaga aagagctgtg tcttggggag 421 atgaaggaga gcagcatcga tgacttgatg aagagcctgg acaagaacag cgaccaggag 481 atcgacttca aggagtactc ggtgttcctg accatgctgt gcatggccta caacgacttc 541 tttctagagg acaacaagtg accagggctg ccctccaccc tcaccctcca ccctttgctg 601 ctgacctcgg ctgctcctct cacagaccct ctttggcccc ctgccctcct ctccctccca 661 gatggaccct tccatgggag gaaataaagt ttccatcgca ggtgctggga // LOCUS HSS100E 738 bp RNA PRI 11-MAY-1994 DEFINITION H.sapiens mRNA for S100E calcium binding protein. ACCESSION Z18948 NID g396712 KEYWORDS calcium binding protein; calcium binding protein S100E. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Engelkamp,D., Schafer,B.W., Mattei,M.G., Erne,P. and Heizmann,C.W. TITLE Six S100 genes are clustered on human chromosome 1q21: identification of two genes coding for the two previously unreported calcium-binding proteins S100D and S100E JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (14), 6547-6551 (1993) MEDLINE 93342029 REFERENCE 2 (bases 1 to 738) AUTHORS Engelkamp,D. TITLE Direct Submission JOURNAL Submitted (07-DEC-1992) Dieter Engelkamp, Pediatrics, Division of Clinical Chemistry, University of Zurich, Steinwiesstrasse 75, Zurich, CH-8032, Switzerland FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /clone="S100E" /chromosome="1q21" exon 1..78 exon 79..224 CDS 84..389 /codon_start=1 /product="S100E calcium binding protein" /db_xref="PID:g396713" /db_xref="SWISS-PROT:P33764" /translation="MARPLEQAVAAIVCTFQEYAGRCGDKYKLCQAELKELLQKELAT WTPTEFRECDYNKFMSVLDTNKDCEVDFVEYVRSLACLCLYCHEYFKDCPSEPPCSQ" exon 225..738 polyA_signal 723..728 BASE COUNT 151 a 220 c 212 g 155 t ORIGIN 1 agtctcagat tggtaaacac ccgaactggt caactctcaa gagaccatct ggttcaggtt 61 cctgactggg ccagcgagtg aggatggcca ggcctctgga gcaggcggta gctgccatcg 121 tgtgcacctt ccaggaatac gcagggcgct gtggggacaa atacaagctc tgccaggcgg 181 agctcaagga gctgctgcag aaggagctgg ccacctggac cccgactgag tttcgggaat 241 gtgactacaa caaattcatg agtgttctgg acaccaacaa ggactgcgag gtggactttg 301 tggagtatgt gcgctcactt gcctgcctct gtctctactg ccacgagtac ttcaaggact 361 gcccctcaga gcccccctgc tcccagtagc ctctgctcca gggggtgcgc tggctgtcgg 421 gggctgggca tgtctcccac accccctcct accctctctc ctgtacccct ttcaatctgg 481 acttgcccag gtcttctgcg atcagttaac ccattttacc taggaggccc agagatgtga 541 gggctccttc ctcaggatgc ccagcgaatg aggggtagag ccactctggg gcccagcctg 601 cctgccgcac ccctgtggcc tcccttgtgg atgggaggag gcgggatctg ctctgaggcc 661 ctcgaggctc agcagagcgt gcaccaatga gaccacgatg ggaaagggcc tatttaactc 721 ctaataaaaa actggcat // LOCUS HSS100PCB 439 bp RNA PRI 20-JUL-1993 DEFINITION H.sapiens mRNA for calcium-binding protein S100P. ACCESSION X65614 S40553 NID g36177 KEYWORDS calcium binding protein; S100 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 439) AUTHORS Gerke,V. TITLE Direct Submission JOURNAL Submitted (15-APR-1992) V. Gerke, Max-Planck-Institut f, Biophysical Chemistry, Dept of Biochemistry, Am Fassberg, D 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 439) AUTHORS Becker,T., Gerke,V., Kube,E. and Weber,K. TITLE S100P, a novel Ca(2+)-binding protein from human placenta. cDNA cloning, recombinant protein expression and Ca2+ binding properties JOURNAL Eur. J. Biochem. 207 (2), 541-547 (1992) MEDLINE 92339442 FEATURES Location/Qualifiers source 1..439 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /clone_lib="lambda ZAPII" CDS 22..309 /codon_start=1 /product="S100P calcium-binding protein" /db_xref="PID:g36178" /db_xref="SWISS-PROT:P25815" /translation="MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGF LQSGKDKDAVDKLLKDLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK" BASE COUNT 118 a 101 c 123 g 97 t ORIGIN 1 ggtgggtctg aatctagcac catgacggaa ctagagacag ccatgggcat gatcatagac 61 gtcttttccc gatattcggg cagcgagggc agcacgcaga ccctgaccaa gggggagctc 121 aaggtgctga tggagaagga gctaccaggc ttcctgcaga gtggaaaaga caaggatgcc 181 gtggataaat tgctcaagga cctggacgcc aatggagatg cccaggtgga cttcagtgag 241 ttcatcgtgt tcgtggctgc aatcacgtct gcctgtcaca agtactttga gaaggcagga 301 ctcaaatgat gccctggaga tgtcacagat tcctgcagag ccatggtccc aggcttccca 361 aaagtgtttg ttggcaatta ttcccctagg ctgagcctgc tcatgtacct ctgattaata 421 aatgcttatg aaaaaaaaa // LOCUS HSSA1 4337 bp RNA PRI 09-OCT-1997 DEFINITION H.sapiens mRNA for nuclear protein SA-1. ACCESSION Z75330 NID g2204212 KEYWORDS SA-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4337) AUTHORS Carramolino,L., Lee,B., Zaballos,A., Peled,A., Barthelemy,I., Shav-Tal,Y., Prieto,I., Carmi,P., Gothelf,Y., Gonzalez de Buitrago,G., Aracil,M., Marquez,G., Barbero,J. and Zipori,D. TITLE SA-1, a nuclear protein encoded by one member of a novel gene family: molecular cloning and detection in hemopoietic organs JOURNAL Gene 195 (2), 151-159 (1997) MEDLINE 97449290 REFERENCE 2 (bases 1 to 4337) AUTHORS Zaballos,A. TITLE Direct Submission JOURNAL Submitted (24-JUN-1996) Angel Zaballos, Research, Pharmacia & Upjohn, Antonio Lopez 109, Madrid, Madrid, 28026, SPAIN FEATURES Location/Qualifiers source 1..4337 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="thymus" CDS 401..4177 /codon_start=1 /product="nuclear protein SA-1" /db_xref="PID:e250093" /db_xref="PID:g2204213" /translation="MITSELPVLQDSTNETTAHSDAGSELEETEVKGKRKRGRPGRPP STNKKPRKSPGEKSRIEAGIRGAGRGRANGHPQQNGEGEPVTLFEVVKLGKSAMQSVV DDWIESYKQDRDIALLDLINFFIQCSGCRGTVRIEMFRNMQNAEIIRKMTEEFDEDSG DYPLTMPGPQWKKFRSNFCEFIGVLIRQCQYSIIYDEYMMDTVISLLTGLSDSQVRAF RHTSTLAAMKLMTALVNVALNLSIHQDNTQRQYEAERNKMIGKRANERLELLLQKRKE LQENQDEIENMMNSIFKGIFVHRYRDAIAEIRAICIEEIGVWMKMYSDAFLNDSYLKY VGWTLHDRQGEVRLKCLKALQSLYTNRELFPKLELFTNRFKDRIVSMTLDKEYDVAVE AIRLVTLILHGSEEALSNEDCENVYHLVYSAHRPVAVAAGEFLHKKLFSRHDPQAEEA LAKRRGRNSPNGNLIRMLVLFFLESELHEHAAYLVDSLWESSQELLKDWECMTELLLE EPVQGEEAMSDRQESALIELMVCTIRQAAEAHPPVGRGTGKRVLTAKERKTQIDDRNK LTEHFIITLPMLLSKYSADAEKVANLLQIPQYFDLEIYSTGRMEKHLDALLKQIKFVV EKHVESDVLEACSKTYSILCNEEYTIQNRVDIARSQLIDEFVDRFNHSVEDLLQEGEE ADDDDIYNVLSTLKRLTSFQNAHDLTKWDLFGNCYRLLKTGIEHGAMPEQIVVQALQC SHYSILWQLVKITDGSPSKEDLLVLRKTVKSFLAVCQQCLSNVNTPVKEQAFMLLCDL LMIFSHQLMTGGREGLQPLVFNPDTGLQSELLSFVMDHVFIDQDEENQSMEGDEEDEA NKIEALHKRRNLLAAFSKLIIYDIVDMHAAADIFKHYMKYYNDYGDIIKETLSKTRQI DKIQCAKTLILSLQQLFNELVQEQGPNLDRTSAHVSGIKELARRFALTFGLDQIKTRE AVATLHKDGIEFAFKYQNQKGQEYPPPNLAFLEVLSEFSSKLLRQDKKTVHSYLEKFL TEQMMERREDVWLPLISYRNSLVTGGEDDRMSVNSGSSSSKTSSVRNKKGRPPLHKKR VEDESLDNTWLNRTDTMIQTPGPLPAPQLTSTVLRENSRPMGDQIQEPESEHGSEPDF LHNPQMQISWLGQPKLEDLNRKDRTGMNYMKVRTGVRHAVRGLMEEDAEPIFEDVMMS SRSQLEDMNEEFEDTMVIDLPPSRNRRERAELRPDFFDSAAIIEDDSGFGMPMF" BASE COUNT 1354 a 809 c 1025 g 1149 t ORIGIN 1 ggctgtgaca ctaatactta acatggtggt tgtgtctctt tatgcctgac tcaatcagtt 61 gaaatccaaa agtaagttct tccttgattt acctgccaag acctgagttc aggccctcag 121 ggtgctgagg ttttcctttg tgggagaaaa tgccaccaga tggcgggtta ggattgcagc 181 tccgttgaag gcgcggcccc cgctcccgaa cccccggcga ccaccccgta acaacccccc 241 cacatcggga ataacacacc ggagactttt ggggggaaac taggtcgatg gtcggcggcg 301 ccggatgggc agctgaggat tgcctttgag gttattttaa aagttttgag ttgtacagca 361 cttgattatt ttgctgcatt gtgaaaggac ctctccagca atgattactt cagaattacc 421 agtgttacag gattcaacta atgaaactac tgcccattcc gatgctggca gcgagcttga 481 agaaacagag gtcaaaggaa aaagaaaaag gggtcgtcct ggccggcctc catctacaaa 541 taagaaacct cgaaaatctc caggtgagaa gagcagaatt gaagctggaa ttagaggagc 601 aggccgtgga agagctaatg gacaccctca acagaatggg gaaggggagc ctgtcacatt 661 atttgaggtg gtgaaactgg ggaaaagtgc aatgcagtcc gtggtggatg actggattga 721 atcatataaa caagacaggg acatcgcact tctggattta atcaactttt ttatccagtg 781 ttcaggatgt cgaggtactg tgagaataga gatgtttcga aatatgcaga atgcagaaat 841 catcagaaaa atgactgaag aatttgatga ggacagtggt gattatcctc ttaccatgcc 901 tggacctcag tggaaaaaat ttcgttcaaa cttttgtgaa tttattggag tcctgattcg 961 acagtgtcag tatagcataa tttatgatga gtatatgatg gacacagtaa tctccctttt 1021 gacgggtttg tcagactccc aggtcagagc ttttaggcat acaagtaccc tggctgccat 1081 gaagctcatg actgctctgg tgaatgttgc cttaaacctc agtattcatc aggataatac 1141 ccagagacaa tatgaagccg agagaaataa aatgattggg aagagagcca atgaaaggtt 1201 ggagttacta cttcagaaac gcaaagagct gcaagaaaat caggatgaaa tcgaaaatat 1261 gatgaactct atttttaagg gtatatttgt tcatagatac cgtgatgcta ttgctgagat 1321 tagagccatt tgtattgaag aaattggagt atggatgaaa atgtatagtg atgccttcct 1381 aaatgacagt tacctaaaat atgttggctg gactcttcat gacaggcaag gggaagtcag 1441 gctgaagtgt ttgaaagctc tgcagagtct atataccaat agagaattat tccccaaatt 1501 ggaactattc actaaccgat tcaaggatcg cattgtatca atgacacttg ataaagaata 1561 tgatgttgct gtggaagcta ttcgattggt tactctgata cttcatggaa gtgaagaagc 1621 tctttccaat gaagactgtg aaaatgttta ccacttggtg tactcggcac atcgccctgt 1681 tgctgtggca gctggagagt tccttcacaa aaagctattt agcagacatg acccacaagc 1741 agaagaagca ttagcaaaga ggaggggaag aaacagcccg aatggaaacc tcattaggat 1801 gctggttctt ttctttcttg aaagtgagtt acatgaacat gcagcctact tggtggacag 1861 tttatgggag agctctcaag aactgttgaa agactgggaa tgtatgacag agttgctatt 1921 agaagaacct gttcaaggag aggaagcaat gtctgatcgt caagagagtg ctcttataga 1981 gctaatggtt tgtacaattc gtcaagctgc tgaggcacat cctccagtgg gaaggggtac 2041 cggcaagaga gtgctaactg ccaaagaaag gaaaactcaa attgatgata gaaacaaatt 2101 gactgaacat tttattatta cacttcctat gttactgtca aagtattctg cagatgcaga 2161 gaaggtagca aacttgctac aaatcccaca gtattttgat ttagaaatct acagcacagg 2221 tagaatggaa aagcatctgg atgctttatt aaaacagatt aagtttgttg tggagaaaca 2281 cgtagaatca gatgttctag aagcctgcag taaaacctat agtatcttat gcaatgaaga 2341 atataccatc cagaacagag ttgacatagc tcgaagccag ctgattgatg agtttgtaga 2401 tcgattcaat cattctgtgg aagacctatt gcaagaggga gaagaagctg atgatgatga 2461 catttacaat gttctttcta cattaaagcg gttaacttct tttcagaatg cacatgatct 2521 cacaaaatgg gatctctttg gtaattgcta cagattattg aagactggaa ttgaacatgg 2581 agccatgcca gaacagatag tcgtgcaagc actgcagtgt tcccattatt cgattctttg 2641 gcagttggtg aaaattactg atggctctcc ttccaaagag gatttgttgg tattgaggaa 2701 aacggtgaaa tcctttttgg ctgtttgcca gcagtgcctg tctaatgtta atactccagt 2761 gaaagaacag gctttcatgt tactctgtga tcttctgatg attttcagcc accaattaat 2821 gacaggtggc agagagggcc ttcagccttt ggtgttcaat ccagatactg gactccaatc 2881 tgaactcctc agttttgtga tggatcacgt ttttattgac caagacgagg agaaccagag 2941 catggagggt gatgaagaag atgaagctaa taaaattgag gccttacata aaagaaggaa 3001 tctacttgct gctttcagca aacttatcat ttatgacatt gttgacatgc atgcagctgc 3061 agacatcttc aaacactaca tgaagtatta caatgactat ggtgatatta ttaaggaaac 3121 actgagtaaa accaggcaga ttgataaaat tcagtgtgcc aagactctca ttctcagttt 3181 gcaacagtta tttaatgaac ttgttcaaga gcaaggtccc aacctagata ggacatctgc 3241 ccatgtcagt ggcattaaag aactggcacg tcgctttgcc cttacatttg gattggacca 3301 gattaagaca cgagaagcag ttgccacact tcacaaggat ggcatagagt ttgcatttaa 3361 ataccaaaat cagaaaggac aagagtatcc acctcctaat ctggcttttc ttgaagtact 3421 aagtgaattt tcttctaaac ttcttcgaca ggacaaaaag acagttcatt catacctaga 3481 gaaattcctt accgagcaga tgatggaaag gagggaggat gtatggcttc cactcatctc 3541 ctatagaaat tcattagtca ctgggggtga agatgataga atgtctgtga acagtggaag 3601 tagcagcagc aaaacctcat cagtaaggaa taagaaagga cgacctccac ttcataaaaa 3661 acgagtagaa gatgagagtc tggataacac atggctaaac aggactgaca ccatgattca 3721 gactcctggc cccctgccag caccacaact cacatccact gtactgcggg agaacagtcg 3781 gcccatggga gaccagattc aagaacctga gtctgaacat ggttctgaac cagacttttt 3841 acacaatcct cagatgcaga tctcttggtt aggccagccg aagttagaag acttaaatcg 3901 gaaggacaga acaggaatga actacatgaa agtgagaact ggagtgaggc atgctgttcg 3961 gggtctaatg gaggaagatg ctgagcccat ctttgaagat gtgatgatgt catcccgaag 4021 ccagttagaa gatatgaatg aagaatttga ggacaccatg gttattgatc tgcctccatc 4081 aagaaatcgg cgagagagag ctgagctaag gccagacttc tttgactctg cagctatcat 4141 agaagatgat tcaggatttg gaatgcctat gttctgaagt ctgaagaaaa tttacaaatc 4201 tggaactcta ttatttagag ctagaggcct atatactgtg atagcttgta tggggaaaaa 4261 caacttttga tgtgatctga tttgtttttt aatcaaatga ttaaggtcaa tccctttttg 4321 cagtgacaga agaggag // LOCUS HSSA2 4170 bp RNA PRI 09-OCT-1997 DEFINITION H.sapiens mRNA for nuclear protein SA-2. ACCESSION Z75331 NID g2204214 KEYWORDS SA-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4170) AUTHORS Carramolino,L., Lee,B., Zaballos,A., Peled,A., Barthelemy,I., Shav-Tal,Y., Prieto,I., Carmi,P., Gothelf,Y., Gonzalez de Buitrago,G., Aracil,M., Marquez,G., Barbero,J. and Zipori,D. TITLE SA-1, a nuclear protein encoded by one member of a novel gene family: detection in hemopoietic organs JOURNAL Unpublished REFERENCE 2 (bases 1 to 4170) AUTHORS Zaballos,A. TITLE Direct Submission JOURNAL Submitted (24-JUN-1996) Angel Zaballos, Research, Pharmacia & Upjohn, Antonio Lopez 109, Madrid, Madrid, 28026, SPAIN REFERENCE 3 (bases 1 to 4170) AUTHORS Carramolino,L., Lee,B., Zaballos,A., Peled,A., Barthelemy,I., Shav-Tal,Y., Prieto,I., Carmi,P., Gothelf,Y., Gonzalez de Buitrago,G., Aracil,M., Marquez,G., Barbero,J. and Zipori,D. TITLE SA-1, a nuclear protein encoded by one member of a novel gene family: molecular cloning and detection in hemopoietic organs JOURNAL Gene 195 (2), 151-159 (1997) MEDLINE 97449290 FEATURES Location/Qualifiers source 1..4170 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="thymus" CDS 650..4138 /codon_start=1 /product="nuclear protein SA-2" /db_xref="PID:e250094" /db_xref="PID:g2204215" /translation="MNGHHQQNGVENMMLFEVVKMGKSAMQSVVDDWIESYKHDRDIA LLDLINFFIQCSGCKGVVTAEMFRHMQNSEIIRKMTEEFDEDSGDYPLTMAGPQWKKF KSSFCEFIGVLVRQCQYSIIYDEYMMDTVISLLTGLSDSQVRAFRHTSTLAAMKLMTA LVNVALNLSINMDNTQRQYEAERNKMIGKRANERLELLLQKRKELQENQDEIENMMNA IFKGVFVHRYRDAIREIRAICIEEIGIWMKMYSDAFLNDSYLKYVGWTMHDKQGEVRL KCLTALQGLYYNKELNSKLELFTSRFKDRIVSMTLDKEYDVAVQAIKLLTLVLQSSEE VLTAEDCENVYHLVYSAHRPVAVAAGEFLYKKLFSRRDPEEDGMMKRRGRQGPNANLV KTLVFFFLESELHEHAAYLVDSMWDCATELLKDWECMNSLLLEEPLSGEEALTDRQES ALIEIMLCTIRQAAECHPPVGRGTGKRVLTAKEKKTQLDDRTKITELFAVALPQLLAK YSVDAEKVTNLLQLPQYFDLEIYTTGRLENDLDALLRQIRNIVEKHTDTDVLEACSKT YHALCNEEFTIFNRVDISRSQLIDELADKFNRLLEDFLQEGEEPDEDDAYQVLSTLKR ITAFHNAHDLSKRDLFACNYKLLKTGIENGDMPEQIVIHALQCTHYVILWQLAKITES SSTKEDLLRLKKQMRVFCQICQHYLTNVNTTVKEQAFTILCDILMIFSHQIMSGGRDM LEPLVYTPDSSLQSELLSFILDHVFIEQDDDNNSADGQQEDEASKIEALHKRRNLLAA FCKLIVYTVVEMNTAADIFKQYMKYYNDYGDIIKETMSKTRQIDKIQCAKTLILSLQQ LFNEMIQENGYNFDRSSSTFSGIKELARRFALTFGLDQLKTREAIAMLHKDGIEFAFK EPNPQGESHPPLNLAFLDILSEFSSKLLRQDKRTVYVYLEKFMTFQMSLRREDVWLPL MSYRNSLLAGGDDDTMSVISGISSRGSTVRSKKSKPSTGKRKVVEGMQLSLTEESSSS DSMWLTREQTLHTPVMMQTPQLTSTIMREPKRLRPEDSFMSVYPKQTEHHQTPLDYNR RGTSLMEDDEEPIVEDVMMSSEGRIEDLNEGMDFDTMDIDLPPSKNRRERTELKPDFF DPASIMDESVLGVSMF" polyA_site 4170 BASE COUNT 1425 a 695 c 909 g 1141 t ORIGIN 1 tcaacatatg aaaaatcaat gtaatatttc atattaacag ttggaagact cacaattcat 61 agtttcagaa tttacaacac aaacctacag gaataaagac aatgtagtat tagaataagg 121 tcagatatat agatcaatgg acagaattga cagtccagaa atacacccat acgattacgg 181 tcacctgatt tttgactaag gttccaaaac aattcactag gggaaagaat agtcttgtca 241 aaaaatggtg ctcagacaag tggatagcca taagcacaga atgagtttgg acccctatct 301 caaatctcaa atacaaacat taactccgaa ttaatcaaat gcctaaattt aagagctaaa 361 attttaaagc tcttagaaga aaacataggt acacatcttt gtgactttga attaggtcac 421 ggtttccttg acatgacact taaaagtaca agcaacaaaa gaaaaaaata gataaaatga 481 acttcagcaa aattagaatg tttatgcttc agaaaacact gtgaagaaag tgatcagaca 541 actcacagaa tgggaaaaat atttgtgaat catatctctt aataagggtc cagcagaaaa 601 gggcaaaggt ggaaatggag gaggaaaacc tccttctggt ccaaaccgaa tgaatggtca 661 tcaccaacag aatggagtgg aaaacatgat gttgtttgaa gttgttaaaa tgggcaagag 721 tgctatgcag tcggtggtag atgattggat agaatcatac aagcatgacc gagatatagc 781 acttcttgac cttatcaact tttttattca gtgttcaggc tgtaaaggag ttgtcacagc 841 agaaatgttt agacatatgc agaactctga gataattcga aaaatgactg aagaattcga 901 tgaggatagt ggagattatc cacttaccat ggctggtcct cagtggaaga agttcaaatc 961 cagtttttgt gaattcattg gcgtgttagt acggcaatgt caatatagta tcatatatga 1021 tgagtatatg atggatacag tcatttcact tcttacagga ttgtctgact cacaagtcag 1081 agcatttcga catacaagca ccctggcagc tatgaagttg atgacagctt tggtgaatgt 1141 ggcactaaat cttagcatta atatggataa tacacaaaga caatatgaag cagaacgaaa 1201 taaaatgatt gggaaacgag ccaatgagag gctagaactc ctgctacaaa agcggaaaga 1261 gcttcaggaa aatcaagatg aaatagaaaa tatgatgaat gcaatattta aaggagtgtt 1321 tgtacataga taccgtgatg cgatacgtga aattcgagct atttgcattg aagagattgg 1381 catttggatg aagatgtata gtgatgcctt tcttaatgac agttatttaa aatatgttgg 1441 ttggactatg catgataagc aaggtgaagt aagactcaaa tgtcttactg ctctacaagg 1501 gctttattat aacaaagagc ttaattccaa actggaactt tttaccagtc ggttcaagga 1561 tagaattgtg tctatgaccc ttgacaaaga atatgatgtt gcagtacaag caataaaatt 1621 actcactctt gttttacaga gtagtgaaga agttctcact gcagaagatt gtgaaaatgt 1681 ctatcatctg gtttattcag ctcaccggcc agtagcagta gcagctggag aatttctcta 1741 caaaaagctc ttcagtcgta gagatccaga ggaggatgga atgatgaaaa gaagaggaag 1801 acaaggtcca aatgccaacc ttgttaagac attggttttt ttctttctag aaagtgagtt 1861 acatgagcat gcagcatacc ttgtggatag catgtgggac tgtgctactg agctgctgaa 1921 agactgggaa tgtatgaata gcttgttact ggaagagcca cttagtggag aggaagcact 1981 aacagatagg caagagagtg ctctgattga aataatgctt tgtaccatta gacaagcggc 2041 tgaatgtcat cctcccgtgg gaagagggac aggaaaaagg gtgcttacag caaaggagaa 2101 gaagacacag ttggatgata ggacaaaaat cactgagctt tttgccgtgg cccttcctca 2161 gttattagca aaatactctg tagatgcaga aaaggtgact aacttgttgc agttgcctca 2221 gtactttgat ttggaaatat ataccactgg acgattagaa aacgatttgg atgccttatt 2281 gcgacagatc cggaatattg tagagaagca cacagataca gatgttttgg aagcatgttc 2341 taaaacttac catgcactct gtaatgaaga gttcacaatc ttcaacagag tagatatttc 2401 aagaagtcaa ctgatagatg aattggcaga taaatttaac cggcttcttg aagattttct 2461 gcaagagggt gaagaacctg atgaagatga tgcatatcag gtattgtcaa cattgaagag 2521 gatcactgct tttcataatg cccatgacct ttcaaagagg gatttatttg cttgtaatta 2581 caaactcttg aaaactggaa tcgaaaatgg agacatgcct gagcagattg ttattcacgc 2641 actgcagtgt actcactatg taatcctttg gcaacttgct aagataactg aaagcagctc 2701 tacaaaggag gacttgctgc gtttaaagaa acaaatgaga gtattttgtc agatatgtca 2761 acattacctg accaacgtga atactactgt taaggaacag gccttcacta ttctgtgtga 2821 tattttgatg atcttcagcc atcagattat gtcaggaggg cgtgacatgt tagagccatt 2881 agtgtatacc cctgattctt cattgcagtc tgagttgctc agctttattt tggatcatgt 2941 cttcattgaa caggatgatg ataataatag tgcagatggt cagcaagagg atgaagccag 3001 taaaattgaa gctctgcaca agagaagaaa tttacttgca gcattttgta agctaattgt 3061 atatactgtg gtggagatga atacagctgc agatatcttc aaacagtata tgaagtatta 3121 taatgactat ggagatatca tcaaagaaac aatgagtaaa acaaggcaga tagacaaaat 3181 tcagtgtgct aagaccctta ttctcagtct gcaacagctt tttaatgaaa tgatacaaga 3241 aaatggctat aattttgata gatcatcctc tacatttagt ggcataaaag aacttgctcg 3301 acgttttgct ttaacttttg gacttgatca gttgaaaaca agagaagcca ttgccatgct 3361 acacaaagat ggcatagaat ttgcttttaa agagcctaat ccgcaagggg agagccatcc 3421 acctttaaat ttggcatttc ttgatattct gagtgaattt tcttctaaac tacttcgaca 3481 agacaaaaga acagtgtatg tttacttgga aaagttcatg acctttcaga tgtcactccg 3541 aagagaggat gtgtggcttc cactgatgtc ttaccgaaat tctttgctag ctggtggtga 3601 tgatgacacc atgtcagtca ttagtggaat cagcagccgg gggtcaacag tacggagtaa 3661 aaaatcaaaa ccatctacag gaaaacggaa agtggttgag ggcatgcagc tttcactcac 3721 tgaagaaagt agtagtagtg acagtatgtg gttaacgaga gaacaaacac tgcacacccc 3781 tgttatgatg cagacaccac aactcacctc cactattatg agagagccca aaagattacg 3841 gcctgaggat agcttcatga gtgtttatcc taagcagact gaacatcatc aaacacctct 3901 tgattataat cggcgtggca caagcctaat ggaagatgat gaagagccaa ttgtggaaga 3961 tgttatgatg tcctcagaag ggaggattga ggatcttaat gagggaatgg attttgacac 4021 catggatata gatttgccac catcaaagaa cagacgagag agaacagaac tgaagcctga 4081 tttctttgat ccagcttcaa ttatggatga atcagttctt ggagtgtcaa tgttttaata 4141 ccagtacaca attaaatctg tggtgaagtc // LOCUS HSSADSYNA 1283 bp RNA PRI 30-NOV-1993 DEFINITION H.sapiens mRNA for S-adenosylmethionine synthetase. ACCESSION X68836 S47859 NID g36326 KEYWORDS isozyme; methylation-regulation; s-adenosylmethionine synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1283) AUTHORS Horikawa,S. and Tsukada,K. TITLE Molecular cloning and developmental expression of a human kidney S-adenosylmethionine synthetase JOURNAL FEBS Lett. 312 (1), 37-41 (1992) MEDLINE 93050159 FEATURES Location/Qualifiers source 1..1283 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 66..1253 /EC_number="2.5.1.6" /codon_start=1 /product="S-adenosylmethionine synthetase" /db_xref="PID:g36327" /db_xref="SWISS-PROT:P31153" /translation="MNGQLNGFHEAFIEEGTFLFTSESVGEGHPDKICDQISDAVLDA HLQQDPDAKVACETVAKTGMILLAGEITSRAAVDYQKVVREAVKHIGYDDSSKGFDYK TCNVLVALEQQSPDIAQGVHLDRNEEDIGAGDQGLMFGYATDETEECMPLTIVLAHKL NAKLAELRRNGTLPWLRPDSKTQVTVQYMQDRGAVLPIRVHTIVISVQHDEEVCLDEM RDALKEKVIKAVVPAKYLDEDTIYHLQPSGRFVIGGPQGDAGLTGRKIIVDTYGGWGA HGGGAFSGKDYTKVDRSAAYAARWVAKSLVKGGLCRRVLVQVSYAIGVSHPLSISIFH YGTSQKSERELLEIVKKNFDLRPGVIVRDLDLKKPIYQRTAAYGHFGRDSFPWEVPKK LKY" BASE COUNT 324 a 281 c 327 g 351 t ORIGIN 1 tttcgcagcc gctgccgcct cgccgctgct ccttcgtaag gccacttccg cacaccgaca 61 ccaacatgaa cggacagctc aacggcttcc acgaggcgtt catcgaggag ggcacattcc 121 ttttcacctc agagtcggtc ggggaaggcc acccagataa gatttgtgac caaatcagtg 181 atgctgtcct tgatgcccac cttcagcagg atcctgatgc caaagtagct tgtgaaactg 241 ttgctaaaac tggaatgatc cttcttgctg gggaaattac atccagagct gctgttgact 301 accagaaagt ggttcgtgaa gctgttaaac acattggata tgatgattct tccaaaggtt 361 ttgactacaa gacttgtaac gtgctggtag ccttggagca acagtcacca gatattgctc 421 aaggtgttca tcttgacaga aatgaagaag acattggtgc tggagaccag ggcttaatgt 481 ttggctatgc cactgatgaa actgaggagt gtatgccttt aaccattgtc ttggcacaca 541 agctaaatgc caaactggca gaactacgcc gtaatggcac tttgccttgg ttacgccctg 601 attctaaaac tcaagttact gtgcagtata tgcaggatcg aggtgctgtg cttcccatca 661 gagtccacac aattgttata tctgttcagc atgatgaaga ggtttgtctt gatgaaatga 721 gggatgccct aaaggagaaa gtcatcaaag cagttgtgcc tgcgaaatac cttgatgagg 781 atacaatcta ccacctacag ccaagtggca gatttgttat tggtgggcct cagggtgatg 841 ctggtttgac tggacggaaa atcattgtgg acacttatgg cggttggggt gctcatggag 901 gaggtgcctt ttcaggaaag gattatacca aggtcgaccg ttcagctgct tatgctgctc 961 gttgggtggc aaaatccctt gttaaaggag gtctgtgccg gagggttctt gttcaggtct 1021 cttatgctat tggagtttct catccattat ctatctccat tttccattat ggtacctctc 1081 agaagagtga gagagagcta ttagagattg tgaagaagaa tttcgatctc cgccctgggg 1141 tcattgtcag ggatctggat ctgaagaagc caatttatca gaggactgca gcctatggcc 1201 actttggtag ggacagcttc ccatgggaag tgcccaaaaa gcttaaatat tgaaagtgtt 1261 agcctttttt ccccagactt gtt // LOCUS HSSAMRNA 1528 bp RNA PRI 02-FEB-1995 DEFINITION H.sapiens SA mRNA. ACCESSION X80062 NID g663208 KEYWORDS intron; SA gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1528) AUTHORS Nabika,T., Bonnardeaux,A., James,M., Julier,C., Jeunemaitre,X., Corvol,P., Lathrop,M. and Soubrier,F. TITLE Evaluation of the SA locus in human hypertension JOURNAL Hypertension 25 (1), 6-13 (1995) MEDLINE 95146110 REFERENCE 2 (bases 1 to 1528) AUTHORS Nabika,T. TITLE Direct Submission JOURNAL Submitted (06-JUL-1994) T. Nabika, INSERM U36, College de France, 3 rue d'Ulm, 75005 Paris, FRANCE REFERENCE 3 (bases 1 to 1528) AUTHORS Iwai,N., Ohmichi,N., Hanai,K., Nakamura,Y. and Kinoshita,M. TITLE Human SA gene locus as a candidate locus for essential hypertension JOURNAL Hypertension 23 (3), 375-380 (1994) MEDLINE 94171328 FEATURES Location/Qualifiers source 1..1528 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="human liver cDNA library" gene 161..1453 /gene="SA" CDS 161..1453 /gene="SA" /codon_start=1 /db_xref="PID:g663209" /translation="MLRHAKCFQRLAIFGSVRALHKDNRTATPQNFSNYESMKQDFKL GIPEYFNFAKDVLDQWTDKEKAGKKPSNPAFWWINRNGEEMRWSFEELGSLSRKFANI LSEACSLQRGDRVILILPRVPEWWLANVACLRTGTVLIPGTTQLTQKDILYRLQSSKA NCIITNDVLAPAVDAVASKCENLHSKLIVSENSREGWGNLKELMKHASDSHTCVKTKH NEIMAIFFTSGTSGYPKMTAHTHSSFGLGLSVNGRFWLDLTPSDVMWNTSDTGWAKSA WSSVFSPWIQGACVFTHHLPRFEPTSILQTLSKYPITVFCSAPTVYRMLVQNDITSYK FKSLKHCVSAGEPITPDVTEKWRNKTGLDIYEGYGQTETVLICGNFKGMKIKPGSMGK PSPAFDVKVCTSPSRRMFNNPICTLPTYRLPPYKLSLL" variation 529..532 /gene="SA" /citation=[3] misc_feature 566..567 /gene="SA" /note="splicing site" variation 846 /gene="SA" /citation=[3] variation 869 /gene="SA" /citation=[3] variation 901 /gene="SA" /citation=[3] misc_feature 1279..1280 /gene="SA" /note="splicing site" misc_feature 1350..1351 /gene="SA" /note="alternative splice site" BASE COUNT 471 a 328 c 332 g 397 t ORIGIN 1 gaagagggag aactcaaggt tcagggctgc tcttctaaga aacaagtctg ccataatctc 61 catctgtgtt ggaatctgtt aactaatgaa ctggtctctg tgcaaatcct gagtgctaaa 121 gcttccaaca agactgatgc tagctcgtgt caccaggaag atgctacgtc atgccaagtg 181 ttttcagcgc ctagcaattt ttggttctgt gagggcactg cataaagata atagaacagc 241 aacccctcag aatttctcca actatgaatc catgaaacag gacttcaaac tggggattcc 301 agagtatttc aactttgcta aagatgtcct ggaccaatgg actgataagg aaaaggctgg 361 aaagaaacct tcaaatccag ccttctggtg gatcaacaga aatggagaag agatgcgatg 421 gagttttgag gaactgggat ctctgtccag aaaatttgcc aatatacttt cagaagcctg 481 ttccctacaa agaggagatc gggtaattct gattctgccc agggtcccag agtggtggct 541 tgcaaatgtg gcctgtctgc gaacagggac agttttaatt ccaggaacca ctcagctgac 601 ccagaaagac attctctaca gactacaatc ttcaaaagca aactgcatta tcaccaatga 661 tgttttagcc ccagcagtag acgctgttgc atccaaatgt gaaaatctgc actccaagct 721 gattgtatca gagaactcca gagaggggtg ggggaacctc aaggagttga tgaaacatgc 781 cagtgacagc cacacctgtg tgaagacaaa acacaatgag atcatggcca tattctttac 841 cagtggaaca agtggatatc cgaaaatgac tgcacacacc cacagcagtt ttggtttagg 901 attatctgta aatggaaggt tctggctaga tttgacaccc tcagatgtga tgtggaatac 961 ctcagatacg ggctgggcaa agtctgcatg gagtagtgtt ttttctccgt ggatccaggg 1021 agcatgtgta ttcacacacc atttaccccg ttttgagccg acttctatct tgcaaacact 1081 ctccaagtac cccatcacag tcttctgttc agcaccaact gtataccgaa tgcttgtaca 1141 gaatgatata accagctata agtttaaaag cttaaagcac tgtgtgagtg ctggggaacc 1201 aattacccct gacgtgactg aaaaatggag aaacaagacg ggcctggata tctacgaagg 1261 atatggacag actgaaacgg tgctaatctg tggaaatttt aagggaatga aaattaaacc 1321 tggctcaatg ggaaaacctt ctcctgcttt cgatgttaag gtttgcacat ccccttccag 1381 gagaatgttt aacaacccaa tctgtacact acctacctac cgcttacccc catataaact 1441 ttctttgtta tgatggtgat tccattttac ttccatgata ctttaatttt tataaatatg 1501 tgaaaatgat tagaaaaaaa aaaaaaaa // LOCUS HSSAP1MNR 319 bp RNA PRI 01-JUL-1997 DEFINITION H.sapiens mRNA for skin-antimicrobial-peptide 1 (SAP1). ACCESSION Z71389 NID g2239127 KEYWORDS SAP1; skin-antimicrobial peptide 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 319) AUTHORS Harder,J., Bartels,J., Christophers,E. and Schroder,J.M. TITLE A peptide antibiotic from human skin JOURNAL Nature 387 (6636), 861 (1997) MEDLINE 97345625 REFERENCE 2 (bases 1 to 319) AUTHORS Harder,J. TITLE Direct Submission JOURNAL Submitted (21-JUN-1996) Harder J., Christian-Albrechts-Universitaet zu Kiel, Dermatology/Hautklinik, Mol. Biol. Lab. 609, Schittenhelmstr. 7, Kiel, Schleswig-Holstein, Germany, D-24105 FEATURES Location/Qualifiers source 1..319 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23a(396), 4Fa(596), 46a(596)" /tissue_type="skin" /cell_type="keratinocyte" mRNA 1..319 CDS 24..218 /function="antimicrobial peptide (beta-defensin-family)" /codon_start=1 /product="skin-antimicrobial peptide 1 (SAP1)" /db_xref="PID:e249983" /db_xref="PID:g2239128" /translation="MRVLYLLFSFLFIFLMPLPGVFGGIGDPVTCLKSGAICHPVFCP RRYKQIGTCGLPGTKCCKKP" sig_peptide 24..92 mat_peptide 93..215 /product="skin-antimicrobial-peptide 1 (SAP1)" BASE COUNT 80 a 77 c 76 g 86 t ORIGIN 1 ggtgaagctc ccagccatca gccatgaggg tcttgtatct cctcttctcg ttcctcttca 61 tattcctgat gcctcttcca ggtgtttttg gtggtatagg cgatcctgtt acctgcctta 121 agagtggagc catatgtcat ccagtctttt gccctagaag gtataaacaa attggcacct 181 gtggtctccc tggaacaaaa tgctgcaaaa agccatgagg aggccaagaa gctgctgtgg 241 ctgatgcgga ttcagaaagg gctccctcat cagagacgtg cgacatgtaa accaaattaa 301 actatggtgt ccaaagata // LOCUS HSSBLA 1619 bp RNA PRI 13-DEC-1994 DEFINITION Human mRNA for ribonucleoprotein SS-B/La. ACCESSION X13697 NID g36414 KEYWORDS RNA binding protein; small nuclear ribonucleoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1619) AUTHORS Chan,E.K.L. TITLE Direct Submission JOURNAL Submitted (05-DEC-1988) Chan E.K.L., Scripps Clinic and Research Foundation, 10666 N Torrey Pines road, La Jolla, CA 92037, USA REFERENCE 2 (bases 1 to 1619) AUTHORS Chan,E.K., Sullivan,K.F. and Tan,E.M. TITLE Ribonucleoprotein SS-B/La belongs to a protein family with consensus sequences for RNA-binding JOURNAL Nucleic Acids Res. 17 (6), 2233-2244 (1989) MEDLINE 89202037 REFERENCE 3 (bases 1 to 1619) AUTHORS Troster,H., Metzger,T.E., Semsei,I., Schwemmle,M., Winterpacht,A., Zabel,B. and Bachmann,M. TITLE One gene, two transcripts: isolation of an alternative transcript encoding for the autoantigen La/SS-B from a cDNA library of a patient with primary Sjogrens' syndrome JOURNAL J. Exp. Med. 180 (6), 2059-2067 (1994) MEDLINE 95053740 FEATURES Location/Qualifiers source 1..1619 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoblastoma" /cell_type="T cell origin" /cell_line="MOLT-4" /clone_lib="lambda gt11 MOLT-4" /clone="H9" CDS 73..1299 /note="ribonucleoprotein SS-B/La (AA 1-408)" /codon_start=1 /db_xref="PID:g36415" /db_xref="SWISS-PROT:P05455" /translation="MAENGDNEKMAALEAKICHQIEYYFGDFNLPRDKFLKEQIKLDE GWVPLEIMIKFNRLNRLTTDFNVIVEALSKSKAELMEISEDKTKIRRSPSKPLPEVTD EYKNDVKNRSVYIKGFPTDATLDDIKEWLEDKGQVLNIQMRRTLHKAFKGSIFVVFDS IESAKKFVETPGQKYKETDLLILFKDDYFAKKNEERKQNKVEAKLRAKQEQEAKQKLE EDAEMKSLEEKIGCLLKFSGDLDDQTCREDLHILFSNHGEIKWIDFVRGAKEGIILFK EKAKEALGKAKDANNGNLQLRNKEVTWEVLEGEVEKEALKKIIEDQQESLNKWKSKGR RFKGKGKGNKAAQPGSGKGKVQFQGKKTKFASDDEHDEHDENGATGPVKRAREETDKE EPASKQQKTENGAGDQ" misc_feature 1575..1580 /note="pot. polyA signal" BASE COUNT 636 a 222 c 353 g 408 t ORIGIN 1 ggagtcgttg ttgttgctgt ttgtgagcct gtgcggcggc ttctgtgggc cggaacctta 61 aagatagccg caatggctga aaatggtgat aatgaaaaga tggctgccct ggaggccaaa 121 atctgtcatc aaattgagta ttattttggc gacttcaatt tgccacggga caagtttcta 181 aaggaacaga taaaactgga tgaaggctgg gtacctttgg agataatgat aaaattcaac 241 aggttgaacc gtctaacaac agactttaat gtaattgtgg aagcattgag caaatccaag 301 gcagaactca tggaaatcag tgaagataaa actaaaatca gaaggtctcc aagcaaaccc 361 ctacctgaag tgactgatga gtataaaaat gatgtaaaaa acagatctgt ttatattaaa 421 ggcttcccaa ctgatgcaac tcttgatgac ataaaagaat ggttagaaga taaaggtcaa 481 gtactaaata ttcagatgag aagaacattg cataaagcat ttaagggatc aatttttgtt 541 gtgtttgata gcattgaatc tgctaagaaa tttgtagaga cccctggcca gaagtacaaa 601 gaaacagacc tgctaatact tttcaaggac gattactttg ccaaaaaaaa tgaagaaaga 661 aaacaaaata aagtggaagc taaattaaga gctaaacagg agcaagaagc aaaacaaaag 721 ttagaagaag atgctgaaat gaaatctcta gaagaaaaga ttggatgctt gctgaaattt 781 tcgggtgatt tagatgatca gacctgtaga gaagatttac acatactttt ctcaaatcat 841 ggtgaaataa aatggataga cttcgtcaga ggagcaaaag aggggataat tctatttaaa 901 gaaaaagcca aggaagcatt gggtaaagcc aaagatgcaa ataatggtaa cctacaatta 961 aggaacaaag aagtgacttg ggaagtacta gaaggagagg tggaaaaaga agcactgaag 1021 aaaataatag aagaccaaca agaatcccta aacaaatgga agtcaaaagg tcgtagattt 1081 aaaggaaaag gaaagggtaa taaagctgcc cagcctgggt ctggtaaagg aaaagtacag 1141 tttcagggca agaaaacgaa atttgctagt gatgatgaac atgatgaaca tgatgaaaat 1201 ggtgcaactg gacctgtgaa aagagcaaga gaagaaacag acaaagaaga acctgcatcc 1261 aaacaacaga aaacagaaaa tggtgctgga gaccagtagt ttagtaaacc aattttttat 1321 tcattttaaa taggttttaa acgacttttg tttgcggggc ttttaaaagg aaaaccgaat 1381 taggtccact tcaatgtcca cctgtgagaa aggaaaaatt tttttgttgt ttaacttgtc 1441 tttttgttat gcaaatgaga tttctttgaa tgtattgttc tgtttgtgtt atttcagatg 1501 attcaaatat caaaaggaag attcttccat taaattgcct ttgtaatatg agaatgtatt 1561 agtacaaact aactaataaa atatatacta tatgaaaaga gcaaaaaaaa aaaaaaaaa // LOCUS HSSCA7 3969 bp mRNA PRI 03-FEB-1998 DEFINITION Homo Sapiens mRNA for spinocerebellar ataxia 7. ACCESSION AJ000517 NID g2370154 KEYWORDS SCA7 gene; spinocerebellar ataxia 7. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3969) AUTHORS David,G., Abbas,N., Stevanin,G., Durr,A., Yvert,G., Cancel,G., Weber,C., Imbert,G., Saudou,F., Antoniou,E., Drabkin,H., Gemmill,R., Giunti,P., Benomar,A., Wood,N., Ruberg,M., Agid,Y., Mandel,J.L. and Brice,A. TITLE Cloning of the SCA7 gene reveals a highly unstable CAG repeat expansion JOURNAL Nature Genet. 17 (1), 65-70 (1997) MEDLINE 97434213 REFERENCE 2 (bases 1 to 3969) AUTHORS Del-Favero,J., Krols,L., Michalik,A., Theuns,J., Loefgren,A., Goossens,D., Wehnert,A., Van den Bossche,D., Zand,K.V., Backhovens,H., van Regenmorter,N., MARTIN,J.J. and Van Broeckhoven,C. TITLE Molecular genetic analysis of autosomal dominant cerebellar ataxia with retinal degeneration (ADCA type II) caused by CAG triplet repeat expansion JOURNAL Hum. Mol. Genet. 7, 177-186 (1998) REFERENCE 3 (bases 1 to 3969) AUTHORS Brice,A. TITLE Direct Submission JOURNAL Submitted (22-JUL-1997) Brice A., Hopital de la Salpetriere, INSERM U289, 47 bd. de l Hopital, 75651 Paris Cedex 13, FRANCE FEATURES Location/Qualifiers source 1..3969 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /sex="Male" /clone_lib="lambda SCREEN-1" /lab_host="E.coli" /map="p12-p13" /tissue_lib="lymphoblastoid cell line" gene 562..3240 /gene="SCA7" CDS 562..3240 /gene="SCA7" /codon_start=1 /product="spinocerebellar ataxia 7" /db_xref="PID:e330267" /db_xref="PID:g2370155" /translation="MSERAADDVRGEPRRAAAAAGGAAAAAARQQQQQQQQQQPPPPQ PQRQQHPPPPPRRTRPEDGGPGAASTSAAAMATVGERRPLPSPEVMLGQSWNLWVEAS KLPGKDGTELDESFKEFGKNREVMGLCREDMPIFGFCPAHDDFYLVVCNDCNQVVKPQ AFQSHYERRHSSSSKPPLAVPPTSVFSFFPSLSKSKGGSASGSNRSSSGGVLSASSSS SKLLKSPKEKLQLRGNTRPMHPIQQSRVPHGRIMTPSVKVEKIHPKMDGTLLKSAVGP TCPATVSSLVKPGLNCPSIPKPTLPSPGQILNGKGLPAPPTLEKKPEDNSNNRKFLNK RLSEREFDPDIHCGVIDLDTKKPCTRSLTCKTHSLTQRRAVQGRRKRFDVLLAEHKNK TREKELIRHPDSQQPPQPLRDPHPAPPRTSQEPHQNPHGVIPSESKPFVASKPKPHTP SLPRPPGCPAQQGGSAPIDPPPVHESPHPPLPATEPASRLSSEEGEGDDKEESVEKLD CHYSGHHPQPASFCTFGSRQIGRGYYVFDSRWNRLRCALNLMVEKHLNAQLWKKIPPV PSTTSPISTRIPHRTNSVPTSQCGVSYLAAATVSTSPVLLSSTCISPNSKSVPAHGTT LNAQPAASGAMDPVCSMQSRQVSSSSSSPSTPSGLSSVPSSPMSRKPQKLKSSKSLRP KESSGNSTNCQNASSSTSGGSGKKRKNSSPLLVHSSSSSSSSSSSSHSMESFRKNCVA HSGPPYPSTVTSSHSIGLNCVTNKANAVNVRHDQSGRGPPTGSPAESIKRMSVMVNSS DSTLSLGPFIHQSNELPVNSHGSFSHSHTPLDKLIGKKRKCSPSSSSINNSSSKPTKV AKVPAVNNVHMKHTGTIPGAQGLMNSSLLHQPKARP" BASE COUNT 1028 a 1090 c 939 g 912 t ORIGIN 1 cgttgctgtc gaaagggtga aagagaaact tggcgacctc ccggaggagt tcgcgaagcg 61 accaggagcg tgttgccatc gtcctcaccc ggcacccaat tccaccacag agtcgggatt 121 tcgtcggtga tcgtgatggg gtgcttttat ttttctcttt gattttcaaa aaatgtctat 181 gtgactgtcc ctatcttaag gggaagttga aagtgggggc gggggtgctc aatgagaaac 241 gttgccttgt gtgtagttgt ttggagcaca ctgcaaatta tattggcatc tctttccaaa 301 agtcactttg attcaacttc gatagctttc tcgtaaatgg cacgtttagg tggtgagagg 361 tggatgagga aacaggcacc agtgcagctg atttgacctc cagtgggata gatacgatta 421 gcaccaggat cgtgtctcat tttgaaccca gatctgaaca gaattaagac gaacgagctt 481 tcacaattgc agcagatgaa gatccattgg taaattgatc aggatttttg gcctaccctc 541 caaagaaaag gagcggaaag aatgtcggag cgggccgcgg atgacgtcag gggggagccg 601 cgccgcgcgg cggcggcggc gggcggagca gcggccgcgg ccgcccggca gcagcagcag 661 cagcagcagc agcagcagcc gccgcctccg cagccccagc ggcagcagca cccgccaccg 721 ccgccacggc gcacacggcc ggaggacggc gggcccggcg ccgcctccac ctcggccgcc 781 gcaatggcga cggtcgggga gcgcaggcct ctgcccagtc ctgaagtgat gctgggacag 841 tcgtggaatc tgtgggttga ggcttccaaa cttcctggga aggacgggac agaattggac 901 gaaagtttca aggagtttgg gaaaaaccgc gaagtcatgg ggctctgtcg ggaagacatg 961 ccaatatttg gtttctgtcc agcccatgat gatttctact tggtggtgtg taacgactgt 1021 aatcaggttg tcaaaccgca ggcatttcaa tcacattatg aaagaagaca tagctcatcc 1081 agcaagccgc ctttggccgt tcctcccact tcagtatttt ccttcttccc ttctctgtcc 1141 aaaagcaaag gaggcagtgc aagtggaagc aaccgttctt ccagtggagg tgttcttagc 1201 gcatcctcat caagttccaa gttgttgaaa tcacccaaag agaaactgca gctcaggggg 1261 aacaccaggc caatgcatcc cattcagcaa agtagagttc cccatggtag aatcatgaca 1321 ccctctgtga aagtggaaaa gattcatccg aaaatggatg gcacactact gaaatctgcg 1381 gtggggccaa cctgtcctgc tactgtgagt tccttagtca agcctggcct taactgcccc 1441 tcaataccaa agccaacctt gccttcacct ggacagattc tgaatggcaa agggcttcct 1501 gcaccgccca ctctggaaaa gaaacctgaa gacaattcca ataataggaa atttttaaat 1561 aagagattat cagaaagaga gtttgatcct gacatccact gtggggttat tgatctcgac 1621 accaagaagc cctgcacccg gtctttgaca tgcaagacac attccttaac ccagcgcagg 1681 gctgtccagg gtagaagaaa acgatttgat gtgttattag ccgagcacaa aaacaaaacc 1741 agggaaaagg aattgattcg ccatccggac tctcagcaac caccgcagcc tctcagggac 1801 ccgcatcccg cccctcctag aacgtcacag gagccgcacc aaaaccctca cggagtgatt 1861 ccttccgaat caaagccttt tgtagctagt aaacctaaac ctcacacccc cagtcttcca 1921 aggcctccag gctgccctgc tcagcaaggt gggagtgccc ccattgaccc tcctccagtc 1981 catgaatctc cacaccctcc cctgcctgcc actgagccag cttctcggtt atccagtgag 2041 gagggcgaag gcgatgacaa agaagagtct gttgaaaaac tggactgtca ttattcaggt 2101 catcatcctc agccagcatc tttttgcaca tttgggagcc ggcagatagg aagaggctat 2161 tacgtgtttg actccaggtg gaatcgactt cgctgcgccc tcaacctcat ggtggagaag 2221 catctgaatg cacagctatg gaagaaaatc ccaccagtgc ccagtaccac ctcacccatc 2281 tccacacgta ttcctcaccg gacaaactct gtgccgacat cacaatgtgg agtcagctat 2341 ctggcagcag ccaccgtctc tacatcccca gtcctgctct catctacctg catctcccca 2401 aatagcaaat cggtaccagc tcatggaacc acactaaatg cacagcctgc tgcttcaggg 2461 gcgatggatc ctgtgtgcag tatgcaatcc agacaagtgt cctcttcatc ctcatcccct 2521 tccacgccct ctggcctttc ctcggttcct tcctccccca tgtccaggaa acctcagaaa 2581 ttgaaatcca gcaaatcttt gaggcccaag gagtcttctg gtaacagcac taactgtcaa 2641 aatgccagta gcagtaccag tggcggctca ggaaagaaac gcaaaaacag ttccccactg 2701 ttggttcact cttcctcctc ctcttcctcc tcctcctctt cttctcattc catggagtct 2761 tttaggaaaa actgtgtggc tcactctggg cctccctacc cctcaacggt aacatcttcc 2821 catagcatcg gcctcaactg tgtgacgaat aaagcaaatg cggtgaacgt ccggcatgac 2881 cagtcaggga ggggcccccc caccgggagc cctgctgaat ccatcaagag gatgagtgtg 2941 atggtgaaca gcagtgattc tactctttct cttgggccat tcattcacca gtccaatgaa 3001 ctgcctgtca actcccacgg cagtttttcc cactcacaca ctcctctaga caaactcata 3061 ggaaagaaaa gaaagtgctc acccagctcg agcagcatca acaacagcag cagcaaaccc 3121 acaaaggttg ccaaagtgcc agccgtgaac aatgtccaca tgaaacacac aggcaccatc 3181 ccaggggcac aaggactgat gaacagttcc ctccttcatc agccaaaggc acgtccctga 3241 cagctgaaaa tagcacgggg aggaataatg cggacacttt tgaggacaag ttacacctcc 3301 actcagcact ctggactcca cgatgccttt gagtctgttt tcccaacctc ctgtgggcct 3361 caagggtaga aacctgccgg gctgttgttt taacgaggat ttccctgaag ctatgtctct 3421 agcagtgagt actcataaag gacactggat caagttcagc caccgaattg cttttatcag 3481 tgttaaagtg gtctgaactg cttgctacca atctgtgaga agtttttgtt tttgttttgt 3541 tttttaactt gcagtatatc acagagccac tcttcaagtt agattggctg ggcaaaagaa 3601 tgttttggca agagcgttac tgtagacctt tctccctcct tccttttact accatttttt 3661 tttaacactg tcatctgtag gtcactctcc agcagttagg caccttaact ggagaccaga 3721 aaccttccag agaacacagg gctgcatccc gagcaaccct ctgaagaagg gaattaggct 3781 ttagattttg atagcaatgt tccaggaatg aaatatagat gttagcccaa gacaccatga 3841 caaaatagcc cagccttttg agagtaattt gggaaaagaa gctgtcagaa gtttctaact 3901 tacaaactgg tttgaaattt ttgatgccca gacagcaagt atcgacagca acggaattcg 3961 agctccgtc // LOCUS HSSCNN1B 2564 bp RNA PRI 03-OCT-1995 DEFINITION H.sapiens mRNA for beta subunit of epithelial amiloride-sensitive sodium channel. ACCESSION X87159 NID g1004270 KEYWORDS beta subunit; epithelial amiloride-sensitive sodium channel; epithelial sodium channel; SCNN1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2564) AUTHORS Voilley,N., Bassilana,F., Mignon,C., Merscher,S., Mattei,M.G., Carle,G.F., Lazdunski,M. and Barbry,P. TITLE Cloning, chromosomal localization, and physical linkage of the beta and gamma subunits (SCNN1B and SCNN1G) of the human epithelial amiloride-sensitive sodium channel JOURNAL Genomics 28 (3), 560-565 (1995) MEDLINE 96039270 REFERENCE 2 (bases 1 to 2564) AUTHORS Voilley,N. TITLE Direct Submission JOURNAL Submitted (11-MAY-1995) N. Voilley, CNRS-UPR411, Inst.de Pharmacologie molecularie et cellulaire, 660 route des Lucioles Sophia Antipolis, F- 06560 Valbonne, FRANCE COMMENT Related sequence: U16023. FEATURES Location/Qualifiers source 1..2564 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lung" /chromosome="16" /map="16p12-13" gene 128..2050 /gene="SCNN1B" CDS 128..2050 /gene="SCNN1B" /codon_start=1 /product="epithelial amiloride-sensitive sodium channel, beta-subunit" /db_xref="PID:g1004271" /translation="MHVKKYLLKGLHRLQKGPGYTYKELLVWYCDNTNTHGPKRIICE GPKKKAMWFLLTLLFAALVCWQWGIFIRTYLSWEVSVSLSVGFKTMDFPAVTICNASP FKYSKIKHLLKDLDELMEAVLERILAPELSHANATRNLNFSIWNHTPLVLIDERNPHH PMVLDLFGDNHNGLTSSSASEKICNAHGCKMAMRLCSLNRTQCTFRNFTSATQALTEW YILQATNIFAQVPQQELVEMSYPGEQMILACLFGAEPCNYRNFTSIFYPHYGNCYIFN WGMTEKALPSANPGTEFGLKLILDIGQEDYVPFLASTGGVRLMLHEQRSYPFIRDEGI YAMSGTETSIGVLVDKLQRMGEPYSPCTVNGSEVPVQNFYSDYNTTYSIQACLRSCFQ DHMIRNCNCGHYLYPLPRGEKYCNNRDFPDWAHCYSDLQMSVAQRETCIGMCKESCND TQYKMTISMADWPSEASEDWIFHVLSQERDQSTNITLSRKGIVKLNIYFQEFNYRTIE ESAANNIVWLLSNLGGQFGFWMGGSVLCLIEFGEIIIDFVWITIIKLVALAKSLRQRR AQASYAGPPPTVAELVEAHTNFGFQPDTAPRSPNTGPYPSEQALPIPGTPPPNYDSLR LQPLDVIESDSEGDAI" misc_difference 1921..2384 /note="Region absent in other clone from same library; no loss of function." BASE COUNT 582 a 808 c 663 g 511 t ORIGIN 1 tcgccgggtg tcccagtgtc accaacactc ggccgccgcc gccagcttgg cgcgcaccgc 61 cgcctccgcc accgccgaca gcgcgcatcc tccgtgtccc cgctccgccg cccgagcagg 121 tgccactatg cacgtgaaga agtacctgct gaagggcctg catcggctgc agaagggccc 181 cggctacacg tacaaggagc tgctggtgtg gtactgcgac aacaccaaca cccacggccc 241 caagcgcatc atctgtgagg ggcccaagaa gaaagccatg tggttcctgc tcaccctgct 301 cttcgccgcc ctcgtctgct ggcagtgggg catcttcatc aggacctact tgagctggga 361 ggtcagcgtc tccctctccg taggcttcaa gaccatggac ttccccgccg tcaccatctg 421 caatgctagc cccttcaagt attccaaaat caagcatttg ctgaaggacc tggatgagct 481 gatggaagct gtcctggaga gaatcctggc tcctgagcta agccatgcca atgccaccag 541 gaacctgaac ttctccatct ggaaccacac acccctggtc cttattgatg aacggaaccc 601 ccaccacccc atggtccttg atctctttgg agacaaccac aatggcttaa caagcagctc 661 agcatcagaa aagatctgta atgcccacgg gtgcaaaatg gccatgagac tatgtagcct 721 caacaggacc cagtgtacct tccggaactt caccagtgct acccaggcat tgacagagtg 781 gtacatcctg caggccacca acatctttgc acaggtgcca cagcaggagc tagtagagat 841 gagctacccc ggcgagcaga tgatcctggc ctgcctattc ggagctgagc cctgcaacta 901 ccggaacttc acgtccatct tctaccctca ctatggcaac tgttacatct tcaactgggg 961 catgacagag aaggcacttc cttcggccaa ccctggaact gaattcggcc tgaagttgat 1021 cctggacata ggccaggaag actacgtccc cttccttgcg tccacgggcg gggtcaggct 1081 gatgcttcac gagcagaggt catacccctt catcagagat gagggcatct acgccatgtc 1141 ggggacagag acgtccatcg gggtactcgt ggataagctt cagcgcatgg gggagcccta 1201 cagcccgtgc accgtgaatg gttctgaggt ccccgtccaa aacttctaca gtgactacaa 1261 cacgacctac tccatccagg cctgtcttcg ctcctgcttc caagaccaca tgatccgtaa 1321 ctgcaactgt ggccactacc tgtacccact gccccgtggg gagaaatact gcaacaaccg 1381 ggacttccca gactgggccc attgctactc agatctacag atgagcgtgg cgcagagaga 1441 gacctgcatt ggcatgtgca aggagtcctg caatgacacc cagtacaaga tgaccatctc 1501 catggctgac tggccttctg aggcctccga ggactggatt ttccacgtct tgtctcagga 1561 gcgggaccaa agcaccaata tcaccctgag caggaaggga attgtcaagc tcaacatcta 1621 cttccaagaa tttaactatc gcaccattga agaatcagca gccaataaca tcgtctggct 1681 gctctcgaat ctgggtggcc agtttggctt ctggatgggg ggctctgtgc tgtgcctcat 1741 cgagtttggg gagatcatca tcgactttgt gtggatcacc atcatcaagc tggtggcctt 1801 ggccaagagc ctacggcagc ggcgagccca agccagctac gctggcccac cgcccaccgt 1861 ggccgagctg gtggaggccc acaccaactt tggcttccag cctgacacgg ccccccgcag 1921 ccccaacact gggccctacc ccagtgagca ggccctgccc atcccaggca ccccgccccc 1981 caactatgac tccctgcgtc tgcagccgct ggacgtcatc gagtctgaca gtgagggtga 2041 tgccatctaa ccctgcccct gtccaccccg ggtgggtgaa actcactgag cagccaagac 2101 tgttgcccga ggactcactg tatggtgccc tctccaaagg gtcgggaggg tagctctcca 2161 ggccagagct tgtgtccttc aacagagagg ccagcggcaa ctggtccgtt actggccaag 2221 ggctctgaag aatcaacggt gctggtacag gatacaggaa taaattgtat cttcacctgg 2281 ttcctaccct cgtccctacc tgtcctgatc ctggtcctga agacccctcg gaacaccctc 2341 tcctggtggc aggccacttc cctcccagtg ccagtctcca tccaccccag agaggaacag 2401 gcgggtgggc catgtggttt tctccttcct ggccttggct ggcctctggg gcaggggtgg 2461 tggagagatg gaagggcatc aggtgtaggg accctgccaa gtggcacctg atttactcta 2521 gaaaataaaa gtagaaaata ctgagaaaaa aaaaaaaaaa aaaa // LOCUS HSSCNN1G 3384 bp RNA PRI 03-OCT-1995 DEFINITION H.sapiens mRNA for gamma subunit of epithelial amiloride-sensitive sodium channel. ACCESSION X87160 NID g1004272 KEYWORDS epithelial amiloride-sensitive sodium channel; epithelial sodium channel; gamma subunit; SCNN1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3384) AUTHORS Voilley,N., Bassilana,F., Mignon,C., Merscher,S., Mattei,M.G., Carle,G.F., Lazdunski,M. and Barbry,P. TITLE Cloning, chromosomal localization, and physical linkage of the beta and gamma subunits (SCNN1B and SCNN1G) of the human epithelial amiloride-sensitive sodium channel JOURNAL Genomics 28 (3), 560-565 (1995) MEDLINE 96039270 REFERENCE 2 (bases 1 to 3384) AUTHORS Voilley,N. TITLE Direct Submission JOURNAL Submitted (11-MAY-1995) N. Voilley, CNRS-UPR411, Inst.de Pharmacologie molecularie et cellulaire, 660 route des Lucioles Sophia Antipolis, F- 06560 Valbonne, FRANCE FEATURES Location/Qualifiers source 1..3384 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lung" /chromosome="16" /map="16p12-13" gene 1..1950 /gene="SCNN1G" CDS 1..1950 /gene="SCNN1G" /codon_start=1 /product="epithelial amiloride-sensitive sodium channel, gamma-subunit" /db_xref="PID:g1004273" /translation="MAPGEKIKAKIKKNLPVTGPQAPTIKELMRWYCLNTNTHGCRRI VVSRGRLRRLLWIGFTLTAVALILWQCALLVFSFYTVSVSIKVHFRKLDFPAVTICNI NPYKYSTVRHLLADLEQETREALKSLYGFPESRKRREAESWNSVSEGKQPRFSHRIPL LIFDQDEKGKARDFFTGRKRKVGGSIIHKASNVMHIESKQVVGFQLCSNDTSDCATYT FSSGINAIQEWYKLHYMNIMAQVPLEKKINMSYSAEELLVTCFFDGVSCDARNFTLFH HPMHGNCYTFNNRENETILSTSMGGSEYGLQVILYINEEEYNPFLVSSTGAKVIIHRQ DEYPSVEDVGTEIETTMVTSIGMHLTESFKLSEPSSQCTEGGSDVPIRNIYNAAYSLQ ICLHSCFQTKMVEKCGCAQYSQPLPPAANYCNYQQHPNWMYCYYQLHRAFVQEELGCQ SVCKEACRFKEWTLTTSLAQWPSVVSEKWLLPVLTWDQGRQVNKKLNKTDLAKLLIFY KDLNQRSIMESPANSIEMLLSNFGGQLGLWMSCSVVCVIEIIEVFFIDFFSIIARRQW QKAKEWWAWKQAPPCPEAPRSPQGQDNPALDIDDDLPTFNSALHLPPALGTQVPGTPP PKYNTLRLERAFSNQLTDTQMLDEL" BASE COUNT 863 a 920 c 847 g 754 t ORIGIN 1 atggcccccg gggagaagat caaagccaaa atcaagaaga atctgcccgt gacgggccct 61 caggcgccga ccattaaaga gctgatgcgg tggtactgcc tcaacaccaa cacccatggc 121 tgtcgccgca tcgtggtgtc ccgcggccgt ctgcgccgcc tcctctggat cgggttcaca 181 ctgactgccg tggccctcat cctctggcag tgcgccctcc tcgtcttctc cttctatact 241 gtctcagttt ccatcaaagt ccacttccgg aagctggatt ttcctgcagt caccatctgc 301 aacatcaacc cctacaagta cagcaccgtt cgccaccttc tagctgactt ggaacaggag 361 accagagagg ccctgaagtc cctgtatggc tttccagagt cccggaagcg ccgagaggcg 421 gagtcctgga actccgtctc agagggaaag cagcctagat tctcccaccg gattccgctg 481 ctgatctttg atcaggatga gaagggcaag gccagggact tcttcacagg gaggaagcgg 541 aaagtcggcg gtagcatcat tcacaaggct tcaaatgtca tgcacatcga gtccaagcaa 601 gtggtgggat tccaactgtg ctcaaatgac acctccgact gtgccaccta caccttcagc 661 tcgggaatca atgccattca ggagtggtat aagctacact acatgaacat catggcacag 721 gtgcctctgg agaagaaaat caacatgagc tattctgctg aggagctgct ggtgacctgc 781 ttctttgatg gagtgtcctg tgatgccagg aatttcacgc ttttccacca cccgatgcat 841 gggaattgct atactttcaa caacagagaa aatgagacca ttctcagcac ctccatgggg 901 ggcagcgaat atgggctgca agtcattttg tacataaacg aagaggaata caacccattc 961 ctcgtgtcct ccactggagc taaggtgatc atccatcggc aggatgagta tccctccgtc 1021 gaagatgtgg gaacagagat tgagacaaca atggtcacct ctataggaat gcacctgaca 1081 gagtccttca agctgagtga gccctccagt cagtgcacgg agggcgggag tgacgtgcca 1141 atcaggaaca tctacaacgc tgcctactcg ctccagatct gccttcattc atgcttccag 1201 acaaagatgg tggagaaatg tgggtgtgcc cagtacagcc agcctctacc tcctgcagcc 1261 aactactgca actaccagca gcaccccaac tggatgtatt gttactacca actgcatcga 1321 gcctttgtcc aggaagagct gggctgccag tctgtgtgca aggaagcctg ccgctttaaa 1381 gagtggacac taaccacaag cctggcacaa tggccatctg tggtttcgga gaagtggttg 1441 ctgcctgttc tcacttggga ccaaggccgg caagtaaaca aaaagctcaa caagacagac 1501 ttggccaaac tcttgatatt ctacaaagac ctgaaccaga gatccatcat ggagagccca 1561 gccaacagta ttgagatgct tctgtccaac ttcggcggtc agctgggcct gtggatgagc 1621 tgctctgttg tctgcgtcat cgagatcatc gaggtcttct tcattgactt cttctctatc 1681 attgcccgcc gccagtggca gaaagccaag gagtggtggg cctggaaaca ggctccccca 1741 tgtccagaag ctccccgtag cccacagggc caggacaatc cagccctgga tatagacgat 1801 gacctaccca ctttcaactc tgctttgcac ctgcctccag ccctaggaac ccaagtgccc 1861 ggcacaccgc cccccaaata caataccttg cgcttggaga gggccttttc caaccagctc 1921 acagataccc agatgctgga tgagctctga ggcagggttg agaagacaga tctagtcagg 1981 accaccagcc atggtctaag gacatggatc gggtgccccc agacgtgtgc acaggggacc 2041 ctctgcccca ctctgggctt ttcagatact ctgaccaaaa agcctgcttt aaaccgcaag 2101 atggggcctg ggcatgcgca ggaggagcca tcgggtacta cgcagcaaca ctcacaactg 2161 tccaggctga gataaatccc gggacctgaa ctattagcac gtcactagag actgggagcc 2221 gaggcagtgg tgctggccca agtgaaggcc agagtgagga ctgatgcagc tctttacggg 2281 tcttgagagg gaaggactct tccaaagccc caaagccgag ggtttcaccc acactgccag 2341 cctgggttgg ggcccaagga tgtgaccttg agtgtcaagg ctggacagct actgccagat 2401 gccaaagata ggagaaagtg ccagccctga agctggagcc gcttgtgaat aaactgttct 2461 tcatcattga cactggagaa aggtgtcctc catgccctca ggcagcagag aactggccca 2521 gagcccttgg agtgttggtg gagatcagag tgccgtggtg gaggtctggg actatgtcag 2581 agtgtcctca ctttggggca tgggtggctc caggagatgg atttagttat tcaattttgt 2641 ggatgaataa attgaggcac agaaagatta agttaccggc ccaaggtgac acagtgagga 2701 ggtggcagag ctaggatttg accccagaca atctgacttc atgattgtgg catccaattc 2761 gtgtctgtgc ctcaatttgt gcaaacgact gtgtcacttt gcttggggag ggggagcatc 2821 ccacccacat gccctgggca ggtgacccaa gacagagctg acccctccac caccaaaggc 2881 ctgtctctca ccaacaagcc aactgctcac agatgaccca cttcatacca cattcacatc 2941 tggccacctg actccagcta tcaggtatta agtcccgggc agcgataagc ccccccacaa 3001 cagtgtgtta gctctgtctt agcaacctga taggttagca taggggctgc aaatgttgtt 3061 ccgccacacc aaatccctgt ttttgtccat gaactaagaa ttttatttta ttttattttt 3121 tattttttgt agaaatagga tctcactatg ttgcccaggc tgttcttgaa ctcctggcct 3181 caaatgatct tcccacttca gcctcccaaa gtgcagggat tacaggcaca agccaccgtg 3241 cccagccaag aatttttatt tttgctttta aatacccaaa gaagaagatt ctgtgactca 3301 tggaaatgat atgagattcg aattccagtg tctataaata aagttttatt ggtacacagc 3361 cccaaaaaaa aaaaaaaaaa aaaa // LOCUS HSSDS22MR 1299 bp RNA PRI 27-NOV-1995 DEFINITION H.sapiens sds22-like mRNA. ACCESSION Z50749 NID g1085027 KEYWORDS sds22 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS Renouf,S., Beullens,M., Wera,S., Van Eynde,A., Sikela,J., Stalmans,W. and Bollen,M. TITLE Molecular cloning of a human polypeptide related to yeast sds22, a regulator of protein phosphatase-1 JOURNAL FEBS Lett. 375 (1-2), 75-78 (1995) MEDLINE 96087087 REFERENCE 2 (bases 1 to 1299) AUTHORS Renouf,S. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) Renouf S., K.U.Leuven, Afdeling Biochemie, Campus Gasthuisberg, Herestraat 49, B-3000 Leuven, Belgium FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..15 gene 16..1098 /gene="sds22" CDS 16..1098 /gene="sds22" /function="regulatory polypeptide of protein phosphatase-1" /note="data were obtained by sequencing cDNA clone IB3548 (accession number T16129), except the first 47 nucleotides at the 5'end, which were obtained by 5'RACE." /citation=[1] /codon_start=1 /product="yeast sds22 homolog" /db_xref="PID:e194945" /db_xref="PID:g1085028" /translation="MAAERGAGQQQSQEMMEVDRRVESEESGDEEGKKHSSGIVADLS EQSLKDGEERGEEDPEEEHELPVDMETINLDRDAEDVDLNHYRIGKIEGFEVLKKVKT LCLRQNLIKCIENLEELQSLRELDLYDNQIKKIENLEALTELEILDISFNLLRNIEGV DKLTRLKKLFLVNNKISKIENLSNLHQLQMLELGSNRIRAIENIDTLTNLESLFLGKN KITKLQNLDALTNLTVLSMQSNRLTKIEGLQNLVNLRELYLSHNGIEVIEGLENNNKL TMLDIASNRIKKIENISHLTELQEFWMNDNLLESWSDLDELKGARSLETVYLERNPLQ KDPQYRRKVMLALPSVRQIDATFVRF" 3'UTR 1099..1299 polyA_signal 1216..1221 polyA_signal 1279..1284 BASE COUNT 387 a 304 c 348 g 260 t ORIGIN 1 gaattggcag ccaacatggc ggcggaacgc ggcgcggggc agcaacagtc gcaggagatg 61 atggaggttg acaggcgggt cgagtctgaa gaatccggcg atgaagaagg gaagaaacac 121 agcagtggca tcgtggccga cctcagtgaa cagagcctga aggatgggga ggagcggggg 181 gaggaggacc cagaagaaga acatgagctg cctgtggaca tggaaaccat caacctggac 241 agagatgcag aggatgttga tttgaatcac tatcgcatag ggaagattga aggatttgag 301 gtactgaaga aagtgaagac tctctgcctc cgccaaaatt taattaaatg cattgagaat 361 ctggaggagc tacagagtct tcgagagctg gatctttacg acaaccagat caagaagatt 421 gagaatctgg aggcgctaac agagctggag attctagata tttcttttaa tctgctgaga 481 aacatcgaag gggttgacaa gttgacacga ctgaaaaaac tcttcttggt caacaataaa 541 atcagtaaaa ttgagaactt aagcaactta catcaactac agatgctaga gctgggatct 601 aaccgcatcc gggcaatcga aaatatcgac accttaacca acctggagag tttgtttttg 661 gggaaaaaca aaattactaa acttcagaac ctggatgcgc tcaccaacct gacagtcctc 721 agtatgcaga gcaaccggct gaccaagatc gagggtctgc agaacctggt gaacctgcgg 781 gagctgtacc ttagccacaa tggcatcgag gtcatcgagg gcctggagaa caataacaaa 841 ctcacgatgt tggacattgc atcaaataga atcaaaaaga ttgaaaatat cagccatcta 901 acagagctgc aagagttctg gatgaacgac aatctccttg agagctggag cgacctcgac 961 gagctgaagg gagccaggag cctggagaca gtgtacctgg agcggaaccc cttgcagaag 1021 gacccccagt accggcggaa ggtcatgctc gccctcccct ccgtgcggca gatcgatgcc 1081 acgttcgtca ggttctgagt ccttcttggc tcctcatgtg gtccctctcc tcggaagaac 1141 tgcccagcca cgggttttta acccacctgt tgctcctgag gtcgtcacta tatcaacagt 1201 cacaaaccca atggcaataa aggcactgac gatagctggc gcgcgcgacg ccacacacca 1261 ttttcagatg ccgttgcaat taaatcttgc cacactgtc // LOCUS HSSEC231 2748 bp RNA PRI 03-MAY-1996 DEFINITION H.sapiens mRNA for Sec23A isoform, 2748bp. ACCESSION X97064 NID g1296663 KEYWORDS COPII component; SEC23 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2748) AUTHORS Paccaud,J.P., Reith,W., Carpentier,J.L., Ravazzola,M., Amherdt,M., Schekman,R. and Orci,L. TITLE Cloning and functional characterization of mammalian homologues of the COPII component Sec23 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2748) AUTHORS Paccaud,J. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) J. Paccaud, University of Geneva School of Medicine, Department of Morphology, University Medical Center, 1, rue Michel-Servet, 1211 Geneva 4, Switzerland FEATURES Location/Qualifiers source 1..2748 /organism="Homo sapiens" /db_xref="taxon:9606" gene 160..2457 /gene="sec23" CDS 160..2457 /gene="sec23" /note="COPII component; isoform A" /codon_start=1 /product="Sec23 protein" /db_xref="PID:e236013" /db_xref="PID:g1296664" /translation="MTTYLEFIQQNEERDGVRFSWNVWPSSRLEATRMVVPVAALFTP LKERPDLPPIQYEPVLCSRTTCRAVLNPLCQVDYRAKLWACNFCYQRNQFPPSYAGIS ELNQPAELLPQFSSIEYVVLRGPQMPLIFLYVVDTCMEDEDLQALKESMQMSLSLLPP TALVGLITFGRMVQVHELGCEGISKSYVFRGTKDLSAKQLQEMLGLSKVPVTQATRGP QVQQPPPSNRFLQPVQKIDMNLTDLLGELQRDPWPVPQGKRPLRSSGVALSIAVGLLE CTFPNTGARIMMFIGGPATQGPGMVVGDELKTPIRSWHDIDKDNAKYVKKGTKHFEAL ANRAATTGHVIDIYACALDQTGLLEMKCCPNLTGGYMVMGDSFNTSLFKQTFQRVFTK DMHGQFKMGFGGTLEIKTSREIKISGAIGPCVSLNSKGPCVSENEIGTGGTCQWKICG LSPTTTLAIYFEVVNQHNAPIPQGGRGAIQFVTQYQHSSGQRRIRVTTIARNWADAQT QIQNIAASFDQEAAAILMARLAIYRAETEEGPDVLRWLDRQLIRLCQKFGEYHKDDPS SFRFSETFSLYPQFMFHLRRSSFLQVFNNSPDESSYYRHHFMRQDLTQSLIMIQPILY AYSFSGPPEPVLLDSSSILADRILLMDTFFQILIYHGETIAQWRKSGYQDMPEYENFR HLLQAPVDDAQEILHSRFPMPRYIDTEHGGSQARFLLSKVNPSQTHNNMYAWGQESGA PILTDDVSLQVFMDHLKKLAVSSAA" BASE COUNT 779 a 567 c 607 g 795 t ORIGIN 1 cgccacccct gattgcggtg ccacggactg ctcctgctgg gcggagagga cagattttgc 61 aaagcggagg ctgcgacggg tcctgcaggg ggacagtgag gaaagggccg cctcgtctcc 121 gctcctgggg gaccgcagaa ataagaatca aactccacaa tgacaaccta tttggaattc 181 attcaacaaa atgaagaacg agatggagtc cgatttagtt ggaatgtttg gccatcaagt 241 cgactggaag ctacaagaat ggttgttcct gtggcagccc tgtttacacc actgaaagag 301 agacctgact taccacctat tcaatatgaa cctgttctgt gtagtaggac cacttgccgt 361 gcagttttga atcctttatg tcaagtggat tatcgagcaa aactttgggc ttgcaacttt 421 tgttaccaaa ggaatcagtt tccacctagt tatgctggta tatctgaact gaatcagcct 481 gctgaacttt tacctcagtt ttctagcatt gaatatgtag ttctgcgtgg tcctcagatg 541 cctttgatat tcctctatgt ggttgatact tgcatggaag atgaagattt acaagccctg 601 aaagaatcca tgcagatgtc attaagtctt ttaccaccta cagctttggt tggacttatt 661 acttttggga gaatggttca ggttcatgaa cttggatgtg aaggcatttc aaaaagctat 721 gtcttcagag gaacaaaaga tttgtctgcc aaacaactgc aggaaatgct ggggctctct 781 aaagtaccag ttactcaagc aacacgtggt cctcaggtac agcagccacc tccttccaac 841 agattcttac aaccagtaca gaaaatagac atgaatctca cagatcttct gggagaactc 901 cagcgagacc cttggcctgt accacaggga aagagacctt tgcgttcctc tggggtggca 961 ctttccatag ctgtaggact gctggagtgt acttttccca acactggtgc tcgtatcatg 1021 atgttcattg gtggtcctgc tactcagggg cctggaatgg tggttggaga tgagttgaag 1081 acacctataa gatcgtggca tgacattgac aaagacaatg ccaaatatgt taaaaaggga 1141 actaagcatt ttgaagcatt ggctaatcga gctgctacaa ctggccatgt tattgatatc 1201 tatgcgtgtg cattagatca gacaggtctc ctggagatga aatgctgtcc caaccttact 1261 ggaggataca tggtaatggg tgattctttc aatacttcct tattcaaaca aacttttcaa 1321 agagtcttta ccaaagacat gcatggacag tttaaaatgg gctttggtgg tacgctagaa 1381 ataaagacct caagggaaat aaagatttca ggagctattg gaccctgtgt gtcactcaat 1441 tctaaaggac cctgtgtgtc tgaaaatgag ataggaacag gtggcacatg tcagtggaag 1501 atatgtggac ttagtcccac tacaacctta gccatatatt ttgaggttgt caatcagcat 1561 aatgctccaa ttcctcaagg agggcgtggt gcaatccagt ttgtgactca gtatcagcat 1621 tcaagtgggc agagacgcat ccgagtgacc accattgcta ggaactgggc agatgctcaa 1681 actcaaatcc aaaacattgc tgcatctttt gaccaggagg cagctgccat tcttatggcc 1741 cggctagcaa tatatagagc agaaacagaa gaaggtccag atgtgcttag gtggctggac 1801 agacagctca ttcgactgtg tcagaaattt ggagaatatc ataaagatga cccaagttcc 1861 ttcagatttt cagaaacttt ctccctttat ccacagttta tgtttcattt aagaagatct 1921 tctttcctgc aagtttttaa caatagtcct gatgagagtt catattatcg tcaccatttt 1981 atgcgtcaag atctgaccca gtctctaatt atgattcagc ctatcctgta tgcgtattct 2041 tttagtggac caccagagcc ggttcttctt gatagcagta gcattcttgc agatcgtatt 2101 cttctcatgg acacattctt ccagattttg atttatcatg gtgagaccat agcacagtgg 2161 cggaagtcag gataccagga tatgcctgag tatgaaaatt tccgccacct tctgcaagcc 2221 ccagtggatg atgcacagga aattcttcac tccagatttc caatgccaag atacattgac 2281 actgaacatg gaggcagcca ggcccgtttc ctcctttcaa aagtcaaccc ttcacagact 2341 cataataata tgtatgcctg ggggcaggag tctggagcac ctattcttac agatgatgtt 2401 agtttacaag tgtttatgga tcacttgaag aaacttgctg tgtccagtgc tgcttgaagt 2461 gctaataatg ttaaagacac cttaagaaga tgaaataata ttccaaattt cattttttcc 2521 tttttccatt tatctgtgga aaccaacaga tattgctcta tattttttgt attagtatgg 2581 tttgagacaa catatggaaa atgttcacat ttgtagatta agctggaatt ataatgagag 2641 caataagaac aaatttattt tgcttaccac agtgttatag ctggttctag aaatttgaag 2701 tctttataac ttaattatgt tttaataaaa aatagagtct gcctcgta // LOCUS HSSEC232 2450 bp RNA PRI 03-MAY-1996 DEFINITION H.sapiens mRNA for Sec23B isoform, 2450bp. ACCESSION X97065 NID g1296665 KEYWORDS COPII component; SEC23 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2450) AUTHORS Paccaud,J.P., Reith,W., Carpentier,J.L., Ravazzola,M., Amherdt,M., Schekman,R. and Orci,L. TITLE Cloning and functional characterization of mammalian homologues of the COPII component Sec23 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2450) AUTHORS Paccaud,J. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) J. Paccaud, University of Geneva School of Medicine, Department of Morphology, University Medical Center, 1, rue Michel-Servet, 1211 Geneva 4, Switzerland FEATURES Location/Qualifiers source 1..2450 /organism="Homo sapiens" /db_xref="taxon:9606" gene 106..2409 /gene="sec23" CDS 106..2409 /gene="sec23" /note="COPII component; isoform B" /codon_start=1 /product="Sec23 protein" /db_xref="PID:e236014" /db_xref="PID:g1296666" /translation="MATYLEFIQQNEERDGVRFSWNVWPSSRLEATRMVVPLACLLTP LKERPDLPPVQYEPVLCSRPTCKAVLNPLCQVDYRAKLWACNFCFQRNQFPPAYGGIS EVNQPAELMPQFSTIEYVIQRGAQSPLIFLYVVDTCLEEDDLQALKESLQMSLSLLPP DALVGLITFGRMVQVHELSCEGISKSYVFRGTKDLTAKQIQDMLGLTKPAMPMQQARP AQPQEHPFASSRFLQPVHKIDMNLTDLLGELQRDPWPVTQGKRPLRSTGVALSIAVGL LEGTFPNTGARIMLFTGGPPTQGPGMVVGDELKIPIRSWHDIEKDNARFMKKATKHYE MLANRTAANGHCIDIYACALDQTGLLEMKCCANLTGGYMVMGDSFNTSLFKQTFQRIF TKDFNGDFRMAFGATLDVKTSRELKIAGAIGPCVSLNVKGPCVSENELGVGGTSQWKI CGLDPTSTLGIYFEVVNQHNTPIPQGGRGAIQFVTHYQQSSTQRRIRVTTIARNWADV QSQLRHIEAAFDQEAAAVLMARLGVFRAESEEGPDVLRWLDRQLIRLCQKFGQYNKED PTSFRLSDSFSLYPQFMFHLRRSPFLQVFNNSPDESSYYRHHFARQDLTQSLIMIQPI LYSYSFHGPPEPVLLDSSSILADRILLMDTFFQIVIYLGETIAQWRKAGYQDMPEYEN FKHLLQAPLDDAQEILQARFPMPRYINTEHGGSQARFLLSKVNPSQTHNNLYAWGQET GAPILTDDVSLQVFMDHLKKLAVSSAC" BASE COUNT 616 a 584 c 607 g 643 t ORIGIN 1 cgctggccaa tcggttgaga gctgagctgg acttggcggt gggagccgga gcctgcttgt 61 tgcagctgtg ggtgaggacg gctctagcta gttccctttt agactatggc gacatacctg 121 gagttcatcc agcagaatga agaacgggat ggtgtgcgtt ttagttggaa cgtgtggcct 181 tccagccggc tggaggctac aagaatggtt gtacccctgg cttgtctcct tactcctttg 241 aaagaacgtc cagacctacc tcctgtacaa tatgaacctg tgctttgcag caggccaact 301 tgtaaagctg ttctcaaccc actttgtcag gttgattatc gagcaaaact ttgggcctgt 361 aatttctgtt ttcaaagaaa tcagtttcct ccagcttatg gaggcatatc tgaggtgaat 421 caacctgccg aattgatgcc ccagttttct acaattgagt acgtgataca gcgaggtgct 481 cagtcccctc tgatctttct ctatgtggtt gacacatgcc tggaggaaga tgaccttcaa 541 gcactcaaag agtccctgca gatgtccctg agtcttcttc ctccagatgc tctggtgggt 601 ctgatcacat ttggaaggat ggtgcaggtt catgagctaa gctgtgaagg aatctccaaa 661 agttatgtct tccgagggac caaggattta actgcaaagc aaatacagga tatgttgggc 721 ctgaccaagc cagccatgcc catgcagcaa gcacgacctg cacaaccaca ggagcaccct 781 tttgcttcaa gcagatttct gcagcctgtt cacaagattg atatgaacct cactgatctt 841 cttggggagc tacagaggga cccatggcca gtaactcagg ggaagagacc tttgcgatcc 901 actggtgtgg ctttgtccat tgctgttggc ttgctggagg gcacttttcc aaacacagga 961 gccaggatca tgctgtttac tggaggtccc cctacccaag ggcctggcat ggtggttgga 1021 gatgaattaa agattcctat tcgttcttgg catgatattg agaaagataa tgcacgattc 1081 atgaaaaagg caaccaagca ctatgagatg cttgctaatc gaacagctgc aaatggtcac 1141 tgcattgata tttatgcttg tgcccttgat caaactggac ttttggagat gaagtgttgt 1201 gcaaatctta ctggaggcta catggtaatg ggagattctt tcaacacttc tctcttcaag 1261 cagacattcc aaagaatctt tactaaagat tttaatggag atttccgaat ggcatttggt 1321 gctactttgg acgtaaagac ctctcgggaa ctgaagattg caggagccat tggtccatgc 1381 gtatctctga atgtgaaagg accgtgtgtg tcagaaaatg agcttggtgt tggtggcacg 1441 agtcagtgga aaatctgtgg cctagatcct acatctacac ttggcatcta ttttgaagtt 1501 gtcaatcagc acaacacccc gatcccccaa ggaggcagag gagccatcca gtttgtcacg 1561 cattatcagc agtccagcac ccagagacgc atccgcgtga ccaccatcgc ccgaaattgg 1621 gcagatgtac agagtcagct caggcacata gaagcagcat ttgaccagga ggctgcggca 1681 gtgttgatgg cacggcttgg ggtgttccga gcggagtcag aggaggggcc cgatgtgctc 1741 cggtggctgg accgacaact catccgactg tgtcaaaagt ttggacagta taacaaagaa 1801 gaccccactt cttttaggtt atcagattcc ttttctctat atcctcagtt tatgttccat 1861 ctgagaagat ctccatttct tcaagtgttt aacaacagtc ctgatgagtc gtcatattac 1921 agacatcatt ttgcccggca ggacctgacc cagtccctca tcatgatcca gcccattctc 1981 tactcttact cctttcatgg gccaccagag ccagtactct tggatagcag cagcattcta 2041 gctgacagaa ttttgctgat ggatactttc tttcaaattg tcatttatct tggtgagacc 2101 atagcccagt ggcgtaaagc tggctaccag gacatgcccg agtatgaaaa cttcaagcac 2161 cttctgcagg caccactgga tgatgctcaa gaaattctgc aagcacgctt cccgatgcca 2221 cgttacatca acacggagca tggaggcagt caggctcgat tccttttgtc caaagtgaac 2281 ccatctcaga cacacaataa cctgtatgct tggggacagg aaactggagc acccatccta 2341 actgatgatg ttagcctgca ggtgttcatg gaccatttga agaagctggc tgtctccagt 2401 gcctgttaag ctgaggatac aaccaggaaa tgcaacggtg tcagattgtg // LOCUS HSSELPLG2 3409 bp DNA PRI 23-AUG-1995 DEFINITION Human P-selectin glycoprotein ligand (SELPLG) gene, exon 2, and complete cds. ACCESSION U25956 NID g902795 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3409) AUTHORS Veldman,G.M., Bean,K.M., Cumming,D.A., Eddy,R.L., Sait,N.J.S. and Shows,T.B. TITLE Genomic organization and chromosomal localization of the gene encoding human P-selectin glycoprotein ligand JOURNAL J. Biol. Chem. 270 (27), 16470-16475 (1995) MEDLINE 95332364 REFERENCE 2 (bases 1 to 3409) AUTHORS Veldman,G.M., Bean,K.M., Cumming,D.A., Eddy,R.L., Sait,N.J.S. and Shows,T.B. TITLE Direct Submission JOURNAL Submitted (28-APR-1995) David Merberg, Research Computing, Genetics Institute, 87 CambridgePark Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..3409 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="12q24" /chromosome="12" mRNA join(U25955:444..497,239..2255) /gene="SELPLG" gene join(U25955:444..703,1..2255) /gene="SELPLG" exon 239..2255 /gene="SELPLG" /number=2 CDS 244..1482 /gene="SELPLG" /codon_start=1 /product="P-selectin glycoprotein ligand" /db_xref="PID:g902797" /translation="MPLQLLLLLILLGPGNSLQLWDTWADEAEKALGPLLARDRRQAT EYEYLDYDFLPETEPPEMLRNSTDTTPLTGPGTPESTTVEPAARRSTGLDAGGAVTEL TTELANMGNLSTDSAAMEIQTTQPAATEAQTTQPVPTEAQTTPLAATEAQTTRLTATE AQTTPLAATEAQTTPPAATEAQTTQPTGLEAQTTAPAAMEAQTTAPAAMEAQTTPPAA MEAQTTQTTAMEAQTTAPEATEAQTTQPTATEAQTTPLAAMEALSTEPSATEALSMEP TTKRGLFIPFSVSSVTHKGIPMAASNLSVNYPVGAPDHISVKQCLLAILILALVATIF FVCTVVLAVRLSRKGHMYPVRNYSPTEMVCISSLLPDGGEGPSATANGGLSKAKSPGL TPEPREDREGDDLTLHSFLP" BASE COUNT 793 a 993 c 925 g 698 t ORIGIN 1 tagaagaagt taaagggccc tcctggatgg ctttattcat gttgatgagt aataataata 61 actgctactg gctgaggatc ttctccatcc caggcatgtc agggatgcct aagtccccag 121 tccctgctcc agaccagaca tcttccagct gtggcagtag agggtggtgg tctagggtgc 181 ttgctaagcc caagggtgaa actgtcttga catccctccg cccattgtct cctcctaggt 241 gccatgcctc tgcaactcct cctgttgctg atcctactgg gccctggcaa cagcttgcag 301 ctgtgggaca cctgggcaga tgaagccgag aaagccttgg gtcccctgct tgcccgggac 361 cggagacagg ccaccgaata tgagtaccta gattatgatt tcctgccaga aacggagcct 421 ccagaaatgc tgaggaacag cactgacacc actcctctga ctgggcctgg aacccctgag 481 tctaccactg tggagcctgc tgcaaggcgt tctactggcc tggatgcagg aggggcagtc 541 acagagctga ccacggagct ggccaacatg gggaacctgt ccacggattc agcagctatg 601 gagatacaga ccactcaacc agcagccacg gaggcacaga ccactcaacc agtgcccacg 661 gaggcacaga ccactccact ggcagccaca gaggcacaga caactcgact gacggccacg 721 gaggcacaga ccactccact ggcagccaca gaggcacaga ccactccacc agcagccacg 781 gaagcacaga ccactcaacc cacaggcctg gaggcacaga ccactgcacc agcagccatg 841 gaggcacaga ccactgcacc agcagccatg gaagcacaga ccactccacc agcagccatg 901 gaggcacaga ccactcaaac cacagccatg gaggcacaga ccactgcacc agaagccacg 961 gaggcacaga ccactcaacc cacagccacg gaggcacaga ccactccact ggcagccatg 1021 gaggccctgt ccacagaacc cagtgccaca gaggccctgt ccatggaacc tactaccaaa 1081 agaggtctgt tcataccctt ttctgtgtcc tctgttactc acaagggcat tcccatggca 1141 gccagcaatt tgtccgtcaa ctacccagtg ggggccccag accacatctc tgtgaagcag 1201 tgcctgctgg ccatcctaat cttggcgctg gtggccacta tcttcttcgt gtgcactgtg 1261 gtgctggcgg tccgcctctc ccgcaagggc cacatgtacc ccgtgcgtaa ttactccccc 1321 accgagatgg tctgcatctc atccctgttg cctgatgggg gtgaggggcc ctctgccaca 1381 gccaatgggg gcctgtccaa ggccaagagc ccgggcctga cgccagagcc cagggaggac 1441 cgtgaggggg atgacctcac cctgcacagc ttcctccctt agctcactct gccatctgtt 1501 ttggcaagac cccacctcca cgggctctcc tgggccaccc ctgagtgccc agaccccaat 1561 ccacagctct gggcttcctc ggagacccct ggggatgggg atcttcaggg aaggaactct 1621 ggccacccaa acaggacaag agcagcctgg ggccaagcag acgggcaagt ggagccacct 1681 ctttcctccc tccgcggatg aagcccagcc acatttcagc cgaggtccaa ggcaggaggc 1741 catttacttg agacagattc tctccttttt cctgtccccc atcttctctg ggtccctcta 1801 acatctccca tggctctccc cgcttctcct ggtcactgga gtctcctccc catgtaccca 1861 aggaagatgg agctccccca tcccacacgc actgcactgc cattgtcttt tggttgccat 1921 ggtcaccaaa caggaagtgg acattctaag ggaggagtac tgaagagtga cggacttctg 1981 aggctgtttc ctgctgctcc tctgacttgg ggcagcttgg gtcttcttgg gcacctctct 2041 gggaaaaccc agggtgaggt tcagcctgtg agggctggga tgggtttcgt gggcccaaag 2101 ggcagacctt tctttgggac tgtgtggacc aaggagcttc catctagtga caagtgaccc 2161 ccagctatcg cctcttgcct tcccctgtgg ccactttcca gggtggactc tgtcttgttc 2221 actgcagtat cccaactgca ggtccagtgc aggcaataaa tatgtgatgg acaaaacgat 2281 agcggaatcc ttcaaggttt caaggctgtc tccttcaggc agccttcccg gaattctcca 2341 tccctcagtg caggatgggg gctggtcctc agctgtctgc cctcagcccc tggcccccca 2401 ggaagcctct ttcatgggct gttaggttga cttcagtttt gcctcttgga caacaggggg 2461 tcttgtacat ccttgggtga ccaggaaaag ttcaggctat ggggggccaa agggagggct 2521 gccccttccc caccagtgac cactttattc cacttcctcc attacccagt tttggcccac 2581 agagtttggt cccccccaaa cctcggacca atatccctct aaacatcaat ctatcctcct 2641 gttaaagaaa aaaaaaaatg ggactgggag cagtggctca tgcctgtaat cccagcactt 2701 tgggaggccg aggcaggtac atcacctgag gtcaggagtt caagactagc ctggccaaca 2761 tagtgaaacc ctgtctctac taaaaataca aagattagtc aggtgtggtg gcacatgcct 2821 gtagtcccag ctactgggga ggctgaggca ggagaattgc ttgaacccgg gaagcggagg 2881 gaggttgcag tgagctgaga tcacgctact gcactccagc ctgggtgaca gagtaagact 2941 ccgtctcaaa aaaaaaaaaa aagattcaat gacccttgtt aaagcatggt aaggaagact 3001 ttgttcaagg ggagtgggac tctctcaatc actgcaggga ctgcagctat gggattttgc 3061 agtgggggca tttgggctca actatgagta cagcaggggc aagtgggagc tgatagccag 3121 ggaacagggt tggatatctg cagctggaaa attaccaaga ggaaacatca ggggaagggg 3181 aattctggct aaactgactg ctggggatgg gttctcggtc attttctaca ctgacctaac 3241 aggattcata ctggaggcag gccagggtgc tcagacatca ccggggggat ggtggcagat 3301 gaggaacgtg atcagatata ggaggtgatc agatatggga ggtgatcaga tatggagtgg 3361 tggggggagg gttgttgcta agctgactta gcagagttct tgttagaac // LOCUS HSSERCA3M 4553 bp RNA PRI 19-SEP-1996 DEFINITION H.sapiens mRNA for adenosine triphosphatase, calcium. ACCESSION Z69881 NID g1524091 KEYWORDS adenosine triphosphatase; Ca2+-ATPase 3; SERCA3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4553) AUTHORS Dode,L. TITLE Direct Submission JOURNAL Submitted (28-FEB-1996) Dode L., Katholieke Universiteit Leuven, Fysiologie, Campus Gasthuisberg Herestraat 49, Leuven, Belgium, B-3000 REFERENCE 2 (bases 1 to 4553) AUTHORS Dode,L., Wuytack,F., Kools,P.F., Baba-Aissa,F., Raeymaekers,L., Brike,F., van de Ven,W.J. and Casteels,R. TITLE cDNA cloning, expression and chromosomal localization of the human sarco/endoplasmic reticulum Ca(2+)-ATPase 3 gene JOURNAL Biochem. J. 318 (Pt 2), 689-699 (1996) MEDLINE 96404924 REMARK Erratum:[[published erratum appears in Biochem J 1996 Nov 1;319(pt 3):1008]] FEATURES Location/Qualifiers source 1..4553 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHS3" /dev_stage="adult" /tissue_type="leukemia" /cell_type="T-lymphocyte" /cell_line="jurkat" /clone_lib="lambda ZAPII (Stratagene)" 5'UTR <1..6 gene 7..3006 /gene="ATP2A3" CDS 7..3006 /gene="ATP2A3" /standard_name="sarco/endoplasmic reticulum Ca(2+)-ATPase 3" /function="calcium transporting ATPase" /codon_start=1 /evidence=experimental /product="adenosine triphosphatase, calcium" /db_xref="PID:e224094" /db_xref="PID:g1524092" /translation="MEAAHLLPAADVLRHFSVTAEGGLSPAQVTGARERYGPNELPSE EGKSLWELVLEQFEDLLVRILLLAALVSFVLAWFEEGEETTTAFVEPLVIMLILVANA IVGVWQERNAESAIEALKEYEPEMGKVIRSDRKGVQRIRARDIVPGDIVEVAVGDKVP ADLRLIEIKSTTLRVDQSILTGESVSVTKHTEAIPDPRAVNQDKKNMLFSGTNITSGK AVGVAVATGLHTELGKIRSQMAAVEPERTPLQRKLDEFGRQLSHAISVICVAVWVINI GHFADPAHGGSWLRGAVYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMARKNAIVR SLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFVVAEADAGSCLLHEFTISGTTYTPE GEVRQGDQPVRCGQFDGLVELATICALCNDSALDYNEAKGVYEKVGEATETALTCLVE KMNVFDTDLQALSRVERAGACNTVIKQLMRKEFTLEFSRDRKSMSVYCTPTRPHPTGQ GSKMFVKGAPESVIERCSSVRVGSRTAPLTPTSREQILAKIRDWGSGSDTLRCLALAT RDAPPRKEDMELDDCSKFVQYETDLTFVGCVGMLDPPRPEVAACITRCYQAGIRVVMI TGDNKGTAVAICRRLGIFGDTEDVAGKAYTGREFDDLSPEQQRQACRTARCFARVEPA HKSRIVENLQSFNEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKSAAEMVLSDDNF ASIVAAVEEGRAIYSNMKQFIRYLISSNVGEVVCIFLTAILGLPEALIPVQLLWVNLV TDGLPATALGFNPPDLDIIEKLPRSPREALISGWLFFRYLAIGVYVGLATVAAATWWF VYDAEGPHINFYQLRNFLKCSEDNPLFAGIDCEVFESRFPTTMALSVLVTIEMCNALN SVSENQSLLRMPPWMNPWLLVAVAMSMALHFLILLVPPLPLIFQVTPLSGRQWVVVLQ ISLPVILLDEALKYLSRNHMHEEMSQK" polyA_signal 4531..4536 /evidence=experimental polyA_site 4553 /evidence=experimental BASE COUNT 827 a 1449 c 1410 g 867 t ORIGIN 1 ggcggcatgg aggcggcgca tctgctcccg gccgccgacg tgctgcgcca cttctcggtg 61 acagccgagg gcggcctgag cccggcgcag gtgaccggcg cgcgggagcg ctacggcccc 121 aacgagctcc cgagtgagga agggaagtcc ctgtgggagc tggtgctgga acagtttgag 181 gacctcctgg tgcgcatcct gctgctggct gcccttgtct cctttgtcct ggcctggttc 241 gaggagggcg aggagaccac gaccgccttc gtggagcccc tggtcatcat gctgatcctc 301 gtggccaacg ccattgtggg cgtgtggcag gaacgcaacg ccgagagtgc catcgaggcc 361 ctgaaggagt atgagcctga gatgggcaag gtgatccgct cggaccgcaa gggcgtgcag 421 aggatccgtg cccgggacat cgtcccaggg gacattgtag aagtggcagt gggggacaaa 481 gtgcctgctg acctccgcct catcgagatc aagtccacca cgctgcgagt ggaccagtcc 541 atcctgacgg gtgaatctgt gtccgtgacc aagcacacag aggccatccc agaccccaga 601 gctgtgaacc aggacaagaa gaacatgctg ttttctggca ccaatatcac atcgggcaaa 661 gcggtgggtg tggccgtggc caccggcctg cacacggagc tgggcaagat ccggagccag 721 atggcggcag tcgagcccga gcggacgccg ctgcagcgca agctggacga gtttggacgg 781 cagctgtccc acgccatctc tgtgatctgc gtggccgtgt gggtcatcaa catcggccac 841 ttcgccgacc cggcccacgg tggctcctgg ctgcgtggcg ctgtctacta cttcaagatc 901 gccgtggccc tggcggtggc ggccatcccc gagggcctcc cggctgtcat cactacatgc 961 ctggcactgg gcacgcggcg catggcacgc aagaacgcca tcgtgcgaag cctgccgtcc 1021 gtggagaccc tgggctgcac ctcagtcatc tgctccgaca agacgggcac gctcaccacc 1081 aatcagatgt ctgtctgccg gatgttcgtg gtagccgagg ccgatgcggg ctcctgcctt 1141 ttgcacgagt tcaccatctc gggtaccacg tatacccccg agggcgaagt gcggcagggg 1201 gatcagcctg tgcgctgcgg ccagttcgac gggctggtgg agctggcgac catctgcgcc 1261 ctgtgcaacg actcggcgct ggactacaac gaggccaagg gtgtgtacga gaaggtggga 1321 gaggccacgg agacagctct gacttgcctg gtggagaaga tgaatgtgtt cgacaccgac 1381 ctgcaggctc tgtcccgggt ggagcgagct ggcgcctgta acacggtcat caagcagctg 1441 atgcggaagg agttcaccct ggagttctcc cgagaccgga aatccatgtc cgtgtactgc 1501 acgcccaccc gccctcaccc taccggccag ggcagcaaga tgtttgtgaa gggggctcct 1561 gagagtgtga tcgagcgctg tagctcagtc cgcgtgggga gccgcacagc acccctgacc 1621 cccacctcca gggagcagat cctggcaaag atccgggatt ggggctcagg ctcagacacg 1681 ctgcgctgcc tggcactggc cacccgggac gcgcccccaa ggaaggagga catggagctg 1741 gacgactgca gcaagtttgt gcagtacgag acggacctga ccttcgtggg ctgcgtaggc 1801 atgctggacc cgccgcgacc tgaggtggct gcctgcatca cacgctgcta ccaggcgggc 1861 atccgcgtgg tcatgatcac gggggataac aaaggcactg ccgtggccat ctgccgcagg 1921 cttggcatct ttggggacac ggaagacgtg gcgggcaagg cctacacggg ccgcgagttt 1981 gatgacctca gccccgagca gcagcgccag gcctgccgca ccgcccgctg cttcgcccgc 2041 gtggagcccg cacacaagtc ccgcatcgtg gagaacctgc agtcctttaa cgagatcact 2101 gctatgactg gcgatggagt gaacgacgca ccagccctga agaaagcaga gatcggcatc 2161 gccatgggct caggcacggc cgtggccaag tcggcggcag agatggtgct gtcagatgac 2221 aactttgcct ccatcgtggc tgcggtggag gagggccggg ccatctacag caacatgaag 2281 caattcatcc gctacctcat ctcctccaat gttggcgagg tcgtctgcat cttcctcacg 2341 gcaattctgg gcctgcccga agccctgatc cctgtgcagc tgctctgggt gaacctggtg 2401 acagatggcc tacctgccac ggctctgggc ttcaacccgc cagacctgga catcatagag 2461 aagctgcccc ggagcccccg agaagccctc atcagtggct ggctcttctt ccgatacctg 2521 gctatcggag tgtacgtagg cctggccaca gtggctgccg ccacctggtg gtttgtgtat 2581 gacgccgagg gacctcacat caacttctac cagctgagga acttcctgaa gtgctccgaa 2641 gacaacccgc tctttgccgg catcgactgt gaggtgttcg agtcacgctt ccccaccacc 2701 atggccttgt ccgtgctcgt gaccattgaa atgtgcaatg ccctcaacag cgtctcggag 2761 aaccagtcgc tgctgcggat gccgccctgg atgaacccct ggctgctggt ggctgtggcc 2821 atgtccatgg ccctgcactt cctcatcctg ctcgtgccgc ccctgcctct cattttccag 2881 gtgaccccac tgagcgggcg ccagtgggtg gtggtgctcc agatatctct gcctgtcatc 2941 ctgctggatg aggccctcaa gtacctgtcc cggaaccaca tgcacgaaga aatgagccag 3001 aagtgagcgc tgggaacagg gtggagtctc cggtgtgtac ctcagactga tggtgcccat 3061 gtgttcgcct ccgcccccca cccttgccac cacactcgcc cacttgccca ccgggtcccg 3121 ccggataaat gacaggcccg aggtcagaat ggccatcccc gggccccgtc ctggggtctc 3181 tgtccccact tccttctggc ctgggaggtc tgtaattcct gtctcctgga ctctcctggg 3241 aagttccctg ctctgcagct ctggcccagg agctgcaggc tgggaggggg cagccaagaa 3301 gccggagctg gcagcatacc cagagatccg gggccccccc acccccaaat cacgagtgca 3361 gctggagctt gctccccctt gttcggaagc tggacgttca cttggtgact ggtgcctctg 3421 cactgacgga ggactctggg ggtccttctt accggctctg acctctctct tcgtgcctgg 3481 tctgggactg ggtcagccct gggggatcag aaggggccat ctgggcccag ctgtgtacag 3541 cgagggtggg cagccccctc cactccactc tgcttccaca aagtcggctc ccgagagctc 3601 gaggctgctt ctgtttatat gtgcagggcc cgggccggtg aagggtcaga gagacggaca 3661 caaggagccg gcaggagggc ggagcgagga tgtcctttcc cgggagacaa gtcgggaaag 3721 cctggctgga ctgcctcagc cccgcgcgcc tcctggactc agggttcccc gtcctgagct 3781 cgggagatgt tcagagtcac actgccgccc ggtctgccac gcagaggtcc aacttgccac 3841 ccgcgtccct ggtacctgag accaccgaca tcctcaggtt cctgaccgtg gcgcccttct 3901 acccagccca gtgtgcggcc gccgcgctgt ctgcacagct gggggcctct gagcctggtg 3961 ggcttcctgg actcttggcc tcactccttg ccccctcccc acgacaccca tgagccgaaa 4021 ggatgtcact aaggatggct gattccccaa gggcacccgc tctccctccc tccctgctgg 4081 aggaacacgt catatcagat gagaggaaga tggcctctga tggacagaat ttttctctta 4141 actcagcttt tgctactttg gcaaaaacta gcgaggggta gcagaaacct gcaccaagga 4201 ttgtccctat gtcttggccc ctcctagagc gtgtgcagac tgatgatttt atatgtaaat 4261 caagactcac atccctttcc tagtccccca catccaaagc ccctcagcct gccttgcaga 4321 ccaatgggct ccatgttctg tagccccctc ccctacgcct cacccctcct ccctctcaca 4381 ggttctgggc ggccagtgag agaaacgcag tgggggaggc agggagtctg gtgcctgcag 4441 agattctctg cttctttcct ggggggaggt ggggaggtct tagcaggagc gggccctgta 4501 cccacctgct gacctgctgt ttggtagaga aataaaggtt gtgtgactgg ggg // LOCUS HSSERR52 3016 bp RNA PRI 23-JAN-1995 DEFINITION H.sapiens serotonin 5-HT2 receptor mRNA. ACCESSION X57830 NID g36430 KEYWORDS serotonin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3016) AUTHORS Saltzman,A. TITLE Direct Submission JOURNAL Submitted (11-FEB-1991) A. Saltzman, Rhone-Poulenc Rorer, Central Research, 500 Arcola Rd, Mail Stop NW14, Collegeville, PA 19426 USA REFERENCE 2 (bases 1 to 3016) AUTHORS Saltzman,A.G., Morse,B., Whitman,M.M., Ivanshchenko,Y., Jaye,M. and Felder,S. TITLE Cloning of the human serotonin 5-HT2 and 5-HT1C receptor subtypes JOURNAL Biochem. Biophys. Res. Commun. 181 (3), 1469-1478 (1991) MEDLINE 92109767 FEATURES Location/Qualifiers source 1..3016 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain stem" gene 146..1561 /gene="serotonin 5-HT2 receptor" CDS 146..1561 /gene="serotonin 5-HT2 receptor" /codon_start=1 /product="serotonin 5-HT2 receptor" /db_xref="PID:g36431" /db_xref="SWISS-PROT:P28223" /translation="MDILCEENTSLSSTTNSLMQLNDDTRLYSNDFNSGEANTSDAFN WTVDSENRTNLSCEGCLSPSCLSLLHLQEKNWSALLTAVVIILTIAGNILVIMAVSLE KKLQNATNYFLMSLAIADMLLGFLVMPVSMLTILYGYRWPLPSKLCAVWIYLDVLFST ASIMHLCAISLDRYVAIQNPIHHSRFNSRTKAFLKIIAVWTISVGISMPIPVFGLQDD SKVFKEGSCLLADDNFVLIGSFVSFFIPLTIMVITYFLTIKSLQKEATLCVSDLGTRA KLASFSFLPQSSLSSEKLFQRSIHREPGSYTGRRTMQSISNEQKACKVLGIVFFLFVV MWCPFFITNIMAVICKESCNEDVIGALLNVFVWIGYLSSAVNPLVYTLFNKTYRSAFS RYIQCQYKENKKPLQLILVNTIPALAYKSSQLQMGQKKNSKQDAKTTDNDCSMVALGK QHSEEASKDNSDGVNEKVSCV" BASE COUNT 874 a 646 c 616 g 880 t ORIGIN 1 gaattcgggt gagccagctc cgggagaaca gcatgtacac cagcctcagt gttacagagt 61 gtgggtacat caaggtgaat ggtgagcaga aactataacc tgttagtcct tctacacctc 121 atctgctaca agttctggct tagacatgga tattctttgt gaagaaaata cttctttgag 181 ctcaactacg aactccctaa tgcaattaaa tgatgacacc aggctctaca gtaatgactt 241 taactctgga gaagctaaca cttctgatgc atttaactgg acagtcgact ctgaaaatcg 301 aaccaacctt tcctgtgaag ggtgcctctc accgtcgtgt ctctccttac ttcatctcca 361 ggaaaaaaac tggtctgctt tactgacagc cgtagtgatt attctaacta ttgctggaaa 421 catactcgtc atcatggcag tgtccctaga gaaaaagctg cagaatgcca ccaactattt 481 cctgatgtca cttgccatag ctgatatgct gctgggtttc cttgtcatgc ccgtgtccat 541 gttaaccatc ctgtatgggt accggtggcc tctgccgagc aagctttgtg cagtctggat 601 ttacctggac gtgctcttct ccacggcctc catcatgcac ctctgcgcca tctcgctgga 661 ccgctacgtc gccatccaga atcccatcca ccacagccgc ttcaactcca gaactaaggc 721 atttctgaaa atcattgctg tttggaccat atcagtaggt atatccatgc caataccagt 781 ctttgggcta caggacgatt cgaaggtctt taaggagggg agttgcttac tcgccgatga 841 taactttgtc ctgatcggct cttttgtgtc atttttcatt cccttaacca tcatggtgat 901 cacctacttt ctaactatca agtcactcca gaaagaagct actttgtgtg taagtgatct 961 tggcacacgg gccaaattag cttctttcag cttcctccct cagagttctt tgtcttcaga 1021 aaagctcttc cagcggtcga tccataggga gccagggtcc tacacaggca ggaggactat 1081 gcagtccatc agcaatgagc aaaaggcatg caaggtgctg ggcatcgtct tcttcctgtt 1141 tgtggtgatg tggtgccctt tcttcatcac aaacatcatg gccgtcatct gcaaagagtc 1201 ctgcaatgag gatgtcattg gggccctgct caatgtgttt gtttggatcg gttatctctc 1261 ttcagcagtc aacccactag tctacacact gttcaacaag acctataggt cagccttttc 1321 acggtatatt cagtgtcagt acaaggaaaa caaaaaacca ttgcagttaa ttttagtgaa 1381 cacaataccg gctttggcct acaagtctag ccaacttcaa atgggacaaa aaaagaattc 1441 aaagcaagat gccaagacaa cagataatga ctgctcaatg gttgctctag gaaagcagca 1501 ttctgaagag gcttctaaag acaatagcga cggagtgaat gaaaaggtga gctgtgtgtg 1561 ataggctagt tgccgtggca actgtggaag gcacactgag caagttttca cctatctgga 1621 aaaaaaaaat atgagattgg aaaaaattag acaagtctag tggaaccaac gatcatatct 1681 gtatgcctca ttttattctg tcaatgaaaa gcggggttca atgctacaaa atgtgtgctt 1741 ggaaaatgtt ctgacagcat ttcagctgtg agctttctga tacttattta taacattgta 1801 aatgatatgt ctttaaaatg attcactttt attgtataat tatgaagccc taagtaaatc 1861 taaattaact tctattttca agtggaaacc ttgctgctat gctgttcatt gatgacatgg 1921 gattgagttg gttacctatt gccgtaaata aaaatagcta taaatagtga aaattttatt 1981 gaatataatg gcctcttaaa aattatcttt aaaacttact atggtatata ttttgaaagg 2041 agaaaaaaaa aaagccacta aggtcagtgt tataaaatct gtattgctaa gataattaaa 2101 tgaaatactt gacaacattt ttcatagata ccattttgaa atattcacaa ggttgctggc 2161 atttgctgca tttcaagtta attctcagaa gtgaaaaaga cttcaaatgt tattcaataa 2221 ctattgctgc tttctcttct acttcttgtg ctttactctg aatttccagt gtggtcttgt 2281 ttaatatttg ttcctctagg taaactagca aaaggatgat ttaacattac caaatgcctt 2341 tctagcaatt gcttctctaa aacagcacta tcgaggtatt tggtaacttg ctgtgaaatg 2401 actgcatcat gcatgcactc ttttgagcag taaatgtata ttgatgtaac tgtgtcagga 2461 ttgaggatga actcaggttt ccggctactg acagtggtag agtcctagga catctctgta 2521 aaaagcaggt gactttccta tgacactcat caggtaaact gatgctttca gatccatcgg 2581 tttatactat ttattaaaac cattctgctt ggttccacaa tcatctattg agtgtacatt 2641 tatgtgtgaa gcaaatttct agatatgaga aatataaaaa taattaaaac aaaatccttg 2701 ccttcaaacg aaatggctcg gccaggcacg gaggctcgtg catgtaatcc tagcactttg 2761 ggaggctgag atgggaggat cacttgaggc caagagtttg agaccaacct gggtaacaaa 2821 gtgagacctc cctgtctcta caaaaaaaat caaaaaatta tctgatcctt gtggcacaca 2881 actgtggtcc cagctacagg ggaggctgag acgcaaggat cacttgagcc cagaagctca 2941 aggctgcagt gagccaagtt cacaccactg ccatttcctc ctgggcaaca gagtgagacc 3001 ctatcacccc gaattc // LOCUS HSSERT 2099 bp RNA PRI 03-FEB-1993 DEFINITION H.sapiens mRNA for serotonin transporter. ACCESSION X70697 NID g36432 KEYWORDS cDNA sequence; serotonin transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2099) AUTHORS Lesch,K.P. TITLE Direct Submission JOURNAL Submitted (18-JAN-1993) K.P. Lesch, Dept of Psychiatry, University of Wuerzburg, Fuechsleinstr 15, 8700 Wuerzburg, FRG REFERENCE 2 (bases 1 to 2099) AUTHORS Lesch,K.P., Wolozin,B.L., Estler,H.C., Murphy,D.L. and Riederer,P. TITLE Isolation of a cDNA encoding the human brain serotonin transporter JOURNAL J. Neural Transm. 91, 67-73 (1993) FEATURES Location/Qualifiers source 1..2099 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 111..2003 /codon_start=1 /product="serotonin transporter" /db_xref="PID:g36433" /db_xref="SWISS-PROT:P31645" /translation="METTPLNSQKQLSACEDGEDCQENGVLQKVVPTPGDKVESGQIS NGYSAVPSPGAGDDTRHSIPATTTTLVAELHQGERETWGKKVDFLLSVIGYAVDLGNV WRFPYICYQNGGGAFLLPYTIMAIFGGIPLFYMELALGQYHRNGCISIWRKICPIFKG IGYAICIIAFYIASYYNTIMAWALYYLISSFTDQLPWTSCKNSWNTGNCTNYFSEDNI TWTLHSTSPAEEFYTRHVLQIHRSKGLQDLGGISWQLALCIMLIFTVIYFSIWKGVKT SGKVVWVTATFPYIILSVLLVRGATLPGAWRGVLFYLKPNWQKLLETGVWIDAAAQIF FSLGPGFGVLLAFASYNKFNNNCYQDALVTSVVNCMTSFVSGFVIFTVLGYMAEMRNE DVSEVAKDAGPSLLFITYAEAIANMPASTFFAIIFFLMLITLGLDSTFAGLEGVITAV LDEFPHVWAKRRERFVLAVVITCFFGSLVTLTFGGAYVVKLLEEYATGPAVLTVALIE AVAVSWFYGITQFCRDVKEMLGFSPGWFWRICWVAISPLFLLFIICSFLMSPPQLRLF QYNYPYWSIILGYCIGTSSFICIPTYIAYRLIITPGTFKERIIKSITPETPTEIPCGD IRLNAV" BASE COUNT 459 a 574 c 541 g 525 t ORIGIN 1 gcgtgcaacc cgacgataga gagctcggag gtgatccaca aatccaagca cccagagatc 61 cattgggatc cttggcagat ggacatcagt gtcatttact aaccagcagg atggagacga 121 cgcccttgaa ttctcagaag cagctatcag cgtgtgaaga tggagaagat tgtcaggaaa 181 acggagttct acagaaggtt gttcccaccc caggggacaa agtggagtcc gggcaaatat 241 ccaatgggta ctcagcagtt ccaagtcctg gtgcgggaga tgacacacgg cactctatcc 301 cagcgaccac caccacccta gtggctgagc ttcatcaagg ggaacgggag acctggggca 361 agaaggtgga tttccttctc tcagtgattg gctatgctgt ggacctgggc aatgtctggc 421 gcttccccta catatgttac cagaatggag ggggggcatt cctcctcccc tacaccatca 481 tggccatttt tgggggaatc ccgctctttt acatggagct cgcactggga cagtaccacc 541 gaaatggatg catttcaata tggaggaaaa tctgcccgat tttcaaaggg attggttatg 601 ccatctgcat cattgccttt tacattgctt cctactacaa caccatcatg gcctgggcgc 661 tatactacct catctcctcc ttcacggacc agctgccctg gaccagctgc aagaactcct 721 ggaacactgg caactgcacc aattacttct ccgaggacaa catcacctgg accctccatt 781 ccacgtcccc tgctgaagaa ttttacacgc gccacgtcct gcagatccac cggtctaagg 841 ggctccagga cctggggggc atcagctggc agctggccct ctgcatcatg ctgatcttca 901 ctgttatcta cttcagcatc tggaaaggcg tcaagacctc tggcaaggtg gtgtgggtga 961 cagccacctt cccttatatc atcctttctg tcctgctggt gaggggtgcc accctccctg 1021 gagcctggag gggtgttctc ttctacttga aacccaattg gcagaaactc ctggagacag 1081 gggtgtggat agatgcagcc gctcagatct tcttctctct tggtccgggc tttggggtcc 1141 tgctggcttt tgctagctac aacaagttca acaacaactg ctaccaagat gccctggtga 1201 ccagcgtggt gaactgcatg acgagcttcg tttcgggatt tgtcatcttc acagtgctcg 1261 gttacatggc tgagatgagg aatgaagatg tgtctgaggt ggccaaagac gcaggtccca 1321 gcctcctctt catcacgtat gcagaagcga tagccaacat gccagcgtcc actttctttg 1381 ccatcatctt ctttctgatg ttaatcacgc tgggcttgga cagcacgttt gcaggcttgg 1441 agggggtgat cacggctgtg ctggatgagt tcccacacgt ctgggccaag cgccgggagc 1501 ggttcgtgct cgccgtggtc atcacctgct tctttggatc cctggtcacc ctgacttttg 1561 gaggggccta cgtggtgaag ctgctggagg agtatgccac ggggcccgca gtgctcactg 1621 tcgcgctgat cgaagcagtc gctgtgtctt ggttctatgg catcactcag ttctgcaggg 1681 acgtgaagga aatgctcggc ttcagcccgg ggtggttctg gaggatctgc tgggtggcca 1741 tcagccctct gtttctcctg ttcatcattt gcagttttct gatgagcccg ccacaactac 1801 gacttttcca atataattat ccttactgga gtatcatctt gggttactgc ataggaacct 1861 catctttcat ttgcatcccc acatatatag cttatcggtt gatcatcact ccagggacat 1921 ttaaagagcg tattattaaa agtattaccc cggagacacc aacagaaatt ccttgtgggg 1981 acatccgctt gaatgctgtg taacacactc accgagagga aaaaggcttc tccacaacct 2041 cctcctccag ttctgaggag gcacgcctgc cttctcccct ccgagtgaat gagtttgcc // LOCUS HSSF1BO 2298 bp RNA PRI 23-OCT-1996 DEFINITION H.sapiens mRNA for splicing factor, SF1-Bo isoform. ACCESSION Y08766 NID g1620402 KEYWORDS splicing factor SF1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2298) AUTHORS Arning,S., Gruter,P., Bilbe,G. and Kramer,A. TITLE Mammalian splicing factor SF1 is encoded by variant cDNAs and binds to RNA JOURNAL RNA 2 (8), 794-810 (1996) MEDLINE 96355840 REFERENCE 2 (bases 1 to 2298) AUTHORS Kramer,A.J. TITLE Direct Submission JOURNAL Submitted (10-OCT-1996) A.J. Kramer, Universite de Geneve, Dept de Biologie Cellulaire, 30 Quai Ernest-Ansermet, 1211 Geneve 4, CH-1211, SWITZERLAND FEATURES Location/Qualifiers source 1..2298 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="bone (femoral head)" /clone_lib="lambda gt11" CDS 295..2211 /codon_start=1 /evidence=experimental /product="SF1-Bo isoform" /db_xref="PID:e274688" /db_xref="PID:g1620403" /translation="MATGANATPLDFPSKKRKRSRWNQDTMEQKTVIPGMPTVIPPGL TREQERAYIVQLQIEDLTRKLRTGDLGIPPNPEDRSPSPEPIYNSEGKRLNTREFRTR KKLEEERHNLITEMVALNPDFKPPADYKPPATRVSDKVMIPQDEYPEINFVGLLIGPR GNTLKNIEKECNAKIMIRGKGSVKEGKVGRKDGQMLPGEDEPLHALVTANTMENVKKA VEQIRNILKQGIETPEDQNDLRKMQLRELARLNGTLREDDNRILRPWQSSETRSITNT TVCTKCGGAGHIASDCKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSG PATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMNSGPSESRPYHGMHGGGPGGPGG GPHSFPHPLPSLTGGHGGHPMQHNPNGPPPPWMQPPPPPMNQGPHPPGHHGPPPMDQY LGSTPVGSGVYRLHQGKGMMPPPPMGMMPPPPPPPSGQPPPPPSGPLPPWQQQQQQPP PPPPPSSSMASSTPLPWQQNTTTTTTSAGTGSIPPWQQQQAAAAASPGAPQMQGNPTM VPLPPGVQPPLPPGAPPPPPPPPPGSAGMMIPPRGGDGPSHESEDFPRPLVTLPGRQP QQRPWWTGWFGKAA" BASE COUNT 520 a 766 c 614 g 398 t ORIGIN 1 gggaggtgtc gcagcgccat caagaaggac tgaggctccg caatcggagg ccgccgattt 61 cgacccttcg cctcggcccg gcccaatcca ggccccggcc cgccgccccc ggccgccccc 121 gcgtgccctc tctcctccct ctttgtgcgt ctcgcgccgc cgccgcccgc cgcgtgagag 181 gacgggctcc gcgcgctccg gcagccgatt cgggtcccct ccccccggga ggcttgcgaa 241 ggagaagccg ccgcagagga aaagcaggtg ccggtgcctg tccccggggg gcccatggcg 301 accggagcga acgccacgcc gttggacttc ccaagtaaga agcggaagag gagccgctgg 361 aaccaagaca caatggaaca gaagacagtg attccaggaa tgcctacagt tattccccct 421 ggacttactc gagaacaaga aagagcttat atagtgcaac tgcagataga agacctgact 481 cgtaaactgc gcacaggaga cctgggcatc ccccctaacc ctgaggacag gtccccttcc 541 cctgagccca tctacaatag cgaggggaag cggcttaaca cccgagagtt ccgcacccgc 601 aaaaagctgg aagaggagcg gcacaacctc atcacagaga tggttgcact caatccggat 661 ttcaagccac ctgcagatta caaacctcca gcaacacgtg tgagtgataa agtcatgatt 721 ccacaagatg agtacccaga aatcaacttt gtggggctgc tcatcgggcc cagagggaac 781 accctgaaga acatagagaa ggagtgcaat gccaagatta tgatccgggg gaaagggtct 841 gtgaaagaag ggaaggttgg gcgcaaagat ggccagatgt tgccaggaga agatgagcca 901 cttcatgccc tggttactgc caatacaatg gagaacgtca aaaaggcagt ggaacagata 961 agaaacatcc tgaagcaggg tatcgagact ccagaggacc agaatgatct acggaagatg 1021 cagcttcggg agttggctcg cttaaatggg acccttcggg aagacgataa caggatctta 1081 agaccctggc agagctcaga gacccgcagc attaccaaca ccacagtgtg taccaagtgt 1141 ggaggggctg gccacattgc ttcagactgt aaattccaaa ggcctggtga tcctcagtca 1201 gctcaggata aagcacggat ggataaagaa tatttgtccc tcatggctga actgggtgaa 1261 gcacctgtcc cagcatctgt gggctccacc tctgggcctg ccaccacacc cctggccagc 1321 gcacctcgtc ctgctgctcc cgccaacaac ccacctccac cgtctctcat gtctaccacc 1381 cagagccgcc caccctggat gaattctggc ccttcagaga gtcggcccta ccacggcatg 1441 catggaggtg gtcctggtgg gcccggaggt ggcccccaca gcttcccaca cccattaccc 1501 agcctgacag gtgggcatgg tggacatccc atgcagcaca accccaatgg acccccaccc 1561 ccttggatgc agccaccacc accaccgatg aaccagggcc cccaccctcc tgggcaccat 1621 ggccctcctc caatggatca gtacctggga agtacgcctg tgggctctgg ggtctatcgc 1681 ctgcatcaag gaaaaggtat gatgccgcca ccacctatgg gcatgatgcc gccgccgccg 1741 ccgcctccca gtgggcagcc cccaccccct ccctctggtc ctcttccccc atggcaacaa 1801 cagcagcagc agcctccgcc accccctccg cccagcagca gtatggcttc cagtaccccc 1861 ttgccatggc agcaaaatac gacgactacc accacgagcg ctggcacagg gtccatcccg 1921 ccatggcaac agcagcaggc ggctgccgca gcttctccag gagcccctca gatgcaaggc 1981 aaccccacta tggtgcccct gccccccggg gtccagccgc ctctgccgcc tggggcccct 2041 ccccctccgc cgcctccacc gcctggttcc gccggcatga tgatccctcc ccgcggcggc 2101 gatggcccga gccatgagag tgaggacttt ccgcgcccat tggtgaccct tccaggcaga 2161 cagcctcagc aacgcccctg gtggacagga tggttcggca aagcagcctg agttattttt 2221 gtggacggaa tcggaacacg ctggctccat atcgtgaaat ttttattaat ttttttcttt 2281 ttcctttgtt acttcttt // LOCUS HSSF3A120 2613 bp RNA PRI 22-APR-1996 DEFINITION H.sapiens mRNA for splicing factor SF3a120. ACCESSION X85237 NID g899297 KEYWORDS SF3a120 gene; splicing factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2613) AUTHORS Kramer,A., Mulhauser,F., Wersig,C., Groning,K. and Bilbe,G. TITLE Mammalian splicing factor SF3a120 represents a new member of the SURP family of proteins and is homologous to the essential splicing factor PRP21p of Saccharomyces cerevisiae JOURNAL RNA 1 (3), 260-272 (1995) MEDLINE 96079958 REFERENCE 2 (bases 1 to 2613) AUTHORS Kramer,A.J. TITLE Direct Submission JOURNAL Submitted (09-MAR-1995) A.J. Kramer, Universite de Geneve, Dept de Biologie Cellulaire, 30 quai Ernest-Ansermet, 1211 Geneve 4, SWITZERLAND COMMENT Sequence overlapping with the one under the acc# T25051. FEATURES Location/Qualifiers source 1..2613 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="lambda gt11" /chromosome="22" gene 98..2479 /gene="SF3a120" CDS 98..2479 /gene="SF3a120" /codon_start=1 /evidence=experimental /product="human splicing factor" /db_xref="PID:g899298" /translation="MPAGPVQAVPPPPPVPTEPKQPTEEEASSKEDSAPSKPVVGIIY PPPEVRNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNPNDPYHAYYRHKVSEFK EGKAQEPSAAIPKVMQQQQQTTQQQLPQKVQAQVIQETIVPKEPPPEFEFIADPPSIS AFDLDVVKLTAQFVARNGRQFLTQLMQKEQRNYQFDFLRPQHSLFNYFTKLVEQYTKI LIPPKGLFSKLKKEAENPREVLDQVCYRVEWAKFQERERKKEEEEKEKERVAYAQIDW HDFVVVETVDFQPNEQGNFPPPTTPEELGARILIQERYEKFGESEEVEMEVESDEEDD KQEKAEEPPSQLDQDTQVQDMDEGSDDEEEGQKVPPPPETPMPPPLPPTPDQVIVRKD YDPKASKPLPPAPAPDEYLVSPITGEKIPASKMQEHMRIGLLDPRWLEQRDRSIREKQ SDDEVYAPGLDIESSLKQLAERRTDIFGVEETAIGKKIGEEEIQKPEEKVTWDGHSGS MARTQQAAQANITLQEQIEAIHKAKGLVPEDDTKEKIGPSKPNEIPQQPPPPSSATNI PSSAPPITSVPRPPTMPPPVRTTVVSAVPVMPRPPMASVVRLPPGSVIAPMPPIIHAP RINVVPMPPSAPPIMAPRPPPMIVPTAFVPAPPVAPVPAPAPMPPVHPPPPMEDEPTS KKLKTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTEWKLNGQVLVFTLPLTDQVSVI KVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMANGAVIHLALKERGGRKK" misc_feature 251..379 /gene="SF3a120" /note="SURP module 1" misc_feature 593..721 /gene="SF3a120" /note="SURP module 2" misc_feature 857..901 /gene="SF3a120" /note="charged domain" misc_feature 1708..1892 /gene="SF3a120" misc_feature 2237..2467 /gene="SF3a120" /note="ubiquitin-like domain" BASE COUNT 659 a 800 c 679 g 475 t ORIGIN 1 cttgcgagct cgtcgtactg accgagcggg gaggctgtct tgaggcggca ccgctcaccg 61 acaccgaggc ggactggcag ccctgagcgt cgcagtcatg ccggccggac ccgtgcaggc 121 ggtgcccccg ccgccgcccg tgcccacgga gcccaaacag cccacagaag aagaagcatc 181 ttcaaaggag gattctgcac cttctaagcc agttgtgggg attatttacc ctcctccaga 241 ggtcagaaat attgttgaca agactgccag ctttgtggcc agaaacgggc ctgaatttga 301 agctaggatc cgacagaacg agatcaacaa ccccaagttc aactttctga accccaatga 361 cccttaccat gcctactacc gccacaaggt cagcgagttc aaggaaggga aggctcagga 421 gccgtccgcc gccatcccca aggtcatgca gcagcagcag cagaccaccc agcagcagct 481 gccccagaag gtccaagccc aagtaatcca agagaccatc gtgcccaaag agcctcctcc 541 tgagtttgag ttcattgctg atcctccctc tatctcagcc ttcgacttgg atgtggtgaa 601 gctgacggct cagtttgtgg ccaggaatgg gcgccagttt ctgacccagc tgatgcagaa 661 agagcagcgc aactaccagt ttgactttct ccgcccacag cacagcctct tcaactactt 721 cacgaagcta gtggaacagt acaccaagat cttgattcca cccaaaggtt tattttcaaa 781 gctcaagaaa gaggctgaaa acccccgaga agttttggat caggtgtgtt accgagtgga 841 atgggccaaa ttccaggaac gtgagaggaa gaaggaagaa gaggagaagg agaaggagcg 901 ggtggcctat gctcagatcg actggcatga ttttgtggtg gtggaaacag tggacttcca 961 acccaatgag caagggaact tccctccccc caccacgcca gaggagctgg gggcccgaat 1021 cctcattcag gagcgctatg aaaagtttgg ggagagtgag gaagttgaga tggaggtcga 1081 gtctgatgag gaggatgaca aacaggagaa ggcggaggag cctccttccc agctggacca 1141 ggacacccaa gtacaagata tggatgaggg ttcagatgat gaagaagaag ggcagaaagt 1201 gcccccaccc ccagagacac ccatgcctcc acctctgccc ccaactccag accaagtcat 1261 tgtccgcaag gattatgatc ccaaagcctc caagcccttg cctccagccc ctgctccaga 1321 tgagtatctt gtgtccccca ttactgggga gaagatcccc gccagcaaaa tgcaggaaca 1381 catgcgcatt ggacttcttg accctcgctg gctggagcag cgggatcgct ccatccgtga 1441 gaagcagagc gatgatgagg tgtacgcacc aggtctggat attgagagca gcttgaagca 1501 gttggctgag cggcgtactg acatcttcgg tgtagaggaa acagccattg gtaagaagat 1561 cggtgaggag gagatccaga agccagagga aaaggtgacc tgggatggcc actcaggcag 1621 catggcccgg acccagcagg ctgcccaggc caacatcacc ctccaggagc agattgaggc 1681 cattcacaag gccaaaggcc tggtgccaga ggatgacact aaagagaaga ttggccccag 1741 caagcccaat gaaatccctc aacagccacc gccaccatct tcagccacca acatccccag 1801 ctcggctcca cccatcactt cagtgccccg accacccaca atgccacctc cagttcgtac 1861 tacagttgtc tccgcagtac ccgtcatgcc ccggccccca atggcatctg tggtccggct 1921 gcccccaggc tcagtgatcg cccccatgcc gcccatcatc cacgcgccca gaatcaacgt 1981 ggtgcccatg cctccctcgg cccctcctat tatggccccc cgcccacccc ccatgattgt 2041 gccaacagcc tttgtgcctg ctccacctgt ggcacctgtc ccagctccag ccccaatgcc 2101 ccctgtgcat cccccacctc ccatggaaga tgagcccacc tccaaaaaac tgaagacaga 2161 ggacagcctc atgccagagg aggagttcct gcgcagaaac aagggtccag tgtccatcaa 2221 agtccaggtg cccaacatgc aggataagac ggaatggaaa ctgaatgggc aggtgctggt 2281 cttcaccctc ccactcacgg accaggtctc tgtcattaag gtgaagattc atgaagccac 2341 aggcatgcct gcagggaaac agaagctaca gtatgagggt atcttcatca aagattccaa 2401 ctcactggct tactacaaca tggccaatgg cgcagtcatc cacctggccc tcaaggagag 2461 aggcgggagg aagaagtaga caagaggaac ctgctgtcaa gtccctgcca ttttgcctct 2521 cctgtctccc accccctgcc ccagacccag gagcccccct gaggctttgc cttgcctgca 2581 tatttgtttc gctcttactc agtttgggaa ttc // LOCUS HSSGI 2454 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for secretogranin I (chromogranin B). ACCESSION Y00064 NID g36438 KEYWORDS chromogranin B; phosphoprotein; secretogranin I; secretory protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2454) AUTHORS Huttner,W.B. TITLE Direct Submission JOURNAL Submitted (10-APR-1987) Huttner W.B., EMBL, Meyerhofstr.1, 6900 Heidelberg REFERENCE 2 (bases 1 to 2454) AUTHORS Benedum,U.M., Lamouroux,A., Konecki,D.S., Rosa,P., Hille,A., Baeuerle,P.A., Frank,R., Lottspeich,F., Mallet,J. and Huttner,W.B. TITLE The primary structure of human secretogranin I (chromogranin B): comparison with chromogranin A reveals homologous terminal domains and a large intervening variable region JOURNAL EMBO J. 6 (5), 1203-1211 (1987) MEDLINE 87275810 COMMENT A disulfid-bond is reported between Cys-16 (bp 218-220) and Cys-37 (bp 281-283) is reported. Data kindly reviewed (04-JUN-1987) by W.B. Huttner. FEATURES Location/Qualifiers source 1..2454 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="primary tumor (pheochromocytoma)" /clone_lib="lambda gt10 cDNA" CDS 113..2146 /note="precursor polypeptide (AA -20 to 657)" /codon_start=1 /db_xref="PID:g36439" /db_xref="SWISS-PROT:P05060" /translation="MQPTLLLSLLGAVGLAAVNSMPVDNRNHNEGMVTRCIIEVLSNA LSKSSAPPITPECRQVLKTSRKDVKDKETTENENTKFEVRLLRDPADASEAHESSSRG EAGAPGEEDIQGPTKADTEKWAEGGGHSRERADEPQWSLYPSDSQVSEEVKTRHSEKS QREDEEEEEGENYQKGERGEDSSEEKHLEEPGETQNAFLNERKQASAIKKEELVARSE THAAGHSQEKTHSREKSSQESGEEAGSQENHPQESKGQPRSQEESEEGEEDATSEVDK RRTRPRHHHGRSRPDRSSQGGSLPSEEKGHPQEESEESNVSMASLGEKRDHHSTHYRA SEEEPEYGEEIKGYPGVQAPEDLEWERYRGRGSEEYRAPRPQSEESWDEEDKRNYPSL ELDKMAHGYGEESEEERGLEPGKGRHHRGRGGEPRAYFMSDTREEKRFLGEGHHRVQE NQMDKARRHPQGAWKELDRNYLNYGEEGAPGKWQQQGDLQDTKENREEARFQDKQYSS HHTAEKRKRLGELFNPYYDPLQWKSSHFERRDNMNDNFLEGEEENELTLNEKNFFPEY NYDWWEKKPFSEDVNWGYEKRNLARVPKLDLKRQYDRVAQLDQLLHYRKKSAEFPDFY DSEEPVSTHQEAENEKDRADQTVLTEDEKKELENLAAMDLELQKIAEKFSQRG" sig_peptide 113..172 /note="signal peptide (AA -20 to -1)" mat_peptide 173..2143 /note="mature secretogranin I (chromogranin B) (AA 1-657)" misc_feature 629..631 /note="tyrosine sulfation site" misc_feature 943..945 /note="pot. N-glycosylation site" misc_feature 1253..1255 /note="major tyrosine sulfation site" misc_feature 2433..2438 /note="pot. polyA signal" polyA_site 2454 /note="polyA site" BASE COUNT 779 a 545 c 715 g 415 t ORIGIN 1 ccaggaggca cgctggtttt ccggggccgc tccatcgcgc cttcctcctg cgcctcgctt 61 ctccggtcca gccgccatct tcctttccgc acaggggccg ccgagcgggg ccatgcagcc 121 aacgctgctt ctcagcctcc tgggagccgt ggggctggcg gctgtcaatt ccatgccagt 181 ggataacagg aaccacaatg aaggaatggt gactcgctgc atcattgagg tcctctcaaa 241 tgccttgtcg aagtccagcg ctccacccat cacccctgag tgccgccaag tcctgaagac 301 gagtagaaaa gacgtcaaag acaaagagac aactgaaaat gaaaacacaa agtttgaagt 361 aagattgtta agagacccag ctgatgcctc ggaagcccac gagtcctcca gcaggggaga 421 ggcaggagcc ccaggggagg aggacatcca aggcccaaca aaggcagaca cagagaaatg 481 ggcagaggga ggcgggcaca gccgagagcg agcggatgag ccccagtgga gcctctatcc 541 ctccgacagc caagtctctg aagaagtgaa gacacgccat tctgagaaga gccagagaga 601 ggatgaggag gaggaggagg gagagaacta tcaaaaaggg gagcgagggg aagatagcag 661 tgaagagaaa caccttgaag agccaggaga gacacaaaac gcttttctca atgaaagaaa 721 gcaggcttca gctataaaaa aagaggagtt agtggccaga tcggaaacac atgctgccgg 781 gcattctcag gagaagacac atagccgaga gaagagtagc caggagagtg gagaggaggc 841 agggagccag gagaatcacc cccaggagtc taaaggccaa ccccgaagcc aggaagaatc 901 tgaggaaggt gaggaagatg ccacctctga ggtggacaaa cgacgcacga ggcccagaca 961 ccaccacggg aggagcaggc ccgacaggtc ctctcaagga gggagtcttc cctctgagga 1021 aaagggacac ccccaggagg aatctgagga gtcaaacgtc agcatggcca gtttagggga 1081 aaagagggac caccattcaa cccactacag ggcttcagag gaagaacctg aatatggaga 1141 agaaataaag ggttatccag gcgtccaggc ccctgaggac ctggagtggg agcgctatag 1201 gggcagagga agtgaagaat acagggctcc aagacctcag agtgaggaga gttgggatga 1261 ggaggacaag agaaactacc ccagcttaga gcttgataag atggcacatg gatatggtga 1321 agaaagtgag gaagagaggg gccttgagcc gggaaaggga cgccatcaca gaggcagggg 1381 aggggagcca cgtgcctatt tcatgtctga caccagagaa gagaaaaggt tcttgggtga 1441 aggacaccac cgtgtccaag aaaaccagat ggacaaggca aggaggcatc cacaaggtgc 1501 gtggaaagag ctggacagaa attatctcaa ctacggtgag gaaggagccc cagggaagtg 1561 gcagcagcag ggagacctgc aggacactaa agaaaacagg gaggaagcta ggtttcaaga 1621 taaacaatat agctcccatc acacagctga aaagaggaag agattagggg aactgttcaa 1681 cccatactac gaccctctcc agtggaagag cagccatttt gaaagaagag acaacatgaa 1741 tgacaatttt ctcgagggtg aggaggaaaa tgagctgacc ttgaacgaga agaatttctt 1801 cccagaatac aactatgact ggtgggagaa aaagcccttc tctgaggatg tgaactgggg 1861 gtatgagaag agaaacctcg ccagggtccc caagctggac ctgaaaaggc aatatgacag 1921 ggtggcccaa ctggaccagc tccttcacta caggaagaag tcagctgagt ttccagactt 1981 ctatgattct gaggagccgg tgagcaccca ccaggaggca gaaaatgaaa aggacagggc 2041 tgaccagaca gtcctgacag aggacgagaa aaaagaactc gaaaacttgg ctgcaatgga 2101 tttggaacta cagaagatag ctgagaaatt cagccaaagg ggctgactgt cattggagcg 2161 gtgggcactg ttaagaagca gccatcacat gatctgtttt tcaccacttc actgaaagac 2221 accatttata tacccaaggg cagaaagtag aacttactat tcattaaatg tttgacacaa 2281 ttggaattgt ctttaatttc tgtcagaatg ctattgaaaa tgtgaattgc atgacttgta 2341 gcatattctt ttctgcaaaa tagacatatt aacatgctta tgacaatgac tgtgctactg 2401 tctttggaaa aatgtttgtc tcagttggaa ataataaaag attcacctga gacc // LOCUS HSSGP1N15 2617 bp RNA PRI 06-DEC-1995 DEFINITION H.sapiens mRNA for surface glycoprotein. ACCESSION Z50022 NID g1107702 KEYWORDS surface glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2617) AUTHORS YASPO,M. TITLE Cloning of a new gene encoding for a putative plasma transmembrane glycoprotein mapping to human chromosome 21q22.3 JOURNAL Unpublished REFERENCE 2 (bases 484 to 1020; 1731 to 1980) AUTHORS YASPO,M. TITLE Model for a transcript map of human chromosome 21: isolation of new coding sequences from exon and cDNA libraries JOURNAL Unpublished REFERENCE 3 (bases 1 to 2617) AUTHORS YASPO,M. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) Marie-Laure Yaspo, Genome Analysis, Imperial Cancer Research, Fund, 44 Lincoln's Inn Fields, London, WC2A 3PX, UK FEATURES Location/Qualifiers source 1..2617 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HTEcDNA1N15" /dev_stage="fetus 21 weeks" /tissue_type="thymus" /clone_lib="directionally cloned poly(dT)-primed cDNA library" /chromosome="21q22.3" 5'UTR 1..93 mRNA 1..2617 CDS 94..636 /note="complete cDNA sequence for a putative plasma integral protein containing an internalisation signal in the cytoplasmic C-terminal domain" /citation=[1] /codon_start=1 /product="putative surface glycoprotein" /db_xref="PID:e188111" /db_xref="PID:g1107703" /translation="MAPGVARGPTPYWRLRLGGAALLLLLIPVAAAQEPPGAACSQNT NKTCEECLKNVSCLWCNTNKACLDYPVTSVLPPASLCKLSSARWGVCWVNFEALIITM SVVGGTLLLGIAICCCCCCRRKRSRKPDRSEEKAMREREERRIRQEERRAEMKTRHDE IRKKYGLFKEENPYARFENN" sig_peptide 94..190 3'UTR 637..2617 polyA_signal 2596 polyA_site 2617 BASE COUNT 625 a 672 c 673 g 647 t ORIGIN 1 tagaggatcc aagcttacgt acgcgtccgg agaccgcttg tgctggagtc ggagttgtaa 61 cgctccactg actgatagag cgaccggccg accatggcgc ccggagtggc ccgcgggccg 121 acgccgtact ggaggttgcg cctcggtggc gccgcgctgc tcctgctgct catcccggtg 181 gccgccgcgc aggagcctcc cggagctgct tgttctcaga acacaaacaa aacctgtgaa 241 gagtgcctga agaacgtctc ctgtctttgg tgcaacacta acaaggcttg tctggactac 301 ccagttacaa gcgtcttgcc accggcttcc ctttgtaaat tgagctctgc acgctgggga 361 gtttgttggg tgaactttga ggcgctgatc atcaccatgt cggtagtcgg gggaaccctc 421 ctcctgggca ttgccatctg ctgctgctgc tgctgcagga ggaagaggag ccggaagccg 481 gacaggagtg aggagaaggc catgcgtgag cgggaggaga ggcggatacg gcaggaggaa 541 cggagagcag agatgaagac aagacatgat gaaatcagaa aaaaatatgg cctgtttaaa 601 gaagaaaacc cgtatgctag atttgaaaac aactaaagcg ctccagcaca tcagtcccga 661 cgcttcctgt gaggtgcact ccgcagccca gcccagccgg gagaccacgt ggccattgcg 721 gtctcctgac cttggccagt gaacctgcca gccttccagg acaggcggcc ggagagctgc 781 ccctgaagga cagtcctctc gtcttgcaga ctggtgacct tctattccct gttcatctct 841 gtttctagat ttagtcactt gaaataagaa atctttgggg tttgggcttt tttatactct 901 tctcagtttg tgaaacgcta actgcacacg aagccgcctg acggcaccca gcgctgtggc 961 tgtcattctc ccagggcaga accctgcgtt tctctctgtc cactaacaag cttcacacgc 1021 aacacaggga agtcggtttg acttttgtca tgaggagaac tgaccagccc tcatcattcc 1081 ccataaaacc acggacagcg tctgtgtgcg catcttgagt cttcacacct gttgactcac 1141 acggcttttg ctgatgacac ggggctccag tacacagtct gataaggact taacgtccta 1201 acctcaattg tattaaatag cattggggaa tagctaaacc tttttaaaaa aatttattgg 1261 attttcctcc ctgcttaaaa gatttcacca gaaaaccttc atataaaaat tcaggccctt 1321 tttggacaat ttttaaaatt tgtatcttta ctagaacatg agaatctttt tcccttggaa 1381 gcttgaatta taaatgtggt gtttggcctg cctcagcagc accagttgac tgctcgtgtg 1441 ccagcggtgt ggggaggacg gggcaggacg ctgcagctct ctccagccct gttggcatcc 1501 tcagtgcctg caggcctctc gctgcctctt gggctgtctg gggggtggcc atttagggat 1561 cgtggggacg gggtccaccc caagaagaaa gaaaggcccg tccacaggcc cggctctggc 1621 cacgtgcccc ggaagcaggt gtgtccagag tcagctgagg gctctcccca caccacccag 1681 caggcgctgg tgctccttct gcctcatggg accagtccag cttccagccg ctctggctcg 1741 agggtggtct gaccacttcc ttctgagtgg gcttctctgg gagctctcca gtggcactgc 1801 tggacctgcc cacgtttctg taaaatcagg atacgtggct ttagtaagca gaccaagcgc 1861 ttcgtggcag ggaaagcagc gtgcggggaa gtcactgaaa agtgctgcct aaggaagttt 1921 ggaaatagtc cccgttccag attgccttga attttaaaac attttgcttt gggaaagtag 1981 gtcagcagca cctaagatca aggatgcgtt ccattttcac acttcacagt catgaaaact 2041 gagaagactg tcttcagcgt gaactaaagt tcacaggcag atcactgatc cagaacactt 2101 caagaactcg ccaaacagct cgataagcct ttttgactgt gtacatctgt accgggaata 2161 acattcctag gctgaaattt ccacaaagaa tagaacctgt acccagttct tcaggctgat 2221 ttccctgacc tcttgggcat ttgtatttgt agtaaagtat tgcagagatt cctaagtatt 2281 ttatagcagc catcaaaatt ggactttgta ttgtttattc ataaaagaca cttggtaata 2341 gacttcagtg aactctgtat gaatgcagta gtgtgtgtgc aaaatccgct tcctgagcgt 2401 agggtgctga gctggcgcta gggctcggtt gtgaaataca gcgtagtcag cccttgcgct 2461 cagtgtagaa acccacgtct gtaaggtcgg tcttcgtcca tctgcttttt tctgaaatac 2521 actaagagca gccacaaaac tgtaacctca aggaaaccat aaagcttgga gtgccttaat 2581 ttttaaccag tttccaataa aacggtttac tacctga // LOCUS HSSH3GL1 2349 bp RNA PRI 13-MAY-1997 DEFINITION H.sapiens mRNA for protein containing SH3 domain, SH3GL1. ACCESSION X99656 NID g1869809 KEYWORDS SH3 domain; SH3GL1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2349) AUTHORS Giachino,C., Lantelme,E., Lanzetti,L., Saccone,S., Della Valle,G. and Migone,N. TITLE A novel SH3-containing human gene family preferentially expressed in the central nervous system JOURNAL Genomics 41, 527-434 (1997) REFERENCE 2 (bases 1 to 2349) AUTHORS Migone,N. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) N. Migone, University Of Torino, Dept. Of Genetics,Biology And Med.Chem., 19 Via Santena, Torino, 10126, ITALY FEATURES Location/Qualifiers source 1..2349 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="19" /map="p13.3" gene 16..1122 /gene="SH3GL1" CDS 16..1122 /gene="SH3GL1" /note="SH3-containing Grb-2-like 1" /codon_start=1 /db_xref="PID:e299513" /db_xref="PID:g1869810" /translation="MSVAGLKKQFYKASQLVSEKVGGAEGTKLDDDFKEMEKKVDVTS KAVTEVLARTIEYLQPNPASRAKLTMLNTVSKIRGQVKNPGYPQSEGLLGECMIRHGK ELGGESNFGDALLDAGESMKRLAEVKDSLDIEVKQNFIDPLQNLCEKDLKEIQHHLKK LEGRRLDFDYKKKRQGKIPDEELRQALEKFEESKEVAETSMHNLLETDIEQVSQLSAL VDAQLDYHRQAVQILDELAEKLKRRMREASSRPKREYKPKPREPFDLGEPEQSNGGFP CTTAPKIAASSSFRSSDKPIRTPSRSMPPLDQPSCKALYDFEPENDGELGFHEGDVIT LTNQIDENWYEGMLDGQSGFFPLSYVEVLVPLPQ" misc_feature 942..2001 /note="SH3 domain" polyA_signal 2331..2336 BASE COUNT 478 a 788 c 692 g 391 t ORIGIN 1 gcggcgggcg gcagcatgtc ggtggcgggg ctgaagaagc agttctacaa ggcgagccag 61 ctggtcagtg agaaggtcgg aggggccgag gggaccaagc tggatgatga cttcaaagag 121 atggagaaga aggtggatgt caccagcaag gcggtgacag aagtgctggc caggaccatc 181 gagtacctgc agcccaaccc agcctcgcgg gctaagctga ccatgctcaa cacggtgtcc 241 aagatccggg gccaggtgaa gaaccccggc tacccgcagt cggaggggct tctgggcgag 301 tgcatgatcc gccacgggaa ggagctgggc ggcgagtcca actttggtga cgcattgctg 361 gatgccggcg agtccatgaa gcgcctggca gaggtgaagg actccctgga catcgaggtc 421 aagcagaact tcattgaccc cctccagaac ctgtgcgaga aagacctgaa ggagatccag 481 caccacctga agaaactgga gggccgccgc ctggactttg actacaagaa gaagcggcag 541 ggcaagatcc ccgatgagga gctacgccag gcgctggaga agttcgagga gtccaaggag 601 gtggcagaaa ccagcatgca caacctcctg gagactgaca tcgagcaggt gagtcagctc 661 tcggccctgg tggatgcaca gctggactac caccggcagg ccgtgcagat cctggacgag 721 ctggcggaga agctcaagcg caggatgcgg gaagcttcct cacgccctaa gcgggagtat 781 aagcccaagc cccgggagcc ctttgacctt ggagagcctg agcagtccaa cgggggcttc 841 ccctgcacca cagcccccaa gatcgcagct tcatcgtctt tccgatcttc cgacaagccc 901 atccggaccc ctagccggag catgccgccc ctggaccagc cgagctgcaa ggcgctgtac 961 gacttcgagc ccgagaacga cggggagctg ggcttccatg agggcgacgt catcacgctg 1021 accaaccaga tcgatgagaa ctggtacgag ggcatgctgg acggccagtc gggcttcttc 1081 ccgctcagct acgtggaggt gcttgtgccc ctgccgcagt gactcacccg tgtccccgcc 1141 ccgcccctcc gtccacactg gccggcaccc cctgctgggt ctcctgcatt ccacggagcc 1201 cctgctgcca gggcggtgtc tgagcctgcc ggcgccacct gggccccggc ccttgaggta 1261 ctccctgagc aggaccccac acttgggtgg gggggcttat ctgggtgggt ggggatgcct 1321 gtttacacta gcgctgactc ccaacggtga cggctccctt ccccactcca tggcgccagc 1381 ctcctccccc gctccccaac ttctcgccca gctggccgag gcggggcaac actaaggtgc 1441 tcttagaaac actaatgttc ctctggggca gcccccacct ccgtcctgac ccgacggggg 1501 cccggcccac tgcctaccct cgagtcccgc agccttaaca ggatgggatc gagggtcccc 1561 atggggtggc tcagagatag gaccctggtt ttaaatccct cccagcctgg tgctggtgat 1621 gggccctggc cctactccag ggccaatgca cccccgcctc acacacgcac tccttctcct 1681 caaggccagg gcagagggcc tcaccgcctc ccgggcctgc tgtcagcttg cagcccgggg 1741 acagaggcca gctgggatct gcctgaggac agagaacatg gtctcctgca gggccctgcc 1801 tcccaagccc cgccctcaga aagccaagta ccttttcagc tttttaactg cccccatccc 1861 aacccaggga ggcctgtgtc actctggcac aagctgccac caccagccac ccacacccac 1921 cccagcacac ctcacacggg accacagccg cgctgccgag ggccaagcac aaaggttcca 1981 gtgagcgcat gtcccagccc tggtggccag gctccccttg ctgagccgct gccacttcac 2041 cctgtgggaa gtggccccag ccatctcctc tagaccaagg caggcagccc cgacatctgc 2101 ttcctctatc gcccaatgca aaatcgatga aatggggagt tctctgggcc aggccacatt 2161 cacattcccc tccctctgtg gtccagtgaa gctccggacc ccaggctctg ctctgccctg 2221 ccctgcaccc ccctcgtcag aagtacatga ggggcgcaga gatgagcaca cagctttggg 2281 cacggtccag ggcaaactga aatgtacgcc tgaattttgt aaacagaagt attaaatgtc 2341 tctttctac // LOCUS HSSH3GL2 2445 bp RNA PRI 13-MAY-1997 DEFINITION H.sapiens mRNA for protein containing SH3 domain, SH3GL2. ACCESSION X99657 NID g1869811 KEYWORDS SH3 domain; SH3GL2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2445) AUTHORS Giachino,C., Lantelme,E., Lanzetti,L., Saccone,S., Della Valle,G. and Migone,N. TITLE A novel SH3-containing human gene family preferentially expressed in the central nervous system JOURNAL Genomics 41, 527-434 (1997) REFERENCE 2 (bases 1 to 2445) AUTHORS Migone,N. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) N. Migone, University Of Torino, Dept. Of Genetics,Biology And Med.Chem., 19 Via Santena, Torino, 10126, ITALY FEATURES Location/Qualifiers source 1..2445 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="9" /map="p22" gene 10..1068 /gene="SH3GL2" CDS 10..1068 /gene="SH3GL2" /note="SH3-containing Grb-2-like 2" /codon_start=1 /db_xref="PID:e291078" /db_xref="PID:g1869812" /translation="MSVAGLKKQFHKATQKVSEKVGGAEGTKLDDDFKEMERKVDVTS RAVMEIMTKTIEYLQPNPASRAKLSMINTMSKIRGQEKGPGYPQAEALLAEAMLKFGR ELGDDCNFGPALGEVGEAMRELSEVKDSLDIEVKQNFIDPLQNLHDKDLREIQHHLKK LEGRRLDFDYKKKRQGKIPDEELRQALEKFDESKEIAESSMFNLLEMDIEQVSQLSAL VQAQLEYHKQAVQILQQVTVRLEERIRQASSQPRREYQPKPRMSLEFPTGDSTQPNGG LSHTGTPKPSGVQMDQPCCRALYDFEPENEGELGFKEGDIITLTNQIDENWYEGMLHG HSGFFPINYVEILVALPH" misc_feature 919..1078 /note="SH3 domain" polyA_signal 2425..2430 BASE COUNT 691 a 535 c 554 g 665 t ORIGIN 1 tcctgcacca tgtcggtggc cggcctcaag aagcagttcc ataaagccac tcagaaagtg 61 agtgagaagg ttggaggagc tgaaggaacc aagctagatg atgacttcaa agagatggaa 121 aggaaagtgg atgtcaccag cagggctgtg atggaaataa tgactaaaac aattgaatac 181 cttcaaccca atccagcttc cagagctaag ctcagcatga tcaacaccat gtcaaaaatc 241 cgtggccagg agaaggggcc aggctatcct caggcagagg cgctgctggc agaggccatg 301 ctcaaatttg gaagagagct tggagatgat tgcaactttg gcccagcact tggtgaggtc 361 ggggaggcca tgcgggaact gtcggaggtc aaagactctt tggacataga agtgaagcag 421 aacttcattg accctcttca gaatcttcat gacaaagatc ttagggaaat tcaacatcat 481 ctaaagaagt tggagggtcg acgcctggat tttgattata agaagaaacg acaaggcaag 541 attccggatg aagagcttcg tcaagctcta gagaaatttg atgagtctaa ggaaattgct 601 gagtcaagca tgttcaatct cttggagatg gatattgaac aagtgagcca gctctctgca 661 cttgtgcaag ctcagctgga gtaccacaag caggcagtcc agatcctgca gcaagtcacg 721 gtcagactgg aagaaagaat aagacaggct tcatctcagc ctagaaggga atatcaacct 781 aaaccacgaa tgagcctgga gtttccaact ggagacagta ctcagcccaa tgggggtctc 841 tcccacacag gcactcccaa accttcaggt gtccaaatgg atcagccctg ctgccgagct 901 ctgtacgact ttgaacctga aaatgaaggg gagttgggat ttaaagaggg cgatatcatc 961 acactcacta accaaattga tgagaactgg tatgagggga tgctgcatgg ccattcaggc 1021 ttcttcccca tcaattatgt ggaaattctg gttgccctgc cccattagga tgttatgctg 1081 gctggctcgc ctcctcttga cccagatagt tacggttaac cactgctttg gcaatgctgc 1141 ttataacaca tcccaagtgc aggccgcagt ggtccacgtc atccagcccc accaagtgac 1201 tttggttgac ttgtgggctc ccacaggagt catggtgatg gatgatatcc tcttagcctg 1261 gtgggcgtgg catgtgcttt ttaaaacatc atctgagacc agccagtagt cacagaactg 1321 ctgtttacac agttctcagg aggctgtggt ttattagaat atgaccatga gccatttcac 1381 agaaaaacca tcccaccgaa gatattgtct atcaccccag gggccatctg aaggtctctt 1441 tgcatttctc catgcaaaga ggagaaagct tttgctttca cactgtccct tcccaaatat 1501 gtgagtcatg gaattgtcaa agtaagcctt ccctcaccag caaattgtct cctgatctga 1561 atgaatttgt ctcttaatgc atccatagaa aagtgttaat tgtgggttca aagcattctc 1621 tgcaaatagg catctcagct cctcacactt atggctattt ctgacgtata gccagttttc 1681 ttccctcctt gctattaaag ccagagtcgt aattccaaat tatttttcag taagacagtt 1741 aatcagcatt attgtgagag ggactgaaaa gaaattctcc attatgagga attgggaaga 1801 aatctggtat ccaagcttaa atttcttgct atacagaact atgtatgtat ttaggctatt 1861 ctgaagggca cagggaaggg gaacaaatat cttcacttca gttttatttg tgaattacat 1921 gtttcatgaa tccatttggc acagagacac aaggaagaaa acactagtaa ccaactttcc 1981 actagttcat atactgagaa acagtaaata cctttccttt ccacttttac cctgtgttct 2041 ttgaacatca tttgtgcaga ttctgccctc aatgaggacc aaataaagat gatttttgtg 2101 cttagcagtt taaggtatat ggctggcata tgcaaaactc tttcccaatt cagtcgctac 2161 ttttacttct gccctttcta tccatcgtct tcattttgtg tgtacagtgc tgtgtgtaag 2221 cttatcagtg tgttttttta tttgtatcag tcatgaaagt cctgttaggt atccagagtt 2281 ctatttatct agctgtacag actctttcag aggtttaacg tgctgcttcc gatgtgccac 2341 ctgcagtagt ggatcatgtg gagtgaaagg caaatcttac tgcttaatgt ataaactctc 2401 accacaggaa gcatcgctgt ttccaataaa tattgctgaa gacag // LOCUS HSSH3GL3 1500 bp RNA PRI 13-MAY-1997 DEFINITION H.sapiens mRNA for protein containing SH3 domain, SH3GL3. ACCESSION X99664 NID g1869813 KEYWORDS SH3 domain; SH3GL3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1500) AUTHORS Giachino,C., Lantelme,E., Lanzetti,L., Saccone,S., Della Valle,G. and Migone,N. TITLE A novel SH3-containing human gene family preferentially expressed in the central nervous system JOURNAL Genomics 41, 527-434 (1997) REFERENCE 2 (bases 1 to 1500) AUTHORS Migone,N. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) N. Migone, University Of Torino, Dept. Of Genetics,Biology And Med.Chem., 19 Via Santena, Torino, 10126, ITALY REMARK Revised by author 17-JAN-1997 FEATURES Location/Qualifiers source 1..1500 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="15" /map="q24" gene 1..1044 /gene="SH3GL3" CDS 1..1044 /gene="SH3GL3" /note="SH3-containing Grb-2-like 3" /codon_start=1 /db_xref="PID:e291118" /db_xref="PID:g1869814" /translation="MSVAGLKKQFHKASQLFSEKISGAEGTKLDDEFLDMERKIDVTN KVVAEILSKTTEYLQPNPAYRAKLGMLNTVSKIRGQVKTTGYPQTEGLLGDCMLKYGK ELGEDSTFGNALIEVGESMKLMAEVKDSLDINVKQTFIDPLQLLQDKDLKEIGHHLKK LEGRRLDYDYKKKRVGKIPDEEVRQAVEKFEESKELAERSMFNFLENDVEQVSQLAVF IEAALDYHRQSTEILQELQSKLQMRISAASSVPRREYKPRPVKRSSSELNGVSTTSVV KTTGSNIPMDQPCCRGLYDFEPENQGELGFKEGDIITLTNQIDENWYEGMIHGESGFF PINYVEVIVPLPQ" misc_feature 864..1023 /gene="SH3GL3" /note="SH3 domain" polyA_signal 1475..1480 BASE COUNT 475 a 284 c 333 g 408 t ORIGIN 1 atgtcggtgg ccgggctgaa gaagcagttc cacaaagcca gccagctatt tagtgaaaaa 61 ataagtggtg ctgaaggaac taaactagac gatgaatttc ttgacatgga aaggaaaata 121 gatgttacca ataaagttgt tgcagaaatt ctttcaaaaa ccactgaata tcttcagcca 181 aatccagcat acagagctaa gctaggaatg ctgaacactg tgtcgaagat ccgagggcag 241 gtgaagacca caggataccc gcagacggaa ggcttgctgg gggactgtat gctgaaatac 301 gggaaggagc tcggggaaga ctccaccttt ggcaatgcat tgatagaagt tggtgaatcc 361 atgaagctaa tggctgaggt gaaagactct cttgatatta atgtaaagca aacttttatt 421 gacccacttc agttactaca agataaagat ttaaaagaga tcgggcatca cctgaaaaag 481 ctggaaggcc gccgcctgga ttacgattat aaaaagaaac gagtaggtaa gataccagac 541 gaagaagtca gacaagcggt agaaaaattt gaagagtcaa aggagttggc tgaaagaagc 601 atgtttaact ttttagaaaa tgatgtagaa caagtcagcc agttggctgt gttcatagag 661 gcagcattag actatcacag acagtccaca gagattctgc aggagctgca gagcaagcta 721 cagatgcgaa tatcagctgc atccagtgtc cccagacgag aatacaagcc aaggcctgtg 781 aaaaggagtt ctagtgagct caatggagtt tccaccacct ctgtagtgaa gacgacaggt 841 tctaacattc ccatggacca gccctgctgt cgtggtctct atgactttga gccagaaaac 901 caaggagaat taggatttaa agaaggggac atcattacat taaccaatca aatagatgaa 961 aactggtatg aaggaatgat acacggagaa tcgggattct tccccattaa ttacgtggaa 1021 gtgatcgtgc ctttacctca gtaaatgtgt aacacaaact ctggacatac tttcgtaact 1081 gaaatgaatt cacaccagtg tgctctcagt gcggtgttct gtgacatcct ttgctctctg 1141 accaacttaa tgacttttgt atgtgtgctc tctttataat gtattttata tcactttaat 1201 ttgtataaat gattttcttg tccgtgctac atgaaaatat tgttttcttt tttgcttcct 1261 gtcctaaaag tcattggtta aatgtatttg cttcctgtgg ctaaaaataa gtctcaccca 1321 ttgcagttat gtcaacgaat ggcctatatt cctcagctgc aatgaaatgg taacatttga 1381 aactaagaaa tgctaaatat tttgtttctc gacattcctg atgacgtctg gtcttttctt 1441 ttcattgtat tttaagctta cctgtgaata gcccaataaa catgacacaa ctgtgttggc // LOCUS HSSHB 2306 bp RNA PRI 10-FEB-1994 DEFINITION H.sapiens SHB mRNA. ACCESSION X75342 NID g406737 KEYWORDS src homology 2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2306) AUTHORS Welsh,M., Mares,J., Karlsson,T., Lavergne,C., Breant,B. and Claesson-Welsh,L. TITLE Shb is a ubiquitously expressed Src homology 2 protein JOURNAL Oncogene 9 (1), 19-27 (1994) MEDLINE 94134414 REFERENCE 2 (bases 1 to 2306) AUTHORS Welsh,M. TITLE Direct Submission JOURNAL Submitted (04-OCT-1993) M. Welsh, Dept. Med. Cell. Biol., Uppsala University, P.O. Box 571, Biomedicum, S-75123 Uppsala, SWEDEN FEATURES Location/Qualifiers source 1..2306 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /cell_type="fibroblasts" /clone_lib="lambda gt10" CDS 311..2101 /codon_start=1 /evidence=experimental /product="Shb" /db_xref="PID:g406738" /translation="MRRAHEGREIPSLGGARRREVLQAGRSQRAAGRRRRRQELELGV GSGRPGGPPPGPGRRGTCAAALPPEWPRRRTGLPRRGPRPPLAMAKWLNKYFSLGNSK TKSPPQPPRPDYREQRRRGERPSQPPQAVPQASSAASASCGPATASCFSASSGSLPDD SGSTSDLIRAYRAQKERHFQDPYNGPGSSLRKLRAMCRLDYCGGSGEPGGVQRAFSAS SASGAAGCCCASSGAGAAASSSSSSGSPHLYRSSSERRPATPAEVRYISPKHRLIKVE SAAGGGAGDPLGGACAGGRTWSPTACGGKKLLNKCAASAAEESGAGKKDKVTIADDYS DPFDAKNDLKSKAGKGESAGYMEPYEAQRIMTEFQRQESVRSQHKGIQLYDTPYEPEG QSVDSDSESTVSPRLRESKLPQDDDRPADEYDQPWEWNRVTSPALAAQFNGNEKRQSS PSPSRDRRRQLRAPGGGFKPIKHGSPEFCGILGERVDPAVPLEKQIWYHGAISRGDAE NLLRLCKECSYLVRNSQTSKHDYPLSLRSNQGFMHMKLAKTKEKYVLGQNSPPFDSVP EVIHYYTTRKLPIKGAEHLSLLYPVAVRTL" BASE COUNT 429 a 788 c 767 g 322 t ORIGIN 1 cgggccgccg ggacgggcac gggcgcgcgg gctccggcgg gcgccggctg ccttcctccg 61 tcgctcgctg tctctcccgg ccgcattctc ctccgctgcg gggccgagct ctccccagcg 121 ctcgcaggaa ggaagaaggg agccgaggac gccgagaagt tcccgcggca gccgcggatc 181 ccggccaagg cggaggctgc ggctccgacg gggcaggagc gcgatccacg gcgaggggcg 241 tacggccaaa gggtccgcgg cgtggagcgc tcggaccttc cgctctcccc cgggcgtggg 301 ccgggacccc atgagacgcg cccacgaggg gcgcgagatt cctagcttgg gcggcgctag 361 gcggagggag gtgttgcagg ccggccggag ccagagagct gccggcagga ggcggcggcg 421 gcaagaactt gaacttggcg tcgggagcgg gcgccccgga ggccccccgc cggggccggg 481 gcgccgaggg acctgcgccg cagcgctgcc ccccgaatgg ccgcggcggc ggaccgggct 541 cccgcgccgc ggccctaggc cgcctctcgc catggccaag tggctaaaca agtacttcag 601 cttgggcaac agcaagacca agagcccccc gcagccgccg cggccagact accgcgagca 661 gcggcgccga ggcgagcggc cttcgcagcc cccccaggcc gtgccgcagg cctcctccgc 721 cgcctcggcg tcctgcggtc cggccaccgc ctcctgcttc tcagcctctt cgggctcgct 781 gcccgacgac agcggcagca ccagcgacct catccgcgcc taccgcgcgc agaaggagcg 841 acacttccag gacccctaca acgggcctgg ctcgtcgctg cgcaaactgc gcgccatgtg 901 ccgcctggac tactgcggcg gcagcgggga gccaggcggg gtccagcgcg ccttctcggc 961 ctcgtccgcg tcgggcgccg cgggctgttg ctgcgcctcc tcgggcgcgg gcgccgccgc 1021 gtcctcgtcc tcgtcctccg gctctccgca tctctaccgc agcagcagcg agcggcggcc 1081 cgccacgccg gccgaggtgc gctacatctc ccccaagcac cgcctcatca aagtggagag 1141 cgccgcgggc ggtggggccg gggaccccct ggggggcgcc tgcgcgggcg gccgcacctg 1201 gagcccgacg gcctgcggag gcaagaaact gctcaacaag tgcgccgcct cagccgcgga 1261 ggagagcggg gccggcaaga aggacaaggt gaccatagcc gatgactact cagatccctt 1321 tgatgccaag aatgatctca agagcaaagc aggaaagggg gagagtgctg gctacatgga 1381 gccctatgag gcacagagga tcatgacaga atttcagagg caggaaagtg tccggtccca 1441 gcataaaggt atccagttat atgacacccc ttacgaacct gaaggccaaa gtgttgactc 1501 ggactcggag agcacagtca gcccccgact gcgggagagc aagctgcccc aggatgacga 1561 caggcccgcc gatgagtacg accagccttg ggagtggaac cgggtcacca gcccagccct 1621 ggcagcacag tttaatggca acgagaagcg gcagtcatcc ccctcacctt cgcgggaccg 1681 gcggcgccag cttcgtgccc ctggaggggg ctttaagcct atcaaacatg ggagccctga 1741 gttctgcggg atcctaggag aaagggtgga tcctgccgtc cccctggaga agcaaatatg 1801 gtatcacgga gccatcagca gaggagacgc cgagaacctg ctgcgactct gcaaggagtg 1861 tagctacctt gtccggaaca gccagaccag caagcatgac taccccctct ccctgaggag 1921 caaccagggt tttatgcaca tgaaactggc caaaaccaaa gagaaatacg ttctgggtca 1981 gaacagccct ccgttcgaca gtgtcccgga agtcatccac tactacacca ccagaaagct 2041 acccatcaaa ggggctgagc acttgtccct cctctatccc gtggctgtga ggaccctgtg 2101 agcggaccag acctgccctg ctctgtgaca gagcctggag acttggaggt gccagaggcc 2161 ccccaccaac cagcccccag ccactgttgc tggctgtgtc gtttgtgttg tgtgtatggt 2221 actagcacac cactgcatgt ctctagaatg ctgttgccac ttacgggggc tggaggcctg 2281 gataaagaca gaagggcggc aacacc // LOCUS HSSHC 3031 bp RNA PRI 17-NOV-1992 DEFINITION H.sapiens SHC mRNA. ACCESSION X68148 NID g36453 KEYWORDS SHC protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3031) AUTHORS Pelicci,P. TITLE Direct Submission JOURNAL Submitted (10-JUN-1992) P. Pelicci, Clinica Medica I, Policlinico Monteluce, Perugia 06100 08854, ITALY REFERENCE 2 (bases 1 to 3031) AUTHORS Pelicci,G., Lanfrancone,L., Grignani,F., McGlade,J., Cavallo,F., Forni,G., Nicoletti,I., Grignani,F., Pawson,T. and Pelicci,P.G. TITLE A novel transforming protein (SHC) with an SH2 domain is implicated in mitogenic signal transduction JOURNAL Cell 70 (1), 93-104 (1992) MEDLINE 92323554 FEATURES Location/Qualifiers source 1..3031 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 82..1503 /codon_start=1 /product="SHC transforming protein" /db_xref="PID:g36454" /db_xref="SWISS-PROT:P29353" /translation="MNKLSGGGGRRTRVEGGQLGGEEWTRHGSFVNKPTRGWLHPNDK VMGPGVSYLVRYMGCVEVLQSMRALDFNTRTQVTREAISLVCEAVPGAKGATRRRKPC SRPLSSILGRSNLKFAGMPITLTVSTSSLNLMAADCKQIIANHHMQSISFASGGDPDT AEYVAYVAKDPVNQRACHILECPEGLAQDVISTIGQAFELRFKQYLRNPPKLVTPHDR MAGFDGSAWDEEEEEPPDHQYYNDFPGKEPPLGGVVDMRLREGAAPGAARPTAPNAQT PSHLGATLPVGQPVGGDPEVRKQMPPPPPCPGRELFDDPSYVNVQNLDKARQAVGGAG PPNPAINGSAPRDLFDMKPFEDALRVPPPPQSVSMAEQLRGEPWFHGKLSRREAEALL QLNGDFLVRESTTTPGQYVLTGLQSGQPKHLLLVDPEGVVRTKDHRFESVSHLISYHM DNHLPIISAGSELCLQQPVERKL" BASE COUNT 664 a 855 c 809 g 703 t ORIGIN 1 gcggtaacct aagctggcag tggcgtgatc cggcaccaaa tcggcccgcg gtgcgtgcgg 61 agactccatg aggccctgga catgaacaag ctgagtggag gcggcgggcg caggactcgg 121 gtggaagggg gccagcttgg gggcgaggag tggacccgcc acgggagctt tgtcaataag 181 cccacgcggg gctggctgca tcccaacgac aaagtcatgg gacccggggt ttcctacttg 241 gttcggtaca tgggttgtgt ggaggtcctc cagtcaatgc gtgccctgga cttcaacacc 301 cggactcagg tcaccaggga ggccatcagt ctggtgtgtg aggctgtgcc gggtgctaag 361 ggggcgacaa ggaggagaaa gccctgtagc cgcccgctca gctctatcct ggggaggagt 421 aacctgaaat ttgctggaat gccaatcact ctcaccgtct ccaccagcag cctcaacctc 481 atggccgcag actgcaaaca gatcatcgcc aaccaccaca tgcaatctat ctcatttgca 541 tccggcgggg atccggacac agccgagtat gtcgcctatg ttgccaaaga ccctgtgaat 601 cagagagcct gccacattct ggagtgtccc gaagggcttg cccaggatgt catcagcacc 661 attggccagg ccttcgagtt gcgcttcaaa caatacctca ggaacccacc caaactggtc 721 acccctcatg acaggatggc tggctttgat ggctcagcat gggatgagga ggaggaagag 781 ccacctgacc atcagtacta taatgacttc ccggggaagg aacccccctt ggggggggtg 841 gtagacatga ggcttcggga aggagccgct ccaggggctg ctcgacccac tgcacccaat 901 gcccagaccc ccagccactt gggagctaca ttgcctgtag gacagcctgt tgggggagat 961 ccagaagtcc gcaaacagat gccacctcca ccaccctgtc caggcagaga gctttttgat 1021 gatccctcct atgtcaacgt ccagaaccta gacaaggccc ggcaagcagt gggtggtgct 1081 gggcccccca atcctgctat caatggcagt gcaccccggg acctgtttga catgaagccc 1141 ttcgaagatg ctcttcgggt gcctccacct ccccagtcgg tgtccatggc tgagcagctc 1201 cgaggggagc cctggttcca tgggaagctg agccggcggg aggctgaggc actgctgcag 1261 ctcaatgggg acttcttggt acgggagagc acgaccacac ctggccagta tgtgctcact 1321 ggcttgcaga gtgggcagcc taagcatttg ctactggtgg accctgaggg tgtggttcgg 1381 actaaggatc accgctttga aagtgtcagt caccttatca gctaccacat ggacaatcac 1441 ttgcccatca tctctgcggg cagcgaactg tgtctacagc aacctgtgga gcggaaactg 1501 tgatctgccc tagcgctctc ttccagaaga tgccctccaa tcctttccac cctattccct 1561 aactctcggg acctcgtttg ggagtgttct gtgggcttgg ccttgtgtca gagctgggag 1621 tagcatggac tctgggtttc atatccagct gagtgagagg gtttgagtca aaagcctggg 1681 tgagaatcct gcctctcccc aaacattaat caccaaagta ttaatgtaca gagtggcccc 1741 tcacctgggc ctttcctgtg ccaacctgat gccccttccc caagaaggtg agtgcttgtc 1801 atggaaaatg tcctgtggtg acaggcccag tggaacagtc acccttctgg gcaaggggga 1861 acaaatcaca cctctgggct tcagggtatc ccagacccct ctcaacaccc gcccccccca 1921 tgtttaaact ttgtgccttt gaccatctct taggtctaat gatattttat gcaaacagtt 1981 cttggacccc tgaattcttc aatgacaggg atgccaacac cttcttggct tctgggacct 2041 gtgttcttgc tgagcaccct ctccggtttg ggttgggata acagaggcag gagtggcagc 2101 tgtcccctct ccctggggat atgcaaccct tagagattgc cccagagccc cactcccggc 2161 caggcgggag atggacccct cccttgctca gtgcctcctg gccggggccc ctcaccccaa 2221 ggggtctgta tatacatttc ataaggcctg ccctcccatg ttgcatgcct atgtactctg 2281 cgccaaagtg cagcccttcc tcctgaagcc tctgccctgc ctccctttct gggagggcgg 2341 ggtgggggtg actgaatttg ggcctcttgt acagttaact ctcccaggtg gattttgtgg 2401 aggtgagaaa aggggcattg agactataaa gcagtagaca atccccacat accatctgta 2461 gagttggaac tgcattcttt taaagtttta tatgcatata ttttagggct gctagactta 2521 ctttcctatt ttcttttcca ttgcttattc ttgagcacaa aatgataatc aattattaca 2581 tttatacatc acctttttga cttttccaag cccttttaca gctcttggca ttttcctcgc 2641 ctaggcctgt gaggtaactg ggatcgcacc ttttatacca gagacctgag gcagatgaaa 2701 tttatttcca tctaggacta gaaaaacttg ggtctcttac cgcgagactg agaggcagaa 2761 gtcagcccga atgcctgtca gtttcatgga ggggaaacgc aaaacctgca gttcctgagt 2821 accttctaca ggcccggccc agcctaggcc cggggtggcc acaccacagc aagccggccc 2881 cccctctttt ggccttgtgg ataagggaga gttgaccgtt ttcatcctgg cctccttttg 2941 ctgtttggat gtttccacgg gtctcactta taccaaaggg aaaactcttc attaaagtcc 3001 cgtatttctt ctaaaaaaaa aaaaaaaaaa a // LOCUS HSSHOXA 1892 bp RNA PRI 12-DEC-1997 DEFINITION H.sapiens mRNA for SHOXa protein. ACCESSION Y11536 NID g2463202 KEYWORDS SHOXa gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1892) AUTHORS Rao,E., Weiss,B., Fukami,M., Rump,A., Niesler,B., Mertz,A., Muroya,K., Binder,G., Kirsch,S., Winkelmann,M., Nordsiek,G., Heinrich,U., Breuning,M.H., Ranke,M.B., Rosenthal,A., Ogata,T. and Rappold,G.A. TITLE Pseudoautosomal deletions encompassing a novel homeobox gene cause growth failure in idiopathic short stature and Turner syndrome JOURNAL Nature Genet. 16 (1), 54-63 (1997) MEDLINE 97285122 REFERENCE 2 (bases 1 to 1892) AUTHORS Rao,E. TITLE Direct Submission JOURNAL Submitted (27-FEB-1997) E. Rao, Institut fuer Humangenetik, INF 328, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..1892 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="Y" /germline /dev_stage="adult" /tissue_type="skeletal muscle" /map="Xp22.3" /map="Yp11.3" gene 92..970 /gene="SHOXa" CDS 92..970 /gene="SHOXa" /codon_start=1 /product="SHOXa protein" /db_xref="PID:e307043" /db_xref="PID:g2463203" /translation="MEELTAFVSKSFDQKSKDGNGGGGGGGGKKDSITYREVLESGLA RSRELGTSDSSLQDITEGGGHCPVHLFKDHVDNDKEKLKEFGTARVAEGIYECKEKRE DVKSEDEDGQTKLKQRRSRTNFTLEQLNELERLFDETHYPDAFMREELSQRLGLSEAR VQVWFQNRRAKCRKQENQMHKGVILGTANHLDACRVAPYVNMGALRMPFQQVQAQLQL EGVAHAHPHLHPHLAAHAPYLMFPPPPFGLPIASLAESASAAAVVAAAAKSNSKNSSI ADLRLKARKHAEALGL" misc_feature 440..619 /gene="SHOXa" /note="homeodomain" BASE COUNT 401 a 580 c 567 g 343 t 1 others ORIGIN 1 gtgatccacc cgcgcgcacg ggccgtcctc tccgcgcggg gagacgcgcg catccaccag 61 ccccggctgc tcgccagccc cggccccagc catggaagag ctcacggctt ttgtatccaa 121 gtcttttgac cagaaaagca aggacggtaa cggcggaggc ggaggcggcg gaggtaagaa 181 ggattccatt acgtaccggg aagttttgga gagcggactg gcgcgctccc gggagctggg 241 gacgtcggat tccagcctcc aggacatcac ggagggcggc ggccactgcc cggtgcattt 301 gttcaaggac cacgtagaca atgacaagga gaaactgaaa gaattcggca ccgcgagagt 361 ggcagaaggg atttatgaat gcaaagagaa gcgcgaggac gtgaagtcgg aggacgagga 421 cgggcagacc aagctgaaac agaggcgcag ccgcaccaac ttcacgctgg agcagctgaa 481 cgagctcgag cgactcttcg acgagaccca ttaccccgac gccttcatgc gcgaggagct 541 cagccagcgc ctggggctct ccgaggcgcg cgtgcaggtt tggttccaga accggagagc 601 caagtgccgc aaacaagaga atcagatgca taaaggcgtc atcttgggca cagccaacca 661 cctagacgcc tgccgagtgg caccctacgt caacatggga gccttacgga tgcctttcca 721 acaggtccag gctcagctgc agctggaagg cgtggcccac gcgcacccgc acctgcaccc 781 gcacctggcg gcgcacgcgc cctacctgat gttccccccg ccgcccttcg ggctgcccat 841 cgcgtcgctg gccgagtccg cctcggccgc cgccgtggtc gccgccgccg ccaaaagcaa 901 cagcaagaat tccagcatcg ccgacctgcg gctcaaggcg cggaagcacg cggaggccct 961 ggggctctga cccgccgcgc agccccccgc gcgcccggac tcccgggctc cgcgcacccc 1021 gcctgcaccg cgcgtcctgc actcaacccc gcctggagct ccttccgcgg ccaccgtgct 1081 ccgggcaccc cgggagctcc tgcaagaggc ctgaggaggg aggctcccgg gaccgtccac 1141 gcacgaccca gccagaccct cgcggagatg gtgcagaagg cggagcgggt gagcggccgt 1201 gcgtccagcc cgggcctctc caaggctgcc cgtgcgtcct gggaccctgg agaagggtaa 1261 acccccgcct ggctgcgtct tcctctgcta taccctatgc atgcggttaa ctacacacgt 1321 ttggaagatc cttagagtct attgaaactg caaagatccc ggagctggtc tccgatgaaa 1381 atgccatttc ttcgttgcca acgattttct ttactaccat gctccttcct tcatcccgag 1441 aggctgcgga acgggtgtgg atttgaatgt ggacttcgga atcccaggag gcaggggccg 1501 ggctctcctc caccgctccc ccggagcctc ccaggcagca ataaggaaat agttctctgg 1561 ctgaggctga ggacgtgaac cgcgggcttt ggaaagggag gggagggaga cccgaacctc 1621 ccacgttggg actcccacgt tccggggacc tgaatgagga ccgactttat aacttttcca 1681 gtgtttgatt cccaaattgg gtctggtttt gttttggatt ggtatttttt tttttttttt 1741 tttttgctgt gttacaggat tcagacgcaa aagacttgca taagagacgg acgcgtggtt 1801 gcaaggtgtc atactgatat gcagcattaa ctttactgac atggagtgaa gtgcaatatt 1861 ataaatatta tagattaaaa aaaaaatagc an // LOCUS HSSHPH20 1919 bp RNA PRI 01-AUG-1996 DEFINITION H.sapiens mRNA for sperm adhesion molecule hPH-20. ACCESSION X84347 NID g1480100 KEYWORDS SPAM1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1919) AUTHORS Jones,M.H., Davey,P.M., Aplin,H. and Affara,N.A. TITLE Expression analysis, genomic structure, and mapping to 7q31 of the human sperm adhesion molecule gene SPAM1 JOURNAL Genomics 29 (3), 796-800 (1995) MEDLINE 96121399 REFERENCE 2 (bases 1 to 1919) AUTHORS Jones,M.H. TITLE Direct Submission JOURNAL Submitted (01-FEB-1995) M.H. Jones, University of Cambridge, Dept of Pathology, Tennis Court Road, CB2 1QP, UK FEATURES Location/Qualifiers source 1..1919 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="clontech testis library cat no H1010b" /chromosome="7" /tissue_type="testis" gene 297..1826 /gene="SPAM1" CDS 297..1826 /gene="SPAM1" /codon_start=1 /product="sperm adhesion molecule gene SPAM1" /db_xref="PID:e256805" /db_xref="PID:g1480101" /translation="MGVLKFKHIFFRSFVKSSGVSQIVFTFLLIPCCLTLNFRAPPVI PNVPFLWAWNAPSEFCLGKFDEPLDMSLFSFIGSPRINATGQGVTIFYVDRLGYYPYI DSITGVTVNGGIPQKISLQDHLDKAKKDITFYMPVDNLGMAVIDWEEWRPTWARNWKP KDVYKNRSIELVQQQNVQLSLTEATEKAKQEFEKAGKDFLVETIKLGKLLRPNHLWGY YLFPDCYNHHYKKPGYNGSCFNVEIKRNDDLSWLWNESTALYPSIYLNTQQSPVAATL YVRNRVREAIRVSKIPDAKSPLPVFAYTRIVFTDQVLKFLSQDELVYTFGETVALGAS GIVIWGTLSIMRSMKSCLLLDNYMETILNPYIINVTLAAKMCSQVLCQEQGVCIRKNW NSSDYLHLNPDNFAIQLEKGGKFTVRGKPTLEDLEQFSEKFYCSCYSTLSCKEKADVK DTDAVDVCIADGVCIDAFLKPPMETEEPQIFYNASPSTLSATMFIVSILFLIISSVAS L" polyA_site 1900..1919 BASE COUNT 608 a 364 c 374 g 573 t ORIGIN 1 ctagccaatg ctctaggaag acattgagac cagccaactt cttgccttga taactactga 61 agagacattg ggtggctgga ttttgaaagc agacttctgg ttataggtga tacaacttga 121 aaaacaatcc tgaaacatga aacaagaata ataatattta aatctaactt aatcattata 181 cctctttatc catcaaagtg attccctttc atctgtgctc atactttgca tcagatattg 241 ggtaaaccaa agtgtgtagg aagaaataaa tgttttcata gtcattactc tttacaatgg 301 gagtgctaaa attcaagcac atctttttca gaagctttgt taaatcaagt ggagtatccc 361 agatagtttt caccttcctt ctgattccat gttgcttgac tctgaatttc agagcacctc 421 ctgttattcc aaatgtgcct ttcctctggg cctggaatgc cccaagtgaa ttttgtcttg 481 gaaaatttga tgagccacta gatatgagcc tcttctcttt cataggaagc ccccgaataa 541 acgccaccgg gcaaggagtt acaatatttt atgttgatag acttggctac tatccttaca 601 tagattcaat cacaggagta actgtgaatg gaggaatccc ccagaagatt tccttacaag 661 accatctgga caaagctaag aaagacatta cattttatat gccagtagac aatttgggaa 721 tggctgttat tgactgggaa gaatggagac ccacttgggc aagaaactgg aaacctaaag 781 atgtttacaa gaataggtct attgaattgg ttcagcaaca aaatgtacaa cttagtctca 841 cagaggccac tgagaaagca aaacaagaat ttgaaaaggc agggaaggat ttcctggtag 901 agactataaa attgggaaaa ttacttcggc caaatcactt gtggggttat tatctttttc 961 cggattgtta caaccatcac tataagaaac ccggttacaa tggaagttgc ttcaatgtag 1021 aaataaaaag aaatgatgat ctcagctggt tgtggaatga aagcactgct ctttacccat 1081 ccatttattt gaacactcag cagtctcctg tagctgctac actctatgtg cgcaatcgag 1141 ttcgggaagc catcagagtt tccaaaatac ctgatgcaaa aagtccactt ccggtttttg 1201 catatacccg catagttttt actgatcaag ttttgaaatt cctttctcaa gatgaacttg 1261 tgtatacatt tggcgaaact gttgctctgg gtgcttctgg aattgtaata tggggaaccc 1321 tcagtataat gcgaagtatg aaatcttgct tgctcctaga caattacatg gagactatac 1381 tgaatcctta cataatcaac gtcacactag cagccaaaat gtgtagccaa gtgctttgcc 1441 aggagcaagg agtgtgtata aggaaaaact ggaattcaag tgactatctt cacctcaacc 1501 cagataattt tgctattcaa cttgagaaag gtggaaagtt cacagtacgt ggaaaaccga 1561 cacttgaaga cctggagcaa ttttctgaaa aattttattg cagctgttat agcaccttga 1621 gttgtaagga gaaagctgat gtaaaagaca ctgatgctgt tgatgtgtgt attgctgatg 1681 gtgtctgtat agatgctttt ctaaaacctc ccatggagac agaagaacct caaattttct 1741 acaatgcttc accctccaca ctatctgcca caatgttcat tgttagtatt ttgtttctta 1801 tcatttcttc tgtagcgagt ttgtaattgc gcaggttagc tgaaatgaac aatatgtcca 1861 tcttaaagtg tgctttttcg actaattaaa tctttgaaaa gaaaaaaaaa aaaaaaaaa // LOCUS HSSIAH2 975 bp RNA PRI 04-DEC-1997 DEFINITION Homo sapiens mRNA for Siah2 protein. ACCESSION Y15268 NID g2664282 KEYWORDS siah-2 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 975) AUTHORS Germani,A. and Romero,F. JOURNAL Unpublished REFERENCE 2 (bases 1 to 975) AUTHORS Germani,A. TITLE Direct Submission JOURNAL Submitted (27-OCT-1997) A. Germani, Institut Cochin de Genetique Moleculaire, U363 INSERM, Hopital Cochin, 27 rue du Faubourg Saint-Jacques, 75014 Paris, FRANCE FEATURES Location/Qualifiers source 1..975 /organism="Homo sapiens" /strain="Jurkat" /db_xref="taxon:9606" /cell_type="T cells" /clone_lib="pGAD-v240" gene 1..975 /gene="siah2" CDS 1..975 /gene="siah2" /codon_start=1 /product="Siah2 protein" /db_xref="PID:e1202710" /db_xref="PID:g2664283" /translation="MSRPSSTGPSANKPCSKQPPPQPQHTPSPAAPPAAATISAAGPG SSAVPAAAAVISGPGGGGGAGPVSPQHHELTSLFECPVCFDYVLPPILQCQAGHLVCN QCRQKLSCCPTCRGALTPSIRNLAMEKVASAVLFPCKYATTGCSLTLHHTEKPEHEDI CEYRPYSCPCPGASCKWQGSLEAVMSHLMHAHKSITTLQGEDIVFLATDINLPGAVDW VMMQSCFGHHFMLVLEKQEKYEGHQQFFAIVLLIGTRKQAENFAYRLELNGNRRRLTW EATPRSIHDGVAAAIMNSDCLVFDTAIAHLFADNGNLGINVTISTCCP" BASE COUNT 188 a 316 c 271 g 200 t ORIGIN 1 atgagccgcc cgtcctccac cggccccagc gctaataaac cctgcagcaa gcagccgccg 61 ccgcagcccc agcacactcc gtccccggct gcgcccccgg ccgccgccac catctcggct 121 gcgggccccg gctcgtccgc ggtgcccgcc gcggcggcgg tgatctcggg ccccggcggc 181 ggcggcgggg ccggcccggt gtccccgcag caccacgagc tgacctcgct cttcgagtgt 241 ccggtctgct ttgactatgt cctgcctcct attctgcagt gccaggccgg gcacctggtg 301 tgtaaccaat gccgccagaa gttgagctgc tgcccgacgt gcaggggcgc cctgacgccc 361 agcatcagga acctggctat ggagaaggtg gcctcggcag tcctgtttcc ctgtaagtat 421 gccaccacgg gctgttccct gaccctgcac catacggaga aaccagaaca tgaagacata 481 tgtgaatacc gtccctactc ctgcccatgt cctggtgctt cctgcaagtg gcaggggtcc 541 ctggaagctg tgatgtccca tctcatgcac gcccacaaga gcattaccac ccttcaggga 601 gaagacatcg tctttctagc tacagacatt aacttgccag gggctgtcga ctgggtgatg 661 atgcagtcat gttttggcca tcacttcatg ctggtgctgg agaaacaaga gaagtacgaa 721 ggccaccagc agttttttgc catcgtcctg ctcattggca cccgcaagca agccgagaac 781 tttgcctaca gactggagtt gaatgggaac cggcggagat tgacctggga ggccacgccc 841 cgttcgattc atgacggtgt ggctgcggcc atcatgaaca gcgactgcct tgttttcgac 901 acagccatag cacatctttt tgcagataat gggaaccttg gaatcaatgt tactatttct 961 acatgttgtc catga // LOCUS HSSIATR 2180 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for sialyltransferase (CMP-N-acetylneuraminate-beta-galactoside alpha-2,6-sialyltransferase) (EC 2.4.99.1). ACCESSION X17247 NID g36461 KEYWORDS beta-galactoside alpha 2,6-sialyltransferase; sialyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2180) AUTHORS Grundmann,U.G. TITLE Direct Submission JOURNAL Submitted (03-JAN-1990) Grundmann U.G., Department of Molecular Biology, Research Institutes, Behringwerke AG, Postfach 1140, 3550 Marburg REFERENCE 2 (bases 1 to 2180) AUTHORS Grundmann,U., Nerlich,C., Rein,T. and Zettlmeissl,G. TITLE Complete cDNA sequence encoding human beta-galactoside alpha-2,6-sialyltransferase JOURNAL Nucleic Acids Res. 18 (3), 667 (1990) MEDLINE 90175005 FEATURES Location/Qualifiers source 1..2180 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambdagt10" CDS 432..1652 /note="sialyltransferase (AA 1-406)" /codon_start=1 /db_xref="PID:g36462" /db_xref="SWISS-PROT:P15907" /translation="MIHTNLKKKFSCCVLVFLLFAVICVWKEKKKGSYYDSFKLQTKE FQVLKSLGKLAMGSDSQSVSSSSTQDPHRGRQTLGSLRGLAKAKPEASFQVWNKDSSS KNLIPRLQKIWKNYLSMNKYKVSYKGPGPGIKFSAEALRCHLRDHVNVSMVEVTDFPF NTSEWEGYLPKESIRTKAGPWGRCAVVSSAGSLKSSQLGREIDDHDAVLRFNGAPTAN FQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPDYN FFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTL CDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYL LGKATLPGFRTIHC" BASE COUNT 560 a 596 c 530 g 494 t ORIGIN 1 gcccggcgtt aacaaaggga gccgataccg accggcgtgg gcgcggagcg ggcggccgcc 61 accgagcgtg ctgagcaacc gcagcctccg cggccgagag tgcagcgagc aaggggagag 121 ccagttgcgc agagccctgc aaccagcagt ccagggagaa gtggtgaatg tcatggagcc 181 cagctgaaat ggactggccc ccttgagcct gtcccaagcc ctggtgccag gtgtccatcc 241 ccgtgctgag atgagttttg atcatcctga gaaaaatggg ccttggcctg cagacccaat 301 aaaccttccc tcccatggat aatagtgcta attcctgagg acctgaaggc ctgccgcccc 361 tgggggatta gccagaagca ggcttgtttt cctgctcaga acaaagtgac ttccctgaac 421 acatcttcat tatgattcac accaacctga agaaaaagtt cagctgctgc gtcctggtct 481 ttcttctgtt tgcagtcatc tgtgtgtgga aggaaaagaa gaaagggagt tactatgatt 541 cctttaaatt gcaaaccaag gaattccagg tgttaaagag tctggggaaa ttggccatgg 601 ggtctgattc ccagtctgta tcctcaagca gcacccagga cccccacagg ggccgccaga 661 ccctcggcag tctcagaggc ctagccaagg ccaaaccaga ggcctccttc caggtgtgga 721 acaaggacag ctcttccaaa aaccttatcc ctaggctgca aaagatctgg aagaattacc 781 taagcatgaa caagtacaaa gtgtcctaca aggggccagg accaggcatc aagttcagtg 841 cagaggccct gcgctgccac ctccgggacc atgtgaatgt atccatggta gaggtcacag 901 attttccctt caatacctct gaatgggagg gttatctgcc caaggagagc attaggacca 961 aggctgggcc ttggggcagg tgtgctgttg tgtcgtcagc gggatctctg aagtcctccc 1021 aactaggcag agaaatcgat gatcatgacg cagtcctgag gtttaatggg gcacccacag 1081 ccaacttcca acaagatgtg ggcacaaaaa ctaccattcg cctgatgaac tctcagttgg 1141 ttaccacaga gaagcgcttc ctcaaagaca gtttgtacaa tgaaggaatc ctaattgtat 1201 gggacccatc tgtataccac tcagatatcc caaagtggta ccagaatccg gattataatt 1261 tctttaacaa ctacaagact tatcgtaagc tgcaccccaa tcagcccttt tacatcctca 1321 agccccagat gccttgggag ctatgggaca ttcttcaaga aatctcccca gaagagattc 1381 agccaaaccc cccatcctct gggatgcttg gtatcatcat catgatgacg ctgtgtgacc 1441 aggtggatat ttatgagttc ctcccatcca agcgcaagac tgacgtgtgc tactactacc 1501 agaagttctt cgatagtgcc tgcacgatgg gtgcctacca cccgctgctc tatgagaaga 1561 atttggtgaa gcatctcaac cagggcacag atgaggacat ctacctgctt ggaaaagcca 1621 cactgcctgg cttccggacc attcactgct aagcacaggc tcctcactct tctccatcag 1681 gcattaaatg aatggtctct tggccacccc agcctgggaa gaacattttc ctgaacaatt 1741 ccagcctgct ccttttactc taggggcctc tgtcagcaag accatgggac ttcaagagcc 1801 tgtggtcagg aaatcaggtc cagccttccc tgtagccaga cagtttatga gcccagagcc 1861 tcctgccaca cacatgcaca catatctagc attctttcca agacagcatc ctccccgcct 1921 tccaccttgt agatgcaagg tctatctctc ccatcagggc tgccaaagct gggctttgtt 1981 tttcccagca gaatgatgcc attctcacaa accaatgctc tatattgctt gaagtctgca 2041 tctaaatatt gatttcacgt tttaaagaaa ttctcttaaa ttacaattgt gcccaatgca 2101 gggtggctct ggggggcaag taggtggtac aggggattgg aaacaatcgt ccgcgcctcc 2161 agagaaaagt tgctcccgag // LOCUS HSSIGMA3B 1829 bp RNA PRI 08-JAN-1997 DEFINITION H.sapiens mRNA for sigma 3B protein. ACCESSION X99459 NID g1770514 KEYWORDS Sigma 3B protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1829) AUTHORS Dell'Angelica,E.C., Ohno,H., Ooi,C.E., Rabinovich,E., Roche,K.W. and Bonifacino,J.S. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1829) AUTHORS Dell'Angelica,E.C. TITLE Direct Submission JOURNAL Submitted (18-JUL-1996) E.C. Dell'Angelica, Cell Biology and Metabolism Branch, National Institute of Child Health and Human Development, Nat. Institues of Health, Bldg. 18T, Rm. 101, NIH,Bethesda,MD20892, USA FEATURES Location/Qualifiers source 1..1829 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" /clone="FRACE7" CDS 31..612 /function="component of an adaptor-like protein complex" /codon_start=1 /product="sigma 3 protein" /db_xref="PID:e256813" /db_xref="PID:g1770515" /translation="MIQAILVFNNHGKPRLVRFYQRFPEEIQQQIVRETFHLVLKRDD NICNFLEGGSLIGGSDYKLIYRHYATLYFVFCVDSSESELGILDLIQVFVETLDKCFE NVCELDLIFHMDKVHYILQEVVMGGMVLETNMNEIVAQIEAQNRLEKSEGGLSAAPAR AVSAVKNINLPEIPRNINIGDLNIKVPNLSQFV" repeat_region 1220..1510 /rpt_family="Alu-like sequence" BASE COUNT 440 a 453 c 460 g 476 t ORIGIN 1 ccggtgctga gagaaccgtg gctggcaaag atgattcagg cgattctggt tttcaacaac 61 catgggaagc cacggctagt ccgcttctac cagcgtttcc cagaagaaat tcaacagcag 121 attgttcgag agactttcca tctagtcctc aagcgggatg acaacatctg taacttcttg 181 gagggtggaa gtttgattgg tggctctgac tacaaactga tctaccggca ctatgctacc 241 ctctactttg tattttgtgt ggattcctca gagagtgaac ttggaatctt ggacctcatc 301 caggtttttg tggaaactct ggataagtgt ttcgaaaatg tgtgtgaatt ggatttgatc 361 ttccatatgg ataaggtgca ctacatcctc caggaggtgg tgatgggtgg gatggtgttg 421 gaaacaaaca tgaatgaaat cgtggctcag attgaggctc aaaacaggct ggagaaatcc 481 gagggtggcc tttcagcagc ccctgcgcgg gctgtgtctg ctgtgaaaaa catcaacctg 541 ccagagattc ctcggaacat caacattggc gatctcaaca tcaaagttcc caacctgtcc 601 cagtttgtct gaggatcaag tattggcctg aaatagagtc cttaagacaa gcaaagacaa 661 gcaaggcaag cacgtctgga aacagaaccc attttgagcc ttagaagagt caagcctcag 721 gacctggaaa ctttgtgtct ggggaagact gtttggcatg gaatagggaa gggattccta 781 ttgacactgc tcgggtgcac ccagttctca catgtgcagt catgccgttc tctgatgcat 841 acggccactg cagatgtgag gggccctgcc ttcctcagta gggagtcaac atgcccaagt 901 catttgcacc tttacctctc acatggatgc tcccaagggt tagggactgc attgagcagg 961 cccacctgct tcccagaacc tcctcactag ggctgagcac cttctctgag tagagtcttc 1021 atccttagca ccacagactt ctgaggtcct gtgcccttta cttgctggtg aggtgtcata 1081 ggtagaaaag ggctggccct tcagatctgg gggtgtggtg agtggcaagt aagggcagaa 1141 ttttaggaga accagagtca cccgctggct ctactgagat tgttacaccc agaatccttt 1201 tgtgtttttt tgtggttttt tttttttgag gtggagtctt gctctgtcac ccaggctgga 1261 gtgctgtggt gcaatctcgg ctcactgcaa cctctgcttc ccgggttcaa gcatttctcc 1321 tgtctcagcc tccccagtag ctgggattac aggcacccac caccatgccc agctaattgt 1381 tgtatgttta gtagagacag ggtttcacca tgttggccag gctgggctcg aactcctgga 1441 cctcaagtga tctacccgcc ttggcctccc aaagtgctgg cattacaggt gtgagccacc 1501 gtgcccggcc accagaatcc tttggtatag ccaagccttt tggttaccgc ctcatgaaga 1561 atatgcttcc cgcattgtcc tagtcccagt tgtattctca caggtgttat gtgcaggaca 1621 caatccaaat cataaacctg gctcatgccc aacacatttc tgctaatagg gagagggacc 1681 caccacacac ccacacatgc cagaggtccc tcctcacaga ggagagggcc tgtgtctgta 1741 gaaggttaaa gctgacaaca tgtgaaacat cccagaatta tgactcttcc caagtttaaa 1801 atacattctc ctcatgagag cagaaggtt // LOCUS HSSIX1 1378 bp RNA PRI 04-JUL-1996 DEFINITION H.sapiens mRNA for SIX1 protein. ACCESSION X91868 NID g1246760 KEYWORDS six1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1378) AUTHORS Boucher,C.A., Carey,N., Edwards,Y.H., Siciliano,M.J. and Johnson,K.J. TITLE Cloning of the human SIX1 gene and its assignment to chromosome 14 JOURNAL Genomics 33 (1), 140-142 (1996) MEDLINE 96207313 REFERENCE 2 (bases 1 to 1378) AUTHORS Boucher,C.A. TITLE Direct Submission JOURNAL Submitted (28-SEP-1995) C.A. Boucher, Division of Molecular Genetics, Glasgow University, Anderson College, 56 Dumbarton Road, Glasgow, G11 6NU, UK FEATURES Location/Qualifiers source 1..1378 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="muscle" /chromosome="14" gene 276..1130 /gene="six1" CDS 276..1130 /gene="six1" /codon_start=1 /db_xref="PID:e205177" /db_xref="PID:g1246761" /translation="MSMLPSFGFTQEQVACVCEVLQQGGNLERLGRFLWSLPACDHLH KNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHPKLQQLWLKAHYVEAEKLRGRP LGAVGKYRVRRKFPLPRTIWDGEETSYCFKEKSRGVLREWYAHNPYPSPREKRELAEA TGLTTTQVSNWFKNRRQRDRAAEAKERENTENNNSSSNKQNQLSPLEGGKPLMSSSEE EFSPPQSPDQNSVLLLQGNMGHARSSNYSLPGLTASQPSHGLQTHQHQLQDSLLGPLT SSLVDLGS" misc_feature 642..821 /gene="six1" /note="homeobox" BASE COUNT 296 a 446 c 381 g 255 t ORIGIN 1 ggtagcagca tccaccgggc gggaggtcgg aggcagcaag gccttaaagg ctactgagtg 61 cgccggccgt tccgtgtcca gaacctcccc tactcctccg ccttctcttc cttggccgcc 121 caccgccaag ttccgactcc ggttttcgcc tttgcaaagc ctaaggagga ggttaggaac 181 agccgcgccc ccctccctgc ggccgccgcc ccctgcctct cggctctgct ccctgccgcg 241 tgcgcctggg ccgtgcgccc cggcaggcgc cagccatgtc gatgctgccg tcgtttggct 301 ttacgcagga gcaagtggcg tgcgtgtgcg aggttctgca gcaaggcgga aacctggagc 361 gcctgggcag gttcctgtgg tcactgcccg cctgcgacca cctgcacaag aacgagagcg 421 tactcaaggc caaggcggtg gtcgccttcc accgcggcaa cttccgtgag ctctacaaga 481 tcctggagag ccaccagttc tcgcctcaca accaccccaa actgcagcaa ctgtggctga 541 aggcgcatta cgtggaggcc gagaagctgc gcggccgacc cctgggcgcc gtgggcaaat 601 atcgggtgcg ccgaaaattt ccactgccgc gcaccatctg ggacggcgag gagaccagct 661 actgcttcaa ggagaagtcg aggggtgtcc tgcgggagtg gtacgcgcac aatccctacc 721 catcgccgcg tgagaagcgg gagctggccg aggccaccgg cctcaccacc acccaggtca 781 gcaactggtt taagaaccgg aggcaaagag accgggccgc ggaggccaag gaaagggaga 841 acaccgaaaa caataactcc tcctccaaca agcagaacca actctctcct ctggaagggg 901 gcaagccgct catgtccagc tcagaagagg aattctcacc tccccaaagt ccagaccaga 961 actcggtcct tctgctgcag ggcaatatgg gccacgccag gagctcaaac tattctctcc 1021 cgggcttaac agcctcgcag cccagtcacg gcctgcagac ccaccagcat cagctccaag 1081 actctctgct cggccccctc acctccagtc tggtggactt ggggtcctaa gtggggaggg 1141 actggggcct cgaagggatt cctggagcag caaccactgc agcgactagg gacacttgta 1201 aatagaaatc aggaacattt ttgcagcttg tttctggagt tgtttgcgca taaaggaatg 1261 gtggactttc acaaatatct ttttaaaaat caaaaccaac agcgatctca agcttaatct 1321 cctcttctct ccaactcttt ccacttttgc attttccttc ccaatgcaga gatcaggg // LOCUS HSSKAP55 1524 bp RNA PRI 09-JUL-1997 DEFINITION Homo sapiens mRNA for SKAP55 protein. ACCESSION Y11215 NID g2252495 KEYWORDS SKAP55 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1524) AUTHORS Marie-Cardine,A., Bruyns,E., Eckerskorn,C., Kirchgessner,H., Meuer,S.C. and Schraven,B. TITLE Molecular cloning of SKAP55, a novel protein that associates with the protein tyrosine kinase p59(fyn) in human T-lymphocytes JOURNAL J. Biol. Chem. 272 (26), 16077-16080 (1997) MEDLINE 97341130 REFERENCE 2 (bases 1 to 1524) AUTHORS Marie-Cardine,A. TITLE Direct Submission JOURNAL Submitted (13-FEB-1997) A. Marie-Cardine, Ruprecht-Karls University Heidelberg, Institute of Immunology, Im Neuenheimer Feld 305, Heidelberg, 69120, FRG FEATURES Location/Qualifiers source 1..1524 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="leukocytes" /clone_lib="lambda ZAP" /tissue_type="blood" gene 71..1150 /gene="SKAP55" CDS 71..1150 /gene="SKAP55" /codon_start=1 /db_xref="PID:e321911" /db_xref="PID:g2252496" /translation="MQAAALPEEIRWLLEDAEEFLAEGLRNENLSAVARDHRDHILRG FQQIKARYYWDFQPQGGDIGQDSSDDNHSGTLGLSLTSDAPFLSDYQDEGMEDIVKGA QELDNVIKQGYLEKKSKDHSFFGSEWQKRWCVVSRGLFYYYANEKSKQPKGTFLIKGY SVRMAPHLRRDSKKESCFELTSQDRRTYEFTATSPAEARDWVDQISFLLKDLSSLTIP YEEDEEEEEKEETYDDIDGFDSPSCGSQCRPTILPGSVGIKEPTEEKEEEDIYEVLPD EEHDLEEDESGTRRKGVDYASYYQGLWDCHGDQPDELSFQRGDLIRILSKEYNMYGWW VGELNSLVGIVPKEYLTTAFEVEER" BASE COUNT 432 a 358 c 388 g 346 t ORIGIN 1 gtcgccttcc agcccgtccg cctcccgacc agggcccgcg ccccgtcccg cctctctccc 61 gcccagccaa atgcaggccg ccgccctccc tgaggagatc cgttggctcc tggaagatgc 121 tgaagagttt ctggcagaag gtttgcggaa tgagaacctc agcgctgttg caagggatca 181 cagagaccat attctacggg gctttcagca aatcaaagcc aggtactatt gggattttca 241 gccccaaggg ggagacattg gacaggacag ctctgatgat aatcacagcg ggactcttgg 301 cctgtccctc acatccgatg cacccttttt gtcagattat caggatgagg gaatggaaga 361 catcgtaaaa ggagctcaag aacttgataa cgtaatcaag caaggatact tggagaagaa 421 aagcaaagat catagtttct ttggatcgga gtggcagaag cgatggtgtg ttgtcagcag 481 aggtctcttc tactactatg ctaatgagaa gagcaagcag cccaaaggga ccttcctcat 541 taagggctac agtgtacgga tggcccccca cctgcgaaga gattccaaga aagaatcctg 601 ctttgaactg acctcccagg ataggcgcac gtatgagttt acagctacta gtccagcaga 661 agccagagac tgggtggatc aaataagttt cttgttaaag gatctgagct ccttaaccat 721 tccatatgaa gaggatgagg aggaagaaga aaaagaagag acatatgatg atattgatgg 781 ttttgactcc ccaagttgtg gttcccagtg cagacccact atcttgcctg ggagtgtggg 841 gataaaagag cctacagagg agaaagaaga agaagatatt tatgaagtct tgccagatga 901 agagcatgat ctagaagagg atgagagtgg cactcgacga aaaggagtag actatgccag 961 ttactaccag ggcctatggg attgccatgg tgaccagcca gatgaactgt ccttccaacg 1021 gggtgacctc atccgtattc tgagcaagga gtataacatg tatggctggt gggtgggaga 1081 actgaacagc ctcgttggga ttgttccaaa ggagtatctc accactgcct ttgaagtgga 1141 agaaagatga aacccaggaa atatattctt ccctctctcc tcctttatga ggaaactgat 1201 catcaaaagt tcccactccc tacttctgca cccaccaacg cctgactcct ctctttgctg 1261 aagagaccca agtctcttga cacctcagag tgactgtaag ctaccagtaa gacaagtggg 1321 aagaggcacg ttcatcaaac ctgttactaa accagcctag tcatagctca tccccatgtg 1381 taaatgtgtc cacacaacca catctgcctt ttccacaagc ttttcacaaa gaaggtgaga 1441 gagaaggaaa ccttgggagg aggacattac tggttgttct ggctggtttg aaaagcacaa 1501 ataaacttgg gatgtggttc cttg // LOCUS HSSKIR 3511 bp RNA PRI 12-SEP-1993 DEFINITION Human ski oncogene mRNA. ACCESSION X15218 NID g36483 KEYWORDS oncogene; ski oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3511) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (08-MAY-1989) Nomura N., Molecular Oncology Laboratory, Nippon Medical School, Sakuragi, 1-10-19 Uenosakuragi Taito-ku, Tokyo 110, JAPAN REFERENCE 2 (bases 1 to 3511) AUTHORS Nomura,N., Sasamoto,S., Ishii,S., Date,T., Matsui,M. and Ishizaki,R. TITLE Isolation of human cDNA clones of ski and the ski-related gene, sno JOURNAL Nucleic Acids Res. 17 (14), 5489-5500 (1989) MEDLINE 89345144 COMMENT see x15217 and x15219 for ski-related mRNAs, snoA and snoN. FEATURES Location/Qualifiers source 1..3511 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilicord vein" /cell_type="endothelial cells." CDS 73..2259 /note="ski protein (AA 1 - 728)" /codon_start=1 /db_xref="PID:g36484" /db_xref="SWISS-PROT:P12755" /translation="MEAAAGGRGCFQPHPGLQKTLEQFHLSSMSSLGGPAAFSARWAQ EAYKKESAKEAGAAAVPAPVPAATEPPPVLHLPAIQPPPPVLPGPFFMPSDRSTERCE TVLEGETISCFVVGGEKRLCLPQILNSVLRDFSLQQINAVCDELHIYCSRCTADQLEI LKVMGILPFSAPSCGLITKTDAERLCNALLYGGAYPPPCKKELAASLALGLELSERSV RVYHECFGKCKGLLVPELYSSPSAACIQCLDCRLMYPPHKFVVHSHKALENRTCHWGF DSANWRAYILLSQDYTGKEEQARLGRCLDDVKEKFDYGNKYKRRVPRVSSEPPASIRP KTDDTSSQSPAPSEKDKPSSWLRTLAGSSNKSLGCVHPRQRLSAFRPWSPAVSASEKE LSPHLPALIRDSFYSYKSFETAVAPNVALAPPAQQKVVSSPPCAAAVSRAPEPLATCT QPRKRKLTVDTPGAPETLAPVAAPEEDKDSEAEVEVESREEFTSSLSSLSSPSFTSSS SAKDLGSPGARALPSAVPDAAAPADAPSGLEAELEHLRQALEGGLDTKEAKEKFLHEV VKMRVKQEEKLSAALQAKRSLHQELEFLRVAKKEKLREATEAKRNLRKEIERLRAENE KKMKEANESRLRLKRELEQARQARVCDKGCEAGRLRAKYSAQIEDLQVKLQHAEADRE QLRADLLREREAREHLEKVVKELQEQLWPRARPEAAGSEGAAELEP" variation 3369 /note="a is g in variant clone" misc_feature 3373 /note="3' end of variant clone with 7 additional A residues" misc_feature 3503..3508 /note="put. polyA signal" BASE COUNT 663 a 1159 c 1089 g 600 t ORIGIN 1 cggggcggcg gcgggggccg ggggggcccg ggcgcgcggg agcgggagcg gccgggggag 61 ccggagcgca ccatggaggc ggcggcaggc ggccgcggct gtttccagcc gcacccgggg 121 ctgcagaaga cgctggagca gttccacctg agctccatga gctcgctggg cggcccggcc 181 gctttctcgg cgcgctgggc gcaggaggcc tacaagaagg agagcgccaa ggaggcgggc 241 gcggccgcgg tgccggcgcc ggtgcccgca gccaccgagc cgccgcccgt gctgcacctg 301 cccgccatcc agccgccgcc gcccgtgctg cccgggccct tcttcatgcc gtccgaccgc 361 tccaccgagc gctgcgagac cgtactggaa ggcgagacca tctcgtgctt cgtggtggga 421 ggcgagaagc gcctgtgtct gccgcagatt ctcaactcgg tgctgcgcga cttctcgctg 481 cagcagatca acgcggtgtg cgacgagctc cacatctact gctcgcgctg cacggccgac 541 cagctggaga tcctcaaagt catgggcatc ctgcccttct cggcgccctc gtgcgggctc 601 atcaccaaga cggacgccga gcgcctgtgc aacgcgctgc tctacggcgg cgcctacccg 661 ccgccctgca agaaggagct ggccgccagc ctggcgctgg gcctggagct cagcgagcgc 721 agcgtccgcg tgtaccacga gtgcttcggc aagtgtaagg ggctgctggt gcccgagctc 781 tacagcagcc cgagcgccgc ctgcatccag tgcctggact gccgcctcat gtacccgccg 841 cacaagttcg tggtgcactc gcacaaggcc ctggagaacc ggacctgcca ctggggcttc 901 gactcggcca actggcgggc ctacatcctg ctgagccagg attacacggg caaggaggag 961 caggcgcgcc tcggccgctg cctggacgac gtgaaggaga aattcgacta tggcaacaag 1021 tacaagcggc gggtgccccg ggtctcctct gagcctccgg cctccataag acccaaaaca 1081 gatgacacct cttcccagtc ccccgcgcct tccgaaaagg acaagccgtc cagctggctg 1141 cggaccttgg ccggctcttc caataagagc ctgggctgtg ttcaccctcg ccagcgcctc 1201 tctgctttcc gaccctggtc ccccgcagtg tcagcgagtg agaaagagct ctccccacac 1261 ctcccggccc tcatccgaga cagcttctac tcctacaaga gctttgagac agccgtggcg 1321 cccaacgtgg ccctcgcacc gccggcccag cagaaggttg tgagcagccc tccgtgtgcc 1381 gccgccgtct cccgggcccc cgagcctctc gccacttgca cccagcctcg gaagcggaag 1441 ctgactgtgg acaccccagg agccccagag acgctggcgc ccgtggctgc cccagaggag 1501 gacaaggact cggaggcgga ggtggaagtt gaaagcaggg aggaattcac ctcctccttg 1561 tcctcgctct cttccccgtc ctttacctca tccagctccg ccaaggacct gggctccccg 1621 ggtgcgcgtg ccctgccctc ggccgtccct gatgctgcgg cccctgccga cgcccccagt 1681 gggctggagg cggagctgga gcacctgcgg caggcactgg agggcggcct ggacaccaag 1741 gaagccaaag agaagttcct gcatgaggtg gtcaagatgc gcgtgaagca ggaggagaag 1801 ctcagcgcag ccctgcaggc caagcgcagc ctccaccagg agctggagtt cctacgcgtg 1861 gccaagaagg agaagctgcg ggaggccacg gaggccaagc gtaacctgcg gaaggagatc 1921 gagcgtctcc gcgccgagaa cgagaagaag atgaaagagg ccaacgagtc acggctgcgc 1981 ctgaagcggg agctggagca ggcgcggcag gcccgggtgt gcgacaaggg ctgcgaggcg 2041 ggccgcctgc gcgccaagta ctcggcccag atcgaagacc tgcaggtgaa gctgcagcac 2101 gcggaggcgg accgggagca gctgcgggcc gacctgctgc gggagcgcga ggcccgggag 2161 cacctggaga aggtggtgaa ggagctgcag gaacagctgt ggccgcgggc ccgccccgag 2221 gctgcgggca gcgagggcgc tgcggagctg gagccgtaga ttccgtgcct gccgccgcag 2281 cgccgccgac aacgcgggtg caggggggcg cggctgggcg gtgcagctcc gcccggctcc 2341 gcccctgcag cccacacagc acaacgtctt accgtgccta ttaccaagcg agtgtttgta 2401 accatgtagt tttggaaccc actgcaaaat tttctactgg ccaagttcaa gtgagtaagc 2461 cgcgtccccc aactacagct ggagacgggg ccagctcggc ggcctgctgg tcctctgctt 2521 gctggaacat tctaacattt acacttttgt tataagctat ttaaaaccag taaggagact 2581 tgaaattcag aaaatcaaca catttttaaa tgactaactt ctaaaagccc caacacatga 2641 cgccatctga agacccgcaa cggagtgggg gtggcggccg ccccaccctc cccacccggg 2701 gaagccatca cagctcatct gcccgcggct gcgtgaggac agcaggggtt tttcttcaga 2761 gtctattttt tcagcgacaa ggacccaggt cttcctgctg ctgccaggga gagcagggac 2821 agtgccgcgt gcgagatgag ctcgaacact gcccgcctta ctgccgccta ccccgcccgc 2881 cacgccgccg tcgatgccag cgctgtcccc acgggtacca ggaagtgcag agccgcacag 2941 gagctgcccc ggagctgagg ggacggtctt cggctcctct gcaccccgtg attctgccca 3001 cgctcctcca ccacgaggca ctgacctgcg tcgggtggtg accgtggctg gcggtcacgc 3061 cctcagccct ccgggcacac gtgccgcctg accgggcgac ccttttcagt tcggcaaacg 3121 tcgctccctt cattttggga ctgaggctgc agcattggaa caaaagagca ttatttcaat 3181 ttttctttct ttttttttgt tcgttcattt aaacgtatat ttagaactgc actttgtcca 3241 caaccttccc ttctctttct attccccagt gaactgaggt ttttaccgac tttatagagc 3301 agtcaaatcc gaagtgctcg agtgcttaga aaccccctct ggtgcttggt tgaacaaggg 3361 aatcacaaaa aaacgaaaat gcaaaaactg aacttcgggg gtcgttctgt gccttccagc 3421 atcttgtaca gcaaatcctg actcgtgtct ttttaccccc aagatatctg tcttcagtag 3481 cgactgaatc tgccactctc agaataagtt c // LOCUS HSSLN2 1417 bp DNA PRI 25-NOV-1997 DEFINITION Human sarcolipin (SLN) gene, exon 2 and complete cds. ACCESSION U96093 NID g1943763 KEYWORDS . SEGMENT 2 of 2 SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1417) AUTHORS Odermatt,A., Taschner,P.E.M., Scherer,S.W., Beatty,B., Khanna,V.K., Cornblath,D.R., Chaudhry,V., Yee,W.C., Schrank,B., Karpati,G., Breuning,M.H., Knoers,N. and MacLennan,D.H. TITLE Characterization of the gene encoding human sarcolipin (SLN), a proteolipid associated with SERCA1: absence of structural mutations in five patients with brody disease JOURNAL Genomics 45 (3), 541-553 (1997) MEDLINE 98035878 REFERENCE 2 (bases 1 to 1417) AUTHORS Odermatt,A. and MacLennan,D.H. TITLE Direct Submission JOURNAL Submitted (01-APR-1997) Banting and Best Department of Medical Research, Best Institute, 112 College St., Toronto, Ontario M5G 1L6, Canada FEATURES Location/Qualifiers source 1..1417 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR join(U96092:973..1066,240..314) /gene="SLN" gene join(U96092:973..1387,1..861) /gene="SLN" mRNA join(U96092:973..1066,240..861) /gene="SLN" exon 240..861 /gene="SLN" /number=2 CDS 315..410 /gene="SLN" /note="proteolipid" /codon_start=1 /product="sarcolipin" /db_xref="PID:g2642411" /translation="MGINTRELFLNFTIVLITVILMWLLVRSYQY" 3'UTR 411..861 /gene="SLN" BASE COUNT 402 a 308 c 272 g 435 t ORIGIN 1 ccctccttaa ttacctaatt ttaacttatt tacctcttaa aagaccctat cttcaaacac 61 agtcacattc tgagatactg tgagctaggg ctttgacatg tagatttttg ggggacacaa 121 tttagcccat cacactctcc ttttccacaa cacttctgtt tcctttgagg aaagaacggg 181 cattgttata caggaatgcc cattaatctc cttgtgtttt ctttgttatg ttttatcagg 241 aggtgaggac aagccagagg tccttggtgt gccctcagaa atctgcctgc agttctcacc 301 aagccgctgt gaaaatgggg ataaacaccc gggagctgtt tctcaacttc actattgtct 361 tgattacggt tattcttatg tggctccttg tgaggtccta tcagtactga gaggccatgc 421 catggtcctg ggattgactg agatgctccg gagctgcctg ctctatgccc tgagacccca 481 ctgctgtcat tgtcacagga tgccattctc catccgaggg cacctgtgac ctgcactcac 541 aatatctgct atgctgtagt gctaggattg attatgtgtt ctccaaagat gctgctccca 601 agggctgcca agtgtttgcc agggaacggt agatttattc cccaactctt aactgaaaat 661 gtgttagaca agccacaaag ttaaaattaa actggattca tgatgatgta ggattgttac 721 aagcccctga tctgtctcac cacacatccc ttcaacccac acggtctgca accaaactct 781 aattcaacct gccagaagga atgttagagg aagtctttgt cagcccttat agctatcatg 841 tgaataaagt taagtcaact tcaaaaacaa cttctagaac ttattttagc ttccatgtgt 901 gacagagcat ttgacccttg gctgggattg gagtgacaag tgctaccgta tttctagcat 961 ttgaggtaag ccaagatgct ccaactgctg aagatttgaa accaagtcaa cacactgtgt 1021 catatttcaa gtaattccat tggttcagcg ctcctcaaac ttttccccta aactagtctg 1081 aagggcagag ggagaataaa tccattccac tacggggtct gaagcacagg ctgaattgct 1141 ggctaaaagt gcaacatttc tttgaagtct tgtgttttat ctttagaatc cacaagaaat 1201 gtattttcta tcttataata tcttcatgtt tgttttcata taaatattta aaattattta 1261 cactaagtaa cacaagaaca tgagtcatgt ccctaagagt agcatagtct attttgatta 1321 ttgttattac acaaatggag ctagtcttta atcaactaca gttctaaaag gaggaaaata 1381 gaaaatgcaa acttatatgt ttataaaaga catataa // LOCUS HSSMA3 1327 bp RNA PRI 14-FEB-1995 DEFINITION H.sapiens SMA3 mRNA. ACCESSION X83299 NID g603027 KEYWORDS SMA3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1327) AUTHORS Theodosiou,A.M., Morrison,K.E., Nesbit,A.M., Daniels,R.J., Campbell,L., Francis,M.J., Christodoulou,Z. and Davies,K.E. TITLE Complex repetitive arrangements of gene sequence in the candidate region of the spinal muscular atrophy gene in 5q13 JOURNAL Am. J. Hum. Genet. 55 (6), 1209-1217 (1994) MEDLINE 95067986 REFERENCE 2 (bases 1 to 1327) AUTHORS Theodosiou,A.M. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) A.M. Theodosiou, Molecular Genetics Group, Inst. Mol. Med., John Radcliffe Hospital, Headington, OX3 9DU, UK FEATURES Location/Qualifiers source 1..1327 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus, 25 weeks" /tissue_type="brain" /clone_lib="lambda gt10" /chromosome="5" /map="5q13" gene 277..699 /gene="SMA3" CDS 277..699 /gene="SMA3" /codon_start=1 /db_xref="PID:g671530" /translation="MDRSNPVKPALDYFLNRLVNYQISVKCSNQFKLEVCLLNAENKV VDNQAGTQGQLKVLGANLWWPYLMHEHPAYLYSWEDGDCSHQSLGPLPACDLCDQLHL RSRQGGSVCGCDPCEQLLLLVSQLRAPGVDSAAAGRPV" BASE COUNT 383 a 310 c 349 g 285 t ORIGIN 1 ggtcgacgac gtggcagccg ggatgcacta ggcaaagcca gctgggctcc tgagtccggt 61 gggtacttgg agaacttact acgtctagct ggaggattgt aaatgtacca atcagcatgc 121 tgtgtctagc tcaagaactc aagctccatg aggagatgtt tcattgtcga gagcagtcat 181 gatggcctgc actccacaca atgcaacaga gtgaaagagc aggttctgct tctttggtgt 241 agtcctgaag cttcctaaga aacttcacat caggtgatgg ataggagcaa ccctgtaaaa 301 ccagccttag actatttttt aaacaggctg gtgaattacc agatctccgt caagtgcagt 361 aaccagttca agttggaagt gtgtcttttg aatgcagaaa acaaagtcgt ggacaaccag 421 gctgggaccc agggccagct gaaggtgctg ggtgccaacc tctggtggcc gtacctgatg 481 cacgaacacc ccgcctacct ctactcgtgg gaggatggtg attgctcaca ccaaagcctt 541 ggacccctcc cagcctgtga cctttgtgac caactccacc tacgcagcag acaagggggc 601 tctgtatgtg gatgtgatcc gtgtgaacag ctactactct tggtatcgca actacgggca 661 cctggagttg attcagctgc agctggccgc ccagtttgag aattggtgta agacatcaca 721 atcccattat tcagagcgcg tatggagtgg aaacgcttgt agggtttcac cagtctttcc 781 cagggaactc cgatgaagtg ttccaacaaa atgagcgagt gaaccaagaa gaggatgaca 841 ttagatccag gagatacaac agaggagata atctccagga tgcctgtgaa gaaagatccc 901 tggatcccag gatgattata ggacaagttg ttcataatcc agcaggccag aagacttcca 961 gggaaactca ttcaaggagg tgaaaatgat ggatgactcc tccaagatga aaatggacca 1021 gccgcagtgc tcacgcctgt aataccagca ctttgggagg ctgaggcagg cggatcactt 1081 gaggtcagga gtttgaaact agcctggcca acgtggcaaa actccatctc tattaaaaat 1141 acaaaaatta gccaggcata gtggtgcatg cctgtagtcc cagctacttg ggatgccgag 1201 gcaggaagaa ttgcttgaac ctgggaggca gagtctgcag tgagccgaga tcatgccact 1261 gcactccagc ctgggtgaca gagccactcc gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 1321 aaaaaaa // LOCUS HSSMCPRO 3833 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for skeletal muscle C-protein. ACCESSION X66276 S48156 NID g36500 KEYWORDS C protein; fibronectin repeats; myosin binding; sarcomere A-band; titin binding. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3833) AUTHORS Fuerst,D.O. TITLE Direct Submission JOURNAL Submitted (15-MAY-1992) D.O. Fuerst, MPI for Biophysical Chemistry, Am Fassberg, 3400 Goettingen, FRG REFERENCE 2 (bases 1 to 3833) AUTHORS Furst,D.O., Vinkemeier,U. and Weber,K. TITLE Mammalian skeletal muscle C-protein: purification from bovine muscle, binding to titin and the characterization of a full-length human cDNA JOURNAL J. Cell. Sci. 102 (Pt 4), 769-778 (1992) MEDLINE 93054997 FEATURES Location/Qualifiers source 1..3833 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /clone_lib="lambda gt11" CDS 97..3513 /codon_start=1 /product="C protein" /db_xref="PID:g36501" /db_xref="SWISS-PROT:Q00872" /translation="MPEPTKKEENEVPAPAPPPEEPSKEKEAGTTPAKDWTLVETPPG EEQAKQNANSQLSILFIEKPQGGTVKVGEDITFIAKVKAEDLSEKPTINGSRKWMDLA SKAGKHLQLKETFERHSRVYTFEMQIIKAKDNFAGNYRCEVTYKDKFDSCSFDLEVHE STGTTPNIDIRSAFKRSGEGQEDAGELDFSGLLKRREVKQQEEEPQVDVWELLKNTKP SEYEKIAFQYESPTCSGMLKRLKRSIREEKKSAAFAKILDPVYQVDKGGRVRFVVELA DPKLEVKWNKNGQELRPSTKYIFEDTRCQSILNIDNCQMTDDSEYYVTAGDEKCSTEL LVREPPIMVTKQLEDTTDYCGERVELECEVSEDDAQVKWFKNGEEIILVQTRYRIRVE GKKHILIIEGATKADAADYSVMTTGGQSSAKLSVDLKPLKILTPLTDQTVNLGKEICL KCEISENIPGKWTKNGLPVQESDRLKVVHKGRIHKLVIDHALTEDEGDYVFAPDAYNV TLPAKVHVIDPPKIILDGLDADNTVTVIAGNKLRLEIPISGEPPPKAMWSRGDKAIME GSGRIRTESYPDSSTLVIDIAERDDSGVYHINLKNEAGEAHASIKVKVVDFPDPPVAP TVTEVGDDWCIMNWEPPAYDGGSPILGYFIERKKKQSSRWMRLNFDLCKETTFEPKKM IEGVAYEVRIFAVNAIGISKPSMPSRPFVPLAVTSPPTLLTVDSVTDTTVTMRWRPPD HIGAAGLDGYVLEYCFEGSTSAKQSDENGEAAYDLPAEDWIVANKDLIDKTKFTITGL PTDAKIFVRVKAVNAAGASEPKYYSQPILVKEIIEPPKIHSPKHLKQTYIRRVGDRVI LVIPFQGKPRPELTWKKDGAEIDKNQINIRNSETDTIIFIRKAERSHSGKYDLQVKVD KFVETASIDIRIIDRPGPPQIVKIEDVWGRNVALTWTPPKDDGNAAITGYTIQKADKK SMEWLRVIEHIIEPVPHTELVIGNEYYFRVFSENMCGLSEDATMTKESAVIARDGKIY KNPVYEDFDFSEAPMFTQPLVNRLCHSGYMATLNCSVRGNPKPKITWMKNKVAIVDDP RYRMFSNLGVCTLEIGKPSPYDGGTYCCKAVNDLGTVEIECKLEVKVIAQ" BASE COUNT 1192 a 802 c 941 g 898 t ORIGIN 1 gtgaattccg caccatctct ctcctgcctg tggggtttct gtcaactagt cgtggaggga 61 aggagactct ttaaagaata acatcttatt gtggccatgc cagaacccac taagaaagag 121 gaaaatgaag tgccagcccc agccccaccc ccggaagaac caagtaaaga gaaggaggcc 181 ggaactacac cagcaaaaga ctggaccctt gtcgaaactc ctcctgggga ggaacaagcc 241 aagcagaatg ccaactccca gctgtccatc ttgttcattg aaaaacctca aggaggaaca 301 gtgaaagttg gtgaagatat caccttcata gccaaagtca aggctgaaga tctttctgag 361 aaacccacta tcaatggttc aaggaaatgg atggacctgg ccagcaaagc cgggaagcac 421 cttcagctga aggaaacctt tgagaggcac agtcgggtgt acacatttga gatgcagatc 481 atcaaggcca aagataactt tgcaggaaat tacagatgcg aggtcaccta taaggataag 541 tttgacagct gttcatttga tcttgaagtg cacgaatcta ctgggactac tccaaacatt 601 gacatcagat ctgctttcaa gagaagtgga gaaggtcaag aggatgcagg agaacttgac 661 tttagtggtc tcctgaaacg tagggaggtg aagcagcagg aggaagaacc ccaggtggac 721 gtatgggagt tgctgaagaa caccaaaccc agtgagtacg agaagatcgc cttccagtat 781 gaatcaccga cctgcagcgg catgctgaag cgactcaagc gcagcatcag agaggagaag 841 aagagcgccg cttttgcaaa aattcttgat cctgtatatc aggttgacaa aggaggcaga 901 gtgaggtttg ttgtggagct ggcagatcca aagttggagg tgaaatggaa taaaaatggt 961 caagaacttc gacccagtac caaatacatc tttgaagaca caagatgcca gagcatcctg 1021 aatatcgata actgtcagat gacagatgat tcagagtatt atgtgacagc cggtgatgag 1081 aaatgttcta ctgagctctt agtaagagag cctccaatta tggtgaccaa acagctggaa 1141 gatacaactg attattgtgg ggagagagtg gaattagaat gtgaggtgtc tgaagatgat 1201 gcccaagtaa aatggtttaa gaatggtgaa gagattatcc tggtccaaac aagataccga 1261 attagagttg agggtaaaaa acacatcttg atcatagagg gagcaacaaa ggctgatgct 1321 gcagattatt cagtaatgac aacaggagga caatcatctg ctaaacttag tgttgacttg 1381 aaacctctga agattttgac acctctgact gatcagactg taaatcttgg aaaagaaatc 1441 tgcctgaagt gtgaaatctc tgaaaacata ccaggaaaat ggactaaaaa tggcctacct 1501 gttcaggaga gtgaccgtct aaaggtggtt cacaagggaa ggatccacaa gttagtgata 1561 gatcatgccc tcactgaaga tgaaggtgat tatgtatttg cacctgatgc ctacaatgtt 1621 actctgcctg ccaaagttca tgttattgat cctcctaaga tcatcctgga tggtcttgat 1681 gctgacaaca cagtgacagt gattgcagga aacaagcttc gtcttgagat ccccatcagc 1741 ggagaaccac ctcctaaagc catgtggagc cggggagata aggctattat ggaaggcagt 1801 ggccggataa gaacagaatc ttaccctgat agcagcactc tggtcattga tatagctgaa 1861 agagatgact ctggtgttta ccacatcaat ctgaaaaacg aagctggaga ggcacatgca 1921 agcatcaagg ttaaagttgt ggacttccct gatcctccag tggcaccgac tgtgacagag 1981 gtgggagatg actggtgtat catgaactgg gagcctcctg cctacgacgg aggctctcca 2041 atcctaggat attttattga gaggaagaag aaacaaagct ccaggtggat gaggctgaat 2101 tttgatctct gcaaagaaac aacttttgag cccaagaaga tgattgaagg tgtggcctat 2161 gaggtccgca tctttgcagt caatgccatt ggcatctcca agcccagtat gccctccagg 2221 ccttttgttc ctttggccgt aacaagccct cctactcttc tgactgtgga ctctgtcact 2281 gacacgactg tcacgatgag gtggcgcccc ccagaccaca ttggtgcagc aggtttagat 2341 ggctatgtgc tagagtattg ctttgaagga agtacatcag caaaacagtc tgatgaaaat 2401 ggggaggctg cctatgatct gccagctgag gactggatag ttgcaaacaa agatctgatt 2461 gacaagacga agttcaccat cacaggtctg ccaacagatg caaagatctt tgtgcgtgtg 2521 aaggctgtta atgcagctgg tgccagcgag cccaagtact attctcagcc cattctcgtg 2581 aaggaaatca tagaacctcc aaagatacac agtcccaagc acctgaagca aacatatatc 2641 cgccgagtag gagaccgtgt cattcttgtt atccctttcc agggaaaacc aagaccagaa 2701 ttaacttgga agaaggatgg tgcagaaatt gataagaatc aaataaacat tcgcaactct 2761 gagactgata caatcatatt tattagaaaa gcagagagga gccactctgg gaaatatgat 2821 ctgcaagtca aagtggacaa attcgtggag accgcatcaa ttgacatcag aatcattgac 2881 cgtccaggtc caccccaaat tgtgaagatt gaggatgtct gggggagaaa tgtcgctctc 2941 acatggactc caccaaagga tgatggaaat gctgctatca caggctatac cattcagaag 3001 gctgacaaga agagcatgga atggttacgt gtcattgagc atatcatcga accagtgcca 3061 catactgaat tggtcatagg gaatgaatat tacttccggg tcttttctga aaacatgtgt 3121 ggcctcagtg aggatgccac catgactaaa gagagtgcag tgatcgccag ggatggtaaa 3181 atctacaaaa atccagtgta tgaagacttt gatttctcag aggcacccat gtttactcag 3241 cctttggtta accgcctatg ccatagcggt tacatggcca ccctaaactg cagtgtgaga 3301 ggaaatccta agcctaaaat aacctggatg aaaaacaaag ttgctattgt ggatgatcca 3361 agatacagga tgttcagcaa cctgggagtc tgtaccctgg aaattggcaa gcccagccct 3421 tatgatggag gcacttactg ctgcaaagca gtcaatgacc ttgggacagt ggagattgaa 3481 tgcaaactgg aggtgaaagt cattgcacaa taaggatttt tggaatgtat aatatcatct 3541 aaggtgggct ctccttctgc agactcctct tgcaaggcgt acctccaaac ataattgatt 3601 gctatctgcg agacttacac tcaagcaatc ctgaggaata ctgagggagg gcctggctac 3661 tgtctctctg cactctgctg ctttgaaatc tggttgaaat gagaaaaagc attttctgtt 3721 ttcccaccag gcccccaagt gtggtctttt tctttcctcc taatgttgaa gagaaaaaaa 3781 aaaaaaaaaa agtttgccca gattgcttaa ttaaaaattg caaacaaaat ctc // LOCUS HSSMOOTHN 1619 bp RNA PRI 11-APR-1997 DEFINITION H.sapiens mRNA for smoothelin. ACCESSION Z49989 NID g1781010 KEYWORDS smoothelin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1619) AUTHORS van der Loop,F.T., Schaart,G., Timmer,E.D., Ramaekers,F.C. and van Eys,G.J. TITLE Smoothelin, a novel cytoskeletal protein specific for smooth muscle cells JOURNAL J. Cell Biol. 134 (2), 401-411 (1996) MEDLINE 96295554 REFERENCE 2 (bases 1 to 1619) AUTHORS van Eys,G.J. TITLE Direct Submission JOURNAL Submitted (30-JUN-1995) Van Eys G. J., University of Limburg, Maastricht, Molecular Cell Biology, Maastricht, The Netherlands, 6200 MD REMARK Revised by [3] REFERENCE 3 (bases 1 to 1619) AUTHORS van Eys,G.J. TITLE Direct Submission JOURNAL Submitted (14-JAN-1997) Van Eys G. J., University of Limburg, Maastricht, Molecular Cell Biology, Maastricht, The Netherlands, 6200 MD FEATURES Location/Qualifiers source 1..1619 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="smooth muscle" /cell_type="myocyte" CDS 312..1427 /codon_start=1 /product="smoothelin" /db_xref="PID:g1781011" /translation="MEPEPAEPLAAAVEAANGAERARVNKAPEGRRLSAEELMTIEDE GVLDKMLDQSTDFEERKLIRADFVSSDKGRETSGTRSGNGGCRRHGAGQGRGAATQPL RPPRGTAAGSDGSAVSTVTKTERLVHSNDGTRTARTTTVESSFVRRSENGSGSTMMQT KTFSSSSSSKKMGSIFDREDQRATGRHGRLESEKRQAEKKKELMKAQSLPKTSASQAR KAMIEKLEKEGAAGSPADPAQPCSDPPASGSPTPTASSRSSWTGVEPRLGAYEHVDIQ NFSSSWSDGMAFCALVHNFFPEAFDYGQLSPQNRRQNFEVAFSSAEMLVDYVPLVEVD DMMIMGKKPDPKCVFTYVQSLYNHLRRHELASRGKNV" BASE COUNT 385 a 501 c 492 g 241 t ORIGIN 1 ggcacgagcc cgctgcccgt ggccgtcgca ctgccgagcc agggggcagt atgaagacca 61 cattcaccat cgagatcaag gacggccgtg gcaggcctcc acaggccggg tgctgctgcc 121 cacaggcaac cagagggcag aactgacact ggggctgcgg gcgccccgac cctactcagc 181 accagtagtg gggcaagagc accatcaccc gtgtcaacag ccctgggacc ctggctcggc 241 tgggcagtgt cactcatgtc accagcttca gccatgcccc ccccagtagc cgaggaggct 301 gcagcatcaa gatggaacca gagccagcag agcctctcgc tgcagcagtg gaagcggcca 361 atggggctga gcgagcccga gtgaacaaag caccagaagg gcggcgtctg agcgctgagg 421 agctgatgac tattgaggat gaaggagtct tggacaagat gctggatcag agcacggact 481 ttgaagagcg gaagctcatc cgggctgact tcgtgagctc cgacaaagga agagagacca 541 gcgggacaag gagcgggaac ggcggctgca ggaggcacgg ggccggccag gggaggggcg 601 cggcaacaca gccactgaga ccaccacgag gcacagcagc gggcagcgat ggctctgctg 661 tcagcactgt taccaagact gagcggctcg tccactccaa tgatggcaca cggacggccc 721 gcaccaccac agtggagtcg agtttcgtga ggcgctcgga gaatggcagt ggcagcacca 781 tgatgcaaac caagaccttc tcctcttcct cctcatccaa gaagatgggc agcatcttcg 841 accgcgagga ccagcgagcc acgggccgcc atggccggct cgagagtgag aaacggcagg 901 ccgagaagaa gaaagagctg atgaaggcgc agagtctgcc caagacctca gcctcccagg 961 cgcgcaaggc catgattgag aagctggaga aggagggcgc ggccggcagc cctgcggacc 1021 ccgcgcagcc gtgcagcgat ccaccagctt cggggtcccc aacgccaaca gcatcaagca 1081 gatcgtcgtg gactggtgtc gagccaagac tcggggccta cgagcacgtc gacatccaga 1141 acttctcctc cagctggagt gatgggatgg ccttctgtgc cctggtgcac aacttcttcc 1201 ctgaggcctt cgactatggg cagcttagcc ctcagaaccg acgccagaac ttcgaggtgg 1261 ccttctcatc tgcggagatg ctggtggact atgtgcccct ggtggaggtg gacgacatga 1321 tgatcatggg caagaagcct gaccccaagt gtgtcttcac ctatgtgcag tcgctctaca 1381 accacctgcg acgccacgaa ctggcctcgc gcggcaagaa tgtctagcct gcccgcccgc 1441 atggccagcc agtggcaact gccgccccca ctctccgggc accgtctcct gcctgtgcgt 1501 ccgcccaccg ctgccctgtc tgttgcgaca ccctcccccc cacatacaca cgcagcgttt 1561 tgataaatta ttggttttca acgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSSMT3A 1733 bp RNA PRI 19-MAR-1997 DEFINITION H.sapiens mRNA for SMT3A protein. ACCESSION X99584 NID g1770516 KEYWORDS SMT3A gene; suppressor; ubiquitin-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1733) AUTHORS Lapenta,V., Chiurazzi,P., van der Spek,P., Pizzuti,A., Hanaoka,F. and Brahe,C. TITLE SMT3A, a human homologue of the S. cerevisiae SMT3 gene, maps to chromosome 21qter and defines a novel gene family JOURNAL Genomics 40 (2), 362-366 (1997) MEDLINE 97237059 REFERENCE 2 (bases 1 to 1733) AUTHORS Chiurazzi,P. TITLE Direct Submission JOURNAL Submitted (26-JUL-1996) P. Chiurazzi, Universita' Cattolica - Roma, largo F. Vito 1, I- 00168 Roma, ITALY FEATURES Location/Qualifiers source 1..1733 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="21" gene 95..406 /gene="SMT3A" CDS 95..406 /gene="SMT3A" /function="suppressor of MIF2 which encodes a centromere protein" /note="ubiquitin-like protein" /codon_start=1 /product="SMT3A protein" /db_xref="PID:e274634" /db_xref="PID:g1770517" /translation="MSEEKPKEGVKTENDHINLKVAGQDGSVVQFKIKRHTSLSKLMK AYCERQGLSMRQIRFRFDGQPINETDTPAQLRMEDEDTIDVFQQQTGGVPESSLAGHS F" BASE COUNT 407 a 404 c 426 g 495 t 1 others ORIGIN 1 ttcggcacag gcgggaganc ggcggggccg aagcgtgaac tcgcccgctc cggcttgctt 61 cccccgcgcc gcctccccgc gccgctcgga agccatgtcc gaggagaagc ccaaggaggg 121 tgtgaagaca gagaatgacc acatcaacct gaaggtggcc gggcaggacg gctccgtggt 181 gcagttcaag atcaagaggc acacgtcgct gagcaagctg atgaaggcct actgcgagag 241 gcagggcttg tcaatgaggc agatcagatt caggttcgac gggcagccaa tcaatgaaac 301 tgacactcca gcacagctga gaatggagga cgaggacacc atcgacgtgt tccagcagca 361 gacgggaggt gtgccggaga gcagcctggc agggcacagt ttctagaggg cccgtcccca 421 gcccgggccg tccatcctcg cattgctgtt gaatggtgag cacgtgacca tgccgaccac 481 aaaggtgtct gcggaaactc gaggacattc accacgatga ttttcctctc tttgatgtac 541 ttcaagtgca actcaaaact atatctgcag ggatgaatct gtaacttaaa ttgggccaat 601 cagaattgtt atctttgttc aggtaaaatg agttgcaaga tattgtgggt acttttgtgt 661 gctcatttgt gttttccccc cctcctacaa cattttttta accccaaaat tatagcctga 721 atgttcgctt ttagtctggc cagggatctg actcctgagt tggttgcctc tcccctgctc 781 actccagtca catagagaat tggtgtttcc cgcagtgggg attgcagctg ttggacaggt 841 attgggggca aggttggtag ggaggacaga ctgtcacttg ctgttacagg cacaggtgat 901 taaaatgcta aatattgcaa atttaagctt tgtcagtata tggaaaagtt gaagggaaaa 961 tactggaatg cttcttcaaa ggttaaaaaa taaccgagtc ttttggtaat ttgaccccac 1021 gtgctctctg gccctcaagc atgtaacctc ggggtctgag gcccaggacc cacccccctg 1081 ccacccctcc caccccactc cctgctcagt acctggcgtt ggtacacagg caaggattgg 1141 cacaaccaaa attggccttt ttctccctct taatattgaa gaaattccca catttctcat 1201 ttggtaatgg tgttgtggcc tcagatttct tctagtattt gcttctgatg aatgattatg 1261 gtctatacat aaaaaagtaa gactaagtat tgctgaattt gcagttatgt tgtcgtgtat 1321 aagagctact tccaagtgtg gttacaaatg aacccatgga atgatgactt catgttcttc 1381 tcgtgggttt gtgccgtgct gctttccaaa taggtattga atttatgcat tagtctggtg 1441 atttcagttc tgtgaaatat tttgggatct ataccaatta aacattttca tagttctgcc 1501 tattgtcctt ccctgaggct ccattgctgc ttggtggcca ttctctgcct ttttacagtc 1561 acctgaacaa tgacccatca tctcttgctt gcttgaaatc ttgctgaaat gttctcattt 1621 cctgtttgct gtatgggctc gggtgggatg tttgttggct ctgttgtgtt tattcaccaa 1681 tttgtacatt atttgttgtc ctttactact gtaaacagta aatatagttt ggt // LOCUS HSSMT3C 590 bp RNA PRI 19-MAR-1997 DEFINITION H.sapiens mRNA for SMT3C protein. ACCESSION X99586 NID g1770520 KEYWORDS SMT3C gene; suppressor; ubiquitin-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 590) AUTHORS Lapenta,V., Chiurazzi,P., van der Spek,P., Pizzuti,A., Hanaoka,F. and Brahe,C. TITLE SMT3A, a human homologue of the S. cerevisiae SMT3 gene, maps to chromosome 21qter and defines a novel gene family JOURNAL Genomics 40 (2), 362-366 (1997) MEDLINE 97237059 REFERENCE 2 (bases 1 to 590) AUTHORS Chiurazzi,P. TITLE Direct Submission JOURNAL Submitted (26-JUL-1996) P. Chiurazzi, Universita' Cattolica - Roma, largo F. Vito 1, I- 00168 Roma, ITALY FEATURES Location/Qualifiers source 1..590 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" gene 42..347 /gene="SMT3C" CDS 42..347 /gene="SMT3C" /function="suppressor of MIF2 which encodes a centromere protein" /note="ubiquitin-like protein" /codon_start=1 /product="SMT3C protein" /db_xref="PID:e274603" /db_xref="PID:g1770521" /translation="MSDQEAKPSTEDLGDKKEGEYIKLKVIGQDSSEIHFKVKMTTHL KKLKESYCQRQGVPMNSLRFLFEGQRIADNHTPKELGMEEEDVIEVYQEQTGGHSTV" BASE COUNT 168 a 113 c 126 g 170 t 13 others ORIGIN 1 ccgctgctgt gcggagaccc ccgggtgaag ccaccgtcat catgtctgac caggaggcaa 61 aaccttcaac tgaggacttg ggggataaga aggaaggtga atatattaaa ctcaaagtca 121 ttggacagga tagcagtgag attcacttca aagtgaaaat gacaacacat ctcaagaaac 181 tcaaagaatc atactgtcaa agacagggtg ttccaatgaa ttcactcagg tttctctttg 241 agggtcagag aattgctgat aatcatactc caaaagaact gggaatggag gaagaagatg 301 tgattgaagt ttatcaggaa caaacggggg gtcattcaac agtttagata ttctttttat 361 tttttttctt ttccctcaat ccttttttat tttttaaaaa taggtccttt tgtaatgtgg 421 gtgttcaaaa ccggnatttg aaactnggca ccccanctct tttggaaaca nctgggaatt 481 tggattccta gtgncccatt atncattaat tgggttgggt tncattgggc tgntttttgg 541 gngaccaanc ctcaggnccc cttcaaaatt ancccccccc ttttttnaaa // LOCUS HSSNAP23A 636 bp RNA PRI 16-DEC-1997 DEFINITION Homo sapiens mRNA for SNAP23A protein. ACCESSION Y09567 NID g1924941 KEYWORDS fusion protein; SNAP-23 protein; snap23A gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 636) AUTHORS Mollinedo,F. and Lazo,P.A. TITLE Identification of two isoforms of the vesicle-membrane fusion protein SNAP-23 in human neutrophils and HL-60 cells JOURNAL Biochem. Biophys. Res. Commun. 231 (3), 808-812 (1997) MEDLINE 97224437 REFERENCE 2 (bases 1 to 636) AUTHORS Mollinedo,F. TITLE Direct Submission JOURNAL Submitted (21-NOV-1996) F. Mollinedo, Instituto de Biologia y Genetica Molecular, Facultad de Medicina, CSIC-Universidad de Valladolid, C/Ramon y Cajal 7, E-47005 Valladolid, SPAIN COMMENT Related sequence U55936. FEATURES Location/Qualifiers source 1..636 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neutrophils" /cell_line="HL-60" gene 1..636 /gene="SNAP23A" CDS 1..636 /gene="SNAP23A" /codon_start=1 /product="SNAP23A protein" /db_xref="PID:e290695" /db_xref="PID:g1924942" /translation="MDNLSSEEIQQRAHQITDESLESTRRILGLAIESQDAGIKTITM LDEQKEQLNRIEEGLDQINKDMRETEKTLTELNKCCGLCVCPCNRTKNFESGKAYKTT WGDGGENSPCNVVSKQPGPVTNGQLQQPTTGAASGGYIKRITNDAREDEMEENLTQVG SILGNLKDMALNIGNEIDAQNPQIKRITDKADTNRDRIDIANARAKKLIDS" BASE COUNT 239 a 124 c 150 g 123 t ORIGIN 1 atggataatc tgtcatcaga agaaattcaa cagagagctc accagattac tgatgagtct 61 ctggaaagta cgaggagaat cctgggttta gccattgagt ctcaggatgc aggaatcaag 121 accatcacta tgctggatga acaaaaggaa caactaaacc gcatagaaga aggcttggac 181 caaataaata aggacatgag agagacagag aagactttaa cagaactcaa caaatgctgt 241 ggcctttgtg tctgcccatg taatagaaca aagaactttg agtctggcaa ggcttataag 301 acaacatggg gagatggtgg agaaaactca ccttgcaatg tagtatctaa acagccaggc 361 ccggtgacaa atggtcagct tcagcaacca acaacgggag cagccagtgg tggatacatt 421 aaacgcataa ctaatgatgc cagagaagat gaaatggaag agaacctgac tcaagtgggc 481 agtatcctgg gaaatctaaa agacatggcc ctgaacatag gcaatgagat tgatgctcaa 541 aatccacaaa taaaacgaat cacagacaag gctgacacca acagagatcg tattgatatt 601 gccaatgcca gagcaaagaa actcattgac agctaa // LOCUS HSSNAP43 1131 bp RNA PRI 17-NOV-1995 DEFINITION H.sapiens mRNA for SNAP43. ACCESSION Z47542 NID g623243 KEYWORDS SNAP43. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1131) AUTHORS Henry,R.W., Sadowski,C.L., Kobayashi,R. and Hernandez,N. TITLE A TBP-TAF complex required for transcription of human snRNA genes by RNA polymerase II and III JOURNAL Nature 374 (6523), 653-656 (1995) MEDLINE 95231630 REFERENCE 2 (bases 1 to 1131) AUTHORS Henry,R.W. TITLE Direct Submission JOURNAL Submitted (10-JAN-1995) Ronald W. Henry, Cold Spring Harbor Laboratory, P.O. Box 100, Cold, Spring Harbor, New York, 11724, U.S.A FEATURES Location/Qualifiers source 1..1131 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="teratocarcinoma" /cell_line="NTera2D1" /clone_lib="lambda gt10" CDS 25..1131 /function="transcription of human snRNA genes" /codon_start=1 /product="SNAP43" /db_xref="PID:g623244" /translation="MGTPPGLQTDCEALLSRFQETDSVRFEDFTELWRNMKFGTIFCG RMRNLEKNMFTKEALALAWRYFLPPYTFQIRVGALYLLYGLYNTQLCQPKQKIRVALK DWDEVLKFQQDLVNAQHFDAAYIFRKLRLDRAFHFTAMPKLLSYRMKKKIHRAEVTEE FKDPSDRVMKLITSDVLEEMLNVHDHYQNMKHVISVDKSKPDKALSLIKDDFFDNIKN IVLEHQQWHKDRKNPSLKSKTNDGEEKMEGNSQETERCERAESLAKIKSKAFSVVIQA SKSRRHRQVKLDSSDSDSASGQGQVKATRKKEKKERLKPAGRKMSLRNKGNVQNIHKE DKPLSLSMPVITEEEENESLSGTEFTASKKRRKH" BASE COUNT 402 a 200 c 259 g 270 t ORIGIN 1 cggagcgtgc gggcttcggg tgccatgggg actcctcccg gcctgcagac cgactgcgag 61 gcgctgctca gccgcttcca ggagacggac agtgtacgct tcgaggactt cacggagctc 121 tggagaaaca tgaagttcgg gactatcttc tgtggcagaa tgagaaattt agaaaagaac 181 atgtttacaa aagaagcttt agctttggct tggcgatatt ttttacctcc atacaccttc 241 cagatcagag ttggtgcttt gtatctgcta tatggattat ataataccca actgtgtcaa 301 ccaaaacaaa agatcagagt tgccctgaag gattgggatg aagttttaaa atttcagcaa 361 gatttagtaa atgcacagca ttttgatgca gcttatattt ttaggaagct acgactagac 421 agagcatttc actttacagc aatgcccaaa ttgctgtcat ataggatgaa gaaaaaaatt 481 caccgagctg aagttacaga agaatttaag gacccaagtg atcgtgtgat gaaacttatc 541 acttctgatg tattagagga aatgctgaat gttcatgatc attatcagaa catgaaacat 601 gtaatttcag ttgataagtc caagccagat aaagccctca gcttgataaa ggatgatttt 661 tttgacaata ttaagaacat agttttggag catcagcagt ggcacaaaga cagaaagaat 721 ccatccttaa agtcaaaaac taatgatgga gaagaaaaaa tggaaggaaa ttcacaagaa 781 acggagagat gtgaaagggc agaatcatta gcgaaaataa aatcaaaggc cttttcagtt 841 gtcatacagg catccaaatc aagaaggcat cgtcaagtca aactcgactc ttctgactct 901 gattctgcat ctggtcaagg gcaagtcaaa gcaactagga aaaaagagaa gaaagaaaga 961 ttgaaaccag caggaaggaa gatgtctctc agaaacaaag gcaatgtgca gaatatacac 1021 aaggaagata aacctttaag tctgagtatg cctgtaatta cagaagaaga agagaatgaa 1081 agtttgagtg gaacagagtt cactgcatcc aagaagagga gaaaacactg a // LOCUS HSSNOAR 2875 bp RNA PRI 12-SEP-1993 DEFINITION Human sno oncogene mRNA for snoA protein, ski-related. ACCESSION X15217 NID g36508 KEYWORDS alternative splicing; Alu repetitive sequence; oncogene; repetitive sequence; sno oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2875) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (08-MAY-1989) Nomura N., Molecular Oncology Laboratory, Nippon Medical School, Sakuragi, 1-10-19 Uenosakuragi Taito-ku, Tokyo 110, JAPAN REFERENCE 2 (bases 1 to 2875) AUTHORS Nomura,N., Sasamoto,S., Ishii,S., Date,T., Matsui,M. and Ishizaki,R. TITLE Isolation of human cDNA clones of ski and the ski-related gene, sno JOURNAL Nucleic Acids Res. 17 (14), 5489-5500 (1989) MEDLINE 89345144 COMMENT see x15219 for snoN cDNA. Data kindly reviewed (15-SEP-1989) by Nomura N. FEATURES Location/Qualifiers source 1..2875 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilicord vein" /cell_type="endothelial cells." variation 302 /note="g is a in variant clones" CDS 710..1957 /note="snoA protein (AA 1 - 415)" /codon_start=1 /db_xref="PID:g36509" /db_xref="SWISS-PROT:P12756" /translation="MENLQTNFSLVQGSTKKLNGMGDDGSPPAKKMITDIHVNGKTIN KVPTVKKEHLDDYGEAPVETDGEHVKRTCTSVPETLHLNPSLKHTLAQFHLSSQSSLG GPAAFSARHSQESMSPTVFLPLPSPQVLPGPLLIPSDSSTELTQTVLEGESISCFQVG GEKRLCLPQVLNSVLREFTLQQINTVCDELYIYCSRCTSDQLHILKVLGILPFNAPSC GLITLTDAQRLCNALLRPRTFPQNGSVLPAKSSLAQLKETGSAFEVEHECLGKCQGLF APQFYVQPDAPCIQCLECCGMFAPQTFVMHSHRSPDKRTCHWGFESAKWHCYLHVNQK YLGTPEEKKLKIILEEMKEKFSMRSGKRNQSKASFLYQFLIMVMVYFEMKILCLVCNL TCMLNIAHATTTKYRLIYLYCSF" misc_feature 1806..1807 /note="alternative splicing point" misc_feature 2131..2294 /note="Alu repeat" polyA_site 2590 /note="3' end of variant clone with 10 additional A residues" BASE COUNT 847 a 563 c 598 g 867 t ORIGIN 1 ggtttcaaat tggccctttg gcctctggag caaattcaaa tgtaactctt ccccaatccc 61 ccttctcttc ttccagatta attaaaagaa gaatgaacta taatccttga agataactgg 121 gcaatttttt aagtcggagg ctgttcttac tggtgtgagg atttacacac gtcttcagtt 181 tttcagcaca gaccagcaga ccatcatttt tagaggaaat actccctctg ccctcctttt 241 tggtttcctt ggtggtaaag attaaatttg gttgcatcat tttgacttgt gtttgagtct 301 agattttatg gcacaaggaa tggcataaac ttttcatgtg ttttggttaa aacaaaccag 361 accattgcat tgaccctgga catctttaat tgagaaattg gtaactttat tttaatatgt 421 atatctgaag aattcaagaa aacaaaggca tcctcagagg tgtgcctctt ttctttatta 481 ttagaggcaa aacgaacaat tttataggat ttgtagtgaa attataccag attataagga 541 gaaccaaaac taagtcgcaa aatttattaa tttaaggggc tctcgctttg aaagtttgag 601 agtaagttac gataggcatt tgtatccatt cattactttc ctcttttcaa ataagcaact 661 aaatagaaat gctaatctca gacttaatta tttaacagaa gagtgtacca tggaaaacct 721 ccagacaaat ttctccttgg ttcagggctc aactaaaaaa ctgaatggga tgggagatga 781 tggcagcccc ccagcgaaaa aaatgataac ggacattcat gtaaatggaa aaacgataaa 841 caaggtgcca acagttaaga aggaacactt ggatgactat ggagaagcac cagtggaaac 901 tgatggagag catgttaagc gaacctgtac ttctgttcct gaaactttgc atttaaatcc 961 cagtttgaaa cacacattgg cacaattcca tttaagtagt cagagctcgc tgggtggacc 1021 agcagcattt tctgctcggc attcccaaga aagcatgtcg cctactgtat ttctgcctct 1081 tccatcacct caggttcttc ctggcccatt gctcatccct tcagatagct ccacagaact 1141 cactcagact gtgttggaag gggaatctat ttcttgtttt caagttggag gagaaaagag 1201 actctgtttg ccccaagtct taaattctgt tctccgagaa tttacactcc agcaaataaa 1261 tacagtgtgt gatgaactgt acatatattg ttcaaggtgt acttcagacc agcttcatat 1321 cttaaaggta ctgggcatac ttccattcaa tgccccatcc tgtgggctga ttacattaac 1381 tgatgcacaa agattatgta atgctttatt gcggccacga acttttcctc aaaatggtag 1441 cgtacttcct gctaaaagct cattggccca gttaaaggaa actggcagtg cctttgaagt 1501 ggagcatgaa tgcctaggca aatgtcaggg tttatttgca ccccagtttt atgttcagcc 1561 tgatgctccg tgtattcaat gtctggagtg ttgtggaatg tttgcacccc agacgtttgt 1621 gatgcattct cacagatcac ctgacaaaag aacttgccac tggggctttg aatcagctaa 1681 atggcattgc tatcttcatg tgaaccaaaa atacttagga acacctgaag aaaagaaact 1741 gaagataatt ttagaagaaa tgaaggagaa gtttagcatg agaagtggaa agagaaatca 1801 atccaaggca agttttttat atcaattttt aataatggta atggtttact ttgaaatgaa 1861 aattctatgt ttagtgtgta acttaacctg tatgttgaac attgctcatg caacaacaac 1921 aaaataccga ttgatatatt tgtattgcag tttttaggcc ataaagtgct ttgcagtatg 1981 tttcctcatt tgactttcca aacatcctgt gagagaagta agactattat tccgttttac 2041 agataaagtg aatgaagctc agagagataa aatgactttc ccaaaattat gtagccaggg 2101 agtggaggag ttagggcttc tttttttttt tttttgtgct tttagtagag gccaggtttc 2161 agcatgttgg ccaggctggt cttgaactcc tgaccgcgtg atccgcccac cttggcctcc 2221 caaagggctg ggattacatc cttgagcccc tgtgtccagc cagggcttct ttttcttatc 2281 ctctttggca cacatcttgc ttcttgacca ctacatctgt tgtttttcta ggactcgata 2341 atttgcgctt tggtgttatc tccatttgca aatggtacaa tggccacaat tcccgtgggc 2401 tcaaaacagc atttttcaga gatacaccta tgatttctga tgtttctatg tttggatatt 2461 caggcttgct caatatttga aacaaatgga aaagacatgt atctgaagaa tttgtgattt 2521 gaaaggaata acaaaaaaaa tgacagctag agtaaggaaa agttatttta aactaataaa 2581 atattaatat aaaaacctgc cgggctcagt ggctcacacc tgtaatccca acactttggg 2641 gggctgaagt aggtggatca cctgaggtca ggagtttgag accagcctgg ccaacatggt 2701 gaaatcccat ctctgctgaa aatacaaaaa ttagacggat gtggtgtcgc acacttgtaa 2761 tcccagctac tcaggagctg aggcaggaga atcgcttgaa ccccggaggc ggaggttgta 2821 gtgagccgag attgtgccat tgcgctccag cgtaggcgtc gagggaaact ccatc // LOCUS HSSOD15 800 bp DNA PRI 12-SEP-1993 DEFINITION Human superoxide dismutase (SOD-1) gene exon 5 and 3' flanking region. ACCESSION X01784 X01662 NID g36529 KEYWORDS superoxide dismutase; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Levanon,D., Lieman-Hurwitz,J., Dafni,N., Wigderson,M., Sherman,L., Bernstein,Y., Laver-Rudich,Z., Danciger,E., Stein,O. and Groner,Y. TITLE Architecture and anatomy of the chromosomal locus in human chromosome 21 encoding the Cu/Zn superoxide dismutase JOURNAL EMBO J. 4 (1), 77-84 (1985) MEDLINE 85257452 REFERENCE 2 (bases 1 to 800) AUTHORS Sherman,L., Levanon,D., Lieman-Hurwitz,J., Dafni,N. and Groner,Y. TITLE Human Cu/Zn superoxide dismutase gene: molecular characterization of its two mRNA species JOURNAL Nucleic Acids Res. 12 (24), 9349-9365 (1984) MEDLINE 85087942 FEATURES Location/Qualifiers source 1..800 /organism="Homo sapiens" /db_xref="taxon:9606" intron <1..158 /note="intron IV" mRNA 159..578 /note="exon 5 (0.9 kb SOD-1 mRNA)" mRNA 159..365 /note="exon 5 (0.7 kb SOD-1 mRNA)" CDS 159..263 /note="superoxide dismutase (aa 120-154); Author-given protein sequence is in conflict with the conceptual translation" /codon_start=1 /db_xref="PID:e4994" /db_xref="PID:g1335318" /db_xref="SWISS-PROT:P00441" /translation="VHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ" misc_feature 262..266 /note="pot. polyadenylation signal" repeat_region 263..271 /note="9 bp direct repeat" CDS 328..348 /note="short open reading frame 1" /codon_start=1 /db_xref="PID:g580486" /translation="MYPDKH" repeat_region 338..346 /note="9 bp direct repeat" misc_feature 344..349 /note="pot. polyadenylation signal" polyA_site 365 /note="polyA-site of 0.7 kb SOD-1 mRNA" CDS 425..457 /note="short open reading frame 2" /codon_start=1 /db_xref="PID:g36531" /translation="MITWKICIVL" CDS 470..559 /note="short open reading frame 3" /codon_start=1 /db_xref="PID:g36532" /translation="MSVSMTCILPDLNHRWVLNLSEFLCHSSL" misc_feature 520..525 /note="pot. polyadenylation signal" misc_feature 559..564 /note="pot. polyadenylation signal" CDS 573..623 /note="short open reading frame 4" /codon_start=1 /db_xref="PID:g36533" /translation="MALIMRLLKESKFKLN" polyA_site 578 /note="polyA-site of 0.9 kb SOD-1 mRNA" misc_feature 593..598 /note="pot. polyadenylation signal" CDS 721..750 /note="short open reading frame 5" /codon_start=1 /db_xref="PID:g580487" /translation="MHTLKQQVF" BASE COUNT 257 a 123 c 146 g 274 t ORIGIN 1 gtttctgctt ttaaactact aaatattagt atatctctct actaggatta atgttatttt 61 tctaatatta tgaggttctt aaacatcttt tgggtattgt tgggaggagg tagtgattac 121 ttgacagccc aaagttatct tcttaaaatt ttttacaggt ccatgaaaaa gcagatgact 181 tgggcaaagg tggaaatgaa gaaagtacaa agacaggaaa cgctggaagt cgtttggctt 241 gtggtgtaat tgggatcgcc caataaacat tcccttggat gtagtctgag gccccttaac 301 tcatctgtta tcctgctagc tgtagaaatg tatcctgata aacattaaac actgtaatct 361 taaaagtgta attgtgtgac tttttcagag ttgctttaaa gtacctgtag tgagaaactg 421 atttatgatc acttggaaga tttgtatagt tttataaaac tcagttaaaa tgtctgtttc 481 aatgacctgt attttgccag acttaaatca cagatgggta ttaaacttgt cagaatttct 541 ttgtcattca agcctgtgaa taaaaaccct gtatggcact tattatgagg ctattaaaag 601 aatccaaatt caaactaaat tagctctgat acttatttat ataaacagct tcagtggaac 661 agatttagta atactaacag tgatagcatt ttattttgaa agtgttttga gaccatcaaa 721 atgcatactt taaaacagca ggtcttttag ctaaaactaa cacaactctg cttagacaaa 781 taggctgtcc tttgaagctt // LOCUS HSSODR1 874 bp RNA PRI 28-JAN-1995 DEFINITION Human mRNA for Cu/Zn superoxide dismutase (SOD). ACCESSION X02317 K00065 NID g36541 KEYWORDS dismutase; superoxide dismutase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 874) AUTHORS Hallewell,R.A., Masiarz,F.R., Najarian,R.C., Puma,J.P., Quiroga,M.R., Randolph,A., Sanchez-Pescador,R., Scandella,C.J., Smith,B., Steimer,K.S. and Mullenbach,G.T. TITLE Human Cu/Zn superoxide dismutase cDNA: isolation of clones synthesising high levels of active or inactive enzyme from an expression library JOURNAL Nucleic Acids Res. 13 (6), 2017-2034 (1985) MEDLINE 85215596 REFERENCE 2 (bases 1 to 560) AUTHORS Sherman,L., Dafni,N., Lieman-Hurwitz,J. and Groner,Y. TITLE Nucleotide sequence and expression of human chromosome 21-encoded superoxide dismutase mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (18), 5465-5469 (1983) MEDLINE 83299994 COMMENT Bases 1 to 95 are derived fromm a genomic library Data kindly reviewed (12-MAY-1986) by G. Mullenbach. FEATURES Location/Qualifiers source 1..874 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 65..529 /note="superoxide dismutase (aa 1-154)" /codon_start=1 /db_xref="PID:g36542" /db_xref="SWISS-PROT:P00441" /translation="MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLH GFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIED SVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ" misc_feature 822..827 /note="pot. polyA signal" polyA_site 874 /note="polyA site" BASE COUNT 248 a 162 c 225 g 239 t ORIGIN 1 ctgcagcgtc tggggtttcc gttgcagtcc tcggaaccag gacctcggcg tggcctagcg 61 agttatggcg acgaaggccg tgtgcgtgct gaagggcgac ggcccagtgc agggcatcat 121 caatttcgag cagaaggaaa gtaatggacc agtgaaggtg tggggaagca ttaaaggact 181 gactgaaggc ctgcatggat tccatgttca tgagtttgga gataatacag caggctgtac 241 cagtgcaggt cctcacttta atcctctatc cagaaaacac ggtgggccaa aggatgaaga 301 gaggcatgtt ggagacttgg gcaatgtgac tgctgacaaa gatggtgtgg ccgatgtgtc 361 tattgaagat tctgtgatct cactctcagg agaccattgc atcattggcc gcacactggt 421 ggtccatgaa aaagcagatg acttgggcaa aggtggaaat gaagaaagta caaagacagg 481 aaacgctgga agtcgtttgg cttgtggtgt aattgggatc gcccaataaa cattcccttg 541 gatgtagtct gaggcccctt aactcatctg ttatcctgct agctgtagaa atgtatcctg 601 ataaacatta aacactgtaa tcttaaaagt gtaattgtgt gactttttca gagttgcttt 661 aaagtacctg tagtgagaaa ctgatttatg atcacttgga agatttgtat agttttataa 721 aactcagtta aaatgtctgt ttcaatgacc tgtattttgc cagacttaaa tcacagatgg 781 gtattaaact tgtcagaatt tctttgtcat tcaagcctgt gaataaaaac cctgtatggc 841 acttattatg aggctattaa aagaatccaa attc // LOCUS HSSOP2PLP 1157 bp RNA PRI 14-JAN-1997 DEFINITION H.sapiens mRNA for Sop2p-like protein. ACCESSION Y08999 NID g1654001 KEYWORDS sop2+ gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1157) AUTHORS Balasubramanian,M.K., Feoktistova,A., McCollum,D. and Gould,K.L. TITLE Fission yeast Sop2p: a novel and evolutionarily conserved protein that interacts with Arp3p and modulates profilin function JOURNAL EMBO J. 15 (23), 6426-6437 (1996) MEDLINE 97133273 REFERENCE 2 (bases 1 to 1157) AUTHORS Balasubramanian,M.K. TITLE Direct Submission JOURNAL Submitted (24-OCT-1996) M.K. Balasubramanian, Howard Hughes Medical Institute, Department of Cell Biology, 802 Light Hall, Vanderbilt University School of Medicine, Nashville, Tn-37232, USA FEATURES Location/Qualifiers source 1..1157 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 34..1146 /gene="sop2+" CDS 34..1146 /gene="sop2+" /codon_start=1 /product="Sop2p-like protein" /db_xref="PID:e276923" /db_xref="PID:g1654002" /translation="MSLHQFLLEPITCHAWNRDRTQIALSPNNHEVHIYKKNGSQWVK AHELKEHNGHITGIDWAPKSDRIVTCGADRNAYVWSQKDGVWKPTLVILRINRAATFV KWSPLENKFAVGSGARLISVCYFEAENDWWVSKHIKKPIRSTVLSLDWHPNNVLLAAG SCDFKCRVFSAYIKEVDEKKASTPWGSKMPFGQLMSEFGGSGTGGWVHGVSFSASGSR LAWVSHDSTVSVADASKSVQVSTLKTEFLPLLSVSFVSENSVVAAGHDCCPMLFIYDD RGCLTFVSKLDIPKQSIQRNMSAMERFRNMDKRATTEDRNTALETLHQNSITQVSIYE VDKQDCRKFCTTGIDGAMTIWDFKTLESSIQGLRIM" BASE COUNT 297 a 286 c 301 g 273 t ORIGIN 1 ccagctttct ctcctttgaa aacactaaga ataatgtcac tgcatcagtt tttactagag 61 ccaatcacct gtcatgcctg gaacagggat cgtactcaga ttgccctcag tcccaataat 121 cacgaagtgc acatctataa gaagaacggg agccagtggg tgaaagctca tgaactcaag 181 gagcacaacg gacacatcac aggtattgac tgggctccca agagcgaccg tattgtcact 241 tgtggggcag accgcaatgc ctatgtctgg agtcagaaag atggtgtttg gaagccaacc 301 ctggtgatcc tgagaattaa tcgcgcagct acttttgtga agtggtcccc cctagagaac 361 aaatttgctg tgggaagtgg agcacgactc atttctgttt gttactttga ggctgaaaat 421 gactggtggg tgagcaagca cattaaaaag ccgattcgct ccacagtcct cagcttggat 481 tggcatccca acaacgtttt gctggcagca ggatcatgtg acttcaaatg cagagtgttt 541 tctgcctaca ttaaagaagt ggatgaaaag aaagccagca cgccctgggg cagcaagatg 601 ccttttgggc agctgatgtc agagtttggt ggcagtggca ctggtggctg ggtccacggg 661 gtaagcttct ctgccagtgg gagccgcctg gcctgggtca gccacgacag caccgtgtct 721 gttgctgatg cctcaaaaag tgtgcaggtc tcgactctga agacagagtt cctgccgctc 781 ctaagtgtgt catttgtctc agagaacagc gtcgtggctg ctggccatga ctgctgccca 841 atgctcttta tctacgatga ccgcggctgc ctgaccttcg tctccaagtt agatattcca 901 aaacagagca tccaacgcaa catgtctgcc atggaacgct tccgcaacat ggacaagaga 961 gccacaactg aggaccgcaa cacggccttg gagacgctgc accagaatag catcactcaa 1021 gtctctattt atgaggtgga caagcaagat tgtcgcaaat tttgcactac tggcatcgat 1081 ggagccatga caatttggga tttcaagacc ctcgagtctt ccatccaggg cctccggata 1141 atgtgaagct gagtgga // LOCUS HSSOX4M 2797 bp RNA PRI 03-FEB-1994 DEFINITION H.sapiens mRNA for SOX-4 protein. ACCESSION X70683 NID g36552 KEYWORDS sox-4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2797) AUTHORS Farr,C.J. TITLE Direct Submission JOURNAL Submitted (12-JAN-1993) C.J. Farr, Dept of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK REFERENCE 2 (bases 1 to 2797) AUTHORS Farr,C.J., Easty,D.J., Ragoussis,J., Collignon,J., Lovell-Badge,R. and Goodfellow,P.N. TITLE Characterization and mapping of the human SOX4 gene JOURNAL Mamm. Genome 4 (10), 577-584 (1993) MEDLINE 94093204 FEATURES Location/Qualifiers source 1..2797 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="melanocytes" /cell_line="LT5-1 melanoma cell line / normal melanocytes" /clone_lib="LT5.1 (lambda gt10) cDNA library, normal melanocyte (lambda gt11) cDna library" /clone="severl overlapping clones" /chromosome="Human 6p22" misc_feature 57..135 /note="HMG-box" gene 351..1775 /gene="SOX-4" CDS 351..1775 /gene="SOX-4" /codon_start=1 /product="SOX-4 protein" /db_xref="PID:g36553" /db_xref="SWISS-PROT:Q06945" /translation="MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGG KADDPSWCKTPSGHIKRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKD SDKIPFIREAERLRLKHMADYPDYKYRPRKKVKSGNANSSSSAAASSKPGEKGDKVGG SGGGGHGGGGGGGSSNAGGGGGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKL ILAGGGGGGKAAAAAAASFAAEQAGAAALLPLGAAADHHSLYKARTPSASASASSAAS ASAALAAPGKHLAEKKVKRVYLFGGLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSP DAPSLSGRSSAASSPAAGRSPADHRGYASLRAASPAPSSAPSHASSSASSHSSSSSSS GSSSSDDEFEDDLLDLNPSSNFESMSLGSFSSSSALDRDLDFNFEPGSGSHFEFPDYC TPEVSEMISGDWLESSISNLVFTY" BASE COUNT 547 a 880 c 931 g 439 t ORIGIN 1 ttccccagca ttcgagaaac tcctctctac tttagcacgg tctccagact cagccgagag 61 acagcaaact gcagcgcggt gagagagcga gagagaggga gagagagact ctccagcctg 121 ggaactataa ctcctctgcg agaggcggag aactccttcc ccaaatcttt tggggacttt 181 tctctcttta cccacctccg cccctgcgag gagttgaggg gccagttcgg ccgccgcgcg 241 cgtcttcccg ttcggcgtgt gcttggcccg gggaaccggg agggcccggc gatcgcgcgg 301 cggccgccgc gagggtgtga gcgcgcgtgg gcgcccgccg agccgaggcc atggtgcagc 361 aaaccaacaa tgccgagaac acggaagcgc tgctggccgg cgagagctcg gactcgggcg 421 ccggcctcga gctgggaatc gcctcctccc ccacgcccgg ctccaccgcc tccacgggcg 481 gcaaggccga cgacccgagc tggtgcaaga ccccgagtgg gcacatcaag cgacccatga 541 acgccttcat ggtgtggtcg cagatcgagc ggcgcaagat catggagcag tcgcccgaca 601 tgcacaacgc cgagatctcc aagcggctgg gcaaacgctg gaagctgctc aaagacagcg 661 acaagatccc tttcattcga gaggcggagc ggctgcgcct caagcacatg gctgactacc 721 ccgactacaa gtaccggccc aggaagaagg tgaagtccgg caacgccaac tccagctcct 781 cggccgccgc ctcctccaag ccgggggaga agggagacaa ggtcggtggc agtggcgggg 841 gcggccatgg gggcggcggc ggcggcggga gcagcaacgc ggggggagga ggcggcggtg 901 cgagtggcgg cggcgccaac tccaaaccgg cgcagaaaaa gagctgcggc tccaaagtgg 961 cgggcggcgc gggcggtggg gttagcaaac cgcacgccaa gctcatcctg gcaggcggcg 1021 gcggcggcgg gaaagcagcg gctgccgccg ccgcctcctt cgccgccgaa caggcggggg 1081 ccgccgccct gctgcccctg ggcgccgccg ccgaccacca ctcgctgtac aaggcgcgga 1141 ctcccagcgc ctcggcctcc gcctcctcgg cagcctcggc ctccgcagcg ctcgcggccc 1201 cgggcaagca cctggcggag aagaaggtga agcgcgtcta cctgttcggc ggcctgggca 1261 cgtcgtcgtc gcccgtgggc ggcgtgggcg cgggagccga ccccagcgac cccctgggcc 1321 tgtacgagga ggagggcgcg ggctgctcgc ccgacgcgcc cagcctgagc ggccgcagca 1381 gcgccgcctc gtcccccgcc gccggccgct cgcccgccga ccaccgcggc tacgccagcc 1441 tgcgcgccgc ctcgcccgcc ccgtccagcg cgccctcgca cgcgtcctcc tcggcctcgt 1501 cccactcctc ctcttcctcc tcctcgggct cctcgtcctc cgacgacgag ttcgaagacg 1561 acctgctcga cctgaacccc agctcaaact ttgagagcat gtccctgggc agcttcagtt 1621 cgtcgtcggc gctcgaccgg gacctggatt ttaacttcga gcccggctcc ggctcgcact 1681 tcgagttccc ggactactgc acgcccgagg tgagcgagat gatctcggga gactggctcg 1741 agtccagcat ctccaacctg gttttcacct actgaagggc gcgcaggcag ggagaagggc 1801 cggggggggt aggagaggag aaaaaaaaag tgaaaaaaag aaacgaaaag gacagacgaa 1861 gagtttaaag agaaaaggga aaaaagaaag aaaaagtaag cagggctcgt tcgcccgcgt 1921 tctcgtcgtc ggatcaagga gcgcggcggc gttttggacc cgcgctccca tcccccacct 1981 tcccgggccg gggacccact ctgcccagcc ggagggacgc ggaggaggaa gagggtagac 2041 aggggcgacc tgtgattgtt gttattgatg ttgttgttga tggcaaaaaa aaaaagcgac 2101 ttcgagtttg ctcccctttg cttgaagaga ccccctcccc cttccaacga gcttccggac 2161 ttgtctgcac ccccagcaag aaggcgagtt agttttctag agacttgaag gagtctcccc 2221 cttcctgcat caccaccttg gttttgtttt attttgcttc ttggtcaaga aaggagggga 2281 gaacccagcg cacccctccc cccctttttt taaacgcgtg atgaagacag aaggctccgg 2341 ggtgacgaat ttggccgatg gcagatgttt tgggggaacg ccgggactga gagactccac 2401 gcaggcgaat tcccgtttgg ggcctttttt tcctccctct tttccccttg ccccctctgc 2461 agccggagga ggagatgttg aggggaggag gccagccagt gtgaccggcg ctaggaaatg 2521 acccgagaac cccgttggaa gcgcagcagc gggagctagg ggcgggggcg gaggaggaca 2581 cgaactggaa gggggttcac ggtcaaactg aaatggattt gcacgttggg gagctggcgg 2641 cggcggctgc tgggcctccg ccttcttttc tacgtgaaat cagtgaggtg agacttccca 2701 gaccccggag gcgtggagga gaggagactg tttgatgtgg tacaggggca gtcagtggag 2761 ggcgagtggt ttcggaaaaa aaaaaagaaa aaaaggg // LOCUS HSSOX9MRN 3923 bp RNA PRI 10-AUG-1995 DEFINITION Homo sapiens SOX9 mRNA. ACCESSION Z46629 NID g758102 KEYWORDS SOX9 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3923) AUTHORS Foster,J.W., Dominguez-Steglich,M.A., Guioli,S., Kwok,C., Weller,P.A., Stevanovic,M., Weissenbach,J., Mansour,S., Young,I.D., Goodfellow,P.N., Brook,D.J. and Schafer,A.J. TITLE Mutations in an SRY-related gene cause campomelic dysplasia and autosomal sex reversal JOURNAL Unpublished REFERENCE 2 (bases 1 to 3923) AUTHORS Guioli,S. TITLE Direct Submission JOURNAL Submitted (26-OCT-1994) Silvana Guioli, Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, United Kingdom REFERENCE 3 (bases 1 to 3923) AUTHORS Foster,J.W., Dominguez-Steglich,M.A., Guioli,S., Kowk,G., Weller,P.A., Stevanovic,M., Weissenbach,J., Mansour,S., Young,I.D., Goodfellow,P.N. and Schafer,A.J. TITLE Campomelic dysplasia and autosomal sex reversal caused by mutations in an SRY-related gene JOURNAL Nature 372 (6506), 525-530 (1994) MEDLINE 95082903 FEATURES Location/Qualifiers source 1..3923 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3A, 2A, 1A, 19-19, 9-4" /dev_stage="adult" /tissue_type="testis" /clone_lib="human testis cDNA library (CLONTECH)" mRNA join(<1..790,791..1044,1045..>3923) /gene="SOX9" exon 1..790 /gene="SOX9" /number=1 5'UTR <1..359 /gene="SOX9" gene 1..3923 /gene="SOX9" CDS 360..1889 /gene="SOX9" /codon_start=1 /db_xref="PID:g758103" /db_xref="SWISS-PROT:P48436" /translation="MNLLDPFMKMTDEQEKGLSGAPSPTMSEDSAGSPCPSGSGSDTE NTRPQENTFPKGEPDLKKESEEDKFPVCIREAVSQVLKGYDWTLVPMPVRVNGSSKNK PHVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLNESEKRPFVEEAER LRVQHKKDHPDYKYQPRRRKSVKNGQAEAEEATEQTHISPNAIFKALQADSPHSSSGM SEVHSPGEHSGQSQGPPTPPTTPKTDVQPGKADLKREGRPLPEGGRQPPIDFRDVDIG ELSSDVISNIETFDVNEFDQYLPPNGHPGVPATHGQVTYTGSYGISSTAATPASAGHV WMSKQQAPPPPPQQPPQAPPAPQAPPQPQAAPPQQPAAPPQQPQAHTLTTLSSEPGQS QRTHIKTEQLSPSHYSEQQQHSPQQIAYSPFNLPHYSPSYPPITRSQYDYTDHQNSSS YYSHAAGQGTGLYSTFTYMNPAQRPMYTPIADTSGVPSIPQTHSPQHWEQPVYTQLTR P" exon 791..1044 /gene="SOX9" /number=2 exon 1045..3923 /gene="SOX9" /number=3 3'UTR 1890..>3923 /gene="SOX9" polyA_site 3896..3901 /gene="SOX9" BASE COUNT 956 a 1097 c 903 g 967 t ORIGIN 1 cggagctcga aactgactgg aaacttcagt ggcgcggaga ctcgccagtt tcaaccccgg 61 aaacttttct ttgcaggagg agaagagaag gggtgcaagc gcccccactt ttgctctttt 121 tcctcccctc ctcctcctct ccaattcgcc tccccccact tggagcgggc agctgtgaac 181 tggccacccc gcgccttcct aagtgctcgc cgcggtagcc ggccgacgcg ccagcttccc 241 cgggagccgc ttgctccgca tccgggcagc cgaggggaga ggagcccgcg cctcgagtcc 301 ccgagccgcc gcggcttctc gcctttcccg gccaccagcc ccctgccccg ggcccgcgta 361 tgaatctcct ggaccccttc atgaagatga ccgacgagca ggagaagggc ctgtccggcg 421 cccccagccc caccatgtcc gaggactccg cgggctcgcc ctgcccgtcg ggctccggct 481 cggacaccga gaacacgcgg ccccaggaga acacgttccc caagggcgag cccgatctga 541 agaaggagag cgaggaggac aagttccccg tgtgcatccg cgaggcggtc agccaggtgc 601 tcaaaggcta cgactggacg ctggtgccca tgccggtgcg cgtcaacggc tccagcaaga 661 acaagccgca cgtcaagcgg cccatgaacg ccttcatggt gtgggcgcag gcggcgcgca 721 ggaagctcgc ggaccagtac ccgcacttgc acaacgccga gctcagcaag acgctgggca 781 agctctggag acttctgaac gagagcgaga agcggccctt cgtggaggag gcggagcggc 841 tgcgcgtgca gcacaagaag gaccacccgg attacaagta ccagccgcgg cggaggaagt 901 cggtgaagaa cgggcaggcg gaggcagagg aggccacgga gcagacgcac atctccccca 961 acgccatctt caaggcgctg caggccgact cgccacactc ctcctccggc atgagcgagg 1021 tgcactcccc cggcgagcac tcggggcaat cccagggccc accgacccca cccaccaccc 1081 ccaaaaccga cgtgcagccg ggcaaggctg acctgaagcg agaggggcgc cccttgccag 1141 aggggggcag acagccccct atcgacttcc gcgacgtgga catcggcgag ctgagcagcg 1201 acgtcatctc caacatcgag accttcgatg tcaacgagtt tgaccagtac ctgccgccca 1261 acggccaccc gggggtgccg gccacgcacg gccaggtcac ctacacgggc agctacggca 1321 tcagcagcac cgcggccacc ccggcgagcg cgggccacgt gtggatgtcc aagcagcagg 1381 cgccgccgcc acccccgcag cagcccccac aggccccgcc ggccccgcag gcgcccccgc 1441 agccgcaggc ggcgccccca cagcagccgg cggcaccccc gcagcagcca caggcgcaca 1501 cgctgaccac gctgagcagc gagccgggcc agtcccagcg aacgcacatc aagacggagc 1561 agctgagccc cagccactac agcgagcagc agcagcactc gccccaacag atcgcctaca 1621 gccccttcaa cctcccacac tacagcccct cctacccgcc catcacccgc tcacagtacg 1681 actacaccga ccaccagaac tccagctcct actacagcca cgcggcaggc cagggcaccg 1741 gcctctactc caccttcacc tacatgaacc ccgctcagcg ccccatgtac acccccatcg 1801 ccgacacctc tggggtccct tccatcccgc agacccacag cccccagcac tgggaacaac 1861 ccgtctacac acagctcact cgaccttgag gaggcctccc acgaagggcg acgatggccg 1921 agatgatcct aaaaataacc gaagaaagag aggaccaacc agaattccct ttggacattt 1981 gtgttttttt gtttttttat tttgttttgt tttttcttct tcttcttctt ccttaaagac 2041 atttaagcta aaggcaactc gtacccaaat ttccaagaca caaacatgac ctatccaagc 2101 gcattaccca cttgtggcca atcagtggcc aggccaacct tggctaaatg gagcagcgaa 2161 atcaacgaga aactggactt tttaaaccct cttcagagca agcgtggagg atgatggaga 2221 atcgtgtgat cagtgtgcta aatctctctg cctgtttgga ctttgtaatt atttttttag 2281 cagtaattaa agaaaaaagt cctctgtgag gaatattctc tattttaaat atttttagta 2341 tgtactgtgt atgattcatt accattttga ggggatttat acatattttt agataaaatt 2401 aaatgctctt atttttccaa cagctaaact actcttagtt gaacagtgtg ccctagcttt 2461 tcttgcaacc agagtatttt tgtacagatt tgctttctct tacaaaaaga aaaaaaaaat 2521 cctgttgtat taacatttaa aaacagaatt gtgttatgtg atcagttttg ggggttaact 2581 ttgcttaatt cctcaggctt tgcgatttaa ggaggagctg ccttaaaaaa aaataaaggc 2641 cttattttgc aattatggga gtaaacaata gtctagagaa gcatttggta agctttatga 2701 tatatatatt ttttaaagaa gagaaaaaca ccttgagcct taaaacggtg ctgctgggaa 2761 acatttgcac tcttttagtg catttcctcc tgcctttgct tgttcactgc agtcttaaga 2821 aagaggtaaa aggcaagcaa aggagatgaa atctgttctg ggaatgtttc agcagccaat 2881 aagtgcccga gcacactgcc cccggttgcc tgcctgggcc ccatgtggaa ggcagatgcc 2941 tgctcgctct gtcacctgtg cctctcagaa caccagcagt taaccttcaa gacattccac 3001 ttgctaaaat tatttatttt gtaaggagag gttttaatta aaacaaaaaa aaattctttt 3061 tttttttttt ttttccaatt ttaccttctt taaaataggt tgttggagct ttcctcaaag 3121 ggtatggtca tctgttgtta aattatgttc ttaactgtaa ccagtttttt tttatttatc 3181 tctttaatct tttttattat taaaagcaag tttctttgta ttcctcaccc tagatttgta 3241 taaatgcctt tttgtccatc ccttttttct ttgttgtttt tgttgaaaac aaactggaaa 3301 cttgtttctt tttttgtata aatgagagat tgcaaatgta gtgtatcact gagtcatttg 3361 cagtgttttc tgccacagac ctttgggctg ccttatattg tgtgtgtgtg tgggtgtgtg 3421 tgtgttttga cacaaaaaca atgcaagcat gtgtcatcca tatttctcta catcttctct 3481 tggagtgagg gaggctacct ggaggggatc agcccactga cagaccttaa tcttaattac 3541 tgctgtggct agagagtttg aggattgctt tttaaaaaag acagcaaact ttttttttta 3601 tttaaaaaaa gatatattaa cagttttaga agtcagtaga ataaaatctt aaagcactca 3661 taatatggca tccttcaatt tctgtataaa agcagatctt tttaaaaaag atacttctgt 3721 aacttaagaa acctggcatt taaatcatat tttgtcttta ggtaaaagct ttggtttgtg 3781 ttcgtgtttt gtttgtttca cttgtttccc tcccagcccc aaaccttttg ttctctccgt 3841 gaaacttacc tttccctttt tctttctctt tttttttttg tatattattg tttacaataa 3901 atatacattg cattaaaaag aaa // LOCUS HSSP17 2029 bp RNA PRI 21-AUG-1996 DEFINITION H.sapiens Sp17 gene. ACCESSION Z48570 NID g695580 KEYWORDS SP17 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2029) AUTHORS Lea,I.A., Richardson,R.T., Widgren,E.E. and O'Rand,M.G. TITLE Cloning and sequencing of cDNAs encoding the human sperm protein, Sp17 JOURNAL Biochim. Biophys. Acta 1307 (3), 263-266 (1996) MEDLINE 96305346 REFERENCE 2 (bases 1 to 1256) AUTHORS Richardson,R.T., Yamasaki,N. and O'Rand,M.G. TITLE Sequence of a rabbit sperm zona pellucida binding protein and localization during the acrosome reaction JOURNAL Dev. Biol. 165 (2), 688-701 (1994) MEDLINE 95046885 REFERENCE 3 (bases 1 to 2029) AUTHORS Lea,I.A. TITLE Direct Submission JOURNAL Submitted (02-MAR-1995) Lea I. A., University of North Carolina at Chapel Hill, Cell Biology & Anatomy, 210 Taylor Hall, Chapel Hill, NC, USA, 27599-7090 FEATURES Location/Qualifiers source 1..2029 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3-5 & 7-1" /dev_stage="50-year old adult" /tissue_type="testis" /cell_type="spermatogenic" /clone_lib="testis cDNA, Clontech" /sex="Male" 5'UTR 1..748 /note="alternate 5' UTR for 1.6 kb mRNA" /function="alternate 5' UTR" /evidence=experimental 5'UTR 749..1210 /note="alternate 5' UTR for 1.3 kb mRNA" /function="alternate 5' UTR" /evidence=experimental CDS 1211..1666 /function="Zona binding protein" /codon_start=1 /evidence=experimental /product="Sp17" /db_xref="PID:g695581" /translation="MSIPFSNTHYRIPQGFGNLLEGLTREILREQPDNIPAFAAAYFE SLLEKREKTNFDPAEWGSKVEDRFYNNHAFEEQEPPEKSDPKQEESQISGKEEETSVT ILDSSEEDKEKEEVAAVKIQAAFRGHIAREEAKKMKTNSLQNEEKEENK" 3'UTR 1667..2029 /note="3' UTR is identical for both 1.6 and 1.3 kb mRNAs" /evidence=experimental polyA_site 1984..1989 /evidence=experimental BASE COUNT 669 a 405 c 434 g 521 t ORIGIN 1 cgggcgtgca gacaaaatac atggatgtgg tcaaggagcg aatccgttta gctcgacaga 61 ttgagaaatc tgagtatcgg aacttccagg cttgcctgca caactcttgg gattgagcag 121 gcagcagctg ccctggagat tgagctggaa gaagacatgt ataagggagg aaaagctgac 181 cagcaagaag aacgtcggag acaaagcaga tgaaggttct gaaggaggag ctgcgccacc 241 tgctgtccag ccactgttta cggagagcca gaaaaccaag tatccactca gtctggcaag 301 ccgccccttg cttgtgtctg ccccaagtaa gagcgagtct gctttgagct gtctctccaa 361 gcagaagaag aagaagacaa agaagccgaa gagccacagc cggaacagcc acagccaagt 421 acaagtgcaa attaactggt caagtgtgtc agtgactgca cattggtttc tgttctctgg 481 ctatttgcaa aacctctccc acccttgagt ttcactccac caccaacccc aggtaaaaaa 541 gtctccctct cttccactca cacccatagc gggagagacc tcatgcagat ttgcattgtt 601 ttggagtaag aattcaatgc agcagcttaa tttttctgta ttgcagtgtt tataggcttc 661 ttgtgtgtta aacttgattt cataaattaa aaacaatggt cagaaaaaaa aaaaaaaccg 721 gaaccggcgg caccagctcg gagagaaatc gatgttgtag tgaccttcag taaaagagcg 781 gtttttcata gaggtgccgt tttagactac ctatttaaga ggcacgaaaa acaaatacat 841 ctaataggtt aagtaaaaaa ccatctattt cggacaataa aagttatttt ctacacacgt 901 tggtcttcat tttactcgtt aacagtatca tacatccttc taagcttatc tttttgacgt 961 gaaagtgtag tagtatgtct ccacctggca gctatgtagt taatattttt gtctgttgta 1021 atgttatcaa gtaccgaaca ttttcctaat gaaatagtgg aaaagacaac ctttttctcc 1081 atttctattt ggatttttag atcacgtaca taacaaggaa tcgaataaat aatgaagtgt 1141 tttataaaga gtatccgtct tggagggaga ttccagttgg gaggttccat aggcagttct 1201 taccaagaag atgtcgattc cattctccaa cacccactac cgaattccac aaggatttgg 1261 gaatcttctt gaagggctga cacgcgagat tctgagagag caaccggaca atataccagc 1321 ttttgcagca gcctattttg agagccttct agagaaaaga gagaaaacca actttgatcc 1381 agcagaatgg gggagtaagg tagaagaccg cttctataac aatcatgcat tcgaggagca 1441 agaaccacct gagaaaagtg atcctaaaca agaagagtct cagatatctg ggaaggagga 1501 agagacatca gtcaccatct tagactcttc tgaggaagat aaggaaaaag aagaggttgc 1561 tgctgtcaaa atccaagctg ccttccgggg acacatagcc agagaggagg caaagaaaat 1621 gaaaacaaat agtcttcaaa atgaggaaaa agaggaaaac aagtgaggac actggtttta 1681 cctccaggaa acatgaaaaa taatccaaat ccatcaacct tcttattaat gtcatttctc 1741 cttgaggaag gaagatttga tgttgtgaaa taacattcgt tactgttgtg aaaatctgtc 1801 atgagcattt gtttaataag cataccattg aaacatgcca cttgaagatt tctctgagat 1861 catgagtttg tttacacttg tctcaagcct atctatagag acccttggat ttagaattat 1921 agaactaaag tatctgagat tacagagatc tcagaggtta tgtgttctaa ctattatcaa 1981 atgaataaat cctctctatc acatccccca aaaaaaaaaa aaaaaaaaa // LOCUS HSSPHAR 1530 bp DNA PRI 26-JUN-1997 DEFINITION H.sapiens SPHAR gene for cyclin-related protein. ACCESSION X82554 NID g575271 KEYWORDS cyclin-related gene; SPHAR gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1530) AUTHORS Digweed,M., Gunthert,U., Schneider,R., Seyschab,H., Friedl,R. and Sperling,K. TITLE Irreversible repression of DNA synthesis in Fanconi anemia cells is alleviated by the product of a novel cyclin-related gene JOURNAL Mol. Cell. Biol. 15 (1), 305-314 (1995) MEDLINE 95098005 REFERENCE 2 (bases 1 to 1530) AUTHORS Digweed,M. TITLE Direct Submission JOURNAL Submitted (31-OCT-1994) M. Digweed, Institut fuer Humangenetik, Freie Universitaet Berlin, Heubnerweg 6, 14059 Berlin, FRG FEATURES Location/Qualifiers source 1..1530 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="cDNA, partial genomic" /clone="pSPHAR, pSPHAR-G, pSPHAR-G7, pSPHAR-G11" /chromosome="8" /map="p22-q11" gene 886..1530 /gene="SPHAR" promoter 886..910 /gene="SPHAR" mRNA 929..1530 /gene="SPHAR" /evidence=experimental CDS 1164..1355 /gene="SPHAR" /codon_start=1 /evidence=experimental /db_xref="PID:g575272" /db_xref="SWISS-PROT:Q15513" /translation="MTRIKISVCICFRYFEFCFFYALNILFQKVSEANSQTELLLRPH CKNILFNVSFMIDLQAAHF" polyA_signal 1510..1516 /gene="SPHAR" BASE COUNT 456 a 261 c 251 g 562 t ORIGIN 1 aattaaatgg acaacaccgt tagatgtgta tgtaaaaatt ttctgtttca tatttttcct 61 ttcactttcg gtttagaaca tgctatatgt actgtatgtc ctgtggccca gtgcggctcc 121 acagcatgga atctgatgta tgatatgata gaatgtggca ctaaatgcag tttcagattt 181 tattttttta atcatatgaa ctaaaattgt caattgtgag ttgtgctttc tcatcatgtt 241 ggttatattg cacaattggt tatatttatg acctgatatt caaagactct ggcattgata 301 gccagtgtgt tttcttattt aactccgttt actacattct acatggtgtt tacgtgatcc 361 acacttgaaa tactagatca gtagacattc actaatatac caaaataaaa tgaaaaattg 421 agtttttccg tgaactttat actgtccagc tctgttgatt ttaaagcttc ttcatccagg 481 tcagttcagg aagtatatct ggagtacctg gctctgtttt tggctgtgag actagcacta 541 aggattctgg tacctttacc caaacctact gggctactaa tacttctctc agcagttgaa 601 tcaaatacaa tagaccatgt aagctggggc cgctcatcca cttccagttt gctggtctcc 661 ctgctagaag acacattgta ctgtgctttt tctggaattc acgataatgg catcactgcc 721 tgtttttcac atcttttgtt tcctgttcat tttaaggaaa cctactaaat ccagttaata 781 ttaaatggac accactcatt aagaaatttc tttatggctt ctgcctgaat acttaaaatg 841 ccttactaca gttatccagt tgacatgttt ttaattcata taaggtatat tgggtatatt 901 gaagtatata ttgtattaca aagacttgtt cttgtatttt aaaatgtcag tgcaaaaaat 961 atatggtgga acctttcttt aaagttgaaa tgcagtatta tttaaatctg aaaggttaaa 1021 aagctttctt caccttatat atgttcttcc actgtgactt tttagttgaa gactagtaaa 1081 ttaactttta gttagaagat gcctactgct tttgttgttt attttaatca gcagagcaca 1141 gagacacata aaaactctgg gaaatgacta ggataaaaat atcagtatgt atctgtttta 1201 gatattttga gttttgcttt ttttatgcct tgaatatttt atttcaaaaa gtatctgaag 1261 caaattctca gactgaacta cttcttagac ctcactgtaa gaatatttta ttcaatgtct 1321 catttatgat agatttgcaa gctgctcatt tttgaacagc tttttgcatg ggataggagc 1381 atgtctattc taacacatca gcttattcaa aagcaagaat tttaaaaata agataaatgt 1441 aaagttgttt tataaacgat cctgttaatt aaaccacaga caccatatat ccttctgcat 1501 cctttggcca ataaaagttg ctggagaacc // LOCUS HSSPI1 1364 bp RNA PRI 04-NOV-1995 DEFINITION Human mRNA for spi-1 proto-oncogene. ACCESSION X52056 NID g36560 KEYWORDS proto-oncogene; spi-1 proto-oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1364) AUTHORS Ray,D. TITLE Direct Submission JOURNAL Submitted (07-MAR-1990) Ray D., INSERM U 248, Faculte de Medecine Lariboisiere Saint Louis, 10 Avenue de Verdun, 75010 Paris, France REFERENCE 2 (bases 1 to 1351) AUTHORS Ray,D., Culine,S., Tavitain,A. and Moreau-Gachelin,F. TITLE The human homologue of the putative proto-oncogene Spi-1: characterization and expression in tumors JOURNAL Oncogene 5 (5), 663-668 (1990) MEDLINE 90265606 REMARK Erratum:[Oncogene 1990 Oct;5(10):1611-2]] COMMENT Data kindly reviewed (20-AUG-1990) by Ray D. FEATURES Location/Qualifiers source 1..1364 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" /chromosome="11p 11-22" CDS 212..1006 /note="spi-1 protein product (AA 1-264)" /codon_start=1 /db_xref="PID:g36561" /db_xref="SWISS-PROT:P17947" /translation="MEGFPLVPPPSEDLVPYDTDLYQRQTHEYYPYLSSDGESHSDHY WDFHPHHVHSEFESFAENNFTELQSVQPPQLQQLYRHMELEQMHVLDTPMVPPHPSLG HQVSYLPRMCLQYPSLSPAQPSSDEEEGERQSPPLEVSDGEADGLEPGPGLLPGETGS KKKIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMT YQKMARALRNYGKTGEVKKVKKKLTYQFSGEVLGRGGLAERRHPPH" BASE COUNT 279 a 497 c 390 g 198 t ORIGIN 1 aaaatcagga acttgtgctg gccctgcaat gtcaagggag ggggctcacc cagggctcct 61 gtagctcagg gggcaggcct gagccctgca cccgccccac gaccgtccag cccctgacgg 121 gcaccccatc ctgaggggct ctgcattggc ccccaccgag gcaggggatc tgaccgactc 181 ggagcccggc tggatgttac aggcgtgcaa aatggaaggg tttcccctcg tcccccctcc 241 atcagaagac ctggtgccct atgacacgga tctataccaa cgccaaacgc acgagtatta 301 cccctatctc agcagtgatg gggagagcca tagcgaccat tactgggact tccaccccca 361 ccacgtgcac agcgagttcg agagcttcgc cgagaacaac ttcacggagc tccagagcgt 421 gcagcccccg cagctgcagc agctctaccg ccacatggag ctggagcaga tgcacgtcct 481 cgataccccc atggtgccac cccatcccag tcttggccac caggtctcct acctgccccg 541 gatgtgcctc cagtacccat ccctgtcccc agcccagccc agctcagatg aggaggaggg 601 cgagcggcag agccccccac tggaggtgtc tgacggcgag gcggatggcc tggagcccgg 661 gcctgggctc ctgcctgggg agacaggcag caagaagaag atccgcctgt accagttcct 721 gttggacctg ctccgcagcg gcgacatgaa ggacagcatc tggtgggtgg acaaggacaa 781 gggcaccttc cagttctcgt ccaagcacaa ggaggcgctg gcgcaccgct ggggcatcca 841 gaagggcaac cgcaagaaga tgacctacca gaagatggcg cgcgcgctgc gcaactacgg 901 caagacgggc gaggtcaaga aggtgaagaa gaagctcacc taccagttca gcggcgaagt 961 gctgggccgc gggggcctgg ccgagcggcg ccacccgccc cactgagccc gcagcccccg 1021 ccggccccgc caggcctccc cgctggccat agcattaagc cctcgcccgg cccggacaca 1081 gggaggacgc tcccggggcc cagaggcagg actgtggcgg gccgggctcc gtcacccgcc 1141 cctcccccca ctccaggccc cctccacatc ccgcttcgcc tccctccagg actccacccc 1201 ggctcccgac gccagctggg cgtcagaccc accggcaacc ttgcagagga cgacccgggg 1261 tactgccttg ggagtctcaa gtccgtatgt aaatcagatc tcccctctca cccctcccac 1321 ccattaacct cctcccaaaa aacaagtaaa gttattctca atcc // LOCUS HSSPMSYN 1612 bp RNA PRI 08-MAR-1996 DEFINITION H.sapiens mRNA for spermine synthase. ACCESSION Z49099 NID g791050 KEYWORDS spermine synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1612) AUTHORS Korhonen,V.P., Halmekyto,M., Kauppinen,L., Myohanen,S., Wahlfors,J., Keinanen,T., Hyvonen,T., Alhonen,L., Eloranta,T. and Janne,J. TITLE Molecular cloning of a cDNA encoding human spermine synthase JOURNAL DNA Cell Biol. 14 (10), 841-847 (1995) MEDLINE 96027753 REFERENCE 2 (bases 1 to 1612) AUTHORS Korhonen,V. TITLE Direct Submission JOURNAL Submitted (24-APR-1995) Veli-Pekka Korhonen, A.I. Virtanen Institute, University of, Kuopio, Kuopio, FIN-70211, Finland FEATURES Location/Qualifiers source 1..1612 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2, liver tumor stimulated with IL-1" /clone_lib="lambda ZAP" mRNA <1..1612 CDS 23..1129 /EC_number="2.5.1.22" /note="spermidine aminopropyltransferase" /codon_start=1 /product="spermine synthase" /db_xref="PID:g791051" /translation="MPGAAARHSTLDFMLGAKADGETILKGLQSIFQEQGMAESVHTW QDHGYLATYTNKNGSFANLRIYPHGLVLLDLQSYDGDAQGKEEIDSILNKVEERMKEL SQDSTGRVKRLPPIVRGGAIDRYWPTADGRLVEYDIDEVVYDEDSPYQNIKILHSKQF GNILILSGDVNLAESDLAYTRAIMGSGKEDYTGKDVLILGGGDGGILCEIVKLKPKMV TMVEIDQMVIDGCKKYMRKTCGDVLDNLKGDCYQVLIEDCIPVLKRYAKEGREFDYVI NDLTAVPISTSPEEDSTWEFLRLILDLSMKVLKQDGKYFTQGNCVNLTEALSLYEEQL GRLYCPVEFSKEIVCVPSYLELWVFYTVWKKAKP" BASE COUNT 480 a 283 c 399 g 450 t ORIGIN 1 tagtgaggcg aggccctgtg ccatgcctgg ggcagcagca cggcacagca cgctcgactt 61 catgctcggc gccaaagctg atggtgagac cattctaaaa ggcctccagt ccattttcca 121 ggagcagggg atggcggagt cggtgcacac ctggcaggac catggctatt tagcaaccta 181 cacaaacaag aacggcagct ttgccaattt gagaatttac ccacatggat tggtgttgct 241 ggaccttcag agttatgatg gtgatgcgca aggcaaagaa gagatcgaca gtattttgaa 301 caaagtagag gaaagaatga aagaattgag tcaggacagt actgggcggg tgaaacgatt 361 accacccata gtgcgaggag gagccatcga cagatactgg cccaccgccg acgggcgcct 421 ggttgaatat gacatagatg aagtggtata tgacgaagat tcaccttatc aaaatataaa 481 aattctacac tcgaagcagt ttggaaatat tctcatcctt agtggggatg ttaatttggc 541 agagagtgat ttggcatata cccgggccat catgggcagt ggcaaagaag attacactgg 601 caaagatgta ctcattctgg gaggtggaga cggaggcata ttgtgtgaaa tagtcaaact 661 aaaaccaaag atggtcacta tggtagagat tgaccaaatg gtgattgatg ggtgtaagaa 721 atacatgcga aaaacgtgtg gcgatgtctt agacaatctt aaaggagact gctatcaggt 781 tctaatagaa gactgtatcc cggtactgaa gaggtacgcc aaagaaggga gagaatttga 841 ttatgtgatt aatgatttga cagctgttcc aatctccacg tctccagaag aagattccac 901 atgggagttt ctcagactga ttcttgacct ctcaatgaaa gtgttgaaac aggatgggaa 961 atattttaca caggggaact gtgtcaatct gacagaagca ctgtcgctct atgaagaaca 1021 gctggggcgc ctgtattgtc ctgtggaatt ttcaaaggag atcgtctgtg tcccttcata 1081 cttggaattg tgggtatttt acactgtttg gaagaaagct aaaccctgaa gatcagtagc 1141 ccctaatcac atgtgctgca aatagccttc ctgacctcca tatgctgtac atgacatcaa 1201 aatgagtcag gcaattgatt gtgaattcct taaagttttc ctttttttaa taattatttt 1261 taatttaaaa aagcaaatgg aaaatgtata ttttgatgag cttagggtgt tttttttttg 1321 aaagtcagct gaaggatggt tagacagcac agcgaagact gctaaatgca ctgacccccc 1381 ccattagaat gtgatttttg ttccttttta tttctctgtg ggcttttgtt tttgtttttg 1441 ttttggtaga tcttcaattt ggatatttgg aggagtgaac atcgttgttt tgctggaggg 1501 aagatcttga tggtgtttct ttccccaaaa attgacttag atattaaaat ttggtgctta 1561 taagagagag ttaaaaaaaa ataggattgc ttcaattaaa attacaaaag ag // LOCUS HSSPOT14 461 bp DNA PRI 17-FEB-1997 DEFINITION H.sapiens spot14 gene. ACCESSION Y08409 NID g1568568 KEYWORDS spot14 gene; Spot14 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 461) AUTHORS Grillasca,J.P., Gastaldi,M., Khiri,H., Dace,A., Peyrol,N., Reynier,P., Torresani,J. and Planells,R. TITLE Cloning and initial characterization of human and mouse Spot 14 genes JOURNAL FEBS Lett. 401 (1), 38-42 (1997) MEDLINE 97157517 REFERENCE 2 (bases 1 to 461) AUTHORS Planells,R. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) R. Planells, INSERM Unite 38, Faculte de Medecine, 27 boulevard Jean Moulin, F- 13385 Marseille Cedex, FRANCE FEATURES Location/Qualifiers source 1..461 /organism="Homo sapiens" /isolate="3 unrelated patients" /db_xref="taxon:9606" /dev_stage="adult" gene 11..451 /gene="Spot14" CDS 11..451 /gene="Spot14" /codon_start=1 /product="Spot14 protein" /db_xref="PID:e268226" /db_xref="PID:g1568569" /translation="MQVLTKRYPKNCLLTVMDRYAAEVHNMEQVVMIPSLLRDVQLSG PGGQAQAEAPDLYTYFTMLKAICVDVDHGLLPREEWQAKVAGSEENGTAETEEVEDES ASGELDLEAQFHLHFSSLHHILMHLTEKAQEVTRKYQEMTGQVW" BASE COUNT 114 a 128 c 146 g 73 t ORIGIN 1 ggaagcaacc atgcaggtgc taaccaagcg ttaccccaag aactgcctgc tgaccgtcat 61 ggaccggtat gcagccgagg tgcacaacat ggagcaggtg gtgatgatcc ccagccttct 121 gcgggacgtg cagctgagtg ggcctggggg ccaggcccag gctgaggccc ctgatctcta 181 cacctacttc accatgctca aggccatctg tgtggatgtg gaccatgggc tgctgccgcg 241 ggaggagtgg caggccaagg tggcaggcag cgaagagaat ggaaccgcag agacagagga 301 agtcgaggac gagagtgcct caggagagct ggacctggaa gcccagttcc acctgcactt 361 ctccagcctc catcacatcc tcatgcacct caccgagaaa gcccaggagg tgacaaggaa 421 ataccaggaa atgacgggac aagtttggta gaccttggac a // LOCUS HSSPR1 2986 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens SPR-1 mRNA for GT box binding protein. ACCESSION X68561 S50516 NID g38419 KEYWORDS binding protein; transcription factor SP1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2986) AUTHORS Suske,G. TITLE Direct Submission JOURNAL Submitted (02-OCT-1992) G. Suske, Institut fuer Molekularbiologie und Tumorforschung, Philipps-Universitaet, Emil-Mannkopff str.2, W-3550 Marburg, FRG REFERENCE 2 (bases 1 to 2986) AUTHORS Hagen,G., Muller,S., Beato,M. and Suske,G. TITLE Cloning by recognition site screening of two novel GT box binding proteins: a family of Sp1 related genes JOURNAL Nucleic Acids Res. 20 (21), 5519-5525 (1992) MEDLINE 93087156 FEATURES Location/Qualifiers source 1..2986 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" /cell_line="Ishikawa" /clone_lib="Random and oligo (dT) primed libraries" gene 182..2536 /gene="SPR-1" CDS 182..2536 /gene="SPR-1" /codon_start=1 /db_xref="PID:g38420" /db_xref="SWISS-PROT:Q02446" /translation="MSDQKKEEEEEAAAAAAMATEGGKTSEPENNNKKPKTSGSQDSQ PSPLALLAATCSKIGTPGENQATGQQQIIIDPSQGLVQLQNQPQQLELVTTQLAGNAW QLVASTPPASKENNVSQPASSSSSSSSSNNGSASPTKTKSGNSSTPGQFQVIQVQNPS GSVQYQVIPQLQTVEGQQIQINPTSSSSLQDLQGQIKLISAGNNQAILTAANRTASGN ILAQNLANQTVPVQIRPGVSIPLQLQTLPGTQAQVVTTLPINIGGVTLALPVINNVAA GGGTGQVGQPAATADSGTSNGNQLVSTPTNTTTSASTMPESPSSSTTCTTTASTSLTS SDTLVSSADTGQYASTSASSSERTIEESQTPAATESEAQSSSQLHANGMQNQQDQSNS LQQVQIVGQPILQQIQIQQPQQQIIQAIPPQSFQLQSGQTIQTIQQQPLQNVQLQAVN PTQVLIRAPTLTPSGQISWQTVQVQNIQSLSNLQVQNAGLSQQLTITPVSSSGGTTLA QIAPVAVAGAPITLNTAQLASVPNLQTVSVANLGAAGVQVQGVPVTITSVAGQQQGQD GVKVQQATIAPVTVAVGGIANATIGAVSPDQLTQVHLQQGQQTSDQEVQPGKRLRRVA CSCPNCREGEGRGSNEPGKKKQHICHIEGCGKVYGKTSHLRAHLRWHTGERPFICNWM FCGKRFTRSDELQRHRRTHTGEKRFECPECSKRFMRSDHLSKHVKTHQNKKGGGTALA IVTSGELDSSVTEVLGSPRIVTVAAISQDSNPATPNVSTNMEEF" misc_binding 2125..2368 /gene="SPR-1" /bound_moiety="DNA" BASE COUNT 887 a 722 c 639 g 738 t ORIGIN 1 agctgctacg cccaaccagc ccagcggcgg ccattcgcgg aaaaagaggc agagcctgtg 61 ccagctacag cctcctccga gccaccgcgg cggcggaccg gcctctcctc ccgcctcgcc 121 cccaccccca cccacctcta tcccagtgtc tccgtctgag ggtttgtcct gttaatgcgg 181 gatgagcgat cagaagaagg aggaggagga ggaggcggca gcggcagcgg cgatggctac 241 agaaggaggg aaaacctctg agccagagaa taacaataaa aaacccaaaa cctcaggctc 301 ccaggactct cagccctctc ctctggcttt actggcagct acttgcagca aaatagggac 361 tcctggtgaa aatcaagcaa ctggacaaca acaaattatt atagatccaa gtcaaggatt 421 ggtgcaactt caaaatcaac cacaacagct agaactggta acaacgcaac ttgctggaaa 481 cgcttggcaa cttgttgcct ccactcctcc tgcttcaaaa gagaataacg tttctcaacc 541 agcctctagt tcgtctagtt cttccagcag taataacggg agtgcatctc ctacaaaaac 601 taaatcaggt aattcttcca cccctggtca atttcaagtc atacaagtac aaaatccaag 661 tggtagtgta cagtaccaag taattccaca acttcagaca gtggaaggtc aacaaattca 721 aatcaatcca actagtagtt catctctaca ggatttgcag ggtcaaatta agctcatttc 781 tgcaggtaat aatcaagcta tactcacagc tgctaacagg acagcttctg ggaatattct 841 tgctcaaaac ctggcaaatc agacagttcc ggtccaaatt agacctggtg tttcaatacc 901 actgcagtta cagactcttc ctggtactca ggctcaagtt gtaacaaccc taccaattaa 961 cattggagga gtgactctag ctttgccagt gataaacaac gtggctgccg gaggagggac 1021 tgggcaggtt ggccagcctg ctgctactgc tgatagtggg acttccaatg ggaatcaatt 1081 agtttccaca cccaccaaca ccactacttc tgccagtact atgccagaat ctccctcctc 1141 ctccactacc tgcacaacca ctgcttcaac gtctttgaca agcagtgaca cattagtgag 1201 ctcagcagat actggccagt atgcaagcac atcagccagt agttctgaac gcaccattga 1261 agaatctcaa acacctgctg ctactgagtc tgaagcccag agctccagtc agcttcacgc 1321 taatggaatg cagaatcagc aggatcaatc aaattctctt cagcaggtgc aaattgtagg 1381 ccaacctatc ttacagcaga tccagatcca acagcctcag caacagatca ttcaggctat 1441 tccaccacag tcgtttcaac tccagtcagg gcagacgatt cagaccatcc agcagcagcc 1501 tttacagaat gttcaacttc aagcagtaaa tccgactcag gtgcttatca gggctccaac 1561 tttaacacct tcagggcaaa tcagttggca aactgtacag gttcagaata ttcagagtct 1621 ttcaaatttg caagttcaga atgctgggtt atcccaacaa ttaaccatca ccccagtgtc 1681 ttcaagtggt ggcacaactc ttgctcagat tgctcctgtg gctgttgctg gtgccccaat 1741 aactttgaat actgcccagc ttgcatcagt gcctaacctt cagacagtga gcgttgccaa 1801 cctgggtgct gcaggtgttc aagtgcaggg agttcccgtt acaatcacta gtgttgcagg 1861 tcagcagcaa ggacaagatg gagtaaaagt ccagcaagct actatagctc ctgtaactgt 1921 agcagttgga ggaattgcta atgccacgat aggtgctgtt agtcctgacc aactcacaca 1981 agtgcatttg cagcaaggcc agcagacttc tgatcaagag gtacaacctg gcaagaggct 2041 tcgaagagtt gcctgttcct gtcctaattg tagggaagga gaaggaagag gcagtaatga 2101 accaggaaaa aagaagcagc atatctgtca tattgaagga tgtggtaaag tttatggcaa 2161 aacatctcat ttacgagcac atcttcgctg gcatactgga gaaagacctt ttatatgcaa 2221 ctggatgttt tgtggcaaaa gattcacacg gagtgatgag ctccagagac atagaagaac 2281 ccatacaggt gaaaagagat ttgaatgccc ggaatgttct aaaaggttta tgcggagtga 2341 tcatctctcc aaacatgtca aaacgcacca gaataaaaaa ggtggtggga cagctcttgc 2401 cattgttacc tcgggagaac tggactcatc tgttacagag gtgcttggct ccccaagaat 2461 tgtcacagtt gcagccattt ctcaagattc gaatccagca actcccaatg tttcaaccaa 2521 catggaagaa ttctgaaaag ttatttataa cagagacctc tagtgctgca cttgtttaca 2581 cacctttgaa aatctggaaa tgggctggtc aagtggatta cagagtagga aattatgttt 2641 tcattcttgg cttctttaag tattccaggg tttggggtca acacgtgaag tgttgaattt 2701 taaaaaatac aaaaagcaaa ctgatgtact ggaaacagaa aagtatttcc tccatactat 2761 aagttgtagt tgtttggaaa tatatcacat aacctttata cagaatcttc ccatctctta 2821 atatcatgtg ttaacatgtt taaaaagacc ttagtagttt gcaggctgga ccttaattgg 2881 acttattttc tttgaaagta ctttgttata aattcagtca gtaataattt acgtgtattc 2941 tttttctcta tagcacagaa aacagatagt taactgatga tagcgc // LOCUS HSSPROT 1582 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for S-protein. ACCESSION X03168 NID g36574 KEYWORDS S-protein; somatomedin B; spreading factor; vitronectin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1582) AUTHORS Jenne,D. and Stanley,K.K. TITLE Molecular cloning of S-protein, a link between complement, coagulation and cell-substrate adhesion JOURNAL EMBO J. 4 (12), 3153-3157 (1985) MEDLINE 86135941 REFERENCE 2 (bases 1 to 1582) AUTHORS Suzuki,S., Oldberg,A., Hayman,E.G., Pierschbacher,M.D. and Ruoslahti,E. TITLE Complete amino acid sequence of human vitronectin deduced from cDNA. Similarity of cell attachment sites in vitronectin and fibronectin JOURNAL EMBO J. 4 (10), 2519-2524 (1985) MEDLINE 86030229 REFERENCE 3 (bases 1 to 1582) AUTHORS Suzuki,S., Pierschbacher,M.D., Hayman,E.G., Nguyen,K., Ohgren,Y. and Ruoslahti,E. TITLE Domain structure of vitronectin. Alignment of active sites JOURNAL J. Biol. Chem. 259 (24), 15307-15314 (1984) MEDLINE 85080020 FEATURES Location/Qualifiers source 1..1582 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 62..118 /note="signal peptide (aa -19 to -1)" CDS 62..1498 /note="S-protein precursor" /codon_start=1 /db_xref="PID:g36575" /db_xref="SWISS-PROT:P04004" /translation="MAPLRPLLILALLAWVALADQESCKGRCTEGFNVDKKCQCDELC SYYQSCCTDYTAECKPQVTRGDVFTMPEDEYTVYDDGEEKNNATVHEQVGGPSLTSDL QAQSKGNPEQTPVLKPEEEAPAPEVGASKPEGIDSRPETLHPGRPQPPAEEELCSGKP FDAFTDLKNGSLFAFRGQYCYELDEKAVRPGYPKLIRDVWGIEGPIDAAFTRINCQGK TYLFKGNQYWRFEDGVLDPDYPRNISDGFDGIPDNVDAALALPAHSYSGRERVYFFKG KQYWEYQFQHQPSQEECEGSSLSAVFEHFAMMQRDSWEDIFELLFWGRTSAGTRQPQF ISRDWHGVPGQVDAAMAGRIYISGMAPRPSLTKKQRFRHRNRKGYRSQRGHSRGRNQN SRRPSRAMWLSLFSSEESNLGANNYDDYRMDWLVPATCEPIQSVFFFSGDKYYRVNLR TRRVDTVDPPYPRSIAHYWLGCPAPGHL" mat_peptide 119..1495 /note="mature S-protein (aa 1-459)" misc_feature 119..254 /note="somatomedin B homology (aa 1-44)" misc_feature 251..259 /note="cell attachment determinant" conflict 541 /note="C is missing in [2]" /citation=[2] conflict 550 /note="C is missing in [2]" /citation=[2] conflict 560 /note="C is missing in [2]" /citation=[2] conflict 735 /note="A is G in [2]" /citation=[2] misc_feature 1145..1255 /note="pot. heparin binding site [3]" conflict 1157 /note="A is G in [2]" /citation=[2] misc_feature 1255..1256 /note="trypsin cleavage site (in vitro)" conflict 1260 /note="U ic C in [2]" /citation=[2] conflict 1387 /note="C is G in [2]" /citation=[2] BASE COUNT 341 a 496 c 441 g 304 t ORIGIN 1 cagagcggag acttcaggga gaccagagcc cagttgcagg cactcagcta gaagccctgc 61 catggcaccc ctgagacccc ttctcatact ggccctgctg gcatgggttg ctctggctga 121 ccaagagtca tgcaagggcc gctgcactga gggcttcaac gtggacaaga agtgccagtg 181 tgacgagctc tgctcttact accagagctg ctgcacagac tatacggctg agtgcaagcc 241 ccaagtgact cgcggggatg tgttcactat gccggaggat gagtacacgg tctatgacga 301 tggcgaggag aaaaacaatg ccactgtcca tgaacaggtg gggggcccct ccctgacctc 361 tgacctccag gcccagtcca aagggaatcc tgagcagaca cctgttctga aacctgagga 421 agaggcccct gcgcctgagg tgggcgcctc taagcctgag gggatagact caaggcctga 481 gacccttcat ccagggagac ctcagccccc agcagaggag gagctgtgca gtgggaagcc 541 cttcgacgcc ttcaccgacc tcaagaacgg ttccctcttt gccttccgag ggcagtactg 601 ctatgaactg gacgaaaagg cagtgaggcc tgggtacccc aagctcatcc gagatgtctg 661 gggcatcgag ggccccatcg atgccgcctt cacccgcatc aactgtcagg ggaagaccta 721 cctcttcaag ggtaatcagt actggcgctt tgaggatggt gtcctggacc ctgattaccc 781 ccgaaatatc tctgacggct tcgatggcat cccggacaac gtggatgcag ccttggccct 841 ccctgcccat agctacagtg gccgggagcg ggtctacttc ttcaagggga aacagtactg 901 ggagtaccag ttccagcacc agcccagtca ggaggagtgt gaaggcagct ccctgtcggc 961 tgtgtttgaa cactttgcca tgatgcagcg ggacagctgg gaggacatct tcgagcttct 1021 cttctggggc agaacctctg ctggtaccag acagccccag ttcattagcc gggactggca 1081 cggtgtgcca gggcaagtgg acgcagccat ggctggccgc atctacatct caggcatggc 1141 accccgcccc tccttgacca agaaacaaag gtttaggcat cgcaaccgca aaggctaccg 1201 ttcacaacga ggccacagcc gtggccgcaa ccagaactcc cgccggccat cccgcgccat 1261 gtggctgtcc ttgttctcca gtgaggagag caacttggga gccaacaact atgatgacta 1321 caggatggac tggcttgtgc ctgccacctg tgaacccatc cagagtgtct tcttcttctc 1381 tggagacaag tactaccgag tcaatcttcg cacacggcga gtggacactg tggaccctcc 1441 ctacccacgc tccatcgctc actactggct gggctgccca gctcctggcc atctgtagga 1501 gtcagagccc acatggccgg gccctctgta gctccctcct cccatctcct tcccccagcc 1561 caataaaggt cccttagccc cg // LOCUS HSSPROTR 3294 bp RNA PRI 21-OCT-1992 DEFINITION Human mRNA for protein S. ACCESSION Y00692 NID g36578 KEYWORDS glycoprotein; protein S. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3294) AUTHORS van Amstel,J.K.P. TITLE Direct Submission JOURNAL Submitted (31-AUG-1987) J.K. Ploos van Amstel, Haemostasis and Thrombosis Unit, Dept. of Haematology, University Hospital, Bldg.1, C2-R, P.O. Box 9600, 2300RC Leiden, The Netherlands REMARK revised by [4] REFERENCE 2 (bases 1 to 245) AUTHORS Ploos van Amstel,H.K., van der Zanden,A.L., Reitsma,P.H. and Bertina,R.M. TITLE Human protein S cDNA encodes Phe-16 and Tyr 222 in consensus sequences for the post-translational processing JOURNAL FEBS Lett. 222 (1), 186-190 (1987) MEDLINE 88005138 REFERENCE 3 (bases 1170 to 3283) AUTHORS van Amstel,J.K.P., van der Zanden,A.L., Bakker,E., Reitsma,P.H. and Bertina,R.M. TITLE Two genes homologous with human protein S cDNA are located on chromosome 3 JOURNAL Thromb. Haemost. (1987) In press REFERENCE 4 (bases 1 to 3294) AUTHORS Ploos van Amstel,H.K. TITLE Direct Submission JOURNAL Submitted (15-JUN-1990) to the EMBL/GenBank/DDBJ databases COMMENT Protein S circulates in the plasma as a cofactor of activated protein C helping to prevent coagulation and stimulating fibrinolysis. Its deficiency is assosiated with ab increased risk to develop thrombotic disease. FEATURES Location/Qualifiers source 1..3294 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="adult liver" CDS 124..2154 /note="(AA -24 to 652)" /codon_start=1 /product="preproprotein S" /db_xref="PID:g36579" /db_xref="SWISS-PROT:P07225" /translation="MRVLGGRCGALLACLLLVLPVSEANFLSKQQASQVLVRKRRANS LLEETKQGNLERECIEELCNKEEAREVFENDPETDYFYPKYLVCLRSFQTGLFTAARQ STNAYPDLRSCVNAIPDQCSPLPCNEDGYMSCKDGKASFTCTCKPGWQGEKCEFDINE CKDPSNINGGCSQICDNTPGSYHCSCKNGFVMLSNKKDCKDVDECSLKPSICGTAVCK NIPGDFECECPEGYRYNLKSKSCEDIDECSENMCAQLCVNYPGGYTCYCDGKKGFKLA QDQKSCEVVSVCLPLNLDTKYELLYLAEQFAGVVLYLKFRLPEISRFSAEFDFRTYDS EGVILYAESIDHSAWLLIALRGGKIEVQLKNEHTSKITTGGDVINNGLWNMVSVEELE HSISIKIAKEAVMDINKPGPLFKPENGLLETKVYFAGFPRKVESELIKPINPRLDGCI RSWNLMKQGASGIKEIIQEKQNKHCLVTVEKGSYYPGSGIAQFHIDYNNVSSAEGWHV NVTLNIRPSTGTGVMLALVSGNNTVPFAVSLVDSTSEKSQDILLSVENTVIYRIQALS LCSDQQSHLEFRVNRNNLELSTPLKIETISHEDLQRQLAVLDKAMKAKVATYLGGLPD VPFSATPVNAFYNGCMEVNINGVQLDLDEAISKHNDIRAHSCPSVWKKTKNS" sig_peptide 124..199 CDS 200..246 /note="Author-given protein sequence is in conflict with the conceptual translation; (AA 1 to 17)" /codon_start=1 /product="propiece of latent protein S" /db_xref="PID:e28395" /db_xref="PID:g1335319" /translation="FCQSNRLHKSWLGSVV" misc_feature 470..972 /note="EGF-homologous regions (four)" misc_feature 973..2151 /note="sex hormone binding globulin homology" BASE COUNT 1002 a 594 c 710 g 988 t ORIGIN 1 ctggcgccgc cgcgcagcac ggctcagacc gaggcgcaca ggctcgcagc tccgcggcgc 61 ctagcgctcc ggtccccgcc gcgacgcgcc accgtccctg ccggcgcctc cgcgcgcttc 121 gaaatgaggg tcctgggtgg gcgctgcggg gcgttgctgg cgtgtctcct cctagtgctt 181 cccgtctcag aggcaaactt tttgtcaaag caacaggctt cacaagtcct ggttaggaag 241 cgtcgtgcaa attctttact tgaagaaacc aaacagggta atcttgaaag agaatgcatc 301 gaagaactgt gcaataaaga agaagccagg gaggtctttg aaaatgaccc ggaaacggat 361 tatttttatc caaaatactt agtttgtctt cgctcttttc aaactgggtt attcactgct 421 gcacgtcagt caactaatgc ttatcctgac ctaagaagct gtgtcaatgc cattccagac 481 cagtgtagtc ctctgccatg caatgaagat ggatatatga gctgcaaaga tggaaaagct 541 tcttttactt gcacttgtaa accaggttgg caaggagaaa agtgtgaatt tgacataaat 601 gaatgcaaag atccctcaaa tataaatgga ggttgcagtc aaatttgtga taatacacct 661 ggaagttacc actgttcctg taaaaatggt tttgttatgc tttcaaataa gaaagattgt 721 aaagatgtgg atgaatgctc tttgaagcca agcatttgtg gcacagctgt gtgcaagaac 781 atcccaggag attttgaatg tgaatgcccc gaaggctaca gatataatct caaatcaaag 841 tcttgtgaag atatagatga atgctctgag aacatgtgtg ctcagctttg tgtcaattac 901 cctggaggtt acacttgcta ttgtgatggg aagaaaggat tcaaacttgc ccaagatcag 961 aagagttgtg aggttgtttc agtgtgcctt cccttgaacc ttgacacaaa gtatgaatta 1021 ctttacttgg cggagcagtt tgcaggggtt gttttatatt taaaatttcg tttgccagaa 1081 atcagcagat tttcagcaga atttgatttc cggacatatg attcagaagg cgtgatactg 1141 tacgcagaat ctatcgatca ctcagcgtgg ctcctgattg cacttcgtgg tggaaagatt 1201 gaagttcagc ttaagaatga acatacatcc aaaatcacaa ctggaggtga tgttattaat 1261 aatggtctat ggaatatggt gtctgtggaa gaattagaac atagtattag cattaaaata 1321 gctaaagaag ctgtgatgga tataaataaa cctggacccc tttttaagcc ggaaaatgga 1381 ttgctggaaa ccaaagtata ctttgcagga ttccctcgga aagtggaaag tgaactcatt 1441 aaaccgatta accctcgtct agatggatgt atacgaagct ggaatttgat gaagcaagga 1501 gcttctggaa taaaggaaat tattcaagaa aaacaaaata agcattgcct ggttactgtg 1561 gagaagggct cctactatcc tggttctgga attgctcaat ttcacataga ttataataat 1621 gtatccagtg ctgagggttg gcatgtaaat gtgaccttga atattcgtcc atccacgggc 1681 actggtgtta tgcttgcctt ggtttctggt aacaacacag tgccctttgc tgtgtccttg 1741 gtggactcca cctctgaaaa atcacaggat attctgttat ctgttgaaaa tactgtaata 1801 tatcggatac aggccctaag tctatgttcc gatcaacaat ctcatctgga atttagagtc 1861 aacagaaaca atctggagtt gtcgacacca cttaaaatag aaaccatctc ccatgaagac 1921 cttcaaagac aacttgccgt cttggacaaa gcaatgaaag caaaagtggc cacatacctg 1981 ggtggccttc cagatgttcc attcagtgcc acaccagtga atgcctttta taatggctgc 2041 atggaagtga atattaatgg tgtacagttg gatctggatg aagccatttc taaacataat 2101 gatattagag ctcactcatg tccatcagtt tggaaaaaga caaagaattc ttaaggcatc 2161 ttttctctgc ttataatacc ttttccttgt gtgtaattat acttatgttt caataacagc 2221 tgaagggttt tatttacaat gtgcagtctt tgattatttt gtggtccttt cctgggattt 2281 ttaaaaggtc ctttgtcaag gaaaaaaatt ctgttgtgat ataaatcaca gtaaagaaat 2341 tcttacttct cttgctatct aagaatagtg aaaaataaca attttaaatt tgaatttttt 2401 tcctacaaat gacagtttca atttttgttt gtaaaactaa atttttaatt ttatcatcat 2461 gaactagtgt ctaaatacct atgttttttt cagaaagcaa ggaagtaaac tcaaacaaaa 2521 gtgcgtgtaa ttaaatacta ttaatcatag gcagatacta ttttgtttat gtttttgttt 2581 ttttcctgat gaaggcagaa gagatggtgg tctattaaat atgaattgaa tggagggtcc 2641 taatgcctta tttcaaaaca attcctcagg gggaccagct ttggcttcat ctttctcttg 2701 tgtggcttca catttaaacc agtatcttta ttgaattaga aaacaagtgg gacatatttt 2761 cctgagagca gcacaggaat cttcttcttg gcagctgcag tctgtcagga tgagatatca 2821 gattaggttg gataggtggg gaaatctgaa gtgggtacat tttttaaatt ttgctgtgtg 2881 ggtcacacaa ggtctacatt acaaaagaca gaattcaggg atggaaagga gaatgaacaa 2941 atgtgggagt tcatagtttt ccttgaatcc aacttttaat taccagagta agttgccaaa 3001 atgtgattgt tgaagtacaa aaggaactat gaaaaccaga acaaatttta acaaaaggac 3061 aaccacagag ggatatagtg aatatcgtat cattgtaatc aaagaagtaa ggaggtaaga 3121 ttgccacgtg cctgctggta ctgtgatgca tttcaagtgg cagttttatc acgtttgaat 3181 ctaccattca tagccagatg tgtatcagat gtttcactga cagtttttaa caataaattc 3241 ttttcactgt attttatatc acttataata aatcggtgta taatctaaaa aaaa // LOCUS HSSPTI 1621 bp RNA PRI 27-OCT-1997 DEFINITION H.sapiens mRNA for serine palmitoyltransferase, subunit I. ACCESSION Y08685 NID g2564246 KEYWORDS serine palmitoyltransferase; subunit I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1621) AUTHORS Stoffel,W. TITLE Direct Submission JOURNAL Submitted (04-OCT-1996) W. Stoffel, Institute For Biochemistry, Medical Faculty, University Of Cologne, Joseph-Stelzmann-Strasse 52, Cologne, 50931, FRG REMARK revised by [3] REFERENCE 2 (bases 1 to 1621) AUTHORS Weiss,B. TITLE Direct Submission JOURNAL Submitted (20-OCT-1997) B.Weiss, Institute For Biochemistry, Medical Faculty, University Of Cologne, Joseph-Stelzmann-Strasse 52, Cologne, 50931, FRG REFERENCE 3 (bases 1 to 1621) AUTHORS Weiss,B. and Stoffel,W. TITLE Human and murine serine-palmitoyl-CoA transferase--cloning, expression and characterization of the key enzyme in sphingolipid synthesis JOURNAL Eur. J. Biochem. 249 (1), 239-247 (1997) MEDLINE 98028405 FEATURES Location/Qualifiers source 1..1621 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEK293" /dev_stage="embryo" /tissue_type="kidney" CDS 1..1422 /EC_number="2.3.1.50" /codon_start=1 /product="serine palmitoyltransferase, subunit I" /db_xref="PID:e1154174" /db_xref="PID:g2564247" /translation="MATATEQWVLVEMVQALYEAPAYHLILEGILILWIIRLLFSKTY KLQERSDLTVKEKEELIEEWQPEPLVPPVPKDHPALNYNIVSGPPSHKTVVNGKECIN FASFNFLGLLDNPRVKAAALASLKKYGVGTCGPRGFYGTFDVHLDLEDRLAKFMKTEE AIIYSYGFATIASAIPAYSKRGDIVFVDRAACFAIQKGLQASRSDIKLFKHNDMADLE RLLKEQEIEDQKNPRKARVTRRFIVVEGLYMNTGTICPLPELVKLKYKYKARIFLEES LSFGVLGEHGRGVTEHYGINIDDIDLISANMENALASIGGFCCGRSFVIDHQRLSGQG YCFSASLPPLLAAAAIEALNIMEENPGIFAVLKEKCGQIHKALQGISGLKVVGESLSP AFHLQLEESTGSREQDVRLLQEIVDQCMNRSIALTQARYLEKEEKCLPPPSIRVVVTV EQTEEELERAASTIKEVAQAVLL" BASE COUNT 449 a 335 c 393 g 439 t 5 others ORIGIN 1 atggcgaccg ccacggagca gtgggttctg gtggagatgg tacaggcgct ttacgaggct 61 cctgcttacc atcttatttt ggaagggatt ctgatcctct ggataatcag acttcttttc 121 tctaagactt acaaattaca agaacgatct gatcttacag tcaaggaaaa agaagaactg 181 attgaagagt ggcaaccaga acctcttgtt cctcctgtcc caaaagacca tcctgctctc 241 aactacaaca tcgtttcagg ccctccaagc cacaaaactg tggtgaatgg aaaagaatgt 301 ataaacttcg cctcatttaa ttttcttgga ttgttggata accctagggt taaggcagca 361 gctttagcat ctctaaagaa gtatggcgtg gggacttgtg gacccagagg attttatggc 421 acatttgatg ttcatttgga tttggaagac cgcctggcaa aatttatgaa gacagaagaa 481 gccattatat actcatatgg atttgccacc atagccagtg ctattcctgc ttactctaaa 541 agaggggaca ttgtttttgt agatagagct gcctgctttg ctattcagaa aggattacag 601 gcatcccgta gtgacattaa gttatttaag cataatgaca tggctgacct cgagcgacta 661 ctaaaagaac aagagatcga agatcaaaag aatcctcgca aggctcgtgt aactcggcgt 721 ttcattgtag tagaaggatt gtatatgaat actggaacta tttgtcctct tccagaattg 781 gttaagttaa aatacaaata caaagcaaga atcttcctgg aggaaagcct ttcatttgga 841 gtcctaggag agcatggccg aggagtcact gaacactatg gaatcaatat tgatgatatt 901 gatcttatca gtgccaacat ggagaatgca cttgcttcta ttggaggttt ctgctgtggc 961 aggtcttttg taattgacca tcagcgactt tccggccagg gatactgctt ttcagcttcg 1021 ttacctcccc tgttagctgc tgcagcaatt gaggccctca acatcatgga agagaatcca 1081 ggtatttttg cagtgttgaa ggaaaagtgc ggacaaattc ataaagcttt acaaggcatt 1141 tctggattaa aagtggtggg ggagtccctt tctccagcct ttcacctaca actggaagag 1201 agcactgggt ctcgcgagca agatgtcaga ctgcttcagg aaattgtaga tcaatgcatg 1261 aacagaagta ttgcattaac tcaggcgcgc tacttggaga aagaagagaa gtgtctccct 1321 cctcccagca ttcgggttgt ggtcacggtg gaacaaacag aggaagaact ggagagagct 1381 gcgtccacca tcaaggaggt agcccaggcc gtcctgctct aggcagagtc ccgggaccat 1441 ggcctcctgc cacacaacac gcagagagga ctcaagactc ccgctggcca tgggagttgc 1501 ttgnaaagga gagcaagaac atgtggggtc ttttgatagg gttgtttacc caantggtgt 1561 tcagttttng gacccatttg ttgttgacca tgagnagggt tgcttatttt ttttttaant 1621 t // LOCUS HSSPTII 2026 bp RNA PRI 27-OCT-1997 DEFINITION H.sapiens mRNA for serine palmitoyltransferase, subunit II. ACCESSION Y08686 NID g2564248 KEYWORDS serine palmitoyltransferase; subunit II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2026) AUTHORS Stoffel,W. TITLE Direct Submission JOURNAL Submitted (04-OCT-1996) W. Stoffel, Institute For Biochemistry, Medical Faculty, University Of Cologne, Joseph-Stelzmann-Strasse 52, Cologne, 50931, FRG REMARK revised by [3] REFERENCE 2 (bases 1 to 2026) AUTHORS Weiss,B. TITLE Direct Submission JOURNAL Submitted (20-OCT-1997) B.Weiss, Institute For Biochemistry, Medical Faculty, University Of Cologne, Joseph-Stelzmann-Strasse 52, Cologne, 50931, FRG REFERENCE 3 (bases 1 to 2026) AUTHORS Weiss,B. and Stoffel,W. TITLE Human and murine serine-palmitoyl-CoA transferase--cloning, expression and characterization of the key enzyme in sphingolipid synthesis JOURNAL Eur. J. Biochem. 249 (1), 239-247 (1997) MEDLINE 98028405 FEATURES Location/Qualifiers source 1..2026 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /cell_line="adenocarcinoma cell line" /clone_lib="Stratagene #937208" CDS 49..1737 /EC_number="2.3.1.50" /codon_start=1 /product="serine palmitoyltransferase, subunit II" /db_xref="PID:e1154176" /db_xref="PID:g2564249" /translation="MRPEPGGCCCRRTVRANGCVANGEVRNGYVRSSAAAAAAAAAGQ IHHVTQNGGLYKRPFNEAFEETPMLVAVLTYVGYGVLTLFGYLRDFLRYWRIEKCHHA TEREEQKDFVSLYQDFENFYTRNLYMRIRDNWNRPICSVPGARVDIMERQSHDYNWSF KYTGNIIKGVINMGSYNYLGFARNTGSCQEAAAKVLEEYGAGVCSTRQEIGNLDKHEE LEELVARFLGVEAAMAYGMGFATNSMNIPALVGKGCLILSDELNHASLVLGARLSGAT IRIFKHNNMQSLEKLLKDAIVYGQPRTRRPWKKILILVEGIYSMEGSIVRLPEVIALK KKYKAYLYLDEAHSIGALGPTGRGVVEYFGLDPEDVDVMMGTFTKSFGASGGYIGGKK ELIDYLRTHSHSAVYATSLSPPVVEQIITSMKCIMGQDGTSLGKECVQQLAENTRYFR RRLKEMGFIIYGNEDSPVVPLMLYMPAKIGAFGREMLKRNIGVVVVGFPATPIIESRA RFCLSAAHTKEILDTALKEIDEVGDLLQLKYSRHRLVPLLDRPFDETTYEETED" BASE COUNT 567 a 435 c 529 g 495 t ORIGIN 1 gctgccaccg cctacagagc ctgccttgcg cctggtgctg ccaggaagat gcggccggag 61 cccggaggct gctgctgccg ccgcacggtg cgggcgaatg gctgcgtggc gaacggggaa 121 gtacggaacg ggtacgtgag gagcagcgct gcagccgcag ccgcagccgc cgccggccag 181 atccatcatg ttacacaaaa tggaggacta tataaaagac cgtttaatga agcttttgaa 241 gaaacaccaa tgctggttgc tgtgctcacg tatgtggggt atggcgtact caccctcttt 301 ggatatcttc gagatttctt gaggtattgg agaattgaaa agtgtcacca tgcaacagaa 361 agagaagaac aaaaggactt tgtgtcattg tatcaagatt ttgaaaactt ttatacaagg 421 aatctgtaca tgaggataag agacaactgg aatcggccaa tctgtagtgt gcctggagcc 481 agggtggaca tcatggagag acagtctcat gattataact ggtccttcaa gtatacaggg 541 aatataataa agggtgttat aaacatgggt tcctacaact atcttggatt tgcacggaat 601 actggatcat gtcaagaagc agccgccaaa gtccttgagg agtatggagc tggagtgtgc 661 agtactcggc aggaaattgg aaacctggac aagcatgaag aactagagga gcttgtagca 721 aggttcttag gagtagaagc tgctatggcg tatggcatgg gatttgcaac gaattcaatg 781 aacattcctg ctcttgttgg caaaggttgc ctgattctga gtgatgaact gaaccatgca 841 tcactggttc tgggagccag actgtcagga gcaaccatta gaatcttcaa acacaacaat 901 atgcaaagcc tagagaagct attgaaagat gccattgttt atggtcagcc tcggacacga 961 aggccctgga agaaaattct catccttgtg gaaggaatat atagcatgga gggatctatt 1021 gttcgtcttc ctgaagtgat tgccctcaag aagaaataca aggcatactt gtatctggat 1081 gaggctcaca gcattggcgc cctgggcccc acaggccggg gtgtggtgga gtactttggc 1141 ctggatcccg aggatgtgga tgttatgatg ggaacgttca caaagagttt tggtgcttct 1201 ggaggatata ttggaggcaa gaaggagctg atagactacc tgcgaacaca ttctcatagt 1261 gcagtgtatg ccacgtcatt gtcacctcct gtagtggagc agatcatcac ctccatgaag 1321 tgcatcatgg ggcaggatgg caccagcctt ggtaaagagt gtgtacaaca gttagctgaa 1381 aacaccaggt atttcaggag acgcctgaaa gagatgggct tcatcatcta tggaaatgaa 1441 gactctccag tagtgccttt gatgctctac atgcctgcca aaattggcgc ctttggacgg 1501 gagatgctga agcggaacat cggtgtcgtt gtggttggat ttcctgccac cccaattatt 1561 gagtccagag ccaggttttg cctgtcagca gctcatacca aagaaatact tgatactgct 1621 ttaaaggaga tagatgaagt tggggaccta ttgcagctga agtattcccg tcatcggttg 1681 gtacctctac tggacaggcc ctttgacgag acgacgtatg aagaaacaga agactgagcc 1741 ttgttggtgc tccctcagag gaactctccc tcacccagga cagcctgtgg cctttgtgag 1801 ccagttccag gaaccacact tctgtggcca tctcacgtga aagacattgc ctcagctact 1861 gaaggtggcc acctccactc taaatgacat tttgtaaata gtaaaaaact gcttctaatc 1921 cttcctttgc taaatctcac ctttaaaaac gaaggtgact cactttgctt tttcagtcca 1981 ttaaaaaaac attttatttt gcaaccaaaa aaaaaaaaaa aaaaaa // LOCUS HSSPYRAT 1487 bp RNA PRI 25-APR-1991 DEFINITION Human Ser-PyrAT mRNA for serine-pyruvate aminotransferase. ACCESSION X56092 NID g36581 KEYWORDS aminotransferase; Ser-PyrAT; serine pyruvate aminotransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1487) AUTHORS Nishiyama,K. TITLE Direct Submission JOURNAL Submitted (13-SEP-1990) Nishiyama K., Hamamatsu University School of Medicine, Department of Biochemistry, 3600 Handa-cho Hamamatsu, Shizuoka 431-31, Japan REFERENCE 2 (bases 1 to 1487) AUTHORS Nishiyama,K., Berstein,G., Oda,T. and Ichiyama,A. TITLE Cloning and nucleotide sequence of cDNA encoding human liver serine-pyruvate aminotransferase JOURNAL Eur. J. Biochem. 194 (1), 9-18 (1990) MEDLINE 91071216 FEATURES Location/Qualifiers source 1..1487 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult male (62 years old)" /tissue_type="liver" /clone="pHspT12,pHspT16" mRNA 1..1487 mat_peptide 22..1197 /gene="Ser-PyrAT" /EC_number="2.6.1.51" /product="serine--pyruvate aminotransferase" gene 22..1200 /gene="Ser-PyrAT" CDS 22..1200 /gene="Ser-PyrAT" /EC_number="2.6.1.51" /codon_start=1 /product="serine--pyruvate aminotransferase" /db_xref="PID:g36582" /db_xref="SWISS-PROT:P21549" /translation="MASHKLLVTPPKALLKPLSIPNQLLLGPGPSNLPPRIMAAGGLQ MIGSMSKDMYQIMDEIKEGIQYVFQTRNPLTLVISGSGHCALEAALVNVLEPGDSFLV GANGIWGQRAVDIGERIGARVHPMTKDPGGHYTLQEVEEGLAQHKPVLLFLTHGESST GVLQPLDGFGELCHRYKCLLLVDSVASLGGTPLYMDRQGIDILYSGSQKALNAPPGTS LISFSDKAKKKMYSRKTKPFSFYLDIKWLANFWGCDDQPRMYHHTIPVISLYSLRESL ALIAEQGLENSWRQHREAAAYLHGRLQALGLQLFVKDPALRLPTVTTVAVPAGYDWRD IVSYVIDHFDIEIMGGLGPSTGKVLRIGLLGCNATRENVDRVTEALRAALQHCPKKKL " BASE COUNT 298 a 493 c 443 g 253 t ORIGIN 1 gcggcaggtt gggtgcggac catggcctct cacaagctgc tggtgacccc ccccaaggcc 61 ctgctcaagc ccctctccat ccccaaccag ctcctgctgg ggcctggtcc ttccaacctg 121 cctcctcgca tcatggcagc cggggggctg cagatgatcg ggtccatgag caaggatatg 181 taccagatca tggacgagat caaggaaggc atccagtacg tgttccagac caggaaccca 241 ctcacactgg tcatctctgg ctcgggacac tgtgccctgg aggccgccct ggtcaatgtg 301 ctggagcctg gggactcctt cctggttggg gccaatggca tttgggggca gcgagccgtg 361 gacatcgggg agcgcatagg agcccgagtg cacccgatga ccaaggaccc tggaggccac 421 tacacactgc aggaggtgga ggagggcctg gcccagcaca agccagtgct gctgttctta 481 acccacgggg agtcgtccac cggcgtgctg cagccccttg atggcttcgg ggaactctgc 541 cacaggtaca agtgcctgct cctggtggat tcggtggcat ccctgggcgg gacccccctt 601 tacatggacc ggcaaggcat cgacatcctg tactcgggct cccagaaggc cctgaacgcc 661 cctccaggga cctcgctcat ctccttcagt gacaaggcca aaaagaagat gtactcccgc 721 aagacgaagc ccttctcctt ctacctggac atcaagtggc tggccaactt ctggggctgt 781 gacgaccagc ccaggatgta ccatcacaca atccccgtca tcagcctgta cagcctgaga 841 gagagcctgg ccctcattgc ggaacagggc ctggagaaca gctggcgcca gcaccgcgag 901 gccgcggcgt atctgcatgg gcgcctgcag gcactggggc tgcagctctt cgtgaaggac 961 ccggcgctcc ggcttcccac agtcaccact gtggctgtac ccgctggcta tgactggaga 1021 gacatcgtca gctacgtcat agaccacttc gacattgaga tcatgggtgg ccttgggccc 1081 tccacgggga aggtgctgcg gatcggcctg ctgggctgca atgccacccg cgagaatgtg 1141 gaccgcgtga cggaggccct gagggcggcc ctgcagcact gccccaagaa gaagctgtga 1201 cctgcccact ggcacacagc tggcactggc acacacctgt cccatgccca ccctgaggga 1261 tcaggagcaa acagaccctg caaggtcctc caggcctggg gacaggaaag ccactgaccc 1321 agcccgggag gcagaaccag gcagcctccc tggccccagg cagccctttt ccctccagtg 1381 gcacctcctg gaaacagtcc acttgggcgc aaaacccagt gccttccaaa tgagctgcag 1441 tccccaggcc atgagcctcc cgggaatgtt taataaaggg cctggcc // LOCUS HSSQUSYN 2051 bp RNA PRI 17-DEC-1993 DEFINITION H.sapiens mRNA for squalene synthase. ACCESSION X69141 NID g435676 KEYWORDS farnesyl-diphosphate farnesyltransferase; squalene synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2051) AUTHORS Charles,A.D. TITLE Direct Submission JOURNAL Submitted (09-NOV-1992) A.D. Charles, ICI Parmaceuticals, Biotechnology Dept, Alderley Park, NR Macclesfield, Cheshire SK10 4TG, UK REFERENCE 2 (bases 1 to 2051) AUTHORS Summers,C., Karst,F. and Charles,A.D. TITLE Cloning, expression and characterisation of the cDNA encoding human hepatic squalene synthase, and its relationship to phytoene synthase JOURNAL Gene 136 (1-2Che), 185-192 (1993) MEDLINE 94123996 FEATURES Location/Qualifiers source 1..2051 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone_lib="lambda ZAPXR (stratagene)" CDS 92..1345 /EC_number="2.5.1.21" /note="squalene synthase" /codon_start=1 /product="farnesyl-diphosphate farnesyltransferase" /db_xref="PID:g435677" /db_xref="SWISS-PROT:P37268" /translation="MEFVKCLGHPEEFYNLVRFRIGGKRKVMPKMDQDSLSSSLKTCY KYLNQTSRSFAAVIQALDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHNFHS FLYQPDWRFMESKEKDRQVLEDFPTISLEFRNLAEKYQTVIADICRRMGIGMAEFLDK HVTSEQEWDKYCHYVAGLVGIGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNIIR DYLEDQQGGREFWPQEVWSRYVKKLGDFAKPENIDLAVQCLNELITNALHHIPDVITY LSRLRNQSVFNFCAIPQVMAIATLAACYNNQQVFKGAVKIRKGQAVTLMMDATNMPAV KAIIYQYMEEIYHRIPDSDPSSSKTRQIISTIRTQNLPNCQLISRSHYSPIYLSFVML LAALSWQYLATLSQVTEDYVQTGEH" polyA_signal 1883..1888 polyA_signal 2011..2016 BASE COUNT 551 a 449 c 499 g 552 t ORIGIN 1 cgagacctac tccacaggtc cagccggccg gtgagcgcct ggggaccgca gaggtgagag 61 tcgcgcccgg gagtccgccg cctgcgccag gatggagttc gtgaaatgcc ttggccaccc 121 cgaagagttc tacaacctgg tgcgcttccg gatcgggggc aagcggaagg tgatgcccaa 181 gatggaccag gactcgctca gcagcagcct gaaaacttgc tacaagtatc tcaatcagac 241 cagtcgcagt ttcgcagctg ttatccaggc gctggatggg gaaatgcgca acgcagtgtg 301 catattttat ctggttctcc gagctctgga cacactggaa gatgacatga ccatcagtgt 361 ggaaaagaag gtcccgctgt tacacaactt tcactctttc ctttaccaac cagactggcg 421 gttcatggag agcaaggaga aggatcgcca ggtgctggag gacttcccaa cgatctccct 481 tgagtttaga aatctggctg agaaatacca aacagtgatt gccgacattt gccggagaat 541 gggcattggg atggcagagt ttttggataa gcatgtgacc tctgaacagg agtgggacaa 601 gtactgccac tatgttgctg ggctggtcgg aattggcctt tcccgtcttt tctcagcctc 661 agagtttgaa gaccccttag ttggtgaaga tacagaacgt gccaactcta tgggcctgtt 721 tctgcagaaa acaaacatca tccgtgacta tctggaagac cagcaaggag gaagagagtt 781 ctggcctcaa gaggtttgga gcaggtatgt taagaagtta ggggattttg ctaagccgga 841 gaatattgac ttggccgtgc agtgcctgaa tgaacttata accaatgcac tgcaccacat 901 cccagatgtc atcacctacc tttcgagact cagaaaccag agtgtgttta acttctgtgc 961 tattccacag gtgatggcca ttgccacttt ggctgcctgt tataataacc agcaggtgtt 1021 caaaggggca gtgaagattc ggaaagggca agcagtgacc ctgatgatgg atgccaccaa 1081 tatgccagct gtcaaagcca tcatatatca gtatatggaa gagatttatc atagaatccc 1141 cgactcagac ccatcttcta gcaaaacaag gcagatcatc tccaccatcc ggacgcagaa 1201 tcttcccaac tgtcagctga tttcccgaag ccactactcc cccatctacc tgtcgtttgt 1261 catgcttttg gctgccctga gctggcagta cctggccact ctctcccagg taacagaaga 1321 ctatgttcag actggagaac actgatccca aatttgtcca tagctgaagt ccaccataaa 1381 gtggatttac tttttttctt taaggatgga tgttgtgttc tctttatttt tttcctacta 1441 ctttaatccc taaaagaacg ctgtgtggct gggaccttta ggaaagtgaa atgcaggtga 1501 gaagaaccta aacatgaaag gaaagggtgc ctcatcccag caacctgtcc ttgtgggtga 1561 tgatcactgt gctgcttgtg gctcatggca gagcattcag tgccacggtt taggtgaagt 1621 cgctgcatat gtgactgtca tgagatccta cttagtatga tcctggctag aatgataatt 1681 aaaagtattt aatttgaagc accatttgaa tgttcgtaat agtagaaaat gatgtgaatt 1741 ttctttctgt tcggctccta tttttctcat cattttgttt tctttaattg ggttgaatgg 1801 agtagataga aatatttatg gtttaggtaa cagttagatg tttcctaaga atgcaaactg 1861 ccttttccac acaaaggctg ggaataaaat tctgggtatt ctcgtattct catttaaagg 1921 agtttagctt tcagagagaa acagcaggat tgcttttgac cttttagaag attggtctcc 1981 agtaaaggtg gacatttttg agatttttat aataaagaat ttaattgctc tgcaaaaaaa 2041 aaaaaaaaaa a // LOCUS HSSRCYP 2695 bp RNA PRI 30-JUN-1997 DEFINITION H.sapiens mRNA for SRcyp protein. ACCESSION X99717 NID g1770525 KEYWORDS SRcyp protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2695) AUTHORS Bourquin,J.P., Stagljar,I., Meier,P., Moosmann,P., Silke,J., Baechi,T., Georgiev,O. and Schaffner,W. TITLE A serine/arginine-rich nuclear matrix cyclophilin interacts with the C-terminal domain of RNA polymerase II JOURNAL Nucleic Acids Res. 25 (11), 2055-2061 (1997) MEDLINE 97298154 REFERENCE 2 (bases 1 to 2695) AUTHORS Bourquin,J.P. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) J.P. Bourquin, University Of Zurich, Institute For Molecular Biology, Winterthurerstrasse 190, CH-8057 Zurich, SWITZERLAND FEATURES Location/Qualifiers source 1..2695 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood lymphocytes" CDS 158..2422 /codon_start=1 /product="SRcyp protein" /db_xref="PID:e268465" /db_xref="PID:g1770526" /translation="MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLC TGEKGTGKSTQKPLHYKSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAV KHNKEFLLSMANRGKDTNGSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTD AASKPFAEVRILSCGELIPKSKVKKEEKKRHKSSSSSSSSSSDSDSSSDSQSSSDSSD SESATEEKSKKRKKKHRKNSRKHKKEKKKRKKSKKSASSESEAENLEAQPQSTVRPEE IPPIPENRFLMRKSPPKADEKERKNRERERERECNPPNSQPASYQRRLLVTRSGRKIK GRGPRRYRTPSRSRSRDRFRRSETPPHWRQEMQRAQRMRVSSGERWIKGDKSELNEIK ENQRSPVRVKERKITDHRNVSESPNRKNEKEKKVKDHKSNSKERDIRRNSEKDDKYKN KVKKRAKSKSRSKSKEKSKSKERDSKHNRNEEKRMRSRSKGRDHENVKEKEKQSDSKG KDQERSRSKEKSKQLESKSNEHDHSKSKEKDRRAQSRSRECDITKGKHSYNSRTRERS RSRDRSRRVRSRTHDRDRSRSKEYHRYREQEYRRRGRSRSRERRTPPGRSRSKDRRRR RRDSRSSEREESQSRNKDKYRNQESKSSHRKENSESEKRMYSKSRDHNSSNNSREKKA DRDQSPFSKIKQSSQDDELKSSMLKNKEDEKIRSSVEKENQKSKGQENDHVHEKNKKF DHESSPGTDEDKSG" polyA_signal 2620..2625 BASE COUNT 1096 a 406 c 617 g 576 t ORIGIN 1 gcgggcttta gcgccttttc tggcggcggt agatttgaag cgcttcaaag gaccggaccc 61 agagaagagg aaaactctac cggtgcagga gcacagggat cagttgtcct tgtttttttt 121 tggtcttttc ttcatttgaa gattaagtat tggagccatg ggaataaagg ttcaacgtcc 181 tcgatgtttt tttgacattg ccattaacaa tcaacctgct ggaagagttg tctttgaatt 241 attttctgat gtgtgcccca aaacatgcga gaactttcgt tgtctttgta caggtgaaaa 301 ggggaccggg aaatcaactc agaaaccatt acattataag agttgtctct ttcacagagt 361 tgtcaaggat tttatggttc aaggtggtga cttcagtgaa ggaaatggac gaggagggga 421 atctatctat ggaggatttt ttgaagacga gagtttcgct gttaaacaca acaaagaatt 481 tctcttgtca atggccaaca gagggaagga tacaaatggt tcacagttct tcataacaac 541 gaaaccaact cctcatttag atgggcatca tgttgttttt ggacaagtaa tctctggtca 601 agaagttgta agagagattg aaaaccagaa aacagatgca gctagcaaac cgtttgcgga 661 ggtacggata ctcagttgtg gagagctgat tcccaaatct aaagttaaga aagaagaaaa 721 gaaaaggcat aaatcatcat catcttcctc ctcctcatct agtgactcag atagctcaag 781 tgattctcag tcctcttctg attcctctga ttccgaaagt gctactgaag agaaatcaaa 841 gaaaagaaaa aagaaacatc ggaaaaattc ccgaaaacac aagaaagaaa agaaaaagcg 901 aaagaaaagc aagaagagtg catctagtga gagtgaagct gaaaatcttg aagcacaacc 961 ccagtctact gtccgtccag aagagatccc tcctatacct gaaaatagat tcctaatgag 1021 aaaaagtcct cctaaagctg atgagaagga aaggaaaaac agagagagag aaagggaaag 1081 agagtgtaat ccacctaact cccagcctgc ttcataccag agacgacttt tagttactag 1141 atctggcagg aaaattaaag gaagaggacc aaggcgttat cgaactcctt ccagatccag 1201 atcaagggat cgtttcagac gtagtgagac tcctccacat tggaggcaag agatgcagag 1261 agctcaaaga atgagggtat caagtggtga aagatggatc aagggggata agagtgagtt 1321 gaatgaaata aaagaaaatc agagaagtcc agttagagta aaagagagaa aaataacaga 1381 tcacaggaat gtatctgaga gtccaaacag aaaaaatgaa aaggagaaga aagttaaaga 1441 ccataaatct aacagcaaag agagagacat cagaagaaat tcagaaaaag atgacaagta 1501 taaaaacaaa gtgaagaaaa gggccaaatc taaaagtagg agtaagagca aagagaaatc 1561 aaagagtaaa gaaagagatt caaaacataa tagaaatgaa gaaaagagga tgaggtcaag 1621 gagtaaagga agggatcatg aaaatgttaa agaaaaagaa aagcagtctg attctaaagg 1681 aaaagatcag gaaaggagta gaagtaaaga gaagtctaaa cagttagaat caaagagtaa 1741 tgagcatgat cacagtaaaa gtaaggaaaa ggatagacgc gcacaatcca ggagtagaga 1801 atgtgatata actaaaggta aacacagtta taatagcaga acaagagaac gaagcagaag 1861 tagggacaga agcagaagag tgcgatcaag aacccatgac agagatcgca gcagaagcaa 1921 ggagtaccat agatacagag aacaggaata caggagaaga ggacggtcac gaagccgaga 1981 gagaagaaca ccaccaggaa gatcaagaag taaagatagg aggagaagga ggagagactc 2041 acggagctca gagagagaag aaagtcaaag cagaaacaaa gacaaataca gaaaccaaga 2101 gagtaagagc tcacacagaa aagaaaattc tgagagtgag aaaagaatgt actctaaaag 2161 tcgtgatcat aatagctcaa ataacagcag ggaaaaaaag gctgatagag atcaaagtcc 2221 cttctcaaaa ataaaacaaa gcagtcagga cgatgaatta aagtcctcca tgttgaaaaa 2281 taaggaggat gagaagatca gatcctcagt ggaaaaagaa aaccaaaaat caaaaggtca 2341 agaaaatgac catgtacatg aaaaaaataa aaaatttgat catgaatcaa gccctggaac 2401 agatgaagac aaaagcggat gagtgagtta tataaactta cttccattct gtttcggatt 2461 ttaagtttga gagacttgct aatgaatctc ctttatgttg ttttcctttt cattgttttt 2521 ggattgtttt atgtttgtcc ttttttttct taatgtggat ttcattgagt tgattttttg 2581 ataatctgca atctggataa tttgtactgc taaagtttta ataaactcga catgagaaaa 2641 acaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSSRP14A 721 bp RNA PRI 07-JUL-1993 DEFINITION H.sapiens mRNA for signal recognition particle subunit 14. ACCESSION X73459 NID g313660 KEYWORDS signal recognition particle subunit 14. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 721) AUTHORS Leffers,H. TITLE The human signal recognition particle subunit (SRP14) mRNA includes a partially translated OPA repeat JOURNAL Unpublished REFERENCE 2 (bases 1 to 721) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (21-JUN-1993) H. Leffers, Institute of Medical Biochemistry and Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..721 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="AMA" /clone_lib="lambda ZapII/AMA cDNA" /clone="R32" CDS 20..430 /codon_start=1 /product="signal recognition particle subunit 14" /db_xref="PID:g313661" /db_xref="SWISS-PROT:P37108" /translation="MVLLESEQFLTELTRLFQKCRTSGSVYITLKKYDGRTKPIPKKG TVEGFEPADNKCLLRATDGKKKISTVVSSKEVNKFQMAYSNLLRANMDGLKKRDKKNK TKKTKAAAAAAAAAPAAAATAATTAATTAATAAQ" BASE COUNT 216 a 150 c 178 g 177 t ORIGIN 1 tcgagccagc gtcgccgcga tggtgttgtt ggagagcgag cagttcctga cggagctgac 61 cagacttttc cagaagtgcc ggacgtcggg cagcgtctat atcaccttga agaagtatga 121 cggtcgaacc aaacccattc caaagaaagg tactgtggag ggctttgagc ccgcagacaa 181 caagtgtctg ttaagagcta ccgatgggaa gaagaagatc agcactgtgg tgagctccaa 241 ggaagtgaat aagtttcaga tggcttattc aaacctcctt agagctaaca tggatgggtt 301 gaagaagaga gacaaaaaga acaaaactaa gaagaccaaa gcagcagcag cagcagcagc 361 agcagcacct gccgcagcag caacagcagc aacaacagca gcaacaacag cagcaacagc 421 agcacagtaa agggcataca tttcctgctt tcaccaatta accactgaat tgctattttt 481 tccttttggc cagatagcta ggtttctggt tcccccacag taggtgtttt cacataagat 541 tagggtcctt ttggaaagaa tagttgcagt gtttatagga tagttgtggt aagaatctag 601 tttattttgc atttggctaa ttggtctgtg ctgcatggtt atatactcct ggattataga 661 ttaaaagtct ctgtagacat ctctgtgaag agcaagctat cattaaacat gtctgtttat 721 c // LOCUS HSSSR 1089 bp RNA PRI 16-MAR-1994 DEFINITION H.sapiens mRNA for TRAP beta subunit. ACCESSION X74104 NID g452756 KEYWORDS endoplasmic reticulum translocation; SSR beta subunit; translocon-associated protein; TRAP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1089) AUTHORS Bodescot,M. TITLE Direct Submission JOURNAL Submitted (09-JUL-1993) M. Bodescot, Inst. Gustave-Roussy, Lab. d' Oncologie Moleculaire, PR1, rue Camille Desmoulins, 94805 Villejuif, FRANCE REFERENCE 2 (bases 1 to 1089) AUTHORS Bodescot,M. and Brison,O. TITLE Cloning and sequence analysis of the beta subunit of the human translocon-associated protein JOURNAL Biochim. Biophys. Acta 1217 (1), 101-102 (1994) MEDLINE 94114564 FEATURES Location/Qualifiers source 1..1089 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SW613-S clone 3" /clone="7" /clone_lib="cDNA library 10/17 /92" /tissue_type="colon carcinoma" gene 51..602 /gene="TRAP" CDS 51..602 /gene="TRAP" /note="beta subunit" /codon_start=1 /product="translocon-associated protein" /db_xref="PID:g452757" /db_xref="SWISS-PROT:P43308" /translation="MRLLSFVVLALFAVTQAEEGARLLASKSLLNRYAVEGRDLTLQY NIYNVGSSAALDVELSDDSFPPEDFGIVSGMLNVKWDRIAPASNVSHTVVLRPLKAGY FNFTSATITYLAQEDGPVVIGSTSAPGQGGILAQREFDRRFSPHFLDWAAFGVMTLPS IGIPLLLWYSSKRKYDTPKTKKN" repeat_region 620..641 /note="first copy" repeat_region 642..663 /note="second copy" polyA_signal 1072..1077 BASE COUNT 240 a 296 c 273 g 280 t ORIGIN 1 ggctctcttc ctgtctttgt ggctccggaa aggcgtttgg gatgccaacg atgaggctgc 61 tgtcatttgt ggtgttggct ctatttgctg tcactcaagc agaggaagga gccaggcttt 121 tggcttccaa atcactgctg aacagatacg ccgtggaggg acgagacctg accttgcagt 181 acaacatcta caatgttggc tcaagtgctg cattagacgt ggaactatct gatgattcct 241 tccctccaga agactttggc attgtgtctg gaatgctcaa tgtcaaatgg gaccggattg 301 cccctgctag caatgtctcc cacactgtgg tcctgcgccc tctcaaggct ggttatttca 361 acttcacctc ggcaacaatt acttacctgg cccaggagga tgggcccgtt gtgattggct 421 ctaccagtgc acctggacag ggaggaatcc tggctcagcg ggagtttgac aggcgattct 481 cccctcattt tctggactgg gcagcctttg gggtcatgac ccttccctcc atcggcatcc 541 ccctgctatt gtggtactcc agcaagagga aatatgacac tcccaaaacg aagaagaact 601 gattggggct tccacagccc tcctctccca agaaatccag gctcctctcc caagaaatcc 661 aggtgctttc cagactccaa agggtatctt aaatgcaatc tcttctctct tagcccttgg 721 ccactttctc ctggatcctg ccctgctctc agccatagtg aaggaccagc cctaggagtc 781 tgcgagagcc tccttggttc catcgtgaag ccataaacag gaatgccttt ggcaatagcc 841 ttgagcctag agggccctct gatgccccac tgaggtgctg ttggtttatt gctggcaacg 901 tgaattctct caggggtcta ggaggggcat tttggagact gcctgacacc acccctatcc 961 cctgcctccc cctctcagaa gagggtggaa gatgaaatga aagctatggg actcttggag 1021 gatacccagt gtctattctg ggttagagaa gtgcttacta aggggttttc taataaaaac 1081 aaatgccac // LOCUS HSSSRALP 974 bp RNA PRI 29-SEP-1994 DEFINITION H.sapiens mRNA for SSR alpha subunit. ACCESSION Z12830 NID g551637 KEYWORDS SSR alpha subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 974) AUTHORS Hartmann,E. and Prehn,S. TITLE The N-terminal region of the alpha-subunit of the TRAP complex has a conserved cluster of negative charges JOURNAL FEBS Lett. 349 (3), 324-326 (1994) MEDLINE 94326944 REFERENCE 2 (bases 1 to 974) AUTHORS Hartmann,E. TITLE Direct Submission JOURNAL Submitted (24-JUN-1992) Hartmann E., Max-Delbrueck-Centrum fuer Molekulare Medizin, Molekulare Zellfforschung, Robert-Roessle-Str.10, Berlin-Buch, Deutschland, O-1115 FEATURES Location/Qualifiers source 1..974 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 30..890 /codon_start=1 /product="SSR alpha subunit" /db_xref="PID:g551638" /db_xref="SWISS-PROT:P43307" /translation="MRLLPRLLLLLLLVFPATVLFRGGPRGSLAVAQDLTEDEETVED SIIEDEDDEAEVEEDEPTDLVEDKEEEDVSGEPEASPSADTTILFVKGEDFPANNIVK FLVGFTNKGTEDFIVESLDASFRYPQDHQFYIQNFTALPLNTVVPPQRQATFEYSFIP AEPMGGRPFGLVINLNYKDLNGNVFQDAVFNQTVTVIEREDGLDGETIFMYMFLAGLG LLVIVGLHQLLESRKRKRPIQKVEMGTSSQNDVDMSWIPQETLNQINKASPRRLPRKR AQKRSVGSDE" BASE COUNT 289 a 201 c 228 g 256 t ORIGIN 1 cggaaactgg acactggacc ggcagcgcca tgagactcct cccccgcttg ctgctgcttc 61 tcttactcgt gttccctgcc actgtcttgt tccgaggcgg ccccagaggc tcgttagcag 121 tggcacaaga tcttacagag gatgaagaaa cagtagaaga ttccataatt gaggatgaag 181 atgatgaagc cgaggtagaa gaagatgaac ccacagattt ggtagaagat aaagaggaag 241 aagatgtgtc tggtgaacct gaagcttcac cgagtgcaga tacaactata ctgtttgtaa 301 aaggagaaga ttttccagca aataacattg tgaagttcct ggtaggcttt accaacaagg 361 gtacagaaga ttttattgtt gaatccttag atgcctcatt ccgttatcct caggaccacc 421 agttttatat ccagaatttc acagctcttc ctctgaacac tgtagtgcca ccccagagac 481 aggcaacttt tgagtactct ttcattcctg cagagcccat gggcggacga ccatttggtt 541 tggtcatcaa tctgaactac aaagatttga acggcaatgt attccaagat gcagtcttca 601 atcaaacagt tacagttatt gaaagagagg atgggttaga tggagaaaca atctttatgt 661 atatgttcct tgctggtctt gggcttctgg ttattgttgg ccttcatcaa ctcctagaat 721 ctagaaagcg taagagaccc atacagaaag tagaaatggg tacatcaagt cagaatgatg 781 ttgacatgag ttggattcct caggaaacat tgaatcaaat caataaagct tcaccaagaa 841 ggttgcccag gaaacgggca cagaagagat cagtgggatc tgatgagtaa atgttccttt 901 gtgcaacaat tcggtcttta cttaacctgc cctaatattt ttcggcctga tgggaattag 961 tgcagagaag ccaa // LOCUS HSST30II 1623 bp RNA PRI 12-SEP-1997 DEFINITION H.sapiens mRNA for Gal beta-1,3 GalNAc alpha-2,3-sialyltransferase. ACCESSION X96667 NID g1235530 KEYWORDS beta-galactoside alpha-2,3-sialyltransferase; ST3(0)-II gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1623) AUTHORS Giordanengo,V., Bannwarth,S., Laffont,C., Van Miegem,V., Harduin-Lepers,A., Delannoy,P. and Lefebvre,J.C. TITLE Cloning and expression of cDNA for a human Gal(beta1-3)GalNAc alpha2,3-sialyltransferase from the CEM T-cell line JOURNAL Eur. J. Biochem. 247 (2), 558-566 (1997) MEDLINE 97409982 REFERENCE 2 (bases 1 to 1623) AUTHORS Giordanengo,V. TITLE Direct Submission JOURNAL Submitted (14-MAR-1996) V. Giordanengo, Laboratoire de Virologie, Faculte de Medecine, Av de Valombrose, 06107, Nice CEDEX 2, FRANCE FEATURES Location/Qualifiers source 1..1623 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CCRF-CEM" gene 297..1349 /gene="ST3(0)-II" CDS 297..1349 /gene="ST3(0)-II" /EC_number="2.4.99.4" /codon_start=1 /product="beta-galactoside alpha-2,3-sialyltransferase" /db_xref="PID:e229619" /db_xref="PID:g1235531" /translation="MKCSLRVWFLSVAFLLVFIMSLLFTYSHHSMATLPYLDSGALDG THRVKLVPGYAGLQRLSKERLSGKSCACRRCMGDAGASDWFDSHFDGNISPVWTRENM DLPPDVQRWWMMLQPQFKSHNTNEVLEKLFQIVPGENPYRFRDPHQCRRCAVVGNSGN LRGSGYGQDVDGHNFIMRMNQAPTVGFEQDVGSRTTHHFMYPESAKNLPANVSFVLVP FKVLDLLWIASALSTGQIRFTYAPVKSFLRVDKEKVQIYNPAFFKYIHDRWTEHHGRY PSTGMLVLFFALHVCDEVNVYGFGADSRGNWHHYWENNRYAGEFRKTGVHDADFEAHI IDMLAKASKIEVYRGN" BASE COUNT 306 a 521 c 500 g 296 t ORIGIN 1 accagggtgg caggagaggc agagcctctg tggcctagct agtgacggag agacccgatg 61 aagccctaag caggggcccc gcctgactca gggacaggac agccactcct gccaacgtgt 121 gttctcccta catgagggag ggcgtggcaa gggacccctg ccactgtccc ctgctgcagc 181 acgtgcccct atgcccttta catgtggtgc cagaataggc aggctacgcc gtggctggcc 241 cctcagcggg ctgggaaaag agtggccacg gtgaccgtca cccgcctgcc ggcaccatga 301 agtgctccct gcgggtgtgg ttcctctccg tggccttcct gctggtgttc atcatgtccc 361 tgctcttcac ctactcgcac cacagcatgg ccacgctccc ctacctggac tcaggggccc 421 tggatgggac gcaccgggtg aagctggtgc ccggctatgc cggcctgcag cgcctcagca 481 aggagaggct ctcgggcaag agctgtgcct gtcgccgctg catgggcgat gccggtgcct 541 ccgactggtt tgacagccac tttgacggta acatttcccc cgtctggacc cgagagaaca 601 tggatcttcc accggacgtc cagaggtggt ggatgatgct gcagccccag ttcaagtcac 661 acaacaccaa tgaggtgctg gagaagctgt tccagatagt gcctggcgag aacccctacc 721 gcttccggga cccccaccag tgccggcgct gtgccgtggt ggggaactcg ggcaacctgc 781 ggggctctgg ctatgggcag gacgtggacg ggcacaactt catcatgagg atgaatcagg 841 cgccaaccgt gggctttgag caggatgttg gcagccgaac cacccaccat ttcatgtacc 901 ctgagagtgc caagaacctg cccgccaacg tcagcttcgt gctggtgccc ttcaaggtcc 961 tggaccttct gtggatcgcc agcgccttgt ccacggggca gatccgattc acctacgccc 1021 cagtgaagtc cttccttcga gtggataaag aaaaggtcca gatctacaac ccagccttct 1081 tcaagtatat ccacgacagg tggacagagc atcacgggcg gtacccttcc acggggatgc 1141 tggtgctttt ctttgccctg catgtgtgtg atgaggtgaa cgtgtacggg ttcggggccg 1201 acagccgggg caactggcac cactactggg agaacaaccg gtacgcgggc gagttccgga 1261 agactggcgt gcacgacgcg gacttcgagg cccacatcat cgacatgctg gccaaggcca 1321 gcaagatcga agtctaccgg ggcaactgag ccgggcctcg ccgcgaccct tccggcccat 1381 ctatcgggca ccggggctcc ggcccgggac ccaggaccag caacccgcga ccaatcatgc 1441 tgcagcccag gggcgtctgc tgtgccccgc caatcacgag actgggggac cggccgggcc 1501 tggcaccaat ctgcgctgcg gtcgggcgga gcttctgttt ctcccagcca atcatgtgac 1561 tcaaggaaaa cttccggcgc tgtgtccagt ctcctccaat caatggcctt cgggggcggg 1621 cca // LOCUS HSSTAF50 2811 bp RNA PRI 12-JUL-1995 DEFINITION H.sapiens Staf50 mRNA. ACCESSION X82200 NID g899299 KEYWORDS interferon inducible gene; staf50 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2811) AUTHORS Tissot,C. and Mechti,N. TITLE Molecular cloning of a new interferon-induced factor that represses human immunodeficiency virus type 1 long terminal repeat expression JOURNAL J. Biol. Chem. 270 (25), 14891-14898 (1995) MEDLINE 95318041 REFERENCE 2 (bases 1 to 2811) AUTHORS Mechti,N. TITLE Direct Submission JOURNAL Submitted (13-OCT-1994) N. Mechti, Institut de Genetique Moleculaire de Montpellier, 1919 route de Mende, 34033 Montpellier Cedex, FRANCE FEATURES Location/Qualifiers source 1..2811 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblastoid" /cell_line="DAUDI" gene 123..1451 /gene="Staf50" CDS 123..1451 /gene="Staf50" /note="interferon-induced" /codon_start=1 /product="gpStaf50" /db_xref="PID:g899300" /translation="MDFSVKVDIEKEVTCPICLELLTEPLSLDCGHSFCQACITAKIK ESVIISRGESSCPVCQTRFQPGNLRPNRHLANIVERVKEVKMSPQEGQKRDVCEHHGK KLQIFCKEDGKVICWVCELSQEHQGHQTFRINEVVKECQEKLQVALQRLIKEDQEAEK LEDDIRQERTAWKIERQKILKGFNEMRVILDNEEQRELQKLEEGEVNVLDNLAAATDQ LVQQRQDASTLISDLQRRLTGSSVEMLQDVIDVMKRSESWTLKKPKSVSKKLKSVFRV PDLSGMLQVLKELTDVQYYWVDVMLNPGSATSNVAISVDQRQVKTVRTCTFKNSNPCD FSAFGVFGCQYFSSGKYYWEVDVSGKIAWILGVHSKISSLNKRKSSGFAFDPSVNYSK VYSRYRPQYGYWVIGLQNTCEYNAFEDSSSSDPKVLTLFMAVLPVVLGFS" BASE COUNT 802 a 570 c 602 g 837 t ORIGIN 1 gaattcggca cgagctcttc tcccctgatt caagactcct ctgctttgga ctgaagcact 61 gcaggagttt gtgaccaaga acttcaagag tcaagacaga aggaagccaa gggagcagtg 121 caatggattt ctcagtaaag gtagacatag agaaggaggt gacctgcccc atctgcctgg 181 agctcctgac agaacctctg agcctagatt gtggccacag cttctgccaa gcctgcatca 241 ctgcaaagat caaggagtca gtgatcatct caagagggga aagcagctgt cctgtgtgtc 301 agaccagatt ccagcctggg aacctccgac ctaatcggca tctggccaac atagttgaga 361 gagtcaaaga ggtcaagatg agcccacagg aggggcagaa gagagatgtc tgtgagcacc 421 atggaaaaaa actccagatc ttctgtaagg aggatggaaa agtcatttgc tgggtttgtg 481 aactgtctca ggaacaccaa ggtcaccaaa cattccgcat aaacgaggtg gtcaaggaat 541 gtcaggaaaa gctgcaggta gccctgcaga ggctgataaa ggaggatcaa gaggctgaga 601 agctggaaga tgacatcaga caagagagaa ccgcctggaa gatcgagaga cagaagattc 661 tgaaagggtt caatgaaatg agagtcatct tggacaatga ggagcagaga gagctgcaaa 721 agctggagga aggtgaggtg aatgtgctgg acaacctggc agcagctaca gaccagctgg 781 tccagcagag gcaggatgcc agcacgctca tctcagatct ccagcggagg ttgacgggat 841 cgtcagtaga gatgctgcag gatgtgattg acgtcatgaa aaggagtgaa agctggacat 901 tgaagaagcc aaaatctgtt tccaagaaac taaagagtgt attccgagta ccagatctga 961 gtgggatgct gcaagttctt aaagagctga cagatgtcca gtactactgg gtggacgtga 1021 tgctgaatcc aggcagtgcc acttcgaatg ttgctatttc tgtggatcag agacaagtga 1081 aaactgtacg cacctgcaca tttaagaatt caaatccatg tgatttttct gcttttggtg 1141 tcttcggctg ccaatatttc tcttcgggga aatattactg ggaagtagat gtgtctggaa 1201 agattgcctg gatcctgggc gtacacagta aaataagtag tctgaataaa aggaagagct 1261 ctgggtttgc ttttgatcca agtgtaaatt attcaaaagt ttactccaga tatagacctc 1321 aatatggcta ctgggttata ggattacaga atacatgtga atataatgct tttgaggact 1381 cctcctcttc tgatcccaag gttttgactc tctttatggc tgtgctccct gtcgtattgg 1441 ggttttccta gactatgagg caggcattgt ctcatttttc aatgtcacaa accacggacg 1501 actcatctac aagttctctg gatgtcgctt ttctcgacct gcttatccgt atttcaatcc 1561 ttggaactgc ctagtcccca tgactgtgtg cccaccgagc tcctgagtgt tctcattcct 1621 ttacccactt ctgcatagta gcccttctgt gagactcaga ttctgcacct gagttcatct 1681 ctactgagac catctcttcc tttctttccc cttcttttac ttagaatgtc tttgtattca 1741 tttgctaggg cttccatagc aaagcatcat agattgctga tttaaactgt aattgtattg 1801 ccgtactgtg ggctgaaatc ccaaatctag attccagcag agttggttct ttctgaggtc 1861 tgcaaggaag ggctctgttc catgcctctc tccttggctt gtagaaggca tcttgtccct 1921 atgactcttc acattgtctt tatgtacatc tctgtgccca agttttccct ttttattaag 1981 acaccagtca tactggcctc agggcccacc gctaatgcct taatgaaatc attttaacat 2041 tatattgtgt acaaagacct tatttccaaa taagataata tttggaggta ttgggaataa 2101 aatttgagga aggcgatttc actcataaca atcttaccct ttcttgcaag agatgcttgt 2161 acattatttt cctaatacct tggtttcact agtagtaaac attattattt tttttatatt 2221 tgcaaaggaa acatatctaa tccttcctat agaaagaaca gtattgctgt aattcctttt 2281 cttttcttcc tcatttcctc tgccccttaa aagattgaag aaagagaaac ttgtcaactc 2341 atatccacgt tatctagcaa agtcataaga atctatcact aagtaatgta tccttcagaa 2401 tgtgttggtt taccagtgac accccatatt catcacaaaa ttaaagcaag aagtccatag 2461 taatttattt gctaatagtg gatttttaat gctcagagtt tctgaggtca aattttatct 2521 tttcacttac aagctctatg atcttaaata atttacttaa tgtattttgg tgtattttcc 2581 tcaaattaat attggtgttc aagactatat ctaattcctc tgatcacttt gagaaacaaa 2641 cttttattaa atgtaaggca cttttctatg aattttaaat ataaaaataa atattgttct 2701 gattattact gaaaagatgt cagccatttc aatgtcttgg gaaacaattt tttgtttttg 2761 ttctgttttc tttttgcttc aataaaacaa tagctggctc taaaaaaaaa a // LOCUS HSSTEACOA 1470 bp RNA PRI 09-JUN-1997 DEFINITION Homo sapiens mRNA for stearoyl-CoA desaturase. ACCESSION Y13647 NID g2190403 KEYWORDS stearoyl-CoA desaturase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1470) AUTHORS Al Jeryan,L., McCord,A., Pierotti,A.R. and Craft,J.A. TITLE Characterization and expression of a stearoyl CoA desaturase from human liver JOURNAL Unpublished REFERENCE 2 (bases 1 to 1470) AUTHORS Craft,J.A. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) J.A. Craft, Glasgow Caledonian University, Biological Sciences, Cowcaddens Road, Glasgow, G4 0BA, UK FEATURES Location/Qualifiers source 1..1470 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambdaZAP from Stratagene" /dev_stage="adult" /tissue_type="liver" CDS 209..1288 /codon_start=1 /product="stearoyl CoA desaturase" /db_xref="PID:e321537" /db_xref="PID:g2190404" /translation="MPAHLLQDDISSSYTTTTTITAPPPGVLQNGGDKLETMPLYLED DIRPDIKDDIYDPTYKDKEGPSPKVEYVWRNIILMSLLHLGALYGITLIPTCKFYTWL WGVFYYFVSALGITAGAHRLWSHRSYKARLPLRLFLIIANTMAFQNDVYEWARDHRAH HKFSETHADPHNSRRGFFFSHVGWLLVRKHPAVKEKGSTLDLSDLEAEKLVMFQRRYY KPGLLMMCFILPTLVPWYFWGETFQNSVFVATFLRYAVVLNATWLVNSAAHLFGYRPY DKNISPRENILVSLGAVGEGFHNYHHSFPYDYSASEYRWHINFNTFFIDWMAALGLTY DRKKVSKAAILARIKRTGDGNYKSG" BASE COUNT 367 a 402 c 345 g 356 t ORIGIN 1 gacggtcacc cgttgccagc tctagccttt aaattcccgg ctcggggacc tccacgcacc 61 gcggctagcg ccgacaacca gctagcgtgc aaggcgccgc ggctcagcgc gtaccggcgg 121 gtttcgaaac cgcagtcctc cggcgacccc gaactccgct ccggagcctc agccccctgg 181 aaagtgatcc cggcatcgga gagccaagat gccggcccac ttgctgcagg acgatatctc 241 tagctcctat accaccacca ccaccattac agcgcctcct ccaggggtcc tgcagaatgg 301 aggagataag ttggagacga tgcccctcta cttggaagac gacattcgcc ctgatataaa 361 agatgatata tatgacccca cctacaagga taaggaaggc ccaagcccca aggttgaata 421 tgtctggaga aacatcatcc ttatgtctct gctacacttg ggagccctgt atgggatcac 481 tttgattcct acctgcaagt tctacacctg gctttggggg gtattctact attttgtcag 541 tgccctgggc ataacagcag gagctcatcg tctgtggagc caccgctctt acaaagctcg 601 gctgccccta cggctctttc tgatcattgc caacacaatg gcattccaga atgatgtcta 661 tgaatgggct cgtgaccacc gtgcccacca caagttttca gaaacacatg ctgatcctca 721 taattcccga cgtggctttt tcttctctca cgtgggttgg ctgcttgtgc gcaaacaccc 781 agctgtcaaa gagaagggga gtacgctaga cttgtctgac ctagaagctg agaaactggt 841 gatgttccag aggaggtact acaaacctgg cttgctgatg atgtgcttca tcctgcccac 901 gcttgtgccc tggtatttct ggggtgaaac ttttcaaaac agtgtgttcg ttgccacttt 961 cttgcgatat gctgtggtgc ttaatgccac ctggctggtg aacagtgctg cccacctctt 1021 cggatatcgt ccttatgaca agaacattag cccccgggag aatatcctgg tttcacttgg 1081 agctgtgggt gagggcttcc acaactacca ccactccttt ccctatgact actctgccag 1141 tgagtaccgc tggcacatca acttcaacac attcttcatt gattggatgg ccgccctcgg 1201 tctgacctat gaccggaaga aagtctccaa ggccgccatc ttggccagga ttaaaagaac 1261 cggagatgga aactacaaga gtggctgagt ttggggtccc tcaggttcct ttttcaaaaa 1321 ccagccaggc agaggtttta atgtctgttt attaactact gaataatgct accaggatgc 1381 taaagatgat gatgttaacc cattccagta cagtattctt ttaaaattca aaagtattga 1441 aagccaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSSTHOR 2402 bp RNA PRI 03-APR-1997 DEFINITION Human mRNA for steroid hormone receptor hERR1. ACCESSION X51416 Y00290 NID g36608 KEYWORDS hormone receptor; receptor; steroid hormone receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2402) AUTHORS Giguere,V., Yang,N., Segui,P. and Evans,R.M. TITLE Identification of a new class of steroid hormone receptors JOURNAL Nature 331 (6151), 91-94 (1988) MEDLINE 88122546 FEATURES Location/Qualifiers source 1..2402 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="kidney" /clone_lib="lambda gt10" /clone="lambda hKA1 and hKE4" CDS 64..1629 /note="hormone receptor hERR1 (AA 1-521)" /codon_start=1 /db_xref="PID:g36609" /db_xref="SWISS-PROT:P11474" /translation="MGLEMSSKDSPGSLDGRAWEDAQKPQSAWCGGRKTRVYATSSRR APPSEGTRRGGAARPEEAAEEGPPAAPGSLRHSGPLGPHACPTALPEPQVTSAMSSQV VGIEPLYIKAEPASPDSPKGSSETETEPPVALAPGPAPTRCLPGHKEEEDGEGAGPGE QGGGKLVLSSLPKRLCLVCGDVASGYHYGVASCEACKAFFKRTIQGSIEYSCPASNEC EITKRRRKACQACRFTKCLRVGMLKEGVRLDRVRGGRQKYKRRPEVDPLPFPGPFPAG PLAVAGGPRKTAAPVNALVSHLLVVEPEKLYAMPDPAGPDGHLPAVATLCDLFDREIV VTISWAKSIPGFSSLSLSDQMSVLQSVWMEVLVLGVAQRSLPLQDELAFAEDLVLDEE GARAAGLGELGAALLQLVRRLQALRLEREEYVLLKALALANSDSVHIEDEPRLWSSCE KLLHEALLEYEAGRAGPGGGAERRRAGRLLLTLPLLRQTAGKVLAHFYGVKLEGKVPM HKLFLEMLEAMMD" polyA_site 2402 /note="polyA site" BASE COUNT 453 a 701 c 822 g 426 t ORIGIN 1 agctcacagc aagtccaggc tagaggtaga aacgtgagag ccccacggct ggggaagatt 61 gccatgggat tggagatgag ctccaaggac agccctggca gtctggatgg aagagcttgg 121 gaagatgctc agaaaccaca aagtgcctgg tgcggtggga ggaaaaccag agtgtatgct 181 acaagcagcc ggcgggcgcc gccgagtgag gggacgcggc gcggtggggc ggcgcggccc 241 gaggaggcgg cggaggaggg gccgcccgcg gcccccggct cactccggca ctccgggccg 301 ctcggccccc atgcctgccc gaccgcgctg ccggagcccc aggtgaccag cgccatgtcc 361 agccaggtgg tgggcattga gcctctctac atcaaggcag agccggccag ccctgacagt 421 ccaaagggtt cctcggagac agagaccgag cctcctgtgg ccctggcccc tggtccagct 481 cccactcgct gcctcccagg ccacaaggaa gaggaggatg gggagggggc tgggcctggc 541 gagcagggcg gtgggaagct ggtgctcagc tccctgccca agcgcctctg cctggtctgt 601 ggggacgtgg cctccggcta ccactatggt gtggcatcct gtgaggcctg caaagccttc 661 ttcaagagga ccatccaggg gagcatcgag tacagctgtc cggcctccaa cgagtgtgag 721 atcaccaagc ggagacgcaa ggcctgccag gcctgccgct tcaccaagtg cctgcgggtg 781 ggcatgctca aggagggagt gcgcctggac cgcgtccggg gtgggcggca gaagtacaag 841 cggcggccgg aggtggaccc actgcccttc ccgggcccct tccctgctgg gcccctggca 901 gtcgctggag gcccccggaa gacagcagcc ccagtgaatg cactggtgtc tcatctgctg 961 gtggttgagc ctgagaagct ctatgccatg cctgaccccg caggccctga tgggcacctc 1021 ccagccgtgg ctaccctctg tgacctcttt gaccgagaga ttgtggtcac catcagctgg 1081 gccaagagca tcccaggctt ctcatcgctg tcgctgtctg accagatgtc agtactgcag 1141 agcgtgtgga tggaggtgct ggtgctgggt gtggcccagc gctcactgcc actgcaggat 1201 gagctggcct tcgctgagga cttagtcctg gatgaagagg gggcacgggc agctggcctg 1261 ggggaactgg gggctgccct gctgcaacta gtgcggcggc tgcaggccct gcggctggag 1321 cgagaggagt atgttctact aaaggccttg gcccttgcca attcagactc tgtgcacatc 1381 gaagatgagc cgaggctgtg gagcagctgc gagaagctcc tgcacgaggc cctgctggag 1441 tatgaagccg gccgggctgg ccccggaggg ggtgctgagc ggcggcgggc gggcaggctg 1501 ctgctcacgc taccgctcct ccgccagaca gcgggcaaag tgctggccca tttctatggg 1561 gtgaagctgg agggcaaggt gcccatgcac aagctgttct tggagatgct cgaggccatg 1621 atggactgag gcaaggggtg ggactggtgg gggttctggc aggacctgcc tagcatgggg 1681 tcagccccaa gggctggggc ggagctgggg tctgggcagt gcacagcctg ctggcagggc 1741 cagggctaat gccatcagcc cctgggaaca ggccccacgc cctctcctcc ccctcctagg 1801 gggtgtcaga agctgggaac gtgtgtccag gctctgggca cagtgctgcc ccttgcaagc 1861 cataacggtg cccccagagt gtagggggcc ttgcggaagc catagggggc tgcacgggat 1921 gcgtgggagg cagaaaccta tctcagggag ggaaggggat ggaggccaga gtctcccagt 1981 gggtgatgct tttgctgctg cttaatccta ccccctcttc aaagcagagt gggacttgga 2041 gagcaaaggc ccatgccccc ttcgctcctc ctctcatcat ttgcattggg cattagtgtc 2101 cccccttgaa gcaataactc caagcagact ccagcccctg gacccctggg gtggccaggg 2161 cttccccatc agctcccaac gagcctcctc agggggtagg agagcactgc ctctatgccc 2221 tgcagagcaa taacactata tttatttttg ggtttggcca gggaggcgca gggacatggg 2281 gcaagccagg gcccagagcc cttggctgta cagagactct attttaatgt atatttgctg 2341 caaagagaaa ccgcttttgg ttttaaacct ttaatgagaa aaaaatatat aataccgagc 2401 tc // LOCUS HSSTHOR2 2153 bp RNA PRI 03-APR-1997 DEFINITION Human mRNA for steroid hormone receptor hERR2. ACCESSION X51417 Y00290 NID g36610 KEYWORDS hormone receptor; receptor; steroid hormone receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2153) AUTHORS Giguere,V., Yang,N., Segui,P. and Evans,R.M. TITLE Identification of a new class of steroid hormone receptors JOURNAL Nature 331 (6151), 91-94 (1988) MEDLINE 88122546 FEATURES Location/Qualifiers source 1..2153 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" /clone_lib="lambda gt11" /clone="lambda hH3" CDS 100..1401 /note="hormone receptor hERR2 (AA 1-443)" /codon_start=1 /db_xref="PID:g36611" /db_xref="SWISS-PROT:P11475" /translation="MSSEDRHLGSSCGSFIKTEPSSPSSGIDALSHHSPSGSSDASGG FGMALGTHANGLDSPPMFAGAGLGGNPCRKSYEDCTSGIMEDSAIKCEYMLNAIPKRL CLVCGDIASGYHYGVASCEACKAFFKRTIQGNIEYSCPATNECEITKRRRKSCQACRF MKCLKVGMLKEGVRLDRVRGGRQKYKRRLDSENSPYLSLQISPPAKKPLTKIVSYLLV AEPDKLYAMPPDDVPEGDIKALTTLCDLADRELVFLISWAKHIPGFSNLTLGDQMSLL QSAWMEILILGIVYRSLPYDDKLAYAEDYIMDEEHSRLVGLLELYRAILQLVRRYKKL KVEKEEFVMLKALALANSDSMYIENLEAVQKLQDLLHEALQDYELSQRHEEPRRAGKL LLTLPLLRQTAAKAVQHFYSVKLQGKVPMHKLFLEMLEAKV" BASE COUNT 445 a 626 c 625 g 457 t ORIGIN 1 ctcctccaac tgggaatgct aaaacgggac tgatggacgt gtccgaactc tgcatcccgg 61 accccctcgg ctaccacaac cagtaggttg ctgaaccgaa tgtcgtccga agacaggcac 121 ctgggctcta gctgcggctc cttcatcaag acggagccat ctagcccatc ctcgggcatt 181 gatgccctca gccaccacag ccccagcggc tcgtcggacg ccagcggtgg ctttggcatg 241 gccctgggca cccacgccaa cggtctggac tctccgccta tgttcgcagg tgcggggctg 301 ggaggcaacc cgtgtcgcaa gagctacgag gactgtacta gcggtatcat ggaggactcg 361 gccatcaagt gcgagtacat gcttaacgcc atccccaagc gcctgtgcct cgtgtgcggg 421 gacattgctt ctggctacca ctatggagtg gcctcctgcg aggcttgcaa ggcgttcttc 481 aagagaacca ttcaaggaaa catcgaatac agctgccctg ccaccaacga gtgtgagatc 541 accaaacgga ggcgcaagtc ctgtcaggcc tgccggttca tgaaatgcct caaagtgggg 601 atgctgaagg aaggcgtgcg ccttgaccgg gtgcgaggag gccgccagaa gtacaagaga 661 cggctggatt cggagaacag cccctacctg agcttacaga tttccccgcc tgctaaaaag 721 ccattgacta agattgtctc gtatctactg gtggccgagc cggacaagct gtacgctatg 781 cctcccgacg atgtgcctga aggggatatc aaggccctga ccactctctg tgacttggca 841 gatcgggagc ttgtgttcct cattagctgg gccaagcaca tcccaggttt ctccaacctg 901 acactcgggg accagatgag cctgctgcag agtgcctgga tggagatcct catcctgggc 961 atcgtgtacc gctcgcttcc ctatgatgac aagctggcat acgcggagga ctatatcatg 1021 gatgaggaac actctcgcct ggtggggctg ctggagcttt accgagccat cttgcagctc 1081 gtacgcaggt acaagaagct caaggtggag aaggaagagt ttgtgatgct caaagccctg 1141 gcccttgcca actcagattc aatgtacatc gagaacctgg aggctgtgca gaagcttcag 1201 gacctgctgc atgaggcgct gcaggactat gagctgagcc agcgccatga ggagccacgg 1261 agggcgggca agctgctgtt gacactgccc ctgctgcggc agacggcagc caaagccgtc 1321 cagcacttct acagtgtgaa actgcagggc aaggtgccca tgcacaaact cttcctggag 1381 atgctggagg ccaaggtgtg atggccccgc atgcagacgg atggacacga tccacatgga 1441 gacttccacg gccaccagcc tcgactttct cacacctgca tcggggctct gagctgtccc 1501 agaagaaggg gtttcttgct tcctggccat gtgcagactc ctggggggca gcagatgggg 1561 agatggggat gggagggtgg gggcgggggg ctcatctgtc acccgaattt tctttggtat 1621 tttttttttt ccttctccat gggcagtgct aaggcttggg ccggggctga cttcccttag 1681 ggctggagac cacgggagga agcatccctt cctgcaaggg atccatttct ggaccactcc 1741 atatttagga cctggaggta cctggatggg cagggcttag tgcccagggc ccaagagact 1801 tagattgggt gctcctgaag gtgttggtat cacagagggc aggcccttgg aacaggaggt 1861 ctctgtggcc tctcctgggg ctctgtgcct cctcagtcta gctgtctccc tccccttccc 1921 cctttcttgt cctagtacat ccagctctca gtggatgctc ctgctagagt agccacatcc 1981 ccaccactaa gaggcccctc ccctgcttcc tgcccctacc tcagccagct gaggtaactc 2041 caggacatgc acctgggaac tcgctggctc agaaaagagt tgggtcctat acccaccctt 2101 gcctgttgtt tctcctaatc ctcttgggca tggcgagtct agaaacctat gga // LOCUS HSSTHPKA 1161 bp RNA PRI 05-JAN-1993 DEFINITION H.sapiens mRNA cdk3 for serine/threonine protein kinase. ACCESSION X66357 NID g36612 KEYWORDS cdk3 gene; serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1161) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REFERENCE 2 (bases 1 to 1161) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 FEATURES Location/Qualifiers source 1..1161 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /clone_lib="fetal brain lambda ZAP II" gene 89..1006 /gene="cdk3" CDS 89..1006 /gene="cdk3" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36613" /db_xref="SWISS-PROT:Q00526" /translation="MDMFQKVEKIGEGTYGVVYKAKNRETGQLVALKKIRLDLEMEGV PSTAIREISLLKELKHPNIVRLLDVVHNERKLYLVFEFLSQDLKKYMDSTPGSELPLH LIKSYLFQLLQGVSFCHSHRVIHRDLKPQNLLINELGAIKLADFGLARAFGVPLRTYT HEVVTLWYRAPEILLGSKFYTTAVDIWSIGCIFAEMVTRKALFPGDSEIDQLFRIFRM LGTPSEDTWPGVTQLPDYKGSFPKWTRKGLEEIVPNLEPEGRDLLMQLLQYDPSQRIT AKTALAHPYFSSPEPSPAARQYVLQRFRH" BASE COUNT 256 a 338 c 334 g 233 t ORIGIN 1 ccacatggaa gctggaggag caaccgggag cgctgggctg gggtgcaaat tgcccagtgc 61 cttctgtttc ccaggcagct ctgtggccat ggatatgttc cagaaggtag agaagatcgg 121 agagggcacc tatggggtgg tgtacaaggc caagaacagg gagacagggc agctggtggc 181 cctgaagaag atcagactgg atttggagat ggagggggtc ccaagcactg ccatcaggga 241 gatctcgctg ctcaaggaac tgaagcaccc caacatcgtc cgactgctgg acgtggtgca 301 caacgagagg aagctctatc tggtgtttga gttcctcagc caggacctga agaagtacat 361 ggactccacc ccaggctcag agctccccct gcacctcatc aagagctacc tcttccagct 421 gctgcagggg gtgagtttct gccactcaca tcgggtcatc caccgagacc tgaagcccca 481 gaacctgctc atcaatgagt tgggtgccat caagctggct gacttcggcc tggctcgcgc 541 cttcggggtg cccctgcgca cctacaccca tgaggtggtg acactgtggt atcgcgcccc 601 cgagattctc ttgggcagca agttctatac cacagctgtg gatatctgga gcattggttg 661 catctttgca gagatggtga ctcgaaaagc cctgtttcct ggtgactctg agattgacca 721 gctctttcgt atctttcgta tgctggggac acccagcgaa gacacatggc ccggggtcac 781 ccagctgcct gactataagg gcagcttccc taagtggacc aggaagggac tggaagagat 841 tgtgcccaat ctggagccag agggcaggga cctgctcatg caactcctgc agtatgaccc 901 cagccagcgg atcacagcca agactgccct ggcccacccg tacttctcat cccctgagcc 961 ctccccagct gcccgccagt atgtgctgca gcgattccgc cattgagaat gtcaaggcca 1021 cactcagatc ctttctcgag cagcagctgc tgccccagct gcctcctacc cattgccaag 1081 agaggatgca tctggggaga gcaaagcact aaggaattca gcatcagcct gcagagggct 1141 gagtctgggt tagtcctgcc c // LOCUS HSSTHPKB 1363 bp RNA PRI 06-FEB-1997 DEFINITION H.sapiens mRNA KKIALRE for serine/threonine protein kinase. ACCESSION X66358 NID g36614 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1363) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REFERENCE 2 (bases 1 to 1363) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 FEATURES Location/Qualifiers source 1..1363 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /cell_line="HeLa" /clone_lib="HeLa" CDS join(214..669,X66359:1..54,670..1290) /codon_start=1 /exception="KKIALRE coding sequence with 54 bp insertion" /label=sthpkin_CDS /pseudo /product="serine/threonine protein kinase" /db_xref="PID:e49306" CDS 214..1290 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36615" /db_xref="SWISS-PROT:Q00532" /translation="MMEKYEKIGKIGEGSYGVVFKCRNRDTGQIVAIKKFLESEDDPV IKKIALREIRMLKQLKHPNLVNLLEVFRRKRRLHLVFEYCDHTVLHELDRYQRGVPEH LVKSITWQTLQAVNFCHKHNCIHRDVKPENILITKHSVIKLCDFGFARLLTGPSDYYT DYVATRWYRSPELLVGDTQYGPPVDVWAIGCVFAELLSGVPLWPGKSDVDQLYLIRKT LGDLIPRHQQVFSTNQYFSGVKIPDPEDMEPLELKFPNISYPALGLLKGCLHMDPTER LTCEQLLHHPYFENIREIEDLAKEHDKPTRKTLRKSRKHHCFTETSKLQYLPQLTGSS ILPALDNKKYYCDTKKLNYRFPNI" BASE COUNT 412 a 308 c 309 g 334 t ORIGIN 1 tgttttactc catttcccat cataaggccc attacaggca gctttgcagc cttcttcaaa 61 ttcctcatgc cgcagaacct tcatccccag gacgtcccga tagaaacgcg ccgtctggaa 121 gcggtttccc actttgaata cgaagtgcag acgtctgcga gcagccatga ttcccaggct 181 taagtgatcc tttttaagaa gatttattcc tctatgatgg agaagtatga aaaaattggg 241 aaaattggag aaggatccta tggagttgtt ttcaaatgta gaaacaggga cacgggtcag 301 attgtggcca tcaagaagtt tctggaatca gaagatgacc ctgtcataaa gaaaattgcc 361 cttcgggaaa tccgaatgct caagcaactc aagcatccca accttgttaa cctcctggaa 421 gtcttcagga ggaaacggag gcttcacctg gtgtttgaat attgtgacca cacagttctc 481 catgagttgg acagatacca aagaggggta ccagaacatc tcgtgaagag cataacttgg 541 cagacactgc aagctgtaaa tttttgccat aaacacaatt gcatacatag agacgtgaag 601 ccagaaaata tcctcatcac gaaacattcc gtgattaagc tttgtgactt tggatttgct 661 cggcttttga ctggaccgag tgactactat acagactacg tggctaccag gtggtaccgc 721 tcccctgagc tgctggtggg ggacacgcag tacggccccc cggtggatgt ttgggcaatt 781 ggctgtgtct ttgctgagct gctgtcagga gtgcctctgt ggccaggaaa atcggatgtg 841 gatcagctgt atctgattag gaagaccttg ggggatctca ttcctaggca ccagcaagtg 901 tttagcacga atcagtactt cagtggagtg aaaattccag accctgaaga tatggaacca 961 cttgaattaa aattcccaaa catctcttat cctgccctgg ggctcctaaa gggctgtctc 1021 cacatggacc ctactgaaag gctgacatgt gaacagctgt tgcatcaccc atattttgaa 1081 aacatcagag aaatagagga tttggcaaaa gaacacgaca aaccaacaag gaagacccta 1141 agaaagagcc gaaagcacca ctgctttaca gaaacatcca agttgcagta cctaccccag 1201 ctaactggca gcagcatcct tccagctttg gataataaga agtactactg tgataccaag 1261 aaacttaact accgttttcc aaacatttaa aggagctaag gagagatgat tttaaaaaag 1321 gaatcaatag atgctttgaa gaaaataaaa cttatacagt tca // LOCUS HSSTHPKC 1738 bp RNA PRI 05-JAN-1993 DEFINITION H.sapiens mRNA PCTAIRE-2 for serine/threonine protein kinase. ACCESSION X66360 NID g36616 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1738) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REFERENCE 2 (bases 1 to 1738) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 FEATURES Location/Qualifiers source 1..1738 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /cell_type="HeLa and Nalm-6 (cDNA)" /clone_lib="lambda Zap II (human fetal brain-Stratagene)" CDS 70..1641 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36617" /db_xref="SWISS-PROT:Q00537" /translation="MKKFKRRLSLTLRGSQTIDESLSELAEQMTIEENSSKDNEPIVK NGRPPTSHSMHSFLHQYTGSFKKPPLRRPHSVIGGSLGSFMAMPRNGSRLDIVHENLK MGSDGESDQASGTSSDEVQSPTGVCLRNRIHRRISMEDLNKRLSLPADIRIPDGYLEK LQINSPPFDQPMSRRSRRASLSEIGFGKMETYIKLEKLGEGTYATVYKGRSKLTENLV ALKEIRLEHEEGAPCTAIREVSLLKDLKHANIVTLHDIVHTDKSLTLVFEYLDKDLKQ YMDDCGNIMSMHNVKLFLYQILRGLAYCHRRKVLHRDLKPQNLLINEKGELKLADFGL ARAKSVPTKTYSNEVVTLWYRPPDVLLGSSEYLTQIDMWGVGCIFFEMASGRPLFPGS TVEDELHLIFRLLGTPSQETWPGISSNEEFKNYNFPKYKPEPLINHAPRLDSEGIELI RKFLQYESKKRVSAEEAMKHVYFRSLGPRIHALPESVSIFSLKEIQLQKDPGFRNSSY PETGHGKNRRQSMLF" BASE COUNT 560 a 346 c 379 g 453 t ORIGIN 1 gaattcctcg cctcgggacc cgcggtcccc gctcttgctg gatttttcaa gccacattca 61 attgatagga tgaaaaaatt taagagaagg ctatccctca cactccgagg aagtcagact 121 attgatgaat cattgtctga attggctgaa caaatgacta ttgaagaaaa cagcagcaag 181 gataatgagc ctattgtgaa gaatggcagg cctccaacgt ctcacagtat gcattccttc 241 ctccaccagt acacaggatc tttcaagaag cccccattgc ggagaccaca cagtgttatt 301 ggagggagcc ttggctcctt catggcaatg cccagaaatg gaagcagatt agatattgtt 361 catgaaaatc taaaaatggg atcagatggt gagagtgacc aagcttctgg gacatcatct 421 gatgaagtcc agtcacctac aggtgtttgt ctcagaaatc gtatacatag acggatctca 481 atggaggatt taaataagcg gttatcactg cctgcagaca tcagaatacc tgatggatat 541 cttgaaaagt tgcagataaa cagtccacca tttgaccaac caatgagtcg aaggtctcgt 601 agagcttcct tatcagaaat tggctttgga aaaatggaaa cctacatcaa attggaaaag 661 cttggagagg gtacatatgc aacagtatat aaaggaagaa gtaaattgac agagaatttg 721 gtggcattaa aagagatccg attggaacat gaagaaggtg caccctgcac agctataaga 781 gaagtttcac tattaaagga tttaaaacat gcaaatatag taaccttaca tgacattgtt 841 cacacagata aatccttgac tttggtgttt gagtatctgg ataaagacct gaaacagtac 901 atggatgact gtggaaacat catgagtatg cacaacgtaa agctgtttct gtaccaaatt 961 ctacgtggtt tggcatattg ccatagaaga aaggtattgc atcgagactt gaaaccacag 1021 aacctcctca ttaatgagaa aggagaatta aagctagcag attttggact agcccgagcc 1081 aagtcagttc ccacaaagac ctactcaaat gaagttgtca cactatggta ccggccacct 1141 gatgtgcttc ttggttcctc ggagtactta acacagattg acatgtgggg tgttggttgc 1201 attttctttg aaatggcttc tggaagacct ttatttccag gatcaaccgt ggaagatgaa 1261 ctgcacttaa ttttccgact gctaggaact ccatctcagg aaacttggcc aggtatttct 1321 tcaaatgagg agttcaagaa ctacaacttt ccaaaatata aaccagagcc tctaattaac 1381 cacgcaccca ggttagactc tgaaggaatt gagttgataa gaaaatttct tcagtatgaa 1441 tctaagaaaa gggtttcagc tgaagaggcc atgaaacatg tgtactttcg aagtctggga 1501 ccaagaatac atgctttacc agaaagtgta tcaatattca gtttgaaaga gattcagttg 1561 caaaaggacc cgggttttcg aaattcttct tatccagaga caggacatgg gaagaacaga 1621 agacagagca tgctctttta agtctgataa catggtttca agcccagccc ccagcctttc 1681 ttaccaatca aggactcaga actgaaggca attatttctt ttggtggact tggaatct // LOCUS HSSTHPKD 1745 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA PCTAIRE-1 for serine/threonine protein kinase. ACCESSION X66363 NID g36618 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1745) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REFERENCE 2 (bases 1 to 1745) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 FEATURES Location/Qualifiers source 1..1745 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /cell_type="HeLa and NALM-6" /clone_lib="fetal brain lambda ZAP II library" CDS 124..1614 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36619" /db_xref="SWISS-PROT:Q00536" /translation="MDRMKKIKRQLSMTLRGGRGIDKTNGAPEQIGLDESGGGGGSDP GEAPTRAAPGELRSARGPLSSAPEIVHEDLKMGSDGESDQASATSSDEVQSPVRVRMR NHPPRKISTEDINKRLSLPADIRLPEGYLEKLTLNSPIFDKPLSRRLRRVSLSEIGFG KLETYIKLDKLGEGTYATVYKGKSKLTDNLVALKEIRLEHEEGAPCTAIREVSLLKDL KHANIVTLHDIIHTEKSLTLVFEYLDKDLKQYLDDCGNIINMHNVKLFLFQLLRGLAY CHRQKVLHRDLKPQNLLINERGELKLADFGLARAKSIPTKTYSNEVVTLWYRPPDILL GSTDYSTQIDMWGVGCIFYEMATGRPLFPGSTVEEQLHFIFRILGTPTEETWPGILSN EEFKTYNYPKYRAEALLSHAPRLDSDGADLLTKLLQFEGRNRISAEDAMKHPFFLSLG ERIHKLPDTTSIFALKEIQLQKEASLRSSSMPDSGRPAFRVVDTEF" unsure 1565..1713 /note="sequence deleted in one clone" /replace="" BASE COUNT 413 a 530 c 475 g 327 t ORIGIN 1 tggaagcagc gtaaaggatg gacaggaatg cagaggtagg caggaggacc agcagtgtga 61 ctgctgaaac ccaggggagg gccccgcggc tctgaggttg ctcgcgcgcc cccgccgatc 121 gccatggatc ggatgaagaa gatcaaacgg cagctgtcaa tgacactccg aggtggccga 181 ggcatagaca agaccaatgg tgcccctgag cagataggcc tggatgagag tggtggtggt 241 ggcggcagtg accctggaga ggcccccaca cgtgctgctc ctggggaact tcgttctgca 301 cggggcccac tcagctctgc accagagatt gtgcacgagg acttgaagat ggggtctgat 361 ggggagagtg accaggcttc agccacgtcc tcggatgagg tgcagtctcc agtgagagtg 421 cgtatgcgca accatccccc acgcaagatc tccactgagg acatcaacaa gcgcctatca 481 ctaccagctg acatccggct gcctgagggc tacctggaga agctgaccct caatagcccc 541 atctttgaca agcccctcag ccgccgcctc cgtcgtgtca gcctatctga gattggcttt 601 gggaaactgg agacctacat taagctggac aaactgggcg agggtaccta tgccaccgtc 661 tacaaaggca aaagcaagct cacagacaac cttgtggcac tcaaggagat cagactggaa 721 catgaagagg gggcaccctg caccgccatc cgggaagtgt ccctgctcaa ggacctcaaa 781 cacgccaaca tcgttacgct acatgacatt atccacacgg agaagtccct cacccttgtc 841 tttgagtacc tggacaagga cctgaagcag tacctggatg actgtgggaa catcatcaac 901 atgcacaacg tgaaactgtt cctgttccag ctgctccgtg gcctggccta ctgccaccgg 961 cagaaggtgc tacaccgaga cctcaagccc cagaacctgc tcatcaacga gaggggagag 1021 ctcaagctgg ctgactttgg cctggcccga gccaagtcaa tcccaacaaa gacatactcc 1081 aatgaggtgg tgacactgtg gtaccggccc cctgacatcc tgcttgggtc cacggactac 1141 tccactcaga ttgacatgtg gggtgtgggc tgcatcttct atgagatggc cacaggccgt 1201 cccctctttc cgggctccac ggtggaggaa cagctacact tcatcttccg tatcttagga 1261 accccaactg aggagacgtg gccaggcatc ctgtccaacg aggagttcaa gacatacaac 1321 taccccaagt accgagccga ggcccttttg agccacgcac cccgacttga tagcgacggg 1381 gccgacctcc tcaccaagct gttgcagttt gagggtcgaa atcggatctc cgcagaggat 1441 gccatgaaac atccattctt cctcagtctg ggggagcgga tccacaaact tcctgacact 1501 acttccatat ttgcactaaa ggagattcag ctacaaaagg aggccagcct tcggtcttcg 1561 tcgatgcctg actcaggcag gccagctttc cgcgtggtgg acaccgagtt ctaagccaca 1621 gaccgaggcc ccagcaggca gcggctggag ggatgccaca cccctcacag ggcagccccc 1681 aactacatct tccctgctta ctctctgcct acctgcctga gccatgttca cctgcccact 1741 tgtcc // LOCUS HSSTHPKE 983 bp RNA PRI 16-FEB-1993 DEFINITION H.sapiens mRNA PSSALRE for serine/threonine protein kinase. ACCESSION X66364 NID g36620 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 983) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REMARK sequence revised by [3] REFERENCE 2 (bases 1 to 983) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 REFERENCE 3 (bases 1 to 983) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-FEB-1993) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA FEATURES Location/Qualifiers source 1..983 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /cell_type="HeLa and NALM-6" /clone_lib="fetal brain lambda ZAP II" CDS 25..903 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36621" /db_xref="SWISS-PROT:Q00535" /translation="MQKYEKLEKIGEGTYGTVFKAKNRETHEIVALKRVRLDDDDEGV PSSALREICLLKELKHKNIVRLHDVLHSDKKLTLVFEFCDQDLKKYFDSCNGDLDPEI VKSFLFQLLKGLGFCHSRNVLHRDLKPQNLLINRNGELKLADFGLARAFGIPVRCYSA EVVTLWYRPPDVLFGAKLYSTSIDMWSAGCIFAELANAGRPLFPGNDVDDQLKRIFRL LGTPTEEQWPSMTKLPDYKPYPMYPATTSLVNVVPKLNATGRDLLQNLLKCNPVQRIS AEEALQHPYFSDFCPP" BASE COUNT 219 a 275 c 279 g 210 t ORIGIN 1 cgcaggggtc ccccggccgc cgcgatgcag aaatacgaga aactggaaaa gattggggaa 61 ggcacctacg gaactgtgtt caaggccaaa aaccgggaga ctcatgagat cgtggctctg 121 aaacgggtga ggctggatga cgatgatgag ggtgtgccga gttccgccct ccgggagatc 181 tgcctactca aggagctgaa gcacaagaac atcgtcaggc ttcatgacgt cctgcacagc 241 gacaagaagc tgactttggt ttttgaattc tgtgaccagg acctgaagaa gtattttgac 301 agttgcaatg gtgacctcga tcctgagatt gtaaagtcat tcctcttcca gctactaaaa 361 gggctgggat tctgtcatag ccgcaatgtg ctacacaggg acctgaagcc ccagaacctg 421 ctaataaaca ggaatgggga gctgaaattg gctgattttg gcctggctcg agcctttggg 481 attcccgtcc gctgttactc agctgaggtg gtcacactgt ggtaccgccc accggatgtc 541 ctctttgggg ccaagctgta ctccacgtcc atcgacatgt ggtcagccgg ctgcatcttt 601 gcagagctgg ccaatgctgg gcggcctctt tttcccggca atgatgtcga tgaccagttg 661 aagaggatct tccgactgct ggggacgccc accgaggagc agtggccctc tatgaccaag 721 ctgccagact ataagcccta tccgatgtac ccggccacaa catccctggt gaacgtcgtg 781 cccaaactca atgccacagg gagggatctg ctgcagaacc ttctgaagtg taaccctgtc 841 cagcgtatct cagcagaaga ggccctgcag cacccctact tctccgactt ctgtccgccc 901 taggccccgg gacccccgcc tccaggctgg gcctggccta tttaagcccc ctcttgagag 961 ggtgagacag tgggggtgcc tgg // LOCUS HSSTHPKF 1249 bp RNA PRI 05-JAN-1993 DEFINITION H.sapiens mRNA PLSTIRE for serine/threonine protein kinase. ACCESSION X66365 NID g36622 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1249) AUTHORS Meyerson,M.L. TITLE Direct Submission JOURNAL Submitted (12-MAY-1992) M.L. Meyerson, Massachusetts General Hospital, Cancer Center, Bldg 149, 13th Street, Charleston MA 02129, USA REFERENCE 2 (bases 1 to 1249) AUTHORS Meyerson,M., Enders,G.H., Wu,C.L., Su,L.K., Gorka,C., Nelson,C., Harlow,E. and Tsai,L.H. TITLE A family of human cdc2-related protein kinases JOURNAL EMBO J. 11 (8), 2909-2917 (1992) MEDLINE 92347325 FEATURES Location/Qualifiers source 1..1249 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /cell_line="NALM-6" /clone_lib="NALM-6 lambda ZAP II" CDS 118..1098 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g36623" /db_xref="SWISS-PROT:Q00534" /translation="MEKDGLCRADQQYECVAEIGEGAYGKVFKARDLKNGGRFVALKR VRVQTGEEGMPLSTIREVAVLRHLETFEHPNVVRLFDVCTVSRTDRETKLTLVFEHVD QDLTTYLDKVPEPGVPTETIKDMMFQLLRGLDFLHSHRVVHRDLKPQNILVTSSGQIK LADFGLARIYSFQMALTSVVVTLWYRAPEVLLQSSYATPVDLWSVGCIFAEMFRRKPL FRGSSDVDQLGKILDVIGLPGEEDWPRDVALPRQAFHSKSAQPIEKFVTDIDELGKDL LLKCLTFNPAKRISAYSALSHPYFQDLERCKENLDSHLPPSQNTSELNTA" BASE COUNT 297 a 341 c 345 g 266 t ORIGIN 1 gaattccgta aagctagacc gatctccggg gagccccgag taggcgagcg gcggccgagc 61 tagttgagcg caccccccgg gcgccccagc gcgccgcggc gggcgcgtcc aggcggcatg 121 gagaaggacg gcctgtgccg cgctgaccag cagtacgaat gcgtggcgga gatcggggag 181 ggcgcctatg ggaaggtgtt caaggcccgc gacttgaaga acggaggccg tttcgtggcg 241 ttgaagcgcg tgcgggtgca gaccggcgag gagggcatgc cgctctccac catccgcgag 301 gtggcggtgc tgaggcacct ggagaccttc gagcacccca acgtggtcag gttgtttgat 361 gtgtgcacag tgtcacgaac agacagagaa accaaactaa ctttagtgtt tgaacatgtc 421 gatcaagact tgaccactta cttggataaa gttccagagc ctggagtgcc cactgaaacc 481 ataaaggata tgatgtttca gcttctccga ggtctggact ttcttcattc acaccgagta 541 gtgcatcgcg atctaaaacc acagaacatt ctggtgacca gcagcggaca aataaaactc 601 gctgacttcg gccttgcccg catctatagt ttccagatgg ctctaacctc agtggtcgtc 661 acgctgtggt acagagcacc cgaagtcttg ctccagtcca gctacgccac ccccgtggat 721 ctctggagtg ttggctgcat atttgcagaa atgtttcgta gaaagcctct ttttcgtgga 781 agttcagatg ttgatcaact aggaaaaatc ttggacgtga ttggactccc aggagaagaa 841 gactggccta gagatgttgc ccttcccagg caggcttttc attcaaaatc tgcccaacca 901 attgagaagt ttgtaacaga tatcgatgaa ctaggcaaag acctacttct gaagtgtttg 961 acatttaacc cagccaaaag aatatctgcc tacagtgccc tgtctcaccc atacttccag 1021 gacctggaaa ggtgcaaaga aaacctggat tcccacctgc cgcccagcca gaacacctcg 1081 gagctgaata cagcctgagg cctcagcagc cgccttaagc tgatcctgcg gagaacaccc 1141 ttggtggctt atgggtcccc ctcagcaagc cctacagagc tgtggaggat tgctatctgg 1201 aggccttcca gctgctgtct tctggacagg ctctgcttct ccaaggaaa // LOCUS HSSTPK 1636 bp RNA PRI 14-FEB-1995 DEFINITION H.sapiens mRNA for serine/threonine protein kinase. ACCESSION X80229 NID g599826 KEYWORDS ser/thr protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1636) AUTHORS Chu,W. TITLE Direct Submission JOURNAL Submitted (02-SEP-1994) W. Chu, Hoffmann-La Roche, Dept of Inflammation/Autoimmune Diseases, 340 Kingsland Street, Nutley NJ 07110, USA REFERENCE 2 (bases 1 to 1636) AUTHORS Chu,W., Presky,D.H., Danho,W., Swerlick,R.A. and Burns,D.K. TITLE Identification and characterization of DBK, a novel putative serine/threonine protein kinase from human endothelial cells JOURNAL Eur. J. Biochem. 225 (2), 695-702 (1994) MEDLINE 95045520 FEATURES Location/Qualifiers source 1..1636 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary culture of dermal microvascular endothelial cells" mRNA 1..1636 CDS 96..1535 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g599827" /translation="MEPQGCLVAEVTFRNPVIERIPRLRRQKKIFSKQQGKAFPRARQ MNIDVATWVRLLRRLIPNATGTGTFSPGGSPGSEARTTGDISVEKLNLGTDSDSSPQK SSRDPPSSPSSLSSPIQESTAPELPSETQETPGPALCSPLRKSPLTLEDFKFLAVLGR GHFGKVLLSEFRPSGELFAIKALKKGDIVARDEVESLMCEKRILAAVTSAGHPFLVNL FGCFQTPEHVCFVMEYSAGGDLILHIHSDVFSEPRAIFYSACVVLGLQFLHEHKIVYR DLKLDNLLLDTEGYVKIADFGLCKEGMGYGDRTSTFCGTPEFLAPEVLTDTSYTRAVD WWGLGVLLYEMLVGESPLPGDDEEEVFDSIVNDEVRYPRFLSAEAIGIMRRLLRRNPE RRLGSSERDAEDVKKQPFFRTLGWEALLARRLPPPFVPTLSGRTDISNFDEEFTGEAP TLSPPRDARPLTAAEQAAFLDFDFVAGGC" misc_feature 150..414 /note="encodes protein kinase domain" BASE COUNT 324 a 488 c 501 g 323 t ORIGIN 1 ttgatagctc tttctcgatt ccgtgggtgg tggtgcatgg ccgttcttag ttggtggatt 61 tcttggacaa tgagaggcat gaggtgcagc tggacatgga accccagggc tgcctggtgg 121 ctgaggtcac cttccgcaac cctgtcattg agaggattcc tcggctccga cggcagaaga 181 aaattttctc caagcagcaa gggaaggcct tcccacgtgc taggcagatg aacatcgatg 241 tcgccacgtg ggtgcggctg ctccggaggc tcatccccaa tgccacgggc acaggcacct 301 ttagccctgg gggttctcca ggatccgagg cccggaccac gggtgacata tcggtggaga 361 agctgaacct cggcactgac tcggacagct cacctcagaa gagctcgcgg gatcctcctt 421 ccagcccatc gagcctgagc tcccccatcc aggaatccac tgctcccgag ctgccttcgg 481 agacccagga gaccccaggc cccgccctgt gcagccctct gaggaagtca cctctgaccc 541 tcgaagattt caagttcctg gcggtgctgg gccggggtca ttttgggaag gtgctcctct 601 ccgaattccg gcccagtggg gagctgttcg ccatcaaggc tctgaagaaa ggggacattg 661 tggcccgaga cgaggtggag agcctgatgt gtgagaagcg gatattggcg gcagtgacca 721 gtgcgggaca ccccttcctg gtgaacctct tcggctgttt ccagacaccg gagcacgtgt 781 gcttcgtgat ggagtactcg gccggtgggg acctgatcct gcacatccac agcgacgtgt 841 tctctgagcc ccgtgccatc ttttattccg cctgcgtggt gctgggccta cagtttcttc 901 acgaacacaa gatcgtctac agggacctga agttggacaa tttgctcctg gacaccgagg 961 gctacgtcaa gatcgcagac tttggcctct gcaaggaggg gatgggctat ggggaccgga 1021 ccagcacatt ctgtgggacc ccggagttcc tggcccctga ggtgctgacg gacacgtcgt 1081 acacgcgagc tgtggactgg tggggactgg gtgtgctgct ctacgagatg ctggttggcg 1141 agtccccact cccaggggat gatgaggagg aggtcttcga cagcatcgtc aacgacgagg 1201 ttcgctaccc ccgcttcctg tcggccgaag ccatcggcat catgagaagg ctgcttcgga 1261 ggaacccaga gcggaggctg ggatctagcg agagagatgc agaagatgtg aagaaacagc 1321 ccttcttcag gactctgggc tgggaagccc tgttggcccg gcgcctgcca ccgccctttg 1381 tgcccacgct gtccggccgc accgacatca gcaacttcga cgaggagttc accggggagg 1441 cccccacact gagcccgccc cgcgacgcgc ggcccctcac agccgcggag caggcagcct 1501 tcctggactt cgacttcgtg gccgggggct gctagccccc tcccctgccc ctgcccctgc 1561 ccctgcccga gagctcttag tttttaaaaa ggcctttggg atttgccgga aaaaaaaaaa 1621 aaaaaaaaaa aaaaaa // LOCUS HSSTPKC2K 1758 bp RNA PRI 21-JUL-1995 DEFINITION H.sapiens mRNA (clone C-2k) mRNA for serine/threonine protein kinase. ACCESSION X80230 NID g599828 KEYWORDS ser/thr protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1758) AUTHORS Best,J.L., Presky,D.H., Swerlick,R.A., Burns,D.K. and Chu,W. TITLE Cloning of a full-length cDNA sequence encoding a cdc2-related protein kinase from human endothelial cells JOURNAL Biochem. Biophys. Res. Commun. 208 (2), 562-568 (1995) MEDLINE 95209665 REFERENCE 2 (bases 1 to 1758) AUTHORS Chu,W. TITLE Direct Submission JOURNAL Submitted (02-SEP-1994) W. Chu, Hoffmann-La Roche, Dept of Inflammation/Autoimmune Diseases, 340 Kingsland Street, Nutley NJ 07110, USA COMMENT Related sequence: L25676. FEATURES Location/Qualifiers source 1..1758 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HDMEC cDNA library" /clone="C-2K" mRNA 1..1758 /evidence=experimental CDS 99..1217 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g599829" /translation="MAKQYDSVECPFCDEVSKYEKLAKIGQGTFGEVFKARHRKTGQK VALKKVLMENEKEGFPITALREIKILQLLKHENVVNLIEICRTKASPYNRCKGSIYLV FDFCEHDLAGLLSNVLVKFTLSEIKRVMQMLLNGLYYIHRNKILHRDMKAANVLITRD GVLKLADFGLARAFSLAKNSQPNRYTNRVVTLWYRPPELLLGERDYGPPIDLWGAGCI MAEMWTRSPIMQANTEQHQLALISQLCGSITPEVWPNVDNYELYEKLELVKGQKRKVK DRLKAYVRDPYALDLIDKLLVLDPAQRIDSDDALNHDFFWSDPMPSDLKGMLSTHLTS MFEYLAPPRRKGSQITQQSTNQSRNPATTNQTEFERVF" polyA_signal 1732..1737 BASE COUNT 380 a 480 c 520 g 378 t ORIGIN 1 cgcccgccgg aggggcctgg agtgcggcgg cggcgggacc cggagcagga gcggcggcag 61 cagcgactgg gggcggcggc ggcgcgttgg aggcggccat ggcaaagcag tacgactcgg 121 tggagtgccc tttttgtgat gaagtttcca aatacgagaa gctcgccaag atcggccaag 181 gcaccttcgg ggaggtgttc aaggccaggc accgcaagac cggccagaag gtggctctga 241 agaaggtgct gatggaaaac gagaaggagg ggttccccat tacagccttg cgggagatca 301 agatccttca gcttctaaaa cacgagaatg tggtcaactt gattgagatt tgtcgaacca 361 aagcttcccc ctataaccgc tgcaagggta gtatatacct ggtgttcgac ttctgcgagc 421 atgaccttgc tgggctgttg agcaatgttt tggtcaagtt cacgctgtct gagatcaaga 481 gggtgatgca gatgctgctt aacggcctct actacatcca cagaaacaag atcctgcata 541 gggacatgaa ggctgctaat gtgcttatca ctcgtgatgg ggtcctgaag ctggcagact 601 ttgggctggc ccgggccttc agcctggcca agaacagcca gcccaaccgc tacaccaacc 661 gtgtggtgac actctggtac cggcccccgg agctgttgct cggggagcgg gactacggcc 721 cccccattga cctgtggggt gctgggtgca tcatggcaga gatgtggacc cgcagcccca 781 tcatgcaggc caacacggag cagcaccaac tcgccctcat cagtcagctc tgcggctcca 841 tcacccctga ggtgtggcca aacgtggaca actatgagct gtacgaaaag ctggagctgg 901 tcaagggcca gaagcggaag gtgaaggaca ggctgaaggc ctatgtgcgt gacccatacg 961 cactggacct catcgacaag ctgctggtgc tggaccctgc ccagcgcatc gacagcgatg 1021 acgccctcaa ccacgacttc ttctggtccg accccatgcc ctccgacctc aagggcatgc 1081 tctccaccca cctgacgtcc atgttcgagt acttggcacc accgcgccgg aagggcagcc 1141 agatcaccca gcagtccacc aaccagagtc gcaatcccgc caccaccaac cagacggagt 1201 ttgagcgcgt cttctgaggg ccggcgcttg ccactagggc tcttgtgttt tttttcttct 1261 gctatgtgac ttgcatcgtg gagacagggc atttgagttt atatctctca tgcatatttt 1321 atttaatccc caccctgggc tctgggagca gcccgctgag tggactggag tggagcattg 1381 gctgagagac caggagggca ctggagctgt cttgtccttg ctggttttct ggatggttcc 1441 cagagggttt ccatggggta ggaggatggg ctcgcccacc agtgactttt tctaagagct 1501 cccggcgtgg tggaagaggg gacaggtccc tcacccaccc acaatcctat tctcgggctg 1561 agaaccctgc gtgaggacag ggctcgcctc aggaatgggc tgtttttggc ctaaccctca 1621 gaaacactgg ggctggcaca aactcttggt ttcttcaaca ggagaatttt actgtgtttc 1681 ttttggttcc attgtttgga gacattcctg ggcacagttt ggtccgttag aattaaaagt 1741 tgaattttta aaaaaaaa // LOCUS HSSTPKEMK 2946 bp RNA PRI 20-DEC-1996 DEFINITION H.sapiens mRNA for serine/threonine protein kinase EMK. ACCESSION X97630 NID g1749793 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2946) AUTHORS Espinosa,L., Real,F.X. and Navarro,E. TITLE Cloning and characterization of the human putative ser/thr protein kinase EMK JOURNAL Unpublished REFERENCE 2 (bases 1 to 2946) AUTHORS Navarro,E. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) E. Navarro, Institut Municipal dInvestigacio Medica, Dept. Biologia Cellular i Molecular, Dr Aiguader 80, Barcelona, 08003, SPAIN REMARK Revised by author 20-DEC-96 FEATURES Location/Qualifiers source 1..2946 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" /cell_line="cancer cell line HT-29/M6" /dev_stage="adult" CDS 408..2645 /note="EMK homologue" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:e286647" /db_xref="PID:g1749794" /translation="MIRGRNSATSADEQPHIGNYRLLKTIGKGNFAKVKLARHILTGK EVAVKIIDKTQLNSSSLQKLFREVRIMKVLNHPNIVKLFEVIETEKTLYLVMEYASGG EVFDYLVAHGRMKEKEARAKFRQIVSAVQYCHQKFIVHRDLKAENLLLDADMNIKIAD FGFSNEFTFGNKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFD GQNLKELRERVLRGKYRIPFYMSTDCENLLKKFLILNPSKRGTLEQIMKDRWMNVGHE DDELKPYVEPLPDYKDPRRTELMVSMGYTREEIQDSLVGQRYNEVMATYLLLGYKSSE LEGDTITLKPRPSADLTNSSAQFPSHKVQRSVSANPKQRRFSDQAGPAIPTSNSYSKK TQSNNAENKRPEEDRESGRKASSTAKVPASPLPGLERKKTTPTPSTNSVLSTSTNRSR NSPLLERASLGQASIQNGKDSLTMPGSRASTASASAAVSAARPRQHQKSMSASVHPNK ASGLPPTESNCEVPRPSTAPQRVPVASPSAHNISSSGGAPDRTNFPRGVSSRSTFHAG QLRQVRDQQNLPYGVTPASPSGHSQGRRGASGSIFSKFTSKFVRRNLNEPESKDRVET LRPHVVGSGGNDKEKEEFREAKPRSLRFTWSMKTTSSMEPNEMMREIRKVLDANSCQS ELHEKYMLLCMHGTPGHEDFVQWEMEVCKLPRLSLNGVRFKRISGTSMAFKNIASKIA NELKL" exon 1820..1981 /note="alternative exon" BASE COUNT 682 a 875 c 806 g 583 t ORIGIN 1 tcctggaatt gcacgcgctt cctgaccacc aggctctggc ccttgagaag ccagcggggc 61 tttgtccctg ttgctctcct tgccaaaccc agtctctctg ctagtggtgg tttcggttgc 121 gacaccgtcc aggttcccag gcaggaaccg ctcggcctgg ctgcttagct acttttcact 181 gaggaggtgg tggaaggtgt cgcctgctct ggctgagtaa gggtggctgg ctgagccggc 241 agcccccgcc ctaggcctgg ctcttcccgg cctctgtact ttgccctcgc tgcctgacag 301 gttctgctgt gggctctgct gaatggaagt cgctggtagt ccttttccct ttctccagtc 361 ggcccacctt gggacacctt gactccaagc ccagcagtaa gtccaacatg attcggggcc 421 gcaactcagc cacctctgct gatgagcagc cccacattgg aaactaccgg ctcctcaaga 481 ccattggcaa gggtaatttt gccaaggtga agttggcccg acacatcctg actgggaaag 541 aggtagctgt gaagatcatt gacaagactc aactgaactc ctccagcctc cagaaactat 601 tccgcgaagt aagaataatg aaggttttga atcatcccaa catagttaaa ttatttgaag 661 tgattgagac tgagaaaacg ctctaccttg tcatggagta cgctagtggc ggagaggtat 721 ttgattacct agtggctcat ggcaggatga aagaaaaaga ggctcgagcc aaattccgcc 781 agatagtgtc tgctgtgcag tactgtcacc agaagtttat tgtccataga gacttaaagg 841 cagaaaacct gctcttggat gctgatatga acatcaagat tgcagacttt ggcttcagca 901 atgaattcac ctttgggaac aagctggaca ccttctgtgg cagtccccct tatgctgccc 961 cagaactctt ccagggcaaa aaatatgatg gacccgaggt ggatgtgtgg agcctaggag 1021 ttatcctcta tacactggtc agcggatccc tgccttttga tggacagaac ctcaaggagc 1081 tgcgggaacg ggtactgagg gggaaatacc gtattccatt ctacatgtcc acggactgtg 1141 aaaacctgct taagaaattt ctcatcctta atcccagcaa gagaggcact ttagagcaaa 1201 tcatgaaaga tcgatggatg aatgtgggtc acgaagatga tgaactaaag ccttacgtgg 1261 agccactccc tgactacaag gacccccggc ggacagagct gatggtgtcc atgggttata 1321 cacgggaaga gatccaggac tcgctggtgg gccagagata caacgaggtg atggccacct 1381 atctgctcct gggctacaag agctccgagc tggaaggcga caccatcacc ctgaaacccc 1441 ggccttcagc tgatctaacc aatagcagcg cccaattccc atcccacaag gtacagcgaa 1501 gcgtgtcggc caatcccaag cagcggcgct tcagcgacca ggctggtcct gccattccca 1561 cctctaattc ttactctaag aagactcaga gtaacaacgc agaaaataag cggcctgagg 1621 aggaccggga gtcagggcgg aaagccagca gcacagccaa ggtgcctgcc agccccctgc 1681 ccggtctgga gaggaagaag accaccccaa ccccctccac gaacagcgtc ctctccacca 1741 gcacaaatcg aagcaggaat tccccacttt tggagcgggc cagcctcggc caggcctcca 1801 tccagaatgg caaagacagc ctaaccatgc cagggtcccg ggcctccacg gcttctgctt 1861 ctgccgcagt ctctgcggcc cggccccgcc agcaccagaa atccatgtcg gcctccgtgc 1921 accccaacaa ggcctctggg ctgcccccca cggagagtaa ctgtgaggtg ccgcggccca 1981 gcacagcccc ccagcgtgtc cctgttgcct ccccatccgc ccacaacatc agcagcagtg 2041 gtggagcccc agaccgaact aacttccccc ggggtgtgtc cagccgaagc accttccatg 2101 ctgggcagct ccgacaggtg cgggaccagc agaatttgcc ctacggtgtg accccagcct 2161 ctccctctgg ccacagccag ggccggcggg gggcctctgg gagcatcttc agcaagttca 2221 cctccaagtt tgtacgcagg aacctgaatg aacctgaaag caaagaccga gtggagacgc 2281 tcagacctca cgtggtgggc agtggcggca acgacaaaga aaaggaagaa tttcgggagg 2341 ccaagccccg ctccctccgc ttcacgtgga gtatgaagac cacgagctcc atggagccca 2401 acgagatgat gcgggagatc cgcaaggtgc tggacgcgaa cagctgccag agcgagctgc 2461 atgagaagta catgctgctg tgcatgcacg gcacgccggg ccacgaggac ttcgtgcagt 2521 gggagatgga ggtgtgcaaa ctgccgcggc tctctctcaa cggggttcga tttaagcgga 2581 tatcgggcac ctccatggcc ttcaaaaaca ttgcctccaa aatagccaac gagctgaagc 2641 tttaacaggc tgccaggagc gggggcggcg ggggcgggcc agctggacgg gctgccggcc 2701 gtgcgccgcc ccacctgggc gagactgcag cgatggattg gtgtgtctcc ctgctggcac 2761 ttctcccctc cctggccctt ctcagttttc tcccacattc acccctgccc agagattccc 2821 ccttctcctc tcccctactg gaggcaaagg aaggggaggg tggatggggg ggcagggctc 2881 cccctcggta ctgcggttgc acagagtatt tcgcctaaac caagaaattt tttattacca 2941 aaaaga // LOCUS HSSTPKSAK 3092 bp RNA PRI 27-MAY-1997 DEFINITION Homo sapiens mRNA for serine/threonine protein kinase SAK. ACCESSION Y13115 NID g2125813 KEYWORDS serine/threonine protein kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3092) AUTHORS Karn,T., Holtrich,U., Wolf,G., Hock,B., Strebhardt,K. and Ruebsamen-Waigmann,H. TITLE Human SAK related to the PLK/polo family of cell cycle kinases shows high mRNA expression in testis JOURNAL Oncol. Rep. 4, 505-510 (1997) REFERENCE 2 (bases 1 to 3092) AUTHORS Karn,T. TITLE Direct Submission JOURNAL Submitted (09-MAY-1997) T. Karn, Chemotherapeutisches Forschungsinstitut, Geor-Speyer-Haus, Paul-Ehrlich-Strasse 42-44, D-60596 Frankfurt, FRG FEATURES Location/Qualifiers source 1..3092 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /clone="K3" gene 141..3053 /gene="SAK" CDS 141..3053 /gene="SAK" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:e316514" /db_xref="PID:g2125814" /translation="MATCIGEKIEDFKVGNLLGKGSFAGVYRAESIHTGLEVAIKMID KKAMYKAGMVQRVKNEVKIHCQLKHPSILELYNYFEDSNYVYLVLEMCHNGEMNRYLK NRVKPFSENEARHFMHQIITGMLYLHSHGILHRDLTLSNLLLTRNMNIKIADFGLATQ LKMPHEKHYTLCGTPNYISPEIATRSAHGLESDVWSLGCMFYTLLIGRPPFDTDTVKN TLNKVVLADYEMPTFLSIEAKDLIHQLLRRNPADRLSLSSVLDHPFMSRNSSTKSKDL GTVEDSIDSGHATISTAITASSSTSISGSLFDKRRLLIGQPLPNKMTVFPKNKSSTDF SSSGDGNSFYTQWGNQETSNSGRGRVIQDAEERPHSRYLRRAYSSDRSGTSNSQSQAK TYTMERCHSAEMLSVSKRSGGGENEERYSPTDNNANIFNFFKEKTSSSSGSFERPDNN QALSNHLCPGKTPFPFADPTPQTETVQQWFGNLQINAHLRKTTEYDSISPNRDFQGHP DLQKDTSKNAWTDTKVKKNSDASDNAHSVKQQNTMKYMTALHSKPEIIQQECVFGSDP LSEQSKTRGMEPPWGYQNRTLRSITSPLVAHRLKPIRQKTKKAVVSILDSEEVCVELV KEYASQEYVKEVLQISSDGNTITIYYPNGGRGFPLADRPPSPTDNISRYSFDNLPEKY WRKYQYASRFVQLVRSKSPKITYFTRYAKCILMENSPGADFEVWFYDGVKIHKTEDFI QVIEKTGKSYTLKSESEVNSLKEEIKMFMDHANEGHRICLALESIISEEERKTRSAPF FPIIIGRKPGSTSSPKALSPPPSVDSNYPTRDRASFNRMVMHSAASPTQAPILNPSMV TNEGLGLTTTASGTDISSNSLKDCLPKSAQLLKSVFVKNVGWATQLTSGAVWVQFNDG SQLVVQAGVSSISYTSPNGQTTRYGENEKLPDYIKQKLQCLSSILLMFSNPTPNFH" BASE COUNT 1010 a 604 c 629 g 849 t ORIGIN 1 tttcagcgtc gtcgcctgga gcggcggttt agagaaccga gcctgatggg cgccaaggcc 61 ggctggctgc ttggagcgct gcctcgaagg gcctgcgtga aggaagctaa tccggagaac 121 ccaggccaga gcctggaaat atggcgacct gcatcgggga gaagatcgag gattttaaag 181 ttggaaatct gcttggtaaa ggatcatttg ctggtgtcta cagagctgag tccattcaca 241 ctggtttgga agttgcaatc aaaatgatag ataagaaagc catgtacaaa gcaggaatgg 301 tacagagagt caaaaatgag gtgaaaatac attgccaatt gaaacatcct tctatcttgg 361 agctttataa ctattttgaa gatagcaatt atgtgtatct ggtattagaa atgtgccata 421 atggagaaat gaacaggtat ctaaagaata gagtgaaacc cttctcagaa aatgaagctc 481 gacacttcat gcaccagatc atcacaggga tgttgtatct tcattctcat ggtatactac 541 accgggacct cacactttct aacctcctac tgactcgtaa tatgaacatc aagattgctg 601 attttgggct ggcaactcaa ctgaaaatgc cacatgaaaa gcactataca ttatgtggaa 661 ctcctaacta catttcacca gaaattgcca ctcgaagtgc acatggcctt gaatctgatg 721 tttggtccct gggctgtatg ttttatacat tacttatcgg gagaccaccc ttcgacactg 781 acacagtcaa gaacacatta aataaagtag tattggcaga ttatgaaatg ccaacttttt 841 tgtcaataga ggccaaggac cttattcacc agttacttcg tagaaatcca gcagatcgtt 901 taagtctgtc ttcagtattg gaccatcctt ttatgtcccg aaattcttca acaaaaagta 961 aagatttagg aactgtggaa gactcaattg atagtgggca tgccacaatt tctactgcaa 1021 ttacagcttc ttccagtacc agtataagtg gtagtttatt tgacaaaaga agacttttga 1081 ttggtcagcc actcccaaat aaaatgactg tatttccaaa gaataaaagt tcaactgatt 1141 tttcttcttc aggagatgga aacagttttt atactcagtg gggaaatcaa gaaaccagta 1201 atagtggaag gggaagagta attcaagatg cagaagaaag gccacattct cgataccttc 1261 gtagagctta ttcctctgat agatctggca cttctaatag tcagtctcaa gcaaaaacat 1321 atacaatgga acgatgtcac tcagcagaaa tgctttcagt gtccaaaaga tcaggaggag 1381 gtgaaaatga agagaggtac tcacccacag acaacaatgc caacattttt aacttcttta 1441 aagaaaagac atccagtagt tctggatctt ttgaaagacc tgataacaat caagcactct 1501 ccaatcatct ttgtccagga aaaactcctt ttccatttgc agacccgaca cctcagactg 1561 aaaccgtaca acagtggttt gggaatctgc aaataaatgc tcatttaaga aaaactactg 1621 aatatgacag catcagccca aaccgggact tccagggcca tccagatttg cagaaggaca 1681 catcaaaaaa tgcctggact gatacaaaag tcaaaaagaa ctctgatgct tctgataatg 1741 cacattctgt aaaacagcaa aataccatga aatatatgac tgcacttcac agtaaacctg 1801 agataatcca acaagaatgt gtttttggct cagatcctct ttctgaacag agcaagacta 1861 ggggtatgga gccaccatgg ggttatcaga atcgtacatt aagaagcatt acatctccgt 1921 tggttgctca caggttaaaa ccaatcagac agaaaaccaa aaaggctgtg gtgagcatac 1981 ttgattcaga ggaggtgtgt gtggagcttg taaaggagta tgcatctcaa gaatatgtga 2041 aagaagttct tcagatatct agtgatggaa atacgatcac tatttattat ccaaatggtg 2101 gtagaggttt tcctcttgct gatagaccac cctcacctac tgacaacatc agtaggtaca 2161 gctttgacaa tttaccagaa aaatactggc gaaaatatca atatgcttcc aggtttgtac 2221 agcttgtaag atctaaatct cccaaaatca cttattttac aagatatgct aaatgcattt 2281 tgatggagaa ttctcctggt gctgattttg aggtttggtt ttatgatggg gtaaaaatac 2341 acaaaacaga agatttcatt caggtgattg aaaagacagg gaagtcttac actttaaaaa 2401 gtgaaagtga agttaatagc ttgaaagagg agataaaaat gtttatggac catgctaatg 2461 agggtcatcg tatttgttta gcactggaat ccataatttc agaagaggaa aggaaaacta 2521 ggagtgctcc ctttttccca ataatcatag gaagaaaacc aggtagtact agttcaccta 2581 aggccttatc acctcctcct tctgtggatt caaattaccc aacgagagat agagcatctt 2641 tcaacagaat ggtcatgcat agtgctgctt ctccaacaca ggcaccaatc cttaatccct 2701 ctatggttac aaatgaagga cttggtctta caactacagc ttctggaaca gacatctctt 2761 ctaatagtct aaaagattgt cttcctaaat cagcacaact tttgaaatct gtttttgtga 2821 aaaatgttgg ttgggctaca cagttaacta gtggagctgt gtgggttcag tttaatgatg 2881 ggtcccagtt ggttgtgcag gcaggagtgt cttctatcag ttatacctca ccaaatggtc 2941 aaacaactag gtatggagaa aatgaaaaat taccagacta catcaaacag aaattacagt 3001 gtctgtcttc catccttttg atgttttcta atccgactcc taattttcat tgattaaaac 3061 tcctttcaga catataagtt taataaataa ct // LOCUS HSSTRIA 2790 bp mRNA PRI 30-JAN-1998 DEFINITION Homo sapiens mRNA for striatin. ACCESSION AJ223814 NID g2828317 KEYWORDS striatin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2790) AUTHORS Moqrich,A., Mattei,M.G., Bartoli,M., Rakitina,T., Baillat,G., Monneron,A. and Castets,F. TITLE Cloning of Human Striatin cDNA, Gene Mapping to 2p22-p21 and Preferential Expression in Brain JOURNAL Unpublished REFERENCE 2 (bases 1 to 2790) AUTHORS Castets,F. TITLE Direct Submission JOURNAL Submitted (28-JAN-1998) Castets F., LNCF, CNRS, 31 chemin Joseph Aiguier, 13009 Marseille, FRANCE FEATURES Location/Qualifiers source 1..2790 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /clone="703908 and 3NHC2032" /lab_host="E. coli" /map="p22-p21" CDS 10..2352 /note="Calmodulin binding site at 149-165" /codon_start=1 /evidence=experimental /product="striatin" /db_xref="PID:e1248982" /db_xref="PID:g2828318" /translation="MDEQAGPGVFFSNNHPGAGGAKGLGPLAEAAAAGDGAAAAGAAR AQYSLPGILHFLQHEWARFEVERAQWEVERAELQAQIAFLQGERKGQENLKKDLVRRI KMLEYALKQERAKYHKLKYGTELNQGDMKPPSYDSDEGNETEVQPQQNSQLMWKQGRQ LLRQYLQEVGYTDTILDVKSKRVRALLGFSIDVTDREDDKNQDSVVNGTEAEVKETAM IAKSELTDSASVLDNFKFLESAAADFSDEDEDDDVDGREKSVIDTSTIVRKKALPDSG EDRDTKEALKEFDFLVTSEEGDNESRSAGDGTDWEKEDQCLMPEAWNVDQGVITKLKE QYKKERKGKKGVKRPNRSKLQDMLANLRDVDELPSLQPSVGSPSRPSSSRLPEHEINR ADEVEALTFPPSSGKSFIMGADEALESELGPGELAGLTVANEADSLTYDIANNKDALR KTWNPKFTLRSHFDGIRALAFHPIEPVLITASEDHTLKMWNLQKTAPAKKSTSLDVEP IYTFRAHKGPVLCVVMSSNGEQCYSGGTDGLIQGWNTTNPNIDPYDSYDPSVLRGPLL GHTDAVWGLAYSAAHQRLLSCSADGTLRLWNTTEVAPALSVFNDIKELGIPASVDLVS SDPSHMVASFSKGYTSIFNMETQQRILTLESNVDTTANSSCQINRVISHPTLSISITA HEDRHIKFYDNNTGKLIHSMVAHLEAVTSLAVDPNGLYLMSGSHDCSIRLWNLESKTC IQEFTAHRKKFEESIHDVAFHPSKCYIASAGADALAKVFV" repeat_unit 1264..1371 repeat_region 1264..2349 /rpt_family="WD" repeat_unit 1390..1482 repeat_unit 1549..1641 repeat_unit 1708..1800 repeat_unit 1846..1938 repeat_unit 1993..2085 repeat_unit 2119..2205 repeat_unit 2245..2349 BASE COUNT 843 a 578 c 695 g 673 t 1 others ORIGIN 1 gcggccgcca tggacgagca ggcgggtccc ggcgtcttct tcagcaacaa ccacccgggc 61 gccggcggtg ccaaggggct cgggcctctg gcggaggctg ccgcggccgg cgacggggcg 121 gctgcggcgg gggcggcccg agcccagtac agtctcccgg ggatcctgca cttcctgcag 181 cacgagtggg cccgcttcga ggtggagaga gcccagtggg aggtggagcg ggcggagctg 241 caggcccaga ttgccttcct gcagggagaa aggaagggcc aagaaaattt gaagaaggat 301 cttgtgagga ggatcaaaat gttggagtat gctcttaaac aggaaagagc caaataccac 361 aagttgaaat acgggacaga attgaatcag ggagatatga agcctccaag ctatgattct 421 gatgaaggta atgaaacaga agtgcagcca caacaaaaca gccagttaat gtggaaacaa 481 ggtcgacaac tactcagaca gtatctacag gaggtgggtt atacagatac tattctagat 541 gtgaaatcta aacgagtgcg agctttgttg ggcttttcaa ttgatgtcac ggacagggaa 601 gatgacaaaa atcaggactc agttgtaaat ggcacagagg ctgaagttaa agagacagca 661 atgattgcaa aatctgagtt aacagattct gcctccgtgc tggataattt caaattcctt 721 gaaagtgcag ctgcagattt cagtgatgaa gatgaagatg atgatgttga tggaagagag 781 aaaagcgtca ttgatacttc aacaattgtt aggaaaaaag cattgcctga cagcggtgaa 841 gatcgagata caaaagaagc tctaaaggag tttgacttct tggttacatc agaggaagga 901 gacaatgaat ctagaagtgc aggcgatgga acagactggg aaaaggaaga ccagtgtctc 961 atgcctgaag cctggaatgt ggaccaggga gtaattacca aactcaagga acaatacaaa 1021 aaggagagaa aggggaaaaa gggggtgaag aggcccaata ggtcaaaact acaagatatg 1081 cttgctaatt tgagagatgt tgatgaactt ccttcattgc agccatctgt gggttcacct 1141 tccagaccca gcagctccag gcttcctgaa catgaaatta atagggcaga tgaagtggaa 1201 gcattgacat ttcctccttc ttctggaaag tcattcatca tgggagcaga tgaagccctt 1261 gaaagtgaac tgggacctgg agaactagca ggccttacgg tggccaatga agcagactca 1321 ctaacttatg atatagcaaa caataaagat gcattgagga agacatggaa ccctaagttt 1381 acattgagaa gtcactttga tggcatccga gcccttgctt tccatcccat tgagcctgtt 1441 ttgataacag catcagagga tcacacatta aaaatgtgga atttacagaa aacagcccca 1501 gccaaaaaga gcacttctct tgatgtagaa cctatctata cattcagagc ccataaaggt 1561 ccagtgcttt gtgtggtaat gagcagcaat ggtgagcagt gttacagtgg tggtactgat 1621 ggactgatcc agggctggaa taccactaat cccaacatcg acccctatga ttcttatgat 1681 ccttctgttt tacgaggccc tctgctaggc cacacggatg cagtctgggg tttggcttat 1741 agtgcagcac atcagcgttt gttgtcctgt tcagcagatg gcactctgcg tttatggaat 1801 acaactgagg ttgctccagc actaagtgta tttaatgata ttaaagaact gggaatccct 1861 gcctctgtgg atctagtgag cagtgacccg agccatatgg tagcatcatt cagcaaggga 1921 tatacaagca tttttaacat ggaaacacaa caacgcattc tcactttaga atccaatgta 1981 gatacaacag ccaactcttc ctgccaaata aatagagtca tcagtcatcc tactctttcg 2041 atcagcatca ctgctcatga agacaggcac atcaaattct atgataacaa tacaggcaaa 2101 ctgatccact cgatggtagc ccacctagaa gctgttacaa gtttagcagt tgatcccaat 2161 ggcctttact tgatgtctgg cagtcatgac tgttcaatac gtttatggaa tctagaaagt 2221 aagacgtgta tccaagaatt cacagctcat cgaaaaaagt ttgaagaatc gattcatgat 2281 gtagctttcc acccatccaa atgctatata gccagtgctg gagctgacgc actggctaaa 2341 gtctttgtat gacgcaatgc atcatcttca ccttctagct gtttataagt aatcaactgc 2401 acacaagaga tacagaagac gagggcaaga atcatctcgt cctgcccttt tgttctgctg 2461 aaggagcaca gagaacattt gttgaagtat agttttgcaa ttcatatact gttttctaaa 2521 actaaggttt gttcaggttg ctgcaagctc agctgaatct gtgagcctga ggtctgtttc 2581 aaatttctcc ccaataggcg cctwtatttc tgaggtggtt ctaattcgct aggcaggcct 2641 gagcgaatac aagtttagct tgtccctgtt gagtaagtag ggcatgctac aatggataat 2701 ttaaaagctt gatagctggg actgaataga agaaaacggg aaacttagac aagttctccc 2761 tgagaaatct ggttaaaaca cattaattat // LOCUS HSSTRNAS 1846 bp mRNA PRI 17-JAN-1998 DEFINITION H.sapiens mRNA for seryl-tRNA synthetase. ACCESSION X91257 NID g1050526 KEYWORDS serS gene; seryl-tRNA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1846) AUTHORS Vincent,C., Tarbouriech,N. and Hartlein,M. TITLE Genomic organization, cDNA sequence, bacterial expression, and purification of human seryl-tRNA synthase JOURNAL Eur. J. Biochem. 250 (1), 77-84 (1997) MEDLINE 98092290 REFERENCE 2 (bases 1 to 1846) AUTHORS Hartlein,M. TITLE Direct Submission JOURNAL Submitted (08-SEP-1995) M. Hartlein, European Molecular Biology Laboratory, Outstation Grenoble, c/o I.L.L. B.P.156, 38042 Grenoble Cedex9, FRANCE FEATURES Location/Qualifiers source 1..1846 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 76..1620 /gene="serS" CDS 76..1620 /gene="serS" /EC_number="6.1.1.11" /codon_start=1 /product="seryl-tRNA synthetase" /db_xref="PID:g1050527" /db_xref="SWISS-PROT:P49591" /translation="MVLDLDLFRVDKGGDPALIRETQEKRFKDPGLVDQLVKADSEWR RCRFRADNLSKLKNLCSKTIGEKMKKKEPVGDDESVPENVLSFDDLTADALANLKVSQ IKKVRLLIDEAILKCDAERIKLEAERFENLREIGNLLHPSVPISNDEDVDNKVERIWG DCTVRKKYSHVDLVVMVDGFEGEKGAVVAGSRGYFLKGVLVFLEQALIQYALRTLGSR GYIPIYTPFFMRKEVMQEVAQLSQFDEELYKVIGKGSEKSDDNSYDEKYLIATSEQPI AALHRDEWLRPEDLPIKYAGLSTCFRQEVGSHGRDTRGIFRVHQFEKIEQFVYSSPHD NKSWEMFEEMITTAEEFYQSLGIPYHIVNIVSGSLNHAASKKLDLEAWFPGSGAFREL VSCSNCTDYQARRLRIRYGQTKKMMDKVEFVHMLNATMCATTRTICAILENYQTEKGI TVPEKLKEFMPPGLQELIPFVKPAPIEQEPSKKQKKQHEGSKKKAAARDVTLENRLQN MEVTDA" BASE COUNT 476 a 444 c 526 g 400 t ORIGIN 1 gcagtgcggc ggtcacaggc tgagtgctgc ggcgcgatcc ttgcttccct gagcgttggc 61 ccgggaggaa agaagatggt gctggatctg gatttgtttc gggtggataa aggaggggac 121 ccagccctca tccgagagac gcaggagaag cgcttcaagg acccgggact agtggaccag 181 ctggtgaagg cagacagcga gtggcgacga tgtagatttc gggcagacaa cttgagcaag 241 ctgaagaacc tatgcagcaa gacaatcgga gagaaaatga agaaaaaaga gccagtggga 301 gatgatgagt ctgtcccaga gaatgtgctg agtttcgatg accttactgc agacgcttta 361 gctaacctga aagtctcaca aatcaaaaaa gtccgactcc tcattgatga agccatcctg 421 aagtgtgacg cggagcggat aaagttggaa gcagagcggt ttgagaacct ccgagagatt 481 gggaaccttc tgcacccttc tgtacccatc agtaacgatg aggatgtgga caacaaagta 541 gagaggattt ggggcgattg tacagtcagg aagaagtact ctcatgtgga cctggtggtg 601 atggtagatg gctttgaagg cgaaaagggg gccgtggtgg ctgggagtcg agggtacttc 661 ttgaaggggg tcctggtgtt cctggaacag gctctcatcc agtatgccct tcgcaccttg 721 ggaagtcggg gctacattcc catttatacc ccctttttca tgaggaagga ggtcatgcag 781 gaggtggcac agctcagcca gtttgatgaa gaactttata aggtgattgg caaaggcagt 841 gaaaagtctg atgacaactc ctatgatgag aagtacctga ttgccacctc agagcagccc 901 attgctgccc tgcaccggga tgagtggctc cggccggagg acctgcccat caagtatgct 961 ggcctgtcta cctgcttccg tcaggaggtg ggctcccatg gccgtgacac ccgtggcatc 1021 ttccgagtcc atcagtttga gaagattgaa cagtttgtgt actcatcacc ccatgacaac 1081 aagtcatggg agatgtttga agagatgatt accaccgcag aggagttcta ccagtccctg 1141 gggattcctt accacattgt gaatattgtc tcaggttctt tgaatcatgc tgccagtaag 1201 aagcttgacc tggaggcctg gtttccgggc tcaggagcct tccgtgagtt ggtctcctgt 1261 tctaattgca cggattacca ggctcgccgg cttcgaatcc gatatgggca aaccaagaag 1321 atgatggaca aggtggagtt tgtccatatg ctcaatgcta ccatgtgcgc cactacccgt 1381 accatctgcg ccatcctgga gaactaccag acagagaagg gcatcactgt gcctgagaaa 1441 ttgaaggagt tcatgccgcc aggactgcaa gaactgatcc cctttgtgaa gcctgcgccc 1501 attgagcagg agccatcaaa gaagcagaag aagcaacatg agggcagcaa aaagaaagca 1561 gcagcaagag acgtcaccct agaaaacagg ctgcagaaca tggaggtcac cgatgcttga 1621 acattcctgc ctccctattt gccaggcttt catttctgtc tgctgagatc tcagagcctg 1681 cccaacagca gggaagccaa gcacccattc atccccctgc ccccatctga ctgcgtagct 1741 gagaggggaa cagtgccatg taccacacag atgttcctgt ctcctcgcat gggcataggg 1801 acccatcatt gatgactgat gaaaccatgt aataaagcat ctctgg // LOCUS HSSTROM2 1743 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for metalloproteinase stromelysin-2. ACCESSION X07820 Y00728 NID g36628 KEYWORDS metalloproteinase; stromelysin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS Breathnach,R. TITLE Direct Submission JOURNAL Submitted (06-JUN-1988) Breathnach R., Laboratoire de Genetique Moleculaire des Eucaryots de CNRS, Unite 184 de Biologie Moleculaire et de Genie Genetique de l'INSE RM, Faculte de Medecine, 11 rue Humann, 67085 STRASBOURG CEDEX, France REFERENCE 2 (bases 1 to 1461) AUTHORS Muller,D., Quantin,B., Gesnel,M.C., Millon-Collard,R., Abecassis,J. and Breathnach,R. TITLE The collagenase gene family in humans consists of at least four members JOURNAL Biochem. J. 253 (1), 187-192 (1988) MEDLINE 88339885 FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10" sig_peptide 23..73 /note="signal peptide (AA -17 to -1)" CDS 23..1453 /note="pre-stromelysin-2 (AA -17 to 459)" /codon_start=1 /db_xref="PID:g36629" /db_xref="SWISS-PROT:P09238" /translation="MMHLAFLVLLCLPVCSAYPLSGAAKEEDSNKDLAQQYLEKYYNL EKDVKQFRRKDSNLIVKKIQGMQKFLGLEVTGKLDTDTLEVMRKPRCGVPDVGHFSSF PGMPKWRKTHLTYRIVNYTPDLPRDAVDSAIEKALKVWEEVTPLTFSRLYEGEADIMI SFAVKEHGDFYSFDGPGHSLAHAYPPGPGLYGDIHFDDDEKWTEDASGTNLFLVAAHE LGHSLGLFHSANTEALMYPLYNSFTELAQFRLSQDDVNGIQSLYGPPPASTEEPLVPT KSVPSGSEMPAKCDPALSFDAISTLRGEYLFFKDRYFWRRSHWNPEPEFHLISAFWPS LPSYLDAAYEVNSRDTVFIFKGNEFWAIRGNEVQAGYPRGIHTLGFPPTIRKIDAAVS DKEKKKTYFFAADKYWRFDENSQSMEQGFPRLIADDFPGVEPKVDAVLQAFGFFYFFS GSSQFEFDPNARMVTHILKSNSWLHC" mat_peptide 74..1450 /note="mature stromelysin-2 (AA 1 - 459)" misc_feature 1713..1718 /note="polyA signal" polyA_site 1743 /note="polyA site" BASE COUNT 485 a 363 c 399 g 496 t ORIGIN 1 aaagaaggta agggcagtga gaatgatgca tcttgcattc cttgtgctgt tgtgtctgcc 61 agtctgctct gcctatcctc tgagtggggc agcaaaagag gaggactcca acaaggatct 121 tgcccagcaa tacctagaaa agtactacaa cctcgaaaag gatgtgaaac agtttagaag 181 aaaggacagt aatctcattg ttaaaaaaat ccaaggaatg cagaagttcc ttgggttgga 241 ggtgacaggg aagctagaca ctgacactct ggaggtgatg cgcaagccca ggtgtggagt 301 tcctgacgtt ggtcacttca gctcctttcc tggcatgccg aagtggagga aaacccacct 361 tacatacagg attgtgaatt atacaccaga tttgccaaga gatgctgttg attctgccat 421 tgagaaagct ctgaaagtct gggaagaggt gactccactc acattctcca ggctgtatga 481 aggagaggct gatataatga tctctttcgc agttaaagaa catggagact tttactcttt 541 tgatggccca ggacacagtt tggctcatgc ctacccacct ggacctgggc tttatggaga 601 tattcacttt gatgatgatg aaaaatggac agaagatgca tcaggcacca atttattcct 661 cgttgctgct catgaacttg gccactccct ggggctcttt cactcagcca acactgaagc 721 tttgatgtac ccactctaca actcattcac agagctcgcc cagttccgcc tttcgcaaga 781 tgatgtgaat ggcattcagt ctctctacgg acctccccct gcctctactg aggaacccct 841 ggtgcccaca aaatctgttc cttcgggatc tgagatgcca gccaagtgtg atcctgcttt 901 gtccttcgat gccatcagca ctctgagggg agaatatctg ttctttaaag acagatattt 961 ttggcgaaga tcccactgga accctgaacc tgaatttcat ttgatttctg cattttggcc 1021 ctctcttcca tcatatttgg atgctgcata tgaagttaac agcagggaca ccgtttttat 1081 ttttaaagga aatgagttct gggccatcag aggaaatgag gtacaagcag gttatccaag 1141 aggcatccat accctgggtt ttcctccaac cataaggaaa attgatgcag ctgtttctga 1201 caaggaaaag aagaaaacat acttctttgc agcggacaaa tactggagat ttgatgaaaa 1261 tagccagtcc atggagcaag gcttccctag actaatagct gatgactttc caggagttga 1321 gcctaaggtt gatgctgtat tacaggcatt tggatttttc tacttcttca gtggatcatc 1381 acagtttgag tttgacccca atgccaggat ggtgacacac atattaaaga gtaacagctg 1441 gttacattgc taggcgagat agggggaaga cagatatggg tgtttttaat aaatctaata 1501 attattcatc taatgtatta tgagccaaaa tggttaattt ttcctgcatg ttctgtgact 1561 gaagaagatg agccttgcag atatctgcat gtgtcatgaa gaatgtttct ggaattcttc 1621 acttgctttt gaattgcact gaacagaatt aagaaatact catgtgcaat aggtgagaga 1681 atgtattttc atagatgtgt tattacttcc tcaataaaaa gttttatttt gggcctgttc 1741 ctt // LOCUS HSSUB15 3353 bp RNA PRI 12-JUN-1997 DEFINITION H.sapiens Sub1.5 mRNA. ACCESSION Y09160 NID g2196871 KEYWORDS Lsc homologue; oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3353) AUTHORS Aasheim,H.C., Pedeutour,F. and Smeland,E.B. TITLE Characterization, expression and chromosomal localization of a human gene homologous to the mouse Lsc oncogene, with strongest expression in hematopoetic tissues JOURNAL Oncogene 14 (14), 1747-1752 (1997) MEDLINE 97280749 REFERENCE 2 (bases 1 to 3353) AUTHORS Aasheim,H.C. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) H.C. Aasheim, The Norwegian Cancer Institute, Immunology, The Norwegian Radiumhospital, Montebello, N-0310 Oslo, NORWAY FEATURES Location/Qualifiers source 1..3353 /organism="Homo sapiens" /isolate="TPA stimulated T cells" /db_xref="taxon:9606" /cell_type="T lymphocytes" /chromosome="19" /map="q13.13" /dev_stage="adult" /germline gene 436..3045 /gene="sub1.5" CDS 436..3045 /gene="sub1.5" /note="Lsc homologue" /codon_start=1 /db_xref="PID:e286621" /db_xref="PID:g2196872" /translation="MALLQHVALQFEPGPLLCCLHADMLGSLGPKEAKKAFLDFYHSF LEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKR LMGMTPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVV NAIGLYMRHLGVRTKSGDKKSGRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRG EPQVPDFRHLKAEVDAEKPGATDRKGGGGDASRDRNIGAPGQDTPGVSLHPLSLDSPD REPADAPLEPGGLIPAGPMSLESLAPPESTDEGAETESPEPGDEGEPGRSGLELEPEE PPGWRELVPPDTLHSLPKSQVKRQEVISELLVTEAAHVRMLRVLHDLFFQPMAECLFF PLEELQNIFPSLDELIEVHSLFLDRLMKRRQESGYLIEEIGDVLLARFDGAEGSWFQK ISSRFCSRQSFALEQLKAKQRKDPRFCAFVQEAESRPRSRRLQLKDMIPTEMQRLTKY PLLLQSIGQNTEEPTEREKVELAAECCREILHHVNQAVRDMEDLLRLKDYQRRLDLSH LRQSSDPMLSEFKNLDITKKKLVHEGPLTWRVTKDKAVEVHVLLLDDLLLLLQRQDER LLLKSHSRTLTPTPDGKTMLRPVLRLTSAMTREVATDHKAFYVLFTWDQEAQIYELVA QTVSERKNWSALITETAGSLKVPAPASRPKPRPSPSSTREPLLSSSENGNGGRETSPA DARTERILSDLLPFCRPGPEGQLAATALRKVLSLKQLLFPAEEDNGAGPPRDGDGVPG GGPRSPARTQEIQENLLRLEETMKTLEELEEEFCRLRPLLSQLGGTLSPSLAALEFPP RRPFARRRMGERT" BASE COUNT 735 a 1007 c 1006 g 605 t ORIGIN 1 ggaaaggaaa ataatctgca gaaacgagct gagcttgcaa agacttcata gttcccaaga 61 attaaaaaaa aaaaaaaaaa gaattccact tgatcaactt aattcctttt ctttatcttc 121 cctccatcac ttcccttttc tcccaccctc ttttccaagc tgtttcgctt tgcaatatat 181 tactggtaat gagttgcagg ataatgcagt cataacttgt tttctcctaa gtatttgagt 241 tcaaaactcc tgtatctaaa gaaatacggt tggggtcatt aataaagaaa atctttctat 301 cttaaaaaaa aaaaaaccgt cagcatcatc ggggctgagg atgaggattt tgagaacgag 361 ctggagacaa actcagaaga gcaaaacagc cagttccaga gcctggagca ggtgaagcgg 421 cgcccagccc acctcatggc cctcctgcag cacgtggccc tgcagtttga gccaggaccc 481 ctgctttgct gtctgcatgc cgacatgctg ggctcactgg gccccaagga ggccaagaag 541 gccttcctgg acttctacca cagcttcctg gagaagacag cggttctccg ggtgccggtc 601 cctcccaacg tcgcctttga acttgaccgc actagggctg acctcatctc cgaggatgtc 661 cagcggcggt tcgtgcagga ggtggtgcaa agccagcagg tagccgtggg ccggcagctg 721 gaggacttcc gttccaagcg gctcatgggc atgacgccct gggagcagga gctggcccag 781 ctggaggctt gggttgggcg ggaccgagcc agctacgagg cccgggagcg gcacgtggcg 841 gagcggctgc tcatgcacct ggaggagatg caacatacca tctctaccga cgaagaaaag 901 agtgctgccg tggtcaacgc cattggcctg tacatgcgcc accttggggt gcggaccaag 961 agtggagaca agaagtcggg gaggaacttc ttccggaaaa aggtgatggg gaaccggcgg 1021 tcggacgagc ctgccaagac caagaagggg ctgagcagca tcctggatgc cgcccgctgg 1081 aaccggggag agccccaggt tccagatttt cgacacctca aagcagaggt tgatgccgag 1141 aagccaggtg ctacagaccg gaagggaggc ggtggggatg cctctcggga ccggaatatc 1201 ggggctcctg ggcaggacac ccctggagtc tctctgcacc ctctgtccct ggacagccca 1261 gaccgggaac cagctgacgc ccccctggag cctgggggac tcatccccgc aggcccaatg 1321 agcctggagt ccttggcgcc cccagagagt accgacgagg gggccgaaac cgagagcccc 1381 gagcctggag atgaggggga gccggggcgg tcgggactgg agcttgaacc agaagagcct 1441 cccggctggc gggaactcgt ccccccagac accctgcaca gcctgcccaa gagccaggtg 1501 aagcggcagg aggtcatcag cgagctgctg gtgacagagg cggcccacgt gcgcatgctg 1561 cgggtgctgc acgacctctt cttccagccc atggcagaat gcctgttctt ccccttggag 1621 gagctgcaga acatcttccc cagcctggac gagctcatcg aggtgcattc cctgttcctc 1681 gatcgcctga tgaagcggag gcaggagagt ggctacctca tcgaggagat cggagacgtg 1741 ctgctggccc ggtttgatgg tgctgagggc tcctggttcc agaaaatctc ctcccgcttc 1801 tgcagccgcc agtcatttgc cttagagcag ctcaaagcca agcaacgcaa ggaccctcgg 1861 ttctgtgcct tcgtgcagga agctgagagc cgcccgcgga gccgccgcct gcagctgaag 1921 gacatgatcc ccacggagat gcagcggctg accaagtacc ccctgctcct gcagagcatc 1981 gggcagaaca cagaagagcc cacagaacgg gagaaagtgg agctggcagc cgagtgctgc 2041 cgggaaattc tacaccacgt caaccaagcc gtgcgtgaca tggaggacct gctgaggctc 2101 aaggactatc agcggcgcct ggacttgtcc caccttcggc agagcagcga ccctatgctg 2161 agcgagttca agaacctgga catcaccaag aagaaattgg tccacgaggg cccactgacg 2221 tggcgggtga ctaaggacaa ggcagtggag gtgcatgtgc tgctgctgga cgacctgctg 2281 ctgctgctcc agcgccagga cgagcggctg ctgctcaagt cccatagccg gacactgacg 2341 cccacgcccg atggcaagac catgctgcgg cccgtgctgc ggctcacctc cgccatgacc 2401 cgcgaggtgg ccaccgatca caaagccttc tacgtccttt ttacctggga ccaggaggcc 2461 cagatatacg agctggtggc acagactgtg tcggagcgga aaaactggag tgctctcatc 2521 actgagactg ccggatccct gaaagtccct gcccctgcct ctcgccctaa gccccggccc 2581 agcccgagca gcacccgaga acccctcctc agcagctctg agaacggcaa tggtggccga 2641 gagacgtctc cagctgatgc ccggaccgag agaatcctca gtgacctcct gcccttctgc 2701 agaccaggcc ccgagggcca gctcgctgcc acggcccttc ggaaagtgct gtccctgaag 2761 cagcttctgt ttccggcgga ggaagacaat ggggcggggc ctcctcgaga tggggatggg 2821 gtcccagggg gcggcccccg tagcccagca cggacccagg aaatccagga gaacctgcta 2881 cgcttggagg agaccatgaa gacgctggag gagttggagg aggaattttg ccgcctgaga 2941 cccctcctgt ctcagcttgg gggaactctg tcccccagcc tggctgcact tgagttcccg 3001 cccagaaggc cttttgcaag aaggaggatg ggggagagga cgtgagggac cacccccacc 3061 cacacagctg ccgcagcatc tcacaccccg agggcctgag gagagggagc tgtggccacg 3121 cctgggaggg gcccagctgg ggttactggc cccgcatgag cctcggccat ctctccctcc 3181 tgccctctgc ttgggggact cagggctcca ttctggaggg caccacggtg acccgggcca 3241 tctcagtatt gcctgtgggg gccacccctc cacccccacc cccaagtgcc ttcgctctgt 3301 ttttataccc tgaattggag gtttattttt taatatatat tatctaagaa gaa // LOCUS HSSUISO 6021 bp RNA PRI 16-NOV-1993 DEFINITION H.sapiens si mRNA for sucrase-isomaltase. ACCESSION X63597 S41833 S41836 NID g36644 KEYWORDS isomaltase; pro-sucrase-isomaltase; si gene; sucrase; sucrase-isomaltase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6021) AUTHORS Lacasa,M. TITLE Direct Submission JOURNAL Submitted (27-DEC-1991) M. Lacasa, Centre National de la Recherche Scientifique, IRSC-7 Rue Guy Moquet, 94801 Villejuif Cedex, FRANCE REFERENCE 2 (bases 1 to 6021) AUTHORS Chantret,I., Lacasa,M., Chevalier,G., Ruf,J., Islam,I., Mantei,N., Edwards,Y., Swallow,D. and Rousset,M. TITLE Sequence of the complete cDNA and the 5' structure of the human sucrase-isomaltase gene. Possible homology with a yeast glucoamylase JOURNAL Biochem. J. 285 (Pt 3), 915-923 (1992) MEDLINE 92359963 COMMENT See also M22616. FEATURES Location/Qualifiers source 1..6021 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="intestine" /chromosome="3q25-26" gene 63..5546 /gene="si" misc_feature 63..2855 /gene="si" /note="isomaltase" CDS 63..5546 /gene="si" /EC_number="3.2.1.48" /EC_number="3.2.1.10" /codon_start=1 /product="prosucrose-isomaltase" /db_xref="PID:g36645" /db_xref="SWISS-PROT:P14410" /translation="MARKKFSGLEISLIVLFVIVTIIAIALIVVLATKTPAVDEISDS TSTPATTRVTTNPSDSGKCPNVLNDPVNVRINCIPEQFPTEGICAQRGCCWRPWNDSL IPWCFFVDNHGYNVQDMTTTSIGVEAKLNRIPSPTLFGNDINSVLFTTQNQTPNRFRF KITDPNNRRYEVPHQYVKEFTGPTVSDTLYDVKVAQNPFSIQVIRKSNGKTLFDTSIG PLVYSDQYLQISARLPSDYIYGIGEQVHKRFRHDLSWKTWPIFTRDQLPGDNNNNLYG HQTFFMCIEDTSGKSFGVFLMNSNAMEIFIQPTPIVTYRVTGGILDFYILLGDTPEQV VQQYQQLVGLPAMPAYWNLGFQLSRWNYKSLDVVKEVVRRNREAGIPFDTQVTDIDYM EDKKDFTYDQVAFNGLPQFVQDLHDHGQKYVIILDPAISIGRRANGTTYATYERGNTQ HVWINESDGSTPIIGEVWPGLTVYPDFTNPNCIDWWANECSIFHQEVQYDGLWIDMNE VSSFIQGSTKGCNVNKLNYPPFTPDILDKLMYSKTICMDAVQNWGKQYDVHSLYGYSM AIATEQAVQKVFPNKRSFILTRSTFAGSGRHAAHWLGDNTASWEQMEWSITGMLEFSL FGIPLVGADICGFVAETTEELCRRWMQLGAFYPFSRNHNSDGYEHQDPAFFGQNSLLV KSSRQYLTIRYTLLPFLYTLFYKAHVFGETVARPVLHEFYEDTNSWIEDTEFLWGPAL LITPVLKQGADTVSAYIPDAIWYDYESGAKRPWRKQRVDMYLPADKIGLHLRGGYIIP IQEPDVTTTASRKNPLGLIVALGENNTAKGDFFWDDGETKDTIQNGNYILYTFSVSNN TLDIVCTHSSYQEGTTLAFQTVKILGLTDSVTEVRVAENNQPMNAHSNFTYDASNQVL LIADLKLNLGRNFSVQWNQIFSENERFNCYPDADLATEQKCTQRGCVWRTGSSLSKAP ECYFPRQDNSYSVNSARYSSMGITADLQLNTANARIKLPSDPISTLRVEVKYHKNDML QFKIYDPQKKRYEVPVPLNIPTTPISTYEDRLYDVEIKENPFGIQIRRRSSGRVIWDS WLPGFAFNDQFIQISTRLPSEYIYGFGEVEHTAFKRDLNWNTWGMFTRDQPPGYKLNS YGFHPYYMALEEEGNAHGVFLLNSNAMDVTFQPTPALTYRTVGGILDFYMFLGPTPQV ATKQYHEVIGHPVMPAYWALGFQLCRYGYANTSEVRELYDAMVAANIPYDVQYTDIDY MERQLDFTIGEAFQDLPQFVDKIRGEGMRYIIILDPAISGNETKTYPAFERGQQNDVF VKWPNTNDICWAKVWPDLPNITIDKTLTEDEAVNASRAHVAFPDFFRTSTAEWWAREI VDFYNEKMKFDGLWIDMNEPSSFVNGTTTNQCRNDELNYPPYFPELTKRTDGLHFRTI CMEAEQILSDGTSVLHYDVHNLYGWSQMKPTHDALQKTTGKRGIVISRSTYPTSGRWG GHWLGDNYARWDNMDKSIIGMMEFSLFGISYTGADICGFFNNSEYHLCTRWMQLGAFY PYSRNHNIANTRRQDPASWNETFAEMSRNILNIRYTLLPYFYTQMHEIHANGGTVIRP LLHEFFDEKPTWDIFKQFLWGPAFMVTPVLEPYVQTVNAYVPNARWFDYHTGKDIGVR GQFQTFNASYDTINLHVRGGHILPCQEPAQNTFYSRQKHMKLIVAADDNQMAQGSLFW DDGESIDTYERDLYLSVQFNLNQTTLTSTILKRGYINKSETRLGSLHVWGKGTTPVNA VTLTYNGNKNSLPFNEDTTNMILRIDLTTHNVTLEEPIEINWS" misc_feature 2856..5546 /gene="si" /note="sucrase" polyA_signal 5992..5997 polyA_site 6009 BASE COUNT 1962 a 1127 c 1170 g 1762 t ORIGIN 1 tattttggca gccttatcca agtctggtac aacatagcaa agagaacagg ctatgaaata 61 agatggcaag aaagaaattt agtggattgg aaatctctct gattgtcctt tttgtcatag 121 ttactataat agctattgcc ttaattgttg ttttagcaac taagacacct gctgttgatg 181 aaattagtga ttctacttca actccagcta ctactcgtgt gactacaaat ccttctgatt 241 caggaaaatg tccaaatgtg ttaaatgatc ctgtcaatgt gagaataaac tgcattccag 301 aacaattccc aacagaggga atttgtgcac agagaggctg ctgctggagg ccgtggaatg 361 actctcttat tccttggtgc ttcttcgttg ataatcatgg ttataacgtt caagacatga 421 caacaacaag tattggagtt gaagccaaat taaacaggat accttcacct acactatttg 481 gaaatgacat caacagtgtt ctcttcacaa ctcaaaatca gacacccaat cgtttccggt 541 tcaagattac tgatccaaat aatagaagat atgaagttcc tcatcagtat gtaaaagagt 601 ttactggacc cacagtttct gatacgttgt atgatgtgaa ggttgcccaa aacccattta 661 gcatccaagt tattaggaaa agcaacggta aaactttgtt tgacaccagc attggtccct 721 tagtgtactc tgaccagtac ttacagatct cagcccgtct tccaagtgat tatatttatg 781 gtattggaga acaagttcat aagagatttc gtcatgattt atcctggaaa acatggccaa 841 tttttactcg agaccaactt cctggtgata ataataataa tttatacggc catcaaacat 901 tctttatgtg tattgaagat acatctggaa agtcattcgg tgttttttta atgaatagca 961 atgcaatgga gatttttatc cagcctactc caatagtaac atatagagtt accggtggca 1021 ttctggattt ttacatcctt ctaggagata caccagaaca agtagttcaa cagtatcaac 1081 agcttgttgg actaccagca atgccagcat attggaatct tggattccaa ctaagtcgct 1141 ggaattataa gtcactagat gtagtgaaag aagtggtaag gagaaaccgg gaagctggca 1201 taccatttga tacacaggtc actgatattg actacatgga agacaagaaa gactttactt 1261 atgatcaagt tgcgtttaac ggactccctc aatttgtgca agatttgcat gaccatggac 1321 agaaatatgt catcatcttg gaccctgcaa tttccatagg tcgacgtgcc aatggaacaa 1381 catatgcaac ctatgagagg ggaaacacac aacatgtgtg gataaatgag tcagatggaa 1441 gtacaccaat tattggagag gtatggccag gattaacagt ataccctgat ttcactaatc 1501 caaactgcat tgattggtgg gcaaatgaat gcagtatttt ccatcaagaa gtgcaatatg 1561 atggactttg gattgacatg aatgaagttt ccagctttat tcaaggttca acaaaaggat 1621 gtaatgtaaa caaattgaat tatccaccgt ttactcctga tattcttgac aaactcatgt 1681 attccaaaac aatttgcatg gatgctgtgc agaactgggg taaacagtat gatgttcata 1741 gcctctatgg atacagcatg gctatagcca cagagcaagc tgtacaaaaa gtttttccta 1801 ataagagaag cttcattctt acccgctcaa catttgctgg atctggaaga catgctgctc 1861 attggttagg agacaatact gcttcatggg aacaaatgga atggtctata actggaatgc 1921 tggagttcag tttgtttgga atacctttgg ttggagcaga catctgtgga tttgtggctg 1981 aaaccacaga agaactttgc agaagatgga tgcaacttgg ggcattttat ccattttcca 2041 gaaaccataa ttctgacgga tatgaacatc aggatcctgc attttttggg cagaattcac 2101 ttttggttaa atcatcaagg cagtatttaa ctattcgcta caccttatta cccttcctct 2161 acactctgtt ttataaagcc catgtgtttg gagaaacagt agcaagacca gttcttcatg 2221 agttttatga ggatacgaac agctggattg aggacactga gtttttgtgg ggccctgcat 2281 tacttattac tcctgttcta aaacagggag cagatactgt gagtgcctac atccctgatg 2341 ctatttggta tgattatgaa tctggtgcaa aaaggccatg gaggaaacaa cgggttgata 2401 tgtatcttcc agcagacaaa ataggattac atcttagagg aggttatatc atccccattc 2461 aagaaccaga tgtaacaaca acagcaagcc gtaagaatcc tctaggactt atagtcgcat 2521 taggtgaaaa caacacagcc aaaggagact ttttctggga tgatggagaa actaaagata 2581 caatacaaaa tggcaactac atattatata cattttcagt ttctaataac acattagata 2641 ttgtgtgcac acattcatca tatcaggaag gaactacctt agcatttcag actgtaaaaa 2701 tccttgggtt gacagacagt gttacagaag ttagagtggc ggaaaataat caaccaatga 2761 acgctcattc caatttcact tatgatgctt ctaaccaggt tctcctaatt gcagatctca 2821 aacttaatct tggaagaaac tttagtgttc aatggaatca aattttctca gaaaatgaaa 2881 gatttaattg ttatccagat gcagatttgg caactgaaca aaagtgcaca caacgtggct 2941 gtgtatggag aacgggttct tctctatcca aagcacctga gtgttacttt cccagacaag 3001 ataactctta ttcagtcaac tcagctcgct attcatccat gggtataaca gctgacctcc 3061 aactaaatac tgcaaatgcc agaataaagt taccttctga ccccatctca actcttcgtg 3121 tggaggtgaa atatcacaaa aatgatatgt tgcagtttaa gatttatgat ccccaaaaga 3181 agagatatga agtaccagta ccgttaaaca ttccaaccac cccaataagt acttatgaag 3241 acagacttta tgatgtggaa atcaaggaaa atccttttgg catccagatt cgacggagaa 3301 gcagtggaag agtcatttgg gattcttggc tgcctggatt tgcttttaat gaccagttca 3361 ttcaaatatc gactcgcctg ccatcagaat atatatatgg ttttggggaa gtggaacata 3421 cagcatttaa gcgagatctg aactggaata cttggggaat gttcacaaga gaccaacccc 3481 ctggttacaa acttaattcc tatggatttc atccctatta catggctctg gaagaggagg 3541 gcaatgctca tggtgttttc ttactcaaca gcaatgcaat ggatgttaca ttccagccaa 3601 ctcctgctct aacttaccgt acagttggag ggatcttgga tttttatatg tttttgggcc 3661 caactccaca agttgcaaca aagcaatacc atgaagtaat tggccatcca gtcatgccag 3721 cttattgggc tttgggattc caattatgtc gttatggata tgcaaatact tcagaggttc 3781 gggaattata tgacgctatg gtggctgcta acatccccta tgatgttcag tacacagaca 3841 ttgactacat ggaaaggcag ctagacttta caattggtga agcattccag gaccttcctc 3901 agtttgttga caaaataaga ggagaaggaa tgagatacat tattatcctg gatccagcaa 3961 tttcaggaaa tgaaacaaag acttaccctg catttgaaag aggacagcag aatgatgtct 4021 ttgtcaaatg gccaaacacc aatgacattt gttgggcaaa ggtttggcca gatttgccca 4081 acataacaat agataaaact ctaacggaag atgaagctgt taatgcttcc agagctcatg 4141 tagctttccc agatttcttc aggacttcca cagcagagtg gtgggccaga gaaattgtgg 4201 acttttacaa tgaaaagatg aagtttgatg gtttgtggat tgatatgaat gagccatcaa 4261 gttttgtaaa tggaacaact actaatcaat gcagaaatga cgaactaaat tatccacctt 4321 atttcccaga actcacaaaa agaactgatg gattacattt cagaacaatt tgcatggaag 4381 ctgagcagat tcttagtgat ggaacatcag ttttgcatta cgatgttcac aatctctatg 4441 gatggtcaca gatgaaacct actcatgatg cattgcaaaa gacaactgga aaaagaggga 4501 ttgtaatttc tcgttccacg tatcctacta gtggacgatg gggaggacac tggcttggag 4561 acaactatgc acgatgggac aacatggaca aatcaatcat tggtatgatg gaatttagtc 4621 tgtttggaat atcatatact ggagcagaca tctgtggttt tttcaacaac tcagaatatc 4681 atctctgtac ccgctggatg caacttggag cattttatcc atactcaagg aatcacaaca 4741 ttgcaaatac tagaagacaa gatcccgctt cctggaatga aacttttgct gaaatgtcaa 4801 ggaatattct aaatattaga tacaccttat tgccctattt ttacacacaa atgcatgaaa 4861 ttcatgctaa tggtggcact gttatccgac cccttttgca tgagttcttt gatgaaaaac 4921 caacctggga tatattcaag cagttcttat ggggtccagc atttatggtt accccagtac 4981 tggaacctta tgttcaaact gtaaatgcct acgtccccaa tgctcggtgg tttgactacc 5041 atacaggcaa agatattggc gtcagaggac aatttcaaac atttaatgct tcttatgaca 5101 caataaacct acatgtccgt ggtggtcaca tcctaccatg tcaagagcca gctcaaaaca 5161 cattttacag tcgacaaaaa cacatgaagc tcattgttgc tgcagatgat aatcagatgg 5221 cacagggttc tctgttttgg gatgatggag agagtataga cacctatgaa agagacctat 5281 atttatctgt acaatttaat ttaaaccaga ccaccttaac aagcactata ttgaagagag 5341 gttacataaa taaaagtgaa acgaggcttg gatcccttca tgtatggggg aaaggaacta 5401 ctcctgtcaa tgcagttact ctaacgtata acggaaataa aaattcgctt ccttttaatg 5461 aagacactac caacatgata ttacgtattg atctgaccac acacaatgtt actctagaag 5521 aaccaataga aatcaactgg tcatgaagat caccatcaat tttagttgtc aatgggaaaa 5581 aacaccagga tttaagtttc acagcactta caattttccc tcttcacttg gttcttgtac 5641 tctacaaaat atagctttca taacatcgaa aagttatttt gtagcgtaca tcaatgataa 5701 tgctaatttt attatagtaa tgtgacttgg attcaatttt aaggcatatt taacaaaatt 5761 tgaatagccc tatttatcct tgttaagtat cagctacaat tgtaaactag ttactaaaca 5821 tgtatgtaaa tagctaagat ataatttaaa cgtgattttt aaattaaata aaatttttat 5881 gtaattatat atactatatt tttctcaatg tttagcagat ttaagatatg taacaacaat 5941 tatttgaaga tttaattact tcttagtatg tgcatttaat tagaaaaaga gaataaaaaa 6001 tgtaagtgta aaaaaaaaaa a // LOCUS HSSURF1 1019 bp RNA PRI 21-JUL-1995 DEFINITION H.sapiens mRNA for SURF-1. ACCESSION Z35093 NID g895848 KEYWORDS SURF-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1019) AUTHORS Lennard,A., Gaston,K. and Fried,M. TITLE The Surf-1 and Surf-2 genes and their essential bidirectional promoter elements are conserved between mouse and human JOURNAL DNA Cell Biol. 13 (11), 1117-1126 (1994) MEDLINE 95217332 REFERENCE 2 (bases 1 to 1019) AUTHORS Fried,M. TITLE Direct Submission JOURNAL Submitted (11-JUL-1994) Mike Fried, Imperial Cancer Research Fund, P. O. Box 123, London, WC2A 3PX, United Kingdom FEATURES Location/Qualifiers source 1..1019 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 15..917 /codon_start=1 /product="SURF-1" /db_xref="PID:g895849" /translation="MAAVAALQLGLRAAGLGRAPASAAWRSVLRVSPRPGVAWRPSRC GSSAAEASATKAEDDSFLQWVLLLIPVTAFGLGTWQVQRRKWKLNLIAELESRVLAEP VPLPADPMELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLISSSTQSGA YVVTPFHCTDLGVTILVNRGFVPRKKVNPETRQKGQIEGEVDLIGMVRLTETRQPFVP ENNPERNHWHYRDLEAMARITGAEPIFIDANFQSTVPGGPIGGQTRVTLRNEHLQYIV TWYGLSAATSYLWFKKFLRGTPGV" BASE COUNT 220 a 271 c 314 g 214 t ORIGIN 1 cggggccggg tgcgatggcg gcggtggctg cgttgcagct ggggctgcgg gcggcggggc 61 tgggacgggc cccggccagc gccgcctgga ggagcgtcct cagggtctcc ccgcgcccag 121 gggtggcctg gaggccaagc agatgtggca gttctgcagc agaagcatct gccacaaaag 181 cggaagatga ctcctttctt cagtgggtcc tgctcctcat ccctgtgact gcctttggct 241 tggggacatg gcaggtccag cgtcggaagt ggaagctgaa cctgattgca gagttggagt 301 ccagagttct ggctgagcct gtccctctgc cagccgaccc aatggaactg aaaaatctgg 361 agtataggcc agtgaaggtc agggggtgct ttgaccattc caaggagctg tatatgatgc 421 cccggaccat ggtggaccct gtccgggagg cccgggaggg cggcctcatc tcctcctcaa 481 ctcagagtgg ggcctatgtg gtcactccct tccactgcac cgacctggga gtcaccatcc 541 tggtaaatag agggttcgtt cccaggaaga aagtgaatcc tgaaacccgg cagaaaggcc 601 agattgaggg agaagtggac ctcattggga tggtgaggct gacagaaacc aggcagcctt 661 ttgtccctga gaacaatcca gaaaggaacc actggcatta tcgagacctg gaagctatgg 721 ccagaatcac aggcgcagag cccatcttca ttgatgccaa cttccagagc acagtccctg 781 gaggacccat tggagggcaa accagagtta ctctgaggaa cgagcatctg cagtacatcg 841 tgacctggta tggactctct gcagctacat cctacctgtg gtttaagaaa ttcctacgtg 901 ggacacctgg tgtgtgacag atcagctgct gaagccctgt ccctggataa tgcagtattt 961 caagactgcc tttatgctgg atcatgtgct actggtataa agttctggcc ttctacctt // LOCUS HSSURF2 813 bp RNA PRI 21-JUL-1995 DEFINITION H.sapiens mRNA for SURF-2. ACCESSION Z35094 NID g895850 KEYWORDS SURF-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 813) AUTHORS Fried,M. TITLE Direct Submission JOURNAL Submitted (11-JUL-1994) Mike Fried, Imperial Cancer Research Fund, P. O. Box 123, London, WC2A 3PX, United Kingdom REFERENCE 2 (bases 1 to 813) AUTHORS Lennard,A., Gaston,K. and Fried,M. TITLE The Surf-1 and Surf-2 genes and their essential bidirectional promoter elements are conserved between mouse and human JOURNAL DNA Cell Biol. 13 (11), 1117-1126 (1994) MEDLINE 95217332 FEATURES Location/Qualifiers source 1..813 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 18..788 /codon_start=1 /product="SURF-2" /db_xref="PID:g895851" /translation="MSELPGDVRAFLREHPSLRLQTDTRKVRCILTGHELTCRLPELQ VYTRGKKYQRLVRASPAFDYAEFEPHIVPSTKNPHQLFCKLTLRHINKCPEHVLRHTQ GRRYQRALCKYEECQKQGVEYVPACLVHRRRRREDQMDGEAPRPREAFWEPTSSDEGA AASDDSMTDLYPPELFTRKDLGSTEDGDGTDDFLTDKEDEKAKPPREKATDEGRRETT VYQGLVQKRGKKQLGSLKKKFKSHHRKPKSFSSCKQPG" BASE COUNT 204 a 227 c 261 g 121 t ORIGIN 1 cgcgggcgtc gtcggccatg agcgagttgc cgggcgacgt gcgggcgttt ctgcgggagc 61 acccgagcct gcggctccag acggacaccc gcaaggtgag gtgcatcctg acaggtcacg 121 agctgacctg ccgcctgccg gagctccagg tctacacccg cggcaaaaag taccagcggc 181 tggtccgcgc ctccccggcc ttcgactatg cagagttcga gccgcacatc gtgcccagca 241 ccaagaaccc gcaccagttg ttctgcaaac tcaccctgcg gcacatcaac aagtgcccag 301 aacacgtgct gaggcacacc cagggccggc ggtaccagcg agctctgtgt aaatatgaag 361 aatgtcagaa gcaaggggtg gagtacgtgc ctgcctgcct ggtgcaccgg aggaggagga 421 gggaggacca gatggacggt gaggcgcctc gcccgcggga agccttctgg gagcccacat 481 ccagtgatga gggggcagct gcaagtgatg acagcatgac agacctgtac ccacctgagc 541 tattcaccag aaaggacctt ggaagcacgg aggatgggga tggcactgat gactttttga 601 cagacaaaga ggatgagaag gcaaagcccc caagagagaa ggccactgat gagggcagga 661 gagagacgac cgtgtaccaa gggctggtcc agaagcgcgg gaagaagcag ttgggctcgt 721 tgaaaaagaa gttcaagagt catcaccgca aacccaagag cttcagctcc tgtaaacagc 781 caggttaata aaagcacatg ccgtgaagtt tcg // LOCUS HSSURF5MR 442 bp RNA PRI 07-MAY-1996 DEFINITION H.sapiens SURF-5 mRNA. ACCESSION X85178 NID g1150423 KEYWORDS SURF-5. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 442) AUTHORS Fried,M. TITLE Direct Submission JOURNAL Submitted (07-MAR-1995) M. Fried, Imperial Cancer Research Fund, PO Box 123, Lincoln's Inn Fields, London WC2A 3PX, UK REFERENCE 2 (bases 1 to 442) AUTHORS Garson,K., Duhig,T., Armes,N., Colombo,P. and Fried,M. TITLE Surf5: a gene in the tightly clustered mouse surfeit locus is highly conserved and transcribed divergently from the rpL7A (Surf3) gene JOURNAL Genomics 30 (2), 163-170 (1995) MEDLINE 96163868 FEATURES Location/Qualifiers source 1..442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60 cells" /chromosome="9" /map="9q34.2" gene 1..423 /gene="SURF-5" CDS 1..423 /gene="SURF-5" /codon_start=1 /product="SURF-5 protein" /db_xref="PID:e140918" /db_xref="PID:g1150424" /translation="MAQQRALPQSKETLLQSYNKRLKDDIKSIMDNFTEIIKTAKIED ETQVSRATQGEQDNYEMHVRAANIVRAGESLMKLVSDLKQFLILNDFPSVNEAIDQRN QQLRTLQEECDRKLITLRDEISIDLYELEEEYYSSRYK" exon <1..123 /gene="SURF-5" /number=1 exon 124..204 /gene="SURF-5" /number=2 exon 205..>442 /number=3 BASE COUNT 116 a 131 c 123 g 72 t ORIGIN 1 atggcccagc agagagccct gccccagagc aaggagacgc tgctgcagtc ctacaacaag 61 cggctgaagg acgacattaa gtccatcatg gacaacttca ccgagatcat caagaccgcc 121 aagattgagg acgagacgca ggtgtcacgg gccactcagg gtgaacagga caattacgag 181 atgcatgtgc gagccgccaa catcgtccga gccggcgagt ccctgatgaa gctggtgtcc 241 gacctcaagc agttcctgat cctcaatgac ttcccctccg tgaacgaggc cattgaccag 301 cgcaaccagc agctgcgcac actgcaggag gagtgcgacc ggaagctcat cacgctgcga 361 gacgagatct ccattgacct ctacgagctg gaggaggagt attactcgtc caggtataaa 421 tagcgctgga ctccccatgc ag // LOCUS HSSYMPLEK 3984 bp RNA PRI 30-MAY-1997 DEFINITION H.sapiens mRNA for symplekin. ACCESSION Y10931 NID g2143261 KEYWORDS symplekin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3984) AUTHORS Alwazzan,M., Hamshere,M.G., Lennon,G. and Brook,J.D. TITLE Six transcripts map within 200 kilobases of the myotonic dystrophy expanded repeat JOURNAL Unpublished REFERENCE 2 (bases 1 to 3984) AUTHORS Alwazzan,M. TITLE Direct Submission JOURNAL Submitted (24-JAN-1997) M. Alwazzan, Queens Medical Centre, Genetics, University of Nottingham, Nottingham NG7 2 UH, UK COMMENT Related sequence: U49240. FEATURES Location/Qualifiers source 1..3984 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="q13.3" /dev_stage="adult" /tissue_type="muscle" /clone="5C-K7" /clone="5C-FC6" /clone="K2C-17" CDS 460..3888 /codon_start=1 /product="symplekin" /db_xref="PID:e305053" /db_xref="PID:g2143262" /translation="MTQLYKVALQWMVKSRVISELQEACWDMVSAMAGDIILLLDSDN DGIRTHAIKFVEGLIVTLSPRMADSEIPRRQEHDISLDRIPRDHPYIQYNVLWEEGKA ALEQLLKFMVHPAISSINLTTALGSLANIARQRPMFMSEVIQAYETLHANLPPTLAKS QVSSVRKNLKLHLLSVLKHPASLEFQAQITTLLVDLGTPQAEIARNMPSSKDTRKRPR DDSDSTLKKMKLEPNLGEDDEDKDLEPGPSGTSKASAQISGQSDTDITAEFLQPLLTP DNVANLVLISMVYLPEAMPASFQAIYTPVESAGTEAQIKHLARLMATQMTAAGLGPGV EQTKQCKEEPKEEKVVKTESVLIKRRLSAQGQAISVVGSLSSMSPLEEEAPQAKRRPE PIIPVTQPRLAGAGGRKKIFRLSDVLKPLTDAQVEAMKLGAVKRILRAEKAVACSGAA QVRIKILASLVTQFNSGLKAEVLSFILEDVRARLDLAFAWLYQEYNAYLAAGASGSLD KYEDCLIRLLSGLQEKPDQKDGIFTKVVLEAPLITESALEVVRKYCEDESRTYLGMST LRDLIFKRPSRQFQYLHVLLDLSSHEKDKVRSQALLFIKRMYEKEQLREYVEKFALNY LQLLVHPNPPSVLFGADKDTEVAAPWTEETVKQCLYLYLALLPQNHKLIHELAAVYTE AIADIKRTVLRVIEQPIRGMGMNSPELLLLVENCPKGAETLVTRCLHSLTDKVPPSPE LVKRVRDLYHKRLPDVRFLIPVLNGLEKKEVIQALPKLIKLNPIVVKEVFNRLLGTQH GEGNSALSPLNPGELLIALHNIDSVKCDMKSIIKATNLCFAERNVYTSEVLAVVMQQL MEQSPLPMLLMRTVIQSLTMYPRLGGFVMNILSRLIMKQVWKYPKVWEGFIKCCQRTK PQSFQVILQLPPQQLGAVFDKCPELREPLLAHVRSFTPHQQAHIPNSIMTILEASGKQ EPEAKEAPAGPLEEDDLEPLTLAPAPAPRPPQDLIGLRLAQEKALKRQLEEEQKLKPG GVGAPSSSSPSPSPSARPGPPPSEEAMDFREEGPECETPGIFISMDDDSGLTEAALLD SSLEGPLPKETAAGGLTLKEERSPQTLAPVGEDAMKTPSPAAEDAREPEAKGNS" BASE COUNT 888 a 1222 c 1194 g 680 t ORIGIN 1 taggaagagg cactgctgag ggggcgcgag gggaacggag gccagagctg cgctgacagc 61 agccatggcg agcggcagtg gagacagcgt cacccgtcgg agcgtggcat cacagttttt 121 cactcaagag gaggggccgg gcatcgatgg catgaccacc tcagagaggg tggtggatct 181 tctgaaccag gcggcgctga tcaccaatga ctcaaagatc acagtgctca aacaggtcca 241 ggagctgatc atcaacaaag acccacacta ctggacaact tcctggatga gatcatcgca 301 ttccaagcag acaagtcaat cgaagtgcga aaatttgtca tcggcttcat cgaggaggca 361 tgcaagcgag acattgagtt gctgctgaaa ctcattgcaa acctcaacat gctcttgagg 421 gacgagaatg tgaacgtggt gaagaaggct atcctcacca tgacccagct ctacaaggtg 481 gccctgcagt ggatggtaaa gtcacgggta attagcgagc tacaggaggc ctgctgggac 541 atggtatctg ccatggcggg ggacatcatc ctgctattgg actctgacaa tgacggcatc 601 cgcacccacg ccatcaagtt tgtggagggc ctcattgtca ccctgtcacc ccgcatggct 661 gactcagaga taccccgacg ccaggagcat gatatcagcc tggaccgcat ccctcgtgac 721 cacccctaca tccagtacaa cgtgctatgg gaagagggca aggcagcctt ggagcagctg 781 cttaagttca tggtgcaccc tgccatctcc tccatcaacc tgaccacagc gctgggctcc 841 cttgccaata tcgcccgcca gagacccatg ttcatgtctg aggtgatcca ggcctatgaa 901 actctgcatg ccaacctgcc cccgacgctg gccaaatcgc aggtgagcag tgtgcgtaag 961 aatctgaagc tgcacctgtt gagtgtgctg aagcacccgg cttccttgga gttccaggcc 1021 cagatcacca ccctgctggt ggacctgggc acacctcagg ccgagatcgc ccgcaacatg 1081 ccgagcagca aggacacccg caagcggccc cgcgatgact cggactccac actcaagaag 1141 atgaagctgg agcccaacct gggggaggac gatgaggaca aagacttgga gccaggcccg 1201 tcggggacct cgaaggcctc agcgcagatc tccggccagt cagacacgga catcacagct 1261 gagttcctgc agcctctgct gacgcctgat aatgtggcta atctggtcct catcagcatg 1321 gtgtacctac ccgaggccat gccagcctcc ttccaggcca tctacacccc cgtggagtca 1381 gcaggcacgg aagcccagat caagcacctg gctcggctca tggccacaca gatgacagct 1441 gccggactgg gaccaggtgt agagcagacc aaacagtgca aggaggagcc caaggaggag 1501 aaggtggtga agacagagag cgtcctgatc aagcggcgcc tgtcagccca gggccaagcc 1561 atctcggtgg tgggttccct gagctccatg tcccccctgg aggaagaggc accgcaggcc 1621 aagaggaggc cagagcccat tatccctgtc actcagcccc ggctggcagg cgctggtggg 1681 cgcaagaaaa ttttccgtct cagcgacgtg ctgaagcccc ttaccgatgc ccaggtggaa 1741 gccatgaagc tgggcgctgt gaagcggatc ctgcgggctg agaaggctgt ggcctgcagc 1801 ggggcagccc aggtccgcat aaagatcctg gccagcctgg tgacacagtt caactcgggc 1861 ctgaaggcgg aggtcctgtc cttcatcctg gaggatgtgc gggcccgcct ggacctggcc 1921 ttcgcctggc tctaccagga gtacaacgcc tacctggccg caggtgcctc gggctccctg 1981 gacaagtatg aggactgcct catccgcctg ttgtctggcc tgcaggagaa accagaccag 2041 aaggatggga tcttcaccaa ggttgtgctg gaggcgccac tcatcacaga gagtgccctg 2101 gaggtggtcc gcaagtactg cgaggatgag agtcgcacct atctgggcat gtccacactt 2161 cgagacctga tcttcaagcg cccgtcccgc cagttccagt acctgcatgt cctcctcgac 2221 ctcagctccc atgagaagga caaggtgcgc tcccaggccc tgctgttcat caaacgcatg 2281 tatgagaagg agcagctgcg ggagtatgtg gagaaatttg ccctcaacta cctgcagctc 2341 ctggtgcacc ccaacccacc gtctgtgctg tttggagctg acaaggacac agaggtggca 2401 gcaccctgga cggaggagac agtgaagcag tgtctgtacc tctacctggc cctcctgcct 2461 cagaaccaca agctgatcca cgaactggcg gccgtgtaca ctgaagccat cgccgacatc 2521 aagcggacgg tgctgagggt cattgagcag ccgatccgag gaatgggcat gaactccccg 2581 gagctgctcc tgctggtgga aaattgtccc aagggagcag agacactggt cacgagatgt 2641 ctgcacagcc tcacagacaa agtcccaccc tccccagagc tggtgaagcg ggtccgggat 2701 ctctaccaca agcgactgcc agacgtccgc ttcctcatcc cggtgctcaa tgggctggag 2761 aagaaagagg tgatccaggc cctgcctaaa ctcatcaaac tcaaccccat cgtggtgaag 2821 gaagtcttca accgcctgct gggcacccag catggtgagg gaaactcagc cttgtccccg 2881 ctgaaccctg gagagctcct gatcgcatta cacaacattg actccgtgaa gtgcgacatg 2941 aaatccatca tcaaagccac caacctgtgc tttgcggagc ggaacgtgta cacgtcagag 3001 gtgctggccg tggtgatgca gcagctgatg gagcagagcc ccctgcccat gctgctcatg 3061 aggaccgtca tccagtccct gaccatgtac ccccgcctgg ggggcttcgt catgaacatc 3121 ctgtcccgcc tcatcatgaa gcaggtgtgg aagtacccca aggtgtggga gggcttcatc 3181 aagtgctgcc agcgcacaaa gccccagagc ttccaggtca tcctgcagct gccgccccag 3241 cagctgggag ccgtctttga caagtgccca gagctccggg agcccctgct ggcccatgtc 3301 cgctccttca ccccccacca gcaagctcac atccctaact ccatcatgac catcttggag 3361 gccagcggca agcaggagcc agaggccaag gaggcgcctg cggggccctt ggaggaggat 3421 gatctggagc ccctgacctt ggccccggcc ccagcacccc ggccccctca ggacctcatc 3481 ggcctgcgac tggcccagga gaaggcctta aagcggcagc tggaggagga acagaagctg 3541 aagccgggag gagtgggagc cccctcctct tcctccccct ctccctctcc gtcggcccgg 3601 ccaggcccgc ccccgtctga ggaagccatg gatttccggg aggaggggcc tgagtgcgag 3661 accccgggca tcttcatcag catggatgac gactcggggc tgaccgaggc cgcgctgttg 3721 gactctagtc tcgagggccc cctacccaag gagacggcag cgggcgggct gaccttgaag 3781 gaggagcgga gcccccagac cctcgcacct gttggagaag atgctatgaa gactcccagc 3841 ccggctgccg aggacgccag ggaacccgag gccaagggga acagctgacg gggctcgagg 3901 gggaaagggg gtgggacagg gactcggggc tgggggacgg ggcggggctt gacctgcggg 3961 tgctttgcct taaaaagaaa taaa // LOCUS HSSYN5 2156 bp RNA PRI 29-NOV-1994 DEFINITION Human mRNA for 5-aminolevulinate synthase. ACCESSION Y00451 NID g36648 KEYWORDS 5-aminolevulinate synthetase; synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS May,B.K. TITLE Direct Submission JOURNAL Submitted (05-OCT-1987) May B.K., University of Adelaide, Biochemistry Dept., niversity, University of Adelaide, P.O. Box 498, Adelaide 500, South Australia REFERENCE 2 (bases 1 to 2156) AUTHORS Bawden,M.J., Borthwick,I.A., Healy,H.M., Morris,C.P., May,B.K. and Elliott,W.H. TITLE Sequence of human 5-aminolevulinate synthase cDNA JOURNAL Nucleic Acids Res. 15 (20), 8563 (1987) MEDLINE 88040476 FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" sig_peptide 84..251 CDS 84..2012 /note="5-aminolevulinate synthase precursor" /codon_start=1 /db_xref="PID:g599830" /db_xref="SWISS-PROT:P13196" /translation="MDSVVGRCPFLSRVPQAFLQKAGKSLLFYAQNCPKMMEVGAKQP SRIVHCSSTLPQDQETPPASEKDQTAKAKVQQTLMDPSRVQMAHSFRLDSVWTPLAAT SQGTASKCPFLAAQMIREAAVSSAKPVLSFRRDVQEMNAVKKEGAETSAGPSVVSVKT DGGDPSGLLKNVQDIMQKQRPERVSHLLHDNLPKSVSTFHYDRFFEKKSDEKNDDHTY RVFKTVNRRAHIFPMADDYSDSLITKKQVSVWCSNDYLGMSRHPRVCGAVMDTLKQHG AGAGGTRNISGTSKFHVDLERELADLHGKDAALLFSSCFVAHDSTLFTLVKMMPGCEI YSDSGDHASMIQGIRNSRVPKYIFRHNDVSHLRELLQRSDPSVPKIVAFETVHSVDGA VLPLEELCDVAHEFGAITFVDEVHAWGLYGARGGGIGDRDGVMPKMDIISGTLGKAFG CVGGYIASTRSLMDTVRSYAAGFIFTTSLPPMLLAGALESVRILKSAEGRVVRRQHQR NVKLMRQMLMDAGLPVVHCPSHIIPVRVADAAKNTEVCDELMSRHNIYVQAINYPTVP RGEELLRIAPTPHHTPQMMNYFLENLLVTWKQVGLELKPHSSAECNFCRRPLHFEVMS EREKSYFSGLSKLVSAKA" mat_peptide 252..2015 BASE COUNT 550 a 556 c 548 g 502 t ORIGIN 1 gttgcccttg tcgacttgag tgcccgcctc cttcgccgcc gcctctgcag tcctcagcgc 61 aggagccagc atacttcctg aacatggaca gtgttgttgg ccgctgccca ttcttatcgc 121 gagtccccca ggcctttctg cagaaagcag gcaaatctct gttgttctat gcccaaaact 181 gccccaagat gatggaagtt ggggccaagc agccctcgcg gattgtccac tgcagcagta 241 cactaccaca agatcaagaa acccctccgg ccagtgagaa agatcaaact gctaaggcca 301 aggtccaaca gactctgatg gatcccagca gagtccagat ggcacacagc ttccgtctgg 361 attccgtctg gacacccctt gctgccacaa gccagggcac tgcaagcaaa tgccctttcc 421 tggcagcaca gatgatcaga gaggcagcag tgtcttctgc aaagccagtc ttgagcttca 481 ggagggatgt gcaggaaatg aatgccgtta agaaagaggg tgctgaaacc tcagcaggcc 541 ccagtgtggt tagtgtgaaa accgatggag gggatcccag tggactgctg aagaacgtcc 601 aggacatcat gcaaaagcag agaccagaaa gagtgtctca tcttcttcat gataacttgc 661 caaaatctgt ttccactttt cactatgatc gtttctttga gaaaaaaagt gatgagaaaa 721 acgatgacca cacctatcga gtttttaaaa ctgtgaaccg gcgagcacac atcttcccca 781 tggcagatga ctattcagac tccctcatca ccaaaaagca agtgtcagtc tggtgcagta 841 atgactacct aggaatgagt cgccacccac gggtgtgtgg ggcagttatg gacactttga 901 aacaacatgg tgctggggca ggtggtacta gaaatatttc tggaactagt aaattccatg 961 tggacttaga gcgggagctg gcagacctcc atgggaaaga tgccgcactc ttgttttcct 1021 cctgctttgt ggcccatgac tcaaccctct tcaccctggt caagatgatg ccaggctgtg 1081 agatttactc tgattctggg gaccatgcct ccatgatcca agggattcga aacagccgag 1141 tgccaaagta catcttccgc cacaatgatg tcagccacct cagagaactg ctgcaaagat 1201 ctgacccctc agtccccaag attgtggcat ttgaaactgt ccattcagtg gatggggcgg 1261 tgctgccact ggaagagctg tgtgatgtgg cccatgagtt tggagcaatc accttcgtgg 1321 atgaggtcca cgcatggggg ctttatgggg ctcgaggcgg agggattggg gatcgggatg 1381 gagtcatgcc aaaaatggac atcatttctg gaacattggg caaagccttt ggttgtgttg 1441 gagggtacat cgccagcacg aggtctctga tggacaccgt acggtcctat gctgctggct 1501 tcatcttcac cacctctctg ccacccatgc tgctggctgg agccctggag tctgtgcgga 1561 tcctgaagag cgctgaggga cgggtcgttc gccgccagca ccagcgcaac gtcaaactca 1621 tgagacagat gctaatggat gccggcctcc ctgtggtcca ctgccccagc cacatcatcc 1681 ctgtgcgggt tgcagatgct gctaaaaaca cagaagtctg tgatgaacta atgagcagac 1741 ataacatcta cgtgcaagca atcaattacc ctacggtgcc ccggggagaa gagctcctac 1801 ggattgcccc cacccctcac cacacacccc agatgatgaa ctacttcctt gagaatctgc 1861 tagtcacatg gaagcaagtg gggctggaac tgaagcctca ttcctcagct gagtgcaact 1921 tctgcaggag gccactgcat tttgaagtga tgagtgaaag agagaagtcc tatttctcag 1981 gcttgagcaa gttggtatct gctaaggcct gagcatgacc tcaattattt cacttaaccc 2041 caggccatta tcatatccag atggtcttca agttgtctta tatgtgaatt aagttatatt 2101 aaattttaat ctatgataaa aacatagtcc tggaaataaa tctgcttaat ggtgaa // LOCUS HSSYNAPTO 2058 bp RNA PRI 30-NOV-1997 DEFINITION Homo sapiens mRNA for synaptopodin. ACCESSION Y11072 NID g2654322 KEYWORDS actin associated; synaptopodin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Mundel,P., Heid,H.W., Mundel,T.M., Kruger,M., Reiser,J. and Kriz,W. TITLE Synaptopodin: An actin-associated protein in telencephalic dendrites and renal podocytes JOURNAL J. Cell Biol. 139 (1), 193-204 (1997) MEDLINE 97461576 REFERENCE 2 (bases 1 to 2058) AUTHORS Mundel,P. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) P. Mundel, University of Heidelberg, Department of Anatomy and Cell Biology, Im Neuenheimer Feld 307, Heidelberg, D-69120, FRG FEATURES Location/Qualifiers source 1..2058 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I.M.A.G.E. clone 178792" /clone="I.M.A.G.E. clone 166347" /clone="I.M.A.G.E. clone 167192" /dev_stage="adult" CDS 1..2058 /note="actin-associated protein" /codon_start=1 /product="synaptopodin" /db_xref="PID:e1192263" /db_xref="PID:g2654323" /translation="MEGYSEEASLLRHLEKVASEEEEVPLVVYLKENAALLTANGLHL SQNREAQQSSPAPPPAEVHSPAADVNQNLASPSATLTTPTSNSSHNPPATDVNQNPPA TVVPQSLPLSSIQQNSSEAQLPSNGTGPASKPSTLCADGQPQAPAEEVRCSTLLIDKV STPATTTSTFSREATLIPSSRPPVSDFMSSSLLIDIQPNTLVVSADQEMSGRAAATTP TKVYSEVHFTLAKPPSVVNRTARPFGIQAPGGTSQMERSPMLERRHFGEKAPAPQPPS LPDRSPRPQRHIMSRSPMVERRMMGQRSPASERRPLGNFTAPPTYTETLSTAPLASWV RSPPSYSVLYPSSDPKSSHLKGQAVPASKTGILEESMARRGSRKSMFTFVEKPKVTPN PDLLDLVQTADEKRRQRDQGEVGVEEEPFALGAEASNFQQEPAPRDRASPAAAEEVVP EWASCLKSPRIQAKPKPKPNQNLSEASGKGAELYARRQSRMEKYVIESSGHTPELARC PSPTMSLPSSWKYPTNAPGAFRVASRSPARTPPASLYHGYLPENGVLRPEPTKQPPYQ LRPSLFVLSPIKEPAKVSPRAASPAKPSSLDLVPNLPKGALPPSPALPRPSRSSPGLY TSPGQDSLQPTAVSPPYGGDISPVSPSRAWSPRAKQAPRPSFSTRNAGIEAQVWKPSF CFK" BASE COUNT 433 a 748 c 541 g 336 t ORIGIN 1 atggaggggt actcagagga ggctagcttg ctgcggcacc tggagaaggt ggccagtgag 61 gaggaagagg taccactggt ggtttatcta aaggagaatg cagcactgct gacagccaat 121 gggctgcacc tgtcccaaaa ccgagaggcc cagcagtcct caccggcccc acctccagct 181 gaggtccaca gcccagctgc agatgtcaac caaaaccttg cctcgcccag tgccacgctc 241 accacaccaa cttctaacag cagccacaat ccgccagcca ccgatgtcaa tcagaaccca 301 ccggcaactg ttgtcccaca gagcctgcca ctttctagca tccaacagaa ttcctcagag 361 gcccaactcc catctaatgg cacagggcct gcttccaaac ccagcaccct gtgtgctgat 421 gggcaacccc aggcaccggc tgaggaggtg agatgcagca cactcctaat tgacaaggta 481 tcaactccag ctaccaccac cagcaccttc tccagagaag ctacgctcat ccccagctcc 541 aggcccccag tctcagattt catgtccagc tccctgctca ttgacatcca gcccaacacc 601 ctagtggtgt cagcagatca agagatgtct gggcgagcag ctgccaccac gcccaccaag 661 gtctacagtg aggtccactt cacactggcc aagcccccat cagtggtcaa caggacggcc 721 aggccttttg ggatccaggc gccagggggc accagccaga tggagaggag ccccatgcta 781 gagagacgac attttgggga gaaggccccg gctccccagc cccccagttt gccagacagg 841 agcccccggc cacagagaca cataatgtcc cgcagcccca tggtggaaag gaggatgatg 901 gggcagcgaa gcccggcctc agagagacgc cccttgggga acttcactgc accccccacc 961 tacactgaga ccttgtccac agcccctctg gcttcctggg tgaggtctcc tccctcatat 1021 tctgtcctgt atcccagctc cgaccccaag tcttctcatc tgaagggcca ggcggttcct 1081 gccagcaaga cgggcattct ggaggagtcg atggcccgcc ggggcagccg caaatccatg 1141 tttactttcg tggagaagcc caaggtgacc ccgaatccag acttgctgga tctggtacag 1201 acagcggatg agaagcggcg gcagagggac cagggggagg taggcgtgga ggaggagccc 1261 ttcgcactgg gggccgaggc ctccaacttc cagcaggagc cagcacctcg tgacagggcc 1321 agccccgcgg cggcggagga ggtggtacca gagtgggcct cctgcctcaa gtcaccccgc 1381 atccaggcca agccgaagcc caaacccaac cagaacctct ccgaggcctc tgggaaggga 1441 gctgagctct acgcccgccg ccagtcacgg atggagaaat atgtcatcga gtcttcaggc 1501 cacacgccag agctggcccg ctgcccatca cctaccatgt ccctgccttc ctcctggaaa 1561 taccccacta acgcccccgg ggccttccga gtggcatccc gaagcccagc ccggaccccg 1621 cctgcctccc tctaccatgg ctacctgcct gagaacgggg tcctgcgccc agagcccacc 1681 aagcagccgc cataccagct gcgcccctcg ctctttgtcc tctcacctat caaggagcct 1741 gccaaggtct caccaagagc tgcctcgccc gccaagccca gctccttgga cctggtgccc 1801 aacctgccca agggggctct ccctccatct cctgccctgc ctcggccctc gcgctcctca 1861 ccgggcctct acacctcccc cggccaggac agcctgcagc ccactgccgt gagccctcct 1921 tacggcggtg acatctcccc cgtgtctccc tccagggcgt ggtctccccg agccaagcag 1981 gcccccaggc cctccttctc tacccggaac gccgggatcg aggctcaggt gtggaagcct 2041 tccttctgct tcaagtaa // LOCUS HSSYT 2919 bp RNA PRI 09-FEB-1995 DEFINITION H.sapiens mRNA for SYT. ACCESSION X79201 NID g531105 KEYWORDS synovial sarcoma; SYT gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2919) AUTHORS Clark,J., Rocques,P.J., Crew,A.J., Gill,S., Shipley,J., Chan,A.M., Gusterson,B.A. and Cooper,C.S. TITLE Identification of novel genes, SYT and SSX, involved in the t(X;18)(p11.2;q11.2) translocation found in human synovial sarcoma JOURNAL Nature Genet. 7 (4), 502-508 (1994) MEDLINE 95038836 REFERENCE 2 (bases 1 to 2919) AUTHORS Cooper,C.S. TITLE Direct Submission JOURNAL Submitted (09-MAY-1994) C.S. Cooper, Institute of Cancer Research, 15 Cotswold Road, Belmount Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..2919 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="A2243" /chromosome="18" /map="18q11.2" gene 4..1179 /gene="SYT" CDS 4..1179 /gene="SYT" /note="translocated in synovial sarcoma" /codon_start=1 /db_xref="PID:g531106" /translation="MGGNMSVAFAAPRQRGKGEITPAAIQKMLDDNNHLIQCIMDSQN KGKTSECSQYQQMLHTNLVYLATIADSNQNMQSLLPAPPTQNMPMGPGGMNQSGPPPP PRSHNMPSDGMVGGGPPAPHMQNQMNGQMPGPNHMPMQGPGPNQLNMTNSSMNMPSSS HGSMGGYNHSVPSSQSMPVQNQMTMSQGQPMGNYGPRPNMSMQPNQGPMMHQQPPSQQ YNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQRQIPPYRPPQQGPPQQYSGQEDYYG DQYSHGGQGPPEGMNQQYYPDGNSQYGQQQDAYQGPPPQQGYPPQQQQYPGQQGYPGQ QQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNYQQ" BASE COUNT 874 a 589 c 636 g 820 t ORIGIN 1 tggatgggcg gcaacatgtc tgtggctttc gcggccccga ggcagcgagg caagggggag 61 atcactcccg ctgcgattca gaagatgttg gatgacaata accatcttat tcagtgtata 121 atggactctc agaataaagg aaagacctca gagtgttctc agtatcagca gatgttgcac 181 acaaacttgg tataccttgc tacaatagca gattctaatc aaaatatgca gtctctttta 241 ccagcaccac ccacacagaa tatgcctatg ggtcctggag ggatgaatca gagcggccct 301 cccccacctc cacgctctca caacatgcct tcagatggaa tggtaggtgg gggtcctcct 361 gcaccgcaca tgcagaacca gatgaacggc cagatgcctg ggcctaacca tatgcctatg 421 cagggacctg gacccaatca actcaatatg acaaacagtt ccatgaatat gccttcaagt 481 agccatggat ccatgggagg ttacaaccat tctgtgccat catcacagag catgccagta 541 cagaatcaga tgacaatgag tcagggacaa ccaatgggaa actatggtcc cagaccaaat 601 atgagtatgc agccaaacca aggtccaatg atgcatcagc agcctccttc tcagcaatac 661 aatatgccac agggaggcgg acagcattac caaggacagc agccacctat gggaatgatg 721 ggtcaagtta accaaggcaa tcatatgatg ggtcagagac agattcctcc ctatagacct 781 cctcaacagg gcccaccaca gcagtactca ggccaggaag actattacgg ggaccaatac 841 agtcatggtg gacaaggtcc tccagaaggc atgaaccagc aatattaccc tgatggaaat 901 tcacagtatg gccaacagca agatgcatac cagggaccac ctccacaaca gggatatcca 961 ccccagcagc agcagtaccc agggcagcaa ggttacccag gacagcagca gggctacggt 1021 ccttcacagg gtggtccagg tcctcagtat cctaactacc cacagggaca aggtcagcag 1081 tatggaggat atagaccaac acagcctgga ccaccacagc caccccagca gaggccttat 1141 ggatatgacc agggacagta tggaaattac cagcagtgaa aaagtactta cattccagta 1201 gccagtatct attagcagcc atattgtcac ctcagcactg tggacacctc cctgtgaaga 1261 gatccttcca ttccatctag tttttggaaa aaccttgtgg ataagtggct gtttcatcag 1321 taagcagcct ttgtggttta gttataaaag gctttagtag ctcaaaaata ctcttgattt 1381 cacatttcta ctctagatgg caacattgga cagaaaatgc aatgacataa ccaatttgta 1441 atgattttgg aactgtgttt caaatggact gttacagact gaaaggtgtg aacagctttg 1501 tatgtttatg aagggtaagg gaatttaata cttttccaca gatttttttg taaggggaag 1561 agggaaatgt acacttttta cagcagcaat attttgtata ttatgtttat ttcatgtggt 1621 gaatatgcaa ggcggtacac tacgcactgg acagcatcag aaatcctctg ttaatgtgga 1681 ctggagcatg gtagatgctt gattgttttg gtctcaaaat ggtgtgctat aaagataaag 1741 gtgaggggaa gacaaagcac accatatgtc cactgttctg ttctcataga ggaaattcaa 1801 atccctttta tctattagat aatcaagggc actgtgatac agttttgagt aaaaagacat 1861 tttttaaaag ccttccagtt ttgtggatta aaccttttta taaagatcat ttataatact 1921 gttttaaaat gtgaggcaat aagaattact ttgtgttgga tctgaggagg ctttggtaaa 1981 acagtttcat ctaaatgaaa gtggtaatcc tcttctaaaa tagcaataac tgaaaatgaa 2041 agtgttaatt ttaccttgtt tgagttatca gggaacttag taagtaatat caaagcattt 2101 tataaatgat atcaaagaag agtcaacatt gatccagtca ttttattttg taatattgag 2161 ggataattgg ttattaaact gaatagttca ggagacttta caaacctttg tttcaacttt 2221 cttatctgga aataatatca tttataaagg gacactttta tgtttttccc ttttttatgt 2281 tggttgatat aacacaaaga gatatttagg aaaatgctta ttgatgaggt ttattctatc 2341 tgtttttaaa gcaccgaggt tgcattctag ataaccttgt ttattagcat ggcatatttt 2401 aatcattatt tgagactgtc ctgtgcctga ttattttagc taaattcagg gagattgcgt 2461 ggggcaggaa agcatgcatt gaaaaatttc taaccacggt tatttaagca taatctgaaa 2521 acatctagcc caaaggtaag ttgctatttt catcacagtt gcctatgccc agggaataag 2581 atgtattctt tataattgaa ttggtttttc ccacgtctaa ctggaaacaa aacagaaggg 2641 gcgtcataaa tttgaataag cagaacatac tgttctcaac atactgtaat caaaaggagg 2701 aatttcagtg ggtctctgtg tgtgtatgag agagagagtg tgtgtttgtg tgtttcaagg 2761 tcagaacagg tttttttgtt tttgtttttt gttctttgtt tttttttttg agatggagtc 2821 ttgctcttgt cgcccaggct ggagtgcagt ggcgcaatct cagctcactg caacctccgc 2881 ctcccaggtt caagcagttc tcctgcctca gcctcctga // LOCUS HST519 853 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA (AH2-519) for gene 519 from functional T-cell line. ACCESSION X05044 NID g36666 KEYWORDS gene 519; T-cell activation; unidentified reading frame. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 853) AUTHORS Jongstra,J., Schall,T.J., Dyer,B.J., Clayberger,C., Jorgensen,J., Davis,M.M. and Krensky,A.M. TITLE The isolation and sequence of a novel gene from a human functional T cell line JOURNAL J. Exp. Med. 165 (3), 601-614 (1987) MEDLINE 87139813 COMMENT Data kindly reviewed (06-AUG-1987) by Jongstra J. FEATURES Location/Qualifiers source 1..853 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="AH2" /clone_lib="lambda gt10 cDNA library" CDS 27..71 /note="pot. ORF (AA 1-14)" /codon_start=1 /db_xref="PID:g36667" /translation="MLLGNPAPASASAW" CDS 281..670 /note="519 gene product (AA 1-129)" /codon_start=1 /db_xref="PID:g36668" /db_xref="SWISS-PROT:P09325" /translation="MEGLVFSRLSPEYYDPARAHLRDGEKSCPCGQEGPQGDLLTKTQ ELGRDYRTCLTIVQKLKKMVDKPTQRSVSNAATRVCRTGRSRWRDVCRNFMRRYQSRV IQGLVAGETAQQICEDLRLCIPSTGPL" misc_feature 825..830 /note="pot. polyA signal" polyA_site 853 /note="polyA site" BASE COUNT 167 a 261 c 241 g 184 t ORIGIN 1 cctgggccct cctgctcctt gcagccatgc tcctgggcaa cccagcccct gcctccgcat 61 ctgcgtggtg aaggccattg gcctcatcgg tggatctgcg tttcctcggg cccacactgt 121 ctaggattgt gcggggctgg tgagagaaca agatctcttc cgtgttcaag gcagacttcc 181 tgccccctgc accctgctct ctcccgggcc ttgaggtcag tgtgagcccc aagggcaaga 241 acacttctgg aagggagagt ggatttggct gggcctctgg atggaaggtc tggtcttctc 301 tcgtctgagc cctgagtact acgacccggc aagagcccac ctgcgtgatg gggagaaatc 361 ctgcccgtgc gggcaggagg gcccccaggg tgacctgttg accaaaacac aggagctggg 421 ccgtgactac aggacctgtc tgacgatagt ccaaaaactg aagaagatgg tggataagcc 481 cacccagaga agtgtttcca atgctgcgac ccgggtgtgt aggacgggga ggtcacgatg 541 gcgcgacgtc tgcagaaatt tcatgaggag gtatcagtct agagttatcc aaggcctcgt 601 ggccggagaa actgcccagc agatctgtga ggacctcagg ttgtgtatac cttctacagg 661 tcccctctga gccctctcac cttgtcctgt ggaagaagca caggctcctg tcctcagatc 721 ccgggaacgt cagcaacctc tgccggctcc tcgcttcctc gatccagaat ccactctcca 781 gtctccctcc cctgactccc tctgctgtcc tcccctctca ggggaataaa gtgtcaagca 841 agattttagc cgc // LOCUS HSTAFII10 3089 bp RNA PRI 14-AUG-1996 DEFINITION H.sapiens mRNA for TAFII100 protein. ACCESSION X95525 NID g1491717 KEYWORDS hTAFII100; TAF2D gene; TATA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3089) AUTHORS Dubrowskaya,V., Lavigne,A.C., Davidson,I., Acker,J., Staub,A. and Tora,L. TITLE Distinct domains of hTAFII100 are required for functional interaction with transcription factor TFIIFb (RAP30) and incorporation into the TFIID complex JOURNAL EMBO J. 15, 3702-3712 (1996) REFERENCE 2 (bases 1 to 3089) AUTHORS Tora,L. TITLE Direct Submission JOURNAL Submitted (05-FEB-1996) L. Tora, Institut de Genetique et de Biologie Moleculaire et Cellulaire, BP 163, 67404 Illkirch Cedex, C.U. de Strasbourg, FRANCE FEATURES Location/Qualifiers source 1..3089 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="random primed cDNA" gene 24..2423 /gene="TAF2D" CDS 24..2423 /gene="TAF2D" /note="100 kDa subunit of Pol II transcription factor" /codon_start=1 /product="hTAFII100" /db_xref="PID:e221805" /db_xref="PID:g1491718" /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ FLRQSKLREAEEALRREAGLLEETVAGAGAPGEVDSAGAEVTSALLSRVTASAPGPAA PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ DDLRVLSSLTKKEHMKGNETMLDLRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR VRLGSDCLPSICFYTFLNVYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRNYLLSSSEDGT VRLWSLQTFTCLVGYKGHNYPVWDTQFSPYGYYFVSGGHDRVARLWATDHYQPLRIFA GHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSPNGR FLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRLWDA IKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSPVVHLHFTRRNLVLAAGAYSP Q" BASE COUNT 859 a 660 c 781 g 789 t ORIGIN 1 gcgcgaggtg gctcagccgc aagatggcgg cgctggcgga ggagcagacg gaggtggcgg 61 tcaagctaga gcctgaggga ccgccaacgc tgctacctcc gcaggcgggg gacggcgcag 121 gcgagggtag cggcggcact accaacaacg gccccaacgg cggcggcggg aacgttgcgg 181 cgtcgtcgtc cactggcggg gatggcggga cccccaagcc cacggtggct gtctccgccg 241 ctgccccggc gggggcggcc ccggtgcccg ccgctgctcc ggacgccggc gctccgcatg 301 accgacagac tctactggcc gtgctgcagt tcctacggca gagcaaactc cgcgaggccg 361 aagaggcgct gcgccgtgag gccgggctgc tggaggagac agtggcgggc gccggagccc 421 cgggagaggt ggacagcgcc ggcgctgagg tgaccagcgc gcttctcagc cgggtgaccg 481 cctcggcccc tggccctgcg gcccccgacc ctccgggcac tggcgcttcg ggggccacgg 541 tcgtctcagg ttcagcctca ggtcctgcgg ctccgggtaa agttggaagt gttgctgtgg 601 aagaccagcc agatgtcagt gccgtgttgt cagcctacaa ccaacaagga gatcccacaa 661 tgtatgaaga atactatagt ggactgaaac acttcattga atgttccctg gactgccatc 721 gggcagagtt gtcccaactt ttttatcctc tgtttgtgca catgtacttg gagctagtct 781 acaatcaaca tgagaatgaa gcaaagtcat tctttgagaa gttccatgga gatcaggaat 841 gttattacca ggatgaccta cgagtattat ctagtcttac caaaaaggaa cacatgaaag 901 ggaatgagac catgttggat ttgcgaacaa gtaaatttgt tctgcgtatt tcccgtgact 961 cgtaccaact cttgaagagg catcttcagg agaaacagaa caatcagata tggaacatag 1021 ttcaggagca cctctacatt gacatctttg atgggatgcc gcgtagtaag caacagatag 1081 atgcgatggt gggaagtttg gcaggagagg ctaaacgaga ggcaaacaaa tcaaaggtat 1141 tttttggttt attaaaagaa ccagaaattg aggtaccttt ggatgacgag gatgaagagg 1201 gagaaaatga agaaggaaaa cctaaaaaga agaagcctaa aaaagatagt attggatcca 1261 aaagcaaaaa acaagatccc aatgctccac ctcagaacag aatccctctt cctgagttga 1321 aagattcaga taagttggat aagataatga atatgaaaga aaccaccaaa cgagtgcgcc 1381 ttgggtcgga ctgcttaccc tccatttgtt tctatacatt tctcaatgtt taccagggtc 1441 tcactgcagt ggatgtcact gatgattcta gtctgattgc tggaggtttt gcagattcaa 1501 ctgtcagagt gtggtcggta acacccaaaa agcttcgtag tgtcaaacaa gcatcagatc 1561 ttagtcttat agacaaagaa tcagatgatg tcttagaaag aatcatggat gagaaaacag 1621 caagtgagtt gaagattttg tatggtcaca gtgggcctgt ctacggagcc agcttcagtc 1681 cggataggaa ctatctgctt tcctcttcag aggacggaac tgttagattg tggagccttc 1741 aaacatttac ttgtttggtg ggatataaag gacacaacta tccagtatgg gacacacaat 1801 tttctccata tggatattat tttgtgtcag ggggccatga ccgagtagct cggctctggg 1861 ctacagacca ctatcagcct ttaagaatat ttgccggcca tcttgctgat gtgaattgta 1921 ccagattcca tccaaattct aattatgttg ctacgggctc tgcagacaga actgtgcggc 1981 tctgggacgt cctgaatggt aactgtgtaa ggatcttcac tggacacaag ggaccaattc 2041 attccttgac attttctccc aatgggagat tcctggctac aggagcaaca gatggcagag 2101 tgcttctttg ggatattgga catggtttga tggttggaga attaaaaggc cacactgata 2161 cagtctgttc acttaggttt agtagagatg gtgaaatttt ggcatcaggt tcaatggata 2221 atacagttcg attatgggat gctatcaaag cctttgaaga tttagagacc gatgacttta 2281 ctacagccac tgggcatata aatttacctg agaattcaca ggagttattg ttgggaacat 2341 atatgaccaa atcaccagtt gtacaccttc attttactcg aagaaacctg gttctagctg 2401 caggagctta tagtccacaa taaaccatcg gtattaaaga ccttttggaa gctactgttt 2461 ttaaaaaggg agactaaaag caaatacctc agtgattaat atttaagcta cagagaatgt 2521 ttttgtctat atggatctgg aagtatgctg cttggaaaaa tctgaacagg acagttccac 2581 gtttctatag caaccacatt tgactaattt ccgttagttg aataagaggt attatgatca 2641 tggaggggac atttatggtg ctttggattg tgtggaaact atgcatttct gttcaaatgc 2701 tattttaatt tattacattt agaaaaaaag ttgatttcaa taattcatcc tgcttcaaga 2761 ttcaaattca gaaatatact atcatcttga attttagctg aagaatccta tagcatgtat 2821 gtttctgctg taaaaacgta gttactgtat ggcactcaaa aactatgtta aatgatccac 2881 taactttttt tttcttggcc catgattaat ggaatgtatg taactaggta gggttccttt 2941 cttagatcta gaggaagtac agccacccac tgacatctga atttatatac ctgttgagtt 3001 ttgagtgcac ccaaacactc gataaaccag gtgaagaaat ttagcttcca tgttctactt 3061 cagctaaaac agctacatcg acagcaacg // LOCUS HSTAFII13 3252 bp RNA PRI 25-JUN-1997 DEFINITION H.sapiens mRNA for TAFII135. ACCESSION Y11354 NID g2058325 KEYWORDS RNA polymerase II; TAFII135 gene; transcription factor TFIID. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3252) AUTHORS Mengus,G., May,M., Carre,L., Chambon,P. and Davidson,I. TITLE Human TAF(II)135 potentiates transcriptional activation by the AF-2s of the retinoic acid, vitamin D3, and thyroid hormone receptors in mammalian cells JOURNAL Genes Dev. 11 (11), 1381-1395 (1997) MEDLINE 97336072 REFERENCE 2 (bases 1 to 3252) AUTHORS Davidson,I. TITLE Direct Submission JOURNAL Submitted (18-FEB-1997) I. Davidson, IGBMC, 1 Rue Laurent Fries., BP163, F- 67404 Illkirch, FRANCE COMMENT Related sequences Y09321, U75308. FEATURES Location/Qualifiers source 1..3252 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 1..3252 /gene="TAFII135" CDS 1..3252 /gene="TAFII135" /function="potentiates ligand dependent transcriptional activation" /codon_start=1 /product="subunit of RNA polymerase II transcription factor TFIID" /db_xref="PID:e305208" /db_xref="PID:g2058326" /translation="MAAGSDLLDEVFFNSEVDEKVVSDLVGSLESQLAASAAHHHHLA PRTPEVRAAAAGALGNHVVSGSPAGAAGAGPAAPAEGAPGAAPEPPPAGRARPGGGGP QRPGPPSPRRPLVPAGPAPPAAKLRPPPEGSAGACAPVPAAAAVAAGPEPAPAGPAKP AGPAALAARAGPGPGPGPGPGPGPGKPAGPGAAQTLNGSAALLNSHHAAAPAVSLVNN GPAALLPLPKPAAPGTVIQTPPFVGAAAPPAPAAPSPPAAPAPAAPAAAPPPPPPAPA TLARPPGHPAGPPTAAPAVPPPAAAQNGGSAGAAPAPAPAAGGPAGVSGQPGPGAAAA APAPGVKAESPKRVVQAAPPAAQTLAASGPASTAASMVIGPTMQGALPSPAAVPPPAP GTPTGLPKGAAGAVTQSLSRTPTATTSGIRATLTPTVLAPRLPQPPQNPTNIQNFQLP PGMVLVRSENGQLLMIPQQALAQMQAQAHAQPQTTMAPRPATPTSAPPVQISTVQAPG TPIIARQVTPTTIIKQVSQAQTTVQPSATLQRSPGVQPQLVLGGAAQTASLGTATAVQ TGTPQRTVPGATTTSSAATETMENVKKCKNFLSTLIKLASSGKQSTETAANVKELVQN LLDGKIEAEDFTSRLYRELNSSPQPYLVPFLKRSLPALRQLTPDSAAFIQQSQQQPPP PTSQATTALTAVVLSSSVQRTAGKTAATVTSALQPPVLSLTQPTQVGVGKQGQPTPLV IQQPPKPGALIRPPQVTLTQTPMVALRQPHNRIMLTTPQQIQLNPLQPVPVVKPAVLP GTKALSAVSAQAAAAQKNKLKEPGGGSFRDDDDINDVASMAGVNLSEESARILATNSE LVGTLTRSCKDETFLLQAPLQRRILEIGKKHGITELHPDVVSYVSHATQQRLQNLVEK ISETAQQKNFSYKDDDRYEQASDVRAQLKFFEQLDQIEKQRKDEQEREILMRAAKSRS RQEDPEQLRLKQKAKEMQQQELAQMRQRDANLTALAAIGPRKKRKVDCPGPGSGAEGS GPGSVVPGSSGVGTPRQFTRQRITRVNLRDLIFCLENERETSHSLLLYKAFLK" intron 682..781 /gene="TAFII135" /note="putative" BASE COUNT 650 a 1199 c 977 g 426 t ORIGIN 1 atggcggcgg gctcggatct gctggacgag gtcttcttca acagcgaggt ggacgagaaa 61 gtggtgagcg acctggtggg ctcgctggag tcgcagctgg cggccagcgc ggcccaccac 121 caccacctcg cgccgcgcac gcccgaggtg cgggccgcgg ccgccggcgc gctcgggaac 181 catgttgtga gcggcagccc ggccggagcc gcgggcgcag ggccggccgc ccccgccgag 241 ggcgcgcccg gagcggcgcc ggagccgccc cccgcaggta gagcgcggcc ggggggcggg 301 gggccgcagc gcccgggccc cccctcaccg cgccgccccc ttgtccccgc agggcccgcg 361 ccgcccgccg cgaagctgag gccgccgccc gagggcagcg cgggggcctg cgccccggtg 421 cccgccgccg ccgccgtcgc cgcggggccc gagcccgccc ccgccggccc cgccaagccc 481 gccggccccg ccgcgctggc cgcccgcgcc ggccccggcc ccgggcccgg ccccggcccc 541 ggccccggcc ctggcaagcc cgccggcccc ggcgccgcgc aaactttgaa tgggagcgcc 601 gcgctgctga actcgcacca cgccgccgca cctgctgtca gcctggtcaa caacgggccc 661 gccgcgctgc tgccgctgcc caagcccgcc gcccccggca ctgtcatcca gacgcccccc 721 ttcgtgggcg ccgccgcgcc ccccgcgccc gccgcgccct cgccccccgc cgcccccgcg 781 cccgccgccc ccgccgccgc cccgcccccg ccaccccccg cgcccgccac cctggcccgg 841 ccgcccggcc accccgccgg acccccgacc gccgcgcccg ccgtgccgcc ccccgccgcc 901 gcccagaacg ggggcagcgc cggggcagcc cccgcccccg ccccggccgc cgggggcccc 961 gctggggtca gcggccagcc cgggcccggc gcggcggctg cggcgccggc gccgggggtc 1021 aaggccgagt cgcccaagag ggtggtgcag gcggcgcccc cggcggcgca gaccctggcg 1081 gccagcggcc cggccagcac ggcggccagc atggtcatcg ggccaactat gcaaggggcg 1141 ctgcccagcc cggccgccgt cccgccgccc gcccccggga cccccaccgg gctgcccaaa 1201 ggcgcggccg gcgcagtgac ccagagcctg tcccggacgc ccacggccac caccagcggg 1261 attcgggcca ccctgacgcc caccgtgctg gccccccgct tgccgcagcc gcctcagaac 1321 ccgaccaaca tccagaactt ccagctgccc ccaggaatgg tcctcgtccg aagtgagaat 1381 gggcagttgt taatgattcc tcagcaggcc ttggcccaga tgcaggcgca ggcccatgcc 1441 cagcctcaga ccaccatggc gcctcgccct gccaccccca caagtgcccc tcccgtccag 1501 atctccaccg tacaggcacc tggaacacct atcattgcac ggcaggtgac cccaactacc 1561 ataattaagc aagtgtctca ggcccagaca acggtgcagc ccagtgcaac cctgcagcgc 1621 tcgcccggcg tccagcctca gctcgttctg ggtggcgctg cccagacggc ttcacttggg 1681 acggcgacgg ctgttcagac ggggactcct cagcgcacgg taccaggggc gaccaccact 1741 tcctcagctg ccacggaaac tatggaaaac gtgaagaaat gtaaaaattt cctatctacg 1801 ttaataaaac tggcttcatc tggcaagcag tctacagaga cagcagctaa tgtgaaagag 1861 ctcgtgcaga atttactgga tggaaaaata gaagcagaag atttcacaag caggttatac 1921 cgagaactta attcttcacc tcaaccttac cttgtgcctt tcctgaagag gagcttaccc 1981 gccttgagac agctgacccc cgactccgcg gccttcatcc agcagagcca gcagcagccg 2041 ccaccgccca cctcgcaggc caccactgcg ctcacggccg tggtgctgag tagctcggtc 2101 cagcgcacgg ccgggaagac ggcggccacc gtgaccagtg ccctccagcc ccctgtgctc 2161 agcctcacgc agcccacgca ggtcggcgtc ggcaagcagg ggcaacccac accgctggtc 2221 atccagcagc ctccgaagcc aggagccctg atccggcccc cgcaggtgac gttgacgcag 2281 acacccatgg tcgccctgcg gcagcctcac aaccggatca tgctcaccac gcctcagcag 2341 atccagctga acccactgca gccagtccct gtggtgaaac ccgccgtgtt acctggaacc 2401 aaagcccttt ctgctgtctc ggcacaagca gctgctgcac agaaaaataa actcaaggag 2461 cctgggggag gttcgtttcg ggacgatgat gacattaatg atgttgcatc gatggctgga 2521 gtaaacttgt cagaagaaag tgcaagaata ttagccacga actctgaatt ggtgggcacg 2581 ctaacgcggt cctgtaaaga tgaaaccttc ctcctccaag cgcctttgca gagaagaata 2641 ttagaaatag gtaaaaaaca tggtataacg gaattacatc cagatgtagt aagttatgta 2701 tcacatgcca cgcaacaaag gctacagaat cttgtagaga aaatatcaga aacagctcag 2761 cagaagaact tttcttacaa ggatgacgac agatatgagc aggcgagtga cgtccgggca 2821 cagctcaagt tttttgaaca gcttgatcaa atcgaaaagc agaggaagga tgagcaggag 2881 cgggagatcc tgatgagggc agcaaagtct cggtcaagac aagaagatcc agaacagtta 2941 aggctgaaac agaaggcaaa ggagatgcag caacaggaac tggcacaaat gagacagcgg 3001 gacgccaacc tcacagcact agcagcgatc gggcccagga aaaagaggaa agtggactgt 3061 ccggggccgg gctcaggagc agaggggtcg ggccccggct cagtggtccc aggcagctcg 3121 ggtgtcggaa cccccagaca gttcacgcga caaagaatca cgcgggtcaa cctcagggac 3181 ctcatatttt gtttagaaaa tgaacgtgag acaagccatt cactgctgct ctacaaagca 3241 ttccttaagt ga // LOCUS HSTAFII18 413 bp RNA PRI 27-APR-1995 DEFINITION H.sapiens TAFII18 mRNA for transcription factor TFIID. ACCESSION X84003 NID g791052 KEYWORDS TAFII18 gene; transcription factor; transcription factor TFIID. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 413) AUTHORS Mengus,G., May,M., Jacq,X., Staub,A., Tora,L., Chambon,P. and Davidson,I. TITLE Cloning and characterization of hTAFII18, hTAFII20 and hTAFII28: three subunits of the human transcription factor TFIID JOURNAL EMBO J. 14 (7), 1520-1531 (1995) MEDLINE 95246745 REFERENCE 2 (bases 1 to 413) AUTHORS Davidson,I.B. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) I.B. Davidson, IGBMC, CNRS/Inserm/ulp, BP, 163-67404 Illkirch cedex, FRANCE FEATURES Location/Qualifiers source 1..413 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 12..386 /gene="TAFII18" CDS 12..386 /gene="TAFII18" /codon_start=1 /product="PolII transcription factor TFIID" /db_xref="PID:g791053" /translation="MADEEEDPTFEEENEEIGGGAEGGQGKRKRLFSKELRCMMYGFG DDQNPYTESVDILEDLVIEFITEMTHKAMSIGRQGRVQVEDIVFLIRKDPRKFARVKD LLTMNEELKRARKAFDEANYGS" BASE COUNT 139 a 55 c 109 g 110 t ORIGIN 1 gtgctagtgg gatggcagat gaggaagaag accccacgtt tgaggaagaa aatgaagaaa 61 ttggaggagg tgcagaaggt ggacagggta aaagaaagag acttttttct aaagaattgc 121 gatgtatgat gtatggcttt ggggatgacc agaatcctta tactgagtca gtggatattc 181 ttgaagatct tgtcatagag tttatcactg aaatgactca caaggcaatg tcaattggaa 241 gacaaggtcg agtacaagtt gaagatatcg tcttcttgat tcgaaaggac ccaaggaagt 301 ttgccagggt taaagacttg cttactatga atgaagaatt gaaacgagct agaaaagcat 361 ttgatgaagc aaattatgga tcttgacact ttttgtagtt tccgaaaatt acc // LOCUS HSTAFII20 887 bp RNA PRI 27-APR-1995 DEFINITION H.sapiens TAFII20 mRNA for transcription factor TFIID. ACCESSION X84002 NID g791054 KEYWORDS TAFII20 gene; transcription factor; transcription factor TFIID. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 887) AUTHORS Mengus,G., May,M., Jacq,X., Staub,A., Tora,L., Chambon,P. and Davidson,I. TITLE Cloning and characterization of hTAFII18, hTAFII20 and hTAFII28: three subunits of the human transcription factor TFIID JOURNAL EMBO J. 14 (7), 1520-1531 (1995) MEDLINE 95246745 REFERENCE 2 (bases 1 to 887) AUTHORS Davidson,I.B. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) I.B. Davidson, IGBMC, CNRS/Inserm/ulp, BP, 163-67404 Illkirch cedex, FRANCE FEATURES Location/Qualifiers source 1..887 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 167..652 /gene="TAFII20" CDS 167..652 /gene="TAFII20" /codon_start=1 /product="PolII transcription factor TFTIID" /db_xref="PID:g791055" /translation="MNQFGPSALINLSNFSSIKPEPASTPPQGSMANSTAVVKIPGTP GAGGRLSPENNQVLTKKKLQDLVREVDPNEQLDEDVEEMLLQIADDFIESVVTAACQL ARHRKSSTLEVKDVQLHLERQWNMWIPGFGSEEIRPYKKACTTEAHKQRMALIRKTTK K" BASE COUNT 260 a 204 c 220 g 203 t ORIGIN 1 ccgcgcagtc ggaccggctg ctgagacgaa cgcttcactg gggcagtctc tgcatatcat 61 ggggagatag acgctgctgc ctttaattgg ccttggtcct cacagctcca aaaagaaaca 121 ggatctcgat aagctctatg agctgaagtc caaagctcgg cagattatga accagtttgg 181 cccctcagcc ctaatcaacc tctccaattt ctcatccata aaaccggaac cagccagcac 241 ccctccacaa ggctccatgg ccaatagtac tgcagtggta aagataccag gcactcctgg 301 ggcaggaggt cgtcttagcc ctgaaaacaa tcaggtattg accaagaaga aattacagga 361 cttagtaaga gaagtggatc ctaatgagca gttggatgaa gatgtggagg agatgctgct 421 gcagattgct gatgatttta tcgagagtgt ggtgacagca gcctgtcagc ttgcgcggca 481 tcgcaagtct agcaccctgg aggtgaaaga tgtccagctg catttagagc gccagtggaa 541 catgtggatc ccaggatttg gttctgaaga aatccgaccc tacaaaaaag cttgcaccac 601 agaagctcac aaacagagaa tggcattgat ccggaaaaca accaagaaat aacacacgga 661 aaggtcaggg aatggacagc aatgtatttg gagatacttg agctgagaac tcagccatct 721 catccttgga tttttttttt taatgcttta cagagaagca tatatttttt attaacagtg 781 cagggaattc cggaattccg gaattccggc atatgaagtg ctgtcggatg ctaagaaaca 841 gggaattccg gaattcgata tcaagcttat cgataccgtc gacctcg // LOCUS HSTAFII28 925 bp RNA PRI 23-AUG-1995 DEFINITION H.sapiens mRNA for transcription factor TFIID subunit TAFII28. ACCESSION X83928 NID g791056 KEYWORDS transcription factor IID; transcription factor TFIID. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 925) AUTHORS Davidson,I.B. TITLE Direct Submission JOURNAL Submitted (16-JAN-1995) I.B. Davidson, IGBMC, IGBMC. CNRS/INSERM/ULP., BP163, F- 67404 Illkirch, FRANCE REFERENCE 2 (bases 1 to 925) AUTHORS Mengus,G., May,M., Jacq,X., Staub,A., Tora,L., Chambon,P. and Davidson,I. TITLE Cloning and characterization of hTAFII18, hTAFII20 and hTAFII28: three subunits of the human transcription factor TFIID JOURNAL EMBO J. 14 (7), 1520-1531 (1995) MEDLINE 95246745 FEATURES Location/Qualifiers source 1..925 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 93..728 /codon_start=1 /product="transcription factor TFIID subunit TAFII28" /db_xref="PID:g791057" /translation="MDDAHESPSDKGGETGESDETAAVPGDPGATDTDGIPEETDGDA DVDLKEAAAEEGELESQDVSDLTTVEREDSSLLNPAAKKLKIDTKEKKEKKQKVDEDE IQKMQILVSSFSEEQLNRYEMYRRSAFPKAAIKRLIQSITGTSVSQNVVIAMSGISKV FVGEVVEEALDVCEKWGEMPPLQPKHMREAVRRLKSKGQIPNSKHKKIIFF" BASE COUNT 270 a 205 c 231 g 219 t ORIGIN 1 gaattccaag atcctggcct gtgcagctcg ggtttccgag cttctgcctc aggcatctcc 61 gcgatctcct ctcccctcca atcctatccg tgatggacga tgcccacgag tcgccctccg 121 acaaaggtgg agagacaggg gagtcggatg agacggccgc tgtgcccggg gacccggggg 181 ctaccgacac cgatggaatc ccagaggaaa ctgacggaga cgcagatgtg gacttgaaag 241 aagctgcagc ggaggaaggc gagctcgaga gtcaggatgt ctcagattta acaacagttg 301 aaagggaaga ctcatcatta cttaatcctg cagccaaaaa actgaaaata gataccaaag 361 aaaagaaaga gaaaaagcag aaagtagatg aagatgagat tcagaagatg caaatcctgg 421 tttcttcttt ttctgaggag cagctgaacc gttatgaaat gtatcgccgc tcagctttcc 481 ctaaggcagc catcaaaagg ctgatccagt ccatcactgg cacctctgtg tctcagaatg 541 ttgttattgc tatgtctggt atttccaagg ttttcgtcgg ggaggtggta gaagaagcac 601 tggatgtgtg tgagaagtgg ggagaaatgc caccactaca acccaaacat atgagggaag 661 ccgttagaag gttaaagtca aaaggacaga tccctaactc gaagcacaaa aaaatcatct 721 tcttctagac caaagtctag aaaggcctat gttactgacg gaagaagtat tggttccaga 781 cttcctataa gactgtctgc attggtgctt tagtatctca ggcctccaag gattccatga 841 tgattttaat gtctttctca aaactctgat atttgtcaca cctagaaagt atgtagcctg 901 attgatactt ggcttgacta aattt // LOCUS HSTAFII55 2130 bp RNA PRI 17-FEB-1997 DEFINITION H.sapiens mRNA for transcription factor IID, subunit TAFII55. ACCESSION X97999 NID g1332521 KEYWORDS hTAFII55; transcription factor IID. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2130) AUTHORS Lavigne,A.C., Mengus,G., May,M., Dubrovskaya,V., Tora,L., Chambon,P. and Davidson,I. TITLE Multiple interactions between hTAFII55 and other TFIID subunits. Requirements for the formation of stable ternary complexes between hTAFII55 and the TATA-binding protein JOURNAL J. Biol. Chem. 271 (33), 19774-19780 (1996) MEDLINE 96355274 REFERENCE 2 (bases 1 to 2130) AUTHORS Davidson,I.B. TITLE Direct Submission JOURNAL Submitted (21-MAY-1996) I.B. Davidson, IGBMC, CNRS/INSERM/ULP., 1 Rue Laurent, B.P 163, F- 67404 Illkirch, FRANCE REFERENCE 3 (bases 1 to 2130) AUTHORS Chiang,C.M. and Roeder,R.G. TITLE Cloning of an intrinsic human TFIID subunit that interacts with multiple transcriptional activators JOURNAL Science 267 (5197), 531-536 (1995) MEDLINE 95125466 COMMENT Related sequence: U18062. FEATURES Location/Qualifiers source 1..2130 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" 5'UTR 9..723 conflict 108 /citation=[3] /replace="a" variation 178 /note="polymorphism" /replace="t" conflict 190 /citation=[3] /replace="t" gene 724..1773 /gene="hTAFII55" CDS 724..1773 /gene="hTAFII55" /note="subunit TAFII55" /codon_start=1 /product="transcription factor IID" /db_xref="PID:e244886" /db_xref="PID:g1332522" /translation="MSKSKDDAPHELESQFILRLPPEYASTVRRAVQSGHVNLKDRLT IELHPDGRHGIVRVDRVPLASKLVDLPCVMESLKTIDKKTFYKTADICQMLVSTVDGD LYPPVEEPVASTDPKASKKKDKDKEKKFIWNHGITLPLKNVRKRRFRKTAKKKYIESP DVEKEVKRLLSTDAEAVSTRWEIIAEDETKEAENQGLDISSPGMSGHRQGHDSLEHDE LREIFNDLSSSSEDEDETQHQDEEDINIIDTEEDLERQLQDKLNESDEQHQENEGTNQ LVMGIQKQIDNMKGKLQETQDRAKRQEDLIMKVENLALKNRFQAVLDELKQKEDREKE QLSSLQEELESLLEK" conflict 1257 /gene="hTAFII55" /citation=[3] /replace="g" conflict 1278 /gene="hTAFII55" /citation=[3] /replace="c" conflict 1572 /gene="hTAFII55" /citation=[3] /replace="t" 3'UTR 1774..2130 conflict 1975 /citation=[3] /replace="c" BASE COUNT 637 a 416 c 527 g 550 t ORIGIN 1 ggaattcccc cggacggagg gccggcgagg tgcggggtct ggtgatgcga gctgcgcctc 61 tcggcaagat ttcgcgctgc ccatcccggg ccctttcatc agtaatcggt agtggatcac 121 tctgccaagc ggcaggaaga attaaggaaa cgacaaggag acgctcggct ctctcccgct 181 tggctccttg cggcctcctc ttcccttcgc tccggcccgg tgaaactgaa cttataatcg 241 tcactggatt gtaagtaccc gaggcgaaga gagctcgctg agccctgatt ttttgagtgt 301 ctttgttccg ggagagtttg tgagttgaaa gtatctctgc tgggctttct gggccgaaaa 361 ccgttccggg ggagccgcca tttgctttcc tgttccctag ctagctagct agctctctcc 421 gcgttgtccg gcagcggcac ctagaggttg ggacttggca ttgcatctga tttaatgaac 481 ttaagtctgt gaataagcct ttgtgttaac gactggtatt cggtcacagc atatttagag 541 aaaagacttg gagcttaaat aaaaactaag gcaaaataga cgcttagctg ctgatctaca 601 gagaacttct tgtaattaaa agatttcaat tcatagcaaa ctggtgtttt aaactattgc 661 agtagctgga actttttagt gtaaccagca tttattggag aagtgaatca caaggaaata 721 aagatgagta aaagcaaaga tgatgctcct cacgaactgg agagccagtt tatcttacgt 781 ctgcctccag aatatgcctc tactgtgaga agggcagtac agtctggtca tgtcaacctc 841 aaggacagac tgacaattga gttacatcct gatgggcgtc atggaatcgt cagagtggac 901 cgtgttccat tggcctcaaa attagtagac ctgccctgtg ttatggaaag cttgaaaacc 961 attgataaaa aaacttttta caagacagct gatatctgtc agatgcttgt atccacagtt 1021 gatggtgatc tctatcctcc tgtggaggag ccagttgcta gcactgatcc taaagcaagc 1081 aagaaaaagg ataaggacaa agagaaaaag tttatctgga accacggaat tactctgcct 1141 ctaaagaatg tcaggaagag aaggttccgg aagacagcaa agaagaaata tattgaatct 1201 ccagatgttg aaaaagaagt gaaacgattg ctgagtacag atgctgaagc tgttagtact 1261 cggtgggaaa taattgcgga agatgaaaca aaggaggcag aaaatcaagg cctggatatc 1321 tcttctccag gaatgtctgg tcacaggcag ggccatgact cattagaaca tgatgagctt 1381 cgggagatat tcaatgacct cagcagcagc agtgaggatg aagatgagac ccagcatcaa 1441 gatgaagaag atataaacat cattgacacg gaggaagatc tggagagaca gctacaggac 1501 aagctaaatg aatcagatga acagcaccag gaaaatgaag gaaccaatca gctggttatg 1561 ggaattcaga agcagattga caacatgaaa ggcaagctcc aagagaccca ggacagggca 1621 aaacgacaag aggatctcat catgaaagtg gaaaatctgg ctctcaagaa cagatttcag 1681 gctgtactgg atgagctcaa acaaaaggaa gaccgagaaa aggagcaact cagctctttg 1741 caagaggagc tagaatcact cctagagaag taaaaagaac tgatatttaa tttcagtctt 1801 cagactggtc agcattagaa aattcttggc tttattgtac tgggtattaa gaccttgctc 1861 ttcctagtcc ttttaatgct gtgtgttctg ttaagttctt tcatttgttt gtaattttgt 1921 ttttcagcaa atttatattg ttttgctagg tgttcatcct ataagaagca ggattgtata 1981 ggcagaaaaa tgattgtagg aaagttgcag gattagcgga atgtatggtt caaccttaat 2041 tatagcttca ttgcaggact ttactgtttc tccattttct agaagctgct gttgctgctt 2101 tgtgatgacg tgagatcaat aagaagaacc // LOCUS HSTAG1 4479 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens mRNA for TAG-1/axonin-1. ACCESSION X68274 S54192 NID g36674 KEYWORDS cell adhesion molecule; glycosyl-phosphatidylinositol-anchor; immunoglobulin superfamily; neurite outgrowth-promoting protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4479) AUTHORS Hasler,T. TITLE Direct Submission JOURNAL Submitted (22-SEP-1992) T. Hasler, University of Zuerich, Inst. of Biochemistry, Winterthurerstrasse 190, 8057 Zuerich, SWITZERLAND REFERENCE 2 (bases 1 to 4479) AUTHORS Hasler,T.H., Rader,C., Stoeckli,E.T., Zuellig,R.A. and Sonderegger,P. TITLE cDNA cloning, structural features, and eucaryotic expression of human TAG-1/axonin-1 JOURNAL Eur. J. Biochem. 211 (1-2), 329-339 (1993) MEDLINE 93145965 COMMENT Related sequences: Adams, M. D., Nature, 355: 632-634 (92). FEATURES Location/Qualifiers source 1..4479 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="embryonic" /tissue_type="brain" /clone_lib="Stratagene #936206" gene 176..3298 /gene="TAG-1/axonin-1" CDS 176..3298 /gene="TAG-1/axonin-1" /codon_start=1 /product="TAG-1/axonin-1" /db_xref="PID:g36675" /db_xref="SWISS-PROT:Q02246" /translation="MGTATRRKPHLLLVAAVALVSSSAWSSALGSQTTFGPVFEDQPL SVLFPEESTEEQVLLACRARASPPATYRWKMNGTEMKLEPGSRHQLVGGNLVIMNPTK AQDAGVYQCLASNPVGTVVSREAILRFGFLQEFSKEERDPVKAHEGWGVMLPCNPPAH YPGLSYRWLLNEFPNFIPTDGRHFVSQTTGNLYIARTNASDLGNYSCLATSHMDFSTK SVFSKFAQLNLAAEDTRLFAPSIKARFPAETYALVGQQVTLECFAFGNPVPRIKWRKV DGSLSPQWTTAEPTLQIPSVSFEDEGTYECEAENSKGRDTVQGRIIVQAQPEWLKVIS DTEADIGSNLRWGCAAAGKPRPTVRWLRNGEPLASQNRVEVLAGDLRFSKLSLEDSGM YQCVAENKHGTIYASAELAVQALAPDFRLNPVRRLIPAARGGEILIPCQPRAAPKAVV LWSKGTEILVNSSRVTVTPDGTLIIRNISRSDEGKYTCFAENFMGKANSTGILSVRDA TKITLAPSSADINLGDNLTLQCHASHDPTMDLTFTWTLDDFPIDFDKPGGHYRRTNVK ETIGDLTILNAQLRHGGKYTCMAQTVVDSASKEATVLVRGPPGPPGGVVVRDIGDTTI QLSWSRGFDNHSPIAKYTLQARTPPAGKWKQVRTNPANIEGNAETAQVLGLTPWMDYE FRVIASNILGTGEPSGPSSKIRTREAAPSVAPSGLSGGGGAPGELIVNWTPMSREYQN GDGFGYLLSFRRQGSTHWQTARVPGADAQYFVYSNESVRPYTPFEVKIRSYNRRGDGP ESLTALVYSAEEEPRVAPTKVWAKGVSSSEMNVTWEPVQQDMNGILLGYEIRYWKAGD KEAAADRVRTAGLDTSARVSGLHPNTKYHVTVRAYNRAGTGPASPSANATTMKPPPRR PPGNISWTFSSSSLSIKWDPVVPFRNESAVTGYKMLYQNDLHLTPTLHLTGKNWIEIP VPEDIGHALVQIRTTGPGGDGIPAEVHIVRNGGTSMMVENMAVRPAPHPGTVISHSVA MLILIGSLEL" sig_peptide 176..259 /gene="TAG-1/axonin-1" mat_peptide 260..3296 /gene="TAG-1/axonin-1" /product="TAG-1/axonin-1" polyA_signal 4467..4472 BASE COUNT 964 a 1428 c 1273 g 814 t ORIGIN 1 ccagacaggg gctggcggcc cggccggccc cggctcaccg actcgggcag catccacctg 61 ccccagccaa cacccttctc tcgccccaga tcctttctca gcctccagct gggctgtccc 121 caagctgagc tgaggctctt ctcctccgat ccccacctct gcccggacat ccaccatggg 181 gacagccacc aggaggaagc cacacctgct gctggtagct gctgtggccc ttgtctcctc 241 ttcagcttgg agttcagccc tgggatccca aaccaccttc gggcctgtct ttgaagacca 301 gcccctcagt gtgctattcc cagaggagtc cacggaggag caggtgttgc tggcatgccg 361 cgcccgggcc agccctccag ccacctatcg gtggaagatg aatggtaccg agatgaagct 421 ggagccaggt tcccgtcacc agctggtggg gggcaacctg gtcatcatga accccaccaa 481 ggcacaggat gccggggtct accagtgcct ggcctccaac ccagtgggca ccgttgtcag 541 cagggaggcc atcctccgct tcggctttct gcaggaattc tccaaggagg agcgagaccc 601 agtgaaagct catgaaggct ggggggtgat gttgccctgt aacccacctg cccactaccc 661 aggcttgtcc taccgctggc tcctcaacga gttccccaac ttcatcccga cggacgggcg 721 tcacttcgtg tcccagacca cagggaacct gtacattgcc cgaaccaatg cctcagacct 781 gggcaactac tcctgtttgg ccaccagcca catggacttc tccaccaaga gcgtcttcag 841 caagtttgct cagctcaacc tggctgctga agatacccgg ctctttgcac ccagcatcaa 901 ggcccggttc ccagcagaga cctatgcact ggtggggcag caggtcaccc tggagtgctt 961 cgcctttggg aaccctgtcc cccggatcaa gtggcgcaaa gtggacggct ccctgtcccc 1021 gcagtggacc acagctgagc ccaccctgca gatccccagc gtcagctttg aggatgaggg 1081 cacctacgag tgtgaggcgg agaactccaa gggccgagac accgtgcagg gccgcatcat 1141 cgtgcaggct cagcctgagt ggctaaaagt gatctcggac acagaggctg acattggctc 1201 caacctgcgt tggggctgtg cagccgccgg caagccccgg cctacagtgc gctggctgcg 1261 gaacggggag cctctggcct cccagaaccg ggtggaggtg ttggctgggg acctgcggtt 1321 ctccaagctg agcctggaag actcgggcat gtaccagtgt gtggcagaga ataagcacgg 1381 taccatctac gccagcgccg agctagccgt gcaagcactc gcccctgact tcaggctgaa 1441 tcccgtgagg cgtctgatcc ccgcggcccg cgggggagag atccttatcc cctgccagcc 1501 ccgggcagct ccaaaggccg tggtgctctg gagcaaaggc acggagattt tggtcaacag 1561 cagcagagtg actgtaactc cagatggcac cttgatcata agaaacatca gccggtcaga 1621 tgaaggcaaa tacacctgct ttgctgagaa cttcatgggc aaagccaaca gcactggaat 1681 cctatctgtg cgagatgcaa ccaaaatcac tctagccccc tcaagtgccg acatcaactt 1741 gggtgacaac ctgaccctac agtgccatgc ctcccacgac cccaccatgg acctcacctt 1801 cacctggacc ctggacgact tccccatcga ctttgataag cctggagggc actaccggag 1861 aactaatgtg aaggagacca ttggggatct gaccatcctg aacgcccagc tgcgccatgg 1921 ggggaagtac acgtgcatgg cccagacggt ggtggacagc gcgtccaagg aggccacagt 1981 cctggtccga ggtccgccag gtcccccagg aggtgtggtg gtgagggaca ttggcgacac 2041 caccatccag ctcagctgga gccgtggctt cgacaaccac agccccatcg ctaagtacac 2101 cctgcaagct cgcactccac ctgcagggaa gtggaagcag gttcggacca atcctgcaaa 2161 catcgagggc aatgccgaga ctgcacaggt gctgggcctc accccctgga tggactatga 2221 gttccgggtc atagccagca acattctggg cactggggag cctagtgggc cctccagcaa 2281 aatccggacc agggaagcag ccccctcggt ggcaccctca ggactcagcg gaggaggtgg 2341 agcccccgga gagctcatcg tcaactggac gcccatgtca cgggagtacc agaacggaga 2401 cggcttcggc tacctgctgt ccttccgcag gcagggcagc actcactggc agaccgcccg 2461 ggtgcctggc gccgatgccc agtactttgt ctacagcaac gagagcgtcc ggccctacac 2521 gccctttgag gtcaagatcc gcagctacaa ccgccgcggg gatgggcccg agagcctcac 2581 tgcactcgtg tactcagctg aggaagagcc cagggtggcc cctaccaagg tgtgggccaa 2641 aggggtctca tcctcagaga tgaacgtgac ctgggaaccc gtgcagcagg acatgaatgg 2701 tatcctcctg gggtatgaga tccgctactg gaaagctggg gacaaagaag cagctgcgga 2761 ccgagtgagg acagcagggc tggacaccag tgcccgagtc agtggcctgc atcccaacac 2821 caagtaccat gtgaccgtga gggcctacaa ccgggctggc actgggcctg ccagcccttc 2881 tgccaacgcc acgaccatga agccccctcc gcggcgacct cctggcaaca tctcctggac 2941 tttctcaagc tctagtctta gcattaagtg ggaccctgtg gtccctttcc gaaatgagtc 3001 tgcagtcacc ggctataaga tgctgtacca gaatgactta cacctgactc ccacgctcca 3061 cctcaccggc aagaactgga tagaaatccc agtgcctgaa gacattggcc atgccctggt 3121 acaaattcgg accacagggc ccggagggga tgggatccct gcagaagtcc acatcgtgag 3181 gaatggaggc acaagcatga tggtggagaa catggcagtc cgcccagcac cacaccctgg 3241 caccgtcatt tcccactccg tggcgatgct gatcctcata ggctccctgg agctctgatc 3301 ctggaacccc tccctctgcg ccgcagctgg acgccacctc cgacggacac agccagcccc 3361 ttcctgctgc caaggtggcc tgacactgtg ccagagagtg gctggtttta aatacctact 3421 ttaaacagtg ccctttttgt aggaggtagg atattttata ttctgccgca ggatagaacc 3481 cacgcaagga ttttctttaa attgagaggc accaggcagt aacttccatg atgacactga 3541 cgcctatacc tgagctctag gctgcctgga gggaaggaac aggcccatgg gaagaagggg 3601 gttttaaaaa catgtcttca actcagcaga gatggccctc tgggacccta tacggactcc 3661 gccacttgag agcagtccta ggcccggcag gaacaccaga catgaacagg ttgaagaact 3721 ggagcgaagt gcacacctca ccatccttca gtctaaggaa gaagggcaag ccctgggacc 3781 aagagctctc ccgccttctc cctcgagcag cagcaaggac cctgacgctg tccccgataa 3841 ctccctaggg gctcctgcct gcccaagcgg ctgagaacca gcgccccgat gcctgaggct 3901 gggagcctga gccccttcag ctttgagggg ggtgatactc caggctgttt ggggtgggag 3961 ccaaaaagag ttgagaggcc agggcccttg gtggaaaggg gcaccagcct tggtctgaga 4021 tagtcacaac ccaggtgacg atgccctctc agccaacact gccaacctga ccctgtcatc 4081 ccgattgaca gcgccacttc aggtggctgg gtgactaaag ggcttgtctt ggtggggtct 4141 cccacccctc caagacccat tctgcacagt ccctccaggg tttgggcagg agatggccaa 4201 tcatgcgccc acctctccag tgctgcctgc agtcagctcg gcctccccga cctgcagccc 4261 cagactctgc tctcccagca ctgactcact cctgcctggg aggggaatgc agcattcatg 4321 ctgtgtgtcc tggtattggg aggtttctgg gaagggcaga ggataaatgt ggccctgcct 4381 gctcccaggt atacctagga ccacctggcc agatccgctc ccagacggcc ttggactgct 4441 tgcatttccc cggagaaaaa ggggttaata aatgggcca // LOCUS HSTAILLE 1593 bp RNA PRI 01-AUG-1997 DEFINITION Homo sapiens mRNA for tailless gene homologue. ACCESSION Y13276 NID g2292901 KEYWORDS tailless gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1593) AUTHORS Jackson,A., Panayiotidis,P. and Foroni,L. TITLE The human homologue of the Drosophila tailless gene: characterisation and mapping to a region of common deletion in human lymphoid leukaemia on chromosome 6q21 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1593) AUTHORS Jackson,A. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) A. Jackson, Royal Free Hospital School of Medicine, Academic Haematology, Royal Free Hospital, Pond Street, London, NW3 2QG, UK FEATURES Location/Qualifiers source 1..1593 /organism="Homo sapiens" /db_xref="taxon:9606" /map="q21" /chromosome="6" /dev_stage="embryo" /tissue_type="brain" /lab_host="E.coli" gene 169..1326 /gene="tailless" CDS 169..1326 /gene="tailless" /codon_start=1 /product="Tailless protein" /db_xref="PID:e332319" /db_xref="PID:g2292902" /translation="MSKPAGSTSRILDIPCKVCGDRSSGKHYGVYACDGCSGFFKRSI RRNRTYVCKSGNQGGCPVDKTHRNQCRACRLKKCLEVNMNKDAVQHERGPRTSTIRKQ VALYFRGHKEENGAAAHFPSAALPAPAFFTAVTQLEPHGLELAAVSTTPERQTLVSLA QPTPKYPHEVNGTPMYLYEVATESVCESAARLLFMSIKWAKSVPAFSTLSLQDQLMLL EDAWRELFVLGIAQWAIPVDANTLLAVSGMNGDNTDSQKLNKIISEIQALQEVVARFR QLRLDATEFACLKCIVTFKAVPTHSGSELRSFRNAAAIAALQDEAQLTLNSYIHTRYP TQPCRFGKLLLLLPALRSISPSTIEEVFFKKTIGNVPITRLLSDMYKSSDI" BASE COUNT 396 a 450 c 435 g 312 t ORIGIN 1 tgagcgccca gggagcagcg cagcgcgcga ctgacaccca cctgtcccgc ccaggagcct 61 tgcaggctgg agggcggctg gagagcggcg gcgcccggcg gcgaggcggg cgctgccggc 121 cgggactcgg gcagcgccca ccaaccgctc cgccccggga cagccagcat gagcaagcca 181 gccggatcaa caagccgcat tttagatatc ccctgcaaag tgtgtggcga ccgcagctcg 241 gggaagcact acggggtcta cgcctgcgac ggctgctcag gttttttcaa acggagcatc 301 cgaaggaata ggacctatgt ctgcaaatct ggaaaccagg gaggctgtcc ggtggacaag 361 acgcacagaa accagtgcag ggcgtgtcgg ctgaagaagt gtttggaagt caacatgaac 421 aaagacgccg tgcagcacga gcgggggcct cggacgtcca ccatccgcaa gcaagtggcc 481 ctctacttcc gtggacacaa ggaggagaac ggggccgccg cgcactttcc ctcggcggcg 541 ctccctgcgc cggccttctt caccgcggtc acgcagctgg agccgcacgg cctggagctg 601 gccgcggtgt ccaccactcc agagcggcag accctcgtga gcctggctca gcccacgccc 661 aagtaccccc atgaagtgaa tgggacccca atgtatctct atgaagtggc cacggagtcg 721 gtgtgtgaat cagctgccag acttctcttc atgagcatca agtgggctaa gagtgtgcca 781 gccttctcca cgctgtcttt gcaagaccag ctgatgcttt tggaagatgc ttggagagaa 841 ctgtttgttc taggaatagc acaatgggcc attccggttg atgctaacac tctactggct 901 gtatctggca tgaacggtga caacacagat tcccagaagc tgaacaagat catatctgaa 961 atacaggctt tacaagaggt ggtggctcga tttagacaac tccggttaga tgctactgaa 1021 tttgcctgtc taaaatgcat cgtcactttc aaagccgttc ctacacatag tggttctgaa 1081 ctgagaagtt tccggaatgc tgccgccatt gcagcccttc aagatgaggc tcagctaacg 1141 ctcaacagct acatccatac cagatatccc actcaaccct gtcgctttgg aaaactcctg 1201 ttgcttttgc cagctttacg ttctattagc ccatcaacta tagaagaagt gtttttcaaa 1261 aaaaccatcg gcaatgtgcc aattacaaga ctgctttcag atatgtacaa atccagtgat 1321 atctaagctc acaagatacc cacttttcag gatgggacag tatcagatca acttcaaccc 1381 atggagaaca agcctcaact aacaaaccct tcaggaagca tataccgggg aatgtgtagc 1441 cttcaggaaa aaaatgccaa ttgacacaaa gcattccagt agctatgacc tgccgccctg 1501 accaggatag ggcgggtggg aaggagaggg gtgcaacagg accgcctgca ctgaaaactc 1561 actgctgcca tgccctggga gggggcaaac tgg // LOCUS HSTAMMD 2533 bp RNA PRI 20-MAR-1996 DEFINITION H.sapiens mRNA for transcript associated with monocyte to macrophage differentiation. ACCESSION X85750 NID g1006664 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2533) AUTHORS Rehli,M., Krause,S.W., Schwarzfischer,L., Kreutz,M. and Andreesen,R. TITLE Molecular cloning of a novel macrophage maturation-associated transcript encoding a protein with several potential transmembrane domains JOURNAL Biochem. Biophys. Res. Commun. 217 (2), 661-667 (1995) MEDLINE 96106867 REFERENCE 2 (bases 1 to 2533) AUTHORS Rehli,M. TITLE Direct Submission JOURNAL Submitted (21-MAR-1995) M. Rehli, Abt. f. Haematologie und Internistische, Onkologie, Klinikum der Universitaet Regensburg, D- 93042 Regensburg, FRG COMMENT Sequence overlapping with that under the acc#T24981. FEATURES Location/Qualifiers source 1..2533 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="blood" /cell_type="monocyte derived macrophages" /clone_lib="lambda ZAP II" /clone="IIIf-J2a-A2aI" mat_peptide 82..795 CDS 82..798 /note="expression associated with monocyte to macrophage differentiation" /codon_start=1 /db_xref="PID:g1006665" /translation="MRFKNRFQRFMNHRAPANGRYKPTCYEHAANCYTHAFLIVPAIV GSALLHRLSDDCWEKITAWIYGMGLCALFIASTVFHIVSWKKSHLRTAEHCFHMCDRM VIYFFIAASYAPWLNLRELGPLASHMRWFIWLMAAGGTIYVFLYHEKYKVVELFFYLT MGFSPALVVTSMNNTDGLQELACGGLIYCLGVVFFKSDGIIPFAHAIWHLFVATAAAV HYYAIWKYLYRSPTDFMRHL" BASE COUNT 697 a 473 c 479 g 884 t ORIGIN 1 ccaagcccat gagggccgcg cgcccggccg ccggtgctga cgagacggag ctcctggccc 61 ccgaggagga gcagaggatc aatgcggttc aagaatcgat tccagcggtt catgaaccat 121 cgagctccag ccaatggccg ctacaagcca acttgctatg aacatgctgc taactgttac 181 acacacgcat tcctcattgt tccggccatc gtgggcagtg ccctcctcca tcggctgtct 241 gatgactgct gggaaaagat aacagcatgg atttatggaa tgggactctg tgccctcttc 301 atcgcttcta cagtatttca cattgtatca tggaaaaaga gccacttaag gacagcggag 361 cattgttttc acatgtgtga tagaatggtt atctatttct tcattgctgc ttcttatgct 421 ccatggttaa atcttcgtga acttggaccc ctggcatctc atatgcgttg gtttatctgg 481 ctcatggcag ctggaggaac catttatgta tttctctacc atgaaaaata taaggtggtt 541 gaactctttt tctatctcac aatgggattc tctccagcct tggtggtgac atcaatgaac 601 aacaccgatg gacttcagga acttgcctgt gggggcttaa tttattgctt gggagttgtg 661 ttcttcaaga gtgatggcat cattccattt gcccacgcca tctggcacct gtttgtggcc 721 acggcagctg cagtgcatta ctacgccatt tggaaatacc tttaccgaag tcctacggac 781 tttatgcggc atttatgacc aatctgtact aattctccaa accagtatta tttcaattat 841 ggcacttggg agtggggtga gagctaaaca ttgcacaggg caaagaaaaa aaataactgc 901 actgacttta tatcttttga atataattac tgtgaaagta taaaggctgt gttctggaat 961 tttctgcctc acagcaaata aataaggtag tgaattaatt attcattcca ttccactatc 1021 atgaaggact ctgaatagac ttggccaact gatgtttaca aaccagactt ttatatttta 1081 attttacaga ttttactaca tgatttttct aaattactat gtcaggttgt aaaagtcagt 1141 gcaataacaa accttccttt ttaagaagaa aattgtttct attactttcc cattcactag 1201 gtaaagaatc atggacagaa cttacactac tttttaccat gtttcatctt ggcataacat 1261 ggttcttttt taaatagaaa ctttagtttt ttgtaaattt ttaaaaaaat atttcattga 1321 tatgcatctc tgcaggtcct cattcatgtt gtaaattttt ggagcaagca gtcaacattc 1381 cacaaacgaa caaacattat acctcttctg atagttttat taagcatgga gaaattgcca 1441 atttttaaaa actgcagttt tccaaacttt tctgccaacc tcttactctg aattcagtgc 1501 tgctttggga catatacttg acctagcttg gtttaccagt gatggaaaag tattttgata 1561 tcattaactt tttcaaaaga tccaactttt tctctatgcc tttgccacat tctcttcagg 1621 gtctctttcc acagcggata aatgtttttt ctgtattatg acagtattgt tgtgatggcc 1681 atctgctgga aactcctgaa gagcattatg tattacagtg agcagttgta ttgcctgttt 1741 ggtgcccaat ggttaagtca ttgtcactta gctttatatt gtcagtttga tatttatttt 1801 aaattgtgga actagatgca taaattcaca tttctgcctt tcctttgcat cttctcatat 1861 attgtgtttt tttttttttt cctagaaaaa atatttaaag cattgtttga caggtagaaa 1921 ctcatgtatc tgtagtccat gagttatatc ctggctcagt ggagtgatat ttatgtatta 1981 tttttacttt tctctcagtg tcttatatta agattaacat gttgttaata gttgctttgt 2041 tgattaatct ctcttgttgg tgttttaata aatgaaatag gcttgccttt agatcgggtg 2101 ctgatattgc ctgtttccta gtaatgggct gatcaaatga tcagtggaat tcttggtttg 2161 atgataacct tattaattga aattttttac tgatgtggct ttaaaagagg tttattttgt 2221 atatgtttag aactctctga ttttgatgaa ttatatggga gtgagaaaca gaagaagtgg 2281 tatttgctgg cgagttaaat aggcaaggta cccagtgata acaccaacca aaccactcct 2341 atctgcatga ttctgaacat ctggatgcct gttgttttac tgtgtatatt ttatttttaa 2401 tatattaact ttgtggattc atttaaggtc tactcaaaag taacactgtc caaaccacta 2461 atatgtatgt aaaaattgtg ctgtatacta caataaagtt gttacttgga tttgttccaa 2521 aaaaaaaaaa aaa // LOCUS HSTAP2BA 2140 bp RNA PRI 17-SEP-1993 DEFINITION H.sapiens TAP2B mRNA, complete CDS. ACCESSION Z22935 NID g312065 KEYWORDS ABC transporter gene; TAP2B. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2140) AUTHORS Powis,S.H., Mockridge,I., Kelly,A., Kerr,L.A., Glynne,R., Gileadi,U., Beck,S. and Trowsdale,J. TITLE Polymorphism in a second ABC transporter gene located within the class II region of the human major histocompatibility complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (4), 1463-1467 (1992) MEDLINE 92159069 REFERENCE 2 (bases 1 to 2140) AUTHORS Kelly,A., Powis,S.H., Kerr,L.A., Mockridge,I., Elliott,T., Bastin,J., Uchanska-Ziegler,B., Ziegler,A., Trowsdale,J. and Townsend,A. TITLE Assembly and function of the two ABC transporter proteins encoded in the human major histocompatibility complex JOURNAL Nature 355 (6361), 641-644 (1992) MEDLINE 92168116 REFERENCE 3 (bases 1 to 2140) AUTHORS Powis,S.H., Tonks,S., Mockridge,I., Kelly,A.P., Bodmer,J.G. and Trowsdale,J. TITLE Alleles and haplotypes of the MHC-encoded ABC transporters TAP1 and TAP2 JOURNAL Immunogenetics 37 (5), 373-380 (1993) MEDLINE 93154779 REMARK Erratum:[Immunogenetics 1993;37(6):480]] REFERENCE 4 (bases 1 to 2140) AUTHORS Powis,S.H. TITLE Direct Submission JOURNAL Submitted (07-JUN-1993) STEPHEN H POWIS, HUMAN IMMUNOGENETICS LABORATORY, IMPERIAL CANCER, RESEARCH FUND, 44 LINCOLN'S INN FIELDS, LONDON, WC2A 3PX, UNITED, KINGDOM FEATURES Location/Qualifiers source 1..2140 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TAP2B" /cell_type="melanoma derived cell line" /cell_line="DX3" /clone_lib="DX3" /chromosome="6" CDS 29..2140 /function="ABC transporter" /codon_start=1 /product="TAP2B" /db_xref="PID:g312066" /db_xref="SWISS-PROT:Q03519" /translation="MRLPDLRPWTSLLLVDAALLWLLQGPLGTLLPQGLPGLWLEGTL RLGGLWGLLKLRGLLGFVGTLLLPLCLATPLTVSLRALVAGASRAPPARVASAPWSWL LVGYGAAGLSWSLWAVLSPPGAQEKEQDQVNNKVLMWRLLKLSRPDLPLLVAAFFFLV LAVLGETLIPHYSGRVIDILGGDFDPHAFASAIFFMCLFSFGSSLSAGCRGGCFTYTM SRINLRIREQLFSSLLRQDLGFFQETKTGELNSRLSSDTTLMSNWLPLNANVLLRSLV KVVGLYGFMLSISPRLTLLSLLHMPFTIAAEKVYNTRHQEVLREIQDAVARAGQVVRE AVGGLQTVRSFGAEEHEVCRYKEALEQCRQLYWRRDLERALYLLVRRVLHLGVQMLML SCGLQQMQDGELTQGSLLSFMIYQESVGSYVQTLVYIYGDMLSNVGAAEKVFSYMDRQ PNLPSPGTLAPTTLQGVVKFQDVSFAYPNRPDRPVLKGLTFTLRPGEVTALVGPNGSG KSTVAALLQNLYQPTGGQVLLDEKPISQYEHCYLHSQVVSVGQEPVLFSGSVRNNIAY GLQSCEDDKVMAAAQAAHADDFIQEMEHGIYTDVGEKGSQLAAGQKQRLAIARALVRD PRVLILDEATSALDVQCEQALQDWNSRGDRTVLVIAHRLQAVQRAHQILVLQEGKLQK LAQLQEGQDLYSRLVQQRLMD" BASE COUNT 394 a 596 c 675 g 475 t ORIGIN 1 gctgcggtct ccccgccgcg gctgagccat gcggctccct gacctgagac cctggacctc 61 cctgctgctg gtggacgcgg ctttactgtg gctgcttcag ggccctctgg ggactttgct 121 tcctcaaggg ctgccaggac tatggctgga ggggaccctg cggctgggag ggctgtgggg 181 gctgctaaag ctaagagggc tgctgggatt tgtggggaca ctgctgctcc cgctctgtct 241 ggccaccccc ctgactgtct ccctgagagc cctggtcgcg ggggcctcac gtgctccccc 301 agccagagtc gcttcagccc cttggagctg gctgctggtg gggtacgggg ctgcggggct 361 cagctggtca ctgtgggctg ttctgagccc tcctggagcc caggagaagg agcaggacca 421 ggtgaacaac aaagtcttga tgtggaggct gctgaagctc tccaggccgg acctgcctct 481 cctcgttgcc gccttcttct tccttgtcct tgctgttttg ggtgagacat taatccctca 541 ctattctggt cgtgtgattg acatcctggg aggtgatttt gacccccatg cctttgccag 601 tgccatcttc ttcatgtgcc tcttctcctt tggcagctca ctgtctgcag gctgccgagg 661 aggctgcttc acctacacca tgtctcgaat caacttgcgg atccgggagc agcttttctc 721 ctccctgctg cgccaggacc tcggtttctt ccaggagact aagacagggg agctgaactc 781 acggctgagc tcggatacca ccctgatgag taactggctt cctttaaatg ccaatgtgct 841 cttgcgaagc ctggtgaaag tggtggggct gtatggcttc atgctcagca tatcgcctcg 901 actcaccctc ctttctctgc tgcacatgcc cttcacaata gcagcggaga aggtgtacaa 961 cacccgccat caggaagtgc ttcgggagat ccaggatgca gtggccaggg cggggcaggt 1021 ggtgcgggaa gccgttggag ggctgcagac cgttcgcagt tttggggccg aggagcatga 1081 agtctgtcgc tataaagagg cccttgaaca atgtcggcag ctgtattggc ggagagacct 1141 ggaacgcgcc ttgtacctgc tcgtaaggag ggtgctgcac ttgggggtgc agatgctgat 1201 gctgagctgt gggctgcagc agatgcagga tggggagctc acccagggca gcctgctttc 1261 ctttatgatc taccaggaga gcgtggggag ctatgtgcag accctggtat acatatatgg 1321 ggatatgctc agcaatgtgg gagctgcaga gaaggttttc tcctacatgg accgacagcc 1381 aaatctgcct tcacctggca cgcttgcccc caccactctg cagggggttg tgaaattcca 1441 agacgtctcc tttgcatatc ccaatcgccc tgacaggcct gtgctcaagg ggctgacgtt 1501 taccctacgt cctggtgagg tgacggcgct ggtgggaccc aatgggtctg ggaagagcac 1561 agtggctgcc ctgctgcaga atctgtacca gcccacaggg ggacaggtgc tgctggatga 1621 aaagcccatc tcacagtatg aacactgcta cctgcacagc caggtggttt cagttgggca 1681 ggagcctgtg ctgttctccg gttctgtgag gaacaacatt gcttatgggc tgcagagctg 1741 cgaagatgat aaggtgatgg cggctgccca ggctgcccac gcagatgact tcatccagga 1801 aatggagcat ggaatataca cagatgtagg ggagaagggg agccagctgg ctgcgggaca 1861 gaaacaacgt ctggccattg cccgggccct tgtacgagac ccgcgggtcc tcatcctgga 1921 tgaggctact agtgccctag atgtgcagtg cgagcaggcc ctgcaggact ggaattcccg 1981 tggggatcgc acagtgctgg tgattgctca caggctgcag gcagttcagc gcgcccacca 2041 gatcctggtg ctccaggagg gcaagctgca gaagcttgcc cagctccagg agggacagga 2101 cctctattcc cgcctggttc agcagcggct gatggactga // LOCUS HSTATR 2754 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for tyrosine aminotransferase (TAT) (EC 2.6.1.5). ACCESSION X52520 NID g36712 KEYWORDS aminotransferase; transferase; tyrosine aminotransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2754) AUTHORS Scherer,G. TITLE Direct Submission JOURNAL Submitted (08-MAR-1990) Scherer G., Institute of Human Genetics, Albertstr 11, D 7800 Freiburg REMARK 9bases 1-2754) REFERENCE 2 (bases 1 to 2754) AUTHORS Rettenmeier,R., Natt,E., Zentgraf,H. and Scherer,G. TITLE Isolation and characterization of the human tyrosine aminotransferase gene JOURNAL Nucleic Acids Res. 18 (13), 3853-3861 (1990) MEDLINE 90326506 COMMENT See -. Data kindly reviewed (26-JUL-1990) by G. Scherer. FEATURES Location/Qualifiers source 1..2754 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta (genomic) and liver (cDNA)" /clone_lib="genomic (phage) + cDNA" /clone="lambda-hTAT1, phcTAT2-16 + phcTAT3a-6" /chromosome="16q22.1" CDS 97..1461 /note="tyrosine aminotransferase (AA 1-454)" /codon_start=1 /db_xref="PID:g36713" /db_xref="SWISS-PROT:P17735" /translation="MDPYMIQMSSKGNLPSILDVHVNVGGRSSVPGKMKGRKARWSVR PSDMAKKTFNPIRAIVDNMKVKPNPNKTMISLSIGDPTVFGNLPTDPEVTQAMKDALD SGKYNGYAPSIGFLSSREEIASYYHCPEAPLEAKDVILTSGCSQAIDLCLAVLANPGQ NILVPRPGFSLYKTLAESMGIEVKLYNLLPEKSWEIDLKQLEYLIDEKTACLIVNNPS NPCGSVFSKRHLQKILAVAARQCVPILADEIYGDMVFSDCKYEPLATLSTDVPILSCG GLAKRWLVPGWRLGWILIHDRRDIFGNEIRDGLVKLSQRILGPCTIVQGALKSILCRT PGEFYHNTLSFLKSNADLCYGALAAIPGLRPVRPSGAMYLMVGIEMEHFPEFENDVEF TERLVAEQSVHCLPATCFEYPNFIRVVITVPEVMMLEACSRIQEFCEQHYHCAEGSQE ECDK" misc_feature 2070..2075 /note="minor mRNA polyadenylation signal" polyA_site 2090 /note="minor mRNA polyadenylation site" repeat_region 2116..2130 /note="direct repeat flanking Alu element" misc_feature complement(2131..2479) /note="Alu element" repeat_region 2480..2494 /note="direct repeat flanking Alu element" misc_feature 2738..2743 /note="major mRNA polyadenylation signal" polyA_site 2754 /note="major mRNA polyadenylation site" BASE COUNT 689 a 646 c 635 g 784 t ORIGIN 1 attgcccctg taacctgtca aagaagagct aagggagctt tcggggttgg cttcttggag 61 gctgctttct cctttacttg gaaggcttcg ctagtgatgg acccatacat gattcagatg 121 agcagcaaag gcaacctccc ctcaattctg gacgtgcatg tcaacgttgg tgggagaagc 181 tctgtgccgg gaaaaatgaa aggcagaaag gccaggtggt ctgtgaggcc ctcagacatg 241 gccaagaaaa ctttcaaccc catccgagcc attgtggaca acatgaaggt gaaaccaaat 301 ccaaacaaaa ccatgatttc cctgtccatt ggggacccta ctgtgtttgg aaacctgcct 361 acagaccctg aagttaccca ggcaatgaaa gatgccctgg actcgggcaa atataatggc 421 tatgccccat ccatcggctt cctatccagt cgggaggaga ttgcttctta ttaccactgt 481 cctgaggcac ccctagaagc taaggacgtc attctgacaa gtggctgcag ccaagctatt 541 gacctttgtt tagctgtgtt ggccaaccca gggcagaaca tcctggttcc aagacctggt 601 ttctctctct acaagactct ggctgagtct atgggaattg aggtcaaact ctacaatttg 661 ttgccagaga aatcttggga aattgacctg aaacaactgg aatatctaat tgatgaaaag 721 acagcttgtc tcattgtcaa taatccatca aacccctgtg ggtcagtgtt cagcaaacgt 781 catcttcaga agattctggc agtggctgca cggcagtgtg tccccatctt agctgatgag 841 atctatggag acatggtgtt ttcggattgc aaatatgaac cactggccac cctcagcacc 901 gatgtcccca tcctgtcctg tggagggctg gccaagcgct ggctggttcc tggctggagg 961 ttgggctgga tcctcattca tgaccgaaga gacatttttg gcaatgagat ccgagatggg 1021 ctggtgaagc tgagtcagcg cattttggga ccctgtacca ttgtccaggg agctctgaaa 1081 agcatcctat gtcgcacccc gggagagttt taccacaaca ctctgagctt cctcaagtcc 1141 aatgctgatc tctgttatgg ggcgttggct gccatccctg gactccggcc agtccgccct 1201 tctggggcta tgtacctcat ggttggaatt gagatggaac atttcccaga atttgagaac 1261 gatgtggagt tcacggagcg gttagttgct gagcagtctg tccactgcct cccagcaacg 1321 tgctttgagt acccgaattt catccgagtg gtcatcacag tccccgaggt gatgatgctg 1381 gaggcgtgca gccggatcca ggagttctgt gagcagcact accattgtgc tgaaggcagc 1441 caggaggagt gtgataaata ggcctgcatc cattctcctg aggatgtgtc ccatctaggg 1501 aaggctggac taggccttgc ggctcctcag ggactcaggt ggccctactg ggagaggggc 1561 ctcaaatgca ccatgtcaag ggttcaagat tgttcctgct tttccccaag tacaaccaca 1621 cccacactca gatcctcctc attcacatcg cagattactc ccttgctctg cgctgctaga 1681 gtgactcact aattcattaa tctgcctccc tctcgtaaga tttccttctt ttttttcttg 1741 aaagtaccag gtgaacaaag tttaccagaa agcagttgag acaagaaaat aagagctcag 1801 gatgagggaa aagaaaaaga ttgagagaat ttgtgccccc aaccatttcc tcagactcta 1861 agaaagaaca cgctctctcc aggcaggtct gaagctcaac tctcttattg cctcacttca 1921 ggtatacctc actttacaca atagaattat aactggaaag aagttgggga cacatgtatt 1981 tggtgattac attttaaaca cattaggaaa agttgctatt tgaacttttt attgattttt 2041 ggggggagta aagaattatt ttggatgcaa ataaatatcc tttaattgat cgacttgcca 2101 aatttagatt tgtgtgcatc aggctttctt ttttttcttt ttttagagaa gttcaatata 2161 agcttttctt ttctttgttt ctttctttct ttattttgag atggagtctt gctctgtcgc 2221 ccatgctgga gtgcagtggc gcgatctcgg ctcactgcaa cctccacctc ctgggttcaa 2281 gcgattctct tgcctcaacc tcccaagcag ttgggactac aggcgtgagc caccatgccc 2341 ggctaatttt tgtattttta gtagagacag ggtttcacca tgttagccag gctggtctca 2401 aactcctgac ctcaggcaat ctgcccgcct gggtctccta aagtactggg attacaggcg 2461 tgagccacct cgcccagcgg catcaggctt tcttaaagtg agagcacgcc tgtactagag 2521 caagcaggaa tcagagacct tccagaaata ctactgtgta agggccagaa atatcttcac 2581 ttgtcattgt tatataatca ttattacttt tgctgtaatg ttaatattga tttattaata 2641 tatattatct tttcatacat tttctaagaa acatttatat tgataagatc ttttattttg 2701 caagggcata aattattgtt tttctttttt tttttttaat aaatttcacc aagt // LOCUS HSTAUI 1200 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for microtubule-associated tau protein. ACCESSION X14474 NID g36724 KEYWORDS phosphoprotein; tau protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Crowther,R.A. TITLE Direct Submission JOURNAL Submitted (20-APR-1989) to the EMBL/GenBank/DDBJ databases REFERENCE 2 (bases 1 to 1200) AUTHORS Goedert,M., Spillantini,M.G., Potier,M.C., Ulrich,J. and Crowther,R.A. TITLE Cloning and sequencing of the cDNA encoding an isoform of microtubule-associated protein tau containing four tandem repeats: differential expression of tau protein mRNAs in human brain JOURNAL EMBO J. 8 (2), 393-399 (1989) MEDLINE 89251564 COMMENT Data kindly reviewed (22nd May 1989) by Crowther R.A. FEATURES Location/Qualifiers source 1..1200 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone="lambda PHF24" CDS 31..1182 /note="tau protein (AA 1 - 383)" /codon_start=1 /db_xref="PID:g36725" /db_xref="SWISS-PROT:P10636" /translation="MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLK AEEAGIGDTPSLEDEAAGHVTQARMVSKSKDGTGSDDKKAKGADGKTKIATPRGAAPP GQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTP PTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQ IINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGG GQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEI VYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL" BASE COUNT 323 a 359 c 342 g 176 t ORIGIN 1 tgtcgactat caggtgaact ttgaaccagg atggctgagc cccgccagga gttcgaagtg 61 atggaagatc acgctgggac gtacgggttg ggggacagga aagatcaggg gggctacacc 121 atgcaccaag accaagaggg tgacacggac gctggcctga aagctgaaga agcaggcatt 181 ggagacaccc ccagcctgga agacgaagct gctggtcacg tgacccaagc tcgcatggtc 241 agtaaaagca aagacgggac tggaagcgat gacaaaaaag ccaagggggc tgatggtaaa 301 acgaagatcg ccacaccgcg gggagcagcc cctccaggcc agaagggcca ggccaacgcc 361 accaggattc cagcaaaaac cccgcccgct ccaaagacac cacccagctc tggtgaacct 421 ccaaaatcag gggatcgcag cggctacagc agccccggct ccccaggcac tcccggcagc 481 cgctcccgca ccccgtccct tccaacccca cccacccggg agcccaagaa ggtggcagtg 541 gtccgtactc cacccaagtc gccgtcttcc gccaagagcc gcctgcagac agcccccgtg 601 cccatgccag acctgaagaa tgtcaagtcc aagatcggct ccactgagaa cctgaagcac 661 cagccgggag gcgggaaggt gcagataatt aataagaagc tggatcttag caacgtccag 721 tccaagtgtg gctcaaagga taatatcaaa cacgtcccgg gaggcggcag tgtgcaaata 781 gtctacaaac cagttgacct gagcaaggtg acctccaagt gtggctcatt aggcaacatc 841 catcataaac caggaggtgg ccaggtggaa gtaaaatctg agaagcttga cttcaaggac 901 agagtccagt cgaagattgg gtccctggac aatatcaccc acgtccctgg cggaggaaat 961 aaaaagattg aaacccacaa gctgaccttc cgcgagaacg ccaaagccaa gacagaccac 1021 ggggcggaga tcgtgtacaa gtcgccagtg gtgtctgggg acacgtctcc acggcatctc 1081 agcaatgtct cctccaccgg cagcatcgac atggtagact cgccccagct cgccacgcta 1141 gctgacgagg tgtctgcctc cctggccaag cagggtttgt gatcaggccc ctggggcggt // LOCUS HSTAUTRAN 3969 bp RNA PRI 01-FEB-1993 DEFINITION H.sapiens mRNA for taurine transporter. ACCESSION Z18956 NID g36726 KEYWORDS taurine transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3969) AUTHORS Jhiang,S.M., Fithian,L., Smanik,P., Mcgill,J., Tong,Q. and Mazzaferri,E.L. TITLE Cloning of the human taurine transporter and characterization of the taurine uptake in thyroid cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 3969) AUTHORS Jhiang,S.M. TITLE Direct Submission JOURNAL Submitted (02-DEC-1992) Jhiang S. M., The Ohio State University, Internal Medicine, 446 Mccampbell Hall, 1581 Dodd Drive, Columbus, OH, USA, 43210 FEATURES Location/Qualifiers source 1..3969 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="THYROID" CDS 20..1879 /codon_start=1 /product="taurine transporter" /db_xref="PID:g36727" /db_xref="SWISS-PROT:P31641" /translation="MATKEKLQCLKDFHKDMVKPSPGKSPGTRPEDEAEGKPPQREKW SSKIDFVLSVAGGFVGLGNVWRFPYLCYKNGGGAFLIPYFIFLFGSGLPVFFLEIIIG QYTSEGGITCWEKICPLFSGIGYASVVIVSLLNVYYIVILAWATYYLFQSFQKELPWA HCNHSWNTPHCMEDTMRKNKSVWITISSTNFTSPVIEFWERNVLSLSPGIDHPGSLKW DLALCLLLVWLVCFFCICKGVRSTGKVVYFTATFPFAMLLVLLVRGLTLPGAGRGIKF YLYPDITRLEDPQVWIDAGTQIFFSYAICLGAMTSLGSYNKYKYNSYRDCMLLGCLNS GTSFVSGFAIFSILGFMAQEQGVDIADVAESGPGLAFIAYPKAVTMMPLPTFWSILFF IMLLLLGLDSQFVEVEGQITSLVDLYPSFLRKGYRREIFIAFVCSISYLLGLTMVTEG GMYVFQLFDYYAASGVCLLWVAFFECFVIAWIYGGDNLYDGIEDMIGYRPGPWMKYSW VITPVLCVGCFIFSLVKYVPLTYNKTYVSPTWAIGLGWSLALSSMLCVPLVIVIRLCQ TEGPFLVRVKYLLTPREPNRWAVEREGATPYNSRTVMNGALVKPTHIIVETMM" BASE COUNT 906 a 961 c 1041 g 1061 t ORIGIN 1 gaattccgaa agcaaggaga tggccaccaa ggagaagctg cagtgtctga aagatttcca 61 caaggacatg gtgaagccct caccagggaa gagcccaggc acgcggcctg aggacgaggc 121 tgagggaaaa cctccgcaga gggagaagtg gtctagcaag atcgactttg tgctctctgt 181 ggctggcggc ttcgtgggct tgggcaacgt ctggcgcttc ccgtacctct gctacaagaa 241 tggtggaggt gcgtttctca taccgtattt tattttcctg tttgggagcg gcctgcctgt 301 gtttttcttg gagatcatca taggccagta cacctctgaa gggggcatca cctgctggga 361 aaagatctgc cccttgttct ctggtatcgg ctatgcctcc gttgtaattg tgtccctcct 421 gaatgtctac tacatcgtca tcctggcctg ggccacatac tacctgttcc agtccttcca 481 gaaggagctg ccctgggcac actgcaacca cagctggaac acacctcact gcatggagga 541 caccatgcgc aagaacaaga gtgtctggat caccatcagc tccaccaact tcacctcccc 601 tgtcatcgag ttctgggagc gcaacgtgct gagcttgtcc cctggaatcg accacccagg 661 ctctctgaaa tgggacctcg ctctctgcct tcttttagtc tggctagtgt gtttcttctg 721 catctgcaag ggcgtcaggt ccactgggaa ggtcgtctac ttcacagcca cttttccatt 781 cgccatgctc ctggtgctgc tggtccgagg gctgacgctg ccgggcgcgg gccgaggcat 841 caagttctat ctgtatcctg acatcacccg ccttgaggac ccacaggtgt ggattgacgc 901 tgggactcag atattcttct cttatgccat ctgcctgggg gctatgacct cgctggggag 961 ctacaacaag tacaagtata actcgtacag ggactgtatg ctgctgggat gcctgaacag 1021 tggtaccagt tttgtgtctg gcttcgcaat tttttccatc ctgggcttca tggcacaaga 1081 gcaaggggtg gacattgctg atgtggctga gtcaggtcct ggcctggcct tcattgccta 1141 cccaaaagct gtgacaatga tgccgctgcc cacattttgg tccattcttt tttttattat 1201 gcttctcttg cttggactgg atagccagtt tgttgaagtt gaaggacaga tcacatcctt 1261 ggttgatctt tacccatcct tcctaaggaa gggttatcgt cgggaaatct tcatcgcctt 1321 cgtgtgtagc atcagctacc tgctggggct gacgatggtg acggagggtg gcatgtatgt 1381 gtttcagctc tttgactact atgcagctag cggtgtatgc cttttgtggg ttgcattctt 1441 tgaatgtttt gttattgcct ggatatatgg aggtgataac ctttatgatg gtattgagga 1501 catgattggc tatcggcccg ggccctggat gaagtacagc tgggtgatca ctccagttct 1561 ctgtgttgga tgtttcatct tctcgctcgt caagtacgta cccctgacct acaacaaaac 1621 atacgtgtcc ccaacttggg ccattgggct gggctggagc ctggcccttt cctccatgct 1681 ctgcgttccc ttggtcatcg tcatccgcct ctgccagact gaggggccgt tccttgtgag 1741 agtcaagtac ctgctgaccc caagggaacc caaccgctgg gctgtggagc gcgagggagc 1801 cacaccttac aactctcgca ccgtcatgaa cggcgctctc gtgaaaccga cccacatcat 1861 tgtggagacc atgatgtgag ctctctcggg tcgacggggc cggcggcttt cctgctgttt 1921 actaacatta gattcacata ggaccaggtt tacagagctt tatatttgca ctaggatttt 1981 tttttttttg taattgtcac agaaaatgta attgtgggta tgtgtgcgtg cgtgtgtgtg 2041 tgtgtgtgtg tgtatcgtgt gtgtgtgttt tgttttgatt tgggggatat tttgtacaaa 2101 aagaaaaccc acgggaagat gtccgtggag aggcagagct ttcatactga attagatgta 2161 ttttatggga atttggtaaa tttttctttg tatttttttt tttacatata agtatatata 2221 cacttagaga ttgtcatata cttttaccac ttgaattgat cttcttgcca gcaatagatc 2281 tcattttcaa aagcaattct tcggtgctgt gtagctggca gaaagttctg tccagtaaac 2341 gcaggatgga attttcctgg gactctacac ccatcttaag gtggtatacc ttccaaatcc 2401 tggttcagat ggaagaaata gcaggagaga ggacccatta gctggcagac ccaggggaag 2461 aaaggagggc tgtgaggaga tacctcatta aacttggctt agtgaagaag agagatgcca 2521 aaggaatgaa ccaacccttc acataaagga gactggctga agctgaatga ggaggcccta 2581 tagcagaagt ctgattctaa gagcagtaga aacttgtacc agaagcaaaa tcccactttt 2641 aattttgaga tggtgagtgg atagtcagta gaccgtcaga accactggcc agagagggag 2701 ctgctagaga tccaagaagg ctggcaggaa tgaggctcac aactcagcct cgcaagaggt 2761 ggcagaggca caggaggcca cagtccttcc tggggcattc caggcagaga aggagcagag 2821 gctctcccgg caggagctgg ggtctcaggg ctcagatgag tctgttgcat ttgaatgggg 2881 tcatagcagg ttctggtcat tccccaagca acatctcagc atctcttaaa gttgcctgca 2941 ggaatgaagc atgacatacc tgttgaggga ctaggggagt ggtggggagg tgagtggacc 3001 aaaggatata ggccccaggc atgcagatgg gcccggtgtc ggggaggggt gctttctttc 3061 ctcatctccc cactccccac tctcagcctg ggagactcct gccaagccct cattaaagat 3121 gccaccctgg gctgccctgg cacctagcaa ggcacaccaa gaacagcttt tgagtcgtat 3181 cctccactgg ggaagtgctc ccagttcaga acaagggcag cccgtggtgc tgacctagga 3241 tataacaaag ctcttcactt caaaacccct gcaatagctg ggtttacaga catttaccac 3301 ctggggaccc aaaagagaag gcctaggaga gttttctaga aggttgggat tgtcagggtc 3361 ctggcccccc agaactggct tgatcaaggg ccttatgtgg agcagaggtt gtctctgaac 3421 caggagagaa ggtactatac ctttcaaatc cccagggcag acacaccccc acccagcccc 3481 tatttggacc taaactgtgc catttgaaca gtcacttcca agctcagtct aaatgaaacc 3541 gaaacgtgac cacgcacaaa ggcagtcact gctcgagggg tgcagaccgc agaattttca 3601 cagcaggggc tcttggaact ctggaaaccc ccttcttaaa tttgggagga ggagtatgcc 3661 tttggtgtcc ccctcccaag ggcaattctg aaccccatct ttggcaggca tacatatttc 3721 actgtttcca aagctatcta ctctgccaaa caacacccag tcctattcca aactctcaac 3781 gattctatct tgttcctgtt tttctatgta tttatggttg ccgtttgtgt ctgatttgat 3841 tttactgttt tttccctgat tttatggagt agcattgtga cctgttttcc tttgtcttat 3901 ataactttag taaactaacc actgtcaatg attgagggca ggtggcacgt ggggaagagg 3961 gcggaattc // LOCUS HSTAZFP1 2024 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for translin associated zinc finger protein-1. ACCESSION X95072 NID g1770527 KEYWORDS POZ domain; TAZ-1 gene; translin associated zinc finger protein-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2024) AUTHORS Aoki,K., Ishida,R. and Kasai,M. TITLE Isolation and characterization of a cDNA encoding a Translin-like protein, TRAX JOURNAL FEBS Lett. 401 (2-3), 109-112 (1997) MEDLINE 97165975 REFERENCE 2 (bases 1 to 2024) AUTHORS Kasai,M. TITLE Direct Submission JOURNAL Submitted (15-JAN-1996) M. Kasai, N.I.H., Immunology, 1-23-1, Toyama, Shinjuku-ku, Tokyo, 162, JAPAN FEATURES Location/Qualifiers source 1..2024 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="D15" /tissue_type="spleen" gene 317..1885 /gene="TAZ-1" CDS 317..1885 /gene="TAZ-1" /note="POZ domain" /codon_start=1 /product="Translin Associated Zinc Finger protein-1" /db_xref="PID:e225462" /db_xref="PID:g1770528" /translation="MEFPDHSRHLLQCLSEQRHQGFLCDCTVLVGDAQFRAHRAVLAS CSMYFHLFYKDQLDKRDIVHLNSDIVTAPAFALLLEFMYEGKLQFKDLPIEDVLAAAS YLHMYDIVKVCKKKLKEKATTEADSTKKEEDASSCSDKVESLSDGSSHIAGDLPSDED EGEDEKLNILPSKRDLAAEPGNMWMRLPSDSAGIPQAGGEAEPHATAAGKTVASPCSS TESLSQRSVTSVRDSADVDCVLDLSVKSSLSGVENLNSSYFSSQDVLRSNLVQVKVEK EASCDESDVGTNDYDMEHSTVKESVSTNNRVQYEPAHLAPLREDSVLRELDREDKASD DEMMTPESERVQVEGGMESSLLPYVSNILSPAGQIFMCPLCNKVFPSPHILQIHLSTH FREQDGIRSKPAADVNVPTCSLCGKTFSCMYTLKRHERTHSGEKPYTCTQCGKSFQYS HNLSRHAVVHTREKPHACKWCERRFTQSGDLYRHIRKFHCELVNSLSVKSEALSLPTV RDWTLEDSSQELWK" BASE COUNT 535 a 481 c 542 g 466 t ORIGIN 1 aaattcttat tcttagtgag agactgtagt taaaaggaag gcttttagaa cttgggttca 61 aggaagatgg agatgcgtcg gaagctcttt ggcgggggtg aggaagttca gaaagtgtgc 121 attttccttc tggcatttag gtcttgtccg tgtgatttgg tggtgcttgg gtcataagcc 181 tgattaaaat tcagggacat gtaccacggc ggccaaagcg gaattaattt ttttatatgg 241 ggactggagc gctgaaaagt tgttcctgac caggctctaa tgagaaattc ctctctcccc 301 aggttatgaa gacagtatgg agtttccaga ccatagtaga catttgctac agtgtctgag 361 cgagcagaga caccagggtt ttctttgtga ctgcactgtt ctggtgggag atgcccagtt 421 ccgagcgcac cgagctgtac tggcttcatg cagcatgtat ttccacctct tttacaagga 481 ccagctggac aaaagagaca ttgttcatct gaacagcgac attgttacag cccccgcttt 541 cgctctcctg cttgaattca tgtatgaagg gaaactccag ttcaaagact tgcccattga 601 agacgtgcta gcagctgcca gttatctcca catgtatgac attgtcaaag tctgcaaaaa 661 gaagctgaaa gagaaagcca ccacggaggc agacagcacc aaaaaggaag aagatgcttc 721 aagttgttcg gacaaagtcg agagtctctc cgatggcagc agccacatag caggcgattt 781 gcccagtgat gaagatgaag gagaagatga aaaattgaac atcctgccca gcaaaaggga 841 cttggcggcc gagcctggga acatgtggat gcgattgccc tcagactcag caggcatccc 901 ccaggctggc ggagaggcag agccacacgc cacagcagct ggaaaaacag tagccagccc 961 ctgcagctca acagagtctt tgtcccagag gtctgtcacc tccgtgaggg attcggcaga 1021 tgttgactgt gtgctggacc tgtctgtcaa gtccagcctt tcaggagttg aaaatctgaa 1081 cagctcttat ttctcttcac aggacgtgct gagaagcaac ctggtgcagg tgaaggtgga 1141 gaaagaggct tcctgtgatg agagtgatgt tggcactaat gactatgaca tggaacatag 1201 cactgtgaaa gaaagtgtga gcactaataa cagggtacag tatgagccgg cccatctggc 1261 tcccctgagg gaggactcgg tcttgaggga gctggaccgg gaggacaaag ccagtgatga 1321 tgagatgatg accccagaga gcgagcgtgt ccaggtggag ggaggcatgg agagcagtct 1381 gctcccctac gtctccaaca tcctgagccc cgcgggccag atcttcatgt gccccctgtg 1441 caacaaggtc ttccccagcc cccacatcct gcagatccac ctgagcacgc acttccgcga 1501 gcaggacggc atccgcagca agcccgccgc cgatgtcaac gtgcccacgt gctcgctgtg 1561 tgggaagact ttctcttgca tgtacaccct caagcgccac gagaggactc actcggggga 1621 gaagccctac acatgcaccc agtgcggcaa gagcttccag tactcgcaca acctgagccg 1681 ccatgccgtg gtgcacaccc gcgagaagcc gcacgcctgc aagtggtgcg agcgcaggtt 1741 cacgcagtcc ggggacctgt acagacacat tcgcaagttc cactgtgagt tggtgaactc 1801 cttgtcggtc aaaagcgaag cactgagctt gcctactgtc agagactgga ccttagaaga 1861 tagctctcaa gaactttgga aataatttta tatatatata aataatatat atatatatac 1921 atatatataa atagatctct atatagttgt ggtacggtct aaaagcagtc ttgtttcctg 1981 gaaataaaaa gttgggatat taacttgttt ttgcacttta gaat // LOCUS HSTBX5 2441 bp RNA PRI 27-JAN-1997 DEFINITION H.sapiens mRNA for transcription factor TBX5. ACCESSION Y09445 NID g1772560 KEYWORDS TBX5 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2441) AUTHORS Li,Q.Y., Newbury-Ecob,R., Terrett,J.A., Wilson,D.I., Curtis,A., Yi,C.H., Bullen,P.J., Strachan,T., Robson,S., Bonnet,D., Young,I.E., Raeburn,J.A., Buckler,A.J., Gebuhr,T., Law,D.J. and Brook,J.D. TITLE Holt-Oram syndrome is caused by mutations in TBX5, a member of the Brachyury (T) gene family JOURNAL Nature Genet. 15 (1), 21-29 (1997) MEDLINE 97141914 REFERENCE 2 (bases 1 to 2441) AUTHORS Li,Q.Y. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) Q.Y. Li, The University of Nottingham, Queen's Medical Centre, Department of genetics, Nottingham, NG7 2UH, UK FEATURES Location/Qualifiers source 1..2441 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="q2" /map="YAC 887_b_9 positive" gene 666..2207 /gene="TBX5" CDS 666..2207 /gene="TBX5" /note="transcription factor" /codon_start=1 /db_xref="PID:e290264" /db_xref="PID:g1772561" /translation="MADADEALAGAHLWSLTQKTCLRFEPRARSGPPASPPGRPRSRL HPAGMEGIKVFLHERELWLKFHEVTEMIITKAGRRMFPSYKVKVTGINPKTKYILLMD IVPADDHRYKFADNKWCVTGKAEPAMAGRLYVHPDSPATGAHWMRQLVSFQKLKLTNN HLDPFGHIILNSMHKYQPRLHIVKADENNGFGSKNTAFCTHVFPETAFIAVTSYQNHK ITQLKIENNPFAKGFRGSDDMELHRMSRMQSKEYPVVPRSTVRQKVASNHSPFSSESR ALSTSSNLGSQYQCENGVSGPSQDLLPPPNPYPLPQEHSQIYHCTKRKEEECSTTDHP YKKPYMETSPSEEDSFYRSSYPQQQGLGASYRTESAQRQACMYASSAPPSEPVPSLED ISCNTWPSMPSYSSCTVTTVQPWTGYPTSTSPLTSPRGPWSLGWLAWQPWLPTAGRGN VPSTRPPVAHQPVVSSVGPQTGLQSPGTLQPPEFLYSHGVQGLYPLISTTLCTELAWC RVERQ" BASE COUNT 618 a 712 c 594 g 517 t ORIGIN 1 catgccttat gcaagagacc tcagtccccc ggaacaactc gatttccttc caatagaggt 61 ctgaggtgga ctcccacctc ccttcgtgaa gagttccctc ctctccccct tcctaagaaa 121 gtcgatcttg gctctatttg tgtcttatgt tcatcaccct cattcctccg gagaaagccg 181 ggttggttta tgtctttatt tattcccggg gccaagacgt ccggaacctg tggctgcgca 241 gacccggcac tgataggcga agacggagag aaatttacct cccgccgctg ccccccagcc 301 aaacgtgaca gcgcgcgggc cggttgcgtg actcgtgacg tctccaagtc ctataggtgc 361 agcggctggt gagatagtcg ctatcgcctg gttgcctctt tattttactg gggtatgcct 421 ggtaataaac agtaatattt aatttgtcgg agaccacaaa ccaaccttga gctgggaggt 481 acgtgctctt cttgacagac gttggaagaa gacctggcct aaagaggtct cttttggtgg 541 tccttttcaa agtcttcacc tgagccctgc tctccagcga ggcgcactcc tggcttttgc 601 gctccaaaga agaggtggga tagttggaga gcagaacctt gcgcgggcac aggcctgggc 661 gcaccatggc cgacgcagac gaggctttgg ctggcgcaca cctctggagc ctgacgcaaa 721 agacctgcct gcgattcgaa ccgagagcgc gctcggggcc cccagcaagt ccccccggtc 781 gtccccgcag ccgccttcac ccagcaggca tggagggaat caaagtgttt ctccatgaaa 841 gagaactgtg gctaaaattc cacgaagtca cggaaatgat cataaccaag gctggaaggc 901 ggatgtttcc cagttacaaa gtgaaggtga cgggcattaa tcccaaaacg aagtacattc 961 ttctcatgga cattgtacct gcggacgatc acagatacaa attcgcagat aataaatggt 1021 gtgtgacggg caaagctgag cccgccatgg ctggccgcct gtacgtgcac ccagactccc 1081 ccgccaccgg ggcgcattgg atgaggcagc tcgtctcctt ccagaaactc aagctcacca 1141 acaaccacct ggacccattt gggcatatta ttctaaattc catgcacaaa taccagccta 1201 gattacacat cgtgaaagcg gatgaaaata atggatttgg ctcaaaaaat acagcgttct 1261 gcactcacgt ctttcctgag actgcgttta tagcagtgac ttcctaccag aaccacaaga 1321 tcacgcaatt aaagattgag aataatccct ttgccaaagg atttcggggc agtgatgaca 1381 tggagctgca cagaatgtca agaatgcaaa gtaaagaata tcccgtggtc cccaggagca 1441 ccgtgaggca aaaagtggcc tccaaccaca gtcctttcag cagcgagtct cgagctctct 1501 ccacctcatc caatttgggg tcccaatacc agtgtgagaa tggtgtttcc ggcccctccc 1561 aggacctcct gcctccaccc aacccatacc cactgcccca ggagcatagc caaatttacc 1621 attgtaccaa gaggaaagag gaagaatgtt ccaccacaga ccatccctat aagaagccct 1681 acatggagac atcacccagt gaagaagatt ccttctaccg ctctagctat ccacagcagc 1741 agggcctggg tgcctcctac aggacagagt cggcacagcg gcaagcttgc atgtatgcca 1801 gctctgcgcc ccccagcgag cctgtgccca gcctagagga catcagctgc aacacgtggc 1861 caagcatgcc ttcctacagc agctgcaccg tcaccaccgt gcagccatgg acaggctacc 1921 ctaccagcac ttctccgctc acttcacctc ggggcccctg gtccctcggc tggctggcat 1981 ggcaaccatg gctccccaca gctgggagag ggaatgttcc cagcaccaga cctcccgtgg 2041 cccaccagcc tgtggtcagc agtgtggggc cccaaactgg cctgcagtcc cctggcaccc 2101 ttcagccccc tgagttcctc tactctcatg gcgtgcaagg actctatccc ctcatcagta 2161 ccactctgtg cacggagttg gcatggtgca gagtggagcg acaatagcta aagtgaggcc 2221 tgcttcacaa cagacatttc ctagagaaag agagagagag aggagaaaga gagagaagga 2281 gagagacagt agccaagaga accccacaga caagattttt catttcaccc aatgttcaca 2341 tctgcactca aggtcgctgg atgctgatct aatcagtagc ttgaaaccac aattttaaaa 2401 atgtgacttt cttgttttgt ctcaaaactt aaaaaaaaaa a // LOCUS HSTC2 677 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for fast skeletal troponin C. ACCESSION X07898 NID g36728 KEYWORDS troponin; troponin C. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 677) AUTHORS Gahlmann,R., Wade,R., Gunning,P. and Kedes,L. TITLE Differential expression of slow and fast skeletal muscle troponin C. Slow skeletal muscle troponin C is expressed in human fibroblasts JOURNAL J. Mol. Biol. 201 (2), 379-391 (1988) MEDLINE 88332973 COMMENT Data kindly reviewed (02-SEP-1988) by GAHLMANN R. FEATURES Location/Qualifiers source 1..677 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" /clone="TC2" CDS 65..547 /note="troponin C (AA 1-160)" /codon_start=1 /db_xref="PID:g36729" /db_xref="SWISS-PROT:P02585" /translation="MTDQQAEARSYLSEEMIAEFKAAFDMFDADGGGDISVKELGTVM RMLGQTPTKEELDAIIEEVDEDGSGTIDFEEFLVMMVRQMKEDAKGKSEEELAECFRI FDRNADGYIDPEELAEIFRASGEHVTDEEIESLMKDGDKNNDGRIDFDEFLKMMEGVQ " misc_feature 660..665 /note="polyA signal" polyA_site 677 /note="polyA site" BASE COUNT 161 a 160 c 237 g 119 t ORIGIN 1 atctttgggt ggtggagtgc aaaggaggcg acctgcaaca gaggagtccc ggtcaccagc 61 aaccatgacg gaccagcagg ctgaggccag gtcctacctc agcgaagaga tgatcgctga 121 gttcaaggct gcctttgaca tgtttgatgc tgatggtggt ggggacatca gcgtcaagga 181 gttgggcacg gtgatgagga tgctgggcca gacacccacc aaggaggagc tggacgccat 241 catcgaggag gtggatgagg acggcagcgg caccatcgac ttcgaggagt tcttggtcat 301 gatggtgcgc cagatgaaag aggacgcgaa agggaagagc gaggaggagc tggccgagtg 361 cttccgcatc ttcgacagga atgcagacgg ctacatcgac ccggaggagc tggctgagat 421 tttcagggcc tccggggagc acgtgactga cgaggagatc gaatctctga tgaaagacgg 481 cgacaagaac aacgacggcc gcattgactt cgacgagttc ctgaagatga tggagggcgt 541 gcagtaagga gtggacagtc gcctctacca agatcgcgtg tccctagggt gtgggagact 601 ccgccctgcc gggtccccac cagggaggcg cggccccttg tgggtctttg tctggaagga 661 ataaaagcaa atgttcc // LOCUS HSTCACT 2606 bp RNA PRI 26-AUG-1997 DEFINITION H.sapiens mRNA for novel T-cell activation protein. ACCESSION X94232 NID g1292867 KEYWORDS T-cell activation protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2606) AUTHORS Renner,C., Pfitzenmeier,J.P., Gerlach,K., Held,G., Ohnesorge,S., Sahin,U., Bauer,S. and Pfreundschuh,M. TITLE RP1, a new member of the adenomatous polyposis coli-binding EB1-like gene family, is differentially expressed in activated T cells JOURNAL J. Immunol. 159 (3), 1276-1283 (1997) MEDLINE 97376852 REFERENCE 2 (bases 1 to 2606) AUTHORS Renner,C. TITLE Direct Submission JOURNAL Submitted (13-DEC-1995) C. Renner, University of the Saarland, Medical Dep.I, Oscar-Orth-Str., Homburg, D-66424 Homburg, FRG FEATURES Location/Qualifiers source 1..2606 /organism="Homo sapiens" /strain="Caucasian" /db_xref="taxon:9606" /cell_type="T-cell" gene 141..1124 /gene="RP1" CDS 141..1124 /gene="RP1" /codon_start=1 /product="t-Cell activation protein" /db_xref="PID:e218521" /db_xref="PID:g1292868" /translation="MPGPTQTLSPNGENNNDIIQDNNGTIIPFRKHTVRGERSYSWGM AVNVYSTSITQETMSRHDIIAWVNDIVSLNYTKVEQLCSGAAYCQFMDMLFPGCISLK KVKFQAKLEHEYIHNFKLLQASFKRMNVDKVIPVEKLVKGRFQDNLDFIQWFKKFYDA NYDGKEYDPVEARQGQDAIPPPDPGEQIFNLPKKSHHANSPTAGAAKSSPAAKPGSTP SRPSSAKRASSSGSASKSDKDLETQVIQLNEQVHSLKLALEGVEKERDFYFGKLREIE LLCQEHGQENDDLVQRLMDILYASEEHEGHTEEPEAEEQAHEQQPPQQEEY" BASE COUNT 726 a 641 c 544 g 695 t ORIGIN 1 ctagaattca gcggccgctg aattctagcg agcaggcggc aggcacggtc cgtgcggaga 61 ggcgagcgag cgggaagacg cagccacctt cctcaccagc cagcccacag cggtttgttc 121 cccttctcgg gagtgcgcca atgcctgggc cgacccaaac cctgtcccca aatggcgaga 181 acaacaacga catcatccag gataataacg ggaccatcat tcctttccgg aagcacacag 241 tgcgcgggga gcgttcctac agttggggaa tggcggtcaa tgtgtattct acctcgataa 301 cccaagagac tatgagcaga catgacatca ttgcatgggt taatgacata gtatctttaa 361 actacacaaa agtggaacag ctttgttcag gagcggccta ttgccaattc atggacatgc 421 tcttccctgg ctgcattagt ttgaagaaag taaaatttca agcaaagctg gaacatgaat 481 atattcacaa ttttaaactt ctgcaagcat catttaagcg aatgaacgtt gataaggtaa 541 ttccagtgga gaagctagtg aaaggacgtt tccaggacaa cctggatttt attcaatggt 601 ttaagaaatt ctatgatgct aactacgatg ggaaggagta tgatcctgta gaggcacgac 661 aagggcaaga tgcaattcct cctcctgacc ctggtgaaca gatcttcaac ctgccaaaaa 721 agtctcacca tgcaaactcc cccacagcag gtgcagctaa atcaagtcca gcagctaaac 781 caggatccac accttctcga ccctcatcag ccaaaagggc ttcttccagt ggctcagcat 841 ccaaatccga taaagattta gaaacgcagg tcatacagct taatgaacag gtacattcat 901 taaaacttgc ccttgaaggc gtggaaaagg aaagggattt ctactttggg aagttgagag 961 agatcgagct actctgccaa gaacacgggc aggaaaatga tgacctcgtg cagagactaa 1021 tggacatcct gtatgcttca gaagaacacg agggccacac agaagagccg gaagcagagg 1081 agcaagccca cgaacagcag cccccgcagc aggaagagta ctgacccacc ccggctgctc 1141 ttgacacttc cattgtgtgt gggaacgttt cttctggaga attggaacat gtgtggcccc 1201 aagctcaaca gaaaccagtt gttcccaatc tgccgttacc atcaacgcac tgttgcatat 1261 gccagccact gcgcttggtt cccattttct ttgctaaggt gtattagcgg acggccctct 1321 ggccacctac ccgagagatc gtagggtcac attcatccaa cttcaccact tggctgcttg 1381 agattggttc tgctcttttc ttcattcctt tccagaacaa ctctttccca ccccaacacc 1441 actgccacca cccctctttt tatcctggtg tgaaacaatg gtaatttgat atatggtatt 1501 tatattggca tttttcaacc cagtgtcact agatgtcaca cacatttgtg gtgctttgat 1561 gtttgcaagt ctaacctctg aacataaatt tggtcaaata attggaacaa agggaaacag 1621 atacttgata tgaaagccat aatgacggtg acttgtgtcg tgggggaaaa cataaggtca 1681 ttttctccct ctactcacaa tactaaaggg aaaaaatgga ttcaaagcta ggatttcagg 1741 gcccagcagt gttcctccat cagcatgtta gacaactaca cagtatgttg ttagttttga 1801 aagacattca ctcaaggaaa acaccatctc aactttgccc gctcaccatg tcccttgccc 1861 ccatgtagcc catttcccag gttatgctct tttctttctc agggtcctct ttggtgggca 1921 gccactcccc gagatgttgc catcagtttt ctgcagtcca aagagggtat ggttaggtac 1981 gggtcttcct gcctcattcc tcttcctctt tgtgtaggtt tcagccacaa aactgtcatt 2041 cactctaggg gacccctact aaagggtaac ttcaggtgtg cagccctgag ctccaaggct 2101 ctgcaccatg ccacacactt gctgtaaggc tagaagtgaa gaccttatta ataggagcat 2161 aattgcgagg gagaatcatg gttctgcagt ctggtgtaga cactggaata acagcacaga 2221 aaaatctatg actcccaata tcttctagaa taaagaattt tccctcttta acacaagggc 2281 cctccttgtc attgacctta gctaaaccat ggcaattcat aaatagagga aacattaatg 2341 aattaaaagc attccttatt ttttaactaa tatttgtaca ttttcttagt ctctttccaa 2401 gtctttgcct cttttttttc tttattttta ttttttcctt tgacagatgg tatcccttcc 2461 tggatcattc atttcacctt ggtttctaac tttaggttta ctttcacttg ttatttgact 2521 tagcaggtgc aacaaaaaca agaaacaaat gtgcccaccc cactttccgc ttaactgaaa 2581 agcttaaaat aaatttccga attatg // LOCUS HSTCARA 1064 bp RNA PRI 06-DEC-1992 DEFINITION H.sapiens mRNA for T-cell antigen receptor alpha-chain. ACCESSION X63455 NID g36730 KEYWORDS influenza haemagglutinin peptide specific; T-cell antigen receptor (alpha chain); V alpha 1.2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1064) AUTHORS Hewitt,C.R.A. TITLE Direct Submission JOURNAL Submitted (09-DEC-1991) C.R.A. Hewitt, St. Mary's Hospital Medical School, Dept.Of Immunology, Norfolk Place, Paddington, London W21PG, UK REFERENCE 2 (bases 1 to 1064) AUTHORS Hewitt,C.R., Lamb,J.R., Hayball,J., Hill,M., Owen,M.J. and O'Hehir,R.E. TITLE Major histocompatibility complex independent clonal T cell anergy by direct interaction of Staphylococcus aureus enterotoxin B with the T cell antigen receptor JOURNAL J. Exp. Med. 175 (6), 1493-1499 (1992) MEDLINE 92268797 FEATURES Location/Qualifiers source 1..1064 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="HA1.7" /clone="UB alpha 14/4" 5'UTR 1..89 sig_peptide 90..149 CDS 90..917 /codon_start=1 /product="T cell antigen receptor alpha chain" /db_xref="PID:g36731" /translation="MLLLLVPVLEVIFTLGGTRAQSVTQLGSHVSVSEGALVLLRCNY SSSVPPYLFWYVQYPNQGLQLLLKYTSAATLVKGINGFEAEFKKSETSFHLTKPSAHM SDAAEYFCAVSESPFGNEKLTFGTGTRLTIIPNIQNPDPAVYQLRDSKSSDKSVCLFT DFDSQTNVSQSKDSDVYITDKTVLDMRSMDFKSNSAVAWSNKSDFACANAFNNSIIPE DTFFPSPESSCDVKLVEKSFETDTNLNFQNLSVIGFRILLLKVAGFNLLMTLRLWSS" mat_peptide 150..914 /product="T cell antigen receptor alpha chain" misc_feature 150..427 /note="V region" misc_feature 428..488 /note="J /N region" misc_feature 489..917 /note="C region" 3'UTR 918..1064 BASE COUNT 258 a 307 c 240 g 259 t ORIGIN 1 tgatgggctg caggaattcg attgaggctc aggcgccttg gcttctgtcc gctctgctca 61 gggccctcca gcgtggccac tgctcagcca tgctcctgct gctcgtccca gtgctcgagg 121 tgatttttac cctgggagga accagagccc agtcggtgac ccagcttggc agccacgtct 181 ctgtctctga aggagccctg gttctgctga ggtgcaacta ctcatcgtct gttccaccat 241 atctcttctg gtatgtgcaa taccccaacc aaggactcca gcttctcctg aagtacacat 301 cagcggccac cctggttaaa ggaatcaacg gttttgaggc tgaatttaag aagagtgaaa 361 cctccttcca cctgacgaaa ccctcagccc atatgagcga cgcggctgag tacttctgtg 421 ctgtgagtga gtctccattt ggaaatgaga aattaacctt tgggactgga acaagactca 481 ccatcatacc caatatccag aaccctgacc ctgccgtgta ccagctgaga gactctaaat 541 ccagtgacaa gtctgtctgc ctattcaccg attttgattc tcaaacaaat gtgtcacaaa 601 gtaaggattc tgatgtgtat atcacagaca aaactgtgct agacatgagg tctatggact 661 tcaagagcaa cagtgctgtg gcctggagca acaaatctga ctttgcatgt gcaaacgcct 721 tcaacaacag cattattcca gaagacacct tcttccccag cccagaaagt tcctgtgatg 781 tcaagctggt cgagaaaagc tttgaaacag atacgaacct aaactttcaa aacctgtcag 841 tgattgggtt ccgaatcctc ctcctgaaag tggccgggtt taatctgctc atgacgctgc 901 ggctgtggtc cagctgagat ctgcaagatt gtaagacagc ctgtgctccc tcgctccttc 961 ctctgcattg cccctcttct ccctctccaa acagagggaa ctctcctacc cccaaggagg 1021 tgaaagctgc taccacctct gtgccccccc ggcaatgcca ccaa // LOCUS HSTCARB 1053 bp RNA PRI 01-DEC-1993 DEFINITION H.sapiens mRNA for T-cell antigen receptor beta-chain. ACCESSION X63456 S35963 NID g36732 KEYWORDS influenza haemagglutinin peptide specific; T-cell antigen receptor (beta chain); V beta 3.1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1053) AUTHORS Hewitt,C.R.A. TITLE Direct Submission JOURNAL Submitted (09-DEC-1991) C.R.A. Hewitt, St. Mary's Hospital Medical School, Dept.Of Immunology, Norfolk Place, Paddington, London W21PG, UK REFERENCE 2 (bases 1 to 1053) AUTHORS Hewitt,C.R., Lamb,J.R., Hayball,J., Hill,M., Owen,M.J. and O'Hehir,R.E. TITLE Major histocompatibility complex independent clonal T cell anergy by direct interaction of Staphylococcus aureus enterotoxin B with the T cell antigen receptor JOURNAL J. Exp. Med. 175 (6), 1493-1499 (1992) MEDLINE 92268797 FEATURES Location/Qualifiers source 1..1053 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="HA1.7" /clone="CUB beta 1.5" 5'UTR 1..10 sig_peptide 11..67 CDS 11..940 /codon_start=1 /product="T cell antigen receptor beta chain" /db_xref="PID:g36733" /translation="MGIRLLCRVAFCFLAVGLVDVKVTQSSRYLVKRTGEKVFLECVQ DMDHENMFWYRQDPGLGLRLIYFSYDVKMKEKGDIPEGYSVSREKKERFSLILESAST NQTSMYLCASSSTGLPYGYTFGSGTRLTVVEDLNKVFPPEVAVFEPSEAEISHTQKAT LVCLATGFFPDHVELSWWVNGKEVHSGVSTDPQPLKEQPALNDSRYCLSSRLRVSATF WQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRADCGFTSVSYQQGVLS ATILYEILLGKATLYAVLVSALVLMAMVKRKDF" mat_peptide 68..937 /product="T cell antigen receptor beta chain" misc_feature 68..346 /note="V region" misc_feature 347..409 /note="J /D /N region" misc_feature 410..940 /note="C region" 3'UTR 941..1053 BASE COUNT 243 a 284 c 291 g 235 t ORIGIN 1 caaagcagcc atgggaatca ggctcctctg tcgtgtggcc ttttgtttcc tggctgtagg 61 cctcgtagat gtgaaagtaa cccagagctc gagatatcta gtcaaaagga cgggagagaa 121 agtttttctg gaatgtgtcc aggatatgga ccatgaaaat atgttctggt atcgacaaga 181 cccaggtctg gggctacggc tgatctattt ctcatatgat gttaaaatga aagaaaaagg 241 agatattcct gaggggtaca gtgtctctag agagaagaag gagcgcttct ccctgattct 301 ggagtccgcc agcaccaacc agacatctat gtacctctgt gccagcagtt cgacagggtt 361 gccctatggc tacaccttcg gttcggggac caggttaacc gttgtagagg acctgaacaa 421 ggtgttccca cccgaggtcg ctgtgtttga gccatcagaa gcagagatct cccacaccca 481 aaaggccaca ctggtgtgcc tggccacagg cttcttcccc gaccacgtgg agctgagctg 541 gtgggtgaat gggaaggagg tgcacagtgg ggtcagcaca gacccgcagc ccctcaagga 601 gcagcccgcc ctcaatgact ccagatactg cctgagcagc cgcctgaggg tctcggccac 661 cttctggcag aacccccgca accacttccg ctgtcaagtc cagttctacg ggctctcgga 721 gaatgacgag tggacccagg atagggccaa acccgtcacc cagatcgtca gcgccgaggc 781 ctggggtaga gcagactgtg gctttacctc ggtgtcctac cagcaagggg tcctgtctgc 841 caccatcctc tatgagatcc tgctagggaa ggccaccctg tatgctgtgc tggtcagcgc 901 ccttgtgttg atggccatgg tcaagagaaa ggatttctga aggcagccct ggaagtggag 961 ttaggagctt ctaacccgtc atggttcaat acacattctt cttttgccag cgcttctgaa 1021 gagctgctct cacctctctg catcccaata gat // LOCUS HSTCF1A 1254 bp RNA PRI 14-JUN-1991 DEFINITION Human TCF-1 mRNA for T cell factor 1 (splice form A). ACCESSION X59869 X55327 NID g36785 KEYWORDS DNA-binding protein; HMG box; T cell factor 1; TCF-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1254) AUTHORS Van de Wetering,M. TITLE Direct Submission JOURNAL Submitted (28-MAY-1991) M. Van De Wetering, Dept of Clinical Immunology, University Hospital, P.O. Box 85500, 3508 GA Utrecht, The Netherlands REFERENCE 2 (bases 1 to 1254) AUTHORS van de Wetering,M., Oosterwegel,M., Dooijes,D. and Clevers,H. TITLE Identification and cloning of TCF-1, a T lymphocyte-specific transcription factor containing a sequence-specific HMG box JOURNAL EMBO J. 10 (1), 123-132 (1991) MEDLINE 91114695 COMMENT See also X59869-X59871. FEATURES Location/Qualifiers source 1..1254 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="Jurkat" /clone_lib="cDNA" /clone="fTCF-1a" mRNA 1..1254 /gene="TCF-1" /evidence=experimental gene 1..1254 /gene="TCF-1" CDS 80..889 /gene="TCF-1" /codon_start=1 /product="T cell factor 1, splice form A" /db_xref="PID:g36786" /db_xref="SWISS-PROT:P36402" /translation="MYKETVYSAFNLLMHYPPPSGAGQHPQPQPPLHKANQPPHGVPQ LSLYEHFNSPHPTPAPADISQKQVHRPLQTPDLSGFYSLTSGSMGQLPHTVSWFTHPS LMLGSGVPGHPAAIPHPAIVPPSGKQELQPFDRNLKTQAESKAEKEAKKPTIKKPLNA FMLYMKEMRAKVIAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLY PGWSARDNYGKKKRRSREKHQESTTETNWPRELKDGNGQESLSMSSSSSPA" misc_feature 539..769 /gene="TCF-1" /note="HMG box" misc_feature 810 /gene="TCF-1" /note="alternative splice site" BASE COUNT 320 a 391 c 335 g 208 t ORIGIN 1 gcccaggtga ctgactaatc cgccgccttc aggagacaga attggccaag gcctgaaggc 61 cccggagtgc accagcggca tgtacaaaga gaccgtctac tccgccttca atctgctcat 121 gcattaccca cccccctcgg gagcagggca gcacccccag ccgcagcccc cgctgcacaa 181 ggccaatcag cccccccacg gtgtccccca actctctctc tacgaacatt tcaacagccc 241 acatcccacc cctgcacctg cggacatcag ccagaagcaa gttcacaggc ctctgcagac 301 ccctgacctc tctggcttct actccctgac ctcaggcagc atggggcagc tcccccacac 361 tgtgagctgg ttcacccacc catccttgat gctaggttct ggtgtacctg gtcacccagc 421 agccatcccc cacccggcca ttgtgccccc ctcagggaag caggagctgc agcccttcga 481 ccgcaacctg aagacacaag cagagtccaa ggcagagaag gaggccaaga agccaaccat 541 caagaagccc ctcaatgcct tcatgctgta catgaaggag atgagagcca aggtcattgc 601 agagtgcaca cttaaggaga gcgctgccat caaccagatc ctgggccgca ggtggcacgc 661 gctgtcgcga gaagagcagg ccaagtacta tgagctggcc cgcaaggaga ggcagctgca 721 catgcagcta tacccaggct ggtcagcgcg ggacaactac gggaagaaga agaggcggtc 781 gagggaaaag caccaagaat ccaccacaga gacaaactgg cccagagaac tcaaggatgg 841 taatggacaa gagtcactgt ccatgtcttc ttcctctagc ccagcttgag gactgggatg 901 gctgggcaag gaagccatag gcattgcggc cccttgcctt ggtgcagatg tgagtcccac 961 aaacacatct ggagaagctc aaaggccggg actgggagat gactcccttg gaagacagga 1021 gagatgactc ccttggaaga cagatgacag cccataggcc tagtgacaaa aggccccttt 1081 ccgaccttgt ggctgttctg ggaactgcac ctgtcctagg tctgggccag accaagcaga 1141 atggcagtct gaggacactg acttaccacc caagtcccag gaagagagga caaggaatca 1201 gccaggcctg tgcaaaggca gcattttttg gttgtggagt atgactatga attc // LOCUS HSTCL1 1324 bp RNA PRI 13-MAY-1995 DEFINITION H.sapiens mRNA for Tcell leukemia/lymphoma 1. ACCESSION X82240 NID g624960 KEYWORDS TCL1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1324) AUTHORS Virgilio,L., Narducci,M.G., Isobe,M., Billips,L.G., Cooper,M.D., Croce,C.M. and Russo,G. TITLE Identification of the TCL1 gene involved in T-cell malignancies JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (26), 12530-12534 (1994) MEDLINE 95107991 REFERENCE 2 (bases 1 to 1324) AUTHORS Russo,G. TITLE Direct Submission JOURNAL Submitted (19-OCT-1994) G. Russo, Raggio-Italgene SpA, Via delle Antille 29, 00040 Pomezia, Rome, ITALY FEATURES Location/Qualifiers source 1..1324 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ALL1" /clone="pAL1.5" /chromosome="14" /map="q32" mRNA join(1..165,166..342,343..396,397..1312) gene 46..390 /gene="TCL1" CDS 46..390 /gene="TCL1" /codon_start=1 /product="T cell leukemia/lymphoma 1" /db_xref="PID:g624961" /translation="MAECPTLGEAVTDHPDRLWAWEKFVYLDEKQHAWLPLTIEIKDR LQLRVLLRREDVVLGRPMTPTQIGPSLLPIMWQLYPDGRYRSSDSSFWRLVYHIKIDG VEDMLLELLPDD" polyA_signal 1237..1242 BASE COUNT 293 a 376 c 348 g 307 t ORIGIN 1 cttgagaggc tctggctctt gcttcttagg cggcccgagg acgccatggc cgagtgcccg 61 acactcgggg aggcagtcac cgaccacccg gaccgcctgt gggcctggga gaagttcgtg 121 tatttggacg agaagcagca cgcctggctg cccttaacca tcgagataaa ggataggtta 181 cagttacggg tgctcttgcg tcgggaagac gtcgtcctgg ggaggcctat gacccccacc 241 cagataggcc caagcctgct gcctatcatg tggcagctct accctgatgg acgataccga 301 tcctcagact ccagtttctg gcgcttagtg taccacatca agattgacgg cgtggaggac 361 atgcttctcg agctgctgcc agatgactga tgtatggtct tggcagcacc tgtctccttt 421 caccccaggg cctgagcctg gccagcctac aatggggatg ttgtgtttct gttcaccttc 481 gtttactatg cctgtgtctt ctccaccacg ctggggtctg ggaggaatgg acagacagag 541 gatgagctct acccagggcc tgcaggacct gcctgtagcc cactctgctc gccttagcac 601 taccactcct gccaaggagg attccatttg gcagagcttc ttccaggtgc ccagctatac 661 ctgtgcctcg gcttttctca gctggatgat ggtcttcagc ctctttctgt cccttctgtc 721 cctcacagca ctagtatttc atgttgcaca cccactcagc tccgtgaact tgtgagaaca 781 cagccgattc acctgagcag gacctctgaa accctggacc agtggtctca catggtgcta 841 cgcctgcatg taaacacgcc tgcaaacgct gcctgccggt aaacacgcct gcaaacgctg 901 cctgcccgta aacacgcctg caaacgctgc ctgcccacac aggttcacgt gcagctcaag 961 gaaaggcctg aaaggagccc ttatctgtgc tcaggactca gaagcctctg ggtcagtggt 1021 ccacatcccg ggacgcagca ggaggccagg ccggcgagcc ctgtggatga gccctcagaa 1081 cccttggctt gcccacgtgg aaaagggata gaggttgggt ttcccccctt tatagatggt 1141 cacgcacctg ggtgttacaa agttgtatgt ggcatgaata ctttttgtaa tgattgatta 1201 aatgcaagat agtttatcta acttcgtgcg caatcagctt ctatccttga cttagattct 1261 ggtggagaga agtgagaata ggcagccccc aaataaaaaa tattcatgga aaaaaaaaaa 1321 aaaa // LOCUS HSTCRBV 77743 bp DNA PRI 20-APR-1994 DEFINITION Human V beta T-cell receptor (TCRBV) gene locus. ACCESSION U03115 NID g467918 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 77743) AUTHORS Slightom,J.L., Siemieniak,D.R., Sieu,L.C., Koop,B.F. and Hood,L. TITLE Nucleotide sequence analysis of 77.7 kb of the human V beta T-cell receptor gene locus: direct primer-walking using cosmid template DNAs JOURNAL Genomics 20, 149-168 (1994) MEDLINE 94292194 REFERENCE 2 (bases 1 to 77743) AUTHORS Slightom,J.L. TITLE Direct Submission JOURNAL Submitted (04-NOV-1993) Jerry L. Slightom, Molecular Biology Unit, The Upjohn Company, 301 Henrietta Street, Kalamazoo, MI 49007, USA FEATURES Location/Qualifiers source 1..77743 /organism="Homo sapiens" /strain="adult cell cultures or sperm fibroblast" /db_xref="taxon:9606" /clone="cosmid clones H7.1, H12.18, and H130.1" exon <1..>253 /gene="TCRVB6S1" /number=2 CDS <1..>253 /gene="TCRVB6S1" /codon_start=2 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467919" /translation="ITKRGQNVTFRCDPISEHNRLYWYRQTLGQGPEFLTYFQNEAQL EKSRLLSDRFSAERPKGSFSTLEIQRTEQGDSAMYLCASS" gene 1..253 /gene="TCRVB6S1" repeat_unit complement(3484..3772) /rpt_family="Alu Class Sx" gene 6305..6783 /gene="TCRBV23S1" gene 6305..6383 /gene="TCRVB23S1" exon 6305..6383 /gene="TCRVB23S1" /number=1 CDS join(6305..6383,6494..>6783) /gene="TCRBV23S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467920" /translation="MLSPDLPDSAWNTRLLCHVMLCLLGAVSVAAGVIQSPRHLIKEK RETATLKCYPIPRHDTVYWYQQGPGQDPQFLISFYEKMQSDKGSIPDRFSAQQFSDYH SELNMSSLELGDSALYFCASS" intron 6384..6393 /gene="TCRBV23S1" exon 6494..6783 /gene="TCRBV23S1" /number=2 repeat_unit complement(7501..7650) /rpt_family="Alu Class j" repeat_unit complement(8654..8942) /rpt_family="Alu Class sb" repeat_unit 11357..11686 /rpt_family="Alu Class Sx" exon 14717..14765 /gene="TCRBV12S2" /number=1 gene 14717..15161 /gene="TCRBV12S2" CDS join(14717..14765,14872..>15161) /gene="TCRBV12S2" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467921" /translation="MGTRLFFYVALCLLWTGHMDAGITQSPRHKVTETGTPVTLRCHQ TENHRYMYWYRQDPGHGLRLIHYSYGVKDTDKGEVSDGYSVSRSKTEDFLLTLESATS SQTSVYFCAIS" intron 14766..14871 /gene="TCRBV12S2" /number=1 exon 14872..15161 /gene="TCRBV12S2" /number=2 repeat_unit 16004..16293 /rpt_family="Alu Class Sq" repeat_unit 16582..16867 /rpt_family="Alu Class Sc" repeat_unit 17843..18135 /rpt_family="Alu Class Sx" repeat_unit complement(18318..18677) /rpt_family="OFR" repeat_unit complement(20097..20939) /rpt_family="Kpn LINE" gene 25365..25797 /gene="TCRBV21S2" exon 25365..25413 /gene="TCRBV21S2" /number=1 CDS join(25365..25413,25505..>25797) /gene="TCRBV21S2" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467922" /translation="MGTRLLCWVAFCLLVEELIEAGVVQSPRYKIIEKKQPVAFWCNP ISGHNTLYWYLQNLGQGPELLIRYENEEAVDDSQLPKDRFSAERLKGVDSTLKIQPAE LGDSAVYLCASS" intron 25414..25504 /gene="TCRBV21S2" /number=1 exon 25505..25797 /gene="TCRBV21S2" /number=2 repeat_region 29229..32539 /note="duplication unit for TCRBV8S1" repeat_unit complement(29358..29506) /rpt_family="MER26" gene 30967..31408 /gene="TCRBV8S1" CDS join(30967..31015,31116..>31408) /gene="TCRBV8S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467923" /translation="MDSWTFCCVSLCILVAKHTDAGVIQSPRHEVTEMGQEVTLRCKP ISGHNSLFWYRQTMMRGLELLIYFNNNVPIDDSGMPEDRFSAKMPNASFSTLKIQPSE PRDSAVYFCASS" exon 30967..31015 /gene="TCRBV8S1" /number=1 intron 31016..31115 /gene="TCRBV8S1" /number=1 exon 31116..31408 /gene="TCRBV8S1" /number=2 repeat_region 32540..35851 /note="duplication unit for TCRBV8S2" repeat_unit complement(32687..32813) /rpt_family="MER26" exon 34290..34338 /gene="TCRBV8S2" /number=1 gene 34290..34731 /gene="TCRBV8S2" CDS join(34290..34338,34439..>34731) /gene="TCRBV8S2" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467924" /translation="MDSWTLCCVSLCILVAKHTDAGVIQSPRHEVTEMGQEVTLRCKP ISGHDYLFWYRQTMMRGLELLIYFNNNVPIDDSGMPEDRFSAKMPNASFSTLKIQPSE PRDSAVYFCASS" intron 34339..34438 /gene="TCRBV8S2" /number=1 exon 34439..34731 /gene="TCRBV8S2" /number=2 repeat_unit 37904..38479 /rpt_family="LTR1" repeat_unit complement(39556..39850) /rpt_family="Alu Class Sp" repeat_unit 41297..43315 /rpt_family="Kpn LINE" repeat_unit 43316..44688 /rpt_family="Kpn LINE" repeat_unit 44689..44977 /rpt_family="Alu Class Sb" repeat_unit complement(45442..45704) /rpt_family="Alu Class Sx" repeat_unit 46193..46544 /rpt_family="LTR5" repeat_unit 47446..47743 /rpt_family="Alu Class J" repeat_unit 48507..48797 /rpt_family="Alu Class Sq" exon 51481..51529 /gene="TCRBV8S3" /number=1 gene 51481..51922 /gene="TCRBV8S3" CDS join(51481..51529,51630..>51922) /gene="TCRBV8S3" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467925" /translation="MATRLLCCVVLCLLGEELIDARVTQTPRHKVTEMGQEVTMRCQP ILGHNTVFWYRQTMMQGLELLAYFRNRAPLDDSGMPKDRFSAEMPDATLATLKIQPSE PRDSAVYFCASG" intron 51530..51629 /gene="TCRBV8S3" /number=1 exon 51630..51922 /gene="TCRBV8S3" /number=2 exon 58424..58472 /gene="TCRBV16S1" /number=1 gene 58424..58851 /gene="TCRBV16S1" CDS join(58424..58472,58559..>58851) /gene="TCRBV16S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467926" /translation="MVSRLLSLVSLCLLGAKHIEAGVTQFPSHSVIEKGQTVTLRCDP ISGHDNLYWYRRVMGKEIKFLLHFVKESKQDESGMPNNRFLAERTGGTYSTLKVQPAE LEDSGVYFCASS" intron 58473..58558 /gene="TCRBV16S1" /number=1 exon 58559..58851 /gene="TCRBV16S1" /number=2 repeat_unit complement(59599..59887) /rpt_family="Alu Class Sx" repeat_unit complement(59895..60184) /rpt_family="Alu Class J" repeat_unit 61004..61160 /rpt_family="MER32" exon 63499..63547 /gene="TCRBV24S1" /number=1 gene 63499..63962 /gene="TCRBV24S1" CDS join(63499..63547,63673..>63962) /gene="TCRBV24S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467927" /translation="MGPGLLHWMALCLLGTGHGDAMVIQNPRYQVTQFGKPVTLSCSQ TLNHNVMYWYQQKSSQAPKLLFHYYDKDFNNEADTPDNFQSRRPNTSFCFLDIRSPGL GDTAMYLCATS" intron 63548..63672 /gene="TCRBV24S1" /number=1 exon 63673..63962 /gene="TCRBV24S1" /number=2 exon 68508..68556 /gene="TCRBV25S1" /number=1 gene 68508..68956 /gene="TCRBV25S1" CDS join(68508..68556,68664..68764) /gene="TCRBV25S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467928" /translation="MSPIFTCITILCLLAAGSPGEEVAQTPKHLVRGEGQKAKLYCAP IKGHS" intron 68557..68663 /gene="TCRBV25S1" /number=1 exon 68664..68956 /gene="TCRBV25S1" /number=2 repeat_unit 69176..69464 /rpt_family="Alu Class Sb" repeat_unit 69996..70285 /rpt_family="Alu Class Sp" repeat_unit 71380..71540 /rpt_family="Alu Class Sx" exon 72124..72172 /gene="TCRBV26S1" /number=1 gene 72124..72853 /gene="TCRBV26S1" CDS join(72124..72172,72564..>72853) /gene="TCRBV26S1" /codon_start=1 /product="T-cell receptor beta chain V region precursor" /db_xref="PID:g467929" /translation="MDIWLLCWVTLCLLAAGHSEPGVSQTPRHKVTNMGQEVILRCDP SSGHMFVHWYRQNLRQEMKLLISFQYQNIAVDSGMPKERFTAERPNGTSSTLKIHPAE PRDSAVYLYSS" intron 72173..72563 /gene="TCRBV26S1" /note="intron 1 + Alu element" repeat_unit 72215..72505 /gene="TCRBV26S1" /rpt_family="Alu Class J" exon 72564..72853 /gene="TCRBV26S1" /number=2 repeat_unit 76008..76298 /rpt_family="Alu Class Sp" repeat_unit 77278..77447 /rpt_family="Alu Class J" repeat_unit 77507..77743 /rpt_family="Alu Class Sb" BASE COUNT 23599 a 16173 c 15566 g 22405 t ORIGIN 1 gatcacaaag aggggacaga atgtaacttt caggtgtgat ccaatttctg aacacaaccg 61 cctttattgg taccgacaga ccctggggca gggcccagag tttctgactt acttccagaa 121 tgaagctcaa ctagaaaaat caaggctgct cagtgatcgg ttctctgcag agaggcctaa 181 gggatctttc tccaccttgg agatccagcg cacagagcag ggggactcgg ccatgtatct 241 ctgtgccagc agcttagcca cagcatggca cagtcgcctc cttcctgctc acaaaccctc 301 aggcacttac ttctccttcc agctctcaga agccctgaac aaaggagctg ccctgctctt 361 tcctcagcaa ggagaatgaa tgcatttgga actgcaggtg ttcttctgat actaggaggt 421 cagaaaataa cctctgaaat acaggaacag ggaatactgg gtagtaataa ttttgactta 481 tggatttctg ggattcctta tatatagttc aaatttccat aattaggata taacagagct 541 tagtctcatg gatagtgact aagtaaatat tctcttatag aactatgaag tttcagcaca 601 tttatattaa actactgtta ccacatgtca ccaactcaga cctataatct accaaggagt 661 ggggaagcca aacgcaaaca ctgctaaggc catttacagc taccaccctt ggaagaaaat 721 tggaatcttt agtaagaatt tccatcaaaa tcaatttttt agaaataact attacccaca 781 ataatgcaca ttaactcatg ttcatacatt gctttttttc cttctgaaaa tcttgtagtc 841 attctatgtt taatttcctt cttctccagt ctaacttaaa aaaatagaaa tacaaattga 901 gattgtttca taagataaac cccacctggg aaggtgttaa aaggacacaa ttcaaagaaa 961 acctgaataa aattatatca accacgtagg attctttgag gtttccaaag aagatgtacc 1021 agtttacatt cccacgagag gtatatgagt tttggatgga attttgtttt ttctctctct 1081 ctctctgtag attagttttt ccctctacta atagtttctc ttgacaaaca gaagttctta 1141 agtgacaatc attccatgtt tctcccattt cagttagtgt tttgtatttt ggtgtccact 1201 ttaggaaatc cttgcttact tcaaaatgat catgttttct tacatttcct tctaaaatct 1261 ttatattttc cacttttcat acttagactc acaatcgaac tggaattaat tctttgtagg 1321 tgatatgagg aagaggtcaa gttctatttt tccaaatgat tattcaattt acccaatact 1381 atttaagtgg taatagtttc accatcactc gataatatca cttttgtcat aaatccgtca 1441 ttatacatgt gtggcctttt ttctcggatc tctattctgt tacatcggtc tgttctgtca 1501 gttctcgcac tgataatatg ctgtcttact acagcttcat ccttattctt aatatctggt 1561 agaataattt ctcaagcttt gctccttttt atcaagtgtt cttgagcact cttgacactt 1621 tttttccata tacattttag aatcagcttg ctaaattatt caggaaaaag ataacaactg 1681 ctgggatttt tttttgtttt tgtttgtttc caactgattt aagtgcatca cagaacaaag 1741 ctaaaaatat taatagaaat ataaaaatat ctaccaccca aaaatataaa atacaaaata 1801 tctggcatca atcaaaaatt acagacctgc aaagaagcat gaaaatagga tctataatga 1861 agagaaggat taattaatag gaaccaactc agaaatgaca aattggcaga caaagacagt 1921 caaacagtta ctataactct ggtctgtaag ttcaaagttt tgttgagatg gggaaaggca 1981 taatgagatc cagacaaagc ttctagagat aaaaattaca atgtctgtga tgaaaaatac 2041 gctggatgaa gttagcatca gattagacat tacaggataa aatattaata aacttgaaga 2101 aataacacaa acaattctta atgaaataca gagaataagt ggggggtcat cagtaaactg 2161 ggggacaact ttaagtagct taatatgtaa ttagagtccc aaaagcaaat ttttaaactt 2221 gaatttcttt aaaggataat tggctattta cactgaaata attccaacac agtgcaggga 2281 ttctaatata tgcaaacata aatctataac aagactagca taaagacagg tgaggaataa 2341 atgaacacat atatcattgg aaggtttcca taatctgagt gaagtcactt gaaggcagac 2401 tgcgttaaag tagagatgca tactgaaaat catagagcaa ccactaagat gacaaaacac 2461 agcattatag ttaataagcc aacacaatac agtgttataa aaatgctcaa ataatccata 2521 aaaacaacat aaaaagaaaa aaggaatata gaagagatgg gattgataga aaacaaatag 2581 aagatgatag acccaaaagt aaccacatca acaatcacat caaatctaaa tagtctaaaa 2641 gcacagattg tactattgaa tattatcgaa tcatatcaaa aaaacaacta tatgctgctt 2701 atcagaatac tatgaagact cagaatgtaa aaaaatataa aagtatggaa aaagatgtac 2761 tatgccaaca ccaattaaag gaaagccaga ctagtaacgt taatatcaaa caaaataaat 2821 ttcaagcaga gaatatcatc tgagataatg aaagtcatat cctaatgaca aaagtattgg 2881 tctatcaaga ggttgtaaca atcaaatatt tacgtacctg aaaatggatc ttcaaaatcc 2941 caaggcaaaa acagaactgc aaggagaaac agacaaatcc acaattataa tcagagattt 3001 caatagccct cttttagtaa ctggtttaac aagttgacaa aaacaaaaga caactaagga 3061 tacagaagat atgaactaca ccaacaacca acttgactta attgacgttt acagaacact 3121 tcatccaaca gcagcagaat acatgttctc ttggagtgca cactgatttt gtgccagcat 3181 ggactacgta cactgggcca taaaacaagc atccatgcat taaaagtatt caagtcaaac 3241 caagtgcact cgctggtcac agtgaaactt cattaaaaat caataagaga aagatttctg 3301 gaaaagtccc gaatatttca aaattagata atagttgtct aaataactca taaagcaaaa 3361 aataaatcaa aagataaaag ataatttaca attattttaa tagtatattt tggtcatttt 3421 ttttttcttt ttgaccaaat actggtgaga atgaagataa gtaagacctt tcttttttat 3481 tttttttttt aacacggagt ctcaatctgt cactgaggct ggggtgcagt ggcatggtgt 3541 tggttcactg caacctccgt ctcctgggtt caagcaattc tcctgcctca gcctcccaaa 3601 cagctaggat tacaggtgcc caccaccatg ctcagctagt ttttatattt ttagcagtga 3661 cggggtttca tcatgttggg caggctggtc tcgaactcct gacctcaggt gatccacctg 3721 cctcggcctc ccaaagtgct ggggttacag gcataagcca ccacgcctgg cccaaatctt 3781 tcttttttgg ggtgagagta aattgtaaaa ccgtttttga tgagtaattt agcagcatac 3841 atcaaaagag ttaatactgc taacactcct tgacccagag tgttttttat ttaatgaatt 3901 tatttcaagg aaattgctat atatgcataa aatgtttgag aatgttcaac agaaaagaaa 3961 ttattttaaa ctgaacaaat ataaaaatat atcaaagttg gtatgatgct gccaaagcag 4021 gactgagggg aaactgaaga acaggggata attttctatt cattctgagg ccagaactac 4081 tctgatacca aaacgagact aagagattcc aagaaagctg ctgactaata tctctcagga 4141 acacagattc aatgatcctt ttaccattac agcctagaga atctgacaat atgtaagagg 4201 gatagtacaa catggtcaag tagggtttat ctcagtaatg cattgttggt ctgacattca 4261 aaaaacagac tatgtcattt accttatctg tatgaaaaat atgaaaagta tatgatcata 4321 tctatagata acttaaacat ttgataaaat tccatatcct ttcttgaaga aaactgttat 4381 caaactcaga gtaacaagaa attttcttca cctaaaaaag gacatttatg acatcctatg 4441 gttaaataga atatttttgc aagcaaactg ataagtcaaa gatgcatatg taattcccag 4501 atcaaatact agaaacagca aaaagaggat aaaatagcct tttcagcaaa tggttctaga 4561 aaatgggata atatggaaaa ataatgaatc tctagcctta tctcactcca taaccagatg 4621 tgggcaaact atggcctccg ggcaaatttg gcctttgtct gttttcatac agcccagaaa 4681 ctaacaaaaa ttttacactt ttaaatgatt gaaaaaaagt aaaaagtatt ttgttacata 4741 taaaaatcta agatattcaa atttcagtaa acataaatga agttctattg gatcatggcc 4801 acagtcattc gtgtatatat tgtcaatggt tgcttttgca ctacaatgac agaattgagt 4861 agtcgagaca aatactgcat gatgcacaaa actaaaaaat ttattatttc accctttaca 4921 ggaaaactgc tgacccttcc cataaacaaa atttagtaag agacacaaaa aaaatcccag 4981 ataatcgaaa aactagaaac tgaaagcttc ctggaagaaa acatagcaag ctctcttcat 5041 gatcatgggg taggcaaaga tttcttaagg acaaaagaat gttctaacca taaaaaaagc 5101 aaattgttct ttatgcaaat cccaaaaagt tcttcaaaat acattacgaa aaaataaatg 5161 tatatctgag aaagggctgg attcatatta taaagaattc ctacacatca atcataaaga 5221 aggcaattca ctaaaaacaa aaagttgact agagaattaa aaaaaaaaga tatataagta 5281 ggcagtttct tttcatgaga ctaatgtgtt tatttctcca gacttagatg attgtttcac 5341 tgatagcatg tcactgtcag ctgcctccag gaattatctt tggttaaagg gaatgctttg 5401 cccaggtctc accccattcc tatggcagct ggcatttaat gactggctga tatctccctt 5461 cctacaccat tatttgatga aatttaaaat ttaaattaaa atgaaaaatt tttctttctt 5521 gtttagaaag acatttgctt tatgtctccc tcaccaacct gtgcaaaatt tgagtttgag 5581 gcaggtaaac ctgaaatcag caagagctgt gctgtgtcct gagagatccc gatggtctaa 5641 tttaggaagc acccagtctc cctagtcagg gtccttagag cttcctcata gtgactacac 5701 actggccact tgggggcact gtggatccac tgagtgggtc actgatagcg ctgtctgaga 5761 gagagagggt taagggagac agtcttgatt tcgtctctgt atcagggata ttaccagatg 5821 aaccagaggt actgctggca caggaagagt taatcatagt cttatcagat caactgatat 5881 gcaacgtgta tttgtcatga caggaagcca tgaactgaag ccttgtgtgg cctctctagc 5941 aacctaggtt tgatggggag tgtgatgagg gtttcagcag ggcagcctgg ataacccaag 6001 ccaggtggag ataaaggagc atctgcctca gatgaaaatc tcagaagctg tgcggatggc 6061 tgtctgcccg gaagcctgcc ccctctgctc agcacagcta gcttcccctg ctctgcagga 6121 agctggacag gatgggggaa agcctgagtt agctgagcta gtcctggagt acagggtgcc 6181 tatgggagcc tacaggacga tgacatcaga acagtgacat cacagtaaaa acctccaaaa 6241 aacggtgagg aggagcaaaa gccctgcttt ctcaccccag gagaccagca acctgagcag 6301 ggagatgctt agtcctgacc tgcctgactc tgcctggaac accaggctcc tctgccatgt 6361 catgctttgt ctcctgggag caggtgagtc ctgagtacag gtgggacatc cctgtatcca 6421 cagtgttcaa ttgttgctga agtgtcaaac tctcccgagc tgagtcttca gcttctgtct 6481 ccttcctcca cagtttcagt ggctgctgga gtcatccagt ccccaagaca tctgatcaaa 6541 gaaaagaggg aaacagccac tctgaaatgc tatcctatcc ctagacacga cactgtctac 6601 tggtaccagc agggtccagg tcaggacccc cagttcctca tttcgtttta tgaaaagatg 6661 cagagcgata aaggaagcat ccctgatcga ttctcagctc aacagttcag tgactatcat 6721 tctgaactga acatgagctc cttggagctg ggggactcag ccctgtactt ctgtgccagc 6781 agcttaggca cagaccctgg agaattactg gctttctgta cccaaaccct cctatctcac 6841 ttgaggatgt aatagggaga aggaggtggg ggctgccaca caactttagc caagccccag 6901 agatgcttct attcttttct aacattttcc cctgccctgc tgagctcagt gagagctcct 6961 gcacttgtgg gctccagacc cactggaagt tctcacatct tagccagtac tttttaattc 7021 ctagcaagtg gcgggagctt ctactctgtg ccaacatagg ggtatatgtt ttagtgtgtt 7081 tcctgttgcc atacagaata cctgagactg ggtattttat aaagaaatga aatttgtttc 7141 ttacagctct ggaggctggg aagtccaagg tcaaggaccc acatctagtg aggaccttct 7201 tgctggtgag gactctgcag agtcccaggt ggcatagggc attacatatg agggggctca 7261 tgaaagatgg tcaaactggc ttttataaca gacccaacct cataactaac taattccttc 7321 tataacccat taatccatga atggattaac ctttatgagg gcagagtcct taagatccag 7381 tcaccttcca aaggtcctac ctctcaacgc tgctgcattg ggaaccaagt tttcaatata 7441 taaattattt aggaacacag tcaaatcata gcagtacaca ttgcgggttt tttttttttt 7501 tcttttttga gacagggtct cactctgtca cccaggctga agtgcagtgg tgctgtgatc 7561 atggctcatt gcaaccttga gcgcgtgggt tcaagcaatc ctcctgcctc agtggcctga 7621 gtagctatga ctacagggac acatcactac aaggtgtgtc cccagttcct cagttcctac 7681 aaggtgatgt gtccccttgc ctggctaatt ttttttaatt tttatttttg tagagatgag 7741 gccttgtcat gttgtccaga ttagtctcaa actcctcaaa tgatcctcct gagtttatgt 7801 gttttaatgc aaagatagac attagcatat ccactttata catgtagaag atttcagtgt 7861 accattttat agataggaag ctgtggctca tgaacttctc cagtatttca tatttccaag 7921 gatcaaaggc aggatccaaa cctacgactc catgcctcca aaataaaatt tctaaaattc 7981 tatagtgttt cctgggcctt ggtcaatgag ctacagcaga gcactattct tccatctcaa 8041 gactagccaa gcagtagaag aaggtcacat attatggaat gtggttacta tggcttatgg 8101 cctgccaacc tccaggacat ataacttcca gccagagaat atttgtcctg ttttagttcc 8161 atatagaatg gggccacaat atgtcctaat atagacatat agtgacaagg ggcatttcta 8221 atacatcaca gggttcagtc aaggaaacag aatcctctag gaattccaag gagaatagga 8281 gttaatacag ggaattagtg gttataaacc actggaaggc tgaaaagtgg gagggtctca 8341 gaaggttgaa acttgcgctc aggtccacca ctagtgatca caaagactga agttgctact 8401 tttgcccagg tcagggactg ctggaaatag ctgagagtca tagtggtctt gcagtgacca 8461 agagggtgat tcacaggaaa gtatccagag gtcattgcaa atccacctgt ctgttgctgt 8521 cagataaaag tctttattct gcttctaact tttccacaag tatagcttac tgggggaaac 8581 aaaactgttt tcagaaactt gctctcagga gaatctagga aatgtctttt tctttttttt 8641 cttttttttt tttttttttt tgagatggag tctcgctcag tcgcccaggc tggaatgcag 8701 tggcgcgatc tcggctcact ggaagctcca cctcctgggt tcatgccatt ctcctgcctc 8761 agcctcccga gtagctggga ctacaggcac ccgccactaa gcccagctaa ttttttttgt 8821 atttttagta gagacggggt ttcaccatgt tagccaggat ggtctcgatc tcctgacctc 8881 atgatccacc catctcagcc tcccaaagtg ctgggattac aggcgtgaga ccatgcctgg 8941 actgggaaat gtactttcaa gtcttctagc tcttgcaata agagagagag agagattgtt 9001 ggaggacatt taagggtact cagtgccaaa agacaatcat cagcacaccc agctatgcag 9061 cagagaaatg gaaagcagga gaataaattt tatattttct gtcaaattgt tacatgaaaa 9121 tttattttat atatatagct ttgctgttgt catttagttg ttgtttactt gttaatttat 9181 tcagtagata ttcctggaat acctattatg ttaaaaattt ccttggaata cagaaaatga 9241 gatgtaattt ttcaaattac acattctgtc tttaagaagt ctaaagtctg gcaagataat 9301 atatccctaa ttattcacac atgcatttac aatgtatgaa agtcatttca atcagtcagt 9361 tttctggcaa ggaacagatg acacagtcaa aaaggctgaa gtgaaaacaa tttaatcgaa 9421 ggactatgtt tagaggggta aacagactta gggacaccaa gaagtgtttg agggaccagg 9481 ggtttccagc acattgggaa gccagtacaa ctaactgtgg ggcctatgga aaaaggggtg 9541 ggagggaaat tctgtgttac tggaggtgca tgcaagtaat gagaaaagtt tcccaaagga 9601 aggaattcag gaccagagct aggaagggga agcccagata tgctgaactt tctcttcact 9661 acccttcaat gtccagctgg tgtcttccat tggaaagccc agctgaaagc ctggatagca 9721 ccgtttgctg acatcagtct tcaggggcac agtccagggc agtaaggggc aggcaggcga 9781 gagtcggttg tggggaactt tgggtgtgca tgtagaacag ccagcacagc agctaccaag 9841 tgtatgatgt gccatttgca gatgcatgtg caagtacacg gtggacagag accaacttaa 9901 ttctacatct tttatgtgaa cttagagaaa attgccttgt tgattgctaa taacaagcaa 9961 catcgttaat cccattttct ctgggccata taccaagtac attatttcaa ttaatttctt 10021 aagaacttca aaagggagaa caatttactc ccatattaca ggtaaaaaac tgttagagaa 10081 tttagatttt ctaaaggagg tacagctaac aaatatagaa cttagactca cacctaggcc 10141 atactgactt tcaacctatg cttgaaggtt actccaccct atacattttg gtagaggaag 10201 ataccttcaa cattggaaat ctaaatgtct cttttagaaa caatgcatct gacacaaacc 10261 atttgaataa catctttcat gtgggaggct gcacgctaaa gaggaagatc ctcagagatg 10321 aggagacctc agtagttcat cttctgatcc cttgggtcag tgactttgaa aacatttggc 10381 taaactgagg ccaatcatat atgtcttaca gagattttat acatgtaaat aaaacagagt 10441 ctaatacata gtataggctc aataagtctt aggtccctat gaattttctg acaaatctct 10501 agtcaagcag ttatttactt tggcaatctg gtaacaaagg catacacgaa ttacttttgt 10561 ctgattcacc tatgattcag acacaaagtc atctaaaatg caaatatagt agtctgataa 10621 ttcattggca aaagtaacac aaacagttgt gtcttaagag aaagtgaaaa tgttgtacct 10681 tacagcattt cctgcttgat actgagtctg atatgcaagg tgtgatgctc ccagaaagtt 10741 gtgaattcag agcaatcctg atggattcca tctcctgttg ctccagcaca aactcaatat 10801 tgtaccttta tcctggtttc ctctctccct attttattct cccatctctc ttattcctaa 10861 ttcctgagat cctttccaaa ataaactact ttcaccaaag tccatgtctc aaactttgcc 10921 ttcaggggca gcttaaaact ttgacagtga tagaatgaga tgaccctcta agacacttct 10981 ctttgaaaaa ggccgtggtt ttcttttagt tacactgcca tgcattcatt cagaagtcct 11041 ataatgatat atgaactatc tcttccaagt tcattttgac tcagtctgag tttccaacag 11101 actcacatgg gtctctgctc agttaaggct ctgagcttat ttgtaactga atagactaag 11161 tttatgtttc acagcttcag ggtctgcatg gactcagaga accattcatt tctttcttca 11221 gtcattcaac aaaaatttaa caagagtatg tgcggagtag atccaagttt tgcggaacat 11281 gaggcttata aaattttagg gatcgctttg aaaaagagaa tacaaaatga gaatgtcttg 11341 aaaaaattct gtgcaaggcc aggcacagtg gctcatgcct ataatcccag cactttggga 11401 ggccgaggtg gttggatcat ttgaagtcag gagttcgaaa tcagcctagc caacatggtg 11461 aaaccctgtc tctactaaaa atacaaaaat tagccaggca tggtggtggg cacctgtaat 11521 cccagctact tgggagactg aggcaggaga accacttaag cccaggaagc ggaggttgca 11581 gtgagcagag atcacgccat tgtactccag cctgggtgac agagtgagac tctatctcaa 11641 attaaaataa ataaataaat tctgtgcaag tgagaaggcc tgaagttatt cttcattgac 11701 ttcatggcaa atccacttct agctagttac cgtgtgtagg caccatgtaa gtgcttgaga 11761 aacatcagca cagaaaacag gcaaaagctg caaaataagg tgtatgtttt caacaaaaga 11821 aaattggtaa atcaataatg atcaagtaat acaaatgtaa atacaaggaa aatattcctg 11881 gtagatgttt atctccccag agaagtcaga gtattgttga gaattctatt ggtggatcac 11941 accttattta ttgatggatc acaccttatt ttgataacaa ttctcagagc tcttggattc 12001 catgtgccct actctaaaga atctgagccc tgagatggaa agctctcctt tcctgtcctg 12061 actcaaccat gggcaatgat ctctttttcc gtgtgccttt tggtctccca ggaccagggg 12121 agttctgggc atagatggga aagctcttcc tgggctttcc aggccccagc cacagcctct 12181 ccatcggggt gttcatgggt cccctccctg tctccacttc agattctgta tccattctca 12241 ctggagcatg agtgtaggaa ttacccaccc aggttacatc aagatgtgca gtcacaaaaa 12301 gggatcctca cactgagatg tgtcctgtat ctgaaataac agtagtacca ctattgacta 12361 gatgtaaatt ttgcaccaca gcacatttat ggttcaatca atatcctccc atttgtattt 12421 ataagacctt ctgaatggac acagggcatt tccctatgac cttaaaggca ccaggctagt 12481 gtacttctgg gcaaatatct catatacatc cctaatgagc ttcctcttat ttttacacaa 12541 aagaaaaaaa ggagggtggg ttttccccac atagtccatc cactcacaag ccaatctcct 12601 gcagaaatac cctcacagat agaccaagaa atgctgttgt accagctacc taggtatgcc 12661 ctagcctggt caggttggca cctaaaatta accatcacaa aactcatgag aacacagagg 12721 tctgagttct gatgatgaga ctcctattag gtgatggaga ctcaggtggc ctcccctaaa 12781 acaaaaggag aagtcatatc tgtattagtc atacatatgt gtatatgtac atacatgtat 12841 gtattctgtt ggttctgttt ctctctgagg acgaggtggc ctagatcctt ccctttctgg 12901 gtaaagggtg gaagcagtgt ggatcattac ttagtgtgtt ctgatagcca tgtgtcacta 12961 tgtacttgtt ctttgtgtct tcacatctct tcctgttgca ttttaaagca ttttcatagt 13021 atgtggtgtt gcctagcaga gcattctcat agcaaaaaaa tgtatatata tatacatatg 13081 catatatata ttctattggt tctgtttctc tggagaaacg gtatatatat atatatatat 13141 atatatatat atatactctc cctgtatata tagtatatat atatatatat acactctctc 13201 tctatatata gtatatatat atatactctc tatatatata ctctcctata tatagtatat 13261 atacactctc ctatatatat agtatatata tatatacact ctcctatata tatagtatat 13321 atatatacac tctcctatat atatagtata tatatataca ctctcctgta tatatagtat 13381 atatatatac tctcctatat atatatatat atatatatat atatacacac actatatata 13441 tatatgggaa gagagagaga gaaaaagaaa gaagagaaga gaagagaaga gaagagaaga 13501 gaagagaaga gaagagaaga gaagggaaga gaagagaagg aaagagatta gggaaattgg 13561 ctcacgtgat tatgaaggct gaaaagtccc ccagtatgcc agccatctgc aagctggaga 13621 tgaagcaaag ccagtagcat gactcagtcc aagtttgaag ccctcagacc acagaagcca 13681 atgttgtgtc tctcagcctg agaccaaaag cctgacaacc caggggattg ctagtgcaag 13741 tcccggaatc caaaggtagg agaatctaga attctgatat ccaagggcag tagaagacac 13801 atgtctcatc tctgagagag aatgcaaatt caccctttct gtgccttttt gtttcatcca 13861 ggcccccagc tgactggata ttgccataca cattgagagt agatctttcc cacacagttc 13921 atcaatttac aagccagtct cctccagaaa taccctcaca gacataccaa gaagtgttgc 13981 tgtaccagct atccaggtat gccttaatct agtcagatta acacctaaaa ttaaccatca 14041 aactaccagc gggaacacac agaggtctga gctctgatga ggagtctcct gtgaggtgaa 14101 ctaatgtgac agggcataaa acaatataca aattgtattg atcctgacct taagactttt 14161 tgaaaccttc ccagatgtca ggtggctctt ttaacagagg ccagcactgt aggctgacat 14221 ccagttatcc agagtttgag aaaagataaa atctagtaaa aggtacttag aggcaaattt 14281 gaggagatga gttataaagg acaccagact gccacgagaa tgtcccttag actacataaa 14341 tattagaaac aattaaaaca gcatataaat gaggaattgt atcctaatgg gaataaaata 14401 tgcagagagg aatgcatttt ctttctaata acctgcaaag agcatatgca cttatgtctt 14461 taaaaatagt ctacaaatta ttcccattct gagctgttaa attgtatata tgtcctataa 14521 aatattgttg tatataccat aaattacata tatgtatata tttgatttat ttttttggag 14581 aatgtaaggg gtgtggcagg ttaactttcc agctcaacta tttgtcaggg gcactgatgt 14641 catcgagtca ctgagaacct aagttctatt tccccaggca gggctgggag agatgagatc 14701 ctggcctgga cctgaaatgg gcacaaggtt gttcttctat gtggcccttt gtctcctgtg 14761 gacaggtgag ggctggtcac aggtgggctt ccttccctag aattcccaag gcctcaatac 14821 aagtcttttt cttgggatta caacatcagg gtctgttgtt ttctattaca ggacacatgg 14881 atgctggaat cacccagagc ccaagacaca aggtcacaga gacaggaaca ccagtgactc 14941 tgagatgtca ccagactgag aaccaccgct atatgtactg gtatcgacaa gacccggggc 15001 atgggctgag gctgatccat tactcatatg gtgttaaaga tactgacaaa ggagaagtct 15061 cagatggcta tagtgtctct agatcaaaga cagaggattt cctcctcact ctggagtccg 15121 ctaccagctc ccagacatct gtgtacttct gtgccatcag tgagtccaca gtgctgcatg 15181 gctgcctcct ctctgcacgt aaacagcagt tagaaagact gaggttgctc tgtgtctatc 15241 cccacccttg gaagtccagg cctccataga agtcagaggg ccctggccag cctggaagcc 15301 atagagcagg ggccttatga ccctcagtgc tgacgtccat tcctacccca gtctcagacc 15361 aactggaggt caccccaaca cacttagctt gccaagtctc tcttctgcag ctctcttttg 15421 ctgcttgcag aaaggaaaag gcatcattag ttgaggttag cgatgattct cttaacccca 15481 aagcctggac tcccttctct cccctttggg cttcagtgac ttcttcatct gcccctttcc 15541 ccagctccac tatccttcca ctcatagtca tgaaccttca ctcctgacct ctgttcctgg 15601 ctgcctttct tctatgggcc aatagcctca tggggccctc tatcatttgc tcttcaccta 15661 catctccatc atcattttgc acaagtctcc cctggctctc tctgctccag ccatgctcct 15721 ttatgttcag ttatttcctc atatgtgtta tgctgtttcc caccctgggg atgtgtgttt 15781 ttacagatta tttcttctac actgaacata tgttcctccc tcttattcct aagtctctac 15841 ttcatttctt acatgaaagt atatttcagg tacatttact cttaactata aacagaaaaa 15901 cttaaatttc actaaaagaa attaaggaaa atatttttat cctttgagtg aagacttttt 15961 tagtggaaac acaaaagaac acatgccata aaaaaataat actggatggg catggtggct 16021 cacaccttta attccagcac tttgggaggc ctaggcgggt ggatcacttg agcccaggag 16081 ttcaagacca gcctgggcaa catggcaaaa cctcgtctct actaaaaaca cataaattag 16141 ctgggggtag tggcgtttgc ctgtaatctc agctactctg gaagctgagg cacgagaatc 16201 acttgaactt gggaggcgga gggtgcagtg agccgagatg gcaccactgc acttcagcct 16261 gggtgacaga gcaaaactcc atctcaaaaa ataaaacaac aacaacaaac tgacattagt 16321 tgtattaatc atgtatcgaa acacatgata aaaaatgaaa agtcaagcca cagcctggaa 16381 aaggcataag gaatgcatgc aaatgaggaa gaatcagcat gacaccattt atgttcacaa 16441 aagtcccttg gctcccctac agttccaggc ttcacatgca gttaaagtgc tgggaacata 16501 tgaagaggtt ctgaataact ggatgtgtgc agaactgatg tgggcaatcc caagtctggc 16561 ccttagaaat aacttgtggc agccacgcgc ggtggctcat gcctgtaatc ccagcacttt 16621 gggaggctga gatgggcgga tcacaaggtc aggagattga gactatcctg gccaacatgg 16681 tgaaatcccg tctctgctaa aaatacaaaa attagctgga tgtggtggtg tgtgcctgta 16741 gtcccagcta cttgggaggc tgaggcagga gaatcacttg aacccaggag gcggagactg 16801 cagtgagcca agattgcgct actgcactcc agcctggcaa cagtgtgaga ttctgtctca 16861 aaaaaaaaaa aaaaaaaaaa gaaataactt gtggcactgt gaagggagtc ctcaagtaga 16921 cacagaaatt agctgcaaag actcatggga tatagcatac agttgcactt atggttaaga 16981 tttattacag tgtcatagta aatattcagc agcggatcat atgaagaaaa cacacaggag 17041 gaatctagag gagttcacac attggcttcc ttatgctctc ttcctcccgg gaggggtcac 17101 acagagctca cttcttccag caacaaaaat gcattaacac gtgtgcaatg tttctgcctg 17161 tggagctcat cagagactca acaccccaga ggctttagtg gaggccagtc atggaggcac 17221 tcactgccta gtatgaaccc aaaatccaca gcccagaagg aaatctcggt attcagcaca 17281 atactgtttg caatatcagt ctaggccaca atgagctact cctctcggtt aggagaaact 17341 atgtcagtgc agggcactgc tcaccagcca gcttcccaga tgtcagccaa atgccaacct 17401 cacaagcagg actttctaag caccacagtc tcaagccttg ctgttgactg ttttctgtac 17461 agatcccttc aattcttttg ccatgcccta ataaacctgg gagctacgca atacagattg 17521 actaatgata atatggaagc agcctggctc actgggtcat aatttggtga aaaaccatga 17581 aagaaagcta cttatggcct gctgggcttt gtggacatgg gaagtaagac ttcatagaat 17641 taagccaatg agattttaag atttattttt attttttttt ctgtagcata gctgttctta 17701 tctaatagag ttagtatcta caatgggtaa agcattctca taacttacta agacaatcca 17761 ataaaatggc aatacttagt ctaaataggc atttcacaga aaagtaaaca taaattaaca 17821 ctaacataaa aataggttct tgggctaggt gcagtggttc attcctgtaa tcccagcact 17881 ttgggaggct gaggtgggca gatcacttga ggtcaggggt tcgagacaag cctggccaac 17941 atggtgagac cccatagcta taaaaaatac aaaaaattag ccagtcatgg tgggatgtgc 18001 ctgtggtccc agcagctact cgggagactg aggcacaaga atcacttgaa cctgggaggc 18061 agaggttgcg gtgagccaag atcaccccct gcactccaga ctgggtgaca gagcaggact 18121 ccgtcttaaa aaaaaaaaaa agaataggtt cttaacctca cttagaatca gagtattcca 18181 atctgaaaca gtaaattatg aatgcaaact tatcagattt tgaagactgg aaagtcttca 18241 aagaaatgat agtcccaaat atgagaaaag atgtggacta gtggaactgt attatactac 18301 agaagaaaat tataactgat atggttttgt tctgtgtcgc caccctcatc tcaaattgta 18361 atccccatgt gtcaagggag ggaactggtg ggaggtgatt ggatcatggg ggtggttttc 18421 cctgtgccgt tctcctaata gcgagtgaat tctcacaggg cctgatggtt tatgagtggc 18481 agttttccct tctctttctc tgtcctgctg ccatgtgaga cgtgccttgc ttcccttctc 18541 cttctgccat gattgtaagt ttcctgaggc cttctcagcc atgtggaact gtaagtcaaa 18601 taaacctctt tcctttataa attaaattac ccagtccctg gtatttcttc atagcagtgt 18661 gaaaacagac taatacaata attatatcag aaaacccttt ctgagatata ttctgacaag 18721 caacattttg aaacaaatta gcaatatagt tgaccctcta tatcagctga tttggaaccc 18781 aaggatacca aaggctgacg gtaaggaagg tgggcaccct tggattagga tatcagcagg 18841 gtttcctgga accaatcccc tcaggatact aagagatgac tgtgtgttta aattacttga 18901 gttcctttca ttgtctttag ggtccatcat ggagaattga gttgtcaata tttattttag 18961 ccattttgaa ggttttaatt gatatttaaa aattaacact taaaaactgt ggttatacat 19021 atatgtataa ccatggtttt acatatatat atataatata tagtatatac attaccacag 19081 ttttatatat aaatataaaa tacatattac cacagttttt aaatgttaat ttttaaatgt 19141 tttatatgta cataaaacat acaatttttc accttaacaa ttttaaaata tacaattaag 19201 tggaatcacg tacattcaca aggttgtaga accatcactg ctgtttccaa atgtttttca 19261 tgatctcaaa gagaaactac acccattaag caataactcc atctttccca tgcttctcag 19321 cttctggtag cctctaatct accgtttgtt tatatgaatt taccaactct agatagttaa 19381 ttgatgggaa attataaaat atatctccct ttgtgtctgt cttctttcac ttagcctaat 19441 gctttcaaga ttcatccata ttgtagcatg tgtcagaatt ttactccttt atatagctga 19501 ataatacttc atcgtgtgta tagaccacat tttgtttatc tcatcatcag ctgatggact 19561 tgtgggttgt ttccaccttt tgactattat aaataatctt gcaataaaca ctggcctaca 19621 agtatctgtc tgctttcctg ctttcaatta ttctgggtgt atacctaggg gtggaattgc 19681 tgagtcacat gggtattcta catttgacat tttgagcaac tgcccaacag tgttttacag 19741 cagcctcagc attttatatt cccataacag tgtaaaagcc ttccagtttc ttcacagcct 19801 tgccaacact taatttctgt tttgttcttc tttttaaaaa ttattacagc catgctggta 19861 aatgcgaagt gatacctcat tgcggttttg atttacatgt tcctaataat taatgatgtt 19921 aagcatcttt ttttgaactt ttactttaaa tttgaaaatt gtattatgta tacttcaggt 19981 acataacatg atatgatgag atacacacct ataggaaaat ggttactata gtgaaacaaa 20041 ttaatatagc catcatctca catagttacc catttactcc ctgtggcaat aacagatata 20101 atttactcat ttagaaaaaa ttctgaatat atgaactata gttttcatgt ggcacattag 20161 atctttagac ttcttcatcc tgcctatctg ctatcttgta tcatttggcc taaatgttcc 20221 catttcctat tccctccccc tgccatcctc ccattaacca ctatttcatt ctctctgtat 20281 atttgagttt ttacaaattt cacatataaa taatatacac tatttctctg tgtctggctt 20341 atttcactta gcctaatgtc ctccaggttc atctatgttt gttgtggtaa atggcaagat 20401 ctcatttttt ttttaggatc aaataatact ccattgtata tgtatgccac agtttcttca 20461 tccatttgtc catcagtaga cacttaggtt gtttccatat cttggctatt gtaaataatg 20521 ctgcaatgaa catgggagtg cagatgtctc caaaaggtgg gatttcattt tctttgtgta 20581 tataccaaga aaagagattg ctgggtcata taattctatt tttaatttct ttaggaacat 20641 ctacactaat tttcttaatg agtgcatcaa tctacttttc caccaacagt gtataaaagt 20701 tcctttttct ctacactctt gcaacttatc tttttaataa tagctatctt aatgggtgtg 20761 aggtgctatc tcatagcagt tttgattttc acttttctaa catttagtga tattgaacat 20821 cttggccatt tttatgcctt cttggtagaa atgtctgttt aggtcctttg ctcagtttaa 20881 atagggttat atgttctctt gccaaagagt tgtatggggt ctttatatat ttggtatatt 20941 aatctcttat ctgacatatg gtttgcaaat attttttcca aatccatagg ttgccttttc 21001 atgttgttga ttgtttaatt tgctgtgcag aagcttttca gtttgattta gtcccattta 21061 tttttgcttt tgtagcctga actttttgta tgatatcgaa gaaatcattg ccaaggccaa 21121 cgtcaaagag cttttccaac atgttgcatg taataaactg aacttagata gaagacctaa 21181 acattagata ttaagcatct tttcatatat tattgatcat ttgtatatct tcttaggagg 21241 caatggcttt tcttttccag ttgtttttta gtttttcttt gtctttagat tttagtagtt 21301 ttatgattgt aaggctaaat gaggttgggg ttgaacttca cttgggagat ttgtcgctgt 21361 tagtgttttt agtgcatatt ctgaaacatt tttaatagac tgtacaacag agaataagta 21421 aattcattga cattactggg aaacagagtt cttacaaagg gaagagagat acaataagaa 21481 atagggaaag acaaggaaga aatctgtggt gtttatttga attggaggta tcaaatatga 21541 actattattt aaaaattatg tatttcctaa ctctaaccat acaatatgta agaagcaata 21601 tgaagtcgat ggacaattgc tttatttctt tgtacttgtg ttcccagtag taatgggcag 21661 aattattgca tgttatttaa tttctaaatg tttctaaaag aaacaaagca gaaatggctt 21721 cttctagatc cagagcaggg aaagtcaaaa acatcttttg gcagaaatca aggaagtttc 21781 atgacccaga aaacatagaa aaaagccaca gaaccagttt catggggctc ccactagcca 21841 aacacgagta tctaaacatt agagcataat aatgaggaca ctaaattata acaaatttta 21901 aaaatctata aggctataat gacaataaaa agatgtaggt aagaatgtct tcattacaga 21961 ataatgctag ttactaaatt tcattagaaa atcaatgtag ccattaatgt agttacagac 22021 agagaccatc agttgatgtc aaaaccaaca tctgaaaggt tattggagac caagatatct 22081 atattgtgtc aaagtatcac tccactaatt gcttattaac tataaaatat aaatgatatc 22141 tttgcaatgg agagatctga tgtctgtgat gttaaattta gtatcgccaa aaatgtgtaa 22201 agacagcatt atatgttctt gtaattgatc ggatgcaatg ggaaacacac atcaactatg 22261 gcatggtatt tgccaaaata tttaacctct gataatgtgg aaattctgat tgggggtcat 22321 tttaccagtc agctgacatg gactcttaca aaaatcaaga ccatgaaagt tggaaaatat 22381 aagtgggcca aagagatgtc tagtttaaaa gtagctaaaa agacctgaca accaaaagaa 22441 atgcataatc tttgataatg tcctgtagaa aaaaaattaa aaccaaaaga cacattggta 22501 caactggaga aaggggaatg tgaagcatat atgagaaaat aatgttatat gaatattaag 22561 tttttgagtg taataaaggc atttgtggtt ttctaagata atggcctctt tttaggagag 22621 acatactgaa tcgttaagaa taaagtgttt tatttcagaa attgagggta aaatacacaa 22681 aaagagataa agcaaatgtc aaaagatgtt aataattgaa atacatttta atattctctt 22741 tgttgtatga ttcagctctt ccaaaagtgt cagtttttta aataaaatta gaagaaaaaa 22801 atcttaagat cttgaaatgt tggcatgatt tcagttttgt ggtggctatt tattatattc 22861 attttcaata tctttgaaat attctataat taccataaaa caaacacaca agataaggtt 22921 tcttttagcc ctatagtgta ctttttctat agggacaggc tatgattttt aaaagtcccc 22981 ccaaaagata agagaattta tagtctgagg acaaaatagc tgatatctct gagatgagca 23041 caaaacttag ctaacaatgc atgtcaagaa gctgtttagt aaatacacag gactgtttta 23101 actgaaacag gagattctga gttccctaca ggttggtcaa gaactcacag atgttgggga 23161 attgcctgaa tgtaagcaga ggagtggtag tgcttgtcct aaattgtggt tggataatat 23221 gtgtttgtgt aacagggagg gggctctagc aggactgaaa cactgtctgt ggggaggagt 23281 ctctagcagg acagaaggat cacatggagc tggaatgaat tcccttcagt cccccacaaa 23341 cactgtctga tagtagccct gatctaattt ccattgttta gaagtcccac cactctccct 23401 gagccatggc tggagggcct aagcccagtg gctctgaacc attggccatc aggaggcact 23461 gaggtttgat ggcgtccaaa tgtttttgaa acagggctaa gtattgcctg agaagttgag 23521 tttaaaggaa aagtgtggct tcatcctagc attcagtatc tagtttgcag gtgttttgag 23581 tggtcaatgg aaactcatct ttctccactg atgccagtgg gttctgttcg gcagtgaaat 23641 ctgaactgta gcaaaccatt gagtggagaa tcttctgagc ccacttgagg ggtagatgga 23701 ggagagccag gagccaacct gaatgctcac agacccatgc agaggaacaa gtgcctcctt 23761 gggagaacat cagagaaacc ttagcaccta acctattcta gtgtgtcctg gacgatggac 23821 aggagaatgt aaggggaatg tagaagatgg gagtggacaa ctgtagatgt ttccagatat 23881 ctcaaattgc agtaacacca caaatattct taacaagaac cagattttcc agaactctgg 23941 catttccttc cctgagcaat gagacggccc tttgggccct gacacaatca tgggtgtttt 24001 ttgttcattt gttttggtgc tcaggaagca ggggaggcat gagaatacct ggggaatgtc 24061 tttcctagaa ataggatgcc ccaatttcaa gtatttcaca tgcagcttga taggtcctgt 24121 ctccagccac ccagcctcca ctctctcaat gctgttgtca tccagggccc taagctctgg 24181 tcacacggac ggaagaggta gtgacactgt gatgtgaaga cactttccac catgaaatta 24241 ttaattggtg tcaatataag gcaggctccg actagcagct tttctaccac aatgatagac 24301 aatttatatg gaaaagagat ggttctgata attctacact cacaggtctt cacctgataa 24361 ccttgacatg tgtcccctag gttttaggaa cttggatttc tttaggctga gttgtactat 24421 tccctttctc tgaacacaca tgtcattggt tagttcattc atattgattc tcagggcttc 24481 tccaatccct caagccttcc cactctacca ctcagataag gagaagtgtg tcttaactaa 24541 tgttgcataa ttttttaaaa attgctggat actaacactg caattcaatt gtaaaatatg 24601 atagaggtgt aagtgtttta gatcccagaa gtcctgctca attttgcatt tattgttgga 24661 tattcaccct gatattttca ggaattagag tgttcttcac ccatttgggt taagatatca 24721 gaacttactt tcaattcatt ggtgaccaga gactacaggc agaatcatag tgaaggcttg 24781 atacataata cataatgaag attgataaca gcaaaaaggg aactagtcat tggccttcag 24841 ataaaatgaa acctcccaac tttgtgctca gggaattatt ttaccccttt aggttagagt 24901 atgtcagcta gttcaatgca gttctaggat aaagctgtgt caattaccct agaggactgg 24961 gagcatggag gacctggtag aaaagaaata cccaaagaga aggaaaatta cacacaaaat 25021 caagaggtca ctaattgata tatctatttt gagttgcctt tgagacttcc acgtagaaac 25081 atctagcaca ggataggcct agagatgtag tgacagatct aatcccccaa gccctgaata 25141 tgtgagtgag gacgcttgga tgaacttcca gtctatgaat acattaaaca aacttttagc 25201 tctgttttag tttacaacaa ctgttttcta agaacttaag catttgtgga gacaatgatg 25261 tcactgtagg aacttctctg taaggacagc aacatcccac ttcctctgct cctgctcaca 25321 gtgaccctga tctggcaaag cttccatcct gccctgaccc tgccatgggt accaggctcc 25381 tctgctgggt ggccttctgt ctcctggtgg aaggtgagtc ctaggaacac catgatcctc 25441 acataatctc ccagtgatta ttctagtcat ttccttctat tttaaaattc tttttccccc 25501 acagaactca tagaagctgg agtggttcag tctcccagat ataagattat agagaaaaaa 25561 cagcctgtgg ctttttggtg caatcctatt tctggccaca atacccttta ctggtacctg 25621 cagaacttgg gacagggccc ggagcttctg attcgatatg agaatgagga agcagtagac 25681 gattcacagt tgcctaagga tcgattttct gcagagaggc tcaaaggagt agactccact 25741 ctcaagatcc agcctgcaga gcttggggac tcggccgtgt atctctgtgc cagcagctta 25801 gacacagtgt agcagagaca cttccctcct gtgcagaaaa ccgcaggact ctctcctctc 25861 tactcagctc acagcagcct ttccttattc ctcatcctcc caggaaagaa gtgagttttc 25921 agatatagct aggattcata tagtgggagg aaatgaacta tttcttacaa catgagcgct 25981 ataattgttg cttgaaaatg tgtctcaggg atttgaaaca cttcttgggg acaactcaag 26041 aaacctaaag tgacttatca gagattgagt cctcaggggt agtgatggtt tatctcccat 26101 aaaagtatgt gatcatgaaa catgtacaat aaacttaata atacagatag tgattatacc 26161 aagcagcaaa cagcaggctc cccttatgta gttgtcagca tgtgaatgac ttgttaccct 26221 tagtgacacc ccggctgtct agcaaagtct caacacacag tatgtgtttg gaaatctttt 26281 tccttctgtt ttatttttgg taaagtatgc taataatgat gttgttaatg atagtgatta 26341 taatttcagg tatgttattc agcccttttt ccagttcaaa gaaatgcaca caaatggata 26401 ccccggcaac caggtgcaaa tcttagctcc agatgaattc tacttctcct gaactctcta 26461 actcttctat gtaatcatcc cagcccaaga atatatctag acctggacca gtcagttcag 26521 aggacatttc tacgtgtctg ccttgtatat taaaagttcc ttagtggatt gaaatattcc 26581 ttgtaagaac ttctgaagca atcaattgcc ctctctgata atatacactc tgtgtgtcta 26641 gctccatatt cattactcta cttctgccta aactctctaa tactagaaat ctttctgaag 26701 tattggttag aattacttaa aaagcagcat actggaaatt gtctgaaagc actgtcactg 26761 tccatttgcc cctcctgact tccctcccct ccttatccaa tctaatacct gatggcaagt 26821 caggctttat ttccatttgt tcattgcaca tctccctgtc ctttattcct gtaataaaac 26881 ctcactgatg tgatggatta aaatgaccac tcactcacca tctccttgtc aaaatttcta 26941 tttttttctt ttcccaatct gctatgttag ttatagttaa taactccctc ataacatttt 27001 ctgagatcac tttttattat ttgaatatag gacacacagt tgtttcagag tctgtctgat 27061 cattttcacc tctaagatct tcataggata tttttgtgat tttctttttg tgctgcttct 27121 cagttatggt gcttgagttt tcatggcatt tgtatttctc cttaccaatc tgatgatgtt 27181 tgacactttg ccaaacacta tagttgtaga aataattttg ggggtaggat ggatattttt 27241 ctgcagagat catttttctg tgcttctgaa agtgcatgta gatataaaca gtcaggctcc 27301 ctgaatcaca gtttaaggct tgaggtgtcc tggagcatct ggagaacagg tttgtctctg 27361 gcttattgtt actagtggaa tttggcaatg acacgttcgg tatgttctgg gctttgcttt 27421 acccccgtgg cctgtgagtc agcttcctga gtgctgagtc aggatctgca aatgcccgct 27481 agggcaaatg gactgctgtg ctcacctccc ggagctcttg ccctcaccaa cattccagcc 27541 caattattat tattttttta ggactttaac taaaattttt aatgttgttt ttgatatttt 27601 ttttccaact atgtagataa caacagaaag agtgagtgcc attatcctta attattggga 27661 ttaaggtgaa aataacttta cctggaattc taattttcct tgtattttaa tgttagattc 27721 tttgtgattt gtctaatatt caacttaaat ggtatagaga tgttaaactc tgatatcaaa 27781 tactttaaaa aaacttatca agaccattca taatccaatt gagacatatc tttgtttggg 27841 gcatgttgtt tttagagtat tcttctactg ttatttattt ctatcacata acctctgtgt 27901 tccattggac catatggcaa gatgggaaag acaatcaaga gaatgctgag atctggtcaa 27961 atagtacaga ttgggcttca aattaattta attactgagt tctgggggtg gatttgtgct 28021 gttttatgag tatttcctgg ggcaatggtc aaaattagag ggagtaactg atatgtggct 28081 attgttttac cccgtttaaa gccaatctcc tgggacaaca caccccattt tgtggggcac 28141 cccgaaagta ataatcctgt tttaatcttc ctattccttg ccaaatcttc tctacagaag 28201 ggaaatcagt tatactattt cctctcttca ttttagttta gtctctcctt attttcccgt 28261 tttcctcctg gttccctaga gaaaagtaga ttattttcaa catgtgctaa gatcagtgag 28321 gaactctctc ttccatgcag ggtggcctgc agtttggagg ggatagggaa tgggttaaat 28381 tctgtttcag catcatgtta gccagcccat cgagtttcac aatgagacct gctgcttatc 28441 agtattaaaa ctgacattat gtttccacct ttctgactgc attgtcaatg tctggggtgt 28501 ttctgtccta ccccatagca ctaatattat ttgttggcct tttaacacca actaatcata 28561 atatcctata tcccttgtac tcagtgaaat aaagctcaag tccctgtgtc ctggcaaaag 28621 agagcacatt tgtgctgttc tgcctactgg ccactaggag gcactgtggc ttcgatgtca 28681 tccagagtct atggggagga gctgagattg ctccctggaa gatggttaag aataaccccg 28741 aggttggacc taaattcagg gagttggttt gcagatgtct tgaatgatgg ctgggaataa 28801 aggaattagt aaccgacgct tcctccattg tagaggtggg ttttgcttta aattgtgctt 28861 tgagtggggc tgagtgacag gtgtactttg cagtgcatat acctctgagc tcgcttaaga 28921 tgtaggggca caaggagctg aggtggtgca gggcccatgt tctgagcttg gaaaaccaga 28981 gaagcagttc ttgtgtcaaa ggagccttgg agagcctggc cccaaaccag cccaggccaa 29041 ccctacctgc tctgtgagag tagaacagca agaaagggag gtgctaccca ggaccacagc 29101 agagggaccc tgtcgccagg gagatgaaaa tcacagtgat gccacgacca tatccacgta 29161 tgaggcatag tccagtgtgt ctcctggaag aagatgaagc ctgggcagac ataagaatat 29221 gtttttgaat aggcttccct tggcctgatt ttaaattcct tccatgtgat catgcatctg 29281 tctgtacctg tattttgtac ccggcttctg cctctcaccc atcagtccca tggctactag 29341 aaggtattaa atttctattg acaaaaatta ccactaattt ggtggcttac aacaataaaa 29401 atttattatc ttacagttct ggaggttaga agtccaaaac caggctaaaa tccaggtatt 29461 ggcagggcta cattccttcc cgaggctctc agggagaact atttttctct cctattcagc 29521 ttctagaggc caccagtgtt ccttggctca tggccccatt caatcttcaa agccaacaag 29581 gactgggtga gcctttgctc actctgcatc actctgacac tcactcttca tcctccctct 29641 tccacatttt aggacccttg tgattacatt gggcccatct ggaaaatcca ggacacgctc 29701 cccatattaa ggccagctga ttagcaacct tcattccatt tggaacattc gtttctcccc 29761 ttgccacata atttaacaca ttcccagttt ctgggaatta gacatgtaca tctttgagag 29821 gtcgtcacgc tgcctgccac acttgctacc tggttacacg aagtagtaag caattaactt 29881 tttgaaggca gaattatggc catagtttta tatcctggat ccaacagacc ctgcaaaagg 29941 atagagccca catggaaata gcccctccag taacacaagc tagtctgcaa actaatgccc 30001 ttttactttg ggaagtgctt ttgactgcag gggactcaga agcatgcctc tgtgccaaca 30061 gcaaaaatgt gcccttctct tttgttggca agtaacttaa ccaaccaacc caaaaaaaag 30121 atctttctct cagctttcca taatctctga gacgaagtag gtttggagaa gtggggttac 30181 aggggaaaaa gccaggtgtt aatgatgaaa aaacattgaa cttttctagg ggtagtaata 30241 agatttaatt caagactaga acattttagc tgcaaatctt caagaataag acaatattat 30301 cccctttctg ttttattgtg ggactagaga atgtgagaga ggttacattc catgggcttt 30361 gggaatttaa tatggttcaa ggataaacac acccaggttt ttcactgcag agaagagctt 30421 caaatataat cagttttcag gtcatcagct cagctcttgt atccctaaca atgcagttga 30481 catgcgtctt ctcagatgtc taactcctaa ctcactgagg gatactttaa gtacatataa 30541 aggactagaa gcaccaagct accagtgaga ggaagaggag agtttgcaga gaagctggct 30601 tgaaataaga caatgagttc atctttaaat acttgccatt tgaggtgcag atggatatag 30661 ttggcaggct cctatgtaag gcatgttatg gagaagctac cgtgaattga taatatcaaa 30721 acaaatatcc agggagcctc tgcaagtgtg catctctatt tcacaccaat tatagttgag 30781 ttaattcctg cctgattcat ctcccagaga tgcagcctcc tcttaaagaa gttgggggtg 30841 gtggcccatt cagtgatgtc actgacagat gcattctgtg gggataaaat gtcacaaaat 30901 tcatttcttt gctcatgctc acagagggcc tggtctagaa tattccacat ctgctctcac 30961 tctgccatgg actcctggac cttctgctgt gtgtcccttt gcatcctggt agcgagtgag 31021 tcttcagaat atttgccatc atcaggctgg gcttctgcat ggatgatctc atatattttc 31081 cttattctga cgcccaattc tgtcttcttt catagagcat acagatgctg gagttatcca 31141 gtcaccccgc catgaggtga cagagatggg acaagaagtg actctgagat gtaaaccaat 31201 ttcaggccac aactcccttt tctggtacag acagaccatg atgcggggac tggagttgct 31261 catttacttt aacaacaacg ttccgataga tgattcaggg atgcccgagg atcgattctc 31321 agctaagatg cctaatgcat cattctccac tctgaagatc cagccctcag aacccaggga 31381 ctcagctgtg tacttctgtg ccagcagttt agccacagcg ctgcagaatc acccctttcc 31441 tgtgcagaaa acccggtgtt tccccttctc cttctacctc ccagcagtcc tgggcaaagt 31501 ctctgctgtt cctccctccc tatgagaaaa aagtggtttg ggggtatgaa aaagacagaa 31561 aatgagaagg gatcaacata ggaaacctta tgttggtttg aggattacaa aatgggtttt 31621 gaggattcct taaaaattgt ctctgctcaa aacacatagg agtaagataa accttggcta 31681 ctgacactgg agatttccct gccctcctgc atttgccatc ccatgagaat ggtgggggct 31741 cttgagaagg gctgcatttt ctgaactgtg aggccctctt cattctctcc taactctaag 31801 ctgcaaacag aaatttccct cacacgtttt ctagattgta aaagaaagtt cttctttact 31861 atgattgtgg acgttccttt ataatgccaa tttcaacttt acattacttc aggatttttc 31921 actactccta aagagtgtct caaatgtggc tagagcaagc aggttagtac actagatgta 31981 agctacctgg cctggaatct aaggatccat ttgtctctgt tctgcgtaag atgagccggg 32041 tgctggccaa aggctgtgca cactcacaga gcactgatga cgcctcctgg taaggaccca 32101 cactggggta tctaaaagca gacaggcatg tccagtcttc tgttgccctg tttcctttct 32161 gattatatgt ccttaacaca caaatttaca ttttccttct tatttatatg agaagtttct 32221 atacaatacc tgcaatccat tctgagtggt tataatttct gtgtgatatt catatttaca 32281 tgctgattcc ttctaaatac ctatcatggt atcattgaca actgaggcaa aagaccccta 32341 tattttgagt gcccaaggcc attgaggttt tttggagctc tgccataagc ccaattccac 32401 tgtgtcattt tcctattttt ctttcctttt ttgttttttg attagtgggt cctgactttc 32461 aagatgaaaa tagtgcagaa ttcctccctg ctgcttccag atcattttcc ttcctacttc 32521 tctaaagccc agctgcatta taggcttcct ttagcctgat tttaaattcc ttccatgtga 32581 tcatgcatct gtctctacct gtatcttgta tctggcttct gcctcttacc catcaatccc 32641 atggctacta gaaggtattc aatctctgtt gacacaaatt actactaact tggtggttta 32701 caacaaaaac taattatctt acagttctgc aggtcagaag tccaaaaccg ggataaaatc 32761 caggtgttgg cagggctgca ttccatcctg aggctctcgg ggagaactgt ttttctctcc 32821 ttttcaactt ctagaggcca ccagtgttcc ttggctcatg gccccattcg atcttcaaag 32881 tcaacaagga ctggctgagc tttgctcact ctgcatcact ctgacactca ctcttcgtcc 32941 cccctcttcc acattttagg acacttgtga ttacattggg cccacctgga aaatccagga 33001 tacactccct atattaaggc cagctgacta gcaaccttca ttccatttgg aacattcctt 33061 tctccccttg ccacataatt taacacattc cagtttctgg aattagacat ggacatcttt 33121 gggaggttgt catgctgcct gccacacttg gtacctggtt acaggagtgg taagtaatta 33181 aattttttaa aacagaatta tgaccacagt tttgtatcct gggtccaaca gacctgcaaa 33241 aggatagagc ccaaaggaaa taggccctcc agaaacacaa gctagtctgc aaactaatgc 33301 cctttaacat tggcaagtgc tttcaactgc atgggactca gaagcatgcc tctgtgccaa 33361 gagcaaaaat atgccctttt atggcaagta agttaattga tcaactcaaa aaacaaacat 33421 ttctctcagc tttccagaac ctctgagagg aagtaggttt ggagaagtgg gattaagggg 33481 ttggcagggg gcggagaaag gtcaacaggg gaaaaagcca ggtgttaatg atgaaaaaac 33541 attgaacttt tcttttctag gggtagtaat aagatataat tcaagactag aacattgtag 33601 ctgcaaatct tcaaaataag acaatcttat cctctgtttt attgagggaa tagagaatgg 33661 agagaagttc cattccacgg gctttgggaa ttcaatatgg ttcaaggata aataaaccca 33721 ggttttttta ctgcagagaa gagcttcaaa tataatcagt tctcagggca tcagctcagt 33781 tcctctatcc ctaacaatgc agttgacatg catcttctca gatgcctaac tcctaactcg 33841 ctgatggata atttaggtac acatagagga ctagaaacac caagccacca gtgagaggaa 33901 gaggagagtt tgcagagaag ctggcttgca ataagacaat gagttcatct ttaattacct 33961 ggaatttgag gagatgtata tagttggcaa gctcctaggt aaggcatgtt atggagaagc 34021 taccgtgaat tgataatatc aaaataacta ttcagggagc ctttgcatgt gtgcatctct 34081 ctttcacacc aattatagtt gagttaattc ctgtctgatt catctcccag agatgcagcc 34141 tcctcttaaa gaagttgggg gtggtggccc attcagtgat gtcactgaca gatgcattct 34201 gtggggataa aatgtcacaa aattcatttc tttgctcatg ttcacagagg gcctggtctg 34261 gaatattcca catctgctct cactctgcca tggactcctg gaccctctgc tgtgtgtccc 34321 tttgcatcct ggtagcaagt gagtcttcag aacatttacc atcatcaggc tgggcttctg 34381 catggatgat ctcatatatt ttccttattc tgacgcccaa ttctgtcttc cttcatagag 34441 cacacagatg ctggagttat ccagtcaccc cggcacgagg tgacagagat gggacaagaa 34501 gtgactctga gatgtaaacc aatttcagga cacgactacc ttttctggta cagacagacc 34561 atgatgcggg gactggagtt gctcatttac tttaacaaca acgttccgat agatgattca 34621 gggatgcccg aggatcgatt ctcagctaag atgcctaatg catcattctc cactctgaag 34681 atccagccct cagaacccag ggactcagct gtgtacttct gtgccagcag tttagccaca 34741 gcgctgcaga atcacccctt tcctgtgcag aaaccctggt gtttctcctt ctccttctac 34801 ctcccagcag tcctgggcaa agtctttcct gttcctccct ccccatgaga aaaagtggtt 34861 ttgggttgtg acaaagacag aaaatgaggt ttcaacatag gaaaccttat gttgatttga 34921 ggattatgaa atgggttttg aggattcctt aaaaaattgt ctctgctcaa aacacatagg 34981 agtaagataa accttggcta ctgacactgg agatttccct gccctcctgc atttgccatc 35041 ccatgagaat ggtgggggct cttgagaagg gctgcatttt ctgaactgtg aggccctctt 35101 cattctctcc taactctaag ctgcaaacag aaatttccct cacacgtttt ctagattgta 35161 aaagaaagtt cttctttact atgattgtgg acattccttt ataatgccaa tttcaacttt 35221 acattacttc aggatttttc actactccta aagagtgtct caaatgtggc tagagcaagc 35281 aggttagtac actagatgta agctacctgg cctggaatct aaggatccat ttgtctctgt 35341 tctgcgtaag atgagccggg tgctggccaa aggctgtagc acactcacag agcactgatg 35401 acgcctcctg gtaaggaccc acactggggt atctaaaagc agacaggcat gtccagtctt 35461 ctgttgccct gtttcctttc tgattacatg tccttaacac acaaatttac atttttcttc 35521 ttatttatat gagaagtttc tatacaatac ctgcaatcca ttctgagtgg ttataatttc 35581 tgtgtgatat tcatatttac atactgattc cttctaaata cctatcatgg tatcattgac 35641 aagtgaggcg aatgatccct gtattttgag tgcccaaggc acttgaggtt ttttggagtt 35701 ctgccatatg ccaaattcca ctttgtcatt ttcccatttt tttttttttt ttacgtgtag 35761 ggatcctgac ttttaagatg aaaatagtgc agaattcctc cctgtgcttc cagatcattt 35821 tctccctcct actttaaagc ccagctgctt tgtacatatt actttataat attacccctt 35881 ctcattgtac ctaccttatt accttcagaa attttctttt ttttaatcct tctgcctgct 35941 actaatccag tcatcattct taatactatt catcacatag acaatttatt cactcgcgag 36001 gcttctaagt ttctcgactt cctatatcct gagtcttttt cccctctaat cattttcaga 36061 cttgtttcca tgattataga ccattaagga acaacaaaga tctttcaata tttctattta 36121 aattgtcctg tcgttctacc actacctccc ttctgtctat atcagtaact cctacaattc 36181 ttctatttct ctgacctctc cagttcatag agcccataac ttttattttt ttaatttttt 36241 tttggtcttt ctaggatgtt atttattaaa aatagattct ctgtccctac cctcacccct 36301 gctggcaaaa tggcctttcc caacatcttt tcctgcatgg ataacaggtc ttgaggacgc 36361 agatgtgcat cctacctaaa ctacaggctc aggcttttct gggacagtga ggcagcagct 36421 ctgccagagc caagggtgga acagcacaag acgacatcag caaaccctgg tgcacctaac 36481 tgtgggctgc tatggagtct gggatggact ggagttcctc ctgctccagg cctcgggaga 36541 caaaatccac aaagagagac ccagtggccc agcagcagcc ccctgagagc ccacaacttt 36601 ttaactctta cgctttaatt cttacattct tttaattata ttttaagatt taagactata 36661 atatcaaccc cgattaaagc caactccccg taggttccta tgaatggcag agttgggagg 36721 ggcactaaaa aatccccgtg ggaactcatt cgatttatct tctgtagttg tcatagactc 36781 ctcttaggtt cctctacttc taatcttggc ctccctcatt ctagccacca acccagggtc 36841 tactctcaat acagaagtca gagtgatcct ttaggttaga tgttgtcact cgtccactaa 36901 accatacagt agattcccgt ttcactcaga gaaaagccta cagcctgcat aggcctacga 36961 ggtcctcggt gatccccctc cctgttgtat ttcttgcctc ctgtctgttc ctctttccct 37021 tgcttatttt ttcacaacac agtggctttc tggccattcc tcgaatacac aaggcacact 37081 tctgccttag aacctttgct caagctattc cctttgcttg gaacatcctt ctcccaagta 37141 tcccagtcag gcctgcaaga agcctgaatt attctaccag tgacacaatc attttcctaa 37201 aaccactttt caatttttat acattcaaca attctccatt gttcaccaga taaaaatgtc 37261 acctcgggag ggcatttgaa ttttaaaaca tttggaccat atcttatctt atttttgact 37321 actgagtaag ataaatgtta ctttatcaag tgcacttcat acccttacaa aacagtctta 37381 cacaatttaa tgcctgttca tacatttgtt cttgcaattt ctgctacctt cttcatattg 37441 ttcaatactt ctagtcactc aagacactac tgaatgcttc tttctctatg aagcaggaaa 37501 tggttctata tactctattt tcacagtgga gtaaaataga tacacatttt aggaacaaaa 37561 caaccatgca gattatagaa attaaactat tttttacata ctttcgaagg tgaggaagtt 37621 cagaagctca aatatggtta agatatgttt tgcttctatt taagtccaca ttccccacca 37681 ctttaaatct tccaagaaag atttccctca ttcaaattaa aaacattttg atacaagact 37741 gggggcaggg aagtgctggg tagagaaggg cggggtccct ggtgagggct ccaccctcgg 37801 gcctgtgccc acagacctaa atgaggacag gcgtttctgt ttttgcactc aaaaagttgc 37861 cttttggccc gccacgcctc ccatcctgtg cccatataaa cccgagacct taagcggaca 37921 cggacacaag aggctggaca tctagaagag cagaagaata cagcagcaga caccggcaga 37981 tcagcgatgg cggaaggaca cgaatgcgag gggagttcag ccaggggaag tctgcgagag 38041 tcctgccact gggcagccaa ctgcagggga aaaccaccgt tccactccat ccctcgactt 38101 ctggctcccc atctatctca ctgagagcca cttacaccac ttaataaaag cttgcactca 38161 acctttcagc ccacatatga tctgattctt ccagtacact ggtcaagagc tcaggctgtc 38221 acattggccc cctgtccttg caataagcag agggtctatt gagttgacta atgcaacgag 38281 acggcagacg gcatagctga aagagtgcac tgtaagtcct agacgctgcc atcccttgga 38341 gcccaaaagc actccccatg gcctctgcac ctgcccgtct gcatgctccc ctaggggttt 38401 gagcactggg gcaccagtga agtgagccat ccccctgtca catgtcctgc gaggagaata 38461 agggaattct ccggttttaa cttaatctgt aaaagacctt gttaagagga tgaaaaagaa 38521 gccacagcaa tggataaaac atttgtaaac tacatatcca acaaaggaat attatgactg 38581 tttcgagatt tgcagaactg tctatttaac ccatagactc atgagcaata ataaattatt 38641 attgctttaa ccagtgagat ttgagtgagg cttgttatac agcattactg tgcaggtgtc 38701 aattgatata ctctatattg caatatcgat tcttaaattg tctaaccata atcctataga 38761 ccattactaa ttccaagctt gttcttttta tgaagagtgt cgctcaactc tgatgtcttt 38821 gttctgttta atattttgat ttacacatga acacagttcc cagcagcttt caaacctcca 38881 gaagccctta acatttctga ttctgaaaaa tatcttttct tgttttcagc attttttatt 38941 gttatttttt attcttttgt atgggaaggt actcagttta ccatcctgaa ttagtaattc 39001 tttaaacatt tttgcaattc cagtaattag agatattact gcattaggca ttgaagatac 39061 catggtcttt gcatttcctg cttgcatatc gagtcctatg aactatttct ggtgtctctg 39121 agtcacttcc cagtgtggtc attggctttt actgagtctg attcaccgac ctttactgag 39181 cctcactcac caacctttgg tcattcttgg acatagaagc acaaattcaa cagtgttgtt 39241 tcccaaaata tgtaatatgc agggaaaatt acttttcatg taaaatagac gtgtcctttt 39301 agtattacca gtagattatt ctaattatag actagtaaca tccaacagac ttatttgggt 39361 ttagtgacag acctattact gtgttcgctc ctcagttcaa atacagagtg tgtattttgc 39421 atcgtgtggc tgcaagtttt ggggttctcc atcatatact tagctataca atgccagcta 39481 ctcttgtctg gatattttcc caaatcctgg atgattacaa acttctcctt cactggtgaa 39541 attttttttt tttttttttt tttgagatgg agttttgctc ttgttgtcca ggctggagtg 39601 caatggcaca atcttggctc actgcaacct ctgcctccca agttcaagtg attctcctgc 39661 ctcagccttc tgagtagctg ggattacagg aatgtgccac catgcccggc taattttttt 39721 tttttttttt agtagagaca gggtttctcc atgtttgtca ggctggtctc caactcccga 39781 cctcaggtga tccacctgct ttggcatccc aaagtgctgg gattacaggc atgagccaca 39841 gtgcctggcc aatttttttt tttttaatga agggtcaaac ctgatattaa gtctttcaga 39901 cagtaagaag ataactattc agggtctaca agatggccaa ttagaagcag ctgtggtcca 39961 tggcactcag ggagaggaat gaaacggggt gagtgaattt agcactgtca attgagatat 40021 tcaggttctc acactgggac tgtctaggca aacaactcaa cccacagaga acaaagaaaa 40081 gcagggggtg tggggcatga tggcccacca ggagcagcac agagccaaag gaacccccac 40141 ccccagccaa aggaagcagt gagtgattgt gtgaccctgc ccaggaaacg actcttctcc 40201 cacagatctt tgcaacacat ggatcaggag atcccctcac aagcccacaa caacagggcc 40261 ttggatccag tacacagagc tgcatggagt cttagcagag cagctgctca ggcacacaca 40321 gagacccagg agttttacac attccagcct caggatcccc agcaaggcag gaaatctgtc 40381 cacatataac cctaggaagg gagctgaatc cagggagcca agcagcatca ttctgcaggc 40441 cccacttcca tagcacctca caagttaaga cgcactggct tggaattcca gatagccaat 40501 agcaacagcc tagaatctgc ctgagataag tatgaattcc tggggaaagg gggtggccac 40561 catctctgca gtccagtaga cttagctgtt taagcctgcc agctttggag aatacaaatg 40621 gtctggatga ggaaaggttc cccataatca gcacagctgc cttgccagat catggccaga 40681 ctgcttcttt aagcaggacc ttgatccatc ccttctcagt gggcaggacc tccctgtgga 40741 ggcttcagcc attctagcca gtgttctaca ttgagagctc tgatctctcc ctaggatgga 40801 gctcctggca gggaggggca gcctccatct ctgcagttca gtcaactcag tccttctagc 40861 ctgttggctt tgcagaatcc caaaatggtc tggacgagaa agcctcctca taatgcagca 40921 catttgctct accaaaaagc agacagactg cttctttaat caggcccctg atcctgttca 40981 tcccaactgg atgagacctc ccaacagggt ctccagccac ctcctacagg tgtgttctgg 41041 ccagcaacag gtcagtaccc cccagggatg gagcttccag aggaagaagc tggctgccat 41101 ctttgctgtt tcacagcctt cactgatgat acctccagta tgggaaaaac cgaggcaact 41161 aaggtctgga gcggacttcc agcaaaccac agcagcccta caataaagtg gcctgactag 41221 taaaggaaaa acaaaaaaca gaaaacaaca agaacagtat caatgaaaag gaccccacaa 41281 aatccaaggt caaaggtcag caagctcaaa gattgaaggt agataagccc acaaaaatga 41341 gaaagaatca acacaaaaat gttgaaaact taaaaagcaa gagtgccttt tctcctccaa 41401 atgactgtaa cacctctcca gcaatggcac agaattgagc tgaggctgaa atggctgaat 41461 tgacagaagt agggttcaga aggtggtaat aatgaacttt gctgagctaa agaagcatgt 41521 tgtaatccaa tgcaaagaag ctaagaatca tgataaaaca atacaggagc tgacagccag 41581 aataaccagt ttacagagga acatgatcaa cctaatggag ctgaaaaata caacacaaga 41641 ccttcacaat gcaattacaa gtatcagtgg gagaacagac caagtggaag aaggaatctc 41701 agagtctgaa cactatctct cttaaataag aggggcagac aagaatagag ggaaaagaat 41761 gaacaacacc ccccagaaat ttgggattat gtaaagagac tgaacccatg actgactggg 41821 gtacctaaaa gagataagga taatggagtc aaattggaaa acatacttca ggatatgatc 41881 cagaagatct ttcccaacct agcaagacac accaacattc aaattcagga aatgcagaga 41941 accccagtaa gatactccat gagaagataa accccaagac acataatcat cggattctcc 42001 aaagttgaaa tgaaaaaaaa atgttaaggg cagccagaga gaatggccag gccacctaca 42061 aacggaagcc catcagacta acagcagacc tgtccgtgga aaccttataa agtacagtgg 42121 ttggaagcta ttctttaaca ttcttaaaga aaagaatctc caacccagaa tttcatatct 42181 gaccaaacta gcttcgtaag tgaagaagaa atacgatcct tttcagacag caaacactta 42241 ggaatttgta ccacaggact gccttgcaag agctcctcta ggaagcacta aatatggaaa 42301 gaagaaaacc attaccagcc ctacaaagca cactgaaata cacagaccag caacactatg 42361 aagcaaccac ataaacaagt ctgtgaaata accaggtagc atagtaatga caggatccaa 42421 cttacccata acaatactaa cttttaagtg taaacagggt aaatgccaca attaaaagat 42481 atagaattgc aagctgagta aagaaccaag actcactgat atgctgtctt taggagacac 42541 atctcacatg caaagacaaa cataggctca aaataaacgg atggaagaaa ttttaccaac 42601 caaatggaaa acagaaaaaa gcaggggttg caatcttagt ttctgacaaa acagacttta 42661 aaccaacaaa gaacaaaaaa gacaaagaag ggcattatat aacggtaaag ggttccattc 42721 aacaggaaga gctagctatc ctaaatatat atgcacccaa tacaggagca cccagattca 42781 taaagcaagt tcttagagac ctacaaagag agacctggac tcccacacaa aaatagtggg 42841 agactttaac accccactga cagtattaga aacatcactg agaccaaaaa ctaacaaata 42901 tattcaggac ccaaattcag ctctggatta agtggaactg atagatatcc atagaattct 42961 tcacctgaaa acatcagaat atacactctt ctcatcatca tatggctgtt gctcaaatcg 43021 atcacatatt tggaagtaaa acattcctca gcaaatgcaa aacaactgaa atcatgacag 43081 tctctcagat cacagttcaa tcaaattaga actcaagatc aagaaattca cttaaaccac 43141 acaactacat ggaaattgaa caacctgctc ctgaatgact cttggataaa taacaaaatt 43201 aaggcagaaa tcaagaagtt atttgaaact aatgagaaca aagagacaat gtaccagaat 43261 ctctgggaca gataaatcag tgttaagagg gaaatgtatt gcactaaagg cccactacaa 43321 acctctgctc aaagaaatga gagaagatac aaacaaatga agaaacattc catgctcatg 43381 ggtaggaaga atcaatattg tgaaaatagc catactgtac aaagtaactt atagattcaa 43441 tgctattccc attaaactac cattgacgtt cttcacagaa ttaggaaaaa aaactatttt 43501 aaaattcata tggaaccaaa aaagagcctg aatagaaata gacaagacaa tcataagcaa 43561 aaggaacaaa gctagaggaa tcatgctacc agacttcaaa ctatacttta aggctacaat 43621 aaccaaaaca gaatggtgct ggtacaagag cagacacata gaccaatgga acagaataga 43681 gaactcagaa ataatacgac acacctacaa ccatctgatc ttcaacaaac ctgacaaaac 43741 aagcaatggg gaaaggactt cctatttact aaatggtgtt gggagagttg gctagccaaa 43801 tgcagaaaat tgaaactgga ctccttcctt acaccatata caaaaattaa ctcaagatgg 43861 attaaagact tacatgtaaa acccgaaact atgaacaccc tagaagaaaa cgtaggcaat 43921 ctcatttagg acataggcac aggcaaagat ttcatgacga acatgccaaa agcaattgca 43981 acaaaagcaa aaattgacaa atgcaatcta attaaactaa agagattctg cacagcaaaa 44041 taaactatca tcagagtaaa cagacaccct gtagaatagg agaaattttt tttgcaatct 44101 atctatctga aaaaggtcta atatccagag tctgcaagga acttaaactt acaacaaaaa 44161 aacaaccaga caaccccatt aaaaagtagg caaaggacgt gaacagacac ttatcaaaag 44221 aagacataca tgtggccaag aaacatttaa aaaatagctc aacatcactg atcattagag 44281 aaatgtaaat caaaccacaa tgagattcca tctcatgaca gccagaatgg ctattataag 44341 agtcaaaaaa caatggatgc tggtaaggtt gcagagaaaa aggaacactt ttacactgtt 44401 gtgggagtgt aaatgagtta aaccactgtg gaagacagtg tggtgattcc tcaaaggcct 44461 agaggcagaa ataccacttg acctagcaat tctattatgg ggtacatacc caaaggaata 44521 taaatcattc tattataaag atacatgcat gtgtatgttc attgcagcac tgttcaccat 44581 agcaaagaca tggaatcaac ctaaatgccc atcagtgata gactggataa agaaaatgtg 44641 gtgcatacaa catggaatta tatgttggca taaaaagaaa tgagatcagg ctgggcacgg 44701 tggctcatgc ctgtaatccc agcactttgg gaggccgagg taggcagatc acgaggtcag 44761 gagattgaga ccatcctagc taacacagtg aaaccccatc tctactaaaa agacaaaaaa 44821 ttagctgagc gtggtggcgg gcacctgtag tcccagctac tcgggaggct gaggcaggag 44881 aatggcatga acccgggagg cgaagcttgc agtgagctga gatcgtgcca ctgcactcca 44941 gcctggatga cagagcaaga ctccatctca aaaaataaat aaataaataa ataaataaat 45001 aaataaatag aaatgagatc atgtcctttg cagggaaatg gatagagttg gaagccatta 45061 cactcagcaa actaatggga acagaaaccc aaacactgca tgttctcact tataagaggg 45121 agctgaatgg taagaacaca tggacacatc acagggaaca agacacacta gggcctgtca 45181 gaagacggtt agtgggaggg agagcatgag gcaagaaaag gtaatgaatg ctggcttgat 45241 acctgggtga tgggatgatc tgtgcagcaa accaccatag cacatgttta cctatgtaac 45301 aaacctgcac ttcctgcaca ggtgccctgg aacttaaaat aaaagtcgaa gagctccctc 45361 tccctctccc tctccctctc cctccccctc cccctccccc tccccctgcc tctgcctctg 45421 cctctgcctc tctgtctccc ctttccacgg tctccctctg atgcggagct gaggctggac 45481 tgtactgccg ccatctcggc tcactgcaac ctccctgcct gattctcctg cctcagcctg 45541 ccgagtgcct gggattgcag gtgcgcgcca ccacgcctga ctggtttttg tattttttgg 45601 tggagacggg gtttcgctgt gttggccggg ctggtctcca gctcctgacc gcgagtgatc 45661 tgcctgcctc ggcctcccga ggtgccggga ttgcagacgg agtcttgctc actcagtgct 45721 caatcttgcc caggctggag tgcagtggtg tgatctcggc tagctacaac ctccaacctc 45781 ccagccgcct accttggcct cccaaagtgc cgagattgca gcctctgccc ggccgccacc 45841 ccgtctggga agtgaggagc gtctctgcct ggccgcccat cctctgggat gtgagaagcc 45901 cctctgcccg gccgcccagt ctgggaagtg aggagtgcct cttcccggcc gtcatcccgt 45961 ctaggaagtg aggagcgtct ctgcccggcc gcccatcgtc tgagatgtgg ggagcgcctc 46021 tgccccgccg ccccgtctgg gatgtgggga gcgcctctgc ccggccacga ccccgtctgg 46081 gaagtgagga gccctctgcc cagccgccac cccgtctgga ggtgtaccca acagctgatt 46141 gagaacgggc catgatgacg atggaggttt tgtcgaatag aaaaggggga aatgtaggga 46201 aaagaaagag agatcagatt gttactgtgt ctgtgtggaa agaagtagac atgggagact 46261 ccattttgtt ctgcactaag aaaaattctt ctgccttggg atgctgttaa tctataacct 46321 tacccccaac cccctgctct ctgaaacatg tcttgtgtcc actaagggtt aaatggatta 46381 agggcggtgc aaaatgtgct ttgttaaact gatgcttgaa ggcagcatgc tccttaagag 46441 tcatcaccaa ctccctaatc tcaagtaccc agggacacaa aaaccgcgga aggccgcagg 46501 gtcctctgcc taggaaacca gagacccttg ttcacatgtt tatctgctga ccttccctcc 46561 actattgtcc tatgaccctg ccaaatcccc ctctccgaga aacacccaag aatgatcaat 46621 aaatactata aaaaaaataa aaaataaaaa aaataaagtt gaatcaaaga aaaaaaaagt 46681 cgaagaaaaa aaaagaagat aaccatttga attgcctcat caggtggctt gttatgggag 46741 aatgtatatt tccaagactc ttgaaattaa agagagtgag ctatcataaa cagtgagctc 46801 ctagacagaa ataatttaga gtatgtgata gtaagcacac ctgtcaaggc tctctccttt 46861 gaagaaaatg taatgaggag gactctgaaa gaaattagag catgtacaca caaagattca 46921 aaatcaatca acattattat tacaaaagct gagagcagaa ttaaaagaat gcctgtgctt 46981 ggaatacttg ttatggagaa ttttaatttg ctaagaaaat ggatgagaca ctacagtatt 47041 ttgaagtgat acttccttct gatcatcaga acaggagcaa aggaagaata tttaaaattg 47101 aaagctttta tataattctt actgtgtttt gggcatttca agtgggttca ttttccaagt 47161 agaaaatatt cagcatctac ttagaatgcc ttattttatg gaagagtaaa ctgaagtaga 47221 gagaagatac ataattttct tcatgtcaca tggctagtaa atggaggaaa gagggattca 47281 aacacaagaa tttggggact ggattccttt taaccattta taaactatta taaccaaaat 47341 taaaaatata ggttcattaa aatggaagag gaacaagact tccttaggaa tagttctaac 47401 ttcttagaag acatatttct catatgaaag aaagaggagg ttctcggccg ggtgtggtgg 47461 ctcacacctg taatctcagc actctgggaa tttgaggcag gcggatcact tgagctcagg 47521 agtttgagac aagcctcggc aacatagtga gatcccatca ctactaaaaa taaaaaaaaa 47581 ttatctgggg attggtggtg catgcctgtg gtcccagcta ctcgggaggc tgaggtggga 47641 aaatcacttg agcctagggg gtaggggttg cagtgagcca agagcatgct actgtattcc 47701 agcctaggtg acagagtgag acctcacccc caattaaaaa aaaaaagaaa gaaaagaaaa 47761 gaaagaaaga agaggctctc atatccagga aagatcaatt ttagagggaa gaggaaatgc 47821 actcagggtc aactgctctc aaggagttat tagaagagat ttattttggt tgttactaag 47881 gcaggcaaga taactggatg taaagaatct tcatggagga aaaacaaaga tacaggatct 47941 cagaggggag actagataaa ctttacagag gaagacagga tttggggatt tgttaaggaa 48001 tgacttaaaa tcaacactat ataaagggtt atatgcaatt tggacatttt tgagaagaga 48061 tgccctgaga catcaattga agactctcaa attatttgat gttcagctct agcctgtgct 48121 tgaaaggagc ctgcttaatt atttttaaaa ttatgagtct taagaaagtg taagaaagat 48181 ggaacatatt aaaggaaaag caggcaaaca aaaaatgaga atacctttta gatacaattg 48241 ctcaaccaag gaaattaaag atattcaata tttatcttaa tagtgcttgc tttagaggat 48301 ccagacattg ttaattgcac atctctcaat agagaagacc acagagatca aatctagaat 48361 attttgaaaa gactagtgac aagtacaaac ataaaattta agaaggaatt taattctagt 48421 gaatattgct aatactacat gtgcttgaga aagaacagga caaatttaca aagcctttca 48481 aacttctaga aattcagctt aagctagacc aggcccagtg gctcacgctt gtaatcccag 48541 cactttggga ggccgaggcg ggtggattac ctgaggtcag gagtgagaga ccagcctggc 48601 taacatagtg aaacctcgtc tctatgaaaa atacagaaat tagcttggca tggtggcaca 48661 agcctgtaat cccagctaca tgggaggctg aggcagggga atcacttgaa cccaggaggt 48721 ataagttgca ttgagccgag accacgccac tgcactccag cctaggcaac aaaagcaaaa 48781 ttccatctca aaaagaaaaa aaaaaaaagt agtgagccaa ctctttattc agtaggggat 48841 cctgtccgct aaggaaaagg taatggaagg gcatattcga ttcaggagaa tgctagctag 48901 cccaataagc tgcaaagctc agccccttct ttaccagaat caaagctcac atttccactt 48961 cagttgatct gacatgagta tttttctctc aggtgttttg aatcttaggt cagaaagagt 49021 ggttacccct gcgcatgacc actgtgccaa tgtaacacag catatattac tttgggtctt 49081 gacaaagggc tcatttttag tgactgagtc cttctggcca ccaggagaca cttcagtgcc 49141 acaaccactc actgacactt tagagttgct aagaggactg tcaggccatg ggaaactaag 49201 aagaagctca ggcttggtac ctgaatcatg agtcctacct gttgagcact gaaaaagcac 49261 tggaaagaga agggaggacc caaaaacttc taaacctgtg cgaatgggct aagaccaata 49321 gtgaggttgg gattttaggc aagtttggga tggttgtcag acaaagaagc cactgatata 49381 ccactagtac aatgcagttc gaattacaat aaaaagttat agggggtttt tttgggtcag 49441 aaagagaaga agtaaaagag cacacatgat tcctatctcc ttttatcctc caaactaact 49501 ctcactaacc cacagaaatg ttttcatagg tggcacaaag gtaggctgca acaattgtgg 49561 ccaatattat ggtagagaag gaccctgccc tgtaatgctc aacatattaa tgttacgtga 49621 gagtgggaca gggtcatctc tgtgacagta tagtcacaga agcctcttcc tgctcaaata 49681 agtggccttc attaaaagaa acacagaaga atcttgtttc ataagctaat gccaatcact 49741 gagacagcaa cggtttcagg aacatgaggg aagctgttgg aatgggcttt cctttaaacc 49801 tagcctggtt ccatcatgct gagtggtggg ggctttaagt cagggcctga gaaggacagc 49861 ctaactgtgg tataagtcag agggctcctc attcctcata agccgacaaa tacattaata 49921 tcatcagaaa aattaccagg gcctgtggtg tcccaaattc tgatttcatt ttccattgaa 49981 gatcagagca cagaggaagg agatacctca ctcatgtgct cttcttgtta tatgctcaca 50041 gtttctttgt ttcaatgact ctgtaccctg gaagaggtta gacctggata tgctaaaaaa 50101 aaattttttt aatccctatc ctagaatatt gtggccttga atcaaagtgt tttctatgtt 50161 tgtgttttca tcttccttcc taagccctat atctaggttt tgccttctcc atccacaaat 50221 ctcaggatgc tatggagtct gaagacacct gattgcagaa agcacaagac cccccaccag 50281 cgcgcgcaca tacacaaaca aatacaggac cagaaaggag tggttactgc tgattctcta 50341 acattacaaa aatatcacgg ttgtaacttc tcaccctgcc agtcatatgt ttctctgtca 50401 cttgacatac tcccctcttt ggggtgttgg aggaaaaaat ctctgtgcca acagcagaga 50461 catggaacta gtgtgtcaca tccactctat gaacagaatt cttactacta tttataccct 50521 aaaaagcagg ccattttcta gagtttcacc ccctcaccag aaagaattta tttgtgagta 50581 ttggttgact tactataaca agagactcaa gtgttaatca ttaaaggaac ctgttagtgc 50641 tttcttagaa gtaagaggaa gatcacattt aactggagag tgccttattg tggttttcaa 50701 ggaaggtcaa agctcttact tatcactgct caatcttcac ccgatgccca aagaattaac 50761 ataattttga accttaccct ggagaagcga aatgggacct tgtcctccag aaatgcagaa 50821 gacactgtgc tgtgggctgg gggagtcacg gagcacgaga gtatccatct ccagccctca 50881 ctggaaataa aatctgcaca gttctatctc tttccagggc agccaccttg ttctctgcat 50941 gaggagcatg gaggtccaga tgcttccagg agggatctgg ggcaaagctc taactaagca 51001 acacattatt ttgattttgt gtgtctggga tataaggatg gccggtaagg ggaacaataa 51061 agttcttgag atttcacaag taatttattg atgtccagaa tttgaggtgc tgatggtaca 51121 cctaagtggc aatatgcacc aggcagatca atatgtgact cgtcagagaa agcagaatgg 51181 gtgatgtgat gtgcaatgcc acagaagcac tgcagccagg agaggtgaca gctaatgggg 51241 atgtttggag tctttgagtg aaccaaacac atcccagagt aattgtaatt tatttcagtc 51301 aatcttctgt acagacttag cattcacctt tggaggaagg tcctttgagc agggacagag 51361 atggtgatgt cactgacagt ccccctttta ctctgggtga gaggtctaga atcctcagct 51421 cctgtattcg tgcccacaag ggcctcatct aggtgaaggc tccacctgcc ccaccctgcc 51481 atggccacca ggctcctctg ctgtgtggtt ctttgtctcc tgggagaagg tgagtcccca 51541 caaataaagc acctgcattt ttggatattg ccagttatga ttccaattat gtttcttatt 51601 ctgtccccaa attctatctc ttttcacaga gcttatagat gctagagtca cccagacacc 51661 aaggcacaag gtgacagaga tgggacaaga agtaacaatg agatgtcagc caattttagg 51721 ccacaatact gttttctggt acagacagac catgatgcaa ggactggagt tgctggctta 51781 cttccgcaac cgggctcctc tagatgattc ggggatgccg aaggatcgat tctcagcaga 51841 gatgcctgat gcaactttag ccactctgaa gatccagccc tcagaaccca gggactcagc 51901 tgtgtatttt tgtgctagtg gtttggtcac agcgctgcag aatcacctgc tccctgtgca 51961 gaaaccctgg tgcttcctct tctcctccag tacccagcag ctctcagcag cctttcttgc 52021 tcctccccta gcacaggaag tacataggtt tcgtgttcca catgtcctag gcaaggcaag 52081 aacaggtcat aaggacacat cacgttagga aacttttggt aggaagtcag tgggtgtgat 52141 ggttcctggg attccacaca tacttctcac aagggtgcct gagtccaagt ttgagggcga 52201 gtgtcacaga ctggagcttt caagtgattg agtgagctta aaactgtgca gccattgcac 52261 atggatgctt atctcctttg ctgctttccg ttatggcttc tttgccattt cttttctctt 52321 ctaccttaaa actttcaaca tctttgatga cttccaacat caatattggc ttcccagttt 52381 tttcacttgc tcacttcaaa tgatcatttc cttcaccttg tgtcagatat tcatttccac 52441 tttctttttc tagattccaa tgacaaatta tttgaattac ccccaaatag tttcctcaag 52501 catcccattg tctcatgcgg gcttctcttt tcatctctgg tactacttct gcagcaattc 52561 ttcaacccat caggaccact aattgacttc tcctgcaaat attgcaccat atatcatctc 52621 ctcatgtact atattcctta tctgacttag actatgtgtt ccaaggtgat tgtgaaattt 52681 tgcctatgtc ttcagctctt ttctttttct gtttcttacc tggaaaataa aatggggcct 52741 actctgtgcc cctttcaatg acacagagaa cataaaatgt cccttaccct tgtcaagcag 52801 tattataacg atacagtgtg tcttgaggtc tgtaagtccc tctatgcata taaaattata 52861 tacttttctt ctattaaaat aatttgttac cttttaatga gttcatcagt ttcatctttc 52921 aattatagtt atacttttga caattataga agtactaaaa aaagttatgc acccagaatg 52981 tatgattcta tcacttttat gtgtgtgcgc acttcacttc atacgtacaa aaaagttaaa 53041 agaatgccga ataggcatat ttatccattc tcttgtagga atgataaaga tgaacttcca 53101 aggggaaaat attttcaaaa taatccatgg cacataaaat tatttaagaa aaaataaatc 53161 aataaatatc cctgttaagt ttatatatag ttttatatgc ttgggtaagg aagaaaaaaa 53221 gttgtgatat attttgctgg ttatttggag agataaaata tagaaagagc agaaagaaca 53281 tgattagctt tgtctttgtt taaaattttt gctttttaaa tttgatacac tatttaccat 53341 caaaatttaa aacaacacca ctgctataaa acaatataca gacaaactaa aaagaaaaag 53401 gaatttacta tgtaaaaaaa aaaaaaaaag cgagagctta taggcagagt attaagggga 53461 atttaaaatt ataaagacct gtaagttttc agagaggtat aggcctcaca gctggagctg 53521 ggattttaat gtccatatga aagagatgac agaaaacagc cagaattaca ttgcttgcct 53581 taatcctgga gcttcaaaat gctcccaggc tttataaagt aaaagtatta caccccaaag 53641 acccagaaac ataattggaa actggatgtt tggtcaggcc tttgggtgat agataaaaag 53701 gaaaagttag gagagaaaaa ttgaatcttc agactgtgtc tcacatgtgt ttggaatgaa 53761 atatactctg catggaatgt acaccatata cccaaaaact gacaaaaagg tccataacca 53821 ttaaaaccct tggagaccct ggcattaaga gattcaatac ggttctggag taaacttctg 53881 cttccaacca agatagatta acgggaatta gatttaactt cttgcttgaa acagctaaca 53941 ccccaacccc caccctgtcc ccctgcacac acacattcac acaacatgtg gaataacggt 54001 tgtcaagaca ttggaagtag gcaacagaag acagtgatct ctgagagatg gaaacaacca 54061 agctgagaac tatgactgtc ttaatacctg gagggagttt ccaggccctg ggccagaaag 54121 aggaatccaa gcagagctgg gtgatctcac tgagctgagt cagagttgga aatacaagat 54181 caaagcagct agatttcaca tgacaggcca ttgaaaaggc gattgctgca cagagagaat 54241 cctggagacc ttcctagggt ctcccttaac ccttcagctg tgctgtaagc agtacatgtg 54301 tgaggaaact ccctaaggtt ggagaaagag ccactcagca agattagagg aaacaatgcc 54361 tggagctcac aaagggctga gagtaatata tacatatatt actctcatat atatatgtat 54421 atgtatattt ttgcattgag aatgggaaac ttaataattc atgggtcaat cagtagaggt 54481 ttcagatgac ttatgcctca gtataaaagc aaaataagcc cttcactaaa taagccacat 54541 ttggctcctg gcctatgtgg gtgagacttc aaaatatcaa actgtttccg agtaacttaa 54601 gtctcagagc aaaactaaag aatatttcta atgtacaaga atacctgttc ggagatatca 54661 gtatgaattt attgttttta aacgtataga tacaatggta aagaaatgaa tacatacctg 54721 tgtgaataat gggttggtat atatacatgt atttcatact tctgtccact tagaagacca 54781 ctagtgcaca gatgtttgtt gccaaatatt attctccaat taaaaaaaaa tgaaaggtgg 54841 ttctaggacc ctgaggtaaa aaatacaaga atgttgtaat gaggttttcc aaaaaaaaaa 54901 aaaaaaaaaa aggatgggac atatctcaaa aggacagagg aatcaatctg aaagagcttc 54961 cccaatgact aaggctggaa caatttgtaa aagaaaattc acaagttcac actgagaatg 55021 aaacgattaa ataaataaat aaataataaa aaatgtgaga aaggacaatc tcttactgaa 55081 gaattccaaa ttgcatatgt agaagaaaca gagggaaata gaaaagtcac tgtgacaaca 55141 ccactgtgag taactgccac acataaaacc caatagtgag tgctaaaatt agtgggtgta 55201 gctaatttta attactaatt aataactaat aactttaata actaattaag aagaaatgga 55261 tcatttgtac agtcacaaag tgtcatcccc aaaatattaa ttaatatggt ggttttaaca 55321 tatgcccaca cattctttta tactcatccc tctagattcc cctctcctgg attcagtggc 55381 ctgcctctaa tgaacagaca tgggaaatag aaaaatggta accgagtgat ccaaagtaaa 55441 taaactgacc cataataagt cttgttaacc tcatgtgcat tctcttatga tgcaatgttg 55501 tgatgttatc atgtgactac atatctgtaa tattcctccc ccaaatctat aattcctgtc 55561 taataatgag aaagcatcaa acaaatccca aactgaaaga gattctacaa aacacctgac 55621 cagtaatctt caaatgtcaa tatcgttaaa aacaaggaaa gagctgtcac aaattgaagg 55681 acactaagaa gatatgacac ttaaatgcag tatgggatcc tggattaggt gctagggcag 55741 gaaaaggtca ttattagaaa aactggggaa atctgagtac agtctagctt agttgtgcca 55801 atgtcaaatt cttagtttgt gtgatgttag cactagtggg tgttggatga aggatataaa 55861 gtcacttggt agtattgtta tagctacagg acatcaattc tccagctttg ggtcacctga 55921 gtccctatca agccaatttg tgaagctgct gttagttgtc cttactcccc agtcttggag 55981 ctctcagtta gacctagaat caacctcttc ttcctcttga cctctgcttc actgcatcct 56041 aaacgtggtg aggtggaggg gtggtgaggg aggctgagca gaagcccatc attcagaagc 56101 ccatcactag agcttctctg atgattctct gagacagggg ctccctctgt agtctccttt 56161 gtgcacaatc ccaagcacgc tgggccacac agactccacc aaagctagtt ttgtgaggag 56221 gcaaatattg aggctccatt gtgaagagaa tgtccatgtc agatttaagt acaaaagatt 56281 aaagttgatg cattactatt gcaggaaaat gatacctgtg agaatttcca aaagccagat 56341 tttgtattgt gtggctggct gagtaatacc cctccccaca aagacattca catcctaatc 56401 cctgggatct gcctctgtta ccttagatgg tgagagtaga ctttacagat gtgatgaagt 56461 taaggatctg gagataagga gattattttg gattagtctg gtggccctca atgcaattac 56521 atgtgtcctt ataagagata gccaggggga gatcagacac aaatggagaa gagaaggcaa 56581 tgtgaccaca ggagaagaga ttagagtgag gtgaccacaa gccaaggagt gaggcagcca 56641 ctagaaacag aggcaaggaa cgaatcctct cctatagcct ccagggggaa tacggtcctg 56701 atgacacctg gattttggtc cattgatact gattttggac ttttccagaa ctgtgagaat 56761 aaatctctgt cattgtaaac cagcaacttt gtggtaattt ttatagcagc cactcaaaat 56821 taacacactc caatatgtcc agttaacctg agactaggcc aagaaactca tgttcttata 56881 atattttctg caaaccttgc tgtcatgttt ttatctgaca cccatctcaa actataatga 56941 aagaaaataa ttttgagact tcaatgtccc cctgtctggg gtgtgccttc catagttcat 57001 tggaaaagtc tagtctcctt tcaagatttt attccagcag tacctcctct tgaaagaatt 57061 ttaggcattg ttttctgttc tcagtgcaca tactaaaaag aaaaagacaa agactcatgt 57121 ctacttgatg tttacatttt tgcggggagg gagagacaga gataaacaaa taagtaaggc 57181 atatagtgta ttagcaggtg atacctgcaa tgcagaaaaa gaaaaggcag actaaggtga 57241 gggagaccag gactgtcaga gtaaggagcg ttgaaatgac aatacattgt tcagagtagg 57301 gcccaatgag gaagtgacct ttgagcaatg atttcaagga gatgaaggaa agagacatgt 57361 aggtatctgg gggaaaagaa ttccaggcag agggaggagc cagtacaaat accctaaaaa 57421 aatattttag gaccaatgag gaggcccata tgactgaaaa attaaggaag ggagtggtag 57481 actttatttc caagcggtaa tggggcagga tggatcattt aagaccttga tggccttaag 57541 tgctttgttg ttttgttttc ttattccaaa taaaactggg agctagtaga tgatttggag 57601 atgagtgaca tgatctagca tgttctagag ggctgagttt gagagctagt tcagaagtag 57661 gtgaaagcag ggagatcagt taggaatcta ttgcagtaat ccaggaaaaa gatgatggtg 57721 gccttgtcct gagggttagc aaagaaggtg atgaaaaaat ggatggattc tgagtgtatt 57781 ttcatgatag agccagttga atttcctaat ggattggcta tagggtggga tagaaaaaga 57841 ggatttggga accactctaa ggttttcagc tggaaggctg ttggtggagt aaagttgagg 57901 ataatctttc atgtttagag catgctgttt gagagtggtg gggggaaaat gttgattgga 57961 gtgcattaga gagataatgg gaaaagagaa tttggaggta agtattgatc attagcaata 58021 gcaacaatag ctaccgtaga ttaaaatgcc ccacagtcta tggggatgtg atacatgagg 58081 gaagacaggg ccagatggga tgaggtagga tccagacatc agactcagga gctacgagtg 58141 gtatatataa ggttaacacc tagtcaaatg cataaacagg ttgattttaa ttgatgtcag 58201 tttttctcca ttgccccctc tagaggcaat cttcttcaga gaaccctggc taggtcttct 58261 atgtttcatg tccgtagagg gagctcctga gactgtggac attggctaat atgctgatgt 58321 cactggaggc cacatcttac agggccaaga gacagatttg ctttcctttt tctcatactt 58381 gtaagctcct tcatctggaa atgtgattta cctgggtcct gccatggttt ccaggcttct 58441 cagtttagtg tccctttgtc tcctgggagc aagtgagtct tcaggtactt aaatatctgt 58501 gctgtaccct atcccagtct attcatgtca tgtattctgt ttttgtctct tcccacagag 58561 cacatagaag ctggagttac tcagttcccc agccacagcg taatagagaa gggccagact 58621 gtgactctga gatgtgaccc aatttctgga catgataatc tttattggta tcgacgtgtt 58681 atgggaaaag aaataaaatt tctgttacat tttgtgaaag agtctaaaca ggatgagtcc 58741 ggtatgccca acaatcgatt cttagctgaa aggactggag ggacgtattc tactctgaag 58801 gtgcagcctg cagaactgga ggattctgga gtttatttct gtgccagcag ccaagacaca 58861 gtgcttcaca gtcgtgccct tgctgtgcaa aaccatagcc ttctcctctc aactcacagc 58921 tgcccaaaag gaaggctttc cctgtgcctt ctcccccaag gaggggagat aaagaaccag 58981 aattaactca tgaaatacaa gagtattcca agaagatttg ggtgaaaata cttgtggttt 59041 gggggatctc tgaagttttt taaacagaac tcaaactatg tgtttctcat atttattttt 59101 attttttaaa gtgtatgttg cctggttata aagcagactt cttgcctatc ttgctgctct 59161 gaaaggtttg tgactctgac tgtgaggcca catctttata tcccttctta tagtaagaga 59221 aacaagtctt tctgatgaaa ataaaaatag tgaataaaaa tgcatttact actggctttt 59281 aggatgctca atgatgatgc tagcaatggt ggtagtggtg acgttggtaa tggtggtgat 59341 gagtttaggt tgcttttgtc tagcatttat aaatttccaa ccactttaaa acaatctcat 59401 tttagagtgg aatagctaag tccattatca gaggttaatg ctaattttct gttcacatag 59461 ggtctgagtt tttaatgtaa aaaatatagt tagcactata aatagatatc tgtcagccga 59521 tacagaacat tttttaagat cacttccagt tgctgcaata ggaatatgta ttcctatttt 59581 ttttcttttt tctttttctt tttttgagac gaagtcttgc tctgtcgccc aggctggagt 59641 gcagtggcaa aatctcagct cactgcaact tctgcctcct gggtttaagc gattctcctg 59701 cctcagccta ccaagtagct gggattacag gtgcctgcca ccatgcctgg ctatttgttg 59761 tatttttagt agagacggga tttcaccatg ttggccaggc tggtcttgaa ctcctgacct 59821 caagtgatct gcctgcctca gcctcccaaa gtgctgggtt tacaggtgtg agctactgtg 59881 cccggccttt ttttcttttt gagacagggt ttcattctgt caccccggct ggagtgccat 59941 ggcagtgtga tcacggctca ctacagcctc gacttcctgg gctcaagcag tcctcctgcc 60001 tcaggctccc tagtagatgg gactataggt gtgcatgacc acacccagct aatttttgca 60061 ttttttgtag ggacaggtct cactatgttg cccaggctgg tcctaaactt cttggctcaa 60121 gtgatcttcc tgccttggcc tcccaaaatg ctgggattat aggcataaga caccatgcct 60181 ggcctggaat atgtaatttt taatgagacc taatggtata atacattcta cctggttcaa 60241 ctgccttctt ttaatatctg gattccaata ttctgctcat gtttcagcca tgtttttgca 60301 aacagtcctg aaattcactc taataataag tttcattttc ccagtctgat tgttatttag 60361 cacctcaaat tattattgaa ctcatcctaa tcctgattag gcaaatggat agtgagaaaa 60421 tattacaagt gggcttggat tttaattttt tcaaacaatt taaaacccat tttccttttc 60481 taatgatcaa gatccaattt ttagataatc agtactctct ctttctatgt ggacatgtaa 60541 tatgaatagg tatattgttt ctgtcacagg catcgcaaac tcatatcttg cttgaaactt 60601 gtcgatgaac tcctttagta tgacaagaac aaagagaaat agatttttaa agcatagcca 60661 catattggac atgagttttt cttcaaaatt tcagctgcag aaccctaact ggtgcctttc 60721 cttttattgt gttatgtaat ctaaggtcat aataagtaac taaattattg atttattttt 60781 ccactctgta atctctccat tatatttaaa aatagcagct atgaagcaaa acaagttcct 60841 cgttgggtac attgcctcaa ctccaaaatc taacaagcag aagtttaatt cagtgctttg 60901 ccatttcata ctatagacac cgaatgtcct atcctataag gggcacacag tggttctctt 60961 tatagttctg ttagctcaag tggtatcttt aggtacaatt tttagaattt ttttatttgt 61021 taatctttct taaggtatga ttgacagaaa aaaattgtag gtatttaagg tatacaaggc 61081 gatgttttga tatacatata cattgcaaaa tgattaccac actcaagtta attaacatat 61141 ccatcacctc acaaagttac ctttgtatgt atatgtatgg tgagaatgat ctactctcag 61201 aaaatttcga gtgtacaata atcattaact atagtcacca tgctgtccat taggtctcca 61261 gaactttcga gcacactcca aaacatcaac ttttattctg attaagcctc aaatatgtct 61321 tcttgttgca aatattagct atttttagat tagctataaa taatgaaaaa tgctagcata 61381 aaagatggaa agaataaatg cagataaggg ctctaaggtc tttacacttt ccaggaagtg 61441 ataaaagctg caatttaaat tatacatgaa gaagtcaagg atggcaactc ctataaacag 61501 ctgatacttc aaacctggaa tgtctagtgc taagcagccc ggacccctgc acccagcaca 61561 cctctctttg gcatgatagg aggatgaggg atccaggtct tctcctccca tgctgccctt 61621 ggcctttatg ttttgccttt cattagtaag gcttctccct taattatcac cctatgacag 61681 gaaattcatc ctaagcctgg acacgtgaga gatggtaact ctgaagcagc cccctgcgga 61741 tgacaccaaa tgaggccaca cccagaacaa attgcaagaa ataacaaggt atttcttaca 61801 gaatccctaa gagaacatgt ctgtctgcat ttaagtgtct acagtcataa atgcacagca 61861 tatgattgga tcagatggaa ctgaaagact ttcagtgctt catggtgccc ataactctca 61921 ttgtcctgaa gtggtgttgg cagtggactc agcagggtgc cgtgagatgg tgaggctaca 61981 cacatacgat cgtgaccaag ctggctacac agatgggcca aactcaccct gtgatggtag 62041 acagagctta ccaatgggtt ccatgaatgg acaatgggca tgctgctgta ataatccctt 62101 cgaccatgat ggaagtcagg caagacccag gccatgggaa cattgaaaac agtcacctct 62161 agcctcaagt atttcctact gtttgccaca ctgtgcctac aaatatcatt tatgatacag 62221 cttaaaattt tcatgtttgt gaaaattatt atcagcctta tccaatctga tattttgaat 62281 gaaatcacat taaaaccata tattaattta agaactggca tcttcataat agtgagtctt 62341 ctacagtttc ctagaccact aaaaaatcat ggattgctat ctcagacttg aaatgggtga 62401 tccttattct tctctcagtc agaacagtga aaggtacaga ttcatggcag ctagtccagc 62461 ttctaaaagt caacttctaa ttgtctactt taggacaaaa aagaaataat atccttagga 62521 aggccaccat ggaattcatt ctatctgaac aatccttaca taatatcatt tcttataata 62581 tgtcctcaac atcgtgctct gcatatattt ctccaccatc tctaaaaatc ctcttagccc 62641 aggcctcctc aaactttaat gtgcatgtaa attatgacct tgttaaaatg cagattctaa 62701 tccaatcagt cagggttgga gctaaggttc cctgtatttt gaacaagagt ccgtccaagt 62761 gatgccaaag ctgccagtcc atggaccaca atgagcagcc aagctctaga tgacacctac 62821 aataattaca ggtgtaaggc ccctcttctc aagtatcatt ttgcacagtt cctggactag 62881 gtcagataca gaccctactt accacatgta tctctgtatt ccaagtatta cgtaatcatc 62941 agttttccta aatctaggca ccaatatttt aacatatatt ttcatggtat aaatagaaat 63001 taattttcta tagtaaatac actaatacct aatcacattc atgcatcttt aatgtgagta 63061 gcaaaaagcc aaggcactat cttaatggtt attgagtgag actttagcaa atactatctt 63121 tatggagaaa cctctgggct gctttttagg atgtgagaaa agtgaggtaa agtgaggaga 63181 actggatgaa cacgggacag agacagggac aggggcaaat atggggacac ctgtctcaag 63241 gaagcagcaa atgatataga aaataaatat ctgttccatc cctgttccag acaagcccat 63301 gtacctgcca agtaggaagc tgtgtatcac attgcaacaa ggaatgaccc cggccctggt 63361 aaagtcaaca gcaacagtca tcacaggcca atctgcctat cagggactgg agactctcta 63421 aactcccacc tctcaaccca ggaatcagag cctgagacag acagatgctt cattcctgta 63481 tggggtggta ttcctgccat gggtcctggg cttctccact ggatggccct ttgtctcctt 63541 ggaacaggtg agtactgggc agaaaggaaa tctttgagca aagctatctt gtcctcagtc 63601 tgcacctttc attcacagca gtaacactgt tctccttaac tctgactcca aatttgtctt 63661 ctttctctac aggtcatggg gatgccatgg tcatccagaa cccaagatac caggttaccc 63721 agtttggaaa gccagtgacc ctgagttgtt ctcagacttt gaaccataac gtcatgtact 63781 ggtaccagca gaagtcaagt caggccccaa agctgctgtt ccactactat gacaaagatt 63841 ttaacaatga agcagacacc cctgataact tccaatccag gaggccgaac acttctttct 63901 gctttcttga catccgctca ccaggcctgg gggacacagc catgtacctg tgtgccacca 63961 gcagagacac agagctgcag tgcttcctgc tctctgttca taaacctcat tgtttcccag 64021 atccaggtgc tttctctagg acttctccct caccacctct tacaacaata ggaagtgggt 64081 tggtggctgt caatatctgt agacagaagt tgagcacaaa ccaataaaaa ccattgacag 64141 ttatgccaag agtggaaaag tggttacaca tgcctggcat tcagttggtg gtattttccc 64201 aggatcacat ttagaaccca tgctgtcctt ttcagaagac aaataagaac tttttctttc 64261 tttttttcat tgttttaatc taagtcagag gcttagaaat ataagaggtg atatgtgaag 64321 aattatggac acccaatgtc tatgaagtcc agtctctggc attcgatgaa ctcttatagg 64381 atttaatcct cctggagaat ccacatgttt ttagcctaga cctggagcaa ctacctcatt 64441 aatgggagaa tgaaagcaca agtatcccaa gataaaaatt tctcagaggt aaaaccacat 64501 tggagagaaa aaaagagaag accatacatt aatctgctca tatctggagt tggaggtact 64561 attgagatga gcaaatggaa accataaata cataaagaaa tgatattgaa aatagagtac 64621 atttggagat gaagatcatg gagccatctg tatagaagag agtaataatg tcattaaggt 64681 gcataagagt gccagcaaat gtcaataaga aaaactacag gagggaagaa gatgggagat 64741 cctgctcccc ttaacaaaga taaataaaat gctgagtgag aaatattgca acctcataaa 64801 ttgtcacatg gaagtatgta tttataaatt accattataa ttaacatccc tcaatgagaa 64861 cgatattctg agaaaaaaaa gcagaaaata tagttatatg gctatctaaa aatgtaaaat 64921 tttgcacatt aaattaagat aatctaggaa aaatatgaca agacctatta caaaagagct 64981 tttacaaatt gtttgcttta atgatattaa taacataatg atagctaaca tttactaagc 65041 acatattaca tgctagacct taagtcttct tcacatatat ttatagttca tgttagtttt 65101 ataacaacct cagaaagtta atactatagt aacctcactc atttatccat gaggaaactg 65161 aggcacagac aagtaactag ccctttatca cgtatctcgt ttgtggtgga gccagtatca 65221 aaatgtagac aatttactgc tggagctcat tctgtcttcg gagtttacat ggattctaag 65281 tcagaccttc agataggaag gtaaatggtt ataagccaga agatttccac actctcacca 65341 acattgcact ggcttttctt tgggtcttgg ccagagggtg agcccttgtg actatgcttc 65401 actggttgac taggtttgag ttgcaccgag gaccaacata gagaaagaag gtctttggga 65461 taaacctggg aggatgaagg gtaaaaaatg cagaacttaa atcttgatcc caagggatca 65521 tttaaagatg aatgagtctc ccttagagga tgagaaatta aacataaact ctttaacttg 65581 cagagatgct gaagtgtttt cctcaagtag taaactttga atcatggtta tatgcatggg 65641 agggagccta tggggatgga acatggatga aaaagtgaca ggatgttcag ggggcaggtg 65701 cgggcagaaa atcagctgcc tccaagagtg gtgagagaag ctgtagagca tcctttatcc 65761 agactggcac aacatgcagt ccagaaggct gtgagatacg aggaaagggt caccataaac 65821 tgcaacctga ctatgtcaaa aatgacactt ggaatccgag agtgggtacg tgtggtggta 65881 tcacagatat acacccaccc catccccgcc caactgggag ataagagact tacaggttta 65941 gcactttcac cactgggact agaaccctgg atttagctag aggctggctt agaccttttc 66001 atggttcttc tgtatgaacc acacaattta gaaccatact atttagagat gatcaattta 66061 tacgacctgc ataaagcctg tctttccaga agaagctcag gttcaaaaaa aaaattgatt 66121 cctctcttgt ccttctcaat cttccctcct tccttctttc ctttcttcct gcctgcctgc 66181 cttcctcatt aaggatagac atgaactagt cacctgtata ctctctgagg tatgctgtta 66241 gtgctgacaa cacattagtt caacctagaa aaactaaccc tgttttattt tatcttatct 66301 gagagtctga gacccctctt tgatccttaa aaacttgcca atttcttgcc aagcaaatat 66361 gtgcagggtt ttagagagcc gtaggtaaca gagacaaaat agaaacaaca cacagaattt 66421 ggagcaaagc ctttagatct ctaaagcttg aggccccact gctgctctaa gcctgcctat 66481 ggcaccaggt tcctctgcta tgagtccctc tgcctcctga gggcagtgag tcctgggcgt 66541 agatagtctg tctgccttgg gcactcacac tgtagtctgt ttccacacct tcctctggta 66601 gtccaatatt cacccccatt ttcctctcta gcccctgctc ctacactcta gtccacagac 66661 tctgtggata tcctaatcac agagacagaa acagaggtga cactcagatg tgagtgatga 66721 aacttacaat gctgtaaaca agactcagga cttagactgc ctactgatgt tcattcttgt 66781 gccatttgtt cagcaaatat gtatttagga actcctgtat aacagacctt gtttccggtt 66841 ctggaaataa ggtaataaac aaaccagaga gacttgtttt aaactcctgg aatttataaa 66901 ccaaagagat gtatgtgatg agaacagtac gactctggat aagaaggaac ctgttacctt 66961 aacactggcc agtgcaaacc cagtctggca gacatctgtg atcctctaga cttgtaagga 67021 atcctatagc tgtatagctc tttgcattta agggcaaaac tagacaagac agaacctcac 67081 tctcatgaat ccatgcccta agaaagagaa gaaacttccc tcataggaac attggtgcag 67141 tctcgcagtt ttcttatcaa gtaggaaaat tagaactgct aaaactttgt gccagaattt 67201 aagacaaata aattgcctgg atcatgcatt atttacaatt atatttacaa ttatatacag 67261 atattttccc taatataaat ctgttgatga accacttatg tggcatataa ataggactta 67321 ttaaaagtta gatgcggtgc tttagaagat atattaaaat tattagagtc tgacttaaat 67381 ttttaatagt aatacaaaat ataaagtacc acagccttgt tcagaagaaa caggatttct 67441 gaaatccacc agaatagctt tctaaaagcc agtatcttat ggcctttagt gggctgaaat 67501 acttataatg atggattgga aaaatggatg tttgagtgca taaggttcac gagtataaga 67561 aacaattggt acctcctaac tatagattgt cagtttcaga ggatgagaat gaatgtgccc 67621 tcatcctcat tttaattttg aaagacaatg cagatatccc tactatagaa tcatttttta 67681 catcaccacc tatagagatt aacaacatag atgtttccag ctccctaaga ggccaagaag 67741 tattaaccca aggtcagatt caggtaattg atcgtgactc acgaattttc agagatccac 67801 aatggttgtt tttaaatttt cagcagttat gtctttttct ctggggagag ggtattcaaa 67861 gctcctcatg aggccattct ggactacttt tcccctcctg ccccctgccc ataaaacagt 67921 ttacagaaat aggtgtccag accgtgggct gtagttcacc aactcctgct ctagcctaat 67981 tttgtctcat tactagagca tggccattct gtggatgcca gtgaagcctt tctggggttt 68041 ctctccaatg ccttttaaag ctttttccta cacagacaca gtttactgtt tagccaaaaa 68101 attaagataa ttcctccaga catctggagt tcttgtgcag cttcctcctc tctggtggtc 68161 tttccttaaa aattctagct gcctcagccc ccgctgattt tgtaacctgt ttccgcaact 68221 caaaatttgc catgctctgt tgctctcttc cccattgtgt atgtgcagaa attgccccca 68281 tgttttggtg ttttggcaga atcccagtcc tgtgctgtct gctttccagt ggctgaatac 68341 aattgtttca catattctct taatttgtgt tacttgtata gagcaggatg ctaaaggcaa 68401 ttggattcta caaagtgatc acgtcacaga gaagccgccg acagaggtgg agagagccac 68461 acagatagcc agctgcctgt gctgcctgct cttcccctaa ttctgccatg agcccaatat 68521 tcacctgcat cacaatcctt tgtctgctgg ctgcaggtaa gtccctgttc tgcagttgtc 68581 agctccctgc tctaagcctt tcatccatgt catcgaactc cctcatgggc tcagtctcca 68641 actcctgtct gctttcttta caggttctcc tggtgaagaa gtcgcccaga ctccaaaaca 68701 tcttgtcaga ggggaaggac agaaagcaaa attatattgt gccccaataa aaggacacag 68761 ttaggttttt tggtaccaac aggtcctgaa aaacgagttc aagttcttga tttccttcca 68821 gaatgaaaat gtctttgatg aaacaggtat gcccaaggaa agattttcag ctaagtgcct 68881 cccaaattca ccctgtagcc ttgagatcca ggctacgaag cttgaggatt cagcagtgta 68941 tttttgtgcc agcagccaat ccacaatgtt aaatattagc taatcttagg acacagactc 69001 atcacggact cagctcagga agcaggtggt atactaggtt ggaaggaaat aacagaaact 69061 agagctagct taagccaaag gggaatgtat tataaggcta aatgtatgtc ccatagaacc 69121 acaaagcaag aacacaggaa cttcccagaa ataaactgca atgaaacctt agaaccggtg 69181 gtcgcgcctg taatcccaga gctttgggag gccgaggcgg gaggatcaca aagtcaggag 69241 atcaagacca tcctgactaa cacggtgaaa ccctgtctct actaaaaaaa aaaaaaaaaa 69301 aatagccggg cctagtggtg ggcgcctgta gtcccagcta ctcgctactc gggaggctga 69361 ggcaggagaa tggcgtgaac cgggaggcgg agcttgcagt gagccgagat cgtgccactg 69421 cactccagcc tggacgacag aacgagactc catctcaaaa aaacaaaaaa caaaaaaaaa 69481 ccttagaacc atggcatggc tgattgacaa ctcatttttt atataccttg tcggcctcat 69541 tattctcttt tattataagt tgtcttttct gtttcttagg tcttacagtg attagaaaat 69601 tgcagattta ggtgttagcc ctccagtcaa ttgagaccaa cgtgactttc tctcatgcta 69661 gattacaaat ccagggtgca gggggaaaaa atgtgatcct gctttactca agaatgggct 69721 cgtggaaatc caagggaatt gaattatttt gaacaagata gtatctggga gtctgtccct 69781 ctgaccttgt ggaccaagga tttcaagggg tggaatccag tcagaacatc caataggcac 69841 ccatcacaag tgagaagttg gggataatac attttctgga atatagagac cttgggaaaa 69901 gagaagagtt tggagagggg atcaaaacct tattatttag aacaatatca gaccacgtaa 69961 agagtctggt taatctctct ttaagaaatt aaaccggcca ggtgtggtgg ctcacggcta 70021 taatcccagc actttggaag gccaaaagca agcagatcac ctgaggttgg gagttcaaga 70081 tcagcctgac caacatgaag aaaccccatc cctactaaaa atacaaaatt agccaggtgt 70141 ggtggcacat gcctgtaatc ccagctactt aggaggctga ggctgaagaa ttgcttgaac 70201 ctgggagggg gaggttgtgg tgagccgaga gcgcaccact gcactctagc ctgggcaaca 70261 agagcgaaac tctgtctcaa aacaataaat aaataaataa aatagaagga ataaaaagaa 70321 attaaaccta gataacacaa aaagcatgtt cattttaagt gtgacaactt cagatcctac 70381 aaattcacta attggcctct atggcactga aggagaacag attaatgtac tcttggtgta 70441 aaacatatta atgcacagag aattaaaaga tcttcaggta tatgattcaa attgtaatga 70501 ggggcaatgt actcaacatc aatattccta atgaggcatt atggtatcaa ctgtacctag 70561 gcaggagact acacttatgt tgagcagtta tagaattcaa aataactctg ttaaaatttg 70621 ataatccatt ctgttcacag atctggtcac cacctagcat tcgctttgag agaagttcct 70681 ttattctctc cagttacatc tgctcttatt tcagtataag ttgaacccag ttgtcttact 70741 tccagaagtc caaagttact tcttttcttg ggaaaaaaat ttactgtgac tcctggaaac 70801 agaagttctc actttatcat ttttctgatc tatttaatat tataccttct cattttaagt 70861 tccttttccc gctctccaga agtgagtgtt ccttagcgtt ctgttctcgg tccccctcaa 70921 gcccagtctt gctgttggtg gtctacatct ttgcttttgt cacagggaag tcactcctct 70981 agatgctatc ttcagctttt tcttgaaata tttccattct accaaaactg ttctcctaaa 71041 gattaaaatt acgtatttcc cagaccaaaa gacgcatgtg tgtttctttg cctgcttgat 71101 ctctttgaag ctcttgacaa tactgaccat gccctaagtc ctgagcatta atttttgctc 71161 agtttcattt acatcatttt caactagttt ctcattttct ctctattctt tttgttttca 71221 atgcagctcc cagaaactaa atctatccat ttaatattct ttttcccagg aaaattcttc 71281 tcattctatt taaagtgtaa tattctagtt taatgggaaa gttttttgtt gttgttgttt 71341 agccttttaa aaatctctac tgaaaaaatt ttggatatga aaattagcca ggcatggtgg 71401 cgggagcctg taatcccagc tactcaggag gctgaggcag gagaatcgct tgaaacttgg 71461 aggcggaggt tgcagtgagc tgagactgca ccactgtgct ccagcctggg caatggagag 71521 agactccatc tcaaaaaaaa aaaaaaaaag aaaagaaaag aaaaaattgt gaaatgtggt 71581 ctgagacaac ttttctcagt gtcacctgag gcatgggtta aacaggtata ttctaggaca 71641 ggttcagaat ttcagagtcg aaatttcttg ttggtggact ttaggaatac gtatcattgc 71701 tcctgagtat ttctgttagt gctgaagttc tgccccaagg acttcataag attaataaca 71761 tcgcaaatct tctgtagtgc tttaaaattt tgttcttcat ttttttattc tcactttctt 71821 ctctttcctt ttccacttcc tcttatgttt tccttaatct tcaactttcc taagcacctg 71881 caagtgggat tggagccttg tttaacatcg tccatgtagc aaaataagga tgaggccaaa 71941 tatttgaacc aaggatcccc atctcctatg gaaggtgccc tgaggttgtg ggtgttgctg 72001 gggacatgat gtcatggcca gatcctacat catgcggcca agggaaccca gaactttcac 72061 tgctctttgc tactgcacat cagaacccat cgctgggagt gtcttgcact gcctgacctc 72121 accatggata tctggctcct ctgctgggtg accctgtgtc tcttggcggc aggtgggtcc 72181 aggtatactt aaacatttgc ataaagatgt ttttggctgg gcgtggtggc tcacagccgt 72241 aatcccacct ttttgggagg ttgaggtgag tagatcacca gaggtcaaga gttcgagacc 72301 agcctggtca acgtggtgaa accccttctc taccaaaaaa tacaaaaatt agccaggcgt 72361 ggtagtgtgc tcctgtagtc ccagctactt gggaggctga ggtgggagga tcacttgaat 72421 ctgggaggta gaggctgcag tgagcagaga tcacgacatt tcactccagc ctgggcaaca 72481 cagagagacc ctatctcaaa aaaaaaaaag atgttttctt tgggcttccc ttcaccttct 72541 atggcttccg tcttcttcca caggacactc ggagcctgga gtcagccaga cccccagaca 72601 caaggtcacc aacatgggac aggaggtgat tctgaggtgc gatccatctt ctggtcacat 72661 gtttgttcac tggtaccgac agaatctgag gcaagaaatg aagttgctga tttccttcca 72721 gtaccaaaac attgcagttg attcagggat gcccaaggaa cgattcacag ctgaaagacc 72781 taacggaacg tcttccacgc tgaagatcca tcccgcagag ccgagggact cagccgtgta 72841 tctctacagt agcggtggca cagcatggct gagtcagttc cctccagggt gcaaaccctc 72901 tggctgctct tctcccagtt gaactccaag aaaacatttg aaaaagcctc ttccttatct 72961 tcctacccca gaagaaagaa gcgagttgat tgttgtggct gcagctgcta ccgggagagt 73021 acaagaccat gaattaaggt cttaaatggt catgatgggc acactggaca atgggctcct 73081 gaaactaccc aaatacaaaa tgagacattc tgtggatcag gaggaatcca catgtttaga 73141 aggaagggcc ccagaccaat ttccaagttc agagaccagc tatctgaggt tgatattact 73201 taccaaaaca caaaaacatt cctactgatt ttatctcaaa cacagttctc ctgcaaactc 73261 ttcccccacc aacgtacccc agcagaggca atgacatgta catttatgga acagcctgtt 73321 gactactgca gatccatctc ttcaataaga ccaactttcc ctggaggtca gtgactaaac 73381 caaaccgcaa ggcagacggc aacacgaccc tcttagggga tggatgccag agaaacactg 73441 actaaaatcc ccactcgggg actgtcctag cacattcagt tcctcacaca gctgcacagg 73501 ggagcgcgac gttttattct gttacttcaa acactcatct gttcctctct ctctcttgca 73561 ggccctagac tcacagacac tggaattaga gtcagaaaaa ttgctttcta atcccaaaac 73621 tggttttgtt tactcccctt cagctgtgaa atcttggcaa atgctcttac ttctctggga 73681 ctcagtgtcc tcctcttcaa aattagagac ttgctccact tagtctctaa gatctgatac 73741 aaatatgatt atttgacttg ctatttattt atttatacaa atattagaat atttatgtat 73801 acatttaaca taaatatatt tatatttaat aaaaacataa aatttataac attttatata 73861 aatttatatc aataatatat atttattgat atacttatat cacataagag atatataaat 73921 atatgcaaat atattttatg taaattttaa tacaaatata aaaaatacat tttctatata 73981 agtatatata aatacttata tataaatgta ttttatacat ttatatataa atatatatat 74041 aaatatatat ttatatataa atacaaatac ttatatatat ttatatataa gtattaatat 74101 ataaatatat gtgtttatat attattatat ataaataata tataaatata tgtgtttata 74161 tattattata tataaataat atataaatat atgtatttat aatatttatc tatttatcta 74221 tctacagcta ttttacacat gtgacaatca gggttgcgat ggaagacatc actacaagct 74281 ggaaaatatg aacataatta catatgcaga taacacagat ccagagacta tctgattgca 74341 aggaacatta tagatcaccc ctatatacca taactttgaa gctaatttga aatgtggttt 74401 gtgaaagtat tatacatttc ctgcaatccc ctccactgct gctagatggc agtaaaggaa 74461 cgtgttagaa agtctgggga ggggacctca tgatttattt accgcgcagt catgatttac 74521 cacgcaatca cggtttagtc ttgttgtcct gtcataatta atagtatcct ctttcacata 74581 caaaagtgtc tcaatttgaa cattaaacta tacggtcatt gtagataatg acacaaatga 74641 tagcatctct gttgaaagta aattgtttat taatgttgca gtaccttcat ttctttgttt 74701 cggtttaatg aaatagattt tttaaattat agaataaata aactcgaagc tatagcattg 74761 gtctgctttt tgtcatttta ctatttaaat atatctcagc cctacttaca gtttgtatta 74821 gcgatatgtg gtgtatgaag cctcaactta aaaactcagc ttggtcttcg tagtgcttct 74881 gtcactgtct gcccagcacc cactgcctct caatgtgttc caacgtttcc caattcctac 74941 ctgctaataa acctaccagg atatttccag acctattttc tctgtgaaga aaaccaagcg 75001 acattgactt tccctcattc atctactaat acataaatgt caccacatgc cagtgacaac 75061 ggggtaaaat aagtaataaa agtgaaatag gataaaagaa ttgttataca tttattggct 75121 ccggagaaat ccaccctctt ctctctagtc tcctgcgcac cctgccaaca atgtcattgg 75181 ctgtaaaaat acctcatgtt catttaacac tctacagttt actgagtaat ttgttgtttc 75241 attttcatgc tttaattata tgtttccagg ttctctgagc ttttgagctc ctgctcttcc 75301 cagatgggct ttcataacaa ggccagcccc tttctctctc cccttccgcc agtgcccatc 75361 atttgaacca aattttcttc ctgtctctct tacttgattc tttcccaacc tgtgccttct 75421 gtttggtctc ttaaaatatt tccctattca gtgataactc ccttaatgaa ggcatcttta 75481 ttgactgcct tggaaatcct cagaattata gaaacatctt tcatgtccac cttggcatgt 75541 tttatatgtt tcacatatat tttcataaat aataaaaaag ttatatattt tccatacgta 75601 ataaaaatgt cccttcaaca tgcgtaatat ataacaagat tagtgatctt ccacctaatg 75661 tcttggacaa tgtgacttct gtctgatagg tatagtcttt gacccctact tactcatgat 75721 taaagaaata cctttctctc aaaggaacta tctgtgttca cagcgttaag tctcatcctg 75781 gatgctgcaa gaagacgttc aaatgtccac tgcccaccca ggtatgtgat gctgcctgcc 75841 atgggtcact agaaggtgat agttaaaagt gtaggctctg gaatcaaaca atgcagacca 75901 caattctggc tcctctgctc aaatgctatg tgaacgtgag caagactgtg aacctctcta 75961 tgcctcactt cctcacctct aaaatgaaga taatagtacc tacctgaggc tgagcacagt 76021 ggctcatgcc tgtaaatccc agcattttgg gaggccgagg tgggtggatc attccaggtt 76081 gggagttcga gaccagcctg accaacatgg agaaaccctg tctcgactaa aaatacaaaa 76141 ttagctgggc gtggtggcac atacctgtaa tcccagctac tcaggaggct aagacaggag 76201 aatctcttga actcgggaag cggaggttgt ggtgagccaa gattgcacca ttacactcca 76261 tcctggatga caagagtgaa actctgtctc aataataata ataataataa taataataat 76321 aataatacct acctgatagt attgggttaa ataagataat tcatgaaaat ccctagtgtg 76381 gtgcctaaaa ctcaaacgct aatatacata ggtttggcct aatattatga ggtgttatta 76441 tttaggattg ctgtgcttca gctttttgtt gagtttcacg gtttggctct tactggacca 76501 ctgacagaag caactccatg actggttatt ctgtcattac ccccagatac tgtgcgatac 76561 cttccaaatc tactacaaac aacgagtgtg catttctctt gcttgttcca tccaagagga 76621 aagagatctg tccagagatg tcattctgct tcagtcaacc cagccgtgcc agacaatatc 76681 agcataaatg tcaaaattac catatgtggt caacatatgc tgatactgtg accatatgct 76741 acatgggact tgttccctta attgtgaata agagaactat tcgtgtaggt gttattatcc 76801 ccaattttat aggctcagaa agagtaataa ttggtccaat gtctcacaac tgtatgtttc 76861 atggcaaaga ttggaaatga agcaataact tttttctagt aaaattacaa gaatgacctt 76921 aatttctttc taagcaaacg tagcgtatca tcacaacagt tatgttaaat cagctcaata 76981 atctcctaga agaacatatt ataatattca ttagaggtgg tatgggcttt aagagggaga 77041 aaaaactagg caggatattg ctatatgtga agtaccatgc tagatatttc acatatggta 77101 tttcatttca tccataccag caacccaaaa ttatgttatt tatccctgtt ttactaacaa 77161 agaaattgaa gtttaaggag ggtaagtgca ggatgaacac taagccaatt gtagagctgg 77221 cttttgtaat cagggatttt atttaattcg atttgctcta ctatcatcat gttgaattaa 77281 gaatggcaaa gggctgggtg cactggctga cacctatagt tctagctact cgggaaactg 77341 aggcaggagg gttgcttgat cccaggagtt cgaggctgca gtgagatatg gtcatgccac 77401 cgaactccag cctgggtaac agaatgagac cctgtctcta aaaatttaaa aaaaaaataa 77461 gaaattttca aaaaagaata aaaattaaaa tatatatata ttttaaggct gggtgcggtg 77521 gctcacgcct gtaatcccag cactttggga ggctgaggcg ggcggatcac gaggcgggaa 77581 atcaagacca tcctggctaa catggtgaaa ccccatctct actgaaaaat acaaaaaatt 77641 agtcgggcgt agcggcgggc gccggtagtc ccagctactc gggaggctga ggcaggagaa 77701 tggcattaac ccaggaggcg gagcttgcag tgagctgagg atc // LOCUS HSTCRDR 1161 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for TCR-delta chain. ACCESSION X06557 Y00289 NID g37003 KEYWORDS constant region; glycoprotein; Ig D-segment; joining region; T-cell receptor; T-cell receptor delta; variable region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1161) AUTHORS Loh,E.Y., Lanier,L.L., Turck,C.W., Littman,D.R., Davis,M.M., Chien,Y.H. and Weiss,A. TITLE Identification and sequence of a fourth human T cell antigen receptor chain JOURNAL Nature 330 (6148), 569-572 (1987) MEDLINE 88065901 COMMENT Data kindly reviewed (22-JUN-1988) by Loh E. FEATURES Location/Qualifiers source 1..1161 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PEER" /clone_lib="lambda gt10" /clone="Pr81" CDS 19..897 /note="T-cell antigen receptor (AA 1 - 292)" /codon_start=1 /db_xref="PID:g37004" /translation="MLFSSLLCVFVAFSYSGSSVAQKVTQAQSSVSMPVRKAVTLNCL YETSWWSYYIFWYKQLPSKEMIFLIRQGSDEQNAKSGRYSVNFKKAAKSVALTISALQ LEDSAKYFCALGTGVRGLQDTDKLIFGKGTRVTVEPRSQPHTKPSVFVMKNGTNVACL VKEFYPKDIRINLVSSKKITEFDPAIVISPSGKYNAVKLGKYEDSNSVTCSVQHDNKT VHSTDFEVKTDSTDHVKPKETENTKQPSKSCHKPKAIVHTEKVNMMSLTVLGLRMLFA KTVAVNFLLTAKLFFL" misc_feature 82..360 /note="variable region" misc_feature 361..382 /note="junctional region" misc_feature 383..433 /note="joining region" misc_feature 434..894 /note="constant region" misc_feature 475..477 /note="pot. N-linked glycosylation site" misc_feature 664..666 /note="pot. N-linked glycosylation site" misc_feature 739..741 /note="pot. N-linked glycosylation site" BASE COUNT 352 a 255 c 248 g 306 t ORIGIN 1 caaagagcta catgccacat gctgttctcc agcctgctgt gtgtatttgt ggccttcagc 61 tactctggat caagtgtggc ccagaaggtt actcaagccc agtcatcagt atccatgcca 121 gtgaggaaag cagtcaccct gaactgcctg tatgaaacaa gttggtggtc atattatatt 181 ttttggtaca agcaacttcc cagcaaagag atgattttcc ttattcgcca gggttctgat 241 gaacagaatg caaaaagtgg tcgctattct gtcaacttca agaaagcagc gaaatccgtc 301 gccttaacca tttcagcctt acagctagaa gattcagcaa agtacttttg tgctcttggg 361 acgggggtga ggggactcca ggacaccgat aaactcatct ttggaaaagg aacccgtgtg 421 actgtggaac caagaagtca gcctcatacc aaaccatccg tttttgtcat gaaaaatgga 481 acaaatgtcg cttgtctggt gaaggaattc taccccaagg atataagaat aaatctcgtg 541 tcatccaaga agataacaga gtttgatcct gctattgtca tctctcccag tgggaagtac 601 aatgctgtca agcttggtaa atatgaagat tcaaattcag tgacatgttc agttcaacac 661 gacaataaaa ctgtgcactc cactgacttt gaagtgaaga cagattctac agatcacgta 721 aaaccaaagg aaactgaaaa cacaaagcaa ccttcaaaga gctgccataa acccaaagcc 781 atagttcata ccgagaaggt gaacatgatg tccctcacag tgcttgggct acgaatgctg 841 tttgcaaaga ctgttgccgt caattttctc ttgactgcca agttattttt cttgtaaggc 901 tgactggcat gaggaagcta cactcctgaa gaaaccaaag gcttacaaaa atgcatctcc 961 ttggcttctg acttctttgt gattcaagtt gacctgtcat agccttgtta aaatggctgc 1021 tagccaaacc actttttctt caaagacaac aaacccagct catcctccag cttgatggga 1081 agacaaaagt cctggggaag gggggtttat gtcctaactg ctttgtatgc tgttttataa 1141 agggatagaa ggatataaaa a // LOCUS HSTCRGR 1080 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for T-cell receptor gamma-chain. ACCESSION Y00790 NID g37017 KEYWORDS constant region; joining region; T-cell receptor; T-cell receptor gamma; variable region. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1080) AUTHORS Parker,C.M. TITLE Direct Submission JOURNAL Submitted (19-JUL-1988) Parker C.M., Dana-Farber Cancer Institut, Mayer 640, 44 Binney st., Boston, MA 02115 USA REFERENCE 2 (bases 1 to 1080) AUTHORS Hochstenbach,F., Parker,C., McLean,J., Gieselmann,V., Band,H., Bank,I., Chess,L., Spits,H., Strominger,J.L., Seidman,J.G. and Brenner,M.B. TITLE Characterization of a third form of the human T cell receptor Gama/Delta JOURNAL J. Exp. Med. 168 (1988) In press FEATURES Location/Qualifiers source 1..1080 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MOLT13 leukemia" /clone="K" sig_peptide 37..78 /note="signal peptide (AA -14 to -1)" CDS 37..1008 /note="pre-gamma-chain (AA -14 to 309)" /codon_start=1 /db_xref="PID:g37018" /translation="MRWALVVLLAFLSPASQKSSNLEGRTKSVTRQTGSSAEITCDLT VTNTFYIHWYLHQEGKAPQRLLYYDVSTARDVLESGLSPGKYYTHTPRRWSWILRLQN LIENDSGVYYCATWDRPRLKKLFGSGTTLVVTDKQLDADVSPKPTIFLPSIAETKLQK AGTYLCLLEKFFPDIIKIHWQEKKSNTILGSQEGNTMKTNDTYMKFSWLTVPEESLDK EHRCIVRHENNKNGIDQEIIFPPIKTDVTTVDPKYNYSKDANDVITMDPKDNWSKDAN DTLLLQLTNTSAYYTYLLLLLKSVVYFAIITCCLLRRTAFCCNGEKS" mat_peptide 79..1005 /note="mature gamma-chain (AA 1 - 309)" misc_feature 79..390 /note="variable region" misc_feature 391..398 /note="N-region" misc_feature 399..439 /note="joining region" misc_feature 440..1005 /note="constant region" BASE COUNT 324 a 260 c 231 g 265 t ORIGIN 1 tggtcccttt ccttccaagg cccccgagag gaaggcatgc ggtgggccct agtggtgctt 61 ctagctttcc tgtctcctgc cagtcagaaa tcttccaact tggaagggag aacgaagtca 121 gtcaccaggc agactgggtc atctgctgaa atcacttgcg atcttactgt aacaaatacc 181 ttctacatcc actggtacct acaccaggag gggaaggccc cacagcgtct tctgtactat 241 gacgtctcca ccgcaaggga tgtgttggaa tcaggactca gtccaggaaa gtattatact 301 catacaccca ggaggtggag ctggatattg agactgcaaa atctaattga aaatgattct 361 ggggtctatt actgtgccac ctgggacagg ccccgcctta agaaactctt tggcagtgga 421 acaacacttg ttgtcacaga taaacaactt gatgcagatg tttcccccaa gcccactatt 481 tttcttcctt cgattgctga aacaaaactc cagaaggctg gaacatacct ttgtcttctt 541 gagaaatttt tcccagatat tattaagata cattggcaag aaaagaagag caacacgatt 601 ctgggatccc aggaggggaa caccatgaag actaacgaca catacatgaa atttagctgg 661 ttaacggtgc cagaagagtc actggacaaa gaacacagat gtatcgtcag acatgagaat 721 aataaaaacg gaattgatca agaaattatc tttcctccaa taaagacaga tgtcaccaca 781 gtggatccca aatacaatta ttcaaaggat gcaaatgatg tcatcacaat ggatcccaaa 841 gacaattggt caaaagatgc aaatgataca ctactgctgc agctcacaaa cacctctgca 901 tattacacgt acctcctcct gctcctcaag agtgtggtct attttgccat catcacctgc 961 tgtctgctta gaagaacggc tttctgctgc aatggagaga aatcataaca gacggtggca 1021 caaggaggcc atcttttcct catcggttat tgtccctaga agcgtccccg aattcaaggt // LOCUS HSTCRGT3 822 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for T-cell receptor T3 gamma polypeptide. ACCESSION X04145 NID g37021 KEYWORDS T-cell receptor; T-cell receptor gamma. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 822) AUTHORS Krissansen,G.W., Owen,M.J., Verbi,W. and Crumpton,M.J. TITLE Primary structure of the T3 gamma subunit of the T3/T cell antigen receptor complex deduced from cDNA sequences: evolution of the T3 gamma and delta subunits JOURNAL EMBO J. 5 (8), 1799-1808 (1986) MEDLINE 87004546 COMMENT Data kindly reviewed (30-JAN-1987) by G. Krissansen. FEATURES Location/Qualifiers source 1..822 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 38..103 /note="put. signal peptide (aa -22 to -1)" CDS 38..586 /note="T3 gamma precursor (aa -22 to 160)" /codon_start=1 /db_xref="PID:g37022" /db_xref="SWISS-PROT:P09693" /translation="MEQGKGLAVLILAIILLQGTLAQSIKGNHLVKVYDYQEDGSVLL TCDAEAKNITWFKDGKMIGFLTEDKKKWNLGSNAKDPRGMYQCKGSQNKSKPLQVYYR MCQNCIELNAATISGFLFAEIVSIFVLAVGVYFIAGQDGVRQSRASDKQTLLPNDQLY QPLKDREDDQYSHLQGNQLRRN" mat_peptide 104..583 /note="mature T3 gamma-chain (aa 1-160)" misc_feature 191..193 /note="pot. N-glycosylation site" misc_feature 311..313 /note="pot. N-glycosylation site" misc_feature 761..777 /note="pot. polyA signal region" polyA_site 822 /note="polyA site" BASE COUNT 255 a 177 c 193 g 197 t ORIGIN 1 gggctgctcc acgcttttgc cggagacaga gactgacatg gaacagggga agggcctggc 61 tgtcctcatc ctggctatca ttcttcttca aggtactttg gcccagtcaa tcaaaggaaa 121 ccacttggtt aaggtgtatg actatcaaga agatggttcg gtacttctga cttgtgatgc 181 agaagccaaa aatatcacat ggtttaaaga tgggaagatg atcggcttcc taactgaaga 241 taaaaaaaaa tggaatctgg gaagtaatgc caaggaccct cgagggatgt atcagtgtaa 301 aggatcacag aacaagtcaa aaccactcca agtgtattac agaatgtgtc agaactgcat 361 tgaactaaat gcagccacca tatctggctt tctctttgct gaaatcgtca gcattttcgt 421 ccttgctgtt ggggtctact tcattgctgg acaggatgga gttcgccagt cgagagcttc 481 agacaagcag actctgttgc ccaatgacca gctctaccag cccctcaagg atcgagaaga 541 tgaccagtac agccaccttc aaggaaacca gttgaggagg aattgaactc aggactcaga 601 gtagtccagg tgttctcctc ctattcagtt cccagaatca aagcaatgca ttttggaaag 661 ctcctagcag agagactttc agccctaaat ctagactcaa ggttcccaga gatgacaaat 721 ggagaagaaa ggccatcaga gcaaatttgg gggtttctca aataaaataa aaataaaaac 781 aaatactgtg tttcagaagc gccacctatt ggggaaaatt gt // LOCUS HSTCRT3E 1311 bp RNA PRI 07-APR-1994 DEFINITION Human mRNA for T3 epsilon chain (20K) of T-cell receptor (from peripheral blood lymphocytes). ACCESSION X03884 NID g37039 KEYWORDS membrane protein; signal peptide; T-cell receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1311) AUTHORS Gold,D.P., Puck,J.M., Pettey,C.L., Cho,M., Coligan,J., Woody,J.N. and Terhorst,C. TITLE Isolation of cDNA clones encoding the 20K non-glycosylated polypeptide chain of the human T-cell receptor/T3 complex JOURNAL Nature 321 (6068), 431-434 (1986) MEDLINE 86230866 REMARK Erratum:[Nature 1986 Dec 18-31;324(6098):702]] REFERENCE 2 (bases 1 to 1311) AUTHORS Terhorst,C. TITLE Direct Submission JOURNAL Submitted (05-JAN-1987) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (05-JAN-1987) by C. Terhorst. FEATURES Location/Qualifiers source 1..1311 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1311 /note="mRNA fragment" sig_peptide 55..120 CDS 55..678 /codon_start=1 /product="20K polypeptide" /db_xref="PID:g469945" /db_xref="SWISS-PROT:P07766" /translation="MQSGTHWRVLGLCLLSVGVWGQDGNEEMGGITQTPYKVSISGTT VILTCPQYPGSEILWQHNDKNIGGDEDDKNIGSDEDHLSLKEFSELEQSGYYVCYPRG SKPEDANFYLYLRARVCENCMEMDVMSVATIVIVDICITGGLLLLVYYWSKNRKAKAK PVTRGAGAGGRQRGQNKERPPPVPNPDYEPIRKGQRDLYSGLNQRRI" mat_peptide 121..675 /product="20K polyeptide" old_sequence 525..526 /note="ca was cca in [1]" /citation=[1] old_sequence 558..559 /note="cu was ccu in [1]" /citation=[1] old_sequence 634..636 /note="aag was ag in [1]" /citation=[1] old_sequence 895..896 /note="gc was ggc in [1]" /citation=[1] polyA_site 1311 /note="polyA site" BASE COUNT 305 a 383 c 301 g 322 t ORIGIN 1 gtaagtctgc tggcctccgc catcttagta aagtaacagt cccatgaaac aaagatgcag 61 tcgggcactc actggagagt tctgggcctc tgcctcttat cagttggcgt ttgggggcaa 121 gatggtaatg aagaaatggg tggtattaca cagacaccat ataaagtctc catctctgga 181 accacagtaa tattgacatg ccctcagtat cctggatctg aaatactatg gcaacacaat 241 gataaaaaca taggcggtga tgaggatgat aaaaacatag gcagtgatga ggatcacctg 301 tcactgaagg aattttcaga attggagcaa agtggttatt atgtctgcta ccccagagga 361 agcaaaccag aagatgcgaa cttttatctc tacctgaggg caagagtgtg tgagaactgc 421 atggagatgg atgtgatgtc ggtggccaca attgtcatag tggacatctg catcactggg 481 ggcttgctgc tgctggttta ctactggagc aagaatagaa aggccaaggc caagcctgtg 541 acacgaggag cgggtgctgg cggcaggcaa aggggacaaa acaaggagag gccaccacct 601 gttcccaacc cagactatga gcccatccgg aaaggccagc gggacctgta ttctggcctg 661 aatcagagac gcatctgacc ctctggagaa cactgcctcc cgctggccca ggtctcctct 721 ccagtccccc tgcgactccc tgtttcctgg gctagtcttg gaccccacga gagagaatcg 781 ttcctcagcc tcatggtgaa ctcgcgccct ccagcctgat cccccgctcc ctcctccctg 841 ccttctctgc tggtacccag tcctaaaata ttgctgcttc ctcttccttt gaagcatcat 901 cagtagtcac accctcacag ctggcctgcc ctcttgccag gatatttatt tgtgctattc 961 actcccttcc ctttggatgt aacttctccg ttcagttccc tccttttctt gcatgtaagt 1021 tgtcccccat cccaaagtat tccatctact tttctatcgc cgtccccttt tgcagccctc 1081 tctggggatg gactgggtaa atgttgacag aggccctgcc ccgttcacag atcctggccc 1141 tgagccagcc ctgtgctcct ccctccccca acactcccta ccaaccccct aatcccctac 1201 tccctccaac cccccctccc actgtaggcc actggatggt catttggcat ctccgtatat 1261 gtgctctggc tcctcagctg agagagaaaa aaataaactg tatttggctg c // LOCUS HSTE2 861 bp RNA PRI 31-JUL-1994 DEFINITION H.sapiens TE2 mRNA for ARD-1 N-acetyltransferase homologue. ACCESSION X77588 NID g517484 KEYWORDS N-acetyltransferase homologue; TE2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 861) AUTHORS Tribioli,C., Mancini,M., Plassart,E., Bione,S., Rivella,S., Sala,C., Torri,G. and Toniolo,D. TITLE Isolation of new genes in distal Xq28: transcriptional map and identification of a human homologue of the ARD1 N-acetyl transferase of Saccharomyces cerevisiae JOURNAL Hum. Mol. Genet. 3 (7), 1061-1067 (1994) MEDLINE 95072568 REFERENCE 2 (bases 1 to 861) AUTHORS Tribioli,C. TITLE Direct Submission JOURNAL Submitted (09-FEB-1994) C. Tribioli, Istituto di Genetica Biochimica ed Evoluzionistica, C.N.R., Via Abbiategrasso 207, 27100 Pavia, ITALY FEATURES Location/Qualifiers source 1..861 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TE2" /chromosome="X" /map="Xq28" gene 96..803 /gene="TE2" CDS 96..803 /gene="TE2" /codon_start=1 /product="ARD1 N-acetyl transferase homologue" /db_xref="PID:g517485" /db_xref="SWISS-PROT:P41227" /translation="MNIRNARPEDLMNMQHCNLLCLPENYQMKYYFYHGLSWPQLSYI AEDENGKIVGYVLAKMEEDPDDVPHGHITSLAVKRSHRRLGLAQKLMDQASRAMIENF NAKYVSLHVRKSNRAALHLYSNTLNFQISEVEPKYYADGEDAYAMKRDLTQMADELRR HLELKEKGRHVVLGAIENKVESKGNSPPSSGEACREEKGLAAEDSGGDSKDLSEVSET TESTDVKDSSEASDSAS" polyA_signal 842..847 BASE COUNT 204 a 262 c 247 g 148 t ORIGIN 1 cggccagccc cggccgtccc ggcgtcgctt cggagcgcgg cggcagctga ctgcgccttc 61 acgatccgct gggacccgcg agccccgccg ccgttatgaa catccgcaat gcgaggccag 121 aggacctaat gaacatgcag cactgcaacc tcctctgcct gcccgagaac taccagatga 181 aatactactt ctaccatggc ctttcctggc cccagctctc ttacattgct gaggacgaga 241 atgggaagat tgtggggtat gtcctggcca aaatggaaga ggacccagat gatgtgcccc 301 atggacatat cacctcattg gctgtgaagc gttcccaccg gcgcctcggt ctggctcaga 361 aactgatgga ccaggcctct cgagccatga tagagaactt caatgccaaa tatgtctccc 421 tgcatgtcag gaagagtaac cgggccgccc tgcacctcta ttccaacacc ctcaactttc 481 agatcagtga agtggagccc aaatactatg cagatgggga ggacgcctat gccatgaagc 541 gggacctcac tcagatggcc gacgagctga ggcggcacct ggagctgaaa gagaagggca 601 ggcacgtggt gctgggtgcc atcgagaaca aggtggagag caaaggcaat tcacctccga 661 gctcaggaga ggcctgtcgc gaggagaagg gcctggctgc cgaggatagt ggtggggaca 721 gcaaggacct cagcgaggtc agcgagacca cagagagcac agatgtcaag gacagctcag 781 aggcctccga ctcagcctcc tagagcctgc cccatcccct cctcacccca cgagctttca 841 caataaattc gctcgtggcc g // LOCUS HSTEGT 2600 bp RNA PRI 28-SEP-1995 DEFINITION H.sapiens TEGT gene. ACCESSION X75861 NID g456258 KEYWORDS TEGT gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2600) AUTHORS Walter,L. TITLE Direct Submission JOURNAL Submitted (14-JAN-1994) L. Walter, Abteilung Immungenetik der Universitaet Goettingen, Gosslerstr 12d, 37073 Goettingen, FRG REFERENCE 2 (bases 1 to 2600) AUTHORS Walter,L., Marynen,P., Szpirer,J., Levan,G. and Gunther,E. TITLE Identification of a novel conserved human gene, TEGT JOURNAL Genomics 28 (2), 301-304 (1995) MEDLINE 96015061 FEATURES Location/Qualifiers source 1..2600 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11 cDNA" /tissue_type="testis" /chromosome="12q12-q13" gene 41..754 /gene="TEGT" CDS 41..754 /gene="TEGT" /codon_start=1 /db_xref="PID:g458545" /translation="MNIFDRKINFDALLKFSHITPSTQQHLKKVYASFALCMFVAAAG AYVHMVTHFIQAGLLSALGSLILMIWLMATPHSHETEQKRLGLLAGFAFLTGVGLGPA LEFCIAVNPSILPTAFMGTAMIFTCFTLSALYARRRSYLFLGGILMSALSLLLLSSLG NVFFGSIWPFQANLYVGLVVMCGFVLVDTQLIIEKAEHGDQDYIWHCIDLFLDFITVF RKLMMILAMNEKDKKKEKK" BASE COUNT 619 a 621 c 590 g 770 t ORIGIN 1 ggttaggaag agtggagact gctgcacgga ctctggaacc atgaacatat ttgatcgaaa 61 gatcaacttt gatgcgcttt taaaattttc tcatataacc ccgtcaacgc agcagcacct 121 gaagaaggtc tatgcaagtt ttgccctttg tatgtttgtg gcggctgcag gggcctatgt 181 ccatatggtc actcatttca ttcaggctgg cctgctgtct gccttgggct ccctgatatt 241 gatgatttgg ctgatggcaa cacctcatag ccatgaaact gaacagaaaa gactgggact 301 tcttgctgga tttgcattcc ttacaggagt tggcctgggc cctgccctgg agttttgtat 361 tgctgtcaac cccagcatcc ttcccactgc tttcatgggc acagcaatga tctttacctg 421 cttcaccctc agtgcactct atgccaggcg ccgtagctac ctctttctgg gaggtatctt 481 gatgtcagcc ctgagcttgt tgcttttgtc ttccctgggg aatgttttct ttggatccat 541 ttggcctttc caggcaaacc tgtatgtggg actggtggtc atgtgtggct tcgtccttgt 601 tgatactcaa ctcattattg aaaaggccga acatggagat caagattata tctggcactg 661 cattgatctc ttcttagatt tcattactgt cttcagaaaa ctcatgatga tcctggccat 721 gaatgaaaag gataagaaga aagagaagaa atgaagtgac catccagcct ttcccaatta 781 gacttcctct ccttccaccc ctcatttcct ttttgcacac attacaggtg gtgtgttctg 841 tgataatgaa aagcatcaga aaagcttttg tactttgtgg tttcctctat tttgaatttt 901 ttgatcaaaa aactgattag cagaatatag tttggagttt ggcttcatct tcctggggtt 961 cccctcactc ccttttttgt caaccccatc tgtagcctct tcctctactc aggcagtcga 1021 cccgccacga tgagaagtgg gaccagccag agggcgccaa cttcaggagt ccgctttccc 1081 accaggcttc attcacccag tggacctgaa ctgtttggta gagccacccg gcccttcctt 1141 cctcattgtt gtttggtatg cgcacagttc ctgtgggact gggccgtgag ttttccattg 1201 gaaagaagtt cagtggtccc attgttaact cagcctcaaa tctcaactgt caggccctac 1261 aaagaaaatg gagagcctct tctggtggat gctttgctcc ctctgagctg cccatgctgg 1321 tctggcaaac acacctttct gctttgcctt cacaaaagta atgtgttccc tttcccgccc 1381 cttgcctgac cctcagggag tcagcctgct tccatccatg ggtgggaaga cttcagcaca 1441 aaggaaagac taattcttgt caggcatttt tgaaaaggct gattatgtgt atcaaggtac 1501 agctacgtag gttcccctaa acttgccctg tttttgtttt tttagtttgt tatcccctta 1561 ctgagcggcc tctactaggt ggctgtgatt aaatgtccca agcaaggata gggaagggga 1621 atggttgagc ctctggagat cattgtaacc aatcctgcca gacctgtttg gggcagtggg 1681 gagcaaacct agataaggac ctgtttgggg cagcagggag caaaatctcc tttaacaacc 1741 aagcagttcc tcattcacat caacagagcg aggctgtgat aacttaggag gcagcaatcc 1801 taatagtcct tcagtgcatt ttagtctgtc tccaactgga caccagtagg tagtgtcaag 1861 ccagagattc ggggcagtag ataaatgttc attttactga tgcactttag tttttggtct 1921 gttacctgtt ttccagaaat ttgtggcctt ttaggcggga gttaggcgac caaaccagtg 1981 agagccccaa tccctgcagt tttgtggctt caagtgtggg tggacagtcc taatggggat 2041 ctccagctcc ttcctgtggg ctgccacaga cagctacccc cagaagggtc aatgttggga 2101 gtggttgtgg ctctgagctg ctctacagag cttcagtgtg agaggatcga gccattgaaa 2161 gctcattacc agtaggacat aatttttggc tctccctatt cacaaccagt gcacagtttg 2221 acagtggcct caggttcaca gtgcaccatg tcactgtgct atcctacgaa atcatttgtt 2281 tctaagttgt gtttattcct ggagtgacat gccaccccga atggctcact ttcactgagg 2341 atgctgtcct ctgatttagc tgctgcctcc agcctctggc ttgagaactt actaaaggca 2401 cttccttcct gttaaacccc tgttaactct ccataaattt ggtgattctc tgctaggcct 2461 aagattttga gttaacatct cttgaagcca aactccacct tctgtgcttt ttgcttggga 2521 taatggagtt tttctttaga aacagtgcca agaatgacaa gatattaaaa aaaaaaaaaa 2581 aaaaaaaaaa aaaaaaaaaa // LOCUS HSTELETHO 959 bp RNA PRI 14-OCT-1997 DEFINITION Homo sapiens mRNA for telethonin. ACCESSION AJ000491 NID g2330600 KEYWORDS 19 kDa protein; sarcomeric protein; telethonin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 959) AUTHORS Valle,G. TITLE Direct Submission JOURNAL Submitted (17-JUL-1997) Valle G., CRIBI Biotechnology Centre, Universita di Padova, via U. Bassi 58b, Padova, 35121, ITALY REFERENCE 2 (bases 1 to 959) AUTHORS Valle,G., Faulkner,G., De Antoni,A., Pacchioni,B., Pallavicini,A., Pandolfo,D., Tiso,N., Toppo,S., Trevisan,S. and Lanfranchi,G. TITLE Telethonin, a novel sarcomeric protein of heart and skeletal muscle JOURNAL FEBS Lett. 415 (2), 163-168 (1997) MEDLINE 98010471 FEATURES Location/Qualifiers source 1..959 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="q12" /tissue_type="skeletal muscle" CDS 11..514 /note="19kD sarcomeric protein" /codon_start=1 /product="telethonin" /db_xref="PID:e329768" /db_xref="PID:g2330601" /translation="MATSELSCEVSEENCERREAFWAEWKDLTLSTRPEEGCSLHEED TQRHETYHQQGQCQVLVQRSPWLMMRMGILGRGLQEYQLPYQRVLPLPIFTPAKMGAT KEEREDTPIQLQELLALETALGGQCVDRQEVAEITKQLPPVVPVSKPGALRRSLSRSM SQEAQRG" BASE COUNT 196 a 257 c 355 g 151 t ORIGIN 1 cggcacgagc atggctacct cagagctgag ctgcgaggtg tcggaggaga actgtgagcg 61 ccgggaggcc ttctgggcag aatggaagga tctgacactg tccacacggc ccgaggaggg 121 ctgctccctg catgaggagg acacccagag acatgagacc taccaccagc aggggcagtg 181 ccaggtgctg gtgcagcgct cgccctggct gatgatgcgg atgggcatcc tcggccgtgg 241 gctgcaggag taccagctgc cctaccagcg ggtactgccg ctgcccatct tcacccctgc 301 caagatgggc gccaccaagg aggagcgtga ggacaccccc atccagcttc aggagctgct 361 ggcgctggag acagccctgg gtggccagtg tgtggaccgc caggaggtgg ctgagatcac 421 aaagcagctg ccccctgtgg tgcctgtcag caagcccggt gcacttcgtc gctccctgtc 481 ccgctccatg tcccaggaag cacagagagg ctgagaggga ctgtgacttg ggctccgctg 541 tgcccgccct gggctgggcc cttcctggct aggactgtgg aggggagctg ctggccatgg 601 ctgctttgta gtttgcccag agttgggggc taggggaggg gggagccaga ggccaggatg 661 cctgagcccc ctgagttccc aaagggaggg tggcagagac agtgggcact aagggtggag 721 agttgggggc cagcacagct gaggaccctc agccccagga gaagggacaa aaggtactgg 781 tgagggcaag aggtgcctgg gaggagtggc cctgatccag gaaaatgtga ggggaatctg 841 gaacgctcta ggcagaagaa gctgggaggg agggggaggt gaaaagggca gaggcaagga 901 tggtggggcc cccagcaccc tctgttagtg ccgcaataaa tgctcaatca tgtgccaga // LOCUS HSTENAS3 7560 bp RNA PRI 02-MAY-1995 DEFINITION H.sapiens mRNA for tenascin-C, 7560bp. ACCESSION X78565 NID g556844 KEYWORDS tenascin-C; wnascin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7560) AUTHORS Gherzi,R., Carnemolla,B., Siri,A., Ponassi,M., Balza,E. and Zardi,L. TITLE Human tenascin gene. Structure of the 5'-region, identification, and characterization of the transcription regulatory sequences JOURNAL J. Biol. Chem. 270 (7), 3429-3434 (1995) MEDLINE 95155442 REFERENCE 2 (bases 1 to 7560) AUTHORS Zardi,L. TITLE Direct Submission JOURNAL Submitted (26-JUL-1994) Luciano Zardi, Cell Biology Laboratory, Istituto Nazionale per la, Ricerca sul Cancro, Viale Benedetto XV, 10, Genova, 16132, Italy FEATURES Location/Qualifiers source 1..7560 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult, Fetal" /tissue_type="melanoma, fetal brain, breast, genomic from placenta" /sex="Male" CDS 314..6919 /codon_start=1 /product="human tenascin-C" /db_xref="PID:g556845" /translation="MGAMTQLLAGVFLAFLALATEGGVLKKVIRHKRQSGVNATLPEE NQPVVFNHVYNIKLPVGSQCSVDLESASGEKDLAPPSEPSESFQEHTVDGENQIVFTH RINIPRRACGCAAAPDVKELLSRLEELENLVSSLREQCTAGAGCCLQPATGRLDTRPF CSGRGNFSTEGCGCVCEPGWKGPNCSEPECPGNCHLRGRCIDGQCICDDGFTGEDCSQ LACPSDCNDQGKCVNGVCICFEGYAGADCSREICPVPCSEEHGTCVDGLCVCHDGFAG DDCNKPLCLNNCYNRGRCVENECVCDEGFTGEDCSELICPNDCFDRGRCINGTCYCEE GFTGEDCGKPTCPHACHTQGRCEEGQCVCDEGFAGLDCSEKRCPADCHNRGRCVDGRC ECDDGFTGADCGELKCPNGCSGHGRCVNGQCVCDEGYTGEDCSQLRCPNDCHSRGRCV EGKCVCEQGFKGYDCSDMSCPNDCHQHGRCVNGMCVCDDGYTGEDCRDRQCPRDCSNR GLCVDGQCVCEDGFTGPDCAELSCPNDCHGQGRCVNGQCVCHEGFMGKDCKEQRCPSD CHGQGRCVDGQCICHEGFTGLDCGQHSCPSDCNNLGQCVSGRCICNEGYSGEDCSEVS PPKDLVVTEVTEETVNLAWDNEMRVTEYLVVYTPTHEGGLEMQFRVPGDQTSTIIQEL EPGVEYFIRVFAILENKKSIPVSARVATYLPAPEGLKFKSIKETSVEVEWDPLDIAFE TWEIIFRNMNKEDEGEITKSLRRPETSYRQTGLAPGQEYEISLHIVKNNTRGPGLKRV TTTRLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTEDEN QYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTTGLDAPRNLRRVSQTDNSITLEWR NGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKEDK ESNPATINAATELDTPKDLQVSETAETSLTLLWKTPLAKFDRYRLNYSLPTGQWVGVQ LPRNTTSYVLRGLEPGQEYNVLLTAEKGRHKSKPARVKASTEQAPELENLTVTEVGWD GLRLNWTAADQAYEHFIIQVQEANKVEAARNLTVPGSLRAVDIPGLKAATPYTVSIYG VIQGYRTPVLSAEASTGETPNLGEVVVAEVGWDALKLNWTAPEGAYEYFFIQVQEADT VEAAQNLTVPGGLRSTDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLTEEVPDMGNL TVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLRSMEIPGLRAGT PYTVTLHGEVRGHSTRPLAVEVVTEDLPQLGDLAVSEVGWDGLRLNWTAADNAYEHFV IQVQEVNKVEAAQNLTLPGSLRAVDIPGLEAATPYRVSIYGVIRGYRTPVLSAEASTA KEPEIGNLNVSDITPESFNLSWMATDGIFETFTIEIIDSNRLLETVEYNISGAERTAH ISGLPPSTDFIVYLSGLAPSIRTKTISATATTEALPLLENLTISDINPYGFTVSWMAS ENAFDSFLVTVVDSGKLLDPQEFTLSGTQRKLELRGLITGIGYEVMVSGFTQGHQTKP LRAEIVTEAEPEVDNLLVSDATPDGFRLSWTADEGVFDNFVLKIRDTKKQSEPLEITL LAPERTRDLTGLREATEYEIELYGISKGRRSQTVSAIATTAMGSPKEVIFSDITENSA TVSWRAPTAQVESFRITYVPITGGTPSMVTVDGTKTQTRLVKLIPGVEYLVSIIAMKG FEESEPVSGSFTTALDGPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEIT RTVSGNTVEYALTDLEPATEYTLRIFAEKGPQKSSTITAKFTTDLDSPRDLTATEVQS ETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDTTSYSLADLSPSTHYTAKIQAL NGPLRSNMIQTIFTTIGLLYPFPKDCSQAMLNGDTTSGLYTIYLNGDKAQALEVFCDM TSDGGGWIVFLRRKNGRENFYQNWKAYAAGFGDRREEFWLGLDNLNKITAQGQYELRV DLRDHGETAFAVYDKFSVGDAKTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSA ITNCALSYKGAFWYRNCHRVNLMGRYGDNNHSQGVNWFHWKGHEHSIQFAEMKLRPSN FRNLEGRRKRA" polyA_signal 7522..7527 BASE COUNT 1894 a 2019 c 2062 g 1585 t ORIGIN 1 accggccaca gcctgcctac tgtcacccgc ctctcccgcg cgcagataca cgcccccgcc 61 tccgtgggca caaaggcagc gctgctgggg aactcggggg aacgcgcacg tgggaaccgc 121 cgcagctcca cactccaggt acttcttcca aggacctagg tctctcgccc atcggaaaga 181 aaataattct ttcaagaaga tcagggacaa ctgatttgaa gtctactctg tgcttctaaa 241 tccccaattc tgctgaaagt gaatccctag agccctagag ccccagcagc acccagccaa 301 acccacctcc accatggggg ccatgactca gctgttggca ggtgtctttc ttgctttcct 361 tgccctcgct accgaaggtg gggtcctcaa gaaagtcatc cggcacaagc gacagagtgg 421 ggtgaacgcc accctgccag aagagaacca gccagtggtg tttaaccacg tttacaacat 481 caagctgcca gtgggatccc agtgttcggt ggatctggag tcagccagtg gggagaaaga 541 cctggcaccg ccttcagagc ccagcgaaag ctttcaggag cacacagtag atggggaaaa 601 ccagattgtc ttcacacatc gcatcaacat cccccgccgg gcctgtggct gtgccgcagc 661 ccctgatgtt aaggagctgc tgagcagact ggaggagctg gagaacctgg tgtcttccct 721 gagggagcaa tgtactgcag gagcaggctg ctgtctccag cctgccacag gccgcttgga 781 caccaggccc ttctgtagcg gtcggggcaa cttcagcact gaaggatgtg gctgtgtctg 841 cgaacctggc tggaaaggcc ccaactgctc tgagcccgaa tgtccaggca actgtcacct 901 tcgaggccgg tgcattgatg ggcagtgcat ctgtgacgac ggcttcacgg gcgaggactg 961 cagccagctg gcttgcccca gcgactgcaa tgaccagggc aagtgcgtga atggagtctg 1021 catctgtttc gaaggctacg ccggggctga ctgcagccgt gaaatctgcc cagtgccctg 1081 cagtgaggag cacggcacat gtgtagatgg cttgtgtgtg tgccacgatg gctttgcagg 1141 cgatgactgc aacaagcctc tgtgtctcaa caattgctac aaccgtggac gatgcgtgga 1201 gaatgagtgc gtgtgtgatg agggtttcac gggcgaagac tgcagtgagc tcatctgccc 1261 caatgactgc ttcgaccggg gccgctgcat caatggcacc tgctactgcg aagaaggctt 1321 cacaggtgaa gactgcggga aacccacctg cccacatgcc tgccacaccc agggccggtg 1381 tgaggagggg cagtgtgtat gtgatgaggg ctttgccggt ttggactgca gcgagaagag 1441 gtgtcctgct gactgtcaca atcgtggccg ctgtgtagac gggcggtgtg agtgtgatga 1501 tggtttcact ggagctgact gtggggagct caagtgtccc aatggctgca gtggccatgg 1561 ccgctgtgtc aatgggcagt gtgtgtgtga tgagggctat actggggagg actgcagcca 1621 gctacggtgc cccaatgact gtcacagtcg gggccgctgt gtcgagggca aatgtgtatg 1681 tgagcaaggc ttcaagggct atgactgcag tgacatgagc tgccctaatg actgtcacca 1741 gcacggccgc tgtgtgaatg gcatgtgtgt ttgtgatgac ggctacacag gggaagactg 1801 ccgggatcgc caatgcccca gggactgcag caacaggggc ctctgtgtgg acggacagtg 1861 cgtctgtgag gacggcttca ccggccctga ctgtgcagaa ctctcctgtc caaatgactg 1921 ccatggccag ggtcgctgtg tgaatgggca gtgcgtgtgc catgaaggat ttatgggcaa 1981 agactgcaag gagcaaagat gtcccagtga ctgtcatggc cagggccgct gcgtggacgg 2041 ccagtgcatc tgccacgagg gcttcacagg cctggactgt ggccagcact cctgccccag 2101 tgactgcaac aacttaggac aatgcgtctc gggccgctgc atctgcaacg agggctacag 2161 cggagaagac tgctcagagg tgtctcctcc caaagacctc gttgtgacag aagtgacgga 2221 agagacggtc aacctggcct gggacaatga gatgcgggtc acagagtacc ttgtcgtgta 2281 cacgcccacc cacgagggtg gtctggaaat gcagttccgt gtgcctgggg accagacgtc 2341 caccatcatc caggagctgg agcctggtgt ggagtacttt atccgtgtat ttgccatcct 2401 ggagaacaag aagagcattc ctgtcagcgc cagggtggcc acgtacttac ctgcacctga 2461 aggcctgaaa ttcaagtcca tcaaggagac atctgtggaa gtggagtggg atcctctaga 2521 cattgctttt gaaacctggg agatcatctt ccggaatatg aataaagaag atgagggaga 2581 gatcaccaaa agcctgagga ggccagagac ctcttaccgg caaactggtc tagctcctgg 2641 gcaagagtat gagatatctc tgcacatagt gaaaaacaat acccggggcc ctggcctgaa 2701 gagggtgacc accacacgct tggatgcccc cagccagatc gaggtgaaag atgtcacaga 2761 caccactgcc ttgatcacct ggttcaagcc cctggctgag atcgatggca ttgagctgac 2821 ctacggcatc aaagacgtgc caggagaccg taccaccatc gatctcacag aggacgagaa 2881 ccagtactcc atcgggaacc tgaagcctga cactgagtac gaggtgtccc tcatctcccg 2941 cagaggtgac atgtcaagca acccagccaa agagaccttc acaacaggcc tcgatgctcc 3001 caggaatctt cgacgtgttt cccagacaga taacagcatc accctggaat ggaggaatgg 3061 caaggcagct attgacagtt acagaattaa gtatgccccc atctctggag gggaccacgc 3121 tgaggttgat gttccaaaga gccaacaagc cacaaccaaa accacactca caggtctgag 3181 gccgggaact gaatatggga ttggagtttc tgctgtgaag gaagacaagg agagcaatcc 3241 agcgaccatc aacgcagcca cagagttgga cacgcccaag gaccttcagg tttctgaaac 3301 tgcagagacc agcctgaccc tgctctggaa gacaccgttg gccaaatttg accgctaccg 3361 cctcaattac agtctcccca caggccagtg ggtgggagtg cagcttccaa gaaacaccac 3421 ttcctatgtc ctgagaggcc tggaaccagg acaggagtac aatgtcctcc tgacagccga 3481 gaaaggcaga cacaagagca agcccgcacg tgtgaaggca tccactgaac aagcccctga 3541 gctggaaaac ctcaccgtga ctgaggttgg ctgggatggc ctcagactca actggaccgc 3601 ggctgaccag gcctatgagc actttatcat tcaggtgcag gaggccaaca aggtggaggc 3661 agctcggaac ctcaccgtgc ctggcagcct tcgggctgtg gacataccgg gcctcaaggc 3721 tgctacgcct tatacagtct ccatctatgg ggtgatccag ggctatagaa caccagtgct 3781 ctctgctgag gcctccacag gggaaactcc caatttggga gaggtcgtgg tggccgaggt 3841 gggctgggat gccctcaaac tcaactggac tgctccagaa ggggcctatg agtacttttt 3901 cattcaggtg caggaggctg acacagtaga ggcagcccag aacctcaccg tcccaggagg 3961 actgaggtcc acagacctgc ctgggctcaa agcagccact cattatacca tcaccatccg 4021 cggggtcact caggacttca gcacaacccc tctctctgtt gaagtcttga cagaggaggt 4081 tccagatatg ggaaacctca cagtgaccga ggttagctgg gatgctctca gactgaactg 4141 gaccacgcca gatggaacct atgaccagtt tactattcag gtccaggagg ctgaccaggt 4201 ggaagaggct cacaatctca cggttcctgg cagcctgcgt tccatggaaa tcccaggcct 4261 cagggctggc actccttaca cagtcaccct gcacggcgag gtcaggggcc acagcactcg 4321 accccttgct gtagaggtcg tcacagagga tctcccacag ctgggagatt tagccgtgtc 4381 tgaggttggc tgggatggcc tcagactcaa ctggaccgca gctgacaatg cctatgagca 4441 ctttgtcatt caggtgcagg aggtcaacaa agtggaggca gcccagaacc tcacgttgcc 4501 tggcagcctc agggctgtgg acatcccggg cctcgaggct gccacgcctt atagagtctc 4561 catctatggg gtgatccggg gctatagaac accagtactc tctgctgagg cctccacagc 4621 caaagaacct gaaattggaa acttaaatgt ttctgacata actcccgaga gcttcaatct 4681 ctcctggatg gctaccgatg ggatcttcga gacctttacc attgaaatta ttgattccaa 4741 taggttgctg gagactgtgg aatataatat ctctggtgct gaacgaactg cccatatctc 4801 agggctaccc cctagtactg attttattgt ctacctctct ggacttgctc ccagcatccg 4861 gaccaaaacc atcagtgcca cagccacgac agaggccctg ccccttctgg aaaacctaac 4921 catttccgac attaatccct acgggttcac agtttcctgg atggcatcgg agaatgcctt 4981 tgacagcttt ctagtaacgg tggtggattc tgggaagctg ctggaccccc aggaattcac 5041 actttcagga acccagagga agctggagct tagaggcctc ataactggca ttggctatga 5101 ggttatggtc tctggcttca cccaagggca tcaaaccaag cccttgaggg ctgagattgt 5161 tacagaagcc gaaccggaag ttgacaacct tctggtttca gatgccaccc cagacggttt 5221 ccgtctgtcc tggacagctg atgaaggggt cttcgacaat tttgttctca aaatcagaga 5281 taccaaaaag cagtctgagc cactggaaat aaccctactt gcccccgaac gtaccaggga 5341 cttaacaggt ctcagagagg ctactgaata cgaaattgaa ctctatggaa taagcaaagg 5401 aaggcgatcc cagacagtca gtgctatagc aacaacagcc atgggctccc caaaggaagt 5461 cattttctca gacatcactg aaaattcggc tactgtcagc tggagggcac ccacggccca 5521 agtggagagc ttccggatta cctatgtgcc cattacagga ggtacaccct ccatggtaac 5581 tgtggacgga accaagactc agaccaggct ggtgaaactc atacctggcg tggagtacct 5641 tgtcagcatc atcgccatga agggctttga ggaaagtgaa cctgtctcag ggtcattcac 5701 cacagctctg gatggcccat ctggcctggt gacagccaac atcactgact cagaagcctt 5761 ggccaggtgg cagccagcca ttgccactgt ggacagttat gtcatctcct acacaggcga 5821 gaaagtgcca gaaattacac gcacggtgtc cgggaacaca gtggagtatg ctctgaccga 5881 cctcgagcct gccacggaat acacactgag aatctttgca gagaaagggc cccagaagag 5941 ctcaaccatc actgccaagt tcacaacaga cctcgattct ccaagagact tgactgctac 6001 tgaggttcag tcggaaactg ccctccttac ctggcgaccc ccccgggcat cagtcaccgg 6061 ttacctgctg gtctatgaat cagtggatgg cacagtcaag gaagtcattg tgggtccaga 6121 taccacctcc tacagcctgg cagacctgag cccatccacc cactacacag ccaagatcca 6181 ggcactcaat gggcccctga ggagcaatat gatccagacc atcttcacca caattggact 6241 cctgtacccc ttccccaagg actgctccca agcaatgctg aatggagaca cgacctctgg 6301 cctctacacc atttatctga atggtgataa ggctcaggcg ctggaagtct tctgtgacat 6361 gacctctgat gggggtggat ggattgtgtt cctgagacgc aaaaacggac gcgagaactt 6421 ctaccaaaac tggaaggcat atgctgctgg atttggggac cgcagagaag aattctggct 6481 tgggctggac aacctgaaca aaatcacagc ccaggggcag tacgagctcc gggtggacct 6541 gcgggaccat ggggagacag cctttgctgt ctatgacaag ttcagcgtgg gagatgccaa 6601 gactcgctac aagctgaagg tggaggggta cagtgggaca gcaggtgact ccatggccta 6661 ccacaatggc agatccttct ccacctttga caaggacaca gattcagcca tcaccaactg 6721 tgctctgtcc tacaaagggg ctttctggta caggaactgt caccgtgtca acctgatggg 6781 gagatatggg gacaataacc acagtcaggg cgttaactgg ttccactgga agggccacga 6841 acactcaatc cagtttgctg agatgaagct gagaccaagc aacttcagaa atcttgaagg 6901 caggcgcaaa cgggcataaa ttggagggac cactgggtga gagaggaata aggcggccca 6961 gagcgaggaa aggattttac caaagcatca atacaaccag cccaaccatc ggtccacacc 7021 tgggcatttg gtgagaatca aagctgacca tggatccctg gggccaacgg caacagcatg 7081 ggcctcacct cctctgtgat ttctttcttt gcaccaaaga catcagtctc caacatgttt 7141 ctgttttgtt gtttgattca gcaaaaatct cccagtgaca acatcgcaat agttttttac 7201 ttctcttagg tggctctggg atgggagagg ggtaggatgt acaggggtag tttgttttag 7261 aaccagccgt attttacatg aagctgtata attaattgtc attatttttg ttagcaaaga 7321 ttaaatgtgt cattggaagc catccctttt tttacatttc atacaacaga aaccagaaaa 7381 gcaatactgt ttccatttta aggatatgat taatattatt aatataataa tgatgatgat 7441 gatgatgaaa actaaggatt tttcaagaga tctttctttc caaaacattt ctggacagta 7501 cctgattgta tttttttttt aaataaaagc acaagtactt ttgaaaaaaa accggaattc // LOCUS HSTEST 3484 bp RNA PRI 01-MAY-1995 DEFINITION H.sapiens mRNA for testican. ACCESSION X73608 NID g793844 KEYWORDS testican. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3484) AUTHORS Alliel,P.M., Perin,J.P., Jolles,P. and Bonnet,F.J. TITLE Testican, a multidomain testicular proteoglycan resembling modulators of cell social behaviour JOURNAL Eur. J. Biochem. 214 (1), 347-350 (1993) MEDLINE 93285162 FEATURES Location/Qualifiers source 1..3484 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" CDS 435..1754 /codon_start=1 /product="testican" /db_xref="PID:g793845" /translation="MPAIAVLAAAAAAWCFLQVESRHLDALAGGAGPNHGNFLDNDQW LSTVSQYDRDKYWNRFRDDDYFRNWNPNKPFDQALDPSKDPCLKVKCSPHKVCVTQDY QTALCVSRKHLLPRQKKGNVAQKHWVGPSNLVKCKPCPVAQSAMVCGSDGHSYTSKCK LEFHACSTGKSLATLCDGPCPCLPEPEPPKHKAERSACTDKELRNLASRLKDWFGALH EDANRVIKPTSSNTAQGRFDTSILPICKDSLGWMFNKLDMNYDLLLDPSEINAIYLDK YEPCIKPLFNSCDSFKDGKLSNNEWCYCFQKPGGLPCQNEMNRIQKLSKGKSLLGAFI PRCNEEGYYKATQCHGSTGQCWCVDKYGNELAGSRKQGAVSCEEEQETSGDFGSGGSV VLLDDLEYERELGPKDKEGKLRVHTRAVTEDDEDEDDDKEDEVGYIW" BASE COUNT 894 a 910 c 808 g 872 t ORIGIN 1 cactctctgt tgtccaatgg acacacctgt cgtgttttga gccagcgaga gatgcagtgg 61 aagtgaaaag catggttaca gactccccat gcgacagtac actcttctga agtagcggac 121 gcctggttag cttgacattc tatgcaaaga tccataatgt ggttcctgca gatggcacag 181 ttatcaacca caatatccca ggcccagagg gctactgcat tccacttttt cacttcaaag 241 cgcttcttgc ccgcgccgct gttggtgccg ctcggggtat ccacatccat cgctgcgggc 301 tcacaaagcg gccagacgct cggcggcggc gtgtggcagg agcgcagggg cgcgagccgg 361 cgatcagcct tcccggcgac cgtgccgcgg gagctcgagc aactcggact aggggacccg 421 ggccggcccc caagatgccg gcgatcgcgg tgttggcggc ggccgccgcg gcgtggtgct 481 tcctccaagt cgagagccgg cacctggacg cgctcgccgg aggcgcgggc cccaaccacg 541 gcaatttcct agacaatgac cagtggctga gcaccgtctc ccagtacgac cgggacaagt 601 actggaaccg ctttcgagac gatgattatt tcagaaactg gaatcccaac aagccctttg 661 accaagccct ggacccatcc aaggacccct gcctgaaggt aaaatgcagc cctcacaaag 721 tgtgtgtgac ccaggactac cagaccgccc tgtgtgtcag ccgcaagcac ctgctcccca 781 ggcaaaagaa ggggaacgtg gcccagaaac actgggttgg accttcgaat ttggtcaagt 841 gcaagccctg tcccgtggca cagtcagcca tggtctgcgg ctcagatggc cactcctaca 901 catccaagtg caaattggag ttccatgctt gttctactgg caaaagcctc gccaccctct 961 gtgatgggcc ctgtccctgt ctcccagagc ctgagccacc aaagcacaag gcagaaagga 1021 gtgcctgcac agacaaggag ttgcggaacc ttgcctcccg gctgaaggat tggtttggag 1081 ctctccacga ggatgcgaac agagtcatca agcccaccag ctccaacaca gcccaaggca 1141 ggtttgacac tagcatcctg cccatctgca aggactccct gggctggatg ttcaacaagt 1201 tggacatgaa ctatgacctc ctgcttgacc cttcagagat caatgccatc tacctggata 1261 agtacgagcc ctgtatcaag cctcttttca actcgtgtga ctccttcaag gatggcaagc 1321 tttctaacaa tgagtggtgc tactgcttcc agaagcctgg aggtctccct tgccagaatg 1381 aaatgaacag aattcagaag ctgagtaagg ggaaaagcct gttgggggcc ttcatacctc 1441 ggtgtaatga ggagggctat tacaaagcca cacagtgcca cggcagcacg gggcagtgct 1501 ggtgtgtgga caaatatggg aatgagttgg ctggctccag gaaacagggt gctgtgagct 1561 gtgaagagga gcaggaaacc tcaggggatt ttggcagtgg tgggtccgtg gtcctgctgg 1621 atgacctaga atatgaacgg gagctgggac caaaggacaa agaggggaag ctgagggtgc 1681 acacccgagc cgtgacagag gatgatgagg atgaggatga tgacaaagag gatgaggtcg 1741 ggtacatatg gtagtgccca caagaaagag gacacaagtt ttgcacaaaa ttgcaagtca 1801 cttcctattc ctgcatttgt atctaagact ccaaggcacc aaggtctctt ctccattgtt 1861 gctctctata cccgacctaa ggtttggaag acaactgctt gttcccagag gattctgatt 1921 ttgcatatgt ttgtatggga gaaagggtgt tgtgtttttt tttttgttgt tgtttatttt 1981 ttggataggg aagtcattgg cttaattaga gcctccttcc tttctgtgag atttttccaa 2041 caagcatgtg atttacgtgg aattctgaca gtgcagggag cccccaccct cttaaatgtc 2101 aaagaccctt tttgattacc cacactggtg gttattacag catggttccc agccttacag 2161 tgtctaagtg cttctcttgt gtcctgtaga tgttgtgaaa aagaaaaaaa caaaaaatac 2221 accacactgt actttttccc cctgcccccg ttactgccgg tgattattat taaaaattag 2281 tttttttcac atcattctat ctggcttcct ataaacaaca gccttaattc agtcaagact 2341 ccctttggga attcatttta ttaaaaattg gtgtctggat acttccctgt acatgcataa 2401 atatgcatgc atgtacagaa agactgtatg tgtgtgcctt gcacacacac ccatacctct 2461 cagaaaaagt gtttgggtat cttaaaaact cgaaaaacaa tgataaattt ctcagcttgt 2521 ccagacctgg aacaaaattt ctggaataag aaatttgtat taaagtcctt ttttgcacta 2581 acagttggct cttgtagcct gcaggctgag gaagtctctt ctctgtgcat cagcagagtt 2641 actgaaagcc tctgattgag aaaaaacctc cgtctgccta aatcactttt ctcgcagaag 2701 ccatgcgact cccacacgac acgggcagct tcacaagcca tctctttcat ttctgcttga 2761 agccccttgg ctgcagcaat cctgtctgcc ataggtttct tccttcctta cctactcaag 2821 ggctttttct aaggcatgca cacatatctc ctgttctctg agagtaccat ggtgttcctt 2881 aaaagaagaa aatttctaat tctgaactca atgttttgct tttactccct ttctactgac 2941 aaatcatgat aagggcacaa aagctgtaca gatttttttt tttaaccact caatcccaaa 3001 tggaggccta caaagaacat cgtaataaca catggaagca aaccccgggt ttttaagagc 3061 aaattctgtc cccccctcac tcccccaagt gacaagatac taatgaagaa agttcttcac 3121 catagtgttt gttttaacta aactcattgg agtctagttc caaatttggt agggtcatca 3181 tctctacatt ccttaggatt tctctcccta tcaagctggc ccagatacaa gtaccaaaca 3241 gtagtctctg aagttcccat ttccttcagt accagtctat aagctactgt ccgccactga 3301 ttttcatcta tcagggtgtc ctaatcagaa tcagccaccc aagcaagcct ctctggccca 3361 catatctatc tcttgccttc ccccatgaac ttcagcctgt ccacacaaaa gccacataaa 3421 ctcaagcaag aaatatgttc agccaaaaca tgattatagt ggcagctgac caatacccca 3481 cccc // LOCUS HSTF2B 1594 bp RNA PRI 15-JUN-1992 DEFINITION Human mRNA for general transcription factor IIB. ACCESSION X59268 NID g37057 KEYWORDS TF2B gene; transcription initiation factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1594) AUTHORS Ha,I. TITLE Direct Submission JOURNAL Submitted (27-APR-1991) I. Ha, Rutgers University, Dept of Biochemistry, Piscataway, NJ 08854, USA REFERENCE 2 (bases 1 to 1594) AUTHORS Ha,I., Lane,W.S. and Reinberg,D. TITLE Cloning of a human gene encoding the general transcription initiation factor IIB JOURNAL Nature 352 (6337), 689-695 (1991) MEDLINE 91342994 FEATURES Location/Qualifiers source 1..1594 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone_lib="cDNA" mRNA 1..1594 /gene="TF2B" /note="cDNA" /evidence=experimental gene 1..1594 /gene="TF2B" repeat_unit complement(124..200) /gene="TF2B" /rpt_type=DIRECT misc_feature complement(190..210) /gene="TF2B" /note="sigma homology" repeat_unit complement(218..294) /gene="TF2B" /rpt_type=DIRECT CDS 361..1311 /gene="TF2B" /codon_start=1 /product="IIB protein" /db_xref="PID:g37058" /db_xref="SWISS-PROT:Q00403" /translation="MASTSRLDALPRVTCPNHPDAILVEDYRAGDMICPECGLVVGDR VIDVGSEWRTFSNDKATKDPSRVGDSQNPLLSDGDLSTMIGKGTGAASFDEFGNSKYQ NRRTMSSSDRAMMNAFKEITTMADRINLPRNIVDRTNNLFKQVYEQKSLKGRANDAIA SACLYIACRQEGVPRTFKEICAVSRISKKEIGRCFKLILKALETSVDLITTGDFMSRF CSNLCLPKQVQMAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGD IAGVADVTIRQSYRLIYPRAPDLFPTDFKFDTPVDKLPQL" BASE COUNT 476 a 359 c 314 g 445 t ORIGIN 1 ggaattcctc tctttattgt cagggtcctc tccctaggag gcctgccccc gctaaccggc 61 tttttgccca aatgggccat tatcgaagaa ttcacaaaaa acaatagcct catcatcccc 121 accatcatag ccaccatcac cctccttaac ctctacttct acctacgcct aatctactcc 181 acctcaatca cactactccc catatctaac aacgtaaaaa taaaatgaca gtttaacata 241 caaaacccac cccattcctc cccacactca tcgcccttac cacgctactc ctacctatct 301 ccccttttat actaataatg tctgttgtgt cttgttgcgg gcaccgcagt cgccgtgaag 361 atggcgtcta ccagccgttt ggatgctctt ccaagagtca catgtccaaa ccatccagat 421 gcgattttag tggaggacta cagagccggt gatatgatct gtcctgaatg tggcttggtt 481 gtaggtgacc gggttattga tgtgggatct gaatggcgaa ctttcagcaa tgacaaagca 541 acaaaagatc catctcgagt tggagattct cagaatcctc ttctgagtga tggagatttg 601 tctaccatga ttggcaaggg cacaggagct gcaagttttg acgaatttgg caattctaag 661 taccagaatc ggagaacaat gagcagttct gatcgggcaa tgatgaatgc attcaaagaa 721 atcactacca tggcagacag aatcaatcta cctcgaaata tagttgatcg aacaaataat 781 ttattcaagc aagtatatga acagaagagc ctgaagggaa gagctaatga tgctatagct 841 tctgcttgtc tctatattgc ctgtagacaa gaaggggttc ctaggacatt taaagaaata 901 tgtgccgtat cacgaatttc taagaaagaa attggtcggt gttttaaact tattttgaaa 961 gcgctagaaa ccagtgtgga tttgattaca actggggact tcatgtccag gttctgttcc 1021 aacctttgtc ttcctaaaca agtacagatg gcagctacac atatagcccg taaagctgtg 1081 gaattggact tggttcctgg gaggagcccc atctctgtgg cagcggcagc tatttacatg 1141 gcctcacagg catcagctga aaagaggacc caaaaagaaa ttggagatat tgctggtgtt 1201 gctgatgtta caatcagaca gtcctataga ctgatctatc ctcgagcccc agatctgttt 1261 cctacagact tcaaatttga caccccagtg gacaaactac cacagctata aattgaggca 1321 gctaacgtca aattcttgaa tacaaaactt tgcctgttgt acatagccta tacaaaatgc 1381 tgggttgagc ctttcatgag gaaaaacaaa agacatggta cgcattccag ggctgaatac 1441 tattgcttgg cattctgtat gtatatacta gtgaaacata tttaatgatt taaatttctt 1501 atcaaatttc ttttgtagca atctaggaaa ctgtattttg gaagatattt gaaattatgt 1561 aattcttgaa taaaacattt ttcaaaacgg aatt // LOCUS HSTFE35 3266 bp RNA PRI 15-NOV-1997 DEFINITION H.sapiens mRNA for transcription factor TFE3. ACCESSION X96717 NID g2612789 KEYWORDS transcription factor TFE3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3266) AUTHORS Sidhar,S.K. TITLE Direct Submission JOURNAL Submitted (19-MAR-1996) S.K. Sidhar, Institute of Cancer Research, Molecular Carcinogenesis, 15 Cotswold Road, Belmont, Sutton, Surrey SM2 5NG, UK REMARK revised by [7] REFERENCE 2 (bases 1 to 3266) AUTHORS Beckmann,H., Su,L.K. and Kadesch,T. TITLE TFE3: a helix-loop-helix protein that activates transcription through the immunoglobulin enhancer muE3 motif JOURNAL Genes Dev. 4 (2), 167-179 (1990) MEDLINE 90249724 REFERENCE 3 (bases 1 to 3266) AUTHORS Sidhar,S.K., Clark,J., Gill,S., Hamoudi,R., Crew,A.J., Gwilliam,R., Ross,M., Linehan,W.M., Birdsall,S., Shipley,J. and Cooper,C.S. TITLE The t(X;1)(p11.2;q21.2) translocation in papillary renal cell carcinoma fuses a novel gene PRCC to the TFE3 transcription factor gene JOURNAL Hum. Mol. Genet. 5 (9), 1333-1338 (1996) MEDLINE 97026295 REFERENCE 4 (bases 1 to 3266) AUTHORS Clark,J., Lu,Y.J., Sidhar,S.K., Parker,C., Gill,S., Smedley,D., Hamoudi,R., Linehan,W.M., Shipley,J. and Cooper,C.S. TITLE Fusion of splicing factor genes PSF and NonO (p54nrb) to the TFE3 gene in papillary renal cell carcinoma JOURNAL Oncogene 15 (18), 2233-2239 (1997) MEDLINE 98054131 REFERENCE 5 (bases 1 to 3266) AUTHORS Clark,J. TITLE Direct Submission JOURNAL Submitted (07-NOV-1997) J. Clark, Institute of Cancer Research, Molecular Carcinogenesis, 15 Cotswold Road, Belmont, Sutton, Surrey SM2 5NG, UK FEATURES Location/Qualifiers source 1..3266 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A2243 synovial sarcoma" /dev_stage="adult" CDS 235..1962 /codon_start=1 /product="transcription factor TFE3" /db_xref="PID:e1172965" /db_xref="PID:g2612790" /translation="MSHAAEPARDGVEASAEGPRAVFVLLEERRPADSAQLLSLNSLL PESGIVADIELENVLDPDSFYELKSQPLPLRSSLPISLQATPATPATLSASSSAGGSR TPAMSSSSSSRVLLRQQLMRAQAQEQERRERREQAAAAPFPSPAPASPAISVVGVSAG GHTLSRPPPAQVPREVLKVQTHLENPTRYHLQQARRQQVKQYLSTTLGPKLASQALTP PPGPASAQPLPAPEAAHTTGPTGSAPNSPMALLTIGSSSEKEIDDVIDEIISLESSYN DEMLSYLPGGTTGLQLPSTLPVSGNLLDVYSSQGVATPAITVSNSCPAELPNIKREIS ETEAKALLKERQKKDNHNLIERRRRFNINDRIKELGTLIPKSSDPEMRWNKGTILKAS VDYIRKLQKEQQRSKDLESRQRSLEQANRSLQLRIQELELQAQIHGLPVPPTPGLLSL ATTSASDSLKPEQLDIEEEGRPGAATFHVGGGPAQNAPHQQPPAPPSDALLDLHFPSD HLGDLGDPFHLGLEDILMEEEEGVVGGLSGGALSPLRAASDPLLSSVSPAVSKASSRR SSFSMEEES" conflict 866 /citation=[2] /replace="g" conflict 898..900 /citation=[2] /replace="aaa" conflict 1561..1563 /citation=[2] /replace="ggg" conflict 1596..1597 /citation=[2] /replace="aa" conflict 1657..1658 /citation=[2] /replace="cg" conflict 1900..1902 /citation=[2] /replace="tc" conflict 1982..1984 /citation=[2] /replace="tg" conflict 2054..2060 /citation=[2] /replace="ac" conflict 2072..2077 /citation=[2] /replace="cg" conflict 3145..3146 /citation=[2] /replace="cg" BASE COUNT 689 a 1035 c 928 g 614 t ORIGIN 1 ctaaagggcg gtcgtccggg gttaggttga gggggggcgt cggtccgttc tgggcggggg 61 atgactcaca gcccatccca tctccccgac gccgcccgcc cgcgcagtgc tagctccatg 121 gcttagcgga ggaggcggca gtggcgagct ggggggaggg gggactctta ttttgttagg 181 gggaccgggc cgaggcccga ccggcctggc agggctcgcc cggggccggg cgtcatgtct 241 catgcggccg aaccagctcg ggatggcgta gaggccagcg cggagggccc tcgagccgtg 301 ttcgtgctgt tggaggagcg caggccggcc gactcggctc agctgctcag cctgaactct 361 ttgcttccgg aatccgggat tgttgctgac atagaattag aaaacgtcct tgatcctgac 421 agcttctacg agctcaaaag ccaaccctta ccccttcgct caagcctccc aatatcactg 481 caggccacac cagccacccc agctacactc tctgcatcgt cttctgcagg gggctccagg 541 acccctgcca tgtcgtcatc ttcttcatcg agggtcttgc tgcggcagca gctaatgcgg 601 gcccaggcgc aggagcagga gaggcgtgag cgtcgggaac aggccgccgc ggctcccttc 661 cccagtcctg cacctgcctc tcctgccatc tctgtggttg gcgtctctgc tgggggccac 721 acattgagcc gtccaccccc tgctcaggtg cccagggagg tgctcaaggt gcagacccat 781 ctggagaacc caacgcgcta ccacctgcag caggcgcgcc ggcagcaggt gaaacagtac 841 ctgtccacca cactcgggcc caagctggct tcccaggccc tcaccccacc gccggggccc 901 gcaagtgccc agccactgcc tgcccctgag gctgcccaca ctaccggccc cacaggcagt 961 gcgcccaaca gccccatggc gctgctcacc atcgggtcca gctcagagaa ggagattgat 1021 gatgtcattg atgagatcat cagcctggag tccagttaca atgatgaaat gctcagctat 1081 ctgcccggag gcaccacagg actgcagctc cccagcacgc tgcctgtgtc agggaatctg 1141 cttgatgtgt acagtagtca aggcgtggcc acaccagcca tcactgtcag caactcctgc 1201 ccagctgagc tgcccaacat caaacgggag atctctgaga ccgaggcaaa ggcccttttg 1261 aaggaacggc agaagaaaga caatcacaac ctaattgagc gtcgcaggcg attcaacatt 1321 aacgacagga tcaaggaact gggcactctc atccctaagt ccagtgaccc ggagatgcgc 1381 tggaacaagg gcaccatcct gaaggcctct gtggattata tccgcaagct gcagaaggag 1441 cagcagcgct ccaaagacct ggagagccgg cagcgatccc tggagcaggc caaccgcagc 1501 ctgcagctcc gaattcagga actagaactg caggcccaga tccatggcct gccagtacct 1561 cccactccag ggctgctttc cttggccacg acttcggctt ctgacagcct caagccagag 1621 cagctggaca ttgaggagga gggcaggcca ggcgcagcaa cgttccatgt agggggggga 1681 cctgcccaga atgctcccca tcagcagccc cctgcaccgc cctcagatgc ccttctggac 1741 ctgcactttc ccagcgacca cctgggggac ctgggagacc ccttccacct ggggctggag 1801 gacattctga tggaggagga ggagggggtg gtgggaggac tgtcgggggg tgccctgtcc 1861 ccactgcggg ctgcctccga tcccctgctc tcttcagtgt cccctgctgt ctccaaggcc 1921 agcagccgcc gcagcagctt cagcatggaa gaggagtcct gatcaggcct cacccctccc 1981 ctgggacttt cccacccagg aaaggaggac cagtcaggat gaggccccgc cttttccccc 2041 accctcccat gagactgccc tgcccaggta tcctggggga agaggagatg tgatcaggcc 2101 ccacccctgt aatcaggcaa ggaggaggag tcagatgagg ccctgcacct tccccaaagg 2161 aaccgcccag tgcaggtatt tcagaaggag aaggctggag aaggacatga gatcagggcc 2221 tgccccctgg ggatcacagc ctcacccctg cccctgtggg actcatcctt gcccaggtga 2281 gggaaggaga caggatgagg tctcgaccct gtcccctagg gactgtccta gccaggtctc 2341 ctgggaaagg gagatgtcag gatgttgctc catcctttgt cttggaacca ccagtctagt 2401 ccgtcctggc acagaagagg agtcaagtaa tggaggtccc agccctgggg gtttaagctc 2461 tgccccttcc ccatgaaccc tgccctgctc tgcccaggca aggaacagaa gtgaggatga 2521 gacccagccc cttcccctgg gaactctcct ggcttctagg aatggaggag cccaggcccc 2581 acccccttcc ctataggaac agcccagcac aggtatttca ggtgtgaaag aatcagtagg 2641 accaggccac cgctaagtgc ttgtggagat cacagcccca cccttgtccc tcagcaacat 2701 cccatctaag cattccacac tgcagggagg agtggtactt aagctcccct gccttaacct 2761 gggaccaacc tgacctaacc taggagggct ctgagccaac cttgctcttg gggaagggga 2821 cagattatga aatttcatgg atgaattttc catacctata tctggagtga gaggcccccc 2881 accccttggg cagagtcctg ccttcttcct tgaggggcag tttgggaagg tgatgggtat 2941 tagtggggga ctgagttcag gttaccagaa ccagtacctc agtattcttt ttcaacatgt 3001 agggcaagag gatgaaggaa ggggctatcc tggacctccc cagcccagga aaaactggaa 3061 gccttccccc agcaaggcag aagcttggag gagggttgta aaagcatatt gtaccccctc 3121 atttgtttat ctgatttttt tattgctccg catactgaga atctaggcca ccccaacctc 3181 tgttccccac ccagttcttc atttggagga atcaccccat ttcagagtta tcaagagaca 3241 ctcccccctc cattcccacc cctcat // LOCUS HSTFIIEA 2969 bp RNA PRI 11-JUN-1992 DEFINITION H.sapiens mRNA for transcription factor TFIIE alpha. ACCESSION X63468 NID g37067 KEYWORDS TFIIA gene; transcription factor TFIIA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2969) AUTHORS Ohkuma,Y. TITLE Direct Submission JOURNAL Submitted (15-NOV-1991) Y. Ohkuma, The Rockefeller University, Dept of Biochemistry and, Molecular Biology, 1230 York Avenue, New York NY 10021, USA REFERENCE 2 (bases 1 to 2969) AUTHORS Ohkuma,Y., Sumimoto,H., Hoffmann,A., Shimasaki,S., Horikoshi,M. and Roeder,R.G. TITLE Structural motifs and potential sigma homologies in the large subunit of human general transcription factor TFIIE JOURNAL Nature 354 (6352), 398-401 (1991) MEDLINE 92065982 FEATURES Location/Qualifiers source 1..2969 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 55..1374 /codon_start=1 /product="TFIIE-alpha" /db_xref="PID:g37068" /db_xref="SWISS-PROT:P29083" /translation="MADPDVLTEVPAALKRLAKYVIRGFYGIEHALALDILIRNSCVK EEDMLELLKFDRKQLRSVLNNLKGDKFIKCRMRVETAADGKTTRHNYYFINYRTLVNV VKYKLDHMRRRIETDERDSTNRASFKCPVCSSTFTDLEANQLFDPMTGTFRCTFCHTE VEEDESAMPKKDARTLLARFNEQIEPIYALLRETEDVNLAYEILEPEPTEIPALKQSK DHAATTAGAASLAGGHHREAWATKGPSYEDLYTQNVVINMDDQEDLHRASLEGKSAKE RPIWLRESTVQGAYGSEDMKEGGIDMDAFQEREEGHAGPDDNEEVMRALLIHEKKTSS AMAGSVGAAAPVTAANGDDSESETSESDDDSPPRPAAVAVHKREEDEEEDDEFEEVAD DPIVMVAGRPFSYSEVSQRPELVAQMTPEEKEAYIAMGQRMFEDLFE" polyA_signal 2778..2783 BASE COUNT 833 a 596 c 674 g 866 t ORIGIN 1 ctaaattacc cactacgttg cttgtatatt taaagttgga gttcgttgct aaagatggca 61 gacccagatg tcctcactga agttccagca gcattgaagc ggttagccaa gtatgtgatc 121 cggggatttt atggcattga gcatgccttg gccttggaca tcttgatcag gaactcctgt 181 gtgaaagagg aggatatgct ggagctgctc aagtttgatc ggaagcaact tcgatcagtt 241 ttgaataatt taaagggaga caagtttatc aaatgcagaa tgagggtaga gactgctgca 301 gacgggaaaa ccactcgcca taactactac ttcatcaatt atcgtactct tgttaatgtg 361 gtaaaatata aactggacca catgagaaga agaattgaga ccgatgagag agattcgacc 421 aaccgggctt ccttcaaatg tcctgtctgt agtagtactt tcacagactt agaagctaat 481 cagctctttg atcctatgac aggaactttc cgctgtactt tttgccatac agaggtagaa 541 gaggatgaat cagcaatgcc caaaaaagat gcacgcacac ttttggcaag gtttaatgaa 601 caaattgagc ccatttatgc attgcttcgg gagacagagg atgtgaactt ggcctatgaa 661 atacttgagc cagaacccac agaaatccca gccctgaaac agagcaagga ccatgcagca 721 actactgctg gagctgctag cctagcaggt gggcaccacc gggaagcatg ggccaccaaa 781 ggtccttcct atgaagactt atacactcag aatgttgtca ttaacatgga tgaccaagaa 841 gatcttcatc gagcctcact ggaagggaaa tctgccaaag agaggcctat ttggttgaga 901 gaaagcactg tccaaggggc atatggttct gaagatatga aagaaggggg catagatatg 961 gacgcatttc aggagcgtga ggaaggccat gctgggcctg atgacaacga agaggtcatg 1021 cgagcactgc tcattcacga gaaaaagact tcctctgcca tggctggttc agtgggggca 1081 gctgctccag tgaccgctgc caatggcgat gactcagaaa gcgagaccag tgagtcagat 1141 gatgattctc caccccgtcc ggcagctgtg gctgtgcata aacgagaaga ggatgaagag 1201 gaagatgacg agtttgaaga agtagcagat gaccccattg tcatggtggc tggccgtccg 1261 ttctcctaca gtgaagtgag ccaacggcca gagctagtgg cccagatgac accagaagaa 1321 aaggaagcat atatagcaat gggacaacgc atgtttgagg acctctttga gtgagctttc 1381 cctaattctt tctcctttct ctaatgctca gttcaaaaag gaatgtctca tctttgaaga 1441 aaagtattta agtggctttc tgcccctctt gatgtaagca actgtccatc cttgtgcaaa 1501 gattgatggt agagagcttg acttttatgc cagaaacttt cccagcaagg tagggtgctg 1561 agaatcctac ccttccttgc tgtcactaca gtattaatat tttactgtat tttcttttct 1621 tttttttttt tttttggaga tgaagtctca ctcttgtacc ccaggctgga gtgcaatggc 1681 gtgatctcgg ctcactgcaa cctctgcctc ctgggttcaa gcgattctcc tgcctcagcc 1741 tcccgagtag ctgggattac aggtgcctgc caccatgcct ggctaatttt tgtattttta 1801 gtagaggcag ggtttcacca tgttagccag gatgatctcg atctcctgac ctcatgatcc 1861 acccgcctcg gcctcccaaa gtgctgtatt ttcttatctg atttttttct tgccttatta 1921 agacataatt ttctcccttc tgaaatgagt gagggaagtt cataaggtaa atccttccca 1981 tccatctgtt tactacaata ggttacaata attcactgat cacatccatt ttatctgttc 2041 tagccaggca ttccaaacaa tttcttatac tgctgcccac caaagcagct tgccaacagt 2101 caaatcactg attgggggaa aaaatcctga aattttgctt agaatttgag catttcctca 2161 aaattgagat ggatcaatat gtaaggggag gtgggagcgt gtgtggaagg gggagagata 2221 tacttgagtc ttatgattaa tgtctaaacc agaatttgtg tctttagaac tgaccagact 2281 ggtagatttt attgtattgc ttaatgtctt ttggtttgga tttaggatga tagaaaacag 2341 aagtataatt ggtaaaccct taggaagaaa ttagaaaaac atggacgtaa gacaaaaagt 2401 ctctgtgaag ggttgaagag tgacaagcat tggtaacagt gccttagaac tgtgtcagtt 2461 agtctgattt ggaaatcctt tatgtaaagc tgagactggt cctggttttg ttccctttgg 2521 tacagacctc ttgtcagtgc tataaattgt ttaatgaggc cattccagca gaaatcaaca 2581 gaataattga ttactcttct ctctctctgt cactctccct ctttctaaac atcattgaag 2641 gctgtctctc tttaattttg tcagacacag tattttaggg tgcatccagt ataccattga 2701 gcattgtaac ctcaggaaac agtttatttt gggttctgat atgtagcatg gtattttccc 2761 taaggcagaa ctttaaaaat aaagaacttt cacacaaggg tctgtaacaa ttgtatatct 2821 tacaatattt ttccttgcat tgtaattttt aagtatttat cattttatag tacacatgta 2881 aagaatatat gagccttgta tggagtgatg tttcatttac ctgggttgtg ttaatgactg 2941 aatgttgaca ataaatctgt tttatactg // LOCUS HSTFIIEB 1515 bp RNA PRI 11-JUN-1992 DEFINITION H.sapiens mRNA for transcription factor TFIIE beta. ACCESSION X63469 NID g37069 KEYWORDS TFIIB gene; transcription factor TFIIB. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1515) AUTHORS Ohkuma,Y. TITLE Direct Submission JOURNAL Submitted (15-NOV-1991) Y. Ohkuma, The Rockefeller University, Dept of Biochemistry and, Molecular Biology, 1230 York Avenue, New York NY 10021, USA REFERENCE 2 (bases 1 to 1515) AUTHORS Sumimoto,H., Ohkuma,Y., Sinn,E., Kato,H., Shimasaki,S., Horikoshi,M. and Roeder,R.G. TITLE Conserved sequence motifs in the small subunit of human general transcription factor TFIIE JOURNAL Nature 354 (6352), 401-404 (1991) MEDLINE 92065983 FEATURES Location/Qualifiers source 1..1515 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 243..1118 /codon_start=1 /product="TFIIE-beta" /db_xref="PID:g37070" /db_xref="SWISS-PROT:P29084" /translation="MDPSLLRERELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVE HGGSSGSKQNSDHSNGSFNLKALSGSSGYKFGVLAKIVNYMKTRHQRGDTHPLTLDEI LDETQHLDIGLKQKQWLMTEALVNNPKIEVIDGKYAFKPKYNVRDKKALLRLLDQHDQ RGLGGILLEDIEEALPNSQKAVKALGDQILFVNRPDKKKILFFNDKSCQFSVDEEFQK LWRSVTVDSMDEEKIEEYLKRQGISSMQESGPKKVAPIQRRKKPASQKKRRFKTHNEH LAGVLKDYSDITSSK" polyA_signal 1496..1501 BASE COUNT 436 a 316 c 386 g 377 t ORIGIN 1 cttaaattac ccactacgtt gtccagtcgc cgcctcagct accgccgctg ccgccgccgc 61 cgccgccacc gccagtggtg agaccccgac ctggcgggtc agcgctgggc gtgcgtgcgg 121 gcaggcgggg gcgctgacga gaagcaggaa gagggtgcag tgccggcgtg ggcggccggc 181 cgaggcggag gcgcaggaag ggggcggcga gtcgtgcgag gctgcccttc tcactcagca 241 ttatggatcc aagcctgttg agagaaaggg agctgttcaa aaaacgagct ctttctactc 301 ctgtagtaga aaaacgttca gcatcttctg agtcatcatc atcatcgtca aagaagaaga 361 aaacaaaggt agaacatgga ggatcgtcag gctctaaaca aaattctgat catagcaatg 421 gatcatttaa cttgaaagct ttgtcaggaa gctctggata taagtttggt gttcttgcta 481 agattgtgaa ttacatgaag acacggcatc agcgaggaga tacgcatcct ctaaccttag 541 atgaaatttt ggatgaaaca caacatttag atattggact caagcagaaa caatggctaa 601 tgactgaggc tttagtcaac aatcccaaaa ttgaagtaat agatgggaag tatgctttca 661 agcccaagta caacgtgaga gataagaagg ccctacttag gctcttagat cagcatgacc 721 agcgaggatt aggaggaatt cttttagaag acatagaaga agcactgccc aattcccaga 781 aagctgtcaa ggctttgggg gaccagatac tatttgtaaa tcgtcccgat aagaagaaaa 841 tacttttctt caatgataag agctgtcagt tttctgtgga tgaagaattt cagaaactgt 901 ggaggagtgt cactgtagat tccatggacg aggagaaaat tgaagaatat ctgaagcgac 961 agggtatttc ttccatgcag gaatctggac caaagaaagt ggcccctatt cagagaagga 1021 aaaagcctgc ttcacagaaa aagcgacgct ttaagactca taacgaacac ttggctggag 1081 tgctgaagga ttactctgac attacttcca gcaaataggg aacagttttg ccctggaaca 1141 gagttacaga tacacaatca agagtgttct tgctgatgct cggggtctga agactgtctt 1201 cctatctgct tcttgcggct gaggagagga gcagttcagt ttacaaaaca agtgcaaatt 1261 accaaactca aagcttattt gagtagaatg ggctcatggg caatgtgatg ttccctgtta 1321 accttctgtt actccctggg agaaaggcgc tgagcgtggc atgcaggtgt ctttgctgtg 1381 tttttctcca cttctaaatg gttcctggtt cctttcttcc tcgtttgtta ctttagagca 1441 agtttgccca tagtcttgaa tgcaatattt gtttattcca aaagaacata tttataataa 1501 aatcactgta gaagg // LOCUS HSTFIIH 1634 bp RNA PRI 07-MAR-1997 DEFINITION H.sapiens mRNA for 52 kD subunit of transcription factor TFIIH. ACCESSION Y07595 NID g1514596 KEYWORDS p52 gene; transcription factor TFIIH. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Marinoni,J.C., Roy,R., Vermeulen,W., Miniou,P., Lutz,Y., Weeda,G., Seroz,T., Gomez,D.M., Hoeijmakers,J.H. and Egly,J.M. TITLE Cloning and characterization of p52, the fifth subunit of the core of the transcription/DNA repair factor TFIIH JOURNAL EMBO J. 16 (5), 1093-1102 (1997) MEDLINE 97224135 REFERENCE 2 (bases 1 to 1634) AUTHORS Marinoni,J. TITLE Direct Submission JOURNAL Submitted (20-AUG-1996) J. Marinoni, IGBMC, Dr Eglys Group, 1, Rue Laurent Fries, Parc Dinnovation, 67404 Illkirch Cedex, FRANCE REMARK revised by [3] REFERENCE 3 (bases 1 to 1634) AUTHORS Marinoni,J. TITLE Direct Submission JOURNAL Submitted (28-AUG-1996) J. Marinoni, IGBMC, Dr Eglys Group, 1, Rue Laurent Fries, Parc Dinnovation, 67404 Illkirch Cedex, FRANCE FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="preB cell" /map="6p21.3" gene 128..1516 /gene="p52" CDS 128..1516 /gene="p52" /note="52 kDa subunit" /codon_start=1 /product="transcription factor TFIIH" /db_xref="PID:e264343" /db_xref="PID:g1514597" /translation="MESTPSRGLNRVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAV FRELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLP GGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVL HFMVGSPSAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQ YLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKR KSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEM LYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIR LWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSD VKRFWKRQKHSS" BASE COUNT 324 a 459 c 477 g 374 t ORIGIN 1 tcccatcttt tcctcgcatt ttttcaccat ctttccctca atctccagga gccaatgcga 61 gacttttggc tccgattaag cgacggcccg agacttgggg tgcgcgagga ggatcgacag 121 agtggtgatg gagagcaccc cttcaagggg actgaaccga gtacacctac aatgcaggaa 181 tctgcaggaa ttcttagggg gcctgagccc tggggtattg gaccgattgt atgggcaccc 241 tgccacatgt ctggctgtct tcagggagct cccatccttg gctaagaact gggtgatgcg 301 gatgctcttt ctggagcagc ctttgccaca ggctgctgta gctctgtggg taaagaagga 361 attcagcaag gctcaggagg aaagtacagg gctgctgagc ggcctccgga tctggcacac 421 ccagctgctc ccaggcgggc tccagggcct catcctcaac cccattttcc gccagaacct 481 ccgcattgcc cttctgggtg gggggaaggc ctggtctgat gacacaagtc agctgggacc 541 agacaagcat gcccgggacg ttccctccct tgacaagtac gccgaggagc gatgggaggt 601 ggtcttgcac ttcatggtgg gctcccccag tgcagctgtc agccaggact tggctcagct 661 cctcagccag gctgggctca tgaagagtac tgaacctgga gagccgccct gcattacttc 721 cgctggcttc cagttcctgt tgctggacac cccggctcag ctctggtact ttatgttgca 781 gtatttgcag acagcccaga gccggggcat ggacctggta gagattctct ccttcctctt 841 ccagctcagc ttctctactc tgggcaagga ttactctgtg gaaggtatga gtgattctct 901 gttgaacttc ctgcaacatc tgcgtgagtt tgggcttgtt ttccagagga agaggaaatc 961 tcggcgttac taccccacac gcctggccat caatctctca tcaggtgtct ctggagctgg 1021 gggcactgtg catcagccag gtttcattgt cgtggaaacc aattaccgac tgtatgccta 1081 cacggagtcg gagctgcaga ttgccctcat tgccctcttc tctgagatgc tctatcggtt 1141 ccccaacatg gtggtggcgc aggtgacccg ggagagtgtg cagcaggcaa tcgccagtgg 1201 catcacagcc cagcagataa tccatttcct aaggacaaga gcccacccag tgatgctcaa 1261 acagacacct gtgctgcccc ccaccatcac cgaccagatc cggctctggg agctggaaag 1321 ggacagactc cggttcactg agggtgtcct gtataaccag ttcctgtcgc aagtggactt 1381 tgagctgctg ctggcccacg cgcgggagct gggcgtgctc gtgttcgaga actcggccaa 1441 gcggctcatg gtggtgaccc cggccgggca cagcgacgtc aagcgctttt ggaagcggca 1501 gaaacatagc tcctgagagc gcgggacttg gacacggacc tcggcgggcg ggactgggcg 1561 gggcggggca tcagaactca ggtgtttttt atttacgcgt cagggctttt cttgtttaat 1621 aagttatgat agct // LOCUS HSTFIIS 2517 bp RNA PRI 25-JUL-1994 DEFINITION Human TFIIS mRNA for transcription elongation factor. ACCESSION X57198 NID g37071 KEYWORDS TFIIS gene; transcription elongation factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2517) AUTHORS Agarwal,K. TITLE Direct Submission JOURNAL Submitted (12-DEC-1990) K. Agarwal, THE UNIVERSITY OF CHICAGO, 920 EAST 58TH STREET, DEPT OF BIOCHEMISTRY & MOL BIOLOGY, CHICAGO IL 60637, USA REFERENCE 2 (bases 1 to 2517) AUTHORS Yoo,O.J., Yoon,H.S., Baek,K.H., Jeon,C.J., Miyamoto,K., Ueno,A. and Agarwal,K. TITLE Cloning, expression and characterization of the human transcription elongation factor, TFIIS JOURNAL Nucleic Acids Res. 19 (5), 1073-1079 (1991) MEDLINE 91212187 REFERENCE 3 (bases 1 to 2517) AUTHORS Yeh,C.H. and Shatkin,A.J. TITLE A HeLa-cell-encoded p21 is homologous to transcription elongation factor SII JOURNAL Gene 143 (2), 285-287 (1994) MEDLINE 94266168 COMMENT Data kindly reviewed (05-APR-1991) by Agarwal K. FEATURES Location/Qualifiers source 1..2517 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /clone="pHIIS 44" gene 126..968 /gene="TFIIS" CDS 126..968 /gene="TFIIS" /codon_start=1 /product="transcription elongation factor" /db_xref="PID:g37072" /db_xref="SWISS-PROT:P23193" /translation="MEDEVVRFAKKMDKMVQKKNASTRIGMSVNAIRKQSTDEEVTSL AKSLIKSWKKLLDGPSTEKDLDEKKKEPAITSQNSPEAREESTSSGNVSNRKDETNAR DTYVSSFPRAPSTSDSVRLKCREMLAAALRTGDDYIAIGADEEELGSQIEEAIYQEIR NTDMKYKNRVRSRISNLKDAKNPNLRKNVLCGNIPPDLFARMTAEEMASDELKEMRKN LTKEAIREHQMAKTGGTQTDLFTCGKCKKKNCTYTQVQTRSADEPMTTFVVCNECGNR WKFC" polyA_signal 2486..2491 BASE COUNT 789 a 423 c 499 g 806 t ORIGIN 1 gcctagcccg gggtgcggtg gtgggggttc gtgcgcgccg ggggtcgctc ctgctgtgtc 61 ttccgctcca cgttcgccca cttccccttg ccagcggggt gggcgcggag aagactgccg 121 gagccatgga ggacgaagtg gtccgctttg ccaagaagat ggacaagatg gtgcagaaga 181 agaacgcgtc cacaagaatc ggaatgtcag ttaatgctat tcgcaagcag agtacagatg 241 aggaagttac atctttggca aagtctctca tcaaatcctg gaaaaaatta ttagatgggc 301 catcaactga gaaagacctt gacgaaaaga agaaagaacc tgcaattaca tcgcagaaca 361 gccctgaggc aagagaagaa agtacttcca gcggcaatgt aagcaacaga aaggatgaga 421 caaatgctcg agatacttat gtttcatcct ttcctcgggc accaagcact tctgattctg 481 tgcggttgaa gtgtagggag atgcttgctg cagctcttcg aacaggggat gactacattg 541 caattggagc tgatgaggaa gaattaggat ctcaaattga agaagctata tatcaagaaa 601 taaggaatac agacatgaaa tacaaaaata gagtacgaag taggatatca aatcttaaag 661 atgcaaaaaa tccaaattta aggaaaaatg tcctctgtgg gaatattcct cctgacttat 721 ttgctagaat gacagcagag gaaatggcta gtgatgagct gaaagagatg cggaaaaact 781 tgaccaaaga agccatcaga gagcatcaga tggccaagac tggtgggacc cagactgact 841 tgttcacatg tggcaaatgt aaaaagaaga attgcactta cacacaggta caaacccgta 901 gtgctgatga accaatgaca acatttgttg tctgtaatga atgtggaaat cgatggaagt 961 tctgttgagt tggaagaatt ggcaaaatat ctggaccatt aagaaaacgg attttgtaac 1021 tagctttaaa ctaggccaag caactagttt tcctgcaaat caaattttta aagcaacttg 1081 ggttagactt tgtttttgac ctaacatccc ttccttaaat gccttctgta gtttcagatc 1141 agtagggaga ccatataaat attttatggt acctgtttca aaacatattt tttctgtttt 1201 tataagtaag ttgatattaa ttaaactctt ggcaatattt cttctttctt aaaggaaaat 1261 ataccttaac tttttttctt ttacactgtg aaacatacac agtagaaatt ctgttactct 1321 ctgttattaa tacataaatg aaaatacatt tttttccata ttggcatgta gctacaaata 1381 ttaaaggagg agaaaaggta atataatttt aggtttacca aatatggtgt gtattcaaat 1441 aatacttgac cagcttatct aaaatgtaca taattttgag gtagcttatg aatttgattt 1501 taattattat gttcacaagc ttggaatatt agatattatt ttgcatctgt aactaaccgt 1561 gatcatcatt tcttgtaatt tcttgtacat gtatattact tgttcttaat agatttttgg 1621 aaacaagact ttattgagat cagtttggtt ttcctgttaa tttacctgtt tgactttata 1681 atgtgtttta gttttgcaga agaacactgt tgtagtttag aaggcttttc ataaatcccc 1741 tcataggcaa agatgaaaac ttcccactat ttttttcccc tcttaggaag acatactgga 1801 aagaaaatgt ttagcatctt agtgtagtat agctattgta aacagttcat gactagattt 1861 tgattcggaa atctatactg accaaggatt aatcttaagg attgtataat tcattaaagc 1921 tgtggtcttt ccatgtggag actgatagaa aataattttg tcccaagtct tatttgctga 1981 ctttttctgt catgagtgag attgttgaac aaactgaata tatgggctat agcaagtagc 2041 tttacagtac agatcttaca attaagtttt gcttttgtta aagtgtgtac cattttttct 2101 gtttggagta agacaaaaat tgttttgaca taggttccct agggtacact tgctctagca 2161 tactttaaag gccactgttg caaagtctac attttatgct gaatctgcat tctgtcaggc 2221 acccgtagaa agacctcagt acatgctttg cactctcctt tgctcccttt ttccaatttc 2281 ttattgcata tcatttgttg taatacagaa agcagcattt ttaaatgtcc gtgttaagaa 2341 ttggcccact ggtaccaact cacctctatt ttgtcagttc atagttgaag atttgtttta 2401 tttcaaaaac aaagtacatt tttgaaataa tgtttcagaa taaaataatc tcacttttaa 2461 gtgatccttt taaaatttgt aattcaataa agtttttttt gttgttaaca taaaacg // LOCUS HSTGFAA 4119 bp RNA PRI 07-DEC-1993 DEFINITION H.sapiens mRNA for transforming growth factor alpha. ACCESSION X70340 NID g37089 KEYWORDS transforming growth factor alpha. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4119) AUTHORS Qian,J.F., Feingold,J., Stoll,C. and May,E. TITLE Transforming growth factor-alpha: characterization of the BamHI, RsaI, and TaqI polymorphic regions JOURNAL Am. J. Hum. Genet. 53 (1), 168-175 (1993) MEDLINE 93304410 REFERENCE 2 (bases 1 to 4119) AUTHORS May,E. TITLE Direct Submission JOURNAL Submitted (04-FEB-1993) E. May, Lab d'Oncologie Moleculaire - IRSC-CNRS, 7 rue Guy Moquet, 94801 Villejuif Cedex, FRANCE REFERENCE 3 (bases 1 to 4119) AUTHORS Qian,J.F., Lazar-Wesley,E., Breugnot,C. and May,E. TITLE Human transforming growth factor alpha: sequence analysis of the 4.5-kb and 1.6-kb mRNA species JOURNAL Gene 132 (2), 291-296 (1993) MEDLINE 94040776 FEATURES Location/Qualifiers source 1..4119 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" /cell_type="carcinoma" /cell_line="SW613S c13" /chromosome="2" /map="2p13" gene 32..514 /gene="TGFA" CDS 32..514 /gene="TGFA" /codon_start=1 /product="transforming growth factor alpha" /db_xref="PID:g37090" /db_xref="SWISS-PROT:P01135" /translation="MVPSAGQLALFALGIVLAACQALENSTSPLSADPPVAAAVVSHF NDCPDSHTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLAVVAASQKKQAITA LVVVSIVALAVLIITCVLIHCCQVRKHCEWCRALICRHEKPSALLKGRTACCHSETVV " mat_peptide 149..298 /gene="TGFA" /product="transforming growth factor alpha" polyA_site 1333..1338 /note="1.6 kb species" misc_feature 2796..2801 /note="BamHI polymorphic site" polyA_site 4069..4074 /note="4.5 kb species" BASE COUNT 1122 a 897 c 916 g 1184 t ORIGIN 1 ctggagagcc tgctgcccgc ccgcccgtaa aatggtcccc tcggctggac agctcgccct 61 gttcgctctg ggtattgtgt tggctgcgtg ccaggccttg gagaacagca cgtccccgct 121 gagtgcagac ccgcccgtgg ctgcagcagt ggtgtcccat tttaatgact gcccagattc 181 ccacactcag ttctgcttcc atggaacctg caggtttttg gtgcaggagg acaagccagc 241 atgtgtctgc cattctgggt acgttggtgc acgctgtgag catgcggacc tcctggccgt 301 ggtggctgcc agccagaaga agcaggccat caccgccttg gtggtggtct ccatcgtggc 361 cctggctgtc cttatcatca catgtgtgct gatacactgc tgccaggtcc gaaaacactg 421 tgagtggtgc cgggccctca tctgccggca cgagaagccc agcgccctcc tgaagggaag 481 aaccgcttgc tgccactcag aaacagtggt ctgaagagcc cagaggagga gtttggccag 541 gtggactgtg gcagatcaat aaagaaaggc ttcttcagga cagcactgcc agagatgcct 601 gggtgtgcca cagaccttcc tacttggcct gtaatcacct gtgcagcctt ttgtgggcct 661 tcaaaactct gtcaagaact ccgtctgctt ggggttattc agtgtgacct agagaagaaa 721 tcagcggacc acgatttcaa gacttgttaa aaaagaactg caaagagacg gactcctgtt 781 cacctaggtg aggtgtgtgc agcagttggt gtctgagtcc acatgtgtgc agttgtcttc 841 tgccagccat ggattccagg ctatatattt ctttttaatg ggccacctcc ccacaacaga 901 attctgccca acacaggaga tttctatagt tattgttttc tgtcatttgc ctactgggga 961 agaaagtgaa ggaggggaaa ctgtttaata tcacatgaag accctagctt taagagaagc 1021 tgtatcctct aaccacgaga ctctcaacca gcccaacatc ttccatggac acatgacatt 1081 gaagaccatc ccaagctatc gccacccttg gagatgatgt cttatttatt agatggataa 1141 tggttttatt tttaatctct taagtcaatg taaaaagtat aaaacccctt cagacttcta 1201 cattaatgat gtatgtgttg ctgactgaaa agctatactg attagaaatg tctggcctct 1261 tcaagacagc taaggcttgg gaaaagtctt ccagggtgcg gagatggaac cagaggctgg 1321 gttactggta ggaataaagg taggggttca gaaatggtgc cattgaagcc acaaagccgg 1381 taaatgcctc aatacgttct gggagaaaac ttagcaaatc catcagcagg gatctgtccc 1441 ctctgttggg gagagaggaa gagtgtgtgt gtctacacag gataaaccca atacatattg 1501 tactgctcag tgattaaatg ggttcacttc ctcgtgagcc ctcggtaagt atgtttagaa 1561 atagaacatt agccacgagc cataggcatt tcaggccaaa tccatgaaag ggggaccagt 1621 catttatttt ccattttgtt gcttggttgg tttgttgctt tatttttaaa aggagaagtt 1681 taactttgct atttattttc gagcactagg aaaactattc cagtaatttt tttttcctca 1741 tttccattca ggatgccggc tttattaaca aaaactctaa caagtcacct ccactatgtg 1801 ggtcttcctt tcccctcaag agaaggagca attgttcccc tgacatctgg gtccatctga 1861 cccatggggc ctgcctgtga gaaacagtgg gtcccttcaa atacatagtg gatagctcat 1921 ccctaggaat tttcattaaa atttggaaac agagtaatga agaaataata tataaactcc 1981 ttatgtgagg aaatgctact aatatctgaa aagtgaaaga tttctatgta ttaactctta 2041 agtgcaccta gcttattaca tcgtgaaagg tacatttaaa atatgttaaa ttggcttgaa 2101 attttcagag aattttgtct tcccctaatt cttcttcctt ggtctggaag aacaatttct 2161 atgaattttc tctttatttt ttttttataa ttcagacaat tctatgaccc gtgtcttcat 2221 ttttggcact cttatttaac aatgccacac ctgaagcact tggatctgtt cagagctgac 2281 cccctagcaa cgtagttgac acagctccag gtttttaaat tactaaaata agttcaagtt 2341 tacatccctt gggccagata tgtgggttga ggcttgactg tagcatcctg cttagagacc 2401 aatcaatgga cactggtttt tagacctcta tcaatcagta gttagcatcc aagagacttt 2461 gcagaggcgt aggaatgagg ctggacagat ggcggaacga gaggttccct gcgaagactt 2521 gagatttagt gtctgtgaat gttctagttc ctaggtccag caagtcacac ctgccagtgc 2581 cctcatcctt atgcctgtaa cacacatgca gtgagaggcc tcacatatac gcctccctag 2641 aagtgccttc caagtcagtc ctttggaaac cagcaggtct gaaaaagagg ctgcatcaat 2701 gcaagcctgg ttggaccatt gtccatgcct caggatagaa cagcctggct tatttgggga 2761 tttttcttct agaaatcaaa tgactgataa gcattggctc cctctgccat ttaatggcaa 2821 tggtagtctt tggttagctg caaaaatact ccatttcaag ttaaaaatgc atcttctaat 2881 ccatctctgc aagctccctg tgtttccttg ccctttagaa aatgaattgt tcactacaat 2941 tagagaatca tttaacatcc tgacctggta agctgccaca cacctggcag tggggagcat 3001 cgctgtttcc aatggctcag gagacaatga aaagccccca tttaaaaaaa taacaaacat 3061 tttttaaaag gcctccaata ctcttatgga gcctggattt ttcccactgc tctacaggct 3121 gtgacttttt ttaagcatcc tgacaggaaa tgttttcttc tacatggaaa gatagacagc 3181 agccaaccct gatctggaag acagggcccc ggctggacac acgtggaacc aagccaggga 3241 tgggctggcc attgtgtccc cgcaggagag atgggcagaa tggccctaga gttcttttcc 3301 ctgagaaagg agaaaaagat gggattgcca ctcacccacc cacactggta agggaggaga 3361 atttgtgctt ctggagcttc tcaagggatt gtgttttgca ggtacagaaa actgcctgtt 3421 atcttcaagc caggttttcg agggcacatg ggtcaccagt tgctttttca gtcaatttgg 3481 ccgggatgga ctaatgaggc tctaacactg ctcaggagac ccctgccctc tagttggttc 3541 tgggctttga tctcttccaa cctgcccagt cacagaagga ggaatgactc aaatgcccaa 3601 aaccaagaac acattgcaga agtaagacaa acatgtatat ttttaaatgt tctaacataa 3661 gacctgttct ctctagccat tgatttacca ggctttctga aagatctagt ggttcacaca 3721 gagagagaga gagtactgaa aaagcaactc ctcttcttag tcttaataat ttactaaaat 3781 ggtcaacttt tcattatctt tattataata aacctgatgc ttttttttag aactccttac 3841 tctgatgtct gtatatgttg cactgaaaag gttaatattt aatgttttaa tttattttgt 3901 gtggtaagtt aattttgatt tctgtaatgt gttaatgtga ttagcagtta ttttccttaa 3961 tatctgaatt atacttaaag agtagtgagc aatataagac gcaattgtgt ttttcagtaa 4021 tgtgcattgt tattgagttg tactgtacct tatttggaag gatgaaggaa tgaacctttt 4081 tttcctaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSTGFB3M 2574 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for transforming growth factor-beta 3 (TGF-beta 3). ACCESSION X14149 NID g37095 KEYWORDS growth factor; transforming growth factor; transforming growth factor-beta 3. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2574) AUTHORS Chen,E.Y. TITLE Direct Submission JOURNAL Submitted (23-MAR-1989) Chen E.Y., Genentech Inc., 460 Pt. San Bruno Blvd., San Francisco, CA 94080, USA REFERENCE 2 (bases 1 to 2574) AUTHORS Derynck,R., Lindquist,P.B., Lee,A., Wen,D., Tamm,J., Graycar,J.L., Rhee,L., Mason,A.J., Miller,D.A., Coffey,R.J., Moses,H.L. and Chen,E.Y. TITLE A new type of transforming growth factor-beta, TGF-beta 3 JOURNAL EMBO J. 7 (12), 3737-3743 (1988) MEDLINE 89091120 COMMENT See for alternative sequence of TGF-beta 3. FEATURES Location/Qualifiers source 1..2574 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta, ovary glioblastoma" /cell_line="A172 glioblastoma" /chromosome="14q24" CDS 254..1492 /note="TGF-beta 3 (AA 1-412)" /codon_start=1 /db_xref="PID:g37096" /db_xref="SWISS-PROT:P10600" /translation="MKMHLQRALVVLALLNFATVSLSLSTCTTLDFGHIKKKRVEAIR GQILSKLRLTSPPEPTVMTHVPYQVLALYNSTRELLEEMHGEREEGCTQENTESEYYA KEIHKFDMIQGLAEHNELAVCPKGITSKVFRFNVSSVEKNRTNLFRAEFRVLRVPNPS SKRNEQRIELFQILRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESN LGLEISIHCPCHTFQPNGDILENIHEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNP HLILMMIPPHRLDNPGQGGQRKKRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVH EPKGYYANFCSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYV GRTPKVEQLSNMVVKSCKCS" BASE COUNT 629 a 680 c 666 g 599 t ORIGIN 1 cctgtttaga cacatggaca acaatcccag cgctacaagg cacacagtcc gcttcttcgt 61 cctcagggtt gccagcgctt cctggaagtc ctgaagctct cgcagtgcag tgagttcatg 121 caccttcttg ccaagcctca gtctttggga tctggggagg ccgcctggtt ttcctccctc 181 cttctgcacg tctgctgggg tctcttcctc tccaggcctt gccgtccccc tggcctctct 241 tcccagctca cacatgaaga tgcacttgca aagggctctg gtggtcctgg ccctgctgaa 301 ctttgccacg gtcagcctct ctctgtccac ttgcaccacc ttggacttcg gccacatcaa 361 gaagaagagg gtggaagcca ttaggggaca gatcttgagc aagctcaggc tcaccagccc 421 ccctgagcca acggtgatga cccacgtccc ctatcaggtc ctggcccttt acaacagcac 481 ccgggagctg ctggaggaga tgcatgggga gagggaggaa ggctgcaccc aggaaaacac 541 cgagtcggaa tactatgcca aagaaatcca taaattcgac atgatccagg ggctggcgga 601 gcacaacgaa ctggctgtct gccctaaagg aattacctcc aaggttttcc gcttcaatgt 661 gtcctcagtg gagaaaaata gaaccaacct attccgagca gaattccggg tcttgcgggt 721 gcccaacccc agctctaagc ggaatgagca gaggatcgag ctcttccaga tccttcggcc 781 agatgagcac attgccaaac agcgctatat cggtggcaag aatctgccca cacggggcac 841 tgccgagtgg ctgtcctttg atgtcactga cactgtgcgt gagtggctgt tgagaagaga 901 gtccaactta ggtctagaaa tcagcattca ctgtccatgt cacacctttc agcccaatgg 961 agatatcctg gaaaacattc acgaggtgat ggaaatcaaa ttcaaaggcg tggacaatga 1021 ggatgaccat ggccgtggag atctggggcg cctcaagaag cagaaggatc accacaaccc 1081 tcatctaatc ctcatgatga ttcccccaca ccggctcgac aacccgggcc aggggggtca 1141 gaggaagaag cgggctttgg acaccaatta ctgcttccgc aacttggagg agaactgctg 1201 tgtgcgcccc ctctacattg acttccgaca ggatctgggc tggaagtggg tccatgaacc 1261 taagggctac tatgccaact tctgctcagg cccttgccca tacctccgca gtgcagacac 1321 aacccacagc acggtgctgg gactgtacaa cactctgaac cctgaagcat ctgcctcgcc 1381 ttgctgcgtg ccccaggacc tggagcccct gaccatcctg tactatgttg ggaggacccc 1441 caaagtggag cagctctcca acatggtggt gaagtcttgt aaatgtagct gagaccccac 1501 gtgcgacaga gagaggggag agagaaccac cactgcctga ctgcccgctc ctcgggaaac 1561 acacaagcaa caaacctcac tgagaggcct ggagcccaca accttcggct ccgggcaaat 1621 ggctgagatg gaggtttcct tttggaacat ttctttcttg ctggctctga gaatcacggt 1681 ggtaaagaaa gtgtgggttt ggttagagga aggctgaact cttcagaaca cacagacttt 1741 ctgtgacgca gacagagggg atggggatag aggaaaggga tggtaagttg agatgttgtg 1801 tggcaatggg atttgggcta ccctaaaggg agaaggaagg gcagagaatg gctgggtcag 1861 ggccagactg gaagacactt cagatctgag gttggatttg ctcattgctg taccacatct 1921 gctctaggga atctggatta tgttatacaa ggcaagcatt ttttttttta aagacaggtt 1981 acgaagacaa agtcccagaa ttgtatctca tactgtctgg gattaagggc aaatctatta 2041 cttttgcaaa ctgtcctcta catcaattaa catcgtgggt cactacaggg agaaaatcca 2101 ggtcatgcag ttcctggccc atcaactgta ttgggccttt tggatatgct gaacgcagaa 2161 gaaagggtgg aaatcaaccc tctcctgtct gccctctggg tccctcctct cacctctccc 2221 tcgatcatat ttccccttgg acacttggtt agacgccttc caggtcagga tgcacatttc 2281 tggattgtgg ttccatgcag ccttggggca ttatgggtct tcccccactt cccctccaag 2341 accctgtgtt catttggtgt tcctggaagc aggtgctaca acatgtgagg cattcgggga 2401 agctgcacat gtgccacaca gtgacttggc cccagacgca tagactgagg tataaagaca 2461 agtatgaata ttactctcaa aatctttgta taaataaata tttttggggc atcctggatg 2521 atttcatctt ctggaatatt gtttctagaa cagtaaaagc cttattctaa ggtg // LOCUS HSTGIFPRO 1562 bp RNA PRI 06-JAN-1996 DEFINITION H.sapiens mRNA for TGIF protein. ACCESSION X89750 NID g1150425 KEYWORDS tgif gene; TGIF protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1562) AUTHORS Clerc,R.G. TITLE Direct Submission JOURNAL Submitted (17-JUL-1995) R.G. Clerc, Roche Ltd., Room 69-209, Grenzacherstr.124, CH- 4002 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 1562) AUTHORS Bertolino,E., Reimund,B., Wildt-Perinic,D. and Clerc,R.G. TITLE A novel homeobox protein which recognizes a TGT core and functionally interferes with a retinoid-responsive motif JOURNAL J. Biol. Chem. 270 (52), 31178-31188 (1995) MEDLINE 96125101 FEATURES Location/Qualifiers source 1..1562 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="Clontech" gene 312..1130 /gene="tgif" CDS 312..1130 /gene="tgif" /codon_start=1 /product="TGIF protein" /db_xref="PID:e190025" /db_xref="PID:g1150426" /translation="MKGKKGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPK ESVQILRDWLYEHRYNAYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDG KDPNQFTISRRGAKISETSSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKP SSPGSVLARPSVICHTTVTALKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPED TCKSGPSTNTQSGLFNTPPPTPPDLNQDFSGFQLLVDVALKRAAEMELQAKLTA" BASE COUNT 421 a 382 c 371 g 388 t ORIGIN 1 ctggaattcg gggcgccgag caggagcagg gaacaaagga gcggagaggg gaggggagag 61 agttgggcga gggagagccc ccggccggct gccagaagat cctggcggga ggaagcccaa 121 gtgtcacttg aattccaccc aaggagcggg cgcctgggat cagagcgtcc tgtttagcaa 181 taacggctgg agcacgtcct acaagttacg ggagagtcgg ctgtgaagga gacgttcgct 241 tatcccctgt gtccccgctc ctggcccctc cagacccccg ccttgcctcg cgctgggagg 301 ggagatccag aatgaaaggc aagaaaggta ttgttgcagc atctggcagt gagactgagg 361 atgaggacag catggacatt cccttggacc tttcttcatc cgctggctca ggcaagagaa 421 ggagaagggg caacctaccc aaggagtctg tgcagattct tcgggattgg ctgtatgagc 481 accgttacaa tgcctatcct tcagagcaag aaaaagcgtt gctgtcccag caaacacacc 541 tgtctacgct acaggtctgt aactggttca tcaacgcccg ccgcaggctc ctccctgaca 601 tgctgagaaa ggatggcaaa gatccaaatc agttcacaat ttcccgccgt ggggccaaga 661 tttctgaaac gagctctgtg gagtccgtga tgggcatcaa aaacttcatg ccagctctag 721 aggagacccc atttcattcc tgtacagctg ggccaaaccc aaccctaggg aggccactgt 781 ctcctaagcc gtcatccccg ggatcagttt tggctcgtcc atcagtgatc tgccatacca 841 ctgtgactgc attgaaagat gtccctttct ctctctgcca gtcggtcggt gtgggacaaa 901 acacagatat acagcagata gcggccaaaa acttcacaga cacctctctc atgtacccag 961 aggacacttg taaatctgga ccaagtacga atacacagag tggtcttttc aacactcctc 1021 cccctactcc accggacctc aaccaggact tcagtggatt tcagcttcta gtggatgttg 1081 cactcaaacg ggctgcagag atggagcttc aggcaaaact tacagcttaa cccattttca 1141 agcaaaacag ttctcagaaa tgtcatgatt gccggggtga aggcaagaga tgaattgcat 1201 tattttatat attttttatt aatatttgca catgggattg ctaaaacagc ttcctgttac 1261 tgagatgtct tcaatggaat acagtcattc caagaactat aaacttaaag ctactgtaga 1321 aacaaagggt tttctttttt aaatgtttct tggtagatta ttcataatgt gagatggttc 1381 ccaatatcat gtgatttttt tttttcctcc ccttcccttt ttttgttatt ttttcagact 1441 gtgcaatact tagagaacct atagcatctt ctcattccca tgtggaacag gatgcccaca 1501 tactgtctaa ttaataaatt ttccattttt tttcaaacaa gtaaaaaaaa aaaaaaaaaa 1561 aa // LOCUS HSTGR1 2109 bp RNA PRI 19-FEB-1996 DEFINITION H.sapiens hTGR 1 mRNA. ACCESSION X72018 NID g1200088 KEYWORDS monomer; N-acetylglucosamine; receptor; transmembrane protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2109) AUTHORS Blanck,O., Perrin,C., Mziaut,H., Darbon,H., Mattei,M.G. and Miquelis,R. TITLE Molecular cloning, cDNA analysis, and localization of a monomer of the N-acetylglucosamine-specific receptor of the thyroid, NAGR1, to chromosome 19p13.3-13.2 JOURNAL Genomics 21 (1), 18-26 (1994) MEDLINE 94375011 REFERENCE 2 (bases 1 to 2106) AUTHORS Miquelis,R. TITLE Direct Submission JOURNAL Submitted (13-MAY-1993) R. Miquelis, CNRS, Laboratoire de Biochimie. URA CNRS 1455, Faculte de Medicine Nord, Bd. Pierre Dramard, 13915 Marseille Cedex 15, FRANCE REFERENCE 3 (bases 1 to 2106) AUTHORS Blanck,O., Perrin,C., Mziaut,H., Darbon,H., Mattei,M.G. and Miquelis,R. TITLE Molecular cloning, cDNA analysis, and localization of a monomer of the N-acetylglucosamine-specific receptor of the thyroid, NAGR1, to chromosome 19p13.3-13.2 JOURNAL Genomics 27 (3), 561 (1995) MEDLINE 96047351 FEATURES Location/Qualifiers source 1..2109 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /chromosome="19p 13.2-13.3" /tissue_type="thyroid" gene 127..1839 /gene="ORF" CDS 127..1839 /gene="ORF" /note="almost identical to nRNP M protein, acc.L03532" /codon_start=1 /db_xref="PID:g1200089" /translation="MATTGGMGMGPGGPGMITIPPSILNNPNIPNEIIHALQAGRLGS TVFVANLDYKVGWKKLKEVFSMAGVVVRADILEDKDGKSRGIGTVTFEQSIEAVQAIS MFNGQLLFDRPMHVKMDERALPKGDFFPPERPQQLPHGLGGIGMGLGPGGQPIDANHL NKGIGMGNIGPAGMGMEGIGFGINKMGGMEGPFGGGMENMGRFGSGMNMGRINEILSN ALKRGEIIAKQGGGGGGGSVPGIERMGPGIDRLGGAGMERMGAGLGHGMDRVGSEIER MGLVMDRMGSVERMGSGIERMGPLGLDHMASSIERMGQTMERIGSGVERMGAGMGFGL ERMAAPIDRVGQTIERMGSGVERMGPAIERMGLSMERMVPAGMGAGLERMGPVMDRMA TGLERMGANNLERMGLERMGANSLERMGLERMGANSLERMGPAMGPALGAGIERMGLA MGGGGGASFDRAIEMERGNFGGSFAGSFGGAGGHAPGVARKACQIFVRNLPFDFTWKM LKDKFNECGHVLYADIKMENGKSKGCGVVKFESPEVAERACRMMNGMKLSGREIDVRI DRNA" polyA_signal 2078..2083 BASE COUNT 517 a 443 c 678 g 471 t ORIGIN 1 attccggaag atgggcatgc taaataagct gcggaagtcc taaacaagca tagtctgagc 61 ggaagaccac tgaaagtcaa agaagatcct gatggtgaac atgccaggag agcaatgcaa 121 aaggtgatgg ctacgactgg tgggatgggt atgggaccag gtggcccagg aatgattact 181 atcccaccca gtatcctaaa taatcccaac atcccaaatg agattatcca tgcattacag 241 gctggaagac ttggaagcac agtatttgta gcaaatctgg attataaagt tggctggaag 301 aaactgaagg aagtatttag tatggctggt gtggtggtcc gagcagacat tcttgaagat 361 aaagatggaa aaagtcgtgg aataggcact gttacttttg aacagtccat tgaagctgtg 421 caagctatat ctatgttcaa tggccagctg ctatttgata gaccaatgca cgtcaagatg 481 gatgagaggg ccttaccaaa aggagatttc ttccctcctg agcgtccaca acaacttccc 541 catggccttg gtggtattgg catggggtta ggaccaggag ggcaacccat tgatgccaat 601 cacctgaata aaggcatcgg aatgggaaac ataggtcccg caggaatggg aatggaaggc 661 ataggatttg gaataaataa aatgggagga atggaggggc cctttggtgg tggtatggaa 721 aacatgggtc gatttggatc tgggatgaac atgggcagga taaatgaaat cctaagtaat 781 gcactgaaga gaggagagat cattgcaaag cagggaggag gtggaggtgg aggaagcgtc 841 cctgggatcg agaggatggg tcctggcatt gaccgcctcg ggggtgccgg catggagcgc 901 atgggcgcgg gcctgggcca cggcatggat cgcgtgggct ccgagatcga gcgcatgggc 961 ctggtcatgg accgcatggg ctccgtggag cgcatgggct ccggcattga gcgcatgggc 1021 ccgctgggcc tcgaccacat ggcctccagc attgagcgca tgggccagac catggagcgc 1081 attggctctg gcgtggagcg catgggtgcc ggcatgggct tcggccttga gcgcatggcc 1141 gctcccatcg accgtgtggg ccagaccatt gagcgcatgg gctctggcgt ggagcgcatg 1201 ggccctgcca tcgagcgcat gggcctgagc atggagcgca tggtgcccgc aggtatggga 1261 gctggcctgg agcgcatggg ccccgtgatg gatcgcatgg ccaccggcct ggagcgcatg 1321 ggcgccaaca atctggagcg gatgggcctg gagcgcatgg gcgccaacag cctcgagcgc 1381 atgggcctgg agcgcatggg tgccaacagc ctcgagcgca tgggccccgc catgggcccg 1441 gccctgggcg ctggcattga gcgcatgggc ctggccatgg gtggcggtgg cggtgccagc 1501 tttgaccgtg ccatcgagat ggagcgtggc aacttcggag gaagcttcgc aggttccttt 1561 ggtggagctg gaggccatgc tcctggggtg gccaggaagg cctgccagat atttgtgaga 1621 aatctgccat tcgatttcac atggaagatg ctaaaggaca aattcaacga gtgcggccac 1681 gtgctgtacg ccgacatcaa gatggagaat gggaagtcca aggggtgtgg tgtggttaag 1741 ttcgagtcgc cagaggtggc cgagagagcc tgccggatga tgaatggcat gaagctgagt 1801 ggccgagaga ttgacgttcg aattgataga aacgcttaag cagttgcctt ttttaaacat 1861 cgatacgaga cctctgaatt tgtatttttt cttgttaacc attttaattt gttggctgga 1921 tgtataaaga tgtttaaaaa attcagttgc tttttggggt aatttgaatt acttttttaa 1981 tgactggggt tccatttgac tgtttgcatt gagattgcaa tgtgcgcaat tttttttgta 2041 gttgtggcat cttgttgaca tcgaatatga ctttgataat aaataccggt tcctgaaaaa 2101 aaaaaaaaa // LOCUS HSTHIMOPS 2515 bp RNA PRI 18-OCT-1995 DEFINITION H.sapiens mRNA for thimet oligopeptidase (metalloproteinase). ACCESSION Z50115 NID g1030054 KEYWORDS endopeptidase; metalloproteinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2515) AUTHORS Thompson,A., Huber,G. and Malherbe,P. TITLE Cloning and functional expression of a metalloendopeptidase from human brain with the ability to cleave a beta-APP substrate peptide JOURNAL Biochem. Biophys. Res. Commun. 213 (1), 66-73 (1995) MEDLINE 95367027 REFERENCE 2 (bases 1 to 2515) AUTHORS Malherbe,P. TITLE Direct Submission JOURNAL Submitted (04-JUL-1995) Malherbe P., F.Hoffmann-La Roche Ltd., Pharma Division,Preclinical Research, 69/333, Basel, Switzerland, CH-4002 FEATURES Location/Qualifiers source 1..2515 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="15-1" /dev_stage="adult" /tissue_type="temporal cortex" /clone_lib="lambda ZAP cDNA library" CDS 128..2197 /standard_name="thimet oligopeptidase" /EC_number="3.4.24.15" /function="endopeptidase" /citation=[1] /codon_start=1 /evidence=experimental /product="metalloproteinase" /db_xref="PID:g1030055" /translation="MKPPAACAGDMADAASPCSVVNDLRWDLSAQQIEERTRELIEQT KRVYDQVGTQEFEDVSYESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEAD KKLSEFDVEMSMREDVYQRIVWLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRET QENIKRIKKKLSLLCIDFNKNLNEDTTFLPFTLQELGGLPEDFLNSLEKMEDGKLKVT LKYPHYFPLLKKCHVPETRRKVEEAFNCRCKEENCAILKELVTLRAQKSRLLGFHTHA DYVLEMNMAKTSQTVATFLDELAQKLKPLGEQERAVILELKRAECERRGLPFDGRIRA WDMRYYMNQVEETRYCVDQNLLKEYFPVQVVTHGLLGIYQELLGLAFHHEEGASAWHE DVRLYTARDAASGEVVGKFYLDLYPREGKYGHAACFGLQPGCLRQDGSRQIAIAAMVA NFTKPTADAPSLLQHDEVETYFHEFGHVMHQLCSQAEFAMFSGTHVERDFVEAPSQML ENWVWEQEPLLRMSRHYRTGSAVPRELLEKLIESRQANTGLFNLRQIVLAKVDQALHT QTDADPAEEYARLCQEILGVPATPGTNMPATFGHLAGGYDAQYYGYLWSEVYSMDMFH TRFKQEGVLNSKVGMDYRSCILRPGGSEDASAMLRRFLGRDPKQDAFLLSKGLQVGGC EPEPQVC" BASE COUNT 504 a 761 c 831 g 419 t ORIGIN 1 gaattccggg gcatgctgtg gcggcggttg ggccgaggca ggcggcctca gtggccgagg 61 tggcttggac gcgtacgagg tggaaggagg gagggagccg caggcgcaga cccacagacc 121 acccgccatg aagccccccg cagcctgtgc aggagacatg gcggacgcag catctccgtg 181 ctctgtggta aacgacctgc ggtgggacct gagtgcccag cagatagagg agcgcaccag 241 ggagctcatc gagcagacca agcgcgtgta tgaccaggtt ggcacccagg agtttgagga 301 cgtgtcctac gagagcacgc tcaaggcgct ggccgatgtg gaggtcacct acacagttca 361 gaggaatatc cttgacttcc cccagcatgt ttccccctcc aaggacatcc ggacagccag 421 cacagaggcc gacaagaagc tctctgagtt cgacgtggag atgagcatga gggaggacgt 481 gtaccagagg atcgtgtggc tccaggagaa agttcagaag gactcactga ggcccgaggc 541 tgcgcggtac ctggagcggc taatcaagct gggccggaga aatgggcttc acctccccag 601 agagactcag gaaaacatca aacgcatcaa gaagaagctg agccttctgt gcatcgactt 661 caacaagaac ctgaacgagg acacgacctt cctgcccttc acgctccagg agctaggagg 721 gctccccgag gactttctga actccctgga gaagatggag gacggcaagt tgaaggtcac 781 cctcaagtac ccccattact tccccctcct gaagaaatgc cacgtgcctg agaccaggag 841 gaaagtggag gaggccttca actgccggtg caaggaggag aactgcgcta tcctcaagga 901 gctggtgacg ctgcgggccc agaagtcccg cctgctgggg ttccacacgc acgccgacta 961 tgtcctggag atgaacatgg ccaagaccag ccagaccgtg gccaccttcc tagatgagct 1021 ggcgcagaag ctgaagcccc tgggggagca ggagcgtgcg gtgattctgg agctgaagcg 1081 tgcggagtgc gagcgccggg gcctgccctt cgacggccgc atccgtgcct gggacatgcg 1141 ctactacatg aaccaggtgg aggagacgcg ctactgcgtg gaccagaacc tgctcaagga 1201 gtacttcccc gtgcaggtgg tcacgcacgg gctgctgggc atctaccagg agctcctggg 1261 gctggccttc caccacgagg agggcgccag tgcctggcat gaggacgtgc ggctctacac 1321 cgcgagggac gcggcctcgg gggaggtggt cggcaagttc tacctggacc tgtacccgcg 1381 ggaaggaaag tacgggcacg cggcctgctt tggcctgcag cccggctgcc tgcggcagga 1441 tgggagccgc cagatcgcca tcgcggccat ggtggccaac ttcaccaagc ccacagccga 1501 cgcgccctcg ctgctgcagc atgacgaggt ggagacctac ttccatgagt ttggccacgt 1561 gatgcaccag ctctgctccc aggcggagtt cgccatgttc agcgggaccc acgtggagcg 1621 ggactttgtg gaggcgccgt cgcagatgct ggagaactgg gtgtgggagc aggagccgct 1681 gctgcggatg tcgcggcact accgcacagg cagcgccgtg ccccgggagc tcctggagaa 1741 gctcattgag tcccggcagg ccaacacagg cctcttcaac ctgcgccaga tcgtcctcgc 1801 caaggtggac caggccctgc acacgcagac ggacgcagac cccgccgagg agtatgcgcg 1861 gctctgccag gagatcctcg gggtcccggc cacgccagga accaacatgc ctgcaacctt 1921 cggccatctg gcaggtggct acgacgccca gtactacggg tacctgtgga gcgaggtgta 1981 ttccatggac atgttccaca cgcgcttcaa gcaggagggt gtcctgaaca gcaaggttgg 2041 catggattac agaagctgca tcctgagacc cggcggttcc gaggatgcca gcgccatgct 2101 gaggcgcttc ctgggccgtg accccaagca ggacgccttc ctcctgagca aggggctgca 2161 ggtcgggggc tgcgagcccg agccgcaggt ctgctgaggc ctggcactgc gactgcccag 2221 tctggcctgc gctcccgccg ccctggtgcc ttagcccccg gcacaggatg gggcaagctc 2281 tggcacagtg ccttgggact ggactggcag ggtggctgag cggctgtctt gcctcttgtc 2341 attgtctgtc cccacccggt cgtggcccac ccggctagac ggcgtcctca aggcatctgg 2401 agggctttcg tggctgccag ggcctggtct ttgttgcact aacacgtctc ctctctggga 2461 aacgtccctt gtcaggagac ggctcttctt tgaaatgagg tcattaaaag gaaac // LOCUS HSTHIORED 3826 bp RNA PRI 27-MAR-1996 DEFINITION H.sapiens mRNA for thioredoxin reductase. ACCESSION X91247 NID g1237037 KEYWORDS thioredoxin reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3826) AUTHORS Gasdaska,P.Y., Gasdaska,J.R., Cochran,S. and Powis,G. TITLE Cloning and sequencing of human thioredoxin reductase JOURNAL Unpublished REFERENCE 2 (bases 1 to 3826) AUTHORS Gasdaska,J.R. TITLE Direct Submission JOURNAL Submitted (09-SEP-1995) J.R. Gasdaska, Arizona Cancer Center, Arizona Health Sciences Center, Univ. of Arizona, Tucson AZ 85724, USA FEATURES Location/Qualifiers source 1..3826 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="5'-stretch cDNA" CDS 440..1933 /EC_number="1.6.4.5" /codon_start=1 /product="thioredoxin reductase (NADPH)" /db_xref="PID:e198281" /db_xref="PID:g1237038" /translation="MNGPEDLPKSYDYDLIIIGGGSGGLAAAKEAAQYGKKVMVLDFV TPTPLGTRWGLGGTCVNVGCIPKKLMHQAALLGQALQDSRNYGWKVEETVKHDWDRMI EAVQNHIGSLNWGYRVALREKKVVYENAYGQFIGPHRIKATNNKGKEKIYSAESFLIA TGERPRYLGIPGDKEYCISSDDLFSLPYCPGKTLVVGASYVALECAGFLAGIGLGVTV MVRSILLRGFDQDMANKIGEHMEEHGIKFIRQFVPIKVEQIEAGTPGRLRVVAQSTNS EEIIEGEYNTVMLAIGRDACTRKIGLETVGVKINEKTGKIPVTDEEQTNVPYIYAIGD ILEDKVELTPVAIQAGRLLAQRLYAGSTVKCDYENVPTTVFTPLEYGACGLSEEKAVE KFGEENIEVYHSYFWPLEWTIPSRDNNKCYAKIICNTKDNERVVGFHVLGPNAGEVTQ GFAAALKCGLTKKQLDSTIGIHPVCAEVFTTLSVTKRSGASILQAGC" BASE COUNT 1053 a 740 c 937 g 1096 t ORIGIN 1 gaattcgggt ggagtcctga aggagggcct gatgtcttca tcattctcaa attcttgtaa 61 gctctgcgtc gggtgaaacc agacaaagcc gcgagcccag ggatgggagc acgcggggga 121 cggcctgccg gcggggacga cagcattgcg cctgggtgca gcagtgtgcg tctcggggaa 181 gggaagatat tttaaggcgt gtctgagcag acggggaggc ttttccaaac ccaggcagct 241 tcgtggcgtg tgcggtttcg acccggtcac acaaagcttc agcatgtcat gtgaggacgg 301 tcgggccctg aaaggaacgc tctcggaatt ggccgcggaa accgatctgc ccgttgtgtt 361 tgtgaaacag agaaagatag gcggccatgg tccaaccttg aaggcttatc aggagggcag 421 acttcaaaag ctactaaaaa tgaacggccc tgaagatctt cccaagtcct atgactatga 481 ccttatcatc attggaggtg gctcaggagg tctggcagct gctaaggagg cagcccaata 541 tggcaagaag gtgatggtcc tggactttgt cactcccacc cctcttggaa ctagatgggg 601 tcttggagga acatgtgtga atgtgggttg catacctaaa aaactgatgc atcaagcagc 661 tttgttagga caagccctgc aagactctcg aaattatgga tggaaagtcg aggagacagt 721 taagcatgat tgggacagaa tgatagaagc tgtacagaat cacattggct ctttgaattg 781 gggctaccga gtagctctgc gggagaaaaa agtcgtctat gagaatgctt atgggcaatt 841 tattggtcct cacaggatta aggcaacaaa taataaaggc aaagaaaaaa tttattcagc 901 agagagtttt ctcattgcca ctggtgaaag accacgttac ttgggcatcc ctggtgacaa 961 agaatactgc atcagcagtg atgatctttt ctccttgcct tactgcccgg gtaagaccct 1021 ggttgttgga gcatcctatg tcgctttgga gtgcgctgga tttcttgctg gtattggttt 1081 aggcgtcact gttatggtta ggtccattct tcttagagga tttgaccagg acatggccaa 1141 caaaattggt gaacacatgg aagaacatgg catcaagttt ataagacagt tcgtaccaat 1201 taaagttgaa caaattgaag cagggacacc aggccgactc agagtagtag ctcagtccac 1261 caatagtgag gaaatcattg aaggagaata taatacggtg atgctggcaa taggaagaga 1321 tgcttgcaca agaaaaattg gcttagaaac cgtaggggtg aagataaatg aaaagactgg 1381 aaaaatacct gtcacagatg aagaacagac caatgtgcct tacatctatg ccattggcga 1441 tatattggag gataaggtgg agctcacccc agttgcaatc caggcaggaa gattgctggc 1501 tcagaggctc tatgcaggtt ccactgtcaa gtgtgactat gaaaatgttc caaccactgt 1561 atttactcct ttggaatatg gtgcttgtgg cctttctgag gagaaagctg tggagaagtt 1621 tggggaagaa aatattgagg tttaccatag ttacttttgg ccattggaat ggacgattcc 1681 gtcaagagat aacaacaaat gttatgcaaa aataatctgt aatactaaag acaatgaacg 1741 tgttgtgggc tttcacgtac tgggtccaaa tgctggagaa gttacacaag gctttgcagc 1801 tgcgctcaaa tgtggactga ccaaaaagca gctggacagc acaattggaa tccaccctgt 1861 ctgtgcagag gtattcacaa cattgtctgt gaccaagcgc tctggggcaa gcatcctcca 1921 ggctggctgc tgaggttaag ccccagtgtg gatgctgttg ccaagactgc aaaccactgg 1981 ctcgtttccg tgcccaaatc caaggcgaag ttttctagag ggttcttggg ctcttggcac 2041 ctgcgtgtcc tgtgcttacc accgcccaag gcccccttgg atctcttgga taggagttgg 2101 tgaatagaag gcaggcagca tcacactggg gtcactgaca gacttgaagc tgacatttgg 2161 cagggcatcg aagggatgca tccatgaagt caccagtctc aagcccatgt ggtaggcggt 2221 gatggaacaa ctgtcaaatc agttttagca tgacctttcc ttgtggattt tcttattctc 2281 gttgtcaagt tttctagggt tgaatttttt tcttttttct ccatggtgtt aatgatatta 2341 gagatgaaaa acgttagcag ttgatttttg tccaaaagca agtcatggct agagtatcca 2401 tgcaaggtgt cttgttgcat ggaagggata gtttggctcc cttggaggct atgtaggctt 2461 gtcccgggaa agagaactgt cctgcagctg aaatggactg ttctttactg acctgctcag 2521 cagtttcttc tctcatatat tcccaaaaca agtacatctg cgatcaactc tagccaaatt 2581 tgcccctgtg tgctacatga tggatgatta ttattttaag gtctgtttag gaagggaaat 2641 ggctacttgg ccagccattg cctggcattt ggtagtatag tatgattctc accattattt 2701 gtcatggagg cagacataca ccagaaatgg gggagaaaca gtacatatct ttctgtcttt 2761 agtttattgt gtgctggtct aagcaagctg agatcatttg caatggaaaa cacgtaactt 2821 gtttaaaagt ttttctggta gctttagctt tatgctaaaa aaaataatga cattgggtat 2881 ctatttcttt ctaagacata cattagtagg aaaataagtc ttttcatgct tatgatttag 2941 ctgttttgtg gtaattgctt tttaaaggaa gttattaata tcataagtta ttattaatat 3001 tttgaacaca ggtggatgtg aaggattttc atttaaaaac caagtggttt tgactttttc 3061 tgttgaatga acaactgtgc cttgtggaat ttttgcagaa gtgtttatgc tttgttagca 3121 tttcaacttg cattattata aagaggtatt aatgcctcag ttatgtgttt gtcaatgtac 3181 tggctgagga ttctatctca gctgtctttt ctaactgtgt aggttgagtt ttgaacacgt 3241 gcttgtggac atcagcctcc tgccagcagt tcttgaagct tctttttcat tcctgctact 3301 ctacctgtat ttctcagttg cagcactgag tggtcaaaat acatttctgg gccacctcag 3361 ggaacccatg catctgcctg gcatttaggc agcagagccc ctgaccgtcc cccacaggct 3421 ctgcctcacg tcctcatctc atttggctgt gtaaagaaat gggaaaaggg aaaaggagag 3481 agcaattgag gcagttgacc atattcagtt ttatttattt atttttaatt tgtttttttc 3541 tccaagtcca ccagtctctg aaattagaac agtaggcggt atgagataat caggcctaat 3601 catgttgtga ttctcttttc ttagtggagt ggaatgttct atccccacaa gaaggattat 3661 atcttataga cttgtcttgt tcagattctg tatttaccca ttttattgaa acatatacta 3721 agttccatgt atttttgtta caaatcttct gaaaaaaaac aaaacaatgt gaaacattaa 3781 aattaaaagg cattaataat aaaaaaaaaa aaaaaaaccc gaattc // LOCUS HSTHMOD 3466 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for thrombomodulin precursor. ACCESSION X05495 NID g37123 KEYWORDS glycoprotein; thrombomodulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3466) AUTHORS Suzuki,K., Kusumoto,H., Deyashiki,Y., Nishioka,J., Maruyama,I., Zushi,M., Kawahara,S., Honda,G., Yamamoto,S. and Horiguchi,S. TITLE Structure and expression of human thrombomodulin, a thrombin receptor on endothelium acting as a cofactor for protein C activation JOURNAL EMBO J. 6 (7), 1891-1897 (1987) MEDLINE 88004395 REFERENCE 2 (bases 1 to 3466) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (10-NOV-1987) to the EMBL/GenBank/DDBJ databases COMMENT Data kindly reviewed (10-NOV-1987) by Yamamoto S. FEATURES Location/Qualifiers source 1..3466 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 151..204 /note="put. signal peptide (AA -18 to -1)" CDS 151..1878 /codon_start=1 /product="thrombomodulin" /db_xref="PID:g736251" /db_xref="SWISS-PROT:P07204" /translation="MLGVLVLGALALAGLGFPAPAEPQPGGSQCVEHDCFALYPGPAT FLNASQICDGLRGHLMTVRSSVAADVISLLLNGDGGVGRRRLWIGLQLPPGCGDPKRL GPLRGFQWVTGDNNTSYSRWARLDLNGAPLCGPLCVAVSAAEATVPSEPIWEEQQCEV KADGFLCEFHFPATCRPLAVEPGAAAAAVSITYGTPFAARGADFQALPVGSSAAVAPL GLQLMCTAPPGAVQGHWAREAPGAWDCSVENGGCEHACNAIPGAPRCQCPAGAALQAD GRSCTASATQSCNDLCEHFCVPNPDQPGSYSCMCETGYRLAADQHRCEDVDDCILEPS PCPQRCVNTQGGFECHCYPNYDLVDGECVEPVDPCFRANCEYQCQPLNQTSYLCVCAE GFAPIPHEPHRCQMFCNQTACPADCDPNTQASCECPEGYILDDGFICTDIDECENGGF CSGVCHNLPGTFECICGPDSALVRHIGTDCDSGKVDGGDSGSGEPPPSPTPGSTLTPP AVGLVHSGLLIGISIASLCLVVALLALLCHLRKKQGAARAKMEYKCAAPSKEVVLQHV RTERTPQRL" mat_peptide 205..1875 /note="mature thrombomodulin (AA 1 - 557)" misc_feature 2567..2572 /note="pot. polyA signal" old_sequence 2879 /note="c was a in [1]" /citation=[1] misc_feature 2987..2992 /note="pot. polyA signal" old_sequence 2996..2998 /note="agc was ac in [1]" /citation=[1] BASE COUNT 684 a 1045 c 951 g 786 t ORIGIN 1 caggggctgc gcgcagcggc aagaagtgtc tgggctggga cggacaggag aggctgtcgc 61 catcggcgtc ctgtgcccct ctgctccggc acggccctgt cgcagtgccc gcgctttccc 121 cggcgcctgc acgcggcgcg cctgggtaac atgcttgggg tcctggtcct tggcgcgctg 181 gccctggccg gcctggggtt ccccgcaccc gcagagccgc agccgggtgg cagccagtgc 241 gtcgagcacg actgcttcgc gctctacccg ggccccgcga ccttcctcaa tgccagtcag 301 atctgcgacg gactgcgggg ccacctaatg acagtgcgct cctcggtggc tgccgatgtc 361 atttccttgc tactgaacgg cgacggcggc gttggccgcc ggcgcctctg gatcggcctg 421 cagctgccac ccggctgcgg cgaccccaag cgcctcgggc ccctgcgcgg cttccagtgg 481 gttacgggag acaacaacac cagctatagc aggtgggcac ggctcgacct caatggggct 541 cccctctgcg gcccgttgtg cgtcgctgtc tccgctgctg aggccactgt gcccagcgag 601 ccgatctggg aggagcagca gtgcgaagtg aaggccgatg gcttcctctg cgagttccac 661 ttcccagcca cctgcaggcc actggctgtg gagcccggcg ccgcggctgc cgccgtctcg 721 atcacctacg gcaccccgtt cgcggcccgc ggagcggact tccaggcgct gccggtgggc 781 agctccgccg cggtggctcc cctcggctta cagctaatgt gcaccgcgcc gcccggagcg 841 gtccaggggc actgggccag ggaggcgccg ggcgcttggg actgcagcgt ggagaacggc 901 ggctgcgagc acgcgtgcaa tgcgatccct ggggctcccc gctgccagtg cccagccggc 961 gccgccctgc aggcagacgg gcgctcctgc accgcatccg cgacgcagtc ctgcaacgac 1021 ctctgcgagc acttctgcgt tcccaacccc gaccagccgg gctcctactc gtgcatgtgc 1081 gagaccggct accggctggc ggccgaccaa caccggtgcg aggacgtgga tgactgcata 1141 ctggagccca gtccgtgtcc gcagcgctgt gtcaacacac agggtggctt cgagtgccac 1201 tgctacccta actacgacct ggtggacggc gagtgtgtgg agcccgtgga cccgtgcttc 1261 agagccaact gcgagtacca gtgccagccc ctgaaccaaa ctagctacct ctgcgtctgc 1321 gccgagggct tcgcgcccat tccccacgag ccgcacaggt gccagatgtt ttgcaaccag 1381 actgcctgtc cagccgactg cgaccccaac acccaggcta gctgtgagtg ccctgaaggc 1441 tacatcctgg acgacggttt catctgcacg gacatcgacg agtgcgaaaa cggcggcttc 1501 tgctccgggg tgtgccacaa cctccccggt accttcgagt gcatctgcgg gcccgactcg 1561 gcccttgtcc gccacattgg caccgactgt gactccggca aggtggacgg tggcgacagc 1621 ggctctggcg agcccccgcc cagcccgacg cccggctcca ccttgactcc tccggccgtg 1681 gggctcgtgc attcgggctt gctcataggc atctccatcg cgagcctgtg cctggtggtg 1741 gcgcttttgg cgctcctctg ccacctgcgc aagaagcagg gcgccgccag ggccaagatg 1801 gagtacaagt gcgcggcccc ttccaaggag gtagtgctgc agcacgtgcg gaccgagcgg 1861 acgccgcaga gactctgagc ggcctccgtc caggagcctg gctccgtcca ggagcctgtg 1921 cctcctcacc cccagctttg ctaccaaagc accttagctg gcattacagc tggagaagac 1981 cctccccgca ccccccaagc tgttttcttc tattccatgg ctaactggcg agggggtgat 2041 tagagggagg agaatgagcc tcggcctctt ccgtgacgtc actggaccac tgggcaatga 2101 tggcaatttt gtaacgaaga cacagactgc gatttgtccc aggtcctcac taccgggcgc 2161 aggagggtga gcgttattgg tcggcagcct tctgggcaga ccttgacctc gtgggctagg 2221 gatgactaaa atatttattt tttttaagta tttaggtttt tgtttgtttc ctttgttctt 2281 acctgtatgt ctccagtatc cactttgcac agctctccgg tctctctctc tctacaaact 2341 cccacttgtc atgtgacagg taaactatct tggtgaattt ttttttccta gccctctcac 2401 atttatgaag caagccccac ttattcccca ttcttcctag ttttctcctc ccaggaactg 2461 ggccaactca cctgagtcac cctacctgtg cctgacccta cttcttttgc tcttagctgt 2521 ctgctcagac agaaccccta catgaaacag aaacaaaaac actaaaaata aaaatggcca 2581 tttgcttttt caccagattt gctaatttat cctgaaattt cagattccca gagcaaaata 2641 attttaaaca aaggttgaga tgtaaaaggt attaaattga tgttgctgga ctgtcataga 2701 aattacaccc aaagaggtat ttatctttac ttttaaacag tgagcctgaa ttttgttgct 2761 gttttgattt gtactgaaaa atggtaattg ttgctaatct tcttatgcaa tttccttttt 2821 tgttattatt acttattttt gacagtgttg aaaatgttca gaaggttgct ctagattgcg 2881 agaagagaca aacacctccc aggagacagt tcaagaaagc ttcaaactgc atgattcatg 2941 ccaattagca attgactgtc actgttcctt gtcactggta gaccaaaata aaaccgactc 3001 tactggtctt gtggaattgg gagcttggga atggatcctg gaggatgccc aattagggcc 3061 tagccttaat caggtcctca gagaatttct accatttcag agaggccttt tggaatgtgg 3121 cccctgaaca agaattggaa gctgccctgc ccatgggagc tggttagaaa tgcagaatcc 3181 taggctccac cccatccagt tcatgagaat ctatatttaa caagatctgc agggggtgtg 3241 tctgctcagt aatttgagga caaccattcc agactgcttc caattttctg gaatacatga 3301 aatatagatc agttataagt agcaggccaa gtcaggccct tattttcaag aaactgagga 3361 attttctttg tgtagctttg ctctttggta gaaaaggcta ggtacacagc tctagacact 3421 gccacacagg gtctgcaagg tctttggttc agctaagccg gaattc // LOCUS HSTHRINH 1361 bp RNA PRI 13-JUL-1994 DEFINITION H.sapiens thrombin inhibitor mRNA. ACCESSION Z22658 NID g297411 KEYWORDS thrombin inhibitor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1361) AUTHORS Coughlin,P., Sun,J., Cerruti,L., Salem,H.H. and Bird,P. TITLE Cloning and molecular characterization of a human intracellular serine proteinase inhibitor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (20), 9417-9421 (1993) MEDLINE 94022386 REFERENCE 2 (bases 1 to 1361) AUTHORS Steinle,A. TITLE Direct Submission JOURNAL Submitted (27-APR-1993) Steinle A., University of Munich, Institute of Immunology, Goethestrasse 31, W-8000 MUENCHEN 2, GERMANY FEATURES Location/Qualifiers source 1..1361 /organism="Homo sapiens" /macronuclear /db_xref="taxon:9606" /clone="PTI/P" /tissue_type="placenta" /clone_lib="cDNA in lamda gt11 (clontech)" CDS 75..1205 /function="serine proteinase inhibitor" /citation=[1] /codon_start=1 /label=PTI /evidence=experimental /product="thrombin inhibitor" /db_xref="PID:g297412" /db_xref="SWISS-PROT:P35237" /translation="MDVLAEANGTFALNLLKTLGKDNSKNVFFSPMSMSCALAMVYMG AKGNTAAQMAQILSFNKSGGGGDIHQGFQSLLTEVNKTGTQYLLRVANRLFGEKSCDF LSSFRDSCQKFYQAEMEELDFISAVEKSRKHINTWVAEKTEGKIAELLSPGSVDPLTR LVLVNAVYFRGNWDGQFDKENTEERLFKVSKNEEKPVQMMFKQSTFKKTYIGEIFTQI LVLPYVGKELNMIIMLPDETTDLRTVEKELTYEKFVEWTRLDMMDEEEVEVSLPRFKL EESYDMESVLRNLGMTDAFELGKADFSGMSQTDLSLSKVVHKSFVEVNEEGTEAAAAT AAIMMMRCARFVPRFCADHPFLFFIQHRKTNGILFCGRFSSP" polyA_signal 1311..1317 BASE COUNT 353 a 328 c 361 g 319 t ORIGIN 1 ctcgctcgct ccccgctctg gagtacgtgt ctggcttggg agccgctcgg acacgctggc 61 ttgggtctgc catcatggat gttctcgcag aggcaaatgg cacctttgcc ttaaaccttt 121 tgaaaacgct gggtaaagac aactcgaaga atgtgttttt ctcacccatg agcatgtcct 181 gtgccctggc catggtctac atgggggcaa agggaaacac cgctgcacag atggcccaga 241 tactttcttt caataaaagt ggcggtggtg gagacatcca ccagggcttc cagtctcttc 301 tcaccgaagt gaacaagact ggcacgcagt acttgcttag ggtggccaac aggctctttg 361 gggaaaagtc ttgtgatttc ctctcatctt ttagagattc ctgccaaaaa ttctaccaag 421 cagagatgga ggagcttgac tttatcagcg ccgtagagaa gtccagaaaa cacataaaca 481 cctgggtagc tgaaaagaca gaaggtaaaa ttgcggagtt gctctctccg ggctcagtgg 541 atccattgac aaggctggtt ctggtgaatg ctgtctattt cagaggaaac tgggatggac 601 agtttgacaa ggagaacacc gaggagagac tgtttaaagt cagcaagaat gaggagaaac 661 ctgtgcaaat gatgtttaag caatctactt ttaagaagac ctatatagga gaaatattta 721 cccaaatctt ggtgcttcca tatgttggca aggaactgaa tatgatcatc atgcttccgg 781 acgagaccac tgacttgaga acggtggaga aagaactcac ttacgagaag ttcgtagaat 841 ggacgaggct ggacatgatg gatgaagagg aggtggaagt gtccctcccg cggtttaaac 901 tagaggaaag ctacgacatg gagagtgtcc tgcgcaacct gggcatgact gatgccttcg 961 agctgggcaa ggcagacttc tctggaatgt cccagacaga cctgtctctg tccaaggtcg 1021 tgcacaagtc ttttgtggag gtcaatgagg aaggcacgga ggctgcagcc gccacagctg 1081 ccatcatgat gatgcggtgt gccagattcg tcccccgctt ctgcgccgac caccccttcc 1141 ttttcttcat ccagcacaga aagaccaacg ggattctctt ctgcggccgc ttttcctctc 1201 cgtgaggaca gggcagtctt ggtgtgcagc ccctctcctc tctgtcccct gacactccac 1261 agtgtgcctg caacccaagt ggccttatcc gtgcagtggt ggcagttcag aaataaaggg 1321 cccatttgtg ggatgccgca ttcaaaaaaa aaaaaaaaaa a // LOCUS HSTHROMB4 3074 bp RNA PRI 05-MAY-1995 DEFINITION H.sapiens mRNA for thrombospondin-4. ACCESSION Z19585 NID g311625 KEYWORDS thrombospondin-4. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1588 to 3074) AUTHORS Lawler,J., McHenry,K., Duquette,M. and Derick,L. TITLE Characterization of human thrombospondin-4 JOURNAL J. Biol. Chem. 270 (6), 2809-2814 (1995) MEDLINE 95155352 REFERENCE 2 (bases 1 to 3074) AUTHORS Lawler,J.W. TITLE Direct Submission JOURNAL Submitted (15-JAN-1993) Lawler J. W., Brigham and Women's Hospital, Pathology, 221 Longwood Ave, Boston, Massachusetts, U.S.A., 02115 FEATURES Location/Qualifiers source 1..3074 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" /clone_lib="human heart" CDS 28..2913 /standard_name="thrombospondin-4" /function="unknown" /codon_start=1 /product="thrombospondin-4" /db_xref="PID:g311626" /db_xref="SWISS-PROT:P35443" /translation="MLAPRGAAVLLLHLVLQRWLAAGAQATPQVFDLLPSSSQRLNPG ALLPVLTDPALNDLYVISTFKLQTKSSATIFGLYSSTDNSKYFEFTVMGRLSKAILRY LKNDGKVHLVVFNNLQLADGRRHRILLRLSNLQRGAGSLELYLDCIQVDSVHNLPRAF AGPSQKPETIELRTFQRKPQDFLEELKLVVRGSLFQVASLQDCFLQQSEPLAATGTGD FNRQFLGQMTQLNQLLGEVKDLLRQQVKETSFLRNTIAECQACGPLKFQSPTPSTVVA PAPPAPPTRPPRRCDSNPCFRGVQCTDSRDGFQCGPCPEGYTGNGITCIDVDECKYHP CYPGVHCINLSPGFRCDACPVGFTGPMVQGVGISFAKSNKQVCTDIDECRNGACVPNS ICVNTLGSYRCGPCKPGYTGDQIRGCKVERNCRNPELNPCSVNAQCIEERQGDVTCVC GVGWAGDGYICGKDVDIDSYPDEELPCSARNCKKDNCKYVPNSGQEDADRDGIGDACD EDADGDGILNEQDNCVLIHNVDQRNSDKDIFGDACDNCLSVLNNDQKDTDGDGRGDAC DDDMDGDGIKNILDNCPKFPNRDQRDKDGDGVGDACDSCPDVSNPNQSDVDNDLVGDS CDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIGDECDDDDDNDGIPDLVPPGPDN CRLVPNPAQEDSNSDGVGDICESDFDQDQVIDRIDVCPENAEVTLTDFRAYQTVGLDP EGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTQTDDDYAGF IFGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLKAVKSKTGPGEHLRNSLWHT GDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYEGSELVADSGVTIDT TMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFDRFDN" sig_peptide 28..90 mat_peptide 91..2910 /standard_name="thrombospondin-4" /citation=[1] /function="unknown" /product="thrombospondin-4" repeat_unit 889..1008 /rpt_type=TANDEM /rpt_family="thrombospondin type 2" repeat_unit 1009..1167 /rpt_type=TANDEM /rpt_family="thrombospondin type 2" repeat_unit 1168..1290 /rpt_type=TANDEM /rpt_family="thrombospondin type 2" repeat_unit 1291..1416 /rpt_type=TANDEM /rpt_family="thrombospondin type 2" repeat_unit 1480..2175 /function="calcium binding" /rpt_type=TANDEM /rpt_family="thrombospondin type 3" misc_binding 1711..1722 /bound_moiety="integrins" /function="cell binding site" polyA_site 3031..3036 BASE COUNT 796 a 768 c 842 g 668 t ORIGIN 1 gaattccggg gagcaggaag agccaacatg ctggccccgc gcggagccgc cgtcctcctg 61 ctgcacctgg tcctgcagcg gtggctagcg gcaggcgccc aggccacccc ccaggtcttt 121 gaccttctcc catcttccag tcagaggcta aacccaggcg ctctgctgcc agtcctgaca 181 gaccccgccc tgaatgatct ctatgtgatt tccaccttca agctgcagac taaaagttca 241 gccaccatct tcggtcttta ctcttcaact gacaacagta aatattttga atttactgtg 301 atgggacgct taagcaaagc catcctccgt tacctgaaga acgatgggaa ggtgcatttg 361 gtggttttca acaacctgca gctggcagac ggaaggcggc acaggatcct cctgaggctg 421 agcaatttgc agcgaggggc cggctcccta gagctctacc tggactgcat ccaggtggat 481 tccgttcaca atctccccag ggcctttgct ggcccctccc agaaacctga gaccattgaa 541 ttgaggactt tccagaggaa gccacaggac ttcttggaag agctgaagct ggtggtgaga 601 ggctcactgt tccaggtggc cagcctgcaa gactgcttcc tgcagcagag tgagccactg 661 gctgccacag gcacagggga ctttaaccgg cagttcttgg gtcaaatgac acaattaaac 721 caactcctgg gagaggtgaa ggaccttctg agacagcagg ttaaggaaac atcatttttg 781 cgaaacacca tagctgaatg ccaggcttgc ggtcctctca agtttcagtc tccgacccca 841 agcacggtgg tcgccccggc tccccctgca ccgccaacac gcccacctcg tcggtgtgac 901 tccaacccat gtttccgagg tgtccaatgt accgacagta gagatggctt ccagtgtggg 961 ccctgccccg agggctacac aggaaacggg atcacctgta ttgatgttga tgagtgcaaa 1021 taccatccct gctacccggg cgtgcactgc ataaatttgt ctcctggctt cagatgtgac 1081 gcctgcccag tgggcttcac agggcccatg gtgcagggtg ttgggatcag ttttgccaag 1141 tcaaacaagc aggtctgcac tgacattgat gagtgtcgaa atggagcgtg cgttcccaac 1201 tcgatctgcg ttaatacttt gggatcttac cgctgtgggc cttgtaagcc ggggtatact 1261 ggtgatcaga taaggggatg caaagtggaa agaaactgca gaaacccaga gctgaaccct 1321 tgcagtgtga atgcccagtg cattgaagag aggcaggggg atgtgacatg tgtgtgtgga 1381 gtcggttggg ctggagatgg ctatatctgt ggaaaggatg tggacatcga cagttacccc 1441 gacgaagaac tgccatgctc tgccaggaac tgtaaaaagg acaactgcaa atatgtgcca 1501 aattctggcc aagaagatgc agacagagat ggcattggcg acgcttgtga cgaggatgct 1561 gacggagatg ggatcctgaa tgagcaggat aactgtgtcc tgattcataa tgtggaccaa 1621 aggaacagcg ataaagatat ctttggggat gcctgtgata actgcctgag tgtcttaaat 1681 aacgaccaga aagacaccga tggggatgga agaggagatg cctgtgatga tgacatggat 1741 ggagatggaa taaaaaacat tctggacaac tgcccaaaat ttcccaatcg tgaccaacgg 1801 gacaaggatg gtgatggtgt gggggatgcc tgtgacagtt gtcctgatgt cagcaaccct 1861 aaccagtctg atgtggataa tgatctggtt ggggactcct gtgacaccaa tcaggacagt 1921 gatggagatg ggcaccagga cagcacagac aactgcccca ccgtcattaa cagtgcccag 1981 ctggacaccg ataaggatgg aattggtgac gagtgtgatg atgatgatga caatgatggt 2041 atcccagacc tggtgccccc tggaccagac aactgccggc tggtccccaa cccagcccag 2101 gaggatagca acagcgacgg agtgggagac atctgtgagt ctgactttga ccaggaccag 2161 gtcatcgatc ggatcgacgt ctgcccagag aacgcagagg tcaccctgac cgacttcagg 2221 gcttaccaga ccgtgggcct ggatcctgaa ggggatgccc agatcgatcc caactgggtg 2281 gtcctgaacc agggcatgga gattgtacag accatgaaca gtgatcctgg cctggcagtg 2341 gggtacacag cttttaatgg agttgacttc gaagggacct tccatgtgaa tacccagaca 2401 gatgatgact atgcaggctt tatctttggc taccaagata gctccagctt ctacgtggtc 2461 atgtggaagc agacggagca gacatattgg caagccaccc cattccgagc agttgcagaa 2521 cctggcattc agctcaaggc tgtgaagtct aagacaggtc caggggagca tctccggaac 2581 tccctgtggc acacggggga caccagtgac caggtcaggc tgctgtggaa ggactccagg 2641 aatgtgggct ggaaggacaa ggtgtcctac cgctggttcc tacagcacag gccccaggtg 2701 ggctacatca gggtacgatt ttatgaaggc tctgagttgg tggctgactc tggcgtcacc 2761 atagacacca caatgcgtgg aggccgactt ggcgttttct gcttctctca agaaaacatc 2821 atctggtcca acctcaagta tcgctgcaat gacaccatcc ctgaggactt ccaagagttt 2881 caaacccaga atttcgaccg cttcgataat taaaccaagg aagcaatctg taactgcttt 2941 tcggaacact aaaaccatat atattttaac ttcaattttc tttagctttt accaacccaa 3001 atatatcaaa acgttttatg tgaatgtggc aataaaggag aagagatcat ttttaaaaaa 3061 aaaaaaaaaa aaaa // LOCUS HSTHROMR 4434 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for thrombospondin. ACCESSION X04665 NID g37137 KEYWORDS glycoprotein; signal peptide; thrombospondin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4434) AUTHORS Lawler,J. and Hynes,R.O. TITLE The structure of human thrombospondin, an adhesive glycoprotein with multiple calcium-binding sites and homologies with several different proteins JOURNAL J. Cell Biol. 103 (5), 1635-1648 (1986) MEDLINE 87057617 COMMENT Three types of repeating amino acid sequence are present in thrombospondin. The first is 57 amino acids long and shows homology with circumsporozoite protein from Plasmodium falciparum. The second is 50-60 amino acids long and shows homology with epidermal growth factor precursor. The third occurs as a continuous eightfold repeat of a 38-residue sequence; structural homology with parvalbumin and calmodulin indicates that these repeats constitute the multiple calcium-binding sites of thrombospondin. Data kindly reviewed (15-SEP-1987) by Lawler J. FEATURES Location/Qualifiers source 1..4434 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="umbilical vein endothelial cells" CDS 76..3588 /note="precursor polypeptide (AA -18 to 1152)" /codon_start=1 /db_xref="PID:g37138" /db_xref="SWISS-PROT:P07996" /translation="MGLAWGLGVLFLMHVCGTNRIPESGGDNSVFDIFELTGAARKGS GRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRTEKGFLLLASLRQMKKTRG TLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFV QEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFV FGTTPEDILRNKGCSSSTSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDE LSSMVLELRGLRTIVTTLQDSIRKVTEENKELANELRRPPLCYHNGVQYRNNEEWTVD SCTECHCQNSVTICKKVSCPIMPCSNATVPDGECCPRCWPSDSADDGWSPWSEWTSCS TSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQDGGWSHWSPWSSCSVT CGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTC GGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTSYP DGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCLPCPPRF TGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGY AGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDD DDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYNHNPDQADTDNNGEGDACA ADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEHNPDQLDSDSDRIGDTC DNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVP NPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQ NDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQ SSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPG QVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYDKTYAGG RLGLFVFSQEMVFFSDLKYECRDP" sig_peptide 76..129 /note="put. signal peptide (AA -18 to -1)" mat_peptide 130..3585 /note="mature peptide (AA 1-1152)" misc_feature 817..825 /note="pot. N-glycosylation site" misc_feature 1153..1161 /note="pot. N-glycosylation site" misc_feature 1636..1644 /note="pot. N-glycosylation site" misc_feature 2197..2205 /note="pot. N-glycosylation site" misc_feature 3226..3234 /note="pot. N-glycosylation site" misc_feature 3274..3282 /note="pot. N-glycosylation site" BASE COUNT 1139 a 1185 c 1177 g 933 t ORIGIN 1 gccgccctcg ccaccgctcc cggccgccgc gctccggtac acacaggatc cctgctgggc 61 accaacagct ccaccatggg gctggcctgg ggactaggcg tcctgttcct gatgcatgtg 121 tgtggcacca accgcattcc agagtctggc ggagacaaca gcgtgtttga catctttgaa 181 ctcaccgggg ccgcccgcaa ggggtctggg cgccgactgg tgaagggccc cgacccttcc 241 agcccagctt tccgcatcga ggatgccaac ctgatccccc ctgtgcctga tgacaagttc 301 caagacctgg tggatgctgt gcggacagaa aagggtttcc tccttctggc atccctgagg 361 cagatgaaga agacccgggg cacgctgctg gccctggagc ggaaagacca ctctggccag 421 gtcttcagcg tggtgtccaa tggcaaggcg ggcaccctgg acctcagcct gaccgtccaa 481 ggaaagcagc acgtggtgtc tgtggaagaa gctctcctgg caaccggcca gtggaagagc 541 atcaccctgt ttgtgcagga agacagggcc cagctgtaca tcgactgtga aaagatggag 601 aatgctgagt tggacgtccc catccaaagc gtcttcacca gagacctggc cagcatcgcc 661 agactccgca tcgcaaaggg gggcgtcaat gacaatttcc agggggtgct gcagaatgtg 721 aggtttgtct ttggaaccac accagaagac atcctcagga acaaaggctg ctccagctct 781 accagtgtcc tcctcaccct tgacaacaac gtggtgaatg gttccagccc tgccatccgc 841 actaactaca ttggccacaa gacaaaggac ttgcaagcca tctgcggcat ctcctgtgat 901 gagctgtcca gcatggtcct ggaactcagg ggcctgcgca ccattgtgac cacgctgcag 961 gacagcatcc gcaaagtgac tgaagagaac aaagagttgg ccaatgagct gaggcggcct 1021 cccctatgct atcacaacgg agttcagtac agaaataacg aggaatggac tgttgatagc 1081 tgcactgagt gtcactgtca gaactcagtt accatctgca aaaaggtgtc ctgccccatc 1141 atgccctgct ccaatgccac agttcctgat ggagaatgct gtcctcgctg ttggcccagc 1201 gactctgcgg acgatggctg gtctccatgg tccgagtgga cctcctgttc tacgagctgt 1261 ggcaatggaa ttcagcagcg cggccgctcc tgcgatagcc tcaacaaccg atgtgagggc 1321 tcctcggtcc agacacggac ctgccacatt caggagtgtg acaagagatt taaacaggat 1381 ggtggctgga gccactggtc cccgtggtca tcttgttctg tgacatgtgg tgatggtgtg 1441 atcacaagga tccggctctg caactctccc agcccccaga tgaacgggaa accctgtgaa 1501 ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccatcaa tggaggctgg 1561 ggtccttggt caccatggga catctgttct gtcacctgtg gaggaggggt acagaaacgt 1621 agtcgtctct gcaacaaccc cacaccccag tttggaggca aggactgcgt tggtgatgta 1681 acagaaaacc agatctgcaa caagcaggac tgtccaattg atggatgcct gtccaatccc 1741 tgctttgccg gcgtgaagtg tactagctac cctgatggca gctggaaatg tggtgcttgt 1801 ccccctggtt acagtggaaa tggcatccag tgcacagatg ttgatgagtg caaagaagtg 1861 cctgatgcct gcttcaacca caatggagag caccggtgtg agaacacgga ccccggctac 1921 aactgcctgc cctgcccccc acgcttcacc ggctcacagc ccttcggcca gggtgtcgaa 1981 catgccacgg ccaacaaaca ggtgtgcaag ccccgtaacc cctgcacgga tgggacccac 2041 gactgcaaca agaacgccaa gtgcaactac ctgggccact atagcgaccc catgtaccgc 2101 tgcgagtgca agcctggcta cgctggcaat ggcatcatct gcggggagga cacagacctg 2161 gatggctggc ccaatgagaa cctggtgtgc gtggccaatg cgacttacca ctgcaaaaag 2221 gataattgcc ccaaccttcc caactcaggg caggaagact atgacaagga tggaattggt 2281 gatgcctgtg atgatgacga tgacaatgat aaaattccag atgacaggga caactgtcca 2341 ttccattaca acccagctca gtatgactat gacagagatg atgtgggaga ccgctgtgac 2401 aactgtccct acaaccacaa cccagatcag gcagacacag acaacaatgg ggaaggagac 2461 gcctgtgctg cagacattga tggagacggt atcctcaatg aacgggacaa ctgccagtac 2521 gtctacaatg tggaccagag agacactgat atggatgggg ttggagatca gtgtgacaat 2581 tgccccttgg aacacaatcc ggatcagctg gactctgact cagaccgcat tggagatacc 2641 tgtgacaaca atcaggatat tgatgaagat ggccaccaga acaatctgga caactgtccc 2701 tatgtgccca atgccaacca ggctgaccat gacaaagatg gcaagggaga tgcctgtgac 2761 cacgatgatg acaacgatgg cattcctgat gacaaggaca actgcagact cgtgcccaat 2821 cccgaccaga aggactctga cggcgatggt cgaggtgatg cctgcaaaga tgattttgac 2881 catgacagtg tgccagacat cgatgacatc tgtcctgaga atgttgacat cagtgagacc 2941 gatttccgcc gattccagat gattcctctg gaccccaaag ggacatccca aaatgaccct 3001 aactgggttg tacgccatca gggtaaagaa ctcgtccaga ctgtcaactg tgatcctgga 3061 ctcgctgtag gttatgatga gtttaatgct gtggacttca gtggcacctt cttcatcaac 3121 accgaaaggg acgatgacta tgctggattt gtctttggct accagtccag cagccgcttt 3181 tatgttgtga tgtggaagca agtcacccag tcctactggg acaccaaccc cacgagggct 3241 cagggatact cgggcctttc tgtgaaagtt gtaaactcca ccacagggcc tggcgagcac 3301 ctgcggaacg ccctgtggca cacaggaaac acccctggcc aggtgcgcac cctgtggcat 3361 gaccctcgtc acataggctg gaaagatttc accgcctaca gatggcgtct cagccacagg 3421 ccaaagacgg gtttcattag agtggtgatg tatgaaggga agaaaatcat ggctgactca 3481 ggacccatct atgataaaac ctatgctggt ggtagactag ggttgtttgt cttctctcaa 3541 gaaatggtgt tcttctctga cctgaaatac gaatgtagag atccctaatc atcaaattgt 3601 tgattgaaag actgatcata aaccaatgct ggtattgcac cttctggaac tatgggcttg 3661 agaaaacccc caggatcact tctccttggc ttccttcttt tctgtgcttg catcagtgtg 3721 gactcctaga acgtgcgacc tgcctcaaga aaatgcagtt ttcaaaaaca gactcagcat 3781 tcagcctcca atgaataaga catcttccaa gcatataaac aattgctttg gtttcctttt 3841 gaaaaagcat ctacttgctt cagttgggaa ggtgcccatt ccactctgcc tttgtcacag 3901 agcagggtgc tattgtgagg ccatctctga gcagtggact caaaagcatt ttcaggcatg 3961 tcagagaagg gaggactcac tagaattagc aaacaaaacc accctgacat cctccttcag 4021 gaacacgggg agcagaggcc aaagcactaa ggggagggcg catacccgag acgattgtat 4081 gaagaaaata tggaggaact gttacatgtt cggtactaag tcattttcag gggattgaaa 4141 gactattgct ggatttcatg atgctgactg gcgttagctg attaacccat gtaaataggc 4201 acttaaatag aagcaggaaa gggagacaaa gactggcttc tggacttcct ccctgatccc 4261 cacccttact catcacctgc agtggccaga attagggaat cagaatcgaa accagtgtaa 4321 ggcagtgctg gctgccattg cctggtcaca ttgaaattgg tggcttcatt ctagatgtag 4381 cttgtgcaga tgtagcagga aaataggaaa acctaccatc tcagtgagca ccag // LOCUS HSTHSPAIA 937 bp RNA PRI 14-MAR-1996 DEFINITION H.sapiens thiol-specific antioxidant protein mRNA. ACCESSION Z22548 L14286 NID g438068 KEYWORDS thiol-specific antioxidant protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 937) AUTHORS Lim,Y.S., Cha,M.K., Kim,H.K. and Kim,I.H. TITLE The thiol-specific antioxidant protein from human brain: gene cloning and analysis of conserved cysteine regions JOURNAL Gene 140 (2), 279-284 (1994) MEDLINE 94193012 REFERENCE 2 (bases 1 to 937) AUTHORS Kim,I. TITLE Direct Submission JOURNAL Submitted (14-APR-1993) Kim I., Pai-Chai University, 439-6 Doma-Dong Seo-Gu, Taejon, Republic of Korea REMARK sequence revised by author 28-SEP-93 FEATURES Location/Qualifiers source 1..937 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 90..686 /codon_start=1 /product="thiol-specific antioxidant protein" /db_xref="PID:g438069" /db_xref="SWISS-PROT:P35701" /translation="MASGNARIGKPAPDFKATAVVDGAFKEVKLSDYKGKYVVLFFYP LDFTFVCPTEIIAFTTVKRTSAKLGCEVLGVSVDSQFTHLAWINTPRKEGGLGPLNIP LLADVTRRLSEDYGVLKNDEGIAYRGLFIIDGKGVLRQITVNDLPVGRSVDEALRLVQ AFQYTDEHGEVCPAAWKPGRDTIKPNVDDSKEYFSKHN" BASE COUNT 195 a 276 c 267 g 199 t ORIGIN 1 cgcggcccca gggctcactt ggcgctgaga acgcgggtgc agcgtgtgat cgtccgtgcg 61 tctagccttt gcccacgcag ctttcagtca tggcctccgg taacgcgcgc atcggaaagc 121 cagcccctga cttcaaggcc acagcggtgg ttgatggcgc cttcaaagag gtgaagctgt 181 cggactacaa agggaagtac gtggtcctct ttttctaccc tctggacttc acttttgtgt 241 gccccaccga gatcatcgcg ttcacaaccg tgaagaggac ttccgcaaag ctgggctgtg 301 aagtgctggg cgtctcggtg gactctcagt tcacccacct ggcttggatc aacacccccc 361 ggaaagaggg aggcttgggc cccttgaaca tccccctgct tgctgacgtg accagacgct 421 tgtctgagga ttacggcgtg ctgaaaaacg atgagggcat tgcttacagg ggcctcttta 481 tcatcgatgg caagggtgtc cttcgccaga tcactgttaa tgatttgcct gtgggacgct 541 ccgtggatga ggctctgcgg ctggtccagg ccttccagta cacagacgag catggggaag 601 tttgtccggc tgcttggaag cctggacgtg acacgattaa gccgaacgtg gatgacagca 661 aggaatattt ctccaaacac aattaggctg gctaacggat agtgagcttg tgcccctgcc 721 taggtgcctg tgctgggtgt ccacctgtgc ccccacctgg gtgccctatg ctgacccagg 781 aaaggccaga cctgcccctc caaaatccac agtatgggac cctggagggc tagcaaggcc 841 ttctcatgcc tccacctaga agctgaatag tgacgccctc ccccaagccc acccagccgc 901 acacaggcct agaggtaacc aataaagtat tagggcc // LOCUS HSTHYRR 8448 bp RNA PRI 17-FEB-1997 DEFINITION Human mRNA for thyroglobulin. ACCESSION X05615 NID g37173 KEYWORDS thyroglobulin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8446) AUTHORS Malthiery,Y. and Lissitzky,S. TITLE Primary structure of human thyroglobulin deduced from the sequence of its 8448-base complementary DNA JOURNAL Eur. J. Biochem. 165 (3), 491-498 (1987) MEDLINE 87246630 REFERENCE 2 (bases 1 to 8448) AUTHORS Malthiery,Y. TITLE Direct Submission JOURNAL Submitted (07-APR-1988) to the EMBL/GenBank/DDBJ databases REFERENCE 3 (bases 1 to 8448) AUTHORS Henry,M., Zanelli,E., Piechaczyk,M., Pau,B. and Malthiery,Y. TITLE A major human thyroglobulin epitope defined with monoclonal antibodies is mainly recognized by human autoantibodies JOURNAL Eur. J. Immunol. 22 (2), 315-319 (1992) MEDLINE 92164705 COMMENT patient); Data kindly reviewed (07-APR-1988) by Malthiery Y. FEATURES Location/Qualifiers source 1..8448 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid (obtained surgically from a Graves'disease" /clone="M1-M5 and B1-B4" CDS 42..8345 /codon_start=1 /product="thyroglobulin" /db_xref="PID:g37174" /db_xref="SWISS-PROT:P01266" /translation="MALVLEIFTLLASICWVSANIFEYQVDAQPLRPCELQRETAFLK QADYVPQCAEDGSFQTVQCQNDGRSCWCVGANGSEVLGSRQPGRPVACLSFCQLQKQQ ILLSGYINSTDTSYLPQCQDSGDYAPVQCDVQHVQCWCVDAEGMEVYGTRQLGRPKRC PRSCEIRNRRLLHGVGDKSPPQCSAEGEFMPVQCKFVNTTDMMIFDLVHSYNRFPDAF VTFSSFQRRFPEVSGYCHCADSQGRELAETGLELLLDEIYDTIFAGLDLPSTFTETTL YRILQRRFLAVQSVISGRFRCPTKCEVERFTATSFGHPYVPSCRRNGDYQAVQCQTEG PCWCVDAQGKEMHGTRQQGEPPSCAEGQSCASERQQALSRLYFGTSGYFSQHDLFSSP EKRWASPRVARFATSCPPTIKELFVDSGLLRPMVEGQSQQFSVSENLLKEAIRAIFPS RGLARLALQFTTNPKRLQQNLFGGKFLVNVGQFNLSGALGTRGTFNFSQFFQQLGLAS FLNGGRQEDLAKPLSVGLDSNSSTGTPEAAKKDGTMNKPTVGSFGFEINLQENQNALK FLASLLELPEFLLFLQHAISVPEDVARDLGDVMETVLDSQTCEQTPERLFVPSCTTEG SYEDVQCFSGECWCVNSWGKELPGSRVRDGQPRCPTDCEKQRARMQSLMGSQPAGSTL FVPACTSEGHFLPVQCFNSECYCVDAEGQAIPGTRSAIGKPKKCPTPCQLQSEQAFLR TVQALLSNSSMLPTLSDTYIPQCSTDGQWRQVQCNGPPEQVFELYQRWEAQNKGQDLT PAKLLVKIMSYREAASGNFSLFIQSLYEAGQQDVFPVLSQYPSLQDVPLAALEGKRPQ PRENILLEPYLFWQILNGQLSQYPGSYSDFSTPLAHFDLRNCWCVDEAGQELEGMRSE PSKLPTCPGSCEEAKLRVLQFIRETEEIVSASNSSRFPLGESFLVAKGIRLRNEDLGL PPLFPPREAFAEFLRGSDYAIRLAAQSTLSFYQRRRFSPDDSAGASALLRSGPYMPQC DAFGSWEPVQCHAGTGHCWCVDEKGGFIPGSLTARSLQIPQCPTTCEKSRTSGLLSSW KQARSQENPSPKDLFVPACLETGEYARLQASGAGTWCVDPASGEELRPGSSSSAQCPS LCNVLKSGVLSRRVSPGYVPACRAEDGGFSPVQCDQAQGSCWCVMDSGEEVPGTRVTG GQPACESPRCPLPFNASEVVGGTILCETISGPTGSAMQQCQLLCRQGSWSVFPPGPLI CSLESGRWESQLPQPRACQRPQLWQTIQTQGHFQLQLPPGKMCSADYAGLLQTFQVFI LDELTARGFCQIQVKTFGTLVSIPVCNNSSVQVGCLTRERLGVNVTWKSRLEDIPVAS LPDLHDIERALVGKDLLGRFTDLIQSGSFQLHLDSKTFPAETIRFLQGDHFGTSPRTR FGCSEGFYQVLTSEASQDGLGCVKCHEGSYSQDEECIPCPVGFYQEQAGSLACVPCPV GRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGRRLPW WETEAPLEDSQCLMMQKFEKVPESKVIFDANAPVAVRSKVPDSEFPVMQCLTDCTEDE ACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQKRDALGNSKATSFGSLRCQVKVRSH GQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLSGLYNPIVFSASGANLTDAHLFCLLA CDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLCNVKDWMDPSEAWANATCPGVTYDQE SHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQVYLWKDSDMGSRPESMGCRKNTVPRP ASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQKHWLFKHLFSAQQANLWCLSRCVQEH SFCQLAEITESASLYFTCTLYPEAQVCDDIMESNTQGCRLILPQMPKALFRKKVILED KVKNFYTRLPFQKLMGISIRNKVPMSEKSISNGFFECERRCDADPCCTGFGFLNVSQL KGGEVTCLTLNSLGIQMCSEENGGAWRILDCGSPDIEVHTYPFGWYQKPIAQNNAPSF CPLVVLPSLTEKVSLESWQSLALSSVVVDPSIRHFDVAHVSTAATSNFSAVRDLCLSE CSQHEACLITTLQTQLGAVRCMFYADTQSCTHSLQGRNCRLLLREEATHIYRKPGISL LSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQVDQFLGVPYAAPPLAERHFQAPEPL NWTGSWDASKPRASCWQPGTRTSTSPGVSEDCLYLNVFIPQNVAPNASVLVFFHNTMD REESEGWPAIDGSFLAAVGNLIVVTASYRVGVFGFLSSGSGEVSGNWGLLDQVAALTW VQTHIRGFGGDPRRVSLAADRGGADVASIHLLTARATNSQLFRRAVLMGGSALSPAAV ISHERAQQQAIALAKEVSCPMSSSQEVVSCLRQKPANVLNDAQTKLLAVSGPFHYWGP VIDGHFLREPPARALKRSLWVEVDLLIGSSQDDGLINRAKAVKQFEESRGRTSSKTAF YQALQNSLGGEDSDARVEAAATWYYSLEHSTDDYASFSRALENATRDYFIICPIIDMA SAWAKRARGNVFMYHAPENYGHGSLELLADVQFALGLPFYPAYEGQFSLEEKSLSLKI MQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDFVPRAGGENYKEFSELLPNRQGLKKA DCSFWSKYISSLKTSADGAKGGQSAESEEEELTAGSGLREDLLSLQEPGSKTYSK" sig_peptide 42..98 mat_peptide 99..8342 /product="thyroglobulin" old_sequence 3165 /citation=[1] /replace="t" old_sequence 3179 /citation=[1] /replace="t" old_sequence 3214..3215 /citation=[1] /replace="ct" polyA_signal 8427..8432 old_sequence 8433..8434 /citation=[1] /replace="t" old_sequence 8447..8448 /citation=[1] /replace="t" BASE COUNT 1898 a 2297 c 2301 g 1952 t ORIGIN 1 gcagtggttt ctcctccttc ctcccaggaa gggccaggaa aatggccctg gtcctggaga 61 tcttcaccct gctggcctcc atctgctggg tgtcggccaa tatcttcgag taccaggttg 121 atgcccagcc ccttcgtccc tgtgagctgc agagggaaac ggcctttctg aagcaagcag 181 actacgtgcc ccagtgtgca gaggatggca gcttccagac tgtccagtgc cagaacgacg 241 gccgctcctg ctggtgtgtg ggtgccaacg gcagtgaagt gctgggcagc aggcagccag 301 gacggcctgt ggcttgtctg tcattttgtc agctacagaa acagcagatc ttactgagtg 361 gctacattaa cagcacagac acctcctacc tccctcagtg tcaggattca ggggactacg 421 cgcctgttca gtgtgatgtg cagcatgtcc agtgctggtg tgtggacgca gaggggatgg 481 aggtgtatgg gacccgccag ctggggaggc caaagcgatg tccaaggagc tgtgaaataa 541 gaaatcgtcg tcttctccac ggggtgggag ataagtcacc accccagtgt tctgcggagg 601 gagagtttat gcctgtccag tgcaaatttg tcaacaccac agacatgatg atttttgatc 661 tggtccacag ctacaacagg tttccagatg catttgtgac cttcagttcc ttccagagga 721 ggttccctga ggtatctggg tattgccact gtgctgacag ccaagggcgg gaactggctg 781 agacaggttt ggagttgtta ctggatgaaa tttatgacac catttttgct ggcctggacc 841 ttccttccac cttcactgaa accaccctgt accggatact gcagagacgg ttcctcgcag 901 ttcaatcagt catctctggc agattccgat gccccacaaa atgtgaagtg gagcggttta 961 cagcaaccag ctttggtcac ccctatgttc caagctgccg ccgaaatggc gactatcagg 1021 cggtgcagtg ccagacggaa gggccctgct ggtgtgtgga cgcccagggg aaggaaatgc 1081 atggaacccg gcagcaaggg gagccgccat cttgtgctga aggccaatct tgtgcctccg 1141 aaaggcagca ggccttgtcc agactctact ttgggacctc aggctacttc agccagcacg 1201 acctgttctc ttccccagag aaaagatggg cctctccaag agtagccaga tttgccacat 1261 cctgcccacc cacgatcaag gagctctttg tggactctgg gcttctccgc ccaatggtgg 1321 agggacagag ccaacagttt tctgtctcag aaaatcttct caaagaagcc atccgagcaa 1381 tttttccctc ccgagggctg gctcgtcttg cccttcagtt taccaccaac ccaaagagac 1441 tccagcaaaa cctttttgga gggaaatttt tggtgaatgt tggccagttt aacttgtctg 1501 gagcccttgg cacaagaggc acatttaact tcagtcaatt tttccagcaa cttggtcttg 1561 caagcttctt gaatggaggg agacaagaag atttggccaa gccactctct gtgggattag 1621 attcaaattc ttccacagga acccctgaag ctgctaagaa ggatggtact atgaataagc 1681 caactgtggg cagctttggc tttgaaatta acctacaaga gaaccaaaat gccctcaaat 1741 tccttgcttc tctcctggag cttccagaat tccttctctt cttgcaacat gctatctctg 1801 tgccagaaga tgtggcaaga gatttaggtg atgtgatgga aacggtactc gactcccaga 1861 cctgtgagca gacacctgaa aggctatttg tcccatcatg cacgacagaa ggaagctatg 1921 aggatgtcca atgcttttcc ggagagtgct ggtgtgtgaa ttcctggggc aaagagcttc 1981 caggctcaag agtcagagat ggacagccaa ggtgccccac agactgtgaa aagcaaaggg 2041 ctcgcatgca aagcctcatg ggcagccagc ctgctggctc caccttgttt gtccctgctt 2101 gtactagtga gggacatttc ctgcctgtcc agtgcttcaa ctcagagtgc tactgtgttg 2161 atgctgaggg tcaggccatt cctggaactc gaagtgcaat agggaagccc aagaaatgcc 2221 ccacgccctg tcaattacag tctgagcaag ctttcctcag gacggtgcag gccctgctct 2281 ctaactccag catgctaccc accctttccg acacctacat cccacagtgc agcaccgatg 2341 ggcagtggag acaagtgcaa tgcaatgggc ctcctgagca ggtcttcgag ttgtaccaac 2401 gatgggaggc tcagaacaag ggccaggatc tgacgcctgc caagctgcta gtgaagatca 2461 tgagctacag agaagcagct tccggaaact tcagtctctt tattcaaagt ctgtatgagg 2521 ctggccagca agatgtcttc ccggtgctgt cacaataccc ttctctgcaa gatgtcccac 2581 tagcagcact ggaagggaaa cggccccagc ccagggagaa tatcctcctg gagccctacc 2641 tcttctggca gatcttaaat ggccaactca gccaataccc ggggtcctac tcagacttca 2701 gcactccttt ggcacatttt gatcttcgga actgctggtg tgtggatgag gctggccaag 2761 aactggaagg aatgcggtct gagccaagca agctcccaac gtgtcctggc tcctgtgagg 2821 aagcaaagct ccgtgtactg cagttcatta gggaaacgga agagattgtt tcagcttcca 2881 acagttctcg gttccctctg ggggagagtt tcctggtggc caagggaatc cggctgagga 2941 atgaggacct cggccttcct ccgctcttcc cgccccggga ggctttcgcg gagtttctgc 3001 gtgggagtga ttacgccatt cgcctggcgg ctcagtctac cttaagcttc tatcagagac 3061 gccgcttttc cccggacgac tcggctggag catccgccct tctgcggtcg ggcccctaca 3121 tgccacagtg tgatgcgttt ggaagttggg agcctgtgca gtgccacgct gggactgggc 3181 actgctggtg tgtagatgag aaaggagggt tcatccctgg ctcactgact gcccgctctc 3241 tgcagattcc acagtgcccg acaacctgcg agaaatctcg aaccagtggg ctgctttcca 3301 gttggaaaca ggctagatcc caagaaaacc catctccaaa agacctgttc gtcccagcct 3361 gcctagaaac aggagaatat gccaggctgc aggcatcggg ggctggcacc tggtgtgtgg 3421 accctgcatc aggagaagag ttgcggcctg gctcgagcag cagtgcccag tgcccaagcc 3481 tctgcaatgt gctcaagagt ggagtcctct ctaggagagt cagcccaggc tatgtcccag 3541 cctgcagggc agaggatggg ggcttttccc cagtgcaatg tgaccaggcc cagggcagct 3601 gctggtgtgt catggacagc ggagaagagg tgcctgggac gcgcgtgacc gggggccagc 3661 ccgcctgtga gagcccgcgg tgtccgctgc cattcaacgc gtcggaggtg gttggtggaa 3721 caatcctgtg tgagacaatc tcgggcccca caggctctgc catgcagcag tgccaattgc 3781 tgtgccgcca aggctcctgg agcgtgtttc caccagggcc attgatatgt agcctggaga 3841 gcggacgctg ggagtcacag ctgcctcagc cccgggcctg ccaacggccc cagctgtggc 3901 agaccatcca gacccaaggg cactttcagc tccagctccc gccgggcaag atgtgcagtg 3961 ctgactacgc gggtttgctg cagactttcc aggttttcat attggatgag ctgacagccc 4021 gcggcttctg ccagatccag gtgaagactt ttggcaccct ggtttccatt cctgtctgca 4081 acaactcctc tgtgcaggtg ggttgtctga ccagggagcg tttaggagtg aatgttacat 4141 ggaaatcacg gcttgaggac atcccagtgg cttctcttcc tgacttacat gacattgaga 4201 gagccttggt gggcaaggat ctccttgggc gcttcacaga tctgatccag agtggctcat 4261 tccagcttca tctggactcc aagacgttcc cagcggaaac catccgcttc ctccaagggg 4321 accactttgg cacctctcct aggacacggt ttgggtgctc ggaaggattc taccaagtct 4381 tgacaagtga ggccagtcag gacggactgg gatgcgttaa gtgccatgaa ggaagctatt 4441 cccaagatga ggaatgcatt ccttgtcctg ttggattcta ccaagaacag gcagggagct 4501 tggcctgtgt cccatgtcct gtgggcagaa cgaccatttc tgccggagct ttcagccaga 4561 ctcactgtgt cactgactgt cagaggaacg aagcaggcct gcaatgtgac cagaatggcc 4621 agtatcgagc cagccagaag gacaggggca gtgggaaggc cttctgtgtg gacggcgagg 4681 ggcggaggct gccatggtgg gaaacagagg cccctcttga ggactcacag tgtttgatga 4741 tgcagaagtt tgagaaggtt ccagaatcaa aggtgatctt cgacgccaat gctcctgtgg 4801 ctgtcagatc caaagttcct gattctgagt tccccgtgat gcagtgcttg acagattgca 4861 cagaggacga ggcctgcagc ttcttcaccg tgtccacgac ggagccagag atttcctgtg 4921 atttctatgc ttggacaagt gacaatgttg cctgcatgac ttctgaccag aaacgagatg 4981 cactggggaa ctcaaaggcc accagctttg gaagtcttcg ctgccaggtg aaagtgagga 5041 gccatggtca agattctcca gctgtgtatt tgaaaaaggg ccaaggatcc accacaacac 5101 ttcagaaacg ctttgaaccc actggtttcc aaaacatgct ttctggattg tacaacccca 5161 ttgtgttctc agcctcagga gccaatctaa ccgatgctca cctcttctgt cttcttgcat 5221 gcgaccgtga tctgtgttgc gatggcttcg tcctcacaca ggttcaagga ggtgccatca 5281 tctgtgggtt gctgagctca cccagtgtcc tgctttgtaa tgtcaaagac tggatggatc 5341 cctctgaagc ctgggctaat gctacatgtc ctggtgtgac atatgaccag gagagccacc 5401 aggtgatatt gcgtcttgga gaccaggagt tcatcaagag tctgacaccc ttagaaggaa 5461 ctcaagacac ctttaccaat tttcagcagg tttatctctg gaaagattct gacatggggt 5521 ctcggcctga gtctatggga tgtagaaaaa acacagtgcc aaggccagca tctccaacag 5581 aagcaggttt gacaacagaa cttttctccc ctgtggacct caaccaggtc attgtcaatg 5641 gaaatcaatc actatccagc cagaagcact ggcttttcaa gcacctgttt tcagcccagc 5701 aggcaaacct atggtgcctt tctcgttgtg tgcaggagca ctctttctgt cagctcgcag 5761 agataacaga gagtgcatcc ttgtacttca cctgcaccct ctacccagag gcacaggtgt 5821 gtgatgacat catggagtcc aatacccagg gctgcagact gatcctgcct cagatgccaa 5881 aggccctgtt ccggaagaaa gttatactgg aagataaagt gaagaacttt tacactcgcc 5941 tgccgttcca aaaactgatg gggatatcca ttagaaataa agtgcccatg tctgaaaaat 6001 ctatttctaa tgggttcttt gaatgtgaac gacggtgcga tgcggaccca tgctgcactg 6061 gctttggatt tctaaatgtt tcccagttaa aaggaggaga ggtgacatgt ctcactctga 6121 acagcttggg aattcagatg tgcagtgagg agaatggagg agcctggcgc attttggact 6181 gtggctctcc tgacattgaa gtccacacct atcccttcgg atggtaccag aagcccattg 6241 ctcaaaataa tgctcccagt ttttgccctt tggttgttct gccttccctc acagagaaag 6301 tgtctctgga atcgtggcag tccctggccc tctcttcagt ggttgttgat ccatccatta 6361 ggcactttga tgttgcccat gtcagcactg ctgccaccag caatttctct gctgtccgag 6421 acctctgttt gtcggaatgt tcccaacatg aggcctgtct catcaccact ctgcaaaccc 6481 aactcggggc tgtgagatgt atgttctatg ctgatactca aagctgcaca catagtctgc 6541 agggtcggaa ctgccgactt ctgcttcgtg aagaggccac ccacatctac cggaagccag 6601 gaatctctct gctcagctat gaggcatctg taccttctgt gcccatttcc acccatggcc 6661 ggctgctggg caggtcccag gccatccagg tgggtacctc atggaagcaa gtggaccagt 6721 tccttggagt tccatatgct gccccgcccc tggcagagag gcacttccag gcaccagagc 6781 ccttgaactg gacaggctcc tgggatgcca gcaagccaag ggccagctgc tggcagccag 6841 gcaccagaac atccacgtct cctggagtca gtgaagattg tttgtatctc aatgtgttca 6901 tccctcagaa tgtggcccct aacgcgtctg tgctggtgtt cttccacaac accatggaca 6961 gggaggagag tgaaggatgg ccggctatcg acggctcctt cttggctgct gttggcaacc 7021 tcatcgtggt cactgccagc taccgagtgg gtgtcttcgg cttcctgagt tctggatccg 7081 gagaggtgag tggcaactgg gggctgctgg accaggtggc ggctctgacc tgggtgcaga 7141 cccacatccg aggatttggc ggggaccctc ggcgcgtgtc cctggcagca gaccgtggcg 7201 gggctgatgt ggccagcatc caccttctca cggccagggc caccaactcc caacttttcc 7261 ggagagctgt gctgatggga ggctccgcac tctccccggc cgccgtcatc agccatgaga 7321 gggctcagca gcaggcaatt gctttggcaa aggaggtcag ttgccccatg tcatccagcc 7381 aagaagtggt gtcctgcctc cgccagaagc ctgccaatgt cctcaatgat gcccagacca 7441 agctcctggc cgtgagtggc cctttccact actggggtcc tgtgatcgat ggccacttcc 7501 tccgtgagcc tccagccaga gcactgaaga ggtctttatg ggtagaggtc gatctgctca 7561 ttgggagttc tcaggacgac gggctcatca acagagcaaa ggctgtgaag caatttgagg 7621 aaagtcgagg ccggaccagt agcaaaacag ccttttacca ggcactgcag aattctctgg 7681 gtggcgagga ctcagatgcc cgcgtcgagg ctgctgctac atggtattac tctctggagc 7741 actccacgga tgactatgcc tccttctccc gggctctgga gaatgccacc cgggactact 7801 ttatcatctg ccctataatc gacatggcca gtgcctgggc aaagagggcc cgaggaaacg 7861 tcttcatgta ccatgctcct gaaaactacg gccatggcag cctggagctg ctggcggatg 7921 ttcagtttgc cttggggctt cccttctacc cagcctacga ggggcagttt tctctggagg 7981 agaagagcct gtcgctgaaa atcatgcagt acttttccca cttcatcaga tcaggaaatc 8041 ccaactaccc ttatgagttc tcacggaaag tacccacatt tgcaaccccc tggcctgact 8101 ttgtaccccg tgctggtgga gagaactaca aggagttcag tgagctgctc cccaatcgac 8161 agggcctgaa gaaagccgac tgctccttct ggtccaagta catctcgtct ctgaagacat 8221 ctgcagatgg agccaagggc gggcagtcag cagagagtga agaggaggag ttgacggctg 8281 gatctgggct aagagaagat ctcctaagcc tccaggaacc aggctctaag acctacagca 8341 agtgaccagc ccttgagctc cccaaaaacc tcacccgagg ctgcccacta tggtcatctt 8401 tttctctaaa atagttactt accttcaata aagtatctac atgcggtg // LOCUS HSTHYSYN 1817 bp RNA PRI 28-APR-1994 DEFINITION H.sapiens rTS alpha mRNA containing four open reading frames. ACCESSION X67098 NID g475908 KEYWORDS antisense RNA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1817) AUTHORS Dolnick,B.J. TITLE Direct Submission JOURNAL Submitted (26-JUN-1992) B.J. Dolnick, Roswell Park Cancer Institute, Grace Cancer Drug Center, Elm & Carlton Sts, Buffalo NY 142 63, USA REMARK revised by [3] MAT REFERENCE 2 (bases 1 to 1812) AUTHORS Dolnick,B.J. TITLE Cloning and characterization of a naturally occurring antisense RNA to human thymidylate synthase mRNA JOURNAL Nucleic Acids Res. 21 (8), 1747-1752 (1993) MEDLINE 93261804 REFERENCE 3 (bases 1 to 1817) AUTHORS Dolnick,B.J. TITLE Direct Submission JOURNAL Submitted (27-APR-1994) B.J. Dolnick, Roswell Park Cancer Institute, Grace Cancer Drug Center, Elm & Carlton Sts, Buffalo NY 142 63, USA COMMENT Sequence data conflicts at 1353, 1678, 1647, 1646, 1694 and 1643 (all complement) with X02308 (Nucleic Acids Research, 13, 2035-2043, 1985). FEATURES Location/Qualifiers source 1..1817 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="carcinoma" /cell_line="KB cell line" exon 1..128 /number=1 /evidence=experimental CDS 18..290 /note="ORF1" /codon_start=1 /db_xref="PID:g312070" /translation="MHTDPDYSAAYVVIETDAEDGIKGCGITFTLGKGTEVDWSRKGR GAPGDSGRPKRGVGLVGQAGGKACLEVTCGHGSQDAGILHRFQVHH" CDS 67..1152 /note="ORF1A" /codon_start=1 /db_xref="PID:g475909" /translation="MQKMESRGVELPSLWEKALKLIGPEKGVVHLATAAVLNAVWDLW AKQEGKPVWKLLVDMDPRMLVSCIDFRYITDVLTEEDALEILQKGQIGKKEREKQMLA QGYPAYTTSCAWLGYSDDTLKQLCAQALKDGWTRFKVKVGADLQDDMRRCQIIRDMIG PEKTLMMDANQRWDVPEAVEWMSKLAKFKPLWIEEPTSPDDILGHATISKALVPLGIG IATGEQCHNRVIFKQLLQAKALQFLQIDSCRLGSVNENLSVLLMAKKFEIPVCPHAGG VGLCELVQHLIIFDYISVSASLENRVCEYVDHLHEHFKYPVMIQRASYMPPKDPGYST EMKEESVKKHQYPDGEVWKKLLPAQEN" exon 129..242 /number=2 /evidence=experimental exon 243..314 /number=3 /evidence=experimental exon 315..355 /number=4 exon 356..968 /number=5 exon 969..1050 /number=6 exon 1051..1296 /number=7 exon 1297..1817 /number=8 polyA_signal 1796..1801 /evidence=experimental BASE COUNT 517 a 406 c 447 g 447 t ORIGIN 1 gccacggcgc ggacgccatg cacacggacc ctgactactc ggctgcctat gtcgtcatag 61 aaactgatgc agaagatgga atcaaggggt gtggaattac cttcactctg ggaaaaggca 121 ctgaagttga ttggtccaga aaagggcgtg gtgcacctgg cgacagcggc cgtcctaaac 181 gcggtgtggg acttgtgggc caagcaggag ggaaagcctg tctggaagtt acttgtggac 241 atggatccca ggatgctggt atcctgcata gatttcaggt acatcactga tgtcctgact 301 gaggaggatg ccctagaaat actgcagaaa ggtcaaattg gtaaaaaaga aagagagaag 361 caaatgctgg cacaaggata ccctgcttac acgacatcgt gcgcctggct ggggtactca 421 gatgacacgt tgaagcagct ctgtgcccag gcgctgaagg atggctggac caggtttaaa 481 gtaaaggtgg gtgctgatct ccaggatgac atgcgaagat gccaaatcat ccgagacatg 541 attggaccgg aaaagacttt gatgatggat gccaaccagc gctgggatgt gcctgaggcg 601 gtggagtgga tgtccaagct ggccaagttc aagccattgt ggattgagga gccaacctcc 661 cctgatgaca ttctggggca cgccaccatt tccaaggcac tggtcccatt aggaattggc 721 attgccacag gagaacagtg ccacaataga gtgatattta agcaactcct acaggcgaag 781 gccctgcagt tcctccagat tgacagttgc agactgggca gtgtcaatga gaacctctca 841 gtattgctga tggccaaaaa gtttgaaatt cctgtttgcc cccatgctgg tggagttggc 901 ctctgtgaac tggtgcagca cctgattata tttgactaca tatcagtttc tgcaagcctt 961 gaaaataggg tgtgtgagta tgttgaccac ctgcatgagc atttcaagta tcccgtgatg 1021 atccagcggg cttcctacat gcctcccaag gatcccggct actcaacaga aatgaaggag 1081 gaatctgtaa agaaacacca gtatccagat ggtgaagttt ggaagaaact ccttcctgct 1141 caagaaaatt aagtgctcag ccccaacaac ttttttcttt ctgaagtgaa agggcttaaa 1201 atttcttgga aatagtttta caaaaatgga tttaaaaaat cctaccgatc aagatgagtt 1261 cagctagaag tcataccacc ctcaggaatc agctaaagca aaaagaactt ttacctcggc 1321 atccagccca acccctaaag actgacaata tccttcaagc tcctttgaaa gcaccctaaa 1381 cagccatttc cattttaata gttggatgcg gattgtaccc ttcaatctga aagtcttcag 1441 ctttgaagtc atcaattttc tcaacttttc gaagaatcct gagctttggg aaaggtctgg 1501 gttctcgctg aagctaaaaa caaaataagg ccattatttt gccataattg tacgacctgt 1561 tgtaattgct cctcatgtcc atgaaacaag tacacaggat gtgatcaaca aagttctatt 1621 ttacaggagt atgatcctgt cgataccttg ccgtagttat gtaacatgat tggagcgcaa 1681 ccagctgttc tcttgaccac agatcgagag tgaggggtat tttgtgacat tacacagcat 1741 caggagcctg gtgcctcatc aggtgtaagt tcttataacc actcttggca aatttattaa 1801 agacaggaac acagtca // LOCUS HSTIEMR 3845 bp RNA PRI 17-FEB-1997 DEFINITION Human tie mRNA for putative receptor tyrosine kinase. ACCESSION X60957 S89716 NID g396814 KEYWORDS glycosylated transmembrane protein; tyrosine kinase receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3845) AUTHORS Partanen,J.M. TITLE Direct Submission JOURNAL Submitted (18-JUL-1991) J.M. Partanen, University of Helsinki, Cancer Biology Laboratory, Dept of Pathology and Virology, Haartmaninkatu 3, 00290 Helsinki, FINLAND REMARK revised by [3] REFERENCE 2 (bases 1 to 3845) AUTHORS Partanen,J., Armstrong,E., Makela,T.P., Korhonen,J., Sandberg,M., Renkonen,R., Knuutila,S., Huebner,K. and Alitalo,K. TITLE A novel endothelial cell surface receptor tyrosine kinase with extracellular epidermal growth factor homology domains JOURNAL Mol. Cell. Biol. 12 (4), 1698-1707 (1992) MEDLINE 92195316 REFERENCE 3 (bases 1 to 3845) AUTHORS Partanen,J.M. TITLE Direct Submission JOURNAL Submitted (28-JUL-1993) J.M. Partanen, University of Helsinki, Cancer Biology Laboratory, Dept of Pathology and Virology, Haartmaninkatu 3, 00290 Helsinki, FINLAND FEATURES Location/Qualifiers source 1..3845 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukemia" /cell_line="HEL" /clone_lib="cDNA in lambda gt11" /clone="HEl1-1, 12a" /chromosome="1" gene 37..3453 /gene="tie" CDS 37..3453 /gene="tie" /codon_start=1 /product="receptor tyrosine kinase" /db_xref="PID:g396815" /db_xref="SWISS-PROT:P35590" /translation="MVWRVPPFLLPILFLASHVGAAVDLTLLANLRLTDPQRFFLTCV SGEAGAGRGSDAWGPPLLLEKDDRIVRTPPGPPLRLARNGSHQVTLRGFSKPSDLVGV FSCVGGAGARRTRVIYVHNSPGAHLLPDKVTHTVNKGDTAVLSARVHKEKQTDVIWKS NGSYFYTLDWHEAQDGRFLLQLPNVQPPSSGIYSATYLEASPLGSAFFRLIVRGCGAG RWGPGCTKECPGCLHGGVCHDHDGECVCPPGFTGTRCEQACREGRFGQSCQEQCPGIS GCRGLTFCLPDPYGCSCGSGWRGSQCQEACAPGHFGADCRLQCQCQNGGTCDRFSGCV CPSGWHGVHCEKSDRIPQILNMASELEFNLETMPRINCAAAGNPFPVRGSIELRKPDG TVLLSTKAIVEPEKTTAEFEVPRLVLADSGFWECRVSTSGGQDSRRFKVNVKVPPVPL AAPRLLTKQSRQLVVSPLVSFSGDGPISTVRLHYRPQDSTMDWSTIVVDPSENVTLMN LRPKTGYSVRVQLSRPGEGGEGAWGPPTLMTTDCPEPLLQPWLEGWHVEGTDRLRVSW SLPLVPGPLVGDGFLLRLWDGTRGQERRENVSSPQARTALLTGLTPGTHYQLDVQLYH CTLLGPASPPAHVLLPPSGPPAPRHLHAQALSDSEIQLTWKHPEALPGPISKYVVEVQ VAGGAGDPLWIDVDRPEETSTIIRGLNASTRYLFRMRASIQGLGDWSNTVEESTLGNG LQAEGPVQESRAAEEGLDQQLILAVVGSVSATCLTILAALLTLVCIRRSCLHRRRTFT YQSGSGEETILQFSSGTLTLTRRPKLQPEPLSYPVLEWEDITFEDLIGEGNFGQVIRA MIKKDGLKMNAAIKMLKEYASENDHRDFAGELEVLCKLGHHPNIINLLGACKNRGYLY IAIEYAPYGNLLDFLRKSRVLETDPAFAREHGTASTLSSRQLLRFASDAANGMQYLSE KQFIHRDLAARNVLVGENLASKIADFGLSRGEEVYVKKTMGRLPVRWMAIESLNYSVY TTKSDVWSFGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRMEQPRNCDDEVYELMRQ CWRDRPYERPPFAQIALQLGRMLEARKAYVNMSLFENFTYAGIDATAEEA" misc_feature 214..257 /gene="tie" /note="EGF homology domain I" misc_feature 258..304 /gene="tie" /note="EGF homology domain II" misc_feature 305..346 /gene="tie" /note="EGF homology domain III" misc_feature 761..786 /gene="tie" /note="transmembrane domain" misc_feature 836..1107 /gene="tie" /note="tyrosine kinase domain" old_sequence 3242..3244 /gene="tie" /citation=[1] /replace="gct" BASE COUNT 743 a 1179 c 1155 g 768 t ORIGIN 1 cgctcgtcct ggctggcctg ggtcggcctc tggagtatgg tctggcgggt gccccctttc 61 ttgctcccca tcctcttctt ggcttctcat gtgggcgcgg cggtggacct gacgctgctg 121 gccaacctgc ggctcacgga cccccagcgc ttcttcctga cttgcgtgtc tggggaggcc 181 ggggcgggga ggggctcgga cgcctggggc ccgcccctgc tgctggagaa ggacgaccgt 241 atcgtgcgca ccccgcccgg gccacccctg cgcctggcgc gcaacggttc gcaccaggtc 301 acgcttcgcg gcttctccaa gccctcggac ctcgtgggcg tcttctcctg cgtgggcggt 361 gctggggcgc ggcgcacgcg cgtcatctac gtgcacaaca gccctggagc ccacctgctt 421 ccagacaagg tcacacacac tgtgaacaaa ggtgacaccg ctgtactttc tgcacgtgtg 481 cacaaggaga agcagacaga cgtgatctgg aagagcaacg gatcctactt ctacaccctg 541 gactggcatg aagcccagga tgggcggttc ctgctgcagc tcccaaatgt gcagccacca 601 tcgagcggca tctacagtgc cacttacctg gaagccagcc ccctgggcag cgccttcttt 661 cggctcatcg tgcggggttg tggggctggg cgctgggggc caggctgtac caaggagtgc 721 ccaggttgcc tacatggagg tgtctgccac gaccatgacg gcgaatgtgt atgcccccct 781 ggcttcactg gcacccgctg tgaacaggcc tgcagagagg gccgttttgg gcagagctgc 841 caggagcagt gcccaggcat atcaggctgc cggggcctca ccttctgcct cccagacccc 901 tatggctgct cttgtggatc tggctggaga ggaagccagt gccaagaagc ttgtgcccct 961 ggtcattttg gggctgattg ccgactccag tgccagtgtc agaatggtgg cacttgtgac 1021 cggttcagtg gttgtgtctg cccctctggg tggcatggag tgcactgtga gaagtcagac 1081 cggatccccc agatcctcaa catggcctca gaactggagt tcaacttaga gacgatgccc 1141 cggatcaact gtgcagctgc agggaacccc ttccccgtgc ggggcagcat agagctacgc 1201 aagccagacg gcactgtgct cctgtccacc aaggccattg tggagccaga gaagaccaca 1261 gctgagttcg aggtgccccg cttggttctt gcggacagtg ggttctggga gtgccgtgtg 1321 tccacatctg gcggccaaga cagccggcgc ttcaaggtca atgtgaaagt gccccccgtg 1381 cccctggctg cacctcggct cctgaccaag cagagccgcc agcttgtggt ctccccgctg 1441 gtctcgttct ctggggatgg acccatctcc actgtccgcc tgcactaccg gccccaggac 1501 agtaccatgg actggtcgac cattgtggtg gaccccagtg agaacgtgac gttaatgaac 1561 ctgaggccaa agacaggata cagtgttcgt gtgcagctga gccggccagg ggaaggagga 1621 gagggggcct gggggcctcc caccctcatg accacagact gtcctgagcc tttgttgcag 1681 ccgtggttgg agggctggca tgtggaaggc actgaccggc tgcgagtgag ctggtccttg 1741 cccttggtgc ccgggccact ggtgggcgac ggtttcctgc tgcgcctgtg ggacgggaca 1801 cgggggcagg agcggcggga gaacgtctca tccccccagg cccgcactgc cctcctgacg 1861 ggactcacgc ctggcaccca ctaccagctg gatgtgcagc tctaccactg caccctcctg 1921 ggcccggcct cgccccctgc acacgtgctt ctgcccccca gtgggcctcc agccccccga 1981 cacctccacg cccaggccct ctcagactcc gagatccagc tgacatggaa gcacccggag 2041 gctctgcctg ggccaatatc caagtacgtt gtggaggtgc aggtggctgg gggtgcagga 2101 gacccactgt ggatagacgt ggacaggcct gaggagacaa gcaccatcat ccgtggcctc 2161 aacgccagca cgcgctacct cttccgcatg cgggccagca ttcaggggct cggggactgg 2221 agcaacacag tagaagagtc caccctgggc aacgggctgc aggctgaggg cccagtccaa 2281 gagagccggg cagctgaaga gggcctggat cagcagctga tcctggcggt ggtgggctcc 2341 gtgtctgcca cctgcctcac catcctggcc gcccttttaa ccctggtgtg catccgcaga 2401 agctgcctgc atcggagacg caccttcacc taccagtcag gctcgggcga ggagaccatc 2461 ctgcagttca gctcagggac cttgacactt acccggcggc caaaactgca gcccgagccc 2521 ctgagctacc cagtgctaga gtgggaggac atcacctttg aggacctcat cggggagggg 2581 aacttcggcc aggtcatccg ggccatgatc aagaaggacg ggctgaagat gaacgcagcc 2641 atcaaaatgc tgaaagagta tgcctctgaa aatgaccatc gtgactttgc gggagaactg 2701 gaagttctgt gcaaattggg gcatcacccc aacatcatca acctcctggg ggcctgtaag 2761 aaccgaggtt acttgtatat cgctattgaa tatgccccct acgggaacct gctagatttt 2821 ctgcggaaaa gccgggtcct agagactgac ccagcttttg ctcgagagca tgggacagcc 2881 tctaccctta gctcccggca gctgctgcgt ttcgccagtg atgcggccaa tggcatgcag 2941 tacctgagtg agaagcagtt catccacagg gacctggctg cccggaatgt gctggtcgga 3001 gagaacctag cctccaagat tgcagacttc ggcctttctc ggggagagga ggtttatgtg 3061 aagaagacga tggggcgtct ccctgtgcgc tggatggcca ttgagtccct gaactacagt 3121 gtctatacca ccaagagtga tgtctggtcc tttggagtcc ttctttggga gatagtgagc 3181 cttggaggta caccctactg tggcatgacc tgtgccgagc tctatgaaaa gctgccccag 3241 ggctaccgca tggagcagcc tcgaaactgt gacgatgaag tgtacgagct gatgcgtcag 3301 tgctggcggg accgtcccta tgagcgaccc ccctttgccc agattgcgct acagctaggc 3361 cgcatgctgg aagccaggaa ggcctatgtg aacatgtcgc tgtttgagaa cttcacttac 3421 gcgggcattg atgccacagc tgaggaggcc tgagctgcca tccagccaga acgtggctct 3481 gctggccgga gcaaactctg ctgtctaacc tgtgaccagt ctgaccctta cagcctctga 3541 cttaagctgc ctcaaggaat ttttttaact taagggagaa aaaaagggat ctggggatgg 3601 ggtgggctta ggggaactgg gttcccatgc tttgtaggtg tctcatagct atcctgggca 3661 tccttctttc tagttcagct gccccacagg tgtgtttccc atcccactgc tcccccaaca 3721 caaaccccca ctccagctcc ttcgcttaag ccagcactca caccactaac atgccctgtt 3781 cagctactcc cactcccggc ctgtcattca gaaaaaaata aatgttctaa taagctccaa 3841 aaaaa // LOCUS HSTIF 1536 bp RNA PRI 03-JUN-1994 DEFINITION H.sapiens nuk_34 mRNA for translation initiation factor. ACCESSION X79538 NID g496901 KEYWORDS translation initiation factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1536) AUTHORS Leffers,H., Wiemann,S. and Ansorge,W. TITLE Cloning and sequencing of a putative human translation initiation factor with similarity to initiation factor 4AII JOURNAL Unpublished REFERENCE 2 (bases 1 to 1536) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (01-JUN-1994) H. Leffers, Inst. of Medical Research Biochemistry & Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus Univ., 8000 Aarhus C, DENMARK FEATURES Location/Qualifiers source 1..1536 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin" /cell_type="keratinocyte" /cell_line="non fractionated non cultured normal keratinocytes" /clone_lib="lambda ZapII" /clone="nuk_34" CDS 74..1309 /codon_start=1 /product="translation initiation factor" /db_xref="PID:g496902" /db_xref="SWISS-PROT:P38919" /translation="MATTATMATSGSARKRLLKEEDMTKVEFETSEEVDVTPTFDTMG LREDLLRGIYAYGFEKPSAIQQRAIKQIIKGRDVIAQSQSGTGKTATFSISVLQCLDI QVRETQALILAPTRELAVQIQKGLLALGDYMNVQCHACIGGTNVGEDIRKLDYGQHVV AGTPGRVFDMIRRRSLRTRAIKMLVLDEADEMLNKGFKEQIYDVYRYLPSATQVVLIS ATLPHEILEMTNKFMTDPIRILVKRDELTLEGIKQFFVAVEREEWKFDTLCDLYDTLT ITQAVIFCNTKRKVDWLTEKMREANFTVSSMHGDMPQKERESIMKEFRSGASRVLIST DVWARGLDVPQVSLIINYDLPNNRELYIHRIGRSGQYGRKGVAINFVKNDDIRILRDI EQYYSTQIDEMPMNVADLI" BASE COUNT 402 a 353 c 409 g 372 t ORIGIN 1 cggcagcgag gtcggcagcg gcacagcgag gtcggcagcg gcgcgcgctg tgctcttccg 61 cggactctga atcatggcga ccacggccac gatggcgacc tcgggctcgg cgcgaaagcg 121 gctgctcaaa gaggaagaca tgactaaagt ggaattcgag accagcgagg aggtggatgt 181 gacccccacg ttcgacacca tgggcctgcg ggaggacctg ctgcggggca tctacgctta 241 cggttttgaa aaaccatcag caatccagca acgagcaatc aagcagatca tcaaagggag 301 agatgtcatc gcacagtctc agtccggcac aggaaaaaca gccaccttca gtatctcagt 361 cctccagtgt ttggatattc aggttcgtga aactcaagct ttgatcttgg ctcccacaag 421 agagttggct gtgcagatcc agaaggggct gcttgctctc ggtgactaca tgaatgtcca 481 gtgccatgcc tgcattggag gcaccaatgt tggcgaggac atcaggaagc tggattacgg 541 acagcatgtt gtcgcgggca ctccagggcg tgtttttgat atgattcgtc gcagaagcct 601 aaggacacgt gctatcaaaa tgttggtttt ggatgaagct gatgaaatgt tgaataaagg 661 tttcaaagag cagatttacg atgtatacag gtacctgcct tcagccacac aggtggttct 721 catcagtgcc acgctgccac acgagattct ggagatgacc aacaagttca tgaccgaccc 781 aatccgcatc ttggtgaaac gtgatgaatt gactctggaa ggcatcaagc aatttttcgt 841 ggcagtggag agggaagagt ggaaatttga cactctgtgt gacctctacg acacactgac 901 catcactcag gcggtcatct tctgcaacac caaaagaaag gtggactggc tgacggagaa 961 aatgagggaa gccaacttca ctgtatcctc aatgcatgga gacatgcccc agaaagagcg 1021 ggagtccatc atgaaggagt tccggtcggg cgccagccga gtgcttattt ctacagatgt 1081 ctgggccagg gggttggatg tccctcaggt gtccctcatc attaactatg atctccctaa 1141 taacagagaa ttgtacatac acagaattgg gagatcaggt caatacggcc ggaagggtgt 1201 ggccattaac tttgtaaaga atgacgacat ccgcatcctc agagatatcg agcagtacta 1261 ttccactcag attgatgaga tgccgatgaa cgttgctgat cttatctgaa gcagcagatc 1321 agtgggatga gggagactgt tcacctgctg tgtactcctg tttggaagta tttagatcca 1381 gattctactt aatggggttt atatggactt tcttctcata aatggcctgc cgtctccctt 1441 cctttgaaga ggatatgggg attctgctct cttttcttat ttacatgtaa ataatacatt 1501 gttctaagtc tttttcatta aaaatttaaa acttta // LOCUS HSTIF2GEN 6156 bp RNA PRI 10-MAR-1997 DEFINITION H.sapiens mRNA for transcriptional intermediary factor 2. ACCESSION X97674 NID g1877214 KEYWORDS alternatively spliced; nuclear receptor coactivator; TIF2 gene; transcriptional mediator. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6156) AUTHORS Voegel,J.J., Heine,M.J., Zechel,C., Chambon,P. and Gronemeyer,H. TITLE TIF2, a 160 kDa transcriptional mediator for the ligand-dependent activation function AF-2 of nuclear receptors JOURNAL EMBO J. 15 (14), 3667-3675 (1996) MEDLINE 96312964 REFERENCE 2 (bases 1 to 6156) AUTHORS Voegel,J.J. TITLE Direct Submission JOURNAL Submitted (22-APR-1996) J.J. Voegel, IGBMC Inst.de Genet.et Biol.Mol.et Cell., CNRS-INSERM-Univ.Louis Pasteur, B.P.163, C.U. de Strasbourg, F-67404 ILLKIRCH CEDEX, FRANCE REMARK Revised by author 25-JUL-96 and 10-MAR-97 COMMENT Related sequences U39060, U40396. FEATURES Location/Qualifiers source 1..6156 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambdaEXlox Ref.No.56" /tissue_type="placenta" gene 163..4557 /gene="TIF2" CDS 163..4557 /gene="TIF2" /function="transcriptional mediator for ligand-dependent activation function AF-2 of nuclear receptors" /codon_start=1 /product="transcriptional intermediary factor 2" /db_xref="PID:e307031" /db_xref="PID:g1877215" /translation="MSGMGENTSDPSRAETRKRKECPDQLGPSPKRNTEKRNREQENK YIEELAELIFANFNDIDNFNFKPDKCAILKETVKQIRQIKEQEKAAAANIDEVQKSDV SSTGQGVIDKDALGPMMLEALDGFFFVVNLEGNVVFVSENVTQYLRYNQEELMNKSVY SILHVGDHTEFVKNLLPKSIVNGGSWSGEPPRRNSHTFNCRMLVKPLPDSEEEGHDNQ EAHQKYETMQCFAVSQPKSIKEEGEDLQSCLICVARRVPMKERPVLPSSESFTTRQDL QGKITSLDTSTMRAAMKPGWEDLVRRCIQKFHAQHEGESVSYAKRHHHEVLRQGLAFS QIYRFSLSDGTLVAAQTKSKLIRSQTTNEPQLVISLHMLHREQNVCVMNPDLTGQTMG KPLNPISSNSPAHQALCSGNPGQDMTLSSNINFPINGPKEQMGMPMGRFGGSGGMNHV SGMQATTPQGSNYALKMNSPSQSSPGMNPGQPTSMLSPRHRMSPGVAGSPRIPPSQFS PAGSLHSPVGVCSSTGNSHSYTNSSLNALQALSEGHGVSLGSSLASPDLKMGNLQNSP VNMNPPPLSKMGSLDSKDCFGLYGEPSEGTTGQAESSCHPGEQKETNDPNLPPAVSSE RADGQSRLHDSKGQTKLLQLLTTKSDQMEPSPLASSLSDTNKDSTGSLPGSGSTHGTS LKEKHKILHRLLQDSSSPVDLAKLTAEATGKDLSQESSSTAPGSEVTIKQEPVSPKKK ENALLRYLLDKDDTKDIGLPEITPKLERLDSKTDPASNTKLIAMKTEKEEMSFEPGDQ PGSELDNLEEILDDLQNSQLPQLFPDTRPGAPAGSVDKQAIINDLMQLTAENSPVTPV GAQKTALRISQSTFNNPRPGQLGRLLPNQNLPLDITLQSPTGAGPFPPIRNSSPYSVI PQPGMMGNQGMIGNQGNLGNSSTGMIGNSASRPTMPSGEWAPQSSAVRVTCAATTSAM NRPVQGGMIRNPAASIPMRPSSQPGQRQTLQSQVMNIGPSELEMNMGGPQYSQQQAPP NQTAPWPESILPIDQASFASQNRQPFGSSPDDLLCPHPAAESPSDEGALLDQLYLALR NFDGLEEIDRALGIPELVSQSQAVDPEQFSSQDSNIMLEQKAPVFPQQYASQAQMAQG SYSPMQDPNFHTMGQRPSYATLRMQPRPGLRPTGLVQNQPNQLRLQLQHRLQAQQNRQ PLMNQISNVSNVNLTLRPGVPTQAPINAQMLAQRQREILNQHLRQRQMHQQQQVQQRT LMMRGQGLNMTPSMVAPSGMPATMSNPRIPQANAQQFPFPPNYGISQQPDPGFTGATT PQSPLMSPRMAHTQSPMMQQSQANPAYQAPSDINGWAQGNMGGNSMFSQQSPPHFGQQ ANTSMYSNNMNINVSMATNTGGMSSMNQMTGQISMTSVTSVPTSGLSSMGPEQVNDPA LRGGNLFPNQLPGMDMIKQEGDTTRKYC" misc_feature 2768..2974 /gene="TIF2" /note="putative alternatively spliced region" BASE COUNT 1799 a 1493 c 1406 g 1458 t ORIGIN 1 ggcggccgca gcctcggcta cagcttcggc ggcgaaggtc agcgccgacg gcagccggca 61 cctgacggcg tgaccgaccc gagccgattt ctcttggatt tggctacaca cttatagatc 121 ttctgcactg tttacaggca cagttgctga tatgtgttca agatgagtgg gatgggagaa 181 aatacctctg acccctccag ggcagagaca agaaagcgca aggaatgtcc tgaccaactt 241 ggacccagcc ccaaaaggaa cactgaaaaa cgtaatcgtg aacaggaaaa taaatatata 301 gaagaacttg cagagttgat ttttgcaaat tttaatgata tagacaactt taacttcaaa 361 cctgacaaat gtgcaatctt aaaagaaact gtgaagcaaa ttcgtcagat caaagaacaa 421 gagaaagcag cagctgccaa catagatgaa gtgcagaagt cagatgtatc ctctacaggg 481 cagggtgtca tcgacaagga tgcgctgggg cctatgatgc ttgaggccct tgatgggttc 541 ttctttgtag tgaacctgga aggcaacgtt gtgtttgtgt cagagaatgt gacacagtat 601 ctaaggtata accaagaaga gctgatgaac aaaagtgtat atagcatctt gcatgttggg 661 gaccacacgg aatttgtcaa aaacctgctg ccaaagtcta tagtaaatgg gggatcttgg 721 tctggcgaac ctccgaggcg gaacagccat accttcaatt gtcggatgct ggtaaaacct 781 ttacctgatt cagaagagga gggtcatgat aaccaggaag ctcatcagaa atatgaaact 841 atgcagtgct tcgctgtctc tcaaccaaag tccatcaaag aagaaggaga agatttgcag 901 tcctgcttga tttgcgtggc aagaagagtt cccatgaagg aaagaccagt tcttccctca 961 tcagaaagtt ttactactcg ccaggatctc caaggcaaga tcacgtctct ggataccagc 1021 accatgagag cagccatgaa accaggctgg gaggacctgg taagaaggtg tattcagaag 1081 ttccatgcgc agcatgaagg agaatctgtg tcctatgcta agaggcatca tcatgaagta 1141 ctgagacaag gattggcatt cagtcaaatc tatcgttttt ccttgtctga tggcactctt 1201 gttgctgcac aaacgaagag caaactcatc cgttctcaga ctactaatga acctcaactt 1261 gtaatatctt tacatatgct tcacagagag cagaatgtgt gtgtgatgaa tccggatctg 1321 actggacaaa cgatggggaa gccactgaat ccaattagct ctaacagccc tgcccatcag 1381 gccctgtgca gtgggaaccc aggtcaggac atgaccctca gtagcaatat aaattttccc 1441 ataaatggcc caaaggaaca aatgggcatg cccatgggca ggtttggtgg ttctggggga 1501 atgaaccatg tgtcaggcat gcaagcaacc actcctcagg gtagtaacta tgcactcaaa 1561 atgaacagcc cctcacaaag cagccctggc atgaatccag gacagcccac ctccatgctt 1621 tcaccaaggc atcgcatgag ccctggagtg gctggcagcc ctcgaatccc acccagtcag 1681 ttttcccctg caggaagctt gcattcccct gtgggagttt gcagcagcac aggaaatagc 1741 catagttata ccaacagctc cctcaatgca cttcaggccc tcagcgaggg gcacggggtc 1801 tcattagggt catcgttggc ttcaccagac ctaaaaatgg gcaatttgca aaactcccca 1861 gttaatatga atcctccccc actcagcaag atgggaagct tggactcaaa agactgtttt 1921 ggactatatg gggagccctc tgaaggtaca actggacaag cagagagcag ctgccatcct 1981 ggagagcaaa aggaaacaaa tgaccccaac ctgcccccgg ccgtgagcag tgagagagct 2041 gacgggcaga gcagactgca tgacagcaaa gggcagacca aactcctgca gctgctgacc 2101 accaaatctg atcagatgga gccctcgccc ttagccagct ctttgtcgga tacaaacaaa 2161 gactccacag gtagcttgcc tggttctggg tctacacatg gaacctcgct caaggagaag 2221 cataaaattt tgcacagact cttgcaggac agcagttccc ctgtggactt ggccaagtta 2281 acagcagaag ccacaggcaa agacctgagc caggagtcca gcagcacagc tcctggatca 2341 gaagtgacta ttaaacaaga gccggtgagc cccaagaaga aagagaatgc actacttcgc 2401 tatttgctag ataaagatga tactaaagat attggtttac cagaaataac ccccaaactt 2461 gagagactgg acagtaagac agatcctgcc agtaacacaa aattaatagc aatgaaaact 2521 gagaaggagg agatgagctt tgagcctggt gaccagcctg gcagtgagct ggacaacttg 2581 gaggagattt tggatgattt gcagaatagt caattaccac agcttttccc agacacgagg 2641 ccaggcgccc ctgctggatc agttgacaag caagccatca tcaatgacct catgcaactc 2701 acagctgaaa acagccctgt cacacctgtt ggagcccaga aaacagcact gcgaatttca 2761 cagagcactt ttaataaccc acgaccaggg caactgggca ggttattgcc aaaccagaat 2821 ttaccacttg acatcacatt gcaaagccca actggtgctg gacctttccc accaatcaga 2881 aacagtagtc cctactcagt gatacctcag ccaggaatga tgggtaatca agggatgata 2941 ggaaaccaag gaaatttagg gaacagtagc acaggaatga ttggtaacag tgcttctcgg 3001 cctactatgc catctggaga atgggcaccg cagagttcgg ctgtgagagt cacctgtgct 3061 gctaccacca gtgccatgaa ccggccagtc caaggaggta tgattcggaa cccagcagcc 3121 agcatcccca tgaggcccag cagccagcct ggccaaagac agacgcttca gtctcaggtc 3181 atgaatatag ggccatctga attagagatg aacatggggg gacctcagta tagccaacaa 3241 caagctcctc caaatcagac tgccccatgg cctgaaagca tcctgcctat agaccaggcg 3301 tcttttgcca gccaaaacag gcagccattt ggcagttctc cagatgactt gctatgtcca 3361 catcctgcag ctgagtctcc gagtgatgag ggagctctcc tggaccagct gtatctggcc 3421 ttgcggaatt ttgatggcct ggaggagatt gatagagcct taggaatacc cgaactggtc 3481 agccagagcc aagcagtaga tccagaacag ttctcaagtc aggattccaa catcatgctg 3541 gagcagaagg cgcccgtttt cccacagcag tatgcatctc aggcacaaat ggcccagggt 3601 agctattctc ccatgcaaga tccaaacttt cacaccatgg gacagcggcc tagttatgcc 3661 acactccgta tgcagcccag accgggcctc aggcccacgg gcctagtgca gaaccagcca 3721 aatcaactaa gacttcaact tcagcatcgc ctccaagcac agcagaatcg ccagccactt 3781 atgaatcaaa tcagcaatgt ttccaatgtg aacttgactc tgaggcctgg agtaccaaca 3841 caggcaccta ttaatgcaca gatgctggcc cagagacaga gggaaatcct gaaccagcat 3901 cttcgacaga gacaaatgca tcagcaacag caagttcagc aacgaacttt gatgatgaga 3961 ggacaagggt tgaatatgac accaagcatg gtggctccta gtggtatgcc agcaactatg 4021 agcaaccctc ggattcccca ggcaaatgca cagcagtttc catttcctcc aaactacgga 4081 ataagtcagc aacctgatcc aggctttact ggggctacga ctccccagag cccacttatg 4141 tcaccccgaa tggcacatac acagagtccc atgatgcaac agtctcaggc caacccagcc 4201 tatcaggccc cctccgacat aaatggatgg gcgcagggga acatgggcgg aaacagcatg 4261 ttttcccagc agtccccacc acactttggg cagcaagcaa acaccagcat gtacagtaac 4321 aacatgaaca tcaatgtgtc catggcgacc aacacaggtg gcatgagcag catgaaccag 4381 atgacaggac agatcagcat gacctcagtg acctccgtgc ctacgtcagg gctgtcctcc 4441 atgggtcccg agcaggttaa tgatcctgct ctgaggggag gcaacctgtt cccaaaccag 4501 ctgcctggaa tggatatgat taagcaggag ggagacacaa cacggaaata ttgctgacac 4561 tgctgaagcc agttgcttct tcagctgacc gggctcactt gctcaaaaca cttccagtct 4621 ggagagctgt gtctatttgt ttcaacccaa ctgacctgcc agccggttct gctagagcag 4681 acaggcctgg ccctggttcc cagggtggcg tccactcggc tgtggcagga ggagctgcct 4741 cttctcttga cagtctgaag ctcgcatcca gacagtcgct cagtctgttc cctgcattca 4801 ccttagtgca acttagatct ctcctcccca agtaaatgtt gacaggccaa tttcataccc 4861 atgtcagatt gaatgtattt aaatgtatgt atttaaggag aaccatgctc ttgttctgtt 4921 cctgttcggt tccagacact ggtttcttgc tttgttttcc ctggctaaca gtctagtgcc 4981 aaagattaag attttatctg ggggaaagaa aagaattttt taaaaaatta aactaaagat 5041 gttttaagct aaagcctgaa tttgggatgg aagcaggaca gacaccgtgg acagcgctgt 5101 atttacagac acacccagtg cgtgaagacc aacaaagtca cagtcgtatc tctagaaagc 5161 tctaaagacc atgttggaaa gagtctccag ttactgaaca gatgaaaagg agcctgtgag 5221 agggctgtta acattagcaa atattttttc cttgtttttt ctttgttaaa accaaactgg 5281 ttcacctgaa tcatgaattg agaagaaata attttcattt ctaaattaag tcccttttag 5341 tttgatcaga cagcttgaat cagcatctct tcttccctgt cagcctgact cttcccttcc 5401 cctctctcat tccccatact ccctattttc attccttttt taaaaaataa tataagctac 5461 agaaaccagg taagcccttt atttccttaa atgttttgcc agccacttac caattgctaa 5521 gtattgaatt tcagaaaaaa aaaatgcatt tactggcaag gagaagagca aagttaaggc 5581 ttgataccaa tcgagctaag gatacctgct ttggaagcat gtttattctg ttccccagca 5641 actctggcct ccaaaatggg agaaacgcca gtgtgtttaa attgatagca gatatcacga 5701 cagatttaac ctctgccatg tgttttttat tttgtttttt agcagtgctg actaagccga 5761 agttttgtaa ggtacataaa atccaattta tatgtaaaca agcaataatt taagttgaga 5821 acttatgtgt tttaattgta taatttttgt gaggtataca tattgtggaa ttgactcaaa 5881 aatgaggtac ttcagtatta aattagatat cttcatagca atgtctccta aaggtgtttt 5941 gtaaaggata tcaatgcctt gattagacct aatttgtaga cttaagactt tttattttct 6001 aaaccttgtg attctgctta taagtcattt atctaatcta tatgatatgc agccgctgta 6061 ggaaccaatt cttgattttt atatgtttat attctttctt aatgaacctt agaaagacta 6121 catgttacta agcaggccac ttttatggtt gttttt // LOCUS HSTIM17 920 bp RNA PRI 10-JUN-1997 DEFINITION H.sapiens mRNA for TIM17 preprotein translocase. ACCESSION X97544 NID g1770563 KEYWORDS preprotein translocase; tim17 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 920) AUTHORS Boemer,U., Rassow,J., Pfanner,N., Meijer,M. and Maarse,A.C. TITLE The preprotein translocase of the inner mitochondrial membrane evolutionary conservation of targeting and assembly of Tim17 JOURNAL Unpublished REFERENCE 2 (bases 1 to 920) AUTHORS Maarse,A.C. TITLE Direct Submission JOURNAL Submitted (25-APR-1996) A.C. Maarse, Institute for Molecular Cell Biology, Section for Molecular Biology, Kruislaan 318, 1098SM Amsterdam, NETHERLANDS REFERENCE 3 (bases 1 to 920) AUTHORS Bomer,U., Rassow,J., Zufall,N., Pfanner,N., Meijer,M. and Maarse,A.C. TITLE The preprotein translocase of the inner mitochondrial membrane: evolutionary conservation of targeting and assembly of Tim17 JOURNAL J. Mol. Biol. 262 (4), 389-395 (1996) MEDLINE 97049120 FEATURES Location/Qualifiers source 1..920 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUVEC" gene 14..529 /gene="tim17" CDS 14..529 /gene="tim17" /codon_start=1 /product="preprotein translocase" /db_xref="PID:e243529" /db_xref="PID:g1770564" /translation="MEEYAREPCPWRIVDDCGGAFTMGTIGGGIFQAIKGFRNSPVGV NHRLRGSLTAIKTRAPQLGGSFAVWGGLFSMIDCSMVQVRGKEDPWNSITSGALTGAI LAARNGPVAMVGSAAMGGILLALIEGAGILLTRFASAQFPNGPQFAEDPSQLPSTQLP SSPFGDYRQYQ" BASE COUNT 267 a 170 c 227 g 256 t ORIGIN 1 cattggagtc aagatggagg agtacgcgcg agagccttgc ccatggcgaa ttgtggatga 61 ctgtggtggg gcctttacga tgggtaccat tggtggtggt atctttcaag caatcaaagg 121 ttttcgcaat tctccagtgg gagtaaacca cagactacga gggagtttga cagctattaa 181 aaccagggct ccacagttag gaggtagctt tgcagtttgg ggagggctgt tttccatgat 241 tgactgtagt atggttcaag tcagaggaaa ggaagatccc tggaactcca tcacaagtgg 301 tgccttaacg ggagccatac tggcagcaag aaatggacca gtggccatgg ttgggtcagc 361 cgcaatgggt ggcattctcc tagctttaat tgaaggagct ggtatcttgt tgacaagatt 421 tgcctctgca cagtttccca atggtcctca gtttgcagaa gacccctccc agttgccttc 481 aactcagtta ccttcctcac cttttggaga ctatcgacaa tatcagtagg acttctttcc 541 taggatttct ttaacagaac gagttgtggt tcgagaagga tttcagaaga tcaagttaca 601 gtctgttttt aaaaccatag gtgggacagc tatggccaat aggctataaa gagacattta 661 gcactttttt ctatttaaag gaacaagcgg ggaagggtgc taaaagataa tacgtttatt 721 tattcacact tgaattgcat ttgtgatcaa aataaatgtt taaatcgcta aaggaaaata 781 cagtaagtgc ttgaaagatg aaggaccaaa aggccaaaaa acagtgaaat atgatcatca 841 tttccttgcg gacttctctg cctggttttg tgtgttctgt tattcaaaca ataaaaagct 901 ggtggaactt aaaaaaaaaa // LOCUS HSTIMP3 1021 bp RNA PRI 06-JAN-1995 DEFINITION H.sapiens TIMP3 mRNA for tissue inhibitor of metalloproteinases-3. ACCESSION X76227 NID g495251 KEYWORDS TIMP-3 gene; tissue inhibitor of metalloproteinases. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1021) AUTHORS Lopez-Otin,C. TITLE Direct Submission JOURNAL Submitted (19-NOV-1993) C. Lopez-Otin, Universidad de Oviedo, Dept. de Biologia Funcional, Area de Bioquimica, Fac. de Medicina, C/ Julian Claveria S/N, 33006 Oviedo, SPAIN REFERENCE 2 (bases 1 to 1021) AUTHORS Uria,J.A., Ferrando,A.A., Velasco,G., Freije,J.M. and Lopez-Otin,C. TITLE Structure and expression in breast tumors of human TIMP-3, a new member of the metalloproteinase inhibitor family JOURNAL Cancer Res. 54 (8), 2091-2094 (1994) MEDLINE 94228524 FEATURES Location/Qualifiers source 1..1021 /organism="Homo sapiens" /isolate="patient I-9" /db_xref="taxon:9606" /tissue_type="breast carcinoma" /clone_lib="lambda gt11" /clone="T7-1" gene 71..706 /gene="TIMP3" CDS 71..706 /gene="TIMP3" /codon_start=1 /product="tissue inhibitor of metalloproteinases-3" /db_xref="PID:g495252" /db_xref="SWISS-PROT:P35625" /translation="MTPWLGLIVLLGSWSLGDWGAEACTCSPSHPQDAFCNSDIVIRA KVVGKKLVKEGPFGTLVYTIKQMKMYRGFTKMPHVQYIHTEASESLCGLKLEVNKYQY LLTGRVYDGKMYTGLCNFVERWDQLTLSQRKGLNYRYHLGCNCKIKSCYYLPCFVTSK NECLWTDMLSNFGYPGYQSKHYACIRQKGGYCSWYRGWAPPDKSIINATDP" BASE COUNT 225 a 301 c 266 g 229 t ORIGIN 1 cccgccggcg gcgcgcacgg caactttgga gaggcgagca gcagccccgg cagcggcggc 61 agcagcggca atgacccctt ggctcgggct catcgtgctc ctgggcagct ggagcctggg 121 ggactggggc gccgaggcgt gcacatgctc gcccagccac ccccaggacg ccttctgcaa 181 ctccgacatc gtgatccggg ccaaggtggt ggggaagaag ctggtaaagg aggggccctt 241 cggcacgctg gtctacacca tcaagcagat gaagatgtac cgaggcttca ccaagatgcc 301 ccatgtgcag tacatccaca cggaagcttc cgagagtctc tgtggcctta agctggaggt 361 caacaagtac cagtacctgc tgacaggtcg cgtctatgat ggcaagatgt acacggggct 421 gtgcaacttc gtggagaggt gggaccagct caccctctcc cagcgcaagg ggctgaacta 481 tcggtatcac ctgggttgta actgcaagat caagtcctgc tactacctgc cttgctttgt 541 gacttccaag aacgagtgtc tctggaccga catgctctcc aatttcggtt accctggcta 601 ccagtccaaa cactacgcct gcatccggca gaagggcggc tactgcagct ggtaccgagg 661 atgggccccc ccggataaaa gcatcatcaa tgccacagac ccctgagcgc cagaccctgc 721 cccacctcac ttccctccct tcccgctgag cttcccttgg acactaactc ttcccagatg 781 atgacaatga aattagtgcc tgttttcttg caaatttagc acttggaaca tttaaagaaa 841 ggtctatgct gtcatatggg gtttattggg aactatcctc ctggccccac cctgcccctt 901 ctttttggtt ttgacatcat tcatttccac ctgggaattt ctggtgccat gccagaaaga 961 atgaggaacc tgtattcctc ttcttcgtga taatataatc tctatttttt taggaaaaaa 1021 a // LOCUS HSTIR 2320 bp RNA PRI 21-MAR-1995 DEFINITION Human mRNA for lymphocyte glycoprotein T1/Leu-1. ACCESSION X04391 NID g37186 KEYWORDS cell surface glycoprotein; glycoprotein; T-cell glycoprotein T1/leu-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Jones,N.H., Clabby,M.L., Dialynas,D.P., Huang,H.J., Herzenberg,L.A. and Strominger,J.L. TITLE Isolation of complementary DNA clones encoding the human lymphocyte glycoprotein T1/Leu-1 JOURNAL Nature 323 (6086), 346-349 (1986) MEDLINE 87014786 FEATURES Location/Qualifiers source 1..2320 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 73..144 /note="put. signal peptide (AA -24 to -1)" CDS 73..1560 /note="put. precursor polypeptide" /codon_start=1 /db_xref="PID:g37187" /db_xref="SWISS-PROT:P06127" /translation="MPMGSLQPLATLYLLGMLVASCLGRLSWYDPDFQARLTRSNSKC QGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRLNCGVPLSLGPFLVTYTPQ SSIICYGQLGSFSNCSHSRNDMCHSLGLTCLEPQKTTPPTTRPPPTTTPEPTAPPRLQ LVAQSGGQHCAGVVEFYSGSLGGTISYEAQDKTQDLENFLCNNLQCGSFLKHLPETEA GRAQDPGEPREHQPLPIQWKIQNSSCTSLEHCFRKIKPQKSGRVLALLCSGFQPKVQS RLVGGSSICEGTVEVRQGAQWAALCDSSSARSSLRWEEVCREQQCGSVNSYRVLDAGD PTSRGLFCPHQKLSQCHELWERNSYCKKVFVTCQDPNPAGLAAGTVASIILALVLLVV LLVVCGPLAYKKLVKKFRQKKQRQWIGPTGMNQNMSFHRNHTATVRSHAENPTASHVD NEYSQPPRNSRLSAYPALEGVLHRSSMQPDNSSDSDYDLHGAQRL" misc_feature 79 /note="pot. alternate start codon" misc_feature 121 /note="pot. alternate start codon" mat_peptide 145..1557 /note="put. mature peptide (AA 1-471)" misc_feature 418..420 /note="pot. N-glycosylation site" misc_feature 793..795 /note="pot. N-glycosylation site" misc_feature 1348..1350 /note="pot. N-glycosylation site" misc_feature 1366..1368 /note="pot. N-glycosylation site" misc_feature 1513..1515 /note="pot. N-glycosylation site" BASE COUNT 504 a 746 c 637 g 433 t ORIGIN 1 gagatacccg gccagacacc ctcacctgcg gtgcccagct gcccaggctg aggcaagaga 61 aggccagaaa ccatgcccat ggggtctctg caaccgctgg ccaccttgta cctgctgggg 121 atgctggtcg cttcctgcct cggacggctc agctggtatg acccagattt ccaggcaagg 181 ctcacccgtt ccaactcgaa gtgccagggc cagctggagg tctacctcaa ggacggatgg 241 cacatggttt gcagccagag ctggggccgg agctccaagc agtgggagga ccccagtcaa 301 gcgtcaaaag tctgccagcg gctgaactgt ggggtgccct taagccttgg ccccttcctt 361 gtcacctaca cacctcagag ctcaatcatc tgctacggac aactgggctc cttctccaac 421 tgcagccaca gcagaaatga catgtgtcac tctctgggcc tgacctgctt agaaccccag 481 aagacaacac ctccaacgac aaggcccccg cccaccacaa ctccagagcc cacagctcct 541 cccaggctgc agctggtggc acagtctggc ggccagcact gtgccggcgt ggtggagttc 601 tacagcggca gcctgggggg taccatcagc tatgaggccc aggacaagac ccaggacctg 661 gagaacttcc tctgcaacaa cctccagtgt ggctccttct tgaagcatct gccagagact 721 gaggcaggca gagcccaaga cccaggggag ccacgggaac accagccctt gccaatccaa 781 tggaagatcc agaactcaag ctgtacctcc ctggagcatt gcttcaggaa aatcaagccc 841 cagaaaagtg gccgagttct tgccctcctt tgctcaggtt tccagcccaa ggtgcagagc 901 cgtctggtgg ggggcagcag catctgtgaa ggcaccgtgg aggtgcgcca gggggctcag 961 tgggcagccc tgtgtgacag ctcttcagcc aggagctcgc tgcggtggga ggaggtgtgc 1021 cgggagcagc agtgtggcag cgtcaactcc tatcgagtgc tggacgctgg tgacccaaca 1081 tcccgggggc tcttctgtcc ccatcagaag ctgtcccagt gccacgaact ttgggagaga 1141 aattcctact gcaagaaggt gtttgtcaca tgccaggatc caaaccccgc aggcctggcc 1201 gcaggcacgg tggcaagcat catcctggcc ctggtgctcc tggtggtgct gctggtcgtg 1261 tgcggccccc ttgcctacaa gaagctagtg aagaaattcc gccagaagaa gcagcgccag 1321 tggattggcc caacgggaat gaaccaaaac atgtctttcc atcgcaacca cacggcaacc 1381 gtccgatccc atgctgagaa ccccacagcc tcccacgtgg ataacgaata cagccaacct 1441 cccaggaact cccgcctgtc agcttatcca gctctggaag gggttctgca tcgctcctcc 1501 atgcagcctg acaactcctc cgacagtgac tatgatctgc atggggctca gaggctgtaa 1561 agaactggga tccatgagca aaaagccgag agccagacct gtttgtcctg agaaaactgt 1621 ccgctcttca cttgaaatca tgtccctatt tctaccccgg ccagaacatg gacagaggcc 1681 agaagccttc cggacaggcg ctgctgcccc gagtggcagg ccagctcaca ctctgctgca 1741 caacagctcg gccgcccctc cacttgtgga agctgtggtg ggcagagccc caaaacaagc 1801 agccttccaa ctagagactc gggggtgtct gaagggggcc ccctttccct gcccgctggg 1861 gagcggcgtc tcagtgaaat cggctttctc ctcagactct gtccctggta aggagtgaca 1921 aggaagctca cagctgggcg agtgcatttt gaatagtttt ttgtaagtag tgcttttcct 1981 ccttcctgac aaatcgagcg ctttggcctc ttctgtgcag catccacccc tgcggatccc 2041 tctggggagg acaggaaggg gactcccgga gacctctgca gccgtggtgg tcagaggctg 2101 ctcacctgag cacaaagaca gctctgcaca ttcaccgcag ctgccagcca ggggtctggg 2161 tgggcaccac cctgacccac agcgtcacct cactccctct gtcttatgac tcccctcccc 2221 aaccccctca tctaaagaca ccttcctttc cactggctgt caagccacag ggcaccagtg 2281 ccacccaggg ccctgcacaa aggggcgcct agtaaacctt // LOCUS HSTM30R 2049 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for fibroblast tropomyosin TM30 (pl). ACCESSION X05276 NID g37201 KEYWORDS Alu repetitive sequence; tropomyosin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2049) AUTHORS MacLeod,A.R., Talbot,K., Smillie,L.B. and Houlker,C. TITLE Characterization of a cDNA defining a gene family encoding TM30p1, a human fibroblast tropomyosin JOURNAL J. Mol. Biol. 194 (1), 1-10 (1987) MEDLINE 87283902 COMMENT Data kindly reviewed (23-NOV-1987) by MAC LEOD A.R. FEATURES Location/Qualifiers source 1..2049 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /cell_line="MRC-5" /clone="1401/32" CDS 51..797 /note="tropomyosin (AA 1-248)" /codon_start=1 /db_xref="PID:g37202" /db_xref="SWISS-PROT:P07226" /translation="MAGLNSLEAVKRKIQALQQQADEAEDRAQGLQRELDGERERREK AEGDVAALNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRAMKDE EKMEIQEMQLKEAKHIAEEADRKYEEVARKLVILEGELERAEERAEVSELKCGDLEEE LKNVTNNLKSLEAASEKYSEKEDKYEEEIKLLSDKLKEAETRAEFAERTVAKLEKTID DLEEKLAQAKEENVGLHQTLDQTLNELNCI" repeat_region 1507..1519 /note="direct repeat 1" repeat_region 1520..1805 /note="Alu repetitive sequence" repeat_region 1806..1818 /note="direct repeat 1" BASE COUNT 602 a 454 c 532 g 461 t ORIGIN 1 gagcccagcc gagcgtccgc cgctgcccgt gcgcctctgc gctccgcgcc atggccggcc 61 tcaactccct ggaggcggtg aaacgcaaga tccaggccct gcagcagcag gcggacgagg 121 cggaagaccg cgcgcagggc ctgcagcggg agctggacgg cgagcgcgag cggcgcgaga 181 aagctgaagg tgatgtggcc gccctcaacc gacgcatcca gctcgttgag gaggagttgg 241 acagggctca ggaacgactg gccacggccc tgcagaagct ggaggaggca gaaaaagctg 301 cagatgagag tgagagagga atgaaggtga tagaaaaccg ggccatgaag gatgaggaga 361 agatggagat tcaggagatg cagctcaaag aggccaagca cattgcggaa gaggctgacc 421 gcaaatacga ggaggtagct cgtaagctgg tcatcctgga gggtgagctg gagagggcag 481 aggagcgtgc ggaggtgtct gaactaaaat gtggtgacct ggaagaagaa ctcaagaatg 541 ttactaacaa tctgaaatct ctggaggctg catctgaaaa gtattctgaa aaggaggaca 601 aatatgaaga agaaattaaa cttctgtctg acaaactgaa agaggctgag acccgtgctg 661 aatttgcaga gagaacggtt gcaaaactgg aaaagacaat tgatgacctg gaagagaaac 721 ttgcccaggc caaagaagag aacgtgggct tacatcagac actggatcag acactaaacg 781 aacttaactg tatataagca aaacagaaga gtcttgttcc aacagaaact ctggagctcc 841 gtgggtcttt ctcttctctt gtaagaagtt ccttttgtta ttgccatctt cgctttgctg 901 gaaatgtcaa gcaaattatg aatacatgac caaatatttt gtatcggaga agctttgagc 961 accagttaaa tctcattcct tccctttttt tttcaaatgg caccagcttt ttcagctctc 1021 ttattttttc cttaagtagc atttattcct aaggtaggca gggtatttcc tagtaagcat 1081 actttcttaa gacggaggcc atttggttcc tgggagaata ggcagcccca cactttgaag 1141 aatacagacc ccagtatcta gtcgtggata taattaaaac gctgaagacc ataacctttt 1201 gggtcaactg ttggtcaaac tataggagag accagggacc atcacatggg tagggatttt 1261 ccatccagag ccaataaaag gactggtggg ggccgggggt ggctattgtg ggaagtcata 1321 acccacagat agatcaacct aagaatcctg gcccttctcc actctccacc atgcaggaca 1381 aacatcttct caagcagtca acgtagaatg cttgggaaat agtcataatt acccacatat 1441 agtaattaat agatggtaat taattgatcc ttgatgtgat gttcttttgc atatttcctt 1501 cattctaaag ttgttccctg gccgggagcg tttgctttcg cctgtaatcc caacactttg 1561 ggaggccagg acagatcact tgaggtcagg agttcgagac cagcccagcc aacatggcga 1621 aaccatgtct ctactaaaaa tacaaaaatt atggtgacgc ctgcctgtag tcccagctac 1681 tcgggaggct gaggcaggag gatcgcttga acccaggaag tggagactgc agtgagccga 1741 tatcgcacca cagcgctcca gcctggtcga cagagtgaga ctccatctca agaaaaaata 1801 aaaataaagt tgttctctga agagcaaatg tctcattcca gtaatgaccc actcagcagg 1861 aatatggtgg agttcagtcc aattcaggtc agccatatcc aaaagaccac aagtcattac 1921 taagttgagc aaaagagttt ttatctatta gcagaaaggg cctctctggc agcagagatt 1981 aaaaactggc ccaacttcat ttccatactt cagggaacag caaattgagg atttacttat 2041 ctaggactt // LOCUS HSTMP21I 690 bp RNA PRI 16-AUG-1996 DEFINITION H.sapiens mRNA for transmembrane protein Tmp21-I. ACCESSION X97442 NID g1359885 KEYWORDS TMP21-I gene; transmembrane protein type I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 690) AUTHORS Blum,R., Feick,P., Puype,M., Vandekerckhove,J., Klengel,R., Nastainczyk,W. and Schulz,I. TITLE Tmp21 and p24A, two type I proteins enriched in pancreatic microsomal membranes, are members of a protein family involved in vesicular trafficking JOURNAL J. Biol. Chem. 271 (29), 17183-17189 (1996) MEDLINE 96291865 REFERENCE 2 (bases 1 to 690) AUTHORS Blum,R. TITLE Direct Submission JOURNAL Submitted (22-APR-1996) R. Blum, Universitt des Saarlandes, 2. Physiologisches Institut, Geb.58, D-66421 Homburg, FRG FEATURES Location/Qualifiers source 1..690 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="T1.1" /clone_lib="lambda ZAP II" /dev_stage="fetal" /tissue_type="brain" gene 12..671 /gene="tmp21-I" CDS 12..671 /gene="tmp21-I" /note="type I topology; enriched in microsomal membranes" /codon_start=1 /product="transmembrane protein" /db_xref="PID:e239969" /db_xref="PID:g1359886" /translation="MSGLSGPPARRGPFPLALLLLFLLGPRLVLAISFHLPINSRKCL REEIHKDLLVTGAYEISDQSGGAGGLRSHLKITDSAGHILYSKEDATKGKFAFTTEDY DMFEVCFESKGTGRIPDQLVILDMKHGVEAKNYEEIAKVEKLKPLEVELRRLEDLSES IVNDFAYMKKREEEMRDTNESTNTRVLYFSIFSMFCLIGLATWQVFYLRRFFKAKKLI E" BASE COUNT 171 a 174 c 173 g 172 t ORIGIN 1 tctccagcac catgtctggt ttgtctggcc caccagcccg gcgcggccct tttccgttag 61 cgttgctgct tttgttcctg ctcggcccca gattggtcct tgccatctcc ttccatctgc 121 ccattaactc tcgcaagtgc ctccgtgagg agattcacaa ggacctgcta gtgactggcg 181 cgtacgagat ctccgaccag tctgggggcg ctggcggcct gcgcagccac ctcaagatca 241 cagattctgc tggccatatt ctctactcca aagaggatgc aaccaagggg aaatttgcct 301 ttaccactga agattatgac atgtttgaag tgtgttttga gagcaaggga acagggcgga 361 tacctgacca actcgtgatc ctagacatga agcatggagt ggaggcgaaa aattacgaag 421 agattgcaaa agttgagaag ctcaaaccat tagaggtaga gctgcgacgc ctagaagacc 481 tttcagaatc tattgttaat gactttgcct acatgaagaa gagagaagag gagatgcgtg 541 ataccaacga gtcaacaaac actcgggtcc tatacttcag catcttttca atgttctgtc 601 tcattggact agctacctgg caggtcttct acctgcgacg cttcttcaag gccaagaaat 661 tgattgagta atgaatgagg tgcctggcct // LOCUS HSTMPKMR 1000 bp RNA PRI 11-SEP-1995 DEFINITION Human mRNA for thymidylate kinase EC 2.7.4.9. ACCESSION X54729 NID g37205 KEYWORDS thymidylate kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1000) AUTHORS Su,J.Y. TITLE Direct Submission JOURNAL Submitted (02-OCT-1990) Su J.-Y., University of Colorado Health Sciences Center, Dept. of Biochemistry, Biophysics & Genetics, Box B-121, 4200 E. 9th Ave, Denver, CO 80262, USA REFERENCE 2 (bases 1 to 1000) AUTHORS Su,J.Y. and Sclafani,R.A. TITLE Molecular cloning and expression of the human deoxythymidylate kinase gene in yeast JOURNAL Nucleic Acids Res. 19 (4), 823-827 (1991) MEDLINE 91204436 COMMENT Data kindly reviewed (25-FEB-1991) by Su J.-Y. FEATURES Location/Qualifiers source 1..1000 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /clone_lib="cDNA" mRNA <1..>1000 /gene="TMPK" gene 1..1000 /gene="TMPK" mat_peptide 28..660 /gene="TMPK" /product="thymidylate kinase" CDS 28..663 /gene="TMPK" /codon_start=1 /product="thymidylate kinase" /db_xref="PID:g37206" /db_xref="SWISS-PROT:P23919" /translation="MAARRGALIVLEGVDRAGKSTQSRKLVEALSRGPPPELLRFPER STEIGKLLSSYLQKKSDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGV AFTGAKENFSLDWCKQPDVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERAL RCFHQLMKDTTLNWKMVDASKRLEAVHEELRVLSEDAIRTATEKPLGELWK" BASE COUNT 208 a 279 c 306 g 207 t ORIGIN 1 aattcgccag cggcgcggtg gacagtcatg gcggcccggc gcggggctct catagtgctg 61 gagggcgtgg accgcgccgg gaagagcacg cagagccgca agctggtgga agcgctgtcg 121 cgcgggccac cgcccgaact gctccggttc ccggaaagat caactgaaat cggcaaactt 181 ctgagttcct acttgcaaaa gaaaagtgac gtggaggatc actcggtgca cctgcttttt 241 tctgcaaatc gctgggaaca agtgccgtta attaaggaaa agttgagcca gggcgtgacc 301 ctcgtcgtgg acagatacgc attttctggt gtggccttca ccggtgccaa ggagaatttt 361 tccctagact ggtgtaaaca gccagacgtg ggccttccca aacccgacct ggtcctgttc 421 ctccagttac agctggcgga tgctgccaag cggggagcgt ttggccatga gcgctatgag 481 aacggggctt tccaggagcg ggcgctccgg tgtttccacc agctcatgaa agacacgact 541 ttgaactgga agatggtgga tgcttccaaa agactcgaag ctgtccatga ggaactccgc 601 gtgctctctg aggacgccat ccgcactgcc acagagaagc cgctggggga gctatggaag 661 tgacccaagg ctgcccactg gagacgcctc tccctgcagt cccccgagag gtgggagact 721 cgcggaaggc cccgtcccca gcggagtcca gaccccacaa cttcaggagc tctttcccgg 781 cagcagagat ctgcaggctg cctcttctgc cccggagctg gggtgcactg gggacccccg 841 tggtggggac cttggcagtg tggacatgag cagagcgatg gagcagtctc ctgccctctc 901 ccctgtcctg atggcactct gttgtatttt cttactgaag ttcagtgata actctgagca 961 gtttcattgt gatcactgta aatggtaatc agttggaatt // LOCUS HSTNCS 685 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for slow skeletal troponin C (TnC). ACCESSION X07897 NID g37207 KEYWORDS troponin; troponin C. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 685) AUTHORS Gahlmann,R., Wade,R., Gunning,P. and Kedes,L. TITLE Differential expression of slow and fast skeletal muscle troponin C. Slow skeletal muscle troponin C is expressed in human fibroblasts JOURNAL J. Mol. Biol. 201 (2), 379-391 (1988) MEDLINE 88332973 COMMENT Data kindly reviewed (02-SEP-1988) by GAHLMANN R. FEATURES Location/Qualifiers source 1..685 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="muscle" /clone="TC1" CDS 27..512 /note="troponin C (AA 1-161)" /codon_start=1 /db_xref="PID:g37208" /db_xref="SWISS-PROT:P02590" /translation="MDDIYKAAVEQLTEEQKNEFKAAFDIFVLGAEDGCISTKELGKV MRMLGQNPTPEELQEMIDEVDEDGSGTVDFDEFLVMMVRCMKDDSKGKSEEELSDLFR MFDKNADGYIDLDELKIMLQATGETITEDDIEELMKDGDKNNDGRIDYDEFLEFMKGV E" misc_feature 668..673 /note="polyA signal" polyA_site 685 /note="polyA site" BASE COUNT 169 a 174 c 217 g 125 t ORIGIN 1 agcaagctgt cctgtgagcc gccagcatgg atgacatcta caaggctgcg gtagagcagc 61 tgacagaaga gcagaaaaat gagttcaagg cagccttcga catcttcgtg ctgggcgctg 121 aggatggctg catcagcacc aaggagctgg gcaaggtgat gaggatgctg ggccagaacc 181 ccacccctga ggagctgcag gagatgatcg atgaggtgga cgaggacggc agcggcacgg 241 tggactttga tgagttcctg gtcatgatgg ttcggtgcat gaaggacgac agcaaaggga 301 aatctgagga ggagctgtct gacctcttcc gcatgtttga caaaaatgct gatggctaca 361 tcgacctgga tgagctgaag ataatgctgc aggctacagg cgagaccatc acggaggacg 421 acatcgagga gctcatgaag gacggagaca agaacaacga cggccgcatc gactatgatg 481 agttcctgga gttcatgaag ggtgtggagt agatgctgac cttcacccag agctgcctat 541 gcccagcctc caactccagc tgagtcctgg ggttggggag ggggtcgggg tcccaggacc 601 tgagcctggc catgtcctca accccaaatc ccccgactcc ctccccagat ctgtcctggg 661 ggatgcaaat aaagcctgct ctccc // LOCUS HSTNFR1A 2161 bp RNA PRI 18-JAN-1993 DEFINITION H.sapiens TNF-R mRNA for tumor necrosis factor receptor type 1. ACCESSION X55313 NID g37223 KEYWORDS TNF-R gene; tumor necrosis factor receptor 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2161) AUTHORS Nophar,Y., Kemper,O., Brakebusch,C., Englemann,H., Zwang,R., Aderka,D., Holtmann,H. and Wallach,D. TITLE Soluble forms of tumor necrosis factor receptors (TNF-Rs). The cDNA for the type I TNF-R, cloned using amino acid sequence data of its soluble form, encodes both the cell surface and a soluble form of the receptor JOURNAL EMBO J. 9 (10), 3269-3278 (1990) MEDLINE 91006021 FEATURES Location/Qualifiers source 1..2161 /organism="Homo sapiens" /db_xref="taxon:9606" gene 256..2161 /gene="TNF-R" CDS 256..1623 /gene="TNF-R" /codon_start=1 /product="tumor necrosis factor receptor type 1" /db_xref="PID:g37224" /db_xref="SWISS-PROT:P19438" /translation="MGLSTVPDLLLPLVLLELLVGIYPSGVIGLVPHLGDREKRDSVC PQGKYIHPQNNSICCTKCHKGTYLYNDCPGPGQDTDCRECESGSFTASENHLRHCLSC SKCRKEMGQVEISSCTVDRDTVCGCRKNQYRHYWSENLFQCFNCSLCLNGTVHLSCQE KQNTVCTCHAGFFLRENECVSCSNCKKSLECTKLCLPQIENVKGTEDSGTTVLLPLVI FFGLCLLSLLFIGLMYRYQRWKSKLYSIVCGKSTPEKEGELEGTTTKPLAPNPSFSPT PGFTPTLGFSPVPSSTFTSSSTYTPGDCPNFAAPRREVAPPYQGADPILATALASDPI PNPLQKWEDSAHKPQSLDTDDPATLYAVVENVPPLRWKEFVRRLGLSDHEIDRLELQN GRCLREAQYSMLATWRRRTPRREATLELLGRVLRDMDLLGCLEDIEEALCGPAALPPA PSLLR" repeat_region 385..504 /gene="TNF-R" repeat_region 505..633 /gene="TNF-R" repeat_region 634..756 /gene="TNF-R" repeat_region 757..857 /gene="TNF-R" polyA_signal 2145..2150 /gene="TNF-R" /note="putative" polyA_site 2161 /gene="TNF-R" BASE COUNT 459 a 642 c 604 g 456 t ORIGIN 1 cggcccagtg atcttgaacc ccaaaggcca gaactggagc ctcagtccag agaattctga 61 gaaaattaaa gcagagagga ggggagagat cactgggacc aggccgtgat ctctatgccc 121 gagtctcaac cctcaactgt caccccaagg cacttgggac gtcctggaca gaccgagtcc 181 cgggaagccc cagcactgcc gctgccacac tgccctgagc ccaaatgggg gagtgagagg 241 ccatagctgt ctggcatggg cctctccacc gtgcctgacc tgctgctgcc gctggtgctc 301 ctggagctgt tggtgggaat atacccctca ggggttattg gactggtccc tcacctaggg 361 gacagggaga agagagatag tgtgtgtccc caaggaaaat atatccaccc tcaaaataat 421 tcgatttgct gtaccaagtg ccacaaagga acctacttgt acaatgactg tccaggcccg 481 gggcaggata cggactgcag ggagtgtgag agcggctcct tcaccgcttc agaaaaccac 541 ctcagacact gcctcagctg ctccaaatgc cgaaaggaaa tgggtcaggt ggagatctct 601 tcttgcacag tggaccggga caccgtgtgt ggctgcagga agaaccagta ccggcattat 661 tggagtgaaa accttttcca gtgcttcaat tgcagcctct gcctcaatgg gaccgtgcac 721 ctctcctgcc aggagaaaca gaacaccgtg tgcacctgcc atgcaggttt ctttctaaga 781 gaaaacgagt gtgtctcctg tagtaactgt aagaaaagcc tggagtgcac gaagttgtgc 841 ctaccccaga ttgagaatgt taagggcact gaggactcag gcaccacagt gctgttgccc 901 ctggtcattt tctttggtct ttgcctttta tccctcctct tcattggttt aatgtatcgc 961 taccaacggt ggaagtccaa gctctactcc attgtttgtg ggaaatcgac acctgaaaaa 1021 gagggggagc ttgaaggaac tactactaag cccctggccc caaacccaag cttcagtccc 1081 actccaggct tcacccccac cctgggcttc agtcccgtgc ccagttccac cttcacctcc 1141 agctccacct atacccccgg tgactgtccc aactttgcgg ctccccgcag agaggtggca 1201 ccaccctatc agggggctga ccccatcctt gcgacagccc tcgcctccga ccccatcccc 1261 aacccccttc agaagtggga ggacagcgcc cacaagccac agagcctaga cactgatgac 1321 cccgcgacgc tgtacgccgt ggtggagaac gtgcccccgt tgcgctggaa ggaattcgtg 1381 cggcgcctag ggctgagcga ccacgagatc gatcggctgg agctgcagaa cgggcgctgc 1441 ctgcgcgagg cgcaatacag catgctggcg acctggaggc ggcgcacgcc gcggcgcgag 1501 gccacgctgg agctgctggg acgcgtgctc cgcgacatgg acctgctggg ctgcctggag 1561 gacatcgagg aggcgctttg cggccccgcc gccctcccgc ccgcgcccag tcttctcaga 1621 tgaggctgcg cccctgcggg cagctctaag gaccgtcctg cgagatcgcc ttccaacccc 1681 acttttttct ggaaaggagg ggtcctgcag gggcaagcag gagctagcag ccgcctactt 1741 ggtgctaacc cctcgatgta catagctttt ctcagctgcc tgcgcgccgc cgacagtcag 1801 cgctgtgcgc gcggagagag gtgcgccgtg ggctcaagag cctgagtggg tggtttgcga 1861 ggatgaggga cgctatgcct catgcccgtt ttgggtgtcc tcaccagcaa ggctgctcgg 1921 gggcccctgg ttcgtccctg agcctttttc acagtgcata agcagttttt tttgtttttg 1981 ttttgttttg ttttgttttt aaatcaatca tgttacacta atagaaactt ggcactcctg 2041 tgccctctgc ctggacaagc acatagcaag ctgaactgtc ctaaggcagg ggcgagcacg 2101 gaacaatggg gccttcagct ggagctgtgg acttttgtac atacactaaa attctgaagt 2161 t // LOCUS HSTNT4 1043 bp RNA PRI 29-JAN-1996 DEFINITION H.sapiens HTNT4 mRNA for cardiac troponin T. ACCESSION X79857 NID g587431 KEYWORDS troponin T. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1043) AUTHORS Townsend,P.J., Barton,P.J., Yacoub,M.H. and Farza,H. TITLE Molecular cloning of human cardiac troponin T isoforms: expression in developing and failing heart JOURNAL J. Mol. Cell. Cardiol. 27 (10), 2223-2236 (1995) MEDLINE 96129582 REFERENCE 2 (bases 1 to 1042) AUTHORS Farza,H. TITLE Direct Submission JOURNAL Submitted (23-JUN-1994) H. Farza, National Heart & Lung Inst, Dovehouse Street, London, SW3 6LY, UK FEATURES Location/Qualifiers source 1..1043 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="15 weeks gestation" /tissue_type="cardiac muscle" /clone="TNT5-1/HTNT4" /chromosome="1q" CDS 112..879 /codon_start=1 /product="troponin T" /db_xref="PID:g587432" /db_xref="SWISS-PROT:P45379" /translation="MSDIEEVVEEYEEEEQEEAAVEEEEDWREDEDEQEEAAEEDAEA EAETEETRAEDEEEEEAKEAEDGPMEESKPKPRSFMPNLVPPKIPDGERVDFDERRRA ERAEQQRIRNEREKERQNRLAEERARREEEENRRKAEDEARKKKALSNMMHFGGYIQK TERKSGKRQTEREKKKKILAERRKVLAIDHLNEDQLREKAKELWQSIYNLEAEKFDLQ EKFKQQKYEINVLRNRINDNQKVSKTRGKAKVTGRWK" polyA_signal 1030..1039 BASE COUNT 301 a 247 c 351 g 144 t ORIGIN 1 ccccatgaca gccgcagcct gctccccacc tgcaaacttc agccccttct gggcctcgtc 61 cgcgccccca ggatctgtcg gcagctgctg ttctgaggga gagcagagac catgtctgac 121 atagaagagg tggtggaaga gtacgaggag gaggagcagg aagaagcagc tgttgaagaa 181 gaggaggact ggagagagga cgaagacgag caggaggagg cagcggaaga ggatgctgaa 241 gcagaggctg agaccgagga gaccagggca gaagatgaag aagaagagga agcaaaggag 301 gctgaagatg gcccaatgga ggagtccaaa ccaaagccca ggtcgttcat gcccaacttg 361 gtgcctccca agatccccga tggagagaga gtggactttg atgagagacg tcgggcagag 421 cgggccgagc agcagcgcat ccggaatgag cgggagaagg agcggcagaa ccgcctggct 481 gaagagaggg ctcgacgaga ggaggaggag aacaggagga aggctgagga tgaggcccgg 541 aagaagaagg ctttgtccaa catgatgcat tttgggggtt acatccagaa gacagagcgg 601 aaaagtggga agaggcagac tgagcgggaa aagaagaaga agattctggc tgagaggagg 661 aaggtgctgg ccattgacca cctgaatgaa gatcagctga gggagaaggc caaggagctg 721 tggcagagca tctataactt ggaggcagag aagttcgacc tgcaggagaa gttcaagcag 781 cagaaatatg agatcaatgt tctccgaaac aggatcaacg ataaccagaa agtctccaag 841 acccgcggga aggctaaagt caccgggcgc tggaaataga gcctggcctc cttcaccaaa 901 gatctgctcc tcgctcgcac ctgcctccgg cctgcactcc cccagttccc gggccctcct 961 gggcacccca ggcagctcct gtttggaaat ggggagctgg cctaggtggg agccaccact 1021 ccacaccagt aataaaaagc cac // LOCUS HSTOPIIB 4866 bp RNA PRI 30-NOV-1992 DEFINITION H.sapiens topIIb mRNA for topoisomerase IIb. ACCESSION X68060 NID g37230 KEYWORDS DNA topoisomerase II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4866) AUTHORS Jenkins,J.R. TITLE Direct Submission JOURNAL Submitted (14-AUG-1992) J.R. Jenkins, ICRF Inst of Mol Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK REFERENCE 2 (bases 1 to 4866) AUTHORS Jenkins,J.R., Ayton,P., Jones,T., Davies,S.L., Simmons,D.L., Harris,A.L., Sheer,D. and Hickson,I.D. TITLE Isolation of cDNA clones encoding the beta isozyme of human DNA topoisomerase II and localisation of the gene to chromosome 3p24 JOURNAL Nucleic Acids Res. 20 (21), 5587-5592 (1992) MEDLINE 93087165 FEATURES Location/Qualifiers source 1..4866 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV-transformed" /clone_lib="NBC" /clone="pT2b-1" /chromosome="3p24" gene 1..4866 /gene="topIIb" CDS 1..4866 /gene="topIIb" /note="antitumor drug target" /codon_start=1 /product="DNA topoisomerase II" /db_xref="PID:g37231" /db_xref="SWISS-PROT:Q02880" /translation="MAKSGGCGAGAGVGGGNGALTWVNNAAKKEESETANKNDSSKKL SVERVYQKKTQLEHILLRPDTYIGSVEPLTQFMWVYDEDVGMNCREVTFVPGLYKIFD EILVNAADNKQRDKNMTCIKVSIDPESNIISIWNNGKGIPVVEHKVEKVYVPALIFGQ LLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTKFTVETACKEYKHSFKQTWMNNMMKTS EAKIKHFDGEDYTCITFQPDLSKFKMEKLDKDIVALMTRRAYDLAGSCRGVKVMFNGK KLPVNGFRSYVDLYVKDKLDETGVALKVIHELANERWDVCLTLSEKGFQQISFVNSIA TTKGGRHVDYVVDQVVGKLIEVVKKKNKAGVSVKPFQVKNHIWVFINCLIENPTFDSQ TKENMTLQPKSFGSKCQLSEKFFKAASNCGIVESILNWVKFKAQTQLNKKCSSVKYSK IKGIPKLDDANDAGGKHSLECTLILTEGDSAKSLAVSGLGVIGRDRYGVFPLRGKILN VREASHKQIMENAEINNIIKIVGLQYKKSYDDAESLKTLRYGKIMIMTDQDQDGSHIK GLLINFIHHNWPSLLKHGFLEEFITPIVKASKNKQELSFYSIPEFDEWKKHIENQKAW KIKYYKGLGTSTAKEAKEYFADMERHRILFRYAGPEDDAAITLAFSKKKIDDRKEWLT NFMEDRRQRRLHGLPEQFLYGTATKHLTYNDFINKELILFSNSDNERSIPSLVDGFKP GQRKVLFTCFKRNDKREVKVAQLAGSVAEMSAYHHGEQALMMTIVNLAQNFVGSNNIN LLQPIGQFGTRLHGGKDAASPRYIFTMLSTLARLLFPAVDDNLLKFLYDDNQRVEPEW YIPIIPMVLINGAEGIGTGWACKLPNYDAREIVNNVRRMLDGLDPHPMLPNYKNFKGT IQELGQNQYAVSGEIFVVDRNTVEITELPVRTWTQVYKEQVLEPMLNGTDKTPALISD YKEYHTDTTVKFVVKMTEEKLAQAEAAGLHKVFKLQTTLTCNSMVLFDHMGCLKKYET VQDILKEFFDLRLSYYGLRKEWLVGMLGAESTKLNNQARFILEKIQGKITIENRSKKD LIQMLVQRGYESDPVKAWKEAQEKAAEEDETQNQHDDSSSDSGTPSGPDFNYILNMSL WSLTKEKVEELIKQRDAKGREVNDLKRKSPSDLWKEDLAAFVEELDKVESQEREDVLA GMSGKAIKGKVGKPKVKKLQLEETMPSPYGRRIIPEITAMKADASKKLLKKKKGDLDT AAVKVEFDEEFSGAPVEGAGEEALTPSVPINKGPKPKREKKEPGTRVRKTPTSSGKPS AKKVKKRNPWSDDESKSESDLEETEPVVIPRDSLLRRAAAERPKYTFDFSEEEDDDAD DDDDDNNDLEELKVKASPITNDGEDEFVPSDGLDKDEYTFSPGKSKATPEKSLHDKKS QDFGNLFSFPSYSQKSEDDSAKFDSNEEDSASVFSPSFGLKQTDKVPSKTVAAKKGKP SSDTVPKPKRAPKQKKVVEAVNSDSDSEFGIPKKTTTPKGKGRGAKKRKASGSENEGD YNPGRKTSKTTSKKPKKTSFDQDSDVDIFPSDFPTEPPSLPRTGRARKEVKYFTESDE EEDDVDFAMFN" BASE COUNT 1690 a 785 c 1092 g 1299 t ORIGIN 1 atggccaagt cgggtggctg cggcgcggga gccggcgtgg gcggcggcaa cggggcactg 61 acctgggtga acaatgctgc aaaaaaagaa gagtcagaaa ctgccaacaa aaatgattct 121 tcaaagaagt tgtctgttga gagagtgtat cagaagaaga cacaacttga acacattctt 181 cttcgtcctg atacatatat tgggtcagtg gagccattga cgcagttcat gtgggtgtat 241 gatgaagatg taggaatgaa ttgcagggag gttacctttg tgccaggttt atacaagatc 301 tttgatgaaa ttttggttaa tgctgctgac aataaacaga gggataagaa catgacttgt 361 attaaagttt ctattgatcc tgaatctaac attataagca tttggaataa tgggaaaggc 421 attccagtag tagaacacaa ggtagagaaa gtttatgttc ctgctttaat ttttggacag 481 cttttaacat ccagtaacta tgatgatgat gagaaaaaag ttacaggtgg tcgtaatggt 541 tatggtgcaa aactttgtaa tattttcagt acaaagttta cagtagaaac agcttgcaaa 601 gaatacaaac acagttttaa gcagacatgg atgaataata tgatgaagac ttctgaagcc 661 aaaattaaac attttgatgg tgaagattac acatgcataa cattccaacc agatctgtcc 721 aaatttaaga tggaaaaact tgacaaggat attgtggccc tcatgactag aagggcatat 781 gatttggctg gttcgtgtag aggggtcaag gtcatgttta atggaaagaa attgcctgta 841 aatggatttc gcagttatgt agatctttat gtgaaagaca aattggatga aactggggtg 901 gccctgaaag ttattcatga gcttgcaaat gaaagatggg atgtttgtct cacattgagt 961 gaaaaaggat tccagcaaat cagctttgta aatagtattg caactacaaa aggtggacgg 1021 cacgtggatt atgtggtaga tcaagttgtt ggtaaactga ttgaagtagt taagaaaaag 1081 aacaaagctg gtgtatcagt gaaaccattt caagtaaaaa accatatatg ggtttttatt 1141 aattgcctta ttgaaaatcc aacttttgat tctcagacta aggaaaacat gactctgcag 1201 cccaaaagtt ttgggtctaa atgccagctg tcagaaaaat tttttaaagc agcctctaat 1261 tgtggcattg tagaaagtat cctgaactgg gtgaaattta aggctcagac tcagctgaat 1321 aagaagtgtt catcagtaaa atacagtaaa atcaaaggta ttcccaaact ggatgatgct 1381 aatgatgctg gtggtaaaca ttccctggag tgtacactga tattaacaga gggagactct 1441 gccaaatcac tggctgtgtc tggattaggt gtgattggac gagacagata cggagttttt 1501 ccactcaggg gcaaaattct taatgtacgg gaagcttctc ataaacagat catggaaaat 1561 gctgaaataa ataatattat taaaatagtt ggtctacaat ataagaaaag ttacgatgat 1621 gcagaatctc tgaaaacctt acgctatgga aagattatga ttatgaccga tcaggatcaa 1681 gatggttctc acataaaagg cctgcttatt aatttcatcc atcacaattg gccatcactt 1741 ttgaagcatg gttttcttga agagttcatt actcctattg taaaggcaag caaaaataag 1801 caggaacttt ccttctacag tattcctgaa tttgacgaat ggaaaaaaca tatagaaaac 1861 cagaaagcct ggaaaataaa gtactataaa ggattgggta ctagtacagc taaagaagca 1921 aaggaatatt ttgctgatat ggaaaggcat cgcatcttgt ttagatatgc tggtcctgaa 1981 gatgatgctg ccattacctt ggcatttagt aagaagaaga ttgatgacag aaaagaatgg 2041 ttaacaaatt ttatggaaga ccggagacag cgtaggctac atggcttacc agagcaattt 2101 ttatatggta ctgcaacaaa gcatttgact tataatgatt tcatcaacaa ggaattgatt 2161 ctcttctcaa actcagacaa tgaaagatct ataccatctc ttgttgatgg ctttaaacct 2221 ggccagcgga aagttttatt tacctgtttc aagaggaatg ataaacgtga agtaaaagtt 2281 gcccagttgg ctggctctgt tgctgagatg tcggcttatc atcatggaga acaagcattg 2341 atgatgacta ttgtgaattt ggctcagaac tttgtgggaa gtaacaacat taacttgctt 2401 cagcctattg gtcagtttgg aactcggctt catggtggca aagatgctgc aagccctcgt 2461 tatattttca caatgttaag cactttagca aggctacttt ttcctgctgt ggatgacaac 2521 ctccttaagt tcctttatga tgataatcaa cgtgtagagc ctgagtggta tattcctata 2581 attcccatgg ttttaataaa tggtgctgag ggcattggta ctggatgggc ttgtaaacta 2641 cccaactatg atgctaggga aattgtgaac aatgtcagac gaatgctaga tggcctggat 2701 cctcatccca tgcttccaaa ctacaaaaac tttaaaggca cgattcaaga acttggtcaa 2761 aaccagtatg cagtcagtgg tgaaatattt gtagtggaca gaaacacagt agaaattaca 2821 gagcttccag ttagaacttg gacacaggta tataaagaac aggttttaga acctatgcta 2881 aatggaacag ataaaacacc agcattaatt tctgattata aagaatatca tactgacaca 2941 actgtgaaat ttgtggtgaa aatgactgaa gagaaactag cacaagcaga agctgctgga 3001 ctgcataaag tttttaaact tcaaactact cttacttgta attccatggt actttttgat 3061 catatgggat gtctgaagaa atatgaaact gtgcaagaca ttctgaaaga attctttgat 3121 ttacgattaa gttattacgg tttacgtaag gagtggcttg tgggaatgtt gggagcagaa 3181 tctacaaagc ttaacaatca agcccgtttc attttagaga agatacaagg gaaaattact 3241 atagagaata ggtcaaagaa agatttgatt caaatgttag tccagagagg ttatgaatct 3301 gacccagtga aagcctggaa agaagcacaa gaaaaggcag cagaagagga tgaaacacaa 3361 aaccagcatg atgatagttc ctccgattca ggaactcctt caggcccaga ttttaattat 3421 attttaaata tgtctctgtg gtctcttact aaagaaaaag ttgaagaact gattaaacag 3481 agagatgcaa aagggcgaga ggtcaatgat cttaaaagaa aatctccttc agatctttgg 3541 aaagaggatt tagcggcatt tgttgaagaa ctggataaag tggaatctca agaacgagaa 3601 gatgttctgg ctggaatgtc tggaaaagca attaaaggta aagttggcaa acctaaggtg 3661 aagaaactcc agttggaaga gacaatgccc tcaccttatg gcagaagaat aattcctgaa 3721 attacagcta tgaaggcaga tgccagcaaa aagttgctga agaagaagaa gggtgatctt 3781 gatactgcag cagtaaaagt ggaatttgat gaagaattca gtggagcacc agtagaaggt 3841 gcaggagaag aggcattgac tccatcagtt cctataaata aaggtcccaa acctaagagg 3901 gagaagaagg agcctggtac cagagtgaga aaaacaccta catcatctgg taaacctagt 3961 gcaaagaaag tgaagaaacg gaatccttgg tcagatgatg aatccaagtc agaaagtgat 4021 ttggaagaaa cagaacctgt ggttattcca agagattctt tgcttaggag agcagcagcc 4081 gaaagaccta aatacacatt tgatttctca gaagaagagg atgatgatgc tgatgatgat 4141 gatgatgaca ataatgattt agaggaattg aaagttaaag catctcccat aacaaatgat 4201 ggggaagatg aatttgttcc ttcagatggg ttagataaag atgaatatac attttcacca 4261 ggcaaatcaa aagccactcc agaaaaatct ttgcatgaca aaaaaagtca ggattttgga 4321 aatctcttct catttccttc atattctcag aagtcagaag atgattcagc taaatttgac 4381 agtaatgaag aagattctgc ttctgttttt tcaccatcat ttggtctgaa acagacagat 4441 aaagttccaa gtaaaacggt agctgctaaa aagggaaaac cgtcttcaga tacagtccct 4501 aagcccaaga gagccccaaa acagaagaaa gtagtagagg ctgtaaactc tgactcggat 4561 tcagaatttg gcattccaaa gaagactaca acaccaaaag gtaaaggccg aggggcaaag 4621 aaaaggaaag catctggctc tgaaaatgaa ggcgattata accctggcag gaaaacatcc 4681 aaaacaacaa gcaagaaacc gaagaagaca tcttttgatc aggattcaga tgtggacatc 4741 ttcccctcag acttccctac tgagccacct tctctgccac gaaccggtcg ggctaggaaa 4801 gaagtaaaat attttacaga gtctgatgaa gaagaagatg atgttgattt tgcaatgttt 4861 aattaa // LOCUS HSTPMYOB 1200 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for skeletal beta-tropomyosin. ACCESSION X06825 M36268 NID g37248 KEYWORDS actin-binding protein; beta-tropomyosin; tropomyosin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Liautard,J.P. TITLE Direct Submission JOURNAL Submitted (23-FEB-1988) Liautard J. P., CRBM du CNRS, U-249 Inserm, BP 5051, 34033 Montpelier Cedex, France REFERENCE 2 (bases 1 to 1200) AUTHORS Widada,J.S., Ferraz,C., Capony,J.P. and Liautard,J.P. TITLE Complete nucleotide sequence of the adult skeletal isoform of human skeletal muscle beta-tropomyosin JOURNAL Nucleic Acids Res. 16 (7), 3109 (1988) MEDLINE 88217530 FEATURES Location/Qualifiers source 1..1200 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 107..961 /note="beta-tropomyosin (AA 1-284)" /codon_start=1 /db_xref="PID:g37249" /db_xref="SWISS-PROT:P07951" /translation="MDAIKKKMQMLKLDKENAIDRAEQAEADKKQAEDRCKQLEEEQQ ALQKKLKGTEDEVEKYSESVKEAQEKLEQAEKKATDAEADVASLNRRIQLVEEELDRA QERLATALQKLEEAEKAADESERGMKVIENRAMKDEEKMELQEMQLKEAKHIAEDSDR KYEEVARKLVILEGELERSEERAEVAESKCGDLEEELKIVTNNLKSLEAQADKYSTKE DKYEEEIKLLEEKLKEAETRAEFAERSVAKLEKTIDDLEDEVYAQKMKYKAISEELDN ALNDITSL" BASE COUNT 330 a 320 c 368 g 182 t ORIGIN 1 cccgctccgt cctcctcgcc tgccaccggt gcacccagtc cgctcaccca gcccagtccg 61 tccggtcctc accgcctgcc ggccggccca ccccccaccg caggccatgg acgccatcaa 121 gaagaagatg cagatgctga agctggacaa ggagaacgcc atcgaccgcg ccgagcaggc 181 cgaagccgac aagaagcaag ctgaggaccg ctgcaagcag ctggaggagg agcagcaggc 241 cctccagaag aagctgaagg ggacagagga tgaggtggaa aagtattctg aatccgtgaa 301 ggaggcccag gagaaactgg agcaggccga gaagaaggcc actgatgctg aggcagatgt 361 ggcctccctg aaccgccgca ttcagctggt tgaggaggag ctggaccggg cccaggagcg 421 cctggctaca gccctgcaga agctggagga ggccgagaag gcggctgatg agagcgagag 481 aggaatgaag gtcatcgaaa accgggccat gaaggatgag gagaagatgg aactgcagga 541 gatgcagctg aaggaggcca agcacatcgc tgaggattca gaccgcaaat atgaagaggt 601 ggccaggaag ctggtgatcc tggaaggaga gctggagcgc tcggaggaga gggctgaggt 661 ggccgagagt aaatgtgggg acctagagga ggagctgaaa attgttacca acaacttgaa 721 atccctggag gcccaggcgg acaagtattc caccaaagaa gataaatatg aagaggagat 781 caaactgttg gaggagaagc tgaaggaggc tgagacccga gcagagtttg ccgagaggtc 841 tgtggcaaag ttggagaaaa ccatcgatga cctagaagat gaagtctatg cccagaagat 901 gaagtacaag gccattagcg aggaactgga caacgcactc aatgacatca cctccctctg 961 agccccacgc ccagcgtgcc acctcagctc tcttctctcc tctcctttcc attctctcta 1021 tggggagggg agagcaggca ggaggagcag aaattgccaa cattgcacag ccaggctggg 1081 agcagcctag ggagagcccc catcatgccc accacccact ctggcactgg cttcatcctt 1141 tacctatccc cttccaccct cctttgcttg cttaataaat tctgaacttg gaaaaaaaaa // LOCUS HSTPO 3027 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for thyroperoxidase. ACCESSION Y00406 NID g37250 KEYWORDS membrane glycoprotein; thyroperoxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3027) AUTHORS Libert,F., Ruel,J., Ludgate,M., Swillens,S., Alexander,N., Vassart,G. and Dinsart,C. TITLE Complete nucleotide sequence of the human thyroperoxidase-microsomal antigen cDNA JOURNAL Nucleic Acids Res. 15 (16), 6735 (1987) MEDLINE 87316933 REFERENCE 2 (bases 1 to 3027) AUTHORS Libert,F. TITLE Direct Submission JOURNAL Submitted (28-OCT-1987) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..3027 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid" /clone_lib="lambda gt11" /map="chromosome 2, short arm" CDS 41..2842 /note="precursor polypeptide" /codon_start=1 /db_xref="PID:g37251" /db_xref="SWISS-PROT:P07202" /translation="MRALAVLSVTLVMACTEAFFPFISRGKELLWGKPEESRVSSVLE ESKRLVDTAMYATMQRNLKKRGILSPAQLLSFSKLPEPTSGVIARAAEIMETSIQAMK RKVNLKTQQSQHPTDALSEDLLSIIANMSGCLPYMLPPKCPNTCLANKYRPITGACNN RDHPRWGASNTALARWLPPVYEDGFSQPRGWNPGFLYNGFPLPPVREVTRHVIQVSNE VVTDDDRYSDLLMAWGQYIDHDIAFTPQSTSKAAFGGGADCQMTCENQNPCFPIQLPE EARPAAGTACLPFYRSSAACGTGDQGALFGNLSTANPRQQMNGLTSFLDASTVYGSSP ALERQLRNWTSAEGLLRVHARLRDSGRAYLPFVPPRAPAACAPEPGIPGETRGPCFLA GDGRASEVPSLTALHTLWLREHNRLAAALKALNAHWSADAVYQEARKVVGALHQIITL RDYIPRILGPEAFQQYVGPYEGYDSTANPTVSNVFSTAAFRFGHATIHPLVRRLDASF QEHPDLPGLWLHQAFFSPWTLLRGGGLDPLIRGLLARPAKLQVQDQLMNEELTERLFV LSNSSTLNLASINLQRGRDHGLPGYNEWREFCGLPRLETPADLSTAIASRSVADKILD LYKHPDNIDVWLGGLAENFLPRARTGPLFACLIGKQMKALRDGDWFWWENSHVFTDAQ RRELEKHSLSRVICDNTGLTRVPMDAFQVGKFPEDFESCDSIPGMNLEAWRETFPQDD KCGFPESMENGDFVHCEESGRRVLVYSCRHGYELQGREQLTCTQEGWDFQPPLCKDVN ECADGAHPPCHASARCRNTKGGFQCLCADPYELGDDGRTCVDSGRLPRATWISMSLAA LLIGGFAGLTSTVICKWTRTGTKSTLPISETGGGTPELRCGKHQAVGTSPQRAAAQDS EQESAGMEGRDTHRLPRAL" sig_peptide 41..82 /note="put. signal peptide (AA -14 to -1)" mat_peptide 83..2839 /note="put. mature thyroperoxidase (AA 1-919)" misc_feature 425..433 /note="pot. N-glycosylation site" misc_feature 959..967 /note="pot. N-glycosylation site" misc_feature 1064..1072 /note="pot. N-glycosylation site" misc_feature 1472..1480 /note="pot. N-glycosylation site" misc_feature 1745..1753 /note="pot. N-glycosylation site" misc_feature 2579..2652 /note="pot. transmembrane region" BASE COUNT 664 a 928 c 871 g 564 t ORIGIN 1 attactcagc agtgcagttg gctgagaaga ggaaaaaaga atgagagcgc tcgctgtgct 61 gtctgtcacg ctggttatgg cctgcacaga agccttcttc cccttcatct cgagagggaa 121 agaactcctt tggggaaagc ctgaggagtc tcgtgtctct agcgtcttgg aggaaagcaa 181 gcgcctggtg gacaccgcca tgtacgccac gatgcagaga aacctcaaga aaagaggaat 241 cctttctcca gctcagcttc tgtctttttc caaacttcct gagccaacaa gcggagtgat 301 tgcccgagca gcagagataa tggaaacatc aatacaagcg atgaaaagaa aagtcaacct 361 gaaaactcaa caatcacagc atccaacgga tgctttatca gaagatctgc tgagcatcat 421 tgcaaacatg tctggatgtc tcccttacat gctgccccca aaatgcccaa acacttgcct 481 ggcgaacaaa tacaggccca tcacaggagc ttgcaacaac agagaccacc ccagatgggg 541 cgcctccaac acggccctgg cacgatggct ccctccagtc tatgaggacg gcttcagtca 601 gccccgaggc tggaaccccg gcttcttgta caacgggttc ccactgcccc cggtccggga 661 ggtgacaaga catgtcattc aagtttcaaa tgaggttgtc acagatgatg accgctattc 721 tgacctcctg atggcatggg gacaatacat cgaccacgac atcgcgttca caccacagag 781 caccagcaaa gctgccttcg ggggaggggc tgactgccag atgacttgtg agaaccaaaa 841 cccatgtttt cccatacaac tcccggagga ggcccggccg gccgcgggca ccgcctgtct 901 gcccttctac cgctcttcgg ccgcctgcgg caccggggac caaggcgcgc tctttgggaa 961 cctgtccacg gccaacccgc ggcagcagat gaacgggttg acctcgttcc tggacgcgtc 1021 caccgtgtat ggcagctccc cggccctaga gaggcagctg cggaactgga ccagtgccga 1081 agggctgctc cgcgtccacg cgcgcctccg ggactccggc cgcgcctacc tgcccttcgt 1141 gccgccacgc gcgcctgcgg cctgtgcgcc cgagcccggc atccccggag agacccgcgg 1201 gccctgcttc ctggccggag acggccgcgc cagcgaggtc ccctccctga cggcactgca 1261 cacgctgtgg ctgcgcgagc acaaccgcct ggccgcggcg ctcaaggccc tcaatgcgca 1321 ctggagcgcg gacgccgtgt accaggaggc gcgcaaggtc gtgggcgctc tgcaccagat 1381 catcaccctg agggattaca tccccaggat cctgggaccc gaggccttcc agcagtacgt 1441 gggtccctat gaaggctatg actccaccgc caaccccact gtgtccaacg tgttctccac 1501 agccgccttc cgcttcggcc atgccacgat ccacccgctg gtgaggaggc tggacgccag 1561 cttccaggag caccccgacc tgcccgggct gtggctgcac caggctttct tcagcccatg 1621 gacattactc cgtggaggtg gtttggaccc actaatacga ggccttcttg caagaccagc 1681 caaactgcag gtgcaggatc agctgatgaa cgaggagctg acggaaaggc tctttgtgct 1741 gtccaattcc agcaccttga atctggcgtc catcaacctg cagaggggcc gggaccacgg 1801 gctgccaggt tacaatgagt ggagggagtt ctgcggcctg cctcgcctgg agacccccgc 1861 tgacctgagc acagccatcg ccagcaggag cgtggccgac aagatcctgg acttgtacaa 1921 gcatcctgac aacatcgatg tctggctggg aggcttagct gaaaacttcc tccccagggc 1981 tcggacaggg cccctgtttg cctgtctcat tgggaagcag atgaaggctc tgcgggatgg 2041 tgactggttt tggtgggaga acagccacgt cttcacggat gcacagaggc gtgagctgga 2101 gaagcactcc ctgtctcggg tcatctgtga caacactggc ctcaccaggg tgcccatgga 2161 tgccttccaa gtcggcaaat tccctgaaga ctttgagtct tgtgacagca tccctggcat 2221 gaacctggag gcctggaggg aaacctttcc tcaagacgac aagtgtggct tcccagagag 2281 catggagaat ggggactttg tgcactgtga ggagtctggg aggcgcgtgc tggtgtattc 2341 ctgccggcac gggtatgagc tccaaggccg ggagcagctc acttgcaccc aggaaggatg 2401 ggatttccag cctcccctct gcaaagatgt gaacgagtgt gcagacggtg cccacccccc 2461 ctgccacgcc tctgcgaggt gcagaaacac caaaggcggc ttccagtgtc tctgcgcgga 2521 cccctacgag ttaggagacg atgggagaac ctgcgtagac tccgggaggc tccctcgggc 2581 gacttggatc tccatgtcgc tggctgctct gctgatcgga ggcttcgcag gtctcacctc 2641 gacggtgatt tgcaagtgga cacgcactgg cactaaatcc acactgccca tctcggagac 2701 aggcggagga actcccgagc tgagatgcgg aaagcaccag gccgtaggga cctcaccgca 2761 gcgggccgca gctcaggact cggagcagga gagtgctggg atggaaggcc gggatactca 2821 caggctgccg agagccctct gagggcaaag tggcaggaca ctgcagaaca gcttcatgtt 2881 cccaaaatca ccgtacgact cttttccaaa cacaggcaaa tccgaaatca gcaggacgac 2941 tgttttccca acacgggtaa atctagtacc atgtcgtagt tactctcagg catggatgaa 3001 taaatgttat agctgcattt gtctggc // LOCUS HSTPRM 7497 bp RNA PRI 18-JAN-1995 DEFINITION H.sapiens tpr mRNA. ACCESSION X66397 NID g633225 KEYWORDS Tpr gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7496) AUTHORS Mitchell,P.J. TITLE Direct Submission JOURNAL Submitted (28-OCT-1991) P.J. Mitchell, Institute of Cancer Research, Haddow Laboratories, 15 Cotswold Road, Sutton, Surrey, SM2 5NG, UK REFERENCE 2 (bases 1 to 7496) AUTHORS Mitchell,P.J. and Cooper,C.S. TITLE The human tpr gene encodes a protein of 2094 amino acids that has extensive coiled-coil regions and an acidic C-terminal domain JOURNAL Oncogene 7 (11), 2329-2333 (1992) MEDLINE 93064711 REFERENCE 3 (bases 1 to 7497) AUTHORS Byrd,D.A., Sweet,D.J., Pante,N., Konstantinov,K.N., Guan,T., Saphire,A.C., Mitchell,P.J., Cooper,C.S., Aebi,U. and Gerace,L. TITLE Tpr, a large coiled coil protein whose amino terminus is involved in activation of oncogenic kinases, is localized to the cytoplasmic surface of the nuclear pore complex JOURNAL J. Cell Biol. 127 (6 Pt 1), 1515-1526 (1994) MEDLINE 95096166 COMMENT Alternatively spliced transcript. Encodes larger protein than X63105. FEATURES Location/Qualifiers source 1..7497 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT1080 and M426" /clone_lib="lambda Zap-cDNA and pCEV15" /clone="lambda HT 1-15" /chromosome="1" gene 298..7347 /gene="Tpr" CDS 298..7347 /gene="Tpr" /codon_start=1 /db_xref="PID:g633226" /translation="MAAVLQQVLERTELNKLPKSVQNKLEKFLADQQSEIDGLKGRHE KFKVESEQQYFEIEKRLSHSQERLVNETRECQSLRLELEKLNNQLKALTEKNKELEIA QDRNIAIQSQFTRTKEELEAEKRDLIRTNERLSQELEYLTEDVKRLNEKLKESNTTKG ELQLKLDELQASDVSVKYREKRLEQEKELLHSQNTWLNTELKTKTDELLALGREKGNE ILELKCNLENKKEEVSRLEEQMNGLKTSNEHLQKHVEDLLTKLKEAKEQQASMEEKFH NELNAHIKLSNLYKSAADDSEAKSNELTRAVEELHKLLKEAGEANKAIQDHLLEVEQS KDQMEKEMLEKIGRLEKELENANDLLSATKRKGAILSEEELAAMSPTAAAVAKIVKPG MKLTELYNAYVETQDQLLLEKLENKRINKYLDEIVKEVEAKAPILKRQREEYERAQKA VASLSVKLEQAMKEIQRLQEDTDKANKQSSVLERDNRRMEIQVKDLSQQIRVLLMELE EARGNHVIRDEEVSSADISSSSEVISQHLVSYRNIEELQQQNQRLLVALRELGETRER EEQETTSSKITELQLKLESALTELEQLRKSRQHQMQLVDSIVRQRDMYRILLSQTTGV AIPLHASSLDDVSLASTPKRPSTSQTVSTPAPVPVIESTEAIEAKAALKQLQEIFENY KKEKAENEKIQNEQLEKLQEQVTDLRSQNTKISTQLDFASKRYEMLQDNVEGYRREIT SLHERNQKLTATTQKQEQIINTMTQDLRGANEKLAVAEVRAENLKKEKEMLKLSEVRL SQQRESLLAEQRGQNLLLTNLQTIQGILERSETETKQRLSSQIEKLEHEISHLKKKLE NEVEQRHTLTRNLDVQLLDTKRQLDTETNLHLNTKELLKNAQKEIATLKQHLSNMEVQ VASQSSQRTGKGQPSNKEDVDDLVSQLRQTEEQVNDLKERLKTSTSNVEQYQAMVTSL EESLNKEKQVTEEVRKNIEVRLKESAEFQTQLEKKLMEVEKEKQELQDDKRRAIESME QQLSELKKTLSSVQNEVQEALQRASTALSNEQQARRDCQEQAKIAVEAQNKYERELML HAADVEALQAAKEQVSKMASVRQHLEETTQKAESQLLECKASWEERERMLKDEVSKCV CRCEDLEKQNRLLHDQIEKLSDKVVASVKEGVQGPLNVSLSEEGKSQEQILEILRFIR REKEIAETRFEVAQVESLRYRQRVELLERELQELEDSLNAEREKVQVTAKTMAQHEEL MKKTETMNVVMETNKMLREEKERLEQDLQQMQAKVRKLELDILPLQEANAELSEKSGM LQAEKKLLEEDVKRWKARNQHLVSQQKDPDTEEYRKLLSEKEVHTKRIQQLTEEIGRL KAEIARSNASLTNNQNLIQSLKEDLNKVRTEKETIQKDLDAKIIDIQEKVKTITQVKK IGRRYKTQYEELKAQQDKVMETSAQSSGDHQEQHVSVQEMQELKETLNQAETKSKSLE SQVENLQKTLSEKETEARNLQEQTVQLQSELSRLRQDLQDRTTQEEQLRQQITEKEEK TRKAIVAAKSKIAHLAGVKDQLTKENEELKQRNGALDQQKDELDVRITALKSQYEGRI SRLERELREHQERHLEQRDEPQEPSNKVPEQQRQITLKTTPASGERGIASTSDPPTAN IKPTPVVSTPSKVTAAAMAGNKSTPRASIRPMVTPATVTNPTTTPTATVMPTTQVESQ EAMQSEGPVEHVPVFGSTSGSVRSTSPNVQPSISQPILTVQQQTQATAFVQPTQQSHP QIEPANQELSSNIVEVVQSSPVERPSTSTAVFGTVSATPSSSLPKRTREEEEDSTIEA SDQVSDDTVEMPLPKKLKSVTPVGTEEEVMAEESTDGEVETQVYNQDSQDSIGEGVTQ GDYTPMEDSEETSQSLQIDLGPLQSDQQTTTSSQDGQGKGDDVIVIDSDDEEEDEEDD DDDEDDTGMGDEGEDSNEGTGSADGNDGYEADDAEGGDGTDPGTETEESMGGGEGNHR AADSQNSGEGNTGAAESSFSQEVSREQQPSSASERQAPRAPQSPRRPPHPLPPRLTIH APPQELGPPVQRIQMTRRQSVGRGLQLTPGIGGMQQHFFDDEDRTVPSTPTLVVPHRT DGFAEAIHSPQVAGVPRFRFGPPEDMPQTSSSHSDLGQLASQGGLGMYETPLFLAHEE ESGGRSVPTTPLQVAAPVTVFTESTTSDASEHASQSVPMVTTSTGTLSTTNETATGDD GDEVFVEAESEGISSEAGLEIDSQQEEEPVQASDESDLPSTSQDPPSSSSVDTSSSQP KPFRRVRLQTTLRQGVRGRQFNRQRGVSHAMGGRGGINRGNIN" misc_feature 6477..7343 /gene="Tpr" /note="Potential overlap with Tpr ORF. Possible frame shift." polyA_signal 7322..7327 /gene="Tpr" polyA_signal 7467..7472 polyA_site 7487 BASE COUNT 2666 a 1424 c 1753 g 1654 t ORIGIN 1 gcgcaagagg atcagggata gcctctgagc tcgggttccc agggttcgta gcttccaacg 61 gctgcgcgcg cacttcggtc gcgggcggtg aggtgctgtt gctgaaacgc tgccgctgag 121 ggtggactcg atttcccagg gtcccgccgc gggagtctcc ggcgggcggg cgcgcgcgag 181 ccaccgagcg aggtgataga ggcggcggcc caggcgtctg ggtcctgctg gtcttcgcct 241 ttcttctccg cttctacccc gtcggccgct gccactgggg tccctggccc caccgacatg 301 gcggcggtgt tgcagcaagt cctggagcgc acggagctga acaagctgcc caagtctgtc 361 cagaacaaac ttgaaaagtt ccttgctgat cagcaatccg agatcgatgg cctgaagggg 421 cggcatgaga aatttaaggt ggagagcgaa caacagtatt ttgaaataga aaagaggttg 481 tcccacagtc aggagagact tgtgaatgaa acccgagagt gtcaaagctt gcggcttgag 541 ctagagaaac tcaacaatca actgaaggca ctaactgaga aaaacaaaga acttgaaatt 601 gctcaggatc gcaatattgc cattcagagc caatttacaa gaacaaagga agaattagaa 661 gctgagaaaa gagacttaat tagaaccaat gagagactat ctcaagaact tgaatactta 721 acagaggatg ttaaacgtct gaatgaaaaa cttaaagaaa gcaatacaac aaagggtgaa 781 cttcagttaa aattggatga acttcaagct tctgatgttt ctgttaagta tcgagaaaaa 841 cgcttggagc aagaaaagga attgctacat agtcagaata catggctgaa tacagagttg 901 aaaaccaaaa ctgatgaact tctggctctt ggaagagaaa aagggaatga gattctagag 961 cttaaatgta atcttgaaaa taaaaaagaa gaggtttcta gactggaaga acaaatgaat 1021 ggcttaaaaa catcaaatga acatcttcaa aagcatgtgg aggatctgtt gaccaaatta 1081 aaagaggcca aggaacaaca ggccagtatg gaagagaaat tccacaatga attaaatgcc 1141 cacataaaac tttctaattt gtacaagagt gccgctgatg actcagaagc aaagagcaat 1201 gaactaaccc gggcagtaga ggaactacac aaacttttga aagaagctgg tgaagccaac 1261 aaagcaatac aagatcatct tctagaggtg gagcaatcca aagatcaaat ggaaaaagaa 1321 atgcttgaga aaatagggag attggagaag gaattagaga atgcaaatga ccttctttct 1381 gccacaaaac gtaaaggagc catattgtct gaagaagagc ttgccgccat gtctcctact 1441 gcagcagctg tagctaagat agtgaaacct gggatgaaac taactgagct ctataatgct 1501 tatgtggaaa ctcaggatca gttgcttttg gagaaactag agaacaaaag aattaataag 1561 tacctagatg aaatagtgaa agaagtggaa gccaaagcac caattttgaa acgccagcgt 1621 gaggaatatg aacgtgcaca gaaagctgta gcaagtttat ctgttaagct tgaacaagct 1681 atgaaggaga ttcagcgatt gcaggaggac actgataaag ccaacaagca atcatctgta 1741 cttgagagag ataatcgaag aatggaaata caagtaaaag atctttcaca acagattaga 1801 gtgcttttga tggaacttga agaagcaagg ggtaaccacg taattcgtga tgaggaagta 1861 agctctgctg atataagtag ttcatctgag gtaatatcac agcatctagt atcttacaga 1921 aatattgaag agcttcaaca acaaaatcaa cgtctcttag tggcccttag agagcttggg 1981 gaaaccagag aaagagaaga acaagaaaca acttcatcca aaatcactga gcttcagctc 2041 aaacttgaga gtgcccttac tgaactagaa caactccgca aatcacgaca gcatcaaatg 2101 cagcttgttg attccatagt tcgtcagcgt gatatgtacc gtattttatt gtcacaaaca 2161 acaggagttg ccattccatt acatgcttca agcttagatg atgtttctct tgcatcaact 2221 ccaaaacgtc caagtacatc acagactgtt tccactcctg ctccagtacc tgttattgaa 2281 tcaacagagg ctatagaggc taaggctgcc cttaaacagt tgcaggaaat ttttgagaac 2341 tacaaaaaag aaaaagcaga aaatgaaaaa atacaaaatg agcagcttga gaaacttcaa 2401 gaacaagtta cagatttgcg atcacaaaat accaaaattt ctacccagct agattttgct 2461 tctaaacgtt atgaaatgct gcaagataat gttgaaggat atcgtcgaga aataacatca 2521 cttcatgaga gaaatcagaa actcactgcc acaactcaaa agcaagaaca gattatcaat 2581 acgatgactc aagatttgag aggagcaaat gagaagctag ctgtcgcaga agtaagagca 2641 gaaaatttga agaaggaaaa ggaaatgctt aaattgtctg aagttcgtct ttctcagcaa 2701 agagagtctt tgttagctga acaaaggggg caaaacttac tgctaactaa tctgcaaaca 2761 attcagggaa tactggagcg atctgaaaca gaaaccaaac aaaggcttag tagccagata 2821 gaaaaactgg aacatgagat ctctcatcta aagaagaagt tggaaaatga ggtggaacaa 2881 aggcatacac ttactagaaa tctagatgtt caacttttag atacaaagag acaactggat 2941 acagagacaa atcttcatct taacacaaaa gaactattaa aaaatgctca aaaagaaatt 3001 gccacattga aacagcacct cagtaatatg gaagtccaag ttgcttctca gtcttcacag 3061 agaactggta aaggtcagcc tagcaacaaa gaagatgtgg atgatcttgt gagtcagcta 3121 agacagacag aagagcaggt gaatgactta aaggagagac tcaaaacaag tacgagcaat 3181 gtggaacaat atcaagcaat ggttactagt ttagaagaat ccctgaacaa ggaaaaacag 3241 gtgacagaag aagtgcgtaa gaatattgaa gttcgtttaa aagagtcagc tgaatttcag 3301 acacagttgg aaaagaagtt gatggaagta gagaaggaaa aacaagaact tcaggatgat 3361 aaaagaagag ccatagagag catggaacaa cagttatctg aattgaagaa aacactttct 3421 agtgttcaga atgaagtaca agaagctctt cagagagcaa gcacagcttt aagtaatgag 3481 cagcaagcca gacgtgactg tcaggaacaa gctaaaatag ctgtggaagc tcagaataag 3541 tatgagagag aattgatgct gcatgctgct gatgttgaag ctctacaagc tgcgaaggag 3601 caggtttcaa aaatggcatc agtccgtcag catttggaag aaacaacaca gaaagcagaa 3661 tcacagttgt tggagtgtaa agcatcttgg gaggaaagag agagaatgtt aaaggatgaa 3721 gtttccaaat gtgtatgtcg ctgtgaagat ctggagaaac aaaacagatt acttcatgat 3781 cagatcgaaa aattaagtga caaggtcgtt gcctctgtga aggaaggtgt acaaggtcca 3841 ctgaatgtat ctctcagtga agaaggaaaa tctcaagaac aaattttgga aattctcaga 3901 tttatacgac gagaaaaaga aattgctgaa actaggtttg aggtggctca ggttgagagt 3961 ctgcgttatc gacaaagggt tgaactttta gaaagagagc tgcaggaact cgaagatagt 4021 ctaaatgctg aaagggagaa agtccaggta actgcaaaaa caatggctca gcatgaagaa 4081 ctgatgaaga aaactgaaac aatgaatgta gttatggaga ccaataaaat gctaagagaa 4141 gagaaggaga gactagaaca ggatctacag caaatgcaag caaaggtgag gaaactggag 4201 ttagatattt tacccttaca agaagcaaat gctgagctga gtgagaaaag cggtatgttg 4261 caggcagaga agaagctctt agaagaggat gtcaaacgtt ggaaagcacg taaccagcat 4321 ctagtaagtc aacagaaaga tccagataca gaagaatatc ggaagctcct ttctgaaaag 4381 gaagttcata ctaagcgtat tcaacaattg acagaagaaa ttggtagact taaagctgaa 4441 attgcaagat caaatgcatc tttgactaac aaccagaact taattcagag tctgaaggaa 4501 gatctaaata aagtaagaac tgaaaaggaa accatccaga aggacttaga tgccaaaata 4561 attgatatcc aagaaaaagt caaaactatt actcaagtta agaaaattgg acgtaggtac 4621 aagactcaat atgaagaact taaagcacaa caggataagg ttatggagac atcggctcag 4681 tcctctggag accatcagga gcagcatgtt tcagtccagg aaatgcagga actcaaagaa 4741 acgctcaacc aagctgaaac aaaatcaaaa tcacttgaaa gtcaagtaga gaatctgcag 4801 aagacattat ctgaaaaaga gacagaagca agaaatctcc aggaacagac tgtgcaactt 4861 cagtctgaac tttcacgact tcgtcaggat cttcaagata gaaccacaca ggaggagcag 4921 ctccgacaac agataactga aaaggaagaa aaaaccagaa aggctattgt agcagcaaag 4981 tcaaaaattg cacacttagc tggtgtaaaa gatcagctaa ctaaagaaaa tgaggagctt 5041 aaacaaagga atggagcctt agatcagcag aaagatgaat tggatgttcg cattactgcg 5101 ctaaagtccc aatatgaagg tcgaattagt cgcttggaaa gagaactcag ggagcatcaa 5161 gagagacacc ttgagcagag agatgagcct caagaacctt ctaataaggt ccctgaacag 5221 cagagacaga tcacattgaa aacaactcca gcttctggtg aaagaggaat tgccagcaca 5281 tcagacccac caacagccaa tatcaagcca actcctgttg tgtctactcc aagtaaagtg 5341 acagctgcag ctatggctgg aaataagtca acacccaggg ctagtatccg cccaatggtt 5401 acacctgcaa ctgttacaaa tcccactact accccaacag ctacagtgat gcccactaca 5461 caagtggaat cacaggaagc tatgcagtca gaagggcctg tggaacatgt tccagttttt 5521 ggaagcacaa gtggatccgt tcgttctact agtcctaatg tccagccttc tatctctcaa 5581 cctattttaa ctgttcagca acaaacacag gctacagctt ttgtgcaacc cactcaacag 5641 agtcatcctc agattgagcc tgccaatcaa gagttatctt caaacatagt agaggttgtt 5701 cagagttcac cagttgagcg gccttctact tccacagcag tatttggcac agtttcggct 5761 acccccagtt cttctttgcc aaagcgtaca cgtgaagagg aagaggatag caccatagaa 5821 gcatcagacc aagtctctga tgatacagtg gaaatgcctc ttccaaagaa gttgaaaagt 5881 gtcacacctg taggaactga ggaagaagtt atggcagaag aaagtactga tggagaggta 5941 gagactcagg tatacaacca ggattctcaa gattccattg gagaaggagt tacccaggga 6001 gattatacac ctatggaaga cagtgaagaa acctctcagt ctctacaaat agatcttggg 6061 ccacttcaat cagatcagca gacgacaact tcatcccagg atggtcaagg caaaggagat 6121 gatgtcattg taattgacag tgatgatgaa gaagaggatg aggaagatga tgatgatgat 6181 gaagatgaca cagggatggg agatgagggt gaagatagta atgaaggaac tggtagtgcc 6241 gatggcaatg atggttatga agctgatgat gctgagggtg gtgatgggac tgatccaggt 6301 acagaaacag aagaaagtat gggtggaggt gaaggtaatc acagagctgc tgattctcaa 6361 aacagtggtg aaggaaatac aggtgctgca gaatcttctt tttctcagga ggtttctaga 6421 gaacaacagc catcatcagc atctgaaaga caggcccctc gagcacctca gtcaccgaga 6481 cgcccaccac atccacttcc cccaagactg accattcatg ccccacctca ggagttggga 6541 ccaccagttc agagaattca gatgacccga aggcagtctg taggacgtgg ccttcagttg 6601 actccaggaa taggtggcat gcaacagcat ttttttgatg atgaagacag aacagttcca 6661 agtactccaa ctcttgtggt gccacatcgt actgatggat ttgctgaagc aattcattcg 6721 ccgcaggttg ctggtgtccc tagattccgg tttgggccac ctgaagatat gccacaaaca 6781 agttctagtc actctgatct tggccagctt gcttctcaag gaggtttagg aatgtatgaa 6841 acacccctgt tcctagctca tgaagaagag tcaggtggcc gaagtgttcc cactactcca 6901 ctacaagtag cagccccagt gactgtattt actgagagca ccacctctga tgcttcggaa 6961 catgcctctc aatctgttcc aatggtgact acatccactg gcactttatc tacaacaaat 7021 gaaacagcaa caggtgatga tggagatgaa gtatttgtgg aggcagaatc tgaaggtatt 7081 agttcagaag caggcctaga aattgatagc cagcaggaag aagagccggt tcaagcatct 7141 gatgagtcag atctcccctc caccagccag gatcctcctt ctagctcatc tgtagatact 7201 agtagtagtc aaccaaagcc tttcagacga gtaagacttc agacaacatt gagacaaggt 7261 gtccgtggtc gtcagtttaa cagacagaga ggtgtgagcc atgcaatggg agggagagga 7321 ggaataaaca gaggaaatat taattaaatg gtctgtaaac aataacaact gtgaataaga 7381 ttatcaaatc tgttttagtg taatgattgt caagtttaaa aacattttta tatataaact 7441 ggtatactca tgtcaatatt ctttattaat aaaatgtttt tcagtgtcaa aaaaaaa // LOCUS HSTRA1 2780 bp RNA PRI 31-MAR-1995 DEFINITION Human tra1 mRNA for human homologue of murine tumor rejection antigen gp96. ACCESSION X15187 NID g37260 KEYWORDS calcium binding protein; cell surface glycoprotein; glycoprotein; heat shock protein; stress protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2780) AUTHORS Maki,R.G. TITLE Direct Submission JOURNAL Submitted (05-MAY-1989) Maki R.G., Mount Sinai School of Medicine, Dept of Pharmacology, C/O Dr Pramod K Srivastava, Box 1215 1 Gustave L Levy Pl, New York NY 10029, USA REFERENCE 2 (bases 1 to 2780) AUTHORS Maki,R.G., Old,L.J. and Srivastava,P.K. JOURNAL Unpublished COMMENT Bases 1-12 genomic sequence, 13-2780 cDNA sequence. See for human GRP94 sequence. FEATURES Location/Qualifiers source 1..2780 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="LASTD teratocarcinoma" /clone_lib="LASTD2" /clone="pH48" precursor_RNA 1..2780 /note="primary transcript" CDS 106..2517 /codon_start=1 /product="precursor polypeptide (AA -21 to 782)" /db_xref="PID:g37261" /db_xref="SWISS-PROT:P14625" /translation="MRALWVLGLCCVLLTFGSVRADDEVDVDGTVEEDLGKSREGSRT DDEVVQREEEAIQLDGLNASQIRELREKSEKFAFQAEVNRMMKLIINSLYKNKEIFLR ELISNASDALDKIRLISLTDENALSGNEELTVKIKCDKEKNLLHVTDTGVGMTREELV KNLGTIAKSGTSEFLNKMTEAQEDGQSTSELIGQFGVGFYSAFLVADKVIVTSKHNND TQHIWESDSNEFSVIADPRGNTLGRGTTITLVLKEEASDYLELDTIKNLVKKYSQFIN FPIYVWSSKTETVEEPMEEEEAAKEEKEESDDEAAVEEEEEEKKPKTKKVEKTVWDWE LMNDIKPIWQRPSKEVEEDEYKAFYKSFSKESDDPMAYIHFTAEGEVTFKSILFVPTS APRGLFDEYGSKKSDYIKLYVRRVFITDDFHDMMPKYLNFVKGVVDSDDLPLNVSRET LQQHKLLKVIRKKLVRKTLDMIKKIADDKYNDTFWKEFGTNIKLGVIEDHSNRTRLAK LLRFQSSHHPTDITSLDQYVERMKEKQDKIYFMAGSSRKEAESSPFVERLLKKGYEVI YLTEPVDEYCIQALPEFDGKRFQNVAKEGVKFDESEKTKESREAVEKEFEPLLNWMKD KALKDKIEKAVVSQRLTESPCALVASQYGWSGNMERIMKAQAYQTGKDISTNYYASQK KTFEINPRHPLIRDMLRRIKEDEDDKTVLDLAVVLFETATLRSGYLLPDTKAYGDRIE RMLRLSLNIDPDAKVEEEPEEEPEETAEDTTEDTEQDEDEEMDVGTDEEEETAKESTA EKDEL" sig_peptide 106..168 /note="signal peptide (AA -21 to -1)" mat_peptide 169..2514 /note="gp96 homologue (AA 1 to 782)" misc_feature 2762..2767 /note="pot.polyA signal" polyA_site 2780 /note="polyA site" BASE COUNT 931 a 492 c 670 g 687 t ORIGIN 1 gtgggcggac cgcgcggctg gaggtgtgag gatccgaacc caggggtggg gggtggaggc 61 ggctcctgcg atcgaagggg acttgagact caccggccgc acgccatgag ggccctgtgg 121 gtgctgggcc tctgctgcgt cctgctgacc ttcgggtcgg tcagagctga cgatgaagtt 181 gatgtggatg gtacagtaga agaggatctg ggtaaaagta gagaaggatc aaggacggat 241 gatgaagtag tacagagaga ggaagaagct attcagttgg atggattaaa tgcatcacaa 301 ataagagaac ttagagagaa gtcggaaaag tttgccttcc aagccgaagt taacagaatg 361 atgaaactta tcatcaattc attgtataaa aataaagaga ttttcctgag agaactgatt 421 tcaaatgctt ctgatgcttt agataagata aggctaatat cactgactga tgaaaatgct 481 ctttctggaa atgaggaact aacagtcaaa attaagtgtg ataaggagaa gaacctgctg 541 catgtcacag acaccggtgt aggaatgacc agagaagagt tggttaaaaa ccttggtacc 601 atagccaaat ctgggacaag cgagttttta aacaaaatga ctgaagcaca ggaagatggc 661 cagtcaactt ctgaattgat tggccagttt ggtgtcggtt tctattccgc cttccttgta 721 gcagataagg ttattgtcac ttcaaaacac aacaacgata cccagcacat ctgggagtct 781 gactccaatg aattttctgt aattgctgac ccaagaggaa acactctagg acggggaacg 841 acaattaccc ttgtcttaaa agaagaagca tctgattacc ttgaattgga tacaattaaa 901 aatctcgtca aaaaatattc acagttcata aactttccta tttatgtatg gagcagcaag 961 actgaaactg ttgaggagcc catggaggaa gaagaagcag ccaaagaaga gaaagaagaa 1021 tctgatgatg aagctgcagt agaggaagaa gaagaagaaa agaaaccaaa gactaaaaaa 1081 gttgaaaaaa ctgtctggga ctgggaactt atgaatgata tcaaaccaat atggcagaga 1141 ccatcaaaag aagtagaaga agatgaatac aaagctttct acaaatcatt ttcaaaggaa 1201 agtgatgacc ccatggctta tattcacttt actgctgaag gggaagttac cttcaaatca 1261 attttatttg tacccacatc tgctccacgt ggtctgtttg acgaatatgg atctaaaaag 1321 agcgattaca ttaagctcta tgtgcgccgt gtattcatca cagacgactt ccatgatatg 1381 atgcctaaat acctcaattt tgtcaagggt gtggtggact cagatgatct ccccttgaat 1441 gtttcccgcg agactcttca gcaacataaa ctgcttaagg tgattaggaa gaagcttgtt 1501 cgtaaaacgc tggacatgat caagaagatt gctgatgata aatacaatga tactttttgg 1561 aaagaatttg gtaccaacat caagcttggt gtgattgaag accactcgaa tcgaacacgt 1621 cttgctaaac ttcttaggtt ccagtcttct catcatccaa ctgacattac tagcctagac 1681 cagtatgtgg aaagaatgaa ggaaaaacaa gacaaaatct acttcatggc tgggtccagc 1741 agaaaagagg ctgaatcttc tccatttgtt gagcgacttc tgaaaaaggg ctatgaagtt 1801 atttacctca cagaacctgt ggatgaatac tgtattcagg cccttcccga atttgatggg 1861 aagaggttcc agaatgttgc caaggaagga gtgaagttcg atgaaagtga gaaaactaag 1921 gagagtcgtg aagcagttga gaaagaattt gagcctctgc tgaattggat gaaagataaa 1981 gcccttaagg acaagattga aaaggctgtg gtgtctcagc gcctgacaga atctccgtgt 2041 gctttggtgg ccagccagta cggatggtct ggcaacatgg agagaatcat gaaagcacaa 2101 gcgtaccaaa cgggcaagga catctctaca aattactatg cgagtcagaa gaaaacattt 2161 gaaattaatc ccagacaccc gctgatcaga gacatgcttc gacgaattaa ggaagatgaa 2221 gatgataaaa cagttttgga tcttgctgtg gttttgtttg aaacagcaac gcttcggtca 2281 gggtatcttt taccagacac taaagcatat ggagatagaa tagaaagaat gcttcgcctc 2341 agtttgaaca ttgaccctga tgcaaaggtg gaagaagagc ccgaagaaga acctgaagag 2401 acagcagaag acacaacaga agacacagag caagacgaag atgaagaaat ggatgtggga 2461 acagatgaag aagaagaaac agcaaaggaa tctacagctg aaaaagatga attgtaaatt 2521 atactctcac catttggatc ctgtgtggag agggaatgtg aaatttacat catttctttt 2581 tgggagagac ttgttttgga tgccccctaa tccccttctc ccctgcactg taaaatgtgg 2641 gattatgggt cacaggaaaa agtgggtttt ttagttgaat tttttttaac attcctcatg 2701 aatgtaaatt tgtactattt aactgactat tcttgatgta aaatcttgtc atgtgtataa 2761 aaataaaaaa gatcccaaat // LOCUS HSTRAMP 1267 bp RNA PRI 01-JUN-1992 DEFINITION H.sapiens mRNA for TRAMP protein. ACCESSION X63679 NID g37264 KEYWORDS traM gene; TraM protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1267) AUTHORS Gorlich,D., Hartmann,E., Prehn,S. and Rapoport,T.A. TITLE A protein of the endoplasmic reticulum involved early in polypeptide translocation JOURNAL Nature 357 (6373), 47-52 (1992) MEDLINE 92244357 REFERENCE 2 (bases 1 to 1267) AUTHORS Hartmann,E. TITLE Direct Submission JOURNAL Submitted (28-JAN-1992) E. Hartmann, Max-Delbrueck-Centr. f. Molekulare Med., Robert-Roessle-Strasse 10, O-1115 Berlin Buch, FRG FEATURES Location/Qualifiers source 1..1267 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 122..1246 /gene="TRAM" CDS 122..1246 /gene="TRAM" /codon_start=1 /product="TRAM protein" /db_xref="PID:g37265" /translation="MAIRKKSTKSPPVLSHEFVLQNHADIVSCVAMVFLLGLMFEITA KASIIFVTLQYNVTLPATEEQATESVSLYYYGIKDLATVFFYMLVAIIIHAVIQEYML DKINRRMHFSKTKHSKFNESGQLSAFYLFACVWGTFILISENYISDPTILWRAYPHNL MTFQMKFFYISQLAYWLHAFPELYFQKTKKEDIPRQLVYIGLYLFHIAGAYLLNLNHL GLVLLVLHYFVEFLFHISRLFYFSNEKYQKGFSLWAVLFVLGRLLTLILSVLTVGFGL ARAENQKLDFSTGNFNVLAVRIAVLASICVTQAFMMWKFINFQLRRWREHSAFQAPAV KKKPTVTKGRSSKKGTENGVNGTLTSNVADSPRNKKEKSS" BASE COUNT 343 a 278 c 275 g 371 t ORIGIN 1 cagcgagcgg ctgcagcggg gccgtgacca gcagccagcg ggaggcggcg gcgagtcggt 61 gagcagctgg gaagagcaga accggggcgg agcacctgca ggcgcgggcg gcggccccac 121 catggcgatt cgcaagaaaa gcaccaagag ccccccagtg ctgagccacg aattcgtcct 181 gcagaatcac gcggacatcg tctcctgtgt ggcgatggtc ttcctgctgg ggctcatgtt 241 tgagataacg gcaaaagctt ctatcatttt tgttactctt cagtacaatg tcaccctccc 301 agcaacagaa gaacaagcta ctgaatcagt gtccctttat tactatggca tcaaagattt 361 ggctactgtt ttcttctaca tgctagtggc gataattatt catgccgtaa ttcaagagta 421 tatgttggat aaaattaaca ggcgaatgca cttctccaaa acaaaacaca gcaagtttaa 481 tgaatctggt cagcttagtg cgttctacct ttttgcctgt gtttggggca cattcattct 541 catctctgaa aactacatct cagacccaac tatcttatgg agggcttatc cccataacct 601 gatgacattt caaatgaagt ttttctacat atcacagctg gcttactggc ttcatgcttt 661 tcctgaactc tacttccaga aaaccaaaaa agaagatatt cctcgtcagc ttgtctacat 721 tggtctttac ctcttccaca ttgctggagc ttaccttttg aacttgaatc atctaggact 781 tgttcttctg gtgctacatt attttgttga atttcttttc cacatttccc gcctgtttta 841 ttttagcaat gaaaagtatc agaaaggatt ttctctgtgg gcagttcttt ttgttttggg 901 aagacttctg actttaattc tttcagtact gactgttggt tttggccttg caagagcaga 961 aaatcagaaa ctggatttca gtactggaaa cttcaatgtg ttagctgtta gaatcgctgt 1021 tctggcatcc atttgcgtta ctcaggcatt tatgatgtgg aagttcatta attttcagct 1081 tcgaaggtgg agggaacatt ctgcttttca ggcaccagct gtgaagaaga aaccaacagt 1141 aactaaaggc agatcttcta aaaaaggaac agaaaatggt gtgaatggaa cattaacttc 1201 aaatgtagca gactctcccc ggaataaaaa agagaaatct tcataatgaa ttataaacta 1261 attgatt // LOCUS HSTRANSK 2106 bp RNA PRI 09-MAY-1996 DEFINITION H.sapiens mRNA for transketolase. ACCESSION X67688 S52775 NID g37266 KEYWORDS transketolase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2104) AUTHORS Singleton,C.K. TITLE Direct Submission JOURNAL Submitted (11-AUG-1992) C.K. Singleton, Vanderbilt University, Box 1820, Station B, Nashville, TN 37235, USA REFERENCE 2 (bases 1 to 2104) AUTHORS Abedinia,M., Layfield,R., Jones,S.M., Nixon,P.F. and Mattick,J.S. TITLE Nucleotide and predicted amino acid sequence of a cDNA clone encoding part of human transketolase JOURNAL Biochem. Biophys. Res. Commun. 183 (3), 1159-1166 (1992) MEDLINE 92231878 REFERENCE 3 (bases 1 to 2106) AUTHORS McCool,B.A., Plonk,S.G., Martin,P.R. and Singleton,C.K. TITLE Cloning of human transketolase cDNAs and comparison of the nucleotide sequence of the coding region in Wernicke-Korsakoff and non-Wernicke-Korsakoff individuals JOURNAL J. Biol. Chem. 268 (2), 1397-1404 (1993) MEDLINE 93123263 FEATURES Location/Qualifiers source 1..2106 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="frontal cortex, liver" /cell_line="Hela (nucleotides residues 1-200 only)" /clone_lib="frontal cortex and liver cDNA" /clone="pTKa-1, pTKd-1" CDS 71..1942 /EC_number="2.2.1.1" /note="homodimer" /codon_start=1 /evidence=experimental /product="transketolase" /db_xref="PID:g37267" /db_xref="SWISS-PROT:P29401" /translation="MESYHKPDQQKLQALKDTANRLRISSIQASSAAGSGHPTSCCSA AVIMAVLFFHTMRYKSQDPRNPHNDRFVLSKGHAAPILYAVWAEAGFLAEAELLNLRK ISSDLDGHPVPKQAFTDVATGSLGQGLGAACGMAYTGKYFDKASYRVYCLLGDGELSE GSVWEAMAFASIYKLDNLVAILDINRLGQSDPAPLQHQMDIYQKRCEAFGWHAIIVDG HSVEELCKAFGQAKHQPTAIIAKTFKGRGITGVEDKESWHGKPLPKNMAEQIIQEIYS QIQSKKKILATPPQEDAPSVDIANIRMPSLPSYKVGDKIATRKAYGQALAKLGHASDR IIALDGDTKNSTFSEIFKKEHPDRFIECYIAEQNMVSIAVGCATRNRTVPFCSTFAAF FTRAFDQIRMAAISESNINLCGSHCGVSIGEDGASQMALEDLAMFRSVPTSTVFYPSD GVATEKAVELAANTKGICFIRTSRPENAIIYNNNEDFQVGQAKVVLKSKDDQVTVIGA GVTLHEALAAAELLKKEKINIRVLDPFTIKPLDRKLILDSARATKGRILTVEDHYYEG GIGEAVSSAVVGEPGITVTHLAVNRVPRSGKPAELLKMFGIDRDAIAQAVRGLITKA" BASE COUNT 529 a 608 c 589 g 380 t ORIGIN 1 gcgctgtcag ctcgcagcag ccactatctc tgtgtgtccg cgtgtgcgcc cggtccccgc 61 ctgccgcacc atggagagct accacaagcc tgaccagcag aagctgcagg ccttgaagga 121 cacggccaac cgcctacgta tcagctccat ccaggcctcc tctgcggcgg gctctggcca 181 ccccacgtca tgctgcagcg ccgcagtgat catggctgtc ctctttttcc acaccatgcg 241 ctacaagtcc caggaccccc ggaatccgca caatgaccgc tttgtgctct ccaagggcca 301 tgcagctccc atcctctacg cggtctgggc tgaagctggt ttcctggccg aggcggagct 361 gctgaacctg aggaagatca gctccgactt ggacgggcac ccggtcccga aacaagcttt 421 caccgacgtg gccactggct ccctgggcca gggcctcggg gccgcttgtg ggatggccta 481 caccggcaaa tacttcgaca aggccagcta ccgagtctat tgcttgctgg gagatgggga 541 gctgtcagag ggctctgtat gggaggccat ggccttcgcc agcatctata agctggacaa 601 ccttgtggcc attctagaca tcaatcgcct gggccagagt gacccggccc cgctgcagca 661 ccagatggac atctaccaga agcggtgcga ggccttcggt tggcatgcca tcatcgtgga 721 tggacacagc gtggaggagc tgtgcaaggc ctttggccag gccaagcacc agccaacagc 781 catcattgcc aagaccttca agggccgagg gatcacgggg gtagaagata aggagtcttg 841 gcatgggaag cccctcccca aaaacatggc tgagcagatc atccaggaga tctacagcca 901 gatccagagc aaaaagaaga tcctggcaac ccctccacag gaggacgcac cctcagtgga 961 cattgccaac atccgcatgc ccagcctgcc cagctacaaa gttggggaca agatagccac 1021 ccgcaaggcc tacgggcagg cactggccaa gctgggccat gccagtgacc gcatcatcgc 1081 cctggatggg gacaccaaaa attccacctt ctcggagatc ttcaaaaagg agcacccgga 1141 ccgcttcatc gagtgctaca ttgccgagca gaacatggtg agcatcgcgg tgggctgtgc 1201 cacccgcaac aggacggtgc ccttctgcag cacttttgca gccttcttca cgcgggcctt 1261 tgaccagatt cgcatggcgg ccatctccga gagcaacatc aacctctgcg gctcccactg 1321 cggcgtttcc atcggggaag acggggcctc ccagatggcc ctagaagatc tggctatgtt 1381 tcggtcagtc cccacatcaa ctgtctttta cccaagtgat ggcgttgcta cagagaaggc 1441 agtggaacta gccgccaata caaagggtat ctgcttcatc cggaccagcc gcccagaaaa 1501 tgccatcatc tataacaaca atgaggactt ccaggtcgga caagccaagg tggtcctgaa 1561 gagcaaggat gaccaggtga ccgttatcgg ggctggggtg accctgcacg aggccttggc 1621 cgctgccgaa ctgctgaaga aagaaaagat caacatccgc gtgctggacc ccttcaccat 1681 caagcccctg gacagaaaac tcattctcga cagcgctcgt gccaccaagg gcaggatcct 1741 caccgtggag gaccattatt atgaaggtgg cattggtgag gctgtgtcca gtgcagtagt 1801 gggcgagcct ggcatcactg tcacccacct ggcagttaac cgggtaccaa gaagtgggaa 1861 gccagctgag ctgctgaaga tgtttggtat cgacagggat gccattgcac aagctgtgag 1921 gggcctcatc accaaggcct agggcgggta tgaagtgtgg ggcgggggtc tatacattcc 1981 tgagattctg ggaaaggtgc tcaaagatgt actgagagga ggggtaaata tatgttttga 2041 gaaaaatgaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2101 aaaaaa // LOCUS HSTRANSLI 2739 bp RNA PRI 14-DEC-1995 DEFINITION H.sapiens mRNA for translin. ACCESSION X78627 NID g607129 KEYWORDS DNA-binding; translin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2739) AUTHORS Aoki,K., Suzuki,K., Sugano,T., Tasaka,T., Nakahara,K., Kuge,O., Omori,A. and Kasai,M. TITLE A novel gene, Translin, encodes a recombination hotspot binding protein associated with chromosomal translocations JOURNAL Nature Genet. 10 (2), 167-174 (1995) MEDLINE 95392568 REFERENCE 2 (bases 1 to 2739) AUTHORS Kasai,M. TITLE Direct Submission JOURNAL Submitted (06-APR-1994) M. Kasai, National Institute of Health of Japan, Dept of Immunology, 1-32-1, Tokama, Shinjuku, Tokyo 162, JAPAN FEATURES Location/Qualifiers source 1..2739 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoid leukemia" /cell_line="human leukemia cell line" /clone_lib="DND-41" /clone="N14" CDS 82..768 /function="DNA binding protein" /codon_start=1 /product="translin" /db_xref="PID:g607130" /translation="MSVSEIFVELQGFLAAEQDIREEIRKVVQSLEQTAREILTLLQG VHQGAGFQDIPKRCLKAREHFGTVKTHLTSLKTKFPAEQYYRFHEHWRFVLQRLVFLA AFVVYLETETLVTREAVTEILGIEPDREKGFHLDVEDYLSGVLILASELSRLSVNSVT AGDYSRPLHISTFINELDSGFRLLNLKNDSLRKRYDGLKYDVKKVEEVVYDLSIRGFN KETAAACVEK" polyA_signal 2668..2673 BASE COUNT 793 a 490 c 608 g 848 t ORIGIN 1 gattgattgc gctggttgcc tgcggcgtcc acttccttgg ccgcccttgc tacactggct 61 gattgttgtg cagccggcgc catgtctgtg agcgagatct tcgtggagct gcagggcttt 121 ttggctgccg agcaggacat ccgagaggaa atcagaaaag ttgtacagag tttagaacaa 181 acagctcgag agattttaac tctactgcaa ggggtccatc agggtgctgg gtttcaggac 241 attccaaaga ggtgtttgaa agctcgagaa cattttggta cagtaaaaac acatctaaca 301 tctttgaaga ccaaatttcc tgctgaacag tattacagat ttcatgagca ctggaggttt 361 gtgttgcagc gcttggtctt cttggcagca tttgttgtgt atttggaaac agaaacacta 421 gtgactcgag aagcagttac agaaattctt ggcattgagc cagatcggga gaaaggattt 481 catctggatg tagaagatta tctctcagga gttctaattc ttgccagtga actgtcgagg 541 ctgtctgtca acagcgtgac tgctggagac tactcccgac ccctccacat ctccaccttc 601 atcaatgagc tggattccgg ttttcgcctt ctcaacctga aaaatgactc cctgaggaag 661 cgctacgacg gattgaaata tgacgtgaag aaagtagagg aagtggtcta tgatctctcc 721 atccggggct ttaataagga gacggcagca gcttgtgttg aaaaatagga ggctctcctt 781 gctcctggcc ttgctgacct cagcggttgc caggaagggg tgagcacaga gtgcctctta 841 cggtagttag gatgctcagt tgctaaacac tgcgctttat tttcttaacc agttgtggtg 901 tgagtatcag aattgaaaca cttttttggg ggtaaaaaat atagccttta catggacaga 961 attttttttg ttgtttcagt gaatatgcct gtaattcagt gtatttcagt tccgtcagaa 1021 agtgtaaatg ttagtttctt ggtaaagtcc ttttcttgct taccttgact gttgatgtac 1081 tgattgagaa gttcattgtc tcgtttgtga ttcttccaga tgtgatgctt gatattttct 1141 atatgcgagt tagccatcca cacccaggca tagcctggat acagtataaa aatagataat 1201 taaaaagatg gttgccaagc aaggaaaact tattttatat tttcccttcc ttattttaag 1261 cattgtgagt aaatcagatg ttgaattctt ttgccaaggg aattatagct gcaggttctc 1321 tctcactgcc atcaaactgt aaaagattaa actgcgaagt caagctcaac agattatttt 1381 ggaaagtttt tgtattaagg gatttagtaa catcattttg ttttccacca ggcagggagt 1441 agggcttagt gttttaaaac acctctgctt tctgatgttg ccttaatatt ctgctattgc 1501 agcaattaaa aattgtcttc atgtacattt ggaactaaca cgtgatgtga tatattccta 1561 aactatgaaa cctttttcct agtagtcagc tagatcattt gttctgggag tataaagcca 1621 cccacgtaag ttaataagca aaatcctgac tattatgttg ttagagaaaa atgctttgct 1681 ttgtctggaa gaaagataaa atagtgaatt ataaataagt caggccgggc gtggtggctc 1741 acacctgtaa tcccagcaca ctgggaggcc gaggcagggg gactgcttga gctcaggagt 1801 tcgagaccag cctgggcaac aaagtgagac tccatctcta tataaaaaca aaaaccacga 1861 aagcacacac aaaataaatc agtgggattt ggtaatgtgt tttagagtaa gaaatttcag 1921 gttgttggtg actatcccaa cagtcatgtt ttaaatgtac agtttggggc aagtcatgta 1981 aatactgttg gtggtcttcc ccacacgccc caattttcag gtagtactaa gagtatgtgc 2041 caggaaactc ttgctattga attgagatga ttaaaatggt gacttaatcc gtagttattt 2101 tgcacccact gaaaggaaag tgctttccag aataatatga agtatctaaa agtgtcacct 2161 tttcttgcct gatcaacaat ttgggcttcc tgtttgtaca aggggccatt tggcatacct 2221 ttcacagctt ttatcaggcc aagttaaagg ctgactacat tttttcatca tgaggaaagc 2281 agttgaaatg aggcatgagt tactgtgcat tgggatttta gaacaatttt cttgtgacag 2341 ctctttttgt gaagttaggt tcttaaaaga gcccatgatg gtcacttaaa atgtgcagta 2401 atagcactgc caggatcaag catgaaaggc ttttaaatta gatcatccca cagacaatac 2461 gtttgataat agttttttct tttaacctct ttaagtattg attctgcttg agaatattga 2521 agtacttgcc agaagttgtg gatttcagtt ttaacaaatg ctattaaagt ggagaagcac 2581 actctggtct tggaattcca tttgaggatt tagaagtgtc atgtttataa ctattcagtt 2641 gtgtttgttg ctggcttgtt gtaaagcaat aaaatttttt tggtcttttt gtaaaaaaaa 2701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSTRANSPO 1287 bp DNA PRI 20-JAN-1998 DEFINITION H.sapiens gene encoding transposase protein. ACCESSION X94948 NID g2808444 KEYWORDS insertion site; mariner-like transposon; transposase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1287) AUTHORS Bigot,Y.B. and Auge-Gouillou,C. TITLE Human and other eukaryotic mariner-like transposons are located in structurally similar sites JOURNAL Unpublished REFERENCE 2 (bases 1 to 1287) AUTHORS Bigot,Y.B. TITLE Direct Submission JOURNAL Submitted (10-JAN-1996) Y.B. Bigot, IBEAS, Faculte des Sciences, parc de Gremont, 37200 Tours, FRANCE FEATURES Location/Qualifiers source 1..1287 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 1..54 /note="5'insertion site" misc_feature 55..89 /note="inverted terminal repeat" CDS 186..1190 /note="mariner-like transposon; cecropia sub-family" /codon_start=1 /product="transposase" /db_xref="PID:e218705" /db_xref="PID:g2808445" /translation="MMLNKKKIRVIFLFEFKMGHKAAEITRNMNNTFGPGTANETYSA VVASRSFAKEESLEDEERDWRPLEVDNDQFESNHQLNYMRNCQKTSTLTILRSLGSFN KLESKWVPHELKSKIFKKYPFLKCLLLFYYNNNEPFLDQIVTCDEKWILYDNWRLPTQ WLDREEAKKHFPKPNLHQKKVMVNVWWSAAGVIHYSFLNPGETITSEKYAQQINEMHR KLQRLQPALVNRKGPILLHNNPQQHVAQPTLQKLNELGYKVLPHPPYSPDLLPTNYHF LEGLNNFLQGKRFHNQQDAENAFQEFVESQSTDFYATGINKLISRWQKCVDCNGSYFD " misc_feature complement(1187..1259) /note="3'insertion site" misc_feature 1235..1258 /note="inverted terminal repeat" BASE COUNT 417 a 247 c 245 g 376 t 2 others ORIGIN 1 ggttggtgca aaagtaattg tggtttttgc actgttggaa tttgccattt gatattagga 61 tactttctta aataaatgtg gttatgttat acatcatttt aatgggcatt tctgctttat 121 nacttatttt tttgctaatg acttattact tgctgtttat tttgtattta ttttagactg 181 tgnaaatgat gttaaacaaa aagaaaattc gagtgatttt cttattcgag ttcaaaatgg 241 gtcataaagc agcagagata actcgaaaca tgaacaacac atttggccca ggaactgcta 301 atgaaacata cagtgcagtg gtggcttcaa gaagttttgc aaaggaagag agccttgaag 361 atgaggaacg agattggcgg ccattggaag ttgacaacga ccaatttgag agcaatcatc 421 aattgaacta catgagaaat tgtcagaaga cctcaacgtt gactattcta cggtcattag 481 ggtcgttcaa caaattggaa agtaagtggg tccctcatga gctgaagtca aaaattttta 541 aaaaatatcc atttttaaag tgtcttctct tattctacta taacaacaac gaaccatttc 601 ttgatcagat tgtgacatgc gatgaaaagt ggattttata tgacaactgg cgactaccaa 661 ctcagtggtt ggaccgagaa gaagctaaaa agcacttccc aaagccaaac ttgcaccaaa 721 aaaaggtcat ggtcaatgtt tggtggtctg ctgccggtgt gatccactac agctttctga 781 atcctggtga aaccattaca tctgagaagt atgctcagca aatcaatgag atgcaccgaa 841 aactgcaacg cctgcagccg gcattggtca acagaaaggg cccaattctt ctccacaaca 901 acccccaaca gcatgtcgca caaccaacgc ttcaaaagtt gaatgaattg ggctacaaag 961 ttttgcctca tccaccgtat tcacctgacc tcttgccaac caactaccac ttcttagaag 1021 gtctcaacaa ctttttacag ggaaaacgct tccacaacca gcaggatgca gaaaatgctt 1081 tccaagagtt cgtcgaatcc caaagcacgg atttttacgc tacaggaata aacaaactta 1141 tttctcgttg gcaaaaatgt gttgattgta atggttccta ttttgattaa taaagatgtg 1201 tttgagccta gttataatga tttaaaatta atggtccaaa acactttatt taaacaatta 1261 cttttgcacc aacctataac acagtga // LOCUS HSTRAPA 1822 bp RNA PRI 30-JUN-1993 DEFINITION H.sapiens TRAP mRNA for ligand of CD40. ACCESSION X68550 S49008 NID g37269 KEYWORDS T-cell activation; TNF related; transmembrane protein; TRAP gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1822) AUTHORS Graf,D. TITLE Direct Submission JOURNAL Submitted (30-SEP-1992) D. Graf, Max-Planck-Society, Res Units for Rheumatology/Immunology at the Inst for Clinical Immunology of the University Erlangen/Nurnberg, Schwabachanlage 10, 8520 Erlangen, FRG REFERENCE 2 (bases 1 to 1822) AUTHORS Graf,D., Korthauer,U., Mages,H.W., Senger,G. and Kroczek,R.A. TITLE Cloning of TRAP, a ligand for CD40 on human T cells JOURNAL Eur. J. Immunol. 22 (12), 3191-3194 (1992) MEDLINE 93076854 FEATURES Location/Qualifiers source 1..1822 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood" /cell_type="T-lymphocyte" /chromosome="X" /map="Xq26.3 - Xq27.1" gene 57..842 /gene="TRAP" CDS 57..842 /gene="TRAP" /note="CD40 ligand" /codon_start=1 /db_xref="PID:g37270" /db_xref="SWISS-PROT:P29965" /translation="MIETYNQTSPRSAATGLPISMKIFMYLLTVFLITQMIGSALFAV YLHRRLDKIEDERNLHEDFVFMKTIQRCNTGERSLSLLNCEEIKSQFEGFVKDIMLNK EETKKENSFEMQKGDQNPQIAAHVISEASSKTTSVLQWAEKGYYTMSNNLVTLENGKQ LTVKRQGLYYIYAQVTFCSNREASSQAPFIASLCLKSPGRFERILLRAANTHSSAKPC GQQSIHLGGVFELQPGASVFVNVTDPSQVSHGTGFTSFGLLKL" mat_peptide 186..839 /gene="TRAP" polyA_signal 1000..1005 polyA_signal 1803..1808 BASE COUNT 516 a 465 c 346 g 495 t ORIGIN 1 catgctgcct ctgccacctt ctctgccaga agataccatt tcaactttaa cacagcatga 61 tcgaaacata caaccaaact tctccccgat ctgcggccac tggactgccc atcagcatga 121 aaatttttat gtatttactt actgtttttc ttatcaccca gatgattggg tcagcacttt 181 ttgctgtgta tcttcataga aggctggaca agatagaaga tgaaaggaat cttcatgaag 241 attttgtatt catgaaaacg atacagagat gcaacacagg agaaagatcc ttatccttac 301 tgaactgtga ggagattaaa agccagtttg aaggctttgt gaaggatata atgttaaaca 361 aagaggagac gaagaaagaa aacagctttg aaatgcaaaa aggtgatcag aatcctcaaa 421 ttgcggcaca tgtcataagt gaggccagca gtaaaacaac atctgtgtta cagtgggctg 481 aaaaaggata ctacaccatg agcaacaact tggtaaccct ggaaaatggg aaacagctga 541 ccgttaaaag acaaggactc tattatatct atgcccaagt caccttctgt tccaatcggg 601 aagcttcgag tcaagctcca tttatagcca gcctctgcct aaagtccccc ggtagattcg 661 agagaatctt actcagagct gcaaataccc acagttccgc caaaccttgc gggcaacaat 721 ccattcactt gggaggagta tttgaattgc aaccaggtgc ttcggtgttt gtcaatgtga 781 ctgatccaag ccaagtgagc catggcactg gcttcacgtc ctttggctta ctcaaactct 841 gaacagtgtc accttgcagg ctgtggtgga cgtgacgctg ggagtcttca taatacagca 901 cagcggttaa gcccaccccc tgttaactgc ctatttataa ccctaggatc ctccttatgg 961 agaactattt attatacact ccaaggcatg tagaactgta ataagtgaat tacaggtcac 1021 atgaaaccaa aacgggccct gctccataag agcttatata tctgaagcag caaccccact 1081 gatgcagaca tccagagagt cctatgaaaa gacaaggcca ttatgcacag gttgaattct 1141 gagtaaacag cagataactt gccaagttca gttttgtttc tttgcgtgca gtgtctttcc 1201 atggataatg catttgattt atcagtgaag atgcagaagg gaaatgggga gcctcagctc 1261 acattcagtt atggttgact ctgggttcct atggccttgt tggagggggc caggctctag 1321 aacgtctaac acagtggaga accgaaaccc cccccccccc cgccaccctc tcggacagtt 1381 attcattctc tttcaatctc tctctctcca tctctctctt tcagtctctc tctctcaacc 1441 tctttcttcc aatctctctt tctcaatctc tctgtttccc tttgtcagtc tcttccctcc 1501 cccagtctct cttctcaatc cccctttcta acacacacac acacacacac acacacacac 1561 acacacacac acacacacac acacacacac agagtcaggc cgttgctagt cagttctctt 1621 ctttccaccc tgtccctatc tctaccacta tagatgaggg tgaggagtag ggagtgcagc 1681 cctgagcctg cccactcctc attacgaaat gactgtattt aaaggaaatc tattgtatct 1741 acctgcagtc tccattgttt ccagagtgaa cttgtaatta tcttgttatt tattttttga 1801 ataataaaga cctcttaaca tt // LOCUS HSTRAXGEN 1198 bp RNA PRI 05-JUN-1997 DEFINITION H.sapiens mRNA for translin associated protein X. ACCESSION X95073 NID g1770575 KEYWORDS translin associated protein X; traX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1198) AUTHORS Aoki,K., Ishida,R. and Kasai,M. TITLE Isolation and characterization of a cDNA encoding a Translin-like protein, TRAX JOURNAL FEBS Lett. 401 (2-3), 109-112 (1997) MEDLINE 97165975 REFERENCE 2 (bases 1 to 1198) AUTHORS Kasai,M. TITLE Direct Submission JOURNAL Submitted (15-JAN-1996) M. Kasai, N.I.H., Immunology, 1-23-1, Toyama, Shinjuku-ku, Tokyo, 162, JAPAN FEATURES Location/Qualifiers source 1..1198 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F2" /tissue_type="spleen" /dev_stage="adult" gene 161..1033 /gene="TRAX" CDS 161..1033 /gene="TRAX" /codon_start=1 /product="Translin associated protein X" /db_xref="PID:e225463" /db_xref="PID:g1770576" /translation="MSNKEGSGGFRKRKHDNFPHNQRREGKDVNSSSPVMLAFKSFQQ ELDARHDKYERLVKLSRDITVESKRTIFLLHRITSAPDMEDILTESEIKLDGVRQKIF QVAQELSGEDMHQFHRAITTGLQEYVEAVSFQHFIKTRSLISMDEINKQLIFTTEDNG KENKTPSSDAQDKQFGTWRLRVTPVDYLLGVADLTGELMRMCINSVGNGDIDTPFEVS QFLRQVYDGFSFIGNTGPYEVSKKLYTLKQSLAKVENACYALKVRGSEIPKHMLADVF SVKTEMIDQEEGIS" BASE COUNT 369 a 219 c 268 g 342 t ORIGIN 1 tgacgtgaga ggagacttcc ggccactgcg ttgtagtcgg cccggctgca aagcgttttt 61 ctgcaggctg ttttcccagg ttccctcggc ctgtacctcg cgcactcctc ttgctccagg 121 tccttcagtc tccgctcgtc tcaccgtagg ctgtgacgac atgagcaaca aagaaggatc 181 aggagggttc aggaaaagga agcatgacaa tttcccacat aaccaaagaa gagaagggaa 241 ggatgttaat tcatcttcac ccgtgatgtt ggcctttaaa tcatttcagc aggaacttga 301 tgcaaggcat gacaaatatg agagacttgt gaaacttagt cgggatataa ctgttgaaag 361 taaaaggaca atttttctcc tccataggat tacaagtgct cctgatatgg aagatatatt 421 gactgaatca gaaattaaat tggatggtgt cagacaaaag atattccagg tagcccaaga 481 gctatcaggg gaagatatgc atcagttcca tcgagccatt actacaggac tacaggaata 541 tgtggaagct gtctcttttc aacacttcat caaaacacga tcattaatta gtatggatga 601 aattaataaa caattgatat ttacgactga agacaatggg aaagaaaata aaactccctc 661 ctctgatgca caggataagc agtttggtac ttggagactg agagtcacac ctgtcgatta 721 ccttctggga gtggctgact taactggaga attgatgcgg atgtgtatta acagtgtggg 781 gaatggggac attgataccc cctttgaagt gagccagttt ttacgtcagg tttatgatgg 841 gttttcattc attggcaaca ctggacctta cgaggtttct aagaagctgt ataccttgaa 901 acaaagtttg gccaaagtgg agaatgcttg ttatgccttg aaagtcagag ggtcagaaat 961 tccaaaacat atgttggcag atgtgttttc agttaaaaca gaaatgatag atcaagaaga 1021 gggcatttct tagaatctaa cgttactcag ttactaattc ttttgagaac tcctaagaga 1081 ccaatttgta agacttattt agtatttcat ttaactttat tgtggctttt acatagaaac 1141 atattcagtt gtacttgttt taaattgtat acaagctgta cataaaattt aaaacaag // LOCUS HSTRE210 7878 bp RNA PRI 22-DEC-1993 DEFINITION H.sapiens mRNA for tre oncogene (clone 210). ACCESSION X63546 NID g37329 KEYWORDS oncogene; transforming capacity; tre gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7878) AUTHORS Hillova,J. TITLE Direct Submission JOURNAL Submitted (16-DEC-1991) J. Hillova, C.N.R.S. - SDI I6204, I.C.I.G., Dept of Molecular and Cellular Biology, 14, avenue Paul-Vaillant Couturier, 94804 Villejuif Cedex, FRANCE REFERENCE 2 (bases 1 to 7878) AUTHORS Nakamura,T., Hillova,J., Mariage-Samson,R., Onno,M., Huebner,K., Cannizzaro,L.A., Boghosian-Sell,L., Croce,C.M. and Hill,M. TITLE A novel transcriptional unit of the tre oncogene widely expressed in human cancer cells JOURNAL Oncogene 7 (4), 733-741 (1992) MEDLINE 92228503 COMMENT See also X63547, X63596 & X71366-79 The tre-2 genetic element identified in tre-transfectants was renamed oncRTE17 because of its origin from the repetoire of hypervariable TRE17 genes (see X63586). FEATURES Location/Qualifiers source 1..7878 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Ewings' sarcoma" /cell_line="transfected NIH 3T3 cell" /clone="210" /chromosome="17q" 5'UTR 1..1696 gene 1697..5805 /gene="tre" CDS 1697..4057 /gene="tre" /codon_start=1 /product="oncogene" /db_xref="PID:g37330" /translation="MDMVENADSLQAQERKDILMKYDKGHRAGLPEDKGPEPVGINSS IDRFGILHETELPPVTAREAKKIRREMTRTSKWMEMLGEWETYKHSSKLIDRVYKGIP MNIRGPVWSVLLNIQEIKLKNPGRYQIMKERGKRSSEHIHHIDLDVRTTLRNHVFFRD RYGAKQRELFYILLAYSEYNPEVGYCRDLSHITALFLLYLPEEDAFWALVQLLASERH SLPGFHSPNGGTVQGLQDQQEHVVPKSQPKTMWHQDKEGLCGQCASLGCLLRNLIDGI SLGLTLRLWDVYLVEGEQVLMPITSIALKVQQKRLMKTSRCGLWARLRNQFFDTWAMN DDTVLKHLRASTKKLTRKQGDLPPPAKREQGSLAPRPVPASRGGKTLCKGYRQAPPGP PAQFQRPICSASPPWASRFSTPCPGGAVREDTYPVGTQGVPSLALAQGGPQGSWRFLE WKSMPRLPTDLDIGGPWFPHYDFERSCWVRAISQEDQLATCWQAEHCGEVHNKDMSWP EEMSFTANSSKIDRQKVPTEKGATGLSNLGNTCFMNSSIQCVSNTQPLTQYFISGRHL YELNRTNPIGMKGHMAKCYGDLVQELWSGTQKSVAPLKLRRTIAKYAPKFDGFQQQDS QELLAFLLDGLHEDLNRVHEKPYVELKDSDGRPDWEVAAEAWDNHLRRNRSIIVDLFH GQLRSQVKCKTCGHISVRFDPFNFLSLPLPMDSYMDLEITVIKLDGTTPVRYGLRLNM DEKYTGLKKQLRDLCGLNSEQILLAEVHDSNIKISPLHHLQMECSP" CDS 4045..5805 /gene="tre" /codon_start=1 /product="oncogene" /db_xref="PID:g37331" /db_xref="SWISS-PROT:P35125" /translation="MFTLTTNGDLPKPIFIPNGMPNTVVPCGTEKNFTNGMVNGHMPS LPDSPFTGYIIAVHRKMMRTELYFLSPQENRPSLFGMPLIVPCTVHTQKKDLYDAVWI QVSWLARPLPPQEASIHAQDRDNCMGYQYPFTLRVVQKDGISCAWCPQYRFCRGCKID CGEDRAFIGNAYIAVDWHPTALHLRYQTSQERVVDKHESVEQSRRAQAEPINLDSCLR AFTSEEELGESEMYYCSKCKTHCLATKKLDLWRLPPFLIIHLKRFQFVNDQWIKSQKI VRFLRESFDPSAFLVPRDPALCQHKPLTPQGDELSKPRILAREVKKVDAQSSAGKEDM LLSKSPSSLSANISSSPKGSPSSSRKSGTSCPSSKNSSPNSSPRTLGRSKGRLRLPQI GSKNKPSSSKKNLDASKENGAGQICELADALSRGHMRGGSQPELVTPQDHEVALANGF LYEHEACGNGCGDGYSNGQLGNHSEEDSTDDQREDTHIKPIYNLYAISCHSGILSGGH YITYAKNPNCKWYCYNDSSCEELHPDEIDTDSAYILFYEQQGIDYAQFLPKIDGKKMA DTSSTDEDSESDYEKYSMLQ" 3'UTR 5806..7878 polyA_site 7856 BASE COUNT 2175 a 1781 c 1928 g 1994 t ORIGIN 1 ctaaaaatac cattaagtaa tagtattagc ttttgtattc tgagattcaa cagcagcagt 61 cacttccctc cactcctatg tgtatcccag gaccaccctg ggcggggagg gctgaggtca 121 gggaggtctg aagctggtcc tgggctccgg gggtgacagt gatgaggaac tgggtgcaca 181 catgagtggg gcagccgggc ctggccagag aagcaacaca cacgtgcaca gacatgttta 241 tccacataca catgtgcacg catgtgcaca aacacattgc aggcaggcat gttgacgcct 301 caggcagcgg aggaccctga ctctgggccc tgctgacccg ggcaaggccc attgtgatgc 361 gtgccatgac ctcagaatgt cactggtgct tagcacctat ccgctctcca gactgcgtct 421 gtgttctacg gcagttacac acacgcagtg gtattcacaa gcggttttgt ggactcaaag 481 gttttctccc tgagaggcat aacccaggcc agctgattca tcagaatcag gtgagtgtga 541 cctgctctct tccctccagg ctgacttggg gacagtggct atggtatggg cggtgttggc 601 ctctgggcag ctacagagga gggtcatccc tgagcactca ccgggcgccc gttctacact 661 gcccatgtag acgattttct ctttcgtctt catggtggct tcgtagagtg ggtgctgttc 721 ccaaatgtac ccattcgaca ggtgagccgt ctggggtcag agaggcagta actggcctgg 781 gaatccagac aagaccctgg gttttgctct cagccctgct gtgtgccatg ctagacttca 841 ggcctcaacc ctgagacctc cctgctctag atcccaaatc tgcccagatt tccgatccaa 901 tgggcagagc ctggccctgg cagagacact gggatggatc cactgtgggt ggggaggagg 961 gaagggtcct cagaacacac ctggggccta agctgggtct tgatggtcac tgtgggaccc 1021 actggacaca cacagtccct tgtctgggag tggcatgggg agccttctgc ccttgggcag 1081 ttgtggaaag tgaaggagcc ctggagagct ggctgagggg agactatctt cccttgtgtt 1141 caaaggggtc caggcactgg ggctctcccc aagtatttct tattctgtct ggcctcgctt 1201 tccttttgcc ctgagtattc tcaggaggga cggtccatct agatgtcctc caggagcaag 1261 gacccactgt tcttcatcag tgacccagga aaatgaagcc ccctcctgtg gggacagctc 1321 agaatggtgg agtccacagt ccctccctga gagacatggt ttccatgagc acagtggctg 1381 ctttggagac agtaatcatt ttcatcccca aaaccaaaca cactcctgct caaatggtgt 1441 tattgctaaa gcagcttcac tggttagact gaagggccat ggtagcccaa gtgatgagcg 1501 gggtagaatg gagcagtcag gagagatctt gttccccgta ggaaactggg catctctgtg 1561 gccctgaaca tcccaggagg ccgatcgtac agagacctct ggtgcctgac cgcagttcac 1621 atccacatcc ctggaataga ccatcacagg ctcttcaccc ttggcaggtg gacaccattc 1681 aacctgccgg ggcaggatgg acatggtaga gaatgcagat agtttgcagg cacaggagcg 1741 gaaggacata cttatgaagt atgacaaggg acaccgagct gggctgccag aggacaaggg 1801 gcctgagccc gttggaatca acagcagcat tgatcgtttt ggcattttgc atgagacgga 1861 gctgcctcct gtgactgcac gggaggcgaa gaaaattcgg cgggagatga cacgaacgag 1921 caagtggatg gaaatgctgg gagaatggga gacatataag cacagtagca aactcataga 1981 tcgagtgtac aagggaattc ccatgaacat ccggggcccg gtgtggtcag tcctcctgaa 2041 cattcaggaa atcaagttga aaaaccccgg aagataccag atcatgaagg agaggggcaa 2101 gaggtcatct gaacacatcc accacatcga cctggacgtg aggacgactc tccggaacca 2161 tgtcttcttt agggatcgat atggagccaa gcagagggaa ctattctaca tcctcctggc 2221 ctattcggag tataacccgg aggtgggcta ctgcagggac ctgagccaca tcaccgcctt 2281 gttcctcctt tatctgcctg aggaggacgc attctgggca ctggtgcagc tgctggccag 2341 tgagaggcac tccctgccag gattccacag cccaaatggt gggacagtcc aggggctcca 2401 agaccaacag gagcatgtgg tacccaagtc acaacccaag accatgtggc atcaggacaa 2461 ggaaggtcta tgcgggcagt gtgcctcgtt aggctgcctt ctccggaacc tgattgacgg 2521 gatctctctc gggctcaccc tgcgcctgtg ggacgtgtat ttggtggaag gagaacaggt 2581 gttgatgcca ataaccagca ttgctcttaa ggttcagcag aagcgcctca tgaagacatc 2641 caggtgtggc ctgtgggcac gtctgcggaa ccaattcttc gatacctggg ccatgaacga 2701 tgacaccgtg ctcaagcatc ttagggcctc tacgaagaaa ctaacaagga agcaagggga 2761 cctgccaccc ccagccaaac gcgagcaagg gtccttggca cccaggcctg tgccggcttc 2821 acgtggtggg aagaccctct gcaaggggta taggcaggcc cctccaggcc caccagccca 2881 gttccagcgg cccatttgct cagcttcccc gccatgggca tctcgttttt ccacgccctg 2941 tcctggtggg gctgtccggg aagacacgta ccctgtgggc actcagggtg tgcccagcct 3001 ggccctggct cagggaggac ctcagggttc ctggagattc ctggagtgga agtcaatgcc 3061 ccggctccca acggacctgg atataggggg cccttggttc ccccattatg attttgaacg 3121 gagctgctgg gtccgtgcca tatcccagga ggaccagctg gccacctgct ggcaggctga 3181 acactgcgga gaggttcaca acaaagatat gagttggcct gaggagatgt cttttacagc 3241 aaatagtagt aaaatagata gacaaaaggt tcccacagaa aagggagcca caggtctaag 3301 caacctggga aacacatgct tcatgaactc aagcatccag tgcgttagta acacacagcc 3361 actgacacag tattttatct cagggagaca tctttatgaa ctcaacagga caaatcccat 3421 tggtatgaag gggcatatgg ctaaatgcta tggtgattta gtgcaggaac tctggagtgg 3481 aactcagaag agtgttgccc cattaaagct tcggcggacc atagcaaaat atgctcccaa 3541 gtttgatggg tttcagcaac aagactccca agaacttctg gcttttctct tggatggtct 3601 tcatgaagat ctcaaccgag tccatgaaaa gccatatgtg gaactgaagg acagtgatgg 3661 ccgaccagac tgggaagtag ctgcagaggc ctgggacaac catctaagaa gaaatagatc 3721 aattattgtg gatttgttcc atgggcagct aagatctcaa gtcaaatgca agacatgtgg 3781 gcatataagt gtccgatttg accctttcaa ttttttgtct ttgccactac caatggacag 3841 ttacatggac ttagaaataa cagtgattaa gttagatggt actacccctg tacggtatgg 3901 actaagactg aatatggatg aaaagtacac aggtttaaaa aaacagctga gggatctctg 3961 tggacttaat tcagaacaaa tcctactagc agaagtacat gattccaaca taaagatttc 4021 tcctcttcac catctacaaa tggaatgttc accctaacta ccaatgggga cctacccaaa 4081 ccaatattca tccccaatgg aatgccaaac actgttgtgc catgtggaac tgagaagaac 4141 ttcacaaatg gaatggttaa tggtcacatg ccatctcttc ctgacagccc ctttacaggt 4201 tacatcattg cagtccaccg aaaaatgatg aggacagaac tgtatttcct gtcacctcag 4261 gagaatcgcc ccagcctctt tggaatgcca ttgattgttc catgcactgt gcatacccag 4321 aagaaagacc tatatgatgc ggtttggatt caagtatcct ggttagcaag accactccca 4381 cctcaggaag ctagtattca tgcccaggat cgtgataact gtatgggcta tcaatatcca 4441 ttcactctac gagttgtgca gaaagatggg atctcctgtg cttggtgccc acagtataga 4501 ttttgcagag gctgtaaaat tgattgtggg gaagacagag ctttcattgg aaatgcctat 4561 attgctgtgg attggcaccc cacagccctt caccttcgct atcaaacatc ccaggaaagg 4621 gttgtagata agcatgagag tgtggagcag agtcggcgag cgcaagccga gcccatcaac 4681 ctggacagct gtctccgtgc tttcaccagt gaggaagagc taggggaaag tgagatgtac 4741 tactgttcca agtgtaagac ccactgctta gcaacaaaga agctggatct ctggaggctt 4801 ccacccttcc tgattattca ccttaagcga tttcaatttg taaatgatca gtggataaaa 4861 tcacagaaaa ttgtcagatt tcttcgggaa agttttgatc cgagtgcttt tttggtacca 4921 cgagacccgg ccctctgcca gcataaacca ctcacacccc agggggatga gctctccaag 4981 cccaggattc tggcaagaga ggtgaagaaa gtggatgcgc agagttcggc tggaaaagag 5041 gacatgctcc taagcaaaag cccatcttca ctcagcgcta acatcagcag cagcccaaaa 5101 ggttctcctt cttcatcaag aaaaagtgga accagctgtc cctccagcaa aaacagcagc 5161 cctaatagca gcccacggac tttggggagg agcaaaggga ggctccggct gccccagatt 5221 ggcagcaaaa ataagccgtc aagtagtaag aagaacttgg atgccagcaa agagaatggg 5281 gctgggcaga tctgtgagct ggctgacgcc ttgagccgag ggcatatgcg ggggggcagc 5341 caaccagagc tggtcactcc tcaggaccat gaggtagctt tggccaatgg attcctttat 5401 gagcatgaag catgtggcaa tggctgtggc gatggctaca gcaatggtca gcttggaaac 5461 cacagtgaag aagacagcac tgatgaccaa agagaagaca ctcatattaa gcctatttat 5521 aatctatatg caatttcatg ccattcagga attctgagtg ggggccatta catcacttat 5581 gccaaaaacc caaactgcaa gtggtactgt tataatgaca gcagctgtga ggaacttcac 5641 cctgatgaaa ttgacaccga ctctgcctac attcttttct atgagcagca ggggatagac 5701 tacgcacaat ttctgccaaa gattgatggc aaaaagatgg cagacacaag cagtacggat 5761 gaagactctg agtctgatta cgaaaagtac tctatgttac agtaaagcta ccactctggc 5821 tgctagacag cttggtggcg agggagatga ctccttgtag ctgatacttg gcaaaagtgt 5881 cactgaaaga caagctaaat gtagttattt tatcctgtta gaacaaaaat tctaattaaa 5941 atagttaact tgaagagtag aaacaattgt attttgaagt ctcatacaag ctgtctgata 6001 gagaactttc aggcagatcc caccattagc ctgtaaacaa aaggtgtggc accagccacc 6061 tgggaccaaa taagaattga attgtgcttg tccagatatg aacaaatatg tagtgagtat 6121 agagtttacc aataatcata acaaatatta aagatttcct tggagtcaga ggaaaaaaca 6181 aacaattata atgttgtcta gggacgacat gatacgctac ctcctttttc ctgaagtttt 6241 attccattat attgacaaga tggagaaagc aagatcatga aggtgtgcaa atgattctta 6301 cggcatggac aaggattttt caatttattt tttaaactgt ttccataccc tttctttttc 6361 ttgctttttg tttttgccat tgtgtttacg tttgagacac aaccagtcat tggtggcagg 6421 ggcatagagt ggtcagtctg aaagggaggc tctcttaaga gctatgtgcc ttccaaccag 6481 agggagaccc agtagaaaga aaaacatcct gggaaatcca gctaccaggg ccctcccagt 6541 ggaggcatct tacatttagg ctacttcaag tatcctcaga aatgtattct gcacccccgg 6601 ccccgcccat gctgagggaa ggggagcagt tgccaatatt tgcaccatct tcacatgcac 6661 atgttgcaac aagagcttct gggaaggtaa gcggcatcgg agctagatca cgtttcacaa 6721 ttagtggtta ttcttttctg tgtttgtttt gcactttaaa aaagagagaa cacatgcaaa 6781 tgaacttgct tgtgtgtatt tgatggctct aagggctata aattacaaac aaaacacatc 6841 ccagacatta ggagttcata agtatattta atgaaattgg tggttttagg aagtcaactt 6901 tagttttgct ttgtttgcat gtccactggt ttttttattt tgatatttgt ctttttttaa 6961 attttacagt agtcattgaa agttatgttt ctttgcttac ttcatttttt ccctctaatt 7021 atttaagatt ggaacaaaag tataaatatt atttatttga ggtagaattt ttttcatgta 7081 gtttcttaat atatacttga aggaaatgtt tcaccttatt tttggtcttt gtttattcat 7141 ttagaccctg caagttgatt ctcattgcca gattccatta ccctttcttc ctcataggta 7201 gtaattacca atgtaactaa gcatttgtgt tctgatatct gaggccagta actattaata 7261 tctagttctc agagcatttg gaaaggttat cttaaatggc tacctaaatt gaaatccttt 7321 tcagaaaaaa tataattgca agtaggtagg agtggcctaa attgtctaat gtaataaagt 7381 cagacaaaat gcacacttta tagtttcaag attttcagta aataaaatct gtccattcct 7441 acctggacat gtcccattaa aaagtggaag attttaaata atttctttac agatgtttta 7501 tttaaacagg tagcacaatc tactaatgtt gtgtgatttg tgttatactg gttgtaatta 7561 atttttttaa ttcatgaact agcggaaaat ttattaaatt aactattaac tacattcacc 7621 ttgtaaatta ctgtataaaa cttgttgaca atgcactgac tttagaaaga tgttaatgta 7681 cataaataga gtgtaaataa aatagtgttg atgtactgaa atatgaactg tatcaaaagt 7741 attggtaatt gtatatgggg tgtacctgtt tatctgttaa ctattatcca aacaaattaa 7801 atactgtggt tgcctctatg tgctgttttt cctcatacaa gtaaacacag aaagtcaaaa 7861 aaaaaaaaaa aaaaaaaa // LOCUS HSTRECP 1875 bp RNA PRI 28-MAY-1996 DEFINITION H.sapiens mRNA for transmenbrane receptor protein. ACCESSION Z17227 NID g393378 KEYWORDS CRF2-4; CRFB4; cytokine receptor; transmembrane receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1875) AUTHORS Lutfalla,G., Gardiner,K. and Uze,G. TITLE A new member of the cytokine receptor gene family maps on chromosome 21 at less than 35 kb from IFNAR JOURNAL Genomics 16 (2), 366-373 (1993) MEDLINE 93300510 REFERENCE 2 (bases 1 to 1875) AUTHORS Lutfalla,G. TITLE Direct Submission JOURNAL Submitted (12-OCT-1992) Georges LUTFALLA, CNRS, 7 rue Guy Moquet, Villejuif, 94801, France FEATURES Location/Qualifiers source 1..1875 /organism="Homo sapiens" /strain="Daudi cell line" /db_xref="taxon:9606" /germline /dev_stage="Adult" /tissue_type="Lymphoid tumor" /cell_type="B-lymphocyte" /cell_line="Daudi cell line" /chromosome="chromosome 21" sig_peptide 44..100 /product="leader peptide" CDS 44..1021 /codon_start=1 /product="transmenbrane receptor precusor" /db_xref="PID:g393379" /db_xref="SWISS-PROT:Q08334" /translation="MAWSLGSWLGGCLLVSALGMVPPPENVRMNSVNFKNILQWESPA FAKGNLTFTAQYLSYRIFQDKCMNTTLTECDFSSLSKYGDHTLRVRAEFADEHSDWVN ITFCPVDDTIIGPPGMQVEVLADSLHMRFLAPKIENEYETWTMKNVYNSWTYNVQYWK NGTDEKFQITPQYDFEVLRNLEPWTTYCVQVRGFLPDRNKAGEWSEPVCEQTTHDETV PSWMVAVILMASVFMVCLALLGCFSLLWCVYKKTKYAFSPRNSLPQHLKEFLGHPHHN TLLFFSFPLSDENDVFDKLSVIAEDSESGKQNPGDSCSLGTPPGQGPQS" misc_feature 101..703 /product="D200 extracellular domain (cytokine receptors)" mat_peptide 101..1018 /product="transmembrane receptor" misc_feature 704..790 /product="transmembrane region" misc_feature 791..1018 /product="intracellular domain" repeat_region 1329..1653 /rpt_family="Alu" polyA_signal 1853..1858 polyA_site 1875 BASE COUNT 517 a 425 c 451 g 482 t ORIGIN 1 gtcgtgtgct tggaggaagc cgcggaaccc ccagcgtccg tccatggcgt ggagccttgg 61 gagctggctg ggtggctgcc tgctggtgtc agcattggga atggtaccac ctcccgaaaa 121 tgtcagaatg aattctgtta atttcaagaa cattctacag tgggagtcac ctgcttttgc 181 caaagggaac ctgactttca cagctcagta cctaagttat aggatattcc aagataaatg 241 catgaatact accttgacgg aatgtgattt ctcaagtctt tccaagtatg gtgaccacac 301 cttgagagtc agggctgaat ttgcagatga gcattcagac tgggtaaaca tcaccttctg 361 tcctgtggat gacaccatta ttggaccccc tggaatgcaa gtagaagtac ttgctgattc 421 tttacatatg cgtttcttag cccctaaaat tgagaatgaa tacgaaactt ggactatgaa 481 gaatgtgtat aactcatgga cttataatgt gcaatactgg aaaaacggta ctgatgaaaa 541 gtttcaaatt actccccagt atgactttga ggtcctcaga aacctggagc catggacaac 601 ttattgtgtt caagttcgag ggtttcttcc tgatcggaac aaagctgggg aatggagtga 661 gcctgtctgt gagcaaacaa cccatgacga aacggtcccc tcctggatgg tggccgtcat 721 cctcatggcc tcggtcttca tggtctgcct ggcactcctc ggctgcttct ccttgctgtg 781 gtgcgtttac aagaagacaa agtacgcctt ctcccctagg aattctcttc cacagcacct 841 gaaagagttt ttgggccatc ctcatcataa cacacttctg tttttctcct ttccattgtc 901 ggatgagaat gatgtttttg acaagctaag tgtcattgca gaagactctg agagcggcaa 961 gcagaatcct ggtgacagct gcagcctcgg gaccccgcct gggcaggggc cccaaagcta 1021 ggctctgaga aggaaacaca ctcggctggg cacagtgacg tactccatct cacatctgcc 1081 tcagtgaggg atcagggcag caaacaaggg ccaagaccat ctgagccagc cccacatcta 1141 gaactccaga cctggactta gccaccagag agctacattt taaaggctgt cttggcaaaa 1201 atactccatt tgggaactca ctgccttata aaggctttca tgatgttttc agaagttggc 1261 cactgagagt gtaattttca gccttttata tcactaaaat aagatcatgt tttaattgtg 1321 agaaacaggg ccgagcacag tggctcacgc ctgtaatacc agcaccttag aggtcgaggc 1381 aggcggatca cttgaggtca ggagttcaag accagcctgg ccaatatggt gaaacccagt 1441 ctctactaaa aatacaaaaa ttagctaggc atgatggcgc atgcctataa tcccagctac 1501 tcgagtgcct gaggcaggag aattgcatga acccgggagg aggaggagga ggttgcagtg 1561 agccgagata gcggcactgc actccagcct gggtgacaaa gtgagactcc atctcaaaaa 1621 aaaaaaaaaa aaattgtgag aaacagaaat acttaaaatg aggaataaga atggagatgt 1681 tacatctggt agatgtaaca ttctaccaga ttatggatgg actgatctga aaatcgacct 1741 caactcaagg gtggtcagct caatgctaca cagagcacgg acttttggat tctttgcagt 1801 actttgaatt tatttttcta cctatatatg ttttatatgc tgctggtgct ccattaaagt 1861 tttactctgt gttgc // LOCUS HSTRKBMR 2224 bp RNA PRI 23-NOV-1994 DEFINITION H.sapiens trkB mRNA for protein-tyrosine kinase. ACCESSION X75958 NID g473007 KEYWORDS protein-tyrosine kinase; trkB gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2224) AUTHORS Allen,S.J., Dawbarn,D., Eckford,S.D., Wilcock,G.K., Ashcroft,M., Colebrook,S.M., Feeney,R. and MacGowan,S.H. TITLE Cloning of a non-catalytic form of human trkB and distribution of messenger RNA for trkB in human brain JOURNAL Neuroscience 60 (3), 825-834 (1994) MEDLINE 95022162 REFERENCE 2 (bases 1 to 2224) AUTHORS Dawbarn,D. TITLE Direct Submission JOURNAL Submitted (16-DEC-1993) D. Dawbarn, University of Bristol, Department of Medicine, Bristol Royal Infirmary, Bristol, Avon BS2 8HW, UK FEATURES Location/Qualifiers source 1..2224 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Infant" /tissue_type="Hippocampus" /clone_lib="L ZAPII Hippocampal Library, Stratagene (#936205)" /sex="Female" gene 98..1531 /gene="trkB" sig_peptide 98..190 /gene="trkB" CDS 98..1531 /gene="trkB" /citation=[1] /codon_start=1 /product="protein-tyrosine kinase precursor" /db_xref="PID:g473008" /translation="MSSWIRWHGPAMARLWGFCWLVVGFWRAAFACPTSCKCSASRIW CSDPSPGIVAFPRLEPNSVDPENITEIFIANQKRLEIINEDDVEAYVGLRNLTIVDSG LKFVAHKAFLKNSNLQHINFTRNKLTSLSRKHFRHLDLSELILVGNPFTCSCDIMWIK TLQEAKSSPDTQDLYCLNESSKNIPLANLQIPNCGLPSANLAAPNLTVEEGKSITLSC SVAGDPVPNMYWDVGNLVSKHMNETSHTQGSLRITNISSDDSGKQISCVAENLVGEDQ DSVNLTVHFAPTITFLESPTSDHHWCIPFTVKGNPKPALQWFYNGAILNESKYICTKI HVTNHTEYHGCLQLDNPTHMNNGDYTLIAKNEYGKDEKQISAHFMGWPGIDDGANPNY PDVIYEDYGTAANDIGDTTNRSNEIPSTDVTDKTGREHLSVYAVVVIASVVGFCLLVM LFLLKLARHSKFGMKGFVLFHKIPLDG" mat_peptide 191..1528 /gene="trkB" /citation=[1] /product="protein-tyrosine kinase non-catalytic form (TrkB)" polyA_signal 1536..1541 polyA_signal 1783..1788 polyA_signal 1942..1948 BASE COUNT 620 a 472 c 526 g 606 t ORIGIN 1 cagagccgca agcgcaggga aggcctcccc gcacggtggg ggaaagcggc cggtgcagcg 61 cggggacagg cactcgggct ggcactggct gctagggatg tcgtcctgga taaggtggca 121 tggacccgcc atggcgcggc tctggggctt ctgctggctg gttgtgggct tctggagggc 181 cgctttcgcc tgtcccacgt cctgcaaatg cagtgcctct cggatctggt gcagcgaccc 241 ttctcctggc atcgtggcat ttccgagatt ggagcctaac agtgtagatc ctgagaacat 301 caccgaaatt ttcatcgcaa accagaaaag gttagaaatc atcaacgaag atgatgttga 361 agcttatgtg ggactgagaa atctgacaat tgtggattct ggattaaaat ttgtggctca 421 taaagcattt ctgaaaaaca gcaacctgca gcacatcaat tttacccgaa acaaactgac 481 gagtttgtct aggaaacatt tccgtcacct tgacttgtct gaactgatcc tggtgggcaa 541 tccatttaca tgctcctgtg acattatgtg gatcaagact ctccaagagg ctaaatccag 601 tccagacact caggatttgt actgcctgaa tgaaagcagc aagaatattc ccctggcaaa 661 cctgcagata cccaattgtg gtttgccatc tgcaaatctg gccgcaccta acctcactgt 721 ggaggaagga aagtctatca cattatcctg tagtgtggca ggtgatccgg ttcctaatat 781 gtattgggat gttggtaacc tggtttccaa acacatgaat gaaacaagcc acacacaggg 841 ctccttaagg ataactaaca tttcatccga tgacagtggg aagcagatct cttgtgtggc 901 ggaaaatctt gtaggagaag atcaagattc tgtcaacctc actgtgcatt ttgcaccaac 961 tatcacattt ctcgaatctc caacctcaga ccaccactgg tgcattccat tcactgtgaa 1021 aggcaacccc aaaccagcgc ttcagtggtt ctataacggg gcaatattga atgagtccaa 1081 atacatctgt actaaaatac atgttaccaa tcacacggag taccacggct gcctccagct 1141 ggataatccc actcacatga acaatgggga ctacactcta atagccaaga atgagtatgg 1201 gaaggatgag aaacagattt ctgctcactt catgggctgg cctggaattg acgatggtgc 1261 aaacccaaat tatcctgatg taatttatga agattatgga actgcagcga atgacatcgg 1321 ggacaccacg aacagaagta atgaaatccc ttccacagac gtcactgata aaaccggtcg 1381 ggaacatctc tcggtctatg ctgtggtggt gattgcgtct gtggtgggat tttgcctttt 1441 ggtaatgctg tttctgctta agttggcaag acactccaag tttggcatga aaggttttgt 1501 tttgtttcat aagatcccac tggatgggta gctgaaataa aagaaaagac agagaaaggg 1561 gctgtggtgc ttgttggttg atgctgccat gtaagctgga ctcctgggac tgctgttggc 1621 ttatcccggg aagtgctgct tatctggggt tttctggtag atgtgggcgg tgtttggagg 1681 ctgtactata tgaagcctgc atatactgtg agctgtgatt ggggaacacc aatgcagagg 1741 taactctcag gcagctaagc agcacctcaa gaaaacatgt taaattaatg cttctcttct 1801 tacagtagtt caaatacaaa actgaaatga aatcccattg gattgtactt ctcttctgaa 1861 aagtgtgctt tttgacccta ctggacattt attgacttaa ttgcttctgt ttattaaaat 1921 tgacctgcaa agttaaaaaa aaattaaagt tgagaacagg tataagtgca cactgaatag 1981 tctaatctac atgtaacaca tattttagta tgattttcta tactctaatc agcactgaat 2041 tcagagggtt tgactttttc atctataaca cagtgactaa aagagttaag ggtatatata 2101 ccatcacttt gggacttggt agtattatta aaaggttatt tccttcactg tcaataaaag 2161 tccaaatgtt tagcttaggt ctgagagtca aacaatgtta aggattgtct taaagttcct 2221 tagc // LOCUS HSTRNCTNR 874 bp RNA PRI 07-OCT-1992 DEFINITION H.sapiens mRNA for tetranectin. ACCESSION X64559 NID g37408 KEYWORDS plasminogen kringle 4 binding protein; tetranectin protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 874) AUTHORS Wewer,U.M. TITLE Direct Submission JOURNAL Submitted (17-FEB-1992) U.M. Wewer, Laboratory of Molecular Pathology, University Inst of Pathology Anatomy, University of Copenhagen, Frederik V's Vej II, 2100 Copenhagen, DENMARK REFERENCE 2 (bases 1 to 874) AUTHORS Wewer,U.M. and Albrechtsen,R. TITLE Tetranectin, a plasminogen kringle 4-binding protein: cloning and gene expression in human colon cancer JOURNAL Lab. Invest. 67, 258-262 (1992) FEATURES Location/Qualifiers source 1..874 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /cell_line="Human placenta" /clone_lib="clontech HL 1008" sig_peptide 94..156 CDS 94..702 /note="Plasminogen-kringle 4 binding protein" /codon_start=1 /product="Tetranectin" /db_xref="PID:g37409" /db_xref="SWISS-PROT:P05452" /translation="MELWGAYLLLCLFSLLTQVTTEPPTQKPKKIVNAKKDVVNTKMF EELKSRLDTLAQEVALLKEQQALQTVCLKGTKVHMKCFLAFTQTKTFHEASEDCISRG GTLSTPQTGSENDALYEYLRQSVGNEAEIWLGLNDMAAEGTWVDMTGARIAYKNWETE ITAQPDGGKTENCAVLSGAANGKWFDKRCRDQLPYICQFGIV" BASE COUNT 196 a 265 c 287 g 126 t ORIGIN 1 gggcgggaag acgtgcagcc tgggccgtgg ctgctcactg cgttcggacc cagacccgct 61 gcaggcagca gcagcccccg cccgcgcacg agcatggagc tctggggggc ctacctcctc 121 ctctgcctct tctccctcct gacccaggtc accaccgagc caccaaccca gaagcccaag 181 aagattgtaa atgccaagaa agatgttgtg aacacaaaga tgtttgagga gctcaagagc 241 cgtctggaca ccctggccca ggaggtggcc ctgctgaagg agcagcaggc cctgcagacg 301 gtctgcctga aggggaccaa ggtgcacatg aaatgctttc tggccttcac ccagacgaag 361 accttccacg aggccagcga ggactgcatc tcgcgcgggg gcaccctgag cacccctcag 421 actggctcgg agaacgacgc cctgtatgag tacctgcgcc agagcgtggg caacgaggcc 481 gagatctggc tgggcctcaa cgacatggcg gccgagggca cctgggtgga catgaccggc 541 gcccgcatcg cctacaagaa ctgggagact gagatcaccg cgcaacccga tggcggcaag 601 accgagaact gcgcggtcct gtcaggcgcg gccaacggca agtggttcga caagcgctgc 661 cgcgatcagc tgccctacat ctgccagttc gggatcgtgt agccggcggg gcgggggccg 721 tggggggcct ggaggagggc aggagccgcg ggaggccggg aggagggtgg ggaccttgca 781 gcccccatcc tctccgtgcg cttggagcct ctttttgcaa ataaagttgg tgcacgttcg 841 cggagaggaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSTROISOA 1569 bp RNA PRI 01-JUN-1995 DEFINITION H.sapiens tropomyosin isoform mRNA, complete CDS. ACCESSION Z24727 NID g854188 KEYWORDS tropomyosin isoform. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1569) AUTHORS Wick,M., Buerger,C., Bruesselbach,S., Lucibello,F.C. and Mueller,R. TITLE Identification of serum-inducible genes: different patterns of gene regulation during G0->S and G1->S progression JOURNAL Unpublished REFERENCE 2 (bases 1 to 1569) AUTHORS Mueller,R. TITLE Direct Submission JOURNAL Submitted (15-JUL-1993) Rolf Mueller, Institut fuer Molekularbiologie und Tumorforschung (IMT), Emil-Mannkopff-Strasse 2, Marburg, 35037, Germany FEATURES Location/Qualifiers source 1..1569 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="tropomyosin isoform" /cell_type="fibroblast" /cell_line="WI-38" CDS 327..1058 /codon_start=1 /product="tropomyosin isoform" /db_xref="PID:g854189" /translation="MLKSWRSGRQITQKGTEDELDKYSEALKDAQEKLELAEKKATDA EADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIESRAQKDEE KMEIQEIQLKEAKHIAEDADRKYEEVARKLVIIESDLERAEERAELSEGQVRQLEEQL RIMDQTLKALMAAEDKYSQKEDRYEEEIKVLSDKLKEAETRAEFAERSVTKLEKSIDD LEEKVLMPKKKTLVCIRCWIRLYWS" polyA_site 1567 BASE COUNT 511 a 279 c 405 g 374 t ORIGIN 1 caaaatctca accatgatct tgagatggca aaggttttaa atacgttttg gaaatatact 61 cattggtata tttcttttga gaaggctgaa atgtagctgg ggacagcagg ttgatcacaa 121 gggacgatga tatgaggtaa gcacacaaga gctatggaca agacaaggtc taaaggattt 181 tgaatacaaa gcagaaatat ttcgaccttc tcatttctgg ggtgggagtg gggagtgttc 241 attaagtaca tatgacaaga gggagtgtgg ggagaaggtg aaacagtaga ctacatttat 301 ggattaagta gggaatgtga acaaagatgt taaagtcatg gcgatccggt agacagatta 361 cacagaaggg gaccgaagat gaactggaca aatactctga ggctctcaaa gatgcccagg 421 agaagctgga gctggcagag aaaaaggcca ccgatgctga agccgacgta gcttctctga 481 acagacgcat ccagctggtt gaggaagagt tggatcgtgc ccaggagcgt ctggcaacag 541 ctttgcagaa gctggaggaa gctgagaagg cagcagatga gagtgagaga ggcatgaaag 601 tcattgagag tcgagcccaa aaagatgaag aaaaaatgga aattcaggag atccaactga 661 aagaggccaa gcacattgct gaagatgccg accgcaaata tgaagaggtg gcccgtaagc 721 tggtcatcat tgagagcgac ctggaacgtg cagaggagcg ggctgagctc tcagaaggcc 781 aagtccgaca gctggaagaa caattaagaa taatggatca gaccttgaaa gcattaatgg 841 ctgcagagga taagtactcg cagaaggaag acagatatga ggaagagatc aaggtccttt 901 ccgacaagct gaaggaggct gagactcggg ctgagtttgc ggagaggtca gtaactaaat 961 tggagaaaag cattgatgac ttagaagaga aagtgctcat gccaaagaag aaaaccttag 1021 tatgcatcag atgctggatc agactttact ggagttaaac aacatgtgaa aacctcctta 1081 gctgcgacca cattctttca ttttgttttg ttttgttttg tttttaaaca cctgcttacc 1141 ccttaaatgc aatttattta cttttaccac tgtcacagaa acatccacaa gataccagct 1201 aggtcagggg gtggggaaaa cacatacaaa aagcaagccc atgtcagggc gatcctggtt 1261 caaatgtgcc atttcccggg ttgatgctgc cacactttgt agagagttta gcaacacagt 1321 gtgcttagtc agtgtaggaa tcctcactaa agcagaagaa gttccattcc tttctgattg 1381 gcacacgtgc agctcatgac aatctgtagg ataacaatca gtgtggattt ccactctttt 1441 cagtccttca tgttaaagat ttagacacca catacaactg gtaaaggacg ttttcttgag 1501 agttttaact atatgtaaac attgtataat gatatggaat aaaatgcaca ttttaggaca 1561 ttttctaaa // LOCUS HSTROPONC 787 bp RNA PRI 04-FEB-1994 DEFINITION Human mRNA for cardiac troponin I. ACCESSION X54163 NID g37427 KEYWORDS troponin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 787) AUTHORS Barton,P.J.R. TITLE Direct Submission JOURNAL Submitted (30-JUL-1990) Barton P.J.R., National Heart and Lung Institute, Dovehouse Street, London SW3 6LY, UK REFERENCE 2 (bases 1 to 787) AUTHORS Vallins,W.J., Brand,N.J., Dabhade,N., Butler-Browne,G., Yacoub,M.H. and Barton,P.J. TITLE Molecular cloning of human cardiac troponin I using polymerase chain reaction JOURNAL FEBS Lett. 270 (1-2), 57-61 (1990) MEDLINE 91032031 COMMENT Data kindly reviewed (09-JAN-1991) by Barton P. Data kindly reviewed (09-JAN-1991) by Barton P. FEATURES Location/Qualifiers source 1..787 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cardiac" CDS 90..722 /note="troponin I" /codon_start=1 /db_xref="PID:g37428" /db_xref="SWISS-PROT:P19429" /translation="MADGSSDAAREPRPAPAPIRRRSSNYRAYATEPHAKKKSKISAS RKLQLKTLLLQIAKQELEREAEERRGEKGRALSTRCQPLELTGLGFAELQDLCRQLHA RVDKVDEERYDIEAKVTKNITEIADLTQKIFDLRGKFKRPTLRRVRISADAMMQALLG ARAKESLDLRAHLKQVKKEDTEKENREVGDWRKNIDALSGMEGRKKKFES" BASE COUNT 187 a 229 c 249 g 122 t ORIGIN 1 ctgaaggtca cccgggcggc cccctcactg accctccaaa cgcccctgtc ctcgccctgc 61 ctcctgccat tcccggcctg agtctcagca tggcggatgg gagcagcgat gcggctaggg 121 aacctcgccc tgcaccagcc ccaatcagac gccgctcctc caactaccgc gcttatgcca 181 cggagccgca cgccaagaaa aaatctaaga tctccgcctc gagaaaattg cagctgaaga 241 ctctgctgct gcagattgca aagcaagagc tggagcgaga ggcggaggag cggcgcggag 301 agaaggggcg cgctctgagc acccgctgcc agccgctgga gttgaccggg ctgggcttcg 361 cggagctgca ggacttgtgc cgacagctcc acgcccgtgt ggacaaggtg gatgaagaga 421 gatacgacat agaggcaaaa gtcaccaaga acatcacgga gattgcagat ctgactcaga 481 agatctttga ccttcgaggc aagtttaagc ggcccaccct gcggagagtg aggatctctg 541 cagatgccat gatgcaggcg ctgctggggg cccgggctaa ggagtccctg gacctgcggg 601 cccacctcaa gcaggtgaag aaggaggaca ccgagaagga aaaccgggag gtgggagact 661 ggcggaagaa catcgatgca ctgagtggaa tggagggccg caagaaaaag tttgagagct 721 gagccttcct gcctactgcc cctgccctga ggagggccac tgaggaataa agcttctctc 781 tgagctg // LOCUS HSTRR 5010 bp RNA PRI 11-APR-1995 DEFINITION Human mRNA for transferrin receptor. ACCESSION X01060 NID g37432 KEYWORDS transferrin receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5010) AUTHORS Schneider,C., Owen,M.J., Banville,D. and Williams,J.G. TITLE Primary structure of human transferrin receptor deduced from the mRNA sequence JOURNAL Nature 311 (5987), 675-678 (1984) MEDLINE 85012743 COMMENT Data kindly reviewed (19-FEB-1986) by C. Schneider. FEATURES Location/Qualifiers source 1..5010 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 264..2546 /note="put. transferrin receptor (aa 1-760)" /codon_start=1 /db_xref="PID:g37433" /db_xref="SWISS-PROT:P02786" /translation="MMDQARSAFSNLFGGEPLSYTRFSLARQVDGDNSHVEMKLAVDE EENADNNTKANVTKPKRCSGSICYGTIAVIVFFLIGFMIGYLGYCKGVEPKTECERLA GTESPVREEPGEDFPAARRLYWDDLKRKLSEKLDSTDFTSTIKLLNENSYVPREAGSQ KDENLALYVENQFREFKLSKVWRDQHFVKIQVKDSAQNSVIIVDKNGRLVYLVENPGG YVAYSKAATVTGKLVHANFGTKKDFEDLYTPVNGSIVIVRAGKITFAEKVANAESLNA IGVLIYMDQTKFPIVNAELSFFGHAHLGTGDPYTPGFPSFNHTQFPPSRSSGLPNIPV QTISRAAAEKLFGNMEGDCPSDWKTDSTCRMVTSESKNVKLTVSNVLKEIKILNIFGV IKGFVEPDHYVVVGAQRDAWGPGAAKSGVGTALLLKLAQMFSDMVLKDGFQPSRSIIF ASWSAGDFGSVGATEWLEGYLSSLHLKAFTYINLDKAVLGTSNFKVSASPLLYTLIEK TMQNVKHPVTGQFLYQDSNWASKVEKLTLDNAAFPFLAYSGIPAVSFCFCEDTDYPYL GTTMDTYKELIERIPELNKVARAAAEVAGQFVIKLTHDVELNLDYERYNSQLLSFVRD LNQYRADIKEMGLSLQWLYSARGDFFRATSRLTTDFGNAEKTDRFVMKKLNDRVMRVE YHFLSPYVSPKESPFRHVFWGSGSHTLPALLENLKLRKQNNGAFNETLFRNQLALATW TIQGAANALSGDVWDIDNEF" misc_feature 456..527 /note="put. transmembrane segment (aa 63-88)" misc_feature 1014..1022 /note="pot. extracellular glycosylation site (aa 251)" misc_feature 1212..1220 /note="pot. extracellular glycosylation site (aa 317)" misc_feature 2442..2450 /note="pot. extracellular glycosylation site (aa 727)" BASE COUNT 1399 a 977 c 1155 g 1479 t ORIGIN 1 ggcggctcgg gacggaggac gcgctagtgt gagtgcgggc ttctagaact acaccgaccc 61 tcgtgtcctc ccttcatcct gcggggctgg ctggagcggc cgctccggtg ctgtccagca 121 gccataggga gccgcacggg gagcgggaaa gcggtcgcgg ccccaggcgg ggcggccggg 181 atggagcggg gccgcgagcc tgtggggaag gggctgtggc ggcgcctcga gcggctgcag 241 gttcttctgt gtggcagttc agaatgatgg atcaagctag atcagcattc tctaacttgt 301 ttggtggaga accattgtca tatacccggt tcagcctggc tcggcaagta gatggcgata 361 acagtcatgt ggagatgaaa cttgctgtag atgaagaaga aaatgctgac aataacacaa 421 aggccaatgt cacaaaacca aaaaggtgta gtggaagtat ctgctatggg actattgctg 481 tgatcgtctt tttcttgatt ggatttatga ttggctactt gggctattgt aaaggggtag 541 aaccaaaaac tgagtgtgag agactggcag gaaccgagtc tccagtgagg gaggagccag 601 gagaggactt ccctgcagca cgtcgcttat attgggatga cctgaagaga aagttgtcgg 661 agaaactgga cagcacagac ttcaccagca ccatcaagct gctgaatgaa aattcatatg 721 tccctcgtga ggctggatct caaaaagatg aaaatcttgc gttgtatgtt gaaaatcaat 781 ttcgtgaatt taaactcagc aaagtctggc gtgatcaaca ttttgttaag attcaggtca 841 aagacagcgc tcaaaactcg gtgatcatag ttgataagaa cggtagactt gtttacctgg 901 tggagaatcc tgggggttat gtggcgtata gtaaggctgc aacagttact ggtaaactgg 961 tccatgctaa ttttggtact aaaaaagatt ttgaggattt atacactcct gtgaatggat 1021 ctatagtgat tgtcagagca gggaaaatca cctttgcaga aaaggttgca aatgctgaaa 1081 gcttaaatgc aattggtgtg ttgatataca tggaccagac taaatttccc attgttaacg 1141 cagaactttc attctttgga catgctcatc tggggacagg tgacccttac acacctggat 1201 tcccttcctt caatcacact cagtttccac catctcggtc atcaggattg cctaatatac 1261 ctgtccagac aatctccaga gctgctgcag aaaagctgtt tgggaatatg gaaggagact 1321 gtccctctga ctggaaaaca gactctacat gtaggatggt aacctcagaa agcaagaatg 1381 tgaagctcac tgtgagcaat gtgctgaaag agataaaaat tcttaacatc tttggagtta 1441 ttaaaggctt tgtagaacca gatcactatg ttgtagttgg ggcccagaga gatgcatggg 1501 gccctggagc tgcaaaatcc ggtgtaggca cagctctcct attgaaactt gcccagatgt 1561 tctcagatat ggtcttaaaa gatgggtttc agcccagcag aagcattatc tttgccagtt 1621 ggagtgctgg agactttgga tcggttggtg ccactgaatg gctagaggga tacctttcgt 1681 ccctgcattt aaaggctttc acttatatta atctggataa agcggttctt ggtaccagca 1741 acttcaaggt ttctgccagc ccactgttgt atacgcttat tgagaaaaca atgcaaaatg 1801 tgaagcatcc ggttactggg caatttctat atcaggacag caactgggcc agcaaagttg 1861 agaaactcac tttagacaat gctgctttcc ctttccttgc atattctgga atcccagcag 1921 tttctttctg tttttgcgag gacacagatt atccttattt gggtaccacc atggacacct 1981 ataaggaact gattgagagg attcctgagt tgaacaaagt ggcacgagca gctgcagagg 2041 tcgctggtca gttcgtgatt aaactaaccc atgatgttga attgaacctg gactatgaga 2101 ggtacaacag ccaactgctt tcatttgtga gggatctgaa ccaatacaga gcagacataa 2161 aggaaatggg cctgagttta cagtggctgt attctgctcg tggagacttc ttccgtgcta 2221 cttccagact aacaacagat ttcgggaatg ctgagaaaac agacagattt gtcatgaaga 2281 aactcaatga tcgtgtcatg agagtggagt atcacttcct ctctccctac gtatctccaa 2341 aagagtctcc tttccgacat gtcttctggg gctccggctc tcacacgctg ccagctttac 2401 tggagaactt gaaactgcgt aaacaaaata acggtgcttt taatgaaacg ctgttcagaa 2461 accagttggc tctagctact tggactattc agggagctgc aaatgccctc tctggtgacg 2521 tttgggacat tgacaatgag ttttaaatgt gatacccata gcttccatga gaacagcagg 2581 gtagtctggt ttctagactt gtgctgatcg tgctaaattt tcagtagggc tacaaaacct 2641 gatgttaaaa ttccatccca tcatcttggt actactagat gtctttaggc agcagctttt 2701 aatacagggt agataacctg tacttcaagt taaagtgaat aaccacttaa aaaatgtcca 2761 tgatggaata ttcccctatc tctagaattt taagtgcttt gtaatgggaa ctgcctcttt 2821 cctgttgttg ttaatgaaaa tgtcagaaac cagttatgtg aatgatctct ctgaatccta 2881 agggctggtc tctgctgaag gttgtaagtg gttcgcttac tttgagtgat cctccaactt 2941 catttgatgc taaataggag ataccaggtt gaaagacctc tccaaatgag atctaagcct 3001 ttccataagg aatgtagcag gtttcctcat tcctgaaaga aacagttaac tttcagaaga 3061 gatgggcttg ttttcttgcc aatgaggtct gaaatggagg tccttctgct ggataaaatg 3121 aggttcaact gttgattgca ggaataaggc cttaatatgt taacctcagt gtcatttatg 3181 aaaagagggg accagaagcc aaagacttag tatattttct tttcctctgt cccttccccc 3241 ataagcctcc atttagttct ttgttatttt tgtttcttcc aaagcacatt gaaagagaac 3301 cagtttcagg tgtttagttg cagactcagt ttgtcagact ttaaagaata atatgctgcc 3361 aaattttggc caaagtgtta atcttagggg agagctttct gtccttttgg cactgagata 3421 tttattgttt atttatcagt gacagagttc actataaatg gtgttttttt aatagaatat 3481 aattatcgga agcagtgcct tccataatta tgacagttat actgtcggtt ttttttaaat 3541 aaaagcagca tctgctaata aaacccaaca gatactggaa gttttgcatt tatggtcaac 3601 acttaagggt tttagaaaac agccgtcagc caaatgtaat tgaataaagt tgaagctaag 3661 atttagagat gaattaaatt taattagggg ttgctaagaa gcgagcactg accagataag 3721 aatgctggtt ttcctaaatg cagtgaattg tgaccaagtt ataaatcaat gtcacttaaa 3781 ggctgtggta gtactcctgc aaaattttat agctcagttt atccaaggtg taactctaat 3841 tcccatttgc aaaatttcca gtacctttgt cacaatccta acacattatc gggagcagtg 3901 tcttccataa tgtataaaga acaaggtagt ttttacctac cacagtgtct gtatcggaga 3961 cagtgatctc catatgttac actaagggtg taagtaatta tcgggaacag tgtttcccat 4021 aattttcttc atgcaatgac atcttcaaag cttgaagatc gttagtatct aacatgtatc 4081 ccaactccta taattcccta tcttttagtt ttagttgcag aaacattttg tggtcattaa 4141 gcattgggtg ggtaaattca accactgtaa aatgaaatta ctacaaaatt tgaaatttag 4201 cttgggtttt tgttaccttt atggtttctc caggtcctct acttaatgag atagcagcat 4261 acatttataa tgtttgctat tgacaagtca ttttaattta tcacattatt tgcatgttac 4321 ctcctataaa cttagtgcgg acaagtttta atccagaatt gaccttttga cttaaagcag 4381 agggactttg tatagaaggt ttgggggctg tggggaagga gagtcccctg aaggtctgac 4441 acgtctgcct acccattcgt ggtgatcaat taaatgtagg tatgaataag ttcgaagctc 4501 cgtgagtgaa ccatcatata aacgtgtagt acagctgttt gtcatagggc agttggaaac 4561 ggcctcctag ggaaaagttc atagggtctc ttcaggttct tagtgtcact tacctagatt 4621 tacagcctca cttgaatgtg tcactactca cagtctcttt aatcttcagt tttatcttta 4681 atctcctctt ttatcttgga ctgacattta gcgtagctaa gtgaaaaggt catagctgag 4741 attcctggtt cgggtgttac gcacacgtac ttaaatgaaa gcatgtggca tgttcatcgt 4801 ataacacaat atgaatacag ggcatgcatt ttgcagcagt gagtctcttc agaaaaccct 4861 tttctacagt tagggttgag ttacttccta tcaagccagt acgtgctaac aggctcaata 4921 ttcctgaatg aaatatcaga ctagtgacaa gctcctggtc ttgagatgtc ttctcgttaa 4981 ggagtagggc cttttggagg taaaggtata // LOCUS HSTRYIII 807 bp RNA PRI 22-MAR-1995 DEFINITION Human mRNA for pancreatic trypsinogen III. ACCESSION X15505 NID g37459 KEYWORDS protease; trypsinogen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 807) AUTHORS Tani,T. TITLE Direct Submission JOURNAL Submitted (08-JUN-1989) Tani T., Dept. of Biology, Faculty of Science, Kyushu Univ., Hakozaki, Higashi-ku, Fukuoka 812, Japan REFERENCE 2 (bases 1 to 807) AUTHORS Tani,T., Kawashima,I., Mita,K. and Takiguchi,Y. TITLE Nucleotide sequence of the human pancreatic trypsinogen III cDNA JOURNAL Nucleic Acids Res. 18 (6), 1631 (1990) MEDLINE 90221895 FEATURES Location/Qualifiers source 1..807 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="pancreas" CDS 14..757 /note="prepro-polypeptide (AA -13 to 234)" /codon_start=1 /db_xref="PID:g37460" /db_xref="SWISS-PROT:P15951" /translation="MNPFLILAFVGAAVAVPFDDDDKIVGGYTCEENSLPYQVSLNSG SHFCGGSLISEQWVVSAAHCYKTRIQVRLGEHNIKVLEGNEQFINAAKIIRHPKYNRD TLDNDIMLIKLSSPAVINARVSTISLPTAPPAAGTECLISGWGNTLSFGADYPDELKC LDAPVLREAECKASCPGKITNSMFCVGFLEGGKDSWKRDSGGPVVCNGQLQGVVSWGH GCAWKNRPGVYTKVYNYVDWIKDTIAANS" sig_peptide 14..52 /note="signal peptide (AA -13 to -1)" misc_feature 53..82 /note="activation peptide (AA 1 to 10)" mat_peptide 83..754 /note="mature trypsinogen (AA 11 to 234)" BASE COUNT 175 a 241 c 209 g 182 t ORIGIN 1 acactctacc accatgaatc cattcctgat ccttgccttt gtgggagctg ctgttgctgt 61 cccctttgac gatgatgaca agattgttgg gggctacacc tgtgaggaga attctctccc 121 ctaccaggtg tccctgaatt ctggctccca cttctgcggt ggctccctca tcagcgaaca 181 gtgggtggta tcagcagctc actgctacaa gacccgcatc caggtgagac tgggagagca 241 caacatcaaa gtcctggagg ggaatgagca gttcatcaat gcggccaaga tcatccgcca 301 ccctaaatac aacagggaca ctctggacaa tgacatcatg ctgatcaaac tctcctcacc 361 tgccgtcatc aatgcccgcg tgtccaccat ctctctgccc accgcccctc cagctgctgg 421 cactgagtgc ctcatctccg gctggggcaa cactctgagc tttggtgctg actacccaga 481 cgagctgaag tgcctggatg ctccggtgct gagggaggct gagtgtaaag cctcctgccc 541 tggaaagatt accaacagca tgttctgtgt gggcttcctt gagggaggca aggattcctg 601 gaagcgtgac tctggtggcc ctgtggtctg caacggacag ctccaaggag ttgtctcctg 661 gggccatggc tgtgcctgga agaacaggcc tggagtctac accaaggtct acaactatgt 721 ggactggatt aaggacacca tcgctgccaa cagctaaagc ccccggtccc tctgcagtct 781 ctataccaat aaagtggccc tgctctc // LOCUS HSTSC2 5474 bp RNA PRI 24-JAN-1994 DEFINITION H.sapiens TSC2 mRNA for tuberin. ACCESSION X75621 NID g450351 KEYWORDS TSC2 gene; tuberin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5474) AUTHORS Nellist,M., Janssen,B., Brook-Carter,P.T., Hesseling-Janssen,A.L.W., Maheshwar,M.M., Verhoef,S., Van den Ouweland,A.M.W., Lindhout,D., Eussen,B., Cordeiro,I., Santos,H., Halley,D.J.J., Sampson,J.R., Ward,C.J., Peral,B., Thomas,S., Hughes,J., Harris,P.C., Roelfsema,J.H., Saris,J.J., Spruit,L., Peters,D.J.M., Dauwerse,J.G. and Breuning,M.H. TITLE Identification and characterization of the tuberous sclerosis gene on chromosome 16 JOURNAL Cell 75, 1305-1315 (1993) MEDLINE 94094325 REFERENCE 2 (bases 1 to 5474) AUTHORS Janssen. TITLE Direct Submission JOURNAL Submitted (03-NOV-1993) Janssen, Clinical Genetics Dept. Rotterdam, Dr. Molewaterplein 50, 3015 Ge Rotterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..5474 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /chromosome="16" /map="p13.3" gene 19..5373 /gene="TSC2" CDS 19..5373 /gene="TSC2" /codon_start=1 /product="tuberin" /db_xref="PID:g450352" /translation="MAKPTSKDSGLKEKFKILLGLGTPRPNPRSAEGKQTEFIITAEI LRELSMECGLNNRIRMIGQICEVAKTKKFEEHAVEALWKAVADLLQPERTLEARHAVL ALLKAIVQGQGERLGVLRALFFKVIKDYPSNEDLHERLEVFKALTDNGRHITYLEEEL ADFVLQWMDVGLSSEFLLVLVNLVKFNSCYLDEYIARMVQMICLLCVRTASSVDIEVS LQVLDAVVCYNCLPAESLPLFIVTLCRTINVKELCEPCWKLMRNLLGTHLGHSAIYNM CHLMEDRAYMEDAPLLRGAVFFVGMALWGAHRLYSLRNSPTSVFPSFYQAMACPNEVV SYEIVLSITRLIKKYRKELQVVAWDILLNIIERLLQQLQTLDSPELRTIVHDLLTTVE ELCDQNEFHGSQERYFELVERCADQRPESSLLNLISYRAQSIHPAKDGWIQNLQALME RFFRSESRGAVRIKVLDVLSFVLLINRQFYEEELINSVVISQLSHIPEDKDHQVRKLA TQLLVDLAEGCHTHHFNSLLDIIEKVMARSLSPPPELEERDVAAYSASLEDVKTAVLG LLVILQTKLYTLPASHATRVYEMLVSHIQLHYKHSYTLPIASSIRLQAFDFLLLLRAD SLHRLGLPNKDGVVRFSPYCVCDYMEPERGSEKKTSGPLSPPTGPPGPAPAGPAVRLG SVPYSLLFRVLLQCLKQESDWKVLKLVLGRLPESLRYKVLIFTSPCSVDQLCSALCSM LSGPKTLERLRGAPEGFSRTDLHLAVVPVLTALISYHNYLDKTKQREMVYCLEQGLIH RCARQCVVALSICSVEMPDIIIKALPVLVVKLTHISATASMAVPLLEFLSTLARLPHL YRNFAAEQYASVFAISLPYTNPSKFNQYIVCLAHHVIAMWFIRCRLPFRKEFVPFITK GLRSNVLLSFDDTPEKDSFRARSTSLNERPKSLRIARPPKQGLNNSPPVKEFKESSAA EAFRCRSISVSEHVVRSRIQTSLTSASLGSADENSVAQADDSLKNLHLELTETCLDMM ARYVFSNFTAVPKRSPVGEFLLAGGRTKTWLVGNKLVTVTTSVGTGTRSLLGLDSGEL QSGPESSSSPGVHVRQTKEAPAKLESQAGQQVSRGARDRVRSMSGGHGLRVGALDVPA SQFLGSATSPGPRTAPAAKPEKASAGTRVPVQEKTNLAAYVPLLTQGWAEILVRRPTG NTSWLMSLENPLSPFSSDINNMPLQELSNALMAAERFKEHRDTALYKSLSVPAASTAK PPPLPRSNTDSAVVMEEGSPGEVPVLVEPPGLEDVEAALGMDRRTDAYSRSSSVSSQE EKSLHAEELVGRGIPIERVVSSEGGRPSVDLSFQPSQPLSKSSSSPELQTLQDILGDP GDKADVGRLSPEVKARSQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPRSPSGLRPR GYTISDSAPSRRGKRVERDALKSRATASNAEKVPGINPSFVFLQLYHSPFFGDESNKP ILLPNESQSFERSVQLLDQIPSYDTHKIAVLYVGEGQSNSELAILSNEHGSYRYTEFL TGLGRLIELKDCQPDKVYLGGLDVCGEDGQFTYCWHDDIMQAVFHIATLMPTKDVDKH RCDKKRHLGNDFVSIVYNDSGEDFKLGTIKGQFNFVHVIVTPLDYECNLVSLQCRKDM EGLVDTSVAKIVSDRNLPFVARQMALHANMASQVHHSRSNPTDIYPSKWIARLRHIKR LRQRICEEAAYSNPSLPLVHPPSHSKAPAQTPAEPTPGYEVGQRKRLISSVEDFTEFV " BASE COUNT 1122 a 1708 c 1580 g 1064 t ORIGIN 1 ggtgcgtcct ggtccaccat ggccaaacca acaagcaaag attcaggctt gaaggagaag 61 tttaagattc tgttgggact gggaacaccg aggccaaatc ccaggtctgc agagggtaaa 121 cagacggagt ttatcatcac cgcggaaata ctgagagaac tgagcatgga atgtggcctc 181 aacaatcgca tccggatgat agggcagatt tgtgaagtcg caaaaaccaa gaaatttgaa 241 gagcacgcag tggaagcact ctggaaggcg gtcgcggatc tgttgcagcc ggagcggacg 301 ctggaggccc ggcacgcggt gctggctctg ctgaaggcca tcgtgcaggg gcagggcgag 361 cgtttggggg tcctcagagc cctcttcttt aaggtcatca aggattaccc ttccaacgaa 421 gaccttcacg aaaggctgga ggttttcaag gccctcacag acaatgggag acacatcacc 481 tacttggagg aagagctggc tgactttgtc ctgcagtgga tggatgttgg cttgtcctcg 541 gaattccttc tggtgctggt gaacttggtc aaattcaata gctgttacct cgacgagtac 601 atcgcaagga tggttcagat gatctgtctg ctgtgcgtcc ggaccgcgtc ctctgtggac 661 atagaggtct ccctgcaggt gctggacgcc gtggtctgct acaactgcct gccggctgag 721 agcctcccgc tgttcatcgt taccctctgt cgcaccatca acgtcaagga gctctgcgag 781 ccttgctgga agctgatgcg gaacctcctt ggcacccacc tgggccacag cgccatctac 841 aacatgtgcc acctcatgga ggacagagcc tacatggagg acgcgcccct gctgagagga 901 gccgtgtttt ttgtgggcat ggctctctgg ggagcccacc ggctctattc tctcaggaac 961 tcgccgacat ctgtgtttcc atcattttac caggccatgg catgtccgaa cgaggtggtg 1021 tcctatgaga tcgtcctgtc catcaccagg ctcatcaaga agtataggaa ggagctccag 1081 gtggtggcgt gggacattct gctgaacatc atcgaacggc tccttcaaca gctccagacc 1141 ttggacagcc cggagctcag gaccatcgtc catgacctgt tgaccacggt ggaggagctg 1201 tgtgaccaga acgagttcca cgggtctcag gagagatact ttgaactggt ggagagatgt 1261 gcggaccaga ggcctgagtc ctccctcctg aacctgatct cctatagagc gcagtccatc 1321 cacccggcca aggacggctg gattcagaac ctgcaggcgc tgatggagag attcttcagg 1381 agcgagtccc gaggcgccgt gcgcatcaag gtgctggacg tgctgtcctt tgtgctgctc 1441 atcaacaggc agttctatga ggaggagctg attaactcag tggtcatctc gcagctctcc 1501 cacatccccg aggataaaga ccaccaggtc cgaaagctgg ccacccagtt gctggtggac 1561 ctggcagagg gctgccacac acaccacttc aacagcctgc tggacatcat cgagaaggtg 1621 atggcccgtt ccctctcccc acccccggag ctggaagaaa gggatgtggc cgcatactcg 1681 gcctccttgg aggatgtgaa gacagccgtc ctggggcttc tggtcatcct tcagaccaag 1741 ctgtacaccc tgcctgcaag ccacgccacg cgtgtgtatg agatgctggt cagccacatt 1801 cagctccact acaagcacag ctacaccctg ccaatcgcga gcagcatccg gctgcaggcc 1861 tttgacttcc tgttgctgct gcgggccgac tcactgcacc gcctgggcct gcccaacaag 1921 gatggagtcg tgcggttcag cccctactgc gtctgcgact acatggagcc agagagaggc 1981 tctgagaaga agaccagcgg ccccctttct cctcccacag ggcctcctgg cccggcgcct 2041 gcaggccccg ccgtgcggct ggggtccgtg ccctactccc tgctcttccg cgtcctgctg 2101 cagtgcttga agcaggagtc tgactggaag gtgctgaagc tggttctggg caggctgcct 2161 gagtccctgc gctataaagt gctcatcttt acttcccctt gcagtgtgga ccagctgtgc 2221 tctgctctct gctccatgct ttcaggccca aagacactgg agcggctccg aggcgcccca 2281 gaaggcttct ccagaactga cttgcacctg gccgtggttc cagtgctgac agcattaatc 2341 tcttaccata actacctgga caaaaccaaa cagcgcgaga tggtctactg cctggagcag 2401 ggcctcatcc accgctgtgc cagacagtgc gtcgtggcct tgtccatctg cagcgtggag 2461 atgcctgaca tcatcatcaa ggcgctgcct gttctggtgg tgaagctcac gcacatctca 2521 gccacagcca gcatggccgt cccactgctg gagttcctgt ccactctggc caggctgccg 2581 cacctctaca ggaactttgc cgcggagcag tatgccagtg tgttcgccat ctccctgccg 2641 tacaccaacc cctccaagtt taatcagtac atcgtgtgtc tggcccatca cgtcatagcc 2701 atgtggttca tcaggtgccg cctgcccttc cggaaggaat ttgtcccttt catcactaag 2761 ggcctgcggt ccaatgtcct cttgtctttt gatgacaccc ccgagaagga cagcttcagg 2821 gcccggagta ctagtctcaa cgagagaccc aagagtctga ggatagccag accccccaaa 2881 caaggcttga ataactctcc acccgtgaaa gaattcaagg agagctctgc agccgaggcc 2941 ttccggtgcc gcagcatcag tgtgtctgaa catgtggtcc gcagcaggat acagacgtcc 3001 ctcaccagtg ccagcttggg gtctgcagat gagaactccg tggcccaggc tgacgatagc 3061 ctgaaaaacc tccacctgga gctcacggaa acctgtctgg acatgatggc tcgatacgtc 3121 ttctccaact tcacggctgt cccgaagagg tctcctgtgg gcgagttcct cctagcgggt 3181 ggcaggacca aaacctggct ggttgggaac aagcttgtca ctgtgacgac aagcgtggga 3241 accgggaccc ggtcgttact aggcctggac tcgggggagc tgcagtccgg cccggagtcg 3301 agctccagcc ccggggtgca tgtgagacag accaaggagg cgccggccaa gctggagtcc 3361 caggctgggc agcaggtgtc ccgtggggcc cgggatcggg tccgttccat gtcggggggc 3421 catggtcttc gagttggcgc cctggacgtg ccggcctccc agttcctggg cagtgccact 3481 tctccaggac cacggactgc accagccgcg aaacctgaga aggcctcagc tggcacccgg 3541 gttcctgtgc aggagaagac gaacctggcg gcctatgtgc ccctgctgac ccagggctgg 3601 gcggagatcc tggtccggag gcccacaggg aacaccagct ggctgatgag cctggagaac 3661 ccgctcagcc ctttctcctc ggacatcaac aacatgcccc tgcaggagct gtctaacgcc 3721 ctcatggcgg ctgagcgctt caaggagcac cgggacacag ccctgtacaa gtcactgtcg 3781 gtgccggcag ccagcacggc caaaccccct cctctgcctc gctccaacac agactccgcc 3841 gtggtcatgg aggagggaag tccgggcgag gttcctgtgc tggtggagcc cccagggttg 3901 gaggacgttg aggcagcgct aggcatggac aggcgcacgg atgcctacag caggtcgtcc 3961 tcagtctcca gccaggagga gaagtcgctc cacgcggagg agctggttgg caggggcatc 4021 cccatcgagc gagtcgtctc ctcggagggt ggccggccct ctgtggacct ctccttccag 4081 ccctcgcagc ccctgagcaa gtccagctcc tctcccgagc tgcagactct gcaggacatc 4141 ctcggggacc ctggggacaa ggccgacgtg ggccggctga gccctgaggt taaggcccgg 4201 tcacagtcag ggaccctgga cggggaaagt gctgcctggt cggcctcggg cgaagacagt 4261 cggggccagc ccgagggtcc cttgccttcc agctcccccc gctcgcccag tggcctccgg 4321 ccccgaggtt acaccatctc cgactcggcc ccatcacgca ggggcaagag agtagagagg 4381 gacgccttaa agagcagagc cacagcctcc aatgcagaga aagtgccagg catcaacccc 4441 agtttcgtgt tcctgcagct ctaccattcc cccttctttg gcgacgagtc aaacaagcca 4501 atcctgctgc ccaatgagtc acagtccttt gagcggtcgg tgcagctcct cgaccagatc 4561 ccatcatacg acacccacaa gatcgccgtc ctgtatgttg gagaaggcca gagcaacagc 4621 gagctcgcca tcctgtccaa tgagcatggc tcctacaggt acacggagtt cctgacgggc 4681 ctgggccggc tcatcgagct gaaggactgc cagccggaca aggtgtacct gggaggcctg 4741 gacgtgtgtg gtgaggacgg ccagttcacc tactgctggc acgatgacat catgcaagcc 4801 gtcttccaca tcgccaccct gatgcccacc aaggacgtgg acaagcaccg ctgcgacaag 4861 aagcgccacc tgggcaacga ctttgtgtcc attgtctaca atgactccgg tgaggacttc 4921 aagcttggca ccatcaaggg ccagttcaac tttgtccacg tgatcgtcac cccgctggac 4981 tacgagtgca acctggtgtc cctgcagtgc aggaaagaca tggagggcct tgtggacacc 5041 agcgtggcca agatcgtgtc tgaccgcaac ctgcccttcg tggcccgcca gatggccctg 5101 cacgcaaata tggcctcaca ggtgcatcat agccgctcca accccaccga tatctacccc 5161 tccaagtgga ttgcccggct ccgccacatc aagcggctcc gccagcggat ctgcgaggaa 5221 gccgcctact ccaaccccag cctacctctg gtgcaccctc cgtcccatag caaagcccct 5281 gcacagactc cagccgagcc cacacctggc tatgaggtgg gccagcggaa gcgcctcatc 5341 tcctcggtgg aggacttcac cgagtttgtg tgaggccggg gccctccctc ctgcactggc 5401 cttggacggt attgcctgtc agtgaaataa ataaagtcct gaccccagtg cacagacata 5461 gaggcacaga ttgc // LOCUS HSTSPM 808 bp RNA PRI 06-JUL-1994 DEFINITION H.sapiens tissue specific mRNA. ACCESSION X67698 NID g37476 KEYWORDS tissue specific sequence. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 808) AUTHORS Kirchhoff,C. TITLE Direct Submission JOURNAL Submitted (21-AUG-1992) C. Kirchhoff, Institute for Hormone & Fertility Res, Grandweg 64, 2000 Hamburg 54, FRG REFERENCE 2 (bases 1 to 808) AUTHORS Krull,N., Ivell,R., Osterhoff,C. and Kirchhoff,C. TITLE Region-specific variation of gene expression in the human epididymis as revealed by in situ hybridization with tissue-specific cDNAs JOURNAL Mol. Reprod. Dev. 34 (1), 16-24 (1993) MEDLINE 93119659 FEATURES Location/Qualifiers source 1..808 /organism="Homo sapiens" /isolate="patient 3" /db_xref="taxon:9606" /tissue_type="epididymis" /cell_type="principal cells" /clone_lib="lambda gt11" /clone="HE1, 19.5" CDS 11..466 /note="orf" /codon_start=1 /db_xref="PID:g37477" /translation="MRFLAATFLLLALSTAAQAEPVQFKDCGSVDGVIKEVNVSPCPT QPCQLSKGQSYSVNVTFTSNIQSKSSKAVVHGILMGVPVPFPIPEPDGCKSGINCPIQ KDKTYSYLNKLPVKSEYPSIKLVVEWQLQDDKNQSLFCWEIPVQIVSHL" BASE COUNT 209 a 195 c 181 g 223 t ORIGIN 1 cggattccgg atgcgtttcc tggcagctac attcctgctc ctggcgctca gcaccgctgc 61 ccaggccgaa ccggtgcagt tcaaggactg cggttctgtg gatggagtta taaaggaagt 121 gaatgtgagc ccatgcccca cccaaccctg ccagctgagc aaaggacagt cttacagcgt 181 caatgtcacc ttcaccagca atattcagtc taaaagcagc aaggccgtgg tgcatggcat 241 cctgatgggc gtcccagttc cctttcccat tcctgagcct gatggttgta agagtggaat 301 taactgccct atccaaaaag acaagaccta tagctacctg aataaactac cagtgaaaag 361 cgaatatccc tctataaaac tggtggtgga gtggcaactt caggatgaca aaaaccaaag 421 tctcttctgc tgggaaatcc cagtacagat cgtttctcat ctctaagtgc ctcattgagt 481 tcggtgcatc tggccaatga gtctgctgag actcttgaca gcacctccag ctctgctgct 541 tcaacaacag tgacttgctc tccaatggta tccagtgatt cgttgaagag gaggtgctct 601 gtagcagaaa ctgagctccg ggtggctggt tctcagtggt tgtctcatgt ctctttttct 661 gtcttaggtg gtttcattaa atgcagcact tggttagcag atgtttaatt ttttttttta 721 acaacattaa cttgtggcct ctttctacac ctggaaattt actcttgaat aaataaaaac 781 tcgtttgtct tgtaaaaaaa aaaaaaaa // LOCUS HSTTFGP 1445 bp RNA PRI 19-SEP-1995 DEFINITION H.sapiens TTF mRNA for small G protein. ACCESSION Z35227 NID g609016 KEYWORDS small G protein; TTF gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1445) AUTHORS Dallery,E., Galiegue-Zouitina,S., Collyn-d'Hooghe,M., Quief,S., Denis,C., Hildebrand,M.P., Lantoine,D., Deweindt,C., Tilly,H., Bastard,C. et,al. TITLE TTF, a gene encoding a novel small G protein, fuses to the lymphoma-associated LAZ3 gene by t(3;4) chromosomal translocation JOURNAL Oncogene 10 (11), 2171-2178 (1995) MEDLINE 95303479 REMARK (sites) REFERENCE 2 (bases 1 to 1445) AUTHORS Kerckaert,J. TITLE Direct Submission JOURNAL Submitted (13-JUL-1994) Kerckaert J., INSERM U. 124, Molecular Onco-Hematology, Place de Verdun, LILLE CEDEX, FRANCE, 59045 COMMENT . FEATURES Location/Qualifiers source 1..1445 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PL2" /cell_type="B-cell N.H.L." /cell_line="K422" /clone_lib="cDNA library from K422 cell line." /germline gene 580..1155 /gene="TTF" CDS 580..1155 /gene="TTF" /note="- Prenyl group binding site (CAAX box) A.A. residues: 188 to 191 - ATP/GTP-binding site motif A (P-loop) A.A. 11 to 18 with a Ser at position 13 instead of the Gly at position 12 in RAS" /codon_start=1 /product="small G protein" /db_xref="PID:g609017" /translation="MLSSIKCVLVGDSAVGKTSLLVRFTSETFPEAYKPTVYENTGVD VFMDGIQISLGLWDTAGNDAFRSIRPLSYQQADVVLMCYSVANHNSFLNLKNKWIGEI RSNLPCTPVLVVATQTDQREMGPHRASCVNAMEGKKLAQDVRAKGYLECSALSNRGVQ QVFECAVRTAVNQARRRNRRRLFSINECKIF" BASE COUNT 362 a 365 c 382 g 336 t ORIGIN 1 ctgccccaca cacactaacc caaccatctt ggggtggact ccctgccagc ccaactgttg 61 tattttcagt tcttccagtg tgaatcagtt aatattctcg ggaacgaggg agaggttgat 121 cctatgagga aatcaaccac agtgaaaagg cttgggccgc ttttgttttc gcctcctttt 181 gttgaacaaa tttgatttcc ggagtcagtc attttactgt caagacattt cttcggcatt 241 ctgcaacagt ttccaacatg gctagatcca tcagaaactg aagccgtgga gaacgctctc 301 ggggcctttg ccacttcttg gagtagaagc cgacagagag ctgtttggaa acttctcctt 361 cacacaccag ttgaagacta ggctttggag gttttcaaag cagacggtgc ttggatgggc 421 agggagaagt aacattctgc aaatcgccgt cagaggtcct gaggacacag acctacctgg 481 cttgcattcc ccttgctgaa tggcgtgtgc tgcagctgcc cactgagggc tcttttccct 541 gggattctgg acttcagagt aggacagcag gctgggaaga tgctgagttc catcaagtgc 601 gtgttggtgg gcgactctgc tgtggggaaa acctctctgt tggtgcgctt cacctccgag 661 accttcccgg aggcctacaa gcccacagtg tacgagaaca caggggtgga cgtcttcatg 721 gatggcatcc agatcagcct gggcctctgg gacacagccg gcaatgacgc cttcagaagc 781 atccggcccc tgtcctacca gcaggcagac gtggtgctga tgtgctactc tgtggccaac 841 cataactcat tcctgaactt gaagaacaag tggattggtg aaattaggag caacttgccc 901 tgtacccctg tgctggtggt ggccacccag actgaccagc gggagatggg gccccacagg 961 gcctcctgcg tcaatgccat ggaagggaag aaactggccc aggatgtcag agccaagggc 1021 tacctggagt gctcagccct tagcaatcgg ggagtacagc aggtgtttga gtgcgccgtc 1081 cgaactgccg tcaaccaggc caggagacga aacagaagga ggctcttctc catcaatgag 1141 tgcaagatct tctaaacccc aagagacttc acacaacact tatgtatgca ccccaaagac 1201 taatggggag agggagggcc gggaagccag gaaagcttgg tgttttctct gggtacaccc 1261 caagcagcgt ctccgttttg gatacagtta ttgatgaggc ttggccactg gatgttttca 1321 ctaactacac tctacaagtg aactccttgc ccaggccagt tagaaaatcc cttggggaac 1381 tgtgatgaat attccatctt tgattaaaaa agtgaaatag tctccataaa aaaaaaaaaa 1441 aaaaa // LOCUS HSTUMP 830 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for translationally controlled tumor protein. ACCESSION X16064 NID g37495 KEYWORDS translational control; tumor protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 830) AUTHORS Rohde,K. TITLE Direct Submission JOURNAL Submitted (11-AUG-1989) Rohde K., Akademie der Wissenschaften der DDR, Zentralinstitut f Molekularbiologie, Robert Roessle str 10, Berlin Buch DDR 1115 REFERENCE 2 (bases 1 to 830) AUTHORS Gross,B., Gaestel,M., Bohm,H. and Bielka,H. TITLE cDNA sequence coding for a translationally controlled human tumor protein JOURNAL Nucleic Acids Res. 17 (20), 8367 (1989) MEDLINE 90045959 COMMENT Data kindly reviewed (27-NOV-1989) by Gross B. FEATURES Location/Qualifiers source 1..830 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 95..613 /note="tumor protein (AA 1 - 172)" /codon_start=1 /db_xref="PID:g37496" /db_xref="SWISS-PROT:P13693" /translation="MIIYRDLISHDEMFSDIYKIREIADGLCLEVEGKMVSRTEGNID DSLIGGNASAEGPEGEGTESTVITGVDIVMNHHLQETSFTKEAYKKYIKDYMKSIKGK LEEQRPERVKPFMTGAAEQIKHILANFKNYQFFIGENMNPDGMVALLDYREDGVTPYM IFFKDGLEMEKC" misc_feature 625..632 /note="translation inhibition element" misc_feature 728..736 /note="translation inhibition element" misc_feature 744..751 /note="translation inhibition element" polyA_site 830 /note="polyA site" BASE COUNT 245 a 178 c 194 g 213 t ORIGIN 1 cccccccgag cgccgctccg gctgcaccgc gctcgctccg agtttcaggc tcgtgctaag 61 ctagcgccgt cgtcgtctcc cttcagtcgc catcatgatt atctaccggg acctcatcag 121 ccacgatgag atgttctccg acatctacaa gatccgggag atcgcggacg ggttgtgcct 181 ggaggtggag gggaagatgg tcagtaggac agaaggtaac attgatgact cgctcattgg 241 tggaaatgcc tccgctgaag gccccgaggg cgaaggtacc gaaagcacag taatcactgg 301 tgtcgatatt gtcatgaacc atcacctgca ggaaacaagt ttcacaaaag aagcctacaa 361 gaagtacatc aaagattaca tgaaatcaat caaagggaaa cttgaagaac agagaccaga 421 aagagtaaaa ccttttatga caggggctgc agaacaaatc aagcacatcc ttgctaattt 481 caaaaactac cagttcttta ttggtgaaaa catgaatcca gatggcatgg ttgctctatt 541 ggactaccgt gaggatggtg tgaccccata tatgattttc tttaaggatg gtttagaaat 601 ggaaaaatgt taacaaatgt ggcaattatt ttggatctat cacctgtcat cataactggc 661 ttctgcttgt catccacaca acaccaggac ttaagacaaa tgggactgat gtcatcttga 721 gctcttcatt tattttgact gtgatttatt tggagtggag gcattgtttt taagaaaaac 781 atgtcatgta ggttgtctaa aaataaaatg catttaaact catttgagag // LOCUS HSTUNPB 2830 bp RNA PRI 03-DEC-1996 DEFINITION H.sapiens tunp mRNA for transformation upregulated nuclear protein. ACCESSION X72727 NID g460788 KEYWORDS transformation upregulated nuclear protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2830) AUTHORS Dejgaard,K., Leffers,H., Rasmussen,H.H., Madsen,P., Kruse,T.A., Gesser,B., Nielsen,H. and Celis,J.E. TITLE Identification, molecular cloning, expression and chromosome mapping of a family of transformation upregulated hnRNP-K proteins derived by alternative splicing JOURNAL J. Mol. Biol. 236 (1), 33-48 (1994) MEDLINE 94149726 REFERENCE 2 (bases 1 to 2830) AUTHORS Leffers,H. TITLE Direct Submission JOURNAL Submitted (18-MAR-1993) H. Leffers, Institute of Medical Biochemistry and, Danish Centre for Human Genome Research, Ole Worms Alle 170, Aarhus University, 8000 Aarhus, DENMARK COMMENT Another alternative splice event replaces exon C with another exon (see exon D, X72726). FEATURES Location/Qualifiers source 1..2830 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AMA cells" /clone_lib="lambda ZAP II AMA cDNA" /clone="tunps A,B,C,D" /chromosome="9" gene 210..2814 /gene="tunp" CDS 210..1604 /gene="tunp" /codon_start=1 /evidence=experimental /product="transformation upregulated nuclear protein" /db_xref="PID:g460789" /translation="METEQPEETFPNTETNGEFGKRPAEDMEEEQDFKRSRNTDEMVE LRILLQSKNAGAVIGKGGKNIKALRTDYNASVSVPDSSGPERILSISADIETIGEILK KIIPTLEEGLQLPSPTATSQLPLESDAVECLNYQHYKGSDFDCELRLLIHQSLAGGII GVKGAKIKELRENTQTTIKLFQECCPHSTDRVVLIGGKPDRVVECIKIILDLISESPI KGRAQPYDPNFYDETYDYGGFTMMFDDRRGRPVGFPMRGRGGFDRMPPGRGGRPMPPS RRDYDDMSPRRGPPPPPPGRGGRGGSRARNLPLPPPPPPRGGDLMAYDRRGRPGDRYD GMVGFSADETWDSAIDTWSPSEWQMAYEPQGGSGYDYSYAGGRGSYGDLGGPIITTQV TIPKDLAGSIIGKGGQRIKQIRHESGASIKIDEPLEGSEDRIITITGTQDQIQNAQYL LQNSVKQYADVEGF" misc_feature 1317..1331 /gene="tunp" /note="alternatively spliced exon A" misc_feature 1570..1630 /gene="tunp" /note="alternatively spliced exon B" misc_feature 1902..2830 /note="alternatively spliced exon C" polyA_signal 2082..2086 /gene="tunp" polyA_signal 2805..2814 /gene="tunp" BASE COUNT 805 a 530 c 654 g 841 t ORIGIN 1 agggcgctcc aggcgacacg attgcagacg ccattatcct ctgtttctct gctgcaccga 61 cctcgacgtc ttgcctgtgt cccacttgtt cgcggcctat aggctactgc agcactgggg 121 tgtcagttgt tggtccgacc cagaacgctt cagttgtgct ctgcaaggat atataataac 181 tgattggtgt gcccgtttaa taaaagaata tggaaactga acagccagaa gaaaccttcc 241 ctaacactga aaccaatggt gaatttggta aacgccctgc agaagatatg gaagaggaac 301 aagactttaa aagatctaga aacactgatg agatggttga attacgcatt ctgcttcaga 361 gcaagaatgc tggggcagtg attggaaaag gaggcaagaa tattaaggct ctccgtacag 421 actacaatgc cagtgtttca gtcccagaca gcagtggccc cgagcgcata ttgagtatca 481 gtgctgatat tgaaacaatt ggagaaattc tgaagaaaat catccctacc ttggaagagg 541 gcctgcagtt gccatcaccc actgcaacca gccagctccc gctcgaatct gatgctgtgg 601 aatgcttaaa ttaccaacac tataaaggaa gtgactttga ctgcgagttg aggctgttga 661 ttcatcagag tctagcagga ggaattattg gggtcaaagg tgctaaaatt aaagaacttc 721 gagagaacac tcaaaccacc atcaagcttt tccaggaatg ctgtcctcat tccactgaca 781 gagttgttct tattggagga aaacccgata gggttgtaga gtgcataaag atcatccttg 841 atcttatatc tgagtctccc atcaaaggac gtgcacagcc ttatgatccc aatttttacg 901 atgaaaccta tgattatggt ggttttacaa tgatgtttga tgaccgtcgc ggacgcccag 961 tgggatttcc catgcgggga agaggtggtt ttgacagaat gcctcctggt cggggtgggc 1021 gtcccatgcc tccatctaga agagattatg atgatatgag ccctcgtcga ggaccacctc 1081 cccctcctcc cggacgaggc ggccggggtg gtagcagagc tcggaatctt cctcttcctc 1141 caccaccacc acctagaggg ggagacctca tggcctatga cagaagaggg agacctggag 1201 accgttacga cggcatggtt ggtttcagtg ctgatgaaac ttgggactct gcaatagata 1261 catggagccc atcagaatgg cagatggctt atgaaccaca gggtggctcc ggatatgatt 1321 attcctatgc agggggtcgt ggctcatatg gtgatcttgg tggacctatt attactacac 1381 aagtaactat tcccaaagat ttggctggat ctattattgg caaaggtggt cagcggatta 1441 aacaaatccg tcatgagtcg ggagcttcga tcaaaattga tgagccttta gaaggatccg 1501 aagatcggat cattaccatt acaggaacac aggaccagat acagaatgca cagtatttgc 1561 tgcagaacag tgtgaagcag tatgcagatg ttgaaggatt ctaatgcaag atattttttc 1621 ttttttatag tgtgaagcag tattctggaa agtttttcta agactagtga agaactgaag 1681 gagtcctgca tctttttttt tttatctgct tctgtttaaa aagccaacat tcctctgctt 1741 cataggtgtt ctgcatttga ggtgtagtga aatctttgct gttcaccaga tgtaatgttt 1801 tagttcttac aaacagggtt ggggggggga agggcgtgca aaaactaaca ttgaaatttt 1861 gaaacagcag cagagtgagt ggattttatt tttcgttatt gtggtggttt aaaaaattcc 1921 ccccatgtaa ttattgtgaa caccttgctt tgtggtcact gtaacatttg gggggtgggc 1981 cagggaggaa aagtaacaat agtccacatg tccctggcat ctgttcagag cagtgtgcag 2041 aatgtaatgc tcttttgtaa gaaacgtttt atgattttta aaataaattt agtgaaccta 2101 tttttggtgg tcattttttt tttaagacag tcattttaaa atggtggctg aatttcccaa 2161 cccaccccca aactaaacac taagtttaat tttcagctcc tctgttggac atataagtgc 2221 atctcttgtt ggacataggc aaaataactt ggcaaactta gttctggtga tttcttgatg 2281 gtttggaagt ctattgctgg gaagaaattc catcatacat attcatgctt ataataagct 2341 ggggattttt tgtttgtttt tgcaaatgct tgcccctact tttcaacaat tttctatgtt 2401 agttgtgaag aactaaggtg gggagcagta ctacaagttg agtaatggta tgagtatata 2461 ccagaattct gattggcagc aagtttatta atcagaataa cacttggtta tggaagtgac 2521 taatgctgaa aaaattgatt atttttatta gataatttct cacctataga cttaaactgt 2581 caatttgctc tagtgtctta ttagttaaac tttgtaaaat atatatatac ttgtttttcc 2641 attgtatgca aattgaaaga aaaagatgta ccatttctct gttgtatgtt ggattatgta 2701 ggaatgtttg tgtacaattc aaaaaaaaaa aagatgaaaa aagttcctgt ggatgttttg 2761 tgtagtatct tggcatttgt attgatagtt aaaattcact tccaaataaa taaaacaccc 2821 atgatgctag // LOCUS HSTYK2 4176 bp RNA PRI 12-SEP-1993 DEFINITION Human tyk2 mRNA for non-receptor protein tyrosine kinase. ACCESSION X54637 NID g37503 KEYWORDS protein tyrosine kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4176) AUTHORS Krolewski,J.J. TITLE Direct Submission JOURNAL Submitted (29-AUG-1990) Krolewski J.J., Columbia University, College of Physicians & Surgeons, Dept. of Pathology, 630 West 168th St., New York, NY 10032, USA REFERENCE 2 (bases 1 to 4176) AUTHORS Firmbach-Kraft,I., Byers,M., Shows,T., Dalla-Favera,R. and Krolewski,J.J. TITLE tyk2, prototype of a novel class of non-receptor tyrosine kinase genes JOURNAL Oncogene 5 (9), 1329-1336 (1990) MEDLINE 91016433 COMMENT Data kindly reviewed (14-DEC-1990) by Krolewski J. FEATURES Location/Qualifiers source 1..4176 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="T-cell lymphoma PEER" CDS 307..3870 /note="protein tyrosine kinase" /codon_start=1 /db_xref="PID:g37504" /db_xref="SWISS-PROT:P29597" /translation="MPLRHWGMARGSKPVGDGAQPMAAMGGLKVLLHWAGPGGGEPWV TFSESSLTAEEVCIHIAHKVGITPPCFNLFALFDAQAQVWLPPNHILEIPRDASLMLY FRIRFYFRNWHGMNPREPAVYRCGPPGTEASSDQTAQGMQLLDPASFEYLFEQGKHEF VNDVASLWELSTEEEIHHFKNESLGMAFLHLCHLALRHGIPLEEVAKKTSFKDCIPRS FRRHIRQHSALTRLRLRNVFRRFLRDFQPGRLSQQMVMVKYLATLERLAPRFGTERVP VCHLRLLAQAEGEPCYIRDSGVAPTDPGPESAAGPPTHEVLVTGTGGIQWWPVEEEVN KEEGSSGSSGRNPQASLFGKKAKAHKAFGQPADRPREPLWAYFCDFRDITHVVLKEHC VSIHRQDNKCLELSLPSRAAALSFVSLVDGYFRLTADSSHYLCHEVAPPRLVMSIRDG IHGPLLEPFVQAKLRPEDGLYLIHWSTSHPYRLILTVAQRSQAPDGMQSLRLRKFPIE QQDGAFVLEGWGRSFPSVRELGAALQGCLLRAGDDCFSLRRCCLPQPGETSNLIIMRG ARASPRTLNLSQLSFHRVDQKEITQLSHLGQGTRTNVYEGRLRVEGSGDPEEGKMDDE DPLVPGRDRGQELRVVLKVLDPSHHDIALAFYETASLMSQVSHTHLAFVHGVCVRGPE NSMVTEYVEHGPLDVWLRRERGHVPMAWKMVVAQQLASALSYLENKNLVHGNVCGRNI LLARLGLAEGTSPFIKLSDPGVGLGALSREERVERIPWLAPECLPGGANSLSTAMDKW GFGATLLEICFDGEAPLQSRSPSEKEHFYQRQHRLPEPSCPQLATLTSQCLTYEPTQR PSFRTILRDLTRVQPHNLADVLTVNRDSPAVGPTTFHKRYLKKIRDLGEGHFGKVSLY CYDPTNDGTGEMVAVKALKADCGPQHRSGWKQEIDILRTLYHEHIIKYKGCCEDQGEK SLQLVMEYVPLGSLRDYLPRHSIGLAQLLLFAQQICEGMAYLHAHDYIHRDLAARNVL LDNDRLVKIGDFGLAKAVPEGHEYYRVREDGDSPVFWYAPECLKEYKFYYASDVWSFG VTLYELLTHCDSSQSPPTKFLELIGIAQGQMTVLRLTELLERGERLPRPDKCPCEVYH LMKNCWETEASFRPTFENLIPILKTVHEKYQGQAPSVFSVC" BASE COUNT 831 a 1291 c 1269 g 785 t ORIGIN 1 gacgcgggcg cggaaggagc gcggccggag gtcctcagga agaagccgcg gggactggct 61 gcgcttgaca ggctgcactt ggatgggagc acctggtgcc tcgggactgc tccgatgccc 121 gggtctgtgc tgaatgtgta atatgcggaa ctatattgaa acattacaac catcttttga 181 tggcaacacc ctgaggacct cccttttcca gatggggaaa ctgaggccca gaattgctaa 241 gtggcttgct tgagttgaca cagggagctc caggactcac cctcagctga gccacctgcc 301 gggagcatgc ctctgcgcca ctgggggatg gccaggggca gtaagcccgt tggggatgga 361 gcccagccca tggctgccat gggaggcctg aaggtgcttc tgcactgggc tggtccaggc 421 ggcggggagc cctgggtcac tttcagtgag tcatcgctga cagctgagga agtctgcatc 481 cacattgcac ataaagttgg tatcactcct ccttgcttca atctctttgc cctcttcgat 541 gctcaggccc aagtctggtt gcccccaaac cacatcctag agatccccag agatgcaagc 601 ctgatgctat atttccgcat aaggttttat ttccggaact ggcatggcat gaatcctcgg 661 gaaccggctg tgtaccgttg tgggccccca ggaaccgagg catcctcaga tcagacagca 721 caggggatgc aactcctgga cccagcctca tttgagtacc tctttgagca gggcaagcat 781 gagtttgtga atgacgtggc atcactgtgg gagctgtcga ccgaggagga gatccaccac 841 tttaagaatg agagcctggg catggccttt ctgcacctct gtcacctcgc tctccgccat 901 ggcatccccc tggaggaggt ggccaagaag accagcttca aggactgcat cccgcgctcc 961 ttccgccggc atatccggca gcacagcgcc ctgacccggc tgcgccttcg gaacgtcttc 1021 cgcaggttcc tgcgggactt ccagccgggc cgactctccc agcagatggt catggtcaaa 1081 tacctagcca cactcgagcg gctggcaccc cgcttcggca cagagcgtgt gcccgtgtgc 1141 cacctgaggc tgctggccca ggccgagggg gagccctgct acatccggga cagtggggtg 1201 gcccctacag accctggccc tgagtctgct gctgggcccc caacccacga ggtgctggtg 1261 acaggcactg gtggcatcca gtggtggcca gtagaggagg aggtgaacaa ggaggagggt 1321 tctagtggca gcagtggcag gaacccccaa gccagcctgt ttgggaagaa ggccaaggct 1381 cacaaggcat tcggccagcc ggcagacagg ccgcgggagc cactgtgggc ctacttctgt 1441 gacttccggg acatcaccca cgtggtgctg aaagagcact gtgtcagcat ccaccggcag 1501 gacaacaagt gcctggagct gagcttgcct tcccgggctg cggcgctgtc cttcgtgtcg 1561 ctggtggacg gctatttccg cctgacggcc gactccagcc actacctgtg ccacgaggtg 1621 gctcccccac ggctggtgat gagcatccgg gatgggatcc acggacccct gctggagcca 1681 tttgtgcagg ccaagctgcg gcccgaggac ggcctgtacc tcattcactg gagcaccagc 1741 cacccctacc gcctgatcct cacagtggcc cagcgtagcc aggcaccaga cggcatgcag 1801 agcttgcggc tccgaaagtt ccccattgag cagcaggacg gggccttcgt gctggagggc 1861 tggggccggt ccttccccag cgttcgggaa cttggggctg ccttgcaggg ctgcttgctg 1921 agggccgggg atgactgctt ctctctgcgt cgctgttgcc tgccccaacc aggagaaacc 1981 tccaatctca tcatcatgcg gggggctcgg gccagcccca ggacactcaa cctcagccag 2041 ctcagcttcc accgggttga ccagaaggag atcacccagc tgtcccactt gggccagggc 2101 acaaggacca acgtgtatga gggccgcctg cgagtggagg gcagcgggga ccctgaggag 2161 ggcaagatgg atgacgagga ccccctcgtg cctggcaggg accgtgggca ggagctacga 2221 gtggtgctca aagtgctgga ccctagtcac catgacatcg ccctggcctt ctacgagaca 2281 gccagcctca tgagccaggt ctcccacacg cacctggcct tcgtgcatgg cgtctgtgtg 2341 cgcggccctg aaaatagcat ggtgacagag tacgtggagc acggacccct ggatgtgtgg 2401 ctgcggaggg agcggggcca tgtgcccatg gcttggaaga tggtggtggc ccagcagctg 2461 gccagcgccc tcagctacct ggagaacaag aacctggttc atggtaatgt gtgtggccgg 2521 aacatcctgc tggcccggct ggggttggca gagggcacca gccccttcat caagctgagt 2581 gatcctggcg tgggcctggg cgccctctcc agggaggagc gggtggagag gatcccctgg 2641 ctggcccccg aatgcctacc aggtggggcc aacagcctaa gcaccgccat ggacaagtgg 2701 gggtttggcg ccaccctcct ggagatctgc tttgacggag aggcccctct gcagagccgc 2761 agtccctccg agaaggagca tttctaccag aggcagcacc ggctgcccga gccctcctgc 2821 ccacagctgg ccacactcac cagccagtgt ctgacctatg agccaaccca gaggccatca 2881 ttccgcacca tcctgcgtga cctcacccgc gtgcagcccc acaatcttgc tgacgtcttg 2941 actgtgaacc gggactcacc ggccgtcgga cctactactt tccacaagcg ctatttgaaa 3001 aagatccgag atctgggcga gggtcacttc ggcaaggtca gcttgtactg ctacgatccg 3061 accaacgacg gcactggcga gatggtggcg gtgaaagccc tcaaggcaga ctgcggcccc 3121 cagcaccgct cgggctggaa gcaggagatt gacattctgc gcacgctcta ccacgagcac 3181 atcatcaagt acaagggctg ctgcgaggac caaggcgaga agtcgctgca gctggtcatg 3241 gagtacgtgc ccctgggcag cctccgagac tacctgcccc ggcacagcat cgggctggcc 3301 cagctgctgc tcttcgccca gcagatctgc gagggcatgg cctatctgca cgcgcacgac 3361 tacatccacc gagacctagc cgcgcgcaac gtgctgctgg acaacgacag gctggtcaag 3421 atcggggact ttggcctagc caaggccgtg cccgaaggcc acgagtacta ccgcgtgcgc 3481 gaggatgggg acagccccgt gttctggtat gccccagagt gcctgaagga gtataagttc 3541 tactatgcgt cagatgtctg gtccttcggg gtgaccctgt atgagctgct gacgcactgt 3601 gactccagcc agagcccccc cacgaaattc cttgagctca taggcattgc tcagggtcag 3661 atgacagttc tgagactcac tgagttgctg gaacgagggg agaggctgcc acggcccgac 3721 aaatgtccct gtgaggtcta tcatctcatg aagaactgct gggagacaga ggcgtccttt 3781 cgcccaacct tcgagaacct catacccatt ctgaagacag tccatgagaa gtaccaaggc 3841 caggcccctt cagtgttcag cgtgtgctga ggcacaatgg cagccctgcc tgggaggact 3901 ggaccaggca gtggctgcag agggagcctc ctgctccctg ctccaggatg aaaccaagag 3961 ggggatgtca gcctcaccca caccgtgtgc cttactcctg tctagagacc ccacctctgt 4021 gaacttattt ttctttcttg gccgtgagcc taaccatgat cttgagggac ccaacatttg 4081 taggggcact aatccagccc ttaaatcccc cagcttccaa acttgaggcc caccatctcc 4141 accatctggt aataaactca tgttttctct gctggg // LOCUS HSTYL 4307 bp RNA PRI 02-AUG-1996 DEFINITION H.sapiens mRNA from TYL gene. ACCESSION X99688 NID g1480102 KEYWORDS TYL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4307) AUTHORS Perletti,L., Ronchetti,D., Maiolo,A.T. and Neri,A. JOURNAL Unpublished REFERENCE 2 (bases 1 to 4307) AUTHORS Perletti,L. TITLE Direct Submission JOURNAL Submitted (31-JUL-1996) L. Perletti, ATTN: A. Neri, Ospedale Maggiore IRCCS, Servizio Di Ematologia, Via Francesco Sforza 35, 20122 Milano, ITALY FEATURES Location/Qualifiers source 1..4307 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q24" /dev_stage="adult" /tissue_type="brain" gene 1807..3744 /gene="TYL" CDS 1807..3744 /gene="TYL" /codon_start=1 /db_xref="PID:e256950" /db_xref="PID:g1480103" /translation="MPLKSPVPFLPGTSPSADGPDSFSCVFEAILESHRAKGTSYTSL ASLEALASPGPTQSPFFTFELPPQPPAPRPDPPAPAPLTPLEPDSGTSSAADGPWTQR GEEEEAEARAKLAPGREPPSPCHSEDSLGLGAAPLGSEPPLSQLVSDSDSELDSTDRL ALGSTDTLSNGQKADLEAAQRLAKRLYRLDGFRKADVARHLGKNNDFSKLVAGEYLKF FVFTGMTLDQALRVFLKELALMGETQERERVLAHFSQRYFQCNPEALSSEDGAHTLTC ALMLLNTDLHGHNIGKRMTCGDFIGNLEGLNDGGDFPRELLKALYSSIKNEKLQWAID EEELRRSLSELADPNPKVIKRISGGSGSGSSPFLDLTPEPGAAVYKHGALVRKVHADP DCRKTPRGKRGWKSFHGILKGMILYLQKEEYKPGKALSETELKNAISIHHALATRASD YSKRPHVFYLRTADWRVFLFQAPSLEQMQSWITRINVVAAMFSAPPFPAAVSSQKKFS RPLLPSAATRLSQEEQVRTHEAKLKAMASELREHRAAQLGKKGRGKEAEEQRQKEAYL EFEKSRYSTYAALLRVKLKAGSEELDAVEAALAQAGSTEDGLPPSHSSPSLQPKPSSQ PRAQRHSSEPRPGAGSGRRKP" BASE COUNT 887 a 1334 c 1217 g 869 t ORIGIN 1 ggcgtagggc ttctctagca accctctttc ttcagatttt gcatgtctcc aagaaaaaac 61 aacttgaaat ttcttgaaca cacagtttac ttgttcatat ctttaactat gctgttctct 121 tttatctaca atgccatcta tctcttccac tgaaaacttc catttattcc tcaaaatcct 181 gctcaggtgc tagcatatct gtgaagcgtt ctctgacctg ttcctgaagt caaacattcc 241 ctcctcggta atatttccat actttaaact tactattatt gatggactcc aaaaaaattc 301 tctttattgt tgtgtctagc cctatgatcc tgatttcaag atcctgaaag acaagtttct 361 atcccttaca aataccacag gacctggcac atagaagaca catttattga ctcagtgttt 421 tcttcatgtt tacattcttc aagtacacag ctgtctatgc agtgaaaaat gtgctctaca 481 agtagaaaaa gaaatatgtg aagtgcacaa tagacaatcc tcaattccca taatgcagat 541 gtgcatgagc aattattagc agttcaaagt tgtttcatta cggtaagtac tattctaaca 601 aactaactgc cccaaccaat tttgtctagg aagcatatat ggttttaaag tgttgattta 661 cgccagctgg gcatggtgct tacgcctgca atcccagcac tttgggaggc caggcgggcg 721 gatcacaagg tcaagagatc gagaccatcc tggccaacgt agtgaaaccc catctctatt 781 aaaaatacaa aaactagctg cgagtggtgg cgcacgcctg tagtcccagc tactcgaggc 841 cgggaactgg gacgtgtgac agcaccctgt acacctctgc gtgggccccc ctcaccccgt 901 gttgctccct caccctgggc accctcttca cccactgggc agcccccacc cggggcccag 961 agctctgtgg tcatcttccg ctttgtggag aaggccagtg tgaggccact gaatgggcta 1021 cctgctccag ggggcttgag tcggagctgg gatctgggtg gggtttctcc tcccaggccc 1081 accccagccc ttgggcctgg ctccaaccgg aagttacggc tggaagcatc cacatcagac 1141 ccactccccg ccagaggagg ctcggcccta cctggcagcc ggaaccttgt acatgggccg 1201 ccagccccac cccaggttgg agcagatggc ctttactcct ctctccccaa tgggctgggg 1261 gacccccctg agcgcctggc cacactcttc ggaggacctg ctgacactgg attcctgaac 1321 cagggggata cctggtcctc cccccgggaa gtctcctctc atgcccagag aatcgctaga 1381 gccaaatggg aattcttcta tggctccttg gaccccccca gctcaggtgc taagccccca 1441 gagcaggccc ccccatctcc acctggggtg ggctcaaggc agggctctgg ggtggctgtg 1501 gggcgagcag ccaagtactc cgagacagac ctggacacgg tgcccctgag gtgctaccgc 1561 gagactgaca tcgatgaggt gctggctgag cgggaggagg ccgactcggc catcgaaagt 1621 cagcccagct ctgagggccc accaggcact gcctacccac ctgccccacg gcccggccca 1681 ctccctggcc ctcatcccag cctcggcagt ggcaatgagg atgaggacga cgatgaggca 1741 ggtggggaag aagatgtgga cgacgaggtg tttgaggcct ctgaaggggc ccggccaggg 1801 agccggatgc ctctcaagtc acctgtgccc tttctacctg ggacgagccc ctcggctgat 1861 gggcctgact ctttcagttg tgtgttcgaa gccatcctgg agtcacaccg ggccaaaggc 1921 acctcctata ccagcctcgc ctcgctggag gccttggcct cacctggccc aacccagagc 1981 cccttcttca cctttgagct gcctccccaa ccccctgcac cccggcccga cccaccagct 2041 cccgccccac ttacccctct tgaaccggat tctggtacca gctctgctgc tgatggtcct 2101 tggacacaga gaggggagga ggaggaggca gaggccagag ccaagctggc cccagggagg 2161 gagcccccta gtccctgcca ctcagaggac agccttgggc tgggggcagc accccttggc 2221 agcgaaccac ccctgagcca gctggtgtcc gactcagact cagagctgga cagcacagac 2281 cggctggccc tgggaagcac agacaccttg tccaatgggc agaaagcgga cctggaggct 2341 gcgcagcgcc tggccaagag gctgtaccga ctagatggct tcaggaaggc cgatgtggcc 2401 cggcacctgg gcaagaacaa tgacttcagc aaactggtgg ctggggagta cctcaagttc 2461 tttgtcttca cgggcatgac tctggaccaa gctctcaggg tgtttctgaa ggagctggcc 2521 ttaatgggtg agacccagga acgagagcgc gtgctggccc acttctccca gcgatacttc 2581 cagtgcaatc ctgaagccct gtcctcagag gacggcgccc acacgctgac ctgtgcgctc 2641 atgctgctca acacggatct ccacggccat aacatcggga agcgcatgac ctgcggggac 2701 ttcatcggga acctggaggg cctcaatgat ggcggcgact tccctaggga gctgctcaag 2761 gccttgtaca gctccatcaa gaatgagaag ctgcagtggg ccatagacga ggaggagctg 2821 agacgctctc tgtctgagtt ggccgacccc aaccccaagg tcatcaagcg gatcagcggg 2881 ggcagtggca gtggctccag ccctttcctg gacctgactc ccgagcctgg ggctgccgtc 2941 tacaagcacg gggccctggt gcgaaaggtg cacgcagacc ctgactgcag gaagacacct 3001 cggggcaagc ggggctggaa gagcttccac gggatcctca agggcatgat cctctacctg 3061 cagaaggagg agtacaagcc tgggaaggcc ctttcagaga cggagctcaa gaatgccatc 3121 agcatccacc atgccctggc cactcgtgcc agtgactaca gcaagaggcc ccacgtcttc 3181 tacctgcgca cagctgactg gcgggtcttc ctcttccagg ccccgagcct ggagcagatg 3241 cagtcctgga tcactcgcat caatgtagta gccgctatgt tctctgcgcc ccccttccca 3301 gctgctgtta gctcccaaaa gaagttcagc cgccctctcc tgcccagcgc tgccacccgc 3361 ctctcccagg aggagcaggt gcggacccac gaggccaagc tgaaggccat ggcaagtgag 3421 ctgcgggagc accgggccgc ccagctgggc aagaagggcc ggggcaagga ggctgaagag 3481 cagcggcaga aggaggccta cctggagttt gagaaatccc gctacagcac ctatgcagcg 3541 ctgcttcggg tcaagctgaa ggcaggcagt gaggagctgg atgcagtgga ggcagcactg 3601 gcccaggccg ggagcacaga ggatggactc cctccttctc actccagtcc ctccctgcag 3661 cccaaaccct ccagccagcc ccgggctcag cgtcacagct cagagcctcg gccaggggca 3721 ggcagtgggc ggcggaagcc ctgagatgag gtttagggtg gggagtgcct gctgggcacc 3781 tgaaggatga catggccctg cctgagcccg gggccggcct cgggccaccc acgagggcct 3841 cccggcctag ggcccgaccg cgccggacgc ggtgtccggg gcagggcagg gctggggcct 3901 gggcctcagg agcctaggct aggggcacct ctgagagccc ctttttgtga taatgttttg 3961 cactttttcg tacagggtgg gcgggggcgg gaggggctag cgcccctgaa cttttgatgc 4021 tttttttttg tgtgggagag gcctgatgcc gtttctcgtt tctgctgaga ccacccccac 4081 cctgacacca tcagtccccc tctgtcctgg ggcctggctt ggacagagac caggatttgg 4141 ggctgagctg gtttcccctc cttcctcccc aagggcttac tctttctctt gctggaggtg 4201 gaggaagggc gtccatgcca aggccccacg gcctggcttt cccccgcttc agtgggcctt 4261 gccctttgta cagcctccag ggggattaaa actgctctgg acttgct // LOCUS HSTYRRP 2837 bp RNA PRI 23-MAR-1995 DEFINITION Human mRNA for tyrosinase-related protein. ACCESSION X51420 NID g37512 KEYWORDS melanosomal protein; tyrosinase-related protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2837) AUTHORS Shibahara,S. TITLE Direct Submission JOURNAL Submitted (17-JAN-1990) Shibahara S., Tohoku University School of Medicine, Dept. of Applied Physiology, Sendai, Miyagi 980, Japan REFERENCE 2 (bases 1 to 2837) AUTHORS Cohen,T., Muller,R.M., Tomita,Y. and Shibahara,S. TITLE Nucleotide sequence of the cDNA encoding human tyrosinase-related protein JOURNAL Nucleic Acids Res. 18 (9), 2807-2808 (1990) MEDLINE 90251459 FEATURES Location/Qualifiers source 1..2837 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="S7 human melanoma cells" precursor_RNA 1..2512 /note="primary transcript" sig_peptide 130..201 /note="signal peptide (AA -24 to -1)" CDS 130..1713 /note="pre propeptide (AA -24 to 503)" /codon_start=1 /db_xref="PID:g37513" /db_xref="SWISS-PROT:P17643" /translation="MSAPKLLSLGCIFFPLLLFQQARAQFPRQCATVEALRSGMCCPD LSPVSGPGTDRCGSSSGRGRCEAVTADSRPHSPQYPHDGRDDREVWPLRFFNRTCHCN GNFSGHNCGTCRPGWRGAACDQRVLIVRRNLLDLSKEEKNHFVRALDMAKRTTHPLFV IATRRSEEILGPDGNTPQFENISIYNYFVWTHYYSVKKTFLGVGQESFGEVDFSHEGP AFLTWHRYHLLRLEKDMQEMLQEPSFSLPYWNFATGKNVCDICTDDLMGSRSNFDSTL ISPNSVFSQWRVVCDSLEDYDTLGTLCNSTEDGPIRRNPAGNVARPMVQRLPEPQDVA QCLEVGLFDTPPFYSNSTNSFRNTVEGYSDPTGKYDPAVRSLHNLAHLFLNGTGGQTH LSPNDPIFVLLHTFTDAVFDEWLRRYNADISTFPLENAPIGHNRQYNMVPFWPPVTNT EMFVTAPDNLGYTYEIQWPSREFSVPEIIAIAVVGALLLVALIFGTASYLIRARRSMD EANQPLLTDQYQCYAEERI" mat_peptide 202..1710 /note="mature tyrosinase-related protein (AA 1-503)" BASE COUNT 823 a 591 c 552 g 871 t ORIGIN 1 aattctaaga gaagttcatc agagacatcc ttcaggattg tgagctggat tttcctctac 61 gtgcttcagt cttctctaca caaagagctg caaaccaggt ctttgttttg cactcttatt 121 tcaagcagaa tgagtgctcc taaactcctc tctctgggct gtatcttctt ccccttgcta 181 ctttttcagc aggcccgggc tcaattccca agacagtgtg ccactgttga ggctttgaga 241 agtggtatgt gttgcccaga cctgtcccct gtgtctgggc ctgggacaga ccgctgtggc 301 tcatcatcag ggaggggcag atgtgaggca gtgactgcag actcccggcc ccacagccct 361 cagtatcccc atgatggcag agatgatcgg gaggtctggc ccttgcgctt cttcaatagg 421 acatgtcact gcaacggcaa tttctcagga cacaactgtg ggacgtgccg tcctggctgg 481 agaggagctg cctgtgacca gagggttctc atagtcagga gaaatcttct ggacttaagt 541 aaagaagaaa agaaccactt tgtccgggcc ctggatatgg caaagcgcac aactcaccct 601 ttatttgtca ttgccaccag gagatcagaa gaaatactgg ggccagatgg caacacgcca 661 caatttgaga acatttccat ttataactac tttgtttgga cacactatta ctcagtcaaa 721 aagactttcc ttggggtagg acaggaaagc tttggtgaag tggatttctc tcatgaggga 781 ccagcttttc tcacatggca caggtaccac ctcctgcgtc tggagaaaga catgcaggaa 841 atgttgcaag agccttcttt ctcccttcct tactggaatt ttgcaacggg gaaaaatgtc 901 tgtgatatct gcacggatga cttgatggga tccagaagca actttgattc cactctaata 961 agcccaaact ctgtcttttc tcaatggcga gtggtctgtg actccttgga agattatgat 1021 accctgggaa cactttgtaa cagcaccgag gatgggccaa ttaggagaaa tccagctgga 1081 aatgtggcca gaccaatggt gcaacgtctt cctgaaccac aggatgtcgc tcagtgcttg 1141 gaagttggtt tatttgacac gcctcctttt tattccaact ctacaaacag tttccgaaac 1201 acagtggaag gttacagtga ccccacggga aagtatgacc ctgctgttcg aagtcttcac 1261 aatttggctc atctattcct gaatggaaca gggggacaaa cccatttgtc tccaaatgat 1321 cctatttttg tcctcctgca caccttcaca gatgcagtct ttgatgaatg gctgaggaga 1381 tacaatgctg atatatccac atttccattg gaaaatgccc ctattggaca taatagacaa 1441 tacaacatgg tgccattctg gcccccagtc accaacacag aaatgtttgt tactgctcca 1501 gacaacctgg gatacactta tgaaattcaa tggccaagtc gggagtttag tgtacctgag 1561 ataattgcca tagcagtagt tggcgctttg ttactggttg cactcatttt tgggactgct 1621 tcttatctga ttcgtgccag acgcagtatg gatgaagcta accagcctct cctcactgat 1681 cagtatcaat gctatgctga agaaagaata tgaaaaactc cagaatccta atcagtctgt 1741 ggtctaacaa atgccctact ctcttatgca ttagtatcac aaaaccacct ggttgaatat 1801 aatagattga gttattaact gtattttctt tcactttatt accttctttc taatacaagc 1861 atatgttagc attaaagttc taggcatact tttcaaagct gggaagaccc tttcagaatc 1921 ttttcaatgg gttttaattt tcagttctat ttaaaatggt gaatgacact aaactccatg 1981 atatttaagg atagtgtgaa gatctttggc atgatttaaa ggttgagtat gtgaagatat 2041 aagtgaacta ccatgctttg tttacgtgta aaggaaaata atgtttgata gtaaatgtcc 2101 acttaaaata catgaatggg catttctaaa atgttaaaac ataaacacat ttccattcat 2161 ggatatttgt caacagattt aaagaaaacc acagttatta attaaagaaa attaattatg 2221 tgtagttata aaccaatgaa attttgatta accttttcaa attaatgttc cagtttgaag 2281 accaatcaaa tatattattt agtcaacata tactatttag tctcaggttc aaggctacaa 2341 caaaaatcac catctttgtc aaactttgga gagggaaaat cttcactttc ttaagcaaca 2401 atggatattg cctgtgtttg ccactgtgtt tccctgcctc tcaattcgct gaaaaaggaa 2461 ctacctatcc ttacatttca cctactaatg tctcttctaa catcttagag gtccatggag 2521 aaggcatatg gagaacatgt tttatactgc tctataaata gtattccaat cactgtgctt 2581 aatttaaata gcattatctt atcatttatc agccttttat gtattttcca agtaaaatat 2641 taacatatta tttcattggt cttctttttt atctggttct atatgaatgc tattttttcc 2701 cttctcttct aacatgaaat atattttctc tttttgatct tgtgctatga aacaatctta 2761 ccaaagaact gtataaggtg gtcataagtg aatattttaa ttaaaattgg taaaaataaa 2821 aaaaaaaaaa taaaaat // LOCUS HSU00803 2863 bp mRNA PRI 25-MAY-1994 DEFINITION Human SRC-like tyrosine kinase (FRK) mRNA, complete cds. ACCESSION U00803 NID g392887 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2863) AUTHORS Lee,J., Wang,Z., Luoh,S.M., Wood,W.I. and Scadden,D.T. TITLE Cloning of FRK, a novel intracellular SRC-like tyrosine kinase-encoding gene JOURNAL Gene 138, 247-251 (1994) MEDLINE 94171047 REFERENCE 2 (bases 1 to 2863) AUTHORS Scadden,D.T. TITLE Direct Submission JOURNAL Submitted (17-AUG-1993) D.T. Scadden, New England Deaconess Hospital, Hematology/Oncology, 185 Pilgrim Road, Boston, MA, USA, 02215 FEATURES Location/Qualifiers source 1..2863 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BL979" /cell_type="lymphocytehepatoma" /tissue_type="lymphoid" gene 448..1965 /gene="FRK" CDS 448..1965 /gene="FRK" /codon_start=1 /product="SRC-like tyrosine kinase" /db_xref="PID:g392888" /translation="MSNICQRLWEYLEPYLPCLSTEADKSTVIENPGALCSPQSQRHG HYFVALFDYQARTAEDLSFRAGDKLQVLDTLHEGWWFARHLEKRRDGSSQQLQGYIPS NYVAEDRSLQAEPWFFGAIGRSDAEKQLLYSENKTGSFLIRESESQKGEFSLSVLDGA VVKHYRIKRLDEGGFFLTRRRIFSTLNEFVSHYTKTSDGLCVKLGKPCLKIQVPAPFD LSYKTVDQWEIDRNSIQLLKRLGSGQFGEVWEGLWNNTTPVAVKTLKPGSMDPNDFLR EAQIMKNLRHPKLIQLYAVCTLEDPIYIITELMRHGSLQEYLQNDTGSKIHLTQQVDM AAQVASGMAYLESRNYIHRDLAARNVLVGEHNIYKVADFGLARVFKVDNEDIYESRHE IKLPVKWTAPEAIRSNKFSIKSDVWSFGILLYEIITYGKMPYSGMTGAQVIQMLAQNY RLPQPSNCPQQFYNIMLECWNAEPKERPTFETLRWKLEDYFETDSSYSDANNFIR" BASE COUNT 895 a 550 c 624 g 794 t ORIGIN 1 cctggcgaaa gcaagacgtg gaggttttac caggggttag tagcttcctc ttgctaactt 61 tttattggga caaaaggcaa gatggcacca ttctgttctc agatatttgt ctaaataaag 121 cccttttaat tttattttat ttttgttgtg ggattcttaa gcagataaga agaaaagaca 181 ccttcctagt gagcagctgc ccagctcctg ctcagttttg cctcggggta gcacctccag 241 ccacagaaag caagccggta agtctctcca ggtaggactt gctgcaaccc agctgctgga 301 ctgatctgaa acgggacttt gcatactctc cgaagtatgg tgagttggtg ctgacttcaa 361 agttgcctgg tgaaggaaga taaggtggat cgcagagact aaggggagag ggagaagccc 421 tgctcctctt ctccccacca aggcacaatg agcaacatct gtcagaggct ctgggagtac 481 ctagaaccct atctcccctg tttgtccacg gaggcagaca agtcaaccgt gattgaaaat 541 ccaggggccc tttgctctcc ccagtcacag aggcatggcc actactttgt ggctttgttt 601 gattaccagg ctcggactgc tgaggacttg agcttccgag caggtgacaa acttcaagtt 661 ctggacactt tgcatgaggg ctggtggttt gccagacact tggagaaaag acgagatggc 721 tccagtcagc aactacaagg ctatattcct tctaactacg tggctgagga cagaagccta 781 caggcagagc cgtggttctt tggagcaatc ggaagatcag atgcagagaa acaactatta 841 tattcagaaa acaagaccgg ttcctttcta atcagagaaa gtgaaagcca aaaaggagaa 901 ttctctcttt cagttttaga tggagcagtt gtaaaacact acagaattaa aagactggat 961 gaagggggat tttttctcac gcgaagaaga atcttttcaa cactgaacga atttgtgagc 1021 cactacacca agacaagtga cggcctgtgt gtcaagctgg ggaaaccatg cttaaagatc 1081 caggtcccag ctccatttga tttgtcgtat aaaaccgtgg accaatggga gatagaccgc 1141 aactccatac agcttctgaa gcgattggga tctggtcagt ttggcgaagt atgggaaggt 1201 ctgtggaaca ataccactcc agtagcagtg aaaacattaa aaccaggttc aatggatcca 1261 aatgacttcc tgagggaggc acagataatg aagaacctaa gacatccaaa gcttatccag 1321 ctttatgctg tttgcacttt agaagatcca atttatatta ttacagagtt gatgagacat 1381 ggaagtctgc aagaatatct ccaaaatgac actggatcaa aaatccatct gactcaacag 1441 gtagacatgg cggcacaggt tgcctctgga atggcctatc tggagtctcg gaactacatt 1501 cacagagatc tggctgccag aaatgtcctc gttggtgaac ataatatcta caaagtagca 1561 gattttggac ttgccagagt ttttaaggta gataatgaag acatctatga atctagacac 1621 gaaataaagc tgccggtgaa gtggactgcg cccgaagcca ttcgtagtaa taaattcagc 1681 attaagtccg atgtatggtc atttggaatc cttctttatg aaatcattac ttatggcaaa 1741 atgccttaca gtggtatgac aggtgcccag gtaatccaga tgttggctca aaactataga 1801 cttccgcaac catccaactg tccacagcaa ttttacaaca tcatgttgga gtgctggaat 1861 gcagagccta aggaacgacc tacatttgag acactgcgtt ggaaacttga agactatttt 1921 gaaacagact cttcatattc agatgcaaat aacttcataa gatgaacact ggagaagaat 1981 atcaaataat aaagtagcaa aacaaattca aataatccat tccaaaatac aatgttatca 2041 accaactgca caatcagttt atcctgacat attcaagtga taggataaag ttggccatgt 2101 attatgaaaa agattatttg tgcattttat tgactgggca acactgcagg acagtcaagg 2161 tgatatataa tttcctcact gcctggtaaa attaagcaca ctaaaccaag ttatttttct 2221 ttttaagaga tacttacatt tccatttatt gtttgaaatg tcgatcaaga gaatcaacag 2281 atgatagtcc aatttttact cagtgactgt tgtagcattt tcctgtttac tgattagagt 2341 ggttattcat tattcctcag attgctgaat cccatcaggc tgttattatg aaggaatttg 2401 attgctttgc tgcacagcag gacctgtgct ttgagatttt tttttctctt ttaaaatatc 2461 ctgtaactac aatgatggta aagccatgtt aaatgacttg attgtacttg gagtaattgc 2521 acattttttt ctatgcataa aaaaatgatg cagctgttga gaaaacgaag tctttttcat 2581 tttgcagaag gaaatgatgg aatttttctg tacttcagta tgtgtcaact gagagtcata 2641 tacattagtt ttaatctctt aatattgaga atcaggttgc aaaacggatg agttattatc 2701 tatggaaatg tgagaaatgt ctaatagccc ataaagtctg agaaataggt atcaaaatag 2761 tttaggaaaa tgagaggaga acagtagatt gctgtggcct agacttctga gtaattaata 2821 aagaaaaaga agtacccttt ggcctacaaa aaaaaaaaaa aaa // LOCUS HSU01157 3071 bp mRNA PRI 28-FEB-1995 DEFINITION Human glucagon-like peptide-1 receptor mRNA with CA dinucleotide repeat, complete cds. ACCESSION U01157 NID g684918 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3071) AUTHORS Dillon,J.S., Tanizawa,Y., Wheeler,M.B., Leng,X.H., Ligon,B.B., Rabin,D.U., Yoo-Warren,H., Permutt,M.A. and Boyd,A.E. III. TITLE Cloning and functional expression of the human glucagon-like peptide-1 (GLP-1) receptor JOURNAL Endocrinology 133 (4), 1907-1910 (1993) MEDLINE 94008746 REFERENCE 2 (bases 1 to 3071) AUTHORS Permutt,M. TITLE Direct Submission JOURNAL Submitted (31-AUG-1993) M. Alan Permutt, Internal Medicine, Washington University School of Medicine, 660 S. Euclid, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..3071 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="islet beta" /tissue_type="pancreas" CDS 1..1392 /note="GLP-1 receptor" /codon_start=1 /product="glucagon-like peptide-1 receptor" /db_xref="PID:g393108" /translation="MAGAPGPLRLALLLLGMVGRAGPRPQGATVSLWETVQKWREYRR QCQRSLTEDPPPATDLFCNRTFDEYACWPDGEPGSFVNVSCPWYLPWASSVPQGHVYR FCTAEGLWLQKDNSSLPWRDLSECEESKRGERSSPEEQLLFLYIIYTVGYALSFSALV IASAILLGFRHLHCTRNYIHLNLFASFILRALSVFIKDAALKWMYSTAAQQHQWDGLL SYQDSLSCRLVFLLMQYCVAANYYWLLVEGVYLYTLLAFSVFSEQWIFRLYVSIGWGV PLLFVVPWGIVKYLYEDEGCWTRNSNMNYWLIIRLPILFGIGVNFLIFVRVICIVVSK LKANLMCKTDIKCRLAKSTLTLIPLLGTHEVIFAFVMDEHARGTLRFIKLFTELSFTS FQGLMVAILYCFVNNEVQLEFRKSWERWRLEHLHIQRDSSMKPLKCPTSSLSSGATAG SSMYTATCQASCS" repeat_region 1486 /note="CA dinucleotide repeat" BASE COUNT 677 a 874 c 813 g 707 t ORIGIN 1 atggccggcg cccccggccc gctgcgcctt gcgctgctgc tgctcgggat ggtgggcagg 61 gccggccccc gcccccaggg tgccactgtg tccctctggg agacggtgca gaaatggcga 121 gaataccgac gccagtgcca gcgctccctg actgaggatc cacctcctgc cacagacttg 181 ttctgcaacc ggaccttcga tgaatacgcc tgctggccag atggggagcc aggctcgttc 241 gtgaatgtca gctgcccctg gtacctgccc tgggccagca gtgtgccgca gggccacgtg 301 taccggttct gcacagctga aggcctctgg ctgcagaagg acaactccag cctgccctgg 361 agggacttgt cggagtgcga ggagtccaag cgaggggaga gaagctcccc ggaggagcag 421 ctcctgttcc tctacatcat ctacacggtg ggctacgcac tctccttctc tgctctggtt 481 atcgcctctg cgatcctcct cggcttcaga cacctgcact gcaccaggaa ctacatccac 541 ctgaacctgt ttgcatcctt catcctgcga gcattgtccg tcttcatcaa ggacgcagcc 601 ctgaagtgga tgtatagcac agccgcccag cagcaccagt gggatgggct cctctcctac 661 caggactctc tgagctgccg cctggtgttt ctgctcatgc agtactgtgt ggcggccaat 721 tactactggc tcttggtgga gggcgtgtac ctgtacacac tgctggcctt ctcggtcttc 781 tctgagcaat ggatcttcag gctctacgtg agcataggct ggggtgttcc cctgctgttt 841 gttgtcccct ggggcattgt caagtacctc tatgaggacg agggctgctg gaccaggaac 901 tccaacatga actactggct cattatccgg ctgcccattc tctttggcat tggggtgaac 961 ttcctcatct ttgttcgggt catctgcatc gtggtatcca aactgaaggc caatctcatg 1021 tgcaagacag acatcaaatg cagacttgcc aagtccacgc tgacactcat ccccctgctg 1081 gggactcatg aggtcatctt tgcctttgtg atggacgagc acgcccgggg gaccctgcgc 1141 ttcatcaagc tgtttacaga gctctccttc acctccttcc aggggctgat ggtggccatc 1201 ttatactgct ttgtcaacaa tgaggtccag ctggaatttc ggaagagctg ggagcgctgg 1261 cggcttgagc acttgcacat ccagagggac agcagcatga agcccctcaa gtgtcccacc 1321 agcagcctga gcagtggagc cacggcgggc agcagcatgt acacagccac ttgccaggcc 1381 tcctgcagct gagactccag cgcctgccct ccctggggtc cttgctgcag gccgggtggc 1441 caatccaggt gggagagaca ctcccaggga caagggaagg aagggacaca cacacacaca 1501 cacacacaca cacacacaca tacatcctgc tttccctccc caaacccatc agacaggtaa 1561 atgggcagtg cctcctggga ccatggacac attttctcct aggagaagca gcctcctaat 1621 ttgatcacag tggcgagagg agaggaaaaa cgatcgctgt gaaaatgagg aggattgctt 1681 cttgtgaaac cacaggccct tggggttccc ccagacagag ccgcaaatca accccagact 1741 caaactcaag gtcaacggct tattagtgaa actggggctt gcaagaggag gtggttctga 1801 aagtggctct tctaacctca gccaaacaca gagcgggagt gacgggagcc tcctctgctt 1861 gcatcacttg gggtcaccac cctcccctgt cttctctcaa agggaagctg tttgtgtgtc 1921 tgggttgctt atttccctca tcttgccccc tcatctcact gcccagtttc tttttgaggg 1981 gctttgtttg ggccactgcc agcagctgtt tctggaaatg gctgtaggtg gtgttgagaa 2041 agaatgagca ttgagacggt gctcgcttct cctccaggta tttgagttgt tttggtgcct 2101 gcctctgcca tgcccagaga atcagggcag gcttgccacc ggggaaccca gccctggggt 2161 atgagctgcc aagtctattt taaagacgct caagaatcct ctggggttca tctagggaca 2221 cgttaggaat gtccagactg tgggtgtaga ttacctgcca cttccaggag cccagagggc 2281 caagagagac attgcctcca cctctccttg gaaatacttt atctgtgacc acacgctgtc 2341 tcttgagaat ttggatacac tctctagctt taggggacca tgaagagact ctcttaggga 2401 aaccaatagt ccccatcagc accatggagg caggctcccc ctgcctttga aattccccca 2461 cttgggagct tgtatatact tcactcactt ttctttattg ctgtgaatag tctgtgtgca 2521 caatgggcaa ttctgacttc tcccatctag tggaaatgag cgaaatcatg gttgtagtga 2581 tgttgtttgg gagagtgcag tagtaattga tttgacccac tcacacttgg agctaattaa 2641 ggtttgccct gcctgcagcc tcccccacaa ataatgaaca gcagaaagac tggacgggga 2701 aacctatcaa tcctgccccc agccatggtg aggaagcccc aagccatggt gacacacagc 2761 agcactgcag atagccagac acatggctat cctagagagg ctggcaagga gttcgtggct 2821 gcaaaagaag tttctggagc aagagagagc tcgctcttgg gagtcaggac ctccggggag 2881 agcagagggt tccgacggat tcctttatga gtcagtctct ctctcccttt taaatggtgg 2941 gaaccctccc caaaaccttt ccccagacac attctcctgt gcccctcaga gaggcatgtg 3001 atgtgcaagg aaaataatag gatgtaaaac acatcaagta gaaaatttct tatacttcca 3061 aaaaaaaaag c // LOCUS HSU01212 3718 bp DNA PRI 03-AUG-1994 DEFINITION Human olfactory marker protein (OMP) gene, complete cds. ACCESSION U01212 NID g520739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3718) AUTHORS Buiakova,O.I., Rama Krishna,N.S., Getchell,T.V. and Margolis,F.L. TITLE Human and rodent OMP genes: Conservation of structural and regulatory motifs and cellular localization JOURNAL Genomics 20, 452-462 (1994) MEDLINE 94307732 REFERENCE 2 (bases 1 to 3718) AUTHORS Margolis,F.L. TITLE Direct Submission JOURNAL Submitted (02-SEP-1993) Frank L. Margolis, Roche Institute of Molecular Biology, 340 Kingsland Street, Nutley, NJ 07110-1199, USA FEATURES Location/Qualifiers source 1..3718 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HOMP2, HOMP3, HOMP5" /clone_lib="genomic library in EMBL3 from one caucasian female, Clontech Labs." /chromosome="11" /map="11q13.5" enhancer 449..459 /note="distal Olf-1 binding site" enhancer 510..534 /note="UBE binding site, putative NF-1 element" enhancer 976..986 /note="proximal Olf-1 binding site" CDS 1245..1736 /note="intronless open reading frame" /codon_start=1 /product="olfactory marker protein" /db_xref="PID:g520740" /translation="MAEDRPQQPQLDMPLVLDQGLTRQMRLRVESLKQRGEKRQDGEK LLQPAESVYRLNFTQQQRLQFERWNVVLDKPGKVTITGTSQNWTPDLTNLMTRQLLDP TAIFWRKEDSDAIDWNEADALEFGERLSDLAKIRKVMYFLVTFGEGVEPANLKASVVF NQL" polyA_signal 3593..3598 BASE COUNT 716 a 1063 c 1117 g 822 t ORIGIN 1 ggatcccact gattgattag ccaccatctc acaattgacc gtgcttgtgg tccactagac 61 ccttcagatt gttttccgtt gtgatacctt ggccttgact ctgtcctctt ttctgtgtgt 121 ggcgtgttgt ggcgaggggc gctccccaac tcccatcccc actctctccc caactccggc 181 tccactcaca ctccagttct ttcatttccc cagtataaag gctgagcttc tggttccgcc 241 ccgggccctg gggatataaa catttgccag attcttcctc ggcccctggg ggaaactgag 301 gattaattca ggtggagtaa gtggtgggat ttgggtagaa gtgaagcctt gtcctgttgt 361 ggccatggtg cagggctgcg gcacagccag ccatcagtgt catccgggtc agtaatgctc 421 aaggcacagt ccctggccca gcagcatgtc acctgggagg tggttaggaa tgcagattct 481 caggcccaca gagccctgat aaaccaggag ttctgggagg gggtccagca atctgtgtgt 541 taagtcctga gagtgagtct tgatgctcac tcaagtcttg agaaccacgg gtctgggtga 601 gagatacggt agctgggctg agatcctgtc aatgggactg gaggggaagg gtcccggggt 661 gtttgggaag cagaatcgac aggctttggt gattgggtgt ggaggagtga gagggaggcg 721 ggcgtcaggg gtagctccaa ggtttaactt aggtgacttc agatctccaa tcaccaagcc 781 ctctctggtc ctgccttctc cacctgctcc tgcgggtctt gcatcttctc ctgtgtacct 841 ccagtgagga gtggtcccca ccaccctccc catcagtgca cttacgaagt gctctcatct 901 tcacaaacaa gccagcaccc agcccagccc tggtagtcag ggcggttgcc acagcaattg 961 acatcagcga cctggtcccc aaggaacctg ccaccttccg cctgcctgca gggcctgcat 1021 tatcgcttct gcggggactg gagtggaggc agatggggac tcccacccct gacacacacc 1081 ccattttgag aactgagtgg ggctgggaag agccagtgcc aaagggaggg gaagagggaa 1141 gggcagaaag taggtggggc ccccctttgg tggcctcttc tctccacggc cccaggctcc 1201 agcccacttg ggtccttggc gttggtggca gcagcacttg ggccatggcg gaggacaggc 1261 cgcagcagcc gcagctggac atgccgctgg tcctggacca gggcctgacc aggcagatgc 1321 ggctacgcgt ggagagcctg aagcagcgcg gggagaagcg ccaggatggg gagaagctgc 1381 tgcagccagc ggagtctgtg taccgcctca acttcaccca gcagcagcgg ctacagttcg 1441 agcgctggaa tgtcgtgctg gacaagccgg gcaaggtcac catcacaggc acctcgcaga 1501 actggacgcc tgacctcacc aacctcatga cacgccagct gctggacccc actgccatct 1561 tctggcgcaa ggaggactcg gatgccatag attggaatga ggccgacgcc ctggagtttg 1621 gggagcgcct gtcggacctg gccaagatcc gcaaggtcat gtacttcctc gtcacctttg 1681 gcgagggtgt ggagcccgcc aacctcaagg cctccgtggt ttttaaccag ctctgacagc 1741 agctgccagc tgctgctctc ctctagccca cctgtgctct cccctgcccc tgccactttc 1801 ccccctgtat tttgggggcc attattctcg ctgctcagcc tgtcctctgc ttgcccagag 1861 gccccctgag tcccacacct ttcctcctct gcttctccct ggggccagca ctccagctca 1921 caggaagaag attctgaggc tccatagcct agaagctgga ctggctgctg cattgctata 1981 gacgatagag gcctactagg ggccagtgtg catggacagt gaggccaggg ccatctgcct 2041 tctctctgct tcattgtggg agagagagac tgagaaagac caagagagac acagagacag 2101 agattgaaaa acccagcatc cacttcctcc agagtcaggg agacagagat gatggggcgt 2161 ctccacgggg agtccagcaa gccggcattc actgctccct ggccttggtg ccctttgccg 2221 gagcctgtgt ctgggctgct ggtcccataa cacgtcgaca accctcagga tatggggcag 2281 ggttgctgca ggggtggatt tgggcagtgg agagtggctg gcaccctgga ggctgtgtag 2341 gcccagctgt ggctcttctg ggcctgactt cagggtggag aagtgaaggg ggaggttaca 2401 cagagatctg tctctacgca cacatatcca tgagacagag tgtgctgtat tcatatggat 2461 gtattctaga ggtctattcc taccctagga acaagtgcag ttttagatta tctgttcatc 2521 attgctgctg gttcaaggat ggctcttaac aggggcctgg tccggatgac cttggcctgg 2581 gggcttgctg agctaggaga ctgcagttca gatagtgaaa cagggagtgg attagtaaag 2641 ggggttccct ttgccttgag ggaagttgga gctggagaga gtggattctc cagggcctca 2701 ggtatcccct gctggggagt caggctcttt agagcttgca ggtcagggaa ggcaagtgct 2761 tcgtcctgac atagcatctg ttggcatttc ttgggcttct tcaatgcagc tgaggggggc 2821 agggcgaagg cgtggtgggc agttacgacg gctgatagtc ccaagtgggc tgcaggcggc 2881 agtggtgtga cggcagaatg gtaacctctg gggtcattgg atgcaactca ctcaccaaac 2941 agatggggaa actgaggcac aattttcatc agattcagtt ctgactctta gcctcattcc 3001 ccttcgcatt gcgcagtccc agagagcccc cccttttggg ggagtgcctg acctgcacct 3061 aacatcagcc aagtacagct aagccactgt ccccagcacc ctgacttaag gccagccctg 3121 tgttttgtcc tcagccagtc agggatgtgt ccaagacatt tcccctcatg aagcaaagct 3181 gtcaaggaac ttgccggctc tggaacagat gcactgaggg ccagagggtc agggccatcc 3241 cctgtggctg gggctgccgg gagggtgagc cccacctcgg aggtgtgcag gctggagcag 3301 catgctggag ctgagattct gtgggtgaga gagtgggaga gtgtctgtgg gctgagcact 3361 ggtcctttct gactcacagc tctggggccc attccgggac aggcttgaag aagtctcggc 3421 cattgcctgc cctgctgagc acgaggggag gccagaaccg tgtgcagtgg ccctgccctt 3481 ctgcttgagc tcttcctgca gctctgggga ccctcttagt cccgactgcc tgtctcccca 3541 gcctgtctgt cccggggcct gagtccctct gctgtgcccg ctgcaggtcc ccaataaagc 3601 ctgtgccctg gcctcggtgg tgtgcagtgt ctcgccatca gcccccatcc ctttcacaat 3661 ccctcacggc cccgagcact tgctccctgg ccacttccca cactccccca gcccttgc // LOCUS HSU01828 5968 bp mRNA PRI 15-OCT-1993 DEFINITION Human microtubule-associated protein 2 (MAP2) mRNA, complete cds. ACCESSION U01828 NID g409874 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5968) AUTHORS Price,R. TITLE Complete cDNA of human MAP2 gene and a profile of two RFLPs for BglII/BclI JOURNAL Unpublished REFERENCE 2 (bases 1 to 5968) AUTHORS Price,R. TITLE Direct Submission JOURNAL Submitted (15-SEP-1993) R. Arlen Price Dr., Psychiatry, University of Pennsylvania, 422 Curie Blvd, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..5968 /organism="Homo sapiens" /db_xref="taxon:9606" gene 249..5729 /gene="MAP2" CDS 249..5729 /gene="MAP2" /codon_start=1 /product="microtubule-associated protein 2" /db_xref="PID:g409875" /translation="MADERKDEAKAPHWTSAPLTEASAHSHPPEIKDQGGRGEGLVRS ANGFPYREDEEGAFGEHGSQGTYSNTKENGINGELTSADRETAEEVSARIVQVVTAEA VAVLKAEQEKEAQHKDQTAALPLAAEETANLPPSPPPSPASEQTVTVEEDLLTASKME FHDQQELTPSTAEPSDQKEKESEKQSSPGEDLKHAALVSQPETTKTYPDKKDMQGTEE EKAPLALFGHTLVASLEDMKQKTEPSLVVPGIDLPKEPPTPKEQKDWFIEMPTEAKKD EWGLVAPISPGPLTPMREKDVFDDIPKWEGKQFDSPMPSPFQGGSFTLPLDVMKNEIV TETSPFAPAFLQPDDKKSLQQTSGPATAKDSFKIEEPHEAKPDKMAEAPPSEAMTLPK DAHIPVVEEHVMGKVLEEEKEAINQETVQQRDTFTPSGQEPILTEKETELKLEEKTTI SDKEAVPKESKPPKPADEEIGIIQTSTEHTFSEQKDQEPTTDMLKQDSFPVSLEQAVT DSAMTSKTLEKAMTEPSALIEKSSIQELFEMRVDDKDKIEGVGAATSAELDMPFYEDK SGMSKYFETSALKEEATKSIEPGSDYYELSDTRESVHESIDTMSPMHKNGDKEFQTGK ESQPSPPAQEAGYSTLAQSYPSDLPEEPSSPQERMFTIDPKVYGEKRDLHSKNKDDLT LSRSLGLGGRSAIEQRSMSINLPMSCLDSIALGFNFGRGHDLSPLASDILTNTSGSMD EGDDYLPATTPALEKAPCFPVESKEEEQIEKVKATGEESTQAEISCESPFLAKDFYKN GTVMAPDLPEMLDLAGTRSRLASVSADAEVARRKSVPSETVVEDSRTGLPPVTDENHV IVKTDSQLEDLGYCVFNKYTVPLPSPVQDSENLSGESGTFYEGTDDKVRRDLATDLSL IEVKLAAAGRVKDEFSVDKEASAHISGDKSGLSKEFDQEKKANDRLDTVLEKSEEHAD SKEHAKKTEEAGDEIETFGLGVTYEQALAKDLSIPTDASSEKAEKGLSSVPEIAEVEP SKKVEQGLDFAVQGQLDVKISDFGQMASGLNIDDRRATELKLEATQDMTPSSKAPQEA DAFMGVESGHMKEGTKVSETEVKQKVAKPDLVHQEAVDKEESYESSGEHESLTMESLK ADEGKKETSPESSLIQDEIAVKLSVEIPCPPAVSEADLATDERADVQMEFIQGPKEES KETPDISITPSDVAEPLHETIVSEPAEIQSEEEEIEAQGEYDKLLFRSDTLQITDLGV SGAREEFVETCPSEHKGVIESVVTIEDDFITVVQTTTDEGESGSHSVRFAALEQPEVE RRPSPHDEEEFEVEEAAEAQAEPKDGSPEAPASPEREEVALSEYKTETYDDYKDETTI DDSIMDADSLWVDTQDDDRSIMTEQLETIPKEEKAEKEARRSSLEKHRKEKPFKTGRG RISTPERKVAKKEPSTVSRDEVRRKKAVYKKAELAKKTEVQAHSPSRKFILKPAIKYT RPTHLSCVKRKTTAAGGESALAPSVFKQAKDKVSDGVTKSPEKRSSLPRPSSILPPRR GVSGDRDENSFSLNSSISSSARRTTRSEPIRRAGKSGTSTPTTPGSTAITPGTPPSYS SRTPGTPGTPSYPRTPHTPGTPKSAILVPSEKKVAIIRTPPKSPATPKQLRLINQPLP DLKNVKSKIGSTDNIKYQPKGGQVQIVTKKIDLSHVTSKCGSLKNIDRPGGGRVKIES VKLDFKEKVQAKVGSLDNAHHVPGGGNVKIDSQKLNFREHAKARVDHGAEIITQSPGR SSVASPRRLSNVSSSGSINLLESPQLATLAEDVTAALAKQGL" BASE COUNT 1948 a 1324 c 1399 g 1297 t ORIGIN 1 gggataatgc tcccgagaag gattctggca gcagttctca aaggctagac ttgagtggta 61 ttgctgcata tgcgctgatt cttcagcttg tctctaaccg aggaagcatt gattgggagc 121 tactcattca gaaaattaaa agaaagaagc cagaaaatat tatcaaccct ttgagaacac 181 gacacaacga actttatatt ttaccacttc cttgaatagt tgcaggagaa ataacaaggc 241 attgaagaat ggcagatgaa cggaaagacg aagcaaaggc acctcactgg acctcagcac 301 cgctaacaga ggcatctgca cactcacatc cacctgagat taaggatcaa ggcggacgag 361 gggaaggact tgtccgaagc gccaatggat tcccatacag ggaggatgaa gagggtgcct 421 ttggagagca tgggtcacag ggcacctatt caaataccaa agagaatggg atcaacggag 481 agctgacctc agctgacaga gaaacagcag aggaggtgtc tgcaaggata gttcaagtag 541 tcactgctga ggctgtagca gtcctgaaag ctgaacaaga gaaagaagct caacataaag 601 accagactgc agctctgcct ttagcagctg aagaaacagc taatctgcct ccttctccac 661 ccccatcacc tgcctcagaa cagactgtca cagtggagga agatttactt acagcctcga 721 agatggagtt ccacgatcaa caggaattga ctccctctac agctgagcct tcagaccaga 781 aggaaaagga gtcagagaag caaagtagtc ctggtgaaga ccttaaacat gctgccttag 841 tttctcagcc agagacaact aaaacttacc ctgataaaaa ggacatgcaa ggcacagaag 901 aagaaaaagc acccctagct ttgtttgggc acactcttgt tgccagcctg gaagacatga 961 aacagaagac agaaccaagc cttgtagtac ctggcattga cctccctaaa gagcctccaa 1021 ctccaaaaga acaaaaggac tggttcatcg aaatgccaac ggaagcaaaa aaggatgagt 1081 ggggtttagt tgcccccata tctcctggcc ctctgactcc catgagggaa aaagatgtat 1141 ttgatgatat cccaaaatgg gaagggaaac agtttgattc tcccatgcca agtccctttc 1201 aagggggaag cttcactctt cctttagatg tcatgaagaa tgaaatagtt acagaaacat 1261 cgccctttgc ccctgccttt ttacagccag atgacaaaaa atctctgcaa caaaccagtg 1321 gcccagctac tgccaaagat agttttaaaa ttgaagagcc ccatgaggct aaacctgaca 1381 aaatggcaga agcaccaccc tcagaggcaa tgaccttacc caaagatgct cacattccag 1441 ttgtagaaga acatgttatg gggaaagttt tagaggaaga aaaggaggcc ataaatcaag 1501 agactgtgca gcaaagggat actttcaccc ccagtggaca ggaacctata cttactgaaa 1561 aggaaactga gctgaagctt gaagaaaaaa ccaccatttc tgacaaagaa gctgtgccaa 1621 aagagagtaa acccccaaaa cctgcagatg aagaaatagg cataattcag acctccacag 1681 agcacacttt ctcagaacag aaagaccaag agcctaccac agatatgttg aaacaggact 1741 cgttccctgt aagtttggag caagcagtta cagattcagc catgacctct aaaacactgg 1801 agaaagccat gaccgaacca tctgcattaa ttgaaaagag ctcaattcag gaactttttg 1861 aaatgagagt tgatgacaaa gataagattg aaggagttgg agctgcaaca tcagctgagc 1921 ttgatatgcc attttatgaa gataaatcag gaatgtccaa gtactttgaa acatctgcct 1981 tgaaagaaga agcaacaaaa agcattgagc caggcagtga ttactatgaa ctgagtgaca 2041 ctagagaaag tgtccatgag tctattgata ccatgtctcc catgcataaa aatggtgaca 2101 aggagtttca aacaggaaaa gaatcccagc ccagtcctcc agcacaagaa gcagggtaca 2161 gcactctcgc acagagttat ccatcagatt tacctgaaga acccagttct cctcaagaaa 2221 gaatgttcac tattgatcca aaagtgtatg gagagaaaag ggacctccac agtaagaata 2281 aggatgattt gacccttagc aggagtttag gacttggtgg taggtctgca atagaacaaa 2341 gaagcatgtc aatcaatttg ccgatgtctt gcctagattc catagccctt ggatttaact 2401 ttggtcgggg acatgatctt tctcctctgg cttccgatat tctaaccaac actagtggaa 2461 gtatggatga aggggatgat taccttccag ccaccacacc tgcactggag aaagcccctt 2521 gcttccctgt agaaagcaaa gaggaagaac agatagagaa agtaaaagct actggagaag 2581 aaagtactca agcggagata tcatgtgagt ctcctttcct agccaaagat ttttacaaaa 2641 atggtactgt catggcacct gaccttcctg aaatgctaga tctggcaggc acaaggtcaa 2701 gattggcttc tgtgagtgca gatgctgagg ttgccaggag gaaatcagtc ccatcagaga 2761 ctgtggttga ggatagtcgt actggcttgc ccccggtaac tgatgaaaac catgtcattg 2821 taaaaacgga cagtcagctc gaagacctgg gctactgtgt gttcaataag tacacagtcc 2881 cattgccatc acctgttcaa gacagtgaga atttatcagg ggagagtggt accttttacg 2941 aaggcactga tgataaagtt cgaagagatt tggccacaga cctttcactg attgaagtga 3001 aactggcagc agccggaaga gtcaaagatg agttcagtgt tgacaaagaa gcatccgcgc 3061 atatctctgg tgacaaatca ggactgagta aggagtttga ccaagagaag aaagctaatg 3121 ataggttgga tactgtacta gaaaagagtg aagaacatgc tgattcaaaa gaacatgcca 3181 agaaaactga agaggctggt gatgaaatag aaacattcgg attaggagta acctatgagc 3241 aagctttggc caaagatttg tcaataccaa cagatgcatc ctctgagaaa gcagagaagg 3301 gtcttagttc agttccagag atagctgagg tagaaccatc caaaaaggtg gaacaaggtc 3361 tggattttgc tgtccagggt caactagatg ttaaaattag tgactttgga cagatggctt 3421 cagggctaaa catagatgat agaagggcaa cagagctaaa acttgaggct acacaggaca 3481 tgaccccctc atccaaagca ccgcaggagg cagatgcatt tatgggtgtt gagtctggcc 3541 acatgaaaga aggcactaaa gttagtgaga cagaagtcaa acagaaggtg gccaagcctg 3601 acttggtgca ccaggaggct gtagacaagg aggagtccta tgaatctagt ggtgagcatg 3661 aaagtctcac catggagtcc ttgaaagctg atgagggcaa gaaggaaaca tctccagaat 3721 catctctaat tcaagatgag attgccgtca aattgtcagt ggaaatacct tgcccacctg 3781 ctgtttcaga ggctgattta gccacagatg agagagctga tgtccagatg gaatttattc 3841 aggggccaaa agaagaaagc aaagagaccc cagatatatc catcacgcct tctgatgttg 3901 cagagccatt gcatgaaacg atcgtatctg aaccagcaga gattcagagt gaggaagaag 3961 agatagaagc ccagggagaa tatgataaac tgctcttccg ctcagacacc cttcagataa 4021 ctgacctggg tgtctcaggt gccagggagg aatttgtgga gacctgccca agtgaacaca 4081 aaggagtgat tgagtctgtt gtgaccatcg aggatgattt catcactgta gtgcaaacca 4141 caactgatga aggggagtca gggtcccaca gcgtgcgttt tgcagcccta gagcagcctg 4201 aggtggaaag gagaccatct cctcatgatg aagaagagtt tgaagtagaa gaggcagctg 4261 aagcccaggc agaacccaaa gatggttccc cagaggctcc agcttcccct gagagagaag 4321 aggttgcact ttctgaatat aagacagaaa cctatgacga ttacaaagat gagaccacca 4381 ttgacgactc catcatggac gctgacagcc tctgggtgga cactcaagat gatgatagga 4441 gcatcatgac agaacagtta gaaactattc ctaaagagga gaaagctgaa aaggaagctc 4501 ggagatcatc tcttgagaaa catagaaaag aaaagccttt taaaaccggg agaggcagaa 4561 tttccactcc tgaaagaaaa gtagctaaaa aggaacctag cacagtctcc agagatgaag 4621 tgagaaggaa aaaagcagtt tataagaagg ctgaacttgc taaaaaaaca gaagttcagg 4681 cccactctcc ctccaggaaa ttcattttaa aacctgctat caaatatact agaccaactc 4741 atctctcctg tgttaagcgg aaaaccacag cagcaggtgg ggaatcagct ctggctccca 4801 gtgtatttaa acaggcaaag gacaaagtct ctgacggagt aaccaagagc ccagaaaagc 4861 gctcttctct cccaagacct tcctccattc tccctcctcg gcgaggtgtg tcaggagaca 4921 gagatgagaa ttccttctct ctcaacagtt ctatctcttc ttcagcacgg cggaccacca 4981 ggtcagagcc aattcgcaga gcaggaaaga gtggtacctc aacacccact acccctgggt 5041 ctactgccat cactcctggc accccaccaa gttattcttc acgcacacca ggcactcctg 5101 gaacccctag ctatcccagg acccctcaca caccaggaac ccccaagtct gccatcttgg 5161 tgccgagtga gaagaaggtc gccatcatac gtactcctcc aaaatctcct gcgactccca 5221 agcagcttcg gcttattaac caaccactgc cagacctgaa gaatgtcaaa tccaaaatcg 5281 gatcaacaga caacatcaaa taccagccta aaggggggca ggtacaaatt gttaccaaga 5341 aaatagacct aagccatgtg acatccaaat gtggctctct gaagaacatc gacaggccag 5401 gtggcggacg tgtgaaaatt gagagtgtaa aactagattt caaggaaaag gtccaagcta 5461 aagttggttc tcttgataat gctcatcatg tacctggagg tggtaatgtc aagattgaca 5521 gccaaaagtt gaacttcaga gagcatgcta aagcccgtgt ggaccatggg gctgagatca 5581 ttacacagtc cccaggcaga tccagcgtgg catcaccccg acgactcagc aatgtctcct 5641 cgtctggaag catcaacctg ctcgaatctc ctcagcttgc cactttggct gaggatgtca 5701 ctgctgcact cgctaagcag ggcttgtgaa tatttctcat ttagcattga aataataata 5761 tttaggcatg agctcttggc aggagtgggc tctgagcagt tgttatattc attctttata 5821 aaccataaat aaataatctc atccccaaac tgtagtaatt gttacaattt tctatttaaa 5881 aaatgaatag tacatgcaga aattgacctg atttccattt gcaacaggaa gacactggct 5941 ttactgggtt caattggaca attatttt // LOCUS HSU01874 2026 bp mRNA PRI 27-MAY-1994 DEFINITION Human me20m mRNA, complete cds. ACCESSION U01874 NID g494939 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2023) AUTHORS Maresh,G.A., Marken,J.S., Neubauer,M., Aruffo,A., Hellstrom,I., Hellstrom,K. and Marquardt,H. TITLE Cloning and expression of the gene for the Melanoma-Associated ME20 Antigen JOURNAL DNA Cell Biol. 13, 87-95 (1994) MEDLINE 94235165 REFERENCE 2 (bases 1 to 2026) AUTHORS Neubauer,M.G. TITLE Direct Submission JOURNAL Submitted (16-SEP-1993) Michael G. Neubauer, Bristol-Myers Squibb Pharmaceutical Research Institute, 3005 1st Ave, Seattle, WA 98121, USA FEATURES Location/Qualifiers source 1..2026 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hf12-2" /clone_lib="cdm8" /cell_line="melanoma H3606" sig_peptide 7..78 CDS 7..1995 /standard_name="melanoma-associated ME20 antigen" /codon_start=1 /product="me20m" /db_xref="PID:g494940" /translation="MDLVLKRCLLHLAVIGALLAVGATKVPRNQDWLGVSRQLRTKAW NRQLYPEWTEAQRLDCWRGGQVSLKVSNDGPTLIGANASFSIALNFPGSQKVLPDGQV IWVNNTIINGSQVWGGQPVYPQETDDACIFPDGGPCPSGSWSQKRSFVYVWKTWGQYW QVLGGPVSGLSIGTGRAMLGTHTMEVTVYHRRGSRSYVPLAHSSSAFTITDQVPFSVS VSQLRALDGGNKHFLRNQPLTFALQLHDPSGYLAEADLSYTWDFGDSSGTLISRALVV THTYLEPGPVTAQVVLQAAIPLTSCGSSPVPGTTDGHRPTAEAPNTTAGQVPTTEVVG TTPGQAPTAEPSGTTSVQVPTTEVISTAPVQMPTAESTGMTPEKVPVSEVMGTTLAEM STPEATGMTPAEVSIVVLSGTTAAQVTTTEWVETTARELPIPEPEGPDASSIMSTESI TGSLGPLLDGTATLRLVKRQVPLDCVLYRYGSFSVTLDIVQGIESAEILQAVPSGEGD AFELTVSCQGGLPKEACMEISSPGCQPPAQRLCQPVLPSPACQLVLHQILKGGSGTYC LNVSLADTNSLAVVSTQLIMPGQEAGGLGQVPLIVGILLVLMAVVLASLIYRRRLMKQ DFSVPQLPHSSSHWLRLPRIFCSCPIGENSPLLSGQQV" mat_peptide 79..1992 /product="me20m" BASE COUNT 437 a 564 c 564 g 461 t ORIGIN 1 ctcgagatgg atctggtgct aaaaagatgc cttcttcatt tggctgtgat aggtgctttg 61 ctggctgtgg gggctacaaa agtacccaga aaccaggact ggcttggtgt ctcaaggcaa 121 ctcagaacca aagcctggaa caggcagctg tatccagagt ggacagaagc ccagagactt 181 gactgctgga gaggtggtca agtgtccctc aaggtcagta atgatgggcc tacactgatt 241 ggtgcaaatg cctccttctc tattgccttg aacttccctg gaagccaaaa ggtattgcca 301 gatgggcagg ttatctgggt caacaatacc atcatcaatg ggagccaggt gtggggagga 361 cagccagtgt atccccagga aactgacgat gcctgcatct tccctgatgg tggaccttgc 421 ccatctggct cttggtctca gaagagaagc tttgtttatg tctggaagac ctggggccaa 481 tactggcaag ttctaggggg cccagtgtct gggctgagca ttgggacagg cagggcaatg 541 ctgggcacac acaccatgga agtgactgtc taccatcgcc ggggatcccg gagctatgtg 601 cctcttgctc attccagctc agccttcacc attactgacc aggtgccttt ctccgtgagc 661 gtgtcccagt tgcgggcctt ggatggaggg aacaagcact tcctgagaaa tcagcctctg 721 acctttgccc tccagctcca tgaccccagt ggctatctgg ctgaagctga cctctcctac 781 acctgggact ttggagacag tagtggaacc ctgatctctc gggcacttgt ggtcactcat 841 acttacctgg agcctggccc agtcactgcc caggtggtcc tgcaggctgc cattcctctc 901 acctcctgtg gctcctcccc agttccaggc accacagatg ggcacaggcc aactgcagag 961 gcccctaaca ccacagctgg ccaagtgcct actacagaag ttgtgggtac tacacctggt 1021 caggcgccaa ctgcagagcc ctctggaacc acatctgtgc aggtgccaac cactgaagtc 1081 ataagcactg cacctgtgca gatgccaact gcagagagca caggtatgac acctgagaag 1141 gtgccagttt cagaggtcat gggtaccaca ctggcagaga tgtcaactcc agaggctaca 1201 ggtatgacac ctgcagaggt atcaattgtg gtgctttctg gaaccacagc tgcacaggta 1261 acaactacag agtgggtgga gaccacagct agagagctac ctatccctga gcctgaaggt 1321 ccagatgcca gctcaatcat gtctacggaa agtattacag gttccctggg ccccctgctg 1381 gatggtacag ccaccttaag gctggtgaag agacaagtcc ccctggattg tgttctgtat 1441 cgatatggtt ccttttccgt caccctggac attgtccagg gtattgaaag tgccgagatc 1501 ctgcaggctg tgccgtccgg tgagggggat gcatttgagc tgactgtgtc ctgccaaggc 1561 gggctgccca aggaagcctg catggagatc tcatcgccag ggtgccagcc ccctgcccag 1621 cggctgtgcc agcctgtgct acccagccca gcctgccagc tggttctgca ccagatactg 1681 aagggtggct cggggacata ctgcctcaat gtgtctctgg ctgataccaa cagcctggca 1741 gtggtcagca cccagcttat catgcctggt caagaagcag ggggccttgg gcaggttccg 1801 ctgatcgtgg gcatcttgct ggtgttgatg gctgtggtcc ttgcatctct gatatatagg 1861 cgcagactta tgaagcaaga cttctccgta ccccagttgc cacatagcag cagtcactgg 1921 ctgcgtctac cccgcatctt ctgctcttgt cccattggtg agaatagccc cctcctcagt 1981 gggcagcagg tctgagtact ctcatatgat gctgtgattg cggccg // LOCUS HSU01877 9046 bp mRNA PRI 06-JUN-1994 DEFINITION Human p300 protein mRNA, complete cds. ACCESSION U01877 NID g495300 KEYWORDS p300; transcriptional adaptor protein; E1A-binding protein; cell cycle regulatory protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9046) AUTHORS Eckner,R., Ewen,M.E., Newsome,D., Gerdes,M., Decaprio,J.A., Lawrence,J.B. and Livingston,D.M. TITLE Molecular cloning and functional analysis of the adenovirus E1A associated 300kD protein (p300) reveals a protein with properties of a transcriptional adaptor JOURNAL Genes Dev. 8, 869-884 (1994) MEDLINE 95011587 REFERENCE 2 (bases 1 to 9046) AUTHORS Eckner,R. TITLE Direct Submission JOURNAL Submitted (16-SEP-1993) Richard Eckner, Neoplastic Disease Mechanisms, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..9046 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="p300" /clone_lib="from several different cDNA libraries" /chromosome="22" /map="22q13.2." /cell_line="293, Akata, Nalm-6, HeLa" 5'UTR 1..1199 CDS 1200..8444 /codon_start=1 /product="p300 protein" /db_xref="PID:g495301" /translation="MAENVVEPGPPSAKRPKLSSPALSASASDGTDFGSLFDLEHDLP DELINSTELGLTNGGDINQLQTSLGMVQDAASKHKQLSELLRSGSSPNLNMGVGGPGQ VMASQAQQSSPGLGLINSMVKSPMTQAGLTSPNMGMGTSGPNQGPTQSTGMMNSPVNQ PAMGMNTGTNAGMNPGMLAAGNGQGIMPNQVMNGSIGAGRGRQDMQYPNPGMGSAGNL LTEPLQQGSPQMGGQTGLRGPQPLKMGMMNNPNPYGSPYTQNPGQQIGASGLGLQIQT KTVLSNNLSPFAMDKKAVPGGGMPNMGQQPAPQVQQPGLVTPVAQGMGSGAHTADPEK RKLIQQQLVLLLHAHKCQRREQANGEVRQCNLPHCRTMKNVLNHMTHCQSGKSCQVAH CASSRQIISHWKNCTRHDCPVCLPLKNAGDKRNQQPILTGAPVGLGNPSSLGVGQQSA PNLSTVSQIDPSSIERAYAALGLPYQVNQMPTQPQVQAKNQQNQQPGQSPQGMRPMSN MSASPMGVNGGVGVQTPSLLSDSMLHSAINSQNPMMSENASVPSLGPMPTAAQPSTTG IRKQWHEDITQDLRNHLVHKLVQAIFPTPDPAALKDRRMENLVAYARKVEGDMYESAN NRAEYYHLLAEKIYKIQKELEEKRRTRLQKQNMLPNAAGMVPVSMNPGPNMGQPQPGM TSNGPLPDPSMIRGSVPNQMMPRITPQSGLNQFGQMSMAQPPIVPRQTPPLQHHGQLA QPGALNPPMGYGPRMQQPSNQGQFLPQTQFPSQGMNVTNIPLAPSSGQAPVSQAQMSS SSCPVNSPIMPPGSQGSHIHCPQLPQPALHQNSPSPVPSRTPTPHHTPPSIGAQQPPA TTIPAPVPTPPAMPPGPQSQALHPPPRQTPTPPTTQLPQQVQPSLPAAPSADQPQQQP RSQQSTAASVPTPNAPLLPPQPATPLSQPAVSIEGQVSNPPSTSSTEVNSQAIAEKQP SQEVKMEAKMEVDQPEPADTQPEDISESKVEDCKMESTETEERSTELKTEIKEEEDQP STSATQSSPAPGQSKKKIFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPD YFDIVKSPMDLSTIKRKLDTGQYQEPWQYVDDIWLMFNNAWLYNRKTSRVYKYCSKLS EVFEQEIDPVMQSLGYCCGRKLEFSPQTLCCYGKQLCTIPRDATYYSYQNRYHFCEKC FNEIQGESVSLGDDPSQPQTTINKEQFSKRKNDTLDPELFVECTECGRKMHQICVLHH EIIWPAGFVCDGCLKKSARTRKENKFSAKRLPSTRLGTFLENRVNDFLRRQNHPESGE VTVRVVHASDKTVEVKPGMKARFVDSGEMAESFPYRTKALFAFEEIDGVDLCFFGMHV QEYGSDCPPPNQRRVYISYLDSVHFFRPKCLRTAVYHEILIGYLEYVKKLGYTTGHIW ACPPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDR LTSAKELPYFEGDFWPNVLEESIKELEQEEEERKREENTSNESTDVTKGDSKNAKKKN NKKTSKNKSSLSRGNKKKPGMPNVSNDLSQKLYATMEKHKEVFFVIRLIAGPAANSLP PIVDPDPLIPCDLMDGRDAFLTLARDKHLEFSSLRRAQWSTMCMLVELHTQSQDRFVY TCNECKHHVETRWHCTVCEDYDLCITCYNTKNHDHKMEKLGLGLDDESNNQQAAATQS PGDSRRLSIQRCIQSLVHACQCRNANCSLPSCQKMKRVVQHTKGCKRKTNGGCPICKQ LIALCCYHAKHCQENKCPVPFCLNIKQKLRQQQLQHRLQQAQMLRRRMASMQRTGVVG QQQGLPSPTPATPTTPTGQQPTTPQTPQPTSQPQPTPPNSMPPYLPRTQAAGPVSQGK AAGQVTPPTPPQTAQPPLPGPPPTAVEMAMQIQRAAETQRQMAHVQIFQRPIQHQMPP MTPMAPMGMNPPPMTRGPSGHLEPGMGPTGMQQQPPWSQGGLPQPQQLQSGMPRPAMM SVAQHGQPLNMAPQPGLGQVGISPLKPGTVSQQALQNLLRTLRSPSSPLQQQQVLSIL HANPQLLAAFIKQRAAKYANSNPQPIPGQPGMPQGQPGLQPPTMPGQQGVHSNPAMQN MNPMQAGVQRAGLPQQQPQQQLQPPMGGMSPQAQQMNMNHNTMPSQFRDILRRQQMMQ QQQQQGAGPGIGPGMANHNQFQQPQGVGYPPQPQQRMQHHMQQMQQGNMGQIGQLPQA LGAEAGASLQAYQQRLLQQQMGSPVQPNPMSPQQHMLPNQAQSPHLQGQQIPNSLSNQ VRSPQPVPSPRPQSQPPHSSPSPRMQPQPSPHHVSPQTSSPHPGLVAAQANPMEQGHF ASPDQNSMLSQLASNPGMANLHGASATDLGLSTDNSDLNSNLSQSTLDIH" misc_feature 1230..1250 /note="nuclear location signal" misc_feature 2238..2431 /note="zinc-finger like structure (cys/his rich region 1)" misc_feature 4430..4650 /note="bromodomain" misc_feature 4686..5552 /note="cys/his rich region 2" misc_signal 6156..6650 /note="cys/his rich region 3, binding site of adenovirus E1A oncoprotein" 3'UTR 8445..9046 BASE COUNT 2392 a 2552 c 2243 g 1859 t ORIGIN 1 ccttgtttgt gtgctaggct gggggggaga gagggcgaga gagagcgggc gagagtgggc 61 aagcaggacg ccgggctgag tgctaactgc gggacgcaga gagtgcggag gggagtcggg 121 tcggagagag gcggcagggg ccagaacagt ggcagggggc ccggggcgca cgggctgagg 181 cgacccccag ccccctcccg tccgcacaca cccccaccgc ggtccagcag ccgggccggc 241 gtcgacgcta ggggggacca ttacataacc cgcgccccgg ccgtcttctc ccgccgccgc 301 ggcgcccgaa ctgagcccgg ggcgggcgct ccagcactgg ccgccggcgt ggggcgtagc 361 agcggccgta ttattatttc gcggaaagga aggcgaagga ggggagcgcc ggcgcgagga 421 ggggccgcct gcgcccgccg ccggagcggg gcctcctcgg tgggctccgc gtcggcgcgg 481 gcgtgcgggc ggcgctgctc ggcccggccc cctcggccct ctggtccggc cagctccgct 541 cccggcgtcc ttgccgcgcc tccgccggcc gccgcgcgat gtgaggcggc ggcgccagcc 601 tggctctcgg ctcgggcgag ttctctgcgg ccattagggg ccggtgcggc ggcggcgcgg 661 agcgcggcgg caggaggagg gttcggaggg tgggggcgca ggcccgggag ggggcaccgg 721 gaggaggtga gtgtctcttg tcgcctcctc ctctcccccc ttttcgcccc cgcctccttg 781 tggcgatgag aaggaggagg acagcgccga ggaggaagag gttgatggcg gcggcggagc 841 tccgagagac ctcggctggg caggggccgg ccgtggcggg ccggggactg cgcctctaga 901 gccgcgagtt ctcgggaatt cgccgcagcg gaccggcctc ggcgaatttg tgctcttgtg 961 ccctcctccg ggcttgggcc aggccggccc ctcgcacttg cccttacctt ttctatcgag 1021 tccgcatccc tctccagcca ctgcgacccg gcgaagagaa aaaggaactt cccccacccc 1081 ctcgggtgcc gtcggagccc cccagcccac ccctgggtgc ggcgcgggga ccccgggccg 1141 aagaagagat ttcctgagga ttctggtttt cctcgcttgt atctccgaaa gaattaaaaa 1201 tggccgagaa tgtggtggaa ccggggccgc cttcagccaa gcggcctaaa ctctcatctc 1261 cggccctctc ggcgtccgcc agcgatggca cagattttgg ctctctattt gacttggagc 1321 acgacttacc agatgaatta atcaactcta cagaattggg actaaccaat ggtggtgata 1381 ttaatcagct tcagacaagt cttggcatgg tacaagatgc agcttctaaa cataaacagc 1441 tgtcagaatt gctgcgatct ggtagttccc ctaacctcaa tatgggagtt ggtggcccag 1501 gtcaagtcat ggccagccag gcccaacaga gcagtcctgg attaggtttg ataaatagca 1561 tggtcaaaag cccaatgaca caggcaggct tgacttctcc caacatgggg atgggcacta 1621 gtggaccaaa tcagggtcct acgcagtcaa caggtatgat gaacagtcca gtaaatcagc 1681 ctgccatggg aatgaacaca gggacgaatg cgggcatgaa tcctggaatg ttggctgcag 1741 gcaatggaca agggataatg cctaatcaag tcatgaacgg ttcaattgga gcaggccgag 1801 ggcgacagga tatgcagtac ccaaacccag gcatgggaag tgctggcaac ttactgactg 1861 agcctcttca gcagggctct ccccagatgg gaggacaaac aggattgaga ggcccccagc 1921 ctcttaagat gggaatgatg aacaacccca atccttatgg ttcaccatat actcagaatc 1981 ctggacagca gattggagcc agtggccttg gtctccagat tcagacaaaa actgtactat 2041 caaataactt atctccattt gctatggaca aaaaggcagt tcctggtgga ggaatgccca 2101 acatgggtca acagccagcc ccgcaggtcc agcagccagg tctggtgact ccagttgccc 2161 aagggatggg ttctggagca catacagctg atccagagaa gcgcaagctc atccagcagc 2221 agcttgttct ccttttgcat gctcacaagt gccagcgccg ggaacaggcc aatggggaag 2281 tgaggcagtg caaccttccc cactgtcgca caatgaagaa tgtcctaaac cacatgacac 2341 actgccagtc aggcaagtct tgccaagtgg cacactgtgc atcttctcga caaatcattt 2401 cacactggaa gaattgtaca agacatgatt gtcctgtgtg tctccccctc aaaaatgctg 2461 gtgataagag aaatcaacag ccaattttga ctggagcacc cgttggactt ggaaatccta 2521 gctctctagg ggtgggtcaa cagtctgccc ccaacctaag cactgttagt cagattgatc 2581 ccagctccat agaaagagcc tatgcagctc ttggactacc ctatcaagta aatcagatgc 2641 cgacacaacc ccaggtgcaa gcaaagaacc agcagaatca gcagcctggg cagtctcccc 2701 aaggcatgcg gcccatgagc aacatgagtg ctagtcctat gggagtaaat ggaggtgtag 2761 gagttcaaac gccgagtctt ctttctgact caatgttgca ttcagccata aattctcaaa 2821 acccaatgat gagtgaaaat gccagtgtgc cctccctggg tcctatgcca acagcagctc 2881 aaccatccac tactggaatt cggaaacagt ggcacgaaga tattactcag gatcttcgaa 2941 atcatcttgt tcacaaactc gtccaagcca tatttcctac gccggatcct gctgctttaa 3001 aagacagacg gatggaaaac ctagttgcat atgctcggaa agttgaaggg gacatgtatg 3061 aatctgcaaa caatcgagcg gaatactacc accttctagc tgagaaaatc tataagatcc 3121 agaaagaact agaagaaaaa cgaaggacca gactacagaa gcagaacatg ctaccaaatg 3181 ctgcaggcat ggttccagtt tccatgaatc cagggcctaa catgggacag ccgcaaccag 3241 gaatgacttc taatggccct ctacctgacc caagtatgat ccgtggcagt gtgccaaacc 3301 agatgatgcc tcgaataact ccacaatctg gtttgaatca atttggccag atgagcatgg 3361 cccagccccc tattgtaccc cggcaaaccc ctcctcttca gcaccatgga cagttggctc 3421 aacctggagc tctcaacccg cctatgggct atgggcctcg tatgcaacag ccttccaacc 3481 agggccagtt ccttcctcag actcagttcc catcacaggg aatgaatgta acaaatatcc 3541 ctttggctcc gtccagcggt caagctccag tgtctcaagc acaaatgtct agttcttcct 3601 gcccggtgaa ctctcctata atgcctccag ggtctcaggg gagccacatt cactgtcccc 3661 agcttcctca accagctctt catcagaatt caccctcgcc tgtacctagt cgtaccccca 3721 cccctcacca tactccccca agcatagggg ctcagcagcc accagcaaca acaattccag 3781 cccctgttcc tacaccacca gccatgccac ctgggccaca gtcccaggct ctacatcccc 3841 ctccaaggca gacacctaca ccaccaacaa cacaacttcc ccaacaagtg cagccttcac 3901 ttcctgctgc accttctgct gaccagcccc agcagcagcc tcgctcacag cagagcacag 3961 cagcgtctgt tcctacccca aacgcaccgc tgcttcctcc gcagcctgca actccacttt 4021 cccagccagc tgtaagcatt gaaggacagg tatcaaatcc tccatctact agtagcacag 4081 aagtgaattc tcaggccatt gctgagaagc agccttccca ggaagtgaag atggaggcca 4141 aaatggaagt ggatcaacca gaaccagcag atacgcagcc ggaggatatt tcagagtcta 4201 aagtggaaga ctgtaaaatg gaatctaccg aaacagaaga gagaagcact gagttaaaaa 4261 ctgaaataaa agaggaggaa gaccagccaa gtacttcagc tacccagtca tctccggctc 4321 caggacagtc aaagaaaaag attttcaaac cagaagaact acgacaggca ctgatgccaa 4381 cattggaggc actttaccgt caggatccag aatcccttcc ctttcgtcaa cctgtggacc 4441 ctcagctttt aggaatccct gattactttg atattgtgaa gagccccatg gatctttcta 4501 ccattaagag gaagttagac actggacagt atcaggagcc ctggcagtat gtcgatgata 4561 tttggcttat gttcaataat gcctggttat ataaccggaa aacatcacgg gtatacaaat 4621 actgctccaa gctctctgag gtctttgaac aagaaattga cccagtgatg caaagccttg 4681 gatactgttg tggcagaaag ttggagttct ctccacagac actgtgttgc tacggcaaac 4741 agttgtgcac aatacctcgt gatgccactt attacagtta ccagaacagg tatcatttct 4801 gtgagaagtg tttcaatgag atccaagggg agagcgtttc tttgggggat gacccttccc 4861 agcctcaaac tacaataaat aaagaacaat tttccaagag aaaaaatgac acactggatc 4921 ctgaactgtt tgttgaatgt acagagtgcg gaagaaagat gcatcagatc tgtgtccttc 4981 accatgagat catctggcct gctggattcg tctgtgatgg ctgtttaaag aaaagtgcac 5041 gaactaggaa agaaaataag ttttctgcta aaaggttgcc atctaccaga cttggcacct 5101 ttctagagaa tcgtgtgaat gactttctga ggcgacagaa tcaccctgag tcaggagagg 5161 tcactgttag agtagttcat gcttctgaca aaaccgtgga agtaaaacca ggcatgaaag 5221 caaggtttgt ggacagtgga gagatggcag aatcctttcc ataccgaacc aaagccctct 5281 ttgcctttga agaaattgat ggtgttgacc tgtgcttctt tggcatgcat gttcaagagt 5341 atggctctga ctgccctcca cccaaccaga ggagagtata catatcttac ctcgatagtg 5401 ttcatttctt ccgtcctaaa tgcttgagga ctgcagtcta tcatgaaatc ctaattggat 5461 atttagaata tgtcaagaaa ttaggttaca caacagggca tatttgggca tgtccaccaa 5521 gtgagggaga tgattatatc ttccattgcc atcctcctga ccagaagata cccaagccca 5581 agcgactgca ggaatggtac aaaaaaatgc ttgacaaggc tgtatcagag cgtattgtcc 5641 atgactacaa ggatattttt aaacaagcta ctgaagatag attaacaagt gcaaaggaat 5701 tgccttattt cgagggtgat ttctggccca atgttctgga agaaagcatt aaggaactgg 5761 aacaggagga agaagagaga aaacgagagg aaaacaccag caatgaaagc acagatgtga 5821 ccaagggaga cagcaaaaat gctaaaaaga agaataataa gaaaaccagc aaaaataaga 5881 gcagcctgag taggggcaac aagaagaaac ccgggatgcc caatgtatct aacgacctct 5941 cacagaaact atatgccacc atggagaagc ataaagaggt cttctttgtg atccgcctca 6001 ttgctggccc tgctgccaac tccctgcctc ccattgttga tcctgatcct ctcatcccct 6061 gcgatctgat ggatggtcgg gatgcgtttc tcacgctggc aagggacaag cacctggagt 6121 tctcttcact ccgaagagcc cagtggtcca ccatgtgcat gctggtggag ctgcacacgc 6181 agagccagga ccgctttgtc tacacctgca atgaatgcaa gcaccatgtg gagacacgct 6241 ggcactgtac tgtctgtgag gattatgact tgtgtatcac ctgctataac actaaaaacc 6301 atgaccacaa aatggagaaa ctaggccttg gcttagatga tgagagcaac aaccagcagg 6361 ctgcagccac ccagagccca ggcgattctc gccgcctgag tatccagcgc tgcatccagt 6421 ctctggtcca tgcttgccag tgtcggaatg ccaattgctc actgccatcc tgccagaaga 6481 tgaagcgggt tgtgcagcat accaagggtt gcaaacggaa aaccaatggc gggtgcccca 6541 tctgcaagca gctcattgcc ctctgctgct accatgccaa gcactgccag gagaacaaat 6601 gcccggtgcc gttctgccta aacatcaagc agaagctccg gcagcaacag ctgcagcacc 6661 gactacagca ggcccaaatg cttcgcagga ggatggccag catgcagcgg actggtgtgg 6721 ttgggcagca acagggcctc ccttccccca ctcctgccac tccaacgaca ccaactggcc 6781 aacagccaac caccccgcag acgccccagc ccacttctca gcctcagcct acccctccca 6841 atagcatgcc accctacttg cccaggactc aagctgctgg ccctgtgtcc cagggtaagg 6901 cagcaggcca ggtgacccct ccaacccctc ctcagactgc tcagccaccc cttccagggc 6961 ccccacctac agcagtggaa atggcaatgc agattcagag agcagcggag acgcagcgcc 7021 agatggccca cgtgcaaatt tttcaaaggc caatccaaca ccagatgccc ccgatgactc 7081 ccatggcccc catgggtatg aacccacctc ccatgaccag aggtcccagt gggcatttgg 7141 agccagggat gggaccgaca gggatgcagc aacagccacc ctggagccaa ggaggattgc 7201 ctcagcccca gcaactacag tctgggatgc caaggccagc catgatgtca gtggcccagc 7261 atggtcaacc tttgaacatg gctccacaac caggattggg ccaggtaggt atcagcccac 7321 tcaaaccagg cactgtgtct caacaagcct tacaaaacct tttgcggact ctcaggtctc 7381 ccagctctcc cctgcagcag caacaggtgc ttagtatcct tcacgccaac ccccagctgt 7441 tggctgcatt catcaagcag cgggctgcca agtatgccaa ctctaatcca caacccatcc 7501 ctgggcagcc tggcatgccc caggggcagc cagggctaca gccacctacc atgccaggtc 7561 agcagggggt ccactccaat ccagccatgc agaacatgaa tccaatgcag gcgggcgttc 7621 agagggctgg cctgccccag cagcaaccac agcagcaact ccagccaccc atgggaggga 7681 tgagccccca ggctcagcag atgaacatga accacaacac catgccttca caattccgag 7741 acatcttgag acgacagcaa atgatgcaac agcagcagca acagggagca gggccaggaa 7801 taggccctgg aatggccaac cataaccagt tccagcaacc ccaaggagtt ggctacccac 7861 cacagccgca gcagcggatg cagcatcaca tgcaacagat gcaacaagga aatatgggac 7921 agataggcca gcttccccag gccttgggag cagaggcagg tgccagtcta caggcctatc 7981 agcagcgact ccttcagcaa cagatggggt cccctgttca gcccaacccc atgagccccc 8041 agcagcatat gctcccaaat caggcccagt ccccacacct acaaggccag cagatcccta 8101 attctctctc caatcaagtg cgctctcccc agcctgtccc ttctccacgg ccacagtccc 8161 agccccccca ctccagtcct tccccaagga tgcagcctca gccttctcca caccacgttt 8221 ccccacagac aagttcccca catcctggac tggtagctgc ccaggccaac cccatggaac 8281 aagggcattt tgccagcccg gaccagaatt caatgctttc tcagcttgct agcaatccag 8341 gcatggcaaa cctccatggt gcaagcgcca cggacctggg actcagcacc gataactcag 8401 acttgaattc aaacctctca cagagtacac tagacataca ctagagacac cttgtatttt 8461 gggagcaaaa aaattatttt ctcttaacaa gactttttgt actgaaaaca atttttttga 8521 atctttcgta gcctaaaaga caattttcct tggaacacat aagaactgtg cagtagccgt 8581 ttgtggttta aagcaaacat gcaagatgaa cctgagggat gatagaatac aaagaatata 8641 tttttgttat gggctggtta ccaccagcct ttcttcccct ttgtgtgtgt ggttcaagtg 8701 tgcactggga ggaggctgag gcctgtgaag ccaaacaata tgctcctgcc ttgcacctcc 8761 aataggtttt attatttttt ttaaattaat gaacatatgt aatattaatg aacatatgta 8821 atattaatag ttattattta ctggtgcaga tggttgacat ttttccctat tttcctcact 8881 ttatggaaga gttaaaacat ttctaaacca gaggacaaaa ggggttaatg ttactttgaa 8941 attacattct atatatatat aaatatatat aaatatatat taaaatacca gttttttttc 9001 tctgggtgca aagatgttca ttcttttaaa aaatgtttaa aaaaaa // LOCUS HSU02019 2559 bp mRNA PRI 11-DEC-1993 DEFINITION Human AU-rich element RNA-binding protein AUF1 mRNA, complete cds. ACCESSION U02019 NID g433343 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2559) AUTHORS Zhang,W., Wagner,B.J., Ehrenman,K., Schaefer,A.W., DeMaria,C.T., Crater,D., DeHaven,K., Long,L. and Brewer,G. TITLE Purification, characterization, and cDNA cloning of an AU-rich element RNA-binding protein, AUF1 JOURNAL Mol. Cell. Biol. 13 (12), 7652-7665 (1993) MEDLINE 94067126 REFERENCE 2 (bases 1 to 2559) AUTHORS Wagner,B.J. TITLE Direct Submission JOURNAL Submitted (21-SEP-1993) Belinda J Wagner, Microbiology and Immunology, Bowman Gray School of Medicine of Wake Forest University, Medical Center Boulevard, Winston-Salem, NC 27157 USA FEATURES Location/Qualifiers source 1..2559 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS8" /clone_lib="HeLa cDNA library in lambdaZAP of B. Yoza" /cell_line="HeLa" 5'UTR 1..245 CDS 246..1106 /codon_start=1 /function="AU-rich element RNA-binding protein" /product="p37 AUF1" /db_xref="PID:g433344" /translation="MSEEQFGGTGRRHANGGGRRSAGDEEGAMVAATQGAAAAREADA GPGAEPRLEAPKGSAESEGAKIDASKNEEDEGKMFIGGLSWDTTKKDLKDYFSKFGEV VDCTLKLDPITGRSRGFGFVLFKESESVDKVMDQKEHKLNGKVIDPKRAKAMKTKEPV KKIFVGGLSPDTPEEKIREYFGGFGEVESIELPMDNKTNKRRGFCFITFKEEEPVKKI MEKKYHNVGLSKCEIKVAMSKEQYQQQQQWGSRGGFAGRARGEFRNSSEAGEGLELPP NSIHCWQLSV" 3'UTR 1107..2559 polyA_signal 1799..1804 BASE COUNT 663 a 608 c 785 g 503 t ORIGIN 1 ggaattccgg aattccgaat gcgtcggaaa gagcgggagt gtgcgccgcg cgagagtggg 61 aggcgaaggg ggcaggccag ggagaggcgc aggagccttt gcagccacgc gcgcgcgctt 121 ccctgtcttg tgtgcttcgc gaggtagagc gggcgccggc agcggcgggg attactttgc 181 tgctagtttc ggttgccggc agcggcgggt gtagtctcgg cggcagcggc ggagacacta 241 gcactatgtc ggaggagcag ttcggcggga cggggcggcg gcacgcgaac ggcggcggta 301 ggcgctcggc gggcgacgag gagggagcca tggtggcggc gacacagggg gcagcggcgg 361 cgcgggaagc ggacgcggga ccgggggcgg aaccgcgtct ggaggcaccg aagggcagcg 421 ccgagtcgga gggggcgaag attgacgcca gtaagaacga ggaggatgaa gggaaaatgt 481 ttataggagg ccttagctgg gacactacaa agaaagatct gaaggactac ttttccaaat 541 ttggtgaagt tgtagactgc actctgaagt tagatcctat cacagggcga tcaaggggtt 601 ttggctttgt gctatttaaa gaatcggaga gtgtagataa ggtcatggat caaaaagaac 661 ataaattgaa tgggaaggtg attgatccta aaagggccaa agccatgaaa acaaaagagc 721 cggttaaaaa aatttttgtt ggtggccttt ctccagatac acctgaagag aaaataaggg 781 agtactttgg tggttttggt gaggtggaat ccatagagct ccccatggac aacaagacca 841 ataagaggcg tgggttctgc tttattacct ttaaggaaga agaaccagtg aagaagataa 901 tggaaaagaa ataccacaat gttggtctta gtaaatgtga aataaaagta gccatgtcga 961 aggaacaata tcagcaacag caacagtggg gatctagagg aggatttgca ggaagagctc 1021 gtggggaatt ccggaattcc tcagaggcag gagaaggctt ggagctaccc ccaaactcaa 1081 tccactgttg gcagctgagc gtgtagtagg gtggtcctag ccatacagaa ccacttctct 1141 gtctccctcc tcttccctgg ttcgtccagc cccagtccat cagggaccac ctgggcagcc 1201 tcccagagat gggatcgggt tggggctaag ggcatcgggt ctgtcgcagc caggggtgca 1261 ggaggatcgc tgtgctgtga gccgttcagc tggctcccga cgaaggaggc acggaaccag 1321 acagcgcggc gagggcgaga gcgctgcagg caaggcgtag gccccgcggc ggatcttgcc 1381 gaagagcagg acaggctccg agtcctggaa ggggtagtgg ccggccagca tggtgaagag 1441 cgccacgccc aggctccaga catcggctgc cttgcccgag tatgaggccc gtgagctgag 1501 tatctcaggt cccacgtagg ctggcgacgc gtgcttgtcc acagggaatc atctggccca 1561 gtcagcacgc aggagtcctc caggttctcc agcaccagct tcttcctggg acatggggag 1621 aaacagaagg gtcaggtcct acccagaacc cccatgctat cacccttgtg gcacccactt 1681 tccaagtcgc tgctggcctt tgacagacac aagccagtcc tgtgatgtct gatcctgttt 1741 tacagatacc caagcccagg ctcagagagg ttaagtcatt taaggccaca gagcaattaa 1801 atttaaacta aaattctgaa aggaatacat ttttcaacag agtccttggg gagggggctg 1861 atggggctga gagggttaag cctctcttaa accagctaca aacttagggt ccaggcaggt 1921 aataagatga gagaaacagg aagtgtgcct gacatctcag cacaagcgct acctaaaaag 1981 ggtacacaac gcattctagg gtttaccaag tgcctgctgt gttcctggcc cttgacccag 2041 ctcattacct ggctcacctc attctatcta gctacagcct gcaaggaaga caccatttta 2101 cagctgtaga gcatgggcct gggatgggaa cgctggctgg cagatactca gagccagtgc 2161 tgtgacccac cctctcagtt cccaagatgg ccccacattc ccattgtttt ccccaagaga 2221 agccaggaat tgtattttaa tgaaaaggtc cccatttaaa aaatattggc aaaccagttt 2281 atataaaaaa cacaaacagg taagcagggc aaaaaaaaaa gtgtgtaagg ctgggcgcgg 2341 tgctcatgcc cggtaatcct agcactttgg gagcgcgagg cagggggatc acttgagttc 2401 aggagttcaa gaccagcctg ggcaacacgg taaaaaccta tctctacaaa aaatacgaaa 2461 attagcaggc atggtgattc gcacctgtag tcccagctac ttgggaggct gatcttgaac 2521 tcctgaactc aagtgatccc cctgcctcgg ccggaattc // LOCUS HSU02031 4249 bp mRNA PRI 22-OCT-1994 DEFINITION Human sterol regulatory element binding protein-2 mRNA, complete cds. ACCESSION U02031 NID g451329 KEYWORDS SREBP-2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4249) AUTHORS Hua,X., Yokoyama,C., Wu,J., Briggs,M.R., Brown,M.S., Goldstein,J.L. and Wang,X. TITLE SREBP-2, a second basic-helix-loop-helix-leucine zipper protein that stimulates transcription by binding to a sterol regulatory element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (24), 11603-11607 (1993) MEDLINE 94089681 REFERENCE 2 (bases 1 to 4249) AUTHORS Hua,X. TITLE Direct Submission JOURNAL Submitted (22-SEP-1993) Xianxin Hua, Department of Molecular Genetics, University of Texas, Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235 USA FEATURES Location/Qualifiers source 1..4249 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 118..3543 /standard_name="SREBP-2" /codon_start=1 /product="sterol regulatory element binding protein-2" /db_xref="PID:g451330" /translation="MDDSGELGGLETMETLTELGDELTLGDIDEMLQFVSNQVGEFPD LFSEQLCSSFPGSGGSGSSSGSSGSSSSSSNGRGSSSGAVDPSVQRSFTQVTLPSFSP SAASPQAPTLQVKVSPTSVPTTPRATPILQPRPQPQPQPQTQLQQQTVMITPTFSTTP QTRIIQQPLIYQNAATSFQVLQPQVQSLVTSSQVQPVTIQQQVQTVQAQRVLTQTANG TLQTLAPATVQTVAAPQVQQVPVLVQPQIIKTDSLVLTTLKTDGSPVMAAVQNPALTA LTTPIQTAALQVPTLVGSSGTILTTMPVMMGQEKVPIKQVPGGVKQLEPPKEGERRTT HNIIEKRYRSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQVNHKLRQENM VLKLANQKNKLLKGIDLGSLVDNEVDLKIEDFNQNVLLMSPPASDSGSQAGFSPYSID SEPGSPLLDDAKVKDEPDSPPVALGMVDRSRILLCVLTFLCLSFNPLTSLLQWGGAHD SDQHPHSGSGRSVLSFESGSGGWFDWMMPTLLLWLVNGVIVLSVFVKLLVHGEPVIRP HSRSSVTFWRHRKQADLDLARGDFAAAAANLQTCLAVLGRALPTSRLDLACSLSWNVI RYSLQKLRLVRWLLKKVFQCRRATPATEAGFEDEAKTSARDAALAYHRLHQLHITGKL PAGSACSDVHMALCAVNLAECAEEKIPPSTLVEIHLTAAMGLKTRCGGKLGFLASYFL SRAQSLCGPEHSAVPDSLRWLCHPLGQKFFMERSWSVKSAAKESLYCAQRNPADPIAQ VHQAFCKNLLERAIESLVKPQAKKKAGDQEEESCEFSSALEYLKLLHSFVDSVGVMSP PLSRSSVLKSALGPDIICRWWTSAITVAISWLQGDDAAVRSHFTKVERIPKALEVTES PLVKAIFHACRAMHASLPGKADGQQSSFCHCERASGHLWSSLNVSGGTSDPALNHVVQ LLTCDLLLSLRTALWQKQASASQAVGETYHASGAELAGFQRDLGSLRRLAHSFRPAYR KVFLHEATVRLMAGGSPTRTHQLLEHSLRRRTTQSTKHGEVDAWPGQRERATAILLAC RHLPLSFLSSPGQRAVLLAEAARTLEKVGDRRSCNDCQQMIVKLGGGTAIAAS" BASE COUNT 860 a 1315 c 1202 g 872 t ORIGIN 1 ccgtcggtga ggcggtgccg ggcgggggtt gtcgggtgtc atgggcggtg gcgacggcac 61 cgcccccgcg tctccctgag cgggacggca gggggggctt ctgcgctgag ccgggcgatg 121 gacgacagcg gcgagctggg tggtctggag accatggaga ccctcacgga gctgggcgac 181 gagctgaccc tgggagacat cgacgagatg ctgcaatttg tcagtaatca agtgggagag 241 ttccctgact tgttttcaga acagctgtgt agctcctttc ctggcagtgg tggtagtggt 301 agcagcagcg gcagcagtgg cagcagcagc agcagcagca atggcagggg cagcagcagc 361 ggagctgtgg acccttcagt gcaacggtca ttcacccagg tcacattacc ttccttctct 421 ccctcggcgg cctccccaca ggctccaact ctgcaagtca aggtttctcc cacctcagtt 481 cccaccacac ccagggcaac tcctattctt cagccccgcc cccagcccca gcctcaacct 541 caaactcagc tgcaacaaca gacggtaatg atcacgccaa cattcagcac cactccgcag 601 acgaggatca tccagcagcc tttgatatac cagaatgcag ctactagctt tcaagtcctt 661 cagcctcaag tccaaagcct ggtgacatcc tcccaggtac agccggtcac cattcagcag 721 caggtgcaga cagtacaggc ccagcgggtg ctgacacaaa cggccaatgg cacgctgcag 781 acccttgccc cggctacggt gcagacagtt gctgcgccac aggtgcagca ggtcccggtc 841 ctggtccagc ctcagatcat caagacagat tcccttgttt tgaccacact gaagacagat 901 ggcagccctg ttatggctgc ggtccagaac ccggccctca ccgccctcac cacccctatc 961 cagacggctg cccttcaagt accaaccctg gtgggcagca gtgggaccat tctgaccaca 1021 atgcctgtaa tgatggggca agagaaagtg cccattaagc aggtacctgg gggagtcaag 1081 cagcttgagc cccccaaaga aggagaaagg cggacaaccc ataatatcat tgagaaacga 1141 tatcgctcct ccatcaatga caaaatcatc gaattgaaag acctggtcat ggggacagac 1201 gccaagatgc acaagtctgg cgttctgagg aaggccattg attacatcaa atacttgcag 1261 caggtcaatc ataaactgcg ccaggagaac atggtgctga agctggcaaa tcaaaagaac 1321 aagcttctaa agggcatcga cctaggcagt ctggtggaca atgaggtgga cctgaagatc 1381 gaggacttta atcagaatgt ccttctgatg tcccccccag cctctgactc agggtcccag 1441 gctggcttct ctccctactc cattgactct gagccaggaa gccctctatt ggatgatgca 1501 aaggtcaaag atgagccaga ctctcctcct gtggcgctgg gcatggtaga ccgctcacgg 1561 attcttctgt gtgtcctcac cttcctgtgc ctctccttta accccctgac ttccctgctg 1621 cagtggggag gggcccacga ctctgaccag cacccacact caggctctgg ccgcagtgtc 1681 ctgtcattcg agtcaggttc tgggggctgg tttgactgga tgatgcctac tcttctctta 1741 tggctggtaa atggtgtgat tgtcctgagc gtctttgtga agctgctggt tcatggggag 1801 ccagtgatcc ggccacactc gcgctcctcg gtcaccttct ggaggcaccg gaaacaggca 1861 gatctggatc tcgccagagg agattttgca gctgctgccg ccaacctaca aacctgcctg 1921 gcagttttgg gccgggcact gcccacctcc cgcctggacc tggcctgcag cctctcctgg 1981 aacgtgatcc gctacagcct gcagaagcta cgcctggtgc gctggctgct caagaaagtc 2041 ttccagtgcc ggcgggccac gccagccact gaggcaggct ttgaagacga agctaagacc 2101 agcgcccggg atgcggctct ggcctatcac cggctgcacc agctgcacat cacagggaag 2161 cttcctgcag gatccgcctg ttccgatgta cacatggcgt tgtgtgccgt gaacctggct 2221 gaatgtgcag aggagaagat cccaccgagc acactggttg agatccatct gactgctgcc 2281 atggggctca agacccggtg tggaggcaag ctgggcttcc tggccagcta cttcctcagc 2341 cgagcccaga gcctgtgtgg ccccgagcac agtgctgttc ctgactccct gcgctggctc 2401 tgccaccccc tgggccagaa gtttttcatg gagcggagct ggtctgtgaa gtcagctgcc 2461 aaggagagtc tatactgtgc ccagaggaac ccagctgacc ccattgcgca ggtccaccag 2521 gccttctgca agaacctgct ggagcgagct atagagtcct tggtgaaacc tcaggccaag 2581 aagaaggctg gagaccagga agaagagagc tgtgaattct ccagtgctct ggagtacttg 2641 aaattacttc attcttttgt ggactctgtg ggggttatga gccccccact ctccaggagc 2701 tccgtgctca agtccgccct gggtccagac atcatctgtc ggtggtggac gtctgcaatc 2761 actgtggcca tcagctggct ccagggagac gatgcagctg tgcgctctca ttttaccaaa 2821 gtggaacgca tccccaaggc cctggaagtg acagagagcc ccctggtgaa ggccatcttc 2881 catgcctgca gagccatgca tgcctcactc cctgggaaag cagatgggca gcagagttcc 2941 ttctgccatt gcgagagggc cagtggccac ctatggagca gcctcaacgt cagtgggggc 3001 acctctgacc ctgccctcaa ccacgtggtc cagctgctca cctgtgacct gctactgtcg 3061 ctacggacag cgctctggca aaaacaggcc agtgccagcc aggctgtggg ggagacctac 3121 cacgcgtcag gcgctgaact ggcgggcttc caacgggacc tgggcagcct gcgcaggctg 3181 gcacacagct tccgcccagc ataccgcaag gtgttcctgc atgaagccac cgtgcgcctg 3241 atggcaggag gcagccccac ccgcacccac cagctgctgg aacacagcct gcggcggcgc 3301 accacgcaga gcaccaagca cggagaggtg gatgcctggc ccggccagcg agagcgggcc 3361 accgccatcc tgctggcctg ccgccacctg cccctctcct tcctctcctc cccgggccag 3421 cgggcagtgc tgctggccga agctgcccgc accctggaga aggtgggcga ccggcgctcc 3481 tgcaacgact gccagcagat gattgttaag ctgggtggtg gcactgccat tgccgcctcc 3541 tgaccaccag gctcagccca cccctccacc tctctctcga tttctctctc tccccctcag 3601 catcttcccg ctgagagtgg tggggaagag ccttgtcttc ttagctgtca cctgccgagg 3661 cttctgggcc actcaggcca gtgcacccct gggcagagcc ccttaaagct gctgtcacta 3721 gatgcccatg gtccagggcc tggtgggcgt gagaggatag gtggcagggc agaaactggg 3781 cagccctgac ttgatagcag cagggggagc tcccaagctg ccaagcccct gcctccagcc 3841 ttcctgagtt tctctctcct gaaccctact ctctcctttt tgcttcctca gtttttatca 3901 ggctttctct gggggacagc agtctctgag caccagggag cagttgccct caggcctgtg 3961 cccagcatgc cctccccttt ttatacgaat gttttctacc agtgtgcttg ggtttgccat 4021 gatgcgaggc tgagttgctg tagcgtcttg attctctccc tgggtctgcg ttccctcccc 4081 tgggcctgac tgagcctgct cattgttttt ccctttatta cacaggacag ccaggggagg 4141 aggggggccc agccctggga ggctggtggg aggcaggggg caggcctgcg gatgcatgaa 4201 ataatgttgg cattattttt taatttttta aaaaataaat ggtatctta // LOCUS HSU02076 3545 bp mRNA PRI 04-DEC-1996 DEFINITION Human ATP-driven ion pump (ATP1AL1) mRNA, complete cds. ACCESSION U02076 NID g493015 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3545) AUTHORS Grishin,A.V., Sverdlov,V.E., Kostina,M.B. and Modyanov,N.N. TITLE Cloning and characterization of the entire cDNA encoded by ATP1AL1--a member of the human Na,K/H,K-ATPase gene family JOURNAL FEBS Lett. 349 (1), 144-150 (1994) MEDLINE 94320635 REFERENCE 2 (bases 1 to 3545) AUTHORS Sverdlov,V.E. and Grishin,A.V. TITLE Direct Submission JOURNAL Submitted (25-SEP-1993) Alexander V. Grishin, Cellular and Molecular Physiology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06510-8026, USA FEATURES Location/Qualifiers source 1..3545 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clones pHAS34.1, pHAS58.1, pHK14, pHK12, alphaD20-7, alphaD23-1" /clone_lib="pHAS, pSP64 armpit skin cDNA library; pHK, pBR322 kidney library" /tissue_type="skin, kidney" 5'UTR <1..167 gene 168..3287 /gene="ATP1AL1" CDS 168..3287 /gene="ATP1AL1" /standard_name="ion-motive ATP-hydrolase" /codon_start=1 /function="ion transport" /product="ATP-driven ion pump" /db_xref="PID:g404017" /translation="MHQKTPEIYSVELSGTKDIVKTDKGDGKEKYRGLKNNCLELKKK NHKEEFQKELHLDDHKLSNRELEEKYGTDIIMGLSSTRAAELLARDGPNSLTPPKQTP EIVKFLKQMVGGFSILLWVGAFLCWIAYGIQYSSDKSASLNNVYLGCVLGLVVILTGI FAYYQEAKSTNIMSSFNKMIPQQALVIRDSEKKTIPSEQLVVGDIVEVKGGDQIPADI RVLSSQGCRVDNSSLTGESEPQPRSSEFTHENPLETKNICFYSTTCLEGTVTGMVINT GDRTIIGHIASLASGVGNEKTPIAIEIEHFVHIVAGVAVSIGILFFIIAVSLKYQVLD SIIFLIGIIVANVPEGLLATVTVTLSLTAKRMAKKNCLVKNLEAVETLGSTSIICSDK TGTLTQNRMTVAHLWFDNQIFVADTSEDHSNQVFDQSSRTWASLSKIITLCNRAEFKP GQENVPIMKKAVIGDASETALLKFSEVILGDVMEIRKRNRKVAEIPFNSTNKFQLSIH EMDDPHGKRFLMVMKGAPERILEKCSTIMINGEEHPLDKSTAKTFHTAYMELGGLGER VLGFCHLYLPADEFPETYSFDIDAMNFPTSNLCFVGLLSMIDPPRSTVPDAVTKCRSA GIKVIMVTGDHPITAKAIAKSVGIISANSETVEDIAHRLNIAVEQVNKRDAKAAVVTG MELKDMSSEQLDEILANYQEIVFARTSPQQKLIIVEGCQRQDAVVAVTGDGVNDSPAL KKADIGIAMGIAGSDAAKNAADMVLLDDNFASIVTGVEEGRLIFDNLKKTIAYSLTKN IAELCPFLIYIIVGLPLPIGTITILFIDLGTDIIPSIALAYEKAESDIMNRKPRHKNK DRLVNQPLAVYSYLHIGLMQALGAFLVYFTVYAQEGFLPRTLINLRVEWEKDYVNDLK DSYGQEWTRYQREYLEWTGYTAFFVGILVQQIADLIIRKTRRNSIFQQGLFRNKVIWV GITSQIIIGLILSYGLGSVTALSFTMLRAQYWFVAVPHAILIWVYDEVRKLFIRLYPG SWWDKNMYY" 3'UTR 3288..3545 polyA_signal 3514..3519 /evidence=experimental polyA_site 3545 /note="29 A residues" BASE COUNT 892 a 941 c 895 g 817 t ORIGIN 1 cggccgcgga ggtgcgtgca gggcccgcgc cgccgccggt atctccaccg ccaacacctc 61 agccactgcc actgccacag ccacacgagg ccccccaccg tgcgctccgc cgctgcggtc 121 ccggatccgc gctccacgcc cgcagcccgc ggcgccacca gcccagcatg caccagaaaa 181 ccccagaaat ttactccgtg gagctcagcg gaactaagga catcgtgaaa acagacaagg 241 gggatggcaa ggagaagtat aggggtctga agaacaactg cctggaactc aaaaagaaaa 301 atcacaaaga ggagtttcag aaagaactcc atctggatga ccacaaactc agcaataggg 361 aattggaaga gaaatatggc acagacatca ttatgggtct ctccagcacc agagctgccg 421 agctcctggc ccgggatggg cccaactccc tcacccctcc caagcagacg cctgagatcg 481 tcaagttcct caagcagatg gtgggggggt tctctatcct cctgtgggtg ggcgcctttc 541 tctgttggat tgcatatggg attcagtact ccagcgacaa gtctgcatcc ctgaacaacg 601 tgtacttggg ctgtgtgctt ggtctggtgg tcattttaac ggggatcttt gcttattacc 661 aagaggcaaa aagcaccaac atcatgtcca gcttcaataa gatgatccct cagcaagctc 721 tcgtcatccg agattccgag aagaagacca tcccttcaga gcagctggtg gtgggggaca 781 ttgtggaggt caaaggagga gaccagatcc ctgcagacat cagggtgctg tcttctcagg 841 ggtgtcgggt ggataactca tctctcacgg gggagtctga gccccagccc cgctcctctg 901 agtttaccca tgaaaacccc ctggaaacaa agaacatctg cttctattcc acaacgtgtc 961 tggaaggcac tgtcaccggc atggttatca acacgggtga ccgcaccatc attggccata 1021 ttgcctcatt ggcctcagga gttggaaatg agaagacgcc cattgccatt gagatcgagc 1081 actttgttca cattgtggca ggagtggctg tctccatcgg catccttttc ttcatcatcg 1141 ctgtgtccct gaagtatcaa gtcctggact ccatcatctt cctcattggc atcattgtgg 1201 ccaatgtgcc cgagggcctc ctggccactg tcactgtgac cctgtcgctg acagcaaaac 1261 ggatggccaa gaagaactgc ctggtgaaga acctggaggc tgtggagacc ctcggctcca 1321 cctccatcat ctgctcggac aagactggga cactgaccca gaacaggatg acagtggccc 1381 atctgtggtt cgacaatcag atctttgtgg ctgacaccag tgaggaccat tcaaaccaag 1441 tctttgacca aagctctagg acttgggcct ccttatccaa gataataaca ttgtgtaacc 1501 gagcagagtt caagccagga caggaaaatg tccccatcat gaagaaagct gtgattggag 1561 atgcctcaga aactgctctt ttaaaattct cagaggtcat tttgggtgat gtgatggaaa 1621 ttagaaaaag aaaccgcaaa gtagctgaaa tcccttttaa ctctactaat aaatttcagc 1681 tctccatcca cgagatggat gacccccacg gcaagcgctt cctcatggtg atgaaggggg 1741 cccctgagcg cattctagag aaatgcagca ccatcatgat caacggcgag gagcacccac 1801 tggacaagag cactgccaag accttccaca cagcctacat ggagctgggc gggttgggcg 1861 agcgtgtgct gggtttctgt catctctacc tgccagcaga cgagtttcca gaaacctact 1921 catttgacat agacgctatg aactttccga cctccaacct ctgttttgtg ggactcttgt 1981 caatgatcga tccccctcgg tccaccgtgc cagatgcagt caccaaatgc cggagtgcag 2041 ggatcaaggt tattatggtt actggtgatc atcccatcac agccaaagct attgccaaga 2101 gtgtggggat catttcagcc aacagtgaaa cagtggaaga cattgcacat cgcctcaaca 2161 ttgctgtgga gcaagttaac aaacgggatg ccaaggccgc tgtggtgact ggcatggagc 2221 tgaaggacat gagctcagaa cagctggatg agatcttagc caactaccag gagattgtct 2281 ttgcccggac atccccccag cagaagctga tcattgtgga gggctgtcag aggcaggatg 2341 ctgttgttgc tgtgaccggg gatggagtta atgactctcc ggctctaaag aaggcagaca 2401 ttgggattgc catggggata gcaggttctg atgcagccaa aaatgcagcc gacatggtct 2461 tgctggacga caacttcgca tccatcgtca caggggtgga ggaaggtcgc ctgatctttg 2521 acaacctcaa gaagactatt gcttattccc tgaccaagaa cattgccgag ctgtgcccct 2581 ttctgatcta catcattgtc gggctccccc tgcccattgg caccatcacc attctgttca 2641 ttgacttggg gacagacatt atcccctcca ttgccttggc gtacgagaaa gctgaaagtg 2701 acatcatgaa caggaagcct cgccacaaga ataaggacag gctggtgaac cagccgctcg 2761 ctgtgtactc atacctgcac attggcctca tgcaagccct gggagctttc cttgtgtatt 2821 tcaccgtcta tgcacaagag ggctttctgc cccgcactct cattaacctg cgggtagaat 2881 gggagaagga ctacgtgaat gacttgaaag acagctatgg gcaggaatgg acaaggtacc 2941 agagggaata cctagaatgg acgggctaca cggctttctt tgttggcatc ctagtccagc 3001 aaatagcaga tctgatcatc aggaaaaccc ggaggaattc catcttccag cagggtctct 3061 tcagaaataa agtcatctgg gtggggatca cctcacagat catcattggt ctgatcctct 3121 cctatggcct cggaagtgtc acagccttga gtttcaccat gcttagggct cagtactggt 3181 ttgtggctgt gccgcacgcc atcctgatct gggtgtatga tgaggtgcgg aagctcttca 3241 tcaggctcta ccctggaagc tggtgggata agaacatgta ttattaagac cacctccctt 3301 cctatgtctc tcagcagcac gttggggcac acttgttcat cttctgaccg tttgctgggc 3361 tattcccctg cagtgcagac atcgtcaaaa ttcatacaag aggaaatttt catgcagaaa 3421 gctgtatgca ggatgctcac tgatgttttg cactttaaaa ctgaaattca actctttata 3481 taggattttc ttttctatct ccatctcctc attaaaaaat acgtacattt cgaggtaatg 3541 gtata // LOCUS HSU02081 2081 bp mRNA PRI 25-SEP-1996 DEFINITION Human guanine nucleotide regulatory protein (NET1) mRNA, complete cds. ACCESSION U02081 NID g548081 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2081) AUTHORS Chan,A.M., Takai,S., Yamada,K. and MikiT. TITLE Isolation of a novel oncogene, NET1, from neuroepithelioma cells by expression cDNA cloning JOURNAL Oncogene 12 (6), 1259-1266 (1996) MEDLINE 96226357 REFERENCE 2 (bases 1 to 2081) AUTHORS Chan,A.M. TITLE Direct Submission JOURNAL Submitted (24-SEP-1993) Andrew M.-L. Chan, Lab. of Cellular and Molecular Biology, National Cancer Institute, Building 37, Room 1E24, Bethesda, Maryland 20892, USA FEATURES Location/Qualifiers source 1..2081 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="39N-1" /chromosome="10" /cell_line="SK-N-MC" /cell_type="neuroblastoma" /tissue_type="neuroepithelial" gene 1..1383 /gene="NET1" CDS 1..1383 /gene="NET1" /note="neuro epithelioma transforming gene 1; formerly designated nep1" /codon_start=1 /product="guanine nucleotide regulatory protein" /db_xref="PID:g548082" /translation="MDGWPAKRRSSALWSEMLDITMKESLTTREIRRQEAIYEMSRGE QDLIEDLKLARKAYHDPMLKLSIMSEEELTHIFGDLDSYIPLHEDLLTRIGEATKPDG TVEQIGHILVSWLPRLNAYRGYCSNQLAAKALLDQKKQDPRVQDFLQRCLESPFSRKL DLWSFLDIPRSRLVKYPLLLKEILKHTPKEHPDVQLLEDAILIIQGVLSDINLKKGES ECQYYIDKLEYLDEKQRDPRIEASKVLLCHGELRSKSGHKLYIFLFQDILVLTRPVTR NERHSYQVYRQPIPVQELVLEDLQDGDVRMGGSFRGAFSNSEKAKNIFRIRFHDPSPA QSHTLQANDVFHKQQWFNCIRAAIAPFQSAGSPPELQGLPELHEECEGNHPSARKLTA QRRASTVSSVTQVEVDENAYRCGSGMQMAEDSKSLKTHQTQPGIRRARDKALLVANGK RLWCREGSVC" BASE COUNT 594 a 431 c 461 g 595 t ORIGIN 1 atggatggat ggcccgccaa gagaaggagc agtgcactgt ggtcagagat gctggacatc 61 accatgaagg agtctctcac caccagggag atcagacggc aggaggcaat atatgaaatg 121 tcccgaggtg aacaggattt aattgaggat ctcaaacttg caagaaaggc ctaccatgac 181 cccatgttaa agttgtccat catgtcagaa gaggaactca cacatatatt tggtgatctg 241 gactcttaca tacctctgca tgaagatttg ttgacaagaa taggagaagc aaccaagcct 301 gatggaacag tggagcagat tggtcacatt ctcgtgagct ggttaccgcg cttgaatgcc 361 tacagaggtt actgtagtaa ccagctggca gccaaagctc ttcttgatca aaagaaacag 421 gatccaagag tccaagactt cctccagcga tgtctcgagt ctcccttcag tcgaaaacta 481 gatctttgga gtttcctaga tatccctcga agtcgcctag tcaaataccc tttactgtta 541 aaagaaattc ttaaacacac tccaaaagag caccctgatg ttcagcttct ggaggatgct 601 atattgataa tacagggagt cctctctgat atcaacttga agaaaggtga atccgagtgc 661 cagtattaca tcgacaagct ggagtacctg gatgaaaagc agagggaccc cagaatcgaa 721 gcgagcaaag tgctgctgtg ccatggggag ctgcggagca agagtggaca taaactttac 781 attttcctgt ttcaagacat cttggttctg actcggcccg tcacacggaa cgaacggcac 841 tcttaccagg tttaccggca gccaatccca gtccaagagc tagtcctaga agacctgcag 901 gatggagatg tgagaatggg aggctccttt cgaggagctt tcagtaactc agagaaagct 961 aaaaatatct ttagaattcg cttccatgac ccctctccag cccagtctca cactctgcaa 1021 gccaatgacg tgttccacaa gcagcagtgg ttcaactgta ttcgagcggc cattgccccc 1081 ttccagtcgg caggcagtcc acctgagctg cagggcctgc cggagctgca cgaagagtgt 1141 gaggggaacc acccctctgc gaggaaactc acagcccaga ggagggcatc cacagtttcc 1201 agtgttactc aggtagaagt tgatgaaaac gcttacagat gtggctctgg catgcagatg 1261 gcagaggaca gcaagagctt aaagacacac cagacacagc ccggcatccg aagagcgagg 1321 gacaaagccc ttctggtggc aaacggaaag agactttggt gtagagaagg ctctgtgtgt 1381 taactgatgg gagagactgt ttgtttataa atgtgtacag ttttgttttc tcgtaagggg 1441 agcatcatag ggttacttta taccagttgt aacattttca ttgtttttgg ttgttctttt 1501 ttcttttttt aatggcagct aaagatatac agattactgt taaattgcag tccttttttt 1561 tttaaagata ttttcttgag ttatttagaa catggtaagc ctggtatttt ttaatcaaac 1621 aaaatattta tgaaatgggt tttctcttaa ttctggattc atcatggctt tctaatacca 1681 attgtaatat ttacaatatt caccaaaact tagaattttg caaatgcagg aattctgcca 1741 gtgtttcttt gctaagcctt gcatgcaaaa tttgaaattt taacattggc acccaaaacc 1801 tacatggaat gtatgtctgg agtatttcaa actttacatt gaaacataat ttccttggaa 1861 aacaaaccat aagcctgagg aggtttttat caactggaat gctttatatt agtttgtttt 1921 tcactgtaca ttcctcattt tacattcatt taacctgccg attatttaat ttttttattg 1981 taaagtagtt tttagcattt gcttttattt ttttactttg atgccttaac aaattggcac 2041 gtctttaaag tatttttctt cctgattaaa aatgtgtgtg t // LOCUS HSU02082 2226 bp mRNA PRI 11-MAY-1994 DEFINITION Human guanine nucleotide regulatory protein (tim1) mRNA, complete cds. ACCESSION U02082 NID g484101 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2226) AUTHORS Chan,A.M.L., McGovern,E.S., Catalano,G., Fleming,T.P. and Miki,T. TITLE Expression cDNA cloning of a novel oncogene with sequence similarity to regulators of small GTP-binding proteins JOURNAL Oncogene 9, 1057-1063 (1994) MEDLINE 94181257 REFERENCE 2 (bases 1 to 2226) AUTHORS Chan,A.M. TITLE Direct Submission JOURNAL Submitted (24-SEP-1993) Andrew M.-L. Chan, Lab of Cellular and Molecular Biology, National, Cancer Institute, Building 37, Room 1E24, Bethesda, Maryland 20892 USA FEATURES Location/Qualifiers source 1..2226 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="42B-8" /clone_lib="B5/589" /chromosome="7" /sex="female" /cell_line="B5/589" /cell_type="epithelial" /tissue_type="mammary gland" gene 94..1653 /gene="tim1" CDS 94..1653 /gene="tim1" /codon_start=1 /product="guanine nucleotide regulatory protein" /db_xref="PID:g484102" /translation="MGGFSRRCSKLINSSQLLYQEYSDVVLNKEIQSQQRLESLSETP GPSSPRQPRKALVSSESYLQRLSMASSGSLWQEIPVVRNSTVLLSMTHEDQKLQEVKF ELIVSEASYLRSLNIAVDHFQLSTSLRATLSNQEHQWLFSRLQDVRDVSATFLSDLEE NFENNIFSFQVCDVVLNHAPDFRRVYLPYVTNQTYQERTFQSLMNSNSNFREVLEKLE SDPVCQRLSLKSFLILPFQRITRLKLLLQNILKRTQPGSSEEAEATKAHHALEQLIRD CNNNVQSMRRTEELIYLSQKIEFECKIFPLISQSRWLVKSGELTALEFSASPGLRRKL NTRPVHLHLFNDCLLLSRPREGSRFLVFDHAPFSSIRGEKCEMKLHGPHKNLFRLFLR QNTQGAQAEFLFRTETQSEKLRWISALAMPREELDLLECYNSPQVQCLRAYKPRENDE LALEKADVVMVTQQSSDGWLEGVRLSDGERGWFPVQQVEFISNPEVRAQNLKEAHRVK TAKLQLVEQQA" BASE COUNT 532 a 614 c 589 g 491 t ORIGIN 1 gagggctctt cagattcaag aggtccagcc gtggagaaac atccgggacc ctcagacact 61 gttgtttttc gggagaaaaa accaaaggag gtgatgggag gcttttcaag acgctgctcc 121 aaactcatca actcctccca gctgctttac caggagtata gtgatgttgt cctgaataag 181 gagatccaga gccagcagcg gctggagagc ctgtccgaga cacccgggcc tagctctccg 241 cggcagcctc ggaaggccct ggtctcctcc gagtcgtacc tgcagcggct ctccatggcc 301 tccagcggct ccctctggca ggaaatcccc gtggtgcgca acagcaccgt gctgctctcc 361 atgacccatg aagaccaaaa gctgcaagag gtcaaatttg agctgattgt gtcagaggcc 421 tcctacctgc gcagtctaaa catagctgtg gatcatttcc aactttcaac ttcactccgg 481 gccacacttt ccaaccagga gcaccaatgg ctcttctctc gtttacagga tgtgcgagac 541 gtcagcgcca cgttcctttc agacctggaa gagaactttg agaacaatat cttctccttc 601 caagtatgtg acgtagtcct gaaccacgcc ccagacttcc gccgggtcta cctgccttat 661 gtcaccaacc agacctatca ggaacgcacc ttccagagcc tgatgaatag caacagcaat 721 ttccgggagg tcttggagaa gctggagagc gaccccgtct gccagcgcct ttccctcaag 781 tcctttctga ttctgccctt ccaacgcatc acccgcctca aactgctgct ccagaacatt 841 ctgaagagaa cacagcctgg ctcctcggag gaggcagagg ccacgaaggc acaccacgcc 901 ctggagcagc tgatccggga ctgcaataac aatgtccaga gtatgcgacg gacagaggag 961 ctaatctacc tgagccagaa gattgagttt gagtgcaaaa tattcccgct catttctcag 1021 tcacgctggc tggtgaaaag tggggagctg acagccttgg agttcagtgc ttccccaggg 1081 ctacgaagga agctgaacac gcgtccagtc cacctgcacc tcttcaatga ctgtctgctg 1141 ctgtctcggc cccgagaggg tagccgattc ctggtatttg accatgctcc cttctcctcc 1201 attcgggggg aaaagtgtga aatgaagcta catggacctc acaaaaacct gttccgactc 1261 tttctgcggc agaacactca gggcgcccag gccgagttcc tcttccgcac ggagactcaa 1321 agtgaaaagc ttcggtggat ctcagccttg gccatgccaa gagaggagtt ggaccttctg 1381 gagtgttaca actcccccca ggtacagtgc cttcgagcct acaagccccg agagaatgat 1441 gaattggcac tggagaaagc cgacgtggtg atggtgactc agcagagcag tgacggctgg 1501 ctggagggcg tgaggctctc agacggggag cgaggctggt ttcctgtgca gcaggtggag 1561 ttcatttcca acccagaggt ccgtgcacag aacctgaagg aagctcatcg agtcaagact 1621 gccaaactac agctggtgga acagcaagcc taagtcttct ctgagaggag tttcgtgagc 1681 tgaagaacaa gctgctcatg gcaagggctg gccccagaac cctgcaagag aggccttctg 1741 tggatggaga actaggcctt ctcaaagcta aggacaaaat ccagctaacc cagtccctcg 1801 gcccaggcct cctttcgtgc tttgtgcttg gtggggggga tttcgaggga ctttgcactg 1861 gactctggga acctttcatc attaaaaaaa gggggaccat tggggcctga gccaaggaac 1921 tttccttcta ctgccttata gtgcttaaac attctccgcc tccagggtgc agattcagag 1981 ctggccagag tttcagtgat agccgtatgt taaacagaat ctcacctcag tctcctggag 2041 ggagatgttt aagaggggtt aacacatcag atgggagggt cagcccggtg acctctaagg 2101 tatcttctaa cctagaaact caccataatt atggtgcaag gtcagtgtgt ctctgagatc 2161 tatgtctgtt ggtggcaatg tgagggtgat actctctcac tctaataaac ttggcacttc 2221 tccgag // LOCUS HSU02310 3421 bp mRNA PRI 17-NOV-1993 DEFINITION Human fork head domain protein (FKHR) mRNA, complete cds. ACCESSION U02310 NID g435422 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3421) AUTHORS Galili,N. TITLE Fusion of a fork head domain to PAX-3 in the solid tumor alveolar rhabdomyosarcoma JOURNAL Nature Genetics 5(3), 230-235 (1993) REFERENCE 2 (bases 1 to 3421) AUTHORS Galili,N., Davis,R.J., Fredericks,W.J., Mukhopadhyay,S., Rauscher,F.J. III, Emanuel,B.S., Rovera,G. and Barr,F.G. TITLE Direct Submission JOURNAL Submitted (01-OCT-1993) Naomi Galili, Wistar Institute, 3606 Spruce Street, Philadelphia, PA 19104 USA FEATURES Location/Qualifiers source 1..3421 /organism="Homo sapiens" /db_xref="taxon:9606" gene 7..1974 /gene="FKHR" CDS 7..1974 /gene="FKHR" /codon_start=1 /product="fork head domain protein" /db_xref="PID:g435423" /translation="MAEAPQVVEIDPDFEPLPRPRSCTWPLPRPEFSQSNSATSSPAP SGSAAANPDAAAGLPSASAAAVSADFMSNLSLLEESEDFPQAPGSVAAAVAAAAAAAA TGGLCGDFQGPEAGCLHPAPPQPPPPGPVSQHPPVPPAAAGPLAGQPRKSSSSRRNAW GNLSYADLITKAIESSAEKRLTLSQIYEWMVKSVPYFKDKGDSNSSAGWKNSIRHNLS LHSKFIRVQNEGTGKSSWWMLNPEGGKSGKSPRRRAASMDNNSKFAKSRSRAAKKKAS LQSGQEGAGDSPGSQFSKWPASPGSHSNDDFDNWSTFRPRTSSNASTISGRLSPIMTE QDDLGEGDVHSMVYPPSAAKMASTLPSLSEISNPENMENLLDNLNLLSSPTSLTVSTQ SSPGTMMQQTPCYSFAPPNTSLNSPSPNYQKYTYGQSSMSPLPQMPIQTLQDNKSSYG GMSQYNCAPGLLKELLTSDSPPHNDIMTPVDPGVAQPNSRVLGQNVMMGPNSVMSTYG SQASHNKMMNPSSHTHPGHAQQTSAVNGRPLPHTVSTMPHTSGMNRLTQVKTPVQVPL PHPMQMSALGGYSSVSSCNGYGRMGLLHQEKLPSDLDGMFIERLDCDMESIIRNDLMD GDTLDFNFDNVLPNQSFPHSVKTTTHSWVSG" BASE COUNT 852 a 859 c 805 g 905 t ORIGIN 1 gtcaccatgg ccgaggcgcc tcaggtggtg gagatcgacc cggacttcga gccgctgccc 61 cggccgcgct cgtgcacctg gccgctgccc aggccggagt ttagccagtc caactcggcc 121 acctccagcc cggcgccgtc gggcagcgcg gctgccaacc ccgacgccgc ggcgggcctg 181 ccctcggcct cggctgccgc tgtcagcgcc gacttcatga gcaacctgag cttgctggag 241 gagagcgagg acttcccgca ggcgcccggc tccgtggcgg cggcggtggc ggcggcggcc 301 gccgcggccg ccaccggggg gctgtgcggg gacttccagg gcccggaggc gggctgcctg 361 cacccagcgc caccgcagcc cccgccgccc gggcccgtgt cgcagcaccc gccggtgccc 421 cccgccgccg ctgggccgct cgcggggcag ccgcgcaaga gcagctcgtc ccgccgcaac 481 gcgtggggca acctgtccta cgccgacctc atcaccaagg ccatcgagag ctcggcggag 541 aagcggctca cgctgtcgca gatctacgag tggatggtca agagcgtgcc ctacttcaag 601 gataagggtg acagcaacag ctcggcgggc tggaagaatt caattcgtca taatctgtcc 661 ctacacagca agttcattcg tgtgcagaat gaaggaactg gaaaaagttc ttggtggatg 721 ctcaatccag agggtggcaa gagcgggaaa tctcctagga gaagagctgc atccatggac 781 aacaacagta aatttgctaa gagccgaagc cgagctgcca agaagaaagc atctctccag 841 tctggccagg agggtgctgg ggacagccct ggatcacagt tttccaaatg gcctgcaagc 901 cctggctctc acagcaatga tgactttgat aactggagta catttcgccc tcgaactagc 961 tcaaatgcta gtactattag tgggagactc tcacccatta tgaccgaaca ggatgatctt 1021 ggagaagggg atgtgcattc tatggtgtac ccgccatctg ccgcaaagat ggcctctact 1081 ttacccagtc tgtctgagat aagcaatccc gaaaacatgg aaaatctttt ggataatctc 1141 aaccttctct catcaccaac atcattaact gtttcgaccc agtcctcacc tggcaccatg 1201 atgcagcaga cgccgtgcta ctcgtttgcg ccaccaaaca ccagtttgaa ttcacccagc 1261 ccaaactacc aaaaatatac atatggccaa tccagcatga gccctttgcc ccagatgcct 1321 atacaaacac ttcaggacaa taagtcgagt tatggaggta tgagtcagta taactgtgcg 1381 cctggactct tgaaggagtt gctgacttct gactctcctc cccataatga cattatgaca 1441 ccagttgatc ctggggtagc ccagcccaac agccgggttc tgggccagaa cgtcatgatg 1501 ggccctaatt cggtcatgtc aacctatggc agccaggcat ctcataacaa aatgatgaat 1561 cccagctccc atacccaccc tggacatgct cagcagacat ctgcagtcaa cgggcgtccc 1621 ctgccccaca cggtaagcac catgccccac acctcgggta tgaaccgcct gacccaagtg 1681 aagacacctg tacaagtgcc tctgccccac cccatgcaga tgagtgccct ggggggctac 1741 tcctccgtga gcagctgcaa tggctatggc agaatgggcc ttctccacca ggagaagctc 1801 ccaagtgact tggatggcat gttcattgag cgcttagact gtgacatgga atccatcatt 1861 cggaatgacc tcatggatgg agatacattg gattttaact ttgacaatgt gttgcccaac 1921 caaagcttcc cacacagtgt caagacaacg acacatagct gggtgtcagg ctgagggtta 1981 gtgagcaggt tacacttaaa agtacttcag attgtctgac agcaggaact gagagaagca 2041 gtccaaagat gtctttcacc aactcccttt tagttttctt ggttaaaaaa aaaaacaaaa 2101 aaaaaaaccc tccttttttc ctttcgtcag acttggcagc aaagacattt ttcctgtaca 2161 ggatgtttgc ccaatgtgtg caggttatgt gctgctgtag ataaggactg tgccattgga 2221 aatttcatta caatgaagtg ccaaactcac tacaccatat aattgcagaa aagattttca 2281 gatcctggtg tgctttcaag ttttgtatat aagcagtaga tacagattgt atttgtgtgt 2341 gtttttggtt tttctaaata tccaattggt ccaaggaaag tttatactct ttttgtaata 2401 ctgtgatggg cctcatgtct tgataagtta aacttttgtt tgtactacct gttttctgcg 2461 gaactgacgg atcacaaaga actgaatctc cattctgcat ctccattgaa cagccttgga 2521 cctgttcacg ttgccacaga attcacatga gaaccaagta gcctgttatc aatctgctaa 2581 attaatggac ttgttaaact tttggaaaaa aaaagattaa atgccagctt tgtacaggtc 2641 ttttctattt ttttttgttt attttgttat ttgcaaattt gtacaaacat ttaaatggtt 2701 ctaatttcca gataaatgat ttttgatgtt attgttggga cttaagaaca tttttggaat 2761 agatattgaa ctgtaataat gttttcttaa aactagagtc tactttgtta catagtcagc 2821 ttgtaaattt tgtggaacca caggtatttg ggggcagcat tcataatttt cattttgtat 2881 tctaactgga ttagtactaa ttttatacat gcttaactgg tttgtacact ttgggatgct 2941 acttagtgat gtttctgact aatcttaaat cattgtaatt agtacttgca tattcaacgt 3001 ttcaggccct ggttgggcag gaaagtgatg tatagttatg gacactttgc gtttcttatt 3061 taggataact taatatgttt ttatgtatgt attttaaaga aatttcactg cttctctgaa 3121 ctatgcgtac tgcatagcat caagtcttct ctagagacct ctgtagtcct gggaggcctc 3181 ataatgtttg tagatacaga aagggagact gcatctaaag caatggtcct ttgtcaaacg 3241 agggattttg atccacttca ccattttgag ttgagcttta gcaaaagttt ccctcataat 3301 tctttgctct tgtttcagtc caggtggagg ttggttttgt agttctgcct tgaggaatta 3361 tgtcaacact catacttcat ctcattctcc cttctgccct gcagattaga ttacttagca 3421 c // LOCUS HSU02326 1793 bp mRNA PRI 22-JUL-1994 DEFINITION Human clone ndf43 neu differentiation factor mRNA, complete cds. ACCESSION U02326 NID g408402 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1793) AUTHORS Wen,D., Suggs,S.V., Karunagaran,D., Liu,N., Cupples,R.L., Luo,Y., Janssen,A.M., Ben-Baruch,N., Trollinger,D.B., Jacobsen,V.L., Meng,S.-Y., Lu,H.S., Hu,S., Chang,D., Yang,W., Yanigahara,D., Koski,R.A. and Yarden,Y. TITLE Structural and functional aspects of the multiplicity of Neu differentiation factors JOURNAL Mol. Cell. Biol. 14, 1909-1919 (1994) MEDLINE 94158863 REFERENCE 2 (bases 1 to 1793) AUTHORS Janssen,A.M. TITLE Direct Submission JOURNAL Submitted (02-OCT-1993) Ann M. Janssen, Developmental Biology, Amgen, Inc., Amgen Center, Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..1793 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="human proNDF-alpha2b, #43" /cell_line="A-704 cells (ATCC HTB 45)" CDS 109..1497 /codon_start=1 /product="neu differentiation factor" /db_xref="PID:g408403" /translation="MSERKEGRGKGKGKKKERGSGKKPESAAGSQSPALPPRLKEMKS QESAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKPGKSELRINKA SLADSGEYMCKVISKLGNDSASANITIVESNEIITGMPASTEGAYVSSESPIRISVST EGANTSSSTSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCQPGFTG ARCTENVPMKVQNQEKAEELYQKRVLTITGICIALLVVGIMCVVAYCKTKKQRKKLHD RLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFST SHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRL NGTGGPRECNSFLRHARETPDSYRDSPHSERHNLIAELRRNKAHRSKCMQIQLSATHL RSSSIPHLGFIL" BASE COUNT 517 a 504 c 413 g 359 t ORIGIN 1 tcgctctccc catcgaggga caaacttttc ccaaacccga tccgagccct tggaccaaac 61 tcgcctgcgc cgagagccgt ccgcgtagag cgctccgtct ccggcgagat gtccgagcgc 121 aaagaaggca gaggcaaagg gaagggcaag aagaaggagc gaggctccgg caagaagccg 181 gagtccgcgg cgggcagcca gagcccagcc ttgcctcccc gattgaaaga gatgaaaagc 241 caggaatcgg ctgcaggttc caaactagtc cttcggtgtg aaaccagttc tgaatactcc 301 tctctcagat tcaagtggtt caagaatggg aatgaattga atcgaaaaaa caaaccacaa 361 aatatcaaga tacaaaaaaa gccagggaag tcagaacttc gcattaacaa agcatcactg 421 gctgattctg gagagtatat gtgcaaagtg atcagcaaat taggaaatga cagtgcctct 481 gccaatatca ccatcgtgga atcaaacgag atcatcactg gtatgccagc ctcaactgaa 541 ggagcatatg tgtcttcaga gtctcccatt agaatatcag tatccacaga aggagcaaat 601 acttcttcat ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag 661 aaggagaaaa ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac 721 ccctcgagat acttgtgcaa gtgccaacct ggattcactg gagcaagatg tactgagaat 781 gtgcccatga aagtccaaaa ccaagaaaag gcggaggagc tgtaccagaa gagagtgctg 841 accataaccg gcatctgcat cgccctcctt gtggtcggca tcatgtgtgt ggtggcctac 901 tgcaaaacca agaaacagcg gaaaaagctg catgaccgtc ttcggcagag ccttcggtct 961 gaacgaaaca atatgatgaa cattgccaat gggcctcacc atcctaaccc accccccgag 1021 aatgtccagc tggtgaatca atacgtatct aaaaacgtca tctccagtga gcatattgtt 1081 gagagagaag cagagacatc cttttccacc agtcactata cttccacagc ccatcactcc 1141 actactgtca cccagactcc tagccacagc tggagcaacg gacacactga aagcatcctt 1201 tccgaaagcc actctgtaat cgtgatgtca tccgtagaaa acagtaggca cagcagccca 1261 actgggggcc caagaggacg tcttaatggc acaggaggcc ctcgtgaatg taacagcttc 1321 ctcaggcatg ccagagaaac ccctgattcc taccgagact ctcctcatag tgaaagacat 1381 aaccttatag ctgagctaag gagaaacaag gcacacagat ccaaatgcat gcagatccag 1441 ctatcagcaa ctcatcttag atcttcttcc attccccatt tgggcttcat tctctaagac 1501 cccttggcct ttaggaaggt atgtgtcagc catgaccacc ccggctcgta tgtcacctgt 1561 agatttccac acgccaagct cccccaaatc gcccccttcg gaaatgtctc cacccgtgtc 1621 cagcatgacg gtgtccatgc cttccatggc ggtcagcccc ttcatggaag aagagagacc 1681 tctacttctc gtgacaccac caaggctgcg ggagaagaag tttgaccatc accctcagca 1741 gttcagctcc ttccaccaca accccacgcg cccacgcgtc cgcggacgcg tgg // LOCUS HSU02388 2368 bp mRNA PRI 15-OCT-1993 DEFINITION Human cytochrome P450 4F2 (CYP4F2) mRNA, complete cds. ACCESSION U02388 NID g408450 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2368) AUTHORS Chen,L. and Hardwick,J.P. TITLE The human liver CYP4F2 cDNA sequence and expression in Baculovirus-infected insect cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 2368) AUTHORS Chen,L. and Hardwick,J.P. TITLE Identification of a new P450 subfamily, CYP4F1, expressed in rat hepatic tumors JOURNAL Arch. Biochem. Biophys. 300 (1), 18-23 (1993) MEDLINE 93143312 REFERENCE 3 (bases 1 to 2368) AUTHORS Kikuta,Y., Kusunose,E., Endo,K., Yamamoto,S., Sogawa,K., Fujii-Kuriyama,Y. and Kusunose,M. TITLE A novel form of cytochrome P-450 family 4 in human polymorphonuclear leukocytes: cDNA cloning and expression of leukotriene B4 omega-hydroxylase, D12620 and D12621 JOURNAL Journal of Biochemistry 268, 9376-9380 (1993) REFERENCE 4 (bases 1 to 2368) AUTHORS Hardwick,J.P. TITLE Direct Submission JOURNAL Submitted (05-OCT-1993) James P. Hardwick, Northeastern Ohio Universities College of Medicine, 4209 State Rt. 44, Rootstown, OH 44272, USA FEATURES Location/Qualifiers source 1..2368 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C13" /clone_lib="Uni-zap CFHL" /sex="male" /cell_type="hepatocytes" 5'UTR 1..59 gene 60..1616 /gene="CYP4F2" CDS 60..1616 /gene="CYP4F2" /note="expressed in hepatic tumors" /codon_start=1 /function="fatty acid hydroxylase; eicosanoid metabolism" /product="cytochrome P450 4F2" /db_xref="PID:g408451" /translation="MSQLSLSWLGLCDVAASPWLLLLLVGASWLLAHVLAWTYAFYDN CRRLRCFPQPPRRNWFWGHQGMVNPTEEGMRVLTQLVATYPQGFKVWMGPISPLLSLC HPDIIRSVINASAAIAPKDKFFYSFLEPWLGDGLLLSAGDKWSRHRRMLTPAFHFNIL KPYMKIFNESVNIMHAKWQLLASEGSACLDMFEHISLMTLDSLQKCVFSFDSHCQEKP SEYIAAILELSALVSKRHHEILLHIDFLYYLTPDGQRFRRACRLVHDFTDAVIQERRR TLPSQGVDDFLQAKAKSKTLDFIDVLLLSKDEDGKKLSDEDIRAEADTFMFEGHDTTA SVSPGSCTTLQSTQNTRSVCRQEVQELLKDREPKEIEWDDLAHLPFLTMCMKESLRCI PPVPVISRHVTQDIVLPDGRVIPKGIICLISVFGTHHNPAVWPDPEVYDPFRFDPENI KERSPLAFIPFSAGPRNCIGQTFAMAEMKVVLALTLLAFRVLPDHTEPRRSRSWSCAQ RADFGCGWSP" misc_feature 1008..1056 /gene="CYP4F2" /note="family 4 concensus sequence in the translated peptide" misc_feature 1461..1463 /gene="CYP4F2" /note="heme-binding cysteine in the translated peptide" 3'UTR 1617..2368 BASE COUNT 568 a 648 c 596 g 556 t ORIGIN 1 ctcggatcag tctggcagag agaggaggtt gtctgggaca gactgctcct gacagaagga 61 tgtcccagct gagcctgtcc tggctgggcc tctgcgacgt ggcagcatcc ccttggctgc 121 tcctcctgct ggtcggggcc tcctggctcc tggcccatgt cctggcctgg acctacgcct 181 tctatgacaa ctgccgccgc cttcggtgtt tcccacaacc cccaagacgg aactggtttt 241 ggggacacca gggcatggtc aaccccacag aggagggcat gagagttctg actcagctgg 301 tggccaccta cccccagggc tttaaggtct ggatgggacc catctccccc ctcctcagtt 361 tgtgccaccc cgacatcatc cggtctgtca tcaacgcctc agctgccatt gcaccaaagg 421 acaagttctt ctacagcttc ctggagccct ggctggggga tgggctcctg ctgagtgctg 481 gtgacaagtg gagccgccac cgtcggatgc tgacgcctgc cttccatttc aacatcctga 541 agccctatat gaagattttc aatgagagtg tgaacatcat gcacgccaag tggcagctcc 601 tggcctcaga gggtagtgcc tgtttggata tgtttgagca catcagcctc atgaccttgg 661 acagtctaca gaaatgtgtc ttcagctttg acagccattg tcaggagaaa cccagtgaat 721 atattgccgc catcttggag ctcagtgccc ttgtatcaaa aagacaccat gagatcctcc 781 tgcatattga cttcctgtat tatctcaccc ctgatgggca gcgtttccgc agggcctgcc 841 gcctggtgca cgacttcaca gatgccgtca tccaggagcg gcgccgcact ctccctagcc 901 agggtgttga tgacttcctc caagccaagg ccaaatccaa gactttggac ttcattgatg 961 tactcctgct gagcaaggat gaagacggga agaagttatc tgatgaggac ataagagcag 1021 aagctgacac ctttatgttt gagggccatg acaccacggc cagtgtctct cctgggtcct 1081 gtaccacctt gcaaagcacc cagaatacca ggagcgtctg ccggcaggag gtgcaagaac 1141 ttctgaagga ccgtgagcct aaagagattg aatgggacga cctggcccat ttgcccttcc 1201 tgaccatgtg catgaaggag agcctgcgct gcatcccccc agtcccggtc atctcccgcc 1261 atgtcaccca ggacattgtg ctcccagacg gccgggtcat ccccaaaggc attatctgcc 1321 tcatcagtgt tttcggaacc catcacaacc cagctgtgtg gccggaccct gaggtctacg 1381 acccctttcg ctttgaccca gagaacatca aggagaggtc acctctggct tttattccct 1441 tctcggcagg gcccaggaac tgcatcgggc agacgttcgc gatggcggag atgaaggtgg 1501 tcctggcgct cacgctgctg gccttccgcg tcctgcctga ccacaccgag ccccgcagaa 1561 gccggagctg gtcctgcgcg cagagggcgg actttggctg cgggtggagc ccctgagctg 1621 agttctgcag agacccactc tgaccccact aaaatgaccc ctgattcatc aaaagtgaag 1681 cctagaatta ccctaagacc ctgttccaca gtcctgtatt ccatcctaga tatctactca 1741 aaataattga gacaagtgtt caaacagaaa gacgcttgtg cgtgaatgtt catggcggcc 1801 ctattcacag tagccaaacg atgaaaacaa ccccaagcta tatattacca gatgaaagga 1861 taaacaaaat gtggtccatc catacaatgg agtattacac agccataaaa aggaatgaag 1921 cagtgatccc tactacactg tggatgaacc ttgaatgcat gatactgaat gaaagacgtc 1981 agatgcaaaa ggtcacatag tgtactgtcc ttttatacga aatttccaga acaggccaat 2041 ctgaagagat gcatagcgga ttggtggctt tcagcagctg tggggaggtg ggactgagga 2101 gcgactgcta atcagtatgg ggtttcctcc cgggatggtg aaaatgttcc ggacctagat 2161 actgacgaag gtagcacgac actgtgagtg cactaaatgc tattgaattg gacactttga 2221 aatggtgaat ttcgtggtat gtgaattcta cctcaatcaa aaaaatttgc tattttatct 2281 cacatacatt ttttttctgt ccaggttgtt catataataa tatgctgtga gcatctttcc 2341 atgacattaa atcatcttag gaaacatt // LOCUS HSU02390 1517 bp mRNA PRI 10-AUG-1994 DEFINITION Human adenylyl cyclase-associated protein homolog CAP2 (CAP2) mRNA, complete cds. ACCESSION U02390 NID g409928 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1517) AUTHORS Yu,G., Swiston,J. and Young,D. TITLE Comparison of human CAP and CAP2, homologs of the yeast adenylyl cyclase- associated proteins JOURNAL J. Cell Sci. 107, 1671-1678 (1994) MEDLINE 95051124 REFERENCE 2 (bases 1 to 1517) AUTHORS Young,D. TITLE Direct Submission JOURNAL Submitted (05-OCT-1993) Dallan Young, Medical Biochemistry, University of Calgary, Calgary, Alberta T2N 4N1, Canada FEATURES Location/Qualifiers source 1..1517 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U188-MG" /cell_type="glioblastoma" gene 84..1517 /gene="CAP2" CDS 84..1517 /gene="CAP2" /codon_start=1 /product="CAP2" /db_xref="PID:g409929" /translation="MANMQGLVERLERAVSRLESLSAESHRPPGNCGEVNGVIAGVAP SVEAFDKLMDSMVAEFLKNSRILAGDVETHAEMVHSAFQAQRAFLLMASQYQQPHEND VAALLKPISEKIQEIQTFRERNRGSNMFNHLSAVSESIPALGWIAVSPKPGPYVKEMN DAATFYTNRVLKDYKHSDLRHVDWVKSYLNIWSELQAYIKEHHTTGLTWSKTGPVAST VSAFSVLSSGPGLPPPPPPLPPPGPPPLFENEGKKEESSPSRSALFAQLNQGEAITKG LRHVTDDQKTYKNPSLRAQGGQTQSPTKSHTPSPTSPKSYPSQKHAPVLELEGKKWRV EYQEDRNDLVISETELKQVAYIFKCEKSTIQIKGKVNSIIIDNCKKLGLVFDNVVGIV EVINSQDIQIQVMGRVPTISINKTEGCHIYLSEDALDCEIVSAKSSEMNILIPQDGDY REFPIPEQFKTAWDGSKLITEPAEIMA" BASE COUNT 438 a 370 c 370 g 339 t ORIGIN 1 attctttggg gaggcaacta ggatggtgtg gccgaccacg gatttgcatt gccgaggacg 61 ggaccccagg gcagcgaagc agaatggcca acatgcaggg actggtggaa agactggaac 121 gagctgtcag ccgcctggag tcgctgtctg cagagtccca caggccccct gggaactgcg 181 gggaagtcaa tggtgtcatt gcaggtgtgg caccctccgt ggaagccttt gacaagctga 241 tggacagtat ggtggccgag tttttaaaga acagtaggat ccttgctggg gacgtggaga 301 cccatgcaga aatggtgcac agtgctttcc aggcccagcg ggctttcctt ctgatggcct 361 ctcagtacca acaaccccac gagaatgacg tggccgcact tctgaaaccc atatcggaaa 421 agattcagga aatccaaact ttcagagaga gaaaccgggg gagtaacatg tttaatcatc 481 tttcggccgt cagcgaaagc atccctgccc ttggatggat agctgtgtct cccaaacctg 541 gtccttatgt caaggagatg aatgacgctg ccacctttta cactaacagg gtcttaaagg 601 actacaaaca cagtgatttg cgtcatgtgg attgggtgaa gtcatatttg aacatttgga 661 gtgaacttca agcatacatc aaggaacacc acaccacggg cctcacatgg agcaaaacag 721 gtcctgtagc atccacagta tcagcgtttt ctgtcctctc ctctgggcct ggccttcctc 781 caccccctcc tcctctgcct cctccagggc cacctccact tttcgagaat gaaggcaaaa 841 aagaggaatc ttctccttca cgctcagctt tatttgccca acttaaccag ggagaagcaa 901 ttacaaaagg gctccgccat gtcacagatg accagaagac atacaaaaat cccagcctgc 961 gggctcaagg agggcaaact caatctccca ccaaaagtca cactccaagt cccacatctc 1021 ctaaatctta tccttctcaa aaacatgccc cagtgttgga gttggaagga aagaaatgga 1081 gagtggagta ccaagaggac aggaatgacc ttgtgatttc agagactgag ctgaaacaag 1141 tggcttacat tttcaaatgc gaaaaatcaa ctattcagat aaaagggaaa gtaaactcca 1201 ttataattga caactgtaag aaactcggcc tggtgtttga caatgtggtg ggcattgtgg 1261 aagtgatcaa ctcccaggac attcaaatcc aggtaatggg gagagtgcca acaatttcca 1321 ttaataagac agaaggttgc cacatatacc tcagtgaaga tgcattagac tgtgagatcg 1381 tgagcgccaa gtcatctgaa atgaacatac ttatccctca ggatggtgat tatagagaat 1441 ttcccattcc tgaacagttc aagacagcat gggatggatc caagttaatc actgaacctg 1501 cagaaattat ggcctaa // LOCUS HSU02478 4839 bp mRNA PRI 11-JAN-1994 DEFINITION Human AF-6 mRNA, complete cds. ACCESSION U02478 NID g430993 KEYWORDS AF-6, ALL-1, chromosome translocation, acute leukemia, gene fusion. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4839) AUTHORS Prasad,R., Gu,Y., Alder,H., Nakamura,T., Canaani,O., Saito,H., Huebner,K., Gale,R.P., Nowell,P.C., Kuriyama,K., Miyazaki,Y., Croce,C.M. and Canaani,E. TITLE Cloning of the ALL-1 fusion partner, the AF-6 gene, involved in acute myeloid leukemias with the t(6;11) chromosome translocation JOURNAL Cancer Res. 53 (23), 5624-5628 (1993) MEDLINE 94061833 REFERENCE 2 (bases 1 to 4839) AUTHORS Gu,Y. TITLE Direct Submission JOURNAL Submitted (08-OCT-1993) Yansong Gu, Jefferson Cancer Institute, Thomas Jefferson University, BLSB/Rm 608, 233 South 10th Street, Philadelphia, PA 19107, U.S.A. FEATURES Location/Qualifiers source 1..4839 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="K10, K12, K26, K28" /clone_lib="cDNA library" /chromosome="6" /map="6q27" /cell_line="Kcl22" /cell_type="chronic myeloid leukemia" gene 1..4839 /gene="AF-6" CDS 1..4839 /gene="AF-6" /note="ALL-1 fusion partner from chromosome 6" /codon_start=1 /db_xref="PID:g430994" /translation="MSAGGRDEERRKLADIIHHWNANRLDLFEISQPTEDLEFHGVMR FYFQDKAAGNFATKCIRVSSTATTQDVIETLAEKFRPDMRMLSSPKYSLYEVHVSGER RLDIDEKPLVVQLNWNKDDREGRFVLKNENDAIPPKAQSNGPEKQEKEGVIQNFKRTL SKKEKKEKKKREKEALRQASDKDDRPFQGEDVENSRLAAEVYKDMPETSFTRTISNPE VVMKRRRQQKLEKRMQEFRSSDGRPDSGGTLRIYADSLKPNIPYKTILLSTTDPADFA VAEALEKYGLEKENPKDYCIARVMLPPGAQHSDEKGAKEIILDDDECPLQIFREWPSD KGILVFQLKRRPPDHIPKKTKKHLEGKTPKGKERADGSVYGSTLPPEKLPYLVELSPD GSDSRDKPKLYRLQLSVTEVGTEKLDDNSIQLFGPGIQPHHCDLTNMDGVVTVTPRSM DAETYVEGQRISETTMLQSGMKVQFGASHVFKFVDPSQDHALAKRSVDGGLMVKGPRH KPGIVQETTFDLGGDIHSGTALPTSKSTTRLDSDRVSSASSTAERGMVKPMIRVEQQP DYRRQESRTQDASGPELILPASIEFRESSEDSFLSAIINYTNSSTVHFKLSPTYVLYM ACRYVLSNQYRPDISPTERTHKVIAVVNKMVSMMEGVIQKQKNIAGALAFWMANASEL LNFIKQDRDLSRITLDAQDVLAHLVQMAFKYLVHCLQSELNNYMPAFLDDPEENSLQR PKIDDVLHTLTGAMSLLRRCRVNAALTIQLFSQLFHFINMWLFNRLVTDPDSGLCSHY WGAIIRQQLGHIEAWAEKQGLELAADCHLSRIVQATTLLTMDKYAPDDIPNINSTCFK LNSLQLQALLQNYHCAPDEPFIPTDLIENVVTVAENTADELARSDGREVQLEEDPDLQ LPFLLPEDGYSCDVVRNIPNGLQEFLDPLCQRGFCRLIPHTRSPGTWTIYFEGADYES HLLRENTELAQPLRKEPEIITVTLKKQNGMGLSIVAAKGAGQDKLGIYVKSVVKGGAA DVDGRLAAGDQLLSVDGRSLVGLSQERAAELMTRTSSVVTLEVAKQGAIYHGLATLLN QPSPMMQRISDRRGSGKPRPKSEGFELYNNSTQNGSPESPQLPWAEYSEPKKLPGDDR LMKNRADHRSSPNVANQPPSPGGKSAYASGTTAKITSVSTGNLCTEEQTPPPRPEAYP IPTQTYTREYFTFPASKSQDRMAPPQNQWPNYEEKPHMHTDSNHSSIAIQRVTRSQEE LREDKAYQLERHRIEAAMDRKSDSDMWINQSSSLDSSTSSQEHLNHSSKSVTPASTLT KSGPGRWKTPAAIPATPVAVSQPIRTDLPPPPPPPPVHYAGDFDGMSMDLPLPPPPSA NQIGLPSAQVAAAERRKREEHQRWYEKEKAPLEEERERKRREQERKLGQMRTQSLNPA PFSPLTAQQMKPEKPSTLQRPQETVIRELQPQQQPRTIERRDLQYITVSKEELSSGDS LSPDPWKRDAKEKLEKQQQMHIVDMLSKEIQELQSKPDRSAEESDRLRKLMLEWQFQK RLQESKQKDEDDEEEEDDDVDTMLIMQRLEAERRARVKGGVLWLCPSVVPILASACFP WG" BASE COUNT 1388 a 1139 c 1246 g 1066 t ORIGIN 1 atgtcggcgg gcggccgtga cgaggagcgg cggaagctgg ccgacatcat ccaccactgg 61 aacgccaacc ggctggacct gttcgagatc agccagccga ccgaggattt ggagttccat 121 ggagtgatga gattttattt tcaagataaa gctgctggaa actttgcaac aaaatgtatt 181 cgggtctcta gtactgccac cactcaagat gtaatcgaaa cgctcgcgga gaaatttcga 241 cctgatatgc gaatgctgtc ctctcccaag tattcactct atgaagtgca tgtcagcgga 301 gaaagaagat tggatataga tgagaaacct ctagttgtac aactgaattg gaacaaagat 361 gatcgggaag gcagatttgt tcttaagaat gagaatgacg ccattcctcc taaggctcaa 421 agtaatggac ctgaaaagca ggaaaaagaa ggggttatcc agaacttcaa gagaactctc 481 tcaaagaaag aaaagaagga aaaaaagaag agagaaaaag aggcattgcg acaggcatct 541 gataaagatg atagaccttt ccaaggggag gatgttgaaa attctcgact ggctgctgag 601 gtttacaaag acatgccgga aaccagcttt actcgaacca tttctaatcc tgaggtggtt 661 atgaaacgac ggaggcagca aaaattggaa aagagaatgc aggaatttcg gagctcagat 721 gggcggcctg attcaggtgg aacattgaga atttatgcag atagtttaaa accaaatatt 781 ccctacaaga caatcctgct gtctactaca gatcctgcag actttgctgt ggctgaagct 841 ttagagaagt atggtctgga aaaagaaaac cctaaggatt actgcatcgc ccgggttatg 901 cttcctcctg gagcccagca ttctgatgaa aagggtgcta aagaaattat tcttgatgat 961 gatgagtgtc ctttacaaat cttcagggaa tggccaagtg acaaagggat tttagtcttt 1021 cagttgaaga ggaggccacc agaccacatc ccaaagaaaa ccaagaaaca cttggaaggc 1081 aagacaccca agggaaagga gagagctgac gggtctgtct atggctccac ccttcctccg 1141 gagaagctgc cctatttagt agagttaagc ccagatggtt ctgactctag agataagcca 1201 aagctttacc gccttcagtt aagtgttact gaagttggga cagaaaagtt ggatgacaac 1261 tctatccagt tgtttggccc aggaattcag ccccatcact gtgaccttac caacatggat 1321 ggagtggtca ctgtgacgcc cagaagtatg gacgcagaaa cctacgtgga aggccagcgc 1381 atctcagaaa ccaccatgct gcagagtggc atgaaagtgc agtttggggc gtcccatgta 1441 tttaagtttg tggaccccag tcaggatcat gctcttgcaa aaagatctgt ggatggaggc 1501 ctgatggtta agggcccaag acataaacct ggaattgttc aggagacaac ttttgatttg 1561 ggaggagata ttcatagtgg gacagcatta ccgacaagca agagcaccac taggctggac 1621 agcgacagag tgtcgtctgc ctctagcaca gccgagcggg gaatggtgaa gccgatgatc 1681 agagtagaac agcagccaga ttatcgcagg caagaaagca gaacacagga tgcttctggg 1741 cctgagctga tactacctgc aagcattgaa ttcagggaaa gttctgaaga ttcatttttg 1801 tctgccatta taaattatac taatagctct acagtccact ttaagttgtc ccctacatat 1861 gtattatata tggcatgccg gtatgtattg tccaaccagt acagacctga catcagccct 1921 acagagcgca cacataaagt cattgcagtc gtcaacaaga tggtgagcat gatggagggt 1981 gtcatccaga aacagaagaa tattgcaggg gcacttgcct tctggatggc aaatgcatct 2041 gaacttctca acttcattaa gcaagaccga gaccttagtc ggatcacact ggatgctcaa 2101 gatgttttag cacatttggt tcaaatggca tttaaatact tggttcactg tcttcaatca 2161 gaacttaata attacatgcc agcctttcta gatgaccctg aagagaacag tctgcaacga 2221 ccaaaaatag atgatgtgct gcacacgctc acaggagcca tgtccttgct acgacgctgc 2281 agagtcaatg ccgccctgac catccagctc ttctctcagc tcttccactt catcaatatg 2341 tggctgttca atagattggt gaccgaccca gattcggggc tgtgctccca ttactggggt 2401 gcgattatcc gtcagcagtt gggccatatt gaagcctggg ctgagaagca ggggctggaa 2461 ctggctgcgg actgtcatct gagcaggatc gtgcaggcaa cgactttgct taccatggat 2521 aagtatgcac ctgatgacat tccaaatata aacagcacct gctttaagtt aaattcatta 2581 caacttcaag ccttattaca gaactatcac tgtgcacctg atgagccttt tatcccaacg 2641 gatcttatag aaaatgtagt gactgtggct gaaaacactg ccgatgagct ggcccgcagt 2701 gatggaaggg aagtgcagtt ggaggaggat cctgatctgc agctgccgtt tcttttgcca 2761 gaagatggtt attcttgtga tgttgtcaga aacattccaa atggtttaca agaattttta 2821 gaccctctgt gccagagagg attttgcagg ttaattcctc acacacgttc accaggtact 2881 tggacaatat attttgaagg tgcagattat gaaagtcacc ttctgcgtga gaacacagag 2941 ctggctcagc ctctgaggaa agaacctgaa ataatcactg tgaccctaaa aaagcagaat 3001 ggaatgggcc ttagcattgt tgcagcaaag ggtgctggtc aagataaact aggaatctac 3061 gtgaagtcgg ttgtgaaagg aggtgctgca gatgtggatg gacgtctagc tgcaggtgat 3121 cagctcctca gtgtggatgg acgaagtctg gttggactct ctcaggaaag ggcggcagaa 3181 ctcatgacaa gaacaagctc tgtggtgaca ctggaagtag caaagcaggg tgccatctac 3241 cacggtctgg ccacccttct caatcagcca tcccccatga tgcagagaat ttcagatcgt 3301 cgtggctcag gtaaaccccg accaaagagt gaaggctttg agctctataa taattcaact 3361 caaaatgggt ctcctgagag tcctcagctg ccttgggcag aatatagtga accaaagaaa 3421 ttgcctggtg atgacagact gatgaaaaat agagctgatc accgttccag ccccaacgta 3481 gcaaatcagc ctcctagtcc tggagggaaa agtgcatatg cctctggaac aacagcgaag 3541 ataacatctg tctctactgg aaacctctgc actgaggagc agacgcctcc gcctagacct 3601 gaagcctacc ccattcccac tcagacgtac accagagagt attttacctt cccagcttcc 3661 aaatcccagg atcggatggc tcctcctcag aaccagtggc caaattatga ggaaaagcca 3721 catatgcaca cagatagtaa tcattccagt attgcaattc agcgtgttac acgttcccaa 3781 gaagaacttc gagaagataa agcttaccaa cttgagcggc atcgaataga ggcagctatg 3841 gaccgaaagt ctgatagtga tatgtggata aatcagagct cctcactgga ctccagtacc 3901 tctagccagg agcatctgaa ccattcctct aagtcggtca cccctgcttc cacactgacc 3961 aaaagtggcc ctggccgttg gaaaacacca gcagccatac cggccacccc tgtggccgtc 4021 tcccagccaa tccgaacaga cctgcctccg ccacccccgc cacctccagt ccactatgcc 4081 ggtgatttcg atggaatgtc catggatttg cctctcccac cacccccttc cgccaaccag 4141 atagggctgc cgtctgcgca ggtggctgct gctgaacgga gaaagagaga agaacatcag 4201 cgttggtatg agaaggagaa ggcccccctg gaggaggagc gggagaggaa gcggagagag 4261 caggagagga agttgggcca gatgcgcact cagtccttaa accctgctcc gttttctccc 4321 ctgactgcac agcagatgaa gcccgaaaag ccttccacac tccagcggcc acaggaaaca 4381 gtcattcggg agctgcagcc tcagcagcag ccccgcacga tcgagcgcag agacttgcag 4441 tacattacag tcagcaaaga ggagctttcc tcgggggaca gtctgtcccc cgacccgtgg 4501 aagcgggacg ccaaggagaa gctggagaag cagcagcaga tgcacatcgt ggacatgctg 4561 agcaaggaga tccaggagct ccagagcaaa ccggaccgca gcgccgagga gagcgaccgg 4621 ctgcgcaagc tcatgctgga gtggcagttc cagaagagac tccaggagtc gaagcagaag 4681 gacgaagatg acgaggagga ggaggacgat gatgtggaca ccatgctgat catgcagcgc 4741 ctggaggctg aacgaagagc gagggtaaag gggggagtgc tttggctgtg cccatctgtg 4801 gtccctattt tagcttctgc gtgtttccca tggggatag // LOCUS HSU02556 2156 bp mRNA PRI 23-DEC-1994 DEFINITION Human RP3 mRNA, complete cds. ACCESSION U02556 NID g413824 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS Roux,A.F., Rommens,J., McDowell,C., Anson-Cartwright,L., Bell,S., Schappert,K., Fishman,G.A. and Musarella,M. TITLE Identification of a gene from Xp21 with similarity to the tctex-1 gene of the murine t complex JOURNAL Hum. Mol. Genet. 3 (2), 257-263 (1994) MEDLINE 94272463 REFERENCE 2 (bases 1 to 2156) AUTHORS Musarella,M.A. TITLE Direct Submission JOURNAL Submitted (15-OCT-1993) Maria A. Musarella, Department of Genetics, Hospital For Sick Children, 555 University Avenue, Toronto, Ontario M5G1X8, Canada FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CDNA91/23" /chromosome="X" /map="Xp21" CDS 69..419 /note="RP3 candidate gene" /codon_start=1 /db_xref="PID:g413825" /translation="MEEYHRHCDEVGFNAEEAHNIVKECVDGVLGGEDYNHNNINQWT ASIVEQSLTHLVKLGKAYKYIVTCAVVQKSAYGFHTASSCFWDTTSDGTCTVRWENRT MNCIVNVFAIAIVL" BASE COUNT 733 a 331 c 398 g 694 t ORIGIN 1 ggaggccagt gcgcggccgc ggtgctctac cggcgtgtcg ctccgcccca gggagagccg 61 gcgctaccat ggaggagtac catcgccact gcgacgaggt tggcttcaat gctgaggaag 121 cccacaatat tgtcaaagag tgtgtagatg gggttttagg tggtgaagat tataatcaca 181 acaacatcaa ccagtggact gcaagcatag tggaacaatc cttaacacac ctggttaagt 241 tgggaaaagc ctataaatat attgtgacct gtgcagtggt ccagaagagc gcatatggct 301 ttcacacagc cagctcctgt ttttgggata ccacatctga tggaacctgt accgtaagat 361 gggagaaccg gaccatgaac tgtattgtca acgtttttgc cattgctatt gttctttaac 421 tgactaaaaa tgttgggcta aagccattaa cttaagaatt tgtcagtgta tcctttccaa 481 aaagagtaat agttgtttac tagtgtgcta gatgaaaagc gtgcaatatg ctttaaagct 541 atcaacaaaa actgaatatt ataagcaagc aatatcatag taattggcag attagctcat 601 attctataca gcatcgttta aataggaaaa atttaatgct agcaaaaaat aaatttagaa 661 tatggcatga catgaaaata caatcttata tttacaccag cttttcacta atattttgta 721 cctaaggtga tggggaactc cattcagata ataaaattct ctttcagcta gagaagttaa 781 caggaataaa tatatgaaca aaaaagctgc aaggataaat gtggagaaaa tgatgagaat 841 tagctaacat ttttaagttt ttttaaactt tcttcccctc agttgtactt aatatttagt 901 ggaaagtaat aattttttta ttttctatca actaatagta tagtaacaac tatgattaac 961 ttgtttactt tttctgagga ttagtaaatc aatttttttt aatttcaaat tttggattta 1021 cacttgaggg taaattaaat ctggtaaact gaatttccta gttaaataaa attagttgca 1081 gtatatgatg aacagtgtat gactcaaaca gctgccttac aattcactca ttccatgtgg 1141 aacaaacatt tatcagatgc ctattatggg catatgtctc tgctaagcac catagttgtc 1201 aatgtgctgt gcaaatgcta agttcctttt agcaattgtt cagttggaag acgtattaat 1261 atttggggaa ggaaaagaaa gtagttgttt tacaagggag gaaaaaagtg aatctggtta 1321 cacatatgga agtaagcaaa atgaaaagca cttattgctt tctgacagaa ttatagatgt 1381 aattttaaga gttgctccta gcaagttaaa agtgcatata aaatatgcaa ctcttagtta 1441 aaggccttat tatcagtctt acctatacaa gtagtaaatt ttgtcattgc tttagttaca 1501 accatctgta aataacttaa aagacttatt atgtggggtt caaattgagt ggaataaagt 1561 atagattaaa agtatacaat ccttagcacg ttatctcagg gcttatgaaa tgtaattaaa 1621 tttattaaga aaatagatga aaaattaggg tacacagctg gccaccaaat gcgaagtcaa 1681 tctgctactt aaccctgaaa acaaaatcag ttttgcatat taccactaac actaatacat 1741 atagagagcg gaaccataac tcattgaatt ttggagagga ataagcttag cgttaatatt 1801 gacaatatta aggcaatatt cttgtaggaa tactatgtgc atgtttgata ttttgccaaa 1861 taacaataat taataattgt tcaatgttta agaataatat taacaaaata aaggagttta 1921 atgcagtgat ctttgttttt ggcacatcaa aaattctcag tcattattca tgtttctttt 1981 atgctgctgg cttttgtgcc ctggaagatc ataatagtga ccaaaatata catgcagact 2041 tgttttttat tattgttgtt taagcataat ttaagaaaaa aaatttttac ctggtgaact 2101 tgctatctgc tctgtttcta gttaaaatat aataaatatt atcttcctgt gctgta // LOCUS HSU02569 1902 bp mRNA PRI 30-MAR-1996 DEFINITION Human alpha1C adrenergic receptor mRNA, complete cds. ACCESSION U02569 NID g409028 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1902) AUTHORS Tseng-Crank,J., Kost,T., Goetz,A., Hazum,S., Roberson,K.M., Haizlip,J., Godinot,N., Robertson,C.N. and Saussy,D. TITLE The alpha 1C-adrenoceptor in human prostate: cloning, functional expression, and localization to specific prostatic cell types JOURNAL Br. J. Pharmacol. 115 (8), 1475-1485 (1995) MEDLINE 96031069 REFERENCE 2 (bases 1 to 1902) AUTHORS Tseng-Crank,J.C. TITLE Direct Submission JOURNAL Submitted (18-OCT-1993) Julie C.L. Tseng-Crank, Molecular Biology, Glaxo Research Institute, 5 Moore Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..1902 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" CDS 425..1825 /codon_start=1 /product="alpha1C adrenergic receptor" /db_xref="PID:g409029" /translation="MVFLSGNASDSSNCTQPPAPVNISKAILLGVILGGLILFGVLCN ILVILSVACHRHLHSVTHYYIVNLAVADLLLTSTVLPFSAIFEVLGYWAFGRVFCNIW AAVDVLCCTASIMGLCIISIDRYIGVTYPLRYPTIVTQRRGLMALLCVWALSLVISIG PLFGWRQPAPEDETICQINEEPGYVLFSALGSFYLPLAIILVMYCRVYVVAKRESRGL KSGLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGI VVGCFVLCWLPFFLVMPIGSFFPDFKPSETVFKIVFWLGYLNSCINPIIYPCSSQEFK KAFQNVLRIQCLCRKQSSKHALGYPLHPPSQAVEGQHKDMVRIPVGSRETFYRISKTD GVCEWKFFSSMPRGSARITVSKDQSSCTTARVRSKSFLQVCCCVGPSTPCLDKNHQVP TIKVHTISLSENGEEV" BASE COUNT 376 a 617 c 503 g 406 t ORIGIN 1 cgaatcatgt gcagaatctg aatcttcccc cagccaggac gaataagaca gcgcggaaaa 61 gcagattctc gtaattctgg aattgcatgt tgcaaggagt ctcctggatc ttcgcaccca 121 gcttcgggta gggagggagt cgggtcccgg ctaggccagc ccggcaggtg gagagggtcc 181 ccggcagccc cgcgcgcccc tggccatgtc tttaatgccc tgccccttca tgtggccttc 241 tgagggttcc cagggctggc cagggttgtt tcccacccgc gcgcgctctc acccccagcc 301 aaacccacct ggcagggctc cctccagccg agaccttttg attcccggct cccgcgctcc 361 cgcctccgcg ccacgccggg aggtggccct ggacagccgg acctcgcccg gccccggctg 421 gaccatggtg tttctctcgg gaaatgcttc cgacagctcc aactgcaccc aaccgccggc 481 accggtgaac atttccaagg ccattctgct cggggtgatc ttggggggcc tcattctttt 541 cggggtgctg tgtaacatcc tagtgatcct ctccgtagcc tgtcaccgac acctgcactc 601 agtcacgcac tactacatcg tcaacctggc ggtggccgac ctcctgctca cctccacggt 661 gctgcccttc tccgccatct tcgaggtcct aggctactgg gccttcggca gggtcttctg 721 caacatctgg gcggcagtgg atgtgctgtg ctgcaccgcg tccatcatgg gcctctgcat 781 catctccatc gaccgctaca tcggcgtgac gtacccgctg cgctacccaa ccatcgtcac 841 ccagaggagg ggtctcatgg ctctgctctg cgtctgggca ctctccctgg tcatatccat 901 tggacccctg ttcggctgga ggcagccggc ccccgaggac gagaccatct gccagatcaa 961 cgaggagccg ggctacgtgc tcttctcagc gctgggctcc ttctacctgc ctctggccat 1021 catcctggtc atgtactgcc gcgtctacgt ggtggccaag agggagagcc ggggcctcaa 1081 gtctggcctc aagaccgaca agtcggactc ggagcaagtg acgctccgca tccatcggaa 1141 aaacgccccg gcaggaggca gcgggatggc cagcgccaag accaagacgc acttctcagt 1201 gaggctcctc aagttctccc gggagaagaa agcggccaaa acgctgggca tcgtggtcgg 1261 ctgcttcgtc ctctgctggc tgcctttttt cttagtcatg cccattgggt ctttcttccc 1321 tgatttcaag ccctctgaaa cagtttttaa aatagtattt tggctcggat atctaaacag 1381 ctgcatcaac cccatcatat acccatgctc cagccaagag ttcaaaaagg cctttcagaa 1441 tgtcttgaga atccagtgtc tctgcagaaa gcagtcttcc aaacatgccc tgggctaccc 1501 cctgcacccg cccagccagg ccgtggaagg gcaacacaag gacatggtgc gcatccccgt 1561 gggatcaaga gagaccttct acaggatctc caagacggat ggcgtttgtg aatggaaatt 1621 tttctcttcc atgccccgtg gatctgccag gattacagtg tccaaagacc aatcctcctg 1681 taccacagcc cgggtgagaa gtaaaagctt tttgcaggtc tgctgctgtg tagggccctc 1741 aaccccctgc cttgacaaga accatcaagt tccaaccatt aaggtccaca ccatctccct 1801 cagtgagaac ggggaggaag tctaggacag gaaagatgca gaggaaaggg gaataatctt 1861 aggtacccac cccacttcct tctcggaagg ccagctcttc tt // LOCUS HSU02609 2475 bp mRNA PRI 11-JUN-1994 DEFINITION Human transducin-like protein mRNA, complete cds. ACCESSION U02609 NID g414535 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2475) AUTHORS Weinstat-Saslow,D.L., Germino,G.G., Somlo,S. and Reeders,S.T. TITLE A transducin-like gene maps to the autosomal dominant polycystic kidney disease gene region JOURNAL Genomics 18, 709-711 (1993) MEDLINE 94140377 REFERENCE 2 (bases 1 to 2475) AUTHORS Weinstat-Saslow,D.L. TITLE Direct Submission JOURNAL Submitted (19-OCT-1993) Debra L. Weinstat-Saslow, Laboratory of Pathology, National Institutes of Health, Building 10, Room 2A33, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2475 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="sazD-c" /map="16p13.3" CDS 393..1952 /codon_start=1 /product="transducin-like protein" /db_xref="PID:g414536" /translation="MAFDPTSTLLATGGCDGAVRVWDIVRHYGTHHFRGSPGVVHLVA FHPDPTRLLLFSSATDAAIRVWSLQDRSCLAVLTAHYSAVTSLAFSADGHTMLSSGRD KICIIWDLQSCQATRTVPVFESVEAAVLLPEEPVSQLGVKSPGLYFLTAGDQGTLRVW EAASGQCVYTQAQPPGPGQELTHCTLAHTAGVVLTATADHNLLLYEARSLRLQKQFAG YSEEVLDVRFLGPEDSHVVVASNSPCLKVFELQTSACQILHGHTDIVLALDVFRKGWL FASCAKDQSVRIWRMNKAGQVMCVAQGSGHTHSVGTVCCSRLKESFLVTGSQDCTVKL WPLPKALLSKNTAPDNGPILLQAHTTQRCHDKDINSVAIAPNDKLLATGSQDRTAKLW ALPQCQLLGVFSGHRVASGASSSLPWTRCWPRPQLMAPSSSGHSRTSAVSRHLRGTML LCLKVAFVSRGTQLLSSGSDGLVKLWTIKNNECVRTLDAHEDKVWGLQAGWTTTPSLG PVTPESSSGRM" BASE COUNT 451 a 801 c 775 g 448 t ORIGIN 1 cggggtggtg ccggacccta gcaggtttca gctggagcgc ggcgcggcaa catggcagag 61 accgcggccg gagtcgggtc gcttcaagac caactatgct gtggagcgca aaattgagcc 121 tttctacaag ggcggaaaag cacagctgga ccagactggc cagcacctct tctgcgtctg 181 tggcaccaga gtcaacattc tggaagtggc ctcgggggcc gtgctgcgga gtctggagca 241 ggaggaccag gaggacatca ctgcctttga cctcagccct gacaacgagg tgctggtgac 301 gccagtcggg cattgctgct ggctcagtgg gcctggcaag agggcagcgt tacccgcctg 361 tggaaggcga tacacacgcc cccgtggcca ccatggcctt cgaccccacc tccactctgc 421 tagccacagg tggctgtgat ggggccgtgc gcgtctggga catcgtgcgg cactacggga 481 cacaccactt ccgaggctcg cccggtgtcg tgcacctagt ggccttccac ccggacccta 541 cacgcctgct gctcttctcc tcggccacgg atgccgccat ccgcgtgtgg tcactgcagg 601 accggtcatg cctggctgtg ctgactgccc actacagcgc cgtcacctca ctggccttca 661 gcgccgacgg ccacaccatg ctcagctccg gccgtgacaa gatatgtatc atctgggacc 721 ttcagagctg ccaggccacg aggaccgtgc ctgtgtttga gagcgtggag gctgctgtgc 781 tgttgccaga ggagccagtg tcccagctgg gtgtgaagtc cccagggctg tactttctga 841 cagctggcga ccaaggcact ctgcgcgtgt gggaggcagc ttctgggcag tgtgtgtaca 901 cgcaggccca gccgccgggc cctgggcagg agctgaccca ctgcaccctg gcacacaccg 961 ccggcgtggt cctcaccgcc accgccgacc acaacctgtt gctctacgag gctcgctccc 1021 tgcggctgca gaaacagttc gctggctaca gtgaggaggt tttggatgtc cggtttcttg 1081 ggcccgagga ctcccacgtt gtcgtggcct ccaatagccc ctgcctaaaa gtgtttgagc 1141 tgcagacgtc agcctgccag atcctccacg gccacacgga tatcgtcctg gccctggatg 1201 tgttccggaa ggggtggctc tttgccagct gtgccaagga tcagagcgtc cgtatctgga 1261 gaatgaacaa ggctggccag gtgatgtgcg tggctcaggg ttccggtcac acacacagtg 1321 tgggcaccgt ctgctgctct aggctgaagg agtccttcct ggtgacaggc agccaggact 1381 gcactgtgaa gctgtggcct cttcccaaag ccttgctgtc caagaacaca gccccagaca 1441 acggccctat cctcctgcag gcccacacca ctcagcgctg ccatgataag gacatcaaca 1501 gcgtggctat tgcccccaac gacaagctgc tggccacagg ctcacaggac cgcacggcca 1561 agctctgggc cctgccacag tgccagctgc tgggtgtctt ctcaggccac cgcgtggcct 1621 ctggtgcgtc cagttctctc ccatggacca ggtgctggcc acggcctcag ctgatggcac 1681 catcaagctc tgggcactcc aggacttcag ctgtctcaag acatttgagg ggcacgatgc 1741 ttctttgtct gaaggtggcc tttgtgagcc gtggcacgca gctgctgtcc agcggttcgg 1801 atggcctcgt gaagctctgg accatcaaga acaacgagtg tgtgcggacg ctggatgccc 1861 acgaggacaa ggtctggggg ctgcaggccg gctggacgac cacgccctca ctggggccag 1921 tgactcccga gtcatcctct ggaaggatgt gaccgaggcg gacgaggcag aggagcaggc 1981 caggcaagag gagcaggtgg tcaggcagca agagctggac aacctgctgc atgagaagcg 2041 gtacctgcgg gcgctgggcc tggccatctc cctggatcgg ccccacaccg tgctgactgt 2101 catccaggcc atccggaggg accctgaggc ctgcgagaag ctggaagcca ccatgctccg 2161 actgcggcgc gaccagaaag gccctgctgc gcttctgcgt cacgtggaac accaactcgc 2221 ggcactgcca cgaggcccag gccgtgctgg gtgtgctctt gaggcgagag gcccccgagg 2281 agctgctggc ctacgaaggc gtgcgggcag cgccttgagg ccctgctgcc ctacactgag 2341 cggcactttc agcggctcag caggtaccct ccaggccgcc gctttcttgg acttcctgtg 2401 gcacaacatg aagctccctg tgccgccgcc gcccccaccc cctgggaaac ccataaaggc 2461 gcactgccct aaaaa // LOCUS HSU02619 6996 bp mRNA PRI 10-MAY-1994 DEFINITION Human TFIIIC Box B-binding subunit mRNA, complete cds. ACCESSION U02619 NID g414932 KEYWORDS transcription factor; RNA polymerase III. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6996) AUTHORS L'Etoile,N.D., Fahnestock,M.L., Shen,Y., Aebersold,R. and Berk,A.J. TITLE Human TFIIIC Box B-Binding Subunit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 1652-1656 (1994) MEDLINE 94173888 REFERENCE 2 (bases 1 to 6996) AUTHORS L'Etoile,N.D. TITLE Direct Submission JOURNAL Submitted (20-OCT-1993) Noelle D. L'Etoile, Microbiology and Molecular Genetics, University of California at Los Angeles, 405 Hilgard Ave., Los Angeles, CA 90024-0150, USA FEATURES Location/Qualifiers source 1..6996 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TFIIICalpha" /clone_lib="cDNA library A.M. Mes-Masson et al. (1986) PNAS 83, 9768-72" /sex="female" /cell_line="K-562 human erythroleukemia" /cell_type="undifferentiated myloid and lymphoid" CDS 61..6390 /codon_start=1 /product="TFIIIC Box B-binding subunit" /db_xref="PID:g442362" /translation="MDALESLLDEVALEGLDGLCLPALWSRLETRVPPFPLPLEPCTQ EFLWRALATHPGISFYEEPRERPDLQLQDRYEEIDLETGILESRRDPVALEDVYPIHM ILENKDGIQGSCRYFKERKNITNDIRTKSLQPRCTMVEPFDRWGKKLIIGSLPAHAVQ ALDSPGGGSRPEAARLLLLHPGTARPVQVQGELQRDLHTTAFKVDAGKLHYHRKILNK NGLITMQSHVIRLPTGAQQHSILLLLNRFHVDRRSKYDILMEKLSVMLSTRTNHIETL GKLREELGLCERTFKRLYQYMLNAGLAKVVSLRLQEIHPECGPCKTKKGTDVMVRCLK LLKEFKRNDHDDDEDEEVISKTVPPVDIVFERDMLTQTYDLIERRGTKGISQAEIRVA MNVGKLEARMLCRLLQRFKVVKGFMEDEGRQRTTKYISCVFAEESDLSRQYQREKARS ELLTTVSLASMQEESLLPEGEDTFLSESDSEEERSSSKRRGRGSQKDTRASANLRPKT QPHHSTPTKGGWKVVNLHPLKKQPPSFPGAAEERACQSLASRDSLLDTSSVSEPNVSF VSHCADSNSGDIAVIEEVRMENPKESSSSLKTGRHSSGQDKPHETYRLLKRRNLIIEA VTNLRLIESLFTIQKMIMDQEKQEGVSTKCCKKSIVRLVRNLSEEGLLRLYRTTVIQD GIKKKVDLVVHPSMDQNDPLVRSAIEQVRFRISNSSTANRVKTSQPPVPQGEAEEDSQ GKEGPSGSGDSQLSASSRSESGRMKKSDNKMGITPLRNYHPIVVPGLGRSLGFLPKMP RLRVVHMFLWYLIYGHPASNTVEKPSFISERRTIKQESGRAGVRPSSSGSAWEACSEA PSKGSQDGVTWEAEVELATETVYVDDASWMRYIPPIPVHRDFGFGWALVSDILLCLPL SIFIQIVQVSYKVDNLEEFLNDPLKKHTLIRFLPRPIRQQLLYKRRYIFSVVENLQRL CYMGVLQFGPTEKFQDKDQVFIFLKKNAVIVDTTICDPHYNLGRRRRPFERRLYVLNS MQDVENYWFDLQCVCLNTPLGVVRCPRVRKNSSTDQGSDEEGSLQKEQESAMDKHNLE RKCAMLEYTTGSREVVDEGLIPGDGLGAAGLDSSFYGHLKRNWIWTSYIINQAKKENT AAENGLTVRLQTFLSKRPMPLSARGNSRLNIWGEARVGSELCAGWEEQFEVDREPSLD RNRRVRGGKSQKRKRLKKDPGKKIKRKKKGEFPGEKSKRLRYHDEADQSALHRMTRLR VTWSMQEDGLLVLCRIASNVLNTKVKGPFVTWQVVRDILHATFEESLDKTSHSLGRRA RYIVKNPQAYLNYKVCLAEVYQDKALVGDFMNRRGDYDDPKVCANEFKEFVEKLKEKF SSALRNSNLEIPDTLQELFARYRVLAIGDEKDQTRKEDELNSVDDIHFLVLQNLIQST LALSDSQMKSYQSFQTFRLYREYKDHVLVKAFMECQKRSLVNRRRVNHTLGPKKNRAL PFVPMSYQLSQTYYRIFTWRFPSTICTESFQFLDRMRAAGKLDQPDRFSFKDQDNNEP TNDMVAFSLDGPGGNCVAVLTLFSLGLISVDVRIPEQIIVVDSSMVENEVIKSLGKDG SLEDDEDEEDDLDEGVGGKRRSMEVKPAQASHTNYLLMRGYYSPGIVSTRNLNPNDSI VVNSCQMKFQLRCTPVPARLRPAAAPLEELTMGTSCLPDTFTKLINPQENTCSLEEFV LQLELSGYSPEDLTAALEILEAIIATGCFGIDKEELRRRFSALEKAGGGRTRTFADCI QALLEQHQVLEVGGNTARLVAMGSAWPWLLHSVRLKDREDADIQREDPQARPLEGSSS EDSPPEGQAPPSHSPRGTKRRASWASENGETDAEGTQMTPAKRPALQDSNLAPSLGPG AEDGAEAQAPSPPPALEDTAAAGAAQEDQEGVGEFSSPGQEQLSGQAQPPEGSEDPRG FTESFGAANISQAARERDCESVCFIGRPWRVVDGHLNLPVCKGMMEAMLYHIMTRPGI PESSLLRHYQGVLQPVAVLELLQGLESLGCIRKRWLRKPRPVSLFSTPVVEEVEVPSS LDESPMAFYEPTLDCTLRLGRVFPHEVNWNKWIHL" BASE COUNT 1663 a 1959 c 2030 g 1344 t ORIGIN 1 atggaccaag gcccttggcg gtgcgttgcg cacccccggg gcgccgcgac tgaagtagca 61 atggacgcgc tggagtcgtt gttggacgaa gtcgctctgg aggggctcga tggcctgtgt 121 ctgccagcgc tgtggagccg gctggagacg cgagtgccgc ccttcccgct gcctttggaa 181 ccctgcacgc aggagtttct ctggcgggcc ctcgccacgc acccgggcat cagcttctat 241 gaggagcctc gggagcgacc cgacctacag ctccaggacc ggtatgaaga aattgatttg 301 gaaactggaa ttttggagtc taggagggac ccggtggctt tggaggatgt ctaccccatt 361 catatgatct tagagaataa ggatggcatc cagggctcat gccgctactt taaggagagg 421 aaaaacatta ccaatgacat cagaaccaag tccttgcagc ctcgctgtac aatggtggaa 481 ccctttgaca ggtgggggaa gaaactgatc atcggttccc tcccagccca tgcggtacag 541 gcccttgata gcccaggagg gggatcccga cctgaagctg cccgacttct cctactgcat 601 cctggaacgg ctaggccggt ccaggtgcaa ggggagctcc agcgagacct tcacaccact 661 gctttcaagg ttgatgctgg gaagctgcac tatcacagaa aaattttgaa caaaaacggg 721 ctgattacaa tgcagtccca tgtgatccga ttacccactg gagcccagca acactcaatc 781 ctcctcctac tgaaccggtt tcatgtggac aggaggagca aatacgacat cctcatggag 841 aagctttcgg tcatgctgag cacacggact aaccacatag agacgctggg aaagctgagg 901 gaagagctgg ggctgtgcga aaggacgttt aagcgtctgt accagtatat gctgaacgcc 961 gggctagcca aggtggtgtc tcttcgcttg caagagatcc accctgaatg tggaccttgt 1021 aagacaaaga aagggaccga cgtcatggtt cggtgcctca agctgctgaa ggaatttaaa 1081 cggaatgacc atgatgatga cgaggacgag gaggtcatct ccaagacagt gcctccagtg 1141 gacattgtgt tcgagcggga tatgctcaca cagacctacg acctcattga gcgcagaggc 1201 acgaaaggaa tttcccaagc tgaaatccga gtggctatga atgtgggaaa actagaagca 1261 agaatgctgt gccgacttct tcaaagattc aaagttgtca agggattcat ggaagacgaa 1321 ggtcggcagc gaaccaccaa gtacatttcc tgcgtgtttg cagaggagag cgacctaagc 1381 cggcagtacc aaagagagaa ggcccgcagc gagctcttga ccaccgtgag cctggcgtct 1441 atgcaggagg agtcgcttct gcctgaaggc gaggacacct tcctctctga gtcggacagt 1501 gaggaggaga ggagcagcag caagcggaga ggcagagggt cccagaaaga cacaagagcc 1561 tctgcaaacc tccggcccaa gacccagcct catcactcca ccccaaccaa gggtgggtgg 1621 aaagttgtaa acctacaccc attgaaaaag cagccgccct ccttcccagg agctgctgaa 1681 gagagagcct gccagagcct tgccagcagg gacagcctct tagataccag cagcgtctca 1741 gaacccaacg tgtcctttgt ctcccactgt gcggacagca acagtggtga catagctgtg 1801 atcgaggagg tccggatgga aaacccaaag gagagtagca gttccctgaa gactgggagg 1861 cacagctcag gccaagacaa accacacgaa acttaccgac tgctgaaacg caggaatctg 1921 atcatagaag ctgtcaccaa tcttcgctta atcgagagtt tattcacgat tcagaagatg 1981 atcatggatc aggagaagca ggaaggcgtg tccaccaagt gctgcaagaa gtccattgtc 2041 cgcttggtgc ggaacctgtc tgaggaaggt ctcttgcgat tgtatcggac cactgtcatt 2101 caagatggca tcaagaagaa ggtggatctg gtggtgcacc cgtccatgga ccagaacgac 2161 cctctagtga gaagtgccat cgagcaggtc cgcttccgga tctccaattc aagcacagcc 2221 aacagggtta aaacttccca gcctccagtg ccccaagggg aggcagaaga agacagtcaa 2281 ggaaaagagg gcccaagtgg atcaggggac tctcagctga gtgcttcctc tagatcagaa 2341 agtggacgga tgaaaaaaag tgataataaa atgggcataa ccccgcttag aaattatcac 2401 cccattgtag ttcccggact ggggcgttct ctaggatttc tgcccaaaat gcctcgcctg 2461 cgggtggtcc acatgtttct gtggtacctc atctacgggc accctgccag caacaccgtg 2521 gagaagccaa gcttcatcag tgaacggaga acgataaagc aggagtcagg cagggcaggc 2581 gtccggccgt cctcctctgg aagtgcctgg gaggcctgct ctgaagcccc atctaaaggc 2641 agccaagatg gtgtcacctg ggaggctgaa gtggagcttg ccacggagac agtgtatgtc 2701 gacgatgcct cgtggatgcg ctacatcccc ccaatcccag tccacaggga cttcggcttt 2761 ggctgggctc tcgtcagcga catcctcctc tgccttcccc tctccatctt catccagatt 2821 gtgcaagtca gctacaaggt ggacaacctg gaggaatttc tgaacgaccc gctgaagaag 2881 cacacgctga tccgctttct ccccaggccc attcggcagc agcttctgta caagaggcgt 2941 tacatttttt cggtggtgga gaaccttcag aggctgtgct acatgggggt gctacagttt 3001 ggtcccacgg aaaagtttca ggataaagat caggtcttta tcttcttgaa gaagaatgca 3061 gtcattgttg acactaccat ctgcgaccca cattacaacc tgggccgcag gaggcggccc 3121 ttcgagaggc gcctctatgt cctgaactca atgcaggatg tggaaaacta ctggtttgac 3181 ctgcagtgcg tctgcctcaa caccccacta ggcgtggtgc gctgcccgcg cgtcaggaag 3241 aacagcagca cagaccaggg cagcgacgag gagggcagcc tgcagaagga gcaggagagc 3301 gccatggaca agcacaacct ggagcgcaag tgcgccatgc tggagtacac cactggaagc 3361 cgtgaggtgg tggatgaagg cttgatccct ggagatgggc tgggtgccgc agggctcgat 3421 tccagcttct acggacacct caagcgcaac tggatctgga ccagctacat catcaaccag 3481 gccaaaaagg agaacactgc cgcagagaat ggactcacag tgaggctcca gacatttctg 3541 tccaagcgcc caatgcccct cagtgccaga ggcaacagca ggttgaatat ttggggggaa 3601 gcaagagtag gctccgagct ctgtgctggc tgggaagagc agtttgaggt ggaccgagag 3661 ccctcgctgg accgaaaccg gagagtgagg ggtgggaaaa gccagaagcg gaagcggctg 3721 aagaaggacc ctgggaagaa gatcaagaga aagaagaaag gagagttccc aggagaaaaa 3781 agcaaaaggc tgcgctacca tgatgaagcc gaccagagtg ccctgcatcg gatgacgcgg 3841 cttcgtgtca cctggtctat gcaggaggat gggctgcttg tgctgtgccg cattgccagc 3901 aatgtcctca acaccaaggt gaagggtcca tttgtcacct ggcaggtggt acgggacatt 3961 ttgcatgcca cgtttgaaga gtctttggat aaaacatctc attcccttgg acgaagagct 4021 cgctacatag tcaaaaaccc acaggcctat ctcaactata aagtgtgcct ggccgaggtg 4081 taccaggata aagcacttgt tggagatttc atgaatcgaa gaggtgacta tgatgaccca 4141 aaggtttgtg ccaacgagtt taaagaattt gtggagaagc ttaaagaaaa gttcagttca 4201 gccctaagga attctaacct tgaaatccca gacacactcc aggagctgtt cgccaggtac 4261 cgagttttgg caattgggga tgaaaaagat caaaccagga aagaggatga acttaacagc 4321 gtggatgaca tccactttct ggtgcttcag aacctgatcc agagcacgct ggccctctca 4381 gacagtcaga tgaagtccta ccagtcattc cagactttcc gcctctatcg ggagtacaag 4441 gaccacgttc ttgtgaaggc cttcatggag tgccagaaga ggagcttggt caaccggcgc 4501 cgggtcaacc acacgctggg ccccaagaag aaccgggccc tccccttcgt gccaatgtcc 4561 taccagctat cccagaccta ctacaggatt tttacgtggc gatttccaag caccatctgc 4621 acggagtcat tccagttttt ggacagaatg cgggctgccg gcaagttgga ccagcctgat 4681 cgtttctctt tcaaagacca ggataataac gagcccacaa acgacatggt ggccttttca 4741 ctggacggcc ctggaggaaa ttgtgtggcc gtcctgaccc tcttctctct gggcctcatt 4801 tctgtggatg tcaggatccc ggagcagatc atcgtggtag acagctcaat ggtggagaat 4861 gaggtcatca aaagcttggg gaaggacggc agcctggagg atgacgagga tgaagaggat 4921 gacttggacg aaggtgtagg gggcaagcgc cggagcatgg aggtgaaacc tgcgcaagcc 4981 tcccacacca actacctgct gatgaggggc tactactccc ccggcatcgt cagcacccgc 5041 aacctcaacc ccaacgacag cattgtggtc aactcctgcc agatgaagtt ccagctccgc 5101 tgcacccctg tgcccgcccg gctcaggccc gctgccgctc ctctggaaga gctaacaatg 5161 ggaacctcct gcctccctga tacgttcacc aagctgataa acccccagga aaacacctgc 5221 agcttggagg agtttgtcct ccagctggag ctgtctgggt atagtcccga agacctgact 5281 gctgccttgg agatcttgga agccattata gccacgggtt gttttgggat tgacaaggag 5341 gagctgcgca gacggttctc ggccttggag aaggcaggtg gtgggcgcac caggacattc 5401 gcagattgca tccaggccct cctggagcag catcaggtgc tggaggtcgg tggcaacact 5461 gcgcgcctgg tagccatggg ctctgcctgg ccttggctcc tgcactccgt gcggctgaaa 5521 gacagagaag acgccgacat ccagagagaa gacccccagg ccagacccct ggaggggtct 5581 tccagtgagg acagcccccc cgaggggcag gcacctcctt ctcacagccc ccggggcacc 5641 aagaggcgcg ccagctgggc cagtgagaat ggggagaccg acgccgaggg cacccagatg 5701 acccctgcca agaggccagc gctccaggac tcaaatttgg cccccagcct tgggcccgga 5761 gctgaagatg gggcagaagc ccaggcccca tctccacccc cagctcttga agacaccgct 5821 gcagcgggag cagcacagga agaccaagag ggtgtcggtg agttcagttc cccaggccaa 5881 gagcagctga gcggccaggc gcagcctcca gagggctctg aagaccccag agggttcaca 5941 gagagtttcg gagctgccaa catctcccag gcagcacggg aaagggactg tgagagtgtc 6001 tgcttcatcg gccggccgtg gcgtgtcgtg gatggccacc tgaaccttcc tgtatgcaag 6061 ggtatgatgg aggccatgct gtaccacatc atgaccaggc ctggcatccc cgagagctcc 6121 ctgctgcgcc actaccaggg ggtcctgcag cccgtcgccg tgctggagtt gctccagggc 6181 ctggagtccc tcggctgcat ccggaagcgc tggctgagaa agccaaggcc tgtctcgctc 6241 ttctctacac ccgtggtgga agaggtggaa gtgccctcca gcctggacga gagccccatg 6301 gctttctatg agcccacctt ggactgtacc ctccggctgg gccgtgtgtt cccccacgag 6361 gtcaactgga acaagtggat ccacctctag gacccctgtg ggcgtcccct ccctcccagc 6421 caccgcctgc cacaccactc ctgcctggtg ctcggcagac cccactgtgc cctggccttg 6481 ggtctgccga gcctcctgca gcaggggacg ggtgctttgg ccagagtcac agactgacac 6541 gtttcccact gtactggaac tctggaaaga ggggctcccc gacctgccca tcccccaggc 6601 tcttctgggc cttccccttg ggaactggcc tcatcacact gggagttggt gcttcttgtc 6661 tctgggtctc cagagtttgc cccgcctgtg cacacctcac attccagact ctagccatct 6721 cggcaggatc tcctggctcc ttgagtgccc aggtgccacc aagaggaagg gccttgtggg 6781 atacaccttg cagaataggg atcggtgtgc cccgctgcga ggggcccccc atgggggctg 6841 tggcccctcc gcaggcagga catcccaacc cctggctggg actgaaccac ccagagcgga 6901 gcggctccct tttcagcctt gtgagtcacc tggcaggccc cagctgggct ggctgtccgt 6961 gtccctcagc ctggctggtg attccttgca ggaggg // LOCUS HSU02680 3000 bp mRNA PRI 03-FEB-1994 DEFINITION Human protein tyrosine kinase mRNA, complete cds. ACCESSION U02680 NID g451481 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3000) AUTHORS Beeler,J.F., LaRochelle,W.J., Chedid,M., Tronick,S.R. and Aaronson,S.A. TITLE Prokaryotic expression cloning of a novel human tyrosine kinase JOURNAL Mol. Cell. Biol. 14 (2), 982-988 (1994) MEDLINE 94119116 REFERENCE 2 (bases 1 to 3000) AUTHORS Beeler,J.F. TITLE Direct Submission JOURNAL Submitted (22-OCT-1993) John F. Beeler, Laboratory of Cellular & Molecular Biology, National Cancer Institute, Building 37 Room 1E24, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3000 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Beta-gal-A6" /clone_lib="M426" /cell_line="M426" /cell_type="fibroblast" /tissue_type="lung" /dev_stage="embryo" CDS 61..1113 /codon_start=1 /product="protein tyrosine kinase" /db_xref="PID:g451482" /translation="MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQ PSDSWDKDYDSFVLPLLEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYA ATRATLKKEFGGGHIKDEVFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKIN EVQTDVGVDTKHQTLQGVAFPISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTT NTELKDLPKRIPKDSARYHFFLYKHSHEGDYLESIVFIYSMPGYTCSIRERMLYSSCK SRLLEIVERQLQMDVIRKIEIDNGDELTADFLYEEVHPKQHAHKQSFAKPKGPAGKRG IRRLIRGPAETEATTD" BASE COUNT 1019 a 500 c 550 g 931 t ORIGIN 1 ccgccggccg gggcgcctgg ctgcactcag cgccggagcc gggagctagc ggccgccgcc 61 atgtcccacc agaccggcat ccaagcaagt gaagatgtta aagagatctt tgccagagcc 121 agaaatggaa agtacagact tctgaaaata tctattgaaa atgagcaact tgtgattgga 181 tcatatagtc agccttcaga ttcctgggat aaggattatg attcctttgt tttacccctg 241 ttggaggaca aacaaccatg ctatatatta ttcaggttag attctcagaa tgcccaggga 301 tatgaatgga tattcattgc atggtctcca gatcattctc atgttcgtca aaaaatgttg 361 tatgcagcaa caagagcaac tctgaagaag gaatttggag gtggccacat taaagatgaa 421 gtatttggaa cagtaaagga agatgtatca ttacatggat ataaaaaata cttgctgtca 481 caatcttccc ctgccccact gactgcagct gaggaagaac tacgacagat taaaatcaat 541 gaggtacaga ctgacgtggg tgtggacact aagcatcaaa cactacaagg agtagcattt 601 cccatttctc gagaagcctt tcaggctttg gaaaaattga ataatagaca gctcaactat 661 gtgcagttgg aaatagatat aaaaaatgaa attataattt tggccaacac aacaaataca 721 gaactgaaag atttgccaaa gaggattccc aaggattcag ctcgttacca tttctttctg 781 tataaacatt cccatgaagg agactattta gagtccatag tttttattta ttcaatgcct 841 ggatacacat gcagtataag agagcggatg ctgtattcta gctgcaagag ccgtctgcta 901 gaaattgtag aaagacaact acaaatggat gtaattagaa agatcgagat agacaatggg 961 gatgagttga ctgcagactt cctttatgaa gaagtacatc ccaagcagca tgcacacaag 1021 caaagttttg caaaaccaaa aggtcctgca ggaaaaagag gaattcgaag actaattagg 1081 ggcccagcgg aaactgaagc tactactgat taaagtcatc acattaaaca ttgtaatact 1141 agttttttaa aagtccagct tttagtacag gagaactgaa atcattccat gttgatataa 1201 agtagggaaa aaaattgtac tttttggaaa atagcacttt tcacttctgt gtgtttttaa 1261 aattaatgtt atagaagact catgatttct atttttgagt taaagctaga aaagggttca 1321 acataatgtt taattttgtc acactgtttt catagcgttg attccacact tcaaatactt 1381 cttaaaattt tatacagttg ggccagttct agaaagtctg atgtctcaaa gggtaaactt 1441 actactttct tgtgggacag aaagacctta aaatattcat attacttaat gaatatgtta 1501 aggaccaggc tagagtattt tctaagctgg aaacttagtg tgccttggaa aagccgcaag 1561 ttgcttactc cgagtagctg tgctagctct gtcagactgt aggatcatgt ctgcaacttt 1621 tagaaatagt gctttatatt gcagcagtct tttatatttg actttttttt aatagcatta 1681 aaattgcaga tcagctcact ctgaaacttt aagggtacca gatattttct atactgcagg 1741 atttctgatg acattgaaag actttaaaca gccttagtaa attatctttc taatgctctg 1801 tgaggccaaa catttatgtt cagattgaaa tttaaattaa tatcattcaa aaggaaacaa 1861 aaaatgttga gttttaaaaa tcaggattga cttttttctc caaaaccata catttatggg 1921 caaattgtgt tctttatcac ttccgagcaa atactcagat ttaaaattac tttaaagtcc 1981 tggtacttaa caggctaacg tagataaaca ccttaataat ctcagttaat actgtatttc 2041 aaaacacatt taactgtttt ctaatgcttt gcattatcag ttacaaccta gagagatttt 2101 gagcctcata tttctttgat acttgaaata gagggagcta gaacacttaa tgtttaatct 2161 gttaaacctg ctgcaagagc cataactttg aggcattttc taaatgaact gtggggatcc 2221 aggatttgta atttcttgat ctaaacttta tgctgcataa atcacttatc ggaaatgcac 2281 atttcatagt gtgaagcact catttctaaa ccttattatc taaggtaata tatgcacctt 2341 tcagaaattt gtgttcgagt aagtaaagca tattagaata attgtgggtt gacagatttt 2401 taaaatagaa tttagagtat ttggggtttt gtttgtttac aaataatcag actataatat 2461 ttaaacatgc aaaataactg acaataatgt tgcacttgtt tactaaagat ataagttgtt 2521 ccatgggtgt acacgtagac agacacacat acacccaaat tattgcatta agaatcctgg 2581 agcagaccat agctgaagct gttattttca gtcaggaaga ctacctgtca tgaaggtata 2641 aaataattta gaagtgaatg tttttctgta ccatctatgt gcaattatac tctaaattcc 2701 actacactac attaaagtaa atggacattc cagaatatag atgtgattat agtcttaaac 2761 taattattat taaaccaatg attgctgaaa atcagtgatg catttgttat agagtataac 2821 tcatcgttta cagtatgttt tagttggcag tatcatacct agatggtgaa taacatattc 2881 ccagtaaatt tatatagcag tgaagaatta catgccttct ggtggacatt ttataagtgc 2941 attttatatc acaataaaaa ttttttctct ttaaaaaaaa aaaacaagaa aaaaaaaaaa // LOCUS HSU02683 3387 bp mRNA PRI 27-SEP-1994 DEFINITION Human alpha palindromic binding protein mRNA, complete cds. ACCESSION U02683 NID g414537 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3387) AUTHORS Efiok,B.J., Chiorini,J.A. and Safer,B. TITLE A key transcription factor for eukaryotic initiation factor-2 alpha is strongly homologous to developmental transcription factors and may link metabolic genes to cellular growth and development JOURNAL J. Biol. Chem. 269 (29), 18921-18930 (1994) MEDLINE 94308151 REFERENCE 2 (sites) AUTHORS Jacob,W.F., Silverman,T.A., Cohen,R.B. and Safer,B. TITLE Identification and characterization of a novel transcription factor participating in the expression of eukaryotic initiation factor 2 alpha JOURNAL J. Biol. Chem. 264 (34), 20372-20384 (1989) MEDLINE 90062168 REFERENCE 3 (bases 1 to 3387) AUTHORS Efiok,B.J.S. TITLE Direct Submission JOURNAL Submitted (22-OCT-1993) Bassey J.S. Efiok, National Institutes of Health, Molecular Hematology Branch - NHLBI, 9000 Rockville Pike, Bldg. 10/Rm 7D18, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3387 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="AP62-29" /clone_lib="K562 Primary Library" /cell_line="K562" /cell_type="malignant hematopoietic multipotent leukemia" CDS 1030..2541 /note="alpha-Pal" /citation=[1] /codon_start=1 /function="transcriptional regulation of eIF-2alpha gene expression" /evidence=experimental /product="alpha palindromic binding protein" /db_xref="PID:g414538" /translation="MEEHGVTQTEHMATIEAHAVAQQVQQVHVATYTEHSMLSADEDS PSSPEDTSYDDSDILNSTAADEVTAHLAAAGPVGMAAAAAVATGKKRKRPHVFESNPS IRKRQQTRLLRKLRATLDEYTTRVGQQAIVLCISPSKPNPVFKVFGAAPLENVVRKYK SMILEDLESALAEHAPAPQEVNSELPPLTIDGIPVSVDKMTQAQLRAFIPEMLKYSTG RGKPGWGKESCKPIWWPEDIPWANVRSDVRTEEQKQRVSWTQALRTIVKNCYKQHGRE DLLYAFEDQQTQTQATATHSIAHLVPSQTVVQTFSNPDGTVSLIQVGTGATVATLADA SELPTTVTVAQVNYSAVADGEVEQNWATLQGGEMTIQTTQASEATQAVASLAEAAVAA SQEMQQGATVTMALNSEAAAHAVATLAEATLQGGGQIVLSGETAAAVGALTGVQDANG LVQIPVSMYQTVVTSLAQGNGPVQVAMAPVTTRISDSAVTMDGQAVEVVTLEQ" misc_feature 1261..1314 /standard_name="Bipartate nuclear targeting signal" /note="basic DNA-binding domain" /citation=[1] /function="localization of polypeptide to nucleus; DNA binding" repeat_region 2809..2870 /note="contains some imperfection" /citation=[1] /function="unknown" /rpt_type=direct /rpt_unit=2809..2810 /label=rpt1 repeat_region 2887..2929 /note="contains some imperfections" /citation=[1] /function="unknown" /rpt_type=direct /rpt_unit=2887..2888 /label=rpt2 polyA_signal 3347..3353 /citation=[1] BASE COUNT 896 a 806 c 754 g 931 t ORIGIN 1 taaggtattt ataccaattg acgtttaaaa cgtctctaac gtgccttaaa ttgcgtttaa 61 acgatactat gagctaatac actctgacaa acttcttgtc agcacgttca aactcttcaa 121 cataatcaat tgccacaaca taattatacg tgtcaatctt agcaactgac atattgccgt 181 caaattcttg caactctgca atcaattctg ctactgtcat cattctacct ccaaatattc 241 aaattttacg cccttgtgag acttcagtct tcccttaagt acgttgacaa ttgtgcttgg 301 cagaatacct cgctccttag cgtaaatggt agcactgtca aatacttcat acgtaccgtc 361 ttcatggatt gctttaatag gtttttgatt agttggcttt aattgtttgt ttgttttaac 421 aaaattcttc tctgcgtaat tgttaatctc atctagtctg ttgctgtctt taaacgcgtt 481 ataaatcaga gttttaactg attttgtttt tgatttacct tttttagaca gtgccactac 541 tagccaacca gacccactac gacccggctt taaaattctt ccgtaacgaa cactctttac 601 acgtccatgt gtacttactt gataatatcc ttcgtagcct tcgatatctt tccatacttc 661 aatcatttat tttctcattt ctatttatta actaacttat gttctgcttt gttgagtaca 721 acgtgccagt tatctaagtt catgttagtt ttgtatccgt ctattggctc taaaaactcc 781 cgatactcca acttgcggac aaacgtttca aacatcaact tcttaactaa cacattcaat 841 cgtcgatcgc ttttcgacgt ttttccaatt gtgacgtaaa cgtttccttt aattcttacg 901 catgttttta attctatatc gacaccgtgt gatactttct ttacgtggtc ctttatttga 961 tattaaaata tattccgtcg cagtctccac ggcgcaggcc cacggtagcg cagccgctct 1021 gagaacttca tggaggaaca cggagtgacc caaaccgaac atatggctac catagaagca 1081 catgcagtgg cccagcaagt gcagcaggtc catgtggcta cttacaccga gcatagtatg 1141 ctgagtgctg atgaagactc gccttcttcg cccgaggaca cctcttacga tgactcagat 1201 atactcaact ccacagcagc tgatgaggtg acagctcatc tggcagctgc aggtcctgtg 1261 ggaatggccg ctgctgctgc tgtggcaaca ggaaagaaac ggaaacggcc tcatgtattt 1321 gagtctaatc catctatccg gaagaggcaa caaacacgtt tgcttcggaa acttcgagcc 1381 acgttagatg aatatactac tcgtgtggga cagcaagcta ttgtcctctg tatctcaccc 1441 tccaaaccta accctgtctt taaagtgttt ggtgcagcac ctttggagaa tgtggtgcgt 1501 aagtacaaga gcatgatcct ggaagacctg gagtctgctc tggcagaaca cgcccctgcg 1561 ccacaggagg ttaactcaga actgccgcct ctcaccatcg acggaattcc agtctctgtg 1621 gacaaaatga cccaggccca gcttcgggca tttatcccag agatgctcaa gtactctaca 1681 ggtcggggaa aaccaggctg ggggaaagaa agctgcaagc ccatctggtg gcctgaagat 1741 atcccctggg caaatgtccg gagtgatgtc cgcacagaag agcaaaagca gagggtttca 1801 tggacccagg cactacggac catagttaaa aactgttata aacagcatgg gcgggaagac 1861 cttttgtatg cctttgaaga tcagcaaacg caaacacagg ccacagccac acatagtata 1921 gctcatcttg taccatcaca gactgtagtc cagactttta gtaaccctga tggcactgtc 1981 tcacttatcc aggttggtac gggggcaaca gtagccacat tggctgatgc ttcagaattg 2041 ccaaccacgg tcaccgttgc ccaagtgaat tattctgccg tggctgatgg agaggtggaa 2101 caaaattggg ccacgttaca gggaggtgag atgaccatcc agacgacgca agcatcagag 2161 gccacccagg cggtggcatc gttggcagag gccgcagtgg cagcttctca ggagatgcag 2221 cagggagcta cagtcactat ggcgcttaac agcgaagctg ccgcccatgc tgtcgccacc 2281 ctggctgagg ccaccttaca aggtggggga cagatcgtct tgtctgggga aaccgcagca 2341 gccgtcggag cacttactgg agtccaagat gctaatggcc tggtccagat ccctgtgagc 2401 atgtaccaga ctgtggtgac cagcctcgcc cagggcaacg gaccagtgca ggtggccatg 2461 gcccctgtga ccaccaggat atcagacagc gcagtcacca tggacggcca agctgtggag 2521 gtggtgacat tggaacagtg acatacagcc atattatggc atcgttttct agtctacttc 2581 aaaatttttt acacgtttgc agaggtgcaa tcaaatggaa ttaagtctct cgactttgga 2641 aggaaagttt tgttaacctt ttttttttta aaaggaagaa agccgggatt ttggaattgc 2701 attttttaaa gcaccactct tgattttctg ggattggtga agaaactgca ttgtcaattt 2761 cactgtccca aaaaagccaa attgtggcag gacttctttc tgcggaaatg tgtgtgtata 2821 cttatgtgtg tgtatgtgtg agtgtgaata tatgtatatg tgtacatatg gacatacaca 2881 tttacatata tataaagtat atatatacat atatatatat atatgtatga aacccgcatg 2941 gaattatctg tatgaaatca aggtgcgctg tggaaacaat aattcaccca gtttagtggg 3001 tggtagggta cgtggccaga cacagtcacc cagtttttgt tcataccagg gtcatgcgtt 3061 gagctactga caaactcagg cggaggtgac catgcccttc accaaagctg cctcccagtg 3121 gccacacaga actctccctg ctggactcac ctgaggaaag aggctccagc atggggtggg 3181 tcagagatgt gcttgcaagg tccagggact gcgtggtctg ccagctgaga tgctcctcgg 3241 gctggcccag gtgctgacct tgccacaggc agatgaatgt cttgaaagct cccgggcctc 3301 agcctcccat ctcctctcct tcccaggaat ccttgatctc atgactatta aaatgttgct 3361 ctgggtttta aaaaaaaaaa aaaaaaa // LOCUS HSU02687 3475 bp mRNA PRI 11-JUN-1994 DEFINITION Human growth factor receptor tyrosine kinase (STK-1) mRNA, complete cds. ACCESSION U02687 NID g409572 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3475) AUTHORS Small,D., Levenstein,M., Kim,E., Carow,C., Amin S,. Rockwell,P., Witte,L., Burrow,C., Ratajczak,M.Z., Gewirtz,A.M. and Civin,C.I. TITLE STK-1, the human homolog of FLK-2/FLT-3, is selectively expressed in CD34+ human bone marrow cells and is involved in the proliferation of early progenitor/stem cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 459-463 (1994) MEDLINE 94119906 REFERENCE 2 (bases 1 to 3475) AUTHORS Small,D. TITLE Direct Submission JOURNAL Submitted (25-OCT-1993) Donald Small, Oncology, Johns Hopkins University School of Medicine, 600 N. Wolfe St., Baltimore, MD 21287 USA FEATURES Location/Qualifiers source 1..3475 /organism="Homo sapiens" /db_xref="taxon:9606" /map="13q12" /cell_type="stem cell" /tissue_type="bone marrow" CDS 58..3039 /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g409573" /translation="MPALARDAGTVPLLVVFSAMIFGTITNQDLPVIKCVLINHKNND SSVGKSSSYPMVSESPEDLGCALRPQSSGTVYEAAAVEVDVSASITLQVLVDAPGNIS CLWVFKHSSLNCQPHFDLQNRGVVSMVILKMTETQAGEYLLFIQSEATNYTILFTVSI RNTLLYTLRRPYFRKMENQDALVCISESVPEPIVEWVLCDSQGESCKEESPAVVKKEE KVLHELFGTDIRCCARNELGRECTRLFTIDLNQTPQTTLPQLFLKVGEPLWIRCKAVH VNHGFGLTWELENKALEEGNYFEMSTYSTNRTMIRILFAFVSSVARNDTGYYTCSSSK HPSQSALVTIVGKGFINATNSSEDYEIDQYEEFCFSVRFKAYPQIRCTWTFSRKSFPC EQKGLDNGYSISKFCNHKHQPGEYIFHAENDDAQFTKMFTLNIRRKPQVLAEASASQA SCFSDGYPLPSWTWKKCSDKSPNCTEEITEGVWNRKANRKVFGQWVSSSTLNMSEAIK GFLVKCCAYNSLGTSCETILLNSPGPFPFIQDNISFYATIGVCLLFIVVLTLLICHKY KKQFRYESQLQMVQVTGSSDNEYFYVDFREYEYDLKWEFPRENLEFGKVLGSGAFGKV MNATAYGISKTGVSIQVAVKMLKEKADSSEREALMSELKMMTQLGSHENIVNLLGACT LSGPIYLIFEYCCYGDLLNYLRSKREKFHRTWTEIFKEHNFSFYPTFQSHPNSSMPGS REVQIHPDSDQISGLHGNSFHSEDEIEYENQKRLEEEEDLNVLTFEDLLCFAYQVAKG MEFLEFKSCVHRDLAARNVLVTHGKVVKICDFGLARDIMSDSNYVVRGNARLPVKWMA PESLFEGIYTIKSDVWSYGILLWEIFSLGVNPYPGIPVDANFYKLIQNGFKMDQPFYA TEEIYIIMQSCWAFDSRKRPSFPNLTSFLGCQLADAEEAMYQNVDGRVSECPHTYQNR RPFSREMDLGLLSPQAQVEDS" BASE COUNT 1042 a 709 c 784 g 940 t ORIGIN 1 cgaggcggca tccgagggct gggccggcgc cctgggggac cccgggctcc ggaggccatg 61 ccggcgttgg cgcgcgacgc gggcaccgtg ccgctgctcg ttgttttttc tgcaatgata 121 tttgggacta ttacaaatca agatctgcct gtgatcaagt gtgttttaat caatcataag 181 aacaatgatt catcagtggg gaagtcatca tcatatccca tggtatcaga atccccggaa 241 gacctcgggt gtgcgttgag accccagagc tcagggacag tgtacgaagc tgccgctgtg 301 gaagtggatg tatctgcttc catcacactg caagtgctgg tcgatgcccc agggaacatt 361 tcctgtctct gggtctttaa gcacagctcc ctgaattgcc agccacattt tgatttacaa 421 aacagaggag ttgtttccat ggtcattttg aaaatgacag aaacccaagc tggagaatac 481 ctacttttta ttcagagtga agctaccaat tacacaatat tgtttacagt gagtataaga 541 aataccctgc tttacacatt aagaagacct tactttagaa aaatggaaaa ccaggacgcc 601 ctggtctgca tatctgagag cgttccagag ccgatcgtgg aatgggtgct ttgcgattca 661 cagggggaaa gctgtaaaga agaaagtcca gctgttgtta aaaaggagga aaaagtgctt 721 catgaattat ttgggacgga cataaggtgc tgtgccagaa atgaactggg cagggaatgc 781 accaggctgt tcacaataga tctaaatcaa actcctcaga ccacattgcc acaattattt 841 cttaaagtag gggaaccctt atggataagg tgcaaagctg ttcatgtgaa ccatggattc 901 gggctcacct gggaattaga aaacaaagca ctcgaggagg gcaactactt tgagatgagt 961 acctattcaa caaacagaac tatgatacgg attctgtttg cttttgtatc atcagtggca 1021 agaaacgaca ccggatacta cacttgttcc tcttcaaagc atcccagtca atcagctttg 1081 gttaccatcg taggaaaggg atttataaat gctaccaatt caagtgaaga ttatgaaatt 1141 gaccaatatg aagagttttg tttttctgtc aggtttaaag cctacccaca aatcagatgt 1201 acgtggacct tctctcgaaa atcatttcct tgtgagcaaa agggtcttga taacggatac 1261 agcatatcca agttttgcaa tcataagcac cagccaggag aatatatatt ccatgcagaa 1321 aatgatgatg cccaatttac caaaatgttc acgctgaata taagaaggaa acctcaagtg 1381 ctcgcagaag catcggcaag tcaggcgtcc tgtttctcgg atggataccc attaccatct 1441 tggacctgga agaagtgttc agacaagtct cccaactgca cagaagagat cacagaagga 1501 gtctggaata gaaaggctaa cagaaaagtg tttggacagt gggtgtcgag cagtactcta 1561 aacatgagtg aagccataaa agggttcctg gtcaagtgct gtgcatacaa ttcccttggc 1621 acatcttgtg agacgatcct tttaaactct ccaggcccct tccctttcat ccaagacaac 1681 atctcattct atgcaacaat tggtgtttgt ctcctcttca ttgtcgtttt aaccctgcta 1741 atttgtcaca agtacaaaaa gcaatttagg tatgaaagcc agctacagat ggtacaggtg 1801 accggctcct cagataatga gtacttctac gttgatttca gagaatatga atatgatctc 1861 aaatgggagt ttccaagaga aaatttagag tttgggaagg tactaggatc aggtgctttt 1921 ggaaaagtga tgaacgcaac agcttatgga attagcaaaa caggagtctc aatccaggtt 1981 gccgtcaaaa tgctgaaaga aaaagcagac agctctgaaa gagaggcact catgtcagaa 2041 ctcaagatga tgacccagct gggaagccac gagaatattg tgaacctgct gggggcgtgc 2101 acactgtcag gaccaattta cttgattttt gaatactgtt gctatggtga tcttctcaac 2161 tatctaagaa gtaaaagaga aaaatttcac aggacttgga cagagatttt caaggaacac 2221 aatttcagtt tttaccccac tttccaatca catccaaatt ccagcatgcc tggttcaaga 2281 gaagttcaga tacacccgga ctcggatcaa atctcagggc ttcatgggaa ttcatttcac 2341 tctgaagatg aaattgaata tgaaaaccaa aaaaggctgg aagaagagga ggacttgaat 2401 gtgcttacat ttgaagatct tctttgcttt gcatatcaag ttgccaaagg aatggaattt 2461 ctggaattta agtcgtgtgt tcacagagac ctggccgcca ggaacgtgct tgtcacccac 2521 gggaaagtgg tgaagatatg tgactttgga ttggctcgag atatcatgag tgattccaac 2581 tatgttgtca ggggcaatgc ccgtctgcct gtaaaatgga tggcccccga aagcctgttt 2641 gaaggcatct acaccattaa gagtgatgtc tggtcatatg gaatattact gtgggaaatc 2701 ttctcacttg gtgtgaatcc ttaccctggc attccggttg atgctaactt ctacaaactg 2761 attcaaaatg gatttaaaat ggatcagcca ttttatgcta cagaagaaat atacattata 2821 atgcaatcct gctgggcttt tgactcaagg aaacggccat ccttccctaa tttgacttcg 2881 tttttaggat gtcagctggc agatgcagaa gaagcgatgt atcagaatgt ggatggccgt 2941 gtttcggaat gtcctcacac ctaccaaaac aggcgacctt tcagcagaga gatggatttg 3001 gggctactct ctccgcaggc tcaggtcgaa gattcgtaga ggaacaattt agttttaagg 3061 acttcatccc tccacctatc cctaacaggc tgtagattac caaaacaaga ttaatttcat 3121 cactaaaaga aaatctatta tcaactgctg cttcaccaga cttttctcta gaagccgtct 3181 gcgtttactc ttgttttcaa agggactttt gtaaaatcaa atcatcctgt cacaaggcag 3241 gaggagctga taatgaactt tattggagca ttgatctgca tccaaggcct tctcaggccg 3301 gcttgagtga attgtgtacc tgaagtacag tatattcttg taaatacata aaacaaaagc 3361 attttgctaa ggagaagcta atatgatttt ttaagtctat gttttaaaat aatatgtaaa 3421 tttttcagct atttagtgat atattttatg ggtgggaata aaatttctac tacag // LOCUS HSU03056 2517 bp mRNA PRI 01-SEP-1994 DEFINITION Human tumor suppressor (LUCA-1) mRNA, complete cds. ACCESSION U03056 NID g532973 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2517) AUTHORS Bader,S.D., Latif,F., Duh,F., Lerman,M.I. and Minna,J.D. TITLE Candidate tumor suppressor gene LUCA-1 involved in lung carcinogenesis JOURNAL Unpublished REFERENCE 2 (bases 1 to 2517) AUTHORS Duh,F. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) Fuh-Mei Duh, BCDP, PRI/Dyncorp, NCI-FCRDC, Building 560, Room 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2517 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="LUCA-1" /clone_lib="Lambda ZapII" /chromosome="3" /map="3p21.2" /tissue_type="heart" gene 617..1924 /gene="LUCA-1" CDS 617..1924 /gene="LUCA-1" /codon_start=1 /function="tumor suppressor" /db_xref="PID:g532974" /translation="MAGHLLPICALFLTLLDMAQGFRGPLVPNRPFTTVWNANTQWCL ERHGVDVDVSVFDVVANPGQTFRGPDMTIFYSSQLGTYPYYTPTGEPVFGGLPQNASL IAHLARTFQDILAAIPAPDFSGLAVIDWEAWRPRWAFNWDTKDIYRQRSRALVQAQHP DWPAPQVEAVAQDQFQGAARAWMAGTLQLGGALRPRGLWGFYGFPDCYNYDFLSPNYT GQCPSGIRAQNDQLGWLWGQSRALYPSIYMPAVLEGTGKSQMYVQHRVAEAFRVAVAA GDPNLPVLPYVQIFYDTTNHFLPLDELEHSLGESAAQGAAGVVLWVSWENTRTKESCQ AIKEYMDTTLGPFILNVTSGALLCSQALCSGHGRCVRRTSHPKALLLLNPASFSIQLT PGGGPLSLRGALSLEDQAQMAVEFKCRCYPGWQAPWCERKSMW" BASE COUNT 524 a 756 c 710 g 527 t ORIGIN 1 ttcctccagg agtctctggt gcagctgggg tggaatctgg ccaggccctg cttaggcccc 61 catcctgggg tcaggaaatt tggaggataa ggcccttcag ccccaaggtc agcagggacg 121 agcgggcaga ctggcgggtg tacaggaggg ctgggttgac ctgtccttgg tcactgaggc 181 cattggatct tcctccagtg gctgccagga tttctggtgg aagagacagg aaggcctccc 241 ccccttggtc gggtcagcct gggggctgag ggcctggctg tcagccactc ttcccagaac 301 atatgtcatg gcctcagtgg ctcatgggga agcaggggtg ggcgagctta ggctagagca 361 agtcctgtgg gagatggcag aggcctggtc tgagaggcaa ctcggatgtg ccctccagtg 421 gccatgctcc cctccatgcg tctcccctgc cctcctggag ccctgcaggt caatgtttaa 481 cagaaaccag agcagcggtg gattaatgcg caagggctca gccccccagc cctgagcagt 541 gggggaatcg gagactttgc aacctgttct cagctctgcc tcccctgggc aggttgtcct 601 cgaccagtcc cgtgccatgg caggccacct gcttcccatc tgcgccctct tcctgacctt 661 actcgatatg gcccaaggct ttaggggccc cttggtaccc aaccggccct tcaccaccgt 721 ctggaatgca aacacccagt ggtgcctgga gaggcacggt gtggacgtgg atgtcagtgt 781 cttcgatgtg gtagccaacc cagggcagac cttccgcggc cctgacatga caattttcta 841 tagctcccag ctgggcacct acccctacta cacgcccact ggggagcctg tgtttggtgg 901 tctgccccag aatgccagcc tgattgccca cctggcccgc acattccagg acatcctggc 961 tgccatacct gctcctgact tctcagggct ggcagtcatc gactgggagg catggcgccc 1021 acgctgggcc ttcaactggg acaccaagga catttaccgg cagcgctcac gggcactggt 1081 acaggcacag caccctgatt ggccagctcc tcaggtggag gcagtagccc aggaccagtt 1141 ccagggagct gcacgggcct ggatggcagg caccctccag ctgggggggg cactgcgtcc 1201 tcgcggcctc tggggcttct atggcttccc tgactgctac aactatgact ttctaagccc 1261 caactacacc ggccagtgcc catcaggcat ccgtgcccaa aatgaccagc tagggtggct 1321 gtggggccag agccgtgccc tctatcccag catctacatg cccgcagtgc tggagggcac 1381 agggaagtca cagatgtatg tgcaacaccg tgtggccgag gcattccgtg tggctgtggc 1441 tgctggtgac cccaatctgc cggtgctgcc ctatgtccag atcttctatg acacgacaaa 1501 ccactttctg cccctggatg agctggagca cagcctgggg gagagtgcgg cccagggggc 1561 agctggagtg gtgctctggg tgagctggga aaatacaaga accaaggaat catgtcaggc 1621 catcaaggag tatatggaca ctacactggg gcccttcatc ctgaacgtga ccagtggggc 1681 ccttctctgc agtcaagccc tgtgctccgg ccatggccgc tgtgtccgcc gcaccagcca 1741 ccccaaagcc ctcctcctcc ttaaccctgc cagtttctcc atccagctca cgcctggtgg 1801 tggccccctg agcctgcggg gtgccctctc acttgaagat caggcacaga tggctgtgga 1861 gttcaaatgt cgatgctacc ctggctggca ggcaccgtgg tgtgagcgga agagcatgtg 1921 gtgattggcc acacactgag ttgcacatat tgagaaccta atgcactctg ggtctggcca 1981 gggcttcctc aaatacatgc acagtcatac aagtcatggt cacagtaaag agtacactca 2041 gccactgtca caggcatatt ccctgcacac acatgcatac ttacagactg gaatagtggc 2101 ataaggagtt agaaccacag cagacaccat tcattcctgc tccatatgca tctacttggc 2161 aaggtcatag acaattcctc cagagacact gagccagtct ttgaactgca gcaatcacaa 2221 aggctgacat tcactgagtg cctactcttt gccaatcccc gtgctaagcg ttttatgtgg 2281 acttattcat tcctcacaat gaggctatga ggaaactgag tcactcacat tgagagtaag 2341 cacgttgccc aaggttgcac agcaagaaaa gggagaagtt gagattcaaa cccaggctgt 2401 ctagctccgg gggtacagcc cttgcactcc tactgagttt gtggtaacca gccctgcacg 2461 acccctgaat ctgctgagag gcaccagtcc agcaaataaa gcagtcatga tttactt // LOCUS HSU03057 2767 bp mRNA PRI 02-FEB-1996 DEFINITION Human actin bundling protein (HSN) mRNA, complete cds. ACCESSION U03057 NID g458027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2767) AUTHORS Duh,F.-M., Latif,F., Weng,Y., Geil,L., Modi,W., Stackhouse,T., Matsumura,F., Duan,D.R., Linehan,W.M., Lerman,M.I. and Gnarra,J.R. TITLE cDNA cloning and expression of the human homolog of the sea urchin fascin and Drosophila singed genes which encodes an actin-bundling protein JOURNAL DNA Cell Biol. 13 (8), 821-827 (1994) MEDLINE 94347211 REFERENCE 2 (bases 1 to 2767) AUTHORS Duh,F. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) Fuh-Mei Duh, BCDP, PRI/ Dyncorp, NCI-FCRDC, Building 560, Room 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2767 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Hsn" /chromosome="7" /map="7p22" /tissue_type="teratocarcinoma" 5'UTR 1..111 gene 112..1593 /gene="HSN" CDS 112..1593 /gene="HSN" /codon_start=1 /evidence=experimental /product="actin bundling protein" /db_xref="PID:g458028" /translation="MTANGTAEAVQIQFGLINCGNKYLTAEAFGFKVNASASSLKKKQ IWTLEQPPDEAGSAAVCLRSHLGRYLAADKDGNVTCEREVPGPDCRFLIVAHDDGRWS LQSEAHRRYFGGTEDRLSCFAQTVSPAEKWSVHIAMHPQVNIYSVTRKRYAHLSARPA DEIAVDRDVPWGVDSLITLAFQDQRYSVQTADHRFLRHDGRLVARPEPATGYTLEFRS GKVAFRDCEGRYLAPSGPSGTLKAGKATKVGKDELFALEQSCAQVVLQAANERNVSTR QGMDLSANQDEETDQETFQLEIDRDTKKCAFRTHTGKYWTLTATGGVQSTASSKNASC YFDIEWRDRRITLRASNGKFVTSKKNGQLAASVETAGDSELFLMKLINRPIIVFRGEH GFIGCRKVTGTLDANRSSYDVFQLEFNDGAYNIKDSTGKYWTVGSDSAVTSSGDTPVD FFFEFCDYNKVAIKVGGRYLKGDHAGVLKASAETVDPASLWEY" 3'UTR 1594..2767 polyA_signal 2748..2753 BASE COUNT 478 a 941 c 856 g 492 t ORIGIN 1 gcggagggtg cgtgcgggcc gcggcagccg aacaaaggag caggggcgcc gccgcaggga 61 cccgccaccc acctcccggg gccgcgcagc ggcctctcgt ctactgccac catgaccgcc 121 aacggcacag ccgaggcggt gcagatccag ttcggcctca tcaactgcgg caacaagtac 181 ctgacggccg aggcgttcgg gttcaaggtg aacgcgtccg ccagcagcct gaagaagaag 241 cagatctgga cgctggagca gccccctgac gaggcgggca gcgcggccgt gtgcctgcgc 301 agccacctgg gccgctacct ggcggcggac aaggacggca acgtgacctg cgagcgcgag 361 gtgcccggtc ccgactgccg tttcctcatc gtggcgcacg acgacggtcg ctggtcgctg 421 cagtccgagg cgcaccggcg ctacttcggc ggcaccgagg accgcctgtc ctgcttcgcg 481 cagacggtgt cccccgccga gaagtggagc gtgcacatcg ccatgcaccc tcaggtcaac 541 atctacagtg tcacccgtaa gcgctacgcg cacctgagcg cgcggccggc cgacgagatc 601 gccgtggacc gcgacgtgcc ctggggcgtc gactcgctca tcaccctcgc cttccaggac 661 cagcgctaca gcgtgcagac cgccgaccac cgcttcctgc gccacgacgg gcgcctggtg 721 gcgcgccccg agccggccac tggctacacg ctggagttcc gctccggcaa ggtggccttc 781 cgcgactgcg agggccgtta cctggcgccg tcggggccca gcggcacgct caaggcgggc 841 aaggccacca aggtgggcaa ggacgagctc tttgctctgg agcagagctg cgcccaggtc 901 gtgctgcagg cggccaacga gaggaacgtg tccacgcgcc agggtatgga cctgtctgcc 961 aatcaggacg aggagaccga ccaggagacc ttccagctgg agatcgaccg cgacaccaaa 1021 aagtgtgcct tccgtaccca cacgggcaag tactggacgc tgacggccac cgggggcgtg 1081 cagtccaccg cctccagcaa gaatgccagc tgctactttg acatcgagtg gcgtgaccgg 1141 cgcatcacac tgagggcgtc caatggcaag tttgtgacct ccaagaagaa tgggcagctg 1201 gccgcctcgg tggagacagc aggggactca gagctcttcc tcatgaagct catcaaccgc 1261 cccatcatcg tgttccgcgg ggagcatggc ttcatcggct gccgcaaggt cacgggcacc 1321 ctggacgcca accgctccag ctatgacgtc ttccagctgg agttcaacga tggcgcctac 1381 aacatcaaag actccacagg caaatactgg acggtgggca gtgactccgc ggtcaccagc 1441 agcggcgaca ctcctgtgga cttcttcttc gagttctgcg actataacaa ggtggccatc 1501 aaggtgggcg ggcgctacct gaagggcgac cacgcaggcg tcctgaaggc ctcggcggaa 1561 accgtggacc ccgcctcgct ctgggagtac tagggccggc ccgtccttcc ccgcccctgc 1621 ccacatggcg gctcctgcca accctccctg ctaacccctt ctccgccagg tgggctccag 1681 ggcgggaggc aagccccctt gcctttcaaa ctggaaaccc cagagaaaac ggtgccccca 1741 cctgtcgccc ctatggactc cccactctcc cctccgcccg ggttccctac tcccctcggg 1801 tcagcggctg cggcctggcc ctgggaggga tttcagatgc ccctgccctc ttgtctgcca 1861 cggggcgagt ctggcacctc tttcttctga cctcagacgg ctctgagcct tatttctctg 1921 gaagcggcta agggacggtt gggggctggg agccctgggc gtgtagtgta actggaatct 1981 tttgcctctc ccagccacct cctcccagcc ccccaggaga gctgggcaca tgtcccaagc 2041 ctgtcagtgg ccctccctgg tgcactgtcc ccgaaacccc tgcttgggaa gggaagctgt 2101 cgggagggct aggactgacc cttgtggtgt ttttttgggt ggtggctgga aacagcccct 2161 ctcccacgtg ggagaggctc agcctggctc ccttccctgg agcggcaggg cgtgacggcc 2221 acagggtctg cccgctgcac gttctgccaa ggtggtggtg gcgggcgggt aggggtgtgg 2281 gggccgtctt cctcctgtct ctttcctttc accctagcct gactggaagc agaaaatgac 2341 caaatcagta ttttttttaa tgaaatatta ttgctggagg cgtcccaggc aagcctggct 2401 gtagtagcga gtgatctggc ggggggcgtc tcagcaccct ccccaggggg tgcatctcag 2461 ccccctcttt ccgtccttcc cgtccagccc cagccctggg cctgggctgc cgacacctgg 2521 gccagagccc ctgctgtgat tggtgctccc tgggcctccc gggtggatga agccaggcgt 2581 cgccccctcc gggagccctg gggtgagccg ccggggcccc cctgctgcca gcctcccccg 2641 tccccaacat gcatctcact ctgggtgtct tggtctttta ttttttgtaa gtgtcatttg 2701 tataactcta aacgcccatg atagtagctt caaactggaa atagcgaaat aaaataactc 2761 agtctgc // LOCUS HSU03090 1016 bp mRNA PRI 18-MAR-1994 DEFINITION Human Ca2+-dependent phospholipase A2 mRNA, complete cds. ACCESSION U03090 NID g460914 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1014) AUTHORS Chen,J., Engle,S.J., Seilhamer,J.J. and Tischfield,J.A. TITLE Cloning and recombinant expression of a novel human low molecular weight Ca2+-dependent phospholipase A2 JOURNAL J. Biol. Chem. 269, 2365-2368 (1994) MEDLINE 94131989 REFERENCE 2 (bases 1 to 1016) AUTHORS Tischfield,J.A. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) Tischfield J.A., School of Medicine, Indiana University, Medical and Molecular Genetics, 975 West Walnut, Indianapolis, IN 46202-5215, USA FEATURES Location/Qualifiers source 1..1016 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HPLA2-10" /map="1p35" mRNA 1..1014 5'UTR 1..130 CDS 133..549 /codon_start=1 /product="Ca2+-dependent phospholipase A2" /db_xref="PID:g460915" /translation="MKGLLPLAWFLACSVPAVQGGLLDLKSMIEKVTGKNALTNYGFY GCYCGWGGRGTPKDGTDWCCWAHDHCYGRLEEKGCNIRTQSYKYRFAWGVVTCEPGPF CHVNLCACDRKLVYCLKRNLRSYNPQYQYFPNILCS" 3'UTR 548..1014 polyA_signal 992..997 BASE COUNT 228 a 296 c 265 g 227 t ORIGIN 1 atggatacca atgttccgac tggagacggg gagcccgcga gacccgggtc tccagggtct 61 gcccaaggaa gttgctcatg ggagcagacc cctagagcag gatttgaggc caggccaaag 121 agaaccccag agatgaaagg cctcctccca ctggcttggt tcctggcttg tagtgtgcct 181 gctgtgcaag gaggcttgct ggacctaaaa tcaatgatcg agaaggtgac agggaagaac 241 gccctgacaa actacggctt ctacggctgt tactgcggct ggggcggccg aggaaccccc 301 aaggatggca ccgattggtg ctgttgggcg catgaccact gctatgggcg gctggaggag 361 aagggctgca acattcgcac acagtcctac aaatacagat tcgcgtgggg cgtggtcacc 421 tgcgagcccg ggcccttctg ccatgtgaac ctctgtgcct gtgaccggaa gctcgtctac 481 tgcctcaaga gaaacctacg gagctacaac ccacagtacc aatactttcc caacatcctc 541 tgctcctagg cctccccagc gagctcctcc cagaccaaga cttttgttct gtttttctac 601 aacacagagt actgactctg cctggttcct gagagaggct cctaagtcac agacctcagt 661 ctttctcgaa gcttggcgga cccccagggc cacactgtac cctccagcga gtcccaggag 721 agtgactctg gtcataggac ttggtagggt cccagggtcc ctaggcctcc acttctgagg 781 gcagcccctc tggtgccaag agctctcctc caactcaggg ttggctgtgt ctcttttctt 841 ctctgaagac agcgtcctgg ctccagttgg aacactttcc tgagatgcac ttacttctca 901 gcttctgcga tcagattatc atcaccacca ccctccagag aattttacgc aagaagagcc 961 aaattgactc tctaaatctg gtgtatgggt attaaataaa attcattctc aaggct // LOCUS HSU03100 3526 bp mRNA PRI 11-JUN-1994 DEFINITION Human alpha2(E)-catenin mRNA, complete cds. ACCESSION U03100 NID g414981 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3526) AUTHORS Rimm,D.L., Kebriaei,P. and Morrow,J.S. TITLE Molecular Cloning Reveals Alternative Splice Forms of Human alpha(E)-catenin JOURNAL Unpublished REFERENCE 2 (bases 1 to 3526) AUTHORS Rimm,D.L. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) David L. Rimm, Pathology, Yale University School of Medicine, 310 Cedar Street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..3526 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Co1.1, Co18.1, Co5.6" /clone_lib="Clonetech Human Colon from E.R. Fearon" /tissue_type="colon" 5'UTR 1..4 CDS 5..2800 /note="alternative splice form" /codon_start=1 /product="alpha2(E)-catenin" /db_xref="PID:g414982" /translation="MTAVHAGNINFKWDPKSLEIRTLAVERLLEPLVTQVTTLVNTNS KGPSNKKRGRSKKAHVLAASVEQATENFLEKGDKIAKESQFLKEELVVAVEDVRKQGD LMKAAAGEFADDPCSSVKRGNMVRAAPALLSAVTRLLILADMADVYKLLVQLKVVEDG ILKLRNAGNEQDLGNQYKALKPEVDKLNIMAAKRQQELKDVGHRDQMAAARGILQSNV PILYTASQACLQHPDVAAYKANRDLIYKQLQQAVTGISNAAQATASDDASQHQGGGGG ELAYALNNFDKQIIVDPLSFSEERFRPSLEERLESIISGAALMADSSCTRDDRRERIV AECNAVRQACRTCVSEYMGNAGRKERSDALNSAIDKMTKKTRDLRRQLRKAVMDHVSD SFLETNVPLLVLIEAAKNGNEKEVKEYAQVFREHANKLIEVANLACSISNNEEGVKLV RMSASQLEAGCPQVINAATWALAPKPQSKLAQENMDLFKEQWEKQVRVLTDAVDDITS IDDFLAVSENHILEDVNKCVIALQEKDVDGLDRTAGAIRGRAARVIHVVTSEMDNYEP GVYTEKVLEATKLLSNTVMPRFTEQVEAAVEALSSDPAQPMDENEFIDASRLVYDGIR DIRKAVLMIRTPEELDDSDFETEDFDVRSETSVQTEDDQLIAGQSARAIMAQLPQEQK AKIREQVASFQEEKSKLDAEVSKWDDSGNDIIVLAKQMCMIMMEMTDFTRGKGPLKNT SDVISAAKKIAEAGSRMDKLGRTIRDHCPDSACKQDLLAYLQRIALYCHQLNICSKVK AEVQNLGGELVVSGNCDTCGALQGLKGWPPPLCLATHWVDSAMSLIQAAKNLMNAVVQ TVKASYVASTKYQKSQGMASLNLPAVSMKMKAPEKKPLVKREKQDETQTKIKRASQKK HVNPVQALSEFKAMDSI" misc_feature 2441..2512 /label=alpha2E_insert 3'UTR 2801..3526 BASE COUNT 1005 a 795 c 906 g 820 t ORIGIN 1 ggaaatgact gctgtccatg caggcaacat aaacttcaag tgggatccta aaagtctaga 61 gatcaggact ctggcagttg agagactgtt ggagcctctt gttacacagg ttacaaccct 121 tgtaaacacc aatagtaaag ggccctctaa taagaagaga ggtcgttcta agaaggccca 181 tgttttggct gcatctgttg aacaagcaac tgagaatttc ttggagaagg gggataaaat 241 tgcaaaagag agccagtttc tcaaggagga gcttgtggtt gctgtagaag atgttcgaaa 301 acaaggtgat ttgatgaagg ctgctgctgg agagttcgca gatgatccct gctcttctgt 361 gaagcgaggc aacatggttc gggcagctcc agctttgctc tctgctgtta cccggttgct 421 cattttggct gacatggcag atgtctacaa attacttgtt cagctgaaag ttgtggaaga 481 tggtatattg aaactgagga atgctggcaa tgaacaagac ttagggaatc agtataaagc 541 cctaaaacct gaagtggata agctgaacat tatggcagca aaaagacaac aggaattgaa 601 agatgttggg catcgtgatc agatggctgc ggctagagga atcctgcaga gcaacgttcc 661 gatcctctat actgcatccc aggcatgcct acagcaccct gatgtcgcag cctataaggc 721 caacagggac ctgatataca agcagctgca gcaggcggtc acagggattt ccaatgcagc 781 ccaggccact gcctcagacg atgcctcaca gcaccagggt ggaggaggag gagaactggc 841 atatgcactc aataactttg acaaacaaat cattgtggac cccttgagct tcagcgagga 901 gcgctttagg ccttccctgg aggagcgtct ggaaagcatc attagtgggg ctgccttgat 961 ggccgactcg tcctgcacgc gtgatgaccg tcgtgagcga attgtggcag agtgtaatgc 1021 tgtccgccag gcctgcagga cctgcgtttc ggagtacatg ggcaatgctg gacgtaaaga 1081 aagaagtgat gcactcaatt ctgcaataga taaaatgacc aagaagacca gggacttgcg 1141 tagacagctt cgcaaagctg tcatggacca cgtttcagat tctttcctgg aaaccaatgt 1201 tccacttttg gtattgattg aagctgcaaa gaatggaaat gagaaagaag ttaaggaata 1261 tgcccaagtt ttccgtgaac atgccaacaa attgattgag gttgccaact tggcctgttc 1321 catctcaaat aatgaagaag gtgtaaagct tgttcgaatg tctgcaagcc agttagaagc 1381 cggttgtcct caggttatta atgctgcaac ctgggcttta gcaccaaaac cacagagtaa 1441 actggcccaa gagaacatgg atctttttaa agaacaatgg gaaaaacaag tccgtgttct 1501 cacagatgct gtcgatgaca ttacttccat tgatgacttc ttggctgtct cagagaatca 1561 cattttggaa gatgtgaaca aatgtgtcat tgctctccaa gagaaggatg tggatggcct 1621 ggaccgcaca gctggtgcaa ttcgaggccg ggcagcccgg gtcattcacg tagtcacctc 1681 agagatggac aactatgagc caggagtcta cacagagaag gttctggaag ccactaagct 1741 gctctccaac acagtcatgc cacgttttac tgagcaagta gaagcagccg tggaagccct 1801 cagctcggac cctgcccagc ccatggatga gaatgagttt atcgatgctt cccgcctggt 1861 atatgatggc atccgggaca tcaggaaagc agtgctgatg ataaggaccc ctgaggagtt 1921 ggatgactct gactttgaga cagaggattt tgatgtcaga agcgagacga gcgtccagac 1981 agaagacgat cagctgatag ctggccagag tgcccgggcg atcatggctc agcttcccca 2041 ggagcaaaaa gcgaagattc gggaacaggt ggccagcttc caggaagaaa agagcaagct 2101 ggatgctgaa gtgtccaaat gggacgacag tggcaatgac atcattgtgc tggccaagca 2161 gatgtgcatg attatgatgg agatgacaga ctttacccga ggtaaaggac cactcaaaaa 2221 tacatcggat gtcatcagtg ctgccaagaa aattgctgag gcaggatcca ggatggacaa 2281 gcttggccgg accattcgag accattgccc cgactcggct tgcaagcagg acctgctggc 2341 ctacctgcaa cgcatcgccc tctactgcca ccagctgaac atctgcagca aggtcaaggc 2401 cgaggtgcag aatctcggcg gggagcttgt tgtctctggg aactgtgaca cctgcggggc 2461 actgcaaggg ctgaaaggct ggcctcctcc cctttgcctg gccactcact gggtggacag 2521 cgccatgtcc ctgatccagg cagccaagaa cttgatgaat gctgtggtgc agacagtgaa 2581 ggcatcctac gtcgcctcta ccaaatacca aaagtcacag ggtatggctt ccctcaacct 2641 tcctgctgtg tcaatgaaga tgaaggcacc agagaaaaag ccattggtga agagagagaa 2701 acaggatgag acacagacca agattaaacg ggcatctcag aagaagcacg tgaacccagt 2761 gcaggccctc agcgagttca aagctatgga cagcatctaa gtctgcccag gccggccgcc 2821 cccacccctc tggctcctga atatcagtca ctgttcgtca ctcaaatgaa tttgctaaat 2881 acaacactga tactagattc cacagggaaa tgggcagact gaaccagtcc aggtggtgaa 2941 ttttccaaga acatagttta agttgattaa aaatgctttt agaatgcagg agcctacttc 3001 tagctgtatt ttttgtatgc ttaaataaaa taaaattcat aaccaagaga tccacattag 3061 cttgttagta atgctctgac caagccgaga tgccattctc ttagtgatgg cggcgttagg 3121 tttgagagaa ggaattggct caacttcagt tgagagggtg cagtccagac agcttgactg 3181 cttttaaatg accaaagatg acctgtggta agcaacctgg catcttagga agcagtcctt 3241 gagaaggcat gttccagaaa ggtctctgag gacaaactca ctcagtaaaa cataatgtat 3301 catgaagaaa actgattctc tatgacatga aatgaaaatt ttaatgcatt gttataatta 3361 ctaatgtacg ctgctgcagg acattaataa agttgctttt ttaggctaca gtgtctcgat 3421 gccataatca gaacacactt tttttcctct ttctcccagc ttcaaatgca caattcatca 3481 ttgggctcac ttctaataac tgcagtgttt ccgccttgcg ttgcag // LOCUS HSU03105 2061 bp mRNA PRI 25-JAN-1996 DEFINITION Human B4-2 protein mRNA, complete cds. ACCESSION U03105 NID g476094 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2061) AUTHORS Chen,J., Liu,L. and Pohajdak,B. TITLE Cloning a cDNA from human NK/T cells which codes for a protein with high proline content JOURNAL Biochim. Biophys. Acta 1264 (1), 19-22 (1995) MEDLINE 96038813 REFERENCE 2 (bases 1 to 2061) AUTHORS Pohajdak,B. TITLE Direct Submission JOURNAL Submitted (03-NOV-1993) Bill Pohajdak, Biology, Dalhousie University, Halifax, Nova Scotia B3H 4J1, Canada FEATURES Location/Qualifiers source 1..2061 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="B4-2" /cell_type="NK/T cells" CDS 114..1097 /codon_start=1 /product="B4-2 protein" /db_xref="PID:g476095" /translation="MTVVSVPQREPLVLGGRLAPLGFSSRGYFGALPMVTTAPPPLPR IPDPRALPPTLFLPHFLGGDGPCLTPQPRAPAALPNRSLAVAGGTPRAAPKKRRKKKV RASPAGQLPSRFHQYQQHRPSLEGGRSPATGPSGAQEVPGPAAALAPSPAAAAGTEGA SPDLAPLRPAAPGQTPLRKEVLKSKMGKSEKIALPHGQLVHGIHLYEQPKINRQKSKY NLPLTKITSAKRNENNFWQDSVSSDRIQKQEKKPFKNTENIKNSHLKKSAFLTEVSQK ENYAGAKFSDPPSPSVLPKPPSHWMGSTVENSNQNRELMAVHLKTLLKVQT" misc_feature complement(1422..1742) /note="identical to Z24920 338bp partial cDNA clone" repeat_region 1600..1980 /rpt_family="Alu" BASE COUNT 543 a 512 c 461 g 545 t ORIGIN 1 tgttccgcga tcttctcagg ctctcctagc agcatccatc gccgccaccc tatcttcact 61 ggcttcacct tctccttctc tcttcgttgc tgagcgacaa gcttcctagc gctatgactg 121 tcgtctccgt cccgcagcgg gagccgctcg tcctgggtgg ccgccttgcg ccgcttggct 181 tttcctcccg aggttacttt ggggccctcc cgatggtgac cacggctccg cctcctttac 241 cccggatccc ggacccccgg gcactgcccc cgaccctctt cctccctcat ttcctagggg 301 gagatggccc gtgtctgacc ccccagcctc gcgctccagc agctctgccc aaccgcagcc 361 tcgccgtggc gggaggcact cctcgggcag cgccgaagaa gcggcgaaag aagaaggtgc 421 gggccagccc cgcagggcag ctgcccagcc gcttccacca gtaccagcag caccggccga 481 gtctggaggg cggccggagc cccgcgaccg gcccgagcgg agcgcaggag gtcccgggcc 541 cggccgccgc cttggccccg agtcctgcag ccgcagccgg cacggaggga gccagccccg 601 accttgcccc gctgcggccc gcggctcccg gccaaacccc cctcaggaaa gaggttttaa 661 aatcaaagat gggaaaatcg gagaaaattg cccttcccca tggccagctt gttcatggta 721 tacacttgta tgagcaacca aagataaaca gacagaaaag caaatataac ttgccactaa 781 ccaagatcac ctctgcaaaa agaaatgaaa acaacttttg gcaggattct gtttcatctg 841 acagaattca gaagcaggaa aaaaagcctt ttaaaaatac cgagaacatt aaaaattcgc 901 atttgaagaa atcagcattt ctaactgaag tgagccaaaa ggaaaattat gctggggcaa 961 agtttagtga tccaccttct cctagtgttc ttccaaagcc tcctagtcac tggatgggaa 1021 gcactgttga aaattccaac caaaacaggg agctgatggc agtacactta aaaaccctcc 1081 tcaaagttca aacttagatt tcagatttca gtatgtgtgt aaaacataat ttttcccata 1141 tccctggact cttgagaaaa ttggtacaga aatggaaatt tgccttgttg caacatacaa 1201 ttgcaaaaga tgagtttaaa aaattacata caaacagctt gtattatatt ttatattttg 1261 taaatactgt ataccatgta ttatgtgtat attgttcata cttgagaggt atattatagt 1321 tttgttatga aagtatgtat tttgccctgc ccacattgca ggtgttttgt atatatacaa 1381 tggataaatt ttaagtgtgt gctaaggcac atggaagacc gattttattt gcacaaggta 1441 ctgagatttt tttcaagaaa cagctgtcaa atctcaaggt gaagatctaa atgtgaacag 1501 tttactaatg cactactgaa gtttaaatct gtggcacaat caatgtaagc atggggtttg 1561 tttctctaaa ttgatttgta atctgaaatt actgaacaac tcctattccc atttttgcta 1621 aactcaattt ctggttttgg tatatatcca ttccagctta atgcctctaa ttttaatgcc 1681 aacaaaattg gttgtaatca aattttaaaa taataataat ttggcccccc cttttaaaat 1741 agtcttgact ctttgtgtgt gactgtttct catgtttgaa tgtgtgacta ggagatgatt 1801 ttgtgtggtt ggattttttt gacttctact ttactggctg agtgtgagcc gccatgcctg 1861 gccataatct acattttctt accaggagca gcattgaggt ttttgagcat agtacttgac 1921 tactctagag gctgagacgg gagcatctct tgagcctgag aagtggagat tgcaattgag 1981 ctaggatcag gccactgcac tccagcctgg gtaacagacg ctgtctcaaa aaaaaggcca 2041 agagaaagta agggagacag a // LOCUS HSU03106 2121 bp mRNA PRI 27-JAN-1994 DEFINITION Human wild-type p53 activated fragment-1 (WAF1) mRNA, complete cds. ACCESSION U03106 NID g414564 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2121) AUTHORS El-Deiry,W.S., Tokino,T., Velculesco,V.E., Levy,D.B., Parsons,R., Trent,J.M., Lin,D., Mercer,E.W., Kinzler,K.W. and Vogelstein,B. TITLE WAF1, a Potential Mediator of p53 Tumor Suppression JOURNAL Cell 75, 817-825 (1993) MEDLINE 94061997 REFERENCE 2 (bases 1 to 2121) AUTHORS El-Deiry,W. TITLE Direct Submission JOURNAL Submitted (03-NOV-1993) El-Deiry W., Johns Hopkins Oncology Center, 424 North Bond Street, Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..2121 /organism="Homo sapiens" /db_xref="taxon:9606" /germline gene 76..570 /gene="WAF1" CDS 76..570 /gene="WAF1" /standard_name="wild type p53 activated fragment-1" /codon_start=1 /db_xref="PID:g414565" /translation="MSEPAGDVRQNPCGSKACRRLFGPVDSEQLSRDCDALMAGCIQE ARERWNFDFVTETPLEGDFAWERVRGLGLPKLYLPTGPRRGRDELGGGRRPGTSPALL QGTAEEDHVDLSLSCTLVPRSGEQAEGSPGGPGDSQGRKRRQTSMTDFYHSKRRLIFS KRKP" BASE COUNT 418 a 628 c 575 g 500 t ORIGIN 1 gccgaagtca gttccttgtg gagccggagc tgggcgcgga ttcgccgagg caccgaggca 61 ctcagaggag gcgccatgtc agaaccggct ggggatgtcc gtcagaaccc atgcggcagc 121 aaggcctgcc gccgcctctt cggcccagtg gacagcgagc agctgagccg cgactgtgat 181 gcgctaatgg cgggctgcat ccaggaggcc cgtgagcgat ggaacttcga ctttgtcacc 241 gagacaccac tggagggtga cttcgcctgg gagcgtgtgc ggggccttgg cctgcccaag 301 ctctaccttc ccacggggcc ccggcgaggc cgggatgagt tgggaggagg caggcggcct 361 ggcacctcac ctgctctgct gcaggggaca gcagaggaag accatgtgga cctgtcactg 421 tcttgtaccc ttgtgcctcg ctcaggggag caggctgaag ggtccccagg tggacctgga 481 gactctcagg gtcgaaaacg gcggcagacc agcatgacag atttctacca ctccaaacgc 541 cggctgatct tctccaagag gaagccctaa tccgcccaca ggaagcctgc agtcctggaa 601 gcgcgagggc ctcaaaggcc cgctctacat cttctgcctt agtctcagtt tgtgtgtctt 661 aattattatt tgtgttttaa tttaaacacc tcctcatgta cataccctgg ccgccccctg 721 ccccccagcc tctggcatta gaattattta aacaaaaact aggcggttga atgagaggtt 781 cctaagagtg ctgggcattt ttattttatg aaatactatt taaagcctcc tcatcccgtg 841 ttctcctttt cctctctccc ggaggttggg tgggccggct tcatgccagc tacttcctcc 901 tccccacttg tccgctgggt ggtaccctct ggaggggtgt ggctccttcc catcgctgtc 961 acaggcggtt atgaaattca ccccctttcc tggacactca gacctgaatt ctttttcatt 1021 tgagaagtaa acagatggca ctttgaaggg gcctcaccga gtgggggcat catcaaaaac 1081 tttggagtcc cctcacctcc tctaaggttg ggcagggtga ccctgaagtg agcacagcct 1141 agggctgagc tggggacctg gtaccctcct ggctcttgat acccccctct gtcttgtgaa 1201 ggcaggggga aggtggggta ctggagcaga ccaccccgcc tgccctcatg gcccctctga 1261 cctgcactgg ggagcccgtc tcagtgttga gccttttccc tctttggctc ccctgtacct 1321 tttgaggagc cccagcttac ccttcttctc cagctgggct ctgcaattcc cctctgctgc 1381 tgtccctccc ccttgtcttt cccttcagta ccctctcatg ctccaggtgg ctctgaggtg 1441 cctgtcccac ccccaccccc agctcaatgg actggaaggg gaagggacac acaagaagaa 1501 gggcacccta gttctacctc aggcagctca agcagcgacc gccccctcct ctagctgtgg 1561 gggtgagggt cccatgtggt ggcacaggcc cccttgagtg gggttatctc tgtgttaggg 1621 gtatatgatg ggggagtaga tctttctagg agggagacac tggcccctca aatcgtccag 1681 cgaccttcct catccacccc atccctcccc agttcattgc actttgatta gcagcggaac 1741 aaggagtcag acattttaag atggtggcag tagaggctat ggacagggca tgccacgtgg 1801 gctcatatgg ggctgggagt agttgtcttt cctggcacta acgttgagcc cctggaggca 1861 ctgaagtgct tagtgtactt ggagtattgg ggtctgaccc caaacacctt ccagctcctg 1921 taacatactg gcctggactg ttttctctcg gctccccatg tgtcctggtt cccgtttctc 1981 cacctagact gtaaacctct cgagggcagg gaccacaccc tgtactgttc tgtgtctttc 2041 acagctcctc ccacaatgct gaatatacag caggtgctca ataaatgatt cttagtgact 2101 ttaaaaaaaa aaaaaaaaaa a // LOCUS HSU03109 2449 bp mRNA PRI 30-NOV-1995 DEFINITION Human aspartyl beta-hydroxylase mRNA, complete cds. ACCESSION U03109 NID g458031 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2249) AUTHORS Korioth,F., Gieffers,C. and Frey,J. TITLE Cloning and characterization of the human gene encoding aspartyl beta-hydroxylase JOURNAL Gene 150 (2), 395-399 (1994) MEDLINE 95121937 REFERENCE 2 (bases 1 to 2449) AUTHORS Korioth,F. TITLE Direct Submission JOURNAL Submitted (03-NOV-1993) Korioth F., Fakultaet fuer Chemie-Biochemie II, Universitaet Bielefeld, Universitaetsstrasse 25, Bielefeld, 33615, Germany FEATURES Location/Qualifiers source 1..2449 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="As-5" /clone_lib="MG63-ZAP" /cell_line="mg63" /cell_type="osteosarcoma" 5'UTR 1..77 CDS 78..2351 /codon_start=1 /function="hydroxylation of aspartyl and asparaginyl residues" /product="aspartyl beta-hydroxylase" /db_xref="PID:g458032" /translation="MAQRKNAKSSGNSSSSGSGSGSTSAGSSSPGARRETKHGGHKNG RKGGLSGTSFFTWFMVIALLGVWTSVAVVWFDLVDYEEVLGKLGIYDADGDGDFDVDD AKVLLGLKERSTSEPAVPPEEAEPHTEPEEQVPVEAEPQNIEDEAKEQIQSLLHEMVH AEHVEGEDLQQEDGPTGEPQQEDDEFLMATDVDDRFETLEPEVSHEETEHSYHVEETV SQDCNQDMEEMMSEQENPDSSEPVVEDERLHHDTDDVTYQVYEEQAVYEPLENEGIEI TEVTAPPEDNPVEDSQVIVEEVSIFPVEEQQEVPPETNRKTDDPEQKAKVKKKKPKLL NKFDKTIKAELDAAEKLRKRGKIEEAVNAFKELVRKYPQSPRARYGKAQCEDDLAEKR RSNEVLRGAIETYQEVASLPDVPADLLKLSLKRRSDRQQFLGHMRGSLLTLQRLVQLF PNDTSLKNDLGVGYLLIGDNDNAKKVYEEVLSVTPNDGFAKVHYGFILKAQNKIAESI PYLKEGIESGDPGTDDGRFYFHLGDAMQRVGNKEAYKWYELGHKRGHFASVWQRSLIN VNGLKAQPCGPKETGYTQLVKSLERNWKLIRDEGLAVMDKAKGLFLPEDENLREKGDW SQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRRGQIKYSIMHPGTHVWPHTGPT NCRLRMHLGLVIPKEGCKIRCANETKTWEEGKVLIFDDSFEHEVWQDASSFRLIFIVD VWHPELTPQQRRSLPAI" 3'UTR 2352..2449 BASE COUNT 778 a 507 c 655 g 509 t ORIGIN 1 agctgcccgc gtcgcgtgtg tacccccgcg cactgaagga ggtccaccag ccctcaccag 61 cccccgcgga ccgtgcaatg gcccagcgta agaatgccaa gagcagcggc aacagcagca 121 gcagcggctc cggcagcggt agcacgagtg cgggcagcag cagccccggg gcccggagag 181 agacaaagca tggaggacac aagaatggga ggaaaggcgg actctcagga acttcattct 241 tcacgtggtt tatggtgatt gcattgctgg gcgtctggac atctgtagct gtcgtttggt 301 ttgatcttgt tgactatgag gaagttctag gaaaactagg aatctatgat gctgatggtg 361 atggagattt tgatgtggat gatgccaaag ttttattagg acttaaagag agatctactt 421 cagagccagc agtcccgcca gaagaggctg agccacacac tgagcccgag gagcaggttc 481 ctgtggaggc agaaccccag aatatcgaag atgaagcaaa agaacaaatt cagtcccttc 541 tccatgaaat ggtacacgca gaacatgttg agggagaaga cttgcaacaa gaagatggac 601 ccacaggaga accacaacaa gaggatgatg agtttcttat ggcgactgat gtagatgata 661 gatttgagac cctggaacct gaagtatctc atgaagaaac cgagcatagt taccacgtgg 721 aagagacagt ttcacaagac tgtaatcagg atatggaaga gatgatgtct gagcaggaaa 781 atccagattc cagtgaacca gtagtagaag atgaaagatt gcaccatgat acagatgatg 841 taacatacca agtctatgag gaacaagcag tatatgaacc tctagaaaat gaagggatag 901 aaatcacaga agtaactgct ccccctgagg ataatcctgt agaagattca caggtaattg 961 tagaagaagt aagcattttt cctgtggaag aacagcagga agtaccacca gaaacaaata 1021 gaaaaacaga tgatccagaa caaaaagcaa aagttaagaa aaagaagcct aaacttttaa 1081 ataaatttga taagactatt aaagctgaac ttgatgctgc agaaaaactc cgtaaaaggg 1141 gaaaaattga ggaagcagtg aatgcattta aagaactagt acgcaaatac cctcagagtc 1201 cacgagcaag atatgggaag gcgcagtgtg aggatgattt ggctgagaag aggagaagta 1261 atgaggtgct acgtggagcc atcgagacct accaagaggt ggccagccta cctgatgtcc 1321 ctgcagacct gctgaagctg agtttgaagc gtcgctcaga caggcaacaa tttctaggtc 1381 atatgagagg ttccctgctt accctgcaga gattagttca actatttccc aatgatactt 1441 ccttaaaaaa tgaccttggc gtgggatacc tcttgatagg agataatgac aatgcaaaga 1501 aagtttatga agaggtgctg agtgtgacac ctaatgatgg ctttgctaaa gtccattatg 1561 gcttcatcct gaaggcacag aacaaaattg ctgagagcat cccatattta aaggaaggaa 1621 tagaatccgg agatcctggc actgatgatg ggagatttta tttccacctg ggggatgcca 1681 tgcagagggt tgggaacaaa gaggcatata agtggtatga gcttgggcac aagagaggac 1741 actttgcatc tgtctggcaa cgctcactca tcaatgtgaa tggactgaaa gcacagcctt 1801 gtggcccaaa agaaacgggc tacacacagt tagtaaagtc tttagaaaga aactggaagt 1861 taatccgaga tgaaggcctt gcagtgatgg ataaagccaa aggtctcttc ctgcctgagg 1921 atgaaaacct gagggaaaaa ggggactgga gccagttcac gctgtggcag caaggaagaa 1981 gaaatgaaaa tgcctgcaaa ggagctccta aaacctgtac cttactagaa aagttccccg 2041 agacaacagg atgcagaaga ggacagatca aatattccat catgcacccc gggactcacg 2101 tgtggccgca cacagggccc acaaactgca ggctccgaat gcacctgggc ttggtgattc 2161 ccaaggaagg ctgcaagatt cgatgtgcca acgagaccaa gacctgggag gaaggcaagg 2221 tgctcatctt tgatgactcc tttgagcacg aggtatggca ggatgcctca tctttccggc 2281 tgatattcat cgtggatgtg tggcatccgg aactgacacc acagcagaga cgcagccttc 2341 cagcaattta gcatgaattc atgcaagctt gggaaactct ggagagaggc tgcctttctg 2401 gttccatctc cttgggtgtg aggatagaat ttcgaacacc aagagtcaa // LOCUS HSU03269 918 bp mRNA PRI 22-FEB-1996 DEFINITION Human actin capping protein alpha subunit (CapZ) mRNA, complete cds. ACCESSION U03269 NID g595254 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 918) AUTHORS Barron-Casella,E.A., Torres,M.A., Scherer,S.W., Heng,H.H., Tsui,L.C. and Casella,J.F. TITLE Sequence analysis and chromosomal localization of human Cap Z. Conserved residues within the actin-binding domain may link Cap Z to gelsolin/severin and profilin protein families JOURNAL J. Biol. Chem. 270 (37), 21472-21479 (1995) MEDLINE 95394897 REFERENCE 2 (bases 1 to 918) AUTHORS Barron-Casella,E.A. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Emily A. Barron-Casella, Pediatrics, Division of Hematology, The Johns Hopkins University School of Medicine, 720 Rutland Ave, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..918 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pha4/2" /clone_lib="lambda gt10" /tissue_type="retina" gene 19..879 /gene="CapZ" CDS 19..879 /gene="CapZ" /codon_start=1 /product="actin capping protein alpha subunit" /db_xref="PID:g595255" /translation="MADLEEQLSDEEKVRIAAKFIIHAPPGEFNEVFNDVRLLLNNDN LLREGAAHAFAQYNLDQFTPVKIEGYEDQVLITEHGDLGNGKFLDPKNRICFKFDHLR KEATDPRPCEVENAVESWRTSVETALRAYVKEHYPNGVCTVYGKKIDGQQTIIACIES HQFQAKNFWNGRWRSEWKFTITPSTTQVVGILKIQVHYYEDGNVQLVSHKDIQDSLTV SNEVQTAKEFIKIVEAAENEYQTAISENYQTMSDTTFKALRRQLPVTRTKIDWNKILS YKIGKEMQNA" BASE COUNT 314 a 156 c 208 g 240 t ORIGIN 1 tttgtcgcca gaaggaagat ggcggatctg gaggagcagt tgtctgatga agagaaggtg 61 cgtatagcag caaaattcat cattcatgcc cctcctggag aatttaatga ggttttcaat 121 gatgttcggt tactgcttaa taatgacaat cttctcaggg aaggagcagc ccatgcattt 181 gcacagtata acttggacca gtttactcca gtaaaaattg aaggttatga agatcaggta 241 ttgataacag aacatggcga cttgggaaat ggaaagtttt tggatccaaa gaacagaatc 301 tgttttaaat ttgatcactt aaggaaggag gcaactgatc caagaccctg tgaagtagaa 361 aatgcagttg aatcatggag aacttcagta gaaactgctc tgagagctta cgtaaaagaa 421 cattacccga atggagtctg cactgtgtat ggcaaaaaaa tagatggaca gcaaaccatt 481 attgcatgca tagaaagcca tcagttccaa gcaaaaaatt tttggaatgg tcgttggagg 541 tcagaatgga agtttacaat cactccttca accactcaag tggttggcat cttgaaaatt 601 caggttcatt attatgaaga tggtaatgtt cagctagtga gtcataaaga tatacaagat 661 tccctaacag tgtctaatga agtgcaaaca gcaaaagaat ttataaagat tgtagaagct 721 gcagaaaatg aataccagac tgccatcagt gagaattatc agacaatgtc ggacactact 781 ttcaaagcct tacgtcgaca gttgccagtt acacgcacta agattgattg gaacaagatc 841 cttagctaca agattggcaa agagatgcag aatgcataag atgaacattg catgaccgga 901 tcattttagt gtctttgc // LOCUS HSU03270 1173 bp mRNA PRI 06-JUL-1994 DEFINITION Human centrin mRNA, complete cds. ACCESSION U03270 NID g414992 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1173) AUTHORS Errabolu,R., Sanders,M.A. and Salisbury,J.L. TITLE Cloning of a cDNA encoding human centrin, an EF-hand protein of centrosomes and mitotic spindle poles JOURNAL J. Cell Sci. 107, 9-16 (1994) MEDLINE 94230620 REFERENCE 2 (bases 1 to 1173) AUTHORS Salisbury,J.L. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Jeffrey L. Salisbury, Biochemistry and Molecular Biology, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1173 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Hcen-1" /clone_lib="lambda gt11 (HL 1010b, Clonetech Laboratories, Palo Alto, CA)" /sex="male" /tissue_type="testis" /dev_stage="mature" 5'UTR 7..48 mRNA 49..567 CDS 49..567 /note="calcium binding protein" /codon_start=1 /evidence=experimental /product="centrin" /db_xref="PID:g414993" /translation="MASGFKKPSAASTGQKRKVAPKPELTEDQKQEVREAFDLFDVDG SGTIDAKELKVAMRALGFEPRKEEMKKMISEVDREGTGKISFNDFLAVMTQKMSEKDT KEEILKAFRLFDDDETGKISFKNLKRVANELGENLTDEELQEMIDEADRDGDGEVNEE EFLRIMKKTSLY" 3'UTR 568..1167 BASE COUNT 309 a 261 c 323 g 280 t ORIGIN 1 gaattcgggg ggcggtgccg ttgggaccac ggcggccaga gcggcaggat ggcttccggc 61 ttcaagaagc ccagcgctgc ctccaccggc caaaagagaa aggtggcacc taagcccgag 121 ctcactgagg atcagaagca agaagttcgg gaagcatttg acctcttcga cgtggacgga 181 agtgggacca tcgacgcgaa ggagctgaag gtggccatga gagcgctggg cttcgaaccc 241 aggaaggaag agatgaagaa aatgatctcc gaggtggaca gggaaggcac ggggaagatc 301 agcttcaatg acttcctggc cgtgatgacg cagaagatgt ccgagaagga caccaaagaa 361 gaaatcctga aggccttcag gctctttgat gacgatgaga ccgggaagat ctcgttcaaa 421 aacctgaagc gtgtggccaa cgagctgggg gagaacctca cggatgagga gctgcaggag 481 atgatcgacg aagctgatcg ggatggggac ggcgaagtga acgaggagga gttccttcgg 541 atcatgaaga agaccagcct ttactgaagt cggttcagaa gctaaagtga ctctctgggt 601 tgcctgcttc cattttgtga aaccttagag gacagcggct gcctgtccct tcttcacccc 661 ctcaccccca taatttgtct agatctattt ccatatctct agttcaataa tagaatttga 721 aagatgcttg taatgtgagt tttgggtttt aattctcaag agccaacctg gagcacatga 781 ggttaaacaa agggccctga agtttgagtg cgccctccat ttgccctgtg ctgaacttgc 841 tgttcatctg ttgatctgga ggcaggacag cttctgggac acacaaaaat gtggttccct 901 ttgtcacttc tttggtggtc ttaaattatc ttgcttcata tatcattcct taaattccag 961 tcattgttcc agcataatga gatggaatct gccagtagat ttgcctagcc tgtccactta 1021 gctgaatacc agtttgaagg aaaacagggt ggccacttac aaacttacgg agctcaggac 1081 agatattctt ataaagaata gacttgcttg ggtggtagta cgttgtgcaa ttttgactat 1141 tcactggctt tatacctgca aatgcccgaa ttc // LOCUS HSU03271 1077 bp mRNA PRI 14-FEB-1996 DEFINITION Human F-actin capping protein beta subunit mRNA, complete cds. ACCESSION U03271 NID g595256 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1077) AUTHORS Barron-Casella,E.A., Torres,M.A., Scherer,S.W., Heng,H.H., Tsui,L.C. and Casella,J.F. TITLE Sequence analysis and chromosomal localization of human Cap Z. Conserved residues within the actin-binding domain may link Cap Z to gelsolin/severin and profilin protein families JOURNAL J. Biol. Chem. 270 (37), 21472-21479 (1995) MEDLINE 95394897 REFERENCE 2 (bases 1 to 1077) AUTHORS Barron-Casella,E.A. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Emily A. Barron-Casella, Pediatrics, Division of Hematology, Johns Hopkins University School of Medicine, 720 Rutland Avenue, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1077 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phb4/2" /clone_lib="lambda gt10" /tissue_type="retina" CDS 1..819 /codon_start=1 /product="F-actin capping protein beta subunit" /db_xref="PID:g595257" /translation="MSDQQLDCALDLMRRLPPQQIEKNLSDLIDLVPSLCEDLLSSVD QPLKIARDKVVGKDYLLCDYNRDGDSYRSPWSNKYDPPLEDGAMPSARLRKLEVEANN AFDQYRDLYFEGGVSSVYLWDLDHGFAGVILIKKAGDGSKKIKGCWDSIHVVEVQEKS SGRTAHYKLTSTVMLWLQTNKSGSGTMNLGGSLTRQMEKDETVSDCSPHIANIGRLVE DMENKIRSTLNEIYFGKTKDIVNGLRSVQTFADKSKQEALKNDLVEALKRKQQC" BASE COUNT 280 a 284 c 270 g 243 t ORIGIN 1 atgagtgatc agcagctgga ctgtgccttg gacctaatga ggcgcctgcc tccccagcaa 61 atcgagaaaa acctcagcga cctgatcgac ctggtcccca gtctatgtga ggatctcctg 121 tcttctgttg accagccact gaaaattgcc agagacaagg tggtgggaaa ggattacctt 181 ttgtgtgact acaacagaga tggggactcc tataggtcac catggagtaa caagtatgac 241 cctcccttgg aggatggggc catgccgtca gctcggctga gaaagctgga ggtggaagcc 301 aacaatgcct ttgaccagta tcgagacctg tattttgaag gtggcgtctc atctgtctac 361 ctctgggatc tggatcatgg ctttgctgga gtgatcctca taaagaaggc tggagatgga 421 tcaaagaaga tcaaaggctg ctgggattcc atccacgtgg tagaagtgca ggagaaatcc 481 agcggtcgca ccgcccatta caagttgacc tccacggtga tgctgtggct gcagaccaac 541 aaatctggct ctggcaccat gaacctcgga ggcagcctta ccagacagat ggagaaggat 601 gaaactgtga gtgactgctc cccacacata gccaacatcg ggcgcctggt agaggacatg 661 gaaaataaaa tcagaagtac gctgaacgag atctactttg gaaaaacaaa ggatatcgtc 721 aatgggctga ggtctgtgca gacttttgca gacaaatcaa aacaagaagc tctgaagaat 781 gacctggtgg aggctttgaa gagaaagcag caatgctaaa cctctgtttc atgctaacca 841 gacacgccgt gcactcgtta gattcctttc ttagaaaact cgttttctgc tcccttccct 901 cgtcccttcc ctccccgaca ggtcacataa cagctgcatc attgaccgca cagcgccatc 961 tctccctgag aataaagccg atagccacct cctccggctc cgagcctgct tctgccacac 1021 ctcgctctca gtctctccac atttccatag agaccgtgtg gtttttgttc acccggg // LOCUS HSU03272 10172 bp mRNA PRI 11-JUN-1994 DEFINITION Human fibrillin-2 mRNA, complete cds. ACCESSION U03272 NID g437971 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10172) AUTHORS Zhang,H., Apfelroth,S.D., Hu,W., Davis,E.C., Sanguineti,C., Bonadio,J., Mecham,R.P. and Ramirez,F. TITLE Structure and expression of fibrillin-2, a novel microfibrillar component preferentially located in elastic matrices JOURNAL J. Cell Biol. 124, 855-863 (1994) MEDLINE 94165150 REFERENCE 2 (bases 1 to 10172) AUTHORS Ramirez,F. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Francesco Ramirez, Brookdale Center for Molecular Biology, Mt.Sinai Medical Center, 1 Gustave L. Levy Place, New York, NY 10029, USA FEATURES Location/Qualifiers source 1..10172 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MG-63, osteosarcoma cell line" CDS 1..8736 /codon_start=1 /product="fibrillin-2" /db_xref="PID:g437972" /translation="MGRRRRLCLQLYFLWLGCVVLWAQGTAGQPQPPPPKPPRPQPPP QQVRSATAGSEGGFLAPEYREEGAAVASRVRRRGQQDVLRGPNVCGSRFHSYCCPGWK TLPGGNQCIVPICRNSCGDGFCSRPNMCTCSSGQISSTCGSKSIQQCSVRCMNGGTCA DDHCQCQKGYIGTYCGQPVCENGCQNGGRCIAQPCACVYGFTGPQCERDYRTGPCFTQ VNNQMCQGQLTGIVCTKTLCCATTGRAWGHPCEMCPAQPQPCRRGFIPNIRTGACQDV DECQAIPGICQGGNCINTVGSFECRCPAGHKQSETTQKCEDIDECSIIPGICETGECS NTVGSYFCVCPRGYVTSTDGSRCIDQRTGMCFSGLVNGRCAQELPGRMTKMQCCCEPG RCWGIGTIPEACPVRGSEEYRRLCMDGLPMGGIPGSAGSRPGGTGGNGFAPSGNGNGY GPGGTGFIPIPGGNGFSPGVGGAGVGAGGQGPIITGLTILNQTIDICKHHANLCLNGR CIPTVSSYRCECNMGYKQDANGDCIDVDECTSNPCTNGDCVNTPGSYYCKCHAGFQRT PTKQACIDIDECIQNGVLCKNGRCVNSDGSFQCICNAGFELTTDGKNCVDHDECTTTN MCLNGMCINEDGSFKCICKPGFVLAPNGRYCTDVDECQTPGICMNGHCINSEGSFRCD CPPGLAVGMDGRVCVDTHMRSTCYGGIKKGVCVRPFPGAVTKSECCCANPDYGFGEPC QPCPAKNSAEFHGLCSSGVGITVDGRDINECALDPDICANGICENLRGSYRCNCNSGY EPDASGRNCIDIDECLVNRLLCDNGLCRNTPGSYSCTCPPGYVFRTETETCEDINECE SNPCVNGACRNNLGSFNCECSPGSKLSSTGLICIDSLKGTCWLNIQDSRCEVNINGAT LKSECCATLGAAWGSPCERCELDTACPRGLARIKGVTCEDVNECEVFPGVCPNGRCVN SKGSFHCECPEGLTLDGTGRVCLDIRMEQCYLKWDEDECIHPVPGKFRMDACCCAVGA AWGTECEECPKPGTKEYETLCPRGAGFANRGDVLTGRPFYKDINECKAFPGMCTYGKC RNTIGSFKCRCNSGFALDMEERNCTDIDECRISPDLCGSGICVNTPGSFECECFEGYE SGFMMMKNCMDIDGCERNPLLCRGGTCVNTEGSFQCDCPLGHELSPSREDCVDINECS LSDNLCRNGKCVNMIGTYQCSCNPGYQATPDRQGCTDIDECMIMNGGCDTQCTNSEGS YECSCSEGYALMPDGRSCADIDECENNPDICDGGQCTNIPGEYRCLCYDGFMASMDMK TCIDVNECDLNSNICMFGECENTKGSFICHCQLGYSVKKGTTGCTDVDECEIGAHNCD MHASCLNIPGSFKCSCREGWIGNGIKCIDLDECSNGTHQCSINAQCVNTPGSYRCACS EGFTGDGFTCSDVDECAENINLCENGQCLNVPGAYRCECEMGFTPASDSRSCQDIDEC SFQNICVSGTCNNLPGMFHCICDDGYELDRTGGNCTDIDECADPINCVNGLCVNTPGR YECNCPPDFQLNPTGVGCVDNRVGNCYLKFGPRGDGSLSCNTEIGVGVSRSSCCCSLG KAWGNPCETCPPVNSTEYYTLCPGGEGFRPNPITIILEDIDECQELPGLCQGGNCINT FGSFQCECPQGYYLSEDTRICEDIDECFAHPGVCGPGTCYNTLGNYTCICPPEYMQVN GGHNCMDMRKSFCYRSYNGTTCENELPFNVTKRMCCCTYNVGKAGNKPCEPCPTPGTA DFKTICGNIPGFTFDIHTGKAVDIDECKEIPGICANGVCINQIGSFRCECPTGFSYND LLLVCEDIDECSNGDNLCQRNADCINSPGSYRCECAAGFKLSPNGACVDRNECLEIPN VCSHGLCVDLQGSYQCICHNGFKASQDQTMCMDVDECERHPCGNGTCKNTVGSYNCLC YPGFELTHNNDCLDIDECSSFFGQVCRNGRCFNEIGSFKCLCNEGYELTPDGKNCIDT NECVALPGSCSPGTCQNLEGSFRCICPPGYEVKSENCIDINECDEDPNICLFGSCTNT PGGFQCLCPPGFVLSDNGRRCFDTRQSFCFTNFENGKCSVPKAFNTTKAKCCCSKMPG EGWGDPCELCPKDDEVAFQDLCPYGHGTVPSLHDTREDVNECLESPGICSNGQCINTD GSFRCECPMGYNLDYTGVRCVDTDECSIGNPCGNGTCTNVIGSFECNCNEGFEPGPMM NCEDINECAQNPLLCALRCMNTFGSYECTCPIGYALREDQKMCKDLDECAEGLHDCES RGMMCKNLIGTFMCICPPGMARRPDGEGCVDENECRTKPGICENGRCVNIIGSYRCEC NEGFQSSSSGTECLDNRQGLCFAEVLQTICQMASSSRNLVTKSECCCDGGRGWGHQCE LCPLPGTAQYKKICPHGPGYTTDGRDIDECKVMPNLCTNGQCINTMGSFRCFCKVGYT TDISGTSCIDLDECSQSPKPCNYICKNTEGSYQCSCPRGYVLQEDGKTCKDLDECQTK QHNCQFLCVNTLGGFTCKCPPGFTQHHTACIDNNECGSQPLLCGGKGICQNTPGSFSC ECQRGFSLDATGLNCEDVDECDGNHRCQHGCQNILGGYRCGCPQGYIQHYQWNQCVDE NECSNPNACGSASCYNTLGSYKCACPSGFSFDQFSSACHDVNECSSSKNPCNYGCSNT EGGYLCGCPPGYYRVGQGHCVSGMGFNKGQYLSLDTEVDEENALSPEACYECKINGYP KKDSRQKRSIHEPDPTAVEQISLESVDMDSPVNMKFNLSHLGSKEHILELRPAIQPLN NHIRYVISQGNDDSVFRIHQRNGLSYLHTAKKKLMPGTYTLEITSIPLYKKKELKKLE ESNEDDYLLGELGEALRMRLQIQLY" 3'UTR 8737..10172 BASE COUNT 2694 a 2248 c 2609 g 2621 t ORIGIN 1 atggggagaa gacggaggct gtgtctccag ctctacttcc tgtggctggg ctgtgtggtg 61 ctctgggcgc agggcacggc cggccagcct cagcctcctc cgcccaagcc gccccggccc 121 cagccgccgc cgcaacaggt tcggtccgct acagcaggct ctgaaggcgg gtttctagcg 181 cccgagtatc gcgaggaggg tgccgcagtg gccagccgcg tccgccggcg aggacagcag 241 gacgtgctcc gagggcccaa cgtgtgcggc tccagattcc actcctactg ctgccctgga 301 tggaagacgc tccctggagg aaaccagtgc attgtcccga tttgtagaaa tagttgtgga 361 gatggatttt gttcccgtcc taacatgtgt acttgttcca gtgggcaaat atcatcaacc 421 tgtggatcaa aatcaattca gcagtgcagt gtgagatgca tgaatggtgg gacctgtgca 481 gatgaccact gccagtgcca gaaaggatat attggaactt attgtggaca acctgtctgt 541 gaaaatggat gtcagaatgg tggacgttgc atcgcccaac cgtgtgcttg tgtttatggg 601 ttcactggtc cacagtgtga aagagattac aggacaggcc cgtgtttcac tcaggtcaac 661 aaccagatgt gccaagggca gctgacaggc attgtctgca cgaagactct gtgctgtgcc 721 accactggac gggcgtgggg ccatccctgt gagatgtgtc cagcccagcc tcagccctgc 781 cgacggggtt tcatccccaa catccgcact ggagcttgcc aagatgttga tgaatgccag 841 gctatcccag ggatatgcca aggaggaaac tgtatcaata cagtgggctc ttttgaatgc 901 agatgccctg ctggtcacaa acagagtgaa actactcaga aatgtgaaga cattgatgag 961 tgcagcatca ttcctgggat atgtgaaact ggtgaatgtt ccaacaccgt gggaagctat 1021 ttttgtgttt gtccacgtgg atatgtaacc tcaacagatg gctctcgatg catcgatcag 1081 agaacaggca tgtgtttctc gggcctggtg aatggccgct gtgcacaaga gctcccgggg 1141 agaatgacga aaatgcagtg ctgctgtgag cctggccgct gctggggcat cggaaccatt 1201 cctgaagcct gtcctgtcag aggttctgag gaatatcgca gactttgcat ggatggactt 1261 ccaatgggag gaattccagg gagtgctggt tccagacctg gaggcactgg gggaaatggc 1321 tttgccccaa gtggcaatgg caatggctat ggcccaggag ggacaggctt catccccatc 1381 cctggaggca atggcttttc tcctggcgtt gggggagccg gtgtgggggc cgggggacag 1441 ggacctatca tcactggact aacaattctg aaccagacaa tagatatctg taagcatcat 1501 gctaaccttt gtttaaatgg acgctgtata ccaactgtct caagctaccg atgtgaatgc 1561 aacatgggtt ataagcagga tgcaaatgga gattgtatag atgttgatga atgcacatca 1621 aatccctgca ctaatggaga ttgtgttaac acacctggtt cctattattg taaatgtcat 1681 gctggattcc agaggactcc taccaagcaa gcatgcattg atattgatga gtgcatccag 1741 aatggggttc tttgtaaaaa cggtcgatgc gtgaactcag atggaagttt ccagtgcatt 1801 tgcaatgccg gctttgaatt aactacagat ggaaaaaact gtgttgatca tgatgaatgt 1861 acaactacca acatgtgttt gaatggaatg tgcatcaatg aagatggcag cttcaagtgc 1921 atctgcaaac caggatttgt cttggctcca aatgggcgtt actgtactga tgttgatgaa 1981 tgccagaccc caggaatctg catgaatggg cactgcatca acagtgaagg gtccttccgc 2041 tgtgactgtc ccccaggcct ggctgtgggc atggatggac gtgtgtgtgt tgatactcac 2101 atgcgcagta cctgctatgg aggaatcaag aaaggagtgt gtgtgcgtcc tttccccggt 2161 gcagtgacca agtccgaatg ctgctgtgcc aatccagact atggttttgg agaaccctgc 2221 cagccatgcc ctgcaaaaaa ttcagctgaa ttccacggcc tttgtagtag tggagtaggt 2281 atcactgtgg atggaagaga tatcaatgaa tgtgctttgg atcctgatat atgtgccaat 2341 gggatttgtg aaaacttacg tggtagttac cgttgtaatt gcaacagtgg ctatgaacca 2401 gatgcctctg gaagaaactg tattgacatt gatgaatgtt tagtaaacag actgctttgt 2461 gataacggat tgtgccgaaa cacgccagga agttacagct gtacgtgccc accagggtat 2521 gtgttcagga ctgagacaga gacctgtgaa gatataaatg aatgtgaaag caacccatgt 2581 gtcaatgggg cctgcagaaa caaccttgga tctttcaatt gtgaatgttc gcccggcagc 2641 aaactcagct ccacaggatt gatctgtatt gacagcctga aggggacctg ttggctcaac 2701 atccaggaca gccgctgtga ggtgaatatt aatggagcca ctctgaaatc tgaatgctgt 2761 gccaccctcg gagccgcctg ggggagcccc tgtgagcggt gtgaactaga tacagcttgc 2821 ccaagagggc ttgccaggat taaaggtgtt acgtgtgaag atgttaatga gtgtgaggtg 2881 ttccctggcg tttgtccaaa tggacgctgt gtcaacagta agggatcttt tcattgcgag 2941 tgccctgaag gccttacgtt ggatgggact ggccgtgtat gtttggatat tcgcatggag 3001 cagtgttact tgaagtggga tgaagatgaa tgcatccacc ccgttcctgg aaagttccgc 3061 atggatgcct gctgctgtgc tgtcggggcg gcttggggca ccgagtgtga ggagtgcccc 3121 aaacctggca ccaaggaata cgagacactg tgcccccgcg gggctggctt tgctaaccga 3181 ggggatgttc ttactgggcg gccattttac aaagacatca atgaatgcaa agcatttcct 3241 gggatgtgca cttatgggaa gtgcagaaat acaatcggaa gcttcaaatg ccgttgcaat 3301 agtggctttg ctctagacat ggaggaaaga aactgcacgg acatcgacga gtgcaggatt 3361 tctcctgacc tctgtggcag tggaatctgc gtcaatacac cgggcagctt tgagtgcgag 3421 tgcttcgaag gctatgaaag tggcttcatg atgatgaaga actgcatgga cattgacgga 3481 tgtgaacgta accctctcct ttgtaggggt ggcacctgtg tgaacactga gggcagcttt 3541 cagtgtgact gcccactggg acacgagctg tcaccatccc gtgaggactg tgtggatatt 3601 aatgaatgct ccctgagtga caatctctgc agaaatggaa aatgtgtgaa catgattgga 3661 acctatcagt gctcttgcaa tcctggatat caggctacgc cagaccgcca gggctgtaca 3721 gatattgatg aatgtatgat aatgaacgga ggctgtgaca cccagtgcac aaattcagag 3781 ggaagctacg aatgcagctg cagtgagggt tatgccctga tgccagatgg gagatcgtgt 3841 gcagacattg atgaatgtga aaacaatcct gatatctgtg atggcggcca gtgtaccaac 3901 attcctggag agtatcgctg cctctgctat gatggcttca tggcttccat ggacatgaaa 3961 acatgcattg atgtcaatga atgtgaccta aattcaaata tctgcatgtt tggggaatgt 4021 gagaacacaa agggatcctt catttgccac tgtcagctgg gttactcagt gaagaagggg 4081 accacaggat gtacagatgt ggatgagtgt gaaattggtg ctcataactg cgacatgcat 4141 gcctcatgtc tgaatatccc aggaagcttc aagtgtagct gcagagaagg ctggattgga 4201 aacggcatca agtgtattga tctggacgaa tgttctaatg gaacccacca gtgtagcatc 4261 aatgctcagt gtgtaaatac cccgggctca taccgctgtg cctgctccga aggtttcact 4321 ggtgatggct ttacctgctc agatgttgat gagtgtgcag aaaacataaa cctctgtgag 4381 aacggacagt gccttaatgt cccgggtgca tatcgctgcg agtgtgagat gggcttcact 4441 ccagcctcag acagcagatc ctgccaagat attgatgaat gctccttcca aaacatttgt 4501 gtctctggaa catgtaataa cctgcctgga atgtttcatt gcatctgcga tgatggttat 4561 gaattggaca gaacaggagg gaactgtaca gatattgatg agtgtgcaga tcctataaac 4621 tgtgtcaatg gcctatgtgt caacacgcct ggtcgctatg agtgtaactg cccacccgat 4681 tttcagttga acccaactgg tgtgggttgt gttgacaacc gtgtgggcaa ctgctacctg 4741 aagtttggac ctcgaggaga tgggagtctg tcttgcaaca ccgagatcgg ggtgggcgtc 4801 agtcgctctt catgctgctg ctctctggga aaggcctggg gaaacccctg tgagacatgc 4861 ccccctgtca atagcactga atattacacc ctgtgtcccg gaggtgaagg cttcagacct 4921 aaccccatca caatcatttt agaagacatt gacgaatgcc aggagttacc aggtctctgc 4981 cagggtggaa actgcatcaa cacttttggg agcttccagt gtgagtgccc acaaggctac 5041 tacctcagcg aggatacccg catctgtgag gatattgatg agtgttttgc acatcctggt 5101 gtgtgtgggc ctgggacctg ctataacacc ctgggaaatt acacctgcat ttgcccacct 5161 gagtacatgc aggtcaatgg aggccacaac tgcatggaca tgagaaaaag cttttgctac 5221 cgaagctata atggaaccac ttgtgagaat gagttgcctt tcaatgtgac aaaaaggatg 5281 tgctgctgca catataatgt gggcaaagct gggaacaaac cttgtgaacc atgcccaact 5341 ccaggaacag ctgactttaa aaccatatgt ggaaatattc ctggattcac ctttgacatt 5401 cacacaggaa aagctgttga cattgatgaa tgtaaagaga ttccaggcat ttgtgcaaat 5461 ggtgtgtgca ttaaccagat tggcagtttc cgctgtgaat gccctacagg attcagttac 5521 aatgacctgc tgttggtttg tgaagatata gatgagtgca gcaatggtga taatctctgc 5581 cagcggaatg cagactgcat caatagtcct ggtagttacc gctgtgaatg tgccgcgggt 5641 ttcaaacttt cacccaatgg ggcctgtgta gatcgcaatg aatgtttaga aattcctaac 5701 gtttgcagtc atggcttgtg tgttgatctg caaggaagtt accagtgcat ctgccacaat 5761 ggctttaagg cttctcagga ccagaccatg tgcatggatg ttgatgagtg cgagcggcac 5821 ccatgtggaa atggaacttg taaaaacacc gttggatcct ataactgtct gtgctaccca 5881 gggtttgaac tcactcataa taatgattgc ctggacatag atgagtgcag ttcctttttt 5941 ggtcaggtgt gcagaaatgg acgttgtttt aatgaaattg gttctttcaa gtgtctatgt 6001 aacgaaggtt atgaacttac cccagatggc aaaaactgta tagacactaa tgagtgtgtc 6061 gcccttcccg gctcttgctc tcctggtacc tgtcagaatt tggagggatc cttcagatgc 6121 atctgtcccc cagggtatga agtaaaaagc gagaactgca ttgatataaa tgaatgtgat 6181 gaagatccca acatttgtct ttttggttcc tgtactaata ctccaggggg cttccagtgc 6241 ctctgccccc ctggctttgt actatctgat aatggacgga gatgctttga tactcgccag 6301 agcttctgct tcacaaattt tgaaaatgga aagtgttctg tacccaaagc tttcaacacc 6361 acaaaagcaa aatgctgctg tagtaagatg ccaggagagg gctgggggga cccctgtgag 6421 ctgtgcccca aagacgatga agttgcattt caggatttgt gtccatatgg ccatggaact 6481 gtccctagtc ttcatgatac acgtgaagat gtcaatgagt gtcttgagag cccaggcatt 6541 tgttcaaatg gtcaatgtat caacaccgac ggatcttttc gctgtgaatg tccaatgggc 6601 tacaaccttg actacactgg agtacgctgt gtggatactg atgagtgttc aatcggcaat 6661 ccgtgtggaa atggtacatg caccaatgtt attgggagtt ttgaatgcaa ttgcaatgaa 6721 ggctttgagc cagggcccat gatgaattgt gaagatatca acgaatgtgc ccagaaccca 6781 ctgctgtgtg ctttacgctg catgaacact tttgggtcct atgaatgcac gtgcccgatt 6841 ggctatgccc tcagggaaga tcaaaagatg tgcaaagatc tggatgaatg tgctgaaggg 6901 ttacacgact gtgaatctag gggcatgatg tgtaagaatc taatcggcac cttcatgtgc 6961 atctgccctc ctggaatggc ccgaaggccc gatggagaag gctgtgtaga tgaaaatgaa 7021 tgcaggacca agccaggaat ctgtgaaaat ggacgttgtg ttaacattat tggaagctat 7081 agatgtgagt gtaatgaagg attccagtca agttcttcag gcactgaatg ccttgacaat 7141 cgacagggtc tctgctttgc agaggtactg cagacaatat gtcaaatggc atccagtagt 7201 cgcaatctcg tcactaagtc agaatgctgc tgtgatggtg ggcgaggctg gggccaccag 7261 tgcgagcttt gcccacttcc tggaactgcc cagtacaaaa agatatgtcc tcatggccca 7321 ggatatacaa ctgatggaag agatattgat gaatgtaagg taatgccaaa cctctgcacc 7381 aatggtcagt gcatcaatac catgggctca ttccgatgct tctgcaaggt tggctacacc 7441 acagacatca gtggaacctc ttgtatagac cttgatgaat gctcccagtc cccgaaacca 7501 tgcaactaca tctgcaagaa cactgagggg agttatcagt gttcatgtcc gagggggtat 7561 gtcctgcaag aggatggaaa gacatgcaaa gaccttgatg aatgtcaaac aaagcagcat 7621 aactgccagt tcctctgtgt caacaccctg ggggggttta cctgtaaatg tccacctggt 7681 ttcacacagc atcacactgc ttgtatcgac aacaacgaat gtgggtctca acctttgctt 7741 tgtggaggaa agggaatctg tcaaaacact ccaggcagtt tcagctgtga atgccaaaga 7801 gggttctctc ttgatgccac cggactgaac tgtgaagatg ttgatgaatg tgatgggaac 7861 cacaggtgcc aacacggctg ccagaacatc ctgggtggct acagatgtgg ctgcccccaa 7921 ggctacatcc agcactacca gtggaatcag tgtgtcgatg agaatgaatg ctccaatccc 7981 aatgcctgtg gctctgcttc ctgctacaac accctgggga gttacaagtg cgcctgcccc 8041 tcggggttct ccttcgacca gttctccagt gcctgccacg acgtgaatga gtgctcgtcc 8101 tccaagaacc cctgcaatta cggctgctct aacacggagg ggggctacct ctgtggctgc 8161 ccccctgggt attacagagt gggacaaggc cactgtgtct caggaatggg atttaacaag 8221 gggcagtacc tgtcactgga tacagaggtc gatgaggaaa atgctctgtc cccagaagca 8281 tgctacgagt gcaaaatcaa cggctatcct aagaaagaca gcaggcagaa gagaagtatt 8341 catgaacctg atcccactgc tgttgaacag atcagcctag agagtgtcga catggacagc 8401 cccgtcaaca tgaagttcaa cctctcccac ctcggctcta aggagcacat cctggaacta 8461 aggcccgcca tccagcccct caacaaccac atccgttatg tcatctctca agggaacgat 8521 gacagcgtct tccgcatcca ccaaaggaat gggctcagct acttgcacac ggccaagaag 8581 aagctcatgc ccggcacata cacactggaa atcactagca tccctctcta caagaagaag 8641 gagcttaaga aactggaaga gagcaatgag gatgactacc tcctagggga gcttggggag 8701 gctctcagaa tgaggctgca gattcagctc tattaaccgt tcacagactt gggcccaggc 8761 tcaaatccta gcacagccag tctgcagaag catttgaaaa gtcaaggact aattttaaag 8821 aggaaaaata ataataactc ttgtttcttt cctccctgtc ttagactttg aatgttgacc 8881 ctcacaggga gggataattt agactctggt atggccaaag atttgagctc aaaggcaacc 8941 gtggttactg tattttttat ataacttcat tttaaaatat attaaaagaa acctaaatgt 9001 tcaagatatc agcatatggc actaaatgca caaaaataat gtgagctttt tttttttttt 9061 cctgttagca gtctgtaaca ctttgggtat tttgctatag ttgctaatta aaaaaatata 9121 gatgtttatt tatttttaat gcagtaatat atggagaaat gaacaaacta tgtaaacaaa 9181 aagggaaact cacttgtttt tctttagatt tataaatttg agctattttt tttagaggtg 9241 ctttttaaaa atccaataga tacaagagat gtttcctttg gttttctgcc agtcatccag 9301 ctgatacaca cctgatcgat tttaaagaaa gccacacaga gctgaatcgg gcagtgctaa 9361 tcaataattt aaaagacatg aatgtcatta gatcctttat aacgtagatc gaagccaaag 9421 cagctcattt gtgacaacat ttcatatcac cagacacacc aggcaacaga agttgaagca 9481 caaccactgt agcaaaatac cttgactgct tgtgagacca ttagcattgc aggccaaacc 9541 gtactgtatt tccttctcat aacctcaagg aaccatatgt gctacccaca acacctcatt 9601 cttacccagg gtgcgctgcg tcctcatggt actgtaggca gctgaagaac cgccgttccc 9661 ttgaaaggga acacctggca ttctgtggtg tttcgtgctg tcttaaataa tggtgcattt 9721 attatgttca agttatttca ggattgccat atgtgcaaac aaatcatgca atgcagccaa 9781 ggaatatatg ttgttgttgt tgttttaaac ccattttttt tttagaattt tcattaatac 9841 tgtagttata caccatatgc ctcattttat catagcctat tgtgtatgaa agatgtttgt 9901 acaatgaatt gatgtttagt ttgctttagt catttaaaaa gatattgtac caggatgtgc 9961 tattaagagc acgtatccat tattcttctc aacccaagaa cctgtttcct ggaccagtga 10021 ccaaacctca tatgtgaaat ggccaaagca catgcaggct cctggttgtt cctctcaaac 10081 ctgtgctgac caaagattag taaccagtta tacccagtat tttgaggttt tattgttttt 10141 ttaataacta aaaaaaaact cgtgccgaat tc // LOCUS HSU03274 2016 bp mRNA PRI 07-APR-1994 DEFINITION Human biotinidase mRNA, complete cds. ACCESSION U03274 NID g468823 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2016) AUTHORS Cole,H., Reynolds,T.R., Lockyer,J.M., Buck,G.A., Denson,T., Spence,J.E., Hymes,J. and Wolf,B. TITLE Human serum biotinidase. cDNA cloning, sequence and characterization JOURNAL J. Biol. Chem. 269, 6566-6570 (1994) MEDLINE 94165042 REFERENCE 2 (bases 1 to 2016) AUTHORS Cole,H. TITLE Direct Submission JOURNAL Submitted (05-NOV-1993) Cole H., Virginia Commonwealth University, Human Genetics, 1101 East Marshall St., Rm 11-059 Sanger Hall, Richmond, VA 22398-0033, USA FEATURES Location/Qualifiers source 1..2016 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="BTD2000" /clone_lib="human liver cDNA" /germline /sex="male" /cell_type="hepatocyte" /tissue_type="liver" CDS 36..1667 /EC_number="3.5.1.12" /codon_start=1 /function="catalytic release of biotin from biocytin" /product="biotinidase" /db_xref="PID:g468824" /translation="MAHAHIQGGRRAKSRFVVCIMSGARSKLALFLCGCYVVALGAHT GEESVADHHEAEYYVAAVYEHPSILSLNPLALISRQEALELMNQNLDIYEQQVMTAAQ KDVQIIVFPEDGIHGFNFTRTSIYPFLDFMPSPQVVRWNPCLEPHRFNDTEVLQRLSC MAIRGDMFLVANLGTKEPCHSSDPRCPKDGRYQFNTNVVFSNNGTLVDRYRKHNLYFE AAFDVPLKVDLITFDTPFAGRFGIFTCFDILFFDPAIRVLRDYKVKHVVYPTAWMNQL PLLAAIEIQKAFAVAFGINVLAANVHHPVLGMTGSGIHTPLESFWYHDMENPKSHLII AQVAKNPVGLIGAENATGETDPSHSKFLKILSGDPYCEKDAQEVHCDEATKWNVNAPP TFHSEMMYDNFTLVPVWGKEGYLHVCSNGLCCYLLYERPTLSKELYALGVFDGLHTVH GTYYIQVCALVRCGGLGFDTCGQEITEATGIFEFHLWGNFSTSYIFPLFLTSGMTLEV PDQLGWENDHYFLRKSRLSSGLVTAALYGRLYERD" mat_peptide 159..1664 /EC_number="3.5.1.12" /function="catalytic release of biotin from biocytin" /evidence=experimental /product="biotinidase" polyA_signal 1967..1972 BASE COUNT 500 a 493 c 498 g 525 t ORIGIN 1 gccagctgga gcgttttcgg ggctgtaaag ggagaatggc gcatgcgcat attcagggcg 61 gaaggcgcgc taagagcaga tttgtggtct gcattatgtc tggagccaga agtaagcttg 121 ctcttttcct ctgcggctgt tacgtggttg ccctgggagc ccacaccggg gaggagagcg 181 tggctgacca tcacgaggct gaatattatg tggctgccgt gtatgagcat ccatccatcc 241 tgagtctgaa ccctctggct ctcatcagcc gccaagaggc cttggagctc atgaaccaga 301 accttgacat ctatgaacag caagtgatga ctgcagccca aaaggatgta cagattatag 361 tgtttccaga agatggcatt catggattca actttacaag aacatccatt tatccatttt 421 tggacttcat gccgtctccc caggtggtca ggtggaaccc atgcctggag cctcaccgct 481 tcaatgacac agaggtgctc cagcgcctga gttgtatggc catcagggga gatatgttct 541 tggtggccaa tcttgggaca aaggagcctt gtcatagcag tgacccaagg tgcccaaaag 601 atgggagata ccagttcaac acaaatgtcg tgttcagcaa taatggaacc cttgttgacc 661 gctaccgtaa acacaacctc tactttgagg cagcattcga tgttcctctt aaagtggatc 721 tcatcacctt tgataccccc tttgctggca ggtttggcat cttcacatgc tttgatatat 781 tgttctttga ccctgccatc agagtcctca gagactacaa ggtgaagcat gttgtgtacc 841 caactgcctg gatgaaccag ctcccactct tggcagcaat tgagattcag aaagcttttg 901 ctgttgcctt tggcatcaac gttctggcag ctaatgtcca ccacccagtt ctggggatga 961 caggaagtgg catacacacc cctctggagt ccttttggta ccatgacatg gaaaatccca 1021 aaagtcacct tataattgcc caggtggcca aaaatccagt gggtctcatt ggtgcagaga 1081 atgcaacagg tgaaacggac ccatcccata gtaagttttt aaaaattttg tcaggcgatc 1141 cgtactgtga gaaggatgct caggaagtcc actgtgatga ggccaccaag tggaacgtga 1201 atgctcctcc cacatttcac tctgagatga tgtatgacaa tttcaccctg gtccctgtct 1261 ggggaaagga aggctatctc cacgtctgtt ccaatggcct ctgctgttat ttactttacg 1321 agaggcccac cttatccaaa gagctgtatg ccctgggggt ctttgatggg cttcacacag 1381 tacatggcac ttactacatc caagtgtgtg ccctggtcag gtgtgggggt cttggcttcg 1441 acacctgcgg acaggaaatc acagaggcca cggggatatt tgagtttcac ctgtggggca 1501 acttcagtac ttcctatatc tttcctttgt ttctgacctc agggatgacc ctagaagtcc 1561 ctgaccagct tggctgggag aatgaccact atttcctgag gaaaagtagg ctgtcctctg 1621 ggctggtgac ggcggctctc tatgggcgct tgtatgagag ggactaggaa aagtgtgtgg 1681 tctgtggggc ggactctggc catcatgttg acagccttgc acttccacag gctacaagcc 1741 ctgggaccat ctttctgcct taagggcagg agcccacttc tgtggcacca gattccaccc 1801 tgggaactgt ggaaaaagta ggagaggcag attccctcag tgtcttcctc ttaaacctca 1861 atcatcgaga cattaggggg tattttctgt tcacatttat ctttttcaag ccacatcttc 1921 ctctaacaaa tctctcagta tgcgattggt ctcaagctaa aacaaaaata aatgtcagtt 1981 tatattttac acatccaaaa aaaaaaaaaa aaaaaa // LOCUS HSU03397 1415 bp mRNA PRI 15-NOV-1994 DEFINITION Human receptor protein 4-1BB mRNA, complete cds. ACCESSION U03397 NID g571320 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1415) AUTHORS Alderson,M.R., Smith,C.A., Tough,T.W., Davis-Smith,T., Armitage,R.J., Falk,B., Roux,E., Baker,E., Sutherland,G.R., Din,W.S. and Goodwin,R.G. TITLE Molecular and biological characterization of human 4-1BB and its ligand JOURNAL Eur. J. Immunol. 24 (9), 2219-2227 (1994) MEDLINE 94374434 REFERENCE 2 (bases 1 to 1415) AUTHORS Alderson,M. TITLE Direct Submission JOURNAL Submitted (10-NOV-1993) Mark Alderson, Immunex Research and Development Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1415 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood" /cell_type="T cell" CDS 120..887 /note="homology to the receptors for TNF and NGF; human homolog of murine T-cell receptor 4-1BB protein; transmembrane protein" /codon_start=1 /product="4-1BB" /db_xref="PID:g571321" /translation="MGNSCYNIVATLLLVLNFERTRSLQDPCSNCPAGTFCDNNRNQI CSPCPPNSFSSAGGQRTCDICRQCKGVFRTRKECSSTSNAECDCTPGFHCLGAGCSMC EQDCKQGQELTKKGCKDCCFGTFNDQKRGICRPWTNCSLDGKSVLVNGTKERDVVCGP SPADLSPGASSVTPPAPAREPGHSPQIISFFLALTSTALLFLLFFLTLRFSVVKRGRK KLLYIFKQPFMRPVQTTQEEDGCSCRFPEEEEGGCEL" misc_feature 531..533 /note="encodes potential glycosylation site" misc_feature 564..566 /note="encodes potential glycosylation site" misc_feature 678..758 /note="encodes transmembrane domain; amino acids 187-208" BASE COUNT 385 a 332 c 333 g 365 t ORIGIN 1 agtggaaagt tctccggcag ccctgagatc tcaagagtga catttgtgag accagctaat 61 ttgattaaaa ttctcttgga atcagctttg ctagtatcat acctgtgcca gatttcatca 121 tgggaaacag ctgttacaac atagtagcca ctctgttgct ggtcctcaac tttgagagga 181 caagatcatt gcaggatcct tgtagtaact gcccagctgg tacattctgt gataataaca 241 ggaatcagat ttgcagtccc tgtcctccaa atagtttctc cagcgcaggt ggacaaagga 301 cctgtgacat atgcaggcag tgtaaaggtg ttttcaggac caggaaggag tgttcctcca 361 ccagcaatgc agagtgtgac tgcactccag ggtttcactg cctgggggca ggatgcagca 421 tgtgtgaaca ggattgtaaa caaggtcaag aactgacaaa aaaaggttgt aaagactgtt 481 gctttgggac atttaacgat cagaaacgtg gcatctgtcg accctggaca aactgttctt 541 tggatggaaa gtctgtgctt gtgaatggga cgaaggagag ggacgtggtc tgtggaccat 601 ctccagccga cctctctccg ggagcatcct ctgtgacccc gcctgcccct gcgagagagc 661 caggacactc tccgcagatc atctccttct ttcttgcgct gacgtcgact gcgttgctct 721 tcctgctgtt cttcctcacg ctccgtttct ctgttgttaa acggggcaga aagaaactcc 781 tgtatatatt caaacaacca tttatgagac cagtacaaac tactcaagag gaagatggct 841 gtagctgccg atttccagaa gaagaagaag gaggatgtga actgtgaaat ggaagtcaat 901 agggctgttg ggactttctt gaaaagaagc aaggaaatat gagtcatccg ctatcacagc 961 tttcaaaagc aagaacacca tcctacataa tacccaggat tcccccaaca cacgttcttt 1021 tctaaatgcc aatgagttgg cctttaaaaa tgcaccactt tttttttttt ttttgacagg 1081 gtctcactct gtcacccagg ctggagtgca gtggcaccac catggctctc tgcagccttg 1141 acctctggga gctcaagtga tcctcctgcc tcagtctcct agtagctgga actacaagga 1201 agggccacca cacctgacta acttttttgt tttttgtttg gtaaagatgg catttcgcca 1261 tgttgtacag gctggtctca aactcctagg ttcactttgg cctcccaaag tgctgggatt 1321 acagacatga actgccaggc ccggccaaaa taatgcacca cttttaacag aacagacaga 1381 tgaggacaga gctggtgata aaaaaaaaaa aaaaa // LOCUS HSU03398 1619 bp mRNA PRI 15-NOV-1994 DEFINITION Human receptor 4-1BB ligand mRNA, complete cds. ACCESSION U03398 NID g571322 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1619) AUTHORS Alderson,M.R., Smith,C.A., Tough,T.W., Davis-Smith,T., Armitage,R.J., Falk,B., Roux,E., Baker,E., Sutherland,G.R., Din,W.S. and Goodwin,R.G. TITLE Molecular and biological characterization of human 4-1BB and its ligand JOURNAL Eur. J. Immunol. 24 (9), 2219-2227 (1994) MEDLINE 94374434 REFERENCE 2 (bases 1 to 1619) AUTHORS Alderson,M. TITLE Direct Submission JOURNAL Submitted (10-NOV-1993) Mark Alderson, Immunex Research and Development Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1619 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PL-1" /cell_type="T cell" CDS 4..768 /codon_start=1 /product="4-1BB ligand" /db_xref="PID:g571323" /translation="MEYASDASLDPEAPWPPAPRARACRVLPWALVAGLLLLLLLAAA CAVFLACPWAVSGARASPGSAASPRLREGPELSPDDPAGLLDLRQGMFAQLVAQNVLL IDGPLSWYSDPGLAGVSLTGGLSYKEDTKELVVAKAGVYYVFFQLELRRVVAGEGSGS VSLALHLQPLRSAAGAAALALTVDLPPASSEARNSAFGFQGRLLHLSAGQRLGVHLHT EARARHAWQLTQGATVLGLFRVTPEIPAGLPSPRSE" misc_feature 79..147 /note="encodes transmembrane domain; amino acids 26-48" polyA_signal 1601..1606 BASE COUNT 280 a 491 c 433 g 415 t ORIGIN 1 gtcatggaat acgcctctga cgcttcactg gaccccgaag ccccgtggcc tcccgcgccc 61 cgcgctcgcg cctgccgcgt actgccttgg gccctggtcg cggggctgct gctgctgctg 121 ctgctcgctg ccgcctgcgc cgtcttcctc gcctgcccct gggccgtgtc cggggctcgc 181 gcctcgcccg gctccgcggc cagcccgaga ctccgcgagg gtcccgagct ttcgcccgac 241 gatcccgccg gcctcttgga cctgcggcag ggcatgtttg cgcagctggt ggcccaaaat 301 gttctgctga tcgatgggcc cctgagctgg tacagtgacc caggcctggc aggcgtgtcc 361 ctgacggggg gcctgagcta caaagaggac acgaaggagc tggtggtggc caaggctgga 421 gtctactatg tcttctttca actagagctg cggcgcgtgg tggccggcga gggctcaggc 481 tccgtttcac ttgcgctgca cctgcagcca ctgcgctctg ctgctggggc cgccgccctg 541 gctttgaccg tggacctgcc acccgcctcc tccgaggctc ggaactcggc cttcggtttc 601 cagggccgct tgctgcacct gagtgccggc cagcgcctgg gcgtccatct tcacactgag 661 gccagggcac gccatgcctg gcagcttacc cagggcgcca cagtcttggg actcttccgg 721 gtgacccccg aaatcccagc cggactccct tcaccgaggt cggaataacg cccagcctgg 781 gtgcagccca cctggacaga gtccgaatcc tactccatcc ttcatggaga cccctggtgc 841 tgggtccctg ctgctttctc tacctcaagg ggcttggcag gggtccctgc tgctgacctc 901 cccttgagga ccctcctcac ccactccttc cccaagttgg accttgatat ttattctgag 961 cctgagctca gataatatat tatatatatt atatatatat atatatttct atttaaagag 1021 gatcctgagt ttgtgaatgg acttttttag aggagttgtt ttgggggggg ggtcttcgac 1081 attgccgagg ctggtcttga actcctggac ttagacgatc ctcctgcctc agcctcccaa 1141 gcaactggga ttcatccttt ctattaattc attgtactta tttgcctatt tgtgtgtatt 1201 gagcatctgt aatgtgccag cattgtgccc aggctagggg gctatagaaa catctagaaa 1261 tagactgaaa gaaaatctga gttatggtaa tacgtgagga atttaaagac tcatccccag 1321 cctccacctc ctgtgtgata cttgggggct agcttttttc tttctttctt ttttttgaga 1381 tggtcttgtt ctgtcaacca ggctagaatg cagcggtgca atcatgagtc aatgcagcct 1441 ccagcctcga cctcccgagg ctcaggtgat cctcccatct cagcctctcg agtagctggg 1501 accacagttg tgtgccacca cacttggcta actttttaat ttttttgcgg agacggtatt 1561 gctatgttgc caaggttgtt tacatgccag tacaatttat aataaacact catttttcc // LOCUS HSU03399 2190 bp mRNA PRI 30-NOV-1995 DEFINITION Human T-complex protein 10A (TCP10A) mRNA, complete cds. ACCESSION U03399 NID g424101 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2190) AUTHORS Islam,S.D., Pilder,S.H., Decker,C.L., Cebra-Thomas,J.A. and Silver,L.M. TITLE The human homolog of a candidate mouse t complex responder gene: conserved motifs and evolution with punctuated equilibria JOURNAL Hum. Mol. Genet. 2 (12), 2075-2079 (1993) MEDLINE 94154681 REFERENCE 2 (bases 1 to 2190) AUTHORS Silver,L.M. TITLE Direct Submission JOURNAL Submitted (10-NOV-1993) Lee M. Silver, Department of Molecular Biology, Princeton University, Princeton, NJ 08544-1014, USA FEATURES Location/Qualifiers source 1..2190 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Hu1-Hu6" /clone_lib="testes cDNA library" /map="6q27" /chromosome="6" /tissue_type="testes" gene 78..1328 /gene="TCP10A" CDS 78..1328 /gene="TCP10A" /codon_start=1 /product="T-complex protein 10A" /db_xref="PID:g424102" /translation="MLEGQLEAGEPKEGTHPEDPCPGAGAAMEKTPAAAEVPREDSNA GEMPSLQQQITSLHQELGRQQSLWADIHRKLQSHMDALRKQNRELREELRGLQRQQWE AGKKPAASPHAGRESHTLALEPAFGKISPLSADEETTPKYAGRKSQSATLLGQRWSSN HLAPPKPMSLKTERINSGKTPPQEDREKSPPGRRQDRSPAPTGRPTPGAERREVSEDG KIMHPSSRSPQNSGGRKSPVQASQATTLQEQTAAARGADRSSSVLGSSEGGFLSRVQA DEFASSSPDSAERQEKFYPNGSKEIVFPDGTVEHLKDGQEETLFPDGTIVRVERNGDK TIVLSNGQKEIHTARFKRREYPDGTVKTVYCSGCQETKYASGRVKIKDEAGNVVLDEK QMSPQHAASHGKCQLQIFAKTDKN" misc_feature 138..297 /gene="TCP10A" /note="dimerization motifs" misc_feature 138..189 /gene="TCP10A" /note="leucine zipper" misc_feature 216..297 /gene="TCP10A" /note="leucine zipper" repeat_region 873..1161 /rpt_unit=nonapeptide_repeat BASE COUNT 634 a 552 c 569 g 435 t ORIGIN 1 ggggaggact tggggacagc tgtggggaca cggcccagat gctctggtcc cagcagaagt 61 gactggcggg gaccaggatg ctggagggtc agctcgaggc cggggagccc aaggagggca 121 cccacccaga agacccgtgc ccgggagctg gggctgccat ggagaagaca cctgcagcag 181 ccgaggtccc cagggaggac agcaatgccg gggagatgcc gtcattacag cagcagatca 241 ccagcctcca ccaagagctc gggagacagc agtcgctgtg ggctgatatt cacagaaaac 301 tccagagtca tatggatgcc ttgaggaagc agaaccggga gctccgagaa gagctgagag 361 gcctgcagcg gcagcagtgg gaagccggga agaaacccgc agcgtcccca cacgcggggc 421 gagaatcaca cactctggca ttggaacctg cttttggaaa aatttcacct ctgtcagctg 481 atgaagagac aacacccaaa tacgctggcc gcaagagtca gagtgccact ctcctgggac 541 aaagatggtc atctaaccac ttagctcctc caaagccaat gagtttaaag acagaaagaa 601 ttaactcggg gaaaacacca ccacaggaag atagagagaa aagtcctccc gggagacgtc 661 aagacagaag tccagcaccc actggaaggc cgactcccgg tgcagaaaga cgggaggtgt 721 ctgaagacgg aaagattatg catccatctt cccgaagtcc gcaaaactcc ggtggcagaa 781 aatcaccagt gcaggcttcc caggccacca cgctgcagga gcagacggca gcagccagag 841 gagctgacag aagctcatca gtcttaggga gttctgaagg cggatttctc agccgcgttc 901 aagctgacga gttcgccagt tcttccccag acagtgcaga gcggcaggaa aaattctacc 961 ccaatggctc caaagaaatc gtgtttcccg acgggacggt ggaacatctc aaggatggac 1021 aggaagagac cttatttcct gatgggacaa tcgtaagggt ggaaaggaac ggcgacaaaa 1081 ccattgtgct cagcaacggg cagaaagaaa tccacacagc ccggttcaag aggagggagt 1141 acccggatgg caccgtcaag actgtgtatt gcagcggctg tcaggaaacc aagtatgcct 1201 ccgggagggt taagatcaaa gatgaagctg gaaatgtcgt cctggacgag aagcagatga 1261 gccctcaaca cgcagcatca catgggaaat gccaattgca gatttttgct aaaacagaca 1321 aaaactaata acatattagc tgccctaaat gatcttggag agccaccaaa actttaggac 1381 tacccaagtc atctagaaat tgcaaaggaa aagacattct ctcccctatt taggaaactt 1441 ggttagagca gcacacgtcg gagaacagga ggctcacaaa agaacgattt tcagtgcccc 1501 cagcaaggct gcaccctcct tgatcatcac aggggccatt tctgccttgt gatggaccca 1561 ggaggtctct gcagagcaag tgccgttgga ccagctgatt cacaaagtgc tgagcccctc 1621 ttccccgtgt ggcccacccc aggcacggct cattcctcca gaattataga agctcagcct 1681 ggcctgctcc cctctggtca ccgacacccc tgtcctcgct gggacaagag ggagacaatt 1741 ctagtcacta caaggacacc caaggggtct gagtcagggc tccaatagtg cagttatcgt 1801 ggcctctcgc agcccgggga gcacgtcgcc tcttgaaact cgagatttct taaatccaat 1861 ataaaatgat aaccagttac ttacttcttt gttgtttgca ggccacctag aaatacaaat 1921 gccaattcac ctaattttta tactgaaatg tagatgcccc caacatgatt ttggaaagat 1981 ataatttctt gaaaatctta tcagaagatg catagttaat ttttttccat tatgtactca 2041 tgtttttatg aaatacaata ccatatttta gggccaggga ttattttaac cttaaaattt 2101 tgcatgcgtc attgtagccc tcaatatttg gtaatgctat ttaaagtgac tcaagaagca 2161 tgtgctatct ggctttgaag gaaaaaaaaa // LOCUS HSU03486 1260 bp DNA PRI 13-JAN-1995 DEFINITION Human connexin40 gene, complete cds. ACCESSION U03486 NID g416327 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1260) AUTHORS Kanter,H.L., Saffitz,J.E. and Beyer,E.C. TITLE Molecular cloning of two human cardiac gap junction proteins, connexin40 and connexin45 JOURNAL J. Mol. Cell. Cardiol. 26, 861-868 (1994) MEDLINE 95055780 REFERENCE 2 (bases 1 to 1260) AUTHORS Beyer,E.C. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Eric C. Beyer, Pediatrics, Washington University School of Medicine, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1260 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 65..1141 /codon_start=1 /function="gap junction channel protein" /product="connexin40" /db_xref="PID:g416328" /translation="MGDWSFLGNFLEEVHKHSTVVGKVWLTVLFIFRMLVLGTAAEST WGDEQADFRCDTIQPGCHNVCYDQAFPISHIRYWVLQIIFVSTPSLVYMGHAMHTVRM QEKRKLREAERAKEVRGSGSYEYPVAEKAELSCWEEGNGRIALQGTLLNTYVCSILIR TTMEVGFIVGQYFIYGIFLTTLHVCRRSPCPHPVNCYVSRPTEKNVFIVFMLAVAALS LLLSLAELYHLGWKKIRQRFVKPRQYMAKCQLSGPLWAIVQSCTPPPDFNQCLENGPG GKFFNPFSNNMASQQNTDNLVTEQVRGQEQTPGEGFIQVRYGQKPEVPNGVSPGHRLP HGYHSDKRRLSKASSKARSDDLSV" BASE COUNT 267 a 360 c 337 g 296 t ORIGIN 1 cttttctctc tttctctctc tcccatttgc agaagttttg gcatctgttc cctgctgtgc 61 caacatgggc gattggagct tcctgggaaa tttcctggag gaagtacaca agcactcgac 121 cgtggtaggc aaggtctggc tcactgtcct cttcatattc cgtatgctcg tgctgggcac 181 agctgctgag tctacctggg gggatgagca ggctgatttc cggtgtgata cgattcagcc 241 tggctgccac aatgtctgct acgaccaggc tttccccatc tcccacattc gctactgggt 301 gctgcagatc atcttcgtct ctacgccctc tctggtgtac atgggccacg ccatgcacac 361 tgtgcgcatg caggagaagc gcaagctacg ggaggccgag agggccaaag aggtccgggg 421 ctctggctct tacgagtacc cggtggcaga gaaggcagaa ctgtcctgct gggaggaagg 481 gaatggaagg attgccctcc agggcactct gctcaacacc tatgtgtgca gcatcctgat 541 ccgcaccacc atggaggtgg gcttcattgt gggccagtac ttcatctacg gaatcttcct 601 gaccaccctg catgtctgcc gcaggagtcc ctgtccccac ccggtcaact gttacgtatc 661 ccggcccaca gagaagaatg tcttcattgt ctttatgctg gctgtggctg cactgtccct 721 cctccttagc ctggctgaac tctaccacct gggctggaag aagatcagac agcgatttgt 781 caaaccgcgg cagtacatgg ctaagtgcca gctttctggc cctctgtggg ctatagtcca 841 gagctgcaca ccaccccccg actttaatca gtgcctggag aatggtcctg ggggaaaatt 901 cttcaatccc ttcagcaata atatggcctc ccaacaaaac acagacaacc tggtcaccga 961 gcaagtacga ggtcaggagc agactcctgg ggaaggtttc atccaggttc gttatggcca 1021 gaagcctgag gtgcccaatg gagtctcacc aggtcaccgc cttccccatg gctatcatag 1081 tgacaagcga cgtcttagta aggccagcag caaggcaagg tcagatgacc tatcagtgtg 1141 accctccttt atgggaggat caggaccagg tgggaacaaa ggaggctcag agaggaaaga 1201 cgtgtccctt ctgaactgat gctttctcac tgtcatcact gcttggctcc tttggcccgg // LOCUS HSU03493 1191 bp DNA PRI 13-JAN-1995 DEFINITION Human connexin45 gene, complete cds. ACCESSION U03493 NID g424133 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1191) AUTHORS Kanter,H.L., Saffitz,J.E. and Beyer,E.C. TITLE Molecular cloning of two human cardiac gap junction proteins, connexin40 and connexin45 JOURNAL J. Mol. Cell. Cardiol. 26, 861-868 (1994) MEDLINE 95055780 REFERENCE 2 (bases 1 to 1191) AUTHORS Beyer,E.C. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Eric C. Beyer, Pediatrics, Washington University School of Medicine, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1191 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1191 /codon_start=1 /function="gap junction channel protein" /product="connexin45" /db_xref="PID:g424134" /translation="MSWSFLTRLLEEIHNHSTFVGKIWLTVLIVFRIVLTAVGGESIY YDEQSKFVCNTEQPGCENVCYDAFAPLSHVRFWVFQIILVATPSVMYLGYAIHKIAKM EHGEADKKAARSKPYAMRWKQHRALEETEEDNEEDPMMYPEMELESDKENKEQSQPKP KHDGRRRIREDGLMKIYVLQLLARTVFEVGFLIGQYFLYGFQVHPFYVCSRLPCPHKI DCFISRPTEKTIFLLIMYGVTGLCLLLNIWEMLHLGFGTIRDSLNSKRRELEDPGAYN YPFTWNTPSAPPGYNIAVKPDQIQYTELSNAKIAYKQNKANTAQEQQYGSHEENLPAD LEALQREIRMAQERLDLAVQAYSHQNNPHGPREKKAKVGSKAGSNKSTASSKSGDGKN SVWI" BASE COUNT 329 a 276 c 306 g 280 t ORIGIN 1 atgagttgga gctttctgac tcgcctgcta gaggagattc acaaccattc cacatttgtg 61 gggaagatct ggctcactgt tctgattgtc ttccggatcg tccttacagc tgtaggagga 121 gaatccatct attacgatga gcaaagcaaa tttgtgtgca acacagaaca gccgggctgt 181 gagaatgtct gttatgatgc gtttgcacct ctctcccatg tacgcttctg ggtgttccag 241 atcatcctgg tggcaactcc ctctgtgatg tacctgggct atgctatcca caagattgcc 301 aaaatggagc acggtgaagc agacaagaag gcagctcgga gcaagcccta tgcaatgcgc 361 tggaaacaac accgggctct ggaagaaacg gaggaggaca acgaagagga tcctatgatg 421 tatccagaga tggagttaga aagtgataag gaaaataaag agcagagcca acccaaacct 481 aagcatgatg gccgacgacg gattcgggaa gatgggctca tgaaaatcta tgtgctgcag 541 ttgctggcaa ggaccgtgtt tgaggtgggt tttctgatag ggcagtattt tctgtatggc 601 ttccaagtcc acccgtttta tgtgtgcagc agacttcctt gtcctcataa gatagactgc 661 tttatttcta gacccactga aaagaccatc ttccttctga taatgtatgg tgttacaggc 721 ctttgcctct tgcttaacat ttgggagatg cttcatttag ggtttgggac cattcgagac 781 tcactaaaca gtaaaaggag ggaacttgag gatccgggtg cttataatta tcctttcact 841 tggaatacac catctgctcc ccctggctat aacattgctg tcaaaccaga tcaaatccag 901 tacaccgaac tgtccaatgc taagatcgcc tacaagcaaa acaaggccaa cacagcccag 961 gaacagcagt atggcagcca tgaggagaac ctcccagctg acctggaggc tctgcagcgg 1021 gagatcagga tggctcagga acgcttggat ctggcagttc aggcctacag tcaccaaaac 1081 aaccctcatg gtccccggga gaagaaggcc aaagtggggt ccaaagctgg gtccaacaaa 1141 agcactgcca gtagcaaatc aggggatggg aagaactctg tctggattta a // LOCUS HSU03494 2418 bp mRNA PRI 03-SEP-1994 DEFINITION Human transcription factor LSF mRNA, complete cds. ACCESSION U03494 NID g476098 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2418) AUTHORS Shirra,M.K., Zhu,Q., Huang,H.C., Pallas,D. and Hansen,U. TITLE One exon of the human LSF gene includes conserved regions involved in novel DNA-binding and dimerization motifs JOURNAL Mol. Cell. Biol. 14 (8), 5076-5087 (1994) MEDLINE 94309627 REFERENCE 2 (bases 1 to 2418) AUTHORS Hansen,U. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Ulla Hansen, Mol Gen/Micro and Mol Gen, Dana Farber Cancer Institute, Harvard Medical School, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2418 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="piH3M-LSF17" /cell_line="HPB-ALL T cells" CDS 692..2200 /codon_start=1 /product="transcription factor LSF" /db_xref="PID:g476099" /translation="MAWALKLPLADEVIESGLVQDFDASLSGIGQELGAGAYSMSDVL ALPIFKQEESSLPPDNENKILPFQYVLCAATSPAVKLHDETLTYLNQGQSYEIRMLDN RKLGELPEINGKLVKSIFRVVFHDRRLQYTEHQQLEGWRWNRPGDRILDIDIPMSVGI IDPRANPTQLNTVEFLWDPAKRTSVFIQVHCISTEFTMRKHGGEKGVPFRVQIDTFKE NENGEYTEHLHSASCQIKVFKPKGADRKQKTDREKMEKRTPHEKEKYQPSYETTILTE CSPWPEITYVNNSPSPGFNSSHSSFSLGEGNGSPNHQPEPPPPVTDNLLPTTTPQEAQ QWLHRNRFSTFTRLFTNFSGADLLKLTRDDVIQICGPADGIRLFNALKGRMVRPRLTI YVCQESLQLREQQQQQQQQQQKHEDGDSNGTFFVYHAIYLEELTAVELTEKIAQLFSI SPCQISQIYKQGPTGIHVLISDEMIQNFQEEACFILDTMKQETNDSYHIILK" BASE COUNT 641 a 532 c 692 g 553 t ORIGIN 1 gaggacgcca tgattggttg gcgctggggc ggcggacggt ggaagggcct ggcgagtcta 61 ggttttacgc ctgtgctgga ctttctcctt ccatgtttcc aggccgtggg gggctacaga 121 gggcgagaag tcggctcagc ggaaacctgg atttggttct aagccgtggg gttgagaagg 181 ggtgaccgga agtgatcgtg ggactgaccg gaagcgaggc ctggagggga aagagagagc 241 gagacctggg agggaggggg cctccagcag aaaggggcgg gggaaaaggt gcaaaagcag 301 cgtgggagcg ccgggctggc ttcctgcggc tgctgctggt ctgactggga agcagcaagc 361 caccactacg aactctcaag aggagtggga gtgcgggagt ccagagctgc ctctgggaag 421 tctgcagtag ttgagcaaag gggtcctcac gttcctgaga gctgggcagg ggggattttg 481 gaacctgggg cagccaagaa cgagcagcca agggtacggg agattagttg tgcacagagc 541 agtgctggtc gggcttgggg gtggctggtg ggcactgcgt gggaaacctt ggtttgtagt 601 tttcttggtt tgcgttactc ctgttgggta gaattaccct ccgcgccttt gtacaagaca 661 cggtgtctcc tggggcaagg aaggagccag gatggcctgg gctctgaagc tgcctctggc 721 cgacgaagtg attgaatccg ggttggtgca ggactttgat gctagcctgt ccgggatcgg 781 ccaggaactg ggtgctggtg cctatagcat gagtgatgtc cttgcattgc ccatttttaa 841 gcaagaagag tcgagtttgc ctcctgataa tgagaataaa atcctgcctt ttcaatatgt 901 gctttgtgct gctacctctc cagcagtgaa actccatgat gaaaccctaa cgtatctcaa 961 tcaaggacag tcttatgaaa ttcgaatgct agacaatagg aaacttggag aacttccaga 1021 aattaatggc aaattggtga agagtatatt ccgtgtggtg ttccatgaca gaaggcttca 1081 gtacactgag catcagcagc tagagggctg gaggtggaac cgacctggag acagaattct 1141 tgacatagat atcccgatgt ctgtgggtat aatcgatcct agggctaatc caactcaact 1201 aaatacagtg gagttcctgt gggaccctgc aaagaggaca tctgtgttta ttcaggtgca 1261 ctgtattagc acagagttca ctatgaggaa acatggcgga gaaaaggggg tgccattccg 1321 agtacaaata gataccttca aggagaatga aaacggggaa tatactgagc acttacactc 1381 ggccagctgc cagatcaaag ttttcaagcc caaaggtgca gacagaaagc aaaaaacgga 1441 tagggaaaaa atggagaaac gaacacctca tgaaaaggag aaatatcagc cttcctatga 1501 gacaaccata ctcacagagt gttctccatg gcccgagatc acgtatgtca ataactcccc 1561 atcacctggc ttcaacagtt cccatagcag tttttctctt ggggaaggaa atggttcacc 1621 aaaccaccag ccagagccac cccctccagt cacagataac ctcttaccaa caaccacacc 1681 tcaggaagct cagcagtggt tgcatcgaaa tcgtttttct acattcacaa ggcttttcac 1741 aaacttctca ggggcagatt tattgaaatt aactagagat gatgtgatcc aaatctgtgg 1801 ccctgcagat ggaatcagac tttttaatgc attaaaaggc cggatggtgc gtccaaggtt 1861 aaccatttat gtttgtcagg aatcactgca gttgagggag cagcaacaac agcagcagca 1921 acagcagcag aagcatgagg atggagactc aaatggtact ttcttcgttt accatgctat 1981 ctatctagaa gaactaacag ctgttgaatt gacagaaaaa attgctcagc ttttcagcat 2041 ttccccttgc cagatcagcc agatttacaa gcaggggcca acaggaattc atgtgctcat 2101 cagtgatgag atgatacaga actttcagga agaagcatgt tttattctgg acacaatgaa 2161 acaggaaacc aatgatagct atcatatcat actgaagtag gagtgcggcg tttcgtgccc 2221 agtggctgct ccttccttca cctctgaaaa cggccctctt gaagggggat atgaatggag 2281 atttgaaggt ctgcaagaac ctgactcgtc tgactgtgtg tggaggagtc caggccatgg 2341 aggcagaatc ctggccctct gtgttggccc aagctcttgt ggtacacaca gagggccagg 2401 attctgcctc catggcct // LOCUS HSU03504 1680 bp mRNA PRI 13-OCT-1994 DEFINITION Human excitatory amino acid transporter1 mRNA, complete cds. ACCESSION U03504 NID g487338 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1680) AUTHORS Arriza,J.L., Fairman,W.A., Wendy,A., Wadiche,J.I., Murdoch,G.H., Kavanaugh,M.P. and Amara,S.G. TITLE Functional comparisons of three glutamate transporter subtypes cloned from human motor cortex JOURNAL J. Neurosci. 14, 5559-5569 (1994) MEDLINE 94365697 REFERENCE 2 (bases 1 to 1680) AUTHORS Arriza,J.L. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Jeffrey L. Arriza, The Vollum Institute, Oregon Health Sciences University, 3181 SW Sam Jackson Park Road, Portland, OR 97201, USA FEATURES Location/Qualifiers source 1..1680 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain: motor cortex" 5'UTR 1..30 CDS 31..1659 /codon_start=1 /product="excitatory amino acid transporter1" /db_xref="PID:g487339" /translation="MTKSNGEEPKMGGRMERFQQGVRKRTLLAKKKVQNITKEDVKSY LFRNAFVLLTVTAVIVGTILGFTLRPYRMSYREVKYFSFPGELLMRMLQMLVLPLIIS SLVTGMAALDSKASGKMGMRAVVYYMTTTIIAVVIGIIIVIIIHPGKGTKENMHREGK IVRVTAADAFLDLIRNMFPPNLVEACFKQFKTNYEKRSFKVPIQANETLVGAVINNVS EAMETLTRITEELVPVPGSVNGVNALGLVVFSMCFGFVIGNMKEQGQALREFFDSLNE AIMRLVAVIMWYAPVGILFLIAGKIVEMEDMGVIGGQLAMYTVTVIVGLLIHAVIVLP LLYFLVTRKNPWVFIGGLLQALITALGTSSSSATLPITFKCLEENNGVDKRVTRFVLP VGATINMDGTALYEALAAIFIAQVNNFELNFGQIITISITATAASIGAAGIPQAGLVT MVIVLTSVGLPTDDITLIIAVDWFLDRLRTTTNVLGDSLGAGIVEHLSRHELKNRDVE MGNSVIEENEMKKPYQLIAQDNETEKPIDSETKM" 3'UTR 1660..1680 BASE COUNT 461 a 382 c 426 g 411 t ORIGIN 1 aaagaagaga ccctcctaga aaagtaaaat atgactaaaa gcaatggaga agagcccaag 61 atggggggca ggatggagag attccagcag ggagtccgta aacgcacact tttggccaag 121 aagaaagtgc agaacattac aaaggaggat gttaaaagtt acctgtttcg gaatgctttt 181 gtgctgctca cagtcaccgc tgtcattgtg ggtacaatcc ttggatttac cctccgacca 241 tacagaatga gctaccggga agtcaagtac ttctcctttc ctggggaact tctgatgagg 301 atgttacaga tgctggtctt accacttatc atctccagtc ttgtcacagg aatggcggcg 361 ctagatagta aggcatcagg gaagatggga atgcgagctg tagtctatta tatgactacc 421 accatcattg ctgtggtgat tggcataatc attgtcatca tcatccatcc tgggaagggc 481 acaaaggaaa acatgcacag agaaggcaaa attgtacgag tgacagctgc agatgccttc 541 ctggacttga tcaggaacat gttccctcca aatctggtag aagcctgctt taaacagttt 601 aaaaccaact atgagaagag aagctttaaa gtgcccatcc aggccaacga aacgcttgtg 661 ggtgctgtga taaacaatgt gtctgaggcc atggagactc ttacccgaat cacagaggag 721 ctggtcccag ttccaggatc tgtgaatgga gtcaatgccc tgggtctagt tgtcttctcc 781 atgtgcttcg gttttgtgat tggaaacatg aaggaacagg ggcaggccct gagagagttc 841 tttgattctc ttaacgaagc catcatgaga ctggtagcag taataatgtg gtatgccccc 901 gtgggtattc tcttcctgat tgctgggaag attgtggaga tggaagacat gggtgtgatt 961 ggggggcagc ttgccatgta caccgtgact gtcattgttg gcttactcat tcacgcagtc 1021 atcgtcttgc cactcctcta cttcttggta acacggaaaa acccttgggt ttttattgga 1081 gggttgctgc aagcactcat caccgctctg gggacctctt caagttctgc caccctaccc 1141 atcaccttca agtgcctgga agagaacaat ggcgtggaca agcgcgtcac cagattcgtg 1201 ctccccgtag gagccaccat taacatggat gggactgccc tctatgaggc tttggctgcc 1261 attttcattg ctcaagttaa caactttgaa ctgaacttcg gacaaattat tacaatcagc 1321 atcacagcca cagctgccag tattggggca gctggaattc ctcaggcggg cctggtcact 1381 atggtcattg tgctgacatc tgtcggcctg cccactgacg acatcacgct catcatcgcg 1441 gtggactggt tcctggatcg cctccggacc accaccaacg tactgggaga ctccctggga 1501 gctgggattg tggagcactt gtcacgacat gaactgaaga acagagatgt tgaaatgggt 1561 aactcagtga ttgaagagaa tgaaatgaag aaaccatatc aactgattgc acaggacaat 1621 gaaactgaga aacccatcga cagtgaaacc aagatgtaga ctaacataaa gaaacacttt // LOCUS HSU03506 1674 bp mRNA PRI 13-OCT-1994 DEFINITION Human excitatory amino acid transporter3 mRNA, complete cds. ACCESSION U03506 NID g487342 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1674) AUTHORS Arriza,J.L., Fairman,W.A., Wendy,A., Wadiche,J.I., Murdoch,G.H., Kavanaugh,M.P. and Amara,S.G. TITLE Functional comparisons of three glutamate transporter subtypes cloned from human motor cortex JOURNAL J. Neurosci. 14, 5559-5569 (1994) MEDLINE 94365697 REFERENCE 2 (bases 1 to 1674) AUTHORS Arriza,J.L. TITLE Direct Submission JOURNAL Submitted (16-NOV-1993) Jeffrey L. Arriza, The Vollum Institute, Oregon Health Sciences University, 3181 SW Sam Jackson Park Road, Portland, OR 97201, USA FEATURES Location/Qualifiers source 1..1674 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain: motor cortex" 5'UTR 1..15 CDS 16..1593 /codon_start=1 /product="excitatory amino acid transporter3" /db_xref="PID:g487343" /translation="MGKPARKGCPSWKRFLKNNWVLLSTVAAVVLGITTGVLVREHSN LSTLEKFYFAFPGEILMRMLKLIILPLIISSMITGVAALDSNVSGKIGLRAVVYYFCT TLIAVILGIVLVVSIKPGVTQKVGEIARTGSTPEVSTVDAMLDLIRNMFPENLVQACF QQYKTKREEVKPPSDPEMNMTEESFTAVMTTAISKNKTKEYKIVGMYSDGINVLGLIV FCLVFGLVIGKMGEKGQILVDFFNALSDATMKIVQIIMCYMPLGILFLIAGKIIEVED WEIFRKLGLYMATVLTGLAIHSIVILPLIYFIVVRKNPFRFAMGMAQALLTALMISSS SATLPVTFRCAEENNQVDKRITRFVLPVGATINMDGTALYEAVAAVFIAQLNDLDLGI GQIITISITATSASIGAAGVPQAGLVTMVIVLSAVGLPAEDVTLIIAVDWLLDRFRTM VNVLGDAFGTGIVEKLSKKELEQMDVSSEVNIVNPFALESTILDNEDSDTKKSYVNGG FAVDKSDTISFTQTSQF" 3'UTR 1594..1674 BASE COUNT 431 a 387 c 425 g 431 t ORIGIN 1 atagcggcga cagccatggg gaaaccggcg aggaaaggat gcccgagttg gaagcgcttc 61 ctgaagaata actgggtgtt gctgtccacc gtggccgcgg tggtgctagg cattaccaca 121 ggagtcttgg ttcgagaaca cagcaacctc tcaactctag agaaattcta ctttgctttt 181 cctggagaaa ttctaatgcg gatgctgaaa ctcatcattt tgccattaat tatatccagc 241 atgattacag gtgttgctgc actggattcc aacgtatccg gaaaaattgg tctgcgcgct 301 gtcgtgtatt atttctgtac cactctcatt gctgttattc taggtattgt gctggtggtg 361 agcatcaagc ctggtgtcac ccagaaagtg ggtgaaattg cgaggacagg cagcacccct 421 gaagtcagta cggtggatgc catgttagat ctcatcagga atatgttccc tgagaatctt 481 gtccaggcct gttttcagca gtacaaaact aagcgtgaag aagtgaagcc tcccagcgat 541 ccagagatga acatgacaga agagtccttc acagctgtca tgacaactgc aatttccaag 601 aacaaaacaa aggaatacaa aattgttggc atgtattcag atggcataaa cgtcctgggc 661 ttgattgtct tttgccttgt ctttggactt gtcattggaa aaatgggaga aaagggacaa 721 attctggtgg atttcttcaa tgctttgagt gatgcaacca tgaaaatcgt tcagatcatc 781 atgtgttata tgccactagg tattttgttc ctgattgctg ggaagatcat agaagttgaa 841 gactgggaaa tattccgcaa gctgggcctt tacatggcca cagtcctgac tgggcttgca 901 atccactcca ttgtaattct cccgctgata tatttcatag tcgtacgaaa gaaccctttc 961 cgatttgcca tgggaatggc ccaggctctc ctgacagctc tcatgatctc ttccagttca 1021 gcaacactgc ctgtcacctt ccgctgtgct gaagaaaata accaggtgga caagaggatc 1081 actcgattcg tgttacccgt tggtgcaaca atcaacatgg atgggaccgc gctctatgaa 1141 gcagtggcag cggtgtttat tgcacagttg aatgacctgg acttgggcat tgggcagatc 1201 atcaccatca gtatcacggc cacatctgcc agcatcggag ctgctggcgt gccccaggct 1261 ggcctggtga ccatggtgat tgtgctgagt gccgtgggcc tgcccgccga ggatgtcacc 1321 ctgatcattg ctgtcgactg gctcctggac cggttcagga ccatggtcaa cgtccttggt 1381 gatgcttttg ggacgggcat tgtggaaaag ctctccaaga aggagctgga gcagatggat 1441 gtttcatctg aagtcaacat tgtgaatccc tttgccttgg aatccacaat ccttgacaac 1501 gaagactcag acaccaagaa gtcttatgtc aatggaggct ttgcagtaga caagtctgac 1561 accatctcat tcacccagac ctcacagttc tagggcccct ggctgcagat gactggaaac 1621 aaggaaggac atttcgtgag agtcatctca aacacggctt aaggaaaaga gaaa // LOCUS HSU03634 1737 bp mRNA PRI 08-MAR-1994 DEFINITION Human P47 LBC oncogene mRNA, complete cds. ACCESSION U03634 NID g458209 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1737) AUTHORS Toksoz,D. and Williams,D.A. TITLE Novel human oncogene lbc detected by transfection with distinct homology regions to signal transduction products JOURNAL Oncogene 9 (2), 621-628 (1994) MEDLINE 94119604 REFERENCE 2 (bases 1 to 1737) AUTHORS Toksoz,D. TITLE Direct Submission JOURNAL Submitted (18-NOV-1993) Deniz Toksoz, Hematology/Oncology, Children's Hospital, Harvard Medical School, 300 Longwood Avenue, Boston, MA 02115 USA FEATURES Location/Qualifiers source 1..1737 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="9a2" /dev_stage="adult" CDS 251..1525 /codon_start=1 /product="P47 LBC oncogene" /db_xref="PID:g458210" /translation="MSNTWKFLSHSTDSLNKISKVNESTESLTDEGTDMNEGQLLGDF EIESKQLEAESWSRIIDSKFLKQQKKDVVKRQEVIYELMQTEFHHVRTLKIMSGVYSQ GMMADLLFEQQMVEKLFPCLDELISIHSQFFQRILERKKESLVDKSEKNFLIKRIGDV LVNQFSGENAERLKKTYGKFCGQHNQSVNYFKDLYAKDKRFQAFVKKKMSSSVVRRLG IPECILLVTQRITKYPVLFQRILQCTKDNEVEQEDLAQSLSLVKDVIGAVDSKVASYE KKVRLNEIYTKTDSKSIMRMKSGQMFAKEDLKRKKLVRDGSVFLKNAAGRLKEVQAVL LTDILVFLQEKDQKYIFASLDQKSTVISLKKLIVREVAHEEKGLFLISMGMTDPEMVE VHASSKEERNSWIQIIQDTINTLSGNGWRCFN" BASE COUNT 564 a 317 c 401 g 455 t ORIGIN 1 gtggtcacct tccccagcta gaaagccttc tttttatatc aacttccttt taattatgaa 61 gtattacaat caagcataaa gacataataa atgttaacat ctttcctttg tgtctgttta 121 aaaaatgttt cttattacag aaaatttcaa atagatttaa agtagagaaa tgcacgtgat 181 gaaaccccat gcacttgcca ttccatttca atagttatta cttcacggaa gagttggcaa 241 tgatgagaac atgtcaaaca cctggaaatt cctgtctcat tcaacagact cactaaataa 301 aatcagcaag gtcaatgagt caacagaatc acttactgat gagggtacag acatgaatga 361 aggacaacta ctgggagact ttgagattga gtccaaacag ctggaagcag agtcttggag 421 tcggataata gacagcaagt ttctaaaaca gcaaaagaaa gatgtggtca aacggcaaga 481 agtaatatat gagttgatgc agacagagtt tcatcatgtc cgcactctca agatcatgag 541 tggtgtgtac agccagggga tgatggcaga tctgcttttt gagcagcaga tggtagaaaa 601 gctgttcccc tgtttggatg agctgatcag tatccatagc caattcttcc agaggattct 661 ggagcggaag aaggagtctc tggtggataa aagtgaaaag aactttctca tcaagaggat 721 aggggatgtg cttgtaaatc agttttcagg tgagaatgca gaacgtttaa agaagacata 781 tggcaagttt tgtgggcaac ataaccagtc tgtaaactac ttcaaagacc tttatgccaa 841 ggataagcgt tttcaagcct ttgtaaagaa gaagatgagc agttcagttg ttagaaggct 901 tggaattcca gagtgcatat tgcttgtaac tcagcggatt accaagtacc cagttttatt 961 ccaaagaata ttgcagtgta ccaaagacaa tgaagtggag caggaagatc tagcacagtc 1021 cttgagcctg gtgaaggatg tgattggagc tgtagacagc aaagtggcaa gttatgaaaa 1081 gaaagtgcgt ctcaatgaga tttatacaaa gacagatagc aagtcaatca tgaggatgaa 1141 gagtggtcag atgtttgcca aggaagattt gaaacggaag aagcttgtac gtgatgggag 1201 tgtgtttctg aagaatgcag caggaaggtt gaaagaggtt caagcagttc ttctcactga 1261 cattttagtt ttccttcaag aaaaagacca gaagtacatc tttgcatcat tggaccagaa 1321 gtcaacagtg atctctttaa agaagctgat tgtgagagaa gtggcacatg aggagaaagg 1381 tttattcctg atcagcatgg ggatgacaga tccagagatg gtagaagtcc atgccagctc 1441 caaagaggaa cgaaacagct ggattcagat cattcaggac acaatcaaca ccctttccgg 1501 gaatggatgg aggtgcttta actgatgcga gtctcacaga gtcctatttc agcaccagct 1561 ttattggagt caatggattt ggaagccctg tagaaacaaa gtatcccttg atgcagtaca 1621 ctctggttgt cttccaacgt cttcgggtct gcacctctgg attcagcaat accaaaaaca 1681 ccggccccca aacttcccac ctacttgaac caataaactt cctcttgtgt ttaaact // LOCUS HSU03643 1380 bp mRNA PRI 23-NOV-1993 DEFINITION Human leukophysin (LKP) mRNA, complete cds. ACCESSION U03643 NID g425353 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1380) AUTHORS Abdelhaleem,M.M., Moreira,P. and Greenberg,A.H. TITLE Molecular cloning of leukophysin JOURNAL Unpublished REFERENCE 2 (bases 1 to 1380) AUTHORS Abdelhaleem,M.M. TITLE Direct Submission JOURNAL Submitted (21-NOV-1993) Abdelhaleem M.M., Manitoba Institute of Cell Biology, Immunology, 100 Olivia Street, Winnipeg, Manitoba R3E 0V9, Canada FEATURES Location/Qualifiers source 1..1380 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="U.3.20." /clone_lib="Uni-ZAP XR expression library" /cell_line="U937" gene 360..1067 /gene="LKP" CDS 360..1067 /gene="LKP" /codon_start=1 /product="leukophysin" /db_xref="PID:g425354" /translation="MKYPSPFFVFGEKIRTRAISAKGMTLVTPLQLLLFASKKVQSDG QIVLVDDWIKLQISHEAAACITGLRAAMEALVVEVTKQPAIISQLDPVNERMLNMIRQ ISRPSAAGINLMIGSTRYGDGPRPPKMARYDNGSGYRRGGSSYSGGGYGGGYSSGGYG SGGYGGSANSFRAGYGAGVGGGYRGVSRGGFRGNSGGDYRGPSGGYRGSGGFQRGGGR GAYGTGYFGQGRGGGGY" polyA_signal 1364..1369 polyA_site 1380 BASE COUNT 370 a 261 c 352 g 397 t ORIGIN 1 agcattgctg ctgctacctg ctttccagag cctttcatca atgaaggaaa gcggctgggc 61 tatatccatc gaaattttgc tggaaacaga ttttctgatc acgtagccct tttatcagta 121 ttccaagcct gggatgatgc tagaatgggt ggagaagaag cagagatacg tttttgtgag 181 cacaaaagac ttaatatggc tacactaaga atgacctggg aagccaaagt tcagctcaaa 241 gagattttga ttaattctgg gtttccagaa gattgtttgt tgacacaagt gtttactaac 301 actggaccag ataataattt ggatgttgtt atctccctcc tggcctttgt agccaagaca 361 tgaagtaccc atctcccttc tttgtatttg gtgaaaagat tcgaactcga gccatctctg 421 ctaaaggcat gactttagtc acccccctgc agttgcttct ctttgcctcc aagaaagtcc 481 aatctgatgg gcagattgtg cttgtagatg actggattaa actgcaaata tctcatgaag 541 ctgctgcctg tatcactggt ctccgggcag ccatggaggc tttggttgtt gaagtaacca 601 aacaacctgc tatcatcagc cagttggacc ccgtaaatga acgtatgctg aacatgatcc 661 gtcagatctc tagaccctca gctgctggta tcaaccttat gattggcagt acacggtatg 721 gagatggtcc acgtcctccc aagatggccc gatacgacaa tggaagcgga tatagaaggg 781 gaggttctag ttacagtggt ggaggctatg gcggtggcta tagcagtgga ggctatggta 841 gcggaggcta tggtggcagc gccaactcct ttcgggcagg atatggtgca ggtgttggtg 901 gaggctatag aggagtttcc cgaggtggct ttagaggcaa ctctggagga gactacagag 961 ggcctagtgg aggctacaga ggatctgggg gattccagcg aggaggtggt aggggggcct 1021 atggaactgg ctactttgga cagggaagag gaggtggcgg ctattaaaac ttggttatgt 1081 cagttcctgt gtgtagacag taaggaaaaa aaggcatgct atgtgttacg tgttttttcc 1141 agtatgttta tttgccacca aaaagtaaat gcattttcac ccattctgtg gttcattgta 1201 gtttaaggaa accaagcata tagatgcatt agtgattttg tttatattat gtaaaatata 1261 acgatctctt aaaaatacca cagtttgtat tttttcttta aggagtaaag atttgccttt 1321 aaataacttg gtattttcct ggctttcgtt taatacaata gaaaataaag tattacaccg // LOCUS HSU03644 1519 bp mRNA PRI 01-MAY-1994 DEFINITION Human recepin mRNA, complete cds. ACCESSION U03644 NID g476104 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1519) AUTHORS Chai,K.X., Li,L., Chao,J. and Chao,L. TITLE Recepin: a novel human liver cDNA encoding a serpin-like molecule JOURNAL Unpublished REFERENCE 2 (bases 1 to 1519) AUTHORS Chao,L. TITLE Direct Submission JOURNAL Submitted (20-NOV-1993) Lee Chao, Biochemistry and Molecular Biology, Medical University of South Carolina, 171 Ashley Avenue, Charleston, SC 29425-2211, USA FEATURES Location/Qualifiers source 1..1519 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="KS1010" /clone_lib="human liver cDNA in lambda-gt 11 vector, Savio L.C. Woo" /tissue_type="liver" gene 33..1388 /gene="recepin" CDS 33..1388 /gene="recepin" /note="a serpin-like molecule" /codon_start=1 /product="recepin" /db_xref="PID:g476105" /translation="MGKSFANFMCKKDFHPASKSNIKKVWMAEQKISYDKKKQEELMQ QYLKEQESYDNRLLMGDERVKNGLNFMYEAPPGAKKENKEKEETEGETEYKFEWQKGA PREKYAKDDMNIRDQPFGIQVRNVRCIKCHKWVMSTQIENVLCLVFLEVNASSVPTDG SGPSMHPSELIGEMRNQWVCTETKCTGEKLDRKLIHHRSMLQVQGEEDPEVEFLKSLT TKQKQKLLRKLDRLEKKKKKKDRKKKKFQKSRSKHKKHKSSSSYLPPPPPLPLLRLQK AVVRVRVTIKKKKLQRKKRKKNKCSGHNNSDSEEKDKSKKRKLHEELSSTHHNREKAK EKPRFLKHESSREDSKWSHSDSDKKSRTHKHSPEKRGSERKEGSSRSHGREERSRRSQ PEVLVVTSKGRQGNGHSEHPGEEQSRRNDSRSHGTDLYRGEKMYREHPGGTHTKVTQR E" BASE COUNT 601 a 277 c 346 g 295 t ORIGIN 1 cagcccctgc tttccctagt tccagttcca agatggggaa atccttcgcc aacttcatgt 61 gcaagaaaga ctttcatcct gcctccaaat ccaatatcaa aaaagtatgg atggcagaac 121 agaaaatatc atatgataag aagaaacaag aagaattgat gcagcaatat cttaaagaac 181 aagaatcata tgataataga ttgcttatgg gagatgaacg tgtaaagaat ggccttaatt 241 tcatgtatga agccccacca ggagctaaaa aagaaaacaa agagaaagaa gaaacagaag 301 gagagaccga atacaaattt gaatggcaga aaggagcccc acgagaaaaa tatgccaaag 361 atgacatgaa catcagagat cagccctttg gtattcaggt tcgaaatgtg aggtgcatta 421 aatgtcacaa atgggtcatg tcaacacaga tcgagaatgt cctttgtttg gtctttctgg 481 aagtcaatgc aagttcggtt cccactgatg gctcagggcc atcgatgcac ccctcggagc 541 taataggcga gatgagaaac cagtgggttt gcactgaaac gaaatgtact ggggagaaac 601 ttgaccgcaa actgatccat cacaggagta tgttgcaagt gcagggtgaa gaagatccag 661 aagttgaatt tttaaagtca ctaacaacca aacaaaaaca gaaacttctc aggaaattag 721 atcgactgga gaagaaaaaa aagaaaaaag atagaaaaaa gaaaaagttt cagaagagca 781 gaagtaaaca caaaaaacat aagtcctctt cttcctatct tcctcctcct cctcctcttc 841 ctctactgag acttcagaaa gcagtagtga gagtgagagt aacaataaag aaaaaaaaac 901 tacaaaggaa gaaaagaaag aaaaacaagt gttcagggca taacaacagt gattctgaag 961 agaaggacaa gtctaagaag agaaagcttc atgaagaact ttctagcact caccataacc 1021 gggaaaaagc caaggaaaag cccaggttct taaaacacga gagttctagg gaggacagca 1081 aatggagcca ttctgattct gacaaaaagt ccagaaccca taaacatagc ccagagaaga 1141 gaggctctga aagaaaggag gggagcagca gaagccacgg cagggaggaa aggagccgga 1201 gaagccagcc agaagtcctg gtagttacaa gcaaagggag acaaggaaac gggcacagcg 1261 aacatcctgg tgaagagcaa agcagaagaa atgacagcag aagccatggc acagacttgt 1321 atagaggaga aaaaatgtac agagagcacc caggaggtac acatactaaa gtgacacaaa 1381 gagaatgaag cagaagtaga gaagaaagac tgtatgtgac aattacctgg gaataaaaat 1441 atctccactt ttttattgaa tacctttagc aaggggtaaa ttatatactg ttgtctttct 1501 aataaaaaag ctcaatttt // LOCUS HSU03858 1080 bp mRNA PRI 19-JUL-1994 DEFINITION Human flt3 ligand mRNA, complete cds. ACCESSION U03858 NID g494978 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1080) AUTHORS Lyman,S.D., James,L., Johnson,L., Brasel,K., de Vries,P., Escobar,S.S., Downey,H., Splett,R.R., Beckmann,M.P. and McKenna,H.J. TITLE Cloning of the human homologue of the murine flt3 ligand: a growth factor for early hematopoietic progenitor cells JOURNAL Blood 83, 2795-2801 (1994) MEDLINE 94235842 REFERENCE 2 (bases 1 to 1080) AUTHORS Lyman,S.D. TITLE Direct Submission JOURNAL Submitted (30-NOV-1993) Stewart D. Lyman, Immunex Research and Development Corporation, 51, University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1080 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="expression, cDNA" /cell_line="clone 22 (T cell)" 5'UTR 1..83 CDS 84..791 /standard_name="FMS-like tyrosine kinase-3 ligand" /note="ligand for the flt3/flk-2 tyrosine kinase receptor" /codon_start=1 /function="stimulates proliferation of early hematopoietic cells" /product="flt3 ligand" /db_xref="PID:g494979" /translation="MTVLAPAWSPTTYLLLLLLLSSGLSGTQDCSFQHSPISSDFAVK IRELSDYLLQDYPVTVASNLQDEELCGGLWRLVLAQRWMERLKTVAGSKMQGLLERVN TEIHFVTKCAFQPPPSCLRFVQTNISRLLQETSEQLVALKPWITRQNFSRCLELQCQP DSSTLPPPWSPRPLEATAPTAPQPPLLLLLLLPVGLLLLAAAWCLHWQRTRRRTPRPG EQVPPVPSPQDLLLVEH" sig_peptide 84..161 misc_feature 162..629 /note="extracellular domain" misc_feature 630..698 /note="transmembrane domain" misc_feature 699..788 /note="cytoplasmic domain" 3'UTR 792..1080 misc_feature 1015..1080 /note="ATTTA mRNA instability motif" polyA_signal 1059..1064 polyA_site 1080 /note="32 A residues" BASE COUNT 204 a 384 c 290 g 202 t ORIGIN 1 ccggggggca tgagggtccg agacttgttc ttctgtccct tccaagaccc ggcgacagga 61 ggcatgaggg gcccccggcc gaaatgacag tgctggcgcc agcctggagc ccaacaacct 121 atctcctcct gctgctgctg ctgagctcgg gactcagtgg gacccaggac tgctccttcc 181 aacacagccc catctcctcc gacttcgctg tcaaaatccg tgagctgtct gactacctgc 241 ttcaagatta cccagtcacc gtggcctcca acctgcagga cgaggagctc tgcgggggcc 301 tctggcggct ggtcctggca cagcgctgga tggagcggct caagactgtc gctgggtcca 361 agatgcaagg cttgctggag cgcgtgaaca cggagataca ctttgtcacc aaatgtgcct 421 ttcagccccc ccccagctgt cttcgcttcg tccagaccaa catctcccgc ctcctgcagg 481 agacctccga gcagctggtg gcgctgaagc cctggatcac tcgccagaac ttctcccggt 541 gcctggagct gcagtgtcag cccgactcct caaccctgcc acccccatgg agtccccggc 601 ccctggaggc cacagccccg acagccccgc agccccctct gctcctccta ctgctgctgc 661 ccgtgggcct cctgctgctg gccgctgcct ggtgcctgca ctggcagagg acgcggcgga 721 ggacaccccg ccctggggag caggtgcccc ccgtccccag tccccaggac ctgctgcttg 781 tggagcactg acctggccaa ggcctcatcc tgcggagcct taaacaacgc agtgagacag 841 acatctatca tcccatttta caggggagga tactgaggca cacagagggg agtcaccagc 901 cagaggatgt atagcctgga cacagaggaa gttggctaga ggccggtccc ttccttgggc 961 ccctctcatt ccctccccag aatggaggca acgccagaat ccagcaccgg ccccatttac 1021 ccaactctga acaaagccct tgcccccatg aaattgttta taaatcatcc ttttctccca // LOCUS HSU03865 1738 bp mRNA PRI 29-MAR-1995 DEFINITION Human adrenergic alpha-1b receptor protein mRNA, complete cds. ACCESSION U03865 NID g494982 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1738) AUTHORS Forray,C., Bard,J.A., Wetzel,J.M., Chiu,G., Shapiro,E., Tang,R., Lepor,H., Hartig,P.R., Weinshank,R.L., Branchek,T.A. and Gluchowski,C. TITLE The alpha 1-adrenergic receptor that mediates smooth muscle contraction in human prostate has the pharmacological properties of the cloned human alpha 1c subtype JOURNAL Mol. Pharmacol. 45 (4), 703-708 (1994) MEDLINE 94239386 REFERENCE 2 (bases 1 to 1738) AUTHORS Ramarao,C.S., Denker,J.M., Perez,D.M., Gaivin,R.J., Riek,R.P. and Graham,R.M. TITLE Genomic organization and expression of the human alpha 1B-adrenergic receptor JOURNAL J. Biol. Chem. 267 (30), 21936-21945 (1992) MEDLINE 93016158 REFERENCE 3 (bases 1 to 1738) AUTHORS Nawoschik,S.P. and Bard,J.A. TITLE Direct Submission JOURNAL Submitted (30-NOV-1993) S.P. Nawoschik and Jonathan A. Bard, Synaptic Pharmaceutical Corporation, Molecular Biology, 215 College Rd., Paramus, NJ 07652 USA FEATURES Location/Qualifiers source 1..1738 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hBS2" /clone_lib="human brainstem cDNA (Stratagene)" /tissue_type="brainstem" 5'UTR 1..123 CDS 124..1686 /citation=[1] /codon_start=1 /product="adrenergic alpha-1b receptor protein" /db_xref="PID:g494983" /translation="MNPDLDTGHNTSAPAHWGELKNANFTGPNQTSSNSTLPQLDITR AISVGLVLGAFILFAIVGNILVILSVACNRHLRTPTNYFIVNLAMADLLLSFTVLPFS AALEVLGYWVLGRIFCDIWAAVDVLCCTASILSLCAISIDRYIGVRYSLQYPTLVTRR KAILALLSVWVLSTVISIGPLLGWKEPAPNDDKECGVTEEPFYALFSSLGSFYIPLAV ILVMYCRVYIVAKRTTKNLEAGVMKEMSNSKELTLRIHSKNFHEDTLSSTKAKGHNPR SSIAVKLFKFSREKKAAKTLGIVVGMFILCWLPFFIALPLGSLFSTLKPPDAVFKVVF WLGYFNSCLNPIIYPCSSKEFKRAFVRILGCQCRGRGRRRRRRRRRLGGCAYTYRPWT RGGSLERSQSRKDSLDDSGSCLSGSQRTLPSASPSPGYLGRGAPPPVELCAFPEWKAP GALLSLPAPEPPGRRGRHDSGPLFTFKLLTEPESPGTDGGASNGGCEAAADVANGQPG FKSNMPLAPGQF" conflict 618 /citation=[2] /replace="c" conflict 1223..1228 /citation=[2] /replace="" conflict 1615..1624 /citation=[2] /replace="ccacgcc" 3'UTR 1687..1738 BASE COUNT 308 a 603 c 502 g 325 t ORIGIN 1 gccaggaggg cgcctctggg aagaagacca cgggggaagc aaagtttcag ggcagctgag 61 gagccttcgc cgcagccctt ccgagcccaa tcatccccca ggctatggag ggcggactct 121 aagatgaatc ccgacctgga caccggccac aacacatcag cacctgccca ctggggagag 181 ttgaaaaatg ccaacttcac tggccccaac cagacctcga gcaactccac actgccccag 241 ctggacatca ccagggccat ctctgtgggc ctggtgctgg gcgccttcat cctctttgcc 301 atcgtgggca acatcctagt catcttgtct gtggcctgca accggcacct gcggacgccc 361 accaactact tcattgtcaa cctggccatg gccgacctgc tgttgagctt caccgtcctg 421 cccttctcag cggccctaga ggtgctcggc tactgggtgc tggggcggat cttctgtgac 481 atctgggcag ccgtggatgt cctgtgctgc acagcgtcca ttctgagcct gtgcgccatc 541 tccatcgatc gctacatcgg ggtgcgctac tctctgcagt atcccacgct ggtcacccgg 601 aggaaggcca tcttggcgct gctcagtgtc tgggtcttgt ccaccgtcat ctccatcggg 661 cctctccttg ggtggaagga gccggcaccc aacgatgaca aggagtgcgg ggtcaccgaa 721 gaacccttct atgccctctt ctcctctctg ggctccttct acatccctct ggcggtcatt 781 ctagtcatgt actgccgtgt ctatatagtg gccaagagaa ccaccaagaa cctagaggca 841 ggagtcatga aggagatgtc caactccaag gagctgaccc tgaggatcca ttccaagaac 901 tttcacgagg acacccttag cagtaccaag gccaagggcc acaaccccag gagttccata 961 gctgtcaaac tttttaagtt ctccagggaa aagaaagcag ctaagacgtt gggcattgtg 1021 gtcggtatgt tcatcttgtg ctggctaccc ttcttcatcg ctctaccgct tggctccttg 1081 ttctccaccc tgaagccccc cgacgccgtg ttcaaggtgg tgttctggct gggctacttc 1141 aacagctgcc tcaaccccat catctaccca tgctccagca aggagttcaa gcgcgctttc 1201 gtgcgcatcc tcgggtgcca gtgccgcggc cgcggccgcc gccgacgccg ccgccgccgt 1261 cgcctgggcg gctgcgccta cacctaccgg ccgtggacgc gcggcggctc gctggagcgc 1321 tcgcagtcgc gcaaggactc gctggacgac agcggcagct gcctgagcgg cagccagcgg 1381 accctgccct cggcctcgcc gagcccgggc tacctgggcc gcggcgcgcc accgccagtc 1441 gagctgtgcg ccttccccga gtggaaggcg cccggcgccc tcctgagcct gcccgcgcct 1501 gagccccccg gccgccgcgg ccgccacgac tcgggcccgc tcttcacctt caagctcctg 1561 accgagcccg agagccccgg gaccgacggc ggcgccagca acggaggctg cgaggccgcg 1621 gccgacgtgg ccaacgggca gccgggcttc aaaagcaaca tgcccctggc gcccgggcag 1681 ttttagggcc cccgtgcgca gctttctttc cctggggagg aaaacatcgt ggggggga // LOCUS HSU03877 2512 bp mRNA PRI 30-APR-1995 DEFINITION Human extracellular protein (S1-5) mRNA, complete cds. ACCESSION U03877 NID g458227 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2512) AUTHORS Lecka-Czernik,B., Lumpkin,C.K. Jr. and Goldstein,S. TITLE An overexpressed gene transcript in senescent and quiescent human fibroblasts encoding a novel protein in the epidermal growth factor-like repeat family stimulates DNA synthesis JOURNAL Mol. Cell. Biol. 15 (1), 120-128 (1995) MEDLINE 95097983 REFERENCE 2 (bases 1 to 2512) AUTHORS Lecka-Czernik,B. TITLE Direct Submission JOURNAL Submitted (01-DEC-1993) Beata Lecka-Czernik, Medicine, University of Arkansas for Medical Sciences, 4301 W. Markham, Little Rock, AR 72205-7199, USA FEATURES Location/Qualifiers source 1..2512 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="S1-5" /clone_lib="sW8" /sex="male" /cell_line="WS8" /cell_type="fibroblast" /tissue_type="skin" /dev_stage="adult" mRNA 1..2512 /gene="S1-5" gene 1..2512 /gene="S1-5" 5'UTR 1..237 /gene="S1-5" CDS 238..1401 /gene="S1-5" /note="contains 5 EGF-like domains" /codon_start=1 /function="stimulation of DNA synthesis in vitro" /product="extracellular protein" /db_xref="PID:g458228" /translation="MATSGVLPGGGFVASAAAVAGPEMQTGRNNFVIRRNPADPQRIP SNPSHRIQCAAGYEQSEHNVCQDIDECTAGTHNCRADQVCINLRGSFACQCPPGYQKR GEQCVDIDECTIPPYCHQRCVNTPGSFYCQCSPGFQLAANNYTCVDINECDASNQCAQ QCYNILGSFICQCNQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGY QVVRSRTCQDINECETTNECREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVCPVS NAMCRELPQSIVYKYMSIRSDRSVPSDIFQIQATTIYANTINTFRIKSGNENGEFYLR QTSPVSAMLVLVKSLSGPREHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF" sig_peptide 238..291 /gene="S1-5" 3'UTR 1399..2512 /gene="S1-5" polyA_signal 1840..1845 /gene="S1-5" polyA_site 1860..1876 /gene="S1-5" polyA_signal 1936..1941 /gene="S1-5" polyA_site 1955..1956 /gene="S1-5" polyA_signal 2115..2120 /gene="S1-5" polyA_signal 2475..2486 /gene="S1-5" polyA_site 2496..2512 /gene="S1-5" BASE COUNT 803 a 507 c 512 g 690 t ORIGIN 1 caatgcactg acggatatga gtgggatcct gtgagacagc aatgcaaaga tattgatgaa 61 tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 121 tacctctgcc ttccgaaaac agcccagatt attgtcaata atgaacagcc tcagcaggaa 181 acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 241 gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtcgcaggc 301 cctgaaatgc agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 361 cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 421 cacaacgtgt gccaagacat agacgagtgc actgcaggga cgcacaactg tagagcagac 481 caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 541 cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 601 tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 661 aacaactata cctgcgtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 721 tgctacaaca ttcttggttc attcatctgt cagtgcaatc aaggatatga gctaagcagt 781 gacaggctca actgtgaaga cattgatgaa tgcagaacct caagctacct gtgtcaatat 841 caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 901 agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg ccgggaggat 961 gaaatgtgtt ggaattatca tggcggcttc cgttgttatc cacgaaatcc ttgtcaagat 1021 ccctacattc taacaccaga gaaccgatgt gtttgcccag tctcaaatgc catgtgccga 1081 gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 1141 tcagacatct tccagataca ggccacaact atttatgcca acaccatcaa tacttttcgg 1201 attaaatctg gaaatgaaaa tggagagttc tacctacgac aaacaagtcc tgtaagtgca 1261 atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 1321 ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 1381 gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcagcca 1441 aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 1501 tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 1561 tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 1621 actggtcttc ttcaagagag ctaagtatac actatctggt gaaacttgga ttctttccta 1681 taaaagtggg accaagcaat gatgatcttc tgtggtgctt aaggaaactt actagagctc 1741 cactaacagt ctcataagga ggcagccatc ataaccattg aatagcatgc aagggtaaga 1801 atgagttttt aactgctttg taagaaaatg gaaaaggtca ataaagatat atttctttag 1861 aaaatgggga tctgccatat ttgtgttggt ttttattttc atatccagcc taaaggtggt 1921 tgtttattat atagtaataa atcattgctg tacaacatgc tggtttctgt agggtatttt 1981 taattttgtc agaaatttta gattgtgaat attttgtaaa aaacagtaag caaaattttc 2041 cagaattccc aaaatgaacc agataccccc tagaaaatta tactattgag aaatctatgg 2101 ggaggatatg agaaaataaa ttccttctaa accacattgg aactgacctg aagaagcaaa 2161 ctcggaaaat ataataacat ccctgaattc aggcattcac aagatgcaga acaaaatgga 2221 taaaaggtat ttcactggag aagttttaat ttctaagtaa aatttaaatc ctaacacttc 2281 actaatttat aactaaaatt tctcatcttc gtacttgatg ctcacagagg aagaaaatga 2341 tgatggtttt tattcctggc atccagagtg acagtgaact taagcaaatt accctcctac 2401 ccaattctat ggaatatttt atacgtctcc ttgtttaaaa tctgactgct ttactttgat 2461 gtatcatatt tttaaataaa aataaatatt cctttagaag atcactctaa aa // LOCUS HSU03882 2232 bp mRNA PRI 22-JUN-1994 DEFINITION Human monocyte chemoattractant protein 1 receptor (MCP-1RA) alternatively spliced mRNA, complete cds. ACCESSION U03882 NID g472555 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2232) AUTHORS Charo,I.F., Myers,S.J., Herman,A., Franci,C., Connolly,A.J. and Coughlin,S.R. TITLE Molecular cloning and functional expression of two monocyte chemoattractant protein 1 receptors reveals alternate splicing of the carboxyl-terminal tails JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 2752-2756 (1994) MEDLINE 94195821 REFERENCE 2 (bases 1 to 2232) AUTHORS Myers,S.J. TITLE Direct Submission JOURNAL Submitted (01-DEC-1993) Scott J. Myers, Cardiovascular, The Gladstone Institutes, 2550 23rd Street, San Francisco, CA 94110, USA FEATURES Location/Qualifiers source 1..2232 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ccr2-9a" /clone_lib="MonoMac6-#3" /cell_line="MonoMac 6" CDS 40..1164 /standard_name="monocyte chemoattractant protein 1 receptor" /note="alternatively spliced; MCP-1RA" /codon_start=1 /product="MCP-1 receptor" /db_xref="PID:g472556" /translation="MLSTSRSRFIRNTNESGEEVTTFFDYDYGAPCHKFDVKQIGAQL LPPLYSLVFIFGFVGNMLVVLILINCKKLKCLTDIYLLNLAISDLLFLITLPLWAHSA ANEWVFGNAMCKLFTGLYHIGYFGGIFFIILLTIDRYLAIVHAVFALKARTVTFGVVT SVITWLVAVFASVPGIIFTKCQKEDSVYVCGPYFPRGWNNFHTIMRNILGLVLPLLIM VICYSGILKTLLRCRNEKKRHRAVRVIFTIMIVYFLFWTPYNIVILLNTFQEFFGLSN CESTSQLDQATQVTETLGMTHCCINPIIYAFVGEKFRSLFHIALGCRIAPLQKPVCGG PGVRPGKNVKVTTQGLLDGRGKGKSIGRAPEASLQDKEGA" BASE COUNT 602 a 464 c 508 g 658 t ORIGIN 1 ggattgaaca aggacgcatt tccccagtac atccacaaca tgctgtccac atctcgttct 61 cggtttatca gaaataccaa cgagagcggt gaagaagtca ccaccttttt tgattatgat 121 tacggtgctc cctgtcataa atttgacgtg aagcaaattg gggcccaact cctgcctccg 181 ctctactcgc tggtgttcat ctttggtttt gtgggcaaca tgctggtcgt cctcatctta 241 ataaactgca aaaagctgaa gtgcttgact gacatttacc tgctcaacct ggccatctct 301 gatctgcttt ttcttattac tctcccattg tgggctcact ctgctgcaaa tgagtgggtc 361 tttgggaatg caatgtgcaa attattcaca gggctgtatc acatcggtta ttttggcgga 421 atcttcttca tcatcctcct gacaatcgat agatacctgg ctattgtcca tgctgtgttt 481 gctttaaaag ccaggacggt cacctttggg gtggtgacaa gtgtgatcac ctggttggtg 541 gctgtgtttg cttctgtccc aggaatcatc tttactaaat gccagaaaga agattctgtt 601 tatgtctgtg gcccttattt tccacgagga tggaataatt tccacacaat aatgaggaac 661 attttggggc tggtcctgcc gctgctcatc atggtcatct gctactcggg aatcctgaaa 721 accctgcttc ggtgtcgaaa cgagaagaag aggcataggg cagtgagagt catcttcacc 781 atcatgattg tttactttct cttctggact ccctataaca ttgtcattct cctgaacacc 841 ttccaggaat tcttcggcct gagtaactgt gaaagcacca gtcaactgga ccaagccacg 901 caggtgacag agactcttgg gatgactcac tgctgcatca atcccatcat ctatgccttc 961 gttggggaga agttcagaag cctttttcac atagctcttg gctgtaggat tgccccactc 1021 caaaaaccag tgtgtggagg tccaggagtg agaccaggaa agaatgtgaa agtgactaca 1081 caaggactcc tcgatggtcg tggaaaagga aagtcaattg gcagagcccc tgaagccagt 1141 cttcaggaca aagaaggagc ctagagacag aaatgacaga tctctgcttt ggaaatcaca 1201 cgtctggctt cacagatgtg tgattcacag tgtgaatctt ggtgtctacg ttaccaggca 1261 ggaaggctga gaggagagag actccagctg ggttggaaaa cagtattttc caaactacct 1321 tccagttcct catttttgaa tacaggcata gagttcagac tttttttaaa tagtaaaaat 1381 aaaattaaag ctgaaaactg caacttgtaa atgtggtaaa gagttagttt gagttgctat 1441 catgtcaaac gtgaaaatgc tgtattagtc acagagataa ttctagcttt gagcttaaga 1501 attttgagca ggtggtatgt ttgggagact gctgagtcaa cccaatagtt gttgattggc 1561 aggagttgga agtgtgtgat ctgtgggcac attagcctat gtgcatgcag catctaagta 1621 atgatgtcgt ttgaatcaca gtatacgctc catcgctgtc atctcagctg gatctccatt 1681 ctctcaggct tgctgccaaa agccttttgt gttttgtttt gtatcattat gaagtcatgc 1741 gtttaatcac attcgagtgt ttcagtgctt cgcagatgtc cttgatgctc atattgttcc 1801 ctaatttgcc agtgggaact cctaaatcaa attggcttct aatcaaagct tttaaaccct 1861 attggtaaag aatggaaggt ggagaagctc cctgaagtaa gcaaagactt tcctcttagt 1921 cgagccaagt taagaatgtt cttatgttgc ccagtgtgtt tctgatctga tgcaagcaag 1981 aaacactggg cttctagaac caggcaactt gggaactaga ctcccaagct ggactatggc 2041 tctactttca ggccacatgg ctaaagaagg tttcagaaag aagtggggac agagcagaac 2101 tttcaccttc atatatttgt atgatcctaa tgaatgcata aaatgttaag ttgatggtga 2161 tgaaatgtaa atactgtttt taacaactat gatttggaaa ataaatcaat gctataacta 2221 tgttgataaa ag // LOCUS HSU03884 2489 bp mRNA PRI 10-AUG-1994 DEFINITION Human inwardly rectifying K+ channel (ROMK1) mRNA, complete cds. ACCESSION U03884 NID g433142 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2489) AUTHORS Yano,H., Philipson,L.H., Kugler,J.L., Tokuyama,Y., Davis,E.M., Le Beau,M.M., Nelson,D.J., Bell,G.I. and Takeda,J. TITLE Alternative splicing of human inwardly rectifying K+ channel ROMK1 mRNA JOURNAL Mol. Pharmacol. 45 (5), 854-860 (1994) MEDLINE 94247391 REFERENCE 2 (bases 1 to 2489) AUTHORS Takeda,J. TITLE Direct Submission JOURNAL Submitted (01-DEC-1993) Jun Takeda, Howard Hughes Medical Institute, The University of Chicago, 5841 S. Maryland Ave., Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..2489 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda hROMK-1 and lambda hROMK-5" /clone_lib="adult human kidney cortex cDNA library in lambda gt10" /map="11q24" /tissue_type="kidney" /dev_stage="adult" gene 252..1421 /gene="ROMK1" CDS 252..1421 /gene="ROMK1" /codon_start=1 /product="inwardly rectifying K+ channel" /db_xref="PID:g433143" /translation="MPTVYLCSEQIRVLTESMFKHLRKWVVTRFFGHSRQRARLVSKD GRCNIEFGNVEAQSRFIFFVDIWTTVLDLKWRYKMTIFITAFLGSWFFFGLLWYAVAY IHKDLPEFHPSANHTPCVENINGLTSAFLFSLETQVTIGYGFRCVTEQCATAIFLLIF QSILGVIINSFMCGAILAKISRPKKRAKTITFSKNAVISKRGGKLCLLIRVANLRKSL LIGSHIYGKLLKTTVTPEGETIILDQININFVVDAGNENLFFISPLTIYHVIDHNSPF FHMAAETLLQQDFELVVFLDGTVESTSATCQVRTSYVPEEVLWGYRFAPIVSKTKEGK YRVDFHNFSKTVEVETPHCAMCLYNEKDVRARMKRGYDNPNFILSEVNETDDTKM" BASE COUNT 735 a 511 c 541 g 702 t ORIGIN 1 atcaacaggg cctcgggtac cctcacctag catatccaaa ctcttgcatc aaaggtgcag 61 ggacttgctc acatcgagaa tctggttgct ttcttggaga ccaagaaaat gagtttttgt 121 ttctacattt actccagcaa tccatgagga ctttatagga attttgcacc attctgaatg 181 gatacatttg gatttctcaa catttgttca gcttcctaat gactgttgtg acaattgctc 241 tataccagtg aatgccaact gtttatctct gctctgaaca gatcagggtg ttgacagaaa 301 gtatgttcaa acatcttcgg aaatgggtcg tcactcgctt ttttgggcat tctcggcaaa 361 gagcaaggct agtctccaaa gatggaaggt gcaacataga atttggcaat gtggaggcac 421 agtcaaggtt tatattcttt gtggacatct ggacaacggt acttgacctc aagtggagat 481 acaaaatgac cattttcatc acagccttct tggggagttg gtttttcttt ggtctcctgt 541 ggtatgcagt agcgtacatt cacaaagacc tcccggaatt ccatccttct gccaatcaca 601 ctccctgtgt ggagaatatt aatggcttga cctcagcttt tctgttttct ctggagactc 661 aagtgaccat tggatatgga ttcaggtgtg tgacagaaca gtgtgccact gccatttttc 721 tgcttatctt tcagtctata cttggagtta taatcaattc tttcatgtgt ggggccatct 781 tagccaagat ctccaggccc aaaaaacgtg ccaagaccat tacgttcagc aagaacgcag 841 tgatcagcaa acggggaggg aagctttgcc tcctaatccg agtggctaat ctcaggaaga 901 gccttcttat tggcagtcac atttatggaa agcttctgaa gaccacagtc actcctgaag 961 gagagaccat tattttggac cagatcaata tcaactttgt agttgacgct gggaatgaaa 1021 atttattctt catctcccca ttgacaattt accatgtcat tgatcacaac agccctttct 1081 tccacatggc agcggagacc cttctccagc aggactttga attagtggtg tttttagatg 1141 gcacagtgga gtccaccagt gctacctgcc aagtccggac atcctatgtc ccagaggagg 1201 tgctttgggg ctaccgtttt gctcccatag tatccaagac aaaggaaggg aaataccgag 1261 tggatttcca taactttagc aagacagtgg aagtggagac ccctcactgt gccatgtgcc 1321 tttataatga gaaagatgtt agagccagga tgaagagagg ctatgacaac cccaacttca 1381 tcttgtcaga agtcaatgaa acagatgaca ccaaaatgta acagtggctt ttcaacggga 1441 gtaaagcaaa gtctctaaag ctcctagtac ctagaagcat tatgaagcag tcaacaattt 1501 aggggtacga aagtaggatg agagccttca aagtctacca gcacaaagac ccctgagccc 1561 cgcaattgtg atcccacaag acatgcatct ccacaaggct actgtattag aacgtgcaat 1621 gcatttatat gaaactggtg tatggaagac ataggtgctc tcttgaaatc ttaaatatga 1681 ttatttgagc tcatataagg tggattggag cagataaaat tatcaaaagt ttcatgaaca 1741 ggccaaacaa aatatttttt aaagtttcct taaagaagtt atgaacttta gaaaggatca 1801 ggggacaata ataatctcat tttgattcta ctgataagaa tgactccact tttaatgtgg 1861 acttttactc atggaaaaat tgtctcctaa tttggggaga tgaaccaacc aatcaatgac 1921 aagaaaacgc ttacacaaag aacaatttga ggctctaagc ttctcatgtg gtacgtttag 1981 acagaggcta aatctgcaca ctagaatctt gatgatacct tcctgcaaga cagaatgctt 2041 tagttaaaag tggtgatgat atttctttca atctgtattg gatggcttaa agggctataa 2101 atctgtttat aaagagcatt tcctgctctt cgaagacagc aatgaggagt tggaaggtgc 2161 aaagtcagta gagaagggaa tgtatcatta atgcacctga gaagaaacag tttcatgtgt 2221 tcctccacct agagtttgta ctggaatgct atttctaaag aagaagtggg aaagagagag 2281 gaatgggatg gagccccaca gtcagaatgt tactatgtct ttctttccct gacagcccat 2341 cttcctaaaa gggaccagct tatggaaggc tcgaccttga ggggaaagtt ttactgtgaa 2401 agtcttcttc agatccccac ctgcatcatt ccgaatgtgt cctggaaaaa aactggtact 2461 caaagctgct taggaatcaa aatgttttc // LOCUS HSU03886 2809 bp mRNA PRI 08-MAR-1994 DEFINITION Human GS2 mRNA, complete cds. ACCESSION U03886 NID g458225 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2809) AUTHORS Lee,W., Salido,E. and Yen,P.H. TITLE Isolation of a gene GS2 (DXS1283E) from a CpG island between STS and KAL on the distal short arm of the human X chromosome JOURNAL Unpublished REFERENCE 2 (bases 1 to 2809) AUTHORS Yen,P.H. TITLE Direct Submission JOURNAL Submitted (01-DEC-1993) Pauline H. Yen, Division of Medical Genetics, E-4, Harbor-UCLA Medical Center, 1124 W. Carson St., Torrance, CA 90502, USA FEATURES Location/Qualifiers source 1..2809 /organism="Homo sapiens" /note="GDB DSEG Number: DXS1283E" /db_xref="taxon:9606" /clone="Lambda GS2-1" /clone_lib="Clontech human brain, cerebellum cDNA (HL1128a)" /map="Xp22.3" /chromosome="X" /tissue_type="brain" exon 1..117 /number=1 /evidence=experimental exon 118..310 /number=2 /evidence=experimental gene 131..892 /gene="GS2" CDS 131..892 /gene="GS2" /note="a gene isolated from a CpG island between STS and KAL" /codon_start=1 /db_xref="PID:g458226" /translation="MKHINLSFAACGFLGIYHLGAASALCRHGKKLVKDVKAFAGASA GSLVASVLLTAPEKIEECNQFTYKFAEEIRRQSFGAVTPGYDFMARLRSGMESILPPS AHELAQNRLHVSITNAKTRENHLVSTFSSREDLIKVLLASSFVPIYAGLKLVEYKGQK WVDGGLTNALPILPVGRTVTISPFSGRLDISPQDKGQLDLYVNIAKQDIMLSLANLVR LNQALFPPSKRKMESLYQCGFDDTVKFLLKENWFE" exon 311..405 /gene="GS2" /number=3 /evidence=experimental exon 406..541 /gene="GS2" /number=4 /evidence=experimental exon 542..607 /gene="GS2" /number=5 /evidence=experimental exon 608..760 /gene="GS2" /number=6 /evidence=experimental exon 761..2809 /number=7 /evidence=experimental polyA_signal 888..893 repeat_region 1899..1944 /note="region of CT dinucleotide repeat polymorphism" polyA_signal 2654..2659 BASE COUNT 779 a 592 c 594 g 844 t ORIGIN 1 ccggctcgcg gagagcgtag cgcggccttg gtggcggaat ggcgttgagt gacggcccgg 61 ccccgccatc tggttaaagg gactcgttca acacggaagt gtcccggggc tgcattgtgc 121 tacagctaga atgaagcaca tcaacctatc atttgcagcg tgtggatttc tgggcattta 181 ccacttgggg gcagcatctg cactttgcag acatggcaaa aaacttgtga aggatgtcaa 241 agccttcgct ggggcgtctg cgggatcgtt ggttgcttct gttctgctaa cagcaccaga 301 aaaaatagag gaatgtaacc aatttaccta caagtttgcc gaagaaatca gaaggcagtc 361 tttcggggca gtaacgcccg gttatgactt catggcccga ctaagaagtg ggatggagtc 421 gattcttcct cccagcgctc acgagctggc ccagaaccga ctgcacgtat ccatcaccaa 481 cgccaaaacc agagaaaatc acttagtctc cactttttcc tccagggagg acctcattaa 541 ggtcctccta gccagcagtt ttgtgcccat ttatgcagga ctgaagctag tggaatacaa 601 agggcagaag tgggtggacg gaggcctcac caacgctctt cccatcctgc ccgtcggccg 661 gacagtaacc atctccccct tcagtggacg actggacatc tccccgcagg acaaagggca 721 gctagatctg tatgttaata tcgccaagca ggatatcatg ttgtccctgg caaacctggt 781 gagactcaac caagcccttt ttcccccaag caagaggaaa atggaatctt tgtatcagtg 841 tggttttgat gacactgtta agtttttact taaagaaaat tggtttgaat aaaatgcata 901 aaagtttata atgcaaaaca cgttttagat agtttttgat ggaagtttct aatcaaatcc 961 tttttagaaa atctatcatg actcaattgt attactcctt gtaatatttg tgtttatttt 1021 taatatttga attattactt agcacatgga tataagaggc actttattag aatttgacaa 1081 cgtcattaag aaggatacac ttaggtgata aggaatcatt tttggtgaac tgttgtgtcc 1141 tttaaaaaat ggaggaggat aggttctttg ggtaaattga ggaacatgga aaatggaggg 1201 ccaccactta ggttccatgg agaatatgtg gcatgatctt gacgtgaagc atgggtgttt 1261 ccgtgactta gctgtgtccc tgtagctctg gaactgaagg aactaggtag gagaggaagg 1321 atttgagaaa taggcaactc atggttaaag acctttgatc aggactggag gaacaaaggt 1381 ttgtgtatta gtccattttc acactgctga gaaagacata cccaagactg ggcaatttac 1441 aaaagaaagg tttaatgggc tcacagttcc acatgactga ggaggcctca caattgtggt 1501 ggaaggtgaa aggcatgtct cacatggcag cagacaagag aagaaaactt gtgcagggaa 1561 actccccttt ataaaaccat cggatctctt gagacgcgtt cactatcatg agaatagcat 1621 gggaaagacc tgcctccatg attcagttac ctcccactgg gtccctccca caacgtgtgg 1681 caattgtggg agctacatac acttcaagat gagatttggg tggggacaca gccaaaccat 1741 aacagtttgc tttccacagg tgtgtgcagt tgctcccagt gcagttaggg gcagatatcc 1801 agaagcctag gagattttga agaggcttat cccctctcat cttcctcccc taaaccccac 1861 cccagtttag gagattatca agctggttgt cattcattct ctctctctct ctctctctct 1921 ctctctctct ctctctctct ctctcgccat acctctctga gtatttagct attgttctat 1981 atactggata catctattat gggaacttaa atagataaaa tactactaaa cacaaagaac 2041 caagtagcct agtaaataag aattaaatcc agaataatca ttcacttctc atttttaaat 2101 cacgttgtac tgatcccagg ttttttactt tctgagtagt tgtcgatcac tttgaggact 2161 tgtgtatggg cttgtaccaa tttactgaca ctgtcttaaa tcctaaatct gtttttatga 2221 ggttttgcca agcaagaact cttgattact aagaggcagt tagagacaaa ccaatttaat 2281 tccatgtctt caattgtcac gcatttcttc ttttactctt tgaaactatt ctttgagtca 2341 attttatggt tctcagaagc caaaatacac aacttttagc acataaacac caacgatggc 2401 ctctttttga ggagttatgc atagacccac tctagagtaa tgatggtccc tgtggtatat 2461 actttctcct actctagcaa acatgtagtt taatcttaat gtgttgtttc cataagtgac 2521 atgaagtgga tagattctca attgttatgt ccacttattc actaggtaaa ttttcagttt 2581 taatactttt ctccttaccc cttcctttga tcatttcatg tgaatattct atttgtatga 2641 tacactgtat ttcaataaat tccttgttga tgtaccctta aattgaagaa atttaagctg 2701 caaaaccaaa tctcattgta taagactttt ttgaagtatc ttgtatcgac tacatatgta 2761 tttgaccctg tgggaggatg gtacttttct ttttaacttt tattttaag // LOCUS HSU03911 3080 bp mRNA PRI 06-JUN-1994 DEFINITION Human mutator gene (hMSH2) mRNA, complete cds. ACCESSION U03911 NID g454360 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3080) AUTHORS Fishel,R., Lescoe,M.K., Rao,M.R., Copeland,N.G., Jenkins,N.A., Garber,J., Kane,M. and Kolodner,R. TITLE The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer JOURNAL Cell 75, 1027-1038 (1993) MEDLINE 94073959 REFERENCE 2 (bases 1 to 3080) AUTHORS Fishel,R., Lescoe,M.K., Rao,M.R., Copeland,N.G., Jenkins,N.A., Garber,J., Kane,M. and Kolodner,R. TITLE The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer JOURNAL Cell 77, 167 (1994) MEDLINE 94208055 REFERENCE 3 (bases 1 to 3080) AUTHORS Kolodner,R.D. TITLE Direct Submission JOURNAL Submitted (02-DEC-1993) Richard D. Kolodner, Cell and Molecular Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3080 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2p22-21" gene 4..2808 /gene="hMSH2" CDS 4..2808 /gene="hMSH2" /note="homolog of S. cerevisiae Msh2p [Swiss-Prot accession number P25847] and bacterial MutS proteins [Swiss-Prot accession numbers P23909, P10339, and P27345]" /codon_start=1 /db_xref="PID:g433147" /translation="MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYT AHGEDALLAAREVFKTQGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEV YKNRAGNKASKENDWYLAYKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQ VGVGYVDSIQRKLGLCEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQ RGGILITERKKADFSTKDIYQDLNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKF LELLSDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCK TPQGQRLVNQWIKQPLMDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAK KFQRQAANLQDCYRLYQGINQLPNVIQALEKHEGKHQKLLLAVFVTPLTDLRSDFSKF QEMIETTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPG KQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKN KTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAI LEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFHIITGPNMGGKSTYIRQT GVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLKGVSTFMAEMLETASILRSAT KDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTALANQIPTVN NLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHVIECAKQKALELEEFQY IGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQLKAE VIAKNNSFVNEIISRIKVTT" polyA_site 3059 BASE COUNT 981 a 538 c 699 g 862 t ORIGIN 1 gacatggcgg tgcagccgaa ggagacgctg cagttggaga gcgcggccga ggtcggcttc 61 gtgcgcttct ttcagggcat gccggagaag ccgaccacca cagtgcgcct tttcgaccgg 121 ggcgacttct atacggcgca cggcgaggac gcgctgctgg ccgcccggga ggtgttcaag 181 acccaggggg tgatcaagta catggggccg gcaggagcaa agaatctgca gagtgttgtg 241 cttagtaaaa tgaattttga atcttttgta aaagatcttc ttctggttcg tcagtataga 301 gttgaagttt ataagaatag agctggaaat aaggcatcca aggagaatga ttggtatttg 361 gcatataagg cttctcctgg caatctctct cagtttgaag atattctctt tggtaacaat 421 gatatgtcag cttccattgg tgttgtgggt gttaaaatgt ccgcagttga tggccagaga 481 caggttggag ttgggtatgt ggattccata cagaggaaac taggactgtg tgaattccct 541 gataatgatc agttctccaa tcttgaggct ctcctcatcc agattggacc aaaggaatgt 601 gttttacccg gaggagagac tgctggagac atggggaaac tgagacagat aattcaaaga 661 ggaggaattc tgatcacaga aagaaaaaaa gctgactttt ccacaaaaga catttatcag 721 gacctcaacc ggttgttgaa aggcaaaaag ggagagcaga tgaatagtgc tgtattgcca 781 gaaatggaga atcaggttgc agtttcatca ctgtctgcgg taatcaagtt tttagaactc 841 ttatcagatg attccaactt tggacagttt gaactgacta cttttgactt cagccagtat 901 atgaaattgg atattgcagc agtcagagcc cttaaccttt ttcagggttc tgttgaagat 961 accactggct ctcagtctct ggctgccttg ctgaataagt gtaaaacccc tcaaggacaa 1021 agacttgtta accagtggat taagcagcct ctcatggata agaacagaat agaggagaga 1081 ttgaatttag tggaagcttt tgtagaagat gcagaattga ggcagacttt acaagaagat 1141 ttacttcgtc gattcccaga tcttaaccga cttgccaaga agtttcaaag acaagcagca 1201 aacttacaag attgttaccg actctatcag ggtataaatc aactacctaa tgttatacag 1261 gctctggaaa aacatgaagg aaaacaccag aaattattgt tggcagtttt tgtgactcct 1321 cttactgatc ttcgttctga cttctccaag tttcaggaaa tgatagaaac aactttagat 1381 atggatcagg tggaaaacca tgaattcctt gtaaaacctt catttgatcc taatctcagt 1441 gaattaagag aaataatgaa tgacttggaa aagaagatgc agtcaacatt aataagtgca 1501 gccagagatc ttggcttgga ccctggcaaa cagattaaac tggattccag tgcacagttt 1561 ggatattact ttcgtgtaac ctgtaaggaa gaaaaagtcc ttcgtaacaa taaaaacttt 1621 agtactgtag atatccagaa gaatggtgtt aaatttacca acagcaaatt gacttcttta 1681 aatgaagagt ataccaaaaa taaaacagaa tatgaagaag cccaggatgc cattgttaaa 1741 gaaattgtca atatttcttc aggctatgta gaaccaatgc agacactcaa tgatgtgtta 1801 gctcagctag atgctgttgt cagctttgct cacgtgtcaa atggagcacc tgttccatat 1861 gtacgaccag ccattttgga gaaaggacaa ggaagaatta tattaaaagc atccaggcat 1921 gcttgtgttg aagttcaaga tgaaattgca tttattccta atgacgtata ctttgaaaaa 1981 gataaacaga tgttccacat cattactggc cccaatatgg gaggtaaatc aacatatatt 2041 cgacaaactg gggtgatagt actcatggcc caaattgggt gttttgtgcc atgtgagtca 2101 gcagaagtgt ccattgtgga ctgcatctta gcccgagtag gggctggtga cagtcaattg 2161 aaaggagtct ccacgttcat ggctgaaatg ttggaaactg cttctatcct caggtctgca 2221 accaaagatt cattaataat catagatgaa ttgggaagag gaacttctac ctacgatgga 2281 tttgggttag catgggctat atcagaatac attgcaacaa agattggtgc tttttgcatg 2341 tttgcaaccc attttcatga acttactgcc ttggccaatc agataccaac tgttaataat 2401 ctacatgtca cagcactcac cactgaagag accttaacta tgctttatca ggtgaagaaa 2461 ggtgtctgtg atcaaagttt tgggattcat gttgcagagc ttgctaattt ccctaagcat 2521 gtaatagagt gtgctaaaca gaaagccctg gaacttgagg agtttcagta tattggagaa 2581 tcgcaaggat atgatatcat ggaaccagca gcaaagaagt gctatctgga aagagagcaa 2641 ggtgaaaaaa ttattcagga gttcctgtcc aaggtgaaac aaatgccctt tactgaaatg 2701 tcagaagaaa acatcacaat aaagttaaaa cagctaaaag ctgaagtaat agcaaagaat 2761 aatagctttg taaatgaaat catttcacga ataaaagtta ctacgtgaaa aatcccagta 2821 atggaatgaa ggtaatattg ataagctatt gtctgtaata gttttatatt gttttatatt 2881 aacccttttt ccatagtgtt aactgtcagt gcccatgggc tatcaactta ataagatatt 2941 tagtaatatt ttactttgag gacattttca aagattttta ttttgaaaaa tgagagctgt 3001 aactgaggac tgtttgcaat tgacataggc aataataagt gatgtgctga attttataaa 3061 taaaatcatg tagtttgtgg // LOCUS HSU04209 1955 bp mRNA PRI 26-MAR-1996 DEFINITION Human associated microfibrillar protein mRNA, complete cds. ACCESSION U04209 NID g434655 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1955) AUTHORS Yeh,H., Chow,M., Abrams,W.R., Fan,J., Foster,J., Mitchell,H., Muenke,M. and Rosenbloom,J. TITLE Structure of the human gene encoding the associated microfibrillar protein (MFAP1) and localization to chromosome 15q15-q21 JOURNAL Genomics 23 (2), 443-449 (1994) MEDLINE 95137591 REFERENCE 2 (bases 1 to 1955) AUTHORS Abrams,W.R. TITLE Direct Submission JOURNAL Submitted (08-DEC-1993) William R. Abrams, Anatomy and Histology, University of Pennsylvania, School of Dental Medicine, 4001 Spruce Street, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..1955 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q15-q21" /sex="female" /cell_line="cc102" /cell_type="fibroblast" /tissue_type="skin" /dev_stage="adult" 5'UTR 1..118 CDS 119..1438 /codon_start=1 /product="associated microfibrillar protein" /db_xref="PID:g434656" /translation="MSVPSALMKQPPIQSTAGAVPVRNEKGEISMEKVKVKRYVSGKR PDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEEDSSSDPRLRRLQNRISEDVEERLA RHRKIVEPEVVGESDSEVEGDAWRMEREDSSEEEEEEIDDEEIERRRGMMRQRAQERK NEEMEVMEVEDEGRSGEESESESEYEEYTDSEDEMEPRLKPVFIRKKDRVTVQEREAE ALKQKELEQEAKRMAEERRQYTLQIVGEETPKELEENKRSLAALDALNTDDENDEEEY EAWKVRELKRIKRDREDREALEKEKAEIERMRNLTEEERRAELRANGKVITNKAVKGK YKFLQKYYHRGAFFMDEDEEVYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYT HLVDQDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKRKTT" 3'UTR 1439..1955 polyA_site 1930..1935 BASE COUNT 603 a 374 c 533 g 443 t 2 others ORIGIN 1 ggtcgcgcag ctgtgttcgc ggactcaggt ggaaggaatt tcttctcttc gttgacgttg 61 ctggtgttca ctgtttggaa ttagtcaagt ttcgggaatc accgtcgctg ccatcaacat 121 gtcggtccca agcgctctca tgaagcaacc gcccattcag tctacggctg gggccgtccc 181 agttcgcaat gagaaaggtg agatttcaat ggaaaaagtg aaggtaaagc gttatgtgtc 241 cggaaaaagg ccagactatg cccctatgga gtcctcagat gaggaggatg aagaatttca 301 gttcattaag aaagccaaag aacaagaagc agagcctgag gaacaggagg aggattcatc 361 cagtgacccc cggctacggc gtttacagaa ccgtattagt gaagatgtgg aagagagatt 421 ggctcgacat cgaaaaatag tggaacctga agtggtagga gagagtgact cagaagtaga 481 aggagatgct tggcgcatgg aacgagaaga cagcagtgaa gaagaggagg aggaaattga 541 tgatgaggaa atagagcggc ggcgtggcat gatgcgtcag cgagcacagg agagaaaaaa 601 tgaagagatg gaagtcatgg aagtggaaga tgagggtcgt tctggagagg agtcagaatc 661 agagtctgag tatgaagagt acacagacag tgaagatgag atggagcctc gccttaagcc 721 agtcttcatt cgaaagaagg accgagtgac agttcaagaa cgtgaagccg aagcattgaa 781 acagaaggag ctggagcagg aagccaaacg catggctgag gaaaggcgcc agtacacact 841 ccagattgtc ggagaggaaa ccccaaaaga gctggaagag aacaagcgat ccctggctgc 901 attggatgca ctcaatactg atgatgaaaa tgatgaggag gaatatgagg catggaaagt 961 tcgagagcta aaaagaatca agagggacag agaagatcga gaagcgcttg agaaggagaa 1021 agcagaaatt gaacgcatgc gaaacctgac tgaggaagag aggagagctg aacttcgggc 1081 aaacggcaaa gtcattacca acaaagctgt taagggcaaa tacaagttct tacagaagta 1141 ttatcaccgg ggtgccttct tcatggatga ggatgaagaa gtatacaaga gagatttcag 1201 cgctcctacc ctggaggatc atttcaataa aaccattctt cctaaagtca tgcaggtcaa 1261 gaactttgga cgctcaggtc gcaccaaata cactcacctt gtggatcaag ataccacctc 1321 ctttgactca gcttggggcc aagagagtgc ccagaacaca aagttcttca aacaaaaggc 1381 agctggggta cgagatgtat ttgagcggcc atctgccaag aagcggaaaa ctacctaggg 1441 tccaactgct tattcttcca actgtggaac acaaggggag tctcagcatc tggtccttga 1501 ttgggttttt tcattgtttc cttggcccct gtatccagat attggactta ctgctatact 1561 tgtgatactg ggtagcccag actttgaagg tgctttgtga ggtttggact catgctgaga 1621 aacccacagg aaagcactgt ccaggtagga ttagaggctt cccacttaaa actatttctg 1681 agaaatctta ggttttatca ctgctatggt ttcccatatt tacttgggac tgttctgact 1741 ttctttttcc agcccttagc ttgggttaga aaagtggaca tgtaagtgaa caatgcatta 1801 cttctacctt aggtttagga gtaatatacc cggaaatcta agctcatgga aacatgtttt 1861 ccatttgggg ttggagtccg tttttctagt tgtacatact tgnggatcca tatatgtgtg 1921 catgtcawga aataaaagaa tcacacaaca aaaaa // LOCUS HSU04241 1288 bp mRNA PRI 03-FEB-1994 DEFINITION Human homolog of Drosophila enhancer of split m9/m10 mRNA, complete cds. ACCESSION U04241 NID g452447 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1288) AUTHORS Scala,L.A., Piparo,K.E., Tirumalai,P.S. and Howells,R.D. TITLE Molecular cloning, sequence analysis and characterization of a human homolog of Drosophila enhancer of split m9/m10 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1288) AUTHORS Howells,R.D. TITLE Direct Submission JOURNAL Submitted (09-DEC-1993) Richard D. Howells, Biochemistry and Molecular Biology, New Jersey Medical School, 185 South Orange Avenue, Newark, NJ 07103, USA FEATURES Location/Qualifiers source 1..1288 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SHSY5Y" /cell_type="neuroblastoma" 5'UTR 1..40 CDS 41..634 /note="homologous to Swiss-Prot accession number P16371; enhancer of split m9/m10 (groucho protein)" /codon_start=1 /db_xref="PID:g435425" /translation="MMFPQSRHSGSSHLPQQLKFTTSDSCDRIKDEFQLLQAQYHSLK LECDKLASEKSEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNGICAQVLPYLSQEHQQ QVLGAIERAKQVTAPELNSIIRQQLQAHQLSQLQALALPLTPLPVGLQPPSLPAVSAG TGLLSLSALGSQAHLSKEDKNGHDGDTHQEDDGEKSD" 3'UTR 635..1288 polyA_signal 1271..1276 BASE COUNT 274 a 434 c 358 g 222 t ORIGIN 1 ggggaatttc ccgcagcccg cgccccccgc cgcgattgac atgatgtttc cacaaagcag 61 gcattcgggc tcctcgcacc taccccagca actcaaattc accacctcgg actcctgcga 121 ccgcatcaaa gacgaatttc agctactgca agctcagtac cacagcctca agctcgaatg 181 tgacaagttg gccagtgaga agtcagagat gcagcgtcac tatgtgatgt actacgagat 241 gtcctacggc ttgaacatcg agatgcacaa acaggctgag atcgtcaaaa ggctgaacgg 301 gatttgtgcc caggtcctgc cctacctctc ccaagagcac cagcagcagg tcttgggagc 361 cattgagagg gccaagcagg tcaccgctcc cgagctgaac tctatcatcc gacagcagct 421 ccaagcccac cagctgtccc agctgcaggc cctggccctg cccttgaccc cactacccgt 481 ggggctgcag ccgccttcgc tgccggcggt cagcgcaggc accggcctcc tctcgctgtc 541 cgcgctgggt tcccaggccc acctctccaa ggaagacaag aacgggcacg atggtgacac 601 ccaccaggag gatgatggcg agaagtcgga ttagcagggg gccgggacag ggaggttggg 661 aggggggaca gaggggagac agaggcacgg agagaaagga atgtttagca caagacacag 721 cggagctcgg gattggctaa tctcccatag tatttatggt ggcgccggcg gggccccagc 781 ccagcttgca ggccacctct agctttcttc ctaccccatt ccggcttccc tcctcctccc 841 ctgcagcctg gttaggtgga tacctgccct gacatgtgag gcaagctaag gcctggaggg 901 tcagatggga gaccaggtcc caagggagca agacctgcga agcgcagcag ccccggccct 961 tcccccgttt tgaacatgtg taaccgacag tctgccctgg gccacagccc tctcaccctg 1021 gtactgcatg cacgcaatgc tagctgcccc tttcccgtcc tgggcacccc gagtctcccc 1081 cgaccccggg tcccaggtat gctcccacct ccacctgccc cactcaccac ctctgctagt 1141 tccagacacc tccacgccca cctggtcctc tcccatcgcc cacaaaaggg ggggcacgag 1201 ggacgagctt agctgagctg ggaggagcag ggtgagggtg ggcgacccag gattccccct 1261 ccccttccca aataaagatg agggtact // LOCUS HSU04270 4070 bp mRNA PRI 28-FEB-1995 DEFINITION Human putative potassium channel subunit (h-erg) mRNA, complete cds. ACCESSION U04270 NID g487737 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 184 to 3663) AUTHORS Warmke,J.W. and Ganetzky,B. TITLE A family of potassium channel genes related to eag in Drosophila and mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (8), 3438-3442 (1994) MEDLINE 94211879 REFERENCE 2 (bases 1 to 4070) AUTHORS Warmke,J.W. TITLE Direct Submission JOURNAL Submitted (09-DEC-1993) Jeffrey W. Warmke, Genetics and Molecular Biology, Merck Research Laboratories, 126 East Lincoln Avenue, P.O. Box 2000, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..4070 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBII+HH1, pBII+HH10, pBHH10-4.5" /clone_lib="Stratagene Number 936205 Human hippocampus cDNA library" /chromosome="7" /tissue_type="hippocampus" /dev_stage="2 year old" /sex="female" gene 184..3663 /gene="h-erg" CDS 184..3663 /gene="h-erg" /standard_name="human eag related gene" /codon_start=1 /product="putative potassium channel subunit" /db_xref="PID:g487738" /translation="MPVRRGHVAPQNTFLDTIIRKFEGQSRKFIIANARVENCAVIYC NDGFCELCGYSRAEVMQRPCTCDFLHGPRTQRRAAAQIAQALLGAEERKVEIAFYRKD GSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEKDMVGSPAHDTNHRGPPTSWLAPGRA KTFRLKLPALLALTARESSVRSGGAGGAGAPGAVVVDVDLTPAAPSSESLALDEVTAM DNHVAGLGPAEERRALVGPGSPPRSAPGQLPSPRAHSLNPDASGSSCSLARTRSRESC ASVRRASSADDIEAMRAGVLPPPPRHASTGAMHPLRSGLLNSTSDSDLVRYRTISKIP QITLNFVDLKGDPFLASPTSDREIIAPKIKERTHNVTEKVTQVLSLGADVLPEYKLQA PRIHRWTILHYSPFKAVWDWLILLLVIYTAVFTPYSAAFLLKETEEGPPATECGYACQ PLAVVDLIVDIMFIVDILINFRTTYVNANEEVVSHPGRIAVHYFKGWFLIDMVAAIPF DLLIFGSGSEELIGLLKTARLLRLVRVARKLDRYSEYGAAVLFLLMCTFALIAHWLAC IWYAIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKYVTALYFTFSSLTS VGFGNVSPNTNSEKIFSICVMLIGSLMYASIFGNVSAIIQRLYSGTARYHTQMLRVRE FIRFHQIPNPLRQRLEEYFQHAWSYTNGIDMNAVLKGFPECLQADICLHLNRSLLQHC KPFRGATKGCLRALAMKFKTTHAPPGDTLVHAGDLLTALYFISRGSIEILRGDVVVAI LGKNDIFGEPLNLYARPGKSNGDVRALTYCDLHKIHRDDLLEVLDMYPEFSDHFWSSL EITFNLRDTNMIPGSPGSTELEGGFSRQRKRKLSFRRRTDKDTEQPGEVSALGPGRAG AGPSSRGRPGGPWGESPSSGPSSPESSEDEGPGRSSSPLRLVPFSSPRPPGEPPGGEP LMEDCEKSSDTCNPLSGAFSGVSNIFSFWGDSRGRQYQELPRCPAPTPSLLNIPLSSP GRRPRGDVESRLDALQRQLNRLETRLSADMATVLQLLQRQMTLVPPAYSAVTTPGPGP TSTSPLLPVSPLPTLTLDSLSQVSQFMACEELPPGAPELPQEGPTRRLSLPGQLGALT SQPLHRHGSDPGS" BASE COUNT 713 a 1413 c 1255 g 689 t ORIGIN 1 acgcggcctg ctcaggcctc cagcggccgg tcggagggga ggcgggaggc gagcgaggac 61 ccgcgcccgc agtccagtct gtgcgcgccc gtgctcgctt ggcgcggtgc gggaccagcg 121 ccggccaccc gaagcctagt gcgtcgccgg gtgggtgggc ccgcccggcg ccatgggctc 181 aggatgccgg tgcggagggg ccacgtcgcg ccgcagaaca ccttcctgga caccatcatc 241 cgcaagtttg agggccagag ccgtaagttc atcatcgcca acgctcgggt ggagaactgc 301 gccgtcatct actgcaacga cggcttctgc gagctgtgcg gctactcgcg ggccgaggtg 361 atgcagcgac cctgcacctg cgacttcctg cacgggccgc gcacgcagcg ccgcgctgcc 421 gcgcagatcg cgcaggcact gctgggcgcc gaggagcgca aagtggaaat cgccttctac 481 cggaaagatg ggagctgctt cctatgtctg gtggatgtgg tgcccgtgaa gaacgaggat 541 ggggctgtca tcatgttcat cctcaatttc gaggtggtga tggagaagga catggtgggg 601 tccccggctc atgacaccaa ccaccggggc ccccccacca gctggctggc cccaggccgc 661 gccaagacct tccgcctgaa gctgcccgcg ctgctggcgc tgacggcccg ggagtcgtcg 721 gtgcggtcgg gcggcgcggg cggcgcgggc gccccggggg ccgtggtggt ggacgtggac 781 ctgacgcccg cggcacccag cagcgagtcg ctggccctgg acgaagtgac agccatggac 841 aaccacgtgg cagggctcgg gcccgcggag gagcggcgtg cgctggtggg tcccggctct 901 ccgccccgca gcgcgcccgg ccagctccca tcgccccggg cgcacagcct caaccccgac 961 gcctcgggct ccagctgcag cctggcccgg acgcgctccc gagaaagctg cgccagcgtg 1021 cgccgcgcct cgtcggccga cgacatcgag gccatgcgcg ccggggtgct gcccccgcca 1081 ccgcgccacg ccagcaccgg ggccatgcac ccactgcgca gcggcttgct caactccacc 1141 tcggactccg acctcgtgcg ctaccgcacc attagcaaga ttccccaaat caccctcaac 1201 tttgtggacc tcaagggcga ccccttcttg gcttcgccca ccagtgaccg tgagatcata 1261 gcacctaaga taaaggagcg aacccacaat gtcactgaga aggtcaccca ggtcctgtcc 1321 ctgggcgccg acgtgctgcc tgagtacaag ctgcaggcac cgcgcatcca ccgctggacc 1381 atcctgcatt acagcccctt caaggccgtg tgggactggc tcatcctgct gctggtcatc 1441 tacacggctg tcttcacacc ctactcggct gccttcctgc tgaaggagac ggaagaaggc 1501 ccgcctgcta ccgagtgtgg ctacgcctgc cagccgctgg ctgtggtgga cctcatcgtg 1561 gacatcatgt tcattgtgga catcctcatc aacttccgca ccacctacgt caatgccaac 1621 gaggaggtgg tcagccaccc cggccgcatc gccgtccact acttcaaggg ctggttcctc 1681 atcgacatgg tggccgccat ccccttcgac ctgctcatct tcggctctgg ctctgaggag 1741 ctgatcgggc tgctgaagac tgcgcggctg ctgcggctgg tgcgcgtggc gcggaagctg 1801 gatcgctact cagagtacgg cgcggccgtg ctgttcttgc tcatgtgcac ctttgcgctc 1861 atcgcgcact ggctagcctg catctggtac gccatcggca acatggagca gccacacatg 1921 gactcacgca tcggctggct gcacaacctg ggcgaccaga taggcaaacc ctacaacagc 1981 agcggcctgg gcggcccctc catcaaggac aagtatgtga cggcgctcta cttcaccttc 2041 agcagcctca ccagtgtggg cttcggcaac gtctctccca acaccaactc agagaagatc 2101 ttctccatct gcgtcatgct cattggctcc ctcatgtatg ctagcatctt cggcaacgtg 2161 tcggccatca tccagcggct gtactcgggc acagcccgct accacacaca gatgctgcgg 2221 gtgcgggagt tcatccgctt ccaccagatc cccaatcccc tgcgccagcg cctcgaggag 2281 tacttccagc acgcctggtc ctacaccaac ggcatcgaca tgaacgcggt gctgaagggc 2341 ttccctgagt gcctgcaggc tgacatctgc ctgcacctga accgctcact gctgcagcac 2401 tgcaaaccct tccgaggggc caccaagggc tgccttcggg ccctggccat gaagttcaag 2461 accacacatg caccgccagg ggacacactg gtgcatgctg gggacctgct caccgccctg 2521 tacttcatct cccggggctc catcgagatc ctgcggggcg acgtcgtcgt ggccatcctg 2581 gggaagaatg acatctttgg ggagcctctg aacctgtatg caaggcctgg caagtcgaac 2641 ggggatgtgc gggccctcac ctactgtgac ctacacaaga tccatcggga cgacctgctg 2701 gaggtgctgg acatgtaccc tgagttctcc gaccacttct ggtccagcct ggagatcacc 2761 ttcaacctgc gagataccaa catgatcccg ggctcccccg gcagtacgga gttagagggt 2821 ggcttcagtc ggcaacgcaa gcgcaagttg tccttccgca ggcgcacgga caaggacacg 2881 gagcagccag gggaggtgtc ggccttgggg ccgggccggg cgggggcagg gccgagtagc 2941 cggggccggc cgggggggcc gtggggggag agcccgtcca gtggcccctc cagccctgag 3001 agcagtgagg atgagggccc aggccgcagc tccagccccc tccgcctggt gcccttctcc 3061 agccccaggc cccccggaga gccgccgggt ggggagcccc tgatggagga ctgcgagaag 3121 agcagcgaca cttgcaaccc cctgtcaggc gccttctcag gagtgtccaa cattttcagc 3181 ttctgggggg acagtcgggg ccgccagtac caggagctcc ctcgatgccc cgcccccacc 3241 cccagcctcc tcaacatccc cctctccagc ccgggtcggc ggccccgggg cgacgtggag 3301 agcaggctgg atgccctcca gcgccagctc aacaggctgg agacccggct gagtgcagac 3361 atggccactg tcctgcagct gctacagagg cagatgacgc tggtcccgcc cgcctacagt 3421 gctgtgacca ccccggggcc tggccccact tccacatccc cgctgttgcc cgtcagcccc 3481 ctccccaccc tcaccttgga ctcgctttct caggtttccc agttcatggc gtgtgaggag 3541 ctgcccccgg gggccccaga gcttccccaa gaaggcccca cacgacgcct ctccctaccg 3601 ggccagctgg gggccctcac ctcccagccc ctgcacagac acggctcgga cccgggcagt 3661 tagtggggct gcccagtgtg gacacgtggc tcacccaggg atcaaggcgc tgctgggccg 3721 ctccccttgg aggccctgct caggaggccc tgaccgtgga aggggagagg aactcgaaag 3781 cacagctcct cccccagccc ttgggaccat cttctcctgc agtcccctgg gccccagtga 3841 gaggggcagg ggcagggccg gcagtaggtg gggcctgtgg tccccccact gccctgaggg 3901 cattagctgg tctaactgcc cggaggcacc cggccctggg ccttaggcac ctcaaggact 3961 tttctgctat ttactgctct tattgttaag gataataatt aaggatcata tgaataatta 4021 atgaagatgc tgatgactat gaataataaa taattatcct gaggagaaaa // LOCUS HSU04313 2566 bp mRNA PRI 11-JUN-1994 DEFINITION Human maspin mRNA, complete cds. ACCESSION U04313 NID g453368 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2566) AUTHORS Zou,Z., Anisowicz,A., Neveu,M., Rafidi,K., Sheng,S., Sager,R., Hendrix,M.J., Seftor,E. and Thor,A. TITLE Maspin, a serpin with tumor suppressing activity in human mammary epithelial cells JOURNAL Science 263, 526-529 (1994) MEDLINE 94120413 REFERENCE 2 (bases 1 to 2566) AUTHORS Anisowicz,A. TITLE Direct Submission JOURNAL Submitted (10-DEC-1993) Anthony Anisowicz, Cancer Genetics, Dana Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2566 /organism="Homo sapiens" /isolate="76N" /db_xref="taxon:9606" /clone="Z32-1" /clone_lib="76N cDNA library in lambda Zap II" /sex="female" /cell_line="76N" /cell_type="epithelial" /tissue_type="mammary" 5'UTR 1..75 CDS 76..1203 /codon_start=1 /product="maspin" /db_xref="PID:g453369" /translation="MDALQLANSAFAVDLFKQLCEKEPLGNVLFSPICLSTSLSLAQV GAKGDTANEIGQVLHFENVKDIPFGFQTVTSDVNKLSSFYSLKLIKRLYVDKSLNLST EFISSTKRPYAKELETVDFKDKLEETKGQINNSIKDLTDGHFENILADNSVNDQTKIL VVNAAYFVGKWMKKFPESETKECPFRLNKTDTKPVQMMNMEATFCMGNIDSINCKIIE LPFQNKHLSMFILLPKDVEDESTGLEKIEKQLNSESLSQWTNPSTMANAKVKLSIPKF KVEKMIDPKACLENLGLKHIFSEDTSDFSGMSETKGVALSNVIHKVCLEITEDGGDSI EVPGARILQHKDELNADHPFIYIIRHNKTRNIIFFGKFCSP" misc_feature 1093..1098 /note="putative serpin reactive center" 3'UTR 1201..2566 polyA_signal 2545..2550 BASE COUNT 786 a 526 c 504 g 750 t ORIGIN 1 ggcacgagtt gtgctcctcg cttgcctgtt ccttttccac gcattttcca ggataactgt 61 gactccaggc ccgcaatgga tgccctgcaa ctagcaaatt cggcttttgc cgttgatctg 121 ttcaaacaac tatgtgaaaa ggagccactg ggcaatgtcc tcttctctcc aatctgtctc 181 tccacctctc tgtcacttgc tcaagtgggt gctaaaggtg acactgcaaa tgaaattgga 241 caggttcttc attttgaaaa tgtcaaagat ataccctttg gatttcaaac agtaacatcg 301 gatgtaaaca aacttagttc cttttactca ctgaaactaa tcaagcggct ctacgtagac 361 aaatctctga atctttctac agagttcatc agctctacga agagacccta tgcaaaggaa 421 ttggaaactg ttgacttcaa agataaattg gaagaaacga aaggtcagat caacaactca 481 attaaggatc tcacagatgg ccactttgag aacattttag ctgacaacag tgtgaacgac 541 cagaccaaaa tccttgtggt taatgctgcc tactttgttg gcaagtggat gaagaaattt 601 cctgaatcag aaacaaaaga atgtcctttc agactcaaca agacagacac caaaccagtg 661 cagatgatga acatggaggc cacgttctgt atgggaaaca ttgacagtat caattgtaag 721 atcatagagc ttccttttca aaataagcat ctcagcatgt tcatcctact acccaaggat 781 gtggaggatg agtccacagg cttggagaag attgaaaaac aactcaactc agagtcactg 841 tcacagtgga ctaatcccag caccatggcc aatgccaagg tcaaactctc cattccaaaa 901 tttaaggtgg aaaagatgat tgatcccaag gcttgtctgg aaaatctagg gctgaaacat 961 atcttcagtg aagacacatc tgatttctct ggaatgtcag agaccaaggg agtggcccta 1021 tcaaatgtta tccacaaagt gtgcttagaa ataactgaag atggtgggga ttccatagag 1081 gtgccaggag cacggatcct gcagcacaag gatgaattga atgctgacca tccctttatt 1141 tacatcatca ggcacaacaa aactcgaaac atcattttct ttggcaaatt ctgttctcct 1201 taagtggcat agcccatgtt aagtcctccc tgacttttct gtggatgccg atttctgtaa 1261 actctgcatc cagagattca ttttctagat acaataaatt gctaatgttg ctggatcagg 1321 aagccgccag tacttgtcat atgtagcctt cacacagata gacctttttt tttttccaat 1381 tctatctttt gtttcctttt ttcccataag acaatgacat acgcttttaa tgaaaaggaa 1441 tcacgttaga ggaaaaatat ttattcatta tttgtcaaat tgtccggggt agttggcaga 1501 aatacagtct tccacaaaga aaattcctat aaggaagatt tggaagctct tcttcccagc 1561 actatgcttt ccttctttgg gatagagaat gttccagaca ttctcgcttc cctgaaagac 1621 tgaagaaagt gtagtgcatg ggacccacga aactgccctg gctccagtga aacttgggca 1681 catgctcagg ctactatagg tccagaagtc cttatgttaa gccctggcag gcaggtgttt 1741 attaaaattc tgaattttgg ggattttcaa aagataatat tttacataca ctgtatgtta 1801 tagaacttca tggatcagat ctggggcagc aacctataaa tcaacacctt aatatgctgc 1861 aacaaaatgt agaatattca gacaaaatgg atacataaag actaagtagc ccataagggg 1921 tcaaaatttg ctgccaaatg cgtatgccac caacttacaa aaacacttcg ttcgcagagc 1981 ttttcagatt gtggaatgtt ggataaggaa ttatagacct ctagtagctg aaatgcaaga 2041 ccccaagagg aagttcagat cttaatataa attcactttc atttttgata gctgtcccat 2101 ctggtcatgt ggttggcact agactggtgg caggggcttc tagctgactc gcacagggat 2161 tctcacaata gccgatatca gaatttgtgt tgaaggaact tgtctcttca tctaatatga 2221 tagcgggaaa aggagaggaa actactgcct ttagaaaata taagtaaagt gattaaagtg 2281 ctcacgttac cttgacacat agtttttcag tctatgggtt tagttacttt agatggcaag 2341 catgtaactt atattaatag taatttgtaa agttgggtgg ataagctatc cctgttgccg 2401 gttcatggat tacttctcta taaaaaatat atatttacca aaaaattttg tgacattcct 2461 tctcccatct cttccttgac atgcattgta aataggttct tcttgttctg agattcaata 2521 ttgaatttct cctatgctat tgacaataaa atattattga actacc // LOCUS HSU04627 2472 bp mRNA PRI 07-DEC-1994 DEFINITION Human 78 kDa gastrin-binding protein mRNA, complete cds. ACCESSION U04627 NID g595266 KEYWORDS enoyl CoA hydratase; 3-hydroxyacyl CoA dehydrogenase; gastrin-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2472) AUTHORS Zhang,Q.-X. and Baldwin,G.S. TITLE Structures of the human cDNA and gene encoding the 78 kDa gastrin-binding protein and of a related pseudogene JOURNAL Biochim. Biophys. Acta. 1219, 567-575 (1994) MEDLINE 95002180 REFERENCE 2 (bases 1 to 2472) AUTHORS Baldwin,G.S. TITLE Comparison of sequences of the 78 kDa gastrin-binding protein and some enzymes involved in fatty acid oxidation JOURNAL Comp. Biochem. Physiol. 104 B, 55-61 (1993) REFERENCE 3 (bases 1 to 2472) AUTHORS Baldwin,G.S. TITLE Direct Submission JOURNAL Submitted (21-DEC-1993) Graham S. Baldwin, Ludwig Institute For Cancer Research, Post Office Royal Melbourne Hospital, Melbourne, Victoria, Australia, 3050 FEATURES Location/Qualifiers source 1..2472 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="colon carcinoma LIM 1215" CDS 27..2318 /note="similar to GenBank Accession Number L12581; related to a family of fatty acid oxidation enzymes possessing enoyl CoA hydratase and/or 3-hydroxyacyl CoA dehydrogenase activity [3]" /codon_start=1 /product="78 kDa gastrin-binding protein" /db_xref="PID:g595267" /translation="MVACRAIGILSRFSAFRILRSRGYICRNFTGSSALLTRTHINYG VKGDVAVVRINSPNSKVNTLSKELHSEFSEVMNEIWASDQIRSAVLISSKPGCFIAGA DINMLAACKTLQEVTQLSQEAQRIVEKLEKSTKPIVAAINGSCLGGGLELAISCQYRI ATKDRKTVLGAPEVLLGILPGAGGTQRLPKMVGVPAVFDMMLTGRNIRADSAKKMGLV DQLVEPLGPGLKPPEERTIEYLEEVAITFAKGLADKKISPKRDKGLVEKLTAYAMTIP FVRQQVYKKVEEKVRKQTKGLYPAPLKIIDVVKTGIEQGSDAGYLCESQKFGELVMTK ESKALMGLYHGQVLCKKNKFGAPQKDVKHLAILGAGLMGAGIAQVSVDKGLKTILKDA TLTALDRGQQQVFKGLNDKVKKKALTSFERDSIFSNLTGQLDYQGFEKADMVIEAVFE DLSLKHRVLKEVEAVIPDHCIFASNTSALPISEIAAVSKRPEKVIGMHYFSPVDKMQL LEIITTEKTSKDTSASAVAVGLKQGKVIIVVKDGPGFYTTRCLAPMMSEVIRILQEGV DPKKLDSLTPSFGFPVGAATLVDEVGVDVAKHVAEDLGKVFGERFGGGNPELLTQMVS KGFLGRKSGKGFYIYQEGVKRKDLNSDMDSILASLKLPPKSEVSSDEDIQFRLVTRFV NEAVMCLQEGILATPAEGDIGAVFGLGFPPCLGGPFRFVDLYGAQKIVDRLKKYEAAY GKQFTPCQLLADHANSPNKKFYQ" exon 94..135 /note="97 bp intron" exon 136..206 exon 207..340 exon 341..479 exon 703..825 /note="488 bp intron" exon 826..944 /note="800 bp intron" exon 945..1001 /note="1.5 kb intron" exon 1002..1111 /note=">600 bp intron" exon 1112..1246 /note=">400 bp intron" exon 1247..1418 /note="3.8 kb intron" exon 1419..1505 /note="2.6 kb intron" exon 1506..1646 /note="410 bp intron" exon 1647..1715 /note="830 bp intron" exon 1716..1911 /note="1.1 kb intron" exon 1912..2026 /note="700 bp intron" exon 2027..2172 /note="87 bp intron" BASE COUNT 696 a 540 c 648 g 588 t ORIGIN 1 tccactgctg tcctcttcag ctcaagatgg tggcctgccg ggcgattggc atcctcagcc 61 gcttttctgc cttcaggatc ctccgctccc gaggttatat atgccgcaat tttacagggt 121 cttctgcttt gctgaccaga acccatatta actatggagt caaaggggat gtggcagttg 181 ttcgaattaa ctctcccaat tcaaaggtaa atacactgag taaagagcta cattcagagt 241 tctcagaagt tatgaatgaa atctgggcta gtgatcaaat cagaagtgcc gtccttatct 301 catcaaagcc aggctgcttt attgcaggtg ctgatatcaa catgttagcc gcttgcaaga 361 cccttcaaga agtaacacag ctatcacaag aagcacagag aatagttgag aaacttgaaa 421 agtccacaaa gcctattgtg gctgccatca atggatcctg cctgggagga ggacttgagc 481 ttgccatttc atgccaatac agaatagcaa caaaagatag aaaaacagta ttaggtgccc 541 ctgaagtctt gctggggatc ttaccaggag caggaggcac acaaaggctg cccaaaatgg 601 tgggtgtgcc tgctgttttt gacatgatgc tgactggtag aaacattcgt gcagacagcg 661 caaagaaaat gggactggtt gaccaattgg tggaacccct gggaccagga ctaaaacctc 721 cagaggaacg gacaatagaa tacctagaag aagttgcaat tacttttgcc aaaggactag 781 ctgataagaa gatctctcca aagagagaca agggattggt ggaaaaattg acagcgtatg 841 ccatgactat tccatttgtc aggcaacagg tttacaaaaa agtggaagaa aaagtgcgaa 901 agcagactaa aggcctttat cctgcacctc tgaaaataat tgatgtggta aagactggaa 961 ttgagcaagg gagtgatgcc ggttatctct gtgaatctca gaaatttgga gagcttgtaa 1021 tgaccaaaga atcaaaggcc ttgatgggac tctaccatgg tcaggtcctg tgcaagaaga 1081 ataaatttgg agctccacag aaggatgtta agcatctggc tattcttggt gcagggctga 1141 tgggagcagg catcgcccaa gtctccgtgg ataaggggct aaagactata cttaaagatg 1201 ccaccctcac tgcgctagac cgaggacagc aacaagtgtt caaaggattg aatgacaaag 1261 tgaagaagaa agctctaaca tcatttgaaa gggattccat cttcagcaac ttgactgggc 1321 agcttgatta ccaaggtttt gaaaaggccg acatggtgat tgaagctgtg tttgaggacc 1381 ttagtcttaa gcacagagtg ctaaaggaag tagaagcggt gattccagat cactgtatct 1441 ttgccagtaa cacatctgct ctcccaatca gtgaaatcgc tgctgtcagc aaaagacctg 1501 agaaggtgat tggcatgcac tacttctctc ccgtggacaa gatgcagctg ctggagatta 1561 tcacgaccga gaaaacttcc aaagacacca gtgcttcagc tgtagcagtt ggtctcaagc 1621 aggggaaggt catcattgtg gttaaggatg gacctggctt ctatactacc aggtgtcttg 1681 cgcccatgat gtctgaagtc atccgaatcc tccaggaagg agttgacccg aagaagctgg 1741 attccctgac accaagcttt ggctttcctg tgggtgccgc cacactggtg gatgaagttg 1801 gtgtggatgt agcgaaacat gtggcggaag atctgggcaa agtctttggg gagcggtttg 1861 gaggtggaaa cccagaactg ctgacacaga tggtgtccaa gggcttccta ggtcgtaaat 1921 ctgggaaggg cttttacatc tatcaggagg gtgtgaagag gaaggatttg aattctgaca 1981 tggatagtat tttagcgagt ctgaagctgc ctcctaagtc tgaagtctca tcagacgaag 2041 acatccagtt ccgcctggtg acaagatttg tgaatgaggc agtcatgtgc ctgcaagagg 2101 ggatcttggc cacacctgca gagggagaca tcggagccgt ctttgggctt ggcttcccgc 2161 cttgtctggg agggcctttc cgctttgtgg atctgtatgg cgcccagaag atagtggacc 2221 ggctcaagaa atatgaagct gcctatggaa aacagttcac cccatgccag ctgctagctg 2281 accatgctaa cagccctaac aagaagttct accagtgagc aggcctcatg cctcgctcag 2341 tcagtgcact aaccccagct gccggcagtg ctggttctcc aacagagtgg tgtctagatt 2401 tatcagagta acgagaagac aaactccggc actgggtttg ctccctgatt aaagtgcctt 2461 cagccaagac ca // LOCUS HSU04641 3953 bp mRNA PRI 16-MAY-1996 DEFINITION Human pyruvate carboxylase (PC) mRNA, complete cds. ACCESSION U04641 NID g458235 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3753) AUTHORS Wexler,I.D., Du,Y., Lisgaris,M.V., Mandal,S.K., Freytag,S.O., Yang,B.S., Liu,T.C., Kwon,M., Patel,M.S. and Kerr,D.S. TITLE Primary amino acid sequence and structure of human pyruvate carboxylase JOURNAL Biochim. Biophys. Acta 1227 (1-2), 46-52 (1994) MEDLINE 95002202 REFERENCE 2 (bases 1 to 3953) AUTHORS Wexler,I.D. TITLE Direct Submission JOURNAL Submitted (21-DEC-1993) Isaiah D. Wexler, Pediatrics, Case Western Reserve University, 2047 Abington Road, Cleveland, OH 44106, USA FEATURES Location/Qualifiers source 1..3953 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda GT11 libraries" /tissue_type="liver and kidney" /dev_stage="adult" mRNA <1..>3953 5'UTR 1..38 sig_peptide 39..98 /gene="PC" gene 39..3575 /gene="PC" CDS 39..3575 /gene="PC" /EC_number="6.4.1.1." /codon_start=1 /evidence=experimental /product="pyruvate: carbon-dioxide ligase (ADP-forming)" /db_xref="PID:g458236" /translation="MLKFRTVHGGLRLLGIRRTSTAPAASPNVRRLEYKPIKKVMVAN RGEIAIRVFRACTELGIRTVAIYSEQDTGQMHRQKADEAYLIGRGLAPVQAYLHIPDI IKVAKENNVDAVHPGYGFLSERADFAQACQDAGVRFIGPSPEVVRKMGDKVEARAIAI AAGVPVVPGTDAPITSLHEAHEFSNTYGFPIIFKAAYGGGGRGMRVVHSYEELEENYT RAYSEALAAFGNGALFVEKFIEKPRHIEVQILGDQYGNILHLYERDCSIQRRHQKVVE IAPAAHLDPQLRTRLTSDSVKLAKQVGYENAGTVEFLVDRHGKHYFIEVNSRLQVEHT VTEEITDVDLVHAQIHVAEGRSLPDLGLRQENIRINGCAIQCRVTTEDPARSFQPDTG RIEVFRSGEGMGIRLDNASAFQGAVISPHYDSLLVKVIAHGKDHPTAATKMSRALAEF RVRGVKTNIAFLQNVLNNQQFLAGTVDTQFIDENPELFQLRPAQNRAQKLLHYLGHVM VNGPTTPIPVKASPSPTDPVVPAVPIGPPPAGFRDILLREGPEGFARAVRNHPGLLLM DTTFRDAHQSLLATRVRTHDLKKIAPYVAHNFSKLFSMENWGGATFDVAMRFLYECPW RRLQELRELIPNIPFQMLLRGANAVGYTNYPDNVVFKFCEVAKENGMDVFRVFDSLNY LPNMLLGMEAAGSAGGVVEAAISYTGDVADPSRTKYSLQYYMGLAEELVRAGTHILCI KDMAGLLKPTACTMLVSSLRDRFPDLPLHIHTHDTSGAGVAAMLACAQAGADVVDVAA DSMSGMTSQPSMGALVACTRGTPLDTEVPMERVFDYSEYWEGARGLYAAFDCTATMKS GNSDVYENEIPGGQYTNLHFQAHSMGLGSKFKEVKKAYVEANQMLGDLIKVTPSSKIV GDLAQFMVQNGLSRAEAEAQAEELSFPRSVVEFLQGYIGVPHGGFPEPFRSKVLKDLP RVEGRPGASLPPLDLQALEKELVDRHGEEVTPEDVLSAAMYPDVFAHFKDFTATFGPL DSLNTRLFLQGPKIAEEFEVELERGKTLHIKALAVSDLNRAGQRQVFFELNGQLRSIL VKDTQAMKEMHFHPKALKDVKGQIGAPMPGKVIDIKVVAGAKVAKGQPLCVLSAMKME TVVTSPMEGTVRKVHVTKDMTLEGDDLILEIE" mat_peptide 99..3572 /gene="PC" /EC_number="6.4.1.1." /note="pyruvate carboxylase" /function="carboxylation of pyruvate" /product="pyruvate: carbon-dioxide ligase (ADP-forming)" misc_feature 3462..3473 /gene="PC" /note="encodes biotin binding site" /function="lysine residue binds biotin" /evidence=experimental 3'UTR 3576..3953 polyA_signal 3928..3933 polyA_site 3953 /note="19 A nucleotides" BASE COUNT 808 a 1219 c 1213 g 713 t ORIGIN 1 gggttcttac ccatttaaag tttgaccaaa cactaaggat gctgaagttc cgaacagtcc 61 atgggggcct gaggctcctg ggaatccgcc gaacctccac cgcccccgct gcctccccaa 121 atgtccggcg cctggagtat aagcccatca agaaagtcat ggtggccaac agaggtgaga 181 ttgccatccg tgtgttccgg gcctgcacgg agctgggcat ccgcaccgta gccatctact 241 ctgagcagga cacgggccag atgcaccggc agaaagcaga tgaagcctat ctcatcggcc 301 gcggcctggc ccccgtgcag gcctacctgc acatcccaga catcatcaag gtggccaagg 361 agaacaacgt agatgcagtg caccctggct acgggttcct ctctgagcga gcggacttcg 421 cccaggcctg ccaggatgca ggggtccggt ttattgggcc aagcccagaa gtggtccgca 481 agatgggaga caaggtggag gcccgggcca tcgccattgc tgcgggtgtt cccgttgtcc 541 ctggcacaga tgcccccatc acgtccctgc atgaggccca cgagttctcc aacacctacg 601 gcttccccat catcttcaag gcggcctatg ggggtggagg gcgtggcatg agggtggtgc 661 acagctacga ggagctggag gagaattaca cccgggccta ctcagaggct ctggccgcct 721 ttgggaatgg ggcgctgttt gtggagaagt tcatcgagaa gccacggcac atcgaggtgc 781 agatcttggg ggaccagtat gggaacatcc tgcacctgta cgagcgagac tgctccatcc 841 agcggcggca ccagaaggtg gtcgagattg cccccgccgc ccacctggac ccgcagcttc 901 ggactcggct caccagcgac tctgtgaaac tcgctaaaca ggtgggctac gagaacgcag 961 gcaccgtgga gttcctggtg gacaggcacg gcaagcacta cttcatcgag gtcaactccc 1021 gcctgcaggt ggagcacacg gtcacagagg agatcaccga cgtagacctg gtccatgctc 1081 agatccacgt ggctgagggc aggagcctac ccgacctggg cctgcggcag gagaacatcc 1141 gcatcaacgg gtgtgccatc cagtgccggg tcaccaccga ggaccccgcg cgcagcttcc 1201 agccggacac cggccgcatt gaggtgttcc ggagcggaga gggcatgggc atccgcctgg 1261 ataatgcttc cgccttccaa ggagccgtca tctcgcccca ctacgactcc ctgctggtca 1321 aagtcattgc ccacggcaaa gaccacccca cggccgccac caagatgagc agggcccttg 1381 cggagttccg cgtccgaggt gtgaagacca acatcgcctt cctgcagaat gtgctcaaca 1441 accagcagtt cctggcaggc actgtggaca cccagttcat cgacgagaac ccagagctgt 1501 tccagctgcg gcctgcacag aaccgggccc aaaagctgtt gcactacctc ggccatgtca 1561 tggtaaacgg tccaaccacc ccgattcccg tcaaggccag ccccagcccc acggaccccg 1621 ttgtccctgc agtgcccata ggcccgcccc cggctggttt cagagacatc ctgctgcgag 1681 aggggcctga gggctttgct cgagctgtgc ggaaccaccc ggggctgctg ctgatggaca 1741 cgaccttcag ggacgcccac cagtcactgc tggccactcg tgtgcgcacc cacgatctca 1801 aaaagatcgc cccctatgtt gcccacaact tcagcaagct cttcagcatg gagaactggg 1861 gaggagccac gtttgacgtc gccatgcgct tcctgtatga gtgcccctgg cggcggctgc 1921 aggagctccg ggagctcatc cccaacatcc ctttccagat gctgctgcgg ggggccaatg 1981 ctgtgggcta caccaactac ccagacaacg tggtcttcaa gttctgtgaa gtggccaaag 2041 agaatggcat ggatgtcttc cgtgtgtttg actccctcaa ctacttgccc aacatgctgc 2101 tgggcatgga ggcggcagga agtgccggag gcgtggtgga ggctgccatc tcatacacgg 2161 gcgacgtggc cgaccccagc cgcaccaagt actcactgca gtactacatg ggcttggccg 2221 aagagctggt gcgagctggc acccacatcc tgtgcatcaa ggacatggcc gggctgctga 2281 agcccacggc ctgcaccatg ctggtcagct ccctccggga ccgcttcccc gacctcccac 2341 tgcacatcca cacccacgac acgtcagggg caggcgtggc agccatgctg gcctgtgccc 2401 aggctggagc tgatgtggtg gatgtggcag ctgattccat gtctgggatg acttcacagc 2461 ccagcatggg ggccctggtg gcctgtacca gagggactcc cctggacaca gaggtgccca 2521 tggagcgcgt gtttgactac agtgagtact gggagggggc tcggggactg tacgcggcct 2581 tcgactgcac ggccaccatg aagtctggca actcggacgt gtatgaaaat gagatcccag 2641 ggggccagta caccaacctg cacttccagg cccacagcat ggggcttggc tccaagttca 2701 aggaggtcaa gaaggcctat gtggaggcca accagatgct gggcgatctc atcaaggtga 2761 cgccctcctc caagatcgtg ggggacctgg cccagtttat ggtgcagaat ggattgagcc 2821 gggcagaggc cgaagctcag gcggaagagc tgtcctttcc ccgctccgtg gtggagttcc 2881 tgcagggcta catcggtgtc ccccatgggg ggttccccga accctttcgc tctaaggtac 2941 tgaaggacct gccaagggtg gaggggcggc ctggagcctc cctccctccc ctggatctgc 3001 aggcactgga gaaggagctg gtagaccggc atggggagga ggtgacgccg gaagatgtgc 3061 tctcagcagc tatgtacccc gatgtgtttg cccacttcaa ggacttcact gccacctttg 3121 gccccctgga tagcctgaat actcgcctct tcctgcaggg acccaagatc gcagaggagt 3181 ttgaggtgga gctggagcgg ggcaagacgc tgcacatcaa agccctggcc gtgagcgacc 3241 tgaaccgggc cggccagagg caggtcttct ttgagctcaa tgggcagctg cggtccatct 3301 tggtcaagga cacccaggcc atgaaggaga tgcacttcca ccccaaggcc ctaaaggacg 3361 tgaagggcca gatcggggcg cccatgcctg ggaaggtgat agacatcaaa gtggtggcag 3421 gggccaaggt ggccaagggc cagcccctgt gtgtgctcag tgccatgaag atggagactg 3481 tggtgacctc acccatggag ggtactgtcc gcaaggttca tgtgaccaag gacatgacac 3541 tggaaggtga cgacctcatc ctggagatcg agtgatcttg ccccagaccg gcagcctggc 3601 catccccaag ccttcaacag aagctgtgct gccacggcag gcccaggcca gccagtgccc 3661 gaggccagga aggccgggcc gtggaggtcc tgtgtccaca gctggacagg agagacaccg 3721 cctgcggtgg ttcattcctt tcagccatcg tcctttcctc cggcggacag ctgcttacat 3781 gttcatctct tgccaaataa gggtcccctc ctcactggag actacaagtg gtgggtcagg 3841 tggtcctagg acccagggga ggtttagggg tcctatctcc tgggggaagg ggagatcaag 3901 atgtcccagg tcctgggaag tttactcaat aaagctggct ttcccctgcc ctc // LOCUS HSU04735 2227 bp mRNA PRI 25-MAR-1994 DEFINITION Human microsomal stress 70 protein ATPase core (stch) mRNA, complete cds. ACCESSION U04735 NID g460147 KEYWORDS cDNA cloning; microsome; protein chaperone. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2227) AUTHORS Otterson,G.A., Flynn,G.C., Kratzke,R.A., Coxon,A., Johnston,P.G. and Kaye,F.J. TITLE Stch encodes the 'ATPase core' of a microsomal stress70 protein JOURNAL EMBO J. 13, 1216-1225 (1994) MEDLINE 94178264 REFERENCE 2 (bases 1 to 2227) AUTHORS Kaye,F.J. TITLE Direct Submission JOURNAL Submitted (27-DEC-1993) Frederic J. Kaye, NCI-Navy Oncology Branch, National Cancer Institute, Bldg 8, Rm 5101, Naval Hospital, Bethesda, MD, 20889, USA FEATURES Location/Qualifiers source 1..2227 /organism="Homo sapiens" /db_xref="taxon:9606" gene 37..1452 /gene="stch" CDS 37..1452 /gene="stch" /codon_start=1 /product="microsomal stress 70 protein ATPase core" /db_xref="PID:g460148" /translation="MAREMTILGSAVLTLLLAGYLAQQYLPLPTPKVIGIDLGTTYCS VGVFFPGTGKVKVIPDENGHISIPSMVSFTDNDVYVGYESVELADSNPQNTIYDAKRF IGKIFTAEELEAEIGRYPFKVLNKNGMVEFSVTSNETITVSPEYVGSRLLLKLKEMAE AYLGMPVANAVISVPAEFDLKQRNSTIEAANLAGLKILRVINEPTAAAMAYGLHKADV FHVLVIDLGGGTLDVSLLNKQGGMFLTRAMSGNNKLGGQDFNQRLLQYLYKQIYQTYG FVPSRKEEIHRLRQAVEMVKLNLTLHQSAQLSVLLTVEEQDRKEPHSSDTELPKDKLS SADDHRVNSGFGRGLSDKKSGESQVLFETEISRKLFDTLNEDLFQKILVPIQQVLKEG HLEKTEIDEVVLVGGSTRIPRIRQVIQEFFGKDPNTSVDPDLAVVTGVAIQAGIDGGS WPLQVSALEIPNKHLQKTNFN" BASE COUNT 691 a 398 c 468 g 670 t ORIGIN 1 ggtacagtca tcacaagcct gttcggcggg actgtgatgg ccagagagat gacgatctta 61 ggatcggctg ttttgactct cctgttggcc ggctatttgg cacaacagta tttaccattg 121 cctactccta aagtgattgg tattgatctt ggcaccacct attgttctgt tggggtgttt 181 tttcctggca caggaaaagt aaaggtgatt ccagatgaaa atgggcatat cagcataccc 241 agcatggtgt cttttactga caatgatgta tatgtgggat atgaaagcgt agagctggca 301 gattcaaatc ctcaaaacac aatatatgat gccaaaagat tcataggcaa gatttttacc 361 gcagaagagt tggaggctga aattggcaga tacccattta aggttttaaa caaaaatgga 421 atggttgagt tttctgtgac aagtaatgag accatcacag tgtccccaga atatgttggc 481 tctcgactat tgttgaagtt aaaggaaatg gcagaggcat atcttggaat gccagttgcc 541 aatgctgtca tttctgtacc agcagaattt gatctaaaac agagaaattc aacaattgaa 601 gctgctaacc ttgcaggact gaagattttg agggtaataa atgaacccac agcagcagct 661 atggcctatg gtctccacaa ggctgacgtc ttccacgtct tggtgataga cttgggcgga 721 ggaactctag atgtgtcttt actgaataaa caaggaggga tgtttctaac ccgagcaatg 781 tctggaaaca ataaacttgg aggacaggac ttcaatcaga gattgcttca gtacttatat 841 aaacagatct atcaaacata tggcttcgtg ccctctagga aagaggaaat ccacagattg 901 agacaagctg tggaaatggt caaattaaat ctgactcttc atcaatctgc tcagttgtca 961 gtattactaa cggtggagga gcaggacagg aaggaacctc acagtagtga cactgaactg 1021 ccaaaagaca aactttcctc agcagatgac catcgcgtga acagtgggtt tggacgtggc 1081 ctttctgata agaaaagtgg agaaagtcag gttttatttg aaacagaaat atcacggaaa 1141 ctctttgata cccttaatga agacctcttt cagaaaatac tggtacccat tcagcaagta 1201 ttgaaagaag gccacctgga aaagactgag attgatgagg tggttttagt tgggggctcc 1261 actcgtattc ctcggatccg tcaagtcatt caagagttct ttggaaaaga tcccaacaca 1321 tctgtagacc ctgacctagc agtagtaacg ggagtggcta tccaagcagg gattgatgga 1381 ggctcttggc ctctccaagt cagtgcttta gaaattccca ataagcattt acaaaaaacc 1441 aacttcaact gaattctgca gaaataatgg ttatttgtga acttgtctga tgatctcttc 1501 ccatttatca gattaccttt tccacaaaag aaagtctcta aaatatcaca gatttaccta 1561 gagggcaaca tttagataca ggaaaatttt acatagtgtt ttgtcttagg attagacgtg 1621 accagattga tcctgtttga ttttggagag atcctattct aacaaatact ctaaaatgat 1681 aaaattgagg tacaactctc ttaaaagagt atggataact atattttctg gattctggag 1741 gttgataacc atatgcactt aacatatatt ctataaacat taagtagtgc cagttatgag 1801 acttcccagt tcttactaaa ttgtattagc aggagctggt aattacttgt attatcacat 1861 gtaactaata atttgaagta tacttgaagg accgtgttga tgtcaggtat ttacagtggt 1921 tggaagatag cagtattatt agataagctg catacgtaat attcagtaac tgccatatta 1981 tataacaaat ttacattcac aaattcagta tcctgttaag tgtcatattc ttgtaatctg 2041 cattctccag gagttttatg tgtttaatag atgaatttat tttatttcta aaggtattca 2101 aatgtttcag caccatataa tagaaatacc caattatatt ctagttcctt tatgtcctgt 2161 acatcattct ctgcttggat ttccattatt ctgtttggtt agagaataaa attggtaatt 2221 gcatttg // LOCUS HSU04810 2561 bp mRNA PRI 17-OCT-1995 DEFINITION Human tastin mRNA, complete cds. ACCESSION U04810 NID g905355 KEYWORDS trophoblast; human embryo; blastocyst; cell adhesion molecule; endometrium; teratocarcinoma; implantation; cytoplasmic protein; cytoskeleton. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2561) AUTHORS Fukuda,M.N., Sato,T., Nakayama,J., Klier,G., Mikami,M., Aoki,D. and Nozawa,S. TITLE Trophinin and tastin, a novel cell adhesion molecule complex with potential involvement in embryo implantation JOURNAL Genes Dev. 9 (10), 1199-1210 (1995) MEDLINE 95278733 REFERENCE 2 (bases 1 to 2561) AUTHORS Sato,T. TITLE Direct Submission JOURNAL Submitted (30-DEC-1993) Takaaki Sato, Oncology and Tumor Suppressor Gene, La Jolla Cancer Research Foundation, 10901 N. Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2561 /organism="Homo sapiens" /note="human" /db_xref="taxon:9606" /clone="HTH27" /clone_lib="trophoblastic teratocarcinoma cell line HT-H" CDS 111..2447 /note="cell adhesion molecule" /codon_start=1 /product="tastin" /db_xref="PID:g905356" /translation="MTTRQATKDPLLRGVSPTPSKIPVRSQKRTPFPTVTSCAVDQEN QDPRRWVQKPPLNIQRPLVDSAGPRPKARHQAETSQRLVGISQPRNPLEELRPSPRGQ NVGPGPPAQTEAPGTIEFVADPAALATILSGEGVKSCHLGRQPSLAKRVLVRGSQGGT TQRVQGVRASAYLAPRTPTHRLDPARASCFSRLEGPGPRGRTLCPQRLQALISPSGPS FHPSTHPSFQELRRETAGSSRTSVSQASGLLLETPVQPAFSLPKGEREVVTHSDEGGV ASLGLAQRVPLRENREMSHTRDSHDSHLMPSPAPVAQPLPGHVVPCPSPFGRAQRVPS PGPPTLTSYSVLRRLTVQPKTRFTPMPSTPRVQQAQWLRGVSPQSCSEDPALPWEQVA VRLFDQESCIRSLEGSGKPPVATPSGPHSNRTPSLQEVKIQRIGILQQLLRQEVEGLV GGQCVPLNGGSSLDMVELQPLLTEISRTLNATEHNSGTSHLPGLLKHSGLPKPCLPEE CGEPQPCPPAEPGPPEAFCRSEPEIPEPSLQEQLEVPEPYPPAEPRPLESCCRSEPEI PESSRQEQLEVPEPCPPAEPRPLESYCRIEPEIPESSRQEQLEVPEPCPPAEPGPLQP STQGQSGPPGPCPRVELGASEPCTLEHRSLESSLPPCCSQWAPATTSLIFSSQHPLCA SPPICSLQSLRPPAGQAGLSNLAPRTLALRESLKSCLTAIHCFHEARLDDECAFYTSR ASPSGPTRVCTNPVATLLEWQDALCFIPVGSAAPQGSP" polyA_signal 2545..2550 polyA_site 2561 /note="9 A nucleotides" BASE COUNT 535 a 848 c 672 g 506 t ORIGIN 1 cgccaggaac agcttgaggt acctgagccc tgccctccag cagcacccga gagggtcagg 61 agaaaagcgg aggaagctgg gtaggccctg aggggcctcg gtaagccatc atgaccaccc 121 ggcaagccac gaaggatccc ctcctccggg gtgtatctcc tacccctagc aagattccgg 181 tacgctctca gaaacgcacg cctttcccca ctgttacatc gtgcgccgtg gaccaggaga 241 accaagatcc aaggagatgg gtgcagaaac caccgctcaa tattcaacgc cccctcgttg 301 attcagcagg ccccaggccg aaagccaggc accaggcaga gacatcacaa agattggtgg 361 ggatcagtca gcctcggaac cccttggaag agctcaggcc tagccctagg ggtcaaaatg 421 tggggcctgg gccccctgcc cagacagagg ctccagggac catagagttt gtggctgacc 481 ctgcagccct ggccaccatc ctgtcaggtg agggtgtgaa gagctgtcac ctggggcgcc 541 agcctagtct ggctaaaaga gtactggttc gaggaagtca gggaggcacc acccagaggg 601 tccagggtgt tcgggcctct gcatatttgg cccccagaac ccccacccac cgactggacc 661 ctgccagggc ttcctgcttc tctaggctgg agggaccagg acctcgaggc cggacattgt 721 gcccccagag gctacaggct ctgatttcac cttcaggacc ttcctttcac ccttccactc 781 accccagttt ccaggagcta agaagggaga cagctggcag cagccggact tcagtgagcc 841 aggcctcagg attgctcctg gagaccccag tccagcctgc tttctctctt cctaaaggag 901 aacgcgaggt tgtcactcac tcagatgaag gaggtgtggc ctctcttggt ctggcccagc 961 gagtaccatt aagagaaaac cgagaaatgt cacataccag ggacagccat gactcccacc 1021 tgatgccctc ccctgcccct gtggcccagc ccttgcctgg ccatgtggtg ccatgtccat 1081 caccctttgg acgggctcag cgtgtaccct ccccaggccc tccaactctg acctcatatt 1141 cagtgttgcg gcgtctcacc gttcaaccta aaacccggtt cacacccatg ccatcaaccc 1201 ccagagttca gcaggcccag tggctgcgtg gtgtctcccc tcagtcctgc tctgaagatc 1261 ctgccctgcc ctgggagcag gttgccgtcc ggttgtttga ccaggagagt tgtataaggt 1321 cactggaggg ttctgggaaa ccaccggtgg ccactccttc tggaccccac tctaacagaa 1381 cccccagcct ccaggaggtg aagattcaac gcatcggtat cctgcaacag ctgttgagac 1441 aggaagtaga ggggctggta gggggccagt gtgtccctct taatggaggc tcttctctgg 1501 atatggttga acttcagccc ctgctgactg agatttctag aactctgaat gccacagagc 1561 ataactctgg gacttcccac cttcctggac tgttaaaaca ctcagggctg ccaaagccct 1621 gtcttccaga ggagtgcggg gaaccacagc cctgccctcc ggcagagcct gggcccccag 1681 aggccttctg taggagtgag cctgagatac cagagccctc cctccaggaa cagcttgaag 1741 taccagagcc ctaccctcca gcagaaccca ggcccctaga gtcctgctgt aggagtgagc 1801 ctgagatacc ggagtcctct cgccaggaac agcttgaggt acctgagccc tgccctccag 1861 cagaacccag gcccctagag tcctactgta ggattgagcc tgagataccg gagtcctctc 1921 gccaggaaca gcttgaggta cctgagccct gccctccagc agaacccggg ccccttcagc 1981 ccagcaccca ggggcagtct ggacccccag ggccctgccc tagggtagag ctgggggcat 2041 cagagccctg caccctggaa catagaagtc tagagtccag tctaccaccc tgctgcagtc 2101 agtgggctcc agcaaccacc agcctgatct tctcttccca acacccgctt tgtgccagcc 2161 cccctatctg ctcactccag tctttgagac ccccagcagg ccaggcaggc ctcagcaatc 2221 tggcccctcg aaccctagcc ctgagggaga gcctcaaatc gtgtttaacc gccatccact 2281 gcttccacga ggctcgtctg gacgatgagt gtgcctttta caccagccga gcctctccct 2341 caggccccac ccgggtctgc accaaccctg tggctacatt actcgaatgg caggatgccc 2401 tgtgtttcat tccagttggt tctgctgccc cccagggctc tccatgatga gacaaccact 2461 cctgccctgc cgtacttctt ccttttagcc cttatttatt gtcggtctgc ccatgggact 2521 gggagccgcc cacttttgtc ctcaataaag tttctaaagt a // LOCUS HSU04811 2490 bp mRNA PRI 17-OCT-1995 DEFINITION Human trophinin mRNA, complete cds. ACCESSION U04811 NID g905357 KEYWORDS trophoblast; human embryo; blastocyst; cell adhesion molecule; endometrium; teratocarcinoma; implantation. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2490) AUTHORS Fukuda,M.N., Sato,T., Nakayama,J., Klier,G., Mikami,M., Aoki,D. and Nozawa,S. TITLE Trophinin and tastin, a novel cell adhesion molecule complex with potential involvement in embryo implantation JOURNAL Genes Dev. 9 (10), 1199-1210 (1995) MEDLINE 95278733 REFERENCE 2 (bases 1 to 2490) AUTHORS Sato,T. TITLE Direct Submission JOURNAL Submitted (30-DEC-1993) Takaaki Sato, Oncology and Tumor Suppressor Gene, La Jolla Cancer Research Foundation, 10901 N. Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2490 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HTH55" /clone_lib="trophoblastic teratocarcinoma cell line HT-H" CDS 28..2277 /note="cell adhesion molecule" /codon_start=1 /product="trophinin" /db_xref="PID:g836820" /translation="MDIDCLTREELGDDSQAWSRFSFEIEARAQENADASTNVNFSRG ASTRAGFSDRASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTS SSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFS GGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFG GSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSIC FDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTS VSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVS TSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAA PSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDG PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAG FGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTS AGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSI VGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAAS LGACGFSYG" polyA_signal 2297..2302 polyA_site 2490 /note="11 A nucleotides" BASE COUNT 475 a 645 c 716 g 654 t ORIGIN 1 gtggctgggc cctggaattg ggatgacatg gatatcgact gcctaacaag ggaagagtta 61 ggcgatgatt ctcaggcctg gagcagattt tcatttgaaa ttgaggccag agcccaagaa 121 aatgcagatg ccagcaccaa cgtcaacttc agcagaggag ctagtaccag ggctggcttc 181 agcgatcgtg ctagtattag cttcaatggt gcacccagct ccagtggtgg cttcagtggt 241 ggacctggca ttacctttgg tgttgcaccc agcaccagtg ccagcttcag caatacagcc 301 agcattagct ttggtggtac actgagcact agctccagct tcagcagcgc agccagcatt 361 agctttggtt gtgcacacag caccagcact agtttcagca gtgaagccag cattagcttt 421 ggtggcatgc cttgtaccag tgccagcttt agtggtggag tcagctctag ttttagtggc 481 ccactcagca ccagtgccac tttcagtggt ggagccagct ctggctttgg aggcacactc 541 agcaccacgg ctggctttag tggtgtactc agcactagca ccagctttgg cagtgcaccc 601 acaacgagca cagtcttcag tagtgcgctt agcaccagca ctggctttgg aggcatactc 661 agcaccagtg tctgttttgg tggctctccc agctccagtg gtagctttgg tggtacactc 721 agtaccagta tctgcttcgg tggctctccc tgcaccagca ctggctttgg aggcacactt 781 agcaccagtg tctcctttgg tggctcttcc agcaccagtg ccaattttgg tggtacacta 841 agtaccagca tctgctttga tggctctccc agcactggtg ctggctttgg tggtgctctc 901 aacaccagtg ccagctttgg cagtgtgctc aacaccagta ctggttttgg tggtgctatg 961 agcaccagtg ctgactttgg cggtacacta agcaccagtg tctgctttgg tggctctcct 1021 ggcaccagtg tcagctttgg cagtgcactc aacaccaatg ctggttatgg tggtgctgtc 1081 agcaccaaca ctgactttgg tggtacacta agcaccagcg tctgttttgg tggctctccc 1141 agcaccagtg ctggctttgg tggtgcactc aacaccaatg ccagctttgg ctgtgccgtc 1201 agcaccagtg ccagcttcag tggtgctgtc agcaccagtg cttgcttcag tggtgcacca 1261 atcaccaacc ctggctttgg cggtgcattt agcaccagtg ctggcttcgg tggtgcactt 1321 agtaccgctg ctgacttcgg tggtactccc agcaacagca ttggctttgg tgctgctccc 1381 agcaccagtg tcagctttgg tggtgctcat ggcaccagcc tctgttttgg tggagctccc 1441 agcaccagcc tctgctttgg cagtgcatct aatactaacc tatgctttgg tggccctcct 1501 agcaccagtg cctgctttag tggtgctacc agccctagtt tttgtgatgg acccagcacc 1561 agtaccggtt tcagctttgg caatgggtta agcaccaatg ctggatttgg tggtggactg 1621 aacaccagtg ctggctttgg tggtggccta ggcaccagtg ctggcttcag tggtggccta 1681 agcacaagtt ctggctttga tggtgggcta ggtaccagcg ctggcttcgg tggaggacca 1741 ggcaccagca ctggttttgg tggtggactg ggcaccagtg ctggcttcag tggcggactg 1801 ggcaccagtg ctggctttgg tggtggactg gtcactagtg atggctttgg tggtggactg 1861 ggcaccaatg ctagtttcgg cagcacactt ggcaccagtg ctggctttag tggtggcctc 1921 agcaccagcg atggctttgg cagtaggcct aatgccagct tcgacagagg actgagtacc 1981 atcattggct ttggcagtgg ttccaacacc agcactggct ttactggcga acccagcacc 2041 agcacgggct tcagtagtgg acccagttct attgttggct tcagcggtgg accaagcact 2101 ggtgttggct tctgcagtgg accaagcacc agtggcttca gcggtggacc gagcacagga 2161 gctggcttcg gcggtggacc aaacactggt gctggctttg gtggtggacc gagcaccagt 2221 gctggctttg gcagtggagc cgccagtctt ggtgcctgtg gcttctcgta tggctagtga 2281 ggtttcagat accgctaata aattgcagta gtccttccca tggagccaaa gtaccttgga 2341 tctttgtcca cacagcagtc aaggcagtta tggcccatca gctgagggtg tcatgtgatg 2401 gaaaaatctg tttgctgttc ctgctttatt gtttgctttc tgtgtgctgt catattttgg 2461 tatcagagtt acattaaatt tgcaaaatga // LOCUS HSU04815 1805 bp mRNA PRI 08-JUL-1994 DEFINITION Human protein kinase PITSLRE alpha 1 mRNA, complete cds. ACCESSION U04815 NID g507157 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1805) AUTHORS Xiang,J., Lahti,J.M., Grenet,J.A., Easton,J.B. and Kidd,V.J. TITLE Molecular cloning and expression of alternatively spliced PITSLRE protein kinase isoforms JOURNAL J. Biol. Chem. 269, 15786-15794 (1994) MEDLINE 94253170 REFERENCE 2 (bases 1 to 1805) AUTHORS Kidd,V.J. TITLE Direct Submission JOURNAL Submitted (03-JAN-1994) Vincent J. Kidd, St. Jude Children's Research Hospital, Tumor Cell Biology, 332 N. Lauderdale St., Memphis, TN 38101, USA FEATURES Location/Qualifiers source 1..1805 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HeLa cDNA library" /map="1p36-2" /cell_line="HeLa S3" /cell_type="epitheloid" /tissue_type="cervix" /dev_stage="adult" mRNA 1..1805 CDS 394..1779 /codon_start=1 /product="PITSLRE alpha 1" /db_xref="PID:g507158" /translation="METGSNSEEASEQSAEEVSEEEMSEDEERENENHLLVVPESRFD RDSGESEEAEEEVGEGTPQSSALTEGDYVPDSPALSPIELKQELPKYLPALQGCRSVE EFQCLNRIEEGTYGVVYRAKDKKTDEIVALKRLKMEKEKEGFPITSLREINTILKAQH PNIVTVREIVVGSNMDKIYIVMNYVEHDLKSLMETMKQPFLPGEVKTLMIQLLRGVKH LHDNWILHRDLKTSNLLLSHAGILKVGDFGLAREYGSPLKAYTPVVVTLWYRAPELLL GAKEYSTAVDMWSVGCIFGELLTQKPLFPGKSEIDQINKVFKDLGTPSEKIWPGYSEL PAVKKMTFSRHPYNNLRKRFGALLSDQGFDLMNKFLTYFPGRRISAEDGLKHEYFRET PLPIDPSMFPTWPAKSEQQRVKRGTSPRPPEGGLGYSQLGDDDLKETGFHLTTTNQGA SAAGPGFSLKF" BASE COUNT 426 a 526 c 508 g 345 t ORIGIN 1 tattgtacaa ttacccacca ctggatttga ctcagagagg acccccagag ggtgtctcca 61 tcttccctat ttattttcag cccttgaggg cttcattgta gatcaaagcc aaggccccca 121 ggaaggtgac atactcctgg aagttcacct cctggtcctt gttccggtcc aagtcttcca 181 tcagccttgc aatttcagca tcctgcagct tctaatgtgt tagaatgtga aatccatact 241 cagtggtgat gacaaccctg gattcttccc cttccccctc ccaggcaatc ctctctgcaa 301 gtggctctgt gctccctcat caccaaggac ccatgtcact ttggcattgc ttctcctcag 361 ctacttctca gttactggtc ctcatttgga gagatggaga ccggcagcaa ctctgaggag 421 gcatcagagc agtctgccga agaagtaagt gaggaagaaa tgagtgaaga tgaagaacga 481 gaaaatgaaa accacctctt ggttgttcca gagtcacggt tcgaccgaga ttccggggag 541 agtgaagaag cagaggaaga agtgggtgag ggaacgccgc agagcagcgc cctgacagag 601 ggcgactatg tgcccgactc ccctgccctg tcgcccatcg agctcaagca ggagctgccc 661 aagtacctgc cggccctgca gggctgccgg agcgtcgagg agttccagtg cctgaacagg 721 atcgaggagg gcacctatgg agtggtctac agagcaaaag acaagaaaac agatgaaatt 781 gtggctctaa agcggctgaa gatggagaag gagaaggagg gcttcccgat cacgtcgctg 841 agggagatca acaccatcct caaggcccag catcccaaca tcgtcaccgt tagagagatt 901 gtggtgggca gcaacatgga caagatctac atcgtgatga actatgtgga gcacgacctc 961 aagagcctga tggagaccat gaaacagccc ttcctgccag gggaggtgaa gaccctgatg 1021 atccagctgc tgcgtggggt gaaacacctg cacgacaact ggatcctgca ccgtgacctc 1081 aagacgtcca acctgctgct gagccacgcc ggcatcctca aggtgggtga cttcgggctg 1141 gcgcgggagt acggatcccc tctgaaggcc tacaccccgg tcgtggtgac cctgtggtac 1201 cgcgccccag agctgctgct tggtgccaag gaatactcca cggccgtgga catgtggtca 1261 gtgggttgca tcttcgggga gctgctgact cagaagcctc tgttccccgg gaagtcagaa 1321 atcgatcaga tcaacaaggt gttcaaggat ctggggaccc ctagtgagaa aatctggccc 1381 ggctacagcg agctcccagc agtcaagaag atgaccttca gcagacaccc ctacaacaac 1441 ctccgcaagc gcttcggggc tctgctctca gaccagggct tcgacctcat gaacaagttc 1501 ctgacctact tccccgggag gaggatcagc gctgaggacg gcctcaagca tgagtatttc 1561 cgcgagaccc ccctccccat cgacccctcc atgttcccca cgtggcccgc caagagcgag 1621 cagcagcgtg tgaagcgggg caccagcccg aggccccctg agggaggcct gggctacagc 1681 cagctgggtg acgacgacct gaaggagacg ggcttccacc ttaccaccac gaaccagggg 1741 gcctctgccg cgggccccgg cttcagcctc aagttctgaa ggtcagagtg gaccccgtca 1801 tgggg // LOCUS HSU04840 3671 bp mRNA PRI 18-JAN-1994 DEFINITION Human onconeural ventral antigen-1 (Nova-1) mRNA, complete cds. ACCESSION U04840 S66174 NID g440877 KEYWORDS Brain; Motor Neuron; Paraneoplastic Antigen; RNA-Binding Protein; Breast Cancer. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3671) AUTHORS Buckanovich,R.J., Posner,J.B. and Darnell,R.B. TITLE Nova, the Paraneoplastic Ri Antigen, is Homologous to an RNA-Binding Protein and is Specifically Expressed in the Developing Motor System JOURNAL Neuron 11, 657-672 (1993) MEDLINE 94000830 REFERENCE 2 (bases 1 to 3671) AUTHORS Darnell,R.B. TITLE Direct Submission JOURNAL Submitted (05-JAN-1994) Robert B. Darnell, Molecular Neuro-Oncology, The Rockefeller University, 1230 York Avenue, New York, NY 10021-6399, USA COMMENT Human brain protein specifically expressed in the developing ventral brainstem and spinal cord recognized by sera of patients with paraneoplastic opsoclonus and breast cancer. FEATURES Location/Qualifiers source 1..3671 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="nova-1" /clone_lib="lambda ZAP; Stratagene" /tissue_type="cerebellum and hippocampus" /dev_stage="adult" gene 61..1593 /gene="Nova-1" CDS 61..1593 /gene="Nova-1" /note="paraneoplastic Ri antigen" /codon_start=1 /product="onconeural ventral antigen-1" /db_xref="PID:g440878" /translation="MMAAAPIQQNGTHTGVPIDLDPPDSRKRPLEAPPEAGSTKRTNT GEDGQYFLKVLIPSYAAGSIIGKGGQTIVQLQKETGATIKLSKLSKSKDFYPGTTERV CLIQGTVEALNAVHGFIAEKIREMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTT TKSSPSDPMTTSRANQVKIIVPNSTAGLIIGKGGATVKAVMEQSGAWVQLSQKPDGIN LQERVVTVSGEPEQNRKAVELIIQKIQEDPQSGSCLNISYANVTGPVANSNPTGSPYA NTAEVLPTAAAAAGLLGHANLAGVAAFPAVLSGFTGNDLVAITSALNTLASYGYNLNT LGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFALGSLA AATAATNGYFGAASPLAASAILGTEKSTDGSKDVVEIAVPENLVGAILGKGGKTLVEY QELTGARIQISKKGEFVPGTRNRKVTITGTPAATQAAQYLITQRITYEQGVRAANPQK VG" repeat_unit 214..321 /gene="Nova-1" /note="RiTE1; KH domain; putative RNA binding domain" misc_feature 322..330 /gene="Nova-1" /note="alternative splice" exon 517..588 /gene="Nova-1" /note="alternative exon" repeat_unit 589..696 /gene="Nova-1" /note="RiTE2; KH domain; putative RNA binding domain" repeat_unit 1339..1446 /gene="Nova-1" /note="RiTE3; KH domain; putative RNA binding domain" BASE COUNT 1068 a 778 c 763 g 1062 t ORIGIN 1 gaattccgac aaaacaaaag ggagaacctt ctcccggtag cagcggcagg aactgcaaac 61 atgatggcgg cagctcccat ccagcagaac gggacccaca ctggggttcc catagacctg 121 gacccgccgg actcgcggaa aaggccgctg gaagcccccc ctgaagccgg cagcaccaag 181 aggaccaata cgggcgaaga cggccagtat tttctaaagg ttctcatacc tagttatgct 241 gctggatcta taattgggaa gggaggacag acaattgttc agttgcaaaa agaaactgga 301 gccaccatca agctgtctaa gctgtctaag tccaaagatt tttacccagg tactactgag 361 cgagtgtgct tgatccaggg aacggttgaa gcactgaatg cagttcatgg attcattgca 421 gaaaaaattc gagaaatgcc ccaaaatgtg gccaagacag aaccagtcag cattctacaa 481 ccccagacca ccgttaatcc agatcgcatc aaacaaacat tgccatcttc cccaactacc 541 accaagtcct ctccatctga tcccatgacc acctccagag ctaatcaggt aaagattata 601 gttcccaaca gcacagcagg tctgataata gggaagggag gtgctactgt gaaggctgta 661 atggagcagt caggggcttg ggtgcagctt tcccagaaac ctgatgggat caacttgcaa 721 gagagggttg tcactgtgag tggagaacct gaacaaaacc gaaaagctgt tgaacttatc 781 atccagaaga tacaagagga tccacaaagt ggcagctgtc tcaatatcag ttatgccaat 841 gtgacaggtc cagtggcaaa ttccaatcca accggatctc cttatgcaaa cactgctgaa 901 gtgttaccaa ctgctgcagc agctgcaggg ctattaggac atgctaacct tgctggcgtt 961 gcagcctttc cagcagtttt atctggcttc acaggcaatg acctggtggc catcacctct 1021 gcacttaata cattagccag ctatggatat aatctcaaca ctttaggttt aggtctcagt 1081 caagcagcag caacaggggc tttggctgca gcagctgcca gtgccaaccc agcagcagca 1141 gcagccaatt tattggccac ctatgccagt gaagcctcag ccagtggcag cacagctggt 1201 ggtacggcgg ggacatttgc attaggtagc ctggctgctg ctactgctgc aaccaatgga 1261 tattttggag ctgcttctcc cctagctgcc agtgccattc taggaacaga aaagtccaca 1321 gatggatcca aggatgtagt tgaaatagca gtgccagaaa acttagttgg tgcaatactt 1381 ggcaaaggag ggaaaacatt agtggaatac caggagttga ctggtgcaag gatacagatc 1441 tccaaaaaag gagaattcgt acctggcaca aggaatcgga aggtaaccat tactggaaca 1501 ccagctgcaa cacaggctgc tcaatattta attacacaaa ggatcacata tgagcaagga 1561 gttcgggctg ccaatcctca gaaagtgggt tgagtgcccc agttacacat cagattgttt 1621 taacccctcc tttaccccat tttcaagaag gatgtactgt actttgcaga agtgaagttt 1681 ttctgttatt aatatataat tatgcaaatg aatgcgacta tgttgacaat gtgtatatgt 1741 aaataatatg tgttttacca gatgtttcat agaaagaatt ttttcttgat ctgttttgtt 1801 ctctatactt tgcttgtgta tatttgtcag aggtgtttct agtgtaagat ttaagcctgc 1861 cattttacca gcattattgt agtttaatga ttgaatgtag acagggatat gcgtatagtt 1921 ttcagtatta gttctagata acactaaatt aactactgtt aggttgagta tggtggggtc 1981 agtgacctaa aatggagtga ggccaaagca ctgtcctgta agtcttactt cctgcttagg 2041 gcacagtgaa gtaggaaaca atattttgaa aataagtttt aaatttaaaa tgatcaaaaa 2101 gcaatatagt tgcataaaag cactgtaaaa tatttaaaag gttaaaactg tggaaaatta 2161 tattggtaag tttacagatc aataaaagca cctgttctcc atctgaacta gacaatggaa 2221 ataatgctgc atgctgccat ggcccattct tcatcatttg taagttcaac aaaagttctc 2281 acatggagtc ccacctcttc agaggttgca catttgtttt taagactgaa ttcactactg 2341 atcccatcgc ctggccgaga cagtcattac tccattaaca tcctcactgt ttagacacat 2401 aactgtggta caggattgga aattataaac aaaagtgaaa gtgccaacaa attattgata 2461 gctgataatg tttcatatct gcaactgctt gataagtatg ttgcatttta agagcttata 2521 attgtgtata atttgttaac actagaaacc tattagtatt gtgaatgtag attttactgt 2581 gaagctatct gtgatttagc tgtttgctcc catgatggag tctttgcagc atggcgctag 2641 cagccaatgc agtttctaat actcggtaat ttgcatgttt tgtggagcat ttttatgtca 2701 ccaaccagac agtatttcct gcatgcttat ttagaagagg cagcttatct tgagaggtag 2761 tgttatctac ctttgtcagg ctttttgaca ggtcatttca gagtaagcct ttgttcccaa 2821 gacccaacaa ctgtcaccct cttctgtacc tctcctgagt gccaactgtc caggccattt 2881 gacacaccat ctgttaacct ctgagtttgc ccactcaagg ccactcatag gggcatccta 2941 gccctgtgca ctcagcactc ataggatcat ccagactctc atgcggcatg cagtctaatc 3001 atgacaaata atgctgctac tctgatatct ggctgagcaa ctgaattaca aaagagaatt 3061 acttccatct caacttcaac ccattgatta cgtccatcct agcaagctaa atggcatccc 3121 agctgctcct ttctgtgcaa ccaattaaag aacaatgagt gtgatgctcc atgtctgaat 3181 ttcgtccagc ctctctctga actgtgatct ttgtcctcat gaactttccc ttttgttcat 3241 tgaactatat ggactcttca tttcatattg atttactgtg caatttactt ttggacattg 3301 agaacttgaa attatttcct gatcccttcc ccttccacta ttaataattc atttctgtca 3361 aactgtaaga gtagactcat tttttttttt tttagttttt aacattggac tgttatttca 3421 tttagagttc tctatctcta aatatttatt tagagaatga ttttaaaagg gaatgatatg 3481 cttgtttaaa tgaaagagaa aagctgtagt aaactgtgtt aattggtaat gactatttat 3541 cgtcgatact ctgtagctgt gtaagttttg acaaatagtg tatctcgtgg aatcagtggt 3601 tagcattgcc gctattatat ttactcattt tatcattata aatgtgttta gttcatcatg 3661 tagcatcaaa a // LOCUS HSU04847 1857 bp mRNA PRI 29-NOV-1995 DEFINITION Human Ini1 mRNA, complete cds. ACCESSION U04847 NID g440239 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1857) AUTHORS Kalpana,G.V., Marmon,S., Wang,W., Crabtree,G.R. and Goff,S.P. TITLE Binding and stimulation of HIV-1 integrase by a human homolog of yeast transcription factor SNF5 JOURNAL Science 266 (5193), 2002-2006 (1994) MEDLINE 95099327 REFERENCE 2 (bases 1 to 1857) AUTHORS Goff,S.P. TITLE Direct Submission JOURNAL Submitted (05-JAN-1994) Stephen P. Goff, Biochemistry and Molecular Biophysics, Columbia University, HHSC 1128, 701 West 168th St., New York, NY 10032, USA FEATURES Location/Qualifiers source 1..1857 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" CDS 70..1227 /codon_start=1 /product="Ini1" /db_xref="PID:g440240" /translation="MMMMALSKTFGQKPVKFQLEDDGEFYMIGSEVGNYLRMFRGSLY KRYPSLWRRLATVEERKKIVASSHGKKTKPNTKDHGYTTLATSVTLLKASEVEEILDG NDEKYKAVSISTEPPTYLREQKAKRNSQWVPTLSNSSHHLDAVPCSTTINRNRMGRDK KRTFPLCFDDHDPAVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMF SEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDQRVIIKLNIHVGNISLV DQFEWDMSEKENSPEKFALKLCSELGLGGEFVTTIAYSIRGQLSWHQKTYAFSENPLP TVEIAIRNTGDADQWCPLLETLTDAEMEKKIRDQDRNTRRMRRLANTGPAW" BASE COUNT 457 a 524 c 476 g 400 t ORIGIN 1 gccccggccc cgccccagcc ctcctgatcc ctcgcagccc ggctccggcc gcccgcctct 61 gccgccgcaa tgatgatgat ggcgctgagc aagaccttcg ggcagaagcc cgtgaagttc 121 cagctggagg acgacggcga gttctacatg atcggctccg aggtgggaaa ctacctccgt 181 atgttccgag gttctctgta caagagatac ccctcactct ggaggcgact agccactgtg 241 gaagagagga agaaaatagt tgcatcgtca catggtaaaa aaacaaaacc taacactaag 301 gatcacggat acacgactct agccaccagt gtgaccctgt taaaagcctc ggaagtggaa 361 gagattctgg atggcaacga tgagaagtac aaggctgtgt ccatcagcac agagcccccc 421 acctacctca gggaacagaa ggccaagagg aacagccagt gggtacccac cctgtccaac 481 agctcccacc acttagatgc cgtgccatgc tccacaacca tcaacaggaa ccgcatgggc 541 cgagacaaga agagaacctt ccccctttgc tttgatgacc atgacccagc tgtgatccat 601 gagaacgcat ctcagcccga ggtgctggtc cccatccggc tggacatgga gatcgatggg 661 cagaagctgc gagacgcctt cacctggaac atgaatgaga agttgatgac gcctgagatg 721 ttttcagaaa tcctctgtga cgatctggat ttgaacccgc tgacgtttgt gccagccatc 781 gcctctgcca tcagacagca gatcgagtcc taccccacgg acagcatcct ggaggaccag 841 tcagaccagc gcgtcatcat caagctgaac atccatgtgg gaaacatttc cctggtggac 901 cagtttgagt gggacatgtc agagaaggag aactcaccag agaagtttgc cctgaagctg 961 tgctcggagc tggggttggg cggggagttt gtcaccacca tcgcatacag catccgggga 1021 cagctgagct ggcatcagaa gacctacgcc ttcagcgaga accctctgcc cacagtggag 1081 attgccatcc ggaacacggg cgatgcggac cagtggtgcc cactgctgga gactctgaca 1141 gacgctgaga tggagaagaa gatccgcgac caggacagga acacgaggcg gatgaggcgt 1201 cttgccaaca cgggcccggc ctggtaacca gcccatcagc acacggctcc cacggagcat 1261 ctcagaagat tgggccgcct ctcctccatc ttctggcaag gacagaggcg aggggacagc 1321 ccagcgccat cctgaggatc gggtgggggt ggagtggggg cttccaggtg gcccttcccg 1381 gtacacattc catttgttga gccccagtcc tgccccccac cccaccctcc ctacccctcc 1441 ccagtctctg gggtcaggaa gaaaccttat tttaggttgt gttttgtttt tgtataggag 1501 ccccaggcag ggctagtaac agtttttaaa taaaaggcaa caggtcatgt tcaatttctt 1561 aaatctagtg tctttatttc ttctgttaca atagtgttgc ttgtgtaagc aggttagagt 1621 gcacagtgtc cccaattgtt cctggcactg caaaaccaaa ttaaacaatc ccacaaagaa 1681 ttctgacatc aatgtgtttt cctcagtcag gtctatttca agattctaga agttcctttt 1741 gtaaaacttg cctttaaaac tcttcctcct aatgccatca gatctcttaa cattggctca 1801 ctgtgggatc tttcctctta ggttgaattt ctacgtgaat atcaaagtgc ctttttc // LOCUS HSU04897 1863 bp mRNA PRI 07-MAR-1995 DEFINITION Human orphan hormone nuclear receptor RORalpha1 mRNA, complete cds. ACCESSION U04897 NID g451563 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1773) AUTHORS Giguere,V., Tini,M., Flock,G., Ong,E., Evans,R.M. and Otulakowski,G. TITLE Isoform-specific amino-terminal domains dictate DNA-binding properties of ROR alpha, a novel family of orphan hormone nuclear receptors JOURNAL Genes Dev. 8 (5), 538-553 (1994) MEDLINE 95011560 REFERENCE 2 (bases 1 to 1863) AUTHORS Giguere,V. TITLE Direct Submission JOURNAL Submitted (06-JAN-1994) Vincent Giguere, Endocrine Research, Research Institute, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada FEATURES Location/Qualifiers source 1..1863 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda hR5" /clone_lib="lamda gt11 from J. Nathans" /tissue_type="retina" /dev_stage="adult" 5'UTR 1..101 CDS 102..1673 /codon_start=1 /product="RORalpha1" /db_xref="PID:g451564" /translation="MESAPAAPDPAASEPGSSGADAAAGSRETPLNQESARKSEPPAP VRRQSYSSTSRGISVTKKTHTSQIEIIPCKICGDKSSGIHYGVITCEGCKGFFRRSQQ SNATYSCPRQKNCLIDRTSRNRCQHCRLQKCLAVGMSRDAVKFGRMSKKQRDSLYAEV QKHRMQQQQRDHQQQPGEAEPLTPTYNISANGLTELHDDLSNYIDGHTPEGSKADSAV SSFYLDIQPSPDQSGLDINGIKPEPICDYTPASGFFPYCSFTNGETSPTVSMAELEHL AQNISKSHLETCQYLREELQQITWQTFLQEEIENYQNKQREVMWQLCAIKITEAIQYV VEFAKRIDGFMELCQNDQIVLLKAGSLEVVFIRMCRAFDSQNNTVYFDGKYASPDVFK SLGCEDFISFVFEFGKSLCSMHLTEDEIALFSAFVLMSADRSWLQEKVKIEKLQQKIQ LALQHVLQKNHREDGILTKLICKVSTLRALCGRHTEKLMAFKAIYPDIVRLHFPPLYK ELFTSEFEPAMQIDG" 3'UTR 1674..1863 polyA_signal 1839..1844 BASE COUNT 571 a 436 c 428 g 428 t ORIGIN 1 gttttttttt tttttttggt accatagagt tgctctgaaa acagaagata gagggagtct 61 cggagctcgc atctccagcg atctctacat tgggaaaaaa catggagtca gctccggcag 121 cccccgaccc cgccgccagc gagccaggca gcagcggcgc ggacgcggcc gccggctcca 181 gggagacccc gctgaaccag gaatccgccc gcaagagcga gccgcctgcc ccggtgcgca 241 gacagagcta ttccagcacc agcagaggta tctcagtaac gaagaagaca catacatctc 301 aaattgaaat tattccatgc aagatctgtg gagacaaatc atcaggaatc cattatggtg 361 tcattacatg tgaaggctgc aagggctttt tcaggagaag tcagcaaagc aatgccacct 421 actcctgtcc tcgtcagaag aactgtttga ttgatcgaac cagtagaaac cgctgccaac 481 actgtcgatt acagaaatgc cttgccgtag ggatgtctcg agatgctgta aaatttggcc 541 gaatgtcaaa aaagcagaga gacagcttgt atgcagaagt acagaaacac cggatgcagc 601 agcagcagcg cgaccaccag cagcagcctg gagaggctga gccgctgacg cccacctaca 661 acatctcggc caacgggctg acggaacttc acgacgacct cagtaactac attgacgggc 721 acacccctga ggggagtaag gcagactccg ccgtcagcag cttctacctg gacatacagc 781 cttccccaga ccagtcaggt cttgatatca atggaatcaa accagaacca atatgtgact 841 acacaccagc atcaggcttc tttccctact gttcgttcac caacggcgag acttccccaa 901 ctgtgtccat ggcagaatta gaacaccttg cacagaatat atctaaatcg catctggaaa 961 cctgccaata cttgagagaa gagctccagc agataacgtg gcagaccttt ttacaggaag 1021 aaattgagaa ctatcaaaac aagcagcggg aggtgatgtg gcaattgtgt gccatcaaaa 1081 ttacagaagc tatacagtat gtggtggagt ttgccaaacg cattgatgga tttatggaac 1141 tgtgtcaaaa tgatcaaatt gtgcttctaa aagcaggttc tctagaggtg gtgtttatca 1201 gaatgtgccg tgcctttgac tctcagaaca acaccgtgta ctttgatggg aagtatgcca 1261 gccccgacgt cttcaaatcc ttaggttgtg aagactttat tagctttgtg tttgaatttg 1321 gaaagagttt atgttctatg cacctgactg aagatgaaat tgcattattt tctgcatttg 1381 tactgatgtc agcagatcgc tcatggctgc aagaaaaggt aaaaattgaa aaactgcaac 1441 agaaaattca gctagctctt caacacgtcc tacagaagaa tcaccgagaa gatggaatac 1501 taacaaagtt aatatgcaag gtgtctacat taagagcctt atgtggacga catacagaaa 1561 agctaatggc atttaaagca atatacccag acattgtgcg acttcatttt cctccattat 1621 acaaggagtt gttcacttca gaatttgagc cagcaatgca aattgatggg taaatgttat 1681 cacctaagca cttctagaat gtctgaagta caaacatgaa aaacaaacaa aaaaattaac 1741 cgagacactt tatatggccc tgcacagacc tggagcgcca cacactgcac atcttttggt 1801 gatcggggtc aggcaaagga ggggaaacaa tgaaaacaaa taaagttgaa cttgtttttc 1861 tca // LOCUS HSU04946 2043 bp mRNA PRI 04-JAN-1995 DEFINITION Human nucleophosmin-anaplastic lymphoma kinase fusion protein (NPM/ALK) mRNA, complete cds. ACCESSION U04946 NID g609341 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2043) AUTHORS Morris,S.W., Kirstein,M.N., Valentine,M.B., Dittmer,K.G., Shapiro,D.N., Saltman,D.L. and Look,A.T. TITLE Fusion of a kinase gene, ALK, to a nucleolar protein gene, NPM, in non-Hodgkin's lymphoma JOURNAL Science 263 (5151), 1281-1284 (1994) MEDLINE 94167588 REFERENCE 2 (bases 1 to 2043) AUTHORS Dittmer,K. TITLE Direct Submission JOURNAL Submitted (07-JAN-1994) K. Dittmer, St. Jude Children's Research Hospital, 332 N. Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..2043 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SU-DHL-1" /cell_type="T-cell" /tissue_type="lymphoma" gene 1..2043 /gene="NPM/ALK" CDS 1..2043 /gene="NPM/ALK" /codon_start=1 /product="nucleophosmin-anaplastic lymphoma kinase fusion protein" /db_xref="PID:g609342" /translation="MEDSMDMDMSPLRPQNYLFGCELKADKDYHFKVDNDENEHQLSL RTVSLGAGAKDELHIVEAEAMNYEGSPIKVTLATLKMSVQPTVSLGGFEITPPVVLRL KCGSGPVHISGQHLVVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAG KTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSE QDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPS QPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMA RDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPS KSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDP DVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTS SGKAAKKPTAAEVSVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPT YGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTA NMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSMNQPGP" misc_recomb 353 /gene="NPM/ALK" /note="fusion junction of the chimeric cDNA (nucleophosmin-anaplastic lymphoma kinase) formed by the t(2;5) chromosomal rearrangement involving the kinase gene ALK and the nucleolar protein gene NPM" /evidence=experimental /organism="Homo sapiens" /label=breakpoint BASE COUNT 519 a 555 c 547 g 422 t ORIGIN 1 atggaagatt cgatggacat ggacatgagc cccctgaggc cccagaacta tcttttcggt 61 tgtgaactaa aggccgacaa agattatcac tttaaggtgg ataatgatga aaatgagcac 121 cagttatctt taagaacggt cagtttaggg gctggtgcaa aggatgagtt gcacattgtt 181 gaagcagagg caatgaatta cgaaggcagt ccaattaaag taacactggc aactttgaaa 241 atgtctgtac agccaacggt ttcccttggg ggctttgaaa taacaccacc agtggtctta 301 aggttgaagt gtggttcagg gccagtgcat attagtggac agcacttagt agtgtaccgc 361 cggaagcacc aggagctgca agccatgcag atggagctgc agagccctga gtacaagctg 421 agcaagctcc gcacctcgac catcatgacc gactacaacc ccaactactg ctttgctggc 481 aagacctcct ccatcagtga cctgaaggag gtgccgcgga aaaacatcac cctcattcgg 541 ggtctgggcc atggcgcctt tggggaggtg tatgaaggcc aggtgtccgg aatgcccaac 601 gacccaagcc ccctgcaagt ggctgtgaag acgctgcctg aagtgtgctc tgaacaggac 661 gaactggatt tcctcatgga agccctgatc atcagcaaat tcaaccacca gaacattgtt 721 cgctgcattg gggtgagcct gcaatccctg ccccggttca tcctgctgga gctcatggcg 781 gggggagacc tcaagtcctt cctccgagag acccgccctc gcccgagcca gccctcctcc 841 ctggccatgc tggaccttct gcacgtggct cgggacattg cctgtggctg tcagtatttg 901 gaggaaaacc acttcatcca ccgagacatt gctgccagaa actgcctctt gacctgtcca 961 ggccctggaa gagtggccaa gattggagac ttcgggatgg cccgagacat ctacagggcg 1021 agctactata gaaagggagg ctgtgccatg ctgccagtta agtggatgcc cccagaggcc 1081 ttcatggaag gaatattcac ttctaaaaca gacacatggt cctttggagt gctgctatgg 1141 gaaatctttt ctcttggata tatgccatac cccagcaaaa gcaaccagga agttctggag 1201 tttgtcacca gtggaggccg gatggaccca cccaagaact gccctgggcc tgtataccgg 1261 ataatgactc agtgctggca acatcagcct gaagacaggc ccaactttgc catcattttg 1321 gagaggattg aatactgcac ccaggacccg gatgtaatca acaccgcttt gccgatagaa 1381 tatggtccac ttgtggaaga ggaagagaaa gtgcctgtga ggcccaagga ccctgagggg 1441 gttcctcctc tcctggtctc tcaacaggca aaacgggagg aggagcgcag cccagctgcc 1501 ccaccacctc tgcctaccac ctcctctggc aaggctgcaa agaaacccac agctgcagag 1561 gtctctgttc gagtccctag agggccggcc gtggaagggg gacacgtgaa tatggcattc 1621 tctcagtcca accctccttc ggagttgcac aaggtccacg gatccagaaa caagcccacc 1681 agcttgtgga acccaacgta cggctcctgg tttacagaga aacccaccaa aaagaataat 1741 cctatagcaa agaaggagcc acacgacagg ggtaacctgg ggctggaggg aagctgtact 1801 gtcccaccta acgttgcaac tgggagactt ccgggggcct cactgctcct agagccctct 1861 tcgctgactg ccaatatgaa ggaggtacct ctgttcaggc tacgtcactt cccttgtggg 1921 aatgtcaatt acggctacca gcaacagggc ttgcccttag aagccgctac tgcccctgga 1981 gctggtcatt acgaggatac cattctgaaa agcaagaata gcatgaacca gcctgggccc 2041 tga // LOCUS HSU04953 4508 bp mRNA PRI 31-OCT-1995 DEFINITION Human isoleucyl-tRNA synthetase mRNA, complete cds. ACCESSION U04953 NID g450850 KEYWORDS IRS. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4508) AUTHORS Nichols,R.C., Raben,N., Boerkoel,C.F. and Plotz,P.H. TITLE Human isoleucyl-tRNA synthetase: sequence of the cDNA, alternative mRNA splicing, and the characteristics of an unusually long C-terminal extension JOURNAL Gene 155 (2), 299-304 (1995) MEDLINE 95237628 REFERENCE 2 (bases 1 to 4508) AUTHORS Nichols,R.C. TITLE Direct Submission JOURNAL Submitted (07-JAN-1994) Ralph C. Nichols, National Institutes of Health, Arthritis and Rheumatism Branch, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4508 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Hep G2 cDNA library, Stratagene" /cell_line="Hep G2" /cell_type="hepatocyte" /tissue_type="liver" intron 79..248 /note="This region is excised in some transcripts and is therefore probably an intron. This suggests that the ATG at 256 rather than the one at 245 is the start codon." CDS 256..4044 /EC_number="6.1.1.5" /note="The following aa are consensus sequences: HYGH(aa 55-58), WTISR(aa 455-459), KMSKR(aa 600-604); 180 aa at the C-terminal region are not present in the homologous enzymes from yeast (Swiss-Prot Accession Number P09436), tetrahymena (PIR Accession Number A42399) and methanobacter (Swiss-Prot Accession Number P26499)" /codon_start=1 /function="ligates isoleucine to isoleucyl-tRNA" /product="isoleucyl-tRNA synthetase" /db_xref="PID:g440799" /translation="MLQQVPENINFPAEEEKILEFWTEFNCFQECLKQSKHKPKFTFY DGPPFATGLPHYGHILAGTIKDIVTRYAHQSGFHVDRRFGWDCHGLPVEYEIDKTLGI RGPEDVAKMGITEYNNQCRAIVMRYSAEWKSTVSRLGRWIDFDNDYKTLYPQFMESVW WVFKQLYDKGLVYRGVKVMPFSTACNTPLSNFESHQNYKDVQDPSVFVTFPLEEDETV SLVAWTTTPWTLPSNLAVCVNPEMQYVKIKDVARGRLLILMEARLSALYKLESDYEIL ERFPGAYLKGKKYRPLFDYFLKCKENGAFTVLVDNYVKEEEGTGVVHQAPYFGAEDYR VCMDFNIIRKDSLPVCPVDASGCFTTEVTDFAGQYVKDADKSIIRTLKEQGRLLVATT FTHSYPFCWRSDTPLIYKAVPSWFVRVENMVDQLLRNNDLCYWVPELVREKRFGNWLK DARDWTISRNRYWGTPIPLWVSDDFEEVVCIGSVAELEELSGAKISDLHRESVDHLTI PSRCGKGSLHRISEVFDCWFESGSMPYAQVHYPFENKREFEDAFPADFIAEGIDQTRG WFYTLLVLATALFGQPPFKNVIVNGLVLASDGQKMSKRKKNYPDPVSIIQKYGADALR LYLINSPVVRAENLRFKEEGVRDVLKDVLLPWYNAYRFLIQNVLRLQKEEEIEFLYNE NTVRESPNITDRWILSFMQSLIGFFETEMAAYRLYTVVPRLVKFVDILTNWYVRMNRR RLKGENGMEDCVMALETLFSVLLSLCRLIAPYTPFLTELMYQNLKVLIDPVSVQDKDT LSIHYLMLPRVREELIDKKTESAVSQMQSVIELGRVIRDRKTIPIKYPLKEIVVIHQD PEALKDIKSLEKYIIEELNVRKVTLSTDKNKYGIRLRAEPDHMVLGKRLKGAFKAVMT SIKQLSSEELEQFQKTGTIVVEGHELHDEDIRLMYTFDQATGGTAQFEAHSDAQALVL LDVTPDQSMVDEGMAREVINRIQKLRKKCNLVPTDEITVYYKAKSEGTYLNSVIESHT EFIFTTIKAPLKPYPVSPSDKVLIQEKTQLKGSELEITLTRGSSLPGPACAYVNLNIC ANGSEQGGVLLLENPKGDNRLDLLKLKSVVTSIFGVKNTELAVFHDETEIQNQTDLLS LSGKTLCVTAGSAPSLINSSSTLLCQYINLQLLNAKPQECLMGTVGTLLLENPLGQNG LTHQGLLYEAAKVFGLRSRKLKLFLNETQTQEITEDIPVKTLNMKTVYVSVLPTTADF " polyA_site 4508 BASE COUNT 1299 a 947 c 1036 g 1226 t ORIGIN 1 cgggcagcgt ggaccccgga tgagttgctt ttaggcttgc tggcccgcgg ggctgtccag 61 gcacgcgagg cccctcaggt acgccctctc ttccctgcag gatccggccc tcaaagacga 121 gggtcacgca cgcgttacaa ccccgaaaca gtagcacaag atttaatttt taaaagagcg 181 tgtttcttcg gggcttgccg ttcgttcgtt tccagcctca ggaatttatg gtcgcctttt 241 tgaatgagca acaaaatgct tcaacaagtt ccagaaaaca taaattttcc tgctgaagaa 301 gagaaaatct tggagttttg gactgaattt aattgttttc aggaatgctt aaagcaatca 361 aaacataaac caaaatttac cttctatgat ggtcctcctt ttgcaactgg actgcctcac 421 tatggacata tacttgcggg tacaattaaa gatatagtta caagatatgc tcaccagagt 481 gggtttcatg ttgacagaag atttggatgg gattgccatg gcttacctgt ggaatatgaa 541 attgataaga cactgggaat cagaggacca gaggatgtgg ccaaaatggg gattacagag 601 tataacaatc agtgccgagc aattgtgatg agatattctg ctgagtggaa gtctactgtt 661 agcagacttg gccgatggat tgactttgac aatgactata aaactctgta tccacaattc 721 atggaatcag tctggtgggt cttcaaacaa ctctatgata aaggccttgt ttatagaggt 781 gtgaaagtca tgcccttctc tacggcatgt aacactccac tttccaactt cgagtcacac 841 cagaattata aggatgttca agatccttca gtatttgtaa ctttcccttt ggaagaagat 901 gaaactgtat ctttagttgc ttggacaacc actccctgga ctctacctag taaccttgct 961 gtgtgtgtta atccagaaat gcaatatgtg aaaattaaag atgttgccag aggacgatta 1021 ctcattttaa tggaagccag attgtcagcc ctctataaat tggagagtga ctatgagatc 1081 cttgaaagat ttcctggtgc ctatcttaaa ggcaagaagt acaggcccct gtttgactat 1141 ttcctgaagt gtaaagagaa tggcgctttc actgtgcttg ttgacaacta tgtgaaggaa 1201 gaagaaggca caggggttgt ccaccaagct ccttacttcg gtgctgagga ctatcgggtc 1261 tgtatggact ttaacattat tcggaaagac tcactccctg tttgccctgt ggatgcttca 1321 ggctgcttca caacggaggt gacagatttc gcaggacagt atgtgaagga tgctgacaaa 1381 agtatcatca ggactttgaa ggaacaaggc cgacttctgg ttgccaccac cttcactcac 1441 agctaccctt tttgctggag atcagacact cctctaattt acaaagcagt gcccagctgg 1501 tttgtgcgag tggagaacat ggtggaccag ctcctaagga acaatgacct gtgctactgg 1561 gtcccagagt tggtacgaga aaaacgattt ggaaattggc tgaaagatgc acgtgactgg 1621 acaatttcca gaaacagata ctggggcacc cccatcccac tgtgggtcag cgatgacttt 1681 gaggaggtgg tatgcattgg gtcagtggcg gaacttgaag aactgtcagg agcaaagatc 1741 tcagatctcc acagagagag tgttgaccac ctgaccattc cttcacgctg tgggaaggga 1801 tccttgcacc gcatctctga agtgtttgac tgttggtttg agagtggcag catgccctat 1861 gctcaggttc attacccgtt tgaaaacaag agggagtttg aggatgcttt tcctgcagat 1921 ttcattgccg agggcatcga ccaaaccaga ggatggtttt ataccctgct ggtgctggcc 1981 acggccctct ttggacaacc gcctttcaag aacgtaattg tgaatgggct tgtcctggca 2041 agtgatggcc aaaaaatgag caaacggaaa aagaattatc cagatccagt ttccatcatc 2101 cagaagtatg gtgctgatgc cctcagatta tatctgatta actcccctgt ggtgagagca 2161 gaaaacctcc gctttaaaga agagggtgtg cgggacgtcc ttaaggatgt actgctccca 2221 tggtacaatg cctatcgctt cttaatccag aacgttctga ggctccagaa ggaggaagaa 2281 atagaatttc tctacaatga gaacacggtt agagaaagcc ccaacattac agaccggtgg 2341 atcctgtcct tcatgcagtc tctcattggc ttctttgaga ctgaaatggc agcttatagg 2401 ctttatactg tggtgcctcg cctggtcaag tttgtagata ttctgaccaa ttggtatgtt 2461 agaatgaacc gcagaagatt aaagggtgaa aatgggatgg aggattgtgt catggcccta 2521 gaaaccttgt ttagtgttct gctttctctt tgcagactta tagctcccta cacacctttt 2581 ctcactgaat tgatgtacca gaatctaaag gtgctgattg accctgtttc tgttcaggac 2641 aaggacacac tcagcattca ctacctcatg ctgccccgtg ttcgagaaga attgattgac 2701 aagaaaacag agagtgcagt atctcagatg cagtctgtga ttgaacttgg aagagtgatc 2761 agagaccgaa aaactattcc cataaagtat cctttgaaag aaattgtggt tatccatcaa 2821 gatccagaag ctcttaaaga tatcaagtct ttggagaagt atatcattga ggaactcaat 2881 gttcgaaaag ttacactgtc tacagataaa aacaagtatg gcattcggct aagggcagaa 2941 ccagatcaca tggtcctggg gaagcgtctg aagggagcct ttaaggcagt gatgacgtcc 3001 atcaagcagt tgagcagtga ggagctggag cagttccaga agactgggac cattgttgtg 3061 gaaggccatg aattgcacga tgaagacatc cgcctcatgt acacctttga tcaggccaca 3121 ggtgggactg cgcaatttga agcacactca gatgctcagg ctttggtcct cttagatgtc 3181 actcctgacc agtcaatggt agatgaagga atggctcggg aagtcatcaa tcgcatacag 3241 aaacttcgca aaaagtgcaa tctggttcca actgatgaaa tcacagtgta ctataaagca 3301 aagtctgaag gaacatatct gaatagtgtt attgaaagcc acacagagtt catatttacc 3361 accataaagg ctcccttgaa accatatcca gtttctccat cggataaagt ccttattcaa 3421 gaaaaaacac agttgaaggg atctgaactg gaaattacac tcaccagagg atcttccctt 3481 cctggtcctg cttgtgcata tgtcaatctt aacatttgtg caaatggcag tgaacaaggt 3541 ggagtattgc tcctggaaaa tccaaaaggt gacaataggt tggacctttt aaagctgaag 3601 agtgttgtca ctagcatttt tggtgtgaaa aatacagagc tggctgtctt ccatgatgaa 3661 acagaaatac aaaaccaaac tgacttactg agtcttagtg gaaaaacact ttgtgtgact 3721 gcaggatcgg ctccctctct gatcaacagt tctagtactc ttctttgtca gtatatcaac 3781 ctacagctcc tgaatgcaaa gccacaagag tgtttaatgg ggacagtggg cactctcctg 3841 cttgaaaacc cacttgggca gaatggactc acccaccaag gtcttctgta tgaagcagcc 3901 aaggtgtttg gccttcggag caggaagcta aagctgtttc tgaatgagac ccaaacgcag 3961 gaaattacag aagacatccc cgtgaagact ttgaatatga agactgtgta tgtttctgtg 4021 ttaccaacaa cagcagactt ctagcatgta cttatcaatg ttgttcggtc agcccttccc 4081 taattacacc tatcccctac acatacatgc acatagacac acacatgaac acactgaaga 4141 tatttccttc aggtgtgtgt aaaatatgct gcttggattg aaattcaaat gggattgatt 4201 agtcaagtaa cttgagacct cacagtaatc ttcacactta accttagaca cctatgcagt 4261 catgttggga gcaggttaca atgttacttc agcccacagt ttatttctat tcttgagttc 4321 ttaagtacag aagatagaag tgatttaaat ggcatagtat atatatcatt ttctggcctt 4381 ttaaaattta tttgagacct cttgatgaaa tggacatatt atatatttct gccacctgga 4441 ttttcctgga taatttgatg gaatatttta agtttcagta aatcagaaca ataaacaaac 4501 tcagatat // LOCUS HSU05012 2750 bp mRNA PRI 15-SEP-1995 DEFINITION Human receptor tyrosine kinase TrkC (NTRK3) mRNA, complete cds. ACCESSION U05012 NID g442389 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2750) AUTHORS McGregor,L.M., Baylin,S.B., Griffin,C.A., Hawkins,A.L. and Nelkin,B.D. TITLE Molecular cloning of the cDNA for human TrkC (NTRK3), chromosomal assignment, and evidence for a splice variant JOURNAL Genomics 22 (2), 267-272 (1994) MEDLINE 95104834 REFERENCE 2 (bases 1 to 2750) AUTHORS McGregor,L.M. TITLE Direct Submission JOURNAL Submitted (12-JAN-1994) Lisa M. McGregor, Johns Hopkins Medical Institutions, Oncology Center, 424 N. Bond Street, Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..2750 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="trkC" /clone_lib="fetal brain cDNA" /map="15q24-q25" /chromosome="15" repeat_region 58..81 /rpt_type=TANDEM /rpt_family="trinucleotide" repeat_unit 58..60 gene 156..2633 /gene="NTRK3" CDS 156..2633 /gene="NTRK3" /codon_start=1 /function="receptor tyrosine kinase" /product="TrkC" /db_xref="PID:g442390" /translation="MDVSLCPAKCSFWRIFLLGSVWLDYVGSVLACPANCVCSKTEIN CRRPDDGNLFPLLEGQDSGNSNGNASINITDISRNITSIHIENWRSLHTLNAVDMELY TGLQKLTIKNSGLRSIQPRAFAKNPHLRYINLSSNRLTTLSWQLFQTLSLRELQLEQN FFNCSCDIRWMQLWQEQGEAKLNSQNLYCINADGSQLPLFRMNISQCDLPEISVSHVN LTVREGDNAVITCNGSGSPLPDVDWIVTGLQSINTHQTNLNWTNVHAINLTLVNVTSE DNGFTLTCIAENVVGMSNASVALTVYYPPRVVSLEEPELRLEHCIEFVVRGNPPPTLH WLHNGQPLRESKIIHVEYYQEGEISEGCLLFNKPTHYNNGNYTLIAKNPLGTANQTIN GHFLKEPFPESTDNFILFDEVSPTPPITVTHKPEEDTFGVSIAVGLAAFACVLLVVLF VMINKYGRRSKFGMKGPVAVISGEEDSASPLHHINHGITTPSSLDAGPDTVVIGMTRI PVIENPQYFRQGHNCHKPDTYVQHIKRRDIVLKRELGEGAFGKVFLAECYNLSPTKDK MLVAVKALKDPTLAARKDFQREAELLTNLQHEHIVKFYGVCGDGDPLIMVFEYMKHGD LNKFLRAHGPNAMILVDGQPRQAKGELGLSQMLHIASQIASGMVYLASQHFVHRDLAT RNCLVGANLLVKIGDFGMSRDVYSTDYYRVGGHTMLPIRWMPPESIMYRKFTTESDVW SFGVILWEIFTYGKQPWFQLSNTEVIECITQGRVLERPRVCPKEVYDVMLGCWQREPQ QRLNIKEIYKILHALGKATPIYLDILG" BASE COUNT 638 a 783 c 732 g 597 t ORIGIN 1 gatgcgagcc ggccaccagt cccggcagag ccactagggc ctctcgcggc tcccacccgg 61 cggcggcggc ggcggcggcg gcgtccgcga tggtttcaga cgctgaagga ttttgcatct 121 gatcgctcgg cgtttcaaag aagcagcgat cggagatgga tgtctctctt tgcccagcca 181 agtgtagttt ctggcggatt ttcttgctgg gaagcgtctg gctggactat gtgggctccg 241 tgctggcttg ccctgcaaat tgtgtctgca gcaagactga gatcaattgc cggcggccgg 301 acgatgggaa cctcttcccc ctcctggaag ggcaggattc agggaacagc aatgggaacg 361 ccagtatcaa catcacggac atctcaagga atatcacttc catacacata gagaactggc 421 gcagtcttca cacgctcaac gccgtggaca tggagctcta caccggactt caaaagctga 481 ccatcaagaa ctcaggactt cggagcattc agcccagagc ctttgccaag aacccccatt 541 tgcgttatat aaacctgtca agtaaccggc tcaccacact ctcgtggcag ctcttccaga 601 cgctgagtct tcgggaattg cagttggagc agaacttttt caactgcagc tgtgacatcc 661 gctggatgca gctctggcag gagcaggggg aggccaagct caacagccag aacctctact 721 gcatcaacgc tgatggctcc cagcttcctc tcttccgcat gaacatcagt cagtgtgacc 781 ttcctgagat cagcgtgagc cacgtcaacc tgaccgtacg agagggtgac aatgctgtta 841 tcacttgcaa tggctctgga tcaccccttc ctgatgtgga ctggatagtc actgggctgc 901 agtccatcaa cactcaccag accaatctga actggaccaa tgttcatgcc atcaacttga 961 cgctggtgaa tgtgacgagt gaggacaatg gcttcaccct gacgtgcatt gcagagaacg 1021 tggtgggcat gagcaatgcc agtgttgccc tcactgtcta ctatccccca cgtgtggtga 1081 gcctggagga gcctgagctg cgcctggagc actgcatcga gtttgtggtg cgtggcaacc 1141 ccccaccaac gctgcactgg ctgcacaatg ggcagcctct gcgggagtcc aagatcatcc 1201 atgtggaata ctaccaagag ggagagattt ccgagggctg cctgctcttc aacaagccca 1261 cccactacaa caatggcaac tataccctca ttgccaaaaa cccactgggc acagccaacc 1321 agaccatcaa tggccacttc ctcaaggagc cctttccaga gagcacggat aactttatct 1381 tgtttgacga agtgagtccc acacctccta tcactgtgac ccacaaacca gaagaagaca 1441 cttttggggt atccatagca gttggacttg ctgcttttgc ctgtgtcctg ttggtggttc 1501 tcttcgtcat gatcaacaaa tatggtcgac ggtccaaatt tggaatgaag ggtcccgtgg 1561 ctgtcatcag tggtgaggag gactcagcca gcccactgca ccacatcaac cacggcatca 1621 ccacgccctc gtcactggat gcggggcccg acactgtggt cattggcatg actcgcatcc 1681 ctgtcattga gaacccccag tacttccgtc agggacacaa ctgccacaag ccggacacgt 1741 atgtgcagca cattaagagg agagacatcg tgctgaagcg agaactgggt gagggagcct 1801 ttggaaaggt cttcctggcc gagtgctaca acctcagccc gaccaaggac aagatgcttg 1861 tggctgtgaa ggccctgaag gatcccaccc tggctgcccg gaaggatttc cagagggagg 1921 ccgagctgct caccaacctg cagcatgagc acattgtcaa gttctatgga gtgtgcggcg 1981 atggggaccc cctcatcatg gtctttgaat acatgaagca tggagacctg aataagttcc 2041 tcagggccca tgggccaaat gcaatgatcc ttgtggatgg acagccacgc caggccaagg 2101 gtgagctggg gctctcccaa atgctccaca ttgccagtca gatcgcctcg ggtatggtgt 2161 acctggcctc ccagcacttt gtgcaccgag acctggccac caggaactgc ctggttggag 2221 cgaatctgct agtgaagatt ggggacttcg gcatgtccag agatgtctac agcacggatt 2281 attacagggt gggaggacac accatgctcc ccattcgctg gatgcctcct gaaagcatca 2341 tgtaccggaa gttcactaca gagagtgatg tatggagctt cggggtgatc ctctgggaga 2401 tcttcaccta tggaaagcag ccatggttcc aactctcaaa cacggaggtc attgagtgca 2461 ttacccaagg tcgtgttttg gagcggcccc gagtctgccc caaagaggtg tacgatgtca 2521 tgctggggtg ctggcagagg gaaccacagc agcggttgaa catcaaggag atctacaaaa 2581 tcctccatgc tttggggaag gccaccccaa tctacctgga cattcttggc tagtggtggc 2641 tggtggtcat gaattcatac tctgttgcct cctctctccc tgcctcacat ctcccttcca 2701 cctcacaact ccttccagcc ttgactgaag cgaacatctt catataaact // LOCUS HSU05040 2325 bp mRNA PRI 08-MAY-1994 DEFINITION Human FUSE binding protein mRNA, complete cds. ACCESSION U05040 NID g460151 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2325) AUTHORS Duncan,R., Bazar,L., Michelotti,G., Tomonaga,T., Krutzsch,H., Avigan,M. and Levens,D. TITLE A sequence-specific, single-strand binding protein activates the far upstream element of c-myc and defines a new DNA-binding motif JOURNAL Genes Dev. 8, 465-480 (1994) MEDLINE 94170991 REFERENCE 2 (bases 1 to 2325) AUTHORS Duncan,R.C. TITLE Direct Submission JOURNAL Submitted (13-JAN-1994) Robert C Duncan, Lab of Pathology, NCI, Bldg 10 Rm 2N105, NIH, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2325 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HL60 cell line, Stratagene BJAB cell line, T. Behrens PBL, activated, Siebenlist" 5'UTR 1..26 CDS 27..1961 /codon_start=1 /product="FUSE binding protein" /db_xref="PID:g460152" /translation="MADYSTVPPPSSGSAGGGGGGGGGGGVNDAFKDALQRARQIAAK IGGDAGTSLNSNDYGYGGQKRPLEDGDQPDAKKVAPQNDSFGTQLPPMHQQQSRSVMT EEYKVPDGMVGFIIGRGGEQISRIQQESGCKIQIAPDSGGLPERSCMLTGTPESVQSA KRLLDQIVEKGRPAPGFHHGDGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVK MVMIQDGPQNTGADKPLRITGDPYKVQQAKEMVLELIRDQGGFREVRNEYGSRIGGNE GIDVPIPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPPDRCQH AAEIITDLLRSVQAGNPGGPGPGGRGRGRGQGNWNMGPPGGLQEFNFIVPTGKTGLII GKGGETIKSISQQSGARIELQRNPPPNADPNMKLFTIRGTPQQIDYARQLIEEKIGGP VNPLGPPVPHGPHGVPGPHGPPGPPGPGTPMGPYNPAPYNPGPPGPAPHGPPAPYAPQ GWGNAYPHWQQQAPPDPAKAGTDPNSAAWAAYYAHYYQQQAQPPPAAPAGAPTTTQTN GQGDQQNPAPAGQVDYTKAWEEYYKKMGQAVPAPTGAPPGGQPDYSAAWAEHYRQQAA YYAQTSPQGMPQHPPAPQGQ" 3'UTR 1962..2325 polyA_signal 2305..2310 BASE COUNT 712 a 488 c 565 g 560 t ORIGIN 1 gcggcagcgg ctcttatagt gcaaccatgg cagactattc aacagtgcct cccccctctt 61 ctggctcagc tggtggcggt ggtggcggcg gtggtggtgg aggagttaac gacgctttca 121 aagatgcact gcagagagcc cggcagattg cagcaaaaat tggaggtgat gcagggacat 181 cactgaattc aaatgactat ggttatgggg gacaaaaaag acctttagaa gatggagatc 241 aaccagatgc taagaaagtt gctcctcaaa atgactcttt tggaacacag ttaccaccga 301 tgcatcagca gcaaagcaga tctgtaatga cagaagaata caaagttcca gatggaatgg 361 ttggattcat aattggcaga ggaggtgaac agatctcacg catacaacag gaatctggat 421 gcaaaataca gatagctcct gacagtggtg gccttccaga aaggtcctgt atgttaactg 481 gaacacctga atctgtccag tcagcaaaac ggttactgga ccagattgtt gaaaaaggaa 541 gaccagctcc tggcttccat catggcgatg gaccgggaaa tgcagttcaa gaaatcatga 601 ttccagctag caaggcagga ttagtcattg gaaaaggggg agaaactatt aaacagcttc 661 aggaacgggc tggagttaaa atggttatga ttcaagacgg gccgcagaac actggtgctg 721 acaaacctct taggattaca ggagacccat ataaagttca acaagccaag gaaatggtgt 781 tagagttaat tcgtgatcaa ggcggtttca gagaagttcg gaatgagtat gggtcaagaa 841 taggaggaaa tgaagggata gatgtcccca ttccaagatt tgctgttggc attgtaatag 901 gaagaaatgg agagatgatc aaaaaaatac aaaatgatgc tggtgttcgc attcagttta 961 agccagatga tgggacaaca cccgaaagga tagcacaaat aacaggacct ccagaccgat 1021 gtcaacatgc tgcagaaatt attacagacc ttcttcgaag tgttcaggct ggtaatcctg 1081 gtggacctgg acctggtggt cgaggaagag gtagaggtca aggcaactgg aacatgggac 1141 cacctggtgg attacaggaa tttaatttta ttgtgccaac tgggaaaact ggattaataa 1201 taggaaaagg aggtgaaacc ataaaaagca taagccagca gtctggtgca agaatagaac 1261 ttcagagaaa tcctccacca aatgcagatc ctaatatgaa gttatttaca attcgtggca 1321 ctccacaaca gatagactat gctcggcaac tcatagaaga aaagattggt ggcccagtaa 1381 atcctttagg gccacctgta ccccatgggc cccatggtgt cccaggcccc catggacctc 1441 ctgggcctcc agggcctgga actccaatgg gaccatacaa ccctgcacct tataatcctg 1501 gaccaccagg cccggctcct catggtcctc cagccccata tgctccccag ggatggggaa 1561 atgcatatcc acactggcag cagcaggctc ctcctgatcc agctaaggca ggaacggatc 1621 caaattcagc agcttgggct gcttattacg ctcactatta tcaacagcaa gcacagccac 1681 caccagcagc ccctgcaggt gcaccaacta caactcaaac taatggacaa ggagatcagc 1741 agaatccagc cccagctgga caggttgatt ataccaaggc ttgggaagag tactacaaga 1801 aaatgggtca ggcagttcct gctccgactg gggctcctcc aggtggtcag ccagattata 1861 gtgcagcctg ggctgagcat tatagacaac aagcagccta ttatgcccag acaagtcccc 1921 agggaatgcc acagcatcct ccagcacctc agggccaata ataagaagtg gacaatacag 1981 tatttgcttc attgtgtggg ggaaaaaaac ctttgttaaa tatatggatg cagacgactt 2041 gatgaagatc ttaattttgt ttttggttta aaatagtgtt tccttttttt tttttttttt 2101 tttgaaaatg tacaaaatat ctatcactac tgataggagg ttaatatttc tgtgtagaaa 2161 tgaaaattgg tttgttttta gtatttagtg tagatgtaca cattccagca aatgtatttg 2221 caattatgtg gttgatgctt tgtgatataa atgtactttt tcaatgtata ctttcacttt 2281 ccaaatgcct gttttgtgct ttacaataaa tgatatgaaa cctca // LOCUS HSU05227 1673 bp mRNA PRI 29-MAR-1994 DEFINITION Human Rar protein mRNA, complete cds. ACCESSION U05227 NID g466270 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1673) AUTHORS Peng,H., Lee,J. and Chang,H. TITLE Characterization of a human hippocampus cDNA clone encoding a novel SEC-4 like protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1673) AUTHORS Chang,H. TITLE Direct Submission JOURNAL Submitted (14-JAN-1994) Hwan-You Chang, Molecular and Cellular Biology, Chang-Gung Medical and Technology College, 259 Wen Hwan First Road, Kwei-San, Taiwan FEATURES Location/Qualifiers source 1..1673 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHC059" /tissue_type="neural tissue (Hippocampus)" 5'UTR 1..45 CDS 46..882 /note="small GTP binding protein, homologous to Saccharomyces cerevisiae SEC4, Swiss-Prot Accession Number P07560" /codon_start=1 /product="Rar protein" /db_xref="PID:g466271" /translation="MSALGSPVRAYDFLLKFLLVGDSDVGKGEILASLQDGAAESPYG HPAGIDYKTTTILLDGRRVKLQLWDTSGQGRFCTIFRSYSRGAQGVILVYDIANRWSF DGIDRWIKEIDEHAPGVPKILVGNRLHLAFKRQVPTEQAQAYAERLGVTFFEVSPLCN FNITESFTELARIVLLRHGMDRLWRPSKVLSLQDLCCRAVVSCTPVHLVDKLPLPIAL RSHLKSFSMANGLNARMMHGGSYSLTTSSTHKRSSLRKVKLVRPPQSPPKNCTRNSCK IS" 3'UTR 883..1673 BASE COUNT 398 a 421 c 446 g 408 t ORIGIN 1 attcccccgc aggccgggca tgggtggggg cgccgggccg tcacgatgag cgccctgggc 61 agcccggtcc gggcctacga ctttctgctc aagttcctgc tggtgggcga cagcgacgtg 121 ggcaagggcg agatcctggc gagcctgcag gatggcgcgg ccgagtcccc gtacggccac 181 ccggcgggca tcgactacaa gacgaccacc atcctgctgg acgggcggcg ggtgaagctg 241 cagctctggg atacttcagg ccagggaaga ttttgtacca tattccgctc ctactcccgg 301 ggcgcacagg gtgtgatcct ggtctatgac attgcgaacc gctggtcttt tgacggcatt 361 gatcgatgga ttaaggagat cgatgagcat gcccccggag tccccaagat cctggtgggg 421 aaccgcctgc acctggcgtt caagcggcag gtgcccacgg agcaggccca ggcctacgcc 481 gagcgcctgg gcgtgacctt ctttgaggtc agccctctgt gcaatttcaa catcacagag 541 tcgttcacgg agctggccag gatcgtgctg ctgcggcatg ggatggaccg gctctggcgg 601 ccgagcaagg tgctgagctt gcaagacctc tgctgccggg cggtcgtgtc ctgcacgccg 661 gtgcacctgg tggacaagct cccgctcccc attgccttaa gaagccacct caagtccttc 721 tcgatggcca acggcctgaa tgccaggatg atgcacggcg gttcctactc cctcaccacc 781 agctccaccc acaaaaggag cagcctccgc aaagtgaagc tcgtccgccc cccccagagc 841 ccccccaaaa actgcaccag aaacagctgc aaaatttctt aaggaaggca ctgaaagaaa 901 cacggcggaa tctctccagg agaagctcgg cgttaccccc ggcagctggt ggatgcatct 961 cagatcccgg ttcctctcgg cgaatgctgc ttgcgaatgt gtgcgacgcc ttccgtgtga 1021 tggaaacaca ctaccccgtc ggacttcgaa tttctacgtg gatgtgcatg aagctcttgt 1081 tttcgatgtg tgtttgtaaa gggaaaatta gtactctgct cgactcttgg taacatgaaa 1141 ttctgaatgt tactttatca tgattgcact gcaacttttt tccttaaaat aactgctttt 1201 gtaagaacgg tgatattgga gtgattagta taaattcaat ggaatttgag aagcaatggc 1261 agcgggataa tttagagtca ctgatattac gagaggggtc tttttgtaaa cctccttttc 1321 aatgtcaaag caccaattta taaaacgctg cagatgtaga ggttatgtgc aactgatctg 1381 tccagtttgt gtatgaaatg gatttgataa agtttttgct agttatttac tacattttgg 1441 gattaataag tgatttatat gcatattttt ctgtaaatct acagtttttt gtacaagata 1501 ttctacaagt tatgaagcta agggaagaaa atgccaaaga tacctctagt tatgttgaac 1561 acagccagca cagtttcgac aggtcaagga agagctgttt cagtaaagaa tgaagtgaaa 1621 acacttattt aggaaaatgt ttctcaacaa taaaatgtat agttgtttct ctc // LOCUS HSU05237 2673 bp mRNA PRI 25-APR-1996 DEFINITION Human fetal Alz-50-reactive clone 1 (FAC1) mRNA, complete cds. ACCESSION U05237 NID g1276427 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2673) AUTHORS Bowser,R., Giambrone,A. and Davies,P. TITLE FAC1, a novel gene identified with the monoclonal antibody Alz50, is developmentally regulated in human brain JOURNAL Dev. Neurosci. 17 (1), 20-37 (1995) MEDLINE 95347245 REFERENCE 2 (bases 1 to 2673) AUTHORS Bowser,R.P. TITLE Direct Submission JOURNAL Submitted (18-JAN-1994) Robert P. Bowser, Dept. Pathology, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY 10641, USA FEATURES Location/Qualifiers source 1..2673 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="fetal Alz-50-reactive clone 1" /clone_lib="Clontech HL1065b, fetal brain cDNA library in gt11" /chromosome="17" /tissue_type="brain" /dev_stage="gestational week 22" gene 74..2506 /gene="FAC1" CDS 74..2506 /gene="FAC1" /codon_start=1 /db_xref="PID:g1276428" /translation="MVSEEEEEEDGDAEETQDSEDDEEDEMEEDDDDSDYPEEMEDDD DDASYCTESSFRSHSTYSSTPGRRKPRVHRPRSPILEEKDIPPLEFPKSSEDLMVPNE HIMNVIAIYEVLRNFGTVLRLSPFRFEDFCAALVSQEQCTLMAEMHVVLLKAVLREED TSNTTFGPADLKDSVNSTLYFIDGMTWPEVLRVYCESDKEYHHVLPYQEAEDYPYGPV ENKIKVLQFLVDQFLTTNIAREELMSEGVIQYDDHCRVCHKLGDLLCCETCSAVYHLE CVKPPLEEVPEDEWQCEVCVAHKVPGVTDCVAEIQKNKPYIRHEPIGYDRSRRKYWFL NRRLIIEEDTENENEKKIWYYSTKVQLAELIDCLDKDYWEAELCKILEEMREEIHRHM DITEDLTNKARGSNKSFLAAANEEILESIRAKKGDIDNVKSPEETEKDKNETENDSKD AEKNREEFEDQSLEKDSDDKTPDDDPEQGKSEEPTEVGDKGNSVSANLGDNTTNATSE ETSPSEGRSPVGCLSETPDSSNMAEKKVASELPQDVPEEPNKTCESSNTSATTTSIQP NLENSNSSSELNSSQSESAKAADDPENGERESHTPVSIQEEIVGDFTSEKSTGELSES PGAGKGASGSTRIITRLRNPDSKLSQLKSQQVAAAAHEANKLFKEGKEVLVVNSQGEI SRLSTKKEVIMKGNINNYFKLGQEGKYRVYHNQYSTNSFALNKHQHREDHDKRRHLAH KFCLTPAGEFKWNGSVHGSKVLTISTLRLTITQLETTSLHPSFIPTGHHIGQIGSRQF RCVANPENLHWL" 3'UTR 2507..2653 BASE COUNT 887 a 530 c 678 g 578 t ORIGIN 1 ttctactact actaggccac gcgtcgacta gtacgggggg gggggggggg gggggaggag 61 gaaagaggag gacatggtct ccgaggagga ggaggaggag gacggcgacg ccgaggagac 121 ccaggattct gaggacgacg aggaggatga gatggaagag gacgacgatg actccgatta 181 tccggaggag atggaagacg acgacgacga cgccagttac tgcacggaaa gcagcttcag 241 gagccatagt acctacagca gcactccagg taggcgaaaa ccaagagtac atcggcctcg 301 ttctcctata ttggaagaaa aagacatccc gccccttgaa tttcccaagt cctctgagga 361 tttaatggtg cctaatgagc atataatgaa tgtcattgcc atttacgagg tactgcggaa 421 ctttggcact gttttgagat tatctccttt tcgctttgag gacttttgtg cagctctggt 481 gagccaagag cagtgcacac tcatggcaga gatgcatgtt gtgcttttga aagcagttct 541 gcgtgaagaa gacacttcca atactacctt tggacctgct gatctgaaag atagcgttaa 601 ttccacactg tatttcatag atgggatgac gtggccagag gtgctgcggg tgtactgtga 661 gagtgataag gagtaccatc acgttcttcc ttaccaagag gcagaggact acccatatgg 721 accagtagag aacaagatca aagttctaca gtttctagtc gatcagtttc ttacaacaaa 781 tattgctcga gaggaattga tgtctgaagg ggtgatacag tatgatgacc attgtagggt 841 ttgtcacaaa cttggggatt tgctttgctg tgagacatgt tcagcagtat accatttgga 901 atgtgtgaag ccacctcttg aggaggtgcc agaggacgag tggcagtgtg aagtctgtgt 961 agcacacaag gtgcctggtg tgactgactg tgttgctgaa atccaaaaaa ataaaccata 1021 tattcgacat gaacctattg gatatgatag aagtcggagg aaatactggt tcttgaaccg 1081 aagactcata atagaagaag atacagaaaa tgaaaatgaa aagaaaattt ggtattacag 1141 cacaaaggtc caacttgcag aattaattga ctgtctagac aaagattatt gggaagcaga 1201 actctgcaaa attctagaag aaatgcgtga agaaatccac cgacacatgg acataactga 1261 agacctgacc aataaggctc ggggcagtaa caaatccttt ctggcggcag ctaatgaaga 1321 aattttggaa tccataagag ccaaaaaggg agacattgat aatgttaaaa gcccagaaga 1381 aacagaaaaa gacaagaatg agactgagaa tgactctaaa gatgctgaga aaaacagaga 1441 agaatttgaa gaccagtccc ttgaaaaaga cagtgacgac aaaacaccag atgatgaccc 1501 tgagcaagga aaatctgagg agccaacaga agttggggat aaaggtaact ctgtgtcagc 1561 aaatcttggc gacaacacaa caaatgcaac ttcagaagag actagtccct ctgaagggag 1621 gagccctgtg gggtgtctct cagaaacccc cgatagcagc aacatggcag agaagaaggt 1681 ggcatctgag ctcccccagg atgtgccaga agaacctaac aagacatgtg agagcagtaa 1741 cactagtgct accactacct ccatccagcc taatctggaa aacagtaaca gcagcagtga 1801 actaaattct tcccagagtg aatctgctaa ggcagctgat gatcctgaaa atggagaaag 1861 agaatctcat acacctgtct ctattcagga agagatagta ggtgatttca catcggagaa 1921 gtccaccggg gagctaagtg aatctcctgg agctggaaaa ggagcatctg gctcaactcg 1981 aatcatcacc agattgcgga atccagatag caaacttagt cagctgaaga gccagcaggt 2041 ggcagccgct gcacatgaag caaataaatt atttaaggag ggcaaagagg tactggtagt 2101 taactctcaa ggagaaattt cacggttgag caccaaaaag gaagtgatca tgaaaggaaa 2161 tatcaacaat tattttaaat tgggtcaaga agggaagtat cgcgtctacc acaatcaata 2221 ctccaccaat tcatttgctt tgaataagca ccagcacaga gaagaccatg ataagagaag 2281 gcatcttgca cataagttct gtctgactcc agcaggagag ttcaaatgga acggttctgt 2341 ccatgggtcc aaagttctta ccatatctac tctgagactg actatcaccc aattagaaac 2401 aacatccctt catccttcct tcatcccaac tgggcatcac atagggcaaa ttggatcaag 2461 gcagttcaga tgtgtagcaa acccagagaa tttgcattgg ctttagccat tttggagtgt 2521 gcagttaaac cagttgtgat gctaccaata tggcgagaat ctttaggaca taccaggtta 2581 caccggatga catcaattga aagagaagaa aaggagaaag tcaaaaaaaa aaaaaaaaaa 2641 gagaagaaac aggccggaat tccagctgag cgc // LOCUS HSU05340 1686 bp mRNA PRI 15-JUN-1994 DEFINITION Human p55CDC mRNA, complete cds. ACCESSION U05340 NID g468031 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1686) AUTHORS Weinstein,J., Jacobsen,F.W., Hsu-Chen,J., Wu,T. and Baum,L.G. TITLE A novel mammalian protein, p55CDC, present in dividing cells is associated with protein kinase activity and has homology to the Saccharomyces cerevisiae cell division cycle proteins Cdc20 and Cdc4 JOURNAL Mol. Cell. Biol. 14, 3350-3363 (1994) MEDLINE 94217731 REFERENCE 2 (bases 1 to 1686) AUTHORS Weinstein,J. TITLE Direct Submission JOURNAL Submitted (21-JAN-1994) Jasminder Weinstein, Glycobiology, Amgen Inc, Amgen Center, Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..1686 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="p55CDC" /cell_line="HT1080" CDS 111..1610 /codon_start=1 /product="p55CDC" /db_xref="PID:g468032" /translation="MAQFAFESDLHSLLQLDAPIPNAPPARWQRKAKEAAGPAPSPMR AANRSHSAGRTPGRTPGKSSSKVQTTPSKPGGDRYIPHRSAAQMEVASFLLSKENQSE NSQTPTKKEHQKAWALNLNGFDVEEAKILRLSGKPQNAPEGYQNRLKVLYSQKATPGS SRKTCRYIPSLPDRILDAPEIRNDYYLNLVDWSSGNVLAVALDNSVYLWSASSGDILQ LLQMEQPGEYISSVAWIKEGNYLAVGTSSAEVQLWDVQQQKRLRNMTSHSARVGSLSW NSYILSSGSRSGHIHHHDVRVAEHHVATLSGHSQEVCGLRWAPDGRHLASGGNDNLVN VWPSAPGEGGWVPLQTFTQHQGAVKAVAWCPWQSNVLATGGGTSDRHIRIWNVCSGAC LSAVDAHSQVCSILWSPHYKELISGHGFAQNQLVIWKYPTMAKVAELKGHTSRVLSLT MSPDGATVASAAADETLRLWRCFELDPARRREREKASAAKSSLIHQGIR" BASE COUNT 382 a 491 c 470 g 343 t ORIGIN 1 ccacgcgtcc gggcgtaagc caggcgtgtt aaagccggtc ggaactgctc cggagggcac 61 gggctccgta ggcaccaact gcaaggaccc ctccccctgc gggcgctccc atggcacagt 121 tcgcgttcga gagtgacctg cactcgctgc ttcagctgga tgcacccatc cccaatgcac 181 cccctgcgcg ctggcagcgc aaagccaagg aagccgcagg cccggccccc tcacccatgc 241 gggccgccaa ccgatcccac agcgccggca ggactccggg ccgaactcct ggcaaatcca 301 gttccaaggt tcagaccact cctagcaaac ctggcggtga ccgctatatc ccccatcgca 361 gtgctgccca gatggaggtg gccagcttcc tcctgagcaa ggagaaccag tctgaaaaca 421 gccagacgcc caccaagaag gaacatcaga aagcctgggc tttgaacctg aacggttttg 481 atgtagagga agccaagatc cttcggctca gtggaaaacc acaaaatgcg ccagagggtt 541 atcagaacag actgaaagta ctctacagcc aaaaggccac tcctggctcc agccggaaga 601 cctgccgtta cattccttcc ctgccagacc gtatcctgga tgcgcctgaa atccgaaatg 661 actattacct gaaccttgtg gattggagtt ctgggaatgt actggccgtg gcactggaca 721 acagtgtgta cctgtggagt gcaagctctg gtgacatcct gcagcttttg caaatggagc 781 agcctgggga atatatatcc tctgtggcct ggatcaaaga gggcaactac ttggctgtgg 841 gcaccagcag tgctgaggtg cagctatggg atgtgcagca gcagaaacgg cttcgaaata 901 tgaccagtca ctctgcccga gtgggctccc taagctggaa cagctatatc ctgtccagtg 961 gttcacgttc tggccacatc caccaccatg atgttcgggt agcagaacac catgtggcca 1021 cactgagtgg ccacagccag gaagtgtgtg ggctgcgctg ggccccagat ggacgacatt 1081 tggccagtgg tggtaatgat aacttggtca atgtgtggcc tagtgctcct ggagagggtg 1141 gctgggttcc tctgcagaca ttcacccagc atcaaggggc tgtcaaggcc gtagcatggt 1201 gtccctggca gtccaatgtc ctggcaacag gagggggcac cagtgatcga cacattcgca 1261 tctggaatgt gtgctctggg gcctgtctga gtgccgtgga tgcccattcc caggtgtgct 1321 ccatcctctg gtctccccat tacaaggagc tcatctcagg ccatggcttt gcacagaacc 1381 agctagttat ttggaagtac ccaaccatgg ccaaggtggc tgaactcaaa ggtcacacat 1441 cccgggtcct gagtctgacc atgagcccag atggggccac agtggcatcc gcagcagcag 1501 atgagaccct gaggctatgg cgctgttttg agttggaccc tgcgcggcgg cgggagcggg 1561 agaaggccag tgcagccaaa agcagcctca tccaccaagg catccgctga agaccaaccc 1621 atcacctcag ttgtttttta tttttctaat aaagtcatgt ctcccttcat gttttttttt 1681 ttaaaa // LOCUS HSU05569 1112 bp mRNA PRI 25-APR-1996 DEFINITION Human alphaA-crystallin (CRYA1) mRNA, complete cds. ACCESSION U05569 NID g452477 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1112) AUTHORS Jaworski,C.J. TITLE The human alphaA-crystallin gene JOURNAL Thesis (1992) LMDB, NEI, Molecular Structure and Function REFERENCE 2 (bases 1 to 1112) AUTHORS Jaworski,C.J. TITLE A reassessment of mammalian alpha A-crystallin sequences using DNA sequencing: implications for anthropoid affinities of tarsier JOURNAL J. Mol. Evol. 41 (6), 901-908 (1995) MEDLINE 96139023 REFERENCE 3 (bases 1 to 1112) AUTHORS Wistow,G.J. TITLE Direct Submission JOURNAL Submitted (25-JAN-1994) Graeme J. Wistow, Molecular Structure and Function, LMDB, NEI, NIH, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1112 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lens" /chromosome="21" gene 68..589 /gene="CRYA1" CDS 68..589 /gene="CRYA1" /function="lens structural protein" /note="This sequence is derived from genomic and cDNA sequence" /citation=[1] /citation=[2] /codon_start=1 /evidence=experimental /product="alphaA-crystallin" /db_xref="PID:g452478" /translation="MDVTIQHPWFKRTLGPFYPSRLFDQFFGEGLFEYDLLPFLSSTI SPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPEDLTVKVQDDFVEIHGKHNE RQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCGPKIQTGLDATHAERAIP VSREEKPTSAPSS" BASE COUNT 184 a 396 c 309 g 223 t ORIGIN 1 acactggctg ccagaggccc cgctgactcc tgccagcctc caggtccccg tggtaccaaa 61 gctgaacatg gacgtgacca tccagcaccc ctggttcaag cgcaccctgg ggcccttcta 121 ccccagccgg ctgttcgacc agtttttcgg cgagggcctt tttgagtatg acctgctgcc 181 cttcctgtcg tccaccatca gcccctacta ccgccagtcc ctcttccgca ccgtgctgga 241 ctccggcatc tctgaggttc gatccgaccg ggacaagttc gtcatcttcc tcgatgtgaa 301 gcacttctcc ccggaggacc tcaccgtgaa ggtgcaggac gactttgtgg agatccacgg 361 aaagcacaac gagcgccagg acgaccacgg ctacatttcc cgtgagttcc accgccgcta 421 ccgcctgccg tccaacgtgg accagtcggc cctctcttgc tccctgtctg ccgatggcat 481 gctgaccttc tgtggcccca agatccagac tggcctggat gccacccacg ccgagcgagc 541 catccccgtg tcgcgggagg agaagcccac ctcggctccc tcgtcctaag cagcattgcc 601 tcggctggct cccctggcag ccctggccca tcatgggggg agcaccctga gggcggggta 661 gtctgtcttc gctttgcttc ccttttttcc tttccacctt ctcacatgga atgagggttt 721 gagagagcag ccaggagagc ttagggtctc agggtgtccc agaccccgac accggccagt 781 ggcggaagtg accgcacctc acactccttt agatagcagc ctggctcccc tggggtgcag 841 gcgcctcaac tctgctgagg gtccagaagg agggggtgac ctcctggcca ggtgcctcct 901 gacacacctg cagcctccct ccgcggcggg ccctgcacac ctcctggggc gcgtgaccgc 961 gtggggccgg ggcttctgtg cacctgggct ctcgcggcct cttctctcag accgtcttcc 1021 tccaacccct ctatgtagtg ccgctcttgg ggacatgggt cgcccatgag agcgcagccc 1081 gcggcaatca ataaacagca ggtgatacaa gc // LOCUS HSU05572 3199 bp mRNA PRI 15-JUL-1996 DEFINITION Human lysosomal alpha-mannosidase (MANB) mRNA, complete cds. ACCESSION U05572 NID g1419373 KEYWORDS MANB; alpha-mannosidosis; lysosomal storage disease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3199) AUTHORS Nebes,V.L. and Schmidt,M.C. TITLE Human lysosomal alpha-mannosidase: isolation and nucleotide sequence of the full-length cDNA JOURNAL Biochem. Biophys. Res. Commun. 200 (1), 239-245 (1994) MEDLINE 94220092 REFERENCE 2 (bases 1 to 3199) AUTHORS Emiliani,C., Martino,S., Stirling,J.L., Maras,B. and Orlacchio,A. TITLE Partial sequence of the purified protein confirms the identity of cDNA coding for human lysosomal alpha-mannosidase B JOURNAL Biochem. J. 305 (Pt 2), 363-366 (1995) MEDLINE 95134211 REFERENCE 3 (bases 1 to 3199) AUTHORS Nebes,V.L. TITLE Direct Submission JOURNAL Submitted (25-JAN-1994) Vicki L. Nebes, Thyroid Eye Disease Laboratories, Allegheny-Singer Research Institute, 320 East North Ave., Level 02 South Tower, Pittsburgh, PA 15212, USA FEATURES Location/Qualifiers source 1..3199 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene lambda Zap II human muscle and retina cDNA libraries" /clone="pHAM31, pHAM18, pHAM20" gene 103..3066 /gene="MANB" CDS 103..3066 /gene="MANB" /EC_number="3.2.1.24" /codon_start=1 /product="alpha-mannosidase" /db_xref="PID:g1419374" /translation="MSRALRPPLPPLCFFLLLLAAAGARAGGYETCPTVQPNMLNVHL LPHTHDDVGWLKTVDQYFYGIKNDIQHAGVQYILDSVISALLADPTRRFIYVEIAFFS RWWHQQTNATQEVVRDLVRQGRLEFANGGWVMNDEAATHYGAIVDQMTLGLRFLEDTF GNDGRPRVAWHIDPFGHSREQASLFAQMGFDGFFFGRLDYQDKWVRMQKLEMEQVWRA STSLKPPTADLFTGVLPNGYNPPRNLCWDVLCVDQPLVEDPRSPEYNAKELVDYFLNV ATAQGRYYRTNHTVMTMGSDFQYENANMWFKNLDKLIRLVNAQAKGSSVHVLYSTPAC YLWELNKANLTWSVKHDDFFPYADGPHQFWTGYFSSRPALKRYERLSYNFLQVCNQLE ALVGLAANVGPYGSGDSAPLNEAMAVLQHHDAVSGTSRQHVANDYARQLAAGWGPCEV LLSNALARLRGFKDHFTFCQQLNISICPLSQTAARFQVIVYNPLGRKVNWMVRLPVSE GVFVVKDPNGRTVPSDVVIFPSSDSQAHPPELLFSASLPALGFSTYSVAQVPRWKPQA RAPQPIPRRSWSPALTIENEHIRATFDPDTGLLMEIMNMNQQLLLPVRQTFFWYNASI GDNESDQASGAYIFRPNQQKPLPVSRWAQIHLVKTPLVQEVHQNFSAWCSQVVRLYPG QRHLELEWSVGPIPVGDTWGKEVISRFDTPLETKGRFYTDSNGREILERRRDYRPTWK LNQTEPVAGNYYPVNTRIYITDGNMQLTVLTDRSQGGSSLRDGSLELMVHRRLLKDDG RGVSEPLMENGSGAWVRGRHLVLLDTAQAAAAGHRLLAEQEVLAPQVVLAPGGGAAYN LGAPPRTQFSGLRRDLPPSVHLLTLASWGPEMVLLRLEHQFAVGEDSGRNLSAPVTLN LRDLFSTFTITRLQETTLVANQLREAASRLKWTTNTGPTPHQTPYQLDPANITLEPME IRTFLASVQWKEVDG" BASE COUNT 660 a 993 c 941 g 605 t ORIGIN 1 ggccgcggaa ccccaggagg aagctgctga gccatgggcg cctacgcgcg ggcttcgggg 61 gtctgcgctc gcggctgcct ggactcagca ggcccctgga ccatgtcccg cgccctgcgg 121 ccaccgctcc cgcctctctg ctttttcctt ttgttgctgg cggctgccgg tgctcgggcc 181 gggggatacg agacatgccc cacagtgcag ccgaacatgc tgaacgtgca cctgctgcct 241 cacacacatg atgacgtggg ctggctcaaa accgtggacc agtactttta tggaatcaag 301 aatgacatcc agcacgccgg tgtgcagtac atcctggact cggtcatctc tgccttgctg 361 gcagatccca cccgtcgctt catttacgtg gagattgcct tcttctcccg ttggtggcac 421 cagcagacaa atgccacaca ggaagtcgtg cgagaccttg tgcgccaggg gcgcctggag 481 ttcgccaatg gtggctgggt gatgaacgat gaggcagcca cccactacgg tgccatcgtg 541 gaccagatga cacttgggct gcgctttctg gaggacacat ttggcaatga tgggcgaccc 601 cgtgtggcct ggcacattga ccccttcggc cactctcggg agcaggcctc gctgtttgcg 661 cagatgggct tcgacggctt cttctttggg cgccttgatt atcaagataa gtgggtacgg 721 atgcagaagc tggagatgga gcaggtgtgg cgggccagca ccagcctgaa gcccccgacc 781 gcggacctct tcactggtgt gcttcccaat ggttacaacc cgccaaggaa tctgtgctgg 841 gatgtgctgt gtgtcgatca gccgctggtg gaggaccctc gcagccccga gtacaacgcc 901 aaggagctgg tcgattactt cctaaatgtg gccactgccc agggccggta ttaccgcacc 961 aaccacactg tgatgaccat gggctcggac ttccaatatg agaatgccaa catgtggttc 1021 aagaaccttg acaagctcat ccggctggta aatgcgcagg caaaaggaag cagtgtccat 1081 gttctctact ccacccccgc ttgttacctc tgggagctga acaaggccaa cctcacctgg 1141 tcagtgaaac atgacgactt cttcccttac gcggatggcc cccaccagtt ctggaccggt 1201 tacttttcca gtcggccggc cctcaaacgc tacgagcgcc tcagctacaa cttcctgcag 1261 gtgtgcaacc agctggaggc gctggtgggc ctggcggcca acgtgggacc ctatggctcc 1321 ggagacagtg cacccctcaa tgaggcgatg gctgtgctcc agcatcacga cgccgtcagc 1381 ggcacctccc gccagcacgt ggccaacgac tacgcgcgcc agcttgcggc aggctggggg 1441 ccttgcgagg ttcttctgag caacgcgctg gcgcggctca gaggcttcaa agatcacttc 1501 accttttgcc aacagctaaa catcagcatc tgcccgctca gccagacggc ggcgcgcttc 1561 caggtcatcg tttataatcc cctggggcgg aaggtgaatt ggatggtacg gctgccggtc 1621 agcgaaggcg ttttcgttgt gaaggacccc aatggcagga cagtgcccag cgatgtggta 1681 atatttccca gctcagacag ccaggcgcac cctccggagc tgctgttctc agcctcactg 1741 cccgccctgg gcttcagcac ctattcagta gcccaggtgc ctcgctggaa gccccaggcc 1801 cgcgcaccac agcccatccc cagaagatcc tggtcccctg ctttaaccat cgaaaatgag 1861 cacatccggg caacgtttga tcctgacaca gggctgttga tggagattat gaacatgaat 1921 cagcaactcc tgctgcctgt tcgccagacc ttcttctggt acaacgccag tataggtgac 1981 aacgaaagtg accaggcctc aggtgcctac atcttcagac ccaaccaaca gaaaccgctg 2041 cctgtgagcc gctgggctca gatccacctg gtgaagacac ccttggtgca ggaggtgcac 2101 cagaacttct cagcttggtg ttcccaggtg gttcgcctgt acccaggaca gcggcacctg 2161 gagctagagt ggtcggtggg gccgatacct gtgggcgaca cctgggggaa ggaggtcatc 2221 agccgttttg acacaccgct ggagacaaag ggacgcttct acacagacag caatggccgg 2281 gagatccttg agaggaggcg ggattatcga cccacctgga aactgaacca gacggagccc 2341 gtggcaggaa actactatcc agtcaacacc cggatttaca tcacggatgg aaacatgcag 2401 ctgactgtgc tgactgaccg ctcccagggg ggcagcagcc tgagagatgg ctcgctggag 2461 ctcatggtgc accgaaggct gctgaaggac gatggacgcg gagtatcgga gccactaatg 2521 gagaacgggt cgggggcgtg ggtgcgaggg cgccacctgg tgctgctgga cacagcccag 2581 gctgcagccg ccggacaccg gctcctggcg gagcaggagg tcctggcccc tcaggtggtg 2641 ctggccccgg gtggcggcgc cgcctacaat ctcggggctc ctccgcgcac gcagttctca 2701 gggctgcgca gggacctgcc gccctcggtg cacctgctca cgctggccag ctggggcccc 2761 gaaatggtgc tgctgcgctt ggagcaccag tttgccgtag gagaggattc cggacgtaac 2821 ctgagcgccc ccgttacctt gaacttgagg gacctgttct ccaccttcac catcacccgc 2881 ctgcaggaga ccacgctggt ggccaaccag ctccgcgagg cagcctccag gctcaagtgg 2941 acaacaaaca caggccccac accccaccaa actccgtacc agctggaccc ggccaacatc 3001 acgctggaac ccatggaaat ccgcactttc ctggcctcag ttcaatggaa ggaggtggat 3061 ggttaggtct gctgggatgg gccctccaag cccaagcctc ctgctccggg ggcagaccag 3121 actctgactc tcctcttggg gctgctgcca ttaaaacgct actactaaga aaaaaaaaaa 3181 aaaaaaaaaa aaggaattc // LOCUS HSU05596 3944 bp mRNA PRI 22-OCT-1994 DEFINITION Human anion exchanger 3 brain isoform (bAE3) mRNA, complete cds. ACCESSION U05596 NID g476221 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3944) AUTHORS Yannoukakos,D., Stuart-Tilley,A., Fernandez,H., Fey,P., Duyk,G. and Alper,S. TITLE Molecular cloning, expression, and chromosomal localization of two isoforms of the AE3 anion exchanger from human heart JOURNAL Circ. Res. 75, 603-614 (1994) MEDLINE 95008042 REFERENCE 2 (bases 1 to 3944) AUTHORS Yannoukakos,D. TITLE Direct Submission JOURNAL Submitted (26-JAN-1994) Drakoulis Yannoukakos, Molecular Medicine, Beth Israel Hospital, 330 Brookline Ave, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..3944 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lZhbAE3" /tissue_type="heart" /dev_stage="adult" CDS 12..3710 /codon_start=1 /product="anion exchanger 3 brain isoform" /db_xref="PID:g476222" /translation="MANGVIPPPGGASPLPQVRVPLEEPPLSPDVEEEDDDLGKTLAV SRFGDLISKPPAWDPEKPSRSYSERDFEFHRHTSHHTHHPLSARLPPPHKLRRLPPTS ARHTRRKRKKEKTSAPPSEGTPPIQEEGGAGVDEEEEEEEEEEGESEAEPVEPPPSGT PQKAKFSIGSDEDDSPGLPGRAAVTKPLPSVGPHTDKSPQHSSSSPSPRARASRLAGE KSRPWSPSASYDLRERLCPGSALGNPGGPEQQVPTDEAEAQMLGSADLDDMKSHRLED NPGVRRHLVKKPSRTQGGRGSPSGLAPILRRKKKKKKLDRRPHEVFVELNELMLDRSQ EPHWRETARWIKFEEDVEEETERWGKPHVASLSFRSLLELRRTIAHGAALLDLEQTTL PGIAHLVVETMIVSDQIRPEDRASVLRTLLLKHSHPNDDKDSGFFPRNPSSSSMNSVL GNHHPTPSHGPDGAVPTMADDLGEPAPLWPHDPDAKEKPLHMPGGDGHRGKSLKLLEK IPEDAEATVVLVGCVPFLEQPAAAFVRLNEAVLLESVLEVPVPVRFLFVMLGPSHTST DYHELGRSIATLMSDKLFHEAAYQADDRQDLLSAISEFLDGSIVIPPSEVEGRDLLRS VAAFQRELLRKRREREQTKVEMTTRGGYTAPGKELSLELGGSEATPEDDPLLRTGSVF GGLVRDVRRRYPHYPSDLRDALHSQCVAAVLFIYFAALSPAITFGGLLGEKTEGLMGV SELIVSTAVLGVLFSLLGAQPLLVVGFSGPLLVFEEAFFKFCRAQDLEYLTGRVWVGL WLVVFVLALVAAEGSFLVRYISPFTQEIFAFLISLIFIYETFYKLYKVFTEHPLLPFY PPEGALEGSLAAGLEPNGSALPPTEGPPSPRNQPNTALLSLILMLGTFFIAFFLRKFR NSRFLGGKARRIIGDFGIPISILVMVLVDYSITDTYTQKLTVPTGLSVTSPDKRSWFI PPLGSARPFPPWMMVAAAVPALLVLILIFMETQITALIVSQKARRLLKGSGFHLDLLL IGSLGGLCGLFGLPWLTAATVRSVTHVNALTVMRTAIAPGDKPQIQEVREQRVTGVLI ASLVGLSIVMGAVLRRIPLAVLFGIFLYMGVTSLSGIQLSQRLLLILMPAKHHPEQPY VTKVKTWRMHLFTCIQLGCIALLWVVKSTAASLAFPFLLLLTVPLRHCLLPRLFQDRE LQALDSEDAEPNFDEDGQDEYNELHMPV" BASE COUNT 717 a 1270 c 1194 g 763 t ORIGIN 1 cctacctggc catggccaac ggagtgatcc cgccgcccgg gggcgcctcc cccctacccc 61 aggtccgggt gcccttggag gagccccctc taagtccaga cgtggaggag gaggacgatg 121 acttgggcaa gaccttggct gtgagcaggt ttggggacct catcagcaag cccccggcct 181 gggaccccga gaagcccagc cgcagctaca gcgagcggga ctttgagttt caccggcaca 241 catcccacca cacccaccac ccgctctcag cgcgcctgcc tccaccccac aagctgcggc 301 ggctgccccc cacctctgcc cggcacacca ggagaaagag gaagaaggag aaaacctctg 361 ctcctccctc cgaggggacc cctcccatcc aggaggaggg gggagctgga gtggatgagg 421 aagaggagga agaggaggaa gaggaaggag aatctgaggc agaacctgtg gagccccccc 481 cctcagggac cccacagaag gcaaagttct ccattggaag tgacgaggat gacagtccag 541 gcctccctgg gagggctgct gtcaccaagc ccctgccctc ggtgggccca cacactgaca 601 agagccccca gcactccagc agctccccca gcccccgggc ccgggcctcc cgactcgctg 661 gggagaaaag ccggccctgg agcccatcgg ccagttatga cctgcgggag cgactgtgcc 721 caggcagtgc cctgggcaac ccaggtggtc cagagcagca ggtgcccaca gatgaggcgg 781 aggcccagat gctgggttct gcagacctgg acgacatgaa gagtcaccga ctggaggaca 841 accctggtgt gcggcgacac ttagtgaaaa agccctcccg gacgcagggc gggaggggca 901 gtcccagcgg cctggccccc atccttcgca ggaagaagaa gaagaaaaag ctggaccgga 961 ggcctcatga ggtgttcgtg gagctgaacg agctgatgct ggaccgcagc caggagcccc 1021 actggcggga gacggcccgc tggatcaagt ttgaggagga cgtggaggag gagacggagc 1081 gctgggggaa gccccatgtt gcctcgctct ccttccgtag ccttctggag ctcaggagga 1141 ccatcgccca tggagctgcc ctcctggacc tggagcaaac caccctgcca ggcattgcac 1201 acctcgtggt ggagaccatg attgtgtctg accagatccg gccggaggac agggccagcg 1261 tcctacgtac cctgctactg aagcacagcc atcccaacga tgacaaggac agtggcttct 1321 ttccccgaaa cccatcgagc tccagcatga actcggttct ggggaatcat cacccaactc 1381 ccagccatgg ccctgatggg gcggtgccta ccatggctga tgacctgggg gagccagccc 1441 cactctggcc acatgaccct gacgccaagg agaagcccct ccacatgcct gggggagatg 1501 gtcaccgggg gaaaagcctg aagctgctgg agaagatccc tgaagatgct gaggccacgg 1561 ttgtgcttgt gggttgtgtg cctttcttgg agcagcctgc agcagccttc gtgcgtctga 1621 atgaggctgt actcctggag tctgtgcttg aggtccctgt cccggtccgc ttcctcttcg 1681 tgatgctggg gcccagccac accagcactg actatcacga gcttgggcgc tccattgcca 1741 cccttatgtc tgacaagctg tttcatgagg ctgcctacca ggcagatgac cggcaagacc 1801 tcctaagtgc catcagcgag ttcctggatg gcagcattgt gatccccccg tccgaggtgg 1861 agggccgtga cctgctgcgc tccgtggctg ctttccagcg agagctgctt aggaagcggc 1921 gagagcgtga acagaccaaa gtcgagatga ccacacgggg tggctacacg gcccctggga 1981 aagaactgtc tttggagttg gggggctctg aggcgacccc tgaagatgac cccttgctgc 2041 ggacgggctc ggtatttggg gggcttgtgc gggatgtgag gcgccggtac ccgcactacc 2101 ccagtgacct gcgagatgcg ctgcactccc agtgtgtggc cgctgtgctc ttcatctact 2161 tcgcagccct cagccctgcc atcaccttcg gggggctgct gggagagaag accgaggggc 2221 tgatgggcgt gtccgagctg atcgtgtcca ccgctgtgct cggcgtcctc ttctctctgc 2281 tgggagctca gccgctgctt gtggttggct tctctgggcc gctgcttgtg tttgaggaag 2341 ccttcttcaa gttctgccga gcccaggacc tggaatacct cactggccgg gtgtgggttg 2401 gtctctggct ggtggtcttc gtccttgccc tggtggccgc cgaaggcagc ttcctggtcc 2461 gctacatctc gcctttcacc caggagatct ttgcctttct catctcactc attttcatct 2521 acgagacctt ctacaagctc tacaaggtgt tcacagagca cccactgctg ccgttctacc 2581 cccctgaggg ggccctggag gggtccctgg ctgctggtct ggagccaaat ggcagtgccc 2641 tgccccccac cgagggcccc cccagcccga ggaaccagcc caatacggca ctgctctcac 2701 tcatcctcat gctcgggacc ttcttcatag ccttcttcct gcgcaagttc aggaacagcc 2761 gcttcctggg gggcaaggct cgtcgcatca tcggggactt tggcatcccc atctccatcc 2821 tggtgatggt cctggtggat tactccatca cagacaccta cacgcagaag ctgacagtgc 2881 ctacagggct ctcagtgacc tctcccgata agcgctcgtg gttcatccca cccctgggga 2941 gtgcccgtcc tttcccgccg tggatgatgg tggcagccgc tgttcccgcc ctcctcgtcc 3001 tcatcctgat cttcatggag acacagatca cggcgcttat cgtcagccag aaggcgcgga 3061 ggctgctcaa gggctccggt ttccacctgg acctgctcct cattggctcc ctgggggggc 3121 tctgtgggct gtttgggttg ccctggctca cggctgccac ggtccgctcc gtcacccatg 3181 tcaatgcgtt gacagtgatg cgtactgcca tcgcgcctgg tgacaagccc cagatccagg 3241 aggtgcggga gcagcgggtc actggtgtgc tcatcgccag cctcgtgggc ctgtccatcg 3301 tcatgggggc tgtgctgcgt cggatcccat tggctgtgct ctttgggatc ttcctgtaca 3361 tgggggtcac gtccctgtct ggtatccagc tgtcccagcg tctgttgctc atcctcatgc 3421 cggcaaaaca ccatcctgag cagccctatg tgaccaaggt gaagacgtgg cggatgcatc 3481 tgttcacctg catccagctg ggctgcatcg cactgctctg ggtggtcaag tccacggcgg 3541 cctcactcgc ctttcccttc ctgctgctgc tcacggtgcc tctgaggcat tgccttctgc 3601 cccggctctt ccaggacagg gagctgcagg cgctggactc ggaagatgct gaaccaaact 3661 tcgatgagga tggccaggat gagtacaatg agctgcacat gccagtgtga ccctgaagac 3721 agtgcccctc agagacccca agaccttagg gattgacacc tgggcctcag gcagagccca 3781 gccctgggct ggggggctcc tcaggaccta gagatgtgcc tggaaccact cctgatgcca 3841 tggctagagt ggcccccctg acttctgccc ggggtgttga cctcgcctca cctttcacag 3901 accagaccgg cacaggcttt gagctcatta taaccacact cctg // LOCUS HSU05598 1219 bp mRNA PRI 20-AUG-1994 DEFINITION Human dihydrodiol dehydrogenase mRNA, complete cds. ACCESSION U05598 NID g531159 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1219) AUTHORS Ciaccio,P.J. and Tew,K.D. TITLE cDNA and deduced amino acid sequences of a human colon dihydrodiol dehydrogenase JOURNAL Biochim. Biophys. Acta 1186 (1-2), 129-132 (1994) MEDLINE 94281244 REFERENCE 2 (bases 1 to 1219) AUTHORS Ciaccio,P.J. TITLE Direct Submission JOURNAL Submitted (26-JAN-1994) Paul J. Ciaccio, The Fox Chase Cancer Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..1219 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="c81" /clone_lib="Uni-Zap" /cell_line="HT29" /tissue_type="colon" CDS 24..995 /note="oxidoreductase, cytosolic protein" /codon_start=1 /function="bile acid binder" /product="dihydrodiol dehydrogenase" /db_xref="PID:g531160" /translation="MDSKYQCVKLNDGHFMPVLGFGTYAPAEVPKSKALEAVKLAIEA GFHHIDSAHVYNNEEQVGLAIRSKIADGSVKREDIFYTSKLWSNSHRPELVRPALERS LKNLQLDYVDLYLIHFPVSVKPGEEVIPKDENGKILFDTVDLCATWEAMEKCKDAGLA KSIGVSNFNHRLLEMILNKPGLKYKPVCNQVECHPYFNQRKLLDFCKSKDIVLVAYSA LGSHREEPWVDPNSPVLLEDPVLCALAKKHKRTPALIALRYQLQRGVVVLAKSYNEQR IRQNVQVFEFQLTSEEMKAIDGLNRNVRYLTLDIFAGPPNYPFSDEY" BASE COUNT 337 a 276 c 303 g 303 t ORIGIN 1 tgctaaccag gccagtgaca gaaatggatt cgaaatacca gtgtgtgaag ctgaatgatg 61 gtcacttcat gcctgtcctg ggatttggca cctatgcgcc tgcagaggtt cctaaaagta 121 aagctctaga ggccgtcaaa ttggcaatag aagccgggtt ccaccatatt gattctgcac 181 atgtttacaa taatgaggag caggttggac tggccatccg aagcaagatt gcagatggca 241 gtgtgaagag agaagacata ttctacactt caaagctttg gagcaattcc catcgaccag 301 agttggtccg accagccttg gaaaggtcac tgaaaaatct tcaattggac tatgttgacc 361 tctatcttat tcattttcca gtgtctgtaa agccaggtga ggaagtgatc ccaaaagatg 421 aaaatggaaa aatactattt gacacagtgg atctctgtgc cacgtgggag gccatggaga 481 agtgtaaaga tgcaggattg gccaagtcca tcggggtgtc caacttcaac cacaggctgc 541 tggagatgat cctcaacaag ccagggctca agtacaagcc tgtctgcaac caggtggaat 601 gtcatcctta cttcaaccag agaaaactgc tggatttctg caagtcaaaa gacattgttc 661 tggttgccta tagtgctctg ggatcccatc gagaagaacc atgggtggac ccgaactccc 721 cggtgctctt ggaggaccca gtcctttgtg ccttggcaaa aaagcacaag cgaaccccag 781 ccctgattgc cctgcgctac cagctgcagc gtggggttgt ggtcctggcc aagagctaca 841 atgagcagcg catcagacag aacgtgcagg tgtttgaatt ccagttgact tcagaggaga 901 tgaaagccat agatggccta aacagaaatg tgcgatattt gacccttgat atttttgctg 961 gcccccctaa ttatccattt tctgatgaat attaacatgg agggcattgc atgaggtctg 1021 ccagaaggcc ctgcgtgtgg atggtgacac agaggatggc tctatgctgg tgactggaca 1081 catcgcctct ggttaaatct ctcctgcttg gcgacttcag taagctacag ctaagcccat 1141 cggccggaaa agaaagacaa taattttgtt ttttcatttt gaaaaaatta aatgctctct 1201 cctaaagatt cttcaccta // LOCUS HSU05659 1134 bp mRNA PRI 20-AUG-1994 DEFINITION Human 17beta-hydroxysteroid dehydrogenase type 3 mRNA, complete cds. ACCESSION U05659 NID g531161 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1134) AUTHORS Geissler,W.M., Davis,D.L., Wu,L., Bradshaw,K.D., Patel,S., Mendonca,B.B., Elliston,K.O., Wilson,J.D., Russell,D.W. and Andersson,S. TITLE Male pseudohermaphroditism caused by mutations of testicular 17 beta-hydroxysteroid dehydrogenase 3 JOURNAL Nature Genet. 7 (1), 34-39 (1994) MEDLINE 94355972 REFERENCE 2 (bases 1 to 1134) AUTHORS Elliston,K.O. TITLE Direct Submission JOURNAL Submitted (26-JAN-1994) Keith O. Elliston, Bioinformatics, Merck Research Laboratories, Box 2000, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..1134 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q22" /chromosome="9" /sex="male" /tissue_type="testes" exon 1..202 CDS 49..981 /codon_start=1 /product="17beta-hydroxysteroid dehydrogenase type 3" /db_xref="PID:g531162" /translation="MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFL RSMGQWAVITGAGDGIGKAYSFELAKRGLNVVLISRTLEKLEAIATEIERTTGRSVKI IQADFTKDDIYEHIKEKLAGLEIGILVNNVGMLPNLLPSHFLNAPDEIQSLIHCNITS VVKMTQLILKHMESRQKGLILNISSGIALFPWPLYSMYSASKAFVCAFSKALQEEYKA KEVIIQVLTPYAVSTAMTKYLNTNVITKTADEFVKESLNYVTIGGETCGCLAHEILAG FLSLIPAWAFYSGAFQRLLLTHYVAYLKLNTKVR" exon 203..249 exon 250..325 exon 326..433 exon 434..501 exon 502..537 exon 538..572 exon 573..654 exon 655..720 exon 721..870 exon 871..1134 polyA_site 1134 BASE COUNT 308 a 271 c 284 g 271 t ORIGIN 1 tacacagaga gccacggcca gggctgaaac agtctgttga gtgcagccat gggggacgtc 61 ctggaacagt tcttcatcct cacagggctg ctggtgtgcc tggcctgcct ggcgaagtgc 121 gtgagattct ccagatgtgt tttactgaac tactggaaag ttttgccaaa gtctttcttg 181 cggtcaatgg gacagtgggc agtgatcact ggagcaggcg atggaattgg gaaagcgtac 241 tcgttcgagc tagcaaaacg tggactcaat gttgtcctta ttagccggac gctggaaaaa 301 ctagaggcca ttgccacaga gatcgagcgg actacaggga ggagtgtgaa gattatacaa 361 gcagatttta caaaagatga catctacgag catattaaag aaaaacttgc aggcttagaa 421 attggaattt tagtcaacaa tgtcggaatg cttccaaacc ttctcccaag ccatttcctg 481 aacgcaccgg atgaaatcca gagcctcatc cattgtaaca tcacctccgt agtcaagatg 541 acacagctaa ttctgaaaca tatggaatca aggcagaaag gtctcatcct gaacatttct 601 tctgggatag ccctgtttcc ttggcctctc tactccatgt actcagcttc caaggcgttt 661 gtgtgcgcat tttccaaggc cctgcaagag gaatataaag caaaagaagt catcatccag 721 gtgctgaccc catatgctgt ctcgactgca atgacaaagt atctaaatac aaatgtgata 781 accaagactg ctgatgagtt tgtcaaagag tcattgaatt atgtcacaat tggaggtgaa 841 acctgtggct gccttgccca tgaaatcttg gcgggctttc tgagcctgat cccggcctgg 901 gccttctaca gcggtgcctt ccaaaggctg ctcctgacac actatgtggc atacctgaag 961 ctcaacacca aggtcaggta gccaggcggt gaggagtcca gcacaacctt ttcctcacca 1021 gtcccatgct ggctgaagag gaccagagga gcagaccagc acttcaacct agtccgctga 1081 agatggaggg ggctggggtc acagaggcat agaatacaca ttttttgcca cttt // LOCUS HSU05875 2214 bp mRNA PRI 25-MAR-1994 DEFINITION Human clone pSK1 interferon gamma receptor accessory factor-1 (AF-1) mRNA, complete cds. ACCESSION U05875 NID g463549 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2214) AUTHORS Soh,J., Donnelly,R.J., Kotenko,S., Mariano,T.M., Cook,J.R., Wang,N., Emanuel,S.L., Schwartz,B., Miki,T. and Pestka,S. TITLE Identification and sequence of an accessory factor required for activation of the human interferon gamma receptor JOURNAL Cell 76, 793-802 (1994) MEDLINE 94170380 REFERENCE 2 (bases 1 to 2214) AUTHORS Pestka,S. TITLE Direct Submission JOURNAL Submitted (01-FEB-1994) Sidney Pestka, UMDNJ-Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, NJ 08854-5635, USA FEATURES Location/Qualifiers source 1..2214 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSK1" /cell_line="M426 cells" /cell_type="lung cells" /tissue_type="lung fibroblasts" /dev_stage="embryo" 5'UTR 1..648 sig_peptide 649..729 CDS 649..1662 /standard_name="interferon gamma receptor accessory factor-1" /note="second chain of the receptor" /codon_start=1 /function="signal transduction" /product="AF-1" /db_xref="PID:g463550" /translation="MRPTLLWSLLLLLGVFAAAAAAPPDPLSQLPAPQHPKIRLYNAE QVLSWEPVALSNSTRPVVYRVQFKYTDSKWFTADIMSIGVNCTQITATECDFTAASPS AGFPMDFNVTLRLRAELGALHSAWVTMPWFQHYRNVTVGPPENIEVTPGEGSLIIRFS SPFDIADTSTAFFCYYVHYWEKGGIQQVKGPFRSNSISLDNLKPSRVYCLQVQAQLLW NKSNIFRVGHLSNISCYETMADASTELQQVILISVGTFSLLSVLAGACFFLVLKYRGL IKYWFHTPPSIPLQIEEYLKDPTQPILEALDKDSSPKDDVWDSVSIISFPEKEQEDVL QTL" mat_peptide 730..1659 3'UTR 1663..2214 polyA_site 2214 /note="39 A residues" BASE COUNT 494 a 595 c 628 g 497 t ORIGIN 1 gttgactgga ggcggaggtt gcagtgagcc gagatcgccc cactgcactc cagcctggtg 61 actccgtctc aaaaaaaagg ggaggggggc gggggagagt tgaaagctta atatgtactt 121 tgggggctat taaagcaaac atttcgacta aaggggcgaa tcctcgaatt gtgcgatcaa 181 gcacccgaga ggagagttgg ggggggtcag gaggggtggg ggctccaggg aacgcccggg 241 ggcctgggcc ggggtctcgc ggggcccttc cggaaggatc gcggcccccg aaggtgggcg 301 tcccgcgggg ctccagtctc caggacgttc cgggaggctc cgcgctctgg gaggccggct 361 gcgtggggtc cccgcgctgc agccgcagag gccccccagg gccgcggttc ccggagcggg 421 aaagtcccgc gcgggggcgg tggcctcggg ggcgggacgg ggcgggggcg ggggcgcggg 481 cggccgagcc gaatcccctc caccgggacg ccccgctgcc gctcgggaag aggcgggccc 541 tgcgcgccct gcgctcgcca tggcggtttg ggcggcgacg tgagcggctc cgcggacccc 601 gagcggggcc ccggccgcga cctgagccgc cgccgagcgc ccggggccat gcgaccgacg 661 ctgctgtggt cgctgctgct gctgctcgga gtcttcgccg ccgccgccgc ggccccgcca 721 gaccctcttt cccagctgcc cgctcctcag cacccgaaga ttcgcctgta caacgcagag 781 caggtcctga gttgggagcc agtggccctg agcaatagca cgaggcctgt tgtctaccga 841 gtgcagttta aatacaccga cagtaaatgg ttcacggccg acatcatgtc cataggggtg 901 aattgtacac agatcacagc aacagagtgt gacttcactg ccgccagtcc ctcagcaggc 961 ttcccaatgg atttcaatgt cactctacgc cttcgagctg agctgggagc actccattct 1021 gcctgggtga caatgccttg gtttcaacac tatcggaatg tgactgtcgg gcctccagaa 1081 aacattgagg tgaccccagg agaaggctcc ctcatcatca ggttctcctc tccctttgac 1141 atcgctgata cctccacggc ctttttttgt tattatgtcc attactggga aaaaggagga 1201 atccaacagg tcaaaggccc tttcagaagc aactccattt cattggataa cttaaaaccc 1261 tccagagtgt actgtttaca agtccaggca caactgcttt ggaacaaaag taacatcttt 1321 agagtcgggc atttaagcaa catatcttgc tacgaaacaa tggcagatgc ctccactgag 1381 cttcagcaag tcatcctgat ctccgtggga acattttcgt tgctgtcggt gctggcagga 1441 gcctgtttct tcctggtcct gaaatataga ggcctgatta aatactggtt tcacactcca 1501 ccaagcatcc cattacagat agaagagtat ttaaaagacc caactcagcc catcttagag 1561 gccttggaca aggacagctc accaaaggat gacgtctggg actctgtgtc cattatctcg 1621 tttccggaaa aggagcaaga agatgttctc caaacgcttt gaaccaaagc atgggcctag 1681 cccactggct ccctggaaga gatcaagcca tcggagctgc tagagttctg tctggacttt 1741 ccagagacca gtattccctt ttgctgcctc taaaaggcct gtccctgcag acatgagaga 1801 cagcaggtct catgggggtg acaagctttt tttttttttt cttaaagaat tttcaaaatc 1861 aaattccaga atgattttac ggagatatcc caggaaaatt aaggcttctc ttaaacacta 1921 aaaaggcatg taattgcttg ttagcaaaat ggatatgaca catctctgat acttttttca 1981 ttattggttg ggctgagcag tcagaagacc tggtcgtcgt cttgactttg gcaaatgagc 2041 cggagcccct tgggcaggtc acacaacctg tcccagcgag ggacactgag tggcccttca 2101 tgtacatcca tggtgtgctg gcttaaaatg taattaatct tgtaaatata ctcctagtaa 2161 tttaagattt tgtttttaaa ctggaaataa aagattgtat agtgcatgtt tttt // LOCUS HSU06117 4179 bp mRNA PRI 09-SEP-1995 DEFINITION Human xanthine dehydrogenase (XDH) mRNA, complete cds. ACCESSION U06117 NID g984266 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4179) AUTHORS Xu,P., Huecksteadt,T.P., Harrison,R. and Hoidal,J.R. TITLE Molecular cloning, tissue expression of human xanthine dehydrogenase JOURNAL Biochem. Biophys. Res. Commun. 199 (2), 998-1004 (1994) MEDLINE 94183289 REFERENCE 2 (bases 1 to 4179) AUTHORS Hoidal,J.R. TITLE Direct Submission JOURNAL Submitted (03-FEB-1994) John R. Hoidal, Department of Medicine, University of Utah Medical Center, Wintrobe Building, Room 743A, Salt Lake City, Utah 84132, USA FEATURES Location/Qualifiers source 1..4179 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 64..4065 /gene="XDH" CDS 64..4065 /gene="XDH" /EC_number="1.1.1.204" /codon_start=1 /product="xanthine dehydrogenase" /db_xref="PID:g984267" /translation="MTADKLVFFVNGRKVVEKNADPETTLLAYLRRKLGLSGTKLGCG EGGCGACTVMLSKYDRLQNKIVHFSANACLAPICSLHHVAVTTVEGIGSTKTRLHPVQ ERIAKSHGSQCGFCTPGIVMSMYTLLRNQPEPTMEEIENAFQGNLCRCTGYRPILQGF RTFARDGGCCGRDGNNPNCCMNQKKDHSVSLSPSLFKPEEFTPLDPTQEPIFPPELLR LKDTPRKQLRFEGERVTWIQASTLKELLDLKAQHPDAKLVEGNTEIGIEMKFKNMLFP MIVCPAWIPELNSVEHGPDGISFGAACPLSIVEKTLVDAVAKLPAQKTEVFRGVLEHV RWFAGKQVKSVASVGGNIITASPISDLNPVFMASGAKLTLVSRGTRRTVQMDHTFFPG YRKTLLSPEEILLSIEIPYSREGEYFSAFKQASRREDDIAKVTSGMRVLFKPGTTEVQ ELALCYGGMANRTISALKTTQRQLSKLWKEELLQDVCAGLAEELQLPPDAPGGMVDFR CTLTLSLLLKFYLTVLQKLGQENLEDKCGKLDPTFASATLLFQKDPPADVQLFQEVPK GQSEEDMVGRPLPHLAADMQASGEAVYCDDIPRYENELSLRLVTSTRAHAKIKSIDTS EAKKVPGFVCFISADDVPGSNITGICNDETVFAKDKVTCVGHIIGAVVADTPEHTQRA AQGVKITYEELPAIITIEDAIKNNSFYGPELKIEKGDLKKGFSEADNVVSGEIYIGGQ EHFYLETHCTIAVPKGEAGEMELFVSTQNTMKTQSFVAKMLGVPANRIVVRVKRMGGG FGGKETRSTVVSTAVALAAYKTGRPVRCMLDRDEDMLITGGRHPFLARYKVGFMKTGT VVALEVDHFSNVGNTQDLSQSIMERALFHMDNCYKIPNIRGTGRLCKTNLPSNTAFRG FGGPQGMLIAECWMSEVAVTCGMPAEEVRRKNLYKEGDLTHFNQKLEGFTLPRCWEEC LASSQYHARKSEVDKFNKENCWKKRGLCIIPTKFGISFTVPFLNQAGALLHVYTDGSV LLTHGGTEMGQGLHTKMVQVASRALKIPTSKIYISETSTNTVPNTSPTAASVSADLNG QAVYAACQTILKRLEPYKKKNPSGSWEDWVTAAYMDTVSLSATGFYRTPNLGYSFETN SGNPFHYFSYGVACSEVEIDCLTGDHKNLRTDIVMDVGSSLNPAIDIGQVEGAFVQGL GLFTLEELHYSPEGSLHTRGPSTYKIPAFGSIPIEFRVSLLRDCPNKKAIYASKAVGE PPLFLAASIFFAIKDAIRAARAQHTGNNVKELFRLDSPATPEKIRNACVDKFTTLCVT GVPENCKPWSVRV" BASE COUNT 1037 a 1086 c 1154 g 902 t ORIGIN 1 ccggacctgc cagtgtctct taggagtgag gtacctggag ttcggggacc ccaacctgtg 61 acaatgacag cagacaaatt ggttttcttt gtgaatggca gaaaggtggt ggagaaaaat 121 gcagatccag agacaaccct tttggcctac ctgagaagaa agttggggct gagtggaacc 181 aagctcggct gtggagaagg gggctgcggg gcttgcacag tgatgctctc caagtatgat 241 cgtctgcaga acaagatcgt ccacttttct gccaatgcct gcctggcccc catctgctcc 301 ttgcatcatg ttgctgtgac aactgtggaa ggaataggaa gcaccaagac gaggctgcat 361 cctgtgcagg agagaattgc caaaagccac ggctcccagt gcgggttctg cacccctggc 421 atcgtcatga gtatgtacac actgctccgg aatcagcccg agcccaccat ggaggagatt 481 gagaatgcct tccaaggaaa tctgtgccgc tgcacaggct acagacccat cctccagggc 541 ttccggacct ttgccaggga tggtggatgc tgtggaagag atgggaataa tccaaattgc 601 tgcatgaacc agaagaaaga ccactcagtc agcctctcgc catctttatt caaaccagag 661 gagttcacgc ccctggatcc aacgcaggaa cccatttttc ccccagagtt gctgaggctg 721 aaagacactc ctcggaagca gctgcgattt gaaggggagc gtgtgacgtg gatacaggcc 781 tcaaccctca aggagctgct ggacctcaag gctcagcacc ctgacgccaa gctggtcgag 841 gggaacacgg agattggcat tgagatgaag ttcaagaata tgctgtttcc tatgattgtt 901 tgcccagcct ggatccctga gctgaattcg gtagaacatg gacccgacgg tatctccttt 961 ggagctgctt gccccctgag cattgtggaa aaaaccctgg tggatgctgt tgctaagctt 1021 cctgcccaaa agacagaggt gttcagaggg gtcctggagc acgtgcgctg gtttgctggg 1081 aagcaagtca agtctgtggc gtccgttgga gggaacatca tcactgccag ccccatctcc 1141 gacctcaacc ccgtgttcat ggccagtggg gccaagctga cacttgtgtc cagaggcacc 1201 aggagaactg tccagatgga ccacaccttc ttccctggct acagaaagac cctgctgagc 1261 ccggaggaga tactgctctc catagagatc ccctacagca gggaggggga gtatttctca 1321 gcattcaagc aggcctcccg gagagaagat gacattgcca aggtaaccag tggcatgaga 1381 gttttattca agccaggaac cacagaggta caggagctgg ccctttgcta tggtggaatg 1441 gccaacagaa ccatctcagc cctcaagacc actcagaggc agctttccaa gctctggaag 1501 gaggagctgc tgcaggacgt gtgtgcagga ctggcagagg agctgcagct gcctcccgat 1561 gcccctggtg gcatggtgga cttccggtgc accctcaccc tcagcttgtt gttgaagttc 1621 tacctgacag tccttcagaa gctgggccaa gagaacctgg aagacaagtg tggtaaactg 1681 gaccccactt tcgccagtgc aactttactg tttcagaaag accccccagc cgatgtccag 1741 ctcttccaag aggtgcccaa gggtcagtct gaggaggaca tggtgggccg gcccctgccc 1801 cacctggcag cggacatgca ggcctctggt gaggccgtgt actgtgacga cattcctcgc 1861 tacgagaatg agctgtctct ccggctggtc accagcaccc gggcccacgc caagatcaag 1921 tccatagata catcagaagc taagaaggtt ccagggtttg tttgtttcat ttccgctgat 1981 gatgttcctg ggagtaacat aactggaatt tgtaatgatg agacagtctt tgcgaaggat 2041 aaggttactt gtgttgggca tatcattggt gctgtggttg ctgacacccc ggaacacaca 2101 cagagagctg cccaaggggt gaaaatcacc tatgaagaac taccagccat tatcacaatt 2161 gaggatgcta taaagaacaa ctccttttat ggacctgagc tgaagatcga gaaaggggac 2221 ctaaagaagg ggttttccga agcagataat gttgtgtcag gggagatata catcggtggc 2281 caagagcact tctacctgga gactcactgc accattgctg ttccaaaagg cgaggcaggg 2341 gagatggagc tctttgtgtc tacacagaac accatgaaga cccagagctt tgttgcaaaa 2401 atgttggggg ttccagcaaa ccggattgtg gttcgagtga agagaatggg aggaggcttt 2461 ggaggcaagg agacccggag cactgtggtg tccacggcag tggccctggc tgcatataag 2521 accggccgcc ctgtgcgatg catgctggac cgtgatgagg acatgctgat aactggtggc 2581 agacatccct tcctggccag atacaaggtt ggcttcatga agactgggac agttgtggct 2641 cttgaggtgg accacttcag caatgtgggg aacacccagg atctctctca gagtattatg 2701 gaacgagctt tattccacat ggacaactgc tataaaatcc ccaacatccg gggcactggg 2761 cggctgtgca aaaccaacct tccctccaac acggccttcc ggggctttgg ggggccccag 2821 gggatgctca ttgccgagtg ctggatgagt gaagttgcag tgacctgtgg gatgcctgca 2881 gaggaggtgc ggagaaaaaa cctgtacaaa gaaggggacc tgacacactt caaccagaag 2941 cttgagggtt tcaccttgcc cagatgctgg gaagaatgcc tagcaagctc tcagtatcat 3001 gctcggaaga gtgaggttga caagttcaac aaggagaatt gttggaaaaa gagaggattg 3061 tgcataattc ccaccaagtt tggaataagc ttcacagttc cttttctgaa tcaggcagga 3121 gccctacttc atgtgtacac agatggctct gtgctgctga cccacggggg gactgagatg 3181 ggccaaggcc ttcataccaa aatggtccag gtggccagta gagctctgaa aatccccacc 3241 tctaagattt atatcagcga gacaagcact aacactgtgc ccaacacctc tcccacggct 3301 gcctctgtca gcgctgacct caatggacag gccgtctatg cggcttgtca gaccatcttg 3361 aaaaggctgg aaccctacaa gaagaagaat cccagtggct cctgggaaga ctgggtcaca 3421 gctgcctaca tggacacagt gagcttgtct gccactgggt tttatagaac acccaatctg 3481 ggctacagct ttgagactaa ctcagggaac cccttccact acttcagcta tggggtggct 3541 tgctctgaag tagaaatcga ctgcctaaca ggagatcata agaacctccg cacagatatt 3601 gtcatggatg ttggctccag tctaaaccct gccattgata ttggacaggt ggaaggggca 3661 tttgtccagg gccttggcct cttcacccta gaggagctac actattcccc cgaggggagc 3721 ctgcacaccc gtggccctag cacctacaag atcccggcat ttggcagcat ccccattgag 3781 ttcagggtgt ccctgctccg cgactgcccc aacaagaagg ccatctatgc atcgaaggct 3841 gttggagagc cgcccctctt cctggctgct tctatcttct ttgccatcaa agatgccatc 3901 cgtgcagctc gagctcagca cacaggtaat aacgtgaagg aactcttccg cctagacagc 3961 cctgccaccc cggagaagat ccgcaatgcc tgcgtggaca agttcaccac cctgtgtgtc 4021 actggtgtcc cagaaaactg caaaccctgg tctgtgaggg tctaaagaga gagtcctcag 4081 cagagtcttc ttgtgctgcc tttgggcttc catggagcag gaggaacata ccacagaaca 4141 tggatctatt aaagtcacag aatgacagac ctgtgattt // LOCUS HSU06452 1524 bp mRNA PRI 25-JUN-1994 DEFINITION Human melanoma antigen recognized by T-cells (MART-1) mRNA. ACCESSION U06452 NID g476131 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1524) AUTHORS Kawakami,Y., Eliyahu,S., Delgado,C.H., Robbins,P.F., Rivoltini,L., Topalian,S.L., Miki,T. and Rosenberg,S.A. TITLE Cloning of the gene coding for a shared human melanoma antigen recognized by autologous T cells infiltrating into tumor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 3515-3519 (1994) MEDLINE 94224770 REFERENCE 2 (bases 1 to 1524) AUTHORS Kawakami,Y. TITLE Direct Submission JOURNAL Submitted (08-FEB-1994) Yutaka Kawakami, Surgery Branch, National Cancer Institute, National Institutes of Health, 9000 Rockville Pike, Bldg.10, Rm. 2B42, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1524 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="melanoma cell line, 501mel" /tissue_type="melanoma" gene 54..410 /gene="MART-1" CDS 54..410 /gene="MART-1" /standard_name="melanoma antigen recognized by T-cells" /codon_start=1 /db_xref="PID:g476132" /translation="MPREDAHFIYGYPKKGHGHSYTTAEEAAGIGILTVILGVLLLIG CWYCRRRNGYRALMDKSLHVGTQCALTRRCPQEGFDHRDSKVSLQEKNCEPVVPNAPP AYEKLSAEQSPPPYSP" BASE COUNT 435 a 330 c 324 g 435 t ORIGIN 1 agcagacaga ggactctcat taaggaaggt gtcctgtgcc ctgaccctac aagatgccaa 61 gagaagatgc tcacttcatc tatggttacc ccaagaaggg gcacggccac tcttacacca 121 cggctgaaga ggccgctggg atcggcatcc tgacagtgat cctgggagtc ttactgctca 181 tcggctgttg gtattgtaga agacgaaatg gatacagagc cttgatggat aaaagtcttc 241 atgttggcac tcaatgtgcc ttaacaagaa gatgcccaca agaagggttt gatcatcggg 301 acagcaaagt gtctcttcaa gagaaaaact gtgaacctgt ggttcccaat gctccacctg 361 cttatgagaa actctctgca gaacagtcac caccacctta ttcaccttaa gagccagcga 421 gacacctgag acatgctgaa attatttctc tcacactttt gcttgaattt aatacagaca 481 tctaatgttc tcctttggaa tggtgtagga aaaatgcaag ccatctctaa taataagtca 541 gtgttaaaat tttagtaggt ccgctagcag tactaatcat gtgaggaaat gatgagaaat 601 attaaattgg gaaaactcca tcaataaatg ttgcaatgca tgatactatc tgtgccagag 661 gtaatgttag taaatccatg gtgttatttt ctgagagaca gaattcaagt gggtattctg 721 gggccatcca atttctcttt acttgaaatt tggctaataa caaactagtc aggttttcga 781 accttgaccg acatgaactg tacacagaat tgttccagta ctatggagtg ctcacaaagg 841 atacttttac aggttaagac aaagggttga ctggcctatt tatctgatca agaacatgtc 901 agcaatgtct ctttgtgctc taaaattcta ttatactaca ataatatatt gtaaagatcc 961 tatagctctt tttttttgag atggagtttc gcttttgttg cccaggctgg agtgcaatgg 1021 cgcgatcttg gctcaccata acctccgcct cccaggttca agcaattctc ctgccttagc 1081 ctcctgagta gctgggatta caggcgtgcg ccactatgcc tgactaattt tgtagtttta 1141 gtagagacgg ggtttctcca tgttggtcag gctggtctca aactcctgac ctcaggtgat 1201 ctgcccgcct cagcctccca aagtgctgga attacaggcg tgagccacca cgcctggctg 1261 gatcctatat cttaggtaag acatataacg cagtctaatt acatttcact tcaaggctca 1321 atgctattct aactaatgac aagtattttc tactaaacca gaaattggta gaaggattta 1381 aataagtaaa agctactatg tactgcctta gtgctgatgc ctgtgtactg ccttaaatgt 1441 acctatggca atttagctct cttgggttcc caaatccctc tcacaagaat gtgcagaaga 1501 aatcataaag gatcagagat tctg // LOCUS HSU06454 2361 bp mRNA PRI 05-APR-1995 DEFINITION Human AMP-activated protein kinase (hAMPK) mRNA, complete cds. ACCESSION U06454 NID g758366 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2361) AUTHORS Aguan,K., Scott,J., See,C.G. and Sarkar,N.H. TITLE Characterization and chromosomal localization of the human homologue of a rat AMP-activated protein kinase-encoding gene: a major regulator of lipid metabolism in mammals JOURNAL Gene 149 (2), 345-350 (1994) MEDLINE 95047501 REFERENCE 2 (bases 1 to 2361) AUTHORS Aguan,K. TITLE Direct Submission JOURNAL Submitted (08-FEB-1994) Kripamoy Aguan, Dept. of Immunology and Microbiology, Medical College of Georgia, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..2361 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p31" /tissue_type="heart" gene 65..1723 /gene="hAMPK" CDS 65..1723 /gene="hAMPK" /codon_start=1 /product="AMP-activated protein kinase" /db_xref="PID:g758367" /translation="MAEKQKHDGRVKIGHYVLGDTLGVGTFGKVKIGEHQLTGHKVAV KILNRQKIRSLDVVGKIKREIQNLKLFRHPHIIKLYQVISTPTDFFMVMEYVSGGELF DYICKHGRVEEMEARRLFQQILSAVDYCHRHMVVHRDLKPENVLLDAHMNAKIADFGL SNMMSDGEFLRTSCGSPNYTAPEVISGRLYAGPEVDIWSCGVILYALLCGTLPFDDEH VPTLFKKIRGGVFYIPEYLNRSVATLLMHMLQVDPLKRATIKDIREHEWFKQGLPSYL FPEDPSYDANVIDDEAVKEVCEKFECTESEVMNSLYSGDPQDQLAVAYHLIIDNRRIM NQASEFYLASSPPSGSFMDDSAMHIPPGLKPHPERMPPLIADSPKARCPLDALNTTKP KSLAVKKAKWRQGIRSQSKPYDIMAEVYRAMKQLDFEWKVVNAYHLRVRRKNPVTGNY VKMSLQLYLVDNRSYLLDFKSIDDEVVEQRSGSSTPQRSCSAAGLHRPRSSFDSTTAE SHSLSGSLTGSLTGSTLSSVSPRLGSHTMDFFEMCASLITTLAR" misc_feature 128..145 /gene="hAMPK" /note="ATP-binding domain" misc_feature 581..592 /gene="hAMPK" /note="phosphorylation site" BASE COUNT 700 a 467 c 510 g 684 t ORIGIN 1 ggtagcggcg gcggcggcgg ctagcggagc ggcaggcggt ggagcgaggc cgcgcgcgcc 61 gaagatggct gagaagcaga agcacgacgg gcgggtgaag atcggacact acgtgctggg 121 cgacacgctg ggcgtcggca ccttcggcaa agtgaagatt ggagaacatc aattaacagg 181 ccataaagtg gcagttaaaa tcttaaatag acagaagatt cgcagtttag atgttgttgg 241 aaaaataaaa cgagaaattc aaaatctaaa actctttcgt catcctcata ttatcaaact 301 ataccaggtg atcagcactc caacagattt ttttatggta atggaatatg tgtctggagg 361 tgaattattt gactacatct gtaagcatgg acgggttgaa gagatggaag ccaggcggct 421 ctttcagcag attctgtctg ctgtggatta ctgtcatagg catatggttg ttcatcgaga 481 cctgaaacca gagaatgtcc tgttggatgc acacatgaat gccaagatag ccgatttcgg 541 attatctaat atgatgtcag atggtgaatt tctgagaact agttgcggat ctccaaatta 601 tacagcacct gaagtcatct caggcagatt gtatgcaggt cctgaagttg atatctggag 661 ctgtggtgtt atcttgtatg ctcttctttg tggcaccctc ccatttgatg atgagcatgt 721 acctacgtta tttaagaaga tccgaggggg tgtcttttat atcccagaat atctcaatcg 781 ttctgtcgcc actctcctga tgcatatgct gcaggttgac ccactgaaac gagcaactat 841 caaagacata agagagcatg aatggtttaa acaaggtttg cccagttact tatttcctga 901 agacccttcc tatgatgcta acgtcattga tgatgaggct gtgaaagaag tgtgtgaaaa 961 atttgaatgt acagaatcag aagtaatgaa cagtttatat agtggtgacc ctcaagacca 1021 gcttgcagtg gcttatcatc ttatcattga caatcggaga ataatgaacc aagccagtga 1081 gttctacctc gcctctagtc ctccatctgg ttcttttatg gatgatagtg ccatgcatat 1141 tcccccaggc ctgaaacctc atccagaaag gatgccacct cttatagcag acagccccaa 1201 agcaagatgt ccattggatg cactgaatac gactaagccc aaatctttag ctgtgaaaaa 1261 agccaagtgg cgtcaaggaa tccgaagtca gagcaaaccg tatgacatta tggctgaagt 1321 ttaccgagct atgaagcagc tggattttga atggaaggta gtgaatgcat accatcttcg 1381 tgtaagaaga aaaaatccag tgactggcaa ttacgtgaaa atgagcttac aactttacct 1441 ggttgataac aggagctatc ttttggactt taaaagcatt gatgatgaag tagtggagca 1501 gagatctggt tcctcaacac ctcagcgttc ctgttctgct gctggcttac acagaccaag 1561 atcaagtttt gattccacaa ctgcagagag ccattcactt tctggctctc tcactggctc 1621 tttgaccgga agcacattgt cttcagtttc acctcgcctg ggcagtcaca ccatggattt 1681 ttttgaaatg tgtgccagtc tgattactac tttagcccgt tgatctgtct ctagtttctt 1741 tctgttattg cactatgaaa atcagttata ttctttaaat ttttatctta cttttggata 1801 atatccactg caatactaat tgagaaacat gaattatttc caggggcaca caatgctatt 1861 gaaattactg aaaacaaaat atctgacatc ttatttactt gtagaaatct gtaattctat 1921 tgtgcctatg ataaattcac ataggcaata tctttaatag gttaatatca atgaagattt 1981 ttaattacaa taatgagttc actacagacg attaacacac cacactggcg aaccatctca 2041 atgtaagggt ggtttggcaa cacctccttg ctttgctgtt tggtgtagta aatctagttt 2101 acttcctaaa tttcagtagg ctttatgctg tgtttatcgc ccaatttatt ttaacaaaag 2161 aagattaaaa agtaaagaac cacgagtaag atattattta aatgttgaaa tcttaaaacc 2221 tgcctccaag atttcagaag ccaagttttt ctaacagtat ttgtacaaat actgcctagt 2281 gtattcaaca gaagactgtg gtcatgtaac aggtaaccac aattttcagg tttcttaaaa 2341 acagctgtaa ctaactcagg a // LOCUS HSU06631 3828 bp mRNA PRI 10-MAR-1994 DEFINITION Human (H326) mRNA, complete cds. ACCESSION U06631 NID g458691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3828) AUTHORS Bergsagel,P.L. and Kuehl,W. TITLE H326 is a human gene homologous to murine PC326 that is ubiquitously expressed, and has a murine homologue that is also ubiquitously expressed JOURNAL Unpublished REFERENCE 2 (bases 1 to 3828) AUTHORS Bergsagel,P.L. TITLE Direct Submission JOURNAL Submitted (10-FEB-1994) P. Leif Bergsagel, Navy Medical Oncology Branch, National Cancer Institute, NMC 8, Room 5101, Bethesda, MD 20889-5105, USA FEATURES Location/Qualifiers source 1..3828 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="h326" /clone_lib="H929 lambda ZAPII" /sex="male" /cell_line="H929" /cell_type="plasma cell" /tissue_type="myeloma" gene 177..1970 /gene="H326" CDS 177..1970 /gene="H326" /note="homologous to mouse gene PC326:GenBank Accession Number M95564" /codon_start=1 /db_xref="PID:g458692" /translation="MSSKGSSTDGRTDLANGSLSSSPEEMSGAEEGRETSSGIEVEAS DLSLSLTGDDGGPNRTSTESRGTDTESSGEDKDSDSMEDTGHYSINDENRVHDRSEEE EEEEEEEEEEQPRGACTAQAANRDQDSSDDERALEDWVSSETSALPRPRWQALPALRE RELGSSARFVYEACGARVFVHGFRLQHGLEGHTGCVNTLHFNQRGTWLASGSDDLKVV VWDWVRRQPVLDFESGHKSNVFQAKFLPNSGDSTLAMCARDGQVRVAELSATQCCKNT KRVAQHKGASHKLALEPDSPCTFLSAGEDAVVFTIDLRQDRPASKLVVTKEKEKKVGL YTIYVNPANTHQFAVGGRDQFVRIYDQRKIDENENNGVLKKFCPHHLVNSESKANITC LVYSHDGTELLASYNDEDIYLFNSSHSDGAQYVKRYKGHRNNATVKGVNFYGPKSEFV VSGSDCGHIFLWEKSSCQIIQFMEGDKGGVVNCLEPHPHLPVLATSGLDHDVKIWAPT AEASTELTGLKDVIKKNKRERDEDSLHQTDLFDSHMLWFLMHHLRQRRHHRRWREPGV GATDADSDESPSSSDTSDEEEGPDRVQCMPS" BASE COUNT 964 a 892 c 999 g 967 t 6 others ORIGIN 1 ctcttagcgc tcaggtcttt tccttccgcc gacccgaagt catcgctggg agtactggtt 61 gccctttcct cagtccttca gtgaatctac agagcctatt tcctcaggag cctcagcctg 121 gtccttactt cagtgataaa aggaggaaag gctggctaca gcaaacatca ttcaagatgt 181 ccagcaaagg gagcagcaca gatggcagaa cagacttagc taatggaagc ctgtctagca 241 gtccagagga gatgtctgga gctgaagagg ggagggagac atcctcaggc attgaagtgg 301 aggcctcaga cctgagtttg agcttgactg gggatgatgg tggccccaac cgcaccagca 361 cagaaagtcg aggcacagac acagagagct caggtgaaga taaggactct gacagcatgg 421 aggacactgg tcattactcc attaatgatg aaaatcgagt ccatgaccgc tcagaggaag 481 aggaagagga ggaagaagag gaggaagaag agcagcctcg gggcgcgtgt acagcgcaag 541 cggctaaccg tgaccaggac tcatcagatg atgagcgggc cctagaggac tgggtgtcct 601 cagaaacatc agctctaccc cgacctcgct ggcaagccct ccctgccctt cgggagcggg 661 agctgggttc aagtgcccgc tttgtctatg aggcctgtgg ggcaagagtc tttgtgcacg 721 gtttccgcct gcagcatggg cttgagggcc atactggttg tgtcaatacc ctgcacttta 781 accagcgcgg cacctggctg gccagtggca gcgatgacct gaaggtggtg gtgtgggatt 841 gggtacggcg gcagccagta ctggactttg agagtggcca caaaagtaat gtgttccagg 901 ccaagtttct tcctaacagt ggtgattcta ctctggccat gtgtgcccgt gacgggcagg 961 ttcgagtagc agaactgtct gccacacagt gttgcaagaa tacaaaacgt gtggcccagc 1021 acaagggagc gtcccacaag ttggcactgg aaccagactc tccctgtacg ttcttatctg 1081 caggtgaaga tgcagttgtt ttcaccattg acctgagaca agaccgccca gcgtcgaaac 1141 tggtggtgac aaaagagaaa gagaagaaag tggggctgta tacgatctat gtgaatcctg 1201 ccaataccca ccagtttgca gtgggtggac gagatcagtt tgtaaggatt tatgaccaga 1261 ggaaaattga tgagaatgag aacaatggag tactcaagaa gttctgtcct catcacctgg 1321 tgaacagtga gtccaaagca aacatcacct gtcttgtgta cagccacgac ggcacagagc 1381 tcctggccag ttacaatgat gaagacattt acctcttcaa ctcctctcac agtgatgggg 1441 cccagtatgt taagagatac aagggccaca gaaataatgc cacagtaaaa ggcgtcaatt 1501 tctatggccc caagagtgag tttgtggtga gcggtagtga ctgtgggcac atcttcctct 1561 gggagaaatc atcctgccag attattcagt tcatggaggg ggacaaggga ggcgtggtaa 1621 actgtcttga gccccaccct cacctgcctg tgctggcaac cagtggccta gaccatgatg 1681 tgaagatctg ggcacccaca gctgaagctt ccactgagct gacagggtta aaagatgtga 1741 ttaagaagaa caagcgggag cgggatgaag atagcttgca ccaaactgac ctgtttgata 1801 gtcacatgct gtggttcctt atgcatcacc tgagacagag acgccatcac cggcgctggc 1861 gagaacctgg ggttggggcc acagacgcgg actctgatga gtctcccagc tcctcagaca 1921 catcggacga ggaggagggc cctgaccggg tgcagtgcat gccatcttga ggcctcatac 1981 ctaggtgggg caggctgggg ctgccaacct gatcctgcct gggcaaccct ttcctgtccc 2041 aggccctaca ttcagcagaa acgcactttg gactttttgc tttagataaa agaaagacat 2101 cccaggagaa ggacaaacca gaggagtgaa ccaacaaaga gtacctagga atgggagttg 2161 agcctggaat gggctccatg gagaggtgca taggactcgg cagaaatggc ctctccccaa 2221 agcctctttt tgagaggaga gggaagccta tttgttaact ggtttgggat agggaatggg 2281 gtttcttttt ctttaatctc ccttgtttct tgggctgggg gaggggtggg gggaacaact 2341 ggctattcag taccaagggg ccagagtgga gggtaggagt gccactctct ctttggttta 2401 ggtttttgac cttttcttcc tttgtttttt aaaagtttat gacagttnct cccnnnaccc 2461 cacaacccca tcccagaatc ctattttcct gggaagtcct taaagcccct aaccatccca 2521 cactcttcac tttcctttcc accttattca ttctctgtac ttaccacagt attttgcact 2581 tgattacata tccttcatct cttctcttca tcccatcacc ccctaaatag gtcaggtgag 2641 ggaggctggg aagaggtggg aggaggggag aagtgaagga agataggaag gatattacct 2701 cttctgttat ttttttaaga aacattgttt ggtggcagca atctccctgt ccctatcact 2761 gttagaggcc taattttata tctataaata tattaaaaag caagtcaaac ttggatgtat 2821 caaggtaaaa ttattgtcaa agtttaaata cctatatatt ctctgaatgc aataaaggga 2881 cttaagagtg aacaagagta atggtgtgga agtgacacct ggggtcagtt tacctctgtg 2941 tatggtcact agagattggg acttaccctt taggttttag gaggcttgag aatggaagga 3001 tcctcatttc tgcccttcct ggttccctgc tttggtgtag gggttgggaa aaacaggaaa 3061 ttcctctcag ctctgcctca gatctcctac ctctccttaa gtcttgtagg gggttccaag 3121 gatggctctt ctaaccagag gctggcctgt ctttaaaact taactacttt agggtggtgc 3181 caccactgca gactattgtg gtactttgtg acagaagaca tgtacacaca caccacacac 3241 atacatacac actctctcac tctgtctctc ttacctttag ctgcttgatc attaagccat 3301 ccaacttcat gccagttccc ttctttatag aagagtgaag ggaaagactt cctgggtttg 3361 acttaaacct tgtccacctt cttgatattt taggattgag gaataaagtc attaatctaa 3421 ggaactgatt acagtggctg gagcttgggc acttgtctta tcactggtca ctgagtctga 3481 aagtcccagn tgaattcttg cccttaagtg cttttgctgc tatttttttg cccccagttc 3541 cacaagatcc aaccaagaat tctgtatcct ggcaacagtc agattcttct aaatcagcca 3601 gcaagagggn aaagagtgag agatggtatt cccagatcat tcttcctcct gcccctttcc 3661 cagcagctct agaccagatg ttggctgctg tacttactcc ctgaggtagg gaatgtgtgg 3721 tgatcgagtg gtctgtgttc ctattgctgg tggggtgata gggtgggcta aaaaccatgc 3781 actctggaat ttgttgtatt ttctcccagt aaagcttttc ttctcccg // LOCUS HSU06632 2622 bp mRNA PRI 23-OCT-1997 DEFINITION Homo sapiens p80-coilin mRNA, complete cds. ACCESSION U06632 NID g458435 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2622) AUTHORS Andrade,L.E., Chan,E.K., Raska,I., Peebles,C.L., Roos,G. and Tan,E.M. TITLE Human autoantibody to a novel protein of the nuclear coiled body: immunological characterization and cDNA cloning of p80-coilin JOURNAL J. Exp. Med. 173 (6), 1407-1419 (1991) MEDLINE 91237287 REFERENCE 2 (bases 1 to 2622) AUTHORS Chan,E.K., Takano,S., Andrade,L.E., Hamel,J.C. and Matera,A.G. TITLE Structure, expression and chromosomal localization of human p80-coilin gene JOURNAL Nucleic Acids Res. 22 (21), 4462-4469 (1994) MEDLINE 95061408 REFERENCE 3 (bases 1 to 2622) AUTHORS Chan,E.K.L. TITLE Direct Submission JOURNAL Submitted (10-FEB-1994) Edward K.L. Chan, Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2622 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pC15 (5'RACE)" /cell_line="Hep G2" /cell_type="epithelial cell" /tissue_type="liver" /dev_stage="adult" /map="17q23" /chromosome="17" CDS 23..1753 /codon_start=1 /product="p80-coilin" /db_xref="PID:g458436" /translation="MAASETVRLRLQFDYPPPATPHCTAFWLLVDLNRCRVVTDLISL IRQRFGFSSGAFLGLYLEGGLLPPAESARLVRDNDCLRVKLEERGVAENSVVISNGDI NLSLRKAKKRAFQLEEGEETEPDCKYSKKHWKSRENNNNNEKVLDLEPKAVTDQTVSK KNKRKNKATCGTVGDDNEEAKRKSPKKKEKCEYKKKAKNPKSPKVQAVKDWANQRCSS PKGSARNSLVKAKRKGSVSVCSKESPSSSSESESCDESISDGPSKVTLEARNSSEKLP TELSKEEPSTKNTTADKLAIKLGFSLTPSKGKTSGTTSSSSDSSAESDDQCLMSSSTP ECAAGFLKTVGLFAGRGRPGPGLSSQTAGAAGWRRSGSNGGGQAPGASPSVSLPASLG RGWGREENLFSWKGAKGRGMRGRGRGRGHPVSCVVNRSTDNQRQQQLNDVVKNSSTII QNPVETPKKDYSLLPLLAAAPQVGEKIAFKLLELTSSYSPDVSDYKEGRILSHNPETQ QVDIEILSSLPALREPGKFDLVYHNENGAEVVEYAVTQESKITVFWKELIDPRLIIES PSNTSSTEPA" BASE COUNT 788 a 510 c 604 g 720 t ORIGIN 1 cttccgttga gcaccaagca agatggcagc ttccgagacg gttaggctac ggcttcaatt 61 tgattacccg ccgccagcta ccccgcactg tacggccttc tggcttctgg tcgacttgaa 121 cagatgccga gtcgtcacag atctcattag tctcatccgc cagcgcttcg gcttcagttc 181 tggggccttc ctaggcctct acctggaggg ggggctcttg ccccccgccg agagcgcgcg 241 ccttgtgaga gacaacgact gcctcagagt taaattagaa gagagaggag ttgctgagaa 301 ttctgtagtc atcagtaatg gtgacattaa tttatctctt agaaaagcaa agaagcgggc 361 atttcagtta gaggagggtg aagaaactga accagattgc aaatattcaa agaagcattg 421 gaagagtcga gagaacaata acaataatga gaaggtcttg gatctggaac caaaagctgt 481 cacagatcag actgtcagca aaaaaaacaa gagaaaaaat aaagcaacct gtggcacagt 541 gggtgatgat aacgaagagg ccaaaagaaa atcaccaaag aaaaaggaga aatgtgaata 601 taaaaaaaag gctaagaatc ccaagtctcc gaaagtacag gcagtgaaag actgggccaa 661 tcagagatgt agttctccaa aaggttctgc tagaaacagc cttgttaaag ccaaaaggaa 721 aggtagtgta agcgtttgct caaaagagag tcccagttcc tcctcggagt ctgagtcttg 781 tgatgaatct atcagtgatg gtcccagcaa agtcactttg gaggccagaa attcctcaga 841 gaaattacca actgagttat caaaggaaga accctctacc aaaaatacaa ctgcagacaa 901 actggctata aaacttggct ttagccttac ccccagcaag ggcaagacct ctggaacaac 961 atcttccagt tcagactcta gtgcagagtc agacgaccaa tgcttgatgt catcgagcac 1021 cccggagtgt gctgcgggtt tcttaaagac agtaggcctt tttgcaggaa gaggtcgtcc 1081 aggcccaggg ctgtcatcac agactgcagg tgctgctgga tggaggcgtt ctggctcaaa 1141 tggtggtgga caggctcctg gtgcttctcc cagtgtgtct ctccctgcta gtttaggaag 1201 aggatggggt agagaagaga accttttttc ttggaaggga gctaagggac ggggcatgcg 1261 ggggagaggt cgaggacgag ggcatcctgt ttcctgtgtt gtaaatagaa gcactgacaa 1321 ccagaggcaa cagcaattaa atgacgtggt aaaaaattca tctactatta tccagaatcc 1381 agtagagaca cccaagaagg actatagtct gttaccactg ttagcagctg cccctcaagt 1441 tggagaaaag attgcattta agcttttgga gctaacatcc agttactctc ctgatgtctc 1501 tgactacaag gaaggaagaa tattaagcca caatccagag acccagcaag tagatataga 1561 aattctttca tccttacctg ccttgagaga acctgggaaa tttgatttag tttatcacaa 1621 tgaaaatgga gccgaggtag tggagtacgc tgtgacacag gagagcaaga tcactgtatt 1681 ttggaaagag ttgattgacc caagactgat tattgaatct ccaagtaaca catcaagtac 1741 agaacctgcc tgagtatgac ctctccacct tatagtttat gaatgtcttg tttgtgaaag 1801 tgactataac ccaaactttt ttttttttta aagaggattt ggaagttgta tggatttttt 1861 tgttatcttc actttactgc ataggaaaca atctacctca tcatttaaaa tgacatgggt 1921 gtcggttttg tagatctttg gtttttttgt caggtttaat taccattaac aaatgtaaaa 1981 catgacattc cctgcagata ttgttgtata ccagtatggt ttattatctt tctttaaatc 2041 ttattggcca tcaagtatgc agtcgtcagt atgtgatgtt tataatacca atgaatgtgc 2101 tgcgtatctt gtctcaataa gttttaagta acatttaaaa atattaaagc atgttatttg 2161 acctaatttt ttagcatttg agttgttcca ttaaatggag catcttgtaa atttcaagta 2221 ttttatactt gcaattgtta agagttaaca ggtagttgga tttgtcgcag acaatgagtt 2281 aaggaatcct ttcacgtttt tcccaacttt aaaattaagg attctcaggt ccctgtgtag 2341 agcagtgaaa ataagatgtg cgtatgtgtg tgtatgcctg gagattggtg tttcacttca 2401 gtgagaggat tggctgtgag cttcagacca gaaatgtgtc atcttgccag cccctggctg 2461 agtgtgctgg agtgaggatc ttgaacagaa acttcctttt ctgttattat tcactacgaa 2521 gctaaaatgg ccaaatatat accgtgaaaa ttggtttcat ttaacaaaag atcagatccc 2581 tccttcagct gtacacattt ttaaataaaa tcatattgaa ct // LOCUS HSU06643 498 bp mRNA PRI 06-FEB-1996 DEFINITION Human keratinocyte lectin 14 (HKL-14) mRNA, complete cds. ACCESSION U06643 NID g563700 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 498) AUTHORS Magnaldo,T., Bernerd,F. and Darmon,M. TITLE Galectin-7, a human 14-kDa S-lectin, specifically expressed in keratinocytes and sensitive to retinoic acid JOURNAL Dev. Biol. 168 (2), 259-271 (1995) MEDLINE 95246905 REFERENCE 2 (bases 1 to 498) AUTHORS Magnaldo,T. TITLE Direct Submission JOURNAL Submitted (14-FEB-1994) Thierry Magnaldo, Departement de Biologie, Laboratoire de Differenciation Epitheliale, Ecole Normale Superieure - 46 Rue D'Ulm, 75230 Paris Cedex 05, France FEATURES Location/Qualifiers source 1..498 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TMC1A12" /clone_lib="human epidermal cDNA library" /cell_type="keratinocyte" /tissue_type="epidermis" /dev_stage="adult" gene 30..440 /gene="HKL-14" CDS 30..440 /gene="HKL-14" /note="not detected in squamous carcinoma cells, repressed by retinoic acid, delayed in psoriasis; also known as galectin-7" /codon_start=1 /evidence=experimental /product="keratinocyte lectin 14" /db_xref="PID:g458703" /translation="MSNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQG SDAALHFNPRLDTSEVVFNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVV GDAQYHHFRHRLPLARVRLVEVGGDVQLDSVRIF" polyA_signal 476 polyA_site 498 /note="23 A residues" BASE COUNT 84 a 165 c 165 g 84 t ORIGIN 1 ttaaagcaaa gaattccccg gtcccagcca tgtccaacgt cccccacaag tcctcgctgc 61 ccgagggcat ccgccctggc acggtgctga gaattcgcgg cttggttcct cccaatgcca 121 gcaggttcca tgtaaacctg ctgtgcgggg aggagcaggg ctccgatgcc gccctgcatt 181 tcaacccccg gctggacacg tcggaggtgg tcttcaacag caaggagcaa ggctcctggg 241 gccgcgagga gcgcgggccg ggcgttcctt tccagcgcgg gcagcccttc gaggtgctca 301 tcatcgcgtc agacgacggc ttcaaggccg tggttgggga cgcccagtac caccacttcc 361 gccaccgcct gccgctggcg cgcgtgcgcc tggtggaggt gggcggggac gtgcagctgg 421 actccgtgag gatcttctga gcagaagccc aggcggcccg gggccttggc tggcaaataa 481 agcgttagcc cgcagcgc // LOCUS HSU06698 3840 bp mRNA PRI 28-JUL-1994 DEFINITION Human neuronal kinesin heavy chain mRNA, complete cds. ACCESSION U06698 NID g497123 KEYWORDS kinesin; motor; organelle transport; microtubules. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3840) AUTHORS Niclas,J., Navone,F., Hom-Booher,N. and Vale,R.D. TITLE Cloning and localization of a conventional kinesin motor expressed exclusively in neurons JOURNAL Neuron 12 (5), 1059-1072 (1994) MEDLINE 94242426 REFERENCE 2 (bases 1 to 3840) AUTHORS Niclas,J. TITLE Direct Submission JOURNAL Submitted (15-FEB-1994) Joshua Niclas, Biochemistry, University of California at San Francisco, 3rd Ave. and Parnassus Ave., San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..3840 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pWBC7" /sex="female" /tissue_type="hippocampus" /dev_stage="2 yrs old" CDS 149..3247 /note="also named nKHC" /codon_start=1 /product="neuronal kinesin heavy chain" /db_xref="PID:g516516" /translation="MAETNNECSIKVLCRFRPLNQAEILRGDKFIPIFQGDDSVVIGG KPYVFDRVFPPNTTQEQVYHACAMQIVKDVLAGYNGTIFAYGQTSSGKTHTMEGKLHD PQLMGIIPRIARDIFNHIYSMDENLEFHIKVSYFEIYLDKIRDLLDVTKTNLSVHEDK NRVPFVKGCTERFVSSPEEILDVIDEGKSNRHVAVTNMNEHSSRSHSIFLINIKQENM ETEQKLSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNINKSLSALGNVISALAEGTKSY VPYRDSKMTRILQDSLGGNCRTTMFICCSPSSYNDAETKSTLMFGQRAKTIKNTASVN LELTAEQWKKKYEKEKEKTKAQKETIAKLEAELSRWRNGENVPETERLAGEEAALGAE LCEETPVNDNSSIVVRIAPEERQKYEEEIRRLYKQLDDKDDEINQQSQLIEKLKQQML DQEELLVSTRGDNEKVQRELSHLQSENDAAKDEVKEVLQALEELAVNYDQKSQEVEEK SQQNQLLVDELSQKVATMLSLESELQRLQEVSGHQRKRIAEVLNGLMKDLSEFSVIVG NGEIKLPVEISGAIEEEFTVARLYISKIKSEVKSVVKRCRQLENLQVECHRKMEVTGR ELSSCQLLISQHEAKIRSLTEYMQSVELKKRHLEESYDSLSDELAKLQAQETVHEVAL KDKEPDTQDADEVKKALELQMESHREAHHRQLARLRDEINEKQKTIDELKDLNQKLQL ELEKLQADYEKLKSEEHEKSTKLQELTFLYERHEQSKQDLKGLEETVARELQTLHNLR KLFVQDVTTRVKKSAEMEPEDSGGIHSQKQKISFLENNLEQLTKVHKQLVRDNADLRC ELPKLEKRLRATAERVKALEGALKEAKEGAMKDKRRYQQEVDRIKEAVRYKSSGKRAH SAQIAKPVRPGHYPASSPTNPYGTRSPECISYTNSLFQNYQNLYLQATPSSTSDMYFA NSCTSSGATSSGGPLASYQKANMDNGNATDINDNRSDLPCGYEAEDQAKLFPLHQETA AS" BASE COUNT 1051 a 990 c 1037 g 762 t ORIGIN 1 cccccaggct tcgccgggcg ccctcaactc tgtccccaga gactgagcac ctgtcctccg 61 cctcggcctc tgctgagagc cctctcctct ggagcacaca ccacccctgc agcccaagaa 121 gagtcccagc cccacgccgg ctaccaccat ggcggagacc aacaacgaat gtagcatcaa 181 ggtgctctgc cgattccggc ccctgaacca ggctgagatt ctgcggggag acaagttcat 241 ccccattttc caaggggacg acagcgtcgt tattgggggg aagccatatg tttttgaccg 301 tgtattcccc ccaaacacga ctcaagagca agtttatcat gcatgtgcca tgcagattgt 361 caaagatgtc cttgctggct acaatggcac catttttgct tatggacaga catcctcagg 421 gaaaacacat accatggagg gaaagctgca cgaccctcag ctgatgggaa tcattcctcg 481 aattgcccga gacatcttca accacatcta ctccatggat gagaaccttg agttccacat 541 caaggtttct tactttgaaa tttacctgga caaaattcgt gaccttctgg atgtgaccaa 601 gacaaatctg tccgtgcacg aggacaagaa ccgggtgcca tttgtcaagg gttgtactga 661 acgctttgtg tccagcccgg aggagattct ggatgtgatt gatgaaggga aatcaaatcg 721 tcatgtggct gtcaccaaca tgaatgaaca cagctctcgg agccacagca tcttcctcat 781 caacatcaag caggagaaca tggaaacgga gcagaagctc agtgggaagc tgtatctggt 841 ggacctggca gggagtgaga aggtcagcaa gactggagca gagggagccg tgctggacga 901 ggcaaagaat atcaacaagt cactgtcagc tctgggcaat gtgatctccg cactggctga 961 gggcactaaa agctatgttc catatcgtga cagcaaaatg acaaggattc tccaggactc 1021 tctcggggga aactgccgga cgactatgtt catctgttgc tcaccatcca gttataatga 1081 tgcagagacc aagtccaccc tgatgtttgg gcagcgggca aagaccatta agaacactgc 1141 ctcagtaaat ttggagttga ctgctgagca gtggaagaag aaatatgaga aggagaagga 1201 gaagacaaag gcccagaagg agacgattgc gaagctggag gctgagctga gccggtggcg 1261 caatggagag aatgtgcctg agacagagcg cctggctggg gaggaggcag ccctgggagc 1321 cgagctctgt gaggagaccc ctgtgaatga caactcatcc atcgtggtgc gcatcgcgcc 1381 cgaggagcgg cagaaatacg aggaggagat ccgccgtctc tataagcagc ttgacgacaa 1441 ggatgatgaa atcaaccaac aaagccaact catagagaag ctcaagcagc aaatgctgga 1501 ccaggaagag ctgctggtgt ccacccgagg agacaacgag aaggtccagc gggagctgag 1561 ccacctgcaa tcagagaacg atgccgctaa ggatgaggtg aaggaagtgc tgcaggccct 1621 ggaggagctg gctgtgaact atgaccagaa gtcccaggag gtggaggaga agagccagca 1681 gaaccagctt ctggtggatg agctgtctca gaaggtggcc accatgctgt ccctggagtc 1741 tgagttgcag cggctacagg aggtcagtgg acaccagcga aaacgaattg ctgaggtgct 1801 gaacgggctg atgaaggatc tgagcgagtt cagtgtcatt gtgggcaacg gggagattaa 1861 gctgccagtg gagatcagtg gggccatcga ggaggagttc actgtggccc gactctacat 1921 cagcaaaatc aaatcagaag tcaagtctgt ggtcaagcgg tgccggcagc tggagaacct 1981 ccaggtggag tgtcaccgca agatggaagt gaccgggcgg gagctctcat cctgccagct 2041 cctcatctct cagcatgagg ccaagatccg ctcgcttacg gaatacatgc agagcgtgga 2101 gctaaagaag cggcacctgg aagagtccta tgactccttg agcgatgagc tggccaagct 2161 ccaggcccag gaaactgtgc atgaagtggc cctgaaggac aaggagcctg acactcagga 2221 tgcagatgaa gtgaagaagg ctctggagct gcagatggag agtcaccggg aggcccatca 2281 ccggcagctg gcccggctcc gggacgagat caacgagaag cagaagacca ttgatgagct 2341 caaagaccta aatcagaagc tccagttaga gctagagaag cttcaggctg actacgagaa 2401 gctgaagagc gaagaacacg agaagagcac caagctgcag gagctgacat ttctgtacga 2461 gcgacatgag cagtccaagc aggacctcaa gggtctggag gagacagttg cccgggaact 2521 ccagaccctc cacaaccttc gcaagctgtt cgttcaagac gtcacgactc gagtcaagaa 2581 aagtgcagaa atggagcccg aagacagtgg ggggattcac tcccaaaagc agaagatttc 2641 ctttcttgag aacaacctgg aacagcttac aaaggttcac aaacagctgg tacgtgacaa 2701 tgcagatctg cgttgtgagc ttcctaaatt ggaaaaacga cttagggcta cggctgagag 2761 agttaaggcc ctggagggtg cactgaagga ggccaaggag ggcgccatga aggacaagcg 2821 ccggtaccag caggaggtgg accgcatcaa ggaggccgtt cgctacaaga gctcgggcaa 2881 acgggcgcat tctgcccaga ttgccaaacc cgtccggcct ggccactacc cagcatcctc 2941 acccaccaac ccctatggca cccggagccc tgagtgcatc agttacacca acagcctctt 3001 ccagaactac cagaatctct acctgcaggc cacacccagc tccacctcag atatgtactt 3061 tgcaaactcc tgtaccagca gtggagccac atcttctggc ggccccttgg cttcctacca 3121 gaaggccaac atggacaatg gaaatgccac agatatcaat gacaatagga gtgacctgcc 3181 gtgtggctat gaggctgagg accaggccaa gcttttccct ctccaccaag agacagcagc 3241 cagctaatct cccacaccca cggctgcata cctgcacttt cagtttctaa gagggactga 3301 ggcctcttct cagcatgctg caaacctgtg gtctctgata ctaactccct ccccaacccc 3361 tgttgttgga ctgtactatg tttgatgtct tctcttactt actctgtatc tctttgtact 3421 ctgtatctat atatcaaaag ctgctgctat gtctctcttc tgtcttattc tcaagtatct 3481 actgatgtat ttagcaattt caaagcatag tctaccttcc ttatttgggg caatagggag 3541 gagggtgaat gtttcttctt tctcatctac tcgtctcaca ctgagtggtg ttagtcactg 3601 agtagaggtc acagagatga caaaaggaaa aatgggagct agagggttgt gacccttcat 3661 acacacacgc acacacgcac acaaacatgc acacacgcat gcacacacac aaagccttaa 3721 gcagaagaat gtcttagcat catgagacga gaaatagact cttcctccct cctctttcac 3781 atatagcaca gaaggtaaaa tggaagggct gctaattgag acatataatt ttcggaattc // LOCUS HSU06863 1987 bp mRNA PRI 11-MAY-1995 DEFINITION Human follistatin-related protein precursor mRNA, complete cds. ACCESSION U06863 NID g536897 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1987) AUTHORS Zwijsen,A., Blockx,H., Van Arnhem,W., Willems,J., Fransen,L., Devos,K., Raymackers,J., Van de Voorde,A. and Slegers,H. TITLE Characterization of a rat C6 glioma-secreted follistatin-related protein (FRP). Cloning and sequence of the human homologue JOURNAL Eur. J. Biochem. 225 (3), 937-946 (1994) MEDLINE 95045570 REFERENCE 2 (bases 1 to 1987) AUTHORS Zwijsen,A. TITLE Direct Submission JOURNAL Submitted (17-FEB-1994) An Zwijsen, Biochemistry, University of Antwerp UIA, Universitieitsplein, Wilrijk, Antwerp, B-2610, Belgium FEATURES Location/Qualifiers source 1..1987 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hs683" /cell_type="glioma" /tissue_type="brain" 5'UTR 1..91 sig_peptide 92..151 CDS 92..1018 /codon_start=1 /product="follistatin-related protein precursor" /db_xref="PID:g536898" /translation="MWKRWLALALALVAVAWVRAEEELRSKSKICANVFCGAGRECAV TEKGEPTCLCIEQCKPHKRPVCGSNGKTYLNHCELHRDACLTGSKIQVDYDGHCKEKK SVSPSASPVVCYQSNRDELRRRIIQWLEAEIIPDGWFSKGSNYSEILDKYFKNFDNGD SRLDSSEFLKFVEQNETAINITTYPDQENNKLLRGLCVDALIELSDENADWKLSFQEF LKCLNPSFNPPEKKCALEDETYADGAETEVDCNRCVCACGNWVCTAMTCDGKNQKGAQ TQTEEEMTRYVQELQKHQETAEKTKRVSTKEI" mat_peptide 152..1015 /product="follistatin-related protein" 3'UTR 1019..1987 BASE COUNT 522 a 482 c 498 g 485 t ORIGIN 1 gggcggggcg atcggcggag ctcccacctc cgcttacagc tcgctgccgc cgtcctgccc 61 cgcgccccca ggagacctgg accagaccac gatgtggaaa cgctggctcg cgctcgcgct 121 cgcgctggtg gcggtcgcct gggtccgcgc cgaggaagag ctaaggagca aatccaagat 181 ctgtgccaat gtgttttgtg gagccggccg ggaatgtgca gtcacagaga aaggggaacc 241 cacctgtctc tgcattgagc aatgcaaacc tcacaagagg cctgtgtgtg gcagtaatgg 301 caagacctac ctcaaccact gtgaactgca tcgagatgcc tgcctcactg gatccaaaat 361 ccaggttgat tacgatggac actgcaaaga gaagaaatcc gtaagtccat ctgccagccc 421 agttgtttgc tatcagtcca accgtgatga gctccgacgt cgcatcatcc agtggctgga 481 agctgagatc attccagatg gctggttctc taaaggcagc aactacagtg aaatcctaga 541 caagtatttt aagaactttg ataatggtga ttctcgcctg gactccagtg aattcctgaa 601 gtttgtggaa cagaatgaaa ctgccatcaa tattacaacg tatccagacc aggagaacaa 661 caagttgctt aggggactct gtgttgatgc tctcattgaa ctgtctgatg aaaatgctga 721 ttggaaactc agcttccaag agtttctcaa gtgcctcaac ccatctttca accctcctga 781 gaagaagtgt gccctggagg atgaaacgta tgcagatgga gctgagaccg aggtggactg 841 taaccgctgt gtctgtgcct gtggaaattg ggtctgtaca gccatgacct gtgacggaaa 901 gaatcagaag ggggcccaga cccagacaga ggaggagatg accagatatg tccaggagct 961 ccaaaagcat caggaaacag ctgaaaagac caagagagtg agcaccaaag agatctaatg 1021 aggaggcaca gaccagtgtc tggatcccag catcttctcc acttcagcgc tgagttcagt 1081 atacacaagt gtctgctaca gtcgccaaat caccagtatt tgcttatata gcaatgagtt 1141 ttattttgtt tatttgtttt gcaataaagg atatgaaggt ggctggctag gaagggaagg 1201 gccacagcct tcatttctag gagtgcttta agagaaactg taaatggtgc tctggggctg 1261 gaggctagta aggaaactgc atcacgattg aaagaggaac agacccaaat ctgaacctct 1321 tttgagttta ctgcatctgt cagcaggctg cagggagtgc acacgatgcc agagagaact 1381 tagcagggtg tccccggagg agaggtttgg gaagctccac ggagaggaac gctctctgct 1441 tccagcctct ttccattgcc gtcagcatga cagacctcca gcatccacgc atctcttggt 1501 cccaataact gcctctagat acatagccat actgctagtt aacccagtgt ccctcagact 1561 tggatggagt ttctgggagg gtacacccaa atgatgcaga tacttgtata ctttgagccc 1621 cttagcgacc taaccaaatt ttaaaaatac tttttaccaa aggtgctatt tctctgtaaa 1681 acactttttt tggcaagttg actttattct tcaattatta tcattatatt attgtttttt 1741 aatattttat tttcttgact aggtattaag cttttgtaat tatttttcag tagtcccacc 1801 acttcatagg tggaaggagt ttggggttct tcctggtgca ggggctgaaa taacccagat 1861 gcccccaccc tgccacatac tagatgcagc ccatagttgg cccccctagc ttccagcagt 1921 ccactatctg ccagaggagc aagggtgcct tagaccgaag ccaggggaag aagcatcctt 1981 gctcccc // LOCUS HSU06935 855 bp mRNA PRI 16-NOV-1995 DEFINITION Human thyrotroph embryonic factor (TEF) mRNA, complete cds. ACCESSION U06935 NID g606796 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 855) AUTHORS Khatib,Z.A., Inaba,T., Valentine,M. and Look,A.T. TITLE Chromosomal localization and cDNA cloning of the human DBP and TEF genes JOURNAL Genomics 23 (2), 344-351 (1994) MEDLINE 95137580 REFERENCE 2 (bases 1 to 855) AUTHORS Inaba,T. TITLE Direct Submission JOURNAL Submitted (18-FEB-1994) Toshiya Inaba, St. Jude Children's Research Hospital, 332 North Laderdale, Memphis, TN 38105-0318, USA FEATURES Location/Qualifiers source 1..855 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="UOC-B1" /cell_type="B-cell precursor" 5'UTR <1..50 gene 51..836 /gene="TEF" CDS 51..836 /gene="TEF" /codon_start=1 /function="transcription factor" /evidence=experimental /product="thyrotroph embryonic factor" /db_xref="PID:g606797" /translation="MENPPREARLDEEKGKEKLEEDEAAAASTMAVSASLMPPIWDKT IPYDGESFHLEYMDLDEFLLENGIPASPTHLAHNLLLPVAELEGKESASSSTASPPSS STAIFQPSETVSSTESSLEKERETPSPIDPNCVEVDVNFNPDPADLVLSSVPGGELFN PRKHKFAEEDLKPQPMIKKAKKVFVPDEQKDEKYWTRRKKNNVAAKRSRDARRLKENQ ITIRAAFLEKENTALRTEVAELRKEVGKCKTIVSKYETKYGPL" 3'UTR 837..>855 BASE COUNT 209 a 263 c 246 g 137 t ORIGIN 1 ctggggaaag gggcctgtcg gggtccttcc ccctggtcct gaagaagctg atggagaacc 61 ccccgcgcga ggcgcgcctc gatgaggaaa aggggaagga aaagctggag gaggacgagg 121 ccgcagccgc cagcaccatg gctgtctcag cctccctcat gccacccatc tgggacaaga 181 ccatcccata tgatggcgaa tctttccacc tggagtacat ggacctggat gagttcctgc 241 tggagaatgg catccccgcc agccccaccc acctggccca caacctgctg ctgcctgtag 301 cagagctaga agggaaggag tctgccagct cttccacagc atccccacca tcctcctcca 361 ctgccatctt tcagccctct gaaactgtgt ccagcacaga atcttccctg gagaaggaga 421 gggagactcc cagtcccatc gaccccaatt gtgtggaagt ggatgtgaac ttcaatccgg 481 accccgccga cctggtgctc tccagtgtgc caggcgggga gctcttcaac cctcggaagc 541 acaagtttgc tgaggaggac ctgaagcccc agcctatgat caaaaaggcc aagaaggtct 601 ttgtccccga cgagcagaag gatgaaaagt actggacaag acgcaagaag aacaacgtgg 661 cagctaaacg gtcacgggat gcccggcgcc tgaaagagaa tcagatcacc atccgggcag 721 ccttcctgga gaaggagaac acagccctgc ggacggaggt ggccgagcta cgcaaggagg 781 tgggcaagtg caagaccatc gtgtccaagt atgagaccaa atacgggccc ttgtaacccg 841 tgccccccgc ccggg // LOCUS HSU06936 1403 bp mRNA PRI 16-NOV-1995 DEFINITION Human albumin D-box binding protein (DBP) mRNA, complete cds. ACCESSION U06936 NID g606798 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1403) AUTHORS Khatib,Z.A., Inaba,T., Valentine,M. and Look,A.T. TITLE Chromosomal localization and cDNA cloning of the human DBP and TEF genes JOURNAL Genomics 23 (2), 344-351 (1994) MEDLINE 95137580 REFERENCE 2 (bases 1 to 1403) AUTHORS Inaba,T. TITLE Direct Submission JOURNAL Submitted (18-FEB-1994) Toshiya Inaba, St. Jude Children's Research Hospital, 332 North Laderdale, Memphis, TN 38105-0318, USA FEATURES Location/Qualifiers source 1..1403 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="UOC-B1" /cell_type="B-cell precursor" 5'UTR <1..157 gene 158..1135 /gene="DBP" CDS 158..1135 /gene="DBP" /codon_start=1 /function="transcriptional activator of albumin" /evidence=experimental /product="albumin D-box binding protein" /db_xref="PID:g606799" /translation="MARPVSDRTPAPLLLGGPAGTPPGGGALLGLRSLLQGTSKPKEP ASCLLKEKERKAALPAATTPGPGLETAGPADAPAGAVVGGGSPRGRPGPVPAPGLLAP LLWERTLPFGDVEYVDLDAFLLEHGLPPSPPPPGGPSPEPSPARTPAPSPGPGSCGSA SPRSSPGHAPARAALGTATGHRAGLTSRDTPSPVDPDTVEVLMTFEPDPADLALSSIP GHETFDPRRHRFSEEELKPQPIMKKARKIQVPEEQKDEKYWSRRYKNNEAAKRSRDAR RLKENQISVRAAFLEKENALLRQEVVAVRQELSHYRAVLSRYQAQHGAL" 3'UTR 1133..1403 polyA_signal 1385..1390 polyA_site 1403 /note="19 A residues" BASE COUNT 240 a 492 c 447 g 224 t ORIGIN 1 caaatccgtg ctcctagatt tgcaggttct gatactgtgg ttcgagctac actcgccgcc 61 tgggcagaca ctcgtccaaa ccactggagt gtgctggtga tcgcagccag cccttcgcct 121 ctccatgaac ccgtgagcct ggggcaggtg ccaggcgatg gcgcggcctg tgagcgacag 181 gaccccggcc cctctgctgc tgggcggccc ggccgggaca ccccctggcg ggggagcgct 241 gcttgggttg cggagccttc tgcaggggac cagcaagccc aaagagccgg ccagctgtct 301 cctgaaggaa aaggagcgca aggcggccct gcctgcagcc acaacccctg ggccaggcct 361 ggagactgcg ggcccggcgg atgccccggc tggggcagtg gtgggcggag ggtccccgcg 421 ggggcgcccg gggccggtgc ccgccccggg tctgttggcg ccactgctgt gggagcgcac 481 gctgccgttc ggcgatgtgg agtacgtaga cctggacgcc ttcctgctgg agcacgggct 541 cccgcccagc ccgccgcccc ccggtggccc gtcgccggag ccgtcgcccg cgcggacgcc 601 cgcaccctcc ccagggccgg gttcgtgcgg ctcggcttcc ccccgctcct ctcctgggca 661 cgcccccgcc cgggctgccc tcgggaccgc cacgggccac cgcgcaggcc tgacctctcg 721 ggacacaccc agccctgtgg acccagacac cgtggaggtg ttgatgacct ttgaacccga 781 cccagctgat cttgccctat caagcattcc tggccacgag acctttgacc ctcgaagaca 841 tcgcttctca gaagaggaac ttaagcccca gccaatcatg aagaaggcaa gaaaaatcca 901 ggtgccggag gagcagaagg atgagaaata ctggagccgg cggtacaaga acaacgaggc 961 agccaagcgg tcccgtgacg cccggcggct caaggagaac cagatatcgg tgcgggcggc 1021 cttcctggag aaggagaacg ccctgctgcg gcaggaagtt gtggccgtgc gccaggagct 1081 gtcccactac cgcgccgtgc tgtcccgata ccaggcccag cacggggccc tgtgaggctg 1141 ccccacatcc ccacctggca ggcgtctcct ccgcttgctg agacttacgc cctgttccct 1201 tcctgccctg tgcccacggg ccggccagct gggtgcccca gggacgtgat aatgcagata 1261 aatacattta tatttttaag aaaaagcgag cctcccccct cttgcggggg cggggagggt 1321 tctctgtgtg tccccggcac gtcagggacc ctatcctccc accgcctccg ttaacacgat 1381 cctgaataaa tcttgagaac ccc // LOCUS HSU07132 2010 bp mRNA PRI 01-FEB-1995 DEFINITION Human steroid hormone receptor Ner-I mRNA, complete cds. ACCESSION U07132 NID g641961 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2010) AUTHORS Shinar,D.M., Endo,N., Rutledge,S.J., Vogel,R., Rodan,G.A. and Schmidt,A. TITLE NER, a new member of the gene family encoding the human steroid hormone nuclear receptor JOURNAL Gene 147 (2), 273-276 (1994) MEDLINE 95011628 REFERENCE 2 (bases 1 to 2010) AUTHORS Golub,E.E. TITLE Direct Submission JOURNAL Submitted (23-FEB-1994) Ellis E. Golub, Biochemistry, Univ. of Pennsylvania School of Dental Medicine, 4001 Spruce Street, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2010 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Ner" /cell_line="SAOS-2/B10" /tissue_type="osteosarcoma" CDS 245..1630 /codon_start=1 /function="steroid hormone receptor" /product="Ner-I" /db_xref="PID:g641962" /translation="MSSPTTSSLDTPLPGNGPPQPGAPSSSPTVKEEGPEPWPGGPDP DVPGTDEASSACSTDWVIPDPEEEPERKRKKGPAPKMLGHELCRVCGDKASGFHYNVL SCEGCKGFFRRSVVRGGARRYACRGGGTCQMDAFMRRKCQQCRLRKCKEAGMREQCVL SEEQIRKKKIRKQQQQESQSQSQSPVGPQGSSSSASGPGASPGGSEAGSQGSGEGEGV QLTAAQELMIQQLVAAQLQCNKRSFSDQPKVTPWPLGADPQSRDARQQRFAHFTELAI ISVQEIVDFAKQVPGFLQLGREDQIALLKASTIEIMLLETARRYNHETECITFLKDFT YSKDDFHRAGLQVEFINPIFEFSRAMRRLGLDDAEYALLIAINIFSADRPNVQEPGRV EALQQPYVEALLSYTRIKRPQDQLRFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLP PLLSEIWDVHE" misc_feature 502..706 /note="DNA binding region" /function="regulation of transcription" misc_feature 937..1627 /note="ligand binding region" BASE COUNT 408 a 655 c 614 g 333 t ORIGIN 1 caagaagtgg cgaagttacc tttgagggta tttgagtagc ggcggtgtgt caggggctaa 61 agaggaggac gaagaaaagc agagcaaggg aacccagggc aacaggagta gttcactccg 121 cgagaggccg tccacgagac ccccgcgcgc aggcatgagc cccgcccccc acgcatgagc 181 cccgcccccc gctgttgctt ggagaggggc gggacctgga gagaggctgc tccgtgaccc 241 caccatgtcc tctcctacca cgagttccct ggataccccc ctgcctggaa atggcccccc 301 tcagcctggc gccccttctt cttcacccac tgtaaaggag gagggtccgg agccgtggcc 361 cgggggtccg gaccctgatg tcccaggcac tgatgaggcc agctcagcct gcagcacaga 421 ctgggtcatc ccagatcccg aagaggaacc agagcgcaag cgaaagaagg gcccagcccc 481 gaagatgctg ggccacgagc tttgccgtgt ctgtggggac aaggcctccg gcttccacta 541 caacgtgctc agctgcgaag gctgcaaggg cttcttccgg cgcagtgtgg tccgtggtgg 601 ggccaggcgc tatgcctgcc ggggtggcgg aacctgccag atggacgctt tcatgcggcg 661 caagtgccag cagtgccggc tgcgcaagtg caaggaggca gggatgaggg agcagtgcgt 721 cctttctgaa gaacagatcc ggaagaagaa gattcggaaa cagcagcagc aggagtcaca 781 gtcacagtcg cagtcacctg tggggccgca gggcagcagc agctcagcct ctgggcctgg 841 ggcttcccct ggtggatctg aggcaggcag ccagggctcc ggggaaggcg agggtgtcca 901 gctaacagcg gctcaagaac taatgatcca gcagttggtg gcggcccaac tgcagtgcaa 961 caaacgctcc ttctccgacc agcccaaagt cacgccctgg cccctgggcg cagaccccca 1021 gtcccgagat gcccgccagc aacgctttgc ccacttcacg gagctggcca tcatctcagt 1081 ccaggagatc gtggacttcg ctaagcaagt gcctggtttc ctgcagctgg gccgggagga 1141 ccagatcgcc ctcctgaagg catccactat cgagatcatg ctgctagaga cagccaggcg 1201 ctacaaccac gagacagagt gtatcacctt cttgaaggac ttcacctaca gcaaggacga 1261 cttccaccgt gcaggcctgc aggtggagtt catcaacccc atcttcgagt tctcgcgggc 1321 catgcggcgg ctgggcctgg acgacgctga gtacgccctg ctcatcgcca tcaacatctt 1381 ctcggccgac cggcccaacg tgcaggagcc gggccgcgtg gaggcgttgc agcagcccta 1441 cgtggaggcg ctgctgtcct acacgcgcat caagaggccg caggaccagc tgcgcttccc 1501 gcgcatgctc atgaagctgg tgagcctgcg cacgctgagc tctgtgcact cggagcaggt 1561 cttcgccttg cggctccagg acaagaagct gccgcctctg ctgtcggaga tctgggacgt 1621 ccacgagtga ggggctggcc acccagcccc acagccttgc ctgaccaccc tccagcagat 1681 agacgccggc accccttcct cttcctaggg tggaaggggc cctgggcgag cctgtagacc 1741 tatcggctct catcccttgg gataagcccc agtccaggtc caggaggctc cctccctgcc 1801 cagcgagtct tccagaaggg gtgaaagggt tgcaggtccc gaccactgac ccttcccggc 1861 tgccctccct ccccagctta cacctcaagc ccagcacgca gcgtaccttg aacagaggga 1921 ggggaggacc catggctctc cccccctagc ccgggagacc aggggccttc ctcttcctct 1981 gcttttattt aataaaaata aaaacagaaa // LOCUS HSU07151 900 bp mRNA PRI 27-SEP-1994 DEFINITION Human GTP binding protein (ARL3) mRNA, complete cds. ACCESSION U07151 NID g460624 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 900) AUTHORS Cavenagh,M.M., Breiner,M., Schurmann,A., Rosenwald,A.G., Terui,T., Zhang,C., Randazzo,P.A., Adams,M., Joost,H.G. and Kahn,R.A. TITLE ADP-ribosylation factor (ARF)-like 3, a new member of the ARF family of GTP-binding proteins cloned from human and rat tissues JOURNAL J. Biol. Chem. 269 (29), 18937-18942 (1994) MEDLINE 94308153 REFERENCE 2 (bases 1 to 900) AUTHORS Kahn,R.A. TITLE Direct Submission JOURNAL Submitted (24-FEB-1994) Richard A. Kahn, National Cancer Institute, Bldg. 37; Room 5D-02, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..900 /organism="Homo sapiens" /db_xref="taxon:9606" gene 16..564 /gene="ARL3" CDS 16..564 /gene="ARL3" /standard_name="ARF-like 3" /codon_start=1 /function="GTP binding protein" /evidence=experimental /product="ARL3" /db_xref="PID:g460625" /translation="MGLLSILRKLKSAPDQEVRILLLGLDNAGKTTLLKQLASEDISH ITPTQGFNIKSVQSQGFKLNVWDIGGQRKIRPYWKNYFENTDILIYVIDSADRKRFEE TGQELAELLEEEKLSCVPVLIFANKQDLLTAAPASEIAEGLNLHTIRDRVWQIQSCSA LTGEGVQDGMNWVCKNVNAKKK" BASE COUNT 285 a 172 c 209 g 234 t ORIGIN 1 gggactcggc ggaggatggg cttgctctca attttgcgca agttgaaaag tgcaccagac 61 caggaggtga gaatacttct cctgggcttg gataatgctg gcaagaccac tcttctgaag 121 cagcttgcat ctgaagacat cagccacatc acacctacac agggtttcaa catcaaaagt 181 gtacaatcac aaggttttaa actgaatgta tgggacattg gtggacagag gaaaatcaga 241 ccatactgga agaattattt tgaaaatacc gatattctta tatatgtaat cgacagtgca 301 gacagaaaaa gatttgaaga gacgggtcag gaactagcgg aattactgga ggaagaaaaa 361 ctaagttgtg tgccagtgct catctttgct aataagcagg atttgctcac agcagcccct 421 gcctctgaaa ttgcagaagg actgaacctg cataccatcc gcgaccgagt ctggcagatc 481 cagtcttgct cagctctcac aggagagggc gttcaggatg gcatgaactg ggtctgcaaa 541 aatgtcaatg caaagaagaa ataaaatcta gacgaatgga gatgcaggag ctgcgggagc 601 cgaattcggg ccttaaaaac actaatttgc tgctttctga ccaaatgttt ttcatctgtg 661 tacactccag ctgtttgaag agagggaaca acacggttta gaaagaatcc ccattccagc 721 agtagattta actgatctct gaggttcagt atcatttttc aaataaagga attatattat 781 ttcctctgca taattgaaat agtattaaat gtctaaagca catgattaga aaatgagatc 841 ttttaaatga gcaagagatt gcattgcagt ttagacaatt ccagtgggct ttttttttcg // LOCUS HSU07158 1248 bp mRNA PRI 23-AUG-1994 DEFINITION Human syntaxin mRNA, complete cds. ACCESSION U07158 NID g463906 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1248) AUTHORS Li,H., Hodge,D.R., Pei,G.K. and Seth,A. TITLE Isolation and sequence analysis of the human syntaxin-encoding gene JOURNAL Gene 143 (2), 303-304 (1994) MEDLINE 94266173 REFERENCE 2 (bases 1 to 1248) AUTHORS Seth,A.K. TITLE Direct Submission JOURNAL Submitted (24-FEB-1994) Arun K. Seth, Laboratory of Molecular Oncology, FCRDC, National Cancer Institute, National Institutes of Health, Fort Detrick, Frederick, MD 21702-1201, USA FEATURES Location/Qualifiers source 1..1248 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS 1.3" /clone_lib="human placenta lambda gt11 (Clontech)" /tissue_type="placenta" CDS 67..960 /codon_start=1 /product="syntaxin" /db_xref="PID:g463907" /translation="MRDRTHELRQGDDSSDEEDKERVALVVHPGTARLGSPDEEFFHK VRTIRQTIVKLGNKVQELEKQQVTILATPLPEESMKQELQNLRDEIKQLGREIRLQLK AIEPQKEEADENYNSVNTRMRKTQHGVLSQQFVELINKCNSMQSEYREKNVERIRRQL KITNAGMVSDEELDQMLDSGQSEVFVSNILKDTQVTRQALNEISARHSEIQQLERSIR ELHDIFTFLATEVEMQGEMINRIEKNILSSADYVERGQEHVKTALENQKKVRKKKVLI AICVSITVVLLAVIIGVTVVG" polyA_signal 1205..1212 BASE COUNT 335 a 310 c 380 g 223 t ORIGIN 1 caagatatcg aattccaaat ttgagggcct cccggctctg gcgccggagg gagagctcag 61 gccgccatgc gcgacaggac ccacgagctg agacaggggg atgacagctc ggacgaagag 121 gacaaggagc gggtcgcgct ggtggtgcac ccgggcacgg cacggctggg gagcccggac 181 gaggagttct tccacaaggt ccggacaatt cgtcagacta ttgtcaaact ggggaataaa 241 gtccaggagt tggagaaaca gcaggtcacc atcctggcca cgccccttcc cgaggagagc 301 atgaagcagg agctgcagaa cctgcgcgat gagatcaaac agctggggag ggagatccgc 361 ctgcagctga aggccataga gccccagaag gaggaagctg atgagaacta taactccgtc 421 aacacaagaa tgagaaaaac ccagcatggg gtcctgtccc agcaattcgt ggagctcatc 481 aacaagtgca attcaatgca gtccgaatac cgggagaaga acgtggagcg gattcggagg 541 cagctgaaga tcaccaatgc tggcatggtg tctgatgagg agttggatca gatgctggac 601 agtgggcaaa gcgaggtgtt tgtgtccaat atccttaagg acacgcaggt gactcgacag 661 gccttaaatg agatctcggc ccggcacagt gagatccagc agcttgaacg cagtattcgt 721 gagctgcacg acatattcac ttttctggct accgaagtgg agatgcaggg ggagatgatc 781 aatcggattg agaagaacat cctgagctca gcggactacg tggaacgtgg gcaggagcac 841 gtcaagacgg ccctggagaa ccagaagaag gtgaggaaga agaaagtctt gattgccatc 901 tgtgtgtcca tcaccgtcgt cctcctagca gtcatcattg gcgtcacagt ggttggataa 961 tgtcgcacat tgttggcact aggagcacca ggaacccagg gcctggcctt ctctcccagc 1021 agcctggggg gcaggcagag cctccagtcg gaccccttcc tcacacactg gcccctatgc 1081 agaagggcag acagttcttc tggggttggc agctgctcat tcatgatggc ctcctccttc 1141 aggcctcaat gcctggggga ggcctgcact gtcctgattg gccgggacac acggttttgt 1201 aaaaaattaa aaaacaaaaa aagagcatag aaaaaaaaaa aaccgagt // LOCUS HSU07223 2453 bp mRNA PRI 17-MAR-1994 DEFINITION Human beta2-chimaerin mRNA, complete cds. ACCESSION U07223 NID g460634 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Leung,T., How,B., Manser,E. and Lim,L. TITLE Cerebellar beta2-chimaerin, a GTPase-activating protein for p21 ras-related rac is specifically expressed in granule cells and has a unique N-terminal SH2 domain JOURNAL J. Biol. Chem. (1994) In press REFERENCE 2 (bases 1 to 2453) AUTHORS Leung,T. TITLE Direct Submission JOURNAL Submitted (01-MAR-1994) Thomas Leung, Institute of Molecular and Cell Biology, National University of Singapore, Kent Ridge, Singapore, 0511 FEATURES Location/Qualifiers source 1..2453 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human brain cDNA" /sex="female" /tissue_type="brain cerebellum" /dev_stage="2 year old" mRNA 1..2453 CDS 445..1845 /codon_start=1 /function="GTPase-activating protein (GAP)" /product="beta2-chimaerin" /db_xref="PID:g460635" /translation="MRLLSSLSGSSVSSDAEEYQPPIWKSYLYQLQQEAPRPKRIICP REVENRPKYYGREFHGIISREQADELLGGVEGAYILRESQRQPGCYTLALRFGNQTLN YRLFHDGKHFVGEKRFESIHDLVTDGLITLYIETKAAEYISKMTTNPIYEHIGYATLL REKVSRRLSRSKNEPRKTNVTHEEHTAVEKISSLVRRAALTHNDNHFNYEKTHNFKVH TFRGPHWCEYCANFMWGLIAQGVRCSDCGLNVHKQCSKHVPNDCQPDLKRIKKVYCCD LTTLVKAHNTQRPMVVDICIREIEARGLKSEGLYRVSGFTEHIEDVKMAFDRDGEKAD ISANVYPDINIITGALKLYFRDLPIPVITYDTYSKFIDAAKISNADERLEAVHEVLML LPPAHYETLRYLMIHLKKVTMNEKDNFMNAENLGIVFGPTLMRPPEDSTLTTLHDMRY QKLIVQILIENEDVLF" polyA_signal 2216 BASE COUNT 689 a 596 c 582 g 586 t ORIGIN 1 gaattccgcc tcacagtggc caggtcctgt gccagattgt ccctctcaac ctccagccgg 61 gcgctgttgg cgtgagttga tcgagccgca gccgcagctc tcgcagctca gcctggtaga 121 cgtctgccag cttggtgggc tccttggccc gcagctggtt cagcctcagc agccagcgcc 181 ttgttttgct gttccaggaa gcgaaccttc tcgatgtagc tggcaaagcg gtattgagct 241 ccatcatctc tgcccgctca ctggcccggg tctccttgaa gccagcattg agtgccccag 301 ccagggagaa atccacccgg gtcggagtgg agggggcatt cgagccaggg agaggcgggt 361 gcaggaccca gacggcggcc aggagccagg ccccccacca tcatctcccc tgaggagacg 421 taggagcggc gagcagcgga ggtgatgcgt ctcctctcca gcctgtccgg ctcgtcggtg 481 tcctccgatg ctgaagaata ccagcctcct atatggaaat catacttata tcagttacag 541 caagaggcac ctcgtcccaa gagaatcatt tgtcctcggg aggtggaaaa cagaccaaaa 601 tattatggaa gagagtttca tgggatcatc tctcgggagc aggcggatga gcttcttgga 661 ggcgtggagg gtgcctacat ccttagagaa agccagcggc aaccaggatg ctacacgctg 721 gctctcaggt ttggaaacca gaccttaaac tacaggctct tccacgacgg gaaacacttt 781 gtgggtgaga agaggtttga gtcgattcat gatctggtga cagatggctt gataacactg 841 tacatagaaa caaaagctgc cgagtacatt tcaaaaatga caactaaccc catctatgaa 901 cacattggat atgccaccct actcagagaa aaagtatcca gaaggctgag caggtctaaa 961 aatgaaccaa gaaaaacaaa cgtcacacat gaagaacaca cagcggtgga aaagatctcc 1021 tccctggttc gaagggctgc cctcacacac aacgacaacc acttcaatta tgagaagaca 1081 cacaacttta aggtccacac gttccgaggc ccacactggt gtgaatattg tgccaatttc 1141 atgtgggggc tcatcgccca aggggtccgg tgctcagact gtggattgaa cgtacacaaa 1201 cagtgttcca agcacgttcc caatgactgc caacctgatc tcaagaggat caagaaagtg 1261 tactgttgtg acctcacaac acttgtgaag gctcacaaca ctcagagacc catggtggta 1321 gacatatgca ttcgggaaat tgaagcaaga ggattaaaat cggaaggcct ttacagagtc 1381 tctgggttca ctgaacacat tgaagatgtc aaaatggcat ttgacagaga tggtgaaaag 1441 gccgatatat ctgccaatgt ctatccagac ataaacatca tcactggagc ccttaaactg 1501 tatttcagag acttacccat ccctgtcatc acatatgata cctattccaa atttatagat 1561 gcagcaaaaa tctccaatgc agatgagagg ctggaagccg tccatgaagt gctgatgctg 1621 ctgcctcctg cccactatga aaccctccgg tacctaatga tccacctcaa aaaggttact 1681 atgaatgaaa aagacaattt catgaatgca gaaaatctgg ggatcgtgtt tgggcccact 1741 ctgatgaggc cccctgagga cagcaccctg accaccctgc atgatatgcg gtaccaaaag 1801 ctgattgtgc agattttaat agaaaacgaa gacgttttat tctaatccat cagggaaatg 1861 agctgaatgg cccagcacca tcaagttgac acagctaagg ataaaacatt tcttaccact 1921 tgatttgttt tccaagcaag tgctagaatt tgctggactg cagaggatcg ctgagtgggg 1981 tactgtgtct catagacatg cgccacctcc acgtgagaac aagggtgaag gtgagggaag 2041 cccctcaggt tgggtctttt gctgtgcctc ctatgtatgt ctggtttgct ggaagagtga 2101 ttaatacatc tttaatttat taaaaaacaa tgtagacctt taaacttcag tcttattggg 2161 aataaaaggg aacttaattc atacaggtac ttgatacagt tatacatttt ccacttacaa 2221 aaagaagaca attctgttaa atgaaacgtg tatcgtaaaa tgtaatttta tttacccacg 2281 agaatgttgt tattttagca atagaactca atgcagatgc attggttatt accctgtgta 2341 ccttgtccct cattttgctg tgacaccctg aaaaagctga ccacaaatgc agtattatca 2401 ttgacatacc tctgtcctcc tcagtgcttt ttaatgtaat ttcaccggaa ttc // LOCUS HSU07225 2025 bp mRNA PRI 28-OCT-1997 DEFINITION Human P2U nucleotide receptor mRNA, complete cds. ACCESSION U07225 S74902 NID g984506 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2025) AUTHORS Parr,C.E., Sullivan,D.M., Paradiso,A.M., Lazarowski,E.R., Burch,L.H., Olsen,J.C., Erb,L., Weisman,G.A., Boucher,R.C. and Turner,J.T. TITLE Cloning and expression of a human P2U nucleotide receptor, a target for cystic fibrosis pharmacotherapy [published erratum appears in Proc Natl Acad Sci U S A 1994 Dec 20;91(26):13067] JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (8), 3275-3279 (1994) MEDLINE 94211846 REFERENCE 2 (bases 1 to 2025) AUTHORS Parr,C.E. TITLE Direct Submission JOURNAL Submitted (01-MAR-1994) Claude E. Parr, University of North Carolina, Dept. of Medicine, Pulmonary Diseases, 724 Burnett-Womack Bldg., Chapel Hill, NC 27599, USA FEATURES Location/Qualifiers source 1..2025 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="5A2116" /cell_line="CF/T43" /cell_type="epithelial cell" /tissue_type="airway epithelium" 5'UTR 1..245 CDS 246..1379 /note="P2U receptor; homologous to mouse ATP receptor, GenBank Accession Number L14751" /codon_start=1 /function="G protein-coupled surface membrane receptor" /product="P2U nucleotide receptor" /db_xref="PID:g984507" /translation="MAADLGPWNDTINGTWDGDELGYRCRFNEDFKYVLLPVSYGVVC VLGLCLNAVALYIFLCRLKTWNASTTYMFHLAVSDALYAASLPLLVYYYARGDHWPFS TVLCKLVRFLFYTNLYCSILFLTCISVHRCLGVLRPLRSLRWGRARYARRVAGAVWVL VLACQAPVLYFVTTSARGGRVTCHDTSAPELFSRFVAYSSVMLGLLFAVPFAVILVCY VLMARRLLKPAYGTSGGLPRAKRKSVRTIAVVLAVFALCFLPFHVTRTLYYSFRSLDL SCHTLNAINMAYKVTRPLASANSCLDPVLYFLAGQRLVRFARDAKPPTGPSPATPARR RLGLRRSDRTDMQRIGDVLGSSEDFRRTESTPAGSENTKDIRL" 3'UTR 1380..2017 BASE COUNT 388 a 616 c 596 g 425 t ORIGIN 1 cggcacgagg caccccgaga ggagaagcgc agcgcagtgg cgagaggagc cccttgtggc 61 agcagcacta cctgcccaga aaaatgctgg aggctgggcg tggccccagg cctggggacc 121 tgtttttcct gtttcccgca gagttccctg cagcccggtc caggtccagg cgtgtgcatt 181 catgagtgag gaacccgtgc aggcgctgag catcctgacc tggagagcag gggctggtca 241 gggcgatggc agcagacctg ggcccctgga atgacaccat caatggcacc tgggatgggg 301 atgagctggg ctacaggtgc cgcttcaacg aggacttcaa gtacgtgctg ctgcctgtgt 361 cctacggcgt ggtgtgcgtg cttgggctgt gtctgaacgc cgtggcgctc tacatcttct 421 tgtgccgcct caagacctgg aatgcgtcca ccacatatat gttccacctg gctgtgtctg 481 atgcactgta tgcggcctcc ctgccgctgc tggtctatta ctacgcccgc ggcgaccact 541 ggcccttcag cacggtgctc tgcaagctgg tgcgcttcct cttctacacc aacctttact 601 gcagcatcct cttcctcacc tgcatcagcg tgcaccggtg tctgggcgtc ttacgacctc 661 tgcgctccct gcgctggggc cgggcccgct acgctcgccg ggtggccggg gccgtgtggg 721 tgttggtgct ggcctgccag gcccccgtgc tctactttgt caccaccagc gcgcgcgggg 781 gccgcgtaac ctgccacgac acctcggcac ccgagctctt cagccgcttc gtggcctaca 841 gctcagtcat gctgggcctg ctcttcgcgg tgccctttgc cgtcatcctt gtctgttacg 901 tgctcatggc tcggcgactg ctaaagccag cctacgggac ctcgggcggc ctccctaggg 961 ccaagcgcaa gtccgtgcgc accatcgccg tggtgctggc tgtcttcgcc ctctgcttcc 1021 tgccattcca cgtcacccgc accctctact actccttccg ctcgctggac ctcagctgcc 1081 acaccctcaa cgccatcaac atggcctaca aggttacccg gccgctggcc agtgctaaca 1141 gttgccttga ccccgtgctc tacttcctgg ctgggcagag gctcgtacgc tttgcccgag 1201 atgccaagcc acccactggc cccagccctg ccaccccggc tcgccgcagg ctgggcctgc 1261 gcagatccga cagaactgac atgcagagga taggagatgt gttgggcagc agtgaggact 1321 tcaggcggac agagtccacg ccggctggta gcgagaacac taaggacatt cggctgtagg 1381 agcagaacac ttcagcctgt gcaggtttat attgggaagc tgtagaggac caggacttgt 1441 gcagacgcca cagtctcccc agatatggac catcagtgac tcatgctgga tgaccccatg 1501 ctccgtcatt tgacaggggc tcaggatatt cactctgtgg tccagagtca actgttccca 1561 taacccctag tcatcgtttg tgtgtataag ttgggggaat taagtttcaa gaaaggcaag 1621 agctcaaggt caatgacacc cctggcctga ctcccatgca agtagctggc tgtactgcca 1681 aggtacctag gttggagtcc agcctaatca agtcaaatgg agaaacaggc ccagagagga 1741 aggtggctta ccaagatcac ataccagagt ctggagctga gctacctggg gtgggggcca 1801 agtcacaggt tggccagaaa accctggtaa gtaatgaggg ctgagtttgc acagtggtct 1861 ggaatggact gggtgccacg gtggacttag ctctgaggag tacccccagc ccaagagatg 1921 aacatctggg gactaatatc atagacccat ctggaggctc ccatgggcta ggagcagtgt 1981 gaggctgtaa cttatactaa aggttgtgtt gcctgctaaa aaaaa // LOCUS HSU07231 2690 bp RNA PRI 09-JAN-1997 DEFINITION Human G-rich sequence factor-1 (GRSF-1) mRNA, complete cds. ACCESSION U07231 NID g517195 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2690) AUTHORS Qian and Wilusz,J. TITLE GRSF-1: a poly(A)+ mRNA binding protein which interacts with a conserved G-rich element JOURNAL Nucleic Acids Res. 22 (12), 2334-2343 (1994) MEDLINE 94310062 REFERENCE 2 (bases 1 to 2690) AUTHORS Wilusz,J. TITLE Direct Submission JOURNAL Submitted (01-MAR-1994) Jeff Wilusz, Microbiology and Molecular Genetics, University of Medicine and Dentistry of New Jersey, 185 S. Orange Ave., Newark, NJ 07103, USA FEATURES Location/Qualifiers source 1..2690 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Hela cell S3 cDNA expression library cloned in the Uni-ZAP XR from Strategene" /cell_line="Hela S3" 5'UTR 1..61 gene 62..1336 /gene="GRSF-1" CDS 62..1336 /gene="GRSF-1" /codon_start=1 /function="RNA binding protein" /product="G-rich sequence factor-1" /db_xref="PID:g517196" /translation="MAGTRWVLGALLRGCGCNCSSCRRTGAACLPFYSAASYPALRAS LLPQSLAAAAAVPTRSYSQESKTTYLEDLPPPPEYELAPSKLEEEVDDVFLIRAQGLP WSCTMEDVLNFFSDCRIRNGENGIHFLLNRDGKRRGDALIEMESEQDVQKALEKHRMY MGQRYVEVYEINNEDVDALMKSLQVKSSPVVNDGVVRLRGLPYSCNEKDIVDFFAGLN IVDITFVMDYRGRRKTGEAYVQFEEPEMANQALLKHREEIGNRYIEIFPSRRNEVRTH VGSYKGKKIASFPTAKYITEPEMVFEEHEVNEVFQPMTAFESEKEIELPKEVPEKLPE AADFGTTSSLHFVHMRGLPFQANAQDIINFFAPLKPVRITMEYSSSGKATGEADVHFE THEDAVAAMLKDRSHVHHRYIELFLNSCPKGK" 3'UTR 1337..2690 polyA_signal 1540..1545 polyA_signal 2665..2670 polyA_site 2690 BASE COUNT 774 a 513 c 615 g 788 t ORIGIN 1 ggcacgagca ccatcgctgc tggagcagct gccttcaggc cctgcgccgc ctccggagtc 61 catggccggc acgcgctggg tactcggggc gctgctccgg ggctgcggct gtaactgcag 121 cagctgccgg cgcaccggcg ccgcctgcct gcccttctac tccgccgcgt cctaccctgc 181 cctccgtgcc tctctgctgc cgcagtcgct ggcggcggcg gccgccgtcc cgacgcgcag 241 ctacagccag gagtccaaaa ctacttacct ggaagacctt ccaccacccc ctgagtatga 301 attggccccg tccaagttag aagaggaagt ggatgatgtc tttctcattc gagctcaagg 361 actgccctgg tcatgcacta tggaagatgt gcttaacttt ttttcagact gcagaatccg 421 caacggtgag aatggaatac attttctcct aaacagagat gggaaacgaa ggggtgatgc 481 cttaattgaa atggagtcag agcaggatgt gcagaaagcc ttagagaagc accgcatgta 541 catgggccag cggtatgtgg aagtatatga gataaacaat gaagatgtgg atgccttaat 601 gaagagcttg caggtcaaat cttcgcctgt ggtaaatgat ggtgtggttc gtttgagagg 661 acttccttat agttgcaatg agaaagacat tgtagacttc tttgcaggac tgaatatagt 721 tgacattact tttgtgatgg actatagagg gaggcgaaaa acaggggaag cctatgtgca 781 atttgaagaa ccagaaatgg ccaaccaagc cctgttgaaa cacagggaag aaattggtaa 841 tcgatacatc gagatatttc caagcagaag gaatgaagtt cgaacacatg tcggttctta 901 taagggaaag aaaatcgcat cttttcctac tgctaagtat ataactgagc cagaaatggt 961 ctttgaagaa catgaagtaa atgaggtatt tcaacccatg acagcttttg aaagtgagaa 1021 ggaaatagaa ttgcctaagg aggtgccaga aaagcttcca gaggctgctg attttggaac 1081 tacgtcttct ctgcattttg tccacatgag aggattacct ttccaagcca atgcccaaga 1141 cattataaac ttttttgctc cactcaagcc tgttagaatc accatggaat acagctccag 1201 tgggaaggcc actggagaag ctgatgtgca ctttgagacc catgaggatg ctgttgcagc 1261 gatgctcaag gatcggtccc acgttcatca taggtatatt gaactgttcc tgaattcatg 1321 tccaaaagga aaataagact ctaggggctc cagataataa gggtgaagca agaagcattt 1381 catttgcaca tctttcttgg acttgggata tacagttcca gtttattagc agcaactgct 1441 agggaaatga ttttggtgtt ttgggttaat tgcttctaag aaaagtttca tagtggactg 1501 tttagaagaa gaaatgaaag atccagtttg ggattatgaa ataaaccaca aattaaaatt 1561 tttgtttaaa ctgtccagga tctgatttaa aaatatggtc tttgttttat atgattaaat 1621 ggtttgtttt catagatgat atgttactca ttgtaaagac cacatatttt tattcagcag 1681 tgttctttaa acggtttcat ttaaaaagta actttttttt tttgcctgtg aattgagtgc 1741 tctgatgtaa aacttctcat ggagtgaaac agtgatttat tttaaccaaa cattcaccaa 1801 agcaaagaac ggtttcagac ctttgaactg gtatggtttg gcagaatagt tttaaatttt 1861 gctgtatttg attacttaga gataggaatt tttaaaaatc aaaacaaaaa ataccacagc 1921 ttagtgtaaa tgacaatttg gcggttttat gtctttagaa atgttttgcc tttctaagcc 1981 ttgtgctaaa ggcgtataac ggtggtgcct atctacttaa gggggcattc tagtcttaac 2041 ttaaaagttg tctaaactgt ccctccctgg ctttttttgg tttggggtag acctaagggt 2101 gtttgttagt ctcaaaactg tgaagtgaca tgtcagaaca gtccagactg gtaagaaaat 2161 taatggcttc acttgaattt aaaccagctc tagataggaa aaaaatcagt ctcctcattt 2221 gctttttaaa tggagtagta catcccatat tttagaacaa gtaggggtgc cttgcttaaa 2281 taaaaatagc atttaatgta taattgtgtg aagggtttat ggataaagct gtacttctgt 2341 cacaatgtgg cagtactttc tgctttaata ttaaacagct tgttatttaa atattggaca 2401 aaatggctgg cttcaaaata tagtcattaa taaactaact ttatgtgcac ctgtgtagga 2461 gaatcaaaat cctgtatgct ttctttgcct tgttcctgtt ctcagggtga cgactgccac 2521 caggagatgc agttctagtt cttaaaatta aatttgccca ggtttctgac aggtgatacc 2581 tggaagagag actatgtctt ctcttactta atacataacc atctttgatt accagctaag 2641 atgcgaaatc actgtactgt agtcaataaa tgaagacttg tttcaggctg // LOCUS HSU07349 2906 bp mRNA PRI 23-AUG-1994 DEFINITION Human B lymphocyte serine/threonine protein kinase mRNA, complete cds. ACCESSION U07349 NID g531819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2906) AUTHORS Katz,P., Whalen,G. and Kehrl,J.H. TITLE Differential expression of a novel protein kinase in human B lymphocytes. Preferential localization in the germinal center JOURNAL J. Biol. Chem. 269 (24), 16802-16809 (1994) MEDLINE 94266900 REFERENCE 2 (bases 1 to 2906) AUTHORS Kehrl,J. TITLE Direct Submission JOURNAL Submitted (03-MAR-1994) John H. Kehrl, NIAID, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20894, USA FEATURES Location/Qualifiers source 1..2906 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphocyte" /tissue_type="tonsil" CDS 40..2499 /codon_start=1 /function="serine/threonine protein kinase" /product="GC kinase" /db_xref="PID:g531820" /translation="MELRDVSLQDPRDRFELLQRVGAGTYGDVYKARDTVTSELAAVK IVKLDPGDDISSLQQEITILRECRHPNVVAYIGSYLRNDRLWICMEFCGGGSLQEIYH ATGPLEERQIAYVCRERLKGLHHLHSQGKIHRDIKGANLLLTLQGDVKLADFGVSGEL TASVAKRRSFIGTPYWMAPEVAAVERKGGYNELCDVWALGITAIELGELQPPLFHLHP MRALMLMSKSSFQPPKLRDKTRWTQNFHHFLKLALTKNPKKRPTAEKLLQHPFTTQQL PRALLTQLLDKASDPHLGTPSPEDCELETYDMFPDTIHSRGQHGPAERTPSEIQFHQV KFGAPRRKETDPLNEPWEEEWTLLGKEELSGSLLQSVQEALEERSLTIRSASEFQELD SPDDTMGTIKRAPFLGPLPTDPPAEEPLSSPPGTLPPPPSGPNSSPLLPTAWATMKQR EDPERSSCHGLPPTPKVHMGACFSKVFNGCPLRIHAAVTWIHPVTRDQFLVVGAEEGI YTLNLHELHEDTLEKLISHRCSWLYCVNNVLLSLSGKSTHIWAHDLPGLFEQRRLQQQ VPLSIPTNRLTQRIIPRRFALSTKIPDTKGCLQCRVVRNPYTGATFLLAALPTSLLLL QWYEPLQKFLLLKNFSSPLPSPAGMLEPLVLDGKELPQVCVGAEGPEGPGCRVLFHVL PLEAGLTPDILIPPEGIPGSAQQVIQVDRDTILVSFERCVRIVNMQGEPTATLAPELT FDFPIETVVCLQDSVLAFWSHGMQGRSLDTNEVTQEITDETRIFRVLGAHRDIILESI PTDNPEAHSNLYILTGHQSTY" BASE COUNT 597 a 941 c 830 g 538 t ORIGIN 1 gctccggccc gccccgctgc ccggcccgcg cgccgggcca tggagctgcg ggatgtgtcg 61 ctgcaggacc cgcgggaccg cttcgagctg ctgcagcgcg tgggggccgg gacctatggc 121 gacgtctaca aggcccgcga cacggtcacg tccgaactgg ccgccgtgaa gatagtcaag 181 ctagacccag gggacgacat cagctccctc cagcaggaaa tcaccatcct gcgtgagtgc 241 cgccacccca atgtggtggc ctacattggc agctacctca ggaatgaccg cttgtggatc 301 tgcatggagt tctgcggagg gggctccctg caggagattt accatgccac tgggcccctg 361 gaggagcggc agattgccta cgtctgccga gagcgactga aggggctcca ccacctgcat 421 tctcagggga agatccacag agacatcaag ggagccaacc ttctcctcac tctccaggga 481 gatgtcaaac tggctgactt tggggtgtca ggcgagctga cagcgtctgt ggccaagagg 541 aggtctttca ttgggactcc ctactggatg gctcccgagg tggctgctgt ggagcgcaaa 601 ggtggctaca atgagctatg tgacgtctgg gccctgggca tcactgccat tgagctgggc 661 gagctgcagc cccctctgtt ccacctgcac cccatgaggg ccctgatgct catgtcgaag 721 agcagcttcc agccgcccaa actgagagat aagactcgct ggacccagaa tttccaccac 781 tttctcaaac tggccctgac caagaatcct aagaagaggc cgacagcaga gaagctcctg 841 cagcacccgt tcacgactca gcagctccct cgggccctcc tcacacagct gctggacaaa 901 gccagtgacc ctcatctggg gaccccctcc cctgaggact gtgagctgga gacctatgac 961 atgtttccag acaccattca ctcccggggg cagcacggcc cagccgagag gaccccctcg 1021 gagatccagt ttcaccaggt gaaatttggc gccccacgca ggaaggaaac tgacccactg 1081 aatgagccgt gggaggaaga gtggacacta ctgggaaagg aagagttgag tgggagcctg 1141 ctgcagtcgg tccaggaggc cctggaggaa aggagtctga ctattcggtc agcctcagaa 1201 ttccaggagc tggactcccc agacgatacc atgggaacca tcaagcgggc cccgttccta 1261 gggccactcc ccactgaccc tccagcagag gagcctctgt ccagtccccc aggaaccctg 1321 cccccacctc cttcaggccc caacagctcc ccactgctgc ccacggcctg ggccaccatg 1381 aagcagcggg aggatcctga gaggtcatcc tgccacgggc tccccccaac tcccaaggtg 1441 catatgggcg cctgcttctc caaggtcttc aatggctgcc ccctgcggat ccacgctgct 1501 gtcacctgga ttcaccctgt tactcgggac cagttcctgg tggtaggggc cgaggaaggc 1561 atctacacac tcaacctgca tgaactgcat gaggatacgc tggagaagct gatttcacat 1621 cgctgctcct ggctctactg cgtgaacaac gtgctgctgt cactctcagg gaaatccacg 1681 cacatctggg cccatgacct cccaggcctg tttgagcagc ggaggctaca gcaacaggtt 1741 cccctctcca tccccaccaa ccgcctcacc cagcgcatca tccccaggcg ctttgctctg 1801 tccaccaaga ttcctgacac caaaggctgc ttgcagtgtc gtgtggtgcg gaacccctac 1861 acgggtgcca ccttcctgct ggccgccctg cccaccagcc tgctcctgct gcagtggtat 1921 gagccgctgc agaagtttct gctgctgaag aacttctcca gccctctgcc cagcccagct 1981 gggatgctgg agccgctggt gctggatggg aaggagctgc cgcaggtgtg tgttggggcc 2041 gaggggcctg aggggcccgg ctgccgcgtc ctgttccatg tcctgcccct ggaggctggc 2101 ctgacgcccg acatcctcat cccacctgag gggatcccag gctcggccca gcaggtgatc 2161 caggtggaca gggacacaat cctagtcagc tttgaacgct gtgtgaggat tgtcaacatg 2221 cagggcgagc ccacggccac actggcacct gagctgacct ttgatttccc catcgagact 2281 gtggtgtgcc tgcaggacag tgtgctggcc ttctggagcc atgggatgca aggccgaagc 2341 ctggatacca atgaggtgac ccaggagatc acagatgaaa caaggatctt ccgagtgctt 2401 ggggcccaca gagacatcat cctggagagc attcccactg acaacccaga ggcgcacagc 2461 aacctctaca tcctcacggg ccaccagagc acctactaag agcagcgggc ctgtccaggc 2521 tccccgcccc accccacgcc ttagctgcag gcccttttgg gcaaaggggc ccatcctaga 2581 ccagaggagc ccaggccctg gccctgctgg ggctgaaggt cagaagtaat cctgagaaat 2641 gtttcaggcc tggggaggga ggggagcccc cgacgcctct gcaataactg gaccaggggg 2701 agctgctgtc actcccccat ccccgaggca gcccagtccc tagtgcccaa ggcagggacc 2761 ctgggcctgg gccatccatt ccattttgtt ccacatttcc tttctactct ttctgccaag 2821 agcctgcccc tgcatttgtc ctgggaaaca cggtatttaa gagagaacta tattggtatt 2881 aaagctggtt tgttttaaaa aaaaaa // LOCUS HSU07358 3365 bp mRNA PRI 25-MAY-1995 DEFINITION Human protein kinase (zpk) mRNA, complete cds. ACCESSION U07358 NID g561542 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3365) AUTHORS Reddy,U.R. and Pleasure,D. TITLE Cloning of a novel putative protein kinase having a leucine zipper domain from human brain [published erratum appears in Biochem Biophys Res Commun 1994 Dec 15;205(2):1494-5] JOURNAL Biochem. Biophys. Res. Commun. 202 (1), 613-620 (1994) MEDLINE 94311945 REFERENCE 2 (bases 1 to 3365) AUTHORS Reddy,U.R. TITLE Direct Submission JOURNAL Submitted (03-MAR-1994) Usha R. Reddy, Neurology Research, Children's Hospital of Philadelphia, 34th Civic Centre Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..3365 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NTera2/D1" /cell_type="teratocarcinoma cells" 5'UTR 1..98 gene 99..2678 /gene="zpk" CDS 99..2678 /gene="zpk" /note="putative" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g561543" /translation="MACLHETRTPSPSFGGFVSTLSEASMRKLDPDTSDCTPEKDLTP THVLQLHEQDAGGPGGAAGSPESRASRVRADEVRLQCQSGSGFLEGLFGCLRPVWTMI GKAYSTEHKQQQEDLWEVPFEEILDLQWVGSGAQGAVFLGRFHGEEVAVKKVRDLKET DIKHLRKLKHPNIITFKGVCTQAPCYCILMEFCAQGQLYEVLRAGRPVTPSLLVDWSM GIAGGMNYLHLHKIIHRDLKSPNMLITYDDVVKISDFGTSKELSDKSTKMSFAGTVAW MAPEVIRNEPVSEKVDIWSFGVVLWELLTGEIPYKDVDSSAIIWGVGSNSLHLPVPSS CPDGFKILLRQCWNSKPRNRPSFRQILLHLDIASADVLSTPQETYFKSQAEWREEVKL HFEKIKSEGTCLHRLEEELVMRRREELRHALDIREHYERKLERANNLYMELNALMLQL ELKERELLRREQALERRCPGLLKPHPSRGLLHGNTMEKLIKKRNVPQNLSPHSQRPDI LKAESLLPKLDAALSGVGLPGCPKAPPSPGRSRRGKTRHRKASAKGSCGDLPGLRTAV PPHEPGGPGSPGGLGGGPSAWEACPPALRGLHHDLLLRKMSSSSPDLLSAALGSRGRG ATGGAGDPGSPPPARGDTPPSEGSAPGSTSPDSPGGAKGEPPPPVGPGEGVGLLGTGR EGTSGRGGSRAGSQHLTPAALLYRAAVTRSQKRGISSEEEEGEVDSEVELTSSQRWPQ SLNMRQSLSTFSSENPSDGEEGTASEPSPSGTPEVGSTNTDERPDERSDDMCSQGSEI PLDPPPSEVIPGPEPSSLPIPHQELLRERGPPNSEDSDCDSTELDNSNSVDALRPPAS LPP" misc_feature 443..469 /gene="zpk" /note="leucine-zipper" 3'UTR 2679..3365 polyA_signal 3347..3352 polyA_site 3365 /note="23 A residues" BASE COUNT 777 a 951 c 963 g 674 t ORIGIN 1 agcatccgga gcggagctgc agcagcgccg ccttttgtgc tgcggccgcg gagcccccga 61 gggcccagtg ttcaccatca taccaggggc cagaggcgat ggcttgcctc catgagaccc 121 gaacaccctc tccttccttt gggggctttg tgtctaccct aagtgaggca tccatgcgca 181 agctggaccc agacacttct gactgcactc ccgagaagga cctgacgcct acccatgtcc 241 tgcagctaca tgagcaggat gcagggggcc cagggggagc agctgggtca cctgagagtc 301 gggcatccag agttcgagct gacgaggtgc gactgcagtg ccagagtggc agtggcttcc 361 ttgagggcct ctttggctgc ctgcgccctg tctggaccat gattggcaaa gcctactcca 421 ctgagcacaa gcagcagcag gaagaccttt gggaggtccc ctttgaggaa atcctggacc 481 tgcagtgggt gggctcaggg gcccagggtg ctgtcttcct ggggcgcttc cacggggagg 541 aggtggctgt gaagaaggtg cgagacctca aagaaaccga catcaagcac ttgcgaaagc 601 tgaagcaccc caacatcatc actttcaagg gtgtgtgcac ccaggctccc tgctactgca 661 tcctcatgga gttctgcgcc cagggccagc tgtatgaggt actgcgggct ggccgccctg 721 tcaccccctc cttactggtt gactggtcca tgggcatcgc tggtggcatg aactacctgc 781 acctgcacaa gattatccac agggatctca agtcacccaa catgctaatc acctacgacg 841 atgtggtgaa gatctcagat tttggcactt ccaaggagct gagtgacaag agcaccaaga 901 tgtcctttgc agggacagta gcctggatgg cccctgaggt gatccgcaat gaacctgtgt 961 ctgagaaggt cgacatctgg tcctttggcg tggtgctatg ggaactgctg actggtgaga 1021 tcccctacaa agacgtagat tcctcagcca ttatctgggg tgtgggaagc aacagtctcc 1081 atctgcccgt gccctccagt tgcccagatg gtttcaagat cctgcttcgc cagtgctgga 1141 atagcaaacc acgaaatcgc ccatcattcc gacagatcct gctgcatctg gacattgcct 1201 cagctgatgt actctccaca ccccaggaga cttactttaa gtcccaggca gagtggcggg 1261 aagaagtaaa actgcacttt gaaaagatta agtcagaagg gacctgtctg caccgcctag 1321 aagaggaact ggtgatgagg aggagggagg agctcagaca cgccctggac atcagggagc 1381 actatgaaag gaagctggag agagccaaca acctgtatat ggaacttaat gccctcatgt 1441 tgcagctgga actcaaggag agggagctgc tcaggcgaga gcaagcttta gagcggaggt 1501 gcccaggcct gctgaagcca cacccttccc ggggcctcct gcatggaaac acaatggaga 1561 agcttatcaa gaagaggaat gtgccacaga atctgtcacc ccatagccaa aggccagata 1621 tcctcaaggc ggagtctttg ctccctaaac tagatgcagc cctgagtggg gtggggcttc 1681 ctgggtgtcc taaggccccc ccctcaccag gacggagtcg ccgtggcaag acccgtcacc 1741 gcaaggccag cgccaagggg agctgtgggg acctgcctgg gcttcgtaca gctgtgccac 1801 cccatgaacc tggaggacca ggaagcccag ggggcctagg agggggaccc tcagcctggg 1861 aggcctgccc tcccgccctc cgtgggcttc atcatgacct cctgctccgc aaaatgtctt 1921 catcgtcccc agacctgctg tcagcagcac tagggtcccg gggccggggg gccacaggcg 1981 gagctgggga tcctggctca ccacctccgg cccggggtga caccccacca agtgagggct 2041 cagcccctgg ctccaccagc ccagattcac ctgggggagc caaaggggaa ccacctcctc 2101 cagtagggcc tggtgaaggt gtggggcttc tgggaactgg aagggaaggg acctcaggcc 2161 ggggaggaag ccgggctggg tcccagcact tgaccccagc tgcactgctg tacagggctg 2221 ccgtcacccg aagtcagaaa cgtggcatct catcggaaga ggaggaagga gaggtagaca 2281 gtgaagtaga gctgacatca agccagaggt ggcctcagag cctgaacatg cgccagtcac 2341 tatctacctt cagctcagag aatccatcag atggggagga aggcacagct agtgaacctt 2401 cccccagtgg cacacctgaa gttggcagca ccaacactga tgagcggcca gatgagcggt 2461 ctgatgacat gtgctcccag ggctcagaaa tcccactgga cccacctcct tcagaggtca 2521 tccctggccc tgaacccagc tccctgccca ttccacacca ggaacttctc agagagcggg 2581 gccctcccaa ttctgaggac tcagactgtg acagcactga attggacaac tccaacagcg 2641 ttgatgcctt gcgcccccca gcttccctcc ctccatgaaa gccactcgta ttccttgtac 2701 atagagaaat atttatatgg attatatata tatacatata tatatatata tgcgccacat 2761 aatcaacaga aagatggggc tgtcccagcc gtaagtcagg ctcgagggag actgatcccc 2821 tgaccaattc acctgataaa ctctagggac actggcagct gtggaaatga atgaggcaca 2881 gccgtagagc tgtggctaag ggcaagcccc ttcctgcccc accccattcc ttatattcag 2941 caagcaacaa ggcaatagaa aagccagggt tgtctttata ttctttatcc ccaaataata 3001 gggggtgggg ggaggggcgg tgggaggggc aggagagaaa accacttaga ctgcactttt 3061 ctgttccgtt tactctgttt acacattttg cacttgggag gagggaggct aaggctgggt 3121 cctcccctct gaggtttctc aggtggcaat gtaactcatt tttttgtccc accatttatc 3181 ttctctgccc aagccctgtc ttaaggccca gggggaggtt aggagactga tagcatgtga 3241 tggctcaggc tgaagaaccg gggttctgtt taagtccctg cttttatcct ggtgcctgat 3301 tggggtgggg actgtcctac tgtaacccct gtgaaaaacc ttgaaaaata acactccatg 3361 cagga // LOCUS HSU07361 1808 bp mRNA PRI 11-MAY-1995 DEFINITION Homo sapiens sorbitol dehydrogenase gene, complete cds. ACCESSION U07361 NID g805074 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Lee,F.K., Cheung,M.C. and Chung,S. TITLE The human sorbitol dehydrogenase gene: cDNA cloning, sequence determination, and mapping by fluorescence in situ hybridization JOURNAL Genomics 21 (2), 354-358 (1994) MEDLINE 94375058 REFERENCE 2 (bases 1 to 1808) AUTHORS Lee,F. TITLE Direct Submission JOURNAL Submitted (04-MAR-1994) Fuk K. Lee, Institute of Molecular Biology, The University of Hong Kong, 3/F Li Shu Fan Building, 5 Sassoon Road, Hong Kong FEATURES Location/Qualifiers source 1..1808 /organism="Homo sapiens" /strain="Caucasian" /db_xref="taxon:9606" /sex="male" /tissue_type="liver" /dev_stage="adult" CDS 142..1215 /standard_name="sord" /EC_number="1.1.1.14" /codon_start=1 /evidence=experimental /product="sorbitol dehydrogenase" /db_xref="PID:g520450" /translation="MAAAAKPNNLSLVVHGPGDLRLENYPIPEPGPNEVLLRMHSVGI CGSDVHYWEYGRIGNFIVKKPMVLGHEASGTVEKVGSSVKHLKPGDRVAIEPGAPREN DEFCKMGRYNLSPSIFFCATPPDDGNLCRFYKHNAAFCYKLPDNVTFEEGALIEPLSV GIHACRRGGVTLGHKVLVCGAGPIGMVTLLVAKAMGAAQVVVTDLSATRLSKAKEIGA DLVLQISKESPQEIARKVEGQLGCKPEVTIECTGAEASIQAGIYATRSGGTLVLVGLG SEMTTVPLLHAAIREVDIKGVFRYCNTWPVAISMLASKSVNVKPLVTHRFPLEKALEA FETFKKGLGLKIMLKCDPSDQNP" polyA_site 1808 /note="14 A nucleotides" BASE COUNT 442 a 449 c 495 g 422 t ORIGIN 1 gcctagtccc gcccctgcgt gcggcgcttc tcccaggccc caccttccat ccagtgccct 61 ggaccctcgg ctgggtagcg ccaccagagc gaccaaacgt cccgcgcctt ccaggccgca 121 ctccagagcc aaaagagctc catggcggcg gcggccaagc ccaacaacct ttccctggtg 181 gtgcacggac cgggggactt gcgcctggag aactatccta tccctgaacc aggcccaaat 241 gaggtcttgc tgaggatgca ttctgttgga atctgtggct cagatgtcca ctactgggag 301 tatggtcgaa ttgggaattt tattgtgaaa aagcccatgg tgctgggaca tgaagcttcg 361 ggaacagtcg aaaaagtggg atcatcggta aagcacctaa aaccaggtga tcgtgttgcc 421 atcgagcctg gtgctccccg agaaaatgat gaattctgca agatgggccg atacaatctg 481 tcaccttcca tcttcttctg tgccacgccc cccgatgacg ggaacctctg ccggttctat 541 aagcacaatg cagccttttg ttacaagctt cctgacaatg tcacctttga ggaaggcgcc 601 ctgatcgagc cactttctgt ggggatccat gcctgcagga gaggcggagt taccctggga 661 cacaaggtcc ttgtgtgtgg agctgggcca atcgggatgg tcactttgct cgtggccaaa 721 gcaatgggag cagctcaagt agtggtgact gatctgtctg ctacccgatt gtccaaagcc 781 aaggagattg gggctgattt agtcctccag atctccaagg agagccctca ggaaatcgcc 841 aggaaagtag aaggtcagct ggggtgcaag ccggaagtca ccatcgagtg cacgggggca 901 gaggcctcca tccaggcggg catctacgcc actcgctctg gtgggaccct cgtgcttgtg 961 gggctgggct ctgagatgac caccgtaccc ctactgcatg cagccatccg ggaggtggat 1021 atcaagggcg tgtttcgata ctgcaacacg tggccagtgg cgatttcgat gcttgcgtcc 1081 aagtctgtga atgtaaaacc cctcgtcacc cataggtttc ctctggagaa agctctggag 1141 gcctttgaaa catttaaaaa gggattgggg ttgaaaatca tgctcaagtg tgaccccagt 1201 gaccagaatc cctgatgtta atgggctctg ctcatcccca cagtcttggg atctcagggc 1261 acaatggctg gacatgggtg ggctctgatg cagaactttc tcttttgaat gttaagaata 1321 actaatacaa ttcattgtga acagaagtcc ttaagcagag gaattggtgt gccttaaaga 1381 tacaatctgg gatagtttgg gggaacttgt agccagaatg ccctgttcat gctgagcaaa 1441 gttcagcaag tagagcagag tttggcaggc aggtgccagg aactcccctt cttcctggag 1501 tgccttcatt gaggaaggaa atctggccct tgggtttcct ggttccactg ctactgaccc 1561 agaggggaat gagggctgag ttatgaaaag ataacttcat gaagacttaa ctggcccaga 1621 agctgatttt catgaaaatc tgccactcag ggtctgggat gaaggcttgt cagcacttcc 1681 agtttagaac gcaatgtttc tagagacata ttggctgttt gttttgatga taaaaggaga 1741 ataagaaaag gcatcacttt cctggatcca ggataatttt taaaccaatc aaatgaaaaa 1801 aacaaaca // LOCUS HSU07559 2397 bp mRNA PRI 29-NOV-1995 DEFINITION Human ISL-1 (Islet-1) mRNA, complete cds. ACCESSION U07559 NID g533418 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2397) AUTHORS Riggs,A.C., Tanizawa,Y., Aoki,M., Wasson,J., Ferrer,J., Rabin,D.U., Vaxillaire,M., Froguel,P. and Permutt,M.A. TITLE Characterization of the LIM/homeodomain gene islet-1 and single nucleotide screening in NIDDM JOURNAL Diabetes 44 (6), 689-694 (1995) MEDLINE 95309532 REFERENCE 2 (bases 1 to 2397) AUTHORS Permutt,M.A. TITLE Direct Submission JOURNAL Submitted (07-MAR-1994) M. Alan Permutt, Internal Medicine, Washington University School of Medicine, 660 S. Euclid, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2397 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="R9" /tissue_type="pancreatic islets" gene 249..1289 /gene="Islet-1" CDS 249..1289 /gene="Islet-1" /codon_start=1 /product="ISL-1" /db_xref="PID:g533419" /translation="MGDPPKKKRLISLCVGCGNQIHDQYILRVSPDLEWHAACLKCAE CNQYLDESCTCFVRDGKTYCKRDYIRLYGIKCAKCSIGFSKNDFVMRARSKVYHIECF RCVACSRQLIPGDEFALREDGLFCRADHDVVERASLGAGDPLSPLHPARPLQMAAEPI SARQPALRPHVHKQPEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTGLS PRVIRVWFQNKRCKDKKRSIMMKQLQQQQPNDKTNIQGMTGTPMVAASPERHDGGLQA NPVEVQSYQPPWKVLSDFALQSDIDQPAFQQLVNFSEGGPGSNSTGSEVASMSSQLPD TPNSMVASPIEA" misc_feature 288..300 /gene="Islet-1" /function="limdomain #1 Cys region" misc_feature 354..384 /gene="Islet-1" /function="limdomain #1 Hys region" misc_feature 372..384 /gene="Islet-1" /function="limdomain #2 Cys Region" misc_feature 540..570 /gene="Islet-1" /function="limdomain #2 Hys Region" misc_feature 777..960 /gene="Islet-1" /function="homeodomain" misc_feature 978..990 /gene="Islet-1" /function="glutamine rich region" BASE COUNT 671 a 547 c 553 g 626 t ORIGIN 1 cccccgagcc gcgccgagtc tgccgccgcc gcagcgcctc cgctccgcca actccgccgg 61 cttaaattgg actcctagat ccgcgagggc gcggcgcagc cgagcagcgg ctctttcagc 121 attggcaacc ccaggggcca atatttccca cttagccaca gctccagcat cctctctgtg 181 ggctgttcac caactgtaca accaccattt cactgtggac attactccct cttacagata 241 tgggagacat gggagatcca ccaaaaaaaa aacgtctgat ttccctatgt gttggttgcg 301 gcaatcagat tcacgatcag tatattctga gggtttctcc ggatttggaa tggcatgcgg 361 catgtttgaa atgtgcggag tgtaatcagt atttggacga gagctgtaca tgctttgtta 421 gggatgggaa aacctactgt aaaagagatt atatcaggtt gtacgggatc aaatgcgcca 481 agtgcagcat cggcttcagc aagaacgact tcgtgatgcg tgcccgctcc aaggtgtatc 541 acatcgagtg tttccgctgt gtggcctgca gccgccagct catccctggg gacgaatttg 601 cgcttcggga ggacggtctc ttctgccgag cagaccacga tgtggtggag agggccagtc 661 taggcgctgg cgacccgctc agtcccctgc atccagcgcg gccactgcaa atggcagcgg 721 agcccatctc cgccaggcag ccagccctgc ggccccacgt ccacaagcag ccggagaaga 781 ccacccgcgt gcggactgtg ctgaacgaga agcagctgca caccttgcgg acctgctacg 841 ccgcaaaccc gcggccagat gcgctcatga aggagcaact ggtagagatg acgggcctca 901 gtccccgtgt gatccgggtc tggtttcaaa acaagcggtg caaggacaag aagcgaagca 961 tcatgatgaa gcaactccag cagcagcagc ccaatgacaa aactaatatc caggggatga 1021 caggaactcc catggtggct gccagtccag agagacacga cggtggctta caggctaacc 1081 cagtggaagt acaaagttac cagccacctt ggaaagtact gagcgacttc gccttgcaga 1141 gtgacataga tcagcctgct tttcagcaac tggtcaattt ttcagaagga ggaccgggct 1201 ctaattccac tggcagtgaa gtagcatcaa tgtcctctca acttccagat acacctaaca 1261 gcatggtagc cagtcctatt gaggcatgag gaacattcat tctgtatttt ttttccctgt 1321 tggagaaagt gggaaattat aatgtcgaac tctgaaacaa aagtatttaa cgacccagtc 1381 aatgaaaact gaatcaagaa atgaatgctc catgaaatgc acgaagtctg ttttaatgac 1441 aaggtgatat ggtagcaaca ctgtgaagac aatcatggga ttttactaga attaaacaac 1501 aaacaaaacg caaaacccag tatatgctat tcaatgatct tagaagtact gaaaaaaaaa 1561 gacgttttta aaacgtagag gatttatatt caaggatctc aaagaaagca ttttcatttc 1621 actgcacatc tagagaaaaa caaaaataga aaattttcta gtccatccta atctgaatgg 1681 tgctgtttct atattggtca ttgccttgcc aaacaggagc tccagcaaaa gcgcaggaag 1741 agagactggc ctccttggct gaaagagtcc tttcaggaag gtggagctgc attggtttga 1801 tatgtttaaa gttgacttta acaaggggtt aattgaaatc ctgggtctct tggcctgtcc 1861 tgtagctggt ttatttttta ctttgccccc tccccacttt ttttgagatc catcctttat 1921 caagaagtct gaagcgacta taaaggtttt tgaattcaga tttaaaaacc aacttataaa 1981 gcattgcaac aaggttacct ctattttgcc acaagcgtct cgggattgtg tttgacttgt 2041 gtctgtccaa gaacttttcc cccaaagatg tgtatagtta ttggttaaaa tgactgtttt 2101 ctctctctat ggaaataaaa aggaaaaaaa aaaggaaact ttttttgttt gctcttgcat 2161 tgcaaaaatt ataaagtaat ttattattta ttgtcggaag acttgccact tttcatgtca 2221 tttgacattt tttgtttgct gaagtgaaaa aaaaagataa aggttgtacg gtggtctttg 2281 aattatatgt ctaattctat gtgttttgtc tttttcttaa atattatgtg aaatcaaagc 2341 gccatatgta gaattatatc ttcaggacta tttcactaat aaacatttgg catagat // LOCUS HSU07616 2377 bp mRNA PRI 01-OCT-1994 DEFINITION Human amphiphysin mRNA, complete cds. ACCESSION U07616 NID g550449 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2377) AUTHORS David,C., Solimena,M. and De Camilli,P. TITLE Autoimmunity in Stiff-Man Syndrome with breast cancer is targeted to the C-terminal region of human amphiphysin, a protein homologous to the yeast proteins, Rvs167 and Rvs161 JOURNAL FEBS Lett. 351, 73-79 (1994) MEDLINE 94357284 REFERENCE 2 (bases 1 to 2377) AUTHORS David,C. TITLE Direct Submission JOURNAL Submitted (09-MAR-1994) Carol David, Cell Biology, Yale University, 333 Cedar Ave., New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..2377 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /sex="female" /tissue_type="cerebellum" /clone="22-2" CDS 111..2198 /codon_start=1 /product="amphiphysin" /db_xref="PID:g550450" /translation="MADIKTGIFAKNVQKRLNRAQEKVLQKLGKADETKDEQFEEYVQ NFKRQEAEGTRLQRELRGYLAAIKGMQEASMKLTESLHEVYEPDWYGREDVKMVGEKC DVLWEDFHQKLVDGSLLTLDTYLGQFPDIKNRIAKRSRKLVDYDSARHHLEALQSSKR KDESRISKAEEEFQKAQKVFEEFNVDLQEELPSLWSRRVGFYVNTFKNVSSLEAKFHK EIAVLCHKLYEVMTKLGDQHADKAFTIQGAPSDSGPLRIAKTPSPPEEPSPLPSPTAS PNHTLAPASPAPARPRSPSQTRKGPPVPPLPKVTPTKELQQENIISFFEDNFVPEISV TTPSQNEVPEVKKEETLLDLDFDPFKPEVTPAGSAGVTHSPMSQTLPWDLWTTSTDLV QPASGGSFNGFTQPQDTSLFTMQTDQSMICNLAESEQAPPTEPKAEEPLAAVTPAVGL DLGMDTRAEEPVEEAVIIPGADADAAVGTLVSAAEGAPGEEAEAEKATVPAGEGVSLE EAKIGTETTEGAESAQPEAEELEATVPQEKVIPSVVIEPASNHEEEGENEITIGAEPK ETTEDAAPPGPTSETPELATEQKPIQDPQPTPSAPAMGAADQLASAREASQELPPGFL YKVETLHDFEAANSDELTLQRGDVVLVVPSDSEADQDAGWLVGVKESDWLQYRDLATY KGLFPENFTRRLD" BASE COUNT 664 a 603 c 632 g 478 t ORIGIN 1 cggctctcag ctgcactcct gtacatccac ctgtcttcag gagagcactg tttgtgtgtg 61 cccagccccg ctgcgcgctc tgctcttcgc agctccccgg acccgcagcc atggccgaca 121 tcaagacggg catcttcgcc aagaacgtcc agaagcgact caaccgcgcg caggaaaagg 181 tcctccaaaa gctggggaaa gctgatgaga caaaagacga acagttcgaa gaatatgtcc 241 agaacttcaa acggcaagaa gcagagggta ccagacttca gcgagaactc cgaggatatt 301 tagcagcaat caaaggcatg caggaggcct ccatgaagct cacagagtcg ctgcatgaag 361 tctatgagcc tgactggtat gggcgggaag atgtgaaaat ggttggtgag aaatgtgatg 421 tgctgtggga agacttccat caaaaactcg tggatgggtc cttgctaaca ctggatacct 481 acctggggca atttcctgac ataaagaatc gcatcgccaa gcgcagcagg aagctagtgg 541 actatgacag tgcccgccac catctggaag ctctgcagag ctccaagagg aaggatgaga 601 gtcgaatctc taaggcagaa gaagaatttc agaaagcaca gaaagtgttt gaagagttta 661 acgttgactt acaagaagag ttaccatcat tatggtcaag acgagttgga ttttatgtta 721 atactttcaa aaacgtctcc agccttgaag ccaagtttca taaggaaatt gcggtgcttt 781 gccacaaact gtatgaagtg atgacaaaac tgggtgacca gcacgccgac aaggccttca 841 ccatccaagg agcgcccagt gattcgggtc ctctccgcat tgcaaagaca ccatcaccgc 901 ctgaggagcc ttcacccctc ccgagcccga cagcaagtcc aaatcataca ttagcacctg 961 cgtctcccgc accagcacgg cctcggtcac cttcacagac aaggaaaggg cctcctgtcc 1021 cacctctacc taaagtcacc ccgacaaagg aactgcagca ggagaacatc atcagtttct 1081 ttgaggacaa ctttgttcca gaaatcagtg tgacaacacc ttcccagaat gaagtccctg 1141 aggtgaagaa agaggagact ttgctggatc tggactttga tcctttcaag cccgaggtga 1201 cacctgcagg ttctgctgga gtgacccact cacccatgtc tcagacattg ccctgggacc 1261 tatggacgac aagcactgat ttggtacagc cggcttctgg tggttcattt aatggattca 1321 cacagcccca ggatacttca ttattcacaa tgcagacaga ccagagtatg atctgcaact 1381 tggctgaatc tgaacaggct ccacccacag agccaaaagc agaggagcct ctggctgctg 1441 tcacacctgc cgttggtctg gaccttggaa tggacactcg ggctgaggag ccagtggagg 1501 aggcagtgat catacctgga gctgatgctg atgcagctgt tggaaccttg gtgtcagcag 1561 ctgagggggc cccaggagag gaagcagagg cggagaaggc cactgtccct gccggggaag 1621 gagtaagttt agaggaggcc aaaattggaa ctgaaaccac tgagggtgca gagagtgccc 1681 aacctgaagc agaggagctc gaagcaacag tgcctcagga gaaggtcatt ccttcggtgg 1741 tcatagagcc tgcctccaac catgaagagg aaggagaaaa cgaaataact ataggtgcag 1801 agcccaagga gaccaccgag gacgcggctc ctccgggccc caccagcgag acaccggagc 1861 tggctacgga gcagaagcct atccaggacc ctcagcccac gccttctgca ccagccatgg 1921 gggctgctga ccagctagca tctgcaaggg aggcctctca ggaattgcct cctggctttc 1981 tctacaaggt ggaaacactg catgattttg aggcagcaaa ttctgatgaa cttaccttac 2041 aaaggggtga tgtggtgctg gtggtcccct cagattcaga agctgatcag gatgcaggct 2101 ggctggtggg agtgaaggaa tcagactggc ttcagtacag agaccttgcc acctacaaag 2161 gcctctttcc agagaacttc acccgacgct tagattaggg caacaagtac tgcaagaagg 2221 agctcagtta cggggttttt aaaccttcat gaaaacctga agagttcact tttgttatta 2281 tgctcttaat gatttacaga ctgatgccag acaaaccttg ggaagatgta tcaatggagc 2341 atgtgtgcaa aaaaatgtaa gaggaaaaaa aaaaccg // LOCUS HSU07620 2372 bp mRNA PRI 17-FEB-1995 DEFINITION Human MAP kinase mRNA, complete cds. ACCESSION U07620 NID g468150 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2372) AUTHORS Mohit,A.A., Martin,J.H. and Miller,C.A. TITLE p493F12 kinase: a novel MAP kinase expressed in a subset of neurons in the human nervous system JOURNAL Neuron 14 (1), 67-78 (1995) MEDLINE 95127233 REFERENCE 2 (bases 1 to 2372) AUTHORS Mohit,A.A. TITLE Direct Submission JOURNAL Submitted (09-MAR-1994) Abdi A. Mohit, Pathology, University of Southern California, School of Medicine, 2011 Zonal Ave, Los Angeles, CA 90033, USA FEATURES Location/Qualifiers source 1..2372 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3F12" /chromosome="21" /map="21q" /cell_type="neuron" /tissue_type="brain" /dev_stage="adult" 5'UTR <1..222 CDS 224..1492 /codon_start=1 /product="MAP kinase" /db_xref="PID:g468151" /translation="MSLHFLYYCSEPTLDVKIAFCQGFDKQVDVSYIAKHYNMSKSKV DNQFYSVEVGDSTFTVLKRYQNLKPIGSGAQGIVCAAYDAVLDRNVAIKKLSRPFQNQ THAKRAYRELVLMKCVNHKNIISLLNVFTPQKTLEEFQDVYLVMELMDANLCQVIQME LDHERMSYLLYQMLCGIKHLHSAGIIHRDLKPSNIVVKSDCTLKILDFGLARTAGTSF MMTPYVVTRYYRAPEVILGMGYKENVDIWSVGCIMGEMVRHKILFPGRDYIDQWNKVI EQLGTPCPEFMKKLQPTVRNYVENRPKYAGLTFPKLFPDSLFPADSEHNKLKASQARD LLSKMLVIDPAKRISVDDALQHPYINVWYDPAEVEAPPPQIYDKQLDEREHTIEEWKE LIYKEVMNSEEKTKNGVVKGQPSPSAQVQQ" 3'UTR 1493..2372 polyA_signal 2351..2356 polyA_site 2372 /note="33 A nucleotides" BASE COUNT 714 a 548 c 519 g 591 t ORIGIN 1 gagaaatggc gtggcagggg acccagcgag cccagaggga ttttgccgct gcttcctcta 61 cccctgtatt tcacgcagct ctctaaattg actcagctcc aggctagtgt gagaaacacc 121 aacagcaggc ccatctcaga tcttcactat ggcaacttat gcaagaaact gttgaattag 181 acccgtttcc tatagatgag aaaccataca agctgtggta tttatgagcc tccatttctt 241 atactactgc agtgaaccaa cattggatgt gaaaattgcc ttttgtcagg gattcgataa 301 acaagtggat gtgtcatata ttgccaaaca ttacaacatg agcaaaagca aagttgacaa 361 ccagttctac agtgtggaag tgggagactc aaccttcaca gttctcaagc gctaccagaa 421 tctaaagcct attggctctg gggctcaggg catagtttgt gccgcgtatg atgctgtcct 481 tgacagaaat gtggccatta agaagctcag cagacccttt cagaaccaaa cacatgccaa 541 gagagcgtac cgggagctgg tcctcatgaa gtgtgtgaac cataaaaaca ttattagttt 601 attaaatgtc ttcacacccc agaaaacgct ggaggagttc caagatgttt acttagtaat 661 ggaactgatg gatgccaact tatgtcaagt gattcagatg gaattagacc atgagcgaat 721 gtcttacctg ctgtaccaaa tgttgtgtgg cattaagcac ctccattctg ctggaattat 781 tcacagggat ttaaaaccaa gtaacattgt agtcaagtct gattgcacat tgaaaatcct 841 ggactttgga ctggccagga cagcaggcac aagcttcatg atgactccat atgtggtgac 901 acgttattac agagcccctg aggtcatcct ggggatgggc tacaaggaga acgtggatat 961 atggtctgtg ggatgcatta tgggagaaat ggttcgccac aaaatcctct ttccaggaag 1021 ggactatatt gaccagtgga ataaggtaat tgaacaacta ggaacaccat gtccagaatt 1081 catgaagaaa ttgcaaccca cagtaagaaa ctatgtggag aatcggccca agtatgcggg 1141 actcaccttc cccaaactct tcccagattc cctcttccca gcggactccg agcacaataa 1201 actcaaagcc agccaagcca gggacttgtt gtcaaagatg ctagtgattg acccagcaaa 1261 aagaatatca gtggacgacg ccttacagca tccctacatc aacgtctggt atgacccagc 1321 cgaagtggag gcgcctccac ctcagatata tgacaagcag ttggatgaaa gagaacacac 1381 aattgaagaa tggaaagaac ttatctacaa ggaagtaatg aattcagaag aaaagactaa 1441 aaatggtgta gtaaaaggac agccttctcc ttcagcacag gtgcagcagt gaacagcagt 1501 gagagtctcc ctccatcctc gtctgtcaat gacatctcct ccatgtccac cgaccagacc 1561 ctggcatctg acactgacag cagcctggaa gcctcggcag gacccctggg ttgttgcagg 1621 tgactagccg cctgcctgcg aaacccagcg ttcttcagga gatgatgtga tggaacacac 1681 acacacgcag acacacacac acacacaaat gcagacacac aacatcaaga aaacagcaag 1741 ggagagaatc caagcctaaa attaaataaa tctttcagcc tgcttcttcc ccagggttct 1801 gtattgcagc taagctcaaa tgtatattta acttctagtt gctcttgctt tggtcttctt 1861 ccaatgatgc ttactacaga aagcaaatca gacacaatta gagaagcctt ttccataaag 1921 tgtaatttta atggctgcaa aaccggcaac ctgtaactgc ccttttaaat ggcatgacaa 1981 ggtgtgcagt ggccccatcc agcatgtgtg tgtctctatc ttgcatctac ctgctccttg 2041 gcctagtcag atggatgtag atacagatcc gcatgtgtct gtattcatac agcactactt 2101 acttagagat gctactctca gtgtcctcag ggctctacca agacataatg cactggggta 2161 ccacatggtc catttcatgt gatctattac tctgacataa acccatctgt aatatattgc 2221 cagtatataa gctgtttagt ttgttaattg attaaactgt atgtcttata agaaaacatg 2281 taaaggggga atatattggg ggagtgagct ctctcagacc cttgaagatg tagcttccaa 2341 atttgaatgg attaaatggc acctgtatac ca // LOCUS HSU07681 2628 bp mRNA PRI 26-JAN-1996 DEFINITION Human NAD(H)-specific isocitrate dehydrogenase alpha subunit precursor mRNA, complete cds. ACCESSION U07681 NID g706838 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2628) AUTHORS Kim,Y.O., Oh,I.U., Park,H.S., Jeng,J., Song,B.J. and Huh,T.L. TITLE Characterization of a cDNA clone for human NAD(+)-specific isocitrate dehydrogenase alpha-subunit and structural comparison with its isoenzymes from different species JOURNAL Biochem. J. 308 (Pt 1), 63-68 (1995) MEDLINE 95275260 REFERENCE 2 (bases 1 to 2628) AUTHORS Huh,T. TITLE Direct Submission JOURNAL Submitted (12-MAR-1994) Tae-Lin Huh, Kyungpook National University, College of Natural Sciences, Genetic Engineering, 1370 Sankyuk-Dong, Pook-Ku, TaeGu 702-701, Korea FEATURES Location/Qualifiers source 1..2628 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hidha" /clone_lib="lambda gt 11" /sex="male" /tissue_type="heart" /dev_stage="adult" sig_peptide 6..86 /note="mitochondrial leader sequence" CDS 6..1106 /codon_start=1 /product="NAD(H)-specific isocitrate dehydrogenase alpha subunit precursor" /db_xref="PID:g706839" /translation="MAGPAWISKVSRLLGAFHNPKQVTRGFTGGVQTVTLIPGDGIGP EISAAVMKIFDAAKAPIQWEERNVTAIQGPGGKWMIPSEAKESMDKNKMGLKGPLKTP IAAGHPSMNLLLRKTFDLYANVRPCVSIEGYKTPYTDVNIVTIRENTEGEYSGIEHVI VDGVVQSIKLITEGASKRIAEFAFEYARNNHRSNVTAVHKANIMRMSDGLFLQKCREV AESCKDIKFNEMYLDTVCLNMVQDPSQFDVLVMPNLYGDILSDLCAGLIGGLGVTPSG NIGANGVAIFESVHGTAPDIAGKDMANPTALLLSAVMMLRHMGLFDHAARIEAACFAT IKDGKSLTKDLGGNAKCSDFTEEICRRVKDLD" mat_peptide 87..1103 BASE COUNT 745 a 533 c 570 g 780 t ORIGIN 1 aagcgatggc tgggcccgcg tggatctcca aggtctctcg gctgctgggg gcattccaca 61 acccaaaaca ggtgaccaga ggttttactg gtggtgttca gacagtaact ttaattccag 121 gagatggtat tggcccagaa atttcagctg cagttatgaa gatttttgat gctgccaaag 181 cacctattca gtgggaggag cggaacgtca ctgccattca aggacctgga ggaaagtgga 241 tgatcccttc agaggctaaa gagtccatgg ataagaacaa gatgggcttg aaaggccctt 301 tgaagacccc aatagcagcc ggtcacccat ctatgaattt actgctgcgc aaaacatttg 361 acctttacgc gaatgtccga ccatgtgtct ctatcgaagg ctataaaacc ccttacaccg 421 atgtaaatat tgtgaccatt cgagagaaca cagaaggaga atacagtgga attgagcatg 481 tgattgttga tggagtcgtg cagagtatca agctcatcac cgagggggcg agcaagcgca 541 ttgctgagtt tgcctttgag tatgcccgga acaaccaccg gagcaacgtc acggcggtgc 601 acaaagccaa catcatgcgg atgtcagatg ggctttttct acaaaaatgc agggaagttg 661 cagaaagctg taaagatatt aaatttaatg agatgtacct tgatacagta tgtttgaata 721 tggtacaaga tccttcccaa tttgatgttc ttgttatgcc aaatttgtat ggagacatcc 781 ttagtgactt gtgtgcagga ttgatcggag gtctcggtgt gacaccaagt ggcaacattg 841 gagccaatgg ggttgcaatt tttgagtcgg ttcatgggac ggctccagac attgcaggca 901 aggacatggc gaatcccaca gccctcctgc tcagtgccgt gatgatgctg cgccacatgg 961 gactttttga ccatgctgca agaattgagg ctgcgtgttt tgctacaatt aaggacggaa 1021 agagcttgac aaaagatttg ggaggcaatg caaaatgctc agacttcaca gaggaaatct 1081 gtcgccgagt aaaagattta gattaacact tctacaactg gcatttacat cagtcactct 1141 aaatggacac cacatgaacc tctgtttaga atacctacgt atgtatgcat tggtttgctt 1201 gtttcttgac agtacatttt tagatctggc cttttcttaa caaaatctgt gcaaaagatg 1261 caggtggatg tccctaggtc tgttttcaaa gaactttttc caagtgcttg ttttatttat 1321 taagtgtcta cctggtaaat gttttttttg taaactctga gtggactgta tcatttgcta 1381 ttctaaacca ttttacactt aagttaaaat agtttctctt cagctgtaaa taacaggata 1441 cagaattaac aagagaaaat gtctaacttt ttaagaaaaa ccttattttc ttcggttttt 1501 gaaaaacata atggaaataa aacaggatat tgacataata gcacaaaatg acactcttct 1561 aaaactaaat gggcacaaga gaattttcct gggaaagttc acatcaaaaa gagtgaatgt 1621 ggtatatttc taaatgatat ggaaaataga gacagatttg tcctttacag aaattactga 1681 gtgtgaataa aaacttcaga tccaagaaat atataatgag agatataatt tttgttaata 1741 agacaaaggt aatatattgg atacaaagac acaaatgtat tgtgtgttca attattttgt 1801 tgtcttgaga tttaatattc tttccaagag cttttaatga agcagagagc tagtacttca 1861 ttttcactgg atacattttc agcatcatga gttgtcacag cctctgagcc cctgatctga 1921 agccagaagg gctgagtgta ttgtaaactt attcttgcat gttgctgtct gggaatggac 1981 cacactacag caggtagttc tgggggcgat actgccgaaa ggcccgaaca catgtatttt 2041 ggctgcaatt gaggaacttg ggatgctatt aattttgtat ttcagcaact gccccttctc 2101 ctatcccaaa gcaccaatta ctgccctctg cctcagcagt accagtataa gatgacattc 2161 caaagactgg aggcaactca gcctgagtta attcacaaaa ttatgccatg ctggggcttg 2221 agcttgagct tgggcttagg cttgggctca gcttttgacc ctcaggcatc tccttttcct 2281 tcctgtcttc ctctcccttc tcctctgctg cagcatgatt ttcttaatct tcagacactc 2341 actattttca tgaacagtta ccctctgtcc ccacaaccaa agacaactca tggcctcctt 2401 tggcccttgt gtaacattgc aaacctgtgg ctttgcaaaa tgtacccagg tcacaagggg 2461 attttttttt ttttagcaat gatatccctg tctgggtcac tttttaagct tgtaaccgcc 2521 cccccagact tataatctta aatgtatttt cctttgttta agctgctgct tcctctgttt 2581 cattggattg tgccagttat cagtggctct tgggttcaaa gtaataaa // LOCUS HSU07695 3945 bp mRNA PRI 10-AUG-1994 DEFINITION Human tyrosine kinase (HTK) mRNA, complete cds. ACCESSION U07695 NID g495472 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 987) AUTHORS Bennett,B.D., Wang,Z., Kuang,W.J., Wang,A., Groopman,J.E., Goeddel,D.V. and Scadden,D.T. TITLE Cloning and characterization of HTK, a novel transmembrane tyrosine kinase of the EPH subfamily JOURNAL J. Biol. Chem. 269 (19), 14211-14218 (1994) MEDLINE 94245746 REFERENCE 2 (bases 1 to 3945) AUTHORS Scadden,D.T. TITLE Direct Submission JOURNAL Submitted (14-MAR-1994) David T. Scadden, Deaconess Hospital, Harvard Medical School, Hematology/Oncology, 185 Pilgrim Road, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..3945 /organism="Homo sapiens" /db_xref="taxon:9606" gene 86..3049 /gene="HTK" CDS 86..3049 /gene="HTK" /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g495473" /translation="MELRVLLCWASLAAALEETLLNTKLETADLKWVTFPQVDGQWEE LSGLDEEQHSVRTYEVCEVQRAPGQAHWLRTGWVPRRGAVHVYATLRFTMLECLSLPR AGRSCKETFTVFYYESDADTATALTPAWMENPYIKVDTVAAEHLTRKRPGAEATGKVN VKTLRLGPLSKAGFYLAFQDQGACMALLSLHLFYKKCAQLTVNLTRFPETVPRELVVP VAGSCVVDAVPAPGPSPSLYCREDGQWAEQPVTGCSCAPGFEAAEGNTKCRACAQGTF KPLSGEGSCQPCPANSHSNTIGSAVCQCRVGDFRARTDPRGAPCTTPPSAPRSVVSRL NGSSLHLEWSAPLESGGREDLTYALRCRECRPGGSCAPCGGDLTFDPGPRDLVEPWVV VRGLRPDFTYTFEVTALNGVSSLATGPVPFEPVNVTTDREVPPAVSDIRVTRSSPSSL SLAWAVPRAPSGAWLDYEVKYHEKGAEGPSSVRFLKTSENRAELRGLKRGASYLVQVR ARSEAGYGPFGQEHHSQTQLDESEGWREQLALIAGTAVVGVVLVLVVIVVAVLCLRKQ SNGREAEYSDKHGQYLIGHGTKVYIDPFTYEDPNEAVREFAKEIDVSYVKIEEVIGAG EFGEVCRGRLKAPGKKESCVAIKTLKGGYTERQRREFLSEASIMGQFEHPNIIRLEGV VTNSMPVMILTEFMENGALDSFLRLNDGQFTVIQLVGMLRGIASGMRYLAEMSYVHRD LAARNILVNSNLVCKVSDFGLSRFLEENSSDPTYTSSLGGKIPIRWTAPEAIAFRKFT SASDAWSYGIVMWEVMSFGERPYWDMSNQDVINAIEQDYRLPPPPDCPTSLHQLMLDC WQKDRNARPRFPQVVSALDKMIRNPASLKIVARENGGASHPLLDQRQPHYSAFGSVGE WLRAIKMGRYEARFAAAGFGSFELVSQISAEDLLRIGVTLAGHQKKILASVQHMKSQA KPGTPGGTGGPAPQY" polyA_site 3945 /note="20 A residues" BASE COUNT 763 a 1188 c 1215 g 779 t ORIGIN 1 cgtccacccg cccagggaga gtcagacctg ggggggcgag ggccccccaa actcagttcg 61 gatcctaccc gagtgaggcg gcgccatgga gctccgggtg ctgctctgct gggcttcgtt 121 ggccgcagct ttggaagaga ccctgctgaa cacaaaattg gaaactgctg atctgaagtg 181 ggtgacattc cctcaggtgg acgggcagtg ggaggaactg agcggcctgg atgaggaaca 241 gcacagcgtg cgcacctacg aagtgtgtga agtgcagcgt gccccgggcc aggcccactg 301 gcttcgcaca ggttgggtcc cacggcgggg cgccgtccac gtgtacgcca cgctgcgctt 361 caccatgctc gagtgcctgt ccctgcctcg ggctgggcgc tcctgcaagg agaccttcac 421 cgtcttctac tatgagagcg atgcggacac ggccacggcc ctcacgccag cctggatgga 481 gaacccctac atcaaggtgg acacggtggc cgcggagcat ctcacccgga agcgccctgg 541 ggccgaggcc accgggaagg tgaatgtcaa gacgctgcgt ctgggaccgc tcagcaaggc 601 tggcttctac ctggccttcc aggaccaggg tgcctgcatg gccctgctat ccctgcacct 661 cttctacaaa aagtgcgccc agctgactgt gaacctgact cgattcccgg agactgtgcc 721 tcgggagctg gttgtgcccg tggccggtag ctgcgtggtg gatgccgtcc ccgcccctgg 781 ccccagcccc agcctctact gccgtgagga tggccagtgg gccgaacagc cggtcacggg 841 ctgcagctgt gctccggggt tcgaggcagc tgaggggaac accaagtgcc gagcctgtgc 901 ccagggcacc ttcaagcccc tgtcaggaga agggtcctgc cagccatgcc cagccaatag 961 ccactctaac accattggat ctgccgtctg ccagtgccgc gtcggggact tccgggcacg 1021 cacagacccc cggggtgcac cctgcaccac ccctccttcg gctccgcgga gcgtggtttc 1081 ccgcctgaac ggctcctccc tgcacctgga atggagtgcc cccctggagt ctggtggccg 1141 agaggacctc acctacgccc tccgctgccg ggagtgccga cccggaggct cctgtgcgcc 1201 ctgcggggga gacctgactt ttgaccccgg cccccgggac ctggtggagc cctgggtggt 1261 ggttcgaggg ctacgtccgg acttcaccta tacctttgag gtcactgcat tgaacggggt 1321 atcctcctta gccacggggc ccgtcccatt tgagcctgtc aatgtcacca ctgaccgaga 1381 ggtacctcct gcagtgtctg acatccgggt gacgcggtcc tcacccagca gcttgagcct 1441 ggcctgggct gttccccggg cacccagtgg ggcgtggctg gactacgagg tcaaatacca 1501 tgagaagggc gccgagggtc ccagcagcgt gcggttcctg aagacgtcag aaaaccgggc 1561 agagctgcgg gggctgaagc ggggagccag ctacctggtg caggtacggg cgcgctctga 1621 ggccggctac gggcccttcg gccaggaaca tcacagccag acccaactgg atgagagcga 1681 gggctggcgg gagcagctgg ccctgattgc gggcacggca gtcgtgggtg tggtcctggt 1741 cctggtggtc attgtggtcg cagttctctg cctcaggaag cagagcaatg ggagagaagc 1801 agaatattcg gacaaacacg gacagtatct catcggacat ggtactaagg tctacatcga 1861 ccccttcact tatgaagacc ctaatgaggc tgtgagggaa tttgcaaaag agatcgatgt 1921 ctcctacgtc aagattgaag aggtgattgg tgcaggtgag tttggcgagg tgtgccgggg 1981 gcggctcaag gccccaggga agaaggagag ctgtgtggca atcaagaccc tgaagggtgg 2041 ctacacggag cggcagcggc gtgagtttct gagcgaggcc tccatcatgg gccagttcga 2101 gcaccccaat atcatccgcc tggagggcgt ggtcaccaac agcatgcccg tcatgattct 2161 cacagagttc atggagaacg gcgccctgga ctccttcctg cggctaaacg acggacagtt 2221 cacagtcatc cagctcgtgg gcatgctgcg gggcatcgcc tcgggcatgc ggtaccttgc 2281 cgagatgagc tacgtccacc gagacctggc tgctcgcaac atcctagtca acagcaacct 2341 cgtctgcaaa gtgtctgact ttggcctttc ccgattcctg gaggagaact cttccgatcc 2401 cacctacacg agctccctgg gaggaaagat tcccatccga tggactgccc cggaggccat 2461 tgccttccgg aagttcactt ccgccagtga tgcctggagt tacgggattg tgatgtggga 2521 ggtgatgtca tttggggaga ggccgtactg ggacatgagc aatcaggacg tgatcaatgc 2581 cattgaacag gactaccggc tgcccccgcc cccagactgt cccacctccc tccaccagct 2641 catgctggac tgttggcaga aagaccggaa tgcccggccc cgcttccccc aggtggtcag 2701 cgccctggac aagatgatcc ggaaccccgc cagcctcaaa atcgtggccc gggagaatgg 2761 cggggcctca caccctctcc tggaccagcg gcagcctcac tactcagctt ttggctctgt 2821 gggcgagtgg cttcgggcca tcaaaatggg aagatacgaa gcccgtttcg cagccgctgg 2881 ctttggctcc ttcgagctgg tcagccagat ctctgctgag gacctgctcc gaatcggagt 2941 cactctggcg ggacaccaga agaaaatctt ggccagtgtc cagcacatga agtcccaggc 3001 caagccggga accccgggtg ggacaggagg accggccccg cagtactgac ctgcaggaac 3061 tccccacccc agggacaccg cctccccatt ttccggggca gcgtggggac tcacagaggc 3121 ccccagccct gtgccccgct ggattgcact ttgagcccgt ggggtgagga gttggcaatt 3181 tggagagaca ggatttgggg gttctgccat aataggaggg gaaaatcacc ccccagccac 3241 ctcggggaac tccagaccaa gggtgagggc gcctttccct caggactggg tgtgaccaga 3301 ggaaaaggaa gtgcccaaca tctcccagcc tccccaggtg cccccctcac cttgatgggt 3361 gcgttcccgc agaccaaaga gagtgtgact cccttgccag ctccagagtg ggggggctgt 3421 cccagggggc aagaaggggt gtcagggccc agtgacaaaa tcattggggt ttgtagtccc 3481 aacttgctgc tgtcaccacc aaactcaatc atttttttcc cttgtaaatg cccctccccc 3541 agctgctgcc ttcatattga aggtttttga gttttgtttt tggtcttaat ttttctcccc 3601 gttccctttt tgtttcttcg ttttgttttt ctaccgtcct tgtcataact ttgtgttgga 3661 gggaacctgt ttcactatgg cctcctttgc ccaagttgaa acaggggccc atcatcatgt 3721 ctgtttccag aacagtgcct tggtcatccc acatccccgg accccgcctg ggacccccaa 3781 gatgtgtcct atgaaggggt gtggggtgag gtagtgaaaa gggcggtagt tggtggtgga 3841 acccagaaac ggacgccggt gcttggaggg gttcttaaat tatatttaaa aaagtaactt 3901 tttgtataaa taaaagaaaa tgggacgtgt cccagctcca ggggt // LOCUS HSU07747 3531 bp mRNA PRI 12-JUL-1994 DEFINITION Human SH3 domain-containing proline-rich kinase (sprk) mRNA, complete cds. ACCESSION U07747 NID g464027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3531) AUTHORS Gallo,K.A., Mark,M.R., Scadden,D.T., Wang,Z., Gu,Q. and Godowski,P.J. TITLE Identification and characterization of SPRK, a novel src-homology 3 domain-containing proline-rich kinase with serine/threonine kinase activity JOURNAL J. Biol. Chem. 269, 15092-15100 (1994) MEDLINE 94253068 REFERENCE 2 (bases 1 to 3531) AUTHORS Godowski,P.J. TITLE Direct Submission JOURNAL Submitted (15-MAR-1994) Paul J. Godowski, Genentech, Inc., 460 Point San Bruno Blvd., South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..3531 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="gt 10" /cell_line="CMK11-5" gene 451..2994 /gene="sprk" CDS 451..2994 /gene="sprk" /standard_name="SPRK" /standard_name="src-homology 3 (SH3) domain-containing proline-rich kinase" /note="submitter comments: serine/threonine protein kinase, proline-rich, src-homology 3 (SH3) domain, leucine/isoleucine zipper motifs" /codon_start=1 /product="serine/threonine protein kinase" /db_xref="PID:g464028" /translation="MEPLKSLFLKSPLGSWNGSGSGGGGGGGGGRPEGSPKAAGYANP VWTALFDYEPSGQDELALRKGDRVEVLSRDAAISGDEGWWAGQVGGQVGIFPSNYVSR GGGPPPCEVASFQELRLEEVIGIGGFGKVYRGSWRGELVAVKAARQDPDEDISVTAES VRQEARLFAMLAHPNIIALKAVCLEEPNLCLVMEYAAGGPLSRALAGRRVPPHVLVNW AVQIARGMHYLHCEALVPVIHRDLKSNNILLLQPIESDDMEHKTLKITDFGLAREWHK TTQMSAAGTYAWMAPEVIKASTFSKGSDVWSFGVLLWELLTGEVPYRGIDCLAVAYGV AVNKLTLPIPSTCPEPFAQLMADCWAQDPHRRPDFASILQQLEALEAQVLREMPRDSF HSMQEGWKREIQGLFDELRAKEKELLSREEELTRAAREQRSQAEQLRRREHLLAQWEL EVFERELTLLLQQVDRERPHVRRRRGTFKRSKLRARDGGERISMPLDFKHRITVQASP GLDRRRNVFEVGPGDSPTFPRFRAIQLEPAEPGQAWGRQSPRRLEDSSNGERRACWAW GPSSPKPGEAQNGRRRSRMDEATWYLDSDDSSPLGSPSTPPALNGNPPRPSLEPEEPK RPVPAERGSSSGTPKLIQRALLRGTALLASLGLGRDLQPPGGPGRERGESPTTPPTPT PAPCPTEPPPSPLICFSLKTPDSPPTPAPLLLDLGIPVGQRSAKSPRREEEPRGGTVS PPPGTSRSAPGTPGTPRSPPLGLISRPRPSPLRSRIDPWSFVSAGPRPSPLPSPQPAP RRAPWTLFPDSDPFWDSPPANPFQGGPQDCRAQTKDMGAQAPWVPEAGP" misc_feature 577..756 /gene="sprk" /note="SH3 domain" misc_feature 793..1659 /gene="sprk" /note="kinase domain" misc_feature 1710..1896 /gene="sprk" /note="Leucine/Isoleucine zipper region" polyA_site 3531 /note="32 A residues" BASE COUNT 632 a 1207 c 1143 g 549 t ORIGIN 1 gccaaaggag acggggccag gaacaggcag tctcggccca actgcggacg ctccctccac 61 cccctgcgca aaaagaccca accggagttg aggcgctgcc cctgaaggcc ccaccttaca 121 cttggcgggg gccggagcca ggctcccagg actgctccag aaccgaggga agctcgggtc 181 cctccaagct agccatggtg aggcgccgga ggccccgggg ccccaccccc ccggcctgac 241 cacactgccc tgggtgccct cctccagaag cccgagatgc ggggggccgg gagacaacac 301 tcctggctcc ccagagaggc gtgggtctgg ggctgagggc cagggcccgg atgcccaggt 361 tccgggacta gggccttggc agccagcggg ggtggggacc acgggcaccc agagaaggtc 421 ctccacacat cccagcgccg gctcccggcc atggagccct tgaagagcct cttcctcaag 481 agccctctag ggtcatggaa tggcagtggc agcgggggtg gtgggggcgg tggaggaggc 541 cggcctgagg ggtctccaaa ggcagcgggt tatgccaacc cggtgtggac agccctgttc 601 gactacgagc ccagtgggca ggatgagctg gccctgagga agggtgaccg tgtggaggtg 661 ctgtcccggg acgcagccat ctcaggagac gagggctggt gggcgggcca ggtgggtggc 721 caggtgggca tcttcccgtc caactatgtg tctcggggtg gcggcccgcc cccctgcgag 781 gtggccagct tccaggagct gcggctggag gaggtgatcg gcattggagg ctttggcaag 841 gtgtacaggg gcagctggcg aggtgagctg gtggctgtga aggcagctcg ccaggacccc 901 gatgaggaca tcagtgtgac agccgagagc gttcgccagg aggcccggct cttcgccatg 961 ctggcacacc ccaacatcat tgccctcaag gctgtgtgcc tggaggagcc caacctgtgc 1021 ctggtgatgg agtatgcagc cggtgggccc ctcagccgag ctctggccgg gcggcgcgtg 1081 cctccccatg tgctggtcaa ctgggctgtg cagattgccc gtgggatgca ctacctgcac 1141 tgcgaggccc tggtgcccgt catccaccgt gatctcaagt ccaacaacat tttgctgctg 1201 cagcccattg agagtgacga catggagcac aagaccctga agatcaccga ctttggcctg 1261 gcccgagagt ggcacaaaac cacacaaatg agtgccgcgg gcacctacgc ctggatggct 1321 cctgaggtta tcaaggcctc caccttctct aagggcagtg acgtctggag ttttggggtg 1381 ctgctgtggg aactgctgac cggggaggtg ccataccgtg gcattgactg ccttgctgtg 1441 gcctatggcg tagctgttaa caagctcaca ctgcccatcc catccacctg ccccgagccc 1501 ttcgcacagc ttatggccga ctgctgggcg caggaccccc accgcaggcc cgacttcgcc 1561 tccatcctgc agcagttgga ggcgctggag gcacaggtcc tacgggaaat gccgcgggac 1621 tccttccatt ccatgcagga aggctggaag cgcgagatcc agggtctctt cgacgagctg 1681 cgagccaagg aaaaggaact actgagccgc gaggaggagc tgacgcgagc ggcgcgcgag 1741 cagcggtcac aggcggagca gctgcggcgg cgcgagcacc tgctggccca gtgggagcta 1801 gaggtgttcg agcgcgagct gacgctgctg ctgcagcagg tggaccgcga gcgaccgcac 1861 gtgcgccgcc gccgcgggac attcaagcgc agcaagctcc gggcgcgcga cggcggcgag 1921 cgtatcagca tgccactcga cttcaagcac cgcatcaccg tgcaggcctc acccggcctt 1981 gaccggagga gaaacgtctt cgaggtcggg cctggggatt cgcccacctt tccccggttc 2041 cgagccatcc agttggagcc tgcagagcca ggccaggcat ggggccgcca gtccccccga 2101 cgtctggagg actcaagcaa tggagagcgg cgagcatgct gggcttgggg tcccagttcc 2161 cccaagcctg gggaagccca gaatgggagg agaaggtccc gcatggacga agccacatgg 2221 tacctggatt cagatgactc atccccctta ggatctcctt ccacaccccc agcactcaat 2281 ggtaaccccc cgcggcctag cctggagccc gaggagccca agaggcctgt ccccgcagag 2341 cgcggtagca gctctgggac gcccaagctg atccagcggg cgctgctgcg cggcaccgcc 2401 ctgctcgcct cgctgggcct tggccgcgac ctgcagccgc cgggaggccc aggacgcgag 2461 cgcggggagt ccccgacaac accccccacg ccaacgcccg cgccctgccc gaccgagccg 2521 cccccttccc cgctcatctg cttctcgctc aagacgcccg actccccgcc cactcctgca 2581 cccctgttgc tggacctggg tatccctgtg ggccagcggt cagccaagag cccccgacgt 2641 gaggaggagc cccgcggagg cactgtctca cccccaccgg ggacatcacg ctctgctcct 2701 ggcaccccag gcaccccacg ttcaccaccc ctgggcctca tcagccgacc tcggccctcg 2761 ccccttcgca gccgcattga tccctggagc tttgtgtcag ctgggccacg gccttctccc 2821 ctgccatcac cacagcctgc accccgccga gcaccctgga ccttgttccc ggactcagac 2881 cccttctggg actccccacc tgccaacccc ttccaggggg gcccccagga ctgcagggca 2941 cagaccaaag acatgggtgc ccaggccccg tgggtgccgg aagcggggcc ttgagtgggc 3001 caggccactc ccccgagctc cagctgcctt aggaggagtc acagcataca ctggaacagg 3061 agctgggtca gcctctgcag ctgcctcagt ttccccaggg accccacccc cctttggggg 3121 tcaggaacac tacactgcac aggaagcctt cacactggaa gggggacctg cgcccccaca 3181 tctgaaacct gtaggtcccc ccagctcacc tgccctactg gggcccaaca ctgtacccag 3241 ctggttggga ggaccagagc ctgtctcagg gaattgcctg ctggggtgat gcagggagga 3301 ggggaggtgc agggaagagg ggccggcctc agctgtcacc agcacttttg accaagtcct 3361 gctactgcgg cccctgccct agggcttaga gcatggacct cctgccctgg gggtcatctg 3421 gggccagggc tctctggatg ccttcctgct gccccagcca gggttggagt cttagcctcg 3481 ggatccagtg aagccagaag ccaaataaac tcaaaagctg tctccccaca a // LOCUS HSU07821 1813 bp mRNA PRI 14-JUN-1994 DEFINITION Human alcohol dehydrogenase (ADH7) mRNA, complete cds. ACCESSION U07821 NID g499097 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1813) AUTHORS Satre,M.A., Zgombic-Knight,M. and Duester,G. TITLE The complete structure of human class IV alcohol dehydrogenase (retinol dehydrogenase) determined from the ADH7 gene JOURNAL J. Biol. Chem. 269, 15606-15612 (1994) MEDLINE 94253145 REFERENCE 2 (bases 1 to 1813) AUTHORS Duester,G. TITLE Direct Submission JOURNAL Submitted (17-MAR-1994) Gregg Duester, Cancer Research Center, La Jolla Cancer Research Foundation, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1813 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="MS7" /clone_lib="lambda gt11 cDNA library" /tissue_type="stomach" /dev_stage="adult" 5'UTR 1..84 gene 85..1209 /gene="ADH7" CDS 85..1209 /gene="ADH7" /codon_start=1 /product="alcohol dehydrogenase" /db_xref="PID:g499098" /translation="MGTAGKVIKCKAAVLWEQKQPFSIEEIEVAPPKTKEVRIKILAT GICRTDDHVIKGTMVSKFPVIVGHEATGIVESIGEGVTTVKPGDKVIPLFLPQCRECN ACRNPDGNLCIRSDITGRGVLADGTTRFTCKGKPVHHFMNTSTFTEYTVVDESSVAKI DDAAPPEKVCLIGCGFSTGYGAAVKTGKVKPGSTCVVFGLGGVGLSVIMGCKSAGASR IIGIDLNKDKFEKAMAVGATECISPKDSTKPISEVLSEMTGNNVGYTFEVIGHLETMI DALASCHMNYGTSVVVGVPPSAKMLTYDPMLLFTGRTWKGCVFGGLKSRDDVPKLVTE FLAKKFDLDQLITHVLPFKKISEGFELLNSGQSIRTVLTF" 3'UTR 1210..1813 BASE COUNT 562 a 332 c 408 g 511 t ORIGIN 1 tgctgttata tacaacagag tgaactgagc atcagtcaca aaaagtctat gtttgcagaa 61 atacagatcc aagagaaaga caggatgggc actgctggaa aagttattaa atgcaaagca 121 gctgtgcttt gggagcagaa gcaacccttc tccattgagg aaatagaagt tgccccacca 181 aagactaaag aagttcgcat taagattttg gccacaggaa tctgtcgcac agatgaccat 241 gtgataaaag gaacaatggt gtccaagttt ccagtgattg tgggacatga ggcaactggg 301 attgtagaga gcattggaga aggagtgact acagtgaaac caggtgacaa agtcatccct 361 ctctttctgc cacaatgtag agaatgcaat gcttgtcgca acccagatgg caacctttgc 421 attaggagcg atattactgg tcgtggagta ctggctgatg gcaccaccag atttacatgc 481 aagggcaaac cagtccacca cttcatgaac accagtacat ttaccgagta cacagtggtg 541 gatgaatctt ctgttgctaa gattgatgat gcagctcctc ctgagaaagt ctgtttaatt 601 ggctgtgggt tttccactgg atatggcgct gctgttaaaa ctggcaaggt caaacctggt 661 tccacttgcg tcgtctttgg cctgggagga gttggcctgt cagtcatcat gggctgtaag 721 tcagctggtg catctaggat cattgggatt gacctcaaca aagacaaatt tgagaaggcc 781 atggctgtag gtgccactga gtgtatcagt cccaaggact ctaccaaacc catcagtgag 841 gtgctgtcag aaatgacagg caacaacgtg ggatacacct ttgaagttat tgggcatctt 901 gaaaccatga ttgatgccct ggcatcctgc cacatgaact atgggaccag cgtggttgta 961 ggagttcctc catcagccaa gatgctcacc tatgacccga tgttgctctt cactggacgc 1021 acatggaagg gatgtgtctt tggaggtttg aaaagcagag atgatgtccc aaaactagtg 1081 actgagttcc tggcaaagaa atttgacctg gaccagttga taactcatgt tttaccattt 1141 aaaaaaatca gtgaaggatt tgagctgctc aattcaggac aaagcattcg aacggtcctg 1201 acgttttgag atccaaagtg gcaggaggtc tgtgttgtca tggtgaactg gagtttctct 1261 tgtgagagtt ccctcatctg aaatcatgta tctgtctcac aaatacaagc ataagtagaa 1321 gatttgttga agacatagaa cccttataaa gaattattaa cctttataaa catttaaagt 1381 cttgtgagca cctgggaatt agtataataa caatgttaat atttttgatt tacattttgt 1441 aaggctataa ttgtatcttt taagaaaaca tacacttgga tttctatgtt gaaatggaga 1501 tttttaagag ttttaaccag ctgctgcaga tatatatctc aaaacagata tagcgtataa 1561 agatatagta aatgcatctc ccagagtaat attcacttaa cacattgaaa ctattatttt 1621 ttagatttga atataaatgt attttttaaa cacttgttat gagttaactt ggattacatt 1681 ttgaaatcag ttcattccat gatgcatatt actggattag attaagaaag acagaaaaga 1741 ttaagggacg ggcacatttt tcaacgatta agaatcatca ttacataact tggtgaaact 1801 gaaaaagtat atc // LOCUS HSU07882 1773 bp mRNA PRI 07-JUN-1994 DEFINITION Human delta opioid receptor mRNA, complete cds. ACCESSION U07882 NID g497313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1773) AUTHORS Knapp,R.J., Malatynska,E., Fang,L., Xiaoping,L., Nguyen,M., Santoro,G., Varga,E.V., Hruby,V.J., Roeske,W.R. and Yamamura,H.I. TITLE Identification of a human delta opioid receptor: Cloning and expression JOURNAL Life Sci. 54, PL463-PL469 (1994) MEDLINE 94260835 REFERENCE 2 (bases 1 to 1773) AUTHORS Knapp,R.J. TITLE Direct Submission JOURNAL Submitted (21-MAR-1994) Richard J. Knapp, Pharmacology, The University of Arizona College of Medicine, Tucson, AZ 85724, USA FEATURES Location/Qualifiers source 1..1773 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="78x44" /clone_lib="Lambda ZAP II human striatum and Lamda ZAP human temporal cortex from Stratagene" /tissue_type="cerebral cortex and striatum" 5'UTR 1..233 CDS 234..1352 /codon_start=1 /product="delta opioid receptor" /db_xref="PID:g497314" /translation="MEPAPSAGAELQPPLFANASDAYPSAFPSAGANASGPPGPGSAS SLALAIAITALYSAVCAVGLLGNVLVMFGIVRYTKMKTATNIYIFNLALADALATSTL PFQSAKYLMETWPFGELLCKAVLSIDYYNMFTSIFTLTMMSVDRYIAVCHPVKALDFR TPAKAKLINICIWVLASGVGVPIMVMAVTRPRDGAVVCMLQFPSPSWYWDTVTKICVF LFAFVVPILIITVCYGLMLLRLRSVRLLSGSKEKDRSLRRITRMVLVVVGAFVVCWAP IHIFVIVWTLVDIDRRDPLVVAALHLCIALGYANSSLNPVLYAFLDENFKRCFRQLCR KPCGRPDPSSFSRPREATARERVTACTPSDGPGGGRAA" 3'UTR 1353..1773 BASE COUNT 263 a 602 c 591 g 317 t ORIGIN 1 ccgaggagcc tgcgctgctc ctggctcaca gcgctccggg cgaggagagc gggcggaccg 61 gggggctggg ccggtgcggg cggcgaggca ggcggacgag gcgcagagac agcggggcgg 121 ccggggcgcg gcacgcggcg ggtcggggcc ggcctctgcc ttgccgctcc cctcgcgtcg 181 gatccccgcg cccaggcagc cggtggagag ggacgcggcg gacgccggca gccatggaac 241 cggccccctc cgccggcgcc gagctgcagc ccccgctctt cgccaacgcc tcggacgcct 301 accctagcgc cttccccagc gctggcgcca atgcgtcggg gccgccagga ccggggagcg 361 cctcgtccct cgccctggca atcgccatca ccgcgctcta ctcggccgtg tgcgccgtgg 421 ggctgctggg caacgtgctt gtcatgttcg gcatcgtccg gtacactaag atgaagacgg 481 ccaccaacat ctacatcttc aacctggcct tagccgatgc gctggccacc agcacgctgc 541 ctttccagag tgccaagtac ctgatggaga cgtggccctt cggcgagctg ctctgcaagg 601 ctgtgctctc catcgactac tacaatatgt tcaccagcat cttcacgctc accatgatga 661 gtgttgaccg ctacatcgct gtctgccacc ctgtcaaggc cctggacttc cgcacgcctg 721 ccaaggccaa gctgatcaac atctgtatct gggtcctggc ctcaggcgtt ggcgtgccca 781 tcatggtcat ggctgtgacc cgtccccggg acggtgcagt ggtgtgcatg ctccagttcc 841 ccagccccag ctggtactgg gacacggtga ccaagatctg cgtgttcctc ttcgccttcg 901 tggtgcccat cctcatcatc accgtgtgct atggcctcat gctgctgcgc ctgcgcagtg 961 tgcgcctgct gtcgggctcc aaggagaagg accgcagcct gcggcgcatc acgcgcatgg 1021 tgctggtggt tgtgggcgcc ttcgtggtgt gttgggcgcc catccacatc ttcgtcatcg 1081 tctggacgct ggtggacatc gaccggcgcg acccgctggt ggtggctgcg ctgcacctgt 1141 gcatcgcgct gggctacgcc aatagcagcc tcaaccccgt gctctacgct ttcctcgacg 1201 agaacttcaa gcgctgcttc cgccagctct gccgcaagcc ctgcggccgc ccagacccca 1261 gcagcttcag ccggccccgc gaagccacgg cccgcgagcg tgtcaccgcc tgcaccccgt 1321 ccgatggtcc cggcggtggc cgtgccgcct gaccaggcca tccggccccc agacgcccct 1381 ccctagttgt acccggaggc cacatgagtc ccagtgggag gcgcgagcca tgatgtggag 1441 tggggccagt agataggtcg gagggctttg ggaccgccag atggggcctc tgtttcggag 1501 acgggaccgg gccgctagat gggcatgggg tgggcctctg gtttggggcg aggcagagga 1561 cagatcaatg gcgcagtgcc tctggtctgg gtgcccccgt ccacggctct aggtggggcg 1621 ggaaagccag tgactccagg agaggagcgg gacctgtggc tctacaactg agtccttaaa 1681 cagggcatct ccaggaaggc ggggcttcaa ccttgagaca gcttcggttt ctaacttgga 1741 gccggacttt cggagttggg gggtccgggg ccc // LOCUS HSU07919 3442 bp mRNA PRI 07-OCT-1995 DEFINITION Human aldehyde dehydrogenase 6 mRNA, complete cds. ACCESSION U07919 NID g995897 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3442) AUTHORS Hsu,L.C., Chang,W.C., Hiraoka,L. and Hsieh,C.L. TITLE Molecular cloning, genomic organization, and chromosomal localization of an additional human aldehyde dehydrogenase gene, ALDH6 JOURNAL Genomics 24 (2), 333-341 (1994) MEDLINE 95213025 REFERENCE 2 (bases 1 to 3442) AUTHORS Hsu,L.C. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) Lily C. Hsu, Beckman Res. Inst. of the City of Hope, Biochemical Genetics, 1450 E. Duarte Rd., Duarte, CA 91010-0269, USA FEATURES Location/Qualifiers source 1..3442 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q26" /tissue_type="salivary gland" CDS 53..1591 /codon_start=1 /product="aldehyde dehydrogenase 6" /db_xref="PID:g544482" /translation="MATANGAVENGQPDGKPPALPRPIRNLEVKFTKIFINNEWHESK SGKKFATCNPSTREQICEVEEGDKPDVDKAVEAAQVAFQRGSPWRRLDALSRGRLLHQ LADLVERDRATLAALETMDTGKPFLHAFFIDLEGCIRTLRYFAGWADKIQGKTIPTDD NVVCFTRHEPIGVCGAITPWNFPLLMLVWKLAPALCCGNTMVLKPAEQTPLTALYLGS LIKEAGFPPGVVNIVPGFGPTVGAAISSHPQINKIAFTGSTEVGKLVKEAASRSNLKR VTLELGGKNPCIVCADADLDLAVECAHQGVFFNQGQCCTAASRVFVEEQVYSEFVRRS VEYAKKRPVGDPFDVKTEQGPQIDQKQFDKILELIESGKKEGAKLECGGSAMEDKGLF IKPTVFSEVTDNMRIAKEEIFGPVQPILKFKSIEEVIKRANSTDYGLTAAVFTKNLDK ALKLASALESGTVWINCYNALYAQAPFGGFKMSGNGRELGEYALAEYTEVKTVTIKLG DKNP" polyA_site 3442 /note="15 A nucleotides" BASE COUNT 920 a 760 c 864 g 898 t ORIGIN 1 agccggtgcg ccgcagacta gggcgcctcg ggccagggag cgcggaggag ccatggccac 61 cgctaacggg gccgtggaaa acgggcagcc ggacgggaag ccgccggccc tgccgcgccc 121 catccgcaac ctggaggtca agttcaccaa gatatttatc aacaatgaat ggcacgaatc 181 caagagtggg aaaaagtttg ctacatgtaa cccttcaact cgggagcaaa tatgtgaagt 241 ggaagaagga gataagcccg acgtggacaa ggctgtggag gctgcacagg ttgccttcca 301 gaggggctcg ccatggcgcc ggctggatgc cctgagtcgt gggcggctgc tgcaccagct 361 ggctgacctg gtggagaggg accgcgccac cttggccgcc ctggagacga tggatacagg 421 gaagccattt cttcatgctt ttttcatcga cctggagggc tgtattagaa ccctcagata 481 ctttgcaggg tgggcagaca aaatccaggg caagaccatc cccacagatg acaacgtcgt 541 atgcttcacc aggcatgagc ccattggtgt ctgtggggcc atcactccat ggaacttccc 601 cctgctgatg ctggtgtgga agctggcacc cgccctctgc tgtgggaaca ccatggtcct 661 gaagcctgcg gagcagacac ctctcaccgc cctttatctc ggctctctga tcaaagaggc 721 cgggttccct ccaggagtgg tgaacattgt gccaggattc gggcccacag tgggagcagc 781 aatttcttct caccctcaga tcaacaagat cgccttcacc ggctccacag aggttggaaa 841 actggttaaa gaagctgcgt cccggagcaa tctgaagcgg gtgacgctgg agctgggggg 901 gaagaacccc tgcatcgtgt gtgcggacgc tgacttggac ttggcagtgg agtgtgccca 961 tcagggagtg ttcttcaacc aaggccagtg ttgcacggca gcctccaggg tgttcgtgga 1021 ggagcaggtc tactctgagt ttgtcaggcg gagcgtggag tatgccaaga aacggcccgt 1081 gggagacccc ttcgatgtca aaacagaaca ggggcctcag attgatcaaa agcagttcga 1141 caaaatctta gagctgatcg agagtgggaa gaaggaaggg gccaagctgg aatgcggggg 1201 ctcagccatg gaagacaagg ggctcttcat caaacccact gtcttctcag aagtcacaga 1261 caacatgcgg attgccaaag aggagatttt cgggccagtg caaccaatac tgaagttcaa 1321 aagtatcgaa gaagtgataa aaagagcgaa tagcaccgac tatggactca cagcagccgt 1381 gttcacaaaa aatctcgaca aagccctgaa gttggcttct gccttagagt ctggaacggt 1441 ctggatcaac tgctacaacg ccctctatgc acaggctcca tttggtggct ttaaaatgtc 1501 aggaaatggc agagaactag gtgaatacgc tttggccgaa tacacagaag tgaaaactgt 1561 caccatcaaa cttggcgaca agaacccctg aaggaaaggc ggggctcctt cctcaaacat 1621 cggacggcgg aatgtggcag atgaaatgtg ctggaggaaa aaaatgacat ttctgacctt 1681 cccgggacac attcttctgg aggctttaca tctactggag ttgaatgatt gctgttttcc 1741 tctcactctc ctgtttattc accagactgg ggatgcctat aggttgtctg tgaaatcgca 1801 gtcctgcctg gggagggagc tgttggccat ttctgtgttt ccctttaaac cagatcctgg 1861 agacagtgag atactcaggg cgttgttaac agggagtggt atttgaagtg tccagcagtt 1921 gcttgaaatg ctttgccgaa tctgactcca gtaagaatgt gggaaaaccc cctgtgtgtt 1981 ctgcaagcag ggctcttgca ccagcggtct cctcagggtg gacctgctta cagagcaagc 2041 cacgcctctt tccgaggtga aggtgggacc attccttggg aaaggattca cagtaaggtt 2101 ttttggtttt tgttttttgt tttcttgttt ttaaaaaaag gatttcacag tgagaaagtt 2161 ttggttagtg cataccgtgg aagggcgcca gggtctttgt ggattgcatg ttgacattga 2221 ccgtgagatt cggcttcaaa ccaatactgc ctttggaata tgacagaatc aatagcccag 2281 agagcttagt caaagacgat atcacggtct accttaacca aggcactttc ttaagcagaa 2341 aatattgttg aggttacctt tgctgctaaa gatccaatct tctaacgcca caacagcata 2401 gcaaatccta ggataattca cctcctcatt tgacaaatca gagctgtaat tcactttaac 2461 aaattacgca tttctatcac gttcactaac agcttatgat aagtctgtgt agtcttcctt 2521 ttctccagtt ctgttaccca atttagatta gtaaagcgta cacaactgga aagactgctg 2581 taataacaca gccttgttat ttttaagtcc tattttgata ttaatttctg attagttagt 2641 aaataacacc tggattctat ggaggacctc ggtcttcatc caagtggcct gagtatttca 2701 ctggcaggtt gtgaattttt cttttcctct ttgggaatcc aaatgatgat gtgcaatttc 2761 atgttttaac ttgggaaact gaaagtgttc ccatatagct tcaaaaacaa aaacaaatgt 2821 gttatccgac ggatactttt atggttacta actagtactt tcctaattgg gaaagtagtg 2881 cttaagtttg caaattaagt tggggagggc aataataaaa tgagggcccg taacagaacc 2941 agtgtgtgta taacgaaaac catgtataaa atgggcctat cacccttgtc agagatataa 3001 attaccacat ttggcttccc ttcatcagct aacacttatc acttatacta ccaataactt 3061 gttaaatcag gatttggctt catacactga attttcagta ttttatctca agtagatata 3121 gacactaacc ttgatagtga tacgttagag ggttcctatt cttccattgt acgataatgt 3181 ctttaatatg aaatgctaca ttatttataa ttggtagagt tattgtatct ttttatagtt 3241 gtaagtacac agaggtggta tatttaaact tctgtaatat actgtattta gaaatggaaa 3301 tatatatagt gttaggtttc acttctttta aggtttaccc ctgtggtgtg gtttaaaaat 3361 ctataggcct gggaattccg atcctagctg cagatcgcat cccacaatgc gagaatgata 3421 aaataaaatt ggatatttga ga // LOCUS HSU07932 3282 bp mRNA PRI 30-AUG-1994 DEFINITION Human AF-17 mRNA, complete cds. ACCESSION U07932 NID g532761 KEYWORDS t[11-17] chromosome translocation; oncogenes; protein-protein interactions. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3282) AUTHORS Prasad,R., Leshkowitz,D., Gu,Y., Alder,H., Nakamura,T., Saito,H., Huebner,K., Berger,R., Croce,C. and Canaani,E. TITLE Leucine zipper dimerization motif encoded by the AF-17 gene fused to ALL-1 (MLL) in acute leukemia JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 8107-8111 (1994) MEDLINE 94336695 REFERENCE 2 (bases 1 to 3282) AUTHORS Gu,Y. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) Yansong Gu, Jefferson Cancer Institute, Thomas Jefferson University, 233 South 10th. Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..3282 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="a4, cl1, cl3, cl13," /chromosome="17" /map="17q21" /cell_line="ALL-1, KCL-22" /cell_type="Ph-positive acute lymphocytic leukemia myeloid acute phase of chronic myeloid leukemia" gene 1..3282 /gene="AF-17" CDS 1..3282 /gene="AF-17" /codon_start=1 /function="unknown" /db_xref="PID:g532762" /translation="MKEMVGGCCVCSDERGWAENPLVYCDGHACSVAVHQACYGIVQV PTGPWFCRKCESQERAARVRCELCPHKDGALKRTDNGGWAHVVCALYIPEVQFANVLT MEPIVLQYVPHDRFNKTCYICEETGRESKAASGACMTCNRHGCRQAFHVTCAQMAGLL CEEEVLEVDNVKYCGYCKYHFSKMKTSRHSSGGGGGGAGGGGGSMGGGGSGFISGRRS RSASPSTQQEKHPTHHERGQKKSRKDKERLKQKHKKRPESPPSILTPPVVPTADKVSS SASSSSHHEASTQETSESSRESKGKKSSSHSLSHKGKKLSSGKGVSSFTSASSSSSSS SSSSGGPFQPAVSSLQSSPDFSAFPKLEQPEEDKYSKPTAPAPSAPPSPSAPEPPKAD LFEQKVVFSGFGPIMRFSTTTSSSGRARAPSPGDYKSPHVTGSGASAGTHKRMPALSA TPVPADETPETGLKEKKHKASKRSRHGPGRPKGSRNKEGTGGPAAPSLPSAQLAGFTA TAASPFSGGSLVSSGLGGLSSRTFGPSGSLPSLSLESPLLGAGIYTSNKDPISHSGGM LRAVCSTPLSSSLLGPPGTSALPRLSRSPFTSTLPSSSASISTTQVFSLAGSTFSLPS THIFGTPMGAVNPLLSQAESSHTEPDLEDCSFRCRGTSPQESLSSMSPISSLPALFDQ TASAPCGGGQLDPAAPGTTNMEQLLEKQGDGEAGVNIVEMLKALHALQKENQRLQEQI LSLTAKKERLQILNVQLSVPFPALPAALPAANGPVPGPYGLPPQAGSSDSLSTSKSPP GKSSLGLDNSLSTSSEDPHSGCPSRSSSSLSFHSTPPPLPLLQQSPATLPLALPGAPA PLPPQPQNGLGRAPGAAGLGAMPMAEGLLGGLAGSGGLPLNGLLGGLNGAAAPNPASL SQAGGAPTLQLPGCLNSLTEQQRHLLQQQEQQLQQLQQLLASPQLTPEHQTVVYQMIQ QIQQKRELQRLQMAGGSQLPMASLLAGSSTPLLSAGTPGLLPTASAPPLLPAGALVAP SLGNNTSLMAAAAAAAAVAAAGGPPVLTAQTNPFLSLSGAEGSGGGPKGGTADKGASA NQEKG" misc_feature 2185..2292 /gene="AF-17" /note="encodes a leucine zipper motif, amino acids 729-764" BASE COUNT 628 a 1141 c 959 g 554 t ORIGIN 1 atgaaggaga tggtaggagg ctgctgcgta tgttcggacg agaggggctg ggccgagaac 61 ccgctggtct actgcgatgg gcacgcgtgc agcgtggccg tccaccaagc ttgctatggc 121 atcgttcagg tgccaacggg accctggttc tgccggaaat gtgaatctca ggagcgagca 181 gccagggtga ggtgtgagct gtgcccacac aaagacgggg cattgaagag gactgataat 241 ggaggctggg cacacgtggt gtgtgccctc tacatccccg aggtgcaatt tgccaacgtg 301 ctcaccatgg agcccatcgt gctgcagtac gtgcctcatg atcgcttcaa caagacctgt 361 tacatctgcg aggagacggg ccgggagagc aaggcggcct cgggagcctg catgacctgt 421 aaccgccatg gatgtcgaca agctttccac gtcacctgtg cccaaatggc aggcttgctg 481 tgtgaggaag aagtgctgga ggtggacaac gtcaagtact gcggctactg caaataccac 541 ttcagcaaga tgaagacatc ccggcacagc agcgggggag gcggaggagg cgctggagga 601 ggaggtggca gcatgggggg aggtggcagt ggtttcatct ctgggaggag aagccggtca 661 gcctcaccat ccacgcagca ggagaagcac cccacccacc acgagagggg ccagaagaag 721 agtcgaaagg acaaagaacg ccttaagcag aagcacaaga agcggcctga gtcgcccccc 781 agcatcctca ccccgcccgt ggtccccact gctgacaagg tctcctcctc ggcttcctct 841 tcctcccacc acgaggccag cacgcaggag acctctgaga gcagcaggga gtcaaagggg 901 aaaaagtctt ccagccatag cctgagtcat aaagggaaga aactgagcag tgggaaaggt 961 gtgagcagtt ttacctccgc ctcctcttct tcctcctcct cttcctcctc ctctgggggg 1021 cccttccagc ctgcagtctc gtccctgcag agctcccctg acttctctgc attccccaag 1081 ctggagcagc cagaggagga caagtactcc aagcccacag cccccgcccc ttcagcccct 1141 ccttctccct cagctcccga gccccccaag gctgaccttt ttgagcagaa ggtggtcttc 1201 tctggctttg ggcccatcat gcgcttctcc accaccacct ccagctcagg ccgggcccgg 1261 gcgccctccc ctggggacta taagtctccc cacgtcacgg ggtctggggc ctcggcaggc 1321 acccacaaac ggatgcccgc actgagtgcc acccctgtgc ctgctgatga gacccctgag 1381 acaggcctga aggagaagaa gcacaaagcc agcaagagga gccgccatgg gccaggccgt 1441 cccaagggca gccggaacaa ggagggcact gggggcccag ctgccccatc cttgcccagt 1501 gcccagctgg ctggctttac cgccactgct gcctcaccct tctctggagg ttccctggtc 1561 agctccggcc tgggaggtct gtcctcccga acctttgggc cttctgggag cttgcccagc 1621 ttgagcctgg agtccccctt actaggggca ggcatctaca ccagtaataa ggaccccatc 1681 tcccacagtg gcgggatgct gcgggctgtc tgcagcaccc ctctctcctc cagcctcctg 1741 gggcccccag ggacctcggc cctgccccgc ctcagccgct ccccgttcac cagcaccctc 1801 ccctcctctt ctgcttctat ctccaccact caggtgtttt ctctggctgg ctctaccttt 1861 agcctccctt ctacccacat ctttggaacc cccatgggtg ccgttaatcc cctcctctcc 1921 caagctgaga gcagccacac agagccagac ctggaggact gcagcttccg gtgtcggggg 1981 acctcccctc aggagagtct gtcttccatg tcccccatca gcagcctccc cgcactcttc 2041 gaccagacag cctctgcacc ctgtgggggc ggccagttag acccggcggc cccagggacg 2101 actaacatgg agcagcttct ggagaagcag ggcgacgggg aggccggcgt caacatcgtg 2161 gagatgctga aggcgctgca cgcgctgcag aaggagaacc agcggctgca agagcagatc 2221 ctgagcctga cggccaaaaa ggagcggctg cagattctca acgtgcagct ctctgtgccc 2281 ttccctgccc tgcctgctgc cctgcctgcc gccaacggcc ctgtccctgg gccctatggc 2341 ctgcctcccc aagccgggag cagcgactcc ttgagcacca gcaagagccc tccgggaaag 2401 agcagcctcg gcctggacaa ctcgctgtcc acttcttctg aggacccaca ctcaggctgc 2461 ccgagccgca gcagctcgtc gctgtccttc cacagcacgc ccccaccgct gcccctcctc 2521 cagcagagcc ctgccactct gcccctggcc ctgcctgggg cccctgcccc actcccgccc 2581 cagccgcaga acgggttggg ccgggcaccc ggggcagcgg ggctgggggc catgcccatg 2641 gctgaggggc tgttgggggg gctggcaggc agtgggggcc tgcccctcaa tgggctcctt 2701 ggggggttga atggggccgc tgcccccaac cccgcaagct tgagccaggc tggcggggcc 2761 cccacgctgc agctgccagg ctgtctcaac agccttacag agcagcagag acatctcctt 2821 cagcagcaag agcagcagct ccagcaactc cagcagctcc tggcctcccc gcagctgacc 2881 ccggaacacc agactgttgt ctaccagatg atccagcaga tccagcagaa acgggagctg 2941 cagcgtctgc agatggctgg gggctcccag ctgcccatgg ccagcctgct ggcaggaagc 3001 tccaccccgc tgctgtctgc gggtacccct ggcctgctgc ccacagcgtc tgctccaccc 3061 ctgctgcccg ctggagccct agtggctccc tcgcttggca acaacacaag tctcatggcc 3121 gcagcagctg cagctgcagc agtagcagca gcaggcggac ctccagtcct cactgcccag 3181 accaacccct tcctcagcct gtcgggagca gagggcagtg gcggtggccc caaaggaggg 3241 accgctgaca aaggagcctc agccaaccag gaaaaaggct aa // LOCUS HSU08015 2743 bp mRNA PRI 08-JUL-1994 DEFINITION Human NF-ATc mRNA, complete cds. ACCESSION U08015 NID g500631 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2743) AUTHORS Northrop,J.P., Ho,S.N., Chen,L., Thomas,D.J., Timmerman,L.A., Nolan,G.P., Admon,A. and Crabtree,G.R. TITLE NF-AT components define a family of transcription factors targeted in T-cell activation JOURNAL Nature 369, 497-502 (1994) MEDLINE 94261186 REFERENCE 2 (bases 1 to 2743) AUTHORS Ho,S.N. TITLE Direct Submission JOURNAL Submitted (24-MAR-1994) Steffan N. Ho, Department of Pathology, Stanford University, Howard Hughes Medical Institute, Beckman Center for Molecular and Genetic Medicine, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..2743 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clones 2.1, 1.9, 6/11 and 27" /clone_lib="Jurkat cDNA lib of D. Thomas and human peripheral blood cDNA lib of A. Krensky" /cell_line="Jurkat T cell leukemia line" /cell_type="T-lymophocyte" mRNA 1..2743 CDS 240..2390 /standard_name="T cell transcription factor NF-ATc" /note="cytosolic component of the nuclear factor of activated T cells" /codon_start=1 /product="NF-ATc" /db_xref="PID:g500632" /translation="MPSTSFPVPSKFPLGPAAAVFGRGETLGPAPRAGGTMKSAEEEH YGYASSNVSPALPLPTAHSTLPAPCHNLQTSTPGIIPPADHPSGYGAALDGGPAGYFL SSGHTRPDGAPALESPRIEITSCLGLYHNNNQFFHDVEVEDVLPSSKRSPSTATLSLP SLEAYRDPSCLSPASSLSSRSCNSEASSYESNYSYPYASPQTSPWQSPCVSPKTTDPE EGFPRGLGACTLLGSPQHSPSTSPRASVTEESWLGARSSRPASPCNKRKYSLNGRQPP YSPHHSPTPSPHGSPRVSVTDDSWLGNTTQYTSSAIVAAINALTTDSSLDLGDGVPVK SRKTTLEQPPSVALKVEPVGEDLGSPPPPADFAPEDYSSFQHIRKGGFCDQYLAVPQH PYQWAKPKPLSPTSYMSPTLPALDWQLPSHSGPYELRIEVQPKSHHRAHYETEGSRGA VKASAGGHPIVQLHGYLENEPLMLQLFIGTADDRLLRPHAFYQVHRITGKTVSTTSHE AILSNTKVLEIPLLPENSMRAVIDCAGILKLRNSDIELRKGETDIGRKNTRVRLVFRV HVPQPSGRTLSLQVASNPIECSQRSAQELPLVEKQSTDSYPVVGGKKMVLSGHNFLQD SKVIFVEKAPDGHHVWEMEAKTDRDLCKPNSLVVEIPPFRNQRITSPVHVSFYVCNGK RKRSQYQRFTYLPANGNAIFLTVSREHERVGCFF" polyA_signal 2727..2732 polyA_site 2743 /note="46 A residues" BASE COUNT 529 a 959 c 788 g 467 t ORIGIN 1 gaattccgca gggcgcgggc accggggcgc gggcagggct cggagccacc gcgcaggtcc 61 tagggccgcg gccgggcccc gccacgcgcg cacacgcccc tcgatgactt tcctccgggg 121 cgcgcggcgc tgagcccggg gcgagggctg tcttcccgga gacccgaccc cggcagcgcg 181 gggcggccac ttctcctgtg cctccgcccg ctgctccact ccccgccgcc gccgcgcgga 241 tgccaagcac cagctttcca gtcccttcca agtttccact tggccctgcg gctgcggtct 301 tcgggagagg agaaactttg gggcccgcgc cgcgcgccgg cggcaccatg aagtcagcgg 361 aggaagaaca ctatggctat gcatcctcca acgtcagccc cgccctgccg ctccccacgg 421 cgcactccac cctgccggcc ccgtgccaca accttcagac ctccacaccg ggcatcatcc 481 cgccggcgga tcacccctcg gggtacggag cagctttgga cggtgggccc gcgggctact 541 tcctctcctc cggccacacc aggcctgatg gggcccctgc cctggagagt cctcgcatcg 601 agataacctc gtgcttgggc ctgtaccaca acaataacca gtttttccac gatgtggagg 661 tggaagacgt cctccctagc tccaaacggt ccccctccac ggccacgctg agtctgccca 721 gcctggaggc ctacagagac ccctcgtgcc tgagcccggc cagcagcctg tcctcccgga 781 gctgcaactc agaggcctcc tcctacgagt ccaactactc gtacccgtac gcgtcccccc 841 agacgtcgcc atggcagtct ccctgcgtgt ctcccaagac cacggacccc gaggagggct 901 ttccccgcgg gctgggggcc tgcacactgc tgggttcccc gcagcactcc ccctccacct 961 cgccccgcgc cagcgtcact gaggagagct ggctgggtgc ccgctcctcc agacccgcgt 1021 ccccttgcaa caagaggaag tacagcctca acggccggca gccgccctac tcaccccacc 1081 actcgcccac gccgtccccg cacggctccc cgcgggtcag cgtgaccgac gactcgtggt 1141 tgggcaacac cacccagtac accagctcgg ccatcgtggc cgccatcaac gcgctgacca 1201 ccgacagcag cctggacctg ggagatggcg tccctgtcaa gtcccgcaag accaccctgg 1261 agcagccgcc ctcagtggcg ctcaaggtgg agcccgtcgg ggaggacctg ggcagccccc 1321 cgcccccggc cgacttcgcg cccgaagact actcctcttt ccagcacatc aggaagggcg 1381 gcttctgcga ccagtacctg gcggtgccgc agcaccccta ccagtgggcg aagcccaagc 1441 ccctgtcccc tacgtcctac atgagcccga ccctgcccgc cctggactgg cagctgccgt 1501 cccactcagg cccgtatgag cttcggattg aggtgcagcc caagtcccac caccgagccc 1561 actacgagac ggagggcagc cggggggccg tgaaggcgtc ggccggagga caccccatcg 1621 tgcagctgca tggctacttg gagaatgagc cgctgatgct gcagcttttc attgggacgg 1681 cggacgaccg cctgctgcgc ccgcacgcct tctaccaggt gcaccgcatc acagggaaga 1741 ccgtgtccac caccagccac gaggctatcc tctccaacac caaagtcctg gagatcccac 1801 tcctgccgga gaacagcatg cgagccgtca ttgactgtgc cggaatcctg aaactcagaa 1861 actccgacat tgaacttcgg aaaggagaga cggacatcgg gaggaagaac acacgggtac 1921 ggctggtgtt ccgcgttcac gtcccgcaac ccagcggccg cacgctgtcc ctgcaggtgg 1981 cctccaaccc catcgaatgc tcccagcgct cagctcagga gctgcctctg gtggagaagc 2041 agagcacgga cagctatccg gtcgtgggcg ggaagaagat ggtcctgtct ggccacaact 2101 tcctgcagga ctccaaggtc attttcgtgg agaaagcccc agatggccac catgtctggg 2161 agatggaagc gaaaactgac cgggacctgt gcaagccgaa ttctctggtg gttgagatcc 2221 cgccatttcg gaatcagagg ataaccagcc ccgttcacgt cagtttctac gtctgcaacg 2281 ggaagagaaa gcgaagccag taccagcgtt tcacctacct tcccgccaac ggtaacgcca 2341 tctttctaac cgtaagccgt gaacatgagc gcgtggggtg ctttttctaa agacgcagaa 2401 acgacgtcgc cgtaaagcag cgtggcgtgt tgcacattta actgtgtgat gtcccgttag 2461 tgagaccgag ccatcgatgc cctgaaaagg aaaggaaaag ggaagcttcg gatgcatttt 2521 ccttgatccc tgttgggggt ggggggcggg ggttgcatac tcagatagtc acggttattt 2581 tgcttcttgc gaatgtataa cagccaaggg gaaaacatgg ctcttctgct ccaaaaaact 2641 gagggggtcc tggtgtgcat ttgcacccta aagctgctta cggtgaaaag gcaaataggt 2701 atagctattt tgcaggcacc tttaggaata aactttgctt tta // LOCUS HSU08021 952 bp mRNA PRI 20-JUL-1994 DEFINITION Human nicotinamide N-methyltransferase (NNMT) mRNA, complete cds. ACCESSION U08021 NID g494988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 952) AUTHORS Aksoy,S., Szumlanski,C.L. and Weinshilboum,R.M. TITLE Human liver nicotinamide N-methyltransferase: cDNA cloning, expression, and biochemical characterization JOURNAL J. Biol. Chem. 269, 14835-14840 (1994) MEDLINE 94237908 REFERENCE 2 (bases 1 to 952) AUTHORS Aksoy,S. TITLE Direct Submission JOURNAL Submitted (25-MAR-1994) Saime Aksoy, Pharmacology, Mayo Medical School, Mayo Clinic, Mayo Foundation, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..952 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 118..912 /gene="NNMT" CDS 118..912 /gene="NNMT" /EC_number="2.1.1.1" /note="cytosolic protein, 29 KDa monomer" /codon_start=1 /function="N-methylation, nicotinamide methylation" /product="nicotinamide N-methyltransferase" /db_xref="PID:g494989" /translation="MESGFTSKDTYLSHFNPRDYLEKYYKFGSRHSAESQILKHLLKN LFKIFCLDGVKGDLLIDIGSGPTIYQLLSACESFKEIVVTDYSDQNLQELEKWLKKEP EAFDWSPVVTYVCDLEGNRVKGPEKEEKLRQAVKQVLKCDVTQSQPLGAVPLPPADCV LSTLCLDAACPDLPTYCRALRNLGSLLKPGGFLVIMDALKSSYYMIGEQKFSSLPLGR EAVEAAVKEAGYTIEWFEVISQSYSSTMANNEGLFSLVARKLSRPL" polyA_signal 927..932 BASE COUNT 235 a 242 c 262 g 213 t ORIGIN 1 tgaactctgg atgctgttag cctgagactc aggaagacaa cttctgcagg gtcactccct 61 ggcttctgga ggaaagagaa ggagggcagt gctccagtgg tacagaagtg agacataatg 121 gaatcaggct tcacctccaa ggacacctat ctaagccatt ttaaccctcg ggattaccta 181 gaaaaatatt acaagtttgg ttctaggcac tctgcagaaa gccagattct taagcacctt 241 ctgaaaaatc ttttcaagat attctgccta gacggtgtga agggagacct gctgattgac 301 atcggctctg gccccactat ctatcagctc ctctctgctt gtgaatcctt taaggagatc 361 gtcgtcactg actactcaga ccagaacctg caggagctgg agaagtggct gaagaaagag 421 ccagaggcct ttgactggtc cccagtggtg acctatgtgt gtgatcttga agggaacaga 481 gtcaagggtc cagagaagga ggagaagttg agacaggcgg tcaagcaggt gctgaagtgt 541 gatgtgactc agagccagcc actgggggcc gtccccttac ccccggctga ctgcgtgctc 601 agcacactgt gtctggatgc cgcctgccca gacctcccca cctactgcag ggcgctcagg 661 aacctcggca gcctactgaa gccagggggc ttcctggtga tcatggatgc gctcaagagc 721 agctactaca tgattggtga gcagaagttc tccagcctcc ccctgggccg ggaggcagta 781 gaggctgctg tgaaagaggc tggctacaca atcgaatggt ttgaggtgat ctcgcaaagt 841 tattcttcca ccatggccaa caacgaagga cttttctccc tggtggcgag gaagctgagc 901 agacccctgt gatgcctgtg acctcaatta aagcaattcc tttgacctgt ca // LOCUS HSU08092 1444 bp mRNA PRI 06-APR-1994 DEFINITION Human histamine N-methyltransferase (HNMT) mRNA, complete cds. ACCESSION U08092 NID g468258 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1444) AUTHORS Girard,B., Otterness,D.M., Wood,T.C., Honchel,R., Wieben,E.D. and Weinshilboum,R.M. TITLE Human histamine N-methyltransferase pharmacogenetics: Cloning and expression of kidney cDNA JOURNAL Mol. Pharmacol. (1994) In press REFERENCE 2 (bases 1 to 1444) AUTHORS Otterness,D.M. TITLE Direct Submission JOURNAL Submitted (28-MAR-1994) Diane M. Otterness, Dept. of Pharmacology, Mayo Foundation, 723 Gugg, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1444 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Clontech Lambda gt11 library" /tissue_type="kidney" gene 40..918 /gene="HNMT" CDS 40..918 /gene="HNMT" /note="33 kda cytosolic protein" /codon_start=1 /function="N-methylation of histamine" /product="histamine N-methyltransferase" /db_xref="PID:g468259" /translation="MASSMRSLFSDHGKYVESFRRFLNHSTEHQCMQEFMDKKLPGII GRIGDTKSEIKILSIGGGAGEIDLQILSKVQAQYPGVCINNEVVEPSAEQIAKYKELV AKISNLENVKFAWHKETSSEYQSRMLEKKELQKWDFIHMIQMLYYVKDIPATLKFFHS LLGTNAKMLIIVVSGSSGWDKLWKKYGSRFPQDDLCQYITSDDLTQMLDNLGLKYECY DLLSTMDISDCFIDGNENGDLLWDFLTETCNFNATAPPDLRAELGKDLQEPEFSAKKE GKVLFNNTLSFIVIEA" polyA_signal 1423..1430 BASE COUNT 465 a 267 c 275 g 437 t ORIGIN 1 aaccttgctt cctgctctgt ctttctcaga aaaccaaata tggcatcttc catgaggagc 61 ttgttttctg accacgggaa atatgttgaa tctttccgga ggtttctcaa ccattccacg 121 gaacaccagt gcatgcagga attcatggac aagaagctgc caggcataat aggaaggatt 181 ggagacacaa aatcagaaat taagattcta agcataggcg gaggtgcagg tgaaattgat 241 cttcaaattc tctccaaagt tcaggctcaa tacccaggag tttgtatcaa caatgaagtt 301 gttgagccaa gtgctgaaca aattgccaaa tacaaagagc ttgtagccaa gatatcgaac 361 ctcgagaacg taaagtttgc ttggcataag gagacatcat ctgaatacca aagtagaatg 421 ttggagaaaa aggagcttca aaagtgggac tttattcata tgattcaaat gctgtattat 481 gtaaaagaca tcccagctac cctgaaattc ttccatagtc tcttaggtac caatgctaag 541 atgctcatta ttgttgtgtc aggaagcagt ggctgggaca agctgtggaa aaagtacgga 601 tcacgctttc cccaggatga cctctgccag tatatcacat cagatgacct cactcagatg 661 ctggacaacc tagggcttaa gtatgagtgc tatgaccttt tgtccaccat ggatatatct 721 gactgcttta ttgatggtaa tgaaaatgga gacctgcttt gggatttttt gactgaaacc 781 tgcaacttta atgccacagc accacctgat ctcagagcag agcttgggaa agatctacaa 841 gagcctgaat ttagtgctaa gaaagagggg aaggttcttt ttaataatac tctgagtttc 901 atagtgattg aggcataact atcaatcaca aaagtatatt caaaaattat attttgaaca 961 actcgaatca ctcatttgtt tccatattaa aatcacaaac tcatccatta atgtagataa 1021 agcactgttt ggatatgaga tgtagcaaat tccaatacat tattggactt ccatttggaa 1081 tcatatggga tactgctggt cttatcctgt ccctcctcca ggtagagaga ccacatgcag 1141 gctcaacata acataagcta gaaaaattag atgactgaat ttctatggca tattgataat 1201 aaaattcatt ccatttgctg attgtctgaa attttctaga atactaataa aatacatact 1261 atagattctt tattagtgaa gtatgcacta atcaatactt tgaacacaaa gcctgtgtta 1321 ctgatttggc cgttttgtga agaaacattt atctttgtac gttcttctat tgtgctttct 1381 atctaatttt tattaatttg taagagtaag cacctttaga atattaaaaa ttaattcttt 1441 atca // LOCUS HSU08098 1046 bp mRNA PRI 30-NOV-1995 DEFINITION Human estrogen sulfotransferase (STE) mRNA, complete cds. ACCESSION U08098 NID g488282 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1046) AUTHORS Aksoy,I.A., Wood,T.C. and Weinshilboum,R. TITLE Human liver estrogen sulfotransferase: identification by cDNA cloning and expression JOURNAL Biochem. Biophys. Res. Commun. 200 (3), 1621-1629 (1994) MEDLINE 94242031 REFERENCE 2 (bases 1 to 1046) AUTHORS Aksoy,I.A. TITLE Direct Submission JOURNAL Submitted (29-MAR-1994) Ibrahim A. Aksoy, Pharmacology, Mayo Medical School, 200 First St SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1046 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 107..991 /gene="STE" CDS 107..991 /gene="STE" /note="cytosolic protein" /codon_start=1 /function="sulfation of estrogens" /product="estrogen sulfotransferase" /db_xref="PID:g488283" /translation="MNSELDYYEKFEEVHGILMYKDFVKYWDNVEAFQARPDDLVIAT YPKSGTTWVSEIVYMIYKEGDVEKCKEDVIFNRIPFLECRKENLMNGVKQLDEMNSPR IVKTHLPPELLPASFWEKDCKIIYLCRNAKDVAVSFYYFFLMVAGHPNPGSFPEFVEK FMQGQVPYGSWYKHVKSWWEKGKSPRVLFLFYEDLKEDIRKEVIKLIHFLERKPSEEL VDRIIHHTSFQEMKNNPSTNYTTLPDEIMNQKLSPFMRKGITGDWKNHFTVALNEKFD KHYEQQMKESTLKFRTEI" polyA_site 1022 BASE COUNT 350 a 171 c 213 g 312 t ORIGIN 1 agaagtggtt ctcatctttt tttgcagctt aagatctgcc ttggtatttg aagagatata 61 aactagatca atttctttca caggatcaac taaacagtgt accacaatga attctgaact 121 tgactattat gaaaagtttg aagaagtcca tgggattcta atgtataaag attttgtcaa 181 atattgggat aatgtggaag cgttccaggc aagaccagat gatcttgtca ttgccaccta 241 ccctaaatct ggtacaacct gggttagtga aattgtgtat atgatctata aagagggtga 301 tgtggaaaag tgcaaagaag atgtaatttt taatcgaata cctttcctgg aatgcagaaa 361 agaaaacctc atgaatggag taaaacaatt agatgagatg aattctccta gaattgtgaa 421 gactcatttg ccacctgaac ttcttcctgc ctcattttgg gaaaaggatt gtaagataat 481 ctatctttgc cggaatgcaa aggatgtggc tgtttccttt tattatttct ttctaatggt 541 ggctggtcat ccaaatcctg gatcctttcc agagtttgtg gagaaattca tgcaaggaca 601 ggttccttat ggttcctggt ataaacatgt aaaatcttgg tgggaaaagg gaaagagtcc 661 acgtgtacta tttcttttct acgaagacct gaaagaggat atcagaaaag aggtgataaa 721 attgatacat ttcctggaaa ggaagccatc agaggagctt gtggacagga ttatacatca 781 tacttcgttc caagagatga agaacaatcc atccacaaat tacacaacac tgccagacga 841 aattatgaac cagaaattgt cgcccttcat gagaaaggga attacaggag actggaaaaa 901 tcactttaca gtagccctga atgaaaaatt tgataaacat tatgagcagc aaatgaagga 961 atctacactg aagtttcgaa ctgagatcta agaaggtctt tctttactta acatatctga 1021 tattaaagat ttcttttcat tattca // LOCUS HSU08112 1323 bp mRNA PRI 08-DEC-1994 DEFINITION Human alpha(1,3)fucosyltransferase mRNA, complete cds. ACCESSION U08112 NID g520463 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1323) AUTHORS Natsuka,S., Gersten,K.M., Zenita,K., Kannagi,R. and Lowe,J.B. TITLE Molecular cloning of a cDNA encoding a novel human leukocyte alpha-1,3-fucosyltransferase capable of synthesizing the sialyl Lewis x determinant [published erratum appears in J Biol Chem 1994 Aug 12;269(32):20806] JOURNAL J. Biol. Chem. 269 (24), 16789-16794 (1994) MEDLINE 94266898 REFERENCE 2 (bases 1 to 1323) AUTHORS Lowe,J.B. TITLE Direct Submission JOURNAL Submitted (29-MAR-1994) John B. Lowe, Pathology, Howard Hughes Medical Institute, Univ. of Michigan Medical School, 1150 West Medical Center Drive, Ann Arbor, MI 48109-0650, USA FEATURES Location/Qualifiers source 1..1323 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pCDM8-Fuc-TVII" /clone_lib="YT cell library described by Hatakeyama et al, J Immunol 134:1623-1630, 1985" /chromosome="9" /cell_line="YT (natural killer-like)" CDS 138..1163 /codon_start=1 /product="alpha(1,3)fucosyltransferase" /db_xref="PID:g520464" /translation="MNNAGHGPTRRLRGLGVLAGVALLAALWLLWLLGSAPRGTPAPQ PTITILVWHWPFTDQPPELPSDTCTRYGIARCHLSANRSLLASADAVVFHHRELQTRR SHLPLAQRPRGQPWVWASMESPSHTHGLSHLRGIFNWVLSYRRDSDIFVPYGRLEPHW ASPPLPAKSRVAAWVVSNFQERQLRARLYRQLAPHLRVDVFGRANGRPLCASCLVPTV AQYRFYLSFENSQHRDYITEKFWRNALVAGTVPVVLGPPRATYEAFVPADAFVHVDDF GSARELAAFLTGMNESRYQRFFAWRDSVRVRLFTDWRERFCAICDRYPHLPRSQVYED LEGWFQA" BASE COUNT 209 a 432 c 437 g 245 t ORIGIN 1 cttgggcaga gataaaagga gcacagttcc aggcggggct gagctagggc gtacgtgtga 61 tttcaggggc acctctggcg gctgccgtga tttgagaatc tcgggtctct tggctgactg 121 atcctgggag actgtggatg aataatgctg ggcacggccc cacccggagg ctgcgaggct 181 tgggggtcct ggccggggtg gctctgctcg ctgccctctg gctcctgtgg ctgctggggt 241 cagcccctcg gggtaccccg gcaccccagc ccacgatcac catccttgtc tggcactggc 301 ccttcactga ccagccccca gagctgccca gcgacacctg cacccgctac ggcatcgccc 361 gctgccacct gagtgccaac cgaagcctgc tggccagcgc cgacgccgtg gtcttccacc 421 accgcgagct gcagacccgg cggtcccacc tgcccctggc ccagcggccg cgagggcagc 481 cctgggtgtg ggcctccatg gagtctccta gccacaccca cggcctcagc cacctccgag 541 gcatcttcaa ctgggtgctg agctaccggc gcgactcgga catctttgtg ccctatggcc 601 gcctggagcc ccactgggcc tcgccaccgc tgccagccaa gagcagggtg gccgcctggg 661 tggtcagcaa cttccaggag cggcagctgc gtgccaggct gtaccggcag ctggcgcctc 721 atctgcgggt ggatgtcttt ggccgtgcca atggacggcc actgtgcgcc agctgcctgg 781 tgcccaccgt ggcccagtac cgcttctacc tgtcctttga gaactctcag caccgcgact 841 acattacgga gaaattctgg cgcaacgcac tggtggctgg cactgtgcca gtggtgctgg 901 ggcccccacg ggccacctat gaggccttcg tgccggctga cgccttcgtg catgtggatg 961 actttggctc agcccgagag ctggcggctt tcctcactgg catgaatgag agccgatacc 1021 aacgcttctt tgcctggcgt gacagcgtcc gcgtgcgact gttcaccgac tggcgggaac 1081 gtttctgtgc catctgtgac cgctacccac acctaccccg cagccaagtc tatgaggacc 1141 ttgagggttg gtttcaggcc tgagatccgc tggccggggg aggtgggtgt gggtggaagg 1201 gctgggtgtc gaaatcaaac caccaggcat ccggccctta ccggcaagca gcgggctaac 1261 gggaggctgg gcacagaggt caggaagcag gggtgggggg tgcaggtggg cactggagca 1321 tgc // LOCUS HSU08191 5281 bp mRNA PRI 02-MAY-1994 DEFINITION Human R kappa B mRNA, complete cds. ACCESSION U08191 S79520 NID g476273 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5281) AUTHORS Adams,B.S., Leung,K.Y., Hanley,E.W. and Nabel,G.J. TITLE Cloning of R kappa B, a novel DNA-binding protein that recognizes the interleukin-2 receptor alpha chain kappa B site JOURNAL New Biol. 3, 1063-1073 (1991) MEDLINE 92135142 REFERENCE 2 (bases 1 to 5281) AUTHORS K. Cheek. TITLE Direct Submission JOURNAL Submitted (25-JAN-1994) Kevin Cheek, University of Michigan Medical Center, 1150 W. Medical Center Drive, Ann Arbor, MI 48109-0652, USA FEATURES Location/Qualifiers source 1..5281 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 2221..5217 /note="interleukin-2 receptor alpha chain kappa B binding protein" /codon_start=1 /product="R kappa B" /db_xref="PID:g476274" /translation="MTRVNAGRKGSLAALYDLAVLKKKVKEKEEKKKKKIKTIKSEAE DLAEPLSSTEGVAPLSQAPSPLAIPAIKEEPLEDLKPCLGINEISSSFFSLLLEILLL ESQASLPMLEERVLDWQSSPASSLNSWFSAAPNWAELVLPALQYLAGESRAVPSSFSP FVEFKEKTQQWKLLGQSQDNEKELAALFQLWLETKDQAFCKQENEDSSDATTPVPRVR TDYVVRPSTGEEKRVFQEQERYRYSQPHKAFTFRMHGFESVVGPVKGVFDKETSLNKA REHSLLRSDRPAYVTILSLVRDAAARLPNGEGTRAEICELLKDSQFLAPDVTSTQVNT VVSGALDRLHYEKDPCVKYDIGRKLWIYLHRDRSEEEFERIHQAQAAAAKARKALQQK PKPPSKVKSSSKESSIKVLSSGPSEQSQMSLSDSSMPPTPVTPVTPTTPALPAIPISP PPVSAVNKSGPSTVSEPAKSSSGVLLVSSPTMPHLGTMLSPASSQTAPSSQAAARVVS HSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTLPQMPAGPQIRVPATATQTKVVPQTVM ATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTAAVIQ NVTGQNIIKQVAITGQLGVKPQTGNSIPLTATNFRIQGKDVLRLPPSSITTDAKGQTV LRITPDMMATLAKSQVTTVKLTQDLFGTGGNTTGKGISATLHVTSNPVHAADSPAKAS SASAPSSTPTGTTVVKVTPDLKPTEASSSAFRLMPALGVSVADQKGKSTVASSEAKPA ATIRIVQGLGVMPPKAGQTITVATHAKQGASVASGSGTVHTSAVSLPSMNAAVSKTVA VASGAASTPISISTGAPTVRQVPVSTTVVSTSQAGKLPTRITVPLSVISQPMKGKSVV TAPIIKGNLGANLSGLGRNIILTTMPAGTKLIAGNKPVSFLTAQQLQQLQQQGQATQV RIQTVPASISNREQLLAPPKQSPLLL" BASE COUNT 1272 a 1508 c 1336 g 1165 t ORIGIN 1 ccctagccct gttttgtgag aagttccata cataatgctc ctccacccaa tacataccct 61 cctgagagaa caggtaacac cactcgaaca aatggaggat caaatggaaa gttatcctta 121 aaagagaagt taagcaaaat atattctatg ccttcttttt cttttaagat ctgaagatca 181 ctgtgcaaag gactatcagg gtcaaccttc tgcagttaac atgccagtca tataaactgt 241 catttatgag ttccactgta taaatccctg ttttataact ctgtgatctg tatatgtccc 301 tgagctcttt cataagtcta tctgaagctt gcactgaccc agacactgca ccatttaaat 361 ggtcttgcct ttgagtcttc ctaattttct ctaatattgc caaattttct ttttcaattc 421 cttcatcctc tgactttttc ccactaatag gctcttcttc cttcatctca tagtgatcta 481 agtcttctat atcttcagcc atctcttctt cttcttcctc ttcttctgaa gtcacttctt 541 ctgttgtccc attctgaccc gtgggtagtg gttgatctag catctcaaca tccaggtgct 601 taggaaggtt atataaactg cagagttcac atatcaacca cttcaattgc tgacgaagca 661 aattgttgtt cttagtatct tctagacgtt ccagaactga tgtcagattt gggtcttcag 721 aatccacaaa ccatatcggt gaagaagatg gataggattc cgtgatgttg cagtggagcg 781 tgagtggcgg cggcacgagt gcgggctgcc ctgctgcggc accaggaact ggcagtgcag 841 ctcgtccagc ttccaactga cgatgcggaa tcgctcgtgg ttcttgtcga agatggacgc 901 caggaacttc agctcggcct tgagccctga cacggacatc ttcccctcat ctccggcggg 961 aggggcgcgg aaggggagcc gggcgcggaa ggggagccgg gcccggagcc gccgtcacgg 1021 ccgcgaccgc cccgcgggcc ggcctgggcc gcgctctccg cctcgtcgag cgctgctgga 1081 aaatggcgag ggggcgcgga agcctcggcg tctgggagcc cgcggccgga gaagggctgc 1141 gggttagggg gccggcgccc gcggttcagg attccagaat tggaaataac gggagggagg 1201 acctggtcca gcttcccttc ctcaaataag gaaattgaca cctggcgtga gaaggggttt 1261 tgccatgttc gctaggctgg tctcaaactc atggattcaa ggggactgcc cgcctggacc 1321 tcccaaagta ctgagattag tacctgtgga gaagaaacaa tggattcctt agaccatatg 1381 ctgacagatc ctctggaact tggtccgtgt ggagatggcc atggcacgcg catcatggag 1441 gattgcctcc tgggaggcac cagagttagt ctgcccgagg accttctgga ggatcctgag 1501 atcttctttg atgttgtcag cctctcaaca tggcaggaag tgttaagtga ttctcaacgt 1561 gaacacctcc agcagtttct gccccagttt cctgaagaca gtgctgagca gcagaatgaa 1621 ctcatcttag ccttgttcag tggggagaac ttccgctttg gaaaccctct gcacattgcc 1681 cagaagcttt tccgagacgg acactttaac cccgaggtgg tcaagtaccg gcagttatgc 1741 ttcaagtcac agtacaagcg ctacctcaac tcccagcagc agtatttcca tcggctgctg 1801 aagcaaattc ttgcttcccg gagtgatctg ctggagatgg cccggcggag tggccccgcc 1861 cttcccttcc ggcagaaacg cccttcacca tcccgcacac ctgaggagcg ggagtggcgg 1921 acccagcagc gctacttgaa ggtcttaagg gaagtgaaag aggagtgtgg tgacacagcc 1981 ctgtcatctg atgaagagga tctcagctca tggcttccga gctctccagc acgttctcct 2041 agtcctgcgg tgcccctgcg ggtggtgccc acactttcaa ccacggatat gaaaactgca 2101 gataaagtag aactggggga cagtgacctg aagataatgt taaagaagca ccacgagaag 2161 cggaaacatc agccagatca cccggacctt ttgacagggg acctgactct caatgacatc 2221 atgactcgag taaatgctgg caggaagggc tctctggcag ccttatatga cttggctgtc 2281 cttaaaaaaa aggttaagga aaaagaggaa aagaagaaga agaaaataaa aacgatcaaa 2341 tcagaggcag aggacctggc cgagccgcta agcagtactg aaggggtcgc acctctctca 2401 caggccccct ctccgctggc aattcctgct atcaaggaag agccccttga agacctcaag 2461 ccttgccttg gaatcaatga aatatcttcc agcttcttct ctcttctatt agagatcttg 2521 ctgctggaga gtcaggctag ccttcctatg ctagaggagc gagttttgga ttggcagtca 2581 tcgccagcca gctccctcaa cagctggttc tctgcggccc ccaactgggc tgagttggta 2641 ctaccagccc tgcagtatct tgctggagaa agtcgagctg ttccttccag tttctctcca 2701 tttgttgaat tcaaagagaa aacccagcag tggaagttgc ttggccaatc ccaagataat 2761 gaaaaggaat tagctgccct cttccagcta tggctagaga ccaaagatca ggccttctgt 2821 aagcaagaaa atgaagacag ctcagatgcc acaacacctg tccctcgggt aagaactgac 2881 tatgtggtgc gtcccagcac gggggaggag aaacgggttt ttcaggagca ggagcgttac 2941 aggtatagcc aaccccataa ggcgttcacc tttcgcatgc acggctttga gtctgtggtg 3001 gggccagtga agggcgtgtt tgacaaggag acctcgctca acaaggctcg ggagcactcc 3061 ctgctgcgct ccgaccggcc tgcctacgtc accattctgt ctcttgttcg ggacgctgcg 3121 gctcgactgc ctaatggaga aggcacacgg gcagagatct gtgaactgct taaggactcc 3181 cagtttcttg caccagatgt caccagcact caggtaaata cagtagtgag tggtgcactg 3241 gatcggctac attacgaaaa agatccctgt gtgaaatacg acattggacg aaagctgtgg 3301 atctacctgc atcgtgaccg gagtgaagaa gagtttgagc ggattcacca agcacaagca 3361 gctgcagcta aagccagaaa agctcttcag caaaaaccca agcccccatc caaggtgaag 3421 tccagtagca aggagagctc cataaaggtc cttagcagtg gcccttctga gcagagccag 3481 atgagcctca gtgactccag tatgccaccc accccagtca cacctgtaac ccccaccaca 3541 ccagcattgc ccgccattcc catctcccct ccacctgtat cggcagtgaa caaaagcggc 3601 ccttccacag tctcagaacc agctaagtct agctcgggtg ttcttctggt gtcttcacca 3661 acaatgccac atctgggaac aatgctttcc ccagcttcca gccagactgc acccagttct 3721 caggctgccg cccgggtcgt gagccactct ggctctgctg gactgtctca ggtgcgagtg 3781 gtggcccagc ctagccttcc tgctgttccc cagcagtcgg gagggccggc acagacattg 3841 ccacagatgc cagcaggacc gcagatccgg gttccagcca ctgccacaca gaccaaagta 3901 gtgccccaga cagtaatggc cactgtgccc gtcaaagcgc agactacggc agccactgtg 3961 cagcggcctg gacccgggca gacagggctc acggtgacaa gtctccctgc cacagccagc 4021 cctgtgagta agccagccac gagttctcct gggacctctg ctcccagtgc ctccacggct 4081 gccgtcattc aaaatgtcac aggacagaac atcatcaagc aggtggcaat cactgggcag 4141 cttggtgtga agccccaaac aggcaacagc attccactca cagccactaa cttccgcatc 4201 cagggtaagg atgtattgcg tctgccgccc tcttccatca ccacagatgc caagggccag 4261 acggttctgc gaatcactcc ggacatgatg gccacattgg ccaagtccca ggttaccaca 4321 gtcaaattga cccaggacct cttcgggaca ggaggcaaca ctacaggcaa aggcatctct 4381 gccaccttac acgtcacttc caatccagta catgcagctg atagccctgc caaggccagt 4441 tcagccagtg ccccttcatc cactccaaca ggtaccactg tggtcaaagt gactcctgac 4501 ctcaagccaa cagaagcctc aagttcggct tttcgcttga tgccagctct tggcgtgagt 4561 gtggctgacc agaagggaaa aagcacagtg gcctcttcag aagcaaaacc agctgccacg 4621 atccgcatcg tgcagggact gggagtgatg cctcccaaag caggccagac catcaccgtt 4681 gcaacccacg ccaagcaagg ggcctcggtg gccagtgggt ctggaactgt ccatacttca 4741 gcggtgtcct tacccagtat gaatgctgct gtgtccaaga ctgtagctgt ggcttctggg 4801 gctgcaagca cccccatcag catcagcaca ggagccccca ccgtgcggca ggtccctgtc 4861 agcaccacgg ttgtgtccac gtcccaggct gggaagttgc ctacacggat cacagttccc 4921 ctctctgtga tcagccagcc aatgaagggc aagagcgtgg tcacagcccc catcatcaaa 4981 ggcaaccttg gagccaacct cagtgggttg ggccgcaaca tcatcctcac aactatgcca 5041 gcaggcacta agctcattgc tggcaataag cctgttagtt tcctcactgc tcagcagttg 5101 cagcagcttc agcagcaagg tcaggccaca caggtgcgca tccagactgt ccctgcatcc 5161 atctccaaca gggaacagct tctggctcct ccaaagcagt ctccactgtt gttgtgacta 5221 cagctccgtc tcctaaacag gcacctgagc aacaatgatt atgagagagg atgggcttcc 5281 t // LOCUS HSU08316 2260 bp mRNA PRI 29-NOV-1995 DEFINITION Human insulin-stimulated protein kinase 1 (ISPK-1) mRNA, complete cds. ACCESSION U08316 NID g475587 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2260) AUTHORS Bjorbaek,C., Vik,T.A., Echwald,S.M., Webb,G.C., Wang,J.P., Yang,P.Y., Vestergaard,H., Richmond,K., Hansen,T., Erikson,R.L., Gabor-Miklos,G.L., Cohen,P.T. and Pedersen,O. TITLE Cloning of a human insulin-stimulated protein kinase (ISPK-1) gene and analysis of coding regions and mRNA levels of the ISPK-1 and the protein phosphatase-1 genes in muscle from NIDDM patients JOURNAL Diabetes 44 (1), 90-97 (1995) MEDLINE 95113220 REFERENCE 2 (bases 1 to 2260) AUTHORS Vik,T.A. TITLE Direct Submission JOURNAL Submitted (04-APR-1994) Terry A. Vik, Pediatrics, Indiana University School of Medicine, Riley Hospital for Children, 702 Barnhill Drive, Indianapolis, IN 46202, USA FEATURES Location/Qualifiers source 1..2260 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="MOLT 4 T-cell leukemia and human placenta cDNA libraries" /chromosome="X" /map="Xp21.3-22.1" gene 1..2223 /gene="ISPK-1" CDS 1..2223 /gene="ISPK-1" /note="ribosomal protein S6 kinase" /codon_start=1 /product="insulin-stimulated protein kinase 1" /db_xref="PID:g475588" /translation="MPLAQLADPWQKMAVESPSDSAENGQQIMDEPMGEEEINPQTEE VSIKEIAITHHVKEGHEKADPSQFELLKVLGQGSFGKVFLVKKISGSDARQLYAMKVL KKATLKVRDRVRTKMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDLFTRLS KEVMFTEEDVKFYLAELALALDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKES IDHEKKAYSFCGTVEYMAPEVVNRRGHTQSADWWSFGVLMFEMLTGTLPFQGKDRKET MTMILKAKLGMPQFLSPEAQSLLRMLFKRNPANRLGAGPDGVEEIKRHSFFSTIDWNK LYRREIHPPFKPATGRPEDTFYFDPEFTAKTPKDSPGIPPSANAHQLFRGFSFVAITS DDESQAMQTVGVHSIVQQLHRNSIQFTDGYEVKEDIGVGSYSVCKRCIHKATNMEFAV KIIDKSKRDPTEEIEILLRYGQHPNIITLKDVYDDGKYVYVVTELMKGGELLDKILRQ KFFSEREASAVLFTITKTVEYLHAQGVVHRDLKPSNILYVDESGNPESIRICDFGFAK QLRAENGLLMTPCYTANFVAPEVLKRQGYDAACDIWSLGVLLYTMLTGYTPFANGPDD TPEEILARIGSGKFSLSGGYWNSVSDTAKDLVSKMLHVDPHQRLTAALVLRHPWIVHW DQLPQYQLNRQDAPHLVKGAMAATYSALNRNQSPVLEPVGRSTLAQRRGIKKITSTAL " BASE COUNT 704 a 417 c 510 g 629 t ORIGIN 1 atgccgctgg cgcagctggc ggacccgtgg cagaagatgg ctgtggagag cccgtccgac 61 agcgctgaga atggacagca aattatggat gaacctatgg gagaggagga gattaaccca 121 caaactgaag aagtcagtat caaagaaatt gcaatcacac atcatgtaaa ggaaggacat 181 gaaaaggcag atccttccca gtttgaactt ttaaaagtat tagggcaggg atcatttgga 241 aaggttttct tagttaaaaa aatctcaggc tctgatgcta ggcagcttta tgccatgaag 301 gtattgaaga aggccacact gaaagttcga gaccgagttc ggacaaaaat ggaacgtgat 361 atcttggtag aggttaatca tccttttatt gtcaagttgc attatgcttt tcaaactgaa 421 gggaagttgt atcttatttt ggattttctc aggggaggag atttgtttac acgcttatcc 481 aaagaggtga tgttcacaga agaagatgtc aaattctact tggctgaact tgcacttgct 541 ttagaccatc tacatagcct gggaataatt tatagagact taaaaccaga aaatatactt 601 cttgatgaag aaggtcacat caagttaaca gatttcggcc taagtaaaga gtctattgac 661 catgaaaaga aggcatattc tttttgtgga actgtggagt atatggctcc agaagtagtt 721 aatcgtcgag gtcatactca gagtgctgac tggtggtctt ttggtgtgtt aatgtttgaa 781 atgcttactg gtacactccc tttccaagga aaagatcgaa aagaaacaat gactatgatt 841 cttaaagcca aacttggaat gccacagttt ttgagtcctg aagcgcagag tcttttacga 901 atgcttttca agcgaaatcc tgcaaacaga ttaggtgcag gaccagatgg agttgaagaa 961 attaaaagac attcattttt ctcaacgata gactggaata aactgtatag aagagaaatt 1021 catccgccat ttaaacctgc aacgggcagg cctgaagata cattctattt tgatcctgag 1081 tttactgcaa aaactcccaa agattcacct ggcattccac ctagtgctaa tgcacatcag 1141 ctttttcggg ggtttagttt tgttgctatt acctcagatg atgaaagcca agctatgcag 1201 acagttggtg tacattcaat tgttcagcag ttacacagga acagtattca gtttactgat 1261 ggatatgaag taaaagaaga tattggagtt ggctcctact ctgtttgcaa gagatgtata 1321 cataaagcta caaacatgga gtttgcagtg aagattattg ataaaagcaa gagagaccca 1381 acagaagaaa ttgaaattct tcttcgttat ggacagcatc caaacattat cactctaaag 1441 gatgtatatg atgatggaaa gtatgtgtat gtagtaacag aacttatgaa aggaggtgaa 1501 ttgctggata aaattcttag acaaaaattt ttctctgaac gagaggccag tgctgtcctg 1561 ttcactataa ctaaaaccgt tgaatatctt cacgcacaag gggtggttca tcgagacttg 1621 aaacctagca acattcttta tgtggatgaa tctggtaatc cggaatctat tcgaatttgt 1681 gattttggct ttgcaaaaca gctgagagcg gaaaatggtc ttctcatgac tccttgttac 1741 actgcaaatt ttgttgcacc agaggtttta aaaagacaag gctatgatgc tgcttgtgat 1801 atatggagtc ttggtgtcct actctataca atgcttaccg gttacactcc atttgcaaat 1861 ggtcctgatg atacaccaga ggaaatattg gcacgaatag gtagcggaaa attctcactc 1921 agtggtggtt actggaattc tgtttcagac acagcaaagg acctggtgtc aaagatgctt 1981 catgtagacc ctcatcagag actgactgct gctcttgtgc tcagacatcc ttggatcgtc 2041 cactgggacc aactgccaca ataccaacta aacagacagg atgcaccaca tctagtaaag 2101 ggtgccatgg cagctacata ttctgctttg aaccgtaatc agtcaccagt tttggaacca 2161 gtaggccgct ctactcttgc tcagcggaga ggtattaaaa aaatcacctc aacagccctg 2221 tgaagtgacc tcagtgagat atttggatcc atggtgtaaa // LOCUS HSU08336 963 bp mRNA PRI 22-JUL-1994 DEFINITION Human basic helix-loop-helix transcription factor mRNA, complete cds. ACCESSION U08336 NID g488286 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 963) AUTHORS Quertermous,E., Hidai,H., Blanar,M.A. and Quertermous,T. TITLE Cloning and characterization of a basic helix-loop-helix protein expressed in early mesoderm and the developing somites JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 7066-7070 (1994) MEDLINE 94316638 REFERENCE 2 (bases 1 to 963) AUTHORS Quertermous,T. TITLE Direct Submission JOURNAL Submitted (05-APR-1994) Thomas Quertermous, Division of Cardiology, Vanderbilt University Medical School, 21st and Garland Avenues, Nashville, TN 37232-2170, USA FEATURES Location/Qualifiers source 1..963 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="bHLH-EC2-3" /cell_type="endothelial cell" /tissue_type="umbilical vein" CDS 35..631 /codon_start=1 /product="basic helix-loop-helix transcription factor" /db_xref="PID:g514278" /translation="MAFALLRPVGAHVLYPDVRLLSEDEENRSESDASDQSFGCCEGP EAARRGPGPGGGRRAGGGGGAGPVVVVRQRQAANARERDRTQSVNTAFTALRTLIPTE PVDRKLSKIETLRLASSYIAHLANVLLLGDSADDGQPCFRAAGSAKGAVPAAPTRRQP RSICTFCLSNQRKGGGRRDLGGSCLKVRGVAPLRGPRR" misc_feature 254..418 /function="DNA binding to the portion of the protein encoded by these nucleotides; protein-protein interaction" BASE COUNT 164 a 295 c 363 g 141 t ORIGIN 1 aattccggcg cacggaggga cgcggccggc gcccatggcg ttcgcgctgc tgcggcccgt 61 cggcgcgcac gtgctgtacc cggacgtgcg gctgctgagc gaggacgagg agaaccgcag 121 cgagagcgac gcgtcggacc agtcgttcgg ctgctgcgag ggcccggagg cggcgcggcg 181 cggcccgggc cccgggggcg ggcggcgggc gggcggcggc ggcggcgcgg gccccgtggt 241 ggtggtgcga cagcggcagg cggccaacgc gcgggagcgg gaccgcactc agagcgtgaa 301 cacggccttc acggcgctgc gcacgctcat ccccaccgag ccggtggacc gcaagctgtc 361 caagatcgag acgctgcgcc tggcgtccag ctacatcgcg cacctggcca acgtgctgct 421 gctgggcgac tcggccgacg acgggcagcc gtgcttccgt gccgcgggca gtgccaaggg 481 cgccgtcccc gccgcgccga cgcgccgcca gccgcgctcc atctgcacct tctgcctcag 541 caaccagcgc aaggggggtg gccgtcgtga cctggggggc agctgcttga aggtgagggg 601 ggtggccccc cttcgagggc cacggagatg agcctggacc ctggagaagg aggccaggag 661 ccagccactg gctggacagg gaagaagacc ccaggagcca agcccacccc ttctttgtgt 721 agggaccggg ggaccatggc ctgttccggg acactctggg cagggccctc gggacatctc 781 cacccgatcc tggagagctg tgaggatcca ttcagcctgc cagctctggc tggtcagaga 841 caaggcagaa cttttggaaa aacaaagact gttggtgaca gggtgtgtgt gtatctgtgc 901 gtgagtgtga gtgtgtgtga gagagaattg gtgagtttta aaataaaagc tatttttaaa 961 taa // LOCUS HSU08377 3225 bp mRNA PRI 08-JUL-1994 DEFINITION Human homolog of Drosophila splicing regulator suppressor-of-white-apricot mRNA, complete cds. ACCESSION U08377 NID g508230 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3225) AUTHORS Denhez,F. and Lafyatis,R. TITLE Conservation of regulated alternative splicing and identification of functional domains in vertebrate homologs to the Drosophila splicing regulator, suppressor-of-white-apricot JOURNAL J. Biol. Chem. 269, 16170-16179 (1994) MEDLINE 94266805 REFERENCE 2 (bases 1 to 3225) AUTHORS Lafyatis,R. TITLE Direct Submission JOURNAL Submitted (05-APR-1994) Robert Lafyatis, Rheumatology and Immunology, University of North Carolina School of Medicine, 932 Faculty Laboratory Office Building, Chapel Hill, NC 27599-7280, USA FEATURES Location/Qualifiers source 1..3225 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 121..2976 /note="similar to the Drosophila splicing regulator, suppressor-of-white-apricot: Swiss-Prot Accession Number P12297" /codon_start=1 /db_xref="PID:g508231" /translation="MYGASGGRAKPERKSGAKEEAGPGGAGGGGSRVELLVFGYACKL FRDDERAQAQEQGQHLIPWMGDHKILIDRYDGRGHLHDLSEYDAEYSTWNRDYQLSEE EARIEALCDEERYLALHTDLLEEEARQEEEYKRFSEALAEDGSYNAVGFTYGSDYYDP SEPTEEEEPSKQREKNEAENLEENEEPFVAPLGLSVPSDVELPPTAKMHAIIERTASF VCRQGAQFEIMLKAKQAPNSQFDFLRFDHYLNPYYKFIQKAMKEGRYTVLAENKSDEK KKSGVSSDNEDDDDEEDGNYLHPSLFASKKCNRLEELMKPLKVVDPDHPLAALVRKAQ ADSSTPTPHNADGAPVQPSQVEYTADSTVAAMYYSYYMLPDGTYCLAPPPPGIDVTTY YSTLPAGVTVSNSPGVTTTAPPPPGTTPPPPPTTAETSSGATSTTTTTSALAPVAAII PPPPDVQPVIDKLAEYVARNGLKFETSVRAKNDQRFEFLQPWYQYNAYYEFKKQFFLQ KEGGDSMQAVSAPEEAPTDSAPEKPSDAGEDGAPEDAAEVGARAGSGGKKEASSSKTV PDGKLVKASFAPISFAIKAKENDLLPLEKNRVKLDDDSDDDEESKEGQESSSSAANTN PAVAPPCVVVEEKKPQLTQEELEAKQAKQKLEDRLAAAAREKLAQASKESKEKQLQAE RKRKAALFLQTLKNPLPEAEAGKIEESPFSVEESSTTPCPLLTGGRPLPTLEVKPPDR PSSKSKDPPREEEKEKKKKKHKKRSRTRSRSPKYHSSSKSRSRSHSKAKHSLPSAYRT VRRSRSRSRSPRRRAHSPERRREERSVPTAYRVSRSPGASRKRTRSRSPHEKKKKRRS RSRTKSKARSQSVSPSKQAAPRPAAPAAHSAHSASVSPVESRGSSQERSRGVSQEKEA QISSAIVSSVQSKITQDLMAKVRAMLAASKNLQTSAS" BASE COUNT 856 a 882 c 910 g 577 t ORIGIN 1 ggcggtgttg aggttgggtc cgggatgcgg ggtctttgac tgaaggggta ggccaagtgg 61 aggtatcagg gacgtcgcgc ggcacagaag aggaccagcc tggaggccgg ggacgctgtc 121 atgtacggcg cgagcggggg ccgcgccaaa cccgagagga aaagcggcgc gaaggaggag 181 gccgggccag gcggtgccgg cggtgggggc agccgagtgg agctcttggt tttcggctat 241 gcctgcaagc tgttccggga cgacgagcgg gcccaggctc aggaacaggg acagcacctc 301 atcccctgga tgggggacca caagatcctc atcgacagat atgatggacg tggtcacctg 361 catgaccttt ctgagtacga tgctgagtat tccacgtgga acagagatta tcagctgtct 421 gaagaggagg cgcgaataga ggccctgtgt gatgaagaga ggtatttagc cttgcatacg 481 gacttgcttg aggaggaggc aaggcaagag gaagaataca agcgattcag tgaagcacta 541 gcagaggatg ggagctacaa tgccgtgggg ttcacttacg gtagcgacta ttacgacccg 601 tcagagccga cggaggagga ggagccttcc aaacagagag aaaaaaatga ggccgaaaat 661 ttagaggaaa atgaagagcc cttcgttgcc cccttaggat tgagcgtccc gtctgacgtg 721 gagttgccac caaccgctaa aatgcacgcc atcatcgagc gcacggccag cttcgtgtgc 781 aggcagggag cacagtttga gatcatgctg aaggccaagc aggccccgaa ctcccagttt 841 gactttctgc gcttcgacca ctacctcaac ccctactata agttcatcca gaaagccatg 901 aaagagggac gctacactgt cctggcagaa aacaaaagtg acgagaaaaa aaaatcagga 961 gtcagctctg acaatgaaga tgatgatgat gaagaagatg ggaattacct tcatccctct 1021 ctctttgcct ccaagaagtg taaccgcctt gaagagctga tgaagccctt gaaggtagtg 1081 gacccagatc atcccctcgc agcacttgtt cgtaaggcac aggctgacag ttccactccc 1141 accccacaca acgcagacgg tgcgcctgtg cagccctccc aggtggagta cacggcagac 1201 tcgaccgtgg cagccatgta ttacagctac tacatgctac cggacggcac ttactgcctg 1261 gcgccgcccc ctcccggaat cgatgtgact acttactaca gcacccttcc tgctggcgtg 1321 accgtgtcta actcccctgg agtgacgacc accgccccac cacctcctgg gaccacacca 1381 ccaccgcccc caaccacagc agagactagc agcggggcca cctccacaac caccaccaca 1441 agtgcacttg cccccgtggc cgccatcatc cccccgcccc ccgacgtcca gcccgtgatt 1501 gacaagctgg ccgagtatgt cgccaggaac ggcctgaagt tcgagaccag tgttcgtgcc 1561 aagaatgatc aaagatttga gttcctgcag ccgtggtacc agtataatgc ttattatgag 1621 tttaagaagc agttcttcct ccagaaagaa gggggcgata gcatgcaggc tgtgtctgca 1681 ccagaagagg ctcccacaga ctctgctccc gagaagccaa gtgatgctgg ggaggatggc 1741 gcgcctgaag acgcagccga ggtgggagca cgggcaggct caggcgggaa gaaggaggca 1801 tcgtccagta agaccgtccc ggacgggaag ctggtgaaag cttcctttgc tccaataagc 1861 tttgcaatca aggccaaaga aaatgatctg cttcccctgg aaaaaaatcg tgttaagcta 1921 gatgatgaca gtgatgatga tgaagaaagc aaagaaggcc aagaaagttc tagtagtgct 1981 gcaaacacta acccagcagt tgccccaccc tgtgtagttg ttgaggagaa gaagcctcaa 2041 cttacccagg aggagctaga agcaaagcaa gcaaagcaaa agctggaaga tcgcctcgca 2101 gctgctgccc gggaaaagct cgcccaggcg tctaaggagt caaaagagaa acagcttcaa 2161 gcagaacgta aaaggaaagc ggcgttattt ttacagaccc tcaaaaatcc tctgccggaa 2221 gcagaagctg ggaaaattga ggagagtcct ttcagtgtcg aggaatccag cactacgccc 2281 tgccctctac tgactggagg taggcctctg cctactttag aagttaaacc acccgatagg 2341 ccttcgagca aaagcaaaga tccaccgaga gaagaagaga aagaaaagaa aaagaaaaag 2401 cacaaaaaaa gatctcgaac aagatcacgt tctcccaagt accattcgtc atccaagtcc 2461 aggtctagat cacactcaaa agcaaagcat tctcttccca gtgcctatcg gacagtgcgg 2521 cggtcgaggt cccgctcccg gtcccctcgg aggagagccc actcccctga gagacggagg 2581 gaagagagga gtgtgcccac tgcctaccgc gtgagccgca gccctggggc cagcaggaag 2641 cggacccgct ccagaagtcc ccacgagaag aagaagaaga ggcggtcccg gtcgcggacc 2701 aagtccaagg ccaggtctca gtcggtgtca cccagcaagc aggcagcgcc ccggcccgcg 2761 gcccccgcgg cccactcggc gcactcagcc agcgtctccc ctgtggagag tcggggctcc 2821 agccaggagc gctccagggg agtctctcag gaaaaagaag cccagatctc ttcagcaatc 2881 gtttcttccg tgcagagcaa aatcactcag gatctcatgg ccaaagtcag agcgatgctt 2941 gcagcttcca aaaacctgca aaccagcgct tcctgagacg gggccagcgg aggcagagcc 3001 gggaggctgc gtgggcttct gggcaggctc acgcagacgc cggccacacc atccacctgg 3061 ccgcctccat ggacccttgg tggcttttgt aaattaattt ttgatgacat tttgagtttt 3121 aagatttctg accagcagtc tcttacctgt atatttgtaa atatatcatg tttctgtgaa 3181 aatgtattat gaaataaaat gggaggaaac accttttcta gctag // LOCUS HSU08815 2733 bp mRNA PRI 05-JUL-1994 DEFINITION Human splicesomal protein (SAP 61) mRNA, complete cds. ACCESSION U08815 NID g508722 KEYWORDS RNA binding; nuclear speckles; U2 snRNP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2733) AUTHORS Chiara,M.D., Champion-Arnaud,P., Buvoli,M., Nadal-Ginard,B. and Reed,R.E. TITLE Specific protein-protein interactions between the essential mammalian splicesome-associated proteins SAP 61 and SAP 114 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 6403-6407 (1994) MEDLINE 94294390 REFERENCE 2 (bases 1 to 2733) AUTHORS Reed,R.E. TITLE Direct Submission JOURNAL Submitted (12-APR-1994) Robin E. Reed, Department of Cell Biology, Harvard Medical School, 45 Shattuck St., LHRRB, 501, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2733 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HeLa Lambda gt11" /cell_line="HeLa" gene 9..1514 /gene="SAP 61" CDS 9..1514 /gene="SAP 61" /note="similar to yeast PRP9, Swiss-Prot Accession Number P19736" /codon_start=1 /function="spliceosomal protein" /product="SAP 61" /db_xref="PID:g508723" /translation="METILEQQRRYHEEKERLMDVMAKEMLTKKSTLRDQINSDHRTR AMQDRYMEVSGNLRDLYDDKDGLRKEELNAISGPNEFAEFYNRLKQIKEFHRKHPNEI CVPMSVEFEELLKARENPSEEAQNLVEFTDEEGYGRYLDLHDCYLKYINLKASEKLDY ITYLSIFDQLFDIPKERKNAEYKRYLEMLLEYLQDYTDRVKPLQDQNELFGKIQAEFE KKWENGTFPGWPKETSSALTHAGAHLDLSAFSSWEELASLGLDRLKSALLALGLKCGG TLEERAQRLFSTKGKSLESLDTSLFAKNPKSKGTKRDTERNKDIAFLEAQIYEYVEIL GEQRHLTHENVQRKQARTGEEREEEEEEQISESESEDEENEIIYNPKNLPLGWDGKPI PYWLYKLHGLNINYNCEICGNYTYRGPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVT QIEDAVSLWAKLKLQKASERWQPDTEEEYEDSSGNVVNKKTYEDLKRQGLL" misc_feature 1232..1301 /gene="SAP 61" /note="zinc finger like motif, amino acids 408-431" polyA_site 2733 /note="44 A residues" BASE COUNT 812 a 580 c 684 g 657 t ORIGIN 1 aagggaagat ggagacaata ctggagcagc agcggcgcta tcatgaggag aaggaacggc 61 tcatggacgt catggctaaa gagatgctca ccaagaagtc cacgctccgg gaccagatca 121 attctgatca ccgcactcgg gccatgcaag ataggtatat ggaggtcagt gggaacctga 181 gggatttgta tgatgataag gatggattac gaaaggagga gctcaatgcc atttcaggac 241 ccaatgagtt tgctgaattc tataatagac tcaagcaaat aaaggaattc caccggaagc 301 acccaaatga gatctgtgtg ccaatgtcag tggaatttga ggaactcctg aaggctcgag 361 agaatccaag tgaagaggca caaaacttgg tggagttcac agatgaggag ggatatggtc 421 gttatctcga tctccatgac tgttacctca agtacattaa cctgaaggca tctgagaagc 481 tggattatat cacatacctg tccatctttg accaattatt tgacattcct aaagaaagga 541 agaatgcaga gtataagaga tacctagaga tgctgcttga gtaccttcag gattacacag 601 atagagtgaa gcctctccaa gatcagaatg aactttttgg gaagattcag gctgagtttg 661 agaagaaatg ggagaatggg acctttcctg gatggccgaa agagacaagc agtgccctga 721 cccatgctgg agcccatctt gacctctctg cattctcctc ctgggaggag ttggcttctc 781 tgggtttgga cagattgaaa tctgctctct tagctttagg cttgaaatgt ggcgggaccc 841 tagaagagcg agcccagaga ctattcagta ccaaaggaaa gtccctggag tcacttgata 901 cctctttgtt tgccaaaaat cccaagtcaa agggcaccaa gcgagacact gaaaggaaca 961 aagacattgc ttttctagaa gcccagatct atgaatatgt agagattctc ggggaacagc 1021 gacatctcac tcatgaaaat gtacagcgca agcaagccag gacaggagaa gagcgagaag 1081 aagaggaaga agagcagatc agtgagagtg agagtgaaga tgaagagaac gagatcattt 1141 acaaccccaa aaacctgcca cttggctggg atggcaaacc tattccctac tggctgtata 1201 agcttcatgg cctaaatatc aactacaact gtgagatttg tggaaactac acctaccgag 1261 ggcccaaagc cttccagcga cactttgctg aatggcgtca tgctcatggc atgaggtgtt 1321 tgggcatccc aaatactgct cactttgcta atgtgacaca gattgaagat gctgtctcct 1381 tgtgggccaa actgaaattg cagaaggctt cagaacgatg gcagcctgac actgaggaag 1441 aatatgaaga ctcaagtggg aatgttgtga ataagaagac atacgaggat ctgaaaagac 1501 aaggactgct ctagtgttga gggatgtagc tcagcttttg ggctagccca ggcttcccta 1561 agatctgctt tttctatttc tcccaaccaa atcctcttaa agaccctttg ctatgtagtc 1621 tcatggtcta gcatgcatct tgtagaaaca aggcatgctg gcagattgca gggttgagat 1681 gtgttttatc tgttttatat tttaaaagat tctgccagaa aataaaacca gaccttgttc 1741 taaagcccag ggttatggac caactcagtg cttcaggtct taatgcctcc atacctcttc 1801 ctcaccaact ttactagtag ctgagattta atgggcacct attatgctac atatcatgtt 1861 aggtaaatct gacctgacct ctttccccac cctcctttgt tgctgcttcc ctgaatgagt 1921 attaccccag gatgaggtct gccatcagct tagttagcca ttgatgcaaa tactagggaa 1981 agactaggag gatgagccag ggttgctact aaggactaag tgtcgcacca aggtttgcct 2041 tttgtatttg cataaagaaa ggagttggag ctgggtgcag tggcttgtgc ctgtagtccc 2101 agctacttgg gaggctgagg caggagggtt gcttgagact agcctaggta acatagtgag 2161 accctgtctc attaaaaaaa aaaaaaaaag gcatggtggc acgcactgta gtcccagcta 2221 ctcaggagac tgaggctaga agatcctttg aacctaggag tttgagacca gcctgggcga 2281 tatagtgagg ccccatctca aaaaaaaaaa aaagcggggg gggggagttg ggctgtgttg 2341 gaatgggcct gcagcccaac aaacaaggga actaggaccg acagtgactt caccagcttg 2401 ctaggtcaga atgagagact ggtgggtctg tctacctgtt tcttctacaa gatccctatt 2461 tgactgtaaa agtagctaat actcacatgt tctccaatcc caggtagcca tggtagagtt 2521 gggtagagtt gagcagccgc cccaggatcc aaatgtggtg tctgaaatgg aaagaactaa 2581 ggcaaccagg aaggcactga tctgccttat aagcacagtc atctgaaagt caggcctgct 2641 gcaggacagg atcccccaga gaccccattt gcctctcaac actcagacct tcaactgttt 2701 tttaataaat ctacttttta aaaaaaaaaa ata // LOCUS HSU08854 2090 bp mRNA PRI 01-FEB-1995 DEFINITION Human UDP glucuronosyltransferase precursor (UGT2B15) mRNA, complete cds. ACCESSION U08854 NID g475758 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2090) AUTHORS Green,M.D., Oturu,E.M. and Tephly,T.R. TITLE Stable expression of a human liver UDP-glucuronosyltransferase (UGT2B15) with activity toward steroid and xenobiotic substrates JOURNAL Drug Metab. Dispos. 22 (5), 799-805 (1994) MEDLINE 95136867 REFERENCE 2 (bases 1 to 2090) AUTHORS Green,M.D. TITLE Direct Submission JOURNAL Submitted (13-APR-1994) Mitchell D. Green, Department of Pharmacology, The University of Iowa, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..2090 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HE8a" /clone_lib="Human liver UniZap cDNA library from Stratagene" /tissue_type="liver" sig_peptide 22..90 /gene="UGT2B15" CDS 22..1614 /gene="UGT2B15" /EC_number="2.4.1.17" /codon_start=1 /product="UDP glucuronosyltransferase precursor" /db_xref="PID:g475759" /translation="MSLKWTSVFLLIQLSCYFSSGSCGKVLVWPTEYSHWINMKTILE ELVQRGHEVTVLTSSASTLVNASKSSAIKLEVYPTSLTKNDLEDSLLKILDRWIYGVS KNTFWSYFSQLQELCWEYYDYSNKLCKDAVLNKKLMMKLQESKFDVILADALNPCGEL LAELFNIPFLYSLRFSVGYTFEKNGGGFLFPPSYVPVVMSELSDQMIFMERIKNMIHM LYFDFWFQIYDLKKWDQFYSEVLGRPTTLFETMGKAEMWLIRTYWDFEFPRPFLPNVD FVGGLHCKPAKPLPKEMEEFVQSSGENGIVVFSLGSMISNMSEESANMIASALAQIPQ KVLWRFDGKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEAIYHGIPMV GIPLFADQHDNIAHMKAKGAALSVDIRTMSSRDLLNALKSVINDPVYKENVMKLSRIH HDQPMKPLDRAVFWIEFVMRHKGAKHLRVAAHNLTWIQYHSLDVIAFLLACVATVIFI ITKFCLFCFRKLAKTGKKKKRD" gene 22..1614 /gene="UGT2B15" mat_peptide 91..1611 /gene="UGT2B15" /EC_number="2.4.1.17" /product="UDP glucuronosyltransferase" BASE COUNT 649 a 384 c 435 g 622 t ORIGIN 1 ttcggcacga gtaagaccag gatgtctctg aaatggacgt cagtctttct gctgatacag 61 ctcagttgtt actttagctc tggaagctgt ggaaaggtgc tagtgtggcc cacagaatac 121 agccattgga taaatatgaa gacaatcctg gaagagcttg ttcagagggg tcatgaggtg 181 actgtgttga catcttcggc ttctactctt gtcaatgcca gtaaatcatc tgctattaaa 241 ttagaagttt atcctacatc tttaactaaa aatgatttgg aagattctct tctgaaaatt 301 ctcgatagat ggatatatgg tgtttcaaaa aatacatttt ggtcatattt ttcacaatta 361 caagaattgt gttgggaata ttatgactac agtaacaagc tctgtaaaga tgcagttttg 421 aataagaaac ttatgatgaa actacaagag tcaaagtttg atgtcattct ggcagatgcc 481 cttaatccct gtggtgagct actggctgaa ctatttaaca taccctttct gtacagtctt 541 cgattctctg ttggctacac atttgagaag aatggtggag gatttctgtt ccctccttcc 601 tatgtacctg ttgttatgtc agaattaagt gatcaaatga ttttcatgga gaggataaaa 661 aatatgatac atatgcttta ttttgacttt tggtttcaaa tttatgatct gaagaagtgg 721 gaccagtttt atagtgaagt tctaggaaga cccactacat tatttgagac aatggggaaa 781 gctgaaatgt ggctcattcg aacctattgg gattttgaat ttcctcgccc attcttacca 841 aatgttgatt ttgttggagg acttcactgt aaaccagcca aacccctgcc taaggaaatg 901 gaagagtttg tgcagagctc tggagaaaat ggtattgtgg tgttttctct ggggtcgatg 961 atcagtaaca tgtcagaaga aagtgccaac atgattgcat cagcccttgc ccagatccca 1021 caaaaggttc tatggagatt tgatggcaag aagccaaata cattaggttc caatactcga 1081 ctgtacaagt ggttacccca gaatgacctt cttggtcatc ccaaaaccaa agcttttata 1141 actcatggtg gaaccaatgg catctatgag gcgatctacc atgggatccc tatggtgggc 1201 attcccttgt ttgcggatca acatgataac attgctcaca tgaaagccaa gggagcagcc 1261 ctcagtgtgg acatcaggac catgtcaagt agagatttgc tcaatgcatt gaagtcagtc 1321 attaatgacc ctgtctataa agagaatgtc atgaaattat caagaattca tcatgaccaa 1381 ccaatgaagc ccctggatcg agcagtcttc tggattgagt ttgtcatgcg ccacaaagga 1441 gccaagcacc ttcgagtcgc agctcacaac ctcacctgga tccagtacca ctctttggat 1501 gtgatagcat tcctgctggc ctgcgtggca actgtgatat ttatcatcac aaaattttgc 1561 ctgttttgtt tccgaaagct tgccaaaaca ggaaagaaga agaaaagaga ttagttatat 1621 caaaagcctg aagtggaatg actgaaagat gggactcctc ctttatttca gcatggaggg 1681 ttttaaatgg aggatttcct ttttcctgtg acaaaacatc ttttcacaac ttaccttgtt 1741 aagacaaaat ttattttcca gggatttaat acgtacttta gttggaatta ttctatgtca 1801 atgattttta agctatgaaa aatacaatgg ggggaaggat agcatttgga gatataccta 1861 atgttaaatg acgagttact ggatgcagca cgcaacatgg cacatgtgta tacatatgta 1921 gctaaccctt cgttgtgcac atgtacccta aaacttaaag tataatttaa aaaaagcaaa 1981 aaaaaaaaat accaactctt ttttttaaac caggaaggaa aatgtgaaca tggaaacaac 2041 ttctagtatt ggatctgaaa ataaagtgtc atccaagcca taaaaaaaaa // LOCUS HSU08895 1404 bp mRNA PRI 25-NOV-1995 DEFINITION Human adhalin (DAG2) mRNA, complete cds. ACCESSION U08895 NID g511586 KEYWORDS alpha sarcoglycan; adhalin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Roberds,S.L., Leturcq,F., Allamand,V., Piccolo,F., Jeanpierre,M., Anderson,R.D., Lim,L.E., Lee,J.C., Tome,F.M., Romero,N.B., Fardeau,M., Beckmann,J.S., Kaplan,J.-C. and Campbell,K.P. TITLE Missense mutations in the adhalin gene linked to autosomal recessive muscular dystrophy JOURNAL Cell 78 (4), 625-633 (1994) MEDLINE 94349366 REFERENCE 2 (bases 1 to 1404) AUTHORS Campbell,K.P. TITLE Direct Submission JOURNAL Submitted (14-APR-1994) Kevin P. Campbell, The University of Iowa, Howard Hughes Medical Institute, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H50-7" /tissue_type="skeletal muscle" /dev_stage="adult" sig_peptide 12..80 /gene="DAG2" CDS 12..1175 /gene="DAG2" /standard_name="50-kDa dystrophin-associated glycoprotein" /codon_start=1 /product="adhalin" /db_xref="PID:g511587" /translation="MAETLFWTPLLVVLLAGLGDTEAQQTTLHPLVGRVFVHTLDHET FLSLPEHVAVPPAVHITYHAHLQGHPDLPRWLRYTQRSPHHPGFLYGSATPEDRGLQV IEVTAYNRDSFDTTRQRLVLEIGDPEGPLLPYQAEFLVRSHDAEEVLPSTPASRFLSA LGGLWEPGELQLLNVTSALDRGGRVPLPIEGRKEGVYIKVGSASPFSTCLKMVASPDS HARCAQGQPPLLSCYDTLAPHFRVDWCNVTLVDKSVPEPADEVPTPGDGILEHDPFFC PPTEAPDRDFLVDALVTLLVPLLVALLLTLLLAYVMCCRREGRLKRDLATSDIQMVHH CTIHGNTEELRQMAASREVPRPLSTLPMFNVHTGERLPPRVDSAQVPLILDQH" gene 12..1175 /gene="DAG2" polyA_site 1404 BASE COUNT 245 a 486 c 391 g 282 t ORIGIN 1 gccgggcagc catggctgag acactcttct ggactcctct cctcgtggtt ctcctggcag 61 ggctggggga caccgaggcc cagcagacca cgctacaccc acttgtgggc cgtgtctttg 121 tgcacacctt ggaccatgag acgtttctga gccttcctga gcatgtcgct gtcccacccg 181 ctgtccacat cacctaccac gcccacctcc agggacaccc agacctgccc cggtggctcc 241 gctacaccca gcgcagcccc caccaccctg gcttcctcta cggctctgcc accccagaag 301 atcgtgggct ccaggtcatt gaggtcacag cctacaatcg ggacagcttt gataccactc 361 ggcagaggct ggtgctggag attggggacc cagaaggccc cctgctgcca taccaagccg 421 agttcctggt gcgcagccac gatgcggagg aggtgctgcc ctcaacacct gccagccgct 481 tcctctcagc cttgggggga ctctgggagc ccggagagct tcagctgctc aacgtcacct 541 ctgccttgga ccgtgggggc cgtgtccccc ttcccattga gggccgaaaa gaaggggtat 601 acattaaggt gggttctgcc tcaccttttt ctacttgcct gaagatggtg gcatcccccg 661 atagccacgc ccgctgtgcc cagggccagc ctccacttct gtcttgctac gacaccttgg 721 caccccactt ccgcgttgac tggtgcaatg tgaccctggt ggataagtca gtgccggagc 781 ctgcagatga ggtgcccacc ccaggtgatg ggatcctgga gcatgacccg ttcttctgcc 841 cacccactga ggccccagac cgtgacttct tggtggatgc tctggtcacc ctcctggtgc 901 ccctgctggt ggccctgctt ctcaccttgc tgctggccta tgtcatgtgc tgccggcggg 961 agggaaggct gaagagagac ctggctacct ccgacatcca gatggtccac cactgcacca 1021 tccacgggaa cacagaggag ctgcggcaga tggcggccag ccgcgaggtg ccccggccac 1081 tctccaccct gcccatgttc aatgtgcaca caggtgagcg gctgcctccc cgcgtggaca 1141 gcgcccaggt gcccctcatt ctggaccagc actgacagcc cagccagtgg ttccaggtcc 1201 agccctgact tcatcctccc ttctctgtcc acaccacgag tggcacatcc cacctgctga 1261 ttccagctcc tggccctcct ggaacccagg ctctaaacaa gcagggagag ggggtggggt 1321 ggggtgagag tgtgtggagt aaggacattc agaataaata tctgctgctc tgctcaccaa 1381 ttgctgctgg cagcctctcc cgtc // LOCUS HSU09002 6137 bp mRNA PRI 24-JAN-1995 DEFINITION Human N-methyl-D-aspartate receptor modulatory subunit 2A (hNR2A) mRNA, complete cds. ACCESSION U09002 NID g558748 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6137) AUTHORS Foldes,R.L., Adams,S.L., Fantaske,R.P. and Kamboj,R.K. TITLE Human N-methyl-D-aspartate receptor modulatory subunit hNR2A: cloning and sequencing of the cDNA and primary structure of the protein JOURNAL Biochim. Biophys. Acta 1223 (1), 155-159 (1994) MEDLINE 94339179 REFERENCE 2 (bases 1 to 6137) AUTHORS Foldes,R.L. TITLE Direct Submission JOURNAL Submitted (18-APR-1994) Foldes R.L., Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario, Canada, L4V 1V7 FEATURES Location/Qualifiers source 1..6137 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Clones HH23A, HH35, HH36, FB4A, FB13, FB15B" /clone_lib="Stratagene Libraries 936205 and 936206" /sex="female" 5'UTR 1..155 /evidence=experimental gene 156..4550 /gene="hNR2A" CDS 156..4550 /gene="hNR2A" /standard_name="NMDA receptor modulatory subunit 2A" /codon_start=1 /evidence=experimental /product="N-methyl-D-aspartate receptor modulatory subunit 2A" /db_xref="PID:g558749" /translation="MGRVGYWTLLVLPALLVWRGPAPSAAAEKGPPALNIAVMLGHSH DVTERELRTLWGPEQAAGLPLDVNVVALLMNRTDPKSLITHVCDLMSGARIHGLVFGD DTDQEAVAQMLDFISSHTFVPILGIHGGASMIMADKDPTSTFFQFGASIQQQATVMLK IMQDYDWHVFSLVTTIFPGYREFISFVKTTVDNSFVGWDMQNVITLDTSFEDAKTQVQ LKKIHSSVILLYCSKDEAVLILSEARSLGLTGYDFFWIVPSLVSGNTELIPKEFPSGL ISVSYDDWDYSLEARVRDGIGILTTAASSMLEKFSYIPEAKASCYGQMERPEVPMHTL HPFMVNVTWDGKDLSFTEEGYQVHPRLVVIVLNKDREWEKVGKWENHTLSLRHAVWPR YKSFSDCEPDDNHLSIVTLEEAPFVIVEDIDPLTETCVRNTVPCRKFVKINNSTNEGM NVKKCCKGFCIDILKKLSRTVKFTYDLYLVTNGKHGKKVNNVWNGMIGEVVYQRAVMA VGSLTINEERSEVVDFSVPFVETGISVMVSRSNGTVSPSAFLEPFSASVWVMMFVMLL IVSAIAVFVFEYFSPVGYNRNLAKGKAPHGPSFTIGKAIWLLWGLVFNNSVPVQNPKG TTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEFVDQVTGLSDKKFQRPHDYSPPFRF GTVPNGSTERNIRNNYPYMHQYMTKFNQKGVEDALVSLKTGKLDAFIYDAAVLNYKAG RDEGCKLVTIGSGYIFATTGYGIALQKGSPWKRQIDLALLQFVGDGEMEELETLWLTG ICHNEKNEVMSSQLDIDNMAGVFYMLAAAMALSLITFIWEHLFYWKLRFCFTGVCSDR PGLLFSISRGIYSCIHGVHIEEKKKSPDFNLTGSQSNMLKLLRSAKNISSMSNMNSSR MDSPKRAADFIQRGSLIMDMVSDKGNLMYSDNRSFQGKESIFGDNMNELQTFVANRQK DNLNNYVFQGQHPLTLNESNPNTVEVAVSTESKANSRPRQLWKKSVDSIRQDSLSQNP VSQRDEATAENRTHSLKSPRYLPEEMAHSDISETSNRATCHREPDNSKNHKTKDNFKR SVASKYPKDCSEVERTYLKTKSSSPRDKIYTIDGEKEPGFHLDPPQFVENVTLPENVD FPDPYQDPSENFRKGDSTLPMNRNPLHNEEGLSNNDQYKLYSKHFTLKDKGSPHSETS ERYRQNSTHCRSCLSNMPTYSGHFTMRSPFKCDACLRMGNLYDIDEDQMLQETGNPAT GEQVYQQDWAQNNALQLQKNKLRISRQHSYDNIVDKPRELDLSRPSRSISLKDRERLL EGNFYGSLFSVPSSKLSGKKSSLFPQGLEDSKRSKSLLPDHTSDNPFLHSHRDDQRLV IGRCPSDPYKHSLPSQAVNDSYLRSSLRSTASYCSRDSRGHNDVYISEHVMPYAANKN NMYSTPRVLNSCSNRRVYKKMPSIESDV" variation 963 /gene="hNR2A" /frequency="0.33" /label=HH36 /replace="g" variation 1430 /gene="hNR2A" /frequency="0.5" /label=HH36-HH35-HH23A /replace="g" variation 2240 /gene="hNR2A" /frequency="0.5" /label=FB13 /replace="c" variation 3209 /gene="hNR2A" /frequency="0.5" /label=FB13 /replace="g" 3'UTR 4551..>6137 /evidence=experimental BASE COUNT 1588 a 1571 c 1509 g 1469 t ORIGIN 1 gacagcgcgg gacagccagg ggagcgcgct ggggccgcag catgcgggaa cccgctaaac 61 ccggtggctg ctgaggcggc cgagatgctc gtgcgcgcag cgcgccccac tgcatcctcg 121 accttctcgg gctacaggga ccgtcagtgg cgactatggg cagagtgggc tattggaccc 181 tgctggtgct gccggccctt ctggtctggc gcggtccggc gccgagcgcg gcggcggaga 241 agggtccccc cgcgctaaat attgcggtga tgctgggtca cagccacgac gtgacagagc 301 gcgaacttcg aacactgtgg ggccccgagc aggcggcggg gctgcccctg gacgtgaacg 361 tggtagctct gctgatgaac cgcaccgacc ccaagagcct catcacgcac gtgtgcgacc 421 tcatgtccgg ggcacgcatc cacggcctcg tgtttgggga cgacacggac caggaggccg 481 tagcccagat gctggatttt atctcctccc acaccttcgt ccccatcttg ggcattcatg 541 ggggcgcatc tatgatcatg gctgacaagg atccgacgtc taccttcttc cagtttggag 601 cgtccatcca gcagcaagcc acggtcatgc tgaagatcat gcaggattat gactggcatg 661 tcttctccct ggtgaccact atcttccctg gctacaggga attcatcagc ttcgtcaaga 721 ccacagtgga caacagcttt gtgggctggg acatgcagaa tgtgatcaca ctggacactt 781 cctttgagga tgcaaagaca caagtccagc tgaagaagat ccactcttct gtcatcttgc 841 tctactgttc caaagacgag gctgttctca ttctgagtga ggcccgctcc cttggcctca 901 ccgggtatga tttcttctgg attgtcccca gcttggtctc tgggaacacg gagctcatcc 961 caaaagagtt tccatcggga ctcatttctg tctcctacga tgactgggac tacagcctgg 1021 aggcgagagt gagggacggc attggcatcc taaccaccgc tgcatcttct atgctggaga 1081 agttctccta catccccgag gccaaggcca gctgctacgg gcagatggag aggccagagg 1141 tcccgatgca caccttgcac ccatttatgg tcaatgttac atgggatggc aaagacttat 1201 ccttcactga ggaaggctac caggtgcacc ccaggctggt ggtgattgtg ctgaacaaag 1261 accgggaatg ggaaaaggtg ggcaagtggg agaaccatac gctgagcctg aggcacgccg 1321 tgtggcccag gtacaagtcc ttctccgact gtgagccgga tgacaaccat ctcagcatcg 1381 tcaccctgga ggaggcccca ttcgtcatcg tggaagacat agacccccta accgagacgt 1441 gtgtgaggaa caccgtgcca tgtcggaagt tcgtcaaaat caacaattca accaatgagg 1501 ggatgaatgt gaagaaatgc tgcaaggggt tctgcattga tattctgaag aagctttcca 1561 gaactgtgaa gtttacttac gacctctatc tggtgaccaa tgggaagcat ggcaagaaag 1621 ttaacaatgt gtggaatgga atgatcggtg aagtggtcta tcaacgggca gtcatggcag 1681 ttggctcgct caccatcaat gaggaacgtt ctgaagtggt ggacttctct gtgccctttg 1741 tggaaacggg aatcagtgtc atggtttcaa gaagtaatgg caccgtctca ccttctgctt 1801 ttctagaacc attcagcgcc tctgtctggg tgatgatgtt tgtgatgctg ctcattgttt 1861 ctgccatagc tgtttttgtc tttgaatact tcagccctgt tggatacaac agaaacttag 1921 ccaaagggaa agcaccccat gggccttctt ttacaattgg aaaagctata tggcttcttt 1981 ggggcctggt gttcaataac tccgtgcctg tccagaatcc taaagggacc accagcaaga 2041 tcatggtatc tgtatgggcc ttcttcgctg tcatattcct ggctagctac acagccaatc 2101 tggctgcctt catgatccaa gaggaatttg tggaccaagt gaccggcctc agtgacaaaa 2161 agtttcagag acctcatgac tattccccac cttttcgatt tgggacagtg cctaatggaa 2221 gcacggagag aaacattcgg aataactatc cctacatgca tcagtacatg accaaattta 2281 atcagaaagg agtagaggac gccttggtca gcctgaaaac ggggaagctg gacgctttca 2341 tctacgatgc cgcagtcttg aattacaagg ctgggaggga tgaaggctgc aagctggtga 2401 ccatcgggag tgggtacatc tttgccacca ccggttatgg aattgccctt cagaaaggct 2461 ctccttggaa gaggcagatc gacctggcct tgcttcagtt tgtgggtgat ggtgagatgg 2521 aggagctgga gaccctgtgg ctcactggga tctgccacaa cgagaagaac gaggtgatga 2581 gcagccagct ggacattgac aacatggcgg gcgtattcta catgctggct gccgccatgg 2641 cccttagcct catcaccttc atctgggagc acctcttcta ctggaagctg cgcttctgtt 2701 tcacgggcgt gtgctccgac cggcctgggt tgctcttctc catcagcagg ggcatctaca 2761 gctgcattca tggagtgcac attgaagaaa agaagaagtc tccagacttc aatctgacgg 2821 gatcccagag caacatgtta aaactcctcc ggtcagccaa aaacatttcc agcatgtcca 2881 acatgaactc ctcaagaatg gactcaccca aaagagctgc tgacttcatc caaagaggtt 2941 ccctcatcat ggacatggtt tcagataagg ggaatttgat gtactcagac aacaggtcct 3001 ttcaggggaa agagagcatt tttggagaca acatgaacga actccaaaca tttgtggcca 3061 accggcagaa ggataacctc aataactatg tattccaggg acaacatcct cttactctca 3121 atgagtccaa ccctaacacg gtggaggtgg ccgtgagcac agaatccaaa gcgaactcta 3181 gaccccggca gctgtggaag aaatccgtag attccatacg ccaggattca ctatcccaga 3241 atccagtctc ccagagggat gaggcaacag cagagaatag gacccactcc ctaaagagcc 3301 ctaggtatct tccagaagag atggcccact ctgacatttc agaaacgtca aatcgggcca 3361 cgtgccacag ggaacctgac aacagtaaga accacaaaac caaggacaac tttaaaaggt 3421 cagtggcctc caaatacccc aaggactgta gtgaggtcga gcgcacctac ctgaaaacca 3481 aatcaagctc ccctagagac aagatctaca ctatagatgg tgagaaggag cctggtttcc 3541 acttagatcc accccagttt gttgaaaatg tgaccctgcc cgagaacgtg gacttcccgg 3601 acccctacca ggatcccagt gaaaacttcc gcaaggggga ctccacgctg ccaatgaacc 3661 ggaacccctt gcataatgaa gaggggcttt ccaacaacga ccagtataaa ctctactcca 3721 agcacttcac cttgaaagac aagggttccc cgcacagtga gaccagcgag cgataccggc 3781 agaactccac gcactgcaga agctgccttt ccaacatgcc cacctattca ggccacttca 3841 ccatgaggtc ccccttcaag tgcgatgcct gcctgcggat ggggaatctc tatgacatcg 3901 atgaagacca gatgcttcag gagacaggta acccagccac cggggagcag gtctaccagc 3961 aggactgggc acagaacaat gcccttcaat tacaaaagaa caagctaagg attagccgtc 4021 agcattccta cgataacatt gtcgacaaac ctagggagct agaccttagc aggccctccc 4081 ggagcataag cctcaaggac agggaacggc ttctggaggg aaatttttac ggcagcctgt 4141 ttagtgtccc ctcaagcaaa ctctcgggga aaaaaagctc ccttttcccc caaggtctgg 4201 aggacagcaa gaggagcaag tctctcttgc cagaccacac ctccgataac cctttcctcc 4261 actcccacag ggatgaccaa cgcttggtta ttgggagatg cccctcggac ccttacaaac 4321 actcgttgcc atcccaggcg gtgaatgaca gctatcttcg gtcgtccttg aggtcaacgg 4381 catcgtactg ttccagggac agtcggggcc acaatgatgt gtatatttcg gagcatgtta 4441 tgccttatgc tgcaaataag aataatatgt actctacccc cagggtttta aattcctgca 4501 gcaatagacg cgtgtacaag aaaatgccta gtatcgaatc tgatgtttaa aaatcttcca 4561 ttaatgtttt atctataggg aaatatacgt aatggccaat gttctggagg gtaaatgttg 4621 gatgtccaat agtgccctgc taagaggaag aagatgtagg gaggtatttt gttgttgttg 4681 ttgttggctc ttttgcacac ggcttcatgc cataatcttc cactcaagga atcttgtgag 4741 gtgtgtgctg agcatggcag acaccagata ggtgagtcct taaccaaaaa taactaacta 4801 cataagggca agtctccggg acatgcctac tgggtatgtt ggcaataatg atgcattgga 4861 tgccaatggt gatgttatga tttcctatat tccaaattcc attaaggtca gcccaccatg 4921 taattttctc atcagaaatg cctaatggtt tctctaatac agaataagca atatggtgtg 4981 catgtaaacc tgacacagac aaaataaaaa cagttaagaa tgcatctgca ctgtagtcgg 5041 atttgaacat gtgcaagaga ttaggaagtt tggctcgtaa cagtttcagc tttcttgtta 5101 tgccttccat cacagcccag gctcacccca agaactccag gctcccctaa agaatagcaa 5161 atcagtgtgt tcgtgatgac tgtgctacct tcattatagt tcatttccaa gacacatctg 5221 gagccaaagg cccgagggac cctcaggtgg ggagagctac aggaatctct ttggatgttg 5281 atgtgtgttt ctctctaccc tcggcttcga tggtcttgtt cagagctgca taaactaaca 5341 catttatgtc tccgagatct aagtgtggat cttctgtctg tgacacagtg gccattgtag 5401 tttatcccga agacgcctat gtacgtaagt ttgcatttcc tcccttctgg tgatgactca 5461 gggttgtata gtatctgtta ccccttccct cccagagtaa ccataactcg ttccgtttcc 5521 aaacagccat ggtggtgtcc aattagctgt gtatcgctct tcccagagtt gttaatgtgg 5581 tgacatgcac caacagccgt atgtgtactg tgatctgtaa gaagtacaat gccatctgtc 5641 tgccgaaggc tagcatggtt ttaggtttat cttccttcac atccagaaat tctgttggac 5701 actcacttcc accccaaact cctcaaatca aaagccttca aaacacgagg cactcttgga 5761 tctaccctga gtatcctcca aactgtggat acagtttagt gagacaagca atttctccct 5821 tctgagttat tctctctgtt ggtggcaaac cacttcatag caccaacaga gatgtaggaa 5881 aaattcctca aagtatttgt catttctgag tcgcctgcat tatcccattc ttattctcct 5941 caaacctgtg catatatgac atgaaatgat atccattttt tttttaagtt agaaacagag 6001 aggggaatac ttatgcatgg ggagcctgtt agcacagtgc ctgccacaaa aacaagtgcc 6061 cccgacaaga tagttgctat gttatgacac tttctcagat caggattttc tagtttaaaa 6121 attaaatatc ataaaac // LOCUS HSU09086 2490 bp mRNA PRI 05-JUL-1994 DEFINITION Human thymopoietin alpha mRNA, complete cds. ACCESSION U09086 NID g508724 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2490) AUTHORS Harris,C.A., Andryuk,P.J., Cline,S., Chan,H.K., Natarajan,A., Siekierka,J.J. and Goldstein,G. TITLE Three distinct human thymopoietins are derived from alternatively spliced mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 6283-6287 (1994) MEDLINE 94294366 REFERENCE 2 (bases 1 to 2490) AUTHORS Harris,C.A. TITLE Direct Submission JOURNAL Submitted (20-APR-1994) Crafford A. Harris, Immunobiology Research Institute, Route 22 East, Annandale, NJ 08801-0999, USA FEATURES Location/Qualifiers source 1..2490 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thymus" CDS 205..2289 /codon_start=1 /product="thymopoietin alpha" /db_xref="PID:g508725" /translation="MPEFLEDPSVLTKDKLKSELVANNVTLPAGEQRKDVYVQLYLQH LTARNRPPLPAGTNSKGPPDFSSDEEREPTPVLGSGAAAAGRSRAAVGRKATKKTDKP RQEDKDDLDVTELTNEDLLDQLVKYGVNPGPIVGTTRKLYEKKLLKLREQGTESRSST PLPTISSSAENTRQNGSNDSDRYSDNEEGKKKEHKKVKSTRDIVPFSELGTTPSGGGF FQGISFPEISTRPPLGSTELQAAKKVHTSKGDLPREPLVATNLPGRGQLQKLASERNL FISCKSSHDRCLEKSSSSSSQPEHSAMLVSTAASPSLIKETTTGYYKDIVENICGREK SGIQPLCPERSHISDQSPLSSKRKALEESESSQLISPPLAQAIRDYVNSLLVQGGVGS LPGTSNSMPPLDVENIQKRIDQSKFQETEFLSPPRKVPRLSEKSVEERDSGSFVAFQN IPGSELMSSFAKTVVSHSLTTLGLEVAKQSQHDKIDASELSFPFHESILKVIEEEWQQ VDRQLPSLACKYPVSSREATQILSVPKVDDEILGFISEATPLGGIQAASTESCNQQLD LALCRAYEAAASALQIATHTAFVAKAMQADISQAAQILSSDPSRTHQALGILSKTYDA ASYICEAAFDEVKMAAHTMGNATVGRRYLWLKDCKINLASKNKLASTPFKGGTLFGGE VCKVIKKRGNKH" variation 1999 /replace="G" BASE COUNT 703 a 567 c 594 g 626 t ORIGIN 1 gttcgtagtt cggctctggg gtcttttgtg tccgggtctg gcttggcttt gtgtccgcga 61 gtttttgttc cgctccgcag cgctcttccc gggcaggagc cgtgaggctc ggaggcggca 121 gcgcggtccc cggccaggag caagcgcgcc ggcgtgagcg gcggcggcaa aggctgtggg 181 gagggggctt cgcagatccc cgagatgccg gagttcctgg aagacccctc ggtcctgaca 241 aaagacaagt tgaagagtga gttggtcgcc aacaatgtga cgctgccggc cggggagcag 301 cgcaaagacg tgtacgtcca gctctacctg cagcacctca cggctcgcaa ccggccgccg 361 ctccccgccg gcaccaacag caaggggccc ccggacttct ccagtgacga agagcgcgag 421 cccaccccgg tcctcggctc tggggccgcc gccgcgggcc ggagccgagc agccgtcggc 481 aggaaagcca caaaaaaaac tgataaaccc agacaagaag ataaagatga tctagatgta 541 acagagctca ctaatgaaga tcttttggat cagcttgtga aatacggagt gaatcctggt 601 cctattgtgg gaacaaccag gaagctatat gagaaaaagc ttttgaaact gagggaacaa 661 ggaacagaat caagatcttc tactcctctg ccaacaattt cttcttcagc agaaaataca 721 aggcagaatg gaagtaatga ttctgacaga tacagtgaca atgaagaagg aaagaagaaa 781 gaacacaaga aagtgaagtc cactagggat attgttcctt tttctgaact tggaactact 841 ccctctggtg gtggattttt tcagggtatt tcttttcctg aaatctccac ccgtcctcct 901 ttgggcagta ccgaactaca ggcagctaag aaagtacata cttctaaggg agacctacct 961 agggagcctc ttgttgccac aaacttgcct ggcaggggac agttgcagaa gttagcctct 1021 gaaaggaatt tgtttatttc atgcaagtct agccatgata ggtgtttaga gaaaagttct 1081 tcgtcatctt ctcagcctga acacagtgcc atgttggtct ctactgcagc ttctccttca 1141 ctgattaaag aaaccaccac tggttactat aaagacatag tagaaaatat ttgcggtaga 1201 gagaaaagtg gaattcaacc attatgtcct gagaggtccc atatttcaga tcaatcgcct 1261 ctctccagta aaaggaaagc actagaagag tctgagagct cacaactaat ttctccgcca 1321 cttgcccagg caatcagaga ttatgtcaat tctctgttgg tccagggtgg ggtaggtagt 1381 ttgcctggaa cttctaactc tatgccccca ctggatgtag aaaacataca gaagagaatt 1441 gatcagtcta agtttcaaga aactgaattc ctgtctcctc caagaaaagt ccctagactg 1501 agtgagaagt cagtggagga aagggattca ggttcctttg tggcatttca gaacatacct 1561 ggatccgaac tgatgtcttc ttttgccaaa actgttgtct ctcattcact cactacctta 1621 ggtctagaag tggctaagca atcacagcat gataaaatag atgcctcaga actatctttt 1681 cccttccatg aatctatttt aaaagtaatt gaagaagaat ggcagcaagt tgacaggcag 1741 ctgccttcac tggcatgcaa atatccagtt tcttccaggg aggcaacaca gatattatca 1801 gttccaaaag tagatgatga aatcctaggg tttatttctg aagccactcc actaggaggt 1861 attcaagcag cctccactga gtcttgcaat cagcagttgg acttagcact ctgtagagca 1921 tatgaagctg cagcatcagc attgcagatt gcaactcaca ctgcctttgt agctaaggct 1981 atgcaggcag acattagtca agctgcacag attcttagct cagatcctag tcgtacccac 2041 caagcgcttg ggattctgag caaaacatat gatgcagcct catatatttg tgaagctgca 2101 tttgatgaag tgaagatggc tgcccatacc atgggaaatg ccactgtagg tcgtcgatac 2161 ctctggctga aggattgcaa aattaattta gcttctaaga ataagctggc ttccactccc 2221 tttaaaggtg gaacattatt tggaggagaa gtatgcaaag taattaaaaa gcgtggaaat 2281 aaacactagt aaaattaagg acaaaaagac atctatctta tctttcaggt actttatgcc 2341 aacattttct tttctgttaa ggttgtttta gtttccagat agggctaatt acaaaatgtt 2401 aagcttctac ccatcaaatt acagtataaa agtaattgcc tgtgtagaac tacttgtctt 2461 ttctaaagat ttgcgtagat aggaagcctg // LOCUS HSU09117 2627 bp mRNA PRI 01-AUG-1995 DEFINITION Human phospholipase c delta 1 mRNA, complete cds. ACCESSION U09117 NID g483919 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2627) AUTHORS Cheng,H.F., Jiang,M.J., Chen,C.L., Liu,S.M., Wong,L.P., Lomasney,J.W. and King,K. TITLE Cloning and identification of amino acid residues of human phospholipase C delta 1 essential for catalysis JOURNAL J. Biol. Chem. 270 (10), 5495-5505 (1995) MEDLINE 95197554 REFERENCE 2 (bases 1 to 2627) AUTHORS King,K. TITLE Direct Submission JOURNAL Submitted (21-APR-1994) Klim King, Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan, 11529, Republic of China FEATURES Location/Qualifiers source 1..2627 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11 cDNA" /tissue_type="aorta" CDS 95..2365 /codon_start=1 /evidence=experimental /product="phospholipase c delta 1" /db_xref="PID:g483920" /translation="MDSGRDFLTLHGLQDDEDLQALLKGSQLLKVKSSSWRRERFYKL QEDCKTIWQESRKVMRTPESQLFSIEDIQEVRMGHRTEGLEKFARDVPEDRCFSIVFK DQRNTLDLIAPSPADAQHWVLGLHKIIHHSGSMDQRQKLQHWIHSCLRKADKNKDNKM SFKELQNFLKELNIQVDDSYARKIFRECDHSQTDSLEDEEIEAFYKMLTQRVEIDRTF AEAAGPGETLSVDQLVTFLQHQQREEAAGPALALSLIERYEPSETTKAQRQMTKDGFL MYLLSADGSAFSLAHRRVYQDMGQPLSHYLVSSSHNTYLLEDQLAGPSSTEAYIRALC KGCRCLELDCWDGPNQEPIIYHGYTFTSKILFCDVLRAIRDYAFKASPYPVILSLENH CTLEQQRVMARHLHAILGPMLLNRPLDGVTNSLPSPEQLKGKILLKGKKLGGLLPPGG EGGPEATVVSDEDEAAEMEDEAVRSRVQHKPKEDKLRLAQELSDMVIYCKSVHFGGFS SPGTPGQAFYEMASFSENRALRLLQESGNGFVRHNVGHLSRIYPAGWRTDSSNYSPVE MWNGGCQIVALNFQTPGPEMDVYQDRFQDNGACGYVLKPAFLRDPNGTFNPRALAQGP WWARKRLNIRVISGQQLPKVNKNKNSIVDPKVTVEIHGVSRDVASRQTAVITNNGFNP WWDTEFAFEVVVPDLALIRFLVEDYDASSKNDFIGQSTIPLNSLKQGYRHVHLMSKNG DQHPSATLFVKISLQD" BASE COUNT 566 a 813 c 763 g 485 t ORIGIN 1 ttctggtcgc actccgctcg gaccccaggc cgccggtgct gtcgctactc aagtgagtcc 61 cgcggtgccc ctcccgccgc gccgcccgac gggcatggac tcgggccggg acttcctgac 121 cctgcacggc ctacaggatg atgaggatct acaggcgctg ctgaagggca gccagctcct 181 gaaggtgaag tccagctcat ggaggagaga gcggttctac aagttgcagg aggactgcaa 241 gaccatctgg caggagtccc gcaaggtcat gcggaccccg gagtcccagc tgttctccat 301 cgaggacatt caggaggtgc gaatggggca ccgcacggag ggtctggaga agttcgcccg 361 tgatgtgccc gaggaccgct gcttctccat tgtcttcaag gaccagcgca atacactaga 421 cctcatcgcc ccatcgccag ctgatgccca gcactgggtg ctggggctgc acaagatcat 481 ccaccactca ggctccatgg accagcgtca gaagctacag cactggattc actcctgctt 541 gcgaaaagct gacaaaaaca aggacaacaa gatgagcttc aaggagctgc agaacttcct 601 gaaggagctc aacatccagg tggacgacag ctatgcccgg aagatcttca gggagtgtga 661 ccactcccag acagactccc tggaggacga ggagattgag gccttctaca agatgctgac 721 ccagcgggtg gagatcgacc gcaccttcgc cgaggccgcg ggcccagggg agactctgtc 781 ggtggatcag ttagtgacgt tcctgcagca ccagcagcgg gaggaggcgg cagggcctgc 841 gctggccctc tccctcattg agcgctacga gcccagcgag actaccaagg cgcagcggca 901 gatgaccaag gacggcttcc tcatgtactt actgtcggct gacggcagcg ccttcagcct 961 ggcacaccgc cgtgtctacc aggacatggg ccagccactt agccactacc tggtgtcctc 1021 ttcacacaac acctacctgc tggaggacca gctagccggg cccagcagca ctgaagccta 1081 catccgggca ctgtgcaaag gctgccgatg cctggagctt gactgctggg acgggcccaa 1141 ccaggaacca atcatctacc acggctatac tttcacttcc aagatcctct tctgcgatgt 1201 gctcagggcc atccgggact atgccttcaa ggcgtccccc taccctgtca tcctatccct 1261 ggagaaccac tgcacactgg agcagcagcg cgtgatggcg cggcacctgc atgccatcct 1321 gggccccatg ctgttgaacc gaccactgga tggggtcacc aacagcctgc cctcccctga 1381 gcaactgaag gggaagatcc tgctgaaggg gaagaagctc ggggggctcc tcccccctgg 1441 aggggagggt ggccctgagg ccactgtggt gtcagacgaa gacgaggctg ctgagatgga 1501 ggatgaggca gtgaggagcc gtgtgcagca caagcccaag gaggacaagc tcaggctagc 1561 acaggagctc tctgacatgg tcatttactg caagagtgtc cactttgggg gcttctccag 1621 tcctggcacc cctggacagg ccttctacga gatggcgtcc ttctctgaga accgtgccct 1681 tcgactgctc caagaatcag gaaacggctt tgtccgccac aacgtggggc acctgagcag 1741 aatatacccg gctggatgga gaacagactc ctccaactac agccccgtgg agatgtggaa 1801 tgggggctgc cagatcgtgg ccctgaattt ccagacacct gggccagaga tggacgtgta 1861 ccaggaccgc ttccaggaca acggggcctg tgggtacgtg ctgaagcccg ccttcctgcg 1921 agaccccaac ggcaccttta acccccgcgc cctggctcag gggccctggt gggcacggaa 1981 gcggctcaac atcagggtca tttcggggca gcagctgcca aaagtcaaca agaataagaa 2041 ttcaattgtg gaccccaaag tgacagtgga gatccatggc gtgagccggg acgtggccag 2101 ccgccagact gctgtcatca ccaacaatgg tttcaaccca tggtgggaca cggagtttgc 2161 gtttgaggta gttgtgcctg accttgccct catccgcttc ttggtggaag attatgatgc 2221 ctcctccaag aatgacttca ttggccagag taccatcccc ttgaacagcc tcaagcaagg 2281 ataccgccat gtccacctca tgtctaagaa cggggaccag catccatcag ccaccctctt 2341 tgtgaagatc tccctccagg actaggctgg aggaacccag tggggtcccc cctgagtggg 2401 ctgggccctc tgtccacatg tggggacagg gctggtgtgc ctgctcccag cctcttgctc 2461 agagctaggc ccccaaattg ccttcagccc taacatagtg tctgctgctg cctccctggg 2521 gaccaggagc tagcccagtc cctggagctg tccttcattc cgttaggaat aacaatgcag 2581 ccctctccac cctccggcca gcgagtggtc aaggattttt ataaaaa // LOCUS HSU09178 3951 bp mRNA PRI 28-DEC-1994 DEFINITION Human dihydropyrimidine dehydrogenase mRNA, complete cds. ACCESSION U09178 NID g558304 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3951) AUTHORS Yokota,H., Fernandez-Salguero,P., Furuya,H., Lin,K., McBride,O.W., Podschun,B., Schnackerz,K.D. and Gonzalez,F.J. TITLE cDNA cloning and chromosome mapping of human dihydropyrimidine dehydrogenase, an enzyme associated with 5-fluorouracil toxicity and congenital thymine uraciluria JOURNAL J. Biol. Chem. 269 (37), 23192-23196 (1994) MEDLINE 94365020 REFERENCE 2 (sites) AUTHORS Eggink,G., Engel,H., Vriend,G., Terpstra,P. and Witholt,B. TITLE Rubredoxin reductase of Pseudomonas oleovorans. Structural relationship to other flavoprotein oxidoreductases based on one NAD and two FAD fingerprints JOURNAL J. Mol. Biol. 212 (1), 135-142 (1990) MEDLINE 90204534 REFERENCE 3 (sites) AUTHORS Porter,D.J., Chestnut,W.G., Merrill,B.M. and Spector,T. TITLE Mechanism-based inactivation of dihydropyrimidine dehydrogenase by 5-ethynyluracil JOURNAL J. Biol. Chem. 267 (8), 5236-5242 (1992) MEDLINE 92184771 REFERENCE 4 (sites) AUTHORS Dupuis,A., Skehel,J.M. and Walker,J.E. TITLE A homologue of a nuclear-coded iron-sulfur protein subunit of bovine mitochondrial complex I is encoded in chloroplast genomes JOURNAL Biochemistry 30 (11), 2954-2960 (1991) MEDLINE 91175743 REFERENCE 5 (sites) AUTHORS Wierenga,R.K., De Maeyer,M.C.H. and Hol,W.G.J. TITLE Interaction of pyrophosphatase moieties with alfa-helixes in dinucleotide binding proteins JOURNAL Biochemistry 24, 1346-1357 (1985) REFERENCE 6 (bases 1 to 3951) AUTHORS Gonzalez,F.J. TITLE Direct Submission JOURNAL Submitted (22-APR-1994) Frank J. Gonzalez, National Cancer Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3951 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 82..3159 /codon_start=1 /product="dihydropyrimidine dehydrogenase" /db_xref="PID:g558305" /translation="MAPVLSKDSADIESILALNPRTQTHATLCSTSAKKLDKKHWKRN PDKNCFNCEKLENNFDDIKHTTLGERGALREAMRCLKCADAPCQKSCPTNLDIKSFIT SIANKNYYGAAKMIFSDNPLGLTCGMVCPTSDLCVGGCNLYATEEGPINIGGLQQFAT EVFKAMSIPQIRNPSLPPPEKMSEAYSAKIALFGAGPASISCASFLARLGYSDITIFE KQEYVGGLSTSEIPQFRLPYDVVNFEIELMKDLGVKIICGKSLSVNEMTLSTLKEKGY KAAFIGIGLPEPNKDAIFQGLTQDQGFYTSKDFLPLVAKGSKAGMCACHSPLPSIRGV VIVLGAGDTAFDCATSALRCGARRVFIVFRKGFVNIRAVPEEMELAKEEKCEFLPFLS PRKVIVKGGRIVAMQFVRTEQDETGKWNEDEDQMVHLKADVVISAFGSVLSDPKVKEA LSPIKFNRWGLPEVDPETMQTSEAWVFAGGDVVGLANTTVESVNDGKQASWYIHKYVQ SQYGASVSAKPELPLFYTPIDLVDISVEMAGLKFINPFGLASATPATSTSMIRRAFEA GWGFALTKTFSLDKDIVTNVSPRIIRGTTSGPMYGPGQSSFLNIELISEKTAAYWCQS VTELKADFPDNIVIASIMCSYNKNDWTELAKKSEDSGADALELNLSCPHGMGERGMGL ACGQDPELVRNICRWVRQAVQIPFFAKLTPNVTDIVSIARAAKEGGANGVTATNTVSG LMGLKSDGTPWPAVGIAKRTTYGGVSGTAIRPIALRAVTSIARALPGFPILATGGIDS AESGLQFLHSGASVLQVCSAIQNQDFTVIEDYCTGLKALLYLKSIEELQDWDGQSPAT VSHQKGKPVPRIAELMDKKLPSFGPYLEQRKKIIAENKIRLKEQNVAFSPLKRSCFIP KRPIPTIKDVIGKALQYLGTFGELSNVEQVVAMIDEEMCINCGKCYMTCNDSGYQAIQ FDPETHLPTITDTCTGCTLCLSVCPIVDCIKMVSRTTPYEPKRGVPLSVNPVC" misc_feature 1084..1134 /citation=[5] /function="catalytic cofactor NADPH/NADP binding site" misc_feature 1492..1524 /citation=[2] /function="electron transfer center, FAD binding site" misc_feature 2062..2175 /citation=[3] /function="uracil (substrate) binding site" misc_feature 2938..2973 /standard_name="iron-sulfur center" /citation=[4] /function="catalytic cofactor [4Fe-4S] binding site" BASE COUNT 1153 a 785 c 896 g 1117 t ORIGIN 1 gctgtcactt ggctctctgg ctggagcttg aggacgcaag gagggtttgt cactggcaga 61 ctcgagactg taggcactgc catggcccct gtgctcagta aggactcggc ggacatcgag 121 agtatcctgg ctttaaatcc tcgaacacaa actcatgcaa ctctgtgttc cacttcggcc 181 aagaaattag acaagaaaca ttggaaaaga aatcctgata agaactgctt taattgtgag 241 aagctggaga ataattttga tgacatcaag cacacgactc ttggtgagcg aggagctctc 301 cgagaagcaa tgagatgcct gaaatgtgca gatgccccgt gtcagaagag ctgtccaact 361 aatcttgata ttaaatcatt catcacaagt attgcaaaca agaactatta tggagctgct 421 aagatgatat tttctgacaa cccacttggt ctgacttgtg gaatggtatg tccaacctct 481 gatctatgtg taggtggatg caatttatat gccactgaag agggacccat taatattggt 541 ggattgcagc aatttgctac tgaggtattc aaagcaatga gtatcccaca gatcagaaat 601 ccttcgctgc ctcccccaga aaaaatgtct gaagcctatt ctgcaaagat tgctcttttt 661 ggtgctgggc ctgcaagtat aagttgtgct tcctttttgg ctcgattggg gtactctgac 721 atcactatat ttgaaaaaca agaatatgtt ggtggtttaa gtacttctga aattcctcag 781 ttccggctgc cgtatgatgt agtgaatttt gagattgagc taatgaagga ccttggtgta 841 aagataattt gcggtaaaag cctttcagtg aatgaaatga ctcttagcac tttgaaagaa 901 aaaggctaca aagctgcttt cattggaata ggtttgccag aacccaataa agatgccatc 961 ttccaaggcc tgacgcagga ccaggggttt tatacatcca aagacttttt gccacttgta 1021 gccaaaggca gtaaagcagg aatgtgcgcc tgtcactctc cattgccatc gatacgggga 1081 gtcgtgattg tacttggagc tggagacact gccttcgact gtgcaacatc tgctctacgt 1141 tgtggagctc gccgagtgtt catcgtcttc agaaaaggct ttgttaatat aagagctgtc 1201 cctgaggaga tggagcttgc taaggaagaa aagtgtgaat ttctgccatt cctgtcccca 1261 cggaaggtta tagtaaaagg tgggagaatt gttgctatgc agtttgttcg gacagagcaa 1321 gatgaaactg gaaaatggaa tgaagatgaa gatcagatgg tccatctgaa agccgatgtg 1381 gtcatcagtg cctttggttc agttctgagt gatcctaaag taaaagaagc cttgagccct 1441 ataaaattta acagatgggg tctcccagaa gtagatccag aaactatgca aactagtgaa 1501 gcatgggtat ttgcaggtgg tgatgtcgtt ggtttggcta acactacagt ggaatcggtg 1561 aatgatggaa agcaagcttc ttggtacatt cacaaatacg tacagtcaca atatggagct 1621 tccgtttctg ccaagcctga actacccctc ttttacactc ctattgatct ggtggacatt 1681 agtgtagaaa tggccggatt gaagtttata aatccttttg gtcttgctag cgcaactcca 1741 gccaccagca catcaatgat tcgaagagct tttgaagctg gatggggttt tgccctcacc 1801 aaaactttct ctcttgataa ggacattgtg acaaatgttt cccccagaat catccgggga 1861 accacctctg gccccatgta tggccctgga caaagctcct ttctgaatat tgagctcatc 1921 agtgagaaaa cggctgcata ttggtgtcaa agtgtcactg aactaaaggc tgacttccca 1981 gacaacattg tgattgctag cattatgtgc agttacaata aaaatgactg gacggaactt 2041 gccaagaagt ctgaggattc tggagcagat gccctggagt taaatttatc atgtccacat 2101 ggcatgggag aaagaggaat gggcctggcc tgtgggcagg atccagagct ggtgcggaac 2161 atctgccgct gggttaggca agctgttcag attccttttt ttgccaagct gaccccaaat 2221 gtcactgata ttgtgagcat cgcaagagct gcaaaggaag gtggtgccaa tggcgttaca 2281 gccaccaaca ctgtctcagg tctgatggga ttaaaatctg atggcacacc ttggccagca 2341 gtggggattg caaagcgaac tacatatgga ggagtgtctg ggacagcaat cagacctatt 2401 gctttgagag ctgtgacctc cattgctcgt gctctgcctg gatttcccat tttggctact 2461 ggtggaattg actctgctga aagtggtctt cagtttctcc atagtggtgc ttccgtcctc 2521 caggtatgca gtgccattca gaatcaggat ttcactgtga tcgaagacta ctgcactggc 2581 ctcaaagccc tgctttatct gaaaagcatt gaagaactac aagactggga tggacagagt 2641 ccagctactg tgagtcacca gaaagggaaa ccagttccac gtatagctga actcatggac 2701 aagaaactgc caagttttgg accttatctg gaacagcgca agaaaatcat agcagaaaac 2761 aagattagac tgaaagaaca aaatgtagct ttttcaccac ttaagagaag ctgttttatc 2821 cccaaaaggc ctattcctac catcaaggat gtaataggaa aagcactgca gtaccttgga 2881 acatttggtg aattgagcaa cgtagagcaa gttgtggcta tgattgatga agaaatgtgt 2941 atcaactgtg gtaaatgcta catgacctgt aatgattctg gctaccaggc tatacagttt 3001 gatccagaaa cccacctgcc caccataacc gacacttgta caggctgtac tctgtgtctc 3061 agtgtttgcc ctattgtcga ctgcatcaaa atggtttcca ggacaacacc ttatgaacca 3121 aagagaggcg tacccttatc tgtgaatccg gtgtgttaag gtgatttgtg aaacagttgc 3181 tgtgaacttt catgtcacct acatatgctg atctcttaaa atcatgatcc ttgtgttcag 3241 ctctttccaa attaaaacaa atatacattt tctaaataaa aatatgtaat ttcaaaatac 3301 atttgtaagt gtaaaaaatg tctcatgtca atgaccattc aattagtggc ataaaataga 3361 ataattcttt tctgaggata gtagttaaat aactgtgtgg cagttaattg gatgttcact 3421 gccagttgtc ttatgtgaaa aattaacttt ttgtgtggca attagtgtga cagtttccaa 3481 attgccctat gctgtgctcc atatttgatt tctaattgta agtgaaatta agcattttga 3541 aacaaagtac tctttaacat acaagaaaat gtatccaagg aaacatttta tcaataaaaa 3601 ttacctttaa ttttaatgct gtttctaaga aaatgtagtt agctccataa agtacaaatg 3661 aagaaagtca aaaattattt gctatggcag gataagaaag cctaaaattg agtttgtgga 3721 ctttattaag taaaatcccc ttcgctgaaa ttgcttattt ttggtgttgg atagaggata 3781 gggagaatat ttactaacta aataccattc actactcatg cgtgagatgg gtgtacaaac 3841 tcatcctctt ttaatggcat ttctctttaa actatgttcc taaccaaatg agatgatagg 3901 atagatcctg gttaccactc ttttactgtg cacatatggg ccccggaatt c // LOCUS HSU09202 956 bp mRNA PRI 30-NOV-1995 DEFINITION Human ornithine decarboxylase antizyme (Oaz) mRNA, complete cds. ACCESSION U09202 NID g852427 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 956) AUTHORS Tewari,D.S., Qian,Y., Thornton,R.D., Pieringer,J., Taub,R., Mochan,E. and Tewari,M. TITLE Molecular cloning and sequencing of a human cDNA encoding ornithine decarboxylase antizyme JOURNAL Biochim. Biophys. Acta 1209 (2), 293-295 (1994) MEDLINE 95110821 REFERENCE 2 (bases 1 to 956) AUTHORS Tewari,M. TITLE Direct Submission JOURNAL Submitted (25-APR-1994) Manorama Tewari, Dept. of Biochemistry/Molecular Biol., Philadelphia College of Osteopathic Medicine, 406 Evans Hall, 4170 City Avenue, Philadelphia, PA 19131, USA FEATURES Location/Qualifiers source 1..956 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lambda ZAP cDNA Library" /cell_line="gingival fibroblast" CDS 46..252 /codon_start=1 /product="unknown" /db_xref="PID:g852429" /translation="MVKSSLQRILNSHCFAREKEGDKPSATIHASRTMPLLSLHSRGG SSSESSRVSLHCCSNPGPGPRWCS" gene 53..733 /gene="Oaz" CDS <53..733 /gene="Oaz" /note="product can bind to the protein ornithine decarboxylase" /codon_start=1 /function="inhibitor" /product="ornithine decarboxylase antizyme" /db_xref="PID:g852428" /translation="NPPCSGSSIATASPERRKGINPAPPSTPAAPCRSLACTAAAAAA VRVPGSPSTAVVTRVRGLGGAPDAPHPPLKIPGGRGNSQRDHNLSANLFYSDDRLNVT EELTSNDKTRILNVQSRLTDAKRINWRTVLSGGSLYIEIPGGALPEGSKDSFAVLLEF AEEQLRADHVFICFHKNREDRAALLRTFSFLGFEIVRPGHPLVPKRPDACFMAYTFER ESSGEEEE" polyA_signal 848..853 polyA_signal 932..937 BASE COUNT 208 a 270 c 267 g 211 t ORIGIN 1 agagacgcag cggaggtttt cctggtttcg gaccccagcg gccggatggt gaaatcctcc 61 ctgcagcgga tcctcaatag ccactgcttc gccagagaga aggaagggga taaacccagc 121 gccaccatcc acgccagccg caccatgccg ctccttagcc tgcacagccg cggcggcagc 181 agcagtgaga gttccagggt ctccctccac tgctgtagta acccgggtcc ggggcctcgg 241 tggtgctcct gatgcccctc acccacccct gaagatccca ggtgggcgag ggaatagtca 301 gagggatcac aatctttcag ctaacttatt ctactccgat gatcggctga atgtaacaga 361 ggaactaacg tccaacgaca agacgaggat tctcaacgtc cagtccaggc tcacagacgc 421 caaacgcatt aactggcgaa cagtgctgag tggcggcagc ctctacatcg agatcccggg 481 cggcgcgctg cccgagggga gcaaggacag ctttgcagtt ctcctggagt tcgctgagga 541 gcagctgcga gccgaccatg tcttcatttg cttccacaag aaccgcgagg acagagccgc 601 cttgctccga accttcagct ttttgggctt tgagattgtg agaccggggc atccccttgt 661 ccccaagaga cccgacgctt gcttcatggc ctacacgttc gagagagagt cttcgggaga 721 ggaggaggag tagggccgcc tcggggctgg gcatccggcc cctggggcca ccccttgtca 781 gccgggtggg taggaaccgt agactcgctc atctcgcctg ggtttgtccg catgttgtaa 841 tcgtgcaaat aaacgctcac tccgaattag cggtgtattt cttgaagttt aatattgtgt 901 ttgtgatact gaagtatttg ctttaattct aaataaaaat ttatatttta cttttt // LOCUS HSU09210 2421 bp mRNA PRI 04-AUG-1994 DEFINITION Human vesicular acetylcholine transporter mRNA, complete cds. ACCESSION U09210 NID g507743 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2421) AUTHORS Erickson,J.D., Varoqui,H., Schafer,M.K., Modi,W., Diebler,M., Weihe,E., Rand,J., Eiden,L.E., Bonner,T.I. and Usdin,T.B. TITLE Functional identification of a vesicular acetylcholine transporter and its expression from a 'cholinergic' gene locus JOURNAL J. Biol. Chem. 269, 21929-21932 (1994) MEDLINE 94350930 REFERENCE 2 (bases 1 to 2421) AUTHORS Chireux,M.A., Le Van Thai,A. and Weber,M. TITLE Human choline acetyltransferase gene: sequence of alternative first exons JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 2421) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (25-APR-1994) Tom I. Bonner, Laboratory of Cell Biology, National Institute of Mental Health, Bldg 36, Room 3A-07, Bethesda, MD 20892-0036, USA FEATURES Location/Qualifiers source 1..2421 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q11.2" /cell_line="SK-N-SH" CDS 443..2041 /codon_start=1 /product="vesicular acetylcholine transporter" /db_xref="PID:g507744" /translation="MESAEPAGQARAAATKLSEAVGAALQEPRRQRRLVLVIVCVALL LDNMLYMVIVPIVPDYIAHMRGGGEGPTRTPEVWEPTLPLPTPANASAYTANTSASPT AAWPAGSALRPRYPTESEDVKIGVLFASKAILQLLVNPLSGPFIDRMSYDVPLLIGLG VMFASTVLFAFAEDYATLFAARSLQGLGSAFADTSGIAMIADKYPEEPERSRALGVAL AFISFGSLVAPPFGGILYEFAGKRVPFLVLAAVSLFDALLLLAVAKPFSAAARARANL PVGTPIHRLMLDPYIAVVAGALTTCNIPLAFLEPTIATWMKHTMAASEWEMGMAWLPA FVPHVLGVYLTVRLAARYPHLQWLYGALGLAVIGASSCIVPACRSFAPLVVSLCGLCF GIALVDTALLPTLAFLVDVRHVSVYGSVYAIADISYSVAYALGPIVAGHIVHSLGFEQ LSLGMGLANLLYAPVLLLLRNVGLLTRSRSERDVLLDEPPQGLYDAVRLRERPVSGQD GEPRSPPGPFDECEDDYNYYYTRS" misc_feature 1631..2421 /note="96% identity with bases 1-785 of GenBank Accession Number M96015" /citation=[2] polyA_signal 2400..2405 polyA_site 2421 BASE COUNT 344 a 871 c 755 g 451 t ORIGIN 1 ggaggaggca agagccgacg cgaggggagg ggagcgcagc ggcggggcta acgggcgggc 61 aagcgggcgg gcggcaacag catgtccctc ggccagcgcg ggcggcctct tagcgcggcg 121 ggggctgctc tgggcgcgcc ccgggcgaag tgcgcccagt ctccggcccc ggcccctcgg 181 cgcgcccgac ttcccggccg cccctgagcc cagcagccgc gggtcccggg atcggctaag 241 agtagctgca acgcctcgcc ggacggagtc ctttcctttc ccgggacgct gggccatgag 301 ctccgcggcc cacctgaggc acaggggagt ctgctcggcc aggacagcct ccccgaagtc 361 ccgctgccct cgcctctgca ctgcgggacg ccagcgctcg gccctggcgg aggcgtcttc 421 ggaagagcat cggggtgggg gcatggaatc cgcggaacct gcgggccagg cccgggcggc 481 ggccaccaag ctgtcggagg ctgtgggcgc ggcgctgcag gagccccggc ggcagaggcg 541 cctggtgctt gttatcgtgt gcgtggcgct gttactggac aacatgctgt acatggtcat 601 cgtgcccata gtgcccgact acatcgccca catgcgcggg ggcggcgagg gccccacccg 661 gactcccgag gtgtgggagc ccaccctgcc gctgcccact ccggccaatg ccagcgccta 721 cacggccaac acctcggcgt ccccgacagc tgcgtggcca gcgggctcag cccttcggcc 781 ccgctaccct acggagagcg aagacgtgaa gatcggggtg ctgtttgctt ccaaggctat 841 cctgcagctg ctagtgaacc ccttgagcgg gcccttcatc gaccgcatga gctacgacgt 901 gccgctgctg atcggcctgg gcgtcatgtt cgcctctaca gtcctgttcg ccttcgccga 961 ggactacgcc acgctgttcg cggcgcgcag cctgcagggc ctgggctcag ccttcgccga 1021 cacgtctggc atagccatga tcgccgataa gtacccggag gagccggagc gcagtcgtgc 1081 actgggcgtg gcgctggcct tcattagctt cggaagccta gtggccccgc ccttcggggg 1141 catcctctat gagttcgccg gcaagcgcgt gcccttcttg gtgctagctg ccgtgtcgct 1201 ctttgacgcg ctgttgctgc tggcagtggc caaacccttc tcggcggctg cacgggctcg 1261 ggccaacctg ccagtgggca ctcccatcca ccgcctcatg ctagacccct acattgccgt 1321 ggtggccggc gcgctcacca cctgtaacat tcccctcgcc ttcctcgaac ccaccattgc 1381 cacgtggatg aagcatacga tggcggcttc cgagtgggag atgggcatgg cctggctgcc 1441 ggccttcgtg cctcatgtgc tgggcgtcta cctcaccgtg cgcctggcgg cgcgctaccc 1501 acacctgcag tggctgtacg gcgcgcttgg gctggctgtg atcggcgcca gctcgtgcat 1561 cgtgcccgcc tgccgctcct tcgcgccgct agtggtctca ctatgcggcc tctgttttgg 1621 catagcccta gtcgacacag cactgctgcc cacgctcgcc ttcctggtgg acgtgcgcca 1681 tgtctcagtc tatggcagcg tctacgccat cgccgacatc tcctattcgg tggcctacgc 1741 gctcgggccc atagtggcag gccacattgt gcactcgctg ggctttgagc agctcagcct 1801 tggcatggga ctggccaacc tgctctatgc tcccgtcttg ctgctgctcc gcaacgtggg 1861 cctcctgacg cgctcccgtt ccgagcgcga tgtgctgctt gatgagccac cgcaaggtct 1921 gtacgatgcg gtgcgcctgc gtgagcgtcc tgtgtctggc caggacggcg agcctcgcag 1981 cccgcctggc ccttttgatg agtgcgagga cgactacaac tactactaca cccgcagcta 2041 gcatccccac tcctcctcca gcccacccaa ccgccttggg tcaagggggc tgctctgcaa 2101 gcccactggc cagctctggc tcagggccca cctcctccag cgagtacccc agccactcct 2161 caaccttgac ttctgcccaa atcccctccc tgtgacccgt tccatatccc tttctctctt 2221 gtccaatggg gcttggagca ccgaggccag cgaagccatc gcgctccttg cggaggtgaa 2281 gaggaccctg agtccccacc tgcggctccc ctgtgtagag cctgcatctg tctgtccttc 2341 cttccattgc tcccagtgcc aaacttgggc cgctgcaccg cggcgcctcc gcccaaatca 2401 ataaactgtg tctgtcccag g // LOCUS HSU09215 1119 bp mRNA PRI 08-JUN-1994 DEFINITION Human PM-Scl-75 autoantigen (PM-sc1) mRNA, complete cds. ACCESSION U09215 NID g497642 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1119) AUTHORS Stahnke,G. and Haubruck,H. TITLE Nucleotide sequence of an alternatively spliced cDNA coding for PM-Scl-75, an autoantigen of the Polymyositis/Scleroderma Overlap Syndrome JOURNAL Unpublished REFERENCE 2 (bases 1 to 1119) AUTHORS Stahnke,G. TITLE Direct Submission JOURNAL Submitted (26-APR-1994) Gisela Stahnke, Department of Molecular Biology, ELIAS, Obere, Hardtstr. 18, Freiburg 79114, Germany FEATURES Location/Qualifiers source 1..1119 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1119 /gene="PM-sc1" CDS 1..1119 /gene="PM-sc1" /note="75 kDa; alternatively spliced form of GenBank Accession Number M58460" /codon_start=1 /product="PM-Scl-75 autoantigen" /db_xref="PID:g497643" /translation="MAAPAFEPGRQSDLLVKLNRLMERCLRNSKCIDTESLCVVAGEK VWQIRVDLHLLNHDGNIIDAASIAAIVALCHFRRPDVSVQGDEVTLYTPEERDPVPLS IHHMPICVSFAFFQQGTYLLVDPNEREERVMDGLLVIAMNKHREICTIQSSGGIMLLK DQVLRCSKIAGVKVAEITELILKALENDQKVRKEGGKFGFAESIANQRITAFKMEKAP IDTSDVEEKAEEIIAEAEPPSEVVSTPVLWTPGTAQIGEGVENSWGDLEDSEKEDDEG GGDQAIILDGIKMDTGVEVSDIGSQELGFHHVGQTGLEFLTSDAPIILSDSEEEEMII LEPDKNPKKIRTQTTSAKQEKAPSKKPVKRRKKKRAAN" BASE COUNT 374 a 199 c 271 g 275 t ORIGIN 1 atggccgctc cagctttcga acctggcagg cagtcagatc tcttggtgaa gttgaatcga 61 ctcatggaaa gatgtctaag aaattcgaag tgtatagaca ctgagtctct ctgtgttgtt 121 gctggtgaaa aggtttggca aatacgtgta gacctacatt tattaaatca tgatggaaat 181 attattgatg ctgccagcat tgctgcaatc gtggccttat gtcatttccg aagacctgat 241 gtctctgtcc aaggagatga agtaacactg tatacacctg aagagcgtga tcctgtacca 301 ttaagtatcc accacatgcc catttgtgtc agttttgcct ttttccagca aggaacatat 361 ttattggtgg atcccaatga acgagaagaa cgtgtgatgg atggcttgct ggtgattgcc 421 atgaacaaac atcgagagat ttgtactatc cagtccagtg gtgggataat gctactaaaa 481 gatcaagttc tgagatgcag taaaatcgct ggtgtgaaag tagcagaaat tacagagcta 541 atattgaaag ctttggagaa tgaccaaaaa gtaaggaaag aaggtggaaa gtttggtttt 601 gcagagtcta tagcaaatca aaggatcaca gcatttaaaa tggaaaaggc ccctattgat 661 acctcggatg tagaagaaaa agcagaagaa atcattgctg aagcagaacc tccttcagaa 721 gttgtttcta cacctgtgct atggactcct ggaactgccc aaattggaga gggagtagaa 781 aactcctggg gtgatcttga agactctgag aaggaagatg atgaaggcgg tggtgatcaa 841 gctatcattc ttgatggtat aaaaatggac actggagtag aagtctctga tattggaagc 901 caagagctgg ggtttcacca tgttggccag actggactcg agttcctgac ctcagatgct 961 cccataatac tctcagatag tgaagaagaa gaaatgatca ttttggaacc agacaagaat 1021 ccaaagaaaa taagaacaca gaccaccagt gcaaaacaag aaaaagcacc aagtaaaaag 1081 ccagtgaaaa gaagaaaaaa gaagagagct gccaattaa // LOCUS HSU09284 1234 bp mRNA PRI 27-JUL-1994 DEFINITION Human PINCH protein mRNA, complete cds. ACCESSION U09284 NID g516011 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 120 to 1061) AUTHORS Rearden,A. TITLE A new LIM protein containing an autoepitope homologous to senescent cell antigen JOURNAL Biochem. Biophys. Res. Comm. 201, 1124-1131 (1994) REFERENCE 2 (bases 1 to 1234) AUTHORS Rearden,A. TITLE Direct Submission JOURNAL Submitted (29-APR-1994) Ann Rearden, Pathology, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0612, USA FEATURES Location/Qualifiers source 1..1234 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone cPINCH 1" /clone_lib="human fetal liver library in the lgt11 expression vector (Clontech Laboratories)" CDS 120..1064 /codon_start=1 /product="PINCH protein" /db_xref="PID:g516012" /translation="MANALASATCERCKGGFAPAEKIVNSNGELYHEQCFVCAQCFQQ FPEGLFYEFEGRKYCEHDFQMLFAPCCHQCGEFIIGRVIKAMNNSWHPECFRCDLCQE VLADIGFVKNAGRHLCRPCHNREKARGLGKYICQKCHAIIDEQPLIFKNDPYHPDHFN CANCGKELTADARELKGELYCLPCHDKMGVPICGACRRPIEGRVVNAMGKQWHVEHFV CAKCEKPFLGHRHYERKGLAYCETHYNQLFGDVCFHCNRVIEGDVVSALNKAWCVNCF ACSTCNTKLTLKNKFVEFDMKPVCKKCYEISIGAEEKT" polyA_site 1231 BASE COUNT 333 a 266 c 311 g 324 t ORIGIN 1 tagttcaaga caacagagac aaagctaaga tgaggaagtt ctgtacagtt taggaaatag 61 aggctttcaa agataattcg cagtgatgtg aaactggcct cccaagccct gataacaaca 121 tggccaacgc cctggccagc gccacttgcg agcgctgcaa gggcggcttt gcgcccgctg 181 agaagatcgt gaacagtaat ggggagctgt accatgagca gtgtttcgtg tgcgctcagt 241 gcttccagca gttcccagaa ggactcttct atgagtttga aggaagaaag tactgtgaac 301 atgactttca gatgctcttt gccccttgct gtcatcagtg tggtgaattc atcattggcc 361 gagttatcaa agccatgaat aacagctggc atccggagtg cttccgctgt gacctctgcc 421 aggaagttct ggcagatatc gggtttgtca agaatgctgg gagacacctg tgtcgcccct 481 gtcataatcg tgagaaagcc agaggccttg ggaaatacat ctgccagaaa tgccatgcta 541 tcatcgatga gcagcctctg atattcaaga acgaccccta ccatccagac catttcaact 601 gcgccaactg cgggaaggag ctgactgccg atgcacggga gctgaaaggg gagctatact 661 gcctcccatg ccatgataaa atgggggtcc ccatctgtgg tgcttgccga cggcccatcg 721 aagggcgcgt ggtgaacgct atgggcaagc agtggcatgt ggagcatttt gtttgtgcca 781 agtgtgagaa accctttctt ggacatcgcc attatgagag gaaaggcctg gcatattgtg 841 aaactcacta taaccagcta tttggtgatg tttgcttcca ctgcaatcgt gttatagaag 901 gtgatgtggt ctctgctctt aataaggcct ggtgcgtgaa ctgctttgcc tgttctacct 961 gcaacactaa attaacactc aagaataagt ttgtggagtt tgacatgaag ccagtctgta 1021 agaagtgcta tgagatttcc attggagctg aagaaaagac ttaagaaact agctgagacc 1081 ttaggaagga aataagttcc tttatttttt cttttctatg caagataaga gattaccaac 1141 attacttgtc ttgatctacc catatttaaa gctatatctc aaagcagttg agagaagagg 1201 acctatatga atggttttat gtcatttttt taaa // LOCUS HSU09304 2077 bp mRNA PRI 11-NOV-1994 DEFINITION Human placenta LERK-2 (EPLG2) mRNA, complete cds. ACCESSION U09304 NID g538366 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2077) AUTHORS Beckmann,M.P., Cerretti,D.P., Baum,P., VandenBos,T., James,L., Farrah,T., Kozlosky,C., Hollingsworth,T., Shilling,H., Maraskovsky,E., Fletcher,F.A., Lhotak,V., Pawson,T. and Lyman,S.D. TITLE Molecular characterization of a family of ligands for eph-related tyrosine kinase receptors JOURNAL EMBO J. 13 (16), 3757-3762 (1994) MEDLINE 94349923 REFERENCE 2 (bases 1 to 2077) AUTHORS Fletcher,F.A. TITLE Direct Submission JOURNAL Submitted (29-APR-1994) Frederick A. Fletcher, Dept. Molecular Immunology, Immunex Research and Development Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..2077 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pDC302 mammalian expression vector" /chromosome="X" /map="Xq12" /tissue_type="placenta" /dev_stage="fetus" sig_peptide 308..379 /gene="EPLG2" gene 308..1348 /gene="EPLG2" CDS 308..1348 /gene="EPLG2" /standard_name="ligand for EPH related kinase" /note="EPH ligand" /codon_start=1 /function="tyrosine kinase receptor" /product="LERK-2" /db_xref="PID:g538367" /translation="MARPGQRWLGKWLVAMVVWALCRLATPLAKNLEPVSWSSLNPKF LSGKGLVIYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQAAACSTVLDPNVLVTCN RPEQEIRFTIKFQEFSPNYMGLEFKKHHDYYITSTSNGSLEGLENREGGVCRTRTMKI IMKVGQDPNAVTPEQLTTSRPSKEADNTVKMATQAPGSRGSLGDSDGKHETVNQEEKS GPGASGGSSGDPDGFFNSKVALFAAVGAGCVIFLLIIIFLTVLLLKLRKRHRKHTQQR AAALSLSTLASPKGGSGTAGTEPSDIIIPLRTTENNYCPHYEKVSGDYGHPVYIVQEM PPQSPANIYYKV" misc_feature 380..1018 /gene="EPLG2" /note="encodes extracellular domain" misc_feature 722..730 /gene="EPLG2" /note="encodes an N-linked site" misc_feature 1019..1102 /gene="EPLG2" /note="encodes transmembrane domain" misc_feature 1103..1345 /gene="EPLG2" /note="encodes cytoplasmic domain" repeat_region 2050..2077 /rpt_unit=CA polyA_site 2077 /note="10 A nucleotides" BASE COUNT 419 a 638 c 624 g 396 t ORIGIN 1 tcgggcggga tcacccgggg gcgcagagcc cccgtcgcgc ctcgtgcggc agcggagagc 61 ccaggagaac gagccctcgg gggccgaagc ccatgcccgg gttgggggcg gctgcccagt 121 gagtcctcct ggccggccgg gcggagaaga gcgacaccga agccggcggg aggggagcac 181 ttcaaggccg gcggctgcgg aggatgggcg cctgagcggc tccgagcgca gcgcggcaga 241 ggaaggcgag gcgagctttg gtgaggaggc gccaagggat cccgaagtgc agtctgcccc 301 cgggaagatg gctcggcctg ggcagcgttg gctcggcaag tggcttgtgg cgatggtcgt 361 gtgggcgctg tgccggctcg ccacaccgct ggccaagaac ctggagcccg tatcctggag 421 ctccctcaac cccaagttcc tgagtgggaa gggcttggtg atctatccga aaattggaga 481 caagctggac atcatctgcc cccgagcaga agcagggcgg ccctatgagt actacaagct 541 gtacctggtg cggcctgagc aggcagctgc ctgtagcaca gttctcgacc ccaacgtgtt 601 ggtcacctgc aataggccag agcaggaaat acgctttacc atcaagttcc aggagttcag 661 ccccaactac atgggcctgg agttcaagaa gcaccatgat tactacatta cctcaacatc 721 caatggaagc ctggaggggc tggaaaaccg ggagggcggt gtgtgccgca cacgcaccat 781 gaagatcatc atgaaggttg ggcaagatcc caatgctgtg acgcctgagc agctgactac 841 cagcaggccc agcaaggagg cagacaacac tgtcaagatg gccacacagg cccctggtag 901 tcggggctcc ctgggtgact ctgatggcaa gcatgagact gtgaaccagg aagagaagag 961 tggcccaggt gcaagtgggg gcagcagcgg ggaccctgat ggcttcttca actccaaggt 1021 ggcattgttc gcggctgtcg gtgccggttg cgtcatcttc ctgctcatca tcatcttcct 1081 gacggtccta ctactgaagc tacgcaagcg gcaccgcaag cacacacagc agcgggcggc 1141 tgccctctcg ctcagtaccc tggccagtcc caaggggggc agtggcacag cgggcaccga 1201 gcccagcgac atcatcattc ccttacggac tacagagaac aactactgcc cccactatga 1261 gaaggtgagt ggggactacg ggcaccctgt ctacatcgtc caagagatgc cgccccagag 1321 cccggcgaac atctactaca aggtctgagt gcccggcacg gcctcaggcc ccagggacag 1381 tcggcctgga ccggacctct cctttcgccc ccacaccccc tccccttgcc agctgtgccc 1441 acctttgtat ttagttttgt agtttcttgg cttttataat cccccttttt ccctgccccc 1501 tgggcttcgg aggggggtgc ttgtgcccct aacccccatg ctcttgtgcc ttccccctct 1561 ggccaggcct ctgggctccg tgggggcgcc ccttcttgga aggcagggct ggacactgat 1621 ggacagcagg cagggagaca gtcccctggt cctgcccctc cctcgccccc cttgccacct 1681 tcccaggact gcttgtccgc tatcatcact gtttttaatg cttttgtgtt cattttttag 1741 ctgtcaactc attttcatct gttttttgaa gaaaaatgga aaaatgtaaa aggcagcccc 1801 tccccaggct ttgtgagcct ggcccaagcc agtacaagag ggcctggggc acgatgtggt 1861 cagccaggaa gcataggatg ccatttcttt tatagattcc ttggtatttc tggtggggta 1921 aggggcaggc cagggctgtt cacgcccatg agggaagagg aaagtgccac tgggcaaggt 1981 gtcccaccct cccctcctga ccctcctacg aggcttatcc tggcaatggg gtagtcactg 2041 ccacccttcc acacacacac acacacacac acacaca // LOCUS HSU09366 2643 bp mRNA PRI 09-NOV-1995 DEFINITION Human zinc finger protein ZNF133. ACCESSION U09366 NID g487782 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2643) AUTHORS Vissing,H., Meyer,W.K., Aagaard,L., Tommerup,N. and Thiesen,H.J. TITLE Repression of transcriptional activity by heterologous KRAB domains present in zinc finger proteins JOURNAL FEBS Lett. 369 (2-3), 153-157 (1995) MEDLINE 95377390 REFERENCE 2 (bases 1 to 2643) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 3 (bases 1 to 2643) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (04-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd, DK-2880, Denmark FEATURES Location/Qualifiers source 1..2643 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF133, pHZ-20" /clone_lib="cDNA" /chromosome="20" /map="20p11.2" /tissue_type="insulinoma" misc_feature 446..671 /note="KRAB domain" /function="transcriptional repressor" CDS 446..2410 /codon_start=1 /product="zinc finger protein ZNF133" /db_xref="PID:g487783" /translation="MAFRDVAVDFTQDEWRLLSPAQRTLYREVMLENYSNLVSLGISF SKPELITQLEQGKETWREEKKCSPATCPADPEPELYLDPFCPPGFSSQKFPMQHVLCN HPPWIFTCLCAEGNIQPGDPGPGDQEKQQQASEGRPWSDQAEGPEGEGAMPLFGRTKK RTLGAFSRPPQRQPVSSRNGLRGVELEASPAQTGNPEETDKLLKRIEVLGFGTVNCGE CGLSFSKMTNLLSHQRIHSGEKPYVCGVCEKGFSLKKSLARHQKAHSGEKPIVCRECG RGFNRKSTLIIHERTHSGEKPYMCSECGRGFSQKSNLIIHQRTHSGEKPYVCRECGKG FSQKSAVVRHQRTHLEEKTIVCSDCGLGFSDRSNLISHQRTHSGEKPYACKECGRCFR QRTTLVNHQRTHSKEKPYVCGVCGHSFSQNSTLISHRRTHTGEKPYVCGVCGRGFSLK SHLNRHQNIHSGEKPIVCKDCGRGFSQQSNLIRHQRTHSGEKPMVCGECGRGFSQKSN LVAHQRTHSGERPYVCRECGRGFSHQAGLIRHKRKHSREKPYMCRQCGLGFGNKSALI THKRAHSEEKPCVCRECGQGFLQKSHLTLHQMTHTGEKPYVCKTCGRGFSLKSHLSRH RKTTSVHHRLPVQPDPEPCAGQPSDSLYSL" BASE COUNT 713 a 661 c 735 g 534 t ORIGIN 1 ttttttcctg agacagagtc ttgctctgtt gcccaggctg gagtgcaatg gcacaatctc 61 aactcactgc aacctccacc ccctgagttc gagcaattct cctgccttgg ctcccgagta 121 gctgggatta caggcgcctc accatgcctg gctaattttt gtatttttag tagagacgga 181 gtttcagcat ttgcaggtga ttttgaactg ctgacctcag gtgatccacc tgcctcagcc 241 tcccaaaatg ctgggattac aggcatgagc cactgcgccc agccgagagt tttaaagaag 301 gtgaaagctc ttcccttttc ctaacctagc ctggccttga ttccgctgat ctaccttctg 361 agatatcatc cttcttcagg gagataagga aaaaaagcca cagggtcccg gagagccagg 421 ggaatgtttt gtgttacagg cacacatggc attcagggat gtggctgtgg atttcaccca 481 ggatgagtgg aggctgctga gccctgctca aaggactctg tacagagagg tgatgctgga 541 gaactacagc aacctggtct cactgggaat ttcattttct aaaccagaac tcatcaccca 601 gctggagcaa gggaaagaga cctggagaga ggaaaaaaaa tgttcaccgg caacctgtcc 661 agcagatcca gagccagagc tctacctcga tcctttctgc cctccgggtt tctccagtca 721 gaaattcccc atgcagcatg tgctgtgtaa tcatcccccc tggatcttca catgcttgtg 781 tgcagaaggt aacatccagc ctggggatcc aggcccaggg gaccaggaga agcagcaaca 841 agcctctgag gggagaccct ggagtgatca agcagaaggt cctgagggag aaggtgccat 901 gcctttgttt ggaagaacca agaaaaggac tctgggagcg ttctccaggc caccccagag 961 gcagccagtc agctctcgga acggcctcag aggggtggag ttagaagcca gcccagctca 1021 gacagggaac cctgaggaaa cagacaaatt gttgaagagg atagaagtct taggatttgg 1081 aacagtcaac tgtggagagt gtggactgag cttcagcaag atgacaaacc tgctcagtca 1141 ccagcggata cactcagggg agaagcccta cgtgtgtggg gtatgtgaga agggcttcag 1201 cctaaagaag agcctcgcca gacaccagaa ggcacactcg ggggagaagc caattgtgtg 1261 cagggagtgt ggacgaggct ttaaccggaa gtcaacgcta atcatacacg aacggacaca 1321 ctccggtgag aaaccttaca tgtgcagtga gtgtgggcga ggcttcagcc agaagtcaaa 1381 cctcatcata caccagagga cacactcagg ggaaaagcct tatgtgtgcc gggaatgtgg 1441 caaaggcttc agccagaagt cagctgtcgt gagacaccag aggacacact tggaggagaa 1501 gaccatcgtg tgcagtgact gtggcctggg cttcagcgac aggtcaaacc tcatctccca 1561 ccagaggacg cactctgggg agaagcccta cgcctgcaag gagtgtgggc gatgcttcag 1621 gcagaggacc acccttgtca accaccagag gacacactca aaggagaagc cctatgtgtg 1681 cggggtgtgt gggcacagct tcagccagaa ttcaaccctc atctctcaca ggcggacaca 1741 cactggggag aagccgtatg tttgtggggt gtgtgggcga ggctttagtc tcaagtcaca 1801 cctcaacaga caccagaaca tacactcagg agagaagccc attgtgtgca aggactgtgg 1861 ccggggcttc agccagcaat ccaacctcat cagacaccag aggacgcact caggcgagaa 1921 gcccatggtg tgtggggagt gcgggcgagg cttcagccag aagtcaaacc ttgttgcaca 1981 ccagaggacg cactcagggg agaggccgta tgtgtgccga gagtgcgggc gaggctttag 2041 ccaccaggcc ggtctcatca ggcacaagcg gaagcactcg agggagaagc cctacatgtg 2101 caggcagtgt ggactgggct ttggcaataa gtcagctcta atcacacaca agcgggctca 2161 ctcggaagag aagccttgtg tgtgcagaga gtgtggccaa ggctttctcc aaaagtcaca 2221 cctcacctta catcaaatga cacatacggg ggagaagcca tatgtgtgca agacgtgtgg 2281 gcggggcttc agcctcaagt ctcacctcag cagacacagg aagaccacgt ctgtccacca 2341 cagactgcca gtgcagcccg accctgagcc gtgtgcaggg caaccttcgg attccttata 2401 ctctctctga aggcaaagat ggggacaagg actaagagtc agaatgttga cactttgatg 2461 aaatggagta gagaaatgca ttctgtaagt ggtcaaagga catttgactg tttactttct 2521 ccacactaag tcttctccat gttttgtggc ttcggttgta ataaacttgg cttctttata 2581 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaacttta gagcacactg gggtaccgga 2641 tcc // LOCUS HSU09368 2407 bp mRNA PRI 09-NOV-1995 DEFINITION Human zinc finger protein ZNF140. ACCESSION U09368 NID g487786 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2407) AUTHORS Vissing,H., Meyer,W.K., Aagaard,L., Tommerup,N. and Thiesen,H.J. TITLE Repression of transcriptional activity by heterologous KRAB domains present in zinc finger proteins JOURNAL FEBS Lett. 369 (2-3), 153-157 (1995) MEDLINE 95377390 REFERENCE 2 (bases 1 to 2407) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 3 (bases 1 to 2407) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (04-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd, DK-2880, Denmark FEATURES Location/Qualifiers source 1..2407 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF140, pHZ-39" /clone_lib="cDNA library" /chromosome="12" /map="12q34.12" /tissue_type="insulinoma" misc_feature 273..498 /note="KRAB domain" /function="transcriptional repressor" CDS 273..1646 /codon_start=1 /product="zinc finger protein ZNF140" /db_xref="PID:g487787" /translation="MSQGSVTFRDVAIDFSQEEWKWLQPAQRDLYRCVMLENYGHLVS LGLSISKPDVVSLLEQGKEPWLGKREVKRDLFSVSESSGEIKDFSPKNVIYDDSSQYL IMERILSQGPVYSSFKGGWKCKDHTEMLQENQGCIRKVTVSHQEALAQHMNISTVERP YGCHECGKTFGRRFSLVLHQRTHTGEKPYACKECGKTFSQISNLVKHQMIHTGKKPHE CKDCNKTFSYLSFLIEHQRTHTGEKPYECTECGKAFSRASNLTRHQRIHIGKKQYICR KCGKAFSSGSELIRHQITHTGEKPYECIECGKAFRRFSHLTRHQSIHTTKTPYECNEC RKALRCHSFLIKHQRIHAGEKLYECDECGKVFTWHASLIQHTKSHTGEKPYACAECDK AFSRSFSLILHQRTHTGEKPYVCKVCNKSFSWSSNLAKHQRTHTLDNPYEYENSFNYH SFLTEHQ" BASE COUNT 753 a 484 c 533 g 637 t ORIGIN 1 ctaaaggtcg gcgaggcttc tgaagacgca attcctgcga cgcccgcgga ggggccctgg 61 ggggcggcgc gagcgtctgg cctgtgttgg ctgtaggcaa cgaaaggagc cctcccggtc 121 tgcgccggat ggccccgggc ggtgactcgg tccggagccc tggaacgcta cgcccacctg 181 gcggaaagca ccacggaaac gcatccttct gtggccactg ttaggtctgc cattttacac 241 ttttctgatc tcctccttcc cttctgtgag ctatgtctca ggggtcagtg acattcagag 301 atgtggccat agacttctcc caggaggagt ggaaatggct tcagcctgct caaagagatt 361 tgtacagatg tgtaatgttg gagaactatg gccatctggt ctcactgggt ctttccattt 421 ctaagccaga tgtggtttcc ttattggagc aagggaaaga accctggctg gggaaaaggg 481 aagtgaaaag agatctgttt tcagtttcag agtcaagtgg tgagatcaaa gacttttcac 541 caaaaaatgt catttatgat gactcatccc agtatttgat catggaaaga attctaagtc 601 aaggccctgt gtattccagt tttaaaggag gctggaaatg caaggatcat actgagatgc 661 tgcaagaaaa tcagggatgt attaggaaag taacagtctc tcatcaagaa gccctggctc 721 aacatatgaa tatcagtact gtggagaggc cctatggatg ccatgaatgt ggaaaaactt 781 ttggtcgacg cttttccctg gtgttacacc agaggactca tactggagag aaaccatatg 841 catgtaagga atgtggcaaa acctttagcc agatttcaaa ccttgtgaaa caccaaatga 901 tacatactgg aaagaaaccc catgagtgta aggactgtaa taaaacattc agttaccttt 961 catttcttat tgaacaccag agaacgcaca ctggggagaa accttatgaa tgtactgagt 1021 gtggaaaggc ctttagccgt gcctccaacc tcactcgaca tcaaagaatt cacataggaa 1081 agaaacaata tatatgtagg aaatgtggta aagcatttag cagtggctca gaactcattc 1141 gccaccagat tacacatact ggagagaaac cttatgaatg cattgaatgt gggaaggcat 1201 ttcgccgttt ctcacacctt actcgacatc agagcatcca tacaaccaaa accccgtatg 1261 aatgtaatga atgtaggaaa gctttgcgtt gtcactcatt ccttattaaa catcagagaa 1321 ttcatgctgg agaaaagctc tatgaatgtg atgaatgtgg taaagttttc acttggcatg 1381 catcccttat tcaacatacg aagagtcaca ctggagagaa accctatgcg tgtgctgaat 1441 gtgataaagc cttcagccgg agcttttccc tcattctaca tcagagaact catactggag 1501 agaaacccta tgtatgtaag gtatgcaaca aatccttcag ctggagctca aaccttgcta 1561 aacatcagag gacacacact cttgacaacc cctatgaata tgaaaattca tttaattacc 1621 actcattcct tactgaacac cagtgaattt acactgcaaa gaaaaactat gaatgtatgg 1681 aattttttaa aaagaagtat aatgccttac ttcagagaac tcttggaaag aagccttatg 1741 tgaaagtgat gactgtgaag taatatggcc cacactttat tcaccaccct ggagaaaaaa 1801 aaacccagga atatgtggaa aagccattaa taaccactct tttatttttt tgcaataaca 1861 aggtgaaatc aatattgttg agaagattct tccatctggt aatgttgaga agacttcatt 1921 tggtaggagt cccttacttt acgtgtgtaa attcctacca ggaaagaata catatccaat 1981 agattggaga aagccagaga ttagccccgc attccgcatc tgtcaaccag gacagaaagc 2041 atggacaagg gatgagcttt acaaagatga tgcactttgg agatcagaaa attcatattt 2101 aagcaaagtg atacaaacac agtgatttgg gaatgccttc atttacaatg caatacttac 2161 attttaatac tcttgtagga gaaaaagcaa ctgtataaat gaatgtagag tgactttctg 2221 caatatttgc aacctatatc agagaattac actgtgggaa aactaccatt gtaataagtg 2281 tagcaaaatc tccttagata tctgaaaagt catactggat ggaatctgta ggaaacggtt 2341 ctattttgag ggaaggggga ttcctttttg ttttttaagt gaattcagaa aatgttataa 2401 actttag // LOCUS HSU09411 2368 bp mRNA PRI 08-NOV-1995 DEFINITION Human zinc finger protein ZNF132 mRNA, complete cds. ACCESSION U09411 NID g488550 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2368) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 2 (bases 1 to 2368) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (05-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd DK-2880, Denmark FEATURES Location/Qualifiers source 1..2368 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF132, pHZ-12" /clone_lib="Human insulinoma cDNA library" /chromosome="19" /map="19q34.4" /tissue_type="insulinoma" CDS 499..2268 /codon_start=1 /product="zinc finger protein ZNF132" /db_xref="PID:g488551" /translation="MCGPFLKDILHLAEHQGTQSEEKPYTCGACGRDFWLNANLHQHQ KEHSGGKPFRWYKDRDALMKSSKVHLSENPFTCREGGKVILGSCDLLQLQAVDSGQKP YSNLGQLPEVCTTQKLFECSNCGKAFLKSSTLPNHLRTHSEEIPFTCPTGGNFLEEKS ILGNKKFHTGEIPHVCKECGKAFSHSSKLRKHQKFHTEVKYYECIACGKTFNHKLTFV HHQRIHSGERPYECDECGKAFSNRSHLIRHEKVHTGERPFECLKCGRAFSQSSNFLRH QKVHTQVRPYECSQCGKSFSRSSALIQHWRVHTGERPYECSECGRAFNNNSNLAQHQK VHTGERPFECSECGRDFSQSSHLLRHQKVHTGERPFECCDCGKAFSNSSTLIQHQKVH TGQRPYECSECRKSFSRSSSLIQHWRIHTGEKPYECSECGKAFAHSSTLIEHWRVHTK ERPYECNECGKFFSQNSILIKHQKVHTGEKPYKCSECGKFFSRKSSLICHWRVHTGER PYECSECGRAFSSNSHLVRHQRVHTQERPYECIQCGKAFSERSTLVRHQKVHTRERTY ECSQCGKLFSHLCNLAQHKKIHT" BASE COUNT 678 a 538 c 544 g 608 t ORIGIN 1 ctaaagctag tggatgtgaa gtggtatctc attatggttt tggttttcat actcctcatg 61 tttaaggatg ctgaacttct tttcatatgc ttattggcca tttgtgtata tatcttcttt 121 tagagaaatg tctatttaag tcctttgacc catttctgtg tccttacccc tggtgaggtc 181 tcccttattc tgttgcttgg ctggtcccta tcctgccaat agtaatgggc ccttcttcac 241 cctgatgatg gccctgttgg cctgtcagca atccctggga cctcttcttg ggtgtgaatt 301 cctgggtaac atttctaatg aagtcaacca ttcccaccaa gtggaattct tagttaactg 361 gcatttctct actttcaggt tcttggcaat ggagtagagg gtgagggggc ccatcccaag 421 cagaatgttt ctgtagaagt gttacaggtc aggatcccta atgcagatcc ttccaccaag 481 aaagctaact cctgtgacat gtgtgggcca ttcttgaaag acattttgca cctggctgag 541 catcagggaa cacagtctga ggagaaaccc tacacatgtg gagcatgtgg gagagacttt 601 tggttgaatg caaaccttca ccagcaccag aaggagcaca gtggagggaa gccctttaga 661 tggtacaagg acagggacgc acttatgaag agctctaaag tccacctgtc agagaacccc 721 ttcacttgca gggaaggtgg gaaggtcatc ctgggcagct gtgacctcct ccagcttcaa 781 gctgttgaca gtgggcagaa gccatattcc aatcttgggc agcttccaga agtctgtacc 841 acacagaaac tcttcgagtg cagcaactgt ggaaaagcct tcctgaagag ctccactctc 901 cccaaccatc tgagaactca ctctgaagag ataccattta catgcccaac aggtggaaat 961 ttcttagagg agaaatcaat ccttggtaat aaaaagtttc acactgggga aataccccat 1021 gtgtgtaagg agtgtgggaa ggcctttagt cactcatcta agctgaggaa gcaccagaaa 1081 tttcacactg aagtaaaata ttatgagtgc attgcatgtg ggaaaacctt caaccacaaa 1141 ctcacatttg ttcatcatca gagaattcac tcaggtgaaa gaccttatga gtgtgatgaa 1201 tgtgggaaag ccttcagtaa cagatcacac ctcattcggc atgagaaagt tcacactgga 1261 gaaaggcctt ttgagtgcct gaaatgtgga agagccttca gccaaagctc caatttcctt 1321 cggcatcaga aagttcacac acaggtaaga ccttatgagt gcagtcaatg tggtaaatcc 1381 ttcagccgaa gctctgctct cattcagcac tggagagttc acactggaga aagaccgtat 1441 gaatgcagtg aatgtggaag agcttttaac aataactcca accttgctca gcaccagaaa 1501 gttcacaccg gagaacggcc ttttgagtgc agtgaatgtg gaagagactt cagccaaagc 1561 tcccatctcc ttcgacatca gaaagttcac actggagaac ggccttttga atgctgtgat 1621 tgtggtaaag ccttcagtaa tagctccacc ctcatccagc accagaaagt acatactggg 1681 caaaggcctt atgagtgcag cgaatgtagg aaatccttca gccgcagctc cagcctgatt 1741 cagcactgga gaattcacac tggagaaaag ccttacgagt gtagtgagtg tgggaaagcc 1801 tttgctcaca gctccactct cattgaacac tggagagttc acacaaaaga aaggccttat 1861 gagtgcaatg aatgtgggaa attctttagc caaaactcca ttctcattaa gcatcagaaa 1921 gttcatactg gagaaaagcc ttataaatgc agtgaatgtg ggaaattctt tagccgaaaa 1981 tccagcctta tttgtcactg gagagttcac actggagaaa ggccttacga atgcagtgaa 2041 tgtgggagag cctttagcag taactcccac ctggttcgtc atcagagagt tcacacacaa 2101 gaaaggccct atgagtgcat ccagtgtgga aaagccttta gtgaaagatc tacacttgtt 2161 cggcaccaga aagttcacac cagagaaagg acttatgagt gtagccagtg tgggaaactc 2221 ttcagccatc tttgtaacct tgcacagcat aaaaagattc atacctgagt ggagccttat 2281 ggaagtggtc tttgtgagaa aatcttcagc caagtcaaac ttcatgcagc agaatcccca 2341 taccagaaaa attacctcca tgctttag // LOCUS HSU09413 2432 bp mRNA PRI 08-NOV-1995 DEFINITION Human zinc finger protein ZNF135 mRNA, complete cds. ACCESSION U09413 NID g488554 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2432) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 2 (bases 1 to 2432) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (05-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd DK-2880, Denmark FEATURES Location/Qualifiers source 1..2432 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF135, pHZ-17" /clone_lib="Human insulinoma cDNA library" /chromosome="17" /map="19q13.4" /tissue_type="insulinoma" CDS 76..1485 /codon_start=1 /product="zinc finger protein ZNF135" /db_xref="PID:g488555" /translation="MGNTWKKGEARPKCFTENCVKEKPYKCQECGKAFSHSSALIEHH RTHTGERPYECHECLKGFRNSSALTKHQRIHTGEKPYKCTQCGRTFNQIAPLIQHQRT HTGEKPYECSECGKSFSFRSSFSQHERTHTGEKPYECSECGKAFRQSIHLTQHLRIHT GEKPYQCGECGKAFSHSSSLTKHQRIHTGEKPYECHECGKAFTQITPLIQHQRTHTGE KPYECGECGKAFSQSTLLTEHRRIHTGEKPYGCNECGKTFSHSSSLSQHERTHTGEKP YECSQCGKAFRQSTHLTQHQRIHTGEKPYECNDCGKAFSHSSSLTKHQRIHTGEKPYE CNQCGRAFSQLAPLIQHQRIHTGEKPYECNQCGRASARATLLIEHQRIHTKEKPYGCN ECGKSFSHSSSLSQHERTHTGEKPYECHDCGKSFRQSTHLTQHRRIHTGEKPYACRDC GKAFTHSSSLTKHQRTHTG" BASE COUNT 754 a 623 c 543 g 512 t ORIGIN 1 ctaaagggaa aacataagtc tgaaccctga tctcccacat caaccaatga ctcctgaaag 61 acaaagcccc cacacatggg gaacacgtgg aaaaagggag aagccagacc taaatgtttt 121 acagaaaact gtgtaaaaga gaaaccctac aaatgtcagg aatgcggaaa ggcctttagt 181 cacagctcag cacttatcga acaccaccgg acgcacacag gagagagacc ttacgaatgt 241 cacgaatgct taaaaggctt tcggaacagc tcggcactta ccaaacacca gagaatccat 301 actggggaga aaccctataa atgcactcag tgtgggagga ccttcaacca aattgctcca 361 ctgatccagc accagagaac tcacacaggt gagaagccct atgaatgcag cgaatgtggg 421 aaatccttca gttttaggtc ctccttcagc cagcacgagc gaactcacac aggcgagaag 481 ccctacgagt gcagtgagtg tgggaaagcc ttccggcaaa gcatccacct cacccagcat 541 ctgcgaatcc acactgggga gaaaccctat cagtgtggtg agtgtggcaa ggccttcagc 601 cacagctcat ccttgaccaa acaccagcga atccacacag gggagaagcc ctacgagtgc 661 catgagtgtg gaaaagcctt cacccagatc acaccactga ttcagcacca gaggacccac 721 acaggagaaa agccctatga gtgtggtgag tgtgggaaag ccttcagtca gagcacactc 781 ctgaccgagc atcggaggat tcacacagga gagaagccct atggatgcaa cgagtgtggg 841 aaaaccttca gccacagctc ctcactcagc cagcatgagc ggacacacac aggagagaag 901 ccctatgagt gcagtcagtg tgggaaggcc ttccggcaga gcacacacct cacccaacac 961 cagcggatcc acacagggga gaagccctat gaatgcaatg actgcggcaa ggcattcagt 1021 cacagctcgt ccctcaccaa acatcagcga atccacactg gggagaagcc ctacgaatgc 1081 aaccagtgtg gcagagcctt cagccagctt gctcccctca ttcagcatca gaggatccac 1141 acaggagaga aaccctatga atgtaaccag tgtggcagag cttcagccag agctaccctt 1201 ctcatcgaac accagaggat tcacaccaag gaaaagccgt atgggtgcaa tgagtgtggg 1261 aaatccttca gccacagctc ctcgctcagc cagcacgaaa ggacgcacac tggggaaaag 1321 ccctatgagt gtcacgattg cggaaagtcc tttaggcaga gcacccacct cactcagcac 1381 cggaggatcc acacaggaga gaagccatat gcatgcaggg actgtggaaa ggcctttacc 1441 cacagctcct cccttaccaa gcaccagaga actcacactg gataaaccca ctccacatgt 1501 gctgggacat aggaagacct taagccatag ctcatccctt tctagatttg acccaatcat 1561 acacatgaga aacgtacatt catacacaag ccttttcaca cagcactccc ctcagacacc 1621 ctcagagagt tcacactgat gggaaatgac catgggacca ccaagctcta ggtcatccat 1681 ctctgcatcc aaatagtagg gaaacgtgga gataatcaac actcaggacc ttcagccttg 1741 aacgcccatt agtgctatgt tatagaacct acaaaaaaga aatggaacaa atgtagtgga 1801 tccagggaac gcttttgtcc aaggattcac cgtattccaa accagagatg ttcaaattgg 1861 tgagaaaccc aacaaatgcc tttcatatat acgagaccaa atgaagtcag atttgccatt 1921 atgcacatca catttttggg gggaaagtct tatgaatggt cagttgactc tgatattcat 1981 tcccaaatga cagtatggca gagtgttcca gaaatgagag tggcatcttt atggaatcac 2041 tgtggatact gactgtctca gtaaacagct gtcttgtctg tgtgtatatt gtttgatcag 2101 ggtacatggc agccagtcac agattggaat tacatatgac aaagtatcag tgtactataa 2161 acaggttttt agttatccct gcattatttt tgcaattaat ctttatatgc aatgagattg 2221 aaaagctttg tatgggaaga ctcaaaatgt aaagctgctt ccatagagtc tcactgattc 2281 tgagatggct tttgctgctc tgttctccct ctacatttct ctgcagaact cgcgttagaa 2341 acacagatat ttgttttaca aaaagggaga tttttccttt gttaaaccat cgtcttataa 2401 gcaatagcaa attcatgtta gaaaaacttt ag // LOCUS HSU09414 2553 bp mRNA PRI 08-NOV-1995 DEFINITION Human zinc finger protein ZNF137 mRNA, complete cds. ACCESSION U09414 NID g488556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2553) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 2 (bases 1 to 2553) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (05-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd DK-2880, Denmark FEATURES Location/Qualifiers source 1..2553 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF137, pHZ-30" /clone_lib="Human insulinoma cDNA library" /chromosome="19" /map="19q13.4" /tissue_type="insulinoma" CDS 141..764 /codon_start=1 /product="zinc finger protein ZNF137" /db_xref="PID:g488557" /translation="MNVARFLVEKHTLHVIIDFILSKVSNQQSNLAQHQRVYTGEKPY KCNEWGKALSGKSSLFYHQAIHGVGKLCKCNDCHKVFSNATTIANHWRIHNEDRSYKC NKCGKIFRHRSYLAVYQRTHTGEKPYKYHDCGKVFSQASSYAKHRRIHTGEKPHKCDD CGKVLTSRSHLIRHQRIHTGQKSYKCLKCGKVFSLWALHAEHQKIHF" BASE COUNT 765 a 511 c 550 g 727 t ORIGIN 1 ctaaaggcct tgcacaacat cagagagttc atactggaga gaaccttaca catttcacga 61 gtatggaaag acctttgctc aaaattcagc ccttgtaatg cataaggcaa ttcatactgg 121 aaagaaacct tacacatgta atgaatgtgg caaggttttt agtagaaaag cacaccttgc 181 atgtcatcat agacttcata ctgtctaagg tttctaatca acaatcaaac cttgcacaac 241 atcagagagt ttatactgga gagaaacctt acaagtgtaa tgagtggggc aaagccttaa 301 gtgggaagtc gtcacttttt tatcatcaag caatccatgg tgtagggaaa ctttgcaaat 361 gtaatgattg tcacaaagtc ttcagtaatg ctacaaccat tgcaaatcac tggagaatcc 421 ataatgaaga cagatcttac aagtgtaata aatgtggtaa aattttcaga catcgatcat 481 atcttgcagt ttatcagcga actcatactg gagagaaacc ttacaaatat catgactgtg 541 gcaaggtctt cagtcaagct tcatcctatg caaaacatag gagaattcat acaggagaga 601 aacctcacaa gtgtgatgat tgtggcaaag tcttgacttc acgttcacac ctcattagac 661 atcagagaat ccatactgga cagaaatctt acaaatgtct taagtgtggc aaggtcttca 721 gtctgtgggc actccatgca gaacatcaga aaattcattt ttgagataac tgttccaaat 781 acagtgacta tagaagatca taaagcttta attgacatta gagccaaata ggcattgact 841 tgagattgag ttgacttaac cttgagttta agaattaatt tacattaaag tgtttatgtt 901 aagaagattg ggccaggtgg gattacaggc gcgagcaccg cgcccggccc ctaagttaat 961 atttcaaaca atcgaaggta aaacaacata ttgtgttggg ccacctgtac tgaacgctga 1021 atcgtttttc ctcttaagtt gaaaatggtt ttaatgcaaa gcgccttttt ttgagcaggt 1081 agagtcacgc atccggcagg cggggcgagc tcccctctgt ctggggcagg gtgggggaga 1141 ggggcaggga cctcggtaaa ggggtggagt ggcgcgctgg ttgccgcggg cactggcaat 1201 tagaagggat tattaaacta agcaaggtcc tgggttgttt gagtggataa tggaaactga 1261 aaggtgacgt gcaaaactgc ctattactcc caggagtgga ggataatttc atatttcatg 1321 gaaataaact cagggcccgg agcggtggct cacacctgta atcccagcac tttgagaggc 1381 caaggaggga ggatcgctta agcccaggaa ttcgaaatca gcctaggcaa catagtaaga 1441 cctcatctct actaaaaata aaaaaaaaca gccaggtgtg ttagtccaca cctgtggtcc 1501 cagctgcctc agcttcccga gtagctggga ttacaggtat gaaccactat gcccggctaa 1561 ctttgttttt tttttttaga aattaaacct tttttcagct taatgaccca ggggtgtatt 1621 tttgaaggac ttgggagctc tctttgaaag gcaaacaaca agggaaacag tacctttatc 1681 tcagtaggaa attaaataat tcaaacatca aataacttca atttaaggct atggactttg 1741 agataattct gagccttgag aggaatgtgg tcaggcaacc tgagtccagt ggaatgcagg 1801 tgcaacttct aagagttttc ctgtaagtaa ttaagaagac taagtagccc cagagataag 1861 acctcctcgg atcattgtcc cttcttatgt agtgataaag taaccttcct tgaagtgtat 1921 ctatccgtaa tcaatcaagt tgctgcagcc tatgcactgg cccagaataa aaaacgtggt 1981 gattctgcta aagcttctct gtctttccct gtgtgtgaaa tcttaacgtc tctacttggg 2041 aacgctgatc ccattcattt agagttgatg tttccacgtg gctatttcca agctttgcct 2101 tcaaataaat tctgtactta atcatatatt ctaaatttta ttatttactg ctgacatcag 2161 tttctgtcgg attgtaggag cctcaccaga gagggcccct gtcgccatgt tgtaaaactc 2221 acacttgcca aaagttgtgg gttagggttt ctccccctcc ctcaggatga cgctagttag 2281 ctgacacaga tggtcacctc cattaccaag tagagtcagg atgaactatg tgtgactgtt 2341 caactatgtg tcctcttccc tgaggactga ttagtgttta tcttgaaaac atgtccttaa 2401 tgggttgtat agaacactga agcatctgat ttcaaactct tagctctttt cctctatttc 2461 ccatcacatt ctggtctaag gcttatttat taataaaatg atttttattt ctttaaacaa 2521 aaaaaacttt agagcacact ggggtaccgg atc // LOCUS HSU09466 2797 bp mRNA PRI 30-AUG-1994 DEFINITION Human heme A:farnesyltransferase (COX10) mRNA, complete cds. ACCESSION U09466 NID g495492 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2797) AUTHORS Glerum,M.D. and Tzagoloff,A. TITLE Isolation of a human cDNA for heme A:farnesyltransferase by functional complementation of a yeast cox10 mutant JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 8452-8456 (1994) MEDLINE 94359949 REFERENCE 2 (bases 1 to 2797) AUTHORS Glerum,M.D. TITLE Direct Submission JOURNAL Submitted (06-MAY-1994) Moira D. Glerum, Biological Sciences, Columbia University, New York, NY 10027, USA FEATURES Location/Qualifiers source 1..2797 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="HeLa cDNA in pDB20" gene 41..1372 /gene="COX10" CDS 41..1372 /gene="COX10" /codon_start=1 /function="a mitochondrial protein involved in heme A biosynthesis" /product="heme A:farnesyltransferase" /db_xref="PID:g495493" /translation="MAASPHTLSSRLLTGCVGGSVWYLERRTIQDSPHKFLHLLRNVN KQWITFQHFSFLKRMYVTQLNRSHNQQVRPKPEPVASPFLEKTSSGQAKAEIYEMRPL SPPSLSLSRKPNEKELIELEPDSVIEDSIDVGKETKEEKRWKEMKLQVYDLPGILAQL SKIKLTALVVSTTAAGFALAPGPFDWPCFLLTSVGTGLASCAANSINQFFEVPFDSNM NRTKNRPLVRGQISPLLAVSFATCCAVPGVAILTLGVNPLTGALGLFNIFLYTCCYTP LKRISIANTWVGAVVGAIPPVMGWTATTGSLDAGAFLLGGILYSWQFPHFNALSWGLR EDYSRDGYCMMSVTHPGLCRRVALRHCLALLVLSAAAPVLDITTWTFPIMALPINAYI SHLGFRFYVDADRRSSRRLFFCSLWHLPLLLLLMLTCKRPSGGGDAGPPPS" BASE COUNT 664 a 781 c 643 g 709 t ORIGIN 1 gacacaggga tcccggggag cggccccaga ctcgtaaatt atggccgcat ctccgcacac 61 tctctcctca cgcctcctga caggttgcgt aggaggctct gtctggtatc ttgaaagaag 121 aactatacag gactcccctc acaagttctt acatcttctc aggaatgtca ataagcagtg 181 gattacattt cagcacttta gcttcctcaa acgcatgtat gtcacacagc tgaacagaag 241 ccacaaccag caagtaagac ccaagccaga accagtagca tctcctttcc ttgaaaaaac 301 atcttcaggt caagccaaag cagaaatata tgagatgaga cctctctcac cgcccagcct 361 atctttgtcc agaaagccaa atgaaaagga attgatagaa ctagagccag actcagtaat 421 tgaagactca atagatgtag ggaaagagac aaaagaggaa aagcggtgga aagagatgaa 481 gctgcaagtg tatgatttgc caggaatttt ggctcaacta tccaaaatca aactcacagc 541 tctagttgta agtaccactg cagctggatt tgcattggct ccgggccctt ttgactggcc 601 ctgtttcctg cttacttctg ttgggacagg ccttgcatcc tgtgctgcca actccatcaa 661 tcagtttttt gaggtgccat ttgactcaaa catgaatagg acaaagaaca gaccgctggt 721 tcgtggacag atcagcccat tgctagctgt gtcctttgcc acttgttgtg ctgttccggg 781 agttgccatt ctgaccttgg gggtgaatcc actcacagga gccctggggc tcttcaacat 841 tttcctgtat acctgctgct acacaccact gaaaaggatc agcattgcca acacatgggt 901 cggagctgtg gttggggcca tcccgcctgt catgggctgg acagcgacca cgggcagcct 961 cgatgctggc gcatttctcc tgggaggaat cctctactcc tggcagtttc ctcatttcaa 1021 cgccctgagc tggggcctcc gtgaagacta ctcccgggac ggctactgca tgatgtcggt 1081 cacccacccg ggcctgtgcc ggcgcgtggc gctgcgccac tgcctggccc tgctcgtgct 1141 gtccgcagca gcccctgtgc tggacatcac cacatggacc ttccccatca tggcccttcc 1201 catcaatgcg tacatctccc acctcggctt ccgcttctac gtggacgcag accgcaggag 1261 ctcgcggaga ctgttcttct gcagcctgtg gcacctgccg ctgctgctgc tgctcatgct 1321 cacctgcaag cggccgagcg gaggcgggga cgcagggccc cctcccagct gagagcactg 1381 ggacgcccac cgcccctttc cctccgctgc caggcgagca tgttgtggta attctggaac 1441 acaagaagag aaattgctgg gtttagaaca agattataaa cgaattcggt gctcagtgat 1501 cacttgacag tttttttttt ttttaaatat tacccaaaat gctccccaaa taagaaatgc 1561 atcagctcag tcagtgaata caaaaaagga attatttttc cctttgaggg tcttttatac 1621 atctctcctc caaccccacc ctctattctg tttcttcctc ctcacatggg ggtacacata 1681 cacagcttcc tcttttggtt ccatccttac caccacacca cacgcacact ccacatgccc 1741 agcagagtgg cacttggtgg ccagaaagtg tgagcctcat gatctgctgt ctgtagttct 1801 gtgagctcag gtccctcaaa ggcctcggag cacccccttc cttgtgactg agccagggcc 1861 tgcatttttg gttttcccca ccccacacat tctcaaccat agtccttcta acaataccaa 1921 tagctaggac ccggctgctg tgcactggga ctggggattc cacatgtttg ccttgggagt 1981 ctcaagctgg actgccagcc cctgtcctcc cttcaccccc attgcgtatg agcatttcag 2041 aactccaagg agtcacaggc atctttatag ttcacgttaa catatagaca ctgttggaag 2101 cagttccttc taaaagggta gccctggact taataccagc cggatacctc tggcccccac 2161 cccattactg tacctctgga gtcactactg tgggtcgcca ctcctctgct acacagcacg 2221 gctttttcaa ggctgtattg agaagggaag ttaggaagaa gggtgtgctg ggctaaccag 2281 cccacagagc tcacattcct gtcccttggg tgaaaaatac atgtccatcc tgatatctcc 2341 tgaattcaga aattagcctc cacatgtgca atggctttaa gagccagaag cagggttctg 2401 ggaattttgc aagttacctg tggccaggtg tggtctcggt taccaaatac ggttacctgc 2461 agctttttag tcctttgtgc tcccacgggt ctacagagtc ccatctgccc aaaggtcttg 2521 aagcttgaca ggatgttttc gattactcag tctcccaggg cactactggt ccgtaggatt 2581 cgattggtcg gggtaggaga gttaaacaac atttaaacag agttctctca aaaatgtcta 2641 aagggattgt aggtagataa catccaatca ctgtttgcac ttatctgaaa tcttccctct 2701 tggctgcccc caggtattta ctgtggagaa cattgcatag gaatgtctgg aaaaagcttc 2761 tacaacttgt tacagccttc acatttgtag aagcttt // LOCUS HSU09510 2462 bp mRNA PRI 02-FEB-1996 DEFINITION Human glycyl-tRNA synthetase mRNA, complete cds. ACCESSION U09510 NID g595304 KEYWORDS GlyRS. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2462) AUTHORS Williams,J., Osvath,S., Khong,T.F., Pearse,M. and Power,D. TITLE Cloning, sequencing and bacterial expression of human glycine tRNA synthetase JOURNAL Nucleic Acids Res. 23 (8), 1307-1310 (1995) MEDLINE 95273165 REFERENCE 2 (bases 1 to 2462) AUTHORS Williams,J.H. TITLE Direct Submission JOURNAL Submitted (09-MAY-1994) James H. Williams, Clinical Immunology, St. Vincents Hospital, 41 Victoria Parade, Fitzroy, Victoria 3065, Australia FEATURES Location/Qualifiers source 1..2462 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 365..2422 /EC_number="6.1.1.14" /codon_start=1 /product="glycyl-tRNA synthetase" /db_xref="PID:g493066" /translation="MDGAGAEEVLAPLRLAVRQQGDLVRKLKEDKAPQVDVDKAVAEL KARKRVLEAKELALQPKDDIVDRAKMEDTLKRRFFYDQAFAIYGGVSGLYDFGPVGCA LKNNIIQTWRQHFIQEEQILEIDCTMLTPEPVLKTSGHVDKFADFMVKDVKNGECFRA DHLLKAHLQKLMSDKKCSVEKKSEMESVLAQLDNYGQQELADLFVNYNVKSPITGNDL SPPVSFNLMFKTFIGPGGNMPGYLRPETAQGIFLNFKRLLEFNQGKLPFAAAQIGNSF RNEISPRSGLIRVREFTMAEIEHFVDPSEKDHPKFQNVADLHLYLYSAKAQVSGQSAR KMRLGDAVEQGVINNTVLGYFIGRIYLYLTKVGISPDKLRFRQHMENEMAHYACDCWD AESKTSYGWIEIVGCADRSCYDLSCHARATKVPLVAEKPLKEPKTVNVVQFEPSKGAI GKAYKKDAKLVMEYLAICDECYITEIEMLLNEKGEFTIETEGKTFQLTKDMINVKRFQ KTLYVEEVVPNVIEPSFGLGRIMYTVFEHTFHVREGDEQRTFFSFPAVVAPFKCSVLP LSQNQEFMPFVKELSEALTRHGVSHKVDDSSGSIGRRYARTDEIGVAFGVTIDFDTVN KTPHTATLRDRDSMRQIRAEISELPSIVQDLANGNITWADVEARYPLFEGQETGKKET IEE" BASE COUNT 669 a 576 c 610 g 607 t ORIGIN 1 gctccccatt gtccttgtca atactgtgag actcatccaa tatcctatcc acctctacgt 61 agtctggatt aaagggctct tcatcctcat ggaagaagtg tctcatctga gccagggggg 121 gcggcgattt catcatgctc cgagccggcg gcgcgcgccg cttccgtcgc caccctctct 181 ggacagccca gggccgcacg tcatgccctc tccgcgtcca gtgctgctta gaggtgctcg 241 cgccgctctg ctgctgctgc tgccgccccg gctcttagcc cgaccctcgc tcctgctccg 301 ccggtccctc agcgcggcct cctgcgcccc gatctccttg cccgccgccg cctcccggag 361 cagcatggac ggcgcggggg ctgaggaggt gctggctcct ctgaggctag cagtgcgcca 421 gcagggagat cttgtgcgaa aactcaaaga agataaagca ccccaagtag acgtagacaa 481 agcagtggct gagctcaaag cccgcaagag ggttctggaa gcaaaggagc tggcgttaca 541 gcccaaagat gatattgtag accgagcaaa aatggaagat accctgaaga ggaggttttt 601 ctatgatcaa gcttttgcta tttatggagg tgttagtggt ctgtatgact ttgggccagt 661 tggctgtgct ttgaagaaca atattattca gacctggagg cagcacttta tccaagagga 721 acagatcctg gagatcgatt gcaccatgct cacccctgag ccagttttaa agacctctgg 781 ccatgtagac aaatttgctg acttcatggt gaaagacgta aaaaatggag aatgttttcg 841 tgctgaccat ctattaaaag ctcatttaca gaaattgatg tctgataaga agtgttctgt 901 cgaaaagaaa tcagaaatgg aaagtgtttt ggcccagctt gataactatg gacagcaaga 961 acttgcggat ctttttgtga actataatgt aaaatctccc attactggaa atgatctatc 1021 ccctccagtg tcttttaact taatgttcaa gactttcatt gggcctggag gaaacatgcc 1081 tgggtacttg agaccagaaa ctgcacaggg gattttcttg aatttcaaac gacttttgga 1141 gttcaaccaa ggaaagttgc cttttgctgc tgcccagatt ggaaattctt ttagaaatga 1201 gatctcccct cgatctggac tgatcagagt cagagaattc acaatggcag aaattgagca 1261 ctttgtagat cccagtgaga aagaccaccc caagttccag aatgtggcag accttcacct 1321 ttatttgtat tcagcaaaag cccaggtcag cggacagtcc gctcggaaaa tgcgcctggg 1381 agatgctgtt gaacagggtg tgattaataa cacagtatta ggctatttca ttggccgcat 1441 ctacctctac ctcacgaagg ttggaatatc tccagataaa ctccgcttcc ggcagcacat 1501 ggagaatgag atggcccatt atgcctgtga ctgttgggat gcagaatcca aaacatccta 1561 cggttggatt gagattgttg gatgtgctga tcgttcctgt tatgacctct cctgtcatgc 1621 acgagccacc aaagtcccac ttgtagctga gaaacctctg aaagaaccca aaacagtcaa 1681 tgttgttcag tttgaaccca gtaagggagc aattggtaag gcatataaga aggatgcaaa 1741 actggtgatg gagtatcttg ccatttgtga tgagtgctac attacagaaa ttgagatgct 1801 gctgaatgag aaaggggaat tcacaattga aactgaaggg aaaacatttc agttaacaaa 1861 agacatgatc aatgtgaaga gattccagaa aacactatat gtggaagaag ttgttccgaa 1921 tgtaattgaa ccttccttcg gcctgggtag gatcatgtat acggtatttg aacatacatt 1981 ccatgtacga gaaggagatg aacagagaac attcttcagt ttccctgctg tagttgctcc 2041 attcaaatgt tccgtcctcc cactgagcca aaaccaggag ttcatgccat ttgtcaagga 2101 attatcggaa gccctgacca ggcatggagt atctcacaaa gtagacgatt cctctgggtc 2161 aatcggaagg cgctatgcca ggactgatga gattggcgtg gcttttggtg tcaccattga 2221 ctttgacaca gtgaacaaga ccccccacac tgcaactctg agggaccgtg actcaatgcg 2281 gcagataaga gcagagatct ctgagctgcc cagcatagtc caagacctag ccaatggcaa 2341 catcacatgg gctgatgtgg aggccaggta tcctctgttt gaagggcaag agactggtaa 2401 aaaagagaca atcgaggaat gaggacaatt ttgacaactt ttgaccactt gcgctaataa 2461 aa // LOCUS HSU09550 2198 bp mRNA PRI 09-FEB-1996 DEFINITION Human oviductal glycoprotein mRNA, complete cds. ACCESSION U09550 NID g1184036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2198) AUTHORS Arias,E.B., Verhage,H.G. and Jaffe,R.C. TITLE Complementary deoxyribonucleic acid cloning and molecular characterization of an estrogen-dependent human oviductal glycoprotein JOURNAL Biol. Reprod. 51 (4), 685-694 (1994) MEDLINE 95119256 REFERENCE 2 (bases 1 to 2198) AUTHORS Jaffe,R.C. TITLE Direct Submission JOURNAL Submitted (10-MAY-1994) Randal C. Jaffe, Department of Physiology and Biophysics, University of Illinois at Chicago, 901 South Wolcott Ave., Chicago, IL 60612-7342, USA FEATURES Location/Qualifiers source 1..2198 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="oviduct" CDS 13..2049 /codon_start=1 /product="oviductal glycoprotein" /db_xref="PID:g1184037" /translation="MWKLLLWVGLVLVLKHHDGAAHKLVCYFTNWAHSRPGPASILPH DLDPFLCTHLIFAFASMNNNQIVAKDLQDEKILYPEFNKLKERNRELKTLLSIGGWNF GTSRFTTMLSTFANREKFIASVISLLRTHDFDGLDLFFLYPGLRGSPMHDRWTFLFLI EELLFAFRKEALLTMRPRLLLSAAVSGVPHIVQTSYDVRFLGRLLDFINVLSYDLHGS WERFTGHNSPLFSLPEDPKSSAYAMNYWRKLGAPSEKLIMGIPTYGRTFRLLKASKNG LQARAIGPASPGKYTKQEGFLAYFEICSFVWGAKKHWIDYQYVPYANKGKEWVGYDNA ISFSYKAWFIRREHFGGAMVWTLDMDDVRGTFCGTGPFPLVYVLNDILVRAEFSSTSL PQFWLSSAVNSSSTDPERLAVTTAWTTDSKILPPGGEAGVTEIHGKCENMTITPRGTT VTPTKETVSLGKHTVALGEKTEITGAMTMTSVGHQSMTPGEKALTPVGHQSVTTGQKT LTSVGYQSVTPGEKTLTPVGHQSVTPVSHQSVSPGGTTMTPVHFQTETLRQNTVAPRR KAVAREKVTVPSRNISVTPEGQTMPLRGENLTSEVGTHPRMGNLGLQMEAENRMMLSS SPVIQLPEQTPLAFDNRFVPIYGNHSSVNSVTPQTSPLSLKKEIPENSAVDEEA" polyA_signal 2174..2179 polyA_site 2174 /note="19 A residues" BASE COUNT 552 a 564 c 529 g 553 t ORIGIN 1 cagaccattg agatgtggaa gctgttgctg tgggttgggc tggttcttgt gctgaaacac 61 cacgatggtg ctgcccataa actcgtgtgt tatttcacca actgggcaca cagtcggcca 121 ggccctgcct cgatcttgcc ccatgacctg gacccctttc tctgcaccca cctgatattt 181 gcctttgcct caatgaacaa caatcagatt gttgctaagg atctccagga tgagaaaatt 241 ctctacccag agttcaacaa actaaaggag aggaacagag agctgaaaac actactgtcc 301 atcggcgggt ggaactttgg cacctcaaga ttcaccacta tgttgtccac atttgccaac 361 cgtgaaaagt ttattgcttc agttatatcc cttctgagga cacatgactt tgatggtctt 421 gaccttttct tcttatatcc tggactaaga ggcagcccca tgcatgaccg gtggactttt 481 ctcttcttaa ttgaagagct cctgtttgcc ttccggaagg aggcactgct caccatgcgc 541 ccgaggctgc tgctgtctgc tgctgtttct ggggtcccac acatcgtcca aacatcctat 601 gatgtgcgct ttctaggaag actcctggat ttcatcaatg tcttgtctta tgacttacat 661 ggaagttggg aaaggttcac aggacataat agccccctct tctctctgcc tgaagacccc 721 aaatcttcgg catatgctat gaattattgg agaaagcttg gggcaccctc agagaagctc 781 atcatgggga tccccaccta tggacgtacc tttcgcctcc tcaaagcctc taagaatggg 841 ttgcaggcca gagcgatcgg accagcatct ccagggaagt acaccaagca agaaggcttc 901 ttggcttatt ttgagatttg ttcctttgtc tggggagcga agaagcactg gattgattac 961 cagtatgtcc cgtatgccaa caaggggaaa gagtgggttg gctatgacaa tgccatcagc 1021 ttcagttaca aggcatggtt tataaggcga gagcattttg ggggggccat ggtgtggaca 1081 ttggacatgg atgacgtcag gggcacgttc tgtggcactg gccctttccc ccttgtctac 1141 gtattgaatg atatcctggt gcgggctgag ttcagttcaa cttctttacc acaattttgg 1201 ctgtcatctg ctgtgaattc ttcaagcact gaccctgaaa ggctggctgt gaccacggca 1261 tggaccactg atagtaagat tttgccccca ggaggagagg ctggggtcac tgagatccac 1321 ggaaagtgtg aaaatatgac tataacccct agaggtacaa ctgtgacccc tacaaaggaa 1381 actgtatccc ttggaaagca cactgtagct ctaggagaga agactgagat cactggggca 1441 atgaccatga cttctgtggg tcatcagtcc atgacccctg gagagaaggc cctgacccct 1501 gtgggtcatc aatctgtgac cactggacag aagaccctga cctctgtggg ttatcagtct 1561 gtgacccctg gggaaaagac cctgacccct gtgggtcatc agtctgtgac ccctgtgagt 1621 catcagtctg tgagccctgg aggaacgact atgacccctg tccattttca gactgagacc 1681 cttagacaga atacagtggc ccctagaagg aaggctgtgg cccgtgaaaa ggtgactgtc 1741 ccctccagaa acatatcagt cacccctgaa gggcagacta tgcctttaag aggggagaat 1801 ttgacttctg aggtgggcac tcaccccagg atgggtaact tgggtcttca gatggaagct 1861 gaaaacagga tgatgctgtc ctccagcccc gtcatccagc tcccggaaca aactcctcta 1921 gcttttgaca accgctttgt tcccatctat ggaaaccatt cctctgtcaa ctcagtaacc 1981 cctcaaacaa gtcctctttc tctaaaaaaa gaaatcccag aaaactctgc tgtggatgaa 2041 gaagcctaag cccctctggt gtcagaaacc agggaaaacc cttgtctttt cttctaagtg 2101 acatgttgga agccttctca tcccggggca aagcaggcat caaaaccaga ataggccaat 2161 ctcttttcca ttaaataaac tgtaaacaca agaaccca // LOCUS HSU09559 1969 bp mRNA PRI 03-MAY-1995 DEFINITION Human Rch1 (RCH1) mRNA, complete cds. ACCESSION U09559 NID g791184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 706 to 1968) AUTHORS Cuomo,C.A., Kirch,S.A., Gyuris,J., Brent,R. and Oettinger,M.A. TITLE Rch1, a protein that specifically interacts with the RAG-1 recombination-activating protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (13), 6156-6160 (1994) MEDLINE 94286596 REFERENCE 2 (bases 1 to 1969) AUTHORS Oettinger,M.A. TITLE Direct Submission JOURNAL Submitted (10-MAY-1994) Marjorie A. Oettinger, Molecular Biology, Massachusetts General Hospital, 50 Blossom St., Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..1969 /organism="Homo sapiens" /note="human" /db_xref="taxon:9606" gene 133..1722 /gene="RCH1" CDS 133..1722 /gene="RCH1" /note="Rch1 interacts with and is the cohort for RAG-1, a recombination activating protein." /codon_start=1 /product="Rch1" /db_xref="PID:g791185" /translation="MSTNENANTPAARLHRFKNKGKDSTEMRRRRIEVNVELRKAKKD DQMLKRRNVSSFPDDATSPLQENRNNQGTVNWSVDDIVKGINSSNVENQLQATQAARK LLSREKQPPIDNIIRAGLIPKFVSFLGRTDCSPIQFESAWALTNIASGTSEQTKAVVD GGAIPAFISLLASPHAHISEQAVWALGNIAGDGSVFRDLVIKYGAVDPLLALLAVPDM SSLACGYLRNLTWTLSNLCRNKNPAPPIDAVEQILPTLVRLLHHDDPEVLADTCWAIS YLTDGPNERIGMVVKTGVVPQLVKLLGASELPIVTPALRAIGNIVTGTDEQTQVVIDA GALAVFPSLLTNPKTNIQKEATWTMSNITAGRQDQIQQVVNHGLVPFLVSVLSKADFK TQKEAVWAVTNYTSGGTVEQIVYLVHCGIIEPLMNLLTAKDTKIILVILDAISNIFQA AEKLGETEKLSIMIEECGGLDKIEALQNHENESVYKASLSLIEKYFSVEEEEDQNVVP ETTSEGYTFQVQDGAPGTFNF" source 229..1968 /organism="Homo sapiens" /clone="Rch1.1 (33-529)" /cell_line="HeLa" /tissue_type="cervical carcinoma" 3'UTR 1723..1968 polyA_site 1969 /note="7 A nucleotides" BASE COUNT 544 a 451 c 434 g 540 t ORIGIN 1 gccacacggt ctttgagctg agtcgaggtg gaccctttga acgcagtcgc cctacagccg 61 ctgattcccc ccgcatcgcc tcccgtggaa gcccaggccc gcttcgcagc tttctccctt 121 tgtctcataa ccatgtccac caacgagaat gctaatacac cagctgcccg tcttcacaga 181 ttcaagaaca agggaaaaga cagtacagaa atgaggcgtc gcagaataga ggtcaatgtg 241 gagctgagga aagctaagaa ggatgaccag atgctgaaga ggagaaatgt aagctcattt 301 cctgatgatg ctacttctcc gctgcaggaa aaccgcaaca accagggcac tgtaaattgg 361 tctgttgatg acattgtcaa aggcataaat agcagcaatg tggaaaatca gctccaagct 421 actcaagctg ccaggaaact actttccaga gaaaaacagc cccccataga caacataatc 481 cgggctggtt tgattccgaa atttgtgtcc ttcttgggca gaactgattg tagtcccatt 541 cagtttgaat ctgcttgggc actcactaac attgcttctg ggacatcaga acaaaccaag 601 gctgtggtag atggaggtgc catcccagca ttcatttctc tgttggcatc tccccatgct 661 cacatcagtg aacaagctgt ctgggctcta ggaaacattg caggtgatgg ctcagtgttc 721 cgagacttgg ttattaagta cggtgcagtt gacccactgt tggctctcct tgcagttcct 781 gatatgtcat ctttagcatg tggctactta cgtaatctta cctggacact ttctaatctt 841 tgccgcaaca agaatcctgc acccccgata gatgctgttg agcagattct tcctacctta 901 gttcggctcc tgcatcatga tgatccagaa gtgttagcag atacctgctg ggctatttcc 961 taccttactg atggtccaaa tgaacgaatt ggcatggtgg tgaaaacagg agttgtgccc 1021 caacttgtga agcttctagg agcttctgaa ttgccaattg tgactcctgc cctaagagcc 1081 atagggaata ttgtcactgg tacagatgaa cagactcagg ttgtgattga tgcaggagca 1141 ctcgccgtct ttcccagcct gctcaccaac cccaaaacta acattcagaa ggaagctacg 1201 tggacaatgt caaacatcac agccggccgc caggaccaga tacagcaagt tgtgaatcat 1261 ggattagtcc cattccttgt cagtgttctc tctaaggcag attttaagac acaaaaggaa 1321 gctgtgtggg ccgtgaccaa ctataccagt ggtggaacag ttgaacagat tgtgtacctt 1381 gttcactgtg gcataataga accgttgatg aacctcttaa ctgcaaaaga taccaagatt 1441 attctggtta tcctggatgc catttcaaat atctttcagg ctgctgagaa actaggtgaa 1501 actgagaaac ttagtataat gattgaagaa tgtggaggct tagacaaaat tgaagctcta 1561 caaaaccatg aaaatgagtc tgtgtataag gcttcgttaa gcttaattga gaagtatttc 1621 tctgtagagg aagaggaaga tcaaaacgtt gtaccagaaa ctacctctga aggctacact 1681 ttccaagttc aggatggggc tcctgggacc tttaactttt agatcatgta gctgagacat 1741 aaatttgttg tgtactacgt ttggtatttt gtcttattgt ttctctacta agaactcttt 1801 cttaaatgtg gtttgttact gtagcacttt ttacactgaa actatacttg aacagttcca 1861 actgtacata catactgtat gaagcttgtc ctctgactag gtttctaatt tctatgtgga 1921 atttcctatc ttgcagcatc ctgtaaataa acattcaagt ccaccctta // LOCUS HSU09564 4326 bp mRNA PRI 05-AUG-1994 DEFINITION Human serine kinase mRNA, complete cds. ACCESSION U09564 NID g507212 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4326) AUTHORS Gui,J.F., Lane,W.S. and Fu,X.D. TITLE A serine kinase regulates intracellular localization of splicing factors in the cell cycle [see comments] JOURNAL Nature 369 (6482), 678-682 (1994) MEDLINE 94268559 REFERENCE 2 (bases 1 to 4326) AUTHORS Fu,X. TITLE Direct Submission JOURNAL Submitted (10-MAY-1994) Xiang-Dong Fu, Cellular and Molecular Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..4326 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSK-SRPK1" /clone_lib="Uni-Zap XR HeLa S3" /cell_line="HeLa S3" CDS 109..2076 /codon_start=1 /function="serine kinase for SR splicing factors" /product="serine kinase" /db_xref="PID:g507213" /translation="MERKVLALQARKKRTKAKKDKAQRKSETQHRGSAPHSESDLPEQ EEEILGSDDDEQEDPNDYCKGGYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQG KKFVAMKVVKSAEHYTETALDEIRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTH ICMVFEVLGHHLLKWIIKSNYQGLPLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPEN ILLSVNEQYIRRLAAEATEWQRSGAPPPSGSAVSTAPQPKPADKMSKNKKKKLKKKQK RQAELLEKRMQEIEEMEKESGPGQKRPNKQEESESPVERPLKENPPNKMTQEKLEESS TIGQDQTLMERDTEGGAAEINCNGVIEVINYTQNSNNETLRHKEDLHNANDCDVQNLN QESSFLSLPNGDSSTSQETDSCTPITSEVSDTMVCQSSSTVGQSFSEQHISQLQESIR AEIPCEDEQEQEHNGPLDNKGKSTAGNFLVNPLEPKNAEKLKVKIADLGNACWVHKHF TEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAFELATGDYLFEPHSGEEYTRDEDHI ALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITKLKPWGLFEVLVEKYEWSQEEAA GFTDFLLPMLELIPEKRATAAECLRHPWLNS" polyA_signal 4202..4207 /evidence=not_experimental polyA_site 4326 /note="20 A residues" BASE COUNT 1296 a 867 c 894 g 1269 t ORIGIN 1 atataaaata gtattccaaa taagtacatt ttatagcaaa attatgcatt tttcctaaga 61 ctttcatcac caatatcgcc ttataccctg cttttgttgg gtctcaccat ggagcggaaa 121 gtgcttgcgc tccaggcccg aaagaaaagg accaaggcca agaaggacaa agcccaaagg 181 aaatctgaaa ctcagcaccg aggctctgct ccccactctg agagtgatct accagagcag 241 gaagaggaga ttctgggatc tgatgatgat gagcaagaag atcctaatga ttattgtaaa 301 ggaggttatc atcttgtgaa aattggagat ctattcaatg ggagatacca tgtgatccga 361 aagttaggct ggggacactt ttcaacagta tggttatcat gggatattca ggggaagaaa 421 tttgtggcaa tgaaagtagt taaaagtgct gaacattaca ctgaaacagc actagatgaa 481 atccggttgc tgaagtcagt tcgcaattca gaccctaatg atccaaatag agaaatggtt 541 gttcaactac tagatgactt taaaatatca ggagttaatg gaacacatat ctgcatggta 601 tttgaagttt tggggcatca tctgctcaag tggatcatca aatccaatta tcaggggctt 661 ccactgcctt gtgtcaaaaa aattattcag caagtgttac agggtcttga ttatttacat 721 accaagtgcc gtatcatcca cactgacatt aaaccagaga acatcttatt gtcagtgaat 781 gagcagtaca ttcggaggct ggctgcagaa gcaacagaat ggcagcgatc tggagctcct 841 ccgccttccg gatctgcagt cagtactgct ccccagccta aaccagctga caaaatgtca 901 aagaataaga agaagaaatt gaagaagaag cagaagcgcc aggcagaatt actagagaag 961 cgaatgcagg aaattgagga aatggagaaa gagtcgggcc ctgggcaaaa aagaccaaac 1021 aagcaagaag aatcagagag tcctgttgaa agacccttga aagagaaccc acctaataaa 1081 atgacccaag aaaaacttga agagtcaagt accattggcc aggatcaaac gcttatggaa 1141 cgtgatacag agggtggtgc agcagaaatt aattgcaatg gagtgattga agtcattaat 1201 tatactcaga acagtaataa tgaaacattg agacataaag aggatctaca taatgctaat 1261 gactgtgatg tccaaaattt gaatcaggaa tctagtttcc taagtctccc aaatggagac 1321 agcagcacat ctcaagaaac agactcttgt acacctataa catctgaggt gtcagacacc 1381 atggtgtgcc agtcttcctc aactgtaggt cagtcattca gtgaacaaca cattagccaa 1441 cttcaagaaa gcattcgggc agagataccc tgtgaagatg aacaagagca agaacataac 1501 ggaccactgg acaacaaagg aaaatccacg gctggaaatt ttcttgttaa tccccttgag 1561 ccaaaaaatg cagaaaagct caaggtgaag attgctgacc ttggaaatgc ttgttgggtg 1621 cacaaacatt tcactgaaga tattcaaaca aggcaatatc gttccttgga agttctaatc 1681 ggatctggct ataatacccc tgctgacatt tggagcacgg catgcatggc ctttgaactg 1741 gccacaggtg actatttgtt tgaacctcat tcaggggaag agtacactcg agatgaagat 1801 cacattgcat tgatcataga acttctgggg aaggtgcctc gcaagctcat tgtggcagga 1861 aaatattcca aggaattttt caccaaaaaa ggtgacctga aacatatcac gaagctgaaa 1921 ccttggggcc tttttgaggt tctagtggag aagtatgagt ggtctcagga agaggcagct 1981 ggcttcacag atttcttact gcccatgttg gagctgatcc ctgagaagag agccactgcc 2041 gccgagtgtc tccggcaccc ttggcttaac tcctaagccc ctgcccagca ccacagcaga 2101 gatcacacac tgaccctccg cccttcccct tcaagcattt tcctcttccc ttttcagggt 2161 gaagctcttc cttcaagagt ttctagatct tgtttttttt ttaatccaac atgttcattt 2221 gggtttgctt acttgaccct gtggagatcc ccacagccat tgggcatcct aggtgaattt 2281 ggccttggtt ggctctgcca aagactaatg gactaaaatg tgaaacagcc tcttgccctg 2341 tacctttcct tcccattagg acatccttta aattataagc atcctttttg aaaagagcta 2401 tgaaggtgta tgagcccatc cttttattca ttgactctaa gagtcaaatt ttctagtgca 2461 tatcctattg ccagcataag gatgaggagg gggaaagggt cttaattcta tgtacagcag 2521 agacattaaa cttgctgtgt ccgggctgca tcatcttcct ggactgtttc tgttgttctc 2581 tgtgttcaca ttttttcctg caacttttaa gctactgtct tttttaaata gctatatgaa 2641 caccaaattt gggtaccatt ttatcactgt tcaaagcact gtcaaattcc tttcatcctt 2701 taatagttaa gatctttgaa tcttcagtct gatttttaat gtaagcaaaa acagaaccat 2761 tgaatagtaa tttcttgaga acctcaggtg ttctataaac agtcctttcc tgtatgtctt 2821 ctattaccct aagaccagag ttattttggt tggttgtttt gttttatttt ttgtttttgt 2881 atccatggct ggcactttac tcattgcact tgagtttatt gccccataac taaaggatca 2941 ggatgatggt agaacggaga tctgggtttc agagctttcc catttaagaa aaatagatct 3001 tgagattctg attcttttcc aaacagtccc ctgctttcat gtacagcttt ttctttacct 3061 tacccaaaat tctggccttg aagcagtttt cctctatggc tttgctttct gattttctca 3121 gaggctcgag tctttaatat aaccccaaat gaaagaacca aggggagggg tgggatggca 3181 cttttttttg ttggtcttgt tttgttttgt tttttggttg gttggttatt ttttaagatt 3241 agccattctc tgctgctatt tccctacata atgtcaattt ttaaccataa ttttgacatg 3301 attgagatgt acttgaggct tttttgtttt aattgagaaa agactttgca attttttttt 3361 aggatgagcc tctcctagac ttgacctaga atattacata ttcctccagt aatactgaag 3421 agcaaaagag aggcaggatt ggggtcacag ccgcttcttc agcatggacc aagtgggcct 3481 tggggattgc agcgttctcg aagtggctgt aggactcgaa tttacagaaa gccacagagg 3541 tgcaacttga ggctctgcta gcaagccacc agtgaggcta ttgggtaacc acctttctat 3601 acaggagatt ggaatctact ttgtcattta tccaccacag tgacaaagga aaagtggtgc 3661 cgttatgcaa tccatttaac tcataaacat attactctga gtaactggcc agccattcat 3721 cggatccttc attgggtact cctgaaatca gacatgttcc tgtagaaaga attttaagtt 3781 aggctttcta tgcacctatc aagaatcaag agaatagatt gtatcaaaca acggcaggga 3841 aatccttcag caattctaat ccactttggg ttttcagctg tttttacatc taaagcaata 3901 gactagaact gaattatctt ctacatagta aaatcacaat tgtggaattc tggtgatatt 3961 aaggtgaaat aacaaaacac aaaaggccct attttaacag ttgatgtgac agtaagtttt 4021 aatagaacct gtaacttcat tttggaaatg cttctccacc aaataagggc tttttcccct 4081 atttaaggag ccagatggat tgaaagatgt ggaaataggc agctgtagat cttgatcttc 4141 caggtacccc atgtaccttt attgagctta attataatac tgtcaaattg ccacgatctc 4201 actaaaggat ttctatttgc tgtcagttaa aaataaagcc ctaaatacat ttttattctt 4261 tctactgagg gcattgtctg ttttctttgt aaatgccgta caataaacaa attatttaat 4321 aaccta // LOCUS HSU09578 2481 bp mRNA PRI 15-MAR-1996 DEFINITION Human MAPKAP kinase (3pK) mRNA, complete cds. ACCESSION U09578 NID g1209017 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2481) AUTHORS Sithanandam,G., Latif,F., Duh,F.-M., Bernal,R., Smola,U., Li,H., Kuzmin,I., Wixler,V., Geil,L., Shrestha,S., LLoyd,P.A., Bader,S., Sekido,Y., Tartof,K.D., Kashuba,V.I., Zabarovsky,E.R., Dean,M., Klein,G., Lerman,M.I., Minna,J.D., Rapp,U.R. and Allikmets,R. TITLE 3pK, a new mitogen-activated protein kinase-activated protein kinase located in the small cell lung cancer tumor suppressor gene region JOURNAL Mol. Cell. Biol. 16 (3), 868-876 (1996) MEDLINE 96182089 REMARK Erratum:[[published erratum appears in Mol Cell Biol 1996 Apr;16(4):1880]] REFERENCE 2 (bases 1 to 2481) AUTHORS Duh,F.-M. TITLE Direct Submission JOURNAL Submitted (11-MAY-1994) Fuh-Mei Duh, BCDP Program, PRI/Dyncorp, NCI-FCRDC, Building 560, Rm 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2481 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3pK" /chromosome="3" /map="3p21.3" /tissue_type="heart" /dev_stage="adult" 5'UTR 1..91 gene 92..1240 /gene="3pK" CDS 92..1240 /gene="3pK" /codon_start=1 /product="MAPKAP kinase" /db_xref="PID:g1209018" /translation="MDGETAEEQGGPVPPPVAPGGPGLGGAPGGRREPKKYAVTDDYQ LSKQVLGLGVNGKVLECFHRRTGQKCALKLLYDSPKARQEVDHHWQASGGPHIVCILD VYENMHHGKRCLLIIMECMEGGELFSRIQERGDQAFTEREAAEIMRDIGTAIQFLHSH NIAHRDVKPENLLYTSKEKDAVLKLTDFGFAKETTQNALQTPCYTPYYVAPEVLGPEK YDKSCDMWSLGVIMYILLCGFPPFYSNTGQAISPGMKRRIRLGQYGFPNPEWSEVSED AKQLIRLLLKTDPTERLTITQFMNHPWINQSMVVPQTPLHTARVLQEDKDHWDEVKEE MTSALATMRVDYDQVKIKDLKTSNNRLLNKRRKKQAGSSSASQGCNNQ" 3'UTR 1238..2481 polyA_signal 2461..2466 BASE COUNT 532 a 707 c 702 g 540 t ORIGIN 1 gcccctcgcc ggtacctcag caaggtgcgt tgccgccagg tgccactaga agcgccaggc 61 tggggccgcc tctgagcgcc ccgcgggggc catggatggt gaaacagcag aggagcaggg 121 gggccctgtg cccccgccag ttgcacccgg cggacccggc ttgggcggtg ctccgggggg 181 gcggcgggag cccaagaagt acgcagtgac cgacgactac cagttgtcca agcaggtgct 241 gggcctgggt gtgaacggca aagtgctgga gtgcttccat cggcgcactg gacagaagtg 301 tgccctgaag ctcctgtatg acagccccaa ggcccggcag gaggtagacc atcactggca 361 ggcttctggc ggcccccata ttgtctgcat cctggatgtg tatgagaaca tgcaccatgg 421 caagcgctgt ctcctcatca tcatggaatg catggaaggt ggtgagttgt tcagcaggat 481 tcaggagcgt ggcgaccagg ctttcactga gagagaagct gcagagataa tgcgggatat 541 tggcactgcc atccagtttc tgcacagcca taacattgcc caccgagatg tcaagcctga 601 aaacctactc tacacatcta aggagaaaga cgcagtgctt aagctcaccg attttggctt 661 tgctaaggag accacccaaa atgccctgca gacaccctgc tatactccct attatgtggc 721 ccctgaggtc ctgggtccag agaagtatga caagtcatgt gacatgtggt ccctgggtgt 781 catcatgtac atcctccttt gtggcttccc acccttctac tccaacacgg gccaggccat 841 ctccccgggg atgaagagga ggattcgcct gggccagtac ggcttcccca atcctgagtg 901 gtcagaagtc tctgaggatg ccaagcagct gatccgcctc ctgttgaaga cagaccccac 961 agagaggctg accatcactc agttcatgaa ccacccctgg atcaaccaat cgatggtagt 1021 gccacagacc ccactccaca cggcccgagt gctgcaggag gacaaagacc actgggacga 1081 agtcaaggag gagatgacca gtgccttggc cactatgcgg gtagactacg accaggtgaa 1141 gatcaaggac ctgaagacct ctaacaaccg gctcctcaac aagaggagaa aaaagcaggc 1201 aggcagctcc tctgcctcac agggctgcaa caaccagtag ctcatggggc cttggaggag 1261 cctggcctct cagcctgcat aacagactga aatgtgctca ggccctggcc aggagggccc 1321 agggtcattc ttttaacaaa ggattatttt gttgtgtttt aatttgtcac tcggaacttc 1381 aggatggagg accctgaccc taagcctcct tcagatctct ggcccaggct caagccctag 1441 agatgggcag ggcctagggg ctgggagctg cctgctgcca tagcagcacc tttagctagg 1501 ttggcccgag tgaggcctct gtgctgtcct gccctggtgc atggccttag ctttctaggc 1561 cactgggagt tgtggctggg cttcccatct tccacagaga catctccctg tgggatgggc 1621 agatgggcct ggccttgaga aaggcattgg ccattggttg ccatggtgac cagggaccac 1681 gttgctgcct gtgaatgctg agtgagcgag taagggagga ggggcgattg agggttcacc 1741 tctgccttgg ggaggctgat ttctcacaca ctggctggcc ctctcattct cactcctcct 1801 tgggccctga ggctgctgga tccagtctgc ctgcctccct gtgcagtcca gccctgcctt 1861 gctgcagccc cagcccagat ggcactcagc gctctcccct gagggagtcc ctgggcctag 1921 ccatcccctc actattcccg acccaaaggg tgacttttca tctgaactta aagtgggaga 1981 tatttttaac ttttttccac tttggaaaat gtcactgtga caaaagccag catactttcc 2041 ctgcacccat ctgctcacca gatctcaggc aggaaagccc ctctctgttg aagtcagggg 2101 ctatcttttg gtatacttgt gtgaaagtgg ctggttggga gcagagctaa gtggcttccc 2161 attaacctga ggtctctttc tttactctgg gtcagacctg aggttgggga aggcgactga 2221 gccatgctca gaatgtctgg tcctggcttg ggcctgagta gggcagagag ggcctttcat 2281 ggctgatcag agcttaccag ccccacccca ccatggtagc cttagggtgc tgagtgcctg 2341 atactgcctg acaagtgcct gacacgcagc ctagttcctt cctggcccct ctctcactgg 2401 ctgggaaacc ctagaccatg tcagatagga caacactgct gggttttaca tccagatagt 2461 aataaacacc atttcatcat t // LOCUS HSU09584 1860 bp mRNA PRI 15-MAR-1996 DEFINITION Human PL6 protein (PL6) mRNA, complete cds. ACCESSION U09584 NID g1209019 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1860) AUTHORS Latif,F., Duh,F.-M., Bader,S., Lerman,M.I. and Minna,J.D. TITLE A novel human cDNA that is homozygously deleted in small cell lung cancer and located to 3p21.3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1860) AUTHORS Duh,F.-M. TITLE Direct Submission JOURNAL Submitted (11-MAY-1994) Fuh-Mei Duh, BCDP Program, PRI/Dyncorp, NCI-FCRDC, Building 560, Rm 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..1860 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PL6" /chromosome="3" /map="3p21.3" /tissue_type="placenta" 5'UTR 1..211 gene 212..1267 /gene="PL6" CDS 212..1267 /gene="PL6" /codon_start=1 /product="PL6 protein" /db_xref="PID:g1209020" /translation="MQRALPGARQHLGAILASASVVVKALCAAVLFLYLLSFAVDTGC LAVTPGYLFPPNFWIWTLATHGLMEQHVWDVAISLTTVVVAGRLLEPLWGALELLIFF SVVNVSVGLLGAFAYLLTYMASFNLVYLFTVRIHGALGFLGGVLVALKQTMGDCVVLR VPQVRVSVMPMLLLALLLLLRLATLLQSPALASYGFGLLSSWVYLRFYQRHSRGRGDM ADHFAFATFFPEILQPVVGLLANLVHSLLVKVKICQKTVKRYDVGAPSSITISLPGTD PQDAERRRQLALKALNERLKRVEDQSIWPSMDDDEEESGAKVDSPLPSDKAPTPPGKG AAPESSLITFEAAPPTL" 3'UTR 1264..1860 polyA_signal 1837..1842 BASE COUNT 340 a 605 c 544 g 371 t ORIGIN 1 ggcgaggggc ctacgctgcg gcccggcaac aaggcccgac tcggcccctc gggaccagag 61 ccccacccga tcggaagcgg atcctttacc agggccatag gccagtgact aggccgggcc 121 tggacctccc atcggggccg gactaggacg aggccccggg gaggcccctg gcctaccaga 181 cccttttctc aggccgacag ccgccaggaa gatgcaacgt gccctgccag gcgcccgcca 241 gcacttgggg gccattctgg ccagcgccag cgtggtggtg aaggctctgt gtgcggcggt 301 actattcctc tacctgctct ccttcgccgt ggacacaggc tgcctggcgg tcaccccggg 361 ctacctcttt cctcccaact tctggatctg gaccctggcc acccatgggc tgatggagca 421 gcatgtgtgg gacgtggcca tcagcctgac aacggtggtg gtggccgggc gtttgctgga 481 gcccctctgg ggggccttgg agctgctcat cttcttctca gtggtgaatg tgtctgtagg 541 gctgctgggg gccttcgcct acctcctcac ctacatggct tccttcaacc tggtctacct 601 gttcactgtc cgtatccacg gcgccttggg cttcctaggt ggcgtcctgg tggcactcaa 661 gcaaaccatg ggggactgtg tggtcctgcg agtgccccag gtgcgcgtca gtgtgatgcc 721 catgctgctg ctggcgctgc tgctcctgct gcggctcgcc acactgctcc agagcccggc 781 gctggcttcc tatggcttcg ggctgctctc cagttgggta tatcttcgct tctaccagcg 841 ccatagccga ggccgagggg acatggctga ccactttgct ttcgccactt tcttccctga 901 gatcctgcag cctgtggtgg gtttgctggc gaacttggtg cacagcctcc tggtgaaggt 961 aaagatatgc cagaagacgg tgaagcgcta cgatgtgggt gccccatcct ccatcaccat 1021 cagcctgcca ggcacagacc ctcaagacgc cgagcggaga aggcaactgg ccctgaaggc 1081 actcaatgag cggctgaaga gagtggaaga ccagtccatc tggcccagca tggatgatga 1141 tgaagaggag tctggggcca aggtggacag ccccctgccc tcagacaaag ctcccacacc 1201 cccagggaag ggggctgccc cagaatccag tctaatcacc ttcgaggcag ctcccccgac 1261 gctgtaactc cagaccacct tgagtgtggc acctcccctc ccaagccccc cgttgacatc 1321 ctctcagcta ctccagggca cctgactgct ctgaggagag ggaagaaggc ctgctggggc 1381 tttccatggc cttctgctgt ttctcgccaa cactacccag gactcttgct acctggttcc 1441 aactccagac aaccactatg ccaggcccgg agcctctgag gcatcggcca gtccaggccc 1501 tcatctgagg taagaatgta catcagctgg cagccccaag caagtggctg cagggacact 1561 gatgccacag ctcctgggcc ggccctcaca tctgaaactg gttgccgaga gccctgagcc 1621 aaggcaagga tttgccaaaa atgttctggg ggcccagcaa atgcaggagc cgacctgggg 1681 ctgcacatcc ctgcccatcc ccagaaagac tgttcctgtc aggatttgtt tccctctgct 1741 gtggcggtga ctgcttctgg accagaacag ctccagctcc caggtatttt ctacaggacc 1801 acttgagtgg gcagccaagc ccaggctcgc agtatcaata aagcagttct ctgaggaatg // LOCUS HSU09607 4064 bp mRNA PRI 05-JUL-1994 DEFINITION Human JAK family protein tyrosine kinase (JAK3) mRNA, complete cds. ACCESSION U09607 NID g508730 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1124) AUTHORS Kawamura,M., McVicar,D.W., Johnston,J.A., Blake,T.B., Chen,Y., Lal,B.K., Lloyd,A.R., Kelvin,D.J., Staples,J.E., Ortaldo,J.R. and O'Shea,J. TITLE Molecular cloning of L-JAK, a Janus family protein-tyrosine kinase expressed in natural killer cells and activated leukocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 6374-6378 (1994) MEDLINE 94294384 REFERENCE 2 (bases 1 to 4064) AUTHORS O'Shea,J. TITLE Direct Submission JOURNAL Submitted (11-MAY-1994) John O'Shea, Leukocyte Cell Biology Section LEI BRMP, National Cancer Institute FCRDC, Bldg 560 Rm 3146 FCRDC, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..4064 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="YT, HUT-78" /cell_type="natural killer cell, PHA activated T cells" gene 96..3470 /gene="JAK3" CDS 96..3470 /gene="JAK3" /codon_start=1 /product="JAK family protein tyrosine kinase" /db_xref="PID:g508731" /translation="MAPPSEETPLIPQRSCSLLSTEAGALHVLLPARAPGPPQRLSFS FGDHLAEDLCVQAAKASGILPVYHSLFALATEDLSCWFPPSHIFSVEDASTQVLLYRI RFYFPNWFGLEKCHRFGLRKDLASAILDLPVLEHLFAQHRSDLVSGRLPVGLSLKEQG ECLSLAVLDLARMAREQAQRPGELLKTVSYKACLPPSLRDLIQGLSFVTRRAIRRTVR RALPRVAACQADRHSLMAKYIMDLERLDPAGAAETFHVGLPGALGGHDGLGLLRVAGD GGIAWTQGEQEVLQPFCDFPEIVDISIKQAPRVGPAGEHRLVTVTRTDNQILEAEFPG LPEALSFVALVDGYFRLTTDSQHFFCKEVAPPRLLEEVAEQCHGPITLDFAINKLKTG GSRPGSYVLRRSPQDFDSFLLTVCVQNPLGPDYKGCLIRRSPTGTFLLVGLSRPHSSL RELLATCWDGGLHVDGVAVTLTSCCIPRPKEKSNLIVVQRGHSPPTSSLVQPQSQYQL SQMTFHKIPADSLEWHENLGHGSFTKIYRGCRHEVVDGEARKTEVLLKVMDAKHKNCM ESFLEAASLMSQVSYRHLVLLHGVCMAGDSTMVQEFVHLGAIDMYLRKRGHLVPASWK LQVVKQLAYALNYLEDKGLPHGNVSARKVLLAREGADGSPPFIKLSDPGVSPAVLSLE MLTDRIPWVAPECLREAQTLSLEADKWGFGATVWEVFSGVTMPISALDPAKKLQFYED RQQLPAPKWTELALLIQQCMAYEPVQRPSFRAVIRDLNSLISSDYELLSDPTPGALAP RDGLWNGAQLYACQDPTIFEERHLKYISQLGKGNFGSVELCRYDPLAHNTGALVAVKQ LQHSGPDQQRDFQREIQILKALHSDFIVKYRGVSYGPGRPELRLVMEYLPSGCLRDFL QRHRARLDASRLLLYSSQICKGMEYLGSRRCVHRDLAARNILVESEAHVKIADFGLAK LLPLDKDYYVVREPGQSPIFWYAPESLSDNIFSRQSDVWSFGVVLYELFTYCDKSCSP SAEFLRMMGCERDVPALCRLLELLEEGQRLPAPPACPAEVHELMKLCWAPSPQDRPSF SALGPQLDMLWSGSRGCETHAFTAHPEGKHHSLSFS" BASE COUNT 746 a 1292 c 1168 g 858 t ORIGIN 1 ccctctgacc aggactgagg ggctttttct ctctgtgccc caggcaagtt gcactcatta 61 tggaattccg gcggcccgct aggcaagttg cactcatggc acctccaagt gaagagacgc 121 ccctgatccc tcagcgttca tgcagcctct tgtccacgga ggctggtgcc ctgcatgtgc 181 tgctgcccgc tcgggccccg gggccccccc agcgcctatc tttctccttt ggggaccact 241 tggctgagga cctgtgcgtg caggctgcca aggccagcgg catcctgcct gtgtaccact 301 ccctctttgc tctggccacg gaggacctgt cctgctggtt ccccccgagc cacatcttct 361 ccgtggagga tgccagcacc caagtcctgc tgtacaggat tcgcttttac ttccccaatt 421 ggtttgggct ggagaagtgc caccgcttcg ggctacgcaa ggatttggcc agtgctatcc 481 ttgacctgcc agtcctggag cacctctttg cccagcaccg cagtgacctg gtgagtgggc 541 gcctccccgt gggcctcagt ctcaaggagc agggtgagtg tctcagcctg gccgtgttgg 601 acctggcccg gatggcgcga gagcaggccc agcggccggg agagctgctg aagactgtca 661 gctacaaggc ctgcctaccc ccaagcctgc gcgacctgat ccagggcctg agcttcgtga 721 cgcggagggc tattcggagg acggtgcgca gagccctgcc gcgcgtggcc gcctgccagg 781 cagaccggca ctcgctcatg gccaagtaca tcatggacct ggagcggctg gatccagccg 841 gggccgccga gaccttccac gtgggcctcc ctggggccct tggtggccac gacgggctgg 901 ggctgctccg cgtggctggt gacggcggca tcgcctggac ccagggagaa caggaggtcc 961 tccagccctt ctgcgacttt ccagaaatcg tagacattag catcaagcag gccccgcgcg 1021 ttggcccggc cggagagcac cgcctggtca ctgttaccag gacagacaac cagattttag 1081 aggccgagtt cccagggctg cccgaggctc tgtcgttcgt ggcgctcgtg gacggctact 1141 tccggctgac cacggactcc cagcacttct tctgcaagga ggtggcaccg ccgaggctgc 1201 tggaggaagt ggccgagcag tgccacggcc ccatcactct ggactttgcc atcaacaagc 1261 tcaagactgg gggctcacgt cctggctcct atgttctccg ccgcagcccc caggactttg 1321 acagcttcct cctcactgtc tgtgtccaga acccccttgg tcctgattat aagggctgcc 1381 tcatccggcg cagccccaca ggaaccttcc ttctggttgg cctcagccga ccccacagca 1441 gtcttcgaga gctcctggca acctgctggg atggggggct gcacgtagat ggggtggcag 1501 tgaccctcac ttcctgctgt atccccagac ccaaagaaaa gtccaacctg atcgtggtcc 1561 agagaggtca cagcccaccc acatcatcct tggttcagcc ccaatcccaa taccagctga 1621 gtcagatgac atttcacaag atccctgctg acagcctgga gtggcatgag aacctgggcc 1681 atgggtcctt caccaagatt taccggggct gtcgccatga ggtggtggat ggggaggccc 1741 gaaagacaga ggtgctgctg aaggtcatgg atgccaagca caagaactgc atggagtcat 1801 tcctggaagc agcgagcttg atgagccaag tgtcgtaccg gcatctcgtg ctgctccacg 1861 gcgtgtgcat ggctggagac agcaccatgg tgcaggaatt tgtacacctg ggggccatag 1921 acatgtatct gcgaaaacgt ggccacctgg tgccagccag ctggaagctg caggtggtca 1981 aacagctggc ctacgccctc aactatctgg aggacaaagg cctgccccat ggcaatgtct 2041 ctgcccggaa ggtgctcctg gctcgggagg gggctgatgg gagcccgccc ttcatcaagc 2101 tgagtgaccc tggggtcagc cccgctgtgt taagcctgga gatgctcacc gacaggatcc 2161 cctgggtggc ccccgagtgt ctccgggagg cgcagacact tagcttggaa gctgacaagt 2221 ggggcttcgg cgccacggtc tgggaagtgt ttagtggcgt caccatgccc atcagtgccc 2281 tggatcctgc taagaaactc caattttatg aggaccggca gcagctgccg gcccccaagt 2341 ggacagagct ggccctgctg attcaacagt gcatggccta tgagccggtc cagaggccct 2401 ccttccgagc cgtcattcgt gacctcaata gcctcatctc ttcagactat gagctcctct 2461 cagaccccac acctggtgcc ctggcacctc gtgatgggct gtggaatggt gcccagctct 2521 atgcctgcca agaccccacg atcttcgagg agagacacct caagtacatc tcacagctgg 2581 gcaagggcaa ctttggcagc gtggagctgt gccgctatga cccgctagcc cacaatacag 2641 gtgccctggt ggccgtgaaa cagctgcagc acagcgggcc agaccagcag agggactttc 2701 agcgggagat tcagatcctc aaagcactgc acagtgattt cattgtcaag tatcgtggtg 2761 tcagctatgg cccgggccgg ccagagctgc ggctggtcat ggagtacctg cccagcggct 2821 gcttgcgcga cttcctgcag cggcaccgcg cgcgcctcga tgccagccgc ctccttctct 2881 attcctcgca gatctgcaag ggcatggagt acctgggctc ccgccgctgc gtgcaccgcg 2941 acctggccgc ccgaaacatc ctcgtggaga gcgaggcaca cgtcaagatc gctgacttcg 3001 gcctagctaa gctgctgccg cttgacaaag actactacgt ggtccgcgag ccaggccaga 3061 gccccatttt ctggtatgcc cccgaatccc tctcggacaa catcttctct cgccagtcag 3121 acgtctggag cttcggggtc gtcctgtacg agctcttcac ctactgcgac aaaagctgca 3181 gcccctcggc cgagttcctg cggatgatgg gatgtgagcg ggatgtcccc gccctctgcc 3241 gcctcttgga actgctggag gagggccaga ggctgccggc gcctcctgcc tgccctgctg 3301 aggttcacga gctcatgaag ctgtgctggg cccctagccc acaggaccgg ccatcattca 3361 gcgccctggg cccccagctg gacatgctgt ggagcggaag ccgggggtgt gagactcatg 3421 ccttcactgc tcacccagag ggcaaacacc actccctgtc cttttcatag ctcctgcccg 3481 cagacctctg gattaggtct ctgttgactg gctgtgtgac cttaggcccg gagctgcccc 3541 tctctgggcc tcagaggcct tatgagggtc ctctacttca ggaacacccc catgacattg 3601 catttggggg ggctcccgtg gcctgtagaa tagcctgtgg cctttgcaat ttgttaaggt 3661 tcaagacaga tgggcatatg tgtcagtggg gctctctgag tcctggccca aagaagcaag 3721 gaaccaaatt taagactctc gcatcttccc aaccccttaa gccctggccc cctgagtttc 3781 cttttctcgt ctctctcttt ttattttttt tatttttatt tttatttttg agacagagcc 3841 tcgctcgtta cccagggtgg agtgcagtgg tagcgatctc ggctcacagt gcaacctctg 3901 cttcccaggt tcaagcgatt ctcctgcctc agcctcccga gtagctggga ttacaggtgt 3961 gcaccaccac acccggctaa ttttttttat ttttaataga gatgaggttt caccatgatg 4021 gccaggctga tctcgaactc ctaacctcaa gtgatcctcc cacc // LOCUS HSU09648 3090 bp mRNA PRI 30-NOV-1995 DEFINITION Human carnitine palmitoyltransferase II precursor (CPT1) mRNA, complete cds. ACCESSION U09648 NID g1086454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3090) AUTHORS Verderio,E., Cavadini,P., Montermini,L., Wang,H., Lamantea,E., Finocchiaro,G., DiDonato,S., Gellera,C. and Taroni,F. TITLE Carnitine palmitoyltransferase II deficiency: structure of the gene and characterization of two novel disease-causing mutations JOURNAL Hum. Mol. Genet. 4 (1), 19-29 (1995) MEDLINE 95227173 REFERENCE 2 (bases 416 to 668) AUTHORS Finocchiaro,G., Taroni,F., Rocchi,M., Martin,A.L., Colombo,I., Tarelli,G.T. and DiDonato,S. TITLE cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (2), 661-665 (1991) MEDLINE 91110588 REFERENCE 3 (bases 416 to 668) AUTHORS Finocchiaro,G., Taroni,F., Rocchi,M., Liras Martin,A., Colombo,I., Tarelli,G.T. and DiDonato,S. TITLE cDNA cloning, sequence analysis, and chromosomal localization of human carnitine palmitoyltransferase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (23), 10981 (1991) MEDLINE 92073411 REFERENCE 4 (bases 1 to 3090) AUTHORS Gellera,C., Verderio,E., Florida,G., Finocchiaro,G., Montermini,L., Zuffardi,O. and Taroni,F. TITLE The human gene for carnitine palmitoyltransferase II (CPT1) maps to 1p32 and not to 1p11-p13 as previously reported JOURNAL Unpublished REFERENCE 5 (bases 1618 to 1618; 2407 to 2407; 2455 to 2455) AUTHORS Taroni,F., Verderio,E., Fiorucci,S., Cavadini,P., Finocchiaro,G., Uziel,G., Lamantea,E., Gellera,C. and DiDonato,S. TITLE Molecular characterization of inherited carnitine palmitoyltransferase II deficiency JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (18), 8429-8433 (1992) MEDLINE 92409529 REFERENCE 6 (bases 1618 to 1618; 2455 to 2455) AUTHORS Verderio,E., Cavadini,P., Pandolfo,M., DiDonato,S. and Taroni,F. TITLE Two novel sequence polymorphisms of the human carnitine palmitoyltransferase II (CPT1) gene JOURNAL Hum. Mol. Genet. 2 (3), 334 (1993) MEDLINE 93272002 REFERENCE 7 (bases 1 to 3090) AUTHORS Taroni,F. TITLE Direct Submission JOURNAL Submitted (12-MAY-1994) Franco Taroni, Divisione di Biochimica e Genetica, Istituto Nazionale Neurologico Carlo Besta, via Celoria 11, Milano I-20133, Italy FEATURES Location/Qualifiers source 1..3090 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p32" /chromosome="1" gene 517..2493 /gene="CPT1" transit_peptide 517..591 /gene="CPT1" /note="mitochondrial leader peptide" /citation=[1] /citation=[2] /citation=[3] CDS 517..2493 /gene="CPT1" /citation=[1] /citation=[2] /citation=[3] /codon_start=1 /evidence=experimental /product="carnitine palmitoyltransferase II precursor" /db_xref="PID:g1041195" /translation="MVPRLLLRAWPRGPAVGPGAPSRPLSAGSGPGQYLQRSIVPTMH YQDSLPRLPIPKLEDTIRRYLSAQKPLLNDGQFRKTEQFCKSFENGIGKELHEQLVAL DKQNKHTSYISGPWFDMYLSARDSVVLNFNPFMAFNPDPKSEYNDQLTRATNMTVSAI RFLKTLRAGLLEPEVFHLNPAKSDTITFKRLIRFVPSSLSWYGAYLVNAYPLDMSQYF RLFNSTRLPKPSRDELFTDDKARHLLVLRKGNFYIFDVLDQDGNIVSPSEIQAHLKYI LSDSSPAPEFPLAYLTSENRDIWAELRQKLMSSGNEESLRKVDSAVFCLCLDDFPIKD LVHLSHNMLHGDGTNRWFDKSFNLIIAKDGSTAVHFEHSWGDGVAVLRFFNEVFKDST QTPAVTPQSQPATTDSTVTVQKLNFELTDALKTGITAAKEKFDATMKTLTIDCVQFQR GGKEFLKKQKLSPDAVAQLAFQMAFLRQYGQTVATYESCSTAAFKHGRTETIRPASVY TKRCSEAFVREPSRHSAGELQQMMVECSKYHGQLTKEAAMGQGFDRHLFALRHLAAAK GIILPELYLDPAYGQINHNVLSTSTLSSPAVNLGGFAPVVSDGFGVGYAVHDNWIGCN VSSYPGRNAREFLQCVEKALEDMFDALEGKSIKS" mat_peptide 592..2490 /gene="CPT1" /EC_number="2.3.1.21" /citation=[1] /citation=[2] /citation=[3] /product="carnitine palmitoyltransferase II" mutation 665 /gene="CPT1" /note="disease-causing mutation (proline-to-histidine substitution at codon 50)" /citation=[1] /phenotype="recessively inherited exercise-induced myoglobinuria with carnitine palmitoyltransferase II deficiency" /replace="a" mutation 854 /gene="CPT1" /note="disease-causing mutation (serine-to-leucine substitution at codon 113)" /phenotype="recessively inherited exercise-induced myoglobinuria with carnitine palmitoyltransferase II deficiency" /replace="t" allele 1618 /gene="CPT1" /note="polymorphism (valine-to-isoleucine substitution at codon 368)" /citation=[5] /citation=[6] /frequency="0.51" /replace="a" mutation 2173 /gene="CPT1" /note="disease-causing mutation (aspartic acid-to-asparagine substitution at codon 553)" /citation=[1] /phenotype="recessively inherited exercise-induced myoglobinuria with carnitine palmitoyltransferase II deficiency" /replace="a" mutation 2407 /gene="CPT1" /note="disease-causing mutation (arginine-to-cysteine substitution at codon 631)" /citation=[5] /phenotype="recessively inherited hypoketotic hypoglycemia and cardiomyopathy with carnitine palmitoyltransferase II deficiency" /replace="t" allele 2455 /gene="CPT1" /note="polymorphism (methionine-to-valine substitution at codon 647)" /citation=[5] /citation=[6] /frequency="0.25" /replace="g" old_sequence 2557..2560 /citation=[2] /citation=[3] /replace="tgc" old_sequence 2671 /citation=[2] /citation=[3] /replace="g" polyA_signal 3069..3074 /citation=[1] polyA_site 3090 /citation=[1] /evidence=experimental BASE COUNT 752 a 801 c 809 g 728 t ORIGIN 1 ctcatatcta gaattttggg taggtacttt gaatcattac gatctatcta cttcttaggt 61 gaggaaatag aggtttaaaa tttagtccac agtctcgcaa ggatggagcc tggatcaaat 121 tttgggttat cagattccaa tcacgtttct tagcttttct tttttttttc caactccagt 181 ttctgtcttg ctccaaaaaa ggggaaggag cggctgcggc gctcggtttc ccgcctccta 241 gggaagggaa gggagacgag caacgcggag gctggggccc ccttccgggc ggggcctact 301 agtgggcggg gcctgtcagt gagcggcccc tgcccggaag gagccagtcc ggggcggagc 361 cgatggcctt acaggggccg gaagtggcct gcgggcggag aagtgcctca ggagtcctga 421 cgcagtgtct tgggcgctaa cggcggcggc ggccttgtgt ttagactcca gaactcccca 481 cttgccgcgt tctcgccgcc gcaggctccc gggacgatgg tgccccgcct gctgctgcgc 541 gcctggcccc ggggccccgc ggttggtccg ggagccccca gtcggcccct cagcgccggc 601 tccgggcccg gccagtacct gcagcgcagc atcgtgccca ccatgcacta ccaggacagc 661 ctgcccaggc tgcctattcc caaacttgaa gacaccatta ggagatacct cagtgcacag 721 aagcctctct tgaatgatgg ccagttcagg aaaacagaac aattttgcaa gagttttgaa 781 aatgggattg gaaaagaact gcatgagcag ctggttgctc tggacaaaca gaataaacat 841 acaagctaca tttcgggacc ctggtttgat atgtacctat ctgctcgaga ctccgttgtt 901 ctgaacttta atccatttat ggctttcaat cctgacccaa aatctgagta taatgaccag 961 ctcacccggg caaccaacat gactgtttct gccatccggt ttctgaagac actccgggct 1021 ggccttctgg agccagaagt gttccacttg aaccctgcaa aaagtgacac tatcaccttc 1081 aagagactca tacgctttgt gccttcctct ctgtcctggt atggggccta cctggtcaat 1141 gcgtatcccc tggatatgtc ccagtatttt cggcttttca actcaactcg tttacccaaa 1201 cccagtcggg atgaactctt cactgatgac aaggccagac acctcctggt cctaaggaaa 1261 ggaaattttt atatctttga tgtcctggat caagatggga acattgtgag cccctcggaa 1321 atccaggcac atctgaagta cattctctca gacagcagcc ccgcccccga gtttcccctg 1381 gcatacctga ccagtgagaa ccgagacatc tgggcagagc tcaggcagaa gctgatgagt 1441 agtggcaatg aggagagcct gaggaaagtg gactcggcag tgttctgtct ctgcctagat 1501 gacttcccca ttaaggacct tgtccacttg tcccacaata tgctgcatgg ggatggcaca 1561 aaccgctggt ttgataaatc ctttaacctc attatcgcca aggatggctc tactgccgtc 1621 cactttgagc actcttgggg tgatggtgtg gcagtgctca gattttttaa tgaagtattt 1681 aaagacagca ctcagacccc tgccgtcact ccacagagcc agccagctac cactgactct 1741 actgtcacgg tgcagaaact caacttcgag ctgactgatg ccttaaagac tggcatcaca 1801 gctgctaagg aaaagtttga tgccaccatg aaaaccctca ctattgactg cgtccagttt 1861 cagagaggag gcaaagaatt cctgaagaag caaaagctga gccctgacgc agttgcccag 1921 ctggcattcc agatggcctt cctgcggcag tacgggcaga cagtggccac ctacgagtcc 1981 tgtagcactg ccgcattcaa gcacggccgc actgagacca tccgcccggc ctccgtctat 2041 acaaagaggt gctctgaggc ctttgtcagg gagccctcca ggcacagtgc tggtgagctt 2101 cagcagatga tggttgagtg ctccaagtac catggccagc tgaccaaaga agcagcaatg 2161 ggccagggct ttgaccgaca cttgtttgct ctgcggcatc tggcagcagc caaagggatc 2221 atcttgcctg agctctacct ggaccctgca tacgggcaga taaaccacaa tgtcctgtcc 2281 acgagcacac tgagcagccc agcagtgaac cttgggggct ttgcccctgt ggtctctgat 2341 ggctttggtg ttgggtatgc tgttcatgac aactggatag gctgcaatgt ctcttcctac 2401 ccaggccgca atgcccggga gtttctccaa tgtgtggaga aggccttaga agacatgttt 2461 gatgccttag aaggcaaatc catcaaaagt taacttctgg gcagatgaaa agctaccatc 2521 acttcctcat catgaaaact gggaggccgg gcatggtggc tcatgcctgt aatcccagca 2581 ttttgagagg ctgaggcggg tggatcactt gaggtcagga gtttgagacc aacctggcca 2641 acatggtgaa accttgtctc tactaaaaat acaagaatta gctgggtgtg gtggcatgtg 2701 cctatatccc agctactggg aggttgaagc agaattgctt gaacccagga ggtggaggtt 2761 gcagtgagct gagatcacac cactgcactc cggcctgggc gacagagcga gactgtctca 2821 aaaagacaaa aaagaaaaaa aactggggcc tgtgtagcca gtgggtgcta ttctgtgaaa 2881 ctaatcataa gctgcctagg cagccagcta caggcttgag ctttaaattc atggttttaa 2941 agctaaacgt aatttccact tgggactaga tcacaactga agataacaag agatttaagt 3001 tttaagggca tttaatcagg aggaaaggtt tggaaaacta actcaggtgt atttattgtt 3061 taagcagaaa taaagtttaa tttttgcttg // LOCUS HSU09759 1873 bp mRNA PRI 25-AUG-1995 DEFINITION Human protein kinase (JNK2) mRNA, complete cds. ACCESSION U09759 NID g607785 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1873) AUTHORS Kallunki,T., Su,B., Tsigelny,I., Sluss,H.K., Derijard,B., Moore,G., Davis,R. and Karin,M. TITLE JNK2 contains a specificity-determining region responsible for efficient c-Jun binding and phosphorylation JOURNAL Genes Dev. 8 (24), 2996-3007 (1994) MEDLINE 95095084 REFERENCE 2 (bases 1 to 1873) AUTHORS Kallunki,T. TITLE Direct Submission JOURNAL Submitted (13-MAY-1994) Tuula Kallunki, Pharmacology, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0636, USA FEATURES Location/Qualifiers source 1..1873 /organism="Homo sapiens" /db_xref="taxon:9606" gene 152..1426 /gene="JNK2" CDS 152..1426 /gene="JNK2" /codon_start=1 /product="protein kinase" /db_xref="PID:g607786" /translation="MSDSKCDSQFYSVQVADSTFTVLKRYQQLKPIGSGAQGIVCAAF DTVLGINVAVKKLSRPFQNQTHAKRAYRELVLLKCVNHKNIISLLNVFTPQKTLEEFQ DVYLVMELMDANLCQVIHMELDHERMSYLLYQMLCGIKHLHSAGIIHRDLKPSNIVVK SDCTLKILDFGLARTACTNFMMTPYVVTRYYRAPEVILGMGYKENVDIWSVGCIMGEL VKGCVIFQGTDHIDQWNKVIEQLGTPSAEFMKKLQPTVRNYVENRPKYPGIKFEELFP DWIFPSESERDKIKTSQARDLLSKMLVIDPDKRISVDEALRHPYITVWYDPAEAEAPP PQIYDAQLEEREHAIEEWKELIYKEVMDWEERSKNGVVKDQPPDAAVSSNATPSQSSS INDISSMSTEQTLASDTDSSLDASTGPLEGCR" BASE COUNT 562 a 388 c 427 g 496 t ORIGIN 1 caaactacgt gctgtacagc tgcatcagct gctcgtagac atgtccagca gctggtcgag 61 gtccacgccg cggtaggtga agttgcggaa ggtccggcga gggatctgaa acttgcccct 121 tacccttcgg gatattgcag gacgctgcat catgagcgac agtaaatgtg acagtcagtt 181 ttatagtgtc caagtggcag actcaacctt cactgtccta aaacgttacc agcagctgaa 241 accaattggc tctggggccc aagggattgt ttgtgctgca tttgatacag ttcttgggat 301 aaatgttgca gtcaagaaac taagccgtcc ttttcagaac caaactcatg caaagagagc 361 ttatcgtgaa cttgtcctct taaaatgtgt caatcataaa aatataatta gtttgttaaa 421 tgtgtttaca ccacaaaaaa ctctagaaga atttcaagat gtgtatttgg ttatggaatt 481 aatggatgct aacttatgtc aggttattca catggagctg gatcatgaaa gaatgtccta 541 ccttctttac cagatgcttt gtggtattaa acatctgcat tcagctggta taattcatag 601 agatttgaag cctagcaaca ttgttgtgaa atcagactgc accctgaaga tccttgactt 661 tggcctggcc cggacagcgt gcactaactt catgatgacc ccttacgtgg tgacacggta 721 ctaccgggcg cccgaagtca tcctgggtat gggctacaaa gagaacgttg atatctggtc 781 agtgggttgc atcatgggag agctggtgaa aggttgtgtg atattccaag gcactgacca 841 tattgatcag tggaataaag ttattgagca gctgggaaca ccatcagcag agttcatgaa 901 gaaacttcag ccaactgtga ggaattatgt cgaaaacaga ccaaagtatc ctggaatcaa 961 atttgaagaa ctctttccag attggatatt cccatcagaa tctgagcgag acaaaataaa 1021 aacaagtcaa gccagagatc tgttatcaaa aatgttagtg attgatcctg acaagcggat 1081 ctctgtagac gaagctctgc gtcacccata catcactgtt tggtatgacc ccgccgaagc 1141 agaagcccca ccacctcaaa tttatgatgc ccagttggaa gaaagagaac atgcaattga 1201 agaatggaaa gagctaattt acaaagaagt catggattgg gaagaaagaa gcaagaatgg 1261 tgttgtaaaa gatcagcctc cagatgcagc agtaagtagc aacgccactc cttctcagtc 1321 ttcatcgatc aatgacattt catccatgtc cactgagcag acgctggcct cagacacaga 1381 cagcagtctt gatgcctcga cgggacccct tgaaggctgt cgatgatagg ttagaaatag 1441 caaacctgtc agcattgaag gaactctcac ctccgtgggc ctgaaatgct tgggagttga 1501 tggaaccaaa tagaaaaact ccatgttctg catgtaagaa acacaatgcc ttgccctact 1561 cagacctgat aggattgcct gcttagatga taaaatgagg cagaatatgt ctgaagaaaa 1621 aaattgcaag ccacacttct agagattttg ttcaagatca tttcagttga gcagttagag 1681 taggtgaatt tgtcaaattg tactagtgac agtttctcat catctgtaac tgttgagatg 1741 attgtgcatg tgaccacaaa tgcttgcttg gacttgccca tctagcactt tggaaatcag 1801 tatttaaatg ccaaataatc ttccaggtag tgctgcttct gaagttatct cttaatcctc 1861 ttaagtaatt tgg // LOCUS HSU09770 416 bp mRNA PRI 30-MAR-1995 DEFINITION Human cysteine-rich heart protein (hCRHP) mRNA, complete cds. ACCESSION U09770 NID g719268 KEYWORDS cysteine-rich portein; LIM motif; zinc-finger; heart development. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 416) AUTHORS Tsui,S.K., Yam,N.Y., Lee,C.Y. and Waye,M.M. TITLE Isolation and characterization of a cDNA that codes for a LIM-containing protein which is developmentally regulated in heart JOURNAL Biochem. Biophys. Res. Commun. 205 (1), 497-505 (1994) MEDLINE 95091772 REFERENCE 2 (bases 1 to 416) AUTHORS Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (14-MAY-1994) Mary M.Y. Waye, Department of Biochemistry, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, Hong Kong FEATURES Location/Qualifiers source 1..416 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A550" gene 65..298 /gene="hCRHP" CDS 65..298 /gene="hCRHP" /codon_start=1 /product="cysteine-rich heart protein" /db_xref="PID:g719269" /translation="MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCEKCGKTLTSGGHA EHEGKPYCNHPCYVAMFGPKGFGRGGAESHTFK" polyA_site 416 BASE COUNT 87 a 130 c 123 g 76 t ORIGIN 1 agagtctcgc actgtagccc gtgccgcccc agccgctgcc gcctgcaccg gacccggagc 61 cgccatgccc aagtgtccca agtgcaacaa ggaggtgtac ttcgccgaga gggtgacctc 121 tctgggcaag gactggcatc ggccctgcct gaagtgcgag aaatgtggga agacgctgac 181 ctctgggggc cacgctgagc acgaaggcaa accctactgc aaccacccct gctacgtagc 241 catgtttggg cctaaaggct ttgggcgggg cggagccgag agccacactt tcaagtaaac 301 caggtggtgg agacccatcc ttggctgctt gcaggccact gtccaggcaa attccaggcc 361 ttgtcccaga tgccaggatc ccttgttgcc taatgctcta gtaacctgac attgga // LOCUS HSU09813 826 bp mRNA PRI 05-OCT-1995 DEFINITION Human mitochondrial ATP synthase subunit 9, P3 gene copy, mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U09813 NID g1008454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 826) AUTHORS Yan,W.L., Lerner,T.J., Haines,J.L. and Gusella,J.F. TITLE Sequence analysis and mapping of a novel human mitochondrial ATP synthase subunit 9 cDNA (ATP5G3) JOURNAL Genomics 24 (2), 375-377 (1994) MEDLINE 95213032 REFERENCE 2 (bases 1 to 826) AUTHORS Lerner,T.J. TITLE Direct Submission JOURNAL Submitted (17-MAY-1994) Terry J. Lerner, Molecular Neurogenetics Unit, Building 149 13th St., Massachusetts General Hospital, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..826 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="subc-21" /clone_lib="liver cDNA in lambda gt11" /chromosome="2" mRNA 1..826 /gene="P3" /product="mitochondrial ATP synthase subunit 9 precursor" gene 1..826 /gene="P3" sig_peptide 255..455 /gene="P3" CDS 255..683 /gene="P3" /codon_start=1 /function="proton transport" /product="mitochondrial ATP synthase subunit 9 precursor" /db_xref="PID:g511450" /translation="MFACAKLACTPSLIRAGSRVAYRPISASVLSRPEASRTGEGSTV FNGAQNGVSQLIQREFQTSAISRDIDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIG YARNPSLKQQLFSYAILGFALSEAMGLFCLMVAFLILFAM" mat_peptide 456..680 /gene="P3" /product="mitochondrial ATP synthase subunit 9" polyA_site 826 /gene="P3" /note="19 A nucleotides" BASE COUNT 182 a 198 c 225 g 221 t ORIGIN 1 aattccggaa ttccccggcc tggccagggc ggagcggcgc ccagtaggga cccattcatt 61 gtgccggcgc ctcactgggc acggggccca agctgacggc gtgcacggga gcctgcggag 121 cctgggtggg aagaacaggc ccctggaggg cacttgacct taagcctctt ttcctccgca 181 gagaggaagc gggagaggag cccacgtcgc ctgtcaccca atatctccag ccgcgcagtc 241 ccgaagagtg taagatgttc gcctgcgcca agctcgcctg caccccctct ctgatccgag 301 ctggatccag agttgcatac agaccaattt ctgcatcagt gttatctcga ccagaggcta 361 gtaggactgg agagggctct acggtattta atggggccca gaatggtgtg tctcagctaa 421 tccaaaggga gtttcagacc agtgcaatca gcagagacat tgatactgct gccaaattta 481 ttggtgcagg tgctgcaaca gtaggagtgg ctggttctgg tgctggtatt ggaacagtct 541 ttggcagcct tatcattggt tatgccagaa acccttcgct gaagcagcag ctgttctcat 601 atgctatcct gggatttgcc ttgtctgaag ctatgggtct cttttgtttg atggttgctt 661 tcttgatttt gtttgccatg taacaaatta ctgcttgaca tgttggcatt catattaatt 721 acggatgtaa ttctgtgtat cttactgtga ctccgaaaac tgtagtattg gtgtcatggg 781 aatgtacgtt atttccaaag tcatttcatt aaagatgaaa acttta // LOCUS HSU09820 6115 bp mRNA PRI 11-JAN-1995 DEFINITION Human helicase II (RAD54L) mRNA, complete cds. ACCESSION U09820 NID g606832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6115) AUTHORS Stayton,C.L., Dabovic,B., Gulisano,M., Gecz,J., Broccoli,V., Giovanazzi,S., Bossolasco,M., Monaco,L., Rastan,S., Boncinelli,E., Bianchi,M.E. and Consalez,G.G. TITLE Cloning and characterization of a new human Xq13 gene, encoding a putative helicase JOURNAL Hum. Mol. Genet. 3 (11), 1957-1964 (1994) MEDLINE 95179111 REFERENCE 2 (bases 1 to 6115) AUTHORS Consalez,G.G. TITLE Direct Submission JOURNAL Submitted (18-MAY-1994) Gian Giacomo Consalez, Dept. Biol. and Technol. Research, San Raffaele Scientific Institute, Via Olgettina 58, Milano, I-20132, Italy FEATURES Location/Qualifiers source 1..6115 /organism="Homo sapiens" /db_xref="taxon:9606" gene 54..4979 /gene="RAD54L" CDS 54..4979 /gene="RAD54L" /note="human RAD54-like protein; previous name XH2; start site at nucleotide 54 is putative" /codon_start=1 /product="helicase II" /db_xref="PID:g606833" /translation="MDNQGHKNLKTSQEGSSDDRERKQERETFSSAEGTVDKDTTIME LRDRLPKKQQASASTDGVDKLSGKEQSFTSLEVRKVAETKEKSKHLKTKTCKKVQDGL SDIAEKFLKKDQSDETSEDDKKQSKKGTEEKKKPSDFKKKVIKMEQQYESSSDGTEKL PEREEICHFPKGIKQIKNGTTDGEKKSKKIRDKTSKKKDELSDYAEKSTGKGDSCDSS EDKKSKNGAYGREKKRCKLLGKSSRKRQDCSSSDTEKYSMKEDGCNSSDKRLKRIELR ERRNLSSKRNTKEIQSGSSSSDAEESSEDNKKKKQRTSSKKKAVIVKEKKRNSLRTST KRKQADITSSSSSDIEDDDQNSIGEGSSDEQKIKPVTENLVLSSHTGFCQSSGDEALS KSVPVTVDDDDDDNDPENRIAKKMLLEEIKANLSSDEDGSSDDEPEEGKKRTGKQNEE NPGDEEAKNQVNSESDSDSEESKKPRYRHRLLRHKLTVSDGESGEEKKTKPKEHKEVK GRNRRKVSSEDSEDSDFQESGVSEEVSESEDEQRPRTRSAKKAELEENQRSYKQKKKR RRIKVQEDSSSENKSNSEEEEEEKEEEEEEEEEEEEEEEDENDDSKSPGKGRKKIRKI LKDDKLRTETQNALKEEEERRKRIAEREREREKLREVIEIEDASPTKCPITTKLVLDE DEETKEPLVQVHRNMVIKLKPHQVDGVQFMWDCCCESVKKTKKSPGSGCILAHCMGLG KTLQVVSFLHTVLLCDKLDFSTALVGLSSSILAFNWMNEFEKWQEGLKDDEKLEVSEL ATVKRPQERSYMLQRWQEDGGVMIIGYEMYRNLAQGRNVKSRKLKEIFNKALVDPGPD FVVCDEGHILKNEASAVSKAMNSIRSRRRIILTGTPLQNNLIEYHCMVNFIKENLLGS IKEFRNRFINPIQNGQCADSTMVDVRVMKKRAHILYEMLAGCVQRKDYTALTKFLPPK HEYVLAVRMTSIQCKLYQYYLDHLTGVGNNSEGGRGKAGAKLFQDFQMLSRIWTHPWC LQLDYISKENKGYFDEDSMDEFIASDSDETSMSLSSDDYTKKKKKGKKGKKDSSSSGS GSDNDVEVIKVWNSRSRGGGEGNVDETGNNPSVSLKLEESKATSSSNPSSPAPDWYKD FVTDADAEVLEHSGKMVLLFEILRMAEEIGDKVLVFSQSLISLDLIEDFLELASREKT EDKDKPLIYKGEGKWLRNIDYYRLDGSTTAQSRKKWAEEFNDETNVRGRLFIISTKAG SLGINLVAANRVIIFDASWNPSYDIQSIFRVYRFGQTKPVYVYRFLAQGTMEDKIYDR QVTKQSLSFRVVDQQQVERHFTMNELTELYTFEPDLLDDPNSEKKKKRDTPMLPKDTI LAELLQIHKEHIVGYHEHDSLLTTKKKKRLTEEERKAAWAEYEGEKRVLTMRFNIPTG TNLPPVSFNSQTPYIPFNLGALSAMSNQQLEDLINQGREKVVEATNSVTAVRIQPLED IISAVWKENMNLSEAQVQALALSRQASQELDVKRREAIYNDVLTKQQMLIQLCSANTY EQKAPAAVQSAATATNDLSTTTLGHHMMPKPRNLIMNPSNYQQIDMRGMYQPVAGGMQ PPPLQRCTTPNEKQKIQDLPKGNQCDFALKA" misc_feature 901..982 /gene="RAD54L" /note="putative nuclear targeting sequence" misc_feature 2244..2291 /gene="RAD54L" /note="helicase II superfamily domain I" misc_feature 2625..2645 /gene="RAD54L" /note="helicase II superfamily domain II" misc_feature 2709..2750 /gene="RAD54L" /note="helicase II superfamily domain III" misc_feature 2769..2798 /gene="RAD54L" /note="helicase II superfamily domain IV" misc_feature 3816..3878 /gene="RAD54L" /note="helicase II superfamily domain V" misc_feature 3897..3947 /gene="RAD54L" /note="helicase II superfamily domain VI" BASE COUNT 2180 a 969 c 1346 g 1620 t ORIGIN 1 caaatacaaa agattttgac tcttctgaag atgagaaaca cagcaaaaaa ggaatggata 61 atcaagggca caaaaatttg aagacctcac aagaaggatc atctgatgat cgtgaaagaa 121 aacaagagag agagactttc tcttcagcag aaggcacagt tgataaagac acgaccatca 181 tggaattaag agatcgactt cctaagaagc agcaagcaag tgcttccact gatggtgtcg 241 ataagctttc tgggaaagag cagagtttta cttctttgga agttagaaaa gttgctgaaa 301 ctaaagaaaa gagcaagcat ctcaaaacca aaacatgtaa aaaagtacag gatggcttat 361 ctgatattgc agagaaattc ctaaagaaag accagagcga tgaaacttct gaagatgata 421 aaaagcagag caaaaaggga actgaagaaa aaaagaaacc ttcagacttt aagaaaaaag 481 taattaaaat ggaacaacag tatgaatctt catctgatgg cactgaaaag ttacctgagc 541 gagaagaaat ttgtcatttt cctaagggca taaaacaaat taagaatgga acaactgatg 601 gagaaaagaa aagtaaaaaa ataagagata aaacttctaa aaagaaggat gaattatctg 661 attatgctga gaagtcaaca gggaaaggag atagttgtga ctcttcagag gataaaaaga 721 gtaagaatgg agcatatggt agagagaaga aaaggtgcaa gttgcttgga aagagttcaa 781 ggaagagaca agattgttca tcatctgata ctgagaaata ttccatgaaa gaagatggtt 841 gtaactcttc tgataagaga ctgaaaagaa tagaattgag ggaaagaaga aatttaagtt 901 caaagagaaa tactaaggaa atacaaagtg gctcatcatc atctgatgct gaggaaagtt 961 ctgaagataa taaaaagaag aagcaaagaa cttcatctaa aaagaaggca gtcattgtca 1021 aggagaaaaa gagaaactcc ctaagaacaa gcactaaaag gaagcaagct gacattacat 1081 cctcatcttc ttctgatata gaagatgatg atcagaattc tataggtgag ggaagcagcg 1141 atgaacagaa aattaagcct gtcactgaaa atttagtgct gtcttcacat actggatttt 1201 gccaatcttc aggagatgaa gccttatcta aatcagtgcc tgtcacagtg gatgatgatg 1261 atgacgacaa tgatcctgag aatagaattg ccaagaagat gcttttagaa gaaattaaag 1321 ccaatctttc ctctgatgag gatggatctt cagatgatga gccagaagaa gggaaaaaaa 1381 gaactggaaa acaaaatgaa gaaaacccag gagatgagga agcaaaaaat caagtcaatt 1441 ctgaatcaga ttcagattct gaagaatcta agaagccaag atacagacat aggcttttgc 1501 ggcacaaatt gactgtgagt gacggagaat ctggagaaga aaaaaagaca aagcctaaag 1561 agcataaaga agtcaaaggc agaaacagaa gaaaggtgag cagtgaagat tcagaagatt 1621 ctgattttca ggaatcagga gttagtgaag aagttagtga atccgaagat gaacagcggc 1681 ccagaacaag gtctgcaaag aaagcagagt tggaagaaaa tcagcggagc tataaacaga 1741 aaaagaaaag gcgacgtatt aaggttcaag aagattcatc cagtgaaaac aagagtaatt 1801 ctgaggaaga agaggaggaa aaagaagagg aggaggaaga ggaggaggag gaggaagagg 1861 aggaggaaga tgaaaatgat gattccaagt ctcctggaaa aggcagaaag aaaattcgga 1921 agattcttaa agatgataaa ctgagaacag aaacacaaaa tgctcttaag gaagaggaag 1981 agagacgaaa acgtattgct gagagggagc gtgagcgaga aaaattgaga gaggtgatag 2041 aaattgaaga tgcttcaccc accaagtgtc caataacaac caagttggtt ttagatgaag 2101 atgaagaaac caaagaacct ttagtgcagg ttcatagaaa tatggttatc aaattgaaac 2161 cccatcaagt agatggtgtt cagtttatgt gggattgctg ctgtgagtct gtgaaaaaaa 2221 caaagaaatc tccaggttca ggatgcattc ttgcccactg tatgggcctt ggtaagactt 2281 tacaggtggt aagttttctt catacagttc ttttgtgtga caaactggat ttcagcacgg 2341 cgttagtggg tttgtcctcc tcaatacttg cttttaattg gatgaatgaa tttgagaagt 2401 ggcaagaggg attaaaagat gatgagaagc ttgaggtttc tgaattagca actgtgaaac 2461 gtcctcagga gagaagctac atgctgcaga ggtggcaaga agatggtggt gttatgatca 2521 taggctatga gatgtataga aatcttgctc aaggaaggaa tgtgaagagt cggaaactta 2581 aagaaatatt taacaaagct ttggttgatc caggccctga ttttgttgtt tgtgatgaag 2641 gccatattct aaaaaatgaa gcatctgctg tttctaaagc tatgaattct atacgatcaa 2701 ggaggaggat tattttaaca ggaacaccac ttcaaaataa cctaattgag tatcattgta 2761 tggttaattt tatcaaggaa aatttacttg gatccattaa ggagttcagg aatagattta 2821 taaatccaat tcaaaatggt cagtgtgcag attctaccat ggtagatgtc agagtgatga 2881 aaaaacgtgc tcacattctc tatgagatgt tagctggatg tgttcagagg aaagattata 2941 cagcattaac aaaattcttg cctccaaaac acgaatatgt gttagctgtg agaatgactt 3001 ctattcagtg caagctctat cagtactact tagatcactt aacaggtgtg ggcaataata 3061 gtgaaggtgg aagaggaaag gcaggtgcaa agcttttcca agattttcag atgttaagta 3121 gaatatggac tcatccttgg tgtttgcagc tagactacat tagcaaagaa aataagggtt 3181 attttgatga agacagtatg gatgaattta tagcctcaga ttctgatgaa acctccatga 3241 gtttaagctc cgatgattat acaaaaaaga agaaaaaagg gaaaaagggg aaaaaagata 3301 gtagctcaag tggaagtggc agtgacaatg atgttgaagt gattaaggtc tggaattcaa 3361 gatctcgggg aggtggtgaa ggaaatgtgg atgaaacagg aaacaatcct tctgtttctt 3421 taaaactgga agaaagtaaa gctacttctt cttctaatcc aagcagccca gctccagact 3481 ggtacaaaga ttttgttaca gatgctgatg ctgaggtttt agagcattct gggaaaatgg 3541 tacttctctt tgaaattctt cgaatggcag aggaaattgg ggataaagtc cttgttttca 3601 gccagtccct catatctctg gacttgattg aagattttct tgaattagct agtagggaga 3661 agacagaaga taaagataaa ccccttattt ataaaggtga ggggaagtgg cttcgaaaca 3721 ttgactatta ccgtttagat ggttccacta ctgcacagtc aaggaagaag tgggctgaag 3781 aatttaatga tgaaactaat gtgagaggac gattatttat catttctact aaagcaggat 3841 ctctaggaat taatctggta gctgctaatc gagtaattat attcgacgct tcttggaatc 3901 catcttatga catccagagt atattcagag tttatcgctt tggacaaact aagcctgttt 3961 atgtatatag gttcttagct cagggaacca tggaagataa gatttatgat cggcaagtaa 4021 ctaagcagtc actgtctttt cgagttgttg atcagcagca ggtggagcgt cattttacta 4081 tgaatgagct tactgaactt tatacttttg agccagactt attagatgac cctaattcag 4141 aaaagaagaa gaagagggat actcccatgc tgccaaagga taccatactt gcagagctcc 4201 ttcagataca taaagaacac attgtaggat accatgaaca tgattctctt ttgaccacaa 4261 agaagaagaa gaggttgact gaagaagaaa gaaaagcagc ttgggctgag tatgaaggag 4321 agaagagggt actgaccatg cgtttcaaca taccaactgg gaccaattta ccccctgtca 4381 gtttcaactc tcaaactcct tatattcctt tcaatttggg agccctgtca gcaatgagta 4441 atcaacagct ggaggacctc attaatcaag gaagagaaaa agttgtagaa gcaacaaaca 4501 gtgtgacagc agtgaggatt caacctcttg aggatataat ttcagctgta tggaaggaga 4561 acatgaatct ctcagaggcc caagtacagg cgttagcatt aagtagacaa gccagccagg 4621 agcttgatgt taaacgaaga gaagcaatct acaatgatgt attgacaaaa caacagatgt 4681 taatccagct gtgttcagcg aatacttatg aacagaaggc tccagcagca gtacaatcag 4741 cagcaacagc aacaaatgac ttatcaacaa caacactggg tcaccacatg atgccaaagc 4801 cccgaaattt gatcatgaat ccttctaact accagcagat tgatatgaga ggaatgtatc 4861 agccagtggc tggtggtatg cagccaccac cattacagcg gtgcaccacc cccaatgaga 4921 agcaaaaaat ccaggacctt cccaagggaa atcaatgtga ttttgcacta aaagcttaat 4981 ggattgttaa aatcatagaa agatctttta tttttttagg aatcaatgac ttaacagaac 5041 tcaactgtat aaatagtttg gtccccttaa atgccaatct tccatattag ttttactttt 5101 ttttttttaa atagggcata ccatttcttc ctgacatttg tcagtgatgt tgcctagaat 5161 cttcttacac acgctgagta cagaagatat ttcaaattgt tttcagtgaa aacaagtcct 5221 tccataatag taacaactcc acagatttcc tctctaaatt tttatgcctg cttttagcaa 5281 ccataaaatt gtcataaaat taataaattt aggaaagaat aaagatttat atattcattc 5341 tttacatata aaaacacaca gctgagttct tagagttgat tcctcaagtt atgaaatact 5401 tttgtactta atccatttct tgattaaagt gattgaaatg gttttaatgt tcttttgagc 5461 tgaagtcctg aaactgggct cctgctttat tgtctctgtg acctgaaagt tagaaactga 5521 ggggttatct ttgacacaga atttgtgtgc aaatattctt aaatcctact gccctaaaag 5581 ttggagaagt cttgcagtta tcttagcatt gtataaacag ccttaagtag agcctaagaa 5641 gagaattcct ttccctcctt tagtccttct ccatttttta ttttcagtta tatgtgctga 5701 aataattact ggtaaaattc agggttgtgg attatcttcc acacatgaat tttctctctc 5761 ctggcacgaa tataaagcac atctcttaac tgcatggtgc cagtgctaat gcttcatcct 5821 gttgctggca gtgggatgtg gacttagaaa atcaagttct agcattttag taggttaaca 5881 ctgaagttgt ggttgttagg ttcacaccct gttttataaa caacatcaaa atggcagaac 5941 cattgctgac tttaggttca catgaggaat gtacttttaa caattcccag tactatcagt 6001 attgtggaaa taattcctct gaaagataag gatcactggc ttctatgcgc ttcttttctc 6061 tcatcatcat gttcttttac cccagtttcc ttacattttt taaattgttt cagag // LOCUS HSU09825 3595 bp mRNA PRI 30-MAR-1996 DEFINITION Human acid finger protein mRNA, complete cds. ACCESSION U09825 NID g563126 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3595) AUTHORS Chu,T.W., Capossela,A., Coleman,R., Goei,V.L., Nallur,G. and Gruen,J.R. TITLE Cloning of a new 'finger' protein gene (ZNF173) within the class I region of the human MHC JOURNAL Genomics 29 (1), 229-239 (1995) MEDLINE 96079113 REFERENCE 2 (bases 1 to 3595) AUTHORS Chu,T.W. TITLE Direct Submission JOURNAL Submitted (18-MAY-1994) Thomas W. Chu, Genetics, Yale University School of Medicine, 295 Congress Avenue, New Haven, CT 06536-0812, USA FEATURES Location/Qualifiers source 1..3595 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="D51.10, D5A1, J6K6" /clone_lib="JY library in Charon BS of Anand Swaroop, human kidney in GT10/11 from Clonetech" /map="6p21.3" /chromosome="6" CDS 556..2175 /codon_start=1 /product="acid finger protein" /db_xref="PID:g563127" /translation="MATSAPLRSLEEEVTCSICLDYLRDPVTIDCGHVFCRSCTTDVR PISGSRPVCPLCKKPFKKENIRPVWQLASLVENIERLKVDKGRQPGEVTREQQDAKLC ERHREKLHYYCEDDGKLLCVMCRESREHRPHTAVLMEKAAQPHREKILNHLSTLRRDR DKIQGFQAKGEADILAALKKLQDQRQYIVAEFEQGHQFLREREEHLLEQLAKLEQELT EGREKFKSRGVGELARLALVISELEGKAQQPAAELMQDTRDFLNRYPRKKFWVGKPIA RVVKKKTGEFSDKLLSLQRGLREFQGKLLRDLEYKTVSVTLDPQSASGYLQLSEDWKC VTYTSLYKSAYLHPQQFDCEPGVLGSKGFTWGKVYWEVEVEREGWSEDEEEGDEEEEG EEEEEEEEAGYGDGYDDWETDEDEESLGDEEEEEEEEEEEVLESCMVGVARDSVKRKG DLSLRPEDGVWALRLSSSGIWANTSPEAELFPALRPRRVGIALDYEGGTVTFTNAESQ ELIYTFTATFTRRLVPFLWLKWPGTRLLLRP" misc_feature 601..723 /note="C3HC4 or ring domain" misc_feature 859..954 /note="CHC3H2 domain" misc_feature 958..1356 /note="coiled-coil domain" /evidence=not_experimental polyA_signal 3576..3581 polyA_site 3595 /note="10 A residues" BASE COUNT 894 a 945 c 988 g 768 t ORIGIN 1 aacgcttttt gaattgaatt ttttaaaatg catgtattcc ttttcaaact aaaatgattt 61 ttaagcacac aaaagagaat caaaggctga gaaacaatga tttcacacac tatgtttgca 121 agaacaagtt attcccaaag caattcctac aggttctgca cctgtgttgc ttttataaag 181 caagcacagc gtaatgtgta agcactattg tcctgaatgc tttatccatg aaggtacgtg 241 ttaaggatcc ataactggta gaagggcgtt taaaacgctt ttttttcttt taaagagaca 301 gggtctcact atgttgccca ggctgctctc ttaactacga ggctcggagt caggaatgga 361 gagaagggta atggttttac ctcttattgt ggaaacctgt tgagatcaca gagaatatac 421 tgacggcata aaagggcaga accatagcag ggtgcgcgct gctgtagttg ccttccatgg 481 atctatagga gagcaagtcc tccaggagaa gaagtcctca ccagtgaacg gagacctctc 541 tgaactaagg ataccatggc cacgtcagcc ccactacgga gcctggaaga ggaggtgacc 601 tgctccatct gtcttgatta cctgcgggac cctgtgacca ttgactgtgg ccacgtcttc 661 tgccgcagct gcaccacaga cgtccgcccc atctcaggga gccgccccgt ctgcccactc 721 tgcaagaagc cttttaagaa ggagaacatc cgacccgtgt ggcaactggc cagcctggtg 781 gagaacattg agcggctgaa ggtggacaag ggcaggcagc cgggagaggt gacccgggag 841 cagcaggatg caaagttgtg cgagcgacac cgagagaagc tgcactacta ctgtgaggac 901 gacgggaagc tgctgtgcgt gatgtgccgg gagtcccggg agcacaggcc ccacacggcc 961 gtcctcatgg agaaggccgc ccagccccac agggaaaaaa tcctgaacca cctgagtacc 1021 ctaaggaggg acagagacaa aattcagggc ttccaggcaa agggagaagc tgatatcctg 1081 gccgcgctga agaagctcca ggaccagagg cagtacattg tggctgagtt tgagcagggt 1141 catcagttcc tgagggagcg ggaggaacac ctgctggaac agctggcgaa gctggagcag 1201 gagctcacgg agggcaggga gaagttcaag agccggggcg tcggggagct tgcccggctg 1261 gccctggtca tctccgaact ggagggcaag gcgcagcagc cagctgcaga gctcatgcag 1321 gacacgagag acttcctaaa caggtatcca cggaagaagt tctgggttgg gaaacccatt 1381 gctcgagtgg ttaaaaaaaa gaccggagaa ttctcagata aactcctctc tctgcaacga 1441 ggcctgaggg aattccaggg gaagctgctg agagacttgg aatataagac agtgagcgtc 1501 accctggacc cacagtcggc cagtgggtac ctgcagctgt cagaggactg gaagtgcgtg 1561 acctacacca gcctgtacaa gagtgcctac ctgcaccccc agcagtttga ctgtgagcct 1621 ggggtgctgg gcagcaaggg cttcacctgg ggcaaggtct actgggaagt ggaagtggag 1681 agggagggct ggtctgagga tgaagaagag ggggatgagg aggaagaggg agaagaggag 1741 gaggaggaag aggaggccgg ctatggggat ggatatgacg actgggaaac ggacgaagat 1801 gaggaatcgt tgggcgatga agaggaagaa gaggaggagg aagaggagga agttctggaa 1861 agctgcatgg tgggggtggc tagagactct gtgaagagga agggagacct ctccctgcgg 1921 ccagaggatg gcgtgtgggc gctgcgcctc tcctcctccg gcatctgggc caacaccagc 1981 cccgaggctg agcttttccc agcactgcgg ccccggagag tgggcatcgc cctggattat 2041 gaagggggca ccgtgacttt caccaacgca gagtcacagg aactcatcta caccttcact 2101 gccaccttca cccggcgcct ggtccccttc ctgtggctca agtggccagg aacacgcctc 2161 ctgctaagac cctgagccct gacatctgcc cccagcccca accctcagat gcttcacttc 2221 tttggaattc caggactctc aatgggggga cgggatgcct ggcctaagca cctggagcag 2281 gggaccccat atccactggt agccacctcc ccattgctgt ggccccctga aatctcactc 2341 agtgctgttg ctccatctac tgccctaatg gggctctttt cccacctcct gctggtttcc 2401 cgagggaact tctgaccctg gagtccatga gggctccttt cctttttgac cacgaccttg 2461 gccccagctc tgcactctct ggaataaggg cccgatgcag cattttcttg cccagtgtgg 2521 caagacctca gaaaaaccag tcaattacgc ctaagatcat tttgctgtcc ttaaaccccc 2581 caggttcctt cttgcacaaa tccactctgc tgcccacctt ccctggcatt ttaaacagac 2641 ctaccccacc ccaactcaga gttaagcata catggcaatg ctgaaaataa acaaaatcca 2701 ctgaggcttc ccaggccatt taaagcctgg agtaccagcg catatcatct catggggtcc 2761 agagtggagt ccaggcttct ctgaaaccag ggctgagcat atttccatag ccaaggaggt 2821 gggactccct ggagctatct gtggtcttag ggaaggatcc agacatacac ggctttgggg 2881 tacaagctgt gatcactgat caataaatta tctctagatt ggtccttgtg aggggagttt 2941 taagaatcca gaacatcttg ctcttgatca gcacatccaa aaacaccaag acaaacatcc 3001 agtggagcag aacctctcct gcctcgggag ttctgaccag gttccccagc agggtgttaa 3061 gcctctgtct ctggccctac cagcatccag gttccacttt cctaggagag agtggagatg 3121 ggaagacagg gaaaggaagg cagcaggagg ccacaagccc accaggtctt catgtcaaga 3181 gagggagtaa catgtcttct cattgcccac ggacccaacc tctgtccagg tgcccctcat 3241 catcacagtt taacacaagc tcccctgctc cagccaaatt gatctcccag tcttgtcctt 3301 acccattcca agtgctctgc cagcccctgt tcatccaagt tttaagtcct ccaccctgtt 3361 ctaagagact gttccaacgg ctctgggcct cagtggtcac ttgaccacca ttgtctcaga 3421 gctgcccact ttgtgtgtgt cacatgctgc ctgagtcaca cgtacatctt atttctgcct 3481 ctagttttgt aagctcctgg agggcggata acatcttata ctactcttgt attctcatgg 3541 cacctagtgc agtgctgggc acagagtagg tgctcaataa agacttgttg aatga // LOCUS HSU09850 3908 bp mRNA PRI 09-NOV-1995 DEFINITION Human zinc finger protein (ZNF143) mRNA, complete cds. ACCESSION U09850 NID g495571 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3908) AUTHORS Tommerup,N. and Vissing,H. TITLE Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs identify putative candidate genes for developmental and malignant disorders JOURNAL Genomics 27 (2), 259-264 (1995) MEDLINE 96044430 REFERENCE 2 (bases 1 to 3908) AUTHORS Vissing,H. TITLE Direct Submission JOURNAL Submitted (19-MAY-1994) Henrik Vissing, Bioscience, Molecular Biology 6B2.107, Novo Nordisk, Novo Alle, Bagsvaerd, DK-2880, Denmark FEATURES Location/Qualifiers source 1..3908 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ZNF143, pHZ-1" /clone_lib="human insulinoma cDNA library" /chromosome="11" /map="11p15.4" /tissue_type="insulinoma" gene 38..1918 /gene="ZNF143" CDS 38..1918 /gene="ZNF143" /note="GLI type" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g495572" /translation="MTEFPGGGMEAQHVTLCLTEAVTVADGDNLENMEGVSLQAVTLA DGSTAYIQHNSKDAKLIDGQVIQLEDGSAAYVQHVPIPKSTGDSLRLEDGQAVQLEDG TTAFIHHTSKDSYDQSALQAVQLEDGTTAYIHHAVQVPQSDTILAIQADGTVAGLHTG DATIDPDTISALEQYAAKVSIDGSESVAGTGMIGENEQEKKMQIVLQGHATRVTAKSQ QSGEKAFRCEYDGCGKLYTTAHHLKVHERSHTGDRPYQCEHAGCGKAFATGYGLKSHV RTHTGEKPYRCSEDNCTKSFKTSGDLQKHIRTHTGERPFKCPFEGCGRSFTTSNIRKV HVRTHTGERPYYCTEPGCGRAFASATNYKNHVRIHTGEKPYVCTVPGCDKRFTEYSSL YKHHVVHTHSKPYNCNHCGKTYKQISTLAMHKRTAHNDTEPIEEEQEAFFEPPPGQGE DVLKGSQITYVTGVEGDDVVSTQVATVTQSGLSQQVTLISQDGTQHVNISQADMQAIG NTITMVTQDGTPITVPAHDAVISSAGTHSVAMVTAEGTEGEQVAIVAQDLAAFHTASS EMGHQQHSHHLVTTETRPLTLVATSNGTQIAVQLGEQPSLEEAIRIASRIQQGETPGL DD" BASE COUNT 1048 a 971 c 975 g 914 t ORIGIN 1 ctaaaggtta gcccaaataa atcgagattc tcagggaatg acagagtttc ctggaggagg 61 gatggaggcg caacatgtta cgctgtgctt gacagaggca gtcaccgtgg cagatggtga 121 caacttagaa aatatggaag gcgtaagctt gcaagcagta acacttgcag atggttctac 181 tgcttacata caacacaatt ctaaagatgc aaaactcata gatggccagg tcattcagtt 241 ggaagatggt tctgcggcct atgttcaaca tgtacccata cctaaaagta caggggacag 301 tttgcgtcta gaggatggtc aagcagtaca gttagaagat ggtaccacag catttattca 361 ccacacctcc aaagatagtt atgaccagag tgcattacag gcggttcagc tggaagatgg 421 taccacagct tatatccacc atgcagtgca agtcccgcag tctgacacca tcttggcaat 481 tcaggctgat gggacagtgg caggtctgca cactggggat gctacaattg accctgacac 541 catcagtgct ttggaacagt atgcagcaaa ggtgtccatt gatggaagtg aaagtgtagc 601 aggtactgga atgattggag aaaatgagca agagaaaaaa atgcagattg ttttacaagg 661 acatgctaca agagtaactg ctaaatctca acagagtgga gagaaggcat ttcgatgtga 721 atatgatgga tgtggaaaat tatatacaac agctcatcat ctcaaggtcc atgagaggtc 781 acacacagga gatcggcctt atcagtgtga gcatgcaggc tgtgggaagg catttgcaac 841 aggttatgga ttaaaaagtc acgtcagaac tcatacagga gaaaagccat atcggtgttc 901 ggaagataat tgtactaaat ctttcaaaac ttcaggagat ctacagaaac acatcagaac 961 tcatacagga gaaaggccct ttaagtgtcc cttcgaaggc tgcggtcggt cctttacaac 1021 atcaaatatc agaaaagtgc acgttaggac acacacagga gaaagacctt attactgcac 1081 agagccagga tgtgggaggg catttgccag tgcaacaaat tataaaaacc atgtgaggat 1141 acacacagga gaaaagccat atgtttgtac agttcctggg tgtgacaaaa ggtttacaga 1201 atattccagt ttgtacaaac atcatgttgt ccacactcat tccaaacctt acaactgtaa 1261 ccactgtggg aagacataca agcagatctc cacgctggcc atgcacaaac ggacagccca 1321 caacgacact gagcccatcg aggaggagca ggaagccttc tttgagccgc ccccaggtca 1381 aggtgaagat gttcttaaag ggtcccagat tacgtatgtt acaggtgtag aaggggacga 1441 cgttgtttct acacaagtag ccacagtaac ccaatctgga ctgagtcaac aagttacact 1501 catatcccag gatgggactc agcatgtcaa catatctcaa gctgacatgc aggccattgg 1561 caacaccatc acaatggtaa cgcaggatgg cacgcccatc acagtccccg cccatgatgc 1621 agtcatctcc tcagcaggaa cgcactctgt tgctatggtt actgctgagg gtacagaagg 1681 ggaacaggtt gcaattgtag ctcaagactt ggcagcattc catactgcct catcagaaat 1741 ggggcaccag cagcatagcc atcacttagt aaccacagaa accagacctc tgaccttagt 1801 agcaacatcc aatggcaccc agattgcagt tcagcttgga gaacagccat ctctggaaga 1861 agccatcaga atagcgtcta gaatccaaca aggagaaacg ccagggttgg atgattaatc 1921 ctcagaacaa tggagcaata aagcagaagg agtctttcat cttctggcag cagaaatcca 1981 tgaagcccgg gcccaggaaa attagaagtt ttccattcct gatacactgt acacattttt 2041 atgcgagagt ggagaacatt ttattcttga cacttttgtg tatataaccc ttggaataga 2101 ttctcagagt gattcattgt gtacaaggaa gtatgaaatt agggcaatac agtaaatttt 2161 catgttactc ttttatcaga tcacaaactc ctagagtcta catgcaagac tagtaaagtc 2221 ttatggagtc ttatgatgga tttttaactt cccgtggaaa aaaaaataaa ggctgtatct 2281 aaaataaaaa aaaaaaaaaa aattttttaa gtaagagcaa tgtttactga gcgatacagg 2341 actgagtact atgaagggtt tacaaggatg gaaatgcacc ccgcccccgc cctggccacc 2401 cccgacaccc agaagactac agtacttagg agttacacac aacggccgta actggtggct 2461 atctgttcat aacaaacaaa ccatagcata tttatactgt atcacatcga gtgattatag 2521 aaatccatat atatattgct tgtataaaat cttttttttt gtaaaaaata ttaaaaaaaa 2581 aaaaaaaaaa aaaggaaagt ataaaaaaca tgtgcagttg aaagccctgc caggacagcc 2641 agtctgtaaa cattcggtga gtatgtgctt tggaaggcgc ccgcgcctca gtgcccacag 2701 caaactccag cagggctggc gagggtgcgc ccggctgccg cctcctgggc agggccgcgc 2761 tggaccgagg tgggtggggg catcagggcc ccgcagcacg cccctctcca gttcgcatca 2821 gggcgagggg cagggctggg ggcaggtggg ttgcagtttc attctgagtt ccatcctcag 2881 ccgcgttttg gttgcaactc atggcttttc ctggctgttc ggaggttcct tggatgggtg 2941 ccctggtgaa gaggccagag aagagcccag ggctgcctcg ttgggcgtct ccctggcctc 3001 ggtgcctctg tcctgcctcc ttcaagtggg gccccaggca ggacaaccct acccgagggt 3061 caactttggt ctcagcgctt ggtccctctg ggtttcctca gcccgctctt cctcctgggt 3121 ggctgatctg agttcacaaa aaagctcggc tgatgctggc tgtgctgaac ctcattgtac 3181 agtttttcat gcaccaaagg aagggccatc tggggccccc ctcctcctct ctcttcgtca 3241 cttttccttc tcctcttcct cctcctccac ctcctcccag cccatggtca cttcttctcg 3301 agctgctcga gcagtgggtt gccttctgac tcgggggtct tgacactgtc ggtccggacg 3361 atgcaggtgg ggcggtgtcg gttcagcatc aggatggagg ctgctgccgc tcctgcttca 3421 gctcctcaat ctgggtcttc agctctgcgt tcatgagttc cagccgctcg gattcccgct 3481 gcagaaactc cgtgcgctcc ttcttcttgt tccggcatcg ggctgctgcg actttgttct 3541 tctcccggcg ccttttcctt cgctcctctt cctcatctag ctcacttttc acgggctggg 3601 gcctcttgcc cagtttcacc tccaggaagt gcaagggtgc aatcatggcc ccgaggttgc 3661 ggatgtcagc gtatttcagc tcctccacag tcagggccga gctggggagc ccggtcaggg 3721 ggccaagccc tggcagggag cctgtggtca ccgaagggtc cgggatctgc ccaggcatca 3781 tagcaggagg agtggcaggc cagccttaac ctcccccgcc gccttgggcg ccctcctcac 3841 cgcgagagcg ctcgcgctcc tctccccgcg gggcgtcccg cgctttagag cacactgggg 3901 taccggat // LOCUS HSU09860 3696 bp mRNA PRI 03-JUN-1995 DEFINITION Human enterokinase mRNA, complete cds. ACCESSION U09860 NID g746412 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3696) AUTHORS Kitamoto,Y., Yuan,X., Wu,Q., McCourt,D.W. and Sadler,J.E. TITLE Enterokinase, the initiator of intestinal digestion, is a mosaic protease composed of a distinctive assortment of domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (16), 7588-7592 (1994) MEDLINE 94329561 REFERENCE 2 (bases 1 to 3696) AUTHORS Kitamoto,Y., Veile,R.A., Donis-Keller,H. and Sadler,J.E. TITLE cDNA sequence and chromosomal localization of human enterokinase, the proteolytic activator of trypsinogen JOURNAL Biochemistry 34 (14), 4562-4568 (1995) MEDLINE 95234679 REFERENCE 3 (bases 1 to 3696) AUTHORS Sadler,J.E. TITLE Direct Submission JOURNAL Submitted (19-MAY-1994) J. Evan Sadler, Medicine, Howard Hughes Medical Institute, Washington University, 660 South Euclid Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..3696 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hek1, hek3, hek5" /clone_lib="Clontech Number HL1133b" /sex="female" /tissue_type="duodenum" /dev_stage="juvenile, 15 year old" CDS 41..3100 /note="enteropeptidase" /codon_start=1 /product="enterokinase" /db_xref="PID:g746413" /translation="MGSKRGISSRHHSLSSYEIMFAALFAILVVLCAGLIAVSCLTIK ESQRGAALGQSHEARATFKITSGVTYNPNLQDKLSVDFKVLAFDLQQMIDEIFLSSNL KNEYKNSRVLQFENGSIIVVFDLFFAQWVSDQNVKEELIQGLEANKSSQLVTFHIDLN SVDILDKLTTTSHLATPGNVSIECLPGSSPCTDALTCIKADLFCDGEVNCPDGSDEDN KMCATVCDGRFLLTGSSGSFQATHYPKPSETSVVCQWIIRVNQGLSIKLSFDDFNTYY TDILDIYEGVGSSKILRASIWETNPGTIRIFSNQVTATFLIESDESDYVGFNATYTAF NSSELNNYEKINCNFEDGFCFWVQDLNDDNEWERIQGSTFSPFTGPNFDHTFGNASGF YISTPTGPGGRQERVGLLSLPLDPTLEPACLSFWYHMYGENVHKLSINISNDQNMEKT VFQKEGNYGDNWNYGQVTLNETVKFKVAFNAFKNKILSDIALDDISLTYGICNGSLYP EPTLVPTPPPELPTDCGGPFELWEPNTTFSSTNFPNSYPNLAFCVWILNAQKGKNIQL HFQEFDLENINDVVEIRDGEEADSLLLAVYTGPGPVKDVFSTTNRMTVLLITNDVLAR GGFKANFTTGYHLGIPEPCKADHFQCKNGECVPLVNLCDGHLHCEDGSDEADCVRFFN GTTNNNGLVRFRIQSIWHTACAENWTTQISNDVCQLLGLGSGNSSKPIFSTDGGPFVK LNTAPDGHLILTPSQQCLQDSLIRLQCNHKSCGKKLAAQDITPKIVGGSNAKEGAWPW VVGLYYGGRLLCGASLVSSDWLVSAAHCVYGRNLEPSKWTAILGLHMKSNLTSPQTVP RLIDEIVINPHYNRRRKDNDIAMMHLEFKVNYTDYIQPICLPEENQVFPPGRNCSIAG WGTVVYQGTTANILQEADVPLLSNERCQQQMPEYNITENMICAGYEEGGIDSCQGDSG GPLMCQENNRWFLAGVTSFGYKCALPNRPGVYARVSRFTEWIQSFLH" mat_peptide 41..2392 /product="enterokinase heavy chain" mat_peptide 2393..3097 /note="catalytic serine protease domain" /product="enterokinase light chain" 3'UTR 3101..3696 BASE COUNT 1162 a 697 c 744 g 1093 t ORIGIN 1 accagacagt tcttaaatta gcaagccttc aaaaccaaaa atggggtcga aaagaggcat 61 atcttctagg catcattctc tcagctccta tgaaatcatg tttgcagctc tctttgccat 121 attggtagtg ctctgtgctg gattaattgc agtatcctgc ctgacaatca aggaatccca 181 acgaggtgca gcacttggac agagtcatga agccagagcg acatttaaaa taacatccgg 241 agttacatat aatcctaatt tgcaagacaa actctcagtg gatttcaaag ttcttgcttt 301 tgaccttcag caaatgatag atgagatctt tctatcaagc aatctgaaga atgaatataa 361 gaactcaaga gttttacaat ttgaaaatgg cagcattata gtcgtatttg accttttctt 421 tgcccagtgg gtgtcagatc aaaatgtaaa agaagaactg attcaaggcc ttgaagcaaa 481 taaatccagc caactggtca ctttccatat tgatttgaac agcgttgata tcctagacaa 541 gctaacaacc accagtcatc tggcaactcc aggaaatgtc tcaatagagt gcctgcctgg 601 ttcaagtcct tgtactgatg ctctaacgtg tataaaagct gatttatttt gtgatggaga 661 agtaaactgt ccagatggtt ctgacgaaga caataaaatg tgtgccacag tttgtgatgg 721 aagatttttg ttaactggat catctgggtc tttccaggct actcattatc caaaaccttc 781 tgaaacaagt gttgtctgcc agtggatcat acgtgtaaac caaggacttt ccattaaact 841 gagcttcgat gattttaata catattatac agatatatta gatatttatg aaggtgtagg 901 atcaagcaag attttaagag cttctatttg ggaaactaat cctggcacaa taagaatttt 961 ttccaaccaa gttactgcca cctttcttat agaatctgat gaaagtgatt atgttggctt 1021 taatgcaaca tatactgcat ttaacagcag tgagcttaat aattatgaga aaattaattg 1081 taactttgag gatggctttt gtttctgggt ccaggatcta aatgatgata atgaatggga 1141 aaggattcag ggaagcacct tttctccttt tactggaccc aattttgacc acacttttgg 1201 caatgcttca ggattttaca tttctacccc aactggacca ggagggagac aagaacgagt 1261 ggggctttta agcctccctt tggaccccac tttggagcca gcttgcctta gtttctggta 1321 tcatatgtat ggtgaaaatg tccataaatt aagcattaat atcagcaatg accaaaatat 1381 ggagaagaca gttttccaaa aggaaggaaa ttatggagac aattggaatt atggacaagt 1441 aaccctaaat gaaacagtta aatttaaggt tgcttttaat gcttttaaaa acaagatcct 1501 gagtgatatt gcgttggatg acattagcct aacatatggg atttgcaatg ggagtcttta 1561 tccagaacca actttggtgc caactcctcc accagaactt cctacggact gtggaggacc 1621 ttttgagctg tgggagccaa atacaacatt cagttctacg aactttccaa acagctaccc 1681 taatctggct ttctgtgttt ggattttaaa tgcacaaaaa ggaaagaata tacaacttca 1741 ttttcaagaa tttgacttag aaaatattaa cgatgtagtt gaaataagag atggtgaaga 1801 agctgattcc ttgctcttag ctgtgtacac agggcctggc ccagtaaagg atgtgttctc 1861 taccaccaac agaatgactg tgcttctcat cactaacgat gtgttggcaa gaggagggtt 1921 taaagcaaac tttactactg gctatcactt ggggattcca gagccatgca aggcagacca 1981 ttttcaatgt aaaaatggag agtgtgttcc actggtgaat ctctgtgacg gtcatctgca 2041 ctgtgaggat ggctcagatg aagcagattg tgtgcgtttt ttcaatggca caacgaacaa 2101 caatggttta gtgcggttca gaatccagag catatggcat acagcttgtg ctgagaactg 2161 gaccacccag atttcaaatg atgtttgtca actgctggga ctagggagtg gaaactcatc 2221 aaagccaatc ttctctaccg atggtggacc atttgtcaaa ttaaacacag cacctgatgg 2281 ccacttaata ctaacaccca gtcaacagtg tttacaggat tccttgattc ggttacagtg 2341 taaccataaa tcttgtggaa aaaaactggc agctcaagac atcaccccaa agattgttgg 2401 aggaagtaat gccaaagaag gggcctggcc ctgggttgtg ggtctgtatt atggcggccg 2461 actgctctgc ggcgcatctc tcgtcagcag tgactggctg gtgtccgccg cacactgcgt 2521 gtatgggaga aacttagagc catccaagtg gacagcaatc ctaggcctgc atatgaaatc 2581 aaatctgacc tctcctcaaa cagtccctcg attaatagat gaaattgtca taaaccctca 2641 ttacaatagg cgaagaaagg acaacgacat tgccatgatg catctggaat ttaaagtgaa 2701 ttacacagat tacatacaac ctatttgttt accggaagaa aatcaagttt ttcctccagg 2761 aagaaattgt tctattgctg gttgggggac ggttgtatat caaggtacta ctgcaaacat 2821 attgcaagaa gctgatgttc ctcttctatc aaatgagaga tgccaacagc agatgccaga 2881 atataacatt actgaaaata tgatatgtgc aggctatgaa gaaggaggaa tagattcttg 2941 tcagggggat tcaggaggac cattaatgtg ccaagaaaac aacaggtggt tccttgctgg 3001 tgtgacctca tttggataca agtgtgccct gcctaatcgc cccggagtgt atgccagggt 3061 ctcaaggttt accgaatgga tacaaagttt tctacattag cgcatttctt aaactaaaca 3121 ggaaagtcgc attattttcc cattctactc tagaaagcat ggaaattaag tgtttcgtac 3181 aaaaatttta aaaagttacc aaaggttttt attcttacct atgtcaatga aatgctaggg 3241 ggccagggaa acaaaatttt aaaaataata aaattcacca tagcaataca gaataacttt 3301 aaaataccat taaatacatt tgtatttcat tgtgaacagg tatttcttca cagatctcat 3361 ttttaaaatt cttaatgatt atttttatta cttactgttg tttaaaggga tgttatttta 3421 aagcatatac catacactta agaaatttga gcagaattta aaaaagaaag aaaataaatt 3481 gtttttccca aagtatgtca ctgttggaaa taaactgcca taaattttct agttccagtt 3541 tagtttgctg ctattagcag aaactcaatt gtttctctgt cttttctatc aaaattttca 3601 acatatgcat aaccttagta ttttcccaac caatagaaac tatttattgt aagcttatgt 3661 cacaggcctg gactaaattg attttacgtt cctctt // LOCUS HSU10099 1496 bp mRNA PRI 27-JAN-1996 DEFINITION Human POM-ZP3 mRNA, complete cds. ACCESSION U10099 NID g607803 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1496) AUTHORS Kipersztok,S., Osawa,G.A., Liang,L.F., Modi,W.S. and Dean,J. TITLE POM-ZP3, a bipartite transcript derived from human ZP3 and a POM121 homologue JOURNAL Genomics 25 (2), 354-359 (1995) MEDLINE 95309900 REFERENCE 2 (bases 1 to 1496) AUTHORS Dean,J. TITLE Direct Submission JOURNAL Submitted (26-MAY-1994) Jurrien Dean, National Institutes of Health, Lab. Cell. and Develop. Biology, NIDDK, Building 6, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1496 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="ovary" /map="7q11.23" /chromosome="7" CDS 694..1326 /codon_start=1 /product="POM-ZP3" /db_xref="PID:g607804" /translation="MVCSPVTVRIAPPDRRFSRSAIPEQIISSTLSSPSSNAPDPCAK ETVLSALKEKKKKRTVEEEDQIFLDGQENKRSCLVDGLTDASSAFKVPRPGPDTLQFT VDVFHFANDSRNMIYITCHLKVTLAEQDPDELNKACSFSKPSNSWFPVEGPADICQCC NKGDCGTPSHSRRQPRVVSQWSTSASRNRRHVTEEADVTVGATDLPGQEW" BASE COUNT 299 a 480 c 419 g 298 t ORIGIN 1 gggccggcgg ctgcggcggc tggagcaggc gagcggcggc ggccgatagc gagtgtcagg 61 gccggccggg gcggcgcttc tcggcctgtc gctggtcggc ctcctactgt acctcgtgcc 121 tgctgcggct gcgctggcct ggctggccgt ggggactacc gcggcctggt ggggactgag 181 ccgcgagccc cgaggttcgc gccccttgtc ctccttcgtt cagaaggcgc gacatcggcg 241 aacactgttc gcttcgcctc cggccaagtc gacagccaac ggaaacctcc tagagccgcg 301 gaccctgctc gaaggacctg accctgccga actgctcctc atgggcagtt acctgggcaa 361 gcccgggccg ccgcagcccg cccccgctcc ggagggccag gacctgcgga ataggcctgg 421 ccgccgcccg cccgcccggc gccgcgctcc acaccgccct ccccgccgac ccatcgcgtt 481 caccactttt acccctctct ccccactcct cttctccgac cctccgggag gccttcccca 541 cgggatcgtg ggactttacc agatcggttt gtaataacac ctcgaagacg ctatccgatc 601 catcaggccc agtattcctg tccgggggta cttcccacag tgtgctggaa tggttatcac 661 aagaaggctg tgctgtcccc tcgcaactcc aggatggtgt gtagcccagt gactgtgagg 721 atcgcccctc ctgacagaag attttcgcgt tctgcgatac cagagcagat aatcagctca 781 acactgtcct caccatcaag taatgcccca gacccatgtg caaaggagac tgtactgagt 841 gccctcaaag agaagaagaa gaaaaggaca gtggaggaag aagaccaaat attccttgat 901 ggccaggaaa ataaaagaag ctgtcttgtc gacggtctca ctgatgcctc ttctgcattc 961 aaagttcctc gacccgggcc agatacactc cagttcacag tggatgtctt ccactttgct 1021 aatgactcca gaaacatgat atacatcacc tgccacctga aggtcaccct agctgagcag 1081 gacccagatg aactcaacaa ggcctgttcc ttcagcaagc cttccaacag ctggttccca 1141 gtggaaggcc cggctgacat ctgtcaatgc tgtaacaaag gtgactgtgg cactccaagc 1201 cattccagga ggcagcctcg tgtcgtgagc cagtggtcca cgtctgcttc ccgtaaccgc 1261 aggcatgtga cagaagaagc agatgtcacc gtgggggcca ctgatcttcc tggacaggag 1321 tggtgaccat gaagtagagc agtgggcttt gccttctgac acctcagtgg tgctgctggg 1381 cgtaggcctg gctgtggtgg tgtccctgac tctgactgct gttatcctgg ttctcaccag 1441 gaggtgtcgc actgcctccc accctgtgtc tgcttccgaa taaaagaaga aagcaa // LOCUS HSU10116 10079 bp DNA PRI 18-FEB-1995 DEFINITION Human superoxide dismutase (SOD3) gene, complete cds. ACCESSION U10116 NID g529149 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10079) AUTHORS Folz,R.J. and Crapo,J.D. TITLE Extracellular superoxide dismutase (SOD3): tissue-specific expression, genomic characterization, and computer-assisted sequence analysis of the human EC SOD gene JOURNAL Genomics 22 (1), 162-171 (1994) MEDLINE 95048365 REFERENCE 2 (bases 1 to 10079) AUTHORS Folz,R.J. TITLE Direct Submission JOURNAL Submitted (27-MAY-1994) Rodney J. Folz, Medicine, Duke University Medical Center, Bell Building, Room 250, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..10079 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="7" /clone_lib="Adult female leukocyte library" /chromosome="4" /sex="female" /cell_type="leukocyte" /tissue_type="blood" 5'UTR 1..558 mRNA join(1..563,1136..1219,5069..6405) exon 559..563 intron 564..1135 exon 1136..1219 intron 1220..5068 exon 5069..6404 gene 5085..5807 /gene="SOD3" CDS 5085..5807 /gene="SOD3" /codon_start=1 /product="superoxide dismutase" /db_xref="PID:g529150" /translation="MLALLCSCLLLAAGASDAWTGEDSAEPNSDSAEWIRDMYAKVTE IWQEVMQRRDDDGTLHAACQVQPSATLDAAQPRVTGVVLFRQLAPRAKLDAFFALEGF PTEPNSSSRAIHVHQFGDLSQGCESTGPHYNPLAVPHPQHPGDFGNFAVRDGSLWRYR AGLAASLAGPHSIVGRAVVVHAGEDDLGRGGNQASVENGNAGRRLACCVVGVCGPGLW ERQAREHSERKKRRRESECKAA" 3'UTR 5808..6405 polyA_signal 6386..6391 polyA_site 6405 BASE COUNT 2482 a 2612 c 2408 g 2577 t ORIGIN 1 ggatccagag atttagattt tttataagct ttcctgccac cgaaacgggt gtttgggacc 61 tcacgaggcc ctgttcattc ttcgtcgctg cgctccccac tctgtactgg atgcatttac 121 tgacgttgtt gtctccgtcc ccagagtatg aacccccaag gtgactcatg cagctgtggg 181 tgcccggcat acagcatggt gactggaatg gatgagcacc caataaacat ttgttgcagg 241 aatgcaggag gacgggcagg ccagcaagca ggctgcctgg tttttcccac atgggctttt 301 ctgggaaaga agagcttcta tttttggaaa gggctgctat gattgagaaa agttcatggc 361 agcaaaaaaa ggacagacgt cgggagggaa acactcctag ttctcccaga caacacattt 421 tttaaaaaga ctccttcatc tctttaataa taacggtaac gacaatgaca atgatgatta 481 cttatgagtg cggctagtgc cagccactgt gttgtcactg ggcgagtaat gatctcattg 541 gatcttcacg gtgggcgtgc ggggtggaca gcctcacacc cccattttac agatgatgaa 601 aaggaggtgc agggagtggt gcagctgctt caggcgtaca cagataggaa gtgacaaggc 661 tgggactctg cagcctgagt gtgtcatcac gacccacccg ctgctctgct ctcataggta 721 tgacagcaca gctctggagc aaatgccatg cacatttgca aggtgcccat ttccatgcag 781 caaaaataag tcaataagtt attgacttag agaaaagcaa agggcctctc aataaagagg 841 tcattgtaca cctctccaaa caggcgattt tctttctcat ttttattccc ctgctgtgtg 901 ctgaaggtca ctggctacaa gccggtgaag tcgcggaatg gaatccttgg cccgaaaacc 961 caaaaatggg aggggcagag gaggtgggga cagagcggga ggaggtggag gcgaagcaat 1021 tctacaaccc ggggaggtct ggcctgcttt tcctccctga actggcccaa tgactggctc 1081 cctcacgctg accactcctc tgggctggcc tcctgcactc gcgctaacag cccaggctcc 1141 agggacagcc tgcgttcctg ggctggctgg gtgcagctct cttttcagga gagaaagctc 1201 tcttggagga gctggaaagg tgggtgctaa gttgaggttc attttgttct tctcggagtg 1261 tgcttattga gtctgaagct gggttggggc aacgggcctc ttcttgggaa caaattggat 1321 catcttcttg ggaaggaaat gtactttccc tggctgctct gaggggttag tggggaggtg 1381 gagtgagcgg ggaggaaggc aaggagggga ggaagaaacc gttcctcctg tggatctgca 1441 aagaccagtc caagaggatt ttagtgttag gaaaaggaat ctggagtgac gagaaagggg 1501 gcctttctag atgttgcatg gctttggtgt cgggagccac ttatgggaca gcaggtactc 1561 taaaaagcca cctccttagg aaagcagaga ggccctggcc agctcaggct cccagcaaga 1621 gctccttcta ggagacagct gagggatgaa acacacccaa ggctcaagag gggcaggttc 1681 ttcccagata cagacccagg aaggagataa aggcttggtg cctctatttg gttcaggata 1741 agggcccctg tcctctttct ctgataacac tgtcctcttt ctctgataac accgtcctcc 1801 cttccagatc cacgtacaaa ggaggccctt aaaaaggcac ttggtcattc acagctcaaa 1861 ctgagcaaga ggctgtggga gaagaatcaa gttggtcccg aggggaagag gtgtcaaagg 1921 cttaagaaac aagaagtcag agtttacctg ggtttgaggg agaattttct ttcccccttt 1981 tcctcctcct cctccttctt ctcttttttt tttttttttt tttttttttt ttgagacatg 2041 gtctcattct gtcacccagc acccaggctg gaatgtagtg gcacgatcac tatcacggct 2101 cactacagcc tctacctccc gggctcaagt gatcctccta cctcagcctc ctgagtaact 2161 gggactacag gcacatgcca ccacacccag ctattttttt ttttgctaga gatgggggtc 2221 tctaccaggt tggtctcata ctcttgtact caaatgattc tcctgtctca tcctcccaaa 2281 gggtgggatt acaggcataa gccaccatgc ctggctcttc ttttggtttc agagaaaaac 2341 atctccttaa aatgtttatt tcccaaggat tcttgaaaaa gaaagctcac tgacacaccc 2401 aaaacaatct ggttttgctc tgtgctttta gggagaactt tctaagcagc agagcccttc 2461 tgagtggcag ggctgtctta ggaggaaggt gtcttttgat gatggggaac ttcatgtcca 2521 ggtctggcag gagagttacc ccactttcct gcctactccc tggggctttg gggtagtagt 2581 accacattgg gccatgtcat ttaggtgagt ccttcaacat cactttctct gcttctccct 2641 ctttctggat cctccttctt ggagcctttc aaggggacct cctctcacag tgtccatagc 2701 atctcttagc taatggtcct taaaatctct accagcagct tctctctgat agctaagagc 2761 tgccatttac tgggaacttt ctatgtactg ggctctgtgc taagtgccct agatgagaga 2821 tgtgcagtgt ggtgcctaaa ccttgggctt ggagcagaca cacactttca aatcctgcct 2881 tcagctcctt agtgaacatg tcaccttggg cgggacacac gcctctctgt gcctcagttt 2941 cctacacttt agaatgggga taacactgaa taatgttctt gtgaggatgc agggaattaa 3001 cccacgcaca gtacttataa tagtgtctgg cgcctgtgtt cgataagttt tagcaattct 3061 aatcatctct tttaagcctc gcagcaagcc tctaaggtaa gtctgtatta gtatccctat 3121 ttacagatga gaaaactgag gttcacaggg gatgagacag tgtacagtct gcagtccagc 3181 aattactctg ctactcagca ataaaaatag taacagctaa cccttagact aagtggcaga 3241 gtcaggcttt agattcatga ggtgagttct ggaatccatc cctttaataa ccacactaaa 3301 ttgcctttct gaaatggtta tataaagcat atctacccaa tcttggagtt ttttaaatgg 3361 cacctagttt ggtgctggaa atgcagttga ccttcaaagc aattctttgg aggcagcatc 3421 aatccctctg gaaatacctc ggtggcatgg ctggccttat tctacaggta aggaacttga 3481 agctaagcat cagtaacccc gtgaagtcac agttagtata ggttggaatt gggattcaaa 3541 tctgtacctg actttataat tcctagctgg gccccagaat ctttgataga ggtgtcttct 3601 ttcttttctt ttctttcttt cctctttctt tcccttcctt cctctctctc tgtctttctt 3661 ctctcctttc tttctcacag aatcaaaatc tcttggggtg gggcctgggc atctgatttt 3721 taaaaaccag acatctgatg tgcagtcaac actgagaacc cctgccagct tcatctcctc 3781 ttctaagtgc cagacccaag tttccaactg tctgcccacc tgtctcccca cctgggcacc 3841 cgccagcgtc tcaccctcag gagactccag ctgaactaat cctctctccc tgcttttcca 3901 gaacaggtcc caccctccct ccactcagtc tctcctgctg ggaaccctgg tcatctgcac 3961 tgtgccttca tcttccatcc tgccagtgct gcccggtgtg tctcttaaac ccatgcctcc 4021 tctgtgtgca ccacctgcac tttggtaaaa gccttcattt cctgcttggg ttactacaac 4081 gccccctaac tcatctcact gtctctattt ctgcttctct gtctctccct aggctactcc 4141 cattcttcct cccctttcct cttcatccca aagtccaacc catatccttt taccagtagg 4201 acttaaggaa ctaaagacta tctcatcacc cacttttctt cttaaaaact tccactgcac 4261 tgcctgctga gatggccttc ctacccaact tggctggaaa actcctaccc atcttgtgga 4321 acccagttca aaagtcacca cctctgagaa gccttccctg aggctcctag ggagatgggt 4381 actgcctcct ctgtccttct ccagcacagg ccccatcttc aatcacagga ttgtgctgga 4441 atgattggat gccaagtctg tccctcactg aactccttat gcaaaatcca tattatatgt 4501 ttccttttgc caggtgtggg cccaggtgct ggggataccg atgaataaaa ctgagtttct 4561 gtcttcaaga agctccaagt ctactgagtg tagcagagaa cagggagaag gcacttcagg 4621 gagaaggggt agcacatgca aagccccaga aggcagggac agaagcctta gggatgtctg 4681 tgggggagga tggaggaaga gggtaacagg agaccaggtg gggagatgag ggaggtggtc 4741 tggaagggcc atgagacacc cctcacgctc cctgagaccc cctccacgct atagagatgg 4801 gactggagag gacgatgatc atttgtgact cagatccctg tgggtttctt cagattgggt 4861 ctcacccatc tttacagcca cagcacctaa cacagtgccc ggcacacagc aggccctaga 4921 caaacgtttg ccacatgaag tcatgccact ggccaggaag cccactgggg actggggggt 4981 tggttctgcg ataatggggt ccctgagatt ctatgtttca cgtgactaag cctcactctg 5041 cccccacctc cgcgggggcg tcccgcaggt gcccgactcc agccatgctg gcgctactgt 5101 gttcctgcct gctcctggca gccggtgcct cggacgcctg gacgggcgag gactcggcgg 5161 agcccaactc tgactcggcg gagtggatcc gagacatgta cgccaaggtc acggagatct 5221 ggcaggaggt catgcagcgg cgggacgacg acggcacgct ccacgccgcc tgccaggtgc 5281 agccgtcggc cacgctggac gccgcgcagc cccgggtgac cggcgtcgtc ctcttccggc 5341 agcttgcgcc ccgcgccaag ctcgacgcct tcttcgccct ggagggcttc ccgaccgagc 5401 cgaacagctc cagccgcgcc atccacgtgc accagttcgg ggacctgagc cagggctgcg 5461 agtccaccgg gccccactac aacccgctgg ccgtgccgca cccgcagcac ccgggcgact 5521 tcggcaactt cgcggtccgc gacggcagcc tctggaggta ccgcgccggc ctggccgcct 5581 cgctcgcggg cccgcactcc atcgtgggcc gggccgtggt cgtccacgct ggcgaggacg 5641 acctgggccg cggcggcaac caggccagcg tggagaacgg gaacgcgggc cggcggctgg 5701 cctgctgcgt ggtgggcgtg tgcgggcccg ggctctggga gcgccaggcg cgggagcact 5761 cagagcgcaa gaagcggcgg cgcgagagcg agtgcaaggc cgcctgagcg cggcccccac 5821 ccggcggcgg ccagggaccc ccgaggcccc cctctgcctt tgagcttctc ctctgctcca 5881 acagacacct tccactctga ggtctcacct tcgcctctgc tgaagtctcc ccgcagccct 5941 ctccacccag aggtctccct ataccgagac ccaccatcct tccatcctga ggaccgcccc 6001 aaccctcgga gccccccact cagtaggtct gaaggcctcc atttgtaccg aaacaccccg 6061 ctcacgctga cagcctccta ggctccctga ggtacctttc cacccagacc ctccttcccc 6121 accccataag ccctgagact cccgcctttg acctgacgat cttccccctt cccgccttca 6181 ggttcctcct aggcgctcag aggccgctct ggggggttgc ctcgagtccc cccacccctc 6241 cccacccacc accgctcccg cggcaagcca gcccgtgcaa cggaagccag gccaactgcc 6301 ccgcgtcttc agctgtttcg catccaccgc caccccactg agagctgctc ctttggggga 6361 atgtttggca acctttgtgt tacagattaa aaattcagca attcagtact gcgtcgaggt 6421 cttggttact tttttgtttg tttgttttag gcttctctcc caagctgagc ttttttttgt 6481 tttgttttcg ttttcctttt ttttcttttt tttgggagtg gcaaacatgc ttcccaaatc 6541 cctacaggac ttctccttat cctctgcccc cacctcccta accctgctgg caacaacgtt 6601 cagccactgc ttgtcttgcc cttcagtgtg gctccaagag gaagatcacc agaatcactc 6661 agggaagtta aaaaaaaaaa tacagcttcc tgggctacat cccagagctg tggaatccaa 6721 agggagaaga gaaagtgaat ttgcgacaag cgtcgggatg attctggcac tggaccctct 6781 ggcctgagag gggaagaggc cttccatctc acctgggctg gtagcttgtc acatctgcct 6841 ccgagtacag ccttaggtcc atttcccaga tatcagagac agtgccaggg aagccaggtg 6901 actgcatctt gcctaggcac agaagagtag ggttggaatg tgacgttgtt agcatttggc 6961 aggaccaaaa ccagaggcaa acggaggcag tgggatggaa aggcagttga ttttgatgaa 7021 ggcttgttgg gagttcagct ttcttttgaa acttataatc tatacccagg ctagaacagt 7081 cttgtgtata caccttcatt catggaataa acgtacttgc aataactttt tagcctccca 7141 gggtagcctc acttcctagc tgtgactttt ccaccctggt tactgggagg cagcttccat 7201 ttctcccaga ctagctaggc agtgcgtcca actgaaccgc agccagaaac ctgtctccag 7261 gggttatttt tacctctaac taggactaac ttattttaaa atctttcctt gagcccaagt 7321 gacaactgaa gagaaaggct attgcctggt gattttgctc caccagttgg ttctcactgg 7381 tttgaatact aacttgaact gtactcatcg acactgaaag gggatgagca aacagtgtct 7441 ctaaatctcc tgatcctgat ctcaaatatc cccctaatta caagttgcaa caaggcagct 7501 attacacggg gacacaggat ggagaggatg ggtgccaaac acccatcgtc tactctgctg 7561 cctcggttat ggtgaattca ggaccatcaa gggaggtgtg gacctttttt ttcagaagga 7621 ggctgacact tcttgtcaat tgcattgtgt tcttagtttt gctcttcaca acccttgacc 7681 ccgtagatgg gggctgaaga ggcaccctgg ccgactcact ctatttctgt tttgggaatg 7741 ggatggataa actatcccat ggcctccaga gccaaaaaac caaaacgaaa caaaacaaaa 7801 aaccccaaaa caaaaaagca aaaagcaaac aagaaaaaaa aaaaaagagg aaataatagg 7861 cagacaattt acagttcatt gtaagggcaa agatatgcat atagcatgat ggttaacagg 7921 tcaggctcag gtagaaaggc ccatttgaac cccagctctg ccacactcag aaactgtgtg 7981 acccgaacaa gtcacttaac ctctctgagc ataggtaaaa taagatcatc ataccagatt 8041 gttttgaaga ttaaatcaag tgttattcac gagaggtgca cagcatagca tgcacaacaa 8101 ataaggacct ggtaagtatc taattaataa caatggctaa gatccaaaaa acagctacct 8161 actaataaat agatggggct gccttgtaag gcagtgagca tcatgcaacc aggattcaaa 8221 tgaaggacag ttgctacctc tgaggttccc gagaaggatt tctcgatcca ttgagagact 8281 gaatgacatg aactctgcga tcccatctct tgtggggagg gaacctagaa tgaagggaag 8341 attgtgggcc ataaaggcag acatctggtt cctgggcaca gaaccatatg tgtgccacca 8401 aagccaccca ccggacccca cttggcccct ggagtctatt tttactcctc tcatcttaca 8461 agatctattt tgttaatctc cttatatttg ctgttttgac ttcccagcca gcttgctaat 8521 cagtttgcct atttgactca cagggtttgc atttgtcacg gggactgaaa cacacgcttg 8581 ttttgatttc tttttgtaaa ttagaagcgt tgatgtaatg actctaccta gacacagctg 8641 gtaaagtgag aataatgctc aagtttgcac agtttaaaca caatgtagac aataattaga 8701 aatgctatct ttagatgttt aggataagct tttctcagaa ttgcactgat tttttttttc 8761 tgagtggggc tttttagtgc atatatacag aaatactaaa aacgtaagaa aatagagcaa 8821 atcagtgagt gctttggtca acttgaaaga ctgcaggaaa taaaccaact gattttagat 8881 ctgccttttt ttgactgaat gcataaaatc tttacattct ccatattttt catgactacc 8941 atatgatcaa atagttttag gtgacagatt gcaactgata agttgctgca atatggcaga 9001 agtcatgctc agcctccgct tgcccggtgg tgagggtgga atatgaagca aacaataaag 9061 ataattcatc atctctatca ggaaaattgc cacatgttta tttcaggtaa caaaaaagat 9121 atagttatga tatacaatga ccatagaatc caataaagca acttctgcaa atgaatagaa 9181 ggtacttttt ctttaaatga aactacaaaa tagcagctgg ttttaaaaac aaagccaatt 9241 gttttagatt taataggcta ccactggcct ctgctaagat ccccaaatat attcctgagc 9301 tcacatagat tccagaaagt caaacttttc aatattatgc aaactttccc tatgcatcca 9361 aaaaattctc atttagtaaa gaggtgatat gaaatgtaag gcagcatgtc catatctatc 9421 attttaaatt gccttcatgc tgtatcaact ggttttgttt tgggaagcaa ccataatatt 9481 gagagacggg tctttcctat tttttctgct actcatttct aactagattc actacggagc 9541 tcccaattgc atctctctga tctacaaatt tttctctctt caggaagaca cctggaaaga 9601 agggactaca ttaaaggagt gtgttggggg caatgctttg gccttttgac atcctatcta 9661 gtctgaaggg accctcacta ttgctaagga ggaggagtgt tttaaatgga ggcttcagaa 9721 tgaaagcaga ggaagaaggt actctctttt tcaaaaagaa ggagggtaca ggccgggcgc 9781 agctgtcacg cctgcaatcc cagcactttg ggaggccgag gaaggcagat cacgaggttg 9841 ggagtttgag ccagcctggt caacatagtg aaaccccgtc tctactaaaa atacaaaaat 9901 tagccagcat ggtggtgcat gcctgtagtc ccagttactc gggaggctga ggcaggagaa 9961 tcgcttgaac tcgggaagtg gaggttgcag tgagccgaga tcatgccact gcactccacc 10021 ctgggtgaca gagtgagact ctcaaaaaaa aaaaaaaaaa aaaaaagaag tagggtacc // LOCUS HSU10117 1057 bp mRNA PRI 14-FEB-1995 DEFINITION Human endothelial-monocyte activating polypeptide II mRNA, complete cds. ACCESSION U10117 NID g498909 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1057) AUTHORS Kao,J., Houck,K., Fan,Y., Haehnel,I., Libutti,S.K., Kayton,M.L., Grikscheit,T., Chabot,J., Nowygrod,R., Greenberg,S., Kuang,W.-J., Leung,D.W., Hayward,J.R., Kisiel,W., Heath,M., Brett,J. and Stern,D.M. TITLE Characterization of a novel tumor-derived cytokine. Endothelial-monocyte activating polypeptide II JOURNAL J. Biol. Chem. 269 (40), 25106-25119 (1994) MEDLINE 95014290 REFERENCE 2 (bases 1 to 1057) AUTHORS Kao,J., Fan,Y., Haehnel,I., Brett,J., Greenberg,S., Clauss,M., Kayton,M., Houck,K., Kisiel,W., Seljelid,R., Burnier,J. and Stern,D. TITLE A peptide derived from the amino terminus of endothelial-monocyte-activating polypeptide II modulates mononuclear and polymorphonuclear leukocyte functions, defines an apparently novel cellular interaction site, and induces an acute inflammatory response JOURNAL J. Biol. Chem. 269 (13), 9774-9782 (1994) MEDLINE 94193665 REFERENCE 3 (bases 1 to 1057) AUTHORS Kao,J., Ryan,J., Brett,G., Chen,J., Shen,H., Fan,Y., Godman,G., Familletti,P.C., Wang,F., Pan,Y.E., Stern,D. and Clauss,M. TITLE Endothelial monocyte-activating polypeptide II. A novel tumor-derived polypeptide that activates host-response mechanisms JOURNAL J. Biol. Chem. 267 (28), 20239-20247 (1992) MEDLINE 93015897 REFERENCE 4 (bases 1 to 1057) AUTHORS Houck,K.A. TITLE Direct Submission JOURNAL Submitted (27-MAY-1994) Keith A. Houck, Molecular Biology, Sphinx Pharmaceuticals Corp., P.O. Box 52330, Durham, NC 27717, USA FEATURES Location/Qualifiers source 1..1057 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /cell_type="histiocytic lymphoma" mRNA 1..1057 CDS 50..988 /standard_name="EMAP II" /codon_start=1 /product="endothelial-monocyte activating polypeptide II" /db_xref="PID:g498910" /translation="MANNDAVLKRLEQKGAEADQIIEYLKQQVSLLKEKAILQATLRE EKKLRVENAKLKKEIEELKQELIQAEIQNGVKQIAFPSGTPLHANSMVSENVIQSTAV TTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIAGSADSKPIDVSRLDLRIG CIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVILLCNLKPAKM RGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKKIWEQIQP DLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIK" mat_peptide 488..985 /standard_name="EMAP II" /evidence=not_experimental /product="endothelial-monocyte activating polypeptide II" polyA_signal 1032..1037 BASE COUNT 371 a 183 c 252 g 251 t ORIGIN 1 ggaacccgtg gtcctccgct tcatgatttt ctgccgtctc ttggcaaaaa tggcaaataa 61 tgatgctgtt ctgaagagac tggagcagaa gggtgcagag gcagatcaaa tcattgaata 121 tcttaagcag caagtttctc tacttaagga gaaagcaatt ttgcaggcaa ctttgaggga 181 agagaagaaa cttcgagttg aaaatgctaa actgaagaaa gaaattgaag aactgaaaca 241 agagctaatt caggcagaaa ttcaaaatgg agtgaagcaa atagcatttc catctggtac 301 tccactgcac gctaattcta tggtttctga aaatgtgata cagtctacag cagtaacaac 361 cgtatcttct ggtaccaaag aacagataaa aggaggaaca ggagacgaaa agaaagcgaa 421 agagaaaatt gaaaagaaag gagagaagaa ggagaaaaaa cagcaatcaa tagctggaag 481 tgccgactct aagccaatag atgtttcccg tctggatctt cgaattggtt gcatcataac 541 tgctagaaaa caccctgatg cagattcttt gtatgtggaa gaagtagatg tcggagaaat 601 agccccaagg acagttgtca gtggcctggt gaatcatgtt cctcttgaac agatgcaaaa 661 tcggatggtg attttacttt gtaacctgaa acctgcaaag atgaggggag tattatctca 721 agcaatggtc atgtgtgcta gttcaccaga gaaaattgaa atcttggctc ctccaaatgg 781 gtctgttcct ggagacagaa ttacttttga tgctttccca ggagagcctg acaaggagct 841 gaatcctaag aagaagattt gggagcagat ccagcctgat cttcacacta atgatgagtg 901 tgtggctaca tacaaaggag ttccctttga ggtgaaaggg aagggagtat gtagggctca 961 aaccatgagc aacagtggaa tcaaataaaa tgcttccact accaaaagac attagagaaa 1021 accttaaaag taataaagag aaatatattt gtcactt // LOCUS HSU10273 2476 bp DNA PRI 01-FEB-1995 DEFINITION Human angiotensin II receptor type 2 subtype gene, complete cds. ACCESSION U10273 NID g607811 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2476) AUTHORS Koike,G., Horiuchi,M., Yamada,T., Szpirer,C., Jacob,H.J. and Dzau,V.J. TITLE Human type 2 angiotensin II receptor gene: cloned, mapped to the X chromosome, and its mRNA is expressed in the human lung JOURNAL Biochem. Biophys. Res. Commun. 203 (3), 1842-1850 (1994) MEDLINE 95032069 REFERENCE 2 (bases 1 to 2476) AUTHORS Koike,G. TITLE Direct Submission JOURNAL Submitted (01-JUN-1994) George Koike, Falk Cardiovascular Research Center, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305-5246, USA FEATURES Location/Qualifiers source 1..2476 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pGK1111" /chromosome="X" /sex="female" /cell_type="leukocyte" /tissue_type="blood" /dev_stage="adult" CDS 190..1281 /codon_start=1 /product="angiotensin II receptor type 2 subtype" /db_xref="PID:g607812" /translation="MKGNSTLATTSKNITSGLHFGLVNISGNNESTLNCSQKPSDKHL DAIPILYYIIFVIGFLVNIVVVTLFCCQKGPKKVSSIYIFNLAVADLLLLATLPLWAT YYSYRYDWLFGPVMCKVFGSFLTLNMFASIFFITCMSVDRYQSVIYPFLSQRRNPWQA SYIVPLVWCMACLSSLPTFYFRDVRTIEYLGVNACIMAFPPEKYAQWSAGIALMKNIL GFIIPLIFIATCYFGIRKHLLKTNSYGKNRITRDQVLKMAAAVVLAFIICWLPFHVLT FLDALAWMGVINSCEVIAVIDLALPFAILLGFTNSCVNPFLYCFVGNRFQQKLRSVFR VPITWLQGKRESMSCRKSSSLREMETFVS" BASE COUNT 684 a 448 c 458 g 886 t ORIGIN 1 gaattcttgt tttacaagcc atggctctgt ttcttaatgt tttctataat cactcacttt 61 ttttttgctt ttgacaaaca ttcaaaatgc taatgattca aggatgtcct cagctctgta 121 tgtgttctaa gagttctatg ttttttctcc acagaaggca taagaactag gagctgctga 181 catttcaata tgaagggcaa ctccaccctt gccactacta gcaaaaacat taccagcggt 241 cttcacttcg ggcttgtgaa catctctggc aacaatgagt ctaccttgaa ctgttcacag 301 aaaccatcag ataagcattt agatgcaatt cctattcttt actacattat atttgtaatt 361 ggatttctgg tcaatattgt cgtggttaca ctgttttgtt gtcaaaaggg tcctaaaaag 421 gtttctagca tatacatctt caacctcgct gtggctgatt tactcctttt ggctactctt 481 cctctatggg caacctatta ttcttataga tatgactggc tctttggacc tgtgatgtgc 541 aaagtttttg gttcttttct taccctgaac atgtttgcaa gcattttttt tatcacctgc 601 atgagtgttg ataggtacca atctgtcatc tacccctttc tgtctcaaag aagaaatccc 661 tggcaagcat cttatatagt tccccttgtt tggtgtatgg cctgtttgtc ctcattgcca 721 acattttatt ttcgagacgt cagaaccatt gaatacttag gagtgaatgc ttgcattatg 781 gctttcccac ctgagaaata tgcccaatgg tcagctggga ttgccttaat gaaaaatatc 841 cttggtttta ttatcccttt aatattcata gcaacatgct attttggaat tagaaaacac 901 ttactgaaga cgaatagcta tgggaagaac aggataaccc gtgaccaagt cctgaagatg 961 gcagctgctg ttgttctggc cttcatcatt tgctggcttc ccttccatgt tctgaccttc 1021 ctggatgctc tggcctggat gggtgtcatt aatagctgcg aagttatagc agtcattgac 1081 ctggcacttc cttttgccat cctcttggga ttcaccaaca gctgcgttaa tccgtttctg 1141 tattgttttg ttggaaaccg gttccaacag aagctccgca gtgtgtttag ggttccaatt 1201 acttggctcc aagggaaaag agagagtatg tcttgccgga aaagcagttc tcttagagaa 1261 atggagacct ttgtgtctta aacgtgagag caaaatgcat gtaatcaaca tggctacttg 1321 ctttgaggct caccagaatt atttttaagt ggttttaata aaataataaa atttccccta 1381 atcttttctg aatcttctga aaccaaatgt aactatgttt tatcgtccag tgactttcag 1441 gaattgccca ttgtttttct gatatgtttg tacaagattt tcattggtga gacatattta 1501 caacctagaa gtaactggtg atatatctca aattgtaatt aataatagat tgtgaataat 1561 gatttgggga ttcagatttc tctttgaaac atgcttgtgt ttcttagtgg ggttttatat 1621 ccatttttat caggatttcc tcttgaacca gaaccagtct ttcaactcat tgcatcattt 1681 acaagacaac attgtaagag agatgagcac ttctaagttg agtatattat aatagattag 1741 tactggatta ttcaggcttt aggcatatgc ttctttaaaa acgctataaa ttatattcct 1801 cttgcatttc acttgagtgg aggtttatag ttaatctata actacatatt gaatagggct 1861 aggaatatag attaaatcat actcctatgc tttagcttat ttttacagtt atagaaagca 1921 agatgtacta taacatagaa ttgcaatcta taatatttgt gtgttcacta aactctgaat 1981 aagcactttt taaaaaactt tctactcatt ttaatgattg tttaaaggtt tctattttct 2041 ctgatacttt tttgaaatca gtaaacactg tgtattgttg taaaatgtaa aggtcacttt 2101 tcacatcctt gactttttag atgtgctgct ttgatatata ggacattgat ttgattttta 2161 ttattaatgc tttggttctg ggttgtttcc taaaatatct gggtggctta aaaaaaactc 2221 tttaacttgt aataaaccct taactggcat aggaaatggt atccagaatg gaattttgct 2281 acatggggtc tgggtggggg caaagagacc cagtcaatta catgtttggt accaagaaag 2341 gaacctgtca gggcagtaca atgtgacttt gaaaatatat accgtggggg tagttttacc 2401 ctatatctat aaacactgtt tgttccagaa tctgtatgat tctatggagc tattttaaac 2461 caattgcagg tctaga // LOCUS HSU10301 3056 bp mRNA PRI 06-JUN-1995 DEFINITION Human glutamate receptor flip isoform (GluR3-flip) mRNA, complete cds. ACCESSION U10301 NID g507826 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3056) AUTHORS Rampersad,V., Elliott,C.E., Nutt,S.L., Foldes,R.L. and Kamboj,R.K. TITLE Human glutamate receptor hGluR3 flip and flop isoforms: cloning and sequencing of the cDNAs and primary structure of the proteins JOURNAL Biochim. Biophys. Acta 1219 (2), 563-566 (1994) MEDLINE 95002179 REFERENCE 2 (bases 1 to 3056) AUTHORS Rajender K. Kamboj.,TITLE Direct Submission. TITLE Direct Submission JOURNAL Submitted (02-JUN-1994) Rajender K. Kamboj, 6850 Goreway Drive, Allelix Biopharmaceuticals Inc., Mississauga, Ontario L4W 1V7, Canada FEATURES Location/Qualifiers source 1..3056 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RKCH221 and RKCSFG34" /clone_lib="Stratagene libraries 936205 and 936206" /sex="female" 5'UTR <1..53 /evidence=experimental gene 54..2738 /gene="GluR3-flip" CDS 54..2738 /gene="GluR3-flip" /standard_name="hGluR3-flip" /codon_start=1 /evidence=experimental /product="glutamate receptor flip isoform" /db_xref="PID:g507827" /translation="MARQKKMGQSVLRAVFFLVLGLLGHSHGGFPNTISIGGLFMRNT VQEHSAFRFAVQLYNTNQNTTEKPFHLNYHVDHLDSSNSFSVTNAFCSQFSRGVYAIF GFYDQMSMNTLTSFCGALHTSFVTPSFPTDADVQFVIQMRPALKGAILSLLGHYKWEK FVYLYDTERGFSILQAIMEAAVQNNWQVTARSVGNIKDVQEFRRIIEEMDRRQEKRYL IDCEVERINTILEQVVILGKHSRGYHYMLANLGFTDILLERVMHGGANITGFQIVNNE NPMVQQFIQRWVRLDEREFPEAKNAPLKYTSALTHDAILVIAEAFRYLRRQRVDVSRR GSAGDCLANPAVPWSQGIDIERALKMVQVQGMTGNIQFDTYGRRTNYTIDVYEMKVSG SRKAGYWNEYERFVPFSDQQISNDSASSENRTIVVTTILESPYVMYKKNHEQLEGNER YEGYCVDLAYEIAKHVRIKYKLSIVGDGKYGARDPETKIWNGMVGELVYGRADIAVAP LTITLVREEVIDFSKPLMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGV SVVLFLVSRFSPYEWHLEDNNEEPRDPQSPPDPPNEFGIFNSLWFSLGAFMQQGCDIS PRSLSGRIVGGVWWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTL DSGSTKEFFRRSKIAVYEKMWSYMKSAEPSVFTKTTADGVARVRKSKGKFAFLLESTM NEYIEQRKPCDTMKVGGNLDSKGYGVATPKGSALGTPVNLAVLKLSEQGILDKLKNKW WYDKGECGAKDSGSKDKTSALSLSNVAGVFYILVGGLGLAMMVALIEFCYKSRAESKR MKLTKNTQNFKPAPATNTQNYATYREGYNVYGTESVKI" 3'UTR 2739..>3056 /evidence=experimental BASE COUNT 895 a 649 c 733 g 779 t ORIGIN 1 tgacgactcc tgagttgcgc ccatgctctt gtcagcttcg ttttaggcgt agcatggcca 61 ggcagaagaa aatggggcaa agcgtgctcc gggcggtctt ctttttagtc ctggggcttt 121 tgggtcattc tcacggagga ttccccaaca ccatcagcat aggtggactt ttcatgagaa 181 acacagtgca ggagcacagc gctttccgct ttgccgtgca gttatacaac accaaccaga 241 acaccaccga gaagcccttc catttgaatt accacgtaga tcacttggat tcctccaata 301 gtttttccgt gacaaatgct ttctgctccc agttctcgag aggggtgtat gccatctttg 361 gattctatga ccagatgtca atgaacaccc tgacctcctt ctgtggggcc ctgcacacat 421 cctttgttac gcctagcttc cccactgacg cagatgtgca gtttgtcatc cagatgcgcc 481 cagccttgaa gggcgctatt ctgagtcttc tgggtcatta caagtgggag aagtttgtgt 541 acctctatga cacagaacga ggattttcca tcctccaagc gattatggaa gcagcagtgc 601 aaaacaactg gcaagtaaca gcaaggtctg tgggaaacat aaaggacgtc caagaattca 661 ggcgcatcat tgaagaaatg gacaggaggc aggaaaagcg atacttgatt gactgcgaag 721 tcgaaaggat taacacaatt ttggaacagg ttgtgatcct agggaaacac tcaagaggtt 781 atcactacat gctcgctaac ctgggtttta ctgatatttt actggaaaga gtcatgcatg 841 ggggagccaa cattacaggt ttccagattg tcaacaatga aaaccctatg gttcagcagt 901 tcatacagcg ctgggtgagg ctggatgaaa gggaattccc tgaagccaag aatgcaccac 961 taaagtatac atctgcattg acacacgacg caatactggt catagcagaa gctttccgct 1021 acctgaggag gcagcgagta gatgtgtccc ggagaggaag tgctggagac tgcttagcaa 1081 atcctgctgt gccctggagt caaggaattg atattgagag agctctgaaa atggtgcaag 1141 tacaaggaat gactggaaat attcaatttg acacttatgg acgtaggaca aattatacca 1201 tcgatgtgta tgaaatgaaa gtcagtggct ctcgaaaagc tggctactgg aacgagtatg 1261 aaaggtttgt gcctttctca gatcagcaaa tcagcaatga cagtgcatcc tcagagaatc 1321 ggaccatagt agtgactacc attctggaat caccatatgt aatgtacaag aagaaccatg 1381 agcaactgga aggaaatgaa cgatatgaag gctattgtgt agacctagcc tatgaaatag 1441 ccaaacatgt aaggatcaaa tacaaattgt ccatcgttgg tgacgggaaa tatggtgcaa 1501 gggatccaga gactaaaata tggaacggca tggttgggga acttgtctat gggagagctg 1561 atatagctgt tgctccactc actataacat tggtccgtga agaagtcata gatttttcaa 1621 agccattaat gagcctgggc atctccatca tgataaagaa gcctcagaaa tcaaaaccag 1681 gcgtattctc atttctggat cccctggctt atgaaatctg gatgtgcatt gtctttgctt 1741 acattggagt cagcgtagtt cttttcctag tcagcaggtt cagtccttat gaatggcact 1801 tggaagacaa caatgaagaa cctcgtgacc cacaaagtcc tcctgatcct ccaaatgaat 1861 ttggaatatt taacagtctt tggttttcct tgggtgcctt tatgcagcaa ggatgtgata 1921 tttctccaag atcactctcc gggcgcattg ttggaggggt ttggtggttc ttcaccctga 1981 tcataatttc ttcctatact gccaatctcg ctgctttcct gactgtggag aggatggttt 2041 ctcccataga gagtgctgaa gacttagcta aacagactga aattgcatat gggaccctgg 2101 actccggttc aacaaaagaa tttttcagaa gatccaaaat tgctgtgtac gagaaaatgt 2161 ggtcttacat gaaatcagcg gagccatctg tgtttaccaa aacaacagca gacggagtgg 2221 cccgagtgcg aaagtccaag ggaaagttcg ccttcctgct ggagtcaacc atgaatgagt 2281 acattgagca gagaaaacca tgtgatacga tgaaagttgg tggaaatctg gattccaaag 2341 gctatggtgt ggcaacccct aaaggctcag cattaggaac gcctgtaaac cttgcagtat 2401 tgaaactcag tgaacaaggc atcttagaca agctgaaaaa caaatggtgg tacgataagg 2461 gggaatgtgg agccaaggac tccgggagta aggacaagac cagcgctctg agcctgagca 2521 atgtggcagg cgttttctat atacttgtcg gaggtctggg gctggccatg atggtggctt 2581 tgatagaatt ctgttacaaa tcacgggcag agtccaaacg catgaaactc acaaagaaca 2641 cccaaaactt taagcctgct cctgccacca acactcagaa ttatgctaca tacagagaag 2701 gctacaacgt gtatggaaca gagagtgtta agatctaggg atcccttccc actggaggca 2761 tgtgatgaga ggaaatcacc gaaaacgtgg ctgcttcaag gatcctgagc cagatttcac 2821 tctccttggt gtcgggcatg acacgaatat tgctgatggt gcaatgacct ttcaatagga 2881 aaaactgatt ttttttttcc ttcagtgcct tatggaacac tctgagactc gcgacaatgc 2941 aaaccatcat tgaaatcttt ttgctttgct tgaaaaaaaa taattaaaat aaaaaccaac 3001 aaaaatggac atgcatcaaa cccttgatgt attaatattt attatagttt tcatta // LOCUS HSU10323 1552 bp mRNA PRI 25-AUG-1994 DEFINITION Human nuclear factor NF45 mRNA, complete cds. ACCESSION U10323 NID g532312 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1552) AUTHORS Kao,P.N., Chen,L., Brock,G., Ng,J., Kenny,J., Smith,A.J. and Corthesy,B. TITLE Cloning and expression of cyclosporin A- and FK506-sensitive nuclear factor of activated T-cells: NF45 and NF90 JOURNAL J. Biol. Chem. 269, 20691-20699 (1994) MEDLINE 94327652 REFERENCE 2 (bases 1 to 1552) AUTHORS Kao,P.N. TITLE Direct Submission JOURNAL Submitted (02-JUN-1994) Peter N. Kao, Pulmonary and Critical Care Medicine, Stanford University, MSLS P312, Stanford, CA 94305-5487, USA FEATURES Location/Qualifiers source 1..1552 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS II KS+ NF45.3" /clone_lib="2H Stim Jurkat T-cells, lambda gt11 cDNA library from J.N. Northrop" /cell_line="Jurkat T-cell" CDS 40..1260 /codon_start=1 /function="involved in DNA-binding with NF90 protein to NF-AT target DNA sequence in interleukin-2 enhancer" /product="NF45 protein" /db_xref="PID:g532313" /translation="MRGDRGRGRGGRFGSRGGPGGGFRPFVPHIPFDFYLCEMAFPRV KPAPDETSFSEALLKRNQDLAPNSAEQASILSLVTKINNVIDNLIVAPGTFEVQIEEV RQVGSYKKGTMTTGHNVADLVVILKILPTLEAVAALGNKVVESLRAQDPSEVLTMLTN ETGFEISSSDATVKILITTVPPNLRKLDPELHLDIKVLQSALAAIRHARWFEENASQS TVKVLIRLLKDLRIRFPGFEPLTPWILDLLGHYAVMNNPTRQPLALNVAYRRCLQILA AGLFLPGSVGITDPCESGNFRVHTVMTLEQQDMVCYTAQTLVRILSHGGFRKILGQEG DASYLASEISTWDGVIVTPSEKAYEKPPEKKEGEEEEENTERTTSRRGRRKHGNSGVT FPSLLFLPKGKTGA" BASE COUNT 422 a 357 c 374 g 399 t ORIGIN 1 cggttggtgc ggcctccatt gttcgtgttt taaggcgcca tgaggggtga cagaggccgt 61 ggtcgtggtg ggcgctttgg ttccagagga ggcccaggag gagggttcag gccctttgta 121 ccacatatcc catttgactt ctatttgtgt gaaatggcct ttccccgggt caagccagca 181 cctgatgaaa cttccttcag tgaggccttg ctgaagagga atcaggacct ggctcccaat 241 tctgctgaac aggcatctat cctttctctg gtgacaaaaa taaacaatgt gattgataat 301 ctgattgtgg ctccagggac atttgaagtg caaattgaag aagttcgaca ggtgggatcc 361 tataaaaagg ggacaatgac tacaggacac aatgtggctg acctggtggt gatactcaag 421 attctgccaa cgttggaagc tgttgctgcc ctggggaaca aagtcgtgga aagcctaaga 481 gcacaggatc cttctgaagt tttaaccatg ctgaccaacg aaactggctt tgaaatcagt 541 tcttctgatg ctacagtgaa gattctcatt acaacagtgc cacccaatct tcgaaaactg 601 gatccagaac tccatttgga tatcaaagta ttgcagagtg ccttagcagc catccgacat 661 gcccgctggt tcgaggaaaa tgcttctcag tccacagtta aagttctcat cagactactg 721 aaggacttga ggattcgttt tcctggcttt gagcccctca caccctggat ccttgaccta 781 ctaggccatt atgctgtgat gaacaacccc accagacagc ctttggccct aaacgttgca 841 tacaggcgct gcttgcagat tctggctgca ggactgttcc tgccaggttc agtgggtatc 901 actgacccct gtgagagtgg caactttaga gtacacacag tcatgaccct agaacagcag 961 gacatggtct gctatacagc tcagactctc gtccgaatcc tctcacatgg tggctttagg 1021 aagatccttg gccaggaggg tgatgccagc tatcttgctt ctgaaatatc tacctgggat 1081 ggagtgatag taacaccttc agaaaaggct tatgagaagc caccagagaa gaaggaagga 1141 gaggaagaag aggagaatac agaaagaacc acctcaagga gaggaagaag aaagcatgga 1201 aactcaggag tgacattccc ttcactcctt ttcctaccca agggaaagac tggagcctaa 1261 gctgcctgct actggcttta catggtgaca gacattccgt ggataggaag atagcaggag 1321 aaagtaactc catagagtgt cattccactg gttgatattg gcttagctgc cagtctccca 1381 tttgtgacct atgccatcca tctataatgg aggataccaa catttcttcc taatattcta 1441 taatctccaa ctcctgaaaa cccctctctc aactaatact ttgctgttga aatgttgtga 1501 aatgttaagt gtctggaaat ttttttttct aagaaaaact attaaagtac tt // LOCUS HSU10324 3505 bp mRNA PRI 25-AUG-1994 DEFINITION Human nuclear factor NF90 mRNA, complete cds. ACCESSION U10324 NID g532314 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3505) AUTHORS Kao,P.N., Chen,L., Brock,G., Ng,J., Kenny,J., Smith,A.J. and Corthesy,B. TITLE Cloning and expression of cyclosporin A- and FK506-sensitive nuclear factor of activated T-cells: NF45 and NF90 JOURNAL J. Biol. Chem. 269, 20691-20699 (1994) MEDLINE 94327652 REFERENCE 2 (bases 1 to 3505) AUTHORS St Johnston,D., Brown,N.H., Gall,J.G. and Jantsch,M. TITLE A conserved double-stranded RNA-binding domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (22), 10979-10983 (1992) MEDLINE 93066367 REFERENCE 3 (bases 1 to 3505) AUTHORS Kao,P.N. TITLE Direct Submission JOURNAL Submitted (02-JUN-1994) Peter N. Kao, Pulmonary and Critical Care Medicine, Stanford University, MSLS P312, Stanford, CA 94305-5487, USA FEATURES Location/Qualifiers source 1..3505 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS II KS+ NF90.3" /clone_lib="2H Stim Jurkat T-cells, lambda gt11 cDNA library from J.N. Northrop" /cell_line="Jurkat T-cell" CDS 265..2280 /codon_start=1 /function="involved in DNA-binding with NF45 protein to NF-AT target DNA sequence in interleukin-2 enhancer" /product="NF90 protein" /db_xref="PID:g532315" /translation="MRPMRIFVNDDRHVMAKHSSVYPTQEELEAVQNMVSHTERALKA VSDWIHEQEKGSSEQAESDNMDVPPEDDSKEGAGEQKTEHMTRTCRGVMRAGPGGQSA SYSRGTWIWSWCCCVRRSPQPALLDKVADNLAIQLAAVTEDKYEILQSVDDAAIVIKN TKEPPLSLTIHLTSPVVREEMEKVLAGETLSVNDPPDVLDRQKCFAALASLRHAKWFQ ARANGLKSCVIVIRVLRDLCTRVPTWGPLRGWPLELLCEKSIGTANRPMGAGEALRRV LECLASGIVMPDGSGIYDPCEKEATDAIGHLDRQQREDITQSAQHALRLAAFGQLHKV LGMDPLPSKMPKKPKNENPVDYTVQIPPSTTYAITPMKRPMEEDGEEKSPSKKKKKIQ KKEEKAEPPQAMNALMRLNQLKPGLQYKLVSQTGPVHAPIFTMSVEVDGNSFEASGPS KKTAKLHVAVKVLQDMGLPTGAEGRDSSKGEDSAEETEAKPAVVAPAPVVEAVSTPSA AFPSDATAENVKQQGPILTKHGKNPVMELNEKRRGLKYELISETGGSHDKRFVMEVEV DGQKFQGAGSNKKVAKAYAALAALEKLFPDTPLSPLMPTKRREPQYPSEGDRNLLLSH ITLASAWEAPCTTKCPHPPTFEGGEEAGRSGDEGAGEDLVAPTMEAT" misc_structure 1519..1656 /note="encodes double-stranded RNA-binding domain" /citation=[2] misc_structure 1867..2076 /note="encodes double-stranded RNA-binding domain" /citation=[2] BASE COUNT 866 a 958 c 1011 g 670 t ORIGIN 1 cgccgcctgc ccgcccgccc gctcgccccc ggtccggact cctcctcctc ctcttctcgc 61 attgcagttg aacccagcag cccgccccac cggtggcttt tgggggcaga ccccggcggc 121 tgtggcagga gggcggcggc ggcggctgcg gtcgaagaag gggacgccga caagagttga 181 agtattgata acaccaagga actctatcac aatttgaaaa gataagcaaa agtttgattt 241 ccagacacta cagaagaagt aaaaatgcgt ccaatgcgaa tttttgtgaa tgatgaccgc 301 catgtgatgg caaagcattc ttccgtttat ccaacacaag aggagctgga ggcagtccag 361 aacatggtgt cccacacgga gcgggcgctc aaagctgtgt ccgactggat acacgagcag 421 gaaaagggta gcagcgagca ggcagagtcc gataacatgg atgtgccccc agaggacgac 481 agtaaagaag gggctgggga acagaagacg gagcacatga ccagaacctg tcggggagtg 541 atgcgggctg ggcctggtgg ccaaagtgcc tcctactcaa gggggacttg gatctggagc 601 tggtgctgct gtgtaaggag aagcccacaa ccggccctcc tggacaaggt ggccgacaac 661 ctggccatcc agcttgctgc tgtaacagaa gacaagtacg aaatactgca atctgtcgac 721 gatgctgcga ttgtgataaa aaacacaaaa gagcctccat tgtccctgac catccacctg 781 acatcccctg ttgtcagaga agaaatggag aaagtattag ctggagaaac gctatcagtc 841 aacgaccccc cggacgttct ggacaggcag aaatgctttg ctgccttggc gtccctccga 901 cacgccaagt ggttccaggc cagagccaac gggctgaagt cttgtgtcat tgtgatccgg 961 gtcttgaggg acctgtgcac tcgcgtgccc acctggggtc ccctccgagg ctggcctctc 1021 gagctcctgt gtgagaaatc cattggcacg gccaacagac cgatgggtgc tggcgaggcc 1081 ctgcggagag tgctggagtg cctggcgtcg ggcatcgtga tgccagatgg ttctggcatt 1141 tatgaccctt gtgaaaaaga agccactgat gctattgggc atctagacag acagcaacgg 1201 gaagatatca cacagagtgc gcagcacgca ctgcggctcg ccgcgttcgg ccagctccat 1261 aaagtcctag gcatggaccc tctgccttcc aagatgccca agaaaccaaa gaatgaaaac 1321 ccagtggact acaccgttca gatcccacca agcaccacct atgccattac gcccatgaaa 1381 cgcccaatgg aggaggacgg ggaggagaag tcgcccagca aaaagaagaa gaagattcag 1441 aagaaagagg agaaggcaga gcccccccag gctatgaatg ccctgatgcg gttgaaccag 1501 ctgaagccag ggctgcagta caagctggtg tcccagactg ggcccgtcca tgcccccatc 1561 tttaccatgt ctgtggaggt tgatggcaat tcattcgagg cctctgggcc ctccaaaaag 1621 acggccaagc tgcacgtggc cgttaaggtg ttacaggaca tgggcttgcc gacgggtgct 1681 gaaggcaggg actcgagcaa gggggaggac tcggctgagg agaccgaggc gaagccagca 1741 gtggtggccc ctgccccagt ggtagaagct gtctccaccc ctagtgcggc ctttccctca 1801 gatgccactg ccgagaacgt aaaacagcag gggccgatcc tgacaaagca cggcaagaac 1861 ccagtcatgg agctgaacga gaagaggcgt gggctcaagt acgagctcat ctccgagacc 1921 gggggcagcc acgacaagcg cttcgtcatg gaggtcgaag tggatggaca gaagttccaa 1981 ggtgctggtt ccaacaaaaa ggtggcgaag gcctacgctg ctcttgctgc cctagaaaag 2041 cttttccctg acacccctct ctcgcccttg atgccaacaa aaagaagaga gccccagtac 2101 ccgtcagagg gggaccgaaa tttgctgcta agccacataa ccctggcttc ggcatgggag 2161 gccccatgca caacgaagtg cccccacccc ccaaccttcg agggcgggga agaggcggga 2221 cgatccgggg acgagggcgc gggcgaggat ttggtggcgc caaccatgga ggctacatga 2281 atgccggtgc tgggtatgga agctatgggt acggaggcaa ctctgcgaca gcaggctaca 2341 gtcagttcta cagcaacgga gggcattctg ggaatgccag tggcggtggc ggcgggggcg 2401 gtggtggctc ctccggctat ggctcctact accaaggtga caactacaac tcaccggtgc 2461 ccccaaaaca cgctgggaag aagcagccgc acgggggcca gcagaagccc tcctacggct 2521 cgggctacca gtcccaccag ggccagcagc agtcctacaa ccagagcccc tacagcaact 2581 atggccctcc acagggcaag cagaaaggct ataaccatgg acaaggcagc tactcctact 2641 cgaactccta caactctccc gggggcgggc gcggatccga ctacaactac gagagcaaat 2701 tcaactacag tggtagtgga ggccgaagcg gcgggaacag ctacggctca ggcggggcat 2761 cctacaaccc agggtcacac gggggctacg gcggaggttc tgggggcggc tcctcatacc 2821 aaggcaaaca aggaggctgc tcacagtcga actacagctc ccggggtccg gccagaacta 2881 cagtggccct cccagctcct accagtcctc acaaggcggc tatggcagaa acgcagacca 2941 cagcatgaac taccagtaca gataagcccc gcgcggagat ttctaccttc tgcacttact 3001 ccccatcaga agatcgagtt ttatgcatca cagttaacat gtcagctgcc tgcgctccag 3061 gcccccgccc ccatcccgtc cacgttgctg tgtcgtgagg tgcagcgggt caccctgtgg 3121 cccgtcctgt gacccatatt tagccgtgtt tgggactccg tgtcttcaat ggtttgttag 3181 ttgccattac aactttgtct gggtagagtt tttgagtttt tgcagttcag tatccctctg 3241 tctattcaca cttcgtgtta gtggtaactc agtttgtctt taaatagtta cagaagggat 3301 acgtcatttg ttaatgcttt ttgttgaagt gagttaaacg agcttttctg tattttaatg 3361 ctttagtgtt tcagttttat aagtgaagat tttattttaa aaaccagtgg gaaagagtgg 3421 ggggtttctt tttatgtctg ggtcattcag gcagtacatc tgaattaagc tgaatgtaga 3481 caaataaaga aaaacaaaac tgaaa // LOCUS HSU10339 945 bp mRNA PRI 14-JUN-1994 DEFINITION Human MAGE-3b mRNA, complete cds. ACCESSION U10339 NID g499121 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 945) AUTHORS Fenton,R.G. TITLE Cloning and Analysis of MAGE-1 Related Genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 945) AUTHORS Fenton,R.G. TITLE Direct Submission JOURNAL Submitted (03-JUN-1994) Robert G. Fenton, BRMP, NCI-FCRDC, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..945 /organism="Homo sapiens" /note="cancer patient" /db_xref="taxon:9606" /clone="MAGE-3b" /clone_lib="DM150 library" /haplotype="HLA-A1/A2" /cell_line="DM150" /cell_type="melanoma" /tissue_type="skin" /dev_stage="adult" CDS 1..945 /codon_start=1 /product="MAGE-3b" /db_xref="PID:g499122" /translation="MPLEQRSQHCKPEEGLEARGEALGLVGAQAPATEEQEAASSSST LVEVTLGEVPAAESPDPPQSPQGASSLPTTMNYPLWSQSYEDSSNQEEEGPSTFPDLE SEFQAALSRKVAKLVHFLLLKYRAREPVTKAEMLGSVVGNWQYFFPVIFSKASDSLQL VFGIELMEVDPIGHVYIFATCLGLSYDGLLGDNQIMPKTGFLIIILAIIAKEGDCAPE EKIWEELSVLEVFEGREDSIFGDPKKLLTQYFVQENYLEYRQVPGSDPACYEFLWGPR ALIETSYVKVLHHMVKISGGPRISYPLLHEWALREGEE" BASE COUNT 213 a 254 c 275 g 203 t ORIGIN 1 atgcctcttg agcagaggag tcagcactgc aagcctgaag aaggccttga ggcccgagga 61 gaggccctgg gcctggtggg tgcgcaggct cctgctactg aggagcagga ggctgcctcc 121 tcctcttcta ctctagttga agtcaccctg ggggaggtgc ctgctgccga gtcaccagat 181 cctccccaga gtcctcaggg agcctccagc ctccccacta ccatgaacta ccctctctgg 241 agccaatcct atgaggactc cagcaaccaa gaagaggagg ggccaagcac cttccctgac 301 ctggagtctg agttccaagc agcactcagt aggaaggtgg ccaagttggt tcattttctg 361 ctcctcaagt atcgagccag ggagccggtc acaaaggcag aaatgctggg gagtgtcgtc 421 ggaaattggc agtacttctt tcctgtgatc ttcagcaaag cttccgattc cttgcagctg 481 gtctttggca tcgagctgat ggaagtggac cccatcggcc acgtgtacat ctttgccacc 541 tgcctgggcc tctcctacga tggcctgctg ggtgacaatc agatcatgcc caagacaggc 601 ttcctgataa tcatcctggc cataatcgca aaagagggcg actgtgcccc tgaggagaaa 661 atctgggagg agctgagtgt gttagaggtg tttgagggga gggaagacag tatcttcggg 721 gatcccaaga agctgctcac ccaatatttc gtgcaggaaa actacctgga gtaccggcag 781 gtccccggca gtgatcctgc atgctatgag ttcctgtggg gtccaagggc cctcattgaa 841 accagctatg tgaaagtcct gcaccatatg gtaaagatca gtggaggacc tcgcatttcc 901 tacccactcc tgcatgagtg ggctttgaga gagggggaag agtga // LOCUS HSU10360 750 bp DNA PRI 18-NOV-1994 DEFINITION Human interferon-gamma gene, complete cds. ACCESSION U10360 NID g551490 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 750) AUTHORS Realini,C., Dubiel,W., Pratt,G., Ferrell,K. and Rechsteiner,M. TITLE Molecular cloning and expression of a gamma-interferon-inducible activator of the multicatalytic protease JOURNAL J. Biol. Chem. 269 (32), 20727-20732 (1994) MEDLINE 94327656 REFERENCE 2 (bases 1 to 750) AUTHORS Realini,C.A. TITLE Direct Submission JOURNAL Submitted (06-JUN-1994) Claudio A. Realini, Biochemistry, University of Utah, School of Medicine, 317 Wintrobe, Salt Lake City, UT 84132, USA FEATURES Location/Qualifiers source 1..750 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="erythrocyte" /tissue_type="blood" CDS 1..750 /note="identical to interferon-gamma coding sequence from GenBank Accession Number L07633" /codon_start=1 /product="interferon-gamma" /db_xref="PID:g551491" /translation="MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFL KEPALNEANLSNLKAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCG PVNCNEKIVVLLQRLKPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELM TSLHTKLEGFHTQISKYFSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVME IRNAYAVLYDIILKNFEKLKKPRGETKGMIY" BASE COUNT 217 a 166 c 216 g 151 t ORIGIN 1 atggccatgc tcagggtcca gcccgaggcc caagccaagg tggatgtgtt tcgtgaagac 61 ctctgtacca agacagagaa cctgctcggg agctatttcc ccaagaagat ttctgagctg 121 gatgcatttt taaaggagcc agctctcaat gaagccaact tgagcaatct gaaggcccca 181 ttggacatcc cagtgcctga tccagtcaag gagaaagaga aagaggagcg gaagaaacag 241 caggagaagg aagacaagga tgaaaagaag aagggggagg atgaagacaa aggtcctccc 301 tgtggcccag tgaactgcaa tgaaaagatc gtggtccttc tgcagcgctt gaagcctgag 361 atcaaggatg tcattgagca gctcaacctg gtcaccacct ggttgcagct gcagatacct 421 cggattgagg atggtaacaa ttttggagtg gctgtccagg agaaggtgtt tgagctgatg 481 accagcctcc acaccaagct agaaggcttc cacactcaaa tctctaagta tttctctgag 541 cgtggtgatg cagtgactaa agcagccaag cagccccatg tgggtgatta tcggcagctg 601 gtgcacgagc tggatgaggc agagtaccgg gacatccggc tgatggtcat ggagatccgc 661 aatgcttatg ctgtgttata tgacatcatc ctgaagaact tcgagaagct caagaagccc 721 aggggagaaa caaagggaat gatctattga // LOCUS HSU10362 1407 bp mRNA PRI 06-JUL-1994 DEFINITION Human GP36b glycoprotein mRNA, complete cds. ACCESSION U10362 NID g505651 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Hartmann,E., Reimann,B., Goerlich,D., Rapoport,T.A. and Prehn,S. TITLE Human GP36b glycoprotein of the endoplasmic reticulum JOURNAL Unpublished REFERENCE 2 (bases 1 to 1407) AUTHORS Hartmann,E. TITLE Direct Submission JOURNAL Submitted (05-JUN-1994) Enno Hartmann, Max-Delbrueck-Centrum fuer Molekulare Medizin, Robert-Roessle-Str. 10, Berlin 12135, Germany FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /tissue_type="endoplasmic reticulum" sig_peptide 1..134 /evidence=experimental CDS 1..1071 /note="similar to canine VIP36, coding sequence in GenBank Accession Number X76392, and to human ERGIC53 gene product, coding sequence in GenBank Accession Number X71661" /codon_start=1 /product="GP36b glycoprotein" /db_xref="PID:g505652" /translation="MAAEGWIWRWGWGRRCLGRPGLLGPGPGPTTPLFLLLLLGSVTA DITDGNSEHLKREHSLIKPYQGVGSSSMPLWDFQGSTMLTSQYVRLTPDERSKEGSIW NHQPCFLKDWEMHVHFKVHGTGKKNLHGDGIALWYTRDRLVPGPVFGSKDNFHGLAIF LDTYPNDETTERVFPYISVMVNNGSLSYDHSKDGRWTELAGCTADFRNRDHDTFLAVR YSRGRLTVMTDLEDKNEWKNCIDITGVRLPTGYYFGASAGTGDLSDNHDIISMKLFQL MVEHTPDEESIDWTKIEPSVNFLKSPKDNVDDPTGNFRSGPLTGWRVFLLLLCALLGI VVCAVVGAVVFQKRQERNKRFY" mat_peptide 135..1068 misc_feature 549 /product="N-linked glycosylation" misc_feature 969..1037 /product="membrane anchor" BASE COUNT 281 a 413 c 426 g 287 t ORIGIN 1 atggcggcgg aaggctggat ttggcgttgg ggctggggcc ggcggtgcct gggaaggcct 61 gggcttctcg gccccggccc tggccccact acacctctct ttcttctttt gttgttgggg 121 tctgtgactg cggatataac tgacggcaac agtgaacatc tcaagcggga gcattcgctc 181 attaagccct accaaggggt cggttccagc tctatgcccc tctgggactt ccagggcagc 241 actatgctca cgagccagta cgtacgtctg acccctgacg agcgcagcaa agagggctct 301 atctggaacc accagccgtg cttcctcaaa gactgggaaa tgcacgtcca cttcaaagtc 361 cacggcacag ggaagaagaa cctccatgga gacggcatcg ccttgtggta cacccgggac 421 cgcctcgtgc cagggcctgt gtttggaagc aaagataact tccacggctt agccatcttc 481 ctggacacct accccaatga tgagaccact gagcgcgtgt tcccgtacat ctcggtgatg 541 gtgaacaatg gctccctgtc ctacgaccac agcaaggatg ggcgctggac cgagctggcg 601 ggctgcacgg ctgacttccg caaccgcgat cacgacacct tcctggctgt gcgctactcc 661 cggggccgtc tgacggtgat gaccgacctg gaggacaaga acgagtggaa gaactgcatt 721 gacatcacgg gagtgcgcct gcccaccggc tactacttcg gggcctccgc cggcaccggc 781 gacctgtctg acaatcatga catcatctcc atgaagctgt tccagctgat ggtggagcac 841 acgcccgacg aggagagcat cgactggacc aagatcgagc ccagcgtcaa cttcctcaag 901 tcgcccaaag acaacgtgga cgaccccacg gggaacttcc gcagcgggcc cctgacgggg 961 tggcgggtgt tcctgctgct gctgtgcgct ctcctgggca tcgttgtctg cgccgtggtg 1021 ggggccgtgg tgttccagaa gcggcaggag cggaacaagc gcttctactg agtggcgcct 1081 ccggcggggc ctgtccctgg gcccaggagc caatgtgaac tttttttttt accgggatta 1141 taaaagaaca acaagatgac cttatttctt aactgtttca aataaatgat taaagtattt 1201 tcatacattt tgcttcttgc ccagcaggga caggtggcag agccgaggct tagggtctgg 1261 caccccccac agctggagac ggaggctctc ctggggctgg tgtctcagga gcaggggtct 1321 gtgtctacag atgggctgtg gcccctgcag gcagctgttg aacactggag ggtcccccgg 1381 accacactgg ggtgggctcc tgaggac // LOCUS HSU10417 3779 bp mRNA PRI 26-NOV-1997 DEFINITION Homo sapiens ileal sodium-dependent bile acid transporter (SLC10-A2) mRNA, complete cds. ACCESSION U10417 NID g2623285 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3779) AUTHORS Wong,M.H., Oelkers,P. and Dawson,P.A. TITLE Identification of a mutation in the ileal sodium-dependent bile acid transporter gene that abolishes transport activity JOURNAL J. Biol. Chem. 270 (45), 27228-27234 (1995) MEDLINE 96070831 REFERENCE 2 (bases 1 to 3779) AUTHORS Craddock,A.L., Love,M.W., Daniel,R.W., Kirby,L.C., Walters,H.C., Wong,M.H. and Dawson,P.A. TITLE Expression and transport properties of the human ileal and renal sodium-dependent bile acid transporter JOURNAL Am. J. Physiol. 274 (37) (1998) In press REFERENCE 3 (bases 1 to 3779) AUTHORS Dawson,P.A. TITLE Direct Submission JOURNAL Submitted (07-JUN-1994) Paul A. Dawson, Internal Medicine/Gastroenterology, Bowman Gray School of Medicine, Wake Forest University, Medical Center Boulevard, Winston-Salem, NC 27157, USA REFERENCE 4 (bases 1 to 3779) AUTHORS Dawson,P.A. TITLE Direct Submission JOURNAL Submitted (17-NOV-1997) Paul A. Dawson, Internal Medicine/Gastroenterology, Bowman Gray School of Medicine, Wake Forest University, Medical Center Boulevard, Winston-Salem, NC 27157, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..3779 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHISBT" /clone_lib="human ileal cDNA lambda gt10" /map="13q33" /chromosome="13" /tissue_type="ileum" /dev_stage="adult" 5'UTR 1..598 /gene="SLC10-A2" gene 1..3779 /gene="SLC10-A2" CDS 599..1645 /gene="SLC10-A2" /codon_start=1 /product="ileal sodium-dependent bile acid transporter" /db_xref="PID:g595399" /translation="MNDPNSCVDNATVCSGASCVVPESNFNNILSVVLSTVLTILLAL VMFSMGCNVEIKKFLGHIKRPWGICVGFLCQFGIMPLTGFILSVAFDILPLQAVVVLI IGCCPGGTASNILAYWVDGDMDLSVSMTTCSTLLALGMMPLCLLIYTKMWVDSGSIVI PYDNIGTSLVALVVPVSIGMFVNHKWPQKAKIILKIGSIAGAILIVLIAVVGGILYQS AWIIAPKLWIIGTIFPVAGYSLGFLLARIAGLPWYRCRTVAFETGMQNTQLCSTIVQL SFTPEELNVVFTFPLIYSIFQLAFAAIFLGFYVAYKKCHGKNKAEIPESKENGTEPES SFYKANGGFQPDEK" BASE COUNT 1117 a 737 c 799 g 1126 t ORIGIN 1 ttctattgaa agggaaatgg gagaacaata tgtgttccta tggctcagtc cctataagat 61 tctgtactat tcagagttga ttttaagtgt cacttaactg aaattatcca acaaaccttc 121 atggcatgaa acattaacac agctcttttt atatggcatg gttcctatgg ctcaatccct 181 ataagattct gtactagttc agagttgatt ttaaaagtca cttaactgaa attatccaac 241 aaaccctcga ggacattaaa cattaacgtg gctcttttta tatggcatgg ttcattatca 301 tgccaataaa tgattaatcg taactctctg tcttgaccaa taattttgct ggacttttgt 361 gattcacaac gtgctctgtg ttgtaatgct acctcttgaa actgacatcc tagctttatt 421 gttttttatt acttccctaa ggtggctttc aaaagagaca ccaagtgaca tatttttagg 481 aggggtttaa aagtttgatg gggtagaagt aaacgttgct taactcaacc agcagcagag 541 ccagggccca gggaccagcg cttctgtgga cttggccttt ccagcagcag acccagcaat 601 gaatgatccg aacagctgtg tggacaatgc aacagtttgc tctggtgcat cctgtgtggt 661 acctgagagc aatttcaata acatcctaag tgtggtccta agtacggtgc tgaccatcct 721 gttggccttg gtgatgttct ccatgggatg caacgtggaa atcaagaaat ttctagggca 781 cataaagcgg ccgtggggca tttgtgttgg cttcctctgt cagtttggaa tcatgcccct 841 cacaggattc atcctgtcgg tggcctttga catcctcccg ctccaggccg tagtggtgct 901 cattatagga tgctgccctg gaggaactgc ctccaatatc ttggcctatt gggtcgatgg 961 cgacatggac ctgagcgtca gcatgaccac atgctccaca ctgcttgccc tcggaatgat 1021 gccgctgtgc ctccttatct ataccaaaat gtgggtcgac tctgggagca tcgtaattcc 1081 ctatgataac ataggtacat ctctggttgc tctcgttgtt cctgtttcca ttggaatgtt 1141 tgttaatcac aaatggcccc aaaaagcaaa gatcatactt aaaattgggt ccatcgcggg 1201 cgccatcctc attgtgctca tagctgtggt tggaggaata ttgtaccaaa gcgcctggat 1261 cattgctccc aaactgtgga ttataggaac aatatttcct gtggcgggtt actccctggg 1321 gtttcttctg gctagaattg ctggtctacc ctggtacagg tgccgaacgg ttgcttttga 1381 aacggggatg cagaacacgc agctatgttc caccatcgtt cagctctcct tcactcctga 1441 ggagctcaat gtcgtattca ccttcccgct catctacagc attttccagc tcgcctttgc 1501 cgcaatattc ttaggatttt atgtggcata caagaaatgt catggaaaaa acaaggcaga 1561 aattccagag agcaaagaaa atggaacgga gccagagtca tcgttttata aggcaaatgg 1621 aggatttcaa cctgacgaaa agtagacatc aagtggacaa aacagacgag ttccaaatta 1681 cgttcttaaa ccgtaactat atttaattat ttgttttggt aggacagttg gcagaaaaga 1741 gttaaagtga aaattggaat ttcattggaa ttcatgtatt ggtttcagta ccaagtgact 1801 ggtggcccaa ttctttaatg ggacaaatat tgtttcctat atatatgtat atgttttata 1861 tatgtatgta tactcatata gatatattgt cattgaaata ttcccccaaa atattctcag 1921 actaaacctg acatagggaa caccgagaat gaaaacatcg ttaacaccaa aactgaattc 1981 ttatgcagaa tttcctagcc catagatgac aacctgagtt tctgtatgtt aaagtagatg 2041 taatgaatta ttattattac agtggtcacg attttcttca gtgtttatga ttataaaaat 2101 tgacatgaac atctttcact gacattttaa tcattatttt aaaagctttg caacctatat 2161 atttatataa ctttgtaata taacatgggc aaatatctga cttcagtatt tttaaaaagt 2221 tgccttctcc agtggcagtc caaaagcaga aatgagagga aattattaca aaatagaatt 2281 caataaccat attggatgca ggctcttaac tcagcaggga tatcgtacat ctattgctct 2341 acctcagggg tccagtgata cccactagat cttccaagga aaaacataat tctttcaaac 2401 ggtgtgtatt tggcaaagag ctcttcaaat ctgggagagg gacttcctca aggttttcct 2461 gtgtgcagtg gatccacata gctaatatga cagctagtca gttgacaggg accacccaca 2521 gtaagcacca tggtcaggga ggtggcagga ggtgcaaaga cagaagtatt gagagaaaca 2581 ccaagactct agtggaggaa ttaattcaat gggagatagt ataaaataca tagaaaacac 2641 aagtaacaga aacctggttg aaatgcttaa ctagagtcaa ttagatgtgc aggagtaagt 2701 agtataagaa gaatcaagtc cgagagtgat caggaaatga gtattaaaca gtatttgaaa 2761 cagagaacgt gtcccagggc ccaaaagtca gaagggcccc accagccagg aaagttgttt 2821 caatgctgta agtaggtgta gccaagggaa gccaggacta tctgatatac ggtagcaggg 2881 gtttacggct gccaggggaa aataactcat caagtgttgg actttcaatt ataagatcga 2941 atttaatttc ctttccctca ttctgcagca atcagaatac acaatcttaa ccactcggtc 3001 cttagtggtt ttgttccatt ttgcattggg tattttcact gcctcataga gtctatttca 3061 agtgtttggc tgaaagggct ttttgcattt gcatgttctg agttcagatt ctgctggtgc 3121 acccaagcat tatgggaaca ggaactcaac ttagctcttc cagtagaggg gtgagggatt 3181 ctgcttttca aattcataac attgatcttt ttatgcaaga tttccattta cagttgaata 3241 agtacttcat atttttccat cattagacaa atacaaaatg gactaaataa ttttaagaga 3301 tagtggaggc agcagggggt acagacttcc ttcttagaga gtgtcagaga atatgctccc 3361 aatggtggaa aggaagattt acagtctagc ggctaagtac ctcctacaca tttcccatca 3421 atcagaaaat agacaggtac actaaaggga cctgagaact cctcttgtaa tttcaacaca 3481 cccaaaatca agggcctgga tgccagcagc tgcagcaagc aggtttttcc tccctgttga 3541 gcaagacagg tgaggcaaga taggacttgg ctttcttaca tgatgcggta acttgtgact 3601 tgagtctttt tccctaattt gctagtggga agaaaaatag ctgagctttc taaaatgata 3661 gctctctatt tttaaatgaa tttgaaaagt cgattaaatt atgtatttta ttgcctctga 3721 gtatcatatt aaatgaatat tttattttaa aggcttaaat aaatgaaaat gatttttgt // LOCUS HSU10550 2156 bp mRNA PRI 08-APR-1995 DEFINITION Human Gem GTPase (gem) mRNA, complete cds. ACCESSION U10550 NID g762886 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS Maguire,J., Santoro,T., Jensen,P., Siebenlist,U., Yewdell,J. and Kelly,K. TITLE Gem: an induced, immediate early protein belonging to the Ras family JOURNAL Science 265 (5169), 241-244 (1994) MEDLINE 94294787 REFERENCE 2 (bases 1 to 2156) AUTHORS Kelly,K. TITLE Direct Submission JOURNAL Submitted (10-JUN-1994) Kathleen Kelly, Laboratory of Pathology, National Cancer Institute, Building 10, Room 2A-33, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone 270-4" /cell_type="T-lymphocyte" /tissue_type="peripheral blood" gene 214..1104 /gene="gem" CDS 214..1104 /gene="gem" /codon_start=1 /function="GTPase" /product="Gem" /db_xref="PID:g544493" /translation="MTLNNVTMRQGTVGMQPQQQRWSIPADGRHLMVQKEPHQYSHRN RHSATPEDHCRRSWSSDSTDSVISSESGNTYYRVVLIGEQGVGKSTLANIFAGVHDSM DSDCEVLGEDTYERTLMVDGESATIILLDMWENKGENEWLHDHCMQVGDAYLIVYSIT DRASFEKASELRIQLRRARQTEDIPIILVGNKSDLVRCREVSVSEGRACAVVFDCKFI ETSAAVQHNVKELFEGIVRQVRLRRDSKEKNERRLAYQKRKESMPRKARRFWGKIVAK NNKNMAFKLKSKSCHDLSVL" polyA_site 2156 /note="9 A residues" BASE COUNT 631 a 462 c 509 g 554 t ORIGIN 1 agcgcagcac tccccgctcg ttggcccggg tatcccagcg cggacccacg cgatacgctg 61 acgccccgac gccgatccgg ccgagccaag taagggggac ggcccgagac ggagaaggga 121 gagagtggga gtttcccagc ccgcagaact ttcgaagttg agaagagaac ccctggaacg 181 tgcgctcagc actgggattt tctggactca acgatgactc tgaataatgt caccatgcgc 241 cagggcactg tgggcatgca gccacagcag cagcgctgga gcatcccagc tgatggcagg 301 catctgatgg tccagaaaga gccccaccag tacagccacc gcaaccgcca ttctgctacc 361 cctgaggacc actgccgccg aagctggtcc tctgactcca cagactcagt catctcctct 421 gagtcaggga acacctacta ccgagtggtg ctcatagggg agcagggggt gggcaagtcc 481 actctggcca acatctttgc aggtgtgcat gacagcatgg acagcgactg cgaggtgctg 541 ggagaagata catatgaacg aaccctgatg gttgatgggg aaagtgcaac gattatactc 601 ctggatatgt gggaaaataa gggggaaaat gaatggctcc atgaccactg catgcaggtc 661 ggggacgcat acctgattgt ctactcaatc acagaccgag cgagcttcga gaaggcatct 721 gagctgcgaa tccagctccg cagggcccgg cagacagagg acattcccat aattttggtt 781 ggcaacaaaa gtgacttagt gcggtgccga gaagtgtctg tatcagaagg gagagcctgt 841 gcagtggtgt ttgactgcaa gttcatcgag acctctgcag ctgtccagca caacgtgaag 901 gagctgtttg agggcattgt gcgacaggtg cgccttcggc gggacagcaa ggagaagaat 961 gaacggcggc tggcctacca gaaaaggaag gagagcatgc ccaggaaagc caggcgcttc 1021 tggggcaaga tcgtggccaa aaacaacaag aatatggcct tcaagctcaa gtccaaatcc 1081 tgccatgacc tctctgtact ctaggaaccc agggtcaccc agatgtccct ttgatggccc 1141 ttgttgaagg ccattgggac caataatcta tattagattg aatacttaag ttagatgtgg 1201 tttcccccat tgtagcaggg agctagcgta ttagccttgt gggcaacatg atgcatggga 1261 aatgaaagat ttttgtaaaa agtcagtatt tatttccagg aaaagcctga ccttgctatt 1321 tgaacaccca agactcttta gaggatgtgt ttggtgttca catgtgtttc ttctattttg 1381 gatagtaggg aagtaaagct tacaaagaat gcctagaaca agaacttttc atcattaaaa 1441 atttttccca gtgttctgat atgtgacttt gaggccaatg agtcataaac aaatataaga 1501 aagctgtcaa tgagtttctt caaaggaggg aaaactttct acgaatctaa gatccatgga 1561 gctagaattg tagaactagg ctcatcagaa tcgtgactat tattgctcca tcaaactgtg 1621 aaaagaaatg atgtggacct tgctggaaac aaaggcttag caaacaattt ttgttcaatg 1681 cccaccgaga catatagaat tgggaactga tacatgtgtc ccttataggc tcaaaaatta 1741 tatcttacaa tttcttattt agggggaaat tatttgaatc agattctatt tagtcaaacc 1801 accttttatg ttttattatt tttgaattca tggagccatc ataaaaatat ttttaaaatc 1861 agaattattg ataccctgta gtgcaaaatg tcaattttta atgtataatc agaagtctga 1921 attttcataa aacatatagc ataaaaacct ccagtacttt ggttgaccct tgtatgtcac 1981 agctctgctc tatttattat tattttgcaa aataaccatt ttaacatttg ataaagcata 2041 tttatgaaca tatttcttaa taagaaaaat atccatttta ttaccatttt ctatcttttt 2101 caaaatatgc aagtttttac ctatatgtct tataataaaa gaaataaaat atttga // LOCUS HSU10564 2194 bp mRNA PRI 06-FEB-1996 DEFINITION Human CDK tyrosine 15-kinase WEE1Hu (Wee1Hu) mRNA, complete cds. ACCESSION U10564 NID g699107 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2194) AUTHORS Watanabe,N., Broome,M. and Hunter,T. TITLE Regulation of the human WEE1Hu CDK tyrosine 15-kinase during the cell cycle JOURNAL EMBO J. 14 (9), 1878-1891 (1995) MEDLINE 95262628 REFERENCE 2 (bases 572 to 2194) AUTHORS Igarashi,M., Nagata,A., Jinno,S., Suto,K. and Okayama,H. TITLE Wee1(+)-like gene in human cells JOURNAL Nature 353 (6339), 80-83 (1991) MEDLINE 91351318 REFERENCE 3 (bases 1 to 2194) AUTHORS Watanabe,N. TITLE Direct Submission JOURNAL Submitted (10-JUN-1994) Nobumoto Watanabe, MBVL, Salk Institute for Biological Studies, 10010 N. Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2194 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone 1E-12" /clone_lib="pCD plasmid expression library from S. Hanks" /cell_line="HeLa cells" misc_difference 1..571 /note="this sequence extends the Wee1 Hu mRNA sequence, GenBank Accession Number X62048, on the 5' end. The CDS is also extended by 214 codons on N-terminal end" /citation=[2] /replace="" conflict 119 /citation=[2] /replace="" CDS 254..2194 /codon_start=1 /product="WEE1Hu CDK tyrosine 15-kinase" /db_xref="PID:g699108" /translation="MSFLSRQQPPPPRRAGAACTLRQKLIFSPCSDCEEEEEEEEEEG SGHSTGEDSAFQEPDSPLPPARSPTEPGPERRRSPGPAPGSPGELEEDLLLPGACPGA DEAGGGAEGDSWEEEGFGSSSPVKSPAAPYFLGSSFSPVRCGGPGDASPRGCGARRAG EGRRSPRPDHPGTPPHKTFRKLRLFDTPHTPKSLLSKARGIDSSSVKLRGSSLFMDTE KSGKREFDVRQTPQVNINPFTPDSLLLHSSGQCRRRKRTYWNDSCGEDMEASDYELED ETRPAKRITITESNMKSRYTTEFHELEKIGSGEFGSVFKCVKRLDGCIYAIKRSKKPL AGSVDEQNALREVYAHAVLGQHSHVVRYFSAWAEDDHMLIQNEYCNGGSLADAISENY RIMSYFKEAELKDLLLQVGRGLRYIHSMSLVHMDIKPSNIFISRTSIPNAASEEGDED DWASNKVMFKIGDLGHVTRISSPQVEEGDSRFLANEVLQENYTHLPKADIFALALTVV CAAGAEPLPRNGDQWHEIRQGRLPRIPQVLSQEFTELLKVMIHPDPERRPSAMALVKH SVLLSASRKSAEQLRIELNAEKFKNSLLQKELKKAQMAKAAAEERALFTDRMATRSTT QSNRTSRLIGKKMNRSVSLTIY" conflict 447 /citation=[2] /replace="a" conflict 1681 /citation=[2] /replace="g" BASE COUNT 539 a 584 c 615 g 456 t ORIGIN 1 ctgagactgg acctgaggag acctcagcct cggtgctcgg gccgccccgc ctctgccgga 61 aagtccgcgc cgccgctgcc gccaccgtcc gcagcccgag cgccccggag ccgcaggccg 121 ccgccgcgca gagacgccgc ggctgcgact aggcgcgccc agccgcacgt ggcggacccg 181 cccccaggcc cgcagtgtcc tggaccccgc aggcctccgc tctcctgtcc tcggccccgt 241 ccccagggcc gcgatgagct tcctgagccg acagcagccg ccgccacccc gccgcgccgg 301 ggcggcctgc accttgcggc agaagctgat cttctcgccc tgcagcgact gtgaggagga 361 ggaagaagag gaggaggagg agggcagcgg ccacagcacc ggggaggact cggcctttca 421 agagcccgac tcgccgctgc cgcccgcgcg gagccccacg gagcccgggc ccgagcgccg 481 ccgctcgccc gggccggccc ccggcagccc cggcgagctg gaggaggacc tgttgctgcc 541 cggcgcctgc ccgggcgcgg acgaggcggg cggtggggcg gagggcgact cgtgggagga 601 ggagggcttc ggctcctcgt cgccggtcaa gtcgccggcg gccccctact tcctgggtag 661 ctctttctcg ccggtgcgct gcggcggccc aggagatgcg tcgccgcggg gttgcggggc 721 gcgccgggcg ggcgaaggcc gccgctcgcc gcggccggac cacccgggca ccccgccaca 781 caagaccttc cgcaagctgc gactcttcga caccccgcac acgcccaaga gtttgctctc 841 caaagctcgg ggaattgatt ccagctctgt taaactccgg ggtagttctc tcttcatgga 901 tacagaaaaa tcaggaaaaa gggaatttga tgtgcgacag actcctcaag tgaatattaa 961 tccttttact ccggattctt tgttgcttca ttcctcagga cagtgtcgtc gtagaaagag 1021 aacgtattgg aatgattcct gtggtgaaga catggaagcc agtgattatg agcttgaaga 1081 tgaaacaaga cctgctaaga gaattacaat tactgaaagc aatatgaagt cccggtatac 1141 aacagaattt catgagctag agaaaatcgg ctctggagaa tttggttctg tatttaagtg 1201 tgtgaagagg ctggatggat gcatttatgc cattaagcga tcaaaaaagc cattggcggg 1261 ctctgttgat gagcagaacg ctttgagaga agtatatgct catgcagtgc ttggacagca 1321 ttctcatgta gttcgatatt tctctgcgtg ggcagaagat gatcatatgc ttatacagaa 1381 tgaatattgt aatggtggaa gtttagctga tgctataagt gaaaactaca gaatcatgag 1441 ttactttaaa gaagcagagt tgaaggatct ccttttgcaa gttggccgag gcttgaggta 1501 tattcattca atgtctttgg ttcacatgga tataaaacct agtaatattt tcatatctcg 1561 aacctcaatc ccaaatgctg cctctgaaga aggagacgaa gatgattggg catccaacaa 1621 agttatgttt aaaataggtg atcttgggca tgtaacaagg atctccagtc cacaagttga 1681 agagggcgat agtcgttttc ttgcaaatga agttttacag gagaattata cccatctacc 1741 aaaagcagat atttttgcgc ttgccctcac agtggtatgt gctgctggtg ctgaacctct 1801 tccgagaaat ggagatcaat ggcatgaaat cagacagggt agattacctc ggataccaca 1861 agtgctttcc caagaattta cagagttgct aaaagttatg attcatccag atccagagag 1921 aagaccttca gcaatggcac tggtaaagca ttcagtattg ctgtccgctt ctagaaagag 1981 tgcagaacaa ttacgaatag aattgaatgc cgaaaagttc aaaaattcac ttttacaaaa 2041 agaactcaag aaagcacaga tggcaaaagc tgcagctgag gaaagagcac tcttcactga 2101 ccggatggcc actaggtcca ccacccagag taatagaaca tctcgactta ttggaaagaa 2161 aatgaaccgc tctgtcagcc ttactatata ctga // LOCUS HSU10685 3510 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-10 antigen (MAGE10) gene, complete cds. ACCESSION U10685 NID g533510 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3510) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3510) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3510 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1741..1814 /number=2 exon 1890..>3065 /number=3 gene 1955..3064 /gene="MAGE10" CDS 1955..3064 /gene="MAGE10" /codon_start=1 /product="MAGE-10 antigen" /db_xref="PID:g533511" /translation="MPRAPKRQRCMPEEDLQSQSETQGLEGAQAPLAVEEDASSSTST SSSFPSSFPSSSSSSSSSCYPLIPSTPEEVSADDETPNPPQSAQIACSSPSVVASLPL DQSDEGSSSQKEESPSTLQVLPDSESLPRSEIDEKVTDLVQFLLFKYQMKEPITKAEI LESVIKNYEDHFPLLFSEASECMLLVFGIDVKEVDPTGHSFVLVTSLGLTYDGMLSDV QSMPKTGILILILSIIFIEGYCTPEEVIWEALNMMGLYDGMEHLIYGEPRKLLTQDWV QENYLEYRQVPGSDPARYEFLWGPRAHAEIRKMSLLKFLAKVNGSDPRSFPLWYEEAL KDEEERAQDRIATTDDTTAMASASSSATGSFSYPE" BASE COUNT 886 a 892 c 909 g 823 t ORIGIN 1 cagggagatg gtggctttgg cgtgcaagac ccatacacga ttcagcagga gggaaaggct 61 gggctgtcgg gagtaaatct gaatacctgg aggacaccca aataaaggaa gtccccgtct 121 tgtccccctc ccctgcccac cacccccccc ccccccgcca aatgtctgct ccttctgtca 181 gctttgggaa tcccatgcag gtgtgatcgt gtggtgcccc tccccacttc tgcctgccgg 241 gtctcaggga ggtgaggacc ttggtctgag ggttgctaag aagttattac agggttccac 301 acttggtcaa cagagggagg agtcccagaa tctgcaggac ccaaggggtg cccccttagt 361 gaggactgga ggtacctgca gcccagaaag aagggatgtc acagagtctg gctgtcccct 421 gttcttagct ctgaggggac ctgatcagga ttggcactaa gtggcaagct caattttacc 481 acaggcagga agatgaggaa ccctcaggga aatggagttt tggtgtaaag gggagatatc 541 agccctggac accccacagg gatgacagga tgtggctcct tcttactttt gttttggaat 601 ctcagggagg tgagaacctt gctctcagag ggtgactcaa gtcaacacag ggaacccctc 661 ttttctacag acacagtggg tcgcaggatc tgacaagagt ccaggtaagg aacctgaggg 721 aaatctgagg gtacccccag cccataacac agatggggtc cccacagaaa tctgccatga 781 ccctactgtc actctggaga acccagtcag ggctgtccgc tgagtctccc tgtcttatac 841 aaggatcact ggtctctggg agggagaggt gttggtctaa gggagctgca ctcgggtcag 901 cagagggagg gtcccagacc ctgccaggag tcaaggtgag gactgagggg acaccattct 961 ccaaacgcac aggactcagc cccaccctac cccttctgtc agccacggga attcatgggg 1021 aactgggggt agatggactc ccctcacttc ctctttccat gtctcctgga ggtaggacct 1081 tggtttaagg aagtggcctc agatcaacaa agggagggtc ccaggtcgta tcaggcatca 1141 agaagaggac caagcaggct cctcacccca gtacacatgg acccagctga atatggccac 1201 ctcttgctgt cttttctggg aggacctctg cagttgtggc cagatgtggg tcccctcatg 1261 tcttctattt cgtatcaggg atgtaagctt ttgatctgag agtttcttag accagcaaag 1321 gagcagggtc taggcttttc caggagaaag gtgagagccc cacgtgagca cagaggctcc 1381 ccaccccagg gtagtgggga actcacagag tccagcccac cctcctgaca acactgggag 1441 gctggggctg tgcttgcagc ctgaaccctg agggcccctc aattcctctt tcaggagctc 1501 cagggactgt gaggtgaggc cttggtctaa ggcagtgttt tcaggtcaca gagcagaaag 1561 ggcccagaca gtgccaggag tcaaggtgag gtgcatgccc tgaatgtgta ccaagggccc 1621 cacctgctcc aggacaaagt ggaccccact gcatcagctc cacctaccct actgtcagtc 1681 ctggagcctt ggcctctgcc ggctgcatcc tgaggagcca tctctcactt ccttcttcag 1741 gttctcaggg gacagggaga gcaagaggtc aagagctgtg ggacaccaca gagcagcact 1801 gaaggagaag acctgtaagt tggcctttgt tagaacctcc agggtgtggt tctcagctgt 1861 ggccacttac accctccctc tctccccagg cctgtgggtc cccatcgccc aagtcctgcc 1921 cacactccca cctgctaccc tgatcagagt catcatgcct cgagctccaa agcgtcagcg 1981 ctgcatgcct gaagaagatc ttcaatccca aagtgagaca cagggcctcg agggtgcaca 2041 ggctcccctg gctgtggagg aggatgcttc atcatccact tccaccagct cctcttttcc 2101 atcctctttt ccctcctcct cctcttcctc ctcctcctcc tgctatcctc taataccaag 2161 caccccagag gaggtttctg ctgatgatga gacaccaaat cctccccaga gtgctcagat 2221 agcctgctcc tccccctcgg tcgttgcttc ccttccatta gatcaatctg atgagggctc 2281 cagcagccaa aaggaggaga gtccaagcac cctacaggtc ctgccagaca gtgagtcttt 2341 acccagaagt gagatagatg aaaaggtgac tgatttggtg cagtttctgc tcttcaagta 2401 tcaaatgaag gagccgatca caaaggcaga aatactggag agtgtcataa aaaattatga 2461 agaccacttc cctttgttgt ttagtgaagc ctccgagtgc atgctgctgg tctttggcat 2521 tgatgtaaag gaagtggatc ccactggcca ctcctttgtc cttgtcacct ccctgggcct 2581 cacctatgat gggatgctga gtgatgtcca gagcatgccc aagactggca ttctcatact 2641 tatcctaagc ataatcttca tagagggcta ctgcacccct gaggaggtca tctgggaagc 2701 actgaatatg atggggctgt atgatgggat ggagcacctc atttatgggg agcccaggaa 2761 gctgctcacc caagattggg tgcaggaaaa ctacctggag taccggcagg tgcctggcag 2821 tgatcctgca cggtatgagt ttctgtgggg tccaagggct catgctgaaa ttaggaagat 2881 gagtctcctg aaatttttgg ccaaggtaaa tgggagtgat ccaagatcct tcccactgtg 2941 gtatgaggag gctttgaaag atgaggaaga gagagcccag gacagaattg ccaccacaga 3001 tgatactact gccatggcca gtgcaagttc tagcgctaca ggtagcttct cctaccctga 3061 ataaagtaag acagattctt cactgtgttt taaaaggcaa gtcaaatacc acatgatttt 3121 actcatatgt ggaatctaaa aaaaaaaaaa aaaaaagttg gtatcatgga agtagagagt 3181 agagcagtag ttacattaca attaaatagg aggaataagt tctagtgttc tattgcacag 3241 taggatgact atagttaaca ttaagatatt gtatattaca aaacagctag aaggaaggct 3301 tttcaatatt gtcaccaaaa agaaatgata aatgcatgag gtgatggata cactacctga 3361 tttgatcatt atactacata tacatgaatc agaacatcaa attgtacctc ataaatatct 3421 acaattacat gtcagttttt gtttatgttt ttgttttttt ttaatttatg aaaacaaatg 3481 agaatggaaa tcaatgatgt atgtggtgga // LOCUS HSU10686 3672 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-11 antigen (MAGE11) gene, complete cds. ACCESSION U10686 NID g533512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3672) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3672) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3672 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1806..1879 /number=2 exon 1955..>3246 /number=3 gene 2019..2978 /gene="MAGE11" CDS 2019..2978 /gene="MAGE11" /codon_start=1 /product="MAGE-11 antigen" /db_xref="PID:g533513" /translation="MPLEQRSQHCKPEEGLQAQEEDLGLVGAQALQAEEQEAAFFSST LNVGTLEELPAAESPSPPQSPQEESFSPTAMDAIFGSLSDEGSGSQEKEGPSTSPDLI DPESFSQDILHDKIIDLVHLLLRKYRVKGLITKAEMLGSVIKNYEDYFPEIFREASVC MQLLFGIDVKEVDPTSHSYVLVTSLNLSYDGIQCNEQSMPKSGLLIIVLGVIFMEGNC IPEEVMWEVLSIMGVYAGREHFLFGEPKRLLTQNWVQEKYLVYRQVPGTDPACYEFLW GPRAHAETSKMKVLEYIANANGRDPTSYPSLYEDALREEGEGV" BASE COUNT 907 a 897 c 973 g 895 t ORIGIN 1 agtccaggat ctgccagtag tcaaggagag gaaaattgat gaagactgaa ggtaagaatg 61 taccctccca catgccaaag aaaaagggac ctcaccaatc cttgcttcct ctgttttcat 121 ccctcggagg cccaagttgg ggaggcatgt gccatgctca catttctgcc acgaggttgg 181 gggtggcacc ttgctcaggg aggtgagcac cgttgtttca agggggtgat gacaggtcag 241 caggtggagc cacacctgat cagcagaggg aggagtccca ggatctttag gactcaaggt 301 gtatgtgtcc ccttggtgag gactggagag cccacatccc ataatgaagg gatcccacag 361 agtctctctg tccccatgtc cttggctgtg tggggacctc atcacgggtg gccccaagtg 421 gcaaggtcac ttgtaccaca ggcagaaagt tgggaaacct tcagggagat gaggtcttgg 481 tgtaaaggga tatgtctgct catctcaggg gttgggagtc aaggaaggac aggccctggc 541 agaagtaaag atgaaaaacc cacaggagga ctttggaatc cccagaaccg aagggtccag 601 cctctgctgt cagccctgga caaccacatg atggggtgat gggacgtggg gccccttact 661 tctgttttgg aatcttgggc aggtgagcac tatgttctca gaggacgact tccagtcaac 721 agaaagagcc ccatatggtc cacaactaca gtggtcccag gatctgccaa gagtccaggt 781 gagaaacctg agggaggatt gagggttcct cctggccaga acacagaggg ctgcttagaa 841 atctgctctg cccctgctgt ctccccagag agcatgtgca ggactatgtg ctgagacccc 901 tctcttatac tgggatcatt ggtctcaggg agcgggagac attggtctga gagggctgca 961 cttaggtcag cagtgggagg gtcccaggcc atgaccagaa tcaaggtggg ggctgacggg 1021 acagcactta ccaaaaacat gggactcagc ccttccctgc cccttctgtc agctatggga 1081 agtccctggg accatgggtg tttctatttc cctgatttcc tcttctgata tctcctggag 1141 gtagagcttt ggtttaagga gatggcgtca ggtcaacaga gggagggtcc caggccaaga 1201 taggcatcaa gatgggaacc aaacaggctc cttacccgag gacacatgga ccctgctgac 1261 tgtcaccatc tcttgctgtc cttcctgggt agccctgtgt acatgtggcc agatgtgtat 1321 ccccacatgt cctctttcat atcaggaaag agctattgat ctgagagttt ctcaggtcag 1381 gagagctgtg tcttccaggc cctggcagga gaaaggtgag ggccctgagc acagagggga 1441 ccatccactc caaaaaagtg agaaactcac agagtttggc acacctttct gacagtgctg 1501 gggtgccagg atgggtgctt gcagtctgca gcctgatggc cccatgattc ctcttctaga 1561 agctccaaaa actgagcagt gaggccttgg tctcaagcaa tgtcttcaga tctcagaaca 1621 caggaagcct aggcagtgcc agtagtcaag atgagatgtt cacccttaat ctacaaatgg 1681 ccccacctgc cccagtacag aaagggaccc ccagcttgca acctcacctg ccctacctca 1741 gtcctggagc ctcctgctct gatgtccagc tgcatcttga gcagccttct cacttccttt 1801 ttcaggtttt tagagaacag gccaacctgg aggacaggag tcccaggaga acccagagga 1861 tcactggagg agaacaagtg taagtaggcc tttgttagat tctccatggt tcatatctca 1921 tctgagtctg ttctcacgct ccctctctcc ccaggctgtg gggccccatc acccagatat 1981 ttcccacagt tcggcctgct gacctaacca gagtcatcat gcctcttgag caaagaagtc 2041 agcactgcaa gcctgaggaa ggccttcagg cccaagaaga agacctgggc ctggtgggtg 2101 cacaggctct ccaagctgag gagcaggagg ctgccttctt ctcctctact ctgaatgtgg 2161 gcactctaga ggagttgcct gctgctgagt caccaagtcc tccccagagt cctcaggaag 2221 agtccttctc tcccactgcc atggatgcca tctttgggag cctatctgat gagggctctg 2281 gcagccaaga aaaggagggg ccaagtacct cgcctgacct gatagaccct gagtcctttt 2341 cccaagatat actacatgac aagataattg atttggttca tttattgctc cgcaagtatc 2401 gagtcaaggg gctgatcaca aaggcagaaa tgctggggag tgtcatcaaa aattatgagg 2461 actactttcc tgagatattt agggaagcct ctgtatgcat gcaactgctc tttggcattg 2521 atgtgaagga agtggacccc actagccact cctatgtcct tgtcacctcc ctcaacctct 2581 cttatgatgg catacagtgt aatgagcaga gcatgcccaa gtctggcctc ctgataatag 2641 tcctgggtgt aatcttcatg gaggggaact gcatccctga agaggttatg tgggaagtcc 2701 tgagcattat gggggtgtat gctggaaggg agcacttcct ctttggggag cccaagaggc 2761 tccttaccca aaattgggtg caggaaaagt acctggtgta ccggcaggtg cccggcactg 2821 atcctgcatg ctatgagttc ctgtggggtc caagggccca cgctgagacc agcaagatga 2881 aagttcttga gtacatagcc aatgccaatg ggagggatcc cacttcttac ccatccctgt 2941 atgaagatgc tttgagagag gagggagagg gagtctgagc atgagatgca accagggcca 3001 gcgggcaggg aaatgggcca atgcatgctt cagggccaca cccagcagtt tccctgtcct 3061 gtgtgaaatc aggcccattc ttccctctgt gtttgatgag agaagtcagt gttctcagta 3121 gtagaaggca cagtgaatgg aagggaacac attgtatact gcctttaggt ttctcttcca 3181 tcgggtgact tggagatttg tttttgtttc cctttggtaa ttttcaaata ttgttcctgt 3241 aataaaagtt ttagttagct tcaacatcta agtgtatgga tgatactgac cacacatgtt 3301 gttttgctta tccatttcaa gtgcaagtgt ttgccatttt gtaaaacatt ttgggaaatc 3361 ttccatcttg ctgtgatttg caataggtat tttcttggag aatgtaagaa cttaacaata 3421 aagctgaact ggtgttgtga aacagagaaa taaaaggaga aggtcattaa ttcttgtctt 3481 cttatccata ttaatctgtt gttctatgaa agtacacacc catacacaca tgtacacccc 3541 cctcccccca catacatatt caccaaggaa atgcagtttc ctactgagtt gcagattctc 3601 tgagatgtcc tggacaataa aaaatattcc aaagtagaga gtggtagcac cgtggggtca 3661 cagtaatact ag // LOCUS HSU10693 3839 bp DNA PRI 23-JUN-1995 DEFINITION Human MAGE-8 antigen (MAGE8) gene, complete cds. ACCESSION U10693 NID g533525 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3839) AUTHORS De Plaen,E., Arden,K., Traversari,C., Gaforio,J.J., Szikora,J.P., De Smet,C., Brasseur,F., van der Bruggen,P., Lethe,B., Lurquin,C. Brasseur,R., Chomez,P., De Backer,O., Cavenee,W. and Boon,T. TITLE Structure, chromosomal localization, and expression of 12 genes of the MAGE family JOURNAL Immunogenetics 40 (5), 360-369 (1994) MEDLINE 95012457 REFERENCE 2 (bases 1 to 3839) AUTHORS De Plaen,E. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Etienne De Plaen, Ludwig Institute for Cancer Research, 74 Avenue Hippocrate, Brussels, 1200, Belgium FEATURES Location/Qualifiers source 1..3839 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_type="lymphocyte" /tissue_type="blood" /dev_stage="adult" exon 1996..2057 /number=2 exon 2133..>3746 /number=3 gene 2196..2900 /gene="MAGE8" CDS 2196..2900 /gene="MAGE8" /codon_start=1 /product="MAGE-8 antigen" /db_xref="PID:g533526" /translation="MLLGQKSQRYKAEEGLQAQGEAPGLMDVQIPTAEEQKAASSSST LIMGTLEEVTDSGSPSPPQSPEGASSSLTVTDSTLWSQSDEGSSSNEEEGPSTSPDPA HLESLFREALDEKVAELVRFLLRKYQIKEPVTKAEMLESVIKNYKNHFPDIFSKASEC MQVIFGIDVKEVDPAGHSYILVTCLGLSYDGLLGDDQSTPKTGLLIIVLGMILMEGSR APEEAIWEALSVMGAV" BASE COUNT 922 a 959 c 1064 g 894 t ORIGIN 1 agtctcagat cactggagag aggtgcccca gagcccttaa ggaggactca gcagacctcc 61 catcatggcc taggaaacct gctcccactc tcaggtctgg gcacccaagg caggacagtg 121 gggaagggat gtggcccccc cactttctgg taggggggcc tcaaggagat ggtggccttg 181 gcatgcaaga cacatccacg gttcagcagg aaggaaaggg ccatgccttg tcgtggagta 241 aatatgaata cctggatgac acccagacag agaaagaccc catgaaacct actacttctg 301 tcagccgtgg gaatcccatg cagggttgtc catgtagtgc ctccttactt ctgcctcctg 361 ggtctcaggg aggtagcaac ctgggtctga agggcgtcct cagctcagca gagggagcca 421 cacctgttca acagagggac ggggtcacag gatctgcagg acccaagatg tgctcacttt 481 gtgatgaatg ggggtactcc tggcctggaa agaagggacc ccacaaagtc tggctaactt 541 tggttattat ctctggggga acccgatcaa gggtggccct aagtggagat ctcatctgta 601 ctgtgggcag gaagttgggg aaacgcagga agataaggtc ttggtggtaa ggggagatgt 661 ctgctcatat cagggtgttg tgggttgagg aagggcgggc tccatcaggg gaaagatgaa 721 taaccccctg aagaccttag aacccaccac tcaagaacaa gtagggacag atcctagtgt 781 cacccctgga caccccaccc agtggtcatc agatgtggtg gctcctcatt tctctcttga 841 gtctcaggga agtgaggacc ttgttctcag agggcaactc aggacaaaac agggaccccc 901 atgtgggcaa cagactcagt ggtccaagaa tctaccaaga gtctaggtga caacactgag 961 ggaagattga gggtaccctc gatggttctc ctagcaggca aaaaacagat gggggcccaa 1021 cagaaatctg cccggcctct tttgtcaccc ctgagagcat gagcaggact atcagctgag 1081 gcccctgtgt tataccagac tcattggtct cagggagaag aaggccttgg tctgagggca 1141 ctgcattcag gtcagcagag cgggggtcca aggccctgcc aggagtcagg gactcagagg 1201 acaccactca ccaaacacac aggaccgaac cccaccctgc accttctgtc agccatggga 1261 agtgcaggga aaggtgggtg gatggaatcc cctcatttgc tcttccagtg tctcctggag 1321 ataggtcctt ggattaagga agtggcctca ggtcagccca ggacacatgg gccccaatgt 1381 attttgtgta gctattgctt ttttctcacc ctaggacaga cacgtgggcc ccattgcatt 1441 ttgtgtagct attgcttttt tcccaggagg ccttgggcat gtggggccag atgtgggtcc 1501 cttcatatcc ttgtcttcca tatcagggat ataaactctt gatctgaaag tttctcaggc 1561 cagcaaaagg gccagatcca ggccctgcca ggagaaagat gagggccctg aatgagcaca 1621 gaaaggacca tccacacaaa atagtgggga gctcacagag tcaggctcac cctcctgaca 1681 gcactggggt gctggggctg tgcttgcagt ctgcagcctg agttcccctc gatttatctt 1741 ctaggagctc caggaaccag gctgtgaggt cttggtctga ggcagtatct tcaatcacag 1801 agcataagag gcccaggcag tagtagcagt caagctgagg tggtgtttcc cctgtatgta 1861 taccagaggc ccctctggca tcagaacagc aggaacccca cagttcctgg ccctaccagc 1921 ccttttgtca gtcctggagc cttggccttt gccaggaggc tgcaccctga gatgccctct 1981 caatttctcc ttcaggttcg cagagaacag gccagccagg aggtcaggag gccccagaga 2041 agcactgaag aagacctgta agtagacctt tgttagggca tccagggtgt agtacccagc 2101 tgaggcctct cacacgcttc ctctctcccc aggcctgtgg gtctcaattg cccagctccg 2161 gcccacactc tcctgctgcc ctgacctgag tcatcatgct tcttgggcag aagagtcagc 2221 gctacaaggc tgaggaaggc cttcaggccc aaggagaggc accagggctt atggatgtgc 2281 agattcccac agctgaggag cagaaggctg catcctcctc ctctactctg atcatgggaa 2341 cccttgagga ggtgactgat tctgggtcac caagtcctcc ccagagtcct gagggtgcct 2401 cctcttccct gactgtcacc gacagcactc tgtggagcca atccgatgag ggttccagca 2461 gcaatgaaga ggaggggcca agcacctccc cggacccagc tcacctggag tccctgttcc 2521 gggaagcact tgatgagaaa gtggctgagt tagttcgttt cctgctccgc aaatatcaaa 2581 ttaaggagcc ggtcacaaag gcagaaatgc ttgagagtgt catcaaaaat tacaagaacc 2641 actttcctga tatcttcagc aaagcctctg agtgcatgca ggtgatcttt ggcattgatg 2701 tgaaggaagt ggaccctgcc ggccactcct acatccttgt cacctgcctg ggcctctcct 2761 atgatggcct gctgggtgat gatcagagta cgcccaagac cggcctcctg ataatcgtcc 2821 tgggcatgat cttaatggag ggcagccgcg ccccggagga ggcaatctgg gaagcattga 2881 gtgtgatggg ggctgtatga tgggagggag cacagtgtct attggaagct caggaagctg 2941 ctcacccaag agtgggtgca ggagaactac ctggagtacc gccaggcgcc cggcagtgat 3001 cctgtgcgct acgagttcct gtggggtcca agggcccttg ctgaaaccag ctatgtgaaa 3061 gtcctggagc atgtggtcag ggtcaatgca agagttcgca tttcctaccc atccctgcat 3121 gaagaggctt tgggagagga gaaaggagtt tgagcaggag ttgcagctag ggccagtggg 3181 gcaggttgtg ggagggcctg ggccagtgca cgttccaggg ccacatccac cactttccct 3241 gctctgttac atgaggccca ttcttcactc tgtgtttgaa gagagcagtc acagttctca 3301 gtagtgggga gcatgttggg tgtgagggaa cacagtgtgg accatctctc agttcctgtt 3361 ctattgggcg atttggaggt ttatctttgt ttccttttgg aattgttcca atgttccttc 3421 taatggatgg tgtaatgaac ttcaacattc attttatgta tgacagtaga cagacttact 3481 gctttttata tagtttagga gtaagagtct tgcttttcat ttatactggg aaacccatgt 3541 tatttcttga attcagacac tacaagagca gaggattaag gtttttttag aaatgtgaaa 3601 caacatagca gtaaaataca tgagataaag acataaagaa attaaacaat agttaattct 3661 tgccttacct gtacctctta gtgtacccta tgtacctgaa tttgcttggc ttctttgaga 3721 atgaaattga attaaatatg aataaataag tccccctgct cactggctca ttttttccca 3781 aaatattcat tgagcttccg ctatttggaa ggccctgggt tagtattgga gatgctaca // LOCUS HSU10860 2212 bp mRNA PRI 11-JAN-1995 DEFINITION Human guanosine 5'-monophosphate synthase mRNA, complete cds. ACCESSION U10860 NID g595409 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2212) AUTHORS Hirst,M., Haliday,E., Nakamura,J. and Lou,L. TITLE Human GMP synthetase. Protein purification, cloning, and functional expression of cDNA JOURNAL J. Biol. Chem. 269 (38), 23830-23837 (1994) MEDLINE 94375496 REFERENCE 2 (bases 1 to 2212) AUTHORS Lou,L. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Lillian Lou, Syntex Discovery Research, 3401 Hillview Avenue, Mail Stop S3-1, Palo Alto, CA 94303, USA FEATURES Location/Qualifiers source 1..2212 /organism="Homo sapiens" /note="custom cDNA library made by Stratagene (La Jolla, CA) using poly(A)+ RNA from A3.01 cells, catalog number 836201" /db_xref="taxon:9606" /clone="GMPS.6" /clone_lib="Strategene Lambda-ZAP-II" /cell_line="A3.01" /cell_type="T-lymphoblastoma" mRNA <1..>2212 /product="guanosine 5'-monophosphate synthetase" CDS 123..2204 /standard_name="GMP synthase" /EC_number="6.3.5.2" /note="located in the cytosol; a glutamine amidotransferase (G-type)" /codon_start=1 /function="de novo synthesis of GMP" /product="guanosine 5'-monophosphate synthetase" /db_xref="PID:g595410" /translation="MALCNGDSKLENAGGDLKDGHHHYEGAVVILDAGAQYGKVIDRR VRELFVQSEIFPLETPAFAIKEQGFRAIIISGGPNSVYAEDAPWFDPAIFTIGKPVLG ICYGMQMMNKVFGGTVHKKSVREDGVFNISVDNTCSLFRGLQKEEVVLLTHGDSVDKV ADGFKVVARSGNIVAGIANESKKLYGAQFHPEVGLTENGKVILKNFLYDIAGCSGTFT VQNRELECIREIKERVGTSKVLVLLSGGVDSTVCTALLNRALNQEQVIAVHIDNGFMR KRESQSVEEALKKLGIQVKVINAAHSFYNGTTTLPISDEDRTPRKRISKTLNMTTSPE EKRKIIGDTFVKIANEVIGEMNLKPEEVFLAQGTLRPDLIESASLVASGKAELIKTHH NDTELIRKLREEGKVIEPLKDFHKDEVRILGRELGLPEELVSRHPFPGPGLAIRVICA EEPYICKDFPETNNILKIVADFSASVKKPHTLLQRVKACTTEEDQEKLMQITSLHSLN AFLLPIKTVGVQGDCRSYSYVCGISSKDEPDWESLIFLARLIPRMCHNVNRVVYIFGP PVKEPPTDVTPTFLTTGVLSTLRQADFEAHNILRESGYAGKISQMPVILTPLHFDRDP LQKQPSCQRSVVIRTFITSDFMTGIPATPGNEIPVEVVLKMVTEIKKIPGISRIMYDL TSKPPGTTEWE" BASE COUNT 661 a 455 c 515 g 581 t ORIGIN 1 tgccggctgc tcctcgacca ggcctccttc tcaacctcag cccgcggcgc cgacccttcc 61 ggcaccctcc cgccccgtct cgtactgtcg ccgtcaccgc cgcggctccg gccctggccc 121 cgatggctct gtgcaacgga gactccaagc tggagaatgc tggaggagac cttaaggatg 181 gccaccacca ctatgaagga gctgttgtca ttctggatgc tggtgctcag tacgggaaag 241 tcatagaccg aagagtgagg gaactgttcg tgcagtctga aattttcccc ttggaaacac 301 cagcatttgc tataaaggaa caaggattcc gtgctattat catctctgga ggacctaatt 361 ctgtgtatgc tgaagatgct ccctggtttg atccagcaat attcactatt ggcaagcctg 421 ttcttggaat ttgctatggt atgcagatga tgaataaggt atttggaggt actgtgcaca 481 aaaaaagtgt cagagaagat ggagttttca acattagtgt ggataataca tgttcattat 541 tcaggggcct tcagaaggaa gaagttgttt tgcttacaca tggagatagt gtagacaaag 601 tagctgatgg attcaaggtt gtggcacgtt ctggaaacat agtagcaggc atagcaaatg 661 aatctaaaaa gttatatgga gcacagttcc accctgaagt tggccttaca gaaaatggaa 721 aagtaatact gaagaatttc ctttatgata tagctggatg cagtggaacc ttcaccgtgc 781 agaacagaga acttgagtgt attcgagaga tcaaagagag agtaggcacg tcaaaagttt 841 tggttttact cagtggtgga gtagactcaa cagtttgtac agctttgcta aatcgtgctt 901 tgaaccaaga acaagtcatt gctgtgcaca ttgataatgg ctttatgaga aaacgagaaa 961 gccagtctgt tgaagaggcc ctcaaaaagc ttggaattca ggtcaaagtg ataaatgctg 1021 ctcattcttt ctacaatgga acaacaaccc taccaatatc agatgaagat agaaccccac 1081 ggaaaagaat tagcaaaacg ttaaatatga ccacaagtcc tgaagagaaa agaaaaatca 1141 ttggggatac ttttgttaag attgccaatg aagtaattgg agaaatgaac ttgaaaccag 1201 aggaggtttt ccttgcccaa ggtactttac ggcctgatct aattgaaagt gcatcccttg 1261 ttgcaagtgg caaagctgaa ctcatcaaaa cccatcacaa tgacacagag ctcatcagaa 1321 agttgagaga ggagggaaaa gtaatagaac ctctgaaaga ttttcataaa gatgaagtga 1381 gaattttggg cagagaactt ggacttccag aagagttagt ttccaggcat ccatttccag 1441 gtcctggcct ggcaatcaga gtaatatgtg ctgaagaacc ttatatttgt aaggactttc 1501 ctgaaaccaa caatattttg aaaatagtag ctgatttttc tgcaagtgtt aaaaagccac 1561 ataccctatt acagagagtc aaagcctgca caacagaaga ggatcaggag aagctgatgc 1621 aaattaccag tctgcattca ctgaatgcct tcttgctgcc aattaaaact gtaggtgtgc 1681 agggtgactg tcgttcctac agttacgtgt gtggaatctc cagtaaagat gaacctgact 1741 gggaatcact tatttttctg gctaggctta tacctcgcat gtgtcacaac gttaacagag 1801 ttgtttatat atttggccca ccagttaaag aacctcctac agatgttact cccactttct 1861 tgacaacagg ggtgctcagt actttacgcc aagctgattt tgaggcccat aacattctca 1921 gggagtctgg gtatgctggg aaaatcagcc agatgccggt gattttgaca ccattacatt 1981 ttgatcggga cccacttcaa aagcagcctt catgccagag atctgtggtt attcgaacct 2041 ttattactag tgacttcatg actggtatac ctgcaacacc tggcaatgag atccctgtag 2101 aggtggtatt aaagatggtc actgagatta agaagattcc tggtatttct cgaattatgt 2161 atgacttaac atcaaagccc ccaggaacta ctgagtggga gtaataaact tc // LOCUS HSU10868 2790 bp mRNA PRI 16-DEC-1995 DEFINITION Human aldehyde dehydrogenase ALDH7 mRNA, complete cds. ACCESSION U10868 NID g601779 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2790) AUTHORS Hsu,L.C., Chang,W.C. and Yoshida,A. TITLE Cloning of a cDNA encoding human ALDH7, a new member of the aldehyde dehydrogenase family JOURNAL Gene 151 (1-2), 285-289 (1994) MEDLINE 95129876 REFERENCE 2 (bases 1 to 2790) AUTHORS Hsu,L.C. TITLE Direct Submission JOURNAL Submitted (14-JUN-1994) Hsu L. C., Beckman Res. Inst. of the City of Hope, Biochemical Genetics, 1450 E. Duarte Rd., Duarte, CA 91010-0269, USA FEATURES Location/Qualifiers source 1..2790 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ALDH7" /map="11q13" /tissue_type="kidney" CDS 48..1454 /codon_start=1 /function="aldehyde dehydrogenase" /product="ALDH7" /db_xref="PID:g601780" /translation="MDPLGDTLRRLREAFHAGRTRPAEFRAAQLQGLGRFLQENKQLL HDALAQDLHKSAFESEVSEVAISQGEVTLALRNLRAWMKDERVPKNLATQLDSAFIRK EPFGLVLIIAPWNYPLNLTLVPLVGALAAGNCVVLKPSEISKNVEKILAEVLPQYVDQ SCFAVVLGGPQETGQLLEHRFDYIFFTGSPRVGKIVMTAAAKHLTPVTLELGGKNPCY VDDNCDPQTVANRVAWFRYFNAGQTCVAPDYVLCSPEMQERLLPALQSTITRFYGDDP QSSPNLGRIINQKQFQRLRALLGCGRVAIGGQSDESDRYIAPTVLVDVQEMEPVMQEE IFGPILPIVNVQSLDEAIEFINRREKPLALYAFSNSSQVVKRVLTQTSSGGFCGNDGF MHMTLASLPFGGVGASGMGRYHGKFSFDTFSHHRACLLRSPGMEKLNALRYPPQSPRR LRMLLVAMEAQGCSCTLL" BASE COUNT 558 a 873 c 821 g 538 t ORIGIN 1 gcagagcggg acagccagga ggaagggcag cttggcagag cctcaggatg gacccccttg 61 gggacacgct gcggcgactg cgggaggcct tccacgcggg gcgcacgcgg ccagctgagt 121 tccgggctgc gcagctccaa ggcctgggcc gcttcctgca agaaaacaag cagcttctgc 181 acgacgcact ggcccaggac ctgcacaagt cagccttcga gtcggaggtg tctgaggttg 241 ccatcagcca gggcgaggtc accctggccc tcaggaacct ccgggcctgg atgaaggacg 301 agcgtgtgcc caagaacctg gccacgcagc tggactccgc cttcatccgg aaggagccct 361 ttggcctggt cctcatcatt gcgccctgga actatccgct gaacctgacg ctggtgcccc 421 tcgtgggagc cctcgctgca gggaactgtg tggtgctgaa gccatcggag attagcaaga 481 acgtcgagaa gatcctggcc gaggtgctgc cccaatacgt ggaccagagc tgctttgctg 541 tggtgctggg cgggccccag gagacggggc agctgctaga gcacaggttc gactacatct 601 tcttcacagg gagccctcgt gtgggcaaga ttgttatgac tgctgccgcc aagcacctga 661 cacctgtcac cctggagctg gggggcaaga acccttgcta cgtggacgac aactgcgacc 721 cccagaccgt ggccaaccgc gtggcctggt tccgctactt caacgccggc cagacctgcg 781 tggcccccga ctacgtccta tgcagccctg agatgcagga gaggctgctg cctgccctgc 841 agagcaccat cacccgtttc tatggcgacg acccccagag ctccccaaac ctgggccgca 901 tcatcaacca gaaacagttc cagcggctgc gggcattgct gggctgcggc cgtgtggcca 961 ttgggggcca gagcgatgag agcgatcgct acatcgcccc cacggtgctg gtggatgtgc 1021 aggagatgga gcctgtgatg caggaggaga tcttcgggcc catcctgccc atcgtgaacg 1081 tgcagagctt ggacgaggcc atcgagttca tcaaccggcg ggagaagccc ctggccctgt 1141 acgccttctc caacagcagc caggtggtca agcgggtgct gacccagacc agcagcgggg 1201 gcttctgtgg gaacgacggc ttcatgcaca tgaccctggc cagcctgcct tttggaggag 1261 tgggtgccag tgggatgggc cggtaccatg gcaagttctc cttcgacacc ttctcccacc 1321 atcgcgcctg cctcctgcgc agcccgggga tggagaagct caacgccctc cgctacccgc 1381 cgcaatcgcc gcgccgcctg aggatgctgc tggtggccat ggaggcccaa ggctgcagct 1441 gcacactgct ctgagccctt ccccaggccc aggctgtaga ccaccatgac agctgtcgcc 1501 tgcggctggt ggagacgggg cctgggctcc cgggcccgag gaggaaaagg attgccaagg 1561 ctccagggca cccctcaaag cagcgcctgc ctcctccctc ctgggtcttc cctctccctg 1621 cctcagcctc ctccctcagc cgctcccaac catgagagcc gaggtgggag gcatgggaaa 1681 cagtgcagtg actcaccccc tgcccccgca ccaaccaccc atattcagga gaagaggaca 1741 gacacggcac ctctgagtca cccctctcct gtggagcggg cgtccgaggg gcctggcgat 1801 ctgactcagg ccacaccatg gaatcactgc atccaaggcc attcctgccc tctctgagtc 1861 tcagtttttc catttgttca gtggagagaa ttaaccattg atacctcctg gctgggtgag 1921 gcggctcaca cctgtaatcc cagcactttg ggaggccgag gcaggcggat cacctgaaat 1981 caggagttca agatcagcct ggctaacatg gcgaaacccc gtctctacta aaaatacaaa 2041 aattagcctg gcgtggtggc gcatgcctgt aatcccagct actcaggagg ctaaggcagg 2101 agaatcgctt gaacccggga ggtggaggtt gccgtgagct gagattgcgt cactgaactc 2161 cggcctgggt gacagaagga ggctctgcct taaaaaaaaa aaaaaaaaaa aaaacctcct 2221 gggactgttg caaggatgaa atgaaggatt gagggattga gggattgctg agctggagct 2281 ccaggtgtcc tatctttctc agtggggtgg cacggagcgg ggccgcctcc ctcttctctc 2341 caggcaggtg gggctgtggt tatgcgatag ggtctccctt ccctccagcc catgccagga 2401 gcttgtaact ctttatcctc atggtgccca ctacgagtca tactcttccc catgctgctc 2461 atcctcctgg gccccatcca ctcagccaaa gcagaatgca gggtttcctg cctgacaacc 2521 cttctcacct cccaagtccc acttttgaac aagctgatga ttctgaaact ggcccaattt 2581 cctaaaagcg ggggtgcttg agaaacctac atttggacaa tgagaggctg ctcctgcggc 2641 ctgcgggcca cctcctcttc cttggctcct gctttctttt tagactatat caacctacaa 2701 ctttagtcgg gaagagggac aggggtggac ctgagtttcg tctcctgtct ctctggctga 2761 tgtcacctga ataaagcctt cttccctggc // LOCUS HSU10906 597 bp mRNA PRI 28-JUL-1994 DEFINITION Human cyclin-dependent kinase inhibitor p27kip1 mRNA, complete cds. ACCESSION U10906 NID g516558 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 597) AUTHORS Polyak,K., Lee,Mong-Hong., Erdjument-Breomage,H., Koff,A., Roberts,J.M., Tempst,P. and Massague,J. TITLE Cloning of p27kip1, a cyclin-dependent kinase inhibitor, and a potential mediator of extracellular antimitogenic signals JOURNAL Cell 78, 56-66 (1994) REFERENCE 2 (bases 1 to 597) AUTHORS Massague,J. TITLE Direct Submission JOURNAL Submitted (15-JUN-1994) M.-H. Lee and J. Massague, Cell Biology and Genetics, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..597 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="p27 kip1" /sex="male" /tissue_type="kidney" /dev_stage="adult" CDS 1..597 /codon_start=1 /function="cyclin-dependent kinase inhibitor" /product="p27kip1" /db_xref="PID:g516559" /translation="MSNVRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRD LEKHCRDMEEASQRKWNFDFQNHKPLEGKYEWQEVEKGSLPEFYYRPPRPPKGACKVP AQESQDVSGSRPAAPLIGAPANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDDS STQNKRANRTEENVSDGSPNAGSVEQTPKKPGLRRRQT" BASE COUNT 161 a 164 c 185 g 87 t ORIGIN 1 atgtcaaacg tgcgagtgtc taacgggagc cctagcctgg agcggatgga cgccaggcag 61 gcggagcacc ccaagccctc ggcctgcagg aacctcttcg gcccggtgga ccacgaagag 121 ttaacccggg acttggagaa gcactgcaga gacatggaag aggcgagcca gcgcaagtgg 181 aatttcgatt ttcagaatca caaaccccta gagggcaagt acgagtggca agaggtggag 241 aagggcagct tgcccgagtt ctactacaga cccccgcggc cccccaaagg tgcctgcaag 301 gtgccggcgc aggagagcca ggatgtcagc gggagccgcc cggcggcgcc tttaattggg 361 gctccggcta actctgagga cacgcatttg gtggacccaa agactgatcc gtcggacagc 421 cagacggggt tagcggagca atgcgcagga ataaggaagc gacctgcaac cgacgattct 481 tctactcaaa acaaaagagc caacagaaca gaagaaaatg tttcagacgg ttccccaaat 541 gccggttctg tggagcagac gcccaagaag cctggcctca gaagacgtca aacgtaa // LOCUS HSU10990 2216 bp mRNA PRI 05-APR-1995 DEFINITION Human nuclear receptor hTAK1 (hTAK1) mRNA, complete cds. ACCESSION U10990 NID g758381 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2216) AUTHORS Hirose,T., Fujimoto,W., Tamaai,T., Kim,K.H., Matsuura,H. and Jetten,A.M. TITLE TAK1: molecular cloning and characterization of a new member of the nuclear receptor superfamily JOURNAL Mol. Endocrinol. 8 (12), 1667-1680 (1994) MEDLINE 95223313 REFERENCE 2 (bases 1 to 2216) AUTHORS Jetten,A.M. TITLE Direct Submission JOURNAL Submitted (17-JUN-1994) Anton M. Jetten, Cell Biology Section, National Institute of Environmental Health Sciences, NIH, 111 T.W. Alexander Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..2216 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hTAK1" /clone_lib="testis cDNA library in lambda gt10" /sex="male" /tissue_type="testis" 5'UTR 1..240 misc_feature 117..177 /note="zinc finger domains" gene 241..2031 /gene="hTAK1" CDS 241..2031 /gene="hTAK1" /codon_start=1 /product="hTAK1" /db_xref="PID:g758382" /translation="MTSPSPRIQIISTDSAVASPQRIQIVTDQQTGQKIQIVTAVDAS GSPKQQFILTSPDGAGTGKVILASPETSSAKQLIFTTSDNLVPGRIQIVTDSASVERL LGKTDVQRPQVVEYCVVCGDKASGRHYGAVSCEGCKGFFKRSVRKNLTYSCRSNQDCI INKHHRNRCQFCRLKKCLEMGMKMESVQSERKPFDVQREKPSNCAASTEKIYIRKDLR SPLIATPTFVADKDGARQTGLLDPGMLVNIQQPLIREDGTVLLATDSKAETSQGALGT LANVVTSLANLSESLNNGDTSEIQPEDQSASEITRAFDTLAKALNTTDSSSSPSLADG IDTSGGGSIHVISRDQSTPIIEVEGPLLSDTHVTFKLTMPSPMPEYLNVHYICESASR LLFLSMHWARSIPAFQALGQDCNTSLVRACWNELFTLGLAQCAQVMSLSTILAAIVNH LQNSIQEDKLSGDRIKQVMEHIWKLQEFCNSMAKLDIDGYEYAYLKAIVLFSPDHPGL TSTSQIEKFQEKAQMELQDYVQKTYSEDTYRLARILVRLPALRLMSSNITEELFFTGL IGNVSIDSIIPYILKMETAEYNGQITGASL" 3'UTR 2032..2216 polyA_signal 2185 BASE COUNT 598 a 591 c 515 g 512 t ORIGIN 1 ggcaaactgt catcaggagc ttaaatagga cagattttca cggcagtgga tataccagca 61 tggctattat ctatggagtg ttctctgctt caaatttgat tacaccgtca gtggttgcca 121 ttgtaggacc tcaactctct atgtttgcca gtggtttatt ttacagcatg tacattgccg 181 ttttcatcca gcctttcccg tggtccttct acacagacct ctcggccgga atctccaggg 241 atgaccagcc cctccccacg catccagata atctccaccg actctgctgt agcctcacct 301 cagcgcattc agattgtcac agaccagcag acaggacaga aaatccagat agtcaccgca 361 gtggacgcct ccggatcccc caaacagcag ttcatcctga ccagcccaga tggagctgga 421 actgggaagg tgatcctggc ttccccagag acatccagcg ccaagcaact catattcacc 481 acctcagaca acctcgtccc tggcaggatc cagattgtca cggattctgc ctctgtggag 541 cgtttactgg ggaagacgga cgtccagcgg ccccaggtgg tagagtactg tgtggtctgt 601 ggcgacaaag cctccggccg tcactatggg gctgtcagtt gtgaaggttg caaaggtttc 661 ttcaaaagga gtgtgaggaa aaatttgacc tacagctgcc ggagcaacca agactgcatc 721 atcaataaac atcaccggaa ccgctgtcag ttttgccggc tgaaaaaatg cttagagatg 781 ggcatgaaaa tggaatctgt gcagagtgaa cggaagccct tcgatgtgca acgggagaaa 841 ccaagcaatt gtgctgcttc aactgagaaa atctatatcc ggaaagacct gagaagtccc 901 ctgatagcta ctcccacgtt tgtggcagac aaagatggag caagacaaac aggtcttctt 961 gatccaggga tgcttgtgaa catccagcag cctttgatac gtgaggatgg tacagttctc 1021 ctggccacgg attctaaggc tgaaacaagc cagggagctc tgggcacact ggcaaatgta 1081 gtgacctccc ttgccaacct aagtgaatct ttgaacaacg gtgacacttc agaaatccag 1141 ccagaggacc agtctgcaag tgagataact cgggcatttg ataccttagc taaagcactt 1201 aataccacag acagctcctc ttctccaagc ttggcagatg ggatagacac cagtggagga 1261 gggagcatcc acgtcatcag cagagaccag tcgacaccca tcattgaggt tgaaggcccc 1321 ctcctttcag acacacacgt cacatttaag ctaacaatgc ccagtccaat gccagagtac 1381 ctcaacgtgc actacatctg tgagtctgca tcccgtctgc ttttcctctc aatgcactgg 1441 gctcggtcaa tcccagcctt tcaggcactt gggcaggact gcaacaccag ccttgtgcgg 1501 gcctgctgga atgagctctt caccctcggc ctggcccagt gtgcccaggt catgagtctc 1561 tccaccatcc tggctgccat tgtcaaccac ctgcagaaca gcatccagga agataaactt 1621 tctggtgacc ggataaagca agtcatggag cacatctgga agctgcagga gttctgtaac 1681 agcatggcga agctggatat agatggctat gagtatgcat accttaaagc tatagttctc 1741 tttagccccg atcatccagg tttgaccagc acaagccaga ttgaaaaatt ccaagaaaag 1801 gcacagatgg agttgcagga ctatgttcag aaaacctact cagaagacac ctaccgattg 1861 gcccggatcc tcgttcgcct gccggcactc aggctgatga gctccaacat aacagaagaa 1921 ctttttttta ctggtctcat tggcaatgtt tcgatagaca gcataatccc ctacatcctc 1981 aagatggaga cagcagagta taatggccag atcaccggag ccagtctata gcgcaaacca 2041 cacacctgcc aaggagcaac agaatccttc caggaccgtt cacatacaaa gaaagtagtg 2101 gtattttggt atgtgcaaat atttccatat gttagccatt tcctgtctgg tttctcctta 2161 tctgttaatc ccagacaata gcaattaaaa agactagtag gatcctttcc tgacat // LOCUS HSU11037 571 bp mRNA PRI 18-APR-1997 DEFINITION Human Sel-1 like mRNA, complete cds. ACCESSION U11037 NID g836884 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 569) AUTHORS Appierto,V., Pergolizzi,R., Spurr,N. and Biunno,I. TITLE Identification and chromosomal localization of two new genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 571) AUTHORS Biunno,I. TITLE Direct Submission JOURNAL Submitted (20-JUN-1994) Ida Biunno, CNR, ITBA, Via Ampere 56, Milano 20131, Italy FEATURES Location/Qualifiers source 1..571 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Clontech skeletal muscle cDNA library" /sex="male" /tissue_type="muscle, leg" /chromosome="14" gene 12..299 /gene="Sel-1 like" CDS 12..299 /gene="Sel-1 like" /codon_start=1 /function="unknown" /db_xref="PID:g836885" /translation="MGLDTDVDYETAFIHYRLASEQQHSAQAMFNLGYMHEKGLGIKQ DIHLAKRFYDMAAVSQPRCTSSSLPSPLQIGHRLFLAVHTGNKHSRYVLPT" BASE COUNT 149 a 143 c 139 g 140 t ORIGIN 1 gttccatttc tatgggtttg gacaccgatg tagattatga aactgcattt attcattacc 61 gtctggcttc tgagcagcaa cacagtgcac aagctatgtt taatctggga tatatgcatg 121 agaaaggact gggcattaaa caggatattc accttgcgaa acgtttttat gacatggcag 181 ctgtaagcca gcccagatgc acaagttcca gtcttcctag ccctctgcaa attgggcatc 241 gtctatttct tgcagtacat acgggaaaca aacattcgag atatgttctc ccaacttgat 301 atggaccagc ttttgggacc tgagtgggac ctttacctca tgaccatcat tgcgctctgt 361 tgggaagtca tagcttacag gcaaaggcag caccaagaca tgcctgcacc caggcctcca 421 gggccacggc cagctccacc ccagcaggag gggccaccag agcagcagcc accacagtaa 481 taggcactgg gtccagcctt gatcagtgac agcgaaggaa gttatctgct gggaacactt 541 gcatttgatt taggaccttg gggatccgat g // LOCUS HSU11053 1182 bp mRNA PRI 24-AUG-1994 DEFINITION Human kappa opioid receptor (hKOR) mRNA, complete cds. ACCESSION U11053 NID g532059 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1182) AUTHORS Mansson,E., Bare,L.A. and Yang,D. TITLE Isolation of a human kappa opioid receptor cDNA from placenta JOURNAL Biochem. Biophys. Res. Commun. 202, 1431-1437 (1994) MEDLINE 94338360 REFERENCE 2 (bases 1 to 1182) AUTHORS Mansson,E. TITLE Direct Submission JOURNAL Submitted (20-JUN-1994) Erik Mansson, Molecular Biology, Ohmeda, PPD, 100 Mounatain Avenue, Murray Hill, NJ 07974, USA FEATURES Location/Qualifiers source 1..1182 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phK1.3" /tissue_type="placenta" gene 14..1156 /gene="hKOR" CDS 14..1156 /gene="hKOR" /codon_start=1 /product="kappa opioid receptor" /db_xref="PID:g532060" /translation="MESPIQIFRGEPGPTCAPSACLPPNSSAWFPGWAEPDSNGSAGS EDAQLEPAHISPAIPVIITAVYSVVFVVGLVGNSLVMFVIIRYTKMKTATNIYIFNLA LADALVTTTMPFQSTVYLMNSWPFGDVLCKIVISIDYYNMFTSIFTLTMMSVDRYIAV CHPVKALDFRTPLKAKIINICIWLLSSSVGISAIVLGGTKVREDVDVIECSLQFPDDD YSWWDLFMKICVFIFAFVIPVLIIIVCYTLMILRLKSVRLLSGSREKDRNLRRITRLV LVVVAVFVVCWTPIHIFILVEALGSTSHSTAALSSYYFCIALGYTNSSLNPILYAFLD ENFKRCFRDFCFPLKMRMERQSTSRVRNTVQDPAYLRDIDGMNKPV" BASE COUNT 245 a 345 c 293 g 299 t ORIGIN 1 tgcagcactc accatggaat ccccgattca gatcttccgc ggggagcctg gccctacctg 61 cgccccgagc gcctgcctgc cccccaacag cagcgcctgg tttcccggct gggccgagcc 121 cgacagcaac ggcagcgccg gctcggagga cgcgcagctg gagcccgcgc acatctcccc 181 ggccatcccg gtcatcatca cggcggtcta ctccgtagtg ttcgtcgtgg gcttggtggg 241 caactcgctg gtcatgttcg tgatcatccg atacacaaag atgaagacag caaccaacat 301 ttacatattt aacctggctt tggcagatgc tttagttact acaaccatgc cctttcagag 361 tacggtctac ttgatgaatt cctggccttt tggggatgtg ctgtgcaaga tagtaatttc 421 cattgattac tacaacatgt tcaccagcat cttcaccttg accatgatga gcgtggaccg 481 ctacattgcc gtgtgccacc ccgtgaaggc tttggacttc cgcacaccct tgaaggcaaa 541 gatcatcaat atctgcatct ggctgctgtc gtcatctgtt ggcatctctg caatagtcct 601 tggaggcacc aaagtcaggg aagacgtcga tgtcattgag tgctccttgc agttcccaga 661 tgatgactac tcctggtggg acctcttcat gaagatctgc gtcttcatct ttgccttcgt 721 gatccctgtc ctcatcatca tcgtctgcta caccctgatg atcctgcgtc tcaagagcgt 781 ccggctcctt tctggctccc gagagaaaga tcgcaacctg cgtaggatca ccagactggt 841 cctggtggtg gtggcggttt tcgtcgtctg ctggactccc attcacatat tcatcctggt 901 ggaggctctg gggagcacct cccacagcac agctgctctc tccagctatt acttctgcat 961 cgccttaggc tataccaaca gtagcctgaa tcccattctc tacgcctttc ttgatgaaaa 1021 cttcaagcgg tgtttccggg acttctgctt tccactgaag atgaggatgg agcggcagag 1081 cactagcaga gtccgaaata cagttcagga tcctgcttac ctgagggaca tcgatgggat 1141 gaataaacca gtatgactag tcgtggagat gtcttcgtac ag // LOCUS HSU11090 1334 bp mRNA PRI 02-JAN-1995 DEFINITION Human hydroxyindole-O-methyltransferase promoter A-derived (HIOMT) mRNA, complete cds. ACCESSION U11090 NID g607839 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1334) AUTHORS Rodriguez,I.R., Mazuruk,K., Schoen,T.J. and Chader,G.J. TITLE Structural analysis of the human hydroxyindole-O-methyltransferase gene: Presence of two distinct promoters JOURNAL Unpublished REFERENCE 2 (bases 1 to 1334) AUTHORS Rodriguez,I.R. TITLE Direct Submission JOURNAL Submitted (22-JUN-1994) Ignacio R. Rodriguez, National Eye Institute, NIH, LRCMB, Bldg. 6 Rm. 304, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1334 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Amplified from human retina cDNA immobilized on Dynabeads" /chromosome="X" /map="Xp22.3" 5'UTR 1..145 /note="promoter A derived 5' untranslated region" gene 146..1267 /gene="HIOMT" CDS 146..1267 /gene="HIOMT" /note="alternatively spliced including exons 6 and 7" /codon_start=1 /product="hydroxyindole-O-methyltransferase" /db_xref="PID:g607840" /translation="MGSSEDQAYRLLNDYANGFMVSQVLFAACELGVFDLLAEAPGPL DVAAVAAGVRASAHGTELLLDICVSLKLLKVETRGGKAFYRNTELSSDYLTTVSPTSQ CSMLKYMGRTSYRCWGHLADAVREGRNQYLETFGVPAEELFTAIYRSEGERLQFMQAL QEVWSVNGRSVLTAFDLSVFPLMCDLGGTRIKLETIILSKLSQGQKTKHRVFSLIGGA GALAKECMSLYPGCKITVFDIPEVVWTAKQHFSFQEEEQIDFQEGDFFKDPLPEADLY ILARVLHDWADGKCSHLLERIYHTCKPGGGILVIESLLDEDRRGPLLTQLYSLNMLVQ TEGQERTPTHYHMLLSSAGFRDFQFKKTGAIYDAILARK" misc_feature 707..791 /gene="HIOMT" /note="alternatively spliced exon 6, line-1 element" misc_feature 792..932 /gene="HIOMT" /note="alternatively spliced exon 7; Three alternatively spliced transcripts are observed: A+6+7, A-6+7 and A-6-7" polyA_signal 1315..1320 BASE COUNT 336 a 348 c 386 g 264 t ORIGIN 1 tcaggaactg gagcagcaaa gagaagaaca aaagcgcaga gagaaggaag cggaggagag 61 gcagcgagcg gaggaaagca ggctctgtgc tccttgaagc aagcgctcca gaggctccgg 121 aagccacggc tggattggag acaagatggg atcctcagag gaccaggcct atcgcctcct 181 taatgactac gccaacggct tcatggtgtc ccaggttctc ttcgccgcct gcgagctggg 241 cgtgtttgac cttctcgccg aggccccagg gcccctggac gtggcggcag tggctgcagg 301 tgtgagggcc agcgcccatg ggacagagct cctgctggac atctgtgtgt ccctgaagct 361 gctgaaagtg gagacgaggg gaggaaaagc tttctatcga aacacagagc tgtccagcga 421 ctacctgacc acggtcagcc cgacgtcaca atgcagcatg ctgaagtaca tgggcaggac 481 cagctaccgg tgctggggcc acctggcaga cgccgtgaga gaaggaagga accagtacct 541 ggagacgttt ggcgttcccg ctgaagagct ttttacggcc atctacaggt ccgagggcga 601 gcggctacag ttcatgcaag ctctgcagga ggtctggagc gtcaacggga gaagcgtgct 661 gaccgccttt gacctgtcag tgttcccact tatgtgtgac cttggtggga cacggataaa 721 gctggaaacc atcattctca gcaaactatc gcaaggacag aaaaccaaac accgcgtgtt 781 ctcactcata ggtggggctg gagctctggc taaggaatgc atgtctctgt accctggatg 841 taagatcacc gtttttgaca tcccagaagt ggtgtggacg gcaaagcagc acttctcatt 901 ccaggaggaa gaacagattg acttccagga aggggatttc ttcaaagacc ctcttccgga 961 agctgatctg tacatcctgg ccagggtcct ccatgactgg gcagacggaa agtgctcaca 1021 cctgctggag aggatctacc acacttgcaa gccaggtggt ggcattctgg taattgaaag 1081 cctcctggat gaagacaggc gaggtcctct gctcacgcag ctctactctc tgaacatgct 1141 tgtgcagacg gaagggcagg agaggacccc cacccactac cacatgctcc tctcttctgc 1201 tggcttcaga gacttccagt ttaagaaaac aggagccatt tatgatgcca ttttagccag 1261 gaaataactg tttcttgtga cctggaacta acgtcaaagc acacaagaca taataataaa 1321 gacatgtacc tcca // LOCUS HSU11276 738 bp mRNA PRI 23-SEP-1994 DEFINITION Human hNKR-P1a protein (NKR-P1A) mRNA, complete cds. ACCESSION U11276 NID g538270 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Lanier,L.L., Chang,C. and Phillips,J.H. TITLE Human NKR-P1A: A disulfide-linked homodimer of the C-type lectin superfamily expressed by a subset of NK and T lymphocytes JOURNAL J. Immunol. 153, 2417-2428 (1994) MEDLINE 94358407 REFERENCE 2 (bases 1 to 738) AUTHORS Lanier,L.L. TITLE Direct Submission JOURNAL Submitted (23-JUN-1994) Lewis L. Lanier, Department of Human Immunology, DNAX Research Institute of Molecular and Cellular Biology, 901 California Avenue, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="LL269" /clone_lib="human NK cell cDNA pJFE14" /chromosome="12" /cell_type="natural killer cell" /dev_stage="adult" gene 61..738 /gene="NKR-P1A" CDS 61..738 /gene="NKR-P1A" /codon_start=1 /product="hNKR-P1a protein" /db_xref="PID:g544496" /translation="MDQQAIYAELNLPTDSGPESSSPSSLPRDVCQGSPWHQFALKLS CAGIILLVLVVTGLSVSVTSLIQKSSIEKCSVDIQQSRNKTTERPGLLNCPIYWQQLR EKCLLFSHTVNPWNNSLADCSTKESSLLLIRDKDELIHTQNLIRDKAILFWIGLNFSL SEKNWKWINGSFLNSNDLEIRGDAKENSCISISQTSVYSEYCSTEIRWICQKELTPVR NKVYPDS" BASE COUNT 231 a 151 c 143 g 213 t ORIGIN 1 aaagcagaat tgagagtttg ttcttacaca caagtttaat gccaccttcc tctgtctgcc 61 atggaccaac aagcaatata tgctgagtta aacttaccca cagactcagg cccagaaagt 121 tcttcacctt catctcttcc tcgggatgtc tgtcagggtt caccttggca tcaatttgcc 181 ctgaaactta gctgtgctgg gattattctc cttgtcttgg ttgttactgg gttgagtgtt 241 tcagtgacat ccttaataca gaaatcatca atagaaaaat gcagtgtgga cattcaacag 301 agcaggaata aaacaacaga gagaccgggt ctcttaaact gcccaatata ttggcagcaa 361 ctccgagaga aatgcttgtt attttctcac actgtcaacc cttggaataa cagtctagct 421 gattgttcca ccaaagaatc cagcctgctg cttattcgag ataaggatga attgatacac 481 acacagaacc tgatacgtga caaagcaatt ctgttttgga ttggattaaa tttttcatta 541 tcagaaaaga actggaagtg gataaacggc tcttttttaa attctaatga cttagaaatt 601 agaggtgatg ctaaagaaaa cagctgtatt tccatctcac agacatctgt gtattctgag 661 tactgtagta cagaaatcag atggatctgc caaaaagaac taacacctgt gagaaataaa 721 gtgtatcctg actcttga // LOCUS HSU11287 5969 bp mRNA PRI 16-AUG-1995 DEFINITION Human N-methyl-D-aspartate receptor subunit NR3 (hNR3) mRNA, complete cds. ACCESSION U11287 NID g560546 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5969) AUTHORS Adams,S.L., Foldes,R.L. and Kamboj,R.K. TITLE Human N-methyl-D-aspartate receptor modulatory subunit hNR3: cloning and sequencing of the cDNA and primary structure of the protein JOURNAL Biochim. Biophys. Acta 1260 (1), 105-108 (1995) MEDLINE 95092783 REFERENCE 2 (bases 1 to 5969) AUTHORS Foldes,R.L. TITLE Direct Submission JOURNAL Submitted (24-JUN-1994) Robert L. Foldes, Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario L4V 1V7, Canada FEATURES Location/Qualifiers source 1..5969 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FB2C, FB6B, FB2B, FB17, FB19A, FB5, FB18, FB2A, FB10" /clone_lib="Stratagene Library No.936206" /tissue_type="brain" /dev_stage="fetus" 5'UTR <1..210 /evidence=experimental gene 211..4665 /gene="hNR3" CDS 211..4665 /gene="hNR3" /codon_start=1 /evidence=experimental /product="N-methyl-D-aspartate receptor subunit NR3" /db_xref="PID:g560547" /translation="MKPRAECCSPKFWLVLAVLAVSGSRARSQKSPPSIGIAVILVGT SDEVAIKDAHEKDDFHHLSVVPRVELVAMNETDPKSIITRICDLMSDRKIQGVVFADD TDQEAIAQILDFISAQTLTPILGIHGGSSMIMADKDESSMFFQFGPSIEQQASVMLNI MEEYDWYIFSIVTTYFPGYQDFVNKIRSTIENSFVGWELEEVLLLDMSLDDGDSKIQN QLKKLQSPIILLYCTKEEATYIFEVANSVGLTGYGYTWIVPSLVAGDTDTVPAEFPTG LISVSYDEWDYGLPARVRDGIAIITTAASDMLSEHSFIPEPKSSCYNTHEKRIYQSNM LNRYLINVTFEGRNLSFSEDGYQMHPKLVIILLNKERKWERVGKWKDKSLQMKYYVWP RMCPETEEQEDDHLSIVTLEEAPFVIVESVDPLSGTCMRNTVPCQKRIVTENKTDEEP GYIKKCCKGFCIDILKKISKSVKFTYDLYLVTNGKHGKKINGTWNGMIGEVVMKRAYM AVGSLTINEERSEVVDFSVPFIETGISVMVSRSNGTVSPSAFLEPFSADVWVMMFVML LIVSAVAVFVFEYFSPVGYNRCLADGREPGGPSFTIGKAIWLLWGLVFNNSVPVQNPK GTTSKIMVSVWAFFAVIFLASYTANLAAFMIQEEYVDQVSGLSDKKFQRPNDFSPPFR FGTVPNGSTERNIRNNYAEMHAYMGKFNQRGVDDALLSLKTGKLDAFIYDAAVLNYMA GRDEGCKLVTIGSGKVFASTGYGIAIQKDSGWKRQVDLAILQLFGDGEMEELEALWLT GICHNEKNEVMSSQLDIDNMAGVFYMLGAAMALSLITFICEHLFYWQFRHCFMGVCSG KPGMVFSISRGIYSCIHGVAIEERQSVMNSPTATMNNTHSNILRLLRTAKNMANLSGV NGSPQRPLDFIRRESSVYDISEHRRSFTHSDCKSYNNPPCEENLFSDYISEVERTFGN LQLKDSNVYQDHYHHHHRPHSIGSASSIDGLYDCDNPPFTTQSRSISKKPLDIGLPSS KHSQLSDLYGKFSFKSDRYSGHDDLIRSDVSDISTHTVTYGNIEGNAAKRRKQQYKDS LKKRPASAKSRREFDEIELAYRRRPPRSPDHKRYFRDKEGLRDFYLDQFRTKENSPHW EHVDLTDIYKERSDDFKRDSVSGGGPCTNRSHIKHGTGDKHGVVSGVPAPWEKNLTNV EWEDRSGGNFCRSCPSKLHNYSTTVTGQNSGRQACIRCEACKKAGNLYDISEDNSLQE LDQPAAPVAVTSNASTTKYPQSPTNSKAQKKNRNKLRRQHSYDTFVDLQKEEAALAPR SVSLKDKGRFMDGSPYAHMFEMSAGESTFANNKSSVPTAGHHHHNNPGGGYMLSKSLY PDRVTQNPFIPTFGDDQCLLHGSKSYFFRQPTVAGASKARPDFRALVTNKPVVSALHG AVPARFQKDICIGNQSNPCVPNNKNPRAFNGSSNGHVYEKLSSIESDV" misc_difference 1430 /gene="hNR3" /note="hNR3-2" /replace="a" misc_difference 2874 /gene="hNR3" /note="hNR3-3" /replace="t" 3'UTR 4666..>5969 /evidence=experimental BASE COUNT 1504 a 1548 c 1559 g 1358 t ORIGIN 1 tttgaatttg catctcttca agacacaaga ttaaaacaaa atttacgcta aattggattt 61 taaattatct tccgttcatt tatccttcgt ctttcttatg tggatatgca agcgagaaga 121 agggactgga cattcccaac atgctcactc ccttaatctg tccgtctaga ggtttggctt 181 ctacaaacca agggagtcga cgagttgaag atgaagccca gagcggagtg ctgttctccc 241 aagttctggt tggtgttggc cgtcctggcc gtgtcaggca gcagagctcg ttctcagaag 301 agccccccca gcattggcat tgctgtcatc ctcgtgggca cttccgacga ggtggccatc 361 aaggatgccc acgagaaaga tgatttccac catctctccg tggtaccccg ggtggaactg 421 gtagccatga atgagaccga cccaaagagc atcatcaccc gcatctgtga tctcatgtct 481 gaccggaaga tccagggggt ggtgtttgct gatgacacag accaggaagc catcgcccag 541 atcctcgatt tcatttcagc acagactctc accccgatcc tgggcatcca cgggggctcc 601 tctatgataa tggcagataa ggatgaatcc tccatgttct tccagtttgg cccatcaatt 661 gaacagcaag cttccgtaat gctcaacatc atggaagaat atgactggta catcttttct 721 atcgtcacca cctatttccc tggctaccag gactttgtaa acaagatccg cagcaccatt 781 gagaatagct ttgtgggctg ggagctagag gaggtcctcc tactggacat gtccctggac 841 gatggagatt ctaagatcca gaatcagctc aagaaacttc aaagccccat cattcttctt 901 tactgtacca aggaagaagc cacctacatc tttgaagtgg ccaactcagt agggctgact 961 ggctatggct acacgtggat cgtgcccagt ctggtggcag gggatacaga cacagtgcct 1021 gcggagttcc ccactgggct catctctgta tcatatgatg aatgggacta tggcctcccc 1081 gccagagtga gagatggaat tgccataatc accactgctg cttctgacat gctgtctgag 1141 cacagcttca tccctgagcc caaaagcagt tgttacaaca cccacgagaa gagaatctac 1201 cagtccaata tgctaaatag gtatctgatc aatgtcactt ttgaggggag gaatttgtcc 1261 ttcagtgaag atggctacca gatgcacccg aaactggtga taattcttct gaacaaggag 1321 aggaagtggg aaagggtggg gaagtggaaa gacaagtccc tgcagatgaa gtactatgtg 1381 tggccccgaa tgtgtccaga gactgaagag caggaggatg accatctgag cattgtgacc 1441 ctggaggagg caccatttgt cattgtggaa agtgtggacc ctctgagtgg aacctgcatg 1501 aggaacacag tcccctgcca aaaacgcata gtcactgaga ataaaacaga cgaggagccg 1561 ggttacatca aaaaatgctg caaggggttc tgtattgaca tccttaagaa aatttctaaa 1621 tctgtgaagt tcacctatga cctttacctg gttaccaatg gcaagcatgg gaagaaaatc 1681 aatggaacct ggaatggtat gattggagag gtggtcatga agagggccta catggcagtg 1741 ggctcactca ccatcaatga ggaacgatcg gaggtggtcg acttctctgt gcccttcata 1801 gagacaggca tcagtgtcat ggtgtcacgc agcaatggga ctgtctcacc ttctgccttc 1861 ttagagccat tcagcgctga cgtatgggtg atgatgtttg tgatgctgct catcgtctca 1921 gccgtggctg tctttgtctt tgagtacttc agccctgtgg gttataacag gtgcctcgct 1981 gatggcagag agcctggtgg accctctttc accatcggca aagctatttg gttgctctgg 2041 ggtctggtgt ttaacaactc cgtacctgtg cagaacccaa aggggaccac ctccaagatc 2101 atggtgtcag tgtgggcctt ctttgctgtc atcttcctgg ccagctacac tgccaactta 2161 gctgccttca tgatccaaga ggaatatgtg gaccaggttt ctggcctgag cgacaaaaag 2221 ttccagagac ctaatgactt ctcaccccct ttccgctttg ggaccgtgcc caacggcagc 2281 acagagagaa atattcgcaa taactatgca gaaatgcatg cctacatggg aaagttcaac 2341 cagaggggtg tagatgatgc attgctctcc ctgaaaacag ggaaactgga tgccttcatc 2401 tatgatgcag cagtgctgaa ctatatggca ggcagagatg aaggctgcaa gctggtgacc 2461 attggcagtg ggaaggtctt tgcttccact ggctatggca ttgccatcca aaaagattct 2521 gggtggaagc gccaggtgga ccttgctatc ctgcagctct ttggagatgg ggagatggaa 2581 gaactggaag ctctctggct cactggcatt tgtcacaatg agaagaatga ggtcatgagc 2641 agccagctgg acattgacaa catggcaggg gtcttctaca tgttgggggc ggccatggct 2701 ctcagcctca tcaccttcat ctgcgaacac cttttctatt ggcagttccg acattgcttt 2761 atgggtgtct gttctggcaa gcctggcatg gtcttctcca tcagcagagg tatctacagc 2821 tgcatccatg gggtggcgat cgaggagcgc cagtctgtaa tgaactcccc caccgcaacc 2881 atgaacaaca cacactccaa catcctgcgc ctgctgcgca cggccaagaa catggctaac 2941 ctgtctggtg tgaatggctc accgcagagg cccctggact tcatccgacg ggagtcatcc 3001 gtctatgaca tctcagagca ccgccgcagc ttcacgcatt ctgactgcaa atcctacaac 3061 aacccgccct gtgaggagaa cctcttcagt gactacatca gtgaggtaga gagaacgttc 3121 gggaacctgc agctgaagga cagcaacgtg taccaagatc actaccacca tcaccaccgg 3181 ccccatagta ttggcagtgc cagctccatc gatgggctct acgactgtga caacccaccc 3241 ttcaccaccc agtccaggtc catcagcaag aagcccctgg acatcggcct cccctcctcc 3301 aagcacagcc agctcagtga cctgtacggc aaattctcct tcaagagcga ccgctacagt 3361 ggccacgacg acttgatccg ctccgatgtc tctgacatct caacccacac cgtcacctat 3421 gggaacatcg agggcaatgc cgccaagagg cgtaagcagc aatataagga cagcctgaag 3481 aagcggcctg cctcggccaa gtcccgcagg gagtttgacg agatcgagct ggcctaccgt 3541 cgccgaccgc cccgctcccc tgaccacaag cgctacttca gggacaagga agggctacgg 3601 gacttctacc tggaccagtt ccgaacaaag gagaactcac cccactggga gcacgtagac 3661 ctgaccgaca tctacaagga gcggagtgat gactttaagc gcgactccgt cagcggagga 3721 gggccctgta ccaacaggtc tcacatcaag cacgggacgg gcgacaaaca cggcgtggtc 3781 agcggggtac ctgcaccttg ggagaagaac ctgaccaacg tggagtggga ggaccggtcc 3841 gggggcaact tctgccgcag ctgtccctcc aagctgcaca actactccac gacggtgacg 3901 ggtcagaact cgggcaggca ggcgtgcatc cggtgtgagg cttgcaagaa agcaggcaac 3961 ctgtatgaca tcagtgagga caactccctg caggaactgg accagccggc tgccccagtg 4021 gcggtgacgt caaacgcctc caccactaag taccctcaga gcccgactaa ttccaaggcc 4081 cagaagaaga accggaacaa actgcgccgg cagcactcct acgacacctt cgtggacctg 4141 cagaaggaag aagccgccct ggccccgcgc agcgtaagcc tgaaagacaa gggccgattc 4201 atggatggga gcccctacgc ccacatgttt gagatgtcag ctggcgagag cacctttgcc 4261 aacaacaagt cctcagtgcc cactgccgga catcaccacc acaacaaccc cggcggcggg 4321 tacatgctca gcaagtcgct ctaccctgac cgggtcacgc aaaacccttt catccccact 4381 tttggggacg accagtgctt gctccatggc agcaaatcct acttcttcag gcagcccacg 4441 gtggcggggg cgtcgaaagc caggccggac ttccgggccc ttgtcaccaa caagccggtg 4501 gtctcggccc ttcatggggc cgtgccagcc cgtttccaga aggacatctg tatagggaac 4561 cagtccaacc cctgtgtgcc taacaacaaa aaccccaggg ctttcaatgg ctccagcaat 4621 gggcatgttt atgagaaact ttctagtatt gagtctgatg tctgagtgag ggaacagaga 4681 ggttaaggtg ggtacgggag ggtaaggctg tgggtcgcgt gatgcgcatg tcacggaggg 4741 tgacgggggt gaacttggtt cccatttgct cctttcttgt tttaatttat ttatggggat 4801 cctggagttc tggttcctac tgggggcaac cctggtgacc agcaccatct ctcctccttt 4861 tcacagttct ctccttcttc cccccgctgt cagccattcc tgttcccatg agatgatgcc 4921 atgggtctca gcaggggagg gtagagcgga gaaaggaagg gcagcatgcg ggcttcctcc 4981 tggtgtggaa gagctccttg atatcctctt tgagtgaagc tgggagaacc aaaaagaggc 5041 tatgtgagca caaaggtagc ttttcccaaa ctgatctttt catttaggtg aggaagcaaa 5101 agcatctatg tgagaccatt tagcacactg cttgtgaaag gaaagaggct ctggctaaat 5161 tcatgctgct tagatgacat ctgtctagga atcatgtgcc aagcagaggt tgggaggcca 5221 tttgtgttta tatataagcc aaaaaatgct tgcttcaacc ccatgagact cgatagtggt 5281 ggtgaacaga acaaaaggtc attggtggca gagtggattc ttgaacaaac tggaaagtac 5341 gttatgatag tgtcccacgg tgccttgggg acaagagcag gtggattgtg cgtgcatgtg 5401 tgttcatgca cacttgcacc catgtgtagt caggtgcctc aagagaaggc aaccttgact 5461 ctttctattg tttctttcaa tatccccaag cagtgtgatt gtttggctta tatacagaca 5521 gagatggcca tgtattacct gaattttggc tgtgtctccc ttcatccttc tggaataagg 5581 agaatgaaaa ttcttgataa agaagattct gtggtctaaa caaaaaaagg cggtgagcaa 5641 tcctgcaaga gcaaggtaca taaacaagtc ctcagtggtt ggcaactgtt tcaacctgtt 5701 tgaaccaaga accttccagg aaggctaaag ggaaaccgaa tttcacagcc atgattcttt 5761 tgcccacact tgggagcaaa agattctaca aagctctttt gagcatttag actctcgact 5821 ggccaaggtt tggggaagaa cgaagccacc tttgaagaag taaggagtcg tgtatggtag 5881 ggtaagtgag agagggggat gtttccaatg ctttgatccc ttcttactta acctgaagct 5941 agacgagcag gcttcttccc cccaaaact // LOCUS HSU11292 2894 bp mRNA PRI 20-AUG-1994 DEFINITION Human Ki nuclear autoantigen mRNA, complete cds. ACCESSION U11292 NID g510689 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2894) AUTHORS Albertsen,H.M., Smith,S.A., Mazoyer,S.S., Fujimoto,E., Stevens,J., Williams,B., Rodriguez,P., Cropp,C.S., Slijepcevic,P., Carlson,M. and Robertson,M. et al. TITLE A physical map and candidate genes in the BRCA1 region on chromosome 17q12-21 JOURNAL Nature Genet. 7 (4), 472-479 (1994) MEDLINE 95038831 REFERENCE 2 (bases 1 to 2894) AUTHORS Albertsen,H.M. TITLE Direct Submission JOURNAL Submitted (24-JUN-1994) Hans M. Albertsen, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..2894 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene fetal brain cDNA library" /tissue_type="fetal brain, B-lymphocytes" /map="17q12-21; BCRA1 region" /chromosome="17" CDS 163..966 /codon_start=1 /product="Ki nuclear autoantigen" /db_xref="PID:g531167" /translation="MASLLKVDQEVKLKVDSFRERITSKAEDLVANFFPKKLLELDSF LKEPILNIHDLTQIHSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVF VMPNGMLKSNQQLVDIIEKVKPEIRLLIEKCNTPSGKGPHICFDLQVKMWVQLLIPRI EDGNNFGVSIQEETVAELRTVESEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRT VTEIDEKEYISLRLIISELRNQYVTLHDMILKNIEKIKRPRSSNAETLY" misc_feature 568..606 /note="splice variant; rare form only observed in fetal transcripts" BASE COUNT 694 a 709 c 667 g 821 t 3 others ORIGIN 1 gggcggacag gcacagaggg agggagcgag cgagcagtga gtaagccagc aagggcggtc 61 gggtcccgag gtcagccgag atttctcagg tccctccggc cccctccctg gagtccacag 121 cgcctccggt gtccagagga tcggacacgg cccggcccgg ccatggcctc gttgctgaag 181 gtggatcagg aagtgaagct caaggttgat tctttcaggg agcggatcac aagtaaggca 241 gaagacttgg tggcaaattt tttcccaaag aagttattag aacttgatag ttttctgaag 301 gaaccaatct taaacatcca tgacctaact cagatccact ctgacatgaa tctcccagtc 361 cctgacccca ttcttctcac caatagccat gatggactgg atggtcccac ttataagaag 421 cgaaggttgg atgagtgtga agaagccttc caaggaacca aggtgtttgt gatgcccaat 481 gggatgctga aaagcaacca gcagctggtg gacattattg agaaagtgaa acctgagatc 541 cggctgttga ttgagaaatg taacacgcct tcaggcaaag gtcctcatat atgttttgac 601 ctccaggtca aaatgtgggt acagctcctg attcccagga tagaagatgg aaacaacttt 661 ggggtgtcca ttcaggagga aacagttgca gagctaagaa ctgttgagag tgaagctgca 721 tcttatctgg accagatttc tagatattat attacaagag ccaaattggt ttctaaaata 781 gctaaatatc cccatgtgga ggactatcgc cgcaccgtga cagagattga tgagaaagaa 841 tatatcagcc ttcggctcat catatcagag ctgaggaatc aatatgtcac tctacatgac 901 atgatcctga aaaatatcga gaagatcaaa cggccccgga gcagcaatgc agagactctg 961 tactgaggcc agggccaggg ccaggggact ctgtgagtct ggctcaagac cgacattgcc 1021 ttggtttgtt acatgactat cgtgatgggg aaactggctg gaaatagtaa tcacacctct 1081 ctgtttttag ttagagtcta atgaaactct catctagttc tgtgatgtgt ttacctcttt 1141 tttcaggcct caggaactct tctatttcct tccctaatac cccacaccca acctgtcgta 1201 atttctggag aactccaggt ttgtgtgtgc aggatgttgg cacaaaaata cctgtgtttt 1261 cattctcccc ctctctccct cctgtgtctg gcgctttatg ttttcttccg tttgataatt 1321 agttggttaa aagctgaggg aaccggaagg aaagtgctag gtgtttttta ggaactaggg 1381 tggagggggg acgaacttct cttcctcaca tgaggttact gtttctttcc tctgtggggc 1441 attggatcct cccacagttg ccctggtgat gacttaggac ttcccatctg tgacatccca 1501 ctttgaatct tgatcgtgac aagaaatacc ttaggccttc agtcaattcc gaagctcctt 1561 cagttgtttt tataatgggc gtttcacatg cacatatgtg tatgcatgta tacgcccata 1621 cagacatgca cacacagact cctactccat tagctaacat accctccctc tccacaaccc 1681 gtgtcacata cctttcagga ggtgacagtt gtcttagttg tcatctaccc agacaaacgt 1741 cctgggcccg tcctccctcc tgatactgta gcctcttggt acccagggtg agttggtgga 1801 gaacagagag atgagaagca gagggcttgg ggaaagcctg ttcctctctg actcagccct 1861 ttttggcatt attgcaagag cttgactcct ggttgccttt tcccagccag ttttcagttg 1921 gggtgaaggt ttctgcaagt gtgaggtcca gatgctgctg ctcatgttgg gctttccttt 1981 tgggaactat ttctctttat ttatagtgtc gggcttccgg ggaaagcaat cattggtgtg 2041 tatgtgtatg tgccatgcac acacgtgcat atatacacat ttgtgtatgt ggaaatgtgc 2101 tgggcaagtc aaaactatag aagagttgcc tcctgtctct cgaatcttcc agagatatca 2161 cttaattgtt aacagctttt gtgttaatcc ccttcatccc ctagcacttt tattctacca 2221 cggctggaga gttgananct acagtcagcc tgccagtgac tcttagtgtc tgtttctgac 2281 ttatttttcc tgtctctgtc ttccaacccc caataatatt tcccaccggg gatgcatcat 2341 ttttactccc aatattctgt agagagggag tcaggatgct gtcttcccac gaatagtact 2401 cagtaacaaa ccaattgcat tttagttggg cagtgctccc acccaccctg cagatccctc 2461 cagctaaaac ccttccccct tccctccatg tgtttctcag tttcccgttc gtttgttgga 2521 ctgttccact gcccctcctc ctcaccctat cacccatgga tcgtaatgta aaattctttt 2581 accatgtcaa gaaattatta aaaatacagg tactttgacc tctttctaaa gccgcagacc 2641 ctggtgcaat gctctggtgg ctagggatgt actcatgctc atatgtgtgc acgcttggac 2701 acccacctcc atggacacct agccaccctg ttgtgtgncc ttatgccagt tgagctgaat 2761 cttttcccca gtatagtgga aagactgagg cttctgccta ctgagcaagg ttgggtgctt 2821 catttgtgtt cagtctgaat tatgggaaag ttagctcttc ccagacctaa gctgccttct 2881 ctccctactt tcag // LOCUS HSU11293 1529 bp mRNA PRI 09-AUG-1995 DEFINITION Human Rab5c-like protein mRNA, complete cds. ACCESSION U11293 NID g508284 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1529) AUTHORS Albertsen,H.M., Smith,S.A., Mazoyer,S.S., Fujimoto,E., Stevens,J., Williams,B., Rodriguez,P., Cropp,C.S., Slijepcevic,P. and Carlson,M. TITLE A physical map and candidate genes in the BRCA1 region on chromosome 17q12-21 JOURNAL Nature Genet. 7 (4), 472-479 (1994) MEDLINE 95038831 REFERENCE 2 (bases 1 to 1529) AUTHORS Albertsen,H.M. TITLE Direct Submission JOURNAL Submitted (24-JUN-1994) Hans M. Albertsen, University of Utah, Eccles Institute of Human Genetics, Building 533, Suite 7410, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..1529 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene fetal brain cDNA library" /map="17q12-21; BCRA1 region" /chromosome="17" CDS 76..726 /note="Rab5c-like protein, similar to Canis familiaris Rab5c protein, PIR Accession Number S38625" /codon_start=1 /db_xref="PID:g508285" /translation="MAGRGGARRPNGPAAGNKICQFKLVLLGESAVGKSSLVLRFVKG QFHEYQESTIGAAFLTQTVCLDDTTVKFEIWDTAGQERYHSLAPMYYRGAQAAIVVYD ITNTDTFARAKNWVKELQRQASPNIVIALAGNKADLASKRAVEFQEAQAYADDNSLLF METSAKTAMNVNEIFMAIAKKLPKNEPQNATGAPGRNRGVDLQENNPASRSQCCSN" polyA_signal 1502..1507 polyA_site 1529 /note="43 A residues" BASE COUNT 355 a 449 c 386 g 339 t ORIGIN 1 cttacactaa gtgcctcttt gcatagcacc agtccccacc cgcacgctct ctggaccact 61 acagctggac gggcaatggc gggtcgggga ggcgcacgac gacccaatgg accagctgct 121 gggaacaaga tctgtcaatt taagctggtt ctgctggggg agtctgcggt aggcaaatcc 181 agcctcgtcc tccgctttgt caagggacag tttcacgagt accaggagag cacaattgga 241 gcggccttcc tcacacagac tgtctgcctg gatgacacaa cagtcaagtt tgagatctgg 301 gacacagctg gacaggagcg gtatcacagc ctggccccca tgtactatcg gggggcccag 361 gctgccatcg tggtctatga catcaccaac acagatacat ttgcacgggc caagaactgg 421 gtgaaggagc tacagaggca ggccagcccc aacatcgtca ttgcactcgc gggtaacaag 481 gcagacctgg ccagcaagag agccgtggaa ttccaggaag cacaagccta tgcagacgac 541 aacagtttgc tgttcatgga gacatcagca aagactgcaa tgaacgtgaa cgaaatcttc 601 atggcaatag ctaagaagct tcccaagaac gagccccaga atgcaactgg tgctccaggc 661 cgaaaccgag gtgtggacct ccaggagaac aacccagcca gccggagcca gtgctgcagc 721 aactgagccc cccttgcctg cccgctgccc ccgcctcctc cgcctgaatg acccgactgg 781 aatccactct aaccaatcgc acttaacgac tcgggccacc actggggggg cagggggagg 841 ggtccaccat gatttctcca tataattttg atcataggcc ggagtgagtc attccacctg 901 cacctttctg tacaaatact aattcaattt taagtcttaa gtcacttttt taatatatat 961 gatcttctgc tcttcccact tcctcccctt tctactgctc tcccattttc ccttgctggg 1021 agtagccaca tgctcttgcc ccccaaccct tgtatatggg gacagtgggg tcagtgcagc 1081 taccctttct ttccctctgc ggaacagcgg acccagcaag agcatccaca tcctcacttt 1141 gttcggagtg gtctttggtt tgggcggtgg ggcagacctt gggaaggggc ttaggaaggg 1201 agaggcagct cttccttcag ctggctctca tcaggctgca gccccctccc cgctcccacc 1261 tccctgctgg gaaaccacag cattatcaca gcattattgt gacagccacg aacccattgc 1321 ccacaacccc tccaccctcg gtcaccccaa cctctggctc tgagccctgt tctgaccaaa 1381 tcatgatgat gagtatttgg gggtgggtgg gtaagggggg gagtgggagg ggacggaacc 1441 aactttttct gtattttgta ttgtatgttt tcttcaacat gtaaccaatc agtatcttgt 1501 caatatagtc agccgatcga tcgacctca // LOCUS HSU11424 1944 bp DNA PRI 11-MAY-1995 DEFINITION Human thiopurine methyltransferase processed pseudogene (pseudoTPMT) gene, complete cds. ACCESSION U11424 NID g805081 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1944) AUTHORS Lee,D., Szumlanski,C., Houtman,J., Honchel,R., Rojas,K., Overhauser,J., Wieben,E.D. and Weinshilboum,R.M. TITLE Thiopurine methyltransferase pharmacogenetics. Cloning of human liver cDNA and a processed pseudogene on human chromosome 18q21.1 JOURNAL Drug Metab. Dispos. 23 (3), 398-405 (1995) MEDLINE 95354518 REFERENCE 2 (bases 1 to 1944) AUTHORS Szumlanski,C.L. TITLE Direct Submission JOURNAL Submitted (27-JUN-1994) Carol L. Szumlanski, Pharmacology, Mayo Medical School/Mayo Clinic/Mayo Foundation, 200 First St. SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1944 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lymphocyte genomic library in the lambda dash vector, Stratagene" /chromosome="18" /map="18q21.1" /cell_type="lymphocyte" repeat_unit 64..73 /note="5' direct repeat" gene 168..908 /gene="pseudoTPMT" CDS 168..908 /gene="pseudoTPMT" /EC_number="2.1.1.67" /codon_start=1 /function="S-methylation" /product="thiopurine methyltransferase" /db_xref="PID:g805082" /translation="MDGTRTSLDIEEYSNTEVQKNQVLTLEEWQDKWVNGNTAFHQEQ GPQLLKKHLDTFLKGESGLRVFFPLCRKEVEMKWFADRGHSVVGVEISELGIREFFTE QNLSYSEEPITEIPGTKAFKSSSGNISSYCCSIFDLPRTNIGTFDMIWDRGALVAINP GDRKCYADIMLSLLGKKFQYLLCVFLTIQLNIQVHHFMFHMLKLKGCLVKYAIYIVLR RLMLLKNDIKIGGLTIFLKSYIYLQKSK" polyA_signal 1828..1833 polyA_site 1849 repeat_unit 1880..1889 /note="3' direct repeat" BASE COUNT 644 a 339 c 370 g 591 t ORIGIN 1 taaatcagat ccctgaggac ctccacttga tatcatcttt agtcccatta atgcccttaa 61 aaacaatttg cttgggatga cactagtggc ggaggcaatg gccagcaacc ctctgtaagc 121 gaggcgtgga agacatatgc ttgtgagaaa aaggtgtcta tgaaactatg gatggtacaa 181 gaacttcact tgacattgaa gagtactcca atactgaggt acagaaaaac caagtactaa 241 ctctggaaga atggcaagac aagtgggtga acggcaacac tgcttttcat caggaacaag 301 gacctcagct attaaagaaa catttagata cttttcttaa aggagagagt ggactgaggg 361 tattttttcc tctttgcaga aaagaggttg agatgaaatg gtttgcagac cggggacaca 421 gcgtagttgg tgtggaaatc agtgaacttg ggatacgaga attttttaca gagcagaatc 481 tatcttactc agaagaacca atcaccgaaa ttcctggaac caaagcattt aagagttctt 541 cggggaacat ttcatcatac tgttgcagta tttttgatct tcccaggaca aatattggca 601 catttgacat gatttgggat agaggagcat tagttgccat taatccaggt gatcgcaaat 661 gctatgcgga tataatgtta tccctcctgg gaaagaagtt tcaatatctc ctgtgtgtct 721 ttcttacgat ccaactaaac atccaggtcc accattttat gttccacatg ctgaaattga 781 aaggttgttt ggtaaaatat gcaatataca ttgtcttgag aaggttgatg cttttgaaga 841 atgacataaa aattggggga ttgaccatct ttctgaaaag ttatatctac ttacagaaaa 901 gtaaatgaga catagataaa atcacattga catgtttttg aggaattgaa aattatgcta 961 aagcctgaaa atgtaatgga tgaatttttt aaattgttta taaatcacat gatagatcta 1021 tactaaaaat ggctttttag taaagctgtt tactttttct aaaaaagttt taggagaaaa 1081 agatgtaact aaacttttca agtagctcct ttggagagga gattatgatg tgaaagatta 1141 tgcctgtgtg tcttacagat tgcaagatat tttatcaatc agtgtgtgtt acctgtacaa 1201 ttaaaaaaat attttaaaat gcaatgcata ttaaacataa tacacacaga aaaactggca 1261 tttattttat ttttttgaga tggagtttcg ttctcgttgc ccaacctgga gtgcaatggc 1321 acaatctcag ctcactgcaa cctctgcctc ccaggttcaa gtgattctcc tgcctcagcc 1381 tcccaagtag ctgagattac aggtgtgcgc caccatgccc agctaatttt ttgtattttt 1441 agtagagaca gggtttcacc atgttggtca ggctggtctc gaactccaga cctcaggtga 1501 tctacccacc tcagcctccc aaagtgctgg gattacaggc gtgagccact gtgcctggcc 1561 tgacattctt tatgaaattt agaattgttg aaaaaaatat aacacttcag tagggttcaa 1621 ggtggtccca aaagttatat aaaagattag tttttactat aaacccttgt cttttactca 1681 gatcctagca tcccttttca catggtttct ccatatatgt aacagaatca agaaacaaat 1741 tttaattaaa caatctgtaa cagaatcaag aaacaaataa attttaatta aacaatctat 1801 atggaacaaa cattcccaaa ttctaagaat aaatttttct ttaagtttaa aacaaacaaa 1861 caaaaaaaca aaaaaaaaac aatttgcttt tctgattttg tttagattac tgtttccaag 1921 ttattgcaag tggatgaagt atac // LOCUS HSU11690 4266 bp mRNA PRI 10-DEC-1994 DEFINITION Human faciogenital dysplasia (FGD1) mRNA, complete cds. ACCESSION U11690 NID g595424 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4266) AUTHORS Pasteris,G.N., Cadle,A., Logie,L.J., Porteous,M.E.M., Schwartz,C.E., Stevenson,R.E., Glover,T.W., Wilroy,R.S. and Gorski,J.L. TITLE Isolation and characterization of the faciogenital dysplasia (Aarskog-Scott syndrome) gene: a putative rho/rac guanine nucleotide exchange factor JOURNAL Cell 79, 669-678 (1994) MEDLINE 95042764 REFERENCE 2 (bases 1 to 4266) AUTHORS Pasteris,N.G. TITLE Direct Submission JOURNAL Submitted (29-JUN-1994) Noe G. Pasteris, Pediatric Genetics, University of Michigan, 1150 W. Medical Center Dr., Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..4266 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pFCF1.1,pFCF3.85" /clone_lib="lambda ZAP II 16 week human fetal craniofacial cDNA library" /chromosome="X" /map="Xp11.21" /tissue_type="craniofacial" /dev_stage="fetal" gene 731..3616 /gene="FGD1" CDS 731..3616 /gene="FGD1" /standard_name="faciogenital dysplasia" /codon_start=1 /function="putative guanine nucleotide exchange factor" /db_xref="PID:g595425" /translation="MHGHRAPGGRRAFGARTPGHEPAGAAPPACADSDPGASEPGLLA RRGSGSALGGPLDPQFVGPSDTSLGAAPGHRVLPCGPSPQHHRALRFSYHLEGSQPRP GLHQGNRILVKSLSLDPGQSLEPHPEGPQRLRSDPGPPTETPSQRPSPLKRAPGPKPQ VPPKPSYLQMPRMPPPLEPIPPPPSRPLPADPRVGKGLAPRAEASPSSAAVSSLIEKF EREPVIVASDRPVPGPSPGPPEPVMLPQPTSQPPVPQLPEGEASRCLFLLAPGPRDGE KVPNRDSGIDSISSPSNSEETCFVSDDGPPSHSLCPGPPALASVPVALADPHRPGSQE VDSDLEEEDDEEEEEEKDREIPVPLMERQESVELTVQQKVFHIANELLQTEKAYVSRL HLLDQVFCARLLEEARNRSSFPADVVHGIFSNICSIYCFHQQFLLPELEKRMEEWDRY PRIGDILQKLAPFLKMYGEYVKNFDRAVELVNTWTERSTQFKVIIHEVQKEEACGNLT LQHHMLEPVQRIPRYELLLKDYLLKLPHGSPDSKDAQKSLELIATAAEHSNAAIRKME RMHKLLKVYELLGGEEDIVSPTKELIKEGHILKLSAKNGTTQDRYLILFNDRLLYCVP RLRLLGQKFSVRARIDVDGMELKESSNLNLPRTFLVSGKQRSLELQARTEEEKKDWVQ AINSTLLKHEQTLETFKLLNSTNREDEDTPPNSPNVDLGKRAPTPIREKEVTMCMRCQ EPFNSITKRRHHCKACGHVVCGKCSEFRARLVYDNNRSNRVCTDCYVALHGVPGSSPA CSQHTPQRRRSILEKQASVAAENSVICSFLHYMEKGGKGWHKAWFVVPENEPLVLYIY GAPQDVKAQRSLPLIGFEVGPPEAGERPDRRHVFKITQSHLSWYFSPETEELQRRWMA VLGRAGRGDTFCPGPTLSEDREMEEAPVAALGATAEPPESPQTRDKT" BASE COUNT 858 a 1442 c 1203 g 763 t ORIGIN 1 cagggctcct tcccgcccgc cgccgccgag acccaggctg aagctgggga ggacggtgga 61 gcccggccca ggccgttctc ctgcccccgc ggctggagca tcgtctggga ctactggctg 121 aaggcgagcc ccgcagccca agcgcaggac acgcgcgccc cgcgccaccg ccgttctctc 181 ttccctcccc tcctctcccc tcccctctcc tccgcccccg gcctcggctg cggcgggagg 241 aagaacgctt cccaaccagc caggagagcc cagatctatt cccattcggg gagaaagaag 301 agaccccgca cttggatccg accgacttct cgccccgggg ccacggaagt ggctcgccgt 361 ccaccggaga gtccccgccg agcgtagccc tccaaccctc gcccaaccgc agccgcatag 421 acagcgctcc ccagcggccc ccggccccgc gagcccactc cccgcgcggg gccggggccg 481 gggccggcga tgcctggcac ggcagcggcg gtggccgcgg ctgctgctgc tgctactgcc 541 accgctgcca cggctgctgg cgcgggcccg ggctgggggc ttgagtctct gcagtggggc 601 tgagcccgca gtcccgcccc ccgatcctga ggccgctgct ccgccgggcg ccgggcctcc 661 tccgcccgcg cctccgcccc aggagctgga gccaagcggg gagctcgggc cccgcgccca 721 ggcccggacc atgcatggcc accgagcccc gggggggcgc cgggccttcg gagcccgaac 781 acccggccac gaacccgccg gcgccgctcc gccggcctgt gccgactcgg accctggagc 841 ctcggaaccc ggactgctgg cgcgcagggg ctcaggttcg gctcttggcg gcccactgga 901 tccccagttt gtcggaccct cggacaccag cctgggcgct gctccaggcc accgggtctt 961 gccctgcggt cccagtccac agcaccaccg ggccctgcgc ttctcttacc acctggaggg 1021 ctcgcagcct cggcctgggc tgcaccaggg aaaccggatc ctggttaaaa gtttgtccct 1081 tgaccctggc caaagcctag agcctcatcc agaaggtccc cagcggcttc gctcagaccc 1141 aggtcccccg actgaaaccc ctagccagcg tccttcacca ctgaagcggg caccgggccc 1201 gaagccacag gtgcccccaa agcccagcta cctgcagatg ccccggatgc cccccccact 1261 ggagcccatc ccccctccac catcacgccc actgcctgcc gacccccgag tcggcaaggg 1321 cctggctccc agggcagagg ccagccccag ttctgcagca gtatcctcac tgattgagaa 1381 gtttgaaaga gagcctgtga ttgtcgcctc ggatagacca gtccctggcc ccagcccagg 1441 tcccccagag ccagtcatgt tgccacagcc aacctcgcag ccaccagtgc cccagctccc 1501 cgagggtgag gcctcccgct gcctgtttct gctggctcct gggccccggg acggtgagaa 1561 ggtgcccaac cgggacagcg gcattgatag catcagctcg ccatccaaca gcgaggagac 1621 ctgcttcgtc agtgatgacg ggccccccag ccacagcctc tgccctgggc cccctgccct 1681 ggctagtgtg cctgttgcct tggccgaccc ccaccggcct ggctcccaag aggttgacag 1741 tgacctggag gaggaggacg acgaggagga ggaggaagag aaggacagag aaatcccagt 1801 gcccctgatg gagagacagg agtctgtgga gttgactgtg cagcaaaagg tgtttcacat 1861 tgccaatgag ctcctgcaaa ctgagaaggc ctacgtttcc aggctccatc tcctggatca 1921 ggtgttctgt gcccggctgc tggaagaagc tcggaaccgc agttccttcc cggccgacgt 1981 tgtccacggc atcttctcta acatctgctc catctattgc ttccaccagc agttcctgct 2041 gcctgagcta gagaagcgca tggaggaatg ggaccgctat ccacgcattg gagacatcct 2101 gcagaaactg gcccccttcc tcaagatgta tggtgagtat gtgaagaact ttgaccgggc 2161 cgtggagctg gtcaacacct ggacagagcg ctccacccag tttaaagtca tcatccatga 2221 ggtgcagaag gaggaagcct gtggcaacct gacattgcag caccacatgc tggagcctgt 2281 gcagcgcatc ccccgctatg agcttcttct caaggactat ctgttaaagc tgccccatgg 2341 ctccccggac agcaaggatg cccaaaagtc tctggagctg atcgccacag cagcagagca 2401 ctcgaatgct gccatccgca aaatggagcg aatgcataag ctgctgaagg tatatgagct 2461 gttagggggc gaggaggaca ttgtcagccc caccaaagag ctcataaaag aaggccacat 2521 ccttaagctg tcagcaaaga atgggaccac tcaagaccga tacctcatac tattcaacga 2581 ccgcctcctt tactgcgtgc ccaggctgcg gctccttggc cagaagttta gcgtgcgggc 2641 acgcattgat gtagatggca tggagctaaa ggagagctcc aacctcaatc tgcctcgaac 2701 cttcctggtg tcaggaaagc agcgctccct cgagctccag gccaggactg aggaggagaa 2761 gaaagactgg gtccaggcca tcaactccac cctcctgaag catgaacaga cgctggagac 2821 tttcaaactg ttgaactcaa caaacaggga agatgaagac accccgccca actctccaaa 2881 cgtggatctt gggaagcggg cacctacgcc catccgggaa aaggaagtca ccatgtgcat 2941 gcgctgccag gagcccttca attctatcac caaacgcagg caccactgca aggcctgcgg 3001 gcatgtggtt tgtgggaagt gctccgagtt ccgggcccgc ctcgtctatg acaacaaccg 3061 ctccaaccgt gtgtgcactg attgctatgt ggccttgcac ggggtgcctg ggagcagtcc 3121 agcctgcagc cagcatacac cccagcgccg gaggtccatc ctggagaaac aggcctcagt 3181 ggctgcagag aacagcgtca tctgcagctt cctgcactac atggagaagg gtggcaaagg 3241 atggcacaag gcatggttcg tggtccctga aaatgaaccc ttggtgctgt atatctacgg 3301 agcccctcag gatgtgaaag cccagcgcag cctgcccctc attggcttcg aggtgggacc 3361 gcccgaggca ggggagcggc ctgacagaag gcatgtcttc aagatcaccc agagccacct 3421 cagctggtac ttcagccctg agacagagga actacagcga cgctggatgg ctgtgcttgg 3481 ccgggcgggc cgaggggaca cgttctgccc ggggcccaca ctgtctgagg acagggagat 3541 ggaggaggca ccggtggctg ctttaggagc cactgctgaa ccccccgaat ccccccagac 3601 ccgagacaag acctagaggg tttgggacaa actgggagcc cccaccccac actctagttg 3661 cccatgtctg attgggggct ctagcccctt ccctcccagc tcagtcaata cttgaactcc 3721 catcacgggc actttcaatc ccgaatgctg ggtcttgggt tttttaattc atcttttcac 3781 aaaaacgtgg gctttttaaa aaatatattc ctacagtgat gtcaattttt attaatccct 3841 gtccccaggg agggtgggag ccgttgccag tcctacctga attaggtcgt ttttctccct 3901 catccctcaa taccctacca cagatcctgc ctccacccag ttccccacaa agcaccagag 3961 gtaggagacc tggattcaag tgccagctct gccactgacc ctggggcccc gacccagccc 4021 tgcctcctca gccaggtctt cacacgtaca actccacgtg ggtgcaacta gacctctcct 4081 gcctctccca tggctacagc tctacccggc cccaaggcta cccagtattt tatcgtccag 4141 acccatggca gggccagcgg gcaggacagg gaaacagggg ggaggacaat ggatactcag 4201 ttttttgtgt ttttttgtgt gttttttttt ttttaagaaa aatacagttt atttcaggct 4261 ttacag // LOCUS HSU11700 6642 bp mRNA PRI 23-MAR-1996 DEFINITION Human copper transporting ATPase mRNA, complete cds. ACCESSION U11700 NID g551501 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 258) AUTHORS Petrukhin,K., Lutsenko,S., Chernov,I., Ross,B.M., Kaplan,J.H. and Gilliam,T.C. TITLE Characterization of the Wilson disease gene encoding a P-type copper transporting ATPase: genomic organization, alternative splicing, and structure/function predictions JOURNAL Hum. Mol. Genet. 3 (9), 1647-1656 (1994) MEDLINE 95135423 REFERENCE 2 (bases 259 to 6641) AUTHORS Tanzi,R.E., Petrukhin,K., Chernov,I., Pellequer,J.L., Wasco,W., Ross,B., Romano,D.M., Parano,E., Pavone,L., Brzustowicz,L.M., Devoto,M., Peppercorn,J., Bush,A.I., Sternlieb,I., Pirastu,M., Gusella,J.F., Evgrafov,O., Penchaszadeh,G.K., Honig,B., Edelman,I.S., Soare,M.B., Scheinberg,I.H. and Gilliam,T.C. TITLE The Wilson disease gene is a copper transporting ATPase with homology to the Menkes disease gene JOURNAL Nature Genet. 5 (4), 344-350 (1993) MEDLINE 94129611 REFERENCE 3 (sites) AUTHORS Petrukhin,K., Fischer,S.G., Pirastu,M., Tanzi,R.E., Chernov,I., Devoto,M., Brzustowicz,L.M., Canyanis,E., Vitale,E., Russo,J.J., Matseoane,D., Boukhgalter,B., Wasco,w., Figus,A.L., Loudianos,J., Cao,A., Sternlieb,I., Evgrafov,O., Parano,E., Pavone,L., Warburton,D., Ott,J., Penchaszadeh,G.K., Scheinberg,I.H. and Gilliam,T.C. TITLE Mapping, cloning and genetic characterization of the region containing the Wilson disease gene JOURNAL Nature Genet. 5 (4), 338-343 (1993) MEDLINE 94129610 REFERENCE 4 (bases 1 to 6642) AUTHORS Petrukhin,K. TITLE Direct Submission JOURNAL Submitted (30-JUN-1994) Konstantin Petrukhin, Psychiatry, Columbia University, 722 West 168th Street, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..6642 /organism="Homo sapiens" /db_xref="taxon:9606" /map="13q14.3" /chromosome="13" 5'UTR 1..162 CDS 163..4560 /codon_start=1 /product="copper transporting ATPase" /db_xref="PID:g551502" /translation="MPEQERQITAREGASRKILSKLSLPTRAWEPAMKKSFAFDNVGY EGGLDGLGPSSQVATSTVRILGMTCQSCVKSIEDRISNLKGIISMKVSLEQDSATVKY VPSVVCLQQVCHQIGDMGFEASIAEGKAASWPSRSLPAQEAVVKLRVEGMTCQSCVSS IEGKVRKLQGVVRVKVSLSNQEAVITYQPYLIQPEDLRDHVNDMGFEAAIKSKVAPLS LGPIDIERLQSTNPKRPLSSANQNFNNSETLGHQGSHVVTLQLRIDGMHCKSCVLNIE ENIGQLLGVQSIQVSLENKTAQVKYDPSCTSPVALQRAIEALPPGNFKVSLPDGAEGS GTDHRSSSSHSPGSPPRNQVQGTCSTTLIAIAGMTCASCVHSIEGMISQLEGVQQISV SLAEGTATVLYNPSVISPEELRAAIEDMGFEASVVSESCSTNPLGNHSAGNSMVQTTD GTPTSVQEVAPHTGRLPANHAPDILAKSPQSTRAVAPQKCFLQIKGMTCASCVSNIER NLQKEAGVLSVLVALMAGKAEIKYDPEVIQPLEIAQFIQDLGFEAAVMEDYAGSDGNI ELTITGMTCASCVHNIESKLTRTNGITYASVALATSKALVKFDPEIIGPRDIIKIIEE IGFHASLAQRNPNAHHLDHKMEIKQWKKSFLCSLVFGIPVMALMIYMLIPSNEPHQSM VLDHNIIPGLSILNLIFFILCTFVQLLGGWYFYVQAYKSLRHRSANMDVLIVLATSIA YVYSLVILVVAVAEKAERSPVTFFDTPPMLFVFIALGRWLEHLAKSKTSEALAKLMSL QATEATVVTLGEDNLIIREEQVPMELVQRGDIVKVVPGGKFPVDGKVLEGNTMADESL ITGEAMPVTKKPGSTVIARSINAHGSVLIKATHVGNDTTLAQIVKLVEEAQMSKAPIQ QLADRFSGYFVPFIIIMSTLTLVVWIVIGFIDFGVVQKYFPNPNKHISQTEVIIRFAF QTSITVLCIACPCSLGLATPTAVMVGTGVAAQNGILIKGGKPLEMAHKIKTVMFDKTG TITHGVPRVMRVLLLGDVATLPLRKVLAVVGTAEASSEHPLGVAVTKYCKEELGTETL GYCTDFQAVPGCGIGCKVSNVEGILAHSERPLSAPASHLNEAGSLPAEKDAVPQTFSV LIGNREWLRRNGLTISSDVSDAMTDHEMKGQTAILVAIDGVLCGMIAIADAVKQEAAL AVHTLQSMGVDVVLITGDNRKTARAIATQVGINKVFAEVLPSHKVAKVQELQNKGKKV AMVGDGVNDSPALAQADMGVAIGTGTDVAIEAADVVLIRNDLLDVVASIHLSKRTVRR IRINLVLALIYNLVGIPIAAGVFMPIGIVLQPWMGSAAMAASSVSVVLSSLQLKCYKK PDLERYEAQAHGHMKPLTASQVSVHIGMDDRWRDSPRATPWDQVSYVSQVSLSSLTSD KPSRHSAAADDDGDKWSLLLNGRDEEQYI" 3'UTR 4561..6642 polyA_site 6642 /note="12 A nucleotides" BASE COUNT 1539 a 1710 c 1745 g 1648 t ORIGIN 1 ttcccggacc cctgtttgct ttagagccga gccgcgccgc gccgatgccc tcacactctg 61 cgcctcctct cccgggactt taacaccacg ctctcctcca ccgaccaggt gaccttttgc 121 tctgagccag atcagagaag aattcggtgt ccgtgcggga cgatgcctga gcaggagaga 181 cagatcacag ccagagaagg ggccagtcgg aaaatcttat ctaagctttc tttgcctacc 241 cgtgcctggg aaccagcaat gaagaagagt tttgcttttg acaatgttgg ctatgaaggt 301 ggtctggatg gcctgggccc ttcttctcag gtggccacca gcacagtcag gatcttgggc 361 atgacttgcc agtcatgtgt gaagtccatt gaggacagga tttccaattt gaaaggcatc 421 atcagcatga aggtttccct ggaacaagac agtgccactg tgaaatatgt gccatcggtt 481 gtgtgcctgc aacaggtttg ccatcaaatt ggggacatgg gcttcgaggc cagcattgca 541 gaaggaaagg cagcctcctg gccctcaagg tccttgcctg cccaggaggc tgtggtcaag 601 ctccgggtgg agggcatgac ctgccagtcc tgtgtcagct ccattgaagg caaggtccgg 661 aaactgcaag gagtagtgag agtcaaagtc tcactcagca accaagaggc cgtcatcact 721 tatcagcctt atctcattca gcccgaagac ctcagggacc atgtaaatga catgggattt 781 gaagctgcca tcaagagcaa agtggctccc ttaagcctgg gaccaattga tattgagcgg 841 ttacaaagca ctaacccaaa gagaccttta tcttctgcta accagaattt taataattct 901 gagaccttgg ggcaccaagg aagccatgtg gtcaccctcc aactgagaat agatggaatg 961 cattgtaagt cttgcgtctt gaatattgaa gaaaatattg gccagctcct aggggttcaa 1021 agtattcaag tgtccttgga gaacaaaact gcccaagtaa agtatgaccc ttcttgtacc 1081 agcccagtgg ctctgcagag ggctatcgag gcacttccac ctgggaattt taaagtttct 1141 cttcctgatg gagccgaagg gagtgggaca gatcacaggt cttccagttc tcattcccct 1201 ggctccccac cgagaaacca ggtccagggc acatgcagta ccactctgat tgccattgcc 1261 ggcatgacct gtgcatcctg tgtccattcc attgaaggca tgatctccca actggaaggg 1321 gtgcagcaaa tatcggtgtc tttggccgaa gggactgcaa cagttcttta taatccctct 1381 gtaattagcc cagaagaact cagagctgct atagaagaca tgggatttga ggcttcagtc 1441 gtttctgaaa gctgttctac taaccctctt ggaaaccaca gtgctgggaa ttccatggtg 1501 caaactacag atggtacacc tacatctgtg caggaagtgg ctccccacac tgggaggctc 1561 cctgcaaacc atgccccgga catcttggca aagtccccac aatcaaccag agcagtggca 1621 ccgcagaagt gcttcttaca gatcaaaggc atgacctgtg catcctgtgt gtctaacata 1681 gaaaggaatc tgcagaaaga agctggtgtt ctctccgtgt tggttgcctt gatggcagga 1741 aaggcagaga tcaagtatga cccagaggtc atccagcccc tcgagatagc tcagttcatc 1801 caggacctgg gttttgaggc agcagtcatg gaggactacg caggctccga tggcaacatt 1861 gagctgacaa tcacagggat gacctgcgcg tcctgtgtcc acaacataga gtccaaactc 1921 acgaggacaa atggcatcac ttatgcctcc gttgcccttg ccaccagcaa agcccttgtt 1981 aagtttgacc cggaaattat cggtccacgg gatattatca aaattattga ggaaattggc 2041 tttcatgctt ccctggccca gagaaacccc aacgctcatc acttggacca caagatggaa 2101 ataaagcagt ggaagaagtc tttcctgtgc agcctggtgt ttggcatccc tgtcatggcc 2161 ttaatgatct atatgctgat acccagcaac gagccccacc agtccatggt cctggaccac 2221 aacatcattc caggactgtc cattctaaat ctcatcttct ttatcttgtg tacctttgtc 2281 cagctcctcg gtgggtggta cttctacgtt caggcctaca aatctctgag acacaggtca 2341 gccaacatgg acgtgctcat cgtcctggcc acaagcattg cttatgttta ttctctggtc 2401 atcctggtgg ttgctgtggc tgagaaggcg gagaggagcc ctgtgacatt cttcgacacg 2461 ccccccatgc tctttgtgtt cattgccctg ggccggtggc tggaacactt ggcaaagagc 2521 aaaacctcag aagccctggc taaactcatg tctctccaag ccacagaagc caccgttgtg 2581 acccttggtg aggacaattt aatcatcagg gaggagcaag tccccatgga gctggtgcag 2641 cggggcgata tcgtcaaggt ggtccctggg ggaaagtttc cagtggatgg gaaagtcctg 2701 gaaggcaata ccatggctga tgagtccctc atcacaggag aagccatgcc agtcactaag 2761 aaacccggaa gcactgtaat tgcgaggtct ataaatgcac atggctctgt gctcattaaa 2821 gctacccacg tgggcaatga caccactttg gctcagattg tgaaactggt ggaagaggct 2881 cagatgtcaa aggcacccat tcagcagctg gctgaccggt ttagtggata ttttgtccca 2941 tttatcatca tcatgtcaac tttgacgttg gtggtatgga ttgtaatcgg ttttatcgat 3001 tttggtgttg ttcagaaata ctttcctaac cccaacaagc acatctccca gacagaggtg 3061 atcatccggt ttgctttcca gacgtccatc acggtgctgt gcattgcctg cccctgctcc 3121 ctggggctgg ccacgcccac ggctgtcatg gtgggcaccg gggtggccgc gcagaacggc 3181 atcctcatca agggaggcaa gcccctggag atggcgcaca agataaagac tgtgatgttt 3241 gacaagactg gcaccattac ccatggcgtc cccagggtca tgcgggtgct cctgctgggg 3301 gatgtggcca cactgcccct caggaaggtt ctggctgtgg tggggactgc ggaggccagc 3361 agtgaacacc ccttgggcgt ggcagtcacc aaatactgta aagaggaact tggaacagag 3421 accttgggat actgcacgga cttccaggca gtgccaggct gtggaattgg gtgcaaagtc 3481 agcaacgtgg aaggcatcct ggcccacagt gagcgccctt tgagtgcacc ggccagtcac 3541 ctgaatgagg ctggcagcct tcccgcagaa aaagatgcag tcccccagac cttctctgtg 3601 ctgattggaa accgtgagtg gctgaggcgc aacggtttaa ccatttctag cgatgtcagc 3661 gacgctatga cagaccacga gatgaaagga cagacagcca tcctggtggc tattgacggt 3721 gtgctctgtg ggatgatcgc aatcgcagac gctgtcaagc aggaggctgc cctggctgtg 3781 cacacgctgc agagcatggg tgtggacgtg gttctgatca cgggggacaa ccggaagaca 3841 gccagagcta ttgccaccca ggttggcatc aacaaagtct ttgcagaggt gctgccttcg 3901 cacaaggtgg ccaaggtcca ggagctccag aataaaggga agaaagtcgc catggtgggg 3961 gatggggtca atgactcccc ggccttggcc caggcagaca tgggtgtggc cattggcacc 4021 ggcacggatg tggccatcga ggcagccgac gtcgtcctta tcagaaatga tttgctggat 4081 gtggtggcta gcattcacct ttccaagagg actgtccgaa ggatacgcat caacctggtc 4141 ctggcactga tttataacct ggttgggata cccattgcag caggtgtctt catgcccatc 4201 ggcattgtgc tgcagccctg gatgggctca gcggccatgg cagcctcctc tgtgtctgtg 4261 gtgctctcat ccctgcagct caagtgctat aagaagcctg acctggagag gtatgaggca 4321 caggcgcatg gccacatgaa gcccctgacg gcatcccagg tcagtgtgca cataggcatg 4381 gatgacaggt ggcgggactc ccccagggcc acaccatggg accaggtcag ctatgtcagc 4441 caggtgtcgc tgtcctccct gacgtccgac aagccatctc ggcacagcgc tgcagcagac 4501 gatgatgggg acaagtggtc tctgctcctg aatggcaggg atgaggagca gtacatctga 4561 tgacttcagg caggcggccc ggggcaggga cttgcctcca ctcaccacaa gctgagcagg 4621 acagccagca gcaggatggg ctgagctagc ctccagcttt ggggacttcc gctccctgga 4681 tatgtccagt catcctgccc tgcagcacgc ggccttgtct gggtgcagct gggcttggcc 4741 tggagaggac ggccctgcct gcctcttggc ctcacgggac cgtcagcatg ggctttgtct 4801 tggactctag tccttggctg gactgtagaa ggtgagaggc gagtcaccct cctcacagac 4861 ctctgcttgg agtatttagg atgactgctg tgaaatggag aacagtttca tcaggaccaa 4921 aaaacctcac tgggcctttc cagagaactg cagacctcac tgtcagggtc tttctgatga 4981 cgcctgtctg tgtgcatcat gtttctgaga ccacagttta cctcaggtgt gcctgttgct 5041 ttcttcctgc atagtctgtt cctttcttcg tacatagtct gttccttttc tctcctgtgt 5101 gcttgtcagt ggggacccct cgcaaccctg cctgtcacct gggagggtgg gaccaatgtc 5161 cttgtggtct ttgctgctgc tctcaggcgc ttctccaatg ctctggagtg tgcatttcag 5221 cttgaacctg cttcctggct cacacatccc cagccaggga gcttgccaca ctcttcttca 5281 agttgaggag agttcttttt tgcttaaagc ccccttctcc atggagtgtt ggcttctcaa 5341 tagagtgttg ttgctgacca gctggagtga gggcctcaga gcctgacctg agagtccgta 5401 ctcggcttcc tgtggggtgt aggttctcgc gattcaggac gtccttccat atccctgccc 5461 agcctgtggt gcttgaaacg tttgccccat gggaaacgta tgtgtgcagg agcctccctg 5521 cacggcccaa ggggcttcgt tttcagtctt ctgactgtca cctcgtgggg ttcagtagag 5581 aattcaatta ctagcgcctg gccttgtgtg gcttggagga aatggtactg cccaaatagg 5641 aggaaaacac agcctccctg agcctgcatt ctgcacgctg cccaggggct tcagaaaagg 5701 agtggccaca gcaccccgaa gggagcatct atttacctgg cagtggctct cagagcagca 5761 gaacgggttc agttttagac tctgaagttg gttgtgattg acagaaccct ttgggagcaa 5821 actagtagag ttggattaaa ttctgggtga aacccttttc tcccacacaa aatagtttta 5881 gtgatttttt tcattgtcca ttacttgcca ggggcagttt tagcagcact tttgatagat 5941 tacgtctaat cctcccaacc aaccagcagg gtagctatta ctgtccacat tttacaggca 6001 aggaaacagg ctccaagagg ctgaggactt tgcccaggat gacatagcca atggacaagc 6061 agtgtctgtc agctgtgaag gcttcactct tattgtcctt ctaccttgaa tagaagtttt 6121 cctgataaga ataaacgagg aaaaggtcct tgcctcctgg aagaacaaat ctaccaggtg 6181 atctattcat tgtttcaact cagaatgcac ttgattcagg aggtcatctg accttcacct 6241 tggatggtta gtttcacttt ttacatatag tttttgcagg gttttatttt ataaaatcca 6301 agcgcgctgt tgattgtgtt ttccttgttt tcagcccccc gactccagcc cgcagcacat 6361 ttccgctgtc cgtcagtaat tgtgtcctct ctttatgctt gcttggggaa tgttgttttc 6421 tgactaggct gatcattatc taaagaatct aattctgttg atttttaaaa cttttaggac 6481 cataaacgtt gtgttcatat atggacatgg aaatatttat ataattttat agaaaataac 6541 cttttagatg gtcaaagtgt aaggagtttt tttgtcagat aatcatttct acttcaaaaa 6601 catttcatgc aatattagaa taaagttcct gtcattcctc ta // LOCUS HSU11701 1887 bp mRNA PRI 24-SEP-1996 DEFINITION Human LIM-homeobox domain protein (hLH-2) mRNA, complete cds. ACCESSION U11701 NID g600494 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1887) AUTHORS Wu,H.K., Heng,H.H., Siderovski,D.P., Dong,W.F., Okuno,Y., Shi,X.M., Tsui,L.C. and Minden,M.D. TITLE Identification of a human LIM-Hox gene, hLH-2, aberrantly expressed in chronic myelogenous leukaemia and located on 9q33-34.1 JOURNAL Oncogene 12 (6), 1205-1212 (1996) MEDLINE 96226351 REFERENCE 2 (bases 1 to 1887) AUTHORS Wu,H.-K. TITLE Direct Submission JOURNAL Submitted (30-JUN-1994) H.-K. Wu, Department of Medicine, Ontario Cancer Institute/Princess Margaret Hospital, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada FEATURES Location/Qualifiers source 1..1887 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hlh2/6-3" /clone_lib="Clontech lambda gt11 human placental cDNA library" /map="9q33-34.1" /tissue_type="placenta" /chromosome="9" gene 489..1760 /gene="hLH-2" CDS 489..1760 /gene="hLH-2" /standard_name="human LH-2" /codon_start=1 /function="transcription factor" /product="LIM-homeobox domain protein" /db_xref="PID:g508712" /translation="MLFHSLSGPEVHGVIDEMDRRQERGSRISSAIDRGDTETTMPSI SSDRAALCGGCGGKISDRYYLLAVDKQWHMRCLKCCECKLNLESELTCFSKDGSIYCK EDYYRRFSVQRCARCHLGISASEMVMRARDLVYHLNCFTCTTCNKMLTTGDHFGMKDS LVYCRLHFEALLQGEYPAHFNHADVQAARARAAAKSAGLGAAGANPLGLPYYNGVGTV QKGRPRKRKSPGPGADLAAYTRALSCNENDAEHLDRDQPYPSSQKTKRMRTSFKHHQL RTMKSYFAINHNPDAKDLKQLAQKTGLTKRVLQVWFQNARAKFRRNLLRQENTGVDKS TDAALQTGTPSGPASELSNASLSPSSTPTTLTDLTSPTLPTVTSVLTSVPGNLEAMSL TAPHKRLLPTFSNDSQPPHPTISLKKKLSLV" BASE COUNT 370 a 636 c 573 g 308 t ORIGIN 1 gcttgaaatc gaattcggga ttcggggggg acgcaccagg gagggagggg tccaggcagc 61 tgggccgccg cggacaccta gcggcttcag ggtgaacccc gaccgcagcc gtcgccgcct 121 cgggcagagt ttgcgccctt gctttgcgcc ccgctgcgaa gccgggcggg cgatcggcgc 181 gtgaaagcgc cgcgcgggcg acctctgtcc tagtctcctg ctccccccgc cccgcttgtc 241 ccgtgccctt gtgacccagg ctttggcgcc gtcgccaggc cccgcaatgt agctgcccct 301 gcgcctcggc ggaggctcct gccccgcgag cgcccggggc ccggagccgg cctgggggct 361 cagccgagct cgggcggggc cggggcgcgg tggcgatgca ccgggccgtt agcgccagga 421 gccaggcagc tgaggcgggg ggcaagcctc cctcggagag ccgcgccccc ggcccgcgtc 481 ccgccgcgat gctgttccac agtctgtcgg gccccgaggt gcacggggtc atcgacgaga 541 tggaccgcag gcaagagcga ggctcccgca tcagctccgc catcgaccgc ggcgacaccg 601 agacgaccat gccgtccatc agcagtgacc gcgccgccct ttgtggcggc tgtggcggca 661 agatctcgga ccgctactac ctgctggcgg tggacaagca gtggcacatg cgctgcctca 721 agtgctgcga gtgcaagctc aacctggagt cggagctcac ctgtttcagc aaggacggta 781 gcatctactg caaggaagac tactaccggc gcttctctgt gcagcgctgc gcccgctgcc 841 acctgggcat ctcggcctcg gagatggtga tgcgcgctcg ggacttggtt tatcacctca 901 actgcttcac gtgcaccacg tgtaacaaga tgctgaccac gggcgaccac ttcggcatga 961 aggacagcct ggtctactgc cgcttgcact tcgaggcgct gctgcagggc gagtaccccg 1021 cacacttcaa ccatgccgac gtgcaggcgg cgcgtgcacg cgcggcggcc aagagcgcgg 1081 ggctgggcgc agcaggggcc aaccctctgg gtcttcccta ctacaatggc gtgggcactg 1141 tgcagaaggg gcggccgagg aaacgtaaga gtccgggccc cggtgcggat ctggcggcct 1201 acacacgtgc gctaagctgc aacgaaaacg acgcagagca cctggaccgt gaccagccat 1261 accccagcag ccagaagacc aagcgcatgc gcacgtcctt caagcaccac cagcttcgga 1321 ccatgaagtc ttactttgcc attaaccaca atcccgatgc caaggacttg aagcagctcg 1381 cgcaaaagac gggcctcacc aagcgggtcc tccaggtctg gttccagaac gcccgagcca 1441 agttcaggcg caacctctta cggcaggaaa acacgggcgt ggacaagtcg acagatgcgg 1501 cgctgcagac agggacgcca tcgggcccgg cctcggagct ctccaacgcc tcgctcagcc 1561 cctccagcac gcccaccacc ctgacagact tgactagccc caccctgcca actgtgacgt 1621 ccgtcttaac ttctgtgcct ggcaacctgg aggccatgag cctcacagcc cctcacaaac 1681 gactcttacc aaccttttct aatgactcgc aaccccctca ccccacaatt tctttaaaaa 1741 agaaattatc tttagtttga attccaagtg tattttaaaa tagaggcttt gagcaactaa 1801 ctaaccacat tttaggatct cgcctggaaa cagaggtaaa aaaaagaagt gtgcgcccgg 1861 ctaatgcagc ggtgtggacc ggaattc // LOCUS HSU11717 3769 bp mRNA PRI 28-JAN-1996 DEFINITION Human calcium activated potassium channel (hslo) mRNA, complete cds. ACCESSION U11717 NID g606875 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3769) AUTHORS Tseng-Crank,J., Foster,C.D., Krause,J.D., Mertz,R., Godinot,N., DiChiara,T.J. and Reinhart,P.H. TITLE Cloning, expression, and distribution of functionally distinct Ca(2+)-activated K+ channel isoforms from human brain JOURNAL Neuron 13 (6), 1315-1330 (1994) MEDLINE 95085775 REFERENCE 2 (bases 1 to 3769) AUTHORS Tseng-Crank,J. TITLE Direct Submission JOURNAL Submitted (30-JUN-1994) Julie Tseng-Crank, Molecular Genetics, Glaxo Research Institute, 5 Moore Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..3769 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hbr1" /map="10q22" /chromosome="10" gene 25..3489 /gene="hslo" CDS 25..3489 /gene="hslo" /codon_start=1 /product="calcium activated potassium channel" /db_xref="PID:g606876" /translation="MSSNIHANHLSLDASSSSSSSSSSSSSSSSSSSSSSVHEPKMDA LIIPVTMEVPCDSRGQRMWWAFLASSMVTFFGGLFIILLWRTLKYLWTVCCHCGGKTK EAQKINNGSSQADGTLKPVDEKEEAVAAEVGWMTSVKDWAGVMISAQTLTGRVLVVLV FALSIGALVIYFIDSSNPIESCQNFYKDFTLQIDMAFNVFFLLYFGLRFIAANDKLWF WLEVNSVVDFFTVPPVFVSVYLNRSWLGLRFLRALRLIQFSEILQFLNILKTSNSIKL VNLLSIFISTWLTAAGFIHLVENSGDPWENFQNNQALTYWECVYLLMVTMSTVGYGDV YAKTTLGRLFMVFFILGGLAMFASYVPEIIELIGNRKKYGGSYSAVSGRKHIVVCGHI TLESVSNFLKDFLHKDRDDVNVEIVFLHNISPNLELEALFKRHFTQVEFYQGSVLNPH DLARVKIESADACLILANKYCADPDAEDASNIMRVISIKNYHPKIRIITQMLQYHNKA HLLNIPSWNWKEGDDAICLAELKLGFIAQSCLAQGLSTMLANLFSMRSFIKIEEDTWQ KYYLEGVSNEMYTEYLSSAFVGLSFPTVCELCFVKLKLLMIAIEYKSANRESRILINP GNHLKIQEGTLGFFIASDAKEVKRAFFYCKACHDDITDPKRIKKCGCKRLEDEQPSTL SPKKKQRNGGMRNSPNTSPKLMRHDPLLIPGNDQIDNMDSHVKKYDSTGMFHWCAPKE IEKVILTRSEAAMTVLSGHVVVCIFGDVSSALIGLRNLVMPLRASNFHYHELKHIVFV GSIEYLKREWETLHNFPKVSILPGTPLSRADLRAVNINLCDMCVILSANQNNIDDTSL QDKECILASLNIKSMQFDDSIGVLQANSQGFTPPGMDRSSPDNSPVHGMLRQPSITTG VNIPIITELVNDTNVQFLDQDDDDDPDTELYLTQPFACGTAFAVSVLDSLMSATYFND NILTLIRTLVTGGATPELEALIAEENALRGGYSTPQTLANRDRCRVAQLALLDGPFAD LGDGGCYGDLFCKALKTYNMLCFGIYRLRDAHLSTPSQCTKRYVITNPPYEFELVPTD LIFCLMQFDHNAGQSRASLSHSSHSSQSSSKKSSSVHSIPSTANRQNRPKSRESRDKQ KYVQEERL" BASE COUNT 950 a 976 c 888 g 955 t ORIGIN 1 ggcggcggag gcagcagtct tagaatgagt agcaatatcc acgcgaacca tctcagccta 61 gacgcgtcct cctcctcctc ctcctcctct tcctcttctt cttcttcctc ctcctcttcc 121 tcctcgtcct cggtccacga gcccaagatg gatgcgctca tcatcccggt gaccatggag 181 gtgccgtgcg acagccgggg ccaacgcatg tggtgggctt tcctggcctc ctccatggtg 241 actttcttcg ggggcctctt catcatcttg ctctggcgga cgctcaagta cctgtggacc 301 gtgtgctgcc actgcggggg caagacgaag gaggcccaga agattaacaa tggctcaagc 361 caggcggatg gcactctcaa accagtggat gaaaaagagg aggcagtggc cgccgaggtc 421 ggctggatga cctccgtgaa ggactgggcg ggggtgatga tatccgccca gacactgact 481 ggcagagtcc tggttgtctt agtctttgct ctcagcatcg gtgcacttgt aatatacttc 541 atagattcat caaacccaat agaatcctgc cagaatttct acaaagattt cacattacag 601 atcgacatgg ctttcaacgt gttcttcctt ctctacttcg gcttgcggtt tattgcagcc 661 aacgataaat tgtggttctg gctggaagtg aactctgtag tggatttctt cacggtgccc 721 cccgtgtttg tgtctgtgta cttaaacaga agttggcttg gtttgagatt tttaagagct 781 ctgagactga tacagttttc agaaattttg cagtttctga atattcttaa aacaagtaat 841 tccatcaagc tggtgaatct gctctccata tttatcagca cgtggctgac tgcagccggg 901 ttcatccatt tggtggagaa ttcaggggac ccatgggaaa atttccaaaa caaccaggct 961 ctcacctact gggaatgtgt ctatttactc atggtcacaa tgtccaccgt tggttatggg 1021 gatgtttatg caaaaaccac acttgggcgc ctcttcatgg tcttcttcat cctcggggga 1081 ctggccatgt ttgccagcta cgtccctgaa atcatagagt taataggaaa ccgcaagaaa 1141 tacgggggct cctatagtgc ggttagtgga agaaagcaca ttgtggtctg cggacacatc 1201 actctggaga gtgtttccaa cttcctgaag gactttctgc acaaggaccg ggatgacgtc 1261 aatgtggaga tcgtttttct tcacaacatc tcccccaacc tggagcttga agctctgttc 1321 aaacgacatt ttactcaggt ggaattttat cagggttccg tcctcaatcc acatgatctt 1381 gcaagagtca agatagagtc agcagatgca tgcctgatcc ttgccaacaa gtactgcgct 1441 gacccggatg cggaggatgc ctcgaatatc atgagagtaa tctccataaa gaactaccat 1501 ccgaagataa gaatcatcac tcaaatgctg cagtatcaca acaaggccca tctgctaaac 1561 atcccgagct ggaattggaa agaaggtgat gacgcaatct gcctcgcaga gttgaagttg 1621 ggcttcatag cccagagctg cctggctcaa ggcctctcca ccatgcttgc caacctcttc 1681 tccatgaggt cattcataaa gattgaggaa gacacatggc agaaatacta cttggaagga 1741 gtctcaaatg aaatgtacac agaatatctc tccagtgcct tcgtgggtct gtccttccct 1801 actgtttgtg agctgtgttt tgtgaagctc aagctcctaa tgatagccat tgagtacaag 1861 tctgccaacc gagagagccg tatattaatt aatcctggaa accatcttaa gatccaagaa 1921 ggtactttag gatttttcat cgcaagtgat gccaaagaag ttaaaagggc atttttttac 1981 tgcaaggcct gtcatgatga catcacagat cccaaaagaa taaaaaaatg tggctgcaaa 2041 cggcttgaag atgagcagcc gtcaacacta tcaccaaaaa aaaagcaacg gaatggaggc 2101 atgcggaact cacccaacac ctcgcctaag ctgatgaggc atgacccctt gttaattcct 2161 ggcaatgatc agattgacaa catggactcc catgtgaaga agtacgactc tactgggatg 2221 tttcactggt gtgcacccaa ggagatagag aaagtcatcc tgactcgaag tgaagctgcc 2281 atgaccgtcc tgagtggcca tgtcgtggtc tgcatctttg gcgacgtcag ctcagccctg 2341 atcggcctcc ggaacctggt gatgccgctc cgtgccagca actttcatta ccatgagctc 2401 aagcacattg tgtttgtggg ctctattgag tacctcaagc gggaatggga gacgcttcat 2461 aacttcccca aagtgtccat attgcctggt acgccattaa gtcgggctga tttaagggct 2521 gtcaacatca acctctgtga catgtgcgtt atcctgtcag ccaatcagaa taatattgat 2581 gatacttcgc tgcaggacaa ggaatgcatc ttggcgtcac tcaacatcaa atctatgcag 2641 tttgatgaca gcatcggagt cttgcaggct aattcccaag ggttcacacc tccaggaatg 2701 gatagatcct ctccagataa cagcccagtg cacgggatgt tacgtcaacc atccatcaca 2761 actggggtca acatccccat catcactgaa ctagtgaacg atactaatgt tcagtttttg 2821 gaccaagacg atgatgatga ccctgataca gaactgtacc tcacgcagcc ctttgcctgt 2881 gggacagcat ttgccgtcag tgtcctggac tcactcatga gcgcgacgta cttcaatgac 2941 aatatcctca ccctgatacg gaccctggtg accggaggag ccacgccgga gctggaggct 3001 ctgattgctg aggaaaacgc ccttagaggt ggctacagca ccccgcagac actggccaat 3061 agggaccgct gccgcgtggc ccagttagct ctgctcgatg ggccatttgc ggacttaggg 3121 gatggtggtt gttatggtga tctgttctgc aaagctctga aaacatataa tatgctttgt 3181 tttggaattt accggctgag agatgctcac ctcagcaccc ccagtcagtg cacaaagagg 3241 tatgtcatca ccaacccgcc ctatgagttt gagctcgtgc cgacggacct gatcttctgc 3301 ttaatgcagt ttgaccacaa tgccggccag tcccgggcca gcctgtccca ttcctcccac 3361 tcgtcgcagt cctccagcaa gaagagctcc tctgttcact ccatcccatc cacagcaaac 3421 cgacagaacc ggcccaagtc cagggagtcc cgggacaaac agaagtacgt gcaggaagag 3481 cggctttgat atgtgtatcc accgccactg tgtgaaactg tatctgccac tcatttcccc 3541 agttggtgtt tccaacaaag taactttccc tgttttcccc tgtagtcccc cccttttttt 3601 ttacacatat ttgcatatgt atgatagtgt gcatgtggtt gtcattttta tttcaccacc 3661 ataaaaccct tgagcacaac agcaaataag caggacgggc ccaaagttat ttatgattct 3721 ggggggaaaa taacccaaag gcatgctcca gacataaata gctcactgc // LOCUS HSU11732 1580 bp mRNA PRI 18-JUL-1994 DEFINITION Human ets-like gene (tel) mRNA, complete cds. ACCESSION U11732 NID g511282 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1580) AUTHORS Golub,T.R., Barker,G.F., Lovett,M. and Gilliland,D.G. TITLE Fusion of PDGF receptor beta to a novel ets-like gene, tel, in chronic myelomonocytic leukemia with t(5;12) chromosomal translocation JOURNAL Cell 77 (2), 307-316 (1994) MEDLINE 94221647 REFERENCE 2 (bases 1 to 1580) AUTHORS Gilliland,D. TITLE Direct Submission JOURNAL Submitted (01-JUL-1994) D. Gary Gilliland, Hematology/Oncology, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1580 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="tel-452" /clone_lib="K562 lambda gt11" /map="12p13" /chromosome="12" /cell_line="K562" /tissue_type="leukemia" gene 25..1383 /gene="tel" CDS 25..1383 /gene="tel" /note="t(5;12) translocation breakpoint occurs after nucleotide 487" /codon_start=1 /db_xref="PID:g511283" /translation="MSETPAQCSIKQERISYTPPESPVPSYASSTPLHVPVPRALRME EDSIRLPAHLRLQPIYWSRDDVAQWLKWAENEFSLRPIDSNTFEMNGKALLLLTKEDF RYRSPHSGDVLYELLQHILKQRKPRILFSPFFHPGNSIHTQPEVILHQNHEEDNCVQR TPRPSVDNVHHNPPTIELLHRSRSPITTNHRPSPDPEQRPLRSPLDNMIRRLSPAERA QGPRPHQENNHQESYPLSVSPMENNHCPASSESHPKPSSPRQESTRVIQLMPSPIMHP LILNPRHSVDFKQSRLSEDGLHREGKPINLSHREDLAYMNHIMVSVSPPEEHAMPIGR IADCRLLWDYVYQLLSDSRYENFIRWEDKESKIFRIVDPNGLARLWGNHKNRTNMTYE KMSRALRHYYKLNIIRKEPGQRLLFRFMKTPDEIMSGRTDRLEHLESQELDEQIYQED EC" BASE COUNT 419 a 471 c 386 g 304 t ORIGIN 1 tcctgatctc tctcgctgtg agacatgtct gagactcctg ctcagtgtag cattaagcag 61 gaacgaattt catatacacc tccagagagc ccagtgccga gttacgcttc ctcgacgcca 121 cttcatgttc cagtgcctcg agcgctcagg atggaggaag actcgatccg cctgcctgcg 181 cacctgcgct tgcagccaat ttactggagc agggatgacg tagcccagtg gctcaagtgg 241 gctgaaaatg agttttcttt aaggccaatt gacagcaaca cgtttgaaat gaatggcaaa 301 gctctcctgc tgctgaccaa agaggacttt cgctatcgat ctcctcattc aggtgatgtg 361 ctctatgaac tccttcagca tattctgaag cagaggaaac ctcggattct tttttcacca 421 ttcttccacc ctggaaactc tatacacaca cagccggagg tcatactgca tcagaaccat 481 gaagaagata actgtgtcca gaggaccccc aggccatccg tggataatgt gcaccataac 541 cctcccacca ttgaactgtt gcaccgctcc aggtcaccta tcacgacaaa tcaccggcct 601 tctcctgacc ccgagcagcg gcccctccgg tcccccctgg acaacatgat ccgccgcctc 661 tccccggctg agagagctca gggacccagg ccgcaccagg agaacaacca ccaggagtcc 721 taccctctgt cagtgtctcc catggagaat aatcactgcc cagcgtcctc cgagtcccac 781 ccgaagccat ccagcccccg gcaggagagc acacgcgtga tccagctgat gcccagcccc 841 atcatgcacc ctctgatcct gaacccccgg cactccgtgg atttcaaaca gtccaggctc 901 tccgaggacg ggctgcatag ggaagggaag cccatcaacc tctctcatcg ggaagacctg 961 gcttacatga accacatcat ggtctctgtc tccccgcctg aagagcacgc catgcccatt 1021 gggagaatag cagactgtag actgctttgg gattacgtct atcagttgct ttctgacagc 1081 cggtacgaaa acttcatccg atgggaggac aaagaatcca aaatattccg gatagtggat 1141 cccaacggac tggctcgact gtggggaaac cataagaaca gaacaaacat gacctatgag 1201 aaaatgtcca gagccctgcg ccactactac aaactaaaca ttatcaggaa ggagccagga 1261 caaaggcttt tgttcaggtt tatgaaaacc ccagatgaaa tcatgagtgg ccgaacagac 1321 cgtctggagc acctagagtc ccaggagctg gatgaacaaa tataccaaga agatgaatgc 1381 tgaaggaacc aacagtccac ctcagcgggc cagcagccca gggaacccct gcccaccagg 1441 attgctggaa gtgtgacgga gcaggcgggc tgaggagagt ggaaaaggaa gcgacccaga 1501 aatggcaggg acacttctct tgcagaccaa gagggaccct ggagcacctt agacaaacta 1561 cccagcacag gcggggctgg // LOCUS HSU11791 1203 bp mRNA PRI 08-SEP-1994 DEFINITION Human cyclin H mRNA, complete cds. ACCESSION U11791 NID g536919 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1203) AUTHORS Makela,T.P., Tassan,J-.P., Nigg,E., Frutiger,S., Hughes,G. and Weinberg,R.A. TITLE A cyclin associated with the CDK-activating kinase MO15 JOURNAL Nature 371, 254-257 (1994) MEDLINE 94359612 REFERENCE 2 (bases 1 to 1203) AUTHORS Makela,T.P. TITLE Direct Submission JOURNAL Submitted (05-JUL-1994) Tomi P. Makela, Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1203 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F11-1" /clone_lib=" Clontech HL1151x 5' stretch cDNA library" /dev_stage="fetus" /tissue_type="liver" CDS 61..1032 /codon_start=1 /product="cyclin H" /db_xref="PID:g536920" /translation="MYHNSSQKRHWTFSSEEQLARLRADANRKFRCKAVANGKVLPND PVFLEPHEEMTLCKYYEKRLLEFCSVFKPAMPRSVVGTACMYFKRFYLNNSVMEYHPR IIMLTCAFLACKVDEFNVSSPQFVGNLRESPLGQEKALEQILEYELLLIQQLNFHLIV HNPYRPFEGFLIDLKTRYPILENPEILRKTADDFLNRIALTDAYLLYTPSQIALTAIL SSASRAGITMESYLSESLMLKENRTCLSQLLDIMKSMRNLVKKYEPPRSEEVAVLKQK LERCHSAELALNVITKKRKGYEDDDYVSKKSKHEEEEWTDDDLVESL" BASE COUNT 376 a 241 c 248 g 338 t ORIGIN 1 ggacgctgat gcgtttgggt tctcgtctgc agaccctctg gacctggtca cgattccata 61 atgtaccaca acagtagtca gaagcggcac tggaccttct ccagcgagga gcagctggca 121 agactgcggg ctgacgccaa ccgcaaattc agatgcaaag ccgtggccaa cgggaaggtt 181 cttccgaatg atccagtctt tcttgagcct catgaagaaa tgacactctg caaatactat 241 gagaaaaggt tattggaatt ctgttcggtg tttaagccag caatgccaag atctgttgtg 301 ggtacggctt gtatgtattt caaacgtttt tatcttaata actcagtaat ggaatatcac 361 cccaggataa taatgctcac ttgtgcattt ttggcctgca aagtagatga attcaatgta 421 tctagtcctc agtttgttgg aaacctccgg gagagtcctc ttggacagga gaaggcactt 481 gaacagatac tggaatatga actacttctt atacagcaac ttaatttcca ccttattgtc 541 cacaatcctt acagaccatt tgagggcttc ctcatcgact taaagacccg ctatcccata 601 ttggagaatc cagagatttt gaggaaaaca gctgatgact ttcttaatag aattgcattg 661 acggatgctt accttttata cacaccttcc caaattgccc tgactgccat tttatctagt 721 gcctccaggg ctggaattac tatggaaagt tatttatcag agagtctgat gctgaaagag 781 aacagaactt gcctgtcaca gttactagat ataatgaaaa gcatgagaaa cttagtaaag 841 aagtatgaac cacccagatc tgaagaagtt gctgttctga aacagaagtt ggagcgatgt 901 cattctgctg agcttgcact taacgtaatc acgaagaaga ggaaaggcta tgaagatgat 961 gattacgtct caaagaaatc caaacatgag gaggaagaat ggactgatga cgacctggta 1021 gaatctctct aaccatttga agttgatttc tcaatgctaa ctaatcaaga gaagtaggaa 1081 gcatatcaaa cgtttaactt tatttaaaaa gtataatgtg aaaacataaa atatattaaa 1141 acttttctat tgttttcttt ccctttcaca gtaactttat gtaaaataaa ccatcttcaa 1201 aag // LOCUS HSU11861 992 bp mRNA PRI 25-JUL-1994 DEFINITION Human G10 homolog (edg-2) mRNA, complete cds. ACCESSION U11861 NID g515482 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 992) AUTHORS Hla,T. TITLE Characterization of edg-2, a human homolog of the Xenopus maternal transcript G10 from endothelial cells JOURNAL Biochim. Biophys. Acta (1994) In press REFERENCE 2 (bases 1 to 992) AUTHORS Hla,T. TITLE Direct Submission JOURNAL Submitted (06-JUL-1994) Timothy Hla, Molecular Biology, Holland Laboratory, 15601 Crabbs Branch Way, Rockville, MD 20855, USA FEATURES Location/Qualifiers source 1..992 /organism="Homo sapiens" /db_xref="taxon:9606" gene 380..814 /gene="edg-2" CDS 380..814 /gene="edg-2" /note="G10 homolog; similar to Xenopus laevis maternal G10 protein, Swiss-Prot Accession Number P12805" /codon_start=1 /db_xref="PID:g515483" /translation="MPKVKRSRKAPQDGWELIEPTLDQLDQKMREAETEPHEGKRKVE SLWPIFRIHHQKTRYIFDLFYKRKAYSRELLDICYKEGLADKNLLAKWKKQGIGNLCC LRCIQTRDTNFGTNCICRVPKSKLEVGRIIECTHCGCRGCSG" polyA_site 992 /note="11 A residues" BASE COUNT 261 a 238 c 277 g 216 t ORIGIN 1 gaattccctg aggcgggaga ccggtggtct gcaccgtcct ggagggagat atgagtggct 61 ggactctcag ccagccactg ggatgtgttc gggctttgga ccttgaggcc ggagagagct 121 cccgagagga ggcggcgcac gttcgttctt ctgaggggac ggtagatttg ggggtttttc 181 ctctaggatt ctcgcgccgt ttcctctgaa gaaacagaac cagagaggga aggtgacctg 241 aaagtcactg aataattttt ttagagctga acaagaatcc aagcctgcaa ctgcaactgc 301 agagacgaga gatctttctg ctgtctatac tcttggaaag cacatcctaa gatctttgca 361 gattatctgt ggaaggagaa tgcctaaagt caaacgaagc cggaaagcac cccaggatgg 421 ctgggagttg attgagccaa cactggatca attagatcaa aagatgagag aagctgaaac 481 agaaccgcat gagggaaaga ggaaagtgga atctctgtgg cccatcttca ggatccacca 541 ccagaaaacc cgctacatct tcgacctctt ttacaagcgg aaagcctaca gcagagaact 601 cttagatata tgttataaag aaggcttagc agacaaaaac ctgttggcaa aatggaaaaa 661 gcaaggtata ggaaacttgt gctgcctgcg gtgcattcag acacgggaca ccaacttcgg 721 gacgaactgc atctgccgcg tgcccaaaag caagctggaa gtgggccgca tcatcgagtg 781 cacacactgt ggctgccgtg gctgctctgg ctgacggtgg ctgctgctcc accctggact 841 ctggacttcg caggttcctt gcctgtcacg ccaccccctt cctgggagca gcgagcagtg 901 ccccaggccc gagttggagc acggtctcta tggggaagcg ttcgctgtct atcagctgtg 961 atttgtaaaa ataaaatctt taaatctctc ga // LOCUS HSU11870 4452 bp DNA PRI 28-MAR-1995 DEFINITION Human interleukin-8 receptor type A (IL8RBA) gene, promoter and complete cds. ACCESSION U11870 NID g511804 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4452) AUTHORS Ahuja,S.K., Shetty,A., Tiffany,H.L. and Murphy,P.M. TITLE Comparison of the genomic organization and promoter function for human interleukin-8 receptors A and B JOURNAL J. Biol. Chem. 269 (42), 26381-26389 (1994) MEDLINE 95014476 REFERENCE 2 (bases 1 to 4452) AUTHORS Ahuja,S.K. TITLE Direct Submission JOURNAL Submitted (06-JUL-1994) Sunil K. Ahuja, Laboratory of Host Defenses, National Institutes of Health, National Institute of Allergy and Infectious Diseases, NIH, Bldg 10, Rm 11N109, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..4452 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="6e/IL8RA-gene" /chromosome="2" /map="2q34-35" /tissue_type="placenta" promoter 1..300 /evidence=experimental protein_bind 65..70 /bound_moiety="NF-ATp" protein_bind 96..101 /bound_moiety="NF-ATp" protein_bind 205..212 /bound_moiety="GRE" exon 301..367 /gene="IL8RA" /number=1 mRNA join(301..367,2024..4452) /gene="IL8RA" /evidence=experimental gene join(301..367,2024..4452) /gene="IL8RA" intron 368..2023 /number=1 exon 2024..4452 /gene="IL8RA" /number=2 CDS 2057..3109 /gene="IL8RA" /note="neutrophil chemoattractant receptor; G-protein coupled seven-transmembrane spanning receptor" /codon_start=1 /evidence=experimental /product="interleukin-8 receptor type A" /db_xref="PID:g511805" /translation="MSNITDPQMWDFDDLNFTGMPPADEDYSPCMLETETLNKYVVII AYALVFLLSLLGNSLVMLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNG WIFGTFLCKVVSLLKEVNFYSGILLLACISVDRYLAIVHATRTLTQKRHLVKFVCLGC WGLSMNLSLPFFLFRQAYHPNNSSPVCYEVLGNDTAKWRMVLRILPHTFGFIVPLFVM LFCYGFTLRTLFKAHMGQKHRAMRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQES CERRNNIGRALDATEILGFLHSCLNPIIYAFIGQNFRHGFLKILAMHGLVSKEFLARH RVTSYTSSSVNVSSNL" BASE COUNT 1067 a 1112 c 1078 g 1195 t ORIGIN 1 aagcttccac aggtgatata ctagggaatt taggaataaa caaatggaaa taaattcaag 61 aaaaggaaaa taataaaaat gatcatccat agagtggaga attcagataa tggaccctca 121 accccagctt cacacctggg acccccactt ggtcatatgg accctggcag tctctaatca 181 caagtctgtg atcccttgac ttaaactgtt cttccccaaa tgtagacatg ggtggggctc 241 agaagggagg tgtcatctga tgtggtttcc ttatttccgt ttattcatca agtgccctct 301 agctgttaag tcactctgat ctctgactgc agctcctact gttggacaca cctggccggt 361 gcttcaggta ggaagccacc tctgtttgct aggactttct gtggggtagg gctgcttggc 421 ttgactttat tttggaaaat gtattcattt ctgtgggagc tgaggatttc tgctgcccgc 481 ttcctctctt gtgaccacca ctcatacatg gagttagcag gtgaacacca cagcctcctc 541 atccctcttt ttataccttg ccctttttgt gggaagaggc tgaccattcc caatttgagt 601 ttattgattc accttaagac tttgcttcag tgaaaatgca aatactggta gggagggaac 661 acactaaatg tgtggatctg agtggactca tcatacaccc tcaggcctgg cagcaaatga 721 gaggtggatg attatgtctt cctcgatttg cagatgagaa gactgaaatc tataggggta 781 cgagacccca ctcaaggtca tgagacttat ttatttattt atttatttat tttgagatgg 841 agtctcactc tgtcgcctag gctgtagtac tgtggcacga tctcggctca ctgcaacctc 901 catttccggg ttcaagtgat tctcctgcct cagcctcctg agtagttggg attacaggtg 961 cacgccacca cacccagcta atttttttgt atttttagta gagacggggt ttcatcattt 1021 tggtcaggct ggtctcgaac tcctgacttc atgatctgcc tgccttggcc tcccaaaggc 1081 tgggattaca cacgaaactt attaatggta gaatcaggat ggaatgaaga ctggatgcta 1141 ggtgtcttta caaccaacct cggtgacttt ccaaaggctt tcagacttct ctggagagtc 1201 ctggagcttt gaggggctct ttaggggcca tctgggttcg gagattagca cactctgccc 1261 cacagtggct cagcttttat ctgcttcaca tactgggctt ctgggaagat cttatttcag 1321 taaactattt cacaatagga aatatatttg aaactcttga atgattctgt ggattttttc 1381 aggggtggag ggtttcaggg accaagatgg agactttatc ttctcaaata cttaaaactc 1441 ctttagacag aggaacaaaa tatgtgctcc tcatttaaag aaggcttaaa atatacagtt 1501 taatgcaatg tgttgttatc acttccttcc cccagtagat tttaaggagg gtaaacaaga 1561 atcagggaga acataggaat agaaatagta gaaggggaca cctgggaaca ggtttgcctt 1621 cttgcatttt gcttaatgct ggcccttccc tgaatgtcta agaccaacct ggtccccaca 1681 tccaaatgca cagacacagc tgaggatgga gaaggctaaa gagggacaga ggtagagaca 1741 taggctgaga ggaggcagtt gtaggttgag ctagggctaa ggtgttttcc ccatattcca 1801 tcttacccca cactcaggcc aggccttaga gttgtggaag gtggagaaca ctgggaagcc 1861 aacctccgaa gaagaccagg ttggagtcaa aggaggaagg agagctctca ttgccaaacc 1921 aacagggaag ccaaggatat cccagtaact gctctcacat cattgatgag aatgccttga 1981 atccgagcta ctaaatcaca tttccttcct tctaaccttc cagttagatc aaaccattgc 2041 tgaaactgaa gaggacatgt caaatattac agatccacag atgtgggatt ttgatgatct 2101 aaatttcact ggcatgccac ctgcagatga agattacagc ccctgtatgc tagaaactga 2161 gacactcaac aagtatgttg tgatcatcgc ctatgcccta gtgttcctgc tgagcctgct 2221 gggaaactcc ctggtgatgc tggtcatctt atacagcagg gtcggccgct ccgtcactga 2281 tgtctacctg ctgaacctgg ccttggccga cctactcttt gccctgacct tgcccatctg 2341 ggccgcctcc aaggtgaatg gctggatttt tggcacattc ctgtgcaagg tggtctcact 2401 cctgaaggaa gtcaacttct acagtggcat cctgctgttg gcctgcatca gtgtggaccg 2461 ttacctggcc attgtccatg ccacacgcac actgacccag aagcgtcact tggtcaagtt 2521 tgtttgtctt ggctgctggg gactgtctat gaatctgtcc ctgcccttct tccttttccg 2581 ccaggcttac catccaaaca attccagtcc agtttgctat gaggtcctgg gaaatgacac 2641 agcaaaatgg cggatggtgt tgcggatcct gcctcacacc tttggcttca tcgtgccgct 2701 gtttgtcatg ctgttctgct atggattcac cctgcgtaca ctgtttaagg cccacatggg 2761 gcagaagcac cgagccatga gggtcatctt tgctgtcgtc ctcatcttcc tgctttgctg 2821 gctgccctac aacctggtcc tgctggcaga caccctcatg aggacccagg tgatccagga 2881 gagctgtgag cgccgcaaca acatcggccg ggccctggat gccactgaga ttctgggatt 2941 tctccatagc tgcctcaacc ccatcatcta cgccttcatc ggccaaaatt ttcgccatgg 3001 attcctcaag atcctggcta tgcatggcct ggtcagcaag gagttcttgg cacgtcatcg 3061 tgttacctcc tacacttctt cgtctgtcaa tgtctcttcc aacctctgaa aaccatcgat 3121 gaaggaatat ctcttctcag aaggaaagaa taaccaacac cctgaggttg tgtgtggaag 3181 gtgatctggc tctggacagg cactatctgg gttttggggg gacgctatag gatgtgggga 3241 agttaggaac tggtgtcttc aggggccaca ccaaccttct gaggagctgt tgaggtacct 3301 ccaaggaccg gcctttgcac ctccatggaa acgaagcacc atcattcccg ttgaacgtca 3361 catctttaac ccactaactg gctaattagc atggccacat ctgagccccg aatctgacat 3421 tagatgagag aacagggctg aagctgtgtc ctcatgaggg ctggatgctc tcgttgaccc 3481 tcacaggagc atctcctcaa ctctgagtgt taagcgttga gccaccaagc tggtggctct 3541 gtgtgctctg atccgagctc aggggggtgg ttttcccatc tcaggtgtgt tgcagtgtct 3601 gctggagaca ttgaggcagg cactgccaaa acatcaacct gccagctggc cttgtgagga 3661 gctggaaaca catgttcccc ttgggggtgg tggatgaaca aagagaaaga gggtttggaa 3721 gccagatcta tgccacaaga acccccttta cccccatgac caacatcgca gacacatgtg 3781 ctggccacct gctgagcccc aagtggaacg agacaagcag cccttagccc ttcccctctg 3841 cagcttccag gctggcgtgc agcatcagca tccctagaaa gccatgtgca gccaccagtc 3901 cattgggcag gcagatgttc ctaataaagc ttctgttccg tgcttgtccc tgtggaagta 3961 tcttggttgt gacagagtca agggtgtgtg cagcattgtt ggctgttcct gcagtagaat 4021 gggggcagca cctcctaaga aggcacctct ctgggttgaa gggcagtgtt ccctggggct 4081 ttaactcctg ctagaacagt ctcttgaggc acagaaactc ctgttcatgc ccatacccct 4141 ggccaaggaa gatccctttg tccacaagta aaaggaaatc ctcctccagg gagtctcagc 4201 ttcaccctga ggtgagcatc atcttctggg ttaggccttg cctaggcata gcctgcctca 4261 agctatgtga gctcaccagt ccctccccaa atgctttcca tgagttgcag ttttttccta 4321 gtctgttttc cctccttgga gaacagggcc ctgtcggttt gttcactgta tgtccttggt 4381 gcctggagcc tactaaatgc tcaataaata atgatcacag gaatgaatgc atgctgaaaa 4441 gaccactctt tt // LOCUS HSU12134 2598 bp mRNA PRI 27-JAN-1996 DEFINITION Human DNA damage repair and recombination protein RAD52 mRNA, complete cds. ACCESSION U12134 NID g603156 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2598) AUTHORS Shen,Z., Denison,K., Lobb,R., Gatewood,J.M. and Chen,D.J. TITLE The human and mouse homologs of the yeast RAD52 gene: cDNA cloning, sequence analysis, assignment to human chromosome 12p12.2-p13, and mRNA expression in mouse tissues JOURNAL Genomics 25 (1), 199-206 (1995) MEDLINE 95293373 REFERENCE 2 (bases 1 to 2598) AUTHORS Shen,Z. TITLE Direct Submission JOURNAL Submitted (10-JUL-1994) Zhiyuan Shen, Los Alamos National Laboratory, Life Sciences Division, Los Alamos, NM 87545, USA FEATURES Location/Qualifiers source 1..2598 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="ZAP-expression" /cell_line="Jurkat" /cell_type="T-cell" /chromosome="12" /map="12p12.2-p13" CDS 33..1289 /note="similar to yeast Rad52p: Swiss-Prot Accession Number P06778" /codon_start=1 /function="DNA damage repair and recombination" /product="RAD52" /db_xref="PID:g603157" /translation="MSGTEEAILGGRDSHPAAGGGSVLCFGQCQYTAEEYQAIQKALR QRLGPEYISSRMAGGGQKVCYIEGHRVINLANEMFGYNGWAHSITQQNVDFVDLNNGK FYVGVCAFVRVQLKDGSYHEDVGYGVSEGLKSKALSLEKARKEAVTDGLKRALRSFGN ALGNCILDKDYLRSLNKLPRQLPLEVDLTKAKRQDLEPSVEEARYNSCRPNMALGHPQ LQQVTSPSRPSHAVIPADQDCSSRSLSSSAVESEATHQRKLRQKQLQQQFRERMEKQQ VRVSTPSAEKSEAAPPAPPVTHSTPVTVSEPLLEKDFLAGVTQELIKTLEDNSEKWAV TPDAGDGVVKPSSRADPAQTSDTLALNNQMVTQNRTPHSVCHQKPQAKSGSWDLQTYS ADQRTTGNWESHRKSQDMKKRKYDPS" BASE COUNT 694 a 592 c 636 g 676 t ORIGIN 1 gctgcccgag gcgcaggtca accagaatca agatgtctgg gactgaggaa gcaattcttg 61 gaggacgtga cagccatcct gctgctggcg gcggctcagt gttatgcttt ggacagtgcc 121 agtacacagc agaagagtac caggccatcc agaaggccct gaggcagagg ctgggcccag 181 aatacataag tagccgcatg gctggcggag gccagaaggt gtgctacatt gagggtcatc 241 gggtaattaa tctggccaat gagatgtttg gttacaatgg ctgggcacac tccatcacgc 301 agcagaatgt ggattttgtt gacctcaaca atggcaagtt ctacgtggga gtctgtgcat 361 ttgtgagggt ccagctgaag gatggttcat atcatgaaga tgttggttat ggtgttagtg 421 agggcctcaa gtccaaggct ttatctttgg agaaggcaag gaaggaggcg gtgacagacg 481 ggctgaagcg agccctcagg agttttggga atgcacttgg aaactgtatt ctggacaaag 541 actacctgag atcactaaat aagcttccac gccagttgcc tcttgaagtg gatttaacta 601 aagcgaagag acaagatctt gaaccgtctg tggaggaggc aagatacaac agctgccgac 661 cgaacatggc cctgggacac ccacagctgc agcaggtgac ctccccttcc agacccagcc 721 atgctgtgat accggcggac caggactgca gctcccgaag cctgagctca tccgccgtgg 781 agagcgaggc cacgcaccag cggaagctcc ggcagaagca gctgcagcag cagttccggg 841 agcggatgga gaagcagcag gttcgagtct ccacgccgtc agctgagaag agtgaggcag 901 cgcctccggc ccctcctgtg acgcacagca ctcctgtaac tgtctcagaa ccactcctgg 961 agaaagactt ccttgcagga gtgactcaag aattaatcaa gactcttgaa gacaactctg 1021 aaaagtgggc tgtgactccc gatgcagggg atggtgtggt caagccctcg tctagagcag 1081 acccagccca gacctctgac acattagcct tgaacaacca gatggtgacc cagaacagga 1141 ctccacacag cgtttgccac cagaaaccac aagcaaaatc tggatcttgg gacctccaaa 1201 cttatagcgc tgaccaacgc acaacaggaa actgggaatc tcataggaag agccaggaca 1261 tgaagaaaag gaaatatgat ccatcttaac tgaggctcag gccacataat tggactctgt 1321 cacaaaggga ctttggaaaa ctactttttg gtcatgaaat tgttcatcgc tgctggagaa 1381 tgaacgtcat tgcaatttat cttgcttcat tctgaacctt atcaagagga tctgactgag 1441 agcccactgc agttagagct gagcactttt gaaaagcttg tccatcactc tagtagggag 1501 aggctctgga cagatgaata ccttttcttc ggcttgtgag gcttcccact atttattact 1561 gaactattat gttaatgaag atggacattt taggaatcac caatggctcc ttgccctcaa 1621 gcaatatagg ccagacttgg tcctaagcac ctgcctcagc aattgtctac attcagttgt 1681 tttgcataac gtctgccttc tttcctttac ggtccatgcc tttaatgttg cccacattaa 1741 gcactgtgga tcacgacagg aaaaaggttg gagcagtgct tttcactact ttgtatcaat 1801 ccaggctaca atcttcattt aatataaata atttatggat ttatgacatt acaatcctgc 1861 attgtttcaa gactgacatt ttttcctaag gaaggaaata atcatctaag accacgaaaa 1921 aaggctgttt tttgtttttt tttttttttt ttgagacggg gtctggctgt gttgccctga 1981 ctggagttca gtggtgcaaa cacagctctc tccacaacct cttgggccca agtgatactc 2041 ccacctctgc cttacaaaat acagggatta ctggtgtgag ccactgtgtc tggccagaaa 2101 aggcattttt gagaaagcaa atcgtatacc ttattaacaa aatagaatat atatatattg 2161 cttatctgaa atgcttgaaa ccagaattgt tttgcatttt ttgaatattt gtatacacat 2221 aatgagacct tggggatggg acccaagtct gaacgtggaa ttcacctgtg tttcgtgtat 2281 atgtctcatg cacataattt tgtgcatgaa acagagtttt tgtataagaa gatacactgc 2341 agctgaagag ggctgggttt ttttttctct tagggtcgct gcataaactg ttgtatgcct 2401 ggtgctttgc gacttgtcac acgaggtcac gtgtggaatt ttccacttct ggcatcacgt 2461 cagtgctcag aaattttctg atctcagagc atttcaatta gggatgctca aacgcaactg 2521 tttctacttc cccatttcag gtgtgagatg taacccacct tgaccataaa ttggcttttc 2581 atagtgctca gatgtttc // LOCUS HSU12170 3952 bp mRNA PRI 10-AUG-1994 DEFINITION Human zinc finger homeodomain protein mRNA, complete cds. ACCESSION U12170 NID g529172 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Bachman,N.J. and Scarpulla,R.C. TITLE A human zinc finger homeodomain protein homologous to the chicken delta-crystallin enhancer binding protein, delta EF1 JOURNAL Unpublished REFERENCE 2 (bases 22 to 3515) AUTHORS Watanabe,Y., Kawakami,K., Hirayama,Y. and Nagano,K. TITLE Transcription factors positively and negatively regulating the Na,K-ATPase alpha 1 subunit gene JOURNAL J. Biochem. 114 (6), 849-855 (1993) MEDLINE 94186507 REFERENCE 3 (bases 1090 to 3605) AUTHORS Williams,T.M., Moolten,D., Burlein,J., Romano,J., Bhaerman,R., Godillot,A., Mellon,M., Rauscher,F.J. III. and Kant,J.A. TITLE Identification of a zinc finger protein that inhibits IL-2 gene expression JOURNAL Science 254 (5039), 1791-1794 (1991) MEDLINE 92108424 REFERENCE 4 (bases 1 to 3952) AUTHORS Bachman,N.J. TITLE Direct Submission JOURNAL Submitted (11-JUL-1994) Nancy J. Bachman, CMS Biology, Northwestern University Medical School, 303 E. Chicago Ave., Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..3952 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11 Clontech #HL1022" /cell_line="HeLa" 5'UTR 1..24 CDS 25..3399 /note="encodes 7 zinc fingers; an amino terminal cluster (fingers 1-4; aa 172..292) and a carboxy terminal cluster (fingers 5-7; aa 906..981); homeodomain is also present (aa 58..640); similar to chicken delta-crystallin enhancer binding protein deltaEF1: SwissProt Accession Number P36197" /citation=[2] /codon_start=1 /product="zinc finger homeodomain protein" /db_xref="PID:g529173" /translation="MADGPRCKRRKQANPRRNNVTNYNTVVETNSDSDDEDKLHIVEE ESVTDAADCEGVPEDDLPTDQTVLPGRSSEREGNAKNCWEDDRKEGQEILGPEAQADE AGCTVKDDECESDAENEQNHDPNVEEFLQQQDTAVIFPEAPEEDQRQGTPEASGHDEN GTPDAFSQLLTCPYCDRGYKRFTSLKEHIKYRHEKNEDNFSCSLCSYTFAYRTQLERH MTSHKSGRDQRHVTQSGCNRKFKCTECGKAFKYKHHLKEHLRIHSGEKPYECPNCKKR FSHSGSYSSHISSKKCISLIPVNGRPRTGLKTSQCSSPSLSASPGSPTRPQIRQKIEN KPLQEQLSVNQIKTEPVDYEFKPIVVASGINCSTPLQNGVFTGGGPLQATSSPQGMVQ AVVLPTVGLVSPISINLSDIQNVLKVAIDGNVIRQVLENNQANLASKEQETINASPIQ QGGHSVISAISLPLVDQDGTTKIIINYSLEQPSQLQVVPQNLKKENPVATNSCKSEKL PEDLTVKSEKDKSFEGGVNDSTCLLCDDCPGDINALPELKHYDLKQPTQPPPLPAAEA EKPESSVSSATGDGNLSPSQPPLKNLLSLLKAYYALNAQPSAEELSKIADSVNLPLDV VKKWFEKMQAGQISVQSSEPSSPEPGKVNTPAKNNDQPQSANANEPQDSTVNLQSPLK MTNSPVLPVGSTTNGSRSSTPSPSPLNLSSSRNTQGYLYTAEGAQEEPQVEPLDLSLP KQQGELLERSTITSVYQNSVYSVQEEPLNLSCAKKEPQKDSCVTDSEPVVNVIPPSAN PINIAIPTVTAQLPTIVAIADQNSVPCLRALAANKQTILIPQVAYTYSTTVSPAVQEP PLKVIQPNGNQDERQDTSSEGVSNVEDQNDSDSTPPKKKMRKTENGMYACDLCDKIFQ KSSSLLRHKYEHTGKRPHECGICKKAFKHKHHLIEHMRLHSGEKPYQCDKCGKRFSHS GSYSQHMNHRYSYCKREAEERDSTEQEEAGPEILSNEHVGARASPSQGDSDERESLTR EEDEDSEKEEEEEDKEMEELQEEKECEKPQGDEEEEEEEEEVEEEEVEEAENEGEEAK TEGLMKDDRAESQASSLGQKVGESSEQVSEEKTNEA" 3'UTR 3400..>3952 /citation=[2] /citation=[3] repeat_region 3549..3592 /citation=[3] /rpt_type=tandem /rpt_family="AC" BASE COUNT 1381 a 806 c 848 g 917 t ORIGIN 1 attttagaca caagcgagag gatcatggcg gatggcccca ggtgtaagcg cagaaagcag 61 gcgaacccgc ggcgcaataa cgttacaaat tataatactg tggtagaaac aaattcagat 121 tcagatgatg aagacaaact gcatattgtg gaagaagaaa gtgttacaga tgcagctgac 181 tgtgaaggtg taccagagga tgacctgcca acagaccaga cagtgttacc agggaggagc 241 agtgaaagag aagggaatgc taagaactgc tgggaggatg acagaaagga agggcaagaa 301 atcctggggc ctgaagctca ggcagatgaa gcaggatgta cagtaaaaga tgatgaatgc 361 gagtcagatg cagaaaatga gcaaaaccat gatcctaatg ttgaagagtt tctacaacaa 421 caagacactg ctgtcatttt tcctgaggca cctgaagagg accagaggca gggcacacca 481 gaagccagtg gtcatgatga aaatggaaca ccagatgcat tttcacaatt actcacctgt 541 ccatattgtg atagaggcta taaacgcttt acctctctga aagaacacat taaatatcgt 601 catgaaaaga atgaagataa ctttagttgc tccctgtgca gttacacctt tgcatacaga 661 acccaacttg aacgtcacat gacatcacat aaatcaggaa gagatcaaag acatgtgacg 721 cagtctgggt gtaatcgtaa attcaaatgc actgagtgtg gaaaagcttt caaatacaaa 781 catcacctaa aagagcactt aagaattcac agtggagaga agccatatga atgcccaaac 841 tgcaagaaac gcttttccca ttctggctcc tatagctcac acataagcag taagaaatgt 901 atcagcttga tacctgtgaa tgggcgacca agaacaggac tcaagacatc tcagtgttct 961 tcaccgtctc tttcagcatc accaggcagt cccacacgac cacagatacg gcaaaagata 1021 gagaataaac cccttcaaga acaactttct gttaaccaaa ttaaaactga acctgtggat 1081 tatgaattca aacccatagt ggttgcttca ggaatcaact gttcaacccc tttacaaaat 1141 ggggttttca ctggtggtgg cccattacag gcaaccagtt ctcctcaggg catggtgcaa 1201 gctgttgttc tgccaacagt tggtttggtg tctcccataa gtatcaattt aagtgatatt 1261 cagaatgtac ttaaagtggc gatagatggt aatgtaataa ggcaagtgtt ggagaataat 1321 caagccaatc ttgcatccaa agaacaagaa acaatcaatg cttcacccat acaacaaggt 1381 ggccattctg ttatttcagc catcagtctt cctttggttg atcaagatgg aacaaccaaa 1441 attatcatca actacagtct tgagcagcct agccaacttc aagttgttcc tcaaaattta 1501 aaaaaagaaa atccagtcgc tacaaacagt tgtaaaagtg aaaagttacc agaagatctt 1561 actgttaagt ctgagaagga caaaagcttt gaaggggggg tgaatgatag cacttgtctt 1621 ctgtgtgatg attgtccagg agatattaat gcacttccag aattaaagca ctatgaccta 1681 aagcagccta ctcagcctcc tccactccct gcagcagaag ctgagaagcc tgagtcctct 1741 gtttcatcag ctactggaga tggcaatttg tctcctagtc agccaccttt aaagaacctc 1801 ttgtctctcc taaaagcata ttatgctttg aatgcacaac caagtgcaga agagctctca 1861 aaaattgctg attcagtaaa cctaccactg gatgtagtaa aaaagtggtt tgaaaagatg 1921 caagctggac agatttcagt gcagtcttct gaaccatctt ctcctgaacc aggcaaagta 1981 aatacccctg ccaagaacaa tgatcagcct caatctgcaa atgcaaatga accccaggac 2041 agcacagtaa atctacaaag tcctttgaag atgactaact ccccagtttt accagtggga 2101 tcaaccacca atggttccag aagtagtaca ccatccccat cacctctaaa cctttcctca 2161 tccagaaata cacagggtta cttgtacaca gctgagggtg cacaagaaga gccacaagta 2221 gaacctcttg atctttcact accaaagcaa cagggagaat tattagaaag gtcaactatc 2281 actagtgttt accagaacag tgtttattct gtccaggaag aacccttgaa cttgtcttgc 2341 gcaaaaaagg agccacaaaa ggacagttgt gttacagact cagaaccagt tgtaaatgta 2401 atcccaccaa gtgccaaccc cataaatatc gctataccta cagtcactgc ccagttaccc 2461 acaatcgtgg ccattgctga ccagaacagt gttccatgct taagagcgct agctgccaat 2521 aagcaaacga ttctgattcc ccaggtggca tacacctact caactacggt cagccctgca 2581 gtccaagaac cacccttgaa agtgatccag ccaaatggaa atcaggatga aagacaagat 2641 actagctcag aaggagtatc aaatgtagag gatcagaatg actctgattc tacaccgccc 2701 aaaaagaaaa tgcggaagac agaaaatgga atgtatgctt gtgatttgtg tgacaagata 2761 ttccaaaaga gtagttcatt attgagacat aaatatgaac acacaggtaa aagacctcat 2821 gagtgtggaa tctgtaaaaa ggcatttaaa cacaaacatc atttgattga acacatgcga 2881 ttacattctg gagaaaagcc ctatcaatgt gacaaatgtg gaaagcgctt ctcacactct 2941 gggtcttatt ctcaacacat gaatcatcgc tactcctact gtaagagaga agcggaagaa 3001 cgtgacagca cagagcagga agaggcaggg cctgaaatcc tctcgaatga gcacgtgggt 3061 gccagggcgt ctccctcaca gggcgactcg gacgagagag agagtttgac aagggaagag 3121 gatgaagaca gtgaaaaaga ggaagaggag gaggataaag agatggaaga attgcaggaa 3181 gaaaaagaat gtgaaaaacc acaaggggat gaggaagagg aggaggagga ggaagaagtg 3241 gaagaagaag aggtagaaga ggcagagaat gagggagaag aagcaaaaac tgaaggtctg 3301 atgaaggatg acagggctga aagtcaagca agcagcttag gacaaaaagt aggcgagagt 3361 agtgagcaag tgtctgaaga aaagacaaat gaagcctaat cgtttttcta gaaggaaaat 3421 aaattctaat tgataatgaa tttcgttcaa tattatcctt gcttttcatg gaaacacagt 3481 aacctgtatg ctgtgattcc tgttcactac tgtgtaaagt aaaaactaaa aaaatacaaa 3541 atacaaaaca cacacacaca cacacacaca cacacacaca cacacacaca caaaataaat 3601 ccgggtgtgc ctgaacctca gacctagtaa tttttcatgc agttttcaaa gttaggaaca 3661 agtttgtaac atgcagcaga ttagaaaacc ttaatgactc agagagcaac aatacaagag 3721 gttaaaggaa gctgattaat tagatatgca tctggcattg ttttatctta tcagtattat 3781 cactcttatg ttggtttatt cttaagctgt acaattggga gaaattttat aattttttat 3841 tggtaaacat atgctaaatc cgcttcagta ttttattatg ttttttaaaa tgtgagaact 3901 tctgcactac aaaattccct tcacagagaa gtataatgta gttccggaat tc // LOCUS HSU12255 1440 bp mRNA PRI 06-JAN-1995 DEFINITION Human IgG Fc receptor hFcRn mRNA, complete cds. ACCESSION U12255 NID g595474 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1440) AUTHORS Story,C.M., Mikulska,J. and Simister,N.E. TITLE A major histocompatibility complex class I-like Fc receptor cloned from human placenta: Possible role in transfer of immunoglobulin G from mother to fetus JOURNAL J. Exp. Med. 180, 2377-2381 (1994) MEDLINE 95053775 REFERENCE 2 (bases 1 to 1440) AUTHORS Simister,N.E. TITLE Direct Submission JOURNAL Submitted (12-JUL-1994) Neil E. Simister, Biology Department, Brandeis University, 415 South Street, Waltham, MA 02254-9110, USA FEATURES Location/Qualifiers source 1..1440 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="11 and 3" /clone_lib="Clontech library" /tissue_type="placenta" 5'UTR 1..125 misc_signal 122..129 /standard_name="Kozak consensus sequence" sig_peptide 126..194 CDS 126..1223 /codon_start=1 /function="Fc receptor for IgG" /product="hFcRn" /db_xref="PID:g595475" /translation="MGVPRPQPWALGLLLFLLPGSLGAESHLSLLYHLTAVSSPAPGT PAFWVSGWLGPQQYLSYNSLRGEAEPCGAWVWENQVSWYWEKETTDLRIKEKLFLEAF KALGGKGPYTLQGLLGCELGPDNTSVPTAKFALNGEEFMNFDLKQGTWGGDWPEALAI SQRWQQQDKAANKELTFLLFSCPHRLREHLERGRGNLEWKEPPSMRLKARPSSPGFSV LTCSAFSFYPPELQLRFLRNGLAAGTGQGDFGPNSDGSFHASSSLTVKSGDEHHYCCI VQHAGLAQPLRVELESPAKSSVLVVGIVIGVLLLTAAAVGGALLWRRMRSGLPAPWIS LRGDDTGVLLPTPGEAQDADLKDVNVIPATA" 3'UTR 1224..1440 polyA_signal 1385..1390 /note="non-canonical" polyA_signal 1413..1418 /note="non-canonical" polyA_site 1436 /note="4 A nucleotides" BASE COUNT 247 a 462 c 447 g 284 t ORIGIN 1 cgggcgcaga agcccctcct cggcgtcctg gtcccggccg tgcccgcggt gtcccgggag 61 gaaggggcgg gccgggggtc gggaggagtc acgtgccccc tcccgcccca ggtcgtcctc 121 tcagcatggg ggtcccgcgg cctcagccct gggcgctggg gctcctgctc tttctccttc 181 ctgggagcct gggcgcagaa agccacctct ccctcctgta ccaccttacc gcggtgtcct 241 cgcctgcccc ggggactcct gccttctggg tgtccggctg gctgggcccg cagcagtacc 301 tgagctacaa tagcctgcgg ggcgaggcgg agccctgtgg agcttgggtc tgggaaaacc 361 aggtgtcctg gtattgggag aaagagacca cagatctgag gatcaaggag aagctctttc 421 tggaagcttt caaagctttg gggggaaaag gtccctacac tctgcagggc ctgctgggct 481 gtgaactggg ccctgacaac acctcggtgc ccaccgccaa gttcgccctg aacggcgagg 541 agttcatgaa tttcgacctc aagcagggca cctggggtgg ggactggccc gaggccctgg 601 ctatcagtca gcggtggcag cagcaggaca aggcggccaa caaggagctc accttcctgc 661 tattctcctg cccgcaccgc ctgcgggagc acctggagag gggccgcgga aacctggagt 721 ggaaggagcc cccctccatg cgcctgaagg cccgacccag cagccctggc ttttccgtgc 781 ttacctgcag cgccttctcc ttctaccctc cggagctgca acttcggttc ctgcggaatg 841 ggctggccgc tggcaccggc cagggtgact tcggccccaa cagtgacgga tccttccacg 901 cctcgtcgtc actaacagtc aaaagtggcg atgagcacca ctactgctgc attgtgcagc 961 acgcggggct ggcgcagccc ctcagggtgg agctggaatc tccagccaag tcctccgtgc 1021 tcgtggtggg aatcgtcatc ggtgtcttgc tactcacggc agcggctgta ggaggagctc 1081 tgttgtggag aaggatgagg agtgggctgc cagccccttg gatctccctt cgtggagacg 1141 acaccggggt cctcctgccc accccagggg aggcccagga tgctgatttg aaggatgtaa 1201 atgtgattcc agccaccgcc tgaccatccg ccattccgac tgctaaaagc gaatgtagtc 1261 aggccccttt catgctgtga gacctcctgg aacactggca tctctgagcc tccagaaggg 1321 gttctgggcc tagttgtcct ccctctggag ccccgtcctg tggtctgcct cagtttcccc 1381 tcctaataca tatggctgtt ttccacctcg ataatataac acgagtttgg gcccgaaaaa // LOCUS HSU12404 700 bp mRNA PRI 02-FEB-1996 DEFINITION Human Csa-19 mRNA, complete cds. ACCESSION U12404 NID g531170 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 700) AUTHORS Fisicaro,N., Katerelos,M., Williams,J., Power,D., D'Apice,A. and Pearse,M. TITLE Identification of genes downregulated in the thymus by cyclosporin-A: preliminary characterization of clone CSA-19 JOURNAL Mol. Immunol. 32 (8), 565-572 (1995) MEDLINE 95334024 REFERENCE 2 (bases 1 to 700) AUTHORS Pearse,M.J. TITLE Direct Submission JOURNAL Submitted (18-JUL-1994) Martin J. Pearse, Clinical Immunology, St. Vincents Hospital, 41 Victoria Parade, Fitzroy, Victoria 3065, Australia FEATURES Location/Qualifiers source 1..700 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 16..669 /codon_start=1 /product="Csa-19" /db_xref="PID:g531171" /translation="MSSKVSRDTLYEAVREVLHGNQRKRRKFLETVELQISLKNYDPQ KDKRFSGTVRLKSTPRPKFSVCVLGDQQHCDEAKAVDIPHMDIEALKKLNKNKKLVKK LVKKYDAFLASESLIKQIPRILGPGLNKAGKFPSLLTHNENMVAKVDEVKSTIKFQMK KVLCLAVAVGHVKMTDDELVYNIHLAVNFLVSLLKKNWQNVRALYIKSTMGKPQRLY" BASE COUNT 194 a 172 c 184 g 150 t ORIGIN 1 gcggcgtgag aagccatgag cagcaaagtc tctcgcgaca ccctgtacga ggcggtgcgg 61 gaagtcctgc acgggaacca gcgcaagcgc cgcaagttcc tggagacggt ggagttgcag 121 atcagcttga agaactatga tccccagaag gacaagcgct tctcgggcac cgtcaggctt 181 aagtccactc cccgccctaa gttctctgtg tgtgtcctgg gggaccagca gcactgtgac 241 gaggctaagg ccgtggatat cccccacatg gacatcgagg cgctgaaaaa actcaacaag 301 aataaaaaac tggtcaagaa gctggtcaag aagtatgatg cgtttttggc ctcagagtct 361 ctgatcaagc agattccacg aatcctcggc ccaggtttaa ataaggcagg aaagttccct 421 tccctgctca cacacaacga aaacatggtg gccaaagtgg atgaggtgaa gtccacaatc 481 aagttccaaa tgaagaaggt gttatgtctg gctgtagctg ttggtcacgt gaagatgaca 541 gacgatgagc ttgtgtataa cattcacctg gctgtcaact tcttggtgtc attgctcaag 601 aaaaactggc agaatgtccg ggccttatat atcaagagca ccatgggcaa gccccagcgc 661 ctatattaag gcacatttga ataaattcta ttaccagttc // LOCUS HSU12424 2618 bp mRNA PRI 03-MAY-1995 DEFINITION Human mitochondrial glycerol-3-phosphate dehydrogenase mRNA, complete cds. ACCESSION U12424 NID g525319 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2618) AUTHORS Lehn,D.A., Brown,L.J., Simonson,G.D., Moran,S.M. and MacDonald,M.J. TITLE The sequence of a human mitochondrial glycerol-3-phosphate dehydrogenase-encoding cDNA JOURNAL Gene 150 (2), 417-418 (1994) MEDLINE 95121946 REFERENCE 2 (bases 1 to 2618) AUTHORS Lehn,D.A. TITLE Direct Submission JOURNAL Submitted (19-JUL-1994) Donald A. Lehn, Childrens Diabetes Center, University of Wisconsin Medical School, Medical Sciences Center\3415, 1300 University Avenue, Madison, WI 53706-1532, USA FEATURES Location/Qualifiers source 1..2618 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hGPD-11" /clone_lib="Stratagene Lambda ZAP II" /sex="female" /cell_line="HeLa, subclone D98/AH-2" /tissue_type="invasive cervical carcinoma" /dev_stage="adult" 5'UTR 1..123 transit_peptide 124..249 /note="determined by similarity to rat glycerol-3-phosphate dehydrogenase: Swiss-Prot Accession Number P35571" CDS 124..2307 /EC_number="1.1.99.5" /codon_start=1 /product="mitochondrial glycerol-3-phosphate dehydrogenase" /db_xref="PID:g533693" /translation="MAFQKAVKGTILVGGGALATVLGLSQFAHYRRKQMNLAYVKAAD CISEPVNREPPSREAQLLTLQNTSEFDILVIGGGATGSGCALDAVTRGLKTALVERDD FSSGTSSRSTKLIHGGVRYLQKAIMKLDIEQYRMVKEALHERANLLEIAPHLSAPLPI MLPVYKWWQLPYYWVGIKLYDLVAGSNCLKSSYVLSKSRALEHFPMLQKDKLVGAIVY YDGQHNDARMNLAIALTAARYGAATANYMEVVSLLKKTDPQTGKVHVSGARCKDVLTG QEFDVRAKCVINATGPFTDSVRKMDDKDAAAICQPSAGVHIVMPGYYSPESMGLLDPA TSDGRVIFFLPWQKMTIAGTTDTPTDVTHHPIPSEEDINFILNEVRNYLSCDVEVRRG DVLAAWSGIRPLVTDPKSADTQSISRNHVVDISESGLITIAGGKWTTYRSMAEDTINA AVKTHNLKAGPSRTVGLFLQGGKDWSPTLYIRLVQDYGLESEVAQHLAATYGDKAFEV AKMASVTGKRWPIVGVHLVSEFPYIEAEVKYGIKEYACTAVDMISRRTRLAFLNVQAA EEALPRIVELMGRELNWDDYKKQEQLETARKFLYYEMGYKSRSEQLTDRSEISLLPSD IDRYKKRFHKFDADQKGFITIVDVQRVLESINVQMDENTLHEILNEVDLNKNGQVELN EFLQLMSAIQKGRVSGSRLAILMKTAEENLDRRVPIPVDRSCGGL" mat_peptide 250..2304 3'UTR 2308..2618 BASE COUNT 751 a 529 c 642 g 696 t ORIGIN 1 gcggggctgg cacccgggcc gaggctctga ttctgggggg aggccgactc caccctggct 61 ggaggaactg ggtgctcctg cccgctggcc cctcgcgcgt gaggatctat ctcaggctaa 121 gaaatggcat ttcaaaaggc agtgaaaggg acgattcttg ttggaggagg tgctcttgca 181 actgttttag gactttctca gtttgctcat tacagaagga aacaaatgaa cctggcctat 241 gttaaagcag cagactgcat ttcagaacca gttaacaggg agcctccttc cagagaagct 301 cagctactga ctttgcaaaa cacatctgaa tttgatatcc ttgttattgg aggaggagca 361 acaggaagtg gctgtgcgct agatgctgtc accagaggac taaaaacagc ccttgtagaa 421 agagatgatt tctcatcagg gaccagcagc agaagcacta aattgatcca tggtggtgtg 481 agatatctgc agaaggccat catgaagttg gatattgagc agtataggat ggtaaaagaa 541 gcccttcatg agcgtgccaa cctgctagaa attgctcccc atttatcagc tccattgcct 601 ataatgcttc cagtttacaa gtggtggcag ttaccttact actgggtagg aatcaagctg 661 tatgatttgg ttgcaggaag caattgccta aaaagcagtt atgtcctcag caaatcaaga 721 gcccttgaac atttcccaat gctccagaag gacaaactgg taggagcaat tgtctactat 781 gacggacaac ataacgatgc acggatgaac cttgccattg ctctgactgc tgccaggtat 841 ggggctgcca cagccaatta catggaggta gtgagcttgc tcaagaagac agacccccag 901 acagggaaag tgcatgtgag cggcgcacgg tgcaaggatg tcctcacagg gcaggaattt 961 gacgtgagag ccaaatgtgt tatcaatgcc acgggacctt tcacggactc tgtgcgcaaa 1021 atggatgata aagacgcagc agctatctgc cagccaagtg ctggtgtcca tattgtgatg 1081 cctggttatt acagcccaga gagcatggga cttcttgacc cagcgaccag tgatgggcga 1141 gttattttct tcttaccctg gcaaaagatg acgatcgctg gcactactga tactccaact 1201 gatgttacac accatccaat tccttcagaa gaagatatca acttcatttt gaatgaagtg 1261 cgtaattacc tgagttgtga tgttgaagtg agaagagggg atgtcctggc agcatggagt 1321 ggaatccgtc ctcttgttac agaccccaaa tctgcagata ctcagtctat ctcccgaaat 1381 catgttgttg atatcagtga gagtggcctt attactatag caggtggaaa gtggacaact 1441 tatcggtcta tggcagaaga taccataaat gctgctgtca aaactcataa tttaaaagca 1501 ggaccaagta gaacagttgg gcttttcctt caagggggta aagattggag ccccacactc 1561 tacattaggc ttgtgcagga ttatggactt gaaagcgagg tggcacagca tcttgccgcc 1621 acctatggtg ataaggcctt tgaggtggcc aaaatggcaa gtgtgactgg caaaaggtgg 1681 cctattgttg gagtacatct tgtgtcagaa tttccatata ttgaagcaga ggtgaaatat 1741 gggattaagg agtatgcctg cactgctgtg gatatgattt cacgtcgtac tcgcctggcc 1801 tttctaaatg tccaggcagc agaggaagcc ctacccagga ttgttgaact gatgggcagg 1861 gaactgaatt gggatgatta taagaagcag gaacaacttg aaacagccag gaagtttcta 1921 tattatgaaa tgggctataa atctcgatca gaacagttaa cagatcgctc tgaaattagc 1981 ctactgcctt cagacattga caggtataag aagagatttc ataagtttga tgcagaccag 2041 aaaggcttta ttaccattgt tgatgttcag cgtgtattag agagtatcaa tgtccaaatg 2101 gatgaaaata cactccatga aattctaaat gaagttgatt tgaataaaaa tggacaggtt 2161 gaactcaatg aatttttgca gctgatgagt gctattcaaa aaggaagggt atctggaagc 2221 cggcttgcta tactaatgaa aactgcagaa gagaacctcg acagaagagt tccaattcca 2281 gtggaccgta gttgtggagg attgtgagtc tgggcagtaa atccacagcc aacaaacata 2341 gaaacgacaa atcaccatgt aacaaccaga gatgactgaa accactctga aataatgaat 2401 gtggatagct gcctttttta acactagaaa acattccaaa actttaaggt gttggtgtat 2461 ttgccagctt tatttgctgt actttatttg tatttgccat tcagtctagc ttttaagtat 2521 atttttttct ttttctcatt ttcaatgcac attagttttg catctgtttt gtgacctgtt 2581 agatgtgaca cattctcttt ttgtttattc ccttattc // LOCUS HSU12431 2233 bp mRNA PRI 12-JUL-1995 DEFINITION Human ELAV-like neuronal protein 1 (hel-N1) mRNA, complete cds. ACCESSION U12431 NID g521143 KEYWORDS RNA-binding protein; Drosophila ELAV-like. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 585 to 1661) AUTHORS King,P.H., Levine,T.D., Fremeau,R.T. Jr. and Keene,J.D. TITLE Mammalian homologs of Drosophila ELAV localized to a neuronal subset can bind in vitro to the 3' UTR of mRNA encoding the Id transcriptional repressor JOURNAL J. Neurosci. 14 (4), 1943-1952 (1994) MEDLINE 94210033 REFERENCE 2 (bases 1 to 2233) AUTHORS Keene,J.D. TITLE Direct Submission JOURNAL Submitted (19-JUL-1994) Jack D. Keene, Microbiology, Duke University Medical Center, Room 405, Jones Building, Research Drive, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..2233 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene lambda ZAP II cDNA library" /sex="female" /tissue_type="brain" /dev_stage="fetus" 5'UTR 1..584 gene 585..1664 /gene="hel-N1" CDS 585..1664 /gene="hel-N1" /note="similar to Drosophila ELAV protein, Swiss-Prot Accession Number P16914; contains three RNA Recognition Motifs (RRMs) at amino acid positions 40-119, 126-207 and 277-356" /codon_start=1 /function="RNA binding protein" /product="ELAV-like neuronal protein 1" /db_xref="PID:g521144" /translation="METQLSNGPTCNNTANGPTTINNNCSSPVDSGNTEDSKTNLIVN YLPQNMTQEELKSLFGSIGEIESCKLVRDKITGQSLGYGFVNYIDPKDAEKAINTLNG LRLQTKTIKVSYARPSSASIRDANLYVSGLPKTMTQKELEQLFSQYGRIITSRILVDQ VTGISRGVGFIRFDKRIEAEEAIKGLNGQKPPGATEPITVKFANNPSQKTNQAILSQL YQSPNRRYPGPLAQQAQRFRLDNLLNMAYGVKRFSPMTIDGMTSLAGINIPGHPGTGW CIFVYNLAPDADESILWQMFGPFGAVTNVKVIRDFNTNKCKGFGFVTMTNYDEAAMAI RSLNGYRLGDRVLQVSFKTNKTHKA" 3'UTR 1665..2233 BASE COUNT 640 a 435 c 472 g 686 t ORIGIN 1 ataaaaacat tgtatatagt agaccaaatg gtcatagtta ctgtgagcct ggcagagcag 61 aaaaggcagt tgaaggaggc agagaagggg ttggtgatac ggtgctataa atctgtggtc 121 cagtccacct cccttttaag acgtcctccc cctatttctg ttcttagtta gagatgcagc 181 agcttactcc tgtagcgacc tactaaaaag caacaaggag aaagacatcg tctttttgaa 241 aaaacgttct ttcgtctctt ttctttgttc cttggtttgt ttttcttgcc cctttttgtt 301 tagttgaacg gcaataggag ggtagtctct ccgtcttttt aaactctttt ttaagtttcc 361 cctccccttt catatttttt tgggccattt cttttagctt tggactttgg gggtcgaaag 421 cgtttctttt tatttgcttc ttttaagccg agcacagttt aggtttcgtg ctgtcttaag 481 agaactatcc agcagcttct tgctcatcct tattgggaga actgcaccgt tactttaaaa 541 acacacatac acaaaaacct taagggagaa aggtaattgc tgccatggaa acacaactgt 601 ctaatgggcc aacttgcaat aacacagcca atggtccaac caccataaac aacaactgtt 661 cgtcaccagt tgactctggg aacacagaag acagcaagac caacttaata gtcaactacc 721 ttcctcagaa catgacacag gaggaactaa agagtctctt tgggagcatt ggtgaaatag 781 agtcctgtaa gcttgtaaga gacaaaataa cagggcagag cttgggatat ggctttgtga 841 actacattga ccccaaggat gcagagaaag ctatcaacac cctgaatgga ttgagacttc 901 aaaccaaaac aataaaagtt tcctatgctc gcccaagttc agcttctatc agagatgcaa 961 atttatatgt cagcggactt ccaaaaacaa tgacccagaa ggagttggaa cagctttttt 1021 cacaatatgg acgcattatt acttctcgta ttcttgtcga ccaggtcact ggcatatcaa 1081 ggggtgtagg gtttattcga tttgacaagc gaattgaggc agaagaagct atcaaaggcc 1141 taaatggcca gaaacctccc ggtgccacgg agccaatcac tgtaaagttt gctaataacc 1201 caagccaaaa aaccaatcag gccatccttt cccagctgta ccagtctcca aacagaaggt 1261 atccaggacc gctagctcag caggcacagc gttttaggtt ggacaatctg ctcaatatgg 1321 cttatggagt aaagaggttt tctccaatga ccattgacgg aatgaccagt ttggctggaa 1381 ttaatatccc tgggcaccct ggaacagggt ggtgtatatt tgtgtacaac ctggctcctg 1441 acgcagatga gagtatcctg tggcaaatgt ttgggccttt tggagctgtc accaatgtga 1501 aggtcatccg tgactttaac accaataaat gcaaaggttt tggatttgtg actatgacaa 1561 actatgatga ggctgccatg gcgatacgta gcctcaatgg ataccgtctg ggagacagag 1621 tactgcaggt ctcctttaag acaaacaaaa cgcacaaagc ctaatgagct cttgtcctca 1681 gtccatttat atatgaaaac tatacaacaa aggcaagtta agagaaactt tatacattag 1741 taaatgtctt tgtaagtcag tgttgagatg gggataaaat gactacttag catcctaaga 1801 aatatgtgag attttttatt gctagtattt gaattaaaac ttcttaaata tcttttatgt 1861 tgaatatgga caagaggtac agggttttac ctgtcacatt gcattctatt gccttctttg 1921 aagaaggtgg accttttaaa gtgtttcagc taagggaaga catttctttt ctttttacat 1981 aactgccttg aacctgtgag taaatattga ggctttgtgt tgtaattctt cagttggttg 2041 tgtctttttt ttcccccctt tttccttttt ctgattagct ttgtgtttgg tttacattta 2101 aagcattgct gttatgtctg tttaagaaaa gtattttgaa gtttacattt ttatttatga 2161 agtttaaaac agtatttatt ttgtaattat gatttgggtt ggggaagggg gggctacatt 2221 ataaacgctt agg // LOCUS HSU12465 433 bp mRNA PRI 01-NOV-1994 DEFINITION Human ribosomal protein L35 mRNA, complete cds. ACCESSION U12465 NID g562073 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 433) AUTHORS Patel,S.K., Chandraratna,R. and Nagpal,S. TITLE Human cDNA sequence of ribosomal protein L35 JOURNAL Unpublished REFERENCE 2 (bases 1 to 433) AUTHORS Patel,S.K. TITLE Direct Submission JOURNAL Submitted (20-JUL-1994) Sheetal K. Patel, Retinoid Research, Allergan Pharmaceuticals, 2525 Dupont Drive, Irvine, CA 92713, USA FEATURES Location/Qualifiers source 1..433 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="skin raft culture cDNA library" CDS 28..399 /codon_start=1 /product="ribosomal protein L35" /db_xref="PID:g562074" /translation="MAKIKARDLRGKKKEELLKQLDDLKVELSQLRVAKVTGGAASKL SKIRVVRKSIARVLTVINQTQKENLRKFYKGKKYKPLDLRPKKTRAMRRRLNKHEENL KTKKQQRKERLYPLRKYAVKA" polyA_signal 413..418 BASE COUNT 118 a 118 c 133 g 64 t ORIGIN 1 ccacgcgtcc gggcggcttg tgcagcaatg gccaagatca aggctcgaga tcttcgcggg 61 aagaagaagg aggagctgct gaaacagctg gacgacctga aggtggagct gtcccagctg 121 cgcgtcgcca aagtgacagg cggtgcggcc tccaagctct ctaagatccg agtcgtccgg 181 aaatccattg cccgtgttct cacagttatt aaccagactc agaaagaaaa cctcaggaaa 241 ttctacaagg gcaagaagta caagcccctg gacctgcggc ctaagaagac acgtgccatg 301 cgccgccggc tcaacaagca cgaggagaac ctgaagacca agaagcagca gcggaaggag 361 cggctgtacc cgctgcggaa gtacgcggtc aaggcctgag gggcgcattg tcaataaagc 421 acagctggct gag // LOCUS HSU12507 1653 bp mRNA PRI 20-JAN-1995 DEFINITION Human cardiac inward rectifier potassium channel (HH-IRK1) mRNA, complete cds. ACCESSION U12507 NID g625091 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1653) AUTHORS Raab-Graham,K.F., Radeke,C.M. and Vandenberg,C.A. TITLE Molecular cloning and expression of a human heart inward rectifier potassium channel JOURNAL Neuroreport 5 (18), 2501-2505 (1994) MEDLINE 95210614 REFERENCE 2 (bases 1 to 1653) AUTHORS Vandenberg,C.A. TITLE Direct Submission JOURNAL Submitted (20-JUL-1994) Carol A. Vandenberg, Biological Sciences, University of California, Santa Barbara, CA 93106, USA FEATURES Location/Qualifiers source 1..1653 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="heart" /dev_stage="adult" gene 326..1609 /gene="HH-IRK1" CDS 326..1609 /gene="HH-IRK1" /codon_start=1 /product="cardiac inward rectifier potassium channel" /db_xref="PID:g625092" /translation="MGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCR SRFVKKDGHCNVQFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCV FWLIALLHGDLDASKEGKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFM VVFQSIVGCIIDAFIIGAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLR KSHLVEAHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDED SPLYDLSKQDIDNADFEIVVILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEK HYYKVDYSRFHKTYEVPNTPLCSARDLAEKKYILSNANSFCYENEVALTSKEEDDSEN GVPESTSTDTPPDIDLHNQASVPLEPRPLRRESEI" BASE COUNT 435 a 400 c 396 g 422 t ORIGIN 1 gaattctggt ttgctttggc tcactcgctt tttacaaacc actggatctt acatgcctct 61 gtacccccca cttccactcc atgtccccat gctcctgcgc cagcaacagg acatgttctc 121 tggatgtcag ctgagtcatt aaagtaactc tgcatgtcag tagacagacc ttggtagaac 181 cacaaggctc ccagagacac ccatctctcc tcattttttt ggtgtgtgtg tcttcaccga 241 acattcaaaa ctgtttctcc aaagcgtttt gcaaaaactc agactgtttt ccaaagcaga 301 agcactggag tccccagcag aagcgatggg cagtgtgcga accaaccgct acagcatcgt 361 ctcttcagaa gaagacggta tgaagttggc caccatggca gttgcaaatg gctttgggaa 421 cgggaagagt aaagtccaca cccgacaaca gtgcaggagc cgctttgtga agaaagatgg 481 ccactgtaat gttcagttca tcaatgtggg tgagaagggg caacggtacc tcgcagacat 541 cttcaccacg tgtgtggaca ttcgctggcg gtggatgctg gttatcttct gcctggcttt 601 cgtcctgtca tggctgtttt ttggctgtgt gttttggttg atagctctgc tccatgggga 661 cctggatgca tccaaagagg gcaaagcttg tgtgtccgag gtcaacagct tcacggctgc 721 cttcctcttc tccattgaga cccagacaac cataggctat ggtttcagat gtgtcacgga 781 tgaatgccca attgctgttt tcatggtggt gttccagtca atcgtgggct gcatcatcga 841 tgctttcatc attggcgcag tcatggccaa gatggcaaag ccaaagaaga gaaacgagac 901 tcttgtcttc agtcacaatg ccgtgattgc catgagagac ggcaagctgt gtttgatgtg 961 gcgagtgggc aatcttcgga aaagccactt ggtggaagct catgttcgag cacagctcct 1021 caaatccaga attacttctg aaggggagta tatccctctg gatcaaatag acatcaatgt 1081 tgggtttgac agtggaatcg atcgtatatt tctggtgtcc ccaatcacta tagtccatga 1141 aatagatgaa gacagtcctt tatatgattt gagtaaacag gacattgaca acgcagactt 1201 tgaaatcgtg gtcatactgg aaggcatggt ggaagccact gccatgacga cacagtgccg 1261 tagctcttat ctagcaaatg aaatcctgtg gggccaccgc tatgagcctg tgctctttga 1321 agagaagcac tactacaaag tggactattc caggttccac aaaacttacg aagtccccaa 1381 cactcccctt tgtagtgcca gagacttagc agaaaagaaa tatatcctct caaatgcaaa 1441 ttcattttgc tatgaaaatg aagttgccct cacaagcaaa gaggaagacg acagtgaaaa 1501 tggagttcca gaaagcacta gtacggacac gccccctgac atagaccttc acaaccaggc 1561 aagtgtacct ctagagccca ggcccttacg gcgagagtcg gagatatgac tgactgattc 1621 cttctctgga atagttactt tacaacacgg tct // LOCUS HSU12512 1082 bp mRNA PRI 07-SEP-1994 DEFINITION Human bradykinin receptor B1 subtype mRNA, complete cds. ACCESSION U12512 NID g535478 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1082) AUTHORS Menke,J.G., Borkowski,J.A., Bierilo,K.K., MacNeil,T., Derrick,A.W., Schneck,K.A., Ransom,R.W., Strader,C.D., Linemeyer,D.L. and Hess,J.F. TITLE Expression cloning of a human B1 bradykinin receptor JOURNAL J. Biol. Chem. 269, 21583-21586 (1994) MEDLINE 94342346 REFERENCE 2 (bases 1 to 1082) AUTHORS Elliston,K.O. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Keith O. Elliston, Bioinformatics, Merck Research Laboratories, Box 2000, Rahway, NJ 07065, USA FEATURES Location/Qualifiers source 1..1082 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR-90" /cell_type="embryonic fibroblasts" CDS 7..1068 /codon_start=1 /product="bradykinin receptor B1 subtype" /db_xref="PID:g535479" /translation="MASSWPPLELQSSNQSQLFPQNATACDNAPEAWDLLHRVLPTFI ISICFFGLLGNLFVLLVFLLPRRQLNVAEIYLANLAASDLVFVLGLPFWAENIWNQFN WPFGALLCRVINGVIKANLFISIFLVVAISQDRYRVLVHPMASGRQQRRRQARVTCVL IWVVGGLLSIPTFLLRSIQAVPDLNITACILLLPHEAWHFARIVELNILGFLLPLAAI VFFNYHILASLRTREEVSRTRVRGPKDSKTTALILTLVVAFLVCWAPYHFFAFLEFLF QVQAVRGCFWEDFIDLGLQLANFFAFTNSSLNPVIYVFVGRLFRTKVWELYKQCTPKS LAPISSSHRKEIFQLFWRN" BASE COUNT 210 a 333 c 264 g 275 t ORIGIN 1 ctgtgcatgg catcatcctg gccccctcta gagctccaat cctccaacca gagccagctc 61 ttccctcaaa atgctacggc ctgtgacaat gctccagaag cctgggacct gctgcacaga 121 gtgctgccga catttatcat ctccatctgt ttcttcggcc tcctagggaa cctttttgtc 181 ctgttggtct tcctcctgcc ccggcggcaa ctgaacgtgg cagaaatcta cctggccaac 241 ctggcagcct ctgatctggt gtttgtcttg ggcttgccct tctgggcaga gaatatctgg 301 aaccagttta actggccttt cggagccctc ctctgccgtg tcatcaacgg ggtcatcaag 361 gccaatttgt tcatcagcat cttcctggtg gtggccatca gccaggaccg ctaccgcgtg 421 ctggtgcacc ctatggccag cggaaggcag cagcggcgga ggcaggcccg ggtcacctgc 481 gtgctcatct gggttgtggg gggcctcttg agcatcccca cattcctgct gcgatccatc 541 caagccgtcc cagatctgaa catcaccgcc tgcatcctgc tcctccccca tgaggcctgg 601 cactttgcaa ggattgtgga gttaaatatt ctgggtttcc tcctaccact ggctgcgatc 661 gtcttcttca actaccacat cctggcctcc ctgcgaacgc gggaggaggt cagcaggaca 721 agagtgcggg ggccgaagga tagcaagacc acagcgctga tcctcacgct cgtggttgcc 781 ttcctggtct gctgggcccc ttaccacttc tttgccttcc tggaattctt attccaggtg 841 caagcagtcc gaggctgctt ttgggaggac ttcattgacc tgggcctgca attggccaac 901 ttctttgcct tcactaacag ctccctgaat ccagtaattt atgtctttgt gggccggctc 961 ttcaggacca aggtctggga actttataaa caatgcaccc ctaaaagtct tgctccaata 1021 tcttcatccc ataggaaaga aatcttccaa cttttctggc ggaattaaaa cagcattgaa 1081 cc // LOCUS HSU12535 3832 bp mRNA PRI 18-FEB-1995 DEFINITION Human epidermal growth factor receptor kinase substrate (Eps8) mRNA, complete cds. ACCESSION U12535 NID g530822 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3832) AUTHORS Wong,W.T., Carlomagno,F., Druck,T., Barletta,C., Croce,C.M., Huebner,K., Kraus,M.H. and Di Fiore,P.P. TITLE Evolutionary conservation of the EPS8 gene and its mapping to human chromosome 12q23-q24 JOURNAL Oncogene 9 (10), 3057-3061 (1994) MEDLINE 94366758 REFERENCE 2 (bases 1 to 3832) AUTHORS Di Fiore,P. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Pier Paolo Di Fiore, Lab. Cellular & Molecular Biology, National Cancer Institute, Building 37, Room 1D23, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3832 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q23-q24" gene 210..2678 /gene="Eps8" CDS 210..2678 /gene="Eps8" /codon_start=1 /product="epidermal growth factor receptor kinase substrate" /db_xref="PID:g530823" /translation="MNGHISNHPSSFGMYPSQMNGYGSSPTFSQTDREHGSKTSAKAL YEQRKNYARDSVSSVSDISQYRVEHLTTFVLDRKDAMITVDDGIRKLKLLDAKGKVWT QDMILQVDDRAVSLIDLESKNELENFPLNTIQHCQAVMHSCSYDSVLALVCKEPTQNK PDLHLFQCDEVKANLISEDIESAISDSKGGKQKRRPDALRMISNADPSIPPPPRAPAP APPGTVTQVDVRSRVAAWSAWAADQGDFEKPRQYHEQEETPEMMAARIDRDVQILNHI LDDIEFFITKLQKAAEAFSELSKRKKNKKGKRKGPGEGVLTLRAKPPPPDEFLDCFQK FKHGFNLLAKLKSHIQNPSAADLVHFLFTPLNMVVQATGGPELASSVLSPLLNKDTID FLNYTVNGDERQLWMSLGGTWMKARAEWPKEQFIPPYVPRFRNGWEPPMLNFMGATME QDLYQLAESVANVAEHQRKQEIKRLSTEHSSVSEYHPADGYAFSSNIYTRGSHLDQGE AAVAFKPTSNRHIDRNYEPLKTQPKKYAKSKYDFVARNNSELSVLKDDILEILDDRKQ WWKVRNASGDSGFVPNNILDIVRPPESGLGRADPPYTHTIQKQRMEYGPRPADTPPAP SPPPTPAPVPVPLPPSTPAPVPVSKVPANITRQNSSSSDSGGSIVRDSQRHKQLPVDR RKSQMEEVQDELIHRLTIGRSAAQKKFHVPRQNVPVINITYDSTPEDVKTWLQSKGFN PVTVNSLGVLNGAQLFSLNKDELRTVCPEGARVYSQITVQKAALEDSSGSSELQEIMR RRQEKISAAASDSGVESFDEGSSH" BASE COUNT 1171 a 781 c 840 g 1040 t ORIGIN 1 cgcgaggccg gcgctgtgct cgcctcggag atcgctgctc tttagctggg tgcagaaggc 61 ggctccgcgg ctcgcggacg actggctggg cgcgaatcag attggggggc tttctcccgg 121 tcccctccca cctcgtctgg gctcgcggcg tctccgggga aagccgtggc cccgagggcg 181 gatccgagaa cacacaagtg aaagacacaa tgaatggtca tatttctaat catcccagta 241 gttttggaat gtacccatct cagatgaatg gctacggatc atcacctacc ttttcccaga 301 cggacagaga acatggttca aaaacaagtg caaaggccct ttatgaacaa aggaagaatt 361 atgcacggga cagtgtcagc agtgtgtcag atatatctca ataccgtgtt gaacacttga 421 ctacctttgt cctggatcgg aaagatgcta tgatcactgt tgatgatgga ataaggaaat 481 tgaaattgct tgatgccaag ggcaaagtgt ggactcaaga tatgattctt caagtggatg 541 acagagctgt gagcctgatt gatttagaat caaagaatga actggagaat tttcctttaa 601 acacaatcca gcactgccaa gctgtgatgc attcatgcag ctatgattca gttcttgcac 661 tggtgtgcaa agagccaacc cagaacaagc cagatcttca tctcttccag tgtgatgagg 721 ttaaggcaaa cctaattagt gaagatattg aaagtgcaat cagtgacagt aaaggaggga 781 aacagaagag gcggcccgac gccctgagga tgatttccaa tgcagaccct agtataccgc 841 ctccacccag agctcctgcc cctgcgcccc ctgggaccgt cacccaggtg gatgttagaa 901 gtcgagtggc agcctggtct gcatgggcag ccgaccaagg ggactttgag aaaccaaggc 961 agtatcatga gcaggaagaa acacctgaga tgatggcagc ccgcattgac agagatgtgc 1021 aaatcttaaa ccacattttg gatgacattg aattttttat cacaaaactc caaaaagcag 1081 cagaagcatt ttctgagctt tctaaaagga agaaaaacaa gaaaggtaaa aggaaaggac 1141 caggagaggg tgttttaacg ctgcgggcaa aacctccacc tcctgatgaa tttcttgact 1201 gtttccaaaa gtttaaacac ggatttaacc ttctggccaa actgaagtct catattcaga 1261 atcctagtgc tgcagatttg gttcactttt tgtttactcc attaaatatg gtggtgcagg 1321 caacaggagg tcctgaacta gccagttcag tacttagtcc cctattgaat aaggacacaa 1381 ttgatttctt aaattatact gtcaatggtg atgaacggca gctgtggatg tcattgggag 1441 gaacttggat gaaagccaga gcagagtggc caaaagaaca gtttattcca ccatatgttc 1501 cacgattccg caatggctgg gagcccccaa tgctgaactt tatgggagcc acaatggaac 1561 aagatcttta tcaactggca gaatctgtgg caaatgtagc agaacatcag cgcaaacagg 1621 aaataaaaag attatccaca gagcattcca gtgtatcaga gtatcatcca gccgatggct 1681 atgcgttcag tagcaacatt tacacaagag gatcccacct ggaccaaggg gaagctgctg 1741 ttgcttttaa gccaacttct aatcgccata tagatagaaa ttatgaacca ctcaaaacac 1801 aacccaagaa atatgccaaa tccaagtatg actttgtagc aaggaacaac agtgagctct 1861 cggttctaaa ggatgatatt ttagagatac ttgatgatcg gaagcaatgg tggaaagttc 1921 gaaatgcaag tggagactct ggatttgtgc caaataacat tttggatatt gtgagacctc 1981 cagaatctgg attggggcgt gctgatccac cttatactca tactatacag aaacaaagga 2041 tggagtatgg cccaagacca gctgatactc cccctgctcc atcacctcct ccaacaccag 2101 ctcctgttcc tgttcccctt cccccttcca ctccagcacc tgttcctgtg tcaaaggtcc 2161 cagcaaatat aacacgtcaa aacagcagct ccagtgacag tggtggcagt atcgtgcgag 2221 acagccagag acacaaacaa cttccggtgg accgaaggaa atctcagatg gaggaagtgc 2281 aagatgaact catccacaga ctgaccattg gtcggagtgc cgctcagaag aaattccatg 2341 tgccacggca gaacgtgcca gttatcaata tcacttacga ctccacacca gaggatgtga 2401 agacgtggtt acagtcaaag ggattcaacc ctgtgactgt caatagtctt ggagtattaa 2461 atggtgcaca acttttctct ctcaataagg atgaactgag gacagtctgc cctgaagggg 2521 cgagagtcta tagccaaatc actgtacaaa aagctgcatt ggaggatagc agtggcagct 2581 ccgagttaca agaaattatg agaagacgac aggaaaaaat cagtgctgcc gctagtgatt 2641 caggagtgga atcttttgat gaaggaagca gtcactaatt tgtttgtttg tatttaaact 2701 ccattgtttt tggcattatt ccaacatgct ttgttttaag aagccttgaa gggaatgtca 2761 gattcatttt tcttgatgta atttatcacc ataaaaaaaa aacccatgca aacctgagtg 2821 agcacaggat ttgcttctag gcccattatt tttattaaaa ctgaaaaaat ttaaactgaa 2881 ttttttgacc ttggaaaata tttttcttac tttaccaagg tgaagtttcc ttaattagac 2941 taattatttt atccccatcc cagggtataa acaggaattg ttttgatagt ggtggagtta 3001 ttcactgcaa caaagcaaca atgttgtcca tgattcaaaa tctaagcagt ttcgattttg 3061 cctgtgaata tggtgtctgt cattcagggc atagctcact gtaggctagc ctctgcttac 3121 ttaagtctct tctctgacat actcaatgga agaatattta gatttattta aagttcttaa 3181 tgccaacagt ttaaaaaaaa attaaaacat ttgaatgaac tgtaaagtac agccatacct 3241 tggacatgca aatataaatc tatggagcat tctcaagaca gtttgtcatg gctctgttga 3301 ttgcaactcc ttgtatagct tgtattttga tttagtttat attctgctta ttatgtatac 3361 tgtgttctta tatatgagaa agcacaaatg cgaaagaggt catgtcttct caaaatctag 3421 caaaggaagt agtctgcatt ggtgtgcatt acagtatttt gcttaatgaa agcctcagtt 3481 ctgaatgttg atatgagtag ttaaaaggaa gtggggccat tttatgtgtt tatctgtgtc 3541 aagtatttct ggtaataaga agcacttaat ttacacatat tttaatcctg tgaaagattc 3601 cacatagaga aaagaaagat acctaacctt caacaaatgt tatttttgga aacacaattt 3661 ttgtcattaa atgttatatt atttcacata tataaaacag atgttatgta agaatgttgt 3721 atattttaac ataaatcatt tagagaaatt atctagattc attaattttc atagtgcctt 3781 tttcacatga gtcagctgga aagtctgcaa taaacagtat ttgctgtctg tt // LOCUS HSU12596 2828 bp mRNA PRI 16-FEB-1996 DEFINITION Human tumor necrosis factor type 1 receptor associated protein (TRAP2) mRNA, complete cds. ACCESSION U12596 NID g687238 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2828) AUTHORS Song,H.Y., Dunbar,J.D., Zhang,Y.X., Guo,D. and Donner,D.B. TITLE Identification of a protein with homology to hsp90 that binds the type 1 tumor necrosis factor receptor JOURNAL J. Biol. Chem. 270 (8), 3574-3581 (1995) MEDLINE 95181307 REFERENCE 2 (bases 1 to 2828) AUTHORS Song,H.Y. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Ho Y. Song, Physiology and Walther Oncology Center, Indiana University School of Medicine, 975 W. Walnut St., Indianapolis, IN 46202, USA FEATURES Location/Qualifiers source 1..2828 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela S3" gene 113..2674 /gene="TRAP2" CDS 113..2674 /gene="TRAP2" /note="TNF type 1 receptor associated protein" /codon_start=1 /function="binds to the intracellular domain of human tumor necrosis factor type 1 receptor" /product="tumor necrosis factor type 1 receptor associated protein" /db_xref="PID:g687239" /translation="MLVERLGEKDTSLYRPALEELRRQIRSSTTSMTSVPKPLKFLRP HYGKLKEIYENMAPGENKRFAADIISVLAMTMSGERECLKYRLVGSQEELASWGHEYV RHLAGEVAKEWQELDDAEKVQREPLLTLVKEIVPYNMAHNAEHEACDLLMEIEQVDML EKDIDENASAKVCLYLTSCVNYVPEPENSALLRCALGVFRKFSRFPEALRLALMLNDM ELVEDSSSCKDVVVQKQMAFMLGRHGVFLELSEDVEEYEDLTEIMSNVQLNSNFLALA RELDIMEPKVPDDIYKTHLENNRFGGSGSQVDSARMNLASSFVNGFVNAAFGQDKLLT DDGNKWLYKNKDHGMLSAAASLGMILLWDVDGGLTQIDKYLYSSEDYIKSGALLACGI VNSGVRNECDPALALLSDYVLHNSNTMRLGSIFGLGLAYAGSNREDVLTLLLPVMGDS KSSMEVAGVTALACGMIAVGSCNGDVTSTILQTIMEKSETELKDTYARWLPLGLGLNH LGKGEAIEAILAALEVVSEPFRSFANTLVDVCAYAGSGNVLKVQQLLHICSEHFDSKE KEEDKDKKEKKDKDKKEAPADMGAHQGVAVLGIALIAMGEEIGAEMALRTFGHLLRYG EPTLRRAVPLALALISVSNPRLNILDTLSKFSHDADPEVSYNSIFAMGMGMVGSGTNN ARLAAMLRQLAQYHAKDPNNLFMVRLAQGLTHLGKGTLTLCPYHSDRQLMSQVAVAGL LTVLVSFLDVRNIILGKSHYVLYGLVAAMQPRMLVTFDEELRPLPVSVRVGQAVDVVG QAGKPKTITGFQTHTTPVLLAHGERAELATEEFLPVTPILEGFVIFGRTPIMISK" BASE COUNT 675 a 665 c 811 g 677 t ORIGIN 1 ttccggcggc acggacgaga agccgagcgg caaggggcgg cgggatgccg gggacaagga 61 caaagaactg gagctgtctg aagaggataa acagcttcaa gatgaactgg tgatgctcgt 121 ggaacgacta ggggagaagg atacatccct gtatcgacca gcgctggagg aattgcgaag 181 gcagattcgt tcttctacaa cttccatgac ttcagtgccc aagcctctca aatttctgcg 241 tccacactat ggcaaactga aggaaatcta tgagaacatg gcccctgggg agaataagcg 301 ttttgctgct gacatcatct ccgttttggc catgaccatg agtggggagc gtgagtgcct 361 caagtatcgg ctagtgggct cccaggagga attggcatca tggggtcatg agtatgtcag 421 gcatctggca ggagaagtgg ctaaggagtg gcaggagctg gatgacgcag agaaggtcca 481 gcgggagcct ctgctcactc tggtgaagga aatcgtcccc tataacatgg cccacaatgc 541 agagcatgag gcttgcgacc tgcttatgga aattgagcag gtggacatgc tggagaagga 601 cattgatgaa aatgcatctg caaaggtctg cctttatctc accagttgtg taaattacgt 661 gcctgagcct gagaactcag ccctactgcg ttgtgccctg ggtgtgttcc gaaagtttag 721 ccgcttccct gaagctctga gattggcatt gatgctcaat gacatggagt tggtagaaga 781 ctcttcctcc tgcaaggatg tggtagtaca gaaacagatg gcattcatgc taggccggca 841 tggggtgttc ctggagctga gtgaagatgt cgaggagtat gaggacctga cagagatcat 901 gtccaatgta cagctcaaca gcaacttctt ggccttagct cgggagctgg acatcatgga 961 gcccaaggtg cctgatgaca tctacaaaac ccacctagag aacaacaggt ttgggggcag 1021 tggctctcag gtggactctg cccgcatgaa cctggcctcc tcttttgtga atggctttgt 1081 gaatgcagct tttggccaag acaagctgct aacagatgat ggcaacaaat ggctttacaa 1141 gaacaaggac cacggaatgt tgagtgcagc tgcatctctt gggatgattc tgctgtggga 1201 tgtggatggt ggcctcaccc agattgacaa gtacctgtac tcctctgagg actacattaa 1261 gtcaggagct cttcttgcct gtggcatagt gaactctggg gtccggaatg agtgtgaccc 1321 tgctctggca ctgctctcag actatgttct ccacaacagc aacaccatga gacttggttc 1381 catctttggg ctaggcttgg cttatgctgg ctcaaatcgt gaagatgtcc taacactgct 1441 gctgcctgtg atgggagatt caaagtccag catggaggtg gcaggtgtca cagctttagc 1501 ctgtggaatg atagcagtag ggtcctgcaa tggagatgta acttccacta tccttcagac 1561 catcatggag aagtcagaga ctgagctcaa ggatacttat gctcgttggc ttcctcttgg 1621 actgggtctc aaccacctgg ggaagggtga ggccatcgag gcaatcctgg ctgcactgga 1681 ggttgtgtca gagccattcc gcagttttgc caacacactg gtggatgtgt gtgcatatgc 1741 aggctctggg aatgtgctga aggtgcagca gctgctccac atttgtagcg aacactttga 1801 ctccaaagag aaggaggaag acaaagacaa gaaggaaaag aaagacaagg acaagaagga 1861 agcccctgct gacatgggag cacatcaggg agtggctgtt ctggggattg cccttattgc 1921 tatgggggag gagattggtg cagagatggc attacgaacc tttggccact tgctgagata 1981 tggggagcct acactccgga gggctgtacc tttagcactg gccctcatct ctgtttcaaa 2041 tccacgactc aacatcctgg ataccctaag caaattctct catgatgctg atccagaagt 2101 ttcctataac tccatttttg ccatgggcat gggcatggtg ggcagtggta ccaataatgc 2161 ccgtctggct gcaatgctgc gccagttagc tcaatatcat gccaaggacc caaacaacct 2221 cttcatggtg cgcttggcac agggcctgac acatttaggg aagggcaccc ttaccctctg 2281 cccctaccac agcgaccggc agcttatgag ccaggtggcc gtggctggac tgctcactgt 2341 gcttgtctct ttcctggatg ttcgaaacat tattctaggc aaatcacact atgtattgta 2401 tgggctggtg gctgccatgc agccccgaat gctggttacg tttgatgagg agctgcggcc 2461 attgccagtg tctgtccgtg tgggccaggc agtggatgtg gtgggccagg ctggcaagcc 2521 gaagactatc acagggttcc agacgcatac aaccccagtg ttgttggccc acggggaacg 2581 ggcagaattg gccactgagg agtttcttcc tgttaccccc attctggaag gttttgttat 2641 cttcggaaga accccaatta tgatctctaa gtgaccacca ggggctctga actgtagctg 2701 atgttatcag caggccatgc atcctgctgc caagggtgga cacggctgca gacttctggg 2761 ggaattgtcg cctcctgctc ttttgttact gagtgagata aggttgttca ataaagactt 2821 ttatcccc // LOCUS HSU12597 2262 bp mRNA PRI 16-FEB-1996 DEFINITION Human tumor necrosis factor type 2 receptor associated protein (TRAP3) mRNA, complete cds. ACCESSION U12597 NID g975272 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2262) AUTHORS Song,H.Y. and Donner,D.B. TITLE Association of a RING finger protein with the cytoplasmic domain of the human type-2 tumour necrosis factor receptor JOURNAL Biochem. J. 309 (Pt 3), 825-829 (1995) MEDLINE 95366958 REFERENCE 2 (bases 655 to 1560) AUTHORS Rothe,M., Wong,S.C., Henzel,W.J. and Goeddel,D.V. TITLE A novel family of putative signal transducers associated with the cytoplasmic domain of the 75 kDa tumor necrosis factor receptor JOURNAL Cell 78 (4), 681-692 (1994) MEDLINE 94349371 REFERENCE 3 (bases 1 to 2262) AUTHORS Song,H.Y. TITLE Direct Submission JOURNAL Submitted (21-JUL-1994) Ho Y. Song, Physiology and Walther Oncology Center, Indiana University School of Medicine, 975 W. Walnut St., Indianapolis, IN 46202, USA FEATURES Location/Qualifiers source 1..2262 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela S3" gene 55..1560 /gene="TRAP3" CDS 55..1560 /gene="TRAP3" /note="TNF type 2 receptor binding protein" /codon_start=1 /evidence=experimental /product="tumor necrosis factor type 2 receptor associated protein 3" /db_xref="PID:g975273" /translation="MAAASVTPPGSLELLQPGFSKTLLGTKLEAKYLCSACRNVLRRP FQAQCGHRYCSFCLASILSSGPQNCAACVHEGIYEEGISILESSSAFPDNAARREVES LPAVCPSDGCTWKGTLKEYESCHEGRCPLMLTECPACKGLVRLGEKERHLEHECPERS LSCRHCRAPCCGADVKAHHEVCPKFPLTCDGCGKKKIPREKFQDHVKTCGKCRVPCRF HAIGCLETVEGEKQQEHEVQWLREHLAMLLSSVLEAKPLLGDQSHAGSELLQRCESLE KKTATFENIVCVLNREVERVAMTAEACSRQHRLDQDKIEALSSKVQQLERSIGLKDLA MADLEQKVRPFQAQCGHRYCSFCLASILRKLQEAVAGRIPAIFSPAFYTSRYGYKMCL RIYLNGDGTGRGTHLSLFFVVMKGPNDALLRWPFNQKVTLMLLDQNNREHVIDAFRPD VTSSSFQRPVNDMNIASGCPLFCPVSKMEAKNSYVRDDAIFIKAIVDLTGL" misc_feature 154..330 /gene="TRAP3" /note="encodes ring finger motif" misc_feature 445..819 /gene="TRAP3" /note="cysteine-histidine rich region" misc_feature 679..723 /gene="TRAP3" /note="encodes zinc finger-like motif" misc_feature 928..1680 /note="encodes TRAF domain" /citation=[2] BASE COUNT 452 a 664 c 727 g 419 t ORIGIN 1 gaattccggc gcgctgcgac cgttggggct ttgttcgcgg gggtcacagc tctcatggct 61 gcagctagcg tgaccccccc tggctccctg gagttgctac agcccggctt ctccaagacc 121 ctcctgggga ccaagctgga agccaagtac ctgtgctccg cctgcagaaa cgtcctccgc 181 aggcccttcc aggcgcagtg tggccaccgg tactgctcct tctgcctggc cagcatcctc 241 agctctgggc ctcagaactg tgctgcctgt gttcacgagg gcatatatga agaaggcatt 301 tctattttag aaagcagttc ggccttccca gataatgctg cccgcaggga ggtggagagc 361 ctgccggccg tctgtcccag tgatggatgc acctggaagg ggaccctgaa agaatacgag 421 agctgccacg aaggccgctg cccgctcatg ctgaccgaat gtcccgcgtg taaaggcctg 481 gtccgccttg gtgaaaagga gcgccacctg gagcacgagt gcccggagag aagcctgagc 541 tgccggcatt gccgggcacc ctgctgcgga gcagacgtga aggcgcacca cgaggtctgc 601 cccaagttcc ccttaacttg tgacggctgc ggcaagaaga agatcccccg ggagaagttt 661 caggaccacg tcaagacttg tggcaagtgt cgagtccctt gcagattcca cgccatcggc 721 tgcctcgaga cggtagaggg tgagaaacag caggagcacg aggtgcagtg gctgcgggag 781 cacctggcca tgctactgag ctcggtgctg gaggcaaagc ccctcttggg agaccagagc 841 cacgcggggt cagagctcct gcagaggtgc gagagcctgg agaagaagac ggccactttt 901 gagaacattg tctgcgtcct gaaccgggag gtggagaggg tggccatgac tgccgaggcc 961 tgcagccggc agcaccggct ggaccaagac aagattgaag ccctgagtag caaggtgcag 1021 cagctggaga ggagcattgg cctcaaggac ctggcgatgg ctgacttgga gcagaaggtc 1081 aggcccttcc aggcgcagtg tggccaccgg tactgctcct tctgcctggc cagcatcctc 1141 aggaagctcc aggaagctgt ggctggccgc atacccgcca tcttctcccc agccttctac 1201 accagcaggt acggctacaa gatgtgtctg cgtatctacc tgaacggcga cggcaccggg 1261 cgaggaacac acctgtccct cttctttgtg gtgatgaagg gcccgaatga cgccctgctg 1321 cggtggccct tcaaccagaa ggtgacctta atgctgctcg accagaataa ccgggagcac 1381 gtgattgacg ccttcaggcc cgacgtgact tcatcctctt ttcagaggcc agtcaacgac 1441 atgaacatcg caagcggctg ccccctcttc tgccccgtct ccaagatgga ggcaaagaat 1501 tcctacgtgc gggacgatgc catcttcatc aaggccattg tggacctgac agggctctaa 1561 ctgcccccta ctggtgtctg ggggttgggg gcagccaggc acagccggct cacggagggg 1621 ccaccacgct gggccagggt ctcactgtac aagtgggcag gggccccgct tgggcgcttg 1681 ggagggtgtc ggcctgcagc caagttcact gtcacggggg aaggagccac cagccagtcc 1741 tcagatttca gagactgcgg aggggcttgg cagacggtct tagccaaggg ctgtggtggc 1801 attggccgag ggtcttcggg tgcttcccag cacaagctgc ccttgctgtc ctgtgcagtg 1861 aagggagagg ccctgggtgg gggacactca gagtgggagc acatcccagc agtgcccatg 1921 tagcaggagc acagtggatg gccttgtgtc cctcgggcat gacaggcaga aacgagggct 1981 gctccaggag aagggcctcc tgctggccag agcaaggaag gctgagcagc ttggttctcc 2041 cctctggccc ctggagagaa gggagcattc ctagacccct gggtgcttgt ctgcacagag 2101 ctctggtctg tgccaccttg gccaggctgg ctgtgggagg gtctggtccc acgccgcctc 2161 tgctcagaca ctgtgtggga gggcacagca cagctgcggg taaagtgtga gagcttgcca 2221 tccagctcac gaagacagag ttattaaacc attacaaatc tc // LOCUS HSU12707 1806 bp mRNA PRI 07-MAR-1995 DEFINITION Human Wiskott-Aldrich syndrome protein (WASP) mRNA, complete cds. ACCESSION U12707 NID g695150 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1806) AUTHORS Derry,J.M., Ochs,H.D. and Francke,U. TITLE Isolation of a novel gene mutated in Wiskott-Aldrich syndrome [published erratum appears in Cell 1994 Dec 2;79(5):following 922] JOURNAL Cell 78 (4), 635-644 (1994) MEDLINE 94349367 REFERENCE 2 (bases 1 to 1806) AUTHORS Derry,J.M. TITLE Direct Submission JOURNAL Submitted (28-JUL-1994) Jonathan M. Derry, Howard Hughes Medical Institute, Beckman Center, Stanford University Medical Center, Stanford, CA 94395, USA FEATURES Location/Qualifiers source 1..1806 /organism="Homo sapiens" /db_xref="taxon:9606" gene 35..1543 /gene="WASP" CDS 35..1543 /gene="WASP" /codon_start=1 /product="Wiskott-Aldrich syndrome protein" /db_xref="PID:g695151" /translation="MSGGPMGGRPGGRGAPAVQQNIPSTLLQDHENQRLFEMLGRKCL TLATAVVQLYLALPPGAEHWTKEHCGAVCFVKDNPQKSYFIRLYGLQAGRLLWEQELY SQLVYSTPTPFFHTFAGDDCQAGLNFADEDEAQAFRALVQEKIQKRNQRQSGDRRQLP PPPTPANEERRGGLPPLPLHPGGDQGGPPVGPLSLGLATVDIQNPDITSSRYRGLPAP GPSPADKKRSGKKKISKADIGAPSGFKHVSHVGWDPQNGFDVNNLDPDLRSLFSRAGI SEAQLTDAETSKLIYDFIEDQGGLEAVRQEMRRQEPLPPPPPPSRGGNQLPRPPIVGG NKGRSGPLPPVPLGIAPPPPTPRGPPPPGRGGPPPPPPPATGRSGPLPPPPPGAGGPP MPPPPPPPPPPPSSGNGPAPPPLPPALVPAGGLAPGGGRGALLDQIRQGIQLNKTPGA PESSALQPPPQSSEGLVGALMHVMQKRSRAIHSSDEGEDQAGDEDEDDEWDD" polyA_signal 1774..1779 BASE COUNT 396 a 614 c 484 g 312 t ORIGIN 1 agcctcgcca gagaagacaa gggcagaaag caccatgagt gggggcccaa tgggaggaag 61 gcccgggggc cgaggagcac cagcggttca gcagaacata ccctccaccc tcctccagga 121 ccacgagaac cagcgactct ttgagatgct tggacgaaaa tgcttgacgc tggccactgc 181 agttgttcag ctgtacctgg cgctgccccc tggagctgag cactggacca aggagcattg 241 tggggctgtg tgcttcgtga aggataaccc ccagaagtcc tacttcatcc gcctttacgg 301 ccttcaggct ggtcggctgc tctgggaaca ggagctgtac tcacagcttg tctactccac 361 ccccaccccc ttcttccaca ccttcgctgg agatgactgc caagcggggc tgaactttgc 421 agacgaggac gaggcccagg ccttccgggc cctcgtgcag gagaagatac aaaaaaggaa 481 tcagaggcaa agtggagaca gacgccagct acccccacca ccaacaccag ccaatgaaga 541 gagaagagga gggctcccac ccctgcccct gcatccaggt ggagaccaag gaggccctcc 601 agtgggtccg ctctccctgg ggctggcgac agtggacatc cagaaccctg acatcacgag 661 ttcacgatac cgtgggctcc cagcacctgg acctagccca gctgataaga aacgctcagg 721 gaagaagaag atcagcaaag ctgatattgg tgcacccagt ggattcaagc atgtcagcca 781 cgtggggtgg gacccccaga atggatttga cgtgaacaac ctcgacccag atctgcggag 841 tctgttctcc agggcaggaa tcagcgaggc ccagctcacc gacgccgaga cctctaaact 901 tatctacgac ttcattgagg accagggtgg gctggaggct gtgcggcagg agatgaggcg 961 ccaggagcca cttccgccgc ccccaccgcc atctcgagga gggaaccagc tcccccggcc 1021 ccctattgtg gggggtaaca agggtcgttc tggtccactg ccccctgtac ctttggggat 1081 tgccccaccc ccaccaacac cccggggacc cccaccccca ggccgagggg gccctccacc 1141 accaccccct ccagctactg gacgttctgg accactgccc cctccacccc ctggagctgg 1201 tgggccaccc atgccaccac caccgccacc accgccaccg ccgcccagct ccgggaatgg 1261 accagcccct cccccactcc ctcctgctct ggtgcctgcc gggggcctgg cccctggtgg 1321 gggtcgggga gcgcttttgg atcaaatccg gcagggaatt cagctgaaca agacccctgg 1381 ggccccagag agctcagcgc tgcagccacc acctcagagc tcagagggac tggtgggggc 1441 cctgatgcac gtgatgcaga agagaagcag agccatccac tcctccgacg aaggggagga 1501 ccaggctggc gatgaagatg aagatgatga atgggatgac tgagtggctg agttacttgc 1561 tgccctgtgc tcctccccgc aggacatggc tccccctcca cctgctctgt gcccaccctc 1621 cactctcctc ttccagggcc cccaaccccc catttcttcc ccaccaaccc ctccaatgct 1681 gttatccctg cctggtcctc acactcaccc aacaatccca aggccctttt tatacaaaaa 1741 ttctcagttc tcttcactca aggattttta aagaaaaata aaagaattgt ctttctgtct 1801 ctctat // LOCUS HSU12778 2682 bp mRNA PRI 16-AUG-1995 DEFINITION Human acyl-CoA dehydrogenase mRNA, complete cds. ACCESSION U12778 NID g531390 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2682) AUTHORS Rozen,R., Vockley,J., Zhou,L., Milos,R., Willard,J., Fu,K., Vicanek,C., Low-Nang,L., Torban,E. and Fournier,B. TITLE Isolation and expression of a cDNA encoding the precursor for a novel member (ACADSB) of the acyl-CoA dehydrogenase gene family JOURNAL Genomics 24 (2), 280-287 (1994) MEDLINE 95213018 REFERENCE 2 (bases 1 to 2682) AUTHORS Rozen,R. TITLE Direct Submission JOURNAL Submitted (29-JUL-1994) Rima Rozen, Pediatrics, Human Genetics and Biology, McGill University - Montreal Children's Hospital, 2300 Tupper St., Montreal, Quebec H3H 1P3, Canada FEATURES Location/Qualifiers source 1..2682 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /dev_stage="adult" CDS 16..1314 /codon_start=1 /product="acyl-CoA dehydrogenase" /db_xref="PID:g531391" /translation="MEGLAVRLLRGSRLLRRNFLTCLSSWKIPPHVSKSSQSEALLNI TNNGIHFAPLQTFTDEEMMIKSSVKKFAQEQIAPLVSTMDENSKMEKSVIQGLFQQGL MGIEVDPEYGGTGASFLSTVLVIEELAKVDASVAVFCEIQNTLINTLIRKHGTEEQKA TYLPQLTTEKVGSFCLSEAGAGSDSFALKTRADKEGDYYVLNGSKMWISSAEHAGLFL VMANVDPTIGYKGITSFLVDRDTPGLHIGKPENKLGLRASSTCPLTFENVKVPEANIL GQIGHGYKYAIGSLNEGRIGIAAQMLGLAQGCFDYTIPYIKERIQFGKRLFDFQGLQH QVAHVATQLEAARLLTYNAARLLEAGKPFIKEASMAKYYASEIAGQTTSKCIEWMGGV GYTKDYPVEKYFRDAKIGTIYEGASNIQLNTIAKHIDAEY" polyA_site 2682 /note="33 A nucleotides" BASE COUNT 823 a 472 c 576 g 811 t ORIGIN 1 attcctgcgc cgaggatgga gggcctggca gtgcggttgc tgcgcggcag caggctgcta 61 agaagaaatt tcctgacttg tttgtcttct tggaagattc ctcctcatgt ctcaaaatct 121 tcccagtcag aagctctact caatataaca aataatggaa tacactttgc tcccctgcaa 181 acatttacag atgaggaaat gatgataaag agttcagtta aaaaatttgc tcaggaacaa 241 attgcacctt tggtttcaac catggatgaa aattcgaaaa tggagaaatc agtaatacaa 301 ggattatttc aacaagggtt gatgggtatt gaagttgacc cagaatatgg aggcacagga 361 gcttcttttt tatccactgt gctcgtgata gaggaattag ccaaagttga tgcatctgtg 421 gctgtctttt gtgagatcca gaacacatta attaacacac tgattagaaa acatggaaca 481 gaagaacaaa aggccaccta tttgcctcag ctcactacag aaaaagtagg aagtttctgc 541 ctttcagagg ctggagcagg tagtgactca tttgctttga agaccagagc tgataaagag 601 ggagattatt atgtcctcaa tggatcaaag atgtggatca gcagtgctga gcatgcaggg 661 ctctttctgg tgatggcaaa tgtagaccct accattggat ataagggaat tacctccttc 721 ttagtagatc gtgatactcc gggccttcat atagggaaac ctgaaaacaa attggggctc 781 agagcttctt ccacctgccc gttaacattc gaaaatgtca aggttccaga agccaatatc 841 ttgggacaaa ttggacatgg ctataagtat gccataggga gtctcaatga aggtagaata 901 ggaattgctg cacagatgct gggactggcg caaggatgtt ttgactacac tattccatat 961 attaaagaaa ggatacaatt tggcaaaaga ctatttgatt ttcagggcct ccaacaccaa 1021 gtggctcacg tggccaccca gctggaagct gcaagattac taacatacaa tgctgctagg 1081 cttttagaag ctggaaagcc attcataaaa gaagcgtcaa tggccaaata ctatgcatca 1141 gagattgcag gacaaacaac gagtaaatgt atcgagtgga tggggggagt aggctacacc 1201 aaagattacc ctgtggagaa atacttccga gatgcaaaga ttggtacgat atatgaagga 1261 gcttccaaca tccagttgaa caccattgca aagcatatcg atgcagaata ctgacgtcta 1321 taggagtggg acccctccct ggtgtcactg ctgtaaaatt ttaaacggtt gtgtcttgtt 1381 gggagtaagt gccttgcgtg ggaataaact tccacagcat tcgaatattt taatgaagcc 1441 cttagtcagg gtcctggtgt tggccttttt ggttttctct tttcaggctg tttaacttag 1501 gcacaggaga tccactttta aacttgggaa ataagcacct gtattttttt ccaaaactgt 1561 ttttaaagct gtatacgcat acatatatat atttttactc tgtcttactc tgtcacccag 1621 gctagagtgc agtggcgcga tctcagctca ctgcagcctt gacctcctgg gttccggtga 1681 ttctcatgcc tcatcctccc aagtagctgg aactacaggt gtgcaccacc atgcctggtt 1741 catttttgtg tttttagtag agatggggtt ttaccatatt gcccaggctg gtcttctggc 1801 ttctggatat cgcccacctt ggcctcccaa agtgctggga ttacagggat gagccaccat 1861 gcctggctgg gtatttatat tatcattcta gtttcagagt atacagaagt ttcatcccat 1921 catttggaaa aataaaggca tctgaagtac aatattactt atagaaatag tttatattcc 1981 tattaaatct taatcttgtg gcatcaggga aatatttgtt acatagtagg caatttttat 2041 ccgtacttta tagattcaac tctaagttgc aagcgaagtc aaaactgatg aaaatttatt 2101 tagaaaaatc taaaaattct ggttttaaat atgagaatca gtggaaaata agggtataat 2161 tttgtaggtc atatgattga agaaaatatt attttaacaa tgtaaagcaa tatgatttgt 2221 tactataatt aacctgtata aaagatacat tttatggtgg tttcagtagg tcattttaaa 2281 aaccaatgtg cattagtttt caagtataag gtttaagtaa tttggtttaa taatcagaaa 2341 atattcaata cagtgttgga tatctgtcat gcactatttt cagttgacaa tttctgtatt 2401 ttaattgaat actgtttctt cagtcatggt tattattgca ctttatcctg aataatagtt 2461 cagaaattgg gttttggttc agtgattctc aagaaaaaga tctcttgccc attaagaagt 2521 gtatcaaaat ctcataagga atgagggaga gaagggggct gtacagtttg aaaaagcata 2581 ttcaatatta aaatagtttt atatttggga aggcaaaaat gaatctattg ttttgcaata 2641 taggttatac aggaaaatgc aataaaatat atatctggtt gt // LOCUS HSU12779 2258 bp mRNA PRI 17-AUG-1994 DEFINITION Human MAP kinase activated protein kinase 2 mRNA, complete cds. ACCESSION U12779 NID g530089 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2258) AUTHORS Zu,Y.L., Wu,F., Gilchrist,A., Ai,Y., Labadia,M.E. and Huang,C.K. TITLE The primary structure of a human MAP kinase activated protein kinase 2 JOURNAL Biochem. Biophys. Res. Commun. 200 (2), 1118-1124 (1994) MEDLINE 94235003 REFERENCE 2 (bases 1 to 2258) AUTHORS Huang,C. TITLE Direct Submission JOURNAL Submitted (29-JUL-1994) Chi-Kuang Huang, Pathology, University of Connecticut Health Center, 263 Farmington Ave., Farmington, CT 06030, USA FEATURES Location/Qualifiers source 1..2258 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="324" /cell_line="HL-60" CDS 379..1491 /codon_start=1 /product="MAP kinase activated protein kinase 2" /db_xref="PID:g530090" /translation="MLSNSQGQSPPVPFPAPAPPPQPPTPALPHPPAQPPPPPPQQFP QFHVKSGLQIKKNAIIDDYKVTSQVLGLGINGKVLQIFNKRTQEKFALKMLQDCPKAR REVELHWRASQCPHIVRIVDVYENLYAGRKCLLIVMECLDGGELFSRIQDRGDQAFTE REASEIMKSIGEAIQYLHSINIAHRDVKPENLLYTSKRPNAILKLTDFGFAKETTSHN SLTTPCYTPYYVAPEVLGPEKYDKSCDMWSLGVIMYILLCGYPPFYSNHGLAISPGMK TRIRMGQYEFPNPEWSEVSEEVKMLIRNLLKTEPTQRMTITEFMNHPWIMQSTKVPQT PLHTSRVLKEDKERWEDVKGCLHDKNSDQATWLTRL" BASE COUNT 496 a 711 c 618 g 433 t ORIGIN 1 gatatcacag caacattgaa atgctaaaaa gtttttaaac actctcaatt tctaattcac 61 catgtcacag actggtgaaa aaaaaaaaaa aagcggccgc ttccccccgg ccgggccccc 121 gccgccccgc ggtccccaga gcgccaggcc cccgggggga gggagggagg gcgccgggcc 181 ggtgggagcc agcggcgcgc ggtgggaccc acggagcccc gcgacccgcc gagcctggag 241 ccgggccggc tcggggaagc cggctccagc ccggagcgaa cttcgcagcc cgtcgggggg 301 cggcggggag ggggcccgga gccggaggag ggggcggccg cgggcacccc cgcctgtgcc 361 ccggcgtccc cgggcaccat gctgtccaac tcccagggcc agagcccgcc ggtgccgttc 421 cccgccccgg ccccgccgcc gcagcccccc acccctgccc tgccgcaccc cccggcgcag 481 ccgccgccgc cgcccccgca gcagttcccg cagttccacg tcaagtccgg cctgcagatc 541 aagaagaacg ccatcatcga tgactacaag gtcaccagcc aggtcctggg gctgggcatc 601 aacggcaaag ttttgcagat cttcaacaag aggacccagg agaaattcgc cctcaaaatg 661 cttcaggact gccccaaggc ccgcagggag gtggagctgc actggcgggc ctcccagtgc 721 ccgcacatcg tacggatcgt ggatgtgtac gagaatctgt acgcagggag gaagtgcctg 781 ctgattgtca tggaatgttt ggacggtgga gaactcttta gccgaatcca ggatcgagga 841 gaccaggcat tcacagaaag agaagcatcc gaaatcatga agagcatcgg tgaggccatc 901 cagtatctgc attcaatcaa cattgcccat cgggatgtca agcctgagaa tctcttatac 961 acctccaaaa ggcccaacgc catcctgaaa ctcactgact ttggctttgc caaggaaacc 1021 accagccaca actctttgac cactccttgt tatacaccgt actatgtggc tccagaagtg 1081 ctgggtccag agaagtatga caagtcctgt gacatgtggt ccctgggtgt catcatgtac 1141 atcctgctgt gtgggtatcc ccccttctac tccaaccacg gccttgccat ctctccgggc 1201 atgaagactc gcatccgaat gggccagtat gaatttccca acccagaatg gtcagaagta 1261 tcagaggaag tgaagatgct cattcggaat ctgctgaaaa cagagcccac ccagagaatg 1321 accatcaccg agtttatgaa ccacccttgg atcatgcaat caacaaaggt ccctcaaacc 1381 ccactgcaca ccagccgggt cctgaaggag gacaaggagc ggtgggagga tgtcaagggg 1441 tgtcttcatg acaagaacag cgaccaggcc acttggctga ccaggttgtg agcagaggat 1501 tctgtgttcc tgtccaaact cagtgctgtt tcttagaatc cttttattcc ctgggtctct 1561 aatgggacct taaagaccat ctggtatcat cttctcattt tgcagaagag aaactgaggc 1621 ccagaggcgg agggcagtct gctcaaggtc acgcagctgg tgactggttg gggcagaccg 1681 gacccaggtt tcctgactcc tggcccaagt ctcttcctcc tatcctgcgg gatcactggg 1741 gggctctcag ggaacagcag cagtgccata gccaggctct ctgctgccca gcgctggggt 1801 gaggctgccg ttgtcagcgt ggaccactaa ccagcccgtc ttctctctct gctcccaccc 1861 ctgccgcctc acctgccctt gttgtctctg tctctcactg tctcttctgc tgtctctcta 1921 ctgtcttctg gctctctctg tacccttcct ggtgctgccg tgcccccagg aggagatgac 1981 cagtgccttg gccacaatgc gcgttgacta cgagcagatc aagataaaaa agattgaaga 2041 tgcatccaac cctctgctgc tgaagaggcg gaagaaagct cgggccctgg aggctgcggc 2101 tctggcccac tgagccaccg cgccctcctg cccacgggag gacaagcaat aactctctac 2161 aggaatatat tttttaaacg aagagacaga actgtccaca tctgcctcct ctcctcctca 2221 gctgcatgga gcctggaact gcatcagtga ctgaattc // LOCUS HSU12918 2016 bp mRNA PRI 22-AUG-1994 DEFINITION Human syntaxin mRNA, complete cds. ACCESSION U12918 NID g531457 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2016) AUTHORS Zhang,R. and Simpson,L. TITLE Human Syntaxin is homologous with Rat Syntaxin A and digested by BoNT C JOURNAL Unpublished REFERENCE 2 (bases 1 to 2016) AUTHORS Zhang,R. TITLE Direct Submission JOURNAL Submitted (03-AUG-1994) Ren-de Zhang, Medicine, Thomas Jefferson University, 1020 Locust Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..2016 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pcDNA.HS.26" /clone_lib="human brain 5'-stretch plus cDNA" /sex="male" /tissue_type="whole cerebral brain" /dev_stage="adult" CDS 47..826 /note="similar to rat syntaxin A, Swiss-Prot Accession Number P32851" /codon_start=1 /product="syntaxin" /db_xref="PID:g531458" /translation="MDEFFEQVEEIRGFIDKIAENVEEVKRKHSAILASPNPDEKTKV ELEELMSDIKKTANKVRSKLKSIEQSIEQEEGLNRSSADLRIRKTQHSTLSRKFVEVM SEYNATQSVYRERCKGRIQRQLEITGRTTTSEELEDMLESGNPAIFASGIIMDSSISK QALSEIETRHSEIIKLENSIRELHDMFMDMAMLVESQGEMIDRIEYNVEHAVDYVERA VSDTKKAVKYQSKARRKKIMIIICCVILGIVIASTVGGIFA" BASE COUNT 411 a 618 c 589 g 398 t ORIGIN 1 cgatgatgat gatgatgtcg ctgtcaccgt ggaccgagac cgcttcatgg atgagttctt 61 tgagcaggtg gaggagattc gaggcttcat tgacaagatc gcagagaacg tggaggaggt 121 gaagcggaag cacagtgcca tcctggcatc ccccaacccc gatgagaaga cgaaggtgga 181 gctggaagaa ctcatgtccg acataaagaa gacagcaaac aaagttcgtt ccaagttaaa 241 gagcatcgag cagtccatcg agcaagagga aggcctgaac cgctcctccg ctgacctgag 301 gatccggaag acacagcact ccacgctgtc cagaaagttt gtggaggtca tgtcggagta 361 caacgccacg cagtccgtct accgcgagcg ctgcaaaggc cgcatccaga ggcagctgga 421 gatcaccggc aggaccacga ccagtgagga gctggaggac atgctggaga gtgggaaccc 481 cgccatcttt gcctctggga tcatcatgga ctccagcatc tcgaagcagg ctctgagcga 541 gattgagacg cggcacagtg agatcatcaa gctggagaac agcatccgtg agctacacga 601 catgttcatg gacatggcca tgctcgtgga gagccaggga gagatgattg acaggatcga 661 gtacaatgtg gaacacgcgg tagactatgt ggagagggcc gtgtctgaca ccaagaaggc 721 cgtcaagtac cagagcaagg cgcgccggaa gaaaatcatg atcatcatct gctgtgtgat 781 cctgggcatc gtcatcgcct ccactgttgg gggcatcttc gcctagaagc cacccaatct 841 gccactccac tccaggtggg ccactccaag gaggccctgg ctgctgccac ctggctgggc 901 tgccctccca acccccgcct ctggctcaga gcaccctccc tcccggcccc catgctccct 961 tctctgccat gggccctccg tccccgcccc gtgtcgtgtg catgatctct gtgagtgtgc 1021 gtctgtacgg gaagaggcag agggaggcag ccagcggggc gtgatgcagt gtgcacagcg 1081 aggagcagac ccaggcaggg ccgccagggt gacacaggcc acgcttcctt gccttcagta 1141 actcggtggg cccaggttct gctcttgccc tggggaccct aacctcgcct ccagctgacc 1201 tgccctgtcc tctccagctg tccccacaag cagagccctg aggggtgggg accagctggc 1261 cacatggtgc tgcttttcag gttaggggag aggtggccct gagggtcagc ccagctctga 1321 gtctcagtcg ctgatcactg ccagggaggc tcaggctgcc atggctccag gctccctccc 1381 ctgcctaggg gcaaagtcca tcgggtcctg ggcctcagct tcccttccca cattcctccg 1441 gccccaggag caaccccttg ggctaggtct gaccccaggt gtccctctgg aaggggctgg 1501 ctggtgccct atttccagcc accccagcag ctagggaggc aaagcaggct gatgtcagtc 1561 cctcaagcca gcgttgcatg tttgggatgg tggctcctgt tgtcttgcgc tctgggaagt 1621 cagatgtcat ttcaggcctg cagtctcatc ctgcccttgc catcctccca tcgatgtgcc 1681 acgtgggtgt cacgtgtccc agatgcagta ttcggcagcc agccggggag ggctacctcc 1741 tcctcctcac caccttgggg cttctcatgg gaaatgtgcc cccgccccag gaccctctcc 1801 cttgtggaca ggcagggaga tgcatgcgag tgcatgcagc aggggatggg gccgtgtccg 1861 tgtgccccac cctccctcgg ctttactcct gcccagtgac tgtgaccact gtccgtgttg 1921 ccttcttgaa cagcgattcc ccccaacccc ttcaccaaag gtcttggtac aaccagctgc 1981 ccattttgtg aaatttttat gtagaataaa caattc // LOCUS HSU13021 1456 bp mRNA PRI 06-JAN-1995 DEFINITION Human positive regulator of programmed cell death ICH-1L (Ich-1) mRNA, complete cds. ACCESSION U13021 NID g537291 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1456) AUTHORS Wang,L., Miura,M., Bergeron,L., Zhu,H. and Yuan,J. TITLE Ich-1, an Ice/ced-3-related gene, encodes both positive and negative regulators of programmed cell death JOURNAL Cell 78 (5), 739-750 (1994) MEDLINE 94373811 REFERENCE 2 (bases 1 to 1456) AUTHORS Yuan,J. TITLE Direct Submission JOURNAL Submitted (05-AUG-1994) Junying Yuan, CVRC CNY-4, Massachusetts General Hospital, 149 13th Street, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..1456 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" misc_feature 1..35 /note="deleted in the alternatively spliced form ICH-1S, GenBank Accession Number U13022" gene 14..1321 /gene="Ich-1" CDS 14..1321 /gene="Ich-1" /note="similar to human interleukin-1beta converting enzyme (ICE), Swiss-Prot Accession Number P29466 and nematode Ced-3 gene, PIR Accession Number A49429; alternatively spliced form" /codon_start=1 /function="positive regulator of programmed cell death" /product="ICH-1L" /db_xref="PID:g537292" /translation="MAADRGRRILGVCGMHPHHQETLKKNRVVLAKQLLLSELLEHLL EKDIITLEMRELIQAKVGSFSQNVELLNLLPKRGPQAFDAFCEALRETKQGHLEDMLL TTLSGLQHVLPPLSCDYDLSLPFPVCESCPLYKKLRLSTDTVEHSLDNKDGPVCLQVK PCTPEFYQTHFQLAYRLQSRPRGLALVLSNVHFTGEKELEFRSGGDVDHSTLVTLFKL LGYDVHVLCDQTAQEMQEKLQNFAQLPAHRVTDSCIVALLSHGVEGAIYGVDGKLLQL QEVFQLFDNANCPSLQNKPKMFFIQACRGDETDRGVDQQDGKNHAGSPGCEESDAGKE KLPKMRLPTRSDMICGYACLKGTAAMRNTKRGSWYIEALAQVFSERACDMHVADMLVK VNALIKDREGYAPGTEFHRCKEMSEYCSTLCRHLYLFPGHPPT" BASE COUNT 353 a 381 c 384 g 338 t ORIGIN 1 gcacaaggag ctgatggccg ctgacagggg acgcaggata ttgggagtgt gtggcatgca 61 tcctcatcat caggaaactc taaaaaagaa ccgagtggtg ctagccaaac agctgttgtt 121 gagcgaattg ttagaacatc ttctggagaa ggacatcatc accttggaaa tgagggagct 181 catccaggcc aaagtgggca gtttcagcca gaatgtggaa ctcctcaact tgctgcctaa 241 gaggggtccc caagcttttg atgccttctg tgaagcactg agggagacca agcaaggcca 301 cctggaggat atgttgctca ccaccctttc tgggcttcag catgtactcc caccgttgag 361 ctgtgactac gacttgagtc tcccttttcc ggtgtgtgag tcctgtcccc tttacaagaa 421 gctccgcctg tcgacagata ctgtggaaca ctccctagac aataaagatg gtcctgtctg 481 ccttcaggtg aagccttgca ctcctgaatt ttatcaaaca cacttccagc tggcatatag 541 gttgcagtct cggcctcgtg gcctagcact ggtgttgagc aatgtgcact tcactggaga 601 gaaagaactg gaatttcgct ctggagggga tgtggaccac agtactctag tcaccctctt 661 caagcttttg ggctatgacg tccatgttct atgtgaccag actgcacagg aaatgcaaga 721 gaaactgcag aattttgcac agttacctgc acaccgagtc acggactcct gcatcgtggc 781 actcctctcg catggtgtgg agggcgccat ctatggtgtg gatgggaaac tgctccagct 841 ccaagaggtt tttcagctct ttgacaacgc caactgccca agcctacaga acaaaccaaa 901 aatgttcttc atccaggcct gccgtggaga tgagactgat cgtggggttg accaacaaga 961 tggaaagaac cacgcaggat cccctgggtg cgaggagagt gatgccggta aagaaaagtt 1021 gccgaagatg agactgccca cgcgctcaga catgatatgc ggctatgcct gcctcaaagg 1081 gactgccgcc atgcggaaca ccaaacgagg ttcctggtac atcgaggctc ttgctcaagt 1141 gttttctgag cgggcttgtg atatgcacgt ggccgacatg ctggttaagg tgaacgcact 1201 tatcaaggat cgggaaggtt atgctcctgg cacagaattc caccggtgca aggaaatgtc 1261 tgaatactgc agcactctgt gccgccacct ctacctgttc ccaggacacc ctcccacatg 1321 atgtcacctc cccatcatcc acgccaagtg gaagccactg gaccacagga ggtgtgatag 1381 agcctttgat cttcaggatg cacggtttct gttctgcccc ctcagggatg tgggaatctc 1441 ccagacttgt ttcctg // LOCUS HSU13044 1970 bp mRNA PRI 03-MAY-1995 DEFINITION Human nuclear respiratory factor-2 subunit alpha mRNA, complete cds. ACCESSION U13044 NID g531892 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1970) AUTHORS Gugneja,S., Virbasius,J.V. and Scarpulla,R.C. TITLE Four structurally distinct, non-DNA-binding subunits of human nuclear respiratory factor 2 share a conserved transcriptional activation domain JOURNAL Mol. Cell. Biol. 15 (1), 102-111 (1995) MEDLINE 95097980 REFERENCE 2 (bases 1 to 1970) AUTHORS Gugneja,S. TITLE Direct Submission JOURNAL Submitted (05-AUG-1994) Sajiv Gugneja, Cell and Molecular Biology, Northwestern University, 303 E. Chicago Ave, Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..1970 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 186..1550 /citation=[1] /codon_start=1 /product="nuclear respiratory factor-2 subunit alpha" /db_xref="PID:g531893" /translation="MTKREAEELIEIEIDGTEKAECTEESIVEQTYAPAECVSQAIDI NEPIGNLKKLLEPRLQCSLDAHEICLQDIQLDPERSLFDQGVKTDGTVQLSVQVISYQ GIEPKLNILEIVKPADTVEVVIDPDAHHAESEAHLVEEAQVITLDGTKHITTISDETS EQVTRWAAALEGYRKEQERLGIPYDPIQWSTDQVLHWVVWVMKEFSMTDIDLTTLNIS GRELCSLNQEDFFQRVPRGEILWSHLELLRKYVLASQEQQMNEIVTIDQPVQIIPASV QSATPTTIKVINRCAKAAKVQRAPRISGEDRSSPGNRTGNNGQIQLWQFLLELLTDKD ARDCISWVGDEGEFKLNQPELVAQKWGQRKNKPTMNYEKLSRALRYYYDGDMICKVQG KRFVYKFVCDLKTLIGYSAAELNRLVTECEQKKLAKMQLHGIAQPVTVVALATASLQT EKDN" BASE COUNT 618 a 382 c 457 g 513 t ORIGIN 1 agcgccgatt ccgcgggaag ggccctggga cctcacactt ctagtcgcgg gagctgcagg 61 tcttacccgg agagacgctg cacgtggagc cctcgccgct gccgttcttc agccggccct 121 ggagtgcggg cgggggcgac agggccgatt ccggagtggg actgatcctt tgaaatactc 181 cagccatgac taaaagagaa gcagaggagc tgatagaaat tgagattgat ggaacagaga 241 aagcagagtg cacagaagaa agcattgtag aacaaaccta cgcgccagct gaatgtgtaa 301 gccaggccat agacatcaat gaaccaatag gcaatttaaa gaaactgcta gaaccaagac 361 tacagtgttc tttggatgct catgaaattt gtctgcaaga tatccagctg gatccagaac 421 gaagtttatt tgaccaagga gtaaaaacag atggaactgt acagcttagt gtacaggtaa 481 tttcttacca aggaattgaa ccaaagttaa acatccttga aattgttaaa cctgcggaca 541 ctgttgaggt tgttattgat ccagatgccc accatgctga atcagaagca catcttgttg 601 aagaagctca agtgataact cttgatggca caaaacacat cacaaccatt tcagatgaaa 661 cttcagaaca agtgacaaga tgggctgctg cactggaagg ctataggaaa gaacaagaac 721 gccttgggat accctatgat cccatacagt ggtccacaga ccaagtcctg cattgggtgg 781 tttgggtaat gaaggaattc agcatgaccg atatagacct caccacactc aacatttcgg 841 ggagagaatt atgtagtctc aaccaagaag atttttttca gcgggttcct cggggagaaa 901 ttctctggag tcatctggaa cttctccgaa aatatgtatt ggcaagtcaa gaacaacaga 961 tgaatgaaat agttacaatt gatcaacctg tgcaaattat tccagcatca gtgcaatctg 1021 ctacacctac taccattaaa gttataaata gatgtgcgaa agcagccaaa gtacaaagag 1081 cgccgaggat ttcaggagaa gatagaagct cacctgggaa cagaacagga aacaatggcc 1141 aaatccaact atggcagttt ttgctagaac ttcttactga taaggacgct cgagactgca 1201 tttcttgggt tggtgatgaa ggtgaattta agctaaatca gcctgaactg gttgcacaga 1261 aatggggaca gcgtaaaaat aagcctacga tgaactatga gaaactcagt cgtgcattaa 1321 gatattatta cgatggggac atgatttgta aagttcaagg caagagattt gtgtacaagt 1381 ttgtctgtga cttgaagact cttattggat acagtgcagc ggagttgaac cgtttggtca 1441 cagaatgtga acagaagaaa cttgcaaaga tgcagctcca tggaattgcc cagccagtca 1501 cagtagtagc tctggctact gcttctctgc aaacggaaaa ggataattga gccccaggac 1561 attctgagac tccaaagtct ttcttaaaat gtttagagca agtatagctc ttacctttat 1621 tactgaattt gaatcttctt ttatttctag gctgtacagt ctgatgcatg atttttttat 1681 aaatatttca tactcttgtg aatttggatc tttttacttt gagcatatat tttagaatat 1741 gtgtatgtta aaggatctcc acaatgtctg cagtgtgaag gcaggttcat tgtggaatag 1801 tttaacagtc aggaaggcta aactggtcag tattaatgtg tagccctacc aaaaatagcc 1861 agtagtatct gaaaatgaaa aataaatgaa gtatctctag gaaacagtct ggcttaacta 1921 tttttgaaaa tataactgtt tcccctctct gctgctttag atgttgcttt // LOCUS HSU13045 2727 bp mRNA PRI 03-MAY-1995 DEFINITION Human nuclear respiratory factor-2 subunit beta 1 mRNA, complete cds. ACCESSION U13045 NID g531894 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2727) AUTHORS Gugneja,S., Virbasius,J.V. and Scarpulla,R.C. TITLE Four structurally distinct, non-DNA-binding subunits of human nuclear respiratory factor 2 share a conserved transcriptional activation domain JOURNAL Mol. Cell. Biol. 15 (1), 102-111 (1995) MEDLINE 95097980 REFERENCE 2 (bases 1 to 2727) AUTHORS Gugneja,S. TITLE Direct Submission JOURNAL Submitted (05-AUG-1994) Sajiv Gugneja, Cell and Molecular Biology, Northwestern University, 303 E. Chicago Ave, Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..2727 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 170..1357 /citation=[1] /codon_start=1 /product="nuclear respiratory factor-2 subunit beta 1" /db_xref="PID:g531895" /translation="MSLVDLGKKLLEAARAGQDDEVRILMANGAPFTTDWLGTSPLHL AAQYGHYSTTEVLLRAGVSRDARTKVDRTPLHMAASEGHASIVEVLLKHGADVNAKDM LKMTALHWATEHNHQEVVELLIKYGADVHTQSKFCKTAFDISIDNGNEDLAEILQIAM QNQINTNPESPDTVTIHAATPQFIIGPGGVVNLTGLVSSENSSKATDETGVSAVQFGN SSTSVLATLAALAEASAPLSNSSETPVVATEEVVTAESVDGAIQQVVSSGGQQVITIV TDGIQLGNLHSIPTSGIGQPIIVTMPDGQQVLTVPATDIAEETVISEEPPAKRQCIEI IENRVESAEIEEREALQKQLDEANREAQKYRQQLLKKEQEAEAYRQKLEAMTRLQTNK EAV" BASE COUNT 847 a 508 c 561 g 811 t ORIGIN 1 ggggattttg ggcgccgacc agacgcggca ttttcggaaa atagcgccct gtgcagctga 61 agcgctttgt gtgtagcggg gccgcgtcag cccggccggg tacgaggcgc ctcgggtccc 121 cgcaccacct cctgctgcct tcccgtcgcc gctcccgaag cttttccaga tgtccctggt 181 agatttggga aagaagcttt tagaagcggc acgagcaggt caagatgatg aagttcgtat 241 tttgatggca aatggagctc cctttactac agactggctg ggaacttctc cacttcatct 301 agcagcacag tatggtcatt attccaccac agaggtactg ctgcgagctg gtgtgagcag 361 agatgccaga accaaagtgg accgaacacc attacatatg gcagcttctg agggccatgc 421 cagcatagta gaggttttac ttaagcatgg tgctgatgtc aatgcaaagg acatgttaaa 481 gatgacagct ctccattggg ccacagaaca caatcatcaa gaggtggtgg aacttttaat 541 caaatatggt gctgatgtac acacgcaaag taaattttgt aaaactgcat ttgatatttc 601 aatagacaat ggaaatgaag atttagcaga gatattacag attgctatgc agaaccaaat 661 caacacaaac ccagagagtc ctgacactgt gacaatacat gctgcaacac cacagtttat 721 cattggacct ggaggggtgg tgaacctaac aggtctggta tcttcagaaa attcatccaa 781 ggcaacagat gaaacgggtg tatctgctgt tcagtttgga aactcttcta catcagtatt 841 agctacatta gctgccttag ctgaagcatc tgctccattg tccaattctt cagaaactcc 901 agtagtggcc acagaagaag tagttactgc agaatctgtg gatggtgcca ttcagcaagt 961 agttagttca gggggtcagc aagtcatcac aatagttaca gatggaattc agcttggaaa 1021 tttgcactct attccaacca gtggaattgg tcagcccatc attgtgacca tgccagatgg 1081 acaacaagta ttaacagtac cagcaacaga cattgctgaa gaaactgtta taagtgaaga 1141 accaccagct aagagacaat gtatcgaaat aattgaaaac cgggtggaat ctgcagaaat 1201 agaagagaga gaagctcttc agaaacagct ggatgaagca aatcgagaag cacaaaaata 1261 tcgacagcag ctcctaaaga aagaacagga agcagaggcc tacagacaga agttggaagc 1321 tatgactcgt cttcagacta ataaagaagc tgtttaattg aaatgaacat gtagtttgat 1381 tttacttttg gtcaagaaag aatacaatct tgaactgtac acaacaaagg tacagccatg 1441 ggaatacaga atgatagaag agactacaga tggataattg gacttaagcc atgagctctg 1501 agttcttgta acataaaact ttactttaga agttgtgaaa tgtatttaaa actgaattct 1561 gtaaatagtt tttttttttt tacagttcca aatgagttga taaagattgt tgaagagatc 1621 caaaaccaga ataagccact gtttttgtga attctttttg attttagtac aaaccttaat 1681 ttctcagaaa cggaacagtt ttaagggtga tcgttgttgg ttaggccaaa tgttgtgtaa 1741 taattatggt ggactgatgc tggaattact cctgtaggta taaacctctg tatgaagaga 1801 agatttctcc caggaaatct ttgtacagct ttaagttgtg tcagattctc tgaaaacatt 1861 ttttagaaag caaaattttt atatttgttc aatttcagct atacccaagt agatttacat 1921 gtatatgaag caaatatttt taaaaatttc tgtttgtaca tattctgcat gttttataat 1981 ttcaaaatgc atcacttaca taggtatttc tcccacagaa atgatgaaag tgaccagaaa 2041 aaaacaaaaa caaaacccct ttactctgta ggtcattgaa acgaagtaag ctggcagctg 2101 gttttattgg aatgacagtg ttctcggaag gagcagccta caagataact tgaatttgcc 2161 aattctgcaa aatctgtgct tttttgaaaa tttaagagtg gggacgtgaa actgtattct 2221 gtgccttcca tcatgatttc cacatgaaag cactttaagg cactgatttt aagataatgt 2281 ttttggaaaa cccaatgcat atgggtttct gaaatatttt atggacttat ttctccccag 2341 gaaacgattc ttacggaaaa aaattgcttt tgtatgtaga acaggaactt tttgtattac 2401 agtgatgcaa tagacatgtc taatgtaact tctacttttc cttttgaaag ctcagtgtct 2461 gtgctatgac ttgctctcat cacaatattg ttgaattcca caatgtatgg acattaaaca 2521 ctggcagact gttcactttt tctttttttt tttttggtaa aatattactt caaacccctt 2581 tttcttgctt tatttttcag tgttttattg ctttatgaac tgtttaaccc tgaaatccct 2641 ctaggttatc tatactgtat aaaaaagcaa ttacccttaa aactgtactc tggcctactt 2701 ttctattttg caattaaata tcttttt // LOCUS HSU13219 2510 bp mRNA PRI 13-MAR-1996 DEFINITION Human forkhead protein FREAC-1 mRNA, complete cds. ACCESSION U13219 NID g1223839 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2510) AUTHORS Pierrou,S., Hellqvist,M., Samuelsson,L., Enerback,S. and Carlsson,P. TITLE Cloning and characterization of seven human forkhead proteins: binding site specificity and DNA bending JOURNAL EMBO J. 13 (20), 5002-5012 (1994) MEDLINE 95045392 REFERENCE 2 (bases 1 to 2510) AUTHORS Hellqvist,M., Mahlapuu,M., Samuelsson,L., Enerback,S. and Carlsson,P. TITLE Differential activation of lung-specific genes by two forkhead proteins, FREAC-1 and FREAC-2 JOURNAL J. Biol. Chem. 271 (8), 4482-4490 (1996) MEDLINE 96224034 REFERENCE 3 (bases 1 to 2510) AUTHORS Carlsson,P. TITLE Direct Submission JOURNAL Submitted (10-AUG-1994) Peter Carlsson, Molecular Biology, Goteborg University, Medicinaregatan 9C, Goteborg S-413 90, Sweden FEATURES Location/Qualifiers source 1..2510 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 94..1158 /codon_start=1 /product="FREAC-1" /db_xref="PID:g1223840" /translation="MDPASSGPSKAKKTNAGIRRPEKPPYSYIALIVMAIQSSPTKRL TLSEIYQFLQSRFPFFRGSYQGWKNSVRHNLSLNECFIKLPKGLGRPGKGHYWTIDPA SEFMFEEGSFRRRPRGFRRKCQALKPMYSMMNGLGFNHLPDTYGFQGSAGGLSCPPNS LALEGGLGMMNGHLPGNVDGMALPSHSVPHLPSNGGHSYMGGCGGAAAGEYPHHDSSV PASPLLPTGAGGVMEPHAVYSGSAAAWPPSASAALNSGASYIKQQPLSPCNPAANPLS GSLSTHSLEQPYLHQNSHNAPAELQGIPRYHSQSPSMCDRKEFVFSFNAMASSSMHSA GGGSYYHQQVTYQDIKPCVM" misc_feature 160..459 /note="forkhead-motif" polyA_signal 2460..2465 BASE COUNT 538 a 759 c 638 g 575 t ORIGIN 1 ggcggcagca gccacccgat gtcttcggcg cccgagaagc agcagccacc gcacggcggc 61 ggcggcggcg gcggcggggg aggcggcgcg gccatggacc ccgcgtcgtc cggcccgtcc 121 aaggccaaga agaccaacgc cggcatccgg cgcccggaga agccgcccta ttcctacatc 181 gcgctcatcg tcatggccat ccagagttca cccaccaagc gcctgacgct gagcgagatc 241 taccagttcc tgcagagccg cttccccttc ttccggggct cctaccaggg ctggaagaac 301 tccgtgcgcc acaacctctc gctcaacgag tgcttcatca agctacccaa gggccttggg 361 cggcccggca agggccacta ctggaccatc gacccggcca gcgagttcat gttcgaggag 421 ggctcctttc ggcggcggcc gcgcggcttc cgaaggaaat gccaggcgct caagcccatg 481 tacagcatga tgaacgggct cggcttcaac cacctcccgg acacctacgg cttccagggc 541 tcggccggcg gcctctcgtg cccgcccaac agcctggcgc tggagggcgg cctgggcatg 601 atgaacggcc acttgccggg caacgtggac ggcatggccc tgcccagcca ctcggtgccc 661 cacctgcctt ccaacggcgg ccactcgtac atgggcggct gcggcggcgc ggcggccggc 721 gagtacccgc accacgacag ctcggtgccc gcctccccgc tgctgcccac cggcgccggt 781 ggggtcatgg agccgcacgc cgtctactcg ggctcggcgg cggcctggcc gccctcggcg 841 tccgcggcgc tcaacagcgg cgcctcttat atcaagcagc agcccctgtc cccctgtaac 901 cccgcggcca accccctgtc cggcagcctc tccacgcact ccctggagca gccgtatctg 961 caccagaaca gccacaacgc cccagccgag ctgcaaggca tcccgcggta tcactcgcag 1021 tcgcccagca tgtgtgaccg aaaggagttt gtcttctctt tcaacgccat ggcgtcctct 1081 tccatgcact cggccggcgg gggctcctac taccaccagc aggtcaccta ccaagacatc 1141 aagccttgcg tgatgtgagg ctgccgccgc aggccctcct ggtgcaggca ggcgggtcac 1201 agggaccctg gaccggcaca agaaactgct ttcttctcga ggtataaccg tcggcagaag 1261 aaaagggttc cacctctccc caaccggagt ttttggcaag gagtccccaa tgcaaagaca 1321 cagcgctgcg gttggcacct ccttcctcac tccctcaaaa ttgttaagaa atgttagtgg 1381 tgggtctgat ctgactgcag ccatcggtaa ataaaagttt ttgatcctgt tgaacccgcc 1441 tgagacggtg ctgtgcaggg gaaagccccg cacccacaca ggaattctgc tgaggtcccc 1501 ctccttccgg ccaatggcag aagtggggga aaatttttag aagaaaagca acatgtgaga 1561 ccaatcatta tcaaatactt ttatttttgg gtgagtatta accttttaat ttttaatttt 1621 tttttgaaag aatgtcttgg aatgcgcaag tctcccttta gagccgtctt ttgcagggag 1681 cgggaagtga caagagctca gatctccctc ccgatctccc tccccacctc cgaagtctcc 1741 tccgtggacc acaggtggat ctttgtgcga acaacttgca tttcggaagc cactgtccgt 1801 ctttaaacag aaagtcaaag gagccacgaa gcaagcggcc gtccgggcgt ccgcctccgt 1861 ccccttccat gttcctcctc ttccttcgct tcagcctctt ctgttatgtt ttgtcttgaa 1921 ttttatttag actttttcag tgggtatttt tctgtctccc aacctctact gtaaactttc 1981 tggtccgaga acgagccgaa cacagcgcga cgcagggact aggacggccc ggtgaccgcg 2041 cggattcagg attgcgggga cgcagaaagg ttaaggcact tttaaaaact atagcaaggc 2101 tcctgtttat ttattctact ttctttccct aataatcaaa acaccgcgta ggctcctccg 2161 tttatcagta ttaatggtgt aactttgttg gcaatatttg ccgtgtagaa ttttttttag 2221 atatccattg taaatttgaa acaaagaccg atctgtgtaa aaacaaattt ccatatgttt 2281 tatataaata tatatataat atgaaggact accctccttt tttttttttt gtattttggc 2341 tgctagagtg cagcatttgt gacacgtatt tgaaatttga aatttccttc tgcactgtat 2401 aaaaggacca tttgaggatg ttttgccttt tgtgtatttt ttcctaaaaa aagaacaaaa 2461 ataaaaatgt ataacatttg tacatggcct ttaaaattgt atcaactaga // LOCUS HSU13261 1908 bp mRNA PRI 15-JUL-1995 DEFINITION Human eIF-2-associated p67 homolog mRNA, complete cds. ACCESSION U13261 NID g687242 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1908) AUTHORS Li,X. and Chang,Y. TITLE Molecular cloning of a human complementary DNA encoding an initiation factor 2-associated protein (p67) JOURNAL Biochim et Biophys Acta 1260, 333-336 (1995) REFERENCE 2 (bases 1 to 1908) AUTHORS Li,X. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) X. Li, Biochemistry & Molecular Biology, St. Louis University, 1402 S. Grand Blvd., St. Louis, MO 63104, USA FEATURES Location/Qualifiers source 1..1908 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 35..1471 /note="similar to rat eIF-2 associated p67: PIR Accession Number A46702" /codon_start=1 /db_xref="PID:g687243" /translation="MAGVEEVAASGSHLNGDLDPDDREEGAASTAEEAAKKKRRKKKK SKGPSAAGEQEPDKESGASVDEVARQLERSALEDKERDEDDEDGDGDGDGATGKKKKK KKKKRGPKVQTDPPSVPICDLYPNGVFPKGQECEYPPTQDGRTAAWRTTSEEKKALDQ ASEEIWNDFREAAEAHRQVRKYVMSWIKPGMTMIEICEKLEDCSRKLIKENGLNAGLA FPTGCSLNNCAAHYTPNAGDTTVLQYDDICKIDFGTHISGRIIDCAFTVTFNPKYDTL LKAVKDATNTGIKCAGIDVRLCDVGEAIQEVMESYEVEIDGKTYQVKPIRNLNGHSIG QYRIHAGKTVPIVKGGEATRMEEGEVYAIETFGSTGKGVVHDDMECSHYMKNFDVGHV PIRLPRTKHLLNVINENFGTLAFCRRWLDRLGESKYLMALKNLCDLGIVDPYPPLCDI KGSYTAQFEHTILLRPTCKEVVSRGDDY" polyA_site 1908 /note="19 A residues" BASE COUNT 644 a 333 c 440 g 491 t ORIGIN 1 ctctgtctca ttccctcgcg ctctctcggg caacatggcg ggtgtggagg aggtagcggc 61 ctccgggagc cacctgaatg gcgacctgga tccagacgac agggaagaag gagctgcctc 121 tacggctgag gaagcagcca agaaaaaaag acgaaagaag aagaagagca aagggccttc 181 tgcagcaggg gaacaggaac ctgataaaga atcaggagcc tcagtggatg aagtagcaag 241 acagttggaa agatcagcat tggaagataa agaaagagat gaagatgatg aagatggaga 301 tggcgatgga gatggagcaa ctggaaagaa gaagaaaaag aagaagaaga agagaggacc 361 aaaagttcaa acagaccctc cctcagttcc aatatgtgac ctgtatccta atggtgtatt 421 tcccaaagga caagaatgcg aatacccacc cacacaagat gggcgaacag ctgcttggag 481 aactacaagt gaagaaaaga aagcattaga tcaggcaagt gaagagattt ggaatgattt 541 tcgagaagct gcagaagcac atcgacaagt tagaaaatac gtaatgagct ggatcaagcc 601 tgggatgaca atgatagaaa tctgtgaaaa gttggaagac tgttcacgca agttaataaa 661 agagaatgga ttaaatgcag gcctggcatt tcctactgga tgttctctca ataattgtgc 721 tgcccattat actcccaatg ccggtgacac aacagtatta cagtatgatg acatctgtaa 781 aatagacttt ggaacacata taagtggtag gattattgac tgtgctttta ctgtcacttt 841 taatcccaaa tatgatacgt tattaaaagc tgtaaaagat gctactaaca ctggaataaa 901 gtgtgctgga attgatgttc gtctgtgtga tgttggtgag gccatccaag aagttatgga 961 gtcctatgaa gttgaaatag atgggaagac atatcaagtg aaaccaatcc gtaatctaaa 1021 tggacattca attgggcaat atagaataca tgctggaaaa acagtgccga ttgtgaaagg 1081 aggggaggca acaagaatgg aggaaggaga agtatatgca attgaaacct ttggtagtac 1141 aggaaaaggt gttgttcatg atgatatgga atgttcacat tacatgaaaa attttgatgt 1201 tggacatgtg ccaataaggc ttccaagaac aaaacacttg ttaaatgtca tcaatgaaaa 1261 ctttggaacc cttgccttct gccgcagatg gctggatcgc ttgggagaaa gtaaatactt 1321 gatggctctg aagaatctgt gtgacttggg cattgtagat ccatatccac cattatgtga 1381 cattaaagga tcatatacag cgcaatttga acataccatc ctgttgcgtc caacatgtaa 1441 agaagttgtc agcagaggag atgactatta aacttagtcc aaagccacct caacaccttt 1501 attttctgag ctttgttgga aaacatgata ccagaattaa tttgccacat gttgtctgtt 1561 ttaacagtgg acccatgtaa tacttttatc catgtttaaa aagaaggaat ttggacaaag 1621 gcaaaccgtc taatgtaatt aaccaacgaa aaagctttcc ggacttttaa atgctaactg 1681 tttttcccct tcctgtctag gaaaatgcta taaagctcaa attagttagg aatgacttat 1741 acgttttgtt ttgaatacct aagagatact ttttggatat ttatattgcc atattcttac 1801 ttgaatgctt tgaatgacta catccagttc tgcacctata ccctctggtg ttgcttttta 1861 accttcctgg aatccatttc taaaaaataa agacattttc agatctga // LOCUS HSU13395 1475 bp mRNA PRI 15-SEP-1994 DEFINITION Human oxidoreductase (HHCMA56) mRNA, complete cds. ACCESSION U13395 NID g538131 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1475) AUTHORS Gmerek,R.E. and Medford,J.I. TITLE The complete sequence of a human hippocampus gene (HHCMA56) shows homology to developmental genes from Arabidopsis and Brassica napus JOURNAL Unpublished REFERENCE 2 (bases 1 to 1475) AUTHORS Gmerek,R.E. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) Ronald E. Gmerek, Biology, Eberly College of Science, The Pennsylvania State University, 506 Wartik Laboratory, University Park, PA 16802, USA FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /isolate="two year old female" /note="DSEG number: D16S432E" /db_xref="taxon:9606" /clone_lib="hippocampus library, Stratagene catalog number 936205" /sex="female" /tissue_type="hippocampus" /dev_stage="juvenile" /chromosome="16" gene 19..1131 /gene="HHCMA56" CDS 19..1131 /gene="HHCMA56" /codon_start=1 /product="oxidoreductase" /db_xref="PID:g538132" /translation="MARASEAVSRILEEWHKAKVEAMTLDLALLRSVQHFAEAFKAKN VPLHVLVCNAATFALPWSLTKDGLETTFQVNHLGHFYLVQLLPGMFCAAQLLPVSLWS PQSPIDLQILTTPWENWTSVASLQQKTTIGRCWLITGPSSATSSSPTSCTVASPTRGH VERSDRSWKYDVLQHSSQLVGVHTAVYLGEAFHQVHATGSCHHRVLCCCPRTGGSRRD VLQQLLPLHALTRSSERRDGPDPVGLSERLIQERLAASPAKWSSERMGTHTRPVCVPS RKCQAGPLPNVPPTQIRKSKGNKSIHNRVKNLKYQWEAGNSWGKVSLFWGWARHRSLC FLVVACLKVKTWLACRFRISLEKHQQFSSFYCYRIA" BASE COUNT 368 a 401 c 358 g 348 t ORIGIN 1 atcttggcct gcaggaacat ggcaagggcg agtgaagcag tgtcacgcat tttagaagaa 61 tggcataaag ccaaggtaga agcaatgacc ctggacctcg ctctgctccg tagcgtgcag 121 cattttgctg aagcattcaa ggccaagaat gtgcctcttc atgtgcttgt gtgcaacgca 181 gcaacttttg ctctaccctg gagtctcacc aaagatggcc tggagaccac ctttcaagtg 241 aatcatctgg ggcacttcta ccttgtccag ctcctcccag ggatgttttg tgccgctcag 301 ctcctgcccg tgtcattgtg gtctcctcag agtcccatcg atttacagat attaacgact 361 ccttgggaaa actggacttc agtcgcctct ctccaacaaa aaacgactat tgggcgatgc 421 tggcttataa caggtccaag ctctgcaaca tcctcttctc caacgagctg caccgtcgcc 481 tctcccacgc ggggtcacgt cgaacgcagt gatcgatcct ggaaatatga tgtactccaa 541 cattcatcgc agctggtggg tgtacacact gctgtttacc ttggcgaggc ctttcaccaa 601 gtccatgcaa cagggagctg ccaccaccgt gtactgtgct gctgtcccag aactggaggg 661 tctaggaggg atgtacttca acaactgctg ccgctgcatg ccctcaccag aagctcagag 721 cgaagagacg gcccggaccc tgtgggcctc agcgagaggc tgatccaaga acgcttggca 781 gccagtccgg ctaagtggag ctcagagcgg atgggcacac acacccgccc tgtgtgtgtc 841 ccctcacgca agtgccaggc tgggcccctt ccaaatgtcc ctccaacaca gatccgcaag 901 agtaaaggaa ataagagcat tcacaacaga gtgaaaaatc ttaagtacca atgggaagca 961 gggaattcct ggggtaaagt atcacttttc tggggctggg ctaggcatag gtctctttgc 1021 tttctggtgg tggcctgttt gaaagtaaaa acctggttgg cgtgtaggtt ccgtatctcc 1081 ctggagaagc accagcaatt ctcttccttt tactgttata gaatagcctg aggtcccctc 1141 gtccatccag ctaccaccac caccaccact gcagccaggg gctggccttc tcctacttag 1201 ggaagaaaaa gcaagtgttc actgctcctt gctgcattga tccaggagat aattgtttca 1261 ttcatcctga ccaagactga gccagcttag caactgctgg ggagacaaat ctcagaacct 1321 tgtcccagcc agtgaggatg acagtgacac ccagagggag tagaatacgc agaactacca 1381 ggtggcaaag tacttgtcat agactccttt gctaatgcta tacaaaaaat tctttagaga 1441 ttataacaaa tttttcaaat cattccttag atacc // LOCUS HSU13616 14770 bp mRNA PRI 06-APR-1995 DEFINITION Human ankyrin G (ANK-3) mRNA, complete cds. ACCESSION U13616 NID g608024 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14770) AUTHORS Kordeli,E., Lambert,S. and Bennett,V. TITLE AnkyrinG. A new ankyrin gene with neural-specific isoforms localized at the axonal initial segment and node of Ranvier JOURNAL J. Biol. Chem. 270 (5), 2352-2359 (1995) MEDLINE 95138209 REFERENCE 2 (bases 1 to 14770) AUTHORS Carpenter,S.S. TITLE Direct Submission JOURNAL Submitted (12-AUG-1994) Stanley S. Carpenter, Cell Biology & Biochemistry, Howard Hughes Med. Institute, Duke University Medical Center, 363 Carl Building, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..14770 /organism="Homo sapiens" /note="multiple clones" /db_xref="taxon:9606" /tissue_type="brain stem" /dev_stage="fetus" 5'UTR 1..192 gene 193..13326 /gene="ANK-3" CDS 193..13326 /gene="ANK-3" /note="480 kDa; antibodies that recognize this sequence stain ankyrin isoform(s) present at the node of Ranvier and axonal initial segment by immunofluorescence" /codon_start=1 /function="peripheral proteins believed to act as membrane-cytoskeleton linker molecules" /product="ankyrin G" /db_xref="PID:g608025" /translation="MAHAASQLKKNRDLEINAEEEPEKKRKHRKRSRDRKKKSDANAS YLRAARAGHLEKALDYIKNGVDINICNQNGLNALHLASKEGHVEVVSELLQREANVDA ATKKGNTALHIASLAGQAEVVKVLVTNGANVNAQSQNGFTPLYMAAQENHLEVVKFLL DNGASQSLATEDGFTPLAVALQQGHDQVVSLLLENDTKGKVRLPALHIAARKDDTKAA ALLLQNDNNADVESKSGFTPLHIAAHYGNINVATLLLNRAAAVDFTARNDITPLHVAS KRGNANMVKLLLDRGAKIDAKTRDGLTPLHCGARSGHEQVVEMLLDRAAPILSKTKNG LSPLHMATQGDHLNCVQLLLQHNVPVDDVTNDYLTALHVAAHCGHYKVAKVLLDKKAN PNAKALNGFTPLHIACKKNRIKVMELLLKHGASIQAVTESGLTPIHVAAFMGHVNIVS QLMHHGASPNTTNVRGETALHMAARSGQAEVVRYLVQDGAQVEAKAKDDQTPLHISAR LGKADIVQQLLQQGASPNAATTSGYTPLHLSAREGHEDVAAFLLDHGASLSITTKKGF TPLHVAAKYGKLEVANLLLQKSASPDAAGKSGLTPLHVAAHYDNQKVALLLLDQGASP HAAAKNGYTPLHIAAKKNQMDIATTLLEYGADANAVTRQGIASVHLAAQEGHVDMVSL LLGRNANVNLSNKSGLTPLHLAAQEDRVNVAEVLVNQGAHVDAQTKMGYTPLHVGCHY GNIKIVNFLLQHSAKVNAKTKNGYTPLHQAAQQGHTHIINVLLQNNASPNELTVNGNT ALGIARRLGYISVVDTLKIVTEETMTTTTVTEKHKMNVPETMNEVLDMSDDEVRKANA PEMLSDGEYISDVEEGEDAMTGDTDKYLGPQDLKELGDDSLPAEGYMGFSLGARSASL RSFSSDRSYTLNRSSYARDSMMIEELLVPSKEQHLTFTREFDSDSLRHYSWAADTLDN VNLVSSPIHSGFLVSFMVDARGGSMRGSRHHGMRIIIPPRKCTAPTRITCRLVKRHKL ANPPPHGERRGISSRLVEMGPAGAQFLGPVIVEIPHFGSMRGKERELIVLRSENGETW KEHQFDSKNEDLTELLNGMDEELDSPEELGKKRICRIITKDFPQYFAVVSRIKQESNQ IGPEGGILSSTTVPLVQASFPEGALTKRIRVGLQAQPVPDEIVKKILGNKATFSPIVT VEPRRRKFHKPITMTIPVPPPSGEGVSNGYKGDTTPNLRLLCSITGGTSPAQWEDITG TTPLTFIKDCVSFTTNVSARFWLADCHQVLETVGLATQLYRELICVPYMAKFVVFAKM NDPVESSLRCFCMTDDKVDKTLEQQENFEEVARSKDIEVLEGKPIYVDCYGNLAPLTK GGQQLVFNFYSFKENRLPFSIKIRDTSQEPCGRLSFLKERKTTKGLPQTAVCNLNITL PAHKKETESDQDDEIEKTDRRQSFASLALRKRYSYLTEPGMIERSTGATRSLPTTYSY KPFFSTRPYQSWTTAPITVPGPAKSGFTSLSSSSSNTPSASPLKSIWSVSTPSPIKST LGASTTSSVKSISDVASPIRSLRTMSSPIKTVVSQSPYNIQVSSGTLARAPAVTEATP LKGLASNSTFSSRTSPVTTAGSLLERSSITMTPPASPKSNINMYSSSLPFKSIITSAA PLISSPLKSVVSPVKSRVDVISSAKITMASSLSSPVKQMPGHAEVALVNGSISPLKYA SSSTLINGCKATATLQEKISSATNSVSSVVSAATDTVEKVFSTTTAMPFSPLRSYVSA APSAFQSLRTPSASALYTSLGSSISATTSSVTSSIITVPVYSVVNVLPEPALKKLPDS NSFTKSAAALLSPIKTLTTETHPQPHFSRTSSPVKSSLFLAPSALKLSTPSSLSSSQE ILKDVAEMKEDLMRMTAILQTDVPEEKPFQPELPKEGRIDDEEPFKIVEKVKEDLVKV SEILKKDVCVDNKGSPKSPKSDKGHSPEDDWIEFSSEEIREARQQAAASQSPSLPERV QVKAKAASEKDYNLTKVIDYLTNDIGSSSLTNLKYKFEDAKKDGEGGQKRVLKPAIAL QEHKLKMPPASMRTSTSEKELCKMADSFFGTDTILESPDDFSQHDQDKSPLSDSGFET RSEKTPSAPQSAETTGPKPLFHEVPIPPVITETRTEVVHVIRSYDPSAGDVPQTQPEE PVSPKPSPTFMELEPKPTTSSIKEKVKAFQMKASSEEDDHNRVLSKGMRVKEETHITT TTRMVYHSPPGGEGASERIEETMSVHDIMKAFQSGRDPSKELAGLFEHKSAVSPDVHK SAAETSAQHAEKDNQMKPKLERIIEVHIEKGNQAEPTEVIIRETKKHPEKEMYVYQKD LSRGDINLKDFLPEKHDAFPCSEEQGQQEEEELTAEESLPSYLESSRVNTPVSQEEDS RPSSAQLISDDSYKTLKLLSQHSIEYHDDELSELRGESYRFAEKMLLSEKLDVSHSDT EESVTDHAGPPSSELQGSDKRSREKIATAPKKEILSKIYKDVSENGVGKVSKDEHFDK VTVLHYSGNVSSPKHAMWMRFTEDRLDRGREKLIYEDRVDRTVKEAEEKLTEVSQFFR DKTEKLNDELQSPEKKARPKNGKEYSSQSPTSSSPEKVLLTELLASNDEWVKARQHGP DGQGFPKAEEKAPSLPSSPEKMVLSQQTEDSKSTVEAKGSISQSKAPDGPQSGFQLKQ SKLSSIRLKFEQGTHAKSKDMSQEDRKSDGQSRIPVKKIQESKLPVYQVFAREKQQKA IDLPDESVSVQKDFMVLKTKDEHAQSNEIVVNDSGSDNVKKQRTEMSSKAMPDSFSEQ QAKDLACHITSDLATRGPWDKKVFRTWESSGATNNKSQKEKLSHVLVHDVRENHIGHP ESKSVDQKNEFMSVTERERKLLTNGSLSEIKEMTVKSPSKKVLYREYVVKEGDHPGGL LDQPSRRSESSAVSHIPVRVADERRMLSSNIPDGFCEQSAFPKHELSQKLSQSSMSKE TVETQHFNSIEDEKVTYSEISKVSKHQSYVGLCPPLEETETSPTKSPDSLEFSPGKES PSSDVFDHSPIDGLEKLAPLAQTEGGKEIKTLPVYVSFVQVGKQYEKEIQQGGVKKII SQECKTVQETRGTFYTTRQQKQPPSPQGSPEDDTLEQVSFLDSSGKSPLTPETPSSEE VSYEFTSKTPDSLIAYIPGKPSPIPEVSEESEEEEQAKSTSLKQTTVEETAVEREMPN DVSKDSNQRPKNNRVAYIEFPPPPPLDADQIESDKKHHYLPEKEVDMIEVNLQDEHDK YQLAEPVIRVQPPSPVPPGADVSDSSDDESIYQPVPVKKYTFKLKEVDDEQKEKPKAS AEKASNQKELESNGSGKDNEFGLGLDSPQNEIAQNGNNDQSITECSIATTAEFSHDTD ATEIDSLDGYDLQDEDDGLTESDSKLPIQAMEIKKDIWNTEGILKPADRSFSQSKLEV IEEEGKVGPDEDKPPSKSSSSEKTPDKTDQKSGAQFFTLEGRHPDRSVFPDTYFSYKV DEEFATPFKTVATKGLDFDPWSNNRGDDEVFDSKSREDETKPFGLAVEDRSPATTPDT TPARTPTDESTPTSEPNPFPFHEGKMFEMTRSGAIDMSKRDFVEERLQFFQIGEHTSE GKSGDQGEGDKSMVTATPQPQSGDTTVETNLERNVETPTVEPNPSIPTSGECQEGTSS SGSLEKSAAATNTSKVDPKLRTPIKMGISASTMTMKKEGPGEITDKIEAVMTSCQGLE NETITMISNTANSQMGVRPHEKHDFQKDNFNNNNNLDSSTIQTDNIMSNIVLTEHSAP TCTTEKDNPVKVSSGKKTGVLQGHCVRDKQKVLGEQQKTKELIGIRQKSKLPIKATSP KDTFPPNHMSNTKASKMKQVSQSEKTKALTTSSCVDVKSRIPVKNTPRDNIIAVRKAC ATQKQGQPEKGKAKQLPSKLPVKVRSTCVTTTTTTATTTTTTTTTTTTSCTVKVRKSQ LKEVCKHSIEYFKGISGETLKLVDRLSEEEKKMQSELSDEEESTSRNTSLSETSRGGQ PSVTTKSARDKKTEAAPLKSKSEKAGSEKRSSRRTGPQSPCERTDIRMAIVADHLGLS WTELARELNFSVDEINQIRVENPNSLISQSFMLLKKWVTRDGKNATTDALTSVLTKIN RIDIVTLLEGPIFDYGNISGTRSFADENNVFHDPVDGWQNETSSGNLESCAQARRVTG GLLDRLDDSPDQCRDSITSYLKGEAGKFEANGSHTEITPEAKTKSYFPESQNDVGKQS TKETLKPKIHGSGHVEEPASPLAAYQKSLEETSKLIIEETKPCVPVSMKKMSRTSPAD GKPRLSLHEEEGSSGSEQKQGEGFKVKTKKEIRHVEKKSHS" repeat_region 408..2679 /note="encodes ankyrin repeat region" misc_feature 4625..12436 /gene="ANK-3" /note="This region is removed by tissue-specific alternative mRNA processing" misc_feature 4625..5916 /gene="ANK-3" /note="encodes unusual serine-rich domain" 3'UTR 13327..14770 polyA_signal 14745..14750 BASE COUNT 4829 a 3240 c 3211 g 3490 t ORIGIN 1 gggacaccat catttgggtt tttaaaagct tcagcttctc cagcatcttt gcattgtgaa 61 tatcttcctg cttctagtca aagctttctc tctagtgatt ggaatgctgc tgtaagccta 121 ttagaggtgc ctgaggatat caccttttac aaggaagccg tgtgtgcttg agaggatctt 181 tttaaatgca ttatggctca tgcagcctca caattaaaga aaaacaggga tttagaaatc 241 aatgctgaag aagagcctga gaaaaaaagg aaacaccgca aacggtcccg ggatcggaag 301 aaaaagtctg atgccaatgc aagttactta agagcagctc gagctggaca ccttgaaaag 361 gccctcgact acataaaaaa tggagttgac atcaacattt gcaatcagaa tgggttgaac 421 gctctccacc ttgcttccaa agaaggccat gtagaggttg tttctgagct gctgcagaga 481 gaagccaatg tggatgcagc tacaaagaaa ggaaacacag cattgcacat cgcatctttg 541 gctgggcaag cagaggtggt aaaagtcttg gttacaaatg gagccaatgt caatgcacaa 601 tctcagaatg gtttcacgcc attgtatatg gcagcccagg aaaatcacct ggaagttgtc 661 aagtttcttc ttgacaatgg tgcaagccag agcctagcca cagaggatgg cttcacacca 721 ttggcagtgg ctttgcaaca aggtcacgac caagtcgttt cgctcctgct agagaatgac 781 accaaaggaa aagtgcgtct cccagctctt catatcgcgg cccgcaaaga cgacacaaaa 841 gccgccgccc tgctgctgca gaatgacaac aatgcagatg tggaatcaaa gagtggcttc 901 actccgctcc acatagctgc tcactatgga aatatcaatg tagccacgtt gctgttaaac 961 cgagcggctg ctgtggattt caccgcaagg aatgacatca ctcctttaca tgttgcatca 1021 aaaagaggaa atgcaaatat ggtaaaacta ttgctcgatc gaggagctaa aatcgatgcc 1081 aaaaccaggg atggtctgac accactgcac tgtggagcaa ggagtggcca cgagcaggtg 1141 gtagaaatgt tgcttgatcg agctgccccc attctttcaa aaaccaagaa tggattatct 1201 ccattgcaca tggccacaca aggggatcat ttaaactgcg tccagcttct cctccagcat 1261 aatgtacccg tggatgatgt caccaatgac tacctgactg ccctacacgt ggctgcccac 1321 tgtggccatt acaaagttgc caaggttctc ttggataaga aagctaaccc caatgccaaa 1381 gccctgaatg gctttacccc tcttcatatc gcctgcaaga agaatcgaat taaagtaatg 1441 gaactccttc tgaaacacgg tgcatccatc caagctgtaa ccgagtcggg ccttacccca 1501 atccatgttg ctgccttcat ggggcatgta aatattgtat cacaactaat gcatcatgga 1561 gcctcaccaa acaccaccaa tgtgagagga gaaacagcac tgcacatggc agctcgctcc 1621 ggccaagctg aagttgtgcg gtatctggta caagacggag ctcaggtaga agctaaagct 1681 aaggatgacc aaacaccact ccacatttca gcccgactgg ggaaagcaga catagtacaa 1741 cagctgttgc agcaaggggc atctccaaat gcagccacaa cttctgggta caccccactt 1801 cacctttccg cccgagaggg gcatgaggat gtggccgcgt tccttttgga tcatggagcg 1861 tctttatcta taacaacaaa gaaaggattt actcctcttc atgtggcagc aaaatatgga 1921 aagcttgaag tcgccaatct cctgctacag aaaagtgcat ctccagatgc tgctgggaag 1981 agcgggctaa caccactgca tgtagctgca cattacgata atcagaaagt ggcccttctg 2041 cttttggacc aaggagcctc acctcacgca gccgcaaaga atggttatac gccactgcac 2101 atcgctgcca aaaagaacca gatggacata gcgacaactc tgctggaata tggtgctgat 2161 gccaacgcag ttacccggca aggaattgct tccgtccatc tcgcagctca ggaagggcac 2221 gtggacatgg tgtcgctgct cctcggtaga aatgcgaatg tgaacctgag caataagagc 2281 ggcctgaccc cactccattt ggctgctcaa gaagatcgag tgaatgtggc agaagtcctc 2341 gtaaaccaag gggctcatgt ggacgcccag acaaagatgg gatacacacc actgcatgtg 2401 ggctgccact atggaaatat caagattgtt aatttcctgc tccagcattc tgcaaaagtt 2461 aatgccaaaa caaagaatgg gtatacgcca ttacatcaag cagcacagca ggggcatacg 2521 catataataa atgtcttact tcagaacaac gcctccccca atgaactcac tgtgaatggg 2581 aatactgccc ttggcattgc ccggcgcctc ggctacatct cagtagtgga caccctgaag 2641 atagtgaccg aagaaaccat gaccacaact actgtcacag agaagcacaa aatgaatgtt 2701 ccagaaacga tgaatgaagt tcttgatatg tctgatgatg aagttcgtaa agccaatgcc 2761 cctgaaatgc tcagtgatgg cgaatatatc tcagatgttg aagaaggtga agatgcaatg 2821 accggggaca cagacaaata tcttgggcca caggacctta aggaattggg tgatgattcc 2881 ctgcctgcag agggttacat gggctttagt ctcggagcgc gttctgccag cctccgctcc 2941 ttcagttcgg ataggtctta caccttgaac agaagctcct atgcacggga cagcatgatg 3001 attgaagaac tccttgtgcc atccaaagag cagcatctaa cattcacaag ggaatttgat 3061 tcagattctc ttagacatta cagctgggct gcagacacct tagacaatgt caatcttgtt 3121 tcaagcccca ttcattctgg gtttctggtt agctttatgg tggacgcgag agggggctcc 3181 atgagaggaa gccgtcatca cgggatgaga atcatcattc ctccacgcaa gtgtacggcc 3241 cccactcgaa tcacctgccg tttggtaaag agacataaac tggccaaccc acccccacat 3301 ggtgaaagga gagggattag cagtaggctg gtagaaatgg gtcctgcagg ggcacaattt 3361 ttaggccctg tcatagtgga aatccctcac tttgggtcca tgagaggaaa agagagagaa 3421 ctcattgttc ttcgaagtga aaatggtgaa acttggaagg agcatcagtt tgacagcaaa 3481 aatgaagatt taaccgagtt acttaatggc atggatgaag aacttgatag cccagaagag 3541 ttagggaaaa agcgtatctg caggattatc acgaaagatt tcccccagta ttttgcagtg 3601 gtttcccgga ttaagcagga aagcaaccag attggtcctg aaggtggaat tctgagcagc 3661 accacagtgc cccttgttca agcatctttc ccagagggtg ccctaactaa aagaattcga 3721 gtgggcctcc aggcccagcc tgttccagat gaaattgtga aaaagatcct tggaaacaaa 3781 gcaactttta gcccaattgt cactgtggaa ccaagaagac ggaaattcca taaaccaatc 3841 acaatgacca ttccggtgcc cccgccctca ggagaaggtg tatccaatgg atacaaaggg 3901 gacactacac ccaatctgcg tcttctctgt agcattacag ggggcacttc gcctgctcag 3961 tgggaagaca tcacaggaac aactcctttg acgtttataa aagattgtgt ctcctttaca 4021 accaatgttt cagccagatt ttggcttgca gactgccatc aagttttaga aactgtgggg 4081 ttagccacgc aactgtacag agaattgata tgtgttccat atatggccaa gtttgttgtt 4141 tttgccaaaa tgaatgatcc cgtagaatct tccttgcgat gtttctgcat gacagatgac 4201 aaagtggaca aaactttaga gcaacaagag aattttgagg aagtcgcaag aagcaaagat 4261 attgaggttc tggaaggaaa acctatttat gttgattgtt atggaaattt ggccccactt 4321 accaaaggag gacagcaact tgtttttaac ttttattctt tcaaagaaaa tagactgcca 4381 ttttccatca agattagaga caccagccaa gagccctgtg gtcgtctgtc ttttctgaaa 4441 gaacgaaaga caacaaaagg actgcctcaa acagcggttt gcaacttaaa tatcactctg 4501 ccagcacata aaaaggagac agagtcagat caagatgatg agattgagaa aacagataga 4561 cgacagagct tcgcatcctt agctttacgt aagcgctaca gctacttgac tgagcctgga 4621 atgattgaac ggagtacagg agcaacaaga tccctcccca ccacttactc atacaagcca 4681 ttcttttcta caagaccata ccagtcctgg acaacagctc cgattacagt gcctgggcca 4741 gccaagtcag gcttcacttc cttatcaagt tcttcctcta atacgccatc agcttctccg 4801 ttaaaatcaa tatggtctgt ttcgacacct tctccaatca aatccacatt aggcgcgtca 4861 actacatctt cagttaaatc cattagtgac gtggcatctc caattagatc cttacggaca 4921 atgtcttcgc cgataaaaac tgtggtgtca caatctccat acaatatcca ggtttcctct 4981 ggtaccctgg ctagagctcc agcagtcacg gaagctacgc ccttaaaagg gctggcatcc 5041 aattctacgt tttcctctcg aacctctcca gtgactacag cagggtctct tttggagagg 5101 tcatcaatta ctatgacacc ccctgcctcc cccaaatcaa acattaatat gtattcctca 5161 agtttgccat ttaagtcaat tattacatca gcagcaccgc taatatcttc acctttaaag 5221 tcagtggtgt ctccagttaa atcacgagtt gatgtcattt catcagccaa aattacaatg 5281 gcatcttctc tctcatcacc tgtgaagcag atgcctggac atgcagaggt agcattagtc 5341 aatggatcta tttcccctct aaaatatgca tcatcctcaa ctttaattaa tggatgcaaa 5401 gccactgcca cgttacagga aaaaatttct tctgctacaa actctgtgag ctctgtggtc 5461 agtgcagcca ctgacacagt tgagaaagtg ttttctacca cgactgcaat gccattttcc 5521 ccactcaggt catatgtttc tgcagcacca tcagcttttc agtctctaag aactccttcc 5581 gcaagtgcac tctatacatc ccttgggtcg tcaatatctg caactacctc atctgtaact 5641 tcatcaatta taacagtgcc agtatactct gtagtcaatg ttttgccaga accagcatta 5701 aagaaacttc cagactctaa ttcatttaca aaatcagcag cagccttgct gtcacccatt 5761 aaaacattga ctacggagac acatcctcag cctcacttca gtcgaacttc atctccagtt 5821 aagtcatctt tgttccttgc accctctgcc cttaagttgt ctacaccatc ttctttatct 5881 tccagtcagg agatactaaa agatgtagct gaaatgaaag aggacctaat gcggatgacc 5941 gcaatactac agacagatgt gcctgaggag aagccattcc aacctgaact cccaaaggaa 6001 gggagaatag atgatgaaga acctttcaaa attgtagaga aagtaaagga agacttagtg 6061 aaagttagtg aaatccttaa aaaggatgta tgtgtagata ataaaggatc acccaaatca 6121 ccaaagagtg acaaaggaca ctctcctgaa gatgactgga tagaatttag ttcggaagaa 6181 atccgggaag ccagacaaca agctgctgcg agccagtctc catctctgcc agagagagtg 6241 caagtaaaag caaaagccgc ctccgaaaag gattataact tgaccaaagt tattgattac 6301 ctaacaaatg atattgggag tagttcactg acaaacttaa aatacaagtt tgaggatgca 6361 aagaaggatg gtgagggagg acagaaaaga gttttaaaac cagcaattgc tttgcaggaa 6421 cacaaactca aaatgcctcc agcctccatg aggacttcca cctctgagaa agaattgtgt 6481 aaaatggctg attccttttt tggaacagat actattttag agtctcctga tgacttttct 6541 caacacgacc aagataaaag tcccttgtct gacagtggct ttgaaacaag aagtgaaaag 6601 acaccttcag ccccacaaag cgctgaaacg actggtccta aaccactttt tcatgaagtt 6661 cccatccctc ctgttattac agaaacaaga actgaagtgg ttcatgttat caggagctat 6721 gatccctcag ctggggatgt tccccagacc caaccagagg agcctgtgtc acctaaacct 6781 tcacctactt ttatggaatt ggaaccaaag cccaccacct ctagtattaa agaaaaggtt 6841 aaagcatttc aaatgaaagc cagtagtgaa gaagatgacc acaatcgggt tttaagcaaa 6901 ggcatgcgtg ttaaagaaga gactcacata accacaacca ccagaatggt ttatcattct 6961 ccaccaggcg gtgaaggtgc atctgaaaga attgaagaaa ccatgtcagt ccatgacatc 7021 atgaaggcct ttcagtccgg gcgggatcct tccaaagaac tggcaggtct gtttgaacat 7081 aagtcggcag tgtctccaga tgttcacaag tctgctgctg aaacctcagc ccagcatgca 7141 gagaaggaca accaaatgaa acccaaactg gagcgtataa tagaagtcca catcgaaaaa 7201 ggtaaccaag ctgagcccac tgaagtcatt attagagaaa ccaaaaagca tccagaaaaa 7261 gaaatgtatg tatatcagaa agacttatcc cggggagata ttaacctaaa agattttctg 7321 ccagaaaaac acgatgcttt tccttgttca gaggaacagg gtcagcaaga agaagaagaa 7381 cttactgctg aagagtcatt gccttcttat ctggagtctt ccagagtaaa cactcctgtg 7441 tcccaagaag aagatagccg ccctagttct gctcaactca tatctgatga ctcttataaa 7501 acattgaagc ttttgagtca acactcaata gaataccatg acgatgagtt gtcagaacta 7561 agaggggagt cttacaggtt tgctgagaaa atgcttctgt cagaaaagct agatgtgtct 7621 cattctgata ctgaggaatc ggttacagac catgcaggac cccctagctc agagttacag 7681 gggtctgata agcggtccag agaaaaaata gccactgccc ccaaaaaaga aattctctcc 7741 aaaatctata aagatgtttc tgaaaatggt gtaggtaaag tgtctaaaga tgagcatttt 7801 gataaagtga cagtgttgca ctattctggc aatgttagta gtccaaaaca tgccatgtgg 7861 atgcgcttta ctgaggacag attagacaga ggtagagaga agttgatata tgaagatagg 7921 gtggacagga ctgtgaagga ggctgaagaa aaactgactg aagtgtcaca gttttttcgt 7981 gacaaaactg aaaagctaaa tgatgaactg cagtccccag agaaaaaggc acgccctaaa 8041 aatggcaaag aatattcttc tcaaagccct accagtagca gccctgagaa agtgctactg 8101 acagaactgc tggcatccaa tgatgagtgg gttaaggcaa gacagcatgg ccctgatgga 8161 caaggcttcc ccaaggccga ggagaaggca cccagtctgc ccagcagccc agagaagatg 8221 gttctctccc aacagactga ggacagcaag tccacagtgg aagccaaagg aagtatttca 8281 cagagcaaag caccagatgg gccccagtct ggattccagc tcaaacaatc taaactcagt 8341 tccattagat taaaatttga acaaggcaca cacgcaaaaa gtaaggacat gtctcaagaa 8401 gacagaaagt cagatggcca gtccagaatc ccagttaaaa aaatacagga gagcaagcta 8461 cccgtctacc aagtttttgc tagagaaaaa cagcagaagg ccatagacct cccagatgaa 8521 agtgtatctg tgcaaaaaga ttttatggta ttaaaaacca aagatgagca tgcccaaagc 8581 aacgaaattg ttgtaaatga ttctggctct gataatgtga aaaaacagag aactgaaatg 8641 tcaagtaaag caatgcctga ctctttttct gagcagcagg ctaaagactt ggcatgtcat 8701 ataacctcag atttagcaac taggggacca tgggacaaaa aggtctttag aacatgggag 8761 agttcgggag ccactaacaa taagtctcag aaagaaaaac tttcgcatgt acttgttcat 8821 gatgtaagag agaatcacat tggtcaccct gagagtaaaa gtgttgatca aaagaatgaa 8881 tttatgtctg tgactgagag agaacgcaaa ttgttaacaa acggctctct ctcagaaatt 8941 aaagaaatga ctgtaaaatc tccctccaaa aaagtcttat atagggaata tgttgtgaaa 9001 gaaggggacc atccaggcgg attgcttgat cagccttcca ggaggagcga gagctcagca 9061 gtgtcacaca ttcccgtcag agttgctgat gagaggagaa tgctgtcttc taatattccc 9121 gatggttttt gtgaacagtc ggcatttcca aaacatgaac tatcacaaaa attgtcccag 9181 tcaagcatga gtaaagagac agttgagaca cagcacttta attctataga agatgaaaaa 9241 gttacctatt cagaaatcag caaagtttcc aaacaccaga gttatgtagg tttatgccca 9301 cctctcgagg aaaccgaaac ctcccccacc aaatctcctg attctttaga gtttagccca 9361 ggaaaggaat ctccctctag tgatgtattc gaccacagtc ccattgatgg attggaaaaa 9421 ctcgcaccac tagcccagac agagggaggg aaagagataa aaactttacc cgtttatgtc 9481 agttttgtac aagtggggaa gcaatatgaa aaggagatac aacaaggagg tgtaaaaaaa 9541 atcataagtc aggaatgtaa gacagtacaa gaaaccaggg ggacctttta tacaactaga 9601 cagcaaaagc aacctccttc tccccaaggt agtccagaag atgatactct agagcaagta 9661 tcctttctag acagctctgg gaaaagccct ttaaccccag aaacacccag ttcagaggaa 9721 gtgagttatg aatttacatc taagacacct gactcgctca tagcttatat accaggcaaa 9781 cccagcccaa ttcccgaggt ttctgaggag tcagaggagg aggaacaggc caagtcaacc 9841 tcccttaagc agactacagt ggaggaaaca gcagttgagc gtgaaatgcc taatgacgtg 9901 agcaaagact ctaaccaaag acccaaaaat aacagagttg cctatattga atttccccct 9961 cctccaccac tggatgcgga ccagattgag tcagataaga agcatcatta tctcccagaa 10021 aaagaggttg acatgattga agtcaatctg caagatgagc atgacaagta ccagctggct 10081 gaacctgtca ttagagtgca gccaccttca ccagttcctc ccggggcaga cgtcagtgat 10141 tcaagcgatg acgaatctat ttatcagcca gtcccagtta aaaaatatac cttcaaatta 10201 aaggaagtgg acgatgaaca aaaagaaaaa cccaaagctt ctgctgaaaa ggcttccaac 10261 cagaaagaac tggaaagtaa tggatctgga aaagataatg aatttggcct tggccttgat 10321 tcacctcaga atgaaattgc ccagaatggg aacaacgacc agtccatcac agagtgttcc 10381 attgccacca cagcagagtt ttctcatgac acggatgcca cagagatcga ctctctggat 10441 ggctatgacc tgcaagatga agatgatggc ttgacagaga gtgattctaa actcccaatt 10501 caagccatgg aaattaagaa agatatctgg aacacagagg gcattctgaa gccagctgac 10561 cgctctttta gccaaagtaa acttgaagtt atcgaggagg agggaaaggt gggaccagat 10621 gaggacaagc caccttctaa aagttcttca tctgaaaaga ctcctgataa gactgatcag 10681 aagtcagggg cccagttctt cacactggaa ggcagacatc ctgacagatc agtgtttcct 10741 gatacttact tcagttacaa agtagatgaa gaatttgcca ctccttttaa aacagtagct 10801 accaaaggtc tagattttga cccttggtct aataaccgag gggatgatga agtttttgac 10861 agtaaatcac gggaagatga aactaagcca tttgggctgg cggtagaaga ccgctctcca 10921 gcaacaaccc ctgatacaac gccagccaga acgccaactg atgaaagtac cccaactagt 10981 gagcctaacc ccttcccatt tcatgaagga aaaatgtttg agatgactcg cagtggtgca 11041 attgacatga gcaagaggga ttttgttgaa gagaggctcc aatttttcca gattggtgag 11101 catacttctg aagggaagtc aggggaccag ggggaagggg ataaaagtat ggtcactgcc 11161 acaccacagc cacagtcagg ggacaccact gtagaaacca atctagagag aaatgtagag 11221 acacctacag tggaacctaa ccccagcatc ccgaccagcg gagagtgtca ggaaggcaca 11281 tccagtagtg gctccctgga gaaatcagca gcagccacta acacctctaa agttgacccc 11341 aagttgcgca cgcctataaa aatgggaatt tctgcatcca ccatgaccat gaagaaagaa 11401 ggccctggag aaataacaga taagatagaa gcggtgatga ccagttgtca gggattagaa 11461 aatgaaacta taacaatgat ttcaaataca gccaatagcc agatgggcgt taggccccat 11521 gaaaaacatg attttcaaaa agataacttt aataacaaca acaatttgga ttcttccact 11581 atacagacag ataacattat gagtaatata gttctgacag aacattctgc acccacttgt 11641 accacagaga aagataaccc agtgaaagtc tcatcaggaa aaaagacagg ggtactacaa 11701 ggacactgtg taagagataa gcagaaagtt cttggagaac agcaaaaaac aaaggaattg 11761 atagggatta ggcaaaaatc caaacttccc ataaaggcca cttcaccaaa agataccttc 11821 ccaccgaacc atatgtcaaa cactaaagca agtaaaatga agcaggttag tcaatccgag 11881 aaaaccaaag cccttactac ttcttcatgt gtagatgtaa agtccagaat tccagtgaaa 11941 aacacaccca gggataacat aattgcagtt agaaaagcat gtgccacaca aaagcaaggg 12001 cagccagaga aaggcaaggc caaacagctt ccatccaagt tgccagtaaa ggtaagatcc 12061 acctgtgtca ctaccaccac caccactgcc accaccacca ccactaccac cactaccacc 12121 accaccagct gcacagttaa agttaggaaa agtcagctca aggaagtatg taaacattcc 12181 attgaatatt ttaagggaat tagtggtgag accttaaagc ttgtggaccg cctctctgaa 12241 gaagaaaaaa agatgcagtc cgagttgtcc gatgaggaag aaagtacctc aagaaacacg 12301 tcgttgtccg agacttcccg gggtggccag ccttcggtta caacgaagtc tgctagagat 12361 aagaaaacag aggcagcacc tttaaaatca aagagtgaaa aggccggcag tgagaaaagg 12421 agcagtagaa ggactggtcc acagagtcca tgtgaacgga cagatatcag gatggcaata 12481 gtagccgatc acctgggact tagttggaca gaactggcaa gggaactgaa tttttcagtg 12541 gatgaaatca atcaaatacg tgtggaaaat ccaaattctt taatttctca gagcttcatg 12601 ttattaaaaa aatgggttac cagagacgga aaaaatgcca caactgatgc cttaacttcg 12661 gtcttgacaa aaattaatcg aatagatata gtgacactgc tagaaggacc aatatttgat 12721 tatggaaata tttcaggcac cagaagtttt gcagatgaga acaatgtttt ccatgaccct 12781 gttgatggtt ggcagaatga gacatcaagt ggaaacctag agtcctgcgc tcaagctcga 12841 agagtaactg gtgggttact agatcgactg gatgacagcc ctgaccagtg tagagattcc 12901 attacctcat atctcaaagg agaagctggc aaatttgaag caaatggaag ccatacagaa 12961 atcactccag aagcaaagac aaaatcttac tttccagaat cccaaaatga tgtaggaaaa 13021 cagagtacca aggaaactct gaaaccaaaa atacatggat ctggtcatgt tgaagaacca 13081 gcatcaccac tagcagcata tcagaaatct ctagaagaaa ccagcaagct tataatagaa 13141 gagactaaac cctgtgtgcc tgtcagtatg aaaaagatga gtaggacttc tccagcagat 13201 ggcaagccaa ggcttagcct ccatgaagaa gaggggtcca gtgggtctga gcaaaagcag 13261 ggagaaggtt ttaaggtgaa aacgaagaaa gaaatccggc atgtggaaaa gaagagccac 13321 tcgtaacagc gaacggtcag tcaaggatca taagttttta ctgccagtat tgagaaattc 13381 gtggaagaaa tgtcagcagg aagtaaaaat tcaccgagaa gtgtgtgtgt gttcgctgct 13441 tccacacatt aatggcatga ttttttttat gcaaaaagaa aagaaaagaa aaccacccac 13501 attttaattt agcatgagcc aatttacaga gcatggaaat cactttcatt tccggattgg 13561 cgcgtgtgca attagcaatg cagtgtatat acaagaagcc atgctgttaa cagtttatct 13621 caagtaattt ggtgtcattt acaaaaggaa gaaattcttg ccatcagaag ggacagaagg 13681 agatggaatg atcaaatcac agagaaacac taaaattact aaatctacaa cagctcgcct 13741 tatttttctt ggacccaaac tgtcagggta taaacactat ttgtttcttt tttaaaacat 13801 cgatagtttt gctgtaaata ataacactgt ataaattcta acagaaaaat agaataaaat 13861 gttgataagg cactgcctct tagaacacaa acagaatatt gcaaatgcat ttaacaaata 13921 ggggtctcaa gtgacttcac cctagaagag gagggaggga ttctgggtgg gggtgactct 13981 tcactgattc ttgtatatta gcaattataa tgtttgtata tgcaaaactg ctagcttgat 14041 tttaaaaaaa aacaaatctt taaaatctgc tgcaaattta aataattccc aaaatgagga 14101 attctgtagt tctgtgaaaa atttttttct tcagaatact aatattttga tgcatgtgta 14161 tcaccttatt ttgattaaat cttttccaat tttgcaaaca ctacagtata ctgcaacaca 14221 aaagatttta tctgtaaaca caaagccctt catctaatat ttgttgctat tgccaatttt 14281 tcaatgaaat gacctaaaaa caacaaaaaa aaataaccta tacggtagtt gctttagggg 14341 gtggggggat gctatctgtt agtgcttaaa agggggtaaa tgcttgccgc tttagaggtg 14401 gatggtgctc ataaaaggcc ccagtcgggg gtatttaaaa aggactgaac agaaatcctt 14461 agctagtaga atggcagcac gctgtaaaat tattactgta ttgtgtactg gctataagat 14521 gtagacacct ttcagtaagc caatcatttg taaccattct agcagtgtca tattaggtta 14581 ataaggctgc tgtgttttaa agggcatttt tatttgggtt ttggtgaaat tctttaattt 14641 gttgattata ttcacataaa atcagcattc attgacacat agctctaatg acatatgtat 14701 gaaaaaccat acactggatg acctagtcga ttatttaagc ataaaataaa ttgtgttaaa 14761 ctcttcacct // LOCUS HSU13660 2323 bp mRNA PRI 10-DEC-1994 DEFINITION Human cartilage-derived morphogenetic protein 1 (CDMP-1) mRNA, complete cds. ACCESSION U13660 NID g600731 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2323) AUTHORS Chang,S., Hoang,B., Thomas,J.T., Vukicevic,S., Luyten,F.P., Ryba,N.J.P., Kozak,C.A., Reddi,A.H. and Moos,M. TITLE Cartilage-derived morphogenetic proteins: New members of the TGF-superfamily predominantly expressed in long bones during human embryonic development JOURNAL J. Biol. Chem. 269, 28227-28234 (1994) MEDLINE 95050604 REFERENCE 2 (bases 1 to 2323) AUTHORS Moos,M. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) Malcolm Moos, FDA/CBER, 1401 Rockville Pike (HFM-527), Rockville, MD 20852-1448, USA FEATURES Location/Qualifiers source 1..2323 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="articular cartilage" /dev_stage="adolescent" gene 263..1768 /gene="CDMP-1" CDS 263..1768 /gene="CDMP-1" /codon_start=1 /product="cartilage-derived morphogenetic protein 1 precursor" /db_xref="PID:g600732" /translation="MRLPKLLTFLLWYLAWLDLEFICTVLGAPDLGQRPQGSRPGLAK AEAKERPPLARNVFRPGGHSYGGGATNANARAKGGTGQTGGLTQPKKDEPKKLPPRPG GPEPKPGHPPQTRQATARTVTPKGQLPGGKAPPKAGSVPSSFLLKKAREPGPPREPKE PFRPPPITPHEYMLSLYRTLSDADRKGGNSSVKLEAGLANTITSFIDKGQDDRGPVVR KQRYVFDISALEKDGLLGAELRILRKKPSDTAKPAVPRSRRAAQLKLSSCPSGRQPAA LLDVRSVPGLDGSGWEVFDIWKLFRNFKNSAQLCLELEAWERGRTVDLRGLGFDRAAR QVHEKALFLVFGRTKKRDLFFNEIKARSGQDDKTVYEYLFSQRRKRRAPSATRQGKRP SKNLKARCSRKALHVNFKDMGWDDWIIAPLEYEAFHCEGLCEFPLRSHLEPTNHAVIQ TLMNSMDPESTPPTCCVPTRLSPISILFIDSANNVVYKQYEDMVVESCGCR" sig_peptide 266..319 /gene="CDMP-1" /note="putative" misc_feature 320..1393 /gene="CDMP-1" /note="encodes Pro region" misc_feature 827..835 /gene="CDMP-1" /note="encodes putative N-linked glycosylation site" misc_feature 1394..1405 /gene="CDMP-1" /note="encodes putative RXXR proteolysis site" mat_peptide 1406..1765 /gene="CDMP-1" /product="cartilage-derived morphogenetic protein 1" polyA_site 2323 /note="16 A residues" BASE COUNT 507 a 688 c 682 g 446 t ORIGIN 1 tcaagaacga gttattttca gctgctgact ggagacggtg cacgtctgga tacgagagca 61 tttccactat gggactggat acaaacacac acccggcaga cttcaagagt ttcagactga 121 ggagaaaacc tttccttctg ctgctactgc tgctgccgct gcttttgaaa gtccactcct 181 ttcatggttt ttcctgccaa accagaggca ccttcgctgc tgccgctgtt ctctttggtg 241 tcattcagcg gctggccaga ggatgagact ccccaaactc ctcactttct tgctttggta 301 cctggcttgg ctggacctgg aattcatctg cactgtgttg ggtgcccctg acttgggcca 361 gagaccccag gggtccaggc caggattggc caaagcagag gccaaggaga ggccccccct 421 ggcccggaac gtcttcaggc cagggggtca cagctatggt gggggggcca ccaatgccaa 481 tgccagggca aagggaggca ccgggcagac aggaggcctg acacagccca agaaggatga 541 acccaaaaag ctgcccccca gaccgggcgg ccctgaaccc aagccaggac accctcccca 601 aacaaggcag gctacagccc ggactgtgac cccaaaagga cagcttcccg gaggcaaggc 661 acccccaaaa gcaggatctg tccccagctc cttcctgctg aagaaggcca gggagcccgg 721 gcccccacga gagcccaagg agccgtttcg cccacccccc atcacacccc acgagtacat 781 gctctcgctg tacaggacgc tgtccgatgc tgacagaaag ggaggcaaca gcagcgtgaa 841 gttggaggct ggcctggcca acaccatcac cagctttatt gacaaagggc aagatgaccg 901 aggtcccgtg gtcaggaagc agaggtacgt gtttgacatt agtgccctgg agaaggatgg 961 gctgctgggg gccgagctgc ggatcttgcg gaagaagccc tcggacacgg ccaagccagc 1021 ggtcccccgg agccggcggg ctgcccagct gaagctgtcc agctgcccca gcggccggca 1081 gccggccgcc ttgctggatg tgcgctccgt gccaggcctg gacggatctg gctgggaggt 1141 gttcgacatc tggaagctct tccgaaactt taagaactcg gcccagctgt gcctggagct 1201 ggaggcctgg gaacggggca ggaccgtgga cctccgtggc ctgggcttcg accgcgccgc 1261 ccggcaggtc cacgagaagg ccctgttcct ggtgtttggc cgcaccaaga aacgggacct 1321 gttctttaat gagattaagg cccgctctgg ccaggacgat aagaccgtgt atgagtacct 1381 gttcagccag cggcgaaaac ggcgggcccc atcggccact cgccagggca agcgacccag 1441 caagaacctt aaggctcgct gcagtcggaa ggcactgcat gtcaacttca aggacatggg 1501 ctgggacgac tggatcatcg caccccttga gtacgaggct ttccactgcg aggggctgtg 1561 cgagttccca ttgcgctccc acctggagcc cacgaatcat gcagtcatcc agaccctgat 1621 gaactccatg gaccccgagt ccacaccacc cacctgctgt gtgcccacgc ggctgagtcc 1681 catcagcatc ctcttcattg actctgccaa caacgtggtg tataagcagt atgaggacat 1741 ggtcgtggag tcgtgtggct gcaggtagca gcactggccc tctgtcttcc tgggtggcac 1801 atcccaagag ccccttcctg cactcctgga atcacagagg ggtcaggaag ctgtggcagg 1861 agcatctaca cagcttggtg aagggattca ataagcttgc tcgctctctg agtgtgactt 1921 gggctaaagg ccccctttta tccacaagtt cccctggctg aggattgctg cccgtctgct 1981 gatgtgacca gtggcaggca caggtccagg gagacagact ctgaatggga ctgagtccca 2041 ggaaacagtg ctttccgatg agactcagcc caccatttct cctcacctgg gccttctcag 2101 cctctggact ctcctaagca cctctcagga gagccacagg tgccactgcc tcctcaaatc 2161 acatttgtgc ctggtgactt cctgtccctg ggacagttga gaagctgact gggcaagagt 2221 gggagagaag aggagagggc ttggatagag ttgaggagtg tgaggctgtt agactgttag 2281 atttaaatgt atattgatga gataaaaagc aaaactgtgc cta // LOCUS HSU13666 1438 bp DNA PRI 01-APR-1995 DEFINITION Human G protein-coupled receptor (GPR1) gene, complete cds. ACCESSION U13666 L35539 NID g577412 KEYWORDS G protein-coupled receptor; intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1438) AUTHORS Marchese,A., Docherty,J.M., Nguyen,T., Heiber,M., Cheng,R., Heng,H.H., Tsui,L.C., Shi,X., George,S.R. and O'Dowd,B.F. TITLE Cloning of human genes encoding novel G protein-coupled receptors JOURNAL Genomics 23 (3), 609-618 (1994) MEDLINE 95154831 REFERENCE 2 (bases 1 to 1438) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (15-AUG-1994) Brian F. O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario, M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1438 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q21.6" gene 227..1294 /gene="GPR1" CDS 227..1294 /gene="GPR1" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g577413" /translation="MEDLEETLFEEFENYSYDLDYYSLESDLEEKVQLGVVHWVSLVL YCLAFVLGIPGNAIVIWFTGLKWKKTVTTLWFLNLAIADFIFLLFLPLYISYVAMNFH WPFGIWLCKANSFTAQLNMFASVFFLTVISLDHYIHLIHPVLSHRHRTLKNSLIVIIF IWLLASLIGGPALYFRDTVEFNNHTLCYNNFQKHDPDLTLIRHHVLTWVKFIIGYLFP LLTMSICYLCLIFKVKKRTVLISSRHFWTILVVVVAFVVCWTPYHLFSIWELTIHHNS YSHHVMQAGIPLSTGLAFLNSCLNPILYVLISKKFQARFRSSVAEILKYTLWEVSCSG TVSEQLRNSETKNLCLLETAQ" BASE COUNT 340 a 333 c 278 g 487 t ORIGIN 1 gggctgcagt gagccaaaag catgccattg cactccagct tgggcaacag agtgagaccc 61 tgtctcaaaa aaaagaaaaa ataatactat gtctggtcca taacctgaaa tatttttatc 121 ttcacgttcc ttatcattca ctgaactttt atttttcttt taaaattttt tcctttcttt 181 ttaaatttgc ttctacagat ttcttcattc tccatttagc aaggtcatgg aagatttgga 241 ggaaacatta tttgaagaat ttgaaaacta ttcctatgac ctagactatt actctctgga 301 gtctgatttg gaggagaaag tccagctggg agttgttcac tgggtctccc tggtgttata 361 ttgtttggct tttgttctgg gaattccagg aaatgccatc gtcatttggt tcacggggct 421 caagtggaag aagacagtca ccactctgtg gttcctcaat ctagccattg cggatttcat 481 ttttcttctc tttctgcccc tgtacatctc ctatgtggcc atgaatttcc actggccctt 541 tggcatctgg ctgtgcaaag ccaattcctt cactgcccag ttgaacatgt ttgccagtgt 601 ttttttcctg acagtgatca gcctggacca ctatatccac ttgatccatc ctgtcttatc 661 tcatcggcat cgaaccctca agaactctct gattgtcatt atattcatct ggcttttggc 721 ttctctaatt ggcggtcctg ccctgtactt ccgggacact gtggagttca ataatcatac 781 tctttgctat aacaattttc agaagcatga tcctgacctc actttgatca ggcaccatgt 841 tctgacttgg gtgaaattta tcattggcta tctcttccct ttgctaacaa tgagtatttg 901 ctacttgtgt ctcatcttca aggtgaagaa gcgaacagtc ctgatctcca gtaggcattt 961 ctggacaatt ctggttgtgg ttgtggcctt tgtggtttgc tggactcctt atcacctgtt 1021 tagcatttgg gagctcacca ttcaccacaa tagctattcc caccatgtga tgcaggctgg 1081 aatccccctc tccactggtt tggcattcct caatagttgc ttgaacccca tcctttatgt 1141 cctaattagt aagaagttcc aagctcgctt ccggtcctca gttgctgaga tactcaagta 1201 cacactgtgg gaagtcagct gttctggcac agtgagtgaa cagctcagga actcagaaac 1261 caagaatctg tgtctcctgg aaacagctca ataagttatt acttttccac aaatcagtat 1321 atggcttttt atgtgggtcc tctgactgat gctttcagat taaaattgtt tccaagatag 1381 agagccgact ccactttcat agttattgtt tctggtcaca tatatggcat cacatttt // LOCUS HSU13680 1254 bp mRNA PRI 07-SEP-1994 DEFINITION Human lactate dehydrogenase-C (LDH-C) mRNA, complete cds. ACCESSION U13680 NID g535359 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1254) AUTHORS Wu,K. and Li,S.S.-L. TITLE Human testicular lactate dehydrogenase-C gene: cDNA sequence and putative alternative splicing at the 5' noncoding region JOURNAL J. Genet. Mol. Biol 1, 72-76 (1990) REFERENCE 2 (bases 1 to 1254) AUTHORS Li,S.S.-L. TITLE Direct Submission JOURNAL Submitted (17-AUG-1994) Steven S.-L. Li, National Institute of Environmental Health Science, 111 Alexander Dr., Md D3-05, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..1254 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda GT10 testis library" /chromosome="11" gene 142..1140 /gene="LDH-C" CDS 142..1140 /gene="LDH-C" /EC_number="1.1.1.27" /codon_start=1 /product="lactate dehydrogenase-C" /db_xref="PID:g535360" /translation="MSTVKEQLIEKLIEDDENSQCKITIVGTGAVGMACAISILLKDL ADELALVDVALDKLKGEMMDLQHGSLFFSTSKITSGKDYSVSANSRIVIVTAGARQQE GETRLALVQRNVAIMKSIIPAIVHYSPDCKILVVSNPVDILTYIVWKISGLPVTRVIG SGCNLDSARFRYLIGEKLGVHPTSCHGWIIGEHGDSSVPLWSGVNVAGVALKTLDPKL GTDSDKEHWKNIHKQVIQSAYEIIKLKGYTSWAIGLSVMDLVGSILKNLRRVHPVSTM VKGLYGIKEELFLSIPCVLGRNGVSDVVKINLNSEEEALFKKSAETLWNIQKDLIF" BASE COUNT 352 a 216 c 299 g 387 t ORIGIN 1 cggcaaccgt cgacgggctt agcgcctcaa ctgtcgttgg tgtatttttc tggtgtcact 61 tctgtgcctt ccttcaaagg tggtgctttg tccctgtggg tcatctgtac tgattgcgcc 121 aagaaagcat ttgttctcca aatgtcaact gtcaaggagc agctaattga gaagctaatt 181 gaggatgatg aaaactccca gtgtaaaatt actattgttg gaactggtgc cgtaggcatg 241 gcttgtgcta ttagtatctt actgaaggat ttggctgatg aacttgccct tgttgatgtt 301 gcattggaca aactgaaggg agaaatgatg gatcttcagc atggcagtct tttctttagt 361 acttcaaaga ttacttctgg aaaagattac agtgtatctg caaactccag aatagttatt 421 gtcacagcag gtgcaaggca gcaggaggga gaaactcgcc ttgccctggt ccaacgtaat 481 gtggctataa tgaaatcaat cattcctgcc atagtccatt atagtcctga ttgtaaaatt 541 cttgttgttt caaatccagt ggatattttg acatatatag tctggaagat aagtggctta 601 cctgtaactc gtgtaattgg aagtggttgt aatctagact ctgcccgttt ccgttaccta 661 attggagaaa agttgggtgt ccaccccaca agctgccatg gttggattat tggagaacat 721 ggtgattcta gtgtgccctt atggagtggg gtgaatgttg ctggtgttgc tctgaagact 781 ctggacccta aattaggaac ggattcagat aaggaacact ggaaaaatat ccataaacaa 841 gttattcaaa gtgcctatga aattatcaag ctgaaggggt atacctcttg ggctattgga 901 ctgtctgtga tggatctggt aggatccatt ttgaaaaatc ttaggagagt gcacccagtt 961 tccaccatgg ttaagggatt atatggaata aaagaagaac tctttctcag tatcccttgt 1021 gtcttggggc ggaatggtgt ctcagatgtt gtgaaaatta acttgaattc tgaggaggag 1081 gcccttttca agaagagtgc agaaacactt tggaatattc aaaaggatct aatattttaa 1141 attaaagcct tctaatgttc cactgtttgg agaacagaag atagcaggct gtgtatttta 1201 aattttgaaa gtattttcat tgatcttaaa aaataaaaac aaattggaga cctg // LOCUS HSU13695 3063 bp DNA PRI 23-MAR-1995 DEFINITION Human homolog of yeast mutL (hPMS1) gene, complete cds. ACCESSION U13695 NID g535512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3063) AUTHORS Nicolaides,N.C., Papadopoulos,N., Liu,B., Wei,Y.-F., Carter,K.C., Ruben,S.M., Rosen,C.A., Haseltine,W.H., Fleischmann,R.D., Fraser,C.M., Adams,M.D., Venter,J.C., Dunlop,M.G., Hamilton,S.R., Petersen,G.M.,de la Chapelle, Vogelstein,B. and Kinzler,K.W. TITLE Mutations of two PMS homologues in hereditary nonpolyposis colon cancer JOURNAL Nature 371 (6492), 75-80 (1994) MEDLINE 94352394 REFERENCE 2 (bases 1 to 3063) AUTHORS Wei,Y.-F. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) Ying-Fei Wei, Molecular Biology, Human Genome Sciences, Inc., 9620 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..3063 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p2" /tissue_type="gall bladder" /dev_stage="adult" gene 81..2879 /gene="hPMS1" CDS 81..2879 /gene="hPMS1" /note="homolog of yeast mutL gene" /codon_start=1 /function="DNA mismatch repair" /db_xref="PID:g535513" /translation="MKQLPAATVRLLSSSQIITSVVSVVKELIENSLDAGATSVDVKL ENYGFDKIEVRDNGEGIKAVDAPVMAMKYYTSKINSHEDLENLTTYGFRGEALGSICC IAEVLITTRTAADNFSTQYVLDGSGHILSQKPSHLGQGTTVTALRLFKNLPVRKQFYS TAKKCKDEIKKIQDLLMSFGILKPDLRIVFVHNKAVIWQKSRVSDHKMALMSVLGTAV MNNMESFQYHSEESQIYLSGFLPKCDADHSFTSLSTPERSFIFINSRPVHQKDILKLI RHHYNLKCLKESTRLYPVFFLKIDVPTADVDVNLTPDKSQVLLQNKESVLIALENLMT TCYGPLPSTNSYENNKTDVSAADIVLSKTAETDVLFNKVESSGKNYSNVDTSVIPFQN DMHNDESGKNTDDCLNHQISIGDFGYGHCSSEISNIDKNTKNAFQDISMSNVSWENSQ TEYSKTCFISSVKHTQSENGNKDHIDESGENEEEAGLENSSEISADEWSRGNILKNSV GENIEPVKILVPEKSLPCKVSNNNYPIPEQMNLNEDSCNKKSNVIDNKSGKVTAYDLL SNRVIKKPMSASALFVQDHRPQFLIENPKTSLEDATLQIEELWKTLSEEEKLKYEEKA TKDLERYNSQMKRAIEQESQMSLKDGRKKIKPTSAWNLAQKHKLKTSLSNQPKLDELL QSQIEKRRSQNIKMVQIPFSMKNLKINFKKQNKVDLEEKDEPCLIHNLRFPDAWLMTS KTEVMLLNPYRVEEALLFKRLLENHKLPAEPLEKPIMLTESLFNGSHYLDVLYKMTAD DQRYSGSTYLSDPRLTANGFKIKLIPGVSITENYLEIEGMANCLPFYGVADLKEILNA ILNRNAKEVYECRPRKVISYLEGEAVRLSRQLPMYLSKEDIQDIIYRMKHQFGNEIKE CVHGRPFFHHLTYLPETT" BASE COUNT 1100 a 503 c 580 g 880 t ORIGIN 1 ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc tgcctcgcgc tagcagcaag 61 ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga ctcctttcaa 121 gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa aactccttgg 181 atgctggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat aaaattgagg 241 tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca atgaagtact 301 acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac ggttttcgtg 361 gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca agaacggctg 421 ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt tctcagaaac 481 cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag aatctacctg 541 taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa aagatccaag 601 atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt gtacataaca 661 aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc atgtcagttc 721 tggggactgc tgttatgaac aatatggaat cctttcagta ccactctgaa gaatctcaga 781 tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact agtctttcaa 841 caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa gatatcttaa 901 agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt ttgtatcctg 961 ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta acaccagata 1021 aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa aatctgatga 1081 cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa acagatgttt 1141 ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt aataaagtgg 1201 aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc caaaatgata 1261 tgcataatga tgaatctgga aaaaacactg atgattgttt aaatcaccag ataagtattg 1321 gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa aacactaaga 1381 atgcatttca ggacatttca atgagtaatg tatcatggga gaactctcag acggaatata 1441 gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc aataaagacc 1501 atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct tcggaaattt 1561 ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag aatattgaac 1621 ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat aataattatc 1681 caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca aatgtaatag 1741 ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta atcaagaaac 1801 ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc 1861 ctaagactag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1921 aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1981 aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggc agaaaaaaga 2041 taaaacccac cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2101 atcaaccaaa acttgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 2161 ttaaaatggt acagatcccc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 2221 acaaagttga cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 2281 atgcatggct aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 2341 aagccctgct atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 2401 agccaattat gttaacagag agtcttttta atggatctca ttatttagac gttttatata 2461 aaatgacagc agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 2521 cagcgaatgg tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 2581 aaatagaagg aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 2641 ttaatgctat attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 2701 taagttattt agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 2761 aagaggacat ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 2821 agtgtgttca tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 2881 taaatatgtt taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 2941 tctggtttta aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 3001 ctgacttgtt tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 3061 aac // LOCUS HSU13696 2771 bp DNA PRI 23-MAR-1995 DEFINITION Human homolog of yeast mutL (hPMS2) gene, complete cds. ACCESSION U13696 NID g535514 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2771) AUTHORS Nicolaides,N.C., Papadopoulos,N., Liu,B., Wei,Y.-F., Carter,K.C., Ruben,S.M., Rosen,C.A., Haseltine,W.H., Fleischmann,R.D., Fraser,C.M., Adams,M.D., Venter,J.C., Dunlop,M.G., Hamilton,S.R., Petersen,G.M., de la Chapelle, Vogelstein,B. and Kinzler,K.W. TITLE Mutations of two PMS homologues in hereditary nonpolyposis colon cancer JOURNAL Nature 371 (6492), 75-80 (1994) MEDLINE 94352394 REFERENCE 2 (bases 1 to 2771) AUTHORS Wei,Y.-F. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) Ying-Fei Wei, Molecular Biology, Human Genome Sciences, Inc., 9620 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..2771 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p2" /tissue_type="endometrial tumor" /dev_stage="adult" gene 25..2613 /gene="hPMS2" CDS 25..2613 /gene="hPMS2" /note="homolog of yeast mutL gene" /codon_start=1 /function="DNA mismatch repair" /db_xref="PID:g535515" /translation="MERAESSSTEPAKAIKPIDRKSVHQICSGQVVLSLSTAVKELVE NSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLKHHTSKIQEFADLTQVE TFGFRGEALSSLCALSDVTISTCHASAKVGTRLMFDHNGKIIQKTPYPRPRGTTVSVQ QLFSTLPVRHKEFQRNIKKEYAKMVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTG GSPSIKENIGSVFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQC THGVGRSSTDRQFFFINRRPCDPAKVCRLVNEVYHMYNRHQYPFVVLNISVDSECVDI NVTPDKRQILLQEEKLLLAVLKTSLIGMFDSDVNKLNVSQQPLLDVEGNLIKMHAADL EKPMVEKQDQSPSLRTGEEKKDVSISRLREAFSLRHTTENKPHSPKTPEPRRSPLGQK RGMLSSSTSGAISDKGVLRPQKEAVSSSHGPSDPTDRAEVEKDSGHGSTSVDSEGFSI PDTGSHCSSEYAASSPGDRGSQEHVDSQEKAPETDDSFSDVDCHSNQEDTGCKFRVLP QPTNLATPNTKRFKKEEILSSSDICQKLVNTQDMSASQVDVAVKINKKVVPLDFSMSS LAKRIKQLHHEAQQSEGEQNYRKFRAKICPGENQAAEDELRKEISKTMFAEMEIIGQF NLGFIITKLNEDIFIVDQHATDEKYNFEMLQQHTVLQGQRLIAPQTLNLTAVNEAVLI ENLEIFRKNGFDFVIDENAPVTERAKLISLPTSKNWTFGPQDVDELIFMLSDSPGVMC RPSRVKQMFASRACRKSVMIGTALNTSEMKKLITHMGEMDHPWNCPHGRPTMRHIANL GVISQN" BASE COUNT 826 a 603 c 664 g 678 t ORIGIN 1 cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 61 aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 121 ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 181 aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 241 tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 301 caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 361 tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 421 actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 481 agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 541 tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 601 atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 661 cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 721 cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 781 gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 841 atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 901 aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 961 tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1021 gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1081 gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1141 agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1201 gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1261 aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1321 aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1381 atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1441 gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1501 gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1561 agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1621 gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1681 tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1741 accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1801 aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1861 aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1921 catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1981 tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2041 tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2101 gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 2161 cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2221 gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 2281 tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2341 agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2401 agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 2461 cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2521 cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2581 atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2641 tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2701 atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2761 cttttcaaac c // LOCUS HSU13737 2635 bp mRNA PRI 14-APR-1995 DEFINITION Human cysteine protease CPP32 isoform alpha mRNA, complete cds. ACCESSION U13737 NID g561665 KEYWORDS cysteine protease; interleukin 1-beta converting enzyme; apoptotic protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2635) AUTHORS Fernandes-Alnemri,T., Litwack,G. and Alnemri,E.S. TITLE CPP32, a novel human apoptotic protein with homology to Caenorhabditis elegans cell death protein Ced-3 and mammalian interleukin-1 beta-converting enzyme JOURNAL J. Biol. Chem. 269 (49), 30761-30764 (1994) MEDLINE 95074098 REFERENCE 2 (bases 1 to 2635) AUTHORS Fernandes-Alnemri,T. TITLE Direct Submission JOURNAL Submitted (17-AUG-1994) Teresa Fernandes-Alnemri, Pharmacology, Thomas Jefferson University, Jefferson Cancer Institute, 233, S. Tenth Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..2635 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="J1.1 EcoRI/XhoI fragment" /clone_lib="Human Jurkat T-lymphocyte lambda Uni-ZAP XR cDNA library" /cell_line="Jurkat" /cell_type="T-lymphocyte" CDS 225..1058 /note="new member of the interleukin 1-beta converting enzyme gene family of cysteine proteases" /codon_start=1 /product="cysteine protease CPP32 isoform alpha" /db_xref="PID:g561666" /translation="MENTENSVDSKSIKNLEPKIIHGSESMDSGISLDNSYKMDYPEM GLCIIINNKNFHKSTGMTSRSGTDVDAANLRETFRNLKYEVRNKNDLTREEIVELMRD VSKEDHSKRSSFVCVLLSHGEEGIIFGTNGPVDLKKITNFFRGDRCRSLTGKPKLFII QACRGTELDCGIETDSGVDDDMACHKIPVDADFLYAYSTAPGYYSWRNSKDGSWFIQS LCAMLKQYADKLEFMHILTRVNRKVATEFESFSFDATFHAKKQIPCIVSMLTKELYFY H" polyA_site 2635 /note="19 A residues" BASE COUNT 815 a 465 c 569 g 786 t ORIGIN 1 gaattcggca cgaggggtgc tattgtgagg cggttgtaga agagtttcgt gagtgctcgc 61 agctcatacc tgtggctgtg tatccgtggc cacagctggt tggcgtcgcc ttgaaatccc 121 aggccgtgag gagttagcga gccctgctca cactcggcgc tctggttttc ggtgggtgtg 181 ccctgcacct gcctcttccc ccattctcat taataaaggt atccatggag aacactgaaa 241 actcagtgga ttcaaaatcc attaaaaatt tggaaccaaa gatcatacat ggaagcgaat 301 caatggactc tggaatatcc ctggacaaca gttataaaat ggattatcct gagatgggtt 361 tatgtataat aattaataat aagaattttc ataaaagcac tggaatgaca tctcggtctg 421 gtacagatgt cgatgcagca aacctcaggg aaacattcag aaacttgaaa tatgaagtca 481 ggaataaaaa tgatcttaca cgtgaagaaa ttgtggaatt gatgcgtgat gtttctaaag 541 aagatcacag caaaaggagc agttttgttt gtgtgcttct gagccatggt gaagaaggaa 601 taatttttgg aacaaatgga cctgttgacc tgaaaaaaat aacaaacttt ttcagagggg 661 atcgttgtag aagtctaact ggaaaaccca aacttttcat tattcaggcc tgccgtggta 721 cagaactgga ctgtggcatt gagacagaca gtggtgttga tgatgacatg gcgtgtcata 781 aaataccagt ggatgccgac ttcttgtatg catactccac agcacctggt tattattctt 841 ggcgaaattc aaaggatggc tcctggttca tccagtcgct ttgtgccatg ctgaaacagt 901 atgccgacaa gcttgaattt atgcacattc ttacccgggt taaccgaaag gtggcaacag 961 aatttgagtc cttttccttt gacgctactt ttcatgcaaa gaaacagatt ccatgtattg 1021 tttccatgct cacaaaagaa ctctattttt atcactaaag aaatggttgg ttggtggttt 1081 tttttagttt gtatgccaag tgagaagatg gtatatttgg tactgtattt ccctctcatt 1141 ttgacctact ctcatgctgc agagggtact ttaagacata ctccttccat caaatagaac 1201 cactatgaag ctacctcaaa cttccagtca ggtagttgca attgaattaa attaggaata 1261 aataaaaatg gatactggtg cagtcattat gagaggcaat gattgttaat ttacagcttt 1321 catgattagc aagttacagt gatgctgtgc tatgaatttt caagtaattg tgaaaaagtt 1381 aaacattgaa gtaatgaatt tttatgatat tccccccact taagactgtg tattctagtt 1441 ttgtcaaact gtagaaatga tgatgtggaa gaacttaggc atctgtgggc atggtcaaag 1501 gctcaaacct ttattttaga attgatatac acggatgact taactgcatt ttagaccatt 1561 tatctgggat tatggttttg tgatgtttgt cctgaacact tttgttgtaa aaaaataata 1621 ataataatgt ttaatattga gaaagaaact aatattttat gtgagagaaa gtgtgagcaa 1681 actaacttga cttttaaggc taaaacttaa cattcataga ggggtggagt tttaactgta 1741 aggtgctaca atgcccctgg atctaccagc ataaatatct tctgatttgt ccctatgcat 1801 atcagttgag cttcatatac cagcaatata tctgaagagc tattatataa aaaccccaaa 1861 ctgttgatta ttagccaggt aatgtgaata aattctatag gaacatatga aaatacaact 1921 taaataataa acagtggaat ataaggaaag caataaatga atgggctgag ctgcctgtaa 1981 cttgagagta gatggtttga gcctgagcag agacatgact cagcctgttc catgaaggca 2041 gagccatgga ccacgcagga agggcctaca gcccatttct ccatacgcac tggtatgtgt 2101 ggatgatgct gccagggcgc catcgccaag taagaaagtg aagcaaatca gaaacttgtg 2161 aagtggaaat gttctaaagg tggtgaggca ataaaaatca tagtactctt tgtagcaaaa 2221 ttcttaagta tgttattttc tgttgaagtt tacaatcaaa ggaaaatagt aatgttttat 2281 actgtttact gaaagaaaaa gacctatgag cacataggac tctagacggc atccagccgg 2341 aggccagagc tgagcactca gcccgggagg caggctccag gcctcagcag gtgcggagcc 2401 gtcactgcac caagtctcac tggctgtcag tatgacattt cacgggagat ttcttgttgc 2461 tcaaaaaatg agctcgcatt tgtcaatgac agtttctttt ttcttactag acctgtaact 2521 tttgtaaata cacacagcat gtaatggtat cttaaagtgt gtttctatgt gacaattttg 2581 tacaaatttg ttattttcca tttttatttc aaaatataca ttcaaactta aaatt // LOCUS HSU13831 405 bp mRNA PRI 21-JUL-1995 DEFINITION Human cellular retinol binding protein II (CRBPII) mRNA, complete cds. ACCESSION U13831 NID g535389 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 405) AUTHORS Loughney,A.D., Kumarendran,M.K., Thomas,E.J. and Redfern,C.P. TITLE Variation in the expression of cellular retinoid binding proteins in human endometrium throughout the menstrual cycle JOURNAL Hum. Reprod. 10 (5), 1297-1304 (1995) MEDLINE 95386663 REFERENCE 2 (bases 1 to 405) AUTHORS Redfern,C.P. TITLE Direct Submission JOURNAL Submitted (19-AUG-1994) Christopher P. Redfern, Medical Molecular Biology Group, University of Newcastle Medical School, Framlington Place, Newcastle upon Tyne, Ne2 4Hh, United Kingdom FEATURES Location/Qualifiers source 1..405 /organism="Homo sapiens" /macronuclear /db_xref="taxon:9606" /clone="pHCRBP2" /cell_line="Caco-2" primer_bind 1..20 /gene="CRBPII" /note="5' PCR amplification primer" CDS 1..405 /gene="CRBPII" /codon_start=1 /function="cytoplasmic binding protein for retinol in enterocytes" /product="cellular retinol binding protein II" /db_xref="PID:g535390" /translation="MTRDQNGTWEMESNENFEGYMKALDIDFATPKIAVRLTQTKVID QDGDNFKTKTTSTFRNYDVDFTVGVEFDEYTKSLDNRHVKALVTWEGDVLVCVQKGEK ENRGWKQWIEGDKLYLELTCGDQVCRQVFKKK" gene 1..405 /gene="CRBPII" primer_bind complement(385..405) /gene="CRBPII" /note="3' PCR amplification primer" BASE COUNT 119 a 74 c 124 g 88 t ORIGIN 1 atgacgaggg accagaatgg aacctgggag atggagagta atgaaaactt tgagggctac 61 atgaaggccc tggatattga ttttgccacc cccaagattg cagtacgtct tactcagacg 121 aaggttattg atcaagatgg tgataacttc aagacaaaaa ccactagcac attccgcaac 181 tatgatgtgg atttcactgt tggagtagag tttgacgagt acacaaagag cctggacaac 241 cggcatgtta aggcactggt cacctgggaa ggtgatgtcc ttgtgtgtgt gcaaaagggg 301 gagaaggaga accgcggctg gaagcagtgg attgaggggg acaagctgta cctggagctg 361 acctgtggtg accaggtgtg ccgtcaagtg ttcaaaaaga agtga // LOCUS HSU13896 3046 bp mRNA PRI 18-OCT-1994 DEFINITION Human homolog of Drosophila discs large protein, isoform 2 (hdlg-2) mRNA, complete cds. ACCESSION U13896 NID g558435 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3046) AUTHORS Lue,R.A., Marfatia,S.M., Branton,D. and Chishti,A.H. TITLE Cloning and characterization of hdlg: the human homologue of the Drosophila discs large tumor suppressor binds to protein 4.1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 9818-9822 (1994) MEDLINE 95024052 REFERENCE 2 (bases 1 to 3046) AUTHORS Lue,R.A. TITLE Direct Submission JOURNAL Submitted (22-AUG-1994) Robert A. Lue, Harvard University, Molecular and Cellular Biology, 16 Divinity Avenue, Cambridge, MA 02138, USA FEATURES Location/Qualifiers source 1..3046 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hdlg-2" /cell_line="BM14" /cell_type="B-cell" CDS 189..2969 /standard_name="hdlg-2" /note="sequences encoding: dlg homology repeats (DHR): DHR1 855..1118, DHR2 1143..1403, DHR3 1563..1826; SH3 Domain 1935..2132; guanylate kinase-related domain 2397..2924" /codon_start=1 /product="homolog of Drosophila discs large protein, isoform 2" /db_xref="PID:g558436" /translation="MPVRKQDTQRALHLLEEYRSKLSQTEDRQLRSSIERVINIFQSN LFQALIDIQEFYEVTLLDNPKCIDRSKPSEPIQPVNTWEISSLPSSTVTSETLPSSLS PSVEKYRYQDEDTPPQEHISPQITNEVIGPELVHVSEKNLSEIENVHGFVSHSHISPI KPTEAVLPSPPTVPVIPVLPVPAENTVILPTIPQANPPPVLVNTDSLETPTYVNGTDA DYEYEEITLERGNSGLGFSIAGGTDNPHIGDDSSIFITKIITGGAAAQDGRLRVNDCI LQVNEVDVRDVTHSKAVEALKEAGSIVRLYVKRRKPVSEKIMEIKLIKGPKGLGFSIA GGVGNQHIPGDNSIYVTKIIEGGAAHKDGKLQIGDKLLAVNNVCLEEVTHEEAVTALK NTSDFVYLKVAKPTSMYMNDGYAPPDITNSSSQPVDNHVSPSSFLGQTPASPARYSPV SKAVLGDDEITREPRKVVLHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRK GDRIISVNSVDLRAASHEQAAAALKNAGQAVTIVAQYRPEEYSRFEAKIHDLREQMMN SSISSGSGSLRTSQKRSLYVRALFDYDKTKDSGLPSQGLNFKFGDILHVINASDDEWW QARQVTPDGESDEVGVIPSKRRVEKKERARLKTVKFNSKTRDKGQSFNDKRKKNLFSR KFPFYKNKDQSEQETSDADQHVTSNASDSESSYRGQEEYVLSYEPVNQQEVNYTRPVI ILGPMKDRINDDLISEFPDKFGSCVPHTTRPKRDYEVDGRDYHFVTSREQMEKDIQEH KFIEAGQYNNHLYGTSVQSVREVAGKGKHCILDVSGNAIKRLQIAQLYPISIFIKPKS MENIMEMNKRLTEEQARKTFERAMKLEQEFTEHFTAIVQGDTLEDIYNQVKQIIEEQS GSYIWVPAKEKL" exon 672..770 /standard_name="I1" /note="alternatively spliced sequence insertion as determined by PCR" exon 2193..2294 /standard_name="I3" /note="alternatively spliced sequence insertion as determined by PCR" BASE COUNT 975 a 612 c 681 g 778 t ORIGIN 1 gttggaaacg gcactgctga gtgaggttga ggggtgtctc ggtatgtgcg ccttggatct 61 ggtgtaggcg aggtcacgcc tctcttcaga cagcccgagc cttcccggcc tggcgcgttt 121 agttcggaac tgcgggacgc cggtgggcta gggcaaggtg tgtgccctct tcctgattct 181 ggagaaaaat gccggtccgg aagcaagata cccagagagc attgcacctt ttggaggaat 241 atcgttcaaa actaagccaa actgaagaca gacagctcag aagttccata gaacgggtta 301 ttaacatatt tcagagcaac ctctttcagg ctttaataga tattcaagaa ttttatgaag 361 tgaccttact ggataatcca aaatgtatag atcgttcaaa gccgtctgaa ccaattcaac 421 ctgtgaatac ttgggagatt tccagccttc caagctctac tgtgacttca gagacactgc 481 caagcagcct tagccctagt gtagagaaat acaggtatca ggatgaagat acacctcctc 541 aagagcatat ttccccacaa atcacaaatg aagtgatagg tccagaattg gttcatgtct 601 cagagaagaa cttatcagag attgagaatg tccatggatt tgtttctcat tctcatattt 661 caccaataaa gccaacagaa gctgttcttc cctctcctcc cactgtccct gtgatccctg 721 tcctgccagt ccctgctgag aatactgtca tcctacccac cataccacag gcaaatcctc 781 ccccagtact ggtcaacaca gatagcttgg aaacaccaac ttacgttaat ggcacagatg 841 cagattatga atatgaagaa atcacacttg aaaggggaaa ttcagggctt ggtttcagca 901 ttgcaggagg tacggacaac ccacacattg gagatgactc aagtattttc attaccaaaa 961 ttatcacagg gggagcagcc gcccaagatg gaagattgcg ggtcaatgac tgtatattac 1021 aagtaaatga agtagatgtt cgtgatgtaa cacatagcaa agcagttgaa gcgttgaaag 1081 aagcagggtc tattgtacgc ttgtatgtaa aaagaaggaa accagtgtca gaaaaaataa 1141 tggaaataaa gctcattaaa ggtcctaaag gtcttgggtt tagcattgct ggaggtgttg 1201 gaaatcagca tattcctggg gataatagca tctatgtaac caaaataatt gaaggaggtg 1261 cagcacataa ggatggcaaa cttcagattg gagataaact tttagcagtg aataacgtat 1321 gtttagaaga agttactcat gaagaagcag taactgcctt aaagaacaca tctgattttg 1381 tttatttgaa agtggcaaaa cccacaagta tgtatatgaa tgatggctat gcaccacctg 1441 atatcaccaa ctcttcttct cagcctgttg ataaccatgt tagcccatct tccttcttgg 1501 gccagacacc agcatctcca gccagatact ccccagtttc taaagcagta cttggagatg 1561 atgaaattac aagggaacct agaaaagttg ttcttcatcg tggctcaacg ggccttggtt 1621 tcaacattgt aggaggagaa gatggagaag gaatatttat ttcctttatc ttagccggag 1681 gacctgctga tctaagtgga gagctcagaa aaggagatcg tattatatcg gtaaacagtg 1741 ttgacctcag agctgctagt catgagcagg cagcagctgc attgaaaaat gctggccagg 1801 ctgtcacaat tgttgcacaa tatcgacctg aagaatacag tcgttttgaa gctaaaatac 1861 atgatttacg ggagcagatg atgaatagta gtattagttc agggtcaggt tctcttcgaa 1921 ctagccagaa gcgatccctc tatgtcagag ccctttttga ttatgacaag actaaagaca 1981 gtgggcttcc cagtcaggga ctgaacttca aatttggaga tatcctccat gttattaatg 2041 cttctgatga tgaatggtgg caagccaggc aggttacacc agatggtgag agcgatgagg 2101 tcggagtgat tcccagtaaa cgcagagttg agaagaaaga acgagcccga ttaaaaacag 2161 tgaaattcaa ttctaaaacg agagataaag ggcagtcatt caatgacaag cgtaaaaaga 2221 acctcttttc ccgaaaattc cccttctaca agaacaagga ccagagtgag caggaaacaa 2281 gtgatgctga ccagcatgta acttctaatg ccagcgatag tgaaagtagt taccgtggtc 2341 aagaagaata cgtcttatct tatgaaccag tgaatcaaca agaagttaat tatactcgac 2401 cagtgatcat attgggacct atgaaagaca ggataaatga tgacttgatc tcagaatttc 2461 ctgacaaatt tggatcctgt gttcctcata caactagacc aaaacgagat tatgaggtag 2521 atggaagaga ttatcatttt gtgacttcaa gagagcagat ggaaaaagat atccaggaac 2581 ataaattcat tgaagctggc cagtataaca atcatctata tggaacaagt gttcagtctg 2641 tacgagaagt agcaggaaag ggcaaacact gtatccttga tgtgtctgga aatgccataa 2701 agagattaca gattgcacag ctttacccta tctccatttt tattaaaccc aaatccatgg 2761 aaaatatcat ggaaatgaat aagcgtctaa cagaagaaca agccagaaaa acatttgaga 2821 gagccatgaa actggaacag gagtttactg aacatttcac agctattgta cagggggata 2881 cgctggaaga catttacaac caagtgaaac agatcataga agaacaatct ggttcttaca 2941 tctgggttcc ggcaaaagaa aagctatgaa aactcatgtt tctctgtttc tcttttccac 3001 aattccattt tctttggcat ctctttgccc tttcctctgg aaaaaa // LOCUS HSU13948 3842 bp mRNA PRI 22-OCT-1995 DEFINITION Human zinc finger/leucine zipper protein (AF10) mRNA, complete cds. ACCESSION U13948 NID g538276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3842) AUTHORS Chaplin,T., Ayton,P., Bernard,O.A., Saha,V., Della Valle,V., Hillion,J., Gregorini,A., Lillington,D., Berger,R. and Young,B.D. TITLE A novel class of zinc finger/leucine zipper genes identified from the molecular cloning of the t(10;11) translocation in acute leukemia JOURNAL Blood 85 (6), 1435-1441 (1995) MEDLINE 95195207 REFERENCE 2 (bases 1 to 3842) AUTHORS Saha,V. TITLE Direct Submission JOURNAL Submitted (23-AUG-1994) Vaskar Saha, ICRF Department of Medical Oncology, St. Bartholomews Hospital, 45 Little Britain, London, Ec1A 7Be, UK FEATURES Location/Qualifiers source 1..3842 /organism="Homo sapiens" /isolate="patient A" /db_xref="taxon:9606" /clone="c-3, c-10, c-19" /clone_lib="Jurkatt T-cell leukaemia" /chromosome="11" /map="11q23" /cell_type="myeloblast" gene 184..3267 /gene="AF10" CDS 184..3267 /gene="AF10" /codon_start=1 /product="zinc finger/leucine zipper protein" /db_xref="PID:g538277" /translation="MVSSDRPVSLEDEVSHSMKEMIGGCCVCSDERGWAENPLVYCDG HGCSVAVHQACYGIVQVPTGPWFCRKCESQERAARVRCELCPHKDGALKRTDNGGWAH VVCALYIPEVQFANVSTMEPIVLQSVPHDRYNKTCYICDEQGRESKAATGACMTCNKH GCRQAFHVTCAQFAGLLCEEEGNGADNVQYCGYCKYHFSKLKKSKRGSNRSYDQSLSD SSSHSQDKHHEKEKKKYKEKDKHKQKHKKQPEPSPALVPSLTVTTEKTYTSTSNNSIS GSLKRLEDTTARFTNANFQEVSAHTSSGKDVSETRGSEGKGKKSSAHSSGQRGRKPGG GRNPGTTVSAASPFPQGSFSGTPGSVKSSSGSSVQSPQDFLSFTDSDLRNDSYSHSQQ SSATKDVHKGESGSQEGGVNSFSTLIGLPSTSAVTSQPKSFENSPGDLGNSSLPTAGY KRAQTSGIEEETVKEKKRKGNKQSKHGPGRPKGNKNQENVSHLSVSSASPTSSVASAA GSITSSSLQKSPTLLRNGSLQSLSVGSSPVGSEISMQYRHDGACPTTTFSELLNAIHN DRGDSSTLTKQELKFIGIYNSNDVAVSFPNVVSGSGSSTPVSSSHLPQQSSGHLQQVG ALSPSAVSSAAPAVATTQANTLSGSSLSQAPSHMYGNRSNSSMAALIAQSENNQTDQD LGDNSRNLVGRGSSPRGSLSPRSPVSSLQIRYDQPGNSSLENLPPVAASIEQLLERQW SEGQQFLLEQGTPSDILGMLKSLHQLQVENRRLEEQIKNLTAKKERLQLLNAQLSVPF PTITANPSPSHQIHTFSAQTAPTTDSLNSSKSPHIGNSFLPDNSLPVLNQDLTSSGQS TSSSSALSTPPPAGQSPAQQGSGVSGVQQVNGVTVGALASGMQPVTSTIPAVSAVGGI IGALPGNQLAINGIVGALNGVMQTPVTMSQNPTPLTHTTVPPNATHPMPATLTNSASG LGLLSDQQRQILIHQQQFQQLLNSQQLTPVHRHPHFTQLPPTHFSPSMEIMQVRK" BASE COUNT 1159 a 869 c 853 g 961 t ORIGIN 1 gccctcttga ttatgtgtgc cctctccggg cgcccgcgtt agcggccggg tggaggtggg 61 gagggaagac gctgaggagg aggaggaggc ggaggaggcg gtggagggga ggtgggggga 121 atcagcaagg acatggctcc tgactcctgt gcggaacgtg agtgactgag cggcaaagcc 181 cgaatggtct ctagcgaccg gcccgtgtca ctggaggacg aggtctccca tagtatgaag 241 gagatgattg gaggctgttg cgtttgctca gacgagagag gctgggccga gaacccgctg 301 gtttattgcg acgggcacgg ctgcagcgtc gcggtgcatc aagcttgcta tggcattgtt 361 caagtaccca ctggaccgtg gttttgcagg aaatgtgaat ctcaggagag agcagccaga 421 gtgagatgtg aactttgtcc ccataaggat ggagctttaa aaagaacaga taatgggggt 481 tgggcccatg tggtttgtgc cctgtatatt ccagaggtac aatttgccaa tgtttccaca 541 atggaaccaa ttgttttaca gtctgttccg catgatcgtt ataataagac ttgctacatt 601 tgtgatgaac aaggaagaga aagcaaagca gccactggtg cttgcatgac atgtaataaa 661 catggatgtc gacaggcttt ccatgtaaca tgcgctcagt ttgccggact gctttgtgaa 721 gaagaaggta atggtgccga taatgtccaa tactgtggct actgtaaata ccattttagt 781 aagctgaaaa agagcaaacg gggatctaat aggtcatatg atcaaagttt aagtgattct 841 tcctctcact ctcaggataa acatcatgag aaagagaaaa aaaaatataa agagaaggac 901 aaacacaaac agaaacacaa gaagcagcca gaaccatcac ctgcattggt tccatccttg 961 actgttacta cagaaaaaac ttatacaagc actagcaaca actctatatc tggatcattg 1021 aagcgcttgg aagatactac tgcacgattt acaaatgcaa atttccagga agtctctgca 1081 cacacctcta gtggaaaaga tgtttcagag actagagggt cagagggcaa agggaagaaa 1141 tcttcagctc acagctcagg tcaaagggga agaaagcctg gtggtggaag aaatccagga 1201 acaactgtgt cagcagctag cccttttcct caaggcagtt tttcaggaac tccaggcagt 1261 gtaaagtcat cttctggaag ttcagtgcag tctccccagg atttcctgag ctttacagac 1321 tcagatctgc gtaatgacag ttactctcac tcccaacagt catcagcaac caaagatgta 1381 cataaaggag agtctggaag ccaggaaggg ggggtaaata gttttagtac cttaattggc 1441 ctcccttcaa cctcagctgt tacttcacag cctaaaagct ttgaaaattc acctggagat 1501 ttgggtaatt ccagccttcc tacagcagga tataagcggg ctcaaacttc tggcatagaa 1561 gaagaaactg taaaggaaaa gaaaaggaaa ggaaataaac aaagtaagca tgggcctggc 1621 agacccaaag gaaacaaaaa tcaagagaat gtttctcatc tctcagtttc ttctgcttca 1681 ccaacatcat ctgtagcatc agctgcagga agcataacaa gctctagtct gcagaaatct 1741 cctacattgc tcaggaatgg aagtttacag agcctcagtg ttggctcatc tccagttggt 1801 tcagaaattt ccatgcagta tcggcatgat ggagcttgcc caacaactac gttctcagag 1861 ttgctgaatg caatacacaa cgacagaggt gacagttcta cactaacaaa gcaagaactt 1921 aaattcatag gtatttataa cagcaatgat gtagcagtat cgtttccaaa tgtagtatct 1981 ggctcgggat ctagtactcc tgtctccagc tctcacttac ctcagcagtc ttctgggcat 2041 ttgcaacaag taggagcgct ctctccctca gctgtgtcat ctgcagcccc tgctgttgct 2101 acaactcagg caaatactct atctggatct tctctcagtc aggcaccatc tcatatgtat 2161 ggcaatagat caaattcatc aatggcagct cttatagctc agtctgaaaa caatcaaaca 2221 gatcaagatc ttggagacaa tagccgcaac ctagttggca gaggaagctc accccgagga 2281 agtctctcgc cacgatcccc tgtaagcagc ttacagattc gctatgatca accaggcaac 2341 agcagtttgg aaaatctgcc tccagtagca gccagcatag aacagctttt ggagaggcag 2401 tggagtgaag gacagcaatt tttactagaa cagggtactc ctagtgacat tttaggaatg 2461 ctgaagtcat tacaccaact tcaagttgaa aaccgaagat tagaggaaca aattaaaaac 2521 ttgactgcca aaaaggaacg gcttcagtta ttgaatgcac agctttcagt gccttttcca 2581 acaataacag caaatcctag tccgtctcat caaatacaca cattttcagc acagactgct 2641 cctactactg attccttgaa cagcagtaag agccctcata taggaaacag ctttttacct 2701 gataattctc ttcctgtatt aaatcaggac ttaacctcca gtggacaaag taccagcagc 2761 tcatcagctc tttctacccc acctcctgct gggcagagtc cggctcaaca aggctcagga 2821 gtgagtggag ttcagcaggt caatggcgtg acagtggggg cactagctag tggaatgcag 2881 cctgtaactt ccaccattcc tgccgtgtct gcagtgggtg gaataattgg agctttgcca 2941 ggtaaccaac tggcaattaa tggcattgta ggagctttaa atggggttat gcagactcct 3001 gtcacaatgt cccagaaccc tacccctctc acccacacaa ccgtaccacc taatgcaaca 3061 catccaatgc cagctacact gactaacagt gcctcaggac taggattact ttctgaccag 3121 caacgacaaa tacttattca tcaacagcag tttcagcagt tgttaaattc tcaacagctc 3181 acaccagtac acaggcaccc ccacttcaca cagctaccac caacccattt ctcaccatcc 3241 atggagataa tgcaagtcag aaagtagcaa gacttagtga taaaactggg cctgtagctc 3301 aagagaaaag ttgacacctg agaaacatct agaaattgcc tatcctgctg ttctagcact 3361 tcatctggct gcctttgcag tccttttact acagctatga agaaacgcaa caagaaactc 3421 aatgcacaac aaaggattaa ttgctgcaag gacattcttg taaggctttg attagttttc 3481 ttgttgcttt gttgcactga aatggaattc ccatgcccct accccttacc ccagtttttt 3541 gaacatggaa agaaaattta ataacttttt aaagtgacat aatttacatg caatatgttt 3601 atcaactcaa gaatttaata tagttggtac acaactagtt ttgtttataa attggagatg 3661 caaatagcaa aactaaatac ttgctccatt tacaaactac ttgattttat tgtacaagtt 3721 gaaatatgct cttttgtttg ggttacagta tgcttgctct aagtcaaatt ccaaggaact 3781 aatttcttct cctggagttg cattgattca gtattacaaa tatatagcac atcacctggg 3841 ac // LOCUS HSU14108 1085 bp mRNA PRI 01-NOV-1996 DEFINITION Human Mel-1a melatonin receptor mRNA, complete cds. ACCESSION U14108 NID g602129 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1085) AUTHORS Reppert,S.M., Weaver,D.R. and Ebisawa,T. TITLE Cloning and characterization of a mammalian melatonin receptor that mediates reproductive and circadian responses JOURNAL Neuron 13 (5), 1177-1185 (1994) MEDLINE 95033233 REFERENCE 2 (bases 1 to 1085) AUTHORS Reppert,S.M. TITLE Direct Submission JOURNAL Submitted (29-AUG-1994) Steven M. Reppert, Chronobiology, Children's Service, Massachusetts General Hospital, 32 Fruit St, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..1085 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male and female" /dev_stage="adult" CDS 33..1085 /note="high-affinity receptor" /codon_start=1 /product="Mel-1a melatonin receptor" /db_xref="PID:g602130" /translation="MQGNGSALPNASQPVLRGDGARPSWLASALACVLIFTIVVDILG NLLVILSVYRNKKLRNAGNIFVVSLAVADLVVAIYPYPLVLMSIFNNGWNLGYLHCQV SGFLMGLSVIGSIFNITGIAINRYCYICHSLKYDKLYSSKNSLCYVLLIWLLTLAAVL PNLRAGTLQYDPRIYSCTFAQSVSSAYTIAVVVFHFLVPMIIVIFCYLRIWILVLQVR QRVKPDRKPKLKPQDFRNFVTMFVVFVLFAICWAPLNFIGLAVASDPASMVPRIPEWL FVASYYMAYFNSCLNAIIYGLLNQNFRKEYRRIIVSLCTARVFFVDSSNDVADRVKWK PSPLMTNNNVVKVDSV" BASE COUNT 226 a 323 c 279 g 257 t ORIGIN 1 atggccctgc ggccgggacg cgaacaggga ccatgcaggg caacggcagc gcgctgccca 61 acgcctccca gcccgtgctc cgcggggacg gcgcgcggcc ctcgtggctg gcgtccgccc 121 tagcctgcgt cctcatcttc accatcgtgg tggacatcct gggcaacctc ctggtcatcc 181 tgtcggtgta tcggaacaag aagctcagga acgcaggaaa catctttgtg gtgagcttag 241 cggtggcaga cctggtggtg gccatttatc cgtacccgtt ggtgctgatg tcgatattta 301 acaacgggtg gaacctgggc tatctgcact gccaagtcag tgggttcctg atgggcctga 361 gcgtcatcgg ctccatattc aacatcaccg gcatcgccat caaccgctac tgctacatct 421 gccacagtct caagtacgac aaactgtaca gcagcaagaa ctccctctgc tacgtgctcc 481 tcatatggct cctgacgctg gcggccgtcc tgcccaacct ccgtgcaggg actctccagt 541 acgacccgag gatctactcg tgcaccttcg cccagtccgt cagctccgcc tacaccatcg 601 ccgtggtggt tttccacttc ctcgtcccca tgatcatagt catcttctgt tacctgagaa 661 tatggatcct ggttctccag gtcagacaga gggtgaaacc tgaccgcaaa cccaaactga 721 aaccacagga cttcaggaat tttgtcacca tgtttgtggt ttttgtcctc tttgccattt 781 gctgggctcc tctgaacttc attggcctgg ccgtggcctc tgaccccgcc agcatggtgc 841 ctaggatccc agagtggctg tttgtggcca gttactacat ggcgtatttc aacagctgcc 901 tcaatgccat tatatacggg ctactgaacc aaaatttcag gaaggaatac aggagaatta 961 tagtctcgct ctgtacagcc agggtgttct ttgtggacag ctctaacgac gtggccgata 1021 gggttaaatg gaaaccgtct ccactgatga ccaacaataa tgtagtaaag gtggactccg 1081 tttaa // LOCUS HSU14187 987 bp mRNA PRI 04-FEB-1995 DEFINITION Human receptor tyrosine kinase ligand LERK-3 (EPLG3) mRNA, complete cds. ACCESSION U14187 NID g642832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 987) AUTHORS Kozlosky,C.J., Maraskovsky,E., McGrew,J.T., VandenBos,T., Teepe,M., Lyman,S.D., Srinivasan,S., Fletcher,F.A., Gayle,R.B. III., Cerretti,D.P. and Beckmann,M.P. TITLE Ligands for the receptor tyrosine kinases hek and elk: isolation of cDNAs encoding a family of proteins JOURNAL Oncogene 10 (2), 299-306 (1995) MEDLINE 95140419 REFERENCE 2 (bases 1 to 987) AUTHORS Cerretti,D.P. TITLE Direct Submission JOURNAL Submitted (01-SEP-1994) Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..987 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..987 /gene="EPLG3" sig_peptide 58..123 /gene="EPLG3" /note="putative" CDS 58..774 /gene="EPLG3" /note="membrane-bound protein; glycosyl-phosphatidylinositol (GPI) anchored" /codon_start=1 /product="LERK-3" /db_xref="PID:g642833" /translation="MAAAPLLLLLLLVPVPLLPLLAQGPGGALGNRHAVYWNSSNQHL RREGYTVQVNVNDYLDIYCPHYNSSGVGPGAGPGPGGGAEQYVLYMVSRNGYRTCNAS QGFKRWECNRPHAPHSPIKFSEKFQRYSAFSLGYEFHAGHEYYYISTPTHNLHWKCLR MKVFVCCASTSHSGEKPVPTLPQFTMGPNVKINVLEDFEGENPQVPKLEKSISGTSPK REHLPLAVGIAFFLMTFLAS" mat_peptide 124..771 /gene="EPLG3" /note="putative" BASE COUNT 176 a 329 c 323 g 159 t ORIGIN 1 ggagaagccg ggagcgcggg gctcagtcgg ggggcggcgg cggcggcggc tccggggatg 61 gcggcggctc cgctgctgct gctgctgctg ctcgtgcccg tgccgctgct gccgctgctg 121 gcccaagggc ccggaggggc gctgggaaac cggcatgcgg tgtactggaa cagctccaac 181 cagcacctgc ggcgagaggg ctacaccgtg caggtgaacg tgaacgacta tctggatatt 241 tactgcccgc actacaacag ctcgggggtg ggccccgggg cgggaccggg gcccggaggc 301 ggggcagagc agtacgtgct gtacatggtg agccgcaacg gctaccgcac ctgcaacgcc 361 agccagggct tcaagcgctg ggagtgcaac cggccgcacg ccccgcacag ccccatcaag 421 ttctcggaga agttccagcg ctacagcgcc ttctctctgg gctacgagtt ccacgccggc 481 cacgagtact actacatctc cacgcccact cacaacctgc actggaagtg tctgaggatg 541 aaggtgttcg tctgctgcgc ctccacatcg cactccgggg agaagccggt ccccactctc 601 ccccagttca ccatgggccc caatgtgaag atcaacgtgc tggaagactt tgagggagag 661 aaccctcagg tgcccaagct tgagaagagc atcagcggga ccagccccaa acgggaacac 721 ctgcccctgg ccgtgggcat cgccttcttc ctcatgacgt tcttggcctc ctagctctgc 781 cccctcccct ggggggggag agatggggcg gggcttggaa ggagcaggga gcctttggcc 841 tctccaaggg aagcctagtg ggcctagacc cctcctccca tggctagaag tggggcctgc 901 accatacatc tgtgtccgcc ccctctaccc cttcccccca cgtagggcac tgtagtggac 961 caagcacggg gacagccatg ggtcccg // LOCUS HSU14188 636 bp mRNA PRI 04-FEB-1995 DEFINITION Human receptor tyrosine kinase LERK-4 (EPLG4) mRNA, complete cds. ACCESSION U14188 NID g642834 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 636) AUTHORS Kozlosky,C.J., Maraskovsky,E., McGrew,J.T., VandenBos,T., Teepe,M., Lyman,S.D., Srinivasan,S., Fletcher,F.A., Gayle,R.B. III., Cerretti,D.P. and Beckmann,M.P. TITLE Ligands for the receptor tyrosine kinases hek and elk: isolation of cDNAs encoding a family of proteins JOURNAL Oncogene 10 (2), 299-306 (1995) MEDLINE 95140419 REFERENCE 2 (bases 1 to 636) AUTHORS Cerretti,D.P. TITLE Direct Submission JOURNAL Submitted (01-SEP-1994) Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..636 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..636 /gene="EPLG4" sig_peptide 28..93 /gene="EPLG4" /note="putative" CDS 28..633 /gene="EPLG4" /note="membrane-bound protein; glycosyl-phosphatidylinositol (GPI) anchored" /codon_start=1 /product="LERK-4" /db_xref="PID:g642835" /translation="MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDA VVELGLNDYLDIVCPHYEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLP FGHVQFSEKIQRFTPFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSES AHPVGSPGESGTSGWRGGDTPSPLCLLLLLLLLILRLLRIL" mat_peptide 94..630 /gene="EPLG4" /note="putative" BASE COUNT 102 a 202 c 186 g 146 t ORIGIN 1 gccagaccaa accggacctc gggggcgatg cggctgctgc ccctgctgcg gactgtcctc 61 tgggccgcgt tcctcggctc ccctctgcgc gggggctcca gcctccgcca cgtagtctac 121 tggaactcca gtaaccccag gttgcttcga ggagacgccg tggtggagct gggcctcaac 181 gattacctag acattgtctg cccccactac gaaggcccag ggccccctga gggccccgag 241 acgtttgctt tgtacatggt ggactggcca ggctatgagt cctgccaggc agagggcccc 301 cgggcctaca agcgctgggt gtgctccctg ccctttggcc atgttcaatt ctcagagaag 361 attcagcgct tcacaccttt ctccctcggc tttgagttct tacctggaga gacttactac 421 tacatctcgg tgcccactcc agagagttct ggccagtgct tgaggctcca ggtgtctgtc 481 tgctgcaagg agaggaagtc tgagtcagcc catcctgttg ggagccctgg agagagtggc 541 acatcagggt ggcgaggggg ggacactccc agccccctct gtctcttgct attactgctg 601 cttctgattc ttcgtcttct gcgaattctg tgagcc // LOCUS HSU14391 4666 bp mRNA PRI 07-MAR-1995 DEFINITION Human myosin-IC mRNA, complete cds. ACCESSION U14391 NID g557467 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1109) AUTHORS Bement,W.M., Wirth,J.A. and Mooseker,M.S. TITLE Cloning and mRNA expression of human unconventional myosin-IC. A homologue of amoeboid myosins-I with a single IQ motif and an SH3 domain JOURNAL J. Mol. Biol. 243 (2), 356-363 (1994) MEDLINE 95018277 REFERENCE 2 (bases 1 to 4666) AUTHORS Bement,W.M. TITLE Direct Submission JOURNAL Submitted (03-SEP-1994) William M. Bement, Zoology, University of Wisconsin, Madison, 1117 West Johnson Street, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..4666 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 376..3705 /codon_start=1 /product="myosin-IC" /db_xref="PID:g557468" /translation="MGSKGVYQYHWQSHNVKHSGVDDMVLLSKITENSIVENLKKRYM DDYIFTYIGSVLISVNPFKQMPYFGEKEIEMYQGAAQYENPPHIYALADNMYRNMIID RENQCVIISGESGAGKTVAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAK TVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEKSRVVMRNPGERSFHIFYQLIEG ASAEQKHSLGITSMDYYYYLSLSGSYKVDDIDDRREFQETLHAMNVIGIFAEEQTLVL QIVAGILHLGNISFKEVGNYAAVESEEFLAFPAYLLGINQDRLKEKLTSRQMDSKWGG KSESIHVTLNVEQACYTRDALAKALHARVFDFLVDSINKAMEKDHEEYNIGVLDIYGF EIFQKNGFEQFCINFVNEKLQQIFIELTLKAEQEEYVQEGIRWTPIEYFNNKIVCDLI ENKVNPPGIMSILDDVCATMHAVGEGADQTLLQKLQMQIGSHEHFNSWNQGFIIHHYA GKVSYDMDGFCERNRDVLFMDLIELMQSSELPFIKSLFPENLQADKKGRPTTAGSKIK KQANDLVSTLMKCTPHYIRCIKPNETKKPRDWEESRVKHQVEYLGLKENIRVRRAGYA YRRIFQKFLQRYAILTKATWPSWQGEEKQGVLHLLQSVNMDSDQFQLGRSKVFIKAPE SLFLLEEMRERKYDGYARVIQKSWRKFVARKKYVQMREEASDLLLNKKERRRNSINRN FIGDYIGMEEHPELQQFVGKREKIDFADTVTKYDRRFKGVKRDLLLTPKCLYLIGREK VKQGPDKGLVKEVLKRKIEIERILSVSLSTMQDDIFILHEQEYDSLLESVFKTEFLSL LAKRYEEKTQKQLPLKFSNTLELKLKKENWGPGVQGAGSRQVQFHQGFGDLAVLKPSN KVLQVSIGPGLPKNSRPTRRNTTQNTGYSSGTQNANYPVRAAPPPPGYHQNGVIRNQY VPYPHAPGSQRSIQKSLYTSMARPPLPRQQSTSSDRVSQTPESLDFLKVPDQGAAGVR RQTTSRPPPAGGRPKPQPKPKPQVPQCKALYAYDAQDTDELSFNANDIIDIIKEDPSG WWTGRLRGKPGLFPNNYVTKI" BASE COUNT 1309 a 1114 c 1213 g 1030 t ORIGIN 1 cgagactcag ccagaggagt acggattaag gggccgaaga aagaggagtg aggcagggga 61 gcgaggcgga cgctccgagc gcatcggact cacctgggcc ggcggcgagc ggcgagtacg 121 agtggactct tgccctccgc agcatgcgcc ccgcctgcta agtcgccgcc gtctcgccct 181 cacctagcac cgccaatccc gcctggatgg agtcccctct agggttccca ttggctcctg 241 tgagtgcttg ggtgttcggg agcctgtttt tggcggaggg gaaccaactt ttgaagttcg 301 cccagaagac tgcgttccag ccccagttcc ccggcagcgg gacccggcga agacgcgacc 361 gcggcgcgag tcaccatggg aagcaaaggt gtctaccagt accactggca aagccacaat 421 gtcaagcaca gtggtgtgga cgacatggtg ctactgtcca agatcacaga gaactccatc 481 gtggagaatc tgaagaagag atacatggat gactacattt ttacatatat aggatctgta 541 ttaatctcag tcaacccttt caagcagatg ccatattttg gggaaaagga aattgaaatg 601 taccaaggag cggcacagta tgaaaaccca ccacatatct atgcccttgc agataatatg 661 tacagaaaca tgatcattga cagagagaac cagtgcgtca ttatcagtgg tgaaagtggt 721 gctggaaaaa cagtggctgc caaatatatc atgagctaca tctccagagt gtctggagga 781 gggaccaaag tccagcacgt gaaggacatt atcctgcagt ccaacccgct gctggaggcc 841 ttcgggaacg ccaagaccgt ccggaacaac aactccagcc gatttggaaa atactttgaa 901 atccagttca gtccaggtgg ggaaccagat ggtggaaaga tctccaactt ccttctggaa 961 aaatctaggg tggtgatgag gaacccagga gagcggagtt ttcacatatt ttaccagctc 1021 atcgagggcg cctctgcaga gcagaaacac agccttggca tcaccagcat ggactattat 1081 tactacctga gcctctcggg ctcatacaag gttgatgaca ttgacgacag gcgggagttt 1141 caggaaactc tgcacgccat gaatgtgatt gggatctttg cagaagagca aacgctggtg 1201 ttgcagatag tggcgggtat tctccacctg ggaaacatca gcttcaaaga agttggcaac 1261 tacgcggctg tggagagtga agagttttta gcttttcctg catatctgct agggataaac 1321 caggaccggt tgaaagaaaa gctaacaagc cggcagatgg atagcaagtg gggaggcaaa 1381 tccgaatcca tccacgtgac cctcaacgta gagcaggcct gttacacccg ggatgcgctc 1441 gccaaggccc tgcacgcccg ggtctttgat ttcttggtag attccatcaa taaagccatg 1501 gagaaagacc atgaagaata caacattggc gtcctagaca tctatggctt tgaaatattc 1561 cagaaaaatg gctttgaaca gttttgtatc aattttgtta atgaaaaact gcagcagatt 1621 tttattgaac tgacattaaa ggcagaacag gaagaatatg ttcaagaggg aataagatgg 1681 acacccattg agtactttaa taataaaatc gtatgtgacc tcatagagaa caaagtgaac 1741 cctcctggca tcatgagcat cctggatgac gtgtgcgcca cgatgcatgc ggtgggtgag 1801 ggggcagatc agacgctgct ccagaaactt cagatgcaga ttgggagtca tgagcacttc 1861 aacagttgga accaaggctt catcattcat cattatgctg ggaaggtatc ctatgacatg 1921 gatggctttt gtgaaaggaa ccgggatgtg ctttttatgg atctcatcga gcttatgcag 1981 agcagcgagc tgcctttcat aaagtcttta tttccggaaa atctgcaggc tgacaagaaa 2041 gggcgcccaa ctactgccgg aagcaaaata aagaaacaag ccaatgacct tgtgagcacc 2101 ctgatgaaat gtacgcccca ctacattcgc tgcatcaagc caaacgaaac caagaagccc 2161 agagactggg aggaaagcag ggtaaagcat caagtcgaat atttgggtct gaaagagaac 2221 attcgagtga gaagagctgg ctatgcctat cggcgcatct tccaaaaatt cctacagagg 2281 tatgccattc tgaccaaagc cacctggcct tcttggcagg gagaggagaa gcaaggcgtc 2341 ctgcacctgc tgcagtcggt caacatggac agcgaccagt tccagctggg gaggagtaaa 2401 gtgttcatca aagcccccga gtctctattt cttttagaag agatgagaga gagaaagtat 2461 gatgggtatg ctcgagtgat acagaaatca tggaggaaat tcgtggcccg gaagaaatac 2521 gttcaaatga gagaagaagc ctcagacctc ttattgaaca agaaggagag aaggagaaac 2581 agtattaaca ggaactttat aggggattat attgggatgg aagagcaccc agaactccag 2641 cagttcgtgg gcaagaggga gaagattgat ttcgcagaca cagtcaccaa gtatgacagg 2701 aggttcaagg gtgtaaagcg agacctgctc cttaccccaa agtgcttgta cttaatcgga 2761 cgagaaaaag tcaaacaggg cccagacaag ggcctggtga aagaagtcct gaagcggaaa 2821 atcgagatag aacggatctt gtctgtgtcc ctcagtacta tgcaggatga catttttatt 2881 ctccatgagc aagagtatga cagtttgctt gaatctgtct tcaaaactga attcctaagc 2941 ctcttagcaa agcgttacga ggagaagacc cagaagcaac tacctctgaa attcagcaat 3001 acgcttgaac tgaagttgaa aaaggaaaac tggggccctg gagtgcaggg tgcgggctcc 3061 cggcaagtgc agttccacca agggtttggg gacctggctg tcctcaagcc cagtaacaaa 3121 gtgctgcagg tcagcatcgg acctggactg cccaagaact cccgtcctac cagaaggaac 3181 actacccaaa atacaggtta ttccagtggg actcaaaatg ccaactaccc agtgagagct 3241 gcccctcctc ccccaggata ccatcagaac ggagtcatca gaaaccagta tgtgccatat 3301 ccccatgctc ctggaagcca gaggtccatt cagaaaagcc tgtacacctc catggcccgc 3361 ccgcccttgc ctcggcagca gtctaccagt tcagaccgag tgtcacagac gccagagagc 3421 ctggatttcc tcaaggtccc ggaccaggga gctgcagggg tcaggagaca aacaaccagt 3481 cggcctcccc cagctggggg cagacccaag ccccagccca agcccaagcc tcaggttcca 3541 cagtgcaagg ctttgtatgc ctatgacgct caggacacag acgaactcag ctttaatgcc 3601 aatgacatta ttgatattat caaagaagat ccttctggct ggtggacggg tcgactacga 3661 ggcaagccag ggctgttccc caacaactat gtgaccaaga tctgaggtgc ccgtgactct 3721 gacacatggg gcagaggagc tccaggcaca gaccaggtga ggggatattt aggtgctccc 3781 cttacaatcc acaatgagca attgcttctc caaggcctgg agctattctg gtaccttccc 3841 catggaggac actgaaaagg ctgggttggg gacagggagt atcactccat aagtgatcct 3901 aaaaggtagc ctcttcatag gaaccaggag gacaaaacca ccatgcatta agatttattt 3961 attgtattta aacctggtga gaggacaagt gaggtctgct cagaccttgt aggcttctat 4021 caaaacagca ccctgcttgc tcaccaggcc tagagaatgg ctgtaggtgg cggctgagaa 4081 gtgcctttag ttgaagagca catttctttc atctctcttg tctatacctg atagacacat 4141 tcctctctgc caccttcctt cagggaggac ccgccctctg cagactgggc ttagcgtgag 4201 caggcacttc ccatgtacgt gccaagggta agctggcctg ctgagcccag ggcgacagag 4261 gggcactggt ttacactttg ccgggaccat cagggccgcc aagcaggtca ggggctgggg 4321 gctggtcctg ctggctttga tttctctggg tcttcaatta gaatgtggct gggcccatat 4381 tggtttgtgt taaatgctgt acttactaca agaatctttt ttcaagcatg tacatttata 4441 aaaacaggat catatactgt atatataaaa atcttgagat ggtagaaaca tgtatgaatg 4501 tactaagtag tattccactg tactcattca taaggtaggt tttcttacaa aactcacacc 4561 aggtacttaa agatgtgctc tgcttttttc caactacgga gtgtcactgc tttctaggtc 4621 agtccctgca gactcttctc aactctttcc ctataggaaa cttact // LOCUS HSU14407 1202 bp mRNA PRI 21-SEP-1994 DEFINITION Human interleukin 15 (IL15) mRNA, complete cds. ACCESSION U14407 NID g540098 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1202) AUTHORS Grabstein,K.K., Eisenman,J., Sheanebeck,K., Rauch,C., Srinivasan,S., Fung,V., Beers,C., Richardson,J., Schoenborn,M.A., Ahdieh,M., Johnson,L., Alderson,M.R., Watson,J.D., Anderson,D.M. and Giri,J.G. TITLE Cloning of a T cell growth factor that interacts with the beta chain of the interleukin-2 receptor JOURNAL Science 264 (5161), 965-968 (1994) MEDLINE 94233380 REFERENCE 2 (bases 1 to 1202) AUTHORS Anderson,D.M. TITLE Direct Submission JOURNAL Submitted (06-SEP-1994) Dirk M. Anderson, Immunex Research and Development Corp., 51, University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1202 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMTLH" /cell_type="stromal" /tissue_type="Bone marrow" sig_peptide 314..460 /evidence=experimental gene 317..805 /gene="IL15" CDS 317..805 /gene="IL15" /note="cytokine; T cell growth factor; secreted protein" /codon_start=1 /product="Interleukin 15" /db_xref="PID:g540099" /translation="MRISKPHLRSISIQCYLCLLLNSHFLTEAGIHVFILGCFSAGLP KTEANWVNVISDLKKIEDLIQSMHIDATLYTESDVHPSCKVTAMKCFLLELQVISLES GDASIHDTVENLIILANNSLSSNGNVTESGCKECEELEEKNIKEFLQSFVHIVQMFIN TS" mat_peptide 461..802 /gene="IL15" /note="putative" misc_feature 671..678 /gene="IL15" /note="encodes N-glycosylation site" misc_feature 695..703 /gene="IL15" /note="encodes N-glycosylation site" misc_feature 794..802 /gene="IL15" /note="encodes N-glycosylation site" BASE COUNT 355 a 219 c 249 g 379 t ORIGIN 1 tgtccggcgc cccccgggag ggaactgggt ggccgcaccc tcccggctgc ggtggctgtc 61 gccccccacc ctgcagccag gactcgatgg agaatccatt ccaatatatg gccatgtggc 121 tctttggagc aatgttccat catgttccat gctgctgctg acgtcacatg gagcacagaa 181 atcaatgtta gcagatagcc agcccataca agatcgtatt gtattgtagg aggcatcgtg 241 gatggatggc tgctggaaac cccttgccat agccagctct tcttcaatac ttaaggattt 301 accgtggctt tgagtaatga gaatttcgaa accacatttg agaagtattt ccatccagtg 361 ctacttgtgt ttacttctaa acagtcattt tctaactgaa gctggcattc atgtcttcat 421 tttgggctgt ttcagtgcag ggcttcctaa aacagaagcc aactgggtga atgtaataag 481 tgatttgaaa aaaattgaag atcttattca atctatgcat attgatgcta ctttatatac 541 ggaaagtgat gttcacccca gttgcaaagt aacagcaatg aagtgctttc tcttggagtt 601 acaagttatt tcacttgagt ccggagatgc aagtattcat gatacagtag aaaatctgat 661 catcctagca aacaacagtt tgtcttctaa tgggaatgta acagaatctg gatgcaaaga 721 atgtgaggaa ctggaggaaa aaaatattaa agaatttttg cagagttttg tacatattgt 781 ccaaatgttc atcaacactt cttgattgca attgattctt tttaaagtgt ttctgttatt 841 aacaaacatc actctgctgc ttagacataa caaaacactc ggcatttaaa atgtgctgtc 901 aaaacaagtt tttctgtcaa gaagatgatc agaccttgga tcagatgaac tcttagaaat 961 gaaggcagaa aaatgtcatt gagtaatata gtgactatga acttctctca gacttacttt 1021 actcattttt ttaatttatt attgaaattg tacatatttg tggaataatg taaaatgttg 1081 aataaaaata tgtacaagtg ttgtttttta agttgcactg atattttacc tcttattgca 1141 aaatagcatt tgtttaaggg tgatagtcaa attatgtatt ggtggggctg ggtaccaatg 1201 ct // LOCUS HSU14510 3680 bp mRNA PRI 31-JAN-1996 DEFINITION Human transcription factor NFATx mRNA, complete cds. ACCESSION U14510 NID g780373 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3680) AUTHORS Masuda,E.S., Naito,Y., Tokumitsu,H., Campbell,D., Saito,F., Hannum,C., Arai,K. and Arai,N. TITLE NFATx, a novel member of the nuclear factor of activated T cells family that is expressed predominantly in the thymus JOURNAL Mol. Cell. Biol. 15 (5), 2697-2706 (1995) MEDLINE 95257951 REFERENCE 2 (bases 1 to 3680) AUTHORS Arai,N. TITLE Direct Submission JOURNAL Submitted (07-SEP-1994) Naoko Arai, Molecular Biology, DNAX Research Institute of Molecular and Cellular Biology, 901 California Avenue, Palo Alto, CA 94304-1104, USA FEATURES Location/Qualifiers source 1..3680 /organism="Homo sapiens" /note="in LambdaZipox vector" /db_xref="taxon:9606" /chromosome="16" /map="16q21-q22" /cell_line="Jurkat" /cell_type="T-cell" CDS 25..3252 /codon_start=1 /function="transcription factor" /product="NFATx" /db_xref="PID:g780374" /translation="MTTANCGAHDELDFKLVFGEDGAPAPPPPGSRPADLEPDDCASI YIFNVDPPPSTLTTPLCLPHHGLPSHSSVLSPSFQLQSHKNYEGTCEIPESKYSPLGG PKPFECPSIQITSISPNCHQELDAHEDDLQINDPEREFLERPSRDHLYLPLEPSYRES SLSPSPASSISSRSWFSDASSCESLSHIYDDVDSELNEAAARFTLGSPLTSPGGSPGG CPGEETWHQQYGLGHSLSPRQSPCHSPRSSVTDENWLSPRPASGPSSRPTSPCGKRRH SSAEVCYAGSLSPHHSPVPSPGHSPRGSVTEDTWLNASVHGGSGLGPAVFPFQYCVET DIPLKTRKTSEDQAAILPGKLELCSDDQGSLSPARETSIDDGLGSQYPLKKDSCGDQF LSVPSPFTWSKPKPGHTPIFRTSSLPPLDWPLPAHFGQCELKIEVQPKTHHRAHYETE GSRGAVKASTGGHPVVKLLGYNEKPINLQMFIGTADDRYLRPHAFYQVHRITGKTVAT ASQEIIIASTKVLEIPLLPENNMSASIDCAGILKLRNSDIELRKGETDIGRKNTRVRL VFRVHIPQPSGKVLSLQIASIPVECSQRSAQELPHIEKYSINSCSVNGGHEMVVTGSN FLPESKIIFLEKGQDGRPQWEVEGKIIREKCQGAHIVLEVPPYHNPAVTAAVQVHFYL CNGKRKKSQSQRFTYTPVLMKQEHREEIDLSSVPSLPVPHPAQTQRPSSDSGCSHDSV LSGQRSLICSIPQTYASMVTSSHLPQLQCRDESVSKEQHMIPSPIVHQPFQVTPTPPV GSSYQPMQTNVVYNGPTCLPINAASSQEFDSVLFQQDATLSGLVNLGCQPLSSIPFHS SNSGSTGHLLAHTPHSVHTLPHLQSMGYHCSNTGQRSLSSPVADQITGQPSSQLQPIT YGPSHSGSATTASPAASHPLASSPLSGPPSPQLQPMPYQSPSSGTASSPSPATRMHSG QHSTQAQSTGQGGLSAPSSLICHSLCDPASFPPDGATVSIKPEPEDREPNFATIGLQD ITLDDVNEIIGRDMSQISVSQGAGVSRQAPLPSPESLDLGRSDGL" BASE COUNT 948 a 926 c 781 g 1025 t ORIGIN 1 gctgcagcac cctgggccac gccgatgact actgcaaact gtggcgccca cgacgagctc 61 gacttcaaac tcgtctttgg cgaggacggg gcgccggcgc cgccgccccc gggctcgcgg 121 cctgcagatc ttgagccaga tgattgtgca tccatttaca tctttaatgt agatccacct 181 ccatctactt taaccacacc actttgctta ccacatcatg gattaccgtc tcactcttct 241 gttttgtcac catcgtttca gctccaaagt cacaaaaact atgaaggaac ttgtgagatt 301 cctgaatcta aatatagccc attaggtggt cccaaaccct ttgagtgccc aagtattcaa 361 attacatcta tctctcctaa ctgtcatcaa gaattagatg cacatgaaga tgacctacag 421 ataaatgacc cagaacggga atttttggaa aggccttcta gagatcatct ctatcttcct 481 cttgagccat cctaccggga gtcttctctt agtcctagtc ctgccagcag catctcttct 541 aggagttggt tctctgatgc atcttcttgt gaatcgcttt cacatattta tgatgatgtg 601 gactcagagt tgaatgaagc tgcagcccga tttacccttg gatcccctct gacttctcct 661 ggtggctctc cagggggctg ccctggagaa gaaacttggc atcaacagta tggacttgga 721 cactcattat cacccaggca atctccttgc cactctccta gatccagtgt cactgatgag 781 aattggctga gccccaggcc agcctcagga ccctcatcaa ggcccacatc cccctgtggg 841 aaacggaggc actccagtgc tgaagtttgt tatgctgggt ccctttcacc ccatcactca 901 cctgttcctt cacctggtca ctcccccagg ggaagtgtga cagaagatac gtggctcaat 961 gcttctgtcc atggtgggtc aggccttggc cctgcagttt ttccatttca gtactgtgta 1021 gagactgaca tccctctcaa aacaaggaaa acttctgaag atcaagctgc catactacca 1081 ggaaaattag agctgtgttc agatgaccaa gggagtttat caccagcccg ggagacttca 1141 atagatgatg gccttggatc tcagtatcct ttaaagaaag attcatgtgg tgatcagttt 1201 ctttcagttc cttcaccctt tacctggagc aaaccaaagc ctggccacac ccctatattt 1261 cgcacatctt cattacctcc actagactgg cctttaccag ctcattttgg acaatgtgaa 1321 ctgaaaatag aagtgcaacc taaaactcat catcgagccc attatgaaac tgaaggtagc 1381 cgaggggcag taaaagcatc tactggggga catcctgttg tgaagctcct gggctataac 1441 gaaaagccaa taaatctaca aatgtttatt gggacagcag atgatcgata tttacgacct 1501 catgcatttt accaggtgca tcgaatcact gggaagacag tcgctactgc aagccaagag 1561 ataataattg ccagtacaaa agttctggaa attccacttc ttcctgaaaa taatatgtca 1621 gccagtattg attgtgcagg tattttgaaa ctccgcaatt cagatataga acttcgaaaa 1681 ggagaaactg atattggcag aaagaatact agagtacgac ttgtgtttcg tgtacacatc 1741 ccacagccca gtggaaaagt cctttctctg cagatagcct ctatacccgt tgagtgctcc 1801 cagcggtctg ctcaagaact tcctcatatt gagaagtaca gtatcaacag ttgttctgta 1861 aatggaggtc atgaaatggt tgtgactgga tctaattttc ttccagaatc caaaatcatt 1921 tttcttgaaa aaggacaaga tggacgacct cagtgggagg tagaagggaa gataatcagg 1981 gaaaaatgtc aaggggctca cattgtcctt gaagttcctc catatcataa cccagcagtt 2041 acagctgcag tgcaggtgca cttttatctt tgcaatggca agaggaaaaa aagccagtct 2101 caacgtttta cttatacacc agttttgatg aagcaagaac acagagaaga gattgatttg 2161 tcttcagttc catctttgcc tgtgcctcat cctgctcaga cccagaggcc ttcctctgat 2221 tcagggtgtt cacatgacag tgtactgtca ggacagagaa gtttgatttg ctccatccca 2281 caaacatatg catccatggt gacctcatcc catctgccac agttgcagtg tagagatgag 2341 agtgttagta aagaacagca tatgattcct tctccaattg tacaccagcc ttttcaagtc 2401 acaccaacac ctcctgtggg gtcttcctat cagcctatgc aaactaatgt tgtgtacaat 2461 ggaccaactt gtcttcctat taatgctgcc tctagtcaag aatttgattc agttttgttt 2521 cagcaggatg caactctttc tggtttagtg aatcttggct gtcaaccact gtcatccata 2581 ccatttcatt cttcaaattc aggctcaaca ggacatctct tagcccatac acctcattct 2641 gtgcataccc tgcctcatct gcaatcaatg ggatatcatt gttcaaatac aggacaaaga 2701 tctctttctt ctccagtggc tgaccagatt acaggtcagc cttcgtctca gttacaacct 2761 attacatatg gtccttcaca ttcagggtct gctacaacag cttccccagc agcttctcat 2821 cccttggcta gttcaccgct ttctgggcca ccatctcctc agcttcagcc tatgccttac 2881 caatctccta gctcaggaac tgcctcatca ccgtctccag ccaccagaat gcattctgga 2941 cagcactcaa ctcaagcaca aagtacgggc caggggggtc tttctgcacc ttcatcctta 3001 atatgtcaca gtttgtgtga tccagcgtca tttccacctg atggggcaac tgtgagcatt 3061 aaacctgaac cagaagatcg agagcctaac tttgcaacca ttggtctgca ggacatcact 3121 ttagatgatg tgaacgagat aattgggaga gacatgtccc agatttctgt ttcccaagga 3181 gcaggggtga gcaggcaggc tcccctcccg agtcctgagt ccctggattt aggaagatct 3241 gatgggctct aacagtgctt actgcagcct tgtgtccacc accaacttct cagcatgttt 3301 ctctccttgg accttgggtt tccaactctg cagccttcag gtctggggcc aggagtggga 3361 cccaccattt gtggggaaag tagcattcct ccacctcagg ccttgggtag atttggcaaa 3421 agaacaggag cagcataggc tgtttgagct ttggggaaat gaactttgct ttttatattt 3481 aactaggata cttttatatg atgggtgctt tgagtgtgaa tgcagcaggc tctcttgttt 3541 ccgaggtgct gcttttgcag gtgacctggt tacttagcta ggattggtga tttgtactgc 3601 tttatggtca tttgaagggc cctttagttt ttatgataat ttttaaaata ggaacttttg 3661 ataagacctt ctagaagcaa // LOCUS HSU14518 1388 bp mRNA PRI 22-DEC-1994 DEFINITION Human centromere protein-A (CENP-A) mRNA, complete cds. ACCESSION U14518 NID g602413 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1388) AUTHORS Sullivan,K.F., Hechenberger,M. and Masri,K. TITLE Human CENP-A contains a histone H3 related histone fold domain that is required for targeting to the centromere JOURNAL J. Cell Biol. 127, 581-592 (1994) MEDLINE 95050934 REFERENCE 2 (bases 1 to 1388) AUTHORS Sullivan,K.F. TITLE Direct Submission JOURNAL Submitted (07-SEP-1994) Sullivan K.F., The Scripps Research Institute, Cell Biology, 10666 N. Torrey Pines Rd., La Jolla, CA 93037, USA FEATURES Location/Qualifiers source 1..1388 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..143 gene 144..566 /gene="CENP-A" CDS 144..566 /gene="CENP-A" /note="centromere-specific histone H3-related protein" /codon_start=1 /evidence=experimental /product="centromere protein-A" /db_xref="PID:g602414" /translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR QGWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFL VHLFEDAYLLTLHAGRVTLFPKDVQLARRIRGLEEGLG" 3'UTR 567..1388 polyA_signal 1372..1377 polyA_site 1388 /note="17 A residues" BASE COUNT 349 a 326 c 331 g 382 t ORIGIN 1 gcggacttct gccaagcacc ggctcatgtg aggctcgcgg cacagcgttc tctgggctcc 61 ccagaagcca gcctttcgct cccggacccg gcagcccgag caggagccgt gggaccgggc 121 gccagcaccc tctgcggcgt gtcatgggcc cgcgccgccg gagccgaaag cccgaggccc 181 cgaggaggcg cagcccgagc ccgaccccga cccccggccc ctcccggcgg ggcccctcct 241 taggcgcttc ctcccatcaa cacagtcggc ggagacaagg ttggctaaag gagatccgaa 301 agcttcagaa gagcacacac ctcttgataa ggaagctgcc cttcagccgc ctggcaagag 361 aaatatgtgt taaattcact cgtggtgtgg acttcaattg gcaagcccag gccctattgg 421 ccctacaaga ggcagcagaa gcatttctag ttcatctctt tgaggacgcc tatctcctca 481 ccttacatgc aggccgagtt actctcttcc caaaggatgt gcaactggcc cggaggatcc 541 ggggccttga ggagggactc ggctgagctc ctgcacccag tgtttctgtc agtctttcct 601 gctcagccag gggggatgat accggggact ctccagagcc atgactagat ccaatggatt 661 ctgcgatgct gtctggactt tgctgtctct gaacagtatg tgtgtgttgc tttaaatatt 721 tttctttttt ttgagaagga gaagactgca tgactttcct ctgtaacaga ggtaatatat 781 gagacaatca acaccgttcc aaaggcctga aaataatttt cagataaaga gactccaagg 841 ttgactttag tttgtgagtt actcatgtga ctatttgagg attttgaaaa catcagattt 901 gctgtggtat gggagaaaag gttatgtact tattatttta gctctttctg taatatttac 961 attttttacc atatgtacat ttgtactttt attttacaca taagggaaaa aataagacca 1021 ctttgagcag ttgcctggaa ggctgggcat ttccatcata tagacctctg cccttcagag 1081 tagcctcacc attagtggca gcatcatgta actgagtgga ctgtgcttgt caacggatgt 1141 gtagcttttc agaaacttaa ttggggatga atagaaaacc tgtaagcttt gatgttctgg 1201 ttacttctag taaattcctg tcaaaatcaa ttcagaaatt ctaacttgga gaatttaaca 1261 ttttactctt gtaaatcata gaagatgtat cataacagtt cagaatttta aagtacattt 1321 tcgatgcttt tatgggtatt tttgtagttt ctttgtagag agataataaa aatcaaaata 1381 tttaatga // LOCUS HSU14528 2832 bp mRNA PRI 20-JUL-1995 DEFINITION Human sulfate transporter (DTD) mRNA, complete cds. ACCESSION U14528 NID g549987 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2082) AUTHORS Hastbacka,J., de la Chapelle,A., Mahtani,M.M., Clines,G., Reeve-Daly,M., Daly,M., Hamilton,B.A., Kusumi,K., Trivedi,B., Weaver,A., Coloma,A., Lovett,M., Buckler,A. and Lander,E.S. TITLE The diastrophic dysplasia gene encodes a novel sulfate transporter: positional cloning by fine-structure linkage disequilibrium mapping JOURNAL Cell 78 (6), 1073-1087 (1994) MEDLINE 95007757 REFERENCE 2 (bases 1 to 2832) AUTHORS Hastbacka,J. TITLE Direct Submission JOURNAL Submitted (08-SEP-1994) Johanna Hastbacka, Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..2832 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q31-q34" /clone="clones P2 and P8" gene 28..2247 /gene="DTD" CDS 28..2247 /gene="DTD" /note="DTDST; diastrophic dysplasia gene" /codon_start=1 /product="sulfate transporter" /db_xref="PID:g549988" /translation="MSSESKEQHNVSPRDSAEGNDSYPSGIHLELQRESSTDFKQFET NDQCRPYHRILIERQEKSDTNFKEFVIKKLQKNCQCSPAKAKNMILGFLPVLQWLPKY DLKKNILGDVMSGLIVGILLVPQSIAYSLLAGQEPVYGLYTSFFASIIYFLLGTSRHI SVGIFGVLCLMIGETVDRELQKAGYDNAHSAPSLGMVSNGSTLLNHTSDRICDKSCYA IMVGSTVTFIAGVYQVAMGFFQVGFVSVYLSDALLSGFVTGASFTILTSQAKYLLGLN LPRTNGVGSLITTWIHVFRNIHKTNLCDLITSLLCLLVLLPTKELNEHFKSKLKAPIP IELVVVVAATLASHFGKLHENYNSSIAGHIPTGFMPPKVPEWNLIPSVAVDAIAISII GFAITVSLSEMFAKKHGYTVKANQEMYAIGFCNIIPSFFHCFTTSAALAKTLVKESTG CHTQLSGVVTALVLLLVLLVIAPLFYSLQKSVLGVITIVNLRGALRKFRDLPKMWSIS RMDTVIWFVTMLSSALLSTEIGLLVGVCFSIFCVILRTQKPKSSLLGLVEESEVFESV SAYKNLQTKPGIKIFRFVAPLYYINKECFKSALYKQTVNPILIKVAWKKAAKRKIKEK VVTLGGIQDEMSVQLSHDPLELHTIVIDCSAIQFLDTAGIHTLKEVRRDYEAIGIQVL LAQCNPTVRDSLTNGEYCKKEEENLLFYSVYEAMAFAEVSKNQKGVCVPNGLSLSSD" intron 726 /gene="DTD" BASE COUNT 795 a 576 c 596 g 865 t ORIGIN 1 aggaagctga accatctatc tccagaaatg tcttcagaaa gtaaagagca acataacgtt 61 tcacccagag actcagctga aggaaatgac agttatccat ctgggatcca tctggaactt 121 caaagggaat caagtactga cttcaagcaa tttgagacca atgatcaatg cagaccttat 181 cataggatcc ttattgagcg tcaagagaaa tcagatacaa acttcaagga gtttgttatt 241 aaaaagctgc agaagaattg ccagtgcagt ccagccaaag ccaaaaatat gattttaggt 301 ttccttcctg ttttgcagtg gctcccaaaa tacgacctaa agaaaaacat tttaggggat 361 gtgatgtcag gcttgattgt gggcatatta ttggtgcccc agtccattgc ttattccctg 421 ctggctggcc aagaacctgt ctatggtctg tacacatctt tttttgccag catcatttat 481 tttctcttgg gtacctcccg tcacatctct gtgggcattt ttggagtact gtgccttatg 541 attggtgaga cagttgaccg agaactacag aaagctggct atgacaatgc ccatagtgct 601 ccttccttag gaatggtttc aaatgggagc acattattaa atcatacatc agacaggata 661 tgtgacaaaa gttgctatgc aattatggtt ggcagcactg taacctttat agctggagtt 721 tatcaggtag cgatgggctt ctttcaagtg ggttttgttt ctgtctacct ctcagatgcc 781 ttgctgagtg gatttgtcac tggtgcctcc ttcactattc ttacatctca ggccaagtat 841 cttcttgggc tcaaccttcc tcggactaat ggtgtgggct cactcatcac tacctggata 901 catgtcttca gaaacatcca taagaccaat ctctgtgatc ttatcaccag ccttttgtgc 961 cttttggttc ttttgccaac caaagaactc aatgaacact tcaaatccaa gcttaaggca 1021 ccgattccta ttgaacttgt tgttgttgta gcagccacat tagcctctca ttttggaaaa 1081 ctacatgaaa attataattc tagtattgct ggacatattc ccactgggtt tatgccaccc 1141 aaagtaccag aatggaacct aattcctagt gtggctgtag atgcaatagc tatttccatc 1201 attggttttg ctatcactgt atcactttct gagatgtttg ccaagaaaca tggttacaca 1261 gtcaaagcaa accaggaaat gtatgccatt ggcttttgta atatcatccc ttccttcttc 1321 cactgtttta ctactagtgc agctcttgca aagacattgg ttaaagaatc aacaggctgc 1381 catactcagc tttctggtgt ggtaacagcc ctggttcttt tgttggtcct cctagtaata 1441 gctcctttgt tctattccct tcaaaaaagt gtccttggtg tgatcacaat tgtaaatcta 1501 cggggagccc ttcgtaaatt tagggatctt cccaaaatgt ggagtattag tagaatggat 1561 acagttatct ggtttgttac tatgctgtcc tctgcactgc taagtactga aataggccta 1621 cttgttgggg tttgtttttc tatattttgt gtcatcctcc gcactcagaa gccaaagagt 1681 tcactgcttg gcttggtgga agagtctgag gtctttgaat ctgtgtctgc ttacaagaac 1741 cttcagacta agccaggcat caagattttc cgctttgtag cccctctcta ctacataaac 1801 aaagaatgct ttaaatctgc tttatacaaa caaactgtca acccaatctt aataaaggtg 1861 gcttggaaga aggcagcaaa gagaaagatc aaagaaaaag tagtgactct tggtggaatc 1921 caggatgaaa tgtcagtgca actttcccat gatcccttgg agctgcatac tatagtgatt 1981 gactgcagtg caattcaatt tttagataca gcagggatcc acacactgaa agaagttcgc 2041 agagattatg aagccattgg aatccaggtt ctgctggctc agtgcaatcc cactgtgagg 2101 gattccctaa ccaacggaga atattgcaaa aaggaagaag aaaaccttct cttctatagt 2161 gtgtatgaag cgatggcttt tgcagaagta tctaaaaatc agaaaggagt atgtgttccc 2221 aatggtctga gtcttagtag tgattaattg agaaggtaga tagaagaatg tctagccaat 2281 aggttaaaat ttcaagtgtc caacatttcc cagttccaca gtgggaaatt ttgcacactt 2341 gaaattttaa ccaagtggct agatattatt cctcctttga agctaatggc atttgtatat 2401 acacactgca gcagagcttg tagctggaca gagtcaaaaa gaagaaaata cggtttcagg 2461 ctttcttgca gatatgaagt attcttggaa tgcaataagt atgtattgaa ctgtactgta 2521 aagtagctcc aaaacttaat tactctcctg ttttaggggt tatacatttg gactgtgcat 2581 tctccaagag atgaagcggt gaagttggga tttacattgg aagtgctgta gacttcttta 2641 tgtggctcag tggagagagg gaaagaatgt tgcacctgct ctagtaccat aggtcaagag 2701 gcttctggat cacaaagtca taactagaca ggtttgttct tgtagttttc tatccccagt 2761 ctttgctccc cagatggcag tagtttttag taggaaagtg ccattcctgt ccttaaggca 2821 cagtctcatc ag // LOCUS HSU14550 1908 bp mRNA PRI 08-NOV-1994 DEFINITION Human sialyltransferase SThM (sthm) mRNA, complete cds. ACCESSION U14550 NID g565079 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1908) AUTHORS Sotiropoulou,G., Anisowicz,A. and Sager,R. TITLE Isolation and cloning from human mammary epithelial cells of a complete cDNA sequence homologous to other known sialyltransferases JOURNAL Unpublished REFERENCE 2 (bases 1 to 1908) AUTHORS Sotiropoulou,G. TITLE Direct Submission JOURNAL Submitted (08-SEP-1994) Georgia Sotiropoulou, Cancer Genetics, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1908 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda ZAP II 76N" /sex="female" /cell_type="epithelial" /tissue_type="mammary" 5'UTR <1..39 gene 40..1164 /gene="sthm" CDS 40..1164 /gene="sthm" /codon_start=1 /function="sialyltransferase" /product="SThM" /db_xref="PID:g565080" /translation="MGLPRGSFFWVLLLLTAACSGLLFALYFSAVQRYPGPAAGARDT TSFEAFFQSKASNSWTGKGQACRHLLHLAIQRHPHFRGLFNLSIPVLLWGDLFTPALW DRLSQHKAPYGWRGLSHQVIASTLSLLNGSESAKLFAPPRDTPPKCIRCAVVGNGGIL NGSRQGPNIDAHDYVFRLNGAVIKGFERDVGTKTSFYGFTVNTMKNSLVSYWNLGFTS VPQGQDLQYIFIPSDIRDYVMLRSAILGVPVPEGLDKGDRPHAYFGPEASASKFKLLH PDFISYLTERFLKSKLINTHFGDLYMPSTGALMLLTALHTCDQVSAYGFITSNYWKFS DHYFERKMKPLIFYANHDLSLEAALWRDLHKAGILQLYQR" misc_feature 487..624 /gene="sthm" /note="encodes sialyl motif" 3'UTR 1165..>1908 polyA_signal 1890..1895 BASE COUNT 449 a 534 c 471 g 454 t ORIGIN 1 gggacgtcag cggacggggc gctcgcgggc cggggctgta tggggctccc gcgcgggtcg 61 ttcttctggg tgctgctcct gctcacggct gcctgctcgg ggctcctctt tgccctgtac 121 ttctcggcgg tgcagcggta cccggggcca gcggccggag ccagggacac cacatcattt 181 gaagcattct ttcaatccaa ggcatcgaat tcttggacag gaaagggcca ggcctgccga 241 cacctgcttc acctggccat tcagcggcac ccccacttcc gtggcctgtt caatctctcc 301 attccagtgc tgctgtgggg ggacctcttc accccagcgc tctgggaccg cctgagccaa 361 cacaaagccc cgtatggctg gcgggggctc tctcaccaag tcatcgcctc caccctgagc 421 cttctgaacg gctcagagag tgccaagctg tttgccccgc ccagggacac ccctccaaag 481 tgtatccggt gtgccgtggt gggcaacgga ggcattctga atgggtcccg ccagggtccc 541 aacatcgatg cccatgacta tgtattcaga ctcaatggag ctgtgatcaa aggcttcgag 601 cgcgatgtgg gcaccaagac ttccttctat ggtttcactg tgaacacgat gaagaactcc 661 ctcgtctcct actggaatct gggcttcacc tccgtgccac aaggacagga cctgcagtat 721 atcttcatcc cctcagacat ccgcgactat gtgatgctga gatcggccat tctgggcgtg 781 cctgtccctg agggcctaga taaaggggac aggccgcacg cctattttgg accagaagcc 841 tctgccagta aattcaagct gctacatccg gacttcatca gctacctgac agaaaggttc 901 ttgaaatcaa agttgattaa cacacatttt ggagacctat atatgcctag taccggggct 961 ctcatgctgc tgacagcttt gcatacctgt gaccaggtca gtgcctatgg attcatcaca 1021 agcaactact ggaaattttc cgaccactat ttcgaacgaa aaatgaagcc attgatattt 1081 tatgcaaacc acgatctgtc cctggaagct gccctgtgga gggacctgca caaggccggc 1141 atccttcagc tgtaccagcg ctgaccccaa tgcactgagc gctttgcttc ttcaagagtt 1201 gcggccctga tcctctcaag tggccaaaag cttttttaac ttttcaatct tcaccttccc 1261 ttgccaacag agggcactgg ggtgaattca agattttcat cgaggtctgt tcaatatagg 1321 acaccccagc ttgtccttgg ctcatccaag aactcttctg tatctaaaac aatacatctc 1381 aatcttggcc aagggaaaat ggactgcttt gctggattgg cactgagcaa ctttaggaaa 1441 tgtcggtgga gtgttcagca agatcagaca gcagtccagg tcaaaggcaa acacacacgc 1501 tccagcccaa atcctcctgg tggcacatcc taccccagat gctaaagtga ttcaaggact 1561 ccaggacacc tcttaagagc ctttctaaga acatgatagg cttacttctg ctccataata 1621 aagtgggaga aaaaagccag aatataactt aagactagat aactgcgtac atgatggacc 1681 attttttttt tttttggctg ggtagagaaa tcatataaaa cgcaggctgt ttagcatgga 1741 gatgactctc agaacactgg gagggtctgg cacttgatgg gggttagttg cttggcagcc 1801 tgcctgccac tgagggaagt cccattagag atgtatcacc accttgtcac caacaggatg 1861 atgtcaccaa caggatgatg tcaccaggta ataaaccttc atcctcac // LOCUS HSU14575 2401 bp mRNA PRI 05-APR-1995 DEFINITION Human (ard-1) mRNA, complete cds. ACCESSION U14575 NID g559771 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2401) AUTHORS Wang,M. and Cohen,S.N. TITLE ard-1: a human gene that reverses the effects of temperature-sensitive and deletion mutations in the Escherichia coli rne gene and encodes an activity producing RNase E-like cleavages JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (22), 10591-10595 (1994) MEDLINE 95024160 REFERENCE 2 (bases 1 to 2401) AUTHORS Wang,M.W. TITLE Direct Submission JOURNAL Submitted (09-SEP-1994) Maureen W Wang, Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..2401 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda YES library" /cell_type="B-lymphocyte" 5'UTR 1..935 gene 936..1319 /gene="ard-1" mat_peptide 936..1316 /gene="ard-1" CDS 936..1319 /gene="ard-1" /standard_name="activator of RNA decay" /codon_start=1 /db_xref="PID:g559772" /translation="MVQTAVVPVKKKRVEGPGSLGLEESGSRRMQNFAFSGGLYGGLP PTHSEAGSQPHGIHGTALIGGLPMPYPNLAPDVDLTPVVPSAVNMNPAPNPAVYNPEA VNEPKKKKYAKEAWPGKKPTPSLLI" 3'UTR 1320..2401 polyA_signal 2382..2387 polyA_site 2401 /note="eight A residues" BASE COUNT 592 a 581 c 603 g 625 t ORIGIN 1 gccacgaagg ccggcggcag ccgcgaactc cggctctagc ctcccgctgt tcgactgccc 61 aacctggtga gtggcggggc ggccagggct agagtggccc ggccggagct agcctgggct 121 ggaagggcgg ctcttttttt acttttctgc tgcgagccga acggtcagaa accccggaat 181 ggttgaggaa aaactgtttg ctgcaccggg ccgggcgacg tgttgaagaa ccgagagcct 241 ggagcccagg cccaggaact gaagaaaccc ggggttgggg gctcaaaggc gctcacttag 301 gcagcccctt tgagcgatta gccagtcgcc ggagcgcttc gaggccttgg cccgaactta 361 cgcccaactc ttgactgagt gcctggtgct ctcgtggagc atcgcatctg gccccttcct 421 gtacgtcccg agcgcgctcg agccagcccc ggccccaacc ctacctccaa gccccgcatc 481 cctctgtggt tgctgcatcc ctcgtgcggc acttgtctgt ctgccacaga gaatacgagg 541 ggcaggtaag ccccctcccg gtttacatct ggatgtagtc aaaggagaca aactaattga 601 gaaactgatt attgatgaga agaagtatta cttatttggg agaaaccctg atttgtgtga 661 ctttaccatt gaccaccagt cttgctctcg ggtccatgct gcacttgtct accacaagca 721 tctgaagaga gttttcctga tagatctcaa cagtaaacct gacagagttc aacactgccc 781 acaacaagcg gatttctacc cttaccattg aggagggaaa tctggacatt caaagaccaa 841 agaggaagag gaagaactca cgggtgacat tcagtgagga tgatgagatc atcaacccag 901 aggatgtgga tccctcagtt ggtcgattca ggaacatggt gcaaactgca gtggtcccag 961 tcaagaagaa gcgtgtggag ggccctggct ccctgggcct ggaggaatca gggagcaggc 1021 gcatgcagaa ctttgccttc agcggaggac tctacggggg cctgcccccc acacacagtg 1081 aagcaggctc ccagccacat ggcatccatg ggacagcact catcggtggc ttgcccatgc 1141 catacccaaa ccttgcccct gatgtggact tgactcctgt tgtgccgtca gcagtgaaca 1201 tgaaccctgc accaaaccct gcagtctata accctgaagc tgtaaatgaa cccaagaaga 1261 agaaatatgc aaaagaggct tggccaggca agaagcccac accttccttg ctgatttgat 1321 atttttggtc atggagaagg gtgggcttgg gtgggaatgg ggtggaaggg tgatggggag 1381 ctaatgaact agggagaaaa actttccatg tgtgcggtat cgtctttcag aatgtctcct 1441 ggcatcctaa ccatgtaata tgacaattgg gggtggggtt gaaatagccc ataaagacct 1501 gtcttcacaa cacttgcatt gtagagaaag gcttcttata tccttttcaa tagactgccc 1561 tggctctttc ctaggccttc cactacctcc tttctttctc ccactttcta ggatcatttt 1621 tatgtaaagt cacatatccc aggccctcag gttgaatcca gagctgtaga ggttacagta 1681 gcatcaccag ccttgggggt ccagagccta atttatattc actatccttc caagtcccgg 1741 gtagcagaag ggttgccata gatctcagtt tgatcaaaaa gaaggcttag aattctgcag 1801 ttaagctgag gtttaaacta aaaaatgttt ccttgggtca gtggttttga ggtccagtag 1861 ctaggctttt ctcttttgtc cttcctgttg gaatgaaaac atttcgattt tccttcatct 1921 gtgactggtg ccatagacac aggtttatag ttttaactta cagtattgtt tgaaatttac 1981 ctgtttttct tgtcaaacct gagcactcct cctgctgaag tttcttattt aattccagag 2041 tactgtcctc tactctaagg cattactttt aagggtatta tgaaggcagt ttttcaaagg 2101 atatgaccag ttggggtaat tcaaattaaa aaggaaaaga tttgtttgga gtaactggtg 2161 tctctaaggg ggatttttag tgtcaagtat ggcggctctt tcacccctcc attgagagcc 2221 cttgttattc agagctccaa gactagacct ggctaacaaa cataggagac aaagttagga 2281 aacattgata caagctttgt acagagattt gtacatttgt gtaataggcc ttttcatgct 2341 ttatgtgtag ctttttacct gtaaccttta ttacattgta aattaaacgt aacttttgtc 2401 a // LOCUS HSU14577 1575 bp mRNA PRI 15-NOV-1995 DEFINITION Human microtubule-associated protein 1A (MAP1A) mRNA, complete cds. ACCESSION U14577 NID g642451 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1575) AUTHORS Fukuyama,R. and Rapoport,S.I. TITLE Brain-specific expression of human microtubule-associated protein 1A (MAP1A) gene and its assignment to human chromosome 15 JOURNAL J. Neurosci. Res. 40 (6), 820-825 (1995) MEDLINE 95356255 REFERENCE 2 (bases 1 to 1575) AUTHORS Fukuyama,R. TITLE Direct Submission JOURNAL Submitted (09-SEP-1994) Ryuichi Fukuyama, Laboratory of Neurosciences, National Institute on Aging, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1575 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="BG5i-4" /clone_lib="Clontech lambda gt11 human amygdala cDNA library" /tissue_type="brain" /chromosome="15" gene 375..1208 /gene="MAP1A" CDS 375..1208 /gene="MAP1A" /codon_start=1 /product="microtubule-associated protein 1A" /db_xref="PID:g642452" /translation="MEEKLEALLEKTKALGLEESLVQEGRAREQEEKYWRGQDVVQEW QETSPTREEPAGEQKELAPAWEDTSPEQDNRYWRGREDVALEQDTYWRELSCERKVWF PHELDGQGARPHYTEERESTFLDEGPDDEQEVPLREHATRSPWASDFKDFQESSPQKG LEVERWLAESPVGLPPEEEDKLTRSPFEIISPPASPPEMVGQRVPSAPGQESPIPDPK LMPHMKNEPTTPSWLADIPPWVPKDRPLPPAPLSPAPGPPTPAPASHTPAPFSWAHSR V" BASE COUNT 461 a 418 c 460 g 236 t ORIGIN 1 ggaattccgg gaacagaagg agaagatccc agaagagaaa gacaaagcct tagatcaaaa 61 agtcagaagt gttgaacata aggctccgga ggacacggtc gctgaaatga aggacagaga 121 cctagaacag acagacaaag cccctgaaca gaaacaccag gcccaggaac aaaaggataa 181 agtctcagaa aagaaggatc aggccttaga acaaaaatac tgggctttgg gacagaagga 241 tgaagccctg gaacaaaaca ttcaggctct ggaagagaac caccaaactc aggagcagga 301 gagcctagtg caggaggata aaaccaggaa accaaagatg ctagaggaaa aattccccag 361 aaaaggtcaa ggccatggaa gagaagttag aagctcttct ggagaagacc aaagctctgg 421 gcctggaaga gagcctagtg caggagggca gggccagaga gcaggaagaa aagtactgga 481 gggggcagga tgtggtccag gagtggcaag aaacatctcc taccagagag gagccggctg 541 gagaacagaa agagcttgcc ccggcatggg aggacacatc tcctgagcag gacaatcggt 601 attggagggg cagagaggat gtggccttgg aacaggacac atactggagg gagctaagct 661 gtgagcggaa ggtctggttc cctcacgagc tggatggcca gggggcccgc ccacactaca 721 ctgaggaacg ggaaagcact ttcctagatg agggcccaga tgatgagcaa gaagtacccc 781 tgcgggaaca cgcaacccgg agcccctggg cctcagactt caaggatttc caggaatcct 841 caccacagaa ggggctagag gtggagcgct ggcttgctga atcaccagtt gggttgccac 901 cagaggaaga ggacaaactg acccgctctc cctttgagat catctcccct ccagcttccc 961 cacctgagat ggttggacaa agggttcctt cagccccagg acaagagagt cctatcccag 1021 accctaagct catgccacac atgaagaatg aacccactac tccctcatgg ctggctgaca 1081 tcccaccctg ggtgcccaag gacagacccc tcccccctgc acccctctcc ccagctcctg 1141 gtccccccac acctgccccg gcatcccata ctcctgcacc cttctcttgg gcgcacagtc 1201 gagtatgaca gtgtggtggc tgcagtgcag gagggggcag ctgagttgga aggtgggcca 1261 tactcccccc tggggaagga ctaccgcaag gctgaagggg aaagggaaga agaaggtagg 1321 gctgaggctc ctgacaaaag ctcacacagc tcaaaggtac cagaggccag caaaagccat 1381 gccaccacgg agcctgagca gactgagccg gagcagagag agcccacacc ctatcctgat 1441 gagagaagct ttcagtatgc agacatctat gagcagatga tgcttactgg gcttggccct 1501 gcatgcccca ctagagagcc tccacttggt tgagaggcat gggggccaat ccccagctgc 1561 tccaagccgg aattc // LOCUS HSU14588 3595 bp mRNA PRI 10-MAR-1995 DEFINITION Human paxillin mRNA, complete cds. ACCESSION U14588 NID g704347 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3595) AUTHORS Salgia,R., Li,J.L., Lo,S.H., Brunkhorst,B., Kansas,G.S., Sobhany,E.S., Sun,Y., Pisick,E., Hallek,M., Ernst,T. et,al. TITLE Molecular cloning of human paxillin, a focal adhesion protein phosphorylated by P210BCR/ABL JOURNAL J. Biol. Chem. 270 (10), 5039-5047 (1995) MEDLINE 95197488 REFERENCE 2 (bases 1 to 3595) AUTHORS Salgia,R. TITLE Direct Submission JOURNAL Submitted (09-SEP-1994) Ravi Salgia, Hematologic Malignancies, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3595 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 75..1748 /codon_start=1 /product="paxillin" /db_xref="PID:g704348" /translation="MDDLDALLADLESTTSHISKRPVFLSEETPYSYPTGNHTYQEIA VPPPVPPPPSSEALNGTILDPLDQWQPSGSRFIHQQPQSSSPVYGSSAKTSSVSNPQD SVGSPCSRVGEEEHVYSFPNKQKSAEPSPTVMSTSLGSNLSELDRLLLELNAVQHNPP GFPADEANSSPPLPGALSPLYGVPETNSPLGGKAGPLTKEKPKRNGGRGLEDVRPSVE SLLDELESSVPSPVPAITVNQGEMSSPQRVTSTQQQTRISASSATRELDELMASLSDF KFMAQGKTGSSSPPGGPPKPGSQLDSMLGSLQSDLNKLGVATVAKGVCGACKKPIAGQ VVTAMGKTWHPEHFVCTHCQEEIGSRNFFERDGQPYCEKDYHNLFSPRCYYCNGPILD KVVTALDRTWHPEHFFCAQCGAFFGPEGFHEKDGKAYCRKDYFDMFAPKCGGCARAIL ENYISALNTLWHPECFVCRECFTPFVNGSFFEHDGQPYCEVHYHERRGSLCSGCQKPI TGRCITAMAKKFHPEHFVCAFCLKQLNKGTFKEQNDKPYCQNCFLKLFC" BASE COUNT 702 a 1133 c 941 g 819 t ORIGIN 1 aaaagttgcg gggcatagac gagcgccccg ggacgcagct agcgcgaccc tgagccggcg 61 cccgtggtcc ggccatggac gacctcgacg ccctgctggc ggacttggag tctaccacct 121 cccacatctc caaacggcct gtgttcttgt cggaggagac cccctactca tacccaactg 181 gaaaccacac ataccaggag attgccgtgc caccccccgt ccccccaccc ccgtccagcg 241 aggccctcaa tggcacaatc cttgacccct tagaccagtg gcagcccagc ggctcccgat 301 tcatccacca gcagcctcag tcctcatcac ctgtgtacgg ctccagtgcc aaaacttcca 361 gtgtctccaa ccctcaggac agtgttggct ctccgtgctc ccgagtgggt gaggaggagc 421 acgtctacag cttccccaac aagcagaaat cagctgagcc ttcacccacc gtaatgagca 481 cgtccctggg cagcaacctt tctgaactcg accgcctgct gctggaactg aacgctgtac 541 agcataaccc gccaggcttc cctgcagatg aggccaactc aagccccccg cttcctgggg 601 ccctgagccc cctctatggt gtcccagaga ctaacagccc cttgggaggc aaagctgggc 661 ccctgacgaa agagaagcct aagcggaatg ggggccgggg cctggaggac gtgcggccca 721 gtgtggagag tctcttggat gaactggaga gctccgtgcc cagccccgtc cctgccatca 781 ctgtgaacca gggcgagatg agcagcccgc agcgcgtcac ctccacccaa cagcagacac 841 gcatctcggc ctcctctgcc accagggagc tggacgagct gatggcttcg ctgtcggatt 901 tcaagttcat ggcccagggg aagacaggga gcagctcacc ccctgggggg cccccgaagc 961 ccgggagcca gctggacagc atgctgggga gcctgcagtc tgacctgaac aagctggggg 1021 tcgccacagt cgccaaagga gtctgcgggg cctgcaagaa gcccatcgcc gggcaggttg 1081 tgaccgccat ggggaagacg tggcaccccg agcacttcgt ctgcacccac tgccaggagg 1141 agatcggatc ccggaacttc ttcgagcggg atggacagcc ctactgtgaa aaggactacc 1201 acaacctctt ctccccgcgc tgctactact gcaacggccc catcctggat aaagtggtga 1261 cagcccttga ccggacgtgg caccctgaac acttcttctg tgcacagtgt ggagccttct 1321 ttggtcccga agggttccac gagaaggacg gcaaggccta ctgtcgcaag gactacttcg 1381 acatgttcgc acccaagtgt ggcggctgcg cccgggccat cctggagaac tatatctcag 1441 ccctcaacac gctgtggcat cctgagtgct ttgtgtgccg ggaatgcttc acgccattcg 1501 tgaacggcag cttcttcgag cacgacgggc agccctactg tgaggtgcac taccacgagc 1561 ggcgcggctc gctgtgttct ggctgccaga agcccatcac cggccgctgc atcaccgcca 1621 tggccaagaa gttccacccc gagcacttcg tctgtgcctt ctgcctcaag cagctcaaca 1681 agggcacctt caaggagcag aacgacaagc cttactgtca gaactgcttc ctcaagctct 1741 tctgctaggt gccctgcccc tgtctctgcc ccccttcccc agccagcatc cccaactgcg 1801 actgtgacct agagacttca cccgggggtg aaggggtaaa cccgactgaa actggaaccc 1861 ttgtcctccg ctggtgcggg atggacagag ggccgtgagg ggtccccctg cttgtcttca 1921 cccctgccag agcctctggg ccccctcctc cctcctgtag ctctccctag gctgcccact 1981 ctccatcctc cccaggggta gaggctgggg gctccacccc agcccatgta cgtccccacg 2041 aactggcctg gccagcaccc cacactggag ccatctcttc ctcatatttc agcagtgcag 2101 ccggggggca gggaagggca ggcagggtct gttggggtct ctttttatcc ttattcctcc 2161 cccgacctaa ttgtctttgt tctgtgatta ttgggggaca cccggctccc tccagacaat 2221 gccagcataa atccatccat ccaaaggcag agaaccaaag gggccatgga aggttctctg 2281 tgctcctcct acccttccag tgccctaggc ctggcgactg cccctgcctt ttagacccgc 2341 actccccttt tatacctgct cttgttctac tgagaaaagc ctctccagca ataatgtttt 2401 ctagtcactt cctccgtctc cgggacggcg tgcctggaca ctgtccgact ttgatagatt 2461 tctacactga ggttgaattc atatcgcctg agatgtttta cttctctata caccatgatt 2521 ttgtagagat attaaagacg ttcccttttg tatctcttct tcatcaccgc cactgggcct 2581 tcactgatgg tgtctggtgt gtagattgct ttgtctgtgg ggtggggtgg ggaagcaata 2641 tgtattttat tgtttcttag cacaagcagg tgtgctggga gcagctctgt gactccccct 2701 ctttcacttc atagctccca ggactgtttt ataaactgct gctatttgga aaccccttct 2761 ttacttccca ggccagcaag ccttcactga aactggttga agagtgttgc acccttttgg 2821 gcctagaatt ctgaacttta tctgttctgt ttctgtggga ggagaagggg aagtatgttt 2881 tggggggctg cttcctgtct gagtaagccc tcaggagcct cttgctcccc tgtgaacccc 2941 tgaaccttct gagcccccct gcttctatgg ggctctctct tctgccttct caggaaagct 3001 gtgtctgatt ttggccatca ggactctgac gtctctttgg tcttgttgat ttacctctgg 3061 gcatatccct tccccagatc tgctcctccc ctttcacagg tgggatcggc actcaggggg 3121 tctggaaaga aggtcataag ggagcatgat aggatttggg gcagagggac aggctcctct 3181 ggggaaaccc cccagagctc tttaccaagg atgaaagagg agccaggcct tgggctcctg 3241 atgaccagaa aggggccacc ggggtctaat ggtgacagtc caaaccactc cactggcctc 3301 ctggcagaag ccgagtgtgc tggggtctcc gaagagggtc cctccttttt gggggaaggt 3361 cagcccagcc cctccaaagg tctgatgtct ccactttcac ccgcaggcct taccgctctg 3421 tttatagtga cccaccctag atcttcccca agagggactg gggtttctgg ggtccattct 3481 ctgagtcagt ggttatttga aaatttgatt ttgattttat tttttctctg taaacttcca 3541 agctggcttt tcccatttca attcctgtga tttatgccaa taaagtttgc ccatg // LOCUS HSU14603 1526 bp mRNA PRI 01-MAR-1996 DEFINITION Human protein-tyrosine phosphatase (HU-PP-1) mRNA, partial sequence. ACCESSION U14603 NID g894158 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Montagna,M., Serova,O., Sylla,B.S., Feunteun,J. and Lenoir,G.M. TITLE A 100-kb physical and transcriptional map around the EDH17B2 gene: identification of three novel genes and a pseudogene of a human homologue of the rat PRL-1 tyrosine phosphatase JOURNAL Hum. Genet. 96 (5), 532-538 (1995) MEDLINE 96070985 REFERENCE 2 (bases 1 to 1526) AUTHORS Lenoir,G.M. TITLE Direct Submission JOURNAL Submitted (12-SEP-1994) Gilbert M. Lenoir, Internatioal Agency for Research on Cancer, VHC/MCA, 150 Cours Albert Thomas, 69372 Lyon CEDEX 08, France FEATURES Location/Qualifiers source 1..1526 /organism="Homo sapiens" /note="the chromosomal location of the functional gene is unknown; a pseudogene corresponding to cDNA OV-1 maps at 17q12-21" /db_xref="taxon:9606" /clone="pov1" /sex="female" /tissue_type="ovary" /dev_stage="adult" gene 424..927 /gene="HU-PP-1" CDS 424..927 /gene="HU-PP-1" /note="similar to rat tyrosine phosphatase encoded by GenBank Accession Number L27843" /codon_start=1 /evidence=experimental /product="protein-tyrosine phosphatase" /db_xref="PID:g894159" /translation="MNRPAPVEISYENMRFLITHNPTNATLNKFTEELKKYGVTTLVR VCDATYDKAPVEKEGIHVLDWPFDDGAPPPNQIVDDWLNLLKTKFREEPGCCVAVHCV AGLGRAPVLVALALIECGMKYEDAVQFIRQKRRGAFNSKQLLYLEKYRPKMRLRFRDT NGHCCVQ" BASE COUNT 441 a 274 c 318 g 493 t ORIGIN 1 gttttttttt ttttttttaa ttgcaagcat atttctttta atgactccag taaaattaag 61 catcaagtaa acaagtggaa agtgacctac acttttaact tgtctcacta gtgcctaaat 121 gtagtaaagg ctgcttaagt tttgtatgta gttggatttt ttggagtccg aaggtatcca 181 tctgcagaaa ttgaggccca aattgaattt ggattcaagt ggattctaaa tactttgctt 241 atcttgaaga gagaagcttc ataaggaata aacaagttga atagagaaaa cactgattga 301 taataggcat tttagtggtc tttttaatgt tttctgctgt gaaacatttc aagatttatt 361 gatttttttt tttcactttc cccatcacac tcacacgcac gctcacactt tttatttgcc 421 ataatgaacc gtccagcccc tgtggagatc tcctatgaga acatgcgttt tctgataact 481 cacaacccta ccaatgctac tctcaacaag ttcacagagg aacttaagaa gtatggagtg 541 acgactttgg ttcgagtttg tgatgctaca tatgataaag ctccagttga aaaagaagga 601 atccacgttc tagattggcc atttgatgat ggagctccac cccctaatca gatagtagat 661 gattggttaa acctgttaaa aaccaaattt cgtgaagagc caggttgctg tgttgcagtg 721 cattgtgttg caggattggg aagggcacct gtgctggttg cacttgcttt gattgaatgt 781 ggaatgaagt acgaagatgc agttcagttt ataagacaaa aaagaagggg agcgttcaat 841 tccaaacagc tgctttattt ggagaaatac cgacctaaga tgcgattacg cttcagagat 901 accaatgggc attgctgtgt tcagtagaag gaaatgtaaa cgaaggctga cttgattgtg 961 ccatttagag ggaactcttg gtacctggaa atgtgaatct ggaatattac ctgtgtcatc 1021 aaagtagtga tggattcagt actcctcaac cactctccta atgattggaa caaaagcaaa 1081 caaaaaagaa atctctctat aaaatgaata aaatgtttaa gaaaagagaa agagaaaagg 1141 aattaattca gtgaaggatg attttgctcc tagttttgga gtttgaattt ctgccaggat 1201 tgaattattt tgaaatctcc tgtcttttta aactttttca aaataggtct ctaaggaaaa 1261 ccagcagaac attagcctgt gcaaaaccat ctgtttgggg agcacactct tccattatgc 1321 ttggcacata gatctccctg tggtgggatt ttttttttcc ctttttttgt gggggagggt 1381 tggtggtata tttttcccct cttttttcct tcctctccta catctccctt ttcccccgat 1441 ccaagttgta gatggaatag aagcccttgt tgctgtagat gtgcgtgcag tctggcagcc 1501 ttaagcccac ctgggcactt ttagat // LOCUS HSU14631 1873 bp mRNA PRI 12-MAR-1996 DEFINITION Human 11 beta-hydroxysteroid dehydrogenase type II mRNA, complete cds. ACCESSION U14631 NID g1222521 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1873) AUTHORS Albiston,A.L., Obeyesekere,V.R., Smith,R.E. and Krozowski,Z.S. TITLE Cloning and tissue distribution of the human 11 beta-hydroxysteroid dehydrogenase type 2 enzyme JOURNAL Mol. Cell. Endocrinol. 105 (2), R11-R17 (1994) MEDLINE 95163772 REFERENCE 2 (bases 1 to 1873) AUTHORS Krozowski,Z., Albiston,A.L., Obeyesekere,V.R., Andrews,R.K. and Smith,R.E. TITLE The human 11 beta-hydroxysteroid dehydrogenase type II enzyme: comparisons with other species and localization to the distal nephron JOURNAL J. Steroid Biochem. Mol. Biol. 55 (5-6), 457-464 (1995) MEDLINE 96135290 REFERENCE 3 (bases 1 to 1873) AUTHORS Krozowski,Z. TITLE Direct Submission JOURNAL Submitted (13-SEP-1994) Zygmunt S. Krozowski, Baker Medical Research Institute, Prahran, Victoria, Australia FEATURES Location/Qualifiers source 1..1873 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="kidney" CDS 111..1328 /note="member of the short chain alcohol dehydrogenase superfamily" /codon_start=1 /product="11 beta-hydroxysteroid dehydrogenase type II" /db_xref="PID:g565082" /translation="MERWPWPSGGAWLLVAARALLQLLRSDLRLGRPLLAALALLAAL DWLCQRLLPPPAALAVLAAAGWIALSRLARPQRLPVATRAVLITGCDSGFGKETAKKL DSMGFTVLATVLELNSPGAIELRTCCSPRLRLLQMDLTKPGDISRLLEFTKAHTTSTG LWGLVNNAGHNEVVADAELSPVATFRSCMEVNFFGALELTKGLLPLLRSSRGRIVTVG SPAGDMPYPCLGAYGTSKAAVALLMDTFSCELLPWGVKVSIIQPGCFKTESVRNVGQW EKRKQLLLANLPQELLQAYGKDYIEHLHGQFLHSLRLAMSDLTPVVDAITDALLAARP RRRYYPGQGLGLMYFIHYYLPEGLRRRFLQAFFISHCLPRALQPGQPGTTPPQDAAQD PNLSPGPSPAVAR" polyA_site 1873 /note="10 A residues" BASE COUNT 330 a 668 c 518 g 357 t ORIGIN 1 cgcgccccag gccggtgtac ccccgcactc cgcgccccgg cctagaagct ctctctcccc 61 gctccccggc ccggcccccg ccccgccccg ccccagcccg ctgggccgcc atggagcgct 121 ggccttggcc gtcgggcggc gcctggctgc tcgtggctgc ccgcgcgctg ctgcagctgc 181 tgcgctcaga cctgcgtctg ggccgcccgc tgctggcggc gctggcgctg ctggccgcgc 241 tcgactggct gtgccagcgc ctgctgcccc cgccggccgc actcgccgtg ctggccgccg 301 ccggctggat cgcgttgtcc cgcctggcgc gcccgcagcg cctgccggtg gccactcgcg 361 cggtgctcat caccggctgt gactctggtt ttggcaagga gacggccaag aaactggact 421 ccatgggctt cacggtgctg gccaccgtat tggagttgaa cagccccggt gccatcgagc 481 tgcgtacctg ctgctcccct cgcctaaggc tgctgcagat ggacctgacc aaaccaggag 541 acattagccg cttgctagag ttcaccaagg cccacaccac cagcaccggc ctgtggggcc 601 tcgtcaacaa cgcaggccac aatgaagtag ttgctgatgc ggagctgtct ccagtggcca 661 ctttccgtag ctgcatggag gtgaatttct ttggcgcgct cgagctgacc aagggcctcc 721 tgcccctgct gcgcagctca aggggccgca tcgtgactgt ggggagccca gcgggggaca 781 tgccatatcc gtgcttgggg gcctatggaa cctccaaagc ggccgtggcg ctactcatgg 841 acacattcag ctgtgaactc cttccctggg gggtcaaggt cagcatcatc cagcctggct 901 gcttcaagac agagtcagtg agaaacgtgg gtcagtggga aaagcgcaag caattgctgc 961 tggccaacct gcctcaagag ctgctgcagg cctacggcaa ggactacatc gagcacttgc 1021 atgggcagtt cctgcactcg ctacgcctgg ccatgtccga cctcacccca gttgtagatg 1081 ccatcacaga tgcgctgctg gcagctcggc cccgccgccg ctattacccc ggccagggcc 1141 tggggctcat gtacttcatc cactactacc tgcctgaagg cctgcggcgc cgcttcctgc 1201 aggccttctt catcagtcac tgtctgcctc gagcactgca gcctggccag cctggcacta 1261 ccccaccaca ggacgcagcc caggacccaa acctgagccc cggcccttcc ccagcagtgg 1321 ctcggtgagc catgtgcacc tatggcccag ccactgcagc acaggaggct ccgtgagcct 1381 tggttcctcc ccgaaaaccc ccagcattac gatcccccaa gtgtcctgga ccctggccta 1441 aagaatccca cccccacttc atgcccactg ccgatgccca atccaggccc ggtgaggcca 1501 aggtttccca gtgagcctct gcgcctctcc actgtttcat gagcccaaac accctcctgg 1561 cacaacgctc taccctgcag cttggagaac tccgctggat gggagtctca tgcaagactt 1621 cactgcagcc tttcacagga ctctgcagat agtgcctctg caaactaagg agtgactagg 1681 tgggttgggg accccctcag gattgtttct cggcaccagt gcctcagtgc tgcaattgag 1741 ggctaaatcc caagtgtctc ttgactggct caagaattag ggccccaact acacaccccc 1801 aagccacagg gaagcatgta ctgtacttcc caattgccac attttaaata aagacaaatt 1861 tttatttctt cta // LOCUS HSU14650 1458 bp mRNA PRI 10-FEB-1996 DEFINITION Human platelet-endothelial tetraspan antigen 3 mRNA, complete cds. ACCESSION U14650 NID g541612 KEYWORDS platelet-endothelial tetraspan antigen; PETA-3; transmembrane glycoprotein; TM4SF protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1458) AUTHORS Fitter,S., Tetaz,T.J., Berndt,M.C. and Ashman,L.K. TITLE Molecular cloning of cDNA encoding a novel platelet-endothelial cell tetra-span antigen, PETA-3 JOURNAL Blood 86 (4), 1348-1355 (1995) MEDLINE 95359431 REFERENCE 2 (bases 1 to 1458) AUTHORS Fitter,S. TITLE Direct Submission JOURNAL Submitted (14-SEP-1994) Steve Fitter, Institute of Medical and Veterinary Science, Hanson Centre For Cancer Research, Rundle Mall, Adelaide, South Australia 5000, Australia FEATURES Location/Qualifiers source 1..1458 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pGP27.1 and pGP27.2" /clone_lib="M07e cDNA library in Bluescript KS (+)" /cell_line="megakaryoblastic cell line M07e" CDS 60..821 /standard_name="PETA-3" /note="cell surface glycoprotein; PETA-3 is similar to members of the newly defined Transmembrane 4 superfamily (TM4SF) which are characterized by the presence of 4 putative transmembrane domains" /codon_start=1 /product="platelet-endothelial tetraspan antigen 3" /db_xref="PID:g541613" /translation="MGEFNEKKTTCGTVCLKYLLFTYNCCFWLAGLAVMAVGIWTLAL KSDYISLLASGTYLATAYILVVAGTVVMVTGVLGCCATFKERRNLLRLYFILLLIIFL LEIIAGILAYAYYQQLNTELKENLKDTMTKRYHQPGHEAVTSAVDQLQQEFHCCGSNN SQDWRDSEWIRSQEAGGRVVPDSCCKTVVALCGQRDHASNIYKVEGGCITKLETFIQE HLRVIGAVGIGIACVQVFGMIFTCCLYRSLKLEHY" polyA_signal 1440..1445 polyA_site 1458 /note="8 A residues" BASE COUNT 269 a 459 c 433 g 297 t ORIGIN 1 tcgcccccgc agctgccgcc gccgccaggg cccggactcg gacgcgtggt agccccagga 61 tgggtgagtt caacgagaag aagacaacat gtggcaccgt ttgcctcaag tacctgctgt 121 ttacctacaa ttgctgcttc tggctggctg gcctggctgt catggcagtg ggcatctgga 181 cgctggccct caagagtgac tacatcagcc tgctggcctc aggcacctac ctggccacag 241 cctacatcct ggtggtggcg ggcactgtcg tcatggtgac tggggtcttg ggctgctgcg 301 ccaccttcaa ggagcgtcgg aacctgctgc gcctgtactt catcctgctc ctcatcatct 361 ttctgctgga gatcatcgct ggtatcctcg cctacgccta ctaccagcag ctgaacacgg 421 agctcaagga gaacctgaag gacaccatga ccaagcgcta ccaccagccg ggccatgagg 481 ctgtgaccag cgctgtggac cagctgcagc aggagttcca ctgctgtggc agcaacaact 541 cacaggactg gcgagacagt gagtggatcc gctcacagga ggccggtggc cgtgtggtcc 601 cagacagctg ctgcaagacg gtggtggctc tttgtggaca gcgagaccat gcctccaaca 661 tctacaaggt ggagggcggc tgcatcacca agttggagac cttcatccag gagcacctga 721 gggtcattgg ggctgtgggg atcggcattg cctgtgtgca ggtctttggc atgatcttca 781 cgtgctgcct gtacaggagt ctcaagctgg agcactactg accctgcctt gggccttgct 841 gctgctgcac ccaactactg agctgagacc actgagtacc aggggctggg ctccctgatg 901 acacccaccc tgtgccatca ccataacctc tggggacccc aacctcagag gcagcttcaa 961 gtgccttttg ctgcgcacca atgcccagca ggggaggtga ggggggctgg cggggcgaag 1021 tttggggggt gttttgtggg gctccccgga catactctct gcctggtggt cagatgcagg 1081 ttggaagggg ccttgctgag tggcgcaagg ccgagcgttc ccagcagggg gagaaaccct 1141 tcacacccca ggcccttcag gaactggggc tttgccttgc agccacatgg ccccatccca 1201 gttggggaag ccaggtgagc tctgaccctt gggcctgggc ctctgcccct cccaacccag 1261 ccgtcgtctc cctcgacagc gcccctgctg tcttccccac cgcagtcacc accacccgaa 1321 atgccacgtg gtcactgtgc actgccctgt tcatgtgcct ctgcggggca gggccttcct 1381 ggttttgtac actgctgtac ccagatgcct acaaccatcc ctgccacata caggtgctca 1441 ataaacactt gtagagca // LOCUS HSU14680 5711 bp mRNA PRI 05-AUG-1995 DEFINITION Human breast and ovarian cancer susceptibility (BRCA1) mRNA, complete cds. ACCESSION U14680 NID g555931 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5711) AUTHORS Miki Y., Swensen J., Shattuck-Eidens D., Futreal P.A., Harshman K., Tavtigian S., Liu Q., Cochran C., Bennett L.M., Ding W., Bell R., Rosenthal J., Hussey C., Tran T., McClure M., Frye C., Hattier T., Phelps R., Haugen-Strano A., Katcher H., Yakumo K., Gholami Z., Shaffer D., Stone S., Bayer S., Wray C., Bogden R., Dayananth P., Ward J., Tonin P., Narod S., Bristow P.K., Norris F.H., Helvering L., Morrison P., Rosteck P., Lai M., Barrett J.C., Lewis C., Neuhausen S., Cannon-Albright L., Goldgar D., Wiseman R., Kamb A. and Skolnick M.H. TITLE A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1 JOURNAL Science 266 (5182), 66-71 (1994) MEDLINE 95025896 REFERENCE 2 (bases 1 to 5711) AUTHORS Skolnick,M.H. TITLE Direct Submission JOURNAL Submitted (14-SEP-1994) Mark H. Skolnick, Myriad Genetics Inc. and the University of Utah, 421 Wakara Way, Suite 201, Salt Lake City, UT 84108, USA FEATURES Location/Qualifiers source 1..5711 /organism="Homo sapiens" /note="For sequence of alternatively spliced exon 4, see GenBank Accession Number U15595" /db_xref="taxon:9606" /chromosome="17" /map="17q21; spans D17S855" 5'UTR 1..119 exon 1..100 /number=1 exon 101..199 /number=2 gene 120..5711 /gene="BRCA1" CDS 120..5711 /gene="BRCA1" /note="influences susceptibility to breast and ovarian cancer" /codon_start=1 /db_xref="PID:g555932" /translation="MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFC KFCMLKLLNQKKGPSQCPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYA NSYNFAKKENNSPEHLKDEVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLG TVRTLRTKQRIQPQKTSVYIELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISL DSAKKAACEFSETDVTNTEHHQPSNNDLNTTEKRAAERHPEKYQGSSVSNLHVEPCGT NTHASSLQHENSSLLLTKDRMNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTP STEKKVDLNADPLCERKEWNKQKLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDEL LGSDDSHDGESESNAKVADVLDVLNEVDEYSGSSEKIDLLASDPHEALICKSERVHSK SVESNIEDKIFGKTYRKKASLPNLSHVTENLIIGAFVTEPQIIQERPLTNKLKRKRRP TSGLHPEDFIKKADLAVQKTPEMINQGTNQTEQNGQVMNITNSGHENKTKGDSIQNEK NPNPIESLEKESAFKTKAEPISSSISNMELELNIHNSKAPKKNRLRRKSSTRHIHALE LVVSRNLSPPNCTELQIDSCSSSEEIKKKKYNQMPVRHSRNLQLMEGKEPATGAKKSN KPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTSELKEFVNPSLPREEKEEKLETVKV SNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTDYGTQESISLLEVSTLGKAKTEP NKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHEVNHSRETSIEMEESELDAQY LQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQSPKVTFECEQKEENQGKNE SNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSSQFRGNETGLITPNKHG LLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGNENIPSTVSTISRNN IRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNRGPKLNAMLRLGV LQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQPMGSSHASQVC SETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHTHLAQGYRR GAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNTEENLLS LKNSLNDCSNQVILAKASQEHHLSEETKCSASLFSSQCSELEDLTANTNTQDPFLIGS SKQMRHQSESQGVGLSDKELVSDDEERGTGLEENNQEEQSMDSNLGEAASGCESETSV SEDCSGLSSQSDILTTQQRDTMQHNLIKLQQEMAELEAVLEQHGSQPSNSYPSIISDS SALEDLRNPEQSTSEKAVLTSQKSSEYPISQNPEGLSADKFEVSADSSTSKNKEPGVE RSSPSKCPSLDDRWYMHSCSGSLQNRNYPSQEELIKVVDVEEQQLEESGPHDLTETSY LPRQDLEGTPYLESGISLFSDDPESDPSEDRAPESARVGNIPSSTSALKVPQLKVAES AQSPAAAHTTDTAGYNAMEESVSREKPELTASTERVNKRMSMVVSGLTPEEFMLVYKF ARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYFWVTQSIKER KMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMV QLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSV ALYQCQELDTYLIPQIPHSHY" exon 200..253 /gene="BRCA1" /number=3 exon 254..331 /gene="BRCA1" /number=5 exon 332..420 /gene="BRCA1" /number=6 exon 421..560 /gene="BRCA1" /number=7 exon 561..665 /gene="BRCA1" /number=8 exon 666..712 /gene="BRCA1" /number=9 exon 713..788 /gene="BRCA1" /number=10 exon 789..4215 /gene="BRCA1" /number=11 exon 4216..4302 /gene="BRCA1" /number=12 exon 4303..4476 /gene="BRCA1" /number=13 exon 4477..4603 /gene="BRCA1" /number=14 exon 4604..4794 /gene="BRCA1" /number=15 exon 4795..5105 /gene="BRCA1" /number=16 exon 5106..5193 /gene="BRCA1" /number=17 exon 5194..5273 /gene="BRCA1" /number=18 exon 5274..5310 /gene="BRCA1" /number=19 exon 5311..5396 /gene="BRCA1" /number=20 exon 5397..5451 /gene="BRCA1" /number=21 exon 5452..5526 /gene="BRCA1" /number=22 exon 5527..5586 /gene="BRCA1" /number=23 exon 5587..5711 /gene="BRCA1" /number=24 BASE COUNT 1956 a 1099 c 1274 g 1382 t ORIGIN 1 agctcgctga gacttcctgg accccgcacc aggctgtggg gtttctcaga taactgggcc 61 cctgcgctca ggaggccttc accctctgct ctgggtaaag ttcattggaa cagaaagaaa 121 tggatttatc tgctcttcgc gttgaagaag tacaaaatgt cattaatgct atgcagaaaa 181 tcttagagtg tcccatctgt ctggagttga tcaaggaacc tgtctccaca aagtgtgacc 241 acatattttg caaattttgc atgctgaaac ttctcaacca gaagaaaggg ccttcacagt 301 gtcctttatg taagaatgat ataaccaaaa ggagcctaca agaaagtacg agatttagtc 361 aacttgttga agagctattg aaaatcattt gtgcttttca gcttgacaca ggtttggagt 421 atgcaaacag ctataatttt gcaaaaaagg aaaataactc tcctgaacat ctaaaagatg 481 aagtttctat catccaaagt atgggctaca gaaaccgtgc caaaagactt ctacagagtg 541 aacccgaaaa tccttccttg caggaaacca gtctcagtgt ccaactctct aaccttggaa 601 ctgtgagaac tctgaggaca aagcagcgga tacaacctca aaagacgtct gtctacattg 661 aattgggatc tgattcttct gaagataccg ttaataaggc aacttattgc agtgtgggag 721 atcaagaatt gttacaaatc acccctcaag gaaccaggga tgaaatcagt ttggattctg 781 caaaaaaggc tgcttgtgaa ttttctgaga cggatgtaac aaatactgaa catcatcaac 841 ccagtaataa tgatttgaac accactgaga agcgtgcagc tgagaggcat ccagaaaagt 901 atcagggtag ttctgtttca aacttgcatg tggagccatg tggcacaaat actcatgcca 961 gctcattaca gcatgagaac agcagtttat tactcactaa agacagaatg aatgtagaaa 1021 aggctgaatt ctgtaataaa agcaaacagc ctggcttagc aaggagccaa cataacagat 1081 gggctggaag taaggaaaca tgtaatgata ggcggactcc cagcacagaa aaaaaggtag 1141 atctgaatgc tgatcccctg tgtgagagaa aagaatggaa taagcagaaa ctgccatgct 1201 cagagaatcc tagagatact gaagatgttc cttggataac actaaatagc agcattcaga 1261 aagttaatga gtggttttcc agaagtgatg aactgttagg ttctgatgac tcacatgatg 1321 gggagtctga atcaaatgcc aaagtagctg atgtattgga cgttctaaat gaggtagatg 1381 aatattctgg ttcttcagag aaaatagact tactggccag tgatcctcat gaggctttaa 1441 tatgtaaaag tgaaagagtt cactccaaat cagtagagag taatattgaa gacaaaatat 1501 ttgggaaaac ctatcggaag aaggcaagcc tccccaactt aagccatgta actgaaaatc 1561 taattatagg agcatttgtt actgagccac agataataca agagcgtccc ctcacaaata 1621 aattaaagcg taaaaggaga cctacatcag gccttcatcc tgaggatttt atcaagaaag 1681 cagatttggc agttcaaaag actcctgaaa tgataaatca gggaactaac caaacggagc 1741 agaatggtca agtgatgaat attactaata gtggtcatga gaataaaaca aaaggtgatt 1801 ctattcagaa tgagaaaaat cctaacccaa tagaatcact cgaaaaagaa tctgctttca 1861 aaacgaaagc tgaacctata agcagcagta taagcaatat ggaactcgaa ttaaatatcc 1921 acaattcaaa agcacctaaa aagaataggc tgaggaggaa gtcttctacc aggcatattc 1981 atgcgcttga actagtagtc agtagaaatc taagcccacc taattgtact gaattgcaaa 2041 ttgatagttg ttctagcagt gaagagataa agaaaaaaaa gtacaaccaa atgccagtca 2101 ggcacagcag aaacctacaa ctcatggaag gtaaagaacc tgcaactgga gccaagaaga 2161 gtaacaagcc aaatgaacag acaagtaaaa gacatgacag cgatactttc ccagagctga 2221 agttaacaaa tgcacctggt tcttttacta agtgttcaaa taccagtgaa cttaaagaat 2281 ttgtcaatcc tagccttcca agagaagaaa aagaagagaa actagaaaca gttaaagtgt 2341 ctaataatgc tgaagacccc aaagatctca tgttaagtgg agaaagggtt ttgcaaactg 2401 aaagatctgt agagagtagc agtatttcat tggtacctgg tactgattat ggcactcagg 2461 aaagtatctc gttactggaa gttagcactc tagggaaggc aaaaacagaa ccaaataaat 2521 gtgtgagtca gtgtgcagca tttgaaaacc ccaagggact aattcatggt tgttccaaag 2581 ataatagaaa tgacacagaa ggctttaagt atccattggg acatgaagtt aaccacagtc 2641 gggaaacaag catagaaatg gaagaaagtg aacttgatgc tcagtatttg cagaatacat 2701 tcaaggtttc aaagcgccag tcatttgctc cgttttcaaa tccaggaaat gcagaagagg 2761 aatgtgcaac attctctgcc cactctgggt ccttaaagaa acaaagtcca aaagtcactt 2821 ttgaatgtga acaaaaggaa gaaaatcaag gaaagaatga gtctaatatc aagcctgtac 2881 agacagttaa tatcactgca ggctttcctg tggttggtca gaaagataag ccagttgata 2941 atgccaaatg tagtatcaaa ggaggctcta ggttttgtct atcatctcag ttcagaggca 3001 acgaaactgg actcattact ccaaataaac atggactttt acaaaaccca tatcgtatac 3061 caccactttt tcccatcaag tcatttgtta aaactaaatg taagaaaaat ctgctagagg 3121 aaaactttga ggaacattca atgtcacctg aaagagaaat gggaaatgag aacattccaa 3181 gtacagtgag cacaattagc cgtaataaca ttagagaaaa tgtttttaaa gaagccagct 3241 caagcaatat taatgaagta ggttccagta ctaatgaagt gggctccagt attaatgaaa 3301 taggttccag tgatgaaaac attcaagcag aactaggtag aaacagaggg ccaaaattga 3361 atgctatgct tagattaggg gttttgcaac ctgaggtcta taaacaaagt cttcctggaa 3421 gtaattgtaa gcatcctgaa ataaaaaagc aagaatatga agaagtagtt cagactgtta 3481 atacagattt ctctccatat ctgatttcag ataacttaga acagcctatg ggaagtagtc 3541 atgcatctca ggtttgttct gagacacctg atgacctgtt agatgatggt gaaataaagg 3601 aagatactag ttttgctgaa aatgacatta aggaaagttc tgctgttttt agcaaaagcg 3661 tccagaaagg agagcttagc aggagtccta gccctttcac ccatacacat ttggctcagg 3721 gttaccgaag aggggccaag aaattagagt cctcagaaga gaacttatct agtgaggatg 3781 aagagcttcc ctgcttccaa cacttgttat ttggtaaagt aaacaatata ccttctcagt 3841 ctactaggca tagcaccgtt gctaccgagt gtctgtctaa gaacacagag gagaatttat 3901 tatcattgaa gaatagctta aatgactgca gtaaccaggt aatattggca aaggcatctc 3961 aggaacatca ccttagtgag gaaacaaaat gttctgctag cttgttttct tcacagtgca 4021 gtgaattgga agacttgact gcaaatacaa acacccagga tcctttcttg attggttctt 4081 ccaaacaaat gaggcatcag tctgaaagcc agggagttgg tctgagtgac aaggaattgg 4141 tttcagatga tgaagaaaga ggaacgggct tggaagaaaa taatcaagaa gagcaaagca 4201 tggattcaaa cttaggtgaa gcagcatctg ggtgtgagag tgaaacaagc gtctctgaag 4261 actgctcagg gctatcctct cagagtgaca ttttaaccac tcagcagagg gataccatgc 4321 aacataacct gataaagctc cagcaggaaa tggctgaact agaagctgtg ttagaacagc 4381 atgggagcca gccttctaac agctaccctt ccatcataag tgactcttct gcccttgagg 4441 acctgcgaaa tccagaacaa agcacatcag aaaaagcagt attaacttca cagaaaagta 4501 gtgaataccc tataagccag aatccagaag gcctttctgc tgacaagttt gaggtgtctg 4561 cagatagttc taccagtaaa aataaagaac caggagtgga aaggtcatcc ccttctaaat 4621 gcccatcatt agatgatagg tggtacatgc acagttgctc tgggagtctt cagaatagaa 4681 actacccatc tcaagaggag ctcattaagg ttgttgatgt ggaggagcaa cagctggaag 4741 agtctgggcc acacgatttg acggaaacat cttacttgcc aaggcaagat ctagagggaa 4801 ccccttacct ggaatctgga atcagcctct tctctgatga ccctgaatct gatccttctg 4861 aagacagagc cccagagtca gctcgtgttg gcaacatacc atcttcaacc tctgcattga 4921 aagttcccca attgaaagtt gcagaatctg cccagagtcc agctgctgct catactactg 4981 atactgctgg gtataatgca atggaagaaa gtgtgagcag ggagaagcca gaattgacag 5041 cttcaacaga aagggtcaac aaaagaatgt ccatggtggt gtctggcctg accccagaag 5101 aatttatgct cgtgtacaag tttgccagaa aacaccacat cactttaact aatctaatta 5161 ctgaagagac tactcatgtt gttatgaaaa cagatgctga gtttgtgtgt gaacggacac 5221 tgaaatattt tctaggaatt gcgggaggaa aatgggtagt tagctatttc tgggtgaccc 5281 agtctattaa agaaagaaaa atgctgaatg agcatgattt tgaagtcaga ggagatgtgg 5341 tcaatggaag aaaccaccaa ggtccaaagc gagcaagaga atcccaggac agaaagatct 5401 tcagggggct agaaatctgt tgctatgggc ccttcaccaa catgcccaca gatcaactgg 5461 aatggatggt acagctgtgt ggtgcttctg tggtgaagga gctttcatca ttcacccttg 5521 gcacaggtgt ccacccaatt gtggttgtgc agccagatgc ctggacagag gacaatggct 5581 tccatgcaat tgggcagatg tgtgaggcac ctgtggtgac ccgagagtgg gtgttggaca 5641 gtgtagcact ctaccagtgc caggagctgg acacctacct gataccccag atcccccaca 5701 gccactactg a // LOCUS HSU14722 1518 bp mRNA PRI 08-OCT-1994 DEFINITION Human activin type I receptor mRNA, complete cds. ACCESSION U14722 NID g555933 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1518) AUTHORS Carcamo,J., Weis,F.M., Ventura,F., Wieser,R., Wrana,J.L., Attisano,L. and Massague,J. TITLE Type I receptors specify growth-inhibitory and transcriptional responses to transforming growth factor beta and activin JOURNAL Mol. Cell. Biol. 14 (6), 3810-3821 (1994) MEDLINE 94254839 REFERENCE 2 (bases 1 to 1518) AUTHORS Attisano,L. TITLE Direct Submission JOURNAL Submitted (15-SEP-1994) Liliana Attisano, Cell Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, 1275 York Ave., New York City, NY 10021, USA FEATURES Location/Qualifiers source 1..1518 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ActR-IB" /tissue_type="kidney" CDS 1..1518 /codon_start=1 /product="activin type I receptor" /db_xref="PID:g555934" /translation="MAESAGASSFFPLVVLLLAGSGGSGPRGVQALLCACTSCLQANY TCETDGACMVSIFNLDGMEHHVRTCIPKVELVPAGKPFYCLSSEDLRNTHCCYTDYCN RIDLRVPSGHLKEPEHPSMWGPVELVGIIAGPVFLLFLIIIIVFLVINYHQRVYHNRQ RLDMEDPSCEMCLSKDKTLQDLVYDLSTSGSGSGLPLFVQRTVARTIVLQEIIGKGRF GEVWRGRWRGGDVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWT QLWLVSDYHEHGSLFDYLNRYTVTIEGMIKLALSAASGLAHLHMEIVGTQGKPGIAHR DLKSKNILVKKNGMCAIADLGLAVRHDAVTDTIDIAPNQRVGTKRYMAPEVLDETINM KHFDSFKCADIYALGLVYWEIARRCNSGGVHEEYQLPYYDLVPSDPSIEEMRKVVCDQ KLRPNIPNWWQSYEALRVMGKMMRECWYANGAARLTALRIKKTLSQLSVQEDVKI" BASE COUNT 339 a 394 c 425 g 360 t ORIGIN 1 atggcggagt cggccggagc ctcctccttc ttcccccttg ttgtcctcct gctcgccggc 61 agcggcgggt ccgggccccg gggggtccag gctctgctgt gtgcgtgcac cagctgcctc 121 caggccaact acacgtgtga gacagatggg gcctgcatgg tttccatttt caatctggat 181 gggatggagc accatgtgcg cacctgcatc cccaaagtgg agctggtccc tgccgggaag 241 cccttctact gcctgagctc ggaggacctg cgcaacaccc actgctgcta cactgactac 301 tgcaacagga tcgacttgag ggtgcccagt ggtcacctca aggagcctga gcacccgtcc 361 atgtggggcc cggtggagct ggtaggcatc atcgccggcc cggtgttcct cctgttcctc 421 atcatcatca ttgttttcct tgtcattaac tatcatcagc gtgtctatca caaccgccag 481 agactggaca tggaagatcc ctcatgtgag atgtgtctct ccaaagacaa gacgctccag 541 gatcttgtct acgatctctc cacctcaggg tctggctcag ggttacccct ctttgtccag 601 cgcacagtgg cccgaaccat cgttttacaa gagattattg gcaagggtcg gtttggggaa 661 gtatggcggg gccgctggag gggtggtgat gtggctgtga aaatattctc ttctcgtgaa 721 gaacggtctt ggttcaggga agcagagata taccagacgg tcatgctgcg ccatgaaaac 781 atccttggat ttattgctgc tgacaataaa gataatggca cctggacaca gctgtggctt 841 gtttctgact atcatgagca cgggtccctg tttgattatc tgaaccggta cacagtgaca 901 attgagggga tgattaagct ggccttgtct gctgctagtg ggctggcaca cctgcacatg 961 gagatcgtgg gcacccaagg gaagcctgga attgctcatc gagacttaaa gtcaaagaac 1021 attctggtga agaaaaatgg catgtgtgcc atagcagacc tgggcctggc tgtccgtcat 1081 gatgcagtca ctgacaccat tgacattgcc ccgaatcaga gggtggggac caaacgatac 1141 atggcccctg aagtacttga tgaaaccatt aatatgaaac actttgactc ctttaaatgt 1201 gctgatattt atgccctcgg gcttgtatat tgggagattg ctcgaagatg caattctgga 1261 ggagtccatg aagaatatca gctgccatat tacgacttag tgccctctga cccttccatt 1321 gaggaaatgc gaaaggttgt atgtgatcag aagctgcgtc ccaacatccc caactggtgg 1381 cagagttatg aggcactgcg ggtgatgggg aagatgatgc gagagtgttg gtatgccaac 1441 ggcgcagccc gcctgacggc cctgcgcatc aagaagaccc tctcccagct cagcgtgcag 1501 gaagacgtga agatctaa // LOCUS HSU14755 1713 bp mRNA PRI 25-SEP-1994 DEFINITION Human LIM domain transcription factor LIM-1 (hLIM-1) mRNA, complete cds. ACCESSION U14755 NID g549845 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1713) AUTHORS Dong,W., Wu,H., Xu,Y., Heng,H., Tsui,L. and Minden,M. TITLE Cloning, Characterization and Chromosome Mapping of Human LIM-1, the Human Homologue of LIM-1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1713) AUTHORS Dong,W. TITLE Direct Submission JOURNAL Submitted (16-SEP-1994) Weifeng Dong, Ontario Cancer Institute, Department of Medicine, 500 Sherbourne Street, Toronto, Ontario, M4X 1K9, Canada FEATURES Location/Qualifiers source 1..1713 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hu-Lim-1" /clone_lib="human brain cDNA library" /tissue_type="brain" /chromosome="11" /map="11p12" gene 433..1647 /gene="hLIM-1" CDS 433..1647 /gene="hLIM-1" /codon_start=1 /product="LIM domain transcription factor LIM-1" /db_xref="PID:g549846" /translation="MVHCAGCKRPILDRFLLNVLDRAWHVKCVQCCECKCNLTEKCFS REGKLYCKNDFFRCFGTKCAGCRQGISPSDLVRRARSKVFHLNCFTCMMCNKQLSTGE ELYIIDENKFVCKEDYLSNSSVAKENSLHSATTGSDPSLSPDSQDPSQDDAKDSESAN VSDKEAGSNENDDQNLGAKRRGPGTTIKAKQLETLKAAFAATPKPTRHIREQLAQETG LNMRVIQVWFQNRRSKERRMKQLSALAGHAFFRSPRRMRPLVDRLEPGELIPNGPFSF YGDYQSEYYGPGGNYDFFPQGPPSSQAQTPVDLPFVPSSGPSGTPLGGLEHPLPGHHP SSEAQRFTDILAHPPGDSPSPEPSLPGPLHSMSAEVFGPSPPFSSLSVNGGASYGNHL SHPPEMNEAAVW" BASE COUNT 321 a 581 c 479 g 332 t ORIGIN 1 gaattcccgg cgctttcctc gcaacccgag ctcggcgagt cgtcgtcttc ttcttctccg 61 tttttattta tttatttccg ttcccgccgc cgttctcgct gaccttcact cctccgcggg 121 ctctgagcag aagggtcgca ttctctcccg cctgagactt cttttcctcg ccccgggagc 181 tcaggcggcg cgctccagcc cggggccccg gactccccgg ctgcacactt cactgagacg 241 cccccaggcc cgatcagcct cgttctccac cctactttga tttcctggtg cgagttttgg 301 cttgcacggc cgagtgtgtg tcctcttttt ggagagactg gggagctcgt gccgattgtc 361 ttcaggagtc atcccctggg ctctactttg cccctctctc tctctgggcc tcatcagacc 421 aaaccaaaga ccatggttca ctgtgccggc tgcaaaaggc ccatcctgga ccgctttctc 481 ttgaacgtgc tggacagggc ctggcacgtc aagtgcgtcc agtgctgtga atgtaaatgc 541 aacctgaccg agaagtgctt ctccagggaa ggcaaactct actgcaagaa cgacttcttc 601 cggtgtttcg gtaccaaatg cgcaggctgc cgtcagggca tctcccctag cgacctggtg 661 cggagagcgc ggagcaaagt gtttcacctg aactgcttca cctgcatgat gtgtaacaag 721 cagctctcca ctggcgagga actctacatc atcgacgaga ataagttcgt ctgcaaagag 781 gattacctaa gtaacagcag tgttgccaaa gagaacagcc ttcactcggc caccacgggc 841 agtgacccca gtttgtctcc ggattcccaa gacccgtcgc aggacgacgc caaggactcg 901 gagagcgcca acgtgtcgga caaggaagcg ggtagcaacg agaatgacga ccagaacctg 961 ggcgccaagc ggcggggacc cggcaccacc atcaaagcca agcagctgga gacgctgaag 1021 gccgccttcg ctgctacacc caagcccacc cgccacatcc gcgagcagct ggcgcaggag 1081 accggcctca acatgcgcgt cattcaggtc tggttccaga accggcgctc caaggagcgg 1141 aggatgaagc agctgagcgc cctggccggc cacgccttct tccgcagtcc gcgccggatg 1201 cggccgctgg tggaccgcct ggagccgggc gagctcatcc ccaatggtcc cttctccttc 1261 tacggagatt accagagcga gtactacggg cccgggggca actacgactt cttcccgcaa 1321 ggccccccgt cctcgcaggc ccagacacca gtggacctac ccttcgtgcc gtcatctggg 1381 ccgtccggga cgcccctggg tggcctggag cacccgctgc cgggccacca cccgtcgagc 1441 gaggcgcagc ggtttaccga catcctggcg cacccacccg gggactcgcc cagccccgag 1501 cccagcctgc ccgggcctct gcactccatg tcggccgagg tcttcggacc cagcccgccc 1561 ttctcgtcgc tgtcggtcaa cggtggggcg agctacggaa accacctgtc ccaccccccc 1621 gaaatgaacg aggcggccgt gtggtagcgg ggtctcgcac ggtctgcgga gttcgtggtt 1681 gtacagaaat gaacctttat ttaagaaaaa tag // LOCUS HSU14910 1415 bp mRNA PRI 07-DEC-1994 DEFINITION Human RPE-retinal G protein-coupled receptor (rgr) mRNA, complete cds. ACCESSION U14910 NID g595826 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1415) AUTHORS Shen,D., Jiang,M., Hao,W., Tao,L., Salazar,M. and Fong,H.K.W. TITLE A human opsin-related gene that encodes a retinaldehyde-binding protein JOURNAL Biochemistry (1994) In press REFERENCE 2 (bases 1 to 1415) AUTHORS Fong,H.K. TITLE Direct Submission JOURNAL Submitted (17-SEP-1994) Henry K. Fong, Departments of Ophthalmology and Microbiology, University of Southern California/Doheny Eye Institute, 1355 San Pablo Street, Los Angeles, CA 90033, USA FEATURES Location/Qualifiers source 1..1415 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HRGR1-2" /clone_lib="human retina cDNA library" /tissue_type="retina" /dev_stage="mature" gene 39..914 /gene="rgr" CDS 39..914 /gene="rgr" /note="putative RPE-retinal G protein-coupled receptor" /codon_start=1 /db_xref="PID:g595827" /translation="MAETSALPTGFGELEVLAVGMVLLVEALSGLSLNTLTIFSFCKT PELRTPCHLLVLSLALADSGISLNALVAATSSLLRRWPYGSDGCQAHGFQGFVTALAS ICSSAAIAWGRYHHYCTRSQLAWNSAVSLVLFVWLSSAFWAALPLLGWGHYDYEPLGT CCTLDYSKGDRNFTSFLFTMSFFNFAMPLFITITSYSLMEQKLGKSGHLQVNTTLPAR TLLLGWGPYAILYLYAVIADVTSISPKLQMVPALIAKMVPTINAINYALGNEMVCRGI WQCLSPQKREKDRTK" polyA_site 1415 /note="11 A residues" BASE COUNT 297 a 433 c 361 g 324 t ORIGIN 1 agagacagct gggccactgg cagtgaggga gagtgaggat ggcagagacc agtgccctgc 61 ccactggctt cggggagctc gaggtgctgg ctgtggggat ggtgctactg gtggaagctc 121 tctccggtct cagcctcaat accctgacca tcttctcttt ctgcaagacc ccggagctgc 181 ggactccctg ccacctactg gtgctgagct tggctcttgc ggacagtggg atcagcctga 241 atgccctcgt tgcagccaca tccagccttc tccggcgctg gccctacggc tcggacggct 301 gccaggctca cggcttccag ggctttgtga cagcgttggc cagcatctgc agcagtgcag 361 ccatcgcatg ggggcgttat caccactact gcacccgtag ccagctggcc tggaactcag 421 ccgtctctct ggtgctcttc gtgtggctgt cttctgcctt ctgggcagct ctgccccttc 481 tgggttgggg tcactatgac tatgagccac tggggacatg ctgcaccctg gactactcca 541 agggggacag aaacttcacc agcttcctct tcaccatgtc cttcttcaac ttcgccatgc 601 ccctcttcat cacgatcact tcctacagtc tcatggagca gaaactgggg aagagtggcc 661 atctccaggt aaacaccact ctgccagcaa ggacgctgct gctcggctgg ggcccctatg 721 ccatcctgta tctatacgca gtcatcgcag acgtgacttc catctccccc aaactgcaga 781 tggtgcccgc cctcattgcc aaaatggtgc ccacgatcaa tgccatcaac tatgccctgg 841 gcaatgagat ggtctgcagg ggaatctggc agtgcctctc accgcagaag agggagaagg 901 accgaaccaa gtgagcctgc caccctggag tgagccccag gccaggaggc tgttccagga 961 gtcctgccca gcagcctcgg tggccaagcc cagacactca cccaccttcc ccagtggccc 1021 cgtggatcct ggtcctaggc tggacacagg attcagaaag acaccaggct gcacagaaag 1081 agccagatgg acctgagtgt cggtcacagc cccctacact caaggctgag aggcctcagg 1141 aaagtcattc ctttttaaaa ataataataa atgtaagggg gtacagtgca gttttgttac 1201 atggatagat tgcctagtgg tgaagtctgg gcttttagtg taaccatcac cctaataata 1261 tacgttgtac ccattaagtt atttctcatc cctcaccccc tcccaccttg tcacccttct 1321 gagtctccaa tgtctattat tccacactcc atgtccacgt gtacacatta tttagctccc 1381 acttacaagt gagaacatgt ggtatttgac tttca // LOCUS HSU14957 1463 bp mRNA PRI 06-APR-1995 DEFINITION Human 53K isoform of Type II phosphatidylinositol-4-phosphate 5-kinase (PIPK) mRNA, complete cds. ACCESSION U14957 NID g758696 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1463) AUTHORS Boronenkov,I.V. and Anderson,R.A. TITLE The sequence of phosphatidylinositol-4-phosphate 5-kinase defines a novel family of lipid kinases JOURNAL J. Biol. Chem. 270 (7), 2881-2884 (1995) MEDLINE 95155363 REFERENCE 2 (bases 1 to 1463) AUTHORS Boronenkov,I. TITLE Direct Submission JOURNAL Submitted (20-SEP-1994) Igor V. Boronenkov, Pharmacology, Medical School, University of Wisconsin-Madison, 3780 MSC, 1300 University Av., Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..1463 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11 library (Clontech HL1075b)" /tissue_type="placenta" gene 23..1240 /gene="PIPK" CDS 23..1240 /gene="PIPK" /codon_start=1 /product="53K isoform of Type II phosphatidylinositol-4-phosphate 5-kinase" /db_xref="PID:g758697" /translation="MATPGNLGSSVLASKTKTKKKHFVAQKVKLFRASDPLLSVLMWG VNHSINELSHVQIPVMLMPDDFKAYSKIKVDNHLFNKENMPSHFKFKEYCPMVFRNLR ERFGIDDQDFQNSLTRSAPLPNDSQARSGARFHTSYDKRYIIKTITSEDVAEMHNILK KYHQYIVECHGITLLPQFLGMYRLNVDGVEIYVIVTRNVFSHRLSVYRKYDLKGSTVA REASDKEKAKELPTLKDNDFINEGQKIYIDDNNKKVFLEKLKKDVEFLAQLKLMDYSL LVGIHDVERAEQEEVECEENDWGGGGRDGWHPPGGTPPDSPGNTLNSSPPLAPGEFDP NIDVYGIKCHENSPRKEVYFMAIIDILTHYDAKKKAAHAAKTVKHGCAEISTVNPEQY SKRFLDFIGHILT" BASE COUNT 428 a 338 c 363 g 334 t ORIGIN 1 ggaggggaca taggaggcgg ccatggcgac ccccggcaac ctagggtcct ctgtcctggc 61 gagcaagacc aagaccaaga agaagcactt cgtagcgcag aaagtgaagc tgtttcgggc 121 cagcgacccg ctgctcagcg tcctcatgtg gggggtaaac cactcgatca atgaactgag 181 ccatgttcaa atccctgtta tgttgatgcc agatgacttc aaagcctatt caaaaataaa 241 ggtggacaat caccttttta acaaagaaaa catgccgagc catttcaagt ttaaggaata 301 ctgcccgatg gtcttccgta acctgcggga gaggtttgga attgatgatc aagatttcca 361 gaattccctg accaggagcg cacccctccc caacgactcc caggcccgca gtggagctcg 421 ttttcacact tcctacgaca aaagatacat catcaagact attaccagtg aagacgtggc 481 cgaaatgcac aacatcctga agaaatacca ccagtacata gtggaatgtc atgggatcac 541 ccttcttccc cagttcttgg gcatgtaccg gcttaatgtt gatggagttg aaatatatgt 601 gatagttaca agaaatgtat tcagccaccg tttgtctgtg tataggaaat acgacttaaa 661 gggctctaca gtggctagag aagctagtga caaagaaaag gccaaagaac tgccaactct 721 gaaagataat gatttcatta atgagggcca aaagatttat attgatgaca acaacaagaa 781 ggtcttcctg gaaaaactaa aaaaggatgt tgagtttctg gcccagctga agctcatgga 841 ctacagtctg ctggtgggaa ttcatgatgt ggagagagcc gaacaggagg aagtggagtg 901 tgaggagaac gattggggag gaggagggcg agacggatgg cacccacccg gtggaacccc 961 cccagatagc cccgggaata cactgaacag ctcaccaccc ctggctcccg gggagttcga 1021 tccgaacatc gacgtctatg gaattaagtg ccatgaaaac tcgcctagga aggaggtgta 1081 cttcatggca attattgaca tccttactca ttatgatgca aaaaagaaag ctgcccatgc 1141 tgcaaaaact gttaaacatg gctgcgcgga gatctccacc gtgaacccag aacagtattc 1201 aaagcgcttt ttggacttta ttggccacat cttgacgtaa cctcctgcgc agcctcggac 1261 agacatgaac attggatgga cagaggtggc ttcggtgtag gaaaaatgaa aaccaaactc 1321 agtgaagtac tcatcttgca ggaagcaaac ctccttgttt acatcttcag gccaagatga 1381 ctgatttggg ggctactcgc tttacagcta cctgattttc ccagcatcgt tgtagctatt 1441 tctgactttg tgtatatgtg tgt // LOCUS HSU14966 987 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein L5 mRNA, complete cds. ACCESSION U14966 NID g550012 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 987) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 987) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..987 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 31..924 /codon_start=1 /product="ribosomal protein L5" /db_xref="PID:g550013" /translation="MGFVKVVKNKAYFKRYQVKFRRRREGKTDYYARKRLVIQDKNKY NTPKYRMIVRVTNRDIICQIAYARIEGDMIVCARYAHELPKYGVKVGLTNYAAAYCTG LLLARRLLNRFGMDKIYEGQVEVTGDEYNVESIDGQPGAFTCYLDAGLARTTTGNKVF GALKGAVDGGLSIPHSTKRFPGYDSESKEFNAEVHRKHIMGQNVADYMRYLMEEDEDA YKKQFSQYIKNSVTPDMMEEMYKKAHAAIRENPVYEKKPKKEVKKKRWNRPKMSLAQK KDRVAQKKASFLRAQERAAES" polyA_signal 961..966 BASE COUNT 322 a 184 c 247 g 234 t ORIGIN 1 gagcagcgga cgccggtctc tgttccgcag atggggtttg ttaaagttgt taagaataag 61 gcctacttta agagatacca agtgaaattt agaagacgac gagagggtaa aactgattat 121 tatgctcgga aacgcttggt gatacaagat aaaaataaat acaacacacc caaatacagg 181 atgatagttc gtgtgacaaa cagagatatc atttgtcaga ttgcttatgc ccgtatagag 241 ggggatatga tagtctgcgc acgttatgca cacgaactgc caaaatatgg tgtgaaggtt 301 ggcctgacaa attatgctgc agcatattgt actggcctgc tgctggcccg caggcttctc 361 aataggtttg gcatggacaa gatctatgaa ggccaagtgg aggtgactgg tgatgaatac 421 aatgtggaaa gcattgatgg tcagccaggt gccttcacct gctatttgga tgcaggcctt 481 gccagaacta ccactggcaa taaagttttt ggtgccctga agggagctgt ggatggaggc 541 ttgtctatcc ctcacagtac caaacgattc cctggttatg attctgaaag caaggaattt 601 aatgcagaag tacatcggaa gcacatcatg ggccagaatg ttgcagatta catgcgctac 661 ttaatggaag aagatgaaga tgcttacaag aaacagttct ctcaatacat aaagaacagc 721 gtaactccag acatgatgga ggagatgtat aagaaagctc atgctgctat acgagagaat 781 ccagtctatg aaaagaagcc caagaaagaa gttaaaaaga agaggtggaa ccgtcccaaa 841 atgtcccttg ctcagaagaa ggatcgggta gctcaaaaga aggcaagctt cctcagagct 901 caggagcggg ctgctgagag ctaaacccag caattttcta tgattttttc agatatagat 961 aataaactta tgaacagcaa ctaaaaa // LOCUS HSU14967 552 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein L21 mRNA, complete cds. ACCESSION U14967 NID g550014 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 552) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 552) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..552 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 30..512 /codon_start=1 /product="ribosomal protein L21" /db_xref="PID:g550015" /translation="MTNTKGKRRGTRYMFSRPFRKHGVVPLATYMRIYKKGDIVDIKG MGTVQKGMPHKCYHGKTGRVYNVTQHAVGIVVNKQVKGKILAKRINVRIEHIKHSKSR DSFLKRVKENDQKKKEAKEKGTWVQLKRQPAPPREAHFVRTNGKEPELLEPIPYEFMA " polyA_signal 530..534 BASE COUNT 198 a 106 c 129 g 119 t ORIGIN 1 gaaccgccat cttccagtaa ttcgccaaaa tgacgaacac aaagggaaag aggagaggca 61 cccgatatat gttctctagg ccttttagaa aacatggagt tgttcctttg gccacatata 121 tgcgaatcta taagaaaggt gatattgtag acatcaaggg aatgggtact gttcaaaaag 181 gaatgcccca caagtgttac catggcaaaa ctggaagagt ctacaatgtt acccagcatg 241 ctgttggcat tgttgtaaac aaacaagtta agggcaagat tcttgccaag agaattaatg 301 tgcgtattga gcacataaag cactctaaga gccgagatag cttcctgaaa cgtgtgaagg 361 aaaatgatca gaaaaagaaa gaagccaaag agaaaggtac ctgggttcaa ctaaagcgcc 421 agcctgctcc acccagagaa gcacactttg tgagaaccaa tgggaaggag cctgagctgc 481 tggaacctat tccctatgaa ttcatggcat aataggtgtt aaaaaaaaaa ataaaggacc 541 tctgggctaa aa // LOCUS HSU14968 507 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein L27a mRNA, complete cds. ACCESSION U14968 NID g550016 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 507) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 507) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..507 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 17..463 /codon_start=1 /product="ribosomal protein L27a" /db_xref="PID:g550017" /translation="MPSRLRKTRKLRGHVSHGHGRIGKHRKHPGGRGNAGGLHHHRIN FDKYHPGYFGKVGMKHYHLKRNQSFCPTVNLDKLWTLVSEQTRVNAAKNKTGAAPIID VVRSGYYKVLGKGKLPKQPVIVKAKFFSRRAEEKIKSVGGACVLVA" BASE COUNT 140 a 122 c 143 g 102 t ORIGIN 1 cgtctgggct gccaacatgc catccagact gaggaagacc cggaaactta ggggccacgt 61 gagccacggc cacggccgca taggcaagca ccggaagcac cccggcggcc gcggtaatgc 121 tggtggtctg catcaccacc ggatcaactt cgacaaatac cacccaggct actttgggaa 181 agttggtatg aagcattacc acttaaagag gaaccagagc ttctgcccaa ctgtcaacct 241 tgacaaattg tggactttgg tcagtgaaca gacacgggtg aatgctgcta aaaacaagac 301 tggggctgct cccatcattg atgtggtgcg atcgggctac tacaaagttc tgggaaaggg 361 aaagctccca aagcagcctg tcatcgtgaa ggccaaattc ttcagcagaa gagctgagga 421 gaagattaag agtgttgggg gggcctgtgt cctggtggct tgaagcacat ggagggagtt 481 tcattaaatg ctaactactt ttaaaaa // LOCUS HSU14969 485 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein L28 mRNA, complete cds. ACCESSION U14969 NID g550018 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 485) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 485) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..485 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 28..441 /codon_start=1 /product="ribosomal protein L28" /db_xref="PID:g550019" /translation="MSAHLQWMVVRNCSSFLIKRNKQTYSTEPNNLKARNSFRYNGLI HRKTVGVEPAADGKGVVVVIKRRSGQRKPATSYVRTTINKNARATLSSIRHMIRKNKY RPDLRMAAIRRASAILRTQKPVMVKRKRTRPTKSS" polyA_signal 460..465 BASE COUNT 117 a 158 c 135 g 75 t ORIGIN 1 gtcgccgctg cggagggagc cgccgccatg tctgcgcatc tgcaatggat ggtcgtgcgg 61 aactgctcca gtttcctgat caagaggaat aagcagacct acagcactga gcccaataac 121 ttgaaggccc gcaattcctt ccgctacaac ggactgattc accgcaagac tgtgggcgtg 181 gagccggcag ccgacggcaa aggtgtcgtg gtggtcatta agcggagatc cggccagcgg 241 aagcctgcca cctcctatgt gcggaccacc atcaacaaga atgctcgcgc cacgctcagc 301 agcatcagac acatgatccg caagaacaag taccgccccg acctgcgcat ggcagccatc 361 cgcagggcca gcgccatcct gcgcacgcag aagcctgtga tggtgaagag gaagcggacc 421 cgccccacca agagctcctg agccccctgc ccccagagca ataaagtcag ctggctttct 481 caaaa // LOCUS HSU14970 705 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein S5 mRNA, complete cds. ACCESSION U14970 NID g550020 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 705) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 705) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..705 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 38..652 /codon_start=1 /product="ribosomal protein S5" /db_xref="PID:g550021" /translation="MTEWETAAPAVAETPDIKLFGKWSTDDVQINDISLQDYIAVKEK YAKYLPHSAGRYAANAFRKAQCPIVERLTNSMMMHGRNNGKKLMTVRIVKHAFEIIHL LTGENPLQVLVNAIINSGPREDSTRIGRAGTVRRQAVDVSPLRRVNQAIWLLCTGARE AAFRNIKTIAECLADELINAAKGSSNSYAIKKKDELERVAKSNR" polyA_signal 670..675 BASE COUNT 165 a 205 c 199 g 136 t ORIGIN 1 cgccgagtga cagagacgct caggctgtgt tctcaggatg accgagtggg agacagcagc 61 accagcggtg gcagagaccc cagacatcaa gctctttggg aagtggagca ccgatgatgt 121 gcagatcaat gacatttccc tgcaggatta cattgcagtg aaggagaagt atgccaagta 181 cctccctcac agtgcagggc ggtatgccgc aaacgctttc cgcaaagctc agtgtcccat 241 tgtggagcgc ctcactaact ccatgatgat gcacggccgc aacaacggca agaagctcat 301 gactgtgcgc atcgtcaagc atgccttcga gatcatacac ctgctcacgg gcgagaaccc 361 tctgcaggtc ctggtgaacg ccatcatcaa cagtggtccc cgggaggact ccacacgcat 421 tgggcgcgcc gggactgtga gacgacaggc tgtggatgtg tcccccctgc gccgtgtgaa 481 ccaggccatc tggctgctgt gcacaggcgc tcgtgaggct gccttccgga acattaagac 541 cattgctgag tgcctggcag atgagctcat caatgctgcc aagggctcct cgaactccta 601 tgccattaag aagaaggacg agctggagcg tgtggccaag tccaaccgct gattttccag 661 ctgctgccca ataaacctgt ctgccctttg ggatcccagc caaaa // LOCUS HSU14971 692 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein S9 mRNA, complete cds. ACCESSION U14971 NID g550022 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 692) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 692) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..692 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 36..620 /codon_start=1 /product="ribosomal protein S9" /db_xref="PID:g550023" /translation="MPVARSWVCRKTYVTPRRPFEKSRLDQELKLIGEYGLRNKREVW RVKFTLAKIRKAARELLTLDEKDPRRLFEGNALLRRLVRLGVLDEGKMKLDYILGLKI EDFLERRLQTQVFKLGLAKSIHHARVLIRQRHIRVRKQVVNIPSFIVRLDSQKHIDFS LRSPTGVGRPGRVKRKNAKKGQGGAGAGDDEEED" polyA_signal 668..672 BASE COUNT 148 a 190 c 222 g 132 t ORIGIN 1 cccgcgcagg cgcagacggt ggaagcggac gcaacatgcc agtggcccgg agctgggttt 61 gtcgcaaaac ttatgtgacc ccgcggagac ccttcgagaa atctcgtctc gaccaagagc 121 tgaagctgat cggcgagtat gggctccgga acaaacgtga ggtctggagg gtcaaattta 181 ccctggccaa gatccgcaag gccgcccggg aactgctgac gcttgatgag aaggacccac 241 ggcgtctgtt cgaaggcaac gccctgctgc ggcggctggt ccgattgggg gtgctggatg 301 agggcaagat gaagctggat tacatcctgg gcctgaagat agaggatttc ttagagagac 361 gcctgcagac ccaggtcttc aagctgggct tggccaagtc catccaccac gctcgcgtgc 421 tgatccgcca gcgccatatc agggtccgca agcaggtggt gaacatcccg tccttcattg 481 tccgcctgga ttcccagaag cacattgact tctctctgcg ctctcctacg ggggttggcc 541 gcccgggccg cgtgaagagg aagaatgcca agaagggcca gggtggggct ggggctggag 601 acgacgagga ggaggattaa gtccacctgt ccctcctggg ctgctggatt gtctcgtttt 661 cctgccaaat aaacaggatc agcgctttaa aa // LOCUS HSU14972 570 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein S10 mRNA, complete cds. ACCESSION U14972 NID g550024 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 570) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 570) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..570 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 17..514 /codon_start=1 /product="ribosomal protein S10" /db_xref="PID:g550025" /translation="MLMPKKNRIAIYELLFKEGVMVAKKDVHMPKHPELADKNVPNLH VMKAMQSLKSRGYVKEQFAWRHFYWYLTNEGIQYLRDYLHLPPEIVPATLRRSRPETG RPRPKGLEGERPARLTRGEADRDTYRRSAVPPGADKKAEAGAGSATEFQFRGGFGRGR GQPPQ" polyA_signal 540..545 BASE COUNT 150 a 143 c 158 g 119 t ORIGIN 1 gcctgcagcc gcagagatgt tgatgcctaa gaagaaccgg attgccatct atgaactcct 61 ttttaaggag ggagtcatgg tggccaagaa ggatgtccac atgcctaagc acccggagct 121 ggcagacaag aatgtgccca accttcatgt catgaaggcc atgcagtctc tcaagtcccg 181 aggctacgtg aaggaacagt ttgcctggag acatttctac tggtacctta ccaatgaggg 241 tatccagtat ctccgtgatt accttcatct gcccccggag attgtgcctg ccaccctacg 301 ccgtagccgt ccagagactg gcaggcctcg gcctaaaggt ctggagggtg agcgacctgc 361 gagactcaca agaggggaag ctgacagaga tacctacaga cggagtgctg tgccacctgg 421 tgccgacaag aaagccgagg ctggggctgg gtcagcaacc gaattccagt ttagaggcgg 481 atttggtcgt ggacgtggtc agccacctca gtaaaattgg agaggattct tttgcattga 541 ataaacttac agccaaaaaa ccttaaaaaa // LOCUS HSU14973 268 bp mRNA PRI 26-JAN-1996 DEFINITION Human ribosomal protein S29 mRNA, complete cds. ACCESSION U14973 NID g550026 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 268) AUTHORS Frigerio,J.M., Dagorn,J.C. and Iovanna,J.L. TITLE Cloning, sequencing and expression of the L5, L21, L27a, L28, S5, S9, S10 and S29 human ribosomal protein mRNAs JOURNAL Biochim. Biophys. Acta 1262 (1), 64-68 (1995) MEDLINE 95290496 REFERENCE 2 (bases 1 to 268) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (21-SEP-1994) Juan L. Iovanna, Inserm U315, 46, Bd de la Gaye, Marseille 13009, France FEATURES Location/Qualifiers source 1..268 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="colon tumor" /dev_stage="adult" CDS 31..201 /codon_start=1 /product="ribosomal protein S29" /db_xref="PID:g550027" /translation="MGHQQLYWSHPRKFGQGSRSCRVCSNRHGLIRKYGLNMCRQCFR QYAKDIGFIKLD" BASE COUNT 63 a 65 c 64 g 76 t ORIGIN 1 tttttacctc gttgcactgc tgagagcaag atgggtcacc agcagctgta ctggagccac 61 ccgcgaaaat tcggccaggg ttctcgctct tgtcgtgtct gttcaaaccg gcacggtctg 121 atccggaaat atggcctcaa tatgtgccgc cagtgtttcc gtcagtacgc gaaggatatc 181 ggtttcatta agttggacta aatgctcttc cttcagagga ttatccgggg catctactca 241 atgaaaaacc atgataattc tttgtata // LOCUS HSU15008 479 bp mRNA PRI 10-DEC-1994 DEFINITION Human SnRNP core protein Sm D2 mRNA, complete cds. ACCESSION U15008 NID g600747 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 479) AUTHORS Lehmeier,T., Raker,V.A., Hermann,H. and Luehrmann,R. TITLE cDNA cloning of the Sm proteins D2 and D3 from human small nuclear ribonucleoproteins: evidence for a direct D1-D2 interaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 12317-12321 (1994) MEDLINE 95083692 REFERENCE 2 (bases 1 to 479) AUTHORS Raker,V.A. TITLE Direct Submission JOURNAL Submitted (22-SEP-1994) Veronica A. Raker, Institut fuer Molekularbiologie und Tumorforschung, Emil-Mannkopff Strasse 2, Marburg, 35037, Germany FEATURES Location/Qualifiers source 1..479 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBLSD2" /cell_line="HeLa S3" terminator one-of(19,385) CDS 31..387 /note="SnRNP core protein" /codon_start=1 /product="Sm D2" /db_xref="PID:g600748" /translation="MSLLNKPKSEMTPEELQKREEEEFNTGPLSVLTQSVKNNTQVLI NCRNNKKLLGRVKAFDRHCNMVLENVKEMWTEVPKSGKGKKKSKPVNKDRYISKMFLR GDSVIVVLRNPLIAGK" BASE COUNT 131 a 124 c 132 g 92 t ORIGIN 1 cgggagtgaa cggagagcgt agtgaccatc atgagcctcc tcaacaagcc caagagtgag 61 atgaccccag aggagctgca gaagcgagag gaggaggaat ttaacaccgg tccactctct 121 gtgctcacac agtcagtcaa gaacaatacc caagtgctca tcaactgccg caacaataag 181 aaactcctgg gccgcgtgaa ggccttcgat aggcactgca acatggtgct ggagaacgtg 241 aaggagatgt ggactgaggt acccaagagt ggcaagggca agaagaagtc caagccagtc 301 aacaaagacc gctacatctc caagatgttc ctgcgcgggg actcagtcat cgtggtcctg 361 cggaacccgc tcatcgccgg caagtagggg ccgcctgtct gttgacagaa ctcactcctc 421 tgtcctatga agaccgctgc cattggtgtt gagaataata aagctctgtg tttttttct // LOCUS HSU15009 626 bp mRNA PRI 10-DEC-1994 DEFINITION Human SnRNP core protein Sm D3 mRNA, complete cds. ACCESSION U15009 NID g600749 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 626) AUTHORS Lehmeier,T., Raker,V.A., Hermann,H. and Luehrmann,R. TITLE cDNA cloning of the Sm proteins D2 and D3 from human small nuclear ribonucleoproteins: evidence for a direct D1-D2 interaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 12317-12321 (1994) MEDLINE 95083692 REFERENCE 2 (bases 1 to 626) AUTHORS Raker,V.A. TITLE Direct Submission JOURNAL Submitted (22-SEP-1994) Veronica A. Raker, Institut fuer Molekularbiologie und Tumorforschung, Emil-Mannkopff Strasse 2, Marburg, 35037, Germany FEATURES Location/Qualifiers source 1..626 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBLSD2" /cell_line="HeLa S3" CDS 88..468 /note="SnRNP core protein" /codon_start=1 /product="Sm D3" /db_xref="PID:g600750" /translation="MSIGVPIKVLHEAEGHIVTCETNTGEVYRGKLIEAEDNMNCQMS NITVTYRDGRVAQLEQVYIRGSKIRFLILPDMLKNAPMLKSMKNKNQGSGAGRGKAAI LKAQVAARGRGRGMGRGNIFQKRR" terminator 466..468 BASE COUNT 175 a 130 c 165 g 156 t ORIGIN 1 tcttgactca cgccttcgcc gtagcatctt tcgcagcgga ccgaagagaa gaaaagtagg 61 ccagagccga actctcttcc tgccaagatg tctattggtg tgccgattaa agtactgcat 121 gaggccgagg gccacattgt gacatgtgag acgaacaccg gtgaggtata tcgggggaag 181 ctcattgaag cagaggacaa catgaactgc cagatgtcca acatcacagt cacatacaga 241 gatggccgag tggcacagct ggagcaggta tacatccgtg gcagcaaaat ccgctttctg 301 attttgcctg acatgctgaa gaacgcaccc atgttaaaga gcatgaaaaa taaaaaccaa 361 ggctcagggg ctggccgagg aaaagctgcc attctcaagg cccaagtggc cgcaagagga 421 agaggacgtg gaatgggacg tggaaacatc tttcaaaagc gaagataatt ttctaagttg 481 aacagaactt tgtccttttt tctttcaggt tatctgagtt cattggagtg ggtgcttgtg 541 catatatcta ggtatctttt gccatctttc tctttagatc aggggaaatg tttaagctaa 601 ataaatctgg ggggtttttt gttctg // LOCUS HSU15128 3414 bp DNA PRI 09-FEB-1996 DEFINITION Human beta-1,2-N-acetylglucosaminyltransferase II (MGAT2) gene, complete cds. ACCESSION U15128 L36537 NID g902744 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3414) AUTHORS Tan,J., D'Agostaro,A.F., Bendiak,B., Reck,F., Sarkar,M., Squire,J.A., Leong,P. and Schachter,H. TITLE The human UDP-N-acetylglucosamine: alpha-6-D-mannoside-beta-1,2- N-acetylglucosaminyltransferase II gene (MGAT2). Cloning of genomic DNA, localization to chromosome 14q21, expression in insect cells and purification of the recombinant protein JOURNAL Eur. J. Biochem. 231 (2), 317-328 (1995) MEDLINE 95361854 REFERENCE 2 (bases 1 to 3414) AUTHORS Schachter,H. TITLE Direct Submission JOURNAL Submitted (23-SEP-1994) Harry Schachter, Biochemistry, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada FEATURES Location/Qualifiers source 1..3414 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHG30 and pHG36" /clone_lib="Clontech library in lambdaEMBL-3 SP6/T7 (catalog #HL1111j)" /chromosome="14" /map="14q21" /cell_type="leukocyte" /dev_stage="adult" gene 683..2026 /gene="MGAT2" CDS 683..2026 /gene="MGAT2" /EC_number="2.4.1.143" /note="GlcNAc-transferase II" /codon_start=1 /product="beta-1,2-N-acetylglucosaminyltransferase II" /db_xref="PID:g902745" /translation="MRFRIYKRKVLILTLVVAACGFVLWSSNGRQRKNEALAPPLLDA EPARGAGGRGGDHPSVAVGIRRVSNVSAASLVPAVPQPEADNLTLRYRSLVYQLNFDQ TLRNVDKAGTWAPRELVLVVQVHNRPEYLRLLLDSLRKAQGIDNVLVIFSHDFWSTEI NQLIAGVNFCPVLQVFFPFSIQLYPNEFPGSDPRDCPRDLPKNAALKLGCINAEYPDS FGHYREAKFSQTKHHWWWKLHFVWERVKILRDYAGLILFLEEDHYLAPDFYHVFKKMW KLKQQECPECDVLSLGTYSASRSFYGMADKVDVKTWKSTEHNMGLALTRNAYQKLIEC TDTFCTYDDYNWDWTLQYLTVSCLPKFWKVLVPQIPRIFHAGDCGMHHKKTCRPSTQS AQIESLLNNNKQYMFPETLTISEKFTVVAISPPRKNGGWGDIRDHELCKSYRRLQ" polyA_signal 2092 polyA_signal 2712 polyA_signal 2870 repeat_region 3329..>3414 /note="Alu fragment 1-192 (numbering according to Alu general consensus)" /rpt_type=tandem /rpt_family="Alu-J" BASE COUNT 900 a 756 c 863 g 895 t ORIGIN 1 cccttcgcac gtctcgcctt tcgcacgtct cgcctaacag gaaagggaag aaagaggcgg 61 aagtgggaac tgcacctgag cgacagtact gcaaaccaat aggcagccgg ccacggcggt 121 caggcgcctt cggtcgcgtc tggaaagcac caaccaacgg tctaaggggc gggccggagg 181 ggtgtgggcc ggagggcgcg gtgtgccgcg gggcagttgc gggttgtcat aacggtcccc 241 gccggagtga ggcgaggccg cgtcgctcag ttctggccgt ctagggcccc tgtaaggatg 301 agagcgcaga ggacgcaggg ccgctggagg cgcaggtaac gaagctaggg tgcggttggg 361 accgcggctg agctttttcc gggacccgtg gtgctgaatg gagaggacgg agacgaagcc 421 gagccgcggc tcctagcggc ggcgccgatg ctcgagctgt agctgccagg cgaggatgtg 481 tggagcgcag gcggcgcggg gtaaatgaga ggtctcgggc cccaggaccc ccggggcccg 541 ggatgagtta gcgagggcag ccgcgggggc cagttccgac cgtgacaggc caaggcgacg 601 gccgccgccc gcccgcccct tccgtgcaga agcagctgct cctttccgcg cccgcccgcc 661 tgcgctcccg gccctggaga ccatgaggtt ccgcatctac aaacggaagg tgctaatcct 721 gacgctcgtg gtggccgcct gcggcttcgt cctctggagc agcaatgggc gacaaaggaa 781 gaacgaggcc ctcgccccac cgttgctgga cgccgaaccc gcgcggggtg ccggcggccg 841 cggtggggac cacccctctg tggctgtggg catccgcagg gtctccaacg tgtcggcggc 901 ttccctggtc ccggcggtcc cccagcccga ggcggacaac ctgacgctgc ggtaccggtc 961 cctggtgtac cagctgaact ttgatcagac cctgaggaat gtagataagg ctggcacctg 1021 ggccccccgg gagctggtgc tggtggtcca ggtgcataac cggcccgaat acctcagact 1081 gctgctggac tcacttcgaa aagcccaggg aattgacaac gtcctcgtca tctttagcca 1141 tgacttctgg tcgaccgaga tcaatcagct gatcgccggg gtgaatttct gtccggttct 1201 gcaggtgttc tttcctttca gcattcagtt gtaccctaac gagtttccag gtagtgaccc 1261 tagagattgt cccagagacc tgccgaagaa tgccgctttg aaattggggt gcatcaatgc 1321 tgagtatccc gactccttcg gccattatag agaggccaaa ttctcccaga ccaaacatca 1381 ctggtggtgg aagctgcatt ttgtgtggga aagagtgaaa attcttcgag attatgctgg 1441 ccttatactt ttcctagaag aggatcacta cttagcccca gacttttacc atgtcttcaa 1501 aaagatgtgg aaactgaagc agcaagagtg ccctgaatgt gatgttctct ccctggggac 1561 ctatagtgcc agtcgcagtt tctatggcat ggctgacaag gtagatgtga aaacttggaa 1621 atccacagag cacaatatgg gtctagcctt gacccggaat gcctatcaga agctgatcga 1681 gtgcacagac actttctgta cttatgatga ttataactgg gactggactc ttcaatactt 1741 gactgtatct tgtcttccaa aattctggaa agtgctggtt cctcaaattc ctaggatctt 1801 tcatgctgga gactgtggta tgcatcacaa gaaaacctgt agaccatcca ctcagagtgc 1861 ccaaattgag tcactcttaa ataataacaa acaatacatg tttccagaaa ctctaactat 1921 cagtgaaaag tttactgtgg tagccatttc cccacctaga aaaaatggag ggtggggaga 1981 tattagggac catgaactct gtaaaagtta tagaagactg cagtgaaaat cacagttaca 2041 aaagcgacag tcttctattt ttgatatttg tccaaacagg acatacaatt gaataaaaga 2101 gtttaggaac tggtttctgc tttaatacaa aaacaaaatc ttgtaaaagg tgtccaaata 2161 catagtaatc ttttccagtt atgtctgatt aagatttaaa actgaaggtt tcattttggg 2221 agtagggttt taaagctcaa tctgttatct gctaaaattg attattgttg atatgagaga 2281 agaggggaaa ttttatttaa attgcattta ttaatctttt tatctgaaac tttgtacact 2341 tttccacttt caaaacctat tttaagtaca gcaaaattta tttaaaactg tgatagcagt 2401 aaaaagtatt acgatgaaat tgttagggta ttaatggaac aaacccagtt tcactctctt 2461 gacacactta ttaggaaggg attgcttcac tggtttaata atttaaaagt tatgtttgtt 2521 aaacaccctg tcagaacagt cattttcagt attagattcc tgtactattg tgttttgagt 2581 gtgttttgga accttcatag aacacacttt cttttggaat gtatttgatt gataagaaag 2641 tttaaacatt gttttcacct caatgtagaa atacagtggt tttgtttttt tttttctttt 2701 agtgctgaca aaataaaata ctcatttttg cataaaaagg ttcctaatcc ttttgcagaa 2761 taagttttgt ttactcttta taccaaaatt cagtgaaggc attctacaag ttttgagtta 2821 gcattacatt ttaatattta ctattgctac attgtataat tgagtttgaa ataaaaccca 2881 gcttatgaca atgcattccc tgtgcaagaa actgtttggc tttcaaatta cccaggcatt 2941 gaaaatgaat gataaaaagt tgctgtgtaa gggaaataca gcctaaatgt tttgaaagcc 3001 agaaatgata caaagttcag tcatgccaaa gtgaaatact ttctagtgcc agctttaact 3061 taaatcatac gttttaaaag gacagataca gaaaattata ggaaacaggc ttaaattttg 3121 ctccatattt aatgtagacg tttatagaag tttcccttaa tttgtaattg cattcaaccg 3181 agaatttctc ataaaagact aatttctgtg taaagatatt acgggctggg tgtggtggct 3241 catgtctgta atccagcact ctgggaggtt gaggcaggac gattgcttga actcagagtt 3301 tgagaccagc ctgggcaaca tggcgaaaaa cccatctcta ctaaaaataa caaaaaatta 3361 gccgggcgta gtggtgactc tgtagtccca gctacttgag aggctgaggt ggga // LOCUS HSU15131 4277 bp mRNA PRI 09-JAN-1997 DEFINITION Human p126 (ST5) mRNA, complete cds. ACCESSION U15131 NID g1769466 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4277) AUTHORS Lichy,J.H., Majidi,M., Elbaum,J. and Tsai,M.M. TITLE Differential expression of the human ST5 gene in HeLa-fibroblast hybrid cell lines mediated by YY1: evidence that YY1 plays a part in tumor suppression JOURNAL Nucleic Acids Res. 24 (23), 4700-4708 (1996) MEDLINE 97128263 REFERENCE 2 (bases 1 to 4277) AUTHORS Lichy,J.H. TITLE Direct Submission JOURNAL Submitted (26-SEP-1994) Jack H. Lichy, Cellular Pathology, Armed Forces Institute of Pathology, Washington D.C. 20306-6000, USA FEATURES Location/Qualifiers source 1..4277 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p15" /chromosome="11" /cell_type="epithelial cell" 5'UTR 1..114 gene 115..3528 /gene="ST5" CDS 115..3528 /gene="ST5" /codon_start=1 /product="p126" /db_xref="PID:g1769467" /translation="MTMTANKNSSITHGAGGTKAPRGTLSRSQSVSPPPVLSPPRSPI YPLSDSETSACRYPSHSSSRVLLKDRHPPAPSPQNPQDPSPDTSPPTCPFKTASFGYL DRSPSACKRDTQKESVQGAAQDVAGVAACLPLAQSTPFPGPAAGPRGVLLTRTGTRSP QPGHPGEDIAWEGRREASPRMSMCGEKREGSGSEWAASEGCPSLGCPSVVPSPCSSEK TFDFKGLRRMSRTFSECSYPETEEEGEALPVRDSFYRLEKRLGRSEPSAFLRGHGSRK ESSAVLSRIQKIEQVLKEQPGRGLPQLPSSCYSVDRGKRKTGTLGSLEEPAGGASVSA GSRAVGVAGVAGEAGPPPEREGSGSTKPGTPGNSPSSQRLPSKSSLDPAVNPVPKPKR TFEYEAEKNPKSKPSNGLPPSPTPAAPPPLPSTPAPPVTRRPKKDMRGHRKSQSRKSF EFEDASSLQSLYPSSPTENGTENQPKFGSKSTLEENAYEDIVGDLPKENPYEDVDLKS RRAGRKSQQLSENSLDSLHRMWSPQDRKYNSPPTQLSLKPNSQSLRSGNWSERKSHRL PRLPKRHSHDDMLLLAQLSLPSSPSSLNEDSLSTTSELLSSRRARRIPKLVQRINSIY NAKRGKKRLKKLSMSSIETASLRDENSESESDSDDRFKAHTQRLVHIQSMLKRAPSYR TLELELLEWQERELFEYFVVVSLKKKPSRNTYLPEVSYQFPKLDRPTKQMREAEERLK AIPQFCFPDAKDWLPVSEYSSETFSFMLTGEDGSRRFGYCRRLLPSGKGPRLPEVYCV ISRLGCFGLFSKVLDEVERRRGISAALVYPFMRSLMESPFPAPGKTIKVKTFLPGAGN EVLELRRPMDSRLEHVDFECLFTCLSVRQLIRIFASLLLERRVIFVADKLSTLSSCSH AVVALLYPFSWQHTFIPVLPASMIDIVCCPTPFLVGLLSSSLPKLKELPVEEALMVNL GSDRFIRQMDDEDTLLPRKLQAALEQALERKNELISQDSDSDSDDECNTLNGLVSEVF IRFFVETVGHYSLFLTQSEKGERAFQREAFRKSVASKSIRRFLEVFMESQMFAGFIQD RELRKCRAKGLFEQRVEQYLEELPDTEQSGMNKFLRGLGNKMKFLHKKN" 3'UTR 3529..4277 polyA_site 4277 BASE COUNT 953 a 1271 c 1174 g 879 t ORIGIN 1 gaagatgaga gactgcttag gcgccaccac tagtaccatg agtccctgca ctggttaaag 61 ccatcgccac aacctggaca ggcagcaagg gctctgggtt tgcagagagc cgaaatgacc 121 atgactgcca acaagaattc cagcatcacc cacggagctg gtggcactaa agcccctcgg 181 gggactctga gcaggtctca gtcagtctct ccacctccag tcctctcccc accaaggagt 241 cccatctacc cgctcagtga tagtgaaacc tcagcctgca ggtaccccag ccactccagc 301 tcccgggtgc tcctcaagga ccggcacccc ccagctcctt caccccagaa tcctcaagat 361 ccctccccag atacttcccc acccacctgt cccttcaaga ccgccagctt cggttatttg 421 gacagaagcc cttcggcgtg caagagagac acccaaaagg aaagtgtcca aggcgcagcc 481 caggatgtag caggggtcgc tgcctgcctc ccccttgccc agagcacgcc attcccgggg 541 ccagcagctg gcccccgggg cgtcttgctg acccgtaccg gtacccgcag cccacagcct 601 gggcatccgg gagaagatat agcatgggaa ggtcgccgag aggcgtcgcc caggatgagc 661 atgtgtggag agaagcggga gggctctggg agcgagtggg cggccagtga gggctgcccc 721 agcctgggct gtcccagcgt ggtgccgtcc ccctgcagct ctgaaaagac ctttgatttc 781 aagggcctcc ggaggatgag caggaccttc tccgagtgtt cctacccaga gactgaggag 841 gagggagagg cgctccctgt ccgggactct ttctaccggc tggagaaacg gctgggccgg 901 agtgagccca gcgccttcct cagggggcat ggcagcagga aggagagctc agcagtgctg 961 agccggatcc agaaaattga acaggtcctg aaggagcagc cgggccgggg gctcccccag 1021 ctccccagca gctgctacag cgtcgaccgg gggaaaagga agactggaac cttgggctcc 1081 ttggaggagc cggcaggggg cgcgagtgtg agcgctggca gccgggcagt cggagtggct 1141 ggtgttgcgg gggaggcggg cccaccccca gagagggaag gcagtggttc cactaagccc 1201 gggacccctg gaaatagccc tagctcccag cggctgccat cgaagagttc cctcgatccc 1261 gctgtgaacc ctgtccccaa acccaagcgc acctttgaat acgaggctga gaagaacccc 1321 aagagtaagc ccagtaatgg tctacctcct tcacccacac ctgctgctcc acctcccttg 1381 ccctccaccc cagccccgcc agtcacccgg agacccaaga aggacatgcg tggtcaccgc 1441 aagtcccaga gcagaaaatc ctttgagttt gaggatgcat ccagtctcca gtccctgtac 1501 ccctcttctc ccactgagaa tggtactgag aaccaaccca agtttggatc caaaagcact 1561 ttagaagaaa atgcctatga agatattgtg ggagatctgc ccaaggagaa tccatatgag 1621 gatgtggact taaagagccg aagagcagga cgaaaatccc agcaactgtc tgagaactcc 1681 ttggactctt tgcacaggat gtggagtcct caggacagga agtacaacag cccgcccaca 1741 cagctttccc tgaaacccaa cagccagtcc ctgcgcagtg ggaactggtc agaaaggaag 1801 agccaccggc tgccacgatt acccaagagg cacagccatg acgacatgct gctgctggct 1861 cagctgagtc tgccgtcctc accctccagc ctcaatgaag acagcctcag caccaccagc 1921 gagctgctgt ccagccgccg ggcccgccgc attcccaagc ttgtccaaag aattaactcc 1981 atctacaatg ccaagagagg aaagaagaga ttaaaaaagt tgtctatgtc cagcattgaa 2041 acagcatcac tgagagatga aaacagtgag agcgagagcg actctgatga caggttcaaa 2101 gcccacacac agcgcctggt ccacatccag tcgatgctga agcgcgcccc cagctatcgc 2161 acgctggagc tggagctgct ggagtggcag gagcgggagc tttttgagta ctttgtggtg 2221 gtgtccctca agaagaagcc atcgcgaaac acctacctcc ccgaagtctc ctaccagttt 2281 cccaagctgg accgacccac caagcagatg cgagaggcag aggaaaggct caaagccatt 2341 ccccagtttt gcttccctga tgccaaggac tggcttcctg tgtcagagta tagcagtgag 2401 accttttctt tcatgctgac tggggaagat ggcagcagac gctttggcta ctgcaggcgc 2461 ttactgccaa gtgggaaagg gccccggttg ccagaggtgt actgtgtcat cagccgcctt 2521 ggctgcttcg gcttgttttc caaggtccta gatgaggtgg agcgccggcg tgggatctcc 2581 gctgcattgg tctatccttt catgagaagt ctcatggagt cgcccttccc agccccaggg 2641 aagaccatca aagtgaagac attcctgcca ggtgctggca atgaggtgtt agagctgcgg 2701 cggcccatgg actcaaggct ggagcacgtg gactttgagt gcctttttac ctgcctcagt 2761 gtgcgccagc tcatccgaat ctttgcctca ctgctgctgg agcgccgggt catttttgtg 2821 gcagataagc tcagtaccct ctccagctgc tcccacgcgg tggtggcctt gctctacccc 2881 ttctcctggc agcacacctt cattcctgtc ctcccggcct ccatgattga catcgtctgc 2941 tgtcccaccc ccttcctggt tggcctgctc tccagctccc tccccaaact gaaggagctg 3001 cctgtggagg aggcgctgat ggtgaatctg ggatctgacc gattcatccg acagatggac 3061 gacgaagaca cgttgttacc taggaagtta caggcagctc tggagcaggc tctggagagg 3121 aagaatgagc tgatctccca ggactctgac agcgactccg acgatgaatg taataccctc 3181 aatgggctgg tgtcggaggt gtttatccgg ttctttgtgg agaccgttgg gcactactcc 3241 ctctttctga cacagagtga gaagggagag agggcctttc agcgagaggc cttccgcaaa 3301 tctgtggcct ccaaaagcat ccgccgcttt cttgaggttt ttatggagtc tcagatgttt 3361 gctggcttca tccaagacag ggagctaaga aagtgtcggg caaagggcct ttttgagcag 3421 cgagtggagc agtacttaga agaactccca gacactgagc agagtggaat gaataagttt 3481 ctccgaggtt tgggcaacaa aatgaagttt ctccacaaga agaattaagc ctccttctca 3541 gtagcagagt ccagtgcctt gcagagcctg aagcctgggg agaaggccca gcctgggacc 3601 ctctgggctg ctgtggctcc tctgccccca cagatcctat cctccaagcc agcccacctc 3661 tgccttcatc atatcccagg atactgtttg taaataatct gctgtaagct ttcttaactg 3721 ttttttgtaa caagcaaaga gaatatggca aatatttgta tattcccaag gggccgggtg 3781 ctttcctgtc ctgccagagc atggatgaag tttcgctggg tgctcgtgac tggccagttt 3841 tgtgcagctg actgtctcag ccaaaccact gatcttccct ggaggccttc ggcctgcctg 3901 cctgcctgcc tgaggtcccc gctgccagtc ccgggccctg gagagcagat gctgtcttgt 3961 tatgtacagg aggacctttt aaaaaaatca agtttctatt ttttgctggt agtccgcata 4021 cccataccct ctgtttttga aaggcaaagg ccaatcagtc cccatttgta gcatggcacc 4081 agggtcttag gcctagtcct ctcattcctc ccaccctccg agatggtcag tgtgtcatgg 4141 gaagcccacc cccagctctg ccagtgctct ctgggcctgg ctcccagtca gtggtggcca 4201 cgatgcggta cagggcatcc ctccttccca tctacgggtg ttgtcaataa acaatgtaca 4261 gttgtttggg cccagag // LOCUS HSU15172 1100 bp mRNA PRI 02-FEB-1998 DEFINITION Homo sapiens BCL2/adenovirus E1B 19kD-interacting protein 1 (BNIP1) mRNA, complete cds. ACCESSION U15172 NID g558841 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 11 to 697) AUTHORS Boyd,J.M., Malstrom,S., Subramanian,T., Venkatesh,L.K., Schaeper,U., Elangovan,B., D'Sa-Eipper,C. and Chinnadurai,G. TITLE Adenovirus E1B 19 kDa and Bcl-2 proteins interact with a common set of cellular proteins JOURNAL Cell 79, 341-351 (1994) MEDLINE 95042730 REFERENCE 2 (bases 1 to 1100) AUTHORS Boyd,J.M. TITLE Direct Submission JOURNAL Submitted (28-SEP-1994) Janice M. Boyd, Institute for Molecular Virology, St. Louis University Health Sciences Center, 3681 Park Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1100 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="B-cell library of S. Elledge, 5' RACE HeLa mRNA" /cell_type="B cells" /cell_line="HeLa" gene 1..1100 /gene="BNIP1" CDS 11..697 /gene="BNIP1" /codon_start=1 /product="BCL2/adenovirus E1B 19kD-interacting protein 1" /db_xref="PID:g558842" /translation="MAAPQDVHVRICNQEIVKFDLEVKALIQDIRDCSGPLSALTELN TKVKEKFQQLRHRIQDLEQLAKEQDKESEKQLLLQEVENHKKQMLSNQASWRKANLTC KIAIDNLEKAELLQGGDLLRQRKTTKESLAQTSSTITESLMGISRMMAQQVQQSEEAM QSLVTSSRTILDANEEFKSMSGTIQLGRKLITKYNRRELTDKLLIFLALRLFLATVLY IVKKRLFPFL" BASE COUNT 299 a 283 c 279 g 239 t ORIGIN 1 agtccccaac atggcggctc cccaagacgt ccacgtccgg atctgtaacc aagagattgt 61 caaatttgac ctggaggtga aggcgcttat tcaggatatc cgtgattgtt caggaccctt 121 aagtgctctt actgaactga atactaaagt aaaagagaaa tttcaacagt tgcgtcacag 181 aatacaggac ctggagcagt tggctaaaga gcaagacaaa gaatcagaga aacaacttct 241 actccaggaa gtggagaatc acaaaaagca gatgctcagc aatcaggcct catggaggaa 301 agctaatctc acctgcaaaa ttgcaatcga caatctagag aaagcagaac ttcttcaggg 361 aggagatctc ttaaggcaaa ggaaaaccac caaagagagc ctggcccaga catccagtac 421 catcactgag agcctcatgg ggatcagcag gatgatggcc cagcaggtcc agcagagcga 481 ggaggccatg cagtctctag tcacttcttc acgaacgatc ctggatgcaa atgaagaatt 541 taagtccatg tcgggcacca tccagctggg ccggaagctt atcacaaaat acaatcgccg 601 ggagctgacg gacaagcttc tcatcttcct tgcgctacgc ctgtttcttg ctacggtcct 661 ctatattgtg aaaaagcggc tctttccatt tttgtgagat cccaaaggtg ccagttctgg 721 ccctttcagc tcctgtttca ggatctgtcc tggttcctga gctctaggct gctaagctga 781 gccacacacc cctccgtttt gcaccagttg cctgcaggtt ggatggaaca cagtgcccca 841 cttttctgca agtagctggc ttgtaaaggg tgaacagagc catgggagga aggtctggca 901 ttgggatgcc gccctgggga catacgaacc gcctccttcc accattgtgc actatgggag 961 gccgctgctg cgtggagcac ttaaagtcca gcctccagga ccggatgccc ctcctgtctc 1021 ccgctcccat cgtgccctta aatgccagat ctggtggagg gaagagagaa gaggtaggaa 1081 gaaaggtgat gaaaactcct // LOCUS HSU15173 2382 bp mRNA PRI 02-FEB-1998 DEFINITION Homo sapiens BCL2/adenovirus E1B 19kD-interacting protein 2 (BNIP2) mRNA, complete cds. ACCESSION U15173 NID g558843 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 212 to 1156) AUTHORS Boyd,J.M., Malstrom,S., Subramanian,T., Venkatesh,L.K., Schaeper,U., Elangovan,B., D'Sa-Eipper,C. and Chinnadurai,G. TITLE Adenovirus E1B 19 kDa and Bcl-2 proteins interact with a common set of cellular proteins JOURNAL Cell 79, 341-351 (1994) MEDLINE 95042730 REFERENCE 2 (bases 1 to 2382) AUTHORS Boyd,J.M. TITLE Direct Submission JOURNAL Submitted (28-SEP-1994) Janice M. Boyd, Institute for Molecular Virology, St. Louis University Health Sciences Center, 3681 Park Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2382 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="B-cell, S.Elledge; B-cell, J.Ambrus; fetal brain, M.Green" /cell_type="B cells; brain" /dev_stage="fetus (brain)" gene 1..2382 /gene="BNIP2" CDS 212..1156 /gene="BNIP2" /codon_start=1 /product="BCL2/adenovirus E1B 19kD-interacting protein 2" /db_xref="PID:g558844" /translation="MEGVELKEEWQDEDFPIPLPEDDSIEADILAITGPEDQPGSLEV NGNKVRKKLMAPDISLTLDPSDGSVLSDDLDESGEIDLDGLDTPSENSNEFEWEDDLP KPKTTEVIRKGSITEYTAAEEKEDGRRWRMFRIGEQDHRVDMKAIEPYKKVISHGGYY GDGLNAIVVFAVCFMPESSQPNYRYLMDNLFKYVIGTLELLVAENYMIVYLNGATTRR KMPSLGWLRKCYQQIDRRLRKNLKSLIIVHPSWFIRTLLAVTRPFISSKFSQKIRYVF NLAELAELVPMEYVGIPECIKQVDQELNGKQDEPKNEQ" BASE COUNT 734 a 393 c 511 g 744 t ORIGIN 1 agtaccgctg cggccggggg attgggccgg ggtctccacc gccgaccgag gggagcggcg 61 tccgctcggc cctgcttttt gcgacctgcc gtcagcccca cgtcgccggc ctggaggggc 121 gaagaggacg aggggcgcaa ggcttcctcc ggggacattg gctccctgga ttatcaagca 181 gtttgtagtt gacattgaat ccaggctgag gatggaaggt gtggaactta aagaagaatg 241 gcaagatgaa gattttccga tacctttacc agaagatgat agtattgaag cagatatact 301 agctataact ggaccagagg accagcctgg ctcactagaa gttaatggaa ataaagtgag 361 aaagaaacta atggctccag acattagcct gacactggat cctagtgatg gctctgtatt 421 gtcagatgat ttggatgaaa gtggggagat tgacttagat ggcttagaca caccgtcaga 481 gaatagtaat gagtttgagt gggaagatga tcttccaaaa cccaagacta ctgaagtaat 541 taggaaaggc tcaattactg aatacacagc agcagaggaa aaagaagatg gacgacgctg 601 gcgtatgttc aggattggag aacaggacca cagggttgat atgaaggcaa ttgaacccta 661 taaaaaagtt atcagccatg ggggatatta tggggatgga ttaaatgcca ttgttgtatt 721 tgctgtctgt ttcatgcctg aaagtagtca gcctaactat agatacctga tggacaatct 781 ttttaaatat gttattggca ctttggagct attagtagca gaaaactaca tgatagttta 841 tttaaatggt gcaacaactc gaagaaaaat gcccagtctg ggatggctca ggaaatgtta 901 tcagcaaatt gatagaaggt tacggaaaaa tctaaaatcc ctaatcattg tacatccttc 961 ttggtttatc agaacacttc tggctgttac aagaccattt attagctcga aattcagcca 1021 aaaaattaga tacgtgttta atttggcaga actagcagaa cttgtcccca tggaatacgt 1081 tggcatacca gaatgcataa aacaagttga tcaagaactt aatggaaaac aagatgaacc 1141 gaaaaatgaa cagtaagttt ggcatctagt ccaaacaaga ctgaagaatg tgctgatgga 1201 gcagtgctgt ttctgcattc ataatgcatt tattggccca tatttttatg taacctgtta 1261 caaaatagac ttgacttttt cataatggac ttttgtatta tacaagggac tgttcactgc 1321 tgtactggtt tgcaaatttc ttgaatttag ctctttaata gctaactgta ttattatcgt 1381 tttatatttt atattgctaa atagagaacc acactttata taaagtagtt tttgcatttg 1441 tttattgaat gatgcatctt cttcggtgaa atatttatat gcataaatgg caaaggaaag 1501 aaataatata tatttttatg tcattgagca atattttttc aatgtgtacc tgtcttatgg 1561 aagaaatatg caggtatata agaccacgat tttctaagct gccatataag aatttttgtt 1621 tttgtaaatg gttaaataca tttcctgggt aacttaggaa attaagcttt ttcataaggc 1681 aacagatggt aaactgattg tcatgaatac ccaaagatca tgtatataat cgaagtgtat 1741 tagtaccatc ccaaggtttt tttctcattt aacatatttg tttcataatt cagcaagtac 1801 agatgcaagc gcattgcaca ctttttcctt tctaaactta aagacaagtc aaaaagccat 1861 tcttagaact agaggattta agcagggtcg gaattacggg tttgtatata tgtatatact 1921 cgtttgtata tatgtatata ctgggacatt ttatcttctg gcccaaagtc agaactttat 1981 aaaaatcttg agtttgttca cttaatgtga aataagctat gtgtccaggg tattgctccc 2041 ctgagtgtat atgagtgctg agtagtattg cagagaatgt gatgagttat cactgtcaca 2101 actttttcta tagaaaacag gggctgcttt taaactctca ctatgggaca ctttaccaaa 2161 atacttccat atcaattatt tgaacccggt agtttgtttg acctagttag attgtggtgt 2221 ttattcaagt ttgaaatcat gtttgacaat actgtaaatt aggttaattt tgaagtctta 2281 gcatcatcat attgtgctgt tttggataac acgtttgttc aagaacattt aaactgtttc 2341 tttggtgtcc tttacattga aataaattgt gtttgtgcct cc // LOCUS HSU15306 3509 bp mRNA PRI 04-JUL-1995 DEFINITION Human cysteine-rich sequence-specific DNA-binding protein NFX1 mRNA, complete cds. ACCESSION U15306 NID g563216 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3509) AUTHORS Song,Z., Krishna,S., Thanos,D., Strominger,J.L. and Ono,S.J. TITLE A novel cysteine-rich sequence-specific DNA-binding protein interacts with the conserved X-box motif of the human major histocompatibility complex class II genes via a repeated Cys-His domain and functions as a transcriptional repressor JOURNAL J. Exp. Med. 180 (5), 1763-1774 (1994) MEDLINE 95053707 REFERENCE 2 (bases 1 to 3509) AUTHORS Ono,S.J. TITLE Direct Submission JOURNAL Submitted (30-SEP-1994) Santa J. Ono, Medicine & Grad. Prog. in Mol. Medicine, The Johns Hopkins University School of Medicine, 5501 Hopkins Bayview Circle, 2A.38, Baltimore, MD 21224, USA FEATURES Location/Qualifiers source 1..3509 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="NFX.1 cDNA #16" /cell_line="Raji" /cell_type="B lineage" /tissue_type="hematopoietic" /dev_stage="adult" CDS 1..3315 /note="cysteine rich protein; contains nfx repeats" /codon_start=1 /function="transcription repressor" /product="NFX1" /db_xref="PID:g563217" /translation="MEFSSICIEFKSTLRQEAPPPSRAAEPRSSCTVHHLPVTFPGRS LMMKSLLFISIVIIRQEGKPKSQQTSFQSSPCNKSPKSHGLQNQPWQKLRNEKHHIRV KKAQSLAEQTSDTAGLESSTRSESGTDLREHSPSESEKEVVGADPRGAKPKKATQFVY SYARGPKVKEKLKCEWSNRTTPKPEMLDPKVPNLWGFSTLTLQRHPLEKEYWMGMEPD EMSREDTHRKGLPGKWRGPGHDQAEIHQNRRATDIQTQDTETTWAPFQSDDLNERPAK STCDSENLAVINKSSRRVDPEKCTVRRQDPQVVSPFSRGKQNHVLKNVETHTGSLIEQ LTTEKYECMVCCELVRVTAPVWSCQSCYHVFHLNCIKKWARSPASQADGQSGWRCPAC QNVSAHVPNTFSCFCGKVKNPEWSRNEIPHSCGEVCRKKQPGQDCPHSCNLLCHPGPC PPCPAFMTKTCECGRTRHTVRCGQAVSVHCSNPCENILNCGQHQCAELCHGGQCQPCQ IILNQVCYCGSTSRDVLCGTDVGKSDGFGDFSCLKTCGKDLKCGNHTCSQVCHPQPCQ QCPRLPQLVRCCPCGQTPLSQLLELGSSSRKTCMDPVPSCGKVCGKPLPCGSLDFIHT CEKLCHEGDCGPVSRTSVISCRCSFRTKELPCTSLKSEDATFMCDKRCNKKRLCGRHK CNEICCVDKEHKCPLNCGRKLRCGLHRCEEPCHRGNCQTCWQASFDELTCHCGASVIY PPVPCGTRPPECTQTCARVHECDHPVYHSGHSEEKCPPCTFLTQKWCMGKHEFRSNIP CHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPCTTPRADCGHPCMAPCH TSSPCPVTACKAKVELQCECGRRKEMVICSEASSTYQRIAAISMASKITDMQLGGSVE ISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDPFNIRSSGSKFSDSLKE DARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDHRRIIHDLAQVYGLESV SYDSEPKRNVVVTAIRGKSVCPPTTLTGVLEREMQARPPPPIPHHRHQSDKNPGSSNL QKITKEPIIDYFDVQD" misc_feature 1251..1419 /note="encodes LIM domain DNA binding motif" misc_feature 1599..1749 /note="encodes LIM domain DNA binding motif" misc_feature 1797..1941 /note="encodes LIM domain DNA binding motif" misc_feature 2064..2211 /note="encodes LIM domain DNA binding motif" misc_feature 2397..2517 /note="encodes LIM domain DNA binding motif" misc_feature 2520..2670 /note="encodes LIM domain DNA binding motif" BASE COUNT 993 a 821 c 883 g 812 t ORIGIN 1 atggaattca gcagcatctg tattgaattt aaaagtacct tgagacagga ggcgcctccg 61 ccatcccgtg ccgcagaacc tagatcgagc tgtacagttc accacctccc tgtcaccttt 121 ccaggcaggt cccttatgat gaaatctctg ctgttcatca gcatagttat catccgtcag 181 gaaggcaaac ctaagagtca gcagacgtct ttccagtcct ctccttgtaa taaatcgccc 241 aagagccatg gccttcagaa tcaaccttgg cagaaattga ggaatgagaa gcaccatatc 301 agagtcaaga aagcacagag tcttgctgag cagacctcag atacagctgg attagagagc 361 tcgaccagat cagagagtgg gacagacctc agagagcata gtccttctga gagtgagaag 421 gaagttgtgg gtgcagatcc caggggagca aaacccaaaa aagcaacaca gtttgtatac 481 agctatgcta gaggaccaaa agtcaaggag aaactcaaat gtgaatggag taaccgaaca 541 actccaaaac cggagatgct ggacccgaaa gtaccaaacc tgtgggggtt ttccaccctg 601 actcttcaga ggcatcctct agaaaaggag tattggatgg gtatggagcc agacgaaatg 661 agcagagaag atacccacag aaaaggcctc cctgggaagt ggagggggcc aggccacgac 721 caggcagaaa tccaccaaaa caggagggcc accgacatac aaacgcagga cacagaaaca 781 acatgggccc cattccaaag tgatgacctc aatgaaagac cagcaaaatc tacctgtgac 841 agtgagaact tggcagtcat caacaagtct tccaggaggg ttgacccaga gaaatgcact 901 gtacggaggc aggatcctca agtagtatct cctttctccc gaggcaaaca gaaccatgtg 961 ctaaagaatg tggaaacgca cacaggttct ctaattgaac aactaacaac agaaaaatac 1021 gagtgcatgg tgtgctgtga attggttcgt gtcacggccc cagtgtggag ttgtcagagc 1081 tgttaccatg tgtttcattt gaactgcata aagaaatggg caaggtctcc agcatctcaa 1141 gcagatggcc agagtggttg gaggtgccct gcctgtcaga atgtttctgc acatgttcct 1201 aataccttct cttgtttctg tggcaaggta aagaatcctg agtggagcag aaatgaaatt 1261 ccacatagct gtggtgaggt ttgtagaaag aaacagcctg gccaggactg cccacattcc 1321 tgtaaccttc tctgccatcc aggaccctgc ccaccctgcc ctgcctttat gacaaaaaca 1381 tgtgaatgtg gacgaaccag gcacacagtt cgctgtggtc aggctgtctc agtccactgt 1441 tctaacccat gtgagaatat tttgaactgt ggtcagcacc agtgtgctga gctgtgccat 1501 gggggtcagt gccagccttg ccagatcatt ttgaaccagg tatgctattg cggcagcacc 1561 tcccgagatg tgttatgtgg aaccgatgta ggaaagtctg atggatttgg ggatttcagc 1621 tgtttaaaga catgtggcaa ggacttgaaa tgcggtaacc atacatgttc gcaagtgtgc 1681 caccctcagc cctgccagca atgcccacgg ctcccccagc tggtgcgctg ttgcccctgt 1741 ggccaaactc ctctcagcca attgctagaa cttggaagta gtagtcggaa aacatgcatg 1801 gaccctgtgc cttcatgtgg aaaagtgtgc ggcaagcctc tgccttgtgg ttccttagat 1861 ttcattcata cctgtgaaaa gctctgccat gaaggagact gtggaccagt ctctcgcaca 1921 tcagttattt cctgcagatg ctctttcaga acaaaggagc ttccatgtac cagtctcaaa 1981 agtgaagatg ctacatttat gtgtgacaag cggtgtaaca agaaacggtt gtgtggacgg 2041 cataaatgta atgagatatg ctgtgtggat aaggagcaca agtgtccttt gaattgtggg 2101 aggaaactcc gttgtggcct tcataggtgt gaagaacctt gtcatcgtgg aaactgccag 2161 acatgctggc aagccagttt tgatgaatta acctgccatt gtggtgcatc agtgatttac 2221 cctccagttc cctgtggtac taggccccct gaatgtaccc aaacctgcgc tagagtccat 2281 gagtgtgacc atccagtata tcattctggt catagtgagg agaagtgtcc cccttgcact 2341 ttcctaactc agaagtggtg catgggcaag catgagtttc ggagcaacat cccctgtcac 2401 ctggttgata tctcttgcgg attaccctgc agtgccacgc taccatgtgg gatgcacaaa 2461 tgtcagagac tctgtcacaa aggggagtgt cttgtggatg agccctgcaa gcagccctgc 2521 accaccccca gagctgactg tgggcacccc tgtatggcac cctgccatac cagctcaccc 2581 tgccctgtga ctgcttgtaa agctaaggta gagctacagt gtgaatgtgg acgaagaaaa 2641 gagatggtga tttgctctga agcatctagt acttatcaaa gaatagctgc aatctccatg 2701 gcctctaaga taacagacat gcagcttgga ggttcagtgg agatcagcaa gttaattacc 2761 aaaaaggaag ttcatcaagc caggctggag tgtgatgagg agtgttcagc cttggaaagg 2821 aaaaagagat tagcagaggc atttcatatc agtgaggatt ctgatccttt caatatacgt 2881 tcttcagggt caaaattcag tgatagtttg aaagaagatg ccaggaagga cttaaagttt 2941 gtcagtgacg ttgagaagga aatggaaacc ctcgtggagg ccgtgaataa gggaaagaat 3001 agtaagaaaa gccacagctt ccctcccatg aacagagacc accgccggat catccatgac 3061 ttggcccaag tttatggcct ggagagcgtg agctatgaca gtgaaccgaa gcgcaatgtg 3121 gtggtcactg ccatcagggg gaagtccgtt tgtcctccta ccacgctgac aggtgtgctt 3181 gaaagggaaa tgcaggcacg gcctccacca ccgattcctc atcacagaca tcagtcagac 3241 aagaatcctg ggagcagtaa tttacagaaa ataaccaagg agccaataat tgactatttt 3301 gacgtccagg actaagaaga tcatgatgca cttagataaa agaatgatta ggtatagtgg 3361 agacttattt gccagcagat aaatcatgcc cgttcccctc tgcctggcag aatcacagtc 3421 tcacatactg tcttgtactg acacatccaa agcatgagtg tgtcagaaat cccttgtcta 3481 ttcctgtctg tataaagtgt ttcaggatg // LOCUS HSU15552 2366 bp mRNA PRI 18-OCT-1994 DEFINITION Human acidic 82 kDa protein mRNA, complete cds. ACCESSION U15552 NID g558457 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2366) AUTHORS Carlsson,P. TITLE cDNA encoding an acidic 82 kDa protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 2366) AUTHORS Carlsson,P. TITLE Direct Submission JOURNAL Submitted (05-OCT-1994) Peter Carlsson, Molecular Biology, Goteborg University, Medicinaregatan 9C, Goteborg, S-413 90, Sweden FEATURES Location/Qualifiers source 1..2366 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="hepatocyte" /tissue_type="liver" CDS 25..2235 /note="acidic 82 kDa protein" /codon_start=1 /db_xref="PID:g558458" /translation="MVVTRSARAKASIQAASAESSGQKSFAANGIQAHPESSTGSDAR TTDESQTTGKQSLIPRTPKARKSKSRTTGSLPKGTEPSTDGETSEAESNYSVSEHHDT ILRVTRRRQILIACSPVSSVRKKPKVTPTKESYTEEIVSEAESHVSGISRIVLPTEKT TGARRSKAKSLTDPSQESHTEAISDAETSSSDISFSGIATRRTRSMQRKLKAQTEKKD SKIVPGNEKQIVGTPVNSEDSDTRQTSHLQARSLSEINKPNFYNNDFDDDFSHRSSEN ILTVHEQANVESLKETKQNCKDLDEDANGITDEGKEINEKSSQLKNLSELQDTSLQQL VSQRHSTPQNKNAVSVHSNLNSEAVMKSLTQTFATVEVGRWNNNKKSPIKASDLTKFG DCGGSDDEEESTVISVSEDMNSEGNVDFECDTKLYTSAPNTSQGKDNSVLLVLSSDES QQSENSENEEDTLCFVENSGQRESLSGDTGSLSCDNALFVIDTTPGMSADKNFYLEEE DKASEVAIEEEKEEEEDEKSEEDSSDHDENEDEFSDEEDFLNSTKAKLLKLTSSSIDP GLSIKQLGGLYINFNADKLQSNKRTLTQIKEKKKNELLQKAVITPDFEKNHCVPPYSE SKYQLQKKRRKERQKTAGDGWFGMKAPEMTNELKNDLKALKMRASMDPKRFYKKNDRD GFPKYFQIGTIVDNPADFYHSRIPKKQRKRTIVEDCWLILNSEIQPKEVLRDHG" polyA_signal 2342..2347 polyA_site 2363 BASE COUNT 882 a 418 c 527 g 539 t ORIGIN 1 ggcacgagcg agggagccgg aaagatggtg gttaccagat ctgcacgggc taaggccagc 61 atccaagccg cgtcggctga aagttccggg caaaagagtt ttgctgctaa tgggattcaa 121 gcgcatccag aaagtagtac tggatctgat gcccgaacta ctgatgaatc acagaccact 181 gggaagcaaa gtttaatccc tagaactcct aaagctagaa agagtaagag cagaactaca 241 ggctcactac caaaggggac tgaaccatct acggatggag agacctctga ggcagagtca 301 aattattctg tgtctgagca ccatgatacc attttaaggg taactaggag aaggcagatc 361 ttaattgcat gctccccagt gtccagtgtt aggaaaaagc cgaaagtaac tccaacaaag 421 gagtcttaca ctgaagaaat agtgtctgaa gcagaatctc atgtttcagg tatttctaga 481 attgtgcttc ctacagaaaa aactacagga gccagaagaa gtaaggctaa atctctgaca 541 gatccaagcc aagaatctca tacagaagct atatctgatg ctgagacatc aagctcagac 601 atttcattct ctggaattgc aactagaaga accaggagta tgcagaggaa attaaaggca 661 caaactgaaa agaaagatag taagattgta ccaggaaatg agaaacagat cgtgggtaca 721 cctgtgaatt cagaggattc agataccaga caaacttccc atttacaagc aagatctctt 781 tctgagataa ataagccaaa tttctataat aatgactttg atgatgattt ctcccacaga 841 agttcagaaa atatattaac agtgcacgaa caggccaatg ttgaatctct taaagaaaca 901 aaacagaatt gtaaggattt ggatgaagat gccaatggaa taacagatga ggggaaagaa 961 attaatgaga aaagttctca gctgaagaat ctttctgaac ttcaggacac tagccttcaa 1021 cagttagttt ctcagagaca ttcaaccccc caaaataaaa atgctgtatc agtgcactct 1081 aatctgaact ctgaggctgt aatgaaatca ttaactcaaa catttgcaac tgtggaagta 1141 ggcagatgga ataacaacaa aaagagcccc ataaaagcaa gtgacttgac aaagtttggt 1201 gattgtggtg gtagtgatga tgaagaagag tccacagtta taagtgtcag tgaagacatg 1261 aacagtgaag ggaatgtaga ttttgaatgt gataccaaac tatacacgtc tgcgcccaac 1321 acatctcagg gtaaagataa ttctgtctta ctagttctca gcagtgatga aagccaacag 1381 tctgaaaaca gtgagaatga agaggatact ttatgttttg ttgaaaatag tggccaaagg 1441 gagtcattaa gtggagacac aggaagtctg tcatgtgaca atgcattgtt tgtaattgac 1501 acaactcctg gaatgagtgc tgataaaaat ttttacttgg aagaggaaga caaggcaagt 1561 gaggttgcca ttgaggaaga aaaagaagag gaagaggatg aaaaaagtga agaagattca 1621 tcagaccatg acgaaaatga agatgagttt agtgatgaag aagacttcct aaatagcaca 1681 aaggctaaac ttctgaagtt gacaagcagc agcatagacc ctggtctgag tatcaagcag 1741 ttgggtggtt tgtatattaa ttttaatgca gataaactac agtctaacaa gagaacccta 1801 acacagatca aggagaaaaa gaaaaatgag cttctgcaga aagccgtcat tacacctgat 1861 tttgaaaaaa accactgtgt tccaccatat agtgaatcaa agtatcaact tcagaaaaaa 1921 cgcagaaaag aacgacaaaa aacagcaggg gatggctggt ttggtatgaa agctccagaa 1981 atgacaaatg aactgaaaaa tgatctcaaa gcactgaaga tgagagccag catggacccg 2041 aaaagatttt acaagaaaaa tgatagagat ggcttcccca agtacttcca gattggaacc 2101 attgttgaca atccagctga tttctaccat tcacgaattc ccaagaagca aaggaaaaga 2161 actattgtgg aagactgctg gctgattctg aattcagaga tacaaccgaa ggaagtactc 2221 agagatcatg gctgaaaaag cagcaaatgc agcaggaaaa aagttccgaa agaagaagaa 2281 atttcgcaat taagatttac caagcaaact gcaacatttt acattgctcc tttatttact 2341 tattaaagac gtttggaaaa ctaaaa // LOCUS HSU15637 2339 bp mRNA PRI 07-DEC-1994 DEFINITION Human CD40 binding protein (CD40bp) mRNA, complete cds. ACCESSION U15637 NID g595910 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2339) AUTHORS Hu,H.M., O'Rourke,K., Boguski,M.S. and Dixit,V.M. TITLE A novel RING finger protein interacts with the cytoplasmic domain of CD40 JOURNAL J. Biol. Chem. 269, 30069-30072 (1994) MEDLINE 95073988 REFERENCE 2 (bases 1 to 2339) AUTHORS Hu,H.M., O'Rourke,K., Boguski,M.S. and Dixit,V.M. TITLE Direct Submission JOURNAL Submitted (07-OCT-1994) Department of Pathology, University of Michigan Medical School, 1301 Catherine St., Ann Arbor, MI 48109-6092, USA COMMENT CD40 antigen is a cell surface transmembrane 45 kDa glycoprotein receptor expressed on B-lymphocytes. FEATURES Location/Qualifiers source 1..2339 /organism="Homo sapiens" /db_xref="taxon:9606" polyA_site 1..2339 /gene="CD40BP" /note="10 A bases" gene 1..2339 /gene="CD40BP" CDS 211..1914 /gene="CD40BP" /codon_start=1 /product="CD40 binding protein" /db_xref="PID:g595911" /translation="MESSKKMDSPGALQTNPPLKLHTDRSAGTPVFVPEQGGYKEKFV KTVEDKYKCEKCHLVLCSPKQTECGHRFCESCMAALLSSSSPKCTACQESIVKDKVFK DNCCKREILALQIYCRNESRGCAEQLMLGHLVHLKNDCHFEELPCVRPDCKEKVLRKD LRDHVEKACKYREATCSHCKSQVPMIALQKHEDTDCPCVVVSCPHKCSVQTLLRSELS AHLSECVNAPSTCSFKRYGCVFQGTNQQIKAHEASSAVQHVNLLKEWSNSLEKKVSLL QNESVEKNKSIQSLHNQICSFEIEIERQKEMLRNNESKILHLQRVIDSQAEKLKELDK EIRPFRQNWEEADSMKSSVESLQNRVTELESVDKSAGQVARNTGLLESQLSRHDQMLS VHDIRLADMDLGFQVLETASYNGVLIWKIRDYKRRKQEAVMGKTLSLYSQPFYTGYFG YKMCARVYLNGDGMGKGTHLSLFFVIMRGEYDALLPWPFKQKVTLMLMDQGSSRRHLG DAFKPDPNSSSFKKPTGEMNIASGCPVFVAQTVLENGTYIKDDTIFIKVIVDTSDLPD P" misc_feature 355..507 /gene="CD40BP" /note="encodes RING finger domain at amino acids 49-97" misc_feature 508..1006 /gene="CD40BP" /note="encodes Cys/His-rich region at amino acids 98-265" misc_feature 1007..1338 /gene="CD40BP" /note="encodes Coiled-coil domain at amino acids 266-376" misc_feature 1375..1911 /gene="CD40BP" /note="encodes TRAF domain at amino acids 389-567" BASE COUNT 656 a 543 c 644 g 496 t ORIGIN 1 acgaaggcca cgcgcccggc gcccctgagc cggccgagcg gcgacggacc gcgagatgag 61 gaaaatgagg cccaaagaag tgatgccact tggttaaggt cccagagcag gtcagaatca 121 gacctaggat cagaaacctg gctcctggct cctgctccct actcttctaa ggatcgctgt 181 cctgacagaa gagaactcct ctttcctaaa atggagtcga gtaaaaagat ggactctcct 241 ggcgcgctgc agactaaccc gccgctaaag ctgcacactg accgcagtgc tgggacgcca 301 gtttttgtcc ctgaacaagg aggttacaag gaaaagtttg tgaagaccgt ggaggacaag 361 tacaagtgtg agaagtgcca cctggtgctg tgcagcccga agcagaccga gtgtgggcac 421 cgcttctgcg agagctgcat ggcggccctg ctgagctctt caagtccaaa atgtacagcg 481 tgtcaagaga gcatcgttaa agataaggtg tttaaggata attgctgcaa gagagaaatt 541 ctggctcttc agatctattg tcggaatgaa agcagaggtt gtgcagagca gttaatgctg 601 ggacatctgg tgcatttaaa aaatgattgc cattttgaag aacttccatg tgtgcgtcct 661 gactgcaaag aaaaggtctt gaggaaagac ctgcgagacc acgtggagaa ggcgtgtaaa 721 taccgggaag ccacatgcag ccactgcaag agtcaggttc cgatgatcgc gctgcagaaa 781 cacgaagaca ccgactgtcc ctgcgtggtg gtgtcctgcc ctcacaagtg cagcgtccag 841 actctcctga ggagcgagtt gagtgcacac ttgtcagagt gtgtcaatgc ccccagcacc 901 tgtagtttta agcgctatgg ctgcgttttt caggggacaa accagcagat caaggcccac 961 gaggccagct ccgccgtgca gcacgtcaac ctgctgaagg agtggagcaa ctcgctcgaa 1021 aagaaggttt ccttgttgca gaatgaaagt gtagaaaaaa acaagagcat acaaagtttg 1081 cacaatcaga tatgtagctt tgaaattgaa attgagagac aaaaggaaat gcttcgaaat 1141 aatgaatcca aaatccttca tttacagcga gtgatagaca gccaagcaga gaaactgaag 1201 gagcttgaca aggagatccg gcccttccgg cagaactggg aggaagcaga cagcatgaag 1261 agcagcgtgg agtccctcca gaaccgcgtg accgagctgg agagcgtgga caagagcgcg 1321 gggcaagtgg ctcggaacac aggcctgctg gagtcccagc tgagccggca tgaccagatg 1381 ctgagtgtgc acgacatccg cctagccgac atggacctgg gcttccaggt cctggagacc 1441 gccagctaca atggagtgct catctggaag attcgcgact acaagcggcg gaagcaggag 1501 gccgtcatgg ggaagaccct gtccctttac agccagcctt tctacactgg ttactttggc 1561 tataagatgt gtgccagggt ctacctgaac ggggacggga tggggaaggg gacgcacttg 1621 tcgctgtttt ttgtcatcat gcgtggagaa tatgatgccc tgcttccttg gccgtttaag 1681 cagaaagtga cactcatgct gatggatcag gggtcctctc gacgtcattt gggagatgca 1741 ttcaagcccg accccaacag cagcagcttc aagaagccca ctggagagat gaatatcgcc 1801 tctggctgcc cagtctttgt ggcccaaact gttctagaaa atgggacata tattaaagat 1861 gatacaattt ttattaaagt catagtggat acttcggatc tgcccgatcc ctgataagta 1921 gctggggagg tggatttagc agaaggcaac tcctctgggg gatttgaacc ggtctgtctt 1981 cactgaggtc ctcgcgctca gaaaaggacc ttgtgagacg gaggaagcgg cagaaggcgg 2041 acgcgtgccg gcgggaggag ccacgcgaga gcacacctga cacgttttat aatagactag 2101 ccacacttca ctctgaagaa ttatttatcc ttcaacaaga taaatattgc tgtcagagaa 2161 ggttttcatt ttcattttta aagatctagt taattaaggt ggaaaacata tatgctaaac 2221 aaaagaaaca tgatttttct tccttaaact tgaacaccaa aaaaacacac acacacacac 2281 acgtggggat agctggacat gtcagcatgt taagtaaaag gagaatttat gaaatagta // LOCUS HSU15641 1332 bp mRNA PRI 05-APR-1995 DEFINITION Human transcription factor E2F-4 mRNA, complete cds. ACCESSION U15641 NID g758413 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1332) AUTHORS Sardet,C., Vidal,M., Cobrinik,D., Geng,Y., Onufryk,C., Chen,A. and Weinberg,R.A. TITLE E2F-4 and E2F-5, two members of the E2F family, are expressed in the early phases of the cell cycle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (6), 2403-2407 (1995) MEDLINE 95199358 REFERENCE 2 (bases 1 to 1332) AUTHORS Sardet,C. TITLE Direct Submission JOURNAL Submitted (07-OCT-1994) Claude Sardet, Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1332 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 63..1304 /codon_start=1 /function="transcription factor" /product="E2F-4" /db_xref="PID:g758414" /translation="MAEAGPQAPPPPGTPSRHEKSLGLLTTKFVSLLQEAKDGVLDLK LAADTLAVRQKRRIYDITNVLEGIGLIEKKSKNSIQWKGVGPGCNTREIADKLIELKA EIEELQQREQELDQHKVWVQQSIRNVTEDVQNSCLAYVTHEDICRCFAGDTLLAIRAP SGTSLEVPIPEGLNGQKKYQIHLKSVSGPIEVLLVNKEAWSSPPVAVPVPPPEDLLQS PSAVSTPPPLPKPALAQSQEASRPNSPQLTPTAVPGSAEVQGMAGPAAEITVSGGPGT DSKDSGELSSLPLGPTTLDTRPLQSSALLDSSSSSSSSSSSSSNSNSSSSSGPNPSTS FEPIKADPTGVLELPKELSEIFDPTRECMSSELLEELMSSEVFAPLLRLSPPPGDHDY IYNLDESEGVCDLFDVPVLNL" BASE COUNT 297 a 396 c 393 g 246 t ORIGIN 1 gcgcggaagt ggcgcggcgc gcctggcctg gcctggctga ggggaggcgg cgggcgggcg 61 cgatggcgga ggccgggcca caggcgccgc cgcccccggg cactccaagc cggcacgaaa 121 agagcctggg actgctcacc accaagttcg tgtcccttct gcaggaggcc aaggacggcg 181 tgcttgacct caagctggca gctgacaccc tagctgtacg ccagaagcgg cggatttacg 241 acattaccaa tgttttggaa ggtatcgggc taatcgagaa aaagtccaag aacagcatcc 301 agtggaaggg tgtggggcct ggctgcaata cccgggagat tgctgacaaa ctgattgagc 361 tcaaggcaga gatcgaggag ctgcagcagc gggagcaaga actagaccag cacaaggtgt 421 gggtgcagca gagcatccgg aacgtcacag aggacgtgca gaacagctgt ttggcctacg 481 tcactcatga ggacatctgc agatgctttg ctggagatac cctcttggcc atccgggccc 541 catcaggcac cagcctggag gtgcccatcc cagagggtct caatgggcag aagaagtacc 601 agattcacct gaagagtgtg agtggtccca ttgaggttct gctggtgaac aaggaggcat 661 ggagctcacc ccctgtggct gtgcctgtgc caccacctga agatttgctc cagagcccat 721 ctgctgtttc tacacctcca cctctgccca agcctgccct agcccagtcc caggaagcct 781 cacgtccaaa tagtcctcag ctcactccca ctgctgtccc tggcagtgca gaagtccagg 841 gaatggctgg cccagcagct gagatcacag tgagtggcgg ccctgggact gatagcaagg 901 acagtggtga gctcagttca ctcccactgg gcccaacaac actggacacc cggccactgc 961 agtcttctgc cctgctggac agcagcagca gcagcagcag cagcagcagc agcagcagca 1021 acagtaacag cagcagttcg tccggaccca acccttctac ctcctttgag cccatcaagg 1081 cagaccccac aggtgttttg gaactcccca aagagctgtc agaaatcttt gatcccacac 1141 gagagtgcat gagctcggag ctgctggagg agttgatgtc ctcagaagtg tttgcccctc 1201 tgcttcgtct ttctccaccc ccgggagacc acgattatat ctacaacctg gacgagagtg 1261 aaggtgtctg tgacctcttt gatgtgcctg ttctcaacct ctgactgaca gggacatgcc 1321 ctgtgtggct gg // LOCUS HSU15655 2667 bp mRNA PRI 03-FEB-1996 DEFINITION Human ets domain protein ERF mRNA, complete cds. ACCESSION U15655 NID g1015336 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2667) AUTHORS Sgouras,D.N., Athanasiou,M.A., Beal,G.J. Jr., Fisher,R.J., Blair,D.G. and Mavrothalassitis,G.J. TITLE ERF: an ETS domain protein with strong transcriptional repressor activity, can suppress ets-associated tumorigenesis and is regulated by phosphorylation during cell cycle and mitogenic stimulation JOURNAL EMBO J. 14 (19), 4781-4793 (1995) MEDLINE 96030784 REFERENCE 2 (bases 1 to 2667) AUTHORS Mavrothalassitis,G.J. TITLE Direct Submission JOURNAL Submitted (07-OCT-1994) George J. Mavrothalassitis, DCE, LMO, NCI-FCRDC, NCI, NIH, Bldg 469, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2667 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 123..1769 /codon_start=1 /product="ERF" /db_xref="PID:g1015337" /translation="MKTPADTGFAFPDWAYKPESSPGSRQIQLWHFILELLRKEEYQG VIAWQGDYGEFVIKDPDEVARLWGVRKCKPQMNYDKLSRALRYYYNKRILHKTKGKRF TYKFNFNKLVLVNYPFIDVGLAGGAVPQSAPPVPSGGSHFRFPPSTPSEVLSPTEDPR SPPACSSSSSSLFSAVVARRLGRGSVSDCSDGTSELEEPLGEDPRARPPGPPDLGAFR GPPLARLPHDPGVFRVYPRPRGGPEPLSPFPVSPLAGPGSLLPPQLSPALPMTPTHLA YTPSPTLSPMYPSGGGGPSGSGGGSHFSFSPEDMKRYLQAHTQSVYNYHLSPRAFLHY PGLVVPQPQRPDKCPLPPMAPETPPVPSSASSSSSSSSSPFKFKLQRPPLGRRQRAAG EKAVAAADKSGGSAGGLAEGAGALAPPPPPPQIKVEPISEGESEEVEVTDISDEDEED GEVFKTPRAPPAPPKPEPGEAPGASQCMPLKLRFKRRWSEDCRLEGGGGPAGGFEDEG EDKKVRGEGPGEAGGPLTPRRVSSDLQHATAQLSLEHRDS" BASE COUNT 456 a 872 c 829 g 510 t ORIGIN 1 tctgagaggc gaggccgggt gaggcggcga gggcggcccg acgggcgcgg gacgggacgg 61 ggcagcgagg gcgccgggag ccgcggcccg gaatcggggc gcttcgcccc gggcccccca 121 gcatgaagac cccggcggac acagggtttg ccttcccgga ttgggcctac aagccagagt 181 cgtcccctgg ctcaaggcag atccagctgt ggcactttat cctggagctg ctgcggaagg 241 aggagtacca gggcgtcatt gcctggcagg gggactacgg ggaattcgtc atcaaagacc 301 ctgatgaggt ggcccggctg tggggcgttc gcaagtgcaa gccccagatg aattacgaca 361 agctgagccg ggccctgcgc tattactata acaagcgcat tctgcacaag accaagggga 421 aacggttcac ctacaagttc aatttcaaca aactggtgct ggtcaattac ccattcattg 481 atgtggggtt ggctgggggt gcagtgcccc agagtgcccc gccagtgccg tcgggtggta 541 gccacttccg cttccctccc tcaacgccct ccgaggtgct gtcccccacc gaggaccccc 601 gctcaccacc agcctgctct tcatcttcat cttccctctt ctcggctgtg gtggcccgcc 661 gcctgggccg aggctcagtc agtgactgta gtgatggcac gtcagagctg gaggaaccgc 721 tgggagagga tccccgcgcc cgaccacccg gccctccgga tctgggtgcc ttccgagggc 781 ccccgctggc ccgcctgccc catgaccctg gtgtcttccg agtctatccc cggcctcggg 841 gtggccctga acccctcagc cccttccctg tgtcgcctct ggccggtcct ggatccctgc 901 tgccccctca gctctccccg gctctgccca tgacgcccac ccacctggcc tacactccct 961 cgcccacgct gagcccgatg taccccagtg gtggcggggg gcccagcggc tcagggggag 1021 gctcccactt ctccttcagc cctgaggaca tgaaacggta cctgcaggcc cacacccaaa 1081 gcgtctacaa ctaccacctc agcccccgcg ccttcctgca ctaccctggg ctggtggtgc 1141 cccagcccca gcgccctgac aagtgcccgc tgccgcccat ggcacccgag accccaccgg 1201 tcccctcctc ggcctcgtca tcctcttctt cttcttcctc cccattcaag tttaagctcc 1261 agcggccccc actcggacgc cggcagcggg cagctgggga gaaggccgta gccgctgctg 1321 acaagagcgg tggcagtgca ggcgggctgg ctgagggggc aggggcgcta gccccaccgc 1381 ccccgccacc acagatcaag gtggagccca tctcggaagg cgagtcggag gaggtagagg 1441 tgactgacat cagtgatgag gatgaggaag acggggaggt gttcaagacg ccccgtgccc 1501 cacctgcacc ccctaagcct gagcccggcg aggcacccgg ggcatcccag tgcatgcccc 1561 tcaagctacg ctttaagcgg cgctggagtg aagactgtcg cctcgaaggg ggtgggggcc 1621 ccgctggggg ctttgaggat gagggtgagg acaagaaggt gcgtggggag gggcctgggg 1681 aggctggggg gcccctcacc ccaaggcggg tgagctctga cctccagcat gccacggccc 1741 agctctccct ggagcaccga gactcctgag ggctgtgggc aggggacctg tgtgccccgc 1801 accccccatg cttcttttgc tgccttaagc cccctatgcc ctggaggtga gggcagctct 1861 cttgtctctt ccctgcctcc tcccttttcc ctccccacat tttgtataaa actttaattt 1921 ctttttttta aaaatggtgg gggtgggtgg gtgcccaggg ctaggggcta ttccctgtct 1981 ctgtgggttt ctaagctctg ggcaaattgg tggtaggggg agggaggggg aagttaaggg 2041 ggtcacctcc attctgggga atttatattt gaattgaggc tttggcctta acacccagga 2101 acttttctat tacaatcgct taggaagtaa agccttgtct ccctccctgt tctctgcctc 2161 ttgtacccct ctgacccacc cgctctgccc cactcccagc cctcctcagc cccagccctg 2221 cctgccctgc ccctccaggg ggccatgagt gcctaggttt ctcatacccc acaaggtcac 2281 agcaggggag ggagggacaa ttttataatg aaccaaaaat tccatgtgtt ggggggtggg 2341 gggcggagga gggtgagggg tgccgcccat gggccacaaa tctctacaag tgcctgctat 2401 ccctctccca ctccccaccc cagcaccggt ccaacccctt catccccagc tgctcctagg 2461 actggcccat gggcaggcgg gtggggggat gggaaggggg tgccctgaaa ccaaactgga 2521 agccccctct gcctcccagc tggggcctct ggggtggggt ggggggctgt ggtcaagcct 2581 tattctgtat tggggactga gggtgggggg agtagagggg ccgctggaga atgtattcaa 2641 aacaataaac tttggacctt tggaaaa // LOCUS HSU15782 2766 bp mRNA PRI 24-JAN-1995 DEFINITION Human cleavage stimulation factor 77kDa subunit mRNA, complete cds. ACCESSION U15782 NID g632497 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2766) AUTHORS Takagaki,Y. and Manley,J.L. TITLE A subunit of the polyadenylation factor CstF is the human homologue of the Drosophila suppressor of forked protein JOURNAL Nature 372, 471-474 (1994) MEDLINE 95075460 REFERENCE 2 (bases 1 to 2766) AUTHORS Takagaki,Y. TITLE Direct Submission JOURNAL Submitted (12-OCT-1994) Yoshio Takagaki, Department of Biological Sciences, Columbia University, 116th Street, Broadway, New York, NY 10027, USA FEATURES Location/Qualifiers source 1..2766 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pG77-12" /tissue_type="placenta" 5'UTR 1..131 CDS 132..2285 /note="CstF 77kDa subunit" /codon_start=1 /function="formation of functional CstF which is required for cleavage of pre-mRNAs at the polyadenylation sites" /product="cleavage stimulation factor 77kDa subunit" /db_xref="PID:g632498" /translation="MSGDGATEQAAEYVPEKVKKAEKKLEENPYDLDAWSILIREAQN QPIDKARKTYERLVAQFPSSGRFWKLYIEAEIKAKNYDKVEKLFQRCLMKVLHIDLWK CYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEIMSYQIWVDYINFLKGVEAVGSYA ENQRITAVRRVYQRGCVNPMINIEQLWRDYNKYEEGINIHLAKKMIEDRSRDYMNARR VAKEYETVMKGLDRNAPSVPPQNTPQEAQQVDMWKKYIQWEKSNPLRTEDQTLITKRV MFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSDEAANIYERAIST LLKKNMLLYFAYADYEESRMKYEKVHSIYNRLLAIEDIDPTLVYIQYMKFARRAEGIK SGRMIFKKAREDTRTRHHVYVTAALMEYYCSKDKSVAFKIFELGLKKYGDIPEYVLAY IDYLSHLNEDNNTRVLFERVLTSGSLPPEKSGEIWARFLAFESNIGDLASILKVEKRR FTAFKEEYEGKETALLVDRYKFMDLYPCSASELKALGYKDVSRAKLAAIIPDPVVAPS IVPVLKDEVDRKPEYPKPDTQQMIPFQPRHLAPPGLHPVPGGVFPVPPAAVVLMKLLP PPICFQGPFVQVDELMEIFRRCKIPNTVEEAVRIITGGAPELAVEGNGPVESNAVLTK AVKRPNEDSDEDEEKGAVVPPVHDIYRARQQKRIR" 3'UTR 2286..2766 BASE COUNT 862 a 499 c 626 g 779 t ORIGIN 1 cgattggtgt tggcggtctg gctcagctgg gcagggggta actttactga tttgggggtg 61 gtttttagtt taatttttct tttctagctt cccatcgacg gtcagtgcgc acgttgtaat 121 cagctgaggc catgtcagga gacggagcca cggagcaggc agctgagtat gtcccagaga 181 aggtgaagaa agcggaaaag aaattagaag agaatccata tgaccttgat gcttggagca 241 ttctcattcg agaggcacag aatcaaccta tagacaaagc acggaagact tatgaacgcc 301 ttgttgccca gttccccagt tctggcagat tctggaaact gtacattgaa gcagagatta 361 aagctaaaaa ttatgacaag gttgaaaagc tatttcagag atgccttatg aaggttttgc 421 acattgattt atggaagtgt tatctttcat atgtccgaga aaccaagggt aaactaccaa 481 gttacaaaga aaaaatggct caagcatatg actttgcact ggataaaatt ggaatggaaa 541 ttatgtccta tcagatttgg gtggattaca tcaatttcct aaaaggcgtg gaagctgtag 601 gatcttatgc agaaaatcaa agaataacag ctgtccgaag agtttatcaa cgaggttgtg 661 ttaatccgat gatcaacatt gaacagctct ggagagacta taacaagtat gaagagggta 721 tcaatattca tttagctaaa aaaatgattg aagatcggag tagagattat atgaatgcta 781 gacgtgtagc aaaggaatat gagacagtaa tgaaaggctt ggaccgtaat gctccctcgg 841 tgcctcctca gaatactcct caagaagctc aacaagtaga tatgtggaag aaatatatac 901 agtgggaaaa gagcaaccct cttcgtacag aggatcagac ccttataaca aaaagagtta 961 tgtttgctta tgaacagtgc ctgcttgtgc tgggccatca ccctgatatt tggtatgaag 1021 ctgcccagta tcttgagcag tcaagtaaac tgctcgcaga aaagggagat atgaataatg 1081 ccaaattatt tagtgatgaa gctgctaata tatatgaaag agccataagc actttattga 1141 agaagaatat gcttctttat tttgcatatg cagattatga agagagtcgc atgaagtatg 1201 aaaaggttca cagtatatat aacagacttc tggcaattga ggatattgac cctaccttgg 1261 tatatatcca atatatgaaa tttgcacgga gagcagaagg catcaaatct ggaagaatga 1321 tatttaaaaa agcaagagaa gataccagaa cccgccacca tgtctatgtt actgcagcac 1381 tcatggaata ttactgtagt aaggacaaat ctgttgcctt taagattttt gagctggggc 1441 taaaaaaata tggagacatt ccagagtatg tcctggccta tattgactat ctttctcacc 1501 tcaatgagga caataatacc cgagttttgt ttgaacgagt tttaacatct ggaagccttc 1561 ctcctgagaa gtctggagaa atctgggccc gatttctagc atttgaaagt aatattggtg 1621 atctagctag tatactcaaa gtggagaaaa gacggtttac agcattcaaa gaagagtatg 1681 aagggaaaga aacggcttta ctagtagata gatacaagtt catggattta tatccttgct 1741 ctgcaagtga attaaaagca cttggttata aggatgtctc ccgtgctaag ctagcagcta 1801 taattccgga cccagttgta gctccttcta tagtgcctgt tctgaaagat gaagtggata 1861 gaaaaccaga ataccctaaa ccagacactc agcagatgat tccatttcag ccacgacatt 1921 tagcacctcc aggtttacac cctgtacctg gtggagtgtt cccagtccct cctgcagctg 1981 ttgttttaat gaaacttctc cctcctccta tctgtttcca gggtcctttt gtacaagtgg 2041 atgaactgat ggaaattttc cgaagatgca agataccaaa tactgttgag gaagctgtga 2101 ggatcattac tggtggggcc ccagagctag ctgtagaagg caacggcccc gtggaaagta 2161 atgcagtact caccaaggcc gtcaaaaggc ccaacgagga ttcagatgaa gatgaagaaa 2221 agggagccgt tgtcccccct gttcatgaca tttacagagc acggcagcag aagcggattc 2281 ggtagggttt taaacgcctc tgcagaaaac tcctgtccag gattcctttt gcctcaagtg 2341 gtatgtttaa aagagacaac gctttgttac aaggttcttg gaaacaaagt tgtattgtca 2401 ttggtgcctc tatcacatgg ttcttgagaa aaaacaaacc aacctgtgtg aattttagaa 2461 tacggaacag acctatgctc taagcaaaat taggttttca aaaatgtgag aacagtacaa 2521 agtggcagaa ccacattttg ttccctcttc aagggtgtct tgtatgtgcc gcttgaagat 2581 ttgtgagttt ttcaacagtt ttattttaaa aactggatgg cttatgattg taaagcattt 2641 tatcacattt tctgaaaaca attgttcttg gtttgcttat gtagagtcct gccttattgt 2701 ttgtttttat ttatggcaga atgtatgaaa tccgttttgt agtttcaaat tttaaaagtc 2761 ctttaa // LOCUS HSU15932 2469 bp mRNA PRI 04-APR-1995 DEFINITION Human dual-specificity protein phosphatase mRNA, complete cds. ACCESSION U15932 NID g606971 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2469) AUTHORS Ishibashi,T., Bottaro,D.P., Michieli,P., Kelley,C.A. and Aaronson,S.A. TITLE A novel dual specificity phosphatase induced by serum stimulation and heat shock JOURNAL J. Biol. Chem. 269 (47), 29897-29902 (1994) MEDLINE 95050849 REFERENCE 2 (bases 1 to 2469) AUTHORS Bottaro,D.P. TITLE Direct Submission JOURNAL Submitted (13-OCT-1994) Donald P. Bottaro, Lab. Cellular and Molecular Biology, National Cancer Institute, Building 37, Room 1E24, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2469 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="B23" /clone_lib="Lambda pCEV27 B5/589" /sex="female" /cell_line="B5/589" /cell_type="epithelial cell" /tissue_type="mammary gland" CDS 211..1404 /codon_start=1 /product="protein phosphatase" /db_xref="PID:g606972" /translation="MKVTSLDGGHVRKMLRKEAAARCVVLDCRPYLAFAASNVRGSLN VNLNSVVLRRARGGAVSARYVLPDEARRARLLQEGGGGVAAVVVLDQGSRHWQKLREE SAFVVLTSLLACLPAGPRVYFLKGGYETFYSEYPECCVDVKPISQEKIESERALISQC GKPVVNVSYRPAYDQGGPVEILPFLYLGSAYHASKCEFLANLHITALLNVSRRTSEAC MTHLHYKWIPVEDSHTADISSHFQEAIDFIDCVREKGGKVLVHCEAGISRSPTICMAY LMKTKQFRLKEAFDYIKQRRSMVSPNFGFMGQLLQYESEILPSTPNPQPPSCQGEAAG SSLIGHLQTLSPDMQGAYCTFPASVLARCLPTQQSQSSAEALWQRPNPAKTGMEESAQ PQEQL" BASE COUNT 515 a 679 c 689 g 586 t ORIGIN 1 aatcgcgaaa cccggcgagc ggcgcgctgg ctatcgagcg agcggggcgg aaccgggagt 61 tgcgccgccg ctcgggcgcc gggctccgtc gcggccgcag ccccgcgggt cgccctcccg 121 tgcctcgccc gcggacaccc tggccgtgga caccctggcc gtgggcaccc gcggggcgcg 181 gcgcgggcgc tgcgcggcgg cggcggcggc atgaaggtca cgtcgctcga cggcggccac 241 gtgcgcaaga tgctccgcaa ggaggcggcg gcgcgctgcg tggtgctcga ctgccggccc 301 tatctggcct tcgctgcctc gaacgtgcgc ggctcgctca acgtcaacct caactcggtg 361 gtgctgcggc gggcccgggg cggcgcggtg tcggcgcgct acgtgctgcc cgacgaggcg 421 cggcgcgcgc ggctcctgca ggagggcggc ggcggcgtcg cggccgtggt ggtgctggac 481 cagggcagcc gccactggca gaagctgcga gaggagagcg cgtttgtcgt cctcacctcg 541 ctactcgctt gcctacccgc cggcccgcgg gtctacttcc tcaaaggggg atatgagact 601 ttctactcgg aatatcctga gtgttgcgtg gatgtaaaac ccatttcaca agagaagatt 661 gagagtgaga gagccctcat cagccagtgt ggaaaaccag tggtaaatgt cagctacagg 721 ccagcttatg accagggtgg cccagttgaa atccttccct tcctctacct tggaagtgcc 781 taccatgcat ccaagtgcga gttcctcgcc aacttgcaca tcacagccct gctgaatgtc 841 tcccgacgga cctccgaggc ctgcatgacc cacctacact acaaatggat ccctgtggaa 901 gacagccaca cggctgacat tagctcccac tttcaagaag caatagactt cattgactgt 961 gtcagggaaa agggaggaaa ggtcctggtc cactgtgagg ctgggatctc ccgttcaccc 1021 accatctgca tggcttacct tatgaagacc aagcagttcc gcctgaagga ggccttcgat 1081 tacatcaagc agaggaggag catggtctcg cccaactttg gcttcatggg ccagctcctg 1141 cagtacgaat ctgagatcct gccctccacg cccaaccccc agcctccctc ctgccaaggg 1201 gaggcagcag gctcttcact gataggccat ttgcagacac tgagccctga catgcagggt 1261 gcctactgca cattccctgc ctcggtgctg gcacggtgcc tacccactca acagtctcag 1321 agctcagcag aagccctgtg gcaacggccc aatcctgcta aaactgggat ggaggaatcg 1381 gcccagcccc aagagcaact gtgatttttg tttttaagac tcatggacat ttcatacctg 1441 tgcaatactg aagacctcat tctgtcatgc tgccccagtg agatagtgag tggtcaccag 1501 gcttgcaaat gaacttcaga cggacctcag ggtaggttct cgggactgaa ggaaggccaa 1561 gccattacgg gagcacagca tgtgctgact actgtacttc cagacccctg ccctcttggg 1621 actgcccagt ccttgcacct cagagttcgc cttttcattt caagcataag ccaataaata 1681 cctgcagcaa cgtgggagaa agaagttgct ggaccaggag aaaaggcagt tatgaagcca 1741 attcattttg aaggaagcac aatttccacc ttattttttg aactttggca gtttcaatgt 1801 ctgtctctgt tgcttcgggg cataagctga tcaccgtcta gttgggaaag tcaccctaca 1861 gggtttgtag ggacatgatc agcatcctga tttgaaccct gaaatgttgt gtagacaccc 1921 tcttgggtcc aatgaggtag ttggttgaag tagcaagatg ttggcttttc tggatttttt 1981 ttgccatggg ttcttcactg accttggact ttggcatgat tcttagtcat acttgaactt 2041 gtctcattcc acctcttctc agagcaactc ttcctttggg aaaagagttc ttcagatcat 2101 agaccaaaaa agtcatacct tcgaggtggt agcagtagat tccaggagga gaagggtact 2161 tgctaggtat cctgggtcag tggcggtgca aactggtttc ctcagctgcc tgtccttctg 2221 tgtgcttatg tctcttgtga caattgtttt cctccctgcc cctggaggtt gtcttcaact 2281 gtggacttct gggatttgca gattttgcaa cgtggtacta cttttttttc tttttgtctg 2341 ttagttattt ctccagggga aaaggcaata attttctaag acccgtgtga atgtgaagaa 2401 aagcagtatg ttactggttg ttgttgttgt tcttgttttt tatatgtaaa ataaaaatag 2461 tgaaaggag // LOCUS HSU15939 2178 bp mRNA PRI 26-APR-1996 DEFINITION Human placental folate transporter (hFOLT1) mRNA, complete cds. ACCESSION U15939 NID g1222522 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2178) AUTHORS Prasad,P.D., Ramamoorthy,S., Leibach,F.H. and Ganapathy,V. TITLE Molecular cloning of the human placental folate transporter JOURNAL Biochem. Biophys. Res. Commun. 206 (2), 681-687 (1995) MEDLINE 95126971 REFERENCE 2 (bases 1 to 2178) AUTHORS Yang-Feng,T.L., Ma,Y.Y., Liang,R., Prasad,P.D., Leibach,F.H. and Ganapathy,V. TITLE Assignment of the human folate transporter gene to chromosome 21q22.3 by somatic cell hybrid analysis and in situ hybridization JOURNAL Biochem. Biophys. Res. Commun. 210 (3), 874-879 (1995) MEDLINE 95283551 REFERENCE 3 (bases 1 to 2178) AUTHORS Ganapathy,V. TITLE Direct Submission JOURNAL Submitted (13-OCT-1994) Vadivel Ganapathy, Biochemistry & Molecular Biology, Medical College of Georgia, 1120 15th Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..2178 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="21q22.3" /chromosome="22" gene 95..1870 /gene="hFOLT1" CDS 95..1870 /gene="hFOLT1" /codon_start=1 /product="placental folate transporter" /db_xref="PID:g1222523" /translation="MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPG ESFITPYLLGPDKNFTRDEVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLS FVSVWLLLLLGHSVAHMQLMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAV LLGVFTSSVLGQLLVTVGRVSFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRG RCETSASELERMNPGPGGKLGHALRVACGDSVLARMLRELGDSLRRPQLPLWSLWWVF NSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARW SKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLS KELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGA MLDGLRHCQRGHHPRQPPAQGLRSAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSL GAVGPASLEQRQSDPYLAQAPAPQAAEFLSPVTTPSPCTLSSAQASGPEAADETCPQL AVHPPGVSKLGLQCLPSDGVQNVNQ" BASE COUNT 307 a 767 c 674 g 430 t ORIGIN 1 ggccgggtcc gggagcccca gggcagccgc cccgccgagt cgcaggcaca gtgtcacctt 61 cgtcccctcc ggagctgcac gtggcctgag caggatggtg ccctccagcc cagcggtgga 121 gaagcaggtg cccgtggaac ctgggcctga ccccgagctc cggtcctggc ggcgcctcgt 181 gtgctacctt tgcttctacg gcttcatggc gcagatacgg ccaggggaga gcttcatcac 241 cccctacctc ctggggcccg acaagaactt cacgcgggac gaggtcacga acgagatcac 301 gccggtgctg tcgtactcct acctggccgt gctggtgccc gtgttcctgc tcaccgacta 361 cctgcgctac acgccggtgc tgctgctgca ggggctcagc ttcgtgtcgg tgtggctgct 421 gctgctgctg ggccactcgg tggcgcacat gcagctcatg gagctcttct acagcgtcac 481 catggccgcg cgcatcgcct attcctccta catcttctct ctcgtgcggc ccgcgcgcta 541 ccagcgtgtg gccggctact cgcgcgctgc ggtgctgctg ggcgtgttca ccagctccgt 601 gctgggccag ctgctggtca ctgtgggccg agtctccttc tccacgctca actacatctc 661 gctggccttc ctcaccttca gcgtggtcct cgccctcttc ctgaagcgcc ccaagcgcag 721 cctcttcttc aaccgcgacg accgggggcg gtgcgaaacc tcggcttcgg agctggagcg 781 catgaatccc ggcccaggcg ggaagctggg acacgccctg cgggtggcct gtggggactc 841 agtgctggcg cggatgctgc gggagctggg ggacagcctg cggcggccgc agctgcccct 901 gtggtccctc tggtgggtct tcaactcggc cggctactac ctggtggtct actacgtgca 961 catcctgtgg aacgaggtgg accccaccac caacagtgcg cgggtctaca acggcgcggc 1021 agatgctgcc tccacgctgc tgggcgccat cacgtccttc gccgcgggct tcgtgaagat 1081 ccgctgggcg cgctggtcca agctgctcat cgcgggcgtc acggccacgc aggcggggct 1141 ggtcttcctt ctggcgcaca cgcgccaccc gagcagcatc tggctgtgct atgcggcctt 1201 cgtgctgttc cgcggctcct accagttcct cgtgcccatc gccacctttc agattgcatc 1261 ttctctgtct aaagagctct gtgccctggt cttcggggtc aacacgttct ttgccaccat 1321 cgtcaagacc atcatcactt tcattgtctc ggacgtgcgg ggcctgggcc tcccggtccg 1381 caagcagttc cagttatact ccgtgtactt cctgatcctg tccatcatct acttcttggg 1441 ggccatgctg gatggcctgc ggcactgcca gcggggccac cacccgcggc agcccccggc 1501 ccagggcctg aggagtgccg cggaggagaa ggcagcacag gcactgagcg tgcaggacaa 1561 gggcctcgga ggcctgcagc cagcccagag cccgccgctt tccccagaag acagcctggg 1621 ggctgtgggg ccagcctccc tggagcagag acagagcgac ccatacctgg cccaggcccc 1681 ggccccgcag gcagctgaat tcctgagccc agtgacaacc ccttccccct gcactctgtc 1741 gtccgcccaa gcctcaggcc ctgaggctgc agatgagact tgtccccagc tggctgtcca 1801 tcctcctggt gtcagcaagc tgggtttgca gtgtcttcca agcgacggtg ttcagaatgt 1861 gaaccagtga ctctcgggcg cccctgtggt aactttgcag gcggccctca gtgcatccca 1921 cgacccctgc ctcgagggcc gcctgcctta gcaatggggg cctccgctta tcctgctagc 1981 aggcccccta ggattccccc tgccctgtgc gcactctggc ggtggccaca gcgtgctggc 2041 gacactcagg gcagctgcct ggccatgctg tccctgcact gtgccccgcg ggctttgttg 2101 ctggaagagg tgggtggtgg gcttctgcgt ccaccaggcc tcactggctc atccgcttgg 2161 ggggcttgag acaaatcc // LOCUS HSU15979 1553 bp mRNA PRI 14-SEP-1995 DEFINITION Human (dlk) mRNA, complete cds. ACCESSION U15979 NID g562105 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Lee,Y.L., Helman,L., Hoffman,T. and Laborda,J. TITLE dlk, pG2 and Pref-1 mRNAs encode similar proteins belonging to the EGF-like superfamily. Identification of polymorphic variants of this RNA JOURNAL Biochim. Biophys. Acta 1261 (2), 223-232 (1995) MEDLINE 95226449 REFERENCE 2 (bases 1 to 1553) AUTHORS Laborda,J. TITLE Direct Submission JOURNAL Submitted (17-OCT-1994) Jorge Laborda, Center for Biologics Evaluation and Research, FDA, Immunoconjugates, Bldg 29, Room 232, 1401 Rockville Pike, Rockville, MD 20852, USA FEATURES Location/Qualifiers source 1..1553 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hdlkaag" /tissue_type="Placenta" gene 174..1322 /gene="dlk" CDS 174..1322 /gene="dlk" /standard_name="Drosophila delta-like" /codon_start=1 /function="Possibly involved in differentiation of several tissues" /evidence=experimental /db_xref="PID:g562106" /translation="MTATEALLRVLLLLLAFGHSTYGAECFPACNPQNGFCEDDNVCR CQPGWQGPLCDQCVTSPGCLHGLCGEPGQCICTDGWDGELCDRDVRACSSAPCANNGT CVSLDGGLYECSCAPGYSGKDCQKKDGPCVINGSPCQHGGTCVDDEGRASHASCLCPP GFSGNFCEIVANSCTPNPCENDGVCTDIGGDFRCRCPAGFIDKTCSRPVTNCASSPCQ NGGTCLQHTQVSYECLCKPEFTGLTCVKKRALSPQQVTRLPSGYGLAYRLTPGVHELP VQQPEHRILKVSMKELNKKTPLLTEGQAICFTILGVLTSLVVLGTVGIVFLNKCETWV SNLRYNHMLRKKNLLLQYNSGEDLAVNIIFPEKIDMTTFSKEAGDEEI" misc_difference 1213 /gene="dlk" /note="3 bp (aag) internal deletion in the CDS with respect to the parental human dlk, GenBank Accession Number Z12172" BASE COUNT 300 a 515 c 439 g 299 t ORIGIN 1 tctaaaggag gtggagagcg caccgcagcc cggtgcagcc cggtgcagcc ctggctttcc 61 cctcgctgcg gcccgtgccc cctttcgcgt ccgcaaccag aagcccagtg cggcgccagg 121 agccggaccc gcgcccgcac cgctcccggg accgcgaccc cggccgccca gagatgaccg 181 cgaccgaagc cctcctgcgc gtcctcttgc tcctgctggc tttcggccac agcacctatg 241 gggctgaatg cttcccggcc tgcaaccccc aaaatggatt ctgcgaggat gacaatgttt 301 gcaggtgcca gcctggctgg cagggtcccc tttgtgacca gtgcgtgacc tctcccggct 361 gccttcacgg actctgtgga gaacccgggc agtgcatttg caccgacggc tgggacgggg 421 agctctgtga tagagatgtt cgggcctgct cctcggcccc ctgtgccaac aacgggacct 481 gcgtgagcct ggacggtggc ctctatgaat gctcctgtgc ccccgggtac tcgggaaagg 541 actgccagaa aaaggacggg ccctgtgtga tcaacggctc cccctgccag cacggaggca 601 cctgcgtgga tgatgagggc cgggcctccc atgcctcctg cctgtgcccc cctggcttct 661 caggcaattt ctgcgagatc gtggccaaca gctgcacccc caacccatgc gagaacgacg 721 gcgtctgcac tgacattggg ggcgacttcc gctgccggtg cccagccggc ttcatcgaca 781 agacctgcag ccgcccggtg accaactgcg ccagcagccc gtgccagaac gggggcacct 841 gcctgcagca cacccaggtg agctacgagt gtctgtgcaa gcccgagttc acaggtctca 901 cctgtgtcaa gaagcgcgcg ctgagccccc agcaggtcac ccgtctgccc agcggctatg 961 ggctggccta ccgcctgacc cctggggtgc acgagctgcc ggtgcagcag ccggagcacc 1021 gcatcctgaa ggtgtccatg aaagagctca acaagaaaac ccctctcctc accgagggcc 1081 aggccatctg cttcaccatc ctgggcgtgc tcaccagcct ggtggtgctg ggcactgtgg 1141 gtatcgtctt cctcaacaag tgcgagacct gggtgtccaa cctgcgctac aaccacatgc 1201 tgcggaagaa gaacctgctg cttcagtaca acagcgggga ggacctggcc gtcaacatca 1261 tcttccccga gaagatcgac atgaccacct tcagcaagga ggccggcgac gaggagatct 1321 aagcagcgtt cccacagccc cctctagatt cttggagttc cgcagagctt actatacgcg 1381 gtctgtccta atctttgtgg tgttcgctat ctcttgtgtc aaatctggtg aacgctacgc 1441 ttacatatat tgtctttgtg ctgctgtgtg acaaacgcaa tgcaaaaaca atcctctttc 1501 tctctcttaa tgcatgatac agaataataa taagaatttc atctttaaat gag // LOCUS HSU16031 3046 bp mRNA PRI 15-DEC-1994 DEFINITION Human transcription factor IL-4 Stat mRNA, complete cds. ACCESSION U16031 NID g559854 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3046) AUTHORS Hou,J., Schindler,U., Henzel,W.J., Ho,T.C., Brasseur,M. and McKnight,S.L. TITLE An interleukin-4-induced transcription factor: IL-4 Stat JOURNAL Science 265 (5179), 1701-1706 (1994) MEDLINE 94367369 REFERENCE 2 (bases 1 to 3046) AUTHORS Schindler,U. TITLE Direct Submission JOURNAL Submitted (18-OCT-1994) Ulrike Schindler, Tularik Inc., 270 East Grand Ave., South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..3046 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HUVE cells" CDS 166..2709 /codon_start=1 /function="interleukin-4-induced transcription factor" /product="IL-4 Stat" /db_xref="PID:g559855" /translation="MSLWGLVSKMPPEKVQRLYVDFPQHLRHLLGDWLESQPWEFLVG SDAFCCNLASALLSDTVQHLQASVGEQGEGSTILQHISTLESIYQRDPLKLVATFRQI LQGEKKAVMEQFRHLPMPFHWKQEELKFKTGLRRLQHRVGEIHLLREALQKGAEAGQV SLHSLIETPANGTGPSEALAMLLQETTGELEAAKALVLKRIQIWKRQQQLAGNGAPFE ESLAPLQERCESLVDIYSQLQQEVGAAGGELEPKTRASLTGRLDEVLRTLVTSCFLVE KQPPQVLKTQTKFQAGVRFLLGLRFLGAPAKPPLVRADMVTEKQARELSVPQGPGAGA ESTGEIINNTVPLENSIPGNCCSALFKNLLLKKIKRCERKGTESVTEEKCAVLFSASF TLGPGKLPIQLQALSLPLVVIVHGNQDNNAKATILWDNAFSEMDRVPFVVAERVPWEK MCETLNLKFMAEVGTNRGLLPEHFLFLAQKIFNDNSLSMEAFQHRSVSWSQFNKEILL GRGFTFWQWFDGVLDLTKRCLRSYWSDRLIIGFISKQYVTSLLLNEPDGTFLLRFSDS EIGGITIAHVIRGQDGSPQIENIQPFSAKDLSIRSLGDRIRDLAQLKNLYPKKPKDEA FRSHYKPEQMGKDGRGYVPATIKMTVERDQPLPTPELQMPTMVPSYDLGMAPDSSMSM QLGPDMVPQVYPPHSHSIPPYQGLSPEESVNVLSAFQEPHLQMPPSLGQMSLPFDQPH PQGLLPCQPQEHAVSSPDPLLCSDVTMVEDSCLSQPVTAFPQGTWIGEDIFPPLLPPT EQDLTKLLLEGQGESGGGSLGAQPLLQPSHYGQSGISMSHMDLRANPSW" BASE COUNT 670 a 904 c 860 g 612 t ORIGIN 1 atcttatttt tctttttggt ggtggtggtg gaagggggga ggtgctagca gggccagcct 61 tgaactcgct ggacagagct acagacctat ggggcctgga agtgcccgct gagaaaggga 121 gaagacagca gaggggttgc cgaggcaacc tccaagtccc agatcatgtc tctgtggggt 181 ctggtctcca agatgccccc agaaaaagtg cagcggctct atgtcgactt tccccaacac 241 ctgcggcatc ttctgggtga ctggctggag agccagccct gggagttcct ggtcggctcc 301 gacgccttct gctgcaactt ggctagtgcc ctactttcag acactgtcca gcaccttcag 361 gcctcggtgg gagagcaggg ggaggggagc accatcttgc aacacatcag cacccttgag 421 agcatatatc agagggaccc cctgaagctg gtggccactt tcagacaaat acttcaagga 481 gagaaaaaag ctgttatgga acagttccgc cacttgccaa tgcctttcca ctggaagcag 541 gaagaactca agtttaagac aggcttgcgg aggctgcagc accgagtagg ggagatccac 601 cttctccgag aagccctgca gaagggggct gaggctggcc aagtgtctct gcacagcttg 661 atagaaactc ctgctaatgg gactgggcca agtgaggccc tggccatgct actgcaggag 721 accactggag agctagaggc agccaaagcc ctagtgctga agaggatcca gatttggaaa 781 cggcagcagc agctggcagg gaatggcgca ccgtttgagg agagcctggc cccactccag 841 gagaggtgtg aaagcctggt ggacatttat tcccagctac agcaggaggt aggggcggct 901 ggtggggagc ttgagcccaa gacccgggca tcgctgactg gccggctgga tgaagtcctg 961 agaaccctcg tcaccagttg cttcctggtg gagaagcagc ccccccaggt actgaagact 1021 cagaccaagt tccaggctgg agttcgattc ctgttgggct tgaggttcct gggggcccca 1081 gccaagcctc cgctggtcag ggccgacatg gtgacagaga agcaggcgcg ggagctgagt 1141 gtgcctcagg gtcctggggc tggagcagaa agcactggag aaatcatcaa caacactgtg 1201 cccttggaga acagcattcc tgggaactgc tgctctgccc tgttcaagaa cctgcttctc 1261 aagaagatca agcggtgtga gcggaagggc actgagtctg tcacagagga gaagtgcgct 1321 gtgctcttct ctgccagctt cacacttggc cccggcaaac tccccatcca gctccaggcc 1381 ctgtctctgc ccctggtggt catcgtccat ggcaaccaag acaacaatgc caaagccact 1441 atcctgtggg acaatgcctt ctctgagatg gaccgcgtgc cctttgtggt ggctgagcgg 1501 gtgccctggg agaagatgtg tgaaactctg aacctgaagt tcatggctga ggtggggacc 1561 aaccgggggc tgctcccaga gcacttcctc ttcctggccc agaagatctt caatgacaac 1621 agcctcagta tggaggcctt ccagcaccgt tctgtgtcct ggtcgcagtt caacaaggag 1681 atcctgctgg gccgtggctt caccttttgg cagtggtttg atggtgtcct ggacctcacc 1741 aaacgctgtc tccggagcta ctggtctgac cggctgatca ttggcttcat cagcaaacag 1801 tacgttacta gccttcttct caatgagccc gacggaacct ttctcctccg cttcagcgac 1861 tcagagattg ggggcatcac cattgcccat gtcatccggg gccaggatgg ctctccacag 1921 atagagaaca tccagccatt ctctgccaaa gacctgtcca ttcgctcact gggggaccga 1981 atccgggatc ttgctcagct caaaaatctc tatcccaaga agcccaagga tgaggctttc 2041 cggagccact acaagcctga acagatgggt aaggatggca ggggttatgt cccagctacc 2101 atcaagatga ccgtggaaag ggaccaacca cttcctaccc cagagctcca gatgcctacc 2161 atggtgcctt cttatgacct tggaatggcc cctgattcct ccatgagcat gcagcttggc 2221 ccagatatgg tgccccaggt gtacccacca cactctcact ccatcccccc gtatcaaggc 2281 ctctccccag aagaatcagt caacgtgttg tcagccttcc aggagcctca cctgcagatg 2341 ccccccagcc tgggccagat gagcctgccc tttgaccagc ctcaccccca gggcctgctg 2401 ccgtgccagc ctcaggagca tgctgtgtcc agccctgacc ccctgctctg ctcagatgtg 2461 accatggtgg aagacagctg cctgagccag ccagtgacag cgtttcctca gggcacttgg 2521 attggtgaag acatattccc tcctctgctg cctcccactg aacaggacct cactaagctt 2581 ctcctggagg ggcaagggga gtcgggggga gggtccttgg gggcacagcc cctcctgcag 2641 ccctcccact atgggcaatc tgggatctca atgtcccaca tggacctaag ggccaacccc 2701 agttggtgat cccagctgga gggagaaccc aaagagacag ctcttctact acccccacag 2761 acctgctctg gacacttgct catgccctgc caagcagcag atggggaggg tgccctccta 2821 tccccaccta ctcctgggtc aggaggaaaa gactaacagg agaatgcaca gtgggtggag 2881 ccaatccact ccttcctttc tatcattccc ctgcccacct ccttccagca ctgactggaa 2941 gggaagttca ggctctgaga cacgccccaa catgcctgca cctgcagcgc gcacacgcac 3001 gcacacacac atacagagct ctctgagggt gatggggctg agcagg // LOCUS HSU16125 2718 bp mRNA PRI 04-APR-1996 DEFINITION Human glutamate/kainate receptor subunit (EEA3) mRNA, complete cds. ACCESSION U16125 NID g790529 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2718) AUTHORS Korczak,B., Nutt,S.L., Fletcher,E.J., Hoo,K.H., Elliott,C.E., Rampersad,V., McWhinnie,E.A. and Kamboj,R.K. TITLE cDNA cloning and functional properties of human glutamate receptor EAA3 (GluR5) in homomeric and heteromeric configuration JOURNAL Recept. Channels 3 (1), 41-49 (1995) MEDLINE 96172461 REFERENCE 2 (bases 1 to 2718) AUTHORS Kamboj,R.K. TITLE Direct Submission JOURNAL Submitted (19-OCT-1994) Rajender K. Kamboj, Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario L4V 1V7, Canada FEATURES Location/Qualifiers source 1..2718 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RKCSFG72, RKCS5F81, RKCS221, RKC41, RKS71" /clone_lib="Stratagene #936206" /tissue_type="brain" /dev_stage="fetus" gene 1..2718 /gene="EAA3" CDS 1..2718 /gene="EAA3" /function="glutamate/kainate receptor subunit" /codon_start=1 /evidence=experimental /product="EAA3" /db_xref="PID:g790530" /translation="MEHGTLLAQPGLWTRDTSWALLYFLCYILPQTAPQVLRIGGIFE TVENEPVNVEELAFKFAVTSINRNRTLMPNTTLTYDIQRINLFDSFEASRRACDQLAL GVAALFGPSHSSSVSAVQSICNALEVPHIQTRWKHPSVDNKDLFYINLYPDYAAISRA ILDLVLYYNWKTVTVVYEDSTGLIRLQELIKAPSRYNIKIKIRQLPSGNKDAKPLLKE MKKGKEFYVIFDCSHETAAEILKQILFMGMMTEYYHYFFTTLDLFALDLELYRYSGVN MTGFGLLNIDNPHVSSIIEKWSMERLQAPPRPETGLLDGMMTTEAALMYDAVYMVAIA SHRASQLTVSSLQCHRHKPWRLGPRFMNLIKEARWDGLTGHITFNKTNGLRKDFDLDI ISLKEEGTEKIGIWNSNSGLNMTDSNKDKSSNITDSLANRTLIVTTILEEPYVMYRKS DKPLYGNDRFEGYCLDLLKELSNILGFIYDVKLVPDGKYGAQNDKGEWNGMVKELIDH RADLAVAPLTITYVREKVIDFSKPFMTLGISILYRKPNGTNPGVFSFLNPLSPDIWMY VLLACLGVSCVLFVIARFTPYEWYNPHPCNPDSDVVENNFTLLNSFWFGVGALMQQGS ELMPKALSTRIVGGIWWFFTLIIISSYTANLAAFLTVERMESPIDSADDLAKQTKIEY GAVRDGSTMTFFKKSKISTYEKMWAFMSSRQQTALVRNSDEGIQRVLTTDYALLMEST SIEYVTQRNCNLTQIGGLIDSKGYGVGTPIGSPYRDKITIAILQLQEEGKLHMMKEKW WRGNGCPEEDNKEASALGVENIGGIFIVLAAGLVLSVFVAIGEFIYKSRKNNDIEQCL SFNAIMEELGISLKNQKKIKKKSRTKGKSSFTSILTCHQRRTQRKETVA" BASE COUNT 774 a 638 c 636 g 670 t ORIGIN 1 atggagcacg gcacactcct cgcccagccc gggctctgga ccagggacac cagctgggca 61 ctcctctatt tcctctgcta tatcctccct cagaccgccc cgcaagtact caggatcgga 121 gggatttttg aaacagtgga aaatgagcct gttaatgttg aagaattagc tttcaagttt 181 gcagtcacca gcattaacag aaaccgaacc ctgatgccta acaccacatt aacctatgac 241 atccagagaa ttaacctttt tgatagtttt gaagcctcgc ggagagcatg tgaccagctg 301 gctcttggtg tggctgctct ctttggccct tcccatagct cctccgtcag tgctgtgcag 361 tctatttgca atgctctcga agttccacac atacagaccc gctggaaaca cccctcggtg 421 gacaacaaag atttgtttta catcaacctt tacccagatt atgcagctat cagcagggcg 481 atcctggatc tggtcctcta ttacaactgg aaaacagtga cagtggtgta tgaagacagc 541 acaggtctaa ttcgtctaca agagctcatc aaagctccct ccagatataa tattaaaatc 601 aaaatccgcc agctgccctc tgggaataaa gatgccaagc ctttactcaa ggagatgaag 661 aaaggcaagg agttctatgt gatatttgat tgttcacatg aaacagccgc tgaaatcctt 721 aagcagattc tgttcatggg catgatgacc gaatactatc actacttttt cacaaccctg 781 gacttatttg ctttggatct ggaactctat aggtacagtg gcgtaaacat gaccgggttt 841 gggctgctta acattgacaa ccctcacgtg tcatccatca ttgagaagtg gtccatggag 901 agactgcagg ccccacccag gcccgagact ggccttttgg atggcatgat gacaactgaa 961 gcggctctga tgtacgatgc tgtgtacatg gtggccattg cctcgcaccg ggcatcccag 1021 ctgaccgtca gctccctgca gtgccataga cataagccat ggcgcctcgg acccagattt 1081 atgaacctga tcaaagaggc ccggtgggat ggcttgactg ggcatatcac ctttaataaa 1141 accaatggct tgaggaagga ttttgatctg gacattatta gtctcaaaga ggaaggaact 1201 gaaaagattg ggatttggaa ttccaacagt gggcttaaca tgacggacag caacaaagac 1261 aagtccagca atatcactga ttcattggcc aacagaacac tcattgtcac caccattctg 1321 gaagaaccct atgttatgta caggaaatct gataagcctc tatatggaaa tgacagattt 1381 gaaggatatt gcctagacct gttgaaagaa ttgtcaaaca tcctgggttt catttatgat 1441 gttaaactag ttcccgatgg caaatatggg gcccagaatg acaaagggga gtggaacggg 1501 atggttaaag aactcataga tcacagggct gacctggcag tggctcctct taccatcacc 1561 tacgtgcggg agaaagtcat tgacttctcc aaacccttca tgaccctagg catcagcatt 1621 ctctaccgga agcccaatgg taccaatcca ggcgttttct ccttcctcaa ccccctgtct 1681 ccagatattt ggatgtatgt gctcttagcc tgcttgggag tcagctgtgt actctttgtg 1741 attgcaaggt ttacacccta cgagtggtat aacccccacc catgcaaccc tgactcagac 1801 gtggtggaaa acaattttac tttactaaat agtttctggt ttggagttgg agctctcatg 1861 cagcaaggat cagagctgat gcccaaagct ctatcgacca gaatagttgg agggatatgg 1921 tggtttttca ccctaatcat catttcatcc tacacggcca atctggctgc cttcttgaca 1981 gtagagagaa tggaatcccc catagattcg gcagatgatc tggcaaagca aaccaagata 2041 gaatatgggg cggttagaga tggatcaaca atgaccttct tcaagaaatc aaaaatctcc 2101 acctatgaga agatgtgggc tttcatgagc agcaggcagc agaccgccct ggtaagaaac 2161 agtgatgagg ggatccagag agtgctcacc acagactacg cgctgctgat ggagtccacc 2221 agcattgagt atgtgacgca gagaaactgc aacctcactc agatcggggg cctcattgac 2281 tccaaaggtt acggagtggg aacacctatt ggttctcctt accgggataa aattactatt 2341 gctattcttc aactccaaga agaagggaag ctgcatatga tgaaagagaa gtggtggcgt 2401 gggaatggct gccccgagga agacaacaaa gaagccagtg ccctgggagt ggaaaatatt 2461 ggaggcatct tcattgttct ggctgccgga ctggtccttt ctgtatttgt agctattgga 2521 gaattcatat acaaatcacg gaagaataat gatattgaac agtgtctctc tttcaacgct 2581 atcatggaag aactgggaat ctcactgaag aatcagaaaa aaataaagaa aaagtcaaga 2641 actaagggga aatcttcctt cacaagtatc cttacttgtc atcagagacg aactcagaga 2701 aaagagactg tggcgtga // LOCUS HSU16126 2727 bp mRNA PRI 04-APR-1996 DEFINITION Human glutamate/kainate receptor subunit (EAA4) mRNA, complete cds. ACCESSION U16126 NID g790531 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2727) AUTHORS Hoo,K.H., Nutt,S.L., Fletcher,E.J., Elliott,C.E., Korczak,B., Deverill,R.M., Rampersad,V., Fantaske,R.P. and Kamboj,R.K. TITLE Functional expression and pharmacological characterization of the human EAA4 (GluR6) glutamate receptor: a kainate selective channel subunit JOURNAL Recept. Channels 2 (4), 327-337 (1994) MEDLINE 95236039 REFERENCE 2 (bases 1 to 2727) AUTHORS Kamboj,R.K. TITLE Direct Submission JOURNAL Submitted (19-OCT-1994) Rajender K. Kamboj, Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario L4V 1V7, Canada FEATURES Location/Qualifiers source 1..2727 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RKCS5F94" /clone_lib="Stratagene #936206" /tissue_type="brain" /dev_stage="fetus" gene 1..2727 /gene="EAA4" CDS 1..2727 /gene="EAA4" /function="glutamate/kainate receptor subunit" /codon_start=1 /evidence=experimental /product="EAA4" /db_xref="PID:g790532" /translation="MKIIFPILSNPVFRRTVKLLLCLLWIGYSQGTTHVLRFGGIFEY VESGPMGAEELAFRFAVNTINRNRTLLPNTTLTYDTQKINLYDSFEASKKACDQLSLG VAAIFGPSHSSSANAVQSICNALGVPHIQTRWKHQVSDNKDSFYVSLYPDFSSLSRAI LDLVQFFKWKTVTVVYDDSTGLIRLQELIKAPSRYNLRLKIRQLPADTKDAKPLLKEM KRGKEFHVIFDCSHEMAAGILKQALAMGMMTEYYHYIFTTLDLFALDVEPYRYSGVNM TGFRILNTENTQVSSIIEKWSMERLQAPPKPDSGLLDGFMTTDAALMYDAVHVVSVAV QQFPQMTVSSLQCNRHKPWRFGTRFMSLIKEAHWEGLTGRITFNKTNGLRTDFDLDVI SLKEEGLEKIGTWDPASGLNMTESQKGKPANITDSLSNRSLIVTTILEEPYVLFKKSD KPLYGNDRFEGYCIDLLRELSTILGFTYEIRLVEDGKYGAQDDANGQWNGMVRELIDH KADLAVAPLAITYVREKVIDFSKPFMTLGISILYRKPNGTNPGVFSFLNPLSPDIWMY ILLAYLGVSCVLFVIARFSPYEWYNPHPCNPDSDVVENNFTLLNSFWFGVGALMQQGS ELMPKALSTRIVGGIWWFFTLIIISSYTANLAAFLTVERMESPIDSADDLAKQTKIEY GAVEDGATMTFFKKSKISTYDKMWAFMSSRRQSVLVKSNEEGIQRVLTSDYAFLMEST TIEFVTQRNCNLTQIGGLIDSKGYGVGTPMGSPYRDKITIAILQLQEEGKLHMMKEKW WRGNGCPEEESKEASALGVQNIGGIFIVLAAGLVLSVFVAVGEFLYKSKKNAQLEKRS FCSAMVEELRMSLKCQRRLKHKPQAPVIVKTEEVINMHTFNDRRLPGKETMA" BASE COUNT 781 a 579 c 618 g 749 t ORIGIN 1 atgaagatta ttttcccgat tctaagtaat ccagtcttca ggcgcaccgt taaactcctg 61 ctctgtttac tgtggattgg atattctcaa ggaaccacac atgtattaag atttggtggt 121 atttttgaat atgtggaatc tggcccaatg ggagctgagg aacttgcatt cagatttgct 181 gtgaacacaa ttaacagaaa cagaacattg ctacccaata ctacccttac ctatgatacc 241 cagaagataa acctttatga tagttttgaa gcatccaaga aagcctgtga tcagctgtct 301 cttggggtgg ctgccatctt cgggccttca cacagctcat cagcaaacgc agtgcagtcc 361 atctgcaatg ctctgggagt tccccacata cagacccgct ggaagcacca ggtgtcagac 421 aacaaagatt ccttctatgt cagtctctac ccagacttct cttcactcag ccgtgccatt 481 ttagacctgg tgcagttttt caagtggaaa accgtcacgg ttgtgtatga tgacagcact 541 ggtctcattc gtttgcaaga gctcatcaaa gctccatcaa ggtataatct tcgactcaaa 601 attcgtcagt tacctgctga tacaaaggat gcaaaaccct tactaaaaga aatgaaaaga 661 ggcaaggagt ttcatgtaat ctttgattgt agccatgaaa tggcagcagg cattttaaaa 721 caggcattag ctatgggaat gatgacagaa tactatcatt atatctttac cactctggac 781 ctctttgctc ttgatgttga gccctaccga tacagtggtg ttaacatgac agggttcaga 841 atattaaata cagaaaatac ccaagtctcc tccatcattg aaaagtggtc gatggaacga 901 ttgcaggcac ctccgaaacc cgattcaggt ttgctggatg gatttatgac gactgatgct 961 gctctaatgt atgatgctgt gcatgtggtg tctgtggccg ttcaacagtt tccccagatg 1021 acagtcagtt ccttgcagtg taatcgacat aaaccctggc gcttcgggac ccgctttatg 1081 agtctaatta aagaggcaca ttgggaaggc ctcacaggca gaataacttt caacaaaacc 1141 aatggcttga gaacagattt tgatttggat gtgatcagtc tgaaggaaga aggtctagaa 1201 aagattggaa cgtgggatcc agccagtggc ctgaatatga cagaaagtca aaagggaaag 1261 ccagcgaaca tcacagattc cttatccaat cgttctttga ttgttaccac cattttggaa 1321 gagccttatg tcctttttaa gaagtctgac aaacctctct atggtaatga tcgatttgaa 1381 ggctattgca ttgatctcct cagagagtta tctacaatcc ttggctttac atatgaaatt 1441 agacttgtgg aagatgggaa atatggagcc caggatgatg ccaatggaca atggaatgga 1501 atggttcgtg aactaattga tcataaagct gaccttgcag ttgctccact ggctattacc 1561 tatgttcgag agaaggtcat cgacttttcc aagcccttta tgacacttgg aataagtatt 1621 ttgtaccgca agcccaatgg tacaaaccca ggcgtcttct ccttcctgaa tcctctctcc 1681 cctgatatct ggatgtatat tctgctggct tacttgggtg tcagttgtgt gctctttgtc 1741 atagccaggt ttagtcctta tgagtggtat aatccacacc cttgcaaccc tgactcagac 1801 gtggtggaaa acaattttac cttgctaaat agtttctggt ttggagttgg agctctcatg 1861 cagcaaggtt ctgagctcat gcccaaagca ctgtccacca ggatagtggg aggcatttgg 1921 tggtttttca cacttatcat catttcttcg tatactgcta acttagccgc ctttctgaca 1981 gtggaacgca tggaatcccc tattgactct gctgatgatt tagctaaaca aaccaagata 2041 gaatatggag cagtagagga tggtgcaacc atgacttttt tcaagaaatc aaaaatctcc 2101 acgtatgaca aaatgtgggc ctttatgagt agcagaaggc agtcagtgct ggtcaaaagt 2161 aatgaagaag gaatccagcg agtcctcacc tctgattatg ctttcctaat ggagtcaaca 2221 accatcgagt ttgttaccca gcggaactgt aacctgacac agattggcgg ccttatagac 2281 tctaaaggtt atggcgttgg cactcccatg ggttctccat atcgagacaa aattaccata 2341 gcaattcttc agctgcaaga ggaaggcaaa ctgcatatga tgaaggagaa atggtggagg 2401 ggcaatggtt gcccagaaga ggaaagcaaa gaggccagtg ccctgggggt tcagaatatt 2461 ggtggcatct tcattgttct ggcagccggc ttggtgcttt cagtttttgt ggcagtggga 2521 gaatttttat acaaatccaa aaaaaacgct caattggaaa agaggtcctt ctgtagtgcc 2581 atggtagaag aattgaggat gtccctgaag tgccagcgtc ggttaaaaca taagccacag 2641 gccccagtta ttgtgaaaac agaagaagtt atcaacatgc acacatttaa cgacagaagg 2701 ttgccaggta aagaaaccat ggcataa // LOCUS HSU16127 3614 bp mRNA PRI 04-APR-1996 DEFINITION Human glutamate/kainate receptor subunit (EAA5) mRNA, complete cds. ACCESSION U16127 NID g790533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3614) AUTHORS Nutt,S.L., Hoo,K.H., Rampersad,V., Deverill,R.M., Elliott,C.E., Fletcher,E.J., Adams,S.L., Korczak,B., Foldes,R.L. and Kamboj,R.K. TITLE Molecular characterization of the human EAA5 (GluR7) receptor: a high-affinity kainate receptor with novel potential RNA editing sites JOURNAL Recept. Channels 2 (4), 315-326 (1994) MEDLINE 95236038 REFERENCE 2 (bases 1 to 3614) AUTHORS Kamboj,R.K. TITLE Direct Submission JOURNAL Submitted (19-OCT-1994) Rajender K. Kamboj, Allelix Biopharmaceuticals Inc., 6850 Goreway Drive, Mississauga, Ontario L4V 1V7, Canada FEATURES Location/Qualifiers source 1..3614 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RKCS5F131, RKCS5F81, RKCAG132, RKCAG112" /clone_lib="Stratagene #936206" /tissue_type="brain" /dev_stage="fetus" 5'UTR <1..18 /evidence=experimental gene 19..2778 /gene="EAA5" CDS 19..2778 /gene="EAA5" /function="glutamate/kainate receptor" /codon_start=1 /evidence=experimental /product="EAA5" /db_xref="PID:g790534" /translation="MTAPWRRLRSLVWEYWAGLLVCAFWIPDSRGMPHVIRIGGIFEY ADGPNAQVMNAEEHAFRFSANIINRNRTLLPNTTLTYDIQRIHFHDSFEATKKACDQL ALGVVAIFGPSQGSCTNAVQSICNALEVPHIQLRWKHHPLDNKDTFYVNLYPDYASLS HAILDLVQYLKWRSATVVYDDSTGLIRLQELIMAPSRYNIRLKIRQLPIDSDDSRPLL KEMKRGREFRIIFDCSHTMAAQILKQAMAMGMMTEYYHFIFTTLDLYALDLEPYRYSG VNLTGFRILNVDNPHVSAIVEKWSMERLQAAPRAESGLLDGVMMTDAALLYDAVHIVS VCYQRAPQMTVNSLQCHRHKAWRFGGRFMNFIKEAQWEGLTGRIVFNKTSGLRTDFDL DIISLKEDGLEKVGVWSPADGLNITEVAKGRGPNVTDSLTNRSLIVTTVLEEPFVMFR KSDRTLYGNDRFEGYCIDLLKELAHILGFSYEIRLVEDGKYGAQDDKGQWNGMVKELI DHKADLAVAPLTITHVREKAIDFSKPFMTLGVSILYRKPNGTNPSVFSFLNPLSPDIW MYVLLAYLGVSCVLFVIARFSPYEWYDAHPCNPGSEVVENNFTLLNSFWFGMGSLMQQ GSVLMPKALSTRIIGGIWWFFTLIIISSYTANLAAFLTVERMESPIDSADDLAKQTKI EYGAVKDGATMTFFKKSKISTFEKMWAFMSRKPSALVKNNEEGIQRALTADYALLMES TTIEYVTQRNCNLTQIGGLIDSKGYGIGTPMGSPYRDKITIAILQLQEEDKLHIMKEK WWRGSGCPEEENKEASALGIQKIGGIFIVLAAGLVLSVLVAVGEFVYKLRKTAEREQR SFCSTVADEIRFSLTCQRRVKHKPQPPMMVKTDAVINMHTFNDRRLPGKDSMACSTSL APVFP" misc_difference 1073 /gene="EAA5" /note="RKCAG132" /replace="a" 3'UTR 2778..>3614 /evidence=experimental BASE COUNT 811 a 1077 c 988 g 738 t ORIGIN 1 cctcggcggc gcccaacgat gaccgctccc tggcggcgcc tccggagtct ggtttgggaa 61 tactgggccg ggctcctcgt gtgcgccttc tggatcccgg actcgcgcgg gatgccccac 121 gtcatccgga tcggaggaat cttcgagtat gcggacggcc ccaacgccca ggtcatgaat 181 gccgaggagc atgcctttcg attttctgcc aacatcatca acaggaacag gactctgctg 241 cccaacacaa ccttgaccta tgacatacag aggattcact tccatgacag cttcgaggcg 301 accaaaaagg cctgtgacca gctggcactg ggcgtggtgg cgatcttcgg cccatcacag 361 ggctcctgca ccaatgccgt ccagtccatc tgcaatgccc tggaggtgcc ccacatccag 421 ctgcgttgga agcaccaccc gctggacaac aaggacacct tctacgtgaa cctctacccc 481 gactacgcct cgctcagcca tgccatcctc gacctggtcc agtacctcaa gtggcggtca 541 gccaccgtgg tctatgacga cagtacaggg ctcatccgac tgcaggagct catcatggcc 601 ccatcaagat acaacatccg cctgaagatc cgtcagctcc ccatcgactc tgacgactcg 661 cgccccttgc tcaaggagat gaagcgaggc cgggaattcc gcattatctt cgactgcagc 721 cacactatgg cggcccagat cctcaagcag gccatggcca tgggcatgat gactgagtac 781 taccacttca tcttcaccac tctggatctc tacgctttag acctggagcc ctaccgctac 841 tcaggcgtga acctgacagg attccggatt ctcaatgtgg acaacccaca cgtctcggcc 901 attgtggaga agtggtccat ggagcggctg caggcagctc cccgggccga gtctggcctg 961 ctggatggag tgatgatgac tgatgcagcc ttactgtacg acgccgtcca tatcgtgtcc 1021 gtgtgctacc agcgggcacc acagatgacc gtgaactccc tgcagtgcca tcggcacaag 1081 gcctggcgct ttggcggccg cttcatgaac ttcatcaagg aggctcaatg ggaaggatta 1141 actggacgaa ttgttttcaa caaaactagt ggcttgcgga cggattttga tctggacatc 1201 atcagcctga aagaggatgg cctggagaag gttggggtgt ggagtcctgc cgacgggctc 1261 aacatcactg aggttgccaa aggccgaggc cctaatgtca ccgactctct gacaaacaga 1321 tcactcattg tcaccacagt gctggaggag cccttcgtca tgtttcggaa atcagacagg 1381 acgctatatg ggaatgaccg gttcgagggc tactgcatcg acctgctaaa ggagctggcc 1441 cacatccttg gtttctccta tgagatccgg ctggtggagg acggcaagta cggggcacag 1501 gatgacaagg gccagtggaa cggcatggtc aaggagctca tcgaccacaa ggcagatctg 1561 gccgtggccc ccctgaccat cacccatgtt cgagagaagg ccatcgactt ctccaagccc 1621 ttcatgacac ttggtgtgag catcctgtat cgaaagccca atggcaccaa ccccagcgtc 1681 ttctccttcc tcaatcccct gtccccagac atctggatgt atgttctcct cgcctacctg 1741 ggggtcagct gtgtcctctt cgtcatcgcc aggttcagcc cttatgagtg gtacgatgct 1801 cacccctgca accctggctc cgaggtggtg gaaaataact tcactctgct taacagcttc 1861 tggtttggaa tgggatccct gatgcagcaa gggtctgtgc tgatgcccaa agccctgtcc 1921 acacgcatca ttggtggcat ctggtggttc tttacgctca tcatcatctc ttcctacacg 1981 gccaacctgg ctgcctttct gaccgtggag cgcatggaat cacccattga ctctgctgat 2041 gacctggcca agcaaaccaa aatcgagtat ggggctgtca aggatggggc caccatgacc 2101 ttcttcaaga aatccaagat ctccaccttc gagaagatgt gggccttcat gagcaggaag 2161 ccatcggcgc tggtgaagaa caacgaggag ggcatccaga gggccctgac ggccgactac 2221 gcgctgctca tggagtccac caccatcgag tacgtcacgc agaggaactg caacctcacc 2281 cagatcgggg gcctcattga ctccaagggc tacggcatcg gcacgcccat gggctcccca 2341 taccgggaca agatcaccat cgccatcctg cagcttcagg aggaggacaa gctgcatatc 2401 atgaaggaga agtggtggcg gggcagcggg tgtcctgagg aggaaaacaa agaggccagt 2461 gccctgggga tccagaagat cgggggcatc ttcattgtcc tggccgccgg gctggtcctc 2521 tctgtgctgg tggccgtggg cgagtttgtg tacaagctcc gcaaaacagc agagagagag 2581 cagcgttcct tctgcagcac cgtggccgat gagatccgtt tctcccttac ctgccagcgt 2641 cgagtcaagc acaagcctca gcctcccatg atggtcaaga ctgacgccgt catcaacatg 2701 cacacattca atgaccgccg gcttcccggc aaggacagca tggcctgcag cacatcctta 2761 gcccctgtgt tcccctaggc acaactgggg tggggacctc aggcctgggg gctgggcaga 2821 ggaaagcaaa ggagattgga aggaacgtcc cctgtacccg cactgggctt ggggaccaga 2881 gctgccacct gcctgttggg ccaggagcct cctgccctta cctgccagga agccagcagg 2941 ctctcaggcc agctgcttgg gcttcatcct cctcagatct tctgtgggtt tctaaagctg 3001 ccagccgaga tagccaaggc caaaggaagc acatgcctct ctcaggccaa actcacctgc 3061 ccctcaactc tcctccagag tcagaagttt ctgccgcagc cctgcagagg gcacagaaaa 3121 tggaagacag ctcttatatt gccatttctt ccacaagagc ccaggcctcc tacagcttga 3181 ccgtgaggcc agagacacaa cttcggcgcc ttaaggatgt tctagcatgg ctgccaatgg 3241 gagctcatgg tgagggatac ccatcccata tgcctgggca gaaggaagac ttcatccctc 3301 tggggctgtt cacgtggtcc taatcttctg aacttggcgc tgcccctggc agcccctgtt 3361 ctggcagagt tgaagacaga gctacacagg ggaaaagagg agtttggggt atgggagaga 3421 agagaatgca caaacagagg ccgccatttt ggattcttat ggacaatgac ccagtggttc 3481 ctaatcctct aggaggtctc taagaatata agtgggggag tggccacaga aaattcttct 3541 ccactttcta gccagaggag agaggacccc ctgaatttct cacaaaggat gcccaaagat 3601 gcagccggta tttg // LOCUS HSU16153 1017 bp mRNA PRI 05-DEC-1995 DEFINITION Human Id-4H protein mRNA, complete cds. ACCESSION U16153 NID g625095 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1017) AUTHORS Pagliuca,A., Bartoli,P.C., Saccone,S., Della Valle,G. and Lania,L. TITLE Molecular cloning of ID4, a novel dominant negative helix-loop-helix human gene on chromosome 6p21.3-p22 JOURNAL Genomics 27 (1), 200-203 (1995) MEDLINE 95394461 REFERENCE 2 (bases 1 to 1017) AUTHORS Lania,L. TITLE Direct Submission JOURNAL Submitted (20-OCT-1994) Luigi Lania, Genetica, Biologia Generale e Molecolare, University of Naples, Via Mezzocannone 8, Naples, 80134 Italy FEATURES Location/Qualifiers source 1..1017 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3-p22" CDS 306..782 /codon_start=1 /product="Id-4H protein" /db_xref="PID:g625096" /translation="MKAVSPVRPRPLRPSGCGGGELALRCLAEHGHSLGGSQAAAAAA AARCKAAEAAADEPALCLQCDMNDCYSRLRWLPTIPPNKKVSKVEILQHVIDYILDLQ LALETHPALLRQPPPPAPPHHPAGTCPAAPPRTPLTALNTDPAGAVNKQGDSILCR" BASE COUNT 192 a 324 c 335 g 166 t ORIGIN 1 gcggccgcat cgggcttagt cggagctccg aaggagtgac taggacaccc gggtgggcta 61 cttttcttcc ggtgcttttg cttttttttc ctttgggctc gggctgagtg tcgcccactg 121 agcaaagatt ccctcgtaaa acccagagcg accctcccgt caattgttgg gctcgggagt 181 gtcgcggtgc cccgagcgcg ccgggccagg caaagggagc ggaccggccg cggacggggc 241 ccggagcttg cctgcctccc tcgctcgccc cagcgggttc gctcgcgcag ggcgagggcg 301 gcgcgatgaa ggcggtgagc ccggtgcgcc ctcggccgct aaggccgtcg ggctgcggcg 361 gcggggagct ggcgctgcgc tgcctggccg agcacggcca cagcctgggt ggctcgcagg 421 cggcggcggc ggcggcggca gcgcgctgta aggcggccga ggcggcggcc gacgagccgg 481 cgctgtgcct gcagtgcgat atgaacgact gctatagccg cctgaggtgg ttgcccacca 541 tcccgcccaa caagaaagtc agcaaagtgg agatcctgca gcacgttatc gactacatcc 601 tggacctgca gctggcgctg gagacgcacc cggccctgct gaggcagcca ccaccgcccg 661 ctccgccaca ccacccggcc gggacctgtc cagccgcgcc gccgcggacc ccgctcactg 721 cgctcaacac cgacccggcc ggcgcggtga acaagcaggg cgacagcatt ctgtgccgct 781 gagccgcgct gtccaggtgt gcggccgcct gagcccgagc caggagcact agagagggag 841 ggggaagagc agaagttaga gaaaaaaagc caccggagga aaggaaaaaa catcggccaa 901 cctagaaacg ttttcattcg tcattccaag agagagagag gaaagaaaaa tacaactttc 961 attctttctt tgcacgttca taaacattct acatacgtat tctcttttgt ctcttca // LOCUS HSU16258 1813 bp mRNA PRI 27-JAN-1996 DEFINITION Human I kappa BR mRNA, complete cds. ACCESSION U16258 NID g746414 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 325 to 1770) AUTHORS Ray,P., Zhang,D.H., Elias,J.A. and Ray,A. TITLE Cloning of a differentially expressed I kappa B-related protein JOURNAL J. Biol. Chem. 270 (18), 10680-10685 (1995) MEDLINE 95256234 REFERENCE 2 (bases 1 to 1813) AUTHORS Ray,A. TITLE Direct Submission JOURNAL Submitted (24-OCT-1994) Anuradha Ray, Internal Medicine/Pulmonary Section, School of Medicine, Yale University, 333 Cedar Street, LCI 105, New Haven, CT 06520-8057, USA FEATURES Location/Qualifiers source 1..1813 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" 5'UTR 1..324 CDS 325..1770 /codon_start=1 /function="inhibitor of transcription factor NF kappa B" /product="I kappa BR" /db_xref="PID:g746415" /translation="MRTRLYLNLGLTFESLQQTALCNDYFRKSIFLADENHLYEDLFR ARYNLGTIHWRAGQHSQAMRCLEGARECAHTMSEAVHGERVLRGYCTGPPRPGRLFGC QASPEEALQAGLPEACAEGSHLSEPPACAAVVRLQQQLEEAEGRDPQGAMVICEQLGD LFSKAGDFPRAAEAYQKQLRFAELLDRPGAERAIIHVSLATTLGDMKDHHGAVRHYEE ELRLRSGNVLEEAKTWLNIALSREEARCLRAAGPVLPESAQLCPAGPASPAAEAGLAA SPYRAAEGCRPQEAPETETRLRELSVAEDEDEEEEAEEAAHSGERTPGGRRGGALRER GRHRWPDPAAGGGRGASGPPGAAKGSKWNRRNDMGETLLHRACIEGQLRRVQDLVRQG HPLNPRDYCGWTPLHEACNYGHLEIVRFLLDHGAAVDDPGGQGCEGITPLHDALNCGH FEVAELLLERGASVTLRTRKASARWRRCSSG" repeat_region 1435..1734 /rpt_family="ankyrin" 3'UTR 1771..>1813 BASE COUNT 382 a 530 c 591 g 310 t ORIGIN 1 aattcgcgta ctagccggac ttggattttc tggaaagatt tcagttgagg aacgggaaca 61 aagattatga tagctttccg accaccacca acttcaattt ccttagctgc cgtaatatca 121 gctccctgag ctgagccttg aggtccgagt tcatctccag ctccagaaga gcctgggaga 181 tgccggactc gaactcgtcc gcttctcgcc attgggcttc acgatcttgg cgctcgaact 241 gaacatggct tctcctttga gaagagcttg gctattgtgg atgaggagct ggaggggaca 301 ctggcgcagg gagagctgaa tgagatgagg acccgcctct atctcaacct gggcctcacc 361 tttgagagcc tgcagcagac agccctgtgc aacgattact tcaggaagag catcttcctt 421 gcggacgaga accaccttta cgaggaccta ttccgcgccc gctacaacct gggcaccatc 481 cactggcgcg cgggccagca ctcccaggct atgcgctgct tggagggtgc ccgggagtgt 541 gcgcacacca tgagcgaagc ggttcatgga gagcgagtgc tgcgtggtta ttgcacaggt 601 cctccaagac ctgggagact ttttggctgc caagcgagcc ctgaagaagc gctacaggct 661 gggctcccag aagcctgtgc agagggcagc catctgtcag aacctccagc atgtgctgca 721 gtggtccggc tgcagcaaca gctggaagag gctgagggca gagaccctca gggtgccatg 781 gtcatctgtg agcagctagg ggacctcttc tccaaggcag gagactttcc cagggcagct 841 gaggcttacc agaagcagct gcgttttgct gagctgctgg acagaccggg tgctgagcgg 901 gccatcatcc acgtgtccct ggccaccaca ctgggagaca tgaaggacca ccatggggcc 961 gtgcgccact atgaggagga actgaggctg cgcagcggca acgtgctgga ggaggccaag 1021 acctggctga acattgcact gtcccgcgag gaggcgcgat gcctacgagc tgctggcccc 1081 gtgcttccag aaagcgctca gctgtgccca gcaggcccag cgtccccagc tgcagaggca 1141 ggtcttgcag catctccata ccgtgcagct gagggctgca ggccccagga ggcccctgag 1201 accgaaacca gactgcggga gctcagtgta gctgaagatg aagatgagga ggaggaggcg 1261 gaggaggcgg cacacagcgg agagcgaacg cctggaggcc ggcgaggtgg agctctcaga 1321 gagcgaggac gacaccgatg gcctgacccc gcagctggag gaggacgagg agcttcaggg 1381 ccacctgggg ccgccaaggg gagcaagtgg aaccggcgaa acgacatggg ggagaccctg 1441 ctgcaccgag cctgcatcga gggccagctg cgccgcgtcc aggaccttgt gaggcagggc 1501 caccccctta accctcggga ctactgtggc tggacacctc tgcacgaggc ctgcaactac 1561 gggcatctag aaattgtccg cttcctgctg gaccacgggg ccgcagtgga cgacccaggt 1621 ggccagggct gcgaaggcat cacccccctc cacgatgccc tcaactgtgg ccacttcgag 1681 gtggctgagc tgctgcttga acggggggcg tccgtcaccc tccgcactcg aaaggcctca 1741 gcgcgctgga gacgctgcag cagtgggtga agctgtaccg cggagacctg gactggagac 1801 gcgggcggaa ttc // LOCUS HSU16261 1700 bp mRNA PRI 09-MAR-1996 DEFINITION Human MDA-7 (mda-7) mRNA, complete cds. ACCESSION U16261 NID g1141750 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1700) AUTHORS Jiang,H., Lin,J.J., Su,Z.Z., Goldstein,N.I. and Fisher,P.B. TITLE Subtraction hybridization identifies a novel melanoma differentiation associated gene, mda-7, modulated during human melanoma differentiation, growth and progression JOURNAL Oncogene 11 (12), 2477-2486 (1995) MEDLINE 96132699 REFERENCE 2 (bases 1 to 1700) AUTHORS Fisher,P.B. TITLE Direct Submission JOURNAL Submitted (24-OCT-1994) Paul B. Fisher, College of Physicians and Surgeons, Pathology / Urology, Columbia University, 630 W 168th Street, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..1700 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HO-1" /cell_type="differentiated melanoma cells" /tissue_type="melanoma" gene 275..895 /gene="mda-7" CDS 275..895 /gene="mda-7" /codon_start=1 /evidence=experimental /product="MDA-7" /db_xref="PID:g1141751" /translation="MNFQQRLQSLWTLARPFCPPLLATASQMQMVVLPCLGFTLLLWS QVSGAQGQEFHFGPCQVKGVVPQKLWEAFWAVKDTMQAQDNITSARLLQQEVLQNVSD AESCYLVHTLLEFYLKTVFKNYHNRTVEVRTLKSFSTLANNFVLIVSQLQPSQENEMF SIRDSAHRRFLLFRRAFKQLDVEAALTKALGEVDILLTWMQKFYKL" polyA_site 1700 /note="18 A residues" BASE COUNT 434 a 418 c 391 g 457 t ORIGIN 1 cttgcctgca aacctttact tctgaaatga cttccacggc tgggacggga accttccacc 61 cacagctatg cctctgattg gtgaatggtg aaggtgcctg tctaactttt ctgtaaaaag 121 aaccagctgc ctccaggcag ccagccctca agcatcactt acaggaccag agggacaaga 181 catgactgtg atgaggagct gctttcgcca atttaacacc aagaagaatt gaggctgctt 241 gggaggaagg ccaggaggaa cacgagactg agagatgaat tttcaacaga ggctgcaaag 301 cctgtggact ttagccagac ccttctgccc tcctttgctg gcgacagcct ctcaaatgca 361 gatggttgtg ctcccttgcc tgggttttac cctgcttctc tggagccagg tatcaggggc 421 ccagggccaa gaattccact ttgggccctg ccaagtgaag ggggttgttc cccagaaact 481 gtgggaagcc ttctgggctg tgaaagacac tatgcaagct caggataaca tcacgagtgc 541 ccggctgctg cagcaggagg ttctgcagaa cgtctcggat gctgagagct gttaccttgt 601 ccacaccctg ctggagttct acttgaaaac tgttttcaaa aactaccaca atagaacagt 661 tgaagtcagg actctgaagt cattctctac tctggccaac aactttgttc tcatcgtgtc 721 acaactgcaa cccagtcaag aaaatgagat gttttccatc agagacagtg cacacaggcg 781 gtttctgcta ttccggagag cattcaaaca gttggacgta gaagcagctc tgaccaaagc 841 ccttggggaa gtggacattc ttctgacctg gatgcagaaa ttctacaagc tctgaatgtc 901 tagaccagga cctccctccc cctggcactg gtttgttccc tgtgtcattt caaacagtct 961 cccttcctat gctgttcact ggacacttca cgcccttggc catgggtccc attcttggcc 1021 caggattatt gtcaaagaag tcattcttta agcagcgcca gtgacagtca gggaaggtgc 1081 ctctggatgc tgtgaagagt ctacagagaa gattcttgta tttattacaa ctctatttaa 1141 ttaatgtcag tatttcaact gaagttctat ttatttgtga gactgtaagt tacatgaagg 1201 cagcagaata ttgtgcccca tgcttcttta cccctcacaa tccttgccac agtgtggggc 1261 agtggatggg tgcttagtaa gtacttaata aactgtggtg ctttttttgg cctgtctttg 1321 gattgttaaa aaacagagag ggatgcttgg atgtaaaact gaacttcaga gcatgaaaat 1381 cacactgtct gctgatatct gcagggacag agcattgggg tgggggtaag gtgcatctgt 1441 ttgaaaagta aacgataaaa tgtggattaa agtgcccagc acaaagcaga tcctcaataa 1501 acatttcatt tcccacccac actcgccagc tcaccccatc atccctttcc cttggtgccc 1561 tccttttttt tttatcctag tcattcttcc ctaatcttcc acttgagtgt caagctgacc 1621 ttgctgatgg tgacattgca cctggatgta ctatccaatc tgtgatgaca ttccctgcta 1681 ataaaagaca acataactca // LOCUS HSU16273 1146 bp mRNA PRI 24-JAN-1995 DEFINITION Human corticotropin releasing hormone receptor variant mRNA, complete cds. ACCESSION U16273 NID g606973 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1146) AUTHORS Ross,P.C., Kostas,C.M. and Ramabhadran,T.V. TITLE A variant of the human corticotropin-releasing factor (CRF) receptor: cloning, expression and pharmacology JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1836-1842 (1994) MEDLINE 95110332 REFERENCE 2 (bases 1 to 1146) AUTHORS Ross,P.C. TITLE Direct Submission JOURNAL Submitted (24-OCT-1994) Philip C. Ross, Neurogen Corporation, 35 N.E. Industrial Road, Branford, CT 06405, USA FEATURES Location/Qualifiers source 1..1146 /organism="Homo sapiens" /note="caucasian" /db_xref="taxon:9606" /clone="CRF-Rc" /clone_lib="Stratagene #936205" /sex="female" /tissue_type="brain hippocampus" /dev_stage="juvenile" primer_bind 1..21 /note="forward PCR primer; sequence derived from Genbank Accession Number L23332" CDS 19..1146 /note="start and stop codon assignment derived from Genbank Accession Number L23332" /codon_start=1 /product="corticotropin releasing hormone receptor variant" /db_xref="PID:g606974" /translation="MGGHPQLRLVKALLLLGLNPVSASLQDQHCESLSLASNISDNGY RECLANGSWAARVNYSECQEILNEEKKSKVHYHVAVIINYLGHCISLVALLVAFVLFL RLRSIRCLRNIIHWNLISAFILRNATWFVVQLTMSPEVHQSNVGWCRLVTAAYNYFHV TNFFWMFGEGCYLHTAIVLTYSTDRLRKWMFICIGWGVPFPIIVAWAIGKLYYDNEKC WFGKRPGVYTDYIYQGPMILVLLINFIFLFNIVRILMTKLRASTTSETIQYRKAVKAT LVLLPLLGITYMLFFVNPGEDEVSRVVFIYFNSFLESFQGFFVSVFYCFLNSEVRSAI RKRWHRWQDKHSIRARVARAMSIPTSPTRVSFHSIKQSTAV" primer_bind complement(1122..1146) /note="reverse PCR primer; sequence derived from Genbank Accession Number L23332" BASE COUNT 219 a 374 c 292 g 261 t ORIGIN 1 agccgagcga gcccgaggat gggagggcac ccgcagctcc gtctcgtcaa ggcccttctc 61 cttctggggc tgaaccccgt ctctgcctcc ctccaggacc agcactgcga gagcctgtcc 121 ctggccagca acatctcaga caatggctac cgggagtgcc tggccaatgg cagctgggcc 181 gcccgcgtga attactccga gtgccaggag atcctcaatg aggagaaaaa aagcaaggtg 241 cactaccatg tcgcagtcat catcaactac ctgggccact gtatctccct ggtggccctc 301 ctggtggcct ttgtcctctt tctgcggctc aggagcatcc ggtgcctgcg aaacatcatc 361 cactggaacc tcatctccgc cttcatcctg cgcaacgcca cctggttcgt ggtccagcta 421 accatgagcc ccgaggtcca ccagagcaac gtgggctggt gcaggttggt gacagccgcc 481 tacaactact tccatgtgac caacttcttc tggatgttcg gcgagggctg ctacctgcac 541 acagccatcg tgctcaccta ctccactgac cggctgcgca aatggatgtt catctgcatt 601 ggctggggtg tgcccttccc catcattgtg gcctgggcca ttgggaagct gtactacgac 661 aatgagaagt gctggtttgg caaaaggcct ggggtgtaca ccgactacat ctaccagggc 721 cccatgatcc tggtcctgct gatcaatttc atcttccttt tcaacatcgt ccgcatcctc 781 atgaccaagc tccgggcatc caccacgtct gagaccattc agtacaggaa ggctgtgaaa 841 gccactctgg tgctgctgcc cctcctgggc atcacctaca tgctgttctt cgtcaatccc 901 ggggaggatg aggtctcccg ggtcgtcttc atctacttca actccttcct ggaatccttc 961 cagggcttct ttgtgtctgt gttctactgt ttcctcaata gtgaggtccg ttctgccatc 1021 cggaagaggt ggcaccggtg gcaggacaag cactcgatcc gtgcccgagt ggcccgtgcc 1081 atgtccatcc ccacctcccc aacccgtgtc agctttcaca gcatcaagca gtccacagca 1141 gtctga // LOCUS HSU16282 2805 bp mRNA PRI 13-DEC-1994 DEFINITION Human ELL mRNA, complete cds. ACCESSION U16282 NID g601792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2805) AUTHORS Thirman,M.J., Levitan,D.A., Kobayashi,H., Simon,M.C. and Rowley,J.D. TITLE Cloning of ELL, a gene that fuses to MLL in a t(11;19)(q23;p13.1) in acute myeloid leukemia JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 12110-12114 (1994) MEDLINE 95083651 REFERENCE 2 (bases 1 to 2805) AUTHORS Thirman,M.J. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) Michael J. Thirman, Hematology/Oncology, University of Chicago, 5841 South Maryland Avenue, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..2805 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="fetal brain, Stratagene catalog number 936206" /chromosome="19" /map="19p13.1" /tissue_type="brain" /dev_stage="fetus" gene 13..1878 /gene="ELL" CDS 13..1878 /gene="ELL" /codon_start=1 /db_xref="PID:g601793" /translation="MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRAR QDSVSLRPSIRFQGSQGHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSS HGEVHLDCLGSIQDKITVCATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKV QFRKPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALR PYRKAELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGY SEGDQQLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPL ANKKPRISHFTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLSSSTHLPPRLEPPR AHDPLADVSNDLGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRPHGSPSRSKPKK KSKKHKDKERAAEDKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPD YLLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEE YETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP" BASE COUNT 559 a 924 c 827 g 495 t ORIGIN 1 gatggtcgca agatggcggc gctgaaggag gataggagct acgggctgtc gtgcgggcgg 61 gttagcgacg gcagcaaggt gtcggtgttc cacgtgaagc tcaccgacag tgccctgagg 121 gccttcgaga gctaccgcgc cagacaggat tctgtttcac tgaggccatc tatccgattt 181 caaggaagcc aagggcacat ctccatcccc cagcctgact gccccgcaga ggcgcggacg 241 ttctccttct acctctccaa catcggccgc gacaaccccc agggcagctt cgactgcatc 301 cagcagtatg tctccagtca tggggaagtt cacctggact gcctgggcag catacaggac 361 aagatcacgg tgtgtgccac cgacgactcc taccagaagg cgcggcagag catggcccag 421 gcggaggagg agacgcggag ccgaagtgcc attgtcatca aggctggagg ccgctacctg 481 ggcaagaagg ttcagtttcg gaaaccagcc ccaggtgcaa cagacgcggt gccctcccgg 541 aagcgggcaa cccccatcaa cttggcgagt gccatcagga agagtggtgc cagtgccgtg 601 agtgggggca gcggggtgtc ccagaggccc ttccgtgacc gagtgctgca cctcctggca 661 ctacggccct accgcaaggc tgagctgctg ctgcgactgc agaaggacgg cctgacgcag 721 gcggacaagg acgcgctgga tggcctcctc cagcaggtgg ccaacatgag tgctaaggac 781 ggcacgtgta cactgcagga ctgcatgtac aaggatgtgc agaaggactg gcctggctac 841 tcggaggggg accagcagct gctgaagcgg gtgctcgtcc ggaagctgtg ccagccacag 901 agcactggca gcctccttgg agaccctgct gcctccagcc ccccaggcga gcgtgggcgc 961 tcggcctcgc ccccacagaa gcggctgcag cctcctgatt tcatcgaccc cctagccaac 1021 aagaaacccc ggatatcgca cttcactcag agagctcagc ctgccgtcaa cgggaagctg 1081 ggcgtgccca atggccgtga ggccttgctg cccaccccgg gcccaccagc cagcacggac 1141 accctcagct ccagcactca cctgcccccg cggctggagc ccccgagggc ccacgacccc 1201 ctggccgatg tcagcaatga cctgggccac agcggccgag actgtgagca cggagaggcg 1261 gctgccccag cccccactgt gcgcctcggc ctgcccctgc tgacggactg tgcccagccc 1321 agcaggccac acggcagccc ctcgcgcagc aagcccaaga agaagtccaa gaagcacaaa 1381 gacaaggaga gggcggctga ggacaagccc cgggcccagc ttccagactg tgcacctgcc 1441 acccatgcca cccccggagc cccagcagac accccaggtt taaacggaac ctgcagcgtt 1501 tccagtgttc ccacgtccac gtcggagacg cctgactact tgctgaagta cgcagccatc 1561 tcctcttcgg agcagcgcca gagctacaag aacgacttca atgccgagta cagcgagtac 1621 cgcgacctgc acgcccgcat tgagcgcatc acgcggcggt tcacccagct cgacgcccag 1681 ctccggcagc tctcccaggg ctccgaggag tatgagacta ctcgagggca gattttgcag 1741 gaatatcgaa aaatcaaaaa gaccaacacc aactacagcc aggagaagca ccgctgcgag 1801 tacctgcaca gcaagctggc ccacatcaag aggctcatcg ccgagtacga ccagcggcag 1861 ctgcaggctt ggccctagcc gccctccccg atggcgggga tctgggaggg tcgggggagc 1921 aaaaggcggt gagagaggat ttatttaaaa aaataaaccc gaggaagatg ctcatctgag 1981 ccagcaccgc cggctttcag ggcagcccct gcagacgtct ggccctggcg ggtggctgca 2041 agcccacctc ggccctccct gcgcttcctg agcagtccct gcttatgatg ggctccccag 2101 gaagcccact gcctcctccc tggctgcagc ctccggggtt cagcctcctg tctcgccaga 2161 agactcctag gcccttgggg tggcgcgcct gccttttcta gttttataca aagacagcca 2221 cttttagcta ctgctaatga gacttgagtc tatttttgta caaaagagaa gcaaaatctt 2281 ttttctaaac ctgtgcctcc ctctcctcgg acacccaggg tctagccgct gccctgggtc 2341 ctgcctgcag tattacagag gtcgtcgaaa ggtgcagctg cgttctgagg gcgtggggaa 2401 tgggcaggtg gcctctgctg gtctctggct ctacgtttag gcaccccttt ccccagcctc 2461 tcctccttgg gcagggtctc tgctccagca agcaaacagg gtggcccagg tggctatttt 2521 gagaactcca gctggtgccc ccagacagct gctcagagcc aagggggcag agggctttca 2581 gcgcccccag gcctgccctg ctatttcagg ccctcagctg tcgggggcca ctgtgtttct 2641 gtgctccaag ttgagactcg gccgcagcgg cgtcagactt ttctttgcga tgtcctcggt 2701 ttcccatttg ttgctgctgc tgctcattcc acactgttga gaccttgtgg tctcgatgct 2761 gctggcctcc ctccgtccct ctgtccactt gtgggtcctg gggtc // LOCUS HSU16296 5521 bp mRNA PRI 26-APR-1996 DEFINITION Human T-lymphoma invasion and metastasis inducing TIAM1 protein (TIAM1) mRNA, complete cds. ACCESSION U16296 NID g897556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5521) AUTHORS Habets,G.G., van der Kammen,R.A., Stam,J.C., Michiels,F. and Collard,J.G. TITLE Sequence of the human invasion-inducing TIAM1 gene, its conservation in evolution and its expression in tumor cell lines of different tissue origin JOURNAL Oncogene 10 (7), 1371-1376 (1995) MEDLINE 95249246 REFERENCE 2 (bases 1 to 5521) AUTHORS Michiels,F., Habets,G.G., Stam,J.C., van der Kammen,R.A. and Collard,J.G. TITLE A role for Rac in Tiam1-induced membrane ruffling and invasion JOURNAL Nature 375 (6529), 338-340 (1995) MEDLINE 95272708 REFERENCE 3 (sites) AUTHORS Habets,G.G., van der Kammen,R.A., Jenkins,N.A., Gilbert,D.J., Copeland,N.G., Hagemeijer,A. and Collard,J.G. TITLE The invasion-inducing TIAM1 gene maps to human chromosome band 21q22 and mouse chromosome 16 JOURNAL Cytogenet. Cell Genet. 70 (1-2), 48-51 (1995) MEDLINE 95254877 REFERENCE 4 (bases 1 to 5521) AUTHORS Habets,G.G.M. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) Gaston G.M. Habets, Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam, 1066 CX, The Netherlands FEATURES Location/Qualifiers source 1..5521 /organism="Homo sapiens" /note="syntenic to mouse chromosome 16" /db_xref="taxon:9606" /clone_lib="lambda ZAPII cDNA library of human female fetal brain, 17-18 weeks of gestation, (Stratagene catalog #936206)" /tissue_type="brain" /dev_stage="fetus; 17-18 weeks of gestation" /map="21q22" /chromosome="21" 5'UTR <1..473 gene 474..5249 /gene="TIAM1" CDS 474..5249 /gene="TIAM1" /function="T-lymphoma invasion and metastasis inducing" /note="similar to product encoded by mouse Tiam-1, GenBank Accession Number U05245" /codon_start=1 /product="TIAM1 protein" /db_xref="PID:g897557" /translation="MGNAESQHVEHEFYGEKHASLGRNDTSRSLRLSHKTRRTRHASS GKVIHRNSEVSTRSSSTPSIPQSLAENGLEPFSQDGTLEDFGSPIWVDRVDMGLRPVS YTDSSVTPSVDSSIVLTAASVQSMPDTEESRLYGDDATYLAEGGRRQHSYTSNGPTFM ETASFKKKRSKSADIWREDSLEFSLSDLSQEHLTSNEEILGSAEEKDCEEARGMETRA SPRQLSTCQRANSLGDLYAQKNSGVTANMGPGSKFAGYCRNLVSDIPNLANHKMPPAA AEETPPYSNYNTLPCRKSHCLSEGATNPQISHSNSMQGRRAKTTQDVNAGEGSEFADS GIEGATTDTDLLSRRSNATNSSYSPTTGRAFVGSDSGSSSTGDAARQGVYENFRRELE MSTTNSESLEEAGSAHSDEQSSGTLSSPGQSDILLTAAQGTVRKAGALAVKNFLVHKK NKKVESATRRKWKHYWVSLKGCTLFFYESDGRSGIDHNSIPKHAVWVENSIVQAVPEH PKKDFVFCLSNSLGDAFLFQTTSQTELENWITAIHSACATAVARHHHKEDTLRLLKSE IKKLEQKIDMDEKMKKMGEMQLSSVTDSKKKKTILDQIFVWEQNLEQFQMDLFRFRCY LASLQGGELPNPKRLLAFASRPTKVAMGRLGIFSVSSFHALVAARTGETGVRRRTQAM SRSASKRRSRFSSLWGLDTTSKKKQGRPSINQVFGEGTEAVKKSLEGIFDDIVPDGKR EKEVVLPNVHQHNPDCDIWVHEYFTPSWFCLPNNQPALTVVRPGDTARDTLELICKTH QLDHSAHYLRLKFLIENKMQLYVPQPEEDIYELLYKEIEICPKVTHSIHIEKSDTAAD TYGFSLSSVEEDGIRRLYVNSVKETGLASKKGLKAGDEILEINNRAADALNSSMLKDF LSQPSLGLLVRTYPELEEGVELLESPPHRVDGPADLDESPLAFLTSNPGHSLCSEQGS SAETAPEETEGPDLESSDETDHSSKSTEQVAAFCRSLHEMNPSDQNPSPQDSTGPQLA TMRQLSDADNVRKVICELLETERTYVKDLNCLMERYLKPLQKETFLTQDELDVLFGNL TEMVEFQVEFLKTLEDGVRLVPDLEKLEKVDQFKKVLFSLGGSFLYYADRFKLYSAFC AIHTKVPKVLVKAKTDTAFKAFLDAQNPKQQHSSTLESYLIKPIQRILKYPLLLRELF ALTDAESEEHYHLDVAIKTMNKVASHINEMQKIHEEFGAVFDQLIAEQTGEKKEVADL SMGDLLLHTTVIWLNPPASLGKWKKEPELAAFVFKTAVVLVYKDGSKQKKKLVGSHRL SIYEDWDPFRFRHMIPTEALQVRALASADAEANAVCEIVHVKSESEGRPERVFHLCCS SPESRKDFLKAVHSILRDKHRRQLLKTESLPSSQQYVPFGGKRLCALKGARPAMSRAV SAPSKSLGRRRRRLARNRFTIDSDAVSASSPEKESQQPPGGGDTDRWVEEQFDLAQYE EQDDIKETDILSDDDEFCESVKGASVDRDLQERLQATSISQRERGRKTLDSHASRMAQ LKKQAALSGINGGLESASEEVIWVRREDFAPSRKLNTEI" 3'UTR 5250..>5521 BASE COUNT 1446 a 1451 c 1488 g 1136 t ORIGIN 1 cgccccgcat cgtgcccggc cccgtcgcgg agatcccgga cgaccgtcgc gggttgatgg 61 tcgcattcca gatgtaaaca gcttcagaag cctgacggtc atatggtaga atcactgtgg 121 actgagaccc acctttctag acctgaagcc caggaggagg aagaggaggc tggttggtac 181 catgggcata atgctctgaa tcctagtctc tcacctagta tgtgagcagt ccctgcagat 241 ggcccatttg gagatcttga caaagcctct tctgtttcca atggggtttt tggcgcattc 301 tcacagactt agatgaaact gtgatggcca ccgcaggggg caggtgctga catcgtcccc 361 agccctgtgg ctgttcatcc ggacatcatt tccaacctca atatctaaat gccacagtgc 421 tcttggagca agttgggctg gggaccactg ttgcctttta agaccataaa accatgggaa 481 acgcagaaag tcaacatgta gagcacgagt tttatggaga aaagcatgcc agcctggggc 541 gcaacgacac ttcccgctcc ctgcgcctct cgcacaagac gcggaggacc aggcacgctt 601 cctcggggaa ggtgatccac aggaactccg aagtgagcac ccgatccagc agcaccccca 661 gcatccccca gtccctggct gaaaatggcc tggagccctt ctcccaagat ggtaccctag 721 aagacttcgg gagccccatc tgggtggacc gagtggacat gggcttgaga cctgtgtctt 781 acactgactc ttctgtcact cccagcgtag acagcagcat cgtcctcaca gcagcctctg 841 tgcagagcat gccagacact gaggagagca ggctttacgg ggatgacgct acatatttgg 901 ctgagggagg caggaggcag cattcctata catccaatgg gcccactttc atggagacgg 961 cgagctttaa gaagaaacgc tccaaatctg cagacatctg gcgggaggac agcctggaat 1021 tctcactctc tgatctgagc caagaacatt taacaagcaa cgaagaaatc ttgggttccg 1081 ccgaagagaa ggactgcgag gaggctcggg ggatggaaac gcgggcgagt ccgcggcagc 1141 tcagcacctg tcagagagcc aattccttgg gtgacttgta tgctcagaaa aactctggag 1201 tgacagcaaa catggggccg gggagcaaat ttgcaggcta ctgtcggaat ttggtgtctg 1261 atattcccaa tcttgcaaac cataagatgc caccagctgc tgctgaagag actcctccgt 1321 acagtaatta taacacactt ccctgtagga aatctcactg tctctctgaa ggtgccacca 1381 acccacaaat tagccatagc aacagcatgc aaggcagaag agctaaaaca actcaggatg 1441 ttaatgcagg cgagggcagt gagtttgcag acagtgggat tgaaggggcc actaccgaca 1501 cggacctcct gtccaggcga tctaatgcca ccaactccag ctactcaccc accacaggcc 1561 gggcctttgt gggcagcgac agcggcagca gctccaccgg ggatgcggct cgtcaggggg 1621 tgtacgagaa cttccggcgg gagctggaga tgagcaccac caacagcgag agcctggagg 1681 aggccggctc tgcgcacagc gatgagcaga gcagcggcac cctgagctct ccgggccagt 1741 cggacatcct gctgaccgcc gcacagggca cggtgcgcaa ggccggcgcc ctggccgtca 1801 agaacttcct ggtgcacaag aagaacaaga aggtggagtc agccacccgg aggaagtgga 1861 agcactactg ggtgtccctg aaaggatgca cgctattttt ctacgagagc gacggcaggt 1921 ctgggataga ccacaacagc atccccaaac acgccgtctg ggtggagaac agcattgtgc 1981 aggctgtgcc tgagcacccc aagaaggact ttgtcttctg cctcagcaat tccctgggtg 2041 atgccttcct ttttcagacc actagccaga cggagcttga aaactggatc accgccatcc 2101 actctgcctg cgccactgcg gtcgcgaggc accaccacaa ggaagacacg ctccgactcc 2161 tgaaatcaga gatcaaaaaa ctggaacaga agattgacat ggatgaaaag atgaagaaaa 2221 tgggtgaaat gcagctgtct tcagtcactg actcaaagaa aaagaaaaca atattagatc 2281 agatctttgt ctgggagcaa aatctcgagc agttccaaat ggacctgttt cgtttccgct 2341 gttatttagc cagccttcag ggtggggagc tgccaaaccc caaaaggctt ctcgcttttg 2401 caagtcgacc aacgaaagtg gccatgggcc gccttggaat cttttcggta tcatcgtttc 2461 atgccctggt ggcagcacgc actggtgaaa ctggagtgag aagacgtact caggccatgt 2521 ccagatccgc gagcaagcga aggagcaggt tttcttctct gtggggtctg gatactacct 2581 ccaaaaagaa gcagggacgg ccaagcatca atcaggtgtt tggagaggga accgaagctg 2641 taaagaaatc tttagaggga atatttgatg acattgttcc agatggcaag agggagaaag 2701 aagtggtctt acctaacgtt caccagcaca accctgactg cgacatttgg gtccacgagt 2761 atttcactcc atcctggttc tgtctgccca ataatcagcc tgccctgacg gtcgtccggc 2821 caggcgacac tgcacgggac accctggagc tgatttgcaa gacacatcaa ctggatcatt 2881 ctgctcatta cctgcgcctg aaatttctaa tagaaaacaa aatgcagctc tatgttccac 2941 agcccgagga agacatctat gagctgctgt acaaagaaat tgaaatctgt ccaaaagtca 3001 ctcacagcat ccacattgag aagtcagata cagctgctga tacttacggg ttttcacttt 3061 cttctgtgga agaagatggt attcgaaggc tgtacgtgaa tagtgtgaag gaaaccggtt 3121 tagcttccaa gaaaggcctg aaagcaggag atgagattct tgagatcaat aatcgtgctg 3181 ctgacgccct gaactcttct atgctcaaag atttcctctc acaaccctcg ctgggcctcc 3241 tggtgaggac ctaccccgag ctggaggaag gagtggagct gctggaaagc ccgccccacc 3301 gagtggacgg ccctgccgac cttgacgaga gccccctcgc ctttctcacc agcaacccag 3361 ggcacagcct ttgcagcgag cagggcagca gtgctgagac cgctccagag gagaccgagg 3421 ggccagactt ggaatcctca gatgagactg atcacagcag caagagtaca gaacaggtgg 3481 ccgcattttg ccgcagtttg catgagatga acccctctga ccagaaccca tctcctcagg 3541 actccacggg gcctcagctg gcgaccatga gacaactctc ggatgcagat aacgtgcgca 3601 aggtgatctg cgagctcctg gagacggagc gcacctacgt gaaggattta aactgtctta 3661 tggagagata cctaaagcct cttcaaaaag aaacttttct cacccaggat gagcttgacg 3721 tgctttttgg aaatttaacg gaaatggtag agtttcaagt agaattcctt aaaactctag 3781 aagatggagt gagactggta cctgatttgg aaaagcttga gaaggttgat caatttaaga 3841 aagtgctgtt ctctctgggg ggatcattcc tgtattatgc tgaccgcttc aagctctaca 3901 gtgccttctg cgccatccac acaaaagttc ccaaggtcct ggtgaaagcc aagacagaca 3961 cggctttcaa ggcattcttg gatgcccaga acccgaagca gcagcactca tccacgctgg 4021 agtcgtacct catcaagccc atccagagga tcctcaagta cccacttctg ctcagggagc 4081 tgttcgccct gaccgatgcg gagagcgagg agcactacca cctggacgtg gccatcaaga 4141 ccatgaacaa ggttgccagt cacatcaatg agatgcagaa aatccatgaa gagtttgggg 4201 ctgtgtttga ccagctgatt gctgaacaga ctggtgagaa aaaagaggtt gcagatctga 4261 gcatgggaga cctgcttttg cacactaccg tgatctggct gaacccgccg gcctcgctgg 4321 gcaagtggaa aaaggaacca gagttggcag cattcgtctt caaaactgct gtggtccttg 4381 tgtataaaga tggttccaaa cagaagaaga aacttgtagg atctcacagg ctttccattt 4441 atgaggactg ggaccccttc agatttcgac acatgatccc cacggaagcg ctgcaggttc 4501 gagctttggc gagtgcagat gcagaggcaa atgccgtgtg tgaaattgtc catgtaaaat 4561 ccgagtctga agggaggccg gagagggtct ttcacttgtg ctgcagctcc ccagagagcc 4621 gaaaggattt cctaaaggct gtgcattcaa tcctgcgtga taagcacaga agacagctcc 4681 tcaaaaccga gagccttccc tcatcccagc aatatgtccc ttttggaggc aaaagattgt 4741 gtgcactgaa gggggccagg ccggccatga gcagggcagt gtctgcccca agcaagtctc 4801 ttgggaggag gaggcggcgg ctggctcgaa acaggtttac cattgattct gatgccgtct 4861 ccgcaagcag cccggagaaa gagtcccagc agccccccgg tggtggggac actgaccgat 4921 gggtagagga gcagtttgat cttgctcagt atgaggagca agatgacatc aaggagacag 4981 acatcctcag tgacgatgat gagttctgtg agtccgtgaa gggtgcctca gtggacagag 5041 acctgcagga gcggcttcag gccacctcca tcagtcagcg ggaaagaggc cggaaaaccc 5101 tggatagtca cgcgtcccgc atggcacagc tcaagaagca agctgccctg tcggggatca 5161 atggaggcct ggagagcgca agcgaggaag tcatttgggt taggcgtgaa gactttgccc 5221 cctccaggaa actgaacact gagatctgac tgcgtcacct gccccgtaga gaatgtgtgt 5281 agatacttcc tgccctaact ctgcccaccc tcctgtaccg tcgacaagaa tgtcccctta 5341 ggtcgcgctc ttgcacacac ggttttggca gctgacttgg ttctgaagcc atgtagccac 5401 ccaactttgt cattttcaac aacatcagaa agaattgatc agaatcccaa ataaaaccca 5461 aaagtgtcta atgtattcat tcattagcta actaaaagcc caaaaaagac aagacaccca 5521 g // LOCUS HSU16306 11185 bp mRNA PRI 14-APR-1995 DEFINITION Human chondroitin sulfate proteoglycan versican V0 splice-variant precursor peptide mRNA, complete cds. ACCESSION U16306 NID g608514 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1309 to 4269) AUTHORS Dours-Zimmermann,M.T. and Zimmermann,D.R. TITLE A novel glycosaminoglycan attachment domain identified in two alternative splice variants of human versican JOURNAL J. Biol. Chem. 269 (52), 32992-32998 (1994) MEDLINE 95105187 REFERENCE 2 (bases 1 to 1308; 4270 to 11185) AUTHORS Zimmermann,D.R. and Ruoslahti,E. TITLE Multiple domains of the large fibroblast proteoglycan, versican JOURNAL EMBO J. 8 (10), 2975-2981 (1989) MEDLINE 90059882 REFERENCE 3 (bases 8398 to 11185) AUTHORS Krusius,T., Gehlsen,K.R. and Ruoslahti,E. TITLE A fibroblast chondroitin sulfate proteoglycan core protein contains lectin-like and growth factor-like sequences JOURNAL J. Biol. Chem. 262 (27), 13120-13125 (1987) MEDLINE 88007514 REFERENCE 4 (bases 1 to 11185) AUTHORS Zimmermann,D.R. TITLE Direct Submission JOURNAL Submitted (26-OCT-1994) Dieter R. Zimmermann, Department of Pathology, University of Zurich, Schmelzbergstrasse 12, Zurich 8091, Switzerland FEATURES Location/Qualifiers source 1..11185 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="G to O" /clone_lib="cDNA library, U251MG" /tissue_type="human glioma cell line" 5'UTR 1..266 sig_peptide 267..326 CDS 267..10457 /codon_start=1 /product="chondroitin sulfate proteoglycan versican V0 splice-variant precursor peptide" /db_xref="PID:g608515" /translation="MFINIKSILWMCSTLIVTHALHKVKVGKSPPVRGSLSGKVSLPC HFSTMPTLPPSYNTSEFLRIKWSKIEVDKNGKDLKETTVLVAQNGNIKIGQDYKGRVS VPTHPEAVGDASLTVVKLLASDAGLYRCDVMYGIEDTQDTVSLTVDGVVFHYRAATSR YTLNFEAAQKACLDVGAVIATPEQLFAAYEDGFEQCDAGWLADQTVRYPIRAPRVGCY GDKMGKAGVRTYGFRSPQETYDVYCYVDHLDGDVFHLTVPSKFTFEEAAKECENQDAR LATVGELQAAWRNGFDQCDYGWLSDASVRHPVTVARAQCGGGLLGVRTLYRFENQTGF PPPDSRFDAYCFKPKEATTIDLSILAETASPSLSKEPQMVSDRTTPIIPLVDELPVIP TEFPPVGNIVSFEQKATVQPQAITDSLATKLPTPTGSTKKPWDMDDYSPSASGPLGKL DISEIKEEVLQSTTGVSHYATDSWDGVVEDKQTQESVTQIEQIEVGPLVTSMEILKHI PSKEFPVTETPLVTARMILESKTEKKMVSTVSELVTTGHYGFTLGEEDDEDRTLTVGS DESTLIFDQIPEVITVSKTSEDTIHTHLEDLESVSASTTVSPLIMPDNNGSSMDDWEE RQTSGRITEEFLGKYLSTTPFPSQHRTEIELFPYSGDKILVEGISTVIYPSLQTEMTH RRERTETLIPEMRTDTYTDEIQEEITKSPFMGKTEEEVFSGMKLSTSLSEPIHVTESS VEMTKSFDFPTLITKLSAEPTEVRDMEEDFTATPGTTKYDENITTVLLAHGTLSVEAA TVSKWSWDEDNTTSKPLESTEPSASSKLPPALLTTVGMNGKDKDIPSFTEDGADEFTL IPDSTQKQLEEVTDEDIAAHGKFTIRFQPTTSTGIAEKSTLRDSTTEEKVPPITSTEG QVYATMEGSALGEVEDVDLSKPVSTVPQFAHTSEVEGLAFVSYSSTQEPTTYVDSSHT IPLSVIPKTDWGVLVPSVPSEDEVLGEPSQDILVIDQTRLEATISPETMRTTKITEGT TQEEFPWKEQTAEKPVPALSSTAWTPKEAVTPLDEQEGDGSAYTVSEDELLTGSERVP VLETTPVGKIDHSVSYPPGAVTEHKVKTDEVVTLTPRIGPKVSLSPGPEQKYETEGSS TTGFTSSLSPFSTHITQLMEETTTEKTSLEDIDLGSGLFEKPKATELIEFSTIKVTVP SDITTAFSSVDRLHTTSAFKPSSAITKKPPLIDREPGEETTSDMVIIGESTSHVPPTT LEDIVAKETETDIDREYFTTSSPPATQPTRPPTVEDKEAFGPQALSTPQPPASTKFHP DINVYIIEVRENKTGRMSDLSVIGHPIDSESKEDEPCSEETDPVHDLMAEILPEFPDI IEIDLYHSEENEEEEEECANATDVTTTPSVQYINGKHLVTTVPKDPEAAEARRGQFES VAPSQNFSDSSESDTHPFVIAKTELSTAVQPNESTETTESLEVTWKPETYPETSEHFS GGEPDVFPTVPFHEEFESGTAKKGAESVTERDTEVGHQAHEHTEPVSLFPEESSGEIA IDQESQKIAFARATEVTFGEEVEKSTSVTYTPTIVPSSASAYVSEEEAVTLIGNPWPD DLLSTKESWVEATPRQVVELSGSSSIPITEGSGEAEEDEDTMFTMVTDLSQRNTTDTL ITLDTSRIITESFFEVPATTIYPVSEQPSAKVVPTKFVSETDTSEWISSTTVEEKKRK EEEGTTGTASTFEVYSSTQRSDQLILPFELESPNVATSSDSGTRKSFMSLTTPTQSER EMTDSTPVFTETNTLENLGAQTTEHSSIHQPGVQEGLTTLPRSPASVFMEQGSGEAAA DPETTTVSSFSLNVEYAIQAEKEVAGTLSPHVETTFSTEPTGLVLSTVMDRVVAENIT QTSREIVISERLGEPNYGAEIRGFSTGFPLEEDFSGDFREYSTVSHPIAKEETVMMEG SGDAAFRDTQTSPSTVPTSVHISHISDSEGPSSTMVSTSAFPWEEFTSSAEGSGEQLV TVSSSVVPVLPSAVQKFSGTASSIIDEGLGEVGTVNEIDRRSTILPTAEVEGTKAPVE KEEVKVSGTVSTNFPQTIEPAKLWSRQEVNPVRQEIESETTSEEQIQEEKSFESPQNS PATEQTIFDSQTFTETELKTTDYSVLTTKKTYSDDKEMKEEDTSLVNMSTPDPDANGL ESYTTLPEATEKSHFFLATALVTESIPAEHVVTDSPIKKEESTKHFPKGMRPTIQESD TELLFSGLGSGEEVLPTLPTESVNFTEVEQINNTLYPHTSQVESTSSDKIEDFNRMEN VAKEVGPLVSQTDIFEGSGSVTSTTLIEILSDTGAEGPTVAPLPFSTDIGHPQNQTVR WAEEIQTSRPQTITEQDSNKNSSTAEINETTTSSTDFLARAYGFEMAKEFVTSAPKPS DLYYEPSGEGSGEVDIVDSFHTSATTQATRQESSTTFVSDGSLEKHPEVPSAKAVTAD GFPTVSVMLPLHSEQNKSSPDPTSTLSNTVSYERSTDGSFQDRFREFEDSTLKPNRKK PTENIIIDLDKEDKDLILTITESTILEILPELTSDKNTIIDIDHTKPVYEDILGMQTD IDTEVPSEPHDSNDESNDDSTQVQEIYEAAVNLSLTEETFEGSADVLASYTQATHDES MTYEDRSQLDHMGFHFTTGIPAPSTETELDVLLPTATSLPIPRKSATVIPEIEGIKAE AKALDDMFESSTLSDGQAIADQSEIIPTLGQFERTQEEYEDKKHAGPSFQPEFSSGAE EALVDHTPYLSIATTHLMDQSVTEVPDVMEGSNPPYYTDTTLAVSTFAKLSSQTPSSP LTIYSGSEASGHTEIPQPSALPGIDVGSSVMSPQDSFKEIHVNIEATFKPSSEEYLHI TEPPSLSPDTKLEPSEDDGKPELLEEMEASPTELIAVEGTEILQDFQNKTDGQVSGEA IKMFPTIKTPEAGTVITTADEIELEGATQWPHSTSASATYGVEAGVVPWLSPQTSERP TLSSSPEINPETQAALIRGQDSTIAASEQQVAARILDSNDQATVNPVEFNTEVATPPF SLLETSNETDFLIGINEESVEGTAIYLPGPDRCKMNPCLNGGTCYPTETSYVCTCVPG YSGDQCELDFDECHSNPCRNGATCVDGFNTFRCLCLPSYVGALCEQDTETCDYGWHKF QGQCYKYFAHRRTWDAAERECRLQGAHLTSILSHEEQMFVNRVGHDYQWIGLNDKMFE HDFRWTDGSTLQYENWRPNQPDSFFSAGEDCVVIIWHENGQWNDVPCNYHLTYTCKKG TVACGQPPVVENAKTFGKMKPRYEINSLIRYHCKDGFIQRHLPTIRCLGNGRWAIPKI TCMNPSAYQRTYSMKYFKNSSSAKDNSINTSKHDHRWSRRWQESRR" mat_peptide 327..10454 /product="versican V0 core protein" misc_feature 1309..4269 /note="encodes alternatively spliced GAG-alpha domain" misc_feature 4270..9531 /note="encodes alternatively spliced GAG-beta domain" 3'UTR 10458..11185 BASE COUNT 3530 a 2489 c 2377 g 2789 t ORIGIN 1 gctgccccga gcctttctgg ggaagaactc caggcgtgcg gacgcaacag ccgagaacat 61 taggtgttgt ggacaggagc tgggaccaag atcttcggcc agccccgcat cctcccgcat 121 cttccagcac cgtcccgcac cctccgcatc cttccccggg ccaccacgct tcctatgtga 181 cccgcctggg caacgccgaa cccagtcgcg cagcgctgca gtgaattttc cccccaaact 241 gcaataagcc gccttccaag gccaagatgt tcataaatat aaagagcatc ttatggatgt 301 gttcaacctt aatagtaacc catgcgctac ataaagtcaa agtgggaaaa agcccaccgg 361 tgaggggctc cctctctgga aaagtcagcc taccttgtca tttttcaacg atgcctactt 421 tgccacccag ttacaacacc agtgaatttc tccgcatcaa atggtctaag attgaagtgg 481 acaaaaatgg aaaagatttg aaagagacta ctgtccttgt ggcccaaaat ggaaatatca 541 agattggtca ggactacaaa gggagagtgt ctgtgcccac acatcccgag gctgtgggcg 601 atgcctccct cactgtggtc aagctgctgg caagtgatgc gggtctttac cgctgtgacg 661 tcatgtacgg gattgaagac acacaagaca cggtgtcact gactgtggat ggggttgtgt 721 ttcactacag ggcggcaacc agcaggtaca cactgaattt tgaggctgct cagaaggctt 781 gtttggacgt tggggcagtc atagcaactc cagagcagct ctttgctgcc tatgaagatg 841 gatttgagca gtgtgacgca ggctggctgg ctgatcagac tgtcagatat cccatccggg 901 ctcccagagt aggctgttat ggagataaga tgggaaaggc aggagtcagg acttatggat 961 tccgttctcc ccaggaaact tacgatgtgt attgttatgt ggatcatctg gatggtgatg 1021 tgttccacct cactgtcccc agtaaattca ccttcgagga ggctgcaaaa gagtgtgaaa 1081 accaggatgc caggctggca acagtggggg aactccaggc ggcatggagg aacggctttg 1141 accagtgcga ttacgggtgg ctgtcggatg ccagcgtgcg ccaccctgtg actgtggcca 1201 gggcccagtg tggaggtggt ctacttgggg tgagaaccct gtatcgtttt gagaaccaga 1261 caggcttccc tccccctgat agcagatttg atgcctactg ctttaaacct aaagaggcta 1321 caaccatcga tttgagtatc ctcgcagaaa ctgcatcacc cagtttatcc aaagaaccac 1381 aaatggtttc tgatagaact acaccaatca tccctttagt tgatgaatta cctgtcattc 1441 caacagagtt ccctcccgtg ggaaatattg tcagttttga acagaaagcc acagtccaac 1501 ctcaggctat cacagatagt ttagccacca aattacccac acctactggc agtaccaaga 1561 agccctggga tatggatgac tactcacctt ctgcttcagg acctcttgga aagctagaca 1621 tatcagaaat taaggaagaa gtgctccaga gtacaactgg cgtctctcat tatgctacgg 1681 attcatggga tggtgtcgtg gaagataaac aaacacaaga atcggttaca cagattgaac 1741 aaatagaagt gggtcctttg gtaacatcta tggaaatctt aaagcacatt ccttccaagg 1801 aattccctgt aactgaaaca ccattggtaa ctgcaagaat gatcctggaa tccaaaactg 1861 aaaagaaaat ggtaagcact gtttctgaat tggtaaccac aggtcactat ggattcacct 1921 tgggagaaga ggatgatgaa gacagaacac ttacagttgg atctgatgag agcaccttga 1981 tctttgacca aattcctgaa gtcattacgg tgtcaaagac ttcagaagac accatccaca 2041 ctcatttaga agacttggag tcagtctcag catccacaac tgtttcccct ttaattatgc 2101 ctgataataa tggatcatcc atggatgact gggaagagag acaaactagt ggtaggataa 2161 cggaagagtt tcttggcaaa tatctgtcta ctacaccttt tccatcacag catcgtacag 2221 aaatagaatt gtttccttat tctggtgata aaatattagt agagggaatt tccacagtta 2281 tttatccttc tctacaaaca gaaatgacac atagaagaga aagaacagaa acactaatac 2341 cagagatgag aacagatact tatacagatg aaatacaaga agagatcact aaaagtccat 2401 ttatgggaaa aacagaagaa gaagtcttct ctgggatgaa actctctaca tctctctcag 2461 agccaattca tgttacagag tcttctgtgg aaatgaccaa gtcttttgat ttcccaacat 2521 tgataacaaa gttaagtgca gagccaacag aagtaagaga tatggaggaa gactttacag 2581 caactccagg tactacaaaa tatgatgaaa atattacaac agtgcttttg gcccatggta 2641 ctttaagtgt tgaagcagcc actgtatcaa aatggtcatg ggatgaagat aatacaacat 2701 ccaagccttt agagtctaca gaaccttcag cctcttcaaa attgccccct gccttactca 2761 caactgtggg gatgaatgga aaggataaag acatcccaag tttcactgaa gatggagcag 2821 atgaatttac tcttattcca gatagtactc aaaagcagtt agaggaggtt actgatgaag 2881 acatagcagc ccatggaaaa ttcacaatta gatttcagcc aactacatca actggtattg 2941 cagaaaagtc aactttgaga gattctacaa ctgaagaaaa agttccacct atcacaagca 3001 ctgaaggcca agtttatgca accatggaag gaagtgcttt gggtgaagta gaagatgtgg 3061 acctctctaa gccagtatct actgttcccc aatttgcaca cacttcagag gtggaaggat 3121 tagcatttgt tagttatagt agcacccaag agcctactac ttatgtagac tcttcccata 3181 ccattcctct ttctgtaatt cccaagacag actggggagt gttagtacct tctgttccat 3241 cagaagatga agttctaggt gaaccctctc aagacatact tgtcattgat cagactcgcc 3301 ttgaagcgac tatttctcca gaaactatga gaacaacaaa aatcacagag ggaacaactc 3361 aggaagaatt cccttggaaa gaacagactg cagagaaacc agttcctgct ctcagttcta 3421 cagcttggac tcccaaggag gcagtaacac cactggatga acaagagggc gatggatcag 3481 catatacagt ctctgaagat gaattgttga caggttctga gagggtccca gttttagaaa 3541 caactccagt tggaaaaatt gatcacagtg tgtcttatcc accaggtgct gtaactgagc 3601 acaaagtgaa aacagatgaa gtggtaacac taacaccacg cattgggcca aaagtatctt 3661 taagtccagg gcctgaacaa aaatatgaaa cagaaggtag tagtacaaca ggatttacat 3721 catctttgag tccttttagt acccacatta cccagcttat ggaagaaacc actactgaga 3781 aaacatccct agaggatatt gatttaggct caggattatt tgaaaagccc aaagccacag 3841 aactcataga attttcaaca atcaaagtca cagttccaag tgatattacc actgccttca 3901 gttcagtaga cagacttcac acaacttcag cattcaagcc atcttccgcg atcactaaga 3961 aaccacctct catcgacagg gaacctggtg aagaaacaac cagtgacatg gtaatcattg 4021 gagaatcaac atctcatgtt cctcccacta cccttgaaga tattgtagcc aaggaaacag 4081 aaaccgatat tgatagagag tatttcacga cttcaagtcc tcctgctaca cagccaacaa 4141 gaccacccac tgtggaagac aaagaggcct ttggacctca ggcgctttct acgccacagc 4201 ccccagcaag cacaaaattt caccctgaca ttaatgttta tattattgag gtcagagaaa 4261 ataagacagg tcgaatgagt gatttgagtg taattggtca tccaatagat tcagaatcta 4321 aagaagatga accttgtagt gaagaaacag atccagtgca tgatctaatg gctgaaattt 4381 tacctgaatt ccctgacata attgaaatag acctatacca cagtgaagaa aatgaagaag 4441 aagaagaaga gtgtgcaaat gctactgatg tgacaaccac cccatctgtg cagtacataa 4501 atgggaagca tctcgttacc actgtgccca aggacccaga agctgcagaa gctaggcgtg 4561 gccagtttga aagtgttgca ccttctcaga atttctcgga cagctctgaa agtgatactc 4621 atccatttgt aatagccaaa acggaattgt ctactgctgt gcaacctaat gaatctacag 4681 aaacaactga gtctcttgaa gttacatgga agcctgagac ttaccctgaa acatcagaac 4741 atttttcagg tggtgagcct gatgttttcc ccacagtccc attccatgag gaatttgaaa 4801 gtggaacagc caaaaaaggg gcagaatcag tcacagagag agatactgaa gttggtcatc 4861 aggcacatga acatactgaa cctgtatctc tgtttcctga agagtcttca ggagagattg 4921 ccattgacca agaatctcag aaaatagcct ttgcaagggc tacagaagta acatttggtg 4981 aagaggtaga aaaaagtact tctgtcacat acactcccac tatagttcca agttctgcat 5041 cagcatatgt ttcagaggaa gaagcagtta ccctaatagg aaatccttgg ccagatgacc 5101 tgttgtctac caaagaaagc tgggtagaag caactcctag acaagttgta gagctctcag 5161 ggagttcttc gattccaatt acagaaggct ctggagaagc agaagaagat gaagatacaa 5221 tgttcaccat ggtaactgat ttatcacaga gaaatactac tgatacactc attactttag 5281 acactagcag gataatcaca gaaagctttt ttgaggttcc tgcaaccacc atttatccag 5341 tttctgaaca accttctgca aaagtggtgc ctaccaagtt tgtaagtgaa acagacactt 5401 ctgagtggat ttccagtacc actgttgagg aaaagaaaag gaaggaggag gagggaacta 5461 caggtacggc ttctacattt gaggtatatt catctacaca gagatcggat caattaattt 5521 taccctttga attagaaagt ccaaatgtag ctacatctag tgattcaggt accaggaaaa 5581 gttttatgtc cttgacaaca ccaacacagt ctgaaaggga aatgacagat tctactcctg 5641 tctttacaga aacaaataca ttagaaaatt tgggggcaca gaccactgag cacagcagta 5701 tccatcaacc tggggttcag gaagggctga ccactctccc acgtagtcct gcctctgtct 5761 ttatggagca gggctctgga gaagctgctg ccgacccaga aaccaccact gtttcttcat 5821 tttcattaaa cgtagagtat gcaattcaag ccgaaaagga agtagctggc actttgtctc 5881 cgcatgtgga aactacattc tccactgagc caacaggact ggttttgagt acagtaatgg 5941 acagagtagt tgctgaaaat ataacccaaa catccaggga aatagtgatt tcagagcgat 6001 taggagaacc aaattatggg gcagaaataa ggggcttttc cacaggtttt cctttggagg 6061 aagatttcag tggtgacttt agagaatact caacagtgtc tcatcccata gcaaaagaag 6121 aaacggtaat gatggaaggc tctggagatg cagcatttag ggacacccag acttcaccat 6181 ctacagtacc tacttcagtt cacatcagtc acatatctga ctcagaagga cccagtagca 6241 ccatggtcag cacttcagcc ttcccctggg aagagtttac atcctcagct gagggctcag 6301 gtgagcaact ggtcacagtc agcagctctg ttgttccagt gcttcccagt gctgtgcaaa 6361 agttttctgg tacagcttcc tccattatcg acgaaggatt gggagaagtg ggtactgtca 6421 atgaaattga tagaagatcc accattttac caacagcaga agtggaaggt acgaaagctc 6481 cagtagagaa ggaggaagta aaggtcagtg gcacagtttc aacaaacttt ccccaaacta 6541 tagagccagc caaattatgg tctaggcaag aagtcaaccc tgtaagacaa gaaattgaaa 6601 gtgaaacaac atcagaggaa caaattcaag aagaaaagtc atttgaatcc cctcaaaact 6661 ctcctgcaac agaacaaaca atctttgatt cacagacatt tactgaaact gaactcaaaa 6721 ccacagatta ttctgtacta acaacaaaga aaacttacag tgatgataaa gaaatgaagg 6781 aggaagacac ttctttagtt aacatgtcta ctccagatcc agatgcaaat ggcttggaat 6841 cttacacaac tctccctgaa gctactgaaa agtcacattt tttcttagct actgcattag 6901 taactgaatc tataccagct gaacatgtag tcacagattc accaatcaaa aaggaagaaa 6961 gtacaaaaca ttttccgaaa ggcatgagac caacaattca agagtcagat actgagctct 7021 tattctctgg actgggatca ggagaagaag ttttacctac tctaccaaca gagtcagtga 7081 attttactga agtggaacaa atcaataaca cattatatcc ccacacttct caagtggaaa 7141 gtacctcaag tgacaaaatt gaagacttta acagaatgga aaatgtggca aaagaagttg 7201 gaccactcgt atctcaaaca gacatctttg aaggtagtgg gtcagtaacc agcacaacat 7261 taatagaaat tttaagtgac actggagcag aaggacccac ggtggcacct ctccctttct 7321 ccacggacat cggacatcct caaaatcaga ctgtcaggtg ggcagaagaa atccagacta 7381 gtagaccaca aaccataact gaacaagact ctaacaagaa ttcttcaaca gcagaaatta 7441 acgaaacaac aacctcatct actgattttc tggctagagc ttatggtttt gaaatggcca 7501 aagaatttgt tacatcagca ccaaaaccat ctgacttgta ttatgaacct tctggagaag 7561 gatctggaga agtggatatt gttgattcat ttcacacttc tgcaactact caggcaacca 7621 gacaagaaag cagcaccaca tttgtttctg atgggtccct ggaaaaacat cctgaggtgc 7681 caagcgctaa agctgttact gctgatggat tcccaacagt ttcagtgatg ctgcctcttc 7741 attcagagca gaacaaaagc tcccctgatc caactagcac actgtcaaat acagtgtcat 7801 atgagaggtc cacagacggt agtttccaag accgtttcag ggaattcgag gattccacct 7861 taaaacctaa cagaaaaaaa cccactgaaa atattatcat agacctggac aaagaggaca 7921 aggatttaat attgacaatt acagagagta ccatccttga aattctacct gagctgacat 7981 cggataaaaa tactatcata gatattgatc atactaaacc tgtgtatgaa gacattcttg 8041 gaatgcaaac agatatagat acagaggtac catcagaacc acatgacagt aatgatgaaa 8101 gtaatgatga cagcactcaa gttcaagaga tctatgaggc agctgtcaac ctttctttaa 8161 ctgaggaaac atttgagggc tctgctgatg ttctggctag ctacactcag gcaacacatg 8221 atgaatcaat gacttatgaa gatagaagcc aactagatca catgggcttt cacttcacaa 8281 ctgggatccc tgctcctagc acagaaacag aattagacgt tttacttccc acggcaacat 8341 ccctgccaat tcctcgtaag tctgccacag ttattccaga gattgaagga ataaaagctg 8401 aagcaaaagc cctggatgac atgtttgaat caagcacttt gtctgatggt caagctattg 8461 cagaccaaag tgaaataata ccaacattgg gccaatttga aaggactcag gaggagtatg 8521 aagacaaaaa acatgctggt ccttcttttc agccagaatt ctcttcagga gctgaggagg 8581 cattagtaga ccatactccc tatctaagta ttgctactac ccaccttatg gatcagagtg 8641 taacagaggt gcctgatgtg atggaaggat ccaatccccc atattacact gatacaacat 8701 tagcagtttc aacatttgcg aagttgtctt ctcagacacc atcatctccc ctcactatct 8761 actcaggcag tgaagcctct ggacacacag agatccccca gcccagtgct ctgccaggaa 8821 tagacgtcgg ctcatctgta atgtccccac aggattcttt taaggaaatt catgtaaata 8881 ttgaagcaac tttcaaacca tcaagtgagg aataccttca cataactgag cctccctctt 8941 tatctcctga cacaaaatta gaaccttcag aagatgatgg taaacctgag ttattagaag 9001 aaatggaagc ttctcccaca gaacttattg ctgtggaagg aactgagatt ctccaagatt 9061 tccaaaacaa aaccgatggt caagtttctg gagaagcaat caagatgttt cccaccatta 9121 aaacacctga ggctggaact gttattacaa ctgccgatga aattgaatta gaaggtgcta 9181 cacagtggcc acactctact tctgcttctg ccacctatgg ggtcgaggca ggtgtggtgc 9241 cttggctaag tccacagact tctgagaggc ccacgctttc ttcttctcca gaaataaacc 9301 ctgaaactca agcagcttta atcagagggc aggattccac gatagcagca tcagaacagc 9361 aagtggcagc gagaattctt gattccaatg atcaggcaac agtaaaccct gtggaattta 9421 atactgaggt tgcaacacca ccattttccc ttctggagac ttctaatgaa acagatttcc 9481 tgattggcat taatgaagag tcagtggaag gcacggcaat ctatttacca ggacctgatc 9541 gctgcaaaat gaacccgtgc cttaacggag gcacctgtta tcctactgaa acttcctacg 9601 tatgcacctg tgtgccagga tacagcggag accagtgtga acttgatttt gatgaatgtc 9661 actctaatcc ctgtcgtaat ggagccactt gtgttgatgg ttttaacaca ttcaggtgcc 9721 tctgccttcc aagttatgtt ggtgcacttt gtgagcaaga taccgagaca tgtgactatg 9781 gctggcacaa attccaaggg cagtgctaca aatactttgc ccatcgacgc acatgggatg 9841 cagctgaacg ggaatgccgt ctgcagggtg cccatctcac aagcatcctg tctcacgaag 9901 aacaaatgtt tgttaatcgt gtgggccatg attatcagtg gataggcctc aatgacaaga 9961 tgtttgagca tgacttccgt tggactgatg gcagcacact gcaatacgag aattggagac 10021 ccaaccagcc agacagcttc ttttctgctg gagaagactg tgttgtaatc atttggcatg 10081 agaatggcca gtggaatgat gttccctgca attaccatct cacctatacg tgcaagaaag 10141 gaacagttgc ttgcggccag ccccctgttg tagaaaatgc caagaccttt ggaaagatga 10201 aacctcgtta tgaaatcaac tccctgatta gataccactg caaagatggt ttcattcaac 10261 gtcaccttcc aactatccgg tgcttaggaa atggaagatg ggctatacct aaaattacct 10321 gcatgaaccc atctgcatac caaaggactt attctatgaa atactttaaa aattcctcat 10381 cagcaaagga caattcaata aatacatcca aacatgatca tcgttggagc cggaggtggc 10441 aggagtcgag gcgctgatcc ctaaaatggc gaacatgtgt tttcatcatt tcagccaaag 10501 tcctaacttc ctgtgccttt cctatcacct cgagaagtaa ttatcagttg gtttggattt 10561 ttggaccacc gttcagtcat tttgggttgc cgtgctccca aaacatttta aatgaaagta 10621 ttggcattca aaaagacagc agacaaaatg aaagaaaatg agagcagaaa gtaagcattt 10681 ccagcctatc taatttcttt agttttctat ttgcctccag tgcagtccat ttcctaatgt 10741 ataccagcct actgtactat ttaaaatgct caatttcagc accgatggcc atgtaaataa 10801 gatgatttaa tgttgatttt aatcctgtat ataaaataaa aagtcacaat gagtttgggc 10861 atatttaatg atgattatgg agccttagag gtctttaatc attggttcgg ctgcttttat 10921 gtagtttagg ctggaaatgg tttcacttgc tctttgactg tcagcaagac tgaagatggc 10981 ttttcctgga cagctagaaa acacaaaatc ttgtaggtca ttgcacctat ctcagccata 11041 ggtgcagttt gcttctacat gatgctaaag gctgcgaatg ggatcctgat ggaactaagg 11101 actccaatgt cgaactcttc tttgctgcat tcctttttct tcacttacaa gaaaggcctg 11161 aatggaggac ttttctgtaa ccagg // LOCUS HSU16307 1597 bp mRNA PRI 04-DEC-1995 DEFINITION Human glioma pathogenesis-related protein (GliPR) mRNA, complete cds. ACCESSION U16307 NID g1100927 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1597) AUTHORS Murphy,E.V., Zhang,Y., Zhu,W. and Biggs,J. TITLE The human glioma pathogenesis-related protein is structurally related to plant pathogenesis-related proteins and its gene is expressed specifically in brain tumors JOURNAL Gene 159 (1), 131-135 (1995) MEDLINE 95331646 REFERENCE 2 (bases 1 to 1597) AUTHORS Murphy,E.V. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) Elizabeth V. Murphy, Preventive Medicine, University of Arizona, 1612 Mabel St., Tucson, AZ 85724, USA FEATURES Location/Qualifiers source 1..1597 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Enh2-1" /clone_lib="Human Astrocytoma U251 (Stratagene)" gene 115..774 /gene="GliPR" CDS 115..774 /gene="GliPR" /citation=[1] /codon_start=1 /function="unknown" /product="glioma pathogenesis-related protein" /db_xref="PID:g847722" /translation="MVSFVSNYSHTANILPDIENEDFIKDCVRIHNKFRSEVKPTASD MLYMTWDPALAQIAKAWASNCQFSHNTRLKPPHKLHPNFTSLGENIWTGSVPIFSVSS AITNWYDEIQDYNFKTRICKKVCGHYTQVVWADSYKVGCAVQFCPKVSGFDALSNGAH FICNYGPGGNYPTWPYKRGATCSACPNNDKCLDNLCVNDSETKSNVTTMLYIRLAHIS T" polyA_site 1597 /note="16 A nucleotides" BASE COUNT 508 a 353 c 268 g 468 t ORIGIN 1 ggctgagcac tcaggcaatc acactctcag aaactgcggc ggctctggac tgcagcctcc 61 aaggctccat gccagacaaa gctatgcgta gtcacacttg ctacaatagc ctggatggtt 121 tcttttgtct ccaattattc acacacagca aatattttgc cagatatcga aaatgaagat 181 ttcatcaaag actgcgttcg aatccataac aagttccgat cagaggtgaa accaacagcc 241 agtgatatgc tatacatgac ttgggaccca gcactagccc aaattgcaaa agcatgggcc 301 agcaattgcc agttttcaca taatacacgg ctgaagccac cccacaagct gcacccaaac 361 ttcacttcac tgggagagaa catctggact gggtctgtgc ccattttttc tgtgtcttcc 421 gccatcacaa actggtatga cgaaatccag gactataact tcaagactcg gatatgcaaa 481 aaagtctgtg gccactacac tcaggttgtt tgggcagata gttacaaagt tggctgcgca 541 gttcaatttt gccctaaagt ttctggcttt gacgctcttt ccaatggagc acattttata 601 tgcaactacg gaccaggagg gaattaccca acttggccat ataagagagg agccacctgc 661 agtgcctgcc ccaataatga caagtgtttg gacaatctct gtgttaacga cagcgagacc 721 aagtcaaacg ttactactat gttgtatatc aggctggccc atatatccac gtaacagata 781 cacttctctc tttctcattg ttaattcagt aattctaata ctgtctgtta taattaccat 841 tttggtacag cacaagtacc ctaatttagt tcttttggac taatacaatt caggaaagaa 901 aaaacccaaa aaccaacctc attcacatat ggcttttttt ttaacccaat aacaattagg 961 tgtaccttcc tatttaaaca tttcataaaa aaatatatgt tatagcaata ctcttactca 1021 aaagaagaaa tttcctaact ctatcagata aactcatctt tagtataaat aagcattatt 1081 tgcaggttgc ctacaggtgg acttttagta agtaacctaa cccatgtttc agcttctaaa 1141 tctcgaaaat gagcaaggta caggtacagt agcacatttt taggtgattc ttagtaactc 1201 cagtagcctt cattagttaa aaacattatt attttttgca tgctgcttcg actctaaata 1261 tcttgttttc cctgtctttt tggtttacta cttccccaga ttcagaacag aggagtaact 1321 aggggatctg attttagagg cctaattttc tgttcatgga ctgttaaaag taaaaccaaa 1381 ctttcaaaag ggataaacct aaatatttac ttgttatcat tagagaggga acatcaaatg 1441 ctggcactat atacatacga tcagcctgat tatgatatca caatggtcgt aatgtataca 1501 aagacttata taccactttc tcgtataaat ttttcaaaaa atacaataat aatataattt 1561 ataaagaaca ctcttctatg aacaaccacc accacca // LOCUS HSU16660 1196 bp mRNA PRI 19-OCT-1995 DEFINITION Human peroxisomal enoyl-CoA hydratase-like protein (HPXEL) mRNA, complete cds. ACCESSION U16660 NID g564064 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1196) AUTHORS FitzPatrick,D.R., Germain-Lee,E. and Valle,D. TITLE Isolation and characterization of rat and human cDNAs encoding a novel putative peroxisomal enoyl-CoA hydratase JOURNAL Genomics 27 (3), 457-466 (1995) MEDLINE 96047331 REFERENCE 2 (bases 1 to 1196) AUTHORS Fitzpatrick,D.R. TITLE Direct Submission JOURNAL Submitted (27-OCT-1994) David R. Fitzpatrick, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, RM 802, PCTB, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1196 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" /tissue_type="retina" /clone_lib="cDNA library made by Dr. Jeremy Nathans, HHMI, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA" gene 28..1014 /gene="HPXEL" CDS 28..1014 /gene="HPXEL" /codon_start=1 /product="peroxisomal enoyl-CoA hydratase-like protein" /db_xref="PID:g564065" /translation="MAAGIVASRRLRDLLTRRLTGSNYPGLSISLRLTGSSAQEEASG VALGEAPDHSYESLRVTSAQKHVLHVQLNRPNKRNAMNKVFWREMVECFNKISRDADC RAVVISGAGKMFTAGIDLMDMASDILQPKGDDVARISWYLRDIITRYQETFNVIERCP KPVIAAVHGGCIGGGVDLVTACDIRYCAQDAFFQVKEVDVGLAADVGTLERLPKVIGN QSLVNELAFTAHKMMADEALDSGLVSRVFPDKEVMLDAALPLAPEISSKTTVLVQSTK VNLLYSRDHSVAESLNYVASWNMSMLQTQDLVKSVQPTTENKELKTVTFSKL" misc_feature 1003..1014 /gene="HPXEL" /note="encodes peroxisomal targeting sequence" polyA_signal 1186..1191 BASE COUNT 269 a 346 c 342 g 239 t ORIGIN 1 ggaactcagt agacgaaggc ggcggcgatg gcggcgggga tagtggcttc tcgcagactc 61 cgcgacctac tgacccggcg actgacaggc tccaactacc cgggactcag tattagcctt 121 cgcctcactg gctcctctgc acaagaggag gcttccggag tagccctcgg tgaagcccca 181 gaccacagct atgagtccct tcgtgtgacg tctgcgcaga aacatgttct gcatgtccag 241 ctcaaccggc ccaacaagag gaatgccatg aacaaggtct tctggagaga gatggtagag 301 tgcttcaaca agatttcgag agacgctgac tgtcgggcgg tggtgatctc tggtgcagga 361 aaaatgttca ctgcaggtat tgacctgatg gacatggctt cggacatcct gcagcccaaa 421 ggagatgatg tggcccggat cagctggtac ctccgtgaca tcatcactcg ataccaggag 481 accttcaacg tcatcgagag gtgccccaag cccgtgattg ctgccgtcca tgggggctgc 541 attggcggag gtgtggacct tgtcaccgcc tgtgacatcc ggtactgtgc ccaggatgct 601 ttcttccagg tgaaggaggt ggacgtgggt ttggctgccg atgtaggaac actggagcgc 661 ctgcccaagg tcatcgggaa ccagagcctg gtcaacgagc tggccttcac cgcccacaag 721 atgatggctg acgaggccct ggacagtggg ctggtcagcc gggtgttccc agacaaagag 781 gtcatgctgg atgctgcctt acccctggcg cccgagattt ccagcaagac caccgtgttg 841 gtgcagagca ccaaggtcaa cctgctgtat tcccgcgacc attcggtggc cgagagcctc 901 aactacgtgg cgtcctggaa catgagcatg ctgcagaccc aagacctcgt gaagtcggtc 961 cagcccacga ctgagaacaa ggaactgaaa accgtcacct tctccaagct ctgagagccc 1021 tcgcgtccca gccccagcca gggggccggc cttgtcccgc ctcatccaca gaaagggagg 1081 atgggcgatg acagttgttt ctatgccttc tgacccagtt tcccagttta taactttatg 1141 acaatgagtt tctcaagccc aaggccttat cttcacccca caaacaataa agcaaa // LOCUS HSU16752 3541 bp mRNA PRI 18-APR-1996 DEFINITION Human cytokine SDF-1-beta mRNA, complete cds. ACCESSION U16752 NID g1272194 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3541) AUTHORS Spotila,L.D. TITLE Novel sequences expressed by mineralizing human osteoblasts in culture JOURNAL Unpublished REFERENCE 2 (bases 1 to 3541) AUTHORS Spotila,L.D. TITLE Direct Submission JOURNAL Submitted (31-OCT-1994) Loretta D. Spotila, Biochemistry and Molecular Biology, Thomas Jefferson University, 233 South Tenth Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..3541 /organism="Homo sapiens" /note="a shorter clone with an identical sequence was purified from a human osteoblast cDNA library" /db_xref="taxon:9606" /clone="sdf5-8.7" /clone_lib="lambda ZAP human fibroblast cDNA library" /cell_type="fibroblast" CDS 81..362 /codon_start=1 /product="cytokine SDF-1-beta" /db_xref="PID:g571508" /translation="MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANV KHLKILNTPNCALQIVARLKNNNRQVCIDPKLKWIQEYLEKALNKRFKM" BASE COUNT 912 a 875 c 799 g 935 t 20 others ORIGIN 1 cgcggccgca gccgcattgc ccgctcggcg tccggccccc gacccgcgct cgtccgcccg 61 cccgcccgcc cgcccgcgcc atgaacgcca aggtcgtggt cgtgctggtc ctcgtgctga 121 ccgcgctctg cctcagcgac gggaagcccg tcagcctgag ctacagatgc ccatgccgat 181 tcttcgaaag ccatgttgcc agagccaacg tcaagcatct caaaattctc aacactccaa 241 actgtgccct tcagattgta gcccggctga agaacaacaa cagacaagtg tgcattgacc 301 cgaagctaaa gtggattcag gagtacctgg agaaagcttt aaacaagagg ttcaagatgt 361 gagagggtca gacgcctgag gaacccttac agtaggagtc cagctctgaa accagtgtta 421 gggaagggcc tgccacagcc tcccctgcca gggcagggcc ccaggcattg ccaagggctt 481 tgttttggac actttgccat attttcacca tttgattatg tagcaaaata catgacattt 541 atttttcatt tagtttgatt attcagtgtc actggcgaca cgtagcagct tagactaagg 601 ccattattgt acttgcctta ttagagtgtc tttccacgga gccactcctc tgactcaggg 661 ctcctgggtt ttggattctc tgagctgtgc aggtggggag actgggctga gggagcctgg 721 ccccatggtc agccctaggg tggagagcca ccaagaggga cgcctggggg tgtcaggacc 781 agtcaacctg ggcaaagcct agtgaaggct tctctctgtg ggatgggatg gtggagggcc 841 acatgggagg ttcaccccct tctccatcca catggtgagc cgggtctgcc tcttctggga 901 gggcagcagg gctaccctga gctgaggcag cagtgtgagg ccagggcaga gtgagaccca 961 gccctcatcc cgagcacctc cacatcctcc acgttctgct catcattctc tgtctcatcc 1021 atcatcatgt gtgtccacga ctgtctccat ggccccgcaa aaggactctc aggaccaaag 1081 ctttcatgta aactgtgcac caagcaggaa atgaaaatgt cttgtgttac ctgaaaacac 1141 tgtgcacatc tgtgtcttgt ttggaatatt gtccattgtc caatcctatg tttttggtca 1201 aagccagcgt cctcctctgt gaccaatgtc ttgatgcatg cactgttccc cctgtgcagc 1261 cgctgagcga ggagatgctc cntgggccct ttgagtgcag tcctgatcag agccgtggtc 1321 ctttggggtg aactaccttg gttcccccac tgatcacaaa aacatggtgg gtccatgggc 1381 agagcccaag ggaattcggt gtgcaccagg gttgacccca gaggattgct gccccatcag 1441 tgctccctca catgtcagta ccttcaaact agggccaagc ccagcactgc ttgaggaaaa 1501 caagcattca caacttgttt tnggttttta aaacccagtc cacaaaataa ccaatcctgg 1561 acatgaagat tctttcccaa ttcacatcta acctcatctt cttcaccatt tggcaatgcc 1621 atcatctcct gccttcctcc tgggccctct ctgctctgcg tgtcacctgt gcttcgggcc 1681 cttcccacag gacatttctc taagagaaca atgtgctatg tgaagagtaa gtcaacctgc 1741 ctgacatttg gagtgttccc cttccactga gggcagtcga tagagctgta ttaagccact 1801 taaaatgttt gtcactttgc caaggcaagc acttgtgggn nttgnttgtt ntcantcagt 1861 cttncgaata ctttttcccc ttgataaaga ctccagttaa aanaaatttt aatgaagaaa 1921 gtggaaacaa ggaagtcaaa gcaaggaaac tatgtaacat gtaggaagta ggaagtaaat 1981 tatagtgatg taatcttgaa ttgtaactgt tcttgaattt aataatctgt agggtaatta 2041 gtaacatgtg ttaagtattt tcataagtat ttcaaattgg agcttcatgg cagaaggcaa 2101 acccatcanc aaaaattgtc ccttaaacaa aaattaaaat cctcaatcca gctatgttat 2161 attgaaaaaa tagagcctga gggatcttta ctagttataa agatacagaa ctctttcnaa 2221 accttttgaa attaacctct cactatacca gtataattga gttttcagtg gggcagtcat 2281 tatccaggta atccaagata ttttaaaatc tgtcacgtag aacttggatg tacctgcccc 2341 caatccatga accaagacca ttgaattctt ggttgaggaa acaaacatga ccctaaatct 2401 tgactacagt caggaaagga atcatttcta tttctcctcc atgggagaaa atagataaga 2461 gtagaaactg cagggnaaaa ttatttgnat aacaattcct ctactaacaa tcagctcctt 2521 cctggagact gcccagctaa agcaatatgc atttaaatac agtcttccat ttgnaaggga 2581 aaagtctctt gtaatccgaa tctctttttg gtttcgaact gctagtcaag tgcgtccacg 2641 agctgtttac tagggatccc tcatctgtcc ctccgggacc tggtgctgcc tctacctgac 2701 actcccttgg gctccctgta acctcttcag aggncctcgc tgccagctct gtntcaggac 2761 ccagaggaag gggncagagg ctcgttgact ggctgtgtgt tgggattgag tctgtgccac 2821 gtgtttgtgc tgtggtgtgt cccctctgtc caggcactga gataccagcg aggaggctcc 2881 agagggcgct ctgcttgtta ttagagatta cctcctgaga aaaaaggttc cgcttggagc 2941 agaggggctg aatagcagaa ggttgcacct cccccaacct tagatgttct aagtctttcc 3001 attggatctc attggaccct tccatggtgt gatcgtctga ctggtgttat caccgtgggc 3061 tccctgactg ggagttgatc gcctttccca ggtgctacac ccttttccag ctggatgaga 3121 atttgagtgc tctgatccct ctacagagct tccctgactc attctgaagg agccccattc 3181 ctgggaaata ttccctagaa acttccaaat cccctaagca gaccactgat aaaaccatgt 3241 agaaaatttg ttattttgna acctcgctgg actctcagtc tctgagcagt gaatgattca 3301 gtgttaaatg tgatgaatac tgtattttgt attgtttcaa ttgcatctcc cagataatgt 3361 gaaaatggtc caggagaagg ncaattccta tacgcagngt gctttaaaaa ataaataaga 3421 aacaactctt tgagaaacaa caatttctac tttgaagtca taccaatgaa aaaatgtata 3481 tgcacttata attttcctaa taaagttctg tactcaaatg taaaaaaaaa aaaaaaaaaa 3541 a // LOCUS HSU16797 1777 bp mRNA PRI 16-MAY-1996 DEFINITION Human LERK-5 (EPLG5) mRNA, complete cds. ACCESSION U16797 NID g902370 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1777) AUTHORS Cerretti,D.P., Vanden Bos,T., Nelson,N., Kozlosky,C.J., Reddy,P., Maraskovsky,E., Park,L.S., Lyman,S.D., Copeland,N.G., Gilbert,D.J., Jenkins,N.A. and Fletcher,R.A. TITLE Isolation of LERK-5: a ligand of the eph-related receptor tyrosine kinases JOURNAL Mol. Immunol. 32 (16), 1197-1205 (1995) MEDLINE 96145238 REFERENCE 2 (bases 1 to 1777) AUTHORS Cerretti,D.P. TITLE Direct Submission JOURNAL Submitted (01-NOV-1994) Douglas P. Cerretti, Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1777 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lambda gt10, Clontech catalog number HL3003a" /chromosome="13" /map="13q33" /tissue_type="brain" /dev_stage="fetus" sig_peptide 8..82 /gene="EPLG5" /note="putative" CDS 8..1009 /gene="EPLG5" /codon_start=1 /function="ligand for the receptor tyrosine kinases hek and elk" /product="LERK-5" /db_xref="PID:g902371" /translation="MAVRRDSVWKYCWGVLMVLCRTAISKSIVLEPIYWNSSNSKFLP GQGLVLYPQIGDKLDIICPKVDSKTVGQYEYYKVYMVDKDQADRCTIKKENTPLLNCA KPDQDIKFTIKFQEFSPNLWGLEFQKNKDYYIISTSNGSLEGLDNQEGGVCQTRAMKI LMKVGQDASSAGSTRNKDPTRRPELEAGTNGRSSTTSPFVKPNPGSSTDGNSAGHSGN NILGSEVALFAGIASGCIIFIVIIITLVVLLLKYRRRHRKHSPQHTTTLSLSTLATPK RSGNNNGSEPSDIIIPLRTADSVFCPHYEKVSGDYGHPVYIVQEMPPQSPANIYYKV" gene 8..1009 /gene="EPLG5" misc_feature 680..757 /gene="EPLG5" /note="encodes the LERK-5 transmembrane domain" BASE COUNT 458 a 469 c 462 g 388 t ORIGIN 1 cacagccatg gctgtgagaa gggactccgt gtggaagtac tgctggggtg ttttgatggt 61 tttatgcaga actgcgattt ccaaatcgat agttttagag cctatctatt ggaattcctc 121 gaactccaaa tttctacctg gacaaggact ggtactatac ccacagatag gagacaaatt 181 ggatattatt tgccccaaag tggactctaa aactgttggc cagtatgaat attataaagt 241 ttatatggtt gataaagacc aagcagacag atgcactatt aagaaggaaa atacccctct 301 cctcaactgt gccaaaccag accaagatat caaattcacc atcaagtttc aagaattcag 361 ccctaacctc tggggtctag aatttcagaa gaacaaagat tattacatta tatctacatc 421 aaatgggtct ttggagggcc tggataacca ggagggaggg gtgtgccaga caagagccat 481 gaagatcctc atgaaagttg gacaagatgc aagttctgct ggatcaacca ggaataaaga 541 tccaacaaga cgtccagaac tagaagctgg tacaaatgga agaagttcga caacaagtcc 601 ctttgtaaaa ccaaatccag gttctagcac agacggcaac agcgccggac attcggggaa 661 caacatcctc ggttccgaag tggccttatt tgcagggatt gcttcaggat gcatcatctt 721 catcgtcatc atcatcacgc tggtggtcct cttgctgaag taccggagga gacacaggaa 781 gcactcgccg cagcacacga ccacgctgtc gctcagcaca ctggccacac ccaagcgcag 841 cggcaacaac aacggctcag agcccagtga cattatcatc ccgctaagga ctgcggacag 901 cgtcttctgc cctcactacg agaaggtcag cggggactac gggcacccgg tgtacatcgt 961 ccaggagatg cccccgcaga gcccggcgaa catttactac aaggtctgag agggaccctg 1021 gtggtacctg tgctttccca gaggacacct aatgtcccga tgcctccctt gagggtttga 1081 gagcccgcgt gctggagaat tgactgaagc acagcaccgg gggagaggga cactcctcct 1141 cggaagagcc cgtcgcgctg gacagcttac ctagtcttgt agcattcggc cttggtgaac 1201 acacacgctc cctggaagct ggaagactgt gcagaagacg cccattcgga ctgctgtgcc 1261 gcgtcccacg tctcctcctc gaagccatgt gctgcggtca ctcaggcctc tgcagaagcc 1321 aagggaagac agtggtttgt ggacgagagg gctgtgagca tcctggcagg tgccccagga 1381 tgccacgcct ggaagggccg gcttctgcct ggggtgcatt tcccccgcag tgcataccgg 1441 acttgtcaca cggacctcgg gctagttaag gtgtgcaaag atctctagag tttagtcctt 1501 actgtctcac tcgttctgtt acccagggct ctgcagcacc tcacctgaga cctccactcc 1561 acatctgcat cactcatgga acactcatgt ctggagtccc ctcctccagc cgctggcaac 1621 aacagcttca gtccatgggt aatccgttca tagaaattgt gtttgctaac aaggtgccct 1681 ttagccagat gctaggctgt ctgcgaagaa ggctaggagt tcatagaagg gagtggggct 1741 ggggaaaggg ctggctgcaa ttgcagctca ctgctgc // LOCUS HSU16799 1476 bp mRNA PRI 16-MAY-1995 DEFINITION Human Na,K-ATPase beta-1 subunit mRNA, complete cds. ACCESSION U16799 NID g806753 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 301) AUTHORS Ruiz,A., Bhat,S.P. and Bok,D. TITLE Characterization and quantification of full-length and truncated Na,K-ATPase alpha 1 and beta 1 RNA transcripts expressed in human retinal pigment epithelium JOURNAL Gene 155 (2), 179-184 (1995) MEDLINE 95237606 REFERENCE 2 (bases 1 to 303) AUTHORS Kawakami,K., Nojima,H., Ohta,T. and Nagano,K. TITLE Molecular cloning and sequence analysis of human Na,K-ATPase beta-subunit JOURNAL Nucleic Acids Res. 14 (7), 2833-2844 (1986) MEDLINE 86176770 REFERENCE 3 (bases 1 to 1476) AUTHORS Bok,D. TITLE Direct Submission JOURNAL Submitted (01-NOV-1994) Dean Bok, Anatomy and Cell Biology, University of California at Los Angeles School of Medicine, 10833 Le Conte Avenue, CHS RM 73-235, Los Angeles, CA 90024, USA FEATURES Location/Qualifiers source 1..1476 /organism="Homo sapiens" /note="in Uni-ZAP XR vector" /db_xref="taxon:9606" /clone="B1-T1" /sex="male" /cell_type="epithelial cell" /tissue_type="retinal pigment epithelium" /dev_stage="fetus" CDS 46..951 /codon_start=1 /product="Na,K-ATPase beta subunit" /db_xref="PID:g806754" /translation="MARGKAKEEGSWKKFIWNSEKKEFLGRTGGSWFKILLFYVIFYG CLAGIFIGTIQVMLLTISEFKPTYQDRVAPPGLTQIPQIQKTEISFRPNDPKSYEAYV LNIVRFLEKYKDSAQRDDMIFEDCGDVPSEPKERGDFNHERGERKVCRFKLEWLGNCS GLNDETYGYKEGKPCIIIKLNRVLGFKPKPPKNESLETYPVMKYNPNVLPVQCTGKRD EDKDKVGNVEYFGLGNSPGFPLQYYPYYGKLLQPKYLQPLLAVQFTNLTMDTEIRIEC KAYGENIGYSEKDRFQGRFDVKIKF" polyA_signal 1459..1464 polyA_site 1476 BASE COUNT 426 a 299 c 333 g 418 t ORIGIN 1 gccacccacc ctccggaccg cggcagctgc tgacccgcca tcgccatggc ccgcgggaaa 61 gccaaggagg agggcagctg gaagaaattc atctggaact cagagaagaa ggagtttctg 121 ggcaggaccg gtggcagttg gtttaagatc cttctattct acgtaatatt ttatggctgc 181 ctggctggca tcttcatcgg aaccatccaa gtgatgctgc tcaccatcag tgaatttaag 241 cccacatatc aggaccgagt ggccccgcca ggattaacac agattcctca gatccagaag 301 actgaaattt cctttcgtcc taatgatccc aagagctatg aggcatatgt actgaacata 361 gttaggttcc tggaaaagta caaagattca gcccagaggg atgacatgat ttttgaagat 421 tgtggcgatg tgcccagtga accgaaagaa cgaggagact ttaatcatga acgaggagag 481 cgaaaggtct gcagattcaa gcttgaatgg ctgggaaatt gctctggatt aaatgatgaa 541 acttatggct acaaagaggg caaaccgtgc attattataa agctcaaccg agttctaggc 601 ttcaaaccta agcctcccaa gaatgagtcc ttggagactt acccagtgat gaagtataac 661 ccaaatgtcc ttcccgttca gtgcactggc aagcgagatg aagataagga taaagttgga 721 aatgtggagt attttggact gggcaactcc cctggttttc ctctgcagta ttatccgtac 781 tatggcaaac tcctgcagcc caaatacctg cagcccctgc tggccgtaca gttcaccaat 841 cttaccatgg acactgaaat tcgcatagag tgtaaggcgt acggtgagaa cattgggtac 901 agtgagaaag accgttttca gggacgtttt gatgtaaaaa ttaaatttta agtgacacta 961 cagaaaaaca caaaaaggtg atgggttgtg ttatgcttgt attgaatgct gtcttgacat 1021 ctcttgcctt gtcctccggt atgttctaaa gctgtgtctg agatctggat ctgcccatca 1081 ctttggctag tgacagggct aattaatttg ctttatacat tttcttttac tttccttttt 1141 tcctttctgg aggcatcaca tgctggtgct gtgtctttat gaatgtttta accattttca 1201 tggtggaaga attttatatt tatgcagttg tacaatttta tttttttctg caagaaaaag 1261 tgtaatgtat gaaataaacc aaagtcactt gtttgaaaat aaatctttat tttgaacttt 1321 ataaaaagca atgcagtacc ccatagactg gtgttaaatg ttgtctacag tgcaaaatcc 1381 atgttctaac atatgtaata attgccagga gtacagtgct cttgttgatc ttgtattcag 1441 tcaggttaaa acaacggtca ataaaagaat gaacac // LOCUS HSU16953 1560 bp mRNA PRI 08-APR-1995 DEFINITION Human potassium channel beta3 subunit mRNA, complete cds. ACCESSION U16953 NID g762887 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1331) AUTHORS Majumder,K., De Biasi,M., Wang,Z. and Wible,B.A. TITLE Molecular cloning and functional expression of a novel potassium channel beta-subunit from human atrium JOURNAL FEBS Lett. 361 (1), 13-16 (1995) MEDLINE 95196856 REFERENCE 2 (bases 1 to 1560) AUTHORS Majumder,K. TITLE Direct Submission JOURNAL Submitted (03-NOV-1994) Kumud Majumder, Dept. of Cell Biology, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /note="isolated from tissue pooled from several patients undergoing cardiac surgeries" /db_xref="taxon:9606" /clone="pKMhKvb3" /tissue_type="atrial" /dev_stage="adult" CDS 68..1294 /note="hKvb3 or beta3" /codon_start=1 /product="potassium channel beta3 subunit" /db_xref="PID:g758699" /translation="MHLYKPACADIPSPKLGLPKSSESALKCRWHLAVTKTQPQAACK PVRPSGAAEQKYVEKFLRVHGISLQETTRAETGMAYRNLGKSGLRVSCLGLGTWVTFG GQISDEVAERLMTIAYESGVNLFDTAEVYAAGKAEVILGSIIKKKGWRRSSLVITTKL YWGGKAETERGLSRKHIIEGLKGSLQRLQLEYVDVVFANRPDSNTPMEEIVRAMTHVI NQGMAMYWGTSRWSAMEIMEAYSVARQFNMIPPVCEQAEYHLFQREKVEVQLPELYHK IGVGAMTWSPLACGIISGKYGNGVPESSRASLKCYQWLKERIVSEEGRKQQNKLKDLS PIAERLGCTLPQLAVAWCLRNEGVSSVLLGSSTPEQLIENLGAIQVLPKMTSHVVNEI DNILRNKPYSKKDYRS" BASE COUNT 451 a 341 c 397 g 371 t ORIGIN 1 gcaagataca gtgagtctta aagttaagca ccgtgcaatt agctttgctt ccttgggttt 61 ttgaaacatg catctgtata aacctgcctg tgcagacatc ccgagcccca agctgggtct 121 gccaaaatcc agtgaatcgg ctctaaaatg tagatggcac ctagcagtga ccaagactca 181 gcctcaggcg gcctgcaaac ctgtgaggcc cagtggagca gccgaacaga aatatgtgga 241 aaagtttcta cgtgttcatg gaatttcgtt gcaggaaacc accagagcag agacgggcat 301 ggcatacagg aatcttggaa aatcaggact cagagtttct tgcttgggtc ttggaacatg 361 ggtgacattt ggaggtcaaa tttcagatga ggttgctgaa cggctgatga ccatcgccta 421 tgaaagtggt gttaacctct ttgatactgc cgaagtctat gctgctggaa aggctgaagt 481 gattctgggg agcatcatca agaagaaagg ctggaggagg tccagtctgg tcataacaac 541 caaactctac tggggtggaa aagctgaaac agaaagaggg ctgtcaagaa agcatattat 601 tgaaggattg aagggctccc tccagaggct gcagctcgag tatgtggatg tggtctttgc 661 aaatcgaccg gacagtaaca ctcccatgga agaaattgtc cgagccatga cacatgtgat 721 aaaccaaggc atggcgatgt actggggcac ctcgagatgg agtgctatgg agatcatgga 781 agcctattct gtagcaagac agttcaatat gatcccaccg gtctgtgaac aagctgagta 841 ccatcttttc cagagagaga aagtggaggt ccagctgcca gagctctacc acaaaatagg 901 tgttggcgca atgacatggt ctccacttgc ctgtggaatc atctcaggaa aatacggaaa 961 cggggtgcct gaaagttcca gggcttcact gaagtgctac cagtggttga aagaaagaat 1021 tgtaagtgaa gaagggagaa aacagcaaaa caagctaaaa gacctttccc caattgcgga 1081 gcgtctggga tgcacactac ctcagctagc tgttgcgtgg tgcctgagaa atgaaggtgt 1141 gagttctgtg ctcctgggat catccactcc tgaacaactc attgaaaacc ttggtgccat 1201 tcaggttctc ccaaagatga catcacatgt ggtaaatgag attgataaca tactgcgcaa 1261 caagccctac agcaagaagg actatagatc ataaggcaat gcatgaacca cagaagctgc 1321 atggttaaaa tagcggcctg tgcccagtac agaaaggtgt tactaaccag tcttttgaat 1381 cacttagcag cttgctgcaa cctctagtgt ccctccctgg attctttgag gtgtctgact 1441 gtcgctacca ctgtgcacat ctgaaaactc acaaccaaga aaatccattc tattttctta 1501 tcttggactg gagtcaccta ttattgcatt gctgtataca cctcatgctt atgcaatggg // LOCUS HSU16954 1628 bp mRNA PRI 20-JUL-1995 DEFINITION Human (AF1q) mRNA, complete cds. ACCESSION U16954 NID g687589 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1628) AUTHORS Tse,W., Zhu,W., Chen,H.S. and Cohen,A. TITLE A novel gene, AF1q, fused to MLL in t(1;11) (q21;q23), is specifically expressed in leukemic and immature hematopoietic cells JOURNAL Blood 85 (3), 650-656 (1995) MEDLINE 95134895 REFERENCE 2 (bases 1 to 1628) AUTHORS Tse,W.W. TITLE Direct Submission JOURNAL Submitted (04-NOV-1994) William W. Tse, Division of Hospital Immunology and Cancer Research, Immunology and Cancer Research, 555 University Ave, Toronto, Ont. M5T 1X8, Canada FEATURES Location/Qualifiers source 1..1628 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" /cell_type="fibroblast" gene 356..628 /gene="AF1q" CDS 356..628 /gene="AF1q" /note="transmembrane protein" /codon_start=1 /db_xref="PID:g687590" /translation="MRDPVSSQYSSFLFWRMPIPELDLSELEGLGLSDTATYKVKDSS VGKMIGQATAADQEKNPEGDGLLEYSTFNFWRAPIASIHSFELDLL" polyA_site 1628 /note="7 A bases" BASE COUNT 443 a 382 c 337 g 465 t 1 others ORIGIN 1 agtcagcacg ggggtgctgg aagagatcgg gaataatagc gcagaccaat gagcctaggg 61 agatgctttc atcgtctctc cttccctcaa gtgttctgga acctatcatt tganttagcc 121 gagtcaggca ggagggggcg gggaatcctt ccgcccttct taggaggggc tgcattgcag 181 ggggagagtg aactgacaga ctcagtcact gaagagggaa aaggagtgag aagacaaagc 241 cgtcaaagcc ccaacagctt tgtatttctc cagcccggcc ggcagacccc ggagctcccg 301 aggcactccc tccatctttg gaacgcgcca gtaattgaat tgataacagg aagctatgag 361 ggaccctgtg agtagccagt acagttcctt tcttttctgg aggatgccca tcccagaact 421 ggatctgtcg gagctggaag gcctgggtct gtcagataca gccacctaca aggtcaaaga 481 cagcagcgtt ggcaaaatga tcgggcaagc aactgcagca gaccaggaga aaaaccctga 541 aggtgatggc ctccttgagt acagcacctt caacttctgg agagctccca ttgccagcat 601 ccactccttc gaactggact tgctctaagg ccaagacttc tctctcccat caccttgccc 661 tcattgtctt ccctctcaag ccccttcctt tccactcctt tcccatttta atcttgttct 721 ctccctactg tgttggtggt gctgatgaat ctgccagagt tgagttctat gtatttattt 781 atctatctgt ctactccatt tctctcaaaa gccctcaagt cacaaagtaa atggttcaag 841 caatggagta ctgggtcaca gggattcctc ctttcccccc caaatattaa ctccagaaac 901 taggcctgac tggggacacc ctgagagtag tatagtagtg caaaatggaa gactgatttt 961 tgactctatt ataatcagct tcagagattc cttaaacctt cctaatttcc tgctccaggg 1021 cagtgaaaca caaatatttc ttcaaggggt gatgaaaacc tcggaagttt taatttgagg 1081 ttatctgcta cgaaacagta tttctaaaag gctaaagtga taagtctctt gctttttttt 1141 gatcctgctc ttatattctt ttttttcctc agagaaatca ggagggtagt tagaggtata 1201 aaacaggagg aaatattatg gaaaatgaaa atagggaaaa taattgaatc attttagaag 1261 tagctaattt cttttctcaa aagagtgtcc cttcttcaca cctactcact ttacaacttt 1321 gctcctaact gtgggttgaa aactctagct aaagaaagtt atcaaatctt aacatgcatt 1381 cctactatta tgatagtttt taaggtttca attcaatctt ctgaacggca taagtcctat 1441 tttagcctta cctcctgcat ttgcaatacg taatactgat cagtgggcac agttcttcag 1501 ctacattgag accctgaaat gaacaattat attctgactc gacatcttgt ccccaatcct 1561 tccaaaaata ttgatggtga tttgtgctac catttactcg tttatttaat aaagacattc 1621 aattccca // LOCUS HSU16997 1819 bp mRNA PRI 05-APR-1995 DEFINITION Human orphan receptor ROR gamma mRNA, complete cds. ACCESSION U16997 NID g758419 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1819) AUTHORS Hirose,T., Smith,R.J. and Jetten,A.M. TITLE ROR gamma: the third member of ROR/RZR orphan receptor subfamily that is highly expressed in skeletal muscle JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1976-1983 (1994) MEDLINE 95110350 REFERENCE 2 (bases 1 to 1819) AUTHORS Jetten,A.M. TITLE Direct Submission JOURNAL Submitted (07-NOV-1994) Anton M. Jetten, Cell Biology Section, National Institute Of Environmental Health Sciences, NIH, 111 T.W. Alexander Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..1819 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RORg" /chromosome="1" /tissue_type="skeletal muscle" 5'UTR 1..69 CDS 70..1752 /codon_start=1 /function="orphan receptor" /product="ROR gamma" /db_xref="PID:g758420" /translation="MDRAPQRQHRASRELLAAKKTHTSQIEVIPCKICGDKSSGIHYG VITCEGCKGFFRRSQRCNAAYSCTRQQNCPIDRTSRNRCQHCRLQKCLALGMSRDAVK FGRMSKKQRDSLHAEVQKQLQQRQQQQQEPVVKTPPAGAQGADTLTYTLGLPDGQLPL GSSPDLPEASACPPGLLKASGSGPSYSNNLAKAGLNGASCHLEYSPERGKAEGRESFY STGSQLTPDRCGLRFEEHRHPGLGELGQGPDSYGSPSFRSTPEAPYASLTEIEHLVQS VCKSYRETCQLRLEDLLRQRSNIFSREEVTGYQRKSMWEMWERCAHHLTEAIQYVVEF AKRLSGFMELCQNDQIVLLKAGAMEVVLVRMCRAYNADNRTVFFEGKYGGMELFRALG CSELISSIFDFSHSLSALHFSEDEIALYTALVLINAHRPGLQEKRKVEQLQYNLELAF HHHLCKTHRQSILAKLPPKGKLRSLCSQHVERLQIFQHLHPIVVQAAFPPLYKELFST ETESPVGCPSDLEEGLLASPYGLLATSLDPVPPSPFSFPMNPGGWSPPALWK" 3'UTR 1753..1819 BASE COUNT 392 a 575 c 516 g 336 t ORIGIN 1 cccctgggcc ctgctccctg ccctcctggg cagccagggc agccaggacg gcaccaaggg 61 agctgcccca tggacagggc cccacagaga cagcaccgag cctcacggga gctgctggct 121 gcaaagaaga cccacacctc acaaattgaa gtgatccctt gcaaaatctg tggggacaag 181 tcgtctggga tccactacgg ggttatcacc tgtgaggggt gcaagggctt cttccgccgg 241 agccagcgct gtaacgcggc ctactcctgc acccgtcagc agaactgccc catcgaccgc 301 accagccgaa accgatgcca gcactgccgc ctgcagaaat gcctggcgct ggggatgtcc 361 cgagatgctg tcaagttcgg ccgcatgtcc aagaagcaga gggacagcct gcatgcagaa 421 gtgcagaaac agctgcagca gcggcaacag cagcaacagg aaccagtggt caagacccct 481 ccagcagggg cccaaggagc agataccctc acctacacct tggggctccc agacgggcag 541 ctgcccctgg gctcctcgcc tgacctgcct gaggcttctg cctgtccccc tggcctcctg 601 aaagcctcag gctctgggcc ctcatattcc aacaacttgg ccaaggcagg gctcaatggg 661 gcctcatgcc accttgaata cagccctgag cggggcaagg ctgagggcag agagagcttc 721 tatagcacag gcagccagct gacccctgac cgatgtggac ttcgttttga ggaacacagg 781 catcctgggc ttggggaact gggacagggc ccagacagct acggcagccc cagtttccgc 841 agcacaccgg aggcacccta tgcctccctg acagagatag agcacctggt gcagagcgtc 901 tgcaagtcct acagggagac atgccagctg cggctggagg acctgctgcg gcagcgctcc 961 aacatcttct cccgggagga agtgactggc taccagagga agtccatgtg ggagatgtgg 1021 gaacggtgtg cccaccacct caccgaggcc attcagtacg tggtggagtt cgccaagagg 1081 ctctcaggct ttatggagct ctgccagaat gaccagattg tgcttctcaa agcaggagca 1141 atggaagtgg tgctggttag gatgtgccgg gcctacaatg ctgacaaccg cacggtcttt 1201 tttgaaggca aatacggtgg catggagctg ttccgagcct tgggctgcag cgagctcatc 1261 agctccatct ttgacttctc ccactcccta agtgccttgc acttttccga ggatgagatt 1321 gccctctaca cagcccttgt tctcatcaat gcccatcggc cagggctcca agagaaaagg 1381 aaagtagaac agctgcagta caatctggag ctggcctttc atcatcatct ctgcaagact 1441 catcgccaaa gcatcctggc aaagctgcca cccaagggga agcttcggag cctgtgtagc 1501 cagcatgtgg aaaggctgca gatcttccag cacctccacc ccatcgtggt ccaagccgct 1561 ttccctccac tctacaagga gctcttcagc actgaaaccg agtcacctgt gggctgtcca 1621 agtgacctgg aagagggact ccttgcctct ccctatggcc tgctggccac ctccctggac 1681 cccgttccac cctcaccctt ttcctttccc atgaaccctg gagggtggtc cccaccagct 1741 ctttggaagt gagcagatgc tgcggctggc tttctgtcag caggccggcc tggcagtggg 1801 acaatcgcca gagggtggg // LOCUS HSU17032 4992 bp mRNA PRI 04-APR-1996 DEFINITION Human p190-B (p190-B) mRNA, complete cds. ACCESSION U17032 NID g687592 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4992) AUTHORS Burbelo,P.D., Miyamoto,S., Utani,A., Brill,S., Yamada,K.M., Hall,A. and Yamada,Y. TITLE p190-B, a new member of the Rho GAP family, and Rho are induced to cluster after integrin cross-linking JOURNAL J. Biol. Chem. 270 (52), 30919-30926 (1995) MEDLINE 96125066 REFERENCE 2 (bases 1 to 4992) AUTHORS Burbelo,P.D. TITLE Direct Submission JOURNAL Submitted (08-NOV-1994) Peter D. Burbelo, MRC, Univ. College London, Gower Street, London, WC1E 6BT, England FEATURES Location/Qualifiers source 1..4992 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" gene 1..4802 /gene="p190-B" CDS 303..4802 /gene="p190-B" /note="member of the Rho GAP family" /codon_start=1 /product="p190-B" /db_xref="PID:g687593" /translation="MMAKNKEPRPPSYTISIVGLSGTEKDKGNCGVGKSCLCNRFVRS KADEYYPEHTSVLSTIDFGGRVVNNDHFLYWGDIIQNSEDGVECKIHVIEQTEFIDDQ TFLPHRSTNLQPYIKRAAASKLQSAEKLMYICTDQLGLEQDFEQKQMPEGKLNVDGFL LCIDVSQGCNRKFDDQLKFVNNLFVQLSKSKKPVIIAATKCDECVGHYLREVQAFASN KKNLLVVETLSAIKVNIETCFTALVQMLDKTRSKPKIIPYLDAYKTQRQLVVTATDKF EKLVQTVRDYHATWKTVSNKLKNHPDYEEYINLEGTRKARNTFSKHIEQLKQEHIRKR REEYINTLPRAFNTLLPNLEEIEHLNWSEALKLMEKRADFQLCFVVLEKTPWDETDHI DKINDRRIPFDLLSTLEAEKVYQNHVQHLISEKRRVEMKEKFKKTLEKIQFISPGQPW EEVMCFVMEDEAYKYITEADSKEVYGRHQREIVEKAKEEFQEMLFEHSELFYDLDLNA TPSSDKMSEIHTVLSEEPRYKALQKLAPDRESLLLKHIGFVYHPTKETCLSGQNCTDI KVEQLLASSLLQLDHGRLRLYHDSTNIDKVNLFILGKDGLAQELANEIRTQSTDDEYA LDGKIYELDLRPVDAKSPYFLSQLWTAAFKPHGCFCVFNSIESLSFIGEFIGKIRTEA SQIRKDKYMANLPFTLILANQRDSISKNLPILRHQGQQLANKLQCPFVDVPAGTYPRK FNETQIKQALRGVLESVKHNLDVVSPIPANKDLSEADLRIVMCAMCGDPFSVDLILSP FLDSHSCSAAQAGQNNSLMLDKIIGEKRRRIQITILSYHSSIGVRKDELVHGYILVYS AKRKASMGMLRAFLSEVQDTIPVQLVAVTDSQADFFENEAIKELMTEGEHIATEITAK FTALYSLSQYHRQTEVFTLFFSDVLEKKNMIENSYLSDNTRESTHQSEDVFLPSPRDC FPYNNYPDSDDDTEAPPPYSPIGDDVQLLPTPSDRSRYRLDLEGNEYPIHSTPNCHDH ERNHKVPPPIKPKPVVPKTNVKALVPNLLRAIEAGIGKNPRKQTSRVPFGPEDMDPSD NYAEPIDTIFKQKGYSDEIYVVPDDSQNRIKIRNSFVNNTQGDEENGFSDRPQKVMGN GGLQNTNINLKPCLVKPSHTIEEHIQMPVMMRLSPLLKPKRKGRHRGSEEDPLLSPVE TWKGGIDNPAITSDQELDDKKMKKKTHKVKEDKKKKTKNFNPPTRRNWESNYFGMPLQ DLVTAEKPIPLFVEKCVEFIEDTGLCTERLYRVSGNKTDQENIQKQFVQDHNINLVSM EVTVNAVAGALKAFFADLPDPLIPYSLHPELLEAAKIPDKTERLHALKEIVKKFHPVN YDVFRYVITHLNRVSQQHKINLMTADNLSICFGQPLMRPDLKSMEFLSTTKIHQSVVE TFIQQCQFFFYNGEIVETTNIVAPPPPSNPGQLVEPMVPLQLPPPLQPQLIQPQLQTD PLGII" BASE COUNT 1705 a 901 c 1011 g 1375 t ORIGIN 1 ccgcggtgag ccgcgaggaa gagaggcgag cgagagtgga ggaggaggcg gcggctgcgg 61 gacggtcccc aggaatgtcg ctgccccccc cccccctgcc gttgaggagg agacggagga 121 gaccgacgtt gttagggaag atgatcccta tgatctgccg ctgtttctgc acagaaatga 181 gggaaataca aagaaccaaa tacagttcta aatttgggat ctgtattttg agatgatttt 241 attttcagaa tgagaagcat atctggttac ctttatgaat gtagagacat gagaagagag 301 ttatgatggc aaaaaacaaa gagcctcgtc ccccatccta taccatcagt atagttggac 361 tctctgggac tgaaaaagac aaaggtaact gtggagttgg aaagtcttgt ttgtgcaata 421 gatttgtacg ctcaaaagca gatgaatatt atccagagca tacttctgtg cttagcacca 481 ttgactttgg aggacgagta gtaaacaatg atcacttttt gtactggggt gacataatac 541 aaaatagtga agatggagta gaatgcaaaa ttcatgtcat tgaacaaaca gagttcattg 601 atgaccagac tttcttgcct catcggagta cgaatttgca accatatata aaacgtgcag 661 ctgcatctaa attgcagtca gcagaaaaac taatgtacat ttgcactgat cagctaggct 721 tagaacaaga ctttgaacag aagcaaatgc ctgaagggaa gctcaacgta gatggatttt 781 tattatgcat tgatgtaagt caaggatgca ataggaagtt tgatgatcaa cttaaatttg 841 tgaataacct ttttgtccag ttatcaaaat caaaaaaacc tgtaataata gcagcaacta 901 aatgtgatga atgcgtgggt cattatctta gagaagttca ggcatttgct tcaaataaaa 961 agaaccttct tgtagtggaa acactcagcg caataaaagt caacattgaa acatgtttta 1021 ctgcactggt acaaatgttg gataaaactc gtagcaagcc taaaattatt ccctatttgg 1081 atgcttataa aacacagaga caacttgttg tcacagcaac agataagttt gaaaaacttg 1141 tgcagactgt gagagattat catgcaactt ggaaaactgt tagtaataaa ttaaaaaatc 1201 atcctgatta tgaagaatac atcaacttag agggaacaag aaaggccaga aatacattct 1261 caaaacatat agaacaactt aaacaggaac atataagaaa aaggagagaa gagtatataa 1321 atactttacc aagagctttt aacactcttt tgccaaatct agaagagatt gaacatttga 1381 attggtcaga agctttgaag ttaatggaaa agagagcaga tttccagtta tgttttgtgg 1441 tgctagaaaa aactccttgg gatgaaactg accatataga caaaattaat gataggcgga 1501 ttccatttga cctcctgagc actttagaag ctgaaaaagt ctatcagaac catgtacagc 1561 atctgatatc cgagaagagg agggtggaaa tgaaggaaaa attcaaaaag actttggaaa 1621 aaattcaatt catttcacca gggcagccat gggaggaagt tatgtgcttt gttatggagg 1681 atgaagccta caaatatatc actgaggctg atagcaaaga ggtatatggt aggcatcagc 1741 gagaaatagt tgaaaaagcc aaagaagagt ttcaagaaat gctttttgag cattctgaac 1801 ttttttatga tttagatctt aatgcaacac ctagttcaga taaaatgagt gaaattcata 1861 cagttctgag tgaagaacct agatataaag ctttacagaa acttgcacct gatagggaat 1921 cccttctact taagcatata ggatttgttt atcatcccac taaagaaaca tgtcttagtg 1981 gccaaaattg tacagacatt aaagtggagc agttacttgc tagtagtctt ttacagttgg 2041 atcatggccg cttaagatta tatcacgata gtaccaatat agataaagtt aaccttttta 2101 ttttagggaa ggatggcctt gcccaagaac tagcaaatga gataaggaca caatccactg 2161 atgatgagta tgccttagat ggaaaaattt atgaacttga tcttcggccg gttgatgcca 2221 aatcgcctta ctttttgagt cagttatgga ctgccgcctt taaaccacat gggtgcttct 2281 gtgtatttaa ttccattgag tcattgagtt ttattgggga atttattggg aaaataagaa 2341 ctgaagcttc tcagatcaga aaagataaat acatggctaa tcttccattt acattaattc 2401 tggctaatca gagagattcc attagtaaga atctaccaat tctcaggcac caagggcagc 2461 agttggcaaa caagttgcaa tgtccttttg tagatgtacc tgctggtaca tatcctcgta 2521 aatttaatga aacccaaata aagcaagctc tcagaggagt attggaatca gttaaacaca 2581 atttggatgt ggtgagccca attcctgcca ataaggactt atcagaagct gacttgagaa 2641 ttgtcatgtg cgccatgtgt ggagatccat ttagtgtgga tcttattctt tcacccttcc 2701 ttgattctca ttcttgcagt gctgctcaag ctggacagaa taattcccta atgcttgata 2761 aaatcattgg tgaaaaaagg aggcgaatac agatcacaat attatcatac cactcttcaa 2821 ttggagtaag aaaagatgaa ctagttcatg ggtatatatt agtttactct gcaaaacgga 2881 aagcttcgat gggaatgctt cgagcatttc tatcagaagt tcaagacacc attcctgtac 2941 agctggtggc agttactgac agccaagcag atttttttga aaatgaggct atcaaagagt 3001 taatgactga aggagaacac attgcaactg agatcactgc taaatttaca gcactgtatt 3061 ctttatctca gtatcatcgg caaactgagg tctttactct gttttttagt gatgttctag 3121 agaaaaaaaa tatgatagaa aattcttatt tgtctgataa tacaagggaa tcaacccatc 3181 aaagtgaaga tgtttttcta ccatctccca gagactgttt tccctataat aactaccctg 3241 attcagatga tgacacagaa gcaccacctc cttatagtcc aattggggat gatgtacagt 3301 tgcttccaac acctagtgac cgttccagat atagattaga tttggaagga aatgagtatc 3361 ctattcatag taccccaaac tgtcatgacc atgaacgcaa ccataaagtg cctccaccta 3421 ttaaacctaa accagttgta cctaagacaa atgtgaaagc gctcgttcca aaccttttaa 3481 gggcaattga agctggtatt ggtaaaaatc caagaaagca gacttcccgg gtgcctttcg 3541 gtcctgaaga tatggatcct tcagataact atgcggaacc cattgataca attttcaaac 3601 agaagggcta ttctgatgag atttatgttg tcccagatga tagtcaaaat cgtattaaaa 3661 ttcgaaactc atttgtaaat aacacccaag gagatgaaga aaatgggttt tctgatagac 3721 ctcaaaaagt catggggaac ggaggccttc aaaatacaaa tataaatcta aaaccttgtt 3781 tagtaaagcc aagtcatact atagaagaac acattcagat gccagtgatg atgaggcttt 3841 caccacttct aaaaccaaaa agaaaaggaa gacatcgtgg aagtgaagaa gatccacttc 3901 tttctcctgt tgaaacttgg aaaggtggta ttgataatcc tgcaatcact tctgaccagg 3961 agttagatga taagaagatg aagaagaaaa cccacaaagt gaaagaagat aaaaaaaaga 4021 aaactaagaa cttcaatcca ccaacacgta gaaattggga aagtaattac tttgggatgc 4081 ccctccagga tctggttaca gctgagaagc ccataccact atttgttgag aaatgtgtgg 4141 aatttattga agatacaggg ttatgtaccg agagactcta ccgtgtcagc gggaataaaa 4201 ctgaccaaga aaatattcaa aagcagtttg ttcaagatca taatatcaat ctagtgtcaa 4261 tggaagtaac agtaaatgct gtagctggag cccttaaagc tttctttgca gatctgccag 4321 atcctttaat tccatattct cttcatccag aactattgga agcagcaaaa atcccggata 4381 aaacagaacg tcttcatgcc ttgaaagaaa ttgttaagaa atttcatcct gtaaactatg 4441 atgtattcag atacgtgata acacatctaa acagggttag tcagcaacat aaaatcaacc 4501 taatgacagc agacaactta tccatctgtt ttggccaacc cttgatgaga cctgatttga 4561 aatcgatgga gtttctgtct actactaaga ttcatcaatc tgttgttgaa acattcattc 4621 agcagtgtca gtttttcttt tacaatggag aaattgtaga aacgacaaac attgtggctc 4681 ctccaccacc ttcaaaccca ggacagttgg tggaaccaat ggtgccactt cagttgccgc 4741 caccattgca acctcagctg atacaaccac aattacaaac ggatcctctt ggtattatat 4801 gagtaggaag tgattgcaaa caggctggat ttggacaaaa agcaaatcta gacatgcatg 4861 tttcagggtt cagtagtata cttcatgttt catacagata attcacattc aaaattacat 4921 tttctctttg aactagatgg tattccttat tcacttacat tacaaatcta agaccatgtg 4981 ataagcatga ct // LOCUS HSU17033 5633 bp mRNA PRI 21-JUL-1995 DEFINITION Human 180 kDa transmembrane PLA2 receptor mRNA, complete cds. ACCESSION U17033 NID g862374 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5633) AUTHORS Ancian,P., Lambeau,G., Mattei,M.G. and Lazdunski,M. TITLE The human 180-kDa receptor for secretory phospholipases A2. Molecular cloning, identification of a secreted soluble form, expression, and chromosomal localization JOURNAL J. Biol. Chem. 270 (15), 8963-8970 (1995) MEDLINE 95238395 REFERENCE 2 (bases 1 to 5633) AUTHORS Ancian,P. TITLE Direct Submission JOURNAL Submitted (08-NOV-1994) Philippe Ancian, Institut de Pharmacologie Moleculaire et Cellulaire, 660, Route des Lucioles Sophia-Antipolis, Valbonne, 06560, France FEATURES Location/Qualifiers source 1..5633 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q23-24" /tissue_type="kidney" /dev_stage="adult" 5'UTR 1..206 CDS 207..4604 /codon_start=1 /product="180 kDa transmembrane PLA2 receptor precursor" /db_xref="PID:g862375" /translation="MLLSPSLLLLLLLGGAAGCAEGVAAALTPERLLEWQDKGIFVIQ SESLKKCIQAGKSVLTLGRTGSKQANKHMLWKWVSNHGLFNIGGSGCLGLNFSAPEQP LSLYECDSTLVSLRWRCNRKMITGPLQYSVQVAHDNTVVASRKYIHKWISYGSGGGDI CEYLHKDLHTIKGNTHGMPCMFPFQYNHQWHHECTREGREDDLLWCATTSRYERDEKW GFCPDPTSAEVGCDTIWEKDLNSHICYQFNLLSSLSWSEAHSSCQMQGGTLLSITDET EENFIREHMSSKTVEVWVGLNQLDEDAGWQWSDGTPLNYLNWSPEVNFEPFVEDHCGT FSSFMPSAWRSRDCESTLPYICKKYLNHIDHEIVEKDAWKYYATHCEPGWNPYNRNCY KLQKEEKTWHEALRSCQADNSALIDITSLAEVEFLVTLLGDENASETWIGLSSNKIPV SFEWSNDSSVIFTNWHTLEPHIFPNRSQLCVSAEQSEGHWKVKNCEERLFYICKKAGH VLSDAESGCQEGWEETCGFCYKIDTVLRSFDQASSGYYCPPALVTITNRFEQAFITSL ISSVVKMKDSYFWIALQDQNDTGEYTWKPVGQKPEPVQYTHWNTHQPRYSGGCVAMRG RHPLGRWEVKHCRHFKAMSLCKQPVENQEKAEYEERWPFHPCYLDWESEPGLASCFKV FHSEKVLMKRTWREAEAFCEEFGAHLASFAHIEEENFVNELLHPKFNWTEERQFWIGF NKRNPLNAGSWEWSDRTPVVSSFLDNTYFGEDARNCAVYKPNKTLLPLHCGSKREWIC KIPRDVKPKIPFWYQYDVPWLFYQDAEYLFHTFASEWLNFEFVCSWLHSDLLTIHSAH EQEFIHSKIKALSKYGASWWIGLQEERANDEFRWRDGTPVIYQNWDTGRERTVNNQSQ RCGFISSITGLWGSEECSVSMPSICKRKKVWLIEKKKDTPKQHGTCPKGWLYFNYKCL LLNIPKDPSSWKNWTHAQHFCAEEGGTLVAIESEVEQAFITMNLFGQTTSVWIGLQND DYETWLNGKPVVYSNWSPFDIINIPSHNTTEVQKHIPLCALLSSNPNFHFTGKWYFED CGKEGYGFVCEKMQDTSGHGVNTSDMYPMPNTLEYGNRTYKIINANMTWYAAIKTCLM HKAQLVSITDQYHQSFLTVVLNRLGYAHWIGLFTTDNGLNFDWSDGTKSSFTFWKDEE SSLLGDCVFADSNGRWHSTACDSFLQGAICHVPPETRQSEHPELCSETSIPWIKFKSN CYKFSTVLDSMSFEAAHEFCKKEGSNLLTIKDEAENAFLLEELFAFGSSVQMVWLNAQ FDGNNETIKWFDGTPTDQSNWGIRKPDTDYFKPHHCVALRIPEGLWQLSPCQEKKGFI CKMEADIHTAEALPEKGPSHSIIPLAVVLTLIVIVAICTLSFCIYKHNGGFFRRLAGF RNPYYPATNFSTVYLEENILISDLEKSDQ" mat_peptide 282..4601 /function="receptor for secretory phospholipases A2" /product="180 kDa transmembrane PLA2 receptor" misc_feature 4179 /note="An alternatively processed transcript encoding a soluble form of the human PLA2 receptor is generated by alternative polyadenylation within an intron located at nucleotide 4179, see GenBank Accession Number U17034" 3'UTR 4605..5633 polyA_signal 5616..5621 polyA_site 5633 /note="19 A nucleotides" BASE COUNT 1680 a 1103 c 1303 g 1547 t ORIGIN 1 cccgagtgtc ggttcactgt ggagacagcg gtggcggagt gggtctccag ggctctgggc 61 tggcaaggcc cccggagggg tggggcgcgg aggaggctac agatccgctt ccgcgcggcg 121 gggccgggtg cttgggacgc ggctctgggc tcccgggata aggggctccc gggacaaggg 181 gctcccggag agcccagtgg ttagcgatgc tgctgtcgcc gtcgctgctg ctgctgctgc 241 tgctgggggg cgccgcgggc tgcgccgagg gtgtggcggc ggcgcttacc cccgagcggc 301 tcctggagtg gcaggataaa ggaatatttg ttatccaaag tgagagtctc aagaaatgca 361 ttcaagcagg taaatcggtt ctgaccctcg gtagaactgg aagcaagcaa gcaaacaagc 421 acatgctgtg gaaatgggtt tcaaaccatg gcctctttaa cataggaggc agcggttgcc 481 tgggcctgaa tttctccgcc ccagagcagc cattaagctt atatgaatgt gactccaccc 541 tcgtttcctt acggtggcgc tgtaacagga agatgatcac aggcccgctg cagtactctg 601 tccaggtggc gcatgacaac acagtggtgg cctcacggaa gtatattcat aagtggattt 661 cttatgggtc aggtggtgga gacatttgtg aatatctaca caaagatttg catacaatca 721 aagggaacac ccacgggatg ccgtgtatgt ttcccttcca gtataaccat cagtggcatc 781 atgaatgtac ccgtgaaggt cgggaagatg acttactgtg gtgtgccacg acaagccgtt 841 atgaaagaga tgaaaagtgg ggattttgcc ctgatcccac ctctgcagaa gtaggttgtg 901 atactatttg ggagaaggac ctcaattcac acatttgcta ccagttcaac ctgctttcat 961 ctctctcttg gagtgaggca cattcttcat gccagatgca aggaggtacg ctgttaagta 1021 ttacagatga aactgaagaa aatttcataa gggagcacat gagcagtaaa acagtggagg 1081 tgtgggttgg cctcaatcag cttgatgaag acgctggctg gcagtggtct gatggaacgc 1141 cgctcaacta tctgaattgg agcccagagg taaattttga gccatttgtt gaagatcact 1201 gtggaacatt tagttcattt atgccaagtg cctggaggag tcgggattgt gagtccacct 1261 tgccatatat atgtaaaaaa tatctaaacc acattgatca tgaaatagtt gaaaaagatg 1321 cgtggaaata ttatgctacc cactgtgagc ctggctggaa tccctacaat cgtaattgct 1381 acaaacttca gaaagaagaa aagacctggc atgaggctct gcgttcttgt caggctgata 1441 acagtgcatt aatagacata acctcattag cagaggtgga gtttcttgta accctccttg 1501 gagatgaaaa tgcatcagaa acatggattg gtttgagcag caataaaatt ccagtttcct 1561 ttgaatggtc taatgactct tcagtcatct ttactaattg gcacacactt gagccccaca 1621 tttttccaaa tagaagccag ctgtgtgtct cagcagagca gtctgaggga cactggaaag 1681 tcaaaaattg tgaagaaaga cttttttaca tttgtaaaaa agcaggccat gtcctctctg 1741 atgctgaatc aggatgtcaa gagggatggg aggagacatg tggattctgt tacaaaattg 1801 acacagtcct tcgaagcttt gaccaagctt ccagcggtta ttactgtcct cctgcacttg 1861 taaccattac aaacaggttt gaacaggctt ttattaccag tttgatcagt agtgtggtaa 1921 aaatgaagga cagttatttt tggatagctc ttcaggacca aaatgatacg ggagaataca 1981 cttggaagcc agtagggcag aaacccgagc cggtgcagta cacacactgg aacacacacc 2041 aaccgcgcta cagtggtggc tgtgttgcca tgcgaggaag gcatccactt ggtcgctggg 2101 aagtgaagca ctgtcggcac tttaaggcaa tgtccttgtg caagcagcca gttgaaaatc 2161 aggaaaaagc agagtatgaa gagagatggc cctttcaccc ctgctatttg gactgggagt 2221 cagagcctgg tctggccagt tgcttcaagg tatttcatag tgaaaaagtt ctgatgaaaa 2281 gaacatggag agaagctgaa gcattttgcg aagaatttgg agctcatctt gcaagctttg 2341 cccatattga ggaagagaat tttgtgaatg agctcttaca cccaaaattt aattggacag 2401 aagaaaggca gttctggatt ggatttaata aaagaaaccc actgaatgcc ggctcatggg 2461 agtggtctga tagaactcct gttgtctctt cgtttttaga caacacttat tttggagaag 2521 atgcaagaaa ctgtgctgtt tataagccaa acaaaacatt gctgccctta cactgtggtt 2581 ccaaacgtga atggatatgc aaaatcccaa gagatgtgaa acccaagatt ccgttctggt 2641 accagtacga tgtaccctgg ctcttttatc aggatgcaga ataccttttt catacctttg 2701 cctcagaatg gttgaacttt gagtttgtct gtagctggct gcacagtgat cttctcacaa 2761 ttcattctgc acatgagcaa gaattcatcc acagcaaaat aaaagcgcta tccaagtatg 2821 gtgcaagttg gtggattgga cttcaagaag aaagagccaa tgatgaattt cgctggagag 2881 atggaacacc agtgatatac cagaactggg acacaggaag agaaagaact gtgaataatc 2941 agagccagag atgtggcttt atttcttcta taacaggact ctggggtagt gaagagtgtt 3001 cagtttctat gcctagtatc tgtaagcgaa aaaaggtttg gctcatagag aaaaagaaag 3061 atacaccaaa acaacatgga acgtgtccca aaggatggct atattttaac tataagtgcc 3121 ttctgctgaa tatccccaaa gacccaagca gttggaagaa ctggacgcat gcccaacatt 3181 tctgtgctga agaagggggg accctggtcg ccattgaaag tgaggtggag caagctttca 3241 ttactatgaa tctttttggc cagaccacca gtgtgtggat aggtttacaa aatgatgatt 3301 atgaaacatg gctaaatgga aaacctgtgg tatattctaa ctggtctcca tttgatataa 3361 taaatattcc aagtcacaat accactgaag ttcagaaaca cattcctctc tgtgccttac 3421 tctcaagtaa tcctaatttt catttcactg gaaaatggta ttttgaagac tgtggaaagg 3481 aaggctatgg gtttgtttgt gaaaaaatgc aagatacttc tggacacggt gtaaatacat 3541 ctgatatgta tccaatgccc aataccttag aatatggaaa cagaacttac aaaataatta 3601 atgcaaatat gacttggtat gcagcaataa aaacctgcct gatgcacaaa gcacaactgg 3661 tcagcatcac agaccagtat caccagtcct tcctcactgt tgtcctcaac cggctaggat 3721 atgcccactg gattggactg ttcaccacag ataatggtct taattttgac tggtctgatg 3781 gcaccaaatc ttctttcact ttttggaaag atgaggagtc ctccctcctt ggtgactgcg 3841 tttttgccga cagcaacgga cgctggcata gcacagcctg cgactcattt ctgcaaggtg 3901 ccatttgtca tgtaccacct gaaacaagac aatctgaaca cccagagttg tgctcagaaa 3961 catctattcc ctggataaaa tttaaaagta attgctacaa gttttctaca gtcctagaca 4021 gtatgagttt tgaggctgct catgaatttt gcaaaaagga aggttctaat cttttaacaa 4081 tcaaggatga ggctgaaaat gcatttctcc tagaagagct gtttgctttt ggttcttctg 4141 tccagatggt ttggttgaat gctcaatttg atggtaacaa tgaaaccata aagtggtttg 4201 atggaactcc cacagaccag tcaaactggg gcattcggaa gccagacaca gactacttca 4261 agccccatca ttgtgttgcc ttgaggatcc ctgaaggatt atggcagcta tccccgtgtc 4321 aagaaaaaaa aggctttata tgtaaaatgg aggcagatat tcacactgca gaggcgctgc 4381 cagaaaaagg accaagtcac agcatcattc ctcttgcggt tgtactgaca ctgatagtca 4441 ttgtggccat ttgcacactt tccttctgca tatacaagca taacggtggc ttcttcagga 4501 gacttgcagg gtttcggaat ccttactatc ctgcaaccaa ctttagtaca gtatatttag 4561 aagaaaatat tctcatttct gatcttgaga agagtgacca ataataatga ggtcagagaa 4621 tgccacagac accagggtaa gtaaagaaga ctaaacagga gtctcatctg tctttccctt 4681 tacagcacag atgccattag aatgtgaatt gggtcactat tttaattatt cttgaagtga 4741 ttactggttt tgaatcttaa ccaaatcaga tgggttttga tttattcatt tccctaaact 4801 gtgatccatt cttaaaaggg gtaaattatg cattggttat ttttcagaaa gacaagaact 4861 attaaaagaa actccctatt gaaaactctg aaatcaatgc gaataatagt ttttgcatta 4921 atgtatctct actaaaattt gggggaattt taaaactaat ctggtatcta ttcagacatt 4981 tacctgcact cgtaccatta agaaagacag aaagaagcca aaaaaaatta atcttgtata 5041 tgaggggaaa aggaaagggc ttctgagagg attcttagtt gtttcttttg aattcctttt 5101 aatagcagga tttggaaaat actaatttct gtgcttaagg gtcacaggtt ctgggctctc 5161 aactgatatt taaggtgaca ttcattttta ttaggtctaa catctcaagc taaaggagaa 5221 agaaaaatac ctccttttaa atggcaaaga ccttcattag cagcacactt ttataaacac 5281 ccatatagtt aaaatgtggc cttaaacttt caattactaa atgatgatta agttggatat 5341 tttaaaatgt cttatacata agcttaaagt aatatattga aactttaaca gttgtgctgt 5401 aaaaactcat ggactttctg ggattctaaa tatattatat aatatgttac actcttaata 5461 actggtagat ataaaatgta acttggattt aaaggagtag agctaaagat ctgtattata 5521 gtctcattag taccagacag atgttgttga gaagtactaa ataaattagt aatctagata 5581 tccttattat gtaaatgagt ttaggtgttc tatttaataa actattttct gga // LOCUS HSU17074 600 bp mRNA PRI 27-JAN-1995 DEFINITION Human CDK6 inhibitor p18 mRNA, complete cds. ACCESSION U17074 NID g639713 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 600) AUTHORS Guan,K.L., Jenkins,C.W., Li,Y., Nichols,M.A., Wu,X., O'Keefe,C.L., Matera,A.G. and Xiong,Y. TITLE Growth suppression by p18, a p16INK4/MTS1- and p14INK4B/MTS2-related CDK6 inhibitor, correlates with wild-type pRb function JOURNAL Genes Dev. 8 (24), 2939-2952 (1994) MEDLINE 95095079 REFERENCE 2 (bases 1 to 600) AUTHORS Guan,K. TITLE Direct Submission JOURNAL Submitted (09-NOV-1994) Kun-Liang Guan, Biological Chemistry, University of Michigan, 1301 East Catherine, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..600 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 94..600 /codon_start=1 /function="CDK6 inhibitor" /product="p18" /db_xref="PID:g639714" /translation="MAEPWGNELASAAARGDLEQLTSLLQNNVNVNAQNGFGRTALQV MKLGNPEIARRLLLRGANPDLKDRTGFAVIHDAARAGFLDTLQTLLEFQADVNIEDNE GNLPLHLAAKEGHLRVVEFLVKHTASNVGHRNHKGDTACDLARLYGRNEVVSLMQANG AGGATNLQ" BASE COUNT 163 a 129 c 177 g 131 t ORIGIN 1 ccgatgccat catgcagcct ggttaggagc aaaggaaagg ggaaaaagaa aaacgactaa 61 ttcatctttt cctgatcgtc aggaccctaa agaatggccg agccttgggg gaacgagttg 121 gcgtccgcag ctgccagggg ggacctagag caacttacta gtttgttgca aaataatgta 181 aacgtcaatg cacaaaatgg atttggaagg actgcgctgc aggttatgaa acttggaaat 241 cccgagattg ccaggagact gctacttaga ggtgctaatc ccgatttgaa agaccgaact 301 ggtttcgctg tcattcatga tgcggccaga gcaggtttcc tggacacttt acagactttg 361 ctggagtttc aagctgatgt taacatcgag gataatgaag ggaacctgcc cttgcacttg 421 gctgccaaag aaggccacct ccgggtggtg gagttcctgg tgaagcacac ggccagcaat 481 gtggggcatc ggaaccataa gggggacacc gcctgtgatt tggccaggct ctatgggagg 541 aatgaggttg ttagcctgat gcaggcaaac ggggctgggg gagccacaaa tcttcaataa // LOCUS HSU17075 738 bp mRNA PRI 27-JAN-1995 DEFINITION Human p14-CDK inhibitor mRNA, complete cds. ACCESSION U17075 NID g639715 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Guan,K.L., Jenkins,C.W., Li,Y., Nichols,M.A., Wu,X., O'Keefe,C.L., Matera,A.G. and Xiong,Y. TITLE Growth suppression by p18, a p16INK4/MTS1- and p14INK4B/MTS2-related CDK6 inhibitor, correlates with wild-type pRb function JOURNAL Genes Dev. 8 (24), 2939-2952 (1994) MEDLINE 95095079 REFERENCE 2 (bases 1 to 738) AUTHORS Guan,K. TITLE Direct Submission JOURNAL Submitted (09-NOV-1994) Kun-Liang Guan, Biological Chemistry, University of Michigan, 1301 East Catherine, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 322..738 /codon_start=1 /product="p14-CDK inhibitor" /db_xref="PID:g639716" /translation="MREENKGMPSGGGSDEGLASAAARGLVEKVRQLLEAGADPNGVN RFGRRAIQVMMMGSARVAELLLLHGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRA GARLDVRDAWGRLPVDLAEERGHRDVAGYLRTATGD" BASE COUNT 127 a 223 c 286 g 102 t ORIGIN 1 cccgcgacgc gtccgcaccc tgcggccaga gcggctttga gctcggctgc gtccgcgcta 61 ggcgcttttt cccagaagca atccaggcgc gcccgctggt tcttgagcgc caggaaaagc 121 ccggagctaa cgaccggccg ctcggccact gcacggggcc ccaagccgca gaaggacgac 181 ggggagggta atgaagctga gcccaggtct cctaggaagg agagagtgcg ccggagcagc 241 gtgggaaaga agggaagagt gtcgttaagt ttacggccaa cggtggatta tccgggccgc 301 tgcgcgtctg ggggctgcgg aatgcgcgag gagaacaagg gcatgcccag tgggggcggc 361 agcgatgagg gtctggccag cgccgcggcg cggggactag tggagaaggt gcgacagctc 421 ctggaagccg gcgcggatcc caacggagtc aaccgtttcg ggaggcgcgc gatccaggtc 481 atgatgatgg gcagcgcccg cgtggcggag ctgctgctgc tccacggcgc ggagcccaac 541 tgcgcagacc ctgccactct cacccgaccg gtgcatgatg ctgcccggga gggcttcctg 601 gacacgctgg tggtgctgca ccgggccggg gcgcggctgg acgtgcgcga tgcctggggt 661 cgtctgcccg tggacttggc cgaggagcgg ggccaccgcg acgttgcagg gtacctgcgc 721 acagccacgg gggactga // LOCUS HSU17195 2168 bp mRNA PRI 16-MAR-1996 DEFINITION Human A-kinase anchor protein (AKAP100) mRNA, complete cds. ACCESSION U17195 NID g687595 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2168) AUTHORS McCartney,S., Little,B.M., Langeberg,L.K. and Scott,J.D. TITLE Cloning and characterization of A-kinase anchor protein 100 (AKAP100). A protein that targets A-kinase to the sarcoplasmic reticulum JOURNAL J. Biol. Chem. 270 (16), 9327-9333 (1995) MEDLINE 95238446 REFERENCE 2 (bases 1 to 2168) AUTHORS McCartney,S. TITLE Direct Submission JOURNAL Submitted (16-NOV-1994) Shirley McCartney, Vollum Institute, 3181 S.W. Sam Jackson Park Road, Portland, OR 97201-3098, USA FEATURES Location/Qualifiers source 1..2168 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1965 /gene="AKAP100" CDS 1..1965 /gene="AKAP100" /codon_start=1 /function="targets protein kinase A (PKA) to sarcoplasmic reticulum" /product="A-kinase anchor protein" /db_xref="PID:g687596" /translation="MSFTGQMSLDIASSINEDSAASLTELSSSDELSLCSEDIVLHKN KIPESNASFRKRLTRSVADESDVNVSMIVNVSCTSACTDDEDDSDLLSSSTLTLTEEE LCIKDEDDDSSIATDDEIYEDCTLMSGLDYIKNELQTWIRPKLSLTRDKKRCNVSDEM KGSKDISSSEMTNPSDTLNIETLLNGSVKRVSENNGNGKNSSHTHELGTKRENKKTIF KVNKDPYVADMENGNIEGIPERQKGKPNVTSKVSENLGSHGKEISESEHCKCKALMDS LDDSNTAGKEFVSQDVRHLPKKCPNHHHFENQSTASTPTEKSFSELALETRFNNRQDS DALKSSDDAPSMAGKSAGCCLALEQNGTEENASISDISCCNCEPDVFHQKDAEDCSVH NFVKEIIDMASTALKSKSQPENEVAAPTSLTQIKEKVLEHSHRPIQLRKGDFYSYLSL SSHDSDCGEVTNYIEEKSSTPLPLDTTDSGLDDKEDIECFFEACVEGDSDGEEPCFSS APPNESAVPSEAAMPLQATACSSEFSDSSLSADDADTVALSSPSSQERAEVGKEVNGL PQTSSGCAENLEFTPSKLDSEKESSGKPGESGMPEEHNAASAKSKVQDLSLKANQPTD KAALHPSPKTLTCEENLLNLHEKRHRNMHR" BASE COUNT 673 a 465 c 471 g 559 t ORIGIN 1 atgtccttta ctggccagat gtcattggac atagcatctt ctatcaatga agactcagcg 61 gcatctctaa cagaacttag cagcagtgac gagctctctc tttgctcaga ggatattgtg 121 ttacacaaga acaagatccc ggaatcgaat gcatcgttca ggaagcgtct gactcgttca 181 gtggctgatg aaagcgatgt caatgtcagc atgattgtta atgtctcttg cacctctgct 241 tgcactgatg atgaagatga cagcgacctg ctctccagct ctacccttac cttgactgaa 301 gaagagctgt gcatcaaaga tgaggatgac gactccagta ttgcaacaga tgatgaaatt 361 tatgaagact gcaccttgat gtcagggcta gactacataa agaatgaatt acagacctgg 421 attaggccaa aattgtcttt gacaagagat aagaaaaggt gcaatgtcag tgatgagatg 481 aagggcagta aagatataag tagcagtgag atgaccaatc cctctgatac tctgaatatt 541 gagacccttc taaatggctc tgtaaaacgt gtctctgaaa ataatggaaa tggtaagaat 601 tcatctcata cccatgagtt agggacaaag cgtgaaaata agaaaactat tttcaaagtt 661 aataaagatc catatgtggc tgacatggaa aatggcaata ttgaaggtat tccagaaagg 721 caaaagggca aaccgaatgt gacttcaaag gtatcagaaa atcttggttc acatgggaaa 781 gagatttcag agagtgagca ttgtaagtgt aaagcactta tggatagttt agatgattca 841 aatactgctg gcaaggaatt tgtttcccaa gatgttagac atcttccaaa gaaatgtcca 901 aatcaccacc attttgaaaa tcaaagcact gcctctactc ccactgagaa gtctttctca 961 gaactggctt tagaaaccag gtttaacaac agacaagact ctgatgcgct gaaatcatct 1021 gatgatgcac cgagtatggc tggaaaatct gctggttgtt gcctagcact tgaacaaaac 1081 ggaacagagg aaaatgcttc tatcagcgac atttcctgtt gcaactgtga gccagatgtt 1141 ttccatcaaa aagatgccga agattgttca gtacacaact ttgttaagga aatcattgac 1201 atggcttcga cagccctaaa aagtaaatct caacctgaaa acgaggtggc tgctcctact 1261 tcattaactc aaatcaagga gaaagtgttg gagcattctc accggcccat ccagctgaga 1321 aaaggggact tttattcgta cttatctctc tcatctcatg acagtgattg tggggaggtc 1381 accaattaca tagaagagaa aagcagcact ccattgccac tagacaccac tgactcgggc 1441 ttagatgaca aggaagatat tgaatgcttt tttgaggcct gtgttgaggg tgactctgat 1501 ggagaggagc cttgtttctc tagtgctcct ccaaatgaat ctgcagttcc cagcgaagct 1561 gcaatgccac tacaagcaac agcatgttct tctgagttca gtgatagttc tctttcagct 1621 gatgatgcag atacagtggc tctttcaagt ccttcctctc aggaaagagc tgaggttgga 1681 aaggaagtga atggtttgcc ccaaacttcc agtggctgtg cagaaaactt agagtttact 1741 ccttcaaagc ttgacagtga aaaggaaagt tccggaaaac caggtgaatc tggaatgcca 1801 gaagaacata atgctgcttc agccaaatct aaagttcaag acctctcctt gaaggcaaat 1861 cagccaacag acaaggccgc attgcatccc agccccaaaa ctttaacctg tgaagaaaat 1921 cttctaaacc ttcatgaaaa acgacataga aatatgcata ggtagaatgt accccctccc 1981 caagcatgaa aatcatctca ctgaaagata cgcctggctg caactcaggg gtggcctcat 2041 cctcccgccc tgggctggcc tctggttcca tcacgtttgt cactgccgtt tattacattg 2101 acttctccca agatgaatct tccttccaaa tgtgttttct ccacacaagc cttgtgatct 2161 gaatgtgg // LOCUS HSU17248 1100 bp mRNA PRI 13-NOV-1995 DEFINITION Human succinate dehydrogenase iron-protein subunit (sdhB) gene, complete cds. ACCESSION U17248 NID g665924 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1100) AUTHORS Au,H.C., Ream-Robinson,D., Bellew,L.A., Broomfield,P.L., Saghbini,M. and Scheffler,I.E. TITLE Structural organization of the gene encoding the human iron-sulfur subunit of succinate dehydrogenase JOURNAL Gene 159 (2), 249-253 (1995) MEDLINE 95347607 REFERENCE 2 (bases 1 to 1100) AUTHORS Scheffler,I.E. TITLE Direct Submission JOURNAL Submitted (16-NOV-1994) Immo E. Scheffler, Biology, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..1100 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="whole liver" /dev_stage="adult" gene 135..977 /gene="sdhB" CDS 135..977 /gene="sdhB" /note="Ip complex II" /codon_start=1 /product="succinate dehydrogenase iron-protein subunit" /db_xref="PID:g665925" /translation="MAAVVALSLRRRLPATTLGGACLQASRGAQTAAATAPRIKKFAI YRWDPDKAGDKPHMQTYKVDLNKCGPMVLDALIKIKNEVDSTLTFRRSCREGICGSCA MNINGGNTLACTRRIDTNLNKVSKIYPLPHMYVIKDLVPDLSNFYAQYKSIEPYLKKK DESQEGKQQYLQSIEEREKLDGLYECILCACCSTSCPSYWWNGDKYLGPAVLMQAYRW MIDSRDDFTEERLAKLQDPFSLYRCHTIMNCTRTCPKGLNPGKAIAEIKKMMATYKEK KASV" misc_feature 410..437 /gene="sdhB" /note="encodes iron-sulfur cluster I" misc_feature 689..722 /gene="sdhB" /note="encodes iron-sulfur cluster II" misc_feature 860..893 /gene="sdhB" /note="encodes iron-sulfur cluster III" polyA_signal 1085..1090 polyA_site 1100 /note="5 A nucleotides" BASE COUNT 299 a 266 c 278 g 257 t ORIGIN 1 ggcctcccac ttggttgctc gtacgcggct agtgggtcct cagtggatgt aggctgggcg 61 ccgcgatgtt cgacgggaca ccggcggaga gcgacctcgg ggttaagggg tggggctgga 121 cgtcaggagc caagatggcg gcggtggtcg cactctcctt gaggcgccgg ttgccggcca 181 caacccttgg cggagcctgc ctgcaggcct cccgaggagc ccagacagct gcagccacag 241 ctccccgtat caagaaattt gccatctatc gatgggaccc agacaaggct ggagacaaac 301 ctcatatgca gacttataag gttgacctta ataaatgtgg ccccatggta ttggatgctt 361 taatcaagat taagaatgaa gttgactcta ctttgacctt ccgaagatca tgcagagaag 421 gcatctgtgg ctcttgtgca atgaacatca atggaggcaa cactctagct tgcacccgaa 481 ggattgacac caacctcaat aaggtctcaa aaatctaccc tcttccacac atgtatgtga 541 taaaggatct tgttcccgat ttgagcaact tctatgcaca gtacaaatcc attgagcctt 601 atttgaagaa gaaggatgaa tctcaggaag gcaagcagca gtatctgcag tccatagaag 661 agcgtgagaa actggacggg ctctacgagt gcattctctg tgcctgctgt agcaccagct 721 gccccagcta ctggtggaac ggagacaaat atctggggcc tgcagttctt atgcaggcct 781 atcgctggat gattgactcc agagatgact tcacagagga gcgcctggcc aagctgcagg 841 acccattctc tctataccgc tgccacacca tcatgaactg cacaaggacc tgtcctaagg 901 gtctgaatcc agggaaagct attgcagaga tcaagaaaat gatggcaacc tataaggaga 961 agaaagcttc agtttaactg tttccatgct aaacatgatt tataaccagc tcagagctga 1021 acataattta tatctaattt gagttccttt aaagatcttg gttttccatg aatacagcat 1081 gtataataaa aattttaaga // LOCUS HSU17278 1835 bp mRNA PRI 02-APR-1996 DEFINITION Human collapsin response mediator protein CRMP-1 mRNA, complete cds. ACCESSION U17278 NID g882148 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1835) AUTHORS Goshima,Y., Nakamura,F., Strittmatter,P. and Strittmatter,S.M. TITLE Collapsin-induced growth cone collapse mediated by an intracellular protein related to UNC-33 JOURNAL Nature 376 (6540), 509-514 (1995) MEDLINE 95364923 REFERENCE 2 (bases 1 to 1835) AUTHORS Strittmatter,S.M. TITLE Direct Submission JOURNAL Submitted (17-NOV-1994) Stephen M. Strittmatter, Neurology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA FEATURES Location/Qualifiers source 1..1835 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" CDS 17..1546 /note="collapsin response mediator protein" /codon_start=1 /product="hCRMP-1" /db_xref="PID:g882149" /translation="MVIPGGIDVNTYLQKPSQGMTAADDFFQGTRAALVGGTTMIIDH VVPEPGSSLLTSFEKWHEAADTKSCCDYSLHVDITSWYDGVREELEVLVQDKGVNSFQ VYMAYKDVYQMSDSQLYEAFTFLKGLGAVILVHAENGDLIAQEQKRILEMGITGPEGH ALSRPEELEAEAVFRAITIAGRINCPVYITKVMSKSAADIIALARKKGPLVFGEPIAA SLGTDGTHYWSKNWAKAAAFVTSPPLSPDPTTPDYLTSLLACGDLQVTGSGHCPYSTA QKAVGKDNFTLIPEGVNGIEERMTVVWDKAVATGKMDENQFVAVTSTNAAKIFNLYPR KGRIAVGSDADVVIWDPDKLKTITAKSHKSAVEYNIFEGMECHGSPLVVISQGKIVFE DGNINVNKGMGRFIPRKAFPEHLYQRVKIRNKVFGLQGVSRGMYDGPVHEVPATPKYA TPAPSAKSSPSKHQPPPIRNLHQSNFSLSGAQIDDNNPRRTGHRIVAPPGGRSNITSL G" BASE COUNT 417 a 509 c 518 g 391 t ORIGIN 1 cgaagccaac gggcggatgg ttattcccgg aggtattgat gtcaacacgt acctgcagaa 61 gccctcccag gggatgactg cggctgatga cttcttccaa gggaccaggg cggcactggt 121 gggcgggacc acgatgatca ttgaccatgt tgttcctgaa cctgggtcca gcctactgac 181 ctctttcgag aagtggcacg aagcagctga caccaaatcc tgctgtgatt actccctcca 241 cgtggacatc acaagctggt acgatggcgt tcgggaggag ctggaggtgc tggtgcagga 301 caaaggcgtc aattccttcc aagtctacat ggcctataag gatgtctacc aaatgtccga 361 cagccagctc tatgaagcct ttaccttcct taagggcctg ggagctgtga tcttggtcca 421 tgcagaaaat ggagatttga tagctcagga acaaaagcgg atcctggaga tgggcatcac 481 gggtcccgag ggccatgccc tgagcagacc tgaagagctg gaggccgagg cggtgttccg 541 ggccatcacc attgcgggcc ggatcaactg ccctgtgtac atcaccaagg tcatgagcaa 601 gagtgcagcc gacatcatcg ctctggccag gaagaaaggg cccctagttt ttggagagcc 661 cattgccgcc agcctgggga ccgatggcac ccattactgg agcaagaact gggccaaggc 721 tgcggcgttc gtgacttccc ctcccctgag cccggaccct accacgcccg actacttgac 781 ctccctactg gcctgtgggg acttgcaggt cacaggcagc ggccactgtc cctacagcac 841 tgcccagaag gcggtgggca aggacaactt taccctgatc cccgagggtg tcaacgggat 901 agaggagcgg atgacggtcg tctgggacaa ggcggtggct actggcaaaa tggatgagaa 961 ccagtttgtc gctgtcacca gcaccaatgc agccaagatc tttaacctgt acccaaggaa 1021 agggcggatt gccgtgggct cggatgccga cgtggtcatc tgggaccccg acaagttgaa 1081 gaccataaca gccaaaagtc acaagtcggc ggtggagtac aacatcttcg agggtatgga 1141 gtgccacggc tccccactag tggtcatcag ccagggcaag atcgtctttg aagacggaaa 1201 catcaacgtc aacaagggca tgggccgctt cattccgcgg aaggcgttcc cggagcacct 1261 gtaccagcgc gtcaaaatca ggaataaggt ttttggattg caaggggttt ccaggggcat 1321 gtatgacggt cctgtgcacg aggtaccagc tacacccaaa tatgcaactc ccgctccttc 1381 agccaaatct tcgccttcta aacaccagcc cccacccatc agaaacctcc accagtccaa 1441 cttcagctta tcaggtgccc agatagatga caacaatccc aggcgcaccg gccaccgcat 1501 cgtggcgccc cctggtggcc gctccaacat caccagcctc ggttgaacgt ggatgcgcgg 1561 aggagctagc ctgaaggatt ctgggaatca tgtccatccc ttttcctgtc agtgtttttg 1621 aaacccacag ttttagttgg tgctgatgga gggaggggga agtcgaagga tgctctttcc 1681 cttttctgtt taggaagaag tggtactagt gtggtgtgtt tgcttggaaa ttccttgccc 1741 cacagttgtg ttcatgctga atccacctcg gagcatggtg ttttcattcc cccttcctag 1801 tgaaccacag gttttagcat tgtcttgttc tgtcg // LOCUS HSU17279 1829 bp mRNA PRI 02-APR-1996 DEFINITION Human collapsin response mediator protein hCRMP-2 mRNA, complete cds. ACCESSION U17279 NID g1244399 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1829) AUTHORS Goshima,Y., Nakamura,F., Strittmatter,P. and Strittmatter,S.M. TITLE Collapsin-induced growth cone collapse mediated by an intracellular protein related to UNC-33 JOURNAL Nature 376 (6540), 509-514 (1995) MEDLINE 95364923 REFERENCE 2 (bases 1 to 1829) AUTHORS Strittmatter,S.M. TITLE Direct Submission JOURNAL Submitted (17-NOV-1994) Stephen M. Strittmatter, Neurology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA FEATURES Location/Qualifiers source 1..1829 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 72..1790 /note="collapsin response mediator protein" /codon_start=1 /product="hCRMP-2" /db_xref="PID:g1244400" /translation="MSYQGKKNIPRITSDRLLIKGGKIVNDDQSFYADIYMEDGLIKQ IGENLIVPGGVKTIEAHSRMVIPGGIDVHTRFQMPDQGMTSADDFFQGTKAALAGGTT MIIDHVVPEPGTSLLAAFDQWREWADSKSCCDYSLHVDISEWHKGIQEEMEALVKDHG VNSFLVYMAFKDRFQLTDCQIYEVLSVIRDIGAIAQVHAENGDIIAEEQQRILDLGIT GPEGHVLSRPEEVEAEAVNRAITIANQTNCPLYITKVMSKSSAEVIAQARKKGTVVYG EPITASLGTDGSHYWSKNWAKAAAFVTSPPLSPDPTTPDFLNSLLSCGDLQVTGSAHC TFNTAQKAVGKDNFTLIPEGTNGTEERMSVIWDKAVVTGKMDENQFVAVTSTNAAKVF NLYPRKGRIAVGSDADLVIWDPDSVKTISAKTHNSSLEYNIFEGMECRGSPLVVISQG KIVLEDGTLHVTEGSGRYIPRKPFPDFVYKRIKARSRLAELRGVPRGLYDGPVCEVSV TPKTVTPASSAKTSPAKQQAPPVRNLHQSGFSLSGAQIDDNIPRRTTQRIVAPPGGRA NITSLG" BASE COUNT 426 a 509 c 511 g 383 t ORIGIN 1 cccaagtccc cttcccggca gtttttgcct taaagctgcc ctcttgaaat taattttttc 61 ccaggagaga gatgtcttat caggggaaga aaaatattcc acgcatcacg agcgatcgtc 121 ttctgatcaa aggaggtaaa attgttaatg atgaccagtc gttctatgca gacatataca 181 tggaagatgg gttgatcaag caaataggag aaaatctgat tgtgccagga ggagtgaaga 241 ccatcgaggc ccactcccgg atggtgatcc ccggaggaat tgacgtccac actcgtttcc 301 agatgcctga tcagggaatg acgtctgctg atgatttctt ccaaggaacc aaggcggccc 361 tggctggggg aaccactatg atcattgacc acgttgttcc tgagcctggg acaagcctgc 421 tcgctgcctt tgaccagtgg agggaatggg ccgacagcaa gtcctgctgt gactactctc 481 tgcatgtgga catcagcgag tggcataagg gcatccagga ggagatggaa gcgcttgtga 541 aggatcacgg ggtaaattcc ttcctcgtgt acatggcttt caaagatcgc ttccagctaa 601 cggattgcca gatttatgaa gtactgagtg tgatccggga tattggcgcc atagcccaag 661 tccacgcaga aaatggcgac atcattgcag aggagcagca gaggatcctg gatctgggca 721 tcacgggccc cgagggacat gtgctgagcc gacctgagga ggtcgaggcc gaagccgtga 781 atcgtgccat caccatcgcc aaccagacca actgcccgct gtatatcacc aaggtgatga 841 gcaaaagctc tgctgaggtc atcgcccagg cacggaagaa gggaactgtg gtgtatggcg 901 agcccatcac tgccagcttg ggaacggacg gctcccatta ctggagcaag aactgggcca 961 aggctgctgc ctttgtcacc tccccaccct tgagccctga tccaaccact ccagactttc 1021 tcaactcctt gctgtcctgt ggagacctcc aggtcacggg cagtgcccat tgcacgttta 1081 acactgccca gaaggctgta ggaaaggaca acttcaccct gattccggag ggcaccaatg 1141 gcactgagga gcggatgtcc gtcatctggg acaaggctgt ggtcactggg aagatggatg 1201 agaaccagtt tgtggctgtg accagcacca atgcagccaa agtcttcaac ctttaccccc 1261 ggaaaggccg cattgctgtg ggatccgatg ccgacctggt catctgggac cccgacagcg 1321 ttaaaaccat ctctgccaag acacacaaca gctctctcga gtacaacatc tttgaaggca 1381 tggagtgccg cggctcccca ctggtggtca tcagccaggg gaagattgtc ctggaggacg 1441 gcaccctgca tgtcaccgaa ggctctggac gctacattcc ccggaagccc ttccctgatt 1501 ttgtttacaa gcgtatcaag gcaaggagca ggctggctga gctgagaggg gttcctcgtg 1561 gcctgtatga cggacccgtg tgtgaagtgt ctgtgacgcc caagacagtc actccagcct 1621 cctcggccaa gacgtctcct gccaagcagc aggccccacc tgtccggaac ctgcaccagt 1681 ctggattcag tttgtctggt gctcagattg atgacaacat tccccgccgc accacccagc 1741 gtatcgtggc gccccccggt ggccgtgcca acatcaccag cctgggctag agctcctggg 1801 ctgtgccgtc cactggggac tggggatgg // LOCUS HSU17280 1605 bp mRNA PRI 14-JUN-1995 DEFINITION Human steroidogenic acute regulatory protein (StAR) mRNA, complete cds. ACCESSION U17280 NID g727252 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1605) AUTHORS Sugawara,T., Holt,J.A., Driscoll,D., Strauss III,J.F., Lin,D., Miller,W.L., Patterson,D., Clancy,K.P., Hart,I.M., Clark,B.J. and Stocco,D.M. TITLE Human steroidogenic acute regulatory protein: functional activity in COS-1 cells, tissue-specific expression, and mapping of the structural gene to 8p11.2 and a pseudogene to chromosome 13 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (11), 4778-4782 (1995) MEDLINE 95281540 REFERENCE 2 (bases 1 to 1605) AUTHORS Strauss III,J. TITLE Direct Submission JOURNAL Submitted (18-NOV-1994) Jerome Strauss III, Obstetrics and Gynecology, University of Pensylvania, 422 Curie Blvd., Philadelphia, PA 19104-6142, USA FEATURES Location/Qualifiers source 1..1605 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p11.2" /tissue_type="adrenal cortex" /dev_stage="adult" 5'UTR 1..126 gene 127..984 /gene="StAR" CDS 127..984 /gene="StAR" /note="expressed in gonads, adrenal cortex and kidney; stimulator of mitochondrial pregnenolone synthesis substrate translocator" /codon_start=1 /product="steroidogenic acute regulatory protein" /db_xref="PID:g727253" /translation="MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPT PSTWINQVRRRSSLLGSRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQ DNGDKVMSKVVPDVGKVFRLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKI GKDTFITHELAAEAAGNLVGPRDFVSVRCAKRRGSTCVLAGMDTDFGNMPEQKGVIRA EHGPTCMVLHPLAGSPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLE SHPASEARC" polyA_signal 1583..1588 BASE COUNT 408 a 412 c 468 g 317 t ORIGIN 1 gggactcaga ggcgaagctt gaggggctca ggaaggacga agaaccaccc ttgagagaag 61 aggcagcagc agcggcggca gcagcagcgg cagcgacccc accactgcca catttgccag 121 gaaacaatgc tgctagcgac attcaagctg tgcgctggga gctcctacag acacatgcgc 181 aacatgaagg ggctgaggca acaggctgtg atggccatca gccaggagct gaaccggagg 241 gccctggggg gccccacccc tagcacgtgg attaaccagg ttcggcggcg gagctctcta 301 ctcggttctc ggctggaaga gactctctac agtgaccagg agctggccta tctccagcag 361 ggggaggagg ccatgcagaa ggccttgggc atccttagca accaagaggg ctggaagaag 421 gagagtcagc aggacaatgg ggacaaagtg atgagtaaag tggtcccaga tgtgggcaag 481 gtgttccggc tggaggtcgt ggtggaccag cccatggaga ggctctatga agagctcgtg 541 gagcgcatgg aagcaatggg ggagtggaac cccaatgtca aggagatcaa ggtcctgcag 601 aagatcggaa aagatacatt cattactcac gagctggctg ccgaggcagc aggaaacctg 661 gtggggcccc gtgactttgt gagcgtgcgc tgtgccaagc gccgaggctc cacctgtgtg 721 ctggctggca tggacacaga cttcgggaac atgcctgagc agaagggtgt catcagggcg 781 gagcacggtc ccacttgcat ggtgcttcac ccgttggctg gaagtccctc taagaccaaa 841 cttacgtggc tactcagcat cgacctcaag gggtggctgc ccaagagcat catcaaccag 901 gtcctgtccc agacccaggt ggattttgcc aaccacctgc gcaagcgcct ggagtcccac 961 cctgcctctg aagccaggtg ttgaagacca gcctgctgtt cccaactgtg cccagctgca 1021 ctggtacaca cgctcatcag gagaatccct actggaagcc tgcaagtcta agatctccat 1081 ctggtgacag tgggatgggt ggggttcgtg tttagagtat gacactagga ttcagattgg 1141 tgaagttttt agtaccaaga aaacagggat gaggctcttg gattaaaagg taacttcatt 1201 cactgattag ctatgacatg agggttcagg cccctaaaat aattgtaaaa ctttttttct 1261 gggcccttat gtacccacct aaaaccatct ttaaaatgct agtggctgat atgggtgtgg 1321 gggatgctaa ccacagggcc tgagaagtct tgctttatgg gctcaagaat gccatgcgct 1381 ggcagtacat gtgcacaaag cagaatctca gagggtctcc tgcagccctc tgctcctccc 1441 ggccgctgca cagcaacacc acagaacaag cagcacccca cagtgggtgc cttccagaaa 1501 tatagtccaa gctttctctg tggaaaaaga caaaactcat tagtagacat gtttccctat 1561 tgctttcata ggcaccagtc agaataaaga atcataattc acacc // LOCUS HSU17327 7124 bp mRNA PRI 24-FEB-1995 DEFINITION Human neuronal nitric oxide synthase (NOS1) mRNA, complete cds. ACCESSION U17327 NID g642525 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7124) AUTHORS Hall,A.V., Antoniou,H., Wang,Y., Cheung,A.H., Arbus,A.M., Olson,S.L., Lu,W.C., Kau,C.L. and Marsden,P.A. TITLE Structural organization of the human neuronal nitric oxide synthase gene (NOS1) JOURNAL J. Biol. Chem. 269 (52), 33082-33090 (1994) MEDLINE 95105197 REFERENCE 2 (bases 1 to 7124) AUTHORS Marsden,P.A. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) Philip A. Marsden, Medicine, University of Toronto, Room 7358, Medical Sciences Building, 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..7124 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q24.2" gene 686..4990 /gene="NOS1" CDS 686..4990 /gene="NOS1" /EC_number="1.14.13.39" /codon_start=1 /product="neuronal nitric oxide synthase" /db_xref="PID:g642526" /translation="MEDHMFGVQQIQPNVISVRLFKRKVGGLGFLVKERVSKPPVIIS DLIRGGAAEQSGLIQAGDIILAVNGRPLVDLSYDSALEVLRGIASETHVVLILRGPEG FTTHLETTFTGDGTPKTIRVTQPLGPPTKAVDLSHQPPAGKEQPLAVDGASGPGNGPQ HAYDDGQEAGSLPHANGLAPRPPGQDPAKKATRVSLQGRGENNELLKEIEPVLSLLTS GSRGVKGGAPAKAEMKDMGIQVDRDLDGKSHKPLPLGVENDRVFNDLWGKGNVPVVLN NPYSEKEQPPTSGKQSPTKNGSPSKCPRFLKVKNWETEVVLTDTLHLKSTLETGCTEY ICMGSIMHPSQHARRPEDVRTKGQLFPLAKEFIDQYYSSIKRFGSKAHMERLEEVNKE IDTTSTYQLKDTELIYGAKHAWRNASRCVGRIQWSKLQVFDARDCTTAHGMFNYICNH VKYATNKGNLRSAITIFPQRTDGKHDFRVWNSQLIRYAGYKQPDGSTLGDPANVQFTE ICIQQGWKPPRGRFDVLPLLLQANGNDPELFQIPPELVLEVPIRHPKFEWFKDLGLKW YGLPAVSNMLLEIGGLEFSACPFSGWYMGTEIGVRDYCDNSRYNILEEVAKKMNLDMR KTSSLWKDQALVEINIAVLYSFQSDKVTIVDHHSATESFIKHMENEYRCRGGCPADWV WIVPPMSGSITPVFHQEMLNYRLTPSFEYQPDPWNTHVWKGTNGTPTKRRAIGFKKLA EAVKFSAKLMGQAMAKRVKATILYATETGKSQAYAKTLCEIFKHAFDAKVMSMEEYDI VHLEHETLVLVVTSTFGNGDPPENGEKFGCALMEMRHPNSVQEERKSYKVRFNSVSSY SDSQKSSGDGPDLRDNFESAGPLANVRFSVFGLGSRAYPHFCAFGHAVDTLLEELGGE RILKMREGDELCGQEEAFRTWAKKVFKAACDVFCVGDDVNIEKANNSLISNDRSWKRN KFRLTFVAEAPELTQGLSNVHKKRVSAARLLSRQNLQSPKSSRSTIFVRLHTNGSQEL QYQPGDHLGVFPGNHEDLVNALIERLEDAPPVNQMVKVELLEERNTALGVISNWTDEL RLPPCTIFQAFKYYLDITTPPTPLQLQQFASLATSEKEKQRLLVLSKGLQEYEEWKWG KNPTIVEVLEEFPSIQMPATLLLTQLSLLQPRYYSISSSPDMYPDEVHLTVAIVSYRT RDGEGPIHHGVCSSWLNRIQADELVPCFVRGAPSFHLPRNPQVPCILVGPGTGIAPFR SFWQQRQFDIQHKGMNPCPMVLVFGCRQSKIDHIYREETLQAKNKGVFRELYTAYSRE PDKPKKYVQDILQEQLAESVYRALKEQGGHIYVCGDVTMAADVLKAIQRIMTQQGKLS AEDAGVFISRMRDDNRYHEDIFGVTLRTYEVTNRLRSESIAFIEESKKDTDEVFSS" BASE COUNT 1699 a 2037 c 1822 g 1566 t ORIGIN 1 agagcggctc ttttaatgag ggttgcgacg tctccctccc cacacccata aaccagtcgg 61 gttggacgtc actgctaatt cgtttcagtg atgataggat aaaggaggga cattaagaaa 121 taaattcccc ctcacgaccc tcgctgagct cacggctcag tccctacata tttatgccgc 181 gtttccagcc gctgggtgag gagctactta gcgccgcggc tcctccgagg ggcggccggg 241 cagcgagcag cggccgagcg gacgggctca tgatgcctca gatctgatcc gcatctaaca 301 ggctggcaat gaagataccc agagaatagt tcacatctat catgcgtcac ttctagacac 361 agccatcaga cgcatctcct cccctttctg cctgacctta ggacacgtcc caccgcctct 421 cttgacgtct gcctggtcaa ccatcacttc cttagagaat aaggagagag gcggatgcag 481 gaaatcatgc caccgacggg ccaccagcca tgagtgggtg acgctgagct gacgtcaaag 541 acagagaggg ctgaagcctt gtcagcacct gtcaccccgg ctcctgctct ccgtgtagcc 601 tgaagcctgg atcctcctgg tgaaatcatc ttggcctgat agcattgtga ggtcttcaga 661 caggacccct cggaagctag ttaccatgga ggatcacatg ttcggtgttc agcaaatcca 721 gcccaatgtc atttctgttc gtctcttcaa gcgcaaagtt gggggcctgg gatttctggt 781 gaaggagcgg gtcagtaagc cgcccgtgat catctctgac ctgattcgtg ggggcgccgc 841 agagcagagt ggcctcatcc aggccggaga catcattctt gcggtcaacg gccggccctt 901 ggtggacctg agctatgaca gcgccctgga ggtactcaga ggcattgcct ctgagaccca 961 cgtggtcctc attctgaggg gccctgaagg tttcaccacg cacctggaga ccacctttac 1021 aggtgatggg acccccaaga ccatccgggt gacacagccc ctgggtcccc ccaccaaagc 1081 cgtggatctg tcccaccagc caccggccgg caaagaacag cccctggcag tggatggggc 1141 ctcgggtccc gggaatgggc ctcagcatgc ctacgatgat gggcaggagg ctggctcact 1201 cccccatgcc aacggcctgg cccccaggcc cccaggccag gaccccgcga agaaagcaac 1261 cagagtcagc ctccaaggca gaggggagaa caatgaactg ctcaaggaga tagagcctgt 1321 gctgagcctt ctcaccagtg ggagcagagg ggtcaaggga ggggcacctg ccaaggcaga 1381 gatgaaagat atgggaatcc aggtggacag agatttggac ggcaagtcac acaaacctct 1441 gcccctcggc gtggagaacg accgagtctt caatgaccta tgggggaagg gcaatgtgcc 1501 tgtcgtcctc aacaacccat attcagagaa ggagcagccc cccacctcag gaaaacagtc 1561 ccccacaaag aatggcagcc cctccaagtg tccacgcttc ctcaaggtca agaactggga 1621 gactgaggtg gttctcactg acaccctcca ccttaagagc acattggaaa cgggatgcac 1681 tgagtacatc tgcatgggct ccatcatgca tccttctcag catgcaagga ggcctgaaga 1741 cgtccgcaca aaaggacagc tcttccctct cgccaaagag tttattgatc aatactattc 1801 atcaattaaa agatttggct ccaaagccca catggaaagg ctggaagagg tgaacaaaga 1861 gatcgacacc actagcactt accagctcaa ggacacagag ctcatctatg gggccaagca 1921 cgcctggcgg aatgcctcgc gctgtgtggg caggatccag tggtccaagc tgcaggtatt 1981 cgatgcccgt gactgcacca cggcccacgg gatgttcaac tacatctgta accatgtcaa 2041 gtatgccacc aacaaaggga acctcaggtc tgccatcacc atattccccc agaggacaga 2101 cggcaagcac gacttccgag tctggaactc ccagctcatc cgctacgctg gctacaagca 2161 gcctgacggc tccaccctgg gggacccagc caatgtgcag ttcacagaga tatgcataca 2221 gcagggctgg aaaccgccta gaggccgctt cgatgtcctg ccgctcctgc ttcaggccaa 2281 cggcaatgac cctgagctct tccagattcc tccagagctg gtgttggaag ttcccatcag 2341 gcaccccaag tttgagtggt tcaaggacct ggggctgaag tggtacggcc tccccgccgt 2401 gtccaacatg ctcctagaga ttggcggcct ggagttcagc gcctgtccct tcagtggctg 2461 gtacatgggc acagagattg gtgtccgcga ctactgtgac aactcccgct acaatatcct 2521 ggaggaagtg gccaagaaga tgaacttaga catgaggaag acgtcctccc tgtggaagga 2581 ccaggcgctg gtggagatca atatcgcggt tctctatagc ttccagagtg acaaagtgac 2641 cattgttgac catcactccg ccaccgagtc cttcattaag cacatggaga atgagtaccg 2701 ctgccggggg ggctgccctg ccgactgggt gtggatcgtg ccccccatgt ccggaagcat 2761 cacccctgtg ttccaccagg agatgctcaa ctaccggctc accccctcct tcgaatacca 2821 gcctgatccc tggaacacgc atgtctggaa aggcaccaac gggaccccca caaagcggcg 2881 agccatcggc ttcaagaagc tagcagaagc tgtcaagttc tcggccaagc tgatggggca 2941 ggctatggcc aagagggtga aagcgaccat cctctatgcc acagagacag gcaaatcgca 3001 agcttatgcc aagaccttgt gtgagatctt caaacacgcc tttgatgcca aggtgatgtc 3061 catggaagaa tatgacattg tgcacctgga acatgaaact ctggtccttg tggtcaccag 3121 cacctttggc aatggagatc cccctgagaa tggggagaaa ttcggctgtg ctttgatgga 3181 aatgaggcac cccaactctg tgcaggaaga aaggaagagc tacaaggtcc gattcaacag 3241 cgtctcctcc tactctgact cccaaaaatc atcaggcgat gggcccgacc tcagagacaa 3301 ctttgagagt gctggacccc tggccaatgt gaggttctca gtttttggcc tcggctcacg 3361 agcataccct cacttttgcg ccttcggaca cgctgtggac accctcctgg aagaactggg 3421 aggggagagg atcctgaaga tgagggaagg ggatgagctc tgtgggcagg aagaggcttt 3481 caggacctgg gccaagaagg tcttcaaggc agcctgtgat gtcttctgtg tgggagatga 3541 tgtcaacatt gaaaaggcca acaattccct catcagcaat gatcgcagct ggaagagaaa 3601 caagttccgc ctcacctttg tggccgaagc tccagaactc acacaaggtc tatccaatgt 3661 ccacaaaaag cgagtctcag ctgcccggct ccttagccgt caaaacctcc agagccctaa 3721 atccagtcgg tcaactatct tcgtgcgtct ccacaccaac gggagccagg agctgcagta 3781 ccagcctggg gaccacctgg gtgtcttccc tggcaaccac gaggacctcg tgaatgccct 3841 gatcgagcgg ctggaggacg cgccgcctgt caaccagatg gtgaaagtgg aactgctgga 3901 ggagcggaac acggctttag gtgtcatcag taactggaca gacgagctcc gcctcccgcc 3961 ctgcaccatc ttccaggcct tcaagtacta cctggacatc accacgccac caacgcctct 4021 gcagctgcag cagtttgcct ccctagctac cagcgagaag gagaagcagc gtctgctggt 4081 cctcagcaag ggtttgcagg agtacgagga atggaaatgg ggcaagaacc ccaccatcgt 4141 ggaggtgctg gaggagttcc catctatcca gatgccggcc accctgctcc tgacccagct 4201 gtccctgctg cagccccgct actattccat cagctcctcc ccagacatgt accctgatga 4261 agtgcacctc actgtggcca tcgtttccta ccgcactcga gatggagaag gaccaattca 4321 ccacggcgta tgctcctcct ggctcaaccg gatacaggct gacgaactgg tcccctgttt 4381 cgtgagagga gcacccagct tccacctgcc ccggaacccc caagtcccct gcatcctcgt 4441 tggaccaggc accggcattg cccctttccg aagcttctgg caacagcggc aatttgatat 4501 ccaacacaaa ggaatgaacc cctgccccat ggtcctggtc ttcgggtgcc ggcaatccaa 4561 gatagatcat atctacaggg aagagaccct gcaggccaag aacaaggggg tcttcagaga 4621 gctgtacacg gcttactccc gggagccaga caaaccaaag aagtacgtgc aggacatcct 4681 gcaggagcag ctggcggagt ctgtgtaccg agccctgaag gagcaagggg gccacatata 4741 cgtctgtggg gacgtcacca tggctgctga tgtcctcaaa gccatccagc gcatcatgac 4801 ccagcagggg aagctctcgg cagaggacgc cggcgtattc atcagccgga tgagggatga 4861 caaccgatac catgaggata tttttggagt caccctgcga acgtacgaag tgaccaaccg 4921 ccttagatct gagtccattg ccttcattga agagagcaaa aaagacaccg atgaggtttt 4981 cagctcctaa ctggaccctc ttgcccagcc ggctgcaagt tttgtaagcg cggacagaca 5041 ctgctgaacc tttcctctgg gaccccctgt ggccctcgct ctgcctcctg tccttgtcgc 5101 tgtgccctgg tttccctcct cgggcttctc gcccctcagt ggtttcctcg gccctcctgg 5161 gtttactcct tgagttttcc tgctgcgatg caatgctttt ctaatctgca gtggctctta 5221 caaaactctg ttcccactcc ctctcttgcc gacaagggca actcacgggt gcatgaaacc 5281 actggaacat ggccgtcgct gtgggggttt ttttctctgg ggttcccctg gaaaggctgc 5341 aggaactagg cacaagctct ctgagccagt ccctcagcca ctgaagtccc cctttctcct 5401 tttttatgat gacattttgg ttgtgcgtgc ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt 5461 gtgtgatggg ccaggtctct gtccgtcctc ttccctgcac aagtgtgtcg atcttagatt 5521 gccactgctt tcattgaaga ccctcaatgc caagaaacgt gtccctggcc catattaatc 5581 cctcgtgtgt ccataattag ggtccacgcc catgtacctg aaacatttgg aagccccata 5641 attgttctag ttagaaaggg ttcagggcat ggggagagga gtgggaaatt gattaaaggg 5701 gctgtctccc aatgaaagag gcattcccag aatttgctgc atttagattt tgataccagt 5761 gagcagagcc ctcatgtgac atgaacccat ccaatggatt gtgcaaatcc cctccccaaa 5821 cccacccata ccagctagaa tcacttgact ttgccacatc cattgactga ccccctcctc 5881 cagcaatagc atccaagggg cctggaagtt atgttgttca aagaagcctg gtggcaataa 5941 ggatcttccc actttgccac tggatgactt tggatgggtc acttgtcctc agtttttcct 6001 agtcataatg tcatacgaac ctaaagaata tgaatggatt aaatgttaaa gctttggtgc 6061 ctggaaacaa tatcaagtaa caatatgatt attatttttt tattccccca aagcgggctt 6121 gctgcttcac ccttggggat gaaataatgg aagctggtta aagtggatga ggttggaaag 6181 agttgccata atgaggtccc acgtggcttc ttcgatagga gccacaactt ggggtgggaa 6241 gaacttgtcc ctcaggcttg ttgccctctg cagttgatct ccaaagtttt aaacctgtta 6301 aattaatttt gacaaataag ttaccctcaa ctcagatcaa aaatgggcag ccaagtcttc 6361 ggtaggaatt ggagccggtg taattcctcc ctaagaggca acctgttgaa tttactctct 6421 cagagtaaat ggtgggaagg gatccctttg tatacttttt taaatactac aaattagtgt 6481 caggcagttc ccagaaagag acaagaaatc ctagtggcct cccagactgc agggtcccca 6541 aggatggaaa gggaatgttc tgctggttct accctgtttg ttgtgtcttg ctatacagaa 6601 aaaccacatt tcttttatat actgtacgtg ggcatatctt gttgttcagt ttgggtgtct 6661 gctaaagagg aagtgcactg gccctctttg aaagggcttt acagtggggg caccaagacc 6721 ccaaaggccc aggccaggag actgttaaag tgaaaaggca atctatgact caccttgctc 6781 tgccatccct ggcagccccc accggtgtcc tgttcctgcc acatggagct tgacttcatg 6841 ccagctataa tctcccctgc cttcctttaa tcccaatttc ccctgctcac tcttccacag 6901 atataaagaa caaacactta gcatcccaca ctcacccctt ctaatcctga agggaagccc 6961 attctaaact cctttcctgc aaacccattt ccagctccta gtagctttcc tcccaaaggc 7021 tttctttcca atcctttata gctttggaga cgcctcccca attccccagg gaaggaaact 7081 gttgtgtcca atccccatta aagacaaatt gatcagtgct tccc // LOCUS HSU17473 2187 bp mRNA PRI 11-FEB-1995 DEFINITION Human calcitonin-like receptor mRNA, complete cds. ACCESSION U17473 NID g662328 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2187) AUTHORS Fluehmann,B., Muff,R., Hunziker,W., Fischer,J.A. and Born,W. TITLE A human orphan calcitonin receptor-like structure JOURNAL Biochem. Biophys. Res. Comm. 206, 341-347 (1995) REFERENCE 2 (bases 1 to 2187) AUTHORS Born,W. TITLE Direct Submission JOURNAL Submitted (22-NOV-1994) Walter Born, Orthopaedic Surgery and Medicine, Research Laboratory for Calcium Metabolism, University of Zu, Zurich CH-8008, Switzerland FEATURES Location/Qualifiers source 1..2187 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum" CDS 471..1856 /note="seven membrane receptor; orphan calcitonin-like receptor; G-protein coupled receptor" /codon_start=1 /product="calcitonin-like receptor" /db_xref="PID:g662329" /translation="MEKKCTLYFLVLLPFFMILVTAELEESPEDSIQLGVTRNKIMTA QYECYQKIMQDPIQQAEGVYCNRTWDGWLCWNDVAAGTESMQLCPDYFQDFDPSEKVT KICDQDGNWFRHPASNRTWTNYTQCNVNTHEKVKTALNLFYLTIIGHGLSIASLLISL GIFFYFKSLSCQRITLHKNLFFSFVCNSVVTIIHLTAVANNQALVATNPVSCKVSQFI HLYLMGCNYFWMLCEGIYLHTLIVVAVFAEKQHLMWYYFLGWGFPLIPACIHAIARSL YYNDNCWISSDTHLLYIIHGPICAALLVNLFFLLNIVRVLITKLKVTHQAESNLYMKA VRATLILVPLLGIEFVLIPWRPEGKIAEEVYDYIMHILMHFQGLLVSTIFCFFNGEVQ AILRRNWNQYKIQFGNSFSNSEALRSASYTVSTISDGPGYSHDCPSEHLNGKSIHDIE NVLLKPENLYN" BASE COUNT 668 a 444 c 411 g 664 t ORIGIN 1 aattccggag gaccatcaag ctctgctaac tgaatctcat cctaattgca ggatcacatt 61 gcaaagcttt cactctttcc caccttgctt gtgggtaaat ctcttctgcg gaatctcaga 121 aagtaaagtt ccatcctgag aatatttcac aaagaatttc cttaagagct ggactgggtc 181 ttgacccctg aatttaagaa attcttaaag acaatgtcaa atatgatcca agagaaaatg 241 tgatttgagt ctggagacca attgtgcata tcgtctaata ataaaaaccc atactagcct 301 atagaaaaca atatttgaaa gattgctacc actaaaaaga aaactactac aacttgacaa 361 gactgctgca aacttcaatt tgtcaaccac aacttgacaa ggttgctata aaacaagatt 421 gctacaactt ctagtttatg ttatacagca tatttcattt tggcttaatg atggagaaaa 481 agtgtaccct gtattttctg gttctcttgc ctttttttat gattcttgtt acagcagaat 541 tagaagagag tcctgaggac tcaattcagt tgggagttac tagaaataaa atcatgacag 601 ctcaatatga atgttaccaa aagattatgc aagaccccat tcaacaagca gaaggcgttt 661 actgcaacag aacctgggat ggatggctct gctggaacga tgttgcagca ggaactgaat 721 caatgcagct ctgccctgat tactttcagg actttgatcc atcagaaaaa gttacaaaga 781 tctgtgacca agatggaaac tggtttagac atccagcaag caacagaaca tggacaaatt 841 atacccagtg taatgttaac acccacgaga aagtgaagac tgcactaaat ttgttttacc 901 tgaccataat tggacacgga ttgtctattg catcactgct tatctcgctt ggcatattct 961 tttatttcaa gagcctaagt tgccaaagga ttaccttaca caaaaatctg ttcttctcat 1021 ttgtttgtaa ctctgttgta acaatcattc acctcactgc agtggccaac aaccaggcct 1081 tagtagccac aaatcctgtt agttgcaaag tgtcccagtt cattcatctt tacctgatgg 1141 gctgtaatta cttttggatg ctctgtgaag gcatttacct acacacactc attgtggtgg 1201 ccgtgtttgc agagaagcaa catttaatgt ggtattattt tcttggctgg ggatttccac 1261 tgattcctgc ttgtatacat gccattgcta gaagcttata ttacaatgac aattgctgga 1321 tcagttctga tacccatctc ctctacatta tccatggccc aatttgtgct gctttactgg 1381 tgaatctttt tttcttgtta aatattgtac gcgttctcat caccaagtta aaagttacac 1441 accaagcgga atccaatctg tacatgaaag ctgtgagagc tactcttatc ttggtgccat 1501 tgcttggcat tgaatttgtg ctgattccat ggcgacctga aggaaagatt gcagaggagg 1561 tatatgacta catcatgcac atccttatgc acttccaggg tcttttggtc tctaccattt 1621 tctgcttctt taatggagag gttcaagcaa ttctgagaag aaactggaat caatacaaaa 1681 tccaatttgg aaacagcttt tccaactcag aagctcttcg tagtgcgtct tacacagtgt 1741 caacaatcag tgatggtcca ggttatagtc atgactgtcc tagtgaacac ttaaatggaa 1801 aaagcatcca tgatattgaa aatgttctct taaaaccaga aaatttatat aattgaaaat 1861 agaaggatgg ttgtctcact gttttgtgct tctcctaact caaggacttg gacccatgac 1921 tctgtagcca gaagacttca atattaaatg actttttgaa tgtcataaag aagagccttc 1981 acatgaaatt agtagtgtgt tgataagagt gtaacatcca gctctatgtg ggaaaaaaga 2041 aatcctggtt tgtaatgttt gtcagtaaat actcccacta tgcctgatgt gacgctacta 2101 acctgacatc accaagtgtg gaattggaga aaagcacaat caacttttct gagctggtgt 2161 aagccagttc cagcacacca ttgcatg // LOCUS HSU17714 2932 bp mRNA PRI 03-MAR-1997 DEFINITION Human putative tumor suppressor (SNC6) mRNA, complete cds. ACCESSION U17714 NID g1549233 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2932) AUTHORS Zheng,S., Cai,X., Geng,L., Cao,J., Shi,Z. and Zheng,L. TITLE SNC6 mRNA sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 2932) AUTHORS Zheng,S. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) Shu Zheng, Cancer Institute, Zhejiang Medical University, Hangzhou, 310009, People's Republic of China FEATURES Location/Qualifiers source 1..2932 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA subtraction library between normal colon mucosa and colon cancer mucosa" gene 154..969 /gene="SCN6" CDS 154..969 /gene="SCN6" /function="putative tumor suppressor gene" /note="expressed less in colon cancer tissue than in normal colon mucosa" /codon_start=1 /db_xref="PID:g1857033" /translation="MGDENAEITEEMMDQANDKKVAAIEALNDGELQKAIDLFTDAIK LNPRLAILYAKRASVFVKLQKPNAAIRDCDRAIEINPDSAQPYKWRGKAHRLLGHWEE AAHDLALACKLDYDEDASAMLKEVQPRAQKIAEHRRKYERKREEREIKERIERVKKAR EEHERAQREEEARRQSGAQYGSFPGGFPGGMPGNFPGGMPGMGGGMPGMAGMPGLNEI LSDPEVLAAMQDPEVMVAFQDVAQNPANMSKYQSNPKVMNLISKLSAKFGGQA" BASE COUNT 900 a 515 c 615 g 902 t ORIGIN 1 tcagttaccg gcggaattcc ggccaaggaa gaaaaacctg atagtaagaa ggtggaggaa 61 gacttaaagg cagacgaacc atcaagtgag gaaagtgatc tagaaattga taaagaaggt 121 gtgattgagc cagacactga tgctcctcaa gaaatgggag atgaaaatgc ggagataacg 181 gaggagatga tggatcaggc aaatgataaa aaagtggctg ctattgaagc cctaaatgat 241 ggtgaactcc agaaagccat tgacttattc acagatgcca tcaagctgaa tcctcgcttg 301 gccattttgt atgccaagag ggccagtgtc ttcgtcaaat tacagaagcc aaatgctgcc 361 atccgagact gtgacagagc cattgaaata aatcctgatt cagctcagcc ttacaagtgg 421 cgggggaaag cacacagact tctaggccac tgggaagaag cagcccatga tcttgccctt 481 gcctgtaaat tggattatga tgaagatgct agtgcaatgc tgaaagaagt tcaacctagg 541 gcacagaaaa ttgcagaaca tcggagaaag tatgagcgaa aacgtgaaga gcgagagatc 601 aaagaaagaa tagaacgagt taagaaggct cgagaagagc atgagagagc ccagagggag 661 gaagaagcca gacgacagtc aggagctcag tatggctctt ttccaggtgg ctttcctggg 721 ggaatgcctg gtaattttcc cggaggaatg cctggaatgg gagggggcat gcctggaatg 781 gctggaatgc ctggactcaa tgaaattctt agtgatccag aggttcttgc agccatgcag 841 gatccagaag ttatggtggc tttccaggat gtggctcaga acccagcaaa tatgtcaaaa 901 taccagagca acccaaaggt tatgaatctc atcagtaaat tgtcagccaa atttggaggt 961 caagcgtaat gtccttctga taaataaagc ccttgctgaa ggaaaagcaa cctagatcac 1021 cttatggatg tcgcaataat acaaccagtg tacctctgac cttctcatca agagagctgg 1081 ggtgctttga agataatccc tacccctctc ccccaaatgc agctgaagca ttttacagtg 1141 gtttgccatt ggggtattca ttcagatatg ttttcctact aggaattaca aactttaaac 1201 actttttaaa tcttcaaaat atttaaaaca aatttaaagg gcctgttaat tcttatattt 1261 ttctttacta atcattttgg atttttttct ttgaattatt ggcagggaat atacttatgt 1321 atggaagatt actgctctga gtgaaataaa agttattagt gcgaggcaaa cataactcat 1381 ttgaggataa agtttgtgtt ggatatgtgg ttcctgatgc attttgactt gtctttttaa 1441 atgctttatc tttttcttta aagatttatt tcaataaaac taattgggac cacccgtatt 1501 tcagtaggac ctgggtaggg attggaagta cttggcaggg cagcagcaat cttgctgtgt 1561 ttgatataac atgcatccct tgggcaggtt gcccttaaat cttacactgt ggtgaaggga 1621 tgtttttttt gtaatgctgc agtagagttg gagtacttag ttctcttgtt gtccagtaca 1681 tctaataagt gtttttcata ttatttccac gtaagggaaa taaggtagta cttttctttt 1741 tatatttcta tgcttaaaat tctctttcct agtcaaaaat tgcccacatc tgtgtttgct 1801 ttctgcctgc tacatttgtc tcccttactt ttcttgagct aaagacaggc tttttccacc 1861 ggcatcatca ctgctatcat cattaacagc gtaattatac aagcatattt aatgctgagt 1921 ttaatttaat atgtaataca tatggtaatt gtagggtaat acccacaaca actgtagttt 1981 cttacttggc caagagaatg cttatttaag tgttagactt ccattctggc aaaatcttgc 2041 cttatcagaa gacattggaa agagggattc cctttggtgt ttggtcttct acttagaaaa 2101 acctattgca gttagtttat cttgtagtat tcatctttgt attctgaaga taaggtttga 2161 attaaattga tacacacaga ggggaaccga ttttttttat ccaatgtgaa ttataaatga 2221 gataatccac agttattcat tgtggagttg ttgagactat gaaagactca ttgtctttgt 2281 attcagctct taaatagtgt aactatatcc ccacctctgc ttgctttctt ccctcccctc 2341 caatgataaa gaaaatgata aattttctgt tgtgcattca attcttcatt ttaaataaga 2401 ctaagtatag gcattgtagc ctgacattgc tgaaacgttt caccaggtgt tcaaattaaa 2461 gtgctagtgt taaaaaaatt tcaggggata ggcccttctg taacttggct aattggagga 2521 tcagtggtag ggagcagtga agtaaattct atgggagaac atttctaaaa taccacattt 2581 ctgaaatcat aaataagttt attcaggttc taaccctttg ctgtacacaa gcagacagaa 2641 atgcatctgt tacataaatg agaaaaagct attatgctga tggagcatgc tttttaaatc 2701 ctttaaaaac actcaccata taaacttgca tttgagcttg tgtgttcttt tgttaatgtg 2761 tagagttctc ctttctcgaa attgccagtg tgtacttggc ttaactcaag aacagtttct 2821 tctggattcc ttatttgatt tatttaacct aattatattc taatattgca aatattacca 2881 taagtgggtg aaagtaaaat tcctcttctg aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU17743 1310 bp mRNA PRI 03-MAY-1995 DEFINITION Human JNK activating kinase (JNKK1) mRNA, complete cds. ACCESSION U17743 NID g791187 KEYWORDS JNKK; JNK-activating kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1310) AUTHORS Lin,A., Minden,A., Martinetto,H., Claret,F.X., Lange-Carter,C., Mercurio,F., Johnson,G.L. and Karin,M. TITLE Identification of a dual specificity kinase that activates the Jun kinases and p38-Mpk2 JOURNAL Science 268 (5208), 286-290 (1995) MEDLINE 95232504 REFERENCE 2 (bases 1 to 1310) AUTHORS Karin,M. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) Michael Karin, Pharmacology, University of California at San Diego, 9500 Gilman Dr., La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..1310 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hJNKK1" gene 21..1220 /gene="JNKK1" CDS 21..1220 /gene="JNKK1" /standard_name="C-jun N-terminal kinase kinase 1" /codon_start=1 /product="JNK activating kinase" /db_xref="PID:g791188" /translation="MAAPSPSGGGGSGGGSGSGTPGPVGSPAPGHPAVSSMQGKRKAL KLNFANPPFKSTARFTLNPNPTGVQNPHIERLRTHSIESSGKLKISPEQHWDFTAEDL KDLGEIGRGAYGSVNKMVHKPSGQIMAVKRIRSTVDEKEQKQLLMDLDVVMRSSDCPY IVQFYGALFREGDCWICMELMSTSFDKFYKYVYSVLDDVIPEEILGKITLATVKALNH LKENLKIIHRDIKPSNILLDRSGNIKLCDFGISGQLVDSIAKTRDAGCRPYMAPERID PSASRQGYDVRSDVWSLGITLYELATGRFPYPKWNSVFDQLTQVVKGDPPQLSNSEER EFSPSFINFVNLCLTKDESKRPKYKELLKHPFILMYEERAVEVACYVCKILDQMPATP SSPMYVD" BASE COUNT 396 a 284 c 305 g 325 t ORIGIN 1 tgggctcttc actcccaaca atggcggctc cgagcccgag cggcggcggc ggctccgggg 61 gcggcagcgg cagcggcacc cccggccccg tagggtcccc ggcgccaggc cacccggccg 121 tcagcagcat gcagggtaaa cgcaaagcac tgaagttgaa ttttgcaaat ccacctttca 181 aatctacagc aaggtttact ctgaatccca atcctacagg agttcaaaac ccacacatag 241 agagactgag aacacacagc attgagtcat caggaaaact gaagatctcc cctgaacaac 301 actgggattt cactgcagag gacttgaaag accttggaga aattggacga ggagcttatg 361 gttctgtcaa caaaatggtc cacaaaccaa gtgggcaaat aatggcagtt aaaagaattc 421 ggtcaacagt ggatgaaaaa gaacaaaaac aacttcttat ggatttggat gtagtaatgc 481 ggagtagtga ttgcccatac attgttcagt tttatggtgc actcttcaga gagggtgact 541 gttggatctg tatggaactc atgtctacct cgtttgataa gttttacaaa tatgtatata 601 gtgtattaga tgatgttatt ccagaagaaa ttttaggcaa aatcacttta gcaactgtga 661 aagcactaaa ccacttaaaa gaaaacttga aaattattca cagagatatc aaaccttcca 721 atattcttct ggacagaagt ggaaatatta agctctgtga cttcggcatc agtggacagc 781 ttgtggactc tattgccaag acaagagatg ctggctgtag gccatacatg gcacctgaaa 841 gaatagaccc aagcgcatca cgacaaggat atgatgtccg ctctgatgtc tggagtttgg 901 ggatcacatt gtatgagttg gccacaggcc gatttcctta tccaaagtgg aatagtgtat 961 ttgatcaact aacacaagtc gtgaaaggag atcctccgca gctgagtaat tctgaggaaa 1021 gggaattctc cccgagtttc atcaactttg tcaacttgtg ccttacgaag gatgaatcca 1081 aaaggccaaa gtataaagag cttctgaaac atccctttat tttgatgtat gaagaacgtg 1141 ccgttgaggt cgcatgctat gtttgtaaaa tcctggatca aatgccagct actcccagct 1201 ctcccatgta tgtcgattga tatcgctgct acatcagact ctagaaaaaa gggctgagag 1261 gaagcaagac gtaaagaatt ttcatcccgt atcacagtgt ttttattgct // LOCUS HSU17894 2115 bp DNA PRI 02-MAR-1995 DEFINITION Human alpha(1,2)fucosyltransferase (FUT2) gene, complete cds. ACCESSION U17894 NID g687618 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2115) AUTHORS Lowe,J.B. TITLE Sequence and expression of a candidate for the human Secretor blood group alpha(1,2)fucosyltransferase gene (FUT2); homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the non-secretor phenotype JOURNAL J. Biol. Chem. (1995) In press REFERENCE 2 (bases 1 to 2115) AUTHORS Lowe,J.B. TITLE Direct Submission JOURNAL Submitted (30-NOV-1994) John B Lowe, Pathology, Howard Hughes Medical Institute, University of Michigan, MSRBI, Room 3510, 1150 West Medical Center Drive, Ann Arbor, MI 48109-0650, USA FEATURES Location/Qualifiers source 1..2115 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 64..1095 /gene="FUT2" CDS 64..1095 /gene="FUT2" /note="member of the secretor blood group" /codon_start=1 /product="alpha(1,2)fucosyltransferase" /db_xref="PID:g687619" /translation="MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQ IPVLASTSKALGPSQLRGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHST LAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHL RQEILQEFTLHDHVREEAQKFLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVA DRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFA LLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIA ADLSPLLKH" BASE COUNT 505 a 606 c 552 g 452 t ORIGIN 1 ttcaccagcg ccccgggcct ccatctccca gctaacgtgt cccgttttcc tcccctgaca 61 gccatgctgg tcgttcagat gcctttctcc tttcccatgg cccacttcat cctctttgtc 121 tttacggttt ccactatatt tcacgttcag cagcggctag cgaagattca agccatgtgg 181 gagttaccgg tgcagatacc agtgctagcc tcaacatcaa aggcactggg acccagccag 241 ctcaggggga tgtggacgat caatgcaata ggccgcctgg ggaaccagat gggcgagtac 301 gccacactgt acgccctggc caagatgaac gggcggcccg ccttcatccc ggcccagatg 361 cacagcaccc tggcccccat cttcagaatc accctgccgg tgctgcacag cgccacggcc 421 agcaggatcc cctggcagaa ctaccacctg aacgactgga tggaggagga ataccgccac 481 atcccggggg agtacgtccg cttcaccggc tacccctgct cctggacctt ctaccaccac 541 ctccgccagg agatcctcca ggagttcacc ctgcacgacc acgtgcggga ggaggcccag 601 aagttcctgc ggggcctgca ggtgaacggg agccggccgg gcacctttgt aggggtccat 661 gttcgccgag gggactatgt ccatgtcatg ccaaaagtgt ggaagggggt ggtggccgac 721 cggcgatacc tacagcaggc cctggactgg ttccgagctc gctacagctc cctcatcttc 781 gtggtcacca gtaatggcat ggcctggtgt cgggagaaca ttgacacctc ccacggtgat 841 gtggtgtttg ctggcgatgg cattgagggc tcacctgcca aagattttgc tctactcaca 901 cagtgtaacc acaccatcat gaccattggg acgttcggga tctgggccgc atacctcacg 961 ggcggagaca ccatctacct ggccaattac accctccccg actccccttt cctcaaaatc 1021 tttaagccag aggcagcctt cctgccggag tggacaggga ttgccgcaga cctgtccccc 1081 ttactcaagc actaatgctg gcccgtcctt tgagaccttt tctccttctc tgcctccctc 1141 aagatgagtg cccgggcatg agaagcacat ggttccatga gcaggaccca tctctcttct 1201 gtgaagatgc gttgggctgc aagtaacaga aatctcagtg aacagtggcc tggcgtggtg 1261 gctcatgcct gtaatgctcg cactttggga ggccagggtg ggtggatcac ttgaggtcag 1321 gagttcaaga ctagcctggc caacatggtg aaaccccatc tcgactaaaa atacaaaaat 1381 tagccaggcg tggtggtgca cacttgtaat cccagctact cgggaggctg aggcaagaga 1441 atcacttgaa cccaggaggc ggaggttgca gtgagccaag atggtgccgc tgcactccag 1501 cctgggtgac acagcaagac tccatctcaa aaaaaaaaaa agaaaaagaa atgaacgggt 1561 tcaaagacca taatcatgca tatcacataa gaccagaagt ggcccaggtc cagggtcagt 1621 taatttagca gctccacaaa gtcatcagtc acctgagctc catccatctt cacatgctgt 1681 gctaccattt cttagctgta tcatcccatg gtcccaaaag ggctgctaca catccagcca 1741 tcacatgcag ataattcctt tcaaaaacag cagaaagagg ctcgttcttg tcttggtccc 1801 ttttgaagaa tgaatgaaac cttcctaagc cttccagcaa tttcccccca actccgatgg 1861 gtaggaattg tcacataccc atgtgacccg ataggaggca aaagaaatga gacttctggg 1921 attagtttag cctcagattc tgcagctgag aagttgatca gccacctctg aaggacatgc 1981 agcttgcaga aaattagggt ggtgttacca aggtgaaaag gggaaatggc tttagagtag 2041 acaacagaga tgccctgagg ggttgtgtag gttgttcact gcaggaagtc ccctggttaa 2101 gaaggcaagt ggggt // LOCUS HSU17970 3569 bp mRNA PRI 02-JUN-1995 DEFINITION Human heparan sulfate N-deacetylase/N-sulfotransferase mRNA, complete cds. ACCESSION U17970 NID g841163 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3269) AUTHORS LaBell,T.L., Milewicz,D.J., Bonadio,J., Edelhoff,S., Disteche,C.M. and Byers,P.H. TITLE Sequence, chromosome location, and expression of human fibroblast heparan sulfate N-deacetylase/N-sulfotransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 3569) AUTHORS LaBell,T.L. TITLE Direct Submission JOURNAL Submitted (01-DEC-1994) Terry L. LaBell, University of Washington, Pathology, Mail Stop SM-30, N.E. Pacific Ave, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..3569 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="T1A-4/pBSII, P1A-12/pBSII, HHSS-1/pCRII, 9C1-C/pCRII" /clone_lib="Random Primed Human Fibroblast cDNA Library" /cell_line="A8496" /cell_type="fibroblast" /tissue_type="connective" /dev_stage="adult" 5'UTR <1..228 CDS 229..2877 /codon_start=1 /product="heparan sulfate N-deacetylase/N-sulfotransferase" /db_xref="PID:g841164" /translation="MPALACLRRLCRHVSPQAVLFLLFIFCLFSVFISAYYLYGWKRG LEPSADAPEPDCGDPAPVAPSRLLPLKPVQAATPSRTDPLVLVFVESLYSQLGQEVVA ILESSRFKYRTEIAPGKGDMPTLTDKGRGRFALIIYENILKYVNLDAWNRELLDKYCV AYGVGIIGFFKANENSLLSAQLKGFPLFLHSNLGLKDCSINPKSPLLYVTRPSEVEKG VLPGEDWTVFQSNHSTYEPVLLAKTRSSESIPHLGADAGLHAALHATVVQDLGLHDGI QRVLFGNNLNFWLHKLVFVDAVAFLTGKRLSLPLDRYILVDIDDIFVGKEGTRMKVED VKALFDTQNELRAHIPNFTFNLGYSGKFFHTGTNAEDAGDDLLLSYVKEFWWFPHMWS HMQPHLFHNQSVLAEQMALNKKFAVEHGIPTDMGYAVAPHHSGVYPVHVQLYEAWKQV WSIRVTSTEEYPHLKPARYRRGFIHNGIMVLPRQTCGLFTHTIFYNEYPGGSSELDKI INGGELFLTVLLNPISIFMTHLSNYGNDRLGLYTFKHLVRFLHSWTNLRLQTLPPVQL AQKYFQIFSEEKDPLWQDPCEDKRHKDIWSKEKTCDRFPKLLIIGPQKTGTTALYLFL GMHPDLSSNYPSSETFEEIQFFNGHNYHKGIDWYMEFFPIPSNTTSDFYFEKSANYFD SEVAPRGAAALLPKAKVLTILINPADRAYSWYQHQRAHDDPVALKYTFHEVITAGSDA SSRLRALQNRCLVPGWYATHIERWLSAYHANQILVLDGKLLRTEPAKVMDMVQKFLGV TNTIDYHKTLAFDPKKGFWCQLLEGGKTKCLGKSKGRKYPEMDLDSRAFLKDYYRDHN IELSKLLYKMGQTLPTWLREDLQNTR" 3'UTR 2878..>3569 BASE COUNT 715 a 1163 c 980 g 711 t ORIGIN 1 ggggcgcgga ggaaggaagg agcgtgacca gcctgtggac tgcgcccctg gctgggagga 61 aggactgggg gcccagatcc tccactccca gtgccccaca agggcgtcgc ttcctaagtc 121 tctgtgaatt tgttggtcag tggacgattc tcgtgtctcc tcctgtgtgg ggccttgggg 181 tagccagggc aggccgggcc tccggtggcc aaggtctcgg aggccaggat gcctgccctg 241 gcatgcctcc ggaggctgtg tcggcacgtg tccccgcagg ctgtcctttt cctgctgttc 301 atcttctgcc tgttcagcgt tttcatctcg gcctactacc tatatggctg gaagcgaggc 361 ctggagccct cggcggatgc ccccgagcct gactgcgggg accccgcgcc tgtggccccc 421 agtcgcctgc tgccactcaa gcctgtgcag gcagccaccc cttcccgcac agacccgttg 481 gtgctggtct ttgtggagag cctctactcg caactgggcc aggaggtggt ggccatcctg 541 gagtccagcc gcttcaaata ccgcacagag attgcgccgg gcaagggtga catgcccacg 601 ctcactgaca agggccgtgg ccgcttcgcc ctcatcatct atgagaacat cctcaagtat 661 gtcaacctgg acgcctggaa ccgggagctg ctggacaagt actgtgtggc ctacggcgtg 721 ggcatcattg gcttcttcaa ggccaatgag aacagcctgc tgagtgcgca gctcaagggc 781 ttccccctgt tcctgcactc aaacctgggc ctgaaggact gcagcatcaa ccccaagtcc 841 ccgctgctct acgtgacgcg acctagcgag gtggagaaag gtgtgctccc cggcgaggac 901 tggacggtgt tccagtcaaa tcactccacc tatgagccag tgctgctggc caagacgcgc 961 tcgtctgagt ccatcccaca cctgggcgca gacgccggcc tgcatgctgc actgcacgcc 1021 actgtggtcc aggacctggg cctgcacgac ggcatccagc gcgtgctgtt tggcaacaac 1081 ctgaacttct ggctgcacaa gcttgtcttc gtggatgccg tggccttcct cacggggaag 1141 cgcctctccc tgccattgga ccgctacatc ctggtggaca ttgatgacat cttcgtgggc 1201 aaggagggca cacgcatgaa ggtggaggac gtgaaggccc tgtttgacac acagaacgaa 1261 ctacgcgcac acatcccaaa cttcaccttc aacctgggct actcagggaa attcttccac 1321 acaggtacca atgctgagga cgctggggat gatctgctgc tgtcgtatgt gaaggagttc 1381 tggtggttcc cccacatgtg gagccacatg cagccccacc ttttccacaa ccagtccgtg 1441 ttggccgagc agatggcctt gaacaagaag ttcgctgtcg agcatggcat tcccacagac 1501 atggggtatg cagtggcgcc ccaccactcg ggcgtgtacc ccgtgcacgt gcagctgtac 1561 gaggcttgga agcaggtgtg gagcatccgc gtgaccagca cggaggagta cccccacctg 1621 aagccagccc gctaccgccg tggcttcatc cacaatggca tcatggttct cccacggcag 1681 acctgcggcc tcttcacaca caccatcttc tacaacgagt accctggcgg ctccagtgag 1741 ctggacaaga tcatcaacgg gggcgagctc ttcctcaccg tgctcctcaa tcctatcagc 1801 atcttcatga cgcacctgtc caactatggg aatgaccgcc tgggcctgta caccttcaag 1861 cacctggtgc gcttcctgca ctcctggacc aacctccggc tgcagacact gccccctgtg 1921 cagttggcgc agaagtactt ccagatcttc tccgaggaga aggacccgct ctggcaggac 1981 ccctgcgagg acaaacgtca caaagacatc tggtccaagg agaagacgtg tgaccgcttc 2041 ccaaagctcc tcatcatcgg cccccagaaa acaggcacca ctgccctcta cctgttcctg 2101 ggcatgcacc ctgacctaag cagcaactac cccagctctg agacctttga ggagatccag 2161 ttttttaatg gccacaacta tcacaaaggc atcgactggt acatggagtt cttccccatc 2221 ccttccaaca ccacctccga cttctacttt gagaaaagcg ccaactactt tgattcagaa 2281 gtggcgcccc gcggggcagc agccctcttg cccaaagcca aggtcctgac catcctcatc 2341 aaccccgcgg accgggccta ttcctggtac cagcaccagc gagcccatga cgacccagtg 2401 gccctaaagt acaccttcca tgaggtgatt accgccggct ctgacgcatc ctcgaggctg 2461 cgtgccctcc agaaccgctg cctggtccct ggctggtacg ccacccacat cgagcgctgg 2521 ctcagtgcct atcacgccaa ccagatcctg gtcttggatg gcaaactgct tcgcacagaa 2581 cctgccaaag tgatggacat ggtgcagaag ttccttgggg tgaccaacac cattgactac 2641 cacaaaacct tggcgtttga tccaaagaaa ggattttggt gccaactgct tgaaggagga 2701 aaaaccaagt gtctgggcaa aagcaagggc cggaaatatc ccgagatgga cttggattcc 2761 cgagccttcc tgaaggacta ttaccgggac cacaacatcg agctctccaa gctgctgtat 2821 aagatgggcc agacacttcc cacttggcta cgagaggacc tccagaacac caggtagccg 2881 tggccaccac agccagactg aacgtttgtg aaagctggga catcccacca cacgctgagc 2941 cagacctgca gagtgggaag ctggaccagg gcagctgcgc acttatgagc aatactctgt 3001 ggaggtctgg tggggctggg ggagcaccca ggcggatctg caagcacctc ggagcaccca 3061 ccgctgggtc tgcggcctaa gggacctccc tcgccagcag aggtccattc cgttcccagc 3121 tgctcctggg gaggccgctt cctggtagga gggagtccac gagactcttt tctgtccctc 3181 actgtgttcc gccgactgtc ccctctcgtc acccatcact ccctgcttcc gcaggcgccc 3241 ctcagtattc gctgccatat gtccctgtcc tccaggctgt aggggaggag agcctggccg 3301 ggggagacag actggacatt tccctgtttc gagccaggct cttccaaggg gccagctggg 3361 tccccggagt cagtcctagg ctggatggga gggtggcccc ctcaagagga ctcccagcct 3421 ccacatctgg ttcctacctt cacatctcac cctcccgttc tggggaagaa tttctggttc 3481 ctacagtatc cactccatcc tcaaggcttc ccgcagggcc ttggggcact gccttgccat 3541 cgggcccagt tctccgggcc ccacctgca // LOCUS HSU18242 1350 bp mRNA PRI 10-JAN-1995 DEFINITION Human calcium modulating cyclophilin ligand (CAMLG) mRNA, complete cds. ACCESSION U18242 NID g619669 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1350) AUTHORS Bram,R.J. and Crabtree,G.R. TITLE Calcium signalling in T cells stimulated by a cyclophilin B-binding protein JOURNAL Nature 371 (6495), 355-358 (1994) MEDLINE 94376879 REFERENCE 2 (bases 1 to 1350) AUTHORS Bram,R.J. TITLE Direct Submission JOURNAL Submitted (06-DEC-1994) Richard J. Bram, Experimental Oncology, St. Jude Children's Research Hospital, 332 N. Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..1350 /organism="Homo sapiens" /note="the mRNA is expressed in all tissues with highest levels in brain, testis, and ovary" /db_xref="taxon:9606" /cell_type="B lymphocyte" gene 37..927 /gene="CAMLG" CDS 37..927 /gene="CAMLG" /note="integral membrane protein" /codon_start=1 /function="activates calcium influx in Jurkat T cells when overexpressed" /product="calcium modulating cyclophilin ligand" /db_xref="PID:g619670" /translation="MESMAVATDGGERPGVPAGSGLSASQRRAELRRRKLLMNSEQRI NRIMGFHRPGSGAEEESQTKSKQQDSDKLNSLSVPSVSKRVVLGDSVSTGTTDQQGGV AEVKGTQLGDKLDSFIKPPECSSDVNLELRQRNRGDLTADSVQRGSRHGLEQYLSRFE EAMKLRKQLISEKPSQEDGNTTEEFDSFRIFRLVGCALLALGVRAFVCKYLSIFAPFL TLQLAYMGLYKYFPKSEKKIKTTVLTAALLLSGIPAEVINRSMDTYSKMGEVFTDLCV YFFTFIFCHELLDYWGSEVP" BASE COUNT 376 a 280 c 332 g 362 t ORIGIN 1 cgccactgcc acccctccca gactgtggac gggaggatgg agtcgatggc cgtcgctacc 61 gacggcgggg agaggccggg ggtcccagcg ggctcaggtc tgtcggcttc ccagcgtcgg 121 gcggagctgc gtcggagaaa gctgctcatg aactcggaac agcgcatcaa ccggatcatg 181 ggctttcaca ggcccgggag cggcgcggaa gaagaaagtc aaacaaaatc aaagcagcag 241 gacagtgata aactgaactc cctcagcgtt ccttccgttt caaagcgagt agtgctgggt 301 gattcagtca gtacaggaac aactgaccag cagggtggtg tggccgaggt aaaggggacc 361 caactgggag acaaattgga ctcgttcatt aaaccacctg agtgcagtag tgatgtcaac 421 cttgagctcc ggcagcggaa cagaggggac ctgacagcgg actcggtcca gaggggttcc 481 cgccatggcc tagagcagta cctttccaga ttcgaagaag caatgaagct aaggaaacag 541 ctgattagtg aaaaacccag tcaagaggat ggaaatacaa cagaagaatt tgactctttt 601 cgaatattta gattggtggg atgtgctctt cttgctcttg gagtcagagc ttttgtttgc 661 aaatacttgt ccatatttgc tccatttctt actttacaac ttgcgtacat gggattatac 721 aaatattttc ccaagagtga aaagaagata aagacaacag tactaacagc tgcacttcta 781 ttgtcgggaa ttcctgccga agtgataaat cgatcaatgg atacctatag caaaatgggc 841 gaagtcttca cagatctctg tgtctacttt ttcactttta tcttttgtca tgaactgctt 901 gattattggg gctctgaagt accatgaagc ctgtagaact gagaaggaga agcttacgaa 961 aaaaatcctc ttctatattg cagtgtctct aaaggaggca aattggttta caccttcatg 1021 taattctttt actttagggg ttgtaaagct actttattag atatagaatg gcagattctc 1081 tgatttaaaa gggctgagtt tgtattatta ctgatatgaa gaatagagta ccaatgtcat 1141 taattgattt ttcttgttaa tcagaattcc tattctgtac ctttcctcta acttctcaga 1201 tttgtaattc ttcttttcgg gagctgagct agtgctttta ggagaacaga taaatgtggt 1261 ctcagccagc cctagagact gcttcttgtg tttgtgtcat tctgtcctga gaaatgaagt 1321 catctgaaaa ataaaaatgc agaaacccaa // LOCUS HSU18244 1719 bp mRNA PRI 13-SEP-1995 DEFINITION Human excitatory amino acid transporter 4 mRNA, complete cds. ACCESSION U18244 NID g930336 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Fairman,W.A., Vandenberg,R.J., Arriza,J.L., Kavanaugh,M.P. and Amara,S.G. TITLE An excitatory amino-acid transporter with properties of a ligand-gated chloride channel JOURNAL Nature 375 (6532), 599-603 (1995) MEDLINE 95312081 REFERENCE 2 (bases 1 to 1719) AUTHORS Fairman,W.A. TITLE Direct Submission JOURNAL Submitted (07-DEC-1994) Wendy A. Fairman, The Vollum Institute, Oregon Health Sciences University, 3181 SW Sam Jackson Park Road, Portland, OR 97201, USA FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain: motor cortex" 5'UTR 1..8 CDS 9..1703 /codon_start=1 /product="excitatory amino acid transporter 4" /db_xref="PID:g930337" /translation="MSSHGNSLFLRESGQRLGRVGWLQRLQESLQQRALRTRLRLQTM TLEHVLRFLRRNAFILLTVSAVVIGVSLAFALRPYQLTYRQIKYFSFPGELLMRMLQM LVLPLIVSSLVTGMASLDNKATGRMGMRAAVYYMVTTIIAVFIGILMVTIIHPGKGSK EGLHREGRIETIPTADAFMDLIRNMFPPNLVEACFKQFKTQYSTRVVTRTMVRTENGS EPGASMPPPFSVENGTSFLENVTRALGTLQEMLSFEETVPVPGSANGINALGLVVFSV AFGLVIGGMKHKGRVLRDFFDSLNEAIMRLVGIIIWYAPVGILFLIAGKILEMEDMAV LGGQLGMYTLTVIVGLFLHAGIVLPLIYFLVTHRNPFPFIGGMLQALITAMGTSSSSA TLPITFRCLEEGLGVDRRITRFVLPVGATVNMDGTALYEALAAIFIAQVNNYELNLGQ ITTISITATAASVGAAGIPQAGLVTMVIVLTSVGLPTEDITLIIAVDWFLDRLRTMTN VLGDSIGAAVIEHLSQRELELQEAELTLPSLGKPYKSLMAQEKGASRGRGGNESAM" 3'UTR 1704..1719 BASE COUNT 328 a 524 c 506 g 361 t ORIGIN 1 gatagaccat gagcagccat ggcaacagcc tgttccttcg ggagagcggc cagcggctgg 61 gccgggtggg ctggctgcag cggctgcagg aaagcctgca gcagagagca ctgcgcacgc 121 gcctgcgcct gcagaccatg accctcgagc acgtgctgcg cttcctgcgc cgaaacgcct 181 tcattctgct gacggtcagc gccgtggtca ttggggtcag cctggccttt gccctgcgcc 241 catatcagct cacctaccgc cagatcaagt acttctcttt tcctggagag cttctgatga 301 ggatgctgca gatgctggtg ttacctctca ttgtctccag cctggtcaca ggtatggcat 361 ccctggacaa caaggccacg gggcggatgg ggatgcgggc agctgtgtac tacatggtga 421 ccaccatcat cgcggtcttc atcggcatcc tcatggtcac catcatccat cccgggaagg 481 gctccaagga ggggctgcac cgggagggcc ggatcgagac catccccaca gctgatgcct 541 tcatggacct gatcagaaat atgtttccac caaaccttgt ggaggcctgc ttcaaacagt 601 tcaagacgca gtacagcacg agggtggtaa ccaggaccat ggtgaggaca gagaacgggt 661 ctgagccggg tgcctccatg cctcctccat tctcagtgga gaacggaacc agcttcctgg 721 aaaatgtcac tcgggccttg ggtaccctgc aggagatgct gagctttgag gagactgtac 781 ccgtgcctgg ctccgccaat ggcatcaacg ccctgggcct cgtggtcttc tctgtggcct 841 ttgggctggt cattggtggc atgaaacaca agggcagagt cctcagggac ttcttcgaca 901 gcctcaatga ggctattatg aggctggtgg gcatcattat ctggtatgca cctgtgggca 961 tcctgttcct gattgctggg aagattctgg agatggaaga catggccgtc ctggggggtc 1021 agctgggcat gtacaccctg accgtcatcg tgggcctgtt cctccatgcc ggcattgtcc 1081 ttcccctcat ctacttcctc gtcactcacc ggaacccctt ccccttcatt gggggcatgc 1141 tacaagccct catcaccgct atgggcacgt cttccagctc ggcaacgctg cccatcacct 1201 tccgctgcct ggaggagggc ctgggtgtgg accgccgcat caccaggttc gtcctgcccg 1261 tgggcgccac ggtcaacatg gatggcactg ccctctacga ggccctggct gccatcttca 1321 ttgctcaagt taacaactac gagctcaacc tgggtcagat cacaaccatc agcatcacgg 1381 ccacagcagc cagtgttggg gctgctggca tcccccaggc gggtctggtc accatggtca 1441 ttgtgcttac gtcggtcggc ttgcccacgg aagacatcac gctcatcatc gccgtggact 1501 ggttccttga ccggcttcgc acaatgacca acgtactggg ggactcaatt ggagcggccg 1561 tcatcgagca cttgtctcag cgggagctgg agcttcagga agctgagctt accctcccca 1621 gcctggggaa accctacaag tccctcatgg cacaggagaa gggggcatcc cggggacggg 1681 gaggcaacga gagtgctatg tgaggggcct ccagctctg // LOCUS HSU18291 2034 bp mRNA PRI 14-SEP-1995 DEFINITION Human CDC16Hs mRNA, complete cds. ACCESSION U18291 NID g603230 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2034) AUTHORS Tugendreich,S., Tomkiel,J., Earnshaw,W. and Hieter,P. TITLE CDC27Hs colocalizes with CDC16Hs to the centrosome and mitotic spindle and is essential for the metaphase to anaphase transition JOURNAL Cell 81 (2), 261-268 (1995) MEDLINE 95254635 REFERENCE 2 (bases 1 to 2034) AUTHORS Tugendreich,S. TITLE Direct Submission JOURNAL Submitted (07-DEC-1994) Stuart Tugendreich, Molecular Biology and Genetics, Johns Hopkins University, 617 Hunterian Bldg, 725 North Wolfe Street, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..2034 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSTU65" /tissue_type="brain" 5'UTR 1..24 CDS 25..1884 /codon_start=1 /product="CDC16Hs" /db_xref="PID:g603231" /translation="MNLERLRKRVRQYLDQQQYQSALFWADKVASLSREPQDIYWLAQ CLYLTAQYHRAAHALRSRKLDKLYEACRYLAARCHYAAKEHQQALDVLDMEEPINKRL FEKYLKDESGFKDPSSDWEMSQSSIKSSICLLRGQIYDALDNRTLATYSYKEALKLDV YCFEAFDLLTSHHMLTAQEEKELLESLPLSKLCNEEQELLRFLFENKLKKYNKPSETV IPESVDGLQENLDVVVSLAERHYYNCDFKMCYKLTSVVMEKDPFHASCLPVHIGTLVE LNKANELFYLSHKLVDLYPSNPVSWFAVGCYYLMVGHKNEHARRYLSKATTLEKTYGP AWIAYGHSFAVESEHDQAMAAYFTAAQLMKGCHLPMLYIGLEYGLTNNSKLAERFFSQ ALSIAPEDPFVMHEVGVVAFQNGEWKTAEKWFLDALEKIKAIGNEVTVDKWEPLLNNL GHVCRKLKKYAEALDYHRQALVLIPQNASTYSAIGYIHSLMGNFENAVDYFHTALGLR RDDTFSVTMLGHCIEMYIGDSEAYIGADIKDKLKCYDFDVHTMKTLKNIISPPWDFRE FEVEKQTAEETGLTPLETSRKTPDSRPSLEETFEIEMNESDMMLETSMSDHST" 3'UTR 1885..2034 polyA_signal 2015..2020 BASE COUNT 620 a 417 c 455 g 542 t ORIGIN 1 gcgtgaggcc gggcccgcgc cgccatgaac ctagagcggc tgcggaagcg cgtccggcag 61 tacctcgacc agcaacagta tcaaagtgct ctattttggg cagataaagt agcttcactc 121 tctcgtgaac cccaggacat ctattggttg gctcagtgtc tttacctgac agcacaatat 181 cacagagccg cccatgcact tcggtcacga aaactggaca aattgtatga agcatgtcgt 241 taccttgcag ctaggtgcca ttatgctgca aaagagcacc agcaggccct tgatgttctt 301 gacatggaag agcccatcaa taaaagatta tttgaaaaat acttgaagga tgaaagtggc 361 ttcaaagatc cctccagcga ctgggaaatg tcacagtctt caataaagag ttctatttgt 421 cttctacgcg ggcaaatcta tgatgctcta gataaccgaa ccctggctac ctacagctac 481 aaagaagctt tgaagcttga tgtctactgt tttgaagcgt tcgatctttt aacatcacat 541 cacatgctga cagcacaaga agaaaaagaa cttcttgaat cactacccct tagcaagctg 601 tgtaatgaag aacaggaatt gctgcgtttt ctatttgaga acaaattgaa aaaatataat 661 aagcctagtg aaacggtcat ccctgaatct gtagatggct tgcaagagaa tctggatgtg 721 gtagtgtctt tagctgagag acattattat aactgtgatt ttaaaatgtg ctacaagctt 781 acttctgtag taatggagaa agatcctttc catgcaagtt gtttacctgt acatataggg 841 acgcttgtag agctgaataa agccaatgaa cttttctatc tttctcataa actggtggat 901 ttatatccta gtaatcctgt gtcttggttt gcagtgggat gttactatct catggtcggt 961 cataaaaatg aacatgccag aagatatctc agcaaagcca caacacttga gaaaacctat 1021 ggacctgcat ggatagccta tggacattca tttgcggtgg agagtgagca cgaccaagcg 1081 atggctgctt acttcacagc agcacagctg atgaaagggt gtcatttgcc tatgctgtat 1141 attggattag aatatggttt gaccaataac tcaaaactag ctgaaaggtt cttcagccaa 1201 gctctgagca ttgcaccgga agaccctttt gttatgcatg aggtcggcgt ggttgcattt 1261 cagaatggag aatggaaaac agccgaaaaa tggtttcttg atgctttgga aaaaattaaa 1321 gcaattggga acgaggtaac agttgacaaa tgggaacctt tgttgaacaa cttggggcat 1381 gtctgcagaa aacttaaaaa gtatgctgag gccttggatt accaccgtca ggcactggtg 1441 ttgattcctc agaacgcatc cacctactct gctattggat atatccacag tctgatgggc 1501 aactttgaaa atgctgtgga ctacttccac acagcccttg gtcttaggcg agatgataca 1561 ttttctgtta caatgcttgg tcattgcatc gaaatgtaca ttggtgattc tgaagcttat 1621 attggagcag acattaaaga caaattaaaa tgttatgact ttgatgtgca tacaatgaag 1681 acactaaaaa acattatttc acctccgtgg gatttcaggg aatttgaagt agaaaaacag 1741 actgcagaag aaacggggct tacgccattg gaaacctcaa ggaaaactcc agattccaga 1801 ccttccttgg aagaaacctt tgaaattgaa atgaatgaaa gtgacatgat gttagagaca 1861 tctatgtcag accacagcac gtgactccag tcagtggtcc tggtcccact gtcccagtgt 1921 aggaacagag acccgcctta agagactgga tcgcacacct ttgcaacaga tgtgttctga 1981 ttctctgaac ctacaaaata gttatacata gtggaataaa gaaggtaaac catc // LOCUS HSU18297 1910 bp mRNA PRI 14-DEC-1995 DEFINITION Human MST1 (MST1) mRNA, complete cds. ACCESSION U18297 NID g1117790 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1910) AUTHORS Creasy,C.L. and Chernoff,J. TITLE Cloning and characterization of a human protein kinase with homology to Ste20 JOURNAL J. Biol. Chem. 270 (37), 21695-21700 (1995) MEDLINE 95394929 REFERENCE 2 (bases 1 to 1910) AUTHORS Creasy,C.L. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) Caretha L. Creasy, Fox Chase Cancer Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..1910 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pYes-33" /clone_lib="lambda-Yes-R, S. Elledge" /cell_type="lymphocyte" gene 20..1483 /gene="MST1" CDS 20..1483 /gene="MST1" /note="protein kinase; similar to the product encoded by GenBank Accession Number L04655" /codon_start=1 /product="MST1" /db_xref="PID:g1117791" /translation="METVQLRNPPRRQLKKLDEDSLTKQPEEVFDVLEKLGEGSYGSV YKAIHKETGQIVAIKQVPVESDLQEIIKEISIMQQCDSPHVVKYYGSYFKNTDLWIVM EYCGAGSVSDIIRLRNKTLTEDEIATILQSTLKGLEYLHFMRKIHRDIKAGNILLNTE GHAKLADFGVAGQLTDTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGITAIEMA EGKRPYADIHPMRAIFMIPTNPPPTFRKPELWSDNFTDFVKQCLVKSPEQRATATQLL QHPFVRSAKGVSILRDLINEAMDVKLKRQESQQREMDQDDEENSEEDEMDSGTMVRAV GDEMGTVRVASTMTDGANTMIEHDDTLPSQLGTMVINAEDEEEEGTMKRRDETMQPAK PSFLEYFEQKEKENQINSFGKSVPGPLKNSSDWKIPQDGDYEFLKSWTVEDLQKRLLA LDPMMEQEIEEIRQKYQSKRQPILDAIEAKKRRQQNF" BASE COUNT 594 a 387 c 467 g 462 t ORIGIN 1 ccggctgctg gcatcggcca tggagacggt acagctgagg aacccgccgc gccggcagct 61 gaaaaagttg gatgaagata gtttaaccaa acaaccagaa gaagtatttg atgtcttaga 121 gaaacttgga gaagggtcct atggcagcgt atacaaagct attcataaag agaccggcca 181 gattgttgct attaagcaag ttcctgtgga atcagacctc caggagataa tcaaagaaat 241 ctctataatg cagcaatgtg acagccctca tgtagtcaaa tattatggca gttattttaa 301 gaacacagac ttatggatcg ttatggagta ctgtggggct ggttctgtat ctgatatcat 361 tcgattacga aataaaacgt taacagaaga tgaaatagct acaatattac aatcaactct 421 taagggactt gaataccttc attttatgag aaaaatacac cgagatatca aggcaggaaa 481 tattttgcta aatacagaag gacatgcaaa acttgcagat tttggggtag caggtcaact 541 tacagatacc atggccaagc ggaatacagt gataggaaca ccattttgga tggctccaga 601 agtgattcag gaaattggat acaactgtgt agcagacatc tggtccctgg gaataactgc 661 catagaaatg gctgaaggaa agcgccctta tgctgatatc catccaatga gggcaatctt 721 catgattcct acaaatcctc ctcccacatt ccgaaaacca gagctatggt cagataactt 781 tacagatttt gtgaaacagt gtcttgtaaa gagccctgag cagagggcca cagccactca 841 gctcctgcag cacccatttg tcaggagtgc caaaggagtg tcaatactgc gagacttaat 901 taatgaagcc atggatgtga aactgaaacg ccaggaatcc cagcagcggg aaatggacca 961 ggacgatgaa gaaaactcag aagaggatga aatggattct ggcacgatgg ttcgagcagt 1021 gggtgatgag atgggcactg tccgagtagc cagcaccatg actgatggag ccaatactat 1081 gattgagcac gatgacacgt tgccatcaca actgggcacc atggtgatca atgcagagga 1141 tgaggaagag gaaggaacta tgaaaagaag ggatgagacc atgcagcctg cgaaaccatc 1201 ctttcttgaa tattttgaac aaaaagaaaa ggaaaaccag atcaacagct ttggcaagag 1261 tgtacctggt ccactgaaaa attcttcaga ttggaaaata ccacaggatg gagactacga 1321 gtttcttaag agttggacag tggaggacct tcagaagagg ctcttggccc tggaccccat 1381 gatggagcag gagattgaag agatccggca gaagtaccag tccaagcggc agcccatcct 1441 ggatgccata gaggctaaga agagacggca acaaaacttc tgagcaaggc caggctgtga 1501 gggccccagc tccacccagg ctttgggtga attctggatg gcttgctcat gtttgttagc 1561 cagcaccttc tgctctgtcg tctctccaca gcacctttgt gaactcagga atgtgcgcca 1621 gtgggaaggg ctctcttgac agtcagcgtg ccatcttgat gtgtgtatgt acattggtca 1681 ggtatattat ctcaaaggat ttatattggg cgacttttaa ctcagagttt taaaccccag 1741 gaacagagac tcctagttga gtgatagctg ggaaagtttt acattgtctg tttttcttct 1801 cccaatagct ttcaattgtt ctttctggaa gacttttaaa aaaatataaa tatgcatata 1861 tatatataaa ttataaatag attccccacg caggttggtg gcatctctgt // LOCUS HSU18299 4193 bp mRNA PRI 20-JAN-1996 DEFINITION Human damage-specific DNA binding protein DDBa p127 subunit (DDB1) mRNA, complete cds. ACCESSION U18299 NID g1052864 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4193) AUTHORS Dualan,R., Brody,T., Keeney,S., Nichols,A.F., Admon,A. and Linn,S. TITLE Chromosomal localization and cDNA cloning of the genes (DDB1 and DDB2) for the p127 and p48 subunits of a human damage-specific DNA binding protein JOURNAL Genomics 29 (1), 62-69 (1995) MEDLINE 96079092 REFERENCE 2 (bases 1 to 4193) AUTHORS Nichols,A.F. and Linn,S. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) Stuart Linn, University of California, Berkeley, 401 Barker Hall, Berkeley, CA 94720-3202, USA FEATURES Location/Qualifiers source 1..4193 /organism="Homo sapiens" /isolate="skin fibroblast CRL 1262; lung fibroblast IMR-90" /note="same sequence in both tissues" /db_xref="taxon:9606" /chromosome="11" /map="11q12-11q13.2" /cell_line="IMR-90" /cell_type="fibroblast" /tissue_type="epidermal; fetal lung" /dev_stage="fetal" gene 101..3523 /gene="DDB1" CDS 101..3523 /gene="DDB1" /note="damage-specific DNA binding protein p127 subunit; implicated in Xeroderma pigmentosum group E" /codon_start=1 /evidence=experimental /product="DDBa p127" /db_xref="PID:g1052865" /translation="MSYNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVV TAEGLRPVKEVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIIT RAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLE ELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFNKGPWKQENVEAEASM VIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDM EGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLV KLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRN GIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMG FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQV VVAVGRALYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISA RILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLS DRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPL NSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRI EVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHT FEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDG KLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHYNNIMALYL KTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAF NLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLF GTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGF IDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH" polyA_signal 4184..4189 BASE COUNT 979 a 1057 c 1127 g 1030 t ORIGIN 1 gtggagttcg ctgcggctgt tgggggccac ctgtcttttc gcttgtgccc ctctttctag 61 tgtcgcgctc gagtcccgac gggccgctcc aagcctcgac atgtcgtaca actacgtggt 121 aacggcccag aagcccaccg ccgtgaacgg ctgcgtgacc ggacacttta cttcggccga 181 agacttaaac ctgttgattg ccaaaaacac gagattagag atctatgtgg tcaccgccga 241 ggggcttcgg cccgtcaaag aggtgggcat gtatgggaag attgcggtca tggagctttt 301 caggcccaag ggggagagca aggacctgct gtttatcttg acagcgaagt acaatgcctg 361 catcctggag tataaacaga gtggcgagag cattgacatc attacgcgag cccatggcaa 421 tgtccaggac cgcattggcc gcccctcaga gaccggcatt attggcatca ttgaccctga 481 gtgccggatg attggcctgc gtctctatga tggccttttc aaggttattc cactagatcg 541 cgataataaa gaactcaagg ccttcaacat ccgcctggag gagctgcatg tcattgatgt 601 caagttccta tatggttgcc aagcacctac tatttgcttt gtctaccagg accctcaggg 661 gcggcacgta aaaacctatg aggtgtctct ccgagaaaag gaattcaata agggcccttg 721 gaaacaggaa aatgtcgaag ctgaagcttc catggtgatc gcagtcccag agccctttgg 781 gggggccatc atcattggac aggagtcaat cacctatcac aatggtgaca aatacctggc 841 tattgcccct cctatcatca agcaaagcac gattgtgtgc cacaatcgag tggaccctaa 901 tggctcaaga tacctgctgg gagacatgga aggccggctc ttcatgctgc ttttggagaa 961 ggaggaacag atggatggca ccgtcactct caaggatctc cgtgtagaac tccttggaga 1021 gacctctatt gctgagtgct tgacatacct tgataatggt gttgtgtttg tcgggtctcg 1081 cctgggtgac tcccagcttg tgaagctcaa cgttgacagt aatgaacaag gctcctatgt 1141 agtggccatg gaaaccttta ccaacttagg acccattgtc gatatgtgcg tggtggacct 1201 ggagaggcag gggcaggggc agctggtcac ttgctctggg gctttcaagg aaggttcttt 1261 gcggatcatc cggaatggaa ttggaatcca cgagcatgcc agcattgact taccaggcat 1321 caaaggatta tggccactgc ggtctgaccc taatcgtgag actgatgaca ctttggtgct 1381 ctcttttgtg ggccagacaa gagttctcat gttaaatgga gaggaggtag aagaaaccga 1441 actgatgggt ttcgtggatg atcagcagac tttcttctgt ggcaacgtgg ctcatcagca 1501 gcttatccag atcacttcag catcggtgag gttggtctct caagaaccca aagctctggt 1561 cagtgaatgg aaggagcctc aggccaagaa catcagtgtg gcctcctgca atagcagcca 1621 ggtggtggtg gctgtaggca gggccctcta ctatctgcag atccatcctc aggagctccg 1681 gcagatcagc cacacagaga tggaacatga agtggcttgc ttggacatca ccccattagg 1741 agacagcaat ggactgtccc ctctttgtgc cattggcctc tggacggaca tctcggctcg 1801 tatcttgaag ttgccctctt ttgaactact gcacaaggag atgctgggtg gagagatcat 1861 tcctcgctcc atcctgatga ccacctttga gagtagccat tacctccttt gtgccttggg 1921 agatggagcg cttttctact ttgggctcaa cattgagaca ggtctgttga gcgaccgtaa 1981 gaaggtgact ttgggcaccc agcccaccgt attgaggact tttcgttctc tttctaccac 2041 caacgtcttt gcttgttctg accgccccac tgtcatctat agcagcaacc acaaattggt 2101 cttctcaaat gtcaacctca aggaagtgaa ctacatgtgt cccctcaatt cagatggcta 2161 tcctgacagc ctggcgctgg ccaacaatag caccctcacc attggcacca tcgatgagat 2221 ccagaagctg cacattcgca cagttcccct ctatgagtct ccaaggaaga tctgctacca 2281 ggaagtgtcc cagtgtttcg gggtcctctc cagccgcatt gaagtccaag acacgagtgg 2341 gggcacgaca gccttgaggc ccagcgctag cacccaggct ctgtccagca gtgtaagctc 2401 cagcaagctg ttctccagca gcactgctcc tcatgagacc tcctttggag aagaggtgga 2461 ggtgcacaac ctacttatca ttgaccaaca cacctttgaa gtgcttcatg cccaccagtt 2521 tctgcagaat gaatatgccc tcagtctggt ttcctgcaag ctgggcaaag accccaacac 2581 ttacttcatt gtgggcacag caatggtgta tcctgaagag gcagagccca agcagggtcg 2641 cattgtggtc tttcagtatt cggatggaaa actacagact gtggctgaaa aggaagtgaa 2701 aggggccgtg tactctatgg tggaatttaa cgggaagctg ttagccagca tcaatagcac 2761 ggtgcggctc tatgagtgga caacagagaa ggagctgcgc actgagtgca accactacaa 2821 caacatcatg gccctctacc tgaagaccaa gggcgacttc atcctggtgg gcgaccttat 2881 gcgctcagtg ctgctgcttg cctacaagcc catggaagga aactttgaag agattgctcg 2941 agactttaat cccaactgga tgagtgctgt ggaaatcttg gatgatgaca attttctggg 3001 ggctgaaaat gcctttaact tgtttgtgtg tcaaaaggat agcgctgcca ccactgacga 3061 ggagcggcag cacctccagg aggttggtct tttccacctg ggcgagtttg tcaatgtctt 3121 ttgccacggc tctctggtaa tgcagaatct gggtgagact tccaccccca cacaaggctc 3181 ggtgctcttc ggcacggtca acggcatgat agggctggtg acctcactgt cagagagctg 3241 gtacaacctc ctgctggaca tgcagaatcg actcaataaa gtcatcaaaa gtgtggggaa 3301 gatcgagcac tccttctgga gatcctttca caccgagcgg aagacagaac cagccacagg 3361 tttcatcgac ggtgacttga ttgagagttt cctggatatt agccgcccca agatgcagga 3421 ggtggtggca aacctacagt atgacgatgg cagcggtatg aagcgagagg ccactgcaga 3481 cgacctcatc aaggttgtgg aggagctaac tcggatccat tagccaaggg cagggggccc 3541 cctttctgac cctccccaaa ggctttgccc tgctgccctc cccctcctct ccaccatcgt 3601 cttcttggcc atgggaggcc tttccctaag ccagctgccc ccagagccac agttccccta 3661 tgtggaagtg gggcgggctt catagagact tgggaatgag ctgaaggtga aacattttct 3721 ccctggattt ttaccagtct cacatgattc cagccatcac cttagaccac caagccttga 3781 ttggtgttgc cagttgtcct ccttccgggg aaggattttg cagttctttg gctgaaagga 3841 agctgtgcgt gtgtgtgtgt gtatgtgtgt gtgtgtatgt gtatctcaca ctcatgcaat 3901 gtcctctttt tatttagatt ggcagtgtag ggagttgtgg gtagtgggga agagggttag 3961 gagggtttca ttgtctgtga agtgagacct tccttttact tttcttctat tgcctctgag 4021 agcatcagcc tagaggcctg actgccaagc catgggtagc ctgggtgtaa aacctggaga 4081 tggtggatga tccccacgcc acagcccttt tgtctctgca aactgccttc ttcggaaaga 4141 agaaggtggg aggatgtgaa ttgttagttt ctgagtttta ccaaataaag tag // LOCUS HSU18300 1820 bp mRNA PRI 12-SEP-1996 DEFINITION Human damage-specific DNA binding protein p48 subunit (DDB2) mRNA, complete cds. ACCESSION U18300 NID g1536965 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1820) AUTHORS Dualan,R., Brody,T., Keeney,S., Nichols,A.F., Admon,A. and Linn,S. TITLE Chromosomal localization and cDNA cloning of the genes (DDB1 and DDB2) for the p127 and p48 subunits of a human damage-specific DNA binding protein JOURNAL Genomics 29 (1), 62-69 (1995) MEDLINE 96079092 REFERENCE 2 (bases 1 to 1820) AUTHORS Brody,T., Keeney,S., Nichols,A.F. and Linn,S. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) Stuart Linn, University of California, Berkeley, 401 Barker Hall, Berkeley, CA 94720-3202, USA FEATURES Location/Qualifiers source 1..1820 /organism="Homo sapiens" /isolate="fetal skin fibroblast CRL 1262; cervical carcinoma Hela S3" /note="same sequence in both tissues" /db_xref="taxon:9606" /chromosome="11" /map="11p11-p12" /cell_line="Hela S3" /cell_type="fibroblast" /tissue_type="epidermal; epithelial" gene 176..1459 /gene="DDB2" CDS 176..1459 /gene="DDB2" /note="damage-specific DNA binding protein p48 subunit; implicated in Xeroderma pigmentosum group E" /codon_start=1 /evidence=experimental /product="DDBb p48" /db_xref="PID:g1536966" /translation="MAPKKRPETQKTSEIVLRPRNKRSRSPLELEPEAKKLCAKGSGP SRRCDSDCLWVGLAGPQILPPCRSIVRTLHQHKLGRASWPSVQQGLQQSFLHTLDSYR ILQKAAPFDRRATSLAWHPTHPSTVAVGSKGGDIMLWNFGIKDKPTFIKGIGAGGSIT GLKFNPLNTNQFYASSMEGTTRLQDFKGNILRVFASSDTINIWFCSLDVSASSRMVVT GDNVGNVILLNMDGKELWNLRMHKKKVTHVALNPCCDWFLATASVDQTVKIWDLRQVR GKASFLYSLPHRHPVNAACFSPDGARLLTTDQKSEIRVYSASQWDCPLGLIPHPHRHF QHLTPIKAAWHPRYNLIVVGRYPDPNFKSCTPYELRTIDVFDGNSGKMMCQLYDPESS GISSLNEFNPMGDTLASAMGYHILIWSQEEARTRK" polyA_signal 1806..1811 BASE COUNT 435 a 498 c 485 g 402 t ORIGIN 1 gagctccaag ctggtttgaa caagccctgg gcatgtttgg cgggaagttg gcttagctcg 61 gctacctgtg gccccgcagt tttgtagtcc ccgccttgtt tctccccaga ggcctctcaa 121 tcctccctcc atgatcttcg catagagcac agtacccctt cacacggagg acgcgatggc 181 tcccaagaaa cgcccagaaa cccagaagac ctccgagatt gtattacgcc ccaggaacaa 241 gaggagcagg agtcccctgg agctggagcc cgaggccaag aagctctgtg cgaagggctc 301 cggtcctagc agaagatgtg actcagactg cctctgggtg gggctggctg gcccacagat 361 cctgccacca tgccgcagca tcgtcaggac cctccaccag cataagctgg gcagagcttc 421 ctggccatct gtccagcagg ggctccagca gtcctttttg cacactctgg attcttaccg 481 gatattacaa aaggctgccc cctttgacag gagggctaca tccttggcgt ggcacccaac 541 tcaccccagc accgtggctg tgggttccaa agggggagat atcatgctct ggaattttgg 601 catcaaggac aaacccacct tcatcaaagg gattggagct ggagggagca tcactgggct 661 gaagtttaac cctctcaata ccaaccagtt ttacgcctcc tcaatggagg gaacaactag 721 gctgcaagac tttaaaggca acattctacg agtttttgcc agctcagaca ccatcaacat 781 ctggttttgt agcctggatg tgtctgctag tagccgaatg gtggtcacag gagacaacgt 841 ggggaacgtg atcctgctga acatggacgg caaagagctt tggaatctca gaatgcacaa 901 aaagaaagtg acgcatgtgg ccctgaaccc atgctgtgat tggttcctgg ccacagcctc 961 cgtagatcaa acagtgaaaa tttgggacct gcgccaggtt agagggaaag ccagcttcct 1021 ctactcgctg ccgcacaggc atcctgtcaa cgcagcttgt ttcagtcccg atggagcccg 1081 gctcctgacc acggaccaga agagcgagat ccgagtttac tctgcttccc agtgggactg 1141 ccccctgggc ctgatcccgc accctcaccg tcacttccag cacctcacac ccatcaaggc 1201 agcctggcat cctcgctaca acctcattgt tgtgggccga tacccagatc ctaatttcaa 1261 aagttgtacc ccttatgaat tgaggacgat cgacgtgttc gatggaaact cagggaagat 1321 gatgtgtcag ctctatgacc cagaatcttc tggcatcagt tcgcttaatg aattcaatcc 1381 catgggggac acgctggcct ctgcaatggg ttaccacatt ctcatctgga gccaggagga 1441 agccaggaca cggaagtgag agacactaaa gaaggtgtgg gccagacaag gccttggagc 1501 ccacacatgg gatcaagtcc tgcaagcaga ggtggtgatt tgttaaaggg ccaaaagtat 1561 ccaaggttag ggttggagca ggggtgctgg gacctggggc actgtgggac tgggacactt 1621 ttatgttaat gctctggact tgcctccaga gactgctcca gagttggtga cacagctgtc 1681 ccaagggccc ctctgtatct agcctggaac caaggttatc ttggaactaa atgacttttc 1741 tcctctcagt gggtggtagc agagggatca agcagttatt tgatttgtgc tcttttgata 1801 tggccaataa aaccataccg // LOCUS HSU18423 1491 bp mRNA PRI 12-MAY-1995 DEFINITION Human spinal muscular atrophy gene product mRNA, complete cds. ACCESSION U18423 NID g624185 KEYWORDS spinal muscular atrophy determining gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1491) AUTHORS Lefebvre,S., Burglen,L., Reboullet,S., Clermont,O., Burlet,P., Viollet,L., Benichou,B., Cruaud,C., Millasseau,P., Zeviani,M. et al. TITLE Identification and characterization of a spinal muscular atrophy-determining gene [see comments] JOURNAL Cell 80 (1), 155-165 (1995) MEDLINE 95112343 REFERENCE 2 (bases 1 to 1491) AUTHORS Lefebvre,S. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Suzie Lefebvre, INSERM U-393, Hopital Necker- Enfants Malades, 149 rue de Sevres, 75743, Paris, Cedex 15, France FEATURES Location/Qualifiers source 1..1491 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="541" /chromosome="5" /map="5q13" 5'UTR 1..33 /gene="spinal muscular atrophy determining gene" exon 1..114 /gene="spinal muscular atrophy determining gene" /number=1 gene 1..1491 /gene="spinal muscular atrophy determining gene" CDS 34..918 /gene="spinal muscular atrophy determining gene" /note="survival motor neuron" /codon_start=1 /db_xref="PID:g624186" /translation="MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYD KAVASFKHALKNGDICETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIW SEDGCIYPATIASIDFKRETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENE NESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPP PPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGY YMGFRQNQKEGRCSHSLN" exon 115..306 /gene="spinal muscular atrophy determining gene" /number=2 exon 307..507 /gene="spinal muscular atrophy determining gene" /number=3 exon 508..660 /gene="spinal muscular atrophy determining gene" /number=4 exon 661..756 /gene="spinal muscular atrophy determining gene" /number=5 exon 757..867 /gene="spinal muscular atrophy determining gene" /number=6 exon 868..921 /gene="spinal muscular atrophy determining gene" /number=7 3'UTR 919..1491 /gene="spinal muscular atrophy determining gene" exon 922..1491 /gene="spinal muscular atrophy determining gene" /note="exon 8 is noncoding" /number=8 polyA_site 1491 /gene="spinal muscular atrophy determining gene" /note="91 A nucleotides" BASE COUNT 470 a 283 c 336 g 402 t ORIGIN 1 cggggcccca cgctgcgcat ccgcgggttt gctatggcga tgagcagcgg cggcagtggt 61 ggcggcgtcc cggagcagga ggattccgtg ctgttccggc gcggcacagg ccagagcgat 121 gattctgaca tttgggatga tacagcactg ataaaagcat atgataaagc tgtggcttca 181 tttaagcatg ctctaaagaa tggtgacatt tgtgaaactt cgggtaaacc aaaaaccaca 241 cctaaaagaa aacctgctaa gaagaataaa agccaaaaga agaatactgc agcttcctta 301 caacagtgga aagttgggga caaatgttct gccatttggt cagaagacgg ttgcatttac 361 ccagctacca ttgcttcaat tgattttaag agagaaacct gtgttgtggt ttacactgga 421 tatggaaata gagaggagca aaatctgtcc gatctacttt ccccaatctg tgaagtagct 481 aataatatag aacagaatgc tcaagagaat gaaaatgaaa gccaagtttc aacagatgaa 541 agtgagaact ccaggtctcc tggaaataaa tcagataaca tcaagcccaa atctgctcca 601 tggaactctt ttctccctcc accacccccc atgccagggc caagactggg accaggaaag 661 ccaggtctaa aattcaatgg cccaccaccg ccaccgccac caccaccacc ccacttacta 721 tcatgctggc tgcctccatt tccttctgga ccaccaataa ttcccccacc acctcccata 781 tgtccagatt ctcttgatga tgctgatgct ttgggaagta tgttaatttc atggtacatg 841 agtggctatc atactggcta ttatatgggt ttcagacaaa atcaaaaaga aggaaggtgc 901 tcacattcct taaattaagg agaaatgctg gcatagagca gcactaaatg acaccactaa 961 agaaacgatc agacagatct ggaatgtgaa gcgttataga agataactgg cctcatttct 1021 tcaaaatatc aagtgttggg aaagaaaaaa ggaagtggaa tgggtaactc ttcttgatta 1081 aaagttatgt aataaccaaa tgcaatgtga aatattttac tggactcttt tgaaaaacca 1141 tctgtaaaag actggggtgg gggtgggagg ccagcacggt ggtgaggcag ttgagaaaat 1201 ttgaatgtgg attagatttt gaatgatatt ggataattat tggtaatttt atggcctgtg 1261 agaagggtgt tgtagtttat aaaagactgt cttaatttgc atacttaagc atttaggaat 1321 gaagtgttag agtgtcttaa aatgtttcaa atggtttaac aaaatgtatg tgaggcgtat 1381 gtggcaaaat gttacagaat ctaactggtg gacatggctg ttcattgtac tgtttttttc 1441 tatcttctat atgtttaaaa gtatataata aaaatattta attttttttt a // LOCUS HSU18543 2424 bp mRNA PRI 02-FEB-1996 DEFINITION Human zinc-finger protein mRNA, complete cds. ACCESSION U18543 NID g758422 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2424) AUTHORS Fridell,R.A., Harding,L.S., Bogerd,H.P. and Cullen,B.R. TITLE Identification of a novel human zinc finger protein that specifically interacts with the activation domain of lentiviral Tat proteins JOURNAL Virology 209 (2), 347-357 (1995) MEDLINE 95297135 REFERENCE 2 (bases 1 to 2424) AUTHORS Fridell,R.A. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Robert A. Fridell, Genetics, Howard Hughes Medical Institute at Duke Univ. Medical Center, DUMC, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..2424 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HT2A" CDS 111..2072 /codon_start=1 /product="zinc-finger protein" /db_xref="PID:g758423" /translation="MAAAAASHLNLDALREVLECPICMESITEEQLRPKLLHCGHTIC RQCLEKLLASSINGVRCPFCSKITRITSLTQLTDNLTVLKIIDTAGLSEAVGLLMCRS CGRRLPRQFCRSCGLVLCEPCREADHQPPGHCTLPVKEAAEERRRDFGEKLTRLRELM GELQRRKAALEGVSKDLQARYKAVLQEYGHEERRVQDELARSRKFFTGSLAEVEKSNS QVVEEQSYLLNIAEVQAVSRCDYFLAKIKQADVALLEETADEEEPELTASLPRELTLQ DVELLKVGHVGPLQIGQAVKKPRTVNVEDSWAMEATASAASTSVTFREMDMSPEEVVA SPRASPAKQRGPEAASNIQQCLFLKKMGAKGSTPGMFNLPVSLYVTSQGEVLVADRGN YRIQVFTRKGFLKEIRRSPSGIDSFVLSFLGADLPNLTPLSVAMNCQGLIGVTDSYDN SLKVYTLDGHCVACHRSQLSKPWGITALPSGQFVVTDVEGGKLWCFTVDRGSGVVKYS CLCSAVRPKFVTCDAEGTVYFTQGLGLNLENRQNEHHLEGGFSIGSVGPDGQLGRQIS HFFSENEDFRCIAGMCVDARGDLIVADSSRKEILHFPKGGGYSVLIREGLTCPVGIAL TPKGQLLVLDCWDHCIKIYSYHLRRYSTP" BASE COUNT 540 a 607 c 673 g 604 t ORIGIN 1 gtgggctcgt cggagccgcg ggcggtcagc aggaatttga ccctctaggg catgaatact 61 gtgctgttca gttctgagct gtgctagcaa tacccttcaa aggaagagca atggctgcag 121 cagcagcttc tcacctgaac ctggatgccc tccgggaagt gctagaatgc cccatctgca 181 tggagtccat cacagaagag cagctgcgtc ccaagcttct gcactgtggc cataccatct 241 gccgccagtg cctggagaag ctattggcca gtagcatcaa tggtgtccgc tgtccctttt 301 gcagcaagat tacccgcata accagcttga cccagctgac agacaatctg acagtgctaa 361 agatcattga tacagctggg ctcagcgagg ctgtggggct gctcatgtgt cggtcctgtg 421 ggcggcgtct gccccggcaa ttctgccgga gctgtggttt ggtgttatgt gagccctgcc 481 gggaggcaga ccatcagcct cctggccact gtacactccc tgtcaaagaa gcagctgagg 541 agcggcgtcg ggactttgga gagaagttaa ctcgtctgcg ggaacttatg ggggagctgc 601 agcggcggaa ggcagccttg gaaggtgtct ccaaggacct tcaggcaagg tataaagcag 661 ttctccagga gtatgggcat gaggagcgca gggtccagga tgagctggct cgctctcgga 721 agttcttcac aggctctttg gctgaagttg agaagtccaa tagtcaagtg gtagaggagc 781 agagttacct gcttaacatt gcagaggtgc aggctgtgtc tcgctgtgac tacttcctgg 841 ccaagatcaa gcaggcagat gtagcactac tggaggagac agctgatgag gaggagccag 901 agctcactgc cagcttgcct cgggagctca ccctgcaaga tgtggagctc cttaaggtag 961 gtcatgttgg ccccctccaa attggacaag ctgttaagaa gccccggaca gttaacgtgg 1021 aagattcctg ggccatggag gccacagcgt ctgctgcctc tacctctgtt acttttagag 1081 agatggacat gagcccggag gaagtggttg ccagccctag ggcctcacct gctaaacagc 1141 ggggtcctga ggcagcctcc aatatccagc agtgcctctt tctcaagaag atgggggcca 1201 aaggcagcac tccaggaatg ttcaatcttc cagtcagtct ctacgtgacc agtcaaggtg 1261 aagtactagt cgctgaccgt ggtaactatc gtatacaagt ctttacccgc aaaggctttt 1321 tgaaggaaat ccgccgcagc cccagtggca ttgatagctt tgtgctaagc ttccttgggg 1381 cagatctacc caacctcact cctctctcag tggcaatgaa ctgccagggg ctgattggtg 1441 tgactgacag ctatgataac tccctcaagg tatatacctt ggatggccac tgcgtggcct 1501 gtcacaggag ccagctgagc aaaccatggg gtatcacagc cttgccatct ggccagtttg 1561 tagtaaccga tgtggaaggt ggaaagcttt ggtgtttcac agttgatcga ggatcagggg 1621 tggtcaaata cagctgccta tgtagtgctg tgcggcccaa atttgtcacc tgtgatgctg 1681 agggcaccgt ctacttcacc cagggcttag gcctcaatct ggagaatcgg cagaatgagc 1741 accacctgga gggtggcttt tccattggct ctgtaggccc tgatgggcag ctgggtcgcc 1801 agattagcca cttcttctcg gagaatgagg atttccgctg cattgctggc atgtgtgtgg 1861 atgctcgtgg tgatctcatc gtggctgaca gtagtcgcaa ggaaattctc cattttccta 1921 agggcggggg ctatagtgtc cttattcgag agggacttac ctgtccggtg ggcatagccc 1981 taactcctaa ggggcagctg ctggtcttgg actgttggga tcattgcatc aagatctaca 2041 gctaccatct gagaagatat tccaccccat aggggatgag aaattatcag tttcttctgc 2101 tcccaagcca acttcccttc ccttagttct tggttgttag tggcacatgc agaatagact 2161 cagcctatgt cctgattcca gctgggtagt tctagaactt tagaagctcc atcttttaat 2221 gtttttattt gttatgtccc cctcccggct tcccacctaa atttagagct ttaaaagatg 2281 cactgcccaa ataggacaca cgatggtgtt agctgaagtt tgattagcaa ttaggcactt 2341 ccaaggcttt agtagagaga gccactttag ccctttgtgc catgtttgaa atttgccctt 2401 gtattaaatc cttgattttt tccc // LOCUS HSU18548 1230 bp DNA PRI 08-MAR-1996 DEFINITION Human GPR12 G protein coupled-receptor gene, complete cds. ACCESSION U18548 NID g604499 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Song,Z.H., Modi,W. and Bonner,T.I. TITLE Molecular cloning and chromosomal localization of human genes encoding three closely related G protein-coupled receptors JOURNAL Genomics 28 (2), 347-349 (1995) MEDLINE 96015070 REFERENCE 2 (bases 1 to 1230) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Tom I. Bonner, Laboratory of Cell Biology, National Institute of Mental Health, Bldg 36, Room 3A-07, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="h6-7 c1" /clone_lib="human genomic in pWE15 cosmid vector (Stratagene)" /chromosome="13" /map="13q12" exon 130..1230 /note="based on discontinuity in similarity to rat G protein coupled-receptor cDNA, GenBank Accession Number X61496, and mouse cDNA, GenBank Accession Number D21061, 5' of this point; another rat cDNA, GenBank Accession Number U12184, has similarity extending 5' of this point to base 1 as though it is not spliced" CDS 145..1149 /note="putative GPR12 G protein coupled-receptor, ligand unknown" /codon_start=1 /db_xref="PID:g604500" /translation="MNEDLKVNLSGLPRDYLDAAAAENISAAVSSRVPAVEPEPELVV NPWDIVLCTSGTLISCENAIVVLIIFHNPSLRAPMFLLIGSLALADLLAGIGLITNFV FAYLLQSEATKLVTIGLIVASFSASVCSLLAITVDRYLSLYYALTYHSERTVTFTYVM LVMLWGTSICLGLLPVMGWNCLRDESTCSVVRPLTKNNAAILSVSFLFMFALMLQLYI QICKIVMRHAHQIALQHHFLATSHYVTTRKGVSTLAIILGTFAACWMPFTLYSLIADY TYPSIYTYATLLPATYNSIINPVIYAFRNQEIQKALCLICCGCIPSSLAQRARSPSDV " BASE COUNT 234 a 379 c 284 g 333 t ORIGIN 1 aagcttgtgg catttggtac tggtatctga gcaggggctg gctttctgtt tgtctgtgtg 61 ttttttgcat gatcttggat tgtcaccctg ctgtatttaa acattaaaaa gcctgtcttt 121 tcgttgaaga ggacaggggt taaaatgaat gaagacctga aggtcaattt aagcgggctg 181 cctcgggatt atttagatgc cgctgctgcg gagaacatct cggctgctgt ctcctcccgg 241 gttcctgccg tagagccaga gcctgagctc gtagtcaacc cctgggacat tgtcttgtgt 301 acctcgggaa ccctcatctc ctgtgaaaat gccattgtgg tccttatcat cttccacaac 361 cccagcctgc gagcacccat gttcctgcta ataggcagcc tggctcttgc agacctgctg 421 gccggcattg gactcatcac caattttgtt tttgcctacc tgcttcagtc agaagccacc 481 aagctggtca cgatcggcct cattgtcgcc tctttctctg cctctgtctg cagcttgctg 541 gctatcactg ttgaccgcta cctctcactg tactacgctc tgacgtacca ttcggagagg 601 acggtcacgt ttacctatgt catgctcgtc atgctctggg ggacctccat ctgcctgggg 661 ctgctgcccg tcatgggctg gaactgcctc cgagacgagt ccacctgcag cgtggtcaga 721 ccgctcacca agaacaacgc ggccatcctc tcggtgtcct tcctcttcat gtttgcgctc 781 atgcttcagc tctacatcca gatctgtaag attgtgatga ggcacgccca tcagatagcc 841 ctgcagcacc acttcctggc cacgtcgcac tatgtgacca cccggaaagg ggtctccacc 901 ctggctatca tcctggggac gtttgctgct tgctggatgc ctttcaccct ctattccttg 961 atagcggatt acacctaccc ctccatctat acctacgcca ccctcctgcc cgccacctac 1021 aattccatca tcaaccctgt catatatgct ttcagaaacc aagagatcca gaaagcgctc 1081 tgtctcattt gctgcggctg catcccgtcc agtctcgccc agagagcgcg ctcgcccagt 1141 gatgtgtagc acccttgcac ccaggaggac tctgcattta ccaagcactt ccactgcctg 1201 gccaaggttt gagatgcttc ccttgaattc // LOCUS HSU18549 2699 bp DNA PRI 08-MAR-1996 DEFINITION Human GPR6 G protein-coupled receptor gene, complete cds. ACCESSION U18549 NID g604501 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2699) AUTHORS Song,Z.H., Modi,W. and Bonner,T.I. TITLE Molecular cloning and chromosomal localization of human genes encoding three closely related G protein-coupled receptors JOURNAL Genomics 28 (2), 347-349 (1995) MEDLINE 96015070 REFERENCE 2 (bases 1 to 2699) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (13-DEC-1994) Tom I. Bonner, Laboratory of Cell Biology, National Institute of Mental Health, Bldg 36, Room 3A-07, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2699 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HCN3A c1" /clone_lib="human genomic in pWE15 cosmid vector (Stratagene)" /chromosome="6" /map="6q21" exon 1..66 /note="similar to part of 5'UTR of rat G protein-coupled receptor cDNA, GenBank Accession Number U12006" exon 694..2307 CDS 712..1800 /note="putative GPR6 G protein-coupled receptor, ligand unknown" /codon_start=1 /db_xref="PID:g604502" /translation="MNASAASLNDSQVVVVAAEGAAAAATAAGGPDTGEWGPPAAAAL GAGGGANGSLELSSQLSAGPPGLLLPAVNPWDVLLCVSGTVIAGENALVVALIASTPA LRTPMFVLVGSLATADLLAGCGLILHFVFQYLVPSETVSLLTVGFLVASFAASVSSLL AITVDRYLSLYNALTYYSRRTLLGVHLLLAATWTVSLGLGLLPVLGWNCLAERAACSV VRPLARSHVALLSAAFFMVFGIMLHLYVRICQVVWRHAHQIALQQHCLAPPHLAATRK GVGTLAVVLGTFGASWLPFAIYCVVGSHEDPAVYTYATLLPATYNSMINPIIYAFRNQ EIQRALWLLLCGCFQSKVPFRSRSPSEV" polyA_signal 2296..2301 polyA_site 2308 /note="by similarity with rat cDNA, GenBank Accession Number U12006" BASE COUNT 528 a 770 c 731 g 670 t ORIGIN 1 gagctcagcc cggcgccccg cctcggcgcc catgaccagc gacttccaga agtcctgacg 61 ccccaggtga gtggccgagt tcccagggaa ctttgaggat gtggggagag gaggggagaa 121 agatatccag gggacaggtc tttccagaag aggagaaatg gagccaacct tgactcccac 181 cctctgctgc ccccatacac acaccctagt ctctcctccc agaccccctt cgcaggggtc 241 tctgggctcc aggactctcc aggcttcgca atgccagaaa cttcgctttc agcaccgcgg 301 acagcgtctc tgctgcccag ccccatgggg atgtcctggt cgcgccgcca caacccaggc 361 gggactctcc caggatgaca ctcctagctt ggtgcactcg ggtgcgtgtg caaatacagg 421 cagccttgga gaattgatcc ttgtgcatgt gagtgcatgt tgtcctggtg tatatctgca 481 aaaactcaga agtccgcgtg ggcatttcac ttactcactc aacaggtatt tattttgggt 541 tcgcacttaa cccatgattc cccgatttgc tctggggtcc cagggtgggg tgggcaactc 601 aagtggggca atgcgagagg aggctctacg cgtggggaag tttcctggca gcctgaagtg 661 tacacctgac gcctgcactc cctccctatg cagggtgcaa atccggccgc gatgaacgcg 721 agcgccgcct cgctcaacga ctcccaggtg gtggtagtgg cggccgaagg agcggcggcg 781 gcggccacag cagcaggggg gccggacacg ggcgaatggg gaccccctgc tgcggcggct 841 ctaggagccg gcggcggagc taatgggtct ctggagctgt cctcgcagct gtcggctggg 901 ccaccgggac tcctgctgcc agcggtgaat ccgtgggacg tgctcctgtg cgtgtcgggg 961 acagtgatcg ctggagaaaa cgcgctggtg gtggcgctca tcgcgtccac tccggcgctg 1021 cgcacgccca tgttcgtgct ggtaggcagc ctggccaccg ctgacctgtt ggcgggctgt 1081 ggcctcatct tgcactttgt gttccagtac ttggtgccct cggagactgt gagtctgctc 1141 acggtgggct tcctcgtggc ctccttcgcc gcctctgtca gcagcctgct ggccattacg 1201 gtggaccgct acctgtccct gtataacgcg ctcacctatt actcgcgccg gaccctgttg 1261 ggcgtgcacc tcctgcttgc cgccacttgg accgtgtccc taggcctggg gctgctgccc 1321 gtgctgggct ggaactgcct ggcagagcgc gccgcctgca gcgtggtgcg cccgctggcg 1381 cgcagccacg tggctctgct ctccgccgcc ttcttcatgg tcttcggcat catgctgcac 1441 ctgtacgtgc gcatctgcca ggtggtctgg cgccacgcgc accagatcgc gctgcagcag 1501 cactgcctgg cgccacccca tctcgctgcc accagaaagg gtgtgggtac actggctgtg 1561 gtgctgggca ctttcggcgc cagctggctg cccttcgcca tctattgcgt ggtgggcagc 1621 catgaggacc cggcggtcta cacttacgcc accctgctgc ccgccaccta caactccatg 1681 atcaatccca tcatctatgc cttccgcaac caggagatcc agcgcgccct gtggctcctg 1741 ctctgtggct gtttccagtc caaagtgccc tttcgttcca ggtctcccag cgaggtctga 1801 agggctcgcc ccgtgtcctc tcaccaacac cacaccccaa caagccagcc tttggtaagc 1861 tcggtgcctg ctgacgaact ctgagatccc aatggtgtga gtctgacttt ggaaagaaaa 1921 agggactaaa gagaaatgta acaaacttac aaggacaaag aggcttgttg gcactttaca 1981 tatacagtgt atacatgtgt acatatatat acaaatattt gtatcttctg gaggtgttca 2041 ggatgtggag cttcctgttc tgtgaaaaac caagaaaaag atatggttgt atactcaaat 2101 tgtacatcac gtttgtcaaa cgaagacatt ccaatactgc ttaattatag cactttattt 2161 ttagctgctg aactgccaaa acagtgttgc cattttcaag ggcagggaaa agggagtaaa 2221 aggtgtattt ttgtcgtatg tgatagaata ttttgctgca catgcatcaa caaattacaa 2281 catgttttgt acacgaataa acccattaca agaatgtaat ttggggtatg tcactgacta 2341 cagaattaca attagctgaa ttgtaagtgt atgagtgtct ttctttcctt tctttcttcc 2401 tttctttctt tcttgcttgc ttgctttctc ttgctttctt tctttctctt tcgtttgttc 2461 gagatagagt ctcactctgt cgcccaggcc tgaatgcagt ggcacaatca tagctcagtg 2521 cagccttgaa ctcctgggcc caagaaatcc tgctttagca tccctagtag ctgggactac 2581 aggcatgcca cagcactcac cttactttat ttattttttt aagtttttaa aatttcagta 2641 gttttgaggg tacaggtgcc ttttttggtt acatagatga gttctttagt agtgaattc // LOCUS HSU18728 1729 bp mRNA PRI 17-JAN-1996 DEFINITION Human lumican mRNA, complete cds. ACCESSION U18728 NID g642533 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1729) AUTHORS Grover,J., Chen,X.N., Korenberg,J.R. and Roughley,P.J. TITLE The human lumican gene. Organization, chromosomal location, and expression in articular cartilage JOURNAL J. Biol. Chem. 270 (37), 21942-21949 (1995) MEDLINE 95394964 REFERENCE 2 (bases 1 to 1729) AUTHORS Grover,J. TITLE Direct Submission JOURNAL Submitted (15-DEC-1994) Judy Grover, Genetics Unit, Shriners' Hospital for Crippled Children, 1529 Cedar Ave., Montreal, Quebec, Canada, H3G 1A6 FEATURES Location/Qualifiers source 1..1729 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hp-1, hac-1, hac-2, hfi-2" /tissue_type="cartilage, placenta, intestine" 5'UTR 1..80 CDS 81..1097 /codon_start=1 /evidence=experimental /product="lumican" /db_xref="PID:g642534" /translation="MSLSAFTLFLALIGGTSGQYYDYDFPPSIYGQSSPNCAPECNCP ESYPSAMYCDELKLKSVPMVPPGIKYLYLRNNQIDHIDEKAFENVTDLQWLILDHNVL ENSKIKGRVFSKLKQLKKLHINHNNLTESVGPLPKSLEDLQLTHNKITKLGSFEGLVN LTFIHLQHNRLKEDAVSAAFKGLKSLEYLDLSFNQIARLPSGLPVSLLTLYLDNNKIS NIPDEYFKRFNALQYLRLSHNELADSGIPGNSFNVSSLVELDLSYNKLKNIPTVNENL ENYYLEVNQLEKFDIKSFCKILGPLSYSKIKHLRLDGNRISETSLPPDMYECLRVANE VTLN" 3'UTR 1098..1729 polyA_site 1693..1698 /note="This is the only consensus polyA signal in the sequence entry and correlates with mRNA size observed on Northern blot analysis." /label=polyA_signal /evidence=experimental BASE COUNT 552 a 336 c 297 g 544 t ORIGIN 1 attcttgtcc atagtgcatc tgctttaaga attaacgaaa gcagtgtcaa gacagtaagg 61 attcaaacca tttgccaaaa atgagtctaa gtgcatttac tctcttcctg gcattgattg 121 gtggtaccag tggccagtac tatgattatg attttccccc atcaatttat gggcaatcat 181 caccaaactg tgcaccagaa tgtaactgcc ctgaaagcta cccaagtgcc atgtactgtg 241 atgagctgaa attgaaaagt gtaccaatgg tgcctcctgg aatcaagtat ctttacctta 301 ggaataacca gattgaccat attgatgaaa aggcctttga gaatgtaact gatctgcagt 361 ggctcattct agatcacaac gttctagaaa actccaagat aaaagggaga gttttctcta 421 aattgaaaca actgaagaag ctgcatataa accacaacaa cctgacagag tctgtgggcc 481 cacttcccaa atctctggag gatctgcagc ttactcataa caagatcaca aagctgggct 541 cttttgaagg attggtaaac ctgaccttca tccatctcca gcacaatcgg ctgaaagagg 601 atgctgtttc agctgctttt aaaggtctta aatcactcga ataccttgac ttgagcttca 661 atcagatagc cagactgcct tctggtctcc ctgtctctct tctaactctc tacttagaca 721 acaataagat cagcaacatc cctgatgagt atttcaagcg ttttaatgca ttgcagtatc 781 tgcgtttatc tcacaacgaa ctggctgata gtggaatacc tggaaattct ttcaatgtgt 841 catccctggt tgagctggat ctgtcctata acaagcttaa aaacatacca actgtcaatg 901 aaaaccttga aaactattac ctggaggtca atcaacttga gaagtttgac ataaagagct 961 tctgcaagat cctggggcca ttatcctact ccaagatcaa gcatttgcgt ttggatggca 1021 atcgcatctc agaaaccagt cttccaccgg atatgtatga atgtctacgt gttgctaacg 1081 aagtcactct taattaatat ctgtatcctg gaacaatatt ttatggttat gtttttctgt 1141 gtgtcagttt tcatagtatc catattttat tactgtttat tacttccatg aattttaaaa 1201 tctgagggaa atgttttgta aacatttatt ttttttaaag aaaagatgaa aggcaggcct 1261 atttcatcac aagaacacac acatatacac gaatagacat caaactcaat gctttatttg 1321 taaatttagt gtttttttat ttctacggtc aaatgatgtg caaaaccttt tactggttgc 1381 atggaaatca gccaagtttt ataatcctta aatcttaatg ttcctcaaag cttggattaa 1441 atacatatgg atgttactct cttgcaccaa attatcttga tacttcaaat ttgtctggtt 1501 aaaaaatagg tggtagatat tgaggccaag aatattgcaa aatacatgaa ccttcatgca 1561 cttaaagaag tatttttaga ataagaattt gcatacttac ctagtgaaac ttttctagaa 1621 ttatttttca ctctaagtca tgtatgttcc tctttgatta tttgcatgtt atgtttaata 1681 agctactagc aaaataaaac atagcaaatg gcaaaaaaaa aaaaaaaaa // LOCUS HSU18914 3247 bp mRNA PRI 23-AUG-1995 DEFINITION Human 19.8 kDa protein mRNA, complete cds. ACCESSION U18914 NID g790224 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3247) AUTHORS Byrne,J.A., Tomasetto,C., Garnier,J.M., Rouyer,N., Mattei,M.G., Bellocq,J.P., Rio,M.C. and Basset,P. TITLE A screening method to identify genes commonly overexpressed in carcinomas and the identification of a novel complementary DNA sequence JOURNAL Cancer Res. 55 (13), 2896-2903 (1995) MEDLINE 95316866 REFERENCE 2 (bases 1 to 3247) AUTHORS Byrne,J.A. TITLE Direct Submission JOURNAL Submitted (20-DEC-1994) Jennifer A. Byrne, I.G.B.M.C., 1 rue Laurent Fries, Illkirch Cedex, 67404, France FEATURES Location/Qualifiers source 1..3247 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="D52" /clone_lib="breast carcinoma cDNA library in lambda ZAP II" /sex="female" /tissue_type="primary infiltrating ductal breast carcinoma" /dev_stage="adult" 5'UTR 1..91 CDS 92..646 /note="predicted 184 amino acid peptide with a molecular mass of 19.8 kDa" /codon_start=1 /function="unknown" /db_xref="PID:g790225" /translation="MDRGEQGLLRTDPVPEEGEDVAATISATETLSEEEQEELRRELA KVEEEIQTLSQVLAAKEKHLAEIKRKLGINSLQELKQNIAKGWQDVTATSAYKKTSET LSQAGQKASAAFSSVGSVITKKLEDVKNSPTFKSFEEKVENLKSKVGGTKPAGGDFGE VLNSAANASATTTEPLPEKTQESL" 3'UTR 647..3247 polyA_signal 2152..2157 /note="gives rise to a minor transcript" polyA_site 2171 variation 2251..2261 /note="varied in length between 2 cDNA clones sequenced" polyA_signal 3228..3233 /note="gives rise to major transcript" polyA_site 3247 BASE COUNT 1057 a 545 c 639 g 1006 t ORIGIN 1 gaggagctct gcgcggcgcg gcgggcgatc cgagccggga cgggctgcag gcgggggtgc 61 tgcagaggac acgaggcggc gggctggaga catggaccgc ggcgagcaag gtctgctgag 121 aacagaccca gtccctgagg aaggagaaga tgttgctgcc acgatcagtg ccacagagac 181 cctctcggaa gaggagcagg aagagctaag aagagaactt gcaaaggtag aagaagaaat 241 ccagactctg tctcaagtgt tagcagcaaa agagaagcat ctagcagaga tcaagcggaa 301 acttggaatc aattctctac aggaactaaa acagaacatt gccaaagggt ggcaagacgt 361 gacagcaaca tctgcttaca agaagacatc tgaaacctta tcccaggctg gacagaaggc 421 ctcagctgct ttttcgtctg ttggctcagt catcaccaaa aagctggaag atgtaaaaaa 481 ctccccaact tttaaatcat ttgaagaaaa ggtcgaaaac ttaaagtcta aagtaggggg 541 aaccaagcct gctggtggtg attttggaga agtcttgaat tcggctgcaa atgctagtgc 601 caccaccacg gagcctcttc cagaaaagac acaggagagc ctgtgagatt cctacctttg 661 ttctgctacc cactgccaga tgctgcaagc gaggtccaag cacatcttgt caacatgcat 721 tgccatgaat ttctaccaga tgtgctttta tttagcttta catattcctt tgaccaaata 781 gtttgtgggt taaacaaaat gaaaatatct tcacctctat tcttgggaaa caccctttag 841 tgtacattta tgttccttta tttaggaaac accattataa aaacacttat agtaaatggg 901 gacattcact ataatgatct aagaagctac agattgtcat agttgttttc ctgctttaca 961 aaattgctcc agatctggaa tgccagtttg acctttgtct tctataatat ttcctttttt 1021 tcccctcttt gaatctctgt atatttgatt cttaactaaa attgttctct taaatattct 1081 gaatcctggt aattaaaagt ttgggtgtat tttctttacc tccaaggaaa gaactactag 1141 ctacaaaaaa tattttggaa taagcattgt tttggtataa ggtacatatt ttggttgaag 1201 acaccagact gaagtaaaca gctgtgcatc caatttatta tagttttgta agtaacaata 1261 tgtaatcaaa cttctaggtg acttgagagt ggaacctcct atatcattat ttagcaccgt 1321 ttgtgacagt aaccatttca gtgtattgtt tattatacca cttatatcaa cttatttttc 1381 accaggttaa aattttaatt tctacaaaat aacattctga atcaagcaca ctgtatgttc 1441 agtaggttga actatgaaca ctgtcatcaa tgttcagttc aaaagcctga aagtttagat 1501 ctagaagctg gtaaaaatga caatatcaat cacattaggg gaaccattgt tgtcttcact 1561 taatccattt agcactattt aaaataagca caccaagtta tatgactaat ataacttgaa 1621 aattttttat actgaggggt tggtgataac tcttgaggat gtaatgcatt aataaaaatc 1681 aactcatcat tttctacttg ttttcaatgt gttggaaact gtaaaatgat actgtagaac 1741 ctgtctccta ctttgaaaac tgaatgtcag ggctgagtga atcaaagtgt ctagacatat 1801 ttgcatagag gccaaggtat tctattctaa taactgctta ctcaacacta ccaccttttc 1861 cttatactgt atatgattat ggcctacaat gttgtatttg ttatttatta aattgtgatt 1921 gttttattat tgtttatgcc aaatgttaac tgccaagctt ggagtgacct aaagcatttt 1981 ttaaaagcat ggctagattt acttcagtat aaattatctt atgaaaacca aattttaaaa 2041 gccacaggtg ttgattgtta taaaataaca tgctgccatt cttgattgct agagtttttg 2101 ttagtacttt ggatgcaatt aaaactatgt gctatcacat gtgaaaagct taataaattc 2161 catctatcag tagtataggt ctcaatattt attatgagac cagtggtctg gaaacagctt 2221 gttgtaccga atcaactgga gtctatgctt aaaaaaaaaa attttttttt aaccatcctt 2281 aaattattgc ttaatggtat catattaaca tattctaaat aagggcttta aggcacaggc 2341 tgttgaagca ttttctcaga ggagtggatc tgtagaagtc tgtctttcta tagaaatatt 2401 gtgcttactc aagtgttaaa ttattttttc tatgaactag tctacttctt aaaattcaaa 2461 catattcttt tgatcacatt gtttcttgag catcctgccc tgctactaac ttttcaacaa 2521 ggcaaaatgg agtaaagtgg caatttcttt agatgagtga aataccctca agtctctttt 2581 ctgcccaaaa agggaaaagt gatagaaatg ggggtggcaa gtggggtgag tggatgaagg 2641 tgggtattgg gggtggctgt gaaagaaaat aatggagaat cacttttcta gacatctacc 2701 tatacttaat ctaagaaaca aagtaatcta ctgtaaagta ctctgcccct tgaaagaagt 2761 attaaaaaga gtgaggatgg atttagaaaa aaacatgaat ttagaaatat tcaaaatggt 2821 ttttgtggca gattcaatat tatgaattca cagatattta aagaatgaga aacatagtaa 2881 ttagtagaaa tgccagaaac agttcctggt tcctcttgtg tttgacacta agaaaatagc 2941 aagagtgtga aatctcagat acttatgaaa tctcacagat gtaaggactc aagtgtagaa 3001 gaaaatatcc ccttcttaca aaaagaaatg tcaatttatg gagtttgtgg gaaatagggc 3061 aagaattctt atgcttatga gagccaagta gtcagtggaa gagagtagag ctcaaaactg 3121 gattatcacc ttagcaactt agaatagttt gaaatagaaa aaaagtattt aatttggatc 3181 tggatctgtt aagatatgca cagtctattt tttgtatagt attggaaaat aaaaatgcta 3241 taatttg // LOCUS HSU18937 2423 bp mRNA PRI 04-AUG-1995 DEFINITION Human histidyl-tRNA synthetase homolog (HO3) mRNA, complete cds. ACCESSION U18937 NID g899108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS O'Hanlon,T.P., Raben,N. and Miller,F.W. TITLE A novel gene oriented in a head-to-head configuration with the human histidyl-tRNA synthetase (HRS) gene encodes an mRNA that predicts a polypeptide homologous to HRS JOURNAL Biochem. Biophys. Res. Commun. 210 (2), 556-566 (1995) MEDLINE 95275311 REFERENCE 2 (bases 1 to 2423) AUTHORS O'Hanlon,T.P. TITLE Direct Submission JOURNAL Submitted (20-DEC-1994) Terrance P. O'Hanlon, Laboratory of Molecular Immunology, Food and Drug Administration, National Institutes of Health, Bldg. 29B, Rm.2G11, 8800 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2423 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HO3" /chromosome="5" /clone_lib="skeletal muscle cDNA library from Stratagene, Catalog No. 936215" 5'UTR 1..149 gene 150..1670 /gene="HO3" CDS 150..1670 /gene="HO3" /codon_start=1 /product="histidyl-tRNA synthetase homologue" /db_xref="PID:g899109" /translation="MPLLGLLPRRAWASLLSQLLRPPCASCTGAVRCQSQVAEAVLTS QLKAHQEKPNFIIKTPKGTRDLSPQHMVVREKILDLVISCFKRHGAKGMDTPAFELKE TLTEKYGEDSGLMYDLKDQGGELLSLRYDLTVPFARYLAMNKVKKMKRYHVGKVWRRE SPTIVQGRYREFCQCDFDIAGQFDPMIPDAECLKIMCEILSGLQLGDFLIKVNDRRIV DGMFAVCGVPESKFRAICSSIDKLDKMAWKDVRHEMVVKKGLAPEVADRIGDYVQCHG GVSLVEQMFQDPRLSQNKQALEGLGDLKLLFEYLTLFGIADKISFDLSLARGLDYYTG VIYEAVLLQTPTQAGEEPLNVGSVAAGGRYDGLVGMFDPKGHKVPCVGLSIGVERIFY IVEQRMKTKGEKVRTTETQVFVATPQKNFLQERLKLIAELWDSGIKAEMLYKNNPKLL TQLHYCESTGIPLVVIIGEQELKEGVIKIRSVASREEVAIKRENFVAEIQKRLSES" 3'UTR 1671..2423 polyA_signal 2406..2411 polyA_site 2423 /note="15 A nucleotides" BASE COUNT 598 a 529 c 677 g 619 t ORIGIN 1 cttcgtgact agtgaggtgc gcaaacgccc gagttttccc tggtgcgcgg gttccgcctt 61 tgcagtgccc tccacccttc ctggtgtctg acccgcctcc ttcccaggcc ttttgttcct 121 gtcccggaaa gccggcgtcc tgccgcgcga tgcccctgct cggacttctt cccaggaggg 181 cctgggcttc gctgctcagc cagctcctgc gaccgccctg cgcttcgtgc accggggcgg 241 tccgttgcca aagccaggtt gcagaggcag tgttaacatc ccaactgaaa gcacatcaag 301 agaaaccaaa ttttattatc aagaccccaa agggtaccag ggatcttagt cctcagcata 361 tggttgtgag ggagaaaatt cttgatttgg ttatcagctg ctttaaacgt catggagcaa 421 aggggatgga caccccagca tttgagctga aggaaaccct gactgagaag tatggagagg 481 actctgggct catgtatgat ctgaaggatc aaggtggaga gctgttgtcc ctccgctatg 541 accttactgt tccctttgct cgttatctgg ccatgaataa ggtgaagaag atgaaacgtt 601 atcatgttgg aaaggtgtgg cggcgagaga gcccaaccat agtccaaggc cgttataggg 661 agttctgcca gtgtgatttt gacattgctg gtcagtttga ccctatgatc cccgatgcag 721 agtgtttgaa gatcatgtgt gaaatcctaa gtggattgca gttgggagac tttctcatta 781 aggtaaatga ccggcggatt gtggatggga tgtttgctgt ctgtggtgtt cctgaaagca 841 agttccgtgc catctgctcc tccatagata aactagacaa gatggcttgg aaagatgtga 901 gacatgagat ggtggtgaag aaaggcctgg ctcctgaggt ggctgatcga attggggact 961 atgtccagtg tcatggtggg gtatccctag tagagcaaat gtttcaggat cccagactat 1021 cccagaacaa gcaggccctg gagggcctgg gagacctaaa gctgctattt gaatacctga 1081 ctttatttgg aattgctgat aagatctcct ttgacctcag cctggctcgg ggcctagact 1141 actatacagg agtgatctat gaagcagtgc tgctgcagac cccaactcag gctggggagg 1201 agcccctgaa tgtgggcagt gtggctgctg gtgggcgcta tgatgggctg gtgggcatgt 1261 ttgaccccaa gggccacaag gtgccatgtg tgggactcag cattggggtt gagcgaatct 1321 tctacattgt ggagcagagg atgaagacca aaggtgagaa ggtgcggact acagagactc 1381 aagtgtttgt ggccacacca cagaagaact ttctccaaga acggttgaag cttattgcag 1441 agctttggga ttctggaatc aaggcagaga tgctatacaa gaacaacccc aaactattaa 1501 cccagctgca ctattgtgag agcacaggca ttccactggt ggtcattatt ggtgagcaag 1561 aactgaaaga aggggtcatc aagatccgtt cagtggccag cagagaggag gtggccatta 1621 aacgggaaaa ttttgtggct gaaattcaga agcgactgtc tgagtcttga tccttgcctg 1681 attcccatct gctgctcttt gtagaaaagg tttcctctag aactgaattc ctctggaatt 1741 gagtgatgga cttcacaaca actagccagg aagggtgaca agtaccttct gcctcctcca 1801 ttcttcctgg gtgcagaact gtagaagacc ctgaggactc ctgggctagt gtgagcagct 1861 attctgcatg ggtgcagatg actggatgtg aaagagacat gctctagctg cagaggcaaa 1921 tttgaagtgc cacggaacgt tgtcaagagg tagtgagatt gttgctgtga gcaaggctct 1981 gggagagtca cctcaggctc aggattctga accattgaga tcagaaacca gatacttagc 2041 cttctactgt agctacggaa ccagattctg gtgaatttgt ccacaatcag ccattggcct 2101 gtctggggag tttttggaag acagcaaaga ggggcaatgg ctgctttcag ctttgaggac 2161 agatgctatc ctaagggcat ggcagtaccc atgttgattt gacatctctc tagcccatcc 2221 attgcttaca gtagaagagt ggggctgtat gcttgataat cttgaacatt gaacttaata 2281 gatgacttga caaggtagag atctgatatt ttgagatttt ggagaccatt tcctgtgcca 2341 gaatcactgc tctattccat accgtgccat ggaggctgtt ttagaagtgg ttggataaca 2401 tgttaaataa aatattaagt gta // LOCUS HSU18945 1635 bp mRNA PRI 08-MAR-1996 DEFINITION Human cyclic nucleotide gated channel gamma subunit (Gar-1) mRNA, complete cds. ACCESSION U18945 NID g1143724 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1635) AUTHORS Ardell,M.D., Makhija,A.K., Oliveira,L., Miniou,P., Viegas-Pequignot,E. and Pittler,S.J. TITLE cDNA, gene structure, and chromosomal localization of human GAR1 (CNCG3L), a homolog of the third subunit of bovine photoreceptor cGMP-gated channel JOURNAL Genomics 28 (1), 32-38 (1995) MEDLINE 96070429 REFERENCE 2 (bases 1 to 1635) AUTHORS Pittler,S.J. TITLE Direct Submission JOURNAL Submitted (20-DEC-1994) Steven J. Pittler, College of Medicine, Biochemistry & Molecular Biology, University of South Alabama, 307 University Blvd., Mobile, AL 36688-0002, USA FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GAR1,GAR2,GAR3 (genomic) hgar17, hgar29 (cDNA)" /clone_lib="retina cDNA lambdagt10,leukocyte genomic lambdaEMBL3-sp6/T7" /map="16q13" /chromosome="16" /sex="male and female" /cell_type="mixed" /tissue_type="retina" /dev_stage="adult" 5'UTR <1..68 /evidence=experimental gene 69..968 /gene="Gar-1" CDS 69..968 /gene="Gar-1" /codon_start=1 /evidence=experimental /product="cyclic nucleotide gated channel gamma subunit" /db_xref="PID:g1143725" /translation="MLGWVQRVLPQPPGTPRKTKMQEEEEVEPEPEMEAEVEPEPNPE EAETESESMPPEESFKEEEVAVADPSPQETKEAALTSTISLRAQGAEISEMNSPSHRV LTWLMKGVEKVIPQPVHSITEDPAQILGHGSTGDTGCTDEPNEALEAQDTRPGLRLLL WLEQNLERVLPQPPKSSEVWRDEPAVATGAASDPAPPGRPQEMGPKLQARETPSLPTP IPLQPKEEPKEAPAPEPQPGSQAQTSSLPPTRDPARLVAWVLHRLEMALPQPVLHGKI GEQEPDSPGICDVQTRVMGAGGL" 3'UTR 969..1635 /evidence=experimental misc_feature 1150..1412 /note="Alu-like repetitive element; approximate location of an Alu-like segment found in numerous human genes and cDNAs" /evidence=experimental polyA_signal 1609..1614 polyA_site 1635 /note="10 A nucleotides" /evidence=experimental BASE COUNT 414 a 477 c 483 g 261 t ORIGIN 1 ccagctacga gtggcagcaa gaagaaggca attcctggct ggcggttggc atctaagcag 61 gcatcaggat gttgggctgg gtccagaggg tgctgcctca gcccccaggg acccctcgga 121 agaccaagat gcaggaggaa gaggaagtgg aaccagagcc agagatggag gcagaggtgg 181 aaccagaacc gaatcctgag gaggccgaga cagagtccga gtccatgccc cccgaagagt 241 cattcaagga ggaggaagtg gctgtggcag acccaagccc tcaggagacc aaggaggctg 301 cccttacttc caccatatcc ctccgggccc agggcgctga gatttctgaa atgaatagtc 361 ccagccacag ggtactgacc tggctcatga agggcgtaga gaaggtgatc ccgcagcctg 421 ttcacagcat cacggaggac ccggctcaga tcctggggca tggcagcact ggggacacag 481 ggtgcacaga tgaacccaat gaggcccttg aggcccaaga cactaggcct gggctgcggc 541 tgcttctgtg gctggagcag aatctggaaa gagtgcttcc tcagcccccc aaatcctctg 601 aggtctggag agatgagcct gcagttgcta caggtgctgc ctcagaccca gcgcctccag 661 gacgccccca ggaaatgggg cccaagctgc aggcccggga gaccccctcc ctgcccacac 721 ccatccccct gcagcccaag gaggaaccca aggaggcacc agctccagag ccccagcccg 781 gctcccaggc ccagacctcc tccctgccac caaccaggga ccctgccagg ctggtggcat 841 gggtcctgca caggctggag atggccttgc cgcagccagt gctacatggg aaaatagggg 901 aacaggagcc tgactcccct gggatatgtg atgtgcagac cagggtgatg ggagctggag 961 gtctctgaaa taaggaagaa agggaatctg ggagagctca gatggtcaca tggatggaag 1021 gaaagaagga tgccctgaag agaagacctt cccgggggag gtggccactg acaccccacc 1081 ctatttgaac agcgaacccc tccctctcca cacactccca ggcaggacaa ggggagccag 1141 agtcacctgc accagcccca gagcctccca gaccagaaca aggggaaggc cagccatggg 1201 cagcctgcaa gatttaggct agacagcagg acttaccccc agagggctgt ggaaaaggcc 1261 catagccggg cgtggtggct cacgcctgta acccctgacc tttgggaagc cagggtcgga 1321 agactctctt gagccccgaa gttcgagacc accgtgggca acacagtgag ccctgtctct 1381 aaaaaacctt ttttaaatta ataaattaaa aagccccatg gatggaggac tcagtattga 1441 gcatctcttt gagaggacgc gtgcacagcc cccagtgtgg tacctggcac cgtcagcacc 1501 tcgacaggat acagtttttc ccagaagagg ttctccctag gcctggccac actctctcct 1561 tccaaggctt ggggacacca atgagcagca acaaatgttt tcattaacat taaaagagtg 1621 taaatgagca catca // LOCUS HSU18985 2947 bp mRNA PRI 13-SEP-1995 DEFINITION Human triadin mRNA, complete cds. ACCESSION U18985 NID g882222 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2947) AUTHORS Taske,N.L., Eyre,H.J., O'Brian,R.O., Sutherland,G.R., Denborough,M.A. and Foster,P.S. TITLE Molecular cloning of the cDNA encoding human skeletal muscle triadin and its localisation to chromosome 6q22-6q23 JOURNAL Eur. J. Biochem. (1995) In press REFERENCE 2 (bases 1 to 2947) AUTHORS Taske,N.L. TITLE Direct Submission JOURNAL Submitted (22-DEC-1994) Nichole L Taske, John Curtin School of Medical Research, The Australian National University, ACTON, Canberra, ACT, 200, Australia FEATURES Location/Qualifiers source 1..2947 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="skeletal muscle" CDS 23..2212 /codon_start=1 /product="triadin" /db_xref="PID:g882223" /translation="MTEITAEGNASTTTTVIDSKNGSVPKSPGKVLKRTVTEDIVTTF SSPAAWLLVIALIITWSAVAIVMFDLVDYKNFSASSIAKIGSDPLKLVRDAMEETTDW IYGFFSLLSDIISSEDEEDDDGDEDTDKGEIDEPPLRKKEIHKDKTEKQEKPERKIQT KVTHKEKEKGKEKVREKEKPEKKATHKEKIEKKEKPETKTVAKEQKKAKTAEKSEEKT KKEVKGGKQEKVKQTAAKVKEVQKTPSKPKEKEDKEKAAVSKHEQKDQYAFCRYMIDI FVHGDLKPGQSPAIPPPLPTEQASRPTPASPALEEKEGEKKKAEKKVTSETKKKEKED IKKKSEKETAIDVEKKEPGKASETKQGTVKIAAQAAAKKDEKKEDSKKTKKPAEVEQP KGKKQEKKEKHVEPAKSPKKEHSVPSDKQVKAKTERAKEEIGAVSSKKAVPGKKEEKT TKTVEQEIRKEKSGKTSSILKDKEPIKGKEEKVPASLKEKEPETKKDEKMSKAGKEVK PKPPQLQGKKEEKPEPQIKKEAKPAISEKVQIHKQDIVKPEKTVSHGKPEEKVLKQVK AVTIEKTAKPKPTKKAEHREREPPSIKTDKPKPTPKGTSEVTESGKKKTEISEKESKE KADMKHLREEKVSTRKESLQLHNVTKAEKPARVSKDVEDVPASKKAKEGTEDVSPTKQ KSPISFFQCVYLDGYNGYGFQFPFTPADRPGESSGQANSPGQKQQGQ" BASE COUNT 1209 a 503 c 611 g 624 t ORIGIN 1 acttttactt ttgacgacca ccatgactga gatcactgct gaaggaaatg catctacaac 61 cacaactgtg atagacagca aaaatggatc tgtgcccaaa tcccccggaa aagtgctgaa 121 gaggacagtc acagaagaca tagtgacgac gttcagctcc cctgcagcct ggcttctggt 181 cattgccctg ataatcacgt ggtcagctgt tgccatcgtt atgtttgatt tagtggatta 241 caaaaacttt tcagcaagct ctattgccaa gattggctca gatcctttaa aactggtacg 301 tgatgctatg gaggaaacca cggactggat ctatggcttc ttttctttgt tatctgacat 361 catctcatct gaagatgaag aagatgatga tggtgacgaa gatactgata aaggagaaat 421 agatgagcct cccttgagaa aaaaagaaat acacaaagat aagactgaaa aacaagagaa 481 acctgaaagg aaaatacaaa ctaaagttac acacaaagaa aaagaaaaag gaaaagaaaa 541 agtaagagaa aaagaaaaac ctgaaaagaa agcaactcac aaggaaaaaa ttgagaaaaa 601 agaaaaacca gaaacaaaga cagtggcgaa agaacagaag aaagctaaga ctgcagaaaa 661 gagtgaagaa aagactaaaa aggaagtgaa aggtggaaaa caggagaaag tgaagcaaac 721 agctgcaaaa gtaaaagaag tacagaaaac accatcaaaa cccaaagaaa aggaggacaa 781 agagaaagca gctgtgtcaa agcatgaaca gaaagatcag tatgcattct gtcgatatat 841 gattgacata tttgtccatg gggatttaaa accaggacaa agcccagcca ttccacctcc 901 cttaccgaca gaacaagctt ccagacccac tccggcatca cctgcccttg aagaaaaaga 961 aggggaaaag aagaaggctg agaagaaagt tacttctgaa acaaaaaaga aagaaaaaga 1021 agatatcaaa aagaaaagtg agaaggaaac tgccattgat gtggaaaaaa aagagccggg 1081 aaaagcttct gaaaccaaac aagggactgt aaaaattgca gcacaagcag cagctaagaa 1141 ggatgaaaag aaggaagatt ccaagaaaac aaaaaaacct gcagaagtag aacaacccaa 1201 gggaaaaaag caggaaaaga aagaaaaaca tgtggaacca gcaaagtcac caaagaaaga 1261 acactcagtt ccaagtgaca aacaagtaaa agcaaaaact gaacgagcca aagaggagat 1321 tggtgcggtt tcaagtaaaa aagctgtacc tggaaagaag gaagagaaaa caaccaagac 1381 agtggagcaa gaaattagaa aagaaaaatc tgggaagact tcttcaattc tgaaagataa 1441 agaacctatt aaagggaaag aagagaaagt tccagcttcc ctaaaggaaa aagaacctga 1501 aactaaaaaa gatgaaaaga tgtccaaagc aggcaaagaa gtcaagccta aacctccaca 1561 actacaagga aaaaaggaag agaagccaga gccccaaatt aaaaaagaag caaaaccagc 1621 tatatctgaa aaagtgcaaa tacacaaaca agacatagtg aaaccagaaa agactgtttc 1681 tcatggtaaa ccagaagaaa aagttctcaa gcaggtaaaa gctgtcacaa tagaaaaaac 1741 agccaagccc aaaccaacaa aaaaagctga acatcgagaa agagaacctc catctataaa 1801 aacagacaaa ccaaaaccaa ctccaaaagg aacatcagaa gtcacagaat caggaaagaa 1861 gaaaactgaa atatctgaaa aagaaagtaa agaaaaagca gacatgaagc atcttagaga 1921 agaaaaagtc tcaacaagaa aagaaagtct tcaattacac aatgtgacaa aagcagaaaa 1981 acctgcaaga gtatcaaaag atgttgaaga tgtaccagct tcaaagaaag ctaaagaagg 2041 aactgaagat gtgtctccca caaagcagaa aagtcccatc agtttcttcc agtgtgtcta 2101 cttggatggg tacaatggct atggatttca gtttcctttc actcctgcag accgccctgg 2161 agagagctct ggtcaagcaa attctccagg acagaagcaa caaggacagt aaacacacat 2221 gtatgaccct tacaagtgct ttaagatttt aaaaatgtga tgttttgtcc acagtagttc 2281 aggcaattaa gaatatgcaa cccagagaat ttctgtgaaa acattttgct ctttggcctg 2341 gtgtggacgg aaagggtggc caaatggatt gagtgatgag cagacatgtt taagggtcta 2401 agtctcaaga atctgttatg tgtgtttgct gcggtgggag ggggtgcttg tatttatctt 2461 atttccagtc actataaggt tgtacacaaa ctaatttaaa gtttacttaa taatggtatc 2521 tttaaaataa ttgacacaat tgcaaaatga attcctggct tcagttagct attatttttt 2581 taatgacaac atagactgtg ctctaagttt aaaagatggg gaagcttata taaaagtgac 2641 ccttttgcat catatgggta tctaaactta atttacccaa taagttgatg cttaatgatt 2701 ttattttatt tttgtctatt ttctatttta gttgtggctt tgctctaaga atgggtaata 2761 gttgtactac agactgctat aaatttcttg tgatactctt ttagagctca aaatatctct 2821 gagctttaga catggtaagg tggagagtaa atgcttgata aatctttaag atatgtcttg 2881 aatgataatt aggacattca gtccagtgga aatacaccat tcaattagtc aggtctggtg 2941 aatttcc // LOCUS HSU18991 2672 bp mRNA PRI 08-MAY-1996 DEFINITION Human retinal pigment epithelium-specific 61 kDa protein (RPE65) mRNA, complete cds. ACCESSION U18991 NID g675457 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2672) AUTHORS Nicoletti,A., Wong,D.J., Kawase,K., Gibson,L.H., Yang-Feng,T.L., Richards,J.E. and Thompson,D.A. TITLE Molecular characterization of the human gene encoding an abundant 61 kDa protein specific to the retinal pigment epithelium JOURNAL Hum. Mol. Genet. 4 (4), 641-649 (1995) MEDLINE 95359969 REFERENCE 2 (bases 1 to 2672) AUTHORS Thompson,D.A. TITLE Direct Submission JOURNAL Submitted (21-DEC-1994) Debra A. Thompson, Department of Ophthalmology, University of Michigan, 1000 Wall St., Ann Arbor, MI 48105, USA FEATURES Location/Qualifiers source 1..2672 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hE61.g1, hE61.g2" /tissue_type="retinal pigment epithelium" /map="1p31" /chromosome="1" 5'UTR <1..54 gene 55..1656 /gene="RPE65" CDS 55..1656 /gene="RPE65" /codon_start=1 /product="retinal pigment epithelium-specific 61 kDa protein" /db_xref="PID:g675458" /translation="MSIQVEHPAGGYKKLFETVEELSSPLTAHVTGRIPLWLTGSLLR CGPGLFEVGSEPFYHLFDGQALLHKFDFKEGHVTYHRRFIRTDAYVRAMTEKRIVITE FGTCAFPDPCKNIFSRFFSYFRGVEVTDNALVNVYPVGEDYYACTETNFITKINPETL ETIKQVDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPPLQADKED PISKSEIVVQFPCSDRFKPSYVHSFGLTPNYIVFVETPVKINLFKFLSSWSLWGANYM DCFESNETMGVWLHIADKKRKKYLNNKYRTSPFNLFHHINTYEDNGFLIVDLCCWKGF EFVYNYLYLANLRENWEEVKKNARKAPQPEVRRYVLPLNIDKADTGKNLVTLPNTTAT AILCSDETIWLEPEVLFSGPRQAFEFPQINYQKYCGKPYTYAYGLGLNHFVPDRLCKL NVKTKETWVWQEPDSYPSEPIFVSHPDALEEDDGVVLSVVVSPGAGQKPAYLLILNAK DLSEVARAEVEINIPVTFHGLFKKS" 3'UTR 1657..>2672 variation replace(2382..2387,"cagc") /note="the sequence is CAGAGC in 14/44 chromosomes scored and is CAGC in 30/44 chromosomes scored" /frequency="0.36" BASE COUNT 829 a 510 c 499 g 834 t ORIGIN 1 tccttcttca ttctgcagtt ggtgccagaa ctctggatcc tgaactggaa gaaaatgtct 61 atccaggttg agcatcctgc tggtggttac aagaaactgt ttgaaactgt ggaggaactg 121 tcctcgccgc tcacagctca tgtaacaggc aggatccccc tctggctcac cggcagtctc 181 cttcgatgtg ggccaggact ctttgaagtt ggatctgagc cattttacca cctgtttgat 241 gggcaagccc tcctgcacaa gtttgacttt aaagaaggac atgtcacata ccacagaagg 301 ttcatccgca ctgatgctta cgtacgggca atgactgaga aaaggatcgt cataacagaa 361 tttggcacct gtgctttccc agatccctgc aagaatatat tttccaggtt tttttcttac 421 tttcgaggag tagaggttac tgacaatgcc cttgttaatg tctacccagt gggggaagat 481 tactacgctt gcacagagac caactttatt acaaagatta atccagagac cttggagaca 541 attaagcagg ttgatctttg caactatgtc tctgtcaatg gggccactgc tcacccccac 601 attgaaaatg atggaaccgt ttacaatatt ggtaattgct ttggaaaaaa tttttcaatt 661 gcctacaaca ttgtaaagat cccaccactg caagcagaca aggaagatcc aataagcaag 721 tcagagatcg ttgtacaatt cccctgcagt gaccgattca agccatctta cgttcatagt 781 tttggtctga ctcccaacta tatcgttttt gtggagacac cagtcaaaat taacctgttc 841 aagttccttt cttcatggag tctttgggga gccaactaca tggattgttt tgagtccaat 901 gaaaccatgg gggtttggct tcatattgct gacaaaaaaa ggaaaaagta cctcaataat 961 aaatacagaa cttctccttt caacctcttc catcacatca acacctatga agacaatggg 1021 tttctgattg tggatctctg ctgctggaaa ggatttgagt ttgtttataa ttacttatat 1081 ttagccaatt tacgtgagaa ctgggaagag gtgaaaaaaa atgccagaaa ggctccccaa 1141 cctgaagtta ggagatatgt acttcctttg aatattgaca aggctgacac aggcaagaat 1201 ttagtcacgc tccccaatac aactgccact gcaattctgt gcagtgacga gactatctgg 1261 ctggagcctg aagttctctt ttcagggcct cgtcaagcat ttgagtttcc tcaaatcaat 1321 taccagaagt attgtgggaa accttacaca tatgcgtatg gacttggctt gaatcacttt 1381 gttccagata ggctctgtaa gctgaatgtc aaaactaaag aaacttgggt ttggcaagag 1441 cctgattcat acccatcaga acccatcttt gtttctcacc cagatgcctt ggaagaagat 1501 gatggtgtag ttctgagtgt ggtggtgagc ccaggagcag gacaaaagcc tgcttatctc 1561 ctgattctga atgccaagga cttaagtgaa gttgcccggg ctgaagtgga gattaacatc 1621 cctgtcacct ttcatggact gttcaaaaaa tcttgagcat actccagcaa gatatgtttt 1681 tggtagcaaa actgagaaaa tcagcttcag gtctgcaatc aaattctgtt caattttagc 1741 ctgctatatg tcatggtttt aacttgcaga tgcgcacaat tttgcaatgt tttacagaaa 1801 gcactgagtt gagcaagcaa ttcctttatt taaaaaaaaa agtacgtatt tagataatca 1861 tacttcctct gtgagacagg ccataactga aaaactctta aatatttagc aatcaaatag 1921 gaaatgaatg tggacttact aaatggcttt taattcctat tataagagca tattttaggt 1981 acctatctgc tccaattata tttttaacat ttaaaaacca aagtcctcta cacttgattt 2041 atattatatg tggctttgct gagtcaagga agtatcatgc aataaggctt aattactaaa 2101 tgtcaaacca aactttttct caaaccaggg actatcatct aagattaatt acagtaatta 2161 ttttgcgtat acgtaactgc tcaaagatta tgaatcttat gaatgttaac ctttccgttt 2221 attacaagca agtactatta tttctgattt tataataaga aaatctatgt ttaatcaact 2281 gaggcctctc aaccaaataa catctcagag attaagttat atattaaaag cttatgtaac 2341 ataaaagcaa gtacatatag tagtgactat atttaaaaaa acagagcata aaatgcttaa 2401 aaatgtaata tttactaaaa tcagattatg ggataatgtt gcaggattat actttattgc 2461 atcttttttg tttaattgta tttaagcatt gtgcaatcac ttgggaaaaa tattaaatta 2521 ttaacattga ggtattaata cattttaagc cttttgtttt taaatttctt ttgttccaga 2581 gattgtttaa aaataaatat tgacaaaaat aatgttttat atcttaattc tagtatctgt 2641 tttatgcttg aaagcattac agatcatgat ac // LOCUS HSU19107 3965 bp DNA PRI 02-OCT-1995 DEFINITION Human ZNF127 (ZNF127) gene, complete cds. ACCESSION U19107 NID g1001958 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3965) AUTHORS Jong,M.T., Carey,A.H., Glenn,C.C., Saitoh,S., Stewart,C.L., Rinchik,E.M., Driscoll,D.J. and Nicholls,R.D. TITLE A novel imprinted zinc-finger gene and overlapping antisense transcript identify multiple candidate genes for Prader-Willi syndrome and a mouse genetic model JOURNAL Unpublished REFERENCE 2 (bases 1 to 3965) AUTHORS Jong,M.T. TITLE Direct Submission JOURNAL Submitted (25-DEC-1994) Michelle T. Jong, Division of Pediatric Genetics, College of Medicine, University of Florida, 1600 S.W. Archer Road, Gainesville, FL 32610-0296, USA FEATURES Location/Qualifiers source 1..3965 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q11-q13" /chromosome="15" promoter 1..968 5'UTR 969..1077 mRNA 969..>3770 gene 1078..2601 /gene="ZNF127" CDS 1078..2601 /gene="ZNF127" /codon_start=1 /product="ZNF127" /db_xref="PID:g1001959" /translation="MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAP DSALPHAARGWAPFPVAPVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICR YYIHGQCKEGENCRYSHDLSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAP PAASSLSLPVIGSAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVA SAPEAPLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHP MDAAQREEHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSN CNHSFCIRCIRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYK EAMSNKACRYFAEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQLVEPVR MGEGNMLYKSIKKELVVLRLASLLFKRFLSLRDELPFSEDQWDLLHYELEEYFNLIL" polyA_signal 3765..3770 BASE COUNT 1168 a 800 c 1007 g 990 t ORIGIN 1 gaattcagta tgcagtcatg atacgtactc tcagcaaagt aggaatacaa aaaaatgatt 61 aatctactac atttatctgc aaaaatgaca aaacgctcat acatggaaat taaagaccat 121 tcctttgtgg ataatatggc acaggatatg ttatgaaagg ctgtcctgcc acaacacagc 181 acagatgggg ttgaaagaaa ggaagggaaa caaagggctg ccatgaatta aaccagtaga 241 gaaaataaaa gcctgggtca tgcaggggac agtgtcttat taggccagca tcaggactgg 301 ttgaagtctg gcgcttctaa cacagtggga aaatgagcat agtaatcaca ataaagaaag 361 acatctgaaa ggtgctaact tagcctcgga tcagaagtgc tctctgctaa tgccttgctg 421 gtgaaaagcg tatccagtgt aagccctcag aaatcccaga aaacaaagac caaaagaaac 481 ggcgccttgt caaccgaacg aattgaaaaa agcttcctgc cgcgatcggg cattaaaaga 541 aaaaaaccac agacataaga tggagtgaat aaaaaaagaa tttatataat tcatggatga 601 aatgtttcag aatctatagg attatatttt tttttaaggc agacagatac gaaaatacaa 661 cgagcgtgca tgaccgaaac cagaagagat taaagtaaaa cctcattctc ctgaggaaat 721 cgtgtgagaa gggacttagg gactgccgga acacagcgaa cgagaggcag aaggcagaca 781 aaaggggctg ggttggtccc cgcctgtgtg aacgaaaaaa tatgtcagat tggaaaattg 841 cggtaaaaac caggcagagc acgtacgttg cccccacagg aagtgtccgc catgctgcct 901 gtgcccggaa gtggtaggaa cacacagtca gagggaccca aaagcagggg ggaaggaaaa 961 agagatgcac acttccccca gagaagcctc cgagcgcggc cgccattccg ggcctcaagc 1021 ccataaagaa aaaataccgg agaggttctg gcaccatttc ggggtgccaa agcagccatg 1081 gaagagcctg cagctccctc agaagcccac gaggcagccg gggcccaggc aggtgctgag 1141 gcagcaaggg agggtgtgtc tgggccggac cttcccgtct gtgagccctc cggggaatct 1201 gctgctccag attcagccct gccacatgcg gcaaggggct gggccccctt ccctgtagct 1261 ccagtccctg cccacctccg cagaggaggc ctgaggcctg ccccagcctc aggaggagga 1321 gcctggccca gtccgttgcc aagccgaagc agcggcattt ggacaaagca gatcatctgc 1381 aggtattata tacatgggca gtgcaaggag ggggagaact gtcgctattc gcacgacctt 1441 tctggtcgga agatggccac tgagggtggc gtttcgccgc ctggggcctc tgcaggtgga 1501 ggccctagca cggctgcgca catcgagccc ccgactcagg aagtggcgga agcccccccg 1561 gctgcatcct ccctttcctt gcctgtgatt ggctcggctg ctgaaagggg tttctttgaa 1621 gccgagagag acaatgcaga ccgtggagct gctggaggag caggtgtaga aagctgggcg 1681 gatgccattg agtttgttcc agggcagccc taccggggcc gctgggttgc atctgccccc 1741 gaggctcctc tacagagctc agagactgag aggaagcaga tggctgtggg cagtgggttg 1801 cggttttgct attatgcttc caggggagtt tgctttcgtg gggagagctg tatgtacctc 1861 catggagaca tatgcgacat gtgtgggctg cagaccttgc accccatgga tgctgcccag 1921 agggaagaac atatgagggc ctgcattgaa gcacacgaga aagatatgga actctcgttt 1981 gctgtgcagc gtggtatgga caaggtgtgt ggcatctgca tggaggttgt ctatgagaag 2041 gccaacccca atgaccgccg ctttggcatt ctttccaatt gcaaccattc cttctgtatt 2101 aggtgtatcc gcaggtggag aagtgccaga cagtttgaga acaggatcgt caagtcttgc 2161 ccacagtgca gggtcacctc tgaattggtc attcccagtg agttctgggt ggaggaggag 2221 gaagagaagc agaaacttat tcagcaatac aaggaggcaa tgagcaacaa ggcctgcagg 2281 tattttgcgg aaggcagggg taactgccca tttggagaca catgctttta caagcatgaa 2341 taccctgagg gctggggaga tgagcctcct gggccaggtg gtgggtcatt cagcgcatac 2401 tggcatcaac ttgtggagcc tgtgcgaatg ggagagggca acatgctcta taaaagcatt 2461 aagaaggagc ttgtcgtgct tcggctggcc agtctgttgt ttaagcggtt tctttcactg 2521 agagatgagt tacccttctc tgaggaccag tgggacttgc ttcattatga gctggaagaa 2581 tatttcaatt tgattctgta gcatcgtgct gtggcatgtg gtctagtctg ctgaggttct 2641 gtcgtctgct attgcctgtt ttccctgtgt tgacactctt actgctttca ggggctgttg 2701 aggcagtgct tctgttttct tgtctattct gcatatcttt ccccctagga ttatggtgat 2761 tatctgtgtt aaaaaataag tccttaaagt tactgttttg gtgaaattaa tattaatgtc 2821 agcttatggc ttttttttgt catctctgtt gtcaacagga ttaactcagt tctagtgtag 2881 tgtttactga atttccacac ttattttgaa gaccctcaag agtaaatgtg gcagagtgaa 2941 aggagaagtt ttaattgaac tagtagcttt gtgctataat agccttaaca aatggaccct 3001 tgcagggctt tgcagctgct catctgtttg tttacagttt gttctttccc tccttcccct 3061 tcaagtgcac ttgttaaact gtgatgaact tgtgattttg tgttttatct gaccaaaacc 3121 aagtgtatat gtttacatgt ttttatcctg tttagcttga catgaaataa tttatatttg 3181 gaaatatata tttaagaatt atatatataa aaatatatat ggtataagga ggttatggta 3241 tttgaaaaaa atatataaaa gaatatacat cacaatataa tatttatgtt tatgtaataa 3301 agtaaataca gagctgaaag ctgaaggtca aagcctaaca ggactggctg ttgtgtggat 3361 gtgagttgtg tgaataatct ttctgtccct cgcacagaag ccagtaatta gcatctaatg 3421 aaaaggactg ttcaagtggg tctggccaaa tgtgacagat gcagatctta gaggacttac 3481 aaagcactat attggtaatt cttacaatgg cattagtagc ttactctata aatacagaga 3541 tggttttcct atgcagttta gccaccttct cattaattct ttgtaacagc aaatcctagg 3601 ctcagaggca cagtgctttg tatttgatat acaaagtctc tagactttcc caacaagggg 3661 cttttgacaa aagagttcaa cataaaagta acaagattat aaaagggaaa atagaacaaa 3721 aaaatattaa aaccaatgag aataaaagaa tggaaagata aaataataaa atatagaaag 3781 cactaaaaac taagtgtaag agtaatatag aaaatttaca tacaacttaa ttagattaaa 3841 tgaaagcagt ttaaagactg aaagtaaagg ctataagact tcattacaat cagaatgaaa 3901 catcctgttt aaaatacaca caacatgaaa atacagaaac actcaaagta gtaggatggg 3961 aaaag // LOCUS HSU19142 646 bp mRNA PRI 04-DEC-1995 DEFINITION Human GAGE-1 protein mRNA, complete cds. ACCESSION U19142 NID g914898 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 646) AUTHORS Van den Eynde,B., Peeters,O., De Backer,O., Gaugler,B., Lucas,S. and Boon,T. TITLE A new family of genes coding for an antigen recognized by autologous cytolytic T lymphocytes on a human melanoma JOURNAL J. Exp. Med. 182 (3), 689-698 (1995) MEDLINE 95378788 REFERENCE 2 (bases 1 to 646) AUTHORS Van Den Eynde,B.J. TITLE Direct Submission JOURNAL Submitted (28-DEC-1994) Benoit J. Van Den Eynde, Ludwig Institute For Cancer Research, 74 Avenue Hippocrate, Brussels B-1200, Belgium FEATURES Location/Qualifiers source 1..646 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /cell_line="MZ2-MEL.43" /tissue_type="melanoma" /dev_stage="adult" CDS 49..465 /codon_start=1 /product="GAGE-1 protein" /db_xref="PID:g914899" /translation="MSWRGRSTYRPRPRRYVEPPEMIGPMRPEQFSDEVEPATPEEGE PATQRQDPAAAQEGEDEGASAGQGPKPEADSQEQGHPQTGCECEDGPDGQEMDPPNPE EVKTPEEEMRSHYVAQTGILWLLMNNCFLNLSPRKP" BASE COUNT 188 a 140 c 174 g 144 t ORIGIN 1 ctgccgtccg gactcttttt cctctactga gattcatctg tgtgaaatat gagttggcga 61 ggaagatcga cctatcggcc tagaccaaga cgctacgtag agcctcctga aatgattggg 121 cctatgcggc ccgagcagtt cagtgatgaa gtggaaccag caacacctga agaaggggaa 181 ccagcaactc aacgtcagga tcctgcagct gctcaggagg gagaggatga gggagcatct 241 gcaggtcaag ggccgaagcc tgaagctgat agccaggaac agggtcaccc acagactggg 301 tgtgagtgtg aagatggtcc tgatgggcag gagatggacc cgccaaatcc agaggaggtg 361 aaaacgcctg aagaagagat gaggtctcac tatgttgccc agactgggat tctctggctt 421 ttaatgaaca attgcttctt aaatctttcc ccacggaaac cttgagtgac tgaaatatca 481 aatggcgaga gaccgtttag ttcctatcat ctgtggcatg tgaagggcaa tcacagtgtt 541 aaaagaagac atgctgaaat gttgcaggct gctcctatgt tggaaaattc ttcattgaag 601 ttctcccaat aaagctttac agccttctgc aaagaaaaaa aaaaaa // LOCUS HSU19178 1654 bp mRNA PRI 23-MAR-1995 DEFINITION Human (Hin-3)/HIV1 promoter region chimeric mRNA, complete cds. ACCESSION U19178 NID g726035 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1654) AUTHORS Raineri,I., Soler,M. and Senn,H. TITLE Analysis of Human Immunodeficiency Virus type 1 promoter insertion in vivo JOURNAL Unpublished REFERENCE 2 (bases 1 to 1654) AUTHORS Senn,H. TITLE Direct Submission JOURNAL Submitted (28-DEC-1994) Hans-Peter Senn, Institut fuer Medizinische Mikrobiologie, Universitaet Basel, Petersplatz 10, Basel, CH-4003, Switzerland FEATURES Location/Qualifiers source 1..1654 /organism="Homo sapiens" /note="chimeric HIV-1/human gene transcript" /db_xref="taxon:9606" /clone="cl3/Hin3" source 1..147 /organism="Human immunodeficiency virus type 1" /proviral source 148..1654 /organism="Homo sapiens" gene 357..1190 /gene="Hin-3" CDS 357..1190 /gene="Hin-3" /codon_start=1 /db_xref="PID:g726036" /translation="MFEYQTLLEEPQYGENMEIYAGKKNNWTGEFSARFLLKLPVDFS NIPTYLLKDVNEDPGEDVALLSVSFEDTEATQVYPKLYLSPRIEHALGGSSALHIPAF PGGGCLIDYVPQVCHLLTNKVQYVIQGYHKRREYIAAFLSHFGTGVVEYDAEGFTKLT LLLMWKDFCFLVHIDLPLFFPRDQPTLTFQSVYHFTNSGQLYSQAQKNYPYSPRWDGN EMAERAKGCQGSRDACSPWEQVLAFAVAKTGCKLLQPQRNWPSSRGPPWRASEGERTA Q" polyA_site 1654 /evidence=experimental BASE COUNT 417 a 432 c 388 g 417 t ORIGIN 1 cctcagaccc tttaagtcag tgtggaaaat ctctagcagt ggcgcccgaa cagggacttg 61 aaagtgaaag agaagccaga ggagctctct cgacgcagga ctcggcttgc tgaagcgcgc 121 agagcaagag gcgaggggcg gcgactgggg atatcatttt caatgcccaa tacccagaac 181 tgcctcccga ttttatcttt ggagaagatg ctgaattcct gccagacccc tcagctttgc 241 agaatcttgc ctcctggaat ccttcaaatc ctgaatgtct cttacttgtg gtgaaggaac 301 ttgtgcaaca atatcaccaa ttccaatgta gccgcctcca ggagagctcc cgcctcatgt 361 ttgaatacca gacattactg gaggagccac agtatggaga gaacatggaa atttatgctg 421 ggaaaaaaaa caactggact ggtgaatttt cagctcgttt ccttttgaag ctgcccgtag 481 atttcagcaa tatccccaca taccttctca aggatgtaaa tgaagaccct ggagaagatg 541 tggccctcct ctctgttagt tttgaggaca ctgaagccac ccaggtgtac cccaagctgt 601 acttgtcacc tcgaattgag catgcacttg gaggctcctc agctcttcat atcccagctt 661 ttccaggagg aggatgtctc attgattacg ttcctcaagt atgccacctg ctcaccaaca 721 aggtgcagta cgtgattcaa gggtatcaca aaagaagaga gtatattgct gcttttctca 781 gtcactttgg cacaggtgtc gtggaatatg atgcagaagg ctttacaaaa ctcactctgc 841 tgctgatgtg gaaagatttt tgttttcttg tacacattga cctgcctctg tttttccctc 901 gagaccagcc aactctcaca tttcagtccg tttatcactt taccaacagt ggacagcttt 961 actcccaggc ccaaaaaaat tatccgtaca gccccagatg ggatggaaat gaaatggccg 1021 aaagagcaaa gggatgccaa gggagcagag atgcctgcag cccgtgggag caagtcctgg 1081 cctttgcagt tgcaaaaact ggctgcaagc tgctccagcc ccagaggaac tggccaagct 1141 ccagagggcc tccttggagg gcctcagagg gagagagaac tgctcagtaa ttttgatcac 1201 tttgggctta tttcaaaacc tttgtccctc agttccagga ggcagcattt gccaatggaa 1261 agctctagga aacaccagtc ttgagaggtg gccagccaga ctgcctgtcc acatgcgtgt 1321 cagcacatac agccgcttcc tggaagccgc ctggaatgtc ttcacggcag cgttttgctc 1381 acacagcagc ttttgcacgc cccaggcagc cccgactgct gaaatccaac ttgagctggc 1441 tggtggtccc tggatcctag agcccttcac ttcgggttac tccctctttc ttgcctctat 1501 ttcttagttg gaagaaataa actcacaaat tatggtgcag taattttccg gggaaagtaa 1561 agcctcagga atgcccacgc ctttcttcca aagcctttgt ctctgagacc tcttaagttc 1621 taagattaaa tgcccctcgc tgttcttcct ctga // LOCUS HSU19179 1916 bp mRNA PRI 23-MAR-1995 DEFINITION Human (Hin-2) mRNA, complete cds. ACCESSION U19179 NID g726037 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1916) AUTHORS Raineri,I., Soler,M. and Senn,H. TITLE Analysis of Human Immunodeficiency Virus type 1 promoter insertion in vivo JOURNAL Unpublished REFERENCE 2 (bases 1 to 1916) AUTHORS Senn,H. TITLE Direct Submission JOURNAL Submitted (28-DEC-1994) Hans-Peter Senn, Institut fuer Medizinische Mikrobiologie, Universitaet Basel, Petersplatz 10, Basel, CH-4003, Switzerland FEATURES Location/Qualifiers source 1..1916 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ATCC TERA-1" gene 360..1610 /gene="Hin-2" CDS 360..1610 /gene="Hin-2" /codon_start=1 /db_xref="PID:g726038" /translation="MEERPNLYSQPYSSPSPTANLPSPFQGMVRQKPSLGTMPVQVTP PRGAFSPGMGMQPRQTLNRPPAAPNQLRLQLQQRLQGQQQLIHQNRQAILNQFAATAP VGINMRSGMQQQITPQPPLNAQMLAQRQRELYSQQHRQRQLIQQQRAMLMRQQSFGNN LPPSSGLPVQMGNPRLPQGAPQQFPYPPNYGTNPGTPPASTSPFSQLAANPEASLANR NSMVSRGMTGNIGGQFGTGINPQMQQNVFQYPGAGMVPQGEANFAPSLSPGSSMVPMP IPPPQSSLLQQTPPASGYQSPDMKAWQQGAIGNNNVFSQAVQNQPTPAQPGVYNNMSI TVSMAGGNTNVQNMNPMMAQMQMSSLQMPGMNTVCPEQINDPALRHTGLYCNHLSSTD LLKTEADGTQDKKTEEFFSVVTTD" polyA_site 1916 /evidence=experimental BASE COUNT 572 a 517 c 418 g 409 t ORIGIN 1 ggctaaatag attacctgag ctggaattgg aagcaattga taaccaattt ggacaaccag 61 gaacaggcga tcagattcca tggacaaata atacagtgac ggctataaat cagagtaaat 121 cagaagacca gtgtattagc tcacaattag atgagcttct ctgtccaccc acaacagtag 181 aagggagaaa tgatgagaag gctcttcttg aacagctggt atccttcctt agtggcaaag 241 atgaaactga gctagctgaa ctagacagag ctctgggaat tgacaaactt gttcaggggg 301 gtggattaga tgtattatca gagagatttc caccacaaca agcaacgcca cctttgatca 361 tggaagaaag acccaacctt tattcccagc cttactcttc tccttctcct actgccaatc 421 tccctagccc tttccaaggc atggtcaggc aaaaaccttc actggggacg atgcctgttc 481 aagtaacgcc tccccgaggt gctttttcac ctggcatggg catgcagccc aggcaaactc 541 taaacagacc tccggctgca cctaaccagc ttcgacttca actacagcag cgattacagg 601 gacaacagca gttgatacac caaaatcggc aagctatctt aaaccagttt gcagcaactg 661 ctcctgttgg catcaatatg agatcaggca tgcaacagca aattacacct cagccacccc 721 tgaatgctca aatgttggca caacgtcagc gggaactgta cagtcaacag caccgacaga 781 ggcagctaat acagcagcaa agagccatgc ttatgaggca gcaaagcttt gggaacaacc 841 tccctccctc atctggacta ccagttcaaa tggggaaccc ccgtcttcct cagggtgctc 901 cacagcaatt cccctatcca ccaaactatg gtacaaatcc aggaacccca cctgcttcta 961 ccagcccgtt ttcacaacta gcagcaaatc ctgaagcatc cttggccaac cgcaacagca 1021 tggtgagcag aggcatgaca ggaaacatag gaggacagtt tggcactgga atcaatcctc 1081 agatgcagca gaatgtcttc cagtatccag gagcaggaat ggttccccaa ggtgaggcca 1141 actttgctcc atctctaagc cctgggagct ccatggtgcc gatgccaatc cctcctcctc 1201 agagttctct gctccagcaa actccacctg cctccgggta tcagtcacca gacatgaagg 1261 cctggcagca aggagcgata ggaaacaaca atgtgttcag tcaagctgtc cagaaccagc 1321 ccacgcctgc acagccagga gtatacaaca acatgagcat caccgtttcc atggcaggtg 1381 gaaatacgaa tgttcagaac atgaacccaa tgatggccca gatgcagatg agctctttgc 1441 agatgccagg aatgaacact gtgtgccctg agcagataaa tgatcccgca ctgagacaca 1501 caggcctcta ctgcaaccac ctctcatcca ctgaccttct caaaacagaa gcagatggaa 1561 cccaggacaa gaagacagaa gagttcttct ctgtggtgac tacagactag aggaatgctc 1621 tagtgcaaca ggttcaggtg tttgctgacg tccagtgtac agtgaatctg gtaggcgggg 1681 acccttacct gaaccagcct ggtccactgg gaactcaaaa gcccacgtca ggaccacaga 1741 ccccccaggc ccagcagaag agcctccttc agcagctact gactgaataa ccacttttaa 1801 aggaatgtga aatttaaata atagacatac agagatatac aaatatatta tatatttttc 1861 tgagattttt gatatctcaa tctgcagcca ttcttcaggt cgtagcattt ggagca // LOCUS HSU19180 1004 bp mRNA PRI 13-APR-1995 DEFINITION Human B melanoma antigen (BAGE) mRNA, complete cds. ACCESSION U19180 NID g726039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1004) AUTHORS Boel,P., Wildmann,C., Sensi,M.L., Brasseur,R., Renauld,J.C., Coulie,P., Boon,T. and van der Bruggen,P. TITLE BAGE: a new gene encoding an antigen recognized on human melanomas by cytolytic T lymphocytes JOURNAL Immunity 2 (2), 167-175 (1995) MEDLINE 95202592 REFERENCE 2 (bases 1 to 1004) AUTHORS Boel,P. TITLE Direct Submission JOURNAL Submitted (28-DEC-1994) Pascale Boel, Ludwig Institute for Cancer Research, Avenue Hippocrate 74, Brussels, B-1200, Belgium FEATURES Location/Qualifiers source 1..1004 /organism="Homo sapiens" /isolate="patient MZ2" /db_xref="taxon:9606" /sex="female" /cell_line="MZ2-MEL.43" /tissue_type="melanoma" /dev_stage="adult" gene 201..332 /gene="BAGE" CDS 201..332 /gene="BAGE" /standard_name="B melanoma antigen" /codon_start=1 /product="B melanoma antigen" /db_xref="PID:g726040" /translation="MAARAVFLALSAQLLQARLMKEESPVVSWRLEPEDGTALCFIF" repeat_region 384..485 /rpt_family="Alu" BASE COUNT 273 a 204 c 255 g 272 t ORIGIN 1 cgccaattta gggtctccgg tatctcccgc tgagctgctc tgttcccggc ttagaggacc 61 aggagaaggg ggagctggag gctggagcct gtaacaccgt ggctcgtctc actctggatg 121 gtggtggcaa cagagatggc agcgcagctg gagtgttagg agggcggcct gagcggtagg 181 agtggggctg gagcagtaag atggcggcca gagcggtttt tctggcattg tctgcccagc 241 tgctccaagc caggctgatg aaggaggagt cccctgtggt gagctggagg ttggagcctg 301 aagacggcac agctctgtgc ttcatcttct gaggttgtgg cagccacggt gatggagacg 361 gcagctcaac aggagcaata ggaggagatg gagtttcact gtgtcagcca ggatggtctc 421 gatctcctga cctcgtgatc cgcccgcctt ggccttccaa agtgccgaga ttacagcgat 481 gtgcattttg taagcacttt ggagccacta tcaaatgctg tgaagagaaa tgtacccaga 541 tgtatcatta tccttgtgct gcaggagccg gctcctttca ggatttcagt cacatcttcc 601 tgctttgtcc agaacacatt gaccaagctc ctgaaagatg taagtttact acgcatagac 661 ttttaaactt caaccaatgt atttactgaa aataacaaat gttgtaaatt ccctgagtgt 721 tattctactt gtattaaaag gtaataatac ataatcatta aaatctgagg gatcattgcc 781 agagattgtt ggggagggaa atgttatcaa cggtttcatt gaaattaaat ccaaaaagtt 841 atttcctcag aaaaatcaaa taaagtttgc atgtttttta ttcttaaaac attttaaaaa 901 ccactgtaga atgatgtaaa tagggactgt gcagtatttc tgacatatac tataaaatta 961 ttaaaaagtc aatcagtatt caacatcttt tacactaaaa agcc // LOCUS HSU19251 6124 bp mRNA PRI 24-NOV-1997 DEFINITION Homo sapiens neuronal apoptosis inhibitory protein mRNA, complete cds. ACCESSION U19251 NID g2642132 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6124) AUTHORS Roy,N., Mahadevan,M.S., McLean,M., Shutler,G., Yaraghi,Z., Farahini,R., Tamai,K., Ioannou,P., de Jong,P.J., Ikeda,J., Korneluk,R.G. and MacKenzie,A. TITLE The gene for neuronal apoptosis inhibitory protein is partially deleted in individuals with spinal muscular atrophy JOURNAL Cell 80 (1), 167-178 (1995) MEDLINE 95112344 REFERENCE 2 (bases 1 to 6124) AUTHORS Chen,Q., Baird,S.D., Mahadevan,M., Besner-Johnston,A., Farahani,R., Xuan,J.-Y., Kang,X., Lefebvre,C., Korneluk,R.G. and MacKenzie,A.E. TITLE Sequence of a 131 kb region of 5q13.1 containing the spinal muscular atrophy candidate genes SMN and NAIP JOURNAL Genomics (1997) In press REFERENCE 3 (bases 1 to 6124) AUTHORS Baird,S.D. TITLE Direct Submission JOURNAL Submitted (29-DEC-1994) Stephen D. Baird, Children's Hospital of Eastern Ontario, Molecular Genetics, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada REFERENCE 4 (bases 1 to 6124) AUTHORS Baird,S.D. TITLE Direct Submission JOURNAL Submitted (24-NOV-1997) Stephen D. Baird, Children's Hospital of Eastern Ontario, Molecular Genetics, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada FEATURES Location/Qualifiers source 1..6124 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q13.1" /chromosome="5" /tissue_type="brain" /dev_stage="fetus" exon 1..73 /number=1 exon 74..162 /number=2 exon 163..288 /number=3 exon 289..859 /number=4 CDS 292..4503 /function="inhibitor of apoptosis" /note="deleted in people with spinal muscular atrophy SMA; NAIP" /codon_start=1 /evidence=experimental /product="neuronal apoptosis inhibitory protein" /db_xref="PID:g2642133" /translation="MATQQKASDERISQFDHNLLPELSALLGLDAVQLAKELEEEEQK ERAKMQKGYNSQMRSEAKRLKTFVTYEPYSSWIPQEMAAAGFYFTGVKSGIQCFCCSL ILFGAGLTRLPIEDHKRFHPDCGFLLNKDVGNIAKYDIRVKNLKSRLRGGKMRYQEEE ARLASFRNWPFYVQGISPCVLSEAGFVFTGKQDTVQCFSCGGCLGNWEEGDDPWKEHA KWFPKCEFLRSKKSSEEITQYIQSYKGFVDITGEHFVNSWVQRELPMASAYCNDSIFA YEELRLDSFKDWPRESAVGVAALAKAGLFYTGIKDIVQCFSCGGCLEKWQEGDDPLDD HTRCFPNCPFLQNMKSSAEVTPDLQSRGELCELLETTSESNLEDSIAVGPIVPEMAQG EAQWFQEAKNLNEQLRAAYTSASFRHMSLLDISSDLATDHLLGCDLSIASKHISKPVQ EPLVLPEVFGNLNSVMCVEGEAGSGKTVLLKKIAFLWASGCCPLLNRFQLVFYLSLSS TRPDEGLASIICDQLLEKEGSVTEMCMRNIIQQLKNQVLFLLDDYKEICSIPQVIGKL IQKNHLSRTCLLIAVRTNRARDIRRYLETILEIKAFPFYNTVCILRKLFSHNMTRLRK FMVYFGKNQSLQKIQKTPLFVAAICAHWFQYPFDPSFDDVAVFKSYMERLSLRNKATA EILKATVSSCGELALKGFFSCCFEFNDDDLAEAGVDEDEDLTMCLMSKFTAQRLRPFY RFLSPAFQEFLAGMRLIELLDSDRQEHQDLGLYHLKQINSPMMTVSAYNNFLNYVSSL PSTKAGPKIVSHLLHLVDNKESLENISENDDYLKHQPEISLQMQLLRGLWQICPQAYF SMVSEHLLVLALKTAYQSNTVAACSPFVLQFLQGRTLTLGALNLQYFFDHPESLSLLR SIHFPIRGNKTSPRAHFSVLETCFDKSQVPTIDQDYASAFEPMNEWERNLAEKEDNVK SYMDMQRRASPDLSTGYWKLSPKQYKIPCLEVDVNDIDVVGQDMLEILMTVFSASQRI ELHLNHSRGFIESIRPALELSKASVTKCSISKLELSAAEQELLLTLPSLESLEVSGTI QSQDQIFPNLDKFLCLKELSVDLEGNINVFSVIPEEFPNFHHMEKLLIQISAEYDPSK LVKLIQNSPNLHVFHLKCNFFSDFGSLMTMLVSCKKLTEIKFSDSFFQAVPFVASLPN FISLKILNLEGQQFPDEETSEKFAYILGSLSNLEELILPTGDGIYRVAKLIIQQCQQL HCLRVLSFFKTLNDDSVVEIAKVAISGGFQKLENLKLSINHKITEEGYRNFFQALDNM PNLQELDISRHFTECIKAQATTVKSLSQCVLRLPRLIRLNMLSWLLDADDIALLNVMK ERHPQSKYLTILQKWILPFSPIIQK" exon 860..959 /number=5 exon 960..1041 /number=6 exon 1042..1093 /number=7 exon 1094..1213 /number=8 exon 1214..1313 /number=9 exon 1314..1395 /number=10 exon 1396..1453 /number=11 exon 1454..3565 /number=12 exon 3566..3733 /number=13 exon 3734..3889 /number=14 exon 3890..3973 /number=15 exon 3974..4138 /number=16 exon 4139..6124 /number=17 BASE COUNT 1803 a 1298 c 1285 g 1738 t ORIGIN 1 acaaaaggtc ctgtgctcac ctgggaccct tctggacgtt gccctgtgta cctcttcgac 61 tgcctgttca tctacgacga accccgggta ttgaccccag acaacaatgc cacttcatat 121 tggggacttc gtctgggatt ccaaggtgca ttcattgcaa agttccttaa atattttctc 181 actgcttcct actaaaggac ggacagagca tttgttcttc agccacatac tttccttcca 241 ctggccagca ttctcctcta ttagactaga actgtggata aacctcagaa aatggccacc 301 cagcagaaag cctctgacga gaggatctcc cagtttgatc acaatttgct gccagagctg 361 tctgctcttc tgggcctaga tgcagttcag ttggcaaagg aactagaaga agaggagcag 421 aaggagcgag caaaaatgca gaaaggctac aactctcaaa tgcgcagtga agcaaaaagg 481 ttaaagactt ttgtgactta tgagccgtac agctcatgga taccacagga gatggcggcc 541 gctgggtttt acttcactgg ggtaaaatct gggattcagt gcttctgctg tagcctaatc 601 ctctttggtg ccggcctcac gagactcccc atagaagacc acaagaggtt tcatccagat 661 tgtgggttcc ttttgaacaa ggatgttggt aacattgcca agtacgacat aagggtgaag 721 aatctgaaga gcaggctgag aggaggtaaa atgaggtacc aagaagagga ggctagactt 781 gcgtccttca ggaactggcc attttatgtc caagggatat ccccttgtgt gctctcagag 841 gctggctttg tctttacagg taaacaggac acggtacagt gtttttcctg tggtggatgt 901 ttaggaaatt gggaagaagg agatgatcct tggaaggaac atgccaaatg gttccccaaa 961 tgtgaatttc ttcggagtaa gaaatcctca gaggaaatta cccagtatat tcaaagctac 1021 aagggatttg ttgacataac gggagaacat tttgtgaatt cctgggtcca gagagaatta 1081 cctatggcat cagcttattg caatgacagc atctttgctt acgaagaact acggctggac 1141 tcttttaagg actggccccg ggaatcagct gtgggagttg cagcactggc caaagcaggt 1201 cttttctaca caggtataaa ggacatcgtc cagtgctttt cctgtggagg gtgtttagag 1261 aaatggcagg aaggtgatga cccattagac gatcacacca gatgttttcc caattgtcca 1321 tttctccaaa atatgaagtc ctctgcggaa gtgactccag accttcagag ccgtggtgaa 1381 ctttgtgaat tactggaaac cacaagtgaa agcaatcttg aagattcaat agcagttggt 1441 cctatagtgc cagaaatggc acagggtgaa gcccagtggt ttcaagaggc aaagaatctg 1501 aatgagcagc tgagagcagc ttataccagc gccagtttcc gccacatgtc tttgcttgat 1561 atctcttccg atctggccac ggaccacttg ctgggctgtg atctgtctat tgcttcaaaa 1621 cacatcagca aacctgtgca agaacctctg gtgctgcctg aggtctttgg caacttgaac 1681 tctgtcatgt gtgtggaggg tgaagctgga agtggaaaga cggtcctcct gaagaaaata 1741 gcttttctgt gggcatctgg atgctgtccc ctgttaaaca ggttccagct ggttttctac 1801 ctctccctta gttccaccag accagacgag gggctggcca gtatcatctg tgaccagctc 1861 ctagagaaag aaggatctgt tactgaaatg tgcatgagga acattatcca gcagttaaag 1921 aatcaggtct tattcctttt agatgactac aaagaaatat gttcaatccc tcaagtcata 1981 ggaaaactga ttcaaaaaaa ccacttatcc cggacctgcc tattgattgc tgtccgtaca 2041 aacagggcca gggacatccg ccgataccta gagaccattc tagagatcaa agcatttccc 2101 ttttataata ctgtctgtat attacggaag ctcttttcac ataatatgac tcgtctgcga 2161 aagtttatgg tttactttgg aaagaaccaa agtttgcaga agatacagaa aactcctctc 2221 tttgtggcgg cgatctgtgc tcattggttt cagtatcctt ttgacccatc ctttgatgat 2281 gtggctgttt tcaagtccta tatggaacgc ctttccttaa ggaacaaagc gacagctgaa 2341 attctcaaag caactgtgtc ctcctgtggt gagctggcct tgaaagggtt tttttcatgt 2401 tgctttgagt ttaatgatga tgatctcgca gaagcagggg ttgatgaaga tgaagatcta 2461 accatgtgct tgatgagcaa atttacagcc cagagactaa gaccattcta ccggttttta 2521 agtcctgcct tccaagaatt tcttgcgggg atgaggctga ttgaactcct ggattcagat 2581 aggcaggaac atcaagattt gggactgtat catttgaaac aaatcaactc acccatgatg 2641 actgtaagcg cctacaacaa ttttttgaac tatgtctcca gcctcccttc aacaaaagca 2701 gggcccaaaa ttgtgtctca tttgctccat ttagtggata acaaagagtc attggagaat 2761 atatctgaaa atgatgacta cttaaagcac cagccagaaa tttcactgca gatgcagtta 2821 cttaggggat tgtggcaaat ttgtccacaa gcttactttt caatggtttc agaacattta 2881 ctggttcttg ccctgaaaac tgcttatcaa agcaacactg ttgctgcgtg ttctccattt 2941 gttttgcaat tccttcaagg gagaacactg actttgggtg cgcttaactt acagtacttt 3001 ttcgaccacc cagaaagctt gtcattgttg aggagcatcc acttcccaat acgaggaaat 3061 aagacatcac ccagagcaca tttttcagtt ctggaaacat gttttgacaa atcacaggtg 3121 ccaactatag atcaggacta tgcttctgcc tttgaaccta tgaatgaatg ggagcgaaat 3181 ttagctgaaa aagaggataa tgtaaagagc tatatggata tgcagcgcag ggcatcacca 3241 gaccttagta ctggctattg gaaactttct ccaaagcagt acaagattcc ctgtctagaa 3301 gtcgatgtga atgatattga tgttgtaggc caggatatgc ttgagattct aatgacagtt 3361 ttctcagctt cacagcgcat cgaactccat ttaaaccaca gcagaggctt tatagaaagc 3421 atccgcccag ctcttgagct gtctaaggcc tctgtcacca agtgctccat aagcaagttg 3481 gaactcagcg cagccgaaca ggaactgctt ctcaccctgc cttccctgga atctcttgaa 3541 gtctcaggga caatccagtc acaagaccaa atctttccta atctggataa gttcctgtgc 3601 ctgaaagaac tgtctgtgga tctggagggc aatataaatg ttttttcagt cattcctgaa 3661 gaatttccaa acttccacca tatggagaaa ttattgatcc aaatttcagc tgagtatgat 3721 ccttccaaac tagtaaaatt aattcaaaat tctccaaacc ttcatgtttt ccatctgaag 3781 tgtaacttct tttcggattt tgggtctctc atgactatgc ttgtttcctg taagaaactc 3841 acagaaatta agttttcgga ttcatttttt caagccgtcc catttgttgc cagtttgcca 3901 aattttattt ctctgaagat attaaatctt gaaggccagc aatttcctga tgaggaaaca 3961 tcagaaaaat ttgcctacat tttaggttct cttagtaacc tggaagaatt gatccttcct 4021 actggggatg gaatttatcg agtggccaaa ctgatcatcc agcagtgtca gcagcttcat 4081 tgtctccgag tcctctcatt tttcaagact ttgaatgatg acagcgtggt ggaaattgcc 4141 aaagtagcaa tcagtggagg tttccagaaa cttgagaacc taaagctttc aatcaatcac 4201 aagattacag aggaaggata cagaaatttc tttcaagcac tggacaacat gccaaacttg 4261 caggagttgg acatctccag gcatttcaca gagtgtatca aagctcaggc cacaacagtc 4321 aagtctttga gtcaatgtgt gttacgacta ccaaggctca ttagactgaa catgttaagt 4381 tggctcttgg atgcagatga tattgcattg cttaatgtca tgaaagaaag acatcctcaa 4441 tctaagtact taactattct ccagaaatgg atactgccgt tctctccaat cattcagaaa 4501 taaaagattc agctaaaaac tgctgaatca ataatttgtc ttggggcata ttgaggatgt 4561 aaaaaaagtt gttgattaat gctaaaaacc aaattatcca aaattatttt attaaatatt 4621 gcatacaaaa gaaaatgtgt aaggcttgct aaaaaacaaa acaaaacaaa acacagtcct 4681 gcatactcac caccaagctc aagaaataaa tcatcaccaa tacctttgag gtccctgagt 4741 aatccacccc agctaaaggc aaacccttca atcaagttta tacagcaaac cctccattgt 4801 ccatggtcaa cagggaaggg gttggggaca ggtctgccaa tctatctaaa agccacaata 4861 tggaagaagt attcaattta tataataaat ggctaactta acggttgaat cactttcata 4921 catggatgaa acgggtttaa cacaggatcc acatgaatct tctgtgggcc aagagatgtt 4981 ccttaatcct tgtagaacct gttttctata ttgaactagc tttggtacag tagagttaac 5041 ttactttcca tttatccact gccaatataa agaggaaaca ggggttaggg aaaaatgact 5101 tcattccaga ggcttctcag agttcaacat atgctataat ttagaatttt cttatgaatc 5161 cactctactt gggtagaaaa tattttatct ctagtgattg catattattt ccatatcata 5221 gtatttcata gtattatatt tgatatgagt gtctatatca atgtcagtgt ccagaatttc 5281 gttcctacca gttaagtagt tttctgaacg gccagaagac cattcgaaat tcatgatact 5341 actataagtt ggtaaacaac catactttta tcctcatttt tattctcact aagaaaaaag 5401 tcaactcccc tccccttgcc caagtatgaa atatagggac agtatgtatg gtgtggtctc 5461 atttgtttag aaaaccactt atgactgggt gcggtggctc acacctgtaa tcccagcact 5521 ttgggaggct gaggcgggcg aatcatttga ggtgaggaat tcgagaccag cctggccagc 5581 atggtgaaac cccatctcta ctaaaaatac aaaaattagc caggtgtggt ggcacatgcc 5641 tgtagtccca gccactaggg cggctgagac gcaagacttg cttgaacccg ggaggcagag 5701 gttgcagtga gccaagatgg cgccactgca ttccagcctg ggcaacagag caagaccctg 5761 tctgtctcaa aacaaaaaac aaaaccactt atattgctag ctacattaag aatttctgaa 5821 tatgttactg agcttgcttg tggtaaccat ttataatatc agaaagtata tgtacaccaa 5881 aacatgttga acatccatgt tgtacaactg aaatataaat aattttgtca attataccta 5941 aataaaactg gaaaaaaatt tctggaagtt tatatctaaa aatgttaata gtgcgtacct 6001 ctaggaagtg ggcctggaag ccattcttac ttttcagtct ctcccattct gtactgtttt 6061 ttgttttact ttcgtgcctg cattattttt ctatttaaaa caaaaataaa tctagtttag 6121 cact // LOCUS HSU19252 5107 bp mRNA PRI 18-MAY-1995 DEFINITION Human putative transmembrane protein mRNA, complete cds. ACCESSION U19252 NID g808035 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5107) AUTHORS Yamakawa,K., Mitchell,S., Hubert,R., Chen,X.N., Colbern,S., Huo,Y.K., Gadomski,C., Kim,U.J. and Korenberg,J.R. TITLE Isolation and characterization of a candidate gene for progressive myoclonus epilepsy on 21q22.3 JOURNAL Hum. Mol. Genet. 4 (4), 709-716 (1995) MEDLINE 95359979 REFERENCE 2 (bases 1 to 5107) AUTHORS Yamakawa,K. TITLE Direct Submission JOURNAL Submitted (29-DEC-1994) Kazuhiro Yamakawa, Medical Genetics, Cedars-Sinai Medical Center, UCLA, 110 George Burns Road, Davis Building, Suite 2005, Los Angeles, CA 90048-1869, USA FEATURES Location/Qualifiers source 1..5107 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cDF9" /map="21q22.3" /chromosome="21" /tissue_type="brain" /dev_stage="14 weeks fetus" CDS 138..3710 /note="putative transmembrane protein" /codon_start=1 /db_xref="PID:g808036" /translation="MDASEEPLPPVIYTMENKPIVTCAGDQNLFTSVYPTLSQQLPRE PMEWRRSYGRAPKMIHLESNFVQFKEELLPKEGNKALLTFPFLHIYWTECCDTEVYKA TVKDDLTKWQNVLKAHSSVDWLIVIVENDAKKKNKTNILPRTSIVDKIRNDFCNKQSD RCVVLSDPLKDSSRTQESWNAFLTKLRTLLLMSFTKNLGKFEDDMRTLREKRTEPGWS FCEYFMVQEELAFVFEMLQQFEDALVQYDELDALFSQYVVNFGAGDGANWLTFFCQPV KSWNGLILRKPIDMEKRESIQRREATLLDLRSYLFSRQCTLLLFLQRPWEVAQRALEL LHNCVQELKLLEVSVPPGALDCWVFLSCLEVLQRIEGCCDRAQIDSNIAHTVGLWSYA TEKLKSLGYLCGLVSEKGPNSEDLNRTVDLLAGLGAERPETANTAQSPYKKLKEALSS VEAFEKHYLDLSHATIEMYTSIGRIRSAKFVGKDLAEFYMRKKAPQKAEIYLQGALKN YLAEGWALPITHTRKQLAECQKHLGQIENYLQTSSLLASDHHLTEEERKHFCQEILDF ASQPSDSPGHKIVLPMHSFAQLRDLHFDPSNAVVHVGGVLCVEITMYSQMPVPVHVEQ IVVNVHFSIEKNSYRKTAEWLTKHKTSNGIINFPPETAPFPVSQNSLPALELYEMFER SPSDNSLNTTGIICRNVHMLLRRQESSSSLEMPSGVALEEGAHVLRCSHVTLEPGANQ ITFRTQAKEPGTYTLRQLCASVGSVWFVLPHIYPIVQYDVYSQEPQLHVEPLADSLLA GIPQRVKFTVTTGHDTIKNGDSLQLSNAEAMLILCQAESRAVVYSNTREQSSEAALRI QSSDKVTSISLPVAPAYHVIEFELEVLSLPSAPALGGESDMLGMAEPHRKHKDKQRTG RCMVTTDHKVSIDCPWSIYSTVIALTFSVPFRTTHSLLSSGTRKYVQVCVQNLSELDF QLSDSYLVDTGDSTDLQLVPLNTQSQQPIYSKQSVFFVWELKWTEEPPPSLHCRFSVG FSPASEEQLSISLKPYTYEFKVENFFTLYNVKAEIFPPSGMEYCRTGSLCSLEVLITR LSDLLEVDKDEALTESDEHFSTKLMYEVVDNSSNWAVCGKSCGVISMPVAARATHRVH MEVMPLFAGYLPLPDVRLFKYLPHHSAHSSQLDADSWIENAACQ" BASE COUNT 1237 a 1322 c 1324 g 1224 t ORIGIN 1 gcggcgcaac cggctccgga gctgcctggc gcggccgggc gggcggcgcc gctcaggctc 61 gggctccggc tgggcccggc gcggcctcgg ggctgcccat ggggcgcggg gggccgggcc 121 ggtgacgccg gacgcccatg gacgcctctg aggagccgct gccgccggtg atctacacca 181 tggagaacaa gcccatcgtc acctgtgctg gagatcagaa tttatttacc tctgtttatc 241 caacgctctc tcagcagctt ccaagagaac caatggaatg gagaaggtcc tatggccggg 301 ctccgaagat gattcaccta gagtctaact ttgttcaatt caaagaggag ctgctgccca 361 aagaaggaaa caaagctctg ctcacgtttc ccttcctcca tatttactgg acagagtgct 421 gtgataccga agtgtataaa gctacagtaa aagatgacct caccaagtgg cagaatgttc 481 tgaaggctca tagctctgtg gactggttaa tagtgatagt tgaaaatgat gccaagaaaa 541 aaaacaaaac caacatcctt ccccgaacct ctattgtgga caaaataaga aatgattttt 601 gtaataaaca gagtgacagg tgtgttgtgc tctccgaccc cttgaaggac tcttctcgaa 661 ctcaggaatc ctggaatgcc ttcctgacca aactcaggac attgcttctt atgtctttta 721 ccaaaaacct aggcaagttt gaggatgaca tgagaacctt gagggagaag aggactgagc 781 caggctggag cttttgtgaa tatttcatgg ttcaggagga gcttgccttt gttttcgaga 841 tgctgcagca gttcgaggac gccctggtgc agtacgacga actggacgcc ctcttctctc 901 agtatgtggt caacttcggg gccggggatg gtgccaactg gctgactttt ttctgccagc 961 cagtgaagag ctggaacgga ttgatcctcc gaaaacccat agatatggag aagcgggaat 1021 cgatccagag gcgagaagcc accctgttag atctgcgcag ttacctgttc tctcgccagt 1081 gcaccttgct gctcttcctg cagaggccgt gggaggtggc ccagcgcgcc ctagagctgc 1141 tgcacaactg cgtgcaggaa ctgaagctct tagaagtctc tgtcccacct ggtgctctgg 1201 actgctgggt gtttctgagc tgtctggagg tgttgcagag gatagaaggc tgctgtgacc 1261 gggcacagat cgactcaaac attgcccaca ctgtggggct atggagctat gccacagaaa 1321 agttaaagtc cttgggctat ctatgtggac ttgtgtcaga gaaaggacct aactcagaag 1381 atctcaacag gacagttgac cttttggcag gtttgggagc tgagcgacca gaaacagcca 1441 acacagctca gagtccttat aagaaactga aagaagcatt atcgtcagtg gaagcttttg 1501 aaaaacacta cttagatttg tcccatgcca ccattgaaat gtatacaagc attgggagga 1561 ttcgatctgc taagtttgtt ggaaaagatc tggcagagtt ttacatgagg aaaaaggctc 1621 cacaaaaggc agaaatctat cttcaaggag cactgaaaaa ctacctggct gagggctggg 1681 cactccccat cacacacaca aggaagcagc tggccgaatg tcaaaagcac cttggacaaa 1741 ttgaaaacta cctgcagacc agcagcctct tagccagtga ccaccacctc actgaagagg 1801 agcgcaagca cttctgccag gagatacttg actttgccag ccagccgtca gacagcccag 1861 gtcataagat agtgctaccc atgcattcct ttgcacaact gcgagatctc cattttgatc 1921 cctccaatgc cgtggtccac gtgggcggcg ttttgtgcgt tgagataacc atgtacagcc 1981 agatgcctgt gcctgttcac gtggagcaga ttgtggtcaa tgtccacttc agcattgaga 2041 aaaacagcta ccggaagact gcggagtggc ttaccaagca caagacgtcc aatgggatca 2101 ttaactttcc acccgagacc gcacctttcc ctgtatccca aaacagtttg cccgcgctgg 2161 agttgtatga aatgtttgag agaagcccat ctgataactc cttgaacacg actgggatta 2221 tctgcagaaa cgtccacatg ctcctgagaa ggcaggagag cagctcctct ctagagatgc 2281 cctcaggggt ggctctggag gagggtgccc acgtgctgag gtgcagccac gtgaccctgg 2341 aaccaggggc caaccagata acattcagga ctcaggccaa ggaacctgga acgtatacac 2401 tcaggcagct gtgcgcctcg gtgggctccg tgtggttcgt cctccctcac atctacccca 2461 ttgtgcagta cgacgtgtac tcacaggagc cccagctgca cgtggagccg ctggctgata 2521 gccttctggc aggcattcct cagagagtca agttcactgt cactaccggc catgatacga 2581 taaagaatgg agacagcctg cagcttagca atgccgaagc catgctcatc ctgtgccagg 2641 cggagagcag ggctgtggtc tactccaaca cgagagaaca gtcttctgag gccgcgctcc 2701 ggattcagtc ctccgacaag gtcacgagca tcagtctgcc tgttgcgcct gcgtaccacg 2761 tgatcgaatt tgaactggaa gttctctctt taccttcagc cccagcactc ggaggggaga 2821 gtgacatgct ggggatggca gagccccaca ggaagcataa ggacaaacag agaactggcc 2881 gctgcatggt taccacagac cacaaagtgt cgattgactg cccgtggtcc atctactcca 2941 cagtcatcgc actgaccttc agcgtaccct tcaggaccac acacagcctc ctgtcctcag 3001 gaacacggaa atatgttcaa gtttgtgtcc agaatttgtc agaacttgac tttcagctgt 3061 cagatagtta tcttgtagat accggtgata gtaccgacct gcaactagta ccactgaaca 3121 cgcagtccca gcagcccatc tacagcaagc agtcggtgtt cttcgtctgg gaactcaagt 3181 ggacagaaga gcctccccct tctctgcatt gccggttctc tgttggattt tccccagctt 3241 ctgaggaaca gctgtctatc tccttaaagc cgtatactta tgaatttaaa gtggaaaatt 3301 tttttacatt atacaacgtg aaggctgaga tctttccccc ttcgggaatg gagtattgca 3361 gaacaggctc cctctgctcc ctggaggttt tgatcacgag gctctcagac ctcttggagg 3421 tggataaaga tgaagcactg actgaatctg atgagcattt ttcgacaaag cttatgtatg 3481 aagttgtcga caacagtagc aactgggcag tgtgtgggaa aagctgcggt gtcatctcca 3541 tgccagtggc tgctcgggcc actcacaggg tccacatgga agtgatgccg ctcttcgccg 3601 ggtatctccc cctgcccgac gtcaggctgt tcaagtacct cccccatcat tctgcacact 3661 cctcccaact ggacgctgac agctggatag aaaacgcagc ctgtcagtag acaagcacgg 3721 ggacgaccag ccggacagca gcagcctcaa gagcaggggc agcgtgcatt cggcctgcag 3781 cagcgagcac aaaggcctac ccatgccccg gctgcaggca ctgccggccg gccaggtctt 3841 caactccagc tcgggcacac aagtcctggt catccccagc caagatgacc acgtcctgga 3901 agtcagtgta acatgacaac gccagggtga acacacgcca cttcccagct aggagtgcac 3961 tttatgggac tgtgactgga ctcttccgtt ctggctccag ccagaccttc agtggtcctg 4021 cctggccgtg gggacatcag agagtgtcat cacgcagctg gccagctgag ttctgttgtt 4081 gttttcatgc cgcctgtgat ctcagattcc tgcttttctc accccgtccc catgctggtg 4141 tccgacgccg cttactcaga gccctggcct ccctccccct acctcacacg ctgctcatga 4201 aagtttccac ccacgctgtc tccacggaac agcctccgtc tgctggctct tcgtggaagg 4261 ccatttgtct ttcaggtaga cactcagcag ccctcacggt cttagtgacg tgtgtgcctt 4321 tctggtcaca cagctgccca gtttcctgat cggggtggat ttgtgtcccc taaggggtaa 4381 aacagccgtt taccgcagat cctctcatac acccttctag gggaggcggg tgggggaggg 4441 agggatcata accccttctg tgccttggga tgccggagct gggggacctg gaggcccatc 4501 agccggagcc acgtgaaagg tactgaagaa agctgagacc cggctgtgag gagcgcctca 4561 gcggtgaggt ggtttaggga taaatgtttc tggaaccctg tggtccccca taatgttgat 4621 agatatcata tgcactggga gttaaatata tttaatttaa tgatcattat atatgtgggg 4681 gttaatatgt tgtttttctg tccctttaaa gtctttacat gtaattgtag ctgtataatc 4741 gttatttttc ttttgcatct taagtcttag aaattaagat attccatcgt gaggatgaga 4801 gaggtcctca gtgtgttttt ggtctggttg tagggaagga ctcaagtcct ggaatgtcct 4861 ccactggtct actgagttgc agtcacactg ttccaatgga ttatttgctt tcggttgtaa 4921 atttaattgt acatatggtt gatttattat ttttaaaaat acagactaac tgatgtaatg 4981 tttatgtata agttgcacca aaaatcaagg acaaaaataa gtgtgtttgt ttttacaggt 5041 gtgaaagtca cagcttgtaa ataagtgttg tatgtattaa accttttcca gttctccaaa 5101 gcgatgt // LOCUS HSU19261 2380 bp mRNA PRI 21-FEB-1995 DEFINITION Human Epstein-Barr virus-induced protein mRNA, complete cds. ACCESSION U19261 NID g675461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2380) AUTHORS Mosialos,G., Birkenbach,M., Yalamanchili,R., VanArsdale,T., Ware,C. and Kieff,E. TITLE The Epstein-Barr virus transforming protein LMP1 engages signaling proteins for the tumor necrosis factor receptor family JOURNAL Cell (1994) In press REFERENCE 2 (bases 1 to 2380) AUTHORS Mosialos,G. TITLE Direct Submission JOURNAL Submitted (29-DEC-1994) George Mosialos, Microbiology and Molecular Genetics, Harvard Medical School, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2380 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BL41/B95-8" /cell_type="B-cell" /tissue_type="lymphoid tumor" 5'UTR 1..75 CDS 76..1326 /note="EBV induced protein" /codon_start=1 /product="Epstein-Barr virus-induced protein" /db_xref="PID:g675462" /translation="MASSSGSSPRPAPDENEFPFGCPPTVCQDPKEPRALCCAGCLSE NPRNGEDQICPKCRGEDLQSISPGSRLRTQEKAHPEVAEAGIGCPFAGVGCSFKGSPQ SVQEHEVTSQTSHLNLLLGFMKQWKARLGCGLESGPMALEQNLSDLQLQAAVEVAGDL EVDCYRAPCSESQEELALQHFMKEKLLAELEGKLRVFENIVAVLNKEVEASHLALATS IHQSQLDRERILSLEQRVVELQQTLAQKDQALGKLEQSLRLMEEASFDGTFLWKITNV TRRCHESACGRTVSLFSPAFYTAKYGYKLCLRLYLNGDGTGKRTHLSLFIVIMRGEYD ALLPWPFRNKVTFMLLDQNNREHAIDAFRPDLSSASFQRPQSETNVASGCPLFFPLSK LQSPKHAYVKDDTMFLKCIVETST" 3'UTR 1327..2380 BASE COUNT 544 a 709 c 683 g 444 t ORIGIN 1 gccaggactc cacaaggctg gtcccctgcc ctggagcaac ttaaacaggc cctctggcca 61 gcctggaacc ctgagatggc ctccagctca ggcagcagtc ctcgcccggc ccctgatgag 121 aatgagtttc cctttgggtg ccctcccacc gtctgccagg acccaaagga gcccagggct 181 ctctgctgtg caggctgtct ctctgagaac ccgaggaatg gcgaggatca gatctgcccc 241 aaatgcagag gggaagacct ccagtctata agcccaggaa gccgtcttcg aactcaggag 301 aaggctcacc ccgaggtggc tgaggctgga attgggtgcc cctttgcagg tgtcggctgc 361 tccttcaagg gaagcccaca gtctgtgcaa gagcatgagg tcacctccca gacctcccac 421 ctaaacctgc tgttggggtt catgaaacag tggaaggccc ggctgggctg tggcctggag 481 tctgggccca tggccctgga gcagaacctg tcagacctgc agctgcaggc agccgtggaa 541 gtggcggggg acctggaggt cgattgctac cgggcaccct gctccgagag ccaggaggag 601 ctggccctgc agcacttcat gaaggagaag cttctggctg agctggaggg gaagctgcgt 661 gtgtttgaga acattgttgc tgtcctcaac aaggaggtgg aggcctccca cctggccctg 721 gccacctcta tccaccagag ccagctggac cgtgagcgca tcctgagctt ggagcagagg 781 gtggtggagc ttcagcagac cctggcccag aaagaccagg ccctgggcaa gctggagcag 841 agcttgcgcc tcatggagga ggcctccttc gatggcactt tcctgtggaa gatcaccaat 901 gtcaccaggc ggtgccatga gtcggcctgt ggcaggaccg tcagcctctt ctccccagcc 961 ttctacactg ccaagtatgg ctacaagttg tgcctgcggc tgtacctgaa tggagatggc 1021 actggaaaga gaacccatct gtcgctcttc atcgtgatca tgagagggga gtatgatgcg 1081 ctgctgccgt ggcccttccg gaacaaggtc accttcatgc tgctggacca gaacaaccgt 1141 gagcacgcca ttgacgcctt ccggcctgac ctaagctcag cgtccttcca gaggccccag 1201 agtgaaacca acgtggccag tggatgccca ctcttcttcc ccctcagcaa actgcagtca 1261 cccaagcacg cctacgtgaa ggacgacaca atgttcctca agtgcattgt ggagaccagc 1321 acttagggtg ggcggggctc ctgagggagc tccaactcag aagggagcta gccagaggac 1381 tgtgatgccc tgcccttggc acccaagacc tcagggcaca aagatgggtg aaggctggca 1441 tgatccaagc aagactgagg ggtcgacttc gggctggcca tctggttagg atggcaggac 1501 gtgggctggg cccacaaagg caaagggtcc agaaggagac aggcagagct gctcccctct 1561 gcacggacca tgcgacactg ggaggccagt gagccactcc ggccccgaat gttgaggtgg 1621 actctcacca aatgagaaga aaatggaacc aggcttggaa ccgtaggacc caagcagaga 1681 agctctcggg ctaggaagat ctctgcaggg ccgccaggga gacctggaca caggcctgct 1741 ctctttttct ccagggtcag aaacaggacc gggtggaagg gatggggtgc cagtttgaat 1801 gcagtctgtc caggctcgtc attggaggtg aacaagcaaa cccagacggc tccactagga 1861 cttcaaattg ggggttggat ttgaagactt ttaagtttcc ttccagccca gaaagtctct 1921 cattctagcc tcctggccca ggtgagtcct agagctacag gggttctgga aacattcagg 1981 agcttcctgt cctcccagct cctcactcac cttcagtaac ccccactgga ctgacctggt 2041 ccacagggca cctgccaccc tgggcctggc agctcagctt cccaacacgc aggagcacac 2101 ccagccccca catcctgtgc ctccatcagc taaacaccac gtcacttcat gcaggtgaaa 2161 cccagtcact gtgagctccc aggtgcagcc agaggcacct caagaagaag aggggcataa 2221 actttcctct tcctgcctag aggccccacc tttggtgctt tccagaatcc cgtaacacct 2281 gattaactga ggcatccact tctttcagca gactgatcag gacctccaag ccactgagca 2341 atgtataacc ccaaagggaa ttcaaaaaaa aaaaaaaaaa // LOCUS HSU19345 2856 bp mRNA PRI 23-MAR-1995 DEFINITION Human AR1 protein (AR) mRNA, complete cds. ACCESSION U19345 NID g726041 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2856) AUTHORS Rajadhyaksha,A., Babin,J., Riviere,M., Szpirer,J., Szpirer,C., Tesmer,V. and Bina,M. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2856) AUTHORS Bina,M. TITLE Direct Submission JOURNAL Submitted (30-DEC-1994) Minou Bina, Department of Chemistry, Purdue University, 1393 Brown Blg., W. Lafayette, IN 47907, USA FEATURES Location/Qualifiers source 1..2856 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /cell_line="Hela" gene 247..2397 /gene="AR" CDS 247..2397 /gene="AR" /codon_start=1 /function="exhibits DNA binding activity; possible regulator of gene expression" /product="AR1" /db_xref="PID:g726042" /translation="MSSDGLPNKGMELKHGSQKLQESCWDLSRQTSPAKSSGPPGMSS QKRYGPPHETDGHGLAEATQSSKPGSVMLRLPGQEDHSSQNPLIMRRRVRSFISPIPS KRQSQDVKNSSTEDKGRLLHSSKEGADKAFNSYAHLSHSQDIKSIPKRDSSKDLPSPD SRNCPAVTLTSPAKTKILPPRKGRGLKLEAIVQKITSPNIRRSASSNSAEAGGDTVTL DDILSLKSGPPEGGSVAVQDADIEKRKGEVASDLVSPANQELHVEKPLPRSSEEWRGS VDDKVKTETHAETVTAGKEPPGAMTSTTSQKPGSNQGRPDGSLGGTAPLIFPDSKNVP PVGILAPEANPKAEEKENDTVTISPKQEGFPPKGYFPSGKKKGRPIGSVNKQKKQQQP PPPPPQPPQIPEGSADGEPKPKKQRQRRERRKPGAQPRKRKTKQAVPIVEPQEPEIKL KYATQPLDKTDAKNKSFYPYIHVVNKCELGAVCTIINAEEEEQTKLVRGRKGQRSLTP PPSSTESKALPASSFMLQGPVVTESSVMGHLVCCLCGKWASYRNMGDLFGPFYPQDYA ATLPKNPPPKRATEMQSKVKVRHKSASNGSKTDSEEEEEQQQQQKEQRSLAAHPRFNG RHRSEDCGGGPRSCPGGSLVKKQPLRAAVKRLFWTRSPPCPPLQKVALSWSYKSLNYL LTAMNFGSMRVVFSGPMESTWFVAGSMACRKRWK" repeat_region 1411..1419 /rpt_type=tandem repeat_region 2059..2074 /rpt_type=tandem BASE COUNT 821 a 699 c 774 g 562 t ORIGIN 1 agctgcatta taagagacag atgtaccaac agcaaccaga ggagtataaa gactggagca 61 gcggttctgc tcagggagta attgctgcag cacagcacag gcaggagggg ccacggaaga 121 gtccaaggca gcagcagttt cttgacagag tacggagccc tctgaaaaat gacaaagatg 181 gtatgatgta tggcccacca gtgggactta ccatgaccca gtgcccagga ggctgggcgc 241 tgcctaatgt ctagtgatgg tctgcctaac aagggcatgg aattaaagca tggctcccag 301 aagttacaag aatcctgttg ggatctttct cggcaaactt ctccagccaa aagcagcggt 361 cctccaggaa tgtccagtca aaaaaggtat gggccgcccc atgagactga tggacatgga 421 ctagctgagg ctacacagtc atccaaacct ggtagtgtta tgctgagact tccaggccag 481 gaggatcatt cttctcaaaa ccccttaatc atgaggaggc gtgttcgttc ttttatctct 541 cccattccca gtaagagaca gtcacaagat gtaaagaaca gtagcactga agataaaggt 601 cgcctccttc actcatcaaa agaaggcgct gataaagcat tcaattccta tgcccatctt 661 tctcacagtc aggatatcaa gtctatccct aagagagatt cctccaagga ccttccaagt 721 ccagatagta gaaactgccc tgctgttacc ctcacaagcc ctgctaagac caaaatactg 781 cccccacgga aaggacgggg attgaaattg gaagctatag ttcagaagat tacatcccca 841 aatattagga ggagcgcatc ttcgaacagt gcggaggctg ggggagacac ggttacgctt 901 gatgatatac tgtctttgaa gagtggtcct cctgaaggtg ggagtgttgc tgttcaggat 961 gctgacatag agaagagaaa aggtgaggtg gcttcggacc tagtcagtcc agcaaaccag 1021 gagttgcacg tagagaaacc tcttccaagg tcttcagaag agtggcgtgg cagcgtggat 1081 gacaaagtga agacagagac acatgcagaa acagttactg ccggaaagga accccctggt 1141 gccatgacat ccacaacctc acagaagcct ggtagtaacc aagggagacc agatggttcc 1201 ctgggtggaa cagcaccttt aatctttcca gactcaaaga atgtacctcc agtgggcata 1261 ttggcccctg aggcaaaccc caaggctgaa gagaaggaga acgatacagt gacgatttca 1321 ccgaagcaag agggtttccc tccaaaggga tatttcccat caggaaagaa gaaggggaga 1381 cccattggta gtgtgaataa gcaaaagaaa cagcagcagc caccgcctcc accccctcag 1441 cccccacaga taccagaagg ttctgcagat ggagagccaa agccaaaaaa acagaggcaa 1501 aggagggaga gaaggaagcc tggggcccag ccgaggaagc gaaaaaccaa acaagcagtt 1561 cccattgtgg aaccccaaga acctgagatc aaactaaaat atgccaccca gccactggat 1621 aaaactgatg ccaagaacaa gtctttttac ccttacatcc atgtagtaaa taagtgtgaa 1681 cttggagccg tttgtacaat catcaatgct gaggaagaag aacagaccaa attagtgagg 1741 ggcaggaagg gtcagaggtc actgacccct ccacctagca gcactgaaag caaggcgctc 1801 ccggcctcgt cctttatgct gcagggacct gttgtgacag agtcttcggt tatggggcac 1861 ctggtttgct gtctgtgtgg caagtgggcc agttaccgga acatgggtga cctctttgga 1921 cctttttatc cccaagatta tgcagccact ctcccgaaga atccacctcc taagagggcc 1981 acagaaatgc agagcaaagt taaggtacgg cacaaaagtg cttctaatgg ctccaagacg 2041 gacagtgagg aggaggaaga gcagcagcag cagcagaagg agcagagaag cctggccgca 2101 caccccaggt ttaacgggcg ccaccgctcg gaagactgtg gtggaggccc tcggtcctgt 2161 ccagggggct cccttgtaaa aaagcagcca ctgagggcag cagtgaaaag actgttttgg 2221 actcgaagcc ctccgtgccc accacttcag aaggtggccc tgagctggag ttacaaatcc 2281 ctgaactacc tcttgacagc aatgaatttt gggtccatga gggttgtatt ctctgggcca 2341 atggaatcta cctggtttgt ggcaggctct atggcctgca ggaagcgctg gaaatagcca 2401 gagagatgaa atgttcccac tgccaggagg caggcgccac cttgggctgc tacaacaaag 2461 gctgctcctt ccgataccat tacccgtgtg ccattgatgc agattgtttg ctacatgagg 2521 agaacttctc ggtgaggtgc cctaagcaac aaggtgagac tgtggagatg agaaggtggt 2581 ggacactcgt gatggaatgg aaatcgtcct accgtgcagc cacaccctgc cctgccccgc 2641 cccgccccgc ccgcgtgcct gcccatgcca gcacttcctt aagttctcac atcacactca 2701 aaccagtgac accacaggaa agaaagaccc aagacgttgg aatggctgtt tccatggaca 2761 caatctccat agtgacaatg tggggggagg ggggaggggt gggatgatgg ggaaagggtg 2821 ggggggatta aaagggaggg ataaatatat atatat // LOCUS HSU19517 2041 bp mRNA PRI 26-JAN-1996 DEFINITION Human (apoargC) long mRNA, complete cds. ACCESSION U19517 NID g642943 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2041) AUTHORS Byrne,C.D., Schwartz,K. and Lawn,R.M. TITLE Loss of a splice donor site at a 'skipped exon' in a gene homologous to apolipoprotein(a) leads to an mRNA encoding a protein consisting of a single kringle domain JOURNAL Arterioscler. Thromb. Vasc. Biol. 15 (1), 65-70 (1995) MEDLINE 95268939 REFERENCE 2 (bases 1 to 2041) AUTHORS Lawn,R.M. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) Richard M. Lawn, Stanford University School of Medicine, Stanford University Medical Center, Falk Cardiovascular Research Center CV 267, 300 Pasteur Drive, Stanford, CA 94305-5246, USA FEATURES Location/Qualifiers source 1..2041 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27-ter" /tissue_type="liver" gene 118..516 /gene="apoargC" CDS 118..516 /gene="apoargC" /standard_name="apolipoprotein (a) related gene C" /codon_start=1 /db_xref="PID:g642944" /translation="MEHKEVVLLLLLFLKSAPTETGPSVQECYHSNGQSYRGTYFTTV TGRTCQAWSSMTPHQHSRTPEKYPNDGLISNYCRNPDCSAGPWCYTTDPNVRWEYCNL TRCSDDEGTVFVPLTVIPVPSLEDSFIQVA" misc_feature 1504 /note="site of the alternatively spliced variant, see GenBank Accession number U19518" BASE COUNT 565 a 525 c 489 g 462 t ORIGIN 1 tgagagaatc attaacttaa tttgactatc tggtttgtgg atgcgtttac tctcatgtaa 61 gtcaacaaca tcctgggatt gggacacact ttctgggcac tgctggccag tcccaaaatg 121 gaacataagg aagtggttct tctacttctg ttatttctga agtcagcacc gactgagaca 181 gggccttctg tgcaggagtg ctaccacagt aatggacaga gttatcgagg cacatacttc 241 accactgtca caggaagaac ctgccaagct tggtcatcta tgacgccaca ccagcacagt 301 agaaccccag aaaagtaccc aaatgatggc ttgatctcga actactgcag gaatccggat 361 tgttcggcag gcccttggtg ttatacgacg gatcccaatg tcaggtggga gtactgcaac 421 ctgacacggt gctcagacga tgaagggact gtgttcgtgc ctctgactgt tatcccagtt 481 ccaagcctag aggattcatt catacaagtg gcttgatctc caactactgc aggaatccgg 541 attgttcggc aggcccttgg tgttatacaa cggatcccaa agtcaggtgg gagtactgca 601 acctgacagg atgctcagac aagaataggg ctgtggccgc gcctctgact attatcccgg 661 ttccaagacg agaggatact tccaaacaag cactgattga tccaaggcct tcaatgtagg 721 agtgctacca tggaaatgga cagagttatc gaggcacata cttcaccacc gtcacaggaa 781 gcacttgcca agcttggtca tctatgacac cacaccagca cagtaggacc tcagaaaagt 841 acccaaatgc tgacttgatc atgaactact gcaggaatcc agatcctgtg gaaggccttg 901 gtgttacacg atggatccca aagtcagatg ggagtactgc aacctgacac gatgctcaga 961 cacagaaggg actgcagtcg tgcctctgac tgttatcccg gttccaagcc tagaggatcc 1021 ttccaaacca gcaccaacag agccaaggcc ttgggagcag cagtgctacc acggtaatgg 1081 acagagttat cgaggcacat acttcaccac tgtcacagga agaacctgcc aagcttggtc 1141 atctatgacg ccacatcagc acatggcctg accaggaaca ctgcaggaat ccagattctg 1201 ggaaacaacc ctggtgttac acaactgatc cgtgtgtgag gtgggagtac tgcaacctga 1261 cacaatgctc agaaacagaa tcagtgtcct agagactccc actgttgttc ccgttccaag 1321 catggaggct cattctgaag cagcaccaac tgagcaaacc cctgtggtcc ggcagtgcta 1381 ccatggtaat ggacagagtt atcaaggcac attctccacc actgtcacag gaaggacatg 1441 tcaatcttgg tcatccatga caccacaccg gtcatcagag gaccccagaa aactacccaa 1501 atgatggcct gacaatgaac tactgcagga atccagatgc cgatacaggc ccttggtgtt 1561 ttaccacaga ccccagcgtc aggtgggagt actgcaacct gacgcgatgc tcagacagaa 1621 gggactgtgg tcgctcctcc gactgttatc caggttccaa gcctagaggc tccttccgaa 1681 caagtaaaaa gtctgtatcc agacatctat gtgcttggac aacgggatga aaagacatga 1741 aaagccacac tgatgcagaa gcctttagtg ctgcacggga gctcgagtgt tggttgaggg 1801 tctgccgtga ccaaggaagc ctcaatgccg tccctgggaa agccagagct gtgatttttg 1861 gcacaacttg cgagtgtagt gactttagga ctggcgcaaa acctccaggg tgctcaactt 1921 aaccactcac cttgttctaa aataggttat ttcagtgtcc cagtcaaatt cctattctaa 1981 catgctgtca actatgtgat tctttccaag ccaataaaca tttccagtaa tttcttaaaa 2041 a // LOCUS HSU19523 2921 bp mRNA PRI 09-OCT-1996 DEFINITION Human GTP cyclohydrolase I mRNA, complete cds. ACCESSION U19523 NID g755461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2921) AUTHORS Nomura,T., Ohtsuki,M., Matsui,S., Sumi-Ichinose,C., Nomura,H., Hagino,Y., Iwase,K., Ichinose,H., Fujita,K. and Nagatsu,T. TITLE Isolation of a full-length cDNA clone for human GTP cyclohydrolase I type 1 from pheochromocytoma JOURNAL J. Neural Transm. 101 (1-3), 237-242 (1995) MEDLINE 96274939 REFERENCE 2 (bases 1 to 2921) AUTHORS Nomura,T. TITLE Direct Submission JOURNAL Submitted (05-JAN-1995) Takahide Nomura, Dept. of Pharmacology, Fujita Health University, School of Medicine, 1-98 Dengakugakubo, Kutsukake-cho, Toyoake, Aichi, 470-11, Japan FEATURES Location/Qualifiers source 1..2921 /organism="Homo sapiens" /strain="Japanese" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" CDS 149..901 /EC_number="3.5.4.16" /codon_start=1 /function="conversion of GTP to D-erythro-7,8-dihydroneopterin triphosphate" /product="GTP cyclohydrolase I" /db_xref="PID:g755462" /translation="MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEA KSAQPADGWKGERPRSEEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAM QFFTKGYQETISDVLNDAIFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNK QVLGLSKLARIVEIYSRRLQVQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGV QKMNSKTVTSTMLGVFREDPKTREEFLTLIRS" BASE COUNT 776 a 589 c 651 g 905 t ORIGIN 1 ccggctcgga gtgtgatcta agcaggtcgc gtaccttcct caggtgactc cggccacagc 61 ccattgtccg cggccaccgg cggagtttag ccgcagacct cgaagcgccc cggggtcctt 121 cccgaacggc agcggctgcg gcgggtccat ggagaagggc cctgtgcggg caccggcgga 181 gaagccgcgg ggcgccaggt gcagcaatgg gttccccgag cgggatccgc cgcggcccgg 241 gcccagcagg ccggcggaga agcccccgcg gcccgaggcc aagagcgcgc agcccgcgga 301 cggctggaag ggcgagcggc cccgcagcga ggaggataac gagctgaacc tccctaacct 361 ggcagccgcc tactcgtcca tcctgagctc gctgggcgag aacccccagc ggcaagggct 421 gctcaagacg ccctggaggg cggcctcggc catgcagttc ttcaccaagg gctaccagga 481 gaccatctca gatgtcctaa acgatgctat atttgatgaa gatcatgatg agatggtgat 541 tgtgaaggac atagacatgt tttccatgtg tgagcatcac ttggttccat ttgttggaaa 601 ggtccatatt ggttatcttc ctaacaagca agtccttggc ctcagcaaac ttgcgaggat 661 tgtagaaatc tatagtagaa gactacaagt tcaggagcgc cttacaaaac aaattgctgt 721 agcaatcacg gaagccttgc ggcctgctgg agtcggggta gtggttgaag caacacacat 781 gtgtatggta atgcgaggtg tacagaaaat gaacagcaaa actgtgacca gcacaatgtt 841 gggtgtgttc cgggaggatc caaagactcg ggaagagttc ctgactctca ttaggagctg 901 agcttcattc agtgtgtgtg cgttggttgc cgatcgtact gccagtagca ttgtctgtct 961 gtccggtctt gtttgtacat tccattttca attgttacag atgtgaactt tattccttgt 1021 cactaattat atttaaaatt atttctagga agtcaaataa atataataaa gggttgagcc 1081 tctactttct tcttgccacc tttttgtggc aatattaaag tgaactgcta atagtgtaag 1141 tatgtgcaca aaaccactgc cagataacca gaggggcctg ggaagggaga agaattagtg 1201 tatttttttc aaatagtaca gtaatttgcc tcataagcat aggagcattg ggaatgagag 1261 ggaactgtgc ccagtatact gttttttttc ttcctccaat aaaagtggtg tagtgccgaa 1321 agtgctaaaa tatttagtgc ggtattgctc tgtgaattca agttcaacag acttcacttt 1381 ggtcatgttt attaaaccac cagtgacatt taaaaatata tttttagcag tcgtaatgtt 1441 agtcaccaag ggaaggtggt ggaatgtcta tgtttttgat tttactgtga gttaaaaagg 1501 cacatttcta ccttctattg tttttaaatt caagaatagg gaattagttc ctggtgttgt 1561 ttacgagtgt attctcgtgt caacatacag ggatttagac atttaactct ctgtgccttg 1621 ataagaatat catttagagt gtagatactt ttgccttttt aaaaaagcca ttattttatg 1681 agacttagta ctcacactgc aaataactag tcagctcagt tttaacttta taggtttatt 1741 gagtttcctt tgtgtgatcc atgtagatgc ctcaaaatgt ttcttcttct tctttttttt 1801 aatcttataa gatatttttc taagtatttc cagaaacatt tgagagtgcc catcattttc 1861 aggtctgcag aaccatagct tccacgcacc tgaacgagca cagaatgaac tgacggtgga 1921 agacattatg agctgtgtcc aacgttttaa ccaaagcgta tcgtaccaac gatctgtgaa 1981 aatgcactgg aagcttctgg tcccggtttc ctttgtggtc tatgtgggtc ttgtcctcat 2041 tgtaactccg tatagatggt ataggtattt taatcctgga agctgttgcc ttattaatga 2101 ttatcttaaa atttcctcca ttggggcagc gtgggccaaa ttaaaacaaa caaaaccgca 2161 actcctccac agaaacacaa acacagttat tccatgaagt ttagtatttg gttgacatag 2221 tgctcttcaa attcatccca ttaccctaaa agtaataact ttgatgcttg ctttaacttt 2281 agtcccatct ctgccacttt gatgctattt gggttatgat ggggcaagat ggcagaggta 2341 ttgggttttt ttgttttttt ccattcctct ctacttctgt ttcctagctt tttctttctg 2401 gagtttaagt acagtgatgg ttggcttgag taccttttta aatctagccc agtataaaca 2461 ttagcctgct taatatttag acatttatag gtagaattct gagcactcaa ctcatgtttg 2521 gcattttaaa gtaaaaacaa gtgtgacttc gaggaccaaa gaaattgtca gctatacatt 2581 tatctttatg aactcattta tattcctttt taatgactcg ttgttctaac atttcctaga 2641 agtgttctta taaaggtcta atgtatccac aggctgttgt cttattagta aatgcaaagt 2701 aatgactttg tctgttttac tctagtcttt agtacttcaa aattaccttt tcatatccat 2761 gatcttgagt ccatttgggg gatttttaag aatttgatgt atttcaatac actgttcaaa 2821 attaaattgt ttaattttat gtatgagtat gtatgttcct gaagttggtc ctatttaaat 2881 tattaaacta ttgtaacttt aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU19556 1279 bp mRNA PRI 25-APR-1996 DEFINITION Human squamous cell carcinoma antigen 1 (SCCA1) mRNA, complete cds. ACCESSION U19556 NID g1276435 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1279) AUTHORS Schneider,S.S., Schick,C., Fish,K.E., Miller,E., Pena,J., Treter,S.D. and Silverman,G.A. TITLE A serine protease inhibitor locus at 18q21.3 contains a tandem duplication of the human squamous cell carcinoma antigen gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1279) AUTHORS Silverman,G.A. TITLE Direct Submission JOURNAL Submitted (06-JAN-1995) Gary A. Silverman, Pediatrics, Harvard Medical School, Joint Program in Neonatology, 300 Longwood Avenue Enders-9, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1279 /organism="Homo sapiens" /db_xref="taxon:9606" /map="18q21.3" /chromosome="18" gene 30..1202 /gene="SCCA1" CDS 30..1202 /gene="SCCA1" /codon_start=1 /product="squamous cell carcinoma antigen 1" /db_xref="PID:g1276436" /translation="MNSLSEANTKFMFDLFQQFRKSKENNIFYSPISITSALGMVLLG AKDNTAQQIKKVLHFDQVTENTTGKAATYHVDRSGNVHHQFQKLLTEFNKSTDAYELK IANKLFGEKTYLFLQEYLDAIKKFYQTSVESVDFANAPEESRKKINSWVESQTNEKIK NLIPEGNIGSNTTLVLVNAIYFKGQWEKKFNKEDTKEEKFWPNKNTYKSIQMMRQYTS FHFASLEDVQAKVLEIPYKGKDLSMIVLLPNEIDGLQKLEEKLTAEKLMEWTSLQNMR ETRVDLHLPRFKVEESYDLKDTLRTMGMVDIFNGDADLSGMTGSRGLVLSGVLHKAFV EVTEEGAEAAAATAVVGFGSSPTSTNEEFHCNHPFLFFIRQNKTNSILFYGRFSSP" BASE COUNT 427 a 267 c 273 g 312 t ORIGIN 1 caggagttcc agatcacatc gagttcacca tgaattcact cagtgaagcc aacaccaagt 61 tcatgttcga cctgttccaa cagttcagaa aatcaaaaga gaacaacatc ttctattccc 121 ctatcagcat cacatcagca ttagggatgg tcctcttagg agccaaagac aacactgcac 181 aacagattaa gaaggttctt cactttgatc aagtcacaga gaacaccaca ggaaaagctg 241 caacatatca tgttgatagg tcaggaaatg ttcatcacca gtttcaaaag cttctgactg 301 aattcaacaa atccactgat gcatatgagc tgaagatcgc caacaagctc ttcggagaaa 361 aaacgtatct atttttacag gaatatttag atgccatcaa gaaattttac cagaccagtg 421 tggaatctgt tgattttgca aatgctccag aagaaagtcg aaagaagatt aactcctggg 481 tggaaagtca aacgaatgaa aaaattaaaa acctaattcc tgaaggtaat attggcagca 541 ataccacatt ggttcttgtg aacgcaatct atttcaaagg gcagtgggag aagaaattta 601 ataaagaaga tactaaagag gaaaaatttt ggccaaacaa gaatacatac aagtccatac 661 agatgatgag gcaatacaca tcttttcatt ttgcctcgct ggaggatgta caggccaagg 721 tcctggaaat accatacaaa ggcaaagatc taagcatgat tgtgttgctg ccaaatgaaa 781 tcgatggtct ccagaagctt gaagagaaac tcactgctga gaaattgatg gaatggacaa 841 gtttgcagaa tatgagagag acacgtgtcg atttacactt acctcggttc aaagtggaag 901 agagctatga cctcaaggac acgttgagaa ccatgggaat ggtggatatc ttcaatgggg 961 atgcagacct ctcaggcatg accgggagcc gcggtctcgt gctatctgga gtcctacaca 1021 aggcctttgt ggaggttaca gaggagggag cagaagctgc agctgccacc gctgtagtag 1081 gattcggatc atcacctact tcaactaatg aagagttcca ttgtaatcac cctttcctat 1141 tcttcataag gcaaaataag accaacagca tcctcttcta tggcagattc tcatccccgt 1201 agatgcaatt agtctgtcac tccatttgga aaatgttcac ctgcagatgt tctggtaaac 1261 tgattgctgg caacaacag // LOCUS HSU19599 433 bp mRNA PRI 15-JUN-1995 DEFINITION Human (BAX delta) mRNA, complete cds. ACCESSION U19599 NID g841237 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 433) AUTHORS Apte,S.S., Mattei,M.G. and Olsen,B.R. TITLE Mapping of the human BAX gene to chromosome 19q13.3-q13.4 and isolation of a novel alternatively spliced transcript, BAX delta JOURNAL Genomics 26 (3), 592-594 (1995) MEDLINE 95331797 REFERENCE 2 (bases 1 to 433) AUTHORS Apte,S.S. TITLE Direct Submission JOURNAL Submitted (09-JAN-1995) Suneel S. Apte, Cell Biology, Harvard Medical School, 25 Shattuck St., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..433 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSAhB8" /chromosome="19" /map="19q13.3-q13.4" /cell_line="K562 erythroleukemia" exon <1..34 /number=1 CDS 1..432 /note="this sequence is a new splice variant called BAX delta in which exon 3 is deleted and exons 2 and 4 are spliced to each other" /codon_start=1 /product="BAX delta" /db_xref="PID:g841238" /translation="MDGSGEQPRGGGPTSSEQIMKTGALLLQGMIAAVDTDSPREVFF RVAADMFSDGNFNWGRVVALFYFASKLVLKALCTKVPELIRTIMGWTLDFLRERLLGW IQDQGGWDGLLSYFGTPTWQTVTIFVAGVLTASLTIWKKMG" exon 35..86 /number=2 exon 87..222 /number=4 exon 223..327 /number=5 exon 328..>433 /number=6 BASE COUNT 79 a 123 c 140 g 91 t ORIGIN 1 atggacgggt ccggggagca gcccagaggc ggggggccca ccagctctga gcagatcatg 61 aagacagggg cccttttgct tcaggggatg attgccgccg tggacacaga ctccccccga 121 gaggtctttt tccgagtggc agctgacatg ttttctgacg gcaacttcaa ctggggccgg 181 gttgtcgccc ttttctactt tgccagcaaa ctggtgctca aggccctgtg caccaaggtg 241 ccggaactga tcagaaccat catgggctgg acattggact tcctccggga gcggctgttg 301 ggctggatcc aagaccaggg tggttgggac ggcctcctct cctactttgg gacgcccacg 361 tggcagaccg tgaccatctt tgtggcggga gtgctcaccg cctcgctcac catctggaag 421 aagatgggct gag // LOCUS HSU19718 1008 bp mRNA PRI 26-OCT-1995 DEFINITION Human microfibril-associated glycoprotein (MFAP2) mRNA, complete cds. ACCESSION U19718 NID g642031 KEYWORDS MAGP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1008) AUTHORS Faraco,J., Bashir,M., Rosenbloom,J. and Francke,U. TITLE Characterization of the human gene for microfibril-associated glycoprotein (MFAP2), assignment to chromosome 1p36.1-p35, and linkage to D1S170 JOURNAL Genomics 25 (3), 630-637 (1995) MEDLINE 95278931 REFERENCE 2 (bases 1 to 1008) AUTHORS Faraco,J. TITLE Direct Submission JOURNAL Submitted (10-JAN-1995) Juliette Faraco, Genetics, Beckman Center, Stanford University Medical Center, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..1008 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p36.1-p35" /cell_type="fibroblast" /dev_stage="adult" gene 42..593 /gene="MFAP2" CDS 42..593 /gene="MFAP2" /note="MAGP" /codon_start=1 /product="microfibril-associated glycoprotein" /db_xref="PID:g642032" /translation="MRAAYLFLLFLPAGLLAQGQYDLDPLPPFPDHVQYTHYSDQIDN PDYYDYQEVTPRPSEEQFQFQSQQQVQQEVIPAPTPEPGNAELEPTEPGPLDCREEQY PCTRLYSIHRPCKQCLNEVCFYSLRRVYVINKEICVRTVCAHEELLRADLCRDKFSKC GVMASSGLCQSVAASCARSCGSC" BASE COUNT 188 a 333 c 261 g 226 t ORIGIN 1 ctgtcctctc tgacaccacc ccggcctgcc tctttgttgc catgagagct gcctacctct 61 tcctgctatt cctgcctgca ggcttgctgg ctcagggcca gtatgatctg gacccgctgc 121 cgccgttccc tgaccacgtc cagtacaccc actatagcga ccagatcgac aacccagact 181 actatgatta tcaagaggtg actcctcggc cctccgagga acagttccag ttccagtccc 241 agcagcaagt ccaacaggaa gtcatcccag ccccaacccc agaaccagga aatgcagagc 301 tggagcccac agagcctggg cctcttgact gccgtgagga acagtacccg tgcacccgcc 361 tctactccat acacaggcct tgcaaacagt gtctcaacga ggtctgcttc tacagcctcc 421 gccgtgtgta cgtcattaac aaggagatct gtgttcgtac agtgtgtgcc cacgaggagc 481 tcctccgagc tgacctctgt cgggacaagt tctccaaatg tggcgtgatg gccagcagcg 541 gcctgtgcca atccgtggcg gcctcctgtg ccaggagctg tgggagctgc tagggtggtg 601 ctggcatcct gagtcctggc cctcctggga tctggggccc tcgggctacc tgacctggtg 661 cttttttccc catccccatg ttccttttat tctgaaaaag ttagtggact gcagccctgg 721 gggttgcagg ctgcggtgcc tcaggcccct ccttcagcct gtggccacct ctggggcacg 781 atgggggctc cccactgccc agtctgcccc tcgggttggg ggagtatccc aggcctctct 841 gtgggacctg ggcctgacgg gcccttctca gcccgttttg aggacagaca gtcccccgag 901 gtaggctaca tccccccacc ccagctggtc tgcttggatt tcctacagcc cccgtgggca 961 tggaccacct ttattttata caaaattaaa aacaagtttt tacaaaaa // LOCUS HSU19727 4696 bp mRNA PRI 25-MAY-1995 DEFINITION Human microtubule-associated protein 4 (MAP4) mRNA, complete cds. ACCESSION U19727 NID g641915 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4696) AUTHORS Chapin,S.J., Lue,C.M., Yu,M.T. and Bulinski,J.C. TITLE Differential expression of alternatively spliced forms of MAP4: a repertoire of structurally different microtubule-binding domains JOURNAL Biochemistry 34 (7), 2289-2301 (1995) MEDLINE 95161404 REFERENCE 2 (bases 378 to 4696) AUTHORS Chapin,S.J. and Bulinski,J.C. TITLE Non-neuronal 210 x 10(3) Mr microtubule-associated protein (MAP4) contains a domain homologous to the microtubule-binding domains of neuronal MAP2 and tau JOURNAL J. Cell. Sci 98 (Pt 1), 27-36 (1991) MEDLINE 91277031 REFERENCE 3 (sites) AUTHORS Chapin,S.J. and Bulinski,J.C. TITLE Microtubule stabilization by assembly-promoting microtubule-associated proteins: a repeat performance JOURNAL Cell Motil. Cytoskeleton 23 (4), 236-243 (1992) MEDLINE 93121365 REFERENCE 4 (bases 1 to 4696) AUTHORS Chapin,S. J. TITLE Direct Submission JOURNAL Submitted (10-JAN-1995) Steven J. Chapin, Department of Anatomy, University of California at San Francisco, 513 Parnassus Avenue, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..4696 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hep G2" /cell_type="hepatocyte" /tissue_type="brain" /dev_stage="fetus" gene 76..3534 /gene="MAP4" CDS 76..3534 /gene="MAP4" /note="alternative RNA splicing" /citation=[1] /citation=[2] /citation=[3] /codon_start=1 /evidence=experimental /product="microtubule-associated protein 4" /db_xref="PID:g641916" /translation="MADLSLADALTEPSPDIEGEIKRDFIATLEAEAFDDVVGETVGK TDYIPLLDVDEKTGNSESKKKPCSETSQIEDTPSSKPTLLANGGHGVEGSDTTGSPTE FLEEKMAYQEYPNSQNWPEDTNFCFQPEQVVDPIQTDPFKMYHDDDLADLVFPSSATA DTSIFAGQNDPLKDSYGMSPCNTAVVPQGWSVEALNSPHSESFVSPEAVAEPPQPTAV PLELAKEIEMASEERPPAQALEIMMGLKTTDMAPSKETEMALAKDMALATKTEVALAK DMESPTKLDVTLAKDMQPSMESDMALVKDMELPTEKEVALVKDVRWPTETDVSSAKNV VLPTETEVAPAKDVTLLKETERASPIKMDLAPSKDMGPPKENKKETERASPIKMDLAP SKDMGPPKENKIVPAKDLVLLSEIEVAQANDIISSTEISSAEKVALSSETEVALARDM TLPPETNVILTKDKALPLEAEVAPVKDMAQLPETEIAPAKDVAPSTVKEVGLLKDMSP LSETEMALGKDVTPPPETEVVLIKNVCLPPEMEVALTEDQVPALKTEAPLAKDGVLTL ANNVTPAKDVPPLSETEATPVPIKDMEIAQTQKGISEDSHLESLQDVGQSAAPTFMIS PETITGTGKKCSLPAEEDSVLEKLGERKPCNSQPSELSSETSGIARPEEGRPVVSGTG NDITTPPNKELPPSPEKKTKPLATTQPAKTSTSKAKTQPTSLPKQPAPTTIGGLNKKP MSLASGLVPAAPPKRPAVASARPSILPSKDVKPKPIADAKAPEKRASPSKPASAPASR SGSKSTQTVAKTTTAAAVASTGPSSRSPSTLLPKKPTAIKTEGKPAEVKKMTAKSVPA DLSRPKSTSTSSMKKTTTLSGTAPAAGVVPSRVKATPMPSRPSTTPFIDKKPTSAKPS STTPRLSRLATNTSAPDLKNVRSKVGSTENIKHQPGGGRAKVEKKTEAAATTRKPESN AVTKTAGPIASAQKQPAGKVQIVSKKVSYSHIQSKCGSKDNIKHVPGGGNVQIQNKKV DISKVSSKCGSKANIKHKPGGGDVKIESQKLNFKEKAQAKVGSLDNVGHLPAGGAVKT EGGGSEAPLCPGPPAGEEPAISEAAPEAGAPTSASGLNGHPTLSGGGDQREAQTLDSQ IQETSI" BASE COUNT 1251 a 1320 c 1183 g 942 t ORIGIN 1 tcttcccggc gctctcctgg ctcccttctg ccccagctcc gtctcggcgg cggcgggcag 61 ttgcagtggt gcagaatggc tgacctcagt cttgcagatg cattaacaga accatctcca 121 gacattgagg gagagataaa gcgggacttc attgccacac tagaggcaga ggcctttgat 181 gatgttgtgg gagaaactgt tggaaaaaca gactatattc ctctcctgga tgttgatgag 241 aaaaccggga actcagagtc aaagaagaaa ccgtgctcag aaactagcca gattgaagat 301 actccatctt ctaaaccaac actcctagcc aatggtggtc atggagtaga agggagcgat 361 actacagggt ctccaactga attccttgaa gagaaaatgg cctaccagga atacccaaat 421 agccagaact ggccagaaga taccaacttt tgtttccaac ctgagcaagt ggttgatcct 481 atccagactg atccctttaa gatgtaccat gatgatgacc tggcagattt ggtctttccc 541 tccagtgcga cagctgatac ttcaatattt gcaggacaaa atgatccctt gaaagacagt 601 tacggtatgt ctccctgcaa cacagctgtt gtacctcagg ggtggtctgt ggaagcctta 661 aactctccac actcagagtc ctttgtttcc ccagaggctg ttgcagaacc tcctcagcca 721 acggcagttc ccttagagct agccaaggag atagaaatgg catcagaaga gaggccacca 781 gcacaagcat tggaaataat gatgggactg aagactactg acatggcacc atctaaagaa 841 acagagatgg ccctcgccaa ggacatggca ctagctacaa aaaccgaggt ggcattggct 901 aaagatatgg aatcacccac caaattagat gtgacactgg ccaaggacat gcagccatcc 961 atggaatcag atatggccct agtcaaggac atggaactac ccacagaaaa agaagtggcc 1021 ctggttaagg atgtcagatg gcccacagaa acagatgtat cttcagccaa gaatgtggta 1081 ctgcccacag aaacagaggt agccccagcc aaggatgtga cactgttgaa agaaacagag 1141 agggcatctc ctataaaaat ggacttagcc ccttccaagg acatgggacc acccaaagaa 1201 aacaagaaag aaacagagag ggcatctcct ataaaaatgg acttggctcc ttccaaggac 1261 atgggaccac ccaaagaaaa caagatagtc ccagccaagg atttggtatt actctcagaa 1321 atagaggtgg cacaggctaa tgacattata tcatccacag aaatatcctc tgctgagaag 1381 gtggctttgt cctcagaaac agaggtagcc ctggccaggg acatgacact gcccccggaa 1441 accaacgtga tcttgaccaa ggataaagca ctacctttag aagcagaggt ggccccagtc 1501 aaggacatgg ctcaactccc agaaacagaa atagccccgg ccaaggatgt ggctccgtcc 1561 acagtaaaag aagtgggctt gttgaaggac atgtctccac tatcagaaac agaaatggct 1621 ctgggcaagg atgtgactcc acctccagaa acagaagtag ttctcatcaa gaacgtatgt 1681 ctgcctccag aaatggaggt ggccctgact gaggatcagg tcccagccct caaaacagaa 1741 gcacccctgg ctaaggatgg ggttctgacc ctggccaaca atgtgactcc agccaaagat 1801 gttccaccac tctcagaaac agaggcaaca ccagttccaa ttaaagacat ggaaattgca 1861 caaacacaaa aaggaataag tgaggattcc catttagaat ctctgcagga tgtggggcag 1921 tcagctgcac ctactttcat gatttcacca gaaaccatca caggaacggg gaaaaagtgc 1981 agcttgccgg ccgaggagga ttctgtgtta gaaaaactag gggaaaggaa accatgcaac 2041 agtcaacctt ctgagctttc ttcagagacc tcaggaatag ccaggccaga agaaggaagg 2101 cctgtggtga gtgggacagg aaatgacatc accaccccac cgaacaagga gctcccacca 2161 agcccagaga agaaaacaaa gcctttggcc accactcaac ctgcaaagac ttcaacatcg 2221 aaagccaaaa cacagcccac ttctctccct aagcagccag ctcccaccac cattggtggg 2281 ttgaataaaa aacccatgag ccttgcttca ggcttagtgc cagctgcccc acccaaacgc 2341 cctgccgtcg cctctgccag gccttccatc ttaccttcaa aagacgtgaa gccaaagccc 2401 attgcagatg caaaggctcc tgagaagcgg gcctcaccat ccaagccagc ttctgcccca 2461 gcctccagat ctgggtccaa gagcactcag actgttgcaa aaaccacaac agctgctgct 2521 gttgcctcaa ctggcccaag cagtaggagc ccctccacgc tcctgcccaa gaagcccact 2581 gccattaaga ctgagggaaa acctgcagaa gtcaagaaga tgactgcaaa gtctgtacca 2641 gctgacttga gtcgcccaaa gagcacctcc accagttcca tgaagaaaac caccactctc 2701 agtgggacag cccccgctgc aggggtggtt cccagccgag tcaaggccac acccatgccc 2761 tcccggccct ccacaactcc tttcatagac aagaagccca cctcggccaa acccagctcc 2821 accacccccc ggctcagccg cctggccacc aatacttctg ctcctgatct gaagaatgtc 2881 cgctccaagg ttggctccac ggaaaacatc aagcatcagc ctggaggagg ccgggccaaa 2941 gtagagaaaa aaacagaggc agctgctaca acccgaaagc ctgaatctaa tgcagtcact 3001 aaaacagccg gcccaattgc aagtgcacag aagcaacctg cggggaaagt tcagatagtc 3061 tccaaaaaag tgagctacag ccatattcag tccaagtgtg gttccaagga caatattaag 3121 catgtccctg gaggtggtaa tgttcagatt cagaacaaga aagtggacat ctctaaggtc 3181 tcctccaagt gtgggtctaa ggctaacatc aagcacaagc ctggtggagg agatgtcaag 3241 attgaaagtc agaagttgaa cttcaaggag aaggcccagg ccaaggtggg atccctcgat 3301 aatgtgggcc acctacctgc aggaggtgct gtgaagactg agggcggtgg cagcgaggct 3361 cctctgtgtc cgggtccccc tgctggggag gagccggcca tctctgaggc agcgcctgaa 3421 gctggcgccc ccacttcagc cagtggcctc aatggccacc ccaccctgtc agggggtggt 3481 gaccaaaggg aggcccagac cttggacagc cagatccagg agacaagcat ctaatgatga 3541 cattctggtc tcgtcttccg tctcccccgt gttcccctct tgtctcccct gttcccctct 3601 cccttccctc ctcccatgtc actgcagatt gagacctaca ggctgacgtt ccgggcaaat 3661 gccagggccc gcaccgacca cggggccgac attgtctccc gccccccaca cttccctggc 3721 ggccccaact cgggctcccg ggtccttggc cccctttccc gggctgtcca ctagaccagt 3781 gagcgcttgg gcgccgtgct gggcagcccg ctaggctcgc cttccctcct gctttgcgtg 3841 cccggggcag cagcagccct gccccacacc tcctctcact ccccagcctg ggcccatctc 3901 cctgctttgg tcttgcccca tcactgcgcc actgctccgt ggaggaggtt gggagggggt 3961 tggggtggtt gaggctaagt tgggatctag gagaggagaa ccagattcta tcctcatctt 4021 tttttggttc tttggtccaa acccaaaaga aactgacatg ccctcccttc tccctggatc 4081 tacctggagg gaagactgga ggtggattcc gagtggtgac aggacgctga ccgtggagct 4141 taagccactg cctctccctc tggtcccaca aatgggcgcc cccccctccc catgcaggtg 4201 gtgtcgggcc cttcttgctg ccctgcccca agttgggggt cagtgctgcc tgtcccatgc 4261 ttaacataac cgcctagctg ctgtcacatt tttcttgttt tgtcctttta tttttttcta 4321 ataacctaaa aactggcaaa atagttctgc aggttgaagc catgtctaca tgaaagtcct 4381 cagtaagtgt tagagggaac agggcggaga tatccttatg ccacccccgc tggaggatgt 4441 gggcagctta gggccctgga ggcggtgcgg cagggaagag gggtgcagag gctgtggctg 4501 gtgagccggt caggcacaca aggggccctt ggagcgtgga ctggttggtt ttgccatttt 4561 gttgtgggta tgctgctttt cttttctaac caagaggctg gttttggcat ctctgtccca 4621 ttccctggga tctggtggtc agccctagga taaaaagcca gggctggaag acaagaaagg 4681 gccaggagat gaattc // LOCUS HSU19769 10096 bp mRNA PRI 05-DEC-1995 DEFINITION Human CENP-F kinetochore protein mRNA, complete cds. ACCESSION U19769 NID g924600 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10096) AUTHORS Liao,H., Winkfein,R.J., Mack,G., Rattner,J.B. and Yen,T.J. TITLE CENP-F is a protein of the nuclear matrix that assembles onto kinetochores at late G2 and is rapidly degraded after mitosis JOURNAL J. Cell Biol. 130 (3), 507-518 (1995) MEDLINE 95348175 REFERENCE 2 (bases 1 to 10096) AUTHORS Yen,T.J. TITLE Direct Submission JOURNAL Submitted (11-JAN-1995) Tim J. Yen, Institute for Cancer Research, Fox Chase Cancer Center, 7701 Burholme Avenue, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..10096 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pCENP-F" /tissue_type="breast cancer" /cell_line="ZR-75-1 (ATCC#CRL 1500)" /clone_lib="oligo dT primed, in lambda gt11, Clonetech, Palo Alto, CA, Catalog no. HL1059b" CDS 171..9803 /codon_start=1 /product="CENP-F kinetochore protein" /db_xref="PID:g924601" /translation="MSWALEEWKEGLPTRTLQKIQELEGQLDKLKKEKQQRQFQLDSL EAAPQKQTQKVENEKTEGTNLKRENQRLMEICESLEKTKQKISHELQVKESQVNFQEG QLNSGKKQIEKLEQELKRCKSELERSQQAAQSADVSLNPCNTPQKIFTTPLTPSQYYS GSKYEDLKEKYNKEVEERKRLEAEVKALQAKKASQTLPQATMNHRDIARHQASSSVFS WQQEKTPSHLSSNSQRTPIRRDFSASYFSGELEVTPSRSTLQIGKRDANSSFFGNSSS PHLLDQLKAQNQELRNKINELELRLQGHEKEMKGQVNKFQELQLQLEKAKVELIEKEK VLNKCRDELVRTTAQYDQASTKYTALEQKLKKLTEDLSCQRQNAESARCSLEQKIKEK EKEFQEELSRQQRSFQTLDQECIQMKARLTQELQQAKNMHNVLQAELDKLTSVKQQLE NNLEEFKQKLCRAEQAFQASQIKENELRRSMEEMKKENNLLKSHSEQKAREVCHLEAE LKNIKQCLNQSQNFAEEMKAKNTSQETMLRDLQEKINQQENSLTLEKLKLAVADLEKQ RDCSQDLLKKREHHIEQLNDKLSKTEKESKALLSALELKKKEYEELKEEKTLFSCWKS ENEKLLTQMESEKENLQSKINHLETCLKTQQIKSHEYNERVRTLEMDRENLSVEIRNL HNVLDSKSVEVETQKLAYMELQQKAEFSDQKHQKEIENMCLKTSQLTGQVEDLEHKLQ LLSNEIMDKDRCYQDLHAEYESLRDLLKSKDASLVTNEDHQRSLLAFDQQPAMHHSFA NIIGEQGSMPSERSECRLEADQSPKNSAILQNRVDSLEFSLESQKQMNSDLQKQCEEL VQIKGEIEENLMKAEQMHQSFVAETSQRISKLQEDTSAHQNVVAETLSALENKEKELQ LLNDKVETEQAEIQELKKSNHLLEDSLKELQLLSETLSLEKKEMSSIISLNKREIEEL TQENGTLKEINASLNQEKMNLIQKSESFANYIDEREKSISELSDQYKQEKLILLQRCE ETGNAYEDLSQKYKAAQEKNSKLECLLNECTSLCENRKNELEQLKEAFAKEHQEFLTK LAFAEERNQNLMLELETVQQALRSEMTDNQNNSKSEAGGLKQEIMTLKEEQNKMQKEV NDLLQENEQLMKVMKTKHECQNLESEPIRNSVKERESERNQCNFKPQMDLEVKEISLD SYNAQLVQLEAMLRNKELKLQESEKEKECLQHELQTIRGDLETSNLQDMQSQEISGLK DCEIDAEEKYISGPHELSTSQNDNAHLQCSLQTTMNKLNELEKICEILQAEKYELVTE LNDSRSECITATRKMAEEVGKLLNEVKILNDDSGLLHGELVEDIPGGEFGEQPNEQHP VSLAPLDESNSYEHLTLSDKEVQMHFAELQEKFLSLQSEHKILHDQHCQMSSKMSELQ TYVDSLKAENLVLSTNLRNFQGDLVKEMQLGLEEGLVPSLSSSCVPDSSSLSSLGDSS FYRALLEQTGDMSLLSNLEGAVSANQCSVDEVFCSSLQTYVDSLKAENLVLSTNLRNF QGDLVKEMQLGLEEGLVPSLSSSCVPDSSSLSSLGDSSFYRALLEQTGDMSLLSNLEG VVSANQCSVDEVFCSSLQEENLTRKETPSAPAKGVEELESLCEVYRQSLEKLEEKMES QGIMKNKEIQELEQLLSSERQELDCLRKQYLSENEQWQQKLTSVTLEMESKLAAEKKQ TEQLSLELEVARLQLQGLDLSSRSLLGIDTEDAIQGRNESCDISKEHTSETTERTPKH DVHQICDKDAQQDLNLDIEKITETGAVKPTGECSGEQSPDTNYEPPGEDKTQGSSECI SELSFSGPNALVPMDFLGNQEDIHNLQLRVKETSNENLRLLHVIEDRDRKVESLLNEM KELDSKLHLQEVQLMTKIEACIELEKIVGELKKENSDLSEKLEYFSCDHQELLQRVET SEGLNSDLEMHADKSSREDIGDNVAKVNDSWKERFLDVENELSRIRSEKASIEHEALY LEADLEVVQTEKLCLEKDNENKQKVIVCLEEELSVVTSERNQLRGELDTMSKKTTALD QLSEKMKEKTQELESHQSECLHCIQVAEAEVKEKTELLQTLSSDVSELLKDKTHLQEK LQSLEKDSQALSLTKCELENQIAQLNKEKELLVKESESLQARLSESDYEKLNVSKALE AALVEKGEFALRLSSTQEEVHQLRRGIEKLRVRIEADEKKQLHIAEKLKERERENDSL KDKVENLERELQMSEENQELVILDAENSKAEVETLKTQIEEMARSLKVFELDLVTLRS EKENLTKQIQEKQGQLSELDKLLSSFKSLLEEKEQAEIQIKEESKTAVEMLQNQLKEL NEAVAALCGDQEIMKATEQSLDPPIEEEHQLRNSIEKLRARLEADEKKQLCVLQQLKE SEHHADLLKGRVENLERELEIARTNQEHAALEAENSKGEVETLKAKIEGMTQSLRGLE LDVVTIRSEKEDLTNELQKEQERISELEIINSSFENILQEKEQEKVQMKEKSSTAMEM LQTQLKELNERVAALHNDQEACKAKEQNLSSQVECLELEKAQLLQGLDEAKNNYIVLQ SSVNGLIQEVEDGKQKLEKKDEEISRLKNQIQDQEQLVSKLSQVEGEHQLWKEQNLEL RNLTVELEQKIQVLQSKNASLQDTLEVLQSSYKNLENELELTKMDKMSFVEKVNKMTA KETELQREMHEMAQKTAELQEELSGEKNRLAGELQLLLEEIKSSKDQLKELTLENSEL KKSLDCMHKDQVEKEGKVREEIAEYQLRLHEAEKKHQALLLDTNKQYEVEIQTYREKL TSKEECLSSQKLEIDLLKSSKEELNNSLKATTQILEELKKTKMDNLKYVNQLKKENER AQGKMKLLIKSCKQLEEEKEILQKELSQLQAAQEKQKTGTVMDTKVDELTTEIKELKE TLEEKTKEADEYLDKYCSLLISHEKLEKAKEMLETQVAHLCSQQSKQDSRGSPLLGPV VPGPSPIPSVTEKRLSSGQNKASGKRQRSSGIWENGGGPTPATPESFSKKSKKAVMSG IHPAEDTEGTEFEPEGLPEVVKKGFADIPTGKTSPYILRRTTMATRTSPRLAAQKLAL SPLSLGKENLAESSKPTAGGSRSQKVKVAQRSPVDSGTILREPTTKSVPVNNLPERSP TDSPREGLRVKRGRLVPSPKAGLESKGSENCKVQ" BASE COUNT 3721 a 1770 c 2399 g 2206 t ORIGIN 1 ggagaagcgg gcgaattggg caccggtggc ggctgcgggc agtttgaatt agactctggg 61 ctccagcccg ccgaagccgc gccagaactg tactctccga gaggtcgttt tcccgtcccc 121 gagagcaagt ttatttacaa atgttggagt aataaagaag gcagaacaaa atgagctggg 181 ctttggaaga atggaaagaa gggctgccta caagaactct tcagaaaatt caagagcttg 241 aaggacagct tgacaaactg aagaaggaaa agcagcaaag gcagtttcag cttgacagtc 301 tcgaggctgc gccgcagaag caaacacaga aggttgaaaa tgaaaaaacc gagggtacaa 361 acctgaaaag ggagaatcaa agattgatgg aaatatgtga aagtctggag aaaactaagc 421 agaagatttc tcatgaactt caagtcaagg agtcacaagt gaatttccag gaaggacaac 481 tgaattcagg caaaaaacaa atagaaaaac tggaacagga acttaaaagg tgtaaatctg 541 agcttgaaag aagccaacaa gctgcgcagt ctgcagatgt ctctctgaat ccatgcaata 601 caccacaaaa aatttttaca actccactaa caccaagtca atattatagt ggttccaagt 661 atgaagatct aaaagaaaaa tataataaag aggttgaaga acgaaaaaga ttagaggcag 721 aggttaaagc cttgcaggct aaaaaagcaa gccagactct tccacaagcc accatgaatc 781 accgcgacat tgcccggcat caggcttcat catctgtgtt ctcatggcag caagagaaga 841 ccccaagtca tctttcatct aattctcaaa gaactccaat taggagagat ttctctgcat 901 cttacttttc tggggaacta gaggtgactc caagtcgatc aactttgcaa atagggaaaa 961 gagatgctaa tagcagtttc tttggcaatt ctagcagtcc tcatcttttg gatcaattaa 1021 aagcgcagaa tcaagagcta agaaacaaga ttaatgagtt ggaactacgc ctgcaaggac 1081 atgaaaaaga aatgaaaggc caagtgaata agtttcaaga actccaactc caactggaga 1141 aagcaaaagt ggaattaatt gaaaaagaga aagttttgaa caaatgtagg gatgaactag 1201 tgagaacaac agcacaatac gaccaggcgt caaccaagta tactgcattg gaacaaaaac 1261 tgaaaaaatt gacggaagat ttgagttgtc agcgacaaaa tgcagaaagt gccagatgtt 1321 ctctggaaca gaaaattaag gaaaaagaaa aggagtttca agaggagctc tcccgtcaac 1381 agcgttcttt ccaaacactg gaccaggagt gcatccagat gaaggccaga ctcacccagg 1441 agttacagca agccaagaat atgcacaacg tcctgcaggc tgaactggat aaactcacat 1501 cagtaaagca acagctagaa aacaatttgg aagagtttaa gcaaaagttg tgcagagctg 1561 aacaggcgtt ccaggcgagt cagatcaagg agaatgagct gaggagaagc atggaggaaa 1621 tgaagaagga aaacaacctc cttaagagtc actctgagca aaaggccaga gaagtctgcc 1681 acctggaggc agaactcaag aacatcaaac agtgtttaaa tcagagccag aattttgcag 1741 aagaaatgaa agcgaagaat acctctcagg aaaccatgtt aagagatctt caagaaaaaa 1801 taaatcagca agaaaactcc ttgactttag aaaaactgaa gcttgctgtg gctgatctgg 1861 aaaagcagcg agattgttct caagaccttt tgaagaaaag agaacatcac attgaacaac 1921 ttaatgataa gttaagcaag acagagaaag agtccaaagc cttgctgagt gctttagagt 1981 taaaaaagaa agaatatgaa gaattgaaag aagagaaaac tctgttttct tgttggaaaa 2041 gtgaaaacga aaaactttta actcagatgg aatcagaaaa ggaaaacttg cagagtaaaa 2101 ttaatcactt ggaaacttgt ctgaagacac agcaaataaa aagtcatgaa tacaacgaga 2161 gagtaagaac gctggagatg gacagagaaa acctaagtgt cgagatcaga aaccttcaca 2221 acgtgttaga cagtaagtca gtggaggtag agacccagaa actagcttat atggagctac 2281 agcagaaagc tgagttctca gatcagaaac atcagaagga aatagaaaat atgtgtttga 2341 agacttctca gcttactggg caagttgaag atctagaaca caagcttcag ttactgtcaa 2401 atgaaataat ggacaaagac cggtgttacc aagacttgca tgccgaatat gagagcctca 2461 gggatctgct aaaatccaaa gatgcttctc tggtgacaaa tgaagatcat cagagaagtc 2521 ttttggcttt tgatcagcag cctgccatgc atcattcctt tgcaaatata attggagaac 2581 aaggaagcat gccttcagag aggagtgaat gtcgtttaga agcagaccaa agtccgaaaa 2641 attctgccat cctacaaaat agagttgatt cacttgaatt ttcattagag tctcaaaaac 2701 agatgaactc agacctgcaa aagcagtgtg aagagttggt gcaaatcaaa ggagaaatag 2761 aagaaaatct catgaaagca gaacagatgc atcaaagttt tgtggctgaa acaagtcagc 2821 gcattagtaa gttacaggaa gacacttctg ctcaccagaa tgttgttgct gaaaccttaa 2881 gtgcccttga gaacaaggaa aaagagctgc aacttttaaa tgataaggta gaaactgagc 2941 aggcagagat tcaagaatta aaaaagagca accatctact tgaagactct ctaaaggagc 3001 tacaactttt atccgaaacc ctaagcttgg agaagaaaga aatgagttcc atcatttctt 3061 taaataaaag ggaaattgaa gagctgaccc aagagaatgg gactcttaag gaaattaatg 3121 catccttaaa tcaagagaag atgaacttaa tccagaaaag tgagagtttt gcaaactata 3181 tagatgaaag ggagaaaagc atttcagagt tatctgatca gtacaagcaa gaaaaactta 3241 ttttactaca aagatgtgaa gaaaccggaa atgcatatga ggatcttagt caaaaataca 3301 aagcagcaca ggaaaagaat tctaaattag aatgcttgct aaatgaatgc actagtcttt 3361 gtgaaaatag gaaaaatgag ttggaacagc taaaggaagc atttgcaaag gaacaccaag 3421 aattcttaac aaaattagca tttgctgaag aaagaaatca gaatctgatg ctagagttgg 3481 agacagtgca gcaagctctg agatctgaga tgacagataa ccaaaacaat tctaagagcg 3541 aggctggtgg tttaaagcaa gaaatcatga ctttaaagga agaacaaaac aaaatgcaaa 3601 aggaagttaa tgacttatta caagagaatg aacagctgat gaaggtaatg aagactaaac 3661 atgaatgtca aaatctagaa tcagaaccaa ttaggaactc tgtgaaagaa agagagagtg 3721 agagaaatca atgtaatttt aaacctcaga tggatcttga agttaaagaa atttctctag 3781 atagttataa tgcgcagttg gtgcaattag aagctatgct aagaaataag gaattaaaac 3841 ttcaggaaag tgagaaggag aaggagtgcc tgcagcatga attacagaca attagaggag 3901 atcttgaaac cagcaatttg caagacatgc agtcacaaga aattagtggc cttaaagact 3961 gtgaaataga tgcggaagaa aagtatattt cagggcctca tgagttgtca acaagtcaaa 4021 acgacaatgc acaccttcag tgctctctgc aaacaacaat gaacaagctg aatgagctag 4081 agaaaatatg tgaaatactg caggctgaaa agtatgaact cgtaactgag ctgaatgatt 4141 caaggtcaga atgtatcaca gcaactagga aaatggcaga agaggtaggg aaactactaa 4201 atgaagttaa aatattaaat gatgacagtg gtcttctcca tggtgagtta gtggaagaca 4261 taccaggagg tgaatttggt gaacaaccaa atgaacagca ccctgtgtct ttggctccat 4321 tggacgagag taattcctac gagcacttga cattgtcaga caaagaagtt caaatgcact 4381 ttgccgaatt gcaagagaaa ttcttatctt tacaaagtga acacaaaatt ttacatgatc 4441 agcactgtca gatgagctct aaaatgtcag agctgcagac ctatgttgac tcattaaagg 4501 ccgaaaattt ggtcttgtca acgaatctga gaaactttca aggtgacttg gtgaaggaga 4561 tgcagctggg cttggaggag gggctcgttc catccctgtc atcctcttgt gtgcctgaca 4621 gctctagtct tagcagtttg ggagactcct ccttttacag agctctttta gaacagacag 4681 gagatatgtc tcttttgagt aatttagaag gggctgtttc agcaaaccag tgcagtgtag 4741 atgaagtatt ttgcagcagt ctgcagacct atgttgactc attaaaggcc gaaaatttgg 4801 tcttgtcaac gaatctgaga aactttcaag gtgacttggt gaaggagatg cagctgggct 4861 tggaggaggg gctcgttcca tccctgtcat cctcttgtgt gcctgacagc tctagtctta 4921 gcagtttggg agactcctcc ttttacagag ctcttttaga acagacagga gatatgtctc 4981 ttttgagtaa tttagaaggg gttgtttcag caaaccagtg cagtgtagat gaagtatttt 5041 gcagcagtct gcaggaggag aatctgacca ggaaagaaac cccttcggcc ccagcgaagg 5101 gtgttgaaga gcttgagtcc ctctgtgagg tgtaccggca gtccctcgag aagctagaag 5161 agaaaatgga aagtcaaggg attatgaaaa ataaggaaat tcaagagctc gagcagttat 5221 taagttctga aaggcaagag cttgactgcc ttaggaagca gtatttgtca gaaaatgaac 5281 agtggcaaca gaagctgaca agcgtgactc tggagatgga gtccaagttg gcggcagaaa 5341 agaaacagac ggaacaactg tcacttgagc tggaagtagc acgactccag ctacaaggtc 5401 tggacttaag ttctcggtct ttgcttggca tcgacacaga agatgctatt caaggccgaa 5461 atgagagctg tgacatatca aaagaacata cttcagaaac tacagaaaga acaccaaagc 5521 atgatgttca tcagatttgt gataaagatg ctcagcagga cctcaatcta gacattgaga 5581 aaataactga gactggtgca gtgaaaccca caggagagtg ctctggggaa cagtccccag 5641 ataccaatta tgagcctcca ggggaagata aaacccaggg ctcttcagaa tgcatttctg 5701 aattgtcatt ttctggtcct aatgctttgg tacctatgga tttcctgggg aatcaggaag 5761 atatccataa tcttcaactg cgggtaaaag agacatcaaa tgagaatttg agattacttc 5821 atgtgataga ggaccgtgac agaaaagttg aaagtttgct aaatgaaatg aaagaattag 5881 actcaaaact ccatttacag gaggtacaac taatgaccaa aattgaagca tgcatagaat 5941 tggaaaaaat agttggggaa cttaagaaag aaaactcaga tttaagtgaa aaattggaat 6001 atttttcttg tgatcaccag gagttactcc agagagtaga aacttctgaa ggcctcaatt 6061 ctgatttaga aatgcatgca gataaatcat cacgtgaaga tattggagat aatgtggcca 6121 aggtgaatga cagctggaag gagagatttc ttgatgtgga aaatgagctg agtaggatca 6181 gatcggagaa agctagcatt gagcatgaag ccctctacct ggaggctgac ttagaggtag 6241 ttcaaacaga gaagctatgt ttagaaaaag acaatgaaaa taagcagaag gttattgtct 6301 gccttgaaga agaactctca gtggtcacaa gtgagagaaa ccagcttcgt ggagaattag 6361 atactatgtc aaaaaaaacc acggcactgg atcagttgtc tgaaaaaatg aaggagaaaa 6421 cacaagagct tgagtctcat caaagtgagt gtctccattg cattcaggtg gcagaggcag 6481 aggtgaagga aaagacggaa ctccttcaga ctttgtcctc tgatgtgagt gagctgttaa 6541 aagacaaaac tcatctccag gaaaagctgc agagtttgga aaaggactca caggcactgt 6601 ctttgacaaa atgtgagctg gaaaaccaaa ttgcacaact gaataaagag aaagaattgc 6661 ttgtcaagga atctgaaagc ctgcaggcca gactgagtga atcagattat gaaaagctga 6721 atgtctccaa ggccttggag gccgcactgg tggagaaagg tgagttcgca ttgaggctga 6781 gctcaacaca ggaggaagtg catcagctga gaagaggcat cgagaaactg agagttcgca 6841 ttgaggccga tgaaaagaag cagctgcaca tcgcagagaa actgaaagaa cgcgagcggg 6901 agaatgattc acttaaggat aaagttgaga accttgaaag ggaattgcag atgtcagaag 6961 aaaaccagga gctagtgatt cttgatgccg agaattccaa agcagaagta gagactctaa 7021 aaacacaaat agaagagatg gccagaagcc tgaaagtttt tgaattagac cttgtcacgt 7081 taaggtctga aaaagaaaat ctgacaaaac aaatacaaga aaaacaaggt cagttgtcag 7141 aactagacaa gttactctct tcatttaaaa gtctgttaga agaaaaggag caagcagaga 7201 tacagatcaa agaagaatct aaaactgcag tggagatgct tcagaatcag ttaaaggagc 7261 taaatgaggc agtagcagcc ttgtgtggtg accaagaaat tatgaaggcc acagaacaga 7321 gtctagaccc accaatagag gaagagcatc agctgagaaa tagcattgaa aagctgagag 7381 cccgcctaga agctgatgaa aagaagcagc tctgtgtctt acaacaactg aaggaaagtg 7441 agcatcatgc agatttactt aagggtagag tggagaacct tgaaagagag ctagagatag 7501 ccaggacaaa ccaagagcat gcagctcttg aggcagagaa ttccaaagga gaggtagaga 7561 ccctaaaagc aaaaatagaa gggatgaccc aaagtctgag aggtctggaa ttagatgttg 7621 ttactataag gtcagaaaaa gaagatctga caaatgaatt acaaaaagag caagagcgaa 7681 tatctgaatt agaaataata aattcatcat ttgaaaatat tttgcaagaa aaagagcaag 7741 agaaagtaca gatgaaagaa aaatcaagca ctgccatgga gatgcttcaa acacaattaa 7801 aagagctcaa tgagagagtg gcagccctgc ataatgacca agaagcctgt aaggccaaag 7861 agcagaatct tagtagtcaa gtagagtgtc ttgaacttga gaaggctcag ttgctacaag 7921 gccttgatga ggccaaaaat aattatattg ttttgcaatc ttcagtgaat ggcctcattc 7981 aagaagtaga agatggcaag cagaaactgg agaagaagga tgaagaaatc agtagactga 8041 aaaatcaaat tcaagaccaa gagcagcttg tctctaaact gtcccaggtg gaaggagagc 8101 accaactttg gaaggagcaa aacttagaac tgagaaatct gacagtggaa ttggagcaga 8161 agatccaagt gctacaatcc aaaaatgcct ctttgcagga cacattagaa gtgctgcaga 8221 gttcttacaa gaatctagag aatgagcttg aattgacaaa aatggacaaa atgtcctttg 8281 ttgaaaaagt aaacaaaatg actgcaaagg aaactgagct gcagagggaa atgcatgaga 8341 tggcacagaa aacagcagag ctgcaagaag aactcagtgg agagaaaaat aggctagctg 8401 gagagttgca gttactgttg gaagaaataa agagcagcaa agatcaattg aaggagctca 8461 cactagaaaa tagtgaattg aagaagagcc tagattgcat gcacaaagac caggtggaaa 8521 aggaagggaa agtgagagag gaaatagctg aatatcagct acggcttcat gaagctgaaa 8581 agaaacacca ggctttgctt ttggacacaa acaaacagta tgaagtagaa atccagacat 8641 accgagagaa attgacttct aaagaagaat gtctcagttc acagaagctg gagatagacc 8701 ttttaaagtc tagtaaagaa gagctcaata attcattgaa agctactact cagattttgg 8761 aagaattgaa gaaaaccaag atggacaatc taaaatatgt aaatcagttg aagaaggaaa 8821 atgaacgtgc ccaggggaaa atgaagttgt tgatcaaatc ctgtaaacag ctggaagagg 8881 aaaaggagat actgcagaaa gaactctctc aacttcaagc tgcacaggag aagcagaaaa 8941 caggtactgt tatggatacc aaggtcgatg aattaacaac tgagatcaaa gaactgaaag 9001 aaactcttga agaaaaaacc aaggaggcag atgaatactt ggataagtac tgttccttgc 9061 ttataagcca tgaaaagtta gagaaagcta aagagatgtt agagacacaa gtggcccatc 9121 tgtgttcaca gcaatctaaa caagattccc gagggtctcc tttgctaggt ccagttgttc 9181 caggaccatc tccaatccct tctgttactg aaaagaggtt atcatctggc caaaataaag 9241 cttcaggcaa gaggcaaaga tccagtggaa tatgggagaa tggtggagga ccaacacctg 9301 ctaccccaga gagcttttct aaaaaaagca agaaagcagt catgagtggt attcaccctg 9361 cagaagacac ggaaggtact gagtttgagc cagagggact tccagaagtt gtaaagaaag 9421 ggtttgctga catcccgaca ggaaagacta gcccatatat cctgcgaaga acaaccatgg 9481 caactcggac cagcccccgc ctggctgcac agaagttagc gctatcccca ctgagtctcg 9541 gcaaagaaaa tcttgcagag tcctccaaac caacagctgg tggcagcaga tcacaaaagg 9601 tcaaagttgc tcagcggagc ccagtagatt caggcaccat cctccgagaa cccaccacga 9661 aatccgtccc agtcaataat cttcctgaga gaagtccgac tgacagcccc agagagggcc 9721 tgagggtcaa gcgaggccga cttgtcccca gccccaaagc tggactggag tccaagggca 9781 gtgagaactg taaggtccag tgaaggcact ttgtgtgtca gtacccctgg gaggtgccag 9841 tcattgaata gataaggctg tgcctacagg acttctcttt agtcagggca tgctttatta 9901 gtgaggagaa aacaattcct tagaagtctt aaatatattg tactctttag atctcccatg 9961 tgtaggtatt gaaaaagttt ggaagcactg atcacctgtt agcattgcca ttcctctact 10021 gcaatgtaaa tagtataaag ctatgtatat aaagcttttt ggtaatatgt tacaattaaa 10081 atgacaagca ctatat // LOCUS HSU19775 993 bp RNA PRI 29-DEC-1995 DEFINITION Human MAP kinase Mxi2 (MXI2) mRNA, complete cds. ACCESSION U19775 NID g1136797 KEYWORDS MAP Kinase; Max protein; stress signaling. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 993) AUTHORS Zervos,A.S., Faccio,L., Gatto,J.P., Kyriakis,J.M. and Brent,R. TITLE Mxi2, a mitogen-activated protein kinase that recognizes and phosphorylates Max protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (23), 10531-10534 (1995) MEDLINE 96068649 REFERENCE 2 (bases 1 to 993) AUTHORS Han,J., Lee,J.D., Bibbs,L. and Ulevitch,R.J. TITLE A MAP kinase targeted by endotoxin and hyperosmolarity in mammalian cells JOURNAL Science 265 (5173), 808-811 (1994) MEDLINE 94323764 REFERENCE 3 (bases 1 to 993) AUTHORS Zervos,A.S. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Antonis S. Zervos, Cutaneous Biology Research Center, Massachusetts General Hospital, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..993 /organism="Homo sapiens" /db_xref="taxon:9606" gene 44..937 /gene="MXI2" CDS 44..937 /gene="MXI2" /note="new ERK-type kinase; interacts with Max and Myc proteins; can phosphorylate Max protein in vitro" /codon_start=1 /product="Mxi2" /db_xref="PID:g1136798" /translation="MSQERPTFYRQELNKTIWEVPERYQNLSPVGSGAYGSVCAAFDT KTGLRVAVKKLSRPFQSIIHAKRTYRELRLLKHMKHENVIGLLDVFTPARSLEEFNDV YLVTHLMGADLNNIVKCQKLTDDHVQFLIYQILRGLKYIHSADIIHRDLKPSNLAVNE DCELKILDFGLARHTDDEMTGYVATRWYRAPEIMLNWMHYNQTVDIWSVGCIMAELLT GRTLFPGTDHIDQLKLILRLVGTPGAELLKKISSESARNYIQSLTQMPKMNFANVFIG ANPLGKLTIYPHLMDIELVMI" BASE COUNT 274 a 204 c 246 g 269 t ORIGIN 1 gaattcggca cgaggcgcct tcttgcccgg cggctgctgg aaaatgtctc aggagaggcc 61 cacgttctac cggcaggagc tgaacaagac aatctgggag gtgcccgagc gttaccagaa 121 cctgtctcca gtgggctctg gcgcctatgg ctctgtgtgt gctgcttttg acacaaaaac 181 ggggttacgt gtggcagtga agaagctctc cagaccattt cagtccatca ttcatgcgaa 241 aagaacctac agagaactgc ggttacttaa acatatgaaa catgaaaatg tgattggtct 301 gttggacgtt tttacacctg caaggtctct ggaggaattc aatgatgtgt atctggtgac 361 ccatctcatg ggggcagatc tgaacaacat tgtgaaatgt cagaagctta cagatgacca 421 tgttcagttc cttatctacc aaattctccg aggtctaaag tatatacatt cagctgacat 481 aattcacagg gacctaaaac ctagtaatct agctgtgaat gaagactgtg agctgaagat 541 tctggatttt ggactggctc ggcacacaga tgatgaaatg acaggctacg tggccactag 601 gtggtacagg gctcctgaga tcatgctgaa ctggatgcat tacaaccaga cagttgatat 661 ttggtcagtg ggatgcataa tggccgagct gttgactgga agaacattgt ttcctggtac 721 agaccatatt gatcagttga agctcatttt aagactcgtt ggaaccccag gggctgagct 781 tttgaagaaa atctcctcag agtctgcaag aaactatatt cagtctttga ctcagatgcc 841 gaagatgaac tttgcgaatg tatttattgg tgccaatccc ctgggtaagt tgaccatata 901 tcctcacctc atggatattg aattggttat gatataaatt ggggatttga agaagagttt 961 ctccttttga ccaaataaag taccattagt tga // LOCUS HSU19796 809 bp mRNA PRI 22-AUG-1995 DEFINITION Human melanoma antigen p15 mRNA, complete cds. ACCESSION U19796 NID g836929 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 809) AUTHORS Robbins,P.F., el-Gamil,M., Li,Y.F., Topalian,S.L., Rivoltini,L., Sakaguchi,K., Appella,E., Kawakami,Y. and Rosenberg,S.A. TITLE Cloning of a new gene encoding an antigen recognized by melanoma-specific HLA-A24-restricted tumor-infiltrating lymphocytes JOURNAL J. Immunol. 154 (11), 5944-5950 (1995) MEDLINE 95270988 REFERENCE 2 (bases 1 to 809) AUTHORS Robbins,P.F. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Paul F. Robbins, Surgery Branch/NCI, NIH, Building 10 Room 2B42, 10 Center Dr., Bethesda, MD 20882-1502, USA FEATURES Location/Qualifiers source 1..809 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="888 mel" /cell_type="melanoma" CDS 145..531 /codon_start=1 /product="melanoma antigen p15" /db_xref="PID:g836930" /translation="MRTLDLIDEAYGLDFYILKTPKEDLCSKFGMELKRGMLLRLARQ DPQLHPEDPERRAAIYDKYKEFAIPEEEAEWVGLTLEEAIEKQRLLEEKDPVPLFKIY VAELIQQLQQQALSEPAVVQKTASGQ" BASE COUNT 171 a 231 c 268 g 139 t ORIGIN 1 agcggcgagg gctggatcct gggccaaata tatgccaaca acgacaagct ctccaagagg 61 ctgaagaaag tgtggaagcc acagctgttt gagcgagagt tctacagtga gatcctggac 121 aagaagttca cagtgactgt gaccatgcgg accctggacc tcatcgatga ggcttacggg 181 ctcgactttt acatcctcaa gaccccgaag gaggacctgt gctccaagtt tgggatggag 241 ctgaagcgag ggatgctgct gcggcttgcc cggcaggacc cccagctgca ccccgaggac 301 cccgagcggc gggcagccat ctacgacaag tacaaggaat ttgccatccc agaggaggag 361 gcagagtggg tgggcctcac gctggaggag gccattgaga agcagagact tttggaggag 421 aaggaccctg tacccctgtt caagatctat gtggcggagc tgatccagca gctgcagcag 481 caggcactgt cagagccggc ggtggtgcag aagacagcca gtggccagtg accacacagc 541 tcctccatgc ctgaccaaca ggcccagctt tccctgccag gccctttgca ctgaggacac 601 agatcccggg gagctgtgag ggccaccggt gggcagtggg tggatcctgg tttcgtgtgc 661 tgcccatgca ccttccagcc cggggccagc ttggcaggga tccccaggag gcctgggccg 721 cccagaggct cctctcaggc tgggccccga cgtttgcggc agtgttcctt gtcccgtggg 781 gccgggagcg agtaaagtct gggccaggc // LOCUS HSU19822 7041 bp mRNA PRI 07-JUN-1995 DEFINITION Human acetyl-CoA carboxylase mRNA, complete cds. ACCESSION U19822 NID g849082 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7041) AUTHORS Abu-Elheiga,L., Jayakumar,A., Baldini,A., Chirala,S.S. and Wakil,S.J. TITLE Human acetyl-CoA carboxylase: characterization, molecular cloning, and evidence for two isoforms JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (9), 4011-4015 (1995) MEDLINE 95249602 REFERENCE 2 (bases 1 to 7041) AUTHORS Abu-Elheiga,L. TITLE Direct Submission JOURNAL Submitted (12-JAN-1995) Abu-Elheiga L., Baylor College of Medicine, Department of Biochemistry, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..7041 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q12" /chromosome="17" /cell_line="HepG2" /tissue_type="liver" CDS 1..7041 /codon_start=1 /function="synthesis of malonyl-CoA from acetyl-CoA and ATP" /evidence=experimental /product="acetyl-CoA carboxylase" /db_xref="PID:g849083" /translation="MDEPSPLAQPLELNQHSRFIIGSVSEDNSEDEISNLVKLDLLEE KEGSLSPASVGSDTLSDLGISALQDGLALHIRSSWSGLHLVKQGGDRKKIDSQRDFTV ASPAEFVTRFGGNKVIEKVLIANNGIAAVKCMRSIRRWSYEMFRNERAIRFVVMVTPE DLKANAEYIKMADHYVPVPGGANNNNYANVELILDIAKRIPVQAVWAGWGHASENPKL PELLLKNGIAFMGPPNQAMWALGDKIASSIVAQTAGIPTLPWSGSGLRVDWQENDFSK RILNVPQELYEKGYVKDVDDGLKAAEKVGYPVMIKASEGGGGKGIRKVNNADDFPNLF RQVQAEVPGSPIFVMRLAKQSRHLEVQILADQYGNAISLFGRDCSVQRRHQKIIEEAP ATIATPAVFEHMEQCAVKLAKMVGYVSAGTVEYLYSQDGSFYFLELNPRLQVEQPCTE MVADVNLPAAQLQIAMGIPLYRIKDIRMMYGVSPWGDSPIDFENSAHVPCPRGHVIAA RITSENPDEGFKPSSGTVQELNFRSNKNVWGYFSVAAAGGLHEFAGSQFGHCFSWGEN REEAISNMVVALKELSIRGDFRTTVEYLIKLLETESFQMNRIDTGWLDRLIAEKVRAE RPDTMLGVVCGALHVGDVSLRNSVSNFLHSLERGQVLPAHTLLNTVDVELIYEGVKYV LKVTRQSPNSYVVIMNGSCVEVDVHRLSDGGLLLSYDGSSYTTYMKEEVDRYRITIGN KTCVFEKENDPSVMRSPSAGKLIQYIVEDGGHVFAGQCYAEIEVMKMVMTLTAVESGC IHYVKRPGAALDPGCVLAKMQLDNPSKVQQAELHTGSLPRIQSTALRGEKLHRVFHYV LDNLVNVMNGYCLPDPFFSSKVKDWVERLMKTLRDPSLPLLELQDIMTSVSGRIPPNV EKSIKKEMAQYASNITSVLCQFPSQQIANILDSHAATLNRKSEREVFFMNTQSIVQLV QRYRSGIRGHMKAVVMDLLRQYLRVETQFQNGHYDKCVFALREENKSDMNTVLNYIFS HAQVTKKNLLVTMLIDQLCGRDPTLTDELLSILTELTQLSKTTNAKVALRARQVLIAS HLPSYDVRHNQVESIFLSAIDMYGHQFCIENLQKLILSETSIFDVLPNFFYHSNQVVR MAALEVYVRRAYIAYELNSVQHRQLKDNTCVVEFQFMLPTSHPNRGNIPTLNRMSFSS NLNHYGMTHVASVSDVLLDNAFTPPCQRMGGMVSFRTFEDFVRIFDEVMGCFCDSPPQ SPTFPEAGHTSLYDEDKVPRDEPIHILNVAIKTDGDIEDDRLAAMFREFTQQNKATLA DHGIRRLTFLVAQKDFRKQVNYEVDRRFHREFPKFFTFRARDKFEEDRIYRHLEPALA FQLELNRMRNFDLTAIPCANHKMHLYLGAAKVEVGTEVTDYRFFVRAIIRHSDLVTKE ASFEYLQSEGERLLLEAMDELEVAFNNTNVRTDCNHILLNFVPTVIMDPSKIEESVRS MVMRYGSRLWKLRVLQAELKINIRLTPTGKAIPIRLFLTNESGYYLDISLYKEVTDSR TAQIMFQAYGDKQGPLHGMLINTPYVTKDLLQSKRFQAQSLGTTYIYDIPEMFRQSLI KLWESMSTQAFLPSPPLPSDMLTYTELVLDDQGQLVHMNRLPGGNEIGMVAWKMSLKS PEYPEGRDVIVIGNDITYRIGSFGPQEDLLFLRASELARAEGIPRIYVSANSGARIGL AEEIRHMFHVAWVDSEDPYKGYRYLYLTPQDYKRVGALNSVHCEHVEDEGESRYKITD IIGKEEGIGPENLRGSGMIAGESSLAYNEIITISLVTSRAIGIGAYLVRLGQRTIQVE NSHLILTGAGALNKVLGREVYTSNNQLGGIQITHNNGVTHCTVCDGFEGVFTVLHWLS YMPKSVHSSVPLLNSKDPIDRIIEFVPTKTPYDPRWMLAGRPHPTQKGQWLSGFFDYG SFSEIMQPWAQTVVVGRARLGGIPVGVVAVETRTVELSVPADPANLDSEAKIIQHAGQ VWFPDSAFKTYQAIKDFNREGLPLMVFANWRGFSGGMKDMYHQVLKFGAYIVDGLREC SQPVLVYIPPQAELRGGSWVVIDPTINPRHMEMYADRESRGSVLEPEGTVEIKFRRKD LVKTMRRVDPVYIHLAERLGTPELSPTERKELESKLKEREEFLIPIYHQVAVQFADLH DTPGRMQEKGVISDILDWKTSRTFFYWRLRRLLLEDLVKKKIHSANPELTDGQIQAML RRWFVEVEGTVKAYVWDNNKDLAEWLEKQLTEEDGVHSVIEENIKCISRDYVLKQIRS LVQANPEVAMDSIIHMTQHISPTQRAEVIRILSTMDSPST" BASE COUNT 1897 a 1593 c 1779 g 1772 t ORIGIN 1 atggatgaac catctccctt ggcccaacct ctggagctga accagcactc tcgattcata 61 ataggttctg tgtcagaaga taactcagag gatgagatca gcaacctggt gaagctggac 121 ctactggagg agaaggaggg ctccttgtca cctgcttctg ttggctcaga tacactctct 181 gatttgggga tctctgccct acaggatggc ttggccttgc acataaggtc cagctggtct 241 ggcttgcacc tagtaaagca gggcggagac agaaagaaaa tagattctca acgagatttc 301 actgtggctt ctccagcaga atttgttact cgctttgggg gaaataaagt gattgagaag 361 gttctcatcg ctaacaatgg cattgcagca gtgaaatgca tgcggtctat ccgtaggtgg 421 tcttatgaaa tgtttcgaaa tgaacgtgca attagattcg ttgtcatggt cacacctgaa 481 gaccttaaag ccaatgcaga atacattaag atggcagatc actatgtgcc ggtgcctgga 541 ggagcaaaca acaacaacta tgcaaatgtg gaattaattc ttgatattgc taaaaggatc 601 ccagtgcaag cagtgtgggc tggctggggt catgcttctg agaatcccaa actaccggaa 661 cttctcttga aaaatggcat tgccttcatg ggtcctccaa accaggccat gtgggcttta 721 ggggataaga ttgcatcttc catagtggct caaactgcag gtatcccaac tcttccctgg 781 agcggcagtg gtcttcgtgt ggactggcag gaaaatgatt tttcaaaacg tatcttaaat 841 gttccccagg agctatatga aaaaggttat gtgaaagatg tggatgatgg gctaaaggca 901 gctgagaagg ttggatatcc agtaatgatc aaggcctcag agggaggagg agggaaggga 961 attagaaaag ttaacaatgc agatgacttc cctaatctct tcagacaggt tcaagctgaa 1021 gttcctggat ctcccatatt tgtgatgaga ctagccaaac aatctcgtca tctggaggtg 1081 cagatcttag cggaccaata tggcaatgct atctctttgt ttggtcgtga ttgctctgta 1141 caacgcaggc atcagaagat tattgaagaa gcacctgcta ctattgctac tccagcagta 1201 tttgaacaca tggaacagtg tgcggtgaaa cttgccaaaa tggtgggtta tgtgagtgct 1261 gggactgtgg aatacctgta cagccaggat ggcagcttct actttctaga attgaaccct 1321 cggctacagg ttgaacaacc ttgtacagag atggtggctg atgtcaatct ccccgcagca 1381 cagctccaga ttgccatggg gattcctcta tatagaatca aggatatccg tatgatgtat 1441 ggggtatctc cttggggtga ttctcccatt gattttgaaa attctgctca cgttccttgt 1501 ccaaggggcc atgttattgc tgctcggatc actagtgaaa atccagatga gggttttaag 1561 cccagctcag gaacagttca ggagctaaat ttccgcagca ataagaatgt ttggggatat 1621 ttcagtgttg ctgctgcagg gggacttcat gaatttgctg gttctcagtt tggtcactgc 1681 ttttcttggg gagaaaacag agaagaagca atttcaaaca tggtggtggc attgaaggag 1741 ctgtctattc ggggtgactt tcgaactaca gttgaatacc tgatcaaatt gttagagact 1801 gaaagctttc aaatgaacag aattgatact ggctggctgg acagactgat agcagaaaaa 1861 gtacgggctg agcgtcctga caccatgttg ggggttgtgt gtggtgccct ccacgtcgga 1921 gatgtgagcc tgcgaaatag cgtctctaac ttccttcact ccttagaaag gggtcaagtc 1981 cttccggctc atacacttct gaatacagta gatgttgaac ttatctatga gggagtcaag 2041 tatgtactta aggtgactcg acagtccccc aactcctatg tggtgatcat gaatggctca 2101 tgtgtagaag tagatgtaca tcggctgagt gacggtggac tgctcttgtc ctatgatggc 2161 agcagttaca ccacgtatat gaaggaggaa gtagacagat atcgcatcac aattggcaat 2221 aaaacctgtg tgtttgagaa ggaaaatgac ccatcggtga tgcgctcacc ttctgctggg 2281 aagttaatcc agtacattgt agaagatgga ggtcatgtgt ttgccggcca gtgctatgca 2341 gagattgagg taatgaagat ggtaatgact ttgacagctg tggagtctgg ctgtatccat 2401 tacgtcaagc gtcctggagc agctcttgac cctggctgtg tactcgccaa aatgcaactg 2461 gacaacccca gcaaggttca gcaggctgaa cttcacacag gtagtctgcc acggatccag 2521 agcaccgctc tccgaggcga gaagctccat cgagtgttcc actatgtcct ggataatctg 2581 gtcaatgtaa tgaatggata ctgccttcca gatcctttct ttagcagcaa ggtaaaagac 2641 tgggtagaac gattgatgaa aaccctcaga gatccctccc tgcctctcct agaattgcaa 2701 gatattatga ccagtgtgtc tggccgcatt ccccccaatg tggagaagtc tatcaagaag 2761 gaaatggctc agtatgctag caacatcaca tcagtcctct gtcagtttcc cagccagcag 2821 attgcaaaca tcctagatag ccatgcagct acattgaacc ggaaatctga acgggaagtc 2881 ttctttatga atactcagag cattgtccag ctggtacaga ggtaccgaag tggcatccga 2941 ggccacatga aggctgtggt gatggatctg ctccggcagt acctgcgagt agagacacaa 3001 ttccagaatg gtcactatga caaatgtgta ttcgcccttc gagaagagaa taagagtgac 3061 atgaacactg tactgaacta catcttctct cacgctcaag tcaccaagaa gaatcttctg 3121 gtcacaatgc ttattgatca gttgtgtggc cgggacccta ctctaactga tgagctgctg 3181 agtattctca cagagctaac tcaactcagt aagaccacca atgccaaagt agcacttcga 3241 gcacgccagg ttcttattgc ctcccatttg ccatcatatg acgttcgcca taaccaagta 3301 gagtctatct tcctatcagc tattgacatg tatggacatc aattttgcat tgagaacctg 3361 cagaaactca tcctatcaga aacatctatt tttgatgtcc taccaaactt cttctatcac 3421 agcaaccaag tagtgaggat ggcagctctg gaggtgtacg ttcgaagggc ttatattgcc 3481 tatgaactta acagcgtaca acaccgccag cttaaggaca acacctgcgt ggtggaattc 3541 cagttcatgc tgcccacatc tcatccaaac agagggaaca tccctacgct aaacagaatg 3601 tccttctcct ccaacctcaa ccactatggc atgacccatg tagctagtgt cagcgatgtt 3661 ctgttggaca acgccttcac accaccttgt caacgcatgg gcggaatggt ctcttttcgg 3721 acttttgaag attttgtcag gatctttgat gaagtaatgg gctgcttctg tgactcccca 3781 ccccagagtc ccacattccc tgaggcaggt cacacgtctc tttatgatga ggataaggtt 3841 cccagggatg aaccaattca cattctcaat gtggctatca agactgacgg tgatattgag 3901 gatgacaggc tggcagctat gttcagagaa ttcacccagc aaaataaagc taccctggct 3961 gaccatggga tccggcgcct gactttcctg gttgcacaaa aggatttcag aaagcaggtc 4021 aactatgagg tggatcggag atttcataga gaattcccta aattttttac attccgagca 4081 agggataagt ttgaggagga tcgtatctat cgtcatctgg agcctgctct ggctttccag 4141 ttagagctga accggatgag aaattttgac ctcactgcca ttccatgtgc taatcacaag 4201 atgcacctgt atctcggggc agccaaggtg gaagtgggca cagaagtgac agactacagg 4261 ttctttgttc gtgcaatcat caggcattct gatctggtca ccaaggaagc ttcttttgaa 4321 tatctgcaaa gtgaagggga gcggctactc ctggaagcca tggatgagtt ggaagttgct 4381 tttaacaata caaatgtccg cactgactgt aaccacatcc tcctcaactt tgtgcccacg 4441 gttatcatgg acccatcaaa gattgaggaa tccgtgcgga gcatggtaat gcggtatgga 4501 agtcgcctgt ggaaattgcg cgtcctccag gcagaactga aaatcaacat tcgcctgacg 4561 ccaactggaa aagcaattcc catccgcctt ttcctgacaa acgagtctgg ctattacttg 4621 gatatcagcc tatacaagga agtgactgac tccaggacag cacagatcat gtttcaggca 4681 tatggagaca agcagggacc actgcatgga atgttaatca atactccata tgtgaccaaa 4741 gacctgctgc aatcaaagag gttccaggca caatccttag ggacaacgta catatatgat 4801 atcccagaga tgtttcggca gtccctgatc aaactctggg agtctatgtc aactcaagca 4861 tttcttccat ctccccctct gccttctgac atgctgactt acactgaact ggtactggat 4921 gatcaaggtc agctggtcca catgaacagg cttccaggag gaaatgagat tggcatggta 4981 gcttggaaaa tgagccttaa aagtcctgaa tatccagaag gccgagatgt tattgttatt 5041 ggcaatgaca ttacataccg aattgggtcc tttgggcctc aagaagattt gttatttctc 5101 agagcttccg aacttgctag ggccgaaggc attccacgca tctatgtatc agccaacagt 5161 ggagcaagaa tcggactggc agaagaaatt cgccatatgt ttcatgtggc ctgggtagat 5221 tctgaggatc cttacaaggg atacaggtat ttatatctga ctcctcaaga ttataagaga 5281 gtcggtgctc tcaactctgt ccattgtgaa cacgtggaag atgaaggaga atccaggtac 5341 aagataacag atattattgg gaaagaagag ggaattggac ccgagaacct tcgaggttct 5401 ggaatgatcg ctggagaatc ctcattggcc tataatgaga tcattaccat cagcctggtg 5461 acgtcccggg ccattgggat tggggcttac cttgtccggc tgggacagag aaccatccag 5521 gttgagaatt ctcacttgat tctaacagga gctggagccc tcaacaaagt cctcgggcgg 5581 gaagtgtaca cctccaataa ccagctgggg ggcatccaga ttacgcacaa caatggggtg 5641 acccactgca ctgtgtgtga cggctttgaa ggggttttca ctgtcctgca ctggctgtct 5701 tacatgccca agagcgtaca cagttcagtt cctcttctga actcaaagga tcctatagac 5761 agaatcatcg agtttgttcc cacaaagacc ccatatgatc ctcgatggat gctagcaggc 5821 cgccctcacc caacccaaaa aggtcagtgg ttgagtggct tttttgacta tggatctttc 5881 tcagagatta tgcagccctg ggcacagacg gtggtggttg gtagagccag gttaggggga 5941 atacctgtgg gagttgttgc tgtagaaacc cggacagtag aactaagtgt accagctgat 6001 ccagcaaacc tggattctga agccaagata atccagcacg ccggccaagt ttggtttccg 6061 gattctgcgt ttaagacgta tcaggccatc aaggacttca accgggaagg gctacctcta 6121 atggtctttg ccaactggag aggcttctct ggtggaatga aagatatgta tcaccaagtg 6181 ctgaagtttg gtgcttacat tgtggatggc ttgcgggaat gttcccagcc tgtgctggtc 6241 tacattcctc cccaggctga gctgcggggt ggctcctggg tggtgatcga cccaaccatc 6301 aatcctcggc acatggagat gtatgctgac cgagaaagca ggggatctgt tctggagcca 6361 gaagggacgg tagaaatcaa attccgcaga aaggatctgg tgaaaaccat gcgtcgggtg 6421 gacccagtct acatccactt ggctgagcga ttggggaccc cagagctgag cccaactgag 6481 cggaaggagt tggagagcaa gttgaaggag cgggaggaat tcctaattcc aatttaccat 6541 caggtagccg tgcagtttgc tgacttgcac gacactccag gccggatgca ggagaagggt 6601 gttattagcg atatcctgga ttggaaaaca tcccgtacct tcttctactg gcgactgagg 6661 cgtcttctgc tggaggacct ggtcaagaag aaaatccaca gtgccaaccc tgagctgact 6721 gatggccaga ttcaagccat gttaagacgc tggtttgtgg aagtggaagg aacagtgaag 6781 gcttatgttt gggacaataa taaggatctg gcggagtggc tagagaaaca gctgacagag 6841 gaggatggtg ttcactcggt aatagaggaa aacatcaaat gcatcagcag agactacgtc 6901 ctcaagcaaa tccgcagctt ggtccaggcc aatccagagg ttgccatgga ttccatcatc 6961 catatgacgc agcacatatc acccactcag cgggcagaag tcataaggat cctttccact 7021 atggactccc cttctacgta g // LOCUS HSU19878 1672 bp mRNA PRI 02-APR-1995 DEFINITION Human transmembrane protein mRNA, complete cds. ACCESSION U19878 NID g755465 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1672) AUTHORS Eib,D.W. and Martens,G.J. TITLE A novel transmembrane protein containing two follistatin modules and an EGF domain JOURNAL Unpublished REFERENCE 2 (bases 1 to 1672) AUTHORS Eib,D.W. TITLE Direct Submission JOURNAL Submitted (17-JAN-1995) D. W. Eib, Molecular Animal Physiology, University of Nijmegen, Toernooiveld, Nijmegen, 6524 ED, The Netherlands FEATURES Location/Qualifiers source 1..1672 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" sig_peptide 167..286 CDS 167..1309 /codon_start=1 /product="transmembrane protein" /db_xref="PID:g755466" /translation="MGAAAAQAPLGLPAASARLLLLATSVLLLFAFSLPGSRASNQPP GGGGGTGGDCPGGKGKSINCSELNVRESDVRVCDESSCKYGGVCKEDGDGLKCACQFQ CHTNYIPVCGSNGDTYQNECFLRRAACKHQKEITVIARGPCYSDNGSGSGEGEEEGSG AEVHRKHSKCGPCKYKAECDEDAENVGCVCNIDCSGYSFNPVCASDGSSYNNPCFVRE ASCIKQEQIDIRHLGHCTDTDDTSLLGKKDDGLQYRPDVKDASDQREDVYIGNHMPCP ENLNGYCIHGKCEFIYLLRRASCRCESGYTGQHCEKTDFSILYVVPSRQKLTHVLIAA IIGAVQIAIIVAIVMCITRKCPKNNRGRRQKQNLGHFTSDTSSRMV" misc_feature 395..595 /note="encodes follistatin module; this is the first occurence of follistatin modules in a transmembrane protein" misc_feature 674..869 /note="encodes follistatin module" misc_feature 989..1096 /note="encodes EGF domain; has all required elements for EGF receptor binding except a single arginine" misc_feature 1151..1279 /note="encodes transmembrane domain" BASE COUNT 538 a 304 c 371 g 459 t ORIGIN 1 aaaaaaatta aaaaaaaaaa aaaaaacaga aaaaaaaaca tagtacatgc caagatatta 61 ttatgacaat tacaaataca aataaattat gatctttgac ctcagcatat ttattaacta 121 aaagggaaga taaaacaggc acataactat aacaggggca ccagtcatgg gcgccgcagc 181 cgctcaggcg cctctcgggc tgcctgcggc ctccgctcgc cttctgctgc tagcgacgtc 241 ggtgcttctg ctcttcgcct tctctctgcc cgggagccgc gcgtccaacc agcccccggg 301 tggtggcggc ggcacgggcg gggactgtcc cggcggcaaa ggcaagagca tcaactgctc 361 agaattaaat gtgagggagt ctgacgtaag agtttgtgat gagtcatcat gtaaatatgg 421 aggagtctgt aaagaagatg gagatggttt gaaatgtgca tgccaatttc agtgccatac 481 aaattatatt cctgtctgtg gatcaaatgg ggacacttat caaaatgaat gctttctcag 541 aagggctgct tgtaagcacc agaaagagat aacagtaata gcaagaggac catgctactc 601 tgataatgga tctggatctg gagaaggaga agaggaaggg tcaggggcag aagttcacag 661 aaaacactcc aagtgtggac cctgcaaata taaagctgag tgtgatgaag atgcagaaaa 721 tgttgggtgt gtatgtaata tagattgcag tggatacagt tttaatcctg tgtgtgcttc 781 tgatgggagt tcctataaca atccctgttt tgttcgagaa gcatcttgta taaagcaaga 841 acaaattgat ataaggcatc ttggtcattg cacagataca gatgacacta gtttgttggg 901 aaagaaagat gatggactac aatatcgacc agatgtgaaa gatgctagtg atcaaagaga 961 agatgtttat attggaaacc acatgccttg ccctgaaaac ctcaatggtt actgcatcca 1021 tggaaaatgt gaattcatct atctactcag aagggcttct tgtagatgtg aatctggcta 1081 cactggacag cactgtgaaa agacagactt tagtattctc tatgtagtgc caagtaggca 1141 aaagctcact catgttctta ttgcagcaat tattggagct gtacagattg ccatcatagt 1201 agcaattgta atgtgcataa caagaaaatg ccccaaaaac aatagaggac gtcgacagaa 1261 gcaaaaccta ggtcatttta cttcagatac gtcatccaga atggtttaaa ctgatgactt 1321 ttatatgtac actgaccatg tgtatgtaca tttattatgt ctttttttaa agaatggaaa 1381 tatttatttc agaaggcctt atttttggac attttatagt gtagtactgt tggctcgata 1441 tttgaatatt cagctacgac agttttggac tgtttagtag tctttgtttt atgtttttaa 1501 atacagaaat tgcttcacaa atttgtacca catggtaatt ctaagacttg ttctttaccc 1561 atggaatgta atatttttgc aaagatggac tacttcacaa atggttataa agtcatatcc 1621 acttcttcca caatgaccac agcaaatgac ccaagcatga actaaagaag ag // LOCUS HSU19948 1659 bp mRNA PRI 16-MAR-1996 DEFINITION Human protein disulfide isomerase (PDIp) mRNA, complete cds. ACCESSION U19948 NID g1161313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1659) AUTHORS Desilva,M.G., Lu,J., Donadel,G., Modi,W.S., Xie,H., Notkins,A.L. and Lan,M.S. TITLE Characterization and chromosomal localization of a new protein disulfide isomerase, PDIp, highly expressed in human pancreas JOURNAL DNA Cell Biol. 15 (1), 9-16 (1996) MEDLINE 96152236 REFERENCE 2 (bases 1 to 1659) AUTHORS Lan,M.S. TITLE Direct Submission JOURNAL Submitted (18-JAN-1995) Michael S. Lan, Laboratory of Oral Medicine, NIDR/NIH, Building 30, Room 124, 30 Convent Dr. MSC 4322, Bethesda, MD 20892-4322, USA FEATURES Location/Qualifiers source 1..1659 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human insulinoma subtraction library ISL-153; J. Biol. Chem. 267, 15252-15257 (1992)" /map="16p13.3" /chromosome="16" /cell_type="acinar cell" /tissue_type="pancreas" gene 28..1563 /gene="PDIp" CDS 28..1563 /gene="PDIp" /codon_start=1 /product="protein disulfide isomerase" /db_xref="PID:g1161314" /translation="MASCPWGQEQGARSPSEEPPEEEIPKEDGILVLSRHTLGLALRE HPALLVEFYAPWCGHCQALAPEYSKAAAVLAAESMVVTLAKVDGPAQRELAEEFGVTE YPTLKFFRNGNRTHPEEYTGPRDAEGIAEWLRRRVGPSAMRLEDEAAAQALIGGRDLV VIGFFQDLQDEDVATFLALAQDALDMTFGLTDRPRLFQQFGLTKDTVVLFKKFDEGRA DFPVDEELGLDLGDLSRFLVTHSMRLVTEFNSQTSAKIFAARILNHLLLFVNQTLAAH RELLAGFGEAAPRFRGQVLFVVVDVAADNEHVLQYFGLKAEAAPTLRLVNLETTKKYA PVDGGPVTAASITAFCHAVLNGQVKPYLLSQEIPPDWDQRPVKTLVGKNFEQVAFDET KNVFVKFYAPWCTHCKEMAPAWEALAEKYQDHEDIIIAELDATANELDAFAVHGFPTL KYFPAGPGRKVIEYKSTRDLETFSKFLDNGGVLPTEESPEEPAAPFPEPPANSTMGSK EEL" misc_feature 192..210 /gene="PDIp" /note="encodes thioredoxin-like catalytic site" misc_feature 364..372 /gene="PDIp" /note="encodes potential N-linked glycosylation site" misc_feature 835..843 /gene="PDIp" /note="encodes potential N-linked glycosylation site" misc_feature 1233..1251 /gene="PDIp" /note="encodes thioredoxin-like catalytic site" misc_feature 1531..1539 /gene="PDIp" /note="encodes potential N-linked glycosylation site" misc_feature 1548..1560 /gene="PDIp" /note="encodes putative ER retention site" BASE COUNT 317 a 495 c 539 g 308 t ORIGIN 1 agcagtacag gcagaagctg gcggctcatg gcttcgtgcc catggggtca ggaacaggga 61 gcgaggagcc cctcggagga gcctccagag gaggaaatcc ccaaggagga tgggatcttg 121 gtgctgagcc gccacaccct gggcctggcc ctgcgggagc accctgccct gctggtggaa 181 ttctatgccc cgtggtgtgg gcactgccag gccctggccc ccgagtacag caaggcagct 241 gccgtgctcg cggccgagtc aatggtggtc acgctggcca aggtggatgg gcccgcgcag 301 cgcgagctgg ctgaggagtt tggtgtgacg gagtacccta cgctcaagtt cttccgcaat 361 gggaaccgca cgcaccccga ggagtacaca ggaccacggg acgctgaggg cattgccgag 421 tggctgcgac ggcgggtggg gcccagtgcc atgcggctgg aggatgaggc ggccgcccag 481 gcgctgatcg gtggccggga cctagtggtc attggcttct tccaggacct gcaggacgag 541 gacgtggcca ccttcttggc cttggcccag gacgccctgg acatgacctt tggcctcaca 601 gaccggccgc ggctctttca gcagtttggc ctcaccaagg acactgtggt tctcttcaag 661 aagtttgatg aggggcgggc agacttcccc gtggacgagg agcttggcct ggacctgggg 721 gatctgtcgc gcttcctggt cacacacagc atgcgcctgg tcacggagtt caacagccag 781 acgtctgcca agatcttcgc ggccaggatc ctcaaccacc tgctgctgtt tgtcaaccag 841 acgctggctg cgcaccggga gctcctagcg ggctttgggg aggcagctcc ccgcttccgg 901 gggcaggtgc tgttcgtggt ggtggacgtg gcggccgaca atgagcacgt gctgcagtac 961 tttggactca aggctgaggc agcccccact ctgcgcttgg tcaaccttga aaccactaag 1021 aagtatgcgc ctgtggatgg gggccctgtc accgcagcgt ccatcactgc tttctgccat 1081 gcagtcctca acggccaagt caagccctat ctcctgagcc aggagatacc ccctgattgg 1141 gatcagcggc cagttaagac cctcgtgggc aagaattttg agcaggtggc ttttgacgaa 1201 accaagaatg tgtttgtcaa gttctatgcc ccgtggtgca cccactgcaa ggagatggcc 1261 cctgcctggg aggcattggc tgagaagtac caagaccacg aggacatcat cattgctgag 1321 ctggatgcca cggccaacga gctggatgcc ttcgctgtgc acggcttccc tactctcaag 1381 tacttcccag cagggccagg tcggaaggtg attgaataca aaagcaccag ggacctggag 1441 actttctcca agttcctgga caacgggggc gtgctgccca cggaggagtc cccggaggag 1501 ccagcagccc cgttcccgga gccaccggcc aactccacta tggggtccaa ggaggaactg 1561 tagctgcccc cgtgtcaccc ccgccatcac tgctggacag gagccacccc cttgggtacc 1621 agagggagct gtgcattgtg aataaagagt gagcttggt // LOCUS HSU19977 1306 bp mRNA PRI 16-AUG-1995 DEFINITION Human preprocarboxypeptidase A2 (proCPA2) mRNA, complete cds. ACCESSION U19977 NID g790226 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1306) AUTHORS Catasus,L., Vendrell,J., Aviles,F.X., Carreira,S., Puigserver,A. and Billeter,M. TITLE The sequence and conformation of human pancreatic procarboxypeptidase A2. cDNA cloning, sequence analysis, and three-dimensional model JOURNAL J. Biol. Chem. 270 (12), 6651-6657 (1995) MEDLINE 95204457 REFERENCE 2 (bases 1 to 1306) AUTHORS Aviles,F.X. TITLE Direct Submission JOURNAL Submitted (19-JAN-1995) Francesc X. Aviles, Bioquimica i Biologia Molecular, Universitat Autonoma de Barcelona, Facultat Ciencies, Campus Bellaterra. U.A.B., Bellaterra, Barcelona. Catalunya, 08193 Bellaterra, Spain FEATURES Location/Qualifiers source 1..1306 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" 5'UTR <1..4 gene 5..1258 /gene="proCPA2" CDS 5..1258 /gene="proCPA2" /codon_start=1 /product="preprocarboxypeptidase A2" /db_xref="PID:g790227" /translation="MRLILFFGALFGHIYCLETFVGDQVLEIVPSNEEQIKNLLQLEA QEHLQLDFWKSPTTPGETAHVRVPFVNVQAVKVFLESQGIAYSIMIEDVQVLLDKENE EMLFNRRRERSGNFNFGAYHTLEEISQEMDNLVAEHPGLVSKVNIGSSFENRPMNVLK FSTGGDKPAIWLDAGIHAREWVTQATALWTANKIVSDYGKDPSITSILDALDIFLLPV TNPDGYVFSQTKNRMWRKTRSKVSGSLCVGVDPNRNWDAGFGGPGASSNPCSDSYHGP SANSEVEVKSIVDFIKSHGKVKAFIILHSYSQLLMFPYGYKCTKLDDFDELSEVAQKA AQSLRSLHGTKYKVGPICSVIYQASGGSIDWSYDYGIKYSFAFELRDTGRYGFLLPAR QILPTAEETWLGLKAIMEHVRDHPY" sig_peptide 5..52 /gene="proCPA2" mat_peptide 53..1255 /gene="proCPA2" /product="procarboxypeptidase A2" 3'UTR 1259..>1306 polyA_signal 1283..1288 BASE COUNT 348 a 313 c 325 g 320 t ORIGIN 1 ggccatgagg ttgatcctgt tttttggtgc cctttttggg catatctact gtctagaaac 61 atttgtggga gaccaagttc ttgagattgt accaagcaat gaagaacaaa ttaaaaatct 121 gctacaattg gaggctcaag aacatctcca gcttgatttt tggaaatcac ccaccacccc 181 aggggagaca gcccacgtcc gagttccctt cgtcaacgtc caggcagtca aagtgttctt 241 ggagtcccag ggaattgcct attccatcat gattgaagac gtgcaggtcc tgttggacaa 301 agagaatgaa gaaatgcttt ttaataggag aagagaacgg agtggtaact tcaattttgg 361 ggcctaccat accctggaag agatttccca agaaatggat aacctcgtgg ctgagcaccc 421 tggtctagtg agcaaagtga atattggctc ttcttttgag aaccggccta tgaacgtgct 481 caagttcagc accggaggag acaagccagc tatctggctg gatgctggga tccatgctcg 541 agagtgggtt acacaagcta cggcactttg gacagcaaat aagattgttt ctgattatgg 601 aaaggaccca tccatcactt ccattctgga cgccctggat atcttcctcc tgccagtcac 661 aaaccctgat ggatacgtgt tctctcaaac caaaaatcgt atgtggcgga agacccggtc 721 caaggtatct ggaagcctct gtgttggtgt ggatcctaac cggaactggg atgcaggttt 781 tggaggacct ggagccagca gcaacccttg ctctgattca taccacggac ccagtgccaa 841 ctctgaagtt gaagtgaaat ccatagtgga cttcatcaag agtcatggaa aagtcaaggc 901 cttcattatc ctccacagct attcccagct gctgatgttc ccctatgggt acaaatgtac 961 caagttagat gactttgatg agctgagtga agtggcccaa aaggctgccc aatctctgag 1021 aagcctgcat ggcaccaagt acaaagtggg accaatctgc tctgtcatct accaagccag 1081 tggaggaagc attgactggt cctatgatta tggcatcaag tactcatttg cctttgaact 1141 gagagacaca gggcgctacg gcttcctctt gccagcccgt cagatcctgc ccacagccga 1201 ggagacctgg cttggcttga aggcaatcat ggagcatgtg cgagaccacc cctattaggg 1261 ccctggggaa gaaacaagag ccattaaaat ctctttggtt tgaagc // LOCUS HSU1RNPA 1209 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for U1 small nuclear RNP-specific A protein. ACCESSION X06347 NID g37540 KEYWORDS U1 snRNP-specific protein A. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1209) AUTHORS Sillekens,P.T., Habets,W.J., Beijer,R.P. and van Venrooij,W.J. TITLE cDNA cloning of the human U1 snRNA-associated A protein: extensive homology between U1 and U2 snRNP-specific proteins JOURNAL EMBO J. 6 (12), 3841-3848 (1987) MEDLINE 88111575 COMMENT Data kindly reviewed (07-NOV-1988) by VAN VENROOIJ W.J. FEATURES Location/Qualifiers source 1..1209 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratocarcinoma." /clone_lib="lambda gt11" /clone="pHA-1, pHA-2, pHA-3, pHA-4" CDS 126..974 /note="A protein (AA 1-282)" /codon_start=1 /db_xref="PID:g37541" /db_xref="SWISS-PROT:P09012" /translation="MAVPETRPNHTIYINNLNEKIKKDELKKSLYAIFSQFGQILDIL VSRSLKMRGQAFVIFKEVSSATNALRSMQGFPFYDKPMRIQYAKTDSDIIAKMKGTFV ERDRKREKRKPKSQETPATKKAVQGGGATPVVGAVQGPVPGMPPMTQAPRIMHHMPGQ PPYMPPPGMIPPPGLAPGQIPPGAMPPQQLMPGQMPPAQPLSENPPNHILFLTNLPEE TNELMLSMLFNQFPGFKEVRLVPGRHDIAFVEFDNEVQAGAARDALQGFKITQNNAMK ISFAKK" misc_feature 279..302 /note="RNP consensus sequence" misc_feature 1162..1167 /note="polyA signal" BASE COUNT 269 a 380 c 307 g 253 t ORIGIN 1 gaattcctga cttccttttc ggaggaagat ccttgagcag ccgacgttgg gacaaaggat 61 ttggagaaac ccagggctaa agtcacgttt ttcctccttt aagacttacc tcaacacttc 121 actccatggc agttcccgag acccgcccta accacactat ttatatcaac aacctcaatg 181 agaagatcaa gaaggatgag ctaaaaaagt ccctgtacgc catcttctcc cagtttggcc 241 agatcctgga tatcctggta tcacggagcc tgaagatgag gggccaggcc tttgtcatct 301 tcaaggaggt cagcagcgcc accaacgccc tgcgctccat gcagggtttc cctttctatg 361 acaaacctat gcgtatccag tatgccaaga ccgactcaga tatcattgcc aagatgaaag 421 gcaccttcgt ggagcgggac cgcaagcggg agaagaggaa gcccaagagc caggagaccc 481 cggccaccaa gaaggctgtg caaggcgggg gagccacccc cgtggtgggg gctgtccagg 541 ggcctgtccc gggcatgccg ccgatgactc aggcgccccg cattatgcac cacatgccgg 601 gccagccgcc ctacatgccg ccccctggta tgatcccccc gccaggcctt gcacctggcc 661 agatcccacc aggggccatg cccccgcagc agcttatgcc aggacagatg ccccctgccc 721 agcctctttc tgagaatcca ccgaatcaca tcttgttcct caccaacctg ccagaggaga 781 ccaacgagct catgctgtcc atgcttttca atcagttccc tggcttcaag gaggtccgtc 841 tggtacccgg gcggcatgac atcgccttcg tggagtttga caatgaggta caggcagggg 901 cagctcgcga tgccctgcag ggctttaaga tcacgcagaa caacgccatg aagatctcct 961 ttgccaagaa gtagcacctt ttccccccat gcctgcccct tcccctgttc tggggccacc 1021 cctttccccc ttggctcagc cccctgaagg taagtccccc cttgggggcc ttcttggagc 1081 cgtgtgtgag tgagtggtcg ccacacagca ttgtacccag agtctgtccc cagacattgc 1141 acctggcgct gttaggccgg aattaaagtg gctttttgag gtttggtttt tcacaaaaaa 1201 aaggaattc // LOCUS HSU1RNPC 733 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for U1 small nuclear RNP-specific C protein. ACCESSION X12517 NID g37542 KEYWORDS U1 snRNP-specific protein C. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 733) AUTHORS Sillekens,P.T.G. TITLE Direct Submission JOURNAL Submitted (29-JUL-1988) Sillekens P.T.G., Department of Biochemistry, University of Nijmegen, St. Adelbertusplein 1, PO Box 9101, 6500 HB Nijmegen, The Netherlands REFERENCE 2 (bases 1 to 733) AUTHORS Sillekens,P.T., Beijer,R.P., Habets,W.J. and van Venrooij,W.J. TITLE Human U1 snRNP-specific C protein: complete cDNA and protein sequence and identification of a multigene family in mammals JOURNAL Nucleic Acids Res. 16 (17), 8307-8321 (1988) MEDLINE 88335591 COMMENT The sequence overlaps with that reported by Yamamoto et al.in J.Immunol. 140:311-317(1988), M18465. FEATURES Location/Qualifiers source 1..733 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratocarcinoma" /clone_lib="lambda gt11" /clone="pHC1" CDS 16..495 /note="C protein (AA 1-159)" /codon_start=1 /db_xref="PID:g37543" /db_xref="SWISS-PROT:P09234" /translation="MPKFYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEE QAQSLIDKTTAAFQQGKIPPTPFSAPPPAGAMIPPPPSLPGPPRPGMMPAPHMGGPPM MPMMGPPPPGMMPVGPAPGMRPPMGGHMPMMPGPPMMRPPARPMMVPTRPGMTRPDR" misc_feature 88 /note="T is C in ref M18465" misc_feature 94 /note="G is A in ref M18465" misc_feature 295 /note="G is A in ref M18465" misc_feature 298 /note="C is a gap in ref M18465" misc_feature 306 /note="C is a gap in ref M18465" misc_feature 316 /note="C is a gap in ref M18465" misc_feature 399 /note="C is a gap in ref M18465" misc_feature 691..697 /note="poyA signal" BASE COUNT 202 a 186 c 175 g 170 t ORIGIN 1 gaattccaga gcaacatgcc caagttttat tgtgactact gcgatacata cctcacccat 61 gactctccat ctgtgagaaa gacacactgc agtggaagga aacacaaaga gaatgtgaaa 121 gactattatc agaaatggat ggaagagcag gctcagagcc tgattgacaa aacaacggct 181 gcatttcaac aaggaaagat acctcctact ccattctctg ctcctcctcc tgcaggggcg 241 atgataccac ctccccccag ccttccgggt cctcctcgcc ctggtatgat gccagcaccc 301 catatggggg gccctcccat gatgccaatg atgggccctc ctcctcctgg gatgatgcca 361 gtgggacctg ctcctggaat gaggccgccc atgggaggcc atatgccaat gatgcctggg 421 cccccaatga tgagacctcc tgcccgtccc atgatggtgc ccactcggcc cggaatgact 481 cgaccagaca gataaggata gaggggaggc cttattgtat cggttttata ttacctgttc 541 tgcttcacca ggagatcatg ctgctgtgat actgagtttt ctaaacagca taaggaagac 601 ttgctcccct gtcctatgaa agagaatagt tttggagggg agaagtggga caaaaaagat 661 gcagttttcc tttgtattgg gaaatgtgaa aataaaattg tcaactcttt cagttaaaaa 721 aaaaaaggaa ttc // LOCUS HSU20157 1505 bp mRNA PRI 22-APR-1995 DEFINITION Human platelet-activating factor acetylhydrolase mRNA, complete cds. ACCESSION U20157 NID g780132 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1505) AUTHORS Tjoelker,L.W., Wilder,C., Eberhardt,C., Stafforini,D.M., Dietsch,G., Schimpf,B., Hooper,S., Trong,H., Cousens,L.S., Zimmerman,G.A., Yamada,Y., McIntyre,T.M., Prescott,S.M. and Gray,P.W. TITLE Anti-inflammatory properties of a platelet-activating factor acetylhydrolase JOURNAL Nature 374 (6522), 549-553 (1995) MEDLINE 95214779 REFERENCE 2 (bases 1 to 1505) AUTHORS Tjoelker,L.W. TITLE Direct Submission JOURNAL Submitted (20-JAN-1995) Larry W. Tjoelker, ICOS Corporation, 22021 20th Ave. S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..1505 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="sAH 406-3" /clone_lib="in vitro differentiated macrophage cDNA library" /cell_type="macrophage" /tissue_type="myeloid" CDS 162..1487 /codon_start=1 /product="platelet-activating factor acetylhydrolase" /db_xref="PID:g780133" /translation="MVPPKLHVLFCLCGCLAVVYPFDWQYINPVAHMKSSAWVNKIQV LMAAASFGQTKIPRGNGPYSVGCTDLMFDHTNKGTFLRLYYPSQDNDRLDTLWIPNKE YFWGLSKFLGTHWLMGNILRLLFGSMTTPANWNSPLRPGEKYPLVVFSHGLGAFRTLY SAIGIDLASHGFIVAAVEHRDRSASATYYFKDQSAAEIGDKSWLYLRTLKQEEETHIR NEQVRQRAKECSQALSLILDIDHGKPVKNALDLKFDMEQLKDSIDREKIAVIGHSFGG ATVIQTLSEDQRFRCGIALDAWMFPLGDEVYSRIPQPLFFINSEYFQYPANIIKMKKC YSPDKERKMITIRGSVHQNFADFTFATGKIIGHMLKLKGDIDSNVAIDLSNKASLAFL QKHLGLHKDFDQWDCLIEGDDENLIPGTNINTTNQHIMLQNSSGIEKYN" BASE COUNT 438 a 311 c 333 g 423 t ORIGIN 1 gctggtcgga ggctcgcagt gctgtcggcg agaagcagtc gggtttggag cgcttgggtc 61 gcgttggtgc gcggtggaac gcgcccaggg accccagttc ccgcgagcag ctccgcgccg 121 cgcctgagag actaagctga aactgctgct cagctcccaa gatggtgcca cccaaattgc 181 atgtgctttt ctgcctctgc ggctgcctgg ctgtggttta tccttttgac tggcaataca 241 taaatcctgt tgcccatatg aaatcatcag catgggtcaa caaaatacaa gtactgatgg 301 ctgctgcaag ctttggccaa actaaaatcc cccggggaaa tgggccttat tccgttggtt 361 gtacagactt aatgtttgat cacactaata agggcacctt cttgcgttta tattatccat 421 cccaagataa tgatcgcctt gacacccttt ggatcccaaa taaagaatat ttttggggtc 481 ttagcaaatt tcttggaaca cactggctta tgggcaacat tttgaggtta ctctttggtt 541 caatgacaac tcctgcaaac tggaattccc ctctgaggcc tggtgaaaaa tatccacttg 601 ttgttttttc tcatggtctt ggggcattca ggacacttta ttctgctatt ggcattgacc 661 tggcatctca tgggtttata gttgctgctg tagaacacag agatagatct gcatctgcaa 721 cttactattt caaggaccaa tctgctgcag aaatagggga caagtcttgg ctctacctta 781 gaaccctgaa acaagaggag gagacacata tacgaaatga gcaggtacgg caaagagcaa 841 aagaatgttc ccaagctctc agtctgattc ttgacattga tcatggaaag ccagtgaaga 901 atgcattaga tttaaagttt gatatggaac aactgaagga ctctattgat agggaaaaaa 961 tagcagtaat tggacattct tttggtggag caacggttat tcagactctt agtgaagatc 1021 agagattcag atgtggtatt gccctggatg catggatgtt tccactgggt gatgaagtat 1081 attccagaat tcctcagccc ctctttttta tcaactctga atatttccaa tatcctgcta 1141 atatcataaa aatgaaaaaa tgctactcac ctgataaaga aagaaagatg attacaatca 1201 ggggttcagt ccaccagaat tttgctgact tcacttttgc aactggcaaa ataattggac 1261 acatgctcaa attaaaggga gacatagatt caaatgtagc tattgatctt agcaacaaag 1321 cttcattagc attcttacaa aagcatttag gacttcataa agattttgat cagtgggact 1381 gcttgattga aggagatgat gagaatctta ttccagggac caacattaac acaaccaatc 1441 aacacatcat gttacagaac tcttcaggaa tagagaaata caattaggat taaaataggt 1501 ttttt // LOCUS HSU20158 2032 bp mRNA PRI 18-MAY-1995 DEFINITION Human 76 kDa tyrosine phosphoprotein SLP-76 mRNA, complete cds. ACCESSION U20158 NID g806765 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2032) AUTHORS Jackman,J.K., Motto,D.G., Sun,Q., Tanemoto,M., Turck,C.W., Peltz,G.A., Koretzky,G.A. and Findell,P.R. TITLE Molecular cloning of SLP-76, a 76-kDa tyrosine phosphoprotein associated with Grb2 in T cells JOURNAL J. Biol. Chem. 270 (13), 7029-7032 (1995) MEDLINE 95221345 REFERENCE 2 (bases 1 to 2032) AUTHORS Findell,P.R. TITLE Direct Submission JOURNAL Submitted (20-JAN-1995) Paul R. Findell, Inflammation and Immunology, Syntex Discovery Research, 3401 Hillview Avenue, Palo Alto, CA 94303, USA FEATURES Location/Qualifiers source 1..2032 /organism="Homo sapiens" /db_xref="taxon:9606" misc_signal 250..258 /note="Kozak sequence" CDS 256..1857 /note="76 kDa tyrosine phosphoprotein" /codon_start=1 /product="SLP-76" /db_xref="PID:g806766" /translation="MALRNVPFRSEVLGWDPDSLADYFKKLNYKDCEKAVKKYHIDGA RFLNLTENDIQKFPKLRVPILSKLSQEINKNEERRSIFTRKPQVPRFPEETESHEEDN GGWSSFEEDDYESPNDDQDGEDDGDYESPNEEEEAPVEDDADYEPPPSNDEEALQNSI LPAKPFPNSNSMYIDRPPSGKTPQQPPVPPQRPMAALPPPPAGRNHSPLPPPQTNHEE PSRSRNHKTAKLPAPSIDRSTKPPLDRSLAPFDREPFTLGKKPPFSDKPSIPAGRSLG EHLPKIQKPPLPPTTERHERSSPLPGKKPPVPKHGWGPDRRENDEDDVHQRPLPQPAL LPMSSNTFPSRSTKPSPMNPLPSSHMPGAFSESNSSFPQSASLPPYFSQGPSNRPPIR AEGRNFPLPLPNKPRPPSPAEEENSLNEEWYVSYITRPEAEAALRKINQDGTFLVRDS SKKTTTNPYVLMVLYKDKVYNIQIRYQKESQVYLLGTGLRGKEDFLSVSDIIDYFRKM PLLLIDGKNRGSRYQCTLTHAAGYP" misc_feature 1519..1797 /note="encodes src homology 2 domain" polyA_signal 2009..2015 polyA_site 2032 BASE COUNT 581 a 579 c 468 g 404 t ORIGIN 1 ctggttcggc ccacctctga aggttccaga atcgatagtg aattcgtgga agagaccata 61 tttgttcgca gaggaagccg ttgctttctg ggatctggct acggcagaaa agacatcggc 121 tccaacaggg gtgttccaca gggtagctgg gagttggaag agccaagaac gcctccgagc 181 tctggatttg agcttctctg cccatgggtg aagcgcccat gctcagcttg tgagcttctt 241 cccgggagag cagccatggc actgaggaat gtgccctttc gctcagaggt cctgggctgg 301 gaccccgaca gccttgctga ctatttcaag aagctcaact ataaggactg tgagaaggca 361 gtgaagaagt accacatcga tggggctcgc ttcttgaacc tgacagaaaa tgacatccag 421 aagttcccca agctccgggt gccgattctc agtaagttaa gtcaggaaat caacaagaac 481 gaagagagga ggagcatctt cacacgcaaa ccccaagtcc cgcggtttcc tgaagagaca 541 gaaagccacg aagaggacaa tgggggttgg tcgtcctttg aagaagacga ttatgaaagt 601 cccaatgatg accaggatgg ggaggatgat ggagactatg agtcccccaa tgaggaggaa 661 gaggcacccg tggaagatga cgcggattat gagccgccac cctccaatga cgaggaagct 721 ctgcagaact ccatcctgcc tgccaagcct ttccccaact ccaactccat gtacatcgac 781 cggcccccct ctgggaaaac cccccagcag cctcctgtgc ccccccagag accgatggcc 841 gccctcccgc ccccaccagc cggccggaat cactcgccac tgcccccacc ccagaccaac 901 cacgaagaac ccagcagaag cagaaaccac aaaacggcaa agctccctgc tccttcaata 961 gacagaagca cgaaacctcc cctagatcgt tcattagctc cgtttgatag agaacccttc 1021 acactaggaa agaaaccacc attttctgac aagccctcga ttccagcggg aaggtcactc 1081 ggggagcatt tacccaagat tcaaaagcct cctttaccac cgaccacgga aagacatgaa 1141 aggagcagcc ccctgccagg gaagaagcca cctgtgccaa agcatggatg gggaccagac 1201 agaagagaga atgatgaaga tgatgtgcat cagagacctt tgccccagcc agcactactt 1261 cctatgagct ccaacacttt cccttcaaga tctactaagc caagtcccat gaaccctctc 1321 ccatcctctc acatgcctgg agcattctca gaaagtaaca gcagttttcc acagagtgcc 1381 tccctgccac catacttctc tcaaggccct agcaacagac cacctatcag agccgaaggc 1441 agaaacttcc ccttgccact tccaaacaaa cctcggcccc catcccccgc ggaggaagag 1501 aattcattaa atgaagagtg gtacgtttct tatattaccc gaccagaggc agaagctgct 1561 cttagaaaga taaaccagga tggcacattt ctggtcagag acagctctaa aaaaacaaca 1621 accaatccat atgtcctcat ggtgttgtac aaagataaag tttacaacat ccagatccgt 1681 tatcagaagg aaagtcaagt ttacttgttg ggaactggac tccgagggaa agaggacttt 1741 ctgtctgtgt cagatattat tgactacttc aggaaaatgc cacttctgct cattgatggg 1801 aaaaaccgag gttccagata ccagtgcaca ttaacgcatg ctgcagggta cccatagcaa 1861 gttatagccg agcaaatgaa ccgtcctcct gcctctgttg ccaacacgag atcaatcagc 1921 cttggtcaat ggacaaacac ttaggactga actgaacccc tccccatgaa cacaagggtt 1981 ttatcctttc ctttaaaaac agtgtttgaa atgaagactg tcaactatcc cc // LOCUS HSU20165 3167 bp mRNA PRI 10-MAR-1995 DEFINITION Human type II serine/threonine kinase receptor mRNA, complete cds. ACCESSION U20165 NID g704361 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3167) AUTHORS Kawabata,M., Chytil,A. and Moses,H.L. TITLE Cloning of a novel type II serine/threonine kinase receptor through interaction with the type I transforming growth factor-beta receptor JOURNAL J. Biol. Chem. 270 (10), 5625-5630 (1995) MEDLINE 95197572 REFERENCE 2 (bases 1 to 3167) AUTHORS Kawabata,M. TITLE Direct Submission JOURNAL Submitted (20-JAN-1995) Masahiro Kawabata, Cell Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..3167 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 37..3153 /codon_start=1 /product="type II serine/threonine kinase receptor" /db_xref="PID:g704362" /translation="MTSSLQRPWRVPWLPWTILLVSTAAASQNQERLCAFKDPYQQDL GIGESRISHENGTILCSKGSTCYGLWEKSKGDINLVKQGCWSHIGDPQECHYEECVVT TTPPSIQNGTYRFCCCSTDLCNVNFTENFPPPDTTPLSPPHSFNRDETIIIALASVSV LAVLIVALCFGYRMLTGDRKQGLHSMNMMEAAASEPSLDLDNLKLLELIGRGRYGAVY KGSLDERPVAVKVFSFANRQNFINEKNIYRVPLMEHDNIARFIVGDERVTADGRMEYL LVMEYYPNGSLCKYLSLHTSDWVSSCRLAHSVTRGLAYLHTELPRGDHYKPAISHRDL NSRNVLVKNDGTCVISDFGLSMRLTGNRLVRPGEEDNAAISEVGTIRYMAPEVLEGAV NLRDCESALKQVDMYALGLIYWEIFMRCTDLFPGESVPEYQMAFQTEVGNHPTFEDMQ VLVSREKQRPKFPEAWKENSLAVRSLKETIEDCWDQDAEARLTAQCAEERMAELMMIW ERNKSVSPTVNPMSTAMQNERNLSHNRRVPKIGPYPDYSSSSYIEDSIHHTDSIVKNI SSEHSMSSTPLTIGEKNRNSINYERQQAQARIPSPETSVTSLSTNTTTTNTTGLTPST GMTTISEMPYPDETNLHTTNVAQSIGPTPVCLQLTEEDLETNKLDPKEVDKNLKESSD ENLMEHSLKQFSGPDPLSSTSSSLLYPLIKLAVEATGQQDFTQTANGQACLIPDVLPT QIYPLPKQQNLPKRPTSLPLNTKNSTKEPRLKFGSKHKSNLKQVETGVAKMNTINAAE PHVVTVTMNGVAGRNHSVNSHAATTQYANGTVLSGQTTNIVTHRAQEMLQNQFIGEDT RLNINSSPDEHEPLLRREQQAGHDEGVLDRLVDRRERPLEGGRTNSNNNNSNPCSEQD VLAQGVPSTAADPGPSKPRRAQRPNSLDLSATNVLDGSSIQIGESTQDGKSGSGEKIK KRVKTPYSLKRWRPSTWVISTESLDCEVNNNGSNRAVHSKSSTAVYLAEGGTATTMVS KDIGMNCL" BASE COUNT 967 a 727 c 714 g 759 t ORIGIN 1 ttttctttgc cctcctgatt cttggctggc ccagggatga cttcctcgct gcagcggccc 61 tggcgggtgc cctggctacc atggaccatc ctgctggtca gcactgcggc tgcttcgcag 121 aatcaagaac ggctatgtgc gtttaaagat ccgtatcagc aagaccttgg gataggtgag 181 agtagaatct ctcatgaaaa tgggacaata ttatgctcga aaggtagcac ctgctatggc 241 ctttgggaga aatcaaaagg ggacataaat cttgtaaaac aaggatgttg gtctcacatt 301 ggagatcccc aagagtgtca ctatgaagaa tgtgtagtaa ctaccactcc tccctcaatt 361 cagaatggaa cataccgttt ctgctgttgt agcacagatt tatgtaatgt caactttact 421 gagaattttc cacctcctga cacaacacca ctcagtccac ctcattcatt taaccgagat 481 gagacaataa tcattgcttt ggcatcagtc tctgtattag ctgttttgat agttgcctta 541 tgctttggat acagaatgtt gacaggagac cgtaaacaag gtcttcacag tatgaacatg 601 atggaggcag cagcatccga accctctctt gatctagata atctgaaact gttggagctg 661 attggccgag gtcgatatgg agcagtatat aaaggctcct tggatgagcg tccagttgct 721 gtaaaagtgt tttcctttgc aaaccgtcag aattttatca acgaaaagaa catttacaga 781 gtgcctttga tggaacatga caacattgcc cgctttatag ttggagatga gagagtcact 841 gcagatggac gcatggaata tttgcttgtg atggagtact atcccaatgg atctttatgc 901 aagtatttaa gtctccacac aagtgactgg gtaagctctt gccgtcttgc tcattctgtt 961 actagaggac tggcttatct tcacacagaa ttaccacgag gagatcatta taaacctgca 1021 atttcccatc gagatttaaa cagcagaaat gtcctagtga aaaatgatgg aacctgtgtt 1081 attagtgact ttggactgtc catgaggctg actggaaata gactggtgcg cccaggggag 1141 gaagataatg cagccataag cgaggttggc actatcagat atatggcacc agaagtgcta 1201 gaaggagctg tgaacttgag ggactgtgaa tcagctttga aacaagtaga catgtatgct 1261 cttggactaa tctattggga gatatttatg agatgtacag acctcttccc aggggaatcc 1321 gtaccagagt accagatggc ttttcagaca gaggttggaa accatcccac ttttgaggat 1381 atgcaggttc tcgtgtctag ggaaaaacag agacccaagt tcccagaagc ctggaaagaa 1441 aatagcctgg cagtgaggtc actcaaggag acaatcgaag actgttggga ccaggatgca 1501 gaggctcggc ttactgcaca gtgtgctgag gaaaggatgg ctgaacttat gatgatttgg 1561 gaaagaaaca aatctgtgag cccaacagtc aatccaatgt ctactgctat gcagaatgaa 1621 cgcaacctgt cacataatag gcgtgtgcca aaaattggtc cttatccaga ttattcttcc 1681 tcctcataca ttgaagactc tatccatcat actgacagca tcgtgaagaa tatttcctct 1741 gagcattcta tgtccagcac acctttgact ataggggaaa aaaaccgaaa ttcaattaac 1801 tatgaacgac agcaagcaca agctcgaatc cccagccctg aaacaagtgt caccagcctc 1861 tccaccaaca caacaaccac aaacaccaca ggactcacgc caagtactgg catgactact 1921 atatctgaga tgccataccc agatgaaaca aatctgcata ccacaaatgt tgcacagtca 1981 attgggccaa cccctgtctg cttacagctg acagaagaag acttggaaac caacaagcta 2041 gacccaaaag aagttgataa gaacctcaag gaaagctctg atgagaatct catggagcac 2101 tctcttaaac agttcagtgg cccagaccca ctgagcagta ctagttctag cttgctttac 2161 ccactcataa aacttgcagt agaagcaact ggacagcagg acttcacaca gactgcaaat 2221 ggccaagcat gtttgattcc tgatgttctg cctactcaga tctatcctct ccccaagcag 2281 cagaaccttc ccaagagacc tactagtttg cctttgaaca ccaaaaattc aacaaaagag 2341 ccccggctaa aatttggcag caagcacaaa tcaaacttga aacaagtcga aactggagtt 2401 gccaagatga atacaatcaa tgcagcagaa cctcatgtgg tgacagtcac catgaatggt 2461 gtggcaggta gaaaccacag tgttaactcc catgctgcca caacccaata tgccaatggg 2521 acagtactat ctggccaaac aaccaacata gtgacacata gggcccaaga aatgttgcag 2581 aatcagttta ttggtgagga cacccggctg aatattaatt ccagtcctga tgagcatgag 2641 cctttactga gacgagagca acaagctggc catgatgaag gtgttctgga tcgtcttgtg 2701 gacaggaggg aacggccact agaaggtggc cgaactaatt ccaataacaa caacagcaat 2761 ccatgttcag aacaagatgt tcttgcacag ggtgttccaa gcacagcagc agatcctggg 2821 ccatcaaagc ccagaagagc acagaggcct aattctctgg atctttcagc cacaaatgtc 2881 ctggatggca gcagtataca gataggtgag tcaacacaag atggcaaatc aggatcaggt 2941 gaaaagatca agaaacgtgt gaaaactccc tattctctta agcggtggcg cccctccacc 3001 tgggtcatct ccactgaatc gctggactgt gaagtcaaca ataatggcag taacagggca 3061 gttcattcca aatccagcac tgctgtttac cttgcagaag gaggcactgc tacaaccatg 3121 gtgtctaaag atataggaat gaactgtctg tgaaatgttt tcaagcc // LOCUS HSU20240 1001 bp mRNA PRI 14-SEP-1995 DEFINITION Human C/EBP gamma mRNA, complete cds. ACCESSION U20240 NID g727293 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1001) AUTHORS Davydov,I.V., Bohmann,D., Krammer,P.H. and Li-Weber,M. TITLE Cloning of the cDNA encoding human C/EBP gamma, a protein binding to the PRE-I enhancer element of the human interleukin-4 promoter JOURNAL Gene 161 (2), 271-275 (1995) MEDLINE 95394369 REFERENCE 2 (bases 1 to 1001) AUTHORS Davydov,I.V. TITLE Direct Submission JOURNAL Submitted (25-JAN-1995) Ilia V. Davydov, Tumor Immunology Program, German Cancer Research Center, Im Neuenheimer Feld 280, Heidelberg D-69120, Germany FEATURES Location/Qualifiers source 1..1001 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C" /clone_lib="lambda gt11 cDNA library prepared from Jurkat T-cell leukemia line using the reagents of Pharmacia and Stratagene" /cell_line="Jurkat T-cell leukemia line" CDS 251..703 /note="CCAAT/enhancer binding protein gamma" /codon_start=1 /function="DNA-binding protein" /product="C/EBP gamma" /db_xref="PID:g727294" /translation="MSKISQQNSTPGVNGISVIHTQAHASGLQQVPQLVPAGPGGGGK AVAPSKQSKKSSPMDRNSDEYRQRRERNNMAVKKSRLKSKQKAQDTLQRVNQLKEENE RLEAKIKLLTKELSVLKDLFLEHAHNLADNVQSISTENTTADGDNAGQ" polyA_site 1001 /note="63 A nucleotides" BASE COUNT 281 a 229 c 275 g 216 t ORIGIN 1 gccgcggctg cggaacgggc ggaggctgcc ggtttcgtaa ccgtcgctcc tcctcgctga 61 ctcgcgggct gtgaggcctg ggtcggctcg ggccgcaccg cgcggggccg ctcggagtgg 121 aggccgcctg ggggcaggcg ggctagagga gcaggtacat gtgaagattt tttggcagct 181 tagcgtggaa accattgatc accctgctct catttctacc tgttctgtgt tggcaaggga 241 gagtgcccaa atgagcaaga tatcgcagca aaacagcact ccaggggtga acggaattag 301 tgttatccat acccaggcac atgccagcgg cttacagcag gttcctcagc tggtgcctgc 361 tggccctggg ggaggaggca aagctgtggc tcccagcaag cagagcaaaa agagttcgcc 421 catggatcga aacagtgacg agtatcggca acgccgagag aggaacaaca tggctgtgaa 481 aaagagccgg ttgaaaagca agcagaaagc acaagacaca ctgcagagag tcaatcagct 541 caaagaagag aatgaacggt tggaagcaaa aatcaaattg ctgaccaagg aattaagtgt 601 actcaaagat ttgtttcttg agcatgcaca caaccttgca gacaacgtac agtccattag 661 cactgaaaat acgacagcag atggcgacaa tgcaggacag tagacctcac cctttccaga 721 ctttagagct tgtggcttga atgttaaagg tgtgaccacc gacaccactc atgtcaatgg 781 ctgaaagttg tccatttcca tgactcaaag acccattgga ggctattttc tgggatcagc 841 actgaagagt tgattagcta aaaatgttag ccttgtaatt cgaatatctg gttttaaatg 901 atagaggttt ttgtgggaat caaaatcccc caaatgttaa ggtatatggt aaaaaaagaa 961 atatctggga tcccgatgtt cttaataaat cctgacttcc c // LOCUS HSU20285 1866 bp mRNA PRI 19-DEC-1996 DEFINITION Human Gps1 (GPS1) mRNA, complete cds. ACCESSION U20285 NID g644878 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Spain,B.H., Bowdish,K.S., Pacal,A.R., Staub,S.F., Koo,D., Chang,C.Y., Xie,W. and Colicelli,J. TITLE Two human cDNAs, including a homolog of Arabidopsis FUS6 (COP11), suppress G-protein- and mitogen-activated protein kinase-mediated signal transduction in yeast and mammalian cells JOURNAL Mol. Cell. Biol. 16 (12), 6698-6706 (1996) MEDLINE 97098647 REFERENCE 2 (bases 1 to 1866) AUTHORS Bowdish,K.S. TITLE Direct Submission JOURNAL Submitted (26-JAN-1995) Katherine S. Bowdish, Lab. of Struct. Biol. and Mol. Med., University of California at Los Angeles, 900 Veteran Avenue, Los Angeles, CA 90024, USA FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="19" /clone_lib="yeast expression library" /cell_line="U118-MG" /cell_type="human glioblastoma" gene 21..1523 /gene="GPS1" CDS 21..1523 /gene="GPS1" /note="similar to product encoded by Arabidopsis thaliana FUS6 gene, GenBank Accession Number L26498" /codon_start=1 /product="Gps1" /db_xref="PID:g644879" /translation="MEVDGTPRRGGCKMPLPVQVFNLQGAVEPMQIDVDPQEDPQNAP DVNYVVENPSLDLEQYAASYSGLMRIERQLFIADHCPTLRVEALKMALSFVQRTFNVD MYEEIHRKLSEATRELQNAPDAIPESGVEPPALDTAWVEATRKKALLKLEKLDTDLKN YKGNSIKESIRRGHDDLGDHYLDCGDLSNALKCYSRARDYCTSAKHVINMCLNVIKVS VYLQNWSHVLSYVSKAESTPEIAEQRGERDSQTQAILTKLKCAAGLAELAARKYKQAA KCLLLASFDHCDFPELLSPSNVAIYGGLCALATFDRQELQRNVISSSSFKLFLELEPQ VRDIIFKFYESKYASCLKMLDEMKDNLLLDMYLAPHVRTLYTQIRNRALIQYFSPYVS ADMHRMAAAFNTTVAALEDELTQLILEGLISARVDSHSKILYARDVDQRSTTFEKSLL MGKEFQRRAKAMMLRAAVLRNQIHVKSPPREGSQGELTPANSQSRMSTNM" polyA_site 1866 BASE COUNT 393 a 594 c 552 g 327 t ORIGIN 1 tctctgaagt tccagaatcg atggaagtgg acggcacgcc gcggcggggt gggtgcaaga 61 tgccgctgcc ggttcaggtg tttaacttgc agggggccgt ggagcccatg cagatcgacg 121 tggaccccca ggaagacccg cagaatgcac ctgacgtcaa ctacgtggtg gagaacccca 181 gcctggatct ggaacagtac gcggccagct acagcggcct gatgcgcatc gaacggcagc 241 tgttcattgc tgatcactgc cccacgctgc gggtggaggc cctgaagatg gccctctcct 301 tcgtgcagag aacctttaac gtggacatgt acgaggagat ccaccgcaag ctctcagagg 361 ccaccaggga gctgcagaac gcacccgacg ccatccctga gagcggcgtg gagcccccag 421 ccctggacac ggcctgggtg gaggccacgc ggaagaaggc gctgctgaag ctggagaagc 481 tggacacaga cctgaagaac tacaagggca actccatcaa agagagcatc cggcgcggcc 541 acgacgacct gggcgaccac tacctggact gtggggacct cagcaacgcc ctcaagtgct 601 attcccgggc ccgggactac tgcaccagcg ccaaacacgt catcaacatg tgcctcaatg 661 tcatcaaggt cagcgtctac ttgcagaatt ggtctcatgt gctcagctac gtcagcaagg 721 ctgagtccac cccagagatt gccgagcagc gaggagagcg tgacagccag acccaggcca 781 tcctcaccaa gctcaagtgt gccgcaggct tggcagagct ggccgccagg aagtacaagc 841 aggctgccaa gtgcctcctg ctggcttcct ttgatcactg tgacttccct gagctgctgt 901 cccccagcaa cgtggccatc tacggtggcc tgtgcgcctt ggctaccttt gaccggcagg 961 agctgcagcg caatgtcatc tccagcagct ccttcaagtt gttcttggag ctggagccac 1021 aggtccgaga catcatcttc aaattctacg agtccaagta cgcctcatgt ctcaagatgc 1081 tggacgagat gaaggacaac ctgctcctgg acatgtatct ggccccccat gtcaggaccc 1141 tgtacaccca gattcgcaac cgtgccctca tccagtattt cagcccctac gtgtcagccg 1201 acatgcatag gatggcggca gccttcaata ccacggtggc cgccctggag gacgagctga 1261 cgcagctaat cctggagggg ctgatcagtg cccgtgtgga ctcacacagc aagatcctat 1321 acgcccggga cgtggatcag cgcagcacca cctttgagaa gtctctgttg atgggcaagg 1381 agttccagcg ccgcgccaag gccatgatgc tgcgggcagc tgtgctccgc aaccagatcc 1441 atgtcaagtc cccgcccaga gaagggagcc agggggagct gactccagcc aacagccagt 1501 cccggatgag caccaacatg tgaggggtga accttggcct ccaggacatc tgcaccccct 1561 ccccacctcc acggacctcg gacctccagg cggctcagtg ctgcctgcgg cccagctaag 1621 gggcctggcc actgggtgcc acccagcctg tgtgccctcc ctggggctga ggaggcaggc 1681 ggctgctagt tgtggccctt cctggaagga gaggcctgca gggctcgacc ctgtgggttt 1741 ctgtccccag ggagcagact gtgcggcacc caggcccagt ggcaccattt cccagacccc 1801 tcctgttccc gcctcagtca ggtgcagaca agtgggcggt gtccattaaa gagcagactc 1861 agcgtt // LOCUS HSU20324 630 bp mRNA PRI 02-MAR-1996 DEFINITION Human LIM domain protein (CLP) mRNA, complete cds. ACCESSION U20324 NID g790228 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 630) AUTHORS Fung,Y.W., Wang,R.X., Heng,H.H. and Liew,C.C. TITLE Mapping of a human LIM protein (CLP) to human chromosome 11p15.1 by fluorescence in situ hybridization JOURNAL Genomics 28 (3), 602-603 (1995) MEDLINE 96039282 REFERENCE 2 (bases 1 to 630) AUTHORS Liew,C.-C. TITLE Direct Submission JOURNAL Submitted (26-JAN-1995) Choong-Chin Liew, Clinical Biochemistry, University of Toronto, 100 College, Toronto, Ontario, M5G 1L5, Canada FEATURES Location/Qualifiers source 1..630 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="F2345" /tissue_type="heart" /dev_stage="fetus" /map="11p15.1" /chromosome="11" gene 46..630 /gene="CLP" CDS 46..630 /gene="CLP" /codon_start=1 /product="LIM domain protein" /db_xref="PID:g790229" /translation="MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKAL DSTTVAAHESEIYCKVCYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSV TTSNPSKFTAKFGESEKCPRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVT DKDGELYCKVCYAKNFGPTGIGFGGLTQQVEKKE" BASE COUNT 169 a 151 c 182 g 128 t ORIGIN 1 ggagtccaca caggcagact tgaccttgac cagatagtct tcaagatgcc aaactggggc 61 ggaggcgcaa aatgtggagc ctgtgaaaag accgtctacc atgcagaaga aatccagtgc 121 aatggaagga gtttccacaa gacgtgtttc cactgcatgg cctgcaggaa ggctcttgac 181 agcacgacag tcgcggctca tgagtcggag atctactgca aggtgtgcta tgggcgcaga 241 tatggcccca aagggatcgg gtatggacaa ggcgctggct gtctcagcac agacacgggc 301 gagcatctcg gcctgcagtt ccaacagtcc ccaaagccgg cacgctcagt taccaccagc 361 aacccttcca aattcactgc gaagtttgga gagtccgaga agtgccctcg atgtggcaag 421 tcagtctatg ctgctgagaa ggttatggga ggtggcaagc cttggcacaa gacctgtttc 481 cgctgtgcca tctgtgggaa gagtctggag tccacaaatg tcactgacaa agatggggaa 541 ctttattgca aagtttgcta tgccaaaaat tttggcccca cgggtattgg gtttggaggc 601 cttacacaac aagtggaaaa gaaagaatga // LOCUS HSU20350 3100 bp mRNA PRI 09-MAR-1996 DEFINITION Human G protein-coupled receptor V28 mRNA, complete cds. ACCESSION U20350 NID g665580 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3100) AUTHORS Raport,C.J., Schweickart,V.L., Eddy RL,J.R., Shows,T.B. and Gray,P.W. TITLE The orphan G-protein-coupled receptor-encoding gene V28 is closely related to genes for chemokine receptors and is expressed in lymphoid and neural tissues JOURNAL Gene 163 (2), 295-299 (1995) MEDLINE 96011651 REFERENCE 2 (bases 1 to 3100) AUTHORS Raport,C.J. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) Carol J. Raport, ICOS Corporation, 22021 20th Avenue S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..3100 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21-3pter" /cell_type="peripheral blood mononuclear cell" CDS 88..1155 /codon_start=1 /function="G protein-coupled receptor" /product="V28" /db_xref="PID:g665581" /translation="MDQFPESVTENFEYDDLAEACYIGDIVVFGTVFLSIFYSVIFAI GLVGNLLVVFALTNSKKPKSVTDIYLLNLALSDLLFVATLPFWTHYLINEKGLHNAMC KFTTAFFFIGFFGSIFFITVISIDRYLAIVLAANSMNNRTVQHGVTISLGVWAAAILV AAPQFMFTKQKENECLGDYPEVLQEIWPVLRNVETNFLGFLLPLLIMSYCYFRIIQTL FSCKNHKKAKAIKLILLVVIVFFLFWTPYNVMIFLETLKLYDFFPSCDMRKDLRLALS VTETVAFSHCCLNPLIYAFAGEKFRRYLYHLYGKCLAVLCGRSVHVDFSSSESQRSRH GSVLSSNFTYHTSDGDALLLL" BASE COUNT 802 a 786 c 644 g 868 t ORIGIN 1 actcgtctct ggtaaagtct gagcaggaca gggtggctga ctggcagatc cagaggttcc 61 cttggcagtc cacgccaggc cttcaccatg gatcagttcc ctgaatcagt gacagaaaac 121 tttgagtacg atgatttggc tgaggcctgt tatattgggg acatcgtggt ctttgggact 181 gtgttcctgt ccatattcta ctccgtcatc tttgccattg gcctggtggg aaatttgttg 241 gtagtgtttg ccctcaccaa cagcaagaag cccaagagtg tcaccgacat ttacctcctg 301 aacctggcct tgtctgatct gctgtttgta gccactttgc ccttctggac tcactatttg 361 ataaatgaaa agggcctcca caatgccatg tgcaaattca ctaccgcctt cttcttcatc 421 ggcttttttg gaagcatatt cttcatcacc gtcatcagca ttgataggta cctggccatc 481 gtcctggccg ccaactccat gaacaaccgg accgtgcagc atggcgtcac catcagccta 541 ggcgtctggg cagcagccat tttggtggca gcaccccagt tcatgttcac aaagcagaaa 601 gaaaatgaat gccttggtga ctaccccgag gtcctccagg aaatctggcc cgtgctccgc 661 aatgtggaaa caaattttct tggcttccta ctccccctgc tcattatgag ttattgctac 721 ttcagaatca tccagacgct gttttcctgc aagaaccaca agaaagccaa agccattaaa 781 ctgatccttc tggtggtcat cgtgtttttc ctcttctgga caccctacaa cgttatgatt 841 ttcctggaga cgcttaagct ctatgacttc tttcccagtt gtgacatgag gaaggatctg 901 aggctggccc tcagtgtgac tgagacggtt gcatttagcc attgttgcct gaatcctctc 961 atctatgcat ttgctgggga gaagttcaga agataccttt accacctgta tgggaaatgc 1021 ctggctgtcc tgtgtgggcg ctcagtccac gttgatttct cctcatctga atcacaaagg 1081 agcaggcatg gaagtgttct gagcagcaat tttacttacc acacgagtga tggagatgca 1141 ttgctccttc tctgaaggga atcccaaagc cttgtgtcta cagagaacct ggagttcctg 1201 aacctgatgc tgactagtga ggaaagattt ttgttgttat ttcttacagg cacaaaatga 1261 tggacccaat gcacacaaaa caaccctaga gtgttgttga gaattgtgct caaaatttga 1321 agaatgaaca aattgaactc tttgaatgac aaagagtaga catttctctt actgcaaatg 1381 tcatcagaac tttttggttt gcagatgaca aaaattcaac tcagactagt ttagttaaat 1441 gagggtggtg aatattgttc atattgtggc acaagcaaaa gggtgtctga gccctcaaag 1501 tgaggggaaa ccagggcctg agccaagcta gaattccctc tctctgactc tcaaatcttt 1561 tagtcattat agatccccca gactttacat gacacagctt tatcaccaga gagggactga 1621 cacccatgtt tctctggccc caagggaaaa ttcccaggga agtgctctga taggccaagt 1681 ttgtatcagg tgcccatccc tggaaggtgc tgttatccat ggggaaggga tatataagat 1741 ggaagcttcc agtccaatct catggagaag cagaaataca tatttccaag aagttggatg 1801 ggtgggtact attctgatta cacaaaacaa atgccacaca tcacccttac catgtgcctg 1861 atccagcctc tcccctgatt acaccagcct cgtcttcatt aagccctctt ccatcatgtc 1921 cccaaacctg caagggctcc ccactgccta ctgcatcgag tcaaaactca aatgcttggc 1981 ttctcatacg tccaccatgg ggtcctacca atagattccc cattgcctcc tccttcccaa 2041 aggactccac ccatcctatc agcctgtctc ttccatatga cctcatgcat ctccacctgc 2101 tcccaggcca gtaagggaaa tagaaaaacc ctgcccccaa ataagaaggg atggattcca 2161 accccaactc cagtagcttg ggacaaatca agcttcagtt tcctggtctg tagaagaggg 2221 ataaggtacc tttcacatag agatcatcct ttccagcatg aggaactagc caccaactct 2281 tgcaggtctc aacccttttg tctgcctctt agacttctgc tttccacacc tgcactgctg 2341 tgctgtgccc aagttgtggt gctgacaaag cttggaagag cctgcaggtg ccttggccgc 2401 gtgcatagcc cagacacaga agaggctggt tcttacgatg gcacccagtg agcactccca 2461 agtctacaga gtgatagcct tccgtaaccc aactctcctg gactgccttg aatatcccct 2521 cccagtcacc ttgtgcaagc ccctgcccat ctgggaaaat accccatcat tcatgctact 2581 gccaacctgg ggagccaggg ctatgggagc agcttttttt tcccccctag aaacgtttgg 2641 aacaatgtaa aactttaaag ctcgaaaaca attgtaataa tgctaaagaa aaagtcatcc 2701 aatctaacca catcaatatt gtcattcctg tattcacccg tccagacctt gttcacactc 2761 tcacatgttt agagttgcaa tcgtaatgta cagatggttt tataatctga tttgttttcc 2821 tcttaacgtt agaccacaaa tagtgctcgc tttctatgta gtttggtaat tatcatttta 2881 gaagactcta ccagactgtg tattcattga agtcagatgt ggtaactgtt aaattgctgt 2941 gtatctgata gctctttggc agtctatatg tttgtataat gaatgagaga ataagtcatg 3001 ttccttcaag atcatgtacc ccaatttact tgccattact caattgataa acatttaact 3061 tgtttccaat gtttagcaaa tacatatttt atagaacttc // LOCUS HSU20352 1213 bp mRNA PRI 04-APR-1995 DEFINITION Human malate dehydrogenase (MDHA) mRNA, complete cds. ACCESSION U20352 NID g755818 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1213) AUTHORS Lo,A.S.Y. and Waye,M.M.Y. TITLE Characterization of human heart cDNA clones coding for cytosolic malate dehydrogenase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1213) AUTHORS Lo,A.S. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) Agnes S.Y. Lo, Biochemistry, The Chinese University of Hong Kong, Shatin, Hong Kong, Hong Kong FEATURES Location/Qualifiers source 1..1213 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HUMMDHA" /tissue_type="heart" /dev_stage="adult and fetal" gene 1..1005 /gene="MDHA" CDS 1..1005 /gene="MDHA" /codon_start=1 /product="malate dehydrogenase" /db_xref="PID:g755819" /translation="MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLDIT PMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLL KASVKIFKSQGAALDKYAKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDH NRAKAQIALKLGVTANDVKNVIIWGNHSTTQYPDVNHAKVKLQGEEVGVYEALEDDSW LKGEFVTTVQQRSAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGN SYGVPDDLLYSFPVVIKNKTRKFVEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA " 3'UTR 1006..1213 polyA_signal 1193..1198 polyA_site 1213 BASE COUNT 352 a 247 c 275 g 339 t ORIGIN 1 atgtctgaac caatcagagt ccttgtgact ggagcagctg gtcaaattgc atattcactg 61 ctgtacagta ttggaaatgg atctgtcttt ggtaaagatc agcctataat tcttgtgctg 121 ttggatatca cccccatgat gggtgtcctg gacggtgtcc taatggaact gcaagactgt 181 gcccttcccc tcctgaaaga tgtcatcgca acagataaag aagacgttgc cttcaaagac 241 ctggatgtgg ccattcttgt gggctccatg ccaagaaggg aaggcatgga gagaaaagat 301 ttactgaaag caagtgtgaa aatcttcaaa tcccagggtg cagccttaga taaatacgcc 361 aagaagtcag ttaaggttat tgttgtgggt aatccagcca ataccaactg cctgactgct 421 tccaagtcag ctccatccat ccccaaggag aacttcagtt gcttgactcg tttggatcac 481 aaccgagcta aagctcaaat tgctcttaaa cttggtgtga ctgctaatga tgtaaagaat 541 gtcattatct ggggaaacca ttccacgact cagtatccag atgtcaacca tgccaaggtg 601 aaattgcaag gagaggaagt tggtgtttat gaagctctgg aagatgacag ctggctcaag 661 ggagaatttg tcacgactgt gcagcagcgt agcgctgctg tcatcaaggc tcgaaaacta 721 tccagtgcca tgtctgctgc aaaagccatc tgtgaccacg tcagggacat ctggtttgga 781 accccagagg gagagtttgt gtccatgggt gttatctctg atggcaactc ctatggtgtt 841 cctgatgatc tgctctactc attccctgtt gtaatcaaga ataagacccg gaagtttgtt 901 gaaggtctcc ctattaatga tttctcacgt gagaagatgg atcttactgc aaaggaactg 961 acagaagaaa aagaaagtgc ttttgaattt ctttcctctg cctgactaga caatgatgtt 1021 actaaatgct tcaaagctga agaatctaaa tgtcgtcttt gactcaagta ccaaataata 1081 ataatgctat acttaaatta ctcgtgaaaa acaacacatt ttaaagatta cgtgcttctt 1141 ggtacaggtt tgtgaatgac agtttatcgt catgctgtta gtgtgcattc taaataaata 1201 tatattcaaa tga // LOCUS HSU20362 2853 bp mRNA PRI 03-FEB-1996 DEFINITION Human Tg737 mRNA, complete cds. ACCESSION U20362 NID g755485 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2853) AUTHORS Schrick,J.J., Onuchic,L.F., Reeders,S.T., Korenberg,J., Chen,X.N., Moyer,J.H., Wilkinson,J.E. and Woychik,R.P. TITLE Characterization of the human homologue of the mouse Tg737 candidate polycystic kidney disease gene JOURNAL Hum. Mol. Genet. 4 (4), 559-567 (1995) MEDLINE 95359958 REFERENCE 2 (bases 1 to 2853) AUTHORS Woychik,R.P. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) Richard P. Woychik, Biology Division, Oak Ridge National Laboratory, Bear Creek Rd., Oak Ridge, TN 37831, USA FEATURES Location/Qualifiers source 1..2853 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hTg737" /chromosome="13" /map="13q12.1" /tissue_type="liver" /dev_stage="adult" 5'UTR 1..191 gene 192..2666 /gene="Tg737" CDS 192..2666 /gene="Tg737" /note="mutations in the mouse Tg737 gene cause polycystic kidney disease" /codon_start=1 /db_xref="PID:g755486" /translation="MMQNVHLAPETDEDDLYSGYNDYNPIYDIEELENDAAFQQAVRT SHGRRPPITAKISSTAVTRPIATGYGSKTSLASSIGRPMTGAIQDGVTRPMTAVRAAG FTKAALRGSAFDPLSQSRGPASPLEAKKKDSPEEKIKQLEKEVNELVEESCIANSCGD LKLALEKAKDAGRKERVLVRQREQVTTPENINLDLTYSVLSNLASQYSVNEMYAEALN TYQVIVKNKMFSNAGILKMNMGNIYLKQRNYSKAIKFYRMALDQVPSVNKQMRIKIMQ NIGVTFIQAGQYSDAINSYEHIMSMAPNLKAGYNLTICYFAIGDREKMKKAFQKLITV PLEIDEDKYISPSDDPHTNLVTEAIKNDHLRQMERERKAMAEKYITTSAKLIAPVIET SFAAGCDWCVEVVKASQYVELANDLEINKAVTYLRQKDYNQAVEILKVLEKKDNRVKS AAATNLSALYYMGKDFAQASSYADIAVNSDRYNPAALTNKGNTVFANGDYEKAAEFYK EALRNDSSCTEALYNIGLTYEKLNRLDEALDCFLKLHAILRNSAEVLYQIANIYELME NPSQAIEWLMQVVSVIPTDPQVLSKLGELYDREGDKSQAFQYYYESYRYFPCNIEVIE WLGAYYIDTQFWEKAIQYFERASLIQPTQVKWQLMVASCFRRSGNYQKALDTYKDTHR KFPENVECLRFLVRLCTDLGLKDAQEYARKLKRLEKMKEIREQRIKSGRDGSGGSRGK REGSASGDSGQNYSASSKGERLSARLRALPGTNEPYESSSNKEIDASYVDPLGPQIER PKTAAKKRIDEDDFADEELGDDLLPE" misc_feature 781..1107 /gene="Tg737" /note="encodes tetratrico peptide repeats (TPR) 1-3" misc_feature 1539..2252 /gene="Tg737" /note="encodes tetratrico peptide repeats (TPR) 4-10" 3'UTR 2667..2853 polyA_signal 2835..2841 BASE COUNT 962 a 526 c 620 g 745 t ORIGIN 1 cgcaccaccc ccttggcgcc gcgtctctcc cgcgcccgcg cttccggccc cggcgcgccc 61 gcgcgaggac tgtgggagcg gcttccttgg attccgcgct tggcaacggc tcggcgtcgc 121 gctttggcca accgctgcgt cgtccctggg cccgaataac tgtcgcccgc ttccctcagc 181 gcgaggtaca aatgatgcaa aatgtgcacc tggctccaga gacagatgaa gatgatcttt 241 attccggcta taatgactac aatccaatct atgatatcga ggaattggag aatgatgcag 301 cttttcagca agctgtgagg actagtcatg gcagaagacc tccaataact gctaaaatat 361 caagcacggc agttactaga cctatagcta ctggatatgg gtccaagaca tctctggcat 421 catcaatagg aagaccaatg acaggggcta ttcaggatgg agttactaga cccatgacag 481 cagtgagagc agctggtttt accaaagcag ctttgagagg ctctgcattt gaccccctta 541 gtcagtcaag gggccctgct tcccctttgg aagccaagaa aaaagatagc ccagaggaaa 601 aaataaagca attagagaag gaagtaaatg agttggtaga agaaagctgt attgccaata 661 gttgtggaga cttaaaattg gccttagaaa aggcaaaaga tgcaggaaga aaagagagag 721 tcctggtgag acagcgagaa caagttacaa ctccagaaaa tatcaatttg gatttaactt 781 actcagttct ttccaatttg gccagtcagt attcagttaa tgaaatgtat gccgaagcac 841 ttaacactta tcaagttata gtcaaaaata agatgtttag caatgcagga atattgaaaa 901 tgaatatggg aaatatctat ttaaagcaaa gaaattattc caaagccatt aaattctacc 961 gaatggcatt agaccaagtt ccaagtgtca ataagcaaat gaggattaaa ataatgcaga 1021 atattggagt tacatttatt caggctggtc agtattcaga tgctattaat tcatatgagc 1081 acataatgag catggcacca aatctgaagg caggctacaa cctaactatc tgttattttg 1141 ctattggaga ccgagaaaaa atgaagaagg cattccaaaa attgattact gttccattag 1201 aaattgatga agataaatat atttcaccaa gtgatgatcc tcatactaac ttagtaactg 1261 aagctataaa aaatgatcac ctcaggcaaa tggaacgtga aaggaaagcc atggcagaaa 1321 aatatattac gacatctgca aaactcattg ctcctgtaat tgaaacatct tttgctgcag 1381 gttgtgattg gtgcgtggaa gtggtgaaag cttctcaata tgtagagcta gccaatgatc 1441 tggaaataaa caaagcagtt acatacttga gacaaaaaga ctataaccaa gctgtagaga 1501 tcttaaaagt gttggaaaaa aaggacaata gagtgaaaag tgcagctgca accaatctct 1561 cagccctgta ttatatggga aaggattttg cacaagccag cagctatgca gatatagctg 1621 tgaactctga tagatataat ccagcagctc ttactaataa agggaataca gtttttgcaa 1681 atggtgatta tgagaaggcc gctgaattct ataaagaggc tctaagaaat gattcttctt 1741 gtactgaagc actttataat attggcctta cctatgagaa actaaatcgg ctagatgagg 1801 ctttggactg tttcctgaaa cttcacgcaa tcctacgaaa cagtgccgaa gttctttacc 1861 agatagcaaa tatatatgaa ttaatggaaa atcccagtca agctattgaa tggctaatgc 1921 aggtggtcag tgttattcca accgatcctc aagttttatc taagctagga gaattatatg 1981 atcgtgaagg agataaatct caagcatttc aatattacta tgagtcatat aggtattttc 2041 cttgtaatat tgaagtcatt gagtggcttg gagcctatta cattgacacc caattttggg 2101 aaaaagctat tcagtacttt gaaagagctt ctcttataca gcctacacaa gtgaaatggc 2161 agctgatggt agctagttgt ttcagaagaa gtggtaacta ccaaaaagca ttagatactt 2221 acaaagatac tcacagaaaa tttccagaaa atgtcgaatg tctgcgtttc ttagttcgtc 2281 tctgcacaga tcttggatta aaagatgctc aagaatatgc cagaaaactg aagaggttgg 2341 aaaaaatgaa agaaataagg gaacagcgca taaagtcagg cagagatggc agtgggggct 2401 cccgtggcaa aagagaagga agtgctagcg gtgatagtgg ccagaactat agtgccagta 2461 gtaaaggtga acgactaagt gccagactca gagctttacc tgggacaaat gaaccttatg 2521 aaagtagcag taacaaagaa atagatgcct cctatgtgga cccacttggc cctcaaatag 2581 aacgaccaaa aactgcagcc aagaaaagga tcgatgagga tgattttgct gatgaagaat 2641 taggagatga tttgcttcca gaataatatt cactttaata tttattaaag gaaagaaatt 2701 gccttatgag atcatcctca tgttaaacct tggattaaat atctaacctg taattatttt 2761 ttttcactgt caaaacttaa gtaagtgtat tctattctgt atgtatgcat ttaagttgtt 2821 tttttctttt aaggaataaa aacaggtaaa act // LOCUS HSU20498 706 bp mRNA PRI 20-JAN-1996 DEFINITION Human p19 protein mRNA, complete cds. ACCESSION U20498 NID g1161921 KEYWORDS cell cycle inhibitor; Nur77 associating protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 706) AUTHORS Chan,F.K., Zhang,J., Cheng,L., Shapiro,D.N. and Winoto,A. TITLE Identification of human and mouse p19, a novel CDK4 and CDK6 inhibitor with homology to p16ink4 JOURNAL Mol. Cell. Biol. 15 (5), 2682-2688 (1995) MEDLINE 95257949 REFERENCE 2 (bases 1 to 706) AUTHORS Winoto,A. TITLE Direct Submission JOURNAL Submitted (30-JAN-1995) Astar Winoto, Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720-3200, USA FEATURES Location/Qualifiers source 1..706 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="4.1" /tissue_type="thymus" CDS 1..501 /codon_start=1 /product="p19 protein" /db_xref="PID:g1161922" /translation="MLLEEVRAGDRLSGAAARGDVQEVRRLLHRELVHPDALNRFGKT ALQVMMFGSTAIALELLKQGASPNVQDTSGTSPVHDAARTGFLDTLKVLVEHGADVNV PDGTGALPIHLAVQEGHTAVVSFLAAESDLHRRDARGLTPLELALQRGAQDLVDILPG HMVAPL" BASE COUNT 121 a 202 c 227 g 156 t ORIGIN 1 atgctgctgg aggaggttcg cgccggcgac cggctgagtg gggcggcggc ccggggcgac 61 gtgcaggagg tgcgccgcct tctgcaccgc gagctggtgc atcccgacgc cctcaaccgc 121 ttcggcaaga cggcgctgca ggtcatgatg tttggcagca ccgccatcgc cctggagctg 181 ctgaagcaag gtgccagccc caatgtccag gacacctccg gtaccagtcc agtccatgac 241 gcagcccgca ctggattcct ggacaccctg aaggtcctag tggagcacgg ggctgatgtc 301 aacgtgcctg atggcaccgg ggcacttcca atccatctgg cagttcaaga gggtcacact 361 gctgtggtca gctttctggc agctgaatct gatctccatc gcagggacgc caggggtctc 421 acacccttgg agctggcact gcagagaggg gctcaggacc tcgtggacat cctgccaggc 481 cacatggtgg ccccgctgtg atctggggtc accctctcca gcaagagaac ccccccgtgg 541 ttatgtatca gaagagaggg gaagaaacac tttctcttct tgtttctcct gcccactgct 601 gcagtagggg aggagcacag tttgtggctt ataggtgttg gttttggggg tgtgagtgtt 661 tgggggacgt tctcatttgt ttttctcact ccttttggtg tgttgg // LOCUS HSU20530 636 bp mRNA PRI 17-FEB-1996 DEFINITION Human bone phosphoprotein spp-24 precursor mRNA, complete cds. ACCESSION U20530 NID g1195462 KEYWORDS human spp24; human spp24 mature protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 636) AUTHORS Coulson,L., Hu,B. and Price,P.A. TITLE Isolation and molecular cloning of human spp-24, a bone phosphoprotein related in sequence to the cystatin family of thiol protease inhibitors JOURNAL Unpublished REFERENCE 2 (bases 1 to 636) AUTHORS Price,P.A. TITLE Direct Submission JOURNAL Submitted (31-JAN-1995) Paul A. Price, Biology, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0322, USA COMMENT This protein is the human version of bovine spp-24 (GenBank Accession number H92795) described in: Hu,B., Coulson,L., Moyer,B., Price,P.A., Isolation and molecular cloning of a novel bone phosphoprotein related in sequence to the cystatin family of thiol protease inhibitors, J. Biol. Chem. 270 (1), 431-436 (1995). FEATURES Location/Qualifiers source 1..636 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..636 /note="secreted phosphoprotein, 24 kDa" /citation=[1] /codon_start=1 /product="spp-24 precursor" /db_xref="PID:g1195463" /translation="MISRMEKMTMMMKILIMFALGMNYWSCSGFPVYDYDPSSLRDAL SASVVKVNSQSLSPYLFRAFRSSLKRVEVLDENNLVMNLEFSIRETTCRKDSGEDPAT CAFQRDYYVSTAVCRSTVKVSAQQVQGVHARCSWSSSTSESYSSEEMIFGDMLGSHKW RNNYLFGLISDESISEQFYDRSLGIMRRVLPPGNRRYPNHRHRARINTDFE" sig_peptide 13..87 mat_peptide 88..633 /citation=[1] /product="spp-24" BASE COUNT 176 a 132 c 161 g 167 t ORIGIN 1 atgatttcca gaatggagaa gatgacgatg atgatgaaga tattgattat gtttgctctt 61 ggaatgaact actggtcttg ctcaggtttc ccagtgtacg actacgatcc atcctcctta 121 agggatgccc tcagtgcctc tgtggtaaaa gtgaattccc agtcactgag tccgtatctg 181 tttcgggcat tcagaagctc attaaaaaga gttgaggtcc tagatgagaa caacttggtc 241 atgaatttag agttcagcat ccgggagaca acatgcagga aggattctgg agaagatccc 301 gctacatgtg ccttccagag ggactactat gtgtccacag ctgtttgcag aagcaccgtg 361 aaggtatctg cccagcaggt gcagggcgtg catgctcgct gcagctggtc ctcctccacg 421 tctgagtctt acagcagcga agagatgatt tttggggaca tgttgggatc tcataaatgg 481 agaaacaatt atctatttgg tctcatttca gacgagtcca taagtgaaca attttatgat 541 cggtcacttg ggatcatgag aagggtattg cctcctggaa acagaaggta cccaaaccac 601 cggcacagag caagaataaa tactgacttt gagtaa // LOCUS HSU20536 1545 bp mRNA PRI 27-JUL-1995 DEFINITION Human cysteine protease Mch2 isoform alpha (Mch2) mRNA, complete cds. ACCESSION U20536 NID g882253 KEYWORDS Ced-3; ICE; Apoptosis. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1545) AUTHORS Fernandes-Alnemri,T., Litwack,G. and Alnemri,E.S. TITLE Mch2, a new member of the apoptotic Ced-3/Ice cysteine protease gene family JOURNAL Cancer Res. 55 (13), 2737-2742 (1995) MEDLINE 95316841 REFERENCE 2 (bases 1 to 1545) AUTHORS Fernandes-Alnemri,T., Litwack,G. and Alnemri,E.S. TITLE CPP32, a novel human apoptotic protein with homology to Caenorhabditis elegans cell death protein Ced-3 and mammalian interleukin-1 beta-converting enzyme JOURNAL J. Biol. Chem. 269 (49), 30761-30764 (1994) MEDLINE 95074098 REFERENCE 3 (bases 1 to 1545) AUTHORS Alnemri,E.S. TITLE Direct Submission JOURNAL Submitted (01-FEB-1995) Emad S. Alnemri, Pharmacology, Thomas Jefferson University, Jefferson Cancer Institute, 233 S. Tenth Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..1545 /organism="Homo sapiens" /note="human" /db_xref="taxon:9606" source join(1..1488,1490..1537,1539..1545) /organism="Homo sapiens" /clone="Mch5b3" /cell_line="Jurkat" /cell_type="T-lymphocyte" gene 79..960 /gene="Mch2" CDS 79..960 /gene="Mch2" /note="The gene product of this sequence is a new member of the interleukin 1-beta converting enzyme gene family of cysteine proteases; cysteine protease Mch2 isoform alpha" /codon_start=1 /product="cysteine protease Mch2 isoform alpha" /db_xref="PID:g882254" /translation="MSSASGLRRGHPAGGEENMTETDAFYKREMFDPAEKYKMDHRRR GIALIFNHERFFWHLTLPERRRTCADRDNLTRRFSDLGFEVKCFNDLKAEELLLKIHE VSTVSHADADCFVCVFLSHGEGNHIYAYDAKIEIQTLTGLFKGDKCHSLVGKPKIFII QACRGNQHDVPVIPLDVVDNQTEKLDTNITEVDAASVYTLPAGADFLMCYSVAEGYYS HRETVNGSWYIQDLCEMLGKYGSSLEFTELLTLVNRKVSQRRVDFCKDPSAIGKKQVP CFASMLTKKLHFFPKSN" polyA_site 1545 /note="23 A nucleotides" BASE COUNT 458 a 320 c 383 g 384 t ORIGIN 1 ccgagggcgg ggccgggccc gggagcctgt ggcttcagga agaggagggc aaggtgtctg 61 gctgcgcgtt tggctgcaat gagctcggcc tcggggctcc gcagggggca cccggcaggt 121 ggggaagaaa acatgacaga aacagatgcc ttctataaaa gagaaatgtt tgatccggca 181 gaaaagtaca aaatggacca caggaggaga ggaattgctt taatcttcaa tcatgagagg 241 ttcttttggc acttaacact gccagaaagg cggcgcacct gcgcagatag agacaatctt 301 acccgcaggt tttcagatct aggatttgaa gtgaaatgct ttaatgatct taaagcagaa 361 gaactactgc tcaaaattca tgaggtgtca actgttagcc acgcagatgc cgattgcttt 421 gtgtgtgtct tcctgagcca tggcgaaggc aatcacattt atgcatatga tgctaaaatc 481 gaaattcaga cattaactgg cttgttcaaa ggagacaagt gtcacagcct ggttggaaaa 541 cccaagatat ttatcatcca ggcatgtcgg ggaaaccagc acgatgtgcc agtcattcct 601 ttggatgtag tagataatca gacagagaag ttggacacca acataactga ggtggatgca 661 gcctccgttt acacgctgcc tgctggagct gacttcctca tgtgttactc tgttgcagaa 721 ggatattatt ctcaccggga aactgtgaac ggctcatggt acattcaaga tttgtgtgag 781 atgttgggaa aatatggctc ctccttagag ttcacagaac tcctcacact ggtgaacagg 841 aaagtttctc agcgccgagt ggacttttgc aaagacccaa gtgcaattgg aaagaagcag 901 gttccctgtt ttgcctcaat gctaactaaa aagctgcatt tctttccaaa atctaattaa 961 ttaatagagg ctatctaatt tcacactctg tattgaaaat ggctttctca gccaggcgtg 1021 gttactcaca cctgtaatcc cagcactttg ggagtccaag gtgggcggat cacctgaggt 1081 cgggagttcg agaccagcct gaccaacatg gcagaagccc cgcctctact aaaaatgcaa 1141 aaaaaaattt agctaggcat ggcggcgcat gcctgcaatc ccagctactt ggaaggctga 1201 ggcaggagaa tcacttgaac ccaggaggtg gaggctgcgg tgagccgagc attgcgccat 1261 tgcactccag cctgggcaac gagtgaaact ccgtctcaaa aaaaaagaaa atgtctttct 1321 cttcctttta tataaatatc gttagggtga agcattatgg tctaatgatt caaatgtttt 1381 aaagtttaat gcctagcaga gaactgcctt aaaaaaaaaa agttcatgtt ggccatggtg 1441 aaagggtttg atatggagaa acaaaatcct caggaaatta gataaataaa aatttataag 1501 catttgtatt attttttaat aaactgcagg gttacacaaa aatct // LOCUS HSU20759 3783 bp mRNA PRI 02-FEB-1996 DEFINITION Human parathyroid cell calcium-sensing receptor mRNA, complete cds. ACCESSION U20759 NID g683744 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3783) AUTHORS Garrett,J.E., Capuano,I.V., Hammerland,L.G., Hung,B.C., Brown,E.M., Hebert,S.C., Nemeth,E.F. and Fuller,F. TITLE Molecular cloning and functional expression of human parathyroid calcium receptor cDNAs JOURNAL J. Biol. Chem. 270 (21), 12919-12925 (1995) MEDLINE 95279439 REFERENCE 2 (bases 1 to 3783) AUTHORS Garrett,J.E. TITLE Direct Submission JOURNAL Submitted (07-FEB-1995) James E. Garrett, Molecular Biology, NPS Pharmaceuticals, Inc., 420 Chipeta Way, Salt Lake City, UT 84108, USA FEATURES Location/Qualifiers source 1..3783 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phPCaR-4.0" /clone_lib="lambdaZ-hPG2ss" /sex="male" /tissue_type="parathyroid gland" 5'UTR 1..372 CDS 373..3609 /codon_start=1 /function="seven transmembrane domain G protein-coupled receptor that senses changes in extracellular calcium ions" /product="parathyroid cell calcium-sensing receptor" /db_xref="PID:g683745" /translation="MAFYSCCWVLLALTWHTSAYGPDQRAQKKGDIILGGLFPIHFGV AAKDQDLKSRPESVECIRYNFRGFRWLQAMIFAIEEINSSPALLPNLTLGYRIFDTCN TVSKALEATLSFVAQNKIDSLNLDEFCNCSEHIPSTIAVVGATGSGVSTAVANLLGLF YIPQVSYASSSRLLSNKNQFKSFLRTIPNDEHQATAMADIIEYFRWNWVGTIAADDDY GRPGIEKFREEAEERDICIDFSELISQYSDEEEIQHVVEVIQNSTAKVIVVFSSGPDL EPLIKEIVRRNITGKIWLASEAWASSSLIAMPQYFHVVGGTIGFALKAGQIPGFREFL KKVHPRKSVHNGFAKEFWEETFNCHLQEGAKGPLPVDTFLRGHEESGDRFSNSSTAFR PLCTGDENISSVETPYIDYTHLRISYNVYLAVYSIAHALQDIYTCLPGRGLFTNGSCA DIKKVEAWQVLKHLRHLNFTNNMGEQVTFDECGDLVGNYSIINWHLSPEDGSIVFKEV GYYNVYAKKGERLFINEEKILWSGFSREVPFSNCSRDCLAGTRKGIIEGEPTCCFECV ECPDGEYSDETDASACNKCPDDFWSNENHTSCIAKEIEFLSWTEPFGIALTLFAVLGI FLTAFVLGVFIKFRNTPIVKATNRELSYLLLFSLLCCFSSSLFFIGEPQDWTCRLRQP AFGISFVLCISCILVKTNRVLLVFEAKIPTSFHRKWWGLNLQFLLVFLCTFMQIVICV IWLYTAPPSSYRNQELEDEIIFITCHEGSLMALGFLIGYTCLLAAICFFFAFKSRKLP ENFNEAKFITFSMLIFFIVWISFIPAYASTYGKFVSAVEVIAILAASFGLLACIFFNK IYIILFKPSRNTIEEVRCSTAAHAFKVAARATLRRSNVSRKRSSSLGGSTGSTPSSSI SSKSNSEDPFPQPERQKQQQPLALTQQEQQQQPLTLPQQQRSQQQPRCKQKVIFGSGT VTFSLSFDEPQKNAMAHGNSTHQNSLEAQKSSDTLTRHQPLLPLQCGETDLDLTVQET GLQGPVGGDQRPEVEDPEELSPALVVSSSQSFVISGGGSTVTENVVNS" 3'UTR 3607..3783 polyA_site 3783 BASE COUNT 892 a 1067 c 975 g 849 t ORIGIN 1 caacaggcac ctggctgcag ccaggaagga ccgcacgccc tttcgcgcag gagagtggaa 61 ggagggagct gtttgccagc accgaggtct tgcggcacag gcaacgcttg acctgagtct 121 tgcagaatga aaggcatcac aggaggcctc tgcatgatgt ggcttccaaa gactcaagga 181 ccacccacat tacaagtctg gattgaggaa ggcagaaatg gagattcaaa caccacgtct 241 tctattattt tattaatcaa tctgtagaca tgtgtcccca ctgcagggag tgaactgctc 301 caagggagaa acttctggga gcctccaaac tcctagctgt ctcatccctt gccctggaga 361 gacggcagaa ccatggcatt ttatagctgc tgctgggtcc tcttggcact cacctggcac 421 acctctgcct acgggccaga ccagcgagcc caaaagaagg gggacattat ccttgggggg 481 ctctttccta ttcattttgg agtagcagct aaagatcaag atctcaaatc aaggccggag 541 tctgtggaat gtatcaggta taatttccgt gggtttcgct ggttacaggc tatgatattt 601 gccatagagg agataaacag cagcccagcc cttcttccca acttgacgct gggatacagg 661 atatttgaca cttgcaacac cgtttctaag gccttggaag ccaccctgag ttttgttgct 721 caaaacaaaa ttgattcttt gaaccttgat gagttctgca actgctcaga gcacattccc 781 tctacgattg ctgtggtggg agcaactggc tcaggcgtct ccacggcagt ggcaaatctg 841 ctggggctct tctacattcc ccaggtcagt tatgcctcct ccagcagact cctcagcaac 901 aagaatcaat tcaagtcttt cctccgaacc atccccaatg atgagcacca ggccactgcc 961 atggcagaca tcatcgagta tttccgctgg aactgggtgg gcacaattgc agctgatgac 1021 gactatgggc ggccggggat tgagaaattc cgagaggaag ctgaggaaag ggatatctgc 1081 atcgacttca gtgaactcat ctcccagtac tctgatgagg aagagatcca gcatgtggta 1141 gaggtgattc aaaattccac ggccaaagtc atcgtggttt tctccagtgg cccagatctt 1201 gagcccctca tcaaggagat tgtccggcgc aatatcacgg gcaagatctg gctggccagc 1261 gaggcctggg ccagctcctc cctgatcgcc atgcctcagt acttccacgt ggttggcggc 1321 accattggat tcgctctgaa ggctgggcag atcccaggct tccgggaatt cctgaagaag 1381 gtccatccca ggaagtctgt ccacaatggt tttgccaagg agttttggga agaaacattt 1441 aactgccacc tccaagaagg tgcaaaagga cctttacctg tggacacctt tctgagaggt 1501 cacgaagaaa gtggcgacag gtttagcaac agctcgacag ccttccgacc cctctgtaca 1561 ggggatgaga acatcagcag tgtcgagacc ccttacatag attacacgca tttacggata 1621 tcctacaatg tgtacttagc agtctactcc attgcccacg ccttgcaaga tatatatacc 1681 tgcttacctg ggagagggct cttcaccaat ggctcctgtg cagacatcaa gaaagttgag 1741 gcgtggcagg tcctgaagca cctacggcat ctaaacttta caaacaatat gggggagcag 1801 gtgacctttg atgagtgtgg tgacctggtg gggaactatt ccatcatcaa ctggcacctc 1861 tccccagagg atggctccat cgtgtttaag gaagtcgggt attacaacgt ctatgccaag 1921 aagggagaaa gactcttcat caacgaggag aaaatcctgt ggagtgggtt ctccagggag 1981 gtgcccttct ccaactgcag ccgagactgc ctggcaggga ccaggaaagg gatcattgag 2041 ggggagccca cctgctgctt tgagtgtgtg gagtgtcctg atggggagta tagtgatgag 2101 acagatgcca gtgcctgtaa caagtgccca gatgacttct ggtccaatga gaaccacacc 2161 tcctgcattg ccaaggagat cgagtttctg tcgtggacgg agccctttgg gatcgcactc 2221 accctctttg ccgtgctggg cattttcctg acagcctttg tgctgggtgt gtttatcaag 2281 ttccgcaaca cacccattgt caaggccacc aaccgagagc tctcctacct cctcctcttc 2341 tccctgctct gctgcttctc cagctccctg ttcttcatcg gggagcccca ggactggacg 2401 tgccgcctgc gccagccggc ctttggcatc agcttcgtgc tctgcatctc atgcatcctg 2461 gtgaaaacca accgtgtcct cctggtgttt gaggccaaga tccccaccag cttccaccgc 2521 aagtggtggg ggctcaacct gcagttcctg ctggttttcc tctgcacctt catgcagatt 2581 gtcatctgtg tgatctggct ctacaccgcg cccccctcaa gctaccgcaa ccaggagctg 2641 gaggatgaga tcatcttcat cacgtgccac gagggctccc tcatggccct gggcttcctg 2701 atcggctaca cctgcctgct ggctgccatc tgcttcttct ttgccttcaa gtcccggaag 2761 ctgccggaga acttcaatga agccaagttc atcaccttca gcatgctcat cttcttcatc 2821 gtctggatct ccttcattcc agcctatgcc agcacctatg gcaagtttgt ctctgccgta 2881 gaggtgattg ccatcctggc agccagcttt ggcttgctgg cgtgcatctt cttcaacaag 2941 atctacatca ttctcttcaa gccatcccgc aacaccatcg aggaggtgcg ttgcagcacc 3001 gcagctcacg ctttcaaggt ggctgcccgg gccacgctgc gccgcagcaa cgtctcccgc 3061 aagcggtcca gcagccttgg aggctccacg ggatccaccc cctcctcctc catcagcagc 3121 aagagcaaca gcgaagaccc attcccacag cccgagaggc agaagcagca gcagccgctg 3181 gccctaaccc agcaagagca gcagcagcag cccctgaccc tcccacagca gcaacgatct 3241 cagcagcagc ccagatgcaa gcagaaggtc atctttggca gcggcacggt caccttctca 3301 ctgagctttg atgagcctca gaagaacgcc atggcccacg ggaattctac gcaccagaac 3361 tccctggagg cccagaaaag cagcgatacg ctgacccgac accagccatt actcccgctg 3421 cagtgcgggg aaacggactt agatctgacc gtccaggaaa caggtctgca aggacctgtg 3481 ggtggagacc agcggccaga ggtggaggac cctgaagagt tgtccccagc acttgtagtg 3541 tccagttcac agagctttgt catcagtggt ggaggcagca ctgttacaga aaacgtagtg 3601 aattcataaa atggaaggag aagactgggc tagggagaat gcagagaggt ttcttggggt 3661 cccagggatg aggaatcgcc ccagactcct ttcctctgag gaagaaggga taatagacac 3721 atcaaatgcc ccgaatttag tcacaccatc ttaaatgaca gtgaattgac ccatgttccc 3781 ttt // LOCUS HSU20972 1045 bp mRNA PRI 12-AUG-1995 DEFINITION Human 14-3-3 protein epsilon isoform mRNA, complete cds. ACCESSION U20972 NID g902786 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1045) AUTHORS Conklin,D.S., Galaktionov,K. and Beach,D. TITLE 14-3-3 proteins associate with cdc25 phosphatases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (17), 7892-7896 (1995) MEDLINE 95372385 REFERENCE 2 (bases 1 to 1045) AUTHORS Conklin,D.S. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) Douglas S. Conklin, Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..1045 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epithelial cell" CDS 85..852 /codon_start=1 /product="14-3-3 protein epsilon isoform" /db_xref="PID:g902787" /translation="MDDREDLVYQAKLAEQAERYDEMVESMKKVAGMDVELTVEERNL LSVAYKNVIGARRASWRIISSIEQKEENKGGEDKLKMIREYRQMVETELKLICCDILD VLDKHLIPAANTGESKVFYYKMKGDYHRYLAEFATGNDRKEAAENSLVAYKAASDIAM TELPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDST LIMQLLRDNLTLWTSDMQGDGEEQNKEALQDVEDENQ" BASE COUNT 317 a 220 c 260 g 248 t ORIGIN 1 ggcacgaggg agcgagaggc tgagagagtc ggagcactat ccgcttccat ccgtcgcgca 61 gaccctgccg gagccgctgc cgctatggat gatcgagagg atctggtgta ccaggcgaag 121 ttggccgagc aggctgagcg atacgacgaa atggtggagt caatgaagaa agtagcaggg 181 atggatgtgg agctgacagt tgaagaaaga aacctcctat ctgttgcata taagaatgta 241 attggagcta gaagagcctc ctggagaata atcagcagca ttgaacagaa agaagaaaac 301 aagggaggag aagacaagct aaaaatgatt cgggaatatc ggcaaatggt tgagactgag 361 ctaaagttaa tctgttgtga catcctggat gtactggaca aacacctcat tccagcagct 421 aacactggcg agtccaaggt tttctattat aaaatgaaag gggactacca caggtatctg 481 gcagaatttg ccacaggaaa cgacaggaag gaggctgcgg agaacagcct agtggcttat 541 aaagctgcta gtgatattgc aatgacagaa cttccaccaa cgcatcctat tcgcttaggt 601 cttgctctca atttttccgt attctactac gaaattctta attcccctga ccgtgcctgc 661 aggttggcta aagcagcttt tgatgatgca attgcagaac tggatacgct gagtgaagaa 721 agctataagg actctacact tatcatgcag ttgttacgtg ataatctgac actatggact 781 tcagacatgc agggtgacgg tgaagagcag aataaagaag cgctgcagga cgtggaagac 841 gaaaatcagt gagacataag ccaacaagag aaaccatctc taaccacccc ctcctcccca 901 tcccaccctt tggaaactcc ccattgtcac tgagaaccac caaatctgac ttttacattt 961 ggtctcagaa tttaggttcc tgccctgttg gttttttttt ttttttttta aaagttttca 1021 aaagttctta aaggcaagag tgaat // LOCUS HSU20979 3111 bp mRNA PRI 29-SEP-1995 DEFINITION Human chromatin assembly factor-I p150 subunit mRNA, complete cds. ACCESSION U20979 NID g882257 KEYWORDS CAF-I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3111) AUTHORS Kaufman,P.D., Kobayashi,R., Kessler,N. and Stillman,B. TITLE The p150 and p60 subunits of chromatin assembly factor I: a molecular link between newly synthesized histones and DNA replication JOURNAL Cell 81 (7), 1105-1114 (1995) MEDLINE 95323966 REFERENCE 2 (bases 1 to 3111) AUTHORS Stillman,B. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) Bruce Stillman, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..3111 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 41..2857 /codon_start=1 /function="DNA replication-dependent assembly of nucleosomes, binds CAF-I p60 subunit" /evidence=experimental /product="chromatin assembly factor-I p150 subunit" /db_xref="PID:g882258" /translation="MDCKDRPAFPVKKLIQARLPFKRLNLVPKGKADDMSDDQGTSVQ SKSPDLEASLDTLENNCHVGSDIDFRPKLVNGKGPLDNFLRNRIETSIGQSTVIIDLT EDSNEQPDSLVDHNKLNSEASPSREAINGQREDTGDQQGLLKAIQNDKLAFPGETLSD IPCKTEEEGVGCGGAGRRGDSQECSPRSCPELTSGPRMCPRKEQDSWSEAGGILFKGK VPMVVLQDILAVRPPQIKSLPATPQGKNMTPESEVLESFPEEDSVLSHSSLSSPSSTS SPEGPPAPPKQHSSTSPFPTSTPLRRITKKFVKGSTEKNKLRLQRDQERLGKQLKLRA EREEKEKLKEEAKRAKEEAKKKKEEEKELKEKERREKREKDEKEKAEKQRLKEERRKE RQEALEAKLEEKRKKEEEKRLREEEKRIKAEKAEITRFFQKPKTPQAPKTLAGSCGKF APFEIKEHMVLAPRRRTAFHPDLCSQLDQLLQQQSGEFSFLKDLKGRQPLRSGPTHVS TRNADIFNSDVVIVERGKGDGVPERRKFGRMKLLQFCENHRPAYWGTWNKKTALIRAR DPWAQDTKLLDYEVDSDEEWEEEEPGESLSHSEGDDDDDMGEDEDEDDGFFVPHGYLS EDEGVTEECADPENHKVRQKLKAKEWDEFLAKGKRFRVLQPVKIGCVWAADRDCAGDD LKVLQQFAACFLETLPAQEEQTPKASKRERRDEQILAQLLPLLHGNVNGSKVIIREFQ EHCRRGLLSNHTGSPRTPSTTYLHTPTPSEDAAIPSKSRLKRLISENSVYEKRPDFRM CWYVHPQVLQSFQQEHLPVPCQWSYVTSVPSAPKEDSGSVPSTGPSQGTPISLKRKSA GSMCITQFMKKRRHDGQIGAEDMDGFQADTEEEEEEEGDCMIVDVPDAVEVQAPCGAA SGAGGGVGVDTGKATLTASPLGAS" polyA_site 3111 /note="41 A nucleotides" BASE COUNT 805 a 809 c 962 g 535 t ORIGIN 1 ggagtgcggg gcgcccggcg ccaggggagc cgccacagcc atggattgca aagatagacc 61 agcttttcca gttaagaagt taatacaagc ccgtctgccg tttaagcgcc tgaatcttgt 121 cccaaagggg aaagccgatg acatgtcaga cgatcagggt acttctgtgc aaagtaaaag 181 ccccgattta gaggcctctt tggacacctt ggaaaacaac tgtcatgtgg gttctgacat 241 agactttaga ccgaaacttg tcaacgggaa gggtccctta gataactttt taagaaatag 301 aatcgaaacc agtattggcc agagcacagt catcattgat ttgacagagg actcgaatga 361 gcagccagac agtcttgtgg accacaataa actaaattct gaagcctctc cctccaggga 421 ggcaataaat ggccagcgag aagacactgg ggatcagcag gggttgttga aggccattca 481 gaacgacaag ttggcatttc ctggagagac cctttcagac attccttgca aaacagagga 541 ggagggtgtt ggctgtggag gtgcagggag gagaggcgac tcccaggaat gttcgccacg 601 gagctgcccg gagctgacga gtggcccgag aatgtgcccc agaaaggagc aggacagttg 661 gagtgaagct gggggcatcc tgttcaaagg gaaggtgcct atggtggtct tgcaggacat 721 cttggctgtg agaccaccgc aaatcaagtc ccttccagcc acaccccaag gcaagaacat 781 gacccctgag agtgaggtgc tggaatcttt ccccgaagaa gactctgtac tcagccattc 841 gtccctgagc tctccctctt ccaccagctc gcccgagggg ccgcctgctc ccccaaagca 901 gcacagcagt accagtccct tccccacctc cacgcccctc cgcagaataa ctaagaaatt 961 cgtcaaaggc tctacagaga agaacaagct cagactgcaa agagatcagg agcgtctggg 1021 caagcagctc aagttacgtg cagaaaggga agaaaaggag aagctgaaag aggaggccaa 1081 gcgggccaag gaggaggcca agaagaagaa ggaggaagag aaggagctta aggaaaagga 1141 gaggcgggag aagcgggaga aggatgagaa ggagaaggcg gagaagcagc ggctcaagga 1201 ggagcggcgc aaggagagac aggaagccct ggaggctaaa cttgaggaaa aaaggaaaaa 1261 ggaagaagag aaacggttaa gagaagaaga gaagcgcatt aaagcagaga aggccgaaat 1321 cacgaggttc ttccagaaac caaagactcc acaggccccc aagaccctgg ccggctcctg 1381 tgggaagttt gccccctttg aaattaaaga gcacatggtc ctggcccctc ggcgtcggac 1441 cgctttccat ccagacctct gcagtcagct ggaccagctc ctccagcagc agagcggcga 1501 gttctccttc ttgaaagacc tcaaaggccg gcagcccctg aggtccggac ccacgcacgt 1561 ttccacccgg aatgcagata tttttaacag tgatgtcgtc atcgtggagc gtgggaaggg 1621 cgacggtgtt cccgagagga ggaagtttgg caggatgaag ctcctgcagt tctgtgagaa 1681 ccaccggcct gcctactggg gtacctggaa taagaagacg gcactcatcc gcgcgcgaga 1741 cccctgggcc caggacacga agctcctgga ctatgaggtg gacagtgatg aggagtggga 1801 agaagaggag cctggggagt ccctgtccca cagtgagggg gatgatgatg acgacatggg 1861 agaggatgaa gatgaggacg atggtttctt tgtgccccat gggtacctgt ctgaggacga 1921 aggtgtgaca gaggagtgtg ccgaccctga gaaccataag gtccgccaga aactgaaggc 1981 caaggagtgg gacgagttcc tggctaaggg gaagcgcttt cgcgtcctgc aacctgtgaa 2041 gatcggctgc gtgtgggcgg ctgacagaga ctgcgcaggc gatgacctga aggtactgca 2101 gcagttcgca gcctgcttcc tggagaccct gccggcccag gaggagcaga cgcccaaggc 2161 ctccaagcgg gagaggagag acgagcagat cctggcccag ctgctgccgc tcctgcacgg 2221 caatgtgaac gggagcaagg tcatcatccg ggagttccag gagcactgcc gccggggact 2281 gctcagcaac cacaccggca gcccgcggac gccctccacc acctacctgc acacccccac 2341 ccccagcgag gatgccgcca tcccctctaa gtcccggctc aagcggctca tttccgagaa 2401 ctcagtgtat gagaagcggc ctgacttcag gatgtgctgg tacgtgcacc cgcaggtgct 2461 acagagcttc cagcaggagc acctgcccgt gccgtgccag tggagctatg tgacatcggt 2521 gccctcggcc cccaaagagg acagtggcag cgtcccctcc acggggccca gccagggcac 2581 tcccatctcg ctgaagagga agtcagcggg cagcatgtgc atcacccaat tcatgaagaa 2641 gcgcaggcac gacggccaga ttggtgctga agacatggac ggcttccagg cagacacgga 2701 ggaggaggaa gaggaggagg gcgactgtat gatcgtggat gtcccggatg ctgtggaggt 2761 ccaagccccg tgtggagccg cttccggagc tgggggtggt gtgggggtgg acaccggcaa 2821 ggccaccctg accgcgagcc cactgggtgc atcctgagag caggggtgac gtatgtagaa 2881 cgcttagggt gtcctcccca cagagcagat acttgaaccg actcaattcc tgtgtaaaga 2941 gcactttgtc ctgcttcacg gacctcccca aagtgtgcag agttctatat aggatgctgg 3001 attagttcct ttgatatttg taaaaattcc cccaagagcc gcatatgaat ctgcccttta 3061 ataaagcatt attgagattg ctggcctatt ggggaagctg cgggcacagg a // LOCUS HSU20980 2197 bp mRNA PRI 29-SEP-1995 DEFINITION Human chromatin assembly factor-I p60 subunit mRNA, complete cds. ACCESSION U20980 NID g882259 KEYWORDS CAF-I. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2197) AUTHORS Kaufman,P.D., Kobayashi,R., Kessler,N. and Stillman,B. TITLE The p150 and p60 subunits of chromatin assembly factor I: a molecular link between newly synthesized histones and DNA replication JOURNAL Cell 81 (7), 1105-1114 (1995) MEDLINE 95323966 REFERENCE 2 (bases 1 to 2197) AUTHORS Stillman,B. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) Bruce Stillman, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..2197 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 63..1742 /codon_start=1 /function="DNA replication-dependent assembly of nucleosomes, binds CAF-I p150 subunit" /evidence=experimental /product="chromatin assembly factor-I p60 subunit" /db_xref="PID:g882260" /translation="MKVITCEIAWHNKEPVYSLDFQHGTAGRIHRLASAGVDTNVRIW KVEKGPDGKAIVEFLSNLARHTKAVNVVRFSPTGEILASGGDDAVILLWKVNDNKEPE QIAFQDEDEAQLNKENWTVVKTLRGHLEDVYDICWATDGNLMASASVDNTAIIWDVSK GQKISIFNEHKSYVQGVTWDPLGQYVATLSCDRVLRVYSIQKKRVAFNVSKMLSGIGA EGEARSYRMFHDDSMKSFFRRLSFTPDGSLLLTPAGCVESGENVMNTTYVFSRKNLKR PIAHLPCPGKATLAVRCCPVYFELRPVVETGVELMSLPYRLVFAVASEDSVLLYDTQQ SFPFGYVSNIHYHTLSDISWSSDGAFLAISSTDGYCSFVTFEKDELGIPLKEKPVLNM RTPDTAKKTKSQTHRGSSPGPRPVEGTPASRTQDPSSPGTTPPQARQAPAPTVIRDPP SITPAVKSPLPGPSEEKTLQPSSQNTKAHPSRRVTLNTLQAWSKTTPRRINLTPLKTD TPPSSVPTSVISTPSTEEIQSETPGDAQGSPPELKRPRLDENKGGTESLDP" BASE COUNT 586 a 517 c 559 g 535 t ORIGIN 1 tcttgtcttg aagaagtaga acggtgcccg agaaacgttt ttccccttcg agactcagga 61 ggatgaaagt catcacttgt gaaatagcct ggcacaacaa ggagcccgtg tacagcctgg 121 acttccagca tgggacggct gggaggatcc acagactggc gtctgccggc gtggacacca 181 atgtcaggat ctggaaggta gaaaagggac cagatggaaa agccatcgtg gaatttttgt 241 ccaatcttgc tcgtcatacc aaagccgtca atgttgtgcg tttttctcca actggggaaa 301 ttttagcatc gggaggagat gatgctgtca tcctattgtg gaaggtgaat gataacaagg 361 agccggagca gatcgctttt caggatgagg acgaggccca gctgaacaag gagaactgga 421 cggttgtgaa gactctgcgg ggccacttag aagatgtgta tgatatttgc tgggcaactg 481 atgggaattt aatggcttct gcctctgtgg ataacacagc catcatatgg gatgtcagca 541 aaggacaaaa gatatcaatt tttaatgaac ataaaagtta tgtccaagga gtaacctggg 601 accctttggg tcaatatgtt gctactctga gctgtgacag ggtgctgcga gtatacagta 661 tacagaagaa gcgtgtggct ttcaatgttt cgaagatgct gtctggaata ggggctgaag 721 gagaggcaag aagctaccgg atgtttcacg acgacagcat gaagtctttc ttccgtagac 781 tgagtttcac tcccgacgga tctttgcttc tcacgccagc tggatgtgtg gaatctggtg 841 aaaatgtaat gaataccact tatgttttct ccaggaagaa tcttaaaagg cccatcgctc 901 atcttccatg tcctggaaaa gccactcttg ctgttcgctg ctgtccggtc tactttgaac 961 tgaggccagt ggtggaaaca ggtgtggagc tgatgagtct gccctaccgc ctggtgtttg 1021 ctgtggcctc ggaggattcc gtgcttctgt atgacaccca gcagtccttc ccttttggtt 1081 acgtgtctaa tatacattac cacaccctca gtgacatttc atggtccagc gatggtgcct 1141 tcctggccat ttcttccacg gacggttact gctcatttgt gacatttgag aaagatgaac 1201 ttggaattcc tttgaaagag aagccagttt tgaacatgag aactcctgat acagcaaaga 1261 aaaccaagag tcagacacat cgagggtctt cgccaggacc cagaccggta gagggaaccc 1321 ctgccagcag aacccaagac cccagcagcc ccggcacgac tccccctcag gccagacagg 1381 ccccagcccc aacagtcatc agggaccctc cctccatcac tcctgctgtc aaaagcccct 1441 tgccggggcc ttcggaggag aagaccctgc agcccagtag tcaaaacaca aaagcccacc 1501 catcccggag ggtcactctg aacacactgc aagcctggag caagacaaca ccccggagaa 1561 taaacttaac acccttaaag acggacactc caccaagttc tgtaccaacc agtgtgattt 1621 ccaccccttc tacagaagaa attcagtcag agacgcctgg agacgctcag ggcagtcccc 1681 cagagctaaa gcggcccaga ctcgatgaaa acaaaggagg cacggaaagt ctggaccctt 1741 gatgggacct cggcttctgc tcgaagccta ccaggctccc ggtgtgtgca gggagacggt 1801 aaagctggag gtgcctgaga ccagggcttc catggagcgg gacacactgt aaatggattt 1861 ctataacaga agtgacatgt gtactgattt ttctccagaa atatggatgc tgttgtattc 1921 agtatccatt tttaacttgg gacatgaacg ttttaacata gtaaatcctc tttttgatga 1981 gtttctgaaa ctggagcggt tcaacgttat ccagtgtgaa aatcagtgag tcctccctgg 2041 catcctcgtg aaagtgcaca cacttcatgg agggactcct tttcaataag aattaggaag 2101 atgagaaagt aatttgagat tttactctgt cgaattttag agtatttgaa gtgattgtta 2161 gatttcactt ctaaggagtt gattgattaa actttgg // LOCUS HSU20998 1466 bp mRNA PRI 22-JUL-1995 DEFINITION Homo sapiens signal recognition particle subunit 9 (SRP9) mRNA, complete cds. ACCESSION U20998 NID g897850 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1466) AUTHORS Hsu,K., Chang,D.Y. and Maraia,R.J. TITLE Human signal recognition particle (SRP) Alu-associated protein also binds Alu interspersed repeat sequence RNAs. Characterization of human SRP9 JOURNAL J. Biol. Chem. 270 (17), 10179-10186 (1995) MEDLINE 95247726 REFERENCE 2 (bases 1 to 1466) AUTHORS Hsu,K. TITLE Direct Submission JOURNAL Submitted (13-FEB-1995) Karl Hsu, UMCB/LMGR, NICHD, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1466 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phSRP9" /cell_line="HeLa" gene 107..1456 /gene="SRP9" CDS 107..367 /gene="SRP9" /codon_start=1 /function="binds SRP RNA as heterodimer with SRP14 to comprise the translation arrest domain of SRP" /product="signal recognition particle subunit 9" /db_xref="PID:g897851" /translation="MPQYQTWEEFSRAAEKLYLADPMKARVVLKYRHSDGNLCVKVTD DLVCLVYKTDQAQDVKKIEKFHSQLMRLMVAKEARNVTMETE" polyA_signal 1451..1456 /gene="SRP9" polyA_site 1466 /gene="SRP9" /note="17 A nucleotides" BASE COUNT 442 a 227 c 299 g 498 t ORIGIN 1 ggggctgctg ggactcgtcg tcggttggcg actcccggac gttaggtagt ttgttgggcc 61 gggttctgag gccttgcttc tctttacttt tccactctag gccacgatgc cgcagtacca 121 gacctgggag gagttcagcc gcgctgccga gaagctttac ctcgctgacc ctatgaaggc 181 acgtgtggtt ctcaaatata ggcattctga tgggaacttg tgtgttaaag taacagatga 241 tttagtttgt ttggtgtata aaacagacca agctcaagat gtaaagaaga ttgagaaatt 301 ccacagtcaa ctaatgcgac ttatggtagc caaggaagcc cgcaatgtta ccatggaaac 361 tgagtgaatg gtttgaaatg aagactttgt cgtgtactta ggaagtaaat atcttttgaa 421 ttagagaaag gttgggacag aaagtacttt atgtaactaa gtgggctgtt cagaagctta 481 gaggtcattt tttgtaattt tctttttaat tactttagag agctagggat gcaaatgttt 541 tcagttagaa agcctttatt tacttttgga aattgaacaa gaaatgcatc tgtcttagaa 601 actggagatt atttgatgtt aggtaaaaca tgtaattgtt tctctggcaa atttgtatca 661 gtaatttgaa aatgagatat taggaaaaac caattcttct taaatttagt tcatctttct 721 ttaaaagaac attaaatgta accattttgt cagatccatg tattttggag cataaaatgt 781 atgctgttgt gaccaataaa tataaaatat ggtaattgga attaactcca caccatagta 841 tgcattgtta tacatactgt gtacctaatt atgtatagca gtgtagtctc aattatatct 901 gaaagtaatt gtgactaaca agtatgcttt gccttatttc cacatttaaa ctacctgtta 961 atataaggga tttgtagtat cagcttgttg agcaatgact ttgaatctag ttttcagtga 1021 tcagaagcag cagttatttg agtgtatgaa tggaatgatg atcactgtgc tataatgtac 1081 tgaaaccacc atattacaga aatatttact acatattttc catctgtagt ttctcagaag 1141 ggctatggat tagtttgaac tgtcaaatcc ttgcatactt ctgtgacacc cctgcccatt 1201 ttctgtcttt aattaaccaa ggtgttaggt gtgactgtca caactgttat gttttccagt 1261 aaactagaag cacgatattt gataattata tttgtatttc accacctaaa tgtaatgttg 1321 attcctcaag aatgaaatga aggcactaca ttgaaatatg ttttgtataa atttgtcatg 1381 ttgaacagca ttttagcatg gtaagttccc ttagctatat gaattttggc atgtttcaga 1441 gagatcagta aataaaatat tagata // LOCUS HSU21049 927 bp RNA PRI 23-MAR-1996 DEFINITION Human DD96 mRNA, complete cds. ACCESSION U21049 NID g722243 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 927) AUTHORS Kocher,O., Cheresh,P., Brown,L.F. and Lee,S.W. TITLE Identification of a gene selectively upregulated in human carcinomas using the differential display technique JOURNAL Clin. Cancer Res. 1, 1209-1215 (1995) REFERENCE 2 (bases 1 to 927) AUTHORS Kocher,O. TITLE Direct Submission JOURNAL Submitted (14-FEB-1995) Oliver Kocher, Pathology, Beth Israel Hospital, 330 Brookline Avenue, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..927 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 202..546 /codon_start=1 /product="DD96" /db_xref="PID:g722244" /translation="MSALSLLILGLLTAVPPASCQQGLGNLQPWMQGLIAVAVFLVLV AIAFAVNHFWCQEEPEPAHMILTVGNKADGVLVGTDGRYSSMAASFRSSEHENAYENV PEEEGKVRSTPM" BASE COUNT 196 a 276 c 248 g 207 t ORIGIN 1 ggaagtttag gttaactgtc ttaaatttcc aaagctgtaa tcattatttt cattctcaaa 61 gtgatggcct tgtgttttgc tcctctcctc cagggccaga ctgagcccag gttgatttca 121 ggcggacacc aatagactcc acagcagctc caggagccca gacaccggcg gccagaagca 181 aggctaggag ctgctgcagc catgtcggcc ctcagcctcc tcattctggg cctgctcacg 241 gcagtgccac ctgccagctg tcagcaaggc ctggggaacc ttcagccctg gatgcagggc 301 cttatcgcgg tggccgtgtt cctggtcctc gttgcaatcg cctttgcagt caaccacttc 361 tggtgccagg aggagccgga gcctgcacac atgatcctga ccgtcggaaa caaggcagat 421 ggagtcctgg tgggaacaga tggaaggtac tcttcgatgg cggccagttt caggtccagt 481 gagcatgaga atgcctatga gaatgtgccc gaggaggaag gcaaggtccg cagcaccccg 541 atgtaacctt ctctgtggct ccaaccccaa gactcccagg cacatgggat ggatgtccag 601 tgctaccacc caagccccct ccttctttgt gtggaatctg caatagtggg ctgactccct 661 ccagccccat gccggcccta cccgcccttg aagtatagcc agccaaggtt ggagctcaga 721 ccgtgtctag gttggggctc ggctgtggcc ctggggtctc ctgctcagct cagaagagcc 781 ttctggagag gacagtcagc tgagcacctc ccatcctgct cacacgtcct tccccataac 841 tatggaaatg gccctaattt ctgtgaaata aagacttttt gtatttctgg ggctgaggct 901 cagcaacagc ccctcaggct tccaaaa // LOCUS HSU21051 2932 bp DNA PRI 26-APR-1996 DEFINITION Human G protein-coupled receptor (GPR4) gene, complete cds. ACCESSION U21051 NID g687793 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2932) AUTHORS Mahadevan,M.S., Baird,S., Bailly,J.E., Shutler,G.G., Sabourin,L.A., Tsilfidis,C., Neville,C.E., Narang,M. and Korneluk,R.G. TITLE Isolation of a novel G protein-coupled receptor (GPR4) localized to chromosome 19q13.3 JOURNAL Genomics 30 (1), 84-88 (1995) MEDLINE 96129306 REFERENCE 2 (bases 1 to 2932) AUTHORS Baird,S. TITLE Direct Submission JOURNAL Submitted (14-FEB-1995) Stephen Baird, Molecular Genetics, Children's Hospital of Eastern Ontario, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada FEATURES Location/Qualifiers source 1..2932 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" mRNA 237..2932 mRNA 237..2052 gene 830..1918 /gene="GPR4" CDS 830..1918 /gene="GPR4" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g687794" /translation="MGNHTWEGCHVDSRVDHLFPPSLYIFVIGVGLPTNCLALWAAYR QVQQRNELGVYLMNLSIADLLYICTLPLWVDYFLHHDNWIHGPGSCKLFGFIFYTNIY ISIAFLCCISVDRYLAVAHPLRFARLRRVKTAVAVSSVVWATELGANSAPLFHDELFR DRYNHTFCFEKFPMEGWVAWMNLYRVFVGFLFPWALMLLSYRGILRAVRGSVSTERQE KAKIKRLALSLIAIVLVCFAPYHVLLLSRSAIYLGRPWDCGFEERVFSAYHSSLAFTS LNCVADPILYCLVNEGARSDVAKALHNLLRFLASDKPQEMANASLTLETPLTSKRNST AKAMTGSWAATPPSQGDQVQLKMLPPAQ" BASE COUNT 619 a 933 c 690 g 689 t 1 others ORIGIN 1 ctgcagtcag gcggtgaact gacttcatcc caatccctca gcccccacca ggaccagtct 61 ggagtccctc ccctgccccc antgaaattt cccttccgtc cccaaactta cctctgatct 121 agaccttact cacctccttc ctgtttccta agactccttc ctgccgtcca cagaccgagc 181 cttttatctt tgtccaccct gtgccagaca cctccttttc cagaaccttc tccttactgg 241 tgaccttact tatctctgtt gctttctggg gtcctaggaa atgccagcac tcccacccac 301 attgcctgaa ctttccaaca ctccctagct gcgctgtgtc ctatctcaac acttcctcat 361 gtatttcttg tgtcttctag aacattcccc cgccattatt acttcaatat ggctacacat 421 acttcctaat tgccctgcaa accatctcct tctcaccatt gcccagcgat gctttcgtct 481 cctccataaa cactcccgga gaccaatttt tgtgtcaccc ccatactccc tcgttgacac 541 actgactcca tacataacct ccttgaaaaa cctctttatt aatctcacca tcctccagac 601 ttccctcctg tcataattcc atccctcctc caacttttcc ctctcaagct ctgcccttcc 661 cagcccagcc cagcctaccc aacctcatct cttccctgta gaccacatcc caccatgttc 721 ccctgagcct ccaaggaagg ggctcagggg gccccatggc ctcccgctcc ctgtggcccc 781 acagcccccg tgggccaggg gaagcgcccc agaagccgaa gtgcccacca tgggcaacca 841 cacgtgggag ggctgccacg tggactcgcg cgtggaccac ctctttccgc catccctcta 901 catctttgtc atcggcgtgg ggctgcccac caactgcctg gctctgtggg cggcctaccg 961 ccaggtgcaa cagcgcaacg agctgggcgt ctacctgatg aacctcagca tcgccgacct 1021 gctgtacatc tgcacgctgc cgctgtgggt ggactacttc ctgcaccacg acaactggat 1081 ccacggcccc gggtcctgca agctctttgg gttcatcttc tacaccaata tctacatcag 1141 catcgccttc ctgtgctgca tctcggtgga ccgctacctg gctgtggccc acccactccg 1201 cttcgcccgc ctgcgccgcg tcaagaccgc cgtggccgtg agctccgtgg tctgggccac 1261 ggagctgggc gccaactcgg cgcccctgtt ccatgacgag ctcttccgag accgctacaa 1321 ccacaccttc tgctttgaga agttccccat ggaaggctgg gtggcctgga tgaacctcta 1381 tcgggtgttc gtgggcttcc tcttcccgtg ggcgctcatg ctgctgtcgt accggggcat 1441 cctgcgggcc gtgcggggca gcgtgtccac cgagcgccag gagaaggcca agatcaagcg 1501 gctggccctc agcctcatcg ccatcgtgct ggtctgcttt gcgccctatc acgtgctctt 1561 gctgtcccgc agcgccatct acctgggccg cccctgggac tgcggcttcg aggagcgcgt 1621 cttttctgca taccacagct cactggcttt caccagcctc aactgtgtgg cggaccccat 1681 cctctactgc ctggtcaacg agggcgcccg cagcgatgtg gccaaggccc tgcacaacct 1741 gctccgcttt ctggccagcg acaagcccca ggagatggcc aatgcctcgc tcaccctgga 1801 gaccccactc acctccaaga ggaacagcac agccaaagcc atgactggca gctgggcggc 1861 cactccgccc tcccaggggg accaggtgca gctgaagatg ctgccgccag cacaatgaac 1921 cccgagtggc acagaatccc cagttttccc ctctcatccc acagtccctt ctctcctggt 1981 ctggtgtatg caaattgtat ggaaaaaggg ctgtgttaat attcataaga atacaagaac 2041 ttaggaagag tgaggttggt gtgtcactgg tcaacctttg tgctcccaga tcccatcaca 2101 gtttggcgat tgtggagggc ctcctgaagg aggagatgag taaatatatt tttttggaga 2161 cagggtctca ctgtgttgcc caggctggag tgcagtagtg cagtcgtggc tcactgcagc 2221 ctccacctcc tgggctctcc agcgatcttc ccacatcagc ctcccgagta gctgggacca 2281 caaatgtgag cccacccatg cctggctaat ttttgtactt tttgtataaa tggagtctca 2341 ctatgtttcc ccaggctgat cttgaactcc tgggctcaag agatcctcct gccttggcct 2401 cccaaagtgc tcagattaga gatgtgagcc gccatgtctg gccagataaa ttaagtcaaa 2461 catttggttt ccagaaaata aagacaaata gagaaggtta gatttttttt tttccaacaa 2521 gtggataaaa gtctgtgact cgggggaaag tggaaggaga aatgcagccg atatagagtc 2581 attatgtttg caaagcccct ggtcatacag gccagggaac ataagaccgc aattctaagt 2641 ttctagataa acagcgatct ccaagtcaag actgaggatg aagagggaga atgtcagaac 2701 tcaagtgaag ggcaatcagg gcagactgcc tggaggagtg atgccagaag gtttgggaag 2761 aaggtgtggg acaagaagaa agggtattta ttcattcatt caacagaggt ttatgtaggg 2821 cactgtgctg ggtggggctg gggacacaac aatgactgag gcagcctggc cttgccttca 2881 cagggctcac catacacaag taaataaaaa atatgtaatg tttggaattg ct // LOCUS HSU21090 1584 bp mRNA PRI 05-OCT-1995 DEFINITION Human DNA polymerase delta small subunit mRNA, complete cds. ACCESSION U21090 NID g1008457 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1584) AUTHORS Zhang,J., Tan,C.K., McMullen,B., Downey,K.M. and So,A.G. TITLE Cloning of the cDNAs for the small subunits of bovine and human DNA polymerase delta and chromosomal location of the human gene (POLD2) JOURNAL Genomics 29 (1), 179-186 (1995) MEDLINE 96079106 REFERENCE 2 (bases 1 to 1584) AUTHORS So,A.G. TITLE Direct Submission JOURNAL Submitted (15-FEB-1995) Antero G. So, Dept. of Medicine (R-99), University of Miami School of Medicine, P.O. Box 016960, Miami, FL 33101, USA FEATURES Location/Qualifiers source 1..1584 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /cell_line="HepG2" /cell_type="hepatoma" CDS 79..1488 /note="share similarity with EST sequences, GenBank Accession Numbers T03551 and T17451" /codon_start=1 /product="DNA polymerase delta small subunit" /db_xref="PID:g1008458" /translation="MFSEQAAQRAHTLLSPPSANNATFARVPVATYTNSSQPFRLGER SFSRQYAHIYATRLIQMRPFLENRAQQHWGSGVGVKKLCELQPEEKCCVVGTLFKAMP LQPSILREVSEEHNLLPQPPRSKYIHPDDELVLEDELQRIKLKGTIDVSKLVTGTVLA VFGSVRDDGKFLVEDYCFADLAPQKPAPPLDTDRFVLLVSGLGLGGGGGESLLGTQLL VDVVTGQLGDEGEQCSAAHVSRVILAGNLLSHSTQSRDSINKAKYLTKKTQAASVEAV KMLDEILLQLSASVPVDVMPGEFDPTNYTLPQQPLHPCMFPLATAYSTLQLVTNPYQA TIDGVRFLGTSGQNVSDIFRYSSMEDHLEILEWTLRVRHISPTAPDTLGCYPFYKTDP FIFPECPHVYFCGNTPSFGSKIIRGPEDQTVLLVTVPDFSATQTACLVNLRSLACQPI SFSGFGAEDDDLGGLGLGP" BASE COUNT 328 a 486 c 454 g 316 t ORIGIN 1 cgggatgcgg cgcgccgcgc gttgaacctc cttggcctgg gcgaagctgt gtggaccaag 61 caagtcagga gtgtggccat gttttctgag caggctgccc agagggccca cactctactg 121 tccccaccat cagccaacaa tgccaccttt gcccgggtgc cagtggcaac ctacaccaac 181 tcctcacaac ccttccggct aggagagcgc agctttagcc ggcagtatgc ccacatttat 241 gccacccgcc tcatccaaat gagacccttc ctggagaacc gggcccagca gcactggggc 301 agtggagtgg gagtgaagaa gctgtgtgaa ctgcagcctg aggagaagtg ctgtgtggtg 361 ggcactctgt tcaaggccat gccgctgcag ccctccatcc tgcgggaggt cagcgaggag 421 cacaacctgc tcccccagcc tcctcggagt aaatacatac acccagatga cgagctggtc 481 ttggaagatg aactgcagcg tatcaaacta aaaggcacca ttgacgtgtc aaagctggtt 541 acggggactg tcctggctgt gtttggctcc gtgagagacg acgggaagtt tctggtggag 601 gactattgct ttgctgacct tgctccccag aagcccgcac ccccacttga cacagatagg 661 tttgtgctac tggtgtccgg cctgggcctg ggtggcggtg gaggcgagag cctgctgggc 721 acccagctgc tggtggatgt ggtgacgggg cagcttgggg acgaagggga gcagtgcagc 781 gccgcccacg tctcccgggt tatcctcgct ggcaacctcc tcagccacag cacccagagc 841 agggattcta tcaataaggc caaatacctc accaagaaaa cccaggcagc cagcgtggag 901 gctgttaaga tgctggatga gatcctcctg cagctgagcg cctcagtgcc cgtggacgtg 961 atgccaggcg agtttgatcc caccaattac acgctccccc agcagcccct ccacccctgc 1021 atgttcccgc tggccactgc ctactccacg ctccagctgg tcaccaaccc ctaccaggcc 1081 accattgatg gagtcagatt tttggggaca tcaggacaga acgtgagtga cattttccga 1141 tacagcagca tggaggatca cttggagatc ctggagtgga ccctgcgggt ccgtcacatc 1201 agccccacag ccccggacac tctaggttgt taccccttct acaaaactga cccgttcatc 1261 ttcccagagt gcccgcatgt ctacttttgt ggcaacaccc ccagctttgg ctccaaaatc 1321 atccgaggtc ctgaggacca gacagtgctg ttggtgactg tccctgactt cagtgccacg 1381 cagaccgcct gccttgtgaa cctgcgcagc ctggcctgcc agcccatcag cttctcgggc 1441 ttcggggcag aggacgatga cctgggaggc ctggggctgg gcccctgact caaaaaagtg 1501 gttttgacca gagaggccca gatggaggct gttcattccc tgcagtgtcg gcattgtaaa 1561 taaagcctgg cacttgctga tgcg // LOCUS HSU21108 2234 bp mRNA PRI 05-JAN-1996 DEFINITION Human dual specific protein phosphatase mRNA, complete cds. ACCESSION U21108 NID g773354 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2234) AUTHORS Guan,K.L. and Butch,E. TITLE Isolation and characterization of a novel dual specific phosphatase, HVH2, which selectively dephosphorylates the mitogen-activated protein kinase JOURNAL J. Biol. Chem. 270 (13), 7197-7203 (1995) MEDLINE 95221370 REFERENCE 2 (bases 1 to 2234) AUTHORS Guan,K.-L. TITLE Direct Submission JOURNAL Submitted (15-FEB-1995) Kun-Liang Guan, Biological Chemistry, University of Michigan, 1301 East Catherine, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..2234 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 247..1431 /note="specific for tyrosine and serine" /codon_start=1 /product="dual specific protein phosphatase" /db_xref="PID:g773355" /translation="MVTMEELREMDCSVLKRLMNRDENGGGAGGSGSHGTLGLPSGGK CLLLDCRPFLAHSAGYILGSVNVRCNTIVRRRAKGSVSLEQILPAEEEVRARLRSGLY SAVIVYDERSPRAESLREDSTVSLVVQALRRNAERTDICLLKGGYERFSSEYPEFCSK TKALAAIPPPVPPSATEPLDLGCSSCGTPLHDQGGPVEILPFLYLGSAYHAARRDMLD ALGITALLNVSSDCPNHFEGHYQYKCIPVEDNHKADISSWFMEAIEYIDAVKDCRGRV LVHCQAGISRSATICLAYLMMKKRVRLEEAFEFVKQRRSIISPNFSFMGQLLQFESQV LATSCAAEAASPSGPLRERGKTPATPTSQFVFSFPVSVGVHSAPSSLPYLHSPITTSP SC" polyA_site 2234 /note="7 A nucleotides" BASE COUNT 474 a 644 c 657 g 459 t ORIGIN 1 ccccctccgc tctgctgcgc cgcccggctg ggccccgagg ccgctccgac tgctatgtga 61 ccgcgaggct gcgggaggaa ggggacaggg aagaagaggc tctcccgcgg gagcccttga 121 ggaccaagtt tgcggccact tctgcaggcg tcccttctta gctctcgcct gcccctttct 181 gcagcctagg cggcccaggt tctcttctct tcctcgcgcg cccagccgcc tcggttcccg 241 gcgaccatgg tgacgatgga ggagctgcgg gagatggact gcagtgtgct caaaaggctg 301 atgaaccggg acgagaatgg cggcggcgcg ggcggcagcg gcagccacgg caccctgggg 361 ctgccgagcg gcggcaagtg cctgctgctg gactgcagac cgttcctggc gcacagcgcg 421 ggctacatcc taggttcggt caacgtgcgc tgtaacacca tcgtgcggcg gcgggctaag 481 ggctccgtga gcctggagca gatcctgccc gccgaggagg aggtacgcgc ccgcttgcgc 541 tccggcctct actcggcggt catcgtctac gacgagcgca gcccgcgcgc cgagagcctc 601 cgcgaggaca gcaccgtgtc gctggtggtg caggcgctgc gccgcaacgc cgagcgcacc 661 gacatctgcc tgctcaaagg cggctatgag aggttttcct ccgagtaccc agaattctgt 721 tctaaaacca aggccctggc agccatccca cccccggttc cccccagcgc cacagagccc 781 ttggacctgg gctgcagctc ctgtgggacc ccactacacg accagggggg tcctgtggag 841 atccttccct tcctctacct cggcagtgcc taccatgctg cccggagaga catgctggac 901 gccctgggca tcacggctct gttgaatgtc tcctcggact gcccaaacca ctttgaagga 961 cactatcagt acaagtgcat cccagtggaa gataaccaca aggccgacat cagctcctgg 1021 ttcatggaag ccatagagta catcgatgcc gtgaaggact gccgtgggcg cgtgctggtg 1081 cactgccagg cgggcatctc gcggtcggcc accatctgcc tggcctacct gatgatgaag 1141 aaacgggtga ggctggagga ggccttcgag ttcgttaagc agcgccgcag catcatctcg 1201 cccaacttca gcttcatggg gcagctgctg cagttcgagt cccaggtgct ggccacgtcc 1261 tgtgctgcgg aggctgctag cccctcggga cccctgcggg agcggggcaa gacccccgcc 1321 acccccacct cgcagttcgt cttcagcttt ccggtctccg tgggcgtgca ctcggccccc 1381 agcagcctgc cctacctgca cagccccatc accacctctc ccagctgtta gagccgccct 1441 gggggcccca gaaccagagc tggctcccag caagggtagg acgggccgca tgcggcagaa 1501 agttgggact gagcagctgg gagcaggcga ccgagctcct tccccatcat ttctccttgg 1561 ccaacgacga ggccagccag aatggcaata aggactccga atacataata aaagcaaaca 1621 gaacactcca acttagagca ataaccggtg ccgcagcagc cagggaagac cttggtttgg 1681 tttatgtgtc agtttcactt ttccgataga aatttcttac ctcatttttt taagcagtaa 1741 ggcttgaagt gatgaaaccc acagatccta gcaaatgtgc ccaaccagct ttactaaagg 1801 gggaggaagg gagggcaaag ggatgagaag acaagtttcc cagaagtgcc tggttctggg 1861 tacttgtccc tttgttgtcg ttgttgtagt taaaggaatt tcatttttaa aagaaatctt 1921 cgaaggtgtg gttttcattt ctcagtcacc aacagatgaa taattatgct taataataaa 1981 gtatttatta agactttctt cagagtatga aagtacaaaa agtctagtta cagtggattt 2041 agaatatatt tatgttgatg tcaaacagct gagcaccgta gcatgcagat gtcaaggcag 2101 ttaggaagta aatggtgtct tgtagatatg tgcaaggtag catgatgagc aacttgagtt 2161 tgttgccact gagaagcagg cgggttgggt gggaggagga agaaagggaa gaattaggtt 2221 tgaattgctt ttta // LOCUS HSU21551 1155 bp mRNA PRI 20-SEP-1996 DEFINITION Human ECA39 mRNA, complete cds. ACCESSION U21551 NID g1036779 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1155) AUTHORS Schuldiner,O., Eden,A., Ben-Yosef,T., Yanuka,O., Simchen,G. and Benvenisty,N. TITLE ECA39, a conserved gene regulated by c-Myc in mice, is involved in G1/S cell cycle regulation in yeast JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (14), 7143-7148 (1996) MEDLINE 96293490 REFERENCE 2 (bases 1 to 1155) AUTHORS Ben-Yosef,T. TITLE Direct Submission JOURNAL Submitted (23-FEB-1995) Tamar Ben-Yosef, Genetics, The Hebrew University of Jerusalem, Givat Ram, Jerusalem, Israel FEATURES Location/Qualifiers source 1..1155 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal brain" gene 1..1155 /gene="ECA39" CDS 1..1155 /gene="ECA39" /codon_start=1 /db_xref="PID:g1036780" /translation="MDCSNGSAECTGEGGSKEVVGTFKAKDLIVTPATILKEKPDPNN LVFGTVFTDHMLTVEWSSEFGWEKPHIKPLQNLSLHPGSSALHYAVELFEGLKAFRGV DNKIRLFQPNLNMDRMYRSAVRATLPVFDKEELLECIQQLVKLDQEWVPYSTSASLYI RPAFIGTEPSLGVKKPTKALLFVLLSPVGPYFSSGTFNPVSLWANPKYVRAWKGGTGD CKMGGNYGSSLFAQCEDVDNGCQQVLWLYGRDHQITEVGTMNLFLYWINEDGEEELAT PPLDGIILPGVTRRCILDLAHQWGEFKVSERYLTMDDLTTALEGNRVREMFSSGTACV VCPVSDILYKGETIHIPTMENGPKLASRILSKLTDIQYGREESDWTIVLS" BASE COUNT 326 a 232 c 298 g 299 t ORIGIN 1 atggattgca gtaacggatc ggcagagtgt accggagaag gaggatcaaa agaggtggtg 61 gggactttta aggctaaaga cctaatagtc acaccagcta ccattttaaa ggaaaaacca 121 gaccccaata atctggtttt tggaactgtg ttcacggatc atatgctgac ggtggagtgg 181 tcctcagagt ttggatggga gaaacctcat atcaagcctc ttcagaacct gtcattgcac 241 cctggctcat cagctttgca ctatgcagtg gaattatttg aaggattgaa ggcatttcga 301 ggagtagata ataaaattcg actgtttcag ccaaacctca acatggatag aatgtatcgc 361 tctgctgtga gggcaactct gccggtattt gacaaagaag agctcttaga gtgtattcaa 421 cagcttgtga aattggatca agaatgggtc ccatattcaa catctgctag tctgtatatt 481 cgtcctgcat tcattggaac tgagccttct cttggagtca agaagcctac caaagccctg 541 ctctttgtac tcttgagccc agtgggacct tatttttcaa gtggaacctt taatccagtg 601 tccctgtggg ccaatcccaa gtatgtaaga gcctggaaag gtggaactgg ggactgcaag 661 atgggaggga attacggctc atctcttttt gcccaatgtg aagacgtaga taatgggtgt 721 cagcaggtcc tgtggctcta tggcagagac catcagatca ctgaagtggg aactatgaat 781 ctttttcttt actggataaa tgaagatgga gaagaagaac tggcaactcc tccactagat 841 ggcatcattc ttccaggagt gacaaggcgg tgcattctgg acctggcaca tcagtggggt 901 gaatttaagg tgtcagagag atacctcacc atggatgact tgacaacagc cctggagggg 961 aacagagtga gagagatgtt tagctctggt acagcctgtg ttgtttgccc agtttctgat 1021 atactgtaca aaggcgagac aatacacatt ccaactatgg agaatggtcc taagctggca 1081 agccgcatct tgagcaaatt aactgatatc cagtatggaa gagaagagag cgactggaca 1141 attgtgctat cctga // LOCUS HSU21663 1849 bp mRNA PRI 14-JUN-1996 DEFINITION Human deleted in azoospermia protein (DAZ) mRNA, complete cds. ACCESSION U21663 NID g1045307 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1849) AUTHORS Reijo,R., Lee,T.-Y., Salo,P., Alagappan,R., Brown,L.G., Rosenberg,M., Rozen,S., Jaffe,T., Strauss,D., Hovatta,O., de la Chapell,A., Silber,S. and Page,D. TITLE Diverse spermatogenic defects in humans caused by Y chromosome deletions encompassing a novel RNA-binding protein gene JOURNAL Nature Genet. 10 (4), 383-393 (1995) MEDLINE 95400318 REFERENCE 2 (bases 1 to 1849) AUTHORS Reijo,R.A. TITLE Direct Submission JOURNAL Submitted (24-FEB-1995) Renee A. Reijo, Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1849 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pDP1577" /chromosome="Y" /map="Yq6D" /sex="Male" /tissue_type="testis" /dev_stage="adult" gene 209..1309 /gene="DAZ" CDS 209..1309 /gene="DAZ" /standard_name="Deleted in AZoospermia" /codon_start=1 /function="putative RNA binding" /product="DAZ protein" /db_xref="PID:g1045308" /translation="MSAANPETPNSTISREASTQSSSAAASQGWVLPEGKIVPNTVFV GGIDARMDETEIGSCFGRYGSVKEVKIITNRTGVSKGYGFVSFVNDVDVQKIVGSQIH FHGKKLKLGPAIRKQKLCARHVQPRPLVVNPPPPPQFQNVWRNPNTETYLQPQITPNP VTQHVQAYSAYPHSPGQVITGCQLLVYNYQEYPTYPDSPFQVTTGYQLPVYNYQPFPA YPSSPFQVTAGYQLPVYNYQAFPAYPSSPFQVTTGYQLPVYNYQAFPAYPSSPFQVTT GYQLPVYNYQAFPAYPSSPFQVTTGYQLPVYNYQAFPAYPNSAVQVTTGYQFHVYNYQ MPPQCPVGEQRRNLWTEAYKWWYLVCLIQRRD" misc_feature 302..554 /gene="DAZ" /note="encodes RNP/RRM domain (RNA binding protein domain/ RNA Recognition Motif)" repeat_region 695..1198 /note="DAZ repeats, 72 nucleotide repeats similar to DYS1 - a human Y chromosome polymorphic marker, GenBank Accession Number S86117" /rpt_type=tandem /rpt_unit=743..813 BASE COUNT 507 a 412 c 364 g 566 t ORIGIN 1 agtcggcctg cgctcctcag cctggcggtt ctacctccga gggttcgccc gcccttggtt 61 ttccttacac cttagccttt ggctcctttg accactcgaa gccccacagc gtgttccagc 121 ggacttcacc agcagaccca gaagtggtgg gtgaaacact gcctctgttc ctccttgagc 181 ctgtcgggag ctgctgcctg ccaccaccat gtctgctgca aatcctgaga ctccaaactc 241 aaccatctcc agagaggcca gcacccagtc ttcatcagct gcagctagcc aaggctgggt 301 gttaccagaa ggcaaaatcg tgccaaacac tgtttttgtt ggtggaattg atgctaggat 361 ggatgaaact gagattggaa gctgctttgg tagatacggt tcagtgaaag aagtgaagat 421 aatcacgaat cgaactggtg tgtccaaagg ctatggattt gtttcgtttg ttaatgacgt 481 ggatgtccag aagatagtag gatcacagat acatttccat ggtaaaaagc tgaagctggg 541 ccctgcaatc aggaaacaaa agttatgtgc tcgtcatgtg cagccacgtc ctttggtagt 601 taatcctcct cctccaccac agtttcagaa cgtctggcgg aatccaaaca ctgaaaccta 661 cctgcagccc caaatcacgc cgaatcctgt aactcagcac gttcaggctt attctgctta 721 tccacattca ccaggtcagg tcatcactgg atgtcagttg cttgtatata attatcagga 781 atatcctact tatcccgatt caccatttca ggtcaccact ggatatcagt tgcctgtata 841 taattatcag ccatttcctg cttatccaag ttcaccattt caggtcactg ctggatatca 901 gttgcctgta tataattatc aggcatttcc tgcttatcca agttcaccat ttcaggtcac 961 cactggatat cagttgcctg tatataatta tcaggcattt cctgcttatc caagttcacc 1021 atttcaggtc accactggat atcagttgcc tgtatataat tatcaggcat ttcctgctta 1081 tccaagttca ccatttcagg tcaccactgg atatcagttg cctgtatata attatcaggc 1141 atttcctgct tatccaaatt cagcagttca ggtcaccact ggatatcagt tccatgtata 1201 caattaccag atgccaccgc agtgccctgt tggggagcaa aggagaaatc tgtggaccga 1261 agcatacaaa tggtggtatc ttgtctgttt aatccagaga agagactgat aaattccgtt 1321 gttactcaag atgactgctt caagggtaaa agagtgcatc gctttagaag aagtttggca 1381 gtatttaaat ctgttggatc ctctcagcta tctagtttca tgggaagttg ctggttttga 1441 atattaagct aaaagttttc cactattaca gaaattctga attttggtaa atcacactga 1501 aactttctgt ataacttgta ttattagact ctctagtttt atcttaacac tgaaactgtt 1561 cttcattaga tgtttattta gaacctggtt ctgtgtttaa tatatagttt aaagtaacaa 1621 ataatcgaga ctgaaagaat gttaagattt atctgcaagg atttttaaaa aattgaaact 1681 tgcattttaa agtgtttaaa agcaaattac tgactttcaa aaaagttttt aaaacctgat 1741 ttgaaagcta acaattttgg atagtctgaa cacaagcatt tcacttctcc aagaagtacc 1801 tgtgaacagt acaatatttc agtattgagc tttgcattta tgatttatc // LOCUS HSU21847 2872 bp mRNA PRI 17-JAN-1996 DEFINITION Human TGF-beta inducible early protein (TIEG) mRNA, complete cds. ACCESSION U21847 NID g1155214 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2872) AUTHORS Subramaniam,M., Harris,S.A., Oursler,M.J., Rasmussen,K., Riggs,B.L. and Spelsberg,T.C. TITLE Identification of a novel TGF-beta-regulated gene encoding a putative zinc finger protein in human osteoblasts JOURNAL Nucleic Acids Res. 23 (23), 4907-4912 (1995) MEDLINE 96128307 REFERENCE 2 (bases 1 to 2872) AUTHORS Subramaniam,M. TITLE Direct Submission JOURNAL Submitted (27-FEB-1995) Malayannan Subramaniam, Biochemistry and Molecular Biology, Mayo Clinic, 1615 Guggenheim, Rochester, NY 55905, USA FEATURES Location/Qualifiers source 1..2872 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="55 year old female" /sex="female" /cell_line="osteoblast primary culture" gene 87..1529 /gene="TIEG" CDS 87..1529 /gene="TIEG" /codon_start=1 /product="TGF-beta inducible early protein" /db_xref="PID:g1155215" /translation="MLNFGASLQQTAEERMEMISERPKESMYSWNKTAEKSDFEAVEA LMSMSCSWKSDFKKYVENRPVTPVSDLSEEENLLPGTPDFHTIPAFCLTPPYSPSDFE PSQVSNLMAPAPSTVHFKSLSDTAKPHIAAPFKEEEKSPVSAPKLPKAQATSVIRHTA DAQLCNHQTCPMKAASILNYQNNSFRRRTHLNVEAARKNIPCAAVSPNRSKCERNTVA DVDEKASAALYDFSVPSSETVICRSQPAPVSPQQKSVLVSPPAVSAGGVPPMPVICQM VPLPANNPVVTTVVPSTPPSQPPAVCPPVVFMGTQVPKGAVMFVVPQPVVQSSKPPVV SPNGTRLSPIAPAPGFSPSAAKVTPQIDSSRIRSHICSHPGCGKTYFKSSHLKAHTRT HTGEKPFSCSWKGCERRFARSDELSRHRRTHTGEKKFACPMCDRRFMRSDHLTKHARR HLSAKKLPNWQMEVSKLNDIALPPTPAPTQ" misc_feature 234..359 /gene="TIEG" /note="encodes proline-rich region" misc_feature 247..346 /gene="TIEG" /note="encodes SH3 binding motifs" misc_feature 371..451 /gene="TIEG" /note="encodes zinc finger domain" polyA_signal 2852..2857 polyA_site 2872 /note="9 A nucleotides" BASE COUNT 830 a 630 c 606 g 806 t ORIGIN 1 gaattcggca cgagcgcccg tctgtggcca agcagccagc agcctagcag ccagtcagct 61 tgccgccggc ggccaagcag ccaaccatgc tcaacttcgg tgcctctctc cagcagactg 121 cggaggaaag aatggaaatg atttctgaaa ggccaaaaga gagtatgtat tcctggaaca 181 aaactgcaga gaaaagtgat tttgaagctg tagaagcact tatgtcaatg agctgcagtt 241 ggaagtctga ttttaagaaa tacgttgaaa acagacctgt tacaccagta tctgatttgt 301 cagaggaaga gaatctgctt ccgggaacac ctgattttca tacaatccca gcattttgtt 361 tgactccacc ttacagtcct tctgactttg aaccctctca agtgtcaaat ctgatggcac 421 cagcgccatc tactgtacac ttcaagtcac tctcagatac tgccaaacct cacattgccg 481 cacctttcaa agaggaagaa aagagcccag tatctgcccc caaactcccc aaagctcagg 541 caacaagtgt gattcgtcat acagctgatg cccagctatg taaccaccag acctgcccaa 601 tgaaagcagc cagcatcctc aactatcaga acaattcttt tagaagaaga acccacctaa 661 atgttgaggc tgcaagaaag aacataccat gtgccgctgt gtcaccaaac agatccaaat 721 gtgagagaaa cacagtggca gatgttgatg agaaagcaag tgctgcactt tatgactttt 781 ctgtgccttc ctcagagacg gtcatctgca ggtctcagcc agcccctgtg tccccacaac 841 agaagtcagt gttggtctct ccacctgcag tatctgcagg gggagtgcca cctatgccgg 901 tcatctgcca gatggttccc cttcctgcca acaaccctgt tgtgacaaca gtcgttccca 961 gcactcctcc cagccagcca ccagccgttt gcccccctgt tgtgttcatg ggcacacaag 1021 tccccaaagg cgctgtcatg tttgtggtac cccagcccgt tgtgcagagt tcaaagcctc 1081 cggtggtgag cccgaatggc accagactct ctcccattgc ccctgctcct gggttttccc 1141 cttcagcagc aaaagtcact cctcagattg attcatcaag gataaggagt cacatctgta 1201 gccacccagg atgtggcaag acatacttta aaagttccca tctgaaggcc cacacgagga 1261 cgcacacagg agaaaagcct ttcagctgta gctggaaagg ttgtgaaagg aggtttgccc 1321 gttctgatga actgtccaga cacaggcgaa cccacacggg tgagaagaaa tttgcgtgcc 1381 ccatgtgtga ccggcggttc atgaggagtg accatttgac caagcatgcc cggcgccatc 1441 tatcagccaa gaagctacca aactggcaga tggaagtgag caagctaaat gacattgctc 1501 tacctccaac ccctgctccc acacagtgac agaccggaaa gtgaagagtc agaactaact 1561 ttggtctcag cgggagccag tggtgatgta aaaatgcttc cactgcaagt ctgtggcccc 1621 acaacgtggg cttaaagcag aagccccaca gcctggcacg aaggccccgc ctgggttagg 1681 tgactaaaag ggcttcggcc acaggcaggt cacagaaagg caggtttcat ttcttatcac 1741 ataagagaga tgagaaagct tttattcctt tgaatatttt ttgaaggttt cagatgaggt 1801 caacacaggt agcacagatt ttgaatctgt gtgcatattt gttactttac ttttgctgtt 1861 tatacttgag accaactttt caatgtgatt cttctaaagc actggtttca agaatatgga 1921 agctggaagg aaataaacat tacggtacag acatggagat gtaaaatgag tttgtattat 1981 tacaaatatt gtcatctttt tctagagtta tcttctttat tattcctagt ctttccagtc 2041 aacatcgtgg atgtagtgat taaatatatc tagaactatc atttttacac tattgtgaat 2101 atttggaatt gaacgactgt atattgctaa gagggcccaa agaattggaa tcctccttaa 2161 tttaattgct ttgaagcata gctacaattt gtttttgcat ttttgttttg aaagtttaac 2221 aaatgactgt atctaggcat ttcattatgc tttgaacttt agtttgcctg cagtttcttg 2281 tgtagatttg aaaattgtat accaatgtgt tttctgtaga ctctaagata cactgcactt 2341 tgtttagaaa aaaaactgaa gatgaaatat atattgtaaa gaagggatat taagaatctt 2401 agataacttc ttgaaaaaga tggcttatgt catcagtaaa gtacctttat gttatgagga 2461 tataatgtgt gctttattga attagaaaat tagtgaccat tattcacagg tggacaaatg 2521 ttcgtcctgt taatttatag gagttttttg gggatgtgga ggtagttggg tagaaaaatt 2581 attagaacat tcacttttgt taacagtatt tctcttttat tctgttatat agtggatgat 2641 atacacagtg gcaaaacaaa agtacattgc ttaaaatata tagtgaaaaa tgtcactata 2701 tcttcccatt taacattgtt tttgtatatt gggtgtagat ttctgacatc aaaacttgga 2761 cccttggaaa acaaaagttt taattaaaaa aaatccttgt gacttacaat ttgcacaata 2821 tttcttttgt tgtactttat atcttgttta caataaagaa ttccctttgg ca // LOCUS HSU21858 1143 bp mRNA PRI 29-JUN-1995 DEFINITION Human transcriptional activation factor TAFII32 mRNA, complete cds. ACCESSION U21858 NID g841307 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1143) AUTHORS Klemm,R.D., Goodrich,J.A., Zhou,S. and Tjian,R. TITLE Molecular cloning and expression of the 32-kDa subunit of human TFIID reveals interactions with VP16 and TFIIB that mediate transcriptional activation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (13), 5788-5792 (1995) MEDLINE 95320160 REFERENCE 2 (bases 1 to 1143) AUTHORS Tjian,R. TITLE Direct Submission JOURNAL Submitted (27-FEB-1995) Robert Tjian, Department of Molecular and Cell Biology, Howard Hughes Medical Institute, 401 Barker Hall, University of California, Berkeley, Berkeley CA 94720, USA FEATURES Location/Qualifiers source 1..1143 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 148..942 /codon_start=1 /product="TAFII32 precursor" /db_xref="PID:g841308" /translation="MESGKTASPKSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAF RYVTTILDDAKIYSSHAKKATVDADDVRLAIQCRADQSFTSPPPRDFLLDIARQRNQT PLPLIKPYSGPRLPPDRYCLTAPNYRLKSLQKKASTSAGRITVPRLSVGSVTSRPSTP TLGTPTPQTMSVSTKVGTPMSLTGQRFTVQMPTSQSPAVKASIPATSAVQNVLINPSL IGSKNILITTNMMSSQNTANESSNALKRKREDDDDDDDDDDDYDNL" BASE COUNT 367 a 238 c 225 g 313 t ORIGIN 1 ccggggacca tgttgcttcc gaacatcctg ctcaccggta caccaggggt tggaaaaacc 61 acactaggca aagaacttgc gtcaaaatca ggactgaaat acattaatgt gggtgattta 121 gctcgagaag tctgatcatc ggatatcatg gagtctggca agacggcttc tcccaagagc 181 atgccgaaag atgcacagat gatggcacaa atcctgaagg atatggggat tacagaatat 241 gagccaagag ttataaatca gatgttggag tttgccttcc gatatgtgac cacaattcta 301 gatgatgcaa aaatttattc aagccatgct aagaaagcta ctgttgatgc agatgatgtg 361 cgattggcaa tccagtgccg cgctgatcag tcttttacct ctcctccccc aagagatttt 421 ttattagata ttgcaaggca aagaaatcaa acccctttgc cattgatcaa gccatattca 481 ggtcctaggt tgccacctga tagatactgc ttaacagctc caaactatag gctgaaatct 541 ttacagaaaa aggcatcaac ttctgcggga agaataacag tcccgcggtt aagtgttggt 601 tcagttacta gcagaccaag tactcccaca ctaggcacac caaccccaca gaccatgtct 661 gtttcaacta aagtagggac tcccatgtcc ctcacaggtc aaaggtttac agtacagatg 721 cctacttctc agtctccagc tgtaaaagct tcaattcctg caacctcagc agttcagaat 781 gttctgatta atccatcatt aatcgggtcc aaaaacattc ttattaccac taatatgatg 841 tcatcacaaa atactgccaa tgaatcatca aatgcattga aaagaaaacg tgaagatgat 901 gatgatgacg atgatgatga tgatgactat gataatctgt aatctagcct tgctgaatgt 961 aacatgtata cttggtcttg aattcattgt actgatatta aacatgcatg ctggatgttt 1021 tcaagttgtg ttttagaaaa ctaataataa tgagtaaaca cagttaccat acttttcaat 1081 tgaaatgaag gtttttcatc agccttaaaa gtgtaagaaa aataaagttg tcattcattc 1141 gat // LOCUS HSU21936 3105 bp mRNA PRI 19-MAR-1995 DEFINITION Human peptide transporter (HPEPT1) mRNA, complete cds. ACCESSION U21936 NID g717118 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3105) AUTHORS Leibach,F.H. JOURNAL J. Biol. Chem. (1995) In press REFERENCE 2 (bases 1 to 3105) AUTHORS Leibach,F.H. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) Frederick H. Leibach, Biochemistry & Molecular Biology, Medical College of Georgia, 1120 Fifteenth Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..3105 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HPEPT1" /cell_type="epithelial cell" /tissue_type="intestine" /dev_stage="adult" /chromosome="13" gene 57..2183 /gene="HPEPT1" CDS 57..2183 /gene="HPEPT1" /codon_start=1 /function="oligopeptide and beta-lactam antibiotic transport" /evidence=experimental /product="peptide transporter" /db_xref="PID:g717119" /translation="MGMSKSHSFFGYPLSIFFIVVNEFCERFSYYGMRAILILYFTNF ISWDDNLSTAIYHTFVALCYLTPILGALIADSWLGKFKTIVSLSIVYTIGQAVTSVSS INDLTDHNHDGTPDSLPVHVVLSLIGLALIALGTGGIKPCVSAFGGDQFEEGQEKQRN RFFSIFYLAINAGSLLSTIITPMLRVQQCGIHSKQACYPLAFGVPAALMAVALIVFVL GSGMYKKFKPQGNIMGKVAKCIGFAIKNRFRHRSKAFPKREHWLDWAKEKYDERLISQ IKMVTRVMFLYIPLPMFWALFDQQGSRWTLQATTMSGKIGALEIQPDQMQTVNAILIV IMVPIFDAVLYPLIAKCGFNFTSLKKMAVGMVLASMAFVVAAIVQVEIDKTLPVFPKG NEVQIKVLNIGNNTMNISLPGEMVTLGPMSQTNAFMTFDVNKLTRINISSPGSPVTAV TDDFKQGQRHTLLVWAPNHYQVVKDGLNQKPEKGENGIRFVNTFNELITITMSGKVYA NISSYNASTYQFFPSGIKGFTISSTEIPPQCQPNFNTFYLEFGSAYTYIVQRKNDSCP EVKVFEDISANTVNMALQIPQYFLLTCGEVVFSVTGLEFSYSQAPSNMKSVLQAGWLL TVAVGNIIVLIVAGAGQFSKQWAEYILFAALLLVVCVIFAIMARFYTYINPAEIEAQF DEDEKKNRLEKSNPYFMSGANSQKQM" polyA_site 3105 /note="29 A nucleotides" BASE COUNT 796 a 735 c 709 g 865 t ORIGIN 1 ccacctgcca ggagcacgtc ccgccggcag tcgcaggagc cctgggagcc gccgccatgg 61 gaatgtccaa atcacacagt ttctttggtt atcccctgag catcttcttc atcgtggtca 121 atgagttttg cgaaagattt tcctactatg gaatgcgagc aatcctgatt ctgtacttca 181 caaatttcat cagctgggat gataacctgt ccaccgccat ctaccatacg tttgtggctc 241 tgtgctacct gacgccaatt ctcggagctc ttatcgccga ctcgtggctg ggaaagttca 301 agaccattgt gtcgctctcc attgtctaca caattggaca agcagtcacc tcagtaagct 361 ccattaatga cctcacagac cacaaccatg atggcacccc cgacagcctt cctgtgcacg 421 tggtgctgtc cttgatcggc ctggccctga tagctctcgg gactggagga atcaaaccct 481 gtgtgtctgc gtttggtgga gatcagtttg aagagggcca ggagaaacaa agaaacagat 541 ttttttccat cttttacttg gctattaatg ctggaagttt gctttccaca atcatcacac 601 ccatgctcag agttcaacaa tgtggaattc acagtaaaca agcttgttac ccactggcct 661 ttggggttcc tgctgctctc atggctgtag ccctgattgt gtttgtcctt ggcagtggga 721 tgtacaagaa gttcaagcca cagggcaaca tcatgggtaa agtggccaag tgcatcggtt 781 ttgccatcaa aaatagattt aggcatcgga gtaaggcatt tcccaagagg gagcactggc 841 tggactgggc taaagagaaa tacgatgagc ggctcatctc ccaaattaag atggttacga 901 gggtgatgtt cctgtatatt ccactcccaa tgttctgggc cttgtttgac cagcagggct 961 ccaggtggac actgcaggca acaactatgt ccgggaaaat cggagctctt gaaattcagc 1021 ccgatcagat gcagaccgtg aacgccatcc tgatcgtgat catggtcccg atcttcgatg 1081 ctgtgctgta ccctctcatt gcaaaatgtg gcttcaattt cacctccttg aagaagatgg 1141 cagttggcat ggtcctggcc tccatggcct ttgtggtggc tgccatcgtg caggtggaaa 1201 tcgataaaac tcttccagtc ttccccaaag gaaacgaagt ccaaattaaa gttttgaata 1261 taggaaacaa taccatgaat atatctcttc ctggagagat ggtgacactt ggcccaatgt 1321 ctcaaacaaa tgcatttatg acttttgatg taaacaaact gacaaggata aacatttctt 1381 ctcctggatc accagtcact gctgtaactg acgacttcaa gcagggccaa cgccacacgc 1441 ttctagtgtg ggcccccaat cactaccagg tggtaaagga tggtcttaac cagaagccag 1501 aaaaagggga aaatggaatc agatttgtaa atacttttaa cgagctcatc accatcacaa 1561 tgagtgggaa agtttatgca aacatcagca gctacaatgc cagcacatac cagttttttc 1621 cttctggcat aaaaggcttc acaataagct caacagagat tccgccacaa tgtcaaccta 1681 atttcaatac tttctacctt gaatttggta gtgcttatac ctatatagtc caaaggaaga 1741 atgacagctg ccctgaagtg aaggtgtttg aagatatttc agccaacaca gttaacatgg 1801 ctctgcaaat cccgcagtat tttcttctca cctgtggcga agtggtcttc tctgtcacgg 1861 gattggaatt ctcatattct caggctcctt ccaacatgaa gtcggtgctt caggcaggat 1921 ggctgctgac cgtggctgtt ggcaacatca ttgtgctcat cgtggcaggg gcaggccagt 1981 tcagcaaaca gtgggccgag tacattctat ttgccgcgtt gcttctggtc gtctgtgtaa 2041 tttttgccat catggctcgg ttctatactt acatcaaccc agcggagatc gaagctcaat 2101 ttgatgagga tgaaaagaaa aacagactgg aaaagagtaa cccatatttc atgtcagggg 2161 ccaattcaca gaaacagatg tgaaggtcag gaggcaagtg gaggatggac tgggcccgca 2221 gatgccctga cctctgcccc caggtagcag gacactccat tggatggccc ctgatgagga 2281 agacttcaga attgggaact aaaccatgaa tgctattttc ttttttcttt ttcttttctt 2341 tttttttttt ttttttttga gacagagttt tgctcttgtt gtccaggctg gagtgcaatg 2401 gcacgatctc agctcactgc aacctccgcc tcccaggttc aagtaattct cctgcctcag 2461 cctcccgagt ggctgggatt agcggcatgc accaccacgc ccagctattt ttgtattttt 2521 agtagagatg gggtttcacc atgttaacca ggatggtctc gatctcttga cctggtgatc 2581 tgcccacctc ggcctgccaa agtgctggga ttacaggctt gagctaccgc gcccggccgt 2641 gaacgctatt ttctaagcag ccagcagtga atctaaaact ctggaagaag tcttctgttt 2701 gaaaggctta tttaagccac acgtacacac actgtcttag agtactgtga gcccacccca 2761 cattggtcat cttccctatc acacaaatga tgttattttg gactagctta attttgaaat 2821 ggtaacaaag tttcctattc catactgttc atttctaata ctcttacgaa aactattcta 2881 aaggaggcag gagccaaggc caaaagtgaa cgtacaggtt taaaatggct gtgataaggg 2941 ccagctggta ttaactgata actttacctt tgggtttttg ttattttgtt tttctagtcc 3001 ctacctgtgt ttaaattatg gataactcga aagacaggct caggtgaagg ccgagtaatg 3061 attttttttg aagtttcaat ggtgtgaaat aaatttctgt tctta // LOCUS HSU21938 915 bp mRNA PRI 27-MAR-1995 DEFINITION Human alpha-tocopherol transfer protein mRNA, complete cds. ACCESSION U21938 NID g726181 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 915) AUTHORS Deng,H.-X., Hentati,A. and Siddique,T. JOURNAL Unpublished REFERENCE 2 (bases 1 to 915) AUTHORS Deng,H. TITLE Direct Submission JOURNAL Submitted (28-FEB-1995) Neurology, Northwestern University Medical School, 303 E. Chicago Ave./Tarry 13-715, Chicago, IL 60611-2712, USA FEATURES Location/Qualifiers source 1..915 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q13" CDS 8..844 /codon_start=1 /product="alpha-tocopherol transfer protein" /db_xref="PID:g726182" /translation="MAEARSQPSAGPQLNALPDHSPLLQPGLAALRRRAREAGVPLAP LPLTDSFLLRFLRARDFDLDLAWRLLKNYYKWRAECPEISADLHPRSIIGLLKAGYHG VLRSRDPTGSKVLIYRIAHWDPKVFTAYDVFRVSLITSELIVQEVETQRNGIKAIFDL EGWQFSHAFQITPSVAKKIAAVLTDSFPLKVRGIHLINEPVIFHAVFSMIKPFLTEKI KERIHMHGNNYKQSLLQHFPDILPLEYGGEEFSMEDICQEWTNFIMKSEDYLRSISES IQ" BASE COUNT 238 a 220 c 221 g 236 t ORIGIN 1 ggcgggcatg gcagaggcgc gatcccagcc ctcggcgggg ccgcagctca acgcgctacc 61 ggaccactct ccgttgctgc agccgggcct ggcggcgctg cggcgccggg cccgggaagc 121 tggcgtcccg ctcgcgccgc tgccgctcac cgactccttc ctgctgcggt tcctgcgcgc 181 ccgggatttc gatctggacc tggcctggcg gttactaaaa aactattata agtggagagc 241 agaatgtcca gaaataagtg cagatctaca ccctagaagt attattggcc tcctaaaggc 301 tggctaccat ggagtcctga gatccaggga tcccactggc agcaaagttc ttatttacag 361 aatcgcacac tgggacccca aagtttttac agcttatgac gtatttcgag taagtctaat 421 cacatccgag cttattgtac aggaggtaga aactcagcgg aatggaatca aggctatctt 481 tgatctggaa ggttggcagt tttctcatgc ttttcaaatc actccatccg tagccaagaa 541 gattgctgct gtacttacgg attcatttcc attgaaagtt cgtggcatcc atttgataaa 601 tgaaccagta attttccatg ctgtcttttc catgatcaaa ccattcctga ctgaaaaaat 661 taaggaacgg attcacatgc atgggaacaa ctacaaacaa agcttgcttc agcatttccc 721 agacattctt cctctggaat atggtggtga agaattctcc atggaggaca tttgtcagga 781 atggacaaat tttataatga agtctgaaga ttatctcagg agcatttctg agagcattca 841 atgagaagtt atgtcatgtg aatgcgttcc taactaaaat catgagtgat atccaactgg 901 ttaattgatt gaaga // LOCUS HSU21943 2721 bp mRNA PRI 16-FEB-1996 DEFINITION Human organic anion transporting polypeptide (OATP) mRNA, complete cds. ACCESSION U21943 NID g885977 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2721) AUTHORS Kullak-Ublick,G.A., Hagenbuch,B., Stieger,B., Schteingart,C.D., Hofmann,A.F., Wolkoff,A.W. and Meier,P.J. TITLE Molecular and functional characterization of an organic anion transporting polypeptide cloned from human liver JOURNAL Gastroenterology 109 (4), 1274-1282 (1995) MEDLINE 96029330 REFERENCE 2 (bases 1 to 2721) AUTHORS Hagenbuch,B. TITLE Direct Submission JOURNAL Submitted (01-MAR-1995) Bruno Hagenbuch, Division of Clinical Pharmacology and Toxicology, Internal Medicine, University Hospital, Zurich Ch-8091, Switzerland FEATURES Location/Qualifiers source 1..2721 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /tissue_type="liver" /dev_stage="adult" misc_signal 45..57 /note="Kozak sequence" gene 54..2066 /gene="OATP" CDS 54..2066 /gene="OATP" /codon_start=1 /function="mediates transport of organic anions" /evidence=experimental /product="organic anion transporting polypeptide" /db_xref="PID:g885978" /translation="MGETEKRIETHRIRCLSKLKMFLLAITCAFVSKTLSGSYMNSML TQIERQFNIPTSLVGFINGSFEIGNLLLIIFVSYFGTKLHRPIMIGIGCVVMGLGCFL KSLPHFLMNQYEYESTVSVSGNLSSNSFLCMENGTQILRPTQDPSECTKEVKSLMWVY VLVGNIVRGMGETPILPLGISYIEDFAKFENSPLYIGLVETGAIIGPLIGLLLASFCA NVYVDTGFVNTDDLIITPTDTRWVGAWWFGFLICAGVNVLTAIPFFFLPNTLPKEGLE TNADIIKNENEDKQKEEVKKEKYGITKDFLPFMKSLSCNPIYMLFILVSVIQFNAFVN MISFMPKYLEQQYGISSSDAIFLMGIYNLPPICIGYIIGGLIMKKFKITVKQAAHIGC WLSLLEYLLYFLSFLMTCENSSVVGINTSYEGIPQDLYVENDIFADCNVDCNCPSKIW DPVCGNNGLSYLSACLAGCETSIGTGINMVFQNCSCIQTSGNSSAVLGLCDKGPDCSL MLQYFLILSAMSSFIYSLAAIPGYMVLLRCMKSEEKSLGVGLHTFCTRVFAGIPAPIY FGALMDSTCLHWGTLKCGESGACRIYDSTTFRYIYLGLPAALRGSSFVPALIILILLR KCHLPGENASSGTELIETKVKGKENECKDIYQKSTVLKDDELKTKL" polyA_site 2721 BASE COUNT 831 a 479 c 492 g 919 t ORIGIN 1 aagacatatc aagatttaaa acataaatag cttggaacac cctgaagagc aacatgggag 61 aaactgagaa aagaattgaa acccatagaa taagatgtct ttccaagttg aagatgtttc 121 tgttggcaat aacatgtgca tttgtatcca aaacactgtc tggatcttat atgaattcca 181 tgctcacaca aatagagaga caattcaaca tcccaacatc tctagttgga ttcattaatg 241 gaagctttga gattggaaat cttttgttga ttatatttgt gagttatttt ggaaccaaac 301 tgcatagacc tataatgatt ggcattggat gtgtggttat gggcttaggc tgtttcttaa 361 aatcactacc tcatttcctc atgaaccaat atgaatatga atctacagtt tcagtttcag 421 gcaacttgtc ctcaaacagt ttcttgtgta tggaaaatgg aacccagatt ttaagaccaa 481 cgcaggatcc atcagagtgt acaaaggaag ttaaatcatt aatgtgggtg tacgtcctag 541 taggcaatat tgtacgtgga atgggtgaaa ctcccatcct gcctttgggt atttcctata 601 tagaagattt tgccaaattt gaaaattctc ctttatatat tgggcttgta gaaacaggag 661 ctattattgg tcctttgatt ggacttttgt tggcatcatt ctgtgcaaat gtttatgttg 721 acactggatt tgtgaacaca gatgatctga tcataactcc cactgacact cgttgggtcg 781 gtgcatggtg gtttggcttt ctgatttgtg caggagttaa cgtgctcact gccattcctt 841 ttttcttttt gcccaacaca cttccaaagg aaggactaga gactaatgct gacatcatta 901 aaaatgaaaa tgaagacaaa caaaaagaag aggtcaagaa ggaaaaatat ggaatcacta 961 aagattttct acctttcatg aaaagtcttt cctgcaatcc aatttatatg cttttcatac 1021 ttgtaagtgt gatacagttc aatgcattcg ttaacatgat ctccttcatg cctaaatacc 1081 tagaacagca atatggaata tcatcttcag atgcaatctt tctaatgggt atttataact 1141 tacctccaat atgtattgga tatataattg gtggtttaat tatgaagaag ttcaagatta 1201 ctgtcaaaca agctgcccac ataggatgtt ggttatcctt acttgagtat cttctctatt 1261 ttttatcttt tctcatgact tgtgaaaatt cttcagttgt tggaataaat acctcttatg 1321 aaggaattcc acaagattta tatgtggaaa atgacatctt tgctgattgc aatgtggatt 1381 gcaactgtcc atctaaaata tgggatcctg tgtgtggaaa caatggcttg tcatatctgt 1441 cagcttgtct tgctggttgt gagacatcca ttggaacggg aataaacatg gtgttccaaa 1501 attgcagctg tattcaaaca tcaggaaatt catctgcagt tcttgggctg tgcgacaaag 1561 gacctgactg ttccttgatg ctccagtact tcctaatctt gtcagcgatg agcagtttca 1621 tttattcttt ggctgccata cctggatata tggttctctt gaggtgtatg aaatctgaag 1681 agaagtccct tggtgtggga ttacatacat tttgcacaag agtatttgct ggcattcctg 1741 cacctatata ttttggcgct ttaatggatt ccacatgttt acactgggga actttgaaat 1801 gtggtgagtc aggggcatgc aggatatatg attccaccac cttcagatac atctacctcg 1861 gattgccggc agcactaaga ggatcaagct ttgttccagc cttaatcatc ttaattcttt 1921 tgaggaagtg tcatctacct ggtgaaaatg cctcttcagg aacagagctt atagagacaa 1981 aagtcaaagg gaaggaaaat gagtgcaaag atatatacca aaagtccacg gttttgaaag 2041 atgatgaatt gaaaactaaa ttgtaattgt cctattatat tacttttttc agaattagag 2101 aacatgctgt acaacttaat tgttttaaaa atcagtagag atataataga taactttttc 2161 ttgtctttaa gaacctaaaa aacctcttaa ctcaaaataa taaaatgttc actaatgata 2221 tttctaaggt atcagtgaca cttgagtttt cctaggaggg acatcaataa cagcccccta 2281 aagaagattc ttagagccag ctttattttt atgttgaaac agcaatttcc cttaattcat 2341 cgaagtaagg gtgtacttcc tacatctcct tctactaata cttctaaaaa ttttctgtta 2401 tgaaaaccta tttaattcca ctaaatttgt tctttgatat tggaattatt cagatgccta 2461 aattctcatt ctgttatgtg aagatttaaa tattttattc aagtttatcg cttccatgtg 2521 agagaagcct acatcttctt attctattta ggaatcgttc tttaactctt cttattcatt 2581 ctaggcatga ctcctatata atagattact cataaatata ccctcctact ttcaattttt 2641 tcttttcttt attactcata catttgctca atttgtacag aatactgaca aacttaagca 2701 ggttattaaa catcatgagg c // LOCUS HSU22055 3480 bp mRNA PRI 03-NOV-1995 DEFINITION Human 100 kDa coactivator mRNA, complete cds. ACCESSION U22055 NID g799176 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3480) AUTHORS Tong,X., Drapkin,R., Yalamanchili,R., Mosialos,G. and Kieff,E. TITLE The Epstein-Barr virus nuclear protein 2 acidic domain forms a complex with a novel cellular coactivator that can interact with TFIIE JOURNAL Mol. Cell. Biol. 15 (9), 4735-4744 (1995) MEDLINE 95379816 REFERENCE 2 (bases 1 to 3480) AUTHORS Tong,X. TITLE Direct Submission JOURNAL Submitted (02-MAR-1995) Xiao Tong, Dept. of Microbiology and Molecular Genetics, Harvard University, 75 Francis St., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3480 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV transformed B cells (IB4 cells)" CDS 268..2925 /codon_start=1 /function="associates with the EBV nuclear protein 2 acidic domain" /product="100 kDa coactivator" /db_xref="PID:g799177" /translation="MVLSGCAIIVRGQPRGGPPPERQINLSNIRAGNLARRAAATQPD AKDTPDEPWAFPAREFLRKKLIGKEVCFTIENKTPQGREYGMIYLGKDTNGENIAESL VAEGLATRREGMRANNPEQNRLSECEEQAKAAKKGMWSEGNGSHTIRDLKYTIENPRH FVDSHHQKPVNAIIEHVRDGSVVRALLLPDYYLVTVMLSGIKCPTFRREADGSETPEP FAAEAKFFTESRLLQRDVQIILESCHNQNIVGTILHPNGNITELLLKEGFARCVDWSI AVYTRGAEKLRAAERFAKERRLRIWRDYVAPTANLDQKDKQFVAKVMQVLNADAIVVK LNSGDYKTIHLSSIRPPRLEGENTQDKNKKLRPLYDIPYMFEAREFLRKKLIGKKVNV TVDYIRPASPATETVPAFSERTCATVTIGGINIAEALVSKGLATVIRYRQDDDQRSSH YDELLAAEARAIKNGKGLHSKKEVPIHRVADISGDTQKAKQFLPFLQRAGRSEAVVEY VFSGSRLKLYLPKETCLITFLLAGIECPRGARNLPGLVQEGEPFSEEATLFTKELVLQ REVEVEVESMDKAGNFIGWLHIDGANLSVLLVEHALSKVHFTAERSSYYKSLLSAEEA AKQKKEKVWAHYEEQPVEEVMPVLEEKERSASYKPVFVTEITDDLHFYVQDVETGTQF QKLMENMRNDIASHPPVEGSYAPRRGEFCIAKFVDGEWYRARVEKVESPAKIHVFYID YGNREVLPSTRLGTLSPAFSTRVLPAQATEYAFAFIQVPQDDDARTDAVDSVVRDIQN TQCLLNVEHLSAGCPHVTLQFADSKGDVGLGLVKEGLVMVEVRKEKQFQKVITEYLNA QESAKSARLNLWRYGDFRADDADEFGYSR" BASE COUNT 838 a 934 c 961 g 747 t ORIGIN 1 ggcggagatc gcgtctcttt cgctccgtgt ccgctgctgc tcctgtgagc gcccggcgag 61 tccgtcccgt ccaccgtccg cagctggtag ccagcctgcc cctcgcctcg actccctttc 121 accaacaccg acacccacat tgacacctcc agtccggcca gccgctccac tcgttgcctt 181 tgcatctcca cacatggcgt cctcgcgcag agcggcggct cctccggggg acccgcggtc 241 cccaccgtgc agcggggcat catcaagatg gtcctctcag ggtgcgccat cattgtccga 301 ggtcagcctc gtggtgggcc tcctcctgag cggcagatca acctcagcaa cattcgtgct 361 ggaaatcttg ctcgccgggc agccgccaca caacctgatg caaaggatac ccctgatgag 421 ccctgggcat ttccagctcg agagttcctt cgaaagaagc tgattgggaa ggaagtctgt 481 ttcacgatag aaaacaagac tccccagggg cgagagtatg gcatgatcta ccttggaaaa 541 gataccaatg gggaaaacat tgcagaatca ctggttgcag agggcttagc cacccggaga 601 gaaggcatga gagctaataa tcctgagcag aaccggcttt cagaatgtga agaacaagca 661 aaggcagcca agaaagggat gtggagtgag gggaacggtt cacatactat ccgggatctc 721 aagtatacca ttgaaaaccc aaggcacttt gtggactcac accaccagaa gcctgttaat 781 gctatcatcg agcatgtgcg ggacggcagt gtggtcaggg ccctgctcct cccagattac 841 tacctggtta cagtcatgct gtcaggcatc aagtgcccaa cttttcgacg ggaagcagat 901 ggcagtgaaa ctccagagcc ttttgctgca gaagccaaat ttttcactga gtcgcgactg 961 cttcagagag atgttcagat cattctggag agctgccaca accagaacat tgtgggtacc 1021 atccttcatc caaatggcaa catcacagag ctcctcctga aggaaggttt cgcacgctgt 1081 gtggactggt cgattgcagt ttacacccgg ggcgcagaaa agctgagggc ggcagagagg 1141 tttgccaaag agcgcaggct gagaatatgg agagactatg tggctcccac agctaatttg 1201 gaccaaaagg acaagcagtt tgttgccaag gtgatgcagg ttctgaatgc tgatgccatt 1261 gttgtgaagc tgaactcagg cgattacaag acgattcacc tgtccagcat ccgaccaccg 1321 aggctggagg gggagaacac ccaggataag aacaagaaac tgcgtcccct gtatgacatt 1381 ccttacatgt ttgaggcccg ggaatttctt cgaaaaaagc ttattgggaa gaaggtcaat 1441 gtgacggtgg actacattag accagccagc ccagccacag agacagtgcc tgccttttca 1501 gagcgtacct gtgccactgt caccattgga ggaataaaca ttgctgaggc tcttgtcagc 1561 aaaggtctag ccacagtgat cagataccgg caggatgatg accagagatc atcacactac 1621 gatgaactgc ttgctgcaga ggccagagct attaagaatg gcaaaggatt gcatagcaag 1681 aaggaagtgc ctatccaccg tgttgcagat atatctgggg atacccaaaa agcaaagcag 1741 ttcctgcctt ttcttcagcg ggcaggtcgt tctgaagctg tggtggaata cgtcttcagt 1801 ggttctcgtc tcaaactcta tttgccaaag gaaacttgcc ttatcacctt cttgcttgca 1861 ggcattgaat gccccagagg agcccgaaac ctcccaggct tggtgcagga aggagagccc 1921 ttcagcgagg aagctacact tttcaccaag gaactggtgc tgcagcgaga ggtggaggtg 1981 gaggtggaga gcatggacaa ggccggcaac tttatcggct ggctgcacat cgacggtgcc 2041 aacctgtccg tcctgctggt ggagcacgcg ctctccaagg tccacttcac cgccgaacgc 2101 agctcctact acaagtccct gctgtctgcc gaggaggccg caaagcagaa gaaagagaag 2161 gtctgggccc actatgagga gcagcccgtg gaggaggtga tgccagtgct ggaggagaag 2221 gagcgatctg ctagctacaa gcccgtgttt gtgaccgaga tcactgatga cctgcacttc 2281 tacgtgcagg atgtggagac cggcacccag ttccagaagc tgatggagaa catgcgcaat 2341 gacattgcca gtcacccccc tgtagagggc tcctatgccc cccgcagggg agagttctgc 2401 attgccaaat ttgtagatgg agaatggtac cgtgcccgag tagagaaagt cgagtctcct 2461 gccaaaatac atgtcttcta cattgactac ggcaacagag aggtcctgcc atccacccgc 2521 ctgggtaccc tatcacctgc cttcagcact cgggtgctgc cagctcaagc cacggagtat 2581 gccttcgcct tcatccaggt gccccaagat gatgatgccc gcacggacgc cgtggacagc 2641 gtagttcggg atatccagaa cactcagtgc ctgctcaacg tggaacacct gagtgccggc 2701 tgcccccatg tcaccctgca gtttgcagat tccaagggcg atgtggggct gggcttggtg 2761 aaggaagggc tggtcatggt ggaggtgcgc aaggagaaac agttccagaa agtgatcaca 2821 gaatacctga atgcccaaga gtcagccaag agcgccaggc tgaacctgtg gcgctatgga 2881 gactttcgag ctgatgatgc agacgaattt ggctacagcc gctaaggagg ggatcgggtt 2941 tggcccccag cccccgtcac gccagtccct cttcctctgc cgggagggtg ttttcaactc 3001 caaaccccag agaggggttg tacattgggt ccagctttgc ttcagtgtgt ggaaatgtct 3061 cgtggggtgg catcggggct gcggggtggg gaccccaagg ctttctgggg cagacccttg 3121 tcctctggga tgatgggcac tgctatccac agtctctgcc agttggtttt atttggaggt 3181 ttgtgggctt ttttaaaaaa aaaaaagtcc tcaaatcagg aagaaacatc aaagactatg 3241 tcctagtgga gggagtaatc ctaacaccca ggctggccgc cagctggcac ctgcctctat 3301 cccagactgc cctcgtccca gctctctgtc caactgttga ttatgtgatt tttctgatac 3361 gtccattctc aaatgccagt gtgttcacat cttcgctctg gccagcccat tctgtattta 3421 aagctttttg aggcccaata aaatagtacg tgctgctgca gcccttattg atcaaaaaaa // LOCUS HSU22178 360 bp mRNA PRI 19-DEC-1995 DEFINITION Human prostatic secretory protein 57 mRNA, complete cds. ACCESSION U22178 NID g885984 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 360) AUTHORS Xuan,J.W., Chin,J.L., Guo,Y., Chambers,A.F., Finkelman,M.A. and Clarke,M.W. TITLE Alternative splicing of PSP94 (prostatic secretory protein of 94 amino acids) mRNA in prostate tissue JOURNAL Oncogene 11 (6), 1041-1047 (1995) MEDLINE 96032566 REFERENCE 2 (bases 1 to 360) AUTHORS Xuan,J.W., Chin,J., Guo,Y., Chambers,A.F., Finkleman,M.A. and Clarke,M.W. TITLE Direct Submission JOURNAL Submitted (06-MAR-1995) Michael W. Clarke, Microbiology, University of Western Ontario, Health Sciences Centre, London, Ontario N0L 1R0, Canada FEATURES Location/Qualifiers source 1..360 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="prostate" CDS 12..245 /note="product of alternatively spliced gene encoding PSP94 (prostatic secretory protein 94), PIR Accession Number S16238" /codon_start=1 /product="PSP57" /db_xref="PID:g885985" /translation="MNVLLGSVVIFATFVTLCNASCYFIPNEGVPGDSTRMFLHLWVM TKTTAKESSRRRTASISWWRRRTQKRPVLSVNG" polyA_site 360 /note="16 A nucleotides" BASE COUNT 103 a 77 c 82 g 98 t ORIGIN 1 tgcttatcac aatgaatgtt ctcctgggca gcgttgtgat ctttgccacc ttcgtgactt 61 tatgcaatgc atcatgctat ttcataccta atgagggagt tccaggagat tcaaccagga 121 tgtttctaca cctgtgggtt atgacaaaga caactgccaa agaatcttca agaaggagga 181 ctgcaagtat atcgtggtgg agaagaagga cccaaaaaag acctgttctg tcagtgaatg 241 gataatctaa tgtgcttcta gtaggcacag ggctcccagg ccaggcctca ttctccctgg 301 cctctaatag tcaatgattg tgtagccatg cctatcagta aaaagatttt tgagcaaaca // LOCUS HSU22233 2269 bp mRNA PRI 25-NOV-1995 DEFINITION Human methylthioadenosine phosphorylase (MTAP) mRNA, complete cds. ACCESSION U22233 NID g847723 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2269) AUTHORS Olopade,O.I., Pomykala,H.M., Hagos,F., Sveen,L.W., Espinosa,R. III., Dreyling,M.H., Gursky,S., Stadler,W.M., Le Beau,M.M. and Bohlander,S.K. TITLE Construction of a 2.8-megabase yeast artificial chromosome contig and cloning of the human methylthioadenosine phosphorylase gene from the tumor suppressor region on 9p21 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (14), 6489-6493 (1995) MEDLINE 95327672 REFERENCE 2 (bases 1 to 2269) AUTHORS Olopade,O.I. TITLE Direct Submission JOURNAL Submitted (06-MAR-1995) Olufunmilayo I. Olopade, Medicine, University of Chicago Pritzker School of Medicine, 5841 S. Maryland Avenue, Chicago, IL 60637-1470, USA FEATURES Location/Qualifiers source 1..2269 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="18-11 cDNA" /map="9p21" /chromosome="9" /sex="male" /cell_line="primary culture" /cell_type="fibroblast" /tissue_type="epidermis" gene 122..973 /gene="MTAP" CDS 122..973 /gene="MTAP" /codon_start=1 /product="methylthioadenosine phosphorylase" /db_xref="PID:g847724" /translation="MASGTTTTAVKIGIIGGTGLDDPEILEGRTEKYVDTPFGKPSDA LILGKIKNVDCILLARHGRQHTIMPSKVNYQANIWALKEEGCTHVIVTTACGSLREEI QPGDIVIIDQFIDRTTMRPQSFYDGSHSCARGVCHIPMAEPFCPKTREVLIETAKKLG LRCHSKGTMVTIEGPRFSSRAESFMFRTWGADVINMTTVPEVVLAKEAGICYASIAMA TDYDCWKEHEEAVSVDRVLKTLKENANKAKSLLLTTIPQIGSTEWSETLHNLKNMAQF SVLLPRH" BASE COUNT 725 a 407 c 490 g 647 t ORIGIN 1 gaattccgct ccgcactgct cactcccgcg cagtgaggtt ggcacagcca ccgctctgtg 61 gctcgcttgg ttcccttagt cccgagcgct cgcccactgc agattccttt cccgtgcaga 121 catggcctct ggcaccacca ccaccgccgt gaagattgga ataattggtg gaacaggcct 181 ggatgatcca gaaattttag aaggaagaac tgaaaaatat gtggatactc catttggcaa 241 gccatctgat gccttaattt tggggaagat aaaaaatgtt gattgcatcc tccttgcaag 301 gcatggaagg cagcacacca tcatgccttc aaaggtcaac taccaggcga acatctgggc 361 tttgaaggaa gagggctgta cacatgtcat agtgaccaca gcttgtggct ccttgaggga 421 ggagattcag cccggcgata ttgtcattat tgatcagttc attgacagga ccactatgag 481 acctcagtcc ttctatgatg gaagtcattc ttgtgccaga ggagtgtgcc atattccaat 541 ggctgagccg ttttgcccca aaacgagaga ggttcttata gagactgcta agaagctagg 601 actccggtgc cactcaaagg ggacaatggt cacaatcgag ggacctcgtt ttagctcccg 661 ggcagaaagc ttcatgttcc gcacctgggg ggcggatgtt atcaacatga ccacagttcc 721 agaggtggtt cttgctaagg aggctggaat ttgttacgca agtatcgcca tggcgacaga 781 ttatgactgc tggaaggagc acgaggaagc agtttcggtg gaccgggtct taaagaccct 841 gaaagaaaac gctaataaag ccaaaagctt actgctcact accatacctc agatagggtc 901 cacagaatgg tcagaaaccc tccataacct gaagaatatg gcccagtttt ctgttttatt 961 accaagacat taaagtagca tggctgccca ggagaaaaga agacattcta attccagtca 1021 ttttgggaat tcctgcttaa cttgaaaaaa atatgggaaa gacatgcagc tttcatgccc 1081 ttgcctatca aagagtatgt tgtaagaaag acaagacatt gtgtgtatta gagactcctg 1141 aatgatttag acaacttcaa aatacagaag aaaagcaaat gactagtaaa catgtgggaa 1201 aaaatattac attttaaggg ggaaaaaaaa aaccccacca ttctcttctc cccctattaa 1261 atttgcaaca ataaagggtg gagggtaatc tctactttcc tatactgcca aagaatgtga 1321 ggaagaaatg ggactctttg gttatttatt gatgcgactg taaattggta cagtatttct 1381 ggagggcaat ttggtaaaat gcatcaaaag acttaaaaat acggacgtcc tttggtgctg 1441 ggaactctac atctagcaat ttctctttaa aaccatatca gagatgcata caaagaatta 1501 tatataaaga agggtgttta ataatgatag ttataataat aaataattga aacaatctga 1561 atcccttgca attggaggta aattatgtct tagttataat ctagattgtg aatcagccaa 1621 ctgaaaatcc tttttgcata tttcaatgtc ctaaaaagac acggttgctc tatatatgaa 1681 gtgaaaaaag gatatggtag cattttatag tactagtttt gctttaaaat gctatgtaaa 1741 tatacaaaaa aactagaaag aaatatatat aaccttgtta ttgtatttgg gggagggata 1801 ctgggataat ttttattttc tttgaatctt tctgtgtctt cacatttttc tacagtgaat 1861 ataatcaaat agtaaagggc cgtaaaaata aaagtggatt tagaaagatc cagttcttga 1921 aaacactgtt tctggtaatg aagcagaatt taagttggta atattaaggt gaatgtcatt 1981 taagggagtt acatctttat tctgctaaag aagaggatca ttgatttctg tacagtcaga 2041 acagtacttg ggtgtgcaac agctttctga gaaaagctag gtgtataata gtttaactga 2101 aagtttaact atttaaaaga ctaaatgcac attttatggt atctgatatt ttaaaaagta 2161 atgtgagctt ctccttttta tgagttaaat tattttatac gagttggtaa tttgtgcctt 2221 ttaataaagt ggaagcttgc tttttaaaaa aaaaaaaaaa gcggaattc // LOCUS HSU22377 6229 bp mRNA PRI 09-MAR-1996 DEFINITION Human Zn-15 related zinc finger protein (rlf) mRNA, complete cds. ACCESSION U22377 NID g1218027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6228) AUTHORS Makela,T.P., Hellsten,E., Vesa,J., Hirvonen,H., Palotie,A., Peltonen,L. and Alitalo,K. TITLE The rearranged L-myc fusion gene (RLF) encodes a Zn-15 related zinc finger protein JOURNAL Oncogene 11 (12), 2699-2704 (1995) MEDLINE 96132723 REFERENCE 2 (bases 1 to 6229) AUTHORS Makela,T.P. TITLE Direct Submission JOURNAL Submitted (08-MAR-1995) Tomi P. Makela, Whitehead Institute for Biomedical Research, 9, Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..6229 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562 leukemia cell line; HEL leukemia cell line" gene 13..5757 /gene="rlf" CDS 13..5757 /gene="rlf" /note="Zn-15 related zinc finger protein" /codon_start=1 /product="RLF" /db_xref="PID:g1218028" /translation="MADGKGDAAAVAGAGAEAPAVAGAGDGVETESMVRGHRPVSPAP GASGLRPCLWQLETELREQEVSEVSSLNYCRSFCQTLLQYASNKNASEHIVYLLEVYR LAIQSFASARPYLTTECEDVLLVLGRLVLSCFELLLSVSESELPCEVWLPFLQSLQES HDALLEFGNNNLQILVHVTKEGVWKNPVLLKILSQQPVETEEVNKLIAQEGPSFLQMR IKHLLKSNCIPQATALSKLCAESKEISNVSSFQQAYITCLCSMLPNEDAIKEIAKVDC KEVLDIICNLESEGQDNTAFVLCTTYLTQQLQTASVYCSWELTLFWSKLQRRIDPSLD TFLERCRQFGVIAKTQQHLFCLIRVIQTEAQDAGLGVSILLCVRALQLRSSEDEEMKA SVCKTIACLLPEDLEVRRACQLTEFLIEPSLDGFNMLEELYLQPDQKFDEENAPVPNS LRCELLLALKAHWPFDPEFWDWKTLKRHCHQLLGQEASDSDDDLSGYEMSINDTDVLE SFLSDYDEGKEDKQYRRRDLTDQHKEKRDKKPIGSSERYQRWLQYKFFCLLCKRECIE ARILHHSKMHMEDGIYTCPVCIKKFKRKEMFVPHVMEHVKMPPSRRDRSKKKLLLKGS QKGICPKSPSAIPEQNHSLNDQAKGESHEYVTFSKLEDCHLQDRDLYPCPGTDCSRVF KQFKYLSVHLKAEHQNNDENAKHYLDMKNRREKCTYCRRHFMSAFHLREHEQVHCGPQ PYMCVSIDCYARFGSVNELLNHKQKHDDLRYKCELNGCNIVFSDLGQLYHHEAQHFRD ASYTCNFLGCKKFYYSKIEYQNHLSMHNVENSNGDIKKSVKLEESATGEKQDCINQPH LLNQTDKSHLPEDLFCAESANSQIDTETAENLKENSDSNSSDQLSHSSSASMNEELID TLDHSETMQDVLLSNEKVFGPSSLKEKCSSMAVCFDGTKFTCGFDGCGSTYKNARGMQ KHLRKVHPYHFKPKKIKTKDLFPSLGNEHNQTTEKLDAEPKPCSDTNSDSPDEGLDHN IHIKCKREHQGYSSESSICASKRPCTEDTMLELLLRLKHLSLKNSITHGSFSGSLQGY PSSGAKSLQSVSSISDLNFQNQDENMPSQYLAQLAAKPFFCELQGCKYEFVTREALLM HYLKKHNYSKEKVLQLTMFQHRYSPFQCHICQRSFTRKTHLRIHYKNKHQIGSDRATH KLLDNEKCDHEGPCSVDRLKGDCSAELGGDPSSNSEKPHCHPKKDECSSETDLESSCE ETESKTSDISSPIGSHREEQEGREGRGSRRTVAKGNLCYILNKYHKPFHCIHKTCNSS FTNLKGLIRHYRTVHQYNKEQLCLEKDKARTKRELVKCKKIFACKYKECNKRFLCSKA LAKHCSDSHNLDHIEEPKVLSEAGSAARFSCNQPQCPAVFYTFNKLKHHLMEQHNIEG EIHSDYEIHCDLNGCGQIFTHRSNYSQHVYYRHKDYYDDLFRSQKVANERLLRSEKVC QTADTQGHEHQTTRRSFNAKSKKCGLIKEKKAPISFKTRAEALHMCVEHSELSLYPCM VQGCLSVVKLESSIVRHYKRTHQMSSAYLEQQMENLVVCVKYGTKIKEEPPSEADPCI KKEENRSCESERTEHSHSPGDSSAPIQNTDCCHSSERDGGQKGCIESSSVFDADTLLY RGTLKCNHSSKTTSLEQCNIVQPPPPCKIENSIPNPNGTESGTYFTSFQLPLPRIKES ETRQHSSGQENTVKNPTHVPKENFRKHSQPRSFDLKTYKPMGFESSFLKFIQESEEKE DDFDDWEPSEHLTLSNSSQSSNDLTGNVVANNMVNDSEPEVDIPHSSSDSTIHENLTA IPPLIVAETTTVPSLENLRVVLDKALTDCGELALKQLHYLRPVVVLERSKFSTPILDL FPTKKTDELCVGSS" misc_feature 249 /gene="rlf" /note="end of region involved in rlf-L-myc rearrangements" exon 961..1101 /gene="rlf" /note="alternative exon" polyA_signal 6218..6223 polyA_site 6229 BASE COUNT 2022 a 1173 c 1304 g 1730 t ORIGIN 1 cgccgtggga agatggcgga cggaaaggga gacgccgccg ctgtcgccgg ggctggggct 61 gaggctccgg cggtagcggg agccggagat ggagtcgaga ctgagtccat ggttcggggt 121 catcgccccg tatctccagc gccgggagcc tcgggactgc ggccgtgtct gtggcagctg 181 gagacagagc tgagggagca agaggtgtcg gaggtctcat ctttgaacta ctgccggagc 241 ttctgccaga ctttattgca atatgcaagc aacaagaatg catcagaaca tattgtgtat 301 cttctggagg tatatcgact tgccatccaa agctttgcca gtgcacgtcc atacttaact 361 actgaatgtg aagatgtcct cttagtgctt ggcagattag tactgagttg tttcgaatta 421 ctgctttcag tgtctgaaag tgaactgcca tgtgaagtct ggctaccatt ccttcagtct 481 ctacaggagt cacatgatgc attattggaa tttgggaata ataacctaca aatattggtt 541 catgttacca aggaaggggt gtggaaaaac ccagttcttc ttaaaattct gtctcaacag 601 ccagtagaaa cggaggaagt caataaattg attgcacaag aaggaccttc ctttctgcaa 661 atgcgaataa aacatttgtt gaaatctaac tgcatccccc aggctactgc tttatcaaaa 721 ctatgtgcag aatctaaaga aatttcaaat gtgtcatctt ttcagcaagc ctatatcaca 781 tgtttatgtt ctatgctccc taatgaagat gctattaagg agattgcaaa ggtcgactgc 841 aaggaagtac tagacatcat ttgtaatctg gaatctgagg ggcaggataa cacagcattt 901 gttctttgta cgacttacct tacccagcag ctccaaactg caagtgtata ttgttcttgg 961 gaactgactc ttttttggag taaactgcaa agaagaattg acccttcttt agatactttt 1021 ttggagcgct gtcgtcagtt tggtgtcata gctaaaacgc agcagcattt attttgcctc 1081 attagagtta tacaaactga agcacaagat gctggtcttg gggtgtcaat tttactgtgt 1141 gtcagagctc ttcaactcag atcaagtgaa gatgaggaaa tgaaggcatc agtttgtaaa 1201 acaattgcct gtcttttacc agaagattta gaagttagac gagcctgtca gcttacagaa 1261 ttcttaattg aacccagttt ggatggattt aatatgttag aagaactata tttgcaacca 1321 gatcaaaaat ttgatgaaga aaatgcaccg gttccaaatt ctcttcgatg tgagctctta 1381 ctagctttaa aagcccactg gccttttgat cctgagtttt gggactggaa aactttaaaa 1441 cgacactgcc accaactttt aggacaagaa gcctcagatt ctgatgatga tttaagtggc 1501 tatgaaatgt ccattaatga cacagatgtt ttagagtcat ttctcagtga ctatgatgag 1561 ggtaaagaag ataaacaata tagaagaaga gatttgacag atcagcataa ggagaaaaga 1621 gacaaaaaac ctattggctc ttctgaaaga tatcagaggt ggcttcagta caagtttttc 1681 tgtttgttat gtaagcggga atgtatagag gctagaattc ttcatcattc taagatgcat 1741 atggaagatg gaatttacac ctgtccagtt tgtattaaaa aatttaagag aaaagaaatg 1801 tttgttcctc atgtgatgga gcatgttaaa atgccaccaa gcagaaggga ccgctctaaa 1861 aagaaattac tgttaaaagg ctctcaaaag ggtatttgtc ctaagagccc ctctgcaatc 1921 ccagagcaaa accattcatt gaatgaccaa gccaaaggag agtctcatga atacgtcaca 1981 ttcagcaaat tagaagattg ccacctgcaa gacagagatt tgtatccatg tcccggtaca 2041 gactgttccc gtgtgtttaa gcaatttaaa tacttaagtg tgcatcttaa agctgaacac 2101 caaaataatg atgaaaatgc caagcactac ttggatatga aaaatagaag agagaagtgt 2161 acttactgtc gacgacattt tatgtctgct tttcaccttc gagagcacga acaagtgcat 2221 tgtgggcctc agccttatat gtgtgtatct atagattgct atgctaggtt tggatcagta 2281 aatgaactac ttaaccataa acaaaagcat gacgatctgc gttacaaatg tgaattaaat 2341 ggctgtaata ttgttttcag tgacttggga cagctttacc accatgaagc acaacacttt 2401 agggatgcat cttacacatg caacttcctt ggctgtaaaa agttctatta ctccaaaatt 2461 gaataccaga atcacctctc aatgcataat gttgaaaatt caaatggaga cataaagaaa 2521 tcagtgaaac ttgaggagtc tgcaacaggt gaaaagcaag attgtattaa tcagccccat 2581 ctacttaacc aaactgataa atcacattta cctgaagatc ttttctgtgc agaatcagct 2641 aattctcaaa tagatacaga aactgcagaa aacctgaaag aaaacagtga cagtaattct 2701 agtgatcagt taagtcatag ctcttcagct tcaatgaatg aagagctaat tgacacacta 2761 gatcactctg aaactatgca ggatgtattg ttatctaatg agaaagtctt tgggccctcc 2821 agtttaaaag aaaaatgttc cagtatggca gtttgttttg acgggactaa gtttacctgt 2881 ggttttgatg gctgtggttc cacatacaaa aatgcaagag gaatgcagaa acatttacgg 2941 aaggttcatc cataccattt caagcccaaa aagataaaga cgaaagatct gtttccctct 3001 ttgggtaatg aacataatca gacaactgaa aagttggatg cagaacctaa accctgctca 3061 gatacaaaca gtgactcccc agatgaaggt ctagatcaca atattcacat taaatgtaaa 3121 cgagaacatc aaggttattc ctcagaatcc tccatttgtg cttctaaaag gccctgtaca 3181 gaggatacca tgttggaact tctgttacgc ttgaaacatt taagcttgaa aaactcaata 3241 acacatggat ctttctcagg gtcattgcag gggtacccat ccagtggtgc taagtctctt 3301 cagtcagttt catctatctc agaccttaat tttcagaatc aagatgaaaa catgccaagt 3361 cagtaccttg cacagttggc ggctaagccg tttttctgtg agcttcaagg atgcaaatat 3421 gaatttgtga ccagagaggc tctgttaatg cattatctta aaaagcataa ttattcaaaa 3481 gaaaaagtcc ttcagttaac catgttccaa catcggtatt ccccatttca gtgtcatatt 3541 tgccaaaggt catttacaag aaaaacacac cttaggattc attataaaaa taaacatcaa 3601 attggcagtg acagagcaac tcacaaacta ttagataatg aaaagtgtga tcatgaaggc 3661 ccatgttcag tagataggtt gaaaggtgat tgttctgcag aacttggagg tgatcccagt 3721 agtaactctg agaaaccaca ctgtcatcct aaaaaggatg aatgtagttc tgaaacagat 3781 ttggaatcat cttgtgaaga aacagaaagt aaaacatctg acatttcatc accaataggc 3841 agccatagag aagaacaaga aggaagagag ggcagaggta gcaggcgaac tgttgctaaa 3901 ggaaatctgt gttatatttt gaataaatac cacaaaccat tccattgtat tcataaaact 3961 tgcaactcct cattcaccaa tctaaaaggc ttaattcgcc attacagaac tgtacatcag 4021 tacaacaaag aacagttatg tttggagaaa gacaaagcaa gaaccaaaag ggaacttgtc 4081 aaatgtaaaa agatatttgc ttgcaaatat aaggaatgta ataaacgctt cctgtgttcc 4141 aaagctcttg ctaagcactg tagtgattct cataacctag accatattga agagcctaaa 4201 gtactttccg aagctggatc tgcagcaagg ttttcttgta accagcctca gtgccctgct 4261 gttttttata cattcaacaa gttgaagcac cacttgatgg aacagcataa tattgaaggg 4321 gaaatacatt cagattatga aattcattgt gatcttaatg gctgtggcca gattttcacc 4381 catcgcagta attactcaca acatgtatat taccgacata aagactatta tgatgatttg 4441 tttagaagcc agaaagtagc aaatgagaga ctactaagga gtgaaaaggt atgtcaaaca 4501 gctgatactc aggggcatga acatcagacc accaggagat catttaatgc taagtctaaa 4561 aaatgtggct taatcaaaga aaagaaagcc ccaataagtt ttaaaaccag agctgaggcc 4621 ctccatatgt gtgtggagca ctctgagctc tctctgtacc cctgcatggt tcaaggatgc 4681 ttatctgtgg tgaagttgga gagcagcatt gtgaggcatt acaaacgcac tcatcagatg 4741 agtagtgcct atttagagca acagatggag aatcttgttg tttgcgttaa gtacggtacc 4801 aaaattaagg aggaaccccc ttctgaagca gatccctgta taaagaaaga agaaaataga 4861 agctgtgaat cagagcgcac agaacacagc cattccccgg gtgacagtag tgcacccatc 4921 cagaacactg attgctgtca ttcaagtgaa agggatggag gtcagaaagg gtgcatagaa 4981 agcagctcag tatttgatgc agatactctg ctctacaggg gaactttgaa atgtaatcat 5041 agttccaaaa ccacttccct agaacagtgt aatatagttc agcctcctcc tccttgtaaa 5101 atagaaaatt ccatacctaa tcccaatggg actgaaagtg ggacttattt cacaagtttc 5161 cagctgcctt taccaaggat caaagaatca gaaactaggc agcatagttc agggcaagaa 5221 aacactgtaa aaaatccaac ccatgtccca aaagagaatt ttaggaaaca ttcacagccc 5281 cggtcatttg atttgaagac ttacaaacct atgggatttg aatcttcatt tctgaaattt 5341 attcaggaaa gtgaagagaa agaagatgat tttgatgatt gggagccttc agagcactta 5401 acattaagta attcttcaca gtccagtaat gatttaacag ggaatgttgt ggcaaataat 5461 atggtgaatg acagtgaacc tgaagttgac atacctcatt cttccagtga ctctacaatt 5521 catgagaacc tgactgcaat cccaccttta atagtagctg aaacaacaac agttccttcc 5581 ttggaaaacc tgagggttgt attggacaaa gcattaacag actgtggaga gcttgcctta 5641 aaacagcttc attatcttcg gccagtggtg gttcttgaaa gatctaagtt ttccacacca 5701 attttagact tatttccaac aaaaaagaca gatgagcttt gtgtaggaag ttcataagta 5761 gcaattttgt tttagtaaca gactggctcc aacactgcaa catggggaca tttgccaact 5821 cgaacaaagg ctgagaagca gccacaccgt tgtttagggt agaataggct gtgtatttac 5881 atgaatgtat aatatctatg tcagcagtat tggctgagtc cattagctct ccagttggtt 5941 taatgattgg gtttattttt gtttgtttgt ttattaaaaa aatggaactg tacacttgtt 6001 tggtgctaat taatacatca aaatatactg gggcttcctt tttcaaatta agtgtgcatg 6061 attgtatatg gaacaaatac taaggtccca gggtgggagg gctagggaaa gggatatgga 6121 gttcttactt gacttgaatg tgcacctgag ggtgctttgt gtaatatatt gtacactaca 6181 gcatcttata ttttttgagt tgagtttcaa taaattacaa tttttcacc // LOCUS HSU22386 1370 bp mRNA PRI 06-APR-1995 DEFINITION Human macrophage-colony stimulating factor gamma precursor mRNA, complete cds. ACCESSION U22386 NID g758780 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1370) AUTHORS Cerretti,D.P., Wignall,J., Anderson,D., Tushinski,R.J., Gallis,B.M., Stya,M., Gillis,S., Urdal,D.L. and Cosman,D. TITLE Human macrophage-colony stimulating factor: alternative RNA and protein processing from a single gene JOURNAL Mol. Immunol. 25 (8), 761-770 (1988) MEDLINE 89039923 REFERENCE 2 (bases 1 to 1370) AUTHORS Cerretti,D.P. TITLE Direct Submission JOURNAL Submitted (08-MAR-1995) Douglas P. Cerretti, Immunex Corporation, 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1370 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p13-p21" /chromosome="1" CDS 45..1361 /note="m-csf gamma precursor" /codon_start=1 /product="macrophage-colony stimulating factor gamma precursor" /db_xref="PID:g758781" /translation="MTAPGAAGRCPPTTWLGSLLLLVCLLASRSITEEVSEYCSHMIG SGHLQSLQRLIDSQMETSCQITFEFVDQEQLKDPVCYLKKAFLLVQYIMEDTMRFRDN TPNAIAIVQLQELSLRLKSCFTKDYEEHDKACVRTFYETPLQLLEKVKNVFNETKNLL DKDWNIFSKNCNNSFAECSSQDVVTKPDCNCLYPKAIPSSDPASVSPHQPLAPSMAPV AGLTWEDSEGTEGSSLLPGEQPLHTVDPGSAKQRPPRSTCQSFEPPETPVVKDSTIGG SPQPRPSVGAFNPGMEDILDSAMGTNWVPEEASGEASEIPVPQGTELSPSRPGGGSMQ TEPARPSNFLSASSPLPASAKGQQPADVTGHERQSEGSSSPQLQESVFHLLVPSVILV LLAVGGLLFYRWRRRSHQEPQRADSPLEQPEGSPLTQDDRQVELPV" sig_peptide 45..140 mat_peptide 141..1358 /note="m-csf gamma" /product="macrophage-colony stimulating factor gamma" misc_feature 1182..1250 /note="encodes transmembrane domain" BASE COUNT 305 a 416 c 382 g 267 t ORIGIN 1 gaattcgggc ggccgacgcg cccggccggg acccagctgc ccgtatgacc gcgccgggcg 61 ccgccgggcg ctgccctccc acgacatggc tgggctccct gctgttgttg gtctgtctcc 121 tggcgagcag gagtatcacc gaggaggtgt cggagtactg tagccacatg attgggagtg 181 gacacctgca gtctctgcag cggctgattg acagtcagat ggagacctcg tgccaaatta 241 catttgagtt tgtagaccag gaacagttga aagatccagt gtgctacctt aagaaggcat 301 ttctcctggt acaatacata atggaggaca ccatgcgctt cagagataac acccccaatg 361 ccatcgccat tgtgcagctg caggaactct ctttgaggct gaagagctgc ttcaccaagg 421 attatgaaga gcatgacaag gcctgcgtcc gaactttcta tgagacacct ctccagttgc 481 tggagaaggt caagaatgtc tttaatgaaa caaagaatct ccttgacaag gactggaata 541 ttttcagcaa gaactgcaac aacagctttg ctgaatgctc cagccaagat gtggtgacca 601 agcctgattg caactgcctg taccccaaag ccatccctag cagtgacccg gcctctgtct 661 cccctcatca gcccctcgcc ccctccatgg cccctgtggc tggcttgacc tgggaggact 721 ctgagggaac tgagggcagc tccctcttgc ctggtgagca gcccctgcac acagtggatc 781 caggcagtgc caagcagcgg ccacccagga gcacctgcca gagctttgag ccgccagaga 841 ccccagttgt caaggacagc accatcggtg gctcaccaca gcctcgcccc tctgtcgggg 901 ccttcaaccc cgggatggag gatattcttg actctgcaat gggcactaat tgggtcccag 961 aagaagcctc tggagaggcc agtgagattc ccgtacccca agggacagag ctttccccct 1021 ccaggccagg agggggcagc atgcagacag agcccgccag acccagcaac ttcctctcag 1081 catcttctcc actccctgca tcagcaaagg gccaacagcc ggcagatgta actggccatg 1141 agaggcagtc cgagggatcc tccagcccgc agctccagga gtctgtcttc cacctgctgg 1201 tgcccagtgt catcctggtc ttgctggccg tcggaggcct cttgttctac aggtggaggc 1261 ggcggagcca tcaagagcct cagagagcgg attctccctt ggagcaacca gagggcagcc 1321 ccctgactca ggatgacaga caggtggaac tgccagtgta gagggaattc // LOCUS HSU22431 3678 bp mRNA PRI 28-JUN-1995 DEFINITION Human hypoxia-inducible factor 1 alpha (HIF-1 alpha) mRNA, complete cds. ACCESSION U22431 NID g881345 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3678) AUTHORS Wang,G.L., Jiang,B.H., Rue,E.A. and Semenza,G.L. TITLE Hypoxia-inducible factor 1 is a basic-helix-loop-helix-PAS heterodimer regulated by cellular O2 tension JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (12), 5510-5514 (1995) MEDLINE 95296340 REFERENCE 2 (bases 1 to 3678) AUTHORS Wang,G.L., Jiang,B.-H., Rue,E.A. and Semenza,G.L. TITLE Direct Submission JOURNAL Submitted (09-MAR-1995) Gregg L. Semenza, Center for Medical Genetics, The Johns Hopkins University School of Medicine, 600 N. Wolfe St., Baltimore, MD 21287-3914, USA FEATURES Location/Qualifiers source 1..3678 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hep3B" /cell_type="hepatoblastoma" gene 29..2509 /gene="HIF-1 alpha" CDS 29..2509 /gene="HIF-1 alpha" /standard_name="hypoxia-inducible factor 1, alpha subunit" /note="basic helix-loop-helix transcription factor" /codon_start=1 /product="hypoxia-inducible factor 1 alpha" /db_xref="PID:g881346" /translation="MEGAGGANDKKKISSERRKEKSRDAARSRRSKESEVFYELAHQL PLPHNVSSHLDKASVMRLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMV LTDDGDMIYISDNVNKYMGLTQFELTGHSVFDFTHPCDHEEMREMLTHRNGLVKKGKE QNTQRSFFLRMKCTLTSRGRTMNIKSATWKVLHCTGHIHVYDTNSNQPQCGYKKPPMT CLVLICEPIPHPSNIEIPLDSKTFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIY EYYHALDSDHLTKTHHDMFTKGQVTTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQC IVCVNYVVSGIIQHDLIFSLQQTECVLKPVESSDMKMTQLFTKVESEDTSSLFDKLKK EPDALTLLAPAAGDTIISLDFGSNDTETDDQQLEEVPLYNDVMLPSPNEKLQNINLAM SPLPTAETPKPLRSSADPALNQEVALKLEPNPESLELSFTMPQIQDQTPSPSDGSTRQ SSPEPNSPSEYCFYVDSDMVNEFKLELVEKLFAEDTEAKNPFSTQDTDLDLEMLAPYI PMDDDFQLRSFDQLSPLESSSASPESASPQSTVTVFQQTQIQEPTANATTTTATTDEL KTVTKDRMEDIKILIASPSPTHIHKETTSATSSPYRDTQSRTASPNRAGKGVIEQTEK SHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQRKRKMEHDGSLFQAVGIGTLLQQ PDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDLACRLLGQSMDESGLPQLTSY DCEVNAPIQGSRNLLQGEELLRALDQVN" polyA_site 3678 /note="42 A nucleotides" BASE COUNT 1197 a 695 c 675 g 1111 t ORIGIN 1 gtgaagacat cgcggggacc gattcaccat ggagggcgcc ggcggcgcga acgacaagaa 61 aaagataagt tctgaacgtc gaaaagaaaa gtctcgagat gcagccagat ctcggcgaag 121 taaagaatct gaagtttttt atgagcttgc tcatcagttg ccacttccac ataatgtgag 181 ttcgcatctt gataaggcct ctgtgatgag gcttaccatc agctatttgc gtgtgaggaa 241 acttctggat gctggtgatt tggatattga agatgacatg aaagcacaga tgaattgctt 301 ttatttgaaa gccttggatg gttttgttat ggttctcaca gatgatggtg acatgattta 361 catttctgat aatgtgaaca aatacatggg attaactcag tttgaactaa ctggacacag 421 tgtgtttgat tttactcatc catgtgacca tgaggaaatg agagaaatgc ttacacacag 481 aaatggcctt gtgaaaaagg gtaaagaaca aaacacacag cgaagctttt ttctcagaat 541 gaagtgtacc ctaactagcc gaggaagaac tatgaacata aagtctgcaa catggaaggt 601 attgcactgc acaggccaca ttcacgtata tgataccaac agtaaccaac ctcagtgtgg 661 gtataagaaa ccacctatga cctgcttggt gctgatttgt gaacccattc ctcacccatc 721 aaatattgaa attcctttag atagcaagac tttcctcagt cgacacagcc tggatatgaa 781 attttcttat tgtgatgaaa gaattaccga attgatggga tatgagccag aagaactttt 841 aggccgctca atttatgaat attatcatgc tttggactct gatcatctga ccaaaactca 901 tcatgatatg tttactaaag gacaagtcac cacaggacag tacaggatgc ttgccaaaag 961 aggtggatat gtctgggttg aaactcaagc aactgtcata tataacacca agaattctca 1021 accacagtgc attgtatgtg tgaattacgt tgtgagtggt attattcagc acgacttgat 1081 tttctccctt caacaaacag aatgtgtcct taaaccggtt gaatcttcag atatgaaaat 1141 gactcagcta ttcaccaaag ttgaatcaga agatacaagt agcctctttg acaaacttaa 1201 gaaggaacct gatgctttaa ctttgctggc cccagccgct ggagacacaa tcatatcttt 1261 agattttggc agcaacgaca cagaaactga tgaccagcaa cttgaggaag taccattata 1321 taatgatgta atgctcccct cacccaacga aaaattacag aatataaatt tggcaatgtc 1381 tccattaccc accgctgaaa cgccaaagcc acttcgaagt agtgctgacc ctgcactcaa 1441 tcaagaagtt gcattaaaat tagaaccaaa tccagagtca ctggaacttt cttttaccat 1501 gccccagatt caggatcaga cacctagtcc ttccgatgga agcactagac aaagttcacc 1561 tgagcctaat agtcccagtg aatattgttt ttatgtggat agtgatatgg tcaatgaatt 1621 caagttggaa ttggtagaaa aactttttgc tgaagacaca gaagcaaaga acccattttc 1681 tactcaggac acagatttag acttggagat gttagctccc tatatcccaa tggatgatga 1741 cttccagtta cgttccttcg atcagttgtc accattagaa agcagttccg caagccctga 1801 aagcgcaagt cctcaaagca cagttacagt attccagcag actcaaatac aagaacctac 1861 tgctaatgcc accactacca ctgccaccac tgatgaatta aaaacagtga caaaagaccg 1921 tatggaagac attaaaatat tgattgcatc tccatctcct acccacatac ataaagaaac 1981 tactagtgcc acatcatcac catatagaga tactcaaagt cggacagcct caccaaacag 2041 agcaggaaaa ggagtcatag aacagacaga aaaatctcat ccaagaagcc ctaacgtgtt 2101 atctgtcgct ttgagtcaaa gaactacagt tcctgaggaa gaactaaatc caaagatact 2161 agctttgcag aatgctcaga gaaagcgaaa aatggaacat gatggttcac tttttcaagc 2221 agtaggaatt ggaacattat tacagcagcc agacgatcat gcagctacta catcactttc 2281 ttggaaacgt gtaaaaggat gcaaatctag tgaacagaat ggaatggagc aaaagacaat 2341 tattttaata ccctctgatt tagcatgtag actgctgggg caatcaatgg atgaaagtgg 2401 attaccacag ctgaccagtt atgattgtga agttaatgct cctatacaag gcagcagaaa 2461 cctactgcag ggtgaagaat tactcagagc tttggatcaa gttaactgag ctttttctta 2521 atttcattcc tttttttgga cactggtggc tcactaccta aagcagtcta tttatatttt 2581 ctacatctaa ttttagaagc ctggctacaa tactgcacaa acttggttag ttcaattttt 2641 gatccccttt ctacttaatt tacattaatg ctctttttta gtatgttctt taatgctgga 2701 tcacagacag ctcattttct cagttttttg gtatttaaac cattgcattg cagtagcatc 2761 attttaaaaa atgcaccttt ttatttattt atttttggct agggagttta tccctttttc 2821 gaattatttt taagaagatg ccaatataat ttttgtaaga aggcagtaac ctttcatcat 2881 gatcataggc agttgaaaaa tttttacacc ttttttttca cattttacat aaataataat 2941 gctttgccag cagtacgtgg tagccacaat tgcacaatat attttcttaa aaaataccag 3001 cagttactca tggaatatat tctgcgttta taaaactagt ttttaagaag aaattttttt 3061 tggcctatga aattgttaaa cctggaacat gacattgtta atcatataat aatgattctt 3121 aaatgctgta tggtttatta tttaaatggg taaagccatt tacataatat agaaagatat 3181 gcatatatct agaaggtatg tggcatttat ttggataaaa ttctcaattc agagaaatca 3241 tctgatgttt ctatagtcac tttgccagct caaaagaaaa caatacccta tgtagttgtg 3301 gaagtttatg ctaatattgt gtaactgata ttaaacctaa atgttctgcc taccctgttg 3361 gtataaagat attttgagca gactgtaaac aagaaaaaaa aaatcatgca ttcttagcaa 3421 aattgcctag tatgttaatt tgctcaaaat acaatgtttg attttatgca ctttgtcgct 3481 attaacatcc tttttttcat gtagatttca ataattgagt aattttagaa gcattatttt 3541 aggaatatat agttgtcaca gtaaatatct tgttttttct atgtacattg tacaaatttt 3601 tcattccttt tgctctttgt ggttggatct aacactaact gtattgtttt gttacatcaa 3661 ataaacatct tctgtgga // LOCUS HSU22491 1596 bp DNA PRI 06-SEP-1995 DEFINITION Human G protein-coupled receptor (GPR7), complete cds. ACCESSION U22491 NID g953232 KEYWORDS Opioid; somatostatin; intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1596) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE The cloning and chromosomal mapping of two novel human opioid-somatostatin-like receptor genes, GPR7 and GPR8, expressed in discrete areas of the brain JOURNAL Genomics 28 (1), 84-91 (1995) MEDLINE 96070436 REFERENCE 2 (bases 1 to 1596) AUTHORS O'Dowd,B. TITLE Direct Submission JOURNAL Submitted (13-MAR-1995) Brian O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek, Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1596 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q11.2-q21.1" gene 526..1512 /gene="GPR7" CDS 526..1512 /gene="GPR7" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g953233" /translation="MDNASFSEPWPANASGPDPALSCSNASTLAPLPAPLAVAVPVVY AVICAVGLAGNSAVLYVLLRAPRMKTVTNLFILNLAIADELFTLVLPINIADFLLRQW PFGELMCKLIVAIDQYNTFSSLYFLTVMSADRYLVVLATAESRRVAGRTYSAARAVSL AVWGIVTLVVLPFAVFARLDDEQGRRQCVLVFPQPEAFWWRASRLYTLVLGFAIPVST ICVLYTTLLCRLHAMRLDSHAKALERAKKRVTFLVVAILAVCLLCWTPYHLSTVVALT TDLPQTPLVIAISYFITSLTYANSCLNPFLYAFLDASFRRNLRQLITCRAAA" BASE COUNT 253 a 550 c 477 g 316 t ORIGIN 1 attctgcaga tatccatcac actggcggcc gctcgagcat gcatctagag cttgccactg 61 cgggattctg tggggtaacc tgggtctacg gaagtttcct gaaagagggg agaagggttt 121 gcatttttcc tatggaggat tcttctctct ctagcatttc gtttgatgta ttcaactggt 181 agaagtgaga tttcaacagg tagcagagag cgctcacgtg gaggaggttt ggggcgccgc 241 ggcacccccc acccctcctc gggaccgcgc ctatttctaa agttacacgt cgacgaacta 301 acctatgctt taaattcctc tttccgaccc cgtgagtccg cggcgacatt gggccgtggg 361 gtggctggga acggtcccct cctccggaaa aaccagagaa cggcttggag agctggaaac 421 gagcgtccgc gagcaggtcc gtgcagaacc gggcttcagg accgctgagc tccgtagggc 481 gtccttgggg gacgccaggt cgccggctcc tctgccctcg ttgagatgga caacgcctcg 541 ttctcggagc cctggcccgc caacgcatcg ggcccggacc cggcgctgag ctgctccaac 601 gcgtcgactc tggcgccgct gccggcgccg ctggcggtgg ctgtaccagt tgtctacgcg 661 gtgatctgcg ccgtgggtct ggcgggcaac tccgccgtgc tgtacgtgtt gctgcgggcg 721 ccccgcatga agaccgtcac caacctgttc atcctcaacc tggccatcgc cgacgagctc 781 ttcacgctgg tgctgcccat caacatcgcc gacttcctgc tgcggcagtg gcccttcggg 841 gagctcatgt gcaagctcat cgtggctatc gaccagtaca acaccttctc cagcctctac 901 ttcctcaccg tcatgagcgc cgaccgctac ctggtggtgt tggccactgc ggagtcgcgc 961 cgggtggccg gccgcaccta cagcgccgcg cgcgcggtga gcctggccgt gtgggggatc 1021 gtcacactcg tcgtgctgcc cttcgcagtc ttcgcccggc tagacgacga gcagggccgg 1081 cgccagtgcg tgctagtctt tccgcagccc gaggccttct ggtggcgcgc gagccgcctc 1141 tacacgctcg tgctgggctt cgccatcccc gtgtccacca tctgtgtcct ctataccacc 1201 ctgctgtgcc ggctgcatgc catgcggctg gacagccacg ccaaggccct ggagcgcgcc 1261 aagaagcggg tgaccttcct ggtggtggca atcctggcgg tgtgcctcct ctgctggacg 1321 ccctaccacc tgagcaccgt ggtggcgctc accaccgacc tcccgcagac gccgctggtc 1381 atcgctatct cctacttcat caccagcctg acgtacgcca acagctgcct caaccccttc 1441 ctctacgcct tcctggacgc cagcttccgc aggaacctcc gccagctgat aacttgccgc 1501 gcggcagcct gactccccca gcgtccggct ccgcaactgc gcgccactcc tggccagcga 1561 gggaggagcc ggcgccagag tgcgggacca gacagg // LOCUS HSU22492 1518 bp DNA PRI 06-SEP-1995 DEFINITION Human G protein-coupled receptor gene (GPR8), complete cds. ACCESSION U22492 NID g953234 KEYWORDS Opioid, somatostatin, intronless. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1518) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE The cloning and chromosomal mapping of two novel human opioid-somatostatin-like receptor genes, GPR7 and GPR8, expressed in discrete areas of the brain JOURNAL Genomics 28 (1), 84-91 (1995) MEDLINE 96070436 REFERENCE 2 (bases 1 to 1518) AUTHORS O'Dowd,B.F., Scheideler,M.A., Nguyen,T., Cheng,R., Rasmussen,J.S., Zastawny,R., Marchese,A., Heng,H.H.Q., Tsui,L.-C., Shi,X., Asa,S., Puy,L. and George,S.R. TITLE Direct Submission JOURNAL Submitted (13-MAR-1995) Brian O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek, Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1518 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q13.3" gene 349..1350 /gene="GPR8" CDS 349..1350 /gene="GPR8" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g953235" /translation="MQAAGHPEPLDSRGSFSLPTMGANVSQDNGTGHNATFSEPLPFL YVLLPAVYSGICAVGLTGNTAVILVILRAPKMKTVTNVFILNLAVADGLFTLVLPVNI AEHLLQYWPFGELLCKLVLAVDHYNIFSSIYFLAVMSVDRYLVVLATVRSRHMPWRTY RGAKVASLCVWLGVTVLVLPFFSFAGVYSNELQVPSCGLSFPWPERVWFKASRVYTLV LGFVLPVCTICVLYTDLLRRLRAVRLRSGAKALGKARRKVTVLVLVVLAVCLLCWTPF HLASVVALTTDLPQTPLVISMSYVITSLTYANSCLNPFLYAFLDDNFRKNFRSILRC" BASE COUNT 269 a 554 c 382 g 313 t ORIGIN 1 tccactagta acggccgcca ggatccacat ctcttcccag gagggtggcc agcagctgct 61 ctctgcggga ggagggaact gatctgctga agtctcacca ggaagaggcg ggaaggcccc 121 cacacacccc accaggctcc ctctggcccc atgtccttga cctggcaaag tggccgcagt 181 ctctgccaga gaacctggag tggctgtgcc taacagacgg ctggatctca aagtctctgg 241 ttgtttttct ttcctagaat ccagcctaag gaggccccca accagatacc caactccaag 301 gcacctccca cctgcccagg gcgcaaatcg tcaacggtcc cagctacaat gcaggccgct 361 gggcacccag agccccttga cagcaggggc tccttctccc tccccacgat gggtgccaac 421 gtctctcagg acaatggcac tggccacaat gccaccttct ccgagccact gccgttcctc 481 tatgtgctcc tgcccgccgt gtactccggg atctgtgctg tggggctgac tggcaacacg 541 gccgtcatcc ttgtaatcct aagggcgccc aagatgaaga cggtgaccaa cgtgttcatc 601 ctgaacctgg ccgtcgccga cgggctcttc acgctggtac tgcccgtcaa catcgcggag 661 cacctgctgc agtactggcc cttcggggag ctgctctgca agctggtgct ggccgtcgac 721 cactacaaca tcttctccag catctacttc ctagccgtga tgagcgtgga ccgatacctg 781 gtggtgctgg ccaccgtgag gtcccgccac atgccctggc gcacctaccg gggggcgaag 841 gtcgccagcc tgtgtgtctg gctgggcgtc acggtcctgg ttctgccctt cttctctttc 901 gctggcgtct acagcaacga gctgcaggtc ccaagctgtg ggctgagctt cccgtggccc 961 gagcgggtct ggttcaaggc cagccgtgtc tacactttgg tcctgggctt cgtgctgccc 1021 gtgtgcacca tctgtgtgct ctacacagac ctcctgcgca ggctgcgggc cgtgcggctc 1081 cgctctggag ccaaggctct aggcaaggcc aggcggaagg tgaccgtcct ggtcctcgtc 1141 gtgctggccg tgtgcctcct ctgctggacg cccttccacc tggcctctgt cgtggccctg 1201 accacggacc tgccccagac cccactggtc atcagtatgt cctacgtcat caccagcctc 1261 acgtacgcca actcgtgcct gaaccccttc ctctacgcct ttctagatga caacttccgg 1321 aagaacttcc gcagcatatt gcggtgctga agggcctggg caccatcatc cccatcatca 1381 tcatcacccc catcatcatc acccccacca ttacccccat cgtcacgccc atcatcacgc 1441 ccatcatcac cccccatcat cacccccatc atcatgccca tcatcacccc ccatcatcat 1501 catgcccacc cctcatca // LOCUS HSU22526 3206 bp mRNA PRI 24-AUG-1995 DEFINITION Human 2,3-oxidosqualene-lanosterol cyclase mRNA, complete cds. ACCESSION U22526 NID g951313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3206) AUTHORS Baker,C.H., Matsuda,S.P., Liu,D.R. and Corey,E.J. TITLE Molecular cloning of the human gene encoding lanosterol synthase from a liver cDNA library JOURNAL Biochem. Biophys. Res. Commun. 213 (1), 154-160 (1995) MEDLINE 95366991 REFERENCE 2 (bases 1 to 3206) AUTHORS Baker,C.H. TITLE Direct Submission JOURNAL Submitted (14-MAR-1995) Charles H. Baker, Chemistry, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA FEATURES Location/Qualifiers source 1..3206 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Clontech catalog number HL1115a" /sex="male" /tissue_type="liver" /dev_stage="adult" CDS 152..2350 /EC_number="5.4.99.7" /note="lanosterol synthase" /codon_start=1 /function="cholesterol biosynthesis" /product="2,3-oxidosqualene-lanosterol cyclase" /db_xref="PID:g951314" /translation="MTEGTCLRRRGGPYKTEPATDLGRWRLNCERGRQTWTYLQDERA GREQTGLEAYALGLDTKNYFKDLPKAHTAFEGALNGMTFYVGLQAEDGHWTGDYGGPL FLLPGLLITCHVARIPLPAGYREEIVRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVS LRILGVGPDDPDLVRARNILHKKGGAVAIPSWGKFWLAVLNVYSWEGLNTLFPEMWLF PDWAPAHPSTLWCHCRQVYLPMSYCYAVRLSAAEDPLVQSLRQELYVEDFASIDWLAQ RNNVAPDELYTPHSWLLRVVYALLNLYEHHHSAHLRQRAVQKLYEHIVADDRFTKSIS IGPISKTINMLVRWYVDGPASTAFQEHVSRIPDYLWMGLDGMKMQGTNGSQIWDTAFA IQALLEAGGHHRPEFSSCLQKAHEFLRLSQVPDNPPDYQKYYRQMRKGGFSFSTLDCG WIVSDCTAEALKAVLLLQEKCPHVTEHIPRERLCDAVAVLLNMRNPDGGFATYETKRG GHLLELLNPSEVFGDIMIDYTYVECTSAVMQALKYFHKRFPEHRAAEIRETLTQGLEF CRRQQRADGSWEGSWGVCFTYGTWFGLEAFACMGQTYRDGTACAEVSRACDFLLSRQM ADGGWGEDFESCEERRYLQSAQSQIHNTCWAMMGLMAVRHPDIEAQERGVRCLLEKQL PNGDWPQENIAGVFNKSCAISYTSYRNIFPIWALGRFSQLYPERALAGHP" BASE COUNT 617 a 919 c 1005 g 665 t ORIGIN 1 cccttgccta ctgctcatgg gtgtggagac tgatattctg gaagactgat aggcagattt 61 actattaaca aacacatagt ctgtggccca gcaaagccac cccaatccct gcacaagggt 121 aaaaggccag cattagagca ctgcagcagc aatgacggag ggcacgtgtc tgcggcgccg 181 agggggcccc tacaagaccg agcccgccac cgacctcggc cgctggcgac tcaactgcga 241 gaggggccgg cagacgtgga cctacctgca ggacgagcgc gccggccgcg agcagaccgg 301 cctggaagcc tacgccctgg ggctggacac caagaattac tttaaggact tgcccaaagc 361 ccacaccgcc tttgaggggg ctctgaacgg gatgacattt tacgtggggc tgcaggctga 421 ggatgggcac tggacgggtg attatggtgg cccacttttc ctcctgccag gcctcctgat 481 cacttgccac gtggcacgca tccctctgcc agccggatac agagaagaga ttgtgcggta 541 cctgcggtca gtgcagctcc ctgacggtgg ctggggcctg cacattgagg ataagtccac 601 cgtgtttggg actgcgctca actatgtgtc tctcagaatt ctgggtgttg ggcctgacga 661 tcctgacctg gtacgagccc ggaacattct tcacaagaaa ggtggtgctg tggccatccc 721 ctcctggggg aagttctggc tggctgtcct gaatgtttac agctgggaag gcctcaatac 781 cctgttccca gagatgtggc tgtttcctga ctgggcaccg gcacacccct ccacactctg 841 gtgccactgc cggcaggtgt acctgcccat gagctactgc tacgccgttc ggctgagtgc 901 cgcggaagac ccgctggtcc agagcctccg ccaggagctc tatgtggagg acttcgccag 961 cattgactgg ctggcgcaga ggaacaacgt ggcccccgac gagctgtaca cgccccacag 1021 ctggctgctc cgcgtggtat atgcgctcct caacctgtat gagcaccacc acagtgccca 1081 cctgcggcag cgggccgtgc agaagctgta tgaacacatt gtggccgacg accgattcac 1141 caagagcatc agcatcggcc cgatctcgaa aaccatcaac atgcttgtgc gctggtatgt 1201 ggacgggccc gcctccactg ccttccagga gcatgtctcc agaatcccgg actatctctg 1261 gatgggcctt gacggcatga aaatgcaggg caccaacggc tcacagatct gggacaccgc 1321 attcgccatc caggctctgc ttgaggcggg cgggcaccac aggcccgagt tttcgtcctg 1381 cctgcagaag gctcatgagt tcctgaggct ctcacaggtc ccagataacc ctcccgacta 1441 ccagaagtac taccgccaga tgcgcaaggg tggcttctcc ttcagtacgc tggactgcgg 1501 ctggatcgtt tctgactgca cggctgaggc cttgaaggct gtgctgctcc tgcaggagaa 1561 gtgtccccat gtcaccgagc acatccccag agaacggctc tgcgatgctg tggctgtgct 1621 gctgaacatg agaaatccag atggagggtt cgccacctat gagaccaagc gtggggggca 1681 cttgctggag ctgctgaacc cctcggaggt cttcggggac atcatgattg actacaccta 1741 tgtggagtgc acctcagccg tgatgcaggc gcttaagtat ttccacaagc gtttcccgga 1801 gcacagggca gcggagatcc gggagaccct cacgcagggc ttagagttct gtcggcggca 1861 gcagagggcc gatggctcct gggaaggctc ctggggagtt tgcttcacct acggcacctg 1921 gtttggcctg gaggccttcg cctgtatggg gcagacctac cgagatggga ctgcctgtgc 1981 agaggtctcc cgggcctgtg acttcctgct gtcccggcag atggcagacg gaggctgggg 2041 ggaggacttt gagtcctgcg aggagcggcg ttatttgcag agtgcccagt cccagatcca 2101 taacacatgc tgggccatga tggggctgat ggccgttcgg catcctgaca tcgaggccca 2161 ggagagagga gtccggtgtc tacttgagaa acagctcccc aatggcgact ggccgcagga 2221 aaacattgct ggggtcttca acaagtcctg tgccatctcc tacacgagct acaggaacat 2281 cttccccatc tgggccctcg gccgcttctc ccagctgtac cctgagagag cccttgctgg 2341 ccacccctga gaacatgcct acctgctggg tgccgtctgt gcgttccagt gaggccaagg 2401 ggtcctggcc gggttgggga gccctcccat aaccctgtct tgggctccaa cccctcaacc 2461 tctatctcat agatgtgaat ctgggggcca ggctggaggc agggatgggg acagggtggg 2521 tggcttagac tcttgatttt tactgtaggt tcatttctga aagtagcttg tcgggcttgg 2581 gtgaggaagg gggcacagga gccgtgaccc ctgaggaggc acagcgcctt ctgccacctc 2641 tgggcacggc ctcaaggtag tgaggctagg aggttttttc tgaccaatag ctgagttctt 2701 gggagaggag cagctgtgcc tgtgtgattc cttagtgtcg agtgggctct gggctggggt 2761 cggccctggg caggcttctc ctgcaccttt tgtctgctgg gctgagggac acgagggcaa 2821 ccctgtgaca atggcaggta gtgtgcatcc gtgaatagcc cagtgcgggg gttgctcatg 2881 gagcatcctg aggccgtgca gcagggagcc ccatgcccct gggtcgtgag cttgcctgcg 2941 tatggggtgg tgtcatggag cctcatgccc ctgggtcgtg agctcgcctg agtatggggt 3001 ggtgtcatgg agccgcatac ccctgggttg tgagctcgcc tgcatatgca gggtctgtca 3061 tggaacatcc caagtctgtg cagcagggag ccccatgccc ctgggacatg aacccacctg 3121 cgtggaatgc tgtttgtgag gtgtctacag ggtttatagt agtcttgtgg acacagaaat 3181 gcacagggga cacttacgga cacaga // LOCUS HSU22662 1528 bp mRNA PRI 27-JAN-1996 DEFINITION Human nuclear orphan receptor LXR-alpha mRNA, complete cds. ACCESSION U22662 NID g726512 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1528) AUTHORS Willy,P.J., Umesono,K., Ong,E.S., Evans,R.M., Heyman,R.A. and Mangelsdorf,D.J. TITLE LXR, a nuclear receptor that defines a distinct retinoid response pathway JOURNAL Genes Dev. 9 (9), 1033-1045 (1995) MEDLINE 95262897 REFERENCE 2 (bases 1 to 1528) AUTHORS Mangelsdorf,D.J. TITLE Direct Submission JOURNAL Submitted (14-MAR-1995) David J. Mangelsdorf, Howard Hughes Medical Institute, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Blvd., Dallas, TX 75235-9050, USA FEATURES Location/Qualifiers source 1..1528 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pXR2-deltaRV" /tissue_type="liver" /dev_stage="adult" CDS 36..1379 /codon_start=1 /product="nuclear orphan receptor LXR-alpha" /db_xref="PID:g726513" /translation="MSLWLGAPVPDIPPDSAVELWKPGAQDASSQAQGGSSCILREEA RMPHSAGGTAGVGLEAAEPTALLTRAEPPSEPTEIRPQKRKKGPAPKMLGNELCSVCG DKASGFHYNVLSCEGCKGFFRRSVIKGAHYICHSGGHCPMDTYMRRKCQECRLRKCRQ AGMREECVLSEEQIRLKKLKRQEEEQAHATSLPPRRSSPPQILPQLSPEQLGMIEKLV AAQQQCNRRSFSDRLRVTPWPMAPDPHSREARQQRFAHFTELAIVSVQEIVDFAKQLP GFLQLSREDQIALLKTSAIEVMLLETSRRYNPGSESITFLKDFSYNREDFAKAGLQVE FINPIFEFSRAMNELQLNDAEFALLIAISIFSADRPNVQDQLQVERLQHTYVEALHAY VSIHHPHDRLMFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDVHE" polyA_site 1528 /note="33 A nucleotides" BASE COUNT 340 a 454 c 439 g 295 t ORIGIN 1 cagtgccttg gtaatgacca gggctccaga aagagatgtc cttgtggctg ggggcccctg 61 tgcctgacat tcctcctgac tctgcggtgg agctgtggaa gccaggcgca caggatgcaa 121 gcagccaggc ccagggaggc agcagctgca tcctcagaga ggaagccagg atgccccact 181 ctgctggggg tactgcaggg gtggggctgg aggctgcaga gcccacagcc ctgctcacca 241 gggcagagcc cccttcagaa cccacagaga tccgtccaca aaagcggaaa aaggggccag 301 cccccaaaat gctggggaac gagctatgca gcgtgtgtgg ggacaaggcc tcgggcttcc 361 actacaatgt tctgagctgc gagggctgca agggattctt ccgccgcagc gtcatcaagg 421 gagcgcacta catctgccac agtggcggcc actgccccat ggacacctac atgcgtcgca 481 agtgccagga gtgtcggctt cgcaaatgcc gtcaggctgg catgcgggag gagtgtgtcc 541 tgtcagaaga acagatccgc ctgaagaaac tgaagcggca agaggaggaa caggctcatg 601 ccacatcctt gccccccagg cgttcctcac ccccccaaat cctgccccag ctcagcccgg 661 aacaactggg catgatcgag aagctcgtcg ctgcccagca acagtgtaac cggcgctcct 721 tttctgaccg gcttcgagtc acgccttggc ccatggcacc agatccccat agccgggagg 781 cccgtcagca gcgctttgcc cacttcactg agctggccat cgtctctgtg caggagatag 841 ttgactttgc taaacagcta cccggcttcc tgcagctcag ccgggaggac cagattgccc 901 tgctgaagac ctctgcgatc gaggtgatgc ttctggagac atctcggagg tacaaccctg 961 ggagtgagag tatcaccttc ctcaaggatt tcagttataa ccgggaagac tttgccaaag 1021 cagggctgca agtggaattc atcaacccca tcttcgagtt ctccagggcc atgaatgagc 1081 tgcaactcaa tgatgccgag tttgccttgc tcattgctat cagcatcttc tctgcagacc 1141 ggcccaacgt gcaggaccag ctccaggtgg agaggctgca gcacacatat gtggaagccc 1201 tgcatgccta cgtctccatc caccatcccc atgaccgact gatgttccca cggatgctaa 1261 tgaaactggt gagcctccgg accctgagca gcgtccactc agagcaagtg tttgcactgc 1321 gtctgcagga caaaaagctc ccaccgctgc tctctgagat ctgggatgtg cacgaatgac 1381 tgttctgtcc ccatattttc tgttttcttg gccggatggc tgaggcctgg tggctgcctc 1441 ctagaagtgg aacagactga gaagggcaaa cattcctggg agctgggcaa ggagatcctc 1501 ccgtggcatt aaaagagagt caaagggt // LOCUS HSU22680 6635 bp mRNA PRI 27-APR-1996 DEFINITION Human X2 box repressor mRNA, complete cds. ACCESSION U22680 NID g1142656 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6635) AUTHORS Scholl,T., Stevens,M.B., Mahanta,S. and Strominger,J.L. TITLE A zinc finger protein that represses transcription of the human MHC class II gene, DPA JOURNAL J. Immunol. 156 (4), 1448-1457 (1996) MEDLINE 96164573 REFERENCE 2 (bases 1 to 6635) AUTHORS Scholl,T., Stevens,M.B., Mahanta,S.K. and Strominger,J.L. TITLE Direct Submission JOURNAL Submitted (14-MAR-1995) Thomas Scholl, Biochemistry, Harvard University, 7 Divinity Avenue, Cambridge, MA 02138, USA FEATURES Location/Qualifiers source 1..6635 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Daudi, B cell" CDS 166..3432 /function="transcriptional repressor of MHC class II gene DPA" /note="XBR; zinc-finger protein" /codon_start=1 /product="X2 box repressor" /db_xref="PID:g1142657" /translation="MATQVMGQSSGGGGLFTSSGNIGMALPNDMYDLHDLSKAELAAP QLIMLANVALTGEVNGSCCDYLVGEERQMAELMPVGDNNFSDSEEGEGLEESADIKGE PHGLENMELRSLELSVVEPQPVFEASGAPDIYSSNKDLPPETPGAEDKGKSSKTKPFR CKPCQYEAESEEQFVHHIRVHSAKKFFVGRECREAGKSRESLFHCRRGRFLKGPIRCD RCGYNTNRYDHYTAHLKHHTRAGDNERVYKCIICTYTTVSEYHWRKHLRNHFPRKVYT CGKCNYFSDRKNNYVQHVRTHTGERPYKCELCPYSSSQKTHLTRHMRTHSGEKPFKCD QCSYVASNQHEVTRHARQVHNGPKPLNCPHCDYKTADRSNFKKHVELHVNPRQFNCPV CDYAASKKCNLQYHFKSKHPTCPNKTMDVSKVKLKKTKKREADLPDNITNEKTEIEQT KIKGDVAGKKNEKSVKAEKRDVSKEKKPSNNVSVIQVTTRTRKSVTEVKEMDVHTGSN SEKFSKTKKSKRKLEVDSHSLHGPVNDEESSTKKKKKVESKSKNNSQEVPKGDSKVEE NKKQNTCMKKSTKKKTLKNKSSKKSSKPPQKEPVEKGSAQMDPPQMGPAPTEAVQKGP VQVEPPPPMEHAQMEGAQIRPAPDEPVQMEVVQEGPAQKELLPPVEPAQMVGAQIVLA HMELPPPMETAQTEVAQMGPAPMEPAQMEVAQVESAPMQVVQKEPVQMELSPPMEVVQ KEPVQIELSPPMEVVQKEPVKIELSPPIEVVQKEPVQMELSPPMGVVQKEPAQRELPP PREPPLHMEPISKKPPLRKDKKEKSNMQSERARKEQVLIEVGLVPVKDSWLLKESVST EDLSPPSPPLPKENLREEASGDQKLLNTGEGNKEAPLQKVGAEEADESLPGLAANINE STHISSSGQNLNTPEGETLNGKHQTDSIVCEMKMDTDQNTRENLTGINSTVEEPVSPM LPPSAVEEREAVSKTALASPPATMAANESQEIDEDEGIHSHEGSDLSDNMSEGSDDSG LHGARPVPQESSRKNAKEALAVKAAKGDFVCIFCDRSFRKGKDYSKHLNRHLVNVLLS " BASE COUNT 2111 a 1219 c 1468 g 1837 t ORIGIN 1 gaattcggca cgaggccttc ttggtccacg acggccccag cacccaactt taccaccctc 61 ccccacctct cccccgaaac tccagcaaca aagaaaagta gtcggagaag gagcggcgac 121 tcagggtcgc ccgcccctcc tcaccgagga aggccgaata cagttatggc cacccaggta 181 atggggcagt cttctggagg aggagggctg tttaccagca gtggcaacat tggaatggcc 241 ctgcctaacg acatgtatga cttgcatgac ctttccaaag ctgaactggc cgcacctcag 301 cttattatgc tggcaaatgt ggccttaact ggggaagtaa atggcagctg ctgtgattac 361 ctggtcggtg aagaaagaca gatggcagaa ctgatgccgg ttggggataa caacttttca 421 gatagtgaag aaggagaagg acttgaagag tctgctgata taaaaggtga acctcatgga 481 ctggaaaaca tggaactgag aagtttggaa ctcagcgtcg tagaacctca gcctgtattt 541 gaggcatcag gtgctccaga tatttacagt tcaaataaag atcttccccc tgaaacacct 601 ggagcggagg acaaaggcaa gagctcgaag accaaaccct ttcgctgtaa gccatgccaa 661 tatgaagcag aatctgaaga acagtttgtg catcacatca gagttcacag tgctaagaaa 721 ttttttgttg gaagagagtg cagagaagca ggcaaaagca gggaatctct cttccactgc 781 agaagaggga gatttctcaa gggccccatt cgctgtgacc gctgcggcta caatactaat 841 cgatatgatc actatacagc acacctgaaa caccacacca gagctgggga taatgagcga 901 gtctacaagt gtatcatttg cacatacaca acagtgagcg agtatcactg gaggaaacat 961 ttaagaaacc attttccaag gaaagtatac acatgtggaa aatgcaacta tttttcagac 1021 agaaaaaaca attatgttca gcatgttaga actcatacag gagaacgccc atataaatgt 1081 gaactttgtc cttactcaag ttctcagaag actcatctaa ctagacatat gcgtactcat 1141 tcaggtgaga agccatttaa atgtgatcag tgcagttatg tggcctctaa tcaacatgaa 1201 gtaacccgcc atgcaagaca ggttcacaat gggcctaaac ctcttaattg cccacactgt 1261 gattacaaaa cagcagatag aagcaacttc aaaaaacatg tagagctaca tgtgaaccca 1321 cggcagttca attgccctgt atgtgactat gcagcttcca agaagtgtaa tctacagtat 1381 cacttcaaat ctaagcatcc tacttgtcct aataaaacaa tggatgtctc aaaagtgaaa 1441 ctaaagaaaa ccaaaaaacg agaggctgac ttgcctgata atattaccaa tgaaaaaaca 1501 gaaatagaac aaacaaaaat aaaaggggat gtggctggaa agaaaaatga aaagtccgtc 1561 aaagcagaga aaagagatgt ctcaaaagag aaaaagcctt ctaataatgt gtcagtgatc 1621 caggtgacta ccagaactcg aaaatcagta acagaggtga aagagatgga tgtgcataca 1681 ggaagcaatt cagaaaaatt cagtaaaact aagaaaagca aaaggaagct ggaagttgac 1741 agccattctt tacatggtcc tgtgaatgat gaggaatctt caacaaaaaa gaaaaagaag 1801 gtagaaagca aatccaaaaa taatagtcag gaagtgccaa agggtgacag caaagtggag 1861 gagaataaaa agcaaaatac ttgcatgaaa aaaagtacaa agaagaaaac tctgaaaaat 1921 aaatcaagta agaaaagcag taagcctcct cagaaggaac ctgttgagaa gggatctgct 1981 cagatggacc ctcctcagat ggggcctgct cccacagagg cggttcagaa ggggcccgtt 2041 caggtggagc cgccacctcc catggagcat gctcagatgg agggtgccca gatacggcct 2101 gctcctgacg agcctgttca gatggaggtg gttcaggagg ggcctgctca gaaggagctg 2161 ctgcctcccg tggagcctgc tcagatggtg ggtgcccaaa ttgtacttgc tcacatggag 2221 ctgcctcctc ccatggagac tgctcagacg gaggttgccc aaatggggcc tgctcccatg 2281 gaacctgctc agatggaggt tgcccaggta gaatctgctc ccatgcaggt ggtccagaag 2341 gagcctgttc agatggagct gtctcctccc atggaggtgg tccagaagga gcctgttcag 2401 atagagctgt ctcctcccat ggaggtggtc cagaaggaac ctgttaagat agagctgtct 2461 cctcccatag aggtggtcca gaaggagcct gttcagatgg agttgtctcc tcccatgggg 2521 gtggttcaga aggagcctgc tcagagggag ctacctcctc ccagagagcc tccccttcac 2581 atggagccaa tttccaaaaa gcctcctctc cgaaaagata aaaaggaaaa gtctaacatg 2641 cagagtgaaa gggcacggaa ggagcaagtc cttattgaag ttggcttagt gcctgttaaa 2701 gatagctggc ttctaaagga aagtgtaagc acagaggatc tctcaccacc atcaccacca 2761 ctgccaaagg aaaatttaag agaagaggca tcaggagacc aaaaattact caacacaggt 2821 gaaggaaata aagaagcccc tcttcagaaa gtaggagcag aagaggcaga tgagagccta 2881 cctggtcttg ctgctaatat caacgaatct acccatattt catcctctgg acaaaacttg 2941 aatacgccag agggtgaaac tttaaatggt aaacatcaga ctgacagtat agtttgtgaa 3001 atgaaaatgg acactgatca gaacacaaga gagaatctca ctggtataaa ttcaacagtt 3061 gaagaaccag tttcaccaat gcttccccct tcagcagtag aagaacgtga agcagtgtcc 3121 aaaactgcac tggcatcacc tcctgctaca atggcagcaa atgagtctca ggaaattgat 3181 gaagatgaag gcatccacag ccatgaagga agtgacctaa gtgacaacat gtcagagggt 3241 agtgatgatt ctggattgca tggggctcgg ccagttccac aagaatctag cagaaaaaat 3301 gcaaaggaag ccttggcagt caaagcggct aagggagatt ttgtttgtat cttctgtgat 3361 cgttctttca gaaagggaaa agattacagc aaacacctca atcgccattt ggttaatgtg 3421 ttactatctt gaagaagcag ctcaagggca ggagtaatga aactttgaac aaggtttcag 3481 ttcttagttt gtaaggtata ttacatttta tattcattta tgatagcaga caacctttta 3541 agattgcttt aattagtatc tgatgttgat ttttaagtgg cattcttttc cttaggactt 3601 tttatgtata cctgttgatt gttgtgtaaa ttttagtaaa tctaagagag tgtactaaac 3661 cagcaggtat ctgttagctt atgtgtttaa ttgaaattag aaggctaaga tggtataaca 3721 gcattttatt gctttgtcca gctacaactt gtcatttttt tctccatgtc ttatcttcct 3781 gtttcacttt agtttattct tcgtttttta ttgagatcta taaaaaattg gcttacttaa 3841 tagcaaatta cttgaagaat ttgcctgctt tatataaagt tagcacttta agattttttt 3901 ttagagatga gaagacattt aaattgaaga aaaattcccc cagcaataga cagtctatca 3961 gtccaagtat ttacttcctg agttttgatc aatatttttt atttgtgtat gttaatcgtc 4021 ataaaaacag tgattttggt gtgtttttta ttttggtgct ttaatggctt aagatgttgc 4081 acattttttt tttcttttgg tttctgttta tgtttttttg cctatgcagt taaatttttc 4141 ctagaaatag catttgtgtt gaacagtaac actttataca tatatatatg catgtttatt 4201 ttttggcgtc tttggaggga tgcttttaga cttgtttgca aaagggcagt tttctttttc 4261 tttgctgcag ttgtctattt tgcagaataa tagtgtgtgc aagtttgtga gcaaatgaaa 4321 tatgcaggtt caatctattg attttgattt ttacatctta tatctatgcc agaatctgta 4381 tttcatataa cttatttatt tcgaatggat gtagtaaatt cacagctatc agttttgatt 4441 ttgcaataaa taaaccacta ggttgcatgt cgaacaaatt tttatctcaa ataccaacca 4501 tcagtttttt ttttcatgtg ttttggtaca gctaattcct aattgtagag tgttaaatgt 4561 ttgaggagaa ccttttctca tagatggttg gtgttcatat ggctacttta caataaagag 4621 aactgtaagt gatatttgga aactacaaac ctggaattag gagatataat tattccttca 4681 agtttataga atatcacttg ggagattgga aagccatagc tattacgcgg caaacctagg 4741 ataagaaagg tagtatgagt gctggtagac cagctgcaac tttcctatac agtgaaaaag 4801 gctggtgaaa caagtacagt ccagattttt taaaatcata ctttctcagg gatctccaca 4861 aactggtggg tgtcctggct gtctgtgtga tagcctcttt ctataggtga ggcctcaaat 4921 gaattgcagc tatcctggtg ttcctatgag ggcactttgt atgaaaaagc gcatgtactc 4981 caaaacattt ttgaggttct ttggccagtt gccaaagagt gtgaaagaat ccaatagagg 5041 atttttctta ctgatagcag tcattcattg cagtaaaata aaatatgatc ccattaggga 5101 atcttgaatt ctgacctccc atactccgtt ttgaaataac cactttatat ttcatttttt 5161 aaaaatctga tgatctcttt gaggcaggtt tcagatttgg cagtacaaca tgaaagatta 5221 ggaaaagcat taataacgtg tgggtggaaa gcttgttaaa aatctgagag tgaagtttga 5281 gttaaaagtt gtttgaacat ggcattgact gggaggccaa agatttaaag aagcggaaga 5341 ttcttctctt aagacatgag gagtaagttg tgtgataatg gtatgtgttt tgtgtgcatg 5401 aatggacatt gtaaatgttg aattctaggc tccgacaatc attgtcaaca gaagatcaag 5461 ctgcaaatat ttatgtttta aaacttaaat tataaagcta gttaagtctt tctaatgact 5521 agttttaatg ttcatgggta cattttacct aagttaccgt ttacattgta tagaaaaaga 5581 tacatcttaa gcacagattg gttattagga attagtttgg ggaagaggtt tttttgtgga 5641 ttatttcata ctgcaaagaa aaaccatttg ccttttgggg aattgagcta acttctaatc 5701 tagtcttaag actagaatgc taaaaacaaa aacatgaagg aaaataaaac cccttattat 5761 taaattgatt tgtaaaaaca ttgttactgg aaatttattg gacttgaggc cttcctccag 5821 aaaataagga cttgattgtc aggcctatat taggttctga accttaatgc catgtatttg 5881 tacttactaa aaattgtttc aatgaaaagt acattagcag tatgaacttc tggtccagtt 5941 ggaagttttt ccatttgaaa aatgtgatgt ttgcatggaa ctgtttgaaa gactggtgat 6001 cgagttttaa gataggcctc caaatccttt ttgaactgag gtggcattac tccagtgaaa 6061 ttggtgagaa tccggggagc aatgttaatt tcactcaaca tgtccacctt tagattagga 6121 gtgaatgggt cggggagcct catgtttctt ggaaaggaca ctcaggatca aatttcttaa 6181 ctggatacaa ttaggtggga tcacatcaca gaacccataa tggtaatcac aaaaggaact 6241 ctgggaaatc atgcaaaaga accagcagca ctcttaaagt gccttgtaga ggatttgcat 6301 aggtttggtg agttccacat ttctaaggaa aggcgctaaa tatttgaata aatcaatcag 6361 tagctgtgca tacataggcc accccttctg ctgtggcgta tgtgccagca ttcttgcaat 6421 aaatatccga tgggaaatca gttcaagcca ggcatataca aagccaggag ctttggtagg 6481 cctcaacatg tggaagtgta ttgcagaaag ctgtaagtgt ctggaaatta atggtttcca 6541 acacatgctc aggtgcattc acttccaaga gaagcataat aaaaattcga tggtagggaa 6601 gttgctgaaa aaaaaaaaaa aaaaaaaaag aattc // LOCUS HSU22815 3976 bp mRNA PRI 08-AUG-1995 DEFINITION Human LAR-interacting protein 1a mRNA, complete cds. ACCESSION U22815 NID g930340 KEYWORDS LIP1a. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3976) AUTHORS Serra-Pages,C., Kedersha,N.L., Fazikas,L., Medley,Q., Debant,A. and Streuli,M. TITLE The LAR transmembrane protein tyrosine phosphatase and a coiled-coil LAR-interacting protein co-localize at focal adhesions JOURNAL EMBO J. 14 (12), 2827-2838 (1995) MEDLINE 95317301 REFERENCE 2 (bases 1 to 3976) AUTHORS Serra-Pages,C., Kedersha,N.L., Fazicas,L., Medley,Q., Debant,A. and Streuli,M. TITLE Direct Submission JOURNAL Submitted (14-MAR-1995) Michel Streuli, Tumor Immunology, Dana-Farber Cancer Institute, 44 Binney St., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3976 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 230..3787 /note="LIP1a; serine phosphorylated; colocalizes with LAR at focal adhesions; LAR PTPase-associated; coiled-coil protein" /codon_start=1 /product="LAR-interacting protein 1a" /db_xref="PID:g930341" /translation="MMCEVMPTISEAEGPPGGGGGHGSGSPSQPDADSHFEQLMVSML EERDRLLDTLRETQETLALTQGKLHEVGHERDSLQRQLNTALPQEFAALTKELNVCRE QLLEREEEIAELKAERNNTRLLLEHLECLVSRHERSLRMTVVKRQAQSPAGVSSEVEV LKALKSLFEHHKALDEKVRERLRVALERCSLLEEELGATHKELMILKEQNNQKKTLTD GVLDINHEQENTPSTSGKRSSDGSLSHEEDLAKVIELQEIISKQSREQSQMKERLASL SSHVTELEEDLDTARKDLIKSEEMNTKLQRDVREAMAQKEDMEERITTLEKRYLAAQR EATSVHDLNDKLENEIANKDSMHRQTEDKNRQLQERLELAEQKLQQTLRKAETLPEVE AELAQRVAALSKAEERHGNIEERLRQMEAQLEEKNQELQRARQREKMNEEHNKRLSDT VDKLLSESNERLQLHLKERMAALEDKNSLLREVESAKKQLEETQHDKDQLVLNIEALR AELDHMRLRGASLHHGRPHLGSVPDFRFPMADGHTDSYSTSAVLRRPQKGRLAALRDE PSKVQTLNEQDWERAQQASVLANVAQAFESDADVSDGEDDRDTLLSSVDLLSPSGQAD AHTLAMMLQEQLDAINKEIRLIQEEKENTEQRAEEIESRVGSGSLDNLGRFRSMSSIP PYPASSLASSSPPGSGRSTPRRIPHSPAREVDRLGVMTLLPPSREEVRDDKTTIKCET SPPSSPRALRLDRLHKGALHTVSHEDIRDIRNSTGSQDGPVSNPSSSNSSQDSLHKAP KKKGIKSSIGRLFGKKEKGRPGQTGKEALGQAGVSETDNSSQDALGLSKLGGQAEKNR KLQKKHELLEEARRQGLPFAQWDGPTVVVWLELWVGMPAWYVAACRANVKSGAIMSAL SDTEIQREIGISNPLHRLKLRLAIQEIMSLTSPSAPPTSRTTLAYGDMNHEWIGNEWL PSLGLPQYRSYFMECLVDARMLDHLTKKDLRGQLKMVDSFHRNSFQCGIMCLRRLNYD RKELERKREESQSEIKDVLVWSNDRVIRWILSIGLKEYANNLIESGVHGALLALDETF DFSALALLLQIPTQNTQARAVLEREFNNLLVMGTDRRFDEDDDKSFRRAPSWRKKFRP KDIRGLAAGSAETLPANFRVTSSMSSPSMQPKKMQMDGM" polyA_site 3976 /note="28 A nucleotides" BASE COUNT 1170 a 951 c 1064 g 791 t ORIGIN 1 gaattccggg cccccggccg tccctcttaa ggctcttcct ccgctccgcc agtgtccggc 61 cgcgggccgg ccttagtgac tggggcggcg ggccccgggg ccgcggcgtg gggcgggcag 121 gcggacgccg gccgcgggct gctttcgtcg gctcccaagc tctcccggag cgagcagccg 181 cccgcgagcc gccgcggagc ctcctcgccc gctcccgccg gcgagcaaga tgatgtgcga 241 ggtgatgccg accatcagcg aagcagaagg cccccctgga ggaggtggag gccatggttc 301 cggctcccct tcacagccag atgcagattc acattttgaa cagttgatgg tctccatgct 361 agaagaaagg gaccgccttc ttgatacact gagagagact caagaaacgc tggccttaac 421 ccaggggaag ttacacgagg ttggtcatga aagagattcc ttgcagagac agctcaacac 481 ggcacttcca caggagttcg cagcacttac taaagaactc aatgtatgca gggaacagct 541 ccttgaaagg gaagaagaaa ttgctgaact gaaagcagaa aggaataaca ccaggctgct 601 gttagagcat ttggaatgcc ttgtctccag gcatgagcgg tctcttagga tgaccgtggt 661 gaagagacaa gcgcagtctc cagcaggcgt gtccagcgaa gtggaagtgc tgaaagcact 721 gaagtcctta tttgaacacc acaaagctct ggatgaaaag gtgagagagc gattacgagt 781 agcacttgaa agatgtagtt tgttagaaga ggaattaggt gccacacaca aagagctaat 841 gattcttaaa gaacagaata atcagaaaaa aactctaaca gatggagtgc tggacataaa 901 ccatgaacaa gaaaatacac caagcacgag tggaaagaga tcttctgatg gttctttaag 961 ccacgaggaa gaccttgcta aagtaattga gctccaagaa atcataagta agcagtcaag 1021 ggaacagagc caaatgaaag aacgcctggc ttccctttcc agtcatgtga cagaactgga 1081 agaggatctg gacacggcta gaaaagatct catcaaatct gaagaaatga acacaaaatt 1141 gcaacgagat gtccgtgaag ccatggccca aaaggaagat atggaagaga gaatcactac 1201 tcttgaaaaa cgctacctcg ctgcacagcg tgaagccaca tctgtgcatg acctcaatga 1261 taaacttgaa aatgaaattg caaataaaga ttctatgcat cgacagactg aagataaaaa 1321 ccgccagtta caggagcgct tggaattggc agagcaaaag ctgcaacaga cactgaggaa 1381 ggcagagacg ctcccggagg tggaggcgga gctggcccag agggtggcag cgctttccaa 1441 ggctgaagag agacacggca acattgaaga aaggttacga cagatggaag cacagttgga 1501 ggagaagaat caagaactgc agcgggcaag gcaaagagaa aaaatgaacg aagaacataa 1561 taaacgttta tcagacactg ttgacaagct gctttcagaa tctaatgaga ggcttcaact 1621 tcatcttaaa gagagaatgg ctgctttgga agataagaac tctcttttaa gagaagttga 1681 aagtgcaaaa aagcagttag aagaaacaca acacgataag gatcagcttg tcctaaacat 1741 tgaagcactg agggctgaac tagaccacat gagactaaga ggtgcttcac ttcatcatgg 1801 ccgaccccac ttgggcagtg tcccagattt caggttcccc atggcagacg gccacacaga 1861 ctcctacagc accagtgcag tgctgcggcg cccacagaaa ggccggctgg cagccctgcg 1921 agatgagcct tccaaggtac aaactcttaa tgagcaggat tgggaacgtg cccagcaagc 1981 tagtgtcttg gcaaatgtag cacaagcatt cgagagtgat gctgacgtgt ctgatggtga 2041 agatgacagg gacactctcc tcagctcagt tgacctgcta tcgcccagcg ggcaggccga 2101 cgcgcacaca ctagccatga tgcttcagga gcagctggac gccatcaaca aagagatcag 2161 gttgattcag gaagaaaaag aaaatacaga gcagcgggca gaggagattg aaagtcgagt 2221 tggcagtgga agtctagaca atcttggtcg ttttagatca atgagctcca ttccccccta 2281 ccctgcttcc tcgcttgcta gctcctcccc tccgggcagt gggcgctcca ccccacgaag 2341 gatccctcac agcccagctc gggaagtgga cagactgggc gtcatgaccc ttttgccacc 2401 ttccagagaa gaggtacgag atgacaagac aaccataaag tgtgaaacct ccccgccttc 2461 ctccccgaga gcccttcggt tagaccggct gcacaaaggg gcgctgcaca ccgtcagcca 2521 cgaggacatc agggacataa ggaactccac aggctcccag gatggtcccg tgagcaaccc 2581 cagcagtagc aacagtagcc aggactcgct ccacaaagcc ccaaagaaga aaggcattaa 2641 gtcctccatt ggccgcttgt ttggcaagaa agaaaagggc cgacctggac aaactggcaa 2701 agaagcatta ggacaagctg gtgtttccga gacggataac tcatctcagg atgccttggg 2761 acttagcaaa ttggggggac aggctgaaaa aaatcgtaaa cttcaaaaaa agcatgaatt 2821 gctggaggaa gcccggagac aaggtttacc ttttgcccaa tgggacgggc caacggttgt 2881 ggtctggcta gagctctggg ttgggatgcc agcctggtat gtggctgcct gccgagcaaa 2941 cgtgaaaagc ggggccatca tgtcggccct gtccgacaca gagatccagc gtgagattgg 3001 catcagcaac cccctgcaca ggctgaagct gaggctggcc atccaggaga tcatgtcgct 3061 gaccagcccg tctgccccgc ccacatctag aacgacactc gcctatgggg acatgaacca 3121 cgagtggatc ggcaacgagt ggctccccag cctgggcctc ccccagtacc gcagctactt 3181 catggagtgc cttgtagacg ccaggatgct ggaccacttg accaagaaag accttcgagg 3241 gcagctgaaa atggtcgaca gttttcacag aaacagtttc cagtgtggaa ttatgtgcct 3301 gagaaggtta aattatgacc ggaaagaact ggaaagaaaa agagaagaaa gtcagagtga 3361 aataaaagac gtgcttgttt ggagcaatga tcgagtgatt cgctggatcc tgtcaattgg 3421 ccttaaagaa tatgcaaaca atcttataga gagtggtgtt cacggagcac ttctggcctt 3481 agatgaaacc ttcgacttca gtgcactggc actgctgtta cagatcccga cgcagaacac 3541 acaggctcgt gctgtcttgg aaagagaatt taacaacctt ttggtcatgg ggactgatag 3601 aaggtttgat gaagatgatg ataaaagctt taggagagca ccttcatgga gaaaaaagtt 3661 tagaccaaag gacattcgtg gcttagctgc tgggtcagca gagactctcc ctgcaaactt 3721 ccgggtgact tcttctatgt cttccccctc tatgcagcca aagaagatgc agatggacgg 3781 tatgtgatgg gtcacactaa cctgtcactt gttgggagca tgagcagctt tctgtctgga 3841 acattaataa tgatctaaaa cggcctattt aatatgttac aaggcacttg agtatggttg 3901 catgtccaaa tataaatgtt tttaaattaa ctctaacatt tgtttataaa agtttaacca 3961 taataataga attttt // LOCUS HSU22897 2387 bp mRNA PRI 09-SEP-1995 DEFINITION Homo sapiens nuclear domain 10 protein (ndp52) mRNA, complete cds. ACCESSION U22897 NID g984286 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2387) AUTHORS Korioth,F., Gieffers,C., Maul,G.G. and Frey,J. TITLE Molecular characterization of NDP52, a novel protein of the nuclear domain 10, which is redistributed upon virus infection and interferon treatment JOURNAL J. Cell Biol. 130 (1), 1-13 (1995) MEDLINE 95310349 REFERENCE 2 (bases 1 to 2387) AUTHORS Frey,J. TITLE Direct Submission JOURNAL Submitted (15-MAR-1995) Juergen Frey, Fakultaet fuer Chemie - Biochemie II, Universitaet Bielefeld, Universitaetsstrasse 25, Bielefeld, Germany, 33615 FEATURES Location/Qualifiers source 1..2387 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pfk52c" /cell_line="MG63" /cell_type="osteosarcoma" 5'UTR 1..54 gene 55..1395 /gene="ndp52" CDS 55..1395 /gene="ndp52" /note="nuclear domain 10 protein as characterized by a monoclonal antibody recognizing the encoded protein" /codon_start=1 /product="NDP52" /db_xref="PID:g984287" /translation="MEETIKDPPTSAVLLDHCHFSQVIFNSVEKFYIPGGDVTCHYTF TQHFIPRRKDWIGIFRVGWKTTREYYTFMWVTLPIDLNNKSAKQQEVQFKAYYLPKDD EYYQFCYVDEDGVVRGASIPFQFRPENEEDILVVTTQGEVEEIEQHNKELCKENQELK DSCISLQKQNSDMQAELQKKQEELETLQSINKKLELKVKEQKDYWETELLQLKEQNQK MSSENEKMGIRVDQLQAQLSTQEKEMEKLVQGDQDKTEQLEQLKKENDHLFLSLTEQR KDQKKLEQTVEQMKQNETTAMKKQQELMDENFDLSKRLSENEIICNALQRQKERLEGE NDLLKRENSRLLSYMGLDFNSLPYQVPTSDEGGARQNPGLAYGNPYSGIQESSSPSPL SIKKCPICKADDICDHTLEQQQMQPLCFNCPICDKIFPATEKQIFEDHVFCHSL" 3'UTR 1396..2387 polyA_signal 2369..2374 polyA_site 2387 /note="14 A nucleotides" BASE COUNT 713 a 505 c 536 g 633 t ORIGIN 1 actctgccct gttgctgtcg cgccgctgct ggttgctgtc cctggacccc taccatggag 61 gagaccatca aagatccccc cacatcagct gtcttgctgg atcactgtca tttctctcag 121 gtcatcttta acagtgtgga gaagttctac atccctggag gggacgtcac atgtcattat 181 accttcaccc agcatttcat ccctcgtcga aaggattgga ttggcatctt tagagtgggg 241 tggaagacaa cccgtgagta ttacaccttc atgtgggtta ctttgcccat tgacctaaac 301 aacaaatcag ctaaacagca ggaagtccaa ttcaaagctt actacctgcc caaggatgat 361 gagtattacc agttctgcta tgtggatgag gatggtgtgg tccggggagc aagtattcct 421 ttccaattcc gtccagaaaa tgaggaagac atcctggttg ttaccactca gggagaggtg 481 gaagagattg agcagcacaa caaggagctt tgcaaagaaa accaggagct gaaggacagc 541 tgtatcagcc tccagaagca gaactcagac atgcaggctg agctccaaaa gaagcaggag 601 gagctagaaa ccctacagag catcaataag aagttggaac tgaaagtgaa agaacagaag 661 gactattggg agacagagct gcttcaactg aaagaacaaa accagaagat gtcctcagaa 721 aatgagaaga tgggaatcag agtggatcag cttcaggccc agctgtcaac tcaagagaaa 781 gaaatggaga agcttgttca gggagatcaa gataagacag agcagttaga gcagctgaaa 841 aaggaaaatg accacctctt tctcagttta actgaacaga ggaaggacca gaagaagctc 901 gagcagacag tggagcaaat gaagcagaat gaaactactg caatgaagaa acaacaggaa 961 ttaatggatg aaaactttga cctgtcaaaa agactgagtg agaacgaaat tatatgtaat 1021 gctctgcaga gacagaaaga gagattggaa ggagaaaatg atcttttgaa gagggagaac 1081 agcagattgc tcagttacat gggtctggat tttaattctt tgccgtatca agtacctact 1141 tcagatgaag gaggcgcaag acaaaatcca ggacttgcct atggaaaccc atattctggt 1201 atccaagaaa gttcttcccc cagcccgctc tccatcaaga aatgccctat ctgcaaagca 1261 gatgatattt gtgatcacac cttggagcaa cagcagatgc agcccctttg tttcaattgt 1321 ccaatttgtg acaagatctt cccagctaca gagaagcaga tctttgaaga ccacgtgttc 1381 tgccactctc tctgagtatc ccaacctctt ggatgtatac agagatttta tagaatagaa 1441 cctatagctt ctaccatgag ttatatgagt caagatcctg cctaacctga aattattagg 1501 gatttactca gccctgctgc cgctaacagt ggagttatgt cactgatctg aaggtcactg 1561 ttaagggctt ctgctgccat ccttgtgggt tgctaccttt aagtcgcata actctagctg 1621 tatcatcctc tcacctgtca ttcttctgag ggtctcagta caagggccct gggatggagc 1681 caacctgggt attcacaaca ggcctgactt gatactaagt gattagtttt ccaagttgtc 1741 ccactgccat tcaaagtcag cccttgagtg tatttgttct cagtcctaac cctggggcca 1801 gagattggtc cgaggttgag aattccttcc tcctcatcct tggtgttgct ttctccaaat 1861 gattgtttta gactagccaa aaatgccgtg gcaaagagct cagaaatcca atttggatac 1921 caaaggtttc tcatgttaat ttctcagccc ccaaagaagc atcttactcc tgaaccttag 1981 acaggaagta ttgtttcagt cacagaaagc ttttctgggt acctctggtt agcactttct 2041 actctctgat atttcctatg tacatagctt ttattgttgt aaatcctttc ttaatggtta 2101 aataggattg ttagcaacta tgggtttgca gttttctgag taggtgagtt ttgaatatgg 2161 gtaaatcaga ataatgagac aacttgttaa tctctttaat actaaaaata aattactctt 2221 ctatttcagg gacttaggta atttaaaata aaccttcaat ttatggtctt ctgttttgaa 2281 gctcatggga aaattgtgat caaaagggct atgggaaggg cagaccccgc caatgatttc 2341 tcttcacctg tcttaagatt aaataaaaaa gagtgtcctg gcagtta // LOCUS HSU22961 3239 bp mRNA PRI 11-APR-1995 DEFINITION Human mRNA clone with similarity to L-glycerol-3-phosphate:NAD oxidoreductase and albumin gene sequences. ACCESSION U22961 NID g763428 KEYWORDS alpha-glycerol-3-phosphate dehydrogenase; albumin; cDNA clone. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3239) AUTHORS Menaya,J., Parrilla,R. and Ayuso,M.S. TITLE A Human cDNA clone containing alpha-glycerol-3-phosphate dehydrogenase and albumin gene sequences JOURNAL Unpublished REFERENCE 2 (bases 1 to 3239) AUTHORS Menaya,J., Parrilla,R. and Ayuso,M.S. TITLE Direct Submission JOURNAL Submitted (17-MAR-1995) Matilde S. Ayuso, Centro de Investigaciones Biologicas, CSIC, Velazquez 144, Madrid, 28006, Spain FEATURES Location/Qualifiers source 1..3239 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /dev_stage="adult" /sex="male" /clone_lib="cDNA from Clontech HL1115a" exon 21..159 /note="similar to glycerol-3-phosphate dehydrogenase gene exon 4" CDS 90..440 /note="putative ORF; similar in part to the product encoded by human glycerol-3-phosphate dehydrogenase mRNA, GenBank Accession Number L34041; Method: conceptual translation supplied by author" /codon_start=1 /evidence=not_experimental /db_xref="PID:g763429" /translation="MSVLMGANIASEVADEKFCETTIGESPLAPAYTVHLVASPPQTA PTPLSHLFPIRQRKENTSIRGEGGWDTPREGASGPCLGKHSDEPSLRVGAVAHLWSQL LKRLRWEDRLSPGV" repeat_unit 401..499 /rpt_family="Alu" exon 538..650 /note="similar to glycerol-3-phosphate dehydrogenase gene exon 5" CDS 579..1574 /note="putative ORF; similar in part to the product encoded by human glycerol-3-phosphate dehydrogenase mRNA, GenBank Accession Number L34041; Method: conceptual translation supplied by author" /codon_start=1 /evidence=not_experimental /db_xref="PID:g763430" /translation="MQTPNFRITVVQEVDTVEICGALKVRGAQRQLWGEEKAPKEVWL SSARLQVLQALTIEQWFPFPTMSHAYRRSCRSKARAIQAADYSSSDRGPPCSCPILAP CGTPHPLAPGGGQGDKEMPRLMGDKTSLTPRQTNGQNYGSWTAVDLGLYNGMVNRWED QLERRVVFRKCPRGGSMGQGLRKGVQQGGKPRGGSRLLKGHTLYLSSAVGVTADEMSM RGLHMGPIGGGLFLTYDLHSFKNVVAVGAGFCDGLGFGDNTKAAVIRLGLMEMIAFAK LFCSGPVSSATFLESCGVADLITTCYGGRNRKVAEAFARTGKVGPGRRENRGAAL" exon 1298..1503 /note="similar to glycerol-3-phosphate dehydrogenase gene exon 6" CDS 1873..>3239 /note="similar to human albumin, Swiss-Prot Accession Number P02768; Method: conceptual translation supplied by author" /codon_start=1 /evidence=not_experimental /db_xref="PID:g763431" /translation="MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFK ALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLFGDKLCTVA TLRETYGEMADCCAKQEPGRNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLK KYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSA KQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLE CADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFV ESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHE CYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEV SRNLG" BASE COUNT 845 a 770 c 843 g 781 t ORIGIN 1 gagctccatc ctgtgctcag ggggtagacg agggccccaa tgggctgaag ctcatctcgg 61 aagtgattgg ggagcgcctc ggcatcccca tgagtgtgct gatgggggcc aacattgcca 121 gcgaggtggc tgatgagaag ttctgtgaga caaccattgg tgagagcccc ctggcacctg 181 catacacagt gcatctagtt gcatcccctc cccaaactgc cccaacccca cttagccatc 241 tctttcccat aaggcagaga aaggaaaata caagcatcag aggtgaagga ggctgggaca 301 cccccaggga aggggcttca gggccttgct tagggaaaca cagtgatgag ccctccctca 361 gagttggtgc agtggcacac ctgtggtccc agctacttaa gaggctgagg tgggaggatc 421 gcttaagccc aggagtttaa gtccagcctg ggcaacagag agagactccc atctctataa 481 aataaatatt ttttaaaaag agtccttccc tcaaagcctt gccccctcct cactttaggc 541 tgcaaggacc cggcccagga caactcctga aagagctgat gcagacacca aacttccgta 601 tcacagtggt gcaagaggtg gacactgtag agatctgtgg agccttaaag gtgagagggg 661 cacagaggca gctatggggt gaggagaagg ccccaaagga ggtctggctg agctctgcaa 721 ggctgcaggt actccaggct ctcactattg agcagtggtt cccctttcct accatgtccc 781 atgcatatag gaggagctgc agaagcaagg ccagggccat tcaggcggct gattattcat 841 catcagaccg tggaccccct tgctcctgtc ccattttagc cccgtgtggg actccccacc 901 ctctggctcc aggaggtggt cagggggaca aggagatgcc caggctaatg ggagacaaaa 961 catccttgac acccagacag actaatggcc agaactatgg atcctggaca gctgttgact 1021 tagggcttta caatggaatg gtcaatagat gggaagatca actagagaga agagtggttt 1081 tcagaaaatg tcctcgagga gggagtatgg ggcagggctt aagaaagggg gtgcagcaag 1141 gggggaaacc aagaggcgga agtaggcttc taaaaggcca cacactatac ctctcttctg 1201 cagtgggtgt cacggctgat gaaatgagta tgagggggct ccacatgggg cctataggag 1261 ggggtctttt tctcacctat gacctccact ccttcaagaa tgtagtggcc gtgggggctg 1321 gcttctgtga tggcctgggc tttggcgaca acaccaaggc ggcagtgatc cggctgggtc 1381 tcatggagat gatagccttc gccaagctct tctgcagtgg ccctgtgtcc tctgccacct 1441 tcttggagag ctgtggtgtt gctgacctga tcactacctg ctatggaggg cggaaccgga 1501 aagtggctga ggcctttgcg cgtacaggaa aggtgggccc cgggagaagg gagaacagag 1561 gggcggctct gtaggcatcc aggtagaggt gcttggcggg aggcatctct ggagcacaaa 1621 cattaagact gttgtgcaca tccccatccc tcttttcctc ccaagacccc actcccatct 1681 gagctccagt ctctccaccc cctactgaca acccagctgc tcccttttcc aagctttacc 1741 ctgaccagtc atgagtggat attctgaacc tgtttcatac tgtcttcttc taggagtctt 1801 tctcagacca gcacagctct gctctgatca gcccaatcac ccccgctgtc aaccccacac 1861 gcctttggca caatgaagtg ggtaaccttt atttcccttc tttttctctt tagctcggct 1921 tattccaggg gtgtgtttcg tcgagatgca cacaagagtg aggttgctca tcggtttaaa 1981 gatttgggag aagaaaattt caaagccttg gtgttgattg cctttgctca gtatcttcag 2041 cagtgtccat ttgaagatca tgtaaaatta gtgaatgaag taactgaatt tgcaaaaaca 2101 tgtgttgctg atgagtcagc tgaaaattgt gacaaatcac ttcataccct ttttggagac 2161 aaattatgca cagttgcaac tcttcgtgaa acctatggtg aaatggctga ctgctgtgca 2221 aaacaagaac ctgggagaaa tgaatgcttc ttgcaacaca aagatgacaa cccaaacctc 2281 ccccgattgg tgagaccaga ggttgatgtg atgtgcactg cttttcatga caatgaagag 2341 acatttttga aaaaatactt atatgaaatt gccagaagac atccttactt ttatgccccg 2401 gaactccttt tctttgctaa aaggtataaa gctgctttta cagaatgttg ccaagctgct 2461 gataaagctg cctgcctgtt gccaaagctc gatgaacttc gggatgaagg gaaggcttcg 2521 tctgccaaac agagactcaa gtgtgccagt ctccaaaaat ttggagaaag agctttcaaa 2581 gcatgggcag tagctcgcct gagccagaga tttcccaaag ctgagtttgc agaagtttcc 2641 aagttagtga cagatcttac caaagtccac acggaatgct gccatggaga tctgcttgaa 2701 tgtgctgatg acagggcgga ccttgccaag tatatctgtg aaaatcaaga ttcgatctcc 2761 agtaaactga aggaatgctg tgaaaaacct ctgttggaaa aatcccactg cattgccgaa 2821 gtggaaaatg atgagatgcc tgctgacttg ccttcattag ctgctgattt tgttgaaagt 2881 aaggatgttt gcaaaaacta tgctgaggca aaggatgtct tcttgggcat gtttttgtat 2941 gaatatgcaa gaaggcatcc tgattactct gtcgtgctgc tgctgagact tgccaagaca 3001 tatgaaacca ctctagagaa gtgctgtgcc gctgcagatc ctcatgaatg ctatgccaaa 3061 gtgttcgatg aatttaaacc tcttgtggaa gagcctcaga atttaatcaa acaaaattgt 3121 gagctttttg agcagcttgg agagtacaaa ttccagaatg cgctattagt tcgttacacc 3181 aagaaagtac cccaagtgtc aactccaact cttgtagagg tctcaagaaa cctaggaaa // LOCUS HSU22963 1263 bp mRNA PRI 11-AUG-1995 DEFINITION Human class I histocompatibility antigen-like protein mRNA, complete cds. ACCESSION U22963 NID g940353 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1263) AUTHORS Hashimoto,K., Hirai,M. and Kurosawa,Y. TITLE A gene outside the human MHC related to classical HLA class I genes JOURNAL Science 269 (5224), 693-695 (1995) MEDLINE 95350662 REFERENCE 2 (bases 1 to 1263) AUTHORS Hashimoto,K. TITLE Direct Submission JOURNAL Submitted (16-MAR-1995) Keiichiro Hashimoto, Fujita Health University, Toyoake, Aichi, 470-11, Japan FEATURES Location/Qualifiers source 1..1263 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C7" /tissue_type="thymus" CDS 6..1031 /codon_start=1 /product="class I histocompatibility antigen-like protein" /db_xref="PID:g940354" /translation="MGELMAFLLPLIIVLMVKHSDSRTHSLRYFRLGVSDPIHGVPEF ISVGYVDSHPITTYDSVTRQKEPRAPWMAENLAPDHWERYTQLLRGWQQMFKVELKRL QRHYNHSGSHTYQRMIGCELLEDGSTTGFLQYAYDGQDFLIFNKDTLSWLAVDNVAHT IKQAWEANQHELLYQKNWLEEECIAWLKRFLEYGKDTLQRTEPPLVRVNRKETFPGVT ALFCKAHGFYPPEIYMTWMKNGEEIVQEIDYGDILPSGDGTYQAWASIELDPQSSNLY SCHVEHCGVHMVLQVPQESETIPLVMKAVSGSIVLVIVLAGVGVLVWRRRPREQNGAI YLPTPDR" BASE COUNT 322 a 303 c 321 g 317 t ORIGIN 1 ggactatggg ggaactgatg gcgttcctgt tacctctcat cattgtgtta atggtgaagc 61 acagcgattc ccggacgcac tctctgagat attttcgcct gggcgtttcg gatcccatcc 121 atggggtccc tgaatttatt tcggttgggt acgtggactc gcaccctatc accacatatg 181 acagtgtcac tcggcagaag gagccacggg ccccatggat ggcagagaac ctcgcgcctg 241 atcactggga gaggtacact cagctgctga ggggctggca gcagatgttc aaggtggaac 301 tgaagcgcct acagaggcac tacaatcact cagggtctca cacttaccag agaatgattg 361 gctgtgagct gctggaggat ggaagcacca caggatttct gcagtatgca tatgacgggc 421 aggatttcct gatcttcaat aaagacaccc tctcctggct ggctgtagat aatgtggctc 481 acaccatcaa gcaggcatgg gaggccaatc agcatgagtt gctgtatcaa aagaattggc 541 tggaagaaga atgtattgcc tggctaaaga gattcctgga gtatgggaaa gacaccctac 601 aaagaacaga gcccccactg gtcagagtaa atcgcaaaga aacttttcca ggggttacag 661 ctctcttctg caaagctcat ggcttttacc ccccagaaat ttacatgaca tggatgaaaa 721 acggggaaga aattgtccaa gaaattgatt atggagacat tcttcccagt ggggatggaa 781 cctatcaggc gtgggcatca attgagcttg atcctcagag cagcaacctt tactcctgtc 841 atgtggagca ctgcggtgtc cacatggttc ttcaggtccc ccaggaatca gaaactatcc 901 ctcttgtgat gaaagctgtc tctgggtcca ttgtccttgt cattgtgctg gctggagttg 961 gtgttctagt ctggagaaga aggccccgag agcaaaatgg agccatctac cttccaacac 1021 cagatcgatg attgcagatc cctcttttcc agttctcctt cctctaggag ccatgttatc 1081 ctctgtcccc catagagtca agcctagtgc ttgaaggtcc tgacgacacc cacaacatac 1141 atgagagtaa tgggattgag catttatggc agcaacagag gagccacaaa atgttctttg 1201 ttctttggct ccaaaaagac tgtcagcttt cagtctcttt tgatggactg ttttatcaga 1261 gtt // LOCUS HSU23052 873 bp DNA PRI 31-MAR-1995 DEFINITION Human arylamine N-acetyltransferase (NAT2), allele NAT2*14A, complete cds. ACCESSION U23052 NID g747646 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 873) AUTHORS Bell,D.A., Taylor,J.A., Butler,M.A., Stephens,E.A., Wiest,J., Brubaker,L.H., Kadlubar,F.F. and Lucier,G.W. TITLE Genotype/phenotype discordance for human arylamine N-acetyltransferase (NAT2) reveals a new slow-acetylator allele common in African-Americans JOURNAL Carcinogenesis 14 (8), 1689-1692 (1993) MEDLINE 93358376 REFERENCE 2 (bases 1 to 873) AUTHORS Ferguson,R.J., Doll,M.A., Rustan,T.D., Gray,K. and Hein,D.W. TITLE Cloning, expression, and functional characterization of two mutant (NAT2(191) and NAT2(341/803)) and wild-type human polymorphic N-acetyltransferase (NAT2) alleles JOURNAL Drug Metab. Dispos. 22 (3), 371-376 (1994) MEDLINE 94349811 REFERENCE 3 (bases 1 to 873) AUTHORS Hein,D.W. TITLE Direct Submission JOURNAL Submitted (17-MAR-1995) David W. Hein, Univ. of North Dakota Sch. of Medicine, Pharmacology-Toxicology, 501 N. Columbia Road., Grand Forks, ND 58202-9037, USA FEATURES Location/Qualifiers source 1..873 /organism="Homo sapiens" /note="human" /db_xref="taxon:9606" gene 1..873 /gene="NAT2" source 1..870 /organism="Homo sapiens" /map="8p21.3-23.1" /tissue_type="colon surgical samples" /chromosome="8" CDS 1..873 /gene="NAT2" /EC_number="2.3.1.5" /note="Allele: NAT2*14A" /codon_start=1 /product="arylamine N-acetyltransferase" /db_xref="PID:g727413" /translation="MDIEAYFERIGYKNSRNKLDLETLTDILEHQIRAVPFENLNMHC GQAMELGLEAIFDHIVRRNQGGWCLQVNQLLYWALTTIGFQTTMLGGYFYIPPVNKYS TGMVHLLLQVTIDGRNYIVDAGSGSSSQMWQPLELISGKDQPQVPCIFCLTEERGIWY LDQIRREQYITNKEFLNSHLLPKKKHQKIYLFTLEPRTIEDFESMNTYLQTSPTSSFI TTSFCSLQTPEGVYCLVGFILTYRKFNYKDNTDLVEFKTLTEEEVEEVLKNIFKISLG RNLVPKPGDGSLTI" BASE COUNT 260 a 179 c 188 g 246 t ORIGIN 1 atggacattg aagcatattt tgaaagaatt ggctataaga actctaggaa caaattggac 61 ttggaaacat taactgacat tcttgagcac cagatccggg ctgttccctt tgagaacctt 121 aacatgcatt gtgggcaagc catggagttg ggcttagagg ctatttttga tcacattgta 181 agaagaaacc agggtgggtg gtgtctccag gtcaatcaac ttctgtactg ggctctgacc 241 acaatcggtt ttcagaccac aatgttagga gggtattttt acatccctcc agttaacaaa 301 tacagcactg gcatggttca ccttctcctg caggtgacca ttgacggcag gaattacatt 361 gtcgatgctg ggtctggaag ctcctcccag atgtggcagc ctctagaatt aatttctggg 421 aaggatcagc ctcaggtgcc ttgcattttc tgcttgacag aagagagagg aatctggtac 481 ctggaccaaa tcaggagaga gcagtatatt acaaacaaag aatttcttaa ttctcatctc 541 ctgccaaaga agaaacacca aaaaatatac ttatttacgc ttgaacctcg aacaattgaa 601 gattttgagt ctatgaatac atacctgcag acgtctccaa catcttcatt tataaccaca 661 tcattttgtt ccttgcagac cccagaaggg gtttactgtt tggtgggctt catcctcacc 721 tatagaaaat tcaattataa agacaataca gatctggtcg agtttaaaac tctcactgag 781 gaagaggttg aagaagtgct gaaaaatata tttaagattt ccttggggag aaatctcgtg 841 cccaaacctg gtgatggatc ccttactatt tag // LOCUS HSU23070 1521 bp mRNA PRI 12-APR-1996 DEFINITION Human putative transmembrane protein (nma) mRNA, complete cds. ACCESSION U23070 NID g1262172 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Degen,W.G., Weterman,M.A., van Groningen,J.J., Cornelissen,I.M., Lemmers,J.P., Agterbos,M.A., Geurts van Kessel,A., Swart,G.W. and Bloemers,H.P. TITLE Expression of nma, a novel gene, inversely correlates with the metastatic potential of human melanoma cell lines and xenografts JOURNAL Int. J. Cancer 65 (4), 460-465 (1996) MEDLINE 96184146 REFERENCE 2 (bases 1 to 1521) AUTHORS Degen,W.G.J. TITLE Direct Submission JOURNAL Submitted (20-MAR-1995) Winfried G.J. Degen, Dept. of Biochemistry, University of Nijmegen, P.O. Box 9101, Nijmegen, 6500 HB, The Netherlands FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MV1" gene 373..1155 /gene="nma" CDS 373..1155 /gene="nma" /note="expression in poorly metastatic human melanoma cell lines; no expression in highly metastatic human melanoma cell lines; similar to EST encoded by GenBank Accession Number T73919; putative transmembrane protein" /codon_start=1 /evidence=experimental /db_xref="PID:g1262173" /translation="MDRHSSYIFIWLQLELCAMAVLLTKGEIRCYCDAAHCVATGYMC KSELSACFSRLLDPQNSNSPLTHGCLDSLASTTDICQAKQARNHSGTTIPTLECCHED MCNYRGLHDVLSPPRGEASGQGNRYQHDGSRNLITKVQELTSSKELWFRAAVIAVPIA GGLILVLLIMLALRMLRSENKRLQDQRQQMLSRLHYSFHGHHSKKGQVAKLDLECMVP VSGHENCCLTCDKMRQADLSNDKILSLVHWGMYSGHGKLEFV" BASE COUNT 364 a 395 c 414 g 348 t ORIGIN 1 ctggcgcggg cgggagctgc ggcggatacc cttgcgtgct gtggagaccc tactctcttc 61 gctgagaacg gccgctagcg gggactgaag gccgggagcc cactcccgac ccggggctag 121 cgtgcgtccc tagagtcgag cggggcaagg gagccagtgg ccgccgacgg gggaccggga 181 aacttttctg ggctcctgga gagccctgta gccgcgctcc atgctccggc agcggcccga 241 aacccagccc cgccgctgac ggagcccgcc gctccgggca gggcccatgc cctgcgcgct 301 ccgggggtcg tagctgccgc cgagccgggg ctccggaagc cggcgggggc gccgcggccg 361 tgcggggcgt caatggatcg ccactccagc tacatcttca tctggctgca gctggagctc 421 tgcgccatgg ccgtgctgct caccaaaggt gaaattcgat gctactgtga tgctgcccac 481 tgtgtagcca ctggttatat gtgtaaatct gagctcagcg cctgcttctc tagacttctt 541 gatcctcaga actcaaattc cccactcacc catggctgcc tggactctct tgcaagcacg 601 acagacatct gccaagccaa acaggcccga aaccactctg gcaccaccat acccacattg 661 gaatgctgtc atgaagacat gtgcaattac agagggctgc acgatgttct ctctcctccc 721 aggggtgagg cctcaggaca aggaaacagg tatcagcatg atggtagcag aaaccttatc 781 accaaggtgc aggagctgac ttcttccaaa gagttgtggt tccgggcagc ggtcattgcc 841 gtgcccattg ctggagggct gattttagtg ttgcttatta tgttggccct gaggatgctt 901 cgaagtgaaa ataagaggct gcaggatcag cggcaacaga tgctctcccg tttgcactac 961 agctttcacg gacaccattc caaaaagggg caggttgcaa agttagactt ggaatgcatg 1021 gtgccggtca gtgggcacga gaactgctgt ctgacctgtg ataaaatgag acaagcagac 1081 ctcagcaacg ataagatcct ctcgcttgtt cactggggca tgtacagtgg gcacgggaag 1141 ctggaattcg tatgacggag tcttatctga actacactta ctgaacagct tgaaggcctt 1201 ttgagttctg ctggacagga gcactttatc tgaagacaaa ctcatttaat catctttgag 1261 agacaaaatg acctctgcaa acagaatctt ggatatttct tctgaaggat tatttgcaca 1321 gacttaaata cagttaaatg tgttatttgc ttttaaaatt ataaaaagca aagagaagac 1381 tttgtacaca ctgtcaccag ggttatttgc atccaaggga gctggaattg agtacctaaa 1441 taaacaaaaa tgtgccctat gtaagcttct acatcttgat ttattgtaaa gatttaaaag 1501 aaatatatat attttgtctg a // LOCUS HSU23157 962 bp mRNA PRI 02-JUL-1995 DEFINITION Human pro-alpha-S1-casein mRNA, complete cds. ACCESSION U23157 NID g886015 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 962) AUTHORS Yu,D.-Y., Jeong,S., Lee,K.-K. and Lonnerdal,B. TITLE Cloning and sequencing of the human alpha-S1-casein cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 962) AUTHORS Yu,D. TITLE Direct Submission JOURNAL Submitted (22-MAR-1995) Dae-Yeul Yu, Korea Reaserch Institute of Bioscience and Biotechnology, Bioresorces Research Group, Yusong, Taejon, Republic of Korea, 305-333 FEATURES Location/Qualifiers source 1..962 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hcAS1-3" /sex="female" /tissue_type="mammary gland" /dev_stage="adult" 5'UTR 1..28 CDS 29..586 /note="nutritional protein" /codon_start=1 /product="pro-alpha-S1-casein" /db_xref="PID:g886016" /translation="MRLLILTCLVAVALARPKLPLRYPERLQNPSESSEPIPLESREE YMNGMNRQRNILREKQTDEIKDTRNESTQNCVVAEPEKMESSISSSSEEMSLSKCAEQ FCRLNEYNQLQLQAAHAQEQIRRMNENSHVQVPFQQLNQLAAYPYAVWYYPQIMQYVP FPPFSDISNPTAHENYEKNNVMLQW" sig_peptide 29..74 mat_peptide 75..583 /note="nutritional protein" /product="alpha-S1-casein" 3'UTR 587..962 polyA_signal 942..947 polyA_site 962 /note="12 A nucleotides" BASE COUNT 288 a 194 c 184 g 296 t ORIGIN 1 cagacttggg cttaaggctc tgataaccat gaggcttctc attctcacct gtcttgtggc 61 tgttgctctt gccaggccta aacttcctct tagataccca gaacgccttc agaatccatc 121 agagagcagt gagcctatac cattagaatc aagagaggaa tacatgaatg gtatgaacag 181 gcagagaaac attctgagag aaaaacagac tgatgaaatc aaggatacta ggaatgagtc 241 tactcagaac tgtgttgtgg cagagcctga gaagatggaa tccagcatca gttcatcgag 301 tgaggaaatg tctctcagta agtgtgcgga acagttttgt agactgaacg aatacaacca 361 acttcagctg caagctgccc atgcccagga gcaaattcgc agaatgaatg aaaacagcca 421 tgtccaagtg cctttccagc agctcaacca acttgctgcc tacccctatg ctgtttggta 481 ctatccacaa atcatgcagt atgttccttt cccaccgttt tccgacatct ccaatcccac 541 tgctcatgaa aattatgaaa aaaataacgt catgctacag tggtgatatg attgaaaatt 601 tcattctctg aatttctcct ctcaaggaaa accatcttat ctgaagactg gactgttttt 661 tttagaatag taaaatccca tattgaagga aattgttctt tttgagttat ctacttaata 721 gcatatcatt ctttttctta agctaaattt tcctagagag tttattgtct aaatttcagt 781 tgtgtcttgc catatggagg gcacctaatc agagggtatt aaagtgttta ctaagttttc 841 tagtggacat tttgtttaaa aagtctttga attgccagtt ctgtaagtgc catcaattaa 901 aatagttttg tgcagtgaca gagattttct tttttctttt caataaatta cactttaagg 961 ca // LOCUS HSU23435 1430 bp mRNA PRI 15-MAR-1996 DEFINITION Human Abl interactor 2 (Abi-2) mRNA, complete cds. ACCESSION U23435 NID g915310 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1430) AUTHORS Dai,Z. and Pendergast,A.M. TITLE Abi-2, a novel SH3-containing protein interacts with the c-Abl tyrosine kinase and modulates c-Abl transforming activity JOURNAL Genes Dev. 9 (21), 2569-2582 (1995) MEDLINE 96067151 REFERENCE 2 (bases 1 to 1430) AUTHORS Dai,Z. and Pendergast,A.M. TITLE Direct Submission JOURNAL Submitted (24-MAR-1995) Zonghan Dai, Department of Pharmacology, Duke University Medical Center, Room C-C230, LSRC Building, Box 3813, Durham, NC 27708, USA FEATURES Location/Qualifiers source 1..1430 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" CDS 49..1254 /note="Aip-1; SH3-containing protein" /codon_start=1 /product="Abl interactor 2" /db_xref="PID:g915311" /translation="MSCRCWISRHPSYEGWNLQSIIFHKQIRGVDLESTFVTKFGNNC SLRLNETVDIHKEKVARREIGILTTNKNTSRTHKIIAPANLERPVRYIRKPIDYTILD DIGHGVKVSTQNMKMGGLPRTTPPTQKPPSPPMSGKGTLGRHSPYRTLEPVRPPVVPN DYVPSPTRNMAPSQQSPVRTASVNQRNRTYSSSGSSGPSHPSSRSSSRENSGSGSVGV PIAVPTPSPPSVFPGHPVQFYSMNRPASRHTPPTIGGSLPYRRPPSITSQTSLQNQMN GGPFYSQNPVSDTPPPPPPVEEPVFDESPPPPPPPEDYEEEEAAVVEYSDPYAEEDPP WAPRSYLEKVVAIYDYTKDKEDELSFQEGAIIYVIKKNDDGWYEGVMNGVTGLFPGNY VESIMHYSE" misc_feature 421..450 /gene="Abi-2" /note="encodes SH3 binding site" gene 421..1239 /gene="Abi-2" misc_feature 1084..1239 /gene="Abi-2" /note="encodes SH3 domain" BASE COUNT 413 a 334 c 323 g 360 t ORIGIN 1 cccaatcctt agcaagtgtt gcctatctga taaacacctt ggccaacaat gtcctgcaga 61 tgctggatat ccaggcatcc cagctacgaa ggatggaatc ttcaatcaat catatttcac 121 aagcaaatta gaggcgttga tcttgagtcg acttttgtga ccaaatttgg aaacaattgc 181 agtttgagat tgaatgagac agttgatatt cataaagaga aagttgcaag aagagaaatt 241 ggtattttga ctaccaataa aaacacttca aggacacata agattattgc tccagccaac 301 cttgaacgac cagttcgtta tattagaaaa cctattgact atacaattct agatgatatt 361 ggacatggag taaaggtgag tacccagaac atgaagatgg gtgggctgcc gcgtacaaca 421 cctccaactc agaagccccc tagtccccct atgtcaggga aagggacact tgggcggcac 481 tccccctatc gcacactgga gccagtgcgt cctccagtgg taccaaatga ttacgtacct 541 agcccaaccc gtaatatggc tccctcgcag cagagccctg tgaggacagc ttctgtgaat 601 caaagaaatc gaacttacag cagcagtggg agtagtggac ccagccaccc aagtagtcgg 661 agcagcagtc gagagaacag tggaagtggt agtgtggggg ttcctattgc tgttcctact 721 ccatctcctc ccagtgtctt tccaggtcat cctgtacagt tctacagcat gaataggcct 781 gcctctcgcc atactccccc aacaataggg ggctcgttgc cctatagacg ccctccttcc 841 attacttcac aaacaagcct tcagaatcag atgaatggag gaccttttta tagccagaat 901 ccagtttcag atacaccacc tccaccgcca cctgtggaag aaccagtctt tgatgagtct 961 cccccacctc ctcctcctcc agaagattac gaagaggagg aagctgctgt ggttgagtat 1021 agtgatcctt atgctgaaga ggacccaccg tgggctccac gttcttactt ggaaaaggtt 1081 gtggcaattt atgactatac aaaagacaag gaagatgagc tgtcctttca ggaaggagcc 1141 attatttatg tcatcaagaa gaatgacgat ggttggtatg agggagttat gaatggagtg 1201 actgggcttt ttcctgggaa ttacgttgag tctatcatgc attattctga gtaaagctca 1261 gcagggctgt gcttgcctca caggaatagt caggtcttcc cagattatct gaaggccctg 1321 gggattccac tccagtaaag tagaatgaag gatacaaatg ataaaaatta cacttttttt 1381 tttggtttat tccccagtat taaaaacaaa gcaagctgag tctgaacaaa // LOCUS HSU23731 1245 bp mRNA PRI 19-JUL-1995 DEFINITION Human TAR DNA-binding protein-43 mRNA, complete cds. ACCESSION U23731 NID g901997 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1245) AUTHORS Ou,S.H., Wu,F., Harrich,D., Garcia-Martinez,L.F. and Gaynor,R.B. TITLE Cloning and characterization of a novel cellular protein, TDP-43, that binds to human immunodeficiency virus type 1 TAR DNA sequence motifs JOURNAL J. Virol. 69 (6), 3584-3596 (1995) MEDLINE 95264449 REFERENCE 2 (bases 1 to 1245) AUTHORS Ou,S.-H.I. TITLE Direct Submission JOURNAL Submitted (27-MAR-1995) S.-H.I. Ou, Internal Medicine, University of Texas Southwestern, 5323 Harry Hines Blvd., Dallas, TX 75209, USA FEATURES Location/Qualifiers source 1..1245 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cells" CDS 1..1245 /note="TDP-43 binds to HIV-1 LTR TAR DNA region and can repress HIV-1 transcription; TDP-43" /codon_start=1 /product="TAR DNA-binding protein-43" /db_xref="PID:g901998" /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYR NPVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQK TSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVK VMSQRHMIDGRWCDCKLPNSKQSQDEPLRSRKVFVGRCTEDMTEDELREFFSQYGDVM DVFIPKPFRAFAFVTFADDQIAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRF GGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSW GMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAG SGSGFNGGFGSSMDSKSSGWGM" BASE COUNT 352 a 227 c 359 g 307 t ORIGIN 1 atgtctgaat atattcgggt aaccgaagat gagaacgatg agcccattga aataccatcg 61 gaagacgatg ggacggtgct gctctccacg gttacagccc agtttccagg ggcgtgtggg 121 cttcgctaca ggaatccagt gtctcagtgt atgagaggtg tccggctggt agaaggaatt 181 ctgcatgccc cagatgctgg ctggggaaat ctggtgtatg ttgtcaacta tccaaaagat 241 aacaaaagaa aaatggatga gacagatgct tcatcagcag tgaaagtgaa aagagcagtc 301 cagaaaacat ccgatttaat agtgttgggt ctcccatgga aaacaaccga acaggacctg 361 aaagagtatt ttagtacctt tggagaagtt cttatggtgc aggtcaagaa agatcttaag 421 actggtcatt caaaggggtt tggctttgtt cgttttacgg aatatgaaac acaagtgaaa 481 gtaatgtcac agcgacatat gatagatgga cgatggtgtg actgcaaact tcctaattct 541 aagcaaagcc aagatgagcc tttgagaagc agaaaagtgt ttgtggggcg ctgtacagag 601 gacatgactg aggatgagct gcgggagttc ttctctcagt acggggatgt gatggatgtc 661 ttcatcccca agccattcag ggcctttgcc tttgttacat ttgcagatga tcagattgcg 721 cagtctcttt gtggagagga cttgatcatt aaaggaatca gcgttcatat atccaatgcc 781 gaacctaagc acaatagcaa tagacagtta gaaagaagtg gaagatttgg tggtaatcca 841 ggtggctttg ggaatcaggg tggatttggt aatagcagag ggggtggagc tggtttggga 901 aacaatcaag gtagtaatat gggtggtggg atgaactttg gtgcgttcag cattaatcca 961 gccatgatgg ctgccgccca ggcagcacta cagagcagtt ggggtatgat gggcatgtta 1021 gccagccagc agaaccagtc aggcccatcg ggtaataacc aaaaccaagg caacatgcag 1081 agggagccaa accaggcctt cggttctgga aataactctt atagtggctc taattctggt 1141 gcagcaattg gttggggatc agcatccaat gcagggtcgg gcagtggttt taatggaggc 1201 tttggctcaa gcatggattc taagtcttct ggctggggaa tgtag // LOCUS HSU23803 1714 bp mRNA PRI 18-APR-1995 DEFINITION Human heterogeneous ribonucleoprotein A0 mRNA, complete cds. ACCESSION U23803 NID g773643 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1714) AUTHORS Myer,V.E. and Steitz,J.A. TITLE Isolation and characterization of a novel, low abundance hnRNP protein: A0 JOURNAL RNA 1 (1995) In press REFERENCE 2 (bases 1 to 1714) AUTHORS Myer,V.E. and Steitz,J.A. TITLE Direct Submission JOURNAL Submitted (30-MAR-1995) Vic E. Myer, HHMI/MB&B, Yale University, bCMM/295 Congress Ave., New Haven, CT 06526, USA FEATURES Location/Qualifiers source 1..1714 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 265..1182 /note="hnRNP protein; hnRNA binding protein" /codon_start=1 /product="heterogeneous ribonucleoprotein A0" /db_xref="PID:g773644" /translation="MENSQLCKLFIGGLNVQTSESGLRGHFEAFGTLTDCVVVVNPQT KRSRCFGFVTYSNVEEADAAMAASPHAVDGNTVELKRAVSREDSARPGAHAKVKKLFV GGLKGDVAEGDLIEHFSQFGTVEKAEIIADKQSGKKRGFGFVYFQNHDAADKAAVVKF HPIQGHRVEVKKAVPKEDIYSGGGGGGSRSSRGGRGGRGRGGGRDQNGLSKGGGGGYN SYGGYGGGGGGGYNAYGGGGGGSSYGGSDYGNGFGGFGSYSQHQSSYGPMKSGGGGGG GGSSWGGRSNSGPYRGGYGGGGGYGGSSF" polyA_site 1714 /note="20 A nucleotides" BASE COUNT 306 a 457 c 550 g 401 t ORIGIN 1 ggggcggcgg cggcggtagc ggtggccttg gttgtcttcc agtctcctcg gctcgccctt 61 tagccggcac cgctcccctt ccctccccct tcctctcttc cttccttccc tccccttccc 121 tttttccctt ccccgtcggt gagcggcggg ggtggctcca gcaacggctg ggcccaagct 181 gtgtagaggc cttaaccaac gataacggcg gcgacggcga aacctcggag ctcgcagggc 241 gggggcaagg cccgggcctt ggagatggag aattctcagt tgtgtaagct gttcatcggc 301 ggcctcaatg tgcagacgag tgagtcgggc ctgcgcggcc actttgaggc ctttgggact 361 ctgacggact gcgtggtggt ggtgaatccc cagaccaagc gctcccgttg ctttggcttc 421 gtgacctact ccaatgtgga ggaggcggac gccgccatgg ccgcctcgcc ccatgccgtg 481 gacggcaaca ctgtggagct gaagcgggcg gtgtcccggg aggattcggc gcggcccggt 541 gcccacgcca aggttaagaa gctctttgtc ggaggcctta aaggagacgt ggctgagggc 601 gacctgatcg agcacttctc gcagtttggc accgtggaaa aggccgagat tattgccgac 661 aagcagtccg gcaagaagcg tggattcggc ttcgtgtatt tccagaatca cgacgcggca 721 gacaaggccg cggtggtcaa gttccatccg attcagggcc atcgcgtgga ggtgaagaaa 781 gcagtcccca aggaggatat ctactccggt gggggtggag gcggctcccg atcctcccgg 841 ggcggccgag gcggccgggg gcgcggcggt ggtcgagacc agaacggcct ttccaagggc 901 ggcggcggcg gttacaacag ctacggtggt tacggcggcg gcggaggcgg cggctacaat 961 gcctacggag gcggcggcgg cggttcgtcc tacggtggga gcgactacgg taacggcttc 1021 ggcggcttcg gcagctacag ccagcatcag tcctcctatg ggcccatgaa gagcggcggc 1081 ggcggcggcg gtggaggcag tagctggggc ggtcgcagta atagtggacc ttacagaggc 1141 ggctatggcg gtgggggtgg ctatggaggc agctccttct aaaagaaaat ttaaaatgcc 1201 tgggagtggc tataggggta gctctttcca acagcccaag tggggtcaac tcctaagccc 1261 caccccctca cacacaccgc cttccctgtt ttgcccttgg gggagccact tctaaggctg 1321 cttacccttg ggggtgttcc tctatttgcc tgccacctct cttgtctctc cctctgaaga 1381 tggactcggc cccacataca catttttgtg ttacagtcat tgatggactc tattttttta 1441 ttattacttg gaccttggtc gtttttatac tagcaaaatg tcttgtttta atttgtgttt 1501 tttgggggga gggagggagt gaacttgctg attctgtagc aaaacctggg tgggggttgg 1561 ggtggggggt agtttacttt gttgtaagga cttgataacc tggctacagc gttttctatg 1621 aaatctactt ggatcccatg cctgaaattt ggaagcatat gtacacaaat catttttacg 1681 ttttattttt aataaatcat tgtgtttgac cgta // LOCUS HSU23851 4168 bp mRNA PRI 25-MAR-1997 DEFINITION Human atrophin-1 mRNA, complete cds. ACCESSION U23851 NID g915325 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4168) AUTHORS Margolis,R.L., Li,S.H., Young,W.S., Wagster,M.V., Stine,O.C., Kidwai,A.S., Ashworth,R.G. and Ross,C.A. TITLE DRPLA gene (atrophin-1) sequence and mRNA expression in human brain JOURNAL Brain Res. Mol. Brain Res. 36 (2), 219-226 (1996) MEDLINE 96262314 REFERENCE 2 (bases 1 to 4168) AUTHORS Li,S., McInnis,M.G., Margolis,R.L., Antonarakis,S.E. and Ross,C.A. TITLE Novel triplet repeat containing genes in human brain: cloning, expression, and length polymorphisms JOURNAL Unpublished REFERENCE 3 (bases 1 to 4168) AUTHORS Margolis,R.L. TITLE Direct Submission JOURNAL Submitted (28-MAR-1995) Russell L. Margolis, Psychiatry, Johns Hopkins University School of Medicine, 720 Rutland Ave., Baltimore, MD 21205-2196, USA FEATURES Location/Qualifiers source 1..4168 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="frontal cortex, cerebellum, and caudate cDNA libraries (Stratagene)" /chromosome="12" /dev_stage="adult" gene 74..3628 /gene="DRPLA" CDS 74..3628 /gene="DRPLA" /codon_start=1 /product="atrophin-1" /db_xref="PID:g915326" /translation="MKTRQNKDSMSMRSGRKKEAPGPREELRSRGRASPGGVSTSSSD GKAEKSRQTAKKARVEEASTPKVNKQGRSEEISESESEETNAPKKTKTEELPRPQSPS DLDSLDGRSLNDDGSSDPRDIDQDNRSTSPSIYSPGSVENDSDSSSGLSQGPARPYHP PPLFPPSPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGAPPPHPQL YPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAPPTKPPT TPVGGGNLPSAPPPANFPHVTPNLPPPPALRPLNNASASPPGLGAQPLPGHLPSPHAM GQGIGGLPPGPEKGPTLAPSPHSLPPASSSAPAPPMRFPYSSSSSSSAAASSSSSSSS SSASPFPASQALPSYPHSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGRL LANSNAHPGPFPPSTGAQSTAHPPVSTHHHHHQQQQQQQQQQQQQQHHGNSGPPPPGA FPHPLEGGSSHHAHPYAMSPSLGSLRPYPPGPAHLPPPHSQVSYSQAGPNGPPVSSSS NSSSSTSQGSYPCSHPSPSQGPQGAPYPFPPVPTVTTSSATLSTVIATVASSPAGYKT ASPPGPPPYGKRAPSPGAYKTATPPGYKPGSPPSFRTGTPPGYRGTSPPAGPGTFKPG SPTVGPGPLPPAGPSGLPSLPPPPAAPASGPPLSATQIKQEPAEEYETPESPVPPARS PSPPPKVVDVPSHASQSARFNKHLDRGFNSCARSDLYFVPLEGSKLAKKRADLVEKVR REAEQRAREEKEREREREREKEREREKERELERSVKLAQEGRAPVECPSLGPVPHRPP FEPGSAVATVPPYLGPDTPALRTLSEYARPHVMSPGNRNHPFYVPLGAVDPGLLGYNV PALYSSDPAAREREREARERDLRDRLKPGFEVKPSELEPLHGVPGPGLDPFPRHGGLA LQPGPPGLHPFPFHPSLGPLERERLALAAGPALRPDMSYAERLAAERQHAERVAALGN DPLARLQMLNVTPHHHQHSHIHSHLHLHQQDAIHAASASVHPLIDPLASGSHLTRIPY PAGTLPNPLLPHPLHENEVLRHQLFAAPYRDLPASLSAPMSAAHQLQAMHAQSAELQR LALEQQQWLHAHHPLHSVPLPAQEDYYSHLKKESDKPL" repeat_region 1532..1561 /note="microsatellite; polymorphic CAG repeat encoding glutamine, long expansion causes DRPLA" /rpt_type=tandem /rpt_unit=cag polyA_site 4168 /note="49 A nucleotides" BASE COUNT 827 a 1554 c 1046 g 741 t ORIGIN 1 ttggggtgga gcagagaagt ttctgtattc agctgcccag gcagaggaga atggggtctc 61 cacagcctga agaatgaaga cacgacagaa taaagactcg atgtcaatga ggagtggacg 121 gaagaaagag gcccctgggc cccgggaaga actgagatcg aggggccggg cctcccctgg 181 aggggtcagc acgtccagca gtgatggcaa agctgagaag tccaggcaga cagccaagaa 241 ggcccgagta gaggaagcct ccaccccaaa ggtcaacaag cagggtcgga gtgaggagat 301 ctcagagagt gaaagtgagg agaccaatgc accaaaaaag accaaaactg aggaactccc 361 tcggccacag tctccctccg atctggatag cttggacggg cggagcctta atgatgatgg 421 cagcagcgac cctagggata tcgaccagga caaccgaagc acgtccccca gtatctacag 481 ccctggaagt gtggagaatg actctgactc atcttctggc ctgtcccagg gcccagcccg 541 cccctaccac ccacctccac tctttcctcc ttcccctcaa ccgccagaca gcacccctcg 601 acagccagag gctagctttg aaccccatcc ttctgtgaca cccactggat atcatgctcc 661 catggagccc cccacatctc gaatgttcca ggctcctcct ggggcccctc cccctcaccc 721 acagctctat cccgggggca ctggtggagt tttgtctgga cccccaatgg gtcccaaggg 781 gggaggggct gcctcatcag tggggggccc taatgggggt aagcagcacc ccccacccac 841 tactcccatt tcagtatcaa gctctggggc tagtggtgct cccccaacaa agccgcctac 901 cactccagtg ggtggtggga acctaccttc tgctccacca ccagccaact tcccccatgt 961 gacaccgaac ctgcctcccc cacctgccct gagacccctc aacaatgcat cagcctctcc 1021 ccctggcctg ggggcccaac cactacctgg tcatctgccc tctccccacg ccatgggaca 1081 gggtatcggt ggacttcctc ctggcccaga gaagggccca actctggctc cttcacccca 1141 ctctctgcct cctgcttcct cttctgctcc agcgcccccc atgaggtttc cttattcatc 1201 ctctagtagt agctctgcag cagcctcctc ttccagttct tcctcctctt cctctgcctc 1261 ccccttccca gcttcccagg cattgcccag ctacccccac tctttccctc ccccaacaag 1321 cctctctgtc tccaatcagc cccccaagta tactcagcct tctctcccat cccaggctgt 1381 gtggagccag ggtcccccac cacctcctcc ctatggccgc ctcttagcca acagcaatgc 1441 ccatccaggc cccttccctc cctctactgg ggcccagtcc accgcccacc caccagtctc 1501 aacacatcac catcaccacc agcaacagca acagcagcag cagcagcagc agcagcagca 1561 gcatcacgga aactctgggc cccctcctcc tggagcattt ccccacccac tggagggcgg 1621 tagctcccac cacgcacacc cttacgccat gtctccctcc ctggggtctc tgaggcccta 1681 cccaccaggg ccagcacacc tgcccccacc tcacagccag gtgtcctaca gccaagcagg 1741 ccccaatggc cctccagtct cttcctcttc caactcttcc tcttccactt ctcaagggtc 1801 ctacccatgt tcacacccct ccccttccca gggccctcaa ggggcgccct accctttccc 1861 accggtgcct acggtcacca cctcttcggc taccctttcc acggtcattg ccaccgtggc 1921 ttcctcgcca gcaggctaca aaacggcctc cccacctggg cccccaccgt acggaaagag 1981 agccccgtcc ccgggggcct acaagacagc caccccaccc ggatacaaac ccgggtcgcc 2041 tccctccttc cgaacgggga ccccaccggg ctatcgagga acctcgccac ctgcaggccc 2101 agggaccttc aagccgggct cgcccaccgt gggacctggg cccctgccac ctgcggggcc 2161 ctcaggcctg ccatcgctgc caccaccacc tgcggcccct gcctcagggc cgcccctgag 2221 cgccacgcag atcaaacagg agccggctga ggagtatgag acccccgaga gcccggtgcc 2281 cccagcccgc agcccctcgc cccctcccaa ggtggtagat gtacccagcc atgccagtca 2341 gtctgccagg ttcaacaaac acctggatcg cggcttcaac tcgtgcgcgc gcagcgacct 2401 gtacttcgtg ccactggagg gctccaagct ggccaagaag cgggccgacc tggtggagaa 2461 ggtgcggcgc gaggccgagc agcgcgcgcg cgaagaaaag gagcgcgagc gcgagcggga 2521 acgcgagaaa gagcgcgagc gcgagaagga gcgcgagctt gaacgcagcg tgaagttggc 2581 tcaggagggc cgtgctccgg tggaatgccc atctctgggc ccagtgcccc atcgccctcc 2641 atttgaaccg ggcagtgcgg tggctacagt gcccccctac ctgggtcctg acactccagc 2701 cttgcgcact ctcagtgaat atgcccggcc tcatgtcatg tctcctggca atcgcaacca 2761 tccattctac gtgcccctgg gggcagtgga cccggggctc ctgggttaca atgtcccggc 2821 cctgtacagc agtgatccag ctgcccggga gagggaacgg gaagcccgtg aacgagacct 2881 ccgtgaccgc ctcaagcctg gctttgaggt gaagcctagt gagctggaac ccctacatgg 2941 ggtccctggg ccgggcttgg atccctttcc ccgacatggg ggcctggctc tgcagcctgg 3001 cccacctggc ctgcaccctt tcccctttca tccgagcctg gggcccctgg agcgagaacg 3061 tctagcgctg gcagctgggc cagccctgcg gcctgacatg tcctatgctg agcggctggc 3121 agctgagagg cagcacgcag aaagggtggc ggccctgggc aatgacccac tggcccggct 3181 gcagatgctc aatgtgactc cccatcacca ccagcactcc cacatccact cgcacctgca 3241 cctgcaccag caagatgcta tccatgcagc ctctgcctcg gtgcaccctc tcattgaccc 3301 cctggcctca gggtctcacc ttacccggat cccctaccca gctggaactc tccctaaccc 3361 cctgcttcct caccctctgc acgagaacga agttcttcgt caccagctct ttgctgcccc 3421 ttaccgggac ctgccggcct ccctttctgc cccgatgtca gcagctcatc agctgcaggc 3481 catgcacgca cagtcagctg agctgcagcg cttggcgctg gaacagcagc agtggctgca 3541 tgcccatcac ccgctgcaca gtgtgccgct gcctgcccag gaggactact acagtcacct 3601 gaagaaggaa agcgacaagc cactgtagaa cctgcgatca agagagcacc atggctccta 3661 cattggacct tggagcaccc ccaccctccc cccaccgtgc ccttggcctg ccacccagag 3721 ccaagagggt gctgctcagt tgcagggcct ccgcagctgg acagagagtg ggggagggag 3781 ggacagacag aaggccaagg cccgatgtgg tgtgcagagg tggggaggtg gcgaggatgg 3841 ggacagaaag cgcacagaat cttggaccag gtctctcttc cttgtccccc ctgcttttct 3901 cctcccccat gcccaacccc tgtggccgcc gcccctcccc tgccccgttg gtgtgattat 3961 ttcatctgtt agatgtggct gttttgcgta gcatcgtgtg ccacccctgc ccctccccga 4021 tccctgtgtg cgcgccccct ctgcaatgta tgccccttgc cccttcccca cactaataat 4081 ttatatatat aaatatctat atgacgctct taaaaaaaca tcccaaccaa aaccaaccaa 4141 acaaaaacat cctcacaact ccccagga // LOCUS HSU23942 3172 bp mRNA PRI 08-JAN-1997 DEFINITION Human lanosterol 14-demethylase cytochrome P450 (CYP51) mRNA, complete cds. ACCESSION U23942 NID g1698395 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3172) AUTHORS Stromstedt,M., Rozman,D. and Waterman,M.R. TITLE The ubiquitously expressed human CYP51 encodes lanosterol 14 alpha-demethylase, a cytochrome P450 whose expression is regulated by oxysterols JOURNAL Arch. Biochem. Biophys. 329 (1), 73-81 (1996) MEDLINE 96201125 REFERENCE 2 (bases 1 to 3172) AUTHORS Rozman,D., Stromstedt,M. and Waterman,M.R. TITLE The three human cytochrome P450 lanosterol 14 alpha-demethylase (CYP51) genes reside on chromosomes 3, 7, and 13: structure of the two retrotransposed pseudogenes, association with a line-1 element, and evolution of the human CYP51 family JOURNAL Arch. Biochem. Biophys. 333 (2), 466-474 (1996) MEDLINE 96404948 REFERENCE 3 (bases 1 to 3172) AUTHORS Stromstedt,M., Rozman,D. and Waterman,M.R. TITLE Direct Submission JOURNAL Submitted (31-MAR-1995) Department of Biochemistry, Vanderbilt University School of Medicine, 607 Light Hall, Nashville, TN 37232-0146, USA FEATURES Location/Qualifiers source 1..3172 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="7" gene 123..1652 /gene="CYP51" CDS 123..1652 /gene="CYP51" /codon_start=1 /product="lanosterol 14-demethylase cytochrome P450" /db_xref="PID:g1698396" /translation="MAAAAGMLLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTL SLVYLIRLAAGHLVQLPAGVKSPPYIFSPIPFLGHAIAFGKSPIEFLENAYEKYGPVF SFTMVGKTFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNPVFLE QKKMLKSGLNIAHFKQHVSIIEKETKEYFESWGESGEKNVFEALSELIILTASHCLHG KEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKDIFYKAIQK RRQSQEKIDDILQTLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLA RDKTLQKKCYLEQKTVCGENLPPLTYDQLKDLNLLDRCIKETLRLRPPIMIMMRMART PQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGA GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRS K" BASE COUNT 910 a 607 c 698 g 957 t ORIGIN 1 gctgggttta gtaggagacc tggggcaagg ccccctgtgg acgaccatct gccagcttct 61 ctcgttccgt cgattgggag gagcggtggc gacctcggcc ttcagtgttt ccgacggagt 121 gaatggcggc ggcggctggg atgctgctgc tgggcttgct gcaggcgggt gggtcggtgc 181 tgggccaggc gatggagaag gtgacaggcg gcaacctctt gtccatgctg ctgatcgcct 241 gcgccttcac cctcagcctg gtctacctga tccgtctggc cgccggccac ctggtccagc 301 tgcccgcagg ggtgaaaagt cctccataca ttttctcccc aattccattc cttgggcatg 361 ccatagcatt tgggaaaagt ccaattgaat ttctagaaaa tgcatatgag aagtatggac 421 ctgtatttag ttttaccatg gtaggcaaga catttactta ccttctgggg agtgatgctg 481 ctgcactgct ttttaatagt aaaaatgaag acctgaatgc agaagatgtc tacagtcgcc 541 tgacaacacc tgtgtttggg aagggagttg catacgatgt gcctaatcca gttttcttgg 601 agcagaagaa aatgttaaaa agtggcctta acatagccca ctttaaacag catgtttcta 661 taattgaaaa agaaacaaag gaatactttg agagttgggg agaaagtgga gaaaaaaatg 721 tgtttgaagc tctttctgag ctcataattt taacagctag ccattgtttg catggaaagg 781 aaatcagaag tcaactcaat gaaaaggtag cacagctgta tgcagatttg gatggaggtt 841 tcagccatgc agcctggctc ttaccaggtt ggctgccttt gcctagtttc agacgcaggg 901 acagagctca tcgggaaatc aaggatattt tctataaggc aatccagaaa cgcagacagt 961 ctcaagaaaa aattgatgac attctccaaa ctttactaga tgctacatac aaggatgggc 1021 gtcctttgac tgatgatgaa gtagcaggga tgcttattgg attactcttg gcagggcagc 1081 atacatcctc aactactagt gcttggatgg gcttcttttt ggccagagac aaaacacttc 1141 aaaaaaaatg ttatttagaa cagaaaacag tctgtggaga gaatctgcct cctttaactt 1201 atgaccagct caaggatcta aatttacttg atcgctgtat aaaagaaaca ttaagactta 1261 gacctcctat aatgatcatg atgagaatgg ccagaactcc tcagactgtg gcagggtata 1321 ccattcctcc aggacatcag gtgtgtgttt ctcccactgt caatcaaaga cttaaagact 1381 catgggtaga acgcctggac tttaatcctg atcgctactt acaggataac ccagcatcag 1441 gggaaaagtt tgcctatgtg ccatttggag ctgggcgtca tcgttgtatt ggggaaaatt 1501 ttgcctatgt tcaaattaag acaatttggt ccactatgct tcgtttatat gaatttgatc 1561 tcattgatgg atactttccc actgtgaatt atacaactat gattcacacc cctgagaacc 1621 cagttatccg ttacaaacga agatcaaaat gaaaaaggtt gcaaggaacg aatatatgtg 1681 attatcactg taagccacaa aggcattcga agagaatgaa gtgtacaaaa caactcttgt 1741 agtttactgt ttttttaagt gtgtaattct aaaagccagt ttatgattta ggattttgtt 1801 aactgaatgg ttctatcaaa tataatagca tttgacacat tttctaatag ttatgatact 1861 tatacatgtg ctttcaggaa gttccttggt gaaacaattg ttgagggggg atctaggtaa 1921 ttggcagatt ctaaataata taatttccag atagtaattt taagagtact catcgctctt 1981 gccaaataag ttcagggtat tcaaatcttg gactagtcct gcaaggtata aagaataaaa 2041 atcccagtga gatacttgga aaccacagtt tattattatt tatctgggca attattgtgt 2101 gtgtgaggat ggaagggtag ggaataatcg aacatctaaa gccttgaata agagaatact 2161 aattgttttg gtatgatgat actcagaaat ggagatatta taggaaaaag aaatcctttg 2221 gaattttaac taaaatcact gcatatggga aattaagaga tccaggacca tatttgataa 2281 gagttcctaa aaataatgta attattaatg ctaaagactg ctcatgtatc ttgatctaat 2341 tactaaataa attacatatt tatttacctg ataaatatgt atctagttct acaaggtcac 2401 atttatgtgg aagtccaaag tcaagtcctt aggggataat tttgttttgg gctcagttgt 2461 tccctgcttc cttttttttt tttttttttt tttgagatgg agtctcgctc tgttgcccag 2521 gctggagtgc agtggtgcga tctcagctca ctgcatcctc tgcctcccgg gttcaagcaa 2581 ttctctgcct cagcctccca agtagttggg attacaggca cctgccacca tgcctggcta 2641 attttttgta tttttagtag agacgggggt ttcactatgt tggctaggct ggtcttgaac 2701 tcctgagcct cgtgagtcca cccgccttgg cctcccaaag tgctgggatt acaggcatga 2761 gccaccgcac ctggccttcc ctgcttcctc tctagaatcc aattagggat gtttgttact 2821 actcatattg attaaaacag ttaacaaact tttttctttt taaaatgtga gatcagtgaa 2881 ctctggtttt aagataatct gaaacaaggt ccttgggagt aataaaattg gtcacattct 2941 gtaaagcaca ttctgtttag gaatcaactt atctcaaatt gtaactcggg gcctaactat 3001 atgagatggc tgaaaaaata ccacatcgtc tgttttcact aggtgatgcc aaaatatttt 3061 gctttatgta tattacagtt ctttttaaaa cactggaaga ctcatgttaa actctaattg 3121 tgaaggcaga atctctgcta atttttcaga ttaaaattct ctttgaaaaa at // LOCUS HSU23946 2575 bp mRNA PRI 16-MAY-1996 DEFINITION Human putative tumor suppressor (LUCA15) mRNA, complete cds. ACCESSION U23946 NID g1244403 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2575) AUTHORS Bader,S., Latif,F., Duh,F., Wei,M., Kashuba,V., Sekido,Y., Lee,C., Koonin,E., Zabarofsky,E., Klein,G., Minna,J.D. and Lerman,M. TITLE A putative tumor suppressor gene LUCA15 on 3p21.3 encodes two RNA recognizing motifs and is related to the Drosophila tumor suppresorgene Sxl JOURNAL Unpublished REFERENCE 2 (bases 1 to 2575) AUTHORS Duh,F. TITLE Direct Submission JOURNAL Submitted (31-MAR-1995) Fuh-Mei Duh, BCDP, PRI/ Dyncorp, NCI-FCRDC, Building 560, Rm 12-71, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2575 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="LUCA15" /chromosome="3" /map="3p21.3" /cell_type="islet beta-cells" 5'UTR 1..69 gene 70..2517 /gene="LUCA15" CDS 70..2517 /gene="LUCA15" /note="putative tumor suppressor" /codon_start=1 /db_xref="PID:g1244404" /translation="MGSDKRVSRTERSGRYGSIIDRDDRDERESRSRRRDSDYKRSSD DRRGDRYDGSRDYDSPERERERRNSDRSEDGYHSDGDYGEHDYRHDISDERESKTIML RGLPITITESDIREMMESFEGPQPADVRLMKRKTGVSRGFAFVEFYHLQDATSWMEAN QKKLVIQGKHIAMHYSNPRPKFEDWLCNKCCLNNFRKRLKCFRCGADKFDSEQEVPPG TTESVQSVDYYCDTIILRNIAPHTVVDSIMTALSPYASLAVNNIRLIKDKQTQQNRGF AFVQLSSAMDASQLLQILQSLHPPLKIDGKTIGVDFAKSARKDLVLSDGNRVSAFSVA STAIAAAQWSSTQSQSGEGVSVDYSYLQPGQDGYAQYAQYSQDYQQFYQQQAGGLESD ASSASGTAVTTTSAAVVSQSPQLYNQTSNPPGSPTEEAQPSTSTSTQAPAASPTGVVP GTKYAVPDTSTYQYDESSGYYYDPTTGLYYDPNSQYYYNSLTQQYLYWDGEKETYVPA AESSSHQQSGLPPAKEGKEKKEKPKSKTAQQIAKDMERWAKSLNKQKENFKNSFQPVN SLREEERRESAAADAGFALFEKKGALAERQQLIPELVRNGDEENPLKRGLVAAYSGDS DNEEELVERLESEEEKLADWKKMACLLCRRQFPNKDALVRHQQLSDLHKQNMDIYRRS RLSEQELEALELREREMKYRDRAAERREKYGIPEPPEPKRKKQFDAGTVNYEQPTKDG IDHSNIGNKMLQAMGWREGSGLGRKCQGITAPIEAQVRLKGAGLGAKGSAYGLSGADS YKDAVRKAMFARFIEME" 3'UTR 2518..2575 BASE COUNT 753 a 588 c 689 g 545 t ORIGIN 1 tcggtctctc cttgggaaaa aataaaattt gaaccttttg gagctgtgtg ctaaatcttc 61 agtgggacaa tgggttcaga caaaagagtg agtagaacag agcgtagtgg aagatacggt 121 tccatcatag acagggatga ccgtgatgag cgtgaatccc gaagcaggcg gagggactca 181 gattacaaaa gatctagtga tgatcggagg ggtgatagat atgatggctc ccgagactat 241 gacagtccag agagagagcg tgaaagaagg aacagtgacc gatccgaaga tggctaccat 301 tcagatggtg actatggtga gcacgactat aggcatgaca tcagtgacga gagggagagc 361 aagaccatca tgctgcgcgg ccttcccatc accatcacag agagcgatat tcgagaaatg 421 atggagtcct tcgaaggccc tcagcctgcg gatgtgaggc tgatgaagag gaaaacaggt 481 gtaagccgtg gtttcgcctt cgtggagttt tatcacttgc aagatgctac cagctggatg 541 gaagccaatc agaaaaagtt ggtgattcaa ggaaagcaca ttgcaatgca ttatagcaat 601 cccagaccta agtttgaaga ttggctttgt aacaagtgct gccttaacaa tttcaggaaa 661 agactaaaat gcttccgatg tggagcagac aagtttgact ctgaacagga agtgcctcct 721 ggaaccacag agtcggttca gtctgtggat tactactgtg atacgatcat tcttcggaac 781 atagctccgc acactgtggt ggattccatc atgacagcac tgtctcctta cgcgtcttta 841 gctgtcaata acatccgcct cataaaagac aaacagaccc agcagaacag aggcttcgca 901 tttgtgcagc tgtcctctgc aatggatgct tctcagctgc ttcagatatt acagagtctc 961 catcctcctt tgaaaattga tggcaaaact attggggttg attttgcaaa aagtgccaga 1021 aaagacttgg tcctctcaga tggtaaccgc gtcagcgcct tctctgtagc tagtacggct 1081 attgctgctg ctcagtggtc atccacccag tctcaaagtg gtgaaggagt cagtgttgac 1141 tacagttatc tgcaaccagg tcaagatggc tatgcccaat atgctcagta ttcacaggat 1201 tatcagcagt tttatcaaca acaagctgga ggattggaat ctgatgcatc atctgcatca 1261 ggcacagcag tgaccaccac ctcagcggct gtagtgtccc agagtcctca gctgtataat 1321 caaacctcca atccacctgg ctctccgact gaggaagcac agcctagcac tagcacaagt 1381 acacaggccc cagccgcttc ccctactggt gtagttcctg gtaccaaata tgcagtacct 1441 gacacgtcca cttaccagta tgatgaatct tcaggatatt actatgatcc gacaacaggg 1501 ctctattatg accccaactc gcaatactac tataattcct tgacccagca gtacctttac 1561 tgggatgggg aaaaagagac ctacgtgcca gctgcagagt ctagctccca ccagcagtcg 1621 ggcctgcctc ctgcaaaaga ggggaaagag aagaaggaga aacccaagag caaaacagcc 1681 cagcagattg ccaaagacat ggaacgctgg gctaagagtt tgaataagca gaaagaaaac 1741 tttaaaaata gctttcagcc tgtcaattcc ttgagggaag aagaaaggag agaatctgct 1801 gcagcagacg ctggctttgc tctctttgag aagaagggag ccttagctga aaggcagcag 1861 ctcatcccag aattggtgcg aaatggagat gaggagaatc ccctcaaaag gggtctggtt 1921 gctgcttaca gtggtgacag tgacaatgag gaggagctgg tggagagact tgagagtgag 1981 gaagagaagc tagctgactg gaagaagatg gcctgtctgc tctgccggcg ccagttcccg 2041 aacaaagatg ccctagtcag gcaccagcaa ctctcagacc ttcacaagca aaacatggac 2101 atctaccgac gatccaggct gagcgagcag gagctggaag ccttggagct aagggagaga 2161 gagatgaaat accgagaccg agctgcagaa agacgggaga agtacggcat tccagaacct 2221 ccagagccca agcgcaagaa gcagtttgat gccggcactg tgaattacga gcaacccacc 2281 aaagatggca ttgaccacag taacattggc aacaagatgc tgcaggccat gggctggcgg 2341 gaaggctctg gcttgggacg aaagtgtcaa ggcattacgg ctcccattga ggctcaagtt 2401 cggctaaagg gagctggcct aggagccaaa ggcagcgcat atggtttgtc gggcgccgat 2461 tcctacaaag atgctgtccg gaaagccatg tttgcccggt tcattgagat ggagtgagag 2521 agagagagag agagagatga caaggagcac aagaagtggt ccatctcccg aattc // LOCUS HSU24074 1579 bp mRNA PRI 12-JAN-1996 DEFINITION Human p58 natural killer cell receptor precursor mRNA, clone cl-6, complete cds. ACCESSION U24074 NID g897900 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1579) AUTHORS Wagtmann,N., Biassoni,R., Cantoni,C., Verdiani,S., Malnati,M.S., Vitale,M., Bottino,C., Moretta,L., Moretta,A. and Long,E.O. TITLE Molecular clones of the p58 NK cell receptor reveal immunoglobulin-related molecules with diversity in both the extra- and intracellular domains JOURNAL Immunity 2 (5), 439-449 (1995) MEDLINE 95269128 REFERENCE 2 (bases 1 to 1579) AUTHORS Wagtmann,N. TITLE Direct Submission JOURNAL Submitted (03-APR-1995) Nicolai Wagtmann, Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, Twinbrook II Facility, 12441 Parklawn Drive, Rockville, MD 20852-1727, USA FEATURES Location/Qualifiers source 1..1579 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cl-6" /chromosome="19" /cell_line="NK3.3" /cell_type="natural killer cell" sig_peptide 22..84 /evidence=experimental CDS 22..1047 /note="p58 NK receptor" /codon_start=1 /product="p58 natural killer cell receptor precursor" /db_xref="PID:g897901" /translation="MSLMVVSMVCVGFFLLQGAWPHEGVHRKPSLLAHPGPLVKSEET VILQCWSDVRFQHFLLHREGKFKDTLHLIGEHHDGVSKANFSIGPMMQDLAGTYRCYG SVTHSPYQLSAPSDPLDIVITGLYEKPSLSAQPGPTVLAGESVTLSCSSRSSYDMYHL SREGEAHERRFSAGPKVNGTFQADFPLGPATHGGTYRCFGSFRDSPYEWSNSSDPLLV SVTGNPSNSWPSPTEPSSETGNPRHLHVLIGTSVVIILFILLLFFLLHRWCCNKKNAV VMDQEPAGNRTVNREDSDEQDPQEVTYAQLNHCVFTQRKITRPSQRPKTPPTDIIVYT ELPNAEP" mat_peptide 85..1044 /note="p58 NK receptor" /product="p58 natural killer cell receptor" misc_feature 756..816 /note="encodes transmembrane region" misc_feature 817..1044 /note="encodes cytoplasmic tail" BASE COUNT 379 a 484 c 334 g 382 t ORIGIN 1 cctgtctgca cagacagcac catgtcgctc atggtcgtca gcatggtgtg tgttgggttc 61 ttcttgctgc agggggcctg gccacatgag ggagtccaca gaaaaccttc cctcctggcc 121 cacccaggtc ccctggtgaa atcagaagag acagtcatcc tgcaatgttg gtcagatgtc 181 aggtttcagc acttccttct gcacagagaa gggaagttta aggacacttt gcacctcatt 241 ggagagcacc atgatggggt ctccaaggcc aacttctcca tcggtcccat gatgcaagac 301 cttgcaggga cctacagatg ctacggttct gttactcact ccccctatca gttgtcagct 361 cccagtgacc ctctggacat cgtcatcaca ggtctatatg agaaaccttc tctctcagcc 421 cagccgggcc ccacggttct ggcaggagag agcgtgacct tgtcctgcag ctcccggagc 481 tcctatgaca tgtaccatct atccagggag ggggaggccc atgaacgtag gttctctgca 541 gggcccaagg tcaacggaac attccaggcc gactttcctc tgggccctgc cacccacgga 601 ggaacctaca gatgcttcgg ctctttccgt gactctccat acgagtggtc aaactcgagt 661 gacccactgc ttgtttctgt cacaggaaac ccttcaaata gttggccttc acccactgaa 721 ccaagctccg aaaccggtaa ccccagacac ctgcatgttc tgattgggac ctcagtggtc 781 atcatcctct tcatcctcct cctcttcttt ctccttcatc gctggtgctg caacaaaaaa 841 aatgctgttg taatggacca agagcctgca gggaacagaa cagtgaacag ggaggactct 901 gatgaacaag accctcagga ggtgacatat gcacagttga atcactgcgt tttcacacag 961 agaaaaatca ctcgcccttc tcagaggccc aagacacccc caacagatat catcgtgtac 1021 acggaacttc caaatgctga gccctgatcc aaagttgtct cctgcccatg agcaccacag 1081 tcaggccttg aggggatctt ctagggagac aacagccctg tctcaaaact gggttgccag 1141 ctccaatgta ccagcagctg gaatctgaag gcgtgagtct gcatcttagg gcatcgctct 1201 tcctcacacc acaaatctga acgtgcctct cccttgctta caaatgtcta aggtccccac 1261 tgcctgctgg agagaaaaca cactcctttg cttagcccac aattctccat ttcacttgac 1321 ccctgcccac ctctccaacc taactggctt acttcctagt ctacttgagg ctgcaatcac 1381 actgaggaac tcacaattcc aaacatacaa gaggctccct cttaacacgg cacttagaca 1441 cgtgctgttc caccttccct catgctgttc cacctcccct cagactagct ttcagccttc 1501 tgtcagcagt aaaacttata tattttttaa aataatttca atgtagtttt ccctccttca 1561 aataaacatg tctgccctc // LOCUS HSU24105 4692 bp mRNA PRI 22-SEP-1997 DEFINITION Homo sapiens coatomer protein (COPA) mRNA, complete cds. ACCESSION U24105 NID g1638873 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 360 to 4692) AUTHORS Chow,V.T. and Quek,H.H. TITLE HEP-COP, a novel human gene whose product is highly homologous to the alpha-subunit of the yeast coatomer protein complex JOURNAL Gene 169 (2), 223-227 (1996) MEDLINE 96194806 REFERENCE 2 (bases 1 to 359) AUTHORS Quek,H.H. and Chow,V.T. TITLE Genomic organization and mapping of the human HEP-COP gene (COPA) to 1q JOURNAL Cytogenet. Cell Genet. 76 (3-4), 139-143 (1997) MEDLINE 97330048 REFERENCE 3 (bases 1 to 4692) AUTHORS Quek,H.H. and Chow,V.T. TITLE Molecular and cellular studies of the human homolog of the 160-kD alpha-subunit of the coatomer protein complex JOURNAL DNA Cell Biol. 16 (3), 275-280 (1997) MEDLINE 97238622 REFERENCE 4 (bases 1 to 4692) AUTHORS Quek,H.-H. TITLE Direct Submission JOURNAL Submitted (04-APR-1995) Hung-Hiang Quek, The National University of Singapore, Microbiology, Lower Kent Ridge Road, Kent Ridge, Singapore, 0511, Singapore FEATURES Location/Qualifiers source 1..4692 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /cell_line="Hep 3B hepatocellular carcinoma" 5'UTR 1..466 /gene="COPA" /evidence=experimental gene 1..4692 /note="HEP-COP" /gene="COPA" exon 1..506 /gene="COPA" /number=1 CDS 467..4141 /gene="COPA" /note="similar to alpha subunit of yeast coatomer complex, PIR Accession Number A55288" /codon_start=1 /product="coatomer protein" /db_xref="PID:g1002369" /translation="MLTKFETKSARVKGLSFHPKRPWILTSLHNGVIQLWDYRMCTLI DKFDEHDGPVRGIDFHKQQPLFVSGGDDYKIKVWNYKLRRCLFTLLGHLDYIRTTFFH HEYPWILSASDDQTIRVWNWQSRTCVCVLTGHNHYVMCAQFHPTEDLVVSASLDQTVR VWDISGLRKKNLSPGAVESDVRGITGVDLFGTTDAVVKHVLEGHDRGVNWAAFHPTMP LIVSGADDRQVKIWRMNESKAWEVDTCRGHYNNVSCAVFHPRQELILSNSEDKSIRVW DMSKRTGVQTFRRDHDRFWVLAAHPNLNLFAAGHDGGMIVFKLERERPAYAVHGNMLH YVKDRFLRQLDFNSSKDVAVMQLRSGSKFPVFNMSYNPAENAVLLCTRASNLENSTYD LYTIPKDADSQNPDAPEGKRSSGLTAVWVARNRFAVLDRMHSLLIKNLKNEITKKVQV PNCDEIFYAGTGNLLLRDADSITLFDVQQKRTLASVKISKVKYVIWSADMSHVALLAK HAIVICNRKLDALCNIHENIRVKSGAWDESGVFIYTTSNHIKYAVTTGDHGIIRTLDL PIYVTRVKGNNVYCLDRECRPRVLTIDPTEFKFKLALINRKYDEVLHMVRNAKLVGQS IIAYLQKKGYPEVALHFVKDEKTRFSLALECGNIEIALEAAKALDDKNCWEKLGEVAL LQGNHQIVEMCYQRTKNFDKVSFLYLITGNLEKLRKMMKIAEIRKDMSGHYQNALYLG DVSERVRILKNCGQKSLAYLTAATHGLDEEAESLKETFDPEKETIPDIDPNAKLLQPP APIMPLDTNWPLLTVSKGFFEGTIASKGKGGALAADIDIDTVGTEGWGEDAELQLDED GFVEATEGLGDDALGKGQEEGGGWDVEEDLELPPELDISPGAAGGAEDGFFVPPTKGT SPTQIWCNNSQLPVDHILAGSFETAMRLLHDQVGVIQFGPYKQLFLQTYARGRTTYQA LPCLPSMYGYPNRNWKDAGLKNGVPAVGLKLNDLIQRLQLCYQLTTVGKFEEAVEKFR SILLSVPLLVVDNKQEIAEAQQLITICREYIVGLSVETERKKLPKETLEQQKRICEMA AYFTHSNLQPVHMILVLRTALNLFFKLKNFKTAATFARRLLELGPKPEVAQQTRKILS ACEKNPTDAYQLNYDMHNPFDICAASYRPIYRGKPVEKCPLSGACYSPEFKGQICRVT TVTEIGKDVIGLRISPLQFR" repeat_unit 470..595 /gene="COPA" /rpt_family="WD-40" repeat_region 470..1315 /note="total of 6 WD-40 repeats within this repeat region" /rpt_type=tandem /rpt_family="WD-40" /rpt_unit=111..236 exon 507..620 /gene="COPA" /number=2 repeat_unit 596..721 /gene="COPA" /rpt_family="WD-40" exon 621..694 /gene="COPA" /number=3 exon 695..775 /gene="COPA" /number=4 repeat_unit 722..847 /gene="COPA" /rpt_family="WD-40" exon 776..852 /gene="COPA" /number=5 repeat_unit 848..973 /gene="COPA" /rpt_family="WD-40" exon 853..962 /gene="COPA" /number=6 exon 963..1072 /gene="COPA" /number=7 repeat_unit 1058..1183 /gene="COPA" /rpt_family="WD-40" exon 1073..1172 /gene="COPA" /number=8 exon 1173..1308 /gene="COPA" /number=9 repeat_unit 1190..1315 /gene="COPA" /rpt_family="WD-40" exon 1309..1391 /gene="COPA" /number=10 exon 1392..1542 /gene="COPA" /number=11 exon 1543..1609 /gene="COPA" /number=12 exon 1610..1685 /gene="COPA" /number=13 exon 1686..1768 /gene="COPA" /number=14 exon 1769..1908 /gene="COPA" /number=15 exon 1909..1994 /gene="COPA" /number=16 exon 1995..2133 /gene="COPA" /number=17 exon 2134..2296 /gene="COPA" /number=18 exon 2297..2443 /gene="COPA" /number=19 exon 2444..2633 /gene="COPA" /number=20 exon 2634..2729 /gene="COPA" /number=21 exon 2730..2818 /gene="COPA" /number=22 exon 2819..2942 /gene="COPA" /number=23 exon 2943..3032 /gene="COPA" /number=24 exon 3033..3142 /gene="COPA" /number=25 exon 3143..3220 /gene="COPA" /number=26 exon 3221..3289 /gene="COPA" /number=27 exon 3290..3426 /gene="COPA" /number=28 exon 3427..3613 /gene="COPA" /number=29 exon 3614..3724 /gene="COPA" /number=30 exon 3725..3886 /gene="COPA" /number=31 exon 3887..4081 /gene="COPA" /number=32 exon 4082..4692 /gene="COPA" /number=33 BASE COUNT 1213 a 1125 c 1230 g 1124 t ORIGIN 1 gagaagggga ccttcaggtc caggcaaagg gggaacttct gtcgtgggaa cgaaaaagaa 61 agaggattta cagggtgggg ggacagaggg gcagcaggaa ccagaaggga gacagtggcg 121 gtcgcaccgg ggccgatccg agagttcccc ttagagaacg gagctcacgg gcggggaggc 181 ctcacctgct agtaggacgc agaaagacag aaggcgaagg agaccccctg ccgtagccat 241 cttgcctctc tgctgagcgg aagcccccgt tcggctcctg tctgttagcg gcctctctag 301 gctaccactg acaccgtctc tgtggcccgg agcctaagag accggaagtt cgtgtttcca 361 ggcgcttccg gaaaccgcgg gagagggtcg ctgacgtgga ggcgtccgaa gggcagcagg 421 gtgtgtcggg gctcggatta agacatcgga gtcggagacc tgagagatgt taaccaaatt 481 cgagaccaag agcgcgcggg tcaaagggct cagctttcac cccaaaagac cttggatcct 541 gactagttta cataatgggg tcatccagtt atgggactat cggatgtgca ctctcattga 601 caagtttgat gaacatgatg gtccagtgcg aggcattgac ttccataagc agcagccact 661 gttcgtctct ggaggagatg actataagat taaggtttgg aattacaagc ttcggcgctg 721 tcttttcaca ttgcttgggc acttagatta tattcgcacc acgttttttc atcatgaata 781 tccctggatt ctgagtgcct ccgatgatca gaccatccga gtgtggaatt ggcaatctag 841 aacctgtgtt tgtgtgttaa cagggcacaa ccattatgtg atgtgtgctc agttccaccc 901 cacagaagac ttggtagtat cagccagcct ggaccagact gtgcgcgttt gggatatttc 961 tggtctgagg aaaaaaaacc tgtcccctgg tgcggtggaa tcggatgtga gaggaataac 1021 tggggttgat ctatttggaa ctacagatgc agtggtgaag catgtactag agggtcacga 1081 tcgtggagta aactgggctg ccttccaccc cactatgccc cttattgtat ctggggcaga 1141 tgatcgtcaa gtgaagatct ggcgcatgaa tgaatcaaag gcatgggagg ttgatacctg 1201 ccggggccat tacaacaatg tatcttgtgc cgtcttccac cctcgccaag agttgatcct 1261 cagcaattct gaggacaaga gtattcgagt ctgggatatg tctaagcgga ctggggttca 1321 gactttccgc agagaccatg atcgtttctg ggtcctagct gctcacccta accttaacct 1381 ctttgcagca ggccatgatg gtggtatgat tgtgtttaag ctggaacggg aacggccagc 1441 ctatgctgtt catggcaata tgctacacta tgtcaaggac cgattcttac gacagctgga 1501 tttcaacagc tccaaagatg tagctgtgat gcagttgcgg agtggttcca agtttccagt 1561 attcaatatg tcatacaatc cagcagaaaa tgcagtcctg ctttgtacaa gagctagcaa 1621 tctagagaat agtacctatg acctgtacac catccctaaa gatgctgact cccagaatcc 1681 tgatgcgcct gaagggaaac gatcctcagg cctgacagcc gtttgggtcg ctcgaaatcg 1741 gtttgctgtc ctagatcgga tgcattcgct tctgatcaag aatctgaaga atgagatcac 1801 caaaaaggta caggtgccca actgtgatga gatcttctat gctggcacag gcaatctcct 1861 gcttcgagat gcggactcta tcacactctt tgacgtacag cagaagcgga ctctggcatc 1921 tgtgaagatt tctaaagtga aatacgttat ctggtcagca gacatgtcac atgtagcact 1981 actagccaaa cacgccattg tgatctgtaa ccgcaaactg gatgctttat gtaacattca 2041 tgagaacatt cgtgtcaaga gtggggcctg ggatgagagt ggggtattta tctataccac 2101 aagcaaccac atcaaatatg ctgtcaccac tggggaccac gggatcattc gaactctgga 2161 tttacccatc tatgtcacac gggtgaaggg caacaatgta tactgcctag acagggagtg 2221 tcgtccccgg gtactcacca ttgatcccac tgagttcaaa ttcaagctgg ccctgatcaa 2281 cagaaaatat gatgaggtac tgcacatggt gaggaatgcc aaactagttg gccagtctat 2341 tattgcttat ctccagaaga agggctatcc tgaagtggca ctgcattttg tcaaggatga 2401 gaaaactcgc tttagtctgg cactggagtg tggaaacatt gagattgctc tggaagcagc 2461 caaagcactg gatgacaaga actgctggga aaagctggga gaagtggccc tgctgcaggg 2521 gaaccaccag attgtggaaa tgtgctatca gcgtaccaaa aactttgaca aagtttcctt 2581 cctgtatctt atcactggca acttagaaaa acttcgcaag atgatgaaga ttgctgagat 2641 cagaaaggac atgagtggcc actatcagaa tgccctatac ctgggtgatg tgtcagagcg 2701 tgtgcggatc ctgaagaact gtggacagaa gtccctggcc tatctcacag ctgctaccca 2761 tggcttagat gaagaagctg agagcctaaa ggagacattt gacccagaga aggagacaat 2821 cccagacatt gaccctaatg ccaagctgct ccagccacct gcacctatca tgccattgga 2881 taccaattgg cctttattga ctgtatccaa aggatttttt gaaggcacca ttgccagcaa 2941 agggaaggga ggagcactgg ctgctgacat tgacattgac actgttggta cagagggctg 3001 gggagaggat gcagagctgc agttggatga agatgggttt gtggaggcta cagaaggttt 3061 gggggatgat gctcttggca agggacagga agaaggaggt ggctgggatg tagaagaaga 3121 tctggagctc cctcctgagc tggatatatc ccctggggca gctggtgggg ctgaagatgg 3181 tttctttgtg cccccaacca agggaacaag tccaactcag atctggtgta ataactctca 3241 gcttccagtt gatcacatcc tggcaggctc tttcgaaaca gccatgcggc tccttcatga 3301 ccaagtaggg gtaatccagt ttggccccta caagcaactg ttcctacaga catacgcccg 3361 aggccgcaca acctatcagg ctctgccctg cctaccctcc atgtatggct atcctaatcg 3421 caactggaag gatgcagggc tgaagaatgg tgtaccagct gtgggcctga agcttaatga 3481 cctcatccaa cggttgcagc tgtgctacca gctcaccaca gttggcaaat ttgaggaggc 3541 tgtggaaaaa ttccgttcca tccttctcag tgtgccactt cttgttgtgg acaataaaca 3601 agagattgca gaggcccagc agctcatcac catttgccgt gagtacattg tgggtttgtc 3661 cgtggagaca gaaaggaaga agctgcccaa agagactcta gaacagcaga agcgcatctg 3721 tgagatggca gcctatttca cccactcaaa cctgcagcct gtgcacatga tcctggtgct 3781 gcgtacagcc ctcaatctgt tcttcaagct caagaacttc aagacagctg ccacctttgc 3841 tcggcgccta ctagaactcg ggcccaagcc tgaggtggcc caacagaccc gaaaaatcct 3901 gtctgcctgt gagaagaatc ccacagatgc ctaccagctc aattatgaca tgcacaaccc 3961 ctttgacatt tgtgctgcat catatcggcc catctaccgt ggaaagccag tagaaaagtg 4021 tccactcagt ggggcctgct attcccctga gttcaaaggt caaatctgca gggtcaccac 4081 agtgacagag attggcaaag atgtgattgg tttaaggatc agtcctctgc agtttcgcta 4141 aggccccctt tgtgtgcatg ggtcagtcac catatgttcc ccccagagaa tgtgtctata 4201 tcctccttct aacagcacct tccccctgca gctactcttc agatctggct ctctgtaccc 4261 taaaacctag tatctttttc tcttctatgg aaaatccgaa ggtctaaact tgactttttt 4321 gaggtcttct caacttgact acagttgtgc tcataattgt ccttgccttt ccagcttaat 4381 tattttaagg aacaaatgaa aactctgggc tgggtggagt ggctcatacc tgtaatccca 4441 gcactttggg aggctacggt gggcagatca tctgaggcca ggagttcgag acctgcctgg 4501 ccaacatggc aacaccccgt ctctaataaa aatataaaaa ttagcctggc atggtagcat 4561 gcgcctatag tcccagctgc tcaggaggct gaggcatgag aatcgcttga acctaggagg 4621 tggaggttgc attcaactga gatcatacca cttcattcca gcctgggtga cagagcaaga 4681 ctctgtctca aa // LOCUS HSU24152 2318 bp mRNA PRI 26-APR-1995 DEFINITION Human p21-activated protein kinase (Pak1) gene, complete cds. ACCESSION U24152 NID g780805 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2318) AUTHORS Sells,M., Knause,U.J., Bagrodia,S., Ambrose,D., Bokoch,G.M. and Chernoff,J. TITLE Human p21-activated protein kinases regulate actin organization in mammalian cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 2318) AUTHORS Chernoff,J. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Jonathan Chernoff, Fox Chase Cancer Center, 7701 Burholme Ave, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..2318 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="21" /clone_lib="lambda-YESR human lymphocyte cDNA library; Elledge and Spottswood, EMBO 10:2653(1991)" /cell_type="EBV-transformed peripheral lymphocytes" gene 394..2031 /gene="Pak1" CDS 394..2031 /gene="Pak1" /codon_start=1 /product="p21-activated protein kinase" /db_xref="PID:g780806" /translation="MSNNGLDIQDKPPAPPMRNTSTMIGVGSKDAGTLNHGSKPLPPN PEEKKKKDRFYRSILPGDKTNKKKEKERPEISLPSDFEHTIHVGFDAVTGEFTGMPEQ WARLLQTSNITKSEQKKNPQAVLDVLEFYNSKKTSNSQKYMSFTDKSAEDYNSSNALN VKAVSETPAVPPVSEDEDDDDDDATPPPVIAPRPEHTKSVYTRSVIEPLPVTPTRDVA TSPISPTENNTTPPDALTRNTEKQKKKPKMSDEEILEKLRSIVSVGDPKKKYTRFEKI GQGASGTVYTAMDVATGQEVAIKQMNLQQQPKKELIINEILVMRENKNPNIVNYLDSY LVGDELWVVMEYLAGGSLTDVVTETCMDEGQIAAVCRECLQALEFLHSNQVIHRDIKS DNILLGMDGSVKLTDFGFCAQITPEQSKRSTMVGTPYWMAPEVVTRKAYGPKVDIWSL GIMAIEMIEGEPPYLNENPLRALYLIATNGTPELQNPEKLSAIFRDFLNRCLDMDVEK RGSAKELLQHQFLKIAKPLSSLTPLIAAAKEATKNNH" BASE COUNT 640 a 604 c 560 g 514 t ORIGIN 1 gccacgaagg ccacagacgc cttccccctt ggactctcat tcccttttcc acggagcccc 61 gcgctttcgt gagccccctc gaggaacctg gtctccgcat ccagttacca cctcctgcct 121 cagaggccat ctgagccctt cgcacctcgc ccctcagtcc ccccttgccc ccccgcggag 181 atcgcctcgc tccctcccgc ccccccatca tcccttccct cgcagttccc ctgtcctgag 241 gggagccccg ccacggcagc gacagcgggc aggagggaga aagtgaaggt tgggcgacac 301 ttggcctcac tcccggctag gcgcacccac ggggaggaga ggaggagccg agagagctga 361 gcagcgcgga agtagctgct gctggtggtg acaatgtcaa ataacggcct agacattcaa 421 gacaaacccc cagcccctcc gatgagaaat accagcacta tgattggagt cggcagcaaa 481 gatgctggaa ccctaaacca tggttctaaa cctctgcctc caaacccaga ggagaagaaa 541 aagaaggacc gattttaccg atccatttta cctggagata aaacaaataa aaagaaagag 601 aaagagcggc cagagatttc tctcccttca gattttgaac acacaattca tgtcggtttt 661 gatgctgtca caggggagtt tacgggaatg ccagagcagt gggcccgctt gcttcagaca 721 tcaaatatca ctaagtcgga gcagaagaaa aacccgcagg ctgttctgga tgtgttggag 781 ttttacaact cgaagaagac atccaacagc cagaaataca tgagctttac agataagtca 841 gctgaggatt acaattcttc taatgccttg aatgtgaagg ctgtgtctga gactcctgca 901 gtgccaccag tttcagaaga tgaggatgat gatgatgatg atgctacccc accaccagtg 961 attgctccac gcccagagca cacaaaatct gtatacacac ggtctgtgat tgaaccactt 1021 cctgtcactc caactcggga cgtggctaca tctcccattt cacctactga aaataacacc 1081 actccaccag atgctttgac ccggaatact gagaagcaga agaagaagcc taaaatgtct 1141 gatgaggaga tcttggagaa attacgaagc atagtgagtg tgggcgatcc taagaagaaa 1201 tatacacggt ttgagaagat tggacaaggt gcttcaggca ccgtgtacac agcaatggat 1261 gtggccacag gacaggaggt ggccattaag cagatgaatc ttcagcagca gcccaagaaa 1321 gagctgatta ttaatgagat cctggtcatg agggaaaaca agaacccaaa cattgtgaat 1381 tacttggaca gttacctcgt gggagatgag ctgtgggttg ttatggaata cttggctgga 1441 ggctccttga cagatgtggt gacagaaact tgcatggatg aaggccaaat tgcagctgtg 1501 tgccgtgagt gtctgcaggc tctggagttc ttgcattcga accaggtcat tcacagagac 1561 atcaagagtg acaatattct gttgggaatg gatggctctg tcaagctaac tgactttgga 1621 ttctgtgcac agataacccc agagcagagc aaacggagca ccatggtagg aaccccatac 1681 tggatggcac cagaggttgt gacacgaaag gcctatgggc ccaaggttga catctggtcc 1741 ctgggcatca tggccatcga aatgattgaa ggggagcctc catacctcaa tgaaaaccct 1801 ctgagagcct tgtacctcat tgccaccaat gggaccccag aacttcagaa cccagagaag 1861 ctgtcagcta tcttccggga ctttctgaac cgctgtctcg atatggatgt ggagaagaga 1921 ggttcagcta aagagctgct acagcatcaa ttcctgaaga ttgccaagcc cctctccagc 1981 ctcactccac tgattgctgc agctaaggag gcaacaaaga acaatcacta aaaccacact 2041 caccccagcc tcattgtgcc aagctctgtg agataaatgc acatttcaga aattccaact 2101 cctgatgccc tcttctcctt gccttgcttc tcccatttcc tgatctagca ctcctcaaga 2161 ctttgatcct tggaaaccgt gtgtccagca ttgaagagaa ctgcaactga atgactaatc 2221 agatgatggc catttctaaa taaggaattt cctcccaatt catggatatg agggtggttt 2281 atgattaagg gtttatataa ataaatgttt ctagtctt // LOCUS HSU24153 2019 bp mRNA PRI 26-APR-1995 DEFINITION Human p21-activated protein kinase (Pak2) gene, complete cds. ACCESSION U24153 NID g780807 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2019) AUTHORS Sells,M., Knause,U.J., Bagrodia,S., Ambrose,D., Bokoch,G.M. and Chernoff,J. TITLE Human p21-activated protein kinases regulate actin organization in mammalian cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 2019) AUTHORS Chernoff,J. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Jonathan Chernoff, Fox Chase Cancer Center, 7701 Burholme Ave, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..2019 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="212" /clone_lib="lambdaYESR human lymphocyte cDNA library; Elledge and Spottswood, EMBO 10:2653(1991)" /cell_type="EBV-transformed peripheral lymphocytes" gene 40..1617 /gene="Pak2" CDS 40..1617 /gene="Pak2" /codon_start=1 /product="p21-activated protein kinase" /db_xref="PID:g780808" /translation="MSDNGELEDKPPAPPVRMSSTIFSTGGKDPLSANHSLKPLPSVP EEKKPRHKIISIFSGTEKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQW ARLLQTSNITKLEQKKNPQAVLDVLKFYDSNTVKQKYLSFTPPEKDGLPSGTPALNAK GTEAPAVVTEEEDDDEETAPPVIAPRPDHTKSIYTRSVIDPVPAPVGDSHVDGAAKSL DKQKKKPKMTDEEIMEKLRTIVSIGDPKKKYTRYEKIGQGASGTVFTATDVALGQEVA IKQINLQKQPKKELIINEILVMKELKNPNIVNFLDSYLVGDELFVVMEYLAGGSLTDV VTETACMDEAQIAAVCRECLQALEFLHANQVIHRDIKSDNVLLGMEGSVKLTDFGFCA QITPEQSKRSTMVGTPYWMAPEVVTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNENPL RALYLIATNGTPELQNPEKLSPIFRDFLNRCLEMDVEKRGSAKELLQHPFLKLAKPLS SLTPLIMAAKEAMKSNR" BASE COUNT 615 a 413 c 481 g 510 t ORIGIN 1 gaccttggct tgcccggggc catttcataa ttctgaatca tgtctgataa cggagaactg 61 gaagataagc ctccagcacc tcctgtgcga atgagcagca ccatctttag cactggaggc 121 aaagaccctt tgtcagccaa tcacagtttg aaacctttgc cctctgttcc agaagagaaa 181 aagcccaggc ataaaatcat ctccatattc tcaggcacag agaaaggaag taaaaagaaa 241 gaaaaggaac ggccagaaat ttctcctcca tctgattttg agcacaccat ccatgttggc 301 tttgatgctg ttactggaga attcactggc atgccagaac agtgggctcg attactacag 361 acctccaata tcaccaaact agagcaaaag aagaatcctc aggctgtgct ggatgtccta 421 aagttctacg actccaacac agtgaagcag aaatatctga gctttactcc tcctgagaaa 481 gatggccttc cttctggaac gccagcactg aatgccaagg gaacagaagc acccgcagta 541 gtgacagagg aggaggatga tgatgaagag actgctcctc ccgttattgc cccgcgaccg 601 gatcatacga aatcaattta cacacggtct gtaattgacc ctgttcctgc accagttggt 661 gattcacatg ttgatggtgc tgccaagtct ttagacaaac agaaaaagaa gcctaagatg 721 acagatgaag agattatgga gaaattaaga actatcgtga gcataggtga ccctaagaaa 781 aaatatacaa gatatgaaaa aattggacaa ggggcttctg gtacagtttt cactgctact 841 gacgttgcac tgggacagga ggttgctatc aaacaaatta atttacagaa acagccaaag 901 aaggaactga tcattaacga gattctggtg atgaaagaat tgaaaaatcc caacatcgtt 961 aactttttgg acagttacct ggtaggagat gaattgtttg tggtcatgga ataccttgct 1021 ggggggtcac tcactgatgt ggtaacagaa acagcttgca tggatgaagc acagattgct 1081 gctgtatgca gagagtgttt acaggcattg gagtttttac atgctaatca agtgatccac 1141 agagacatca aaagtgacaa tgtacttttg ggaatggaag gatctgttaa gctcactgac 1201 tttggtttct gtgcccagat cacccctgag cagagcaaac gcagtaccat ggtcggaacg 1261 ccatactgga tggcaccaga ggtggttaca cggaaagctt atggccctaa agtcgacata 1321 tggtctctgg gtatcatggc tattgagatg gtagaaggag agcctccata cctcaatgaa 1381 aatcccttga gggccttgta cctaatagca actaatggaa ccccagaact tcagaatcca 1441 gagaaacttt ccccaatatt tcgggatttc ttaaatcgat gtttggaaat ggatgtggaa 1501 aaaaggggtt cagccaaaga attattacag catcctttcc tgaaactggc caaaccgtta 1561 tctagcttga caccactgat catggcagct aaagaagcaa tgaagagtaa ccgttaacat 1621 cactgctgtg ggctcatact cttttttcca ttttctacaa gaagcctttt agtatatgaa 1681 aatgatgact ctgttggggg tttaaagaaa tggtctgcat aacctgaatg aaagaaggaa 1741 atgactattc tctgaagaca accaagagaa aattggaaaa gacaaggtat gactttgtta 1801 tgaacccctg cttttagggg tccaggaagg gatttgtggg acttgaattc actaggctta 1861 ggtctttcag gaaacaggct atcaggggca tttatcatgt gtgagattgg attctacttg 1921 ggtgatttgg tggatagacc catgaatggc ccctgggggt tttcaatctt ggattggagg 1981 tgggggtttc agagtgttgc cacgtctagc tcctctccc // LOCUS HSU24163 1476 bp mRNA PRI 18-OCT-1996 DEFINITION Human Frizzled related protein Frzb precursor (fzrb) mRNA, complete cds. ACCESSION U24163 NID g1620540 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1476) AUTHORS Hoang,B., Moos,M. Jr., Vukicevic,S. and Luyten,F.P. TITLE Primary structure and tissue distribution of FRZB, a novel protein related to Drosophila frizzled, suggest a role in skeletal morphogenesis JOURNAL J. Biol. Chem. 271 (42), 26131-26137 (1996) MEDLINE 96421609 REFERENCE 2 (bases 1 to 1476) AUTHORS Hoang,B., Moos,M., Vukicevic,S. and Luyten,F.P. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Malcolm Moos, Cellular & Gene Therapies, FDA/CBER, 1401 Rockville Pike, Suite 200N, HFM-527, Rockville, MD 20852-1448, USA FEATURES Location/Qualifiers source 1..1476 /organism="Homo sapiens" /db_xref="taxon:9606" RBS 206..212 gene 209..1186 /gene="frzb" CDS 209..1186 /gene="frzb" /note="protein present in highly purified fraction from mammalian articular cartilage associated with in vivo chondrogenic/osteogenic activity; it is expressed during human skeletogenesis in a graded pattern suggesting a role in polarity specification; Frizzled related protein; Method: conceptual translation supplied by author" /codon_start=1 /product="Frzb precursor" /db_xref="PID:g1620541" /translation="MVCGSPGGMLLLRAGLLALAALCLLRVPGARAAACEPVRIPLCK SLPWNMTKMPNHLHHSTQANAILAIEQFEGLLGTHCSPDLLFFLCAMYAPICTIDFQH EPINPCKSVCERARQGCEPILIKYRHSWPENLACEELPVYDRGVCISPEAIVTADGAD FPMDSSNGNCRGASSERCKCKPIRATQKTYFRNNYNYVIRAKVKEIKTKCHDVTAVVE VKEILKSSLVNIPRDTVNLYTSSGCLCPPLNVNEEYIIMGYEDEERSRLLLVEGSIAE KWKDRLGKKVKRWDMKLRHLGLSKSDSSNSDSTQSQKSGRNSNPRQARN" mat_peptide 284..1183 /gene="frzb" /note="Frizzled related protein" /product="Frzb" misc_feature 353..355 /gene="frzb" /note="encodes putative N-linked glycosylation site" misc_feature 402..404 /gene="frzb" /note="encodes putative N-linked glycosylation site" misc_feature 431..502 /gene="frzb" /note="encodes putative membrane spanning domain" misc_feature 480..551 /gene="frzb" /note="encodes putative membrane spanning domain" misc_feature 536..538 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 585..587 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 593..595 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 704..706 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 764..766 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 1025..1027 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 1103..1105 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 1118..1120 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 1139..1141 /gene="frzb" /note="encodes putative phosphorylation site" misc_feature 1148..1150 /gene="frzb" /note="encodes putative phosphorylation site" polyA_signal 1456..1461 polyA_site 1476 /note="8 A nucleotides" BASE COUNT 349 a 379 c 389 g 358 t 1 others ORIGIN 1 acggggcctg ggcggsaggg gcggtggctg gagctcggta aagctcgtgg gaccccattg 61 ggggaatttg atccaaggaa gcggtgattg ccgggggagg agaagctccc agatccttgt 121 gtccacttgc agcgggggag gcggagacgc ggagcgggcc ttttggcgtc cactgcgcgg 181 ctgcaccctg ccccatcctg ccgggatcat ggtctgcggc agcccgggag ggatgctgct 241 gctgcgggcc gggctgcttg ccctggctgc tctctgcctg ctccgggtgc ccggggctcg 301 ggctgcagcc tgtgagcccg tccgcatccc cctgtgcaag tccctgccct ggaacatgac 361 taagatgccc aaccacctgc accacagcac tcaggccaac gccatcctgg ccatcgagca 421 gttcgaaggt ctgctgggca cccactgcag ccccgatctg ctcttcttcc tctgtgccat 481 gtacgcgccc atctgcacca ttgacttcca gcacgagccc atcaacccct gtaagtctgt 541 gtgcgagcgg gcccggcagg gctgtgagcc catactcatc aagtaccgcc actcgtggcc 601 ggagaacctg gcctgcgagg agctgccagt gtacgacagg ggcgtgtgca tctctcccga 661 ggccatcgtt actgcggacg gagctgattt tcctatggat tctagtaacg gaaactgtag 721 aggggcaagc agtgaacgct gtaaatgtaa gcctattaga gctacacaga agacctattt 781 ccggaacaat tacaactatg tcattcgggc taaagttaaa gagataaaga ctaagtgcca 841 tgatgtgact gcagtagtgg aggtgaagga gattctaaag tcctctctgg taaacattcc 901 acgggacact gtcaacctct ataccagctc tggctgcctc tgccctccac ttaatgttaa 961 tgaggaatat atcatcatgg gctatgaaga tgaggaacgt tccagattac tcttggtgga 1021 aggctctata gctgagaagt ggaaggatcg actcggtaaa aaagttaagc gctgggatat 1081 gaagcttcgt catcttggac tcagtaaaag tgattctagc aatagtgatt ccactcagag 1141 tcagaagtct ggcaggaact cgaacccccg gcaagcacgc aactaaatcc cgaaatacaa 1201 aaagtaacac agtggacttc ctattaagac ttacttgcat tgctggacta gcaaaggaaa 1261 attgcactat tgcacatcat attctattgt ttactataaa aatcatgtga taactgatta 1321 ttacttctgt ttctcttttg gtttctgctt ctctcttctc tcaacccctt tgtaatggtt 1381 tgggggcaga ctcttaagta tattgtgagt tttctatttc actaatcatg agaaaaactg 1441 ttcttttgca ataataataa attaaacatg ctgtta // LOCUS HSU24169 1222 bp mRNA PRI 06-MAR-1996 DEFINITION Human JTV-1 (JTV-1) mRNA, complete cds. ACCESSION U24169 NID g1215668 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1222) AUTHORS Nicolaides,N.C., Kinzler,K.W. and Vogelstein,B. TITLE Analysis of the 5' region of PMS2 reveals heterogeneous transcripts and a novel overlapping gene JOURNAL Genomics 29 (2), 329-334 (1995) MEDLINE 96115582 REFERENCE 2 (bases 1 to 1222) AUTHORS Kinzler,K. W. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Kenneth W. Kinzler, Johns Hopkins Oncology Center, Johns Hopkins School of Medicine, Room 109, 424 North Bond St., Baltimore, MD 21231-1001 USA FEATURES Location/Qualifiers source 1..1222 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" gene 114..1052 /gene="JTV-1" CDS 114..1052 /gene="JTV-1" /codon_start=1 /product="JTV-1" /db_xref="PID:g1215669" /translation="MPMYQVKPYHGGGAPLRVELPTCMYRLPNVHGRSYGPAPGAGHV QEESNLSLQALESRQDDILKRLYELKAAVDGLSKMIQTPDADLDVTNIIQADEPTTLT TNALDLNSVLGKDYGALKDIVINANPASPPLSLLVLHRLLCEHFRVLSTVHTHSSVKS VPENLLKCFGEQNKKQPRQDYQLGFTLIWKNVPKTQMKFSIQTMCPIEGEGNIARFLF SLFGQKHNAVNATLIDSWVDIAIFQLKEGSSKEKAAVFRSMNSALGKSPWLAGNELTV ADVVLWSVLQQIGGCSVTVPANVQRWMRSCENLAPF" polyA_site 1222 /note="11 A nucleotides" BASE COUNT 293 a 311 c 319 g 299 t ORIGIN 1 ccgaacgccc gcagcagggt cagaagggag gtggccggtc tccgtcgtga cctctgacgg 61 tttctgagcg ttggcctttg gcacgcgcta cacccttttg ctttggttct gccatgccga 121 tgtaccaggt aaagccctat cacgggggcg gcgcgcctct ccgtgtggag cttcccacct 181 gcatgtaccg gctccccaac gtgcacggca ggagctacgg cccagcgccg ggcgctggcc 241 acgtgcagga agagtctaac ctgtctctgc aagctcttga gtcccgccaa gatgatattt 301 taaaacgtct gtatgagttg aaagctgcag ttgatggcct ctccaagatg attcaaacac 361 cagatgcaga cttggatgta accaacataa tccaagcgga tgagcccacg actttaacca 421 ccaatgcgct ggacttgaat tcagtgcttg ggaaggatta cggggcgctg aaagacatcg 481 tgatcaacgc aaacccggcc tcccctcccc tctccctgct tgtgctgcac aggctgctct 541 gtgagcactt cagggtcctg tccacggtgc acacgcactc ctcggtcaag agcgtgcctg 601 aaaaccttct caagtgcttt ggagaacaga ataaaaaaca gccccgccaa gactatcagc 661 tgggattcac tttaatttgg aagaatgtgc cgaagacgca gatgaaattc agcatccaga 721 cgatgtgccc catcgaaggc gaagggaaca ttgcacgttt cttgttctct ctgtttggcc 781 agaagcataa tgctgtcaac gcaaccctta tagatagctg ggtagatatt gcgatttttc 841 agttaaaaga gggaagcagt aaagaaaaag ccgctgtttt ccgctccatg aactctgctc 901 ttgggaagag cccttggctc gctgggaatg aactcaccgt agcagacgtg gtgctgtggt 961 ctgtactcca gcagatcgga ggctgcagtg tgacagtgcc agccaatgtg cagaggtgga 1021 tgaggtcttg tgaaaacctg gctccttttt aacacggccc tcaagctcct taagtgaatt 1081 gccgtaactg attttaaagg gtttagattt taagaatggt gctctttcat gcctattatc 1141 agtaagggga cttgtattag agtcagagtc tttttattta ggccagttgt caagtgtcaa 1201 taaaagcgca tcatgtaatt ta // LOCUS HSU24186 1565 bp DNA PRI 20-SEP-1996 DEFINITION Human replication protein A complex subunit homolog Rpa4 gene, complete cds. ACCESSION U24186 NID g887964 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1565) AUTHORS Keshav,K.F., Chen,C. and Dutta,A. TITLE Rpa4, a homolog of the 34-kilodalton subunit of the replication protein A complex JOURNAL Mol. Cell. Biol. 15 (6), 3119-3128 (1995) MEDLINE 95280910 REFERENCE 2 (bases 1 to 1565) AUTHORS Keshav,K.F. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Kylie F. Keshav, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, 75 Francis St, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1565 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Acid activation-tagged HeLa cDNA library from R. Brent" CDS 405..1190 /note="replication protein A complex 34 kd subunit homolog" /codon_start=1 /product="Rpa4" /db_xref="PID:g887965" /translation="MSKSGFGSYASISAADGASGGSDQLCERDATPAIKTQRPKVRIQ DVVPCNVNQLLSSTVFDPVFKVRGIIVSQVSIVGVIRGAEKASNHICYKIDDMTAKPI EARQWFGREKVKQVTPLSVGVYVKVFGILKCPTGTKSLEVLKIHVLEDMNEFTVHILE TVNAHMMLDKARRDTTVESVPVSPSEVNDAGDNDESHRNFIQDEVLRLIHECPHQEGK SIHELRAQLCDLSVKAIKEAIDYLTVEGHIYPTVDREHFKSAD" BASE COUNT 441 a 320 c 403 g 401 t ORIGIN 1 tctagtaaaa atgcattttt atagagatgt tgggaaaggc ttcttgaaat tacacgtggg 61 acttttaata gataggcgct ttgaccagct aagcaacagg gctcccctcg tgtgggactt 121 ttagaatgta gcaaccactg acacgcaggg aaggattatg cgatcaggtg agaaggtggc 181 cgaccctgac tggctggaag cagatgcatt ctggtagttg attggtccac aggtagcgtg 241 acgcttgtca cgtcctcagc ctcccagcat tcaatcgtag cctttcggac agctcgaagc 301 ccttctgtgg agagctcgaa gccttctgtg gagaactcaa agccgtccgt ggagccccag 361 acgagccaaa gcccaccttc tcctcagcct gagctgtctt gaagatgagt aagagtgggt 421 ttgggagcta tgcgagcatt tctgctgctg atggagcgag tggaggcagt gaccaactgt 481 gtgagagaga tgcaactcct gctattaaga cccaaagacc taaggtccga attcaggacg 541 ttgtaccgtg taacgtgaac cagcttctca gctctactgt gtttgaccct gtgttcaagg 601 ttaggggaat tatagtttcc caggtctcca tcgtgggggt aatcagaggg gcagagaagg 661 cttcaaatca catttgttac aaaattgatg atatgaccgc gaaaccaatc gaggcccgac 721 agtggtttgg tagagagaaa gtcaagcagg tgactccatt gtcagtcgga gtatatgtca 781 aagtgtttgg tatcctcaaa tgtcccacgg gaacaaagag ccttgaggta ttgaaaattc 841 atgtcctaga ggacatgaac gagttcaccg tgcatattct ggaaacggtc aatgcacaca 901 tgatgctgga taaagcccgt cgtgatacca ctgtagaaag tgtgcctgtg tctccatcag 961 aagtgaatga tgctggggat aacgatgaga gtcaccgcaa tttcatccag gacgaagtgc 1021 tgcgtttgat tcatgagtgt cctcatcagg aagggaagag catccatgag ctccgggctc 1081 agctctgcga ccttagcgtc aaggccatca aggaagcgat tgattatctg accgttgagg 1141 gccacatcta tcccactgtg gatcgggagc attttaagtc tgctgattga ggcagggaaa 1201 acatcctttc atttttcgaa gacccttgca tccagctgtg agtaattttg acctgttgac 1261 tttttaggag taggactaaa aaaaaaaatc tcaagtggca ttctttgtca actcgctgct 1321 tttctaactg ctttgaactt ttcggatttt ctgtatttga agctcagaga gagacggtga 1381 tggataaatt gacaactctg taggatttac tagcaagcta atggaaacat gattttcggg 1441 gaagaaaaac tacagaaaat gtagaaattt attatttaat tgtgttggag cttctttttc 1501 caaaagaaaa actagttgca gtcagggagc cagcgaaaag acaaaaaaaa aaaaaaaaaa 1561 cacga // LOCUS HSU24266 3134 bp mRNA PRI 05-JUN-1996 DEFINITION Human pyrroline-5-carboxylate dehydrogenase (P5CDh) mRNA, long form, complete cds. ACCESSION U24266 NID g1353247 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3134) AUTHORS Hu,C.A., Lin,W.W. and Valle,D. TITLE Cloning, characterization, and expression of cDNAs encoding human delta 1-pyrroline-5-carboxylate dehydrogenase JOURNAL J. Biol. Chem. 271 (16), 9795-9800 (1996) MEDLINE 96199247 REFERENCE 2 (bases 1 to 3134) AUTHORS Hu,C.A., Lin,W. and Valle,D.L. TITLE Direct Submission JOURNAL Submitted (07-APR-1995) Chien-an A. Hu, Molecular Biology and Genetics, Johns Hopkins University, PCTB802, 725 N. Wolfe St, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..3134 /organism="Homo sapiens" /note="P5CDhL; this is the long form of the mRNA which contains a 1011-bp insert in the 3' untranslated region" /db_xref="taxon:9606" /chromosome="1" /tissue_type="kidney" gene 31..1722 /gene="P5CDh" CDS 31..1722 /gene="P5CDh" /EC_number="1.5.1.12" /note="Method: conceptual translation supplied by author" /codon_start=1 /function="P5C degradation" /product="pyrroline-5-carboxylate dehydrogenase" /db_xref="PID:g1353248" /translation="MLLPAPALRRALLSRPWTGAGLRWKHTSSLKVANEPVLAFTQGS PERDALQKALKDLKGRMEAIPCVMGDEEVWTSDVQYQVSPFNHGHKVAKFCYADKSLL NKAIEAALAARKEWDLKPIADRAQIFLKAADMLSGPRRAEILAKTMVGQGKTVIQAEI DAAAELIDFFRFNAKYAVELEGQQPISVPPSTNSTVYRGLEGFVAAISPFNFTAIGGN LAGAPALMGNVVLWKPSDTAMLASYAVYRILREAGLPPNIIQFVPADGPLFGDTVTSS EHLCGINFTGSVPTFKHLWKQVAQNLDRFHTFPRLAGECGGKNFHFVHRSADVESVVS GTLRSAFEYGGQKCSACSRLYVPHSLWPQIKGRLLEEHSRIKVGDPAEDFGTFFSAVI DAKSFARIKKWLEHARSSPSLTILAGGKCDDSVGYFVEPCIVESKDPQEPIMKEEIFG PVLSVYVYPDDKYKETLQLVDSTTSYGLTGAVFSQDKDVVQEATKVLRNAAGNFYIND KSTGSIVGQQPFGGARASGTNDKPGGPHYILRWTSPQVIKETHKPLGDWSYAYMQ" polyA_site 3134 /note="16 A nucleotides" BASE COUNT 573 a 959 c 946 g 656 t ORIGIN 1 tccagcgaac agccccgctt ctaacccgag atgctgctgc cggcgcccgc gctccgccgc 61 gccctgctgt cccgcccctg gaccggggcc ggcctgcggt ggaagcacac ctcctccctg 121 aaggtggcca acgagcccgt cttagccttc acgcagggca gccctgagcg agatgccctg 181 caaaaggcct tgaaggacct gaagggccgg atggaagcca tcccatgcgt gatgggggat 241 gaggaggtgt ggacgtcgga cgtgcagtac caagtgtcgc cttttaacca tggacataag 301 gtggccaagt tctgttatgc agacaagagc ctgctcaaca aagccattga ggctgccctg 361 gctgcccgga aagagtggga cctgaagcct attgcagacc gggcccagat cttcctgaag 421 gcggcagaca tgctgagtgg gccgcgcagg gctgagatcc tcgccaagac catggtggga 481 cagggtaaga ccgtgatcca agcggagatt gacgctgcag cggaactcat cgacttcttc 541 cggttcaatg ccaagtatgc ggtggagctg gaggggcagc agcccatcag cgtgcccccg 601 agcaccaaca gcacggtgta ccggggtctg gagggcttcg tggcggccat ctcgcccttt 661 aacttcactg caatcggcgg caacctggcg ggggcaccgg ccctgatggg caacgtggtc 721 ctatggaagc ccagtgacac tgccatgctg gccagctatg ctgtctaccg catccttcgg 781 gaggctggcc tgccccccaa catcatccag tttgtgccag ctgatgggcc cctatttggg 841 gacactgtca ccagctcaga gcacctctgt ggcatcaact tcacaggcag tgtgcccacc 901 ttcaaacacc tgtggaagca ggtggcccag aacctggacc ggttccacac cttcccacgc 961 ctggctggag agtgcggcgg aaagaacttc cacttcgtgc accgctcggc cgacgtggag 1021 agcgtggtga gcgggaccct ccgctcagcc ttcgagtacg gtggccagaa gtgttccgcc 1081 tgctcgcgtc tctacgtgcc gcactcgctg tggccgcaga tcaaagggcg gctgctggag 1141 gagcacagtc ggatcaaagt gggcgaccct gcagaggatt ttgggacctt cttctctgca 1201 gtgattgatg ccaagtcctt tgcccgtatc aagaagtggc tggagcacgc gcgctcctcg 1261 cccagcctca ccatcctggc tgggggcaag tgtgatgact ccgtgggcta ctttgtggag 1321 ccctgcatcg tggagagcaa ggaccctcag gagcccatca tgaaggagga gatcttcggg 1381 cctgtactgt ctgtgtacgt ctacccggac gacaagtaca aggagacgct gcagctggtt 1441 gacagcacca ccagctatgg cctcacgggg gcagtgttct cccaggataa ggacgtcgtg 1501 caggaggcca caaaggtgct gaggaatgct gccggcaact tctacatcaa cgacaagtcc 1561 actggctcga tagtgggcca gcagcccttt gggggggccc gagcctctgg aaccaatgac 1621 aagccagggg gcccacacta catcctgcgc tggacgtcgc cgcaggtcat caaggagaca 1681 cataagcccc tgggggactg gagctacgcg tacatgcagt gagcccctct cgggctccac 1741 cgtccagctg tctgtccgtc caggtggccg acctcactgc acagacccca ctccagcccc 1801 tccacccctt cttcatgcac agctgccttt ctataatccg ggcttgactc ccttcttacc 1861 actgtattct ggcctctccc atgcctcagg ctctggtttg agatcgtgct ggggaggaac 1921 atggccacta ccccttatcc catcggccat gtgggaggta tgaccctggt gcctggcagg 1981 ttctccctct gccctccact gggcccagtg gctcagggac ctggggaaag gagatggagc 2041 agctcttggg atcctttggg gaaaaggagg ccattctggg ccccttggca aacctcacca 2101 ctcacagagg ctcctggcct tgatccctgc ccctccaggt gtccagggta aagtgtaact 2161 cagactgacc tgtggggcac agggggcacc agctggcctt gccctctctg gtctgggctg 2221 tctaccttcc tcactgtatc tttgcccaga cccacctggg ccagtaggcc tgtccccagc 2281 cacacacctt agatgctggc atgccttact ccaggtgcct gtgtttggcc gaggcctgtg 2341 tgattcccgg tctgcaccac atggcggggt tggggggccg ctggaggcca cctgccaagg 2401 cgtgggatgg gatggtcctg ccggtttagg ccgtgattct ggaaaacctt ggatgggcct 2461 tcgtcctatg tcagccttcc ctttgatcct caggccctac ctgtagagac ctccactcct 2521 agagccagtc tcagggtctg ggatttccct gcaggagctc agccaccact gtgccatggt 2581 gacacaggcc aaggcagaca ttggccctcc cttctcccag cccccagagg cctggccttg 2641 ggttcgtcag catgggccga ggacgttgcc tgtagaatcc tcctctgcct gggagtggct 2701 ctgtgtggac cagtccctca ctggcccatt ctttttttga cgcagccaat ctgtgaccac 2761 gattcctccc acagatgcct cctgcttgga ttctgagtgg tcagagatct gtaaagcatg 2821 actttcaagg atggttctta ggggactgtg taaagtgttg ggtcttcctc caggatgcct 2881 gcatgggacc ccacccggag ctggtgtggc cattccccga agtgccactg gcccatggat 2941 gggggtgggt gctggtgcac ctgggctggg tgtgggttct gtgtccttcc aggatctgtg 3001 tcatttccca tgaggggccg gggcaggtgg ctgggtgggg gcacaggctg gagtattctt 3061 agttctactg gttctacact gtgaggtggc aatgggattt gctcagatgc cacccaataa 3121 aatgcctgtt actt // LOCUS HSU24497 14063 bp mRNA PRI 05-MAY-1995 DEFINITION Human autosomal dominant polycystic kidney disease protein 1 (PKD1) mRNA, complete cds. ACCESSION U24497 U24499 NID g799334 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14063) AUTHORS The International Polycystic Kidney Disease Consortium,. TITLE Polycystic kidney disease: The complete structure of the PKD1 gene and its protein JOURNAL Cell 81, 289-298 (1995) REFERENCE 2 (sites) AUTHORS The European Polycystic Kidney Disease Consortium,. TITLE The polycystic kidney disease 1 gene encodes a 14 kb transcript and lies within a duplicated region on chromosome 16. The European Polycystic Kidney Disease Consortium [published erratum appears in Cell 1994 Aug 26;78(4):following 724] JOURNAL Cell 77 (6), 881-894 (1994) MEDLINE 94273192 REFERENCE 3 (bases 1 to 14063) AUTHORS Glucksmann-Kuis,M.A. TITLE Direct Submission JOURNAL Submitted (10-APR-1995) M. Alexandra Glucksmann-Kuis, Genomics, Millennium Pharmaceuticals, 640 Memorial Drive, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..14063 /organism="Homo sapiens" /db_xref="taxon:9606" /map="16p13.3" /chromosome="16" gene 136..13047 /gene="PKD1" CDS 136..13047 /gene="PKD1" /codon_start=1 /product="autosomal dominant polycystic kidney disease protein 1" /db_xref="PID:g799335" /translation="MPPAAPARLALALGLGLWLGALAGGPGRGCGPCEPPCLCGPAPG AACRVNCSGRGLRTLGPALRIPADATELDVSHNLLRALDVGLLANLSALAELDISNNK ISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPQWAEEQQVRVVQPEAATCAGPGS LAGQPLLGIPLLDSGCGEEYVACLPDNSSGTVAAVSFSAAHEGLLQPEACSAFCFSTG QGLAALSEQGWCLCGAAQPSSASFACLSLCSGPPAPPAPTCRGPTLLQHVFPASPGAT LVGPHGPLASGQLAAFHIAAPLPVTDTRWDFGDGSAEVDAAGPAASHRYVLPGRYHVT AVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIV ALGEEPARAVHPLCPSDTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSP AVQRFLVSRVTRSLDVWIGFSTVQGVEVGPAPQGEAFSLESCQNWLPGEPHPATAEHC VRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL SAPHEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRLLSTAGTPENGSE PESRSPDNRTQLAPACMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYAL WREFLFSVPAGPPAQYSVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRA PYLSANASSWLPHLPAQLEGTWGCPACALRLLAQREQLTVLLGLRPNPGLRLPGRYEV RAEVGNGVSRHNLSCSFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANAT ATARWPGGSLSARFENVCPALVATFVPACPWETNDTLFSVVALPWLSEGEHVVDVVVE NSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSDMVFRWTINDK QSLTFQNVVFNVIYQSAAVFKLSLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAV LSPNATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVE HNVTHTYAAPGEYLLTVLASNAFENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTF YPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQA DVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEH VYLRAQNCTVTVGAGSPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGN PAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEP EVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFIYR DPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLGLELQQPYLFSAVGRGR PASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLV VNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNII VTAENEVGSAQDSIFVYVLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTA WRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAAS PNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAG NPLGSANATVEVDVQVPVSGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWAVP GGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWASSKVVA PGQLVHFQILLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSW AQAQVRIVVLEAVSGLQVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGD SLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQDAVQYVALQSGPCFTN RSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVY RTASCQRPGRPARVALPGVDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQAN VTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVASTQR EAGGCALNFGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVP IVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTT STGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFP LGAVHALTTKVHFECTGWHDAEDAGAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAV LPPGFRPHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATGLTVWLHGLTASVL PGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLR VHTVDDIQQIAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAI GDSILNITGDLIHLASSDVRAPQPSELGAESPSRMVASQAYNLTSALMRILMRSRVLN EEPLTLAGEEIVAQGKRSDPRSLLCYGGAPGPGCHFSIPEAFSGALANLSDVVQLIFL VDSNPFPFGYISNYTVSTKVASMAFQTQAGAQIPIERLASERAITVKVPNNSDWAARG HRSSANSANSVVVQPQASVGAVVTLDSSNPAAGLHLQLNYTLLDGHYLSEEPEPYLAV YLHSEPRPNEHNCSASRRIRPESLQGADHRPYTFFISPGSRDPAGSYHLNLSSHFRWS ALQVSVGLYTSLCQYFSEEDMVWRTEGLLPLEETSPRQAVCLTRHLTAFGASLFVPPS HVRFVFPEPTADVNYIVMLTCAVCLVTYMVMAAILHKLDQLDASRGRAIPFCGQRGRF KYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDRAFHRNSLDIFRIATPHSL GSVWKIRVWHDNKGLSPAWFLQHVIVRDLQTARSAFFLVNDWLSVETEANGGLVEKEV LAASDAALLRFRRLLVAELQRGFFDKHIWLSIWDRPPRSRFTRIQRATCCVLLICLFL GANAVWYGAVGDSAYSTGHVSRLSPLSVDTVAVGLVSSVVVYPVYLAILFLFRMSRSK VAGSPSPTPAGQQVLDIDSCLDSSVLDSSFLTFSGLHAEQAFVGQMKSDLFLDDSKSL VCWPSGEGTLSWPDLLSDPSIVGSNLRQLARGQAGHGLGPEEDGFSLASPYSPAKSFS ASDEDLIQQVLAEGVSSPAPTQDTHMETDLLSSLSSTPGEKTETLALQRLGELGPPSP GLNWEQPQAARLSRTGLVEGLRKRLLPAWCASLAHGLSLLLVAVAVAVSGWVGASFPP GVSVAWLLSSSASFLASFLGWEPLKVLLEALYFSLVAKRLHPDEDDTLVESPAVTPVS ARVPRVRPPHGFALFLAKEEARKVKRLHGMLRSLLVYMLFLLVTLLASYGDASCHGHA YRLQSAIKQELHSRAFLAITRSEELWPWMAHVLLPYVHGNQSSPELGPPRLRQVRLQE ALYPDPPGPRVHTCSAAGGFSTSDYDVGWESPHNGSGTWAYSAPDLLGAWSWGSCAVY DSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFLELTRYSPAVGLHAAVTLRLE FPAAGRALAALSVRPFALRRLSAGLSLPLLTSVCLLLFAVHFAVAEARTWHREGRWRV LRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVAHVSSAAR GLAASLLFLLLVKAAQHVRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLAIL LVSSCVDSLWSVAQALLVLCPGTGLSTLCPAESWHLSPLLCVGLWALRLWGALRLGAV ILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVKEFRHKVRFEGMEPLP SRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFE ALLTQFDRLNQATEDVYQLEQQLHSLQGRRSSRAPAGSSRGPSPGLRPALPSRLARAS RGVDLATGPSRTPLRAKNKVHPSST" BASE COUNT 2148 a 4789 c 4614 g 2512 t ORIGIN 1 gctcagcagc aggtcgcggc cgcagcccca tccagccccg cgcccgccat gccgtccgcc 61 ggccccgcct gagccgcggc ctccgcgcgc gggcgggcct ggggacggcg gggccatgcg 121 cgcgctgccc taacgatgcc gcccgccgcg cccgcccgcc tggcgctggc cctgggcctg 181 ggcctgtggc tcggggcgct ggcggggggg cccgggcgcg gctgcgggcc ctgcgagccc 241 ccctgcctct gcgggccagc gcccggcgcc gcctgccgcg tcaactgctc gggccgcggg 301 ctgcggacgc tcggtcccgc gctgcgcatc cccgcggacg ccacagagct agacgtctcc 361 cacaacctgc tccgggcgct ggacgttggg ctcctggcga acctctcggc gctggcagag 421 ctggatataa gcaacaacaa gatttctacg ttagaagaag gaatatttgc taatttattt 481 aatttaagtg aaataaacct gagtgggaac ccgtttgagt gtgactgtgg cctggcgtgg 541 ctgccgcaat gggcggagga gcagcaggtg cgggtggtgc agcccgaggc agccacgtgt 601 gctgggcctg gctccctggc tggccagcct ctgcttggca tccccttgct ggacagtggc 661 tgtggtgagg agtatgtcgc ctgcctccct gacaacagct caggcaccgt ggcagcagtg 721 tccttttcag ctgcccacga aggcctgctt cagccagagg cctgcagcgc cttctgcttc 781 tccaccggcc agggcctcgc agccctctcg gagcagggct ggtgcctgtg tggggcggcc 841 cagccctcca gtgcctcctt tgcctgcctg tccctctgct ccgggccccc ggcacctcct 901 gcccccacct gtaggggccc caccctcctc cagcacgtct tccctgcctc cccaggggcc 961 accctggtgg ggccccacgg acctctggcc tctggccagc tagcagcctt ccacatcgct 1021 gccccgctcc ctgtcactga cacacgctgg gacttcggag acggctccgc cgaggtggat 1081 gccgctgggc cggctgcctc gcatcgctat gtgctgcctg ggcgctatca cgtgacggcc 1141 gtgctggccc tgggggccgg ctcagccctg ctggggacag acgtgcaggt ggaagcggca 1201 cctgccgccc tggagctcgt gtgcccgtcc tcggtgcaga gtgacgagag cctcgacctc 1261 agcatccaga accgcggtgg ttcaggcctg gaggccgcct acagcatcgt ggccctgggc 1321 gaggagccgg cccgagcggt gcacccgctc tgcccctcgg acacggagat cttccctggc 1381 aacgggcact gctaccgcct ggtggtggag aaggcggcct ggctgcaggc gcaggagcag 1441 tgtcaggcct gggccggggc cgccctggca atggtggaca gtcccgccgt gcagcgcttc 1501 ctggtctccc gggtcaccag gagcctagac gtgtggatcg gcttctcgac tgtgcagggg 1561 gtggaggtgg gcccagcgcc gcagggcgag gccttcagcc tggagagctg ccagaactgg 1621 ctgcccgggg agccacaccc agccacagcc gagcactgcg tccggctcgg gcccaccggg 1681 tggtgtaaca ccgacctgtg ctcagcgccg cacagctacg tctgcgagct gcagcccgga 1741 ggcccagtgc aggatgccga gaacctcctc gtgggagcgc ccagtgggga cctgcaggga 1801 cccctgacgc ctctggcaca gcaggacggc ctctcagccc cgcacgagcc cgtggaggtc 1861 atggtattcc cgggcctgcg tctgagccgt gaagccttcc tcaccacggc cgaatttggg 1921 acccaggagc tccggcggcc cgcccagctg cggctgcagg tgtaccggct cctcagcaca 1981 gcagggaccc cggagaacgg cagcgagcct gagagcaggt ccccggacaa caggacccag 2041 ctggcccccg cgtgcatgcc agggggacgc tggtgccctg gagccaacat ctgcttgccg 2101 ctggacgcct cctgccaccc ccaggcctgc gccaatggct gcacgtcagg gccagggcta 2161 cccggggccc cctatgcgct atggagagag ttcctcttct ccgttcccgc ggggcccccc 2221 gcgcagtact cggtcaccct ccacggccag gatgtcctca tgctccctgg tgacctcgtt 2281 ggcttgcagc acgacgctgg ccctggcgcc ctcctgcact gctcgccggc tcccggccac 2341 cctggtcccc gggccccgta cctctccgcc aacgcctcgt catggctgcc ccacttgcca 2401 gcccagctgg agggcacttg gggctgccct gcctgtgccc tgcggctgct tgcacaacgg 2461 gaacagctca ccgtgctgct gggcttgagg cccaaccctg gactgcggct gcctgggcgc 2521 tatgaggtcc gggcagaggt gggcaatggc gtgtccaggc acaacctctc ctgcagcttt 2581 gacgtggtct ccccagtggc tgggctgcgg gtcatctacc ctgccccccg cgacggccgc 2641 ctctacgtgc ccaccaacgg ctcagccttg gtgctccagg tggactctgg tgccaacgcc 2701 acggccacgg ctcgctggcc tgggggcagt ctcagcgccc gctttgagaa tgtctgccct 2761 gccctggtgg ccaccttcgt gcccgcctgc ccctgggaga ccaacgatac cctgttctca 2821 gtggtagcac tgccgtggct cagtgagggg gagcacgtgg tggacgtggt ggtggaaaac 2881 agcgccagcc gggccaacct cagcctgcgg gtgacggcgg aggagcccat ctgtggcctc 2941 cgcgccacgc ccagccccga ggcccgtgta ctgcagggag tcctagtgag gtacagcccc 3001 gtggtggagg ccggctcgga catggtcttc cggtggacca tcaacgacaa gcagtccctg 3061 accttccaga acgtggtctt caatgtcatt tatcagagcg cggcggtctt caagctctca 3121 ctgacggcct ccaaccacgt gagcaacgtc accgtgaact acaacgtaac cgtggagcgg 3181 atgaacagga tgcagggtct gcaggtctcc acagtgccgg ccgtgctgtc ccccaatgcc 3241 acgctagcac tgacggcggg cgtgctggtg gactcggccg tggaggtggc cttcctgtgg 3301 acctttgggg atggggagca ggccctccac cagttccagc ctccgtacaa cgagtccttc 3361 ccagttccag acccctcggt ggcccaggtg ctggtggagc acaatgtcac gcacacctac 3421 gctgccccag gtgagtacct cctgaccgtg ctggcatcta atgccttcga gaacctgacg 3481 cagcaggtgc ctgtgagcgt gcgcgcctcc ctgccctccg tggctgtggg tgtgagtgac 3541 ggcgtcctgg tggccggccg gcccgtcacc ttctacccgc acccgctgcc ctcgcctggg 3601 ggtgttcttt acacgtggga cttcggggac ggctcccctg tcctgaccca gagccagccg 3661 gctgccaacc acacctatgc ctcgaggggc acctaccacg tgcgcctgga ggtcaacaac 3721 acggtgagcg gtgcggcggc ccaggcggat gtgcgcgtct ttgaggagct ccgcggactc 3781 agcgtggaca tgagcctggc cgtggagcag ggcgcccccg tggtggtcag cgccgcggtg 3841 cagacgggcg acaacatcac gtggaccttc gacatggggg acggcaccgt gctgtcgggc 3901 ccggaggcaa cagtggagca tgtgtacctg cgggcacaga actgcacagt gaccgtgggt 3961 gcgggcagcc ccgccggcca cctggcccgg agcctgcacg tgctggtctt cgtcctggag 4021 gtgctgcgcg ttgaacccgc cgcctgcatc cccacgcagc ctgacgcgcg gctcacggcc 4081 tacgtcaccg ggaacccggc ccactacctc ttcgactgga ccttcgggga tggctcctcc 4141 aacacgaccg tgcgggggtg cccgacggtg acacacaact tcacgcggag cggcacgttc 4201 cccctggcgc tggtgctgtc cagccgcgtg aacagggcgc attacttcac cagcatctgc 4261 gtggagccag aggtgggcaa cgtcaccctg cagccagaga ggcagtttgt gcagctcggg 4321 gacgaggcct ggctggtggc atgtgcctgg cccccgttcc cctaccgcta cacctgggac 4381 tttggcaccg aggaagccgc ccccacccgt gccaggggcc ctgaggtgac gttcatctac 4441 cgagacccag gctcctatct tgtgacagtc accgcgtcca acaacatctc tgctgccaat 4501 gactcagccc tggtggaggt gcaggagccc gtgctggtca ccagcatcaa ggtcaatggc 4561 tcccttgggc tggagctgca gcagccgtac ctgttctctg ctgtgggccg tgggcgcccc 4621 gccagctacc tgtgggatct gggggacggt gggtggctcg agggtccgga ggtcacccac 4681 gcttacaaca gcacaggtga cttcaccgtt agggtggccg gctggaatga ggtgagccgc 4741 agcgaggcct ggctcaatgt gacggtgaag cggcgcgtgc gggggctcgt cgtcaatgca 4801 agccgcacgg tggtgcccct gaatgggagc gtgagcttca gcacgtcgct ggaggccggc 4861 agtgatgtgc gctattcctg ggtgctctgt gaccgctgca cgcccatccc tgggggtcct 4921 accatctctt acaccttccg ctccgtgggc accttcaata tcatcgtcac ggctgagaac 4981 gaggtgggct ccgcccagga cagcatcttc gtctatgtcc tgcagctcat agaggggctg 5041 caggtggtgg gcggtggccg ctacttcccc accaaccaca cggtacagct gcaggccgtg 5101 gttagggatg gcaccaacgt ctcctacagc tggactgcct ggagggacag gggcccggcc 5161 ctggccggca gcggcaaagg cttctcgctc accgtgctcg aggccggcac ctaccatgtg 5221 cagctgcggg ccaccaacat gctgggcagc gcctgggccg actgcaccat ggacttcgtg 5281 gagcctgtgg ggtggctgat ggtggccgcc tccccgaacc cagctgccgt caacacaagc 5341 gtcaccctca gtgccgagct ggctggtggc agtggtgtcg tatacacttg gtccttggag 5401 gaggggctga gctgggagac ctccgagcca tttaccaccc atagcttccc cacacccggc 5461 ctgcacttgg tcaccatgac ggcagggaac ccgctgggct cagccaacgc caccgtggaa 5521 gtggatgtgc aggtgcctgt gagtggcctc agcatcaggg ccagcgagcc cggaggcagc 5581 ttcgtggcgg ccgggtcctc tgtgcccttt tgggggcagc tggccacggg caccaatgtg 5641 agctggtgct gggctgtgcc cggcggcagc agcaagcgtg gccctcatgt caccatggtc 5701 ttcccggatg ctggcacctt ctccatccgg ctcaatgcct ccaacgcagt cagctgggtc 5761 tcagccacgt acaacctcac ggcggaggag cccatcgtgg gcctggtgct gtgggccagc 5821 agcaaggtgg tggcgcccgg gcagctggtc cattttcaga tcctgctggc tgccggctca 5881 gctgtcacct tccgcctaca ggtcggcggg gccaaccccg aggtgctccc cgggccccgt 5941 ttctcccaca gcttcccccg cgtcggagac cacgtggtga gcgtgcgggg caaaaaccac 6001 gtgagctggg cccaggcgca ggtgcgcatc gtggtgctgg aggccgtgag tgggctgcag 6061 gtgcccaact gctgcgagcc tggcatcgcc acgggcactg agaggaactt cacagcccgc 6121 gtgcagcgcg gctctcgggt cgcctacgcc tggtacttct cgctgcagaa ggtccagggc 6181 gactcgctgg tcatcctgtc gggccgcgac gtcacctaca cgcccgtggc cgcggggctg 6241 ttggagatcc aggtgcgcgc cttcaacgcc ctgggcagtg agaaccgcac gctggtgctg 6301 gaggttcagg acgccgtcca gtatgtggcc ctgcagagcg gcccctgctt caccaaccgc 6361 tcggcgcagt ttgaggccgc caccagcccc agcccccggc gtgtggccta ccactgggac 6421 tttggggatg ggtcgccagg gcaggacaca gatgagccca gggccgagca ctcctacctg 6481 aggcctgggg actaccgcgt gcaggtgaac gcctccaacc tggtgagctt cttcgtggcg 6541 caggccacgg tgaccgtcca ggtgctggcc tgccgggagc cggaggtgga cgtggtcctg 6601 cccctgcagg tgctgatgcg gcgatcacag cgcaactact tggaggccca cgttgacctg 6661 cgcgactgcg tcacctacca gactgagtac cgctgggagg tgtatcgcac cgccagctgc 6721 cagcggccgg ggcgcccagc gcgtgtggcc ctgcccggcg tggacgtgag ccggcctcgg 6781 ctggtgctgc cgcggctggc gctgcctgtg gggcactact gctttgtgtt tgtcgtgtca 6841 tttggggaca cgccactgac acagagcatc caggccaatg tgacggtggc ccccgagcgc 6901 ctggtgccca tcattgaggg tggctcatac cgcgtgtggt cagacacacg ggacctggtg 6961 ctggatggga gcgagtccta cgaccccaac ctggaggacg gcgaccagac gccgctcagt 7021 ttccactggg cctgtgtggc ttcgacacag agggaggctg gcgggtgtgc gctgaacttt 7081 gggccccgcg ggagcagcac ggtcaccatt ccacgggagc ggctggcggc tggcgtggag 7141 tacaccttca gcctgaccgt gtggaaggcc ggccgcaagg aggaggccac caaccagacg 7201 gtgctgatcc ggagtggccg ggtgcccatt gtgtccttgg agtgtgtgtc ctgcaaggca 7261 caggccgtgt acgaagtgag ccgcagctcc tacgtgtact tggagggccg ctgcctcaat 7321 tgcagcagcg gctccaagcg agggcggtgg gctgcacgta cgttcagcaa caagacgctg 7381 gtgctggatg agaccaccac atccacgggc agtgcaggca tgcgactggt gctgcggcgg 7441 ggcgtgctgc gggacggcga gggatacacc ttcacgctca cggtgctggg ccgctctggc 7501 gaggaggagg gctgcgcctc catccgcctg tcccccaacc gcccgccgct ggggggctct 7561 tgccgcctct tcccactggg cgctgtgcac gccctcacca ccaaggtgca cttcgaatgc 7621 acgggctggc atgacgcgga ggatgctggc gccccgctgg tgtacgccct gctgctgcgg 7681 cgctgtcgcc agggccactg cgaggagttc tgtgtctaca agggcagcct ctccagctac 7741 ggagccgtgc tgcccccggg tttcaggcca cacttcgagg tgggcctggc cgtggtggtg 7801 caggaccagc tgggagccgc tgtggtcgcc ctcaacaggt ctttggccat caccctccca 7861 gagcccaacg gcagcgcaac ggggctcaca gtctggctgc acgggctcac cgctagtgtg 7921 ctcccagggc tgctgcggca ggccgatccc cagcacgtca tcgagtactc gttggccctg 7981 gtcaccgtgc tgaacgagta cgagcgggcc ctggacgtgg cggcagagcc caagcacgag 8041 cggcagcacc gagcccagat acgcaagaac atcacggaga ctctggtgtc cctgagggtc 8101 cacactgtgg atgacatcca gcagatcgct gctgcgctgg cccagtgcat ggggcccagc 8161 agggagctcg tatgccgctc gtgcctgaag cagacgctgc acaagctgga ggccatgatg 8221 ctcatcctgc aggcagagac caccgcgggc accgtgacgc ccaccgccat cggagacagc 8281 atcctcaaca tcacaggaga cctcatccac ctggccagct cggacgtgcg ggcaccacag 8341 ccctcagagc tgggagccga gtcaccatct cggatggtgg cgtcccaggc ctacaacctg 8401 acctctgccc tcatgcgcat cctcatgcgc tcccgcgtgc tcaacgagga gcccctgacg 8461 ctggcgggcg aggagatcgt ggcccagggc aagcgctcgg acccgcggag cctgctgtgc 8521 tatggcggcg ccccagggcc tggctgccac ttctccatcc ccgaggcttt cagcggggcc 8581 ctggccaacc tcagtgacgt ggtgcagctc atctttctgg tggactccaa tccctttccc 8641 tttggctata tcagcaacta caccgtctcc accaaggtgg cctcgatggc attccagaca 8701 caggccggcg cccagatccc catcgagcgg ctggcctcag agcgcgccat caccgtgaag 8761 gtgcccaaca actcggactg ggctgcccgg ggccaccgca gctccgccaa ctccgccaac 8821 tccgttgtgg tccagcccca ggcctccgtc ggtgctgtgg tcaccctgga cagcagcaac 8881 cctgcggccg ggctgcatct gcagctcaac tatacgctgc tggacggcca ctacctgtct 8941 gaggaacctg agccctacct ggcagtctac ctacactcgg agccccggcc caatgagcac 9001 aactgctcgg ctagcaggag gatccgccca gagtcactcc agggtgctga ccaccggccc 9061 tacaccttct tcatttcccc ggggagcaga gacccagcgg ggagttacca tctgaacctc 9121 tccagccact tccgctggtc ggcgctgcag gtgtccgtgg gcctgtacac gtccctgtgc 9181 cagtacttca gcgaggagga catggtgtgg cggacagagg ggctgctgcc cctggaggag 9241 acctcgcccc gccaggccgt ctgcctcacc cgccacctca ccgccttcgg cgccagcctc 9301 ttcgtgcccc caagccatgt ccgctttgtg tttcctgagc cgacagcgga tgtaaactac 9361 atcgtcatgc tgacatgtgc tgtgtgcctg gtgacctaca tggtcatggc cgccatcctg 9421 cacaagctgg accagttgga tgccagccgg ggccgcgcca tccctttctg tgggcagcgg 9481 ggccgcttca agtacgagat cctcgtcaag acaggctggg gccggggctc aggtaccacg 9541 gcccacgtgg gcatcatgct gtatggggtg gacagccgga gcggccaccg gcacctggac 9601 ggcgacagag ccttccaccg caacagcctg gacatcttcc ggatcgccac cccgcacagc 9661 ctgggtagcg tgtggaagat ccgagtgtgg cacgacaaca aagggctcag ccctgcctgg 9721 ttcctgcagc acgtcatcgt cagggacctg cagacggcac gcagcgcctt cttcctggtc 9781 aatgactggc tttcggtgga gacggaggcc aacgggggcc tggtggagaa ggaggtgctg 9841 gccgcgagcg acgcagccct tttgcgcttc cggcgcctgc tggtggctga gctgcagcgt 9901 ggcttctttg acaagcacat ctggctctcc atatgggacc ggccgcctcg tagccgtttc 9961 actcgcatcc agagggccac ctgctgcgtt ctcctcatct gcctcttcct gggcgccaac 10021 gccgtgtggt acggggctgt tggcgactct gcctacagca cggggcatgt gtccaggctg 10081 agcccgctga gcgtcgacac agtcgctgtt ggcctggtgt ccagcgtggt tgtctatccc 10141 gtctacctgg ccatcctttt tctcttccgg atgtcccgga gcaaggtggc tgggagcccg 10201 agccccacac ctgccgggca gcaggtgctg gacatcgaca gctgcctgga ctcgtccgtg 10261 ctggacagct ccttcctcac gttctcaggc ctccacgctg agcaggcctt tgttggacag 10321 atgaagagtg acttgtttct ggatgattct aagagtctgg tgtgctggcc ctccggcgag 10381 ggaacgctca gttggccgga cctgctcagt gacccgtcca ttgtgggtag caatctgcgg 10441 cagctggcac ggggccaggc gggccatggg ctgggcccag aggaggacgg cttctccctg 10501 gccagcccct actcgcctgc caaatccttc tcagcatcag atgaagacct gatccagcag 10561 gtccttgccg agggggtcag cagcccagcc cctacccaag acacccacat ggaaacggac 10621 ctgctcagca gcctgtccag cactcctggg gagaagacag agacgctggc gctgcagagg 10681 ctgggggagc tggggccacc cagcccaggc ctgaactggg aacagcccca ggcagcgagg 10741 ctgtccagga caggactggt ggagggtctg cggaagcgcc tgctgccggc ctggtgtgcc 10801 tccctggccc acgggctcag cctgctcctg gtggctgtgg ctgtggctgt ctcagggtgg 10861 gtgggtgcga gcttcccccc gggcgtgagt gttgcgtggc tcctgtccag cagcgccagc 10921 ttcctggcct cattcctcgg ctgggagcca ctgaaggtct tgctggaagc cctgtacttc 10981 tcactggtgg ccaagcggct gcacccggat gaagatgaca ccctggtaga gagcccggct 11041 gtgacgcctg tgagcgcacg tgtgccccgc gtacggccac cccacggctt tgcactcttc 11101 ctggccaagg aagaagcccg caaggtcaag aggctacatg gcatgctgcg gagcctcctg 11161 gtgtacatgc tttttctgct ggtgaccctg ctggccagct atggggatgc ctcatgccat 11221 gggcacgcct accgtctgca aagcgccatc aagcaggagc tgcacagccg ggccttcctg 11281 gccatcacgc ggtctgagga gctctggcca tggatggccc acgtgctgct gccctacgtc 11341 cacgggaacc agtccagccc agagctgggg cccccacggc tgcggcaggt gcggctgcag 11401 gaagcactct acccagaccc tcccggcccc agggtccaca cgtgctcggc cgcaggaggc 11461 ttcagcacca gcgattacga cgttggctgg gagagtcctc acaatggctc ggggacgtgg 11521 gcctattcag cgccggatct gctgggggca tggtcctggg gctcctgtgc cgtgtatgac 11581 agcgggggct acgtgcagga gctgggcctg agcctggagg agagccgcga ccggctgcgc 11641 ttcctgcagc tgcacaactg gctggacaac aggagccgcg ctgtgttcct ggagctcacg 11701 cgctacagcc cggccgtggg gctgcacgcc gccgtcacgc tgcgcctcga gttcccggcg 11761 gccggccgcg ccctggccgc cctcagcgtc cgcccctttg cgctgcgccg cctcagcgcg 11821 ggcctctcgc tgcctctgct cacctcggtg tgcctgctgc tgttcgccgt gcacttcgcc 11881 gtggccgagg cccgtacttg gcacagggaa gggcgctggc gcgtgctgcg gctcggagcc 11941 tgggcgcggt ggctgctggt ggcgctgacg gcggccacgg cactggtacg cctcgcccag 12001 ctgggtgccg ctgaccgcca gtggacccgt ttcgtgcgcg gccgcccgcg ccgcttcact 12061 agcttcgacc aggtggcgca cgtgagctcc gcagcccgtg gcctggcggc ctcgctgctc 12121 ttcctgcttt tggtcaaggc tgcccagcac gtacgcttcg tgcgccagtg gtccgtcttt 12181 ggcaagacat tatgccgagc tctgccagag ctcctggggg tcaccttggg cctggtggtg 12241 ctcggggtag cctacgccca gctggccatc ctgctcgtgt cttcctgtgt ggactccctc 12301 tggagcgtgg cccaggccct gttggtgctg tgccctggga ctgggctctc taccctgtgt 12361 cctgccgagt cctggcacct gtcacccctg ctgtgtgtgg ggctctgggc actgcggctg 12421 tggggcgccc tacggctggg ggctgttatt ctccgctggc gctaccacgc cttgcgtgga 12481 gagctgtacc ggccggcctg ggagccccag gactacgaga tggtggagtt gttcctgcgc 12541 aggctgcgcc tctggatggg cctcagcaag gtcaaggagt tccgccacaa agtccgcttt 12601 gaagggatgg agccgctgcc ctctcgctcc tccaggggct ccaaggtatc cccggatgtg 12661 cccccaccca gcgctggctc cgatgcctcg cacccctcca cctcctccag ccagctggat 12721 gggctgagcg tgagcctggg ccggctgggg acaaggtgtg agcctgagcc ctcccgcctc 12781 caagccgtgt tcgaggccct gctcacccag tttgaccgac tcaaccaggc cacagaggac 12841 gtctaccagc tggagcagca gctgcacagc ctgcaaggcc gcaggagcag ccgggcgccc 12901 gccggatctt cccgtggccc atccccgggc ctgcggccag cactgcccag ccgccttgcc 12961 cgggccagtc ggggtgtgga cctggccact ggccccagca ggacacccct tcgggccaag 13021 aacaaggtcc accccagcag cacttagtcc tccttcctgg cgggggtggg ccgtggagtc 13081 ggagtggaca ccgctcagta ttactttctg ccgctgtcaa ggccgagggc caggcagaat 13141 ggctgcacgt aggttcccca gagagcaggc aggggcatct gtctgtctgt gggcttcagc 13201 actttaaaga ggctgtgtgg ccaaccagga cccagggtcc cctccccagc tcccttggga 13261 aggacacagc agtattggac ggtttctagc ctctgagatg ctaatttatt tccccgagtc 13321 ctcaggtaca gcgggctgtg cccggcccca ccccctgggc agatgtcccc cactgctaag 13381 gctgctggct tcagggaggg ttagcctgca ccgccgccac cctgccccta agttattacc 13441 tctccagttc ctaccgtact ccctgcaccg tctcactgtg tgtctcgtgt cagtaattta 13501 tatggtgtta aaatgtgtat atttttgtat gtcactattt tcactagggc tgaggggcct 13561 gcgcccagag ctggcctccc ccaacacctg ctgcgcttgg taggtgtggt ggcgttatgg 13621 cagcccggct gctgcttgga tgcgagcttg gccttgggcc ggtgctgggg gcacagctgt 13681 ctgccaggca ctctcatcac cccagaggcc ttgtcatcct cccttgcccc aggccaggta 13741 gcaagagagc agcgcccagg cctgctggca tcaggtctgg gcaagtagca ggactaggca 13801 tgtcagagga ccccagggtg gttagaggaa aagactcctc ctgggggctg gctcccaggg 13861 tggaggaagg tgactgtgtg tgtgtgtgtg tgcgcgcgcg acgcgcgagt gtgctgtatg 13921 gcccaggcag cctcaaggcc ctcggagctg gctgtgcctg cttctgtgta ccacttctgt 13981 gggcatggcc gcttctagag cctcgacacc cccccaaccc ccgcaccaag cagacaaagt 14041 caataaaaga gctgtctgac tgc // LOCUS HSU24576 2080 bp mRNA PRI 31-MAR-1997 DEFINITION Human breast tumor autoantigen mRNA, complete sequence. ACCESSION U24576 NID g1914876 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2080) AUTHORS Racevskis,J., Dill,A. and Ruan,H. TITLE Human breast tumor autoantigen JOURNAL Unpublished REFERENCE 2 (bases 1 to 2080) AUTHORS Racevskis,J. TITLE Direct Submission JOURNAL Submitted (31-MAR-1997) Montefiore Medical Center, Oncology, 111 East 210th Street, Bronx, NY 10467, USA FEATURES Location/Qualifiers source 1..2080 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="ductal breast tumor" CDS 781..1278 /codon_start=1 /evidence=not_experimental /product="breast tumor autoantigen" /db_xref="PID:g1914877" /translation="MVNPGSSSQPPPVTAGSLSWKRCAGCGGKIADRFLLYAMDSYWH SRCLKCSCCQAQLGDIGTSCYTKSGMILCRNDYIRLFGNSGACSACGQSIPASELVMR AQGNVYHLKCFTCSTCRNRLVPGDRFHYINGSLFCEHDRPTALINGHLNSLQSNPLLP DQKVC" BASE COUNT 529 a 550 c 561 g 440 t ORIGIN 1 gctctgtcag taacacatgt gtaagagccg cggagggagc gagcgagccg gctagaggcc 61 agcgccgccg ccgccgccgc ctccgagccg ggcagcaaca gccccggcag cggcgcaggc 121 tccagcgcgc cgggcccggc cggccgcagc ccccgacgcc tgggtgcgcc tgcctgccgg 181 cctccgcacc gtccgccgcc gctcccgggg ctgttgtgtc tgcgactgct cccggccgga 241 ggtgcaggga gctcagccga gccgccgctg ccatcccgga gcgagcaagc gagcgagcgc 301 gcgggaggga ggaaggcggc ggcggaggag gaggaggagc gggaggagcg cgggcggggg 361 cgggggccgc cgggcggggg aatatacaaa gtgaagccac attgccaaac ttgcagcagc 421 gattgcagca gttgctgccg ctgcgccgcg cctgaagccg cgccgcgcgg gccgagggct 481 cctgcagctg ctcgcgcgca gtcggaggcg gagaaggacg aagactgaga ctgacacttc 541 tgctcccggc cgcccggcac ttacgcgggg gccccccaac ccgccccaga gcaacgcgat 601 ttaaaaaaaa aaaaaaagcc gcccttagcc ccctcctctc ctttcctgct tctgcgagaa 661 ctccctccct ccctccagct ccgccagccc aggcgcccct tccctggaag ccgagcggct 721 tcgctcgcat ttcaccgccg ccgcctctcg caatattgca atatagggga aaagcagacc 781 atggtgaatc cgggcagcag ctcgcagccg cccccggtga cggccggctc cctctcctgg 841 aagcggtgcg caggctgcgg gggcaagatt gcggaccgct ttctgctcta tgccatggac 901 agctattggc acagccggtg cctcaagtgc tcctgctgcc aggcgcagct gggcgacatc 961 ggcacgtcct gttacaccaa aagtggcatg atcctttgca gaaatgacta cattaggtta 1021 tttggaaata gcggtgcttg cagcgcttgc ggacagtcga ttcctgcgag tgaactcgtc 1081 atgagggcgc aaggcaatgt gtatcatctt aagtgtttta catgctctac ctgccggaat 1141 cgcctggtcc cgggagatcg gtttcactac atcaatggca gtttattttg tgaacatgat 1201 agacctacag ctctcatcaa tggccatttg aattcacttc agagcaatcc actactgcca 1261 gaccagaagg tctgctaaaa ggtcagagta atgcagaatg cgtgccttca tctcagattt 1321 gttcatcaca ggtggatccc atgtgtcttc agtagacaag tcacctttgt agctagcacc 1381 agtgccagct ccatgccatt gcaccttctt tagtcttgat tgcccttccc gcatttattg 1441 gtgtattaaa atgactgaat atgaacatta aggactccat gaacctgggc taatgggaga 1501 ctgtagagaa aatgaaaaaa gatccaccag aggacatctt ggggaggggg agggagctgg 1561 gggggaggga aatgactaat gaagctaatt aaaagaagca ttcaaatctg ctttctaccc 1621 tcattaacaa ttagcagggc actggccaga gtttgtaccc tgtgttttac cttaacaaca 1681 ttctatttgc tctttgtata tttaagtgtt gtaaggaaac gtgtttcaat caaaactgac 1741 catgagataa aggaaagaga tgtggctttt gtgatattct atcacaaaca cttattgtat 1801 ctctgtaaaa tacaatgtat gtatgcatgt aagtgttttt gtcctaatgt tgctactccc 1861 atggcaaaga aaaaaaaaag aatgaaaaaa agaaaaaaaa tttggaaaaa aaaatcaggc 1921 tcatagcagc tactgtgtag aaaattcccc ctacttctaa tttgctgaat gaagaaaaaa 1981 aaaaatcttt tatttgtgat attttcagag acatttgctc tagtatggtg tatttaaata 2041 ataaaaactt aaaagaaaaa ataaaaaaaa aaaaaaaaaa // LOCUS HSU24660 2074 bp mRNA PRI 08-NOV-1995 DEFINITION Human G protein coupled inward rectifier potassium channel 2 (hiGIRK2) mRNA, complete cds. ACCESSION U24660 NID g1052874 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2074) AUTHORS Ferrer,J., Nichols,C.G., Makhina,E.N., Salkoff,L., Bernstein,J., Gerhard,D., Wasson,J., Ramanadham,S. and Permutt,A. TITLE Pancreatic islet cells express a family of inwardly rectifying K+ channel subunits which interact to form G-protein-activated channels JOURNAL J. Biol. Chem. 270 (44), 26086-26091 (1995) MEDLINE 96064672 REFERENCE 2 (bases 1 to 2074) AUTHORS Permutt,M. TITLE Direct Submission JOURNAL Submitted (12-APR-1995) M. Alan Permutt, Internal Medicine, Washington University School of Medicine, 660 S. Euclid, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2074 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pK1.8" /cell_type="pancreatic islet" /tissue_type="insulinoma" gene 213..1484 /gene="hiGIRK2" CDS 213..1484 /gene="hiGIRK2" /note="similar to Mus musculus G protein coupled inward rectifier K+ channel 2, encoded by Genbank Accession Number U11859" /codon_start=1 /db_xref="PID:g1052875" /translation="MAKLTESMTNVLEGDSMDQDVESPVAIHQPKLPKQARDDLPRHI SRDRTKRKIQRYVRKDGKCNVHHGNVRETYRYLTDIFTTLVDLKWRFNLLIFVMVYTV TWLFFGMIWWLIAYIRGDMDHIEDPSWTPCVTNLNGFVSAFLFSIETETTIGYGYRVI TDKCPEGIILLLIQSVLGSIVNAFMVGCMFVKISQPKKRAETLVFSTHAVISMRDGKL CLMFRVGDLRNSHIVEASIRAKLIKSKQTSEGEFIPLNQTDINVGYYTGDDRLFLVSP LIISHEINQQSPFWEISKAQLPKEELEIVVILEGMVEATGMTCQARSSYITSEILWGY RFTPVLTLEDGFYEVDYNSFHETYETSTPSLSAKELAELASRAELPLSWSVSSKLNQH AELETEEEEKNLEEQTERNGDVANLENESKV" BASE COUNT 558 a 495 c 475 g 546 t ORIGIN 1 atggagtctc ctaacagcct ctcggtgctg atgtgaaatt tgaccatctg attccagttt 61 ttttcttttc cttttctttt ttgcatttcc ttccctcgcc atccgtcgtg tagtgaattg 121 ttcagtcttg ctccgtttca agagaggaga tcatgattga gtgaagccac cccgtccgca 181 gccaggaaaa gcacaaagaa gaaactgcaa caatggccaa gctgacagaa tccatgacta 241 acgtcctgga gggcgactcc atggatcagg acgtcgaaag cccagtggcc attcaccagc 301 caaagttgcc taagcaggcc agggatgacc tgccaagaca catcagccga gatcggacca 361 aaaggaaaat ccagaggtac gtgaggaaag acggaaagtg caatgttcat cacggcaacg 421 tgagggagac ctatcgctac ctgaccgata tcttcaccac attagtggac ctgaagtgga 481 gattcaacct attgattttt gtcatggttt acacagtgac ctggctcttt tttggaatga 541 tctggtggtt gatcgcatac atacggggag acatggacca catagaggac ccctcctgga 601 ctccttgtgt taccaacctc aacgggttcg tctctgcttt tttattctca atagagacag 661 aaaccaccat tggttatggc taccgggtca tcacagataa atgcccggag ggaattattc 721 ttctcttaat ccaatctgtg ttggggtcca ttgtcaatgc attcatggtg ggatgcatgt 781 ttgtaaaaat ctctcaaccc aagaagaggg cagagaccct ggtcttttcc acccatgcag 841 tgatctccat gcgggatggg aaactgtgcc tgatgttccg ggtaggggac cttaggaatt 901 cccacattgt ggaggcttcc atcagagcca agttgatcaa atccaaacag acctcggagg 961 gggagttcat cccgttgaac cagacggata tcaacgtagg gtattacacg ggggatgacc 1021 gtctgtttct ggtgtcaccg ctgatcatta gccatgaaat taaccaacag agtcctttct 1081 gggagatctc caaagcccag ctgcccaaag aggaactgga aattgtggtc atcctagaag 1141 gaatggtgga agccacaggg atgacatgcc aagctcgaag ctcctacatc accagtgaga 1201 tcctgtgggg ttaccggttc acacctgtcc tgaccctgga ggatgggttc tacgaagttg 1261 actacaacag cttccatgag acctatgaga ccagcacccc atcccttagt gccaaagagc 1321 tggccgagtt agccagcagg gcagagctgc ccctgagttg gtctgtatcc agcaaactca 1381 accaacatgc agaactggag actgaagagg aagaaaagaa cctcgaagag caaacagaaa 1441 gaaatggtga tgtggcaaac ctggagaatg aatccaaagt ttagtgccct agctgggcaa 1501 acccttctct tctcccccca acacaatctt tccttgtctc tcattctctt tctttttctg 1561 tctctcttgc tttgttcttt atttgtttat atttaatttt tacatgacca gaaaacaaat 1621 cttcaaggtg taaaatatct acctgccctc tctcagttat tcagattgac aaggtagaca 1681 tggatttgat gaaagtgcaa agtgccctca tttgtggccc aagcctggtc tcctcccaaa 1741 atactacaca tccaactcct ggagatttca gttacttacc tgcatgtgtt gtacaatacc 1801 agatcactca aaaaggtgtg tcaaagattt tacctgggat atgacaagca aggtttctgg 1861 tgcctattta ttcattcagt gagacacaga gtggagccct cagttttatg gatcccaatt 1921 catttcatct actacagggt gaggtgcttg cccccatgtg ggtgtggcag ttacagggcc 1981 caggtgagct gaagacaaac cactgtacat atatatgcct tatgtaatta ttttcttttt 2041 gtaattagta ataaaaccca gcatgtacaa aagt // LOCUS HSU25033 1237 bp mRNA PRI 02-APR-1996 DEFINITION Human neuronatin alpha mRNA, complete cds. ACCESSION U25033 NID g1244407 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1237) AUTHORS Dou,D. and Joseph,R. TITLE Molecular cloning of human neuronatin alpha cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1237) AUTHORS Dou,D. TITLE Direct Submission JOURNAL Submitted (13-APR-1995) Dexian Dou, Laboratory of Molecular Neuroscience, Henry Ford Hospital, 2799 West Grand Boulevard, Detroit, MI 48202, USA FEATURES Location/Qualifiers source 1..1237 /organism="Homo sapiens" /strain="Caucasian" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetus" 5'UTR 1..69 CDS 70..315 /codon_start=1 /function="brain development" /product="neuronatin alpha" /db_xref="PID:g1244408" /translation="MAAVAAASAELLIIGWYIFRVLLQVFLECCIYWVGFAFRNPPGT QPIARSEVFRYSLQKLAYTVSRTGRQVLGERRQRAPN" 3'UTR 316..1237 BASE COUNT 272 a 371 c 337 g 257 t ORIGIN 1 gcggactccg agaccagcgg atctcggcaa accctctttc tcgaccaccc acctaccatt 61 cttggaacca tggcggcagt ggcggcggcc tcggctgaac tgctcatcat cggctggtac 121 atcttccgcg tgctgctgca ggtgttcctg gaatgctgca tttactgggt aggattcgct 181 tttcgaaatc ctccagggac acagcccatt gcgagaagtg aggtgttcag gtactccctg 241 cagaagctgg catacacggt gtcgcggacc gggcggcagg tgttggggga gcgcaggcag 301 cgagccccca actgaggccc cagctcccag cctgggcggc cgtatatagt gctcctgtgc 361 atctcggcca gcacgggagc cagtgccgcg caggaatgtg gggtcccctg tgttccctcg 421 ccagaggagc acttggcaag gtcagtgagg ggccagtaga cccccggaga agcagtaccg 481 acaatgacga agataccaga tcccttccca acccctttgc accggtccca ctaaggggca 541 gggtcgagag aggagggggg atagggggag cagaccctga gatctgggca taggcaccgc 601 attctgatct ggacaaagtc gggacagcac catcccagcc ccgaagccag ggccatgcca 661 gcaggcccca ccatggaaat caaaacaccg caccagccag cagaatggac attctgacat 721 cgccagccga cgccctgaat cttggtgcag cacccaccgc gtgcctgtgt ggcgggactg 781 gagggcacag ttgaggaagg agggtggtta agaaatacag tggggccctc tcgctgtccc 841 ttgcccaggg cacttgtatt ccagcctcgc tgcatttgct ctctcgattg cccctttcct 901 cctacatgcc tcccaagccc accctactcc aaaagtaatg tgtcacttga tttggaacta 961 ttcaagcagt aaaagtaaat gaatcccacc tttactaaaa cactttctct gaacccccct 1021 tgcccctcac tgatcttgct tttccctggt ctcatgcagt tgtggtcaat attgtggtaa 1081 tcgctaattg tactgattgt ttaagtgtgc attagttgtc tctccccagc tagattgtaa 1141 gctcctggag gacagggacc acctctacaa aaaataaaaa aagtacctcc cctgtctcgc 1201 acagtgtccc aggaccctgc ggtgcagtag aggcgca // LOCUS HSU25128 2641 bp mRNA PRI 07-JUL-1995 DEFINITION Human PTH2 parathyroid hormone receptor mRNA, complete cds. ACCESSION U25128 NID g887966 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2641) AUTHORS Usdin,T.B., Gruber,C. and Bonner,T.I. TITLE Identification and functional expression of a receptor selectively recognizing parathyroid hormone, the PTH2 receptor JOURNAL J. Biol. Chem. 270 (26), 15455-15458 (1995) MEDLINE 95318121 REFERENCE 2 (bases 1 to 2641) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (17-APR-1995) Tom I. Bonner, Lab of Cell Biology, NIMH, Bldg. 36, Room 3A-17, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2641 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebral cortex and hippocampus" CDS 143..1795 /note="G protein-coupled receptor" /codon_start=1 /function="stimulates cAMP production in response to parathyroid hormone but not parathyroid hormone-related peptide" /product="PTH2 parathyroid hormone receptor" /db_xref="PID:g887967" /translation="MAGLGASLHVWGWLMLGSCLLARAQLDSDGTITIEEQIVLVLKA KVQCELNITAQLQEGEGNCFPEWDGLICWPRGTVGKISAVPCPPYIYDFNHKGVAFRH CNPNGTWDFMHSLNKTWANYSDCLRFLQPDISIGKQEFFERLYVMYTVGYSISFGSLA VAILIIGYFRRLHCTRNYIHMHLFVSFMLRATSIFVKDRVVHAHIGVKELESLIMQDD PQNSIEATSVDKSQYIGCKIAVVMFIYFLATNYYWILVEGLYLHNLIFVAFFSDTKYL WGFILIGWGFPAAFVAAWAVARATLADARCWELSAGDIKWIYQAPILAAIGLNFILFL NTVRVLATKIWETNAVGHDTRKQYRKLAKSTLVLVLVFGVHYIVFVCLPHSFTGLGWE IRMHCELFFNSFQGFFVSIIYCYCNGEVQAEVKKMWSRWNLSVDWKRTPPCGSRRCGS VLTTVTHSTSSQSQVAASTRMVLISGKAAKIASRQPDSHITLPGYVWSNSEQDCLPHS FHEETKEDSGRQGDDILMEKPSRPMESNPDTEGCQGETEDVL" polyA_signal 2619..2625 polyA_site 2641 BASE COUNT 670 a 564 c 623 g 784 t ORIGIN 1 ggccggtggc ccgggcccga ccaccccagc tgcgcgtcgt tactggccac aagtttgctc 61 tgggccagcc aagttggcaa cttggaagct tctcccgggc tctggaggag ggtccctgct 121 tcttcctaca gccgttccgg gcatggccgg gctgggggcg tcgctccacg tctggggttg 181 gctaatgctc ggcagctgcc tcctggccag agcccagctg gattctgatg gcaccattac 241 tatagaggag cagattgtcc ttgtgctgaa agcgaaagta caatgtgaac tcaacatcac 301 agctcaactc caggagggag aaggtaattg tttccctgaa tgggatggac tcatttgttg 361 gcccagagga acagtgggga aaatatcggc tgttccatgc cctccttata tttatgactt 421 caaccataaa ggagttgctt tccgacactg taaccccaat ggaacatggg attttatgca 481 cagcttaaat aaaacatggg ccaattattc agactgcctt cgctttctgc agccagatat 541 cagcatagga aagcaagaat tctttgaacg cctctatgta atgtataccg ttggctactc 601 catctctttt ggttccttgg ctgtggctat tctcatcatt ggttacttca gacgattgca 661 ttgcactagg aactatatcc acatgcactt atttgtgtct ttcatgctga gagctacaag 721 catctttgtc aaagacagag tagtccatgc tcacatagga gtaaaggagc tggagtccct 781 aataatgcag gatgacccac aaaattccat tgaggcaact tctgtggaca aatcacaata 841 tatcgggtgc aagattgctg ttgtgatgtt tatttacttc ctggctacaa attattattg 901 gatcctggtg gaaggtctct acctgcataa tctcatcttt gtggctttct tttcggacac 961 caaatacctg tggggcttca tcttgatagg ctgggggttt ccagcagcat ttgttgcagc 1021 atgggctgtg gcacgagcaa ctctggctga tgcgaggtgc tgggaactta gtgctggaga 1081 catcaagtgg atttatcaag caccgatctt agcagctatt gggctgaatt ttattctgtt 1141 tctgaatacg gttagagttc tagctaccaa aatctgggag accaatgcag ttgggcatga 1201 cacaaggaag caatacagga aactggccaa atcgacactg gtcctggtcc tagtctttgg 1261 agtgcattac atcgtgttcg tatgcctgcc tcactccttc actgggctcg ggtgggagat 1321 ccgcatgcac tgtgagctct tcttcaactc ctttcagggt ttctttgtgt ctatcatcta 1381 ctgctactgc aatggagagg ttcaggcaga ggtgaagaag atgtggagtc ggtggaatct 1441 ctccgtggac tggaaaagga caccgccatg tggcagccgc agatgcggct cagtgctcac 1501 caccgtgacg cacagcacca gcagccagtc acaggtggcg gccagcacac gcatggtgct 1561 tatctctggc aaagctgcca agatcgccag cagacagcct gacagccaca tcactttacc 1621 tggctatgtc tggagtaact cagagcagga ctgcctgcca cactctttcc acgaggagac 1681 caaggaagat agtgggaggc agggagatga tattctaatg gagaagcctt ccaggcctat 1741 ggaatctaac ccagacactg aaggatgcca aggagaaact gaggatgttc tctgaatgga 1801 catttgtggc tgactttcat gggctggtcc aatggctggt tgtgtgagag ggcttggctg 1861 atactcctat gcttgagttc aaaggctgaa aattcagtta aggtgttact taataatagt 1921 ttttaggctc catgaattgg ctcctgtaaa tactaacgac atgaaaatgc aagtgtcaat 1981 ggagtagttt attaccttct attggcatca agttttcctc taaattaatg tatggtattt 2041 gctctgtgat tgttcatttt tttctgctac ttttgggtag aaaaaagatt caattgcttg 2101 gctgtagctt tctctcatat atatcaccct aaatataatg aagatctttt agtgtgtatc 2161 attttccttt tagaaactag tattctctta tttcttactt taatgtactt ctatcactgc 2221 atttattttg cctgtgcata ggagcaatta ggatctaaaa aaatatatgg gaagataaaa 2281 gatctaagaa caagtacttg ctggaaaatt agttggctgg acattgataa aataatgcat 2341 ttataacaat tacatgtgtt tttgggaaca aggaaaattt ctcaaaaaag aatatttcac 2401 acatcccttc ttttgaatgg cctctttgtg accagccaga cctcaggtct tcactctttc 2461 ttctttgtaa accatgtcat gtggaaagat ttcctcagtt agtgagcttg tgtctgcaaa 2521 ttgattttgt ttgtaatgta ttttgatagc aaatcatgct gcatctatat ctttttcttg 2581 tttgagctgt tactacattg tacatggcat gtgggatcaa ttaaaaattt gttttaaaaa 2641 t // LOCUS HSU25147 1533 bp mRNA PRI 20-SEP-1996 DEFINITION Human citrate transporter protein mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U25147 NID g950003 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1533) AUTHORS Heisterkamp,N., Mulder,M.P., Langeveld,A., ten Hoeve,J., Wang,Z., Roe,B.A. and Groffen,J. TITLE Localization of the human mitochondrial citrate transporter protein gene to chromosome 22Q11 in the DiGeorge syndrome critical region JOURNAL Genomics 29 (2), 451-456 (1995) MEDLINE 96115597 REFERENCE 2 (bases 1 to 1533) AUTHORS Roe,B.A. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) Bruce A. Roe, Chemistry and Biochemistry, University of Oklahoma, 620 Parrington Oval, Norman, OK 73019, USA FEATURES Location/Qualifiers source 1..1533 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11" /chromosome="22" CDS 75..1010 /codon_start=1 /product="citrate transporter protein" /db_xref="PID:g950004" /translation="MPAPRAPRALAAAAPASGKAKLTHPEKAILAGGLAGGIEICITF PTEYVKTQLQLDERSHPPRYRGIGDCVRQTVRSHGVLGLYRGLSSLLYGSIPKAAVRF GMFEFLSNHMRDAQGRLDSTRGLLCGLGAGVAEAVVVVCPMETIKVKFIHDQTSPNPK YRGFFHGVREIVREQGLKGTYQGLTATVLKQGSNQAIRFFVMTSLRNWYRGDNPNKPM NPLITGVFGAIAGAASVFGNTPLDVIKTRMQGLEAHKYRNTWDCGLQILKKEGLKAFY KGTVPRLGRVCLDVAIVFVIYDEVVKLLNKVWKTD" BASE COUNT 256 a 524 c 470 g 283 t ORIGIN 1 caccgcggac cgagcgcgga gttctggagt ctcggacccg aagccgccac agggcgcccc 61 gcctcccgcc cgccatgccc gcgccccgcg ccccgcgcgc tctggcggcc gccgcgcccg 121 cgtccgggaa ggccaagctg acgcacccgg agaaggcgat cctggcaggc ggcctggcgg 181 gtggcatcga gatctgcatc accttcccca ccgagtacgt gaagacgcag ctgcagctgg 241 acgagcgctc gcacccgccg cggtaccggg gcatcgggga ctgcgtgcgg cagacggttc 301 gcagccatgg cgtcctgggc ctgtaccgcg gccttagctc gctgctctac ggttccatcc 361 ccaaggcggc cgtcaggttt ggaatgttcg agttcctcag caaccacatg cgggatgccc 421 agggacggct ggacagcacg cgtgggctgc tgtgcggcct gggcgctggc gtggccgagg 481 ccgtggtggt cgtgtgcccc atggagacca tcaaggtgaa gttcatccac gaccagacct 541 ccccaaaccc caagtacaga ggattcttcc acggggttag ggagattgtg cgggaacaag 601 ggctgaaggg gacgtaccag ggcctcacag ccactgtcct gaagcagggc tcgaaccagg 661 ccatccgctt cttcgtcatg acctccctgc gcaactggta ccgaggggac aaccccaaca 721 agcccatgaa ccctctgatc actggggtct tcggagctat tgcaggcgca gccagtgtct 781 ttggaaacac tcctctggat gtgattaaga cccggatgca gggcctggag gcgcacaaat 841 accggaacac gtgggactgc ggcttgcaga tcctgaagaa ggaggggctc aaggcattct 901 acaagggcac tgtcccccgc ctgggccggg tctgcctgga tgtggccata gtgtttgtca 961 tctatgatga agtggtgaag ctgctcaaca aagtgtggaa gacggactaa gcctagagag 1021 gccgcaaggg gaccgcccca ggcaccgcca gagtgtcctg ctacctttgt ctcacgattc 1081 cagtgcagta gtgccaaaag gccccttccc acgtccctcg agctctgtag cctggtctgt 1141 gcattgtggc tgtcaaatcc atgtgtcccc cctgtggtct gtgtgtgaca ccaccactgt 1201 gtcccagtgt ctggcccagc catggctgga tgtgcatctg gcctatgacc ctgtgcctgt 1261 gtttcatgtt ctgtgtcacg tgaccctgtg ccccgcctcc cggggtgccc gtgtggcctg 1321 ggtcctcggc cctgtagccc tggcccggtc ccagtccggt gccttccacc ctgccctggc 1381 ctaccacagc tgcctccggg cctcggcctg gcttcaccgc attccagggg ctgcagcccc 1441 ctgcttctcc cgccattggc cttaactggc cctcgggccc tctctccgcc ccggacaggg 1501 tggcacccac cactctcagg accaccctgc caa // LOCUS HSU25165 2132 bp mRNA PRI 04-JUL-1995 DEFINITION Human fragile X mental retardation protein 1 homolog FXR1 mRNA, complete cds. ACCESSION U25165 NID g887792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2132) AUTHORS Siomi,M.C., Siomi,H., Sauer,W.H., Srinivasan,S., Nussbaum,R.L. and Dreyfuss,G. TITLE FXR1, an autosomal homolog of the fragile X mental retardation gene JOURNAL EMBO J. 14 (11), 2401-2408 (1995) MEDLINE 95300772 REFERENCE 2 (bases 1 to 2132) AUTHORS Dreyfuss,G. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) Gideon Dreyfuss, HHMI, School of Medicine, University of Pennsylvania, 422 Curie Boulevard, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2132 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa cell" /chromosome="12" /map="12q13" CDS 13..1878 /note="FMR1 homolog; similar to human fragile X mental retardation protein 1 FMR-1, Swiss-Prot Accession Number Q06787" /codon_start=1 /product="FXR1" /db_xref="PID:g887793" /translation="MADVTVEVRGSNGAFYKGFIKDVHEDSLTVVFENNWQPERQVPF NEVRLPPPPDIKKEISEGDEVEVYSRANDQEPCGWWLAKVRMMKGEFYVIEYAACDAT YNEIVTFERLRPVNQNKTVKKNTFFKCTVDVPEDLREACANENAHKDFKKAVGACRIF YHPETTQLMILSASEATVKRVNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAA FHEEFVVREDLMGLAIGTHGSNIQQARKVPGVTAIELDEDTGTFRIYGESADAVKKAR GFLEFVEDFIQVPRNLVGKVIGKNGKVIQEIVDKSGVVRVRIEGDNENKLPREDGMVP FVFVGTKESIGNVQVLLEYHIAYLKEVEQLRMERLQIDEQLRQIGSRSYSGRGRGRRG PNYTSGYGTNSELSNPSETESERKDELSDWSLAGEDNRDSRHQRDSRRRPGGRGRSVS GGRGRGGPRGGKSSISSVLKDPDSNPYSLLDNTESDQTADTDASESHHSTNRRRRSRR RRTDEDAVLMDGMTESDTASVNENGLVTVADYISRAESQSRQRNLPRETLAKNKKEMA KDVIEEHGPSEKAINGPTSASGDDISKLQRTPGEEKINTLKEENTQEAAVLNGVS" polyA_site 2132 /note="19 A nucleotides" BASE COUNT 710 a 367 c 517 g 538 t ORIGIN 1 tgcggttcca acatggcgga cgtgacggtg gaggttcgcg gctctaacgg ggctttctac 61 aagggattta tcaaagatgt tcatgaagac tcccttacag ttgtttttga aaataattgg 121 caaccagaac gccaggttcc atttaatgaa gttagattac caccaccacc tgatataaaa 181 aaagaaatta gtgaaggaga tgaagtagag gtatattcaa gagcaaatga ccaagagcca 241 tgtgggtggt ggttggctaa agttcggatg atgaaaggag aattttatgt cattgaatat 301 gctgcttgtg acgctactta caatgaaata gtcacatttg aacgacttcg gcctgtcaat 361 caaaataaaa ctgtcaaaaa aaataccttc tttaaatgca cagtggatgt tcctgaggat 421 ttgagagagg cgtgtgctaa tgaaaatgca cataaagatt ttaagaaagc agtaggagca 481 tgcagaattt tttaccatcc agaaacaaca cagctaatga tactgtctgc cagtgaagca 541 actgtgaaga gagtaaacat cttaagtgac atgcatttgc gaagtattcg tacgaagttg 601 atgcttatgt ccagaaatga agaggccact aagcatttag aatgcacaaa acaacttgca 661 gcagcttttc atgaggaatt tgttgtgaga gaagatttaa tgggcctggc aataggaaca 721 catggtagta acatccagca agctaggaag gttcctggag ttaccgccat tgagctagat 781 gaagatactg gaacattcag aatctacgga gagagtgctg atgctgtaaa aaaggctaga 841 ggtttcttgg aatttgtgga ggattttatt caggttccta ggaatctcgt tggaaaagta 901 attggaaaaa atggcaaagt tattcaagaa atagtggaca aatctggtgt ggttcgagtg 961 agaattgaag gggacaatga aaataaatta cccagagaag acggtatggt tccatttgta 1021 tttgttggca ctaaagaaag cattggaaat gtgcaggttc ttctagagta tcatattgcc 1081 tatctaaagg aagtagaaca gctaagaatg gaacgcctac agattgatga acagctgcga 1141 cagattggtt ctaggtctta tagcggaaga ggcagaggtc gtcggggacc taattacacc 1201 tccggttatg gtacaaattc tgagctgtct aacccctctg aaacggaatc tgagcgtaaa 1261 gacgagctga gtgattggtc attggcagga gaagataatc gagacagccg acatcagcgt 1321 gacagcagga gacgcccagg aggaagaggc agaagtgttt cagggggtcg aggtcgtggt 1381 ggaccacgtg gtggcaaatc ctccatcagt tctgtgctca aagatccaga cagcaatcca 1441 tacagcttac ttgataatac agaatcagat cagactgcag acactgatgc cagcgaatct 1501 catcacagta ctaaccgtcg taggcggtct cgtagacgaa ggactgatga agatgctgtt 1561 ctgatggatg gaatgactga atctgataca gcttcagtta atgaaaatgg gctagtcaca 1621 gttgcagatt atatttctag agctgagtct cagagcagac aaagaaacct cccaagggaa 1681 actttggcta aaaacaagaa agaaatggca aaagatgtga ttgaagagca tggtccttca 1741 gaaaaggcaa taaacggccc aactagtgct tctggcgatg acatttctaa gctacagcgt 1801 actccaggag aagaaaagat taatacctta aaagaagaaa acactcaaga agcagcagtc 1861 ctgaatggtg tttcataaac tgaagaagtt cctagtttac agttctttta cattacattt 1921 acaatagtgc ttgtacaagc ttgccaaaga tagaatatgg atcgccagtc tttacatcgc 1981 actttcagtt cctccatttg gaattcaaaa aggggaggga tcctgaagaa atcatatgtt 2041 aaacatactt tgacacctac tgtgttataa aatatatcat cagatgtgcc ttgagaatag 2101 tatatgtaac attaaaaaaa agttgctggc ta // LOCUS HSU25182 921 bp mRNA PRI 07-JAN-1998 DEFINITION Human antioxidant enzyme AOE37-2 mRNA, complete cds. ACCESSION U25182 NID g799380 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 921) AUTHORS Jin,D.Y., Chae,H.Z., Rhee,S.G. and Jeang,K.T. TITLE Regulatory role for a novel human thioredoxin peroxidase in NF-kappaB activation JOURNAL J. Biol. Chem. 272 (49), 30952-30961 (1997) MEDLINE 98049564 REFERENCE 2 (bases 1 to 921) AUTHORS Jin,D.-Y. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) Dong-Yan Jin, Laboratory of Molecular Microbiology, NIAID, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892-0460, USA FEATURES Location/Qualifiers source 1..921 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3, exponentially growing, ATCC #CCL 2.2" /clone_lib="Clontech cDNA library cat.#: HL4000A1, lot#: 39042a" CDS 44..859 /note="putative peroxidase; member of peroxiredoxin family" /codon_start=1 /product="antioxidant enzyme AOE37-2" /db_xref="PID:g799381" /translation="MEALPLLAATTPDHGRHRRLLLLPLLLFLLPAGAVQGWETEERP RTREEECHFYAGGQVYPGEASRVSVADHSLHLSKAKISKPAPYWEGTAVIDGEFKELK LTDYRGKYLVFFFYPLDFTFVCPTEIIAFGDRLEEFRSINTEVVACSVDSQFTHLAWI NTPRRQGGLGPIRIPLLSDLTHQISKDYGVYLEDSGHTLRGLFIIDDKGILRQITLND LPVGRSVDETLRLVQAFQYTDKHGEVCPAGWKPGSETIIPDPAGKLKYFDKLN" polyA_site 921 /note="13 A nucleotides" BASE COUNT 236 a 221 c 235 g 229 t ORIGIN 1 gcggcgctcg cgccaaggga cgtgtttctg cgctcgcgtg gtcatggagg cgctgccgct 61 gctagccgcg acaactccgg accacggccg ccaccgaagg ctgcttctgc tgccgctact 121 gctgttcctg ctgccggctg gagctgtgca gggctgggag acagaggaga ggccccggac 181 tcgcgaagag gagtgccact tctacgcggg tggacaagtg tacccgggag aggcatcccg 241 ggtatcggtc gccgaccact ccctgcacct aagcaaagcg aagatttcca agccagcgcc 301 ctactgggaa ggaacagctg tgatcgatgg agaatttaag gagctgaagt taactgatta 361 tcgtgggaaa tacttggttt tcttcttcta cccacttgat ttcacatttg tgtgtccaac 421 tgaaattatc gcttttggcg acagacttga agaattcaga tctataaata ctgaagtggt 481 agcatgctct gttgattcac agtttaccca tttggcctgg attaataccc ctcgaagaca 541 aggaggactt gggccaataa ggattccact tctttcagat ttgacccatc agatctcaaa 601 ggactatggt gtatacctag aggactcagg ccacactctt agaggtctct tcattattga 661 tgacaaagga atcctaagac aaattactct gaatgatctt cctgtgggta gatcagtgga 721 tgagacacta cgtttggttc aagcattcca gtacactgac aaacacggag aagtctgccc 781 tgctggctgg aaacctggta gtgaaacaat aatcccagat ccagctggaa agctgaagta 841 tttcgataaa ctgaattgag aaatacttct tcaagttatg atgcttgaaa gttctcaata 901 aagttcacgg tttcattacc a // LOCUS HSU25265 2083 bp mRNA PRI 05-APR-1996 DEFINITION Human MEK5 mRNA, complete cds. ACCESSION U25265 NID g1255719 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2083) AUTHORS Zhou,G., Bao,Z.Q. and Dixon,J.E. TITLE Components of a new human protein kinase signal transduction pathway JOURNAL J. Biol. Chem. 270 (21), 12665-12669 (1995) MEDLINE 95279403 REFERENCE 2 (bases 1 to 2083) AUTHORS Zhou,G. TITLE Direct Submission JOURNAL Submitted (19-APR-1995) Gaochao Zhou, Biological Chemistry, The University of Michigan Medical School, Room 4433 Medical Science I, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..2083 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3-2" /clone_lib="cDNA Lambda ZAP (Invitrogen)" /tissue_type="brain" /dev_stage="fetal" CDS 297..1613 /codon_start=1 /product="MEK5" /db_xref="PID:g1255720" /translation="MLWLALGPFPAMENQVLVIRIKIPNSGAVDWTVHSGPQLLFRDV LDVIGQVLPEATTTAFEYEDEDGDRITVRSDEEMKAMLSYYYSTVMEQQVNGQLIEPL QIFPRACKPPGERNIHGLKVNTRAGPSQHSSPAVSDSLPSNSLKKSSAELKKILANGQ MNEQDIRYRDTLGHGNGGTVYKAYHVPSGKILAVKVILLDITLELQKQIMSELEILYK CDSSYIIGFYGAFFVENRISICTEFMDGGSLDVYRKMPEHVLGRIAVAVVKGLTYLWS LKILHRDVKPSNMLVNTRGQVKLCDFGVSTQLVNSIAKTYVGTNAYMAPERISGEQYG IHSDVWSLGISFMEIQKNQGSLMPLQLLQCIVDEDSPVLPVGEFSEPFVHFITQCMRK QPKERPAPEELMGHPFIVQFNDGNAAVVSMWVCRALEERRSQQGPP" BASE COUNT 518 a 532 c 512 g 521 t ORIGIN 1 agcgttcgct caactccaga accttccgac ctccgctagt tcctgcgggc ctttgcccgc 61 ttcccggtgc accctccccg ggagacacct cagacccccg acagcctggg caggctcggt 121 gcctgcgggt gcgttcctga tcacccctcc cctcttccct ccccctcatc ctccattccc 181 ttgttttcac cctctgtcct ctgcccgtca ctccccttgt cacctcttgg agccccctcc 241 taaccagcgg ccagtgggtt tcccataccc caggatgtga gcctctttaa cctgtaatgc 301 tgtggctagc ccttggcccc tttcctgcca tggagaacca ggtgctggta attcgcatca 361 agatcccaaa tagtggcgcg gtggactgga cagtgcactc cgggccgcag ttactcttca 421 gggatgtgct ggatgtgata ggccaggttc tgcctgaagc aacaactaca gcatttgaat 481 atgaagatga agatggtgat cgaattacag tgagaagtga tgaggaaatg aaggcaatgc 541 tgtcatatta ttattccaca gtaatggaac agcaagtaaa tggacagtta atagagcctc 601 tgcagatatt tccaagagcc tgcaagcctc ctggggaacg gaacatacat ggcctgaagg 661 tgaatactcg ggccggaccc tctcaacaca gcagcccagc agtctcagat tcacttccaa 721 gcaatagctt aaagaagtct tctgctgaac tgaaaaaaat actagccaat ggccagatga 781 atgaacaaga catacgatat cgggacactc ttggtcatgg caacggaggc acagtctaca 841 aagcatatca tgtcccgagt gggaaaatat tagctgtaaa ggtcatacta ctagatatta 901 cactggaact tcagaagcaa attatgtctg aattggaaat tctttataag tgcgattcat 961 catatatcat tggattttat ggagcatttt ttgtagaaaa caggatttca atatgtacag 1021 aattcatgga tgggggatct ttggatgtat ataggaaaat gccagaacat gtccttggaa 1081 gaattgcagt agcagttgtt aaaggcctta cttatttgtg gagtttaaag attttacata 1141 gagacgtgaa gccctccaat atgctagtaa acacaagagg acaggttaag ctgtgtgatt 1201 ttggagttag cactcagctg gtgaattcta tagccaagac gtatgttgga acaaatgctt 1261 atatggcgcc tgaaaggatt tcaggggagc agtatggaat tcattctgat gtctggagct 1321 taggaatctc ttttatggag attcagaaaa accagggatc tttaatgcct ctccagcttc 1381 tgcagtgcat tgttgatgag gattcgcccg tccttccagt tggagagttc tcggagccat 1441 ttgtacattt catcactcag tgtatgcgaa aacagccaaa agaaaggcca gcacctgaag 1501 aattgatggg ccacccgttc atcgtgcagt tcaatgatgg aaatgccgcc gtggtgtcca 1561 tgtgggtgtg ccgggcgctg gaggagaggc ggagccagca ggggcccccg tgaggctgcc 1621 gcagggcact gaaagcccag gaccagtaac caaggagaac aacccacccg tcgcccttct 1681 ccgtatgctg cctgcgccag aagagctttg ctgggccctg gcttccctgc cctcgccttc 1741 acctctgtca gcaggtggcc ttgcctgggg agccccatgt gtggcccacc ccaccaggcc 1801 atccccatac cttctggttt gaaggcgctg acactggcag agaggtaaag ggtggggcat 1861 tgagaatgga ggctcccagg gtccctgccc acttctgttt tcctaatgtt tttctctata 1921 aagggtcagg cccgtcagca tcactgatgg gaataaaagt attaatgctt tgtgacagcc 1981 tctgcctgaa aactggacag aaggacccag aggtgttctt tcattttctc tcttacctcc 2041 aatctttccc ctttcaagct acaggtaaag gctctaccac cat // LOCUS HSU25278 2828 bp mRNA PRI 16-NOV-1995 DEFINITION Human ERK5 mRNA, complete cds. ACCESSION U25278 NID g837260 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2828) AUTHORS Zhou,G., Bao,Z.Q. and Dixon,J.E. TITLE Components of a new human protein kinase signal transduction pathway JOURNAL J. Biol. Chem. 270 (21), 12665-12669 (1995) MEDLINE 95279403 REFERENCE 2 (bases 1 to 2828) AUTHORS Zhou,G. TITLE Direct Submission JOURNAL Submitted (19-APR-1995) Gaochao Zhou, Biological Chemistry, The University of Michigan Medical School, R4433 Medical Science I, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..2828 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="3-1" /clone_lib="cDNA Lambda ZAP (Invitrogen)" /tissue_type="brain" /dev_stage="fetal" CDS 84..2531 /codon_start=1 /product="ERK5" /db_xref="PID:g837261" /translation="MAEPLKEEDGEDGSAEPPAREGRTRPHRCLCSAKNLALLKARSF DVTFDVGDEYEIIETIGNGAYGVVSSARRRLTGQQVAIKKIPNAFDVVTNAKRTLREL KILKHFKHDNIIAIKDILRPTVPYGEFKSVYVVLDLMESDLHQIIHSSQPLTLEHVRY FLYQLLRGLKYMHSAQVIHRDLKPSNLLVNENCELKIGDFGMARGLCTSPAEHQYFMT EYVATRWYRAPELMLSLHEYTQAIDLWSVGCIFGEMLARRQLFPGKNYVHQLQLIMMV LGTPSPAVIQAVGAERVRAYIQSLPPRQPVPWETVYPGADRQALSLLGRMLRFEPSAR ISAAAALRHPFLAKYHDPDDEPDCAPPFDFAFDREALTRERIKEAIVAEIEDFHARRE GIRQQIRFQPSLQPVASEPGCPDVEMPSPWAPSGDCAMESPPPAPPPCPGPAPDTIDL TLQPPPPVSEPAPPKKDGAISDNTKAALKAALLKSLRSRLRDGPSAPLEAPEPRKPVT AQERQREREEKRRRRQERAKEREKRRQERERKERGAGASGGPSTDPLAGLVLSDNDRS LLERWTRMARPAAPALTSVPAPAPAPTPTPTPVQPTSPPPGPLAQPTGPQPQSAGSTS GPVPQPACPPPGPAPHPTGPPGPIPVPAPPQIATSTSLLAAQSLVPPPGLPGSSTPGV LPYFPPGLPPPDAGGAPQSSMSESPDVNLVTQQLSKSQVEDPLPPVFSGTPKGSGAGY GVGFDLEEFLNQSFDMGVADGPQDGQADSASLSASLLADWLEGHGMNPADIESLQREI QMDSPMLLADLPDLQDP" source 321..2828 /organism="Homo sapiens" /clone="A1" /clone_lib="Human Hela S3 Matchmaker cDNA" /cell_line="Hela" BASE COUNT 558 a 959 c 759 g 552 t ORIGIN 1 gaattccgga gacccccgcg ctggggacgg gaggccggcg agcctcggga cctctgaaag 61 ccttgaggag gcccggggac accatggccg agcctctgaa ggaggaagac ggcgaggacg 121 gctctgcgga gcccccggcc cgtgaaggtc gaacccgccc acaccgctgc ctctgtagcg 181 ccaagaacct ggccctgctt aaagcccgct ccttcgatgt gacctttgac gtgggcgacg 241 agtacgagat catcgagacc ataggcaacg gggcctatgg agtggtgtcc tccgcccgcc 301 gccgcctcac cggccagcag gtggccatca agaagatccc taatgctttc gatgtggtga 361 ccaatgccaa gcggaccctc agggagctga agatcctcaa gcactttaaa cacgacaaca 421 tcatcgccat caaggacatc ctgaggccca ccgtgcccta tggcgaattc aaatctgtct 481 acgtggtcct ggacctgatg gaaagcgacc tgcaccagat catccactcc tcacagcccc 541 tcacactgga acacgtgcgc tacttcctgt accaactgct gcggggcctg aagtacatgc 601 actcggctca ggtcatccac cgtgacctga agccctccaa cctattggtg aatgagaact 661 gtgagctcaa gattggtgac tttggtatgg ctcgtggcct gtgcacctcg cccgctgaac 721 atcagtactt catgactgag tatgtggcca cgcgctggta ccgtgcgccc gagctcatgc 781 tctctttgca tgagtataca caggctattg acctctggtc tgtgggctgc atctttggtg 841 agatgctggc ccggcgccag ctcttcccag gcaaaaacta tgtacaccag ctacagctca 901 tcatgatggt gctgggtacc ccatcaccag ccgtgattca ggctgtgggg gctgagaggg 961 tgcgggccta tatccagagc ttgccaccac gccagcctgt gccctgggag acagtgtacc 1021 caggtgccga ccgccaggcc ctatcactgc tgggtcgcat gctgcgtttt gagcccagcg 1081 ctcgcatctc agcagctgct gcccttcgcc accctttcct ggccaagtac catgatcctg 1141 atgatgagcc tgactgtgcc ccgccctttg actttgcctt tgaccgcgaa gccctcactc 1201 gggagcgcat taaggaggcc attgtggctg aaattgagga cttccatgca aggcgtgagg 1261 gcatccgcca acagatccgc ttccagcctt ctctacagcc tgtggctagt gagcctggct 1321 gtccagatgt tgaaatgccc agtccctggg ctcccagtgg ggactgtgcc atggagtctc 1381 caccaccagc cccgccacca tgccccggcc ctgcacctga caccattgat ctgaccctgc 1441 agccacctcc accagtcagt gagcctgccc caccaaagaa agatggtgcc atctcagaca 1501 atactaaggc tgcccttaaa gctgccctgc tcaagtcttt gaggagccgg ctcagagatg 1561 gccccagcgc acccctggag gctcctgagc ctcggaagcc ggtgacagcc caggagcgcc 1621 agcgggagcg ggaggagaag cggcggaggc ggcaagaacg agccaaggag cgggagaaac 1681 ggcggcagga gcgggagcga aaggaacggg gggctggggc ctctgggggc ccctccactg 1741 accccttggc tggactagtg ctcagtgaca atgacagaag cctgttggaa cgctggactc 1801 gaatggcccg gcccgcagcc ccagccctca cctctgtgcc ggcccctgcc ccagcgccaa 1861 cgccaacccc aaccccagtc caacctacca gtcctcctcc tggccctcta gcccagccca 1921 ctggcccgca accacaatct gcgggctcta cctctggccc tgtaccccag cctgcctgcc 1981 caccccctgg ccctgcaccc caccccactg gccctcctgg gcccatccct gtccccgcgc 2041 caccccagat tgccacctcc accagcctcc tggctgccca gtcacttgtg ccaccccctg 2101 ggctgcctgg ctccagcacc ccaggagttt tgccttactt cccacctggc ctgccgcccc 2161 cagacgccgg gggagcccct cagtcttcca tgtcagagtc acctgatgtc aaccttgtga 2221 cccagcagct atctaagtca caggtggagg accccctgcc ccctgtgttc tcaggcacac 2281 caaagggcag tggggctggc tacggtgttg gctttgacct ggaggaattc ttaaaccagt 2341 ctttcgacat gggcgtggct gatgggccac aggatggcca ggcagattca gcctctctct 2401 cagcctccct gcttgctgac tggctcgaag gccatggcat gaaccctgcc gatattgagt 2461 ccctgcagcg tgagatccag atggactccc caatgctgct ggctgacctg cctgacctcc 2521 aggacccctg aggcccccag cctgtgcctt gctgccacag tagacctagt tccaggatcc 2581 atgggagcat tctcaaaggc tttagccctg gacccagcag gtgaggctcg gcttggatta 2641 ttctgcaggt tcatctcaga cccacctttc agccttaagc agccacctga gccaccaccg 2701 agccatggca ggatcgggag accccaactc cccctgaaca atccttttca gtattatatt 2761 tttattatta ttatgttatt attacactgt cttttgccat caaaatgagg cctgtgaaat 2821 acaaggtt // LOCUS HSU25341 1105 bp mRNA PRI 26-JUL-1996 DEFINITION Human Mel1b-melatonin receptor (MTNR1B) mRNA, complete cds. ACCESSION U25341 NID g971193 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1105) AUTHORS Reppert,S.M., Godson,C., Mahle,C.D., Weaver,D.R., Slaugenhaupt,S.A. and Gusella,J.F. TITLE Molecular characterization of a second melatonin receptor expressed in human retina and brain: the Mel1b melatonin receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (19), 8734-8738 (1995) MEDLINE 96004613 REFERENCE 2 (bases 1 to 1105) AUTHORS Reppert,S.M., Weaver,D.R., Ebisawa,T., Mahle,C.D. and Kolakowski,L.F. Jr. TITLE Cloning of a melatonin-related receptor from human pituitary JOURNAL FEBS Lett. 386 (2-3), 219-224 (1996) MEDLINE 96228068 REFERENCE 3 (bases 1 to 1105) AUTHORS Reppert,S.M., Godson,C., Mahle,C.D., Weaver,D.R., Slaugenhaupt,S.A. and Gusella,J.F. TITLE Direct Submission JOURNAL Submitted (19-APR-1995) Steven M. Reppert, Chronobiology, Mass. Gen. Hospital, GRJ 1226, 32 Fruit Street, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..1105 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q21-22" gene 13..1101 /gene="MTNR1B" CDS 13..1101 /gene="MTNR1B" /note="member of G protein coupled receptor family" /codon_start=1 /product="Mel1b-melatonin receptor" /db_xref="PID:g971194" /translation="MSENGSFANCCEAGGWAVRPGWSGAGSARPSRTPRPPWVAPALS AVLIVTTAVDVVGNLLVILSVLRNRKLRNAGNLFLVSLALADLVVAFYPYPLILVAIF YDGWALGEEHCKASAFVMGLSVIGSVFNITAIAINRYCYICHSMAYHRIYRRWHTPLH ICLIWLLTVVALLPNFFVGSLEYDPRIYSCTFIQTASTQYTAAVVVIHFLLPIAVVSF CYLRIWVLVLQARRKAKPESRLCLKPSDLRSFLTMFVVFVIFAICWAPLNCIGLAVAI NPQEMAPQIPEGLFVTSYLLAYFNSCLNAIVYGLLNQNFRREYKRILLALWNPRHCIQ DASKGSHAEGLQSPAPPIIGVQHQADAL" BASE COUNT 188 a 363 c 303 g 251 t ORIGIN 1 ggagagtctg cgatgtcaga gaacggctcc ttcgccaact gctgcgaggc gggcgggtgg 61 gcagtgcgcc cgggctggtc gggggctggc agcgcgcggc cctccaggac ccctcgacct 121 ccctgggtgg ctccagcgct gtccgcggtg ctcatcgtca ccaccgccgt ggacgtcgtg 181 ggcaacctcc tggtgatcct ctccgtgctc aggaaccgca agctccggaa cgcaggtaat 241 ttgttcttgg tgagtctggc attggctgac ctggtggtgg ccttctaccc ctacccgcta 301 atcctcgtgg ccatcttcta tgacggctgg gccctggggg aggagcactg caaggccagc 361 gcctttgtga tgggcctgag cgtcatcggc tctgtcttca atatcactgc catcgccatt 421 aaccgctact gctacatctg ccacagcatg gcctaccacc gaatctaccg gcgctggcac 481 acccctctgc acatctgcct catctggctc ctcaccgtgg tggccttgct gcccaacttc 541 tttgtggggt ccctggagta cgacccacgc atctattcct gcaccttcat ccagaccgcc 601 agcacccagt acacggcggc agtggtggtc atccacttcc tcctccctat cgctgtcgtg 661 tccttctgct acctgcgcat ctgggtgctg gtgcttcagg cccgcaggaa agccaagcca 721 gagagcaggc tgtgcctgaa gcccagcgac ttgcggagct ttctaaccat gtttgtggtg 781 tttgtgatct ttgccatctg ctgggctcca cttaactgca tcggcctcgc tgtggccatc 841 aacccccaag aaatggctcc ccagatccct gaggggctat ttgtcactag ctacttactg 901 gcttatttca acagctgcct gaatgccatt gtctatgggc tcttgaacca aaacttccgc 961 agggaataca agaggatcct cttggccctt tggaacccac ggcactgcat tcaagatgct 1021 tccaagggca gccacgcgga ggggctgcag agcccagctc cacccatcat tggtgtgcag 1081 caccaggcag atgctctcta gcctg // LOCUS HSU25433 1306 bp mRNA PRI 12-JUL-1995 DEFINITION Human protein associated with tumorigenic conversion (CATR1.3) mRNA, complete cds. ACCESSION U25433 NID g896044 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Li,D., Noyes,I., Shuler,C. and Milo,G.E. TITLE Cloning and sequencing of CATR1.3, a human gene associated with tumorigenic conversion JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (14), 6409-6413 (1995) MEDLINE 95327656 REFERENCE 2 (bases 1 to 1306) AUTHORS Li,D. TITLE Direct Submission JOURNAL Submitted (20-APR-1995) Dawei E. Li, The Ohio State University, Medical Biochemistry, 1645 Neil Ave., Columbus, OH 43210, USA FEATURES Location/Qualifiers source 1..1306 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CATR1.3" /cell_line="SCC83-01-82" /cell_type="epithelial cell" /tissue_type="oral carcinoma" gene 46..285 /gene="CATR1.3" CDS 46..285 /gene="CATR1.3" /codon_start=1 /product="protein associated with tumorigenic conversion" /db_xref="PID:g896045" /translation="MVLNEEIPRHLLLTQNNDIIPKHHILILPAVDSYQKSVNDLRAL TFSKFQELKHAHELRNLCVSQSRFLAIMWFGTNTN" polyA_signal 1288..1293 BASE COUNT 468 a 249 c 225 g 364 t ORIGIN 1 accagcacac cactgtgtaa tttctatacg aggtttggct tggatatggt gctaaatgaa 61 gagattcctc gacatttgct tctcactcaa aataatgaca taattccgaa gcaccatatc 121 ttaatcttac cagcagtaga cagttatcaa aaaagtgtta atgatttaag agctctaaca 181 ttttctaagt ttcaagaatt aaagcatgcc catgaattaa gaaacctttg tgtctcccaa 241 tcaaggtttc tagctattat gtggtttggg actaacacca actgatgatg acaatgcaca 301 aaaaattcca ccattcattc cattatacta aaggctaatt gcatgggcct attattggaa 361 tatgctttcc tagttcaact agctgcattt caatagagta aagagggttt tctggagaaa 421 ccctactgtg aaaagatgaa ctttgtctta acaactttag tttcaaaaac tattcattta 481 tagatgccta tttcacgtct ctgaagcaaa atggttcatt tgttatgtag attactaagc 541 agtcagtcac ttaagaataa aaagtttctt ctttagaggc tccagctaac tgtctgcata 601 ggttcaatct aaaaaccagc aaagcatact gctaaatatg atagcaaata attgtttaaa 661 cacaaatgag caccaccttc aaattttcca atccactttc caagggccaa tctatgatta 721 tccccaacaa agactggagc ccctctctct cagagaagga atacaaaaca caggagaaag 781 atcataaaga actatgtaat ataaggagca gggagaaaat gtcaggtggg aaaaatggcc 841 ggaaatggga agaagaaaca tgtacaagaa tcaccaggag agtgacattc cccgccccac 901 tgatacctag agattgattc cccatcttaa tgacttctat aatataatct caagaaaatt 961 cttgaaactc aaatacctat aaatccagag gaaaaaggaa aaggtaacac atatgcacat 1021 ataaataatg aatttgctct tataaagagg ttagaaccca ctagtctaaa gctctaatca 1081 tagtctgctc gttctccagt tatcttgcat aatatgaatg gctggcccct gctgctccct 1141 ttactacttc aagttcatta ggcatagcag gtgtcaaata ttaagtggca ctaatatcaa 1201 tttaaccttg atttcataaa acttaaaaag gggagaaaaa gagataaaga tgtaagtaga 1261 tagactgaca atgtaacaaa ggtctgtaat agaaaaggct gcagca // LOCUS HSU25435 3780 bp mRNA PRI 11-SEP-1996 DEFINITION Human transcriptional repressor (CTCF) mRNA, complete cds. ACCESSION U25435 NID g924759 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3780) AUTHORS Filippova,G.N., Fagerlie,S., Klenova,E.M., Myers,C., Dehner,Y., Goodwin,G., Neiman,P.E., Collins,S.J. and Lobanenkov,V.V. TITLE An exceptionally conserved transcriptional repressor, CTCF, employs different combinations of zinc fingers to bind diverged promoter sequences of avian and mammalian c-myc oncogenes JOURNAL Mol. Cell. Biol. 16 (6), 2802-2813 (1996) MEDLINE 96220465 REFERENCE 2 (bases 1 to 3780) AUTHORS Filippova,G.N., Neiman,P.E., Collins,S. and Lobanenkov,V.V. TITLE Direct Submission JOURNAL Submitted (20-APR-1995) Victor V. Lobanenkov, Basic Sciences, Fred Hutchinson Ctr, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..3780 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /chromosome="16" gene 292..2475 /gene="CTCF" CDS 292..2475 /gene="CTCF" /note="11-Zn-finger transcription factor" /codon_start=1 /function="transcriptional repressor binding to promoters of vertebrate c-myc genes" /product="CTCF" /db_xref="PID:g924760" /translation="MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQ TDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVN MEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPL PEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEG KDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCP RRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTS GELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTY KLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKS DLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQER HMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTR RNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNED EEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIE NIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR" polyA_site 3780 /note="30 A nucleotides" BASE COUNT 1098 a 823 c 960 g 899 t ORIGIN 1 tgtgtctgag cctgtggagc gattaaaccg tgcgcggagc tgcttctttg gcggcagcgg 61 cggcggcggt ggccggtgcg gacgcgcgga gctcgccgga gacgccgggt ggccggagcc 121 gtggagcggc ggcggagcgg gcgccgcggg gggtgtggcg cggagaatga ttacggacct 181 gaagccaaag aacaagatgc gctagtggac agattgctga ccaggggctt gagagctggg 241 ttctattttc cctcctcaaa ctgactttgc agccacggag aggcagggga aatggaaggt 301 gatgcagtcg aagccattgt ggaggagtcc gaaactttta ttaaaggaaa ggagagaaag 361 acttaccaga gacgccggga agggggccag gaagaagatg cctgccactt accccagaac 421 cagacggatg ggggtgaggt ggtccaggat gtcaacagca gtgtacagat ggtgatgatg 481 gaacagctgg accccaccct tcttcagatg aagactgaag taatggaggg cacagtggct 541 ccagaagcag aggctgctgt ggacgatacc cagattataa ctttacaggt tgtaaatatg 601 gaggaacagc ccataaacat aggagaactt cagcttgttc aagtacctgt tcctgtgact 661 gtacctgttg ctaccacttc agtagaagaa cttcaggggg cttatgaaaa tgaagtgtct 721 aaagagggcc ttgcggaaag tgaacccatg atatgccaca ccctaccttt gcctgaaggg 781 tttcaggtgg ttaaagtggg ggccaatgga gaggtggaga cactagaaca aggggaactt 841 ccaccccagg aagatcctag ttggcaaaaa gacccagact atcagccacc agccaaaaaa 901 acaaagaaaa ccaaaaagag caaactgcgt tatacagagg agggcaaaga tgtagatgtg 961 tctgtctacg attttgagga agaacagcag gagggtctgc tatcagaggt taatgcggag 1021 aaagtggttg gtaatatgaa gcctccaaag ccaacaaaaa ttaaaaagaa aggtgtaaag 1081 aagacattcc agtgtgagct ttgcagttac acgtgtccac ggcgttcaaa tttggatcgt 1141 cacatgaaaa gccacactga tgagagacca cacaagtgcc atctctgtgg cagggcattc 1201 agaacagtca ccctcctgag gaatcacctt aacacacaca caggtactcg tcctcacaag 1261 tgcccagact gcgacatggc ctttgtgacc agtggagaat tggttcggca tcgtcgttac 1321 aaacacaccc acgagaagcc attcaagtgt tccatgtgcg attacgccag tgtagaagtc 1381 agcaaattaa aacgtcacat tcgctctcat actggagagc gtccgtttca gtgcagtttg 1441 tgcagttatg ccagcaggga cacatacaag ctgaaaaggc acatgagaac ccattcaggg 1501 gaaaagcctt atgaatgtta tatttgtcat gctcggttta cccaaagtgg taccatgaag 1561 atgcacattt tacagaagca cacagaaaat gtggccaaat ttcactgtcc ccactgtgac 1621 acagtcatag cccgaaaaag tgatttgggt gtccacttgc gaaagcagca ttcctatatt 1681 gagcaaggca agaaatgccg ttactgtgat gctgtgtttc atgagcgcta tgccctcatc 1741 cagcatcaga agtcacacaa gaatgagaag cgctttaagt gtgaccagtg tgattacgct 1801 tgtagacagg agaggcacat gatcatgcac aagcgcaccc acaccgggga gaagccttac 1861 gcctgcagcc actgcgataa gaccttccgc cagaagcagc ttctcgacat gcacttcaag 1921 cgctatcacg accccaactt cgtccctgcg gcttttgtct gttctaagtg tgggaaaaca 1981 tttacacgtc ggaataccat ggcaagacat gctgataatt gtgctggccc agatggcgta 2041 gagggggaaa atggaggaga aacgaagaag agtaaacgtg gaagaaaaag aaagatgcgc 2101 tctaagaaag aagattcctc tgacagtgaa aatgctgaac cagatctgga cgacaatgag 2161 gatgaggagg agcctgccgt agaaattgaa cctgagccag agcctcagcc tgtgacccca 2221 gccccaccac ccgccaagaa gcggagagga cgaccccctg gcagaaccaa ccagcccaaa 2281 cagaaccagc caacagctat cattcaggtt gaagaccaga atacaggtgc aattgagaac 2341 attatagttg aagtaaaaaa agagccagat gctgagcccg cagagggaga ggaagaggag 2401 gcccagccag ctgccacaga tgcccccaac ggagacctca cgcccgagat gatcctcagc 2461 atgatggacc ggtgatggcg gagccttgtg cgtcgccagg acttctctgg gctgtgttta 2521 aacggcccgc atcttaattt ttctcccttc tttctttttt tggctttggg aaaagcatca 2581 ttttaccaaa cataccgaga acgaaaactt caaggatgat gttagaaaaa aatgtgattt 2641 aactagaact tgctgtctga tgttagcaaa tcatggaatg ttctgagtcc ctgagggttt 2701 actgtgaagt gctgaggaca gtgttgacaa ctaactcgtt ttcctagatg gaaacggaga 2761 cattgacccc tccctccatg tggtaaacca ctccagaatg gccaccaggc ttcccagagt 2821 tctatggtct tcttcccaag agagttttta attgtaaatg catacttggg aaggacttag 2881 agttttaaac tgttttttgc ttttgctttt ccctgactcc ctttgcttgg agtcagctgc 2941 acaccagtag tatggcatgc tacgatcagg ttctgtcctg aaagctttgc ctctttcttg 3001 gcaaagtttc tggtatggtc aagcttgtaa ataacttttt ttacatttta atcttttcca 3061 ttaattaaga ggttgaaaag aagtgcagtg taagaaaacc cagcatttta attacttgca 3121 aattaagtta ccacagactc tgtagtgtgt aaatgttgac aaggaattgg atcacaatca 3181 tgtagcagaa tggcacccag accactgccc accagtgacg gacatgcacg tggcagatca 3241 tgatttccag cccacggagc cagcatttga accttgtata attaactttc agttatgatt 3301 tcccatcgac attttctttg ccctgtttgt agctgattgt tgtgttttat aaatcttctg 3361 ttaaggcaga agggtgatta tgagtggttc acagcagccc ttataagctg ggccagaaaa 3421 tttcactagg tcagtaattt aaaccttgga tcttcaaaaa ataaaataat gtgaagcaaa 3481 accaactaaa aagtgattct tgcacatgaa ctgtcacatg tttaaaaatg tgttttttag 3541 agagcctcag tcttactgat ttcaaacact tttttcttct gtgtattgct tttaagagag 3601 ccatcagtta gctatcagac tctaggttga tgcattttgt acttagctgt actgtgtgat 3661 atttttcatt attttaggac gccaacatga gacctgtaat aaaatatgta atggggttga 3721 aagctgggga ggaggatcta ctgctgtaca gctaataaat cataacggat taacaagtga // LOCUS HSU25677 456 bp mRNA PRI 20-JUL-1995 DEFINITION Human lysozyme precursor, mRNA, complete cds. ACCESSION U25677 NID g847819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 456) AUTHORS Huang,B., Zhao,C., Lei,X. and Cai,L. TITLE The cloning, sequencing and analysis of Chinese human lysozyme gene cDNA amplified with RT-PCR from human placental total RNA JOURNAL Sheng Wu Hua Hsueh Tsa Chi 9 (No.3), 269-273 (1993) REFERENCE 2 (bases 1 to 456) AUTHORS Xu,L. TITLE Direct Submission JOURNAL Submitted (21-APR-1995) Lin Xu, Institute of Biophysics, Academia Sinica, Dept. of Protein Engineering, 15 Datun Road, Chaoyang District, Beijing 100101, Peoples Republic of China FEATURES Location/Qualifiers source 1..456 /organism="Homo sapiens" /strain="Chinese" /db_xref="taxon:9606" sig_peptide 1..54 CDS 1..447 /codon_start=1 /product="lysozyme precursor" /db_xref="PID:g847820" /translation="MKALIVLGLALLSVTVQGKVFERCELARTLKRLGMDGYRGISLA NWMCLAKWESGYNTRATNYNAGDRSTDYGIFQINSRYWCNDGKTPGAVNACHLSCSAL LQDNIADAAACAKRVVRDPQGVRAWAAWRNRCQDRDVRQYVQGCGV" mat_peptide 55..444 /label=JC1075 /product="lysozyme" BASE COUNT 121 a 88 c 133 g 114 t ORIGIN 1 atgaaggctc tcattgttct ggggcttgcc ctcctttctg ttacggtcca gggcaaggtc 61 tttgaaaggt gtgagttggc cagaactctg aaaagattgg gaatggatgg ctacagggga 121 atcagcctag caaactggat gtgtttggcc aaatgggaga gtggctacaa cacacgagct 181 acaaactaca atgctggaga cagaagcact gattatggga tatttcagat caatagccgc 241 tactggtgta atgatggcaa aaccccagga gcagttaatg cctgtcattt atcctgcagt 301 gctttgctgc aagataacat cgctgatgct gcagcttgtg caaagagggt tgtccgtgat 361 ccacaaggcg ttagagcatg ggcggcatgg agaaatcgtt gtcaagacag agatgtccgt 421 cagtatgttc aaggttgtgg agtgtaatga gtcgac // LOCUS HSU25997 3757 bp mRNA PRI 17-FEB-1996 DEFINITION Human stanniocalcin precursor (STC) mRNA, complete cds. ACCESSION U25997 NID g975297 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3757) AUTHORS Chang,A.C., Janosi,J., Hulsbeek,M., de Jong,D., Jeffrey,K.J., Noble,J.R. and Reddel,R.R. TITLE A novel human cDNA highly homologous to the fish hormone stanniocalcin JOURNAL Mol. Cell. Endocrinol. 112 (2), 241-247 (1995) MEDLINE 96077825 REFERENCE 2 (bases 1 to 3757) AUTHORS Chang,A.C.-M. TITLE Direct Submission JOURNAL Submitted (02-MAY-1995) Andy C.-M. Chang, Childrens Medical Research Institute, 214 Hawkesbury Road, Westmead, NSW 2145, Australia FEATURES Location/Qualifiers source 1..3757 /organism="Homo sapiens" /note="similar to the nucleotide sequence in GenBank Accession Number S50179" /db_xref="taxon:9606" /cell_type="fibroblast" /cell_line="HT1080" gene 160..903 /gene="STC" CDS 160..903 /gene="STC" /note="similar to SwissProt Accession Number Q08264" /codon_start=1 /function="putative calcium responsive hormone" /product="stanniocalcin precursor" /db_xref="PID:g975298" /translation="MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEVVR CLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANG VTSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITEVVQLPNHFSNRYY NRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNE PQKLKVLLRNLRGEEDSPSHIKRTSHESA" sig_peptide 160..258 /gene="STC" /note="pre-pro-peptide" mat_peptide 259..900 /gene="STC" /function="putative calcium responsive hormone" /product="stanniocalcin" polyA_site 3757 /note="19 A nucleotides" BASE COUNT 1099 a 875 c 735 g 1048 t ORIGIN 1 catcaccagc aacaacaaca aaaaaaaatc ctcatcaaat cctcacctaa gctttcagtg 61 tatccagatc cacatcttca ctcaagccag gagagggaaa gaggaaaggg gggcaggaaa 121 aaaaaaaaac ccaacaactt agcggaaact tctcagagaa tgctccaaaa ctcagcagtg 181 cttctggtgc tggtgatcag tgcttctgca acccatgagg cggagcagaa tgactctgtg 241 agccccagga aatcccgagt ggcggctcaa aactcagctg aagtggttcg ttgcctcaac 301 agtgctctac aggtcggctg cggggctttt gcatgcctgg aaaactccac ctgtgacaca 361 gatgggatgt atgacatctg taaatccttc ttgtacagcg ctgctaaatt tgacactcag 421 ggaaaagcat tcgtcaaaga gagcttaaaa tgcatcgcca acggggtcac ctccaaggtc 481 ttcctcgcca ttcggaggtg ctccactttc caaaggatga ttgctgaggt gcaggaagag 541 tgctacagca agctgaatgt gtgcagcatc gccaagcgga accctgaagc catcactgag 601 gtcgtccagc tgcccaatca cttctccaac agatactata acagacttgt ccgaagcctg 661 ctggaatgtg atgaagacac agtcagcaca atcagagaca gcctgatgga gaaaattggg 721 cctaacatgg ccagcctctt ccacatcctg cagacagacc actgtgccca aacacaccca 781 cgagctgact tcaacaggag acgcaccaat gagccgcaga agctgaaagt cctcctcagg 841 aacctccgag gtgaggagga ctctccctcc cacatcaaac gcacatccca tgagagtgca 901 taaccaggga gaggttattc acaacctcac caaactagta tcattttagg ggtgttgaca 961 caccaatttt gagtgtactg tgcctggttt gattttttta aagtagttcc tattttctat 1021 cccccttaaa gaaaattgca tgaaactagg cttctgtaat caatatccca acattctgca 1081 atggcagcat tcccaccaac aaaatccatg tgatcattct gcctctcctc aggagaaagt 1141 accctctttt accaacttcc tctgccatgt cttttcccct gctcccctga gaccaccccc 1201 aaacacaaaa cattcatgta actctccagc cattgtaatt tgaagatgtg gatcccttta 1261 gaacggttgc cccagtagag ttagctgata aggaaacttt atttaaatgc atgtcttaaa 1321 tgctcataaa gatgttaaat ggaattcgtg ttatgaatct gtgctggcca tggacgaata 1381 tgaatgtcac atttgaattc ttgatctcta atgagctagt gtcttatggt cttgatcctc 1441 caatgtctaa ttttctttcc gacacattta ccaaattgct tgagcctggc tgtccaacca 1501 gactttgagc ctgcatcttc ttgcatctaa tgaaaaacaa aaagctaaca tctttacgta 1561 ctgtaactgc tcagagcttt aaaagtatct ttaacaattg tcttaaaacc agagaatctt 1621 aaggtctaac tgtggaatat aaatagctga aaactaatgt actgtacata aattccagag 1681 gactctgctt aaacaaagca gtatataata actttattgc atatagattt agttttgtaa 1741 cttagcttta tttttctttt cctgggaatg gaataactat ctcacttcca gatatccaca 1801 taaatgctcc ttgtggcctt ttttataact aagggggtag aagtagtttt aattcaacat 1861 caaaacttaa gatgggcctg tatgagacag gaaaaaccaa caggtttatc tgaaggaccc 1921 caggtaagat gttaatctcc cagcccacct caacccagag gctactcttg acttagacct 1981 atactgaaag atctctgtca catccaactg gaaattccag gaaccaaaaa gagcatccct 2041 atgggcttgg accacttaca gtgtgataag gcctactata cattaggaag tggtagttct 2101 ttactcgtcc cctttcatcg gtgcctggta ctctggcaaa tgatgatggg gtgggagact 2161 ttccattaaa tcaatcagga atgagtcaat cagcctttag gtctttagtc cgggggactt 2221 ggggctgaga gagtataaat aaccctgggc tgtccagcct taatagactt ctcttacatt 2281 ttcgtcctgt agcacgctgc ctgccaaagt agtcctggca gctggaccat ctctgtagga 2341 tcgtaaaaaa atagaaaaaa agaaaaaaaa aagaaagaaa gagggaaaaa gagctggtgg 2401 tttgatcatt tctgccatga tgtttacaag atggcgacca ccaaagtcaa acgactaacc 2461 tatctatgaa caacagtagt ttctcagggt cactgtcctt gaacccaaca gtcccttatg 2521 agcgtcactg cccaccaaag gtcaatgtca agagaggaag agagggagga ggggtaggac 2581 tgcaggggcc actccaaact cgcttaggta gaaactattg gtgctcgact ctcactaggc 2641 taaactcaag atttgaccaa atcgagtgat agggatcctg gtgggaggag agagggcaca 2701 tctccagaaa aatgaaaagc aatacaactt taccataaag cctttaaaac cagtaacgtg 2761 ctgctcaagg accaagagca attgcagcag acccagcagc agcagcagca gcacaaacat 2821 tgctgccttt gtccccacac agcctctaag cgtgctgaca tcagattgtt aagggcattt 2881 ttatactcag aactgtccca tccccaggtc cccaaactta tggacactgc cttagcctct 2941 tggaaatcag gtagaccata ttctaagtta gactcttccc ctccctccca cacttcccac 3001 ccccaggcaa ggctgacttc tctgaatcag aaaagctatt aaagtttgtg tgttgtgtcc 3061 attttgcaaa cccaactaag ccaggacccc aatgcgacaa gtagttcatg agtattccta 3121 gcaaatttct ctctttcttc agttcagtag atttcctttt ttcttttctt tttttttttt 3181 tttttttttt ggctgtgacc tcttcaaacc gtggtacccc cccttttctc cccacgatga 3241 tatctatata tgtatctaca atacatatat ctacacatac agaaagaagc agttctcaca 3301 tgttgctagt tttttgcttc tctttccccc accctactcc ctccaattcc ccccttaaac 3361 ttccaaagct tcgtcttgtg tttgctgcag agtgattcgg gggctgacct agaccagttt 3421 gcatgattct tctcttgtga tttggttgca ctttagacat ttttgtgcca ttatatttgc 3481 attatgtatt tataatttaa atgatattta ggtttttggc tgagtactgg aataaacagt 3541 gagcatatct ggtatatgtc attatttatt gttaaattac attttttaag ctccatgtgc 3601 atataaaggt tatgaaacat atcatggtaa tgacagatgc aagttatttt atttgcttat 3661 tttttataat taaagatgcc atagcataat atgaagcctt tggtgaattc cttctaagat 3721 aaaaataata ataaagtgtt acgttttatt ggtttca // LOCUS HSU26174 1040 bp mRNA PRI 23-AUG-1995 DEFINITION Human pre-granzyme 3 mRNA, complete cds. ACCESSION U26174 NID g829637 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1040) AUTHORS Przetak,M.M., Yoast,S. and Schmidt,B.F. TITLE Cloning of cDNA for human granzyme 3 JOURNAL FEBS Lett. 364 (3), 268-271 (1995) MEDLINE 95278340 REFERENCE 2 (bases 1 to 1040) AUTHORS Schmidt,B.F. TITLE Direct Submission JOURNAL Submitted (02-MAY-1995) Brian F. Schmidt, Khepri Pharmaceuticals, Inc., 260 Littlefield Avenue, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..1040 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Uni-ZAP XR" /tissue_type="ascites" 5'clip 1..21 5'UTR 1..40 sig_peptide 31..119 CDS 41..835 /codon_start=1 /product="pre-granzyme 3" /db_xref="PID:g829638" /translation="MTKFSSFSLFFLIVGAYMTHVCFNMEIIGGKEVSPHSRPFMASI QYGGHHVCGGVLIDPQWVLTAAHCQYRFTKGQSPTVVLGAHSLSKNEASKQTLEIKKF IPFSRVTSDPQSNDIMLVKLQTAAKLNKHVKMLHIRSKTSLRSGTKCKVTGWGATDPD SLRPSDTLREVTVTVLSRKLCNSQSYYNGDPFITKDMVCAGDAKGQKDSCKGDSGGPL ICKGVFHAIVSGGHECGVATKPGIYTLLTKKYQTWIKSNLVPPHTN" mat_peptide 119..832 /product="granzyme 3" 3'UTR 836..1040 polyA_signal 1023..1028 polyA_site 1040 BASE COUNT 300 a 231 c 204 g 305 t ORIGIN 1 aacacatttc atctgggctt cttaaatcta aatctttaaa atgactaagt tttcttcctt 61 ttctctgttt ttcctaatag ttggggctta tatgactcat gtgtgtttca atatggaaat 121 tattggaggg aaagaagtgt cacctcattc caggccattt atggcctcca tccagtatgg 181 cggacatcac gtttgtggag gtgttctgat tgatccacag tgggtgctga cagcagccca 241 ctgccaatat cggtttacca aaggccagtc tcccactgtg gttttaggcg cacactctct 301 ctcaaagaat gaggcctcca aacaaacact ggagatcaaa aaatttatac cattctcaag 361 agttacatca gatcctcaat caaatgatat catgctggtt aagcttcaaa cagccgcaaa 421 actcaataaa catgtcaaga tgctccacat aagatccaaa acctctctta gatctggaac 481 caaatgcaag gttactggct ggggagccac cgatccagat tcattaagac cttctgacac 541 cctgcgagaa gtcactgtta ctgtcctaag tcgaaaactt tgcaacagcc aaagttacta 601 caacggcgac ccttttatca ccaaagacat ggtctgtgca ggagatgcca aaggccagaa 661 ggattcctgt aagggtgact cagggggccc cttgatctgt aaaggtgtct tccacgctat 721 agtctctgga ggtcatgaat gtggtgttgc cacaaagcct ggaatctaca ccctgttaac 781 caagaaatac cagacttgga tcaaaagcaa ccttgtcccg cctcatacaa attaagttac 841 aaataatttt attggatgca cttgcttctt ttttcctaat atgctcgcag gttagagttg 901 ggtgtaagta aagcagagca catatggggt ccatttttgc acttgtaagt cattttatta 961 aggaatcaag ttctttttca cttgtatcac tgatgtattt ctaccatgct ggttttattc 1021 taaataaaat ttagaagact // LOCUS HSU26209 1953 bp mRNA PRI 27-APR-1996 DEFINITION Human renal sodium/dicarboxylate cotransporter (NADC1) mRNA, complete cds. ACCESSION U26209 NID g1098556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1943) AUTHORS Pajor,A.M. TITLE Sequence and chromosomal localization of a human renal sodium/dicarboxylate cotransporter JOURNAL Am. J. Physiol. 270, 642-648 (1996) REFERENCE 2 (bases 1 to 1953) AUTHORS Pajor,A. M. TITLE Direct Submission JOURNAL Submitted (03-MAY-1995) Ana M. Pajor, Physiology, College of Medicine, University of Arizona, Tucson, AZ 85724, USA FEATURES Location/Qualifiers source 1..1953 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hu51" /clone_lib="renal lambda-gt10 library, Clontech" /sex="female" /tissue_type="kidney" /chromosome="17" gene 69..1847 /gene="NADC1" CDS 69..1847 /gene="NADC1" /note="huNaDC-1" /codon_start=1 /function="cotransport of sodium and dicarboxylates (succinate,citrate)" /evidence=experimental /product="renal sodium/dicarboxylate cotransporter" /db_xref="PID:g1098557" /translation="MATCWQALWAYRSYLIVFFVPILLLPLPILVPSKEAYCAYAIIL MALFWCTEALPLAVTALFPLILFPMMGIVDASEVAVEYLKDSNLLFFGGLLVAIAVEH WNLHKRIALRVLLIVGVRPAPLILGFMLVTAFLSMWISNTATSAMMVPIAHAVLDQLH SSQASSNVEEGSNNPTFELQEPSPQKEVTKLDNGQALPVTSASSEGRAHLSQKHLHLT QCMSLCVCYSASIGGIATLTGTAPNLVLQGQINSLFPQNGNVVNFASWFSFAFPTMVI LLLLAWLWLQILFLGFNFRKNFGIGEKMQEQQQAAYCVIQTEHRLLGPMTFAEKAISI LFVILVLLWFTREPGFFLGWGNLAFPNAKGESMVSDGTVAIFIGIIMFIIPSKFPGLT QDPENPGKLKAPLGLLDWKTVNQKMPWNIVLLLGGGYALAKGSERSGLSEWLGNKLTP LQSVPAPAIAIILSLLVATFTECTSNVATTTIFLPILASMAQAICLHPLYVMLPCTLA TSLAFMLPVATPPNAIVFSFGDLKVLDMARAGFLLNIIGVLIIALAINSWGIPLFSLH SFPSWAQSNTTAQCLPSLANTTTPSP" BASE COUNT 356 a 680 c 496 g 421 t ORIGIN 1 gatctttggt ccttctgtta cccagctcct ggaggcagtg gctgtagcag cccttgctgc 61 tccacaccat ggccacctgc tggcaggccc tgtgggccta tcgctcctac ctgatcgtgt 121 tcttcgtgcc cattctcctg ctgcctctgc ccatcctcgt ccccagtaag gaggcctact 181 gcgcgtatgc catcatcctc atggcgctct tctggtgcac tgaggccctg cccctggccg 241 tcactgccct cttcccctta atcctgttcc ctatgatggg catcgtggat gcctctgagg 301 ttgccgtcga gtatcttaag gactccaacc tcctgttctt cggggggctg ctggtggcca 361 tcgcggtgga acactggaac ctgcataaac gcatcgccct ccgtgtcctc ctcatcgttg 421 gggtgcggcc tgccccgcta atcctgggct tcatgctggt cacggccttc ctgtccatgt 481 ggatcagcaa cacggccacc tcagccatga tggtgcccat cgcacatgcc gtcctggacc 541 agctgcacag ctcgcaagcc agcagcaacg tcgaggaggg cagcaacaac cccaccttcg 601 agctccagga accaagtccc cagaaggagg tgaccaagct tgataatggg caggccctcc 661 ctgtcacgtc tgcctcttcg gaggggaggg cacatctcag ccagaagcat ctccacctca 721 cccagtgcat gagcctgtgc gtgtgctact ccgccagcat cgggggcatc gccacgctga 781 ctggcaccgc acccaacctg gtgctgcaag gccagatcaa ctcgctcttc ccccaaaacg 841 gcaacgtggt gaacttcgcc tcctggttca gcttcgcctt ccccaccatg gtcatcttgc 901 tgctgctggc ctggttgtgg ctgcagatcc tcttcctggg cttcaacttc cggaagaact 961 ttggcattgg ggaaaagatg caggagcaac agcaggcagc ctactgcgtc atccagaccg 1021 agcacaggct gctgggcccc atgacctttg cagaaaaggc catcagcatc ctattcgtca 1081 tcctggtgct gctctggttc acccgggagc cgggcttttt tcttggctgg ggcaatttgg 1141 cttttcccaa tgccaagggg gagagcatgg tgtccgatgg gacagtggcc atcttcatcg 1201 gcataattat gttcatcata ccctccaagt tcccagggct gacccaggac ccagaaaacc 1261 cagggaagct gaaggcccct cttggcctcc tcgactggaa gacggtgaac cagaagatgc 1321 cgtggaatat cgtgttattg ctgggtggtg gctatgccct ggccaagggc agtgagcgat 1381 cgggcctgtc agagtggctg ggaaacaagc tgaccccact gcagagtgtg ccagctccag 1441 ccattgccat catcctctcc ctcctggtgg ccaccttcac cgagtgcact agcaacgtgg 1501 ccaccactac gatcttcctg cccatcctag cctccatggc ccaggccatc tgcctccacc 1561 ctctctacgt catgctcccc tgcactctgg ccacctccct ggccttcatg ttgcctgtgg 1621 ccaccccgcc caatgccatc gtcttctctt tcggggacct caaagtgttg gatatggccc 1681 gggcaggatt cctcctcaac atcattggag tcctgatcat cgcactggcc atcaacagct 1741 ggggcatccc cctcttcagc ctgcactctt tcccctcctg ggcacagtcc aacaccacag 1801 cccagtgcct gccaagcctg gccaacacca ccacaccaag cccctaggct ggggcacagc 1861 ctggccatgc ccaggaagac ccaccccatt cccactcctc tgagcccgga ggggacaccc 1921 caagctccaa gctccaagct ccaggccgga att // LOCUS HSU26312 747 bp mRNA PRI 27-FEB-1997 DEFINITION Human heterochromatin protein HP1Hs-gamma mRNA, complete cds. ACCESSION U26312 NID g1773226 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 747) AUTHORS Ye,Q. and Worman,H.J. TITLE Interaction between an integral protein of the nuclear envelope inner membrane and human chromodomain proteins homologous to Drosophila HP1 JOURNAL J. Biol. Chem. 271 (25), 14653-14656 (1996) MEDLINE 96278941 REFERENCE 2 (bases 1 to 747) AUTHORS Ye,Q. and Worman,H.J. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) Howard J. Worman, Medicine, Columbia University, 630 W. 168th St., 10th Fl., Rm. 508, New York, NY 10032, USA REFERENCE 3 (bases 1 to 747) AUTHORS Ye,Q. and Worman,H.J. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) Howard J. Worman, Medicine, Columbia University, 630 W. 168th St., 10th Fl., Rm. 508, New York, NY 10032, USA REMARK Sequence and feature updates by submitter FEATURES Location/Qualifiers source 1..747 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" CDS 1..522 /note="similar to Drosophila heterochromatin protein HP1 Swiss-Prot Accession Number P29227, and to human heterochromatin protein HP1Hs-alpha encoded by GenBank Accession Number U26311; contains chromo domain; recognized by autoantibodies from some patients with scleroderma; heterochromatin protein" /codon_start=1 /product="HP1Hs-gamma" /db_xref="PID:g1773227" /translation="MGKKQNGKSKKVEEAEPEEFVVEKVLDRRVVNGKVEYFLKWKGF TDADNTWEPEENLDCPELIEAFLNSQKAGKEKDGTKRKSLSDSESDDSKSKKKRDAAD KPRGFARGLDPERIIGAIDSSGELMFLMKWKDSDEADLVLAKEANMKCPQIVIAFYEE RLTWHSCPEDEAQ" polyA_site 747 /note="24 A nucleotides" BASE COUNT 264 a 92 c 172 g 219 t ORIGIN 1 atgggaaaaa aacagaatgg aaagagtaaa aaagttgaag aggcagagcc tgaagaattt 61 gtcgtggaaa aagtactaga tcgacgtgta gtgaatggga aagtggaata tttcctgaag 121 tggaagggat ttacagatgc tgacaatact tgggaacctg aagaaaattt agattgtcca 181 gaattgattg aagcgtttct taactctcag aaagctggca aagaaaaaga tggtacaaaa 241 agaaaatctt tatctgacag tgaatctgat gacagcaaat caaagaagaa aagagatgct 301 gctgacaaac caagaggatt tgccagaggt cttgatcctg aaagaataat tggtgccata 361 gacagcagtg gagaattgat gtttctcatg aaatggaaag attcagatga ggcagacttg 421 gtgctggcga aagaggcaaa tatgaagtgt cctcaaattg taattgcttt ttatgaagag 481 agactaactt ggcattcttg tccagaagat gaagctcaat aattgttcac attgttcttt 541 tatatatatt tatatatata tattaaaaat tgggtcttag attttgattt actagtgtga 601 caaaataact acatcctaat gaaaatcaag tttgatatgt ttgttttgaa agtagcgttg 661 gaagagttgt tgggggtttt ttgcatccat agcactggtt actttgaaca aataataaaa 721 gctttctgta gttgcttcct ttatcag // LOCUS HSU26398 3217 bp mRNA PRI 30-MAY-1996 DEFINITION Human inositol polyphosphate 4-phosphatase mRNA, complete cds. ACCESSION U26398 NID g1294811 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3217) AUTHORS Norris,F.A., Auethavekiat,V. and Majerus,P.W. TITLE The isolation and characterization of cDNA encoding human and rat brain inositol polyphosphate 4-phosphatase JOURNAL J. Biol. Chem. 270 (27), 16128-16133 (1995) MEDLINE 95332315 REFERENCE 2 (bases 1 to 3217) AUTHORS Norris,F.A., Authavekiat,V. and Majerus,P.W. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) Philip W. Majerus, Division of Hematology, Wash. Univ. Med. School, Box 8125 660 So. Euclid, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..3217 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 295..3111 /codon_start=1 /product="inositol polyphosphate 4-phosphatase" /db_xref="PID:g944911" /translation="MTAREHSPRHGARARAMQRASTIDVAADMLGLSLAGNIQDPDEP ILEFSLACSELHTPSLDRKPNSFVAVSVTTPPQAFWTKHAQTEIIEGTNNPIFLSSIA FFQDSLINQMTQVKLSVYDVKDRSQGTMYLLGSGTFIVKDLLQDRHHRLHLTLRSAES DRVGNITVIGWQMEEKSDQRPPVTRSVDTVNGRMVLPVDESLTEALGIRSKYASLRKD TLLKSVFGGAICRMYRFPTTDGNHLRILEQMAESVLSLHVPRQFVKLLLEEDAARVCE LEELGELSPCWESLRRQIVTQYQTIILTYQENLTDLHQYRGPSFKASSLKADKKLEFV PTNLHIQRMRVQDDGGSDQNYDIVTIGAPAAHCQGFKSGGLRKKLHKFEETKKHFEEC CTSSGCQSIIYIPQDVVRAKEIIAQINTLKTQVSYYAERLSRAAKDRSATGLERTLAI LADKTRQLVTVCDCKLLANSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMAR WTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERLHGEGCEDVFPCA GSCTSKKDCSPPPEESSPGEWSEALYPLLTTLTDCVAMMSDKAKKAMVFLLMQDSAPT IATYLSLQYRRDVVFCQTLTALICGFIIKLRNCLHDDGFLRQLYTIGLLAQFESLLST YGEELAMLEDMSLGIMDLRNVTFKVTQATSSASADMLPVITGNRDGFNVRVPLPGPLF DALPREIQSGMLLRVQPVLFNVGINEQQTLAERFGDTSLQEVINVESLVRLNSYFEQF KEVLPEDCLPRSRSQTCLPELLRFLGQNVHARKNKNVDILWQAAEICRRLNGVRFTSC KSAKDRTAMSVTLEQCLILQHEHGMAPQVFTQALECMRSEGCRRENTMKNVGSRKYAF NSLQLKAFPKHYRPPEGTYGKVET" BASE COUNT 781 a 856 c 906 g 674 t ORIGIN 1 cgaggaccag cacctgaggc cgggccgcag gccgacttca cagccaggcg cgcgcgtcgc 61 tgctgctgct gcggggctct ggcgcgccag ccggcgggac agcgggggct ggcgggggcc 121 ggggagcgcc agatgatgga tttggacatg cttctatgaa gaaatgccac atggtccgag 181 agcaaacatg tccaggctac tcactttagt gactagggct cggtgccagc acttcccggg 241 taatcagacg tggtctgacc gaggatcaag aagcacatca tcaccaatga catcatgaca 301 gcaagagagc acagccctcg ccatggtgcc agggcccgtg caatgcagcg ggcttccacc 361 atcgacgtgg cggccgacat gctgggcctc tctctggcag gaaatataca agacccagat 421 gagcccattt tagaatttag cttagcttgc agtgagctgc atactccatc gctagatcga 481 aagccaaata gttttgttgc ggtgagtgtc accacccctc ctcaggcatt ctggacgaag 541 catgcacaga cggagatcat tgagggaacc aacaatccta tatttctaag cagtattgcc 601 ttctttcaag actctcttat caatcagatg acacaagtca aactctccgt gtatgatgtc 661 aaagatagat ctcagggaac aatgtattta ctgggctctg gaacgttcat tgtcaaagat 721 ctgctccagg acaggcatca taggttgcat ttaacactaa ggtctgcaga gagtgaccgt 781 gtaggtaaca tcaccgtgat tggctggcag atggaggaga agtcagacca acggccccct 841 gtgacccggt ctgtggacac tgtcaatggg aggatggttc ttcctgtcga tgagagcttg 901 acggaggcgt taggaatccg atccaaatac gcttcattgc gaaaggacac tttgctgaaa 961 tcggtgttcg gtggtgccat ctgccgcatg taccggtttc caaccactga tggtaaccat 1021 ttgcggatcc tggagcagat ggcagagagc gtgctctccc tgcacgtgcc ccggcagttc 1081 gtgaagctcc tactagagga agatgcagcc agagtgtgtg agctggagga gctgggagag 1141 ctgtcccctt gctgggagag cctccggcgc caaattgtca cccagtacca gaccatcatc 1201 ctcacatacc aggagaacct gaccgacctc catcagtaca gagggccctc gtttaaagca 1261 agcagtttga aagcagataa aaagttagaa tttgttccca caaacttgca catacaaagg 1321 atgagagttc aagacgatgg aggatcagat cagaactacg acatcgtcac cattggggcg 1381 ccagcagcac actgccaagg ttttaagtca ggaggtctcc gcaaaaagct gcacaaattt 1441 gaagagacca agaaacattt tgaggagtgt tgtacatcat ctggctgcca gtccataatc 1501 tacatacccc aggatgttgt cagagccaag gagatcatcg cccagatcaa caccctgaaa 1561 acccaagtga gttactacgc agagcggctg tcaagggcag ccaaggacag gtctgccact 1621 ggccttgaga ggacactcgc catcttggca gacaagacac ggcagctggt cacggtctgc 1681 gactgcaagc tcctggccaa ctccatccat gggctgaacg ctgcacggcc tgactacatt 1741 gcctccaagg cctctcccac ttcgactgag gaggagcagg tgatgcttag aaatgaccag 1801 gacaccctca tggcccggtg gacagggaga aacagccgat cttccctgca ggtggactgg 1861 cacgaggagg agtgggagaa agtgtggctg aacgtggaca agagcctaga gtgcatcatt 1921 cagcgtgtgg acaagctgct gcagaaggag cggctgcatg gcgagggctg tgaggatgtc 1981 ttcccctgtg caggcagctg caccagcaag aaagattgca gtccccctcc tgaagagtcc 2041 agcccaggtg aatggagtga ggccctttac ccgctgctga ccactctcac cgactgcgtg 2101 gccatgatga gtgacaaggc caagaaggcc atggtattcc tgctcatgca ggacagcgcg 2161 cccaccatag ccacctacct gagcctgcag taccgccgtg acgtggtctt ctgccagacg 2221 ctgaccgccc tcatctgcgg cttcatcatt aagctgagga actgcctgca tgacgacggc 2281 ttcctgcgcc agctctacac catcgggctg ctggcccagt tcgagagcct gctgagcacc 2341 tacggggagg agctggcaat gctggaggac atgagccttg ggatcatgga cttgaggaac 2401 gtgaccttca aagtcactca ggccacttcc agcgcctccg cagacatgct gcccgtcatc 2461 acaggaaatc gcgacgggtt taacgtgcgg gtccctctgc cgggcccgct gtttgacgcc 2521 ttgccccggg agatccagag tggcatgctg ctgcgagtgc agcccgtcct cttcaacgtg 2581 ggcatcaatg agcagcagac actggccgag aggtttggcg atacgtcttt acaagaagtc 2641 atcaacgtgg agagtttggt gcggttaaat tcctactttg agcagtttaa ggaagttttg 2701 cctgaggatt gcctgcctcg gtctcgcagt cagacgtgcc tgccagagct gctgcggttt 2761 ctgggtcaga acgtgcatgc ccggaagaat aagaacgtcg acattctctg gcaagctgct 2821 gagatctgcc gccgccttaa tggggtccgg ttcaccagct gcaagagcgc taaggaccgt 2881 acagccatgt cggtgacact ggagcagtgc ctgatcctgc aacacgagca tggcatggcc 2941 ccgcaggtct tcacccaggc cctggagtgc atgcgcagtg agggttgtcg aagagaaaat 3001 acaatgaaga atgttggaag tcgcaaatat gcatttaatt ccctgcagct gaaggctttc 3061 cccaagcatt acaggcctcc cgaagggact tacggaaaag ttgaaacgtg aacacacggt 3121 ttcctctaat tagctgttac ataataaatg tgggtacctc tagtgtcata tatgaattct 3181 tcaggagact gaaggattgg tttattttgt gttcttt // LOCUS HSU26403 1574 bp mRNA PRI 14-OCT-1995 DEFINITION Human receptor tyrosine kinase ligand LERK-7 precursor (EPLG7) mRNA, complete cds. ACCESSION U26403 NID g1019430 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1574) AUTHORS Kozlosky,C.J., VandenBos,T., Park,L.S., Cerretti,D.P. and Carpenter,M.K. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) Douglas P. Cerretti, Molecular Biology, Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1574 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda GT10 cDNA library; Clontech cat. #HL3003a" /dev_stage="fetus" /tissue_type="brain" sig_peptide 283..342 /gene="EPLG7" /note="putative" gene 283..969 /gene="EPLG7" CDS 283..969 /gene="EPLG7" /codon_start=1 /function="ligand for tyrosine kinases hek, elk, and eck" /product="LERK-7 precursor" /db_xref="PID:g1019431" /translation="MLHVEMLTLVFLVLWMCVFSQDPGSKAVADRYAVYWNSSNPRFQ RGDYHIDVCINDYLDVFCPHYEDSVPEDKTERYVLYMVNFDGYSACDHTSKGFKRWEC NRPHSPNGPLKFSEKFQLFTPFSLGFEFRPGREYFYISSAIPDNGRRSCLKLKVFVRP TNSCMKTIGVHDRVFDVNDKVENSLEPADDTVHESAEPSRGENAAQTPRIPSRLLAIL LFLLAMLLTL" mat_peptide 343..966 /gene="EPLG7" /function="ligand for tyrosine kinases hek, elk, and eck" /product="LERK-7" BASE COUNT 344 a 446 c 346 g 438 t ORIGIN 1 gcttctctcc atcttgtgat tcctttttcc tcctgaaccc tccagtgggg gtgcgagttt 61 gtctttatca ccccccatcc caccgccttc ttttcttctc gctctcctac ccctccccag 121 cttggtgggc gcctctttcc tttctcgccc cctttcattt ttatttattc atatttattt 181 ggcgcccgct ctctctctgt ccctttgcct gcctccctcc ctccggatcc ccgctctctc 241 cccggagtgg cgcgtcgggg gctccgccgc tggccaggcg tgatgttgca cgtggagatg 301 ttgacgctgg tgtttctggt gctctggatg tgtgtgttca gccaggaccc gggctccaag 361 gccgtcgccg accgctacgc tgtctactgg aacagcagca accccagatt ccagaggggt 421 gactaccata ttgatgtctg tatcaatgac tacctggatg ttttctgccc tcactatgag 481 gactccgtcc cagaagataa gactgagcgc tatgtcctct acatggtgaa ctttgatggc 541 tacagtgcct gcgaccacac ttccaaaggg ttcaagagat gggaatgtaa ccggcctcac 601 tctccaaatg gaccgctgaa gttctctgaa aaattccagc tcttcactcc cttttctcta 661 ggatttgaat tcaggccagg ccgagaatat ttctacatct cctctgcaat cccagataat 721 ggaagaaggt cctgtctaaa gctcaaagtc tttgtgagac caacaaatag ctgtatgaaa 781 actataggtg ttcatgatcg tgttttcgat gttaacgaca aagtagaaaa ttcattagaa 841 ccagcagatg acaccgtaca tgagtcagcc gagccatccc gcggcgagaa cgcggcacaa 901 acaccaagga tacccagccg ccttttggca atcctactgt tcctcctggc gatgcttttg 961 acattatagc acagtctcct cccatcactt gtcacagaaa acatcagggt cttggaacac 1021 cagagatcca cctaactgct catcctaaga agggacttgt tattgggttt tggcagatgt 1081 cagatttttg ttttctttct ttcagcctga attctaagca acaacttcag gttgggggcc 1141 taaacttgtt cctgcctccc tcaccccacc ccgccccacc cccagccctg gcccttggct 1201 tctctcaccc ctcccaaatt aaatggactc cagatgaaaa tgccaaattg tcatagtgac 1261 accagtggtt cgtcagctcc tgtgcattct cctctaagaa ctcacctccg ttagcgcact 1321 gtgtcagcgg gctatggaca aggaagaata gtggcagatg cagccagcgc tggctagggc 1381 tgggagggtt ttgctctcct atgcaatatt tatgccttct cattcagaac tgtaagatga 1441 tcgcgcaggg catcatgtca ccatgtcagg tccggagggg aggtattaag aatagatacg 1501 atattacacc atttcctata ggagtatgta aatgaacagg cttctaaaag gttgagacac 1561 tggttttttt tttt // LOCUS HSU26424 2820 bp mRNA PRI 27-FEB-1996 DEFINITION Human Ste20-like kinase (MST2) mRNA, complete cds. ACCESSION U26424 NID g1203795 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2820) AUTHORS Creasy,C.L. and Chernoff,J. TITLE Cloning and characterization of a member of the MST subfamily of Ste20-like kinases JOURNAL Gene 167 (1-2), 303-306 (1995) MEDLINE 96144292 REFERENCE 2 (bases 1 to 2820) AUTHORS Creasy,C.L. TITLE Direct Submission JOURNAL Submitted (05-MAY-1995) Caretha L. Creasy, Fox Chase Cancer Center, 7701 Burholme Avenue, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..2820 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS-4-11" /clone_lib="lambda-ZAP HeLa cDNA library, Stratagene" /cell_line="HeLa" gene 139..1614 /gene="MST2" CDS 139..1614 /gene="MST2" /note="Ste20-like kinase" /codon_start=1 /product="MST2" /db_xref="PID:g1203796" /translation="MEQPPAPKSKLKKLSEDSLTKQPEEVFDVLEKLGEGSYGSVFKA IHKESGQVVAIKQVPVESDLQEIIKEISIMQQCDSPYVVKYYGSYFKNTDLWIVMEYC GAGSVSDIIRLRNKTLIEDEIATILKSTLKGLEYLHFMRKIHRDIKAGNILLNTEGHA KLADFGVAGQLTDTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGITSIEMAEGK PPYADIHPMRAIFMIPTNPPPTFRKPELWSDDFTDFVKKCLVKNPEQRATATQLLQHP FIKNAKPVSILRDLITEAMEIKAKRHDEQQRELEEEEENSDEDELDSHTMVKTSVGEC GTMRATSTMSEGAQTMIEHNSTMLESDLGTMVINSEDEEEEDGTMKRNATSPQVQRPS FMDYFDKQDFKNKSHENCNQNMHEPFPMSKNVFPDNWKVPQDGDFDFLKNLSLEELQM RLKALDPMMEREIEELRQRYTAKRQPILDAMDAKKRRQQNF" BASE COUNT 925 a 498 c 579 g 818 t ORIGIN 1 ccgcggagtt acgggaaagt tggtccgagt tcccagagtt tccctctgtg gtgccctagg 61 cttcggcccg gtgccccggc tcctttcctc ctttcggcct tcgccgtcca ccaggtccct 121 ctctctgtcc cggccgccat ggagcagccg ccggcgccta agagtaaact aaaaaagctg 181 agtgaagaca gtttgactaa gcagcctgaa gaagtttttg atgtattaga gaagcttgga 241 gaagggtctt atggaagtgt atttaaagca atacacaagg aatccggtca agttgtcgca 301 attaaacaag tacctgttga atcagatctt caggaaataa tcaaagaaat ttccataatg 361 cagcaatgtg acagcccata tgttgtaaag tactatggca gttattttaa gaatacagac 421 ctctggattg ttatggagta ctgtggcgct ggctctgtct cagacataat tagattacga 481 aacaagacat taatagaaga tgaaattgca accattctta aatctacatt gaaaggacta 541 gaatatttgc actttatgag aaaaatacac agagatataa aagctggaaa tattctcctc 601 aatacagaag gacatgcaaa attggcagat tttggagtgg ctggtcagtt aacagataca 661 atggcaaaac gcaatactgt aataggaact ccattttgga tggctcctga ggtgattcaa 721 gaaataggct ataactgtgt ggccgacatc tggtcccttg gcattacttc tatagaaatg 781 gctgaaggaa aacctcctta tgctgatata catccaatga gggctatttt tatgattccc 841 acaaatccac caccaacatt cagaaagcca gaactttggt ccgatgattt caccgatttt 901 gttaaaaagt gtttggtgaa gaatcctgag cagagagcta ctgcaacaca acttttacag 961 catcctttta tcaagaatgc caaacctgta tcaatattaa gagacctgat cacagaagct 1021 atggagatca aagctaaaag acatgacgaa cagcaacgag aattggaaga ggaagaagaa 1081 aattcggatg aagatgagct ggattcccac accatggtga agactagtgt gggagagtgt 1141 ggcaccatgc gggccacaag cacgatgagt gaaggggccc agaccatgat tgaacataat 1201 agcacgatgt tggaatccga cttggggacc atggtgataa acagtgagga tgaggaagaa 1261 gaagatggaa ctatgaaaag aaatgcaacc tcaccacaag tacaaagacc atctttcatg 1321 gactactttg ataagcaaga cttcaagaat aagagtcacg aaaactgtaa tcagaacatg 1381 catgaaccct tccctatgtc caaaaacgtt tttcctgata actggaaagt tcctcaagat 1441 ggagactttg actttttgaa aaatctaagt ttagaagaac tacagatgcg gttaaaagca 1501 ctggacccca tgatggaacg ggagatagaa gaacttcgtc agagatacac tgcgaaaaga 1561 cagcccattc tggatgcgat ggatgcaaag aaaagaaggc agcaaaactt ttgagtctaa 1621 tttcctctct gtttttaact attctggaga ccaagaaacc actaggaatt gaaggaatat 1681 ttggatattt ttaatcctaa gattttgccc tacaattagg cagaggtcaa aaagtgacaa 1741 tggtacatgc ccaggtaaat tcccaaaagg cagaattgac agttgtatct gctgtgcatt 1801 cactctaaga tgaggagaac aaaagaagtg tattctcttg ttctgtcagc tgcataccag 1861 taataaaact gttatgaaat ggattttcaa ggtctctaaa ccttgaaaat ccaaaggcta 1921 ttgttgcatt gtacagcact gaaagggctt tatgttacaa tattctttat tcctatctag 1981 tatactaggc tatttattgt ccccttaggt aaacttattt atttatgcta ttttggcttt 2041 gtttcatttt ttaaggacaa gatcaggata gctttggtga aggtagggtc atattaatat 2101 gatgataatg tgcaaccaat ttatactttc tgcagggagc tatggggtac attccttgat 2161 ttccaggata gtttttcaaa taggaaagca ataatggcag tagttctcaa atgggctagg 2221 ccttttttat attgaagcaa taattccatt tttacccttt gaaattttgt ttttttgatt 2281 tttgatgttt ggtacaaata gaactatata tatttaggta aaatagatct atcgtgttta 2341 aaaccaaaga aatcaatgga acccttgcac aaaaaagtgt gataaatatt tttaaataaa 2401 aacttaatac aaatgtaatt tgttaatatt gtttcatgtt ttatgtgtag atctaatagc 2461 tgaactgatt caaactgtaa taagctcatc aatttcattt ctatgaaaat gtgctctgtt 2521 gtcacaggat gtttctgttg attttattca tttcctggga attggtaaac atcatgttcc 2581 tgatgataac ccagtagcaa aaacatttgt actgagtggt acaagccttg gggactgaaa 2641 aaaaaaaaag attaaaacca ttaaaaagaa actcattttt acgctgaatg aacatttata 2701 tgattgcatt gggaccagtc atttcctaag ctacatatgg ccatcttgac agtgtttttt 2761 cttttgtgtg tttaattatt atgtgtaaat cataaagaca aataaatttc actgtgccac // LOCUS HSU26446 1667 bp mRNA PRI 01-JUN-1995 DEFINITION Human protoporphyrinogen oxidase mRNA, complete cds. ACCESSION U26446 NID g837327 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1667) AUTHORS Dailey,T.A., Meissner,P. and Dailey,H.A. TITLE Expression of a cloned protoporphyrinogen oxidase JOURNAL J. Biol. Chem. 269 (2), 813-815 (1994) MEDLINE 94117488 REFERENCE 2 (bases 1 to 1667) AUTHORS Dailey,T.A. and Dailey,H.A. TITLE Direct Submission JOURNAL Submitted (05-MAY-1995) Harry A. Dailey, Microbiology, University of Georgia, Athens, GA 30602-2605, USA FEATURES Location/Qualifiers source 1..1667 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 192..1625 /EC_number="1.3.3.4" /codon_start=1 /product="protoporphyrinogen oxidase" /db_xref="PID:g837328" /translation="MGRTVVVLGGGISGLAASYHLSRAPCPPKVVLVESSERLGGWIR SVRGPNGAIFELGPRGIRPAGALGARTLLLVSELGLDSEVLPVRGDHPAAQNRFLYVG GALHALPTGLRGLLRPSPPFSKPLFWAGLRELTKPRGKEPDETVHSFAQRRLGPEVAS LAMDSLCRGVFAGNSRELSIRSCFPSLFQAEQTHRSILLGLLLGAGRTPQPDSALIRQ ALAERWSQWSLRGGLEMLPQALETHLTSRGVSVLRGQPVCGLSLQAEGRWKVSLRDSS LEADHVISAIPASVLSELLPAEAAPLARALSAITAVSVAVVNLQYQGAHLPVQGFGHL VPSSEDPGVLGIVYDSVAFPEQDGSPPGLRVTVMLGGSWLQTLEASGCVLSQELFQQR AQEAAATQLGLKEMPSHCLVHLHKNCIPQYTLGHWQKLESARQFLTAHRLPLTLAGAS YEGVAVNDCIESGRQAAVSVLGTEPNS" BASE COUNT 320 a 472 c 516 g 359 t ORIGIN 1 cggagagtag gagagaccga aaaggctggg ggtgggagta gcggatttga agcacttgtt 61 ggcctacaga ggtgtggcaa gcagagcacc tcagaactca gtcgtactgc ccgccgcccg 121 agccgtgcga gggccgatag cgagggtgtg gcccttatct gcacccagca gagcgccggc 181 ggggtttccg catgggccgg accgtggtcg tgctgggcgg aggcatcagc ggcttggccg 241 ccagttacca cctgagccgg gccccctgcc cccctaaggt ggtcctagtg gagagcagtg 301 agcgtctggg aggctggatt cgctccgttc gaggccctaa tggtgctatc tttgagcttg 361 gacctcgggg aattaggcca gcgggagccc taggggcccg gaccttgctc ctggtttctg 421 agcttggctt ggattcagaa gtgctgcctg tccggggaga ccacccagct gcccagaaca 481 ggttcctcta cgtgggcggt gccctgcatg ccctacccac tggcctcagg gggctactcc 541 gcccttcacc ccccttctcc aaacctctgt tttgggctgg gctgagggag ctgaccaagc 601 cccggggcaa agagcctgat gagactgtgc acagttttgc ccagcgccgc cttggacctg 661 aggtggcgtc tctagccatg gacagtctct gccgtggagt gtttgcaggc aacagccgtg 721 agctcagcat caggtcctgc tttcccagtc tcttccaagc tgagcaaacc catcgttcca 781 tattactggg cctgctgctg ggggcagggc ggaccccaca gccagactca gcactcattc 841 gccaggcctt ggctgagcgc tggagccagt ggtcacttcg tggaggtcta gagatgttgc 901 ctcaggccct tgaaacccac ctgactagta ggggggtcag tgttctcaga ggccagccgg 961 tctgtgggct cagcctccag gcagaagggc gctggaaggt atctctaagg gacagcagtc 1021 tggaggctga ccacgttatt agtgccattc cagcttcagt gctcagtgag ctgctccctg 1081 ctgaggctgc ccctctggct cgtgccctga gtgccatcac tgcagtgtct gtagctgtgg 1141 tgaatctgca gtaccaagga gcccatctgc ctgtccaggg atttggacat ttggtgccat 1201 cttcagaaga tccaggagtc ctgggaatcg tgtatgactc agttgctttc cctgagcagg 1261 acgggagccc ccctggcctc agagtgactg tgatgctggg aggttcctgg ttacagacac 1321 tggaggctag tggctgtgtc ttatctcagg agctgtttca acagcgggcc caggaagcag 1381 ctgctacaca attaggactg aaggagatgc cgagccactg cttggtccat ctacacaaga 1441 actgcattcc ccagtataca ctaggtcact ggcaaaaact agagtcagct aggcaattcc 1501 tgactgctca caggttgccc ctgactctgg ctggagcctc ctatgaggga gttgctgtta 1561 atgactgtat agagagtggg cgccaggcag cagtcagtgt cctgggcaca gaacctaaca 1621 gctgatcccc aactctcatt catgaaaata aaaattgctg gagcttg // LOCUS HSU26455 5912 bp mRNA PRI 02-FEB-1996 DEFINITION Human phosphatidylinositol 3-kinase homolog (ATM) mRNA, complete cds. ACCESSION U26455 NID g870785 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5912) AUTHORS Savitsky,K., Bar-Shira,A., Gilad,S., Rotman,G., Ziv,Y., Vanagaite,L., Tagle,D.A., Smith,S., Uziel,T., Sfez,S., Ashkenazi,M., Pecker,I., Frydman,M., Harnik,R., Patanjali,S.R., Simmons,A., Clines,G.A., Sartiel,A., Gatti,R.A., Chessa,L., Sanal,O., Lavin,M.F., Jaspers,N.J., Taylor,A.R., Arlett,C.F., Miki,T., Weissman,S.M., Lovett,M., Collins,F.S. and Shiloh,Y. TITLE A single ataxia telangiectasia gene with a product similar to PI-3 kinase JOURNAL Science 268 (5218), 1749-1753 (1995) MEDLINE 95312868 REFERENCE 2 (bases 1 to 5912) AUTHORS Shiloh,Y., Savitsky,K., Bar-Shira,A., Gilad,S., Rotman,G., Ziv,Y., Vanagaite,L., Pecker,I., Uziel,T., Ashkenazi,M., Sfez,S., Smith,S., Sartiel,A. and Harnik,R. TITLE Direct Submission JOURNAL Submitted (08-MAY-1995) Yossi Shiloh, Human Genetics, Sackler School of Medicine, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, Israel COMMENT This clone may represent either splicing intermediate or alternatively spliced transcript of this gene, in which sequences of the adjacent intron were left to serve as an untranslated leader. FEATURES Location/Qualifiers source 1..5912 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="7-9" /chromosome="11" /map="11q22-23" misc_feature 1..259 /note="unspliced adjacent intron" gene 260..5386 /gene="ATM" CDS 260..5386 /gene="ATM" /note="similar to yeast phosphatidylinositol 3-kinase Tor1p, Swiss-Prot Accession Number P35169, and Tor2p, Swiss-Prot Accession Number P32600, and Esr1p, Swiss-Prot Accession Number P38111" /codon_start=1 /product="phosphatidylinositol 3-kinase homolog" /db_xref="PID:g870786" /translation="MTLHEPANSSASQSTDLCDFSGDLDPAPNPPHFPSHVIKATFAY ISNCHKTKLKSILEILSKSPDSYQKILLAICEQAAETNNVYKKHRILKIYHLFVSLLL KDIKSGLGGAWAFVLRDVIYTLIHYINQRPSCIMDVSLRSFSLCCDLLSQVCQTAVTY CKDALENHLHVIVGTLIPLVYEQVEVQKQVLDLLKYLVIDNKDNENLYITIKLLDPFP DHVVFKDLRITQQKIKYSRGPFSLLEEINHFLSVSVYDALPLTRLEGLKDLRRQLELH KDQMVDIMRASQDNPQDGIMVKLVVNLLQLSKMAINHTGEKEVLEAVGSCLGEVGPID FSTIAIQHSKDASYTKALKLFEDKELQWTFIMLTYLNNTLVEDCVKVRSAAVTCLKNI LATKTGHSFWEIYKMTTDPMLAYLQPFRTSRKKFLEVPRFDKENPFEGLDDINLWIPL SENHDIWIKTLTCAFLDSGGTKCEILQLLKPMCEVKTDFCQTVLPYLIHDILLQDTNE SWRNLLSTHVQGFFTSCLRHFSQTSRSTTPANLDSESEHFFRCCLDKKSQRTMLAVVD YMRRQKRPSSGTIFNDAFWLDLNYLEVAKVAQSCAAHFTALLYAEIYADKKSMDDQEK RSLAFEEGSQSTTISSLSEKSKEETGISLQDLLLEIYRSIGEPDSLYGCGGGKMLQPI TRLRTYEHEAMWGKALVTYDLETAIPSSTRQAGIIQALQNLGLCHILSVYLKGLDYEN KDWCPELEELHYQAAWRNMQWDHCTSVSKEVEGTSYHESLYNALQSLRDREFSTFYES LKYARVKEVEEMCKRSLESVYSLYPTLSRLQAIGELESIGELFSRSVTHRQLSEVYIK WQKHSQLLKDSDFSFQEPIMALRTVILEILMEKEMDNSQRECIKDILTKHLVELSILA RTFKNTQLPERAIFQIKQYNSVSCGVSEWQLEEAQVFWAKKEQSLALSILKQMIKKLD ASCAANNPSLKLTYTECLRVCGNWLAETCLENPAVIMQTYLEKAVEVAGNYDGESSDE LRNGKMKAFLSLARFSDTQYQRIENYMKSSEFENKQALLKRAKEEVGLLREHKIQTNR YTVKVQRELELDELALRALKEDRKRFLCKAVENYINCLLSGEEHDMWVFRLCSLWLEN SGVSEVNGMMKRDGMKIPTYKFLPLMYQLAARMGTKMMGGLGFHEVLNNLISRISMDH PHHTLFIILALANANRDEFLTKPEVARRSRITKNVPKQSSQLDEDRTEAANRIICTIR SRRPQMVRSVEALCDAYIILANLDATQWKTQRKGINIPADQPITKLKNLEDVVVPTME IKVDHTGEYGNLVTIQSFKAEFRLAGGVNLPKIIDCVGSDGKERRQLVKGRDDLRQDA VMQQVFQMCNTLLQRNTETRKRKLTICTYKVVPLSQRSGVLEWCTGTVPIGEFLVNNE DGAHKRYRPNDFSAFQCQKKMMEVQKKSFEEKYEVFMDVCQNFQPVFRYFCMEKFLDP AIWFEKRLAYTRSVATSSIVGYILGLGDRHVQNILINEQSAELVHIDLGVAFEQGKIL PTPETVPFRLTRDIVDGMGITGVEGVFRRCCEKTMEVMRNSQETLLTIVEVLLYDPLF DWTMNPLKALYLQQRPEDETELHPTLNADDQECKRNLSDIDQSFDKVAERVLMRLQEK LKGVEEGTVLSVGGQVNLLIQQAIDPKNLSRLFPGWKAWV" repeat_region 5610..5912 /note="Alu-Sq subfamily" /rpt_type=dispersed /rpt_family="Alu" BASE COUNT 1885 a 1085 c 1281 g 1661 t ORIGIN 1 catacttttt cctcttagtc tacaggttgg ctgcatagaa gaaaaaggta gagttattta 61 taatcttgta aatcttggac tttgagtcat ctattttctt ttacagtcat cgaatacttt 121 tggaaataag gtaatatatg ccttttgagc tgtcttgacg ttcacagata taaaatatta 181 aatatatttt aattttgtgc ccttgcagat tgatcactta ttcattagta atttaccaga 241 gattgtggtg gagttattga tgacgttaca tgagccagca aattctagtg ccagtcagag 301 cactgacctc tgtgactttt caggggattt ggatcctgct cctaatccac ctcattttcc 361 atcgcatgtg attaaagcaa catttgccta tatcagcaat tgtcataaaa ccaagttaaa 421 aagcatttta gaaattcttt ccaaaagccc tgattcctat cagaaaattc ttcttgccat 481 atgtgagcaa gcagctgaaa caaataatgt ttataagaag cacagaattc ttaaaatata 541 tcacctgttt gttagtttat tactgaaaga tataaaaagt ggcttaggag gagcttgggc 601 ctttgttctt cgagacgtta tttatacttt gattcactat atcaaccaaa ggccttcttg 661 tatcatggat gtgtcattac gtagcttctc cctttgttgt gacttattaa gtcaggtttg 721 ccagacagcc gtgacttact gtaaggatgc tctagaaaac catcttcatg ttattgttgg 781 tacacttata ccccttgtgt atgagcaggt ggaggttcag aaacaggtat tggacttgtt 841 gaaatactta gtgatagata acaaggataa tgaaaacctc tatatcacga ttaagctttt 901 agatcctttt cctgaccatg ttgtttttaa ggatttgcgt attactcagc aaaaaatcaa 961 atacagtaga ggaccctttt cactcttgga ggaaattaac cattttctct cagtaagtgt 1021 ttatgatgca cttccattga caagacttga aggactaaag gatcttcgaa gacaactgga 1081 actacataaa gatcagatgg tggacattat gagagcttct caggataatc cgcaagatgg 1141 gattatggtg aaactagttg tcaatttgtt gcagttatcc aagatggcaa taaaccacac 1201 tggtgaaaaa gaagttctag aggctgttgg aagctgcttg ggagaagtgg gtcctataga 1261 tttctctacc atagctatac aacatagtaa agatgcatct tataccaagg cccttaagtt 1321 atttgaagat aaagaacttc agtggacctt cataatgctg acctacctga ataacacact 1381 ggtagaagat tgtgtcaaag ttcgatcagc agctgttacc tgtttgaaaa acattttagc 1441 cacaaagact ggacatagtt tctgggagat ttataagatg acaacagatc caatgctggc 1501 ctatctacag ccttttagaa catcaagaaa aaagttttta gaagtaccca gatttgacaa 1561 agaaaaccct tttgaaggcc tggatgatat aaatctgtgg attcctctaa gtgaaaatca 1621 tgacatttgg ataaagacac tgacttgtgc ttttttggac agtggaggca caaaatgtga 1681 aattcttcaa ttattaaagc caatgtgtga agtgaaaact gacttttgtc agactgtact 1741 tccatacttg attcatgata ttttactcca agatacaaat gaatcatgga gaaatctgct 1801 ttctacacat gttcagggat ttttcaccag ctgtcttcga cacttctcgc aaacgagccg 1861 atccacaacc cctgcaaact tggattcaga gtcagagcac tttttccgat gctgtttgga 1921 taaaaaatca caaagaacaa tgcttgctgt tgtggactac atgagaagac aaaagagacc 1981 ttcttcagga acaattttta atgatgcttt ctggctggat ttaaattatc tagaagttgc 2041 caaggtagct cagtcttgtg ctgctcactt tacagcttta ctctatgcag aaatctatgc 2101 agataagaaa agtatggatg atcaagagaa aagaagtctt gcatttgaag aaggaagcca 2161 gagtacaact atttctagct tgagtgaaaa aagtaaagaa gaaactggaa taagtttaca 2221 ggatcttctc ttagaaatct acagaagtat aggggagcca gatagtttgt atggctgtgg 2281 tggagggaag atgttacaac ccattactag actacgaaca tatgaacacg aagcaatgtg 2341 gggcaaagcc ctagtaacat atgacctcga aacagcaatc ccctcatcaa cacgccaggc 2401 aggaatcatt caggccttgc agaatttggg actctgccat attctttccg tctatttaaa 2461 aggattggat tatgaaaata aagactggtg tcctgaacta gaagaacttc attaccaagc 2521 agcatggagg aatatgcagt gggaccattg cacttccgtc agcaaagaag tagaaggaac 2581 cagttaccat gaatcattgt acaatgctct acaatctcta agagacagag aattctctac 2641 attttatgaa agtctcaaat atgccagagt aaaagaagtg gaagagatgt gtaagcgcag 2701 ccttgagtct gtgtattcgc tctatcccac acttagcagg ttgcaggcca ttggagagct 2761 ggaaagcatt ggggagcttt tctcaagatc agtcacacat agacaactct ctgaagtata 2821 tattaagtgg cagaaacact cccagcttct caaggacagt gattttagtt ttcaggagcc 2881 tatcatggct ctacgcacag tcattttgga gatcctgatg gaaaaggaaa tggacaactc 2941 acaaagagaa tgtattaagg acattctcac caaacacctt gtagaactct ctatactggc 3001 cagaactttc aagaacactc agctccctga aagggcaata tttcaaatta aacagtacaa 3061 ttcagttagc tgtggagtct ctgagtggca gctggaagaa gcacaagtat tctgggcaaa 3121 aaaggagcag agtcttgccc tgagtattct caagcaaatg atcaagaagt tggatgccag 3181 ctgtgcagcg aacaatccca gcctaaaact tacatacaca gaatgtctga gggtttgtgg 3241 caactggtta gcagaaacgt gcttagaaaa tcctgcggtc atcatgcaga cctatctaga 3301 aaaggcagta gaagttgctg gaaattatga tggagaaagt agtgatgagc taagaaatgg 3361 aaaaatgaag gcatttctct cattagcccg gttttcagat actcaatacc aaagaattga 3421 aaactacatg aaatcatcgg aatttgaaaa caagcaagct ctcctgaaaa gagccaaaga 3481 ggaagtaggt ctccttaggg aacataaaat tcagacaaac agatacacag taaaggttca 3541 gcgagagctg gagttggatg aattagccct gcgtgcactg aaagaggatc gtaaacgctt 3601 cttatgtaaa gcagttgaaa attatatcaa ctgcttatta agtggagaag aacatgatat 3661 gtgggtattc cgactttgtt ccctctggct tgaaaattct ggagtttctg aagtcaatgg 3721 catgatgaag agagacggaa tgaagattcc aacatataaa tttttgcctc ttatgtacca 3781 attggctgct agaatgggga ccaagatgat gggaggccta ggatttcatg aagtcctcaa 3841 taatctaatc tctagaattt caatggatca cccccatcac actttgttta ttatactggc 3901 cttagcaaat gcaaacagag atgaatttct gactaaacca gaggtagcca gaagaagcag 3961 aataactaaa aatgtgccta aacaaagctc tcagcttgat gaggatcgaa cagaggctgc 4021 aaatagaata atatgtacta tcagaagtag gagacctcag atggtcagaa gtgttgaggc 4081 actttgtgat gcttatatta tattagcaaa cttagatgcc actcagtgga agactcagag 4141 aaaaggcata aatattccag cagaccagcc aattactaaa cttaagaatt tagaagatgt 4201 tgttgtccct actatggaaa ttaaggtgga ccacacagga gaatatggaa atctggtgac 4261 tatacagtca tttaaagcag aatttcgctt agcaggaggt gtaaatttac caaaaataat 4321 agattgtgta ggttccgatg gcaaggagag gagacagctt gttaagggcc gtgatgacct 4381 gagacaagat gctgtcatgc aacaggtctt ccagatgtgt aatacattac tgcagagaaa 4441 cacggaaact aggaagagga aattaactat ctgtacttat aaggtggttc ccctctctca 4501 gcgaagtggt gttcttgaat ggtgcacagg aactgtcccc attggtgaat ttcttgttaa 4561 caatgaagat ggtgctcata aaagatacag gccaaatgat ttcagtgcct ttcagtgcca 4621 aaagaaaatg atggaggtgc aaaaaaagtc ttttgaagag aaatatgaag tcttcatgga 4681 tgtttgccaa aattttcaac cagttttccg ttacttctgc atggaaaaat tcttggatcc 4741 agctatttgg tttgagaagc gattggctta tacgcgcagt gtagctactt cttctattgt 4801 tggttacata cttggacttg gtgatagaca tgtacagaat atcttgataa atgagcagtc 4861 agcagaactt gtacatatag atctaggtgt tgcttttgaa cagggcaaaa tccttcctac 4921 tcctgagaca gttcctttta gactcaccag agatattgtg gatggcatgg gcattacggg 4981 tgttgaaggt gtcttcagaa gatgctgtga gaaaaccatg gaagtgatga gaaactctca 5041 ggaaactctg ttaaccattg tagaggtcct tctatatgat ccactctttg actggaccat 5101 gaatcctttg aaagctttgt atttacagca gaggccggaa gatgaaactg agcttcaccc 5161 tactctgaat gcagatgacc aagaatgcaa acgaaatctc agtgatattg accagagttt 5221 cgacaaagta gctgaacgtg tcttaatgag actacaagag aaactgaaag gagtggaaga 5281 aggcactgtg ctcagtgttg gtggacaggt gaatttgctc atacagcagg ccatagaccc 5341 caaaaatctc agccgacttt tcccaggatg gaaagcttgg gtgtgatctt cagtatatga 5401 attacccttt cattcagcct ttagaaatta tattttagcc tttattttta acctgccaac 5461 atactttaag tagggattaa tatttaagtg aactattgtg ggtttttttg aatgttggtt 5521 ttaatacttg atttaatcac cactcaaaaa tgttttgatg gtcttaagga acatctctgc 5581 tttcactctt tagaaataat ggtcattcgg gctgggcgca gcggctcacg cctgtaatcc 5641 cagcactttg ggaggccgag gtgagcggat cacaaggtca ggagttcgag accagcctgg 5701 ccaagagacc agcctggcca gtatggtgaa accctgtctc tactaaaaat acaaaaatta 5761 gccgagcatg gtggcgggca cctgtagtcc cagctactcg agaggctgag gcaggagaat 5821 ctcttgaacc tgggaggtga aggttgctgt gggccaaaat catgccattg cactccagcc 5881 tgggtgacaa gagcgaaact ccatctcaaa aa // LOCUS HSU26553 1530 bp mRNA PRI 14-DEC-1995 DEFINITION Human calcitonin receptor mRNA, complete cds. ACCESSION U26553 NID g1117794 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1530) AUTHORS Albrandt,K., Brady,E.M., Moore,C.X., Mull,E., Sierzega,M.E. and Beaumont,K. TITLE Molecular cloning and functional expression of a third isoform of the human calcitonin receptor and partial characterization of the calcitonin receptor gene JOURNAL Endocrinology 136 (12), 5377-5384 (1995) MEDLINE 96079881 REFERENCE 2 (bases 1 to 1530) AUTHORS Albrandt,K.A. TITLE Direct Submission JOURNAL Submitted (08-MAY-1995) Keith A. Albrandt, Pharmacology, Amylin Pharmaceuticals, Inc., 9373 Towne Centre Drive, Suite 250, San Diego, CA 92121-3027, USA FEATURES Location/Qualifiers source 1..1530 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="1154" /cell_line="MCF-7" /tissue_lib="ATCC HTB22" /tissue_type="breast carcinoma" CDS 73..1497 /note="hC1a" /codon_start=1 /evidence=experimental /product="calcitonin receptor" /db_xref="PID:g1117795" /translation="MRFTFTSRCLALFLLLNHPTPILPAFSNQTYPTIEPKPFLYVVG RKKMMDAQYKCYDRMQQLPAYQGEGPYCNRTWDGWLCWDDTPAGVLSYQFCPDYFPDF DPSEKVTKYCDEKGVWFKHPENNRTWSNYTMCNAFTPEKLKNAYVLYYLAIVGHSLSI FTLVISLGIFVFFRSLGCQRVTLHKNMFLTYILNSMIIIIHLVEVVPNGELVRRDPVS CKILHFFHQYMMACNYFWMLCEGIYLHTLIVVAVFTEKQRLRWYYLLGWGFPLVPTTI HAITRAVYFNDNCWLSVETHLLYIIHGPVMAALVVNFFFLLNIVRVLVTKMRETHEAE SHMYLKAVKATMILVPLLGIQFVVFPWRPSNKMLGKIYDYVMHSLIHFQGFFVATIYC FCNNEVQTTVKRQWAQFKIQWNQRWGRRPSNRSARAAAAAAEAGDIPIYICHQEPRNE PANNQGEESAEIIPLNIIEQESSA" BASE COUNT 388 a 372 c 351 g 419 t ORIGIN 1 ttgcttctat tgagctgtgc ccagccgccc agtgacagaa ttccaggaca aagagatctt 61 caaaaaccaa aaatgaggtt cacatttaca agccggtgct tggcactgtt tcttcttcta 121 aatcacccaa ccccaattct tcctgccttt tcaaatcaaa cctatccaac aatagagccc 181 aagccatttc tttacgtcgt aggacgaaag aagatgatgg atgcacagta caaatgctat 241 gaccgaatgc agcagttacc cgcataccaa ggagaaggtc catattgcaa tcgcacctgg 301 gatggatggc tgtgctggga tgacacaccg gctggagtat tgtcctatca gttctgccca 361 gattattttc cggattttga tccatcagaa aaggttacaa aatactgtga tgaaaaaggt 421 gtttggttta aacatcctga aaacaatcga acctggtcca actatactat gtgcaatgct 481 ttcactcctg agaaactgaa gaatgcatat gttctgtact atttggctat tgtgggtcat 541 tctttgtcaa ttttcaccct agtgatttcc ctggggattt tcgtgttttt caggagcctt 601 ggctgccaaa gggtaaccct gcacaagaac atgtttctta cttacattct gaattctatg 661 attatcatca tccacctggt tgaagtagta cccaatggag agctcgtgcg aagggacccg 721 gtgagctgca agattttgca ttttttccac cagtacatga tggcctgcaa ctatttctgg 781 atgctctgtg aagggatcta tcttcataca ctcattgtcg tggctgtgtt tactgagaag 841 caacgcttgc ggtggtatta tctcttgggc tgggggttcc cgctggtgcc aaccactatc 901 catgctatta ccagggccgt gtacttcaat gacaactgct ggctgagtgt ggaaacccat 961 ttgctttaca taatccatgg acctgtcatg gcggcacttg tggtcaattt cttctttttg 1021 ctcaacattg tccgggtgct tgtgaccaaa atgagggaaa cccatgaggc ggaatcccac 1081 atgtacctga aggctgtgaa ggccaccatg atccttgtgc ccctgctggg aatccagttt 1141 gtcgtctttc cctggagacc ttccaacaag atgcttggga agatatatga ttacgtgatg 1201 cactctctga ttcatttcca gggcttcttt gttgcgacca tctactgctt ctgcaacaat 1261 gaggtccaaa ccaccgtgaa gcgccaatgg gcccaattca aaattcagtg gaaccagcgt 1321 tgggggaggc gcccctccaa ccgctctgct cgcgctgcag ccgctgctgc ggaggctggc 1381 gacatcccaa tttacatctg ccatcaggag ccgaggaatg aaccagccaa caaccaaggc 1441 gaggagagtg ctgagatcat ccctttgaat atcatagagc aagagtcatc tgcttgaatg 1501 tgaagcaaac acagtatcgt gatcactgag // LOCUS HSU26596 700 bp mRNA PRI 25-MAR-1996 DEFINITION Human ribosomal protein L23-related mRNA, complete cds. ACCESSION U26596 NID g903322 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 700) AUTHORS Tsang,P., Gilles,F., Yuan,L., Kuo,Y., Lupu,F., Samara,G., Moosikasuwan,J., Goy,A., Zelenetz,A.D., Selleri,L. and Tycko,B. TITLE A novel L23-related gene 40 kb downstream of the imprinted H19 gene is biallelically expressed in mid-fetal and adult human tissues JOURNAL Hum. Mol. Genet. 4 (9), 1499-1507 (1995) MEDLINE 96081210 REFERENCE 2 (bases 1 to 700) AUTHORS Tsang,P. TITLE Direct Submission JOURNAL Submitted (08-MAY-1995) Patricia Tsang, Pathology, Columbia University College of Physicians and Surgeons, 630 W. 168th St., PS 14-503, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..700 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" exon 1..71 /number=1 5'UTR 1..54 CDS 55..516 /note="similar to Y. enterocolitica ribosomal protein L23, Swiss-Prot Accession Number P41278" /codon_start=1 /product="ribosomal protein L23-related product" /db_xref="PID:g903323" /translation="MARNVVYPLYRLGGPQLRVFRTNFFIQLVRPGVAQPEDTVQFRI PMEMTRVDLRNYLEGIYNVPVAAVRTRVQHGSNKRRDHRNVRIKKPDYKVAYVQLAHG QTFTFPDLFPEKDESPEGSAADDLYSMLEEERQQRQSSDPRRGGVPSWFGL" exon 72..194 /number=2 exon 195..277 /number=3 exon 278..354 /number=4 exon 355..700 /number=5 3'UTR 517..700 BASE COUNT 142 a 211 c 246 g 101 t ORIGIN 1 gggcgggcgc gctgctcctc cgcctcgcgg accccggaag cgcgcgtggc cgccatggcg 61 cggaatgtgg tgtaccccct gtaccggctg ggtggcccac aacttcgggt gttccgaacc 121 aacttcttca ttcagctggt gcggcccggt gtggcccagc ccgaggacac cgtgcagttc 181 cggatcccca tggaaatgac aagggtggac ctcaggaatt acctcgaggg catctataac 241 gtgcccgtgg ctgctgtgcg gacacgggtg cagcatggct ctaacaagag aagagatcac 301 agaaacgtga ggatcaagaa gccggactac aaggtcgcct acgtgcagct ggcccatgga 361 cagaccttca cgttcccaga tctgtttccc gagaaagacg agagccctga aggcagcgct 421 gccgacgacc tctacagcat gctcgaggag gagaggcagc agaggcagag cagcgacccg 481 cggcggggcg gcgtccccag ctggttcggg ctgtgacggg gtggccagca gggacgcgcc 541 ccaggtgggc agctgtggca gagcagcgac ccgcggcggg gcggcatccc cagctggttc 601 gggccgtgac ggggcggcca gcagggacgc gccccaggtg ggcagctgtg gcagagcagt 661 cccgacacct aaataaaagt cttgctgcag gagaaagaaa // LOCUS HSU26644 7515 bp mRNA PRI 08-NOV-1995 DEFINITION Human fatty acid synthase (fas) mRNA, complete cds. ACCESSION U26644 NID g1049052 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7515) AUTHORS Jayakumar,A., Tai,M.H., Huang,W.Y., al-Feel,W., Hsu,M., Abu-Elheiga,L., Chirala,S.S. and Wakil,S.J. TITLE Human fatty acid synthase: properties and molecular cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (19), 8695-8699 (1995) MEDLINE 96004605 REFERENCE 2 (bases 1 to 7515) AUTHORS Jayakumar,A. TITLE Direct Submission JOURNAL Submitted (08-MAY-1995) Arumugam Jayakumar, Biochemistry, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..7515 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25" /sex="male" /tissue_type="whole cerebral brain" gene 1..7515 /gene="fas" CDS 1..7515 /gene="fas" /EC_number="2.3.1.85" /note="encodes region of fatty acid synthase activity; FAS; multifunctional protein" /codon_start=1 /function="palmitate synthesis" /evidence=experimental /db_xref="PID:g1049053" /translation="MEEVVIAGMFGKLPESENLQEFWDNLIGGVDMVTDDDRRWKAGL YGLPRRSGKLKDLSRFDASFFGVHPKQAHTMDPQLRLLLEATYEAIVDGGINPDSLRG THTGVWVGVSGSETSEALSRDPETLVGYSMVGCQRAMMANRLSFFFDFRGPSIALDTA CSSSLMALQNAYQAIHSGQCPAAIVGGINVLLKPNTSVQFLRLGMLSPEGTCKAFDTA GNGYCRSEGVVAVLLTKKSLARKVYTTILNKGTNTDGFKEQGVTFPQDIQEQPIRSLY QSAGVAPESFEYIEAHGPGTKVGDPQERNGITRALCATRQEPLLIGSTKSNMGHPEPA SGLDALAKVLLSLEHGLWAPNLHFHSPNPEIPALLDGRLQVVDQPLPVRGGNVGINSF GFGGSNMHIILRPNTQSAPAPAPHATLPRLLRASGRTPEAVQKLLEQGLRHSQGLAFL SMLNDIAAVPATAMPFRGYAVLGGETRWPRVQQVPAGERPLWFICSGMGTQWRGMGLS LMRLDRFRDSILRSDEAVNRFGLKVSQLLLSTDESTFDDIVHSFVSLTAIQIGLIDLL SCMGPEADGIVGHSLGEWLSVRDGCLSQEEAVLAAYWRGQCIKEAPLPAGAMAAVGLS WEECKQRCPPAVVPACHNSKDTVTISGPQAPVFEFVEQLRKEGVFAKEVRTGGMAFHS YFMEAIAPPLLQELKKVIREPKPRSARWLSTSIPEAQWHSSLARTSSAEYNVNNLVSP VLFQEALWHVPEHAVVLEIAPTPCPQAVLKRVRKPSCTIIPRMKKDHRDNLEFFLAGI GRLHLSGIDANPNALFPPVESPAPRGTPLISPLIKWDHSLAWDAPAAEDFPNGSGSPS ATIYTCTPSSESPDRYLVDHTIDGRVLFPATGYLSIVWKTLARAWAGLEQLPVVFEDV VQHQATILPKTGTVSLEVRLLEATGAFEVSENGNLVVSGKVYQWDDPDPRLFDHPESP HPNSPRSPLFLAQAEVYKELRLRGYDYGPHFQGILEASLEGDSGRLLWKDNWVSFMDT MLQMSILGSAKHGLYLPTRVTAIHIDPATHRQKLYTLQDKAQVADVVVSRWPRVTVAG GVHISGLHTESAPRRHEEQQVPILEKFCFTPHTEEGCLSEHAALEEELQLCKGLVEAL ETKVTQQGLKMVVPDWTGPRSPRDPSQQELPRLLSAACRLQLNGNLQLELAQVLAQER PKLPEDPLLSGLLDSPALKACLDTAVENMPSLKMKVVEVLAGHGHLYSRIPGLLSPHP LLQLSYTATDRHPQALEAAQAELQQHDVAQGQWDPADPAPSALGSADLLVCNCAVAAL GDPASALSNMVAALREGGFLLLHTLLRGHPRDIVAFLTSTEPQYGQGILSQDAWESLF SRVSLRLVGLKKSFYGATLFLCRRPTPQDSPIFLPVDDTSFRWVESLKGILADEDSSR PVWLKAINCATSGVVGLVNCLRREPGGTVRCVLLSNLSSTSHVPEVDPGSAELQKVLQ GDLVMNVYRDGAWGVFRHFLLEDKPEEPTAHAFVSTLTRGDLSSIRWVCSSLRHAQPT CPGAQLCTVYYASLNFRDIMLATGKLSPDAIPGKWTSQDSLLGMEFSGRDASGKRVMG LVPAKGLATSVLLSPDFLWDVPSNWTLEEAASVPVVYSTAYYALVVRGRVRPGETLLI HSGSGGVGQAAIAIALSLGCRVFTTVGSAEKRAYLQARFPQLDSTSFANSRDTSFEQH VLWHTGGKGVDLVLNSLAEEKLQASVRCFGTHGRFLEIGKFDLSQNHPLGMAIFLKNV TFHGVLLDAFFNESSADWREVWALVEAAIRDGVVRPLKCTVFHGAQVEDAFRYMAQGK HIGKVVVQVLAEEPAVLKGAKPKLMSAISKTFCPAHKSYIIAGGLGGFGLELAQWLIQ RGVQKLVLTSRSGIRTGYQAKQVRRWRRQGLQVQVSTSNISSLEGARGLIAEAAQLGP VGGVFNLAVVLRDGLLENQTPEFFQDVCKPKYSGTLNLDRVTREACPELDYFVVFSSV SCGRGNAGQSNYGFANSAMERICEKRRHEGLPGLAVQWGAIGTVGILVETMSTNDTIV SGTLPTRIGVLGLEVLDLFLNQPHMVLSSFVLAEKAAAYRDRDSQRDLVEAVAHILGI RDLAAVNLGGSLADLGLDSLMSAPVRQTLERELNLVLSVREVRQLTLRKLQELSSKAD EASELACPTPKEDGLAQQQTQLNLRSLLVKPEGPTLMRLNSVQSSERPLFLVHPIEAT TVFHSLGPGLSIPTYGLQCTPAAPLDSIHSLAAYYIDCIRQVQPEGPYRVAGYSYGAC VAFEMCSQLQAQQSPAPTHNSLFLFDGSPTYVLAYTQSYRAKLTPGCKAEAETEAICF FVQQFTDMEHNRVLEALLPLKGLEERVAAAVDLIIKSHQGLDRQELSFAARSFYYRLR AADQYTPKAKYSGNVMLLRAKTGGRYGEDLGADYNLSQVCDGKVSVHIIEGDHRTLLE GSGLESIISIIHSSLAEPRVSREG" misc_feature 1..1218 /gene="fas" /note="encodes region of beta-ketoacyl-synthase activity" /function="condensing reaction" /evidence=not_experimental misc_feature 1284..2445 /gene="fas" /note="encodes region of acetyl/malonyl transacylase activity" /evidence=not_experimental misc_feature 2490..2910 /gene="fas" /note="encodes region of dehydratase activity" /evidence=not_experimental misc_feature 4890..5550 /gene="fas" /note="encodes region of enoyl reductase activity" /evidence=not_experimental misc_feature 5610..6300 /gene="fas" /note="encodes region of beta-ketoacyl-reductase activity" /evidence=not_experimental misc_feature 6342..6570 /gene="fas" /note="encodes region of acyl carrier protein activity" /evidence=not_experimental misc_feature 6600..7512 /gene="fas" /note="encodes region of thioesterase activity" /evidence=not_experimental BASE COUNT 1328 a 2491 c 2385 g 1311 t ORIGIN 1 atggaggagg tggtgattgc cggcatgttc gggaagctgc cagagtcgga gaacttgcag 61 gagttctggg acaacctcat cggcggtgtg gacatggtca cggacgatga ccgtcgctgg 121 aaggctgggc tctacggcct gccccggcgg tccggcaagc tgaaggacct gtctaggttt 181 gatgcctcct tcttcggagt ccaccccaag caggcacaca cgatggaccc tcagctgcgg 241 ctgctgctgg aagctaccta tgaagccatc gtggacggag gcatcaaccc agattcactc 301 cgaggaacac acactggcgt ctgggtgggc gtgagcggct ctgagacctc ggaggccctg 361 agccgagacc ccgagacact cgtgggctac agcatggtgg gctgccagcg agcgatgatg 421 gccaaccggc tctccttctt cttcgacttc agagggccca gcatcgcact ggacacagcc 481 tgctcctcca gcctgatggc cctgcagaac gcctaccagg ccatccacag cgggcagtgc 541 cctgccgcca tcgtgggggg catcaacgtc ctgctgaagc ccaacacctc cgtgcagttc 601 ttgaggctgg ggatgctcag ccccgagggc acctgcaagg ccttcgacac agcggggaat 661 gggtactgcc gctcggaggg tgtggtggct gtcctgctga ccaagaagtc cctggcccgg 721 aaggtctaca ccaccatcct gaacaaaggc accaatacag atggcttcaa ggagcaaggc 781 gtgaccttcc ctcaggatat ccaggagcag cctatccgct cgttgtacca gtcggccgga 841 gtggcccctg agtcatttga atacatcgaa gcccacggac caggcaccaa ggtgggcgac 901 ccccaggagc gtaatggcat cacccgagcc ctgtgcgcca cccgccagga gccgctgctc 961 atcggctcca ccaagtccaa catggggcac ccggagccag cctcggggct cgacgccctg 1021 gccaaggtgc tgctgtccct ggagcacggg ctctgggccc ccaacctgca cttccatagc 1081 cccaaccctg agatcccagc gctgttggat gggcggctgc aggtggtgga ccagcccctg 1141 cccgtccgtg gcggcaacgt gggcatcaac tcctttggct tcgggggctc caacatgcac 1201 atcatcctga ggcccaacac gcagtccgcc cccgcacccg ccccacatgc caccctgccc 1261 cgtctgctgc gggccagcgg acgcacccct gaggccgtgc agaagctgct ggagcagggc 1321 ctccggcaca gccagggcct ggctttcctg agcatgctga acgacatcgc ggctgtcccc 1381 gccaccgcca tgcccttccg tggctacgct gtgctgggtg gtgagacgcg gtggcccaga 1441 gtgcagcagg tgcccgctgg cgagcgcccg ctctggttca tctgctctgg gatgggcaca 1501 cagtggcgtg gaatggggct gagccttatg cgcctggacc gcttccgaga ttccatccta 1561 cgctccgatg aggctgtgaa ccgattcggc ctgaaggtgt cacagctgct gctgagcaca 1621 gacgagagca cctttgatga catcgtccat tcgtttgtga gcctgactgc catccagata 1681 ggcctcatag acctgctgag ctgcatggga cctgaggcag atggcatcgt cggccactcc 1741 ctgggggagt ggctgtcggt acgcgacggc tgcctgtccc aggaggaggc cgtcctcgct 1801 gcctactgga ggggacagtg catcaaagaa gccccacttc ccgccggcgc catggcagcc 1861 gtgggcttgt cctgggagga gtgtaaacag cgctgccccc ctgcggtggt gcccgcctgc 1921 cacaactcca aggacacagt caccatctcg ggacctcagg ccccggtgtt tgagttcgtg 1981 gagcagctga ggaaggaggg tgtgtttgcc aaggaggtgc ggaccggcgg tatggccttc 2041 cactcctact tcatggaggc catcgcaccc ccactgctgc aggagctcaa gaaggtgatc 2101 cgggagccga agccacgttc agcccgctgg ctcagcacct ctatccccga ggcccagtgg 2161 cacagcagcc tggcacgcac gtcttccgcc gagtacaatg tcaacaacct ggtgagccct 2221 gtgctgttcc aggaggccct gtggcacgtg cctgagcacg cggtggtgct ggagatcgcc 2281 ccgaccccgt gccctcaggc tgtcctgaag cgggtccgta agccgagctg caccatcatc 2341 ccccgtatga agaaggatca cagggacaac ctggagttct tcctggccgg catcggcagg 2401 ctgcacctct caggcatcga cgccaacccc aatgccttgt tcccacctgt ggagtcccca 2461 gctccccgag gaactcccct catctcccca ctcatcaagt gggaccacag cctggcctgg 2521 gacgcgccgg ccgccgagga cttccccaac ggttcaggtt ccccctcagc caccatctac 2581 acatgcacac caagctccga gtctcctgac cgctacctgg tggaccacac catcgacggt 2641 cgcgtcctct tccccgccac tggctacctg agcatagtgt ggaagacgct ggcccgcgcc 2701 tgggctgggc tcgagcagct gcctgtggtg tttgaggatg tggtgcagca ccaggccacc 2761 atcctgccca agactgggac agtgtccttg gaggtacggc tcctggaggc caccggtgcc 2821 ttcgaggtgt cagagaacgg caacctggta gtgagtggga aggtgtacca gtgggatgac 2881 cctgacccca ggctcttcga ccacccggaa agtccccacc ccaattcccc acggagtccc 2941 ctcttcctgg cccaggcaga agtttacaag gagctgcgtc tgcgtggcta cgactacggc 3001 cctcatttcc agggcatcct ggaggccagc ctggaaggtg actcggggag gctgctgtgg 3061 aaggataact gggtgagctt catggacacc atgctgcaga tgtccatcct gggctcggcc 3121 aagcacggcc tgtacctacc cacccgtgtc accgccatcc acatcgaccc tgccacccac 3181 aggcagaagc tgtacacact gcaggacaag gcccaagtgg ctgacgtggt ggtgagcagg 3241 tggccgaggg tcacagtggc gggaggcgtc cacatctccg ggctccacac tgagtcggcc 3301 ccgcggcggc acgaggagca gcaggtgccc atcctggaga agttttgctt cactccccac 3361 acggaggagg ggtgcctgtc tgagcacgct gccctcgagg aggagctgca actgtgcaag 3421 gggctggtcg aggcactcga gaccaaggtg acccagcagg ggctgaagat ggtggtgccg 3481 gactggacgg ggcccagatc cccccgggac ccctcacagc aggaactgcc ccggctgttg 3541 tcggctgcct gcaggcttca gctcaacggg aacctgcagc tggagctggc gcaggtgctg 3601 gcccaggaga ggcccaagct gccagaggac cctctgctca gcggcctcct ggactccccg 3661 gcactcaagg cctgcctgga cactgccgtg gagaacatgc ccagcctgaa gatgaaggtg 3721 gtggaggtgc tggccggcca cggtcacctg tattcccgca tcccaggcct gctcagcccc 3781 catcccctgc tgcagctgag ctacacggcc accgaccgcc acccccaggc cctggaggct 3841 gcccaggccg agctgcagca gcacgacgtt gcccagggcc agtgggatcc cgcagaccct 3901 gcccccagcg ccctgggcag cgcggacctc ctggtgtgca actgtgctgt ggctgccctc 3961 ggggacccgg cctcagctct cagcaacatg gtggctgccc tgagagaagg gggctttctg 4021 ctcctgcaca cactgctccg ggggcaccct cgggacatcg tggccttcct cacctccact 4081 gagccgcagt atggccaggg catcctgagc caggacgcgt gggagagcct cttctccagg 4141 gtgtcgctgc gcctggtggg cctgaagaag tccttctacg gcgccacgct cttcctgtgc 4201 cgccggccca ccccgcagga cagccccatc ttcctgccgg tggacgatac cagcttccgc 4261 tgggtggagt ctctgaaggg catcctggct gacgaagact cttcccggcc tgtgtggctg 4321 aaggccatca actgtgccac ctcgggcgtg gtgggcttgg tgaactgtct ccgccgagag 4381 cccggcggaa ccgtccggtg tgtgctgctc tccaacctca gcagcacctc ccacgtcccg 4441 gaggtggacc cgggctccgc agaactgcag aaggtgttgc agggagacct ggtgatgaac 4501 gtctaccgcg acggggcctg gggggttttc cgccacttcc tgctggagga caagcctgag 4561 gagccgacgg cacatgcctt tgtgagcacc ctcacccggg gggacctgtc ctccatccgc 4621 tgggtctgct cctcgctgcg ccatgcccag cccacctgcc ctggcgccca gctctgcacg 4681 gtctactacg cctccctcaa cttccgcgac atcatgctgg ccactggcaa gctgtcccct 4741 gatgccatcc cagggaagtg gacctcccag gacagcctgc taggtatgga gttctcgggc 4801 cgagacgcca gcggcaagcg tgtgatggga ctggtgcctg ccaagggcct ggccacctct 4861 gtcctgctgt caccggactt cctctgggat gtgccttcca actggacgct ggaggaggcg 4921 gcctcggtgc ctgtcgtcta cagcacggcc tactacgcgc tggtggtgcg tgggcgggtg 4981 cgccccgggg agacgctgct catccactcg ggctcgggcg gcgtgggcca ggccgccatc 5041 gccatcgccc tcagtctggg ctgccgcgtc ttcaccaccg tggggtcggc tgagaagcgg 5101 gcgtacctcc aggccaggtt cccccagctc gacagcacca gcttcgccaa ctcccgggac 5161 acatccttcg agcagcatgt gctgtggcac acgggcggga agggcgttga cctggtcttg 5221 aactccttgg cggaagagaa gctgcaggcc agcgtgaggt gcttcggtac gcacggtcgc 5281 ttcctggaaa ttggcaaatt cgacctttct cagaaccacc cgctcggcat ggctatcttc 5341 ctgaagaacg tgacattcca cggggtccta ctggatgcgt tcttcaacga gagcagtgct 5401 gactggcggg aggtgtgggc gcttgtcgag gccgccatcc gggatggggt ggtacggccc 5461 ctcaagtgca cggtgttcca tggggcccag gtggaggacg ccttccgcta catggcccaa 5521 gggaagcaca ttggcaaagt cgtcgtgcag gtgcttgcgg aggagccggc agtgctgaag 5581 ggggccaaac ccaagctgat gtcggccatc tccaagacct tctgcccggc ccacaagagc 5641 tacatcatcg ctggtggtct gggtggcttc ggcctggagt tggcgcagtg gctgatacag 5701 cgtggggtgc agaagctcgt gttgacttct cgctccggga tccggacagg ctaccaggcc 5761 aagcaggtcc gccggtggag gcgccagggg ctacaggtgc aggtgtccac cagcaacatc 5821 agctcactgg agggggcccg gggcctcatt gccgaggcgg cgcagcttgg gcccgtgggg 5881 ggcgtcttca acctggccgt ggtcttgaga gatggcttgc tggagaacca gaccccagag 5941 ttcttccagg acgtctgcaa gcccaagtac agcggcaccc tgaacctgga cagggtgacc 6001 cgagaggcgt gccctgagct ggactacttt gtggtcttct cctctgtgag ctgcgggcgt 6061 ggcaatgcgg gacagagcaa ctacggcttt gccaattccg ccatggagcg tatctgtgag 6121 aaacgccggc acgaaggcct cccaggcctg gccgtgcagt ggggcgccat cggcaccgtg 6181 ggcattttgg tggagacgat gagcaccaac gacacgatcg tcagtggcac gctgcccacg 6241 cgcattggcg tccttggcct ggaggtgctg gacctcttcc tgaaccagcc ccacatggtc 6301 ctgagcagct ttgtgctggc tgagaaggct gcggcctata gggacaggga cagccagcgg 6361 gacctggtgg aggccgtggc acacatcctg ggcatccgcg acttggctgc tgtcaacctg 6421 ggcggctcac tggcggacct gggcctggac tcgctcatga gcgcgccggt gcgccagacg 6481 ctggagcgtg agctcaacct ggtgctgtcc gtgcgcgagg tgcggcaact cacgctccgg 6541 aaactgcagg agctgtcctc aaaggcggat gaagccagcg agctggcatg ccccacgccc 6601 aaggaggatg gtctggccca gcagcagact cagctgaacc tgcgctccct gctggtgaaa 6661 ccggagggcc ccaccctgat gcggctcaac tccgtgcaga gctcggagcg gcccctgttc 6721 ctggtgcacc caatcgaggc taccaccgtg ttccacagcc tcggtcccgg tctcagcatc 6781 cccacctatg gcctgcagtg caccccggct gcgccccttg acagcatcca cagcctggct 6841 gcctactaca tcgactgcat caggcaggtg cagcccgagg gcccctaccg cgtggccggc 6901 tactcctacg gggcctgcgt ggcctttgaa atgtgctccc agctgcaggc ccagcagagc 6961 ccagccccca cccacaacag cctcttcctg ttcgacggct cgcccaccta cgtactggcc 7021 tacacccaga gctaccgggc aaagctgacc ccaggctgta aggctgaggc tgagacggag 7081 gccatatgct tcttcgtgca gcagttcacg gacatggagc acaacagggt gctggaggcg 7141 ctgctgccgc tgaagggcct agaggagcgt gtggcagccg ccgtggacct gatcatcaag 7201 agccaccagg gcctggaccg ccaggagctg agctttgcgg cccggtcctt ctactacagg 7261 ctgcgtgccg ctgaccagta tacacccaag gccaagtaca gtggcaacgt gatgctactg 7321 cgggccaaga cgggtggccg ctacggcgag gacctgggcg cggactacaa cctctcccag 7381 gtatgcgacg ggaaagtatc cgtccatatc atcgagggtg accaccgcac gctgctggag 7441 ggcagcggcc tggagtccat catcagcatc atccacagct ccctggctga gccacgtgtg 7501 agtcgggagg gctag // LOCUS HSU26648 1507 bp mRNA PRI 02-JUL-1995 DEFINITION Human syntaxin 5 mRNA, complete cds. ACCESSION U26648 NID g886070 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1507) AUTHORS Ravichandran,V. and Roche,P.A. TITLE Isolation and sequence analysis of the cDNA encoding human syntaxin 5 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1507) AUTHORS Ravichandran,V. and Roche,P.A. TITLE Direct Submission JOURNAL Submitted (09-MAY-1995) Paul A. Roche, NCI, NIH, 9000 Rockville Pike, Bethesda, MD 21045, USA FEATURES Location/Qualifiers source 1..1507 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV-transformed peripheral blood lymphocyte; B cell population" /clone_lib="lymphocyte Matchmaker library (Clontech)" CDS 27..932 /codon_start=1 /product="syntaxin 5" /db_xref="PID:g886071" /translation="MSCRDRTQEFLSACKSLQTRQNGIQTNKPALRAVRQRSEFTLMA KRIGKDLSNTFAKLEKLTILAKRKSLFDDKAVEIEELTYIIKQDINSLNKQIAQLQDF VRAKGSQSGRHLQTHSNTIVVSLQSKLASMSNDFKSVLEVRTENLKQQRSRREQFSRA PVSALPLAPNHLGGGAVVLGAESHASKDVAIDMMDSRTSQQLQLIDEQDSYIQSRADT MQNIESTIVELGSIFQQLAHMVKEQEETIQRIDENVLGAQLDVEAAHSEILKYFQSVT SNRWLMVKIFLILIVFFIIFVVFLA" polyA_site 1507 /note="32 A nucleotides" /evidence=not_experimental BASE COUNT 356 a 437 c 383 g 331 t ORIGIN 1 ctcgaggcca cgaaggcccc gacaccatgt cctgccggga tcggacccag gagtttctgt 61 ctgcctgcaa gtcgctgcag acccgtcaga atggaatcca gacaaataag ccagctttgc 121 gtgctgtccg acaacgcagt gaattcaccc tcatggccaa gcgcattggg aaagacctta 181 gcaacacatt tgccaagctg gagaagctga caatcttggc aaagcgcaag tccctctttg 241 atgataaagc agtggaaatt gaagagctaa catatatcat caaacaggac atcaatagcc 301 tcaacaaaca aattgctcag ctccaggatt tcgtgagagc caagggcagc cagagtggcc 361 ggcacctgca gacccactcc aacaccattg tggtctcctt gcagtcgaaa ctggcttcta 421 tgtccaatga cttcaaatcg gttttagaag tgaggacaga gaacctgaag cagcagagga 481 gccggagaga gcagttctcc cgggcacctg tgtcagccct gccccttgcc cctaaccacc 541 tgggcggtgg tgctgtggtt ctgggggcag agtcccatgc ctccaaggat gtcgccatcg 601 acatgatgga ctctcggacc agccagcagc tgcagctcat tgacgagcag gattcctaca 661 tccagagtcg ggcagacacc atgcagaaca ttgagtcgac aattgttgag ttgggctcca 721 tctttcagca gttggcacac atggttaagg aacaggagga aaccattcag aggatcgacg 781 agaacgtgct aggagcccag ctggacgttg aggccgccca ttcagagatc ctcaagtact 841 tccagtctgt cacctccaac cggtggctca tggtcaaaat cttcctcatc ctcattgtct 901 tcttcatcat ctttgtggtc ttccttgctt gaaccctctc tactctgagg cactctgttg 961 gggtttggga ccctcctggg aaggcaagtg gccagtgctg ccactgagcc tgtgcagggt 1021 acttgggaga aaggccctgt ttccctggaa ctgctaagaa tgaccactgc ccctgatccc 1081 ccaccccttg cctctggcca ccctgtcctc cccccaccac cctcaggcct atgaaacaca 1141 cagggttcta gatttgaact ctgctgtgaa gtgactggaa gggagcagag gccagctggg 1201 ggccagtggg ggaggttgtt tccactagga gatttttata aaccctctcc agcctctccc 1261 ggaaaggaag cgttggcagc aaagggagat gatgccctta cccaccttcc tgtgagtgaa 1321 gagaggaagc agccccaggg accaattttc ccaattgacc tctttcttcc tctttcacca 1381 tgtgaggcag ggagccctga gcccttcagc tgcctgcaca acccctgaca ttggctgctg 1441 gtgactcaat ctgccaaatg tgctgcagct cgttttctcc caattacagc aagactgtca 1501 gcctcca // LOCUS HSU26710 3982 bp mRNA PRI 30-SEP-1996 DEFINITION Human cbl-b mRNA, complete cds. ACCESSION U26710 NID g862406 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3982) AUTHORS Keane,M.M., Rivero-Lezcano,O.M., Mitchell,J.A., Robbins,K.C. and Lipkowitz,S. TITLE Cloning and characterization of cbl-b: a SH3 binding protein with homology to the c-cbl proto-oncogene JOURNAL Oncogene 10 (12), 2367-2377 (1995) MEDLINE 95303504 REFERENCE 2 (bases 1 to 3982) AUTHORS Lipkowitz,S., Keane,M.M. and Mitchell,J.A. TITLE Direct Submission JOURNAL Submitted (10-MAY-1995) Stan Lipkowitz, Navy Medical Oncology Branch, National Cancer Institute, Blg 8, Rm 5101, Bethesda Naval Hospital, Bethesda, MD 20889, USA FEATURES Location/Qualifiers source 1..3982 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="breast cancer cell line ZR75-1" /map="3q" /chromosome="3" CDS 323..3271 /note="interacts with SH3 proteins; similar to c-cbl proto-oncogene product, Swiss-Prot Accession Number P22681" /codon_start=1 /product="cbl-b" /db_xref="PID:g862407" /translation="MANSMNGRNPGGRGGNPRKGRILGIIDAIQDAVGPPKQAAADRR TVEKTWKLMDKVVRLCQNPKLQLKNSPPYILDILPDTYQHLRLILSKYDDNQKLAQLS ENEYFKIYIDSLMKKSKRAIRLFKEGKERMYEEQSQDRRNLTKLSLIFSHMLAEIKAI FPNGQFQGDNFRITKADAAEFWRKFFGDKTIVPWKVFRQCLHEVHQISSSLEAMALKS TIDLTCNDYISVFEFDIFTRLFQPWGSILRNWNFLAVTHPGYMAFLTYDEVKARLQKY STKPGSYIFRLSCTRLGQWAIGYVTGDGNILQTIPHNKPLFQALIDGSREGFYLYPDG RSYNPDLTGLCEPTPHDHIKVTQEQYELYCEMGSTFQLCKICAENDKDVKIEPCGHLM CTSCLTAWQESDGQGCPFCRCEIKGTEPIIVDPFDPRDEGSRCCSIIDPFGMPMLDLD DDDDREESLMMNRLANVRKCTDRQNSPVTSPGSSPLAQRRKPQPDPLQIPHLSLPPVP PRLDLIQKGIVRSPCGSPTGSPKSSPCMVRKQDKPLPAPPPPLRDPPPPPPERPPPIP PDNRLSRHIHHVESVPSRDPPMPLEAWCPRDVFGTNQLVGCRLLGEGSPKPGITASSN VNGRHSRVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPPRLSPPPPVTTLLP SIKCTGPLANSLSEKTRDPVEEDDDEYKIPSSHPVSLNSQPSHCHNVKPPVRSCDNGH CMLNGTHGPSSEKKSNIPDLSIYLKGDVFDSASDPVPLPPARPPTRDNPKHGSSLNRT PSDYDLLIPPLGEDAFDALPPSLPPPPPPARHSLIEHSKPPGSSSRPSSGQDLFLLPS DPFVDLASGQVPLPPARRLPGENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRPRRTA PEIHHRKPHGPEAALENVDAKIAKLMGEGYAFEEVKRALEIAQNNVEVARSILREFAF PPPVSPRLNL" misc_feature 668..679 /note="encodes nuclear localization signal" misc_feature 1439..1558 /note="encodes ring finger" misc_feature 3116..3223 /note="encodes leucine zipper" BASE COUNT 1093 a 969 c 877 g 1043 t ORIGIN 1 ctgggtcctg tgtgtgccac aggggtgggg tgtccagcga gcggtctcct cctcctgcta 61 gtgctgctgc ggcgtcccgc ggcctccccg agtcgggcgg gaggggagag cgggtgtgga 121 tttgtcttga cggtaattgt tgcgtttcca cgtctcggag gcctgcgcgc tgggttgctc 181 cttcttcggg agcgagctgt tctcagcgat cccactccca gccggggctc cccacacaca 241 ctgggctgcg tgcgtgtgga gtgggacccg cgcacacgcg tgtctctgga cagctacggc 301 gccgaaagaa ctaaaattcc agatggcaaa ctcaatgaat ggcagaaacc ctggtggtcg 361 aggaggaaat ccccgaaaag gtcgaatttt gggtattatt gatgctattc aggatgcagt 421 tggaccccct aagcaagctg ccgcagatcg caggaccgtg gagaagactt ggaagctcat 481 ggacaaagtg gtaagactgt gccaaaatcc caaacttcag ttgaaaaata gcccaccata 541 tatacttgat attttgcctg atacatatca gcatttacga cttatattga gtaaatatga 601 tgacaaccag aaacttgccc aactcagtga gaatgagtac tttaaaatct acattgatag 661 ccttatgaaa aagtcaaaac gggcaataag actctttaaa gaaggcaagg agagaatgta 721 tgaagaacag tcacaggaca gacgaaatct cacaaaactg tcccttatct tcagtcacat 781 gctggcagaa atcaaagcaa tctttcccaa tggtcaattc cagggagata actttcgtat 841 cacaaaagca gatgctgctg aattctggag aaagtttttt ggagacaaaa ctatcgtacc 901 atggaaagta ttcagacagt gccttcatga ggtccaccag attagctcta gcctggaagc 961 aatggctcta aaatcaacaa ttgatttaac ttgcaatgat tacatttcag tttttgaatt 1021 tgatattttt accaggctgt ttcagccttg gggctctatt ttgcggaatt ggaatttctt 1081 agctgtgaca catccaggtt acatggcatt tctcacatat gatgaagtta aagcacgact 1141 acagaaatat agcaccaaac ccggaagcta tattttccgg ttaagttgca ctcgattggg 1201 acagtgggcc attggctatg tgactgggga tgggaatatc ttacagacca tacctcataa 1261 caagccctta tttcaagccc tgattgatgg cagcagggaa ggattttatc tttatcctga 1321 tgggaggagt tataatcctg atttaactgg attatgtgaa cctacacctc atgaccatat 1381 aaaagttaca caggaacaat atgaattata ttgtgaaatg ggctccactt ttcagctctg 1441 taagatttgt gcagagaatg acaaagatgt caagattgag ccttgtgggc atttgatgtg 1501 cacctcttgc cttacggcat ggcaggagtc ggatggtcag ggctgccctt tctgtcgttg 1561 tgaaataaaa ggaactgagc ccataatcgt ggaccccttt gatccaagag atgaaggctc 1621 caggtgttgc agcatcattg acccctttgg catgccgatg ctagacttgg acgacgatga 1681 tgatcgtgag gagtccttga tgatgaatcg gttggcaaac gtccgaaagt gcactgacag 1741 gcagaactca ccagtcacat caccaggatc ctctcccctt gcccagagaa gaaagccaca 1801 gcctgaccca ctccagatcc cacatctaag cctgccaccc gtgcctcctc gcctggatct 1861 aattcagaaa ggcatagtta gatctccctg tggcagccca acaggttcac caaagtcttc 1921 tccttgcatg gtgagaaaac aagataaacc actcccagca ccacctcctc ccttaagaga 1981 tcctcctcca ccgccacctg aaagacctcc accaatccca ccagacaata gactgagtag 2041 acacatccat catgtggaaa gcgtgccttc cagagacccg ccaatgcctc ttgaagcatg 2101 gtgccctcgg gatgtgtttg ggactaatca gcttgtggga tgtcgactcc taggggaggg 2161 ctctccaaaa cctggaatca cagcgagttc aaatgtcaat ggaaggcaca gtagagtggg 2221 ctctgaccca gtgcttatgc ggaaacacag acgccatgat ttgcctttag aaggagctaa 2281 ggtcttttcc aatggtcacc ttggaagtga agaatatgat gttcctcccc ggctttctcc 2341 tcctcctcca gttaccaccc tcctccctag cataaagtgt actggtccgt tagcaaattc 2401 tctttcagag aaaacaagag acccagtaga ggaagatgat gatgaataca agattccttc 2461 atcccaccct gtttccctga attcacaacc atctcattgt cataatgtaa aacctcctgt 2521 tcggtcctgt gataatggtc actgtatgct gaatggaaca catggtccat cttcagagaa 2581 gaaatcaaac atccctgact taagcatata tttaaaggga gatgtttttg attcagcctc 2641 tgatcccgtg ccattaccac ctgccaggcc tccaactcgg gacaatccaa agcatggttc 2701 ttcactcaac aggacgccct ctgattatga tcttctcatc cctccattag gtgaagatgc 2761 ttttgatgcc ctccctccat ctctcccacc tcccccacct cctgcaaggc atagtctcat 2821 tgaacattca aaacctcctg gctccagtag ccggccatcc tcaggacagg atctttttct 2881 tcttccttca gatccctttg ttgatctagc aagtggccaa gttcctttgc ctcctgctag 2941 aaggttacca ggtgaaaatg tcaaaactaa cagaacatca caggactatg atcagcttcc 3001 ttcatgttca gatggttcac aggcaccagc cagaccccct aaaccacgac cgcgcaggac 3061 tgcaccagaa attcaccaca gaaaacccca tgggcctgag gcggcattgg aaaatgtcga 3121 tgcaaaaatt gcaaaactca tgggagaggg ttatgccttt gaagaggtga agagagcctt 3181 agagatagcc cagaataatg tcgaagttgc ccggagcatc ctccgagaat ttgccttccc 3241 tcctccagta tccccacgtc taaatctata gcagccagaa ctgtagacac caaaatggaa 3301 agcaatcgat gtattccaag agtgtggaaa taaagagaac tgagatggaa ttcaagagag 3361 aagtgtctcc tcctcgtgta gcagcttgag aagaggcttg ggagtgcagc ttctcaaagg 3421 agaccgatgc ttgctcagga tgtcgacagc tgtggcttcc ttgtttttgc tagccatatt 3481 tttaaatcag ggttgaactg acaaaaataa tttaaagacg tttacttccc ttgaactttg 3541 aacctgtgaa atgctttacc ttgtttacaa tttggcaaag ttgcagtttg ttcttgtttt 3601 tagtttagtt ttgttttggt gttttgatac ctgtactgtg ttcttcacag accctttgta 3661 gcgtggtcag gtctgctgta acatttccca ccaactctct tgctgtccac atcaacagct 3721 aaatcattta ttcatatgga tctctaccat ccccatgcct tgcccaggtc cagttccatt 3781 tctctcattc acaagatgct ttgaaggttc tgattttcaa ctgatcaaac taatgcaaaa 3841 aaaaaaagta tgtattcttc actactgagt ttcttctttg gaaaccatca ctattgagag 3901 atgggaaaaa cctgaatgta taaagcattt atttgtcaat aaactgcctt ttgtaagggg 3961 ttttcacata aaaaaaaaaa aa // LOCUS HSU26742 1707 bp mRNA PRI 06-APR-1996 DEFINITION Human dystrobrevin-delta mRNA, complete cds. ACCESSION U26742 NID g1255988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1707) AUTHORS Sadoulet-Puccio,H.M., Khurana,T.S., Cohen,J.B. and Kunkel,L.M. TITLE Cloning and characterization of the human homologue of a dystrophin related phosphoprotein found at the Torpedo electric organ post-synaptic membrane JOURNAL Hum. Mol. Genet. 5 (4), 489-496 (1996) MEDLINE 96254978 REFERENCE 2 (bases 1 to 1707) AUTHORS Sadoulet-Puccio,H.M., Khurana,T.S. and Kunkel,L.M. TITLE Direct Submission JOURNAL Submitted (10-MAY-1995) Helene M. Sadoulet-Puccio, Genetics, Howard Hughes Medical Institute, Children's Hospital, 300 Longwood Ave., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1707 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="891A" /clone_lib="human adult brain cDNA library cloned in lambda gt10" /tissue_type="brain" /map="18q12.1-21.2" /chromosome="18" CDS 325..1449 /note="similar to the 87 kDA Torpedo acetylcholine receptor-associated protein; similar to rat apodystrophin-3, PIR Accession Number S32011" /codon_start=1 /product="dystrobrevin-delta" /db_xref="PID:g1255989" /translation="MIEDSGKRGNTMAERRQLFAEMRAQDLDRIRLSTYRTACKLRFV QKKCNLHLVDIWNVIEALRENALNNLDPNTELNVSRLEAVLSTIFYQLNKRMPTTHQI HVEQSISLLLNFLLAAFDPEGHGKISVFAVKMALATLCGGKIMDKLRYIFSMISDSSG VMVYGRYDQFLREVLKLPTAVFEGPSFGYTEQSARSCFSQQKKVTLNGFLDTLMSDPP PQCLVWLPLLHRLANVENVFHPVECSYCHSESMMGFRYRCQQCHNYQLCQDCFWRGHA GGSHSNQHQMKEYTSWKSPAKKLTNALSKSLSCASSREPLHPMFPDQPEKPLNLAHIV DTWPPRPVTSMNDTLFSHSVPSSGSPFITRSSDGAFGGCV" BASE COUNT 451 a 421 c 367 g 468 t ORIGIN 1 caggaaaccc tggtactggc agcagccagc ctctgctgtg cccacatgac ccacaactct 61 ggcagcggac ccggcacttc caacattatt aaataataag aaagcggctc ctactccagg 121 ctcaaacctc cctgcagacc aatggacacc ttctaagagt ttggcgagtc agtgactgaa 181 gcgcccgtcc attccaagat aaataggatt taccaatcct tggatgaagt gcttgggaag 241 tctttaagtg ccataatcaa ctgccatttc aaagaatata gatggttttg aaaagttcat 301 gctgtccctt cattgaattt tagaatgatt gaagatagtg ggaaaagagg aaataccatg 361 gcagaaagaa gacagctgtt tgcagagatg agggctcaag atctggatcg catccgactc 421 tccacctaca gaacagcatg caagcttagg tttgttcaga agaaatgcaa tttgcacctg 481 gtggacatat ggaatgtcat agaagcattg cgggaaaatg ctctgaacaa cctggaccca 541 aacactgaac tcaacgtgtc ccgcttagag gctgtgctct ccactatttt ttaccagctc 601 aacaaacgga tgccaaccac tcaccaaatc catgtggagc agtccatcag cctcctcctt 661 aacttcctgc ttgcagcgtt tgatccggaa ggccatggta aaatttcagt atttgctgtc 721 aaaatggctt tagccacatt gtgtggaggg aagatcatgg acaaattaag atatattttc 781 tcaatgattt ctgactccag tggggtgatg gtttatggac gatatgacca attccttcgg 841 gaagttctca aactacccac ggcagttttt gaaggtcctt catttggtta cacagaacag 901 tcagccagat cctgtttctc ccaacagaaa aaagtcacgt taaatggttt cttggacacg 961 cttatgtcag atcctccccc gcagtgtctg gtctggttgc ctcttctgca tcgactagca 1021 aatgtggaaa atgtcttcca tccggttgag tgttcctact gccacagtga gagtatgatg 1081 ggatttcgct accgatgcca acagtgtcac aattaccagc tctgtcagga ctgcttctgg 1141 aggggacatg ccggtggttc tcatagcaac cagcaccaaa tgaaagagta cacgtcatgg 1201 aaatcacctg ctaagaagct gactaatgca ttaagcaagt ccctgagctg tgcttccagc 1261 cgtgaacctt tgcaccccat gttcccagat cagcctgaga agccactcaa cttggctcac 1321 atcgttgata cttggcctcc cagacctgta accagcatga acgacaccct gttctcccac 1381 tctgttccct cctcaggaag tccttttatt accaggagct cggacggtgc ttttggtgga 1441 tgcgtctaga tggataacat gacttcttct accctaaaat attcctataa tactttgagc 1501 tgttctggtt cctccagggt gcatggtacc cattaaccca aaatatgatt atttcccttt 1561 tttcccattt tcagtcattt tggaatgttc tctgtgaacc acagttgggt tgtttaaagc 1621 tcacatttct ttctgtcacc acagagattg gcctacggtt tctgttttga gggtgctgtt 1681 caataaagct gtgtacacta aatgtcc // LOCUS HSU27109 4198 bp mRNA PRI 04-AUG-1995 DEFINITION Human prepromultimerin mRNA, complete cds. ACCESSION U27109 NID g927595 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4198) AUTHORS Hayward,C.P., Hassell,J.A., Denomme,G.A., Rachubinski,R.A., Brown,C. and Kelton,J.G. TITLE The cDNA sequence of human endothelial cell multimerin. A unique protein with RGDS, coiled-coil, and epidermal growth factor-like domains and a carboxyl terminus similar to the globular domain of complement C1q and collagens type VIII and X JOURNAL J. Biol. Chem. 270 (31), 18246-18251 (1995) MEDLINE 95355440 REFERENCE 2 (bases 1 to 4198) AUTHORS Hayward,C.P. TITLE Direct Submission JOURNAL Submitted (15-MAY-1995) Catherine P. M. Hayward, Pathology, McMaster University, HSC 2N32, 1200 Main St. W., Hamilton, Ontario L8N 3Z5, Canada FEATURES Location/Qualifiers source 1..4198 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="overlapping clones: mmlambda 4, 5, 7, 11 & 17" /clone_lib="Clonetech library and libraries VII-91-4 and VII-91-5 (J Biol Chem 262 page 3718)" /cell_type="endothelial" 5'UTR 1..71 sig_peptide 72..128 CDS 72..3758 /codon_start=1 /product="prepromultimerin" /db_xref="PID:g927596" /translation="MKGARLFVLLSSLWSGGIGLNNSKHSWTIPEDGNSQKTMPSASV PPNKIQSLQILPTTRVMSAEIATTPEARTSEDSLLKSTLPPSETSAPAEGVRNQTLTS TEKAEGVVKLQNLTLPTNASIKFNPGAESVVLSNSTLKFLQSFARKSNEQATSLNTVG GTGGIGGVGGTGGVGNRAPRETYLSRGDSSSSQRTDYQKSNFETTRGKNWCAYVHTRL SPTVTLDNQVTYVPGGKGPCGWTGGSCPQRSQKISNPVYRMQHKIVTSLDWRCCPGYS GPKCQLRAQEQQSLIHTNQAESHTAVGRGVAEQQQQQGCGDPEVMQKMTDQVNYQAMK LTLLQKKIDNISLTVNDVRNTYSSLEGKVSEDKSREFQSLLKGLKSKSINVLIRDIVR EQFKIFQNDMQETVAQLFKTVSSLSEDLESTRQIIQKVNESVVSIAAQQKFVLVQENR PTLTDIVELRNHIVNVRQEMTLTCEKPIKELEVKQTHLEGALEQEHSRSILYYESLNK TLSKLKEVHEQLLSTEQVSDQKNAPAAESVSNNVTEYMSTLHENIKKQSLMMLQMFED LHIQESKINNLTVSLEMEKESLRGECEDMLSKCRNDFKFQLKDTEENLHVLNQTLAEV LFPMDNKMDKMSEQLNDLTYDMEILQPLLEQGASLRQTMTYEQPKEAIVIRKKIENLT SAVNSLNFIIKELTKRHNLLRNEVQGRDDALERRINEYALEMEDGLNKTMTIINNAID FIQDNYALKETLSTIKDNSEIHHKCTSDMETILTFIPQFHRLNDSIQTLVNDNQRYNF VLQVAKTLAGIPRDEKLNQSNFQKMYQMFNETTSQVRKYQQNMSHLEEKLLLTTKISK NFETRLQDIESKVTQTLIPYYISVKKGSVVTNERDQALQLQVLNSRFKALEAKSIHLS INFFSLNKTLHEVLTMCHNASTSVSELNATIPKWIKHSLPDIQLLQKGLTEFVEPIIQ IKTQAALSNSTCCIDRSLPGSLANVVKSQKQVKSLPKKINALKKPTVNLTTVLIGRTQ RNTDNIIYPEEYSSCSRHPCQNGGTCINGRTSFTCACRHPFTGDNCTIKLVEENALAP DFSKGSYRYAPMVAFFASHTYGMTIPGPILFNNLDVNYGASYTPRTGKFRIPYLGVYV FKYTIESFSAHISGFLVVDGIDKLAFESENINSEIHCDRVLTGDALLELNYGQEVWLR LAKGTIPAKFPPVTTFSGYLLYRT" mat_peptide 129..3755 /product="multimerin" misc_structure 627..638 /note="encodes RGDS domain" misc_structure 876..911 /note="encodes partial EGF-like domain" misc_structure one-of(1020..1196,1269..1406,2073..2285,2523..2690) /note="encodes putative coiled-coil structures in protein" misc_feature 1173..1199 /note="encodes sequence confirmed from protein sequencing" misc_feature 3183..3185 /note="encodes putative tyrosine sulfation site" misc_feature 3243..3245 /note="encodes putative asparagine hydroxylation site" misc_structure 3264..3299 /note="encodes EGF-like domain" misc_structure 3420..3755 /note="encodes putative globular head domain in protein" 3'UTR 3759..4198 polyA_signal 4179..4184 BASE COUNT 1436 a 791 c 808 g 1163 t ORIGIN 1 ctgctatcaa aaaggccata aggattttgt ccccaaattt cacatgagct accttgcttc 61 aaactactga gatgaagggg gcaagattat ttgtccttct ttctagttta tggagtgggg 121 gcattgggct taacaacagt aagcattctt ggactatacc tgaggatggg aactctcaga 181 agactatgcc ttctgcttca gttcctccaa ataaaataca aagtttgcaa atactgccaa 241 ccactcgggt catgtcggcg gagatagcta caactccaga ggcaagaact tctgaagaca 301 gtcttcttaa atcaacactg cctccctcag aaacaagtgc acctgctgag ggtgtgagaa 361 atcaaactct cacatccaca gagaaagcag aaggagtggt caagttacag aatcttaccc 421 tcccaaccaa cgctagcatc aagttcaatc ctggagcaga atcagtggtc ctttccaatt 481 ctacactgaa atttcttcag agctttgcca gaaagtcaaa tgaacaagca acttctctaa 541 acacagttgg aggcactgga ggcattggag gcgttggagg cactggaggc gtgggaaatc 601 gagccccacg ggaaacatac ctcagccggg gtgacagcag ttccagccaa agaactgact 661 accaaaaatc aaatttcgaa acaactagag gaaagaattg gtgtgcttat gtacatacca 721 ggttatctcc cacagtgaca ttggacaacc aggtcactta tgtcccaggt gggaaaggac 781 cttgtggctg gaccggtgga tcctgtcctc agagatctca gaagatatcc aatcctgtct 841 ataggatgca acataaaatt gtcacctcat tggattggag gtgctgtcct ggatacagtg 901 ggccgaaatg tcaactaaga gcccaggaac agcaaagttt gatacacacc aaccaggctg 961 aaagtcatac agctgttggc agaggagtag ctgagcagca gcagcagcaa ggctgtggtg 1021 acccagaagt gatgcaaaaa atgactgatc aggtgaacta ccaggcaatg aaactgactc 1081 ttctgcagaa gaagattgac aatatttctt tgactgtgaa tgatgtaagg aacacttact 1141 cctccctaga aggaaaagtc agcgaagata aaagcagaga atttcaatct cttctaaaag 1201 gtctaaaatc caaaagcatt aatgtactga taagagacat agtaagagaa caatttaaaa 1261 tttttcaaaa tgacatgcaa gagactgtag cacagctctt caagactgta tcaagtctat 1321 cagaggacct cgaaagcacc aggcaaataa ttcaaaaagt taatgaatct gtggtttcaa 1381 tagcagccca gcaaaagttt gttttggtgc aagagaatcg gcccactttg actgatatag 1441 tggaactaag gaatcacatt gtgaatgtaa ggcaagaaat gactcttaca tgtgagaagc 1501 ctattaaaga actagaagta aagcagactc atttagaagg tgctctagaa caggaacact 1561 caagaagcat tctgtattat gaatccctca ataaaactct ttctaaattg aaggaagtac 1621 atgagcagct tttatcaact gaacaggtat cagaccagaa gaatgctcca gctgctgagt 1681 cagttagcaa taatgtcact gagtacatgt ctactttaca tgaaaatata aagaagcaga 1741 gtttgatgat gctgcaaatg tttgaagatt tgcacattca agaaagcaag attaacaatc 1801 tcaccgtctc tttggagatg gagaaagagt ctctcagagg tgaatgtgaa gacatgttat 1861 ccaaatgcag aaatgatttt aaatttcaac ttaaggacac agaagagaat ttacatgtgt 1921 taaatcaaac attggctgaa gttctctttc caatggacaa taagatggac aaaatgagtg 1981 agcaactaaa tgatttgact tatgatatgg agatccttca acccttgctt gagcagggag 2041 catcactcag acagacaatg acatatgaac aaccaaagga agcaatagtg ataaggaaaa 2101 agatagaaaa tctgactagt gctgtcaata gtctaaattt tattatcaaa gaacttacaa 2161 aaagacacaa cttacttaga aatgaagtac agggtcgtga tgatgcctta gaaagacgta 2221 tcaatgaata tgccttagaa atggaagatg gcctcaataa gacaatgact attataaata 2281 atgctattga tttcattcaa gataactatg ccctaaaaga gactttaagt actattaagg 2341 ataatagtga gatccatcat aaatgtacct ccgatatgga aactattttg acatttattc 2401 ctcagttcca ccgtctgaat gattctattc agactttggt caatgacaat cagagatata 2461 actttgtttt gcaagtcgcc aagacccttg caggtattcc cagagatgag aaactaaatc 2521 agtccaactt ccaaaagatg tatcaaatgt tcaatgaaac cacttcccaa gtgagaaaat 2581 accagcaaaa tatgagtcat ttggaagaaa aactactctt aactaccaag atttccaaaa 2641 attttgagac tcggttgcaa gacattgagt ctaaagttac ccagacgctc ataccttatt 2701 atatttcagt taaaaaaggc agtgtagtta caaatgagag agatcaggct cttcaactgc 2761 aagtattaaa ttccagattt aaggcgttgg aagcaaaatc tatccatctt tcaattaact 2821 tcttttcgct taacaaaact ctccacgaag ttttaacaat gtgtcacaat gcttctacaa 2881 gtgtgtcaga actgaatgct accatcccta agtggataaa acattccctg ccagatattc 2941 aacttcttca gaaaggtcta acagaatttg tggaaccaat aattcaaata aaaactcaag 3001 ctgccctatc taattcaact tgttgtatag atcgatcgtt gcctggtagt ctggcaaatg 3061 ttgtcaagtc tcagaagcaa gtaaaatcat tgccaaagaa aattaacgca cttaagaaac 3121 caacggtaaa tcttaccaca gtcctgatag gccggactca aagaaacacg gacaacataa 3181 tatatcctga ggagtattca agctgtagtc ggcatccgtg ccaaaatggg ggcacgtgca 3241 taaatggaag aactagcttt acctgtgcct gcagacatcc ttttactggt gacaactgca 3301 ctatcaagct tgtggaagaa aatgctttag ctccagattt ttccaaagga tcttacagat 3361 atgcacccat ggtggcattt tttgcatctc atacgtatgg aatgactata cctggtccta 3421 tcctgtttaa taacttggat gtcaattatg gagcttcata taccccaaga actggaaaat 3481 ttagaattcc gtatcttgga gtatatgttt tcaagtacac catcgagtca tttagtgctc 3541 atatttctgg atttttagtg gttgatggaa tagacaagct tgcatttgag tctgaaaata 3601 ttaacagtga aatacactgt gatagggttt taactgggga tgccttatta gaattaaatt 3661 atgggcagga agtctggtta cgacttgcaa aaggaacaat tccagccaag tttccccctg 3721 ttactacatt tagtggctat ttattatatc gtacataagt tagtatgaaa aacagactat 3781 cacctttatt gagaaacagc cagtgttttc atttatcttt gcttgcacat ctgctctgtt 3841 ttggtttttc tacaggaaat gaaaatcaac ttgttttttt aatatgagta aacttgtatg 3901 tctattttat aaaattattt gaatattgtt taatgtctga atatgaaaga gttcttgatc 3961 ctaaagaaat ttagtggcac agaaaacaaa gtgaatttgt tagcataatt attcctattc 4021 ttatttcttc attttaagtc attgcaatgg aaagtaatat tataaaacgg taattacaac 4081 atattatcag tcacagtttt ctttccaatt aaacacttaa cttttgttat tccctgtata 4141 taaatatata acacacattt tctagattca caaatttaaa taaattactc aaaaaatg // LOCUS HSU27143 580 bp mRNA PRI 05-DEC-1995 DEFINITION Human protein kinase C inhibitor-I cDNA, complete cds. ACCESSION U27143 NID g862932 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 580) AUTHORS Brzoska,P.M., Chen,H., Zhu,Y., Levin,N.A., Disatnik,M.H., Mochly-Rosen,D., Murnane,J.P. and Christman,M.F. TITLE The product of the ataxia-telangiectasia group D complementing gene, ATDC, interacts with a protein kinase C substrate and inhibitor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (17), 7824-7828 (1995) MEDLINE 95372371 REFERENCE 2 (bases 1 to 580) AUTHORS Christman,M.F. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) Michael F. Christman, Radiation Oncology, University of California, San Francisco, 1855 Folsom St. MCB200, San Francisco, CA 94103 USA FEATURES Location/Qualifiers source 1..580 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HELA" CDS 44..424 /note="; PKCI-1; interacts with product of the Ataxia Telangiectasia complementing gene, ATDC" /codon_start=1 /product="protein kinase C inhibitor-I" /db_xref="PID:g862933" /translation="MADEIAKAQVARPGGDTIFGKIIRKEIPAKIIFEDDRCLAFHDI SPQAPTHFLVIPKKHISQISVAEDDDESLLGHLMIVGKKCAADLGLNKGYRMVVNEGS DGGQSVYHVHLHVLGGRQMHWPPG" BASE COUNT 154 a 116 c 147 g 163 t ORIGIN 1 ggcacgaggc gctcctctgg gcccgcgcgg gagagaggcc gagatggcag atgagattgc 61 caaggctcag gtcgctcggc ctggtggcga cacgatcttt gggaagatca tccgcaagga 121 aataccagcc aaaatcattt ttgaggatga ccggtgcctt gctttccatg acatttcccc 181 tcaagcacca acacattttc tggtgatacc caagaaacat atatcccaga tttctgtggc 241 agaagatgat gatgaaagtc ttcttggaca cttaatgatt gttggcaaga aatgtgctgc 301 tgatctgggc ctgaataagg gttatcgaat ggtggtgaat gaaggttcag atggtggaca 361 gtctgtctat cacgttcatc tccatgttct tggaggtcgg caaatgcatt ggcctcctgg 421 ttaagcacgt tttggggata attttctctt ctttaggcaa tgattaagtt aggcaatttc 481 cagtatgtta agtaacacac ttatttttgc ctgtgtatgg agagattcaa gaaataattt 541 taaaaccgca tacataataa aagacattgt tgcatggcta // LOCUS HSU27185 846 bp mRNA PRI 16-MAY-1996 DEFINITION Human RAR-responsive (TIG1) mRNA, complete cds. ACCESSION U27185 NID g942584 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 846) AUTHORS Nagpal,S., Patel,S., Asano,A.T., Johnson,A.T., Duvic,M. and Chandraratna,R.A. TITLE Tazarotene-induced gene 1 (TIG1), a novel retinoic acid receptor-responsive gene in skin JOURNAL J. Invest. Dermatol. 106 (2), 269-274 (1996) MEDLINE 96179739 REFERENCE 2 (bases 1 to 846) AUTHORS Patel,S.K. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) Sheetal K. Patel, Retinoid Research, Allergan Pharmaceuticals, 2525 Dupont Drive, Irvine, CA 92713, USA FEATURES Location/Qualifiers source 1..846 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skin raft culture" gene 37..723 /gene="TIG1" CDS 37..723 /gene="TIG1" /codon_start=1 /db_xref="PID:g942585" /translation="MQPRRQRLPAPWSGPRGPRPTAPLLALLLLLAPVAAPAGSGGPD DPGQPQDAGVPRRLLQQKARAALHFFNFRSGSPSALRVLAEVQEGRAWINPKEGCKVH VVFSTERYNPESLLQEGEGRLGKCSARVFFKNQKPRPTINVTCTRLIEKKKRQQEDYL LYKQMKQLKNPLEIVSIPDNHGHIDPSLRLIWDLAFLGSSYVMWEMTTQVSHYYLAQL TSVRQWVRKT" polyA_site 846 BASE COUNT 203 a 234 c 225 g 184 t ORIGIN 1 ccacgtccgg ggtgccgagc caactttcct gcgtccatgc agccccgccg gcaacggctg 61 cccgctccct ggtccgggcc caggggcccg cgccccaccg ccccgctgct cgcgctgctg 121 ctgttgctcg ccccggtggc ggcgcccgcg gggtccgggg gccccgacga ccctgggcag 181 cctcaggatg ctggggtccc gcgcaggctc ctgcagcaga aggcgcgcgc ggcgcttcac 241 ttcttcaact tccggtccgg ctcgcccagc gcgctgcgag tgctggccga ggtgcaggag 301 ggccgcgcgt ggattaatcc aaaagaggga tgtaaagttc acgtggtctt cagcacagag 361 cgctacaacc cagagtcttt acttcaggaa ggtgagggac gtttggggaa atgttctgct 421 cgagtgtttt tcaagaatca gaaacccaga ccaaccatca atgtaacttg tacacggctc 481 atcgagaaaa agaaaagaca acaagaggat tacctgcttt acaagcaaat gaagcaactg 541 aaaaacccct tggaaatagt cagcatacct gataatcatg gacatattga tccctctctg 601 agactcatct gggatttggc tttccttgga agctcttacg tgatgtggga aatgacaaca 661 caggtgtcac actactactt ggcacagctc actagtgtga ggcagtgggt aagaaaaacc 721 tgaaaattaa cttgtgccac aagagttaca atcaaagtgg tctccttaga ctgaattcat 781 gtgaacttct aatttcatat caagagttgt aatcacattt atttcaataa atatgtgagt 841 tcctgc // LOCUS HSU27193 2377 bp mRNA PRI 09-DEC-1995 DEFINITION Human protein-tyrosine phosphatase mRNA, complete cds. ACCESSION U27193 NID g1109781 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2377) AUTHORS Martell,K.J., Seasholtz,A.F., Kwak,S.P., Clemens,K.K. and Dixon,J.E. TITLE hVH-5: a protein tyrosine phosphatase abundant in brain that inactivates mitogen-activated protein kinase JOURNAL J. Neurochem. 65 (4), 1823-1833 (1995) MEDLINE 96009533 REFERENCE 2 (bases 1 to 2377) AUTHORS Martell,K.J. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) Karen J. Martell, Biochemistry, University of Michigan, M5416 Medical Science I, 1301 Catherine Street, Ann Arbor, MI 48109-0606, USA FEATURES Location/Qualifiers source 1..2377 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hVH-5" /tissue_type="brain" /dev_stage="fetal" CDS 135..2012 /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g1109782" /translation="MAGDRLPRKVMDAKKLASLLRGGPGGPLVIDSRSFVEYNSWHVL SSVNICCSKLVKRRLQQGKVTIAELIQPAARSQVEATEPQDVVVYDQSTRDASVLAAD SFLSILLSKLDGCFDSVAILTGGFATFSSCFPGLCEGKPAALLPMSLSQPCLPVPSVG LTRILPHLYLGSQKDVLNKDLMTQNGISYVLNASNSCPKPDFICESRFMRVPINDNYC EKLLPWLDKSIEFIDKAKLSSCQVIVHCLAGISRSATIAIAYIMKTMGMSSDDAYRFV KDRRPSISPNFNFLGQLLEYERTLKLLAALQGDPGTPSGTPEPPPSPAAGAPLPRLPP PTSESAATGNAAAREGGLSAGGEPPAPPTPPATSALQQGLRGLHLSSDRLQDTNRLKR SFSLDIKSAYAPSRRPDGPGPPDPGEAPKLCKLDSPSGAALGLSSPSPDSPDAAPEAR PRPRRRPRPPAGSPARSPAHSLGLNFGDAARQTPRHGLSALSAPGLPGPGQPAGPGAW APPLDSPGTPSPDGPWCFSPEGAQGAGGVLFAPFGRAGAPGPGGGSDLRRREAARAEP RDARTGWPEEPAPETQFKRRSCQMEFEEGMVEGRARGEELAALGKQASFSGSVEVIEV S" BASE COUNT 397 a 862 c 736 g 382 t ORIGIN 1 ggggagccga ggcgagcgcg agcgaggtcc agcaccatgt gctaggtcac tcccagcgcg 61 aggccacacc tgggccgtcg gagcagcccc tcctcacttc aggggtcacc ctccccagca 121 cccattgccc caccatggct ggggaccggc tcccgaggaa ggtgatggat gccaagaagc 181 tggccagcct gctgcggggc gggcctgggg ggccgctggt catcgacagc cgctccttcg 241 tggagtacaa cagctggcat gtgctcagct ccgtcaacat ctgctgctcc aagctggtga 301 agcggcggct gcagcagggc aaggtgacca ttgcggagct catccagccg gctgcacgca 361 gccaggtgga ggctacggag ccacaggacg tggtggtcta tgaccagagc acgcgggacg 421 ccagcgtgct ggccgcagac agcttcctct ccatcctgct gagcaagctg gacggctgct 481 tcgacagcgt ggccatcctc actgggggct tcgccacctt ctcctcctgc ttccccggcc 541 tctgcgaggg caagcctgct gccctgctac ccatgagcct ctcccagccc tgcctgcctg 601 tgcccagcgt gggcctgacc cgcatcctgc ctcacctcta cctgggctcg cagaaggacg 661 tcctaaacaa ggatctgatg acgcaaaatg gaataagcta cgtcctcaac gccagcaact 721 cctgccccaa gcctgacttc atctgcgaga gccgcttcat gcgggtcccc atcaacgaca 781 actactgtga aaaactgctg ccctggctgg acaagtccat cgagttcatc gataaagcca 841 agctctccag ctgccaagtc atcgtccact gtctggctgg catctcccgc tctgccacca 901 tcgccatcgc ctacatcatg aagaccatgg gcatgtcctc cgacgacgcc tacaggttcg 961 tgaaggacag gcgcccgtcc atctcgccca acttcaactt cctgggccag ctgctggagt 1021 acgagcgcac gctgaagctg ctggccgccc tgcagggcga cccgggcacc ccctcaggga 1081 cgccggagcc tccgcccagt cctgccgccg gggccccgct gccacggctg ccaccaccta 1141 cctcagagag cgctgccaca gggaatgcgg ctgccaggga gggcggcctg agcgcgggcg 1201 gggagccccc cgcgcccccc acgcccccgg cgaccagcgc actgcagcag ggcctgcgcg 1261 gcctgcacct ctcctcggac cgcctgcagg acactaaccg cctcaagcgc tccttctccc 1321 tggacatcaa gtctgcctac gcccctagca ggcggcccga cggccccggg ccccccgacc 1381 ccggcgaggc cccgaagctc tgcaagctgg acagcccgtc gggggccgcg ctgggcctgt 1441 cctcgcccag cccggacagc ccggacgccg cgcctgaggc gcgcccacgg ccccgccggc 1501 ggccccggcc ccccgccggc tcccccgcgc gctcccccgc gcacagcctc ggcctgaact 1561 tcggcgatgc ggcccggcag actccgcggc acggcctctc ggccctgtcg gcgcccgggc 1621 tgcccggccc tggccagccg gccggccccg gggcctgggc accgccgctt gactccccag 1681 gcacgccgtc gcccgacggg ccctggtgct tcagccccga gggcgcacag ggggcgggcg 1741 gggtgctgtt tgcgcccttc ggccgggcgg gcgccccggg accaggcggc ggcagcgacc 1801 tgcggcggcg ggaggcagcg agggctgagc cccgggacgc gcggaccggc tggcccgagg 1861 agccggcccc ggagacgcag ttcaagcgcc gcagctgcca gatggagttc gaggagggca 1921 tggtggaggg gcgcgcgcgc ggcgaggagc tggccgccct gggcaagcag gcgagcttct 1981 cgggcagcgt ggaggtcatc gaggtgtcct gacccctccg ctgccctcgg ccccgccgcc 2041 cgcagccagg cccgttataa atgtatatta tatataatgc aaagaaaggt aaatggtttt 2101 actgggattt ttatcgagaa gtaaatattt cgatttttta tttatttaag ctgttcattc 2161 tggcaatgat ttggcaacag tgcgggtggt cctcgagctc tatttttact gtctggtatt 2221 taaactgaaa catacgtttc taagcaatac gaggccacct tcagtcgcaa gctgggtgcc 2281 aggcctgggg ccctcccagt tcccccgccc caggaaacac tgctgacctt tgcaaaggct 2341 gccgagcttt cgtgcacttt ttacataaca aaaaggg // LOCUS HSU27325 1323 bp mRNA PRI 21-JUN-1995 DEFINITION Human thromboxane A2 receptor mRNA, complete cds. ACCESSION U27325 NID g862993 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1323) AUTHORS D'Angelo,D.D., Davis,M.G., Ali,S. and Dorn,G.W. 2nd. TITLE Cloning and pharmacologic characterization of a thromboxane A2 receptor from K562 (human chronic myelogenous leukemia) cells JOURNAL J. Pharmacol. Exp. Ther. 271 (2), 1034-1041 (1994) MEDLINE 95055139 REFERENCE 2 (bases 1 to 1323) AUTHORS Dorn,G.G. TITLE Direct Submission JOURNAL Submitted (19-MAY-1995) Gerald W. Dorn II, Division of Cardiology, University of Cincinnati, 231 Bethesda Ave, Cincinnati, OH 45267-0542, USA FEATURES Location/Qualifiers source 1..1323 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" CDS 124..1155 /codon_start=1 /product="thromboxane A2 receptor" /db_xref="PID:g862994" /translation="MWPNGSSLGPCFRPTNITLEERRLIASPWFAASFCVVGLASNLL ALSVLAGARQGGSHTRSSFLTFLCGLVLTDFLGLLVTGTIVVSQHAALFEWHAVDPGC RLCRFMGVVMIFFGLSPLLLGAAMASERYLGITRPFSRPAVASQRRAWATVGLVWAAA LALGLLPLLGVGRYTVQYPGSWCFLTLGAESGDVAFGLLFSMLGGLSVGLSFLLNTVS VATLCHVYHGQEAAQQRPRDSEVEMMAQLLGIMVVASVCWLPLLVFIAQTVLRNPPAM SPAGQLSRTTEKELLIYLRVATWNQILDPWVYILFRRAVLRRLQPRLSTRPRSLSLQP QLTQRSGLQ" BASE COUNT 157 a 474 c 432 g 260 t ORIGIN 1 atccagccag gtgggagccc cgcagatgag gtctctgaag gtgtgcctga accagtgcca 61 acctgccctg tctgcagcat cggcctgatg gggtggtgac tgatccctca gggctccgga 121 gccatgtggc ccaacggcag ttccctgggg ccctgtttcc ggcccacaaa cattaccctg 181 gaggagagac ggctgatcgc ctcgccctgg ttcgccgcct ccttctgcgt ggtgggcctg 241 gcctccaacc tgctggccct gagcgtgctg gcgggcgcgc ggcagggggg ttcgcacacg 301 cgctcctcct tcctcacctt cctctgcggc ctggtcctca ccgacttcct ggggctgctg 361 gtgaccggta ccatcgtggt gtcccagcac gccgcgctct tcgagtggca cgccgtggac 421 cctggctgcc gtctctgtcg cttcatgggc gtcgtcatga tcttcttcgg cctgtccccg 481 ctgctgctgg gggccgccat ggcctcagag cgctacctgg gtatcacccg gcccttctcg 541 cgcccggcgg tcgcctcgca gcgccgcgcc tgggccaccg tggggctggt gtgggcggcc 601 gcgctggcgc tgggcctgct gcccctgctg ggcgtgggtc gctacaccgt gcaatacccg 661 gggtcctggt gcttcctgac gctgggcgcc gagtccgggg acgtggcctt cgggctgctc 721 ttctccatgc tgggcggcct ctcggtcggg ctgtccttcc tgctgaacac ggtcagcgtg 781 gccaccctgt gccacgtcta ccacgggcag gaggcggccc agcagcgtcc ccgggactcc 841 gaggtggaga tgatggctca gctcctgggg atcatggtgg tggccagcgt gtgttggctg 901 ccccttctgg tcttcattgc ccagacagtg ctgcgaaacc cgcctgccat gagccccgcc 961 gggcagctgt cccgcaccac ggagaaggag ctgctcatct acttgcgcgt ggccacctgg 1021 aaccagatcc tggacccctg ggtgtatatc ctgttccgcc gcgccgtgct ccggcgtctc 1081 cagcctcgcc tcagcacccg gcccaggtcg ctgtccctcc agccccagct cacgcagcgc 1141 tccgggctgc agtaggaagt ggacagagcg cccctcccgc gcctttccgc ggagcccttg 1201 gcccctcgga cagcccatct gcctgttctg aggattcagg ggctgggggt gctggatgga 1261 cagtgggcat cagcagcagg gttttgggtt gaccccaatc caacccgggg acccccaact 1321 cct // LOCUS HSU27326 2180 bp mRNA PRI 31-AUG-1995 DEFINITION Human alpha (1,3/1,4) fucosyltransferase (FUT3) mRNA, major transcript I, complete cds. ACCESSION U27326 NID g967188 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2180) AUTHORS Cameron,H.S., Szczepaniak,D. and Weston,B.W. TITLE Expression of human chromosome 19p alpha(1,3)-fucosyltransferase genes in normal tissues. Alternative splicing, polyadenylation, and isoforms JOURNAL J. Biol. Chem. 270 (34), 20112-20122 (1995) MEDLINE 95378269 REFERENCE 2 (bases 1 to 2180) AUTHORS Weston,B.W. TITLE Direct Submission JOURNAL Submitted (19-MAY-1995) Brent W. Weston, Pediatrics, University of North Carolina at Chapel Hill, Division of Hematology/Oncology, 418 MacNider Building, Chapel Hill, NC 27599-7220, USA FEATURES Location/Qualifiers source 1..2180 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19p" /chromosome="19" /clone="major transcript I" /tissue_type="colon, kidney" gene 230..1315 /gene="FUT3" CDS 230..1315 /gene="FUT3" /codon_start=1 /product="alpha (1,3/1,4) fucosyltransferase" /db_xref="PID:g967189" /translation="MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGS PRAPSGSSRQDTTPTRPTLLILLWTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQ ADTVIVHHWDIMSNPKSRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYR SDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLK VDVYGRSHKPLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGP SRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWA LDFCKACWKLQQESRYQTVRSIAAWFT" BASE COUNT 437 a 697 c 604 g 442 t ORIGIN 1 gttcccactt cctcccgccc caggaaacct gccatggcct cctggtgagc tgtcctcatc 61 cactgctcgc tgcctctcca ggtgacctac aggctccgct cgacactgca aggcttagac 121 cagttcggtc caacagagaa agcaggcaac caccatgtca tttgaaaaca gtttcatcgg 181 gatataattc gcaacccata cagtgaatcc atttaagata ctctgaccca tggatcccct 241 gggtgcagcc aagccacaat ggccatggcg ccgctgtctg gccgcactgc tatttcagct 301 gctggtggct gtgtgtttct tctcctacct gcgtgtgtcc cgagacgatg ccactggatc 361 ccctagggct cccagtgggt cctcccgaca ggacaccact cccacccgcc ccaccctcct 421 gatcctgcta tggacatggc ctttccacat ccctgtggct ctgtcccgct gttcagagat 481 ggtgcccggc acagccgact gccacatcac tgccgaccgc aaggtgtacc cacaggcaga 541 cacggtcatc gtgcaccact gggatatcat gtccaaccct aagtcacgcc tcccaccttc 601 cccgaggccg caggggcagc gctggatctg gttcaacttg gagccacccc ctaactgcca 661 gcacctggaa gccctggaca gatacttcaa tctcaccatg tcctaccgca gcgactccga 721 catcttcacg ccctacggct ggctggagcc gtggtccggc cagcctgccc acccaccgct 781 caacctctcg gccaagaccg agctggtggc ctgggcggtg tccaactgga agccggactc 841 agccagggtg cgctactacc agagcctgca ggctcatctc aaggtggacg tgtacggacg 901 ctcccacaag cccctgccca aggggaccat gatggagacg ctgtcccggt acaagttcta 961 cctggccttc gagaactcct tgcaccccga ctacatcacc gagaagctgt ggaggaacgc 1021 cctggaggcc tgggccgtgc ccgtggtgct gggccccagc agaagcaact acgagaggtt 1081 cctgccaccc gacgccttca tccacgtgga cgacttccag agccccaagg acctggcccg 1141 gtacctgcag gagctggaca aggaccacgc ccgctacctg agctactttc gctggcggga 1201 gacgctgcgg cctcgctcct tcagctgggc actggatttc tgcaaggcct gctggaaact 1261 gcagcaggaa tccaggtacc agacggtgcg cagcatagcg gcttggttca cctgagaggc 1321 cggcatggtg cctgggctgc cgggaacctc atctgcctgg ggcctcacct gctggagtcc 1381 tttgtggcca accctctctc ttacctggga cctcacacgc tgggcttcac ggctgccagg 1441 agcctctccc ctccagaaga cttgcctgct agggacctcg cctgctgggg acctcgcctg 1501 ttggggacct cacctgctgg ggacctcacc tgctggggac cttggctgct ggaggctgca 1561 cctactgagg atgtcggcgg tcggggactt tacctgctgg gacctgctcc cagagacctt 1621 gccacactga atctcacctg ctggggacct caccctggag ggccctgggc cctggggaac 1681 tggcttactt ggggccccac ccgggagtga tggttctggc tgatttgttt gtgatgttgt 1741 tagccgcctg tgaggggtgc agagagatca tcacggcacg gtttccagat gtaatactgc 1801 aaggaaaaat gatgacgtgt ctcctcactc tagaggggtt ggtcccatgg gttaagagct 1861 caccccaggt tctcacctca ggggttaaga gctcagagtt cagacaggtc caagttcaag 1921 cccaggacca ccacttatag ggtacaggtg ggatcgactg taaatgagga cttctggaac 1981 attccaaata ttctggggtt gagggaaatt gctgctgtct acaaaatgcc aagggtggac 2041 aggcgctgtg gctcacgcct gtaattccag cactttggga ggctgaggta ggaggattga 2101 ttgaggccaa gagttaaaga ccagcctggt caatatagca agaccacgtc tctaaataaa 2161 aaataatagg ccggccagca // LOCUS HSU27467 737 bp mRNA PRI 29-NOV-1995 DEFINITION Human Bcl-2 related (Bfl-1) mRNA, complete cds. ACCESSION U27467 NID g1079557 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 737) AUTHORS Choi,S.S., Park,I.C., Yun,J.W., Sung,Y.C., Hong,S.I. and Shin,H.S. TITLE A novel Bcl-2 related gene, Bfl-1, is overexpressed in stomach cancer and preferentially expressed in bone marrow JOURNAL Oncogene 11 (9), 1693-1698 (1995) MEDLINE 96068895 REFERENCE 2 (bases 1 to 737) AUTHORS Choi,S.S., Park,I.C., Yun,J.W., Sung,Y.C., Hong,S.I. and Shin,H.S. TITLE Direct Submission JOURNAL Submitted (23-MAY-1995) Hee-Sup Shin, Life Science, Pohang University of Science and Technology, San31, Pohang, Kyungbuk 790-784, Korea FEATURES Location/Qualifiers source 1..737 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 week old fetus" /tissue_type="liver" gene 35..562 /gene="Bfl-1" CDS 35..562 /gene="Bfl-1" /note="Bcl-2 related; similar to mouse hemopoietic-specific early-response protein, Swiss-Prot Accession Number Q07440; similar to mouse transforming protein bcl-2-beta, Swiss-Prot Accession Number P10418; Method: conceptual translation supplied by author." /codon_start=1 /db_xref="PID:g1079558" /translation="MTDCEFGYIYRLAQDYLQCVLQIPQPGSGPSKTSRVLQNVAFSV QKEVEKNLKSCLDNVNVVSVDTARTLFNQVMEKEFEDGIINWGRIVTIFAFEGILIKK LLRQQIAPDVDTYKEISYFVAEFIMNNTGEWIRQNGGWENGFVKKFEPKSGWMTFLEV TGKICEMLSLLKQYC" polyA_site 737 /note="19 A nucleotides" BASE COUNT 239 a 137 c 157 g 204 t ORIGIN 1 ccagctcaag actttgctct ccaccaggca gaagatgaca gactgtgaat ttggatatat 61 ttacaggctg gctcaggact atctgcagtg cgtcctacag ataccacaac ctggatcagg 121 tccaagcaaa acgtccagag tgctacaaaa tgttgcgttc tcagtccaaa aagaagtgga 181 aaagaatctg aagtcatgct tggacaatgt taatgttgtg tccgtagaca ctgccagaac 241 actattcaac caagtgatgg aaaaggagtt tgaagacggc atcattaact ggggaagaat 301 tgtaaccata tttgcatttg aaggtattct catcaagaaa cttctacgac agcaaattgc 361 cccggatgtg gatacctata aggagatttc atattttgtt gcggagttca taatgaataa 421 cacaggagaa tggataaggc aaaacggagg ctgggaaaat ggctttgtaa agaagtttga 481 acctaaatct ggctggatga cttttctaga agttacagga aagatctgtg aaatgctatc 541 tctcctgaag caatactgtt gaccagaaag gacactccat attgtgaaac cggcctaatt 601 tttctgactg atatggaaac gattgccaac acatacttct acttttaaat aaacaacttt 661 gatgatgtaa cttgaccttc cagagttatg gaaattttgt ccccatgtaa tgaataaatt 721 gtatgtattt ttctcta // LOCUS HSU27655 2638 bp mRNA PRI 07-MAR-1996 DEFINITION Human RGP3 mRNA, complete cds. ACCESSION U27655 NID g1216368 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2638) AUTHORS Druey,K.M., Blumer,K.J., Kang,V.H. and Kehrl,J.H. TITLE Inhibition of G-protein-mediated MAP kinase activation by a new mammalian gene family JOURNAL Nature 379 (6567), 742-746 (1996) MEDLINE 96178495 REFERENCE 2 (bases 1 to 2638) AUTHORS Druey,K. TITLE Direct Submission JOURNAL Submitted (25-MAY-1995) Kirk Druey, Intramural Research/NIAID/LIR, Rm 11B13, National Institutes of Health, 10 Center Drive, MSC 1876, Bethesda, MD 20892-1876, USA FEATURES Location/Qualifiers source 1..2638 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphocytes" CDS 288..1847 /codon_start=1 /product="RGP3" /db_xref="PID:g1216369" /translation="MFETEADEKREMALEEGKGPGAEDSPPSKEPSPGQELPPGQDLP PNKDSPSGQEPAPSQEPLSSKDSATSEGSPPGPDAPPSKDVPPCQEPPPAQDLSPCQD LPAGQEPLPHQDPLLTKDLPAIQESPTRDLPPCQDLPPSQVSLPAKALTEDTMSSGDL LAATGDPPAAPRPAFVIPEVRLDSTYSQKAGAEQGCSGDEEDAEEAEEVEEGEEGEED EDEDTSDDNYGERSEAKRSSMIETGQGAEGGLSLRVQNSLRRRTHSEGSLLQEPRGPC FASDTTLHCSDGEGAASTWGMPSPSTLKKELGRNGGSMHHLSLFFTGHRKMSGADTVG DDDEASRKRKSKNLAKDMKNKLGIFRRRNESPGAPPAGKADKMMKSFKPTSEEALKWG ESLEKLLVHKYGLAVFQAFLRTEFSEENLEFWLACEDFKKVKSQSKMASKAKKIFAEY IAIQACKEVNLDSYTREHTKDNLQSVTRGCFDLAQKRIFGLMEKDSYPRFLRSDLYLD LINQKKMSPPL" BASE COUNT 593 a 801 c 780 g 464 t ORIGIN 1 gaggaaaggg gaaatgcggc ccgctcccca ctcagtgcca ctctgtgcca ctccgtgcca 61 ggccctgagg gcacccggtt gctgcttcct tccgtctttc cccaaggact atcagagatg 121 ccagcgtgac ccctgacacg tgtgtgcagc agcctgcagc tgccccaagc catggctgaa 181 cactgactcc cagctgtggg cttcaccatt acagactccc cagggcttca aagacttctc 241 agcttcgagc atggcttttg gctgtcaggg cagctgtaca atagtggatg tttgagacgg 301 aggcagatga gaagagggag atggccttgg aggaagggaa ggggcctggt gccgaggatt 361 ccccacccag caaggagccc tctcctggcc aggagcttcc tccaggacaa gaccttccac 421 ccaacaagga ctccccttct gggcaggaac ccgctcccag ccaagaacca ctgtccagca 481 aagactcagc tacctctgaa ggatcccctc caggcccaga tgctccgccc agcaaggatg 541 tgccaccatg ccaggaaccc cctccagccc aagacctctc accctgccag gacctacctg 601 ctggtcaaga acccctgcct caccaggacc ctctactcac caaagacctc cctgccatcc 661 aggaatcccc cacccgggac cttccaccct gtcaagatct gcctcctagc caggtctccc 721 tgccagccaa ggcccttact gaggacacca tgagctccgg ggacctacta gcagctactg 781 gggacccacc tgcggccccc aggccagcct tcgtgatccc tgaggtccgg ctggatagca 841 cctacagcca gaaggcaggg gcagagcagg gctgctcggg agatgaggag gatgcagaag 901 aggccgagga ggtggaggag ggggaggaag gggaggagga cgaggatgag gacaccagcg 961 atgacaacta cggagagcgc agtgaggcca agcgcagcag catgatcgag acgggccagg 1021 gggctgaggg tggcctctca ctgcgtgtgc agaactcgct gcggcgccgg acgcacagcg 1081 agggcagcct gctgcaggag ccccgagggc cctgctttgc ctccgacacc accttgcact 1141 gctcagacgg tgagggcgcc gcctccacct ggggcatgcc ttcgcccagc accctcaaga 1201 aagagctggg ccgcaatggt ggctccatgc accacctttc cctcttcttc acaggacaca 1261 ggaagatgag cggggctgac accgttgggg atgatgacga agcctcccgg aagagaaaga 1321 gcaaaaacct agccaaggac atgaagaaca agctggggat cttcagacgg cggaatgagt 1381 cccctggagc ccctcccgcg ggcaaggcag acaaaatgat gaagtcattc aagcccacct 1441 cagaggaagc cctcaagtgg ggcgagtcct tggagaagct gctggttcac aaatacgggt 1501 tagcagtgtt ccaagccttc cttcgcactg agttcagtga ggagaatctg gagttctggt 1561 tggcttgtga ggacttcaag aaggtcaagt cacagtccaa gatggcatcc aaggccaaga 1621 agatctttgc tgaatacatc gcgatccagg catgcaagga ggtcaacctg gactcctaca 1681 cgcgggagca caccaaggac aacctgcaga gcgtcacgcg gggctgcttc gacctggcac 1741 agaagcgcat cttcgggctc atggaaaagg actcgtaccc tcgctttctc cgttctgacc 1801 tctacctgga ccttattaac cagaagaaga tgagtccccc gctttagggg ccactggagt 1861 cgagctcagc gttcacacca ggcgggctgg gtcccctgcc cacctgcctc cctgccccct 1921 gtgacggagg gggcaagcaa gcccccagag gccgtgtctc tggacagacg gatagacata 1981 cggaagcgag gcctggacca agagaggccc aggctactgg aggagtagaa ggatgggccc 2041 cgtggggtcc ccactgcccc ggtacgaggg ggcccaagac cctggcaggt caggggccct 2101 ggccaagcca gatctggagc tgctgctccc tgctgcggag accgcggagg cttcgcgttg 2161 accaagttcc ttaaagaact ggctgatggg gcaggaggtc caggcctggg ctctcgggcc 2221 ctcctagagg gccattggag cttgcagctc agacccccac tttgagtttt atttatttaa 2281 atagtagttg gatgcttggc acgtcgtcct gtaataggaa acccttgcct catcagtttt 2341 cctgatttac aagtgcaata ttttagccaa tgccttggga gaagctgcca tgcaaaggtg 2401 gacaccattc tccagcttca ggggatatgc tcgtcccggg caccggtggc aggcagctgg 2461 ccttctggac taaggcagcc tggggggaca ctgcagtctg gctacacaca gagatctggc 2521 accccctggg tggagtgtcc ctcgggggct ttgggaaagc atggcaccct cagaccacac 2581 agtagccaag ttctggagca aataaaaggc ctgtgttatt tcttgttctt gaaaaaaa // LOCUS HSU27699 3410 bp mRNA PRI 09-FEB-1996 DEFINITION Human pephBGT-1 betaine-GABA transporter mRNA, complete cds. ACCESSION U27699 NID g881474 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3410) AUTHORS Rasola,A., Galietta,L.J., Barone,V., Romeo,G. and Bagnasco,S. TITLE Molecular cloning and functional characterization of a GABA/betaine transporter from human kidney JOURNAL FEBS Lett. 373 (3), 229-233 (1995) MEDLINE 96033979 REFERENCE 2 (bases 1 to 3410) AUTHORS Rasola,A. TITLE Direct Submission JOURNAL Submitted (25-MAY-1995) Andrea Rasola, Laboratorio Di Genetica Molecolare, Istituto Giannina Gaslini, Largo G. Gaslini 5, Genova 16148, Italia FEATURES Location/Qualifiers source 1..3410 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="B18" /map="12p13" /chromosome="12" /tissue_type="kidney inner medulla" CDS 587..2431 /codon_start=1 /product="pephBGT-1 betaine-GABA transporter" /db_xref="PID:g881475" /translation="MDGKVAVQERGPPAVSWVPEEGEKLDQEDEDQVKDRGQWTNKME FVLSVAGEIIGLGNVWRFPYLCYKNGGGAFFIPYFIFFFVCGIPVFFLEVALGQYTSQ GSVTAWRKICPLFQGIGLASVVIESYLNVYYIIILAWALFYLFSSFTSELPWTTCNNF WNTEHCTDFLNHSGAGTVTPFENFTSPVMEFWERRVLGITSGIHDLGSLRWELALCLL LAWVICYFCIWKGVKSTGKVVYFTATFPYLMLVILLIRGVTLPGAYQGIIYYLKPDLF RLKDPQVWMDAGTQIFFSFAICQGCLTALGSYNKYHNNCYKDCIALCFLNSATSFVAG FVVFSILGFMSQEQGVPISEVAESGPGLAFIAFPKAVTMMPLSQLWSCLFFIMLIFLG LDSQFVCVECLVTASIDMFPRQLRKSGRRELLILTIAVMCYLIGLFLVTEGGMYIFQL FDYYASSGICLLFLSLFEVVCISWVYGADRFYDNIEDMIGYRPWPLVKISWLFLTPGL CLATFLFSLSKYTPLKYNNVYVYPPWGYSIGWFLALSSMVCVPLFVVITLLKTRGPFR KRLRHVITPDSSLPQPKQHPCLDGSAGRNFGPSPTREGLIAGEKETHL" BASE COUNT 682 a 990 c 899 g 839 t ORIGIN 1 gtaccggttc ggaattcccg ggtcgaccca cgcgtccgga aggctacaga gagagccagg 61 ttttggtgcc atgcacacag ggaaacttag agttcagaga gggggtgtga tttgcctgac 121 ctcacacagc aagttagaga cccagctcca cgactcattg tcttgctgcc cagagctgct 181 ggctcccctg tttactctga gctgatcgat caccttagca cacagctggc taggagagaa 241 ccatgcagtc acttcggcca cacctgcccg ttgacccttg ctacctcggc aggctttgat 301 cccttctgac ctggaggcca gaggctaggc tgaggtcact cagcagacat caaggacctg 361 ggcagatggg ccggctggga tggtggcgag ctgtacagat aaaaagggac atgaaaatga 421 aaagcccgag cctgagtttt catcacggtt ccactcctga gtggtcttgg gtgaatcact 481 tcatctgcca aggcctggat ttcctcatct gcaaactcag aaaactaagg ctttggccct 541 cgtcatcctg cccacccagc ggggcttccc aacccaccac acagccatgg acgggaaggt 601 ggcagtgcaa gagcgtgggc ctcctgcggt ctcctgggtc cccgaggagg gagagaagtt 661 ggaccaggaa gacgaggacc aggtgaagga tcggggccaa tggaccaaca agatggagtt 721 tgtgctgtca gtggccgggg agatcattgg gctgggcaat gtctggaggt ttccctatct 781 ctgctacaaa aacggaggtg gagccttctt catcccctac ttcatcttct tctttgtctg 841 cggcatcccg gtgttcttcc tggaggtggc gttgggccaa tacaccagcc aagggagtgt 901 cacagcctgg aggaagatct gccccctctt ccagggcatt ggtctggcat ctgtggtcat 961 cgagtcatat ttgaatgtct actacatcat catccttgcc tgggctctct tctacctgtt 1021 cagctccttc acttctgagc tgccctggac gacctgcaac aacttttgga acacagagca 1081 ttgcacggac tttctgaacc actcaggagc cggcacagtg accccatttg agaattttac 1141 ctcacctgtc atggaattct gggagagacg agttctgggc atcacctcgg gcatccatga 1201 cctgggctcc ctgcgctggg agctggccct gtgcctcctg ctcgcctggg tcatctgcta 1261 tttctgcatc tggaaggggg tcaagtccac aggcaaggtg gtttatttca cagccacgtt 1321 tccgtacctg atgcttgtca ttttgctgat cagaggtgtc acccttcccg gagcctacca 1381 gggcatcatc tactacttga agccagattt gttccgcctc aaggaccctc aggtgtggat 1441 ggatgcgggc acccagatct tcttctcctt tgccatctgc caggggtgcc tgacagccct 1501 gggcagctac aacaagtatc acaacaactg ctacaaggac tgcatcgccc tctgcttcct 1561 gaacagtgcc accagctttg tggctgggtt tgttgtcttc tccatcctgg gcttcatgtc 1621 ccaagagcaa ggggtgccca tttctgaagt ggccgagtca ggtcctgggc tggccttcat 1681 cgccttcccc aaggctgtga ctatgatgcc cttatcccag ctgtggtcct gcctgttctt 1741 tatcatgctc atattcctag ggctggacag ccagtttgtc tgtgtggagt gcctggtgac 1801 agcctccata gacatgttcc ccaggcagct ccggaagagc gggcggcgcg agctcctcat 1861 cctcaccatc gccgtcatgt gctacctgat agggcttttc ctggtcaccg agggcgggat 1921 gtacatcttc cagctgtttg actactatgc ttccagtggc atatgcctgc tgttcctgtc 1981 attgtttgaa gtggtctgca taagctgggt gtatggggcg gaccgtttct atgacaacat 2041 tgaggacatg attggctacc ggccatggcc cctggtgaag atctcctggc tcttcctgac 2101 ccctggactt tgcctggcca ctttcctctt ctccttgagc aagtacaccc ccctcaagta 2161 caacaacgtc tatgtgtacc cgccctgggg atactccatt ggctggttcc tggctctgtc 2221 ctccatggtc tgtgtcccac tcttcgtcgt catcaccctc ctgaagactc ggggtccttt 2281 caggaagcgt ctgcgtcacg tcatcacccc tgactccagt ctgccacagc ccaagcaaca 2341 tccctgcttg gatggcagtg ctggccggaa ctttgggccc tccccaacaa gggaaggact 2401 gatagccggg gagaaggaga cccatttgta gggtgtggcc agagcgaggc ggctcctaag 2461 ccgggaacct aggtcagggc caccctccat tctcagcgga cagcctctgc ctctgtctcc 2521 tgccacaatc ctgctgggaa ctctggagag ccacaggcac ccccagctgg aggccagact 2581 cctctcttgt gctagctgga gcagctcctt cccctttgct gataacacca ccactgggac 2641 gtgccatgtt gggacgccac tccctgtgga aggcaccatc gtttttataa aggggggtct 2701 ttttggaggc cgccatctga ttgcaacacc tcgagttatg aggattccac tgtggggatg 2761 cctcttgtta gagcgtactg catttgtaca cggggagagg agctataatt ggaacgcaca 2821 ctgccgtcca atgtggagag cctgatggga caataccctg ttggaagtga caactgaaca 2881 cactgtgttg gatcggaggt tccgttaggg gatccttcct taggcttaac gacagaggca 2941 agcctttgca tgccgtcagt ctggagtttc ctccgagtct ctcatggcat ctccagctcc 3001 tgccctagtt ccgcactgtt cttgcagtgt ttcatcaact cctggagcat tggaatggaa 3061 ggggcttggg agatgattcc tagacttcac aaacactcgg catgcctccc tgcactgtcc 3121 gttcctctgc ccaaggccga tattgctaac tgatcacaga ttctttccca cctcacaatc 3181 ctccgaatgt gctccaggcg acaccatttg ccatcctgct tctaacgcaa acccctgact 3241 tcatggatga ggaacctgga gaccaaagag acaaagggac tttttcaagt tcacatgggg 3301 acccccttct tgggggccag agatatgact aaaaccttat ctccttgtgc tcaggccagt 3361 gtcttcccat taaccccctg ccttagttaa caagtgtgta tggattgcca // LOCUS HSU27768 800 bp mRNA PRI 07-MAR-1996 DEFINITION Human RGP4 mRNA, complete cds. ACCESSION U27768 NID g1216372 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Druey,K.M., Blumer,K.J., Kang,V.H. and Kehrl,J.H. TITLE Inhibition of G-protein-mediated MAP kinase activation by a new mammalian gene family JOURNAL Nature 379 (6567), 742-746 (1996) MEDLINE 96178495 REFERENCE 2 (bases 1 to 800) AUTHORS Druey,K. TITLE Direct Submission JOURNAL Submitted (25-MAY-1995) Kirk Druey, Intramural Research/NIAID/LIR, Rm 11B13, National Institutes of Health, 10 Center Drive, MSC 1876, Bethesda, MD 20892-1876, USA FEATURES Location/Qualifiers source 1..800 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 98..715 /codon_start=1 /product="RGP4" /db_xref="PID:g1216373" /translation="MCKGLAGLPASCLRSAKDMKHRLGFLLQKSDSCEHNSSHNKKDK VVICQRVSQEEVKKWAESLENLISHECGLAAFKAFLKSEYSEENIDFWISCEEYKKIK SPSKLSPKAKKIYNEFISVQATKEVNLDSCTREETSRNMLEPTITCFDEAQKKIFNLM EKDSYRRFLKSRFYLDLVNPSSCGAEKQKGAKSSADCASLVPQCA" BASE COUNT 241 a 181 c 195 g 183 t ORIGIN 1 gcaacatagc acgttcgcaa agtacgctca aagccgaagc cacagctcct cctgccgcat 61 ttctttcctg cttgcgaatt ccaagctgtt aaataagatg tgcaaagggc ttgcaggtct 121 gccggcttct tgcttgagga gtgcaaaaga tatgaaacat cggctaggtt tcctgctgca 181 aaaatctgat tcctgtgaac acaattcttc ccacaacaag aaggacaaag tggttatttg 241 ccagagagtg agccaagagg aagtcaagaa atgggctgaa tcactggaaa acctgattag 301 tcatgaatgt gggctggcag ctttcaaagc tttcttgaag tctgaatata gtgaggagaa 361 tattgacttc tggatcagct gtgaagagta caagaaaatc aaatcaccat ctaaactaag 421 tcccaaggcc aaaaagatct ataatgaatt catctcagtc caggcaacca aagaggtgaa 481 cctggattct tgcaccaggg aagagacaag ccggaacatg ctagagccta caataacctg 541 ctttgatgag gcccagaaga agattttcaa cctgatggag aaggattcct accgccgctt 601 cctcaagtct cgattctatc ttgatttggt caacccgtcc agctgtgggg cagaaaagca 661 gaaaggagcc aagagttcag cagactgtgc ttccctggtc cctcagtgtg cctaattctc 721 acctgaaggg agagggatga aaagccgaat ctgagatatc catcacactg gggggcgtcg 781 agcatgatct agagggccta // LOCUS HSU28042 3204 bp mRNA PRI 18-NOV-1996 DEFINITION Human DEAD box RNA helicase-like protein mRNA, complete cds. ACCESSION U28042 NID g1142709 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3204) AUTHORS Savitsky,K., Ziv,Y., Bar-Shira,A., Gilad,S., Tagle,D.A., Smith,S., Uziel,T., Sfez,S., Nahmias,J., Sartiel,A., Eddy,R.L., Shows,T.B., Collins,F.S., Shiloh,Y. and Rotman,G. TITLE A human gene (DDX10) encoding a putative DEAD-box RNA helicase at 11q22-q23 JOURNAL Genomics 33 (2), 199-206 (1996) MEDLINE 96301396 REFERENCE 2 (bases 1 to 3204) AUTHORS Savitsky,K., Rotman,G., Ziv,Y., Bar-Shira,A., Gilad,S., Sartiel,A. and Shiloh,Y. TITLE Direct Submission JOURNAL Submitted (25-MAY-1995) Yossi Shiloh, Human Genetics, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, Israel FEATURES Location/Qualifiers source 1..3204 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q22-23" CDS 66..2693 /note="similar to DEAD box RNA helicases" /codon_start=1 /db_xref="PID:g1142710" /translation="MGKTANSPGSGARPDPVRSFNRWKKKHSHRQNKKKQLRKQLKKP EWQVERESISRLMQNYEKINVNEITRFSDFPLSKKTLKGLQEAQYRLVTEIQKQTIGL ALQGKDVLGAAKTGSGKTLAFLVPVLEALYRLQWTSTDGLGVLIISPTRELAYQTFEV LRKVGKNHDFSAGLIIGGKDLKHEAERINNINILVCTPGRLLQHMDETVSFHATDLQM LVLDEADRILDMGFADTMNAVIENLPKKRQTLLFSATQTKSVKDLARLSLKNPEYVWV HEKAKYSTPATLEQNYIVCELQQKISVLYSFLRSHLKKKSIVFFSSCKEVQYLYRVFC RLRPGVSILALHGRQQQMRRMEVYNEFVRKRAAVLFATDIAARGLDFPAVNWVLQFDC PEDANTYIHRAGRTARYKEDGEALLILLPSEKAMVQQLLQKKVPVKEIKINPEKLIDV QKKLESILAQDQDLKERAQRCFVSYVRSVYLMKDKEVFDVSKLPIPEYALSLGLAVAP RVRFLQKMQKQPTKELVRSQADKVIEPRAPSLTNDEVEEFRAYFNEKMSILQKGGKRL EGTEHRQDNDTGNEEQEEEEDDEEEMEEKLAKAKGSQAPSLPNTSEAQKIKEVPTQFL DRDEEEEDADFLKVKRHNVFGLALKDEKTLQKKDPSNSSIKKKMTKVAEAKKVMKRNF KVNKKITFTDEGELVQQWPQMQKSAIKDAEEDDDTGGINLHKAKERLQEEDKFDKEEY RKKIKAKHREKRLKEREARREANKRQAKAKDEEEAFLDWSDDDDDDDDGFDPSTLPDP DKYRSSEDSDSEDMENKISDTKKKQGMKKRSNSEVEDVGPTSHNRKKARWDTLEPLDT GLSLAEDEELVLHLLRSQS" BASE COUNT 1062 a 612 c 760 g 770 t ORIGIN 1 ccgtgagtct gaccttaggt gtctcgtgtc tggggttgat ccgagctgtc gccgccgccg 61 ccgcaatggg caaaacggcc aactctccgg gttcgggagc ccgacccgac ccggtgcgga 121 gcttcaatcg ctggaagaaa aaacacagcc ataggcagaa caaaaagaag cagttgagga 181 agcaactgaa gaaacccgaa tggcaggtcg agcgcgagag tatcagccgc ctcatgcaga 241 actatgaaaa gataaatgta aatgaaatca caagattttc agattttccc ttgtccaaaa 301 aaacattgaa aggtttgcaa gaagctcagt accgtttggt gactgagata cagaagcaga 361 ccattggatt ggctttgcaa ggtaaagatg tacttggagc ggccaaaact ggatctggca 421 agactctggc ttttcttgtt ccagtgctgg aagccttata tcgtctgcaa tggacttcaa 481 cagatgggct gggggttctc ataatatcac ctacgagaga actggcctat cagacctttg 541 aggttctccg aaaagtagga aagaatcatg acttctcagc tggtctcatc attggtggaa 601 aggatctaaa acacgaagct gagaggatca acaacataaa tatactcgtg tgcacaccag 661 gtcggcttct tcaacacatg gatgaaacag tatcttttca tgctaccgac ctccaaatgt 721 tagttcttga tgaagcagat agaatcttgg atatgggctt tgctgatacc atgaatgctg 781 ttattgaaaa tctccccaag aaacgtcaga ctttactttt ctcagcaaca caaactaaat 841 ctgtaaagga ccttgcacgc ttgagtttga aaaaccctga gtatgtctgg gttcatgaaa 901 aagcaaaata tagcacccct gccactttgg aacagaacta catagtctgt gagctgcagc 961 aaaaaataag tgtgctgtat tcctttttga gaagccatct gaagaagaag agcattgtat 1021 ttttttccag ttgcaaagag gtccagtatc tgtaccgagt gttttgccgg ctacgtcctg 1081 gtgtttctat ccttgcactc catggtcgac agcagcaaat gagaagaatg gaagtctata 1141 atgagtttgt ccgtaagaga gctgcagtac tctttgctac tgatattgca gccaggggtc 1201 tggatttccc ggccgtgaat tgggttcttc agtttgattg tcctgaggat gccaacacat 1261 atattcacag agcaggtaga actgccaggt acaaagagga tggtgaagct ttgctaattt 1321 tgctaccctc agaaaaagct atggtgcagc agcttcttca gaagaaagta cctgtgaagg 1381 aaatcaaaat caatccagaa aaacttatag atgtccagaa aaaattggaa tctattttag 1441 ctcaagatca agatttaaaa gaaagagctc aaaggtgttt cgtctcctat gtacgatctg 1501 tatatctgat gaaggataaa gaagtatttg atgtgagcaa gttacctata cctgaatatg 1561 ccctgtctct tggtcttgct gtggcaccac gcgtaagatt tcttcagaaa atgcagaaac 1621 aacccaccaa agaattggta aggagccaag ccgataaagt aattgagcca agggctccct 1681 ccctcaccaa tgacgaagtg gaagaattta gagcctactt caatgagaaa atgtccatcc 1741 ttcaaaaagg tggaaaaaga ctcgaaggga cagagcacag acaggataat gatactggta 1801 atgaagaaca ggaagaagaa gaagacgatg aagaagaaat ggaagagaaa ctggcaaaag 1861 caaaaggatc tcaagcccca tctcttccta acaccagtga ggcacagaag atcaaggaag 1921 ttcctacaca gttcttggac agagatgagg aggaagaaga tgctgatttc ttgaaggtga 1981 agcggcataa tgtgtttgga ttggccctta aagacgagaa aacattacag aagaaagacc 2041 cttctaactc cagcatcaag aaaaaaatga ccaaagttgc agaagcaaaa aaagtaatga 2101 agagaaattt taaagtgaat aagaagataa catttactga tgaaggggag ttggttcagc 2161 agtggccaca aatgcagaaa tctgccatca aggatgctga ggaagatgat gacacaggtg 2221 gtatcaactt acataaagca aaggaaagac ttcaggaaga ggacaaattt gacaaagaag 2281 aatataggaa aaaaattaag gcaaagcatc gggagaaaag actgaaagaa agggaagcca 2341 gaagagaagc caacaagaga caagcaaagg ccaaagatga agaggaagcc tttctggatt 2401 ggagtgatga tgatgatgat gatgatgatg gatttgatcc aagcacactc ccagatccag 2461 ataaatacag aagctctgaa gattcagata gtgaagatat ggaaaataaa ataagtgata 2521 ccaagaagaa gcaggggatg aagaagagga gcaacagtga agtggaagac gtgggaccaa 2581 caagtcataa cagaaagaag gccaggtggg acactttaga gcctttggat accggcctgt 2641 ctttagcaga ggatgaagag ctggtgttac atctgctaag aagtcaaagc taaatacttc 2701 ctgcgcctgc cttctccttg aaaccttggt tatgactgcg taggcaagaa gttgaaaaac 2761 agttgatttg ggggcactta ggtaccatat gccccattcc caaagggcac atttctggat 2821 agaagcgatc gtatctccaa gtccctctca caggacatgc tttgtgccat cactgagcat 2881 actcagatcg agggtggatg ataccatttc ctgaccccgt tttccagcat gtgttctgtt 2941 agatttttat ccatgggttc ctacgccttg tcattggaaa cactgccttt gtcttactgg 3001 caagttctgg agctcttgtg tcattgttag aaatccctgt cttgcttact gtacagaagt 3061 ttctgttgct gttaaaattg ctcatgattt cgacgtattt aatattttca aagagactat 3121 gatggaccag ccctgagaaa gaatgagtat ttttgaattg agatgatcaa taataaacat 3181 atttcctata aaaaaaaaaa aaaa // LOCUS HSU28043 2584 bp mRNA PRI 10-MAR-1997 DEFINITION Human plasma membrane Na+/H+ exchanger isoform 3 (NHE3) mRNA, complete cds. ACCESSION U28043 NID g971208 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2584) AUTHORS Brant,S.R., Yun,C.H., Donowitz,M. and Tse,C.M. TITLE Cloning, tissue distribution, and functional analysis of the human Na+/N+ exchanger isoform, NHE3 JOURNAL Am. J. Physiol. 269 (1 Pt 1), C198-C206 (1995) MEDLINE 95358265 REFERENCE 2 (bases 1 to 2584) AUTHORS Brant,S.R. TITLE Direct Submission JOURNAL Submitted (30-MAY-1995) Steve R. Brant, Medicine, Gastroenterology Division, The Johns Hopkins University School of Medicine, 720 Rutland Avenue, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..2584 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23-3, HKC-3, and HKC-5" /clone_lib="lamda gt10 human kidney cortex cDNA library of Graeme I. Bell, The University of Chicago, Chicago, IL" /tissue_type="kidney, cortex" gene 11..2515 /gene="NHE3" CDS 11..2515 /gene="NHE3" /codon_start=1 /evidence=experimental /product="plasma membrane Na+/H+ exchanger isoform 3" /db_xref="PID:g971209" /translation="MWGLGARGPDRGLLLALALGGLARAGGVEVEPGGAHGESGGFQV VTFEWAHVQDPYVIALWILVASLAKIGFHLSHKVTSVVPESALLIVLGLVLGGIVWAA DHIASFTLTPTVFFFYLLPPIVLDAGYFMPNRLFFGNLGTILLYAVVGTVWNAATTGL SLYGVFLSGLMGDLQIGLLDFLLFGSLMAAVDPVAVLAVFEEVHVNEVLFIIVFGESL LNDAVTVVLYNVFESFVALGGDNVTGVDCVKGIVSFFVVSLGGTLVGVVFAFLLSLVT RFTKHVRIIEPGFVFIISYLSYLTSEMLSLSAILAITFCGICCQKYVKANISEQSATT VRYTMKMLASSAETIIFMFLGISAVNPFIWTWNTAFVLLTLVFISVYRAIGVVLQTWL LNRYRMVQLEPIDQVVLSYGGLRGAVAFALVVLLDGDKVKEKNLFVSTTIIVVFFTVI FQGLTIKPLVQWLKVKRSEHREPRLNEKLHGRAFDHILSAIEDISGQIGHNYLRDKWS HFDRKFLSRVLMRRSAQKSRDRILNVFHELNLKDAISYVAEGERRGSLAFIRSPSTDN VVNVDFTPRSSTVEASVSYLLRENVSAVCLDMQSLEQRRRSIRDAEDMVTHHTLQQYL YKPRQEYKHLYSRHELTPTEDEKQDREIFHRTMRKRLESFKSTKLGLNQNKKAAKLYK RERAQKRRNSSIPNGKLPMESPAQNFTIKEKDLELSDTEEPPNYDEEMSGGIEFLASV TKDTASDSPAGIDNPVFSPDEALDRSLLARLPPWLSPGETVVPSQRARTQIPYSPGTF RRLMPFRLSSKSVDSFLQADGPEERPPAALPESTHM" BASE COUNT 455 a 846 c 789 g 494 t ORIGIN 1 gcaggcggca atgtggggac tcggggcccg gggccccgac cgggggctgc tgctggcgct 61 ggcgctgggc gggctggcgc gggccggggg cgtcgaggtg gagcccggcg gcgcgcacgg 121 cgagagcggg ggcttccagg tggtcacctt cgagtgggcc cacgtgcagg atccctacgt 181 catcgcgctc tggatcctcg tggccagctt ggccaagatc gggttccacc tgtcccacaa 241 ggtcaccagc gtggttcccg agagcgccct gctcatcgtg ctgggcctgg tgctgggcgg 301 catcgtctgg gcggccgacc acatcgcgtc cttcacactg acgcccaccg tcttcttctt 361 ctacctgctg ccccccatcg tgctggacgc cggctacttc atgcccaacc gcctcttctt 421 cggcaacctg gggaccatcc tgttgtacgc cgtcgtgggt accgtgtgga acgcggccac 481 caccgggctg tccctctacg gcgtcttcct cagtgggctc atgggcgacc tgcagattgg 541 gctgctggac ttcctcctgt ttggcagcct catggcggct gtggacccgg tggccgtcct 601 ggccgtgttt gaggaggtcc atgtcaacga ggtcctgttc atcatcgtct tcggggagtc 661 gctgctgaac gacgcagtca ccgtggttct gtacaatgtg tttgaatctt tcgtggcgct 721 gggaggtgac aacgtgactg gcgtggactg cgtgaagggc atagtgtcct tcttcgtggt 781 gagcctgggg ggcacgctgg tgggggtggt cttcgccttc ctgctgtcgc tggtgacgcg 841 cttcaccaag catgtgcgta tcatcgagcc cggcttcgtg ttcatcatct cctacctgtc 901 ctacctgacg tccgagatgc tgtcgctgtc ggccatcctc gccatcacct tctgtggcat 961 ctgctgtcag aagtatgtga aggccaacat ctcggagcag tcggccacca ccgtgcgcta 1021 caccatgaag atgctggcca gcagcgccga gaccatcatc ttcatgttcc tgggtatctc 1081 ggccgtgaac ccgttcatct ggacctggaa cacggccttc gtgctcctga cgctggtctt 1141 catctccgtg taccgggcca tcggtgtggt cctgcagacc tggcttctga accgctaccg 1201 catggtgcag ctggagccca ttgaccaggt ggtcctgtcc tacgggggcc tgcgcggggc 1261 cgtggccttt gccctggtgg tgcttctgga tggagacaag gtcaaggaga agaacctgtt 1321 cgtcagcacc accatcatcg tagtgttctt caccgtcatc ttccagggcc tgaccatcaa 1381 gcctctggtg cagtggctga aggtgaagag gagcgagcac cgggaacctc ggctcaacga 1441 gaagctgcac ggccgcgctt tcgaccacat cctctcggcc atcgaggaca tatccggaca 1501 gatcgggcac aattatctca gagacaagtg gtcccacttc gacaggaagt tcctcagcag 1561 ggtcctcatg agacggtcgg cccagaagtc tcgagaccgg atcctgaatg tcttccacga 1621 gctgaacctg aaggatgcca tcagctacgt ggctgaggga gagcgccgcg ggtccctggc 1681 cttcatccgc tcccccagca ccgacaacgt ggtcaacgtg gacttcacgc cacgatcgtc 1741 caccgtggag gcctctgtct cctacctcct gagagaaaat gtcagcgctg tctgcctgga 1801 catgcagtct ctggagcagc gacggcggag catccgggac gcggaggaca tggtcacgca 1861 ccacacgcta cagcagtacc tgtacaagcc gcggcaggag tacaagcatc tgtacagccg 1921 acacgagctc acgcccacgg aggacgagaa acaggaccgg gaaatcttcc acaggaccat 1981 gcggaagcgc ctggagtcct tcaagtcgac caagctgggg ctcaaccaga acaagaaggc 2041 agccaagctg tacaagcggg agcgtgccca gaagcggaga aacagcagca tccccaatgg 2101 gaagctgccc atggagagcc ctgcgcagaa tttcaccatc aaggagaaag acttggaact 2161 ttcagacacc gaggagcccc ccaactatga tgaggagatg agtgggggga tcgagttcct 2221 ggctagtgtc accaaggaca cagcgtccga ctcccctgca ggaattgaca accctgtgtt 2281 ttctccggac gaggccctgg accgcagcct cctggccagg ctgccgccct ggctgtctcc 2341 cggggagacg gtggtcccct cgcagagggc ccgcacgcag attccctact ctcccggcac 2401 cttccgccgc ctgatgccct tccgcctcag cagcaagtcc gtggactcct tcctgcaggc 2461 agacggcccc gaggagcggc cccccgccgc cctccccgag tccacacaca tgtgacaccg 2521 gctccgacac gccgctaacc ggccgctcgt ccccgcgcca cggtccgccc accgccgccg 2581 ccgc // LOCUS HSU28049 2279 bp mRNA PRI 02-AUG-1995 DEFINITION Human TBX2 (TXB2) mRNA, complete cds. ACCESSION U28049 NID g924927 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2279) AUTHORS Campbell,C.E., Goodrich,K., Casey,G. and Beatty,B. TITLE Cloning and mapping of a human gene sharing a highly conserved protein motif with the Drosophila omb gene JOURNAL Genomics (1995) In press REFERENCE 2 (bases 1 to 2279) AUTHORS Campbell,C.E. TITLE Direct Submission JOURNAL Submitted (30-MAY-1995) Christine E. Campbell, Cancer Biology, Research Institute, Cleveland Clinic Foundation, 9500 Euclid Avenue, Cleveland, OH 44195, USA FEATURES Location/Qualifiers source 1..2279 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal kidney" /chromosome="17" /map="17q23" gene 48..2156 /gene="TBX2" CDS 48..2156 /gene="TBX2" /note="similar to DNA binding motif in Drosophila melanogaster lethal(1)optomotor-blind, PIR Accession Number A40213, and in Xenopus laevis brachyury protein, Swiss-Prot Accession Number P24781" /codon_start=1 /product="TBX2" /db_xref="PID:g924928" /translation="MAYHPFHAPRPADFPMSAFLAAAQPSFFPALALPPGALAKPLPD PGLAGAAAAAAAAAAAAEAGLHVSALGPHPPAAHLRSLKSLEPEDEVEDDPKVTLEAK ELWDQFHKLGTEMVITKSGRRMFPPFKVRVSGLDKKAKYILLMDIVAADDCRYKFHNS RWMVAGKADPEMPKRMYIHPDSPATGEQWMAKPVAFHKLKLTNNISDKHGFTILNSMH KYQPRFHIVRANDILKLPYSTFRTYVFPETDFIAVTAYQNDKITQLKIDNNPFAKGFR DTGNGRREKRKQLTLPSLRLYEEHCKPERDGAESDASSCDPPPAREPPTSPGAAPSPL RLHRARAEEKSCAADSDPEPERLSEERARAPLGRSPAPDSASPTRLTEPERARERRCP ERGKEPAESGGDGPFGLRSLEKERPEARRKDEGRKEAAEGKEQGLAPLVVQTDSASPL GAGHLPGLAFSSHLHGQQFFGPLGAGQPLFLHPGQFTMGPGAFSAMGMGHLLASVAGG GNGGGGGPGTAAGLDAGGLGPAASAASTAAPFPFHLSQHMLASQGIPMPTFGGLFPYP YTYMAAAAAAASALPATSAAAAAAAAAGSLSRSPFLGSARPRLRFSPYQIPVTIPPST SLLTTGLASEGSKAAGGNSREPSPLPELALRKVGAPSRGALSPSGSAKEAANELLSIQ RLVSGLESQRALSPGRESPK" 3'UTR 2157..>2279 BASE COUNT 400 a 822 c 738 g 319 t ORIGIN 1 cctgggccgg atgtcccgat gagagagccg cgctgacggc cagcgccatg gcttaccacc 61 cgttccacgc gccacggccc gccgacttcc ccatgtccgc ctttctggcg gcggcgcagc 121 cctccttctt cccggcactc gcgctgccgc ccggcgcgct ggccaagccg ctgcccgacc 181 cgggcctggc gggggcggcg gccgcggcgg cggcggcggc agcagcggcc gaggcggggc 241 tgcacgtctc ggcactgggc ccgcacccgc ccgccgcgca tctgcgctcc ctcaagagcc 301 tggagcccga ggacgaggtg gaggacgacc ccaaggtgac gctggaggcc aaggagctgt 361 gggaccagtt ccacaagcta ggcacggaga tggtcatcac caagtccggg aggcggatgt 421 tccccccctt caaggtgcga gtcagcggcc tggacaagaa ggccaagtat atcctgctga 481 tggacattgt agccgctgac gattgccgct ataagttcca caactcgcgc tggatggtgg 541 cgggcaaggc cgaccctgag atgcccaaac gcatgtacat ccacccagac agcccagcca 601 cgggggagca gtggatggct aagcctgtgg ccttccacaa gctgaagctg accaacaaca 661 tctctgacaa gcacggcttc accatcctaa actccatgca caagtaccag ccgcgattcc 721 acatagtgcg agccaacgac atcctgaagc tgccttacag caccttccgc acctacgtgt 781 tcccggagac cgacttcatc gccgtcactg cctaccagaa tgacaagatc acacagctga 841 agatcgacaa caacccgttt gccaagggct tccgggacac cgggaacggc cggcgggaga 901 aaaggaagca gctgacgctg ccgtctctac gcttgtacga ggagcactgc aaacccgagc 961 gcgatggcgc ggagtcagac gcctcgtcgt gcgaccctcc ccccgcgcgg gaaccaccca 1021 cctccccggg cgcagcgccc agtccgctgc gcctgcaccg ggcccgagct gaggagaagt 1081 cgtgcgccgc ggacagcgac ccggagcctg agcggttgag cgaggagcgt gcgcgggcgc 1141 cgctaggccg cagcccggct ccagacagcg ccagccccac tcgcttgacc gaacccgagc 1201 gcgcccggga gcggcgttgt cccgagaggg gcaaggagcc ggccgagagc ggcggggacg 1261 gcccgttcgg cctgaggagc ctggagaagg agcgccccga agctcggagg aaggacgagg 1321 ggcgcaagga ggcggccgag ggcaaggagc agggcctggc gccgctggtg gtgcagacag 1381 acagtgcgtc ccccctgggc gccggacacc tgcccggcct ggccttttcc agccacttgc 1441 acgggcagca gttctttggg ccgctgggag ccggccagcc gctcttcctg caccctggac 1501 agttcaccat gggccctggc gccttctccg ccatgggcat gggtcaccta ctggcctcgg 1561 tggcaggcgg cggcaacggc ggaggtggcg ggcctgggac cgccgcgggg ctggacgcag 1621 gcgggctggg tcccgcggcc agcgcagcaa gcaccgccgc gcccttcccg ttccacctct 1681 cccagcacat gctggcatct cagggaattc caatgcccac tttcggaggc ctcttcccct 1741 acccctacac ctacatggca gcagcagccg cagccgcctc ggctttgccc gccactagtg 1801 ctgcagctgc cgccgccgca gccgccggct ccctctcccg gagccccttc ctgggcagtg 1861 cccggccccg actgcgtttc agcccctatc agatcccggt caccatcccg cctagcacta 1921 gcctcctcac caccgggctg gcctctgagg gctccaaggc cgctggtgga aacagccggg 1981 agcctagccc cctgcccgag ctggctctcc gcaaagtagg ggccccatcc cgcggtgccc 2041 tgtcgcccag tggctcggcc aaggaggcgg ccaatgaact gctgagcatc cagagactgg 2101 tgagtgggct ggagagccag cgagccctct ccccaggccg ggagtcgccc aagtgagggg 2161 ctgcccagct gctcccctgc cacgcaggcc acccgggctg cctgcccctg ctgcttggga 2221 cgtgtacagc acagaatgag tatttattta aataaaggag aaaagtgggc tgcagccgg // LOCUS HSU28369 2919 bp mRNA PRI 14-AUG-1996 DEFINITION Human semaphorin V mRNA, complete cds. ACCESSION U28369 NID g974283 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2919) AUTHORS Sekido,Y., Bader,S., Latif,F., Chen,J.Y., Duh,F.M., Wei,M.H., Albanesi,J.P., Lee,C.C., Lerman,M.I. and Minna,J.D. TITLE Human semaphorins A(V) and IV reside in the 3p21.3 small cell lung cancer deletion region and demonstrate distinct expression patterns JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (9), 4120-4125 (1996) MEDLINE 96210603 REFERENCE 2 (bases 1 to 2919) AUTHORS Sekido,Y. TITLE Direct Submission JOURNAL Submitted (02-JUN-1995) Yoshikata Sekido, Simmons Cancer Center, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235-8590, USA FEATURES Location/Qualifiers source 1..2919 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p21.3" /chromosome="3" CDS 236..2485 /codon_start=1 /product="semaphorin V" /db_xref="PID:g974284" /translation="MGRAGAAAVIPGLALLWAVGLGSAAPSPPRLRLSFQELQAWHGL QTFSLERTCCYQALLVDEERGRLFVGAENHVASLNLDNISKRAKKLAWPAPVEWREEC NWAGKDIGTECMNFVKLLHAYNRTHLLACGTGAFHPTCAFVEVGHRAEEPVLRLDPGR IEDGKGKSPYDPRHRAASVLVGEELYSGVAADLMGRDFTIFRSLGQRPSLRTEPHDSR WLNEPKFVKVFWIPESENPDDDKIYFFFRETAVEAAPALGRLSVSRVGQICRNDVGGQ RSLVNKWTTFLKARLVCSVPGVEGDTHFDQLQDVFLLSSRDHRTPLLYAVFSTSSSIF QGSAVCVYSMNDVRRAFLGPFAHKEGPMHQWVSYQGRVPYPRPGMCPSKTFGTFSSTK DFPDDVIQFARNHPLMYNSVLPTGGRPLFLQVGANYTFTQIAADRVAAADGHYDVLFI GTDVGTVLKVISVPKGSRPSAEGLLLEELHVFEDSAAVTSMQISSKRHQLYVASRSAV AQIALHRCAAHGRVCTECCLARDPYCAWDGVACTRFQPSAKRRFRRQDVRNGDPSTLC SGDSSRPALLEHKVFGVEGSSAFLECEPRSLQARVEWTFQRAGVTAHTQVLAEERTER TARGLLLRRLRRRDSGVYLCAAVEQGFTQPLRRLSLHVLSATQAERLARAEEAAPAAP PGPKLWYRDFLQLVEPGGGGSANSLRMCRPQPALQSLPLESRRKGRNRRTHAPEPRAE RGPRSATHW" BASE COUNT 531 a 926 c 967 g 495 t ORIGIN 1 tctgtgattg tggccaggcg gggcaccctc ggaggggagg gttcggaagt ggaatgcgac 61 cccccagcct ctttccccta ggggctgtaa tctgatccct ggggactccc cccctagcct 121 cccgccctcg ccctcactgc tgactcctct tccagatcct ggggcagagt ccagggcagc 181 tcaaggctcc tccacacaca cacccgctga accctgagca ccctgagctg ctgagatggg 241 gcgggccggg gctgccgccg tgatcccggg cctggccctg ctctgggcag tggggctggg 301 gagtgccgcc cccagccccc cacgccttcg gctctccttc caagagctcc aggcctggca 361 tggtctccag actttcagcc tggagcgaac ctgctgctac caggccttgc tggtggatga 421 ggagcgtgga cgcctgtttg tgggtgccga gaaccatgtg gcctccctca acctggacaa 481 catcagcaag cgggccaaga agctggcctg gccggcccct gtggaatggc gagaggagtg 541 caactgggca gggaaggaca ttggtactga gtgcatgaac ttcgtgaagt tgctgcatgc 601 ctacaaccgc acccatttgc tggcctgtgg cacgggagcc ttccacccaa cctgtgcctt 661 tgtggaagtg ggccaccggg cagaggagcc cgtcctccgg ctggacccag gaaggataga 721 ggatggcaag gggaagagtc cttatgaccc caggcatcgg gctgcctccg tgctggtggg 781 ggaggagcta tactcagggg tggcagcaga cctcatggga cgagacttta ccatctttcg 841 cagcctaggg caacgtccaa gtctccgaac agagccacac gactcccgct ggctcaatga 901 gcccaagttt gtcaaggtat tttggatccc ggagagcgag aacccagacg acgacaaaat 961 ctacttcttc tttcgtgaga cggcggtaga ggcggcgccg gcactgggac gcctgtccgt 1021 gtcccgcgtt ggccagatct gccggaacga cgtgggcggc cagcgcagcc tggtcaacaa 1081 gtggacgacg ttcctgaagg cgcggctggt gtgctcggtg cccggcgtcg agggcgacac 1141 ccacttcgat cagctccagg atgtgtttct gttgtcctcg cgggaccacc ggaccccgct 1201 gctctatgcc gtcttctcca cgtccagcag catcttccag ggctctgcgg tgtgcgtgta 1261 cagcatgaac gacgtgcgcc gggccttctt gggacccttt gcacacaagg aggggcccat 1321 gcaccagtgg gtgtcatacc agggtcgcgt cccctacccg cggccaggca tgtgccccag 1381 caagaccttt ggcaccttca gttccaccaa ggacttccca gacgatgtca tccagtttgc 1441 gcggaaccac cccctcatgt acaactctgt cctgcccact ggggggcgcc ctcttttcct 1501 acaagttgga gccaattaca ccttcactca aattgccgcg gaccgggttg cagccgctga 1561 cggacactat gacgtcctct tcattggcac agacgttggc acggtgctga aggtgatctc 1621 ggtccccaag ggcagtaggc ccagcgcaga ggggctgctc ctggaggagc tgcacgtgtt 1681 tgaggactcg gccgctgtca ccagcatgca aatttcttcc aagaggcacc agctgtacgt 1741 agcctcgcgg agcgcggtgg cccagatcgc gttgcaccgc tgcgctgccc acggccgcgt 1801 ctgcaccgaa tgctgtctgg cgcgtgaccc ctactgcgcc tgggacgggg tcgcgtgcac 1861 gcgcttccag cccagtgcca agaggcggtt ccggcggcaa gacgtaagga atggcgaccc 1921 cagcacgttg tgctccggag actcgtctcg tcccgcgctg ctggaacaca aggtgttcgg 1981 cgtggagggc agcagcgcct ttctggagtg tgagccccgc tcgctgcagg cgcgcgtgga 2041 gtggactttc cagcgcgcag gggtgacagc ccacacccag gtgctggcag aggagcgcac 2101 cgagcgcacc gcccggggac tactgctgcg caggctgcgg cgccgggact cgggcgtgta 2161 cttgtgcgcc gccgtcgagc agggctttac gcaaccgctg cgtcgcctgt cgctgcacgt 2221 gttgagtgct acgcaggccg aacgactggc gcgggccgag gaggctgcgc ccgccgcgcc 2281 gccgggcccc aaactctggt accgggactt tctgcagctg gtggagccgg gcggaggtgg 2341 cagcgcgaac tccctgcgca tgtgccgccc gcagcctgcg ctgcagtcac tgcccctgga 2401 gtcgcggaga aagggccgta accggaggac ccacgcccct gagcctcgcg ctgagcgggg 2461 gccgcgcagc gcaacgcact ggtgaccaga ctgtccccac gccgggaacc aagcaggaga 2521 cgacaggcga gagaggagcc agacagaccc tgaaaagaag gacgggttgg ggccgggcac 2581 attgggggtc accggccgat ggagacacca accgacaggc cctggctgag ggcagctgcg 2641 cgggcttatt tattaacagg ataacccttg aatgtagcag ccccgggagg gcggcacagg 2701 tcgggcgcag gattcagccg gagggaaggg acggggaagc cgagctccag agcaacgacc 2761 agggccgagg aggtgcctgg agtgcccacc ctgggagaca gaccccacct ccttgggtag 2821 tgagcagtga gcagaaagct gtgaacaggc tgggctgctg gaggtggggc gaggcaggcc 2881 gactgtacta aagtaacgca ataaacgcat tatcagcca // LOCUS HSU28389 2727 bp mRNA PRI 24-OCT-1995 DEFINITION Human dematin 52 kDa subunit mRNA, complete cds. ACCESSION U28389 NID g899540 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2727) AUTHORS Azim,A.C., Knoll,J.H., Beggs,A.H. and Chishti,A.H. TITLE Isoform cloning, actin binding, and chromosomal localization of human erythroid dematin, a member of the villin superfamily JOURNAL J. Biol. Chem. 270 (29), 17407-17413 (1995) MEDLINE 95340535 REFERENCE 2 (bases 1 to 2727) AUTHORS Chishti,A.H. TITLE Direct Submission JOURNAL Submitted (05-JUN-1995) Athar H. Chishti, Biomedical Research, St. Elizabeths Medical Center, 736 Cambridge Street, Boston, MA 02135, USA FEATURES Location/Qualifiers source 1..2727 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /cell_type="blood reticulocytes" CDS 451..1668 /note="; similar to product encoded by dematin 48 kDa subunit gene, GenBank Accession Number L19713" /codon_start=1 /product="dematin 52 kDa subunit" /db_xref="PID:g899541" /translation="MERLQKQPLTSPGSVSPSRDSSVPGSPSSIVAKMDNQVLGYKDL AAIPKDKAILDIERPDLMIYEPHFTYSLLEHVELPRQRERSLSPKSTSPPPSPEVWAD SRSPGIISQASAPRTTGTPRTSLPHFHHPETSRPDSNIYKKPPIYKQRESVGGSPQTK HLIEDLIIESSKFPAAQPPDPNQPAKIETDYWPCPPSLAVVETEWRKRKASRRGAEEE EEEEDDDSGEEMKALRERQREELSKVTSNLGKMILKEEMEKSLPIRRKTRSLPDRTPF HTSLHQGTSKSSSLPRYGRTTLSRLQSTEFSPSGSETGSPGLQNGEGQRGRMDRGNSL PCVLEQKIYPYEMLVVTNKGRTKLPPGVDRMRLERHLSAEDFSRVFAMSPEEFGKLAL WKRNELKKKASLF" polyA_site 2727 /note="29 A nucleotides" BASE COUNT 561 a 890 c 766 g 510 t ORIGIN 1 cacgagaaga caggaggaag aaagggagag agggccaggc agtcgcactg tgaacagaac 61 aggagaaggc gaagcggggc aaagttccct gcccaccgac gccagcctgc ttggatgact 121 tgcctcgttt cataattcac ttactgtctg caccagccgg cctcagcctg gctggaccct 181 gctgcctgtg tggcccggag ccagaggccc ccacactccc agctgctctt ctacagatgc 241 catcaacgag caggactctg ggtggctcca ctgtctaagc ctggagagtc accgccgagg 301 gatgaggacg cgccagcccg ggggaacgcg ccagctgctt tcgcggcccc aagcgcgcag 361 tgcccagcag ccgcgccgag cctgacacgc tgtcctctcc cctcgcgcac agggctctgc 421 gagtgacccg gcgggcgagc tccgtgctgc atggaacggc tgcagaagca accacttacc 481 tcccccggga gcgtgagccc ctcccgagat tccagtgtgc ctggctctcc ctccagcatc 541 gtggccaaga tggacaatca ggtgctgggc tacaaggacc tggctgccat ccccaaggac 601 aaggccatcc tggacatcga gcggcccgac ctcatgatct acgagcctca cttcacttat 661 tccctcctgg aacacgtgga gctgcctcgc cagcgcgagc gctcgctgtc acccaaatcc 721 acatcccccc caccatcccc agaggtgtgg gcggacagcc ggtcgcctgg aatcatctct 781 caggcctcgg cccccagaac cactggaacc ccccggacca gcctgcccca tttccaccac 841 cctgagacct cccgcccaga ttccaacatc tacaagaagc ctcccatcta taagcagaga 901 gagtccgtgg gaggcagccc tcagaccaag cacctcatcg aggatctcat catcgagtca 961 tccaagtttc ctgcagccca gcccccagac cccaaccagc cagccaaaat cgaaaccgac 1021 tactggccat gccccccgtc tctggctgtt gtggagacag aatggaggaa gcggaaggcg 1081 tctcggaggg gagcagagga agaggaggag gaggaagatg acgactctgg agaggagatg 1141 aaggctctca gggagcgtca gagagaggaa ctcagtaagg ttacttccaa cttgggaaag 1201 atgatcttga aagaagagat ggaaaagtca ttgccgatcc gaaggaaaac ccgctctctg 1261 cctgaccgga cacccttcca tacctccttg caccagggaa cgtctaaatc ttcctctctc 1321 ccccgctatg gcaggaccac cctgagccgg ctacagtcca cagagttcag cccatcaggg 1381 agtgagactg gaagcccagg cctgcagaac ggagagggcc agagggggag gatggaccgg 1441 gggaactccc tgccctgtgt gctggagcag aagatctatc cctatgaaat gctagtggtg 1501 accaacaagg ggcgaaccaa gctgccaccg ggggtggatc ggatgcggct tgagaggcat 1561 ctgtctgccg aggacttctc aagggtattt gccatgtccc ctgaagagtt tggcaagctg 1621 gctctgtgga agcggaatga gctcaagaag aaggcctctc tcttctgatg gcccccacct 1681 gctccgggac ggccccctta cccctgctgc ttcagggttt ttccccggcg ggttgggagg 1741 ggcaggaggt ggggtggaaa tagggtgggc tcctttcctc aggtagagtg gggggccaaa 1801 acctctgcag tccccggcag tgagctatgg actttcttcc ccctcacgag gctgggggcc 1861 tcctgctctc gtccctggcc ctccctgtac agggcaaagc cagtctgggc tctggcacac 1921 agagttcatg tttgccgccc tctccctgcc cctcacccca gaggtgagag gaatgagggg 1981 cattggtggt taggccggtt ggctgtcttg aacagctgga gggaagatgc aggggtggga 2041 agcggccagg cagaaagagc tccaggctct tgtgtcgccc acccagccct cccatactca 2101 ctcctgacag ctttcctgca ctgcagcttc ctgctcctct gactctagtg ggaacaggcc 2161 ccagctcagc ctccgcgagg gaggtcaccc ctccacttca gcttgccctg acctccgctc 2221 gcaaaccccg agcttccaag ccttttgctc cagccctgcg gcttccccag aagcctgggc 2281 ttagggtgga gatgccgcct acccgatcct ggccctccac ctgcctccag gccacgaaat 2341 gggaattcca gcactaagcc aggcaccggg cagaagctgg gccttccgcc tcccttggat 2401 ggggtcaaga ggccaggcct ggcacatttt ggagtgtcct ggctaccagc tctcacacct 2461 acacccacgc accccctcac acactatgct ctctcaagaa tgtaatttat tggggccccc 2521 ccagctgctt tcctcacctg cccctgccct accttacacc cccagcttga cttctttcca 2581 gtccacgtgg atataatgat atctatattt ttgcccaggt ctgggtattg ctcctgccca 2641 gaccctgaca tccctttcca ctgtgtgtgt gaccatgctg ggggaggggg actctgcttg 2701 gaattaaaag gttgctttgg gtcccta // LOCUS HSU28413 2011 bp mRNA PRI 02-DEC-1995 DEFINITION Human Cockayne syndrome complementation group A CSA protein (CSA) mRNA, complete cds. ACCESSION U28413 NID g975301 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2011) AUTHORS Henning,K.A., Li,L., Iyer,N., McDaniel,L.D., Reagan,M.S., Legerski,R., Schultz,R.A., Stefanini,M., Lehmann,A.R., Mayne,L.V. and Friedberg,E.C. TITLE The Cockayne syndrome group A gene encodes a WD repeat protein that interacts with CSB protein and a subunit of RNA polymerase II TFIIH JOURNAL Cell 82 (4), 555-564 (1995) MEDLINE 95393468 REFERENCE 2 (bases 1 to 2011) AUTHORS Henning,K.A. TITLE Direct Submission JOURNAL Submitted (05-JUN-1995) Karla A. Henning, Laboratory of Gene Transfer, National Center for Human Genome Research, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2011 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pCSA5" /clone_lib="9N fraction 2 library of R. Legerski" gene 37..1227 /gene="CSA" CDS 37..1227 /gene="CSA" /note="Cockayne syndrome group A" /codon_start=1 /function="WD-repeat protein" /product="CSA protein" /db_xref="PID:g975302" /translation="MLGFLSARQTGLEDPLRLRRAESTRRVLGLELNKDRDVERIHGG GINTLDIEPVEGRYMLSGGSDGVIVLYDLENSSRQSYYTCKAVCSIGRDHPDVHRYSV ETVQWYPHDTGMFTSSSFDKTLKVWDTNTLQTADVFNFEETVYSHHMSPVSTKHCLVA VGTRGPKVQLCDLKSGSCSHILQGHRQEILAVSWSPRYDYILATASADSRVKLWDVRR ASGCLITLDQHNGKKSQAVESANTAHNGKVNGLCFTSDGLHLLTVGTDNRMRLWNSSN GENTLVNYGKVCNNSKKGLKFTVSCGCSSEFVFVPYGSTIAVYTVYSGEQITMLKGHY KTVDCCVFQSNFQELYSGSRDCNILAWVPSLYEPVPDDDETTTKSQLNPAFEDAWSSS DEEG" polyA_signal 1986..1991 polyA_site 2011 BASE COUNT 596 a 368 c 413 g 634 t ORIGIN 1 cgacgtccag tgctccagcc ggtgtgagga cacgatatgc tggggttttt gtccgcacgc 61 caaacgggtt tggaggaccc tcttcgcctt cggagagcag agtcaacacg gagagttttg 121 ggactggaat taaataaaga cagagatgtt gaaagaatcc acggcggtgg aattaacacc 181 cttgacattg aacctgttga agggagatac atgttatcag gtggttcaga tggtgtgatt 241 gtactttatg accttgagaa ctccagcaga caatcttatt acacatgtaa agcagtgtgt 301 tccattggca gagatcatcc tgatgttcac agatacagtg tggagactgt acagtggtat 361 cctcatgaca ctggcatgtt cacatcaagc tcatttgata aaactctgaa agtatgggat 421 acaaatacat tacaaactgc agatgtattt aattttgagg aaacagttta tagtcatcat 481 atgtctccag tctccaccaa gcactgtttg gtagcagttg gtactagagg acccaaagta 541 caactttgtg acttgaagtc tggatcctgt tctcacattc tacagggtca cagacaagaa 601 atattagcag tttcctggtc tccacgttat gactatatct tggcaacagc aagtgctgac 661 agtagagtaa aattatggga tgtgagaaga gcatcaggat gtttgattac tcttgatcaa 721 cataatggga aaaagtcaca agctgttgaa tcagcaaaca ctgctcataa tgggaaagtt 781 aatggcttat gttttacaag tgatggactt cacctcctca ctgttggtac agataatcga 841 atgaggctct ggaatagttc caatggagaa aacacacttg tgaactatgg aaaagtttgt 901 aataacagta aaaaaggatt gaaattcact gtctcctgtg gctgcagttc agaatttgtt 961 tttgtaccat atggtagcac cattgctgtt tatacagttt actcaggaga acagataact 1021 atgcttaagg gacattataa aactgttgac tgctgtgtat ttcagtcaaa tttccaggaa 1081 ctttatagtg gtagcagaga ctgcaacatt ctggcttggg ttccatcctt atatgaacca 1141 gttcctgatg atgatgagac tacaacaaaa tcacaattaa atccggcctt tgaagatgcc 1201 tggagcagca gtgatgaaga aggatgaata tcatctttag tacctttttg tctctgctga 1261 aactttttaa atgagactgt gtttttttca actgtatggt ctattcctga cagctaaatt 1321 agccctaaat gcgggtaata tttttcctca tgttttaaaa tgaggttaat atttgcataa 1381 aatcctaaaa cagacttctg tatagtttat ttagtcaaaa tgtgttcctt gatcccagat 1441 gttgtggcct gggaaagccc tcattgctac agtacaagta acacaagtcg ttgtacctca 1501 gttgtgacct tcagcagatt ttatgaacta taagatgcag tctcagagga tcagcaagtg 1561 gaggccatca gtattgactt tctcttactt gctgtactat cagcctgctc gtttccacct 1621 ttaagaatga ttttgccaag aatgattata tcaaaaatag tagttgaaat ggtaacatca 1681 aaattatttt attctttctt cttcatgtat tcacattttt cagtggtttc atttaattaa 1741 ccatgcttta tgttaaacat tttggggctc aatgtctcct actatccaaa atgtgcatca 1801 caggaggctc ttaactttgt gaaaatccca tgtttgcttt attttatttt aatgtcagaa 1861 ggcagtttgc gctaatgctt gaactctttt tctgtgaaac tcattaaggt atgaccaaat 1921 cctgcctcat taattcaagc agaaaatatc ctggcaggga atctggctta aacatgaaat 1981 gctgtaataa aatttctatg ttattgtctc a // LOCUS HSU28424 2205 bp mRNA PRI 05-JUN-1996 DEFINITION Human protein kinase inhibitor p58 mRNA, complete cds. ACCESSION U28424 NID g1353269 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2205) AUTHORS Korth,M.J., Lyons,C.N., Wambach,M. and Katze,M.G. TITLE Cloning, expression, and cellular localization of the oncogenic 58-kDa inhibitor of the RNA-activated human and mouse protein kinase JOURNAL Gene 170 (2), 181-188 (1996) MEDLINE 96235132 REFERENCE 2 (bases 1 to 2205) AUTHORS Korth,M.J. TITLE Direct Submission JOURNAL Submitted (05-JUN-1995) Marcus J. Korth, Microbiology, University of Washington, Box 357242, Seattle, WA 98195-7242, USA FEATURES Location/Qualifiers source 1..2205 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 691..2205 /function="inhibitor of the interferon-induced, dsRNA-activated protein kinase (PKR)" /codon_start=1 /product="p58" /db_xref="PID:g1353270" /translation="MVAPGSVTSRLGSVFPFLLVLVDLQYEGAECGVNADVEKHLELG KKLLAAGQLADALSQFHAAVDGDPDNYIAYYRRATVFLAMGKSKAALPDLTKVIQLKM DFTAARLQRGHLLLKQGKLDEAEDDFKKVLKSNPSENEEKEAQSQLIKSDEMQRLRSQ ALNAFGSGDYTAAIAFLDKILEVCVWDAELRELRAECFIKEGEPRKAISDLKAASKLK NDNTEAFYKISTLYYQLGDHELSLSEVRECLKLDQDHKRCFAHYKQVKKLNKLIESAE ELIRDGRYTDATSKYESVMKTEPSIAEYTVRSKERICHCFSKDEKPVEAIRVCSEVLQ MEPDNVNALKDRAEAYLIEEMYDEAIQDYETAQEHNENDQQIREGLEKAQRLLKQSQK RDYYKILGVKRNAKKQEIIKAYRKLALQWHPDNFQNEEEKKKAEKKFIDIAAAKEVLS DPEMRKKFDDGEDPLDAESQQGGGGNPFHRSWNSWQGFNPFSSGGPFRFKFHFN" BASE COUNT 691 a 498 c 490 g 526 t ORIGIN 1 aaattattca caaaaagcag ctggaaatca gaaaggcagc tgccgctcca gggtctcaaa 61 tccattaaaa ccaccacaga acaattaaaa caatcagaga gagaaaagga aaaggaaaca 121 acgtccaaca tttggcattt ggttttatcc acaggttttc ccagagctcc tcttggaatt 181 gaaagcaaaa tatattggca aacactgctt ctcaatacac aggaagggga gcctcagtgc 241 ctcctgccaa cagaggcggc tctgtgcagt cagactgtgg ggtgcacagc agtctctcag 301 gctggggttt ctggcctctt tccatccaat caccccttcc aggaaattca gcagcctacc 361 agacactggg cctctgcctg gctgcccttt cccccaggca gagcggcctt gggcataata 421 ccaggcaacc tgccagcccc agagcagaaa ccctgctggg atcccttcct tcatcaacct 481 tggtgtggag gatgctcaat agcgcttcca cctccccctc cccagcccct agctacatct 541 ctccaccctt ccctggacag tcctacttcc cagctcaccc acccaccagc agcttaagtc 601 tgggtgggat ctatcatcag ctcctccccc tgtaacctct tccctccaca gatccaccct 661 gtgactcctc ttcactcgcg agcctcggac atggtggccc ccggctccgt gaccagccgg 721 ctgggctcgg tattcccctt cctgctagtc ctggtggatc tgcagtacga aggtgctgaa 781 tgtggagtaa atgcagatgt tgagaaacat cttgaattgg gcaagaaatt acttgcagct 841 ggacagctag ctgatgcttt atctcagttt catgctgccg tagatggtga ccctgataac 901 tatattgctt attatcggag ggctactgtc tttttagcta tgggcaaatc aaaagctgca 961 cttcctgatt taactaaagt gattcaattg aagatggact tcactgcagc aagattacag 1021 agaggtcact tattactcaa acaaggaaaa cttgatgaag cagaagatga ttttaaaaaa 1081 gtgctcaaat ctaatccaag tgaaaatgaa gaaaaggaag cacagtctca acttataaaa 1141 tctgatgaaa tgcagcgttt gcgttcacaa gcacttaacg cttttggaag tggagattat 1201 actgctgcta tagccttcct tgataagatt ttagaggttt gtgtttggga tgcagaacta 1261 cgggaacttc gagctgaatg ttttataaaa gaaggagaac ctaggaaagc tataagtgac 1321 ttaaaagctg cgtcaaagtt gaagaatgat aatactgaag cgttttataa aataagcaca 1381 ctgtactacc aactaggaga ccacgaactg tccctcagtg aagttcggga atgtcttaaa 1441 cttgaccagg atcataaaag gtgttttgca cactataaac aagtaaagaa acttaataag 1501 ctgattgagt cagctgaaga gctcatcaga gatggcagat acacagatgc taccagcaaa 1561 tatgaatctg tcatgaaaac agagccaagc attgctgaat atacagttcg ttcaaaggag 1621 aggatttgcc actgcttttc taaggacgag aagcctgttg aagctattag ggtttgttct 1681 gaagttttac agatggaacc tgacaatgtg aatgccctga aagatcgagc tgaggcctat 1741 ttgatagagg aaatgtatga tgaagctatt caggattatg aaactgctca ggaacacaat 1801 gaaaatgatc agcagattcg agaaggtcta gagaaagcac aaagattatt gaaacagtcg 1861 cagaaacgag attattataa aatcttggga gtaaaaagaa atgccaaaaa gcaagaaatt 1921 attaaagcat accgaaaatt agcactgcag tggcacccag ataacttcca gaatgaagaa 1981 gaaaagaaaa aagctgagaa aaagttcatt gatatagcag ctgctaaaga agtcctctct 2041 gatccagaaa tgagaaagaa gtttgacgac ggagaagatc ctttggatgc agagagccag 2101 caaggaggcg gcggcaaccc tttccacaga agctggaact catggcaagg gttcaatccc 2161 ttcagctcag gcggaccatt tagatttaaa ttccacttca attaa // LOCUS HSU28480 924 bp mRNA PRI 17-JAN-1996 DEFINITION Human uncoupling protein (UCP) mRNA, complete cds. ACCESSION U28480 NID g1155218 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 924) AUTHORS Bouillaud,F., Ricquier,D. and Raimbault,S. TITLE Sequence of the cDNA coding for the human uncoupling protein UCP JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 924) AUTHORS Bouillaud,F. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) Frederic Bouillaud, CEREMOD, CNRS, 9 rue Jules Hetzel, Meudon, Bellevue 92190, France FEATURES Location/Qualifiers source 1..924 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brown adipose tissue" /dev_stage="infant" gene 1..924 /gene="UCP" CDS 1..924 /gene="UCP" /codon_start=1 /product="uncoupling protein" /db_xref="PID:g1155219" /translation="MGGLTASDVHPTLGVQLFSAGIAACLADVITFPLDTAKVRLQVQ GECPTSSVIRYKGVLGTITAVVKTEGRMKLYSGLPAGLQRQISSASLRIGLYDTVQEF LTAGKETAPSLGSKILAGLTTGGVAVFIGQPTEVVKVRLQAQSHLHGIKPRYTGTYNA YRIIATTEGLTGLWKGTTPNLMRSVIINCTELVTYDLMKEAFVKNNILADDVPCHLVS ALIAGFCATAMSSPVDVVKTRFINSPPGQYKSVPNCAMKVFTNEGPTAFFKGLVPSFL RLGSWNVIMFVCFEQLKRELSKSRQTMDCAT" BASE COUNT 249 a 228 c 240 g 207 t ORIGIN 1 atggggggcc tgacagcctc ggacgtacac ccgaccctgg gggtccagct cttctcagct 61 ggaatagcgg cgtgcttggc ggacgtgatc accttcccgc tggacacggc caaagtccgg 121 ctccaggtcc aaggtgaatg cccgacgtcc agtgttatta ggtataaagg tgtcctggga 181 acaatcaccg ctgtggtaaa aacagaaggg cggatgaaac tctacagcgg gctgcctgcg 241 gggcttcagc ggcaaatcag ctccgcctct ctcaggatcg gcctctacga cacggtccag 301 gagttcctca ccgcagggaa agaaacagca cctagtttag gaagcaagat tttagctggt 361 ctaacgactg gaggagtggc agtattcatt gggcaaccca cagaggtcgt gaaagtcaga 421 cttcaagcac agagccatct ccacggaatc aaacctcgct acacggggac ttataatgcg 481 tacagaataa tagcaacaac cgaaggcttg acgggtcttt ggaaagggac tactcccaat 541 ctgatgagaa gtgtcatcat caattgtaca gagctagtaa catatgatct aatgaaggag 601 gcctttgtga aaaacaacat attagcagat gacgtcccct gccacttggt gtcggctctt 661 atcgctggat tttgcgcaac agctatgtcc tccccggtgg atgtagtaaa aaccagattt 721 attaattctc caccaggaca gtacaaaagt gtgcccaact gtgcaatgaa agtgttcact 781 aacgaaggac caacggcttt cttcaagggg ttggtacctt ccttcttgcg acttggatcc 841 tggaacgtca ttatgtttgt gtgctttgaa caactgaaac gagaactgtc aaagtcaagg 901 cagactatgg actgtgccac ataa // LOCUS HSU28488 1956 bp mRNA PRI 22-FEB-1996 DEFINITION Human putative G protein-coupled receptor (AZ3B) mRNA, complete cds. ACCESSION U28488 NID g1199577 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1956) AUTHORS Roglic,A., Prossnitz,E.R., Cavanagh,S.L., Pan,Z., Zou,A. and Ye,R.D. TITLE cDNA cloning of a novel G protein-coupled receptor with a large extracellular loop structure JOURNAL Biochim. Biophys. Acta 1305 (1-2), 39-43 (1996) MEDLINE 96180983 REFERENCE 2 (bases 1 to 1956) AUTHORS Ye,R.D. TITLE Direct Submission JOURNAL Submitted (06-JUN-1995) Richard D. Ye, Immunology, IMM-25, Scripps Research Institute, 10666 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1956 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60 promyelocytic cell line" gene 82..1530 /gene="AZ3B" CDS 82..1530 /gene="AZ3B" /note="putative G protein-coupled receptor; similar to human C5A anaphylatoxin chemotactic receptor, Swiss-Prot Accession Number P21730" /codon_start=1 /db_xref="PID:g1199578" /translation="MASFSAETNSTDLLSQPWNEPPVILSMVILSLTFLLGLPGNGLV LWVAGLKMQRTVNTIWFLHLTLADLLCCLSLPFSLAHLALQGQWPYGRFLCKLIPSII VLNMFASVFLLTAISLDRCLVVFKPIWCQNHRNVGMACSICGCIWVVAFVMCIPVFVY REIFTTDNHNRCGYKFGLSSSLDYPDFYGDPLENRSLENIVQRPGEMNDRLDPSSFQT NDHPWTVPTVFQPQTFQRPSADSLPRGSARLTSQNLYSNVFKPADVVSPKIPSGFPIE DHETSPLDNSDAFLSTHLKLFPSASSNSFYESELPQGFQDYYNLGQFTDDDQVPTPLV AITITRLVVGFLLPSVIMIACYSFIVFRMQRGRFAKSQSKTFRVAVVVVAVFLVCWTP YHIFGVLSLLTDPETPLGKTLMSWDHVCIALASANSCFNPFLYALLGKDFRKKARQSI QGILEAAFSEELTRSTHCPSNNVISERNSTTV" polyA_site 1956 /note="14 A nucleotides" BASE COUNT 488 a 479 c 418 g 571 t ORIGIN 1 cggtggggac cagacaggac tcgtggagac atccaggtgc tgaagccttc agctactgtc 61 tcagtttttt gaagtttagc aatggcgtct ttctctgctg agaccaattc aactgaccta 121 ctctcacagc catggaatga gcccccagta attctctcca tggtcattct cagccttact 181 tttttactgg gattgccagg caatgggctg gtgctgtggg tggctggcct gaagatgcag 241 cggacagtga acacaatttg gttcctccac ctcaccttgg cggacctcct ctgctgcctc 301 tccttgccct tctcgctggc tcacttggct ctccagggac agtggcccta cggcaggttc 361 ctatgcaagc tcatcccctc catcattgtc ctcaacatgt ttgccagtgt cttcctgctt 421 actgccatta gcctggatcg ctgtcttgtg gtattcaagc caatctggtg tcagaatcat 481 cgcaatgtag ggatggcctg ctctatctgt ggatgtatct gggtggtggc ttttgtgatg 541 tgcattcctg tgttcgtgta ccgggaaatc ttcactacag acaaccataa tagatgtggc 601 tacaaatttg gtctctccag ctcattagat tatccagact tttatggaga tccactagaa 661 aacaggtctc ttgaaaacat tgttcagcgg cctggagaaa tgaatgatag gttagatcct 721 tcctctttcc aaacaaatga tcatccttgg acagtcccca ctgtcttcca acctcaaaca 781 tttcaaagac cttctgcaga ttcactccct aggggttctg ctaggttaac aagtcaaaat 841 ctgtattcta atgtatttaa acctgctgat gtggtctcac ctaaaatccc cagtgggttt 901 cctattgaag atcacgaaac cagcccactg gataactctg atgcttttct ctctactcat 961 ttaaagctgt tccctagcgc ttctagcaat tccttctacg agtctgagct accacaaggt 1021 ttccaggatt attacaattt aggccaattc acagatgacg atcaagtgcc aacacccctc 1081 gtggcaataa cgatcactag gctagtggtg ggtttcctgc tgccctctgt tatcatgata 1141 gcctgttaca gcttcattgt cttccgaatg caaaggggcc gcttcgccaa gtctcagagc 1201 aaaacctttc gagtggccgt ggtggtggtg gctgtctttc ttgtctgctg gactccatac 1261 cacatttttg gagtcctgtc attgcttact gacccagaaa ctcccttggg gaaaactctg 1321 atgtcctggg atcatgtatg cattgctcta gcatctgcca atagttgctt taatcccttc 1381 ctttatgccc tcttggggaa agattttagg aagaaagcaa ggcagtccat tcagggaatt 1441 ctggaggcag ccttcagtga ggagctcaca cgttccaccc actgtccctc aaacaatgtc 1501 atttcagaaa gaaatagtac aactgtgtga aaatgtggag cagccaacaa gcaggggctc 1561 ttaggcaatc acatagtgaa agtttataag aggatgaagt gatatggtga gcagcggact 1621 tcaaaaactg tcaaagaatc aatccagcgg ttctcaaacg gtacacagac tattgacatc 1681 agcatcacct agaaacttgt tagaaatgca aattctcaag ccgcatccca gacttgctga 1741 atcggaatct ctgggggttg ggacccagca agggcactta acaaaccctc gtttctgatt 1801 aatgctaaat gtaagaatca ttgtaaacat tagttctatt tctatcccaa actaagctat 1861 gtgaaataag agaagctact ttgtttttaa atgatgttga atatttgtcg atatttccat 1921 cattaaattt ttccttagca ttgtctaagt cttcca // LOCUS HSU28687 2317 bp mRNA PRI 25-APR-1996 DEFINITION Human zinc finger containing protein ZNF157 (ZNF157) mRNA, complete cds. ACCESSION U28687 NID g881563 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2317) AUTHORS Derry,J.M., Jess,U. and Francke,U. TITLE Cloning and characterization of a novel zinc finger gene in Xp11.2 JOURNAL Genomics 30 (2), 361-365 (1995) MEDLINE 96163894 REFERENCE 2 (bases 1 to 2317) AUTHORS Derry,J.M. TITLE Direct Submission JOURNAL Submitted (07-JUN-1995) Jonathan M. Derry, Molecular Biology, Immunex Corporation, 51 University St, Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..2317 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp11.2" /chromosome="X" gene 87..1607 /gene="ZNF157" CDS 87..1607 /gene="ZNF157" /note="zinc finger containing protein" /codon_start=1 /product="ZNF157" /db_xref="PID:g881564" /translation="MPANGTSPQRFPALIPGEPGRSFEGSVSFEDVAVDFTRQEWHRL DPAQRTMHKDVMLETYSNLASVGLCVAKPEMIFKLERGEELWILEEESSGHGYSGSLS LLCGNGSVGDNALRHDNDLLHHQKIQTLDQNVEYNGCRKAFHEKTGFVRRKRTPRGDK NFECNECGKAYCRKSNLVEHLRIHTGERPYECGECAKTFSARSYLIAHQKTHTGERPF ECNECGKSFGRKSQLILHTRTHTGERPYECTECGKTFSEKATLTIHQRTHTGEKPYEC SECGKTFRVKISLTQHHRTHTGEKPYECGECGKNFRAKKSLNQHQRIHTGEKPYECGE CGKFFRMKMTLNNHQRTHTGEKPYQCNECGKSFRVHSSLGIHQRIHTGEKPYECNECG NAFYVKARLIEHQRMHSGEKPYECSECGKIFSMKKSLCQHRRTHTGEKPYECSECGNA FYVKVRLIEHQRIHTGERPFECQECGKAFCRKAHLTEHQRTHIGWSWRCTMKKASH" misc_feature 555..638 /gene="ZNF157" /note="encodes zinc finger region" repeat_region 2009..2317 /rpt_family="Alu" BASE COUNT 656 a 497 c 541 g 623 t ORIGIN 1 cagccttacc ccttactgga ggctgcagga ccctctacac acagacagct gtccagcacg 61 gaaggtgggc tgaggccagg gtgaacatgc cagctaatgg gacatcaccc cagagattcc 121 ctgccctgat tccaggagaa cctggcagat cttttgaggg gtccgtgtca ttcgaggatg 181 tggctgtcga tttcacccga caggagtggc acagactgga ccctgctcag aggaccatgc 241 acaaggatgt gatgctggag acctacagca acctggcatc tgtgggcctc tgcgtggcca 301 aaccagagat gatcttcaag ttggagcgag gagaagagct gtggatatta gaggaggaat 361 cctcaggcca tggttactca ggatctctct cactgctgtg tggcaatggt tctgttgggg 421 ataatgccct caggcatgat aatgaccttc ttcaccatca gaagattcaa acattggatc 481 aaaatgttga atataatgga tgcaggaaag ccttccatga gaaaacaggc tttgttagac 541 gtaaaagaac acccagagga gataaaaact ttgaatgtaa tgaatgtggg aaagcttact 601 gtaggaagtc aaaccttgtt gaacatctga gaatacacac aggagagaga ccctatgaat 661 gcggtgaatg tgcaaaaacc ttcagtgcaa gatcatacct cattgctcat cagaaaactc 721 acacagggga gaggcccttt gaatgtaatg aatgtgggaa atcttttggc aggaagtcac 781 aactcatcct acatacaaga acacacactg gagagagacc ctatgaatgt actgaatgtg 841 ggaaaacctt ttctgagaag gcaaccctca cgattcatca gagaactcac acaggggaga 901 aaccctatga atgtagtgaa tgtgggaaaa catttcgtgt aaagatatcc cttacccaac 961 accacagaac tcatacaggg gagaaacctt atgaatgtgg ggagtgtggg aaaaacttcc 1021 gtgcaaagaa atccctaaat cagcatcaaa gaattcacac aggtgagaaa ccctatgagt 1081 gtggtgaatg tgggaaattc ttccgaatga agatgactct caataatcat caaagaactc 1141 acacaggtga aaagccctat cagtgtaatg aatgtgggaa atctttcagg gtgcactcat 1201 ctcttgggat ccatcagaga attcacacag gagagaaacc ttacgaatgt aatgagtgtg 1261 gtaatgcttt ctatgtgaaa gcacgcctaa ttgaacatca gaggatgcat tcaggagaga 1321 aaccctacga atgtagtgaa tgtgggaaaa tcttcagtat gaagaaatcc ctttgtcaac 1381 accggagaac tcacacagga gagaaacctt atgaatgtag tgaatgtgga aatgccttct 1441 atgtgaaagt acgcctcatt gaacatcagc gaattcacac aggagagaga ccctttgagt 1501 gtcaagaatg tgggaaagct ttctgccgga aagcacacct cacagaacat cagagaactc 1561 acataggctg gtcctggcgt tgtacaatga agaaagcctc tcactgaaga cttccctcac 1621 cattggatca agctccttgg gggcatatga tcaccagggc accacagtgt gctgtgaaaa 1681 tttggcacct acatttgtat caaagtatgt cctgatcccc ttaggatagc tcattgtttt 1741 tattcagatt tccctaatta gtggtgtgtg aacatcttat ctgttgattg gctatttggg 1801 ttttttcttc tgtgcattga ctgttcatgt cctctgctca ttgttttcaa atgggctgtt 1861 catctcttgc tttttggttc acatgttgtt ttttaacata tgagaatata taaaaatcct 1921 ttttttttca gtcatatgtg ttgcaaatat cttctcccag actgtcgttg gttttttaaa 1981 ttttgccatc tgttttcttc tgctgaagtt ttatatttta tttatttatt ttttattttt 2041 tgagatggag tctcactctg tcacccaggc tggagtgcaa tggcatgacc tcagctcatt 2101 tcaaccttcg cctcccgggt tcaagcgatt ctcctgcctc agtctcccaa gtagctggga 2161 ttacaggtgt gcaccaccat gcctggctaa tttttgtatt tttagtaaag acgggtttca 2221 ccatgttggc caggctggtc tctaactcct gacctcaggt gatccacctg cctcggcctc 2281 caaaagtgtt gggattacag gtgtgagcca ccgtgcg // LOCUS HSU28694 1201 bp mRNA PRI 16-MAY-1996 DEFINITION Human eosinophil CC chemokine receptor 3 mRNA, complete cds. ACCESSION U28694 NID g1199579 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Combadiere,C., Ahuja,S.K. and Murphy,P.M. TITLE Cloning and functional expression of a human eosinophil CC chemokine receptor JOURNAL J. Biol. Chem. 270 (28), 16491-16494 (1995) MEDLINE 95348056 REFERENCE 2 (bases 1 to 1201) AUTHORS Combadiere,C., Ahuja,S.K. and Murphy,P.M. TITLE Cloning and functional expression of a human eosinophil CC chemokine receptor JOURNAL J. Biol. Chem. 271 (18), 11034 (1996) MEDLINE 96210048 REFERENCE 3 (bases 1 to 1201) AUTHORS Combadiere,C. TITLE Direct Submission JOURNAL Submitted (07-JUN-1995) Christophe Combadiere, NIAID, National Institutes of Health, Building 10, Room 11N111, Bethesda, MD 20892, USA COMMENT [Erratum J. Biol. Chem. 270 (1995) 30235]. FEATURES Location/Qualifiers source 1..1201 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="eosinophil" CDS 32..1099 /codon_start=1 /product="CC chemokine receptor 3" /db_xref="PID:g1199580" /translation="MTTSLDTVETFGTTSYYDDVGLLCEKADTRALMAQFVPPLYSLV FTVGLLGNVVVVMILIKYRRLRIMTNIYLLNLAISDLLFLVTLPFWIHYVRGHNWVFG HGMCKLLSGFYHTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITSIVTWGL AVLAALPEFIFYETEELFEETLCSALYPEDTVYSWRHFHTLRMTIFCLVLPLLVMAIC YTGIIKTLLRCPSKKKYKAIRLIFVIMAVFFIFWTPYNVAILLSSYQSILFGNDCERS KHLDLVMLVTEVIAYSHCCMNPVIYAFVGERFRKYLRHFFHRHLLMHLGRYIPFLPSE KLERTSSVSPSTAEPELSIVF" BASE COUNT 278 a 320 c 267 g 336 t ORIGIN 1 tttttcttct tctatcacag ggagaagtga aatgacaacc tcactagata cagttgagac 61 ctttggtacc acatcctact atgatgacgt gggcctgctc tgtgaaaaag ctgataccag 121 agcactgatg gcccagtttg tgcccccgct gtactccctg gtgttcactg tgggcctctt 181 gggcaatgtg gtggtggtga tgatcctcat aaaatacagg aggctccgaa ttatgaccaa 241 catctacctg ctcaacctgg ccatttcgga cctgctcttc ctcgtcaccc ttccattctg 301 gatccactat gtcagggggc ataactgggt ttttggccat ggcatgtgta agctcctctc 361 agggttttat cacacaggct tgtacagcga gatctttttc ataatcctgc tgacaatcga 421 caggtacctg gccattgtcc atgctgtgtt tgcccttcga gcccggactg tcacttttgg 481 tgtcatcacc agcatcgtca cctggggcct ggcagtgcta gcagctcttc ctgaatttat 541 cttctatgag actgaagagt tgtttgaaga gactctttgc agtgctcttt acccagagga 601 tacagtatat agctggaggc atttccacac tctgagaatg accatcttct gtctcgttct 661 ccctctgctc gttatggcca tctgctacac aggaatcatc aaaacgctgc tgaggtgccc 721 cagtaaaaaa aagtacaagg ccatccggct catttttgtc atcatggcgg tgtttttcat 781 tttctggaca ccctacaatg tggctatcct tctctcttcc tatcaatcca tcttatttgg 841 aaatgactgt gagcggagca agcatctgga cctggtcatg ctggtgacag aggtgatcgc 901 ctactcccac tgctgcatga acccggtgat ctacgccttt gttggagaga ggttccggaa 961 gtacctgcgc cacttcttcc acaggcactt gctcatgcac ctgggcagat acatcccatt 1021 ccttcctagt gagaagctgg aaagaaccag ctctgtctct ccatccacag cagagccgga 1081 actctctatt gtgttttagg tcagatgcag aaaattgcct aaagaggaag gaccaaggag 1141 atgaagcaaa cacattaagc cttccacact cacctctaaa acagtccttc aaacttccag 1201 t // LOCUS HSU28727 8400 bp mRNA PRI 19-JUN-1996 DEFINITION Human pregnancy-associated plasma protein-A preproform (PAPPA) mRNA, complete cds. ACCESSION U28727 NID g1142969 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 3443 to 8400) AUTHORS Oxvig,C., Sand,O., Kristensen,T., Gleich,G.J. and Sottrup-Jensen,L. TITLE Circulating human pregnancy-associated plasma protein-A is disulfide-bridged to the proform of eosinophil major basic protein JOURNAL J. Biol. Chem. 268 (17), 12243-12246 (1993) MEDLINE 93286045 REFERENCE 2 (bases 1 to 8400) AUTHORS Kristensen,T., Oxvig,C., Sand,O., Moller,N.P. and Sottrup-Jensen,L. TITLE Amino acid sequence of human pregnancy-associated plasma protein-A derived from cloned cDNA JOURNAL Biochemistry 33 (6), 1592-1598 (1994) MEDLINE 94146014 REFERENCE 3 (bases 1 to 8400) AUTHORS Haaning,J., Oxvig,C., Overgaard,M.T., Ebbesen,P., Kristensen,T. and Sottrup-Jensen,L. TITLE Complete cDNA sequence of the preproform of human pregnancy-associated plasma protein-A. Evidence for expression in the brain and induction by cAMP JOURNAL Eur. J. Biochem. 237 (1), 159-163 (1996) MEDLINE 96203921 REFERENCE 4 (bases 1 to 8400) AUTHORS Haaning,J., Oxvig,C., Overgaard,M.T., Ebbesen,P., Kristensen,T. and Sottrup-Jensen,L. TITLE Direct Submission JOURNAL Submitted (08-JUN-1995) Jesper Haaning, Molecular Biology, Univ. of Aarhus, C.F. Mollers Alle, Bldg. 130, Aarhus, DK-8000, Denmark FEATURES Location/Qualifiers source 1..8400 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="9q33.1" /chromosome="9" sig_peptide 3215..3280 /gene="PAPPA" CDS 3215..8098 /gene="PAPPA" /note="putative metalloproteinase" /codon_start=1 /product="pregnancy-associated plasma protein-A preproform" /db_xref="PID:g1142970" /translation="MRLWSWVLHLGLLSAALGCGLAERPRRARRDPRAGRPPRPAAGP ATCATRGPRPPRLAAAAAAAGRAWEAVRVPRRRQQREARGATEEPSPPSRALYFSGRG EQLRVLRADLELPRDAFTLQVWLRAEGGQRSPAVITGLYDKCSYISRDRGWVVGIHTI SDQDNKDPRYFFSLKTDRARQVTTINAHRSYLPGQWVYLAATYDGQFMKLYVNGAQVA TSGEQVGGIFSPLTQKCKVLMLGGSALNHNYRGYIEHFSLWKVARTQREILSDMETHG AHTALPQLLLQENWDNVKHAWSPMKDGSSPKVEFSNAHGFLLDTSLEPPLCGQTLCDN TEVIASYNQLSSFRQPKVVRYRVVNLYEDDHKNPTVTREQVDFQHHQLAEAFKQYNIS WELDVLEVSNSSLRRRLILANCDISKIGDENCDPECNHTLTGHDGGDCRHLRHPAFVK KQHNGVCDMDCNYERFNFDGGECCDPEITNVTQTCFDPDSPHRAYLDVNELKNILKLD GSTHLNIFFAKSSEEELAGVATWPWDKEALMHLGGIVLNPSFYGMPGHTHTMIHEIGH SLGLYHVFRGISEIQSCSDPCMETEPSFETGDLCNDTNPAPKHKSCGDPGPGNDTCGF HSFFNTPYNNFMSYADDDCTDSFTPNQVARMHCYLDLVYQGWQPSRKPAPVALAPQVL GHTTDSVTLEWFPPIDGHFFERELGSACHLCLEGRILVQYASNASSPMPCSPSGHWSP REAEGHPDVEQPCKSSVRTWSPNSAVNPHTVPPACPEPQGCYLELEFLYPLVPESLTI WVTFVSTDWDSSGAVNDIKLLAVSGKNISLGPQNVFCDVPLTIRLWDVGEEVYGIQIY TLDEHLEIDAAMLTSTADTPLCLQCKPLKYKVVRDPPLQMDVASILHLNRKFVDMDLN LGSVYQYWVITISGTEESEPSPAVTYIHGRGYCGDGIIQKDQGEQCDDMNKINGDGCS LFCRQEVSFNCIDEPSRCYFHDGDGVCEEFEQKTSIKDCGVYTPQGFLDQWASNASVS HQDQQCPGWVIIGQPAASQVCRTKVIDLSEGISQHAWYPCTISYPYSQLAQTTFWLRA YFSQPMVAAAVIVHLVTDGTYYGDQKQETISVQLLDTKDQSHDLGLHVLSCRNNPLII PVVHDLSQPFYHSQAVRVSFSSPLVAISGVALRSFDNFDPVTLSSCQRGETYSPAEQS CVHFACEKTDCPELAVENASLNCSSSDRYHGAQCTVSCRTGYVLQIRRDDELIKSQTG PSVTVTCTEGKWNKQVACEPVDCSIPDHHQVYAASFSCPEGTTFGSQCSFQCRHPAQL KGNNSLLTCMEDGLWSFPEALCELMCLAPPPVPNADLQTARCRENKHKVGSFCKYKCK PGYHVPGSSRKSKKRAFKTQCTQDGSWQEGACVPVTCDPPPPKFHGLYQCTNGFQFNS ECRIKCEDSDASQGLGSNVIHCRKDGTWNGSFHVCQEMQGQCSVPNELNSNLKLQCPD GYAIGSECATSCLDHNSESIILPMNVTVRDIPHWLNPTRVERVVCTAGLKWYPHPALI HCVKGCEPFMGDNYCDAINNRAFCNYDGGDCCTSTVKTKKVTPFPMSCDLQGDCACRD PQAQEHSRKDLRGYSHG" gene 3215..8098 /gene="PAPPA" 3'UTR 8099..8400 BASE COUNT 2040 a 2218 c 2332 g 1810 t ORIGIN 1 ctgtccaggt tgagtgaggg tgtccaggat gaggaatgag ggtgtccagg gtgaggagtg 61 agggagtcca gggtgaggag tgagggagtc cagggtgagg attgagggtg tcagggtgag 121 tgagagtgtc cagggtgagg agtgagggta tccagggtga gtgagggtgt ccagggtgag 181 gagtgagggt atccagggtg agtgagggtg tccagggtga gtgagggtgt cagggtgagt 241 gaacgtgtcc agggtgtgtg agggtgtcca gggtgcagag tgaggtgtcc agggtgagga 301 gtgacggtgt ctggggtgag tgagggtgtc cagggtgagg agtgagggtg tcagggtgag 361 tgagggtgtc cagggttgag tgcacatgtg tggtgaggag gtgtttgcag tgcttcaggc 421 gcagcaactc tttcatctag tttaaaattg tgctctgagg ttagatttta gtagaacaaa 481 ggccttacaa agaatgtgaa aacattgtgc ttccctgctt acaggcaatt aaaaaggaga 541 atcaagctga gggtgcctgg tgtggggtgg ggtggagaag accacagaga ctattgtgtg 601 ttttattcaa cagtgtcctg ggctgctttc tccagaaatg tccctgacac atggatgtaa 661 gtgtggctag tttactggga gatgatccca gtgatgcagg acacgcgagc cctaagattg 721 aagcatagcc cgggagggtt cttagctttg cccaggaagg aactcaaggc aagccagtgg 781 tgttagcaac ttttattgaa gcggccggct gtgcacagca gcagcagagg cgctgctcct 841 tgcaaagcag ggctgcccta caggctgtgc gcccacagta gcagctcaga ggcagttctg 901 cagtggtatt tgtatccact tttaattata tgcaaatgaa ggggcagttt atgcagacat 961 ttccagggtg agggtggtaa cttctgggtg ctgccagagc catcgtgaac tgacttgaca 1021 caggtcggtg tgtcctatgg aaactagcat ctgccctgga cctattttag ctagtgctca 1081 gtttggtctg agtgcctgag ccccacttcc agagttgagt cccacctcct acctcattcc 1141 cccttcagag attagatact cctccttaat cttaaggggg cctcgagaag ggcggagtcc 1201 tgcttttccg taactacttc ctgctgagtt tatggcacgt aggccctgcc tggcactgga 1261 ggacgtaaaa atttctggat acctgatcta aggagcccag aggcaggacg atttcattct 1321 ccgtgtcagt ggacaggatg ggctggaagc cttgtgccag cattgtctct ggaactgtgg 1381 taatctagaa tacacaaact ttactaagag gttaaagaag caaggaccaa acatttgtaa 1441 caagacagtt gtcaaaggtc ctagaagagg tgaaaaacag gtgagacttg ggaaggcact 1501 tttgatggtt gaccagatat agttggggca gtgccctggt tatatctatg taactaggta 1561 gcttgctcat agatcttttg aatgttaacc tcaacctgtc cagagttaat atatgtgcag 1621 caggttttat taataactgc acaagacccc accttgttca gctagtaaat aatccaatgc 1681 tagtctgtta tcaacaacta cattttccag agtctgggga actcttgaat tctctttaat 1741 gcctgatctc cgttggtggc taaggattct aggatttgag ccaagttctt tagcgttaac 1801 tcatggtagg caaagcaccc cagggtgctg ctagtcctat tgccaccctg attcctgcca 1861 gaaataagta agcaagcaac aggacaatga actccatgtt gcccagatcc cactgagagt 1921 gaacgtgcag tcatgcccat aaccgacaca catcccagtc catgtgggtc agtccttcat 1981 caccctccct gccttctgac aacagcagac tccagccatt ccattatcat tcacagccca 2041 acccaagcag tcagtggctg aagaaagaga atcaggtata ctctatgtcc acatatacct 2101 tcctgccaca gggcttcacc aactggcagg attcccccca gcccccaact cctcctcatc 2161 ccttgttccc aggcgggtca attggagagt gagcagagat agatggcttg cagaagggta 2221 tgcaagatgg ggagactcca ccagacctta gggagagaat ggcatctttc tccacctttt 2281 aagagagaca gagcctccct gaggtctcaa agatttttca gggcaacaca agaaacgttt 2341 ccaagcctca gctgcaggtg ttcacctttc cactctgggg tgtgaggaga tggtaagctg 2401 aggacaagtg tgttccagat gcaggccaac tctgttctcc aacagtttga accaacttgg 2461 ccttggggtt ccaagggtag gatgaagggt gtcacggcag gacggggcct ggcgtgaggc 2521 aacagcaagc agagcgagga cacaacagtg gcagcagtgt acagacacag gacgtcagct 2581 tcaaacgatg caacagcagg gatatcttgg gcacggctct gattcgagcc actcccaact 2641 ctcttgcctc caggatggag gcactgtaca tgcaacagtc cagagaaaga ttctagccag 2701 tccaagccca ggcacatcca gagaaggtgg gagctcttcg ggttgactcc accgaggaaa 2761 acaggcattt gggtatttca gattcccgct ccacaataag gcaactttta aaaaaatatt 2821 atttccaaaa acaaaacaaa aattccaggg ggagggaatt cagcggatca gtcttaagag 2881 gagctttttt ttggagcgag aaatcatata aaataaaatg aaataaaaca aggaggaagg 2941 caaccagctg ttagggggaa aataaggcag ataaaggagc ggggagagaa attaattgcc 3001 aaccaggagg agttgggctg tatttttcaa aggtggggag agtggagcac acaccttgag 3061 gaggaaagcg agaaagaaaa gaaaaaagca agtgaagggg ggctcgccca agaagggtga 3121 agaagcgaag aaagtcgagg cgccgaggct cccaaagctg gcagctccgg gtggcggtgc 3181 aggggcgaag gggggggcgg ggggaacgtc ggacatgcgg ctctggagtt gggtgctgca 3241 cctggggctg ctgagcgccg cgctgggctg cgggctggcc gagcgtcccc gccgggcccg 3301 gagagacccg cgggccggcc gacccccgcg ccccgccgcc ggcccggcca cctgcgccac 3361 ccgcggcccg cggccgccgc gcctcgccgc cgccgccgcc gccgccgggc gtgcctggga 3421 agccgtgcgc gtcccccggc ggcggcagca gcgggaggcg aggggcgcca ccgaggagcc 3481 gagcccgccg agccgggcgc tctatttcag cgggcgaggc gagcagctgc gagtcctccg 3541 ggccgacctc gagctgcccc gggacgcgtt cacgctgcaa gtgtggctgc gagcggaggg 3601 gggccagagg tctccggcag tgatcacagg gctgtatgac aaatgttctt atatctcacg 3661 tgaccgagga tgggtcgtgg gcattcacac catcagtgac caagacaaca aagacccacg 3721 ctactttttc tccttgaaga cagaccgagc ccggcaagtg accaccatca atgcccaccg 3781 cagctacctc ccaggccagt gggtatacct agctgccacc tatgatgggc agttcatgaa 3841 gctctatgtg aatggtgccc aggtggccac ctctggggaa caagtgggtg gcatattcag 3901 cccactgacc cagaagtgca aagtgctcat gttagggggc agtgccctga atcacaacta 3961 ccggggctac atcgagcact tcagtctgtg gaaggtggcc aggactcagc gggagatact 4021 gtctgacatg gaaacccatg gcgcccacac tgctctacct cagctcctcc tccaggagaa 4081 ctgggacaat gtgaagcatg cctggtcccc catgaaggat ggcagcagcc ccaaagtgga 4141 attcagcaat gcccacggct ttctgctgga cacgagtctg gagcctcctc tgtgcggaca 4201 gacattgtgt gacaacacag aggtcattgc cagctacaat cagctctcaa gtttccgcca 4261 gcccaaggtg gtgcgctacc gcgtggtcaa cctctatgaa gatgatcata agaacccgac 4321 ggtgacgcgc gagcaggtgg acttccagca ccatcagctg gctgaggcct tcaagcaata 4381 caacatctcc tgggagctgg acgtgctgga ggtgagcaac tcctcccttc gccgccgcct 4441 catcctggcc aactgtgaca tcagcaagat tggggatgag aactgtgacc ccgagtgcaa 4501 ccacacgctg acgggccacg acggcgggga ttgccgccac ctgcgccacc ctgccttcgt 4561 gaagaagcag cacaacgggg tgtgtgacat ggactgcaac tatgaacggt tcaactttga 4621 tggtggagag tgctgtgacc ctgaaatcac caatgtcact cagacttgct ttgaccccga 4681 ctctccacac agagcctact tggatgttaa tgagctgaag aacattctta aattggatgg 4741 atcaacacat ctcaatattt tctttgcaaa atcctcagag gaggagttgg caggagtagc 4801 aacttggcca tgggacaagg aggccctgat gcacttaggt ggcattgtct tgaacccatc 4861 tttctatggc atgcctgggc acacccacac catgatccat gagattggtc acagcctggg 4921 cctctatcac gtcttccgag gcatctcaga aatccagtcc tgcagtgacc cctgcatgga 4981 gacagagccc tccttcgaga ctggagacct ctgcaatgat accaacccag cccctaaaca 5041 caagtcctgt ggtgacccag ggccaggaaa tgacacctgt ggctttcata gcttcttcaa 5101 cactccttac aacaacttca tgagctatgc agatgacgac tgtacggact ccttcacgcc 5161 caatcaagtc gccagaatgc actgttacct ggacctggtc taccagggct ggcagccctc 5221 caggaaacca gcgcctgttg ccctcgcccc ccaagttctg ggccacacaa cggactctgt 5281 gacactggag tggttcccac ctatagatgg ccatttcttt gaaagagaat tgggatcagc 5341 atgtcatctt tgcctggaag ggagaatcct ggtgcagtat gcttccaacg cttcctcccc 5401 aatgccctgc agcccatcag gacactggag ccctcgtgaa gcagaaggtc atcctgatgt 5461 tgaacagccc tgtaagtcca gtgtccgcac ctggagccca aattcagctg tcaacccaca 5521 cacggttcct ccagcctgcc ctgagcctca aggctgctac ctcgagctgg agttcctcta 5581 ccccttggtc cctgagtctc tgaccatttg ggtgaccttt gtctccactg actgggactc 5641 tagtggagct gtcaatgaca tcaaactgtt ggctgtcagt gggaagaaca tctccctggg 5701 tcctcagaat gtcttctgtg atgtcccact gaccatcaga ctctgggacg tgggcgagga 5761 ggtgtatggc atccaaatct acacgctgga tgagcacctg gagatcgatg ctgccatgtt 5821 gacctccact gcagacaccc cactctgtct acagtgtaag cccctgaagt ataaggtggt 5881 ccgggaccct cctctccaga tggatgtggc ctccatccta catctcaata ggaaattcgt 5941 agacatggat ctaaatcttg gcagtgtgta ccagtattgg gtcataacta tttcaggaac 6001 tgaagagagt gagccatcac ctgctgtcac atacatccat ggacgtgggt actgtggcga 6061 tggcattata caaaaagacc aaggtgaaca atgcgacgac atgaataaga tcaatggtga 6121 tggctgctcc cttttctgcc gacaagaagt ctccttcaat tgtattgatg aacccagccg 6181 gtgctatttc catgatggtg atggggtatg tgaggagttt gaacaaaaaa ccagcattaa 6241 ggactgtggt gtctacacgc cccagggatt cctggatcag tgggcatcca atgcttcagt 6301 atctcatcaa gaccagcaat gcccaggctg ggtcatcatc ggacagccag cagcatccca 6361 ggtgtgtcga accaaggtga tagatctcag tgaaggcatt tcccagcatg cctggtaccc 6421 ttgcaccatc agctacccat attcccagct ggctcagacc actttttggc tccgggcgta 6481 tttttctcaa ccaatggttg ccgcagctgt cattgtccac ctggtgacgg atgggacata 6541 ttatggggac caaaagcagg agaccatcag cgtgcagctg cttgatacca aagatcagag 6601 ccacgatcta ggcctccatg tcctgagctg caggaacaat cccctgatta tccctgtggt 6661 ccatgacctc agccagccct tctaccacag ccaggcggta cgtgtgagct tcagttcgcc 6721 cctggtcgcc atctcggggg tggccctccg ttccttcgac aactttgacc ccgtcaccct 6781 gagcagctgc cagagagggg agacctacag ccctgccgag cagagctgcg tgcacttcgc 6841 atgtgagaaa actgactgtc cagagctggc tgtggagaat gcttctctca attgctccag 6901 cagcgaccgc taccacggtg cccagtgtac tgtgagctgc cggacaggct acgtgctcca 6961 gatacggcgg gatgatgagc tgatcaagag ccagacggga cccagcgtca cagtgacctg 7021 tacagagggc aagtggaata agcaggtggc ctgtgagcca gtcgactgca gcatcccaga 7081 tcaccatcaa gtctatgctg cctccttctc ctgccctgag ggcaccacct ttggcagtca 7141 atgttccttc cagtgccgtc accctgcaca attgaaaggc aacaacagcc tcctgacctg 7201 catggaggat gggctgtggt ccttcccaga ggccctgtgt gagctcatgt gcctcgctcc 7261 accccctgtg cccaatgcag acctccagac cgcccggtgc cgagagaata agcacaaggt 7321 gggctccttc tgcaaataca aatgcaagcc tggataccat gtgcctggat cctctcggaa 7381 gtcaaagaaa cgggccttca agactcagtg tacccaggat ggcagctggc aggagggagc 7441 ttgtgttcct gtgacctgtg acccacctcc accaaaattc catgggctct accagtgtac 7501 taatggcttc cagttcaaca gtgagtgtag gatcaagtgt gaagacagtg atgcctccca 7561 gggacttggg agcaatgtca ttcattgccg gaaagatggc acctggaacg gctccttcca 7621 tgtctgccag gagatgcaag gccagtgctc ggttccaaac gagctcaaca gcaacctcaa 7681 actgcagtgc cctgatggct atgccatagg gtcggagtgt gccacctcgt gcctggacca 7741 caacagcgag tccatcatcc tgccaatgaa cgtgaccgtg cgtgacatcc cccactggct 7801 gaaccccaca cgggtagaga gagttgtctg cactgctggt ctcaagtggt atcctcaccc 7861 tgctctgatt cactgtgtca aaggctgtga gcccttcatg ggagacaatt attgtgatgc 7921 catcaacaac cgagcctttt gcaactatga cggtggggat tgctgcacct ccacagtgaa 7981 gaccaaaaag gtcaccccat tccctatgtc ctgtgaccta caaggtgact gtgcttgtcg 8041 ggacccccag gcccaagaac acagccggaa agacctccgg ggatacagcc atggctaagg 8101 aaggacaaga agttgtcaaa gaattcccaa cgccaggacc cacatccctt tggtattgat 8161 ttcacagtca gctgctcaac ggaatggcct ctccacacca gggatcctta gcacccaacc 8221 ggtctgcctt taattttacc caggaaggac tcacattggg gcgaatgaac caagtttcgc 8281 catgctggat gatgaaatgg attcccatcc caaagtctga gatggattgc atatacagtg 8341 tgcagtccca gagcctccta aaattctagc catttgtcac acaaccacag caaaaaaaaa // LOCUS HSU28811 3909 bp mRNA PRI 12-JUN-1996 DEFINITION Human cysteine-rich fibroblast growth factor receptor (CFR-1) mRNA, complete cds. ACCESSION U28811 NID g1373018 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Burrus,L.W., Zuber,M.E., Lueddecke,B.A. and Olwin,B.B. TITLE Identification of a cysteine-rich receptor for fibroblast growth factors JOURNAL Mol. Cell. Biol. 12 (12), 5600-5609 (1992) MEDLINE 93078761 REFERENCE 2 (sites) AUTHORS Steegmaier,M., Levinovitz,A., Isenmann,S., Borges,E., Lenter,M., Kocher,H.P., Kleuser,B. and Vestweber,D. TITLE The E-selectin-ligand ESL-1 is a variant of a receptor for fibroblast growth factor JOURNAL Nature 373 (6515), 615-620 (1995) MEDLINE 95157635 REFERENCE 3 (bases 1 to 3909) AUTHORS Wu,M., Chen,J., Tan,Y,H., Hong,W.J. and Ting,R. TITLE Direct Submission JOURNAL Submitted (09-JUN-1995) Mian Wu, IMCB, NUS, 10 Kent Ridge Crescent, Singapore 0511, Singapore FEATURES Location/Qualifiers source 1..3909 /organism="Homo sapiens" /db_xref="taxon:9606" gene 27..3560 /gene="CFR-1" CDS 27..3560 /gene="CFR-1" /codon_start=1 /product="cysteine-rich fibroblast growth factor receptor" /db_xref="PID:g1373019" /translation="MAACGRVRRMFRLSAALHLLLLFAAGGRNSPARASHSQGQGPGA NFVSFVGQAGGGGPAGQQLPQLPQSSQLQQQQQQQQQQQQPQPPQPPFPAGGPPRRGG AGAGGGWKLAEEESCREDVTRVCPKHTWSNNLAVLECLQDVREPENEISSDCNHLLWN YKLNLTTDPKFESVAREVCKSTITEIKECADEPVGKGYMVSCLVDHRGNITEYQCHQY ITKMTAIIFSDYRLICGFMDDCKNDINILKCGSIRLGEKDAHSQGEVVSCLEKGLVKE AEEREPKIQVSELCKKAILRVAELSSDDFHLDRHLYFACRDDRERFCENTQAGEGRVY KCLFNHKFEESMSEKCREALTTRQKLIAQDYKVSYSLAKSCKSDLKKYRCNVENLPRS REARLSYLLMCLESAVHRGRQVSSECQGEMLDYRRMLMEDFSLSPEIILSCRGEIEHH CSGLHRKGRTLHCLMKVVRGEKGNLGMNCQQALQTLIQETDPGADYRIDRALNEACES VIQTACKHIRSGDPMISSCLMEHLYTEKMVEDCEHRLLELQYFISRDWKLDPVLYRKC QGDASRLCHTHGWNETSEFMPQGAVFSCLYRHAYRTEEQGRRLSRECRAEVQRILHQR AMDVKLDPALQDKCLIDLGKWCSEKTETGQELECLQDHLDDLVVECRDIVGNLTELES EDIQIEALLMRACEPIIQTFCHDADNQIDSGDLMECLIQNKHQKDMNEKCAIGVTHFQ LVQMKDFRFSYKFKMACKEDVLKLCPNIKKKVDVVICLSTTVRNDTLQEAKEHRVSLK CRRQLRVEELEMTEDIRLEPDLYEACKSDIKNFCSAVQYGNAQIIECLKENKKQLSTR CHQKVFKLQETEMMDPELDYTLMRVCKQMIKRFCPEADSKTMLQCLKQNKNSELMDPK CKQMITKRQITQNTDYRLNPMLRKACKADIPKFCHGILTKAKDDSELEGQVISCLKLR YADQRLSSDCEDQIRIIIQESALDYRLDPQLQLHCSDEISSLCAEEAAAQEQTGQVEE CLKVNLLKIKTELCKKEVLNMLKESKADIFVDPVLHTACALDIKHHCAALTPGRGRQM SCLMEALEDKRVRLQPECKKRLNDRIEMWSYAAKVAPADGFSDLAMQVMTSPSKNYIL SVISGSICILFLIGLMCGRITKRVTRELKDR" polyA_site 3909 /note="27 A nucleotides" BASE COUNT 1060 a 944 c 1052 g 853 t ORIGIN 1 ggcacgaggc tcgccgcgga ctcaagatgg cggcgtgtgg acgtgtacgg aggatgttcc 61 gcttgtcggc ggcgctgcat ctgctgctgc tattcgcggc cgggggcaga aactccccgg 121 ccagggcgtc ccacagccag ggccagggtc ccggggccaa ctttgtgtcc ttcgtagggc 181 aggccggagg cggcggcccg gcgggtcagc agctgcccca gctgcctcag tcatcgcagc 241 ttcagcagca acagcagcag cagcaacagc aacagcagcc tcagccgccg cagccgcctt 301 tcccggcggg tgggcctccg cggcggggag gagcgggggc tggtgggggc tggaagctgg 361 cggaggaaga gtcctgcagg gaggacgtga cccgcgtgtg ccctaagcac acctggagca 421 acaacctggc ggtgctcgag tgcctgcagg atgtgaggga gcctgaaaat gaaatttctt 481 cagactgcaa tcatttgttg tggaattata agctgaacct aactacagat cccaaatttg 541 aatctgtggc cagagaggtt tgcaaatcta ctataacaga gattaaagaa tgtgctgatg 601 aaccggttgg aaaaggttac atggtttcct gcttagtgga tcaccgaggc aacatcactg 661 agtatcagtg tcaccagtac attaccaaga tgacggccat catttttagt gattaccgtt 721 taatctgtgg cttcatggat gactgcaaaa atgacatcaa cattctgaaa tgtggcagta 781 ttcggcttgg agaaaaggat gcacattcac aaggtgaggt ggtatcatgc ttggagaaag 841 gcctggtgaa agaagcagaa gaaagagaac ccaagattca agtttctgaa ctctgcaaga 901 aagccattct ccgggtggct gagctgtcat cggatgactt tcacttagac cggcatttat 961 attttgcttg ccgagatgat cgggagcgtt tttgtgaaaa tacacaagct ggtgagggca 1021 gagtgtataa gtgcctcttt aaccataaat ttgaagaatc catgagtgaa aagtgtcgag 1081 aagcacttac aacccgccaa aagctgattg cccaggatta taaagtcagt tattcattgg 1141 ccaaatcctg taaaagtgac ttgaagaaat accggtgcaa tgtggaaaac cttccgcgat 1201 cgcgtgaagc caggctctcc tacttgttaa tgtgcctgga gtcagctgta cacagagggc 1261 gacaagtcag cagtgagtgc cagggggaga tgctggatta ccgacgcatg ttgatggaag 1321 acttttctct gagccctgag atcatcctaa gctgtcgggg ggagattgaa caccattgtt 1381 ccggattaca tcgaaaagga cggaccctac actgtctgat gaaggtagtt cgaggggaga 1441 aggggaacct tggaatgaac tgccagcagg cgcttcaaac actgattcag gagactgacc 1501 ctggtgcaga ttaccgcatt gatcgagctt tgaatgaagc ttgtgaatct gtaatccaga 1561 cagcctgcaa acatataaga tctggagacc caatgatctc gtcgtgcctg atggaacatt 1621 tatacacaga gaagatggta gaagactgtg aacaccgtct cttagagctg cagtatttca 1681 tctcccggga ttggaagctg gaccctgtcc tgtaccgcaa gtgccaggga gacgcttctc 1741 gtctttgcca cacccacggt tggaatgaga ccagtgaatt tatgcctcag ggagctgtgt 1801 tctcttgttt atacagacac gcctaccgca ctgaagaaca gggaaggagg ctctcacggg 1861 agtgccgagc tgaagtccaa aggatcctac accagcgtgc catggatgtc aagctggatc 1921 ctgccctcca ggataagtgc ctgattgatc tgggaaaatg gtgcagtgag aaaacagaga 1981 ctggacagga gctggagtgc cttcaggacc atctggatga cttggtggtg gagtgtagag 2041 atatagttgg caacctcact gagttagaat cagaggatat ccaaatagaa gccttgctga 2101 tgagagcctg tgagcccata attcagacat tctgccacga tgcggataac cagatagact 2161 ctggggacct gatggagtgt ctgatacaga acaaacacca gaaggacatg aacgagaagt 2221 gtgccatcgg agttacccac ttccagctgg tgcagatgaa ggattttcgg ttttcttaca 2281 agtttaaaat ggcctgcaag gaggacgtgt tgaagctttg cccaaacata aaaaagaagg 2341 tggacgtggt gatctgcctg agcacgaccg tgcgcaatga cactctgcag gaagccaagg 2401 agcacagggt gtccctgaag tgccgcaggc agctccgtgt ggaggagctg gagatgacgg 2461 aggacatccg cttggagcca gatctatacg aagcctgcaa gagtgacatc aaaaacttct 2521 gttccgctgt gcaatatggc aacgctcaga ttatcgaatg tctgaaagaa aacaagaagc 2581 agctaagcac ccgctgccac caaaaagtat ttaagctgca ggagacagag atgatggacc 2641 cagagctaga ctacaccctc atgagggtct gcaagcagat gataaagagg ttctgtccgg 2701 aagcagattc taaaaccatg ttgcagtgct tgaagcaaaa taaaaacagt gaattgatgg 2761 atcccaaatg caaacagatg ataaccaagc gccagatcac ccagaacaca gattaccgct 2821 taaaccccat gttaagaaaa gcctgtaaag ctgacattcc taaattctgt cacggtatcc 2881 tgactaaggc caaggatgat tcagaattag aaggacaagt catctcttgc ctgaagctga 2941 gatatgctga ccagcgcctg tcttcagact gtgaagacca gatccgaatc attatccagg 3001 agtccgccct ggactaccgc ctggatcctc agctccagct gcactgctca gacgagatct 3061 ccagtctatg tgctgaagaa gcagcagccc aagagcagac aggtcaggtg gaggagtgcc 3121 tcaaggtcaa cctgctcaag atcaaaacag aattgtgtaa aaaggaagtg ctaaacatgc 3181 tgaaggaaag caaagcagac atctttgttg acccggtact tcatactgct tgtgccctgg 3241 acattaaaca ccactgcgca gcactcaccc ctggccgcgg gcgtcaaatg tcctgtctca 3301 tggaagcact ggaggataag cgggtgaggt tacagcccga gtgcaaaaag cgcctcaatg 3361 accggattga gatgtggagt tacgcagcaa aggtggcccc agcagatggc ttctctgatc 3421 ttgccatgca agtaatgacg tctccatcta agaactacat tctctctgtg atcagtggga 3481 gcatctgtat attgttcctg attggcctga tgtgtggacg gatcaccaag cgagtgacac 3541 gagagctcaa ggacaggtag agccaccttg accaccaaag gaactaccta tccagtgccc 3601 agtttgtaca gccctcttgt atagcatccc cactcacctc gctcttctca gaagtgacac 3661 caaccccgtg ttagagcatt agcagatgtc cactgcgttg tcccatccag cctccactcg 3721 tgtccatggt gtcctcctcc tcctcaccgt gcagcagcag cagctggtcg ctggggttac 3781 tgcctttgtt tggcaaactt gggtttacct gcctgtagac aagtctctct cataccaaca 3841 gaacttccgg tacttccaga accaactcac ctgacctgca actcaaaggc ttttttaaga 3901 aaaccacca // LOCUS HSU28833 2160 bp mRNA PRI 23-OCT-1997 DEFINITION Human Down syndrome candidate region protein (DSCR1) mRNA, complete cds. ACCESSION U28833 NID g1125051 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2160) AUTHORS Fuentes,J.J., Pritchard,M.A., Planas,A.M., Bosch,A., Ferrer,I. and Estivill,X. TITLE A new human gene from the Down syndrome critical region encodes a proline-rich protein highly expressed in fetal brain and heart JOURNAL Hum. Mol. Genet. 4 (10), 1935-1944 (1995) MEDLINE 96121593 REFERENCE 2 (bases 1 to 2160) AUTHORS Estivill,X. TITLE Direct Submission JOURNAL Submitted (09-JUN-1995) Xavier Estivill, Molecular Genetics Department, Cancer Research Institute, Hospital Duran i Reynals, Avia. Castelldefels Km 2.7, Hospitalet, Barcelona, Catalonia 08907, Spain FEATURES Location/Qualifiers source 1..2160 /organism="Homo sapiens" /db_xref="taxon:9606" /map="21q22.1-q22.2" /chromosome="21" gene 49..564 /gene="DSCR1" CDS 49..564 /gene="DSCR1" /codon_start=1 /product="Down syndrome candidate region protein 1" /db_xref="PID:g1125052" /translation="MVYAKFESLFRTYDKDITFQYFKSFKRVRINFSNPFSAADARLQ LHKTEFLGKEMKLYFAQTLHIGSSHLAPPNPDKQFLISPPASPPVGWKQVEDATPVIN YDLLYAISKLGPGEKYELHAATDTTPSVVVHVCESDQEKEEEEEMERMRRPKPKIIQT RRPEYTPIHLS" polyA_site 2160 /note="14 A nucleotides" BASE COUNT 590 a 435 c 503 g 631 t 1 others ORIGIN 1 gaactatagt tgaaggctgc tgccaataca acaccactgt gaaacagaat ggtgtatgcc 61 aaatttgagt ccctctttag gacgtatgac aaggacatca cctttcagta ttttaagagc 121 ttcaaacgag tcagaataaa cttcagcaac cccttctccg cagcagatgc caggctccag 181 ctgcataaga ctgagtttct gggaaaggaa atgaagttat attttgctca gaccttacac 241 ataggaagct cacacctggc tccgccaaat ccagacaagc agtttctgat ctcccctccc 301 gcctctccgc cagtgggatg gaaacaagtg gaagatgcga ccccagtcat aaactatgat 361 ctcttatatg ccatctccaa gctggggcca ggggaaaagt atgaattgca cgcagcgact 421 gacaccactc ccagcgtggt ggtccatgta tgtgagagtg atcaagagaa ggaggaagaa 481 gaggaaatgg aaagaatgag gagacctaag ccaaaaatta tccagaccag gaggccggag 541 tacacgccga tccacctcag ctgaactggc acgcgacgag gacgcattcc aaatcatact 601 cacgggagga atcttttact gtggaggtgg ctggtcacga cttcttcgga ggtggcagcc 661 gagatcgggg tggcagaaat cccagttcat gttgctcaga agagaatcaa ggccgtgtcc 721 ccttgttcta atgctgcaca ccagttactg ttcatggcac ccgggaatga cttgggccaa 781 tcactgagtt tgtggtgatc gcacaaggac atttgggact gtcttgagaa aacagataat 841 gatagtgttt tgtacttgtt cttttctggt aggttctgtc tgtgccaagg gcaggttgat 901 cagtgagctc aggagagagc ttcctgtttc taagtggcct gcaggggcca ctctctactg 961 gtaggaagag gtaccacagg aagccgccta gtgcagagag gttgtgaaaa cagcagcaat 1021 gcaatgtgga aattgtagcg tttcctttct tccctcatgt tctcatgttt gtgcatgtat 1081 attactgatt tacaagacta acctttgttc gtatataaag ttacaccgtt gttgttttac 1141 atcttttggg aagccaggaa agcgtttgga aaacgtatca cctttcccag attctcggat 1201 tctcgactct ttgcaacagc acttgcttgc ggaactcttc ctggaatgca ttcactcagc 1261 atccccaacc gtgcaacgtg taacttgtgc ttttgcaaaa gaagttgatc tgaaattcct 1321 ctgtagaatt tagcttatac aattcagaga atagcagttt cactgccaac ttttagtggg 1381 tgagaaattt tagtttaggt gtttgggatc ggacctcagt ttctgttgtt tcttttatgt 1441 ggtggtttct atacatgaat catagccaaa aactttttcg gaaactgttg gttgagatag 1501 ttggttcttt taccccacga agacatcaag atacacttgt aaataaagct gatagcatat 1561 attcatacct gttgtacact tgggtgaaaa gtatggcagt gggagactaa gatgtattaa 1621 cctacctgtg aatcatatgt tgtaggaaaa gctgttccca tgtctaacag gacttgaatt 1681 caaagcatgt caagtggata gtagatctgt ggcgatatga gagggatgca gtgcctttcc 1741 ccattcattc ctgatggaat tgttatacta ggttaacatt tgtaattttt ttctagttgt 1801 aatgtgtatg tctggtaaat aggtattata ttttggcctt acaataccgt aacaatgttt 1861 gtcattttga aatacttaat gccaagtaac aatgcatgct ttggaaattt ggaacatggt 1921 tttattcttt gagaagcaaa tatgtttgca ttaaatgctt tgattgttcg tatcaagaaa 1981 ttgattgaac gttctcaaac cctgtttacg gtacttggta agagggagcc ggtttgggag 2041 agaccattgc atcgctgngc caagtgtttc ttgttaagtt cctttaaact ggagaggcta 2101 acctcaaaat acttttttta cctgcattct ataataaatg ggcacagtat gctccttaca // LOCUS HSU28838 3649 bp mRNA PRI 04-AUG-1995 DEFINITION Human transcription factor TFIIIB 90 kDa subunit (hTFIIIB90) mRNA, complete cds. ACCESSION U28838 NID g927597 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3649) AUTHORS Wang,Z. and Roeder,R.G. TITLE Structure and function of a human transcription factor TFIIIB subunit that is evolutionarily conserved and contains both TFIIB- and high-mobility-group protein 2-related domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (15), 7026-7030 (1995) MEDLINE 95350204 REFERENCE 2 (bases 1 to 3649) AUTHORS Wang,Z. TITLE Direct Submission JOURNAL Submitted (08-JUN-1995) Zhengxin Wang, Biochemistry and Molecular Biology Laboratory, Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..3649 /organism="Homo sapiens" /db_xref="taxon:9606" gene 367..2394 /gene="hTFIIIB90" CDS 367..2394 /gene="hTFIIIB90" /codon_start=1 /function="transcription factor" /product="TFIIIB 90 kDa subunit" /db_xref="PID:g927598" /translation="MTGRVCRGCGGTDIELDAARGDAVCTACGSVLEDNIIVSEVQFV ESSGGGSSAVGQFVSLDGAGKTPTLGGGFHVNLGKESRAQTLQDGRRHIHHLGNQLQL NQHCLDTAFNFFKMAVSRHLTRGRKMAHVIAACLYLVCRTEGTPHMLLVLSDLLQVNV YVLGKTFLLLARELCINAPAIDPCLYIPRFAHLLEFGEKNHEVSMTALRLLQRMKRDW MHTGRGPSGLCGGALLVAARMHDFRRTVKEVISVVKVCESTLRKRLTEFEDTPTSQLT IDEFMKIDLEEECDPPSYTAGQRKLRMKQLEQVLSKKLEEVEGEISSYQDAIEIELEN SRPKRGGLQPGKRWLHRGHRVQLVWRGGHRGRGAGSRGQPPEQRLIPGAPWWCPRQLG SSRKPRVGRQTPALGSLLDPLPTAASLGISDSIRECISSQSSDPKDASGDGELDLSGI DDLEIDRYILNESEARVKAELWMRENAEYLREQREKEARIAKEKELGIYKEHKPKKSC KRREPIQASTAREAIEKMLEQKKISSKINYSVLRGLSSAGGGSPHREDAQPEHSASAR KLSRRRTPASRSGADPVTSVGKRLRTLVSTQPAKKVATGEALLPSSPTLEAEPARPQA VLVESGPVSYHADEEADEEEPDEEDGEPCVSALQMMGSNDYGCDGDEDDGY" polyA_site 3649 /note="13 A nucleotides" BASE COUNT 709 a 1107 c 1241 g 592 t ORIGIN 1 cggccgcgtc gaccggctgc gctcaccggt aggccccgct cgggttccgc cgaagcccag 61 cccccgcagg tcggcccctc cgacgccggc cgcgccgcaa gggaggccag ctcgctcgca 121 gtggggaggt cgcggctcca gtcctcgcgt ccccgccgtg gtcccggtgc ctgtcccatc 181 ccgcgggcgg ggccgttgcg gggccgggcc cgggccgggg cgaatctgcg gctgcgaatc 241 ggctggagcg gggcctcgcg agaggccgag gctgggcggc tgggctgggc gggcggccgg 301 ggctgctccg gaggctcggg tggcttgaga gtcttgggag gctccgcctg cccgccggtc 361 gccggcatga cgggccgcgt gtgccgcggt tgcggcggca cggacatcga gctggacgcg 421 gcccgcgggg acgcggtgtg caccgcctgc ggctcagtgc tggaggacaa catcatcgtg 481 tccgaggtgc agttcgtgga gagcagcggc ggcggctcct cggccgtggg ccagttcgtg 541 tccctggacg gtgctggcaa aaccccgact ctgggtggcg gcttccacgt gaatctgggg 601 aaggagtcga gagcgcagac cctgcaggat gggaggcgcc acatccacca cctggggaac 661 cagctgcagc tgaaccagca ctgcctggac accgccttca acttcttcaa gatggccgtg 721 agcaggcacc tgacccgcgg ccggaagatg gcccacgtga ttgctgcctg cctctacctg 781 gtctgccgta cggagggcac gccgcacatg ctcctggtcc tcagcgacct gctccaggtg 841 aatgtgtacg tgcttggaaa gacgtttctt ctcttggcaa gagagctctg catcaatgcg 901 ccggccatag acccgtgcct gtatattcca cgctttgcgc acctgctgga attcggggag 961 aagaaccacg aggtgtccat gactgccctg aggctcctac agaggatgaa gcgggactgg 1021 atgcacacag gccggggccc ctcgggcctc tgcggaggag cgctcctggt tgcagccaga 1081 atgcatgact tcaggaggac tgtgaaggag gtcatcagtg tggtcaaagt gtgtgagtcc 1141 acgctgcgga agaggctcac ggaatttgaa gacaccccca ccagtcagtt gaccattgat 1201 gagttcatga agatcgacct ggaggaggag tgcgaccccc cctcgtacac agctgggcag 1261 aggaagctgc ggatgaagca gcttgaacaa gtcctgtcaa aaaaactgga ggaggttgaa 1321 ggtgaaatat ccagttacca ggatgcaatt gagattgaac tagaaaacag ccggccaaaa 1381 cggggggggc tgcagcctgg caaaagatgg ctccaccgag gacaccgcgt ccagcttgtg 1441 tggcgaggag gacacagagg acgaggagct ggaagccgcg gccagccacc tgaacaaaga 1501 cttataccgg gagctccttg gtggtgcccc cggcagctcg gaagcagcag gaagccccga 1561 gtggggcggc agactccggc cctggggtcc ctgctggacc ccctccccac tgcagccagc 1621 ctgggcatct cagactccat ccgcgaatgc atctcctctc agagcagcga ccccaaagat 1681 gcttcaggag acggtgagct ggacctcagt ggcattgatg acctggagat tgacaggtac 1741 atcctgaatg agtcggaagc ccgcgtgaag gccgagctgt ggatgaggga gaacgccgag 1801 tacctgcggg aacagaggga aaaagaagca agaatagcga aagagaagga gctcggcatc 1861 tacaaggaac acaagcccaa gaagtcttgc aagcgacggg agccaattca ggccagtacc 1921 gccagggagg ccatcgagaa gatgctggag cagaagaaga tctccagcaa gatcaattat 1981 agcgtgctcc ggggcctcag cagcgccggc gggggcagtc cgcacaggga ggatgcacag 2041 cccgagcata gcgccagtgc caggaagctg tcacgaagga ggacgccggc cagcagaagt 2101 ggggctgacc ctgtgaccag tgtggggaaa aggttgagga ctctggtgtc tacgcagcca 2161 gcaaagaagg tggccacggg agaggctttg ctcccaagct ctcccaccct cgaagctgag 2221 cctgccaggc cccaggcggt gctggtggag agcgggcccg tgtcatacca cgccgacgag 2281 gaggctgacg aggaggagcc tgacgaggag gacggggagc cctgcgtcag tgccctgcag 2341 atgatgggca gcaacgacta tggctgtgat ggcgatgagg acgacggcta ctgaagtgtg 2401 gcctccaggc aggtgatgtc ctggcagggg gcctcgcggg tctcctcagc atcagacggg 2461 cttccaggac cgcagcaggc aggccccagc gccgagactc ctggtgacag gtggcacctg 2521 tcccacagcc ctcggtccca tgtggaactt accattggga ttgtgtttct attcagcaag 2581 ggaaaccgga ccaacgtctg catgtgtgtg atcagatgtg ggccgggtgt gtgcagggct 2641 gggtcccgct gcctgccgtc gactcatcca aggaccctcc aaggctggca gtgtggtgtt 2701 gctactatta aggaaacagg cttggggcag ccccactgct ggtccaagtg tgtggagggc 2761 tgagtgtgct ggccctgtga ctcaggacca gctctggagt ctccagccca ccctccgcac 2821 cgtcccctcc tgagcaggac tcggcgccca ggcctctgcc agagtggaag ccagagccct 2881 gcaggtgtgg cgcagccgtg ggagctgagg atctggcact tgagaggcag cagctccttg 2941 aaggtcctct gcctccagct gtggccctgc atccagatac ctgcctcgtc caaggcagac 3001 acccccaccc ctgcctcctc cagacccccc tccccgctgc ctgcaccgcc tggagcagca 3061 tgggggtcag accctgctcc agggccactt gagttgtggg cccaggagcc tgcggctgcc 3121 ggcagtgaac tgagtgcccg acagctgaga ccggcgccca cccgtcctga gcatagctct 3181 gtaggcagtg cgggcatagc ctgcatagtg tcctggcgct gggagttgcc cgtggacaga 3241 gccagagggc agtggcgctc cctgtcagag ctggatcagg ccccccatcg aggagggagg 3301 gcagacgagg cccgagagcc tccccaggcc tcttcgtggg aaggccccag taccactcgt 3361 aggaggtctc agctctgcat ggctgccccg gatgtggccg aaggggcttc accctgtgtc 3421 cttaggaggg ggtggccttg aggcagagcc gtgcctcact gacccccagg ggcctcatcc 3481 tccccatgga atgggctgta tgtcctgccc caacttggcc cgcagcaggc cagacccccc 3541 tacccccgcc cagagctcag tagccagcct gggtcctgcc agggcttctc gagggcttgg 3601 gggaagaata gatttagtaa agcaggaaga tctgttgtta cttaacaga // LOCUS HSU28946 4264 bp mRNA PRI 04-MAY-1996 DEFINITION Human G/T mismatch binding protein (GTBP) mRNA, complete cds. ACCESSION U28946 NID g1294812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4264) AUTHORS Palombo,F., Gallinari,P., Iaccarino,I., Lettieri,T., Hughes,M., D'Arrigo,A., Truong,O., Hsuan,J.J. and Jiricny,J. TITLE GTBP, a 160-kilodalton protein essential for mismatch-binding activity in human cells JOURNAL Science 268 (5219), 1912-1914 (1995) MEDLINE 95327934 REFERENCE 2 (bases 1 to 4264) AUTHORS Nicolaides,N.C., Palombo,F., Kinzler,K.W., Vogelstein,B. and Jiricny,J. TITLE Molecular cloning of the N-terminus of GTBP JOURNAL Genomics 31 (3), 395-397 (1996) MEDLINE 96435440 REFERENCE 3 (bases 1 to 4264) AUTHORS Jiricny,J. TITLE Direct Submission JOURNAL Submitted (12-JUN-1995) Josef Jiricny, Genetics Department, Istituto di Ricerche di Biol. Molecolare P. Angeletti (IRBM), Via Pontina Km 30.600, Pomezia, 00040, Italy FEATURES Location/Qualifiers source 1..4264 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C1" /cell_line="HeLa" /clone_lib="HeLa S3 cDNA in lambda Uni-ZAP XR (Stratagene)" /chromosome="2" /map="2p16" gene 88..4170 /gene="GTBP" CDS 88..4170 /gene="GTBP" /note="homolog of bacterial MutS proteins; binds to G/T mismatches through heterodimerization with hMSH2; similar to ORF YD8557.04c, probable DNA repair protein, from S. cerevisiae chromosome IV cosmid 8557, PIR Accession Number S51246; somatic mutations found in colon cancer" /codon_start=1 /product="G/T mismatch binding protein" /db_xref="PID:g1294813" /translation="MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG YPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ KGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDK SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETK NTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD ASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLV FMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRV VRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLG KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG SQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALS ALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTL NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL MVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII DFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT AFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGI GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLK DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPF LELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQ AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAV RLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKA REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL" BASE COUNT 1249 a 840 c 1076 g 1099 t ORIGIN 1 atttcccgcc agcaggagcc gcgcggtaga tgcggtgctt ttaggagctc cgtccgacag 61 aacggttggg ccttgccggc tgtcggtatg tcgcgacaga gcaccctgta cagcttcttc 121 cccaagtctc cggcgctgag tgatgccaac aaggcctcgg ccagggcctc acgcgaaggc 181 ggccgtgccg ccgctgcccc cggggcctct ccttccccag gcggggatgc ggcctggagc 241 gaggctgggc ctgggcccag gcccttggcg cgatccgcgt caccgcccaa ggcgaagaac 301 ctcaacggag ggctgcggag atcggtagcg cctgctgccc ccaccagttg tgacttctca 361 ccaggagatt tggtttgggc caagatggag ggttacccct ggtggccttg tctggtttac 421 aaccacccct ttgatggaac attcatccgc gagaaaggga aatcagtccg tgttcatgta 481 cagttttttg atgacagccc aacaaggggc tgggttagca aaaggctttt aaagccatat 541 acaggttcaa aatcaaagga agcccagaag ggaggtcatt tttacagtgc aaagcctgaa 601 atactgagag caatgcaacg tgcagatgaa gccttaaata aagacaagat taagaggctt 661 gaattggcag tttgtgatga gccctcagag ccagaagagg aagaagagat ggaggtaggc 721 acaacttacg taacagataa gagtgaagaa gataatgaaa ttgagagtga agaggaagta 781 cagcctaaga cacaaggatc taggcgaagt agccgccaaa taaaaaaacg aagggtcata 841 tcagattctg agagtgacat tggtggctct gatgtggaat ttaagccaga cactaaggag 901 gaaggaagca gtgatgaaat aagcagtgga gtgggggata gtgagagtga aggcctgaac 961 agccctgtca aagttgctcg aaagcggaag agaatggtga ctggaaatgg ctctcttaaa 1021 aggaaaagct ctaggaagga aacgccctca gccaccaaac aagcaactag catttcatca 1081 gaaaccaaga atactttgag agctttctct gcccctcaaa attctgaatc ccaagcccac 1141 gttagtggag gtggtgatga cagtagtcgc cctactgttt ggtatcatga aactttagaa 1201 tggcttaagg aggaaaagag aagagatgag cacaggagga ggcctgatca ccccgatttt 1261 gatgcatcta cactctatgt gcctgaggat ttcctcaatt cttgtactcc tgggatgagg 1321 aagtggtggc agattaagtc tcagaacttt gatcttgtca tctgttacaa ggtggggaaa 1381 ttttatgagc tgtaccacat ggatgctctt attggagtca gtgaactggg gctggtattc 1441 atgaaaggca actgggccca ttctggcttt cctgaaattg catttggccg ttattcagat 1501 tccctggtgc agaagggcta taaagtagca cgagtggaac agactgagac tccagaaatg 1561 atggaggcac gatgtagaaa gatggcacat atatccaagt atgatagagt ggtgaggagg 1621 gagatctgta ggatcattac caagggtaca cagacttaca gtgtgctgga aggtgatccc 1681 tctgagaact acagtaagta tcttcttagc ctcaaagaaa aagaggaaga ttcttctggc 1741 catactcgtg catatggtgt gtgctttgtt gatacttcac tgggaaagtt tttcataggt 1801 cagttttcag atgatcgcca ttgttcgaga tttaggactc tagtggcaca ctatccccca 1861 gtacaagttt tatttgaaaa aggaaatctc tcaaaggaaa ctaaaacaat tctaaagagt 1921 tcattgtcct gttctcttca ggaaggtctg atacccggct cccagttttg ggatgcatcc 1981 aaaactttga gaactctcct tgaggaagaa tattttaggg aaaagctaag tgatggcatt 2041 ggggtgatgt taccccaggt gcttaaaggt atgacttcag agtctgattc cattgggttg 2101 acaccaggag agaaaagtga attggccctc tctgctctag gtggttgtgt cttctacctc 2161 aaaaaatgcc ttattgatca ggagctttta tcaatggcta attttgaaga atatattccc 2221 ttggattctg acacagtcag cactacaaga tctggtgcta tcttcaccaa agcctatcaa 2281 cgaatggtgc tagatgcagt gacattaaac aacttggaga tttttctgaa tggaacaaat 2341 ggttctactg aaggaaccct actagagagg gttgatactt gccatactcc ttttggtaag 2401 cggctcctaa agcaatggct ttgtgcccca ctctgtaacc attatgctat taatgatcgt 2461 ctagatgcca tagaagacct catggttgtg cctgacaaaa tctccgaagt tgtagagctt 2521 ctaaagaagc ttccagatct tgagaggcta ctcagtaaaa ttcataatgt tgggtctccc 2581 ctgaagagtc agaaccaccc agacagcagg gctataatgt atgaagaaac tacatacagc 2641 aagaagaaga ttattgattt tctttctgct ctggaaggat tcaaagtaat gtgtaaaatt 2701 atagggatca tggaagaagt tgctgatggt tttaagtcta aaatccttaa gcaggtcatc 2761 tctctgcaga caaaaaatcc tgaaggtcgt tttcctgatt tgactgtaga attgaaccga 2821 tgggatacag cctttgacca tgaaaaggct cgaaagactg gacttattac tcccaaagca 2881 ggctttgact ctgattatga ccaagctctt gctgacataa gagaaaatga acagagcctc 2941 ctggaatacc tagagaaaca gcgcaacaga attggctgta ggaccatagt ctattggggg 3001 attggtagga accgttacca gctggaaatt cctgagaatt tcaccactcg caatttgcca 3061 gaagaatacg agttgaaatc taccaagaag ggctgtaaac gatactggac caaaactatt 3121 gaaaagaagt tggctaatct cataaatgct gaagaacgga gggatgtatc attgaaggac 3181 tgcatgcggc gactgttcta taactttgat aaaaattaca aggactggca gtctgctgta 3241 gagtgtatcg cagtgttgga tgttttactg tgcctggcta actatagtcg agggggtgat 3301 ggtcctatgt gtcgcccagt aattctgttg ccggaagata cccccccctt cttagagctt 3361 aaaggatcac gccatccttg cattacgaag actttttttg gagatgattt tattcctaat 3421 gacattctaa taggctgtga ggaagaggag caggaaaatg gcaaagccta ttgtgtgctt 3481 gttactggac caaatatggg gggcaagtct acgcttatga gacaggctgg cttattagct 3541 gtaatggccc agatgggttg ttacgtccct gctgaagtgt gcaggctcac accaattgat 3601 agagtgttta ctagacttgg tgcctcagac agaataatgt caggtgaaag tacatttttt 3661 gttgaattaa gtgaaactgc cagcatactc atgcatgcaa cagcacattc tctggtgctt 3721 gtggatgaat taggaagagg tactgcaaca tttgatggga cggcaatagc aaatgcagtt 3781 gttaaagaac ttgctgagac tataaaatgt cgtacattat tttcaactca ctaccattca 3841 ttagtagaag attattctca aaatgttgct gtgcgcctag gacatatggc atgcatggta 3901 gaaaatgaat gtgaagaccc cagccaggag actattacgt tcctctataa attcattaag 3961 ggagcttgtc ctaaaagcta tggctttaat gcagcaaggc ttgctaatct cccagaggaa 4021 gttattcaaa agggacatag aaaagcaaga gaatttgaga agatgaatca gtcactacga 4081 ttatttcggg aagtttgcct ggctagtgaa aggtcaactg tagatgctga agctgtccat 4141 aaattgctga ctttgattaa ggaattatag actgactaca ttggaagctt tgagttgact 4201 tctgaccaaa ggtggtaaat tcagacaaca ttatgatcta ataaacttta ttttttaaaa 4261 atga // LOCUS HSU28963 1168 bp mRNA PRI 19-DEC-1996 DEFINITION Human Gps2 (GPS2) mRNA, complete cds. ACCESSION U28963 NID g1049069 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1168) AUTHORS Spain,B.H., Bowdish,K.S., Pacal,A.R., Staub,S.F., Koo,D., Chang,C.Y., Xie,W. and Colicelli,J. TITLE Two human cDNAs, including a homolog of Arabidopsis FUS6 (COP11), suppress G-protein- and mitogen-activated protein kinase-mediated signal transduction in yeast and mammalian cells JOURNAL Mol. Cell. Biol. 16 (12), 6698-6706 (1996) MEDLINE 97098647 REFERENCE 2 (bases 1 to 1168) AUTHORS Bowdish,K.S. TITLE Direct Submission JOURNAL Submitted (12-JUN-1995) Katherine S. Bowdish, Lab. of Struct. Biol. and Mol. Med., University of California at Los Angeles, 900 Veteran Avenue, Los Angeles, CA 90024, USA COMMENT D. Jin and K. Jeang, Laboratory of Molecular Microbiology, National Institutes of Allergy and Infectious Diseases, Bethesda, MD 20892, USA; The previous mentioned are credited for discovering the silent mutation and the addition of a 't' at position 741. FEATURES Location/Qualifiers source 1..1168 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="17" /cell_line="U118-MG" /cell_type="glioblastoma" gene 91..1074 /gene="GPS2" CDS 91..1074 /gene="GPS2" /codon_start=1 /product="Gps2" /db_xref="PID:g1049070" /translation="MPALLERPKLSNAMARALHRHIMMERERKRQEEEEVDKMMEQKM KEEQERRKKKEMEERMSLEETKEQILKLEEKLLALQEEKHQLFLQLKKVLHEEEKRRR KEQSDLTTLTSAAYQQSLTVHTGTHLLSMQGSPGGHNRPGTLMAADRAKQMFGPQVLT TRHYVGSAAAFAGTPEHGQFQGSPGGAYGTAQPPPHYGPTQPAYSPSQQLRAPSAFPA VQYLSQPQPQPYAVHGHFQPTQTGFLQPGGALSLQKQMEHANQQTGFSDSSSLRPMHP QALHPAPGLLASPQLPVQMQPAGKSGFAATSQPGPRLPFIQHSQNPRFYHK" mutation 423 /gene="GPS2" /note="silent mutation" /replace="g" BASE COUNT 306 a 362 c 302 g 198 t ORIGIN 1 gagaaagcgc agagaaggac gggaccccgt ctgaggtctg gcagtcagag acagccgggc 61 gcccacggcc cgagcgccca cggcagcacc atgcccgcac tcctggagcg ccccaagctt 121 tccaacgcca tggccagggc gctgcaccgg cacattatga tggagcggga gcgcaagcgg 181 caggaggaag aagaggtgga taagatgatg gaacagaaga tgaaggaaga acaggagaga 241 aggaagaaaa aggagatgga agagagaatg tcattagagg agaccaagga acaaattctg 301 aagttggagg agaagctttt ggctctacag gaagagaagc accagctttt cctgcagctc 361 aagaaagttt tacatgagga agaaaaacgg aggcgaaagg aacagagtga cctgaccacc 421 ctaacatcag ctgcatacca gcagagcctg actgttcaca caggaactca tctcctcagc 481 atgcagggga gccctggagg acacaatcgc ccaggcaccc tcatggcagc tgacagagcc 541 aaacaaatgt ttggacccca agtgcttacg acccggcact acgtgggctc agcagctgct 601 tttgcaggga caccagagca tggacaattc caaggcagtc ctggtggtgc ctatgggact 661 gctcagcccc cacctcacta tgggcccaca cagccagctt atagtcctag tcagcagctc 721 agagctcctt cggcattccc tgcagtgcag tacctatctc agccacagcc acagccctat 781 gctgtgcatg gccactttca gcccactcag acaggtttcc tccagcctgg tggtgccctg 841 tccttgcaaa agcagatgga acatgctaac cagcagactg gcttctccga ctcatcctct 901 ctgcgcccca tgcaccccca ggctctgcat ccagcccctg gactccttgc ttccccccag 961 ctccctgtgc agatgcagcc agcaggaaag tcgggctttg cagctaccag ccaacctggc 1021 cctcggctcc ccttcatcca acacagccag aacccgcgat tctaccacaa gtgaccatca 1081 gattatatct tcaacaccac accccccacc ccatcgtggg tgagggtatc ccctgtgtgt 1141 cccaggccaa taaaatctac ctgccaat // LOCUS HSU28964 1030 bp mRNA PRI 17-JUL-1995 DEFINITION Human 14-3-3 protein mRNA, complete cds. ACCESSION U28964 NID g899458 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1030) AUTHORS Seluja,G.A., Pietromonaco,S.F. and Elias,L. TITLE Cloning and characterization of a novel 14-3-3 cDNA of human hematopoietic cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1030) AUTHORS Seluja,G.A. TITLE Direct Submission JOURNAL Submitted (12-JUN-1995) Gustavo A. Seluja, University of New Mexico, Cell Biology, 900 Camino de Salud, NE, Albuquerque, NM 87131, USA FEATURES Location/Qualifiers source 1..1030 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hBM5" /tissue_type="bone marrow" misc_signal 123..130 /note="Kozak sequence" CDS 127..864 /codon_start=1 /product="14-3-3 protein" /db_xref="PID:g899459" /translation="MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLL SVAYKNVVGARRSSWRVVSSIEQKTEGAEKKQQMAREYREKIETELRDICNDVLSLLE KFLIPNASQAESKVFYLKMKGDYYRYLAEVAAGDDKKGIVDQSQQAYQEAFEISKKEM QPTHPIRLGLALNFSVFYYEILNSPEKACSLAKTAFDEAIAELDTLSEESYKDSTLIM QLLRDNLTLWTSDTQGDEAEAGEGGEN" BASE COUNT 309 a 222 c 242 g 257 t ORIGIN 1 gcctgtgagc agcgagatcc agggacagag tctcagcctc gccgctgctg ccgccgccgc 61 cgcccagaga ctgctgagcc cgtccgtccg ccgccaccac ccactccgga cacagaacat 121 ccagtcatgg ataaaaatga gctggttcag aaggccaaac tggccgagca ggctgagcga 181 tatgatgaca tggcagcctg catgaagtct gtaactgagc aaggagctga attatccaat 241 gaggagagga atcttctctc agttgcttat aaaaatgttg taggagcccg taggtcatct 301 tggagggtcg tctcaagtat tgaacaaaag acggaaggtg ctgagaaaaa acagcagatg 361 gctcgagaat acagagagaa aattgagacg gagctaagag atatctgcaa tgatgtactg 421 tctcttttgg aaaagttctt gatccccaat gcttcacaag cagagagcaa agtcttctat 481 ttgaaaatga aaggagatta ctaccgttac ttggctgagg ttgccgctgg tgatgacaag 541 aaagggattg tcgatcagtc acaacaagca taccaagaag cttttgaaat cagcaaaaag 601 gaaatgcaac caacacatcc tatcagactg ggtctggccc ttaacttctc tgtgttctat 661 tatgagattc tgaactcccc agagaaagcc tgctctcttg caaagacagc ttttgatgaa 721 gccattgctg aacttgatac attaagtgaa gagtcataca aagacagcac gctaataatg 781 caattactga gagacaactt gacattgtgg acatcggata cccaaggaga cgaagctgaa 841 gcaggagaag gaggggaaaa ttaaccggcc ttccaacttt tgtctgcctc attctaaaat 901 ttacacagta gaccatttgt catccatgct gtcccacaaa tagttttttg tttacgattt 961 atgacaggtt tatgttactt ctatttgaat ttctatattt cccatgtggt ttttatgttt 1021 aatattaggg // LOCUS HSU29089 1560 bp mRNA PRI 27-OCT-1995 DEFINITION Human proline- arginine-rich end leucine-rich repeat protein PRELP mRNA, complete cds. ACCESSION U29089 NID g886135 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Bengtsson,E., Neame,P.J., Heinegard,D. and Sommarin,Y. TITLE The primary structure of a basic leucine-rich repeat protein, PRELP, found in connective tissues JOURNAL J. Biol. Chem. 270 (43), 25639-25644 (1995) MEDLINE 96029653 REFERENCE 2 (bases 1 to 1560) AUTHORS Sommarin,Y. TITLE Direct Submission JOURNAL Submitted (13-JUN-1995) Yngve Sommarin, Cell and Molecular Biology, Lund University, P.O. Box 94, Lund, 221 00, Sweden FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /note="in lambdaZAP library" /db_xref="taxon:9606" /cell_type="articular chondrocyte" CDS 129..1277 /note="PRELP; connective tissue protein; similar to extracellular matrix family of leucine-rich repeat proteins, including human fibromodulin, Swiss-Prot Accession Number Q06828" /codon_start=1 /product="proline- arginine-rich end leucine-rich repeat protein" /db_xref="PID:g886136" /translation="MRSPLCWLLPLLILASVAQGQPTRRPRPGTGPGRRPRPRPRPTP SFPQPDEPAEPTDLPPPLPPGPPSIFPDCPRECYCPPDFPSALYCDSRNLRKVPVIPP RIHYLYLQNNFITELPVESFQNATGLRWINLDNNRIRKIDQRVLEKLPGLVFLYMEKN QLEEVPSALPRNLEQLRLSQNHISRIPPGVFSKLENLLLLDLQHNRLSDGVFKPDTFH GLKNLMQLNLAHNILRKMPPRVPTAIHQLYLDSNKIETIPNGYFKSFPNLAFIRLNYN KLTDRGLPKNSFNISNLLVLHLSHNRISSVPAINNRLEHLYLNNNSIEKINGTQICPN DLVAFHDFSSDLENVPHLRYLRLDGNYLKPPIPLDLMMCFRLLQSVVI" BASE COUNT 354 a 540 c 353 g 313 t ORIGIN 1 tctgggagat cagatcttct agctggctct ctgctgccac agctccgccg aagggagggg 61 gtggaagagg aggactaaac tcagagctga gaggagaggc aggtgtgtgc aggtgcatca 121 cctggatcat gaggtcaccc ctctgctggc tcctcccact tctcatcttg gcctcagtgg 181 cccaaggcca gccaacaaga cgaccaagac ccgggactgg gcccgggcgc agacccaggc 241 ccaggcccag gcccacaccc agctttcctc agcctgatga accagcagag ccaacagacc 301 tgcctcctcc cctccctcca ggccctccat ctatcttccc tgactgtccc cgcgaatgct 361 actgcccccc tgatttccca tctgccctct actgtgatag ccgcaacctg cgaaaggtcc 421 ctgtcatccc gccccgcatc cattacctct atctccagaa caacttcatc actgagctcc 481 cggtggagtc cttccagaat gccacaggcc tgcgatggat taacctggac aacaaccgaa 541 tccgcaagat agaccagagg gtgctggaga aactgcccgg cctggtgttc ctctacatgg 601 agaagaacca gttggaagag gtcccctcgg ccctgccccg gaacctggag cagctgaggc 661 tgagccagaa ccacatctcc agaatcccgc ctggtgtctt cagcaagctg gagaacctgc 721 tgctcctgga tctccagcac aacaggctga gcgacggcgt cttcaagccc gacaccttcc 781 atggcctcaa gaacctcatg cagctcaacc tggcccacaa catcctgaga aagatgccgc 841 ccagggtccc caccgccatt caccagctct acctggacag taacaagatt gagaccatcc 901 ctaacggata cttcaagagc tttcccaatc ttgccttcat tcggcttaac tacaacaagc 961 tgacagacag gggactcccc aagaactcct ttaatatctc caacctgctt gtgctccacc 1021 tgtcccacaa caggatcagc agtgtgcccg ccatcaacaa caggctggaa cacctgtacc 1081 tcaacaacaa tagcatcgag aaaatcaacg gaacccagat ttgccccaac gacctagtgg 1141 cgttccatga cttctcctcg gacctggaga acgtgccaca cctgcgctac ctgcggctgg 1201 atggaaacta cttgaagccg cccatcccgc tggacctcat gatgtgcttc cgcctcctgc 1261 agtccgtggt catctaggcc ctactccgcc accggatctg ctctgaccgc acttgaaggc 1321 tggggcccag gcacctgtgc cggccattcg ttttctctct ctccccttct ttctcccagc 1381 tttgcctccc ttatcccacc ctcgaggcag ggaaaagcca tctattcttc tgcagcctca 1441 ggagcgagac ttcaaggact cagtttggtt ccacccagtt gaaagacaac cagtgcacac 1501 ccaaactcct ggccttctgt ggtttccctt tgctccagaa acacagatgt gtctaaaaaa // LOCUS HSU29091 1429 bp mRNA PRI 14-JUN-1996 DEFINITION Human selenium-binding protein (hSBP) mRNA, complete cds. ACCESSION U29091 NID g1374791 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1429) AUTHORS Chang,P.W.G., Tsui,S. and Fung,K. TITLE Sequence determination of human fetal heart selenium-binding protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1429) AUTHORS Chang,P.W.G. TITLE Direct Submission JOURNAL Submitted (13-JUN-1995) P.W.G. Chang, Biochemistry, The Chinese University of Hong Kong, Rm414, B.M.S.B., Shatin, Hong Kong FEATURES Location/Qualifiers source 1..1429 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="M239" /tissue_type="heart" 5'UTR <1..4 /gene="hSBP" gene 1..1429 /gene="hSBP" CDS 5..1423 /gene="hSBP" /codon_start=1 /product="selenium-binding protein" /db_xref="PID:g1374792" /translation="MATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYL ATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWNTYSSCFGDSTKSRNKLVLPSLISS RIYVVDVGSEPGPQKLHKVIEPKDIHAKCELACLHTSHCLASGEVMISSLGDVKGNGK GGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPAD VEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPSATQGFVGCASAPNIQR FYKTREGTWSVEKVIQVPPKKVKGWLLPGVPGLITDILLSLDDRFLYFSNWLHGDLRQ YDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMIQL SLDGKRLYITTSLYSAWEKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNCLVDFGKEP LGPALAHELRYPGGDCSSDIWI" 3'UTR 1424..>1429 /gene="hSBP" BASE COUNT 322 a 403 c 419 g 285 t ORIGIN 1 cagcatggct acgaaatgtg ggaattgtgg acccggctac tccacccctc tggaggccat 61 gaaaggaccc agggaagaga tcgtctacct gccctgcatt taccgaaaca caggcactga 121 ggccccagat tatctggcca ctgtggatgt tgaccccaag tctccccagt attgccaggt 181 catccaccgg ctgcccatgc ccaacctgaa ggacgagctg catcactcag gatggaacac 241 ctacagcagc tgcttcggtg atagcaccaa gtcgcgcaac aagctggtct tgcccagtct 301 catctcctct cgcatctatg tggtggacgt gggctctgag cccgggcccc aaaagctgca 361 caaggtcatt gagcccaagg acatccatgc caagtgcgaa ctggcctgtc tccacaccag 421 ccactgcctg gccagcgggg aagtgatgat cagctccctg ggggacgtca agggcaatgg 481 caaagggggt tttgtgctgc tggatgggga gacgttcgag gtgaagggga catgggagag 541 acctgggggt gctgcaccgt tgggctatga cttctggtac cagcctcgac acaatgtcat 601 gatcagcact gagtgggcag ctcccaatgt cttacgagat ggctttaacc ccgctgatgt 661 ggaggctgga ctgtacggga gccacttata tgtatgggac tggcagcgcc atgagattgt 721 gcagaccctg tctctaaaag atgggctgat acccttggag atccgcttcc tgcacaaccc 781 aagtgccacc cagggttttg taggctgtgc ctcagctcca aacatccagc gcttctacaa 841 aacgagggaa ggtacatggt cagtggagaa ggtgatccag gtgcccccca agaaagtgaa 901 gggctggctg ctgccagggg tgccaggcct gatcaccgac atcctgctct ccctggacga 961 ccgcttcctc tacttcagca actggctgca tggggacctg aggcagtatg acatctctga 1021 cccacagaga ccccgcctca caggacagct cttcctcgga ggcagcattg ttaagggagg 1081 ccctgtgcaa gtgctggagg acgaggaact aaagtcccag ccagagcccc tagtggtcaa 1141 gggaaaacgg gtggctggag gccctcagat gatccagctc agcctggatg gcaagcgcct 1201 ctacatcacc acgtcgctgt acagtgcctg ggaaaagcag ttttaccctg atctcatcag 1261 ggaaggctct gtaatgctgc aggttgatgt agacacagta aaaggagggc tgaagttgaa 1321 ccccaactgc ctggtggact tcgggaagga gccccttggc ccagccctgg ctcacgagct 1381 tcgctaccct gggggcgatt gtagctctga catctggatt tgaaggctc // LOCUS HSU29343 2805 bp mRNA PRI 05-NOV-1996 DEFINITION Human hyaluronan receptor (RHAMM) mRNA, complete cds. ACCESSION U29343 NID g1657697 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2805) AUTHORS Wang,C., Entwistle,J., Hou,G., Li,Q. and Turley,E.A. TITLE The characterization of a human RHAMM cDNA: conservation of the hyaluronan-binding domains JOURNAL Gene 174 (2), 299-306 (1996) MEDLINE 97045829 REFERENCE 2 (bases 1 to 2805) AUTHORS Wang,C., Hou,G., Entwistle,J. and Turley,E.A. TITLE Direct Submission JOURNAL Submitted (16-JUN-1995) Chao Wang, Cell Biology/Physiology, University of Manitoba, 100 Olivia Street, Winnipeg, Manitoba R3E 0v9, Canada FEATURES Location/Qualifiers source 1..2805 /organism="Homo sapiens" /note="normal human breast 5'-stretch cDNA expression library (Clontech Laboratories, Inc., Palo Alto, CA)" /db_xref="taxon:9606" /tissue_type="normal breast" /map="5q33.2-qter" /chromosome="5" gene 110..2287 /gene="RHAMM" CDS 110..2287 /gene="RHAMM" /codon_start=1 /product="hyaluronan receptor" /db_xref="PID:g1657698" /translation="MSFPKAPLKRFNDPSGCAPSPGAYDVKTLEVLKGPVSFQKSQRF KQQKESKQNLNVDKDTTLPASARKVKSSESKKESQKNDKDLKILEKEIRVLLQERGAQ DRRIQDLETELEKMEARLNAALREKTSLSANNATLEKQLIELTRTNELLKSKFSENGN QKNLRILSLELMKLRNKRETKMRGMMAKQEGMEMKLQVTQRSLEESQGKIAQLEGKLV SIEKEKIDEKSETEKLLEYIEEISCASDQVEKYKLDIAQLEENLKEKNDEILSLKQSL EDNIVILSKQVEDLNVKCQLLETEKEDHVNRNREHNENLNAEMQNLEQKFILEQREHE KLQQKELQIDSLLQQEKELSSSLHQKLCSFQEEMVKEKNLFEEELKQTLDELDKLQQK EEQAERLVKQLEEEAKSRAEELKLLEEKLKGKEAELEKSSAAHTQATLLLQEKYDSMV QSLEDVTAQFESYKALTASEIEDLKLENSSLQEKAAKAGKNAEDVQHQILATESSNQE YVRMLLDLQTKSALKETEIKEITVSFLQKITDLQNQLKQQEEDFRKQLEDEEGRKAEK ENTTAELTEEINKWRLLYEELYNKTKPFQLQLDAFEVEKQALLNEHGAAQEQLNKIRD SYAKLLGHQNLKQKIKHVVKLKDENSQLKSEVSKLRCQLAKKKQSETKLQEELNKVLG IKHFDPSKAFHHESKENFALKTPLKEGNTNCYRAPMECQESWK" BASE COUNT 1032 a 484 c 563 g 726 t ORIGIN 1 ccgccagtgt gatggatatc tgcagaattc ggcttactca ctatagggct cgagcggccg 61 cccgggcagg tgtgccagtc accttcagtt tctggagctg gccgtcaaca tgtcctttcc 121 taaggcgccc ttgaaacgat tcaatgaccc ttctggttgt gcaccatctc caggtgctta 181 tgatgttaaa actttagaag tattgaaagg accagtatcc tttcagaaat cacaaagatt 241 taaacaacaa aaagaatcta aacaaaatct taatgttgac aaagatacta ccttgcctgc 301 ttcagctaga aaagttaagt cttcggaatc aaagaaggaa tctcaaaaga atgataaaga 361 tttgaagata ttagagaaag agattcgtgt tcttctacag gaacgtggtg cccaggacag 421 gcggatccag gatctggaaa ctgagttgga aaagatggaa gcaaggctaa atgctgcact 481 aagggaaaaa acatctctct ctgcaaataa tgctacactg gaaaaacaac ttattgaatt 541 gaccaggact aatgaactac taaaatctaa gttttctgaa aatggtaacc agaagaattt 601 gagaattcta agcttggagt tgatgaaact tagaaacaaa agagaaacaa agatgagggg 661 tatgatggct aagcaagaag gcatggagat gaagctgcag gtcacccaaa ggagtctcga 721 agagtctcaa gggaaaatag cccaactgga gggaaaactt gtttcaatag agaaagaaaa 781 gattgatgaa aaatctgaaa cagaaaaact cttggaatac atcgaagaaa ttagttgtgc 841 ttcagatcaa gtggaaaaat acaagctaga tattgcccag ttagaagaaa atttgaaaga 901 gaagaatgat gaaattttaa gccttaagca gtctcttgag gacaatattg ttatattatc 961 taaacaagta gaagatctaa atgtgaaatg tcagctgctt gaaacagaaa aagaagacca 1021 tgtcaacagg aatagagaac acaacgaaaa tctaaatgca gagatgcaaa acttagaaca 1081 gaagtttatt cttgaacaac gggaacatga aaagcttcaa caaaaagaat tacaaattga 1141 ttcacttctg caacaagaga aagaattatc ttcgagtctt catcagaagc tctgttcttt 1201 tcaagaggaa atggttaaag agaagaatct gtttgaggaa gaattaaagc aaacactgga 1261 tgagcttgat aaattacagc aaaaggagga acaagctgaa aggctggtca agcaattgga 1321 agaggaagca aaatctagag ctgaagaatt aaaactccta gaagaaaagc tgaaagggaa 1381 ggaggctgaa ctggagaaaa gtagtgctgc tcatacccag gccaccctgc ttttgcagga 1441 aaagtatgac agtatggtgc aaagccttga agatgttact gctcaatttg aaagctataa 1501 agcgttaaca gccagtgaga tagaagatct taagctggag aactcatcat tacaggaaaa 1561 agcggccaag gctgggaaaa atgcagagga tgttcagcat cagattttgg caactgagag 1621 ctcaaatcaa gaatatgtaa ggatgcttct agatctgcag accaagtcag cactaaagga 1681 aacagaaatt aaagaaatca cagtttcttt tcttcaaaaa ataactgatt tgcagaacca 1741 actcaagcaa caggaggaag actttagaaa acagctggaa gatgaagaag gaagaaaagc 1801 tgaaaaagaa aatacaacag cagaattaac tgaagaaatt aacaagtggc gtctcctcta 1861 tgaagaacta tataataaaa caaaaccttt tcagctacaa ctagatgctt ttgaagtaga 1921 aaaacaggca ttgttgaatg aacatggtgc agctcaggaa cagctaaata aaataagaga 1981 ttcatatgct aaattattgg gtcatcagaa tttgaaacaa aaaatcaagc atgttgtgaa 2041 gttgaaagat gaaaatagcc aactcaaatc ggaagtatca aaactccgct gtcagcttgc 2101 taaaaaaaaa caaagtgaga caaaacttca agaggaattg aataaagttc taggtatcaa 2161 acactttgat ccttcaaagg cttttcatca tgaaagtaaa gaaaattttg ccctgaagac 2221 cccattaaaa gaaggcaata caaactgtta ccgagctcct atggagtgtc aagaatcatg 2281 gaagtaaaca tctgagaaac ctgttgaaga ttatttcatt cgtcttgttg ttattgatgt 2341 tgctgttatt atatttgaca tgggtatttt ataatgttgt atttaatttt aactgccaat 2401 ccttaaatat gtgaaaggaa cattttttac caaagtgtct tttgacattt tattttttct 2461 tgcaaatacc tcctccctaa tgctcacctt tatcacctca ttctgaaccc tttcgctggc 2521 tttccagctt agaatgcatc tcatcaactt aaaagtcagt atcatattat tatcctcctg 2581 ttctgaaacc ttagtttcaa gagtctaaac cccagattct tcagcttgat cctggaggct 2641 tttctagtct gagcttcttt agctaggcta aaacaccttg gcttgttatt gcctctactt 2701 tgattcttga taatgctcac ttggtcctac ctattatcct ttctacttgt ccagttcaaa 2761 taagaaataa ggacaagcct aacttcatag taacctctct atttt // LOCUS HSU29538 966 bp mRNA PRI 13-MAY-1997 DEFINITION Human heart protein with four and a half LIM domains (FHL-1) mRNA, complete cds. ACCESSION U29538 NID g2078479 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 966) AUTHORS Tsui,S.K.W., Lim,N.K., Fung,K.P., Waye,M.M.Y. and Lee,C.Y. TITLE The cloning, sequencing and characterization of two human heart proteins, FHL-1 and FHL-2, which contain four and a half LIM domains JOURNAL Unpublished REFERENCE 2 (bases 1 to 966) AUTHORS Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (20-JUN-1995) Biochemistry Department, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong REFERENCE 3 (bases 1 to 966) AUTHORS Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (12-MAY-1997) Biochemistry Department, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..966 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /dev_stage="fetal" gene 22..864 /gene="FHL-1" CDS 22..864 /gene="FHL-1" /note="heart protein with four and a half LIM domains" /codon_start=1 /db_xref="PID:g2078480" /translation="MAEKFDCHYCRDPLQGKKYVQKDGHHCCLKCFDKFCANTCVECR KPIGADSKEVHYKNRFWHDTCFRCAKCLHPLANETFVAKDNKILCNKCTTREDSPKCK GCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTGSFFPKGEDFYCVTCHETKFAK HCVKCNKAITSGGITYQDQPWHADCFVCVTCSKKLAGQRFTAVEDQYYCVDCYKNFVA KKCAGCKNPITGFGKGSSVVAYEGQSWHDYCFHCKKCSVNLANKRFVFHQEQVYCPDC AKKL" BASE COUNT 236 a 247 c 259 g 224 t ORIGIN 1 tccagctaca aggtgggcac catggcggag aagtttgact gccactactg cagggatccc 61 ttgcagggga agaagtatgt gcaaaaggat ggccaccact gctgcctgaa atgctttgac 121 aagttctgtg ccaacacctg tgtggaatgc cgcaagccca tcggtgcgga ctccaaggag 181 gtgcactata agaaccgctt ctggcatgac acctgcttcc gctgtgccaa gtgccttcac 241 cccttggcca atgagacctt tgtggccaag gacaacaaga tcctgtgcaa caagtgcacc 301 actcgggagg actcccccaa gtgcaagggg tgcttcaagg ccattgtggc aggagatcaa 361 aacgtggagt acaaggggac cgtctggcac aaagactgct tcacctgtag taactgcaag 421 caagtcatcg ggactggaag cttcttccct aaaggggagg acttctactg cgtgacttgc 481 catgagacca agtttgccaa gcattgcgtg aagtgcaaca aggccatcac atctggagga 541 atcacttacc aggatcagcc ctggcatgcc gattgctttg tgtgtgttac ctgctctaag 601 aagctggctg ggcagcgttt caccgctgtg gaggaccagt attactgcgt ggattgctac 661 aagaactttg tggccaagaa gtgtgctgga tgcaagaacc ccatcactgg gtttggtaaa 721 ggctccagtg tggtggccta tgaaggacaa tcctggcacg actactgctt ccactgcaaa 781 aaatgctccg tgaatctggc caacaagcgc tttgttttcc accaggagca agtgtattgt 841 cccgactgtg ccaaaaagct gtaaactgac aggggctcct gtcctgtaaa atggcatttg 901 aatctcgttc tttgtgtcct tactttctgc cctataccat caatagggga agagtggtcc 961 ttccct // LOCUS HSU29589 3906 bp DNA PRI 24-JUL-1995 DEFINITION Human m3 muscarinic acetylcholine receptor (CHRM3) gene, complete cds. ACCESSION U29589 NID g903978 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3906) AUTHORS Bonner,T.I., Young,A.C., Brann,M.R. and Buckley,N.J. TITLE Cloning and expression of the human and rat m5 muscarinic acetylcholine receptor genes JOURNAL Neuron 1 (5), 403-410 (1988) MEDLINE 90166521 REFERENCE 2 (bases 1 to 3906) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (20-JUN-1995) Tom I. Bonner, Lab of Cell Biology, NIMH, National Institutes of Health, Bldg. 36, Room 3A-17, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..3906 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q43-44" /clone_lib="partial HaeIII-AluI genomic library of R.M. Lawn, E.F. Fritsch, R.C. Parker, G. Blake, and T. Maniatis, Cell 15, 1157-1174, 1978." intron <1..182 exon 183..3713 /note="based on comparison to the rat cDNA rather than a human cDNA" gene 202..1974 /gene="CHRM3" CDS 202..1974 /gene="CHRM3" /codon_start=1 /product="m3 muscarinic acetylcholine receptor" /db_xref="PID:g903979" /translation="MTLHNNSTTSPLFPNISSSWIHSPSDAGLPPGTVTHFGSYNVSR AAGNFSSPDGTTDDPLGGHTVWQVVFIAFLTGILALVTIIGNILVIVSFKVNKQLKTV NNYFLLSLACADLIIGVISMNLFTTYIIMNRWALGNLACDLWLAIDYVASNASVMNLL VISFDRYFSITRPLTYRAKRTTKRAGVMIGLAWVISFVLWAPAILFWQYFVGKRTVPP GECFIQFLSEPTITFGTAIAAFYMPVTIMTILYWRIYKETEKRTKELAGLQASGTEAE TENFVHPTGSSRSCSSYELQQQSMKRSNRRKYGRCHFWFTTKSWKPSSEQMDQDHSSS DSWNNNDAAASLENSASSDEEDIGSETRAIYSIVLKLPGHSTILNSTKLPSSDNLQVP EEELGMVDLERKADKLQAQKSVDDGGSFPKSFSKLPIQLESAVDTAKTSDVNSSVGKS TATLPLSFKEATLAKRFALKTRSQITKRKRMSLVKEKKAAQTLSAILLAFIITWTPYN IMVLVNTFCDSCIPKTFWNLGYWLCYINSTVNPVCYALCNKTFRTTFKMLLLCQCDKK KRRKQQYQQRQSVIFHKRAPEQAL" polyA_site 3714 /note="based on comparison to the rat cDNA rather than a human cDNA" BASE COUNT 1073 a 897 c 875 g 1061 t ORIGIN 1 atcgatgtct gtctgcccta gactccactt atttaaaata agagaatgaa cttgatgttt 61 ggcttcatag agattcagca ccctgtaata ggccttccat gtcttttaac gtatgtaatg 121 caaagaacaa acaaataaag gcagaaattt ttctaactct gtctcttctc tctttccccc 181 agactatgtc agagagtcac aatgaccttg cacaataaca gtacaacctc gcctttgttt 241 ccaaacatca gctcctcctg gatacacagc ccctccgatg cagggctgcc cccgggaacc 301 gtcactcatt tcggcagcta caatgtttct cgagcagctg gcaatttctc ctctccagac 361 ggtaccaccg atgaccctct gggaggtcat accgtctggc aagtggtctt catcgctttc 421 ttaacgggca tcctggcctt ggtgaccatc atcggcaaca tcctggtaat tgtgtcattt 481 aaggtcaaca agcagctgaa gacggtcaac aactacttcc tcttaagcct ggcctgtgcc 541 gatctgatta tcggggtcat ttcaatgaat ctgtttacga cctacatcat catgaatcga 601 tgggccttag ggaacttggc ctgtgacctc tggcttgcca ttgactacgt agccagcaat 661 gcctctgtta tgaatcttct ggtcatcagc tttgacagat acttttccat cacgaggccg 721 ctcacgtacc gagccaaacg aacaacaaag agagccggtg tgatgatcgg tctggcttgg 781 gtcatctcct ttgtcctttg ggctcctgcc atcttgttct ggcaatactt tgttggaaag 841 agaactgtgc ctccgggaga gtgcttcatt cagttcctca gtgagcccac cattactttt 901 ggcacagcca tcgctgcttt ttatatgcct gtcaccatta tgactatttt atactggagg 961 atctataagg aaactgaaaa gcgtaccaaa gagcttgctg gcctgcaagc ctctgggaca 1021 gaggcagaga cagaaaactt tgtccacccc acgggcagtt ctcgaagctg cagcagttac 1081 gaacttcaac agcaaagcat gaaacgctcc aacaggagga agtatggccg ctgccacttc 1141 tggttcacaa ccaagagctg gaaacccagc tccgagcaga tggaccaaga ccacagcagc 1201 agtgacagtt ggaacaacaa tgatgctgct gcctccctgg agaactccgc ctcctccgac 1261 gaggaggaca ttggctccga gacgagagcc atctactcca tcgtgctcaa gcttccgggt 1321 cacagcacca tcctcaactc caccaagtta ccctcatcgg acaacctgca ggtgcctgag 1381 gaggagctgg ggatggtgga cttggagagg aaagccgaca agctgcaggc ccagaagagc 1441 gtggacgatg gaggcagttt tccaaaaagc ttctccaagc ttcccatcca gctagagtca 1501 gccgtggaca cagctaagac ttctgacgtc aactcctcag tgggtaagag cacggccact 1561 ctacctctgt ccttcaagga agccactctg gccaagaggt ttgctctgaa gaccagaagt 1621 cagatcacta agcggaaaag gatgtccctg gtcaaggaga agaaagcggc ccagaccctc 1681 agtgcgatct tgcttgcctt catcatcact tggaccccat acaacatcat ggttctggtg 1741 aacacctttt gtgacagctg catacccaaa accttttgga atctgggcta ctggctgtgc 1801 tacatcaaca gcaccgtgaa ccccgtgtgc tatgctctgt gcaacaaaac attcagaacc 1861 actttcaaga tgctgctgct gtgccagtgt gacaaaaaaa agaggcgcaa gcagcagtac 1921 cagcagagac agtcggtcat ttttcacaag cgcgcacccg agcaggcctt gtagaatgag 1981 gttgtatcaa tagcagtgac aaaacgcaca catcaaccca cagaccttag gaggaggaag 2041 gcgagggcgg ggtgacttct ggtgatgata aaaatggttt tatcacccag atgtgaaaga 2101 agctgcctgt ttactgatcc attgaataaa cccattttaa tagaaaaagt caataccaat 2161 tcagcaaaaa gaaaaaaaaa acatactact gaatataaag aaatttattc tgaaatagac 2221 tttacgtgtt tttttcttaa agaggagaaa aatattgctt gacggcaatt atatacccaa 2281 agtgatttgc ctgggtcctt taattcccat tagctttgga atctcagatg agcatagctg 2341 acccagttcc cacattcttc ccaaggatcc aaaagtggga atccagaccc caagtggaac 2401 actgcaggct tacgaatctg tggttccaaa attatttcat acgttgcaaa gctgaatctt 2461 cttgtcccaa tagagcttcc tgtcttttct ttggtgtgtt gttaaactct atttgtggac 2521 ttgattcttg attcttgcaa agtactgttt tgtgcagttc aagtttcgta caaataaaat 2581 acttaagtat atatatatgt gtgagttctg cacgcacaca catagtgtat ataatatcat 2641 gggaaacact gaactggcaa attattcctg caacatacgc tttcagtact ttggtaactg 2701 aagttctcta ggatcctaat gcaacattaa cgtgaaataa gcccagtgta atgtttttgc 2761 aaaccagggc tgttttccac agagagcagc caggccttcc cagcaggtct gtgcagagcg 2821 gacaggctcg tgagtcagct gagcgccgtg gcttcgccag acttggtgtt aagcaacctc 2881 ctttgttgat gtctcaacag agctaaatcg gggcccctct gagctcaaag aatgaaccac 2941 atccacacgt ttgaatttaa tcatctaaat ctgaatgttt cagaacaaaa tttctgctat 3001 ctaaactgct tgaaactcaa taatagtgtc acgtttgaat gtcatacaca gcaatatata 3061 tatatgtgta tatatatata tatggcaaag caaaaaaaaa aacatggtaa gagagaatga 3121 aggagaacat tgtgtttgat tcttgctgaa tggcaccttc tcaaagaaaa tagggcttgc 3181 acctttgtta atcagctgtg gccagtgctt tctggtgttc attgtgtaac cttcacccag 3241 gaataggtga ggttttagga agttacatgt cctctgaaga aagaattaca ctctgaaaag 3301 taatgcttca aattgatttc cttacctttt gggaaaaaaa aaaaattgtt tttttgcatt 3361 ctcccttgaa ttgaccaaaa tgttaactgt ttcatttggg gaggggatgg ggtgctgcca 3421 tcattgtcgt tgttgttgct gctgtagctg ttggggtttc ttttcctgtt gccggggctg 3481 tttggggaga gggaggggag ggaggtggga gggccgcgga gatatcttcc cctttgtaca 3541 gggcattctg tgttgtgaac ccagagctgg gtagaagctg cttttgtatt cagtgtgagg 3601 tggtgtttac agacgacttt gacaacagta gaagtgtact cagtggtgtc tgtgtatctg 3661 aactatttaa tttcgtgtta tgtttatatg cagaaatatt tatggatact acaccaagtg 3721 tttatttatt gttgataaat atgactcttc agtcgtcagc catggtgtcc tttcaaatga 3781 ttctttaagg tccacttgag caatgaatag agtatattgg agctttcctg tggctaagaa 3841 gaagaaacat gtcatcctgt tgccatcacc aagcacctaa ctctttctag gtaataaaaa 3901 gtcaac // LOCUS HSU29615 1633 bp mRNA PRI 04-NOV-1995 DEFINITION Human chitotriosidase precursor mRNA, complete cds. ACCESSION U29615 NID g1050957 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1633) AUTHORS Boot,R.G., Renkema,G.H., Strijland,A., van Zonneveld,A.J. and Aerts,J.M. TITLE Cloning of a cDNA encoding chitotriosidase, a human chitinase produced by macrophages JOURNAL J. Biol. Chem. 270 (44), 26252-26256 (1995) MEDLINE 96064695 REFERENCE 2 (bases 1 to 1633) AUTHORS Boot,R.G. TITLE Direct Submission JOURNAL Submitted (21-JUN-1995) Rolf G. Boot, Department of Biochemistry, AMC, E.C. Slater, Institute, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, The Netherlands FEATURES Location/Qualifiers source 1..1633 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Ch1.1" /clone_lib="macrophage-1 cDNA library of Rolf G. Boot" /cell_type="macrophage" sig_peptide 13..75 CDS 13..1413 /codon_start=1 /product="chitotriosidase precursor" /db_xref="PID:g1050958" /translation="MVRSVAWAGFMVLLMIPWGSAAKLVCYFTNWAQYRQGEARFLPK DLDPSLCTHLIYAFAGMTNHQLSTTEWNDETLYQEFNGLKKMNPKLKTLLAIGGWNFG TQKFTDMVATANNRQTFVNSAIRFLRKYSFDGLDLDWEYPGSQGSPAVDKERFTTLVQ DLANAFQQEAQTSGKERLLLSAAVPAGQTYVDAGYEVDKIAQNLDFVNLMAYDFHGSW EKVTGHNSPLYKRQEESGAAASLNVDAAVQQWLQKGTPASKLILGMPTYGRSFTLASS SDTRVGAPATGSGTPGPFTKEGGMLAYYEVCSWKGATKQRIQDQKVPYIFRDNQWVGF DDVESFKTKVSYLKQKGLGGAMVWALDLDDFAGFSCNQGRYPLIQTLRQELSLPYLPS GTPELEVPKPGQPSEPEHGPSPGQDTFCQGKADGLYPNPRERSSFYSCAAGRLFQQSC PTGLVFSNSCKCCTWN" mat_peptide 76..1410 /note="human chitinase" /product="chitotriosidase" polyA_signal 1605..1610 polyA_site 1633 /note="10 A nucleotides" BASE COUNT 354 a 489 c 443 g 347 t ORIGIN 1 ctgagctgca tcatggtgcg gtctgtggcc tgggcaggtt tcatggtcct gctgatgatc 61 ccatggggct ctgctgcaaa actggtctgc tacttcacca actgggccca gtacagacag 121 ggggaggctc gcttcctgcc caaggacttg gaccccagcc tttgcaccca cctcatctac 181 gccttcgctg gcatgaccaa ccaccagctg agcaccactg agtggaatga cgagactctc 241 taccaggagt tcaatggcct gaagaagatg aatcccaagc tgaagaccct gttagccatc 301 ggaggctgga atttcggcac tcagaagttc acagatatgg tagccacggc caacaaccgt 361 cagacctttg tcaactcggc catcaggttt ctgcgcaaat acagctttga cggccttgac 421 cttgactggg agtacccagg aagccagggg agccctgccg tagacaagga gcgcttcaca 481 accctggtac aggacttggc caatgccttc cagcaggaag cccagacctc agggaaggaa 541 cgccttcttc tgagtgcagc ggttccagct gggcagacct atgtggatgc tggatacgag 601 gtggacaaaa tcgcccagaa cctggatttt gtcaacctta tggcctacga cttccatggc 661 tcttgggaga aggtcacggg acataacagc cccctctaca agaggcaaga agagagtggt 721 gcagcagcca gcctcaacgt ggatgctgct gtgcaacagt ggctgcagaa ggggacccct 781 gccagcaagc tgatccttgg catgcctacc tacggacgct ccttcacact ggcctcctca 841 tcagacacca gagtgggggc cccagccaca gggtctggca ctccaggccc cttcaccaag 901 gaaggaggga tgctggccta ctatgaagtc tgctcctgga agggggccac caaacagaga 961 atccaggatc agaaggtgcc ctacatcttc cgggacaacc agtgggtggg ctttgatgat 1021 gtggagagct tcaaaaccaa ggtcagctat ctgaagcaga agggactggg cggggccatg 1081 gtctgggcac tggacttaga tgactttgcc ggcttctcct gcaaccaggg ccgatacccc 1141 ctcatccaga cgctacggca ggaactgagt cttccatact tgccttcagg caccccagag 1201 cttgaagttc caaaaccagg tcagccctct gaacctgagc atggccccag ccctggacaa 1261 gacacgttct gccagggcaa agctgatggg ctctatccca atcctcggga acggtccagc 1321 ttctacagct gtgcagcggg gcggctgttc cagcaaagct gcccgacagg cctggtgttc 1381 agcaactcct gcaaatgctg cacctggaat tgagtcgtaa agcccctcca gtccagcttt 1441 gaggctgggc ccaggatcac tctacagcct gcctcctggg ttttcctggg ggccgcaatc 1501 tggctcctgc aggcctttct gtggtcttcc tttatccagg ctttctgctc tcagccttgc 1561 cttccttttt tctgggtctc ctgggctgcc cctttcactt gcaaaataaa tctttggttt 1621 gtgcccctct tca // LOCUS HSU29656 849 bp mRNA PRI 05-JAN-1996 DEFINITION Human DR-nm23 mRNA, complete cds. ACCESSION U29656 NID g1051255 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 840) AUTHORS Venturelli,D., Martinez,R., Melotti,P., Casella,I., Peschle,C., Cucco,C., Spampinato,G., Darzynkiewicz,Z. and Calabretta,B. TITLE Overexpression of DR-nm23, a protein encoded by a member of the nm23 gene family, inhibits granulocyte differentiation and induces apoptosis in 32Dc13 myeloid cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (16), 7435-7439 (1995) MEDLINE 95365382 REFERENCE 2 (bases 1 to 849) AUTHORS Calabretta,B. TITLE Direct Submission JOURNAL Submitted (20-JUN-1995) Bruno Calabretta, Genetics, Thomas Jefferson University, 233 S. 10th Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..849 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="126" gene 19..525 /gene="DR-nm23" CDS 19..525 /gene="DR-nm23" /codon_start=1 /db_xref="PID:g1051256" /translation="MICLVLTIFANLFPAACTGAHERTFLAVKPDGVQRRLVGEIVRR FERKGFKLVALKLVQSSEELLREHYAELRERPFYGRLVKYMASGPVVAMVWQGLDVVR TSRALIGATNPADAPPGTIRGDFCIEVGNLIHGSDSVESARREIALWFRADELLCWED SAGHWLYE" polyA_signal 809..815 BASE COUNT 145 a 272 c 269 g 163 t ORIGIN 1 cgctcccgca ccgccatcat gatctgcctg gtgctgacca tcttcgctaa cctcttcccc 61 gcggcctgca ccggcgcaca cgaacgcacc ttcctggccg tgaagccgga cggcgtgcag 121 cggcggctgg tgggcgagat tgtgcggcgc ttcgagagga agggcttcaa gttggtggcg 181 ctgaagctgg tgcagtcctc cgaggagctg ctgcgtgagc actacgccga gctgcgtgaa 241 cgcccgttct acggccgcct tgtcaagtat atggcctccg ggccggtggt ggccatggtt 301 tggcaggggc tggacgtggt gcgcacctcg cgggcgctca tcggagccac gaacccggcc 361 gacgccccgc ccggcaccat ccgcggggat ttctgcatcg aggttggcaa cctgattcac 421 ggcagcgact cggtggagag tgcccgccgc gagatcgctc tctggttccg cgcagacgag 481 ctcctctgct gggaggacag cgctgggcac tggctgtatg agtagcccgg cagatgcgcg 541 tcacagaggc tctcacattc cagcctcctc cagggcccag gtgggcggct tctggcccca 601 ccccacagcg cttggagcat ccctttggac gggctgctga acatccacct gtctggacgt 661 tgcatggagg gtggcgcagc ctctccaatc cctggcgtac agggtttcct gcccgaggac 721 ctgctccagg agcctgcgcg gctcgcctgg aaacgtgcca ggagcactgt cctggtgccc 781 agcccaacgt ggtccaaggt ttttttataa ttaaagtcct cgttttcgtt aaaaaaaaaa 841 aaaaaaaaa // LOCUS HSU2AF 1707 bp RNA PRI 11-AUG-1992 DEFINITION H.sapiens mmRNA for large subunit of splicing factor U2AF. ACCESSION X64044 NID g37544 KEYWORDS splicing factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1707) AUTHORS Zamore,P.D., Patton,J.G. and Green,M.R. TITLE Cloning and domain structure of the mammalian splicing factor U2AF JOURNAL Nature 355 (6361), 609-614 (1992) MEDLINE 92168111 REFERENCE 2 (bases 1 to 1707) AUTHORS Zamore,P.D. TITLE Direct Submission JOURNAL Submitted (29-JUL-1992) P.D. Zamore, Univ. of Massachusetts Medical Center, Program in Molecular Medicine, 373 Plantation Street, Worcester, 01605 Massachusetts, USA FEATURES Location/Qualifiers source 1..1707 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 41..1468 /note="large subunit" /codon_start=1 /product="splicing factor U2AF" /db_xref="PID:g37545" /db_xref="SWISS-PROT:P26368" /translation="MSDFDEFERQLNENKQERDKENRHRKRSHSRSRSRDRKRRSRSR DRRNRDQRSASRDRRRRSKPLTRGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHI TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGI TEEAMMDFFNAQMRLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGII FQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVK ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQR ASVGAKNATLVSPPSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPEELLDD EEYEEIVEDVRDECSKYGLVKSIEIPRPVDGVEVPGCGKIFVEFTSVFDCQKAMQGLT GRKFANRVVVTKYCDPDSYHRRDFW" BASE COUNT 366 a 509 c 548 g 284 t ORIGIN 1 gaggcgaaag ctgcacaggg ccctacgcgg ccgcctcagc atgtcggact tcgacgagtt 61 cgagcggcag ctcaacgaga ataaacaaga gcgggacaag gagaaccggc atcggaagcg 121 cagccacagc cgctctcgga gccgggaccg caaacgccgg agccggagcc gcgaccggcg 181 caaccgggac cagcggagcg cctcccggga caggcgacga cgcagcaaac ctttgaccag 241 aggcgctaaa gaggagcacg gtggactgat tcgttccccc cgccacgaga agaagaagaa 301 ggtccgtaaa tactgggacg tgccaccccc aggctttgag cacatcaccc caatgcagta 361 caaggccatg caagctgcgg gtcagattcc agccactgct cttctcccca ccatgacccc 421 tgacggtctg gctgtgaccc caacgccggt gcccgtggtc gggagccaga tgaccagaca 481 agcccggcgc ctctacgtgg gcaacatccc ctttggcatc actgaggagg ccatgatgga 541 tttcttcaac gcccagatgc gcctgggggg gctgacccag gcccctggca acccagtgtt 601 ggctgtgcag attaaccagg acaagaattt tgcctttttg gagttccgct cagtggacga 661 gactacccag gctatggcct ttgatggcat catcttccag ggccagtcac taaagatccg 721 caggcctcac gactaccagc cgcttcctgg catgtcagag aacccctccg tctatgtgcc 781 tggggttgtg tccactgtgg tccccgactc tgcccacaag ctgttcatcg ggggcttacc 841 caactacctg aacgatgacc aggtcaaaga gctgctgaca tcctttgggc ccctcaaggc 901 cttcaacctg gtcaaggaca gtgccacggg gctctccaag ggctacgcct tctgtgagta 961 cgtggacatc aacgtcacgg atcaggccat tgcggggctg aacggcatgc agctggggga 1021 taagaagctg ctggtccaga gggcgagtgt gggagccaag aatgccacgc tggtgagccc 1081 cccgagcacc atcaatcaga cgcctgtgac cctgcaagtg ccgggcttga tgagctccca 1141 ggtgcagatg ggcggccacc cgactgaggt cctgtgcctc atgaacatgg tgctgcctga 1201 ggagctgctg gacgacgagg agtatgagga gatcgtggag gatgtgcggg acgagtgcag 1261 caagtacggg cttgtcaagt ccatcgagat cccccggcct gtggacggcg tcgaggtgcc 1321 cggctgcgga aagatctttg tggagttcac ctctgtgttt gactgccaga aagccatgca 1381 gggcctgacg ggccgcaagt tcgccaacag agtggttgtc acaaaatact gtgaccccga 1441 ctcttatcac cgccgggact tctggtagag gcggctgggg gagggtgggg gcagggctgg 1501 ctgggggctt ctccccactc ccgccccccc ccttatcccc ctctgaagac gatgggcaga 1561 ggagtgacag ccgcagacac acgacagccg gcagcaactg gaatggcagc aattaagggt 1621 ggggagcggg ggttgggggg ttggggggtt agggcaggga ggggactggg gaagtgcgca 1681 cacagcccca cacagacaac acgcacg // LOCUS HSU2AR 1054 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for U2 snRNP-specific A' protein. ACCESSION X13482 NID g37546 KEYWORDS A' protein; small nuclear ribonucleic particle; U2 small nuclear RNP; U2 snRNP-specific A' protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1054) AUTHORS Sillekens,P.T.G. TITLE Direct Submission JOURNAL Submitted (05-NOV-1988) Sillekens P.T.G., Dept of Biochemistry, University of Nijmegen, St Adelbertusplein 1, PO Box 9101, 6500 HB Nijmegen, The Netherlands REFERENCE 2 (bases 1 to 1054) AUTHORS Sillekens,P.T., Beijer,R.P., Habets,W.J. and van Verooij,W.J. TITLE Molecular cloning of the cDNA for the human U2 snRNA-specific A' protein JOURNAL Nucleic Acids Res. 17 (5), 1893-1906 (1989) MEDLINE 89183600 FEATURES Location/Qualifiers source 1..1054 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratocarcinoma" /clone_lib="lambda gt11" CDS 57..824 /note="U2 snRNP-specific A' protein (AA 1-255)" /codon_start=1 /db_xref="PID:g37547" /db_xref="SWISS-PROT:P09661" /translation="MVKLTAELIEQAAQYTNAVRDRELDLRGYKIPVIENLGATLDQF DAIDFSDNEIRKLDGFPLLRRLKTLLVNNNRICRIGEGLDQALPCLTELILTNNSLVE LGDLDPLASLKSLTYLSILRNPVTNKKHYRLYVIYKVPQVRVLDFQKVKLKERQEAEK MFKGKRGAQLAKDIARRSKTFNPGAGLPTDKKRGGPSPGDVEAIKNAIANASTLAEVE RLKGLLQSGQIPGRERRSGPTDDGEEEMEEDTVTNGS" misc_feature 1012..1017 /note="polyA signal" BASE COUNT 326 a 202 c 274 g 252 t ORIGIN 1 gaattccgcg ggaggccacg ggctttccac agcgcggggg aacgggaggc tgcaggatgg 61 tcaagctgac ggcggagctg atcgagcagg cggcgcagta caccaacgcg gtgcgcgacc 121 gggagctgga cctccggggg tataaaattc ccgtcattga aaatctaggt gctacgttag 181 accagtttga tgctattgat ttttctgaca atgagatcag gaaactggat ggttttcctt 241 tgttgagaag actgaaaaca ttgttagtga acaacaacag aatatgccgt ataggtgagg 301 gacttgatca ggctctgccc tgtctgacag aactcattct caccaataat agtctcgtgg 361 aactgggtga tctggaccct ctggcatctc tcaaatcgct gacttaccta agtatcctaa 421 gaaatccggt aaccaataag aagcattaca gattgtatgt gatttataaa gttccgcaag 481 tcagagtact ggatttccag aaagtgaaac taaaagagcg tcaggaagca gagaaaatgt 541 tcaagggcaa acggggtgca cagcttgcaa aggatattgc caggagaagc aaaactttta 601 atccaggtgc tggtttgcca actgacaaaa agagaggtgg gccatctcca ggggatgtag 661 aagcaatcaa gaatgccata gcaaatgctt caactctggc tgaagtggag aggctgaagg 721 ggttgctgca gtctggtcag atccctggca gagaacgcag atcagggccc actgatgatg 781 gtgaagaaga gatggaagaa gacacagtca caaacgggtc ctgagcagtg aggcagatgt 841 ataataatag gccctcttgg aacaagtctt gcttttcgaa catggtataa tagccttgtt 901 tgtgttagca aagtggaatc tatcagcatt gttgaaatgc ttaagactgc tgctgataat 961 tttgtaatat aagttttgaa atctaaatgt caattttcta caaattataa aaataaactc 1021 cactctctat gctaaaaaaa aaaaaaagga attc // LOCUS HSU30246 4098 bp mRNA PRI 05-JUL-1996 DEFINITION Human bumetanide-sensitive Na-K-Cl cotransporter (NKCC1) mRNA, complete cds. ACCESSION U30246 NID g903681 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4098) AUTHORS Payne,J.A., Xu,J.C., Haas,M., Lytle,C.Y., Ward,D. and Forbush,B. 3rd. TITLE Primary structure, functional expression, and chromosomal localization of the bumetanide-sensitive Na-K-Cl cotransporter in human colon JOURNAL J. Biol. Chem. 270 (30), 17977-17985 (1995) MEDLINE 95355397 REFERENCE 2 (bases 1 to 1212) AUTHORS Gillen,C.M., Brill,S., Payne,J.A. and Forbush,B. 3rd. TITLE Molecular cloning and functional expression of the K-Cl cotransporter from rabbit, rat, and human. A new member of the cation-chloride cotransporter family JOURNAL J. Biol. Chem. 271 (27), 16237-16244 (1996) MEDLINE 96279170 REFERENCE 3 (bases 1 to 4098) AUTHORS Payne,J.A. TITLE Direct Submission JOURNAL Submitted (23-JUN-1995) John A. Payne, Human Physiology, University of California, School of Medicine, Medical Sciences Bldg.1A #4138, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..4098 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="TEF 1-1 and TEF 11a" /clone_lib="Stratagene T84 library of J. Riordan" /cell_line="T84 cell line" /tissue_type="colon tumor" gene 165..3803 /gene="NKCC1" CDS 165..3803 /gene="NKCC1" /codon_start=1 /product="bumetanide-sensitive Na-K-Cl cotransporter" /db_xref="PID:g903682" /translation="MEPRPTAPSSGAPGLAGVGETPSAAALAAARVELPGTAVPSVPE DAAPASRDGGGVRDEGPAAAGDGLGRPLGPTPSQSRFQVDLVSENAGRAAAAAAAAAA AAAAAGAGAGAKQTPADGEASGESEPAKGSEEAKGRFRVNFVDPAASSSAEDSLSDAA GVGVDGPNVSFQNGGDTVLSEGSSLHSGGGGGSGHHQHYYYDTHTNTYYLRTFGHNTM DAVPRIDHYRHTAAQLGEKLLRPSLAELHDELEKEPFEDGFANGEESTPTRDAVVTYT AESKGVVKFGWIKGVLVRCMLNIWGVMLFIRLSWIVGQAGIGLSVLVIMMATVVTTIT GLSTSAIATNGFVRGGGAYYLISRSLGPEFGGAIGLIFAFANAVAVAMYVVGFAETVV ELLKEHSILMIDEINDIRIIGAITVVILLGISVAGMEWEAKAQIVLLVILLLAIGDFV IGTFIPLESKKPKGFFGYKSEIFNENFGPDFREEETFFSVFAIFFPAATGILAGANIS GDLADPQSAIPKGTLLAILITTLVYVGIAVSVGSCVVRDATGNVNDTIVTELTNCTSA ACKLNFDFSSCESSPCSYGLMNNFQVMSMVSGFTPLISAGIFSATLSSALASLVSAPK IFQALCKDNIYPAFQMFAKGYGKNNEPLRGYILTFLIALGFILIAELNVIAPIISNFF LASYALINFSVFHASLAKSPGWRPAFKYYNMWISLLGAILCCIVMFVINWWAALLTYV IVLGLYIYVTYKKPDVNWGSSTQALTYLNALQHSIRLSGVEDHVKNFRPQCLVMTGAP NSRPALLHLVHDFTKNVGLMICGHVHMGPRRQAMKEMSIDQAKYQRWLIKNKMKAFYA PVHADDLREGAQYLMQAAGLGRMKPNTLVLGFKKDWLQADMRDVDMYINLFHDAFDIQ YGVVVIRLKEGLDISHLQGQEELLSSQEKSPGTKDVVVSVEYSKKSDLDTSKPLSEKP ITHKVEEEDGKTATQPLLKKESKGPIVPLNVADQKLLEASTQFQKKQGKNTIDVWWLF DDGGLTLLIPYLLTTKKKWKDCKIRVFIGGKINRIDHDRRAMATLLSKFRIDFSDIMV LGDINTKPKKENIIAFEEIIEPYRLHEDDKEQDIADKMKEDEPWRITDNELELYKTKT YRQIRLNELLKEHSSTANIIVMSLPVARKGAVSSALYMAWLEALSKDLPPILLVRGNH QSVLTFYS" polyA_signal 4075..4080 BASE COUNT 1090 a 890 c 1037 g 1081 t ORIGIN 1 ggtggcctct gtggccgtcc aggctagcgg cggcccgcag gcggcgggga gaaagactct 61 ctcacctggt cttgcggctg tggccaccgc cggccagggg tgtggagggc gtgctgccgg 121 agacgtccgc cgggctctgc agttccgccg ggggtcgggc agctatggag ccgcggccca 181 cggcgccctc ctccggcgcc ccgggactgg ccggggtcgg ggagacgccg tcagccgctg 241 cgctggccgc agccagggtg gaactgcccg gcacggctgt gccctcggtg ccggaggatg 301 ctgcgcccgc gagccgggac ggcggcgggg tccgcgatga gggccccgcg gcggccgggg 361 acgggctggg cagacccttg gggcccaccc cgagccagag ccgtttccag gtggacctgg 421 tttccgagaa cgccgggcgg gccgctgctg cggcggcggc ggcggcggcg gcagcggcgg 481 cggctggtgc tggggcgggg gccaagcaga cccccgcgga cggggaagcc agcggcgaga 541 gcgagccagc taaaggcagc gaggaagcca agggccgctt ccgcgtgaac ttcgtggacc 601 cagctgcctc ctcgtcggct gaagacagcc tgtcagatgc tgccggggtc ggagtcgacg 661 ggcccaacgt gagcttccag aacggcgggg acacggtgct gagcgagggc agcagcctgc 721 actccggcgg cggcggcggc agtgggcacc accagcacta ctattatgat acccacacca 781 acacctacta cctgcgcacc ttcggccaca acaccatgga cgctgtgccc aggatcgatc 841 actaccggca cacagccgcg cagctgggcg agaagctgct ccggcctagc ctggcggagc 901 tccacgacga gctggaaaag gaaccttttg aggatggctt tgcaaatggg gaagaaagta 961 ctccaaccag agatgctgtg gtcacgtata ctgcagaaag taaaggagtc gtgaagtttg 1021 gctggatcaa gggtgtatta gtacgttgta tgttaaacat ttggggtgtg atgcttttca 1081 ttagattgtc atggattgtg ggtcaagctg gaataggtct atcagtcctt gtaataatga 1141 tggccactgt tgtgacaact atcacaggat tgtctacttc agcaatagca actaatggat 1201 ttgtaagagg aggaggagca tattatttaa tatctagaag tctagggcca gaatttggtg 1261 gtgcaattgg tctaatcttc gcctttgcca acgctgttgc agttgctatg tatgtggttg 1321 gatttgcaga aaccgtggtg gagttgctta aggaacattc catacttatg atagatgaaa 1381 tcaatgatat ccgaattatt ggagccatta cagtcgtgat tcttttaggt atctcagtag 1441 ctggaatgga gtgggaagca aaagctcaga ttgttctttt ggtgatccta cttcttgcta 1501 ttggtgattt cgtcatagga acatttatcc cactggagag caagaagcca aaagggtttt 1561 ttggttataa atctgaaata tttaatgaga actttgggcc cgattttcga gaggaagaga 1621 ctttcttttc tgtatttgcc atcttttttc ctgctgcaac tggtattctg gctggagcaa 1681 atatctcagg tgatcttgca gatcctcagt cagccatacc caaaggaaca ctcctagcca 1741 ttttaattac tacattggtt tacgtaggaa ttgcagtatc tgtaggttct tgtgttgttc 1801 gagatgccac tggaaacgtt aatgacacta tcgtaacaga gctaacaaac tgtacttctg 1861 cagcctgcaa attaaacttt gatttttcat cttgtgaaag cagtccttgt tcctatggcc 1921 taatgaacaa cttccaggta atgagtatgg tgtcaggatt tacaccacta atttctgcag 1981 gtatattttc agccactctt tcttcagcat tagcatccct agtgagtgct cccaaaatat 2041 ttcaggctct atgtaaggac aacatctacc cagctttcca gatgtttgct aaaggttatg 2101 ggaaaaataa tgaacctctt cgtggctaca tcttaacatt cttaattgca cttggattca 2161 tcttaattgc tgaactgaat gttattgcac caattatctc aaacttcttc cttgcatcat 2221 atgcattgat caatttttca gtattccatg catcacttgc aaaatctcca ggatggcgtc 2281 ctgcattcaa atactacaac atgtggatat cacttcttgg agcaattctt tgttgcatag 2341 taatgttcgt cattaactgg tgggctgcat tgctaacata tgtgatagtc cttgggctgt 2401 atatttatgt tacctacaaa aaaccagatg tgaattgggg atcctctaca caagccctga 2461 cttacctgaa tgcactgcag cattcaattc gtctttctgg agtggaagac cacgtgaaaa 2521 actttaggcc acagtgtctt gttatgacag gtgctccaaa ctcacgtcca gctttacttc 2581 atcttgttca tgatttcaca aaaaatgttg gtttgatgat ctgtggccat gtacatatgg 2641 gtcctcgaag acaagccatg aaagagatgt ccatcgatca agccaaatat cagcgatggc 2701 ttattaagaa caaaatgaag gcattttatg ctccagtaca tgcagatgac ttgagagaag 2761 gtgcacagta tttgatgcag gctgctggtc ttggtcgtat gaagccaaac acacttgtcc 2821 ttggatttaa gaaagattgg ttgcaagcag atatgaggga tgtggatatg tatataaact 2881 tatttcatga tgcttttgac atacaatatg gagtagtggt tattcgccta aaagaaggtc 2941 tggatatatc tcatcttcaa ggacaagaag aattattgtc atcacaagag aaatctcctg 3001 gcaccaagga tgtggtagta agtgtggaat atagtaaaaa gtccgattta gatacttcca 3061 aaccactcag tgaaaaacca attacacaca aagttgagga agaggatggc aagactgcaa 3121 ctcaaccact gttgaaaaaa gaatccaaag gccctattgt gcctttaaat gtagctgacc 3181 aaaagcttct tgaagctagt acacagtttc agaaaaaaca aggaaagaat actattgatg 3241 tctggtggct ttttgatgat ggaggtttga ccttattgat accttacctt ctgacgacca 3301 agaaaaaatg gaaagactgt aagatcagag tattcattgg tggaaagata aacagaatag 3361 accatgaccg gagagcgatg gctactttgc ttagcaagtt ccggatagac ttttctgata 3421 tcatggttct aggagatatc aataccaaac caaagaaaga aaatattata gcttttgagg 3481 aaatcattga gccatacaga cttcatgaag atgataaaga gcaagatatt gcagataaaa 3541 tgaaagaaga tgaaccatgg cgaataacag ataatgagct tgaactttat aagaccaaga 3601 cataccggca gatcaggtta aatgagttat taaaggaaca ttcaagcaca gctaatatta 3661 ttgtcatgag tctcccagtt gcacgaaaag gtgctgtgtc tagtgctctc tacatggcat 3721 ggttagaagc tctatctaag gacctaccac caatcctcct agttcgtggg aatcatcaga 3781 gtgtccttac cttctattca taaatgttct atacagtgga cagccctcca gaatggtact 3841 tcagtgccta gtgtagtaac ctgaaatctt caatgacaca ttaacatcac aatggcgaat 3901 ggtgactttt ctttcacgat ttcattaatt tgaaagcaca caggaaagct tgctccattg 3961 ataacgtgta tggagacttc ggttttagtc aattccatat ctcaatctta atggtgattc 4021 ttctctgttg aactgaagtt tgtgagagta gttttccttt gctacttgaa tagcaataaa 4081 agcgtgttaa ctttttgg // LOCUS HSU30255 1536 bp mRNA PRI 09-SEP-1995 DEFINITION Human phosphogluconate dehydrogenase (hPGDH) gene, complete cds. ACCESSION U30255 NID g984324 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1536) AUTHORS Chan,J.Y.W., Fung,K.P. and Lee,C.Y. TITLE Sequence determination of human phosphogluconate dehydrogenase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1536) AUTHORS Chan,J.Y.W. TITLE Direct Submission JOURNAL Submitted (23-JUN-1995) Judy Y.W. Chan, Biochemistry, Rm. 414, B.M.S.B., The Chinese University of Hong Kong, Shatin, Hong Kong. FEATURES Location/Qualifiers source 1..1536 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A460" /tissue_type="heart" 5'UTR 1..6 /gene="hPGDH" gene 1..1536 /gene="hPGDH" CDS 7..1458 /gene="hPGDH" /codon_start=1 /product="phosphogluconate dehydrogenase" /db_xref="PID:g984325" /translation="MAQADIALIGLAVMGQNLILNMNDHGFVVCAFNRTVSKVDDFLA NEAKGTKVVGAQSLKEMVSKLKKPRRIILLVKAGQAVDDFIEKLVPLLDTGDIIIDGG NSEYRDTTRRCRDLKGKGILFVGSGVSGGEEGPRYGPSLMPGGNKEAWPHIKTIFQGI AAKVGTGEPCCDWVGDEGAGHFVKMVHNGIEYGDMQLICEAYHLMKDVLGMAQDEMAQ AFEDWNKTELDSFLIEITANILKFQDTDGKHLLPKIRDSAGQKGTGKWTAISALEYGV PVTLIGEAVFARCLSSLKDERIQASKKLKGPQKFQFDGDKKSFLEDIRKALYASKIIS YAQGFMLLRQAATEFGWTLNYGGIALMWRGGCIIRSVFLGKIKDAFDRNPELQNLLLD DFFKSAVENCQDSWRRAVSTGVQAGIPMPCFTTALSFYDGYRHEMLPASLIQAQRDYF GAHTYELLAKPGQFIHTNWTGHGGTVSSSSYNA" 3'UTR 1459..1536 /gene="hPGDH" BASE COUNT 385 a 384 c 433 g 334 t ORIGIN 1 gccgccatgg cccaagctga catcgcgctg atcggattgg ccgtcatggg ccagaactta 61 attctgaaca tgaatgacca cggctttgtg gtctgtgctt ttaataggac tgtctccaaa 121 gttgacgatt tcttggccaa tgaggcaaag ggaaccaaag tggtgggtgc ccagtccctg 181 aaagagatgg tctccaagct gaagaagccc cggcggatca tcctcctggt gaaggctggg 241 caagctgtgg atgatttcat cgagaaattg gtaccattgt tggatactgg tgacatcatc 301 attgacggag gaaattctga atatagggac accacaagac ggtgccgaga cctcaaaggc 361 aagggaattt tatttgtggg gagcggagtc agtggtggag aggaagggcc ccggtatggc 421 ccatcgctca tgccaggagg gaacaaagaa gcgtggcccc acatcaagac catcttccaa 481 ggcattgctg caaaagtggg aactggagaa ccctgctgtg actgggtggg agatgaggga 541 gcaggccact ttgtgaagat ggtgcacaac gggatagagt atggggacat gcagctgatc 601 tgtgaggcat accacctgat gaaagacgtg ctgggcatgg cgcaggacga gatggcccag 661 gcctttgagg attggaataa gacagagcta gactcattcc tgattgaaat cacagccaat 721 attctcaagt tccaagacac cgatggcaaa cacctgctgc caaagatcag ggacagcgcg 781 gggcagaagg gcacagggaa gtggaccgcc atctccgccc tggaatacgg cgtacccgtc 841 accctcattg gagaagctgt ctttgctcgg tgcttatcat ctctgaagga tgagagaatt 901 caagctagca aaaagctgaa gggtccccag aagttccagt ttgatggtga taagaaatca 961 ttcctggagg acattcggaa ggcactctac gcttccaaga tcatctctta cgctcaaggc 1021 tttatgctgc taaggcaggc agccaccgag tttggctgga ctctcaatta tggtggcatc 1081 gccctgatgt ggagaggggg ctgcatcatt agaagtgtat tcctaggaaa gataaaggat 1141 gcatttgatc gaaacccgga acttcagaac ctcctactgg acgacttctt taagtcagct 1201 gttgaaaact gccaggactc ctggcggcgg gcagtcagca ctggggtcca ggctggcatt 1261 cccatgccct gttttaccac tgccctctcc ttctatgacg ggtacagaca tgagatgctt 1321 ccagccagcc tcatccaggc tcagcgggat tacttcgggg ctcacaccta tgaactcttg 1381 gccaaaccag ggcagtttat ccacaccaac tggacaggcc atggtggcac cgtgtcatcc 1441 tcgtcataca atgcctgatg ggctcctgtc accctccacg tctccacaga ccaggacatt 1501 ccatgtgcct catggcactg ccacctgggc ctttgg // LOCUS HSU30313 877 bp mRNA PRI 15-NOV-1995 DEFINITION Human diadenosine tetraphosphatase mRNA, complete cds. ACCESSION U30313 NID g1050959 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 877) AUTHORS Thorne,N.M., Hankin,S., Wilkinson,M.C., Nunez,C., Barraclough,R. and McLennan,A.G. TITLE Human diadenosine 5',5''-P1,P4-tetraphosphate pyrophosphohydrolase is a member of the MutT family of nucleotide pyrophosphatases JOURNAL Biochem. J. 311 (Pt 3), 717-721 (1995) MEDLINE 96067583 REFERENCE 2 (bases 1 to 877) AUTHORS Thorne,N.M.H, Hankin,S., Wilkinson,M.C., Nunez,C., Barraclough,R. and McLennan,A.G. TITLE Direct Submission JOURNAL Submitted (26-JUN-1995) Alexander G. McLennan, Biochemistry, University of Liverpool, P.O. Box 147, Liverpool, L69 3BX, UK COMMENT This is the complete cDNA sequence of IMAGE clone 108448 of the Washington University-Merck EST project. The sequences of the 5' and 3' ends of this clone have accession nos. T77765 and T77766 respectively and the GDB ID is G00-464-065. It is available from the IMAGE Consortium (info@image.llnl.gov). For other information contact A.G. McLennan (agmclen@liv.ac.uk). FEATURES Location/Qualifiers source 1..877 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver and spleen" /dev_stage="20-week fetus" /sex="male" /clone="108448" CDS 64..123 /note="probably untranslated; short upstream ORF; Method: conceptual translation supplied by author." /codon_start=1 /db_xref="PID:g1050960" /translation="MPALFYREPWRSWDRGHID" CDS 175..618 /EC_number="3.6.1.17" /note="diadenosine 5',5'''-P1,P4-tetraphosphate (asymmetrical) pyrophosphohydrolase; Ap4A hydrolase; Ap4Aase; The sequence contains a novel modification (Gx5Ex5AxRETxEE) of the MutT signature motif found in several nucleotide pyrophosphatases (Koonin, E.V., Nucleic Acids Res. 21, 4847 (1993))" /codon_start=1 /function="Asymmetrically hydrolyzes Ap4A to yield AMP and ATP." /product="diadenosine tetraphosphatase" /db_xref="PID:g1050961" /translation="MALRACGLIIFRRCLIPKVDNNAIEFLLLQASDGIHHWTPPKGH VEPGEDDLETALRETQEEAGIEAGQLTIIEGFKRELNYVARNKPKTVIYWLAEVKDYD VEIRLSHEHQAYRWLGLEEACQLAQFKEMKAALQEGHQFLCSIEA" polyA_site 877 /note="16 A nucleotides" BASE COUNT 239 a 205 c 242 g 191 t ORIGIN 1 cctcctacct ccttctgctt cggtgcgttt gcttctgagg ttctccagtg tcacaacaaa 61 cacatgccag ccctgtttta cagggagccc tggaggagtt gggatagagg ccacattgac 121 tgagggtagt tgccagggtc ctgcagttat acacaaagtc cttaggataa gaccatggcc 181 ttgagagcat gtggcttgat catcttccga agatgcctca ttcccaaagt ggacaacaat 241 gcaattgagt ttttactgct gcaggcatca gatggcattc atcactggac tcctcccaaa 301 ggccatgtgg aaccaggaga ggatgacttg gaaacagccc tgagggagac ccaagaggaa 361 gcaggcatag aagcaggcca gctgaccatt attgaggggt tcaaaaggga actcaattat 421 gtggccagga acaagcctaa aacagtcatt tactggctgg cggaggtgaa ggactatgac 481 gtggagatcc gcctctccca tgagcaccaa gcctaccgct ggctggggct ggaggaggcc 541 tgccagttgg ctcagttcaa ggagatgaag gcagcgctcc aagaaggaca ccagtttctt 601 tgctccatag aggcctgagc tgactggagc agagtcattt gcttcagcag gatccttgtg 661 ggccttctaa gatgaagcca ccctcaggtc cagggaaggt tgtgctggta tttggctcat 721 gacagccaag agcagatttg tgaaatcggc tcaactccca ggtgagagca agcaaaaatc 781 ttggctgggt ggaaaggaag gcaaaagagt aaaaattaaa aaggccaggc ccaagtaagt 841 gtaccttgta ctttataaat aaacctcaag cagctca // LOCUS HSU30461 1679 bp mRNA PRI 18-APR-1997 DEFINITION Human GABAA receptor subunit alpha4 mRNA, complete cds. ACCESSION U30461 NID g905392 KEYWORDS . SOURCE Human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1679) AUTHORS Yang,W., Drewe,J.A. and Lan,N.C. TITLE Cloning and characterization of the human GABAA receptor alpha 4 subunit: identification of a unique diazepam-insensitive binding site JOURNAL Eur. J. Pharmacol. 291 (3), 319-325 (1995) MEDLINE 96360044 REFERENCE 2 (bases 1 to 1679) AUTHORS Yang,W. TITLE Direct Submission JOURNAL Submitted (27-JUN-1995) Wu Yang, Molecular Biology, CoCensys Inc., 213 Technology Drive, Irvine, CA 92718, USA FEATURES Location/Qualifiers source 1..1679 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="cerebral cortex" /dev_stage="adult" CDS 15..1679 /codon_start=1 /product="GABAA receptor subunit alpha4" /db_xref="PID:g905393" /translation="MVSAKKVPAIALSAGVSFALLRFLCLAVCLNESPGQNQKEEKLC TENFTRILDSLLDGYDNRLRPGFGGPVTEVKTDIYVTSFGPVSDVEMEYTMDVFFRQT WIDKRLKYDGPIEILRLNNMMVTKVWTPDTFFRNGKKSVSHNMTAPNKLFRIMRNGTI LYTMRLTISAECPMRLVDFPMDGHACPLKFGSYAYPKSEMIYTWTKGPEKSVEVPKES SSLVQYDLIGQTVSSETIKSITGEYIVMTVYFHLRRKMGYFMIQTYIPCIMTVILSQV SFWINKESVPARTVFGITTVLTMTTLSISARHSLPKVSYLTAMDWFIAVCFAFVFSAL IEFAAVNYFTNIQMEKAKRKTSKPPQEVPAAPVQREKHPEAPLQDTNANLNMRKRTNA LVHSESDVGNRTEVGNHSSKSSTVVQESSKGTPRSYLASSPNPFSRANAAETISAARA LPSASPTSIRTGHMPRKASVGSASTRHVFGSRLQRIKTTVNTIGATGKLSATPPPSAP PPSGSGTSKIDKYARILFPVTFGAFNMVYWVVYLSKDTMEKSESLM" BASE COUNT 483 a 369 c 359 g 468 t ORIGIN 1 ggcatgttgc aaagatggtt tctgccaaga aggtacccgc gatcgctctg tccgccgggg 61 tcagtttcgc cctcctgcgc ttcctgtgcc tggcggtttg tttaaacgaa tccccaggac 121 agaaccaaaa ggaggagaaa ttgtgcacag aaaatttcac ccgcatcctg gacagtttgc 181 tcgatggtta tgacaacagg ctgcgtcctg gatttggggg tcctgttaca gaagtgaaaa 241 ctgacatata tgtcaccagc tttggacctg tttctgatgt tgaaatggaa tacacaatgg 301 atgtgttctt caggcagaca tggattgaca aaagattaaa atatgacggc cccattgaaa 361 ttttgagatt gaacaatatg atggtaacga aagtgtggac ccctgatact ttcttcagga 421 atggaaagaa atctgtctca cataatatga cagctccaaa taagcttttt agaattatga 481 gaaatggtac tattttatac acaatgagac tcaccataag tgcggagtgt cccatgagat 541 tggtggattt tcccatggat ggtcatgcat gccctttgaa attcgggagt tatgcctatc 601 caaagagtga gatgatctat acctggacaa aaggtcctga gaaatcagtt gaagttccga 661 aggagtcttc cagcttagtt caatatgatt tgattgggca aaccgtatca agtgaaacca 721 tcaaatcaat tacgggcgaa tatattgtta tgacggttta cttccacctc agacggaaga 781 tgggttattt tatgattcag acctatattc cgtgcattat gacagtgatt ctttctcaag 841 tttcattttg gataaataaa gaatcagttc ccgctaggac tgtatttgga ataacaactg 901 tcctcaccat gaccacacta agcatcagtg cacgacattc tttgcccaaa gtgtcctact 961 tgaccgccat ggactggttc atagctgtct gctttgcttt tgtattttcg gcccttatcg 1021 agtttgctgc tgtcaactat ttcaccaata ttcaaatgga aaaagccaaa agaaagacat 1081 caaagccccc tcaggaagtt cccgctgctc cagtgcagag agagaagcat cctgaagccc 1141 ctctgcagga tacaaatgcc aatttgaaca tgagaaaaag aacaaatgct ttggttcact 1201 ctgaatctga tgttggcaac agaactgagg tgggaaacca ttcaagcaaa tcttccacag 1261 ttgttcaaga atcttctaaa ggcacacctc ggtcttactt agcttccagt ccaaacccat 1321 tcagccgtgc aaatgcagct gaaaccatat ctgcagcaag agcacttcca tctgcttctc 1381 ctacttctat ccgaactgga catatgcctc gaaaggcttc agttggatct gcttctactc 1441 gtcacgtgtt tggatcaaga ctgcagagga taaagaccac agttaatacc ataggggcta 1501 ctgggaagtt gtcagctact cctcctccat cggctccacc accttctgga tctggcacaa 1561 gtaaaataga caaatatgcc cgtattctct ttccagtcac atttggggca tttaacatgg 1621 tttattgggt tgtttattta tctaaggaca ctatggagaa atcagaaagt ctaatgtaa // LOCUS HSU30473 1076 bp mRNA PRI 02-FEB-1996 DEFINITION Human putative src-like adapter protein (SLAP) mRNA, complete cds. ACCESSION U30473 NID g1173538 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1076) AUTHORS Angrist,M., Wells,D.E., Chakravarti,A. and Pandey,A. TITLE Chromosomal localization of the mouse Src-like adapter protein (Slap) gene and its putative human homolog SLA JOURNAL Genomics 30 (3), 623-625 (1995) MEDLINE 96423054 REFERENCE 2 (bases 1 to 1076) AUTHORS Angrist,M.H., Wells,D., Chakravarti,A. and Pandey,A. TITLE Direct Submission JOURNAL Submitted (27-JUN-1995) Misha H. Angrist, Genetics, Case Western Reserve, 10900 Euclid Ave, Cleveland, OH 44106-4955, USA FEATURES Location/Qualifiers source 1..1076 /organism="Homo sapiens" /note="Sequence is partially derived from two overlapping ESTs: T08339 and F05292; These sequences were aligned and novel sequence was obtained by further sequencing of these EST clones" /db_xref="taxon:9606" /chromosome="8" /map="8q" gene 166..996 /gene="SLAP" CDS 166..996 /gene="SLAP" /note="putative src-like adapter protein; non-catalytic src-like adapter protein containing SH3 and SH2 domains; homolog of mouse SLAP; Method: conceptual translation supplied by author" /codon_start=1 /db_xref="PID:g1173539" /translation="MGNSMKSTPAPAERPLPNPEGLDSDFLAVLSDYPSPDISPPIFR RGEKLRVISDEGGWWKAISLSTGRESYIPGICVARVYHGWLFEGLGRDKAEELLQLPD TKVGSFMIRESETKKGFYSLSVRHRQVKHYRIFRLPNNWYYISPRLTFQCLEDLVNHY SEVADGLCCVLTTPCLTQSTAAPAVRASSSPVTLRQKTVDWRRVSRLQEDPEGTENPL GVDESLFSYGLRESIASYLSLTSEDNTSFDRKKKSISLMYGGSKRKSSFFSSPPYFED " BASE COUNT 277 a 284 c 295 g 220 t ORIGIN 1 cacgaggggg agaaattccc catgacagcg actgatgaag aatttcaata gaaagctgct 61 acttcagaaa ataagatcat ttgctgcgaa tggagaacat ctcaggcagc cctgatgctc 121 caccggctct gggcatcacc agcggcccca gggaaaaaga aagaaatggg aaacagcatg 181 aaatccaccc ctgcgcctgc cgagaggccc ctgcccaacc cggagggact ggatagcgac 241 ttccttgccg tgctaagtga ctacccgtct cctgacatca gccccccgat attccgccga 301 ggggagaaac tgcgtgtgat ttctgatgaa gggggctggt ggaaagctat ttctcttagc 361 actggtcgag agagttacat ccctggaata tgtgtggcca gagtttacca tggctggctg 421 tttgagggcc tgggcagaga caaggccgag gagctgctgc agctgccaga cacaaaggtc 481 ggctccttca tgatcagaga gagtgagacc aagaaagggt tttactcact gtcggtgaga 541 cacaggcagg taaagcatta ccgcattttc cgtctgccga acaactggta ctacatttcc 601 ccgaggctca ccttccagtg cctggaggac ctggtgaacc actattctga ggtggctgat 661 ggcctgtgct gtgtgctcac cacgccctgc ctgacacaaa gcacggctgc cccagcagtg 721 agggcctcca gctcacctgt caccttgcgt cagaagactg tggactggag gagagtgtcc 781 agactgcagg aggaccccga gggaacagag aacccgcttg gggtagacga gtcccttttc 841 agctatggcc ttcgagagag cattgcctct tacctgtccc tgaccagtga ggacaacacc 901 tcctttgatc gaaagaagaa aagcatctcc ctgatgtatg gtggcagcaa gagaaagagc 961 tcattcttct catcaccacc ttactttgag gactagccaa gaacagacac aatggttcat 1021 gccaaaagga acagaagttc caactattgc ctgggatctt tgcggaaaag gaggtt // LOCUS HSU30498 2187 bp mRNA PRI 07-AUG-1995 DEFINITION Human retinoic acid-inducible E3 protein mRNA, complete cds. ACCESSION U30498 NID g929952 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2187) AUTHORS Scott,L.M. and Collins,S.J. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2187) AUTHORS Scott,L.M. TITLE Direct Submission JOURNAL Submitted (28-JUN-1995) Linda M. Scott, Program in Molecular Medicine, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..2187 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="acute promyelocytic leukemia HL60" CDS 23..811 /codon_start=1 /product="E3 protein" /db_xref="PID:g929953" /translation="MDPRLSTVRQTCCCFNVRIATTALAIYHVIMSVLLFIEHSVEVA HGKASCKLSQMGYLRIADLISSFLLITMLFIISLSLLIGVVKNREKYLLPFLSLQIMD YLLCLLTLLGSYIELPAYLKLASRSRASSSKFPLMTLQLLDFCLSILTLCSSYMEVPT YLNFKSMNHMNYLPSQEDMPHNQFIKMMIIFSIAFITVLIFKVYMFKCVWRCYRLIKC MNSVEEKRNSKMLQKVVLPSYEEALSLPSKDPEGGPAPPPYSEV" BASE COUNT 544 a 580 c 513 g 550 t ORIGIN 1 cagaggaggg gacggcagca ccatggaccc ccgcttgtcc actgtccgcc agacctgctg 61 ctgcttcaat gtccgcatcg caaccaccgc cctggccatc taccatgtga tcatgagcgt 121 cttgttgttc atcgagcact cagtagaggt ggcccatggc aaggcgtcct gcaagctctc 181 ccagatgggc tacctcagga tcgctgacct gatctccagc ttcctgctca tcaccatgct 241 cttcatcatc agcctgagcc tactgatcgg ggtagtcaag aaccgggaga agtacctgct 301 gcccttcctg tccctgcaaa tcatggacta tctcctgtgc ctgctcacac tgctgggctc 361 ctacattgag ctgcccgcct acctcaagtt ggcctcccgg agccgtgcta gctcctccaa 421 gttccccctg atgacgctgc agctgctgga cttctgcctg agcatcctga ccctctgcag 481 ctcctacatg gaagtgccca cctatctcaa cttcaagtcc atgaaccaca tgaattacct 541 ccccagccag gaggatatgc ctcataacca gttcatcaag atgatgatca tcttttccat 601 cgccttcatc actgtcctta tcttcaaggt ctacatgttc aagtgcgtgt ggcggtgcta 661 cagattgatc aagtgcatga actcggtgga ggagaagaga aactccaaga tgctccagaa 721 ggtggtcctg ccgtcctacg aggaagccct gtctttgcca tcgaaggacc cagagggggg 781 gccagcacca cccccatact cagaggtgtg accctcggca gggcccaggc ccagtgttgg 841 gaggggtgga gctgcctcat aatctgcttt tttggtttgg tggcccctgt ggcctgggtg 901 ggccctcccg cccctccctg gcaggacaat ctgcttgtgt ctccctcgct ggcctgctcc 961 tcctgcaggg gctgtgagct gctcacaact gggtcaacgc tttaggctga gtcactcctc 1021 gggtctctcc ataattcagc ccaacaatgt ttggtttatt tcaatcagct ctgacacttg 1081 tttagacgat tgggcattct aaagttggtg agtttgtcaa gcaactatcg acttgatcag 1141 ttcagcaagc aactgacaaa tcaaaaaccc acttgtcagt tcaataaaat aatttgggca 1201 aacaacagac tattggattg atttataaat aggtggcagt tcacatagga atttaatcaa 1261 gtaatcatta attagttacc ccctatatat aaatatatgt aatcaatttc ttcaaatagc 1321 ttggttacat gataatcaat tagccaacca tgagtcattt agaatagtga taaatagaat 1381 acacagaata gtgatgaaat tcaatttaaa aaatcacgtt agcctccaaa ccatttaatt 1441 caaatgaacc catcaactgg atgccaactc tggcgaatgt aggacctctg agtgggtgta 1501 taattgttaa ttcaaatgaa attcatttaa acagttgaca aactgtcatt caacaattag 1561 gtccaggaaa taacagttat ttcatcataa aacagtccct tcaaacacac aattggtctg 1621 ctgaagggtt gtcatcaaca atccaatgct cacctattca gttgctctgt ggtcagtgtg 1681 gctgcataac agtggattcc atgaaaggag tcattttagt gatgagctgc cagtccattc 1741 ccaggccagg ctgtcgctgg ccatccattc agtcgattca gtcataggcg aatctgttct 1801 gcccgaggct tgtggtcaag caaaaattca gccctgaaat caggcacatc tgttcggtgg 1861 actaaaccca caggttagtt cagtcaaagc agggaacccc cttgtgggca ctgaccctgc 1921 cactggggtc atggcggttg tgacagctgg ggaggtttgc ccccaacagc cctcctgtgc 1981 ctgcttccct gtgtgtcggg gtcctccagg gagctgaccc agaggtggag gccacggagg 2041 cagggtctct ggggactgtc ggggggtaca gagggagaag gctcttcaag agctccctgg 2101 caataccccc ttgtgtaatt gctttgtgtg cgacagggag gaagtttcaa taaagcagca 2161 acaagcttcc aaaaaaaaaa aaaaaaa // LOCUS HSU30521 2036 bp mRNA PRI 28-AUG-1995 DEFINITION Human P311 HUM (3.1) mRNA, complete cds. ACCESSION U30521 NID g963091 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2036) AUTHORS Studler,J.M., Glowinski,J. and Levi-Strauss,M. TITLE An abundant mRNA of the embryonic brain persists at a high level in cerebellum, hippocampus and olfactory bulb during adulthood JOURNAL Eur. J. Neurosci. 5 (6), 614-623 (1993) MEDLINE 94084289 REFERENCE 2 (bases 1 to 1814) AUTHORS Studler,J.M. TITLE Direct Submission JOURNAL Submitted (29-JUN-1995) Jeanne-Marie Studler, INSERM U-114, College de France, 11 Place Marcelin Berthelot, Paris cedex 05, 75231, France FEATURES Location/Qualifiers source 1..2036 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 year-old" /sex="female" /tissue_type="cerebellum" gene 203..409 /gene="3.1" CDS 203..409 /gene="3.1" /note="putative" /codon_start=1 /product="P311 HUM" /db_xref="PID:g963092" /translation="MVYYPELFVWVSQEPFPNKDMEGRLPKGRLPVPKEVNRKKNDET NAASLTPLGSSELRSPRISYLHFF" polyA_signal 2005..2010 BASE COUNT 532 a 446 c 406 g 652 t ORIGIN 1 tttcctcttt ctctaagagt ctctctctcc ctttccctct ctctcccccc aatctgtctt 61 tctagcatgt tgcccttttt caaccacatt tgtgtttcag gtgtagagag gagagagagt 121 gaacagggag cggggctttt gtctgttggt ctccctggac tgaagagagg gagaatagaa 181 gcccaagact aagattctca aaatggttta ttacccagaa ctctttgtct gggtcagtca 241 agaaccattt ccaaacaagg acatggaggg aaggcttcct aagggaagac ttcctgtccc 301 aaaggaagtg aaccgcaaga agaacgatga gacaaacgct gcctccctga ctccactggg 361 cagcagtgaa ctccgctccc caagaatcag ttacctccac tttttttaat cgtaacacct 421 ccatttgtat tacatatggt gtatgggtat tgatgaggtc atggtatcat atatgggatt 481 tttttctgtg taaatcatca agtataagaa gaaactatgg gactctgagc cttgctttag 541 agaatttaca gtggacaaat aggtgtcatc aaaccagttt ttaatcattc tgactcaagt 601 gaaaacgctc agaatttcac actgtgaatc cacgtttaca acccttacag gtgggccttc 661 aggcctggtt cgctacaaca atgtcttcca caactcaaac tcccaccgcg ctcacacaac 721 cggtccactc ctgccttttc actcacacag ctcccgactg cttcttgcag aggctgagag 781 tccccccccc cacctttttt tttcatttag atgtaacaaa cctagtagtt tatgttcatc 841 aattgtctgt atatctctat attttatcca tgtactcttt tgatgtatag aagtagtttg 901 aaactcattg tttccttgtg gtaagtgacc gagatgctgc cacaggacct gagacactga 961 tgaatggtgc tattttggac tttcaacatg ctccttggcg aggtagctct gatggagtta 1021 ttttttattt ccatgttcta agaaggtgtt ggtactctgt ttccctgaat gttgttctct 1081 agactggatt gacttgtttt ccttgtgtct tcagtgtggc tttcttcctc agtgttgtag 1141 gttgagcgaa tgctaccaga gtgtgagaga ccattgtctc gttggctggc gctcacggac 1201 atgcagtcac ggtagcggga gcaatcacaa aactgtaatt tacttaccaa atctcttcct 1261 ttccgtagcc tcgcctgcct gacttagaga aagaaaagca ataattttac aggcattttg 1321 aggtgtctct ttgggttctt tctgtttgaa aggatatttg tcgaaaaaaa gagcaaaacc 1381 gttttaaata aactccccct ggaaaaaaac ccaaaacact ggcatctgag taggaatatg 1441 aaaatgacac cttttccaaa tattaaattg gaaaacaagg tctacaaaat catgatactt 1501 ttttaaaagg cagagcattc ttttttcggc aattttgata agcaaggtgt agatttacat 1561 ttttgtcctt gctcccaacg aaatggataa acaaaaataa attaccatct actcatggaa 1621 tgttgttgtg ttagccagtc tgaaagccca ccttaatttt tatataactg tctttagctc 1681 ttcttttgac agggcaggcc ttgttctgaa ctgtttcgct tctgactgtt aaacaccgat 1741 gacgcatgca ctgcacttct tcgttttctt cttgctcccc cattggcctg agtttcttgt 1801 gcattactcc tctccctcct tcgttagaat aggtatatca gctgtgtaaa tagagcaaga 1861 aaacagtatt ctgcatctgt ggcatttatg tagagttgca gttgtgtact gctgaaaatg 1921 caggcttttg taacagtgtg atctttactg atgcactcat gacaagtacc caatgtattt 1981 tagctatttt agtagtattt gttcaataaa tacgcaagct gtaaggtaac tgtctg // LOCUS HSU30610 834 bp mRNA PRI 02-DEC-1995 DEFINITION Human CD94 protein mRNA, complete cds. ACCESSION U30610 NID g1098616 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 834) AUTHORS Chang,C., Rodriguez,A., Carretero,M., Lopez-Botet,M., Phillips,J.H. and Lanier,L.L. TITLE Molecular characterization of human CD94: a type II membrane glycoprotein related to the C-type lectin superfamily JOURNAL Eur. J. Immunol. 25 (9), 2433-2437 (1995) MEDLINE 96011848 REFERENCE 2 (bases 1 to 834) AUTHORS Lanier,L.L. TITLE Direct Submission JOURNAL Submitted (29-JUN-1995) Lewis L. Lanier, Human Immunology, DNAX Research Institute of Molecular and Cellular Biology, 901 California Avenue, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..834 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="plasmid LL288" /chromosome="12" /cell_type="NK cell" /tissue_type="blood" /dev_stage="adult" 5'UTR 1..84 CDS 85..624 /codon_start=1 /product="CD94 protein" /db_xref="PID:g1098617" /translation="MAVFKTTLWRLISGTLGIICLSLMATLGILLKNSFTKLSIEPAF TPGPNIELQKDSDCCSCQEKWVGYRCNCYFISSEQKTWNESRHLCASQKSSLLQLQNT DELDFMSSSQQFYWIGLSYSEEHTAWLWENGSALSQYLFPSFETFNTKNCIAYNPNGN ALDESCEDKNRYICKQQLI" 3'UTR 625..834 BASE COUNT 256 a 157 c 159 g 262 t ORIGIN 1 aaaggcttca acaattcaac gctgttcttt ctgaaaaagt acacatcgtg ccttctctac 61 ttcgctcttg gaacataatt tctcatggca gtgtttaaga ccactctgtg gaggttaatt 121 tctgggacct tagggataat atgcctttcg ttgatggcta cgttgggaat tttgttgaaa 181 aattctttta ctaaactgag tattgagcca gcatttactc caggacccaa catagaactc 241 cagaaagact ctgactgctg ttcttgccaa gaaaaatggg ttgggtaccg gtgcaactgt 301 tacttcattt ccagtgaaca gaaaacttgg aacgaaagtc ggcatctctg tgcttctcag 361 aaatccagcc tgcttcagct tcaaaacaca gatgaactgg attttatgag ctccagtcaa 421 caattttact ggattggact ctcttacagt gaggagcaca ccgcctggtt gtgggagaat 481 ggctctgcac tctcccagta tctatttcca tcatttgaaa cttttaatac aaagaactgc 541 atagcgtata atccaaatgg aaatgcttta gatgaatcct gtgaagataa aaatcgttat 601 atctgtaagc aacagctcat ttaaatgttt cttggggcag agaaggtgga gagtaaagac 661 ccaacattac taacaatgat acagttgcat gttatattat tactaattgt ctacttctgg 721 agtctataaa atgtttttaa acagtgtcat atacaattgt catgtatgtg aaacaatgtg 781 ttttaaaatt gatgaaattc gttcacctac atttgagaat tataaaatta acat // LOCUS HSU30826 1454 bp mRNA PRI 29-MAR-1996 DEFINITION Human splicing factor SRp40-1 (SRp40) mRNA, complete cds. ACCESSION U30826 NID g1049079 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1454) AUTHORS Screaton,G.R., Caceres,J.F., Mayeda,A., Bell,M.V., Plebanski,M., Jackson,D.G., Bell,J.I. and Krainer,A.R. TITLE Identification and characterization of three members of the human SR family of pre-mRNA splicing factors JOURNAL EMBO J. 14 (17), 4336-4349 (1995) MEDLINE 96016206 REFERENCE 2 (bases 1 to 1454) AUTHORS Screaton,G.R. TITLE Direct Submission JOURNAL Submitted (03-JUL-1995) Gavin R. Screaton, Immunology Group Nuffield Dept. of Medicine, Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford 0X3 9DU, United Kingdom FEATURES Location/Qualifiers source 1..1454 /organism="Homo sapiens" /db_xref="taxon:9606" gene 90..908 /gene="SRp40" CDS 90..908 /gene="SRp40" /function="splicing factor" /note="member of the family of SR protein pre-mRNA splicing factors" /codon_start=1 /product="SRp40-1" /db_xref="PID:g1049080" /translation="MSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEF EDPRDADDAVYELDGKELCSERVTIEHARARSRGGRGRGRYSDRFSSRRPRNDRRNAP PVRTENRLIVENLSSRVSWQDLKDFMRQAGEVTFADAHRPKLNEGVVEFASYGDLKNA IEKLSGKEINGRKIKLIEGSKRHSRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSR SRSRSRSKSRSVSRSPVPEKSQKRGSSSRSKSPASVDRQRSRSRSRSRSVDSGN" polyA_site 1454 /note="39 A nucleotides" BASE COUNT 432 a 255 c 365 g 402 t ORIGIN 1 gcgtggaggt cgacgactcc gtcgcagact acggacctgt ctgggtctca gccgccaaag 61 accccgtccg gaagtactag ccggacatca tgagtggctg tcgggtattc atcgggagac 121 taaatccagc ggccagggag aaggacgtgg aaagattctt caagggatat ggacggataa 181 gagatattga tctgaaaaga ggctttggtt ttgtggaatt tgaggatcca agggatgcag 241 atgatgctgt gtatgagctt gatggaaaag aactctgtag tgaaagggtt actattgaac 301 atgctagggc tcggtcacga ggtggaagag gtagaggacg atactctgac cgttttagta 361 gtcgcagacc tcgaaatgat agacgaaatg ctccacctgt aagaacagaa aatcgtctta 421 tagttgagaa tttatcctca agagtcagct ggcaggatct caaagatttc atgagacaag 481 ctggggaagt aacgtttgcg gatgcacacc gacctaaatt aaatgaaggg gtggttgagt 541 ttgcctctta tggtgactta aagaatgcta ttgaaaaact ttctggaaag gaaataaatg 601 ggagaaaaat aaaattaatt gaaggcagca aaaggcacag taggtcaaga agcaggtctc 661 gatcccggac cagaagttcc tctaggtctc gtagccgatc ccgttcccgt agtcgcaaat 721 cttacagccg gtcaagaagc aggagcagga gccggagccg gagcaagtcc cgttctgtta 781 gtaggtctcc cgtgcctgag aagagccaga aacgtggttc ttcaagtaga tctaagtctc 841 cagcatctgt ggatcgccag aggtcccggt cccgatcaag gtccagatca gttgacagtg 901 gcaattaaac tgtaaataac ttgccctggg ggcctttttt tttaaaaaac aaaaaccaca 961 aaaattccca aaccatactt gctaaaaatt ctggtaagta tgtgcttttc tgtgggggtg 1021 ggatttggaa ggggggttgg gttgggctgg atatctttgt agatgtggac caccaagggg 1081 ttgttgaaaa ctaattgtat taaatgtctt ttgataagcc ttctgctcac atttttgtga 1141 atgtctgaag tatatagttt gtgtatattg acagagctct tttataacta aagcaaattt 1201 aatttttttg tactagaaaa aaatttgaac attttagttc ttggttataa aaatgttaat 1261 tcagaattag tttaatgcct taattaaact aattaatagc tttggacact taaaagagct 1321 ctaaatttgc ttgtacataa aggcttaatt tgttttcctt gttagggtca agggtgtcct 1381 ccactcttta acagctgctg gacagacaca ttagagcagc tgtttgttat tgataataaa 1441 atattataaa acta // LOCUS HSU30828 1679 bp mRNA PRI 29-MAR-1996 DEFINITION Human splicing factor SRp55-2 (SRp55) mRNA, complete cds. ACCESSION U30828 NID g1049083 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1679) AUTHORS Screaton,G.R., Caceres,J.F., Mayeda,A., Bell,M.V., Plebanski,M., Jackson,D.G., Bell,J.I. and Krainer,A.R. TITLE Identification and characterization of three members of the human SR family of pre-mRNA splicing factors JOURNAL EMBO J. 14 (17), 4336-4349 (1995) MEDLINE 96016206 REFERENCE 2 (bases 1 to 1679) AUTHORS Screaton,G.R. TITLE Direct Submission JOURNAL Submitted (03-JUL-1995) Gavin R. Screaton, Immunology Group Nuffield Dept. of Medicine, Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford 0X3 9DU, United Kingdom FEATURES Location/Qualifiers source 1..1679 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT-29 carcinoma" /tissue_type="colon" /clone="2" gene 106..513 /gene="SRp55" CDS 106..513 /gene="SRp55" /function="splicing factor" /note="member of the family of SR protein pre-mRNA splicing factors; alternatively spliced" /codon_start=1 /product="SRp55-2" /db_xref="PID:g1049084" /translation="MPRVYIGRLSYNVREKDIQRFFSGYGRLLEVDLKNGYGFVEFED SRDADDAVYELNGKELCGEHVIVEHARGPRRDRDGYSYGSRMTNGAEAVSTEAKMTAF PDWPWLFHTLCDPCPMTLWLTLPEAMTTAAFCH" misc_feature 361..629 /note="sequence insertion, which results in premature translational termination between RRM1 and RRM2" BASE COUNT 444 a 419 c 426 g 390 t ORIGIN 1 gtgaggcgcg tgttcgggct cttgccgtcc ccgcacccgc accgcgttac tggcttgcgg 61 tccgccgttc gacaaccagc ccttgggtcc ccgcccgcca cggacatgcc gcgcgtctac 121 ataggacgcc tgagctacaa cgtccgggag aaggacatcc agcgcttttt cagtggctat 181 ggccgcctcc tcgaagtaga cctcaaaaat gggtacggct tcgtggagtt cgaggactcc 241 cgcgacgccg acgacgccgt ttacgagctg aacggcaagg agctctgcgg cgagcacgtg 301 atcgtagagc acgcccgggg cccgcgtcgc gatcgcgacg gctacagcta cggaagccgc 361 atgaccaatg gggctgaggc tgtgtccact gaggctaaga tgactgcctt tcctgattgg 421 ccttggcttt tccatacatt gtgtgaccct tgccctatga ccctttggct gaccttaccg 481 gaagccatga cgacagcagc cttttgccat tagacgcagg gtgatggtga ggattccaag 541 ggttagacaa aactggttaa tctgaactag gtgactgtta ccttgcgtgt tttgtggcca 601 aaccaccacc aaaaacctca cactgtgatg tggtggaggt ggatacagca gtcggagaac 661 atctggcaga gacaaatacg gaccacctgt tcgtacagaa tacaggctta ttgtagaaaa 721 tctttctagt cggtgcagtt ggcaagattt aaaggatttt atgcgacaag caggtgaagt 781 aacctatgcg gatgcccaca aggaacgaac aaatgagggt gtaattgagt ttcgctccta 841 ctctgacatg aagcgtgctt tggacaaact ggatggcaca gaaataaatg gcagaaatat 901 taggcttatt gaagataagc cacgcacaag ccataggcga tcttactctg gaagcagatc 961 caggtctcga tctagaagac ggtcacgaag taggagtcgc aggagcagcc gcagtagatc 1021 tcgaagtatc tcaaaaagtc gctcccgttc caggtcgcgg agcaaaggtc gatcacgttc 1081 tcgatcaaaa ggcaggaaat ctagatcaaa gagcaaatct aagcccaagt ctgatcgggg 1141 ctcccattca cattctcgaa gcagatctaa ggatgagtat gagaaatctc gaagcaggtc 1201 tcggtcccga tcccccaaag aaaatggaaa gggtgatata aagtcaaaat ccagatcaag 1261 gagccagtcc cgttccaatt cgccgctacc tgttccaccc tcaaaggccc gttctgtgtc 1321 ccctccacca aaaagagcta cttcaagatc ccgttctaga tctcgctcaa agtcaagatc 1381 aaggtccagg tcgagttcca gagattaact cagaactcct tgtttgcaca ttattatgga 1441 acactttcct acttaggcag ttactcttcc atgtttatac ttggcctctt ctgcaagagg 1501 aatctcttga aaacaggggc acacagaaat ttgatttgtg gccaaattgg atgaaaaaga 1561 tgaggctcta aggaaatggt ggcatgaaga ccctctccct tctttgtaga attaagataa 1621 ctttgatttt atagcttttg agctaacgta acttttgtaa agattaagct catttagtg // LOCUS HSU30888 2481 bp mRNA PRI 10-AUG-1995 DEFINITION Human tRNA-guanine transglycosylase mRNA, complete cds. ACCESSION U30888 NID g940181 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2481) AUTHORS Deshpande,K.L. and Katze,J.R. TITLE tRNA-Guanine Transglycosylase cDNA from human placenta JOURNAL Unpublished REFERENCE 2 (bases 1 to 2481) AUTHORS Katze,J.R. TITLE Direct Submission JOURNAL Submitted (04-JUL-1995) Jon R. Katze, Microbiology and Immunology, The University of Tennessee, Memphis, 858 Madison Avenue, Memphis, TN 38163, USA FEATURES Location/Qualifiers source 1..2481 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A1" /clone_lib="lambda gt11; ClonTech Laboratories, Inc. catalog number: HL3007b" /map="18p" /chromosome="18" /tissue_type="placenta" CDS 92..1576 /codon_start=1 /product="tRNA-Guanine Transglycosylase" /db_xref="PID:g940182" /translation="MPLYSVTVKWGKEKFEGVELNTDEPPMVFKAQLFALTGVQPARQ KVMVKGGTLKDDDWGNIKIKNGMTLLMMGSADALPEEPSAKTVFVEDMTEEQLASAME LPCGLTNLGNTCYMNATVQCIRSVPELKDALKRYAGALRASGEMASAQYITAALRDLF DSMDKTSSSIPPIILLQFLHMAFPQFAEKGEQGQYLQQDANECWIQMMRVLQQKLEAI EDDSVKETDSSSASAATPSKKKSLIDQFFGVEFETTMKCTESEEEEVTKGKENQLQLS CFINQEVKYLFTGLKLRLQEEITKQSPTLQRNALYIKSSKISRLPAYLTIQMVRFFYK EKESVNAKVLKDVKFPLMLDMYELCTPELQEKMVSFRSKFKDLEDKKVNQQPNTSDKK SSPQKEVKYEPFSFADDIGSNNCGYYDLQAVLTHQGRSSSSGHYVSWVKRKQDEWIKF DDDKVSIVTPEDILRLSGGGDWHIAYVLLYGPRRVEIMEEESEQ" misc_feature 1984..2321 /note="similar to the complement of GenBank Accession Number G07320 which has been linked to chromosome map position 18p." misc_difference 2028 /note="GenBank Accession Number G07320" /replace="" BASE COUNT 756 a 475 c 514 g 736 t ORIGIN 1 gccgccgccg cagctgctcc tggtccccgt ccctttgccg ccctcgtcag gcccagctct 61 cctgcgccgc cgcctcccgc cgcgccccgc catgccgctc tactccgtta ctgtaaaatg 121 gggaaaggag aaatttgaag gtgtagaatt gaatacagat gaacctccaa tggtattcaa 181 ggctcagctg tttgcgttga ctggagtcca gcctgccaga cagaaagtta tggtgaaagg 241 aggaacgcta aaggatgatg attggggaaa catcaaaata aaaaacggaa tgactctact 301 aatgatgggg tcagcagatg ctcttccaga agaaccctca gccaaaactg tcttcgtaga 361 agacatgaca gaagaacagt tagcatctgc tatggagtta ccatgtggat tgacaaacct 421 tggtaacact tgttacatga atgccacagt tcagtgtatt cgttctgtgc ctgaactcaa 481 agatgccctt aaaaggtatg caggtgcctt gagagcttca ggggaaatgg cttcagcgca 541 gtatattact gcagccctta gagatttgtt tgattccatg gataaaactt cttccagtat 601 tccacctatt attctactgc agtttttgca catggctttc ccacagtttg ccgagaaagg 661 tgaacaagga cagtatcttc aacaggatgc taatgaatgt tggatacaaa tgatgcgagt 721 attgcaacag aaattggaag caatagagga tgattctgtt aaagagacag actcctcatc 781 tgcatcggca gcgacacctt ctaaaaagaa aagtttaatc gatcagttct tcggtgttga 841 gtttgaaact accatgaaat gtacagaatc tgaagaagaa gaagtcacca aaggaaagga 901 aaatcaactt cagcttagct gttttatcaa tcaggaagtc aagtatcttt ttacaggact 961 taaattgcga cttcaggaag aaatcaccaa acagtctcca acgttgcaaa gaaatgcctt 1021 gtatatcaaa tcttccaaga tcagccggct gcctgcttac ttgaccattc agatggttcg 1081 atttttttat aaagagaagg aatctgtgaa tgccaaagtt cttaaggatg ttaaatttcc 1141 tcttatgttg gatatgtatg aactgtgtac accagaactt caagagaaaa tggtgtcttt 1201 tcgatccaaa ttcaaggatc tagaagataa aaaagtgaat cagcagccaa atacaagtga 1261 caaaaagagt agtccccaga aagaagttaa gtatgaaccc ttttcttttg ctgatgatat 1321 tggctccaat aattgtggat actatgactt acaagcagta ctaacacacc agggaaggtc 1381 tagttcttca ggtcattatg tatcatgggt gaaaaggaaa caagatgaat ggattaagtt 1441 tgatgatgac aaagtcagca tcgtaacacc agaagatatc ttacggcttt ctggtggtgg 1501 agactggcat atcgcttacg ttctactcta tgggcctcgc agagttgaaa taatggaaga 1561 ggaaagtgaa cagtaatctt cattttagta tttatgctta gatgtgaaaa taaatgttat 1621 ttgttgatca tttctataat ccagagcttt agaggaagac acataggtgg gtttatgttt 1681 cacctcattt ggaacaaaag aggacagaag cagaccactc tgtgcaccaa cctaaaaaat 1741 tacagagaag agaaaattat ctttggattg tgctgcccta tataaaggtg gcagaaagac 1801 atttttaaaa agcttattat ttcttgcatt attttaaaaa gttcagagtt gaaatgcctt 1861 tcaaccattt ccttctgtgg tcatttttct tgctgccttt ttcacccaag attcagcagt 1921 cagatgttta ctgcacacct attacctatt atttgctgtt cttgcatggt tcaaaccacc 1981 attctgtagc cacccatcct ttgccttatc taacaaacat ttttccagga aggtggaaaa 2041 ggaagtgttg ctctcattgt gtgactcagt gctgctgtcc atcccatgga aacatgggca 2101 caatcaagta tttgtccagc ctattgcagg cttttcctga ctttaaaata aattgtgatc 2161 aataatagta cctttgatta tacatttatt attgtgtctc tctctgatgt actgtggatt 2221 gtacatttaa ctttggaatg gctttgtaat aatcagtctt aagaaaatgt tgacaagctc 2281 tggttgctta tttttagaaa atgaggacat ttaataataa taaaaaaaaa gggattaata 2341 gcttttgacc tcaagtcttt tgtcttctga gtgttggagc ttggctgaag acatgtttaa 2401 tactgtacaa tttctgaaga tggttattaa cactgtgctg ttaagcatcc atttaaaaat 2461 atgttatctt ctttgcctgc c // LOCUS HSU30894 2657 bp mRNA PRI 02-FEB-1996 DEFINITION Human N-sulphoglucosamine sulphohydrolase mRNA, complete cds. ACCESSION U30894 NID g1173542 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2657) AUTHORS Scott,H.S., Blanch,L., Guo,X.H., Freeman,C., Orsborn,A., Baker,E., Sutherland,G.R., Morris,C.P. and Hopwood,J.J. TITLE Cloning of the sulphamidase gene and identification of mutations in Sanfilippo A syndrome JOURNAL Nature Genet. 11 (4), 465-467 (1995) MEDLINE 96083602 REFERENCE 2 (bases 1 to 2657) AUTHORS Scott,H.S., Blanch,L., Guo,X.-H., Freeman,C., Orsborn,A., Baker,E., Sutherland,G.R., Morris,C.P. and Hopwood,J.J. TITLE Direct Submission JOURNAL Submitted (03-JUL-1995) Donald Anson, Chemical Pathology, Women and Children's Hospital, 72 King William Road, North Adelaide, SA 5006, Australia FEATURES Location/Qualifiers source 1..2657 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q25.3" /chromosome="17" CDS 13..1521 /EC_number="3.10.1.1" /note="sulphamidase" /codon_start=1 /product="N-sulphoglucosamine sulphohydrolase" /db_xref="PID:g1173543" /translation="MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSA IATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFD KVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLVR KFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPL DVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGI PFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYA IFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNF KMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL" BASE COUNT 460 a 897 c 759 g 541 t ORIGIN 1 gaattccggg ccatgagctg ccccgtgccc gcctgctgcg cgctgctgct agtcctgggg 61 ctctgccggg cgcgtccccg gaacgcactg ctgctcctcg cggatgacgg aggctttgag 121 agtggcgcgt acaacaacag cgccatcgcc accccgcacc tggacgcctt ggcccgccgc 181 agcctcctct ttcgcaatgc cttcacctcg gtcagcagct gctctcccag ccgcgccagc 241 ctcctcactg gcctgcccca gcatcagaat gggatgtacg ggctgcacca ggacgtgcac 301 cacttcaact ccttcgacaa ggtgcggagc ctgccgctgc tgctcagcca agctggtgtg 361 cgcacaggca tcatcgggaa gaagcacgtg gggccggaga ccgtgtaccc gtttgacttt 421 gcgtacacgg aggagaatgg ctccgtcctc caggtggggc ggaacatcac tagaattaag 481 ctgctcgtcc ggaaattcct gcagactcag gatgaccggc ctttcttcct ctacgtcgcc 541 ttccacgacc cccaccgctg tgggcactcc cagccccagt acggaacctt ctgtgagaag 601 tttggcaacg gagagagcgg catgggtcgt atcccagact ggacccccca ggcctacgac 661 ccactggacg tgctggtgcc ttacttcgtc cccaacaccc cggcagcccg agccgacctg 721 gccgctcagt acaccaccgt cggccgcatg gaccaaggag ttggactggt gctccaggag 781 ctgcgtgacg ccggtgtcct gaacgacaca ctggtgatct tcacgtccga caacgggatc 841 cccttcccca gcggcaggac caacctgtac tggccgggca ctgctgaacc cttactggtg 901 tcatccccgg agcacccaaa acgctggggc caagtcagcg aggcctacgt gagcctccta 961 gacctcacgc ccaccatctt ggattggttc tcgatcccgt accccagcta cgccatcttt 1021 ggctcgaaga ccatccacct cactggccgg tccctcctgc cggcgctgga ggccgagccc 1081 ctctgggcca ccgtctttgg cagccagagc caccacgagg tcaccatgtc ctaccccatg 1141 cgctccgtgc agcaccggca cttccgcctc gtgcacaacc tcaacttcaa gatgcccttt 1201 cccatcgacc aggacttcta cgtctcaccc accttccagg acctcctgaa ccgcaccaca 1261 gctggtcagc ccacgggctg gtacaaggac ctccgtcatt actactaccg ggcgcgctgg 1321 gagctctacg accggagccg ggacccccac gagacccaga acctggccac cgacccgcgc 1381 tttgctcagc ttctggagat gcttcgggac cagctggcca agtggcagtg ggagacccac 1441 gacccctggg tgtgcgcccc cgacggcgtc ctggaggaga agctctctcc ccagtgccag 1501 cccctccaca atgagctgtg accatcccag gaggcctgtg cacacatccc aggcatgtcc 1561 cagacacatc ccacacgtgt ccgtgtggcc ggccagcctg gggagtagtg gcaacagccc 1621 ttccgtccac actcccatcc aaggagggtt cttccttcct gtggggtcac tcttgccatt 1681 gcctggaggg ggaccagagc atgtgaccag agcatgtgcc cagcccctcc accaccaggg 1741 gcactgccgt catggcaggg gacacagttg tccttgtgtc tgaaccatgt cccagcacgg 1801 gaattctaga catacgtggt ctgcggacag ggcagcgccc ccagcccatg acaagggagt 1861 cttgttttct ggcttggttt ggggacctgc aaatgggagg cctgaggccc tcttcaggct 1921 ttggcagcca cagatacttc tgaacccttc acagagagca ggcaggggct tcggtgccgc 1981 gtgggcagta cgcaggtccc accgacactc acctgggagc acggcgcctg gctcttacca 2041 gcgtctggcc tagaggaagc ctttgagcga cctttgggca ggtttctgct tcttctgttt 2101 tgcccatggt caagtccctg ttccccaggc aggtttcagc tgattggcag caggctccct 2161 gagtgatgag cttgaacctg tggtgtttct gggcagaagc ttatcttttt tgagagtgtc 2221 cgaagatgaa ggcatggcga tgcccgtcct ctggcttggg ttaattcttc ggtgacactg 2281 gcattgctgg gtggtgatgc ccgtcctctg gcttgggtta attcttcggt gacactggcg 2341 ttgctgggtg gcaatgcccg tcctctggct tgggttaatt cttcggtgac actggcgttg 2401 ctgggtggcg atgcccgtcc tctggcttgg gttaattctt ggatgacgtc ggcgttgctg 2461 ggagaatgtg ccgttcctgc cctgcctcca cccacctcgg gagcagaagc ccggcctgga 2521 cacccctcgg cctggacacc cctcgaagga gagggcgctt ccttgagtag gtgggctccc 2581 cttgcccttc cctccctatc actccatact ggggtgggct ggaggaggcc acaggccagc 2641 tattgtaaaa gcttttt // LOCUS HSU30930 2448 bp mRNA PRI 10-JUL-1996 DEFINITION Human UDP-Galactose ceramide galactosyl transferase (CGT) mRNA, complete cds. ACCESSION U30930 NID g1407589 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2448) AUTHORS Bosio,A., Binczek,E., Le Beau,M.M., Fernald,A.A. and Stoffel,W. TITLE The human gene CGT encoding the UDP-galactose ceramide galactosyl transferase (cerebroside synthase): cloning, characterization, and assignment to human chromosome 4, band q26 JOURNAL Genomics 34 (1), 69-75 (1996) MEDLINE 96299661 REFERENCE 2 (bases 1 to 2448) AUTHORS Stoffel,W. TITLE Direct Submission JOURNAL Submitted (06-JUL-1995) Wilhelm Stoffel, Department of Biochemistry, Medical Faculty University of Cologne, Joseph-Stelzmann-Strasse 52, Cologne, Nordrhein-Westfalen 50931, Federal Republic of Germany FEATURES Location/Qualifiers source 1..2448 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q26" /chromosome="4" gene 516..2141 /gene="CGT" CDS 516..2141 /gene="CGT" /EC_number="2.4.1.45" /note="cerebroside synthase" /codon_start=1 /product="UDP-Galactose ceramide galactosyl transferase" /db_xref="PID:g1407590" /translation="MKSYTPYFILLWSAVGIAKAAKIIIVPPIMFESHMYIFKTLASA LHERGHHTVFLLSEGRDIAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAIE LFDILDHYTKNCDLMVGNHALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFST GLWYPAEVGAPAPLAYVPEFNSLLTDRMNLLQRMKNTGVYLISRLGVSFLVLPKYERI MQKYNLLPEKSMYDLVHGSSLWMLCTDVALEFPRPTLPNVVYVGGILTKPASPLPEDL QRWVNGANEHGFVLVSFGAGVKYLSEDIANKLAGALGRLPQKVIWRFSGPKPKNLGNN TKLIEWLPQNDLLGHSKIKAFLSHGGLNSIFETMYHGVPVVGIPLFGDHYDTMTRVQA KGMGILLEWKTVTEKELYEALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTIYWIDY IIRHNGAHHLRAAVHQISFCQYFLLDIAFVLLLGAALLYFLLSWVTKFIYRKIKSLWS RNKHSTVNGHYHNGILNGKYKRNGHIKHEKKVK" BASE COUNT 683 a 482 c 522 g 761 t ORIGIN 1 gggttggttc caagtctttg ctattgtgaa tagtgccgca ataaacatac gtgtgcatgt 61 gtctttatag cagcatgatt tatagtcctt tgggtatata cccagtaatg ggatggctgg 121 gttctagatc cctgaggaat cgccacattg acttgcacaa tggttgaact agtttacagt 181 cccaccaaca gtgtaaaagt gttcctattt ctccacatcc tctccggcac ctgttgtttc 241 ctgacttttt aatgattgcc attctaactg gtgtgagatg atatctcatt gtggttttga 301 tttgcatttc tctgatggcc agtgatggtg agcatttttt catgtgtttt ttggctgcat 361 aaatgtcttc ttttgagaag tgtctgttca tgattttttt ttttagacaa agtattagat 421 cagatatttt tggggaaaaa tgctttgtga ttgcttgttt tgaatggtga gcattgtatt 481 ttgttttaag ttgttttctg gttgttatta cagctatgaa gtcttacact ccatatttca 541 ttctcctgtg gagtgctgtt gggatagcga aggctgccaa aatcatcatc gtgccgccaa 601 ttatgtttga aagccatatg tacattttca agacgctagc ctcagccttg cacgagagag 661 gccaccatac agtgttcctc ctctctgaag gcagagacat cgccccatct aatcattaca 721 gcctccagcg ctacccaggg atctttaaca gtaccacctc agatgctttc ctacagtcca 781 agatgcggaa tattttctct gggagattga cagcaatcga actgtttgac atactggatc 841 actatactaa gaactgtgac ctgatggttg gcaaccatgc cctgatccag ggtctgaaga 901 aagaaaaatt tgacctgctg ctggtggacc ctaatgatat gtgtggattt gtgatagctc 961 atcttttagg ggttaaatat gctgtatttt caactggcct ttggtatcct gctgaagtgg 1021 gtgctcctgc tccattagca tacgtcccag agtttaactc actcctcaca gaccgcatga 1081 acttgctgca aaggatgaaa aataccggtg tttacctcat ttccagatta ggggtcagct 1141 ttctggttct tcccaaatat gaaaggataa tgcagaagta caacctgctg ccggagaagt 1201 ccatgtatga tttggttcat gggtccagcc tgtggatgct gtgtactgac gtagcactgg 1261 aattcccaag acccactctg cctaatgttg tttatgtagg aggaatccta accaaaccag 1321 ccagcccact accagaagat ctccaaagat gggtaaatgg tgctaatgaa catggctttg 1381 tcttggtgtc ttttggagct ggtgtcaagt atctgtcaga agacattgct aacaaactgg 1441 caggagctct ggggagattg cctcaaaaag tgatttggag gttttctgga cccaaaccaa 1501 agaatctagg aaacaacact aaactcatag aatggttacc acaaaatgac ctgcttgggc 1561 attcaaagat taaagccttc ctgagccatg gtggtttgaa cagtattttt gaaactatgt 1621 atcatggtgt gcctgtagtg ggaattccac tctttggaga ccattatgat actatgacca 1681 gagtacaggc aaaaggcatg gggatattgc tagaatggaa gacagttact gaaaaagagc 1741 tctatgaagc actagtgaag gttatcaata atcccagcta ccgtcagagg gctcagaagc 1801 tttcggaaat tcacaaggat caacctggtc accctgtcaa tcgaactatc tattggatag 1861 attatattat tcgtcacaat ggagcccatc acctacgtgc cgctgtccat cagatctcct 1921 tttgtcagta ttttttactg gatattgcct ttgtgctttt gcttggtgct gccttgttat 1981 actttctctt gtcttgggtg acaaaattta tctacagaaa aatcaaaagt ctgtggtcta 2041 gaaataagca tagcacagtt aatggacatt accacaatgg aatcctcaat ggcaagtaca 2101 aaagaaatgg ccatattaaa catgaaaaga aagtgaaatg agccaacagc ccaggtgata 2161 gaaataaatt ggttcactca ttgaattttt attgctatta tttagtctaa cagctactaa 2221 aagtaaaaca tcagtaaaca attctaacat gcccttatga gactactaat gaaattctgt 2281 ggaattaaga tggctgtaaa aagcacaaac ctaaaatgca gaaatgtatt ttattcaaat 2341 actgatgtag agagttttgg cactgaacct tttagaagcc ttaattattt aaatcaattc 2401 agtgactgtg tcagacctta gttttaaatc ttgatatgtg cgtgtccc // LOCUS HSU31110 3312 bp mRNA PRI 02-APR-1996 DEFINITION Human alternatively spliced trp-1 protein and unspliced trp-1 protein (trp-1) mRNA, complete cds. ACCESSION U31110 NID g1072042 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3312) AUTHORS Zhu,X., Chu,P.B., Peyton,M. and Birnbaumer,L. TITLE Molecular cloning of a widely expressed human homologue for the Drosophila trp gene JOURNAL FEBS Lett. 373 (3), 193-198 (1995) MEDLINE 96033971 REFERENCE 2 (bases 1 to 3312) AUTHORS Zhu,X., Chu,P.B. and Birnbaumer,L. TITLE Direct Submission JOURNAL Submitted (06-JUL-1995) Xi Zhu, Anesthesiology, UCLA School of Medicine, 10833 Le Conte Ave., Los Angeles, CA 90095-1778, USA FEATURES Location/Qualifiers source 1..3312 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..3312 /gene="trp-1" /product="unspliced trp-1 protein" mRNA join(1..1062,1165..3312) /gene="trp-1" /note="This alternatively spliced mRNA is often found in clones from several libraries" /product="alternatively spliced trp-1 protein" gene 1..3312 /gene="trp-1" CDS join(736..1062,1165..3117) /gene="trp-1" /note="This is a shorter human trp-1 product; similar to Drosophila transient receptor, Swiss-Prot Accession Number P19334; Method: conceptual translation supplied by author" /codon_start=1 /product="alternatively spliced trp-1 protein" /db_xref="PID:g1072043" /translation="MMAALYPSTDLSGASSSSLPSSPSSSSPNEVMALKDVREVKEEN TLNEKLFLLACDKGDYYMVKKILEENSSGDLNINCVDVLGRNAVTITIENENLDILQL LLDYGCQKLMERIQNPEYSTTMDVAPVILAAHRNNYEILTMLLKQDVSLPKPHAVGCE CTLCSAKNKKDSLRHSRFRLDIYRCLASPALIMLTEEDPILRAFELSADLKELSLVEV EFRNDYEELARQCKMFAKDLLAQARNSRELEVILNHTSSDEPLDKRGLLEERMNLSRL KLAIKYNQKEFVSQSNCQQFLNTVWFGQMSGYRRKPTCKKIMTVLTVGIFWPVLSLCY LIAPKSQFGRIIHTPFMKFIIHGASYFTFLLLLNLYSLVYNEDKKNTMGPALERIDYL LILWIIGMIWSDIKRLWYEGLEDFLEESRNQLSFVMNSLYLATFALKVVAHNKFHDFA DRKDWDAFHPTLVAEGLFAFANVLSYLRLFFMYTTSSILGPLQISMGQMLQDFGKFLG MFLLVLFSFTIGLTQLYDKGYTSKEQKDCVGIFCEQQSNDTFHSFIGTCFALFWYIFS LAHVAIFVTRFSYGEELQSFVGAVIVGTYNVVVVIVLTKLLVAMLHKSFQLIANHEDK EWKFARAKLWLSYFDDKCTLPPPFNIIPSPKTICYMISSLSKWICSHTSKGKVKRQNS LKEWRNLKQKRDENYQKVMCCLVHRYLTSMRQKMQSTDQATVENLNELRQDLSKFRNE IRDLLGFRTSKYAMFYPRN" CDS 736..3117 /gene="trp-1" /note="This is the predicted product from the complete human trp-1 mRNA; similar to Drosophila transient receptor, Swiss-Prot Accession Number P19334; Method: conceptual translation supplied by author" /codon_start=1 /product="unspliced trp-1 protein" /db_xref="PID:g1072044" /translation="MMAALYPSTDLSGASSSSLPSSPSSSSPNEVMALKDVREVKEEN TLNEKLFLLACDKGDYYMVKKILEENSSGDLNINCVDVLGRNAVTITIENENLDILQL LLDYGCQSADALLVAIDSEVVGAVDILLNHRPKRSSRPTIVKLMERIQNPEYSTTMDV APVILAAHRNNYEILTMLLKQDVSLPKPHAVGCECTLCSAKNKKDSLRHSRFRLDIYR CLASPALIMLTEEDPILRAFELSADLKELSLVEVEFRNDYEELARQCKMFAKDLLAQA RNSRELEVILNHTSSDEPLDKRGLLEERMNLSRLKLAIKYNQKEFVSQSNCQQFLNTV WFGQMSGYRRKPTCKKIMTVLTVGIFWPVLSLCYLIAPKSQFGRIIHTPFMKFIIHGA SYFTFLLLLNLYSLVYNEDKKNTMGPALERIDYLLILWIIGMIWSDIKRLWYEGLEDF LEESRNQLSFVMNSLYLATFALKVVAHNKFHDFADRKDWDAFHPTLVAEGLFAFANVL SYLRLFFMYTTSSILGPLQISMGQMLQDFGKFLGMFLLVLFSFTIGLTQLYDKGYTSK EQKDCVGIFCEQQSNDTFHSFIGTCFALFWYIFSLAHVAIFVTRFSYGEELQSFVGAV IVGTYNVVVVIVLTKLLVAMLHKSFQLIANHEDKEWKFARAKLWLSYFDDKCTLPPPF NIIPSPKTICYMISSLSKWICSHTSKGKVKRQNSLKEWRNLKQKRDENYQKVMCCLVH RYLTSMRQKMQSTDQATVENLNELRQDLSKFRNEIRDLLGFRTSKYAMFYPRN" BASE COUNT 884 a 700 c 791 g 937 t ORIGIN 1 gggacgatta ttaggaacta gtcagttgca aagctcatta tgatggtatt actatgaaga 61 agattattac aaatgcatgg gctgtgacga taacgttgta gatgtggtcg ttacccagaa 121 ggttgcctgg ctggcccagc tcggctcgaa taaggaggct tagggctgtg cctaggactc 181 cagctcatgc gccgaataat aggtatagtg ttccaatgtc tttgtggttt gtagagaata 241 gtcaacggtc ggcgaacatc tcctcggagc gcagctgggc cagcggttcc cacagccctg 301 gagcccaacg tgcgcagaag cgcctcttgg agctcctctc ccacagatct ctcgtcctct 361 tcctgggcta ggccgcccca ggcgcgggcc ctgcgactcc tggcacggcc ccgtgctcgg 421 ctgccgccct ggcgcgcgcc acactgtcgt ccccggacgg gcgcggaccg gctcggccgg 481 ggcgccggcg gctggggagg ggtcgctggc cccggggccg cgcatgcgcc gccaccaact 541 tggggctgtc agtggagggc gagtgctggt tctcagggga ggcgacgcct tcgggccaac 601 gggcctcgag ccgaggcagc agtgggaacg actcatcctt tttccagccc tggggcgtgg 661 ctggggtcgg ggtcggggtc ggggccggtg ggggccccgc ccccgtctcc tggcctgccc 721 ccttcatggg ccgcgatgat ggcggccctg tacccgagca cggacctctc gggcgcctcc 781 tcctcctccc tgccttcctc tccatcctct tcctcgccga acgaggtgat ggcgctgaag 841 gatgtgcggg aggtgaagga ggagaatacg ctgaatgaga agcttttctt gctggcgtgc 901 gacaagggtg actattatat ggttaaaaag attttggagg aaaacagttc aggtgacttg 961 aacataaatt gcgtagatgt gcttgggaga aatgctgtta ccataactat tgaaaacgaa 1021 aacttggata tactgcagct tcttttggac tacggttgtc agtctgcaga tgcacttttg 1081 gtggcaatcg actctgaagt agtgggagct gttgatatac tacttaatca tcgaccaaaa 1141 cgatcatcaa gaccaactat agtaaaacta atggaacgaa ttcagaatcc tgagtattca 1201 acaactatgg atgttgcacc tgtcatttta gctgctcatc gtaacaacta tgaaattctt 1261 acaatgctct taaaacagga tgtatctcta cccaagcccc atgcagttgg ctgtgaatgc 1321 acattgtgtt ctgcaaaaaa caaaaaggat agcctccggc attccaggtt tcgtcttgat 1381 atatatcgat gtttggccag tccagctcta ataatgttaa cagaggagga tccaattctg 1441 agagcatttg aacttagtgc tgatttaaaa gaactaagtc ttgtggaggt ggaattcagg 1501 aatgattatg aggaactagc ccggcaatgt aaaatgtttg ctaaggattt acttgcacaa 1561 gcccggaatt ctcgtgaatt ggaagttatt ctaaaccata cgtctagtga cgagcctctt 1621 gacaaacggg gattattaga agaaagaatg aatttaagtc gtctaaaact tgctatcaaa 1681 tataaccaga aagagtttgt ctcccagtct aactgccagc agttcctgaa cactgtttgg 1741 tttggacaga tgtcgggtta ccgacgcaag cccacctgta agaagataat gactgttttg 1801 acagtaggca tcttttggcc agttttgtca ctttgttatt tgatagctcc caaatctcag 1861 tttggcagaa tcattcacac accttttatg aaatttatca ttcatggagc atcatatttc 1921 acatttctgc tgttgcttaa tctatactct cttgtctaca atgaggataa gaaaaacaca 1981 atggggccag cccttgaaag aatagactat cttcttattc tgtggattat tgggatgatt 2041 tggtcagaca ttaaaagact ctggtatgaa gggttggaag actttttaga agaatctcgt 2101 aatcaactca gttttgtcat gaattctctt tatttggcaa cctttgccct caaagtggtt 2161 gctcacaaca agtttcatga ttttgctgat cggaaggatt gggatgcatt ccatcctaca 2221 ctggtggcag aagggctttt tgcatttgca aatgttctaa gttatcttcg tctctttttt 2281 atgtatacaa ccagctctat cttgggtcca ttacagattt caatgggaca gatgttacaa 2341 gattttggaa aatttcttgg gatgtttctt cttgttttgt tttctttcac aattggactg 2401 acacaactgt atgataaagg atatacttca aaggagcaga aggactgtgt aggcatcttc 2461 tgtgaacagc aaagcaatga taccttccat tcgttcattg gcacctgctt tgctttgttc 2521 tggtatattt tctccttagc gcatgtggca atctttgtca caagatttag ctatggagaa 2581 gaactgcagt cctttgtggg agctgtcatt gttggtacat acaatgtcgt ggttgtgatt 2641 gtgcttacca aactgctggt ggcaatgctt cataaaagct ttcagttgat agcaaatcat 2701 gaagacaaag aatggaagtt tgctcgagca aaattatggc ttagctactt tgatgacaaa 2761 tgtacgttac ctccaccttt caacatcatt ccctcaccaa agactatctg ctatatgatt 2821 agtagcctca gtaagtggat ttgctctcat acatcaaaag gcaaggtcaa acggcaaaac 2881 agtttaaagg aatggaggaa tttgaaacag aagagagatg aaaactatca aaaagtgatg 2941 tgctgcctag tgcatcgtta cttgacttcc atgagacaga agatgcaaag tacagatcag 3001 gcaactgtgg aaaatctaaa cgaactgcgc caagatctgt caaaattccg aaatgaaata 3061 agggatttac ttggctttcg gacttctaaa tatgctatgt tttatccaag aaattaacca 3121 ttttctaaat catggagcga ataattttca ataacagatc caaaagacta tattgcataa 3181 cttgcaatga aattaatgag atatatattg aaataaagaa ttatgtaaaa gccgttcctt 3241 taaaatattt atagcattaa atatatgtta tgtaaagtgt gtatatagaa ttagtttttt 3301 aaaccttctg tt // LOCUS HSU31176 2248 bp mRNA PRI 09-APR-1996 DEFINITION Human hERV1 mRNA, complete cds. ACCESSION U31176 NID g950170 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Lisowsky,T. TITLE Dual function of a new nuclear gene for oxidative phosphorylation and vegetative growth in yeast JOURNAL Mol. Gen. Genet. 232 (1), 58-64 (1992) MEDLINE 92204135 REFERENCE 2 (bases 1 to 2248) AUTHORS Lisowsky,T., Weinstat-Saslow,D.L., Barton,N., Reeders,S.T. and Schneider,M.C. TITLE A new human gene located in the PKD1 region of chromosome 16 is a functional homologue to ERV1 of yeast JOURNAL Genomics 29 (3), 690-697 (1995) MEDLINE 96121380 REFERENCE 3 (bases 1 to 2248) AUTHORS Schneider,M.C. TITLE Direct Submission JOURNAL Submitted (06-JUL-1995) Michael C. Schneider, Renal Division, Brigham and Women's, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2248 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="renal cyst lining" /clone="cosmid cDeb4" /chromosome="16" /map="16p13.3" gene 550..981 /gene="hERV1" CDS 550..981 /gene="hERV1" /note="similar to yeast Erv1p, Swiss-Prot Accession Number P27882" /codon_start=1 /db_xref="PID:g950171" /translation="MGQLARHGWIGAGHSLKCITARACCVSVGWSPGHCLWGPALPLL GLEQKWRCPQQVPTGPLLSGDPKELQLNCRGGKEEQPGLPLDIQDVASCPPHTLAPHS SHGKTAGLAGHPCACPWRPGHCLPTMQPGMPLPPMALCCSL" BASE COUNT 444 a 666 c 715 g 423 t ORIGIN 1 aagcgcgacc gacgcggggc cggggcgcgg ggcggagaga cgcggccgcc tcggcctcga 61 cgccagccca ggcgccgacc tccgattctc ctgtcgccga ggacgcctcc cggaggcggc 121 gtgccgggcc tgcgtcgact tcaagacgtg gatgcggacg cagcagaagc gggacaccaa 181 gtttagggag gactgcccgc cggatcgcga ggaactgggc cgccacagct gggctgtcct 241 ccacaccctg gccgcctact accccgacct gcccacccca gaacagcagc aagacatggc 301 ccagttcata catttatttt ctaagtttta cccctgtgag gagtgtgctg aagacctaag 361 aaaaaggctg tgcaggaacc acccagacac ccgcacccgg gcatgcttca cacagtggct 421 gtgccacctg cacaatgaag tgaaccgcaa gctgggcaag cctgacttcg actgctcaaa 481 agtggatgag cgctggcgcg acggctggaa ggatggctcc tgtgactaga gggtggtcag 541 ccagagctca tgggacagct agccaggcat ggttggatag gggcagggca ctcattaaag 601 tgcatcacag ccagagcctg ttgtgtctca gttgggtggt ccccaggaca ctgcctgtgg 661 ggacctgccc tgcccctctt aggtttggag cagaagtgga ggtgcccaca gcaggtaccc 721 actggccccc tcctcagtgg agaccccaag gagctgcagc tgaactgcag gggagggaag 781 gaggagcagc ctgggctgcc ccttgacatt caggatgtag cttcctgccc accgcatacc 841 ctggcgcctc actcctcaca cgggaagaca gcgggcctgg ctgggcatcc ctgtgcctgt 901 ccctggcggc caggccattg ccttcccact atgcagccag ggatgcccct gccccccatg 961 gctctgtgct gctcacttta gggggctcaa ttctccactc tgctcagtcc tacagggaaa 1021 gctcaggtcg ggtctttctg agggtccacc agccatccta ccctctccct gcctggcaca 1081 tgcctgccag cgttgtgtca tgcctgtcca caggggattc gtggggctca cttcatcaga 1141 gtttgaagcc caaatgaaac gctgaagtga ctgagaacct ggcttcagta tattttctgc 1201 tggggcttaa taaagcagta gacagggctt gttccatccc tctgtgctca gctgcatttc 1261 ctgctggggt cctggttcct caggagagag agaccacagg gtgagagtga gccaggaaca 1321 gcaaggacgt tgattggttg gggcaggggg gccagagtag ctgatgtagg agtactggga 1381 ggccagacgg cacgaggtct ccaaggcccc agcaaagcca tggcttctac ccctagttcc 1441 cctgacagga agttcttggc gggtttggag ccaggggatg gcatggagtg atgtggcttt 1501 gaagggtcct ctggctgctg agctgggatg aggcaggtaa gggtggaaca ggaggggtgg 1561 ggaggaagcc ggggcagtca ccgagtgacc accaagagga agacccaccc cacggcgggg 1621 acagatgcgg ggtacgttaa agggagagcc agagaactca tggggtgagg atggagtccg 1681 aggagactgc tgggagccgc cgtgtgggtc agagatggag aaggctgagt gcagcaaggt 1741 ggggggtgac tgggacccag cctttgggcc tccccagcca gagcagccca gcaacagtgt 1801 gtcctgtggt cataaaactc cagggacctc tatcctccag gagtctcagc ctttccctgg 1861 gcgcaggccc accttggcat ggccgcctca ggcctccatg gagggagctg ctatgtcccc 1921 accagattgg ccccgtgcgg ctgctggctt ctgtagaggc tgcccagagg ggccaggtgg 1981 cacaaataag agaggggaga tggggggcag ccaggagagg aggtgtccct tcctcgccca 2041 gacacagcgc gcttctctct ggcctttccc gaggcctgtg agtgcctcag gaagcagctg 2101 ggccctctgg gaaggctgtg ttcagcttag gaacataccg cctgtatctg ctgtccctcc 2161 cctgcccccc tgcccccccc accgccttcc ctttttccct gtcttcctta aagtttcact 2221 cctgaataaa acttcacttt gccttaaa // LOCUS HSU31202 1557 bp DNA PRI 14-DEC-1995 DEFINITION Human noggin (NOGGIN) gene, complete cds. ACCESSION U31202 NID g1117816 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1557) AUTHORS Valenzuela,D.M., Economides,A.N., Rojas,E., Lamb,T.M., Nunez,L., Jones,P., Ip,N.Y., Espinosa,R., Brannan,C.I., Gilbert,D.J., Copeland,N.G., Jenkins,N.A., LeBeau,M.M., Harland,R.M. and Yancopoulos,G.D. TITLE Identification of mammalian noggin and its expression in the adult nervous system JOURNAL J. Neurosci. 15 (9), 6077-6084 (1995) MEDLINE 95395592 REFERENCE 2 (bases 1 to 1557) AUTHORS Valenzuela,D.M. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) David M. Valenzuela, Discovery Group, Regeneron Pharmaceuticals, Inc., 777 Old Saw Mill River Rd., Tarrytown, NY 10591, USA FEATURES Location/Qualifiers source 1..1557 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q22" /chromosome="17" gene 812..1510 /gene="NOGGIN" CDS 812..1510 /gene="NOGGIN" /codon_start=1 /product="noggin" /db_xref="PID:g1117817" /translation="MERCPSLGVTLYALVVVLGLRATPAGGQHYLHIRPAPSDNLPLV DLIEHPDPIFDPKEKDLNETLLRSLLGGHYDPGFMATSPPEDRPGGGGGAAGGAEDLA ELDQLLRQRPSGAMPSEIKGLEFSEGLAQGKKQRLSKKLRRKLQMWLWSQTFCPVLYA WNDLGSRFWPRYVKVGSCFSKRSCSVPEGMVCKPSKSVHLTVLRWRCQRRGGQRCGWI PIQYPIISECKCSC" BASE COUNT 242 a 521 c 580 g 213 t 1 others ORIGIN 1 gagctccggc gggtcagccg gactgtcggc ttcccggggc atctgggtcc ggcggggcac 61 agccctgggc gctgccgaag ccgccgccgc cgcctccgcg gcgagtacag gcggcttccc 121 ccggagcctg tgcagctcca gctcctcggg ggtggagaag tggggggtgg gggtgatgta 181 tggggggaag aagggggagg ggccaacccc gagagagtca gtggtttcca tggtgatgga 241 gctgaaagtg caggaaattt aaaggcttgg accctgcgag acagacaaac cggtgccaac 301 gtgcgcggac gccgccgccg ccgccgccgc tggagtccgc cgggcagagc cggccgcgga 361 gcccggagca ggcggaggga agtgccccta gaaccagctc agccagcggc gcttgcacag 421 agcggccggn cgaagagcag cgagaggagg aggggagagc ggctcgtcca cgcgccctgc 481 gccgccgccg gcccgggaag gcagcgagga gccggcgcct cccgcgcccc gcggtcgccc 541 tggagtaatt tcggatgccc agccgcggcc gccttcccca gtagacccgg gagaggagtt 601 gcggccaact tgtgtgcctt tcttccgccc cggtgggagc cggcgctgcg cgaagggctc 661 tcccggcggc tcatgctgcc ggccctgcgc ctgcccagcc tcgggtgagc cgcctccgga 721 gagacggggg agcgcggcgg cgccgcgggc tcggcgtgct ctcctccggg gacgcgggac 781 gaagcagcag ccccgggcgc gcgccagagg catggagcgc tgccccagcc taggggtcac 841 cctctacgcc ctggtggtgg tcctggggct gcgggcgaca ccggccggcg gccagcacta 901 tctccacatc cgcccggcac ccagcgacaa cctgcccctg gtggacctca tcgaacaccc 961 agaccctatc tttgacccca aggaaaagga tctgaacgag acgctgctgc gctcgctgct 1021 cgggggccac tacgacccag gcttcatggc cacctcgccc cccgaggacc ggcccggcgg 1081 gggcgggggt gcagctgggg gcgcggagga cctggcggag ctggaccagc tgctgcggca 1141 gcggccgtcg ggggccatgc cgagcgagat caaagggcta gagttctccg agggcttggc 1201 ccagggcaag aagcagcgcc taagcaagaa gctgcggagg aagttacaga tgtggctgtg 1261 gtcgcagaca ttctgccccg tgctgtacgc gtggaacgac ctgggcagcc gcttttggcc 1321 gcgctacgtg aaggtgggca gctgcttcag taagcgctcg tgctccgtgc ccgagggcat 1381 ggtgtgcaag ccgtccaagt ccgtgcacct cacggtgctg cggtggcgct gtcagcggcg 1441 cgggggccag cgctgcggct ggattcccat ccagtacccc atcatttccg agtgcaagtg 1501 ctcgtgctag aactcggggg ccccctgccc gcacccggac acttgatcct cgagctc // LOCUS HSU31214 459 bp mRNA PRI 28-FEB-1996 DEFINITION Human (nmc) mRNA, partial cds. ACCESSION U31214 NID g1205981 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 459) AUTHORS van Groningen,J.J., Bloemers,H.P. and Swart,G.W. TITLE Identification of melanoma inhibitory activity and other differentially expressed messenger RNAs in human melanoma cell lines with different metastatic capacity by messenger RNA differential display JOURNAL Cancer Res. 55 (24), 6237-6243 (1995) MEDLINE 96105048 REFERENCE 2 (bases 1 to 459) AUTHORS von Groningen,J.J.M. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) Jan J.M. von Groningen, Biochemistry, University of Nijmegen, P.O. Box 9101, 6500 HB Nijmegen, The Netherlands FEATURES Location/Qualifiers source 1..459 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="melanoma 530" gene 100..219 /gene="nmc" CDS 100..219 /gene="nmc" /codon_start=1 /db_xref="PID:g1205982" /translation="MMEVFRQFIIVSPCDPLTYPETSLYYKIFPNTQMSLQAR" BASE COUNT 165 a 131 c 68 g 95 t ORIGIN 1 ccgaagaatg aaaagagagc tctaaccaga tggaacactg gaacattcca gtggaccctg 61 gaccattcca ggaaaactgg gacataggat cgtcccgcta tgatggaagt gttcagacag 121 tttataatag taagcccctg tgaccctctc acttaccccg agacctcact ttattacaag 181 atctttccaa atacccaaat gtccctgcaa gcccgttaaa taattcccta tgctaccctt 241 aataacatac aatgaccaca tagtgtgaga acttccaaca agcctcaaag tcccttgaga 301 ctccccaata cctaataagg catgcgaaat gttctcatga actaccccac aacacgccta 361 aaactcaaaa cacccaaaaa tacctcctcc aatgtcctga aacatgaacc caaaaagaga 421 cccacaataa actcgtgact tgtccccaaa aaaaaaaaa // LOCUS HSU31215 4074 bp mRNA PRI 16-FEB-1996 DEFINITION Human metabotropic glutamate receptor 1 alpha (mGluR1alpha) mRNA, complete cds. ACCESSION U31215 NID g945096 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4074) AUTHORS Desai,M.A., Burnett,J.P., Mayne,N.G. and Schoepp,D.D. TITLE Cloning and expression of a human metabotropic glutamate receptor 1 alpha: enhanced coupling on co-transfection with a glutamate transporter JOURNAL Mol. Pharmacol. 48 (4), 648-657 (1995) MEDLINE 96029774 REFERENCE 2 (bases 1 to 4074) AUTHORS Burnett,J.P. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) J. Paul Burnett, Technology Core, Eli Lilly and Company, Lilly Research Laboratories, Lilly Corporate Center, Indianapolis, IN 46285, USA FEATURES Location/Qualifiers source 1..4074 /organism="Homo sapiens" /db_xref="taxon:9606" gene 236..3820 /gene="mGluR1alpha" CDS 236..3820 /gene="mGluR1alpha" /codon_start=1 /product="metabotropic glutamate receptor 1 alpha" /db_xref="PID:g945097" /translation="MVGLLLFFFPAIFLEVSLLPRSPGRKVLLAGASSQRSVARMDGD VIIGALFSVHHQPPAEKVPERKCGEIREQYGIQRVEAMFHTLDKINADPVLLPNITLG SEIRDSCWHSSVALEQSIEFIRDSLISIRDEKDGINRCLPDGQSLPPGRTKKPIAGVI GPGSSSVAIQVQNLLQLFDIPQIAYSATSIDLSDKTLYKYFLRVVPSDTLQARAMLDI VKRYNWTYVSAVHTEGNYGESGMDAFKELAAQEGLCIAHSDKIYSNAGEKSFDRLLRK LRERLPKARVVVCFCEGMTVRGLLSAMRRLGVVGEFSLIGSDGWADRDEVIEGYEVEA NGGITIKLQSPEVRSFDDYFLKLRLDTNTRNPWFPEFWQHRFQCRLPGHLLENPNFKR ICTGNESLEENYVQDSKMGFVINAIYAMAHGLQNMHHALCPGHVGLCDAMKPIDGSKL LDFLIKSSFIGVSGEEVWFDEKGDAPGRYDIMNLQYTEANRYDYVHVGTWHEGVLNID DYKIQMNKSGVVRSVCSEPCLKGQIKVIRKGEVSCCWICTACKENEYVQDEFTCKACD LGWWPNADLTGCEPIPVRYLEWSNIEPIIAIAFSCLGILVTLFVTLIFVLYRDTPVVK SSSRELCYIILAGIFLGYVCPFTLIAKPTTTSCYLQRLLVGLSSAMCYSALVTKTNRI ARILAGSKKKICTRKPRFMSAWAQVIIASILISVQLTLVVTLIIMEPPMPILSYPSIK EVYLICNTSNLGVVAPLGYNGLLIMSCTYYAFKTRNVPANFNEAKYIAFTMYTTCIIW LAFVPIYFGSNYKIITTCFAVSLSVTVALGCMFTPKMYIIIAKPERNVRSAFTTSDVV RMHVGDGKLPCRSNTFLNIFRRKKAGAGNANSNGKSVSWSEPGGGQVPKGQHMWHRLS VHVKTNETACNQTAVIKPLTKSYQGSGKSLTFSDTSTKTLYNVEEEEDAQPIRFSPPG SPSMVVHRRVPSAATTPPLPPHLTAEETPLFLAEPALPKGLPPPLQQQQQPPPQQKSL MDQLQGVVSNFSTAIPDFHAVLAGPGGPGNGLRSLYPPPPPPQHLQMLPLQLSTFGEE LVSPPADDDDDSERFKLLQEYVYEHEREGNTEEDELEEEEEDLQAASKLTPDDSPALT PPSPFRDSVASGSSVPSSPVSESVLCTPPNVSYASVILRDYKQSSSTL" BASE COUNT 938 a 1137 c 1131 g 868 t ORIGIN 1 gaattccctt acaaacgcct ccagcttgta gaggcggtcg tggaggaccc agaggaggag 61 acgaagggga aggaggcggt ggtggaggag gcaaaggcct tggacgacca ttgttggcga 121 ggggcaccac tccgggagag gcggcgctgg gcgtcttggg ggtgcgcgcc gggagcctgc 181 agcgggacca gcgtgggaac gcggctggca ggctgtggac ctcgtcctca ccaccatggt 241 cgggctcctt ttgttttttt tcccagcgat ctttttggag gtgtcccttc tccccagaag 301 ccccggcagg aaagtgttgc tggcaggagc gtcgtctcag cgctcggtgg ccagaatgga 361 cggagatgtc atcattggag ccctcttctc agtccatcac cagcctccgg ccgagaaagt 421 gcccgagagg aagtgtgggg agatcaggga gcagtatggc atccagaggg tggaggccat 481 gttccacacg ttggataaga tcaacgcgga cccggtcctc ctgcccaaca tcaccctggg 541 cagtgagatc cgggactcct gctggcactc ttccgtggct ctggaacaga gcattgagtt 601 cattagggac tctctgattt ccattcgaga tgagaaggat gggatcaacc ggtgtctgcc 661 tgacggccag tccctccccc caggcaggac taagaagccc attgcgggag tgatcggtcc 721 cggctccagc tctgtagcca ttcaagtgca gaacctgctc cagctcttcg acatccccca 781 gatcgcttat tcagccacaa gcatcgacct gagtgacaaa actttgtaca aatacttcct 841 gagggttgtc ccttctgaca ctttgcaggc aagggccatg cttgacatag tcaaacgtta 901 caattggacc tatgtctctg cagtccacac ggaagggaat tatggggaga gcggaatgga 961 cgctttcaaa gagctggctg cccaggaagg cctctgtatc gcccattctg acaaaatcta 1021 cagcaacgct ggggagaaga gctttgaccg actcttgcgc aaactccgag agaggcttcc 1081 caaggctaga gtggtggtct gcttctgtga aggcatgaca gtgcgaggac tcctgagcgc 1141 catgcggcgc cttggcgtcg tgggcgagtt ctcactcatt ggaagtgatg gatgggcaga 1201 cagagatgaa gtcattgaag gttatgaggt ggaagccaac gggggaatca cgataaagct 1261 gcagtctcca gaggtcaggt catttgatga ttatttcctg aaactgaggc tggacactaa 1321 cacgaggaat ccctggttcc ctgagttctg gcaacatcgg ttccagtgcc gccttccagg 1381 acaccttctg gaaaatccca actttaaacg aatctgcaca ggcaatgaaa gcttagaaga 1441 aaactatgtc caggacagta agatggggtt tgtcatcaat gccatctatg ccatggcaca 1501 tgggctgcag aacatgcacc atgccctctg ccctggccac gtgggcctct gcgatgccat 1561 gaagcccatc gacggcagca agctgctgga cttcctcatc aagtcctcat tcattggagt 1621 atctggagag gaggtgtggt ttgatgagaa aggagacgct cctggaaggt atgatatcat 1681 gaatctgcag tacactgaag ctaatcgcta tgactatgtg cacgttggaa cctggcatga 1741 aggagtgctg aacattgatg attacaaaat ccagatgaac aagagtggag tggtgcggtc 1801 tgtgtgcagt gagccttgct taaagggcca gattaaggtt atacggaaag gagaagtgag 1861 ctgctgctgg atttgcacgg cctgcaaaga gaatgaatat gtgcaagatg agttcacctg 1921 caaagcttgt gacttgggat ggtggcccaa tgcagatcta acaggctgtg agcccattcc 1981 tgtgcgctat cttgagtgga gcaacatcga acccattata gccatcgcct tttcatgcct 2041 gggaatcctt gttaccttgt ttgtcaccct aatctttgta ctgtaccggg acacaccagt 2101 ggtcaaatcc tccagtcggg agctctgcta catcatccta gctggcatct tccttggtta 2161 tgtgtgccca ttcactctca ttgccaaacc tactaccacc tcctgctacc tccagcgcct 2221 cttggttggc ctctcctctg cgatgtgcta ctctgcttta gtgactaaaa ccaatcgtat 2281 tgcacgcatc ctggctggca gcaagaagaa gatctgcacc cggaagccca ggttcatgag 2341 tgcctgggct caggtgatca ttgcctcaat tctgattagt gtgcaactaa ccctggtggt 2401 aaccctgatc atcatggaac cccctatgcc cattctgtcc tacccaagta tcaaggaagt 2461 ctaccttatc tgcaatacca gcaacctggg tgtggtggcc cctttgggct acaatggact 2521 cctcatcatg agctgtacct actatgcctt caagacccgc aacgtgcccg ccaacttcaa 2581 cgaggccaaa tatatcgcgt tcaccatgta caccacctgt atcatctggc tagcttttgt 2641 gcccatttac tttgggagca actacaagat catcacaact tgctttgcag tgagtctcag 2701 tgtaacagtg gctctggggt gcatgttcac tcccaagatg tacatcatta ttgccaagcc 2761 tgagaggaat gtccgcagtg ccttcaccac ctctgatgtt gtccgcatgc atgttggcga 2821 tggcaagctg ccctgccgct ccaacacttt cctcaacatc ttccgaagaa agaaggcagg 2881 ggcagggaat gccaattcta atggcaagtc tgtgtcatgg tctgaaccag gtggaggaca 2941 ggtgcccaag ggacagcata tgtggcaccg cctctctgtg cacgtgaaga ccaatgagac 3001 ggcctgcaac caaacagccg tcatcaaacc cctcactaaa agttaccaag gctctggcaa 3061 gagcctgacc ttttcagata ccagcaccaa gaccctttac aacgtagagg aggaggagga 3121 tgcccagccg attcgcttta gcccgcctgg tagcccttcc atggtggtgc acaggcgcgt 3181 gccaagcgcg gcgaccactc cgcctctgcc gccccacctg accgcagagg agacccccct 3241 cttcctggcc gaaccagccc tccccaaggg cttgccccct cctctccagc agcagcagca 3301 accccctcca cagcagaaat cgctgatgga ccagctccag ggagtggtca gcaacttcag 3361 taccgcgatc ccggattttc acgcggtgct ggcaggcccc gggggtcccg ggaacgggct 3421 gcggtccctg tacccgcccc cgccacctcc gcagcacctg cagatgctgc cgctgcagct 3481 gagcaccttt ggggaggagc tggtctcccc gcccgcggac gacgacgacg acagcgagag 3541 gtttaagctc ctccaggagt acgtgtatga gcacgagcgg gaagggaaca ccgaagaaga 3601 cgaactggaa gaggaggagg aggacctgca ggcggccagc aaactgaccc cggatgattc 3661 gcctgcgctg acgcctccgt cgcctttccg cgactcggtg gcctcgggca gctcggtgcc 3721 cagctcccca gtgtccgagt cggtgctctg cacccctccc aacgtatcct acgcctctgt 3781 cattctgcgg gactacaagc aaagctcttc caccctgtaa gggggaaggg tccacataga 3841 aaagcaagac aagccagaga tctcccacac ctccagagat gtgcaaacag ctgggaggaa 3901 aagcctggga gtggggggcc tcgtcgggag gacaggagac cgctgctgct gctgccgcta 3961 ctgctgctgc tgccttaagt aggaagagag ggaaggacac caagcaaaaa atgttcaggc 4021 caggattcgg attcttgaat tactcgaagc cttctctggg aagaaaggga attc // LOCUS HSU31248 2264 bp mRNA PRI 25-JUL-1996 DEFINITION Human zinc finger protein (ZNF174) mRNA, complete cds. ACCESSION U31248 NID g1045453 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2264) AUTHORS Williams,A.J., Khachigian,L.M., Shows,T. and Collins,T. TITLE Isolation and characterization of a novel zinc-finger protein with transcription repressor activity JOURNAL J. Biol. Chem. 270 (38), 22143-22152 (1995) MEDLINE 95403401 REFERENCE 2 (bases 1 to 2264) AUTHORS Williams,A.J., Khachigian,L.M., Shows,T. and Collins,T. TITLE Direct Submission JOURNAL Submitted (07-JUL-1995) Amy J. Williams, Pathology, Brigham & Women's Hospital, 221 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2264 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="10-12 week fetus" /chromosome="16" /map="16p13.3" gene 586..1809 /gene="ZNF174" CDS 586..1809 /gene="ZNF174" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g1045454" /translation="MAAKMEITLSSNTEASSKQERHIIAKLEEKRGPPLQKNCPDPEL CRQSFRRFCYQEVSGPQEALSQLRQLCRQWLQPELHTKEQILELLVMEQFLTILPPEI QARVRHRCPMSSKEIVTLVEDFHRASKKPKQWVAVCMQGQKVLLEKTGSQLGEQELPD FQPQTPRRDLRESSPAEPSQAGAYDRLSPHHWEKSPLLQEPTPKLAGTEAPRMRSDNK ENPQQEGAKGAKPCAVSAGRSKGNGLQNPEPRGANMSEPRLSRRQVSSPNAQKPFAHY QRHCRVEYISSPLKSHPLRELKKSKGGKRSLSNRLQHLGHQPTRSAKKPYKCDDCGKS FTWNSELKRHKRVHTGERPYTCGECGNCFGRQSTLKLHQRIHTGEKPYQCGQCGKSFR QSSNLHQHHRLHHGD" misc_feature 1549..1795 /gene="ZNF174" /note="encodes zinc finger domain" polyA_signal 2230..2235 BASE COUNT 632 a 590 c 574 g 468 t ORIGIN 1 ccgccctgga ggccggagcc actgggcctg cgcgcctcgg cagcgagcag ccgctttgct 61 cgcgtgcagg aggctgttcg ctacctcaca cccccggctg gcgctgtggc ctcgcttagc 121 tctaccgttt agcacccggc gacatgcacc cggtcggttt ccgccaggat gcgggagagt 181 tggggcaagc tacctgcgac agcttgaact tttcctaggg attccgctcc acccgccggt 241 tagagcgtat tgctcattaa atccgagacc tgtgtgcttg ctactgaaat aaaagaagta 301 ttttttgccc cagagtcctt ttaggacgtt atgacttttc tcctttgcaa gactgcaaaa 361 aactgacaag aagacaaaac tcttcccact cccagggacc gcgttgtctt gagtttggct 421 agtaaaggct agcaagtgag gctttgtctc tgcatcccgt tccccctaac atcctcagag 481 aaccttcgtt tctagaatct ttctagtatt cagagacttc tccagggtca tgatcccaaa 541 ggcttaaccc gtttacaagg agagagttgt ctcctgacgc ccaaaatggc agctaaaatg 601 gagataactt taagctccaa cactgaagct tcctccaagc aagagagaca cataatagcc 661 aaactagaag agaaacgggg ccctcctctg caaaaaaact gcccagatcc tgagctctgc 721 cgccagagct tcagacgctt ttgttatcaa gaggtgtctg gaccccaaga ggcgctctcc 781 cagctccgac agctctgccg tcagtggttg caacccgagc tgcacaccaa ggagcagatt 841 ttggagcttc tggtgatgga gcagttcctg accatcctgc ccccggagat ccaggctcgg 901 gtcaggcatc gatgtccaat gagcagcaag gagattgtga ccctcgtgga agattttcac 961 agagcatcca agaaaccaaa gcagtgggtg gccgtttgta tgcaggggca aaaggtgctc 1021 ttggagaaaa ctggatctca gcttggagaa caggaactgc cagactttca accgcagact 1081 cctaggagag atctcaggga gagctctcca gcagagcctt cccaggcagg agcttatgac 1141 cggctgagcc cccatcattg ggagaaatcc ccactcctcc aagaaccaac ccccaaattg 1201 gctgggacag aggcccccag aatgagaagt gacaacaagg aaaatccaca acaggaaggg 1261 gctaaaggag caaagccatg tgcagtgtca gctggcagat ccaaagggaa tggtctgcag 1321 aatcctgaac caagaggggc aaatatgagt gaacctcggt tgtcacggag gcaggtcagc 1381 tccccaaatg ctcaaaagcc atttgctcac taccagagac attgcagggt ggaatacatc 1441 agcagccccc taaaaagcca cccactgaga gagctaaaga aaagcaaagg aggtaaacgg 1501 agtctgagca accgtttgca acatcttggt caccagccca cccgctcagc aaagaaaccc 1561 tacaaatgtg atgactgtgg gaaaagcttc acgtggaatt cagagctgaa gagacacaag 1621 agagtccaca caggagagag accctacacg tgcggagagt gtggaaactg ctttgggcgg 1681 cagtcaaccc tgaagctgca ccagaggatc cacactggag agaagccata ccagtgtggc 1741 cagtgtggga aaagctttcg ccagagctca aaccttcacc agcatcaccg acttcaccat 1801 ggggactaaa aggagcactc catgctttag attcacacgg aaggtgtttg tgtttctcct 1861 cccccttgct tgcatgtaaa tcacaaaaac tgtgtgactt acaaggaaag cacgaggccc 1921 ttgaggaatg atgatgcaca ttctgctgtg aggaggccca gaaaaggcca accagggccc 1981 aaccgtgcat gatgacaggg tgcagagaag gcaggtctgg gcactggggc aaagagaact 2041 taagtctctg cagagagcaa ggagtaacta ctgagagaga atcaggacaa tcctgcaggt 2101 ggcccgctta ctgttaaatc gtccctctgt tgctttatcc tctaaaatat gttaagggat 2161 aaattctata tatatagtta tgcatttgct gtcataccag acaatttttt atcatgagca 2221 cacttcttta ataaagatga gtgatctacc agaaaaaaaa aaaa // LOCUS HSU31278 1390 bp mRNA PRI 22-AUG-1995 DEFINITION Human mitotic feedback control protein Madp2 homolog mRNA, complete cds. ACCESSION U31278 NID g950198 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1390) AUTHORS Jin,D.-Y. and Jeang,K.-T. TITLE Direct Submission JOURNAL Submitted (09-JUL-1995) Dong-Yan Jin, Molecular Virology Section, Laboratory of Molecular Microbiology, National Institute of Allergy and Infectious Diseases, 9000 Rockville Pike, Bethesda, MD 20892-0460, USA FEATURES Location/Qualifiers source 1..1390 /organism="Homo sapiens" /note="ATCC #CCL 2.2'" /db_xref="taxon:9606" /cell_line="HeLa S3" /clone_lib="Clontech catalog #HL4000A1; lot #39042a'" CDS 75..692 /note="CP10625; Method: conceptual translation supplied by author" /codon_start=1 /product="mitotic feedback control protein Madp2 homolog" /db_xref="PID:g950199" /translation="MALQLSREQGITLRGSAEIVAEFFSFGINSILYQRGIYPSETFT RVQKYGLTLLVTTDLELIKYLNNVVEQLKDWLYKCSVQKLVVVISNIESGEVLERWQF DIECDKTAKDDSAPREKSQKAIQDEIRSVIRQITATVTFLPLLEVSCSFDLLIYTDKD LVVPEKWEESGPQFITNSEEVRLRSFTTTIHKVNSMVAYKIPVND" BASE COUNT 427 a 221 c 290 g 452 t ORIGIN 1 gggaagtgct gttggagccg ctgtggttgc tgtccgcgga gtggaagcgc gtgcttttgt 61 ttgtgtccct ggccatggcg ctgcagctct cccgggagca gggaatcacc ctgcgcggga 121 gcgccgaaat cgtggccgag ttcttctcat tcggcatcaa cagcatttta tatcagcgtg 181 gcatatatcc atctgaaacc tttactcgag tgcagaaata cggactcacc ttgcttgtaa 241 ctactgatct tgagctcata aaatacctaa ataatgtggt ggaacaactg aaagattggt 301 tatacaagtg ttcagttcag aaactggttg tagttatctc aaatattgaa agtggtgagg 361 tcctggaaag atggcagttt gatattgagt gtgacaagac tgcaaaagat gacagtgcac 421 ccagagaaaa gtctcagaaa gctatccagg atgaaatccg ttcagtgatc agacagatca 481 cagctacggt gacatttctg ccactgttgg aagtttcttg ttcatttgat ctgctgattt 541 atacagacaa agatttggtt gtacctgaaa aatgggaaga gtcgggacca cagtttatta 601 ccaattctga ggaagtccgc cttcgttcat ttactactac aatccacaaa gtaaatagca 661 tggtggccta caaaattcct gtcaatgact gaggatgaca tgaggaaaat aatgtaattg 721 taattttgaa atgtggtttt cctgaaatca ggtcatctat agttgatatg ttttatttca 781 ttggttaatt tttacatgga gaaaaccaaa atgatactta ctgaactgtg tgtaattgtt 841 cctttatttt tttggtacct atttgactta ccatggagtt aacatcatga atttattgca 901 cattgttcaa aaggaaccag gaggtttttt tgtcaacatt gtgatgtata ttcctttgaa 961 gatagtaact gtagatggaa aaacttgtgc tataaagcta gatgctttcc taaatcagat 1021 gttttggtca agtagtttga ctcagtatag gtagggagat atttaagtat aaaatacaac 1081 aaaggaagtc taaatattca gaatctttgt taaggtcctg aaagtaactc ataatctata 1141 aacaatgaaa tattgctgta tagctccttt tgaccttcat ttcatgtata gttttcccta 1201 ttgaatcagt ttccaattat ttgactttaa tttatgtaac ttgaacctat gaagcaatgg 1261 atatttgtac tgtttaatgt tctgtgatac agaactctta aaaatgtttt ttcatgtgtt 1321 ttataaaatc aagttttaag tgaaagtgag gaaataaagt taagtttgtt ttaaaaaaaa 1381 aaaaaaaaaa // LOCUS HSU31382 670 bp mRNA PRI 26-SEP-1995 DEFINITION Human G protein gamma-4 subunit mRNA, complete cds. ACCESSION U31382 NID g995916 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 670) AUTHORS Ray,K., Kunsch,C., Bonner,L.M. and Robishaw,J.D. TITLE Isolation of cDNA clones encoding eight different human G protein gamma subunits, including three novel forms designated the gamma 4, gamma 10, and gamma 11 subunits JOURNAL J. Biol. Chem. 270 (37), 21765-21771 (1995) MEDLINE 95394940 REFERENCE 2 (bases 1 to 670) AUTHORS Kunsch,C. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) Charles Kunsch, Molecular Biology, Human Genome Sciences, Inc., 9410 Key West Ave., Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..670 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 99..326 /codon_start=1 /product="G protein gamma-4 subunit" /db_xref="PID:g995917" /translation="MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAY CEAHVREDPLIIPVPASENPFREKKFFCTIL" BASE COUNT 177 a 165 c 140 g 188 t ORIGIN 1 ggcacgagct catctgacga ctgacagctg atggcaccgc cagcctctgt cccttggcca 61 ggactgtcac acggctgact ctcagcaggg gcagtagaat gaaagagggc atgtctaata 121 acagcaccac tagcatctcc caagccagga aagctgtgga gcagctaaag atggaagcct 181 gtatggacag ggtcaaggtc tcccaggcag ccgcggacct cctggcctac tgtgaagctc 241 acgtgcggga agatcctctc atcattccag tgcctgcatc agaaaacccc tttcgcgaga 301 agaagttctt ttgtaccatt ctctaactcc gtgtgtgatg aaaacgcctc cttttctgac 361 cttcaaagtc ccctgtagag accatgcatg ctctaagcct tagggagtga gaccaacacc 421 catccctgcc cagccaacag tggccggggc ttgtcttatg tttccatctg ttttcttcgt 481 ggcattcaat ttcatttttt tccttttcat tttcatgtta ttttcattat tggcaaagaa 541 aatcaaaatg tttatagcca aataacaaat gtgccatgta aaagtaagtc tggacttaag 601 agtttaaaat ttttaaacat cagtttccaa gtttatatca tattaataca tttcagtgga 661 taatttattt // LOCUS HSU31383 1194 bp mRNA PRI 26-SEP-1995 DEFINITION Human G protein gamma-10 subunit mRNA, complete cds. ACCESSION U31383 NID g995918 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Ray,K., Kunsch,C., Bonner,L.M. and Robishaw,J.D. TITLE Isolation of cDNA clones encoding eight different human G protein gamma subunits, including three novel forms designated the gamma 4, gamma 10, and gamma 11 subunits JOURNAL J. Biol. Chem. 270 (37), 21765-21771 (1995) MEDLINE 95394940 REFERENCE 2 (bases 1 to 1194) AUTHORS Kunsch,C. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) Charles Kunsch, Molecular Biology, Human Genome Sciences, Inc., 9410 Key West Ave., Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 24..230 /codon_start=1 /product="G protein gamma-10 subunit" /db_xref="PID:g995919" /translation="MSSGASASALQRLVEQLKLEAGVERIKVSQAAAELQQYCMQNAC KDALLVGVPAGSNPFREPRSCALL" BASE COUNT 360 a 210 c 237 g 387 t ORIGIN 1 ggcacgagcc cagcgccgcc gccatgtcct ccggggctag cgcgagcgcc ctgcagcgct 61 tggtagagca gctcaagttg gaggctggcg tggagaggat caaggtctct caggcagctg 121 cagagcttca acagtactgt atgcagaatg cctgcaagga tgccctgctg gtgggtgttc 181 cagctggaag taaccccttc cgggagccta gatcctgtgc tttactctga agactctagg 241 agagaagttt gctgaggaat gccttcaagc acaaagtgat gaatgactgc cttcaagtct 301 caagaaaaca cttttcccta acttttagag atatttcagc cctttcctgt ggcctggtcc 361 tatagccaaa atcacagata ttcatgagtt tctacttgag tgagaaaact gggtgaagga 421 atagaatttt aaatagtaat aactgcttgt tttttgtgtg caagtacttt tatacataag 481 ataaacaaaa accttaccac caaacatacc aaaatgcacc tctttcataa gtgagttact 541 aagatttcta tacctggaat atcatgtatg tttcatttac tggatgttta cattttagga 601 aggaaaatag ttttgtttat ttaaacaact gaatacttat aaactgttgt tcctggaagt 661 tatttattcc ataaaaaatt tgttcttttc tcatgaattt ataattccta aatgaagacc 721 agaaagtaca aattgctggg aggaagaata ggctttatta atcaactgat gtcttgattt 781 ttctaaatgg gaagattgct ttatttttaa cactaattat gggagcagat tcttaccaaa 841 cttctttgga aaagttaatg ttatgatgtg cattaggctg ccccatcgtg tatataaatg 901 aaggcagatt tgatttttgt attcttacgt ttactctgct ttgtagttgt ggctgtactt 961 aaagcaatac agaatttcat atatttaaaa atgtttaaaa tgtgacccac agaacattgt 1021 aaatgattaa aaactaacat gaaaatatta caacctaaaa gaattcttaa cttcacaagt 1081 gttttacttc gacgatgtgc ctttgattta atttgggaca cttttttaga aggatacatt 1141 attcgtgttt gcaacggtct ttgaagagct tggaaataaa atttctgctt aatt // LOCUS HSU31384 622 bp mRNA PRI 26-SEP-1995 DEFINITION Human G protein gamma-11 subunit mRNA, complete cds. ACCESSION U31384 NID g995920 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 622) AUTHORS Ray,K., Kunsch,C., Bonner,L.M. and Robishaw,J.D. TITLE Isolation of cDNA clones encoding eight different human G protein gamma subunits, including three novel forms designated the gamma 4, gamma 10, and gamma 11 subunits JOURNAL J. Biol. Chem. 270 (37), 21765-21771 (1995) MEDLINE 95394940 REFERENCE 2 (bases 1 to 622) AUTHORS Kunsch,C. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) Charles Kunsch, Molecular Biology, Human Genome Sciences, Inc., 9410 Key West Ave., Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..622 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 108..329 /codon_start=1 /product="G protein gamma-11 subunit" /db_xref="PID:g995921" /translation="MPALHIEDLPEKEKLKMEVEQLRKEVKLQRQQVSKCSEEIKNYI EERSGEDPLVKGIPEDKNPFKEKGSCVIS" BASE COUNT 199 a 116 c 142 g 165 t ORIGIN 1 ggcacgagct cgtgccggcc ttcagttgtt tcgggacgcg ccgagcttcg ccgctcttcc 61 agcggctccg ctgccagagc tagcccgagc ccggttctgg ggcgaaaatg cctgcccttc 121 acatcgaaga tttgccagag aaggaaaaac tgaaaatgga agttgagcag cttcgcaaag 181 aagtgaagtt gcagagacaa caagtgtcta aatgttctga agaaataaag aactatattg 241 aagaacgttc tggagaggat cctctagtaa agggaattcc agaagacaag aaccccttta 301 aagaaaaagg cagctgtgtt atttcataaa taacttggga gaaactgcat cctaagtgga 361 agaactagtt tgttttagtt ttcccagata aaaccaacat gctttttaag gaaggaagaa 421 tgaaattaaa aggagacttt cttaagcacc atatagatag ggttatgtat aaaagcatat 481 gtgctactca tctttgctca ctatgcagtc ttttttaaga gagcagagag tatcagatgt 541 acaattatgg aaataagaac attacttgag catgacactt ctttcagtat attgcttgat 601 gcttcaaata aagttttgtc tt // LOCUS HSU31449 1362 bp mRNA PRI 25-AUG-1995 DEFINITION Human intestinal and liver tetraspan membrane protein (il-TMP) mRNA, complete cds. ACCESSION U31449 NID g953238 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1362) AUTHORS Wice,B.M. and Gordon,J.I. TITLE A Tetraspan Membrane Glycoprotein Produced in the Human Intestinal Epithelium and Liver That Can Regulate Cell Density-dependent Proliferation JOURNAL J. of Biol. Chem. 270, 21907-21918 (1995) REFERENCE 2 (bases 1 to 1362) AUTHORS Wice,B.M. and Gordon,J.I. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) Burton M. Wice, Molecular Biology and Pharmacology, Washington University School of Medicine, 4566 Scott Ave., Saint Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1362 /organism="Homo sapiens" /note="Cloned from a cDNA library prepared from proliferating HT-29 human colon adenocarcinoma cells cultured in the absence of glucose, using inosine as the carbon source. These cells are committed to differentiate into intestinal enterocytes and goblet cells, but have not yet expressed the differentiated phenotype." /db_xref="taxon:9606" /tissue_type="intestine and liver" gene 166..774 /gene="il-TMP" CDS 166..774 /gene="il-TMP" /note="glycoprotein containing 4 transmembrane domains; novel member of tetraspan membrane superfamily" /codon_start=1 /product="tetraspan membrane protein" /db_xref="PID:g953239" /translation="MCTGGCARCLGGTLIPLAFFGFLANILLFFPGGKVIDDNDHLSQ EIWFFGGILGSGVLMIFPALVFLGLKNNDCCGCCGNEGCGKRFAMFTSTIFAVVGFLG AGYSFIISAISINKGPKCLMANSTWGYPFHDGDYLNDEALWNKCREPLNVVPWNLTLF SILLVVGGIQMVLCAIQVVNGLLGTLCGDCQCCGCCGGDGPV" misc_feature 385..402 /gene="il-TMP" /note="encodes CysCysGlyCysCysGly motif" misc_feature 535..537 /gene="il-TMP" /note="encodes potential N-glycosylation site, Asn" misc_feature 631..633 /gene="il-TMP" /note="encodes potential N-glycosylation site, Asn" misc_feature 739..756 /gene="il-TMP" /note="encodes CysCysGlyCysCysGly motif" BASE COUNT 347 a 279 c 304 g 432 t ORIGIN 1 agcaactcca aggacacagt tcacagaaat ttggttctca gccccaaaat actgattgaa 61 ttggagacaa ttacaaggac tctctggcca aaaacccttg aagaggcccc gtgaaggagg 121 cagtgaggag cttttgattg ctgacctgtg tcgtaccacc ccagaatgtg cactgggggc 181 tgtgccagat gcctgggggg gaccctcatt ccccttgctt tttttggctt cctggctaac 241 atcctgttat tttttcctgg aggaaaagtg atagatgaca acgaccacct ttcccaagag 301 atctggtttt tcggaggaat attaggaagc ggtgtcttga tgatcttccc tgcgctggtg 361 ttcttgggcc tgaagaacaa tgactgctgt gggtgctgcg gcaacgaggg ctgtgggaag 421 cgatttgcga tgttcacctc cacgatattt gctgtggttg gattcttggg agctggatac 481 tcgtttatca tctcagccat ttcaatcaac aagggtccta aatgcctcat ggccaatagt 541 acatggggct accccttcca cgacggggat tatctcaatg atgaggcctt atggaacaag 601 tgccgagagc ctctcaatgt ggttccctgg aatctgaccc tcttctccat cctgctggtc 661 gtaggaggaa tccagatggt tctctgcgcc atccaggtgg tcaatggcct cctggggacc 721 ctctgtgggg actgccagtg ttgtggctgc tgtgggggag atggacccgt ttaaacctcc 781 gagatgagct gctcagactc tacagcatga cgactacaat ttcttttcat aaaacttctt 841 ctcttcttgg aattattaat tcctatctgc ttcctagctg ataaagctta gaaaaggcag 901 ttattccttc tttccaacca gctttgctcg agttagaatt ttgttatttt caaataaaaa 961 atagtttggc cacttaacaa atttgattta taaatctttc aaattagttc ctttttagaa 1021 tttaccaaca ggttcaaagc atacttttca tgattttttt attacaaatg taaaatgtat 1081 aaagtcacat gtactgccat actacttctt tgtatataaa gatgtttata tctttggaag 1141 ttttacataa atcaaaggaa gaaagcacat ttaaaatgag aaactaagac caatttctgt 1201 ttttaagagg aaaaagaatg attgatgtat cctaagtatt gttatttgtt gtcttttttt 1261 gctgccttgc ttgagttgct tgtgactgat cttttgaggc tgtcatcatg gctagggttc 1321 ttttatgtat gttaaattaa aacctgaatt cagaggtaac gt // LOCUS HSU31501 2902 bp mRNA PRI 02-DEC-1995 DEFINITION Human fragile X mental retardation syndrome related protein (FXR2) mRNA, complete cds. ACCESSION U31501 NID g1098636 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2902) AUTHORS Zhang,Y., O'Connor,J.P., Siomi,M.C., Srinivasan,S., Dutra,A., Nussbaum,R.L. and Dreyfuss,G. TITLE The fragile X mental retardation syndrome protein interacts with novel homologs FXR1 and FXR2 JOURNAL EMBO J. 14 (21), 5358-5366 (1995) MEDLINE 96080171 REFERENCE 2 (bases 1 to 2902) AUTHORS Zhang,Y. and Dreyfuss,G. TITLE Direct Submission JOURNAL Submitted (12-JUL-1995) Yan Zhang, Biochemistry and Biophysics, Howard Hughes Medical Institute, University of Pennsylvania, Clinical Research Building, Room 330, 422 Curie Boulevard, Philadelphia, PA 19104-6148, USA FEATURES Location/Qualifiers source 1..2902 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17p31.1" /chromosome="17" /tissue_type="brain" /dev_stage="fetus" gene 228..2249 /gene="FXR2" CDS 228..2249 /gene="FXR2" /codon_start=1 /product="fragile X mental retardation syndrome related protein" /db_xref="PID:g1098637" /translation="MGGLASGGDVEPGLPVEVRGSNGAFYKGFVKDVHEDSVTIFFEN NWQSERQIPFGDVRLPPPADYNKEITEGDEVEVYSRANEQEPCGWWLARVRMMKGDFY VIEYAACDATYNEIVTLERLRPVNPNPLATKGSFFKVTMAVPEDLREACSNENVHKEF KKALGANCIFLNITNSELFILSTTEAPVKRASLLGDMHFRSLRTKLLLMSRNEEATKH LETSKQLAAAFQEEFTVREDLMGLAIGTHGANIQQARKVPGVTAIELGEETCTFRIYG ETPEACRQARSYLEFSEDSVQVPRNLVGKVIGKNGKVIQEIVDKSGVVRVRVEGDNDK KNPREEGMVPFIFVGTRENISNAQALLEYHLSYLQEVEQLRLERLQIDEQLRQIGLGF RPPGSGRGSGGSDKAGYSTDESSSSSLHATRTYGGSYGGRGRGRRTGGPAYGPSSDVS TASETESEKREEPNRAGPGDRDPPTRGEESRRRPTGGRGRGPPPAPRPTSRYNSSSIS SVLKDPDSNPYSLLDTSEPEPPVDSEPGEPPPASARRRRSRRRRTDEDRTVMDGGLES DGPNMTENGLEDESRPQRRNRSRRRRNRGNRTDGSISGDRQPVTVADYISRAESQSRQ SAPLERTKPSEDSLSGQKGDSVSKLPKGPSENGELSAPLELGSMVNGVS" BASE COUNT 677 a 808 c 860 g 557 t ORIGIN 1 gaattccggt ggcggagacc aaggcggcgg cggcggacgg ggagcggccc ggccccggcc 61 ccctgctcgt tggctgtggc agggccgccg tggggccggc ccggctcccg ccccccgcgg 121 ctccccctcc ggctcctcct ccggggagac gccgggggcc tggcccggcc cgcactcaga 181 ctgctgctgc agccgccgcc gggggagtcg gaggcggtgg cggcgccatg ggcggcctgg 241 cctctggggg ggatgtggag ccgggactgc ccgtcgaggt gcgcggctcc aacggggcct 301 tctacaaggg ctttgtgaag gatgtccatg aagactctgt caccatcttc tttgaaaaca 361 actggcagag tgagagacaa attccttttg gggatgtccg gctaccacct ccagctgact 421 ataataagga gatcacagaa ggggatgaag tggaggttta ttctcgagcc aatgaacagg 481 aaccttgtgg ctggtggctg gcccgggtgc ggatgatgaa gggagatttc tatgtcattg 541 aatatgctgc ctgtgatgcc acctacaatg aaattgttac cctggagcga cttcggccag 601 ttaatcccaa tccccttgca accaaaggca gcttcttcaa ggttaccatg gctgtgcccg 661 aggatctgag agaagcctgc tccaatgaaa acgtccataa agagttcaag aaagccctgg 721 gagccaactg catctttctc aacatcacaa acagtgagct cttcattctg tcaaccacag 781 aagcccctgt gaagcgagca tctctgctgg gtgatatgca tttccgaagc ctgcgcacca 841 aactgctact tatgtcccgc aatgaagaag ctaccaagca cctagagaca agcaagcagt 901 tggcagcagc cttccaagag gagttcacag tgcgagagga cctgatggga ctggcaattg 961 ggactcacgg tgccaacatc cagcaggccc gaaaagtacc tggggtgacc gccattgagt 1021 tgggtgaaga gacctgcact ttccgcatct atggggagac tcccgaggct tgccgacagg 1081 cccgaagcta ccttgagttt tctgaggact cagtgcaagt gcccaggaac ctggttggca 1141 aagtgattgg aaagaacggg aaagtgatcc aggagattgt ggataaatct ggtgtggtga 1201 gggttcgagt ggaaggtgat aatgacaaga agaaccccag ggaggaggga atggttccct 1261 tcatttttgt tggcacccga gagaacatca gcaatgccca ggctttgctg gagtatcacc 1321 tctcctacct gcaggaggta gagcagcttc gcttggagag gctacaaatt gatgagcagc 1381 ttcggcagat tgggctgggc tttcgccctc ctgggagtgg gcggggcagc ggtggcagcg 1441 acaaggctgg atatagcact gatgagagct cctcctcctc cctccatgcg actcgaacct 1501 atgggggcag ctatgggggc cgtggccgtg gccggaggac aggcggtcct gcctatggcc 1561 ccagctcaga tgtgtctaca gcttcagaga ctgagtcaga gaagagagag gagcccaacc 1621 gagctgggcc tggcgacagg gatcccccaa cccgagggga agaaagccgg aggcggccga 1681 ctgggggccg gggtagggga cccccacctg ccccccggcc cacttcgaga tacaattctt 1741 catctattag ctcagtgctg aaggatccag acagtaatcc ctacagccta ttggacacgt 1801 ctgaaccaga gcccccggtt gattcagaac ctggggaacc ccccccagca agtgccaggc 1861 gccgccgctc ccgccgccgc cgcactgatg aagacaggac cgtcatggat ggaggcctgg 1921 aatcagatgg gcccaacatg acagagaatg gcctggaaga tgaatcaaga cctcaacgtc 1981 gtaatcgcag ccgccgccgc cgtaaccgtg gtaatcggac tgatggctct atcagtggag 2041 accgccagcc agtgactgtg gctgactata tctcacgagc agagtctcag agccgccaga 2101 gcgcacccct ggaacgcact aaaccctcag aagactctct ttcaggacag aagggtgact 2161 ctgtcagcaa gcttcctaag ggcccctcgg agaatgggga gctctccgcc cccttggagt 2221 tgggtagtat ggtgaatggg gtttcataaa acctccaacc tgcacccctc ccttctccat 2281 ctcgcttgct gcccaacacc atggccctca caggcccaac tgacctgcgc tggagctgct 2341 cttatctagg ggggaggggg gtggcacagc agcttgggta ccccccaacc tccaggagct 2401 agtggagggg tgtgtaacag ggtcataccc cctccctctt gtccacccta cccccagggt 2461 aaggggagcc tctctccttc cccatcagac tggatgtgcc tttatcctct aatgccccaa 2521 tctctctctg aacaccccca ttctccacct gttggtgggg ggtgctcctc gacccaccca 2581 gatttgaccg ttcagggggc ctcccctgct atccctcctc ccatcctgta ccccccattt 2641 ctggggcctc atcactgtgg aagacgggga tagtaagaga taagtgggtg ggaggcacgg 2701 ggaaggtttt ggagtagaac caggggtgtg tatgaagggg ggtgacaagg tccccctggg 2761 gaggggacca accttgtctg gtggatgaga aggcgtattt atttttcact gtacagtatt 2821 taaaaagaga ataaaaaaat ccaaatggca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2881 aaaaaaaaaa aaaaaggaat tc // LOCUS HSU31520 3599 bp mRNA PRI 14-DEC-1995 DEFINITION Human alpha mannosidase II mRNA, complete cds. ACCESSION U31520 NID g1117826 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3599) AUTHORS Misago,M., Liao,Y.F., Kudo,S., Eto,S., Mattei,M.G., Moremen,K.W. and Fukuda,M.N. TITLE Molecular cloning and expression of cDNAs encoding human alpha-mannosidase II and a previously unrecognized alpha-mannosidase IIx isozyme JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (25), 11766-11770 (1995) MEDLINE 96102195 REFERENCE 2 (bases 1 to 3599) AUTHORS Moremen,K.W. and Robbins,P.W. TITLE Isolation, characterization, and expression of cDNAs encoding murine alpha-mannosidase II, a Golgi enzyme that controls conversion of high mannose to complex N-glycans JOURNAL J. Cell Biol. 115 (6), 1521-1534 (1991) MEDLINE 92098565 REFERENCE 3 (bases 1 to 3599) AUTHORS Moremen,K. W. TITLE Direct Submission JOURNAL Submitted (12-JUL-1995) Kelley W. Moremen, Biochemistry, University of Gerogia, Life Sciences Building, Athens, GA 30602, USA FEATURES Location/Qualifiers source 1..3599 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone="HM-4 and HM-1" /cell_line="HepG2" /map="5q21-22" /chromosome="5" CDS 4..3438 /EC_number="3.2.1.114" /note="mannosyl-oligosaccharide 1,3/1,6-alpha-mannosidase; Golgi transmembrane protein, Asn-linked oligosaccharide processing hydrolase" /codon_start=1 /product="alpha mannosidase II" /db_xref="PID:g1117827" /translation="MKLSRQFTVFGSAIFCVVIFSLYLMLDRGHLDYPRNPRREGSFP QGQLSMLQEKIDHLERLLAENNEIISNIRDSVINLSESVEDGPKSSQSNFSQGAGSHL LPSQLSLSVDTADCLFASQSGSHNSDVQMLDVYSLISFDNPDGGVWKQGFDITYESNE WDTEPLQVFVVPHSHNDPGWLKTFNDYFRDKTQYIFNNMVLKLKEDSRRKFIWSEISY LSKWWDIIDIQKKDAVKSLIENGQLEIVTGGWVMPDEATPHYFALIDQLIEGHQWLEN NIGVKPRSGWAIDPFGHSPTMAYLLNRAGLSHMLIQRVHYAVKKHFALHKTLEFFWRQ NWDLGSVTDILCHMMPFYSYDIPHTCGPDPKICCQFDFKRLPGGRFGCPWGVPPETIH PGNVQSRARMLLDQYRKKSKLFRTKVLLAPLGDDFRYCEYTEWDLQFKNYQQLFDYMN SQSKFKVKIQFGTLSDFFDALDKADETQRDKGQSMFPVLSGDFFTYADRDDHYWSGYF TSRPFYKRMDRIMESHLRAAEILYYFALRQAHKYKINKFLSSSLYTALTEARRNLGLF QHHDAITGTAKDWVVVDYGTRLFHSLMVLEKIIGNSAFLLIGKDKLTYDSYSPDTFLE MDLKQKSQDSLPQKNIIRLSAEPRYLVVYNPLEQDRISLVSVYVSSPTVQVFSASGKP VEVQVSAVWDTANTISETAYEISFRAHIPPLGLKVYKILESASSNSHLADYVLYKNKV EDSGIFTIKNMINTEEGITLENSFVLLRFDQTGLMKQMMTKEDGKHHEVNVQFSWYGT TIKRDKSGAYLFLPDGNAKPYVYTTPPFVRVTHGRIYSEVTCFFDHVTHRVRLYHIQG IEGQSVEVSNIVDIRKVYNREIAMKISSDIKSQNRFYTDLNGYQIQPRMTLSKLPLQA NVYPMTTMAYIQDAKHRLTLLSAQSLGVSSLNSGQIEVIMDRRLMQDDNRGLEQGIQD NKITANLFRILLEKRSAVNTEEEKKSVSYPSLLSHITSSLMNHPVIPMANKFSSPTLE LQGEFSPLQSSLPCDIHLVNLRTIQSKVGNGHSNEAALILHRKGFDCRFSSKGTGLFC STTQGKILVQKLLNKFIVESLTPSSLSLMHSPPGTQNISEINLSPMEISTFRIQLR" polyA_site 3599 /note="22 A nucleotides" BASE COUNT 1075 a 716 c 746 g 1062 t ORIGIN 1 aaaatgaagt taagccgcca gttcaccgtg ttcggcagtg cgatcttctg tgtggtgatt 61 ttctcgctct acctgatgct ggaccggggt cacttagact accccaggaa cccgcgccgc 121 gagggctcct tccctcaggg ccagctctca atgttgcaag aaaaaataga ccatttggag 181 cgtttgctag ctgagaataa tgagatcatc tcaaatatta gagactcagt catcaatttg 241 agtgagtctg tggaggatgg tccgaaaagt tcacaaagca atttcagcca aggtgctggc 301 tcacatcttc tgccctcaca attatccctc tcagttgaca ctgcagactg tctgtttgct 361 tcacaaagtg gaagtcacaa ttcagatgtg cagatgttgg atgtttacag tctaatttct 421 tttgacaatc cagatggtgg agtttggaag caaggatttg acattactta tgaatctaat 481 gaatgggaca ctgaacccct tcaagtcttt gtggtgcctc attcccataa cgacccaggt 541 tggttgaaga ctttcaatga ctactttaga gacaagactc agtatatttt taataacatg 601 gtcctaaagc tgaaagaaga ctcacggagg aagtttattt ggtctgagat ctcttacctt 661 tcaaagtggt gggatattat agatattcag aagaaggatg ctgttaaaag tttaatagaa 721 aatggtcagc ttgaaattgt gacaggtggc tgggttatgc ctgatgaagc tactccacat 781 tattttgcct taattgatca actaattgaa ggacatcagt ggctggaaaa taatatagga 841 gtgaaacctc ggtccggctg ggctattgat ccctttggac actcaccaac aatggcttat 901 cttctaaacc gtgctggact ttctcacatg cttatccaga gagttcatta tgcagttaaa 961 aaacactttg cactgcataa aacattggag tttttttgga gacagaattg ggatctggga 1021 tctgtcacag atattttatg ccacatgatg cccttctaca gctatgacat ccctcacact 1081 tgtggacctg atcctaaaat atgctgccag tttgatttta aacgtcttcc tggaggcaga 1141 tttggttgtc cctggggagt ccccccagaa acaatacatc ctggaaatgt ccaaagcagg 1201 gctcggatgc tactagatca gtaccgaaag aagtcaaagc tttttcgaac caaagttctc 1261 ctggctccac taggagatga tttccgctac tgtgaataca cggaatggga tttacagttt 1321 aagaattatc agcagctttt tgattatatg aattctcagt ccaagtttaa agttaagata 1381 cagtttggaa ctttatcaga tttttttgat gcgctggata aagcagatga aactcagaga 1441 gacaagggcc aatcgatgtt ccctgtttta agtggagatt ttttcactta tgccgatcga 1501 gatgatcatt actggagtgg ctattttaca tccagaccct tttacaaacg aatggacaga 1561 atcatggaat ctcatttaag ggctgctgaa attctttact atttcgccct gagacaagct 1621 cacaaataca agataaataa atttctctca tcatcacttt acacggcact gacagaagcc 1681 agaaggaatt tgggactgtt tcaacatcat gatgctatca caggaactgc aaaagactgg 1741 gtggttgtgg attatggtac cagacttttt cattcgttaa tggttttgga gaagataatt 1801 ggaaattctg catttcttct tattgggaag gacaaactca catacgactc ttactctcct 1861 gataccttcc tggagatgga tttgaaacaa aaatcacaag attctctgcc acaaaaaaat 1921 ataataaggc tgagtgcgga gccaaggtac cttgtggtct ataatccttt agaacaagac 1981 cgaatctcgt tggtctcagt ctatgtgagt tccccgacag tgcaagtgtt ctctgcttca 2041 ggaaaacctg tggaagttca agtcagcgca gtttgggata cagcaaatac tatttcagaa 2101 acagcctatg agatctcttt tcgagcacat ataccgccat tgggactgaa agtgtataag 2161 attttggaat cagcaagttc aaattcacat ttagctgatt atgtcttgta taagaataaa 2221 gtagaagata gcggaatttt caccataaag aatatgataa atactgaaga aggtataaca 2281 ctagagaact cctttgtttt acttcggttt gatcaaactg gacttatgaa gcaaatgatg 2341 actaaagaag atggtaaaca ccatgaagta aatgtgcaat tttcatggta tggaaccaca 2401 attaaaagag acaaaagtgg tgcctacctc ttcttacctg atggtaatgc caagccttat 2461 gtttacacaa caccgccctt tgtcagagtg acacatggaa ggatttattc ggaagtgact 2521 tgcttttttg accatgttac tcatagagtc cgactatacc acatacaggg aatagaagga 2581 cagtctgtgg aagtttccaa tattgtggac atccgaaaag tatataaccg tgagattgca 2641 atgaaaattt cttctgatat aaaaagccaa aatagatttt atactgacct aaatgggtac 2701 cagattcaac ctagaatgac actgagcaaa ttgcctcttc aagcaaatgt ctatcccatg 2761 accacaatgg cctatatcca ggatgccaaa catcgtttga cactgctctc tgctcagtca 2821 ttaggggttt cgagtttgaa tagtggtcag attgaagtta tcatggatcg aagactcatg 2881 caagatgata atcgtggcct tgagcaaggt atccaggata acaagattac agctaatcta 2941 tttcgaatac tactagaaaa aagaagtgct gttaatacgg aagaagaaaa gaagtcggtc 3001 agttatcctt ctctccttag ccacataact tcttctctca tgaatcatcc agtcattcca 3061 atggcaaata agttctcctc acctaccctt gagctgcaag gtgaattctc tccattacag 3121 tcatctttgc cttgtgacat tcatctggtt aatttgagaa caatacagtc aaaggtgggc 3181 aatgggcact ccaatgaggc agccttgatc ctccacagaa aagggtttga ttgtcggttc 3241 tctagcaaag gcacagggct gttttgttct actactcagg gaaagatatt ggtacagaaa 3301 cttttaaaca agtttattgt cgaaagtctc acaccttcat cactatcctt gatgcattca 3361 cctcccggca ctcagaatat aagtgagatc aacttgagtc caatggaaat cagcacattc 3421 cgaatccagt tgaggtgaac ctgactttca catttggatt gagaatcatt ggcttttata 3481 cctttcttgg tttgacgtgc aataaagaag cacattattt tagcttctgg ctactgtgag 3541 aacatgaatt ctgtgattct gtgggttttt tctttttttc ttttaccagt acagtaaga // LOCUS HSU31628 1610 bp mRNA PRI 19-DEC-1995 DEFINITION Human interleukin-15 receptor alpha chain precursor (IL15RA) mRNA, complete cds. ACCESSION U31628 NID g1125055 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1610) AUTHORS Anderson,D.M., Kumaki,S., Ahdieh,M., Bertles,J., Tometsko,M., Loomis,A., Giri,J., Copeland,N.G., Gilbert,D.J., Jenkins,N.A., Valentine,V., Shapiro,D.N., Morris,S.W., Park,L.S. and Cosman,D. TITLE Functional characterization of the human interleukin-15 receptor alpha chain and close linkage of IL15RA and IL2RA genes JOURNAL J. Biol. Chem. 270 (50), 29862-29869 (1995) MEDLINE 96102040 REFERENCE 2 (bases 1 to 1610) AUTHORS Anderson,D.M. TITLE Direct Submission JOURNAL Submitted (14-JUL-1995) Dirk M. Anderson, Molecular Biology, Immunex Corporation, 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1610 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI-26 VA4" /tissue_type="stromal bone marrow" /chromosome="10" /map="10p15-p14" exon 1..171 /gene="IL15RA" /number=1 gene 1..1610 /gene="IL15RA" CDS 83..886 /gene="IL15RA" /note="cell surface receptor" /codon_start=1 /product="interleukin-15 receptor alpha chain precursor" /db_xref="PID:g1125056" /translation="MAPRRARGCRTLGLPALLLLLLLRPPATRGITCPPPMSVEHADI WVKSYSLYSRERYICNSGFKRKAGTSSLTECVLNKATNVAHWTTPSLKCIRDPALVHQ RPAPPSTVTTAGVTPQPESLSPSGKEPAASSPSSNNTAATTAAIVPGSQLMPSKSPST GTTEISSHESSHGTPSQTTAKNWELTASASHQPPGVYPQGHSDTTVAISTSTVLLCGL SAVSLLACYLKSRQTPPLASVEMEAMEALPVTWGTSSRDEDLENCSHHL" sig_peptide 83..172 /gene="IL15RA" exon 172..365 /gene="IL15RA" /number=2 misc_feature 173..697 /gene="IL15RA" /note="encodes extracellular domain" mat_peptide 173..883 /gene="IL15RA" /product="interleukin-15 receptor alpha chain" exon 366..464 /gene="IL15RA" /number=3 exon 465..665 /gene="IL15RA" /number=4 misc_feature 491..499 /gene="IL15RA" /note="encodes N-linked glycosylation site" exon 666..698 /gene="IL15RA" /number=5 misc_feature 698..760 /gene="IL15RA" /note="encodes transmembrane domain" exon 699..774 /gene="IL15RA" /number=6 misc_feature 761..883 /gene="IL15RA" /note="encodes cytoplasmic domain" exon 775..1610 /gene="IL15RA" /number=7 polyA_signal 1576..1581 /gene="IL15RA" polyA_site 1610 /gene="IL15RA" /note="30 A nucleotides" BASE COUNT 381 a 501 c 403 g 325 t ORIGIN 1 cccagagcag cgctcgccac ctccccccgg cctgggcagc gctcgcccgg ggagtccagc 61 ggtgtcctgt ggagctgccg ccatggcccc gcggcgggcg cgcggctgcc ggaccctcgg 121 tctcccggcg ctgctactgc tgctgctgct ccggccgccg gcgacgcggg gcatcacgtg 181 ccctcccccc atgtccgtgg aacacgcaga catctgggtc aagagctaca gcttgtactc 241 cagggagcgg tacatttgta actctggttt caagcgtaaa gccggcacgt ccagcctgac 301 ggagtgcgtg ttgaacaagg ccacgaatgt cgcccactgg acaaccccca gtctcaaatg 361 cattagagac cctgccctgg ttcaccaaag gccagcgcca ccctccacag taacgacggc 421 aggggtgacc ccacagccag agagcctctc cccttctgga aaagagcccg cagcttcatc 481 tcccagctca aacaacacag cggccacaac agcagctatt gtcccgggct cccagctgat 541 gccttcaaaa tcaccttcca caggaaccac agagataagc agtcatgagt cctcccacgg 601 caccccctct cagacaacag ccaagaactg ggaactcaca gcatccgcct cccaccagcc 661 gccaggtgtg tatccacagg gccacagcga caccactgtg gctatctcca cgtccactgt 721 cctgctgtgt gggctgagcg ctgtgtctct cctggcatgc tacctcaagt caaggcaaac 781 tcccccgctg gccagcgttg aaatggaagc catggaggct ctgccggtga cttgggggac 841 cagcagcaga gatgaagact tggaaaactg ctctcaccac ctatgaaact cggggaaacc 901 agcccagcta agtccggagt gaaggagcct ctctgcttta gctaaagacg actgagaaga 961 ggtgcaagga agcgggctcc aggagcaagc tcaccaggcc tctcagaagt cccagcagga 1021 tctcacggac tgccgggtcg gcgcctcctg cgcgagggag caggttctcc gcattcccat 1081 gggcaccacc tgcctgcctg tcgtgccttg gacccagggc ccagcttccc aggagagacc 1141 aaaggcttct gagcaggatt tttatttcat tacagtgtga gctgcctgga atacatgtgg 1201 taatgaaata aaaaccctgc cccgaatctt ccgtccctca tcctaacttg cagttcacag 1261 agaaaagtga catacccaaa gctctctgtc aattacaagg cttctcctgg cgtgggagac 1321 gtctacaggg aagacaccag cgtttgggct tctaaccacc ctgtctccag ctgctctgca 1381 cacatggaca gggacctggg aaaggtggga gagatgctga gcccagcgaa tcctctccat 1441 tgaaggattc aggaagaaga aaactcaact cagtgccatt ttacgaatat atgcgtttat 1501 atttatactt ccttgtctat tatatctata cattatatat tatttgtatt ttgacattgt 1561 accttgtata aacaaaataa aacatctatt ttcaatattt ttaaaatgca // LOCUS HSU31659 2510 bp mRNA PRI 27-DEC-1995 DEFINITION Human TBP-associated factor TAFII80 mRNA, complete cds. ACCESSION U31659 NID g1136305 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2510) AUTHORS Hisatake,K., Ohta,T., Takada,R., Guermah,M., Horikoshi,M., Nakatani,Y. and Roeder,R.G. TITLE Evolutionary conservation of human TATA-binding-polypeptide-associated factors TAFII31 and TAFII80 and interactions of TAFII80 with other TAFs and with general transcription factors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (18), 8195-8199 (1995) MEDLINE 95396764 REFERENCE 2 (bases 1 to 2510) AUTHORS Hisatake,K., Ohta,T., Takada,R., Guermah,M., Horikoshi,M., Nakatani,Y. and Roeder,G.R. TITLE Direct Submission JOURNAL Submitted (16-JUL-1995) Tsutomu Ohta, Laboratory of Biochemistry and Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..2510 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 309..2342 /note="a subunit of TFIID" /codon_start=1 /product="TAFII80" /db_xref="PID:g1136306" /translation="MAEEKKLKLSNTVLPSESMKVVAESMGIAQIQEETCQLLTDEVS YRIKEIAQDALKFMHMGKRQKLTTSDIDYALKLKNVEPLYGFHAQEFIPFRFASGGGR ELYFYEEKEVDLSDIINTPLPRVPLDVCLKAHWLSIEGCQPAIPENPPPAPKEQQKAE ATEPLKSAKPGQEEDGPLKGKGQGATTADGKGKEKKAPPLLEGAPLRLKPRSIHELSV EQQLYYKEITEACVGSCEAKRAEALQSIATDPGLYQMLPRFSTFISEGVRVNVVQNNL ALLIYLMRMVKALMDNPTLYLEKYVHELIPAVMTCIVSRQLCLRPDVDNHWALRDFAA RLVAQICKHFSTTTNNIQSRITKTFTKSWVDEKTPWTTRYGSIAGLAELGHDVIKTLI LPRLQQEGERIRSVLDGPVLSNIDRIGADHVQSLLLKHCAPVLAKLRPPPDNQDAYRA EFGSLGPLLCSQVVKARAQAALQAQQVNRTTLTITQPRPTLTLSQAPQPGPRTPGLLK VPGSIALPVQTLVSARAAAPPQPSPPPTKFIVMSSSSSAPSTQQVLSLSTSAPGSGST TTSPVTTTVPSVQPIVKLVSTATTAPPSTAPSGPGSVQKYIVVSLPPTGEGKGGPTSH PSPVPPPASSPSPLSGSALCGGKQEAGDSPPPAPGTPKANGSQPNSGSPQPAP" polyA_site 2510 /note="20 A nucleotides" BASE COUNT 521 a 832 c 657 g 500 t ORIGIN 1 gaattcctga cctcaagtga tccgcccgcc tcgtcctccc aaggtgctgg tgggattaca 61 ggcgtgagcc actgaacctg gccccagccc agatttgtat ctttctctgt tacacctcag 121 ggctgactca cgtggggatg gacataagac actcggagtt cctcccctgt ctccactgtc 181 ctccctttgt tggccctctc cctgtgtctc acgtttcttt ttcttctgcc tgccccgttc 241 ctctgccagg gtctccgtct ctccaccggg ggcttcatcc ttccagggag gagaagaggg 301 actccagaat ggctgaggag aagaagctga agcttagcaa cactgtgctg ccctcggagt 361 ccatgaaggt ggtggctgaa tccatgggca tcgcccagat tcaggaggag acctgccagc 421 tgctaacgga tgaggtcagc taccgcatca aagagatcgc acaggatgcc ttgaagttca 481 tgcacatggg gaagcggcag aagctcacca ccagtgacat tgactacgcc ttgaagctaa 541 agaatgtcga gccactctat ggcttccacg cccaggagtt cattcctttc cgcttcgcct 601 ctggtggggg ccgggagctt tacttctatg aggagaagga ggttgatctg agcgacatca 661 tcaatacccc tctgccccgg gtgcccctgg acgtctgcct caaagctcat tggctgagca 721 tcgagggctg ccagccagct atccccgaga acccgccccc agctcccaaa gagcaacaga 781 aggctgaagc cacagaaccc ctgaagtcag ccaagccagg ccaggaggaa gacggacccc 841 tgaagggcaa aggtcaaggg gccaccacag ccgacggcaa agggaaagag aagaaggcgc 901 cgcccttgct ggagggggcc cccttgcgac tgaagccccg gagcatccac gagttgtctg 961 tggagcagca gctctactac aaggagatca ccgaggcctg cgtgggctcc tgcgaggcca 1021 agagggcgga agccctgcaa agcattgcca cggaccctgg actgtatcag atgctgccac 1081 ggttcagtac ctttatctcg gagggggtcc gtgtgaacgt ggttcagaac aacctggccc 1141 tactcatcta cctgatgcgt atggtgaaag cgctgatgga caaccccacg ctctatctag 1201 aaaaatacgt ccatgagctg attccagctg tgatgacctg catcgtgagc agacagttgt 1261 gcctgcgacc agatgtggac aatcactggg cactccgaga ctttgctgcc cgcctggtgg 1321 cccagatctg caagcatttt agcacaacca ctaacaacat ccagtcccgg atcaccaaga 1381 ccttcaccaa gagctgggtg gacgagaaga cgccctggac gactcgttat ggctccatcg 1441 caggcttggc tgagctggga cacgatgtta tcaagactct gattctgccc cggctgcagc 1501 aggaagggga gcggatccgc agtgtgctgg acggccctgt gctgagcaac attgaccgga 1561 ttggagcaga ccatgtgcag agcctcctgc tgaaacactg tgctcctgtt ctggcaaagc 1621 tgcgcccacc gcctgacaat caggacgcct atcgggcaga attcgggtcc cttgggcccc 1681 tcctctgctc ccaggtggtc aaggctcggg cccaggctgc tctgcaggct cagcaggtca 1741 acaggaccac tctgaccatc acgcagcccc ggcccacgct gaccctctcg caggccccac 1801 agcctggccc tcgcacccct ggcttgctga aggttcctgg ctccatcgca cttcctgtcc 1861 agacactggt gtctgcacga gcggctgccc caccacagcc ttcccctcct ccaaccaagt 1921 ttattgtaat gtcatcgtcc tccagcgccc catccaccca gcaggtcctg tccctcagca 1981 cctcggcccc cggctcaggt tccaccacca cttcgcccgt caccaccacc gtccccagcg 2041 tgcagcccat cgtcaagttg gtctccaccg ccaccaccgc accccccagc actgctccct 2101 ctggtcctgg gagtgtccag aagtacatcg tggtctcact tcccccaaca ggggagggca 2161 aaggaggccc cacctcccat ccttctccag ttcctccccc ggcatcgtcc ccgtccccac 2221 tcagcggcag tgccctttgt ggggggaagc aggaggctgg ggacagtccc cctccagctc 2281 cagggactcc aaaagccaat ggctcccagc ccaactccgg ctcccctcag cctgctccgt 2341 gatgctccac ctgccagccc ccggattccc acacatgcag acatgtacac acgtgcacgt 2401 acacacatgc atgctcgcta agcggaagga agttgtagat tgcttccttc atgtcacttt 2461 ctttttagat attgtacagc cagtttctca gaataaaagt ttggtttgta // LOCUS HSU31814 1985 bp mRNA PRI 14-NOV-1996 DEFINITION Human transcriptional regulator homolog RPD3 mRNA, complete cds. ACCESSION U31814 NID g1667393 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1985) AUTHORS Yang,W.M., Inouye,C., Zeng,Y., Bearss,D. and Seto,E. TITLE Transcriptional repression by YY1 is mediated by interaction with a mammalian homolog of the yeast global regulator RPD3 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (23), 12845-12850 (1996) MEDLINE 97075080 REFERENCE 2 (bases 1 to 1985) AUTHORS Seto,E. TITLE Direct Submission JOURNAL Submitted (17-JUL-1995) Edward Seto, Molecular Medicine, University of Texas Health Science Center at San Antonio, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..1985 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" CDS 205..1671 /note="similar to yeast RPD3, encoded by GenBank Accession Number X78454" /codon_start=1 /product="transcriptional regulator homolog RPD3" /db_xref="PID:g1667394" /translation="MAYSQGGGKKKVCYYYDGDIGNYYYGQGHPMKPHRIRMTHNLLL NYGLYRKMEIYRPHKATAEEMTKYHSDEYIKFLRSIRPDNMSEYSKQMHIFNVGEDCP AFDGLFEFCQLSTGGSVAGAVKLNRQQTDMAVNWAGGLHHAKKYEASGFCYVNDIVLA ILELLKYHQRVLYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAG KGKYYAVNFPMCDGIDDESYGQIFKPIISKVMEMYQPSAVVLQCGADSLSGDRLGCFN LTVKGHAKCVEVVKTFNLPLLMLGGGGYTIRNVARCWTYETAVALDCEIPNELPYNDY FEYFGPDFKLHISPSNMTNQNTPEYMEKIKQRLFENLRMLPHAPGVQMQAIPEDAVHE DSGDEDGEDPDKRISIRASDKRIACDEEFSDSEDEGEGGRRNVADHKKGAKKARIEED KKETEDKKTDVKEEDKSKDNSGEKTDTKGTKSEQLSNP" BASE COUNT 626 a 360 c 454 g 545 t ORIGIN 1 cgccgagctt tcggcacctc tgccgggtgg taccgagcct tcccggcgcc ccctcctctc 61 ctcccaccgg cctgcccttc cccgcgggac tatcgccccc acgtttccct cagccctttt 121 ctctcccggc cgagccgcgg cggcagcagc agcagcagca gcagcaggag gaggagcccg 181 gtggcggcgg tggccgggga gcccatggcg tacagtcaag gaggcggcaa aaaaaaagtc 241 tgctactact acgacggtga tattggaaat tattattatg gacagggtca tcccatgaag 301 cctcatagaa tccgcatgac ccataacttg ctgttaaatt atggcttata cagaaaaatg 361 gaaatatata ggccccataa agccactgcc gaagaaatga caaaatatca cagtgatgag 421 tatatcaaat ttctacggtc aataagacca gataacatgt ctgagtatag taagcagatg 481 catatattta atgttggaga agattgtcca gcgtttgatg gactctttga gttttgtcag 541 ctctcaactg gcggttcagt tgctggagct gtgaagttaa accgacaaca gactgatatg 601 gctgttaatt gggctggagg attacatcat gctaagaaat acgaagcatc aggattctgt 661 tacgttaatg atattgtgct tgccatcctt gaattactaa agtatcatca gagagtctta 721 tatattgata tagatattca tcatggtgat ggtgttgaag aagcttttta tacaacagat 781 cgtgtaatga cggtatcatt ccataaatat ggggaatact ttcctggcac aggagacttg 841 agggatattg gtgctggaaa aggcaaatac tatgctgtca attttccaat gtgtgatggt 901 atagatgatg agtcatatgg gcagatattt aagcctatta tctcaaaggt gatggagatg 961 tatcaaccta gtgctgtggt attacagtgt ggtgcagact cattatctgg tgatagactg 1021 ggttgtttca atctaacagt caaaggtcat gctaaatgtg tagaagttgt aaaaactttt 1081 aacttaccat tactgatgct tggaggaggt ggctacacaa tccgtaatgt tgctcgatgt 1141 tggacatatg agactgcagt tgcccttgat tgtgagattc ccaatgagtt gccatataat 1201 gattactttg agtattttgg accagacttc aaactgcata ttagtccttc aaacatgaca 1261 aaccagaaca ctccagaata tatggaaaag ataaaacagc gtttgtttga aaatttgcgc 1321 atgttacctc atgcacctgg tgtccagatg caagctattc cagaagatgc tgttcatgaa 1381 gacagtggag atgaagatgg agaagatcca gacaagagaa tttctattcg agcatcagac 1441 aagcggatag cttgtgatga agaattctca gattctgagg atgaaggaga aggaggtcga 1501 agaaatgtgg ctgatcataa gaaaggagca aagaaagcta gaattgaaga agataagaaa 1561 gaaacagagg acaaaaaaac agacgttaag gaagaagata aatccaagga caacagtggt 1621 gaaaaaacag ataccaaagg aaccaaatca gaacagctca gcaacccctg aatttgacag 1681 tctcaccaat ttcagaaaat cattaaaaag aaaatattga aaggaaaatg ttttcttttt 1741 gaagacttct ggcttcattt tatactactt tggcatggac tgtatttatt ttcaaatggg 1801 actttttcgt ttttgttttt ctgggcaagt tttattgtga gattttctaa ttatgaagca 1861 aaatttcttt tctccaccat gctttatgtg atagtattta aaattgatgt gagttattat 1921 gtcaaaaaaa ctgatctatt aaagaagtaa ttggcctttc tgagctgaaa aaaaaaaaaa 1981 aaaag // LOCUS HSU31875 1442 bp mRNA PRI 29-NOV-1995 DEFINITION Human Hep27 protein mRNA, complete cds. ACCESSION U31875 NID g1079565 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1442) AUTHORS Gabrielli,F., Donadel,G., Bensi,G., Heguy,A. and Melli,M. TITLE A nuclear protein, synthesized in growth-arrested human hepatoblastoma cells, is a novel member of the short-chain alcohol dehydrogenase family JOURNAL Eur. J. Biochem. 232 (2), 473-477 (1995) MEDLINE 96035881 REFERENCE 2 (bases 1 to 1442) AUTHORS Gabrielli,F. TITLE Direct Submission JOURNAL Submitted (19-JUL-1995) Franco Gabrielli, Physiology and Biochemistry, University of Pisa, Via Roma 55, Pisa 56126, Italy FEATURES Location/Qualifiers source 1..1442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="hepetoblastoma HepG2" CDS 434..1276 /note="similar to Streptomyces violaceoruber granaticin polyketide synthase putative ketoacyl, Swiss-Prot Accession Number P16542" /codon_start=1 /product="Hep27 protein" /db_xref="PID:g1079566" /translation="MLSAVARGYQGWFHPCARLSVRMSSTGIDRKGVLANRVAVVTGS TSGIGFAIARRLARDGAHVVISSRKQQNVDRAMAKLQGEGLSVAGIVCHVGKAEDREQ LVAKALEHCGGVDFLVCSAGVNPLVGSTLGTSEQIWDKILSVNVKSPALLLSQLLPYM ENRRGAVILVSSIAAYNPVVALGVYNVSKTALLGLTRTLALELAPKDIRVNCVVPGII KTDFSKVFHGNESLWKNFKEHHQLQRIGESEDCAGIVSFLCSPDASYVNGENIAVAGY STRL" BASE COUNT 322 a 369 c 440 g 311 t ORIGIN 1 ggttcccttc cacgctgtga agctttgttc ttttggtctt catgataaat cttgctgctg 61 ctcactcgtt gggtccgtgc cacctttaag agctgtaaca ctcaccgcga aggtctgcaa 121 cttcactcct ggggccagca agaccacgaa tgcaccgaga ggaatgaaca actctggaca 181 caccatcttt aagaaccgta atactcaccg caagggtctg caacttcatt cttgaagtca 241 gtgaggccaa gaacccatca attccgtaca cattttggtg actttgaaga gactgtcacc 301 tatcaccaag tggtgagact attgccaagc agtgagacta ttgccaagtg gtgagaccat 361 caccaagcgg tgagactatc acctatcgcc aagtggcctg attcagcagg aagcatctca 421 gacaccaacc actatgctgt cagcagttgc ccggggctac cagggctggt ttcatccctg 481 tgctaggctt tctgtgagga tgagcagcac cgggatagac aggaagggcg tcctggctaa 541 ccgggtagcc gtggtcacgg ggtccaccag tgggatcggc tttgccatcg cccgacgtct 601 ggcccgggac ggggcccacg tggtcatcag cagccggaag cagcagaacg tggaccgggc 661 catggccaag ctgcaggggg aggggctgag tgtggcgggc attgtgtgcc acgtggggaa 721 ggctgaggac cgggagcagc tggtggccaa ggccctggag cactgtgggg gcgtcgactt 781 cctggtgtgc agcgcagggg tcaaccctct ggtagggagc actctgggga ccagtgagca 841 gatctgggac aagatcctaa gtgtgaacgt gaagtcccca gccctgctgc tgagccagtt 901 gctgccctac atggagaaca ggaggggtgc tgtcatcctg gtctcttcca ttgcagctta 961 taatccagta gtggcgctgg gtgtctacaa tgtcagcaag acagcgctgc tgggtctcac 1021 tagaacactg gcattggagc tggcccccaa ggacatccgg gtaaactgcg tggttccagg 1081 aattataaaa actgacttca gcaaagtgtt tcatgggaat gagtctctct ggaagaactt 1141 caaggaacat catcagctgc agaggattgg ggagtcagag gactgtgcag gaatcgtgtc 1201 cttcctgtgc tctccagatg ccagctacgt caacggggag aacattgcgg tggcaggcta 1261 ctccactcgg ctctgagagg agtgggggcg gctgcgtagc tgtggtccca gcccaggagc 1321 ctgagggggt gtctaggtga tcatttggat ctggagcaga gtctgccatt ctgccagact 1381 agcaatttgg gggcttactc atgctaggct tgaggaagaa gaaaaacgct tcggcattct 1441 cc // LOCUS HSU31905 2983 bp mRNA PRI 12-JUN-1996 DEFINITION Human prostate-specific transglutaminase (hTGP) mRNA, complete cds. ACCESSION U31905 NID g1353349 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2983) AUTHORS Dubbink,H.J., Verkaik,N.S., Faber,P.W., Trapman,J., Schroder,F.H. and Romijn,J.C. TITLE Tissue specific and androgen-regulated expression of human prostate-specific transglutaminase JOURNAL Biochem. J. 315 (Pt 3), 901-908 (1996) MEDLINE 96220705 REFERENCE 2 (bases 1 to 2983) AUTHORS Dubbink,H.J. TITLE Direct Submission JOURNAL Submitted (20-JUL-1995) Hendrikus J. Dubbink, Urology, Erasmus University Rotterdam, Dr. Molewaterplein 50, Rotterdam 3000 DR, The Netherlands FEATURES Location/Qualifiers source 1..2983 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p21.33-p22" /chromosome="3" gene 45..2099 /gene="TGM4" CDS 45..2099 /gene="TGM4" /EC_number="2.3.2.13" /note="expressed mRNA is termed hTGP" /codon_start=1 /product="prostate-specific transglutaminase" /db_xref="PID:g1353350" /translation="MMDASKELQVLHIDFLNQDNAVSHHTWEFQTSSPVFRRGQVFHL RLVLNQPLQSYHQLKLEFSTGPNPSIAKHTLVVLDPRTPSDHYNWQATLQNESGKEVT VAVTSSPNAILGKYQLNVKTGNHILKSEENILYLLFNPWCKEDMVFMPDEDERKEYIL NDTGCHYVGAARSIKCKPWNFGQFEKNVLDCCISLLTESSLKPTDRRDPVLVCRAMCA MMSFEKGQGVLIGNWTGDYEGGTAPYKWTGSAPILQQYYNTKQAVCFGQCWVFAGILT TVLRALGIPARSVTGFDSAHDTERNLTVDTYVNENGKKITSMTHDSVWNFHVWTDAWM KRPDLPKGYDGWQAVDATPQERSQGVFCCGPSPLTAIRKGDIFIVYDTRFVFSEVNGD RLIWLVKMVNGQEELHVISMETTSIGKNISTKAVGQDRRRDITYEYKYPEGSSEERQV MDHAFLLLSSEREHRRPVKENFLHMSVQSDDVLLGNSVNFTVILKRKTAALQNVNILG SFELQLYTGKKMAKLCDLNKTSQIQGQVSEVTLTLDSKTYINSLAILDDEPVIRGFII AEIVESKEIMASEVFTSFQYPEFSIELPNTGRIGQLLVCNCIFKNTLAIPLTDVKFSL ESLGISSLQTSDHGTVQPGETIQSQIKCTPIKTGPKKFIVKLSSKQVKEINAQKIVLI TK" BASE COUNT 800 a 778 c 712 g 693 t ORIGIN 1 agagatagag tcttccctgg cattgcagga gagaatctga agggatgatg gatgcatcaa 61 aagagctgca agttctccac attgacttct tgaatcagga caacgccgtt tctcaccaca 121 catgggagtt ccaaacgagc agtcctgtgt tccggcgagg acaggtgttt cacctgcggc 181 tggtgctgaa ccagccccta caatcctacc accaactgaa actggaattc agcacagggc 241 cgaatcctag catcgccaaa cacaccctgg tggtgctcga cccgaggacg ccctcagacc 301 actacaactg gcaggcaacc cttcaaaatg agtctggcaa agaggtcaca gtggctgtca 361 ccagttcccc caatgccatc ctgggcaagt accaactaaa cgtgaaaact ggaaaccaca 421 tccttaagtc tgaagaaaac atcctatacc ttctcttcaa cccatggtgt aaagaggaca 481 tggttttcat gcctgatgag gacgagcgca aagagtacat cctcaatgac acgggctgcc 541 attacgtggg ggctgccaga agtatcaaat gcaaaccctg gaactttggt cagtttgaga 601 aaaatgtcct ggactgctgc atttccctgc tgactgagag ctccctcaag cccacagata 661 ggagggaccc cgtgctggtg tgcagggcca tgtgtgctat gatgagcttt gagaaaggcc 721 agggcgtgct cattgggaat tggactgggg actatgaagg tggcacagcc ccatacaagt 781 ggacaggcag tgccccgatc ctgcagcagt actacaacac gaagcaggct gtgtgctttg 841 gccagtgctg ggtgtttgct gggatcctga ctacagtgct gagagcgttg ggcatcccag 901 cacgcagtgt gacaggcttc gattcagctc acgacacaga aaggaacctc acggtggaca 961 cctatgtgaa tgagaatggc aagaaaatca ccagtatgac ccacgactct gtctggaatt 1021 tccatgtgtg gacggatgcc tggatgaagc gaccggatct gcccaagggc tacgacggct 1081 ggcaggctgt ggacgcaacg ccgcaggagc gaagccaggg tgtcttctgc tgtgggccat 1141 caccactgac cgccatccgc aaaggtgaca tctttattgt ctatgacacc agattcgtct 1201 tctcagaagt gaatggtgac aggctcatct ggttggtgaa gatggtgaat gggcaggagg 1261 agttacacgt aatttcaatg gagaccacaa gcatcgggaa aaacatcagc accaaggcag 1321 tgggccaaga caggcggaga gatatcacct atgagtacaa gtatccagaa ggctcctctg 1381 aggagaggca ggtcatggat catgccttcc tccttctcag ttctgagagg gagcacagac 1441 gacctgtaaa agagaacttt cttcacatgt cggtacaatc agatgatgtg ctgctgggaa 1501 actctgttaa tttcaccgtg attcttaaaa ggaagaccgc tgccctacag aatgtcaaca 1561 tcttgggctc ctttgaacta cagttgtaca ctggcaagaa gatggcaaaa ctgtgtgacc 1621 tcaataagac ctcgcagatc caaggtcaag tatcagaagt gactctgacc ttggactcca 1681 agacctacat caacagcctg gctatattag atgatgagcc agttatcaga ggtttcatca 1741 ttgcggaaat tgtggagtct aaggaaatca tggcctctga agtattcacg tctttccagt 1801 accctgagtt ctctatagag ttgcctaaca caggcagaat tggccagcta cttgtctgca 1861 attgtatctt caagaatacc ctggccatcc ctttgactga cgtcaagttc tctttggaaa 1921 gcctgggcat ctcctcacta cagacctctg accatgggac ggtgcagcct ggtgagacca 1981 tccaatccca aataaaatgc accccaataa aaactggacc caagaaattt atcgtcaagt 2041 taagttccaa acaagtgaaa gagattaatg ctcagaagat tgttctcatc accaagtagc 2101 cttgtctgat gctgtggagc cttagttgag atttcagcat ttcctacctt gtgcttagct 2161 ttcagattat ggatgattaa atttgatgac ttatatgagg gcagattcaa gagccagcag 2221 gtcaaaaagg ccaacacaac cataagcagc cagacccaca aggccaggtc ctgtgctatc 2281 acagggtcac ctcttttaca gttagaaaca ccagccgagg ccacagaatc ccatcccttt 2341 cctgagtcat ggcctcaaaa atcagggcca ccattgtctc aattcaaatc catagatttc 2401 gaagccacag agtctctccc tggagcagca gactatgggc agcccagtgc tgccacctgc 2461 tgacgaccct tgagaagctg ccatatcttc aggccatggg ttcaccagcc ctgaaggcac 2521 ctgtcaactg gagtgctctc tcagcactgg gatgggcctg atagaagtgc attctcctcc 2581 tattgcctcc attctcctct ctctatccct gaaatccagg aagtccctct cctggtgctc 2641 caagcagttt gaagcccaat ctgcaaggac atttctcaag ggccatgtgg ttttgcagac 2701 aaccctgtcc tcaggcctga actcaccata gagacccatg tcagcaaacg gtgaccagca 2761 aatcctcttc ccttattcta aagctgcccc ttgggagact ccagggagaa ggcattgctt 2821 cctccctggt gtgaactctt tctttggtat tccatccact atcctggcaa ctcaaggctg 2881 cttctgttaa ctgaagcctg ctccttcttg ttctgccctc cagagatttg ctcaaatgat 2941 caataagctt taaattaaac tctacttcaa gaaaaaaaaa ccg // LOCUS HSU31913 1120 bp mRNA PRI 08-JAN-1997 DEFINITION Human HBV-X associated (XAP2) mRNA, complete cds. ACCESSION U31913 NID g1765935 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1120) AUTHORS Kuzhandaivelu,N., Cong,Y.S., Inouye,C., Yang,W.M. and Seto,E. TITLE XAP2, a novel hepatitis B virus X-associated protein that inhibits X transactivation JOURNAL Nucleic Acids Res. 24 (23), 4741-4750 (1996) MEDLINE 97128268 REFERENCE 2 (bases 1 to 1120) AUTHORS Kuzhandaivelu,N., Inouye,C. and Seto,E. TITLE Direct Submission JOURNAL Submitted (19-JUL-1995) Edward Seto, Cellular & Structure Biology, Molecular Medicine, University of Texas Health Science Center at San Antonio, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..1120 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoma" /cell_type="B-cell" gene 10..1002 /gene="XAP2" CDS 10..1002 /gene="XAP2" /codon_start=1 /product="HBV-X associated protein" /db_xref="PID:g1765936" /translation="MADIIARLREDGIQKRVIQEGRGELPDFQDGTKATFHYRTLHSD DEGTVLDDSRARGKPMELIIGKKFKLPVWETIVCTMREGEIAQFLCDIKHVVLYPLVA KSLRNIAVGKDPLEGQRHCCGVAQMREHSSLGHADLDALQQNPQPLIFHMEMLKVESP GTYQQDPWAMTDEEKAKAVPLIHQEGNRLYREGHVKEAAAKYYDAIACLKNLQMKEQP GSPEWIQLDQQITPLLLNYCQCKLVVEEYYEVLDHCSSILNKYDDNVKAYFKRGKAHA AVWNAQEAQADFAKVLELDPALAPVVSRELRALEARIRQKDEEDKARFRGIFSH" BASE COUNT 257 a 329 c 344 g 190 t ORIGIN 1 ggaaggagga tggcggatat catcgcaaga ctccgggagg acgggatcca aaaacgtgtg 61 atacaggaag gccgaggaga gctcccggac tttcaagatg ggaccaaggc cacgttccac 121 taccggacgc tgcacagtga cgacgagggc accgtgctgg acgacagccg ggctcgtggc 181 aagcccatgg agctcatcat tggcaagaag ttcaagctgc ctgtgtggga gaccatcgtg 241 tgcaccatgc gagaagggga gattgcccag ttcctctgtg acatcaagca tgtggtcctg 301 tacccgctgg tggccaagag tctccgcaac atcgcggtgg gcaaggaccc cctggagggc 361 cagcggcact gctgcggtgt tgcacagatg cgtgaacaca gctccctggg ccatgctgac 421 ctggacgccc tgcagcagaa cccccagccc ctcatcttcc acatggagat gctgaaggtg 481 gagagccctg gcacgtacca gcaggaccca tgggccatga cagacgaaga gaaggcaaag 541 gcagtgccac ttatccacca ggagggcaac cggttgtacc gcgaggggca tgtgaaggag 601 gctgctgcca agtactacga tgccattgcc tgcctcaaga acctgcagat gaaggaacag 661 cctgggtccc ctgaatggat ccagctggac cagcagatca cgccgctgct gctcaactac 721 tgccagtgca agctggtggt cgaggagtac tacgaggtgc tggaccactg ttcttccatc 781 ctcaacaagt acgacgacaa cgtcaaggcc tacttcaagc ggggcaaggc ccacgcggcc 841 gtgtggaatg cccaggaggc ccaggctgac tttgccaaag tgctggagct ggacccagcc 901 ctggcgcctg tggtgagccg agagctgcgg gccctggagg cacggatccg gcagaaggac 961 gaagaggaca aagcccggtt ccgggggatc ttctcccatt gacaggagca cttggccctg 1021 ccttacctgc caagcccact gctgcagctg ccaccccccc tgcccgtgct gcgtcatgct 1081 tctgtgtata taaaggcctt tatttatctc tcaaaaaaaa // LOCUS HSU31930 1006 bp mRNA PRI 29-MAR-1996 DEFINITION Human deoxyuridine nucleotidohydrolase mRNA, complete cds. ACCESSION U31930 NID g1144331 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1006) AUTHORS Ladner,R.D., McNulty,D.E., Carr,S.A., Roberts,G.D. and Caradonna,S.J. TITLE Characterization of distinct nuclear and mitochondrial forms of human deoxyuridine triphosphate nucleotidohydrolase JOURNAL J. Biol. Chem. 271 (13), 7745-7751 (1996) MEDLINE 96205967 REFERENCE 2 (bases 1 to 1006) AUTHORS Ladner,R.D. TITLE Direct Submission JOURNAL Submitted (20-JUL-1995) Robert D. Ladner, Molecular Biology, University of Medicine and Dentistry of New Jersey, 2 Medical Center Drive, Stratford, NJ 08084, USA FEATURES Location/Qualifiers source 1..1006 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" CDS 30..524 /EC_number="3.6.1.23" /codon_start=1 /product="deoxyuridine nucleotidohydrolase" /db_xref="PID:g1144332" /translation="MPCSEETPAISPSKRARPAEVGGMQLRFARLSEHATAPTRGSAR AAGYDLYSAYDYTIPPMEKAVVKTDIQIALPSGCYGRVAPRSGLAAKHFIDVGAGVID EDYRGNVGVVLFNFGKEKFEVKKGDRIAQLICERIFYPEIEEVQALDDTERGSGGFGS TGKN" mat_peptide 33..524 /note="the nuclear form of this protein is phosphorylated on Ser11" /evidence=experimental polyA_site 1006 /note="31 A nucleotides" BASE COUNT 306 a 196 c 208 g 296 t ORIGIN 1 cgtctcctcg ctcgccttct ggctctgcca tgccctgctc tgaagagaca cccgccattt 61 cacccagtaa gcgggcccgg cctgcggagg tgggcggcat gcagctccgc tttgcccggc 121 tctccgagca cgccacggcc cccacccggg gctccgcgcg cgccgcgggc tacgacctgt 181 acagtgccta tgattacaca ataccaccta tggagaaagc tgttgtgaaa acggacattc 241 agatagcgct cccttctggg tgttatggaa gagtggctcc acggtcaggc ttggctgcaa 301 aacactttat tgatgtagga gctggtgtca tagatgaaga ttatagagga aatgttggtg 361 ttgtactgtt taattttggc aaagaaaagt ttgaagtcaa aaaaggtgat cgaattgcac 421 agctcatttg cgaacggatt ttttatccag aaatagaaga agttcaagcc ttggatgaca 481 ccgaaagggg ttcaggaggt tttggttcca ctggaaagaa ttaaaattta tgccaagaac 541 agaaaacaag aagtcatacc tttttcttaa aaaaaaaaaa agtttttgct tcaagtgttt 601 tggtgttttg cacttctgta aacttactag ctttaccttc taaaagtact gcatttttta 661 ctttttttta tgatcaagga aaagatcatt aaaaaaaaac acaaaagaag tttttctttg 721 tgtttggatc aaaaagaaac tttgtttttc cgcaattgaa ggttgtatgt aaatctgctt 781 tgtggtgacc tgatgtaaac agtgtcttct taaaatcaaa tgtaaatcaa ttacagatta 841 aaaaaaaaaa gcctgtattt aactcatatg atctcccttc agcaacttat tttgctttaa 901 ttgctttaaa tcttaagcaa tattttttat tcagtaaaca aattctttca caaggtacaa 961 aatcttgcat aagctgaact aaaataaaaa tgaaaaggag agatta // LOCUS HSU31973 2969 bp mRNA PRI 09-APR-1996 DEFINITION Human phosphodiesterase A' subunit (PDE6C) mRNA, complete cds. ACCESSION U31973 NID g940230 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2969) AUTHORS Piriev,N.I., Viczian,A.S., Ye,J., Kerner,B., Korenberg,J.R. and Farber,D.B. TITLE Gene structure and amino acid sequence of the human cone photoreceptor cGMP-phosphodiesterase alpha' subunit (PDEA2) and its chromosomal localization to 10q24 JOURNAL Genomics 28 (3), 429-435 (1995) MEDLINE 96039253 REFERENCE 2 (bases 1 to 2969) AUTHORS Viczian,A.S., Piriev,N.I. and Farber,D.B. TITLE Isolation and characterization of a cDNA encoding the alpha' subunit of human cone cGMP-phosphodiesterase JOURNAL Gene 166 (2), 205-211 (1995) MEDLINE 96125191 REFERENCE 3 (bases 1 to 2969) AUTHORS Viczian,A.S. TITLE Direct Submission JOURNAL Submitted (21-JUL-1995) Andrea S. Viczian, Jules Stein Eye Institute, UCLA School of Medicine, 100 Stein Plaza, Los Angeles, CA 90095, USA FEATURES Location/Qualifiers source 1..2969 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /cell_type="photoreceptor cell" /map="10q24" /chromosome="10" gene 139..2715 /gene="PDE6C" CDS 139..2715 /gene="PDE6C" /note="cGMP-phosphodiesterase activity; membrane associated protein" /codon_start=1 /product="phosphodiesterase A' subunit" /db_xref="PID:g940231" /translation="MGEINQVAVEKYLEENPQFAKEYFDRKLRVEVLGEIFKNSQVPV QSSMSFSELTQVEESALCLELLWTVQEEGGTPEQGVHRALQRLAHLLQADRCSMFLCR SRNGIPEVASRLLVVTPTSKFEDNLVGPDKEVVFPLDIGIVGWAAHTKKTHNVPDVKK NSHFSDFMDKQTGYVTKNLLATPIVVGKEVLAVIMAVNKVNASEFSKQDEEVFSKYLN FVSIILRLHHTSYMYNIESRRSQILMWSANKVFEELTDVERQFHKALYTVRSYLNCER YSIGLLDMTKEKEFYDEWPIKLGEVEPYKGPKTPDGREVNFYKIIDYILHGKEEIKVI PTPPADHWTLISGLPTYVAENGFICNMMNAPADEYFTFPKGPVDETGWVIKNVLSLPI VNKKEDIVGVATFYNRKDGKPFDEHDEYITETLTQFLGWSLLNTDTYDKMNKLENRKD IAQEMLMNQTKATLEEIKSILKFQEKLNVDVIDDCEEKQLVAILKEDLPDPRSAELYE FRFSDFPLTEHGLIKCGIRLFFEINVVEKFKVPVEVLTRWMYTVRKGYRAVTYHNWQH GFNVGQTMFTLLMTGRLKKYYTDLEAFAMLAAAFCHDIDHRGTNNLYQMKSTSPLARL HGSSILERHHLEYSKTLLQDESLNIFQNLNKRQFETVIHLFEVAIIATDLALYFKKRT MFQKIVDACEQMQTEEEAIKYVTVDPTKKEIIMAMMMTACDLSAITKPWEVQSQVALM VANEFWEQGDLERTVLQQQPIPMMDRNKRDELPKLQVGFIDFVCTFVYKEFSRFHKEI TPMLSGLQNNRVEWKSLADEYDAKMKVIEEEAKKQEGGAEKAAEDSGGGDDKKSKTCL ML" polyA_signal 2880..2885 polyA_signal 2947..2952 polyA_site 2969 /note="11 A nucleotides" BASE COUNT 943 a 593 c 683 g 750 t ORIGIN 1 ctttggaagt cctatgaggg accatttacg gtttcctcag taatttccac caggatgaat 61 ttccttctca tcactctgcc tcaggtagtg ctctgaaggt cgtcctttct gaacaaacgc 121 agcaaagcaa gccacaccat gggtgagatc aaccaagttg ccgtggagaa atacctggag 181 gagaaccctc agtttgccaa ggagtacttt gacaggaagt tgcgggtgga ggtgctggga 241 gaaatcttca agaacagcca ggtgccagtc cagtccagca tgtccttctc tgagctgacc 301 caggtggagg agtcagccct gtgcttggag ctgctgtgga ccgtgcagga ggaggggggc 361 accccagagc agggggttca cagggcccta cagaggctgg cccacctgct ccaggctgac 421 cgctgcagca tgttcctgtg ccggtcccgg aacggcatac ctgaggtggc ctctaggttg 481 ctggtagtca cccccacctc caagtttgag gacaacctgg tgggccctga caaagaagtt 541 gtgtttccat tggacattgg gatagtgggt tgggctgctc acacgaagaa aactcataat 601 gtcccagatg tgaaaaagaa cagccatttt tctgacttca tggacaagca aactgggtat 661 gtcactaaga acctgctggc aaccccgatc gtggtgggca aggaggttct tgctgtgatc 721 atggcagtta acaaagtaaa tgcatctgaa ttttccaaac aggatgaaga ggtcttttcc 781 aaatacctca actttgtgtc tatcatccta aggcttcatc acaccagcta catgtacaat 841 attgaatccc gaagaagcca gatccttatg tggtcagcca ataaagtatt tgaagaactc 901 acagatgttg agcgacagtt tcacaaagcg ctctacacgg ttagatcata tctgaactgt 961 gaacgatact ccattggact gctggacatg accaaggaga aggaattcta cgatgaatgg 1021 ccaatcaagc ttggagaagt agagccttat aaaggtccaa agacacctga tggcagggaa 1081 gtcaactttt ataaaatcat tgattacatt ttacatggaa aagaagagat caaagtgatt 1141 ccgacgcctc ctgcagacca ctggacactc attagtgggt tgccaacata tgttgctgaa 1201 aatggattta tctgtaacat gatgaatgcc cctgcggatg aatacttcac atttccgaaa 1261 ggacctgtag acgaaactgg ttgggtcatt aagaatgttt tgtccctgcc tattgtcaac 1321 aagaaagaag atattgtggg agtggctaca ttttacaaca ggaaggatgg aaaacctttc 1381 gatgagcatg atgaatacat taccgagact ctcacacaat ttcttggatg gtctctttta 1441 aatactgaca cctacgataa gatgaataag ctagaaaaca gaaaggacat tgctcaggaa 1501 atgctcatga accaaaccaa agccactctt gaagaaatta agtccatttt gaaatttcaa 1561 gagaagttaa atgttgatgt aattgacgac tgtgaagaaa aacaacttgt tgcaattttg 1621 aaagaggact tgccagaccc acgctcagca gaactgtacg aattccgctt cagtgacttc 1681 ccccttacag agcacggatt gattaaatgt ggaatacgac tgttttttga aataaatgtg 1741 gtggagaaat tcaaagtacc tgtagaggtt cttaccagat ggatgtacac tgtgaggaaa 1801 gggtaccgag ctgtcactta ccacaattgg cagcatgggt tcaacgtggg gcagaccatg 1861 tttactttgc tgatgacagg aagattaaag aagtactaca cagatctcga agcctttgcc 1921 atgcttgctg ctgctttctg ccatgatatt gaccacagag gcaccaataa tttgtaccag 1981 atgaaatcca cgtctccatt agcaagactt catggttctt ctattttgga gaggcaccac 2041 ctggagtaca gtaagactct gttgcaggat gagagtttaa acatcttcca gaacctaaat 2101 aagcggcagt ttgaaacagt tattcatttg ttcgaggtcg caataatagc aactgacctg 2161 gctttatatt tcaagaagag gaccatgttt caaaaaattg ttgatgcctg tgaacaaatg 2221 caaacggaag aagaagccat caaatatgta actgttgatc caaccaagaa agagattatc 2281 atggcaatga tgatgacggc atgtgacttg tctgctatta ccaagccctg ggaggtgcaa 2341 agtcaggtag cacttatggt tgcaaatgaa ttttgggaac aaggagatct ggagagaaca 2401 gtgttgcagc aacaacccat tcctatgatg gacagaaaca aaagagatga attacctaaa 2461 cttcaagttg gatttattga ttttgtttgt acttttgtat ataaggagtt ctcacggttt 2521 cacaaagaaa tcacacctat gctgagtggt cttcagaata acagagtaga atggaaatca 2581 ctagctgatg agtatgatgc aaagatgaag gtcattgaag aggaggcaaa aaagcaagaa 2641 ggaggagccg aaaaagctgc tgaagattca ggaggtggtg atgacaaaaa gtccaaaaca 2701 tgtttaatgt tgtaatatta tctaactggt ctaaacttca aatatcattt tacctttgaa 2761 gaaaaccaga aaacattcaa aagaacttca acaaatcatc acgtaacagg atcttcagaa 2821 aaactaccct gtgactatga agaaaatata tattgctagc ccaaaaatcc caggggcaaa 2881 ataaagttca acaaaagtgc aaaatatgac aaaaataggt acatttttgg tgccaattta 2941 ttttaaatta aaaaatcatg caatcctga // LOCUS HSU31986 1442 bp mRNA PRI 26-SEP-1996 DEFINITION Human cartilage-specific homeodomain protein Cart-1 mRNA, complete cds. ACCESSION U31986 NID g1098653 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1442) AUTHORS Gordon,D.F., Wagner,J., Atkinson,B.L., Chiono,M., Berry,R., Sikela,J. and Gutierrez-Hartmann,A. TITLE Human Cart-1: structural organization, chromosomal localization, and functional analysis of a cartilage-specific homeodomain cDNA JOURNAL DNA Cell Biol. 15 (7), 531-541 (1996) MEDLINE 96326288 REFERENCE 2 (bases 1 to 1442) AUTHORS Gordon,D.F., Wagner,J., Atkinson,B., Chiono,M., Berry,R., Sikela,J. and Gutierrez-Hartmann,A. TITLE Direct Submission JOURNAL Submitted (20-JUL-1995) David F. Gordon, Medicine/Endocrinology, University of Colorado Health Sciences Center, 4200 E. Ninth Avenue, Denver, CO 80262, USA FEATURES Location/Qualifiers source 1..1442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /map="12q21.3-q22" /chromosome="12" CDS 156..1136 /note="cartilage-specific homeodomain protein" /codon_start=1 /product="Cart-1" /db_xref="PID:g1098654" /translation="MEFLSEKFALKSPPSKNSDFYMGAGGPLEHVMETLDNESFYSKA SAGKCVQAFGPLPRAEHHVRLERTSPCQDSSVNYGITKVEGQPLHTELNRAMDNCNSL RMSPVKGMQEKGELDELGDKCDTNVSSSKKRRHRTTFTSLQLEELEKVFQKTHYPDVY VREQLALRTELTEARVQVWFQNRRAKWRKRERYGQIQQAKSHFAATYDISVLPRTDSY PQIQNNLWAGNASGGSVVTSCMLPRDTSSCMTPYSHSPRTDSSYTGFSNHQNQFSHVP LNNFFTDSLLTGATNGHAFETKPEFERRSSSIAVLRMKAKEHTANISWAM" misc_feature 549..731 /note="encodes homeodomain" BASE COUNT 406 a 350 c 328 g 358 t ORIGIN 1 gtattaaagg gaccggagcg gggcagttcc cgacgccgca gcgctcgctt tcccttccct 61 cctctctggc cctccctcct cccacccact ggctccctcc cccagcgctc tccagtttct 121 gtgccccagg agctacgcga cagtcttcca ggattatgga gtttctgagc gagaagtttg 181 ccctcaagag ccctccgagt aaaaacagtg acttttacat gggcgcagga ggtcctctgg 241 agcacgttat ggagacgctg gacaatgagt ccttttacag caaagcgtct gcaggcaaat 301 gcgtgcaggc cttcggaccc ctgccccgcg ccgagcatca cgtgcgcttg gagaggacct 361 cgccctgtca ggacagcagc gtgaactatg ggatcactaa agtagaagga cagccccttc 421 acaccgaact gaatagagct atggacaact gtaacagtct ccgaatgtct cccgtgaaag 481 ggatgcaaga gaagggagag ctggatgaac ttggggataa atgtgatacg aatgtatcca 541 gcagtaagaa acggaggcac cgaaccacct tcaccagttt gcagctagag gagctggaga 601 aagtctttca gaaaactcat tacccggatg tgtatgtcag agaacagctt gctctgagga 661 cagagctcac tgaggccagg gtccaggttt ggtttcaaaa tcgaagggcc aaatggagaa 721 aaagggaacg ttatggccaa atacaacaag cgaaaagcca ttttgctgcc acctatgata 781 tatcagtttt gccaaggact gacagctacc cacagattca gaacaatttg tgggcaggaa 841 atgcaagtgg tggttctgtg gttacttcat gcatgttacc acgtgacact tcctcctgta 901 tgacacctta ttctcactcg cctcggacag attccagtta cacggggttt tcaaaccacc 961 agaaccagtt cagccacgtg cccctcaaca attttttcac tgactctctt cttactgggg 1021 caaccaatgg acatgcattt gaaacaaagc cagagtttga aaggaggtct tccagtatcg 1081 cagttcttcg aatgaaagcc aaggagcaca ccgccaatat ttcatgggcc atgtaacata 1141 cagtactctt ttatttttct tttaatagca aagttaaaca ttcttatttc tcatatttaa 1201 aggataccac aataagctgc tgtgtgtgga attgctaaag gtcaagatat tcagtgagac 1261 cagcttaaat gaatagttgt tatttaacat taaaatctaa gaatgaacct ctgaaaagac 1321 taaataggtt taccatgtgc cagtctccac aaaccctgtt ttagtagtaa ggttttcttt 1381 ttctattgta caagtcaatg aaatatgatc acgcaactta ttaaagaata aatgtgttaa 1441 ac // LOCUS HSU32331 2569 bp mRNA PRI 07-SEP-1995 DEFINITION Human RIG mRNA, complete sequence. ACCESSION U32331 NID g975878 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2569) AUTHORS Ligon,A.H., Pershouse,M.A., Jassar,S., Yung,W.K.A. and Steck,P.A. TITLE Identification of a novel gene product (RIG) differentially expressed in human glioblastoma JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 2569) AUTHORS Steck,P.A. TITLE Direct Submission JOURNAL Submitted (26-JUL-1995) Peter A. Steck, University of Texas, M.D. Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2569 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /chromosome="11" /map="11p15.1" CDS 26..358 /note="Regulated In Glioma; ; expressed in brain with lower quantities in lung and heart" /codon_start=1 /function="unknown" /product="RIG" /db_xref="PID:g975879" /translation="MPFFSSCLCPSHYSGPSLPSSTSSSLPTGPENQLGFVLLQAMVH HANSSCVRNAFWLQITEKLTPALSIIISVVYLRCPEMVENRIGFLLNVKDSKTLSVVG PHPKPCIL" repeat_region 2273..2508 BASE COUNT 685 a 601 c 537 g 740 t 6 others ORIGIN 1 aattcccaga ccacaggaat acctaatgcc ttttttctct tcctgtcttt gtccctcaca 61 ctacagcggc ccctcccttc cctcttcaac ctcatcctcc ctccccacag gcccagagaa 121 ccagttgggc tttgttctcc tgcaggctat ggttcatcat gcaaatagct cctgtgtcag 181 aaatgctttt tggcttcaaa taacagaaaa gctaacacca gctttatcaa taataatatc 241 ggtggtttac ttaaggtgtc cagagatggt ggagaacagg attggtttcc tcctcaatgt 301 caaggactca aagactcttt ctgtggtagg gccacatcct aaaccctgta tcctgtgatt 361 atttacccga caggcaaaag agattttgca gatgcaatta aggttaagga ccttgacgtg 421 ggaagattgt gattatttac ctgacagggc aaaagagatt ttgcagatgc aattaaggtt 481 aaggaccttg acgtgggaag attatcctgg attatctagg tgggcgcaat ttgatcacat 541 gggtccccag aagtggagaa cctttcccac ctgtagaaag ccagagagct ggcacctgag 601 aaggacagaa ctgtcattgc aggatttgaa gatgaagggg cccatgagcc aaggaatgcc 661 agtgacctat agaggctaaa aaacagcaag gaaatggact ctccccagag cctccagagg 721 aatgcagccc tgttgatcac atgatcacca gatggctgcc ccagagccaa atgtcgcttc 781 ctgagcacca tactcaaagg caggggaagt ggatggaggg caggagctcc attcttgttt 841 gccactctcc ttttgtcaat tgggaaaaaa ttccagaaac tctgggagcc ctccccttac 901 atttcctggg tcatggggcc agccctagct gctggaggga ctgagaacag ctgttgagca 961 gtttacctga cggcatctgc catggcttgg caggaactct ggctttggga gagagcagca 1021 gcaaggtatt caagcaccac ctccacccag cccctcccac atttcactca ggactgagta 1081 aaggagacac tcagatgcta ctcagatgct ggcttcagct aagtattttg caaagcctct 1141 cgtgtcttac aagtttgtgg ctatcatgac aaaatggagc agcctactat atctacatat 1201 acaactatgg gggacctagt tttatctcat ttaccacaat gttttcaatc attttttgga 1261 tgacataatt tttagcctct tctctaaatg cttcctcaag ctttccttgc cttccagcca 1321 ctgcaaatga cttgcagttt cccctacatg ncacctgacc cttgtgcctc cctccctctg 1381 cccatgncca gaaagccctt tnctgtgccc tctggcttcc tgataaactc ctatcatctt 1441 caagagccag ttcccatgcc agctctcccc aagtgctcca ctgaggcttc cgtaacacct 1501 ctgttcccac atcgggttga ctgtctttgt tttgtcattg cttgctctgg ctgtgtctcc 1561 ctcattagac tgggatgcct tcaaggtagg gaccctatct gggtcagctt ggcaccccaa 1621 agcgtaccac agcacctgat nctgaggagg ctctcagtag atatctgttg agtaaccaga 1681 atgtagggtg gtcctgatgg tttctgacat tgaatagaaa acagctccct atttgatctt 1741 aaaataatca ctataacctg gacatactgt actagatgct gtttttgtct gacttctact 1801 ctgtcaatct ctttgcacct ccatttgttc atctgtgaaa tgaagaaaat gctcatggag 1861 ttcagtgaag attaaatgaa tgaatatagg tagactgcct aatctggcac ttgccacgca 1921 gctgacttca atatagtagc tctaatatta tggtccttga ggatcttact gtcttatggc 1981 ccagaactgc atttgattaa agaaggctnn cctaaaaaaa gagtcataca tattccattt 2041 gtcctttcag aaggccgtga agcatttaca ctctttaaga caaattccca tccaaaaata 2101 gttaagattt ctaaaatatt ttgatgctga aagaggtgtg cttcagttgg gtggcaaatt 2161 tgcttctatg gaagattttt aatacaggtt gtttctattt tactttttct ggctgaaagg 2221 attttacatt tattcaaagt caaaagggaa aagaaatcca agaactacag aagagcagtt 2281 gaagtgattt atgcttgatt tctaaatgca acttatgttt atacataatt taaaactcaa 2341 agaaagcatg cttatacaat catgtgcaac tttaaacttt aagaactctg gatgaataca 2401 tggtggcaac agtccatgac acctgaaaac atcatttgtg gagtggcgta gagttcagtg 2461 ttcgcagtcg catattacaa ccatgtttca cacagccctg ctcggtttga ttttctccac 2521 gtggttgata attgtcttca gttgctgcta agtgattttg caaatttcg // LOCUS HSU32499 1203 bp mRNA PRI 03-AUG-1995 DEFINITION Human d3 dopamine receptor mRNA, complete cds. ACCESSION U32499 NID g927341 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1203) AUTHORS Fishburn,C.S., Park,B.-H. and Fuchs,S. TITLE Direct Submission JOURNAL Submitted (27-JUL-1995) C.Simone Fishburn, Dept. of Chemical Immunology, The Weizmann Institute of Science, Rehovot 76100, Israel FEATURES Location/Qualifiers source 1..1203 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TE671" /chromosome="3" CDS 1..1203 /codon_start=1 /product="d3 dopamine receptor" /db_xref="PID:g927342" /translation="MASLSQLSSHLNYTCGAENSTGASQARPHAYYALSYCALILAIV FGNGLVCMAVLKERALQTTTNYLVVSLAVADLLVATLVMPWVVYLEVTGGVWNFSRIC CDVFVTLDVMMCTASILNLCAISIDRYTAVVMPVHYQHGTGQSSCRRVALMITAVWVL AFAVSCPLLFGFNTTGDPTVCSISNPDFVIYSSVVSFYLPFGVTVLVYARIYVVLKQR RRKRILTRQNSQCNSVRPGFPQQTLSPDPAHLELKRYYSICQDTALGGPGFQERGGEL KREEKTRNSLSPTIAPKLSLEVRKLSNGRLSTSLKLGPLQPRGVPLREKKATQMVAIV LGAFIVCWLPFFLTHVLNTHCQTCHVSPELYSATTWLGYVNSALNPVIYTTFNIEFRK AFLKILSC" BASE COUNT 254 a 351 c 309 g 289 t ORIGIN 1 atggcatctc tgagtcagct gagtagccac ctgaactaca cctgtggggc agagaactcc 61 acaggtgcca gccaggcccg cccacatgcc tactatgccc tctcctactg cgcgctcatc 121 ctggccatcg tcttcggcaa tggcctggtg tgcatggctg tgctgaagga gcgggccctg 181 cagactacca ccaactactt agtagtgagc ctggctgtgg cagacttgct ggtggccacc 241 ttggtgatgc cctgggtggt atacctggag gtgacaggtg gagtctggaa tttcagccgc 301 atttgctgtg atgtttttgt caccctggat gtcatgatgt gtacagccag catccttaat 361 ctctgtgcca tcagcataga caggtacact gcagtggtca tgcccgttca ctaccagcat 421 ggcacgggac agagctcctg tcggcgcgtg gccctcatga tcacggccgt ctgggtactg 481 gcctttgctg tgtcctgccc tcttctgttt ggctttaata ccacagggga ccccactgtc 541 tgctccatct ccaaccctga ttttgtcatc tactcttcag tggtgtcctt ctacctgccc 601 tttggagtga ctgtccttgt ctatgccaga atctatgtgg tgctgaaaca aaggagacgg 661 aaaaggatcc tcactcgaca gaacagtcag tgcaacagtg tcaggcctgg cttcccccaa 721 caaaccctct ctcctgaccc ggcacatctg gagctgaagc gttactacag catctgccag 781 gacactgcct tgggtggacc aggcttccaa gaaagaggag gagagttgaa aagagaggag 841 aagactcgga attccctgag tcccaccata gcgcccaagc tcagcttaga agttcgaaaa 901 ctcagcaatg gcagattatc gacatctttg aagctggggc ccctgcaacc tcggggagtg 961 ccacttcggg agaagaaggc aacccaaatg gtggccattg tgcttggggc cttcattgtc 1021 tgctggctgc ccttcttctt gacccatgtt ctcaataccc actgccagac atgccacgtg 1081 tccccagagc tttacagtgc cacgacatgg ctgggctacg tgaatagcgc cctcaaccct 1141 gtgatctata ccaccttcaa tatcgagttc cggaaagcct tcctcaagat cctgtcttgc 1201 tga // LOCUS HSU32500 3400 bp mRNA PRI 30-MAR-1996 DEFINITION Human type 2 neuropeptide Y receptor mRNA, complete cds. ACCESSION U32500 NID g1000750 KEYWORDS Neuropeptide Y; neuropeptide Y Y2 receptor; G-protein coupled receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3400) AUTHORS Rose,P.M., Fernandes,P., Lynch,J.S., Frazier,S.T., Fisher,S.M., Kodukula,K., Kienzle,B. and Seethala,R. TITLE Cloning and functional expression of a cDNA encoding a human type 2 neuropeptide Y receptor JOURNAL J. Biol. Chem. 270 (39), 22661-22664 (1995) MEDLINE 96032678 REFERENCE 2 (bases 1 to 3400) AUTHORS Rose,P.M. TITLE Direct Submission JOURNAL Submitted (27-JUL-1995) Patricia M. Rose, Biomolecular Screening, Bristol-Myers Squibb, Box 4000, Princeton, NJ 08543-4000, USA FEATURES Location/Qualifiers source 1..3400 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SMS-KAN" /tissue_type="neuroblastoma" CDS 249..1394 /codon_start=1 /product="type 2 neuropeptide Y receptor" /db_xref="PID:g1000751" /translation="MGPIGAEADENQTVEEMKVEQYGPQTTPRGELVPDPEPELIDST KLIEVQVVLILAYCSIILLGVIGNSLVIHVVIKFKSMRTVTNFFIANLAVADLLVNTL CLPFTLTYTLMGEWKMGPVLCHLVPYAQGLAAQVSTITLTVIALDRHRCIVYHLESKI SKRISFLIIGLAWGISALLASPLAIFREYSLIEIIPDFEIVACTEKWPGEEKSIYGTV YSLSSLLILYVLPLGIISFSYTRIWSKLKNHVSPGAANDHYHQRRQKTTKMLVCVVVV FAVSWLPLHAFQLAVDIDSQVLDLKEYKLIFTVFHIIAMCSTFANPLLYGWMNSNYRK AFLSAFRCEQRLDAIHSEVSVTFKAKKNLEVRKNSGPNDSFTEATNV" BASE COUNT 1012 a 660 c 697 g 1031 t ORIGIN 1 gcagcgccaa ccgcccagcc gctctgactg ctccggctgc ccgcccgcgc ggcgcgggct 61 gtcctggacc ctaggagggg acggaaccgg acttgccttt gggcaccttc cagggccctc 121 tccaggtcgg ctggctaatc atcggacaga cggactgcac acatcttgtt tccgcgtctc 181 cgcaaaaacg cgaggtccag gttgtagact cttgtgctgg ttgcaggcca agtggacctg 241 tactgaaaat gggtccaata ggtgcagagg ctgatgagaa ccagacagtg gaagaaatga 301 aggtggaaca atacgggcca caaacaactc ctagaggtga actggtccct gaccctgagc 361 cagagcttat agatagtacc aagctgattg aggtacaagt tgttctcata ttggcctact 421 gctccatcat cttgcttggg gtaattggca actccttggt gatccatgtg gtgatcaaat 481 tcaagagcat gcgcacagta accaactttt tcattgccaa tctggctgtg gcagatcttt 541 tggtgaacac tctgtgtcta ccgttcactc ttacctatac cttaatgggg gagtggaaaa 601 tgggtcctgt cctgtgccac ctggtgccct atgcccaggg cctggcagca caagtatcca 661 caatcacctt gacagtaatt gccctggacc ggcacaggtg catcgtctac cacctagaga 721 gcaagatctc caagcgaatc agcttcctga ttattggctt ggcctggggc atcagtgccc 781 tgctggcaag tcccctggcc atcttccggg agtattcgct gattgagatc attccggact 841 ttgagattgt ggcctgtact gaaaagtggc ctggcgagga gaagagcatc tatggcactg 901 tctatagtct ttcttccttg ttgatcttgt atgttttgcc tctgggcatt atatcatttt 961 cctacactcg catttggagt aaattgaaga accatgtcag tcctggagct gcaaatgacc 1021 actaccatca gcgaaggcaa aaaaccacca aaatgctggt gtgtgtggtg gtggtgtttg 1081 cggtcagctg gctgcctctc catgccttcc agcttgccgt tgacattgac agccaggtcc 1141 tggacctgaa ggagtacaaa ctcatcttca cagtgttcca cattatcgcc atgtgctcca 1201 cttttgccaa tccccttctc tatggctgga tgaacagcaa ctacagaaag gctttcctct 1261 cggccttccg ctgtgagcag cggttggatg ccattcactc tgaggtgtcc gtgacattca 1321 aggctaaaaa gaacctggag gtcagaaaga acagtggccc caatgactct ttcacagagg 1381 ctaccaatgt ctaaggaagc tgtggtgtga aaatgtatgg atgaattctg accagagcta 1441 tgaatctggt tgatggcggc tcacaagtga aaactgattt cccattttaa agaagaagtg 1501 gatctaaatg gaagcatctg ctgtttaatt cctggaaaac tggctgggca gagcctgtgt 1561 gaaaatactg gaattcaaag ataaggcaac aaaatggttt acttaacagt tggttgggta 1621 gtaggttgca ttatgagtaa aagcagagag aagtactttt gattattttc ctggagtgaa 1681 gaaaacttga acaagaaatt ggtattatca aagcattgct gagagacggt gggaaaataa 1741 gttgactttc aaatcacgtt aggacctgga ttgaggaggt gtgcagttcg ctgctccctg 1801 cttggcttat gaaaacacca ctgaacagaa atttctccag ggagccacag gctctccttc 1861 atcgcatttt gatttttttg ttcattctct agacaaaatc catcagggga atgctgcagg 1921 aaacgattgc caactatacg aatggcttcg aggagataaa ctgaaatttg gctatataat 1981 taatattttg gcagatgata ggggaactcc tcaacactca gtgggccaat tgttcttaaa 2041 accaattgca cgtttggtga aagtttcttc aactctgaat caaaagctga aattctcaga 2101 attacaggaa atgcaaacca tcatttaatt tctaatttca agttacatcc gctttatgga 2161 gatactattt agataacaag aatacaactt gatactttta ttgttatacc tttttgaaca 2221 tgtatgattt ctgttgttat tcctattgga gctaagtttg tctacactaa aatttaaatc 2281 agactagaga ataatttttg tggcatgttg taacatttca cagtatttac aagctatttt 2341 tgcacaggta catagctctc atgtatttaa agaacactgc agtgttattt tctttgaaat 2401 tcatcctcca cggacccatt catactaaat aaaacaatgt aattacatta aaatggacct 2461 atctgtaaga ggtactaaaa acactggatt catttcatct tgcaaatgtt gtatttcaaa 2521 ccagtttcac ataagttatt tgtcttcttt tcaaaataat tagctatatt tttatataat 2581 atgaatatat acataaaaat tgtttctata aattgtagaa catagatgct acagtatttt 2641 ttatttaatt atattatgaa taaaattgtt atttcaatag tacccaacca aagatgctta 2701 aaaaccttct tatgttcata aaaaataaca actgagatgt taaaatagtc atacgtcttt 2761 agatgctatt aaagtttcat tagtcatatt tttgtaaata tgacagaatt tgtgaatata 2821 tttttaaagc aaaaaacttc aacatgcata tgatatatag ttacaacatt aattttatga 2881 actggagagc tttactttgt ggatatattt aaaattcata ttatagctcc tattaaattc 2941 cttccatgat agatataaag gactggtttt taagtgcact gcacttctgg aatactgaaa 3001 aagaatgaaa acaatatgtt agattaggtg taagacttta agaagcgaac aaaaagtaat 3061 gtatatctgt aatatataat caaatgattc atttttctgt tagactaggc aaattgttca 3121 aaaataacct ttttgtcttt taagtagcag tcactttgct taagatgcta atagaaaact 3181 gtggttaaag atttaccctc cctcttggtg aattattaca ctgtaagaaa tgtatatgct 3241 actgtgttac atgttgtatt agtaaattat tagaatccaa ttaatgattc aattaacata 3301 tatcttatcc aattcattat gtcaattcat caataaaata ccttttatgt agaggcttta 3361 tgttgcaatt aaaaagttgg gaaaatgaga aaaaaaaaaa // LOCUS HSU32519 1769 bp mRNA PRI 11-SEP-1996 DEFINITION Human GAP SH3 binding protein mRNA, complete cds. ACCESSION U32519 NID g1051169 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1769) AUTHORS Parker,F., Maurier,F., Delumeau,I., Duchesne,M., Faucher,D., Debussche,L., Dugue,A., Schweighoffer,F. and Tocque,B. TITLE A Ras-GTPase-activating protein SH3-domain-binding protein JOURNAL Mol. Cell. Biol. 16 (6), 2561-2569 (1996) MEDLINE 96220439 REFERENCE 2 (bases 1 to 1769) AUTHORS Tocque,B. TITLE Direct Submission JOURNAL Submitted (27-JUL-1995) Bruno Tocque, Molecular Oncology, Gencell. Rhone-Poulenc Rorer, 13 Quai Jules Guesde, Vitry Sur Seine, F94400, France FEATURES Location/Qualifiers source 1..1769 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 122..1522 /codon_start=1 /product="GAP SH3 binding protein" /db_xref="PID:g1051170" /translation="MVMEKPSPLLVGREFVRQYYTLLNQAPDMLHRFYGKNSSYVHGG LDSNGKPADAVYGQKEIHRKVMSQNFTNCHTKIRHVDAHATLNDGVVVQVMGLLSNNN QALRRFMQTFVLAPEGSVANKFYVHNDIFRYQDEVFGGFVTEPQEESEEEVEEPEERQ QTPEVVPDDSGTFYDQAVVSNDMEEHLEEPVAEPEPDPEPEPEQEPVSEIQEEKPEPV LEETAPEDAQKSSSPAPADIAQTVQEDLRTFSWASVTSKNLPPSGAVPVTGIPPHVVK VPASQPRPESKPESQIPPQRPQRDQRVREQRINIPPQRGPRPIREAGEQGDIEPRRMV RHPDSHQLFIGNLPHEVDKSELKDFFQSYGNVVELRINSGGKLPNFGFVVFDDSEPVQ KVLSNRPIMFRGEVRLNVEEKKTRAAREGDRRDNRLRGPGGPRGGLGGGMRGPPRGGM VQKPGFGVGRGLAPRQ" BASE COUNT 475 a 395 c 449 g 450 t ORIGIN 1 gaattcgggc ggggtttgta ctatcctcgg tgctgtggtg cagagctagt tcctctccag 61 ctcagccgcg taggtttgga catatttact cttttccccc caggttgaat tgaccaaagc 121 aatggtgatg gagaagccta gtcccctgct ggtcgggcgg gaatttgtga gacagtatta 181 cacactgctg aaccaggccc cagacatgct gcatagattt tatggaaaga actcttctta 241 tgtccatggg ggattggatt caaatggaaa gccagcagat gcagtctacg gacagaaaga 301 aatccacagg aaagtgatgt cacaaaactt caccaactgc cacaccaaga ttcgccatgt 361 tgatgctcat gccacgctaa atgatggtgt ggtagtccag gtgatggggc ttctctctaa 421 caacaaccag gctttgagga gattcatgca aacgtttgtc cttgctcctg aggggtctgt 481 tgcaaataaa ttctatgttc acaatgatat cttcagatac caagatgagg tctttggtgg 541 gtttgtcact gagcctcagg aggagtctga agaagaagta gaggaacctg aagaaagaca 601 gcaaacacct gaggtggtac ctgatgattc tggaactttc tatgatcagg cagttgtcag 661 taatgacatg gaagaacatt tagaggagcc tgttgctgaa ccagagcctg atcctgaacc 721 agaaccagaa caagaacctg tatctgaaat ccaagaggaa aagcctgagc cagtattaga 781 agaaactgcc cctgaggatg ctcagaagag ttcttctcca gcacctgcag acatagctca 841 gacagtacag gaagacttga ggacattttc ttgggcatct gtgaccagta agaatcttcc 901 acccagtgga gctgttccag ttactgggat accacctcat gttgttaaag taccagcttc 961 acagccccgt ccagagtcta agcctgaatc tcagattcca ccacaaagac ctcagcggga 1021 tcaaagagtg cgagaacaac gaataaatat tcctccccaa aggggaccca gaccaatccg 1081 tgaggctggt gagcaaggtg acattgaacc ccgaagaatg gtgagacacc ctgacagtca 1141 ccaactcttc attggcaacc tgcctcatga agtggacaaa tcagagctta aagatttctt 1201 tcaaagttat ggaaacgtgg tggagttgcg cattaacagt ggtgggaaat tacccaattt 1261 tggttttgtt gtgtttgatg attctgagcc tgttcagaaa gtccttagca acaggcccat 1321 catgttcaga ggtgaggtcc gtctgaatgt cgaagagaag aagactcgag ctgccaggga 1381 aggcgaccga cgagataatc gccttcgggg acctggaggc cctcgaggtg ggctgggtgg 1441 tggaatgaga ggccctcccc gtggaggcat ggtgcagaaa ccaggatttg gagtgggaag 1501 ggggcttgcg ccacggcagt aatcttcatg gatcttcatg cagccataca aaccctggtt 1561 ccaacagaat ggtgaatttt cgacagcctt tggtatcttg gagtatgacc ccagtctgtt 1621 ataaactgct taagtttgta taattttact ttttttgtgt gttaatggtg tgtgctccct 1681 ctccctctct tccctttcct gacctttagt ctttcacttc caattttgtg gaatgatatt 1741 ttaggaataa cggactttta cccgaattc // LOCUS HSU32581 3017 bp mRNA PRI 02-FEB-1996 DEFINITION Human lambda/iota-protein kinase C-interacting protein mRNA, complete cds. ACCESSION U32581 NID g1173575 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3017) AUTHORS Diaz-Meco,M.T., Municio,M.M., Sanchez,P., Lozano,J. and Moscat,J. TITLE Lambda-interacting protein, a novel protein that specifically interacts with the zinc finger domain of the atypical protein kinase C isotype lambda/iota and stimulates its kinase activity in vitro and in vivo JOURNAL Mol. Cell. Biol. 16 (1), 105-114 (1996) MEDLINE 96104559 REFERENCE 2 (bases 1 to 3017) AUTHORS Moscat,J., Diaz-Meco,M.T., Municio,M.M. and Sanchez,P. TITLE Direct Submission JOURNAL Submitted (28-JUL-1995) Jorge Moscat, Centro de Biologia Molecular 'Severo Ochoa', (CSIC-UAM), Canto Blanco, Madrid 28049, Spain FEATURES Location/Qualifiers source 1..3017 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 25..2166 /codon_start=1 /product="lambda/iota protein kinase C-interacting protein" /db_xref="PID:g1173576" /translation="MLHELDGLIEQTTDGVPLQTLVESLQAYLRNAAMGLEEETHAHY IDVARLLHAQYGELIQPRNGSVDETPKMSAGQMLLVAFDGMFAQVETAFSLLVEKLNK MEIPIAWRKIDIIREARSTQVNFFDDDNHRQVLEEIFFLKRLQTIKEFFRLCGTFSKT LSGSSSLEDQNTVNGPAQIVNVKTLFRNSCFSEDQMAKPIKAFTADFVRQLLIGLPNQ ALGLTLCSFISALGVDIIAQVEAKDFGAESKVSVDDLCKKAVEHNIQIGKFSQLVMNR ATVLASSYDTAWKKHDLVRRLETSISSCKTSLQRVQLHIAMFQWQHEDLLINRPQAMS VTPPPRSAILTSMKKKLHTLSQIETSIATVQEKLAALESSIEQRLKWAGGANPALAPV LQDFEATIAERRNLVLKESQRASQVTFLCSNIIHFESLRTRTAEALNLDAALFELIKR CQQMCSFASQFNSSVSELELRLLQRVDTGLEHPIGSSEWLLSAHKQLTQDMSTQRAIQ TEKEQQIETVCETIQNLVDNIKTVLTGHNRQLGDVKHLLKAMAKDEEAALADGEDVPY ENSVRQFLGEYKSWQDNIQTVLFTLVQAMGQVRSQEHVEMLQEITPTLKELKTQSQSI YNNLVSFASPLVTDATNECSSPTSSATYQPSFAAAVRSNTGQKTQPDVMSQNARKLIQ KNLATSADTPPSTVPGTARVLLVVLKRQSET" BASE COUNT 930 a 643 c 721 g 723 t ORIGIN 1 aattcgtgat acacgttaga aagtatgctg catgaactgg acggtcttat tgagcagacc 61 accgatggcg ttcccctgca gactctagtg gaatctcttc aggcctactt aagaaacgca 121 gctatgggac tggaagaaga aacacatgct cattacatcg atgttgccag actactacat 181 gctcagtacg gtgaattaat ccaaccgaga aatggttcag ttgatgaaac acccaaaatg 241 tcagctggcc agatgctttt ggtagcattc gatggcatgt ttgctcaagt tgaaactgct 301 ttcagcttat tagttgaaaa gttgaacaag atggaaattc ccatagcttg gcgaaagatt 361 gacatcataa gggaagccag gagtactcaa gttaattttt ttgatgatga taatcaccgg 421 caggtgctag aagagatttt ctttctaaaa agactacaga ctattaagga gttcttcagg 481 ctctgtggta ccttttctaa aacattgtca ggatcaagtt cacttgaaga tcagaatact 541 gtgaatgggc ctgcacagat tgtcaatgtg aaaacccttt ttagaaactc ttgtttcagt 601 gaagaccaaa tggccaaacc tatcaaggca ttcacagctg actttgtgag gcagctcttg 661 atagggctac ccaaccaagc cctcggactc acactgtgca gttttatcag tgctctgggt 721 gtagacatca ttgctcaagt agaggcaaag gactttggtg ccgaaagcaa agtttctgtt 781 gatgatctct gtaagaaagc ggtggaacat aacatccaga tagggaagtt ctctcagctg 841 gttatgaaca gggcaactgt gttagcaagt tcttacgaca ctgcctggaa gaagcatgac 901 ttggtgcgaa ggctagaaac cagtatttct tcttgtaaga caagcctgca gcgggttcag 961 ctgcatattg ccatgtttca gtggcaacat gaagatctac ttatcaatag accacaagcc 1021 atgtcagtca cacctccccc acggtctgct atcctaacca gcatgaaaaa gaagctgcat 1081 accctgagcc agattgaaac ttctattgcg acagttcagg agaagctagc tgcacttgaa 1141 tcaagtattg aacagcgact caagtgggca ggtggtgcca accctgcatt ggcccccgta 1201 ctacaagatt ttgaagcaac gatagctgaa agaagaaatc ttgtccttaa agagagccaa 1261 agagcaagtc aggtcacatt tctctgcagc aatatcattc attttgaaag tttacgaaca 1321 agaactgcag aagccttaaa cctggatgcg gcgttatttg aactaatcaa gcgatgtcag 1381 cagatgtgtt cgtttgcatc acagtttaac agttcagtgt ctgagttaga gcttcgttta 1441 ttacagagag tggacactgg tcttgaacat cctattggca gctctgaatg gcttttgtca 1501 gcacacaaac agttgaccca ggatatgtct actcagaggg caattcagac agagaaagag 1561 cagcagatag aaacggtctg tgaaacaatt cagaatctgg ttgataatat aaagactgtg 1621 ctcactggtc ataaccgaca gcttggagat gtcaaacatc tcttgaaagc tatggctaag 1681 gatgaagaag ctgctctggc agatggtgaa gatgttccct atgagaacag tgttaggcag 1741 tttttgggtg aatataaatc atggcaagac aacattcaaa cagttctatt tacattagtc 1801 caggctatgg gtcaggttcg aagtcaagaa cacgttgaaa tgctccagga aatcactccc 1861 accttgaaag aactgaaaac acaaagtcag agtatctata ataatttagt gagttttgca 1921 tcacccttag tcaccgatgc aacaaatgaa tgttcgagtc caacgtcatc tgctacttat 1981 cagccatcct tcgctgcagc agtccggagt aacactggcc agaagactca gcctgatgtc 2041 atgtcacaga atgctagaaa gctgatccag aaaaatcttg ctacatcagc tgatactcca 2101 ccaagcaccg ttccaggaac tgcaagagtg ttgcttgtag tcctaaaaag gcagtcagag 2161 acctaaaact gggaaagcgg tgcaagagag aaactcctat gcagtgagtg tgtggaagag 2221 agtgaaagcc aagttagagg gccgagatgt tgatccgaat aggaggatgt cagttgctga 2281 acaggttgac tatgtcatta aggaggcaac taatctagat aacttggctc agctgtatga 2341 aggttggaca gcctgggtgt gaatggcaag acagtagatg agtctggtta agcgaggtca 2401 gacatccacc agaatcaact cagcctcagg catccaaagc cacaccacag tcggtggtga 2461 tgcaactggg ggcttactct gaggaaacct aggaaatctc ggtgcactag gaagtgaatc 2521 ccgcaggaca gctgcactca gggatacgcc acacaccatg gcctgcaacc ccagggtcaa 2581 gggtgaagga aagcaagctc accgcctgaa cacggagatt gtctttctgc cacagaacag 2641 cagcagacgt gtcgggaggt tagctgcgga aagaaatcgg gatgccgcgg agcacagagt 2701 gatttggaac tccattccac ctgaccctgt gttgacaatc caggaaaaaa aacaaacccc 2761 actcagaaac agagaaaact ggggtcgcga agaaatcaca gccggaagat ttgatgcatt 2821 cagattctcg tgtaacactt gttgcttggc aacagtactg gttgggctga ccagtaatga 2881 gaaaaaggtc aaaggctatg cgatatgaat ttcagaaatg gactgaaaat ggagagctat 2941 gtaacagata cactacagta gaagacttac ttctgaaatg aagggaaaaa aaccacccca 3001 tcgttcccta ctcctcc // LOCUS HSU32645 4190 bp mRNA PRI 06-MAY-1997 DEFINITION Human myeloid elf-1 like factor (MEF) mRNA, complete cds. ACCESSION U32645 NID g1761934 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4190) AUTHORS Miyazaki,Y., Sun,X., Uchida,H., Zhang,J. and Nimer,S. TITLE MEF, a novel transcription factor with an Elf-1 like DNA binding domain but distinct transcriptional activating properties JOURNAL Oncogene 13 (8), 1721-1729 (1996) MEDLINE 97050779 REFERENCE 2 (bases 1 to 4190) AUTHORS Nimer,S.D., Miyazaki,Y. and Sun,X. TITLE Direct Submission JOURNAL Submitted (29-JUL-1995) Stephen D. Nimer, Department of Medicine, Memorial Sloan-Kettering Cancer Center, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..4190 /organism="Homo sapiens" /db_xref="taxon:9606" gene 383..2374 /gene="MEF" CDS 383..2374 /gene="MEF" /codon_start=1 /product="myeloid elf-1 like factor" /db_xref="PID:g1761935" /translation="MAITLQPSDLIFEFASNGMDDDIHQLEDPSVFPAVIVEQVPYPD LLHLYSGLELDDVHNGIITDGTLCMTQDQILEGSFLLTDDNEATSHTMSTAEVLLNME SPSDILDEKQIFSTSEMLPDSDPAPAVTLPNYLFPASEPDALNRAGDTSDQEGHSLEE KASREESAKKTGKSKKRIRKTKGNRSTSPVTDPSIPIRKKSKDGKGSTIYLWEFLLAL LQDRNTCPKYIKWTQREKGIFKLVDSKAVSKLWGKQKNKPDMNYETMGRALRYYYQRG ILAKVEGQRLVYQFKEMPKDLVVIEDEDESSEATAAPPQASTASVASASTTRRTSSRV SSRSAPQGKGSSSWEKPKIQHVGLQPSASLELGPSLDEEIPTTSTMLVSPAEGQVKLT KAVSASSVPSNIHLGVAPVGSGSALTLQTIPLTTVLTNGPPASTTAPTQLVLQSVPAA STFKDTFTLQASFPLNASFQDSQVAAPGAPLILSGLPQLLAGANRPTNPAPPTVTGAG PAGPSSQPPGTVIAAFIRTSGTTAAPRVKEGPLRSSSYVQGMVTGAPMEGLLVPEETL RELLRDQAHLQPLPTQVVSRGSHNPSLLGNQTLSPPSRPTVGLTPVAELELSSGSGSL LMAEPSVTTSGSLLTRSPTPAPFSPFNPTSLIKMEPHDI" BASE COUNT 925 a 1252 c 1081 g 932 t ORIGIN 1 gaattccctt tcgccggcgc cgagttcctg gcgccgctcg cccggcccgg cttccgaggg 61 gagaggacgg gctggcgggg ctggggaccc gcgtctcggc ccccggagcg gggaccacgg 121 agacagaccc cggcccggcg accgagctgg gcccgtgagc cactcggcct caggtcgctc 181 ctgtggttgg tccagcccag aatgcagcct tgagcctggc ttaggccacc acctactcca 241 gctctctcca ccccctattt tactgcagct cagggggtag gctctaggct ccaaagtacc 301 tgggtattgt cccttcatca agaaagcccc acagctctgg agggctctga taatcccgtt 361 gtcagctctc tgaaaagaca gcatggctat taccctacag cccagtgacc tgatctttga 421 gttcgcaagc aacgggatgg atgatgatat ccaccagctg gaagacccct ctgtgttccc 481 agctgtgatc gtggagcagg taccctaccc tgatttactg catctgtact cgggactgga 541 gttggacgac gttcacaatg gcatcataac agacgggacc ttgtgcatga cccaggatca 601 gatcctggaa ggcagttttt tgctgacaga tgacaatgag gccacctcgc acaccatgtc 661 aaccgcggaa gtcttactca atatggagtc tcccagcgat atcctggatg agaagcagat 721 cttcagtacc tccgaaatgc ttccagactc ggaccctgca ccagctgtca ctctgcccaa 781 ctacctgttt cctgcctctg agcccgatgc cctgaacagg gcgggtgaca ctagtgacca 841 ggaggggcat tctctggagg agaaggcctc cagagaggaa agtgccaaga agactgggaa 901 atcaaagaag agaatccgga agaccaaggg caaccgaagt acctcacctg tcactgaccc 961 cagcatcccc attaggaaga aatcaaagga tggcaaaggc agcaccatct atctgtggga 1021 gttcctcctg gctcttctgc aagacagaaa cacctgtccc aagtacatca agtggaccca 1081 gcgagagaaa ggcatcttca aactggtgga ctccaaagct gtgtccaagc tgtgggggaa 1141 gcagaaaaac aagcctgaca tgaactatga gacaatgggg cgggcactaa gatactacta 1201 ccaaagaggc atactggcca aagtggaagg gcagaggctg gtgtaccagt ttaaggagat 1261 gcccaaggac ctggtggtca ttgaagatga ggatgagagc agcgaagcca cagcagcccc 1321 acctcaggcc tccacggcct ctgtggcctc tgccagtacc acccggcgaa ccagctccag 1381 ggtctcatcc agatctgccc cccagggcaa gggcagctct tcttgggaga agccaaaaat 1441 tcagcatgtc ggtctccagc catctgcgag tctggaattg ggaccgtcgc tagacgagga 1501 gatccccact acctccacca tgctcgtctc tccagcagag ggccaggtca agctcaccaa 1561 agctgtgagt gcatcttcag tgcccagcaa catccaccta ggagtggccc ccgtggggtc 1621 gggctcggcc ctgaccctgc agacgatccc actgaccacg gtgctgacca atgggcctcc 1681 tgccagtact actgctccca ctcagctcgt tctccagagt gttccagcgg cctctacttt 1741 caaggacacc ttcactttgc aggcctcttt ccccctgaac gccagtttcc aagacagcca 1801 ggtggcagcc ccaggggctc cactgattct cagtggcctc ccccaacttc tggctggggc 1861 caaccgtccg accaacccgg cgccacccac ggtcacaggg gctggaccag cagggcccag 1921 ctctcagccc cctgggactg tcattgctgc cttcatcagg acttctggca ctacagcagc 1981 ccctagggtc aaggaggggc cactgaggtc ctcctcctat gttcagggta tggtgacggg 2041 ggcccccatg gaggggctgc tggttcctga agagaccctg agggagctcc tgagagatca 2101 ggctcatctt cagccacttc caacccaggt ggtttccagg ggttcccaca atccgagcct 2161 tctgggcaac cagactttgt ctcctcccag ccgccccact gttgggctga ccccagtggc 2221 tgaacttgag ctctcctcag gctcagggtc cctgctgatg gctgagccta gtgtgaccac 2281 atctgggagc cttctgacaa gatcccccac cccagcccct ttctccccat tcaaccctac 2341 ttccctcatt aagatggagc cccatgacat ataagcaaag gggtcagggc aagtgtgacc 2401 caccaggcaa aattgagcag cattttcata gggaccgact tcagtagcac acctgcccct 2461 gcatttcagt gggatgtcaa tacacttgac cccaagtccc ccggccctgc ctggtgtcac 2521 tgtggccaaa cagtgcccag cttaagcatc cctggcatca gactatggcc ttcaagagca 2581 ctagggcata tgcttttggc agcataacgg gctgacttgg tgatggaggg aaaaagcctt 2641 gagccaggca gaagtttgtg gccagggttt gtgcagcagc tttgtgagaa gagcccttct 2701 acctggctct atctcactgg ctgcattccc tacacaggga atttactacc ctatatgtga 2761 atatcccctg tatgtacttg tgtgtacttg ttggtctgta tcttagtttc tttggggagg 2821 acagggctgt agctgtgagg tcttgtctcc aagggtgtgt gtatgtctcc gtggatcagc 2881 cacagggata gggattttgt ttttaaggga aagcattctc taattccctt tgttcatgcc 2941 gagattcagt tgctctgaga ctatggggta caagtttgat cctccgaatc tggagatgtt 3001 gtagagctgg aacgagtgca gagtaggaac gctttgatgc gcatgcacat tggggaagat 3061 gcgctcctca gggacacaaa ggccgagtgg ggtaaaacca cgaagggagg gaagggaagt 3121 cagctctggg agcagccctc actggctgga ccaaggtact cttcctggag tttgccgtgt 3181 tagcaaccac agtcaccttg cagtcaggct ggaatcttgg gccaccccac agtgctttgc 3241 tgtaggattt agacggggat gaagtgccct ccagcctcag agctagccac aaagccccca 3301 gagctgaatt cattgagtat ttgtgcctag ggcttgggct gtttgtgtga taccggcccc 3361 ccgacagaca ataggctgtg atgacacccc agtctacttc cccgatcctg ggctccctct 3421 tgattagtag gtgacatttt ccactgtcag gcatcactgg ggctagtccg gcagcgacct 3481 agatggggtc cacccccatt cctgctcaag catgggcacc taccacatgg tttctgctgc 3541 tcagcctgac tgcaactcac ctcgaaggcg gaccagcctg cctctgtgat gactgcagaa 3601 gacctccttg ggtgtaccaa tgcccctcat ctcccacttt cacacctaac cctgactcct 3661 tcaccaagaa gacgggagtc ggcagccagg agttcccgtg gcacctctct ctcttcgtgg 3721 ctccctgctt cccccttccc tctttccgag gaagggtcaa cctattctct ctcaaaacca 3781 acccctaggc caattgcctg gatctcctcc cctctccctt ctttaaacga gcttgcctcc 3841 ctcctgccaa gtttgagggc aaggctaaga aatgtcagcc acggaaacaa ctctaatatc 3901 tggtgacttt gggtaatgtg aatcagtgcc tgaggacctt tgctgtgtcc ttggtacaga 3961 accatccact tgacctaact acctcccctg gccgcgctct cgctcttctc ttctttgtta 4021 agccaacaac tatcaccctc tcctactctt ctttctccct gccccctgga gggcactgtg 4081 tttggttgtg caaatgtatt tactatgcgt gtttccagca gttggcatta aagtgccttt 4141 ttctaataaa atcagtttat tatgaccaaa aaaaaaaaaa aaaggaattc // LOCUS HSU32659 1874 bp mRNA PRI 17-JAN-1996 DEFINITION Human IL-17 mRNA, complete cds. ACCESSION U32659 NID g1155222 KEYWORDS cytokine; CTLA8, IL-17. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1874) AUTHORS Yao,Z., Painter,S.L., Fanslow,W.C., Ulrich,D., Macduff,B.M., Spriggs,M.K. and Armitage,R.J. TITLE Human IL-17: a novel cytokine derived from T cells JOURNAL J. Immunol. 155 (12), 5483-5486 (1995) MEDLINE 96094436 REFERENCE 2 (bases 1 to 1874) AUTHORS Yao,Z., Painter,S., Fanslow,W., Macduff,B., Ulrich,D., Spriggs,M. and Armitage,R. TITLE Direct Submission JOURNAL Submitted (28-JUL-1995) Zhengbin Yao, Molecular Biology, Immunex R & D, University Street, Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1874 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="T cell clone 22" CDS 54..521 /note="interleukin 17" /codon_start=1 /product="IL-17" /db_xref="PID:g1155223" /translation="MTPGKTSLVSLLLLLSLEAIVKAGITIPRNPGCPNSEDKNFPRT VMVNLNIHNRNTNTNPKRSSDYYNRSTSPWNLHRNEDPERYPSVIWEAKCRHLGCINA DGNVDYHMNSVPIQQEILVLRREPPHCPNSFRLEKILVSVGCTCVTPIVHHVA" BASE COUNT 575 a 418 c 356 g 525 t ORIGIN 1 gaattccggc aggcacaaac tcatccatcc ccagttgatt ggaagaaaca acgatgactc 61 ctgggaagac ctcattggtg tcactgctac tgctgctgag cctggaggcc atagtgaagg 121 caggaatcac aatcccacga aatccaggat gcccaaattc tgaggacaag aacttccccc 181 ggactgtgat ggtcaacctg aacatccata accggaatac caataccaat cccaaaaggt 241 cctcagatta ctacaaccga tccacctcac cttggaatct ccaccgcaat gaggaccctg 301 agagatatcc ctctgtgatc tgggaggcaa agtgccgcca cttgggctgc atcaacgctg 361 atgggaacgt ggactaccac atgaactctg tccccatcca gcaagagatc ctggtcctgc 421 gcagggagcc tccacactgc cccaactcct tccggctgga gaagatactg gtgtccgtgg 481 gctgcacctg tgtcaccccg attgtccacc atgtggccta agagctctgg ggagcccaca 541 ctccccaaag cagttagact atggagagcc gacccagccc ctcaggaacc ctcatccttc 601 aaagacagcc tcatttcgga ctaaactcat tagagttctt aaggcagttt gtccaattaa 661 agcttcagag gtaacacttg gccaagatat gagatctgaa ttacctttcc ctctttccaa 721 gaaggaaggt ttgactgagt accaatttgc ttcttgttta cttttttaag ggctttaagt 781 tatttatgta tttaatatgc cctgagataa ctttggggta taagattcca ttttaatgaa 841 ttacctactt tattttgttt gtctttttaa agaagataag attctgggct tgggaatttt 901 attatttaaa aggtaaaacc tgtatttatt tgagctattt aaggatctat ttatgtttaa 961 gtatttagaa aaaggtgaaa aagcactatt atcagttctg cctaggtaaa tgtaagatag 1021 aattaaatgg cagtgcaaaa tttctgagtc tttacaacat acggatatag tatttcctcc 1081 tctttgtttt taaaagttat aacatggctg aaaagaaaga ttaaacctac tttcatatgt 1141 attaatttaa attttgcaat ttgttgaggt tttacaagag atacagcaag tctaactctc 1201 tgttccatta aacccttata ataaaatcct tctgtaataa taaagtttca aaagaaaatg 1261 tttatttgtt ctcattaaat gtattttagc aaactcagct cttccctatt gggaagagtt 1321 atgcaaattc tcctataagc aaaacaaagc atgtctttga gtaacaatga cctggaaata 1381 cccaaaattc caagttctcg atttcacatg ccttcaagac tgaacaccga ctaaggtttt 1441 catactatta gccaatgctg tagacagaag cattttgata ggaatagagc aaataagata 1501 atggccctga ggaatggcat gtcattatta aagatcatat ggggaaaatg aaaccctccc 1561 caaaatacaa gaagttctgg gaggagacat tgtcttcaga ctacaatgtc cagtttctcc 1621 cctagactca ggcttccttt ggagattaag gcccctcaga gatcaacaga ccaacatttt 1681 tctcttcctc aagcaacact cctagggcct ggcttctgtc tgatcaaggc accacacaac 1741 ccagaaagga gctgatgggg cagaatgaac tttaagtatg agaaaagttc agcccaagta 1801 aaataaaaac tcaatcacat tcaattccag agtagtttca agtttcacat cgtaaccatt 1861 ttcgcccgga attc // LOCUS HSU32672 1535 bp DNA PRI 05-JUN-1996 DEFINITION Human orphan receptor GPR10 (GPR10) gene, complete cds. ACCESSION U32672 NID g1002738 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS Marchese,A., Heiber,M., Nguyen,T., Heng,H.H.Q., Saldivia,V.R., Cheng,R., Murphy,P.M., Tsui,L.-C., Shi,X., George,S.R., O'Dowd,B.F. and Docherty,J.M. TITLE Cloning and chromosomal mapping of three novel genes, GPR9, GPR10, and GPR14, encoding receptors related to interleukin 8, neuropeptide Y, and somatostatin receptors JOURNAL Genomics 29 (2), 335-344 (1995) MEDLINE 96115583 REFERENCE 2 (bases 1 to 1535) AUTHORS Marchese,A., Heiber,M., Nguyen,T., Heng,H.H.Q., Saldivia,V.R., Cheng,R., Murphy,P.M., Tsui,L.-C., Shi,X., George,S.R., O'Dowd,B.F. and Docherty,J.M. TITLE Direct Submission JOURNAL Submitted (31-JUL-1995) B.F. O'Dowd, Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1535 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q25.3-q26" gene 308..1417 /gene="GPR10" CDS 308..1417 /gene="GPR10" /note="orphan receptor; G protein-coupled receptor" /codon_start=1 /product="GPR10" /db_xref="PID:g1002739" /translation="MASSTTRGPRVSDLFSGLPPAVTTPANQSAEASAGNGSVAGADA PAVTPFQSLQLVHQLKGLIVLLYSVVVVVGLVGNCLLVLVIARVPRLHNVTNFLIGNL ALSDVLMCTACVPLTLAYAFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRY VVLVHPLRRASRCASAYAVLAIWALSAVLALPPAVHTYHVELKPHDVRLCEEFWGSQE RQRQLYAWGLLLVTYLLPLLVILLSYVRVSVKLRNRVVPGCVTQSQADWDRARRRRTF CLLVVVVVVFAVCWLPLHVFNLLRDLDPHAIDPYAFGLVQLLCHWLAMSSACYNPFIY AWLHDSFREELRKLLVAWPRKIAPHGQNMTVSVVI" BASE COUNT 228 a 493 c 479 g 335 t ORIGIN 1 gcgagtgctt tcccgctctc caaaccccac tcccaggtcg gatcgcgctc ctgagtctgc 61 ctgcgtggac tgcgaggacc gtaaatagag gcggaagcgt ttaaataaac cgtatgttct 121 aaccgtgctt aggtttattt tacagaggtg ataggacaac actttttgct acttttgctg 181 ttgttctgtg gccggttatt cccaagggag gactcaggcg atgggggagg ggcgcggctg 241 gtgtaggagg tcggatggag tatggcaagg gaagtgacgg actttgatta cctttgaaca 301 ggtggccatg gcctcatcga ccactcgggg ccccagggtt tctgacttat tttctgggct 361 gccgccggcg gtcacaactc ccgccaacca gagcgcagag gcctcggcgg gcaacgggtc 421 ggtggctggc gcggacgctc cagccgtcac gcccttccag agcctgcagc tggtgcatca 481 gctgaagggg ctgatcgtgc tgctctacag cgtcgtggtg gtcgtggggc tggtgggcaa 541 ctgcctgctg gtgctggtga tcgcgcgggt gccgcggctg cacaacgtga cgaacttcct 601 catcggcaac ctggccttgt ccgacgtgct catgtgcacc gcctgcgtgc cgctcacgct 661 ggcctatgcc ttcgagccac gcggctgggt gttcggcggc ggcctgtgcc acctggtctt 721 cttcctgcag ccggtcaccg tctatgtgtc ggtgttcacg ctcaccacca tcgcagtgga 781 ccgctacgtc gtgctggtgc acccgctgag gcgcgcatct cgctgcgcct cagcctacgc 841 tgtgctggcc atctgggcgc tgtccgcggt gctggcgctg ccgcccgccg tgcacaccta 901 tcacgtggag ctcaagccgc acgacgtgcg cctctgcgag gagttctggg gctcccagga 961 gcgccagcgc cagctctacg cctgggggct gctgctggtc acctacctgc tccctctgct 1021 ggtcatcctc ctgtcttacg tccgggtgtc agtgaagctc cgcaaccgcg tggtgccggg 1081 ctgcgtgacc cagagccagg ccgactggga ccgcgctcgg cgccggcgca ccttctgctt 1141 gctggtggtg gtcgtggtgg tgttcgccgt ctgctggctg ccgctgcacg tcttcaacct 1201 gctgcgggac ctcgaccccc acgccatcga cccttacgcc tttgggctgg tgcagctgct 1261 ctgccactgg ctcgccatga gttcggcctg ctacaacccc ttcatctacg cctggctgca 1321 cgacagcttc cgcgaggagc tgcgcaaact gttggtcgct tggccccgca agatagcccc 1381 ccatggccag aatatgaccg tcagcgtggt catctgatgc cacttagcca ggccttggtc 1441 aaggagctcc acttcaactg gcctcctagg gcaccactcg aggtcaatct ggtgcttatt 1501 cttcagcacc agagctagct aagccaacat agggc // LOCUS HSU32849 1426 bp mRNA PRI 07-NOV-1996 DEFINITION Human Hou mRNA, complete cds. ACCESSION U32849 NID g1322219 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Bao,J. and Zervos,A.S. TITLE Isolation and characterization of Nmi, a novel partner of Myc proteins JOURNAL Oncogene 12 (10), 2171-2176 (1996) MEDLINE 96218666 REFERENCE 2 (bases 1 to 1426) AUTHORS Bao,J. TITLE Direct Submission JOURNAL Submitted (01-AUG-1995) Cutaneous Biology Research Center, Massachusetts General Hospital, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 weeks" /tissue_type="brain" /chromosome="22" /map="22q13.3" gene 281..1204 /gene="Hou" CDS 281..1204 /gene="Hou" /note="interacts with N-myc, C-myc, Max, and other transcription factors carrying leucine zipper (Zip), helix-loop-helix (HLH) or HLH-Zip domains" /codon_start=1 /db_xref="PID:g1322220" /translation="MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEI QKLETELQEATKEFQIKEDIPETKMKFLSVETPENDSQLSNISCSFQVSSKVPYEIQK GQALITFEKEEVAQNVVSMSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKIN VTEIPDTLREDQMRDKLELSFSKFRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILK KKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEEIVEDLI NIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE" BASE COUNT 462 a 266 c 322 g 376 t ORIGIN 1 ggatccgcgt gctaaagaaa aatcgccgtt aaagcagttt tctttttcac tgtctttttc 61 ttttcgcggg gaacccagct gttcctgcga gggccacctc ctcaggaaga ccccgcagct 121 ctcccgcggc gcttctgcag gaggcagcga cagtttcgag aacccgggcc ttcccctccc 181 agtgcctccg ggggttcgcg tttcaggcgc tcgtgttttc cgggaagggc aggcgcgctg 241 ggccttgggg agctgcgctc ggcgggcgga ccgggggatc atggaagctg ataaagatga 301 cacacaacaa attcttaagg agcattcgcc agatgaattt ataaaagatg aacaaaataa 361 gggactaatt gatgaaatta caaagaaaaa tattcagcta aagaaggaga tccaaaagct 421 tgaaacggag ttacaagagg ctaccaaaga attccagatt aaagaggata ttcctgaaac 481 aaagatgaaa ttcttatcag ttgaaactcc tgagaatgac agccagttgt caaatatctc 541 ctgttcgttt caagtgagct cgaaagttcc ttatgagata caaaaaggac aagcacttat 601 cacctttgaa aaagaagaag ttgctcaaaa tgtggtaagc atgagtaaac atcatgtaca 661 gataaaagat gtaaatctgg aggttacggc caagccagtt ccattaaatt caggagtcag 721 attccaggtt tatgtagaag tttctaaaat gaaaatcaat gttactgaaa ttcctgacac 781 actgcgtgaa gatcaaatga gagacaaact agagctgagc ttttcaaagt tccgaaatgg 841 aggcggagag gtggaccgcg tggactatga cagacagtcc gggagtgcag tcatcacgtt 901 tgtggagatt ggagtggctg acaagatttt gaaaaagaaa gaataccctc tttatataaa 961 tcaaacctgc catagagtta ctgtttctcc atacacagaa atacacttga aaaagtatca 1021 gatattttca ggaacatcta agaggacagt gcttctgaca ggaatggaag gcattcaaat 1081 ggatgaagaa attgtggagg atttaattaa cattcacttt caacgggcaa agaatggagg 1141 tggagaagta gatgtggtca agtgttctct aggtcaacct cacatagcat actttgaaga 1201 atagacttaa cagaatcatg aaaactatag ctttttaacc cggattactg taaatgtttg 1261 acaagaatga atatgctttt ccttaaaaaa tgaaaacttt aatttttacc atccatttat 1321 gtttagatac aaaacttatt tccatgtttc tgaatcttct ttgtttcaaa tggtgctgca 1381 tgttttcaac tacaataagt gcactgtaat aaaaagtttt gtttat // LOCUS HSU32907 2002 bp mRNA PRI 30-JAN-1997 DEFINITION Human p37NB mRNA, complete cds. ACCESSION U32907 NID g1236328 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2002) AUTHORS Kim,D., LaQuaglia,M.P. and Yang,S.Y. TITLE A cDNA encoding a putative 37 kDa leucine-rich repeat (LRR) protein, p37NB, isolated from S-type neuroblastoma cell has a differential tissue distribution JOURNAL Biochim. Biophys. Acta 1309 (3), 183-188 (1996) MEDLINE 97136875 REFERENCE 2 (bases 1 to 2002) AUTHORS Kim,D., LaQuaglia,M. and Yang,S.Y. TITLE Direct Submission JOURNAL Submitted (01-AUG-1995) Biochemical Immunogenetics, Memorial Sloan-Kettering Cancer Center, 1275 York Ave., #K425, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..2002 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neuroblastoma" /cell_line="LA1-5S" CDS 282..1223 /codon_start=1 /product="p37NB" /db_xref="PID:g1236329" /translation="MRVVTIVILLCFCKAAELRKASPGSVRSRVNHGRAGGGRRGSNP VKRYAPGLPCDVYTYLHEKYLDCQERKLVYVLPGWPQDLLHMLLARNKIRTLKNNMFS KFKKLKSLDLQQNEISKIESEAFFGLNKLTTLLLQHNQIKVLTEEVFIYTPLLSYLRL YDNPWHCTCEIETLISMLQIPRNRNLANYAKCESPQEQKNKKLRQIKSEQLCNEEEKE QLDPKPQVSGRPPVIKPEVDSTFCHNYVFPIQTLDCKRKELKKVPNNIPPDIVKLDLS YNKINQLRPKEFEDVHELKKLNLSSNGIEFIDPGSLR" repeat_unit 525..770 /note="leucine-rich repeat, LRR-1" repeat_unit 1086..1220 /note="leucine-rich repeat, LRR-2" polyA_site 2002 /note="15 A nucleotides" BASE COUNT 637 a 407 c 429 g 529 t ORIGIN 1 tataacgtga gggctgaatg cagcccattc tctggagaac ttcctcacac accgcagcaa 61 agagaagact gaaagacaaa cctgggtgca gccagagagg tccagataga tgagcttgtg 121 gcatccattc cccaagttca gcctagggac tccacgtacc ccagctgggt ctcattgttc 181 cagaactgca ttagttaaga ttacccagac ttggatttca aaggaatact ttcattgttc 241 cgtctgtaac acgaagtaat tggggccagc tggatgtcag gatgcgtgtg gttaccattg 301 taatcttgct ctgcttttgc aaagcggctg agctgcgcaa agcaagccca ggcagtgtga 361 gaagccgagt gaatcatggc cgggcgggtg gaggccggag aggctccaac ccggtcaaac 421 gctacgcacc aggcctcccg tgtgacgtgt acacatatct ccatgagaaa tacttagatt 481 gtcaagaaag aaaattagtt tatgtgctgc ctggttggcc tcaggatttg ctgcacatgc 541 tgctagcaag aaacaagatc cgcacattga agaacaacat gttttccaag tttaaaaagc 601 tgaaaagcct ggatctgcag cagaatgaga tctctaaaat tgagagtgag gcgttctttg 661 gtttaaacaa actcaccacc ctcttactgc agcacaacca gatcaaagtc ttgacggagg 721 aagtgttcat ttacacacct ctcttgagct acctgcgtct ttatgacaac ccctggcact 781 gtacttgtga gatagaaacg cttatttcaa tgttgcagat tcccaggaac cggaatttgg 841 cgaactacgc caagtgtgaa agtccacaag aacaaaaaaa taaaaaactg cggcagataa 901 aatctgaaca gttgtgtaat gaagaagaaa aggaacaatt ggacccgaaa ccccaagtgt 961 cagggagacc cccagtcatc aagcctgagg tggactcaac tttttgccac aattatgtgt 1021 ttcccataca aacactggac tgcaaaagga aagagttgaa aaaagtgcca aacaacatcc 1081 ctccagatat tgttaaactt gacttgtcat acaataaaat caaccaactt cgacccaagg 1141 aatttgaaga tgttcatgag ctgaagaaat taaacctcag cagcaatggc attgaattca 1201 tcgatcctgg gtctttgaga tgaaaccctg caagtagact tacgtgaatg atttttgctg 1261 tgccgctttt ttagggctca cacatttaga agaattagat ttatcaaaca acagtctgca 1321 aaactttgac tatggcgtat tagaagactt gtattttttg aaactcttgt ggctcagaga 1381 taacccttgg agatgtgact acaacattca ctacctctac tactggttaa agcaccacta 1441 caatgtccat tttaatggcc tggaatgcaa aacgcctgaa gaatacaaag gatggtctgt 1501 gggaaaatat attagaagtt actatgaaga atgccccaaa gacaagttac cagcatatcc 1561 tgagtcattt gaccaagaca cagaagatga tgaatgggaa aaaaaacata gagatcacac 1621 cgcaaagaag caaagcgtaa taattactat agtaggataa ggtagaaatt gttctgattg 1681 taattagttt tgtattttct atactggtgt tagaaaacat atgtttacat ttgattaact 1741 gtgttgccta tttatgcagg gtaatccagc taaaggaagc tttctttaat tataagtatt 1801 attgtgacta ttatagtaat caagagaatg ctatcatcct gcttgcctgt ccatttgtgg 1861 aacagcatct ggtgatatgc aattccacac tggtaacctg cagcagttgg gtcctaatga 1921 tggcattaga ctttcataat gtcctgtata aatgttttta ctgcttttag aaaataaaga 1981 aaaaaaactt ggttcatgtt ta // LOCUS HSU32944 643 bp mRNA PRI 23-JUL-1996 DEFINITION Human cytoplasmic dynein light chain 1 (hdlc1) mRNA, complete cds. ACCESSION U32944 NID g1209060 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 643) AUTHORS Dick,T., Ray,K., Salz,H.K. and Chia,W. TITLE Cytoplasmic dynein (ddlc1) mutations cause morphogenetic defects and apoptotic cell death in Drosophila melanogaster JOURNAL Mol. Cell. Biol. 16 (5), 1966-1977 (1996) MEDLINE 96189078 REFERENCE 2 (bases 1 to 643) AUTHORS Dick,T., Ray,K., Salz,H. and Chia,W. TITLE Direct Submission JOURNAL Submitted (02-AUG-1995) Institute of Molecular and Cell Biology, National University of Singapore, 10 Kent Ridge Crescent, Singapore 0511, Republic of Singapore FEATURES Location/Qualifiers source 1..643 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUT-78 T cell library" /chromosome="14" /map="14q24" gene 94..363 /gene="hdlc1" CDS 94..363 /gene="hdlc1" /note="Chlamydomonas 8 kDa outer arm dynein light chain homolog" /codon_start=1 /product="cytoplasmic dynein light chain 1" /db_xref="PID:g1209061" /translation="MCDRKAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKK EFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLGQVAILLFKSG" BASE COUNT 179 a 156 c 143 g 165 t ORIGIN 1 ggtagcgacg gtagctctag ccgggcctga gctgtgctag cacctccccc aggagaccgt 61 tgcagtcggc cagccccctt ctccacggta accatgtgcg accgaaaggc cgtgatcaaa 121 aatgcggaca tgtcggaaga gatgcaacag gactcggtgg agtgcgctac tcaggcgctg 181 gagaaataca acatagagaa ggacattgcg gctcatatca agaaggaatt tgacaagaag 241 tacaatccca cctggcattg catcgtgggg aggaacttcg gtagttatgt gacacatgaa 301 accaaacact tcatctactt ctacctgggc caagtggcca ttcttctgtt caaatctggt 361 taaaagcatg gactgtgcca cacacccagt gatccatcca gaaacaagga ctgcagccta 421 aattccaaat accagagact gaaattttca gccttgctaa gggaacatct cgatgtttga 481 acctttgttg tgttttgtac agggcattct ctgtactagt ttgtcgtggt tataaaacaa 541 ttagcagaat agcctacatt tgtatttatt ttctattcca tacttctgcc cacgttgttt 601 tctctcaaaa tccattcctt taaaaaataa atctgatgca ccg // LOCUS HSU32989 1712 bp mRNA PRI 20-SEP-1996 DEFINITION Human tryptophan oxygenase (TDO) mRNA, complete cds. ACCESSION U32989 NID g993045 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1712) AUTHORS Comings,D.E., Muhleman,D., Dietz,G., Sherman,M. and Forest,G.L. TITLE Sequence of human tryptophan 2,3-dioxygenase (TDO2): presence of a glucocorticoid response-like element composed of a GTT repeat and an intronic CCCCT repeat JOURNAL Genomics 29 (2), 390-396 (1995) MEDLINE 96115589 REFERENCE 2 (bases 1 to 1712) AUTHORS Comings,D.E. TITLE Direct Submission JOURNAL Submitted (01-AUG-1995) David E. Comings, City of Hope, Medical Genetics, 1500 E. Duarte Rd., Duarte, CA 91010, USA FEATURES Location/Qualifiers source 1..1712 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q31" /chromosome="4" /clone="pHTO3" /tissue_type="brain" gene 65..1285 /gene="TDO" CDS 65..1285 /gene="TDO" /codon_start=1 /product="tryptophan oxygenase" /db_xref="PID:g993046" /translation="MSGCPFLGNNFGYTFKKLPVEGSEEDKSQTGVNRASKGGLIYGN YLHLEKVLNAQELQSETKGNKIHDEHLFIITHQAYELWFKQILWELDSVREIFQNGHV RDERNMLKVVSRMHRVSVILKLLVQQFSILETMTALDFNDFREYLSPASGFQSLQFRL LENKIGVLQNMRVPYNRRHYRDNFKGEENELLLKSEQEKTLLELVEAWLERTPGLEPH GFNFWGKLEKNITRGLEEEFIRIQAKEESEEKEEQVAEFQKQKEVLLSLFDEKRHEHL LSKGERRLSYRALQGALMIYFYREEPRFQVPFQLLTSLMDIDSLMTKWRYNHVCMVHR MLGSKAGTGGSSGYHYLRSTVSDRYKVFVDLFNLSTYLIPRHWIPKMNPTIHKFLYTA EYCDSSYFSSDESD" BASE COUNT 576 a 306 c 340 g 490 t ORIGIN 1 aaggtcaatg atagcatctg cctagagtca aacctccgtg cttctcagac agtgcctttt 61 caccatgagt gggtgcccat ttttaggaaa caactttgga tatactttta aaaaactccc 121 cgtagaaggc agcgaagaag acaaatcaca aactggtgtg aatagagcca gcaaaggagg 181 tcttatctat gggaactacc tgcatttgga aaaagttttg aatgcacaag aactgcaaag 241 tgaaacaaaa ggaaataaaa tccatgatga acatcttttt atcataactc atcaagctta 301 tgaactctgg tttaagcaaa tcctctggga gttggattct gttcgagaga tctttcagaa 361 tggccatgtc agagatgaaa ggaacatgct taaggttgtt tctcggatgc accgagtgtc 421 agtgatcctg aaactgctgg tgcagcagtt ttccattctg gagacgatga cagccttgga 481 cttcaatgac ttcagagagt acttatctcc agcatcaggc ttccagagtt tgcaattccg 541 actattagaa aacaagatag gtgttcttca gaacatgaga gtcccttata acagaagaca 601 ttatcgtgat aacttcaaag gagaagaaaa tgaactgcta cttaaatctg agcaggaaaa 661 gacacttctg gaattagtgg aggcatggct ggaaagaact ccaggtttag agccacatgg 721 atttaacttc tggggaaagc ttgaaaaaaa tatcaccaga ggcctggaag aggaattcat 781 aaggattcag gctaaagaag agtctgaaga aaaagaggaa caggtggctg aatttcagaa 841 gcaaaaagag gtgctactgt ccttatttga tgagaaacgt catgaacatc tccttagtaa 901 aggtgaaaga cggctgtcat acagagcact tcagggagca ttgatgatat atttttacag 961 ggaagagcct aggttccagg tgccttttca gttgctgact tctcttatgg acatagattc 1021 actgatgacc aaatggagat ataaccatgt gtgcatggtg cacagaatgc tgggcagcaa 1081 agctggcacc ggtggttcct caggctatca ctacctgcga tcaactgtga gtgataggta 1141 caaggtattt gtagatttat ttaatctttc aacatacctg attccccgac actggatacc 1201 gaagatgaac ccaaccattc acaaatttct atatacagca gaatactgtg atagctccta 1261 cttcagcagt gatgaatcag attaaaatcg tctgcaaaat ctatgaagaa tactggtttc 1321 acagcctatt ttttattttc tatggatttt cataaataca gtttgaatat atgtatgcat 1381 atattgttca gcaccacgat gctctgattt aattctagaa acaatttgat tacctcttgt 1441 ttgtgacaag actaagcatt aagatgagaa agaatacatt taaatagtaa cattgtacat 1501 agggtgtttt cctattaaaa atcagtttcc cctgagactt aatgtaacca cttaatgtaa 1561 tcactatctc attgtttcat ctttataaac ttgtaaactt catctatttc aaatatttta 1621 tgcagtacat tatattattc tgtacaaagg ctttcaaaca aaatttttaa aataataaag 1681 tattaatctt tcaaaaaaaa aaaaaaaaaa aa // LOCUS HSU33017 1789 bp mRNA PRI 15-SEP-1995 DEFINITION Human signaling lymphocytic activation molecule (SLAM) mRNA, complete cds. ACCESSION U33017 NID g984968 KEYWORDS . SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 235) AUTHORS Cocks,B.G., Chang,C.C., Carballido,J.M., Yssel,H., de Vries,J.E. and Aversa,G. TITLE A novel receptor involved in T-cell activation JOURNAL Nature 376 (6537), 260-263 (1995) MEDLINE 95342241 REFERENCE 2 (bases 1 to 1789) AUTHORS Cocks,B.G. TITLE Direct Submission JOURNAL Submitted (02-AUG-1995) Benjamin G. Cocks, Human Immunology, DNAX Research Institute of Molecular and Cellular Biology, 901 California Ave, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..1789 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSURSLAM1" /cell_type="A10 CD8+ T-cells" gene 134..1141 /gene="SLAM" CDS 134..1141 /gene="SLAM" /codon_start=1 /product="signaling lymphocytic activation molecule" /db_xref="PID:g984969" /translation="MDPKGLLSLTFVLFLSLAFGASYGTGGRMMNCPKILRQLGSKVL LPLTYERINKSMNKSIHIVVTMAKSLENSVENKIVSLDPSEAGPPRYLGDRYKFYLEN LTLGIRESRKEDEGWYLMTLEKNVSVQRFCLQLRLYEQVSTPEIKVLNKTQENGTCTL ILGCTVEKGDHVAYSWSEKAGTHPLNPANSSHLLSLTLGPQHADNIYICTVSNPISNN SQTFSPWPGCRTDPSETKPWAVYAGLLGGVIMILIMVVILQLRRRGKTNHYQTTVEKK SLTIYAQVQKPGPLQKKLDSFPAQDPCTTIYVAATEPVPESVQETNSITVYASVTLPE S" BASE COUNT 517 a 440 c 412 g 420 t ORIGIN 1 ggttcaggaa cctgctggtt ctgatacata aatcagacag cctctgctgc atgacacgaa 61 gcttgcttct gcctggcatc tgtgagcagc tgccaggctc cggccaggat cccttccttc 121 tcctcattgg ctgatggatc ccaaggggct cctctccttg accttcgtgc tgtttctctc 181 cctggctttt ggggcaagct acggaacagg tgggcgcatg atgaactgcc caaagattct 241 ccggcagttg ggaagcaaag tgctgctgcc cctgacatat gaaaggataa ataagagcat 301 gaacaaaagc atccacattg tcgtcacaat ggcaaaatca ctggagaaca gtgtcgagaa 361 caaaatagtg tctcttgatc catccgaagc aggccctcca cgttatctag gagatcgcta 421 caagttttat ctggagaatc tcaccctggg gatacgggaa agcaggaagg aggatgaggg 481 atggtacctt atgaccctgg agaaaaatgt ttcagttcag cgcttttgcc tgcagttgag 541 gctttatgag caggtctcca ctccagaaat taaagtttta aacaagaccc aggagaacgg 601 gacctgcacc ttgatactgg gctgcacagt ggagaagggg gaccatgtgg cttacagctg 661 gagtgaaaag gcgggcaccc acccactgaa cccagccaac agctcccacc tcctgtccct 721 caccctcggc ccccagcatg ctgacaatat ctacatctgc accgtgagca accctatcag 781 caacaattcc cagaccttca gcccgtggcc cggatgcagg acagacccct cagaaacaaa 841 accatgggca gtgtatgctg ggctgttagg gggtgtcatc atgattctca tcatggtggt 901 aatactacag ttgagaagaa gaggtaaaac gaaccattac cagacaacag tggaaaaaaa 961 aagccttacg atctatgccc aagtccagaa accaggtcct cttcagaaga aacttgactc 1021 cttcccagct caggaccctt gcaccaccat atatgttgct gccacagagc ctgtcccaga 1081 gtctgtccag gaaacaaatt ccatcacagt ctatgctagt gtgacacttc cagagagctg 1141 acaccagaga ccaacaaagg gactttctga aggaaaatgg aaaaaccaaa atgaacactg 1201 aacttggcca caggcccaag tttcctctgg cagacatgct gcacgtctgt acccttctca 1261 gatcaactcc ctggtgatgt ttcttccaca tacatctgtg aaatgaacaa ggaagtgagg 1321 cttcccaaga atttagcttg ctgtgcagtg gctgcaggcg cagaacagag cgttacttga 1381 taacagcgtt ccatctttgt gttgtagcag atgaaatgga cagtaatgtg agttcagact 1441 ttgggcatct tgctcttggc tggaactgat aataaaaatc agactgaaag ccaggacatc 1501 tgagtaccta tctcacacac tgaccaccag tcacaaagtc tggaaaagtt tacattttgg 1561 ctatctttac tttgttctgg gagctgatca tgataacctg cagacctgat caagcctctg 1621 tgcctcagtt tctctctcag gataaagagt gaatagaggc cgaagggtga atttcttatt 1681 atacataaaa cactctgata ttattgtata aaggaagcta agaatattat tttatttgca 1741 aaacccagaa gctaaaaagt caataaacag aaagaatgat tttgagaaa // LOCUS HSU33052 3255 bp mRNA PRI 28-SEP-1995 DEFINITION Human lipid-activated, protein kinase PRK2 mRNA, complete cds. ACCESSION U33052 NID g1000124 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3255) AUTHORS Palmer,R.H., Ridden,J. and Parker,P.J. TITLE Identification of multiple, novel, protein kinase C-related gene products JOURNAL FEBS Lett. 356 (1), 5-8 (1994) MEDLINE 95080426 REFERENCE 2 (bases 1 to 3255) AUTHORS Palmer,R.H., Ridden,J. and Parker,P.J. TITLE Cloning and expression patterns of two members of a novel protein-kinase-C-related kinase family JOURNAL Eur. J. Biochem. 227 (1-2), 344-351 (1995) MEDLINE 95154310 REFERENCE 3 (bases 1 to 3255) AUTHORS Palmer,R.H. TITLE Direct Submission JOURNAL Submitted (02-AUG-1995) Ruth H. Palmer, Protein Phosphorylation, ICRF, 44 Lincoln's Inn Fields, London, WC2A 3PX, UK FEATURES Location/Qualifiers source 1..3255 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA library from human DX3 cell line (B-cell lineage)" CDS 10..2964 /note="lipid-activated, protein kinase C-related, serine/threonine protein kinase" /codon_start=1 /product="PRK2" /db_xref="PID:g1000125" /translation="MASNPERGEILLTELQGDSRSLPFSENVSAVQKLDFSDTMVQQK LDDIKDRIKREIRKELKIKEGAENLRKVTTDKKSLAYVDNILKKSNKKLEELHHKLQE LNAHIVVSDPEDITDCPRTPDTPNNDPRCSTSNNRLKALQKQLDIELKVKQGAENMIQ MYSNGSSKDRKLHGTAQQLLQDSKTKIEVIRMQILQAVQTNELAFDNAKPVISPLELR MEELRHHFRIEFAVAEGAKNVMKLLGSGKVTDRKALSEAQARFNESSQKLDLLKYSLE QRLNEVPKNHPKSRIIIEELSLVAASPTLSPRQSMISTQNQYSTLSKPAALTGTLEVR LMGCQDILENVPGRSKATSVALPGWSPSETRSSFMSRTSKSKSGSSRNLLKTDDLSND VCAVLKLDNTVVGQTSWKPISNQSWDQKFTLELDRSRELEISVYWRDWRSLCAVKFLR LEDFLDNQRHGMCLYLEPQGTLFAEVTFFNPVIERRPKLQRQKKIFSKQQGKTFLRAP QMNINIATWGRLVRRAIPTVNHSGTFSPQAPVPTTVPVVDVRIPQLAPPASDSTVTKL DFDLEPEPPPAPPRASSLGEIDESSELRVLDIPGQDSETVFDIQNDRNSILPKSQSEY KPDTPQSGLEYSGIQELEDRRSQQRFQFNLQDFRCCAVLGRGHFGKVLLAEYKNTNEM FAIKALKKGDIVARDEVDSLMCEKRIFETVNSVRHPFLVNLFACFQTKEHVCFVMEYA AGGDLMMHIHTDVFSEPRAVFYAACVVLGLQYLHEHKIVYRDLKLDNLLLDTEGFVKI ADFGLCKEGMGYGDRTSTFCGTPEFLAPEVLTETSYTRAVDWWGLGVLIYEMLVGESP FPGDDEEEVFDSIVNDEVRYPRFLSTEAISIMRRLLRRNPERRLGASEKDAEDVKKHP FFRLIDWSALMDKKVKPPFIPTIRGREDVSNFDDEFTSEAPILTPPREPRILSEEEQE MFRDFDYIADWC" BASE COUNT 1070 a 582 c 700 g 903 t ORIGIN 1 ggagcgcaaa tggcgtccaa ccccgaacgg ggggagattc tgctcacgga actgcagggg 61 gattcccgaa gtcttccgtt ttctgagaat gtgagtgctg ttcaaaaatt agacttttca 121 gatacaatgg tgcagcagaa attggatgat atcaaggatc gaattaagag agaaataagg 181 aaagaactga aaatcaaaga aggagctgaa aatctgagga aagtcacaac agataaaaaa 241 agtttggctt atgtagacaa cattttgaaa aaatcaaata aaaaattaga agaactacat 301 cacaagctgc aggaattaaa tgcacatatt gttgtatcag atccagaaga tattacagat 361 tgcccaagga ctccagatac tccaaataat gaccctcgtt gttctactag caacaataga 421 ttgaaggcct tacaaaaaca attggatata gaacttaaag taaaacaagg tgcagagaat 481 atgatacaga tgtattcaaa tggatcttca aaggatcgga aactccatgg tacagctcag 541 caactgctcc aggacagcaa gacaaaaata gaagtcatac gaatgcagat tcttcaggca 601 gtccagacta atgaattggc ttttgataat gcaaaacctg tgataagtcc tcttgaactt 661 cggatggaag aattaaggca tcattttagg atagagtttg cagtagcaga aggtgcaaag 721 aatgtaatga aattacttgg ctcaggaaaa gtaacagaca gaaaagcact ttcagaagct 781 caagcaagat ttaatgaatc aagtcagaag ttggaccttt taaagtattc attagagcaa 841 agattaaacg aagtccccaa gaatcatccc aaaagcagga ttattattga agaactttca 901 cttgttgctg catcaccaac actaagtcca cgtcaaagta tgatatctac gcaaaatcaa 961 tatagtacac tatccaaacc agcagcacta acaggtactt tggaagttcg tcttatgggc 1021 tgccaagata tcctagagaa tgtccctgga cggtcaaaag caacatcagt tgcactgcct 1081 ggttggagtc caagtgaaac cagatcatct ttcatgagca gaacgagtaa aagtaaaagc 1141 ggaagtagtc gaaatcttct aaaaaccgat gacttgtcca atgatgtctg tgctgttttg 1201 aagctcgata atactgtggt tggccaaact agctggaaac ccatttccaa tcagtcatgg 1261 gaccagaagt ttacactgga actggacagg tcacgtgaac tggaaatttc agtttattgg 1321 cgtgattggc ggtctctgtg tgctgtaaaa tttctgaggt tagaagattt tttagacaac 1381 caacggcatg gcatgtgtct ctatttggaa ccacagggta ctttatttgc agaggttacc 1441 ttttttaatc cagttattga aagaagacca aaacttcaaa gacaaaagaa aattttttca 1501 aagcaacaag gcaaaacatt tctcagagct cctcaaatga atattaatat tgccacttgg 1561 ggaaggctag taagaagagc tattcctaca gtaaatcatt ctggcacctt cagccctcaa 1621 gctcctgtgc ctactacagt gccagtggtt gatgtacgca tccctcaact agcacctcca 1681 gctagtgatt ctacagtaac caaattggac tttgatcttg agcctgaacc tcctccagcc 1741 ccaccacgag cttcttctct tggagaaata gatgaatctt ctgaattaag agttttggat 1801 ataccaggac aggattcaga gactgttttt gatattcaga atgacagaaa tagtatactt 1861 ccaaaatctc aatctgaata caagcctgat actcctcagt caggcctaga atatagtggt 1921 attcaagaac ttgaggacag aagatctcag caaaggtttc agtttaatct acaagatttc 1981 aggtgttgtg ctgtcttggg aagaggacat tttggaaagg tgcttttagc tgaatataaa 2041 aacacaaatg agatgtttgc tataaaagcc ttaaagaaag gagatattgt ggctcgagat 2101 gaagtagaca gcctgatgtg tgaaaaaaga atttttgaaa ctgtgaatag tgtaaggcat 2161 ccctttttgg tgaacctttt tgcatgtttc caaaccaaag agcatgtttg ctttgtaatg 2221 gaatatgctg ccggtgggga cctaatgatg cacattcata ctgatgtctt ttctgaacca 2281 agagctgtat tttatgctgc ttgtgtagtt cttgggttgc agtatttaca tgaacacaaa 2341 attgtttata gagatttgaa attggataac ttattgctag atacagaggg ctttgtgaaa 2401 attgctgatt ttggtctttg caaagaagga atgggatatg gagatagaac aagcacattt 2461 tgtggcactc ctgaatttct tgccccagaa gtattaacag aaacttctta tacaagggct 2521 gtagattggt ggggccttgg cgtgcttata tatgaaatgc ttgttggtga gtctcccttt 2581 cctggtgatg atgaagagga agtttttgac agtattgtaa atgatgaagt aaggtatcca 2641 aggttcttat ctacagaagc catttctata atgagaaggc tgttaagaag aaatcctgaa 2701 cggcgccttg gggctagcga gaaagatgca gaggatgtaa aaaagcaccc atttttccgg 2761 ctaattgatt ggagcgctct gatggacaaa aaagtaaagc caccatttat acctaccata 2821 agaggacgag aagatgttag taattttgat gatgaattta cctcagaagc acctattctg 2881 actccacctc gagaaccaag gatactttcg gaagaggagc aggaaatgtt cagagatttt 2941 gactacattg ctgattggtg ttaagttgct agacactgcg aaaccaagct gactcacaag 3001 aagacctctt aaaaatagca acccttcatt tgctctctgt gccaccaata gcttctgagt 3061 tttttgttgt tgttgttttt attgaaacac gtgaagattt gtttaaaagt accattctaa 3121 tacttcttca aaagtggctc ctcattgtac ttcagcgtaa atatgagcac tggaaacagt 3181 ttcatggagt ttaagttgag tgaacatcgg ccatgaaaat ccatcacgaa tacttttgga 3241 tcaatagtct atttt // LOCUS HSU33054 2113 bp mRNA PRI 24-MAR-1996 DEFINITION Human G protein-coupled receptor kinase GRK4 mRNA, alpha splice variant, complete cds. ACCESSION U33054 NID g971254 KEYWORDS IT11 kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2113) AUTHORS Premont,R.T., Macrae,A.D., Stoffel,R.H., Chung,N., Pitcher,J.A., Ambrose,C., Inglese,J., MacDonald,M.E. and Lefkowitz,R.J. TITLE Characterization of the G protein-coupled receptor kinase GRK4. Identification of four splice variants JOURNAL J. Biol. Chem. 271 (11), 6403-6410 (1996) MEDLINE 96198106 REFERENCE 2 (bases 1 to 2113) AUTHORS Premont,R.T. TITLE Direct Submission JOURNAL Submitted (02-AUG-1995) Richard T. Premont, Medicine (Cardiology), Duke University Medical Center, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..2113 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4p16.3" /chromosome="4" /sex="male" /tissue_type="testis" /dev_stage="adult" CDS 255..1991 /note="beta and gamma splice variants are located in GenBank Accession Numbers U33055 and U33056, respectively; delta splice variant is located in GenBank Accession Number L03718" /codon_start=1 /product="G protein-coupled receptor kinase GRK4, alpha splice variant" /db_xref="PID:g971255" /translation="MELENIVANSLLLKARQGGYGKKSGRSKKWKEILTLPPVSQCSE LRHSIEKDYSSLCDKQPIGRRLFRQFCDTKPTLKRHIEFLDAVAEYEVADDEDRSDCG LSILDRFFNDKLAAPLPEIPPDVVTECRLGLKEENPSKKAFEECTRVAHNYLRGEPFE EYQESSYFSQFLQWKWLERQPVTKNTFRHYRVLGKGGFGEVCACQVRATGKMYACKKL QKKRIKKRKGEAMALNEKRILEKVQSRFVVSLAYAYETKDALCLVLTIMNGGDLKFHI YNLGNPGFDEQRAVFYAAELCCGLEDLQRERIVYRDLKPENILLDDRGHIRISDLGLA TEIPEGQRVRGRVGTVGYMAPEVVNNEKYTFSPDWWGLGCLIYEMIQGHSPFKKYKEK VKWEEVDQRIKNDTEEYSEKFSEDAKSICRMLLTKNPSKRLGCRGEGAAGVKQHPVFK DINFRRLEANMLEPPFCPDPHAVYCKDVLDIEQFSAVKGIYLDTADEDFYARFATGCV SIPWQNEMIESGCFKDINKSESEEALPLDLDKNIHTPVSRPNRGFFYRLFRRGGCLTM VPSEKEVEPKQC" BASE COUNT 593 a 467 c 590 g 463 t ORIGIN 1 gcagccgccg cggtcgggct gccccctccc ctcgccccga ccgctcccct gctggtgagg 61 gcctgcgcag gcggcggcgg cggcgccctt ggtggcagtg gtggcggcgg agcagcctcc 121 cgggatcgtg tctggagctc gaggagaggg tagtgcccgg cgagctatgc acgggggcgg 181 cggcgtctcc tcctgttccg cctcctcagt ctcctcggtc tcgcagaatc cgccggcggc 241 ggcggcgcca ggacatggag ctcgagaaca tcgtggccaa ctcgctgctg ctgaaagcgc 301 gtcaaggagg atatggcaaa aaaagtggtc gtagtaaaaa atggaaggag atactgacac 361 tgcctcctgt cagccagtgc agtgagctta gacattccat tgaaaaggat tatagcagtc 421 tttgtgacaa gcaaccgata ggaagacgtc tcttcaggca gttctgtgat accaaaccca 481 ctctaaagag gcacattgaa ttcttggatg cagtggcaga atatgaagtt gccgatgatg 541 aggaccgaag tgattgtgga ctgtcaatct tagatagatt cttcaatgat aagttggcag 601 cccctttacc agaaatacct ccagatgttg tgacagaatg tagattggga ctgaaggagg 661 agaacccttc caaaaaagcc tttgaggaat gtactagagt tgcccataac tacctaagag 721 gggaaccatt tgaagaatac caagaaagct catatttttc tcagttttta caatggaaat 781 ggctggaaag gcaacccgta acaaagaaca catttagaca ttacagagtt ctaggaaaag 841 gcggatttgg agaggtttgc gcctgtcaag tgcgagccac aggaaaaatg tatgcctgca 901 aaaagctaca aaaaaaaaga ataaagaaga ggaaaggtga agctatggct ctaaatgaga 961 aaagaattct ggagaaagtg caaagtagat tcgtagttag tttagcctac gcttatgaaa 1021 ccaaagatgc cttgtgcttg gtgctcacca ttatgaatgg aggggatttg aagtttcaca 1081 tttacaacct gggcaatccc ggctttgatg agcagagagc cgttttctat gctgcagagc 1141 tgtgttgcgg cttggaagat ttacagaggg aaagaattgt atacagagac ttgaagcctg 1201 agaatattct ccttgatgat cgtggacaca tccggatttc agacctcggt ttggccacag 1261 agatcccaga aggacagagg gttcgaggaa gagttggaac agtcggctac atggcacctg 1321 aagttgtcaa taatgaaaag tatacgttta gtcccgattg gtggggactt ggctgtctga 1381 tctatgaaat gattcaggga cattctccat tcaaaaaata caaagagaaa gtcaaatggg 1441 aggaggtcga tcaaagaatc aagaatgata ccgaggagta ttctgagaag ttttcagagg 1501 atgccaaatc tatctgcagg atgttactca ccaagaatcc aagcaagcgg ctgggctgca 1561 ggggcgaggg agcggctggg gtgaagcagc accccgtgtt caaggacatc aacttcagga 1621 ggctggaggc aaacatgctg gagccccctt tctgtcctga tcctcatgcc gtttactgta 1681 aggacgtcct ggatatcgag cagttctcgg cggtgaaagg gatctacctg gacaccgcag 1741 atgaagactt ctatgctcgg tttgctaccg ggtgtgtctc catcccctgg cagaatgaga 1801 tgatcgaatc cgggtgtttc aaagacatca acaaaagtga aagtgaggaa gctttgccat 1861 tagatctaga caagaacata cataccccgg tttccagacc aaacagaggc ttcttctata 1921 gactcttcag aagagggggc tgcctgacca tggtccccag tgagaaggaa gtggaaccca 1981 agcaatgctg agcaccccgg tgcggaccac agagcagacc ctggcgccag gaaggagcat 2041 gtgttagcgt ctcgtcccac ctggaattgt aataaataca tctaaataaa acatgccttg 2101 ggagtgtaca gac // LOCUS HSU33147 503 bp mRNA PRI 22-FEB-1996 DEFINITION Human mammaglobin mRNA, complete cds. ACCESSION U33147 NID g1199595 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 503) AUTHORS Watson,M.A. and Fleming,T.P. TITLE Mammaglobin, a mammary-specific member of the uteroglobin gene family, is overexpressed in human breast cancer JOURNAL Cancer Res. 56 (4), 860-865 (1996) MEDLINE 96223698 REFERENCE 2 (bases 1 to 503) AUTHORS Watson,M.A. TITLE Direct Submission JOURNAL Submitted (03-AUG-1995) Mark A. Watson, Washington Univ. School of Medicine, Dept. Ophthalmology and Visual Sciences, Dept. Genetics, Div. Laboratory Medicine, Box 8118, 660 S. Euclid Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..503 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast" CDS 61..342 /note="similar to uteroglobin: SwissProt Accession Number P02779 and rat prostatic steroid binding protein subunit C3: SwissProt Accession Number P02780" /codon_start=1 /product="mammaglobin" /db_xref="PID:g1199596" /translation="MKLLMVLMLAALSQHCYAGSGCPLLENVISKTINPQVSKTEYKE LLQEFIDDNATTNAIDELKECFLNQTDETLSNVEVFMQLIYDSSLCDLF" BASE COUNT 146 a 118 c 97 g 142 t ORIGIN 1 gacagcggct tccttgatcc ttgccacccg cgactgaaca ccgacagcag cagcctcacc 61 atgaagttgc tgatggtcct catgctggcg gccctctccc agcactgcta cgcaggctct 121 ggctgcccct tattggagaa tgtgatttcc aagacaatca atccacaagt gtctaagact 181 gaatacaaag aacttcttca agagttcata gacgacaatg ccactacaaa tgccatagat 241 gaattgaagg aatgttttct taaccaaacg gatgaaactc tgagcaatgt tgaggtgttt 301 atgcaattaa tatatgacag cagtctttgt gatttatttt aactttctgc aagacctttg 361 gctcacagaa ctgcagggta tggtgagaaa ccaactacgg attgctgcaa accacacctt 421 ctctttctta tgtcttttta ctacaaacta caagacaatt gttgaaacct gctatacatg 481 tttattttaa taaattgatg gca // LOCUS HSU33267 2106 bp mRNA PRI 04-DEC-1996 DEFINITION Human glycine receptor beta subunit (GLRB) mRNA, complete cds. ACCESSION U33267 NID g992686 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2106) AUTHORS Handford,C.A., Lynch,J.W., Baker,E., Webb,G.C., Ford,J.H., Sutherland,G.R. and Schofield,P.R. TITLE The human glycine receptor beta subunit: primary structure, functional characterisation and chromosomal localisation of the human and murine genes JOURNAL Brain Res. Mol. Brain Res. 35 (1-2), 211-219 (1996) MEDLINE 96352561 REFERENCE 2 (bases 1 to 2106) AUTHORS Handford,C.A. and Schofield,P.R. TITLE Direct Submission JOURNAL Submitted (06-AUG-1995) Peter R. Schofield, Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, Sydney, NSW 2010, Australia FEATURES Location/Qualifiers source 1..2106 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampal" /chromosome="4" /map="4q32; Murine 3E3-F1" gene 77..1570 /gene="GLRB" CDS 77..1570 /gene="GLRB" /codon_start=1 /product="glycine receptor beta subunit" /db_xref="PID:g992687" /translation="MKFLLTTAFLILISLWVEEAYSKEKSSKKGKGKKKQYLCPSQQS AEDLARVPANSTSNILNRLLVSYDPRIRPNFKGIPVDVVVNIFINSFGSIQETTMDYR VNIFLRQKWNDPRLKLPSDFRGSDALTVDPTMYKCLWKPDLFFANEKSANFHDVTQEN ILLFIFRDGDVLVSMRLSITLSCPLDLTLFPMDTQRCKMQLESFGYTTDDLRFIWQSG DPVQLEKIALPQFDIKKEDIEYGNCTKYYKGTGYYTCVEVIFTLRRQVGFYMMGVYAP TLLIVVLSWLSFWINPDASAARVPLGIFSVLSLASECTTLAAELPKVSYVKALDVWLI ACLLFGFASLVEYAVVQVMLNNPKRVEAEKARIAKAEQADGKGGNVAKKNTVNGTGTP VHISTLQVGETRCKKVCTSKSDLRSNDFSIVGSLPRDFELSNYDCYGKPIEVNNGLGK SQAKNNKKPPPAKPVIPTAAKRIDLYARALFPFCFLFFNVIYWSIYL" BASE COUNT 622 a 409 c 424 g 651 t ORIGIN 1 gagcctccac gatctcgccc ggcgattgtg ggcaggggcg cctccggatc gatcttctga 61 aattcaagtt ttcaagatga agtttttatt gacaactgcc tttttaattt taatttcctt 121 gtgggtggaa gaagcctatt ctaaggaaaa gtcttcaaag aaagggaagg ggaaaaagaa 181 gcagtatcta tgcccatctc agcagtcagc agaggacctt gcccgagtac ctgccaactc 241 cactagcaat atcttgaaca ggttattggt cagttatgat cccaggataa gaccaaactt 301 caaaggcatt cctgttgatg tagtagtcaa catttttatt aacagttttg gatccattca 361 agaaacaaca atggactata gagttaacat cttcctgaga caaaaatgga atgaccccag 421 gctgaagctc cccagtgatt ttaggggttc agatgcactg acagtggatc caacaatgta 481 caagtgttta tggaaacctg atttattttt tgcaaatgaa aaaagtgcca attttcatga 541 tgtgacccag gaaaacatcc tcctctttat ttttcgtgat ggagatgtcc ttgtcagcat 601 gaggttatct attactcttt catgcccttt ggacttgaca ttgtttccca tggatacaca 661 acgttgcaag atgcaactgg agagctttgg ttacacaact gatgatttac gatttatctg 721 gcagtcagga gatcctgtgc aattagaaaa aattgccttg cctcaatttg atatcaaaaa 781 ggaagatatt gaatatggta actgtacaaa atactataaa ggcacgggct actacacatg 841 cgtggaagtc atcttcaccc tgaggaggca ggtcggcttt tacatgatgg gggtctacgc 901 cccaaccctg ctcattgttg ttctctcctg gctttccttc tggatcaacc cggacgcgag 961 tgctgccaga gtgcccctgg gtatcttctc agtcctcagc ttggcctctg agtgcacaac 1021 ccttgccgct gagcttccca aagtttccta tgtgaaggct cttgatgttt ggcttattgc 1081 ttgccttctc tttgggtttg cttccctggt ggagtatgca gttgtccagg tgatgctgaa 1141 caaccccaaa agggttgaag ctgaaaaagc cagaattgct aaggctgagc aagcagatgg 1201 aaaaggtgga aatgtggcta aaaagaatac tgtgaatgga acagggactc ctgttcatat 1261 tagcactttg caggttggtg agaccagatg caaaaaagtt tgtacttcta agtctgatct 1321 gagatctaat gacttcagca ttgttggaag cttaccaaga gattttgaac tatccaatta 1381 tgactgctat ggaaaaccca ttgaagttaa caacggactt gggaaatctc aggctaagaa 1441 caacaagaag cctccccctg cgaaacctgt tattccaaca gcagcaaagc gaattgatct 1501 ttatgcaaga gcattgtttc ctttctgctt cttgttcttc aatgttatat attggtctat 1561 atatttatga taaatctttt ccatttgtac aaaataaaat tccatttcat tgtgacctac 1621 tcctttcata aatgccaatc tgtgagaact tttgaatttt catagcaaca ttgcattttg 1681 gatgccattt gattgtaata aaactgtggc accttaattt tgaatggcag catgatcatg 1741 taatatctgt gctctaataa cgatgtatat atgtatagtg aacatattgc ttagtaacaa 1801 atgaaggaca agcatactac ataatataat ccatacaatt ctcttcagtt agtgtaaact 1861 gcaaatacta cagataattc tgataataaa atgatatgca cgctgaatcc tgctatggta 1921 ccattctaat gtatgtagta tttcaaattt ccttccttgt aactttcaaa gaaagccatc 1981 ttattcttgt aaaattttag atggtattat cacagattta aaaaggttgt attacatatt 2041 gtttaaactt tgtaagtaga aatatatctg ttataattat acaggctctg tggagaaata 2101 aagttc // LOCUS HSU33284 3416 bp mRNA PRI 22-SEP-1995 DEFINITION Human protein tyrosine kinase PYK2 mRNA, complete cds. ACCESSION U33284 NID g988304 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3416) AUTHORS Lev,S., Moreno,H., Martinez,R., Canoll,P., Peles,E., Musacchio,J.M., Plowman,G.D., Rudy,B. and Schlessinger,J. TITLE Protein tyrosine kinase PYK2 involved in Ca(2+)-induced regulation of ion channel and MAP kinase functions JOURNAL Nature 376 (6543), 737-745 (1995) MEDLINE 95379967 REFERENCE 2 (bases 1 to 3416) AUTHORS Lev,S.S. TITLE Direct Submission JOURNAL Submitted (07-AUG-1995) Sima S. Lev, Neurobiology, SUGEN Inc., 515 Galveston Drive, Redwood City, CA 94063, USA FEATURES Location/Qualifiers source 1..3416 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PYK2" /tissue_type="brain" /dev_stage="fetal" 5'UTR 1..107 CDS 108..3137 /EC_number="2.7.1.112" /note="protein tyrosine kinase; non-receptor tyrosine kinase" /codon_start=1 /product="PYK2" /db_xref="PID:g988305" /translation="MSGVSEPLSRVKLGTLRRPEGPAEPMVVVPVDVEKEDVRILKVC FYSNSFNPGKNFKLVKCTVQTEIREIITSILLSGRIGPNIRLAECYGLRLKHMKSDEI HWLHPQMTVGEVQDKYECLHVEAEWRYDLQIRYLPEDFMESLKEDRTTLLYFYQQLRN DYMQRYASKVSEGMALQLGCLELRRFFKDMPHNALDKKSNFELLEKEVGLDLFFPKQM QENLKPKQFRKMIQQTFQQYASLREEECVMKFFNTLAGFANIDQETYRCELIQGWNIT VDLVIGPKGIRQLTSQDAKPTCLAEFKQIRSIRCLPLEEGQAVLQLGIEGAPQALSIK TSSLAEAENMADLIDGYCRLQGEHQGSLIIHPRKDGEKRNSLPQIPMLNLEARRSHLS ESCSIESDIYAEIPDETLRRPGGPQYGIAREDVVLNRILGEGFFGEVYEGVYTNHKGE KINVAVKTCKKDCTLDNKEKFMSEAVIMKNLDHPHIVKLIGIIEEEPTWIIMELYPYG ELGHYLERNKNSLKVLTLVLYSLQICKAMAYLESINCVHRDIAVRNILVASPECVKLG DFGLSRYIEDEDYYKASVTRLPIKWMSPESINFRRFTTASDVWMFAVCMWEILSFGKQ PFFWLENKDVIGVLEKGDRLPKPDLCPPVLYTLMTRCWDYDPSDRPRFTELVCSLSDV YQMEKDIAMEQERNARYRTPKILEPTAFQEPPPKPSRPKYRPPPQTNLLAPKLQFQVP EGLCASSPTLTSPMEYPSPVNSLHTPPLHRHNVFKRHSMREEDFIQPSSREEAQQLWE AEKVKMRQILDKQQKQMVEDYQWLRQEEKSLDPMVYMNDKSPLTPEKEVGYLEFTGPP QKPPRLGAQSIQPTANLDRTDDLVYLNVMELVRAVLELKNELCQLPPEGYVVVVKNVG LTLRKLIGSVDDLLPSLPSSSRTEIEGTQKLLNKDLAELINKMRLAQQNAVTSLSEEC KRQMLTASHTLAVDAKNLLDAVDQAKVLANLAHPPAE" 3'UTR 3138..3146 BASE COUNT 796 a 988 c 984 g 648 t ORIGIN 1 cggtacaggt aagtcggccg ggcaggtagg ggtgcccgag gagtagtcgc tggagtccgc 61 gcctccctgg gactgcaatg tgccggtctt agctgctgcc tgagaggatg tctggggtgt 121 ccgagcccct gagccgagta aagttgggca cattacgccg gcctgaaggc cctgcagagc 181 ccatggtggt ggtaccagta gatgtggaaa aggaggacgt gcgtatcctc aaggtctgct 241 tctatagcaa cagcttcaat cctgggaaga acttcaaact ggtcaaatgc actgtccaga 301 cggagatccg ggagatcatc acctccatcc tgctgagcgg gcggatcggg cccaacatcc 361 ggttggctga gtgctatggg ctgaggctga agcacatgaa gtccgatgag atccactggc 421 tgcacccaca gatgacggtg ggtgaggtgc aggacaagta tgagtgtctg cacgtggaag 481 ccgagtggag gtatgacctt caaatccgct acttgccaga agacttcatg gagagcctga 541 aggaggacag gaccacgctg ctctattttt accaacagct ccggaacgac tacatgcagc 601 gctacgccag caaggtcagc gagggcatgg ccctgcagct gggctgcctg gagctcaggc 661 ggttcttcaa ggatatgccc cacaatgcac ttgacaagaa gtccaacttc gagctcctag 721 aaaaggaagt ggggctggac ttgtttttcc caaagcagat gcaggagaac ttaaagccca 781 aacagttccg gaagatgatc cagcagacct tccagcagta cgcctcgctc agggaggagg 841 agtgcgtcat gaagttcttc aacactctcg ccggcttcgc caacatcgac caggagacct 901 accgctgtga actcattcaa ggatggaaca ttactgtgga cctggtcatt ggccctaaag 961 ggatccgcca gctgactagt caggacgcaa agcccacctg cctggccgag ttcaagcaga 1021 tcaggtccat caggtgcctc ccgctggagg agggccaggc agtacttcag ctgggcattg 1081 aaggtgcccc ccaggccttg tccatcaaaa cctcatccct agcagaggct gagaacatgg 1141 ctgacctcat agacggctac tgccggctgc agggtgagca ccaaggctct ctcatcatcc 1201 atcctaggaa agatggtgag aagcggaaca gcctgcccca gatccccatg ctaaacctgg 1261 aggcccggcg gtcccacctc tcagagagct gcagcataga gtcagacatc tacgcagaga 1321 ttcccgacga aaccctgcga aggcccggag gtccacagta tggcattgcc cgtgaagatg 1381 tggtcctgaa tcgtattctt ggggaaggct tttttgggga ggtctatgaa ggtgtctaca 1441 caaatcacaa aggggagaaa atcaatgtag ctgtcaagac ctgcaagaaa gactgcactc 1501 tggacaacaa ggagaagttc atgagcgagg cagtgatcat gaagaacctc gaccacccgc 1561 acatcgtgaa gctgatcggc atcattgaag aggagcccac ctggatcatc atggaattgt 1621 atccctatgg ggagctgggc cactacctgg agcggaacaa gaactccctg aaggtgctca 1681 ccctcgtgct gtactcactg cagatatgca aagccatggc ctacctggag agcatcaact 1741 gcgtgcacag ggacattgct gtccggaaca tcctggtggc ctcccctgag tgtgtgaagc 1801 tgggggactt tggtctttcc cggtacattg aggacgagga ctattacaaa gcctctgtga 1861 ctcgtctccc catcaaatgg atgtccccag agtccattaa cttccgacgc ttcacgacag 1921 ccagtgacgt ctggatgttc gccgtgtgca tgtgggagat cctgagcttt gggaagcagc 1981 ccttcttctg gctggagaac aaggatgtca tcggggtgct ggagaaagga gaccggctgc 2041 ccaagcctga tctctgtcca ccggtccttt ataccctcat gacccgctgc tgggactacg 2101 accccagtga ccggccccgc ttcaccgagc tggtgtgcag cctcagtgac gtttatcaga 2161 tggagaagga cattgccatg gagcaagaga ggaatgctcg ctaccgaacc cccaaaatct 2221 tggagcccac agccttccag gaacccccac ccaagcccag ccgacctaag tacagacccc 2281 ctccgcaaac caacctcctg gctccaaagc tgcagttcca ggttcctgag ggtctgtgtg 2341 ccagctctcc tacgctcacc agccctatgg agtatccatc tcccgttaac tcactgcaca 2401 ccccacctct ccaccggcac aatgtcttca aacgccacag catgcgggag gaggacttca 2461 tccaacccag cagccgagaa gaggcccagc agctgtggga ggctgaaaag gtcaaaatgc 2521 ggcaaatcct ggacaaacag cagaagcaga tggtggagga ctaccagtgg ctcaggcagg 2581 aggagaagtc cctggacccc atggtttata tgaatgataa gtccccattg acgccagaga 2641 aggaggtcgg ctacctggag ttcacagggc ccccacagaa gcccccgagg ctgggcgcac 2701 agtccatcca gcccacagct aacctggacc ggaccgatga cctggtgtac ctcaatgtca 2761 tggagctggt gcgggccgtg ctggagctca agaatgagct ctgtcagctg ccccccgagg 2821 gctacgtggt ggtggtgaag aatgtggggc tgaccctgcg gaagctcatc gggagcgtgg 2881 atgatctcct gccttccttg ccgtcatctt cacggacaga gatcgagggc acccagaaac 2941 tgctcaacaa agacctggca gagctcatca acaagatgcg gctggcgcag cagaacgccg 3001 tgacctccct gagtgaggag tgcaagaggc agatgctgac ggcttcacac accctggctg 3061 tggacgccaa gaacctgctc gacgctgtgg accaggccaa ggttctggcc aatctggccc 3121 acccacctgc agagtgacgg agggtggggg ccacctgcct gcgtcttccg cccctgcctg 3181 ccatgtacct cccctgcctt gctgttggtc atgtgggtct tccagggaga aggccaaggg 3241 gagtcacctt cccttgccac tttgcacgac gccctctccc cacccctacc cctggctgta 3301 ctgctcaggc tgcagctgga cagaggggac tctgggctat ggacacaggg tgacggtgac 3361 aaagatggct cagaggggga ctgctgctgc ctggccactg ctccctaagc cagcct // LOCUS HSU33286 3147 bp mRNA PRI 14-FEB-1996 DEFINITION Human chromosome segregation gene homolog CAS mRNA, complete cds. ACCESSION U33286 NID g1050964 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3147) AUTHORS Brinkmann,U., Brinkmann,E., Gallo,M. and Pastan,I. TITLE Cloning and characterization of a cellular apoptosis susceptibility gene, the human homologue to the yeast chromosome segregation gene CSE1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (22), 10427-10431 (1995) MEDLINE 96036098 REFERENCE 2 (bases 1 to 3147) AUTHORS Brinkmann,U., Brinkmann,E. and Pastan,I. TITLE Expression cloning of cDNAs that render cancer cells resistant to Pseudomonas and Diphtheria toxin and immunotoxins JOURNAL Mol. Med. (Camb. Mass.) 1, 206-216 (1995) REFERENCE 3 (bases 1 to 3147) AUTHORS Brinkmann,U., Gallo,M., Polymeropoulos,M.H. and Pastan,I. TITLE The human CAS gene maps on chromosome 20q13, is amplified in BT474 breast cancer cells and part of aberrant chromosomes in breast and colon cancer cells JOURNAL Genome Res. (1996) In press REFERENCE 4 (bases 1 to 3147) AUTHORS Brinkmann,U. TITLE Direct Submission JOURNAL Submitted (07-AUG-1995) Ulrich Brinkman, Laboratory of Molecular Biology, DCBDC, National Cancer Institute, National Institutes of Health, Building 37, Room 4E16, 37 Convent Dr., MSC 4255, Bethesda, MD 20892-4255, USA FEATURES Location/Qualifiers source 1..3147 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /chromosome="20" /map="20q13" CDS 124..3039 /note="yeast chromosome segregation gene CSE1 homolog" /codon_start=1 /product="CAS" /db_xref="PID:g951338" /translation="MELSDANLQTLTEYLKKTLDPDPAIRRPAEKFLESVEGNQNYPL LLLTLLEKSQDNVIKVCASVTFKNYIKRNWRIVEDEPNKICEADRVAIKANIVHLMLS SPEQIQKQLSDAISIIGREDFPQKWPDLLTEMVNRFQSGDFHVINGVLRTAHSLFKRY RHEFKSNELWTEIKLVLDAFALPLTNLFKATIELCSTHANDASALRILFSSLILISKL FYSLNFQDLPEFWEGNMETWMNNFHTLLTLDNKLLQTDDEEEAGLLELLKSQICDNAA LYAQKYDEEFQRYLPRFVTAIWNLLVTTGQEVKYDLLVSNAIQFLASVCERPHYKNLF EDQNTLTSICEKVIVPNMEFRAADEEAFEDNSEEYIRRDLEGSDIDTRRRAACDLVRG LCKFFEGPVTGIFSGYVNSMLQEYAKNPSVNWKHKDAAIYLVTSLASKAQTQKHGITQ ANELVNLTEFFVNHILPDLKSANVNEFPVLKADGIKYIMIFRNQVPKEHLLVSIPLLI NHLQAGSIVVHTYAAHALERLFTMRGPNNATLFTAAEIAPFVEILLTNLFKALTLPGS SENEYIMKAIMRSFSLLQEAIIPYIPTLITQLTQKLLAVSKNPSKPHFNHYMFEAICL SIRITCKANPAAVVNFEEALFLVFTEILQNDVQEFIPYVFQVMSLLLETHKNDIPSSY MALFPHLLQPVLWERTGNIPALVRLLQAFLERGSNTIASAAADKIPGLLGVFQKLIAS KANDHQGFYLLNSIIEHMPPESVDQYRKQIFILLFQRLQNSKTTKFIKSFLVFINLYC IKYGALALQEIFDGIQPKMFGMVLEKIIIPEIQKVSGNVEKKICAVGITNLLTECPPM MDTEYTKLWTPLLQSLIGLFELPEDDTIPDEEHFIDIEDTPGYQTAFSQLAFAGKKEH DPVGQMVNNPKIHLAQSLHMLSTACPGRVPSMVSTSLNAEALQYLQGYLQAASVTLL" polyA_site 3147 /note="37 A nucleotides" BASE COUNT 959 a 639 c 621 g 928 t ORIGIN 1 gtcgcgccat tttgccgggg tttgaatgtg aggcggagcg gcggcaggag cggatagtgc 61 cagctacggt ccgcggctgg ggttccctcc tccgtttctg tatccccacg agatcctata 121 gcaatggaac tcagcgatgc aaatctgcaa acactaacag aatatttaaa gaaaacactt 181 gatcctgatc ctgccatccg acgtccagct gagaaatttc ttgaatctgt tgaaggaaat 241 cagaattatc cactgttgct tttgacatta ctggagaagt cccaggataa tgttatcaaa 301 gtatgtgctt cagtaacatt caaaaactat attaaaagga actggagaat tgttgaagat 361 gaaccaaaca aaatttgtga agccgatcga gtggccatta aagccaacat agtgcacttg 421 atgcttagca gcccagagca aattcagaag cagttaagtg atgcaattag cattattggc 481 agagaagatt ttccacagaa atggcctgac ttgctgacag aaatggtgaa tcgctttcag 541 agtggagatt tccatgttat taatggagtc ctccgtacag cacattcatt atttaaaaga 601 taccgtcatg aatttaagtc aaacgagtta tggactgaaa ttaagcttgt tctggatgcc 661 tttgctttgc ctttgactaa tctttttaag gccactattg aactctgcag tacccatgca 721 aatgatgcct ctgccctgag gattctgttt tcttccctga tcctgatctc aaaattgttc 781 tatagtttaa actttcagga tctccctgaa ttttgggaag gtaatatgga aacttggatg 841 aataatttcc atactctctt aacattggat aataagcttt tacaaactga tgatgaagag 901 gaagccggct tattggagct cttaaaatcc cagatttgtg ataatgccgc actctatgca 961 caaaagtacg atgaagaatt ccagcgatac ctgcctcgtt ttgttacagc catctggaat 1021 ttactagtta caacgggtca agaggttaaa tatgatttgt tggtaagtaa tgcaattcaa 1081 tttctggctt cagtttgtga gagacctcat tataagaatc tatttgagga ccagaacacg 1141 ctgacaagta tctgtgaaaa ggttattgtg cctaacatgg aatttagagc tgctgatgaa 1201 gaagcatttg aagataattc tgaggagtac ataaggagag atttggaagg atctgatatt 1261 gatactagac gcagggctgc ttgtgatctg gtacgaggat tatgcaagtt ttttgaggga 1321 cctgtgacag gaatcttctc tggttatgtt aattccatgc tgcaggaata cgcaaaaaat 1381 ccatctgtca actggaaaca caaagatgca gccatctacc tagtgacatc tttggcatca 1441 aaagcccaaa cacagaagca tggaattaca caagcaaatg aacttgtaaa cctaactgag 1501 ttctttgtga atcacatcct ccctgattta aaatcagcta atgtgaatga atttcctgtc 1561 cttaaagctg acggtatcaa atatattatg atttttagaa atcaagtgcc aaaagaacat 1621 cttttagtct cgattcctct cttgattaat catcttcaag ctggaagtat tgttgttcat 1681 acttacgcag ctcatgctct tgaacggctc tttactatgc gagggcctaa caatgccact 1741 ctctttacag ctgcagaaat cgcaccgttt gttgagattc tgctaacaaa ccttttcaaa 1801 gctctcacac ttcctggctc ttcagaaaat gaatatatta tgaaagctat catgagaagt 1861 ttttctctcc tacaagaagc cataatcccc tacatcccta ctctcatcac tcagcttaca 1921 cagaagctat tagctgttag taagaaccca agcaaacctc actttaatca ctacatgttt 1981 gaagcaatat gtttatccat aagaataact tgcaaagcta accctgctgc tgttgtaaat 2041 tttgaggagg ctttgttttt ggtgtttact gaaatcttac aaaatgatgt gcaagaattt 2101 attccatacg tctttcaagt gatgtctttg cttctggaaa cacacaaaaa tgacatcccg 2161 tcttcctata tggccttatt tcctcatctc cttcagccag tgctttggga aagaacagga 2221 aatattcctg ctctagtgag gcttcttcaa gcattcttag aacgcggttc aaacacaata 2281 gcaagtgctg cagctgacaa aattcctggg ttactaggtg tctttcagaa gctgattgca 2341 tccaaagcaa atgaccacca aggtttttat cttctaaaca gtataataga gcacatgcct 2401 cctgaatcag ttgaccaata taggaaacaa atcttcattc tgctattcca gagacttcag 2461 aattccaaaa caaccaagtt tatcaagagt tttttagtct ttattaattt gtattgcata 2521 aaatatgggg cactagcact acaagaaata tttgatggta tacaaccaaa aatgtttgga 2581 atggttttgg aaaaaattat tattcctgaa attcagaagg tatctggaaa tgtagagaaa 2641 aagatctgtg cggttggcat aaccaactta ctaacagaat gtcccccaat gatggacact 2701 gagtatacca aactgtggac tccattatta cagtctttga ttggtctttt tgagttaccc 2761 gaagatgata ccattcctga tgaggaacat tttattgaca tagaagatac accaggatat 2821 cagactgcct tctcacagtt ggcatttgct gggaaaaaag agcatgatcc tgtaggtcaa 2881 atggtgaata accccaaaat tcacctggca cagtcacttc acatgttgtc taccgcctgt 2941 ccaggaaggg ttccatcaat ggtgagcacc agcctgaatg cagaagcgct ccagtatctc 3001 caagggtacc ttcaggcagc cagtgtgaca ctgctttaaa ctgcattttt ctaatgggct 3061 aaacccagat ggtttcctag gaaatcacag gcttctgagc acagctgcat taaaacaaag 3121 gaagttttcc ttttgaactt gtcacga // LOCUS HSU33551 1128 bp mRNA PRI 30-SEP-1995 DEFINITION Human sialyltransferase (STX) mRNA, complete cds. ACCESSION U33551 NID g995770 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1128) AUTHORS Scheidegger,E.P., Sternberg,L.R., Roth,J. and Lowe,J.B. TITLE A human STX cDNA confers polysialic acid expression in mammalian cells JOURNAL J. Biol. Chem. 270 (39), 22685-22688 (1995) MEDLINE 96032684 REFERENCE 2 (bases 1 to 1128) AUTHORS Scheidegger,E.P., Sternberg,L.R., Roth,J. and Lowe,J.B. TITLE Direct Submission JOURNAL Submitted (09-AUG-1995) Eugen Paul Scheidegger, University of Michigan, Howard Hughes Medical Institute, 1150 W.Medical Center Dr., Ann Arbor, MI 48150-0650, USA FEATURES Location/Qualifiers source 1..1128 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="small cell lung carcinoma" gene 1..1128 /gene="STX" CDS 1..1128 /gene="STX" /codon_start=1 /product="sialyltransferase" /db_xref="PID:g995771" /translation="MQLQFRSWMLAALTLLVVFLIFADISEIEEEIGNSGGRGTIRSA VNSLHSKSNRAEVVINGSSSPAVVDRSNESIKHNIQPASSKWRHNQTLSLRIRKQILK FLDAEKDISVLKGTLKPGDIIHYIFDRDSTMNVSQNLYELLPRTSPLKNKHFGTCAIV GNSGVLLNSGCGQEIDAHSFVIRCNLAPVQEYARDVGLKTDLVTMNPSVIQRAFEDLV NATWREKLLQRLHSLNGSILWIPAFMARGGKERVEWVNELILKHHVNVRTAYPSLRLL HAVRGYWLTNKVHIKRPTTGLLMYTLATRFCKQIYLYGFWPFPLDQNQNPVKYHYYDS LKYGYTSQASPHTMPLEFKALKSLHEQGALKLTVGQCDGAT" BASE COUNT 281 a 321 c 291 g 235 t ORIGIN 1 atgcagctgc agttccggag ctggatgctg gccgcgctca cgctgctcgt ggtcttcctc 61 atcttcgcag acatctcaga gatcgaagaa gaaatcggga attcgggagg cagaggtaca 121 atcagatcag ctgtgaacag cttacatagc aaatctaata gagctgaagt tgtaataaac 181 ggctcctcat caccagctgt tgttgacaga agtaatgaaa gcatcaagca caacatccag 241 ccagcctcgt ccaaatggag acataaccag acgctctctc tgaggatcag gaagcagatt 301 ttaaagttct tggatgctga aaaggacatt tctgtcctaa agggaaccct gaagcctgga 361 gatattattc attacatctt cgatcgagac agcaccatga atgtgtccca gaacctctac 421 gagctcctcc ccaggacttc gccactgaag aataagcact ttgggacttg tgccatcgtg 481 ggcaactcgg gggtcttgct gaacagcggc tgtgggcagg agattgacgc ccacagcttc 541 gtcatcaggt gcaacctggc cccagtacag gagtatgccc gggatgtggg gctcaagaca 601 gacctggtaa ccatgaaccc ctcggtcatc cagcgggcct ttgaggactt ggtcaatgcc 661 acgtggcggg agaagctgct gcaacggctg cacagcctca atggcagcat cctgtggatc 721 cctgccttca tggcccgggg cggcaaggag cgtgttgagt gggtcaacga gcttatcctg 781 aagcaccacg tcaacgtgcg cactgcatac ccctcgctgc gcctgctgca cgccgttcgc 841 ggatactggc tgaccaacaa agtccacatc aaaagaccca ccaccggcct cttgatgtat 901 accctggcca cacgtttctg caaacaaatc tacctctacg gcttctggcc ctttccgctg 961 gatcagaacc agaacccagt caagtaccac tattatgaca gcctcaagta tggctacacc 1021 tcccaggcca gcccgcatac catgcccttg gagtttaagg ccctcaagag cctacatgag 1081 cagggggctt tgaaactgac tgtcggccag tgcgatgggg ccacgtag // LOCUS HSU33632 1882 bp mRNA PRI 05-JUN-1996 DEFINITION Human two P-domain K+ channel TWIK-1 mRNA, complete cds. ACCESSION U33632 NID g1086490 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1882) AUTHORS Lesage,F., Guillemare,E., Fink,M., Duprat,F., Lazdunski,M., Romey,G. and Barhanin,J. TITLE TWIK-1, a ubiquitous human weakly inward rectifying K+ channel with a novel structure JOURNAL EMBO J. 15 (5), 1004-1011 (1996) MEDLINE 96183184 REFERENCE 2 (bases 1 to 1882) AUTHORS Lesage,F. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) Florian Lesage, Institut de Pharmacologie Moleculaire et Cellulaire-CNRS, 660 Route des Lucioles, Valbonne 06560, France FEATURES Location/Qualifiers source 1..1882 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS-HORK" /tissue_type="kidney" /dev_stage="adult" 5'UTR 1..182 CDS 183..1193 /codon_start=1 /function="two P-domain K+ channel" /product="TWIK-1" /db_xref="PID:g1086491" /translation="MLQSLAGSSCVRLVERHRSAWCFGFLVLGYLLYLVFGAVVFSSV ELPYEDLLRQELRKLKRRFLEEHECLSEQQLEQFLGRVLEASNYGVSVLSNASGNWNW DFTSALFFASTVLSTTGYGHTVPLSDGGKAFCIIYSVIGIPFTLLFLTAVVQRITVHV TRRPVLYFHIRWGFSKQVVAIVHAVLLGFVTVSCFFFIPAAVFSVLEDDWNFLESFYF CFISLSTIGLGDYVPGEGYNQKFRELYKIGITCYLLLGLIAMLVVLETFCELHELKKF RKMFYVKKDKDEDQVHIIEHDQLSFSSITDQAAGMKEDQKQNEPFVATQSSACVDGPA NH" 3'UTR 1194..1882 BASE COUNT 449 a 435 c 512 g 486 t ORIGIN 1 gggcaggaag acggcgctgc ccggaggagc ggggcgggcg ggcgcgcggg ggagcgggcg 61 gcgggcggga gccaggcccg ggcgggggcg ggggcggcgg ggccagaaga ggcggcgggc 121 cgcgctccgg ccggtctgcg gcgttggcct tggctttggc tttggcggcg gcggtggaga 181 agatgctgca gtccctggcc ggcagctcgt gcgtgcgcct ggtggagcgg caccgctcgg 241 cctggtgctt cggcttcctg gtgctgggct acttgctcta cctggtcttc ggcgcagtgg 301 tcttctcctc ggtggagctg ccctatgagg acctgctgcg ccaggagctg cgcaagctga 361 agcgacgctt cttggaggag cacgagtgcc tgtctgagca gcagctggag cagttcctgg 421 gccgggtgct ggaggccagc aactacggcg tgtcggtgct cagcaacgcc tcgggcaact 481 ggaactggga cttcacctcc gcgctcttct tcgccagcac cgtgctctcc accacaggtt 541 atggccacac cgtgcccttg tcagatggag gtaaggcctt ctgcatcatc tactccgtca 601 ttggcattcc cttcaccctc ctgttcctga cggctgtggt ccagcgcatc accgtgcacg 661 tcacccgcag gccggtcctc tacttccaca tccgctgggg cttctccaag caggtggtgg 721 ccatcgtcca tgccgtgctc cttgggtttg tcactgtgtc ctgcttcttc ttcatcccgg 781 ccgctgtctt ctcagtcctg gaggatgact ggaacttcct ggaatccttt tatttttgtt 841 ttatttccct gagcaccatt ggcctggggg attatgtgcc tggggaaggc tacaatcaaa 901 aattcagaga gctctataag attgggatca cgtgttacct gctacttggc cttattgcca 961 tgttggtagt tctggaaacc ttctgtgaac tccatgagct gaaaaaattc agaaaaatgt 1021 tctatgtgaa gaaggacaag gacgaggatc aggtgcacat catagagcat gaccaactgt 1081 ccttctcctc gatcacagac caggcagctg gcatgaaaga ggaccagaag caaaatgagc 1141 cttttgtggc cacccagtca tctgcctgcg tggatggccc tgcaaaccat tgagcgtagg 1201 atttgttgca ttatgctaga gcaccagggt cagggtgcaa ggaagaggct taagtatgtt 1261 catttttatc agaatgcaaa agcgaaaatt atgtcacttt aagaaatagc tactgtttgc 1321 aatgtcttat taaaaaacaa caaaaaaaga cacatggaac aaagaagctg tgaccccagc 1381 aggatgtcta atatgtgagg aaatgagatg tccacctaaa attcatatgt gacaaaatta 1441 tctcgacctt acataggagg agaatacttg aagcagtatg ctgctgtggt tagaagcaga 1501 ttttatactt ttaactggaa actttggggt ttgcatttag atcatttagc tgatggctaa 1561 atagcaaaat ttatatttag aagcaaaaaa aaaaagcata gagatgtgtt ttataaatag 1621 gtttatgtgt actggtttgc atgtacccac ccaaaatgat tatttttgga gaatctaagt 1681 caaactcact atttataatg cataggtaac cattaactat gtacatataa agtataaata 1741 tgtttatatt ctgtacatat ggtttaggtc accagatcct agtgtagttc tgaaactaag 1801 actatagata ttttgtttct tttgatttct ctttatacta aagaatccag agttgctaca 1861 ataaaataag gggaataata aa // LOCUS HSU33635 4236 bp mRNA PRI 14-FEB-1996 DEFINITION Human colon carcinoma kinase-4 (CCK4) mRNA, complete cds. ACCESSION U33635 NID g1016701 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4236) AUTHORS Mossie,K., Jallal,B., Alves,F., Sures,I., Plowman,G.D. and Ullrich,A. TITLE Colon carcinoma kinase-4 defines a new subclass of the receptor tyrosine kinase family JOURNAL Oncogene 11 (10), 2179-2184 (1995) MEDLINE 96074849 REFERENCE 2 (bases 1 to 4236) AUTHORS Plowman,G.D. TITLE Direct Submission JOURNAL Submitted (10-AUG-1995) Greg D. Plowman, Molecular Biology, SUGEN, Inc, 515 Galveston Drive, Redwood City, CA 94063-4720, USA FEATURES Location/Qualifiers source 1..4236 /organism="Homo sapiens" /note="pool of 13 Human colon carcinoma cell lines" /db_xref="taxon:9606" /clone_lib="Lambda ZAP" /cell_line="SW480, SW1463, SW1417, SW837, SW948, SW620, SW403, SW116, T84, HTC15, LS123, HT29, Caco-2" gene 1..4236 /gene="CCK4" 5'UTR 1..192 /gene="CCK4" CDS 193..3405 /gene="CCK4" /note="receptor tyrosine kinase" /codon_start=1 /product="colon carcinoma kinase-4" /db_xref="PID:g1016702" /translation="MGAARGSPARPRRLPLLSVLLLPLLGGTQTAIVFIKQPSSQDAL QGRRALLRCEVEAPGPVHVYWLLDGAPVQDTERRFAQGSSLSFAAVDPLQDSGTFQCV ARDDVTGEEARSANASFNIKWIEAGPVVLKHPASEAEIQPQTQVKLRCHIDGHPRPTY QWFRDGTPLSDGQSNHTVSSKERNLTLRPAGPEHSGLYSCCAHSAFSQACSSQNFTLS IADESFARVVLAPQDVVVARYEEAMFHCQFSAQPPPSLQWLFEDETPITNRSRPPHLR RATVFANGSLLLTQVRPRNAGIYRCIGQGQRGPPIILEATLHLAEIEDMPLFEPRVFT AGSEERVTCLPPKGLPEPSVWWEHAGVRLPTHGRVYQKGHELVLANIAESDAGVYTCH AANLAGQRRQDVNITVATVPSWLKKPQDSQLEEGKPGYLDCLTQATPKPTVVWYRNQM LISEDSRFEVFKNGTLRINSVEVYDGTWYRCMSSTPAGSIEAQAVLQVLEKLKFTPPP QPQQCMGFDKEATVPCSATGREKPTIKWERADGSSLPEWVTDNAGTLHFARVTRDDAG NYTCIASNGPQGQIRAHVQLTVAVFITFKVEPERTTVYQGHTALLQCEAQGDPKPLIQ WKGKDRILDPTKLGPRMHIFQNGSLVIHDVAPEDSGRYTCIAGNSCNIKHTEAPLYVV DKPVPEESEGPGSPPPYKMIQTIGLSVGAAVAYIIAVLGLMFYCKKRCKAKRLQKQPE GEEPEMECLNGGPLQNGQPSAEIQEEVALTSLGSGPAATNKRHSTSDKMHFPRSSLQP ITTLGKSEFGEVFLAKAQGLEEGVAETLVLVKSLQSKDEQQQLDFRRELEMFGKLNHA NVVRLLGLCREAEPHYMVLEYVDLEDLKQFLRISKSKDEKLKSQPLSTKQKVALCTQV ALGMEHLSNNRFVHKDLAARNCLVSAQRQVKVSALGLSKDVYNSEYYHFRQAWVALRW MSPEAILEGDFSTKSDVWASGVLMWEVFTHGEMPHGGQADDEVLADLQAGKARLPQPE GCPSKLYRLMQRCWALSPKDRPSFSEIASALGDSTVDSKP" sig_peptide 193..270 /gene="CCK4" misc_structure 193..2304 /gene="CCK4" /note="encodes extracellular domain" mat_peptide 271..3402 /gene="CCK4" misc_structure 2305..2370 /gene="CCK4" /note="encodes transmembrane domain" misc_structure 2371..3402 /gene="CCK4" /note="encodes cytoplasmic domain" 3'UTR 3405..4236 /gene="CCK4" BASE COUNT 845 a 1254 c 1283 g 854 t ORIGIN 1 cggggactcg gaggtactgg gcgcgcgcgg ctccggctcg ggacgcctcg ggacgcctcg 61 gggtcgggtt ccggttgcgg ctgctgctgc ggcgcccgcg cttccgtagc gttccgcctc 121 ctgtgcccgc cgcggagcaa gtctgcgcgc ccgccgtgcg cccctaagct ccttttacct 181 gagcccgccg cgatgggagc tgcgcgggga tccccggcca gaccccgccg gttgcctctg 241 ctcagcgtcc tgctgctgcc gctgctgggc ggtacccaga cagccattgt cttcatcaag 301 cagccgtcct cccaggatgc actgcagggg cgccgggcgc tgcttcgctg tgaggttgag 361 gctccgggcc cggtacatgt gtactggctg ctcgatgggg cccctgtcca ggacacggag 421 cggcgtttcg cccagggcag cagcctgagc tttgcagctg tggacccgct gcaggactct 481 ggcaccttcc agtgtgtggc tcgggatgat gtcactggag aagaagcccg cagtgccaac 541 gcctccttca acatcaaatg gattgaggca ggtcctgtgg tcctgaagca tccagcctcg 601 gaagctgaga tccagccaca gacccaggtc aaacttcgtt gccacattga tgggcaccct 661 cggcccacct accaatggtt ccgagatggg accccccttt ctgatggtca gagcaaccac 721 acagtcagca gcaaggagcg gaacctgacg ctccggccag ctggtcctga gcatagtggg 781 ctgtattcct gctgcgccca cagtgctttt agccaggctt gcagcagcca gaacttcacc 841 ttgagcattg ctgatgaaag ctttgccagg gtggtgctgg caccccagga cgtggtagta 901 gcgaggtatg aggaggccat gttccattgc cagttctcag cccagccacc cccgagcctg 961 cagtggctct ttgaggatga gactcccatc actaaccgca gtcgcccccc acacctccgc 1021 agagccacag tgtttgccaa cgggtctctg ctgctgaccc aggtccggcc acgcaatgca 1081 gggatctacc gctgcattgg ccaggggcag aggggcccac ccatcatcct ggaagccaca 1141 cttcacctag cagagattga agacatgccg ctatttgagc cacgggtgtt tacagctggc 1201 agcgaggagc gtgtgacctg ccttcccccc aagggtctgc cagagcccag cgtgtggtgg 1261 gagcacgcgg gagtccggct gcccacccat ggcagggtct accagaaggg ccacgagctg 1321 gtgttggcca atattgctga aagtgatgct ggtgtctaca cctgccacgc ggccaacctg 1381 gctggtcagc ggagacagga tgtcaacatc actgtggcca ctgtgccctc ctggctgaag 1441 aagccccaag acagccagct ggaggagggc aaacccggct acttggattg cctgacccag 1501 gccacaccaa aacctacagt tgtctggtac agaaaccaga tgctcatctc agaggactca 1561 cggttcgagg tcttcaagaa tgggaccttg cgcatcaaca gcgtggaggt gtatgatggg 1621 acatggtacc gttgtatgag cagcacccca gccggcagca tcgaggcgca agccgtgctc 1681 caagtgctgg aaaagctcaa gttcacacca ccaccccagc cacagcagtg catggggttt 1741 gacaaggagg ccacggtgcc ctgttcagcc acaggccgag agaagcccac tattaagtgg 1801 gaacgggcag atgggagcag cctcccagag tgggtgacag acaacgctgg gaccctgcat 1861 tttgcccggg tgactcgaga tgacgctggc aactacactt gcattgcctc caacgggccg 1921 cagggccaga ttcgtgccca tgtccagctc actgtggcag tttttatcac cttcaaagtg 1981 gaaccagagc gtacgactgt gtaccagggc cacacagccc tactgcagtg cgaggcccag 2041 ggggacccca agccgctgat tcagtggaaa ggcaaggacc gcatcctgga ccccaccaag 2101 ctgggaccca ggatgcacat cttccagaat ggctccctgg tgatccatga cgtggcccct 2161 gaggactcag gccgctacac ctgcattgca ggcaacagct gcaacatcaa gcacacggag 2221 gcccccctct atgtcgtgga caagcctgtg ccggaggagt cggagggccc tggcagccct 2281 cccccctaca agatgatcca gaccattggg ttgtcggtgg gtgccgctgt ggcctacatc 2341 attgccgtgc tgggcctcat gttctactgc aagaagcgct gcaaagccaa gcggctgcag 2401 aagcagcccg agggcgagga gccagagatg gaatgcctca acggtgggcc tttgcagaac 2461 gggcagccct cagcagagat ccaagaagaa gtggccttga ccagcttggg ctccggcccc 2521 gcggccacca acaaacgcca cagcacaagt gataagatgc acttcccacg gtctagcctg 2581 cagcccatca ccacgctggg gaagagtgag tttggggagg tgttcctggc aaaggctcag 2641 ggcttggagg agggagtggc agagaccctg gtacttgtga agagcctgca gagcaaggat 2701 gagcagcagc agctggactt ccggagggag ttggagatgt ttgggaagct gaaccacgcc 2761 aacgtggtgc ggctcctggg gctgtgccgg gaggctgagc cccactacat ggtgctggaa 2821 tatgtggatc tggaagacct caagcagttc ctgaggattt ccaagagcaa ggatgaaaaa 2881 ttgaagtcac agcccctcag caccaagcag aaggtggccc tatgcaccca ggtagccctg 2941 ggaatggagc acctgtccaa caaccgcttt gtgcataagg acttggctgc gcgtaactgc 3001 ctggtcagtg cccagagaca agtgaaggtg tctgccctgg gcctcagcaa ggatgtgtac 3061 aacagtgagt actaccactt ccgccaggcc tgggtggcgc tgcgctggat gtcccccgag 3121 gccatcctgg agggtgactt ctctaccaag tctgatgtct gggcctccgg tgtgctgatg 3181 tgggaagtgt ttacacatgg agagatgccc catggtgggc aggcagatga tgaagtactg 3241 gcagatttgc aggctgggaa ggctagactt cctcagcccg agggctgccc ttccaaactc 3301 tatcggctga tgcagcgctg ctgggccctc agccccaagg accggccctc cttcagtgag 3361 attgccagcg ccctgggaga cagcaccgtg gacagcaagc cgtgaggagg gagcccgctc 3421 aggatggcct gggcagggga ggacatctct agagggaagc tcacagcatg atgggcaaga 3481 tccctgtcct cctgggccct gaggcccctg ccctagtgca acaggcattg ctgaggtctg 3541 agcagggcct ggcctttcct cctcttcctc accctcatcc tttgggaggc tgacttggac 3601 ccaaactggg cgactagggc tttgagctgg gcagttttcc ctgccacctc ttcctctatc 3661 agggacagtg tgggtgccac aggtaacccc aatttctggc cttcaacttc tccccttgac 3721 cgggtccaac tctgccactc atctgccaac tttgcctggg gagggctagg cttgggatga 3781 gctgggtttg tggggagttc cttaatattc tcaagttctg ggcacacagg gttaatgagt 3841 ctcttggccc actggtccca cttgggggtc tagaccagga ttatagagga cacagcaagt 3901 gagtcctccc cactctgggc ttgtgcacac tgacccagac ccacggtctt ccccaccctt 3961 ctctcctttc ctcatcctaa gtgcctggca gatgaaggag ttttcaggag cttttgacac 4021 tatataaacc gccctttttg tatgcaccac gggcggcttt tatatgtaat tgcagcgtgg 4081 ggtgggtggg catgggaggt aggggtgggc cctggagatg aggagggtgg gccatcctta 4141 ccccacactt ttattgttgt cgttttttgt gtgtttgtgt ttttttgttt ttgtttttgt 4201 ttttacactc gctgctctca ataaataagc cttttt // LOCUS HSU33761 1600 bp mRNA PRI 01-NOV-1995 DEFINITION Human cyclin A/CDK2-associated p45 (Skp2) mRNA, complete cds. ACCESSION U33761 NID g995825 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1600) AUTHORS Zhang,H., Kobayashi,R., Galaktionov,K. and Beach,D. TITLE p19Skp1 and p45Skp2 are essential elements of the cyclin A-CDK2 S phase kinase JOURNAL Cell 82 (6), 915-925 (1995) MEDLINE 96016087 REFERENCE 2 (bases 1 to 1600) AUTHORS Zhang,H., Kobayashi,R., Galaktionov,K. and Beach,D. TITLE Direct Submission JOURNAL Submitted (10-AUG-1995) Hui Zhang, Cold Spring Harbor Laboratory, 1 Bungtown Road, P. O. Box 100, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..1600 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /chromosome="5" /map="5p13" gene 148..1455 /gene="Skp2" CDS 148..1455 /gene="Skp2" /codon_start=1 /product="cyclin A/CDK2-associated p45" /db_xref="PID:g995826" /translation="MHVFKTPGPADAMHRKHLQEIPDLSSNVATSFTWGWDSSKTSEL LSGMGVSALEKEEPDSENIPQELLSNLGHPESPPRKRLKSKGSDKDFVIVRRPKLNRE NFPGVSWDSLPDELLLGIFSCLCLPELLKVSGVCKRWYRLASDESLWQTLDLTGKNLH PDVTGRLLSQGVIAFRCPRSFMDQPLAEHFSPFRVQDMDLSNSVIEVSTLHGILSQCS KLQNLSLELRLSDPIVNTLAKNSNLVRLNLPGCPGFPKFPLQTFLSSCPRLDELNLSW CFNFTEKHVQVAVAHVSETMTQLNLSGYRKNLQKSDLSTLVRRCPNLVHLDLSNSVML KNDCFQEFSQLNYLQHLSLSRCYDIIPETLLELGEIPTLKTLQVFGIVPDGTLQLLKE ALPHLQINCSHFTTIARPTIGNKKNQEIWGIKCRLTLQKPSCL" BASE COUNT 405 a 406 c 383 g 406 t ORIGIN 1 gaattccggg ctgtagagct tgcccgcgca gtggggatgg aacgttgcta ggcttagcgg 61 gtctggctgc tggaggcccg agcagcacgc tcgagccgac gcgcgccaaa gcgggaatct 121 ggaaggcgaa gcagctctgc aagtttaatg cacgtattta aaactcccgg gcctgcggac 181 gctatgcaca ggaagcacct ccaggagatt ccagacctga gtagcaacgt tgccaccagc 241 ttcacgtggg gatgggattc cagcaagact tctgaactgc tgtcaggcat gggggtctcc 301 gccctggaga aagaggagcc cgacagtgag aacatccccc aggaactgct ctcaaacctg 361 ggccacccgg agagcccccc acggaaacgg ctgaagagca aagggagtga caaagacttt 421 gtaattgtcc gcaggcctaa gctaaatcgg gagaactttc caggtgtttc atgggactct 481 cttccggatg agctgctctt gggaatcttt tcctgtctgt gcctccctga gctgctaaag 541 gtctctggtg tttgtaagag gtggtatcgc ctagcgtctg atgagtctct atggcagacc 601 ttagacctta caggtaaaaa tctgcacccg gatgtgactg gtcggttgct gtctcaaggg 661 gtgattgcct tccgctgccc acgatcattt atggaccaac cattggctga acatttcagc 721 ccttttcgtg tacaggacat ggacctatcg aactcagtta tagaagtgtc caccctccac 781 ggcatactgt ctcagtgttc caagttgcag aatctaagcc tggaactgcg gctttcggat 841 cccattgtca atactctcgc aaaaaactca aatttagtgc gacttaacct tcctgggtgt 901 cctggattcc ctaaatttcc cctgcagact ttcctaagca gctgtcccag actggatgag 961 ctgaacctct cctggtgttt taatttcact gaaaagcatg tacaggtggc tgttgcgcat 1021 gtctcagaga ccatgaccca gctgaatcta agcggctaca gaaagaatct ccagaaatca 1081 gatctctcta ctttagttag aagatgcccc aatcttgtcc atctagactt aagtaatagt 1141 gtcatgctaa agaatgactg ctttcaggaa ttttcccagc tcaactacct ccaacaccta 1201 tcactcagtc ggtgctatga tataatacct gaaactttac ttgaacttgg agaaattccc 1261 acactaaaaa cactacaagt ttttggaatc gtgccagatg gtacccttca actgttaaag 1321 gaagcccttc ctcatctaca gattaattgc tcccatttca ccaccattgc caggccaact 1381 attggcaaca aaaagaacca ggagatatgg ggcatcaaat gccgactgac actgcaaaag 1441 cccagttgtc tatgaagtat ttattgcagg atggtgtctc ttctttagaa cagggaaaat 1501 aggcaggaag cccaattgct ggagtactta gctagtttta ttcttggttt tcccttttgc 1561 ctgtcattct gcaagtatac tagggagccc attttgagag // LOCUS HSU33818 2394 bp mRNA PRI 23-JAN-1996 DEFINITION Human inducible poly(A)-binding protein mRNA, complete cds. ACCESSION U33818 NID g1163176 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2394) AUTHORS Yang,H., Duckett,C.S. and Lindsten,T. TITLE iPABP, an inducible poly(A)-binding protein detected in activated human T cells JOURNAL Mol. Cell. Biol. 15 (12), 6770-6776 (1995) MEDLINE 96069385 REFERENCE 2 (bases 1 to 2394) AUTHORS Lindsten,T. TITLE Direct Submission JOURNAL Submitted (11-AUG-1995) Tullia Lindsten, Dept. of Medicine and the Gwen Knapp Center for Lupus and Immunology Research, University of Chicago, 924 E. 57th Street, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..2394 /organism="Homo sapiens" /note="T cell cDNA library activated with PMA, ionomycin, and anti-CD28" /db_xref="taxon:9606" /cell_type="T cell" CDS 154..2088 /note="iPABP" /codon_start=1 /product="inducible poly(A)-binding protein" /db_xref="PID:g1163177" /translation="MNAAASSYPMASLYVGDLHSDVTEAMLYEKFSPAGPVLSIRVCR DMITRRSLGYAYVNFQQPADAERALDTMNFDVIKGKPIRIMWSQRDPSLRKSGVGNVF IKNLDKSIDNKALYDTFSAFGNILSCKVVCDENGSKGYAFVHFETQEAADKAIEKMNG MLLNDRKVFVGRFKSRKEREAELGAKAKEFTNVYIKNFGEEVDDESLKELFSQFGKTL SVKVMRDPNGKSKGFGFVSYEKHEDANKAVEEMNGKEISGKIIFVGRAQKKVERQAEL KRKFEQLKQERISRYQGVNLYIKNLDDTIDDEKLRKEFSPFGSITSAKVMLEDGRSKG FGFVCFSSPEEATKAVTEMNGRIVGSKPLYVALAQRKEERKAHLTNQYMQRVAGMRAL PANAILNQFQPAAGGYFVPAVPQAQGRPPYYTPNQLAQMRPNPRWQQGGRPQGFQGMP SAIRQSGPRPTLRHLAPTGSECPDRLAMDFGGAGAAQQGLTDSCQSGGVPTAVQNLAP RAAVAAAAPRAVAPYKYASSVRSPHPAIQPLQAPQPAVHVQGQEPLTASMLAAAPPQE QKQMLGERLFPLIQTMHSNLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAVAVLQ AHHAKKEAAQKVGAVAAATS" BASE COUNT 624 a 579 c 635 g 556 t ORIGIN 1 gcatgtattc cccagccagc cgtccgtccg tcctggtcaa cggctagtcc tgcaggattc 61 cctaatgggc ctccatggga ctcagccaag agtaagagca tgaagtgggg gtgtggactc 121 ctggcggggc tcggggtggt ggggggcggg gagatgaacg ctgcggccag cagctacccc 181 atggcctccc tgtacgtggg cgacctgcat tcggacgtca ccgaggccat gctgtacgaa 241 aagttcagcc ccgcggggcc tgtgctgtcc atccgggtct gccgcgatat gatcacccgc 301 cgctccctgg gctatgccta cgtcaacttc cagcagccgg ccgacgctga gcgggctttg 361 gacaccatga actttgatgt gattaaggga aagccaatcc gcatcatgtg gtctcagagg 421 gatccctctt tgagaaaatc tggtgtggga aacgtcttca tcaagaacct ggacaaatct 481 atagataaca aggcacttta tgatactttt tctgcttttg gaaacatact gtcctgcaag 541 gtggtgtgtg atgagaacgg ctctaagggt tatgcctttg tccacttcga gacccaagag 601 gctgccgaca aggccatcga gaagatgaat ggcatgctcc tcaatgaccg caaagtattt 661 gtgggcagat tcaagtctcg caaagagcgg gaagctgagc ttggagccaa agccaaggaa 721 ttcaccaatg tttatatcaa aaactttggg gaagaggtgg atgatgagag tctgaaagag 781 ctattcagtc agtttggtaa gaccctaagt gtcaaggtga tgagagatcc caatgggaaa 841 tccaaaggct ttggctttgt gagttacgaa aaacacgagg atgccaataa ggctgtggaa 901 gagatgaatg gaaaagaaat aagtggtaaa atcatatttg taggccgtgc acaaaagaaa 961 gtagaacggc aggcagagtt aaaacggaaa tttgaacagt tgaaacagga gagaattagt 1021 cgatatcagg gggtgaatct ctacattaag aacttggatg acactattga tgatgagaaa 1081 ttaaggaaag aattttctcc ttttggatca attaccagtg ctaaggtaat gctggaggat 1141 ggaagaagca aagggtttgg cttcgtctgc ttctcatctc ctgaagaagc aaccaaagca 1201 gtcactgaga tgaatggacg cattgtgggc tccaagccac tatatgttgc cctggcccag 1261 aggaaggaag agagaaaggc tcacctgacc aaccagtata tgcaacgagt ggctggaatg 1321 agagcacttc ctgccaatgc catcttaaat cagttccagc ctgcagcggg tggctacttt 1381 gtgccagcag tcccacaggc tcagggaagg cctccatatt atacacctaa ccagttagca 1441 cagatgaggc ctaatccacg ctggcagcaa ggtgggagac ctcaaggctt ccaaggaatg 1501 ccaagtgcta tacgccagtc tgggcctcgt ccaactcttc gccatctggc tccaactggg 1561 tctgagtgcc cggaccgctt ggctatggac tttggtgggg ctggtgccgc ccagcaaggg 1621 ctgactgaca gctgccagtc tggaggcgtt cccacagctg tgcagaactt agcgccacgc 1681 gctgctgttg ctgctgctgc tccccgggct gttgccccct acaaatacgc ctccagtgtc 1741 cgcagccctc atcctgccat acagcctctg caggcacccc agcctgcggt ccatgtgcag 1801 gggcaggagc cactgactgc ctccatgctg gctgcagcac ccccccagga acagaagcag 1861 atgctgggag aacgcttgtt cccactcatc caaacaatgc attcaaatct ggctgggaag 1921 atcacgggaa tgctgctgga gatagacaac tctgagctgc tgcacatgtt agagtccccc 1981 gagtctctcc gctccaaggt ggatgaagct gtagcagttc tacaggctca tcatgccaag 2041 aaagaagctg cccagaaggt gggcgctgtt gctgctgcta cctcttagac aaggaaaaac 2101 cgattcaaaa gccaaataac cccttatgga attcaactca aggtttgaag acttcctagc 2161 ttgtcctatg gacctcaaca ccaaggatta caaattgcaa atttaatagg tcattttgta 2221 tcaaaaggtc aattatgaag cacctagaat ttttcaatta tacgaatatg ttctttgggt 2281 tctgctgtgg cccagacagt gttaactttt tttttattgt gggttttgat tttttccccc 2341 agaaattggt tttatttgat gtacccaagt cttacgtttc ccaataaaga aaaa // LOCUS HSU33821 1734 bp mRNA PRI 25-SEP-1995 DEFINITION Human tax1-binding protein TXBP151 mRNA, complete cds. ACCESSION U33821 NID g995834 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1734) AUTHORS Jin,D.Y. and Jeang,K.T. TITLE Direct Submission JOURNAL Submitted (13-AUG-1995) Dong-Yan Jin, Laboratory of Molecular Microbiology, NIAID, NIH, 9000 Rockville Pike, Bethesda, MD 20892-0460, USA FEATURES Location/Qualifiers source 1..1734 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3, exponentially growing, ATCC #CCL 2.2" /clone_lib="Clontech cDNA library cat.# HL4000A1, lot# 39042a" CDS 6..1697 /note="tax1-binding protein" /codon_start=1 /product="TXBP151" /db_xref="PID:g995835" /translation="MERELNHEKERCDQLQAEQKGLTEVTQSLKMENEEFKKRFSDAT SKAHHVEEDIVSVTHKAIEKETELDSLKDKLKKAQHEREQLECQLKTEKDEKELYKVH LKNTEIENTKLMSEVQTLKNLDGNKESVITHFKEEIGRLQLCLAEKENLQRTFLLTTS SKEDTCFLKEQLRKAEEQVQATRQEVVFLAKELSDAVNVRDRTMADLHTARLENEKVK KQLADAVAELKLNAMKKDQDKTDTLEHELRREVEDLKLRLQMAADHYKEKFKECQRLQ KQINKLSDQSANNNNVFTKKTGNQQKVNDASVNTDPATSASTVDVKPSPSAAEADFDI VTKGQVCEMTKEIADKTEKYNKCKQLLQDEKAKCNKYADELAKMELKWKEQVKIAENV KLELAEVQDNYKELKRSLENPAERKMEDGADGAFYPDEIQRPPVRVPSWGLEDNVVCS QPARNFSRPDGLEDSEDSKEDENVPTAPDPPSQHLRGHGTGFCFDSSFDVHKKCPLCE LMFPPNYDQSKFEEHVESHWKVCPMCSEQFPPDYDQQVFERHVQTHFDQNVLNFD" BASE COUNT 655 a 294 c 389 g 396 t ORIGIN 1 ggagaatgga aagagaactt aaccatgaga aagaaagatg tgaccaactg caagcagaac 61 aaaagggtct tactgaagta acacaaagct taaaaatgga aaatgaagag tttaagaaga 121 ggttcagtga tgctacatcc aaagcccatc acgttgagga agatattgtg tcagtaacac 181 ataaagcaat tgaaaaagaa accgaattag acagtttaaa ggacaaactc aagaaggcac 241 aacatgaaag agaacaactt gaatgtcagt tgaagacaga gaaggatgaa aaggaacttt 301 ataaggtaca tttgaagaat acagaaatag aaaataccaa gcttatgtca gaggtccaga 361 ctttaaaaaa tttagatggg aacaaagaaa gcgtgattac tcatttcaaa gaagagattg 421 gcaggctgca gttatgtttg gctgaaaagg aaaatctgca aagaactttc ctgcttacaa 481 cctcaagtaa agaagatact tgttttttaa aggagcaact tcgtaaagca gaggaacagg 541 ttcaggcaac tcggcaagaa gttgtctttc tggctaaaga actcagtgat gctgtcaacg 601 tacgagacag aacgatggca gacctgcata ctgcacgctt ggaaaacgag aaagtgaaaa 661 agcagttagc tgatgcagtg gcagaactta aactaaatgc tatgaaaaaa gatcaggaca 721 agactgatac actggaacac gaactaagaa gagaagttga agatctgaaa ctccgtcttc 781 agatggctgc agaccattat aaagaaaaat ttaaggaatg ccaaaggctc caaaaacaaa 841 taaacaaact ttcagatcaa tcagctaata ataataatgt cttcacaaag aaaacgggga 901 atcagcagaa agtgaatgat gcttcagtaa acacagaccc agccacttct gcctctactg 961 tagatgtaaa gccatcacct tctgcagcag aggcagattt tgacatagta acaaaggggc 1021 aagtctgtga aatgaccaaa gaaattgctg acaaaacaga aaagtataat aaatgtaaac 1081 aactcttgca ggatgagaaa gcaaaatgca ataaatatgc tgatgaactt gcaaaaatgg 1141 agctgaaatg gaaagaacaa gtgaaaattg ctgaaaatgt aaaacttgaa ctagctgaag 1201 tacaggataa ttataaagaa cttaaaagga gtctagaaaa tccagcagaa aggaaaatgg 1261 aagatggagc agatggtgct ttttacccag atgaaataca aaggccacct gtcagagtcc 1321 cctcttgggg actggaagac aatgttgtct gcagccagcc tgctcgaaac tttagtcggc 1381 ctgatggctt agaggactct gaggatagca aagaagatga gaatgtgcct actgctcctg 1441 atcctccaag tcaacattta cgtgggcatg ggacaggctt ttgctttgat tccagctttg 1501 atgttcacaa gaagtgtccc ctctgtgagt taatgtttcc tcctaactat gatcagagca 1561 aatttgaaga acatgttgaa agtcactgga aggtgtgccc gatgtgcagc gagcagttcc 1621 ctcctgacta tgaccagcag gtgtttgaaa ggcatgtgca gacccatttt gatcagaatg 1681 ttctaaattt tgactagtta ctttttatta tgagttaata tagtttagca gtaa // LOCUS HSU33822 1466 bp mRNA PRI 25-SEP-1995 DEFINITION Human tax1-binding protein TXBP181 mRNA, complete cds. ACCESSION U33822 NID g995836 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1466) AUTHORS Jin,D.Y. and Jeang,K.T. TITLE Direct Submission JOURNAL Submitted (13-AUG-1995) Dong-Yan Jin, Laboratory of Molecular Microbiology, NIAID, NIH, 9000 Rockville Pike, Bethesda, MD 20892-0460, USA FEATURES Location/Qualifiers source 1..1466 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3, exponentially growing, ATCC# CCL 2.2" /clone_lib="Clontech cDNA library cat.# HL4000A1, lot# 39042a" CDS 6..1451 /note="tax1-binding protein" /codon_start=1 /product="TXBP181" /db_xref="PID:g995837" /translation="MGLSIRTPEDLSRFVVELQQRELALKDKNSAVTSSARGLEKARQ QLQEELRQVSGQLLEERKKRETHEALARRLQKRVLLLTKERDGMRAILGSYDSELTPA EYSPQLTRRMREAEDMVQKVHSHSAEMEAQLSQALEELGGQKQRADMLEMELKMLKSQ SSSAEQSFLFSREEADTLRLKVEELEGERSRLEEEKRMLEAQLERRALQGDYDQSRTK VLHMSLNPTSVARQRLREDHSQLQAECERLRGLLRAMERGGTVPADLEAAAASLPSSK EVAELKKQVESAELKNQRLKEVFQTKIQEFRKACYTLTGYQIDITTENQYRLTSLYAE HPGDCSSSRPPAPRVPRCSYWRQSSHTPWASSSRCTCGARTASLPSSARSPSSSSAAR PWRSLQARGHSRSHSAWPDLQVPCPASHRLGARPASPAPQGSSMTDRHAGTYVGLPAG AASTLSTCRPHASRSLVCGRRPPAWVPHLVK" BASE COUNT 312 a 454 c 489 g 211 t ORIGIN 1 agaccatggg cctgagcatc aggactccag aagacctttc cagattcgtg gttgagctgc 61 agcagaggga gcttgccttg aaggacaaga acagcgccgt caccagcagc gcccgggggc 121 tggagaaggc caggcagcag ctgcaggagg agctccggca ggtcagcggc cagctgttgg 181 aggagaggaa gaagcgcgag acccacgagg cgctggcccg gaggctccag aaacgggtcc 241 tgctgctcac caaggagcgg gacggtatgc gggccatcct ggggtcctac gacagcgagc 301 tgaccccggc cgagtactca ccccagctga cgcggcgcat gcgggaggct gaggatatgg 361 tgcagaaggt gcacagccac agcgccgaga tggaggctca gctgtcgcag gccctggagg 421 agctgggagg ccagaaacaa agagcagaca tgctggagat ggagctgaag atgctgaagt 481 ctcagtccag ctctgccgaa cagagcttcc tgttctccag ggaggaggcg gacacgctca 541 ggttgaaggt cgaggagctg gaaggcgagc ggagtcggct ggaggaggaa aagaggatgc 601 tggaggcaca gctggagcgg cgagctctgc agggtgacta tgaccagagc aggaccaaag 661 tgctgcacat gagcctgaac cccaccagtg tggccaggca gcgcctgcgc gaggaccaca 721 gccagctgca ggcggagtgc gagcgactgc gcgggctcct gcgcgccatg gagagaggag 781 gcaccgtccc agccgacctt gaggctgccg ccgcgagtct gccatcgtcc aaggaggtgg 841 cagagctgaa gaagcaggtg gagagtgccg agctgaagaa ccagcggctc aaggaggttt 901 tccagaccaa gatccaggag ttccgcaagg cctgctacac gctcaccggc taccagatcg 961 acatcaccac ggagaaccag taccggctga cctcgctgta cgccgagcac ccaggcgact 1021 gctcatcttc aaggccacca gcccctcggg ttccaagatg cagctactgg agacagagtt 1081 ctcacacacc gtgggcgagc tcatcgaggt gcacctgcgg cgccaggaca gcatccctgc 1141 cttcctcagc tcgctcaccc tcgagctctt cagccgccag accgtggcgt agcctgcagg 1201 ctcgggggca tagccggagc cactctgctt ggcctgacct gcaggtcccc tgccccgcca 1261 gccacaggct gggtgcacgt cctgcctctc cagccccaca gggcagcagc atgactgaca 1321 gacacgctgg gacctacgtc gggcttcctg ctggggcggc cagcaccctc tccacgtgca 1381 gaccccatgc gtcccggagc ctggtgtgtg ggcgtcggcc accagcctgg gttcctcacc 1441 ttgtgaaata aaatcttctc ccctaa // LOCUS HSU33839 795 bp mRNA PRI 25-SEP-1995 DEFINITION Human potassium channel mRNA, complete cds. ACCESSION U33839 NID g995832 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 795) AUTHORS Li,Q., Hornby,D.P. and White,S.J. TITLE Direct Submission JOURNAL Submitted (12-AUG-1995) Quan Li, Molecular Biology, Sheffield University, West Bank, Sheffield, S10 2TN, UK FEATURES Location/Qualifiers source 1..795 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cloning vector plasmid (pGex-KG)" /tissue_type="kidney, distal tubules" CDS 1..795 /codon_start=1 /product="potassium channel" /db_xref="PID:g995833" /translation="MPFIRSEPRLRVLSWHARKYQTPGPMFRAMTPRRRMISGLLEIS CGRNTMRLASNPYCRKSCAGNRPRLSGWRRWPADLAGFSKCTTDSCSTSECIINGGIT GFSPRRAITALLILPTPLWSGSSDSGGIRPCAISCDKNVTTSSAICWHMGFWLSSILP ASGKLVSTTATIFFGSDLNIAGTDAIHCMEDRHAIAPRWRDRNHNIRQFWNGRIVIVI ESMITFFPRSSQVGALPIPLVNQTRPSGASADASMTKHRCRRRNRN" BASE COUNT 188 a 211 c 210 g 186 t ORIGIN 1 atgccgttca tcagatccga accccggcta cgtgtgttga gttggcatgc ccgtaaatat 61 caaactccag gcccgatgtt cagagcgatg acgcccagac gacggatgat ttccgggtta 121 ttggaaatct cctgcggacg caacacaatg cggttggcaa gtaatccata ttgtcgcaaa 181 tcttgcgcag ggaatcggcc gagattgtca ggctggaggc gctggcccgc tgatttagcc 241 ggtttcagta aatgcaccac cgattcctgt agcacttccg aatgcatcat aaacggagga 301 attaccgggt tttcccccag acgcgccatt accgcattat tgatattgcc cacgccactt 361 tggagcggta gcagtgattc aggcggaata cgcccatgcg ccatttcctg tgataagaac 421 gtgaccacgt catcggcaat ctgctggcac atgggatttt ggttatccag catattaccg 481 gcgtcgggta agttggtttc cacgacggcg acaatctttt tcggatccga tttgaacata 541 gcgggtaccg acgcgatcca ttgtatggaa gatcgacacg ctattgcgcc gcggtggcgc 601 gacaggaatc acaatatccg ccagttctgg aacgggcgga tcgtgatagt gattgaatca 661 atgatcactt tcttcccccg gagtagccag gtcggcgcat taccgatccc gctggttaac 721 cagactcgac catccggtgc cagtgccgat gcttcaatga cgaaacatcg atgtcgccga 781 agaaaccgta attga // LOCUS HSU33920 2719 bp mRNA PRI 10-JUL-1996 DEFINITION Human clone lambda 5 semaphorin mRNA, complete cds. ACCESSION U33920 NID g1000206 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2719) AUTHORS Roche,J., Boldog,F., Robinson,M., Robinson,L., Varella-Garcia,L., Swanton,M., Waggoner,B., Fishel,R., Franklin,W., Gemmill,R. and Drabkin,H. TITLE Distinct 3p21.3 deletions in lung cancer and identification of a new human semaphorin JOURNAL Oncogene 12 (6), 1289-1297 (1996) MEDLINE 96226360 REFERENCE 2 (bases 1 to 2719) AUTHORS Drabkin,H.A. TITLE Direct Submission JOURNAL Submitted (14-AUG-1995) Harry A. Drabkin, Division of Medical Oncology, University of Colorado Health Sciences Center, 4200 East 9th Ave., Denver, CO 80262, USA FEATURES Location/Qualifiers source 1..2719 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda 5" /clone_lib="Stragene fetal brain" /chromosome="3" /map="3p21.3" /tissue_type="brain" /dev_stage="fetus" CDS 79..2436 /codon_start=1 /product="semaphorin" /db_xref="PID:g1000207" /translation="MLVAGLLLWASLLTGAWPSFPTQDHLPATPRVRLSFKELKATGT AHFFNFLLNTTDYRILLKDEDHDRMYVGSKDYVLSLDLHDINREPLIIHWAASPQRIE ECVLSGKDVNGECGNFVRLIQPWNRTHLYVCGTGAYNPMCTYVNRGRRAQATPWTQTQ AVRGRGSRATDGALRPMPTAPRQDYIFYLEPERLESGKGKCPYDPKLDTASALINEEL YAGVYIDFMGTDAAIFRTLGKQTAMRTDQYNSRWLNDPSFIHAELIPDSAERNDDKLY FFFRERSAEAPQSPAVYARIGRICLNDDGGHCCLVNKWSTFLKARLVCSVPGEDGIET HFDELQDVFVQQTQDVRNPVIYAVFTSSGSVFRGSAVCVYSMADIRMVFNGPFAHKEG PNYQWMPFSGKMPYPRPGTCPGGTFTPSMKSTKDYPDEVINFMRSHPLMYQAVYPLQR RPLVVRTGAPYRLTTIAVDQVDAGDGRYEVLFLGTDRGTVQKVIVLPKDDQEMEELML EEVEVFKDPAPVKTMTISSKRQQLYVASAVGVTHLSLHRCQAYGAACADCCLARDPYC AWDGQACSRYTASSKRRSRRQDVRHGNPIRQCRGFNSNANKNAVESVQYGVAGSAAFL ECQPRSPQATVKWLFQRDPGDRRREIRAEDRFLRTEQGLLLRALQLSDRGLYSCTATE NNFKHVVTRVQLHVLGRDAVHAALFPPLSMSAPPPPGAGPPTPPYQELAQLLAQPEVG LIHQYCQGYWRHVPPSPREAPGAPRSPEPQDQKKPRNRRHHPPDT" BASE COUNT 551 a 872 c 768 g 528 t ORIGIN 1 cggggcccag gccccgccgc tgcggaagag gtttctagag agtggagcct gcttcctggg 61 ccctaggccc ctcccacaat gcttgtcgcc ggtcttcttc tctgggcttc cctactgacc 121 ggggcctggc catccttccc cacccaggac cacctcccgg ccacgccccg ggtccggctc 181 tcattcaaag agctgaaggc cacaggcacc gcccacttct tcaacttcct gctcaacaca 241 accgactacc gaatcttgct caaggacgag gaccacgacc gcatgtacgt gggcagcaag 301 gactacgtgc tgtccctgga cctgcacgac atcaaccgcg agcccctcat tatacactgg 361 gcagcctccc cacagcgcat cgaggaatgc gtgctctcag gcaaggatgt caacggcgag 421 tgtgggaact tcgtcaggct catccagccc tggaaccgaa cacacctgta tgtgtgcggg 481 acaggtgcct acaaccccat gtgcacctat gtgaaccgcg gacgccgcgc ccaggccaca 541 ccatggaccc agactcaggc ggtcagaggc cgcggcagca gagccacgga tggtgccctc 601 cgcccgatgc ccacagcccc acgccaggat tacatcttct acctggagcc tgagcgactc 661 gagtcaggga agggcaagtg tccgtacgat cccaagctgg acacagcatc ggccctcatc 721 aatgaggagc tctatgctgg tgtgtacatc gattttatgg gcactgatgc agccatcttc 781 cgcacacttg gaaagcagac agccatgcgc acggatcagt acaactcccg gtggctgaac 841 gacccgtcgt tcatccatgc tgagctcatt cctgacagtg cggagcgcaa tgatgataag 901 ctttacttct tcttccgtga gcggtcggca gaggcgccgc agagccccgc ggtgtacgcc 961 cgcatcgggc gcatttgcct gaacgatgac ggtggtcact gttgcctggt caacaagtgg 1021 agcacattcc tgaaggcgcg gctcgtctgc tctgtcccgg gcgaggatgg cattgagact 1081 cactttgatg agctccagga cgtgtttgtc cagcagaccc aggacgtgag gaaccctgtc 1141 atttacgctg tctttacctc ctctggctcc gtgttccgag gctctgccgt gtgtgtctac 1201 tccatggctg atattcgcat ggtcttcaac gggccctttg cccacaaaga ggggcccaac 1261 taccagtgga tgcccttctc agggaagatg ccctacccac ggccgggcac gtgccctggt 1321 ggaaccttca cgccatctat gaagtccacc aaggattatc ctgatgaggt gatcaacttc 1381 atgcgcagcc acccactcat gtaccaggcc gtgtaccctc tgcagcggcg gcccctggta 1441 gtccgcacag gtgctcccta ccgccttacc actattgccg tggaccaggt ggatgcaggc 1501 gacgggcgct atgaggtgct tttcctgggc acagaccgcg ggacagtgca gaaggtcatt 1561 gtgctgccca aggatgacca ggagatggag gagctcatgc tggaggaggt ggaggtcttc 1621 aaggatccag cacccgtcaa gaccatgacc atctcttcta agaggcaaca actctacgtg 1681 gcgtcagccg tgggtgtcac acacctgagc ctgcaccgct gccaggcgta tggggctgcc 1741 tgtgctgact gctgccttgc ccgggaccct tactgtgcct gggatggcca ggcctgctcc 1801 cgctatacag catcctccaa gaggcggagc cgccggcagg acgtccggca cggaaacccc 1861 atcaggcagt gccgtgggtt caactccaat gccaacaaga atgccgtgga gtctgtgcag 1921 tatggcgtgg ccggcagcgc agccttcctt gagtgccagc cccgctcgcc ccaagccact 1981 gttaagtggc tgttccagcg agatcctggt gaccggcgcc gagagattcg tgcagaggac 2041 cgcttcctgc gcacagagca gggcttgttg ctccgtgcac tgcagctcag cgatcgtggc 2101 ctctactcct gcacagccac tgagaacaac tttaagcacg tcgtcacacg agtgcagctg 2161 catgtactgg gccgggacgc cgtccatgct gccctcttcc caccactgtc catgagcgcc 2221 ccgccacccc caggcgcagg ccccccaacg cctccttacc aggagttagc ccagctgctg 2281 gcccagccag aagtgggcct catccaccag tactgccagg gttactggcg ccatgtgccc 2341 cccagcccca gggaggctcc aggggcaccc cggtctcctg agccccagga ccagaaaaag 2401 ccccggaacc gccggcacca ccctccggac acatgaggcc agctgcctgt gcctgccatg 2461 ggccaggcta ggccttggtc ccttttaata taaaagatat atatatatat atatatatat 2521 attaaaatat cggggtgggg ggtgattgga agggagggag gtggccttcc caatgcgcgt 2581 tattcggggt tattgaagaa taatattgca agtgacagcc agaagtagac tttctgtcct 2641 cacaccgaag aacccgagtg agcaggaggg agggagagac gcgaagagac cttttttcct 2701 ttttggagac cttgtccgc // LOCUS HSU34038 1451 bp mRNA PRI 21-FEB-1997 DEFINITION Human proteinase-activated receptor-2 mRNA, complete cds. ACCESSION U34038 NID g1041728 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1451) AUTHORS Bohm,S.K., Kong,W., Bromme,D., Smeekens,S.P., Anderson,D.C., Connolly,A., Kahn,M., Nelken,N.A., Coughlin,S.R., Payan,D.G. and Bunnett,N.W. TITLE Molecular cloning, expression and potential functions of the human proteinase-activated receptor-2 JOURNAL Biochem. J. 314 (Pt 3), 1009-1016 (1996) MEDLINE 96177879 REFERENCE 2 (bases 1 to 1451) AUTHORS Bohm,S. TITLE Direct Submission JOURNAL Submitted (15-AUG-1995) Stephan Bohm, Department of Surgery, School of Medicine, University of California at San Francisco, 521 Parnassus Avenue, San Francisco, CA 94143-0660, USA FEATURES Location/Qualifiers source 1..1451 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" 5'UTR 1..147 CDS 148..1341 /note="G-protein-coupled receptor" /codon_start=1 /product="proteinase-activated receptor-2" /db_xref="PID:g1041729" /translation="MRSPSAAWLLGAAILLAASLSCSGTIQGTNRSSKGRSLIGKVDG TSHVTGKGVTVETVFSVDEFSASVLTGKLTTVFLPIVYTIVFVVGLPSNGMALWVFLF RTKKKHPAVIYMANLALADLLSVIWFPLKIAYHIHANNWIYGEALCNVLIGFFYGNMY CSILFMTCLSVQRYWVIVNPMGHSRKKANIAIGISLAIWLLILLVTIPLYVVKQTIFI PALNITTCHDVLPEQLLVGDMFNYFLSLAIGVFLFPAFLTASAYVLMIRMLRSSAMDE NSEKKRKRAIKLIVTVLAMYLICFTPSNLLLVVHYFLIKSQGQSHVYALYIVALCLST LNSCIDPFVYYFVSHDFRDHAKNALLCRSVRTVKQMQVSLTSKKHSRKSSSYSSSSTT VKTSY" 3'UTR 1342..1451 BASE COUNT 310 a 389 c 346 g 406 t ORIGIN 1 cggcccgccc tggggaggcg cgcagcagag gctccgattc ggggcaggtg agaggctgac 61 tttctctcgg tgcgtccagt ggagctctga gtttcgaatc ggtggcggcg gattccccgc 121 gcgcccggcg tcggggcttc caggaggatg cggagcccca gcgcggcgtg gctgctgggg 181 gccgccatcc tgctagcagc ctctctctcc tgcagtggca ccatccaagg aaccaataga 241 tcctctaaag gaagaagcct tattggtaag gttgatggca catcccacgt cactggaaaa 301 ggagttacag ttgaaacagt cttttctgtg gatgagtttt ctgcatctgt cctcactgga 361 aaactgacca cggtcttcct tccaattgtc tacacaattg tgtttgtggt gggtttgcca 421 agtaacggca tggccctgtg ggtctttctt ttccgaacta agaagaagca ccctgctgtg 481 atttacatgg ccaatctggc cttggctgac ctcctctctg tcatctggtt ccccttgaag 541 attgcctatc acatacatgc caacaactgg atttatgggg aagctctttg taatgtgctt 601 attggctttt tctatggcaa catgtactgt tccattctct tcatgacctg cctcagtgtg 661 cagaggtatt gggtcatcgt gaaccccatg gggcactcca ggaagaaggc aaacattgcc 721 attggcatct ccctggcaat atggctgctg attctgctgg tcaccatccc tttgtatgtc 781 gtgaagcaga ccatcttcat tcctgccctg aacatcacga cctgtcatga tgttttgcct 841 gagcagctct tggtgggaga catgttcaat tacttcctct ctctggccat tggggtcttt 901 ctgttcccag ccttcctcac agcctctgcc tatgtgctga tgatcagaat gctgcgatct 961 tctgccatgg atgaaaactc agagaagaaa aggaagaggg ccatcaaact cattgtcact 1021 gtcctggcca tgtacctgat ctgcttcact cctagtaacc ttctgcttgt ggtgcattat 1081 tttctgatta agagccaggg ccagagccat gtctatgccc tgtacattgt agccctctgc 1141 ctctctaccc ttaacagctg catcgacccc tttgtctatt actttgtttc acatgatttc 1201 agggatcatg caaagaacgc tctcctttgc cgaagtgtcc gcactgtaaa gcagatgcaa 1261 gtatccctca cctcaaagaa acactccagg aaatccagct cttactcttc aagttcaacc 1321 actgttaaga cctcctattg agttttccag gtcctcagat gggaattgca cagtaggatg 1381 tggaacctgt ttaatgttat gaggacgtgt ctgttatttc ctaatcaaaa aggtctcacc 1441 acataccacc g // LOCUS HSU34044 1672 bp mRNA PRI 14-FEB-1996 DEFINITION Human selenium donor protein (selD) mRNA, complete cds. ACCESSION U34044 NID g1000283 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1672) AUTHORS Low,S.C., Harney,J.W. and Berry,M.J. TITLE Cloning and functional characterization of human selenophosphate synthetase, an essential component of selenoprotein synthesis JOURNAL J. Biol. Chem. 270 (37), 21659-21664 (1995) MEDLINE 95394923 REFERENCE 2 (bases 1 to 1672) AUTHORS Low,S.C. TITLE Direct Submission JOURNAL Submitted (15-AUG-1995) Susan C. Low, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1672 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 268..1419 /gene="selD" CDS 268..1419 /gene="selD" /codon_start=1 /product="selenium donor protein" /db_xref="PID:g1000284" /translation="MSTRESFNPESYELDKSFRLTRFTELKGTGCKVPQDVLQKLLES LQENHFQEDEQFLGAVMPRLGIGMDTCVIPLRHGGLSLVQTTDYIYPIVDDPYMMGRI ACANVLSDLYAMGVTECDNMLMLLGVSNKMTDRERDKVMPLIIQGFKDAAEEAGTSVT GGQTVLNPWIVLGGVATTVCQPNEFIMPDNAVPGDVLVLTKPLGTQVAVAVHQWLDIP EKWNKIKLVVTQEDVELAYQEAMMNMARLNRTAAGLMHTFNTHAATDITGFGILGHAQ NLAKQQRNEVSFVIHNLPVLAKMAAVSKACGNMFGLMHGTCPETSGGLLICLPREQAA RFCAEIKSPKYGEGHQAWIIGIVEKGNRTARIIDKPRIIEVAHKWPLKT" BASE COUNT 436 a 418 c 472 g 346 t ORIGIN 1 ggcggcggcc ggctcccgca ggcgccgagg caggcgagcc cccggcccca ggcgcccggg 61 ccccgccgcc gcccgcgcgc ggcgcggcat tttattcagg cgacgcttaa gggagccggc 121 cgcgcccggt gcattgtggg aggcccgcgg ccgttttcgg gaggaaggcg gaggggccaa 181 agcgagccgg tggatccata aagaacccag ccaacccgca gagggagggg aggggctgag 241 ctgtgaggag agcggggccc aagaaccatg tctacgcggg agtcctttaa cccggaaagt 301 tacgaattgg acaaaagctt ccggctaacc agattcactg aactgaaggg cacaggctgc 361 aaagtgcccc aagatgtcct gcaaaaattg ctggaatctt tacaggagaa ccacttccaa 421 gaagatgagc agtttctggg agccgttatg ccaaggcttg gcattggaat ggatacttgt 481 gtcattcctt tgaggcacgg tgggctttcc ttggttcaaa ccacagatta catttacccg 541 atcgtagacg acccttacat gatgggcagg atagcgtgtg ccaatgtcct cagtgacctc 601 tatgcaatgg gggtcacgga atgtgacaat atgctgatgc tccttggagt cagtaataaa 661 atgaccgaca gggaaaggga taaagtgatg cctctgatta tccaaggttt taaagacgca 721 gctgaggaag caggaacgtc tgtaacaggc ggccaaacag tactaaaccc ctggattgtc 781 ctgggaggag tggctaccac tgtctgccaa cccaatgaat ttatcatgcc agacaatgca 841 gtgccagggg acgtgctggt gctgacaaaa cccctgggga cacaggtggc agtggctgtg 901 caccagtggc tggatatccc tgagaaatgg aataagatta aactagtggt cacccaagaa 961 gatgtagagc tggcctacca ggaggcgatg atgaacatgg cgaggctcaa caggacagct 1021 gcaggactca tgcacacgtt caatacccac gccgccactg acatcacggg cttcgggatt 1081 ttgggccatg cgcagaacct ggccaagcag cagaggaacg aggtgtcgtt tgtaattcac 1141 aacctcccgg tgctggccaa gatggctgcg gtgagcaagg cctgcggaaa catgttcggc 1201 ctcatgcacg ggacctgccc ggagacttca ggcggccttc tgatctgttt accacgtgag 1261 caagcagctc ggttctgtgc agagataaag tcccccaaat atggtgaagg ccaccaagca 1321 tggattattg ggattgtaga gaagggcaac cgcacagcca gaatcataga caaaccccgg 1381 atcatcgagg tcgcacacaa gtggccactc aaaacgtgaa tcccacaccc ggggccacct 1441 cttaatctag acagaaatag ctgtttggtt ttgtttttaa atagatctat ttcccttatc 1501 acttcaatta aagactataa acaacaaaaa tctcattgtg tctacacatc ggggtgacct 1561 taggtcggtt tgtaagtgga tacaattaat aaaataaaat ccattgcctt tttttcctgt 1621 tacattaact gaagatcgac ctaatcttga ggcagcttct gagttgagaa tt // LOCUS HSU34051 1186 bp mRNA PRI 15-NOV-1995 DEFINITION Human cyclin-dependent kinase 5 activator isoform p39i mRNA, complete cds. ACCESSION U34051 NID g1063622 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1186) AUTHORS Tang,D., Yeung,J., Lee,K.Y., Matsushita,M., Matsui,H., Tomizawa,K., Hatase,O. and Wang,J.H. TITLE An isoform of the neuronal cyclin-dependent kinase 5 (Cdk5) activator JOURNAL J. Biol. Chem. 270 (45), 26897-26903 (1995) MEDLINE 96070784 REFERENCE 2 (bases 1 to 1186) AUTHORS Tang,D. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) Damu Tang, Biochemistry, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong FEATURES Location/Qualifiers source 1..1186 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" CDS 70..1173 /codon_start=1 /product="cyclin-dependent kinase 5 activator isoform p39i" /db_xref="PID:g1063623" /translation="MGTVLSLSPASSAKGRRPGGLPEEKKKAPPAGDEALGGYGAPPV GKGGKGESRLKRPSVLISALTWKRLVAASAKKKKGSKKVTPKPASTGPDPLVQQRNRE NLLRKGRDPPDGGGTAKPLAVPVPTVPAAAATCEPPSGGSAAAQPPGSGGGKPPPPPP PAPQVAPPVPGGSPRRVIVQASTGELLRCLGDFVCRRCYRLKELSPGELVGWFRGVDR SLLLQGWQDQAFITPANLVFVYLLCRESLRGDELASAAELQAAFLTCLYLAYSYMGNE ISYPLKPFLVEPDKERFWQRCLRLIQRLSPQMLRLNADPHFFTQVFQDLKNEGEAAAS GGGPPSGGAPAASSAARDSCAAGTKHWTMNLDR" BASE COUNT 172 a 441 c 418 g 155 t ORIGIN 1 ggggctgcag tagcagcggc gccgcccgcg gctcccgctg gggcctgggc gccggccccg 61 ctctgcagga tgggcacagt gctgtctctt tcgcctgcct cctcggccaa gggccggagg 121 cccggcgggc tgcccgagga gaagaagaag gcgccgcccg cgggggacga ggcgctgggg 181 ggctacgggg cgccgccagt gggcaagggc ggcaaaggcg agagccgact caagcggccg 241 tccgtgctca tctcggcgct cacctggaag cgcctggtgg ccgcgtccgc caagaagaag 301 aaaggcagca agaaggtgac acccaagccg gcatccacgg gccccgaccc cctggtccag 361 caacgcaacc gcgagaacct tctccgcaag ggccgggatc cccccgacgg cggcggcacc 421 gccaagcccc tggcggtgcc agtgcccacc gtgcccgcgg ctgccgccac ctgcgagcca 481 ccgtcggggg gcagcgcggc cgctcagccg ccgggctcgg gcgggggaaa gcctccgccg 541 ccgcctcccc cagccccgca ggtggcgccg ccggtgcctg gcggctcgcc gcggcgggtc 601 atcgtgcagg cgtccaccgg cgagctgctg cgctgtctgg gcgacttcgt gtgccgacgc 661 tgctatcgcc tcaaggagct gagcccgggc gagctggtgg gctggttccg cggtgtggac 721 cgctcgctgc tgctgcaggg ctggcaagac caggccttca ttacgcctgc aaacctggtg 781 ttcgtgtacc tgctgtgccg cgagtcgctg cgtggggacg agctggcgtc ggccgccgag 841 ctgcaggccg ccttcctcac ctgcctctac ctcgcctact cctacatggg caacgagatc 901 tcctacccac tcaagccctt cctcgtggag cccgacaagg agcgcttctg gcagcgctgc 961 ctgcgcctca tccagcggct cagcccgcag atgctgcggc tcaacgccga cccccacttc 1021 ttcacgcagg tctttcaaga cctcaagaac gagggcgagg ccgccgccag cggtgggggc 1081 ccaccgagcg ggggcgcgcc cgccgcctcc tcggccgcca gggacagctg cgcggccgga 1141 accaagcact ggactatgaa cctggaccgc tagggatacc cagggg // LOCUS HSU34070 3318 bp DNA PRI 22-OCT-1995 DEFINITION Human CCAAT/enhancer binding protein alpha gene, complete cds. ACCESSION U34070 NID g1041732 KEYWORDS Transcription factor; DNA binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3318) AUTHORS Antonson,P. and Xanthopoulos,K.G. TITLE Molecular cloning, sequence, and expression patterns of the human gene encoding CCAAT/enhancer binding protein alpha (C/EBP alpha) JOURNAL Biochem. Biophys. Res. Commun. 215 (1), 106-113 (1995) MEDLINE 96003748 REFERENCE 2 (bases 1 to 3318) AUTHORS Antonson,P. and Xanthopoulos,K.G. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) Per Antonson, Department of Bioscience at Novum, Karolinska Institute, Huddinge S-141 57, Sweden FEATURES Location/Qualifiers source 1..3318 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.1" /chromosome="19" /tissue_type="umbilical cord" promoter 1..471 TATA_signal 442..448 CDS 592..1668 /note="bZIP transcription factor; C/EBPa" /codon_start=1 /product="CCAAT/enhancer binding protein alpha" /db_xref="PID:g1041733" /translation="MESADFYEAEPRPPMSSHLQSPPHAPSSAAFGFPRGAGPPKPPA PPAAPEPLGGICEHETSIDISAYIDPAAFNDEFLADLFQHSRQQEKAKAAVGPTGGGG GGDFDYPGAPAGPGGAVMPGGAHGPPPGYGCAAAGYLDGRLEPLYERVGAPALRPLVI KQEPREEDEAKQLALAGLFPYQPPPPPPPSHPHPHPPPAHLAAPHLQFQIAHCGQTTM HLQPGHPTPPPTPVPSPHPAPALGAAGLPGPGSALKGLGAAHPDLRASGGTGAGKAKK SVDKNSNEYRVRRERNNIAVRKSRDKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRE LDTLRGIFRQLPESSLVKAMGNCA" polyA_signal 3249..3254 BASE COUNT 566 a 1099 c 1096 g 557 t ORIGIN 1 ctgcagcctc cccgggacgc gggtccggga caggcctggt tctggctttg aaagagaatc 61 cgcgccccag cagctcaaga ccaagactcg ccctccgccc cccaccccta ccccgtgcag 121 cctcgggata ctcctgggct cccggccgtg gctggatacg ggcgcctagg gcaggcagga 181 ggagggggcc cccgctaccg accacgtggg cgcgggggcg acggccgggc cgggggcgga 241 gcttggagcg agcgccgcgg ctctgctggg cgcgctggag gcggtgggcg ttgcgccgcg 301 gcctgcctgg ggagcgcggc gctgtgccgc gtggttcgcc gccccatgcc ggccgcgcgc 361 taggacccag caggcgccgc gccgccgcag cccggggaca gaggccgcct cggactctag 421 ggggcgacgc ggcctgccgg gtataaaagc tgggccggcg cgggccgggc cattcgcgac 481 ccggaggtgc gcgggcgcgg gcgagcaggg tctccgggtg ggcggcggcg acgccccgcg 541 caggctggag gccgccgagg ctcgccatgc cgggagaact ctaactcccc catggagtcg 601 gccgacttct acgaggcgga gccgcggccc ccgatgagca gccacctgca gagccccccg 661 cacgcgccca gcagcgccgc cttcggcttt ccccggggcg cgggcccgcc gaagcctccc 721 gccccacctg ccgccccgga gccgctgggc ggcatctgcg agcacgagac gtccatcgac 781 atcagcgcct acatcgaccc ggccgccttc aacgacgagt tcctggccga cctgttccag 841 cacagccggc agcaggagaa ggccaaggcg gccgtgggcc ccacgggcgg cggcggcggc 901 ggcgactttg actacccggg cgcgcccgcg ggccccggcg gcgccgtcat gcccggggga 961 gcgcacgggc ccccgcccgg ctacggctgc gcggccgccg gctacctgga cggcaggctg 1021 gagcccctgt acgagcgcgt cggggcgccg gcgctgcggc cgctggtgat caagcaggag 1081 ccccgcgagg aggatgaagc caagcagctg gcgctggccg gcctcttccc ttaccagccg 1141 ccgccgccgc cgccgccctc gcacccgcac ccgcacccgc cgcccgcgca cctggccgcc 1201 ccgcacctgc agttccagat cgcgcactgc ggccagacca ccatgcacct gcagcccggt 1261 caccccacgc cgccgcccac gcccgtgccc agcccgcacc ccgcgcccgc gctcggtgcc 1321 gccggcctgc cgggccctgg cagcgcgctc aaggggctgg gcgccgcgca ccccgacctc 1381 cgcgcgagtg gcggcacggg cgcgggcaag gccaagaagt cggtggacaa gaacagcaac 1441 gagtaccggg tgcggcgcga gcgcaacaac atcgcggtgc gcaagagccg cgacaaggcc 1501 aagcagcgca acgtggagac gcagcagaag gtgctggagc tgaccagtga caatgaccgc 1561 ctgcgcaagc gggtggaaca gctgagccgc gaactggaca cgctgcgggg catcttccgc 1621 cagctgccag agagctcctt ggtcaaggcc atgggcaact gcgcgtgagg cgcgcggctg 1681 tgggaccgcc ctgggccagc ctccggcggg gacccaggga gtggtttggg gtcgccggat 1741 ctcgaggctt gcccgagccg tgcgagccag gactaggaga ttccggtgcc tcctgaaagc 1801 ctggcctgct ccgcgtgtcc cctcccttcc tctgcgccgg acttggtgcg tctaagatga 1861 gggggccagg cggtggcttc tccctgcgag gaggggagaa ttcttggggc tgagctggga 1921 gcccggcaac tctagtattt aggataacct tgtgccttgg aaatgcaaac tcaccgctcc 1981 aatgcctact gagtaggggg agcaaatcgt gccttgtcat tttatttgga ggtttcctgc 2041 ctccttcccg aggctacagc agacccccat gagagaagga aggggagcag gcccgtggca 2101 ggaggagggc tcagggagct gagatcccga caagcccgcc agccccagcc gctcctccac 2161 gcctgtcctt agaaaggggt ggaaacatag ggacttgggg cttggaacct aaggttgttc 2221 ccctagttct acatgaaggt ggagggtctc tagttccacg cctctcccac ctccctccgc 2281 acacacccca cccccagcct gctataggct gggcttccct tgggcggaac tcactgcgat 2341 gggggtcacc aggtgaccag tgggagcccc caccccgagt cacaccagaa agctaggtcg 2401 tgggtcagct ctgaggatgt atacccctgg tgggagaggg agacctagag atctggctgt 2461 ggggcgggca tggggggtga agggccactg ggaccctcag ccttgtttgt actgtatgcc 2521 ttcagcattg cctaggaaca cgaagcacga tcagtccatc ccagagggac cggagttatg 2581 acaagctttc caaatatttt gctttatcag ccgatatcaa cacttgtatc tggcctctgt 2641 gccccagcag tgccttgtgc aatgtgaatg tgcgcgtctc tgctaaacca ccattttatt 2701 tgggttttgt tttgttttgg ttttgctcgg atacttgcca aaatgagact ctccgtcggc 2761 agctggggga agggtctgag actccctttc cttttggttt tgggattact tttgatcctg 2821 ggggaccaat gaggtgaggg gggttctcct ttgccctcag ctttccccag cccctccggc 2881 ctgggctgcc cacaaggctt gtcccccaga ggccctggct cctggtcggg aagggaggtg 2941 gcctcccgcc aacgcatcac tggggctggg agcagggaag gacggcttgg ttctcttctt 3001 ttggggagaa cgtagagtct cactctagat gttttatgta ttatatctat aatataaaca 3061 tatcaaagtc aatgtcggtg tctttttaaa accagaaaga agctacttcc aaggttgtct 3121 gtgggccagg tcacatttgt aaataataca gcattttccc tggcggcaat cctgactttc 3181 atgagctctc catccatcct gagcccctct taccctaagg gggtgactta cttcccccag 3241 gcaagacaaa taaatagcag aggacaaggc tccaaatgga gtatgtccag agcctgaagg 3301 cagtctcttg gcgtcagg // LOCUS HSU34074 2380 bp mRNA PRI 15-NOV-1995 DEFINITION Human A kinase anchor protein S-AKAP84 mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U34074 NID g1049216 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2380) AUTHORS Lin,R.Y., Moss,S.B. and Rubin,C.S. TITLE Characterization of S-AKAP84, a novel developmentally regulated A kinase anchor protein of male germ cells JOURNAL J. Biol. Chem. 270 (46), 27804 (1995) MEDLINE 96070913 REFERENCE 2 (bases 1 to 2380) AUTHORS Rubin,C.S. TITLE Direct Submission JOURNAL Submitted (16-AUG-1995) Charles S. Rubin, Molecular Pharmacology, Albert Einstein College of Medicine, 1300 Morris Park Ave., Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..2380 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pGT7" /clone_lib="human testis 5-prime stretch cDNA library in lambda gt10" /sex="male" /cell_type="spermatid" /tissue_type="testis" /dev_stage="adult" 5'UTR 1..62 CDS 63..1844 /note="AKAP" /codon_start=1 /function="A Kinase Anchor Protein" /evidence=experimental /product="S-AKAP84" /db_xref="PID:g1049217" /translation="MAIQFRSLFPLALPGMLALLGWWWFFSRKKGHVSSHDEQQVEAG AVQVRADPAIKEPLPVEDVCPKVVSTPPSVTEPPEKELSTVSKLPAEPPALLHPHPPC RRSESSGILPNTTDMRLRPGTRRDDSTKLELALTGGEAKSIPLECPLSSPKGVLFSSK SAEVCKQDSPFSRVPRKVQPGYPVVPAEKRSSGERARETGGAEGTGDAVLGEKVLEEA LLSREHVLELENSKGPSLASLEGEEDKGKSSSSQVVGPVQEEEYVAEKLPSRFIESAH TELAKDDAAPAPPVADAKAQDRGVEGELGNEESLDRNEEGLDRNEEGLDRNEESLDRN EEGLDRNEEIKRAAFQIISQVISEATEQVLATTVGKVAGRVCQASQLQGQKEESCVPV HQKTVLGPDTAEPATAEAAVAPPDAGLPLPGLPAEGSPPPKTYVSCLKSLLSSPTKDS KPNISAHHISLASCLALTTPSEELPDRAGILVEDATCVTCMSDSSQSVPLVASPGHCS DSFSTSGLEDSCTETSSSPRDKAITPPLPESTVPFSNGVLKGELSDLGAEDGWTMDAE ADHSGVAAPPPGKRGTLITRCPGFFEC" misc_feature 1053..1230 /note="encodes tethering region; binding RII subunits of protein kinase A" /evidence=experimental 3'UTR 1842..2380 BASE COUNT 584 a 655 c 687 g 454 t ORIGIN 1 cgcacccttg gtggcagaac cgagcactac ccagcaaggt gtaattactt caagcctcca 61 ggatggcaat ccagttccgt tcgctcttcc ccttggcatt gcctgggatg ctggcgctcc 121 tcggctggtg gtggtttttc tctcgtaaaa aaggccatgt cagcagccat gatgagcagc 181 aggtggaggc tggtgctgtg caggtgaggg ctgaccctgc catcaaggaa cctctccccg 241 tggaagacgt ctgtcccaaa gtagtgtcca caccccccag tgtcacagag cctccagaaa 301 aggaactgtc caccgtgagc aagctgcctg cagagccccc agcattgctc cacccacacc 361 caccttgccg aagatcagag tcctcgggca ttcttcctaa caccacagac atgagattgc 421 gaccaggaac acgcagagat gacagtacaa agctggagct agccctgaca ggtggtgaag 481 ccaaatcgat tcctctagag tgcccccttt catccccaaa gggtgtacta ttctccagca 541 aatcagctga ggtgtgtaag caagattccc ccttcagcag ggtgccaagg aaggtccagc 601 caggctaccc cgtagtcccc gcagagaagc gtagctctgg ggagagggca agagagacag 661 gtggggccga agggactggt gatgccgtgt tgggggaaaa ggtgcttgaa gaagctctgt 721 tgtctcggga gcatgtcttg gaattggaga acagcaaggg ccccagcctg gcctctttag 781 agggggaaga agataagggg aagagcagct catcccaggt ggtggggcca gtgcaggagg 841 aagagtatgt agcagagaag ttgccaagta ggttcatcga gtcggctcac acagagctgg 901 caaaggacga tgcggcgcca gcacccccag tcgcagacgc caaagcccag gacagaggtg 961 tcgagggaga actgggcaat gaggagagct tggatagaaa tgaggagggc ttggatagaa 1021 atgaggaggg cttggataga aatgaggaga gcttggatag aaatgaggag ggcttggata 1081 gaaatgagga gattaagcgg gctgccttcc agataatctc ccaagtgatc tcagaagcaa 1141 ccgaacaggt gctggccacc acggttggca aggttgcagg tcgtgtgtgt caggccagtc 1201 agctccaagg gcagaaggaa gagagctgtg tcccagttca ccagaaaact gtcttgggcc 1261 cagacactgc ggagcctgcc acagcagagg cagctgttgc cccgccggat gctggcctcc 1321 ccttgccagg cctaccagca gagggctcac caccaccaaa gacctacgtg agctgcctga 1381 agagccttct gtccagcccc accaaggaca gtaagccaaa tatctctgca caccacatct 1441 ccctggcctc ctgcctggca ctgaccaccc ccagtgaaga gttgccggac cgggcaggca 1501 tcctggtgga agatgccacc tgtgtcacct gcatgtcaga cagcagccaa agtgtccctt 1561 tggtggcttc tccaggacac tgctcagatt ctttcagcac ttcagggctt gaagactctt 1621 gcacagagac cagctcgagc cccagggaca aggccatcac cccgccactg ccagaaagta 1681 ctgtgccctt cagcaatggg gtgctgaagg gggagttgtc agacttgggg gctgaggatg 1741 gatggaccat ggatgcggaa gcagatcatt caggagttgc agctccaccc ccgggaaagc 1801 ggggcacttt gataacgagg tgtcctgggt ttttcgagtg ctgaccccag cgtagacagt 1861 tcagaccgct aacttacagg ttctgacagg aacagcatgg attccgtgga tagctgttgc 1921 agtctcaaga agactgagag cttccaaaat gcccaggcag gctccaaccc taagaaggtc 1981 gacctcatca tctgggagat cgaggtgcca aagcacttag tcggtcggct aattggcaag 2041 caggggcgct atgtgagttt tctgaagcaa acatctggtg ccaagatcta catttcaacc 2101 ctgccttaca cccagagcgt ccagatctgc cacatagaag gctctcaaca tcatgtagac 2161 aaagcgctga acttgattgg gaagaagttc aaagagctga acctcaccaa tatctacgct 2221 cccccattgc cttcactggc actgccttct ctgccgatga catcctggct catgctgcct 2281 gatggcatca ccgtggaggt cattgtggtc aaccaggtca atgccgggca cctgttcgtg 2341 cagcagcaca cacaccctac cttccacgcg ctgcgcagcc // LOCUS HSU34252 2712 bp mRNA PRI 14-NOV-1996 DEFINITION Human gamma-aminobutyraldehyde dehydrogenase mRNA, complete cds. ACCESSION U34252 NID g1049218 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2712) AUTHORS Lin,S.W., Chen,J.C., Hsu,L.C., Hsieh,C.L. and Yoshida,A. TITLE Human gamma-aminobutyraldehyde dehydrogenase (ALDH9): cDNA sequence, genomic organization, polymorphism, chromosomal localization, and tissue expression JOURNAL Genomics 34 (3), 376-380 (1996) MEDLINE 96374830 REFERENCE 2 (bases 1 to 2712) AUTHORS Chen,J.C. TITLE Direct Submission JOURNAL Submitted (17-AUG-1995) James C. Chen, Biochemical Genetics, Beckman Research Institute of City of Hope, 1500 E. Duarte Road, Duarte, CA 91010, USA FEATURES Location/Qualifiers source 1..2712 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 378..1859 /codon_start=1 /product="gamma-aminobutyraldehyde dehydrogenase" /db_xref="PID:g1049219" /translation="MSTGTFVVSQPLNYRGGAAGAGGRSGTEKAFEPATGRVIATFTC SGEKEVNLAVQNAKAAFKIWSQKSGMERCRILLEAARIIREREDEIATMECINNGKSI FEARLDIDISWQCLEYYAGLAASMAGEHIQLPGGSFGYTRREPLGVCVGIGAWNYPFQ IASWKSAPALACGNAMVFKPSPFTPVSALLLAEIYSEAGVPPGLFNVVQGGAATGQFL CQHPDVAKVSFTGSVPTGMKIMEMSAKGIKPVTLELGGKSPLIIFSDCDMNNAVKGAL MANFLTQGQVCCNGTRVFVQKEILDKFTEEVVKQTQRIKIGDPLLEDTRMGPLINRPH LERVLGFVKVAKEQGAKVLCGGDIYVPEDPKLKDGYYMRPCVLTNCRDDMTCVKEEIF GPVMSILSFDTEAEVLERANDTTFGLAAGVFTRDIQRAHRVVAELQAGTCFINNYNVS PVELPFGGYKKSGFGRENGRVTIEYYSQLKTVCVEMGDVESAF" BASE COUNT 719 a 595 c 670 g 728 t ORIGIN 1 ggatcctagg atgcttacat gcaatgatga acccgaaaac acttgtaaag tgctacgtaa 61 atattgatca cgaagaagga agtcctcttc ccgcctggag actgtgtggg gtatggcggc 121 gtggtggaga gaatgtggtg tcttgttcca ccctcctgga gaggggaggg cctggcctgg 181 accgcagagg aatcgagtga ctgcccctaa aatctcctag aaccgatccc gtggacccgt 241 ccctcccgag ggtcccgccc ctcccgtggt ccgtcagcct ctgccgcgga gctgcgtccg 301 ccactcattt tctccgagca ggcctggccg cgctctcccc gcttcttcgc agtcttcggc 361 cctctcctgt cgccgccatg agcactggca ccttcgtcgt gtcgcagccg ctcaattacc 421 gcggcggggc cgctggagcc ggcggacgct ccggtaccga gaaagctttc gagccagcaa 481 ccggccgagt gatagctact ttcacatgtt caggagaaaa ggaagtaaat ttggctgttc 541 aaaatgcaaa ggctgctttt aaaatatgga gtcaaaaatc tggcatggag cgttgccgaa 601 tccttttgga ggctgccagg ataataaggg aacgggagga tgaaattgct actatggagt 661 gcatcaacaa tggcaagtcc atctttgagg cccgcttgga cattgacatt tcctggcagt 721 gcctggagta ttatgcgggc ttggctgcat ccatggctgg tgaacacatc cagctcccag 781 gtggatcgtt tggttatacc agaagagaac cacttggggt atgtgtggga ataggagcat 841 ggaactaccc ctttcagatt gcctcttgga agtcggctcc agcattagcc tgtggtaatg 901 ccatggtctt taaaccttct ccctttacac ctgtttctgc attgctactg gctgaaatct 961 acagtgaggc tggtgtacct cctgggctct tcaatgtggt gcagggaggg gctgccacag 1021 gccagtttct gtgtcagcat cccgatgtgg ccaaagtctc cttcactgga agtgtgccca 1081 ctggcatgaa gatcatggag atgtcagcta aaggaatcaa acctgttacc ttggaacttg 1141 gaggcaaatc tccactcatc atcttctcag actgtgatat gaacaatgct gtaaaggggg 1201 cgctgatggc caacttcctc acacaaggcc aggtttgctg taatggcaca agagtatttg 1261 tgcagaaaga aattcttgat aaatttacag aggaagtggt gaaacagacc caaaggatta 1321 aaattggaga tccccttctg gaagatacaa ggatgggtcc actcatcaac cgaccacacc 1381 tggagcgagt ccttgggttt gtcaaagtgg caaaggagca gggtgctaaa gtgttatgtg 1441 gtggagatat atatgtacct gaagatccca aattaaagga tggatattac atgagacctt 1501 gtgtattaac taattgcaga gacgacatga cctgtgtgaa ggaagagatc tttgggcctg 1561 ttatgtccat tttatcattt gacactgaag ctgaggttct agaaagagcc aatgatacca 1621 cttttggact agcagctggc gtctttacca gggacatcca acgggctcat agagtggtag 1681 ctgagcttca ggctgggacg tgcttcatta acaactataa cgtcagccca gtggagttgc 1741 cctttggtgg atataagaag tcaggatttg gcagagagaa cggccgtgtg acaatcgaat 1801 attattcaca gctgaagact gtgtgtgtgg agatgggtga tgtggaatct gctttttgaa 1861 aacctgcagt gaaacctatt gacatggcca cgctgtgaat gatgtgaatt ggccctgttt 1921 acagaggcag tacaactgaa tgttatttta catccagaat tttggcgttc agtataagag 1981 aatggttcat gttactcttt ctctctccat cagcttcctc actgaaaatg tgcattaagt 2041 gccttgtaga tactaatcaa gaaagctgtg attctcctca aagcgtattt ttgtgaaatc 2101 ttttaagagc cagtaacata cttctagaga acaggaaaga gactaggata atacatcttc 2161 cacacatttg gcccactgat aatgttaatt ctctggcgta tttcaaagaa cttgttcctg 2221 gctgatccaa gtgcagtggt atttacaact aattgatcac aaccagtttg tagatttctt 2281 tgttccttct ccattcccac tgcttcactt gcctagtctt gaagaaaaaa aacaaaaaac 2341 aaaaaaaacc ttgttccttt ataggttcct ggtagaatca gtagagatga tttcagctca 2401 ttgacatttt taagctgtat ccccttgtca ttccattgag aaagctgaca actgggatag 2461 ggaggggatt agataataga tggggtcaaa ttctgtgtga atgtgaactt gcctagtaag 2521 cactttgtct ctgttcacta ctgcgataga ggaaatctac tccctatctt gggtccttga 2581 actacagcct gctgtcttac accagtggag ctacccttta aatgtacaaa ttaatttgta 2641 tgctaatgta atatggtgaa attaaaataa atcacactgt taattgttaa aaaaaaaaaa 2701 aaaaaggaat tc // LOCUS HSU34355 2070 bp mRNA PRI 21-FEB-1996 DEFINITION Human skeletal muscle insulin receptor binding protein (Grb-IR) mRNA, complete cds. ACCESSION U34355 NID g1079573 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2070) AUTHORS Liu,F. and Roth,R.A. TITLE Grb-IR: a SH2-domain-containing protein that binds to the insulin receptor and inhibits its function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (22), 10287-10291 (1995) MEDLINE 96036069 REFERENCE 2 (bases 1 to 2070) AUTHORS Liu,F. TITLE Direct Submission JOURNAL Submitted (18-AUG-1995) Feng Liu, Mol. Pharmacology, Stanford University, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..2070 /organism="Homo sapiens" /db_xref="taxon:9606" gene 32..1678 /gene="Grb-IR" CDS 32..1678 /gene="Grb-IR" /note="SH2 domain containing protein" /codon_start=1 /function="insulin receptor inhibitor" /db_xref="PID:g1079574" /translation="MALAGCPDSFLHHPYYQDKVEQTPRSQQDPAGPGLPAQSDRLAN HQEDDVDLEALVNDMNASLESLYSACSMQSDTVPLLQNGQHARSQPRASGPPRSIQPQ VSPRQRVQRSQPVHILAVRRLQEEDQQFRTSSLPAIPNPFPELCGPGSPPVLTPGSLP PSQAAAKQDVKVFSEDGTSKVVEILADMTARDLCQLLVYKSHCVDDNSWTLVEHHPHL GLERCLEDHELVVQVESTMASESKFLFRKNYAKYEFFKNPMNFFPEQMVTWCQQSNGS QTQLLQEPRHLQLLADLEDSNIFSLIAGRKQYNAPTDHGLCIKPNKVRNETKELRLLC AEDEQTRTCWMTAFRLLKYGMLLYQNYRIPQQRKALLSPFSTPVRSVSENSLVAMDFS GQTGRVIENPAEAQSAALEEGHAWRKRSTRMNILGSQSPLHPSTLSTVIHRTQHWFHG RISREESHRIIKQQGLVDGLFLLRDSQSNPKAFVLTLCHHQKIKNFQILPCEDDGQTF FSLDDGNTKFSDLIQLVDFYQLNKGVLPCKLKHHCIRVAL" BASE COUNT 554 a 549 c 548 g 419 t ORIGIN 1 aaatgtaatt tgaagaaggc agaaggaacc catggcttta gccggctgcc cagattcctt 61 tttgcaccat ccgtactacc aggacaaggt ggagcagaca cctcgcagtc aacaagaccc 121 ggcaggacca ggactccccg cacagtctga ccgacttgcg aatcaccagg aggatgatgt 181 ggacctggaa gccctggtga acgatatgaa tgcatccctg gagagcctgt actcggcctg 241 cagcatgcag tcagacacgg tgcccctcct gcagaatggc cagcatgccc gcagccagcc 301 tcgggcttca ggccctcctc ggtccatcca gccacaggtg tccccgaggc agagggtgca 361 gcgctcccag cctgtgcaca tcctcgctgt caggcgcctt caggaggaag accagcagtt 421 tagaacctca tctctgccgg ccatccccaa tccttttcct gaactctgtg gccctgggag 481 cccccctgtg ctcacgccgg gttctttacc tccgagccag gccgccgcaa agcaggatgt 541 taaagtcttt agtgaagatg ggacaagcaa agtggtggag attctagcag acatgacagc 601 cagagacctg tgccaattgc tggtttacaa aagtcactgt gtggatgaca acagctggac 661 actagtggag caccacccgc acctaggatt agagaggtgc ttggaagacc atgagctggt 721 ggtccaggtg gagagtacca tggccagtga gagtaaattt ctattcagga agaattacgc 781 aaaatacgag ttctttaaaa atcccatgaa tttcttccca gaacagatgg ttacttggtg 841 ccagcagtca aatggcagtc aaacccagct tttgcaggaa cccagacacc tgcagctgct 901 ggccgacctg gaggacagca acatcttctc cctgatcgct ggcaggaagc agtacaacgc 961 ccctacagac cacgggctct gcataaagcc aaacaaagtc aggaatgaaa ctaaagagct 1021 gaggttgctc tgtgcagagg acgagcaaac caggacgtgc tggatgacag cgttcagact 1081 cctcaagtat ggaatgctcc tttaccagaa ttaccgaatc cctcagcaga ggaaggcctt 1141 gctgtccccg ttctcgacgc cagtgcgcag tgtctccgag aactccctcg tggcaatgga 1201 tttttctggg caaacaggac gcgtgataga gaatccggcg gaggcccaga gcgcagccct 1261 ggaggagggc cacgcctgga ggaagcgaag cacacggatg aacatcctag gtagccaaag 1321 tcccctccac ccttctaccc taagtacagt gattcacagg acacagcact ggtttcacgg 1381 gaggatctcc agggaggaat cccacaggat cattaaacag caagggctcg tggatgggct 1441 ttttctcctc cgtgacagcc agagtaatcc aaaggcattt gtactcacac tgtgtcatca 1501 ccagaaaatt aaaaatttcc agatcttacc ttgcgaggac gacgggcaga cgttcttcag 1561 cctagatgac gggaacacca aattctctga cctgatccag ctggttgact tttaccagct 1621 gaacaaagga gtcctgcctt gcaaactcaa gcaccactgc atccgagtgg ccttatgacc 1681 gcagatgtcc tctcggctga agactggagg aagtgaacac tggagtgaag aagcggtctg 1741 tgcgttggtg aagaacacac atcgattctg cacctgggga cccagagcga gatgggtttg 1801 ttcggtgcca gccgaccaag attgactagt ttgttggact taaacgacga tttgctgctg 1861 tgaacccagc agggtcgcct ccctctgcgt cggccaaatt ggggagggca tggaagatcc 1921 agcggaaagt tgaaaataaa ctggaatgat catcttggct tgggccgctt aggaacaaga 1981 accggagaga agtgattgga aatgaactct tgccctggaa taatcttgac aattaaaact 2041 gatatgttta aaaaaaaaaa aaaaaaaact // LOCUS HSU34360 3857 bp mRNA PRI 03-MAY-1996 DEFINITION Human lymphoid nuclear protein (LAF-4) mRNA, complete cds. ACCESSION U34360 NID g1144492 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3857) AUTHORS Ma,C. and Staudt,L.M. TITLE LAF-4 encodes a lymphoid nuclear protein with transactivation potential that is homologous to AF-4, the gene fused to MLL in t(4;11) leukemias JOURNAL Blood 87 (2), 734-745 (1996) MEDLINE 96141096 REFERENCE 2 (bases 1 to 3857) AUTHORS Ma,C. and Staudt,L.M. TITLE Direct Submission JOURNAL Submitted (19-AUG-1995) Louis Staudt, Metabolism Branch, National Cancer Institute, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3857 /organism="Homo sapiens" /db_xref="taxon:9606" gene 55..3738 /gene="LAF-4" CDS 55..3738 /gene="LAF-4" /note="lymphoid nuclear protein; similar to AF-4" /codon_start=1 /product="LAF-4" /db_xref="PID:g1144493" /translation="MDSFDLALLQEWDLESLCVYEPDRNALRRKERERRNQETQQDDG TFNSSYSLFSEPYKTNKGDELSNRIQNTLGNYDEMKDFLTDRTNQSHLVGVPKPGVPQ TPVNKIDEHFVADSRAQNQPSSICSTTTSTPAAVPVQQSKRGTMGWQKAGHPPSDGQQ RATQQGSLRTLLGDGVGRQQPRAKQVCNVEVGLQTQERPPAMAAKHSSSGHCVQNFPP SLASKPSLVQQKPTAYVRPMDGQDQAPDESPKLKSSSETSVHCTSYRGVPASKPEPAR AKAKLSKFSIPKQGEESRSGETNSCVEEIIREMTWLPPLSAIQAPGKVEPTKFPFPNK DSQLVSSGHNNPKKGDAEPESPDNGTSNTSMLEDDLKLSSDEEENEQQAAQRTALRAL SDSAVVQQPNCRTSVPSSKGSSSSSSSGTSSSSSDSESSSGSDSETESSSSESEGSKP PHFSSPEAEPASSNKWQLDKWLNKVNPHKPPILIQNESHGSESNQYYNPVKEDVQDCG KVPDVCQPSLREKEIKSTCKEEQRPRTANKAPGSKGVKQKSPPAAVAVAVSAAAPPPA VPCAPAENAPAPARRSAGKKPTRRTERTSAGDGANCHRPEEPAAADALGTSVVVPPEP TKTRPCGNNRASHRKELRSSVTCEKRRTRGLSRIVPKSKEFIETESSSSSSSSDSDLE SEQEEYPLSKAQTVAASASSGNDQRLKEAAANGGSGPRAPVGSINARTTSDIAKELEE QFYTLVPFGRNELLSPLKDSDEIRSLWVKIDLTLLSRIPEHLPQEPGVLSAPATKDSE SAPPSHTSDTPAEKALPKSKRKRKCDNEDDYREIKKSQGEKDSSSRLATSTSNTLSAN HCNMNINSVAIPINKNEKMLRSPISPLSDASKHKYTSEDLTSSSRPNGNSLFTSASSS KKPKADSQLQPHGGDLTKAAHNNSENIPLHKSRPQTKPWSPGSNGHRDCKRQKLVFDD MPRSADYFMQEAKRMKHKADAMVEKFGKALNYAEAALSFIECGNAMEQGPMESKSPYY LMYSETVELIRYAMRLKTHSGPNATPEDKQLAALCYRCLALLYWRMFRLKRDHAVKYS KALIDYFKNSSKAAQAPSPWGASGKSTGTPSPISPNPFPGSSVGSQGSLSNASALSPS TIVSIPQRIHQMAANHVSITNSILHSYDYWEMADNLAKENREFFNDLDLLMGPVTLHS SMEHLVQYSQQGLHWLRNSAHLS" BASE COUNT 1043 a 1165 c 994 g 655 t ORIGIN 1 ggccgagcct cggcggcggc ggtagcggcg gcggcgacgc tgacacctcc caccatggac 61 agcttcgact tagccctgct ccaggaatgg gacctcgagt cactgtgtgt ctatgaacca 121 gatagaaatg cattacggag gaaagaacga gaaagaagaa atcaagaaac tcaacaggat 181 gatggcacgt ttaattctag ttactctctc ttcagtgagc cctacaagac taacaagggg 241 gatgaactct ccaaccggat ccagaacact ttaggcaatt atgatgaaat gaaagacttt 301 ttaactgata gaaccaatca gagtcatctc gttggagttc ccaaacctgg ggttcctcag 361 actcctgtga acaagatcga tgaacatttt gttgcagatt caagagccca gaaccagccc 421 tcgtctatct gtagcactac aacttccaca ccagcagctg tccccgtgca gcagagtaaa 481 agaggcacta tgggctggca gaaggctggg cacccaccct ctgacggcca acagagagca 541 acacaacagg gctctctcag gaccttgctt ggagatggtg ttggcagaca gcagcctcgg 601 gccaaacaag tgtgcaatgt ggaggtgggc cttcagaccc aggagaggcc acctgccatg 661 gcggccaagc acagcagcag cggacactgt gttcagaact ttcctccatc cctagcttca 721 aaacccagcc tggtccagca gaaaccgacc gcgtatgtga ggccaatgga cggccaagat 781 caggcccctg atgagtctcc taagctgaag tcgtcttcgg aaaccagcgt gcactgcaca 841 tcatacaggg gagtccctgc cagcaagccg gagcctgcca gagccaaggc caagctctcc 901 aagttcagca tccccaagca gggggaggag agtagatctg gagaaaccaa cagctgtgtt 961 gaagaaataa tccgggagat gacctggctt ccaccacttt ctgctattca agcacctggc 1021 aaagtggaac caaccaaatt tccatttcca aataaggact ctcagcttgt atcctctgga 1081 cacaataatc caaagaaagg tgatgcagag ccagagagtc cagacaatgg cacatcgaat 1141 acatcaatgc tggaagatga ccttaagcta agcagtgatg aagaggagaa tgaacagcag 1201 gcagctcaga gaacggctct ccgcgctctc tctgacagcg ccgtggtcca gcagcccaac 1261 tgcagaacct cggtgccttc cagcaagggc agcagcagca gcagcagcag cggcacgagc 1321 agctcctcca gcgactcaga gagcagctcc ggatctgact cggagaccga gagcagctcc 1381 agcgagagtg agggcagcaa gcccccccac ttctccagcc ccgaggctga accggcatcc 1441 tctaacaagt ggcagctgga taaatggcta aacaaagtta atccccacaa gcctcctatt 1501 ctgatccaaa atgaaagcca cgggtcagag agcaatcagt actacaaccc ggtgaaagag 1561 gacgtccagg actgtgggaa agtccccgac gtttgccagc ccagcctgag agagaaggag 1621 atcaagagca cttgcaagga ggagcaaagg ccaaggacag ccaacaaggc ccctgggagt 1681 aaaggcgtga agcagaagtc cccgcccgcg gccgtggccg tggcggtgag cgcagccgcc 1741 ccgccacccg cagtgccctg tgcgcccgcg gagaacgcgc ccgcgcctgc ccggaggtcc 1801 gcgggcaaga agcccaccag gcgcaccgag aggacctcag ccggggacgg cgccaactgc 1861 caccggcccg aggagcccgc ggccgcggac gcgctgggga cgagcgtggt ggtccccccg 1921 gagcccacca aaaccaggcc ctgtggcaac aacagagcga gccaccgcaa ggagctgcgc 1981 tcctccgtga cctgcgagaa gcgccgcacg cgggggctaa gcaggatcgt ccccaaatcc 2041 aaggagttca ttgagacaga gtcgtcatct tcatcctcct cctcggactc cgacctggag 2101 tccgagcagg aggagtaccc tctgtccaaa gcacagaccg tggctgcctc tgcctcctcc 2161 gggaatgatc agaggctgaa ggaggccgct gccaacgggg gcagtggtcc tagggcccct 2221 gtaggctcca tcaacgccag gaccaccagt gacatcgcca aggagctgga ggagcagttc 2281 tacacactgg tcccctttgg ccggaacgaa cttctctccc ctctaaagga cagtgatgag 2341 atcaggtctc tctgggtcaa aatcgacctg accctcctgt ccaggatccc agaacacctg 2401 ccccaggagc caggggtatt gagcgcccct gccaccaagg actctgagag cgcaccgccc 2461 agccacacct cggacacacc tgcagaaaag gctttgccaa aatccaagag gaaacgcaag 2521 tgtgacaacg aagacgacta cagggagatc aagaagtccc agggagagaa agacagctct 2581 tcaagactgg ccacctccac cagtaatact ttgtctgcaa accactgcaa catgaacatc 2641 aacagtgtgg caataccaat aaataaaaat gaaaaaatgc ttcggtcgcc catctcaccc 2701 ctctctgatg catctaaaca caaatacacc agcgaggact taacttcttc cagccgacct 2761 aatggcaaca gtttgtttac ttcagcctct tccagcaaaa agcctaaggc cgacagccag 2821 ctgcagcctc acggcggaga cctcacgaaa gcagctcaca acaattctga aaacattccc 2881 ctccacaagt cacggccgca gacgaagccg tggtctccag gctccaacgg ccacagggac 2941 tgcaagaggc agaaacttgt cttcgatgat atgcctcgca gtgccgatta ttttatgcaa 3001 gaagctaaac gaatgaagca taaagcagat gcaatggtgg aaaagtttgg aaaggctttg 3061 aactatgctg aagcagcatt gtcgtttatc gagtgtggaa atgcaatgga acaaggcccc 3121 atggaatcca aatctcctta ttacctgatg tattcagaaa cagtagagct catcaggtat 3181 gctatgagac taaaaaccca ctcaggcccc aatgccacac cagaagacaa acaactggct 3241 gcattatgtt accgatgcct ggccctcctg tactggcgga tgtttcgact caaaagggac 3301 cacgctgtaa agtattcaaa agcactaatc gactatttca agaactcatc taaagccgcc 3361 caagccccat ctccgtgggg ggccagtgga aagagcactg gaaccccatc ccccatttct 3421 cccaacccct ttcccggcag ctccgtgggg tctcagggca gcctctccaa cgccagcgcc 3481 ctgtccccgt cgaccatcgt cagcatccca cagcgcatcc accagatggc ggccaaccac 3541 gtcagcatca ccaacagcat cctgcacagc tacgactact gggagatggc cgacaacctg 3601 gccaaggaaa accgagaatt cttcaacgac ctggatctgc tcatggggcc ggtcaccctg 3661 cacagcagca tggagcacct ggtccagtac tcccaacagg gcctgcactg gctgcggaac 3721 agcgcccacc tgtcataggg acctcaccct ggggccagag tgggctctgg tctccacaga 3781 tggctcaacg tttttggaca ctgtgctact gaaactccca gccacagcat ttatagactg 3841 cggtgaacat ttcctca // LOCUS HSU34587 2110 bp mRNA PRI 06-MAR-1996 DEFINITION Human corticotropin-releasing factor receptor 2 mRNA, complete cds. ACCESSION U34587 NID g1144507 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2110) AUTHORS Liaw,C.W., Lovenberg,T.W., Barry,G., Oltersdorf,T., Grigoriadis,D.E. and de Souza,E.B. TITLE Cloning and characterization of the human corticotropin-releasing factor-2 receptor complementary deoxyribonucleic acid JOURNAL Endocrinology 137 (1), 72-77 (1996) MEDLINE 96107120 REFERENCE 2 (bases 1 to 2110) AUTHORS Liaw,C.W. TITLE Direct Submission JOURNAL Submitted (21-AUG-1995) Chen W. Liaw, Molecular Neurobiology, Neurocrine Biosciences Inc, 3050 Science Park Rd, San Diego, CA 92121, USA FEATURES Location/Qualifiers source 1..2110 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1236 /note="CRF2 receptor" /codon_start=1 /product="corticotropin-releasing factor receptor 2" /db_xref="PID:g1144508" /translation="MDAALLHSLLEANCSLALAEELLLDGWGPPLDPEGPYSYCNTTL DQIGTCWPRSAAGALVERPCPEYFNGVKYNTTRNAYRECLENGTWASKINYSQCEPIL DDKQRKYDLHYRIALVVNYLGHCVSVAALVAAFLLFLALRSIRCLRNVIHWNLITTFI LRNVMWFLLQLVDHEVHESNEVWCHCITTIFNYFVVTNFFWMFVEGCYLHTAIVMTYS TERLRKCLFLFIGWCIPFPIIVAWAIGKLYYENEQCWFGKEPGDLVDYIYQGPIILVL LINFVFLFNIVRILMTKLRASTTSETIQYRKAVKATLVLLPLLGITYMLFFVNPGEDD LSQIMFIYFNSFLQSFQGFFVSVFYCFFNGEVRSAVRKRWHRWQDHHSLRVPMARAMS IPTSPTRISFHSIKQTAAV" BASE COUNT 432 a 661 c 562 g 455 t ORIGIN 1 atggacgcgg cactgctcca cagcctgctg gaggccaact gcagcctggc gctggctgaa 61 gagctgctct tggacggctg ggggccaccc ctggaccccg agggtcccta ctcctactgc 121 aacacgacct tggaccagat cggaacgtgc tggccccgca gcgctgccgg agccctcgtg 181 gagaggccgt gccccgagta cttcaacggc gtcaagtaca acacgacccg gaatgcctat 241 cgagaatgct tggagaatgg gacgtgggcc tcaaagatca actactcaca gtgtgagccc 301 attttggatg acaagcagag gaagtatgac ctgcactacc gcatcgccct tgtcgtcaac 361 tacctgggcc actgcgtatc tgtggcagcc ctggtggccg ccttcctgct tttcctggcc 421 ctgcggagca ttcgctgtct gcggaatgtg attcactgga acctcatcac cacctttatc 481 ctgcgaaatg tcatgtggtt cctgctgcag ctcgttgacc atgaagtgca cgagagcaat 541 gaggtctggt gccactgcat caccaccatc ttcaactact tcgtggtgac caacttcttc 601 tggatgtttg tggaaggctg ctacctgcac acggccattg tcatgaccta ctccactgag 661 cgcctgcgca agtgcctctt cctcttcatc ggatggtgca tccccttccc catcatcgtc 721 gcctgggcca tcggcaagct ctactatgag aatgaacagt gctggtttgg caaggagcct 781 ggcgacctgg tggactacat ctaccaaggc cccatcattc tcgtgctcct gatcaatttc 841 gtatttctgt tcaacatcgt caggatccta atgacaaagt tacgcgcgtc caccacatcc 901 gagacaatcc agtacaggaa ggcagtgaag gccaccctgg tgctcctgcc cctcctgggc 961 atcacctaca tgctcttctt cgtcaatccc ggggaggacg acctgtcaca gatcatgttc 1021 atctatttca actccttcct gcagtcgttc cagggtttct tcgtgtctgt cttctactgc 1081 ttcttcaatg gagaggtgcg ctcagccgtg aggaagaggt ggcaccgctg gcaggaccat 1141 cactcccttc gagtccccat ggcccgggcc atgtccatcc ctacatcacc cacacggatc 1201 agcttccaca gcatcaagca gacggccgct gtgtgacccc tcggtcgccc acctgcacag 1261 ctcccctgtc ctcctccacc ttcttcctct gggttctctg tgctgggcag gctctcgtgg 1321 ggcaggagat gggaggggag agaccagctc tccagcctgg caggaaagag ggggtgcggc 1381 agccaagggg gactgcaagg gacagggatg agtgggggcc accaggctca gcgcaagagg 1441 aagcagaggg aattcacagg accccctgag aagagccagt cagatgtctg caggcatttg 1501 cccatcccag cctctctggc cagggcctta ctgggcccag agcagagaag gacctgtcca 1561 acacacacag ctatttatag tagcagacac agggctcccc tgccctactc atggagccag 1621 cagccaggca atggtgtggc cctgcactgg cccttggact ccacactcag tggtgccctg 1681 cagttgggtg ggttaacgcc aagcaaagga tcagtttggc tgccttatcc cagggctgtc 1741 acctagagag gctcacttgt accccaccct gttcctgtgt cccctcccca gccatcctcc 1801 ccgccttggg ggctccatga aggatgcagg cttccaggcc tggcttcctc tcttgggaga 1861 ccccttctct gcctagtcca cagattaggc aatcaaggaa gacgccatca gggaagccac 1921 atccttagtc aaccagttgc atcgtgcggg gcaaaatgag gagcagaggc atggaggagg 1981 gaggcgtggg atgggaatag cagaaccacc atgtcttcag tgattgaaac tcatacccca 2041 ttgccctttg ccctccagtc tccccttcag aaacatctct gctctctgtg aaataaacca 2101 tgcctcttgg // LOCUS HSU34605 3906 bp mRNA PRI 02-JAN-1996 DEFINITION Human retinoic acid- and interferon-inducible 58K protein RI58 mRNA, complete cds. ACCESSION U34605 NID g1144510 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3906) AUTHORS Niikura,T., Hirata,R. and Weil,S.C. TITLE A novel interferon-inducible gene expressed during myeloid differentiation JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3906) AUTHORS Weil,S.C. TITLE Direct Submission JOURNAL Submitted (22-AUG-1995) Susan C. Weil, Pathology, Anatomy and Cell Biology, Thomas Jefferson University, 1020 Locust St., Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..3906 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="acute promyelocytic leukemia NB4" CDS 118..1566 /note="interferon-inducible, TPR motif" /codon_start=1 /product="retinoic acid- and interferon-inducible 58K protein RI58" /db_xref="PID:g1144511" /translation="MSEIRKDTLKAILLELECHFTWNLLKEDIDLFEVEDTIGQQLEF LTTKSRLALYNLLAYVKHLKGQNKDALECLEQAEEIIQQEHSDKEEVRSLVTWGNYAW VYYHMDQLEEAQKYTGKIGNVCKKLSSPSNYKLECPETDCEKGWALLKFGGKYYQKAK AAFEKALEVEPDNPEFNIGYAITVYRLDDSDREGSVKSFSLGPLRKAVTLNPDNSYIK VFLALKLQDVHAEAEGEKYIEEILDQISSQPYVLRYAAKFYRRKNSWNKALELLKKAL EVTPTSSFLHHQMGLCYRAQMIQIKKATHNRPKGKDKLKVDELISSAIFHFKAAMERD SMFAFAYTDLANMYAEGGQYSNAEDIFRKALRLENITDDHKHQIHYHYGRFQEFHRKS ENTAIHHYLEALKVKDRSPLRTKLTSALKKLSTKRLCHNALDVQSLSALGFVYKLEGE KRQAAEYYEKAQKIDPENAEFLTALCELRLSI" polyA_site 3906 /note="19 A nucleotides" BASE COUNT 1229 a 708 c 736 g 1233 t ORIGIN 1 ctggcgcgcg cacgcgcacg cgcacgccca ccgcgcggct tcccccgctc cccggtgctg 61 aggagagagc gatccgaggg actgcgccgc ccggacggcc tgcagagcgc tgccatcatg 121 agtgaaattc gtaaggacac cttgaaggcc attctgttgg agttagaatg tcattttaca 181 tggaatttac ttaaggaaga cattgatctg tttgaggtag aagatacaat tgggcaacag 241 cttgaatttc ttaccacaaa atctagactt gctctttata acctattggc ctatgtgaaa 301 cacctaaaag gccaaaataa agacgccctt gagtgcttgg aacaagcaga agaaataatc 361 cagcaagaac actcagacaa agaagaagta cgaagcctgg tcacttgggg aaactatgcc 421 tgggtgtatt atcacatgga ccagcttgaa gaagctcaga agtatacagg taagataggg 481 aatgtctgta agaaattgtc cagtccttct aactacaagt tggagtgtcc tgagactgac 541 tgtgagaaag gctgggcact cttgaaattt ggaggaaagt attatcaaaa ggctaaagcg 601 gcttttgaga aggctctgga agtggagcct gacaatccag aatttaacat cggctatgct 661 atcacagtgt atcggctgga tgattctgat agagaagggt ctgtaaagag cttttctctg 721 gggcctttga gaaaggctgt taccctgaac ccagataaca gctatattaa ggtttttctg 781 gcactgaagc ttcaagatgt acatgcagaa gctgaagggg aaaagtatat tgaagaaatc 841 ctggaccaaa tatcatccca gccttacgtc cttcgttatg cagccaagtt ctataggaga 901 aaaaattcct ggaacaaagc tctcgaactt ttaaaaaagg ccttggaggt gacaccaact 961 tcttctttcc tgcatcacca gatgggactt tgctacaggg cacaaatgat ccaaatcaag 1021 aaggccacac acaacagacc taaaggaaag gataaactaa aggttgatga gctgatttca 1081 tctgctatat ttcatttcaa agcagccatg gaacgagact ctatgtttgc atttgcctac 1141 acagacctgg ccaacatgta tgctgaagga ggccagtata gcaatgctga ggacattttc 1201 cggaaagctc ttcgtctgga gaacataacc gatgatcaca aacatcagat ccattaccac 1261 tatggccgct ttcaggaatt tcaccgtaaa tcagaaaata ctgccatcca tcattattta 1321 gaagccttaa aggtcaaaga cagatcaccc cttcgcacca aactgacaag tgctctgaag 1381 aaattgtcta ccaagagact ttgtcacaat gctttagatg tgcagagttt aagtgcccta 1441 gggtttgttt acaagctgga aggagaaaag aggcaagctg ctgagtacta tgagaaggca 1501 caaaagatag atccagaaaa tgcagaattc ctgactgctc tctgtgagct ccgactttcc 1561 atttaaatac atactctagg aaattagctc taagtttttc ccttcatttt gggttctcct 1621 gtttgttttt tttttattat tttaatccct tgtttattat agagctaata tttattgaat 1681 agttattgtg taccaagcat tgtgctaaat actttatatg cattatgatg aatcttgtgc 1741 ggttttcttt ctttttttct ttttaattaa aatactataa tccattgaga aatagcaata 1801 ttctagctat tgtaacttct aaaaatggta tggccattag atctgtgctt tttatctctg 1861 ctctttgaat ttctcatatt atatagtaaa tatattccta cgtaaacctt tgatacctag 1921 atcaggaata ctcttccagg agtacaaaat tacattattg atagttaagc tcttaattgt 1981 gtagcttgca aaagacagca ctttttagtt acagatgttt tgactttgat gaggatattt 2041 agctatcaat ctaatagtca cctaaaatat cttttttgtt ggaaaaaagt ttataataaa 2101 aaagtttgtc atctctagtg acttcaataa agaaaaaact agaagaggag aaaaaggatt 2161 tcctcaaatt ttaaatatgt aacttcaggg attcaatccc caaatgttta ttaagtagct 2221 agaaataatt atgtggaaaa aaatgaataa tggaaaatag tgagtctcaa attgtttttt 2281 tttaactaaa atctgcaatg aatctagatg caattaattt tattccttcc aactaaaatt 2341 acaatatttt taggttaaaa ttattgagat ataaagcagc cattgggaaa ttgggagaaa 2401 tgataaacaa atggaaaaag aagatgtccc taacctacac ccatagatta ccaaggtttc 2461 agtgtactag ttttgaatct gttctgaatg gagtttttat accctcaatt tctggccttt 2521 ggctatttta gcatttcaaa gtgacttcta tgaagctttt tttttaatgt gaaattttca 2581 gaatgttgtt tttttcatgt agatactcca ggaagagtta agcactgctt tcagttttaa 2641 tatccacctt gaggggtcgc tgcttgaggg ctcttatccc aggggacttt ttaattcgga 2701 tgttacttaa tgtggcttct ctaatgtagt ttctttgatt accgactaca caattatgta 2761 ccatcacagt attagtggaa aagtaccatg tgatttaatt ctccattcct ccaatgtaac 2821 tcttaaaatt attatgtatg tgtatgtgtt ttactttttg ttttttatca tctttaaaat 2881 ttctattatg gtttgattat tataaaaata atgaattctc actgtaaatt tcaaaaaaaa 2941 aattacaaaa gtatgtgaat ttaaaaatga gagcagtccc ctcaccctac cacagttcca 3001 caccctcaag gtaaacttat aacttataat ttgatatgta aacttccaga tcttttttct 3061 atgcgtaatc agacatacat atatactgca gtgtatctca cgtattaatt tttaaaaatc 3121 ttttgtttta cttaattctg tttttattat tattattatt ttgtttgatc tattaaggaa 3181 gaacaaggaa gggaatgatc tttactcaag aatttcagaa agtcagcact gaagtcctga 3241 cctatcagta gacacatttg tccctttcag atattttagg atattctagc aaagcaggcc 3301 atttctccca cctgaaagta cataacttct atcacttgcc acataattaa aagaactcac 3361 attaagcggt tactcagaca gttaatcata gaaaagatta tttgcttcat cagttcatag 3421 aaaagattat ttgcttcatc agttaacttg tttttataaa tcagggcctg tgttcataca 3481 cagaaggggc ctgagatttc tgcactttaa acaagctcct cctaggtgag gatgctgtgg 3541 ctgttctaat tacattttga gtagtaaggt ctacagcatt gttcctcaaa cttggctacg 3601 tattggaatc acctaaaaag ttaaaacaaa acatggatgt ctgggtcccg ccccatagag 3661 aatgacttaa ttggcatggg gtgcagtcca ggcatcatga tttttagatt tcccagttgg 3721 aacttgtgca gcaaagtttg ggagctactg atggacatgt gaaaagtaag tataaatgga 3781 ataaaattaa ttaggctaat aggcttaacc caggaaatcc taagttcctt gaatatccag 3841 tttgcatttg ggactcctca tcatatactt ggtatataat actctaataa aagctgcctg 3901 agttga // LOCUS HSU34623 2817 bp DNA PRI 31-JAN-1996 DEFINITION Human T cell surface glycoprotein CD-6 mRNA, complete cds. ACCESSION U34623 NID g1015963 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2817) AUTHORS Robinson,W.H., Neuman de Vegvar,H.E., Prohaska,S.S., Rhee,J.W. and Parnes,J.R. TITLE Human CD6 possesses a large, alternatively spliced cytoplasmic domain JOURNAL Eur. J. Immunol. 25 (10), 2765-2769 (1995) MEDLINE 96062022 REFERENCE 2 (bases 1 to 2817) AUTHORS Parnes,J.R. TITLE Direct Submission JOURNAL Submitted (22-AUG-1995) Jane Parnes, Medicine, Stanford Universiy, MSLS Bldg. P-306, 1201 Welch Rd., Stanford, CA, 94305, USA COMMENT Aruffo, A. et al. J. Exp. Med. 174,949-952,1991. FEATURES Location/Qualifiers source 1..2817 /organism="Homo sapiens" /note="hybrid CD6-PB1" /db_xref="taxon:9606" CDS 121..2127 /codon_start=1 /product="T cell surface glycoprotein CD6" /db_xref="PID:g1015964" /translation="MWLFFGITGLLTAALSGHPSPAPPDQLNTSSAESELWEPGERLP VRLTNGSSSCSGTVEVRLEASWEPACGALWDSRAAEAVCRALGCGGAEAASQLAPPTP ELPPPPAAGNTSVAANATLAGAPALLCSGAEWRLCEVVEHACRSDGRRARVTCAENRA LRLVDGGGACAGRVEMLEHGEWGSVCDDTWDLEDAHVVCRQLGCGWAVQALPGLHFTP GRGPIHRDQVNCSGAEAYLWDCPGLPGQHYCGHKEDAGVVCSEHQSWRLTGGADRCEG QVEVHFRGVWNTVCDSEWYPSEAKVLCQSLGCGTAVERPKGLPHSLSGRMYYSCNGEE LTLSNCSWRFNNSNLCSQSLAARVLCSASRSLHNLSTPEVPASVQTVTIESSVTVKIE NKESRELMLLIPSIVLGILLLGSLIFIAFILLRIKGKYALPVMVNHQHLPTTIPAGSN SYQPVPITIPKEVFMLPIQVQAPPPEDSDSGSDSDYEHYDFSAQPPVALTTFYNSQRH RVTDEEVQQSRFQMPPLEEGLEELHASHIPTANPGHCITDPPSLGPQYHPRSNSESST SSGEDYCNSPKSKLPPWNPQVFSSERSSFLEQPPNLELAGTQPAFSAGPPADDSSSTS SGEWYQNFQPPPQPPSEEQFGCPGSPSPQPDSTDNDDYDDISAA" BASE COUNT 543 a 930 c 845 g 499 t ORIGIN 1 gaacagcaaa gggtagagca gacctgcgcc aggggcgcac aacggccgtg tccacctccc 61 ggccccaaga tggtgcttcc cacaggcagc cacgcgtagc agccagagac agctccagac 121 atgtggctct tcttcgggat cactggattg ctgacggcag ccctctcagg tcatccatct 181 ccagccccac ctgaccagct caacaccagc agtgcagaga gtgagctctg ggagccaggg 241 gagcggcttc cggtccgtct gacaaacggg agcagcagct gcagcgggac ggtggaggtg 301 cggctcgagg cgtcctggga gcccgcgtgc ggggcgctct gggacagccg cgccgccgag 361 gccgtgtgcc gagcactggg ctgcggcggg gcggaggccg cctctcagct cgccccgccg 421 acccctgagc tgccgccccc gcctgcagcc gggaacacca gcgtagcagc taatgccact 481 ctggccgggg cgcccgccct cctgtgcagc ggcgccgagt ggcggctctg cgaggtggtg 541 gagcacgcgt gccgcagcga cgggaggcgg gcccgtgtca cctgtgcaga gaaccgcgcg 601 ctgcgcctgg tggacggtgg cggcgcctgc gccggccgcg tggagatgct ggagcatggc 661 gagtggggat cagtgtgcga tgacacttgg gacctggagg acgcccacgt ggtgtgcagg 721 caactgggct gcggctgggc agtccaggcc ctgcccggct tgcacttcac gcccggccgc 781 gggcctatcc accgggacca ggtgaactgc tcgggggccg aagcttacct gtgggactgc 841 ccggggctgc caggacagca ctactgcggc cacaaagagg acgcgggcgt ggtgtgctca 901 gagcaccagt cctggcgcct gacagggggc gctgaccgct gcgaggggca ggtggaggta 961 cacttccgag gggtctggaa cacagtgtgt gacagtgagt ggtacccatc ggaggccaag 1021 gtgctctgcc agtccttggg ctgtggaact gcggttgaga ggcccaaggg gctgccccac 1081 tccttgtccg gcaggatgta ctactcatgc aatggggagg agctcaccct ctccaactgc 1141 tcctggcggt tcaacaactc caacctctgc agccagtcgc tggcagccag ggtcctctgc 1201 tcagcttccc ggagtttgca caatctgtcc actcccgaag tccctgcaag tgttcagaca 1261 gtcactatag aatcttctgt gacagtgaaa atagagaaca aggaatctcg ggagctaatg 1321 ctcctcatcc cctccatcgt tctgggaatt ctcctccttg gctccctcat cttcatagcc 1381 ttcatcctct tgagaattaa aggaaaatat gccctccccg taatggtgaa ccaccagcac 1441 ctacccacca ccatcccggc agggagcaat agctatcaac cggtccccat caccatcccc 1501 aaagaagttt tcatgctgcc catccaggtc caggccccgc cccctgagga ctcagactct 1561 ggctcggact cagactatga gcactatgac ttcagcgccc agcctcctgt ggccctgacc 1621 accttctaca attcccagcg gcatcgggtc acagatgagg aggtccagca aagcaggttc 1681 cagatgccac ccttggagga aggacttgaa gagttgcatg cctcccacat cccaactgcc 1741 aaccctggac actgcattac agacccgcca tccctgggcc ctcagtatca cccgaggagc 1801 aacagtgagt cgagcacctc ttcaggggag gattactgca atagtcccaa aagcaagctg 1861 cctccatgga acccccaggt gttttcttca gagaggagtt ccttcctgga gcagccccca 1921 aacttggagc tggccggcac ccagccagcc ttttcagcag ggcccccggc tgatgacagc 1981 tccagcacct catccgggga gtggtaccag aacttccagc caccacccca gcccccttcg 2041 gaggagcagt ttggctgtcc agggtccccc agccctcagc ctgactccac cgacaacgat 2101 gactacgatg acatcagcgc agcctaggcc ggggccagcc gaggctcctg gggtggctct 2161 gaccctctgg cctcctgctc tacctactcc ctttcccctt tcccaccctc ccagctcacc 2221 tccccatgga gctgagaggc ctcccttgga gagatggaag gaaacgttat accttgtacc 2281 cctcggtctc catccatcaa gccaaacctg ctgccacagc cctcccccgg ccccagatag 2341 cagccccagg gaggatgctg cctccaagag gtgtgagccc tctgtctcgg ggatgaacaa 2401 gcagagtctg ggctacctct tgacagctgg tggaggggag ttggggagct ggactggatg 2461 actctggagg ccccttccaa acctcaagtg tccggcgctt tgattgcctg agtttctgac 2521 acttcagggc ccagaggtcc tgcgaggggc agaactggac ccccatgcca gtgctgctgc 2581 aggagggccc atatactagg gtctgctgag ctgttgtcac tgatcggtgg gcgctggggg 2641 ggtagggtag cacaccagct gtcccaggct ttgctccggg tggtaactgc acttgggcag 2701 ggaatatagc cttcctgggc acaactagct gacaatgaca ggttgactgt gtacccccaa 2761 ccaaggagct ggggcccaag gccagtcctg ccccagagac actccaagtc cgccagg // LOCUS HSU34802 1299 bp DNA PRI 02-OCT-1995 DEFINITION Human intrinsic membrane protein MP70 (Cx50) gene, complete cds. ACCESSION U34802 NID g1002998 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS Church,R.L., Wang,J.H. and Steele,E. TITLE The human lens intrinsic membrane protein MP70 (Cx50) gene: clonal analysis and chromosome mapping JOURNAL Curr. Eye Res. 14 (3), 215-221 (1995) MEDLINE 95317073 REFERENCE 2 (bases 1 to 1299) AUTHORS Church,R.L., Wang,J.H. and Steele,E. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Robert L. Church, Ophthalmology, Emory Eye Center, 1327 Clifton Road, N.E., Atlanta, GA 30322, USA FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" gene 1..1299 /gene="Cx50" CDS 1..1299 /gene="Cx50" /codon_start=1 /product="intrinsic membrane protein MP70" /db_xref="PID:g1002999" /translation="MGDWSFLGNILEEVNEHSTVIGRVWLTVLFIFRILILGTAAEFV WGDEQSDFVCNTQQPGCENVCYDEAFPISHIRLWVLQIIFVSTPSLMYVGHAVHYVRM EEKRKSRDEELGQQAGTNGGPDQGSVKKSSGSKGTKKFRLEGTLLRTYICHIIFKTLF EVGFIVGHYFLYGFRILPLYRCSRWPCPNVVDCFVSRPTEKTIFILFMLSVASVSLFL NVMELSHLGLKGIRSALKRPVEQPLGEIPEKSLHSIAVSSIQKAKGYQLLEEEKIVSH YFPLTEVGMVETSPLPAKPFNQFEEKISTGPLGDLSRGYQETLPSYAQVGAQEVEGEG PPAEEGAEPEVGEKKEEAERLTTEEQEKVAVPEGEKVETPGVDKEGEKEEPQSEKVSK QGLPAEKTPSLCPELTTDDARPLSRLSKASSRARSDDLTV" BASE COUNT 294 a 363 c 406 g 236 t ORIGIN 1 atgggcgact ggagtttcct ggggaacatc ttggaggagg tgaatgagca ctccaccgtc 61 atcggcagag tctggctcac cgtgcttttc atcttccgga tcctcatcct tggcacggcc 121 gcagagttcg tgtgggggga tgagcaatcc gacttcgtgt gcaacaccca gcagcctggc 181 tgcgagaacg tctgctacga cgaggccttt cccatctccc acattcgcct ctgggtgctg 241 cagatcatct tcgtctccac cccgtccctg atgtacgtgg ggcacgcggt gcactacgtc 301 cgcatggagg agaagcgcaa aagccgcgac gaggagctgg gccagcaggc ggggactaac 361 ggcggcccgg accagggcag cgtcaagaag agcagcggca gcaaaggcac taagaagttc 421 cggctggagg ggaccctgct gaggacctac atctgccaca tcatcttcaa gaccctcttt 481 gaagtgggct tcatcgtggg ccactacttc ctgtacgggt tccggatcct gcctctgtac 541 cgctgcagcc ggtggccctg ccccaatgtg gtggactgct tcgtgtcccg gcccacggag 601 aaaaccatct tcatcctgtt catgttgtct gtggcctctg tgtccctatt cctcaacgtg 661 atggagttga gccacctggg cctgaagggg atccggtctg ccttgaagag gcctgtagag 721 cagcccctgg gggagattcc tgagaaatcc ctccactcca ttgctgtctc ctccatccag 781 aaagccaagg gctatcagct tctagaagaa gagaaaatcg tttcccacta tttccccttg 841 accgaggttg ggatggtgga gaccagccca ctgcctgcca agcctttcaa tcagttcgag 901 gagaagatca gcacaggacc cctgggggac ttgtcccggg gctaccaaga gacactgcct 961 tcctacgctc aggtgggggc acaagaagtg gagggcgagg ggccgcctgc agaggaggga 1021 gccgaacccg aggtgggaga gaagaaggag gaagcagaga ggctgaccac ggaggagcag 1081 gagaaggtgg ccgtgccaga gggggagaaa gtagagaccc ccggagtgga taaggagggt 1141 gaaaaagaag agccgcagtc ggagaaggtg tcaaagcaag ggctgccagc tgagaagaca 1201 ccttcactct gtccagagct gacaacagat gatgccagac ccctgagcag gctaagcaaa 1261 gccagcagcc gagccaggtc agacgatcta accgtatga // LOCUS HSU34806 1232 bp DNA PRI 19-NOV-1996 DEFINITION Human G protein-coupled receptor (GPR15) gene, complete cds. ACCESSION U34806 NID g1171145 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1232) AUTHORS Heiber,M., Marchese,A., Nguyen,T., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE A novel human gene encoding a G-protein-coupled receptor (GPR15) is located on chromosome 3 JOURNAL Genomics 32 (3), 462-465 (1996) MEDLINE 96435926 REFERENCE 2 (bases 1 to 1232) AUTHORS O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Brian F. O'Dowd, Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1232 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q11.2-q13.1" /chromosome="3" gene 83..1165 /gene="GPR15" CDS 83..1165 /gene="GPR15" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g1171146" /translation="MDPEETSVYLDYYYATSPNSDIRETHSHVPYTSVFLPVFYTAVF LTGVLGNLVLMGALHFKPGSRRLIDIFIINLAASDFIFLVTLPLWVDKEASLGLWRTG SFLCKGSSYMISVNMHCSVLLLTCMSVDRYLAIVWPVVSRKFRRTDCAYVVCASIWFI SCLLGLPTLLSRELTLIDDKPYCAEKKATPIKLIWSLVALIFTFFVPLLSIVTCYCCI ARKLCAHYQQSGKHNKKLKKSIKIIFIVVAAFLVSWLPFNTFKFLAIVSGLRQEHYLP SAILQLGMEVSGPLAFANSCVNPFIYYIFDSYIRRAIVHCLCPCLKNYDFGSSTETSD SHLTKALSTFIHAEDFARRRKRSVSL" BASE COUNT 291 a 302 c 270 g 369 t ORIGIN 1 aagcttcctt gaggtttcta aaatttatac aaaaacatca tatgtaagta aactcaccag 61 atttggcatc tgctctttgg tgatggaccc agaagaaact tcagtttatt tggattatta 121 ctatgctacg agcccaaact ctgacatcag ggagacccac tcccatgttc cttacacctc 181 tgtcttcctt ccagtctttt acacagctgt gttcctgact ggagtgctgg ggaaccttgt 241 tctcatggga gcgttgcatt tcaaacccgg cagccgaaga ctgatcgaca tctttatcat 301 caatctggct gcctctgact tcatttttct tgtcacattg cctctctggg tggataaaga 361 agcatctcta ggactgtgga ggacgggctc cttcctgtgc aaagggagct cctacatgat 421 ctccgtcaat atgcactgca gtgtcctcct gctcacttgc atgagtgttg accgctacct 481 ggccattgtg tggccagtcg tatccaggaa attcagaagg acagactgtg catatgtagt 541 ctgtgccagc atctggttta tctcctgcct gctggggttg cctactcttc tgtccaggga 601 gctcacgctg attgatgata agccatactg tgcagagaaa aaggcaactc caattaaact 661 catatggtcc ctggtggcct taattttcac cttttttgtc cctttgttga gcattgtgac 721 ctgctactgt tgcattgcaa ggaagctgtg tgcccattac cagcaatcag gaaagcacaa 781 caaaaagctg aagaaatcta taaagatcat ctttattgtc gtggcagcct ttcttgtctc 841 ctggctgccc ttcaatactt tcaagttcct ggccattgtc tctgggttgc ggcaagaaca 901 ctatttaccc tcagctattc ttcagcttgg tatggaggtg agtggaccct tggcatttgc 961 caacagctgt gtcaaccctt tcatttacta tatcttcgac agctacatcc gccgggccat 1021 tgtccactgc ttgtgccctt gcctgaaaaa ctatgacttt gggagtagca ctgagacatc 1081 agatagtcac ctcactaagg ctctctccac cttcattcat gcagaagatt ttgccaggag 1141 gaggaagagg tctgtgtcac tctaaaggga actgtgacat ttcaagctct gttggtgggt 1201 ttaggagtta atttttgtca gcaacaaaga aa // LOCUS HSU34845 1665 bp mRNA PRI 25-NOV-1995 DEFINITION Human mercurial-insensitive water channel mRNA, form 1, complete cds. ACCESSION U34845 NID g1072052 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1665) AUTHORS Yang,B., Ma,T. and Verkman,A.S. TITLE cDNA cloning, gene organization, and chromosomal localization of a human mercurial insensitive water channel. Evidence for distinct transcriptional units JOURNAL J. Biol. Chem. 270 (39), 22907-22913 (1995) MEDLINE 96032721 REFERENCE 2 (bases 1 to 1665) AUTHORS Yang,B., Ma,T. and Verkman,A.S. TITLE Direct Submission JOURNAL Submitted (24-AUG-1995) Alan S. Verkman, CVRI, UCSF, 505 Parnassus St., San Francisco, CA 94143-0521, USA FEATURES Location/Qualifiers source 1..1665 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal brain" /chromosome="18" /clone="hMIWC1" /map="18q22" CDS 285..1190 /codon_start=1 /product="mercurial-insensitive water channel" /db_xref="PID:g1072053" /translation="MVAFKGVWTQAFWKAVTAEFLAMLIFVLLSLGSTINWGGTEKPL PVDMVLISLCFGLSIATMVQCFGHISGGHINPAVTVAMVCTRKISIAKSVFYIAAQCL GAIIGAGILYLVTPPSVVGGLGVTMVHGNLTAGHGLLVELIITFQLVFTIFASCDSKR TDVTGSIALAIGFSVAIGHLFAINYTGASMNPARSFGPAVIMGNWENHWIYWVGPIIG AVLAGALYEYVFCPDVEFKRRFKEAFSKAAQQTKGSYMEVEDNRSQAKTDDLILKLGV VHVIDVDRGEEKKGKDQSGEVLSSV" BASE COUNT 455 a 348 c 375 g 487 t ORIGIN 1 cctttctagg gacagtttgg ataattttcc gtgaaactgg atgatctttg aatattaaat 61 gaaaggagac tagaataaca gtctcttgac tattcatgaa ggagctttag cagaagcatt 121 ctttcttggt gtgaaatcac gcctgcagcc cctgctcgac agtacccgaa gatgcaggcc 181 ttgttccctt cacctaaatt cataaacctg ggtgtagtgg cttctgatgc tgatttgttt 241 ctcttttcag taagtgtgga cctttgtgta ccagagagaa catcatggtg gctttcaaag 301 gggtctggac tcaagctttc tggaaagcag tcacagcgga atttctggcc atgcttattt 361 ttgttctcct cagcctggga tccaccatca actggggtgg aacagaaaag cctttaccgg 421 tcgacatggt gctcatctcc ctttgctttg gactcagcat tgcaaccatg gtgcagtgct 481 ttggccatat cagcggtggc cacatcaacc ctgcagtgac tgtggccatg gtgtgcacca 541 ggaagatcag catcgccaag tctgtcttct acatcgcagc ccagtgcctg ggggccatca 601 ttggagcagg aatcctctat ctggtcacac ctcccagtgt ggtgggaggc ctgggagtca 661 ccatggttca tggaaatctt accgctggtc atggtctcct ggttgagttg ataatcacat 721 ttcaattggt gtttactatc tttgccagct gtgattccaa acggactgat gtcactggct 781 caatagcttt agcaattggg ttttctgttg caattggaca tttatttgca atcaattata 841 ctggtgccag catgaatccc gcccgatcct ttggacctgc agttatcatg ggaaattggg 901 aaaaccattg gatatattgg gttgggccca tcataggagc tgtcctcgct ggtgcccttt 961 atgagtatgt cttctgtcca gatgttgaat tcaaacgtcg ttttaaagaa gccttcagca 1021 aagctgccca gcaaacaaaa ggaagctaca tggaggtgga ggacaacagg agtcaggcaa 1081 agacggatga cctgattcta aaacttggag tggtgcatgt gattgacgtt gaccggggag 1141 aggagaagaa ggggaaagac caatctggag aggtattgtc ttcagtatga ctagaagatc 1201 gcactgaaag ccagacaaga ctccttagaa ctgtcctcag atttccttcc gcccattaag 1261 gaaacagatt tgttataaat tagaatgtgc aggtttgttg tttcatgtca tattactcag 1321 tctaaacaat aaatatttca taatttacaa ggaggaacgg aagaaaccta ttgtgaattc 1381 caaatctaaa aaaagaaata tttttaaaat gttcttaagc aaatatatac ctattttatc 1441 tagttacctt tcattaacaa ccaattttaa ccgtgtgtca agatttggtt aagtcttgcc 1501 tgacagaact caaagacagc tctatcagct tattccttct ctactggaat attggtatag 1561 tcaattctat tgaatattat cctttttaat gcccaaggta atgtggacta aagcccagaa 1621 atttgaaaag aatattcaga aatccttccc aaatcataag ggccc // LOCUS HSU34880 2234 bp mRNA PRI 15-AUG-1996 DEFINITION Human DPH2L mRNA, complete cds. ACCESSION U34880 NID g1490414 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2234) AUTHORS Phillips,N.J., Ziegler,M.R. and Deaven,L.L. TITLE Similar to diphthamide biosynthesis gene DPH2 of Saccharomyces cerevisiae (human locus DPH2L) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2234) AUTHORS Phillips,N.J. TITLE Direct Submission JOURNAL Submitted (25-AUG-1995) Nancy J. Phillips, Pathology, Saint Louis University School of Medicine, 1402 S. Grand Ave., Saint Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2234 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="B2-3.1" /clone_lib="FP4 (ATCC #77434, Swaroop et al, Cyto. Cell Genet.64:292, 1993) vector lambdaSHK" /chromosome="17" /map="17p13.3" /tissue_type="fetus and placenta" /dev_stage="9 weeks gestation" CDS 294..1385 /note="similar to yeast hypothetical 48.3 kDa protein Swiss-Prot Accession Number P40487" /codon_start=1 /product="DPH2L" /db_xref="PID:g1490415" /translation="MPEGLLLFACTIVDILERFTEAEVMVMGDVTYGACCVDDFTARA LGADFLVHYGHSCLIPMDTSAQDFRVLYVFVDIRIDTTHLLDSLRLTFPPATALALVS TIQFVSTLQAAAQELKAEYRVSVPQCKPLSPGEILGCTSPRLSKEVEAVVYLGDGRFH LESVMIANPNVPAYRYDPYSKVLSREHYDHQRMQAARQEAIATARSAKSWGLILGTLG RQGSPKILEHLESRLRALGLSFVRLLLSEIFPSKLSLLPEVDVWVQVACPRLSIDWGT ASPKPLLTPYEAAVALRDISWQQPYPMDFYAGSSLGPWTVNHGQDRRPHAPGRPARGK VQEGSARPPSAVACEDCSCRDEKVAPLAP" BASE COUNT 433 a 690 c 638 g 473 t ORIGIN 1 gcttgaatta gcctgcgctc tccgcgttct tccagcgctg tctttttagt accacatgcg 61 caggcaggtg atggcggcgc tggtcgtatc cggggcagcg gagcagggcg ccgagacggc 121 cctggcagag gtcgggcccc tcgggccgcg tggccaatca gatcccccct gagatcctga 181 agaaccctca gctgcaggca gcaatccggg tcctgccttc caactacaac tttgagatcc 241 ccaagaccat ctggaggatc caacaagccc aggccaagaa ggtggccttg caaatgccgg 301 aaggcctcct cctctttgcc tgtaccattg tggatatctt ggaaaggttc acggaggccg 361 aagtgatggt gatgggtgac gtgacctacg gggcttgctg tgtggatgac ttcacagcga 421 gggccctggg agctgacttc ttggtgcact acggccacag ttgcctgatt cccatggaca 481 cctcggccca agacttccgg gtgctgtacg tctttgtgga catccggata gacactacac 541 acctcctgga ctctctccgc ctcacctttc ccccagccac tgcccttgcc ctggtcagca 601 ccattcagtt tgtgtcgacc ttgcaggcag ccgcccagga gctgaaagcc gagtatcgtg 661 tgagtgtccc acagtgcaag cccctgtccc ctggagagat cctgggctgc acatcccccc 721 gactgtccaa agaggtggag gccgttgtgt atcttggaga tggccgcttc catctggagt 781 ctgtcatgat tgccaacccc aatgtccccg cttaccggta tgacccatat agcaaagtcc 841 tatccagaga acactatgac caccagcgca tgcaggctgc tcgccaagaa gccatagcca 901 ctgcccgctc agctaagtcc tggggcctta ttctgggcac tttgggccgc cagggcagtc 961 ctaagatcct ggagcacctg gaatctcgac tccgagcctt gggcctttcc tttgtgaggc 1021 tgctgctctc tgagatcttc cccagcaagc ttagcctact tcccgaggtg gatgtgtggg 1081 tgcaggtggc atgtccacgt ctctccattg actggggcac agcctccccc aagccgctgc 1141 tgacacccta tgaggcggcc gtggctctga gggacatttc ctggcagcag ccctacccga 1201 tggacttcta cgctggcagc tccttggggc cctggacggt gaaccacggc caggaccgcc 1261 gtccccacgc cccgggccgg cccgcgcggg ggaaggtgca ggaggggtcc gcgcgtcccc 1321 cttcggccgt ggcttgcgag gactgcagct gcagggacga gaaggtggcg ccgctggctc 1381 cttgacgcgc tcccgggcct cagggtcctg ccctccggag gagcagcctc gaggctggtg 1441 gttttcagag caggaggccg acgttttctc cgcattggaa gagcccgccg tctgcagggg 1501 cctggaggaa tcactgggga tggtggcaca ggcactgaac aggctggggc cttttgacgg 1561 ccttcttggt ttcagccaag gggctgcgct agcagccctt gtgtgtgccc tgggccaggc 1621 aggcgatccc cgcttcccct tgccacggtt tatcctcttg gtgtctggtt tctgtccccg 1681 gggcattggg ttcaaggaat ccatcctgca aaggcccttg tcattgcctt cgctccatgt 1741 ttttggggac actgacaaag tcatcccctc tcaggagagt gtgcaactgg ccagccaatt 1801 tcccggagcc atcaccctca cccactctgg tggccacttc attccagcag ctgcacccca 1861 gcgtcaggcc tacctcaagt tcttggacca gtttgcagag tgaaagatca agaaatgtct 1921 ctgctcctac atccagctcc tctaggggca gcctccgtca tccatgccct cccaggaccc 1981 tccactcact gctgtgagtg cgcctcacca gaaccagtta agagacaact atcaattctt 2041 gagacccaaa ttataagggc cctgcctgta ctgaagaaaa ggggagcaca aggccttaat 2101 ggacattgac ttgtgaaaac gcaaacatga atatggttgg agagccctgg attaggaggg 2161 tgacatgggg aaggcagagg ctggcaccat ggtgactgcc acataataaa gtggtgattt 2221 ccaaaaaaaa aaaa // LOCUS HSU34919 2745 bp mRNA PRI 20-FEB-1997 DEFINITION Human white homolog (white) mRNA, complete cds. ACCESSION U34919 NID g1314276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2745) AUTHORS Croop,J.M., Tiller,G.E., Fletcher,J.A., Lux,M.L., Raab,E., Goldenson,D., Son,D., Arciniegas,S. and Wu,R.L. TITLE Isolation and characterization of a mammalian homolog of the Drosophila white gene JOURNAL Gene 185 (1), 77-85 (1997) MEDLINE 97186700 REFERENCE 2 (bases 1 to 2745) AUTHORS Croop,J.M., Tiller,G.E., Fletcher,J.A., Lux,M.L., Raab,E., Goldenson,D., Arciniegas,S., Son,D. and Wu,R.L. TITLE Direct Submission JOURNAL Submitted (27-AUG-1995) James M. Croop, Pediatric Oncology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2745 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal brain" /chromosome="21" /map="21q22.3" gene 378..2294 /gene="white" CDS 378..2294 /gene="white" /note="ABC transporter; homolog of Drosophila white gene product, Swiss-Prot Accession Number P10090" /codon_start=1 /product="white homolog" /db_xref="PID:g1314277" /translation="MEATETDLLNGHLKKVDNNLTEAQRFSSLPRRAAVNIEFRDLSY SVPEGPWWRKKGYKTLLKGISGKFNSGELVAIMGPSGAGKSTLMNILAGYRETGMKGA VLINGLPRDLRCFRKVSCYIMQDDMLLPHLTVQEAMMVSAHLKLQEKDEGRREMVKEI LTALGLLSCANTRTGSLSGGQRKRLAIALELVNNPPVMFFDEPTSGLDSASCFQVVSL MKGLAQGGRSIICTIHQPSAKLFELFDQLYVLSQGQCVYRGKVCNLVPYLRDLGLNCP TYHNPADFVMEVASGEYGDQNSRLVRAVREGMCDSDHKRDLGGDAEVNPFLWHRPSEE VKQTKRLKGLRKDSSSMEGCHSFSASCLTQFCILFKRTFLSIMRDSVLTHLRITSHIG IGLLIGLLYLGIGNEAKKVLSNSGFLFFSMLFLMFAALMPTVLTFPLEMGVFLREHLN YWYSLKAYYLAKTMADVPFQIMFPVAYCSIVYWMTSQPSDAVAFVLFAALGTMTSLVA QSLGLLIGAASTSLQVATFVGPVTAIPVLLFSGFFVSFDTIPTYLQWMSYISYVRYGF EGVILSIYGLDREDLHCDIDETCHFQKSEAILRELDVENAKLYLDFIVLGIFFISLRL IAYFVLRYKIRAER" BASE COUNT 595 a 740 c 751 g 659 t ORIGIN 1 gaattccggg atgtggaacg gtcgcaggag gctgctacaa gccccatgag caaggctgtt 61 cccactgaca gagctttccc aggatgacag agagtgcgct ctgcctctct ggggtgtgct 121 agcctacgag gggcaatcgt aaggcgaatg tcactgaaag aacacaagtg tccttaaaca 181 tggactatct gggctttcta gtgctgaaat tcttcccact cccactgccc acttcccatt 241 atataaaaaa cacagttgtt tcatgttttt gtttctttac tgtttttctt tgtttttgtt 301 aagaatgcat tcatttattc aaaattgttt attgtagaat aatcaggcat tgcgtggatg 361 aggtggtgtc cagcaacatg gaggccactg agacggacct gctgaatgga catctgaaaa 421 aagtagataa taacctcacg gaagcccagc gcttctcctc cttgcctcgg agggcagctg 481 tgaacattga attcagggac ctttcctatt cggttcctga aggaccctgg tggaggaaga 541 aaggatacaa gaccctcctg aaaggaattt ccgggaagtt caatagtggt gagttggtgg 601 ccattatggg tccttccggg gccgggaagt ccacgctgat gaacatcctg gctggataca 661 gggagacggg catgaagggg gccgtcctca tcaacggcct gccccgggac ctgcgctgct 721 tccggaaggt gtcctgctac atcatgcagg atgacatgct gctgccgcat ctcactgtgc 781 aggaggccat gatggtgtcg gcacatctga agcttcagga gaaggatgaa ggcagaaggg 841 aaatggtcaa ggagatactg acagcgctgg gcttgctgtc ttgcgccaac acgcggaccg 901 ggagcctgtc aggtggtcag cgcaagcgcc tggccatcgc gctggagctg gtgaacaacc 961 ctccagtcat gttcttcgat gagcccacca gcggcctgga cagcgcctcc tgcttccagg 1021 tggtctcgct gatgaaaggg ctcgctcaag ggggtcgctc catcatttgc accatccacc 1081 agcccagcgc caaactcttc gagctgttcg accagcttta cgtcctgagt caaggacaat 1141 gtgtgtaccg gggaaaagtc tgcaatcttg tgccatattt gagggatttg ggtctgaact 1201 gcccaaccta ccacaaccca gcagattttg tcatggaggt tgcatccggc gagtacggtg 1261 atcagaacag tcggctggtg agagcggttc gggagggcat gtgtgactca gaccacaaga 1321 gagacctcgg gggtgatgcc gaggtgaacc cttttctttg gcaccggccc tctgaagagg 1381 taaagcagac aaaacgatta aaggggttga gaaaggactc ctcgtccatg gaaggctgcc 1441 acagcttctc tgccagctgc ctcacgcagt tctgcatcct cttcaagagg accttcctca 1501 gcatcatgag ggactcggtc ctgacacacc tgcgcatcac ctcgcacatt gggatcggcc 1561 tcctcattgg cctgctgtac ttggggatcg ggaacgaagc caagaaggtc ttgagcaact 1621 ccggcttcct cttcttctcc atgctgttcc tcatgttcgc ggccctcatg cctactgttc 1681 tgacatttcc cctggagatg ggagtctttc ttcgggaaca cctgaactac tggtacagcc 1741 tgaaggccta ctacctggcc aagaccatgg cagacgtgcc ctttcagatc atgttcccag 1801 tggcctactg cagcatcgtg tactggatga cgtcgcagcc gtccgacgcc gtggcctttg 1861 tgctgtttgc cgcgctgggc accatgacct ccctggtggc acagtccctg ggcctgctga 1921 tcggagccgc ctccacgtcc ctgcaggtgg ccactttcgt gggcccagtg acagccatcc 1981 cggtgctcct gttctcgggg ttcttcgtca gcttcgacac catccccacg tacctacagt 2041 ggatgtccta catctcctat gtcaggtatg ggttcgaagg ggtcatcctc tccatctatg 2101 gcttagaccg ggaagatctg cactgtgaca tcgacgagac gtgccacttc cagaagtcgg 2161 aggccatcct gcgggagctg gacgtggaaa atgccaagct gtacctggac ttcatcgtac 2221 tcgggatttt cttcatctcc ctccgcctca ttgcctattt tgtcctcagg tacaaaatcc 2281 gggcagagag gtaaaacacc tgaatgccag gaaacaggaa gattagacac tgtggccgag 2341 ggcacgtcta gaatcgagga ggcaagcctg tgcccgaccg acgacacaga gactcttctg 2401 atccaacccc tagaaccgcg ttgggtttgt gggtgtctcg tgctcagcca ctctgcccag 2461 ctgggttgga tcttctctcc attccccttt ctagctttaa ctaggaagat gtaggcagat 2521 tggtggtttt tttttttttt taacatacag aattttaaat accacaactg gggcagaatt 2581 taaagctgca acacagctgg tgatgagagg cttcctcagt ccagtcgctc cttagcacca 2641 ggcaccgtgg gtcctggatg gggaactgca agcagcctct cagctgatgg ctgcgcagtc 2701 agatgtctgg tggcagagag tccgagcatg gagcgattcc atttt // LOCUS HSU34962 1585 bp mRNA PRI 16-MAY-1996 DEFINITION Human transcription factor HCSX (hCsx) mRNA, complete cds. ACCESSION U34962 NID g1314280 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1585) AUTHORS Turbay,D., Wechsler,S.B., Blanchard,K.M. and Izumo,S. TITLE Molecular cloning, chromosomal mapping, and characterization of the human cardiac-specific homeobox gene hCsx JOURNAL Mol. Med. 2 (1), 86-96 (1996) MEDLINE 97056197 REFERENCE 2 (bases 1 to 1585) AUTHORS Turbay,D. TITLE Direct Submission JOURNAL Submitted (28-AUG-1995) David Turbay, Cardiovascular Research Center, Department of Internal Medicine, The University of Michigan Medical Center, 7220 MSRB III, 1150 West Medical Center Drive, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..1585 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" /clone="phCsx 1313" /chromosome="5" /map="5q35" /clone_lib="lambda gt10 (Tamkun, et al., FASEB J. 5, 331-337)" gene 177..1151 /gene="hCsx" CDS 177..1151 /gene="hCsx" /note="homeodomain containing protein" /codon_start=1 /function="transcription factor" /product="HCSX" /db_xref="PID:g1314281" /translation="MFPSPALTPTPFSVKDILNLEQQQRSLAAAGELSARLEATLAPS SCMLAAFKPEAYAGPEAAAPGLPELRAELGRAPSPAKCASAFPAAPAFYPRAYSDPDP AKDPRAEKKELCALQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQ RYLSAPERDQLASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRI AVPVLVRDGKPCLGDSAPYAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPA GPSPAQPATAAANNNFVNFGVGDLNAVQSPGIPQSNSGVSTLHGIRAW" misc_feature 588..768 /gene="hCsx" /note="encodes helix-turn-helix motif" BASE COUNT 256 a 569 c 497 g 263 t ORIGIN 1 gcctggtccc gcctctcctg ccccttgtgc tcagcgctac ctgctgcccg gacacatcca 61 gagctggccg acgggtgcgc gggcgggcgg cggcaccatg cagggaagct gccaggggcc 121 gtgggcagcg ccgctttctg ccgcccacct ggcgctgtga gactggcgct gccaccatgt 181 tccccagccc tgctctcacg cccacgccct tctcagtcaa agacatccta aacctggaac 241 agcagcagcg cagcctggct gccgccggag agctctctgc ccgcctggag gcgaccctgg 301 cgccctcctc ctgcatgctg gccgccttca agccagaggc ctacgctggg cccgaggcgg 361 ctgcgccggg cctcccagag ctgcgcgcag agctgggccg cgcgccttca ccggccaagt 421 gtgcgtctgc ctttcccgcc gcccccgcct tctatccacg tgcctacagc gaccccgacc 481 cagccaagga ccctagagcc gaaaagaaag agctgtgcgc gctgcagaag gcggtggagc 541 tggagaagac agaggcggac aacgcggagc ggccccgggc gcgacggcgg aggaagccgc 601 gcgtgctctt ctcgcaggcg caggtctatg agctggagcg gcgcttcaag cagcagcggt 661 acctgtcggc ccccgaacgc gaccagctgg ccagcgtgct gaaactcacg tccacgcagg 721 tcaagatctg gttccagaac cggcgctaca agtgcaagcg gcagcggcag gaccagactc 781 tggagctggt ggggctgccc ccgccgccgc cgccgcctgc ccgcaggatc gcggtgccag 841 tgctggtgcg cgatggcaag ccatgcctag gggactcggc gccctacgcg cctgcctacg 901 gcgtgggcct caatccctac ggttataacg cctaccccgc ctatccgggt tacggcggcg 961 cggcctgcag ccctggctac agctgcactg ccgcttaccc cgccgggcct tccccagcgc 1021 agccggccac tgccgccgcc aacaacaact tcgtgaactt cggcgtcggg gacttgaatg 1081 cggttcagag ccccgggatt ccgcagagca actcgggagt gtccacgctg catggtatcc 1141 gagcctggta gggaagggac ccgcgtggcg cgaccctgac cgatcccacc tcaacagctc 1201 cctgactctc gtggggagaa ggggctccca acatgaccct gagtcccctg gattttgcat 1261 tcactcctgc ggagacctag gaactttttc tgtcccacgc gcgtttgttc ttgcgcacgg 1321 gagagtttgt ggcggcgatt atgcagcgtg caatgagtga tcctgcagcc tggtgtctta 1381 gctgtccccc caggagtgcc ctccgagagt ccatgggcac ccccggttgg aactgggact 1441 gagctcgggc acgcagggcc tgagatctgg ccgcccattc cgcgagccag ggccgggcgc 1501 ccgggccttt gctatctcgc cgtcgcccgc ccacgcaccc acccgtattt atgtttttac 1561 ctattgctgt aagaaatgac gatcc // LOCUS HSU34976 1630 bp mRNA PRI 09-NOV-1995 DEFINITION Human gamma-sarcoglycan mRNA, complete cds. ACCESSION U34976 NID g1054902 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1630) AUTHORS Noguchi,S., McNally,E.M., Othmane,K.B., Hagiwara,Y., Mizuno,Y., Yoshida,M., Yamamoto,H., Bonnemann,C.G., Gussoni,E., Denton,P.H., Kyriakides,T., Middleton,L., Hentati,F., Hamida,M.B., Nonaka,I., Vance,J.M., Kunkel,L.M. and Ozawa,E. TITLE Mutations in the dystrophin-associated protein gamma-sarcoglycan in chromosome 13 muscular dystrophy JOURNAL Science 270 (5237), 819-822 (1995) MEDLINE 96055122 REFERENCE 2 (bases 1 to 1630) AUTHORS McNally,E.M. TITLE Direct Submission JOURNAL Submitted (28-AUG-1995) Elizabeth M. McNally, Genetics, Children's Hospital, 300 Longwood, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1630 /organism="Homo sapiens" /db_xref="taxon:9606" /map="13q12" /chromosome="13" CDS 125..1000 /note="mutated in SCARMD (LGMD 2C); 35 kDa dystrophin-associated protein" /codon_start=1 /product="gamma-sarcoglycan" /db_xref="PID:g1054903" /translation="MVREQYTTATEGICIERPENQYVYKIGIYGWRKRCLYLFVLLLL IILVVNLALTIWILKVMWFSPAGMGHLCVTKDGLRLEGESEFLFPLYAKEIHSRVDSS LLLQSTQNVTVNARNSEGEVTGRLKVGPKMVEVQNQQFQINSNDGKPLFTVDEKEVVV GTDKLRVTGPEGALFEHSVETPLVRADPFQDLRLESPTRSLSMDAPRGVHIQAHAGKI EALSQMDILFHSSDGMLVLDAETVCLPKLVQGTWGPSGSSQSLYEICVCPDGKLYLSV AGVSTTCQEHSHICL" BASE COUNT 446 a 350 c 386 g 448 t ORIGIN 1 gttgctgaag cttcatcctt tgctctcatt ctgtaagtca tagaaaagtt tgaaacattc 61 tgtctgtggt agagctcggg ccagctgtag ttcattcgcc agtgtgcttt tcttaatatc 121 taagatggtg cgtgagcagt acactacagc cacagaaggc atctgcatag agaggccaga 181 gaatcagtat gtctacaaaa ttggcattta tggctggaga aagcgctgtc tctacttgtt 241 tgttcttctt ttactcatca tcctcgttgt gaatttagct cttacaattt ggattcttaa 301 agtgatgtgg ttttctccag caggaatggg ccacttgtgt gtaacaaaag atggactgcg 361 cttggaaggg gaatcagaat ttttattccc attgtatgcc aaagaaatac actccagagt 421 ggactcatct ctgctgctac aatcaaccca gaatgtgact gtaaatgcgc gcaactcaga 481 aggggaggtc acaggcaggt taaaagtcgg tcccaaaatg gtagaagtcc agaatcaaca 541 gtttcagatc aactccaacg acggcaagcc actatttact gtagatgaga aggaagttgt 601 ggttggtaca gataaacttc gagtaactgg gcctgaaggg gctctttttg aacattcagt 661 ggagacaccc cttgtcagag ccgacccgtt tcaagacctt agattagaat cccccactcg 721 gagtctaagc atggatgccc caaggggtgt gcatattcaa gctcacgctg ggaaaattga 781 ggcgctttct caaatggata ttctttttca tagtagtgat ggaatgcttg tgcttgatgc 841 tgaaactgtg tgcttaccca agctggtgca ggggacgtgg ggtccctctg gcagctcaca 901 gagcctctac gaaatctgtg tgtgtccaga tgggaagctg tacctgtctg tggccggtgt 961 gagcaccacg tgccaggagc acagccacat ctgcctctga gctgcctgcg tcctctcggt 1021 gagctgtgca gtgccggccc cagatcctca cacccaggga gcagctgcac atcgtgaaag 1081 actgaggcag cgtggatggg aagtaaacgc ttccagagga actcagaaaa aattatgtgc 1141 cagtgaaagt gtttggacaa aaactacatg atctcaaaat gcacgtggat gtgagacaca 1201 aaagttgaca aaatggaaaa gcaatgtgtt tttccactgg attaattttc accggaacaa 1261 ttgcgaattc tctctgcctc gcctccccct atcttgtccg tgtgggcaca cactgagtgt 1321 tgagttgccg tgtggagtta atgtatgacg ctccactgtg gatatctaat gccctgttga 1381 gagtagcctt gctcagtact aaaatgcccc aaagttctat acagcatttc ctttatagca 1441 ttcaaacctc acatcctccc ttcagtttaa tgcaagtaag tcaggtttca caagaaaatt 1501 ttcaagtttt gaagggaatt tgaggttgat ctggttttca agatgtagtt aaaggaataa 1561 atcactcaaa attaaacttt ctgtatatag tcaataagca ataaaaacct catttttcag 1621 agttaaaaaa // LOCUS HSU35100 940 bp mRNA PRI 26-OCT-1995 DEFINITION Human complexin II mRNA, complete cds. ACCESSION U35100 NID g1040920 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 940) AUTHORS McMahon,H.T., Missler,M., Li,C. and Sudhof,T.C. TITLE Complexins: cytosolic proteins that regulate SNAP receptor function JOURNAL Cell 83 (1), 111-119 (1995) MEDLINE 96006530 REFERENCE 2 (bases 1 to 940) AUTHORS McMahon,H.T. TITLE Direct Submission JOURNAL Submitted (30-AUG-1995) Harvey T. McMahon, University of Texas Southwestern Medical Center, Howard Hughes Medical Institute, 5323 Harry Hines Blvd, Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..940 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBlue-HTC3" /tissue_type="brain" CDS 346..750 /codon_start=1 /evidence=experimental /product="complexin II" /db_xref="PID:g1040921" /translation="MDFVMKQALGGATKDMGKMLGGEEEKDPDAQKKEEERQEALRQQ EEERKAKHARMEAEREKVRQQIRDKYGLKKKEEKEAEEKAALEQPCEGSLTRPKKAIP AGCGDEEEEEEESILDTVLKYLPGPLQDMFKK" BASE COUNT 214 a 264 c 319 g 143 t ORIGIN 1 cgggagacat tcgaggcgga cagaaacggg gcttggcgcc ccccggtgca cgtgtgctag 61 cccaggcagg agggagcgcc tcggcggagg agtcaaggaa gagggggagg gagaaacgcg 121 ccagaacctc ggcccgggcg ccctcgtcgg ccgcggagga gctgcagcct ccaacaggaa 181 ggtgtgatcc ctgccatgct atctgctctg ctcagcgact gaaggtgccc gcatcccagc 241 tctgccagga agcaaaggtt gtcacatctt cccaagccag gccagccagg agcgctgcat 301 gcaaattctg ccgtgggcta aggcacgcta accagagccg gcggcatgga cttcgtcatg 361 aagcaggccc ttggaggggc cacaaaggac atggggaaga tgctgggggg agaggaggag 421 aaggaccccg acgcgcagaa aaaggaggag gagcggcagg aggcgctgcg gcagcaggag 481 gaggagcgta aggccaagca cgcgcgcatg gaggcggagc gggagaaggt ccggcagcag 541 atccgagata agtatgggct gaagaagaag gaggagaagg aagcagagga gaaagcagcc 601 ctggagcagc cctgcgaggg gagcctgacc cggcccaaga aggccatccc tgcgggctgc 661 ggggacgagg aggaggagga agaggagagc atcctggaca cggtgctcaa atacctgccc 721 gggccgctgc aggacatgtt caagaagtaa ccaggcctcc tgccccagcc tactccacct 781 gttactactt ctttttggtt ctttcttttc tttttattag gttaagtctc aattctgaag 841 gggaaaacct cagttggcct ctgcccctct tccctggcca ggggcttctc cccctcagct 901 ctccctcaca cctcccttca tcccagggta tccggaattc // LOCUS HSU35113 2640 bp mRNA PRI 05-OCT-1995 DEFINITION Human metastasis-associated mta1 mRNA, complete cds. ACCESSION U35113 NID g1008543 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2640) AUTHORS Toh,Y., Pencil,S.D. and Nicolson,G.L. TITLE A novel candidate metastasis-associated gene, mta1, differentially expressed in highly metastatic mammary adenocarcinoma cell lines. cDNA cloning, expression, and protein analyses JOURNAL J. Biol. Chem. 269 (37), 22958-22963 (1994) MEDLINE 94364985 REFERENCE 2 (bases 1 to 2640) AUTHORS Lin,P., Toh,Y. and Nicolson,G.L. TITLE Direct Submission JOURNAL Submitted (30-AUG-1995) NaWa Akihiro, Tumor Biology, U. Texas M.D. Anderson Cancer Center, 1515 Holcombe Blvd., Box 108, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2640 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="A2058 malignant melanoma cells" gene 33..2180 /gene="mta1" CDS 33..2180 /gene="mta1" /note="metastasis-associated gene" /codon_start=1 /db_xref="PID:g1008544" /translation="MAANMYRVGDYVYFENSSSNPYLIRRIEELNKTANGNVEAKVVC FYRRRDISSTLIALADKHATLSVCYKAGPGADNGEEGEIEEEMENPEMVDLPEKLKHQ LRHRELFLSRQLESLPATHIRGKCSVTLLNETESLKSYLEREDFFFYSLVYDPQQKTL LADKGEIRVGNRYQADITDLLKEGEEDGRDQSRLETQVWEAHNPLTDKQIDQFLVVAR SVGTFARALDCSSSVRQPSLHMSAAAASRDITLFHAMDTLHKNIYDISKAISALVPQG GPVLCRDEMEEWSASEANLFEEALEKYGKDFTDIQQDFLPWKSLTSIIEYYYMWKTTD RYVQQKRLKAAEAESKLKQVYIPNYNKPNPNQISVNNVKAGVVNGTGAPGQSPGAGRA CESCYTTQSYQWYSWGPPNMQCRLCASCWTYWKKYGGLKMPTRLDGERPGPNRSNMSP HGLPARSSGSPKFAMKTRQAFYLHTTKLTRIARRLCREILRPWHAARNPYLPINSAAI KAECTARLPEASQSPLVLKQAVRKPLEAVLRYLETHPRPPKPDPVKSVSSVLSSLTPA KVAPVINNGSPTILGKRSYEQHNGVDGNMKKRLLMPSRGLANHGQTRHMGPSRNLLLN GKSYPTKVRLIRGGSLPPVKRRRMNWIDAPGDVFYMPKEETRKIRKLLSSSETKRAAR RPYKPIALRQSQALPPRPPPPAPVNDEPIVIED" BASE COUNT 581 a 843 c 762 g 454 t ORIGIN 1 aaacattctc ctccgccgcc gccggcccgg acatggccgc caacatgtac agggtcggag 61 actacgtcta ctttgagaac tcctccagca acccatacct gatccggaga atcgaggagc 121 tcaacaagac ggccaatggg aacgtggagg ccaaagtggt gtgcttctac cggaggcggg 181 acatctccag caccctcatc gccctggccg acaagcacgc aaccctgtca gtctgctata 241 aggccggacc gggggcggac aacggcgagg aaggggaaat agaagaggaa atggagaatc 301 cggaaatggt ggacctgccc gagaaactaa agcaccagct gcggcatcgg gagctgttcc 361 tctcccggca gctggagtct ctgcccgcca cgcacatcag gggcaagtgc agcgtcaccc 421 tgctcaacga gaccgagtcg ctcaagtcct acctggagcg ggaggatttc ttcttctatt 481 ctctagtcta cgacccacag cagaagaccc tgctggcaga taaaggagag attcgagtag 541 gaaaccggta ccaggcagac atcaccgact tgttaaaaga aggcgaggag gatggccgag 601 accagtccag gttggagacc caggtgtggg aggcgcacaa cccactcaca gacaagcaga 661 tcgaccagtt cctggtggtg gcccgctctg tgggcacctt cgcacgggcc ctggactgca 721 gcagctccgt ccgacagccc agcctgcaca tgagcgccgc agctgcctcc cgagacatca 781 ccctgttcca cgccatggat actctccaca agaacatcta cgacatctcc aaggccatct 841 cggcgctggt gccgcagggc gggcccgtgc tctgcaggga cgagatggag gagtggtctg 901 catcagaggc caaccttttc gaggaagccc tggaaaaata tgggaaggat ttcacggaca 961 ttcagcaaga ttttctcccg tggaagtcgc tgaccagcat cattgagtac tactacatgt 1021 ggaagaccac cgacagatac gtgcagcaga aacgcttgaa agcagctgaa gctgagagca 1081 agttaaagca agtttatatt cccaactata acaagccaaa tccgaaccaa atcagcgtca 1141 acaacgtcaa ggccggtgtg gtgaacggca cgggggcgcc gggccagagc cctggggctg 1201 gccgggcctg cgagagctgt tacaccacac agtcttacca gtggtattct tggggtcccc 1261 ctaacatgca gtgtcgtctc tgcgcatctt gttggacata ttggaagaaa tatggtggct 1321 tgaaaatgcc aacccggtta gatggagaga ggccaggacc aaaccgcagt aacatgagtc 1381 cccacggcct cccagcccgg agcagcggga gccccaagtt tgccatgaag accaggcagg 1441 ctttctatct gcacacgacg aagctgacgc ggatcgcccg gcgcctgtgc cgtgagatcc 1501 tgcgcccgtg gcacgctgcg cggaacccct acctgcccat caacagcgcg gccatcaagg 1561 ccgagtgcac ggcgcggctg cccgaagcct cccagagccc gctggtgctg aagcaggcgg 1621 tacgcaagcc gctggaagcc gtgcttcggt atcttgagac ccacccccgc ccccccaagc 1681 ctgaccccgt gaaaagcgtg tccagcgtgc tcagcagcct gacgcccgcc aaggtggccc 1741 ccgtcatcaa caacggctcc cccaccatcc tgggcaagcg cagctacgag cagcacaacg 1801 gggtggacgg caacatgaag aagcgcctct tgatgcccag taggggtctg gcaaaccacg 1861 gacagaccag gcacatggga ccaagccgga acctcctgct caacgggaag tcctacccca 1921 ccaaagtgcg cctgatccgg gggggctccc tgcccccagt caagcggcgg cggatgaact 1981 ggatcgacgc cccgggtgac gtgttctaca tgcccaaaga ggagaccagg aagatccgca 2041 agctgctctc atcctcggaa accaagcgtg ctgcccgccg gccctacaag cccatcgccc 2101 tgcgccagag ccaggccctg ccgccgcggc caccgccacc tgcgcccgtc aacgacgagc 2161 ccatcgtcat cgaggactag gggccgcccc cacctgcggc cgccccccgc ccctcgcccg 2221 cccacacggc cccttcccag ccagcccgcc gcccgcccct cagtttggta gtgccccacc 2281 tcccgccctc acctgaagag aaacgcgctc cttggcggac actgggggag gagaggaaga 2341 agcgcggcta acttattccg agaatgccga ggagttgtcg tttttagctt tgtgtttact 2401 ttttggctgg agcggagatg aggggccacc ccgtgcccct gtgctgcggg gccttttgcc 2461 cggaggccgg gccctaaggt tttgttgtgt tctgttgaag gtgccatttt aaattttatt 2521 tttattactt tttttgtaga tgaacttgag ctctgtaact tacacctgga atgttaggat 2581 cgtgcggccg cggccggccg agctgcctgg cggggttggc ccttgtcttt tcacccgccc // LOCUS HSU35146 1993 bp DNA PRI 31-DEC-1996 DEFINITION Human p56 KKIAMRE protein kinase (KKIAMRE), complete cds. ACCESSION U35146 NID g1517819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1993) AUTHORS Taglienti,C.A., Wysk,M. and Davis,R.J. TITLE Molecular cloning of the epidermal growth factor-stimulated protein kinase p56 KKIAMRE JOURNAL Oncogene 13 (12), 2563-2574 (1996) MEDLINE 97152547 REFERENCE 2 (bases 1 to 1993) AUTHORS Taglienti,C.A. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) Cherie A. Taglienti, Molecular Medicine, University of Massachusetts Medical School, 373 Plantation Street, Worcester, MA 01605, USA FEATURES Location/Qualifiers source 1..1993 /organism="Homo sapiens" /db_xref="taxon:9606" gene 376..1857 /gene="KKIAMRE" CDS 376..1857 /gene="KKIAMRE" /note="similar to human p42 KKIALRE gene, GenBank Accession Number X66358; these protein kinases have mutually exclusive expression in testis (p56 KKIAMRE) and ovary (p42 KKIALRE)" /codon_start=1 /product="p56 KKIAMRE protein kinase" /db_xref="PID:g1517820" /translation="MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMV KKIAMREIKLLKQLRHENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQV VQKYLFQIINGIGFCHSHNIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTD YVATRWYRAPELLVGDVKYGKAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCL GNLIPRHQELFNKNPVFAGVRLPEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRP FCAELLHHDFFQMDGFAERFSQELQLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEER KTLVVQDTNADPKIKDYKLFKIKGSKIDGEKAEKGNRASNASCLHDSRTSHNKIVPST SLKDCSNVSVDHTRNPSVAIPPLTHNLSAVAPSINSGMGTETIPIQGYRVDEKTKKCS IPFVKPNRHSPSGIYNINVTTLVSGPPLSDDSGADLPQMEHQH" BASE COUNT 668 a 371 c 439 g 515 t ORIGIN 1 caggtgttgg tgcctgccgt gaacgcattc tgacctgggc cgtatctgtc tcccaagact 61 ttgtgcctat ggttggggac agagtgaggt cgttgccttg acgacgacag catgcggccc 121 gtggtcctcc taagtgtgag cttgcggcgg accgaggccc acctgcctcc ctgcctgctt 181 cgccctggac tcgtgactgc gtccgcagaa gaaatcacaa cagcgctgga attgctagtt 241 tgctaggcag catcttttgg acctgcgaac catatgcatt tcacctcaaa tttgtttcca 301 agttgaaaac ctttgggtct ttctatgcga acggattgaa gaaacgcaaa aagtttctac 361 ggactttaaa ttaaaatgga aaaatatgaa aacctgggtt tggttggaga agggagttat 421 ggaatggtga tgaagtgtag gaataaagat actggaagaa ttgtggccat aaagaagttc 481 ttagaaagtg acgatgacaa aatggttaaa aagattgcaa tgcgagaaat caagttacta 541 aagcaactta ggcatgaaaa cttggtgaat ctcttggaag tgtgtaagaa aaaaaaacga 601 tggtacctag tctttgaatt tgttgaccac acaattcttg atgacttgga gctctttcca 661 aatggactag actaccaagt agttcaaaag tatttgtttc agattattaa tggaattgga 721 ttttgtcaca gtcacaatat catacacaga gatataaagc cagagaatat attagtctcc 781 cagtctggcg ttgtcaagct atgcgatttt ggatttgcgc gaacattggc agctcctggg 841 gaggtttata ctgattatgt ggcaacccga tggtacagag ctccagaact attggttggt 901 gatgtcaagt atggcaaggc tgttgatgtg tgggccattg gttgtctggt aactgaaatg 961 ttcatggggg aacccctatt tcctggagat tctgatattg atcagctata tcatattatg 1021 atgtgtttag gtaatctaat tccaaggcat caggagcttt ttaataaaaa tcctgtgttt 1081 gctggagtaa ggttgcctga aatcaaggaa agagaacctc ttgaaagacg ctatcctaag 1141 ctctctgaag tggtgataga tttagcaaag aaatgcttac atattgaccc cgacaaaaga 1201 cccttctgtg ctgagctcct acaccatgat ttctttcaaa tggatggatt tgctgagagg 1261 ttttcccaag aactacagtt aaaagtacag aaagatgcca gaaatgtttc tttatctaaa 1321 aaatcccaaa acagaaagaa ggaaaaagaa aaagatgatt ccttagttga agaaagaaaa 1381 acacttgtgg tacaggatac caatgctgat cccaaaatta aggattataa actatttaaa 1441 ataaaaggct caaaaattga tggagaaaaa gctgaaaaag gcaatagagc ttcaaatgcc 1501 agctgtctcc atgacagtag gacaagccac aacaaaatag tgccttcaac aagcctcaaa 1561 gactgcagca atgtcagcgt ggaccacaca aggaatccaa gcgtggcaat tcccccactt 1621 acacacaatc tttctgcagt tgctcccagc attaattctg gaatggggac tgagactata 1681 ccaattcagg gttacagagt ggatgagaaa actaagaagt gttctattcc atttgttaaa 1741 ccgaacagac attccccatc aggcatttat aacattaatg tgaccacatt agtatcagga 1801 cctcccctgt cagatgattc aggggctgat ttgcctcaaa tggaacacca gcactgagaa 1861 ccattttggt tctgaactgg atgatgctct tgcacttgag atgacatctt cttgcagcaa 1921 gaaaaaaaaa aaaaaaaaaa aaaaaaaaac aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaaaaaa aaa // LOCUS HSU35232 1180 bp DNA PRI 15-NOV-1995 DEFINITION Human neuropeptide Y4 receptor protein gene, complete cds. ACCESSION U35232 NID g1063629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1180) AUTHORS Bard,J.A., Walker,M.W., Branchek,T.A. and Weinshank,R.L. TITLE Cloning and functional expression of a human Y4 subtype receptor for pancreatic polypeptide, neuropeptide Y, and peptide YY JOURNAL J. Biol. Chem. 270 (45), 26762-26765 (1995) MEDLINE 96070761 REFERENCE 2 (bases 1 to 1180) AUTHORS Bard,J.A. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) Jonathan A. Bard, Molecular Biology, Synaptic Pharmaceutical Corporation, 215 College Rd., Paramus, NJ 07652, USA FEATURES Location/Qualifiers source 1..1180 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hp25a" /tissue_type="placenta" 5'UTR 1..27 CDS 28..1155 /codon_start=1 /product="neuropeptide Y4 receptor protein" /db_xref="PID:g1063630" /translation="MNTSHLLALLLPKSPQGENRSKPLGTPYNFSEHCQDSVDVMVFI VTSYSIETVVGVLGNLCLMCVTVRQKEKANVTNLLIANLAFSDFLMCLLCQPLTAVYT IMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERHQLIINPTGWKPSISQAYLGI VLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPLAHHRTIYTTFL LLFQYCLPLGFILVCYARIYRRLQRQGRVFHKGTYSLRAGHMKQVNVVLVVMVVAFAV LWLPLHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNTNFKKEIK ALVLTCQQSAPLEESEHLPLSTVHTEVSKGSLRLSGRSNPI" 3'UTR 1156..1180 BASE COUNT 234 a 382 c 285 g 279 t ORIGIN 1 gagtcctgga atcttttcac atccactatg aacacctctc acctcctggc cttgctgctc 61 ccaaaatctc cacaaggtga aaacagaagc aaacccctgg gcaccccata caacttctct 121 gaacattgcc aggattccgt ggacgtgatg gtcttcatcg tcacttccta cagcattgag 181 actgtcgtgg gggtcctggg taacctctgc ctgatgtgtg tgactgtgag gcagaaggag 241 aaagccaacg tgaccaacct gcttatcgcc aacctggcct tctctgactt cctcatgtgc 301 ctcctctgcc agccgctgac cgccgtctac accatcatgg actactggat ctttggagag 361 accctctgca agatgtcggc cttcatccag tgcatgtcgg tgacggtctc catcctctcg 421 ctcgtcctcg tggccctgga gaggcatcag ctcatcatca acccaacagg ctggaagccc 481 agcatctcac aggcctacct ggggattgtg ctcatctggg tcattgcctg tgtcctctcc 541 ctgcccttcc tggccaacag catcctggag aatgtcttcc acaagaacca ctccaaggct 601 ctggagttcc tggcagataa ggtggtctgt accgagtcct ggccactggc tcaccaccgc 661 accatctaca ccaccttcct gctcctcttc cagtactgcc tcccactggg cttcatcctg 721 gtctgttatg cacgcatcta ccggcgcctg cagaggcagg ggcgcgtgtt tcacaagggc 781 acctacagct tgcgagctgg gcacatgaag caggtcaatg tggtgctggt ggtgatggtg 841 gtggcctttg ccgtgctctg gctgcctctg catgtgttca acagcctgga agactggcac 901 catgaggcca tccccatctg ccacgggaac ctcatcttct tagtgtgcca cttgcttgcc 961 atggcctcca cctgcgtcaa cccattcatc tatggctttc tcaacaccaa cttcaagaag 1021 gagatcaagg ccctggtgct gacttgccag cagagcgccc ccctggagga gtcggagcat 1081 ctgcccctgt ccacagtaca tacggaagtc tccaaagggt ccctgaggct aagtggcagg 1141 tccaatccca tttaaccagg tctaggtctt ctccctgcca // LOCUS HSU35246 1987 bp mRNA PRI 15-JAN-1997 DEFINITION Human vacuolar protein sorting homolog h-vps45 mRNA, complete cds. ACCESSION U35246 NID g1477465 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1987) AUTHORS Pevsner,J., Hsu,S.C., Hyde,P.S. and Scheller,R.H. TITLE Mammalian homologues of yeast vacuolar protein sorting (vps) genes implicated in Golgi-to-lysosome trafficking JOURNAL Gene 183 (1-2), 7-14 (1996) MEDLINE 97149272 REFERENCE 2 (bases 1 to 1987) AUTHORS Pevsner,J. TITLE Direct Submission JOURNAL Submitted (31-AUG-1995) Jonathan Pevsner, Dept. Mol. Cell. Physiol., Stanford University Medical Center, Beckman Center, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..1987 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="brain cDNA library" CDS 44..1756 /note="homolog of yeast Vps45p" /codon_start=1 /product="vacuolar protein sorting homolog h-vps45" /db_xref="PID:g1477466" /translation="MNVVFAVKQYISKMIEDSGPGMKVLLMDKETTGIVSMVYTQSEI LQKEVYLFERIDSQNREIMKHLKAICFLRPTKENVDSLIQELRRPKYSIYFIYFSNVI SKSDVKSLAEADEQEVVAEVQEFYGDYIAVNPHLFSLNILGCCQGRNWDPAQLSRTTQ GLTALLLSLKKCPMIRYQLSSEAAKRLGECVKQVISKEYELFEFRRTEVPPLLLILDR CDDAITPLLNQWTYQAMVHELLDINNNRIDLSRVPGISKDLREVVLSAENDEFYANNM YLNFAEIGSNIKNLMEDFQKKRPKEQQKLESIADMKAFVENYPQFKKMSGTVSKHVTV VGELSRLVSERNLLEVSEVEQELACQNDHSSALQNVKRLLQNPKVTEFDAVRLVMLYA LHYERHSSNSLPGLIVDLRSKGVAEKYRKLVSAVVEYGGKRVRGSDLFSPKDAVAITK QFLKGLKGVENVYTQHQPFLHETLDHLIKGKLKENLYPYLGPSTLRDRPQDIIVFVIG GATYEEALTVYNLNRTTPGVRIVLGGTTIHNTKSFLEEVLASGLHSRSRESSQATSRS ASRR" BASE COUNT 557 a 445 c 496 g 489 t ORIGIN 1 ttcggcacga gaagggctgt agggtacttg tcaattcgcc gccatgaacg tggtttttgc 61 tgtgaagcag tacatttcca aaatgataga ggacagcggg cctggtatga aagtacttct 121 catggataaa gagacgactg gcatagtgag tatggtatac acacaatcgg agattctaca 181 gaaggaagtg tacctctttg aacgcattga ttctcaaaat cgagagatca tgaaacacct 241 gaaggcaatt tgtttccttc gacctacaaa ggagaatgtg gattctctga tccaggagct 301 ccgaagaccc aagtatagca tatattttat ttatttcagt aatgtgatca gcaagagtga 361 cgtgaagtcc ttggctgaag ctgacgagca ggaagttgtg gctgaagttc aggaatttta 421 tggagattat attgctgtga atccacattt gttttccctc aatatcttgg gctgctgtca 481 gggtcgaaat tgggatccag cccagctatc cagaaccact caagggctga ccgctctcct 541 tttgtctctg aagaagtgtc ccatgattcg ttatcagctt tcatcagagg ctgcaaagag 601 actgggagag tgtgttaagc aagtgataag taaagagtat gaactctttg agttccggcg 661 gacagaggtt cctccactgc ttctcattct ggatcgctgc gatgatgcca tcaccccact 721 gctcaaccag tggacatatc aggccatggt ccatgaacta ctggacataa acaacaaccg 781 gattgatctt tccagagtgc caggaatcag caaagactta agagaggtgg tcctgtccgc 841 tgaaaatgat gaattctatg ctaataacat gtacctgaac tttgccgaga ttggtagcaa 901 tataaagaat ctcatggaag atttccagaa gaagagaccg aaagagcagc aaaagctaga 961 gtccatagcg gacatgaagg cctttgttga aaattatcca caattcaaga agatgtctgg 1021 gactgtctca aagcatgtga cagtcgttgg ggaactgtct cggttggtca gtgaacgaaa 1081 cctgctggag gtttcagagg ttgagcaaga actggcctgt cagaatgacc attctagtgc 1141 tcttcagaat gtgaagagac tcctgcagaa tccgaaagtt acagaatttg atgcagttcg 1201 cctggtgatg ctttatgctc tacattatga gcgccacagc agcaacagcc tgccagggct 1261 catagtggac ctcaggagta aaggtgtcgc tgagaaatat cggaagcttg tgtctgcagt 1321 tgttgaatat ggtggtaaac gggttagagg aagtgacctc ttcagcccca aagatgctgt 1381 ggctattacc aaacagttcc tcaaaggcct gaagggagtg gaaaatgtgt acacccagca 1441 ccagcctttt ctgcatgaga ccctggacca tctcatcaaa gggaagctta aggaaaacct 1501 ctatccgtat ttaggcccca gcacactcag agacaggcct caggacatca tcgtgttcgt 1561 tattggagga gccacctatg aagaggcact gacagtctat aacctcaacc gtaccactcc 1621 tggagtgagg atcgttctgg gaggaacaac aatacacaac acaaaaagtt tcctagagga 1681 agtcctggct tctgggctgc acagccgcag cagagagagc tcacaggcca cctcaaggtc 1741 agcaagcaga agatgagatg gcagttgaga acagagagaa tgtcacggcc tccctcttat 1801 cccgtctcgg ccttccctgt gaccaggggc ttcagggtag ctcttgtgtt tgctctaact 1861 cactccaggt aaccctgatt cctgaactgc tggcacacac gccggctggg actgccaatg 1921 atgagcccac cccaacccac ttgtgattct gtatcaaata aatatgttct ttgtgtattt 1981 gacagca // LOCUS HSU35376 2320 bp mRNA PRI 12-OCT-1995 DEFINITION Human repressor transcriptional factor (ZNF85) mRNA, complete cds. ACCESSION U35376 NID g1017721 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Poncelet,D.A., Marine,J.C., Demoitie,M.A., Pendeville-Samain,H., Lecocq,P.J. and Martial,J.A. TITLE The human ZNF85 gene encodes for a nuclear repressor protein that binds multiple factors with its KRAB domain JOURNAL Unpublished REFERENCE 2 (bases 1 to 2320) AUTHORS Poncelet,D.A. TITLE Direct Submission JOURNAL Submitted (05-SEP-1995) Dominique A. Poncelet, Laboratory of Molecular Biology and Genetic Engineering, Batiment de Chimie B6, Institut de Chimie, University of Liege, Allee du Six Aout, Sart-Tilman, Liege B-4000, Belgium FEATURES Location/Qualifiers source 1..2320 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /chromosome="19" /map="19p12-p13.1" gene 119..1906 /gene="ZNF85" misc_feature 119..247 /gene="ZNF85" /note="encodes KRAB A domain" CDS 119..1906 /gene="ZNF85" /note="nuclear factor; DNA binding protein; Znf-91 related gene family" /codon_start=1 /product="repressor transcriptional factor" /db_xref="PID:g1017722" /translation="MGPLTFRDVAIEFSLKEWQCLDTAQRNLYRNVMLENYRNLVFLG ITVSKPDLITCLEQGKEAWSMKRHEIMVAKPTVMCSHFARDLWPEQNIKDSFQKVTLK RYGKCRHENLPLRKGCESMDECKMHKGGCNGLNQCLTATQSKIFQCDKYVKVAHKFSN SNRHEIRHTKKKPFKCTKCGKSFGMISCLTEHSRIHTRVNFYKCEECGKAFNWSSTLT KHKRIHTGEKPYKCEECGKAFNQSSNLIKHKKIHTGEKPYKCEECGKAFNRFSTLTTH KIIHTGEKPYKCKECGKAFNRSSTLTTHRKIHTGEKPYKCEECGKAFKQSSNLTTHKI IHTGEKPYKCKKCGKAFNQSAHLTTHEVIHTGEKPYKCEKCGKAFNHFSHLTTHKIIH TGEKPYKCKECGKAFKHSSTLTKHKIIHTGEKPYKSKECEKAFNQSSKLTEHKKIHTG EKPYECEKCGKAFNQSSNLTRHKKSHTEEKPYKCEECGKGFKWPSTLTIHKIIHTGEK PYKCEECGKAFNQSSKLTKHKKIHTGEKPYTCEECGKAFNQSSNLTKHKRIHTGEKPY KCEECDKAFKWSSVLTKHKIIHTGEKLQI" misc_feature 248..346 /gene="ZNF85" /note="encodes KRAB B domain" misc_feature 347..622 /gene="ZNF85" /note="Znf91 spacer type region" misc_feature 623..1906 /gene="ZNF85" /note="encodes zinc finger domain" /evidence=experimental BASE COUNT 903 a 423 c 412 g 582 t ORIGIN 1 ggggggtttt ccctgctttg tgttttctgc tcgtggacgc ccagcctctg tggccctgtg 61 gcctgcaggt attgggagat ccacagctaa gacgccggga ccccctggaa gcctagaaat 121 gggaccattg acatttaggg atgtggccat agaattctct ctgaaggagt ggcaatgcct 181 ggacactgca cagcggaatt tatatagaaa tgtgatgtta gagaactaca gaaacctggt 241 cttcctgggt attactgttt ctaagccaga cctgatcact tgtctggagc aagggaaaga 301 ggcctggagt atgaagagac atgagatcat ggtggccaaa cccacagtta tgtgttctca 361 ttttgcccga gacctttggc cggagcagaa tataaaagat tctttccaaa aagtgacact 421 gaaaagatat ggaaaatgta gacatgaaaa tttaccatta agaaaaggct gtgaaagtat 481 ggatgagtgt aagatgcaca aaggaggttg taatggactt aaccaatgtc tcacagctac 541 ccagagcaaa atatttcaat gtgataaata tgtaaaagtc gctcataaat tttcaaattc 601 aaacagacat gagataagac atactaaaaa gaaacctttc aaatgtacaa aatgtggcaa 661 atcatttggc atgatttcat gcctaactga acatagcaga attcatacta gagtaaattt 721 ctacaaatgt gaagaatgtg gaaaagcctt taactggtcc tcaaccctta ctaaacataa 781 gagaattcat acgggagaga aaccttacaa atgtgaagaa tgtggtaaag cctttaacca 841 gtcctcaaac cttattaaac ataagaaaat tcatactgga gagaaaccct acaaatgtga 901 agaatgtggc aaagctttta accgattctc aactcttact acccataaga taattcatac 961 tggagagaaa ccctacaaat gtaaagaatg tggtaaagct tttaaccgat cttcaaccct 1021 tactacccat agaaaaattc atactggaga gaaaccttac aaatgtgaag aatgtggcaa 1081 agcctttaag cagtcctcaa accttactac acataagata attcatactg gagagaaacc 1141 ctacaaatgt aaaaaatgtg gaaaagcctt taaccagtct gcacacctta ccacacatga 1201 ggtaattcat actggagaga aaccctacaa atgtgaaaaa tgtggaaaag cctttaatca 1261 tttctcacac cttactacac ataagataat tcatactgga gagaaacctt acaaatgtaa 1321 agaatgtggt aaagctttta aacactcttc aacccttact aaacataaga taattcatac 1381 tggagagaag ccttacaaat ctaaagaatg tgaaaaagct tttaaccaat cctcaaaact 1441 tactgaacat aagaaaattc atactggaga gaaaccctat gaatgtgaaa aatgtggcaa 1501 agcttttaac cagtcctcaa atcttactag acataagaaa agtcatacag aagagaaacc 1561 ttacaaatgt gaagaatgtg gcaaaggttt taaatggccc tcaaccctta ctatccataa 1621 gataattcat actggagaga aaccatacaa atgtgaagaa tgtggcaaag cttttaacca 1681 atcctcaaaa cttaccaaac ataagaaaat tcatactgga gagaaaccct acacatgtga 1741 agaatgtggc aaagccttta accagtcctc aaaccttact aaacataaga gaattcatac 1801 tggagaaaaa ccttacaaat gtgaagaatg tgacaaagct tttaaatggt cctcagtcct 1861 tactaaacat aagataattc ataccggaga aaaattacaa atatgaaaat tatggcaaag 1921 ctttaatcaa tttacaagtc ttactaaaca taagaaaatt tatactggag agaaactact 1981 aacctgaaag atgtgacaat aattttgaca acacctcaga cttataaaag taatcatact 2041 ggtgagaaat tctaaaaatg tgaagactat ggcaaagtct ttggttgtca cactttaggt 2101 aagataattc atattggaac aaactacaag tgcaaacaat gtggcaaaac ttaatttatg 2161 ctcacacctt actgcacaga aaagaatttt tagttgagaa aaagtataca aatataaaga 2221 atgtggaaaa gccgttaata tctgctcaca tcttactcag catcagaaag tacttaataa 2281 aagcattata aatggaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU35398 1280 bp mRNA PRI 08-OCT-1995 DEFINITION Human G protein-coupled receptor mRNA, complete cds. ACCESSION U35398 NID g1015418 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 101 to 1195) AUTHORS An,S. and Goetzl,E.J. TITLE Cloning, sequencing and tissue distribution of two related G protein-coupled receptor candidates JOURNAL Unpublished REFERENCE 2 (bases 1 to 1280) AUTHORS An,S. TITLE Direct Submission JOURNAL Submitted (02-SEP-1995) Songzhu An, Medicine, University of California, San Francisco, 533 Parnassus, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1280 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="12A" CDS 101..1198 /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g1015419" /translation="MGNITADNSSMSCTIDHTIHQTLAPVVYVTVLVVGFPANCLSLY FGYLQIKARNELGVYLCNLTVADLFYICSLPFWLQYVLQHDNWSHGDLSCQVCGILLY ENIYISVGFLCCISVDRYLAVAHPFRFHQFRTLKAAVRVTVVIWAKELLTSIYFLMHE EVIEDENQHRVCFEHYPIQAWQRAINYYRFLVGFLFPICLLLASYQGILRAVRRSHGT QKSRKDQIQRLVLSTVVIFLACFLPYHVLLLVRSVWEASCDFAKGVFNAYHFSLLLTS FNCVADPVLYCFVSETTHRDLARLRGACLAFLTCSRTGRAREAYPLGAPEASGKSGAQ GEEPELLTKLHPAFQTPNSPGSGGFPTGRLA" BASE COUNT 202 a 464 c 366 g 248 t ORIGIN 1 ccttggtaaa gcttgaacca ccttctataa acaggatggc ggtggagaga caggcccagt 61 ccctgagccc gtgaggagtg tggccccttc aggcccaaag atggggaaca tcactgcaga 121 caactcctcg atgagctgta ccatcgacca taccatccac cagacgctgg ccccggtggt 181 ctatgttacc gtgctggtgg tgggcttccc ggccaactgc ctgtccctct acttcggcta 241 cctgcagatc aaggcccgga acgagctggg cgtgtacctg tgcaacctga cggtggccga 301 cctcttctac atctgctcgc tgcccttctg gctgcagtac gtgctgcagc acgacaactg 361 gtctcacggc gacctgtcct gccaggtgtg cggcatcctc ctgtacgaga acatctacat 421 cagcgtgggc ttcctctgct gcatctccgt ggaccgctac ctggctgtgg cccatccctt 481 ccgcttccac cagttccgga ccctgaaggc ggccgtccgc gtcacggtgg tcatctgggc 541 caaggagctg ctgaccagca tctacttcct gatgcacgag gaggtcatcg aggacgagaa 601 ccagcaccgc gtgtgctttg agcactaccc catccaggca tggcagcgcg ccatcaacta 661 ctaccgcttc ctggtgggct tcctcttccc catctgcctg ctgctggcgt cctaccaggg 721 catcctgcgc gccgtgcgcc ggagccacgg cacccagaag agccgcaagg accagatcca 781 gcggctggtg ctcagcaccg tggtcatctt cctggcctgc ttcctgccct accacgtgtt 841 gctgctggtg cgcagcgtct gggaggccag ctgcgacttc gccaagggcg ttttcaacgc 901 ctaccacttc tccctcctgc tcaccagctt caactgcgtc gccgaccccg tgctctactg 961 cttcgtcagc gagaccaccc accgggacct ggcccgcctc cgcggggcct gcctggcctt 1021 cctcacctgc tccaggaccg gccgggccag ggaggcctac ccgctgggtg cccccgaggc 1081 ctccgggaaa agcggggccc agggtgagga gcccgagctg ttgaccaagc tccacccggc 1141 cttccagacc cctaactcgc cagggtcggg cgggttcccc acgggcaggt tggcctagcc 1201 tgggtcctcc gcgggtggct ccacgtgagg cctgagcttc agcccacggc ctcagggctg 1261 ccgcctcctg cttccctcgc // LOCUS HSU35451 2148 bp mRNA PRI 23-OCT-1997 DEFINITION Homo sapiens heterochromatin protein p25 mRNA, complete cds. ACCESSION U35451 NID g1177844 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2148) AUTHORS Furuta,K., Chan,E.K.L., Kiyosawa,K., Reimer,G., Luderschmidt,C. and Tan,E.M. TITLE Heterochromatin protein HP1Hsbeta (p25beta) and its localization with centromeres in mitosis JOURNAL Chromosoma 106 (1), 11-19 (1997) MEDLINE 97313386 REFERENCE 2 (bases 1 to 2148) AUTHORS Furuta,K., Chan,E.K.L., Kiyosawa,K. and Tan,E.M. TITLE Direct Submission JOURNAL Submitted (05-SEP-1995) Edward K.L. Chan, Molecular and Experimental Medicine, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2148 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HepG2 cells" gene 1..2148 /gene="p25beta" CDS 217..774 /gene="p25beta" /codon_start=1 /product="heterochromatin protein p25" /db_xref="PID:g1177845" /translation="MGKKQNKKKVEEVLEEEEEEYVVEKVLDRRVVKGKVEYLLKWKG FSDEDNTWEPEENLDCPDLIAEFLQSQKTAHETDKSEGGKRKADSDSEDKGEESKPKK KKEESEKPRGFARGLEPERIIGATDSSGELMFLMKWKNSDEADLVPAKEANVKCPQVV ISFYEERLTWHSYPSEDDDKKDDKN" polyA_signal 2130..2135 /gene="p25beta" polyA_site 2148 /gene="p25beta" BASE COUNT 632 a 423 c 563 g 530 t ORIGIN 1 ttatgggagg ctgaggggag ggccgttggc cggggcctgc ggtacgccgc ttcagtgagg 61 gacgccactg cggccacccg gcttgctgcc ttcctgggcg ccactccccc aggcgacccg 121 acgcgacgcg ccagtagcgc agcaccgatt cctctcgggc tcttgggcgc tgctctgagc 181 agcgtcaccc tttacaccag aaagctggcg ggcactatgg ggaaaaaaca aaacaagaag 241 aaagtagagg aggtgctaga agaggaggaa gaggaatatg tggtggaaaa agttctcgac 301 cgtcgagtgg taaagggcaa agtggagtac ctcctaaagt ggaagggatt ctcagatgag 361 gacaacacat gggagccaga agagaacctg gattgccccg acctcattgc tgagtttctg 421 cagtcacaga aaacagcaca tgagacagat aaatcagagg gaggcaagcg caaagctgat 481 tctgattctg aagataaggg agaggagagc aaaccaaaga agaagaaaga agagtcagaa 541 aagccacgag gctttgctcg aggtttggag ccggagcgga ttattggagc tacagactcc 601 agtggagagc tcatgttcct gatgaaatgg aaaaactctg atgaggctga cctggtccct 661 gccaaggaag ccaatgtcaa gtgcccacag gttgtcatat ccttctatga ggaaaggctg 721 acgtggcatt cctacccctc ggaggatgat gacaaaaaag atgacaagaa ctaacgctcc 781 tgagtaccag cccctgtcac atctgactgt gggtttcaag tgggaaggga aggagttcta 841 cttgtcttga caccatagag gtggcttgag aagatgtcct ttgaagagcc agtatagttt 901 ctgttccctg cagcagccca agtgctttaa agccgtttca agctgtatag tttgcacacc 961 catcccagtg gaggggaaag gggataagtg tttcaaggca accttttctg cactttgctg 1021 cgaaaagcaa agggccttct atgaaggaca acccttgcag aattgggtgt gtgggagagc 1081 aaaaaaatac tgtagatctt caaagagcat ctccacaacc cacagccttc ttcccaatag 1141 tgttaactct gcatttttac agcgtagcat gtgtgtagtt tttggctatt actggtgtat 1201 tatttggggg agggagggat ggggagggga gaaagggaga tgggtagcat cattttgatt 1261 aacatttggg gcctgatagg ggaaatggtg aagcaatgga aaagaacaga caactaatga 1321 tttgcttcta tgtccagaat attttacctt taaaaaaatg tcattggcac cataaataag 1381 gactgtgaga gactgtttaa aagctgtgaa agtctgaaac ctataagcca aggtgttccc 1441 tgcctaaact tattgctgtt cccacaaagg actaagcctg ttcataagtt accaaagttg 1501 ccattttgga gatggaaatt gacgaggagg gaaggtcttt tattggagag tatacaggac 1561 aagcagatca ttctgcctta gaggtgctaa ttcccgaaat tagaagaccc tttcttttcc 1621 agtaatgaag ttataaatat cagcttgttc atccaagcca ctggctgagg tgttaggaag 1681 aggaagaggg tggtagagga ggtaagacag tagggaaaga caagggccca tgctcttagt 1741 ggggaaaact cttggacccg tttactttga gctttgaaca ctgaaaccat tgttggcagg 1801 gttcagtcac tgacagcaca agtttcactg aattgatcca agagtttagt gatttcaaaa 1861 gccttggtct caggagaaga ttaaactttc atattgggca gtggttcact ttaaaacaca 1921 cacatacaca cacaaaacaa ttttttaaga aatcctaata agtaacatac ccaaaatgct 1981 ctgtcttgag tcatgagaac catcagttct tgatattgtc tagacttgca tctagagcta 2041 cgttgtaaaa ttcttttagg catgtgttag atttctgtgt aaactttgtt taaatgtaaa 2101 cttcatacta cattgtcagt ttttgtctta ataaaactat agatttat // LOCUS HSU35459 1194 bp mRNA PRI 16-NOV-1995 DEFINITION Human bomapin mRNA, complete cds. ACCESSION U35459 NID g1065408 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Riewald,M. and Schleef,R.R. TITLE Molecular cloning of bomapin (protease inhibitor 10), a novel human serpin that is expressed specifically in the bone marrow JOURNAL J. Biol. Chem. 270 (45), 26754-26757 (1995) MEDLINE 96070759 REFERENCE 2 (bases 1 to 1194) AUTHORS Riewald,M. TITLE Direct Submission JOURNAL Submitted (06-SEP-1995) Matthias Riewald, Department of Vascular Biology, The Scripps Research Institute, 10666 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" /dev_stage="adult" CDS 1..1194 /codon_start=1 /evidence=experimental /product="bomapin" /db_xref="PID:g1065409" /translation="MDSLATSINQFALELSKKLAESAQGKNIFFSSWSISTSLTIVYL GAKGTTAAQMAQVLQFNRDQGVKCDPESEKKRKMEFNLSNSEEIHSDFQTLISEILKP NDDYLLKTANAIYGEKTYAFHNKYLEDMKTYFGAEPQPVNFVEASDQIRKDINSWVER QTEGKIQNLLPDDSVDSTTRMILVNALYFKGIWEHQFLVQNTTEKPFRINETTSKPVQ MMFMKKKLHIFHIEKPKAVGLQLYYKSRDLSLLILLPEDINGLEQLEKAITYEKLNEW TSADMMELYEVQLHLPKFKLEDSYDLKSTLSSMGMSDAFSQSKADFSGMSSARNLFLS NVFHKAFVEINEQGTEAAAGSGSEIDIRIRVPSIEFNANHPFLFFIRHNKTNTILFYG RLCSP" BASE COUNT 402 a 259 c 239 g 294 t ORIGIN 1 atggactctc tagcaacatc aatcaaccag tttgccctgg agttgagcaa aaagctagct 61 gaatctgctc agggtaaaaa tatcttcttt tcttcctgga gcatctcaac ttccttgacc 121 atagtgtatt tgggcgccaa aggtaccact gcagcccaaa tggcccaggt gcttcaattt 181 aacagagacc agggagtcaa atgtgaccct gaaagtgaaa aaaaaaggaa aatggaattc 241 aacttgagca actcggaaga aatacactct gatttccaaa cacttatctc agaaatcctc 301 aagcccaacg atgactactt acttaaaaca gccaatgcga tatatggaga gaaaacgtat 361 gcatttcaca ataaatattt agaagacatg aaaacatatt ttggtgcaga acctcagcct 421 gttaactttg tggaagcttc tgatcaaatc agaaaggaca tcaactcttg ggttgaaaga 481 cagaccgagg gtaaaatcca gaatctcctg cctgatgact ctgtggattc cacaaccagg 541 atgattctgg tgaacgccct atactttaaa ggaatctggg aacatcaatt cttagtgcaa 601 aacaccacag aaaagccttt tagaataaac gagactacaa gcaaaccagt gcaaatgatg 661 tttatgaaga aaaagcttca catttttcac atagaaaagc caaaagcagt gggccttcaa 721 ctctactaca aaagccgtga cctcagcctg cttatactac tgccagaaga cattaatggg 781 ctggaacagc tggaaaaggc catcacctat gagaagctga atgagtggac cagtgcagac 841 atgatggagt tgtatgaagt gcagctacac cttcccaagt tcaagctgga agacagttat 901 gatctcaagt caaccctgag cagtatgggg atgagtgatg ccttcagcca aagcaaagct 961 gatttctcag gaatgtcttc agcaagaaac ctatttttgt ccaatgtttt ccataaggct 1021 tttgtggaaa taaatgaaca aggtactgaa gctgcagctg gcagtgggag tgagatagat 1081 atacgaatta gagtcccatc cattgaattc aatgcaaatc acccattcct cttcttcatc 1141 aggcacaata aaaccaacac cattcttttt tatggaagat tatgctcccc ctaa // LOCUS HSU355K 1521 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens mRNA for U3 snoRNP associated 55 kDa protein. ACCESSION AJ001340 NID g2832330 KEYWORDS 55 kDa protein; U3 snoRNP; U3-55k. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Pluk,H., Soffner,J., Luhrmann,R. and van Venrooij,W.J. TITLE cDNA cloning and characterization of the human U3 snoRNP associated 55 kDa protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1521) AUTHORS Pluk,H. TITLE Direct Submission JOURNAL Submitted (01-SEP-1997) Pluk H., Department of Biochemistry, University of Nijmegen, P.O. Box 9101, 6500 HB Nijmegen, THE NETHERLANDS FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="teratocarcinoma" gene 35..1462 /gene="U3-55k" CDS 35..1462 /gene="U3-55k" /codon_start=1 /product="U3 snoRNP associated 55 kDa protein" /db_xref="PID:e1249330" /db_xref="PID:g2832331" /translation="MSATAAARKRGKPASGAGAGAGAGKRRRKADSAGDRGKSKGGGK MNEEISSDSESESLAPRKPEEEEEEELEETAQEKKLRLAKLYLEQLRQQEEEKAEARA FEEDQVAGRLKEDVLEQRGRLQKLVAKEIQAPASADIRVLRGHQLSITCLVVTPDDSA IFSAAKDCSIIKWSVESGRKLHVIPRAKKGAEGKPPGHSSHVLCMAISSDGKYLASGD RSKLILIWEAQSCQHLYTFTGHRDAVSGLAFRRGTHQLYSTSHDRSVKVWNVAENSYV ETLFGHQDAVAALDALSRECCVTAGGRDGTVRVWKIPEESQLVFYGHQGSIDCIHLIN EEHMVSGADDGSVALWGLSKKRPLALQREAHGLRGEPGLEQPFWISSVAALLNTDLVA TGSHSSCVRLWQCGEGFRQLDLLCDIPLVGFINSLKFSSSGDFLVAGVGQEHRLGRWW RIKEARNSVCIIPLRRVPVPPAAGS" BASE COUNT 305 a 432 c 493 g 291 t ORIGIN 1 gaattccgcc gctgctacac gcctggtggg cagcatgtcg gcaacagcgg ctgctcgtaa 61 gcggggaaag ccggcctctg gggccggggc tggcgcgggg gccggcaagc ggcggcgaaa 121 ggccgactct gcgggggaca ggggcaaatc caagggtggc ggcaagatga atgaggagat 181 ctccagcgac tctgagagcg agagcctagc tccaaggaag cctgaggagg aggaggagga 241 ggagctggag gaaactgcac aggaaaagaa gctgcgcttg gccaagctct acctagagca 301 gctccgtcag caagaggagg agaaggctga ggcccgtgca tttgaggagg accaggtggc 361 ggggcgcctg aaggaggatg tgcttgagca gaggggcagg ctgcagaagt tggtggcaaa 421 agagatccag gccccagcct cagctgacat tcgcgtttta cgggggcacc agctctctat 481 cacatgtttg gtcgtcaccc ccgatgactc agccatcttc tctgctgcca aagactgcag 541 catcattaag tggagcgtgg agagtggacg gaagctgcat gtgattcctc gagccaagaa 601 gggtgccgag ggaaagcccc ctggccacag cagccacgtc ctctgcatgg ccatctcctc 661 cgacggcaag taccttgcct ctggtgaccg cagcaagctc attctcattt gggaggccca 721 gagctgccag cacttgtaca ccttcacagg acaccgggat gcagtgtcgg gtctggcatt 781 ccgcagaggc acccaccagc tctacagcac atcccacgat cgctccgtga aggtgtggaa 841 tgtggcagag aactcctacg tggagacgct cttcggacac caggacgctg tggctgcact 901 ggatgccttg agccgggagt gctgtgtgac ggctgggggc cgggatggga ctgtacgtgt 961 gtggaagatc cccgaggagt cccagcttgt cttctatggc caccagggct ccatcgactg 1021 catccaccta atcaatgagg agcacatggt gtccggcgcg gacgatggct ctgtggcctt 1081 gtggggtctc tccaagaagc gaccacttgc cctgcagcgt gaagctcacg ggctgcgggg 1141 agagccaggc ctggagcagc ccttctggat atcgtcggtg gcagccctcc tcaacacaga 1201 ccttgtggcc acaggctccc acagctcctg tgtgcggctt tggcagtgtg gggaaggctt 1261 ccggcagctt gaccttctct gtgacatccc cctggtgggt tttatcaaca gcctcaagtt 1321 ctccagctct ggggacttcc tggtggctgg ggtagggcag gagcacaggc ttggccgatg 1381 gtggagaatc aaagaggctc ggaattctgt ctgcatcatc ccactccgca gggtccctgt 1441 acccccagct gctggttcct gacactctta tcctccttat ttaagtcctt cccaggctat 1501 gccccaccct ctttgaagct t // LOCUS HSU35735 2725 bp mRNA PRI 18-MAY-1996 DEFINITION Human RACH1 (RACH1) mRNA, complete cds. ACCESSION U35735 NID g1322221 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2725) AUTHORS Davey,S. and Beach,D. TITLE RACH2, a novel human gene that complements a fission yeast cell cycle checkpoint mutation JOURNAL Mol. Biol. Cell 6 (10), 1411-1421 (1995) MEDLINE 96117053 REFERENCE 2 (bases 1 to 2725) AUTHORS Davey,S. and Beach,D. TITLE Direct Submission JOURNAL Submitted (08-SEP-1995) Scott Davey, Cold Spring Harbor Laboratory, P.O. Box 100, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..2725 /organism="Homo sapiens" /db_xref="taxon:9606" gene 169..1338 /gene="RACH1" CDS 169..1338 /gene="RACH1" /codon_start=1 /product="RACH1" /db_xref="PID:g1322222" /translation="MEDSPTMVRVDSPTMVRGENQVSPCQGRRCFPKALGYVTGDMKE LANQLKDKPVVLQFIDWILRGISQVVFVNNPVSGILILVGLLVQNPWWALTGWLGTVV STLMALLLSQDRSLIASGLYGYNATLVGVLMAVFSDKGDYFWWLLLPVCAMSMTCPIF SSALNSMLSKWDLPVFTLPFNMALSMYLSATGHYNPFFPAKLVIPITTAPNISWSDLS ALELLKSIPVGVGQIYGCDNPWTGGIFLGAILLSSPLMCLHAAIGSLLGIAAGLSLSA PFENIYFGLWGFNSSLACIAMGGMFMALTWQTHLLALGCALFTAYLGVGMANFMAEVG LPACTWPFCLATLLFLIMTTKNSNIYKMPLSKVTYPEENRIFYLQAKKRMVESPL" BASE COUNT 732 a 631 c 602 g 760 t ORIGIN 1 agtaagcact ctcccttgtc gtggaggtgg gcaaatcttt atcagccact gccttctgct 61 gccaggaagc cagctagagt ggtctttaaa gaaaactggg catctcctgc tacttaaaat 121 caaaaactac ctaaaataaa gattatagag ccagaggaag agatagccat ggaggacagc 181 cccactatgg ttagagtgga cagccccact atggttaggg gtgaaaacca ggtttcgcca 241 tgtcaaggga gaaggtgctt ccccaaagct cttggctatg tcaccggtga catgaaagaa 301 cttgccaacc agcttaaaga caaacccgtg gtgctccagt tcattgactg gattctccgg 361 ggcatatccc aagtggtgtt cgtcaacaac cccgtcagtg gaatcctgat tctggtagga 421 cttcttgttc agaacccctg gtgggctctc actggctggc tgggaacagt ggtctccact 481 ctgatggccc tcttgctcag ccaggacagg tcattaatag catctgggct ctatggctac 541 aatgccaccc tggtgggagt actcatggct gtcttttcgg acaagggaga ctatttctgg 601 tggctgttac tccctgtatg tgctatgtcc atgacttgcc caattttctc aagtgcattg 661 aattccatgc tcagcaaatg ggacctcccc gtcttcaccc tccctttcaa catggcgttg 721 tcaatgtacc tttcagccac aggacattac aatccgttct ttccagccaa actggtcata 781 cctataacta cagctccaaa tatctcctgg tctgacctca gtgccctgga gttgttgaaa 841 tctataccag tgggagttgg tcagatctat ggctgtgata atccatggac agggggcatt 901 ttcctgggag ccatcctact ctcctcccca ctcatgtgcc tgcatgctgc cataggatca 961 ttgctgggca tagcagcggg actcagtctt tcagccccat ttgagaacat ctactttgga 1021 ctctggggtt tcaacagctc tctggcctgc attgcaatgg gaggaatgtt catggcgctc 1081 acctggcaaa cccacctcct ggctcttggc tgtgccctgt tcacggccta tcttggagtc 1141 ggcatggcaa actttatggc tgaggttgga ttgccagctt gtacctggcc cttctgtttg 1201 gccacgctat tgttcctcat catgaccaca aaaaattcca acatctacaa gatgcccctc 1261 agtaaagtta cttatcctga agaaaaccgc atcttctacc tgcaagccaa gaaaagaatg 1321 gtggaaagcc ctttgtgaga acaagcccca tttgcagcca tggtcacgag tcatttctgc 1381 ctgactgctc cagctaactt ccagggtctc agcaaactgc tgtttttcac gagtatcaac 1441 tttcatactg acgcgtctgt aatctgttct tatgctcatt ttgtattttc ctttcaactc 1501 caggaatatc cttgagcata tgagagtcac atccaggtga tgtgctctgg tatggaattt 1561 gaaaccccaa tggggccttg gcactaagac tggaatgtat ataaagtcaa agtgctccaa 1621 cagaaggagg aagtgaaaac aaactattag tatttattga tattcttggt gtttagctgg 1681 ctcgatgatg ttaacagtat taaaaattaa acccccataa acccaaccta agcctatgga 1741 atccacagtc acaaaatcga agttaaccca gaatctgtga taagcagctt ggcttttttt 1801 ttaaatcaat gcaagtacac cattatagcc agaatctgta tcacagaggt gcaagctgac 1861 agcagagctc agtccccact tcctgcaaac aatggcctgc accctatccc ttgtgtgtgt 1921 gacattctct catgggacaa tgttggggtt tttcagactg acaggactgc aagagggaga 1981 aaggaatttt gtcaatcaaa attattctgt attgcaactt ttctcagaga ttgcaaagga 2041 ttttttaggt agagattatt tttccttatg aaaaatgatc tgttttaaat gagataaaat 2101 aggagaagtt cctggcttaa cctgttctta catattaaag aaaagttact tactgtattt 2161 atgaaatact cagcttaggc atttttactt taacccctaa attgattttg taaatgccac 2221 aaatgcatag aattgttacc aacctccaaa gggctcttta aaatcatatt tttttattca 2281 tttgaggatg tcttataaag actgaaggca aaggtcagaa tgcttacggg tgttattttt 2341 ataagttgtt gaattcctta atttagaaaa gctcattatt ttttgcacac tcacaatatt 2401 ctctctcaga aatcaatggc atttgaacca ccaaaaagaa ataaagggct gagtgtggtg 2461 ctcacgcctg taatcccagc actttgggga gcccaggcgg gcagattgct tgaacccagg 2521 agttcaagac cagcctgggc agcatggtga aaccctgtat ctacaaaaaa tacaaaaatt 2581 agccaggcat ggtggtgggt gcctgtagtt ccagctactt gggaggctga ggtgggaaaa 2641 tgacttgagc ccaggaggag gaggctgcag tgagctaaga ttgcaccact gcactccaac 2701 ctgggtgaca agagtgaaac tgaat // LOCUS HSU36188 1828 bp mRNA PRI 02-APR-1996 DEFINITION Human clathrin assembly protein 50 (AP50) mRNA, complete cds. ACCESSION U36188 NID g1244507 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1828) AUTHORS Tsui,S.K.W., Waye,M.M.Y., Liew,C.C., Fung,K. and Lee,C.Y. TITLE Molecular cloning and sequence analysis of the cDNA for human 50 kDa subunit of the clathrin assembly complex AP-2 (AP50) JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 1828) AUTHORS Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (12-SEP-1995) M.M.Y. Waye, The Chinese University of Hong Kong, Biochemistry Department, Shatin, N.T., Hong Kong FEATURES Location/Qualifiers source 1..1828 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /dev_stage="adult" gene 47..1354 /gene="AP50" CDS 47..1354 /gene="AP50" /note="50 kDa subunit of the clathrin assembly complex AP-2" /codon_start=1 /product="assembly protein 50" /db_xref="PID:g1244508" /translation="MIGGLFIYNHKGEVLISRVYRDDIGRNAVDAFRVNVIHARQQVR SPVTNIARTSFFHVKRSNIWLAAVTKQNVNAAMVFEFLYKMCDVMAAYFGKISEENIK NNFLLIYELLDEILDFGYPQNSETGALKTFITQQGIKSQHQTKEEQSQITSQVTGQIG WRREGIKYRRNELFLDVLESVNLLMSPQGQVLSAHVSGRVVMKSYLSGMPECKFGMND KIVIEKQGKGTADETSKSGKQSIAIDDCTFHQCVRLSKFDSERSISFIPPDGEFELMR YRTTKDIILPFRVIPLVREVGRTKLEVKVVIKSNFKPSLLAQKIEVRIPTPLNTSGVQ VICMKGKAKYKASENAIVWKIKRMAGMKESQISAEIELLPTNDKKKWARPPISMNFEV PFAPSGLKVRYLKVFEPKLNYSDHDVIKWVRYIGRSGIYETRC" BASE COUNT 434 a 504 c 492 g 398 t ORIGIN 1 caggtctgtt ctcagagcga tgggccgcag agactgatct gccgccatga ttggaggctt 61 attcatctat aatcacaagg gggaggtgct catctcccga gtctaccgag atgacatcgg 121 gaggaacgca gtggatgcct ttcgggtcaa tgttatccat gcccggcagc aggtgcgcag 181 ccccgtcacc aacattgctc gcaccagctt cttccacgtt aagcggtcca acatttggct 241 ggcagcagtc accaagcaga atgtcaacgc tgccatggtc ttcgaattcc tctataagat 301 gtgtgacgtg atggccgctt actttggcaa gatcagcgag gaaaacatca agaacaattt 361 tttgctcata tatgagctgc tggatgagat tctagacttt ggctacccac agaattccga 421 gacaggcgcg ctgaaaacct tcatcacgca gcagggcatc aagagtcagc atcagacaaa 481 agaagagcag tcacagatca ccagccaggt aactgggcag attggctggc ggcgagaggg 541 catcaagtat cgtcggaatg agctcttcct ggatgtgctg gagagtgtga acctgctcat 601 gtccccacaa gggcaggtgc tgagtgccca tgtgtcgggc cgggtggtga tgaagagcta 661 cctgagtggc atgcctgaat gcaagtttgg gatgaatgac aagattgtta ttgaaaagca 721 gggcaaaggc acagctgatg aaacaagcaa gagcgggaag caatcaattg ccattgatga 781 ctgcaccttc caccagtgtg tgcgactcag caagtttgac tctgaacgca gcatcagctt 841 tatcccgcca gatggagagt ttgagcttat gaggtatcgc acaaccaagg acatcatcct 901 tcccttccgg gtgatcccgc tagtgcgaga agtgggacgc accaaactgg aggtcaaggt 961 ggtcatcaag tccaacttta aaccctcact gctggctcag aagattgagg tgaggatccc 1021 aaccccactg aacacaagcg gggtgcaggt gatctgcatg aaggggaagg ccaagtacaa 1081 ggccagcgag aatgccatcg tgtggaagat caagcgcatg gcaggcatga aggaatcgca 1141 gatcagcgca gagattgagc ttctgcctac caacgacaag aagaaatggg ctcgaccccc 1201 catttccatg aactttgagg tgccattcgc gccctctggc ctcaaggtgc gctacttgaa 1261 ggtgtttgaa ccgaagctga actacagcga ccatgatgtc atcaaatggg tgcgctacat 1321 tggccgcagt ggcatttatg aaactcgctg ctagctgcca ctaggcagct agcccacctc 1381 cccagccacc ctcctccaca ggtccaggtg ccgctccctc ccccaccaca catcagtgtc 1441 tcctccctcc tgctttgctg ccttcccttt gcaccagccc gagtctaggt ctgggccaag 1501 cacattacaa gtgggaccgg tggagcagcc cctgggctcc ctgggcaggg gagttctgag 1561 gctcctgctc tcccatccac ctgtctgtcc tggcctaatg ccaggctctg agttctgtga 1621 ccaaagccag gtgggttccc tttccttccc acccctgtgg ccacagctct ggagtgggag 1681 ggttggttgc ccctcacctc agagctcccc caaaggccag taatggatcc ccggcctcag 1741 tccctactct gctttgggat agtgtgagct tcattttgta cacgtgttgc ttcgtccagt 1801 tacaaaccca ataaactctg tagagtgg // LOCUS HSU36190 815 bp mRNA PRI 01-JUL-1996 DEFINITION Human cysteine-rich protein 2 (hCRP2) mRNA, complete cds. ACCESSION U36190 NID g1399027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 815) AUTHORS Tsui,S.K.W., Chan,P.P.K., Cheuk,C.W., Waye,M.M.Y., Fung,K.P. and Lee,C.Y. TITLE Molecular cloning, sequencing, expression and chromosomal mapping of the human cysteine-rich protein 2 (hCRP2) JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 815) AUTHORS Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (12-SEP-1995) M.M.Y. Waye, The Chinese University of Hong Kong, Department of Biochemistry, Shatin, N.T., Hong Kong FEATURES Location/Qualifiers source 1..815 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" /dev_stage="fetal" gene 37..663 /gene="hCRP2" CDS 37..663 /gene="hCRP2" /codon_start=1 /evidence=experimental /product="cysteine-rich protein 2" /db_xref="PID:g1399028" /translation="MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGH AEHDGKPFCHKPCYATLFGPKGVNIGGAGSYIYEKPLAEGPQVTGPIEVPAARAEERK ASGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTL TPGGHAEHDGQPYCHKPCYGILFGPKGVNTGAVGSYIYDRDPEGKVQP" BASE COUNT 158 a 277 c 277 g 103 t ORIGIN 1 cgggcggagg gcgcgggccg accgggcgca ccgaccatgg cctccaaatg ccccaagtgc 61 gacaagaccg tgtacttcgc cgagaaggtg agctccctgg ggaaggactg gcacaagttc 121 tgcctcaagt gcgagcgctg cagcaagacg ctgacgcccg ggggccacgc cgagcatgac 181 gggaagccgt tctgccacaa gccgtgctac gccaccctgt tcggacccaa aggcgtgaac 241 atcgggggcg cgggctccta catctacgag aagcccctgg cggaggggcc gcaggtcacc 301 ggccccatcg aggtccccgc ggcccgagca gaggagcgga aggcgagcgg ccccccgaag 361 gggcccagca gagcctccag tgtcaccact ttcaccgggg agcccaacac gtgcccgcgc 421 tgcagcaaga aggtgtactt cgctgagaag gtgacgtctc tgggcaagga ttggcaccgg 481 ccctgcctgc gctgcgagcg ctgcgggaag acactgaccc ccggcgggca cgcggagcac 541 gacggccagc cctactgcca caagccctgc tatggaatcc tcttcggacc caagggagtg 601 aacaccggtg cggtgggcag ctacatctat gaccgggacc ccgaaggcaa ggtccagccc 661 taggctacag cggctctcat gatgtgggct cacctgcgcc ccagaccctg cagggccccc 721 ctgcttggct ctgctgggag agtgctagcc gcccagtcct gcctgcaagc ccaggggcga 781 gtattggagg aggggcagcc acgggcagag cacca // LOCUS HSU36221 2129 bp mRNA PRI 21-NOV-1996 DEFINITION Human pancreatic zymogen granule membrane protein GP-2 mRNA, complete cds. ACCESSION U36221 NID g1244511 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2129) AUTHORS Wong,S.M. and Lowe,A.W. TITLE Sequence of the cDNA encoding human GP-2, the major membrane protein in the secretory granule of the exocrine pancreas JOURNAL Gene 171 (2), 311-312 (1996) MEDLINE 96257244 REFERENCE 2 (bases 1 to 2129) AUTHORS Wong,S.M.E. and Lowe,A.W. TITLE Direct Submission JOURNAL Submitted (12-SEP-1995) Shirley M.E. Wong, Medicine, Stanford University, MSLS Building, Rm. P-308, Stanford, CA 94305-5487, USA FEATURES Location/Qualifiers source 1..2129 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 72..1664 /codon_start=1 /product="pancreatic zymogen granule membrane protein GP-2" /db_xref="PID:g1244512" /translation="MERMVGSGLLWLALVSCILTQASAVQRGYGNPIEASSYGLDLDC GAPGTPEAHVCFDPCQNYTLLDEPFRSTENSAGSQGCDKNMSGWYRFVGEGGVRMSET CVQVHRCQTDAPMWLNGTHPALGDGITNHTACAHWSGNCCFWKTEVLVKACPGGYHVY RLEGTPWCNLRYCTDPSTVEDKCEKACRPEEECLALNSTWGCFCRQDLNSSDVHSLQP QLDCGPREIKVKVDKCLLGGLGLGEEVIAYLRDPNCSSILQTEERNWVSVTSPVQASA CRNILERNQTHAIYKNTLSLVNDFIIRDTILNINFQCAYPLDMKVSLQAALQPIVSSL NVSVDGNGEFIVRMALFQDQNYTNPYEGDAVELSVESVLYVGAILEQGDTSRFNLVLR NCYATPTEDKADLVKYFIIRNSCSNQRDSTIHVEENGQSSESRFSVQMFMFAGHYDLV FLHCEIHLCDSLNEQCQPSCSRSQVRSEVPAIDLARVLDLGPITRRGAQSPGVMNGTP STAGFLVAWPMVLLTVLLAWLF" BASE COUNT 493 a 556 c 592 g 488 t ORIGIN 1 ggcgctttgt gttctcatcg gagctgcatg ggaagtctgc gtacagcaaa gtgacctgca 61 tgcctcacct tatggaaagg atggtgggct ctggcctcct gtggctggcc ttggtctcct 121 gcattctgac ccaggcatct gcagtgcagc gaggttatgg aaaccccatt gaagccagtt 181 cgtatgggct ggacctggac tgcggagctc ctggcacccc agaggctcat gtctgttttg 241 acccctgtca gaattacacc ctcctggatg aacccttccg aagcacagag aactcagcag 301 ggtcccaggg gtgcgataaa aacatgagcg gctggtaccg ctttgtaggg gaaggaggag 361 taaggatgtc ggagacctgt gtccaggtgc accgatgcca gacagacgct cccatgtggc 421 tgaatgggac ccaccctgcc cttggggatg gcatcaccaa ccacactgcc tgtgcccatt 481 ggagtggcaa ctgctgtttc tggaaaacag aggtgctggt gaaggcctgc ccaggcgggt 541 accatgtgta ccggttggaa ggcactccct ggtgtaatct gagatactgc acagacccat 601 ccactgtgga ggacaagtgt gagaaggcct gccgccccga ggaggagtgc cttgccctca 661 acagcacctg gggctgtttc tgcagacagg acctcaatag ttctgatgtc cacagtttgc 721 agcctcagct agactgtggg cccagggaga tcaaggtgaa ggtggacaaa tgtttgctgg 781 gaggcctggg tttgggggag gaggtcattg cctacctgcg agacccaaac tgcagcagca 841 tcttgcagac agaggagagg aactgggtat ctgtgaccag ccccgtccag gctagtgcct 901 gcaggaacat tctggagaga aatcaaaccc atgccatcta caaaaacacc ctctccttgg 961 tcaatgattt catcatcaga gacaccatcc tcaacatcaa cttccaatgt gcctacccac 1021 tggacatgaa agtcagcctc caagctgcct tgcagcccat tgtaagttcc ctgaacgtca 1081 gtgtggacgg gaatggagag ttcattgtca ggatggccct cttccaagac cagaactaca 1141 cgaatcctta cgaaggggat gcagttgaac tgtctgttga gtccgtgctg tatgtgggtg 1201 ccatcttgga acaaggggac acctcccggt ttaacctggt gttgaggaac tgctatgcca 1261 cccccactga agacaaggct gaccttgtga agtatttcat catcagaaac agctgctcaa 1321 atcagcgtga ttccaccatc cacgtggagg agaatgggca gtcctcggaa agccggttct 1381 cagttcagat gttcatgttt gctggacatt atgacctagt tttcctgcat tgtgagattc 1441 atctctgtga ttctcttaat gaacagtgcc agccttcttg ctcaagaagt caagtccgca 1501 gtgaagtacc ggccatcgac ctagcccggg ttctagattt ggggcccatc actcggagag 1561 gtgcacagtc tcccggtgtc atgaatggaa cccctagcac tgcagggttc ctggtggcct 1621 ggcctatggt cctcctgact gtcctcctgg cttggctgtt ctgagagctc cgctgagcat 1681 ctggccttga agtttgtgtt cttccctctg gcaatggctc ccttcagcac ttctgctttc 1741 cactccaatt cacacaggct tggtattaac agaatcaagg ccaggctagg ttaggaaaag 1801 ggaagagctt tcaccttctt tggggctctc ggctgggcgc agtggctcat gcctgtaatc 1861 ccagcatttt gggaggctga ggcaggtgga tcacctgagg tcagcagttc aaaatcagcc 1921 tggccaaaat gctgaaactc cgtctctact aaaaatacaa aaattagcca ggcatggtgg 1981 caggcgcctg taatcccagc tactcgggag gccaaggcag gagaattgct cgaactcagg 2041 gggtggaggt tgcagtgagt tgagattgtg ccattgcact ccagcctggg caacagagca 2101 agactctgtc tcaggaaaaa aaaaaaaaa // LOCUS HSU36336 4006 bp mRNA PRI 02-MAR-1996 DEFINITION Human lysosome-associated membrane protein-2b (LAMP2) mRNA, alternatively spliced form h-lamp-2b, complete cds. ACCESSION U36336 NID g1209628 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4006) AUTHORS Konecki,D.S., Foetisch,K., Zimmer,K.-P., Schlotter,M. and Lichter-Konecki,U. TITLE An alternatively spliced form of the human lysosome-associated membrane protein-2 gene is expressed in a tissue specific manner JOURNAL Unpublished REFERENCE 2 (bases 1 to 4006) AUTHORS Konecki,D.S., Foetisch,K., Zimmer,K.-P., Schlotter,M. and Lichter-Konecki,U. TITLE Direct Submission JOURNAL Submitted (14-SEP-1995) D.S. Konecki, Marshfield Medical Research Foundation, Center for Medical Genetics, 2R7, 1000 North Oak Avenue, Marshfield, WI 54449, USA FEATURES Location/Qualifiers source 1..4006 /organism="Homo sapiens" /note="patient SD3 (4 year old male infant) liver cDNA library-lambda1956 and human phaeochromocytoma cDNA library-lambda496" /db_xref="taxon:9606" /map="Xq24-25" /chromosome="X" /clone="lambda1956-19, lambda1956-36, lambda496-58b" gene 138..1370 /gene="LAMP2" CDS 138..1370 /gene="LAMP2" /note="h-lamp-2b; integral lysosomal membrane protein" /codon_start=1 /function="postulated to help protect lysosomal membrane integrity; possible adhesion molecule bound by galectin3" /product="lysosome-associated membrane protein-2b" /db_xref="PID:g1209629" /translation="MVCFRLFPVPGSGLVLVCLVLGAVRSYALELNLTDSENATCLYA KWQMNFTVRYETTNKTYKTVTISDHGTVTYNGSICGDDQNGPKIAVQFGPGFSWIANF TKAASTYSIDSVSFSYNTGDNTTFPDAEDKGILTVDELLAIRIPLNDLFRCNSLSTLE KNDVVQHYWDVLVQAFVQNGTVSTNEFLCDKDKTSTVAPTIHTTVPSPTTTPTPKEKP EAGTYSVNNGNDTCLLATMGLQLNITQDKVASVININPNTTHSTGSCRSHTALLRLNS STIKYLDFVFAVKNENRFYLKEVNISMYLVNGSVFSIANNNLSYWDAPLGSSYMCNKE QTVSVSGAFQINTFDLRVQPFNVTQGKYSTAQECSLDDDTILIPIIVGAGLSGLIIVI VIAYVIGRRKSYAGYQTL" exon 1231..1370 /gene="LAMP2" /note="h-lamp-2b" /number=9 polyA_signal 1932..1937 polyA_signal 3668..3673 polyA_signal 3994..3999 BASE COUNT 1156 a 742 c 772 g 1336 t ORIGIN 1 ccgattcctg gcttttgcaa ggctgtggtc ggtggtcatc agtgctcttg acccaggtcc 61 agcgagcctt ttccctggtg ttgcagctgt tgttgtaccg ccgccgtcgc cgccgtcgcc 121 gcctgctctg cggggtcatg gtgtgcttcc gcctcttccc ggttccgggc tcagggctcg 181 ttctggtctg cctagtcctg ggagctgtgc ggtcttatgc attggaactt aatttgacag 241 attcagaaaa tgccacttgc ctttatgcaa aatggcagat gaatttcaca gtacgctatg 301 aaactacaaa taaaacttat aaaactgtaa ccatttcaga ccatggcact gtgacatata 361 atggaagcat ttgtggggat gatcagaatg gtcccaaaat agcagtgcag ttcggacctg 421 gcttttcctg gattgcgaat tttaccaagg cagcatctac ttattcaatt gacagcgtct 481 cattttccta caacactggt gataacacaa catttcctga tgctgaagat aaaggaattc 541 ttactgttga tgaacttttg gccatcagaa ttccattgaa tgaccttttt agatgcaata 601 gtttatcaac tttggaaaag aatgatgttg tccaacacta ctgggatgtt cttgtacaag 661 cttttgtcca aaatggcaca gtgagcacaa atgagttcct gtgtgataaa gacaaaactt 721 caacagtggc acccaccata cacaccactg tgccatctcc tactacaaca cctactccaa 781 aggaaaaacc agaagctgga acctattcag ttaataatgg caatgatact tgtctgctgg 841 ctaccatggg gctgcagctg aacatcactc aggataaggt tgcttcagtt attaacatca 901 accccaatac aactcactcc acaggcagct gccgttctca cactgctcta cttagactca 961 atagcagcac cattaagtat ctagactttg tctttgctgt gaaaaatgaa aaccgatttt 1021 atctgaagga agtgaacatc agcatgtatt tggttaatgg ctccgttttc agcattgcaa 1081 ataacaatct cagctactgg gatgcccccc tgggaagttc ttatatgtgc aacaaagagc 1141 agactgtttc agtgtctgga gcatttcaga taaatacctt tgatctaagg gttcagcctt 1201 tcaatgtgac acaaggaaag tattctacag cccaagagtg ttcgctggat gatgacacca 1261 ttctaatccc aattatagtt ggtgctggtc tttcaggctt gattatcgtt atagtgattg 1321 cttacgtaat tggcagaaga aaaagttatg ctggatatca gactctgtaa cactaatcaa 1381 tacgtgatct ctgttacaaa agaaaaaagc aagtacaagt tccaacatgc aatactggtc 1441 aacttaaggt atatttagtt gcagtccagc tctttagaat gggtggtatg ggggatttca 1501 aacttaaaca aaaaactatc aactacaaat tagttgcctg actttggttt ttccaaccaa 1561 ggaatttaaa actgttattt ttacagcaaa agatgtgcaa aatcactgga ttataagttc 1621 tattttactg tcttgaatta gtatttcagt gttttcattt tagacattca gactaaaaat 1681 acaccgttta gaaaaaacaa tttttgaaaa agagattttt tttccctgca ggtagttgag 1741 ttgaacaaca tgttctaccg tggatttgta cttgctcctt ttgctctttt tgtgtgtgtg 1801 tgtgtgtgtg tgtgtgtgtg tgtgattttt gtttgcaggt taacttagct actttggcat 1861 tgctgcatat ttgacctttg agagatataa tagtagattt gaacaggggc tggtattatt 1921 atgttcttag caataaatgc ttttctaatg ccttttgaat acatttgtat ttatgtggct 1981 gtaatgacaa aagatacaaa agctttttaa aatttagagt aggtattaat cttattgttt 2041 aatctttttt ttaaaaaaac tggatatttc aatcttttaa attgcaatat ataagactat 2101 tccaactggg catttcaatc cattttttag gtgctttaga gataattgct tgccagtgcc 2161 aattgagggc attagtactt tgtgctcata aattggcctc tgtatgcagt actaaaatta 2221 atgcagattt ctctttagcc ttccaacatt tcttgttgat agtgatgtat tttattattt 2281 tctttttctt aagaaatgcc agtgtgtcct agaacctaga taacgaagtg cacttacact 2341 tataaaataa cttgcatcta ggctgggcgt ggcggctcac gcctgtaatc ccagcacttt 2401 gggaggccga agtgggtgga tcacttgagg ccaggagttt gagaccagcc tggccaacat 2461 ggtgaaaccc catctctatc agaaatacaa aaaattagct gggcatggtg gtgggcgcct 2521 gtaatcccag ttactcggga ggctgaggca ggagaatcac ttgaacccgg gaggcagagg 2581 ttgcggtgag ccaagagcgc accattgcac tccagccttg ggcgacaaaa acgaaactcc 2641 atcttcaaaa caaaacaaaa caaaacaaac aaacaaacaa aacttgcatc ttaaccaaaa 2701 gtcttggttt tatcttaatc cattaaaagt tggtcttgtt tccagcttgc attgattgct 2761 acaacatcac taatttggct ttcacattta aatggttctg tgctaatcaa aactttcgtt 2821 gttattattc gttatggtag aatcattttt aattcacgtg ctttgtgttc agttttgtgg 2881 tctgagagat gtaccaattg tcaaattacc gtgtaccacc taatgtttat aggagaaagc 2941 aaaatacatc agcttggtag ttaacacatc aaatatttct tgctgcttct aggagaactt 3001 ttttggtgtg tgttggaatg gctgagcaaa tattaaaatt gttaatatgc agccatatat 3061 ggaaggttcc tgtggggttg ttttttcgtg tttttttttt ttgtggtggg attatgtgcc 3121 tcccattcac tagaaaatga gaaaattgtc tgggttccaa aatattgaca ttgaatggat 3181 caatacacac acacagacat atatatatat atatgcacac atatataggc agttgcatgc 3241 ctagcatggg tattttataa ccatataact gagttatatt ggaattataa atattttccg 3301 tcacttaaat ttgttctttg tttagcctga aaacctttat ggctcaagat cagattcctg 3361 actaacccct ctcttagagc tacagcgagc tgcattacca gcttaaaaca cttcttaggg 3421 attaaatata gatgtaattt ttcaaaatcg tttttaattt aaactgtgtt ttagtgtaaa 3481 attgttaacc ttgtaagatg gataatgtgt ataagaatgt aggccttaac tatttcacat 3541 gagtcaaaac aaagcagctt taaaaaaata attggaagca caatgcatgg cactgactga 3601 atgctgttaa tatttctaaa agtttctaca ttcagattat atgcctgatt catagtaaaa 3661 tacctctaat aaacactgtt ttatagaaaa cctgacttca gtgaatattt ttgtatttta 3721 catgggccag tttatatact gctatttaca ctattatttc ctatagctac atgttctttg 3781 taccttttgt agttttattt gtattactag attcatacct tgatggtaac gctctatctg 3841 gttttgggtg tttttcatgt tttagcattt gtataaagaa actggtccat gtaaatactt 3901 tccatgtttt ttcttcaaat gtttaaacca ctagttgatg tatggtatct ttagatattt 3961 gcctgtctgt ttgctcaaaa ttgcttctaa aacaataaag attctt // LOCUS HSU36448 1817 bp mRNA PRI 19-OCT-1995 DEFINITION Human Ca2+-dependent activator protein for secretion mRNA, complete cds. ACCESSION U36448 NID g1022781 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1817) AUTHORS Walent,J.H., Porter,B.W. and Martin,T.F. TITLE A novel 145 kd brain cytosolic protein reconstitutes Ca(2+)-regulated secretion in permeable neuroendocrine cells JOURNAL Cell 70 (5), 765-775 (1992) MEDLINE 92386596 REFERENCE 2 (bases 1 to 1817) AUTHORS Ann,K., Yom,H-C., Kowalchyk,J.A., Porter,B.W. and Martin,T.F.J. TITLE A novel Ca2+-regulated cytoskeletal protein (CAPS) required for Ca2+-activated secretion JOURNAL Unpublished REFERENCE 3 (bases 1 to 1817) AUTHORS Martin,T.F.J. TITLE Direct Submission JOURNAL Submitted (15-SEP-1995) Thomas F.J. Martin, Biochemistry, University of Wisconsin, 420 Henry Mall, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..1817 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="IB227; constructed by Bento Soares, Columbia University" /dev_stage="infant" /tissue_type="brain" /chromosome="3" CDS 103..774 /note="CAPS" /codon_start=1 /product="Ca2+-dependent activator protein for secretion" /db_xref="PID:g1022782" /translation="MFNVMVDAKAQSTKLCSMEMGQEFAKMWHQYHSKIDELIEETVK EMITLLVAKFVTILEGVLAKLYRYDEGTLFSSFLSFTVKAASKYVDVPKPGMDVPDAY VTFGRHSQDVLRDKVNEEMYIERLFDQWYNSSMNVICTWLTDRMDLQLHIYQLKTLIR MVKKTYRDFRLQGGPGLHLKQARPDEPIRNRLTVEEATASVSEGGGLQGISMKDSDEE DEEDD" BASE COUNT 559 a 353 c 366 g 539 t ORIGIN 1 aagcttggca cgagggtcaa aagaaccagg attgcatttg aagttaagct gcaaaaaacc 61 agtcgatcaa cagattttcg agtcccacag tcaatatgca ccatgtttaa tgttatggtt 121 gatgccaaag ctcaatcaac aaaactttgc agcatggaaa tgggccaaga gtttgctaaa 181 atgtggcatc aataccattc aaaaatagac gaactaattg aagaaactgt taaagaaatg 241 ataacactct tggttgcaaa gttcgttact atcttggaag gagtactggc aaaattatac 301 agatatgacg aagggacttt gttttcttct tttctgtcat ttaccgtgaa ggcagcttcc 361 aaatatgtgg atgtacctaa acccgggatg gacgtgcccg acgcctacgt gactttcggc 421 cgccattctc aggatgtcct gcgtgataag gtcaatgagg agatgtacat agaaaggtta 481 tttgatcaat ggtacaacag ctccatgaac gtgatctgca cctggttgac ggaccggatg 541 gacttacagc ttcatattta tcagttgaaa acactaatta ggatggtaaa gaaaacctac 601 agagatttcc gattgcaagg gggccctgga ctccacctta aacaggcaag acctgatgaa 661 ccgatccgga accgtctcac tgtggaggaa gccacagcat cagtgagtga aggtgggggg 721 ctgcagggca tcagcatgaa ggacagcgat gaggaagacg aagaagacga ttagaccatt 781 tggtcctaga gtctgctgga cagagtcctg taatcagtgc atgtccttag tctgttagtt 841 aaacccatta ccaatttctg tcaactacca tgagatgtta tcaatacaac tgccatttta 901 gctatgtggt accaagatta gcaaatgacc ttcatatcca ctgatttcct gatgtccatg 961 tctatatgtt tacaagcaat atggagcacc attctttaaa tactgttcat ggagaataca 1021 tagtctaacc actaggcgtg tccctgttat cagcaaagat caatgatgct tcattcatgt 1081 actatgtatg cattggtggt aaatggatgt gagggcaagt acatcaagta cattcactct 1141 gtttcacgta tgtggatgcc agttaattaa atgagtacgt aaataaatta attaaaacac 1201 atagatctgc tttgtgtttt tatttttatt ttttgaaaaa caaaaggcaa gtctccaaca 1261 attaactttt gatgcttctg ttcccctaaa accaaaaaat gaaccccttg tgtcgttgtt 1321 aacccatcct tcattttact catattaata gccaaaaaaa aaattatggc tactaccaat 1381 ggattgattc tcttaattgc cacggcaagg ggggcgacct atagacttac atcaagcgcg 1441 ccaggttcaa acctaccggc ttcctgtcaa gttgtcctct aaatgttatt ttgcttttac 1501 gtctcaactg tgtatgtaaa aaaaacgaat atttaaatta caaccctaga ctaaaaatgt 1561 gtttataata agatgtggat attccttcag tagattgtaa ccataattta aattattttg 1621 gttccacact gtttttatat ctgtcatgta cattgcattt tgatctgtaa ctgcacaacc 1681 ctggggtttg ctgcagagct atttctttcc atgtaaagta gtggatccat cttgcttttg 1741 ccttatataa agcctacagt tatggaagtg tggaaaactg tggcttctca ataaatattc 1801 agatgtccta agaatat // LOCUS HSU36499 2979 bp mRNA PRI 07-NOV-1996 DEFINITION Human lymphoid-specific SP100 homolog (LYSP100-A) mRNA, complete cds. ACCESSION U36499 NID g1173651 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2979) AUTHORS Dent,A.L., Yewdell,J., Puvion-Dutilleul,F., Koken,M.H., de The,H. and Staudt,L.M. TITLE LYSP100-associated nuclear domains (LANDs): description of a new class of subnuclear structures and their relationship to PML nuclear bodies JOURNAL Blood 88 (4), 1423-1426 (1996) MEDLINE 96329578 REFERENCE 2 (bases 1 to 2979) AUTHORS Dent,A. and Staudt,L.M. TITLE Direct Submission JOURNAL Submitted (18-SEP-1995) Alex Dent, Metabolism Branch, National Cancer Institute, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2979 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji Burkitt's lymphoma" gene 117..1355 /gene="LYSP100-A" CDS 117..1355 /gene="LYSP100-A" /note="Lymphoid-specific SP100 homolog" /codon_start=1 /product="LYSP100-A" /db_xref="PID:g1173652" /translation="MAQQGQQGQMASGDSNLNFRMVAEIQNVEGQNLQEQVCPEPIFR FFRENKVEIASAITRPFPFLMGLRDRSFISEQMYEHFQEAFRNLVPVTRVMYCVLSEL EKTFGWSHLEALFSRINLMAYPDLNEIYRSFQNVCYEHSPLQMNNVNDLEDRPRLLPY GKQENSNACHEMDDIAVPQEALSSSARCEPGFSSESCEQLALPKAGGGDAEDAPSLLP GGGVSCKLAIQIDEGESEEMPKLLPYDTEVLESNGMIDAARTYSTAPGEKQGEEEGRN SPRKRNQDKEKYQESPEGRDKETFDLKTPQVTNEGEPEKGLCLLPGEGEEGSDDCSEM CDGEEPQEASSSLARCGSEGSDDCSEMCDGEERQEAQKQGRKVIKRVAQWILWILQTT PLWENPRGKEEKRGGMAGAE" BASE COUNT 912 a 620 c 794 g 653 t ORIGIN 1 tcttttcctt ttcctctttg actgagcacc gaggggcagt tggcagcttc acctcagagc 61 tgcaggaagg aacggggcag tgaaaatcga atcgggtgtg atcctaggcc aagctcatgg 121 cccagcaggg ccagcagggg cagatggcaa gtggagacag caatctcaac ttcaggatgg 181 tcgcagagat ccagaacgta gagggtcaga acctgcagga gcaggtttgc cctgagccca 241 ttttcaggtt cttcagagaa aacaaggtgg agattgcaag tgcaataaca aggccatttc 301 ctttccttat gggcctccga gaccgctcct tcatctccga gcagatgtat gaacattttc 361 aagaagcttt tagaaacctg gtcccagtga caagagtgat gtattgtgta ctcagtgaac 421 tggagaagac atttggctgg tcacatctgg aagcattgtt cagcaggatt aacctgatgg 481 cctatcctga tttaaacgag atttacagaa gcttccagaa tgtatgctat gaacactcac 541 ctctccaaat gaataatgta aacgatttag aagatagacc cagattacta ccatatggta 601 aacaagagaa cagcaatgcc tgtcatgaaa tggatgatat agcagtgcct caggaagcct 661 tgagctcctc ggcaaggtgt gagccaggtt tctcttcaga gtcttgtgag cagttagctc 721 tcccaaaggc tggtggagga gatgctgaag atgcacccag cctactacca ggtgggggag 781 tgtcctgtaa acttgctata caaatagatg aaggagaatc agaagaaatg cccaagttac 841 tgccttatga tacagaagtt ctagaaagca acgggatgat agatgcggca aggacataca 901 gcacagcacc aggggagaaa cagggagagg aggaaggcag gaacagtccc agaaaaagaa 961 accaagacaa ggagaagtac caagagagtc cagagggaag agacaaagag acctttgatc 1021 taaaaactcc ccaagtcact aatgaaggag aaccagagaa ggggctctgt ctactaccag 1081 gtgaaggaga agagggcagt gatgactgtt cagaaatgtg tgatggagaa gagccccagg 1141 aagcctctag ctccctagca agatgtgggt cagagggcag tgatgactgt tcggaaatgt 1201 gtgatggaga agagcgccag gaagcccaga agcaaggacg gaaagtgatc aagcgtgtgg 1261 cacaatggat actgtggata ttgcaaacaa ctccactttg ggaaaaccca agaggaaaag 1321 aagaaaaaag agggggcatg gctggagcag aatgagaatg agaaggcaga aaaacagcca 1381 acaaaatgat aatagcaaag ccgacggcca ggtggtctcc agtgaaaaga aggcgaacgt 1441 gaatctgaaa gacctttcca agattagggg gagaaagaga ggcaaacctg gaacccgctt 1501 cactcagagt gacagagctg cacagaaaag agtccgatca agagcttcaa gaaagcacaa 1561 agatgaaact gtggatttta aggctccttt gcttccagtg acctgtggtg gggtgaaggg 1621 aattttacat aagaagaaat tgcagcaagg aatcttggtg aagtgtatac agactgagga 1681 tggaaaatgg ttcaccccca cggaatttga aatcaaagga ggccatgcaa gatcaaagaa 1741 ctggaggctg agtgtgcgct gtggcgggtg gcccctacga tggctgatgg agaatggatt 1801 tctgcctgat cctccaagaa tacgttacag gaaaaaaaag agaatactga agtctcaaaa 1861 caatagctca gttgaccctt gtatgagaaa cctggatgag tgtgaggtgt gccgggacgg 1921 aggggagctg ttctgttgcg acacttgttc aagagtcttc catgaggact gtcacatccc 1981 gcctgtggaa gctgagagga ccccgtggaa ttgcatcttc tgcaggatga aggagtctcc 2041 gggaagccaa cagtgttgtc aggaatctga ggtcctggag aggcagatgt gtcctgagga 2101 acagttgaaa tgtgagttcc tcctcttgaa agtctattgc tgttctgaga gctccttttt 2161 tgccaagatt ccatactatt attatattag agaggcgtgt caaggcctga aggagcccat 2221 gtggttggat aaaatcaaga aaaggctgaa tgagcacggt tacccccaag tggaggggtt 2281 tgtacaagac atgcgcctca tcttccagaa ccacagggcc tcttacaagt acaagggttt 2341 tggccaaatg ggatttagac tggaggctga gtttgagaag aatttcaagg aagtgtttgc 2401 tattcaggga acaaatgggg acaattgact ggattagtgg atgctgaaag cattcagcaa 2461 atggcaccct aaaatatgcc gctggtttgc cactgacttc aaaatgaggt cacttgggca 2521 cagcacatgc agggaggggc ttttctctga gcctccttca tctgcccaaa gacaaatcct 2581 caaaaggaaa ttcaatcatc atgaatcaca accccaagta tctcatcagc cagggaagag 2641 taagtgggat cacagggaag gatgttggca gcgacaccat cccatacagg ctcttacctc 2701 ttctcctgag ggctgctcca gacaacattt attacccaga agaccttttg tctgaaaacc 2761 agccaagctt tattcaggac acacttcttg ccttcacttt cccacttccg tggccacctc 2821 catgcagaag ccctaagccc acattctttc gatagctcac ggtggtgcat gagtgtccat 2881 catctgactc ttctcggagt ctcatatttt gtgggactcc tgtgcaaaca tatgttatta 2941 aaattttttt cctcctgtta aaaaaaaaaa aaaaaaaaa // LOCUS HSU36759 1080 bp mRNA PRI 15-AUG-1996 DEFINITION Human pre-T cell receptor alpha-type chain precursor, mRNA, complete cds. ACCESSION U36759 NID g1127580 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1080) AUTHORS Del Porto,P., Bruno,L., Mattei,M.G., von Boehmer,H. and Saint-Ruf,C. TITLE Cloning and comparative analysis of the human pre-T-cell receptor alpha-chain gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (26), 12105-12109 (1995) MEDLINE 96109214 REFERENCE 2 (bases 1 to 1080) AUTHORS Saint-Ruf,C. TITLE Direct Submission JOURNAL Submitted (20-SEP-1995) Claude Saint-Ruf, INSERM 373, Institut Necker, 156 rue Vaugirard, Paris, 75015, France FEATURES Location/Qualifiers source 1..1080 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thymus" /clone_lib="human thymus cDNA library Clontech #HL1127A" /map="6p21.2-6p12" /chromosome="6" 5'UTR 1..81 sig_peptide 82..129 CDS 82..927 /codon_start=1 /product="pre-T cell receptor alpha-type chain precursor" /db_xref="PID:g1127581" /translation="MAGTWLLLLLALGCPALPTGVGGTPFPSLAPPIMLLVDGKQQMV VVCLVLDVAPPGLDSPIWFSAGNGSALDAFTYGPSPATDGTWTNLAHLSLPSEELASW EPLVCHTGPGAEGHSRSTQPMHLSGEASTARTCPQEPLRGTPGGALWLGVLRLLLFKL LLFDLLLTCSCLCDPAGPLPSPATTTRLRALGSHRLHPATETGGREATSSPRPQPRDR RWGDTPPGRKPGSPVWGEGSYLSSYPTCPAQAWCSRSRLRAPSSSLGAFFRGDLPPPL QAGAA" mat_peptide 130..924 /note="pTalpha" /product="pre-T cell receptor alpha-type chain" misc_feature 130..522 /note="encodes extracellular region, immunoglobulin superfamily C-type domain" misc_feature 220..222 /note="encodes Cys potentially involved in intramolecular disulphide linkage" misc_feature 280..291 /note="encodes potential N-linked glycosylation sites" misc_feature 400..402 /note="encodes Cys potentially involved in intramolecular disulphide linkage" misc_feature 484..486 /note="encodes Cys potentially involved in intermolecular disulphide linkage to the TCR-beta chain" misc_feature 523..582 /note="encodes transmembrane region" misc_feature 583..924 /note="encodes cytoplasmic region" 3'UTR 927..1080 polyA_signal 1053..1062 polyA_site 1080 BASE COUNT 179 a 366 c 316 g 219 t ORIGIN 1 tagaaggcag tcttgtgggt gcctcctccc ccagccgcaa ctcaggtctg cagctgggtc 61 ctgcctcctt ccgagtgggc catggccggt acatggctgc tacttctcct ggcccttggg 121 tgtccagccc tacccacagg tgtgggcggc acaccctttc cttctctggc cccaccaatc 181 atgctgctgg tggatggaaa gcagcagatg gtggtggtct gcctggtcct tgatgttgca 241 ccccctggcc ttgacagccc catctggttc tcagccggca atggcagtgc actggatgcc 301 ttcacctatg gcccttcccc agcaacggat ggcacctgga ccaacttggc ccatctctcc 361 ctgccttctg aggagctggc atcctgggag cctttggtct gccacactgg gcctggggct 421 gagggtcaca gcaggagtac acagcccatg catctgtcag gagaggcttc tacagccagg 481 acctgccccc aggagcctct cagggggaca ccgggtgggg cgctgtggct gggggtcctg 541 cggctgctgc tcttcaagct gctgctgttt gacctgctcc tgacctgcag ctgcctgtgc 601 gaccccgcgg gcccgctgcc ttcccccgca accaccaccc gcctgcgagc cctcggctcc 661 catcgactgc acccggccac ggagactggg ggacgagagg ccaccagctc acccagaccc 721 cagcctcggg accgccgctg gggtgacacc cctccgggtc ggaagcccgg gagcccagta 781 tggggggaag ggtcttacct cagcagttac cccacttgcc cagcacaggc ctggtgctca 841 agatctcgcc tcagggctcc ttcctccagt cttggagcat tttttcgagg tgacctgcct 901 cctcctctgc aggctggagc tgcctgaggg cagggctcta cctcccctgc gtcacactgt 961 gtgaggctgt gtctctgcca tccaaaaggg ggccccttga gaatggtgat ccacccagtt 1021 acaggggcat ttagggagca gatgactgag aacattaaaa aagaacttaa atgacacagc // LOCUS HSU36764 1328 bp mRNA PRI 24-OCT-1995 DEFINITION Human TGF-beta receptor interacting protein 1 mRNA, complete cds. ACCESSION U36764 NID g1036804 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1328) AUTHORS Chen,R.H., Miettinen,P.J., Maruoka,E.M., Choy,L. and Derynck,R. TITLE A WD-domain protein that is associated with and phosphorylated by the type II TGF-beta receptor JOURNAL Nature 377 (6549), 548-552 (1995) MEDLINE 96013749 REFERENCE 2 (bases 1 to 1328) AUTHORS Chen,R.-H., Miettinen,P.J., Maruka,E.M., Choy,L. and Derynck,R. TITLE Direct Submission JOURNAL Submitted (20-SEP-1995) Ruey-Hwa Chen, Growth and Development, University of California at San Francisco, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1328 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 38..1015 /note="TRIP-1; WD domain protein" /codon_start=1 /product="TGF-beta receptor interacting protein 1" /db_xref="PID:g1036805" /translation="MKPILLQGHERSITQIKYNREGDLLFTVAKDPIVNVWYSVNGER LGTYMGHTGAVWCVDADWDTKHVLTGSADNSCRLWDCETGKQLALLKTNSAVRTCGFD FGGNIIMFSTDKQMGYQCFVSFFDLRDPSQIDNNEPYMKIPCNDSKITSAVWGPLGEC IIAGHESGELNQYSAKSGEVLVNVKEHSRQINDIQLSRDMTMFVTASKDNTAKLFDST TLEHQKTFRTERPVNSAALSPNYDHVVLGGGQEAMDVTTTSTRIGKFEARFFHLAFEE EFGRVKGHFGPINSVAFHPDGKSYSSGGEDGYVRIHYFDPQYFEFEFEA" BASE COUNT 338 a 343 c 349 g 298 t ORIGIN 1 ggcacgaggt tgcggccttc ctcgcgtcac cgccgggatg aagccgatcc tactgcaggg 61 ccatgagcgg tccattacgc agattaagta taaccgcgaa ggagacctcc tctttactgt 121 ggccaaggac cctatcgtca atgtatggta ctctgtgaat ggtgagaggc tgggcaccta 181 catgggccat accggagctg tgtggtgtgt ggacgctgac tgggacacca agcatgtcct 241 cactggctca gctgacaaca gctgtcgtct ctgggactgt gaaacaggaa agcagctggc 301 ccttctcaag accaattcgg ctgtccggac ctgcggtttt gactttgggg gcaacatcat 361 catgttctcc acggacaagc agatgggcta ccagtgcttt gtgagctttt ttgacctgcg 421 ggatccgagc cagattgaca acaatgagcc ctacatgaag atcccttgca atgactctaa 481 aatcaccagt gctgtttggg gacccctggg ggagtgcatc atcgctggcc atgagagtgg 541 agagctcaac cagtatagtg ccaagtctgg agaggtgttg gtgaatgtta aggagcactc 601 ccggcagatc aacgacatcc agttatccag ggacatgacc atgtttgtga ccgcgtccaa 661 ggacaacaca gccaagcttt ttgactccac aactcttgaa catcagaaga ctttccggac 721 agaacgtcct gtcaactcag ctgccctctc ccccaactat gaccatgtgg tcctgggcgg 781 tggtcaggaa gccatggatg taaccacaac ctccaccagg attggcaagt ttgaggccag 841 gttcttccat ttggcctttg aagaagagtt tggaagagtc aagggtcact ttggacctat 901 caacagtgtt gccttccatc ctgatggcaa gagctacagc agcggcggcg aagatggtta 961 cgtccgtatc cattacttcg acccacagta cttcgaattt gagtttgagg cttaagaagc 1021 tggatctcct gccgggcgtg gtggctcatg cctgtaatcc caccactttt ttttaaggca 1081 ggcggatcac ctgaggtcag gagtttaaga ccagcctgac caacatggag aaactcgtct 1141 ctactaaaaa tacaaaaata caaaaattag ccaggcatgg tggcacacgc ctatagtccc 1201 agctactcag gaggctgagg caggagaatc acttgaaccc aggaggcata ggttgcagtg 1261 agctgagatc acgtcattgc actccatcct gagccacaag agcaaaactc cgtctcaaaa 1321 aaaaaaaa // LOCUS HSU36787 1086 bp mRNA PRI 18-NOV-1996 DEFINITION Human putative holocytochrome c-type synthetase mRNA, complete cds. ACCESSION U36787 NID g1209634 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1086) AUTHORS Schaefer,L., Ballabio,A. and Zoghbi,H.Y. TITLE Cloning and characterization of a putative human holocytochrome c-type synthetase gene (HCCS) isolated from the critical region for microphthalmia with linear skin defects (MLS) JOURNAL Genomics 34 (2), 166-172 (1996) MEDLINE 96299705 REFERENCE 2 (bases 1 to 1086) AUTHORS Schaefer,L., Ballabio,A. and Zoghbi,H.Y. TITLE Direct Submission JOURNAL Submitted (20-SEP-1995) Laura Schaefer, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1086 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp22.2-22.3" /chromosome="X" CDS 200..1006 /note="putative" /codon_start=1 /product="holocytochrome c-type synthetase" /db_xref="PID:g1209635" /translation="MGLSPSAPAVAVQASNASASPPSGCPMHEGKMKGCPVNTEPSGP TCEKKTYSVPAHQERAYEYVECPIRGTAAENKENLDPSNLMPPPNQTPAPDQPFALST VREESSIPRADSEKKWVYPSEQMFWNAMLKKGWKWKDEDISQKDMYNIIRIHNQNNEQ AWKEILKWEALHAAECPCGPSLIRFGGKAKEYSPRARIRSWMGYELPFDRHDWIINRC GTEVRYVIDYYDGGEVNKDYQFTILDVRPALDSLSAVWDRMKVAWWRWTS" BASE COUNT 294 a 230 c 303 g 259 t ORIGIN 1 cggggcggcg gcggcggcgt gaagtcactg ctgctctggg ttcgggttgg cgactgaagg 61 cggtaccggc ctcccggaac agcccggggg agggcttagg tgcagaaggg caggctggcc 121 gcggccggtt tggtctgggg accacgggct ggagcaggtg gaaatttaaa attgtttaca 181 gtcaacactg tttccagcca tgggtttgtc tccatctgct cctgctgttg cagttcaggc 241 ctcaaatgct tcagcgtccc caccttcagg atgcccgatg catgaaggga aaatgaaagg 301 ctgtccagtg aatacagagc catctggccc aacctgtgag aagaaaacat actctgtgcc 361 tgcccaccag gaacgcgcct atgagtacgt ggagtgtccc attaggggca ctgcggctga 421 gaataaggag aacctagatc cttcaaatct gatgccacca ccaaatcaaa caccagctcc 481 agatcagcca tttgcattgt ctactgtcag agaagagtca tccattccga gagcagattc 541 agagaaaaag tgggtttacc cttctgagca gatgttctgg aatgcaatgt taaagaaagg 601 gtggaagtgg aaggatgagg atatcagtca gaaggatatg tataatatca ttagaattca 661 caatcagaat aacgagcagg cttggaagga gattttgaag tgggaagccc ttcatgctgc 721 agagtgtcct tgtggtccat cattgatccg gtttggaggg aaagcaaaag agtattcacc 781 aagggcacga attcgttcct ggatggggta tgagttgcct tttgataggc acgattggat 841 cataaaccgt tgcgggacag aagttagata tgtgattgat tattatgatg gtggtgaagt 901 caacaaggac taccagttca ccatcctgga cgtccgtcct gccttagatt cactttcggc 961 agtatgggac agaatgaaag tcgcttggtg gcgttggacc tcgtaaagca ctgtttcaga 1021 tggaaaaata taaactattt ttttctgagc gatacattaa actattttcc ccagaaaaaa 1081 aaaaaa // LOCUS HSU37012 4463 bp mRNA PRI 05-DEC-1995 DEFINITION Human cleavage and polyadenylation specificity factor mRNA, complete cds. ACCESSION U37012 NID g1045573 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4463) AUTHORS Murthy,K.G. and Manley,J.L. TITLE The 160-kD subunit of human cleavage-polyadenylation specificity factor coordinates pre-mRNA 3'-end formation JOURNAL Genes Dev. 9 (21), 2672-2683 (1995) MEDLINE 96067159 REFERENCE 2 (bases 1 to 4463) AUTHORS Murthy,K.G. TITLE Direct Submission JOURNAL Submitted (25-SEP-1995) Kanneganti G. Murthy, Biological Sciences, Columbia University, 715 Fairchild Bldg, New York, New York, 10027, USA FEATURES Location/Qualifiers source 1..4463 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cell subclone D98/AH2" /cell_type="uterine cervical carcinoma" 5'UTR 1..51 CDS 52..4380 /note="160 kDa subunit" /codon_start=1 /product="cleavage and polyadenylation specificity factor" /db_xref="PID:g1045574" /translation="MYAVYKQAHPPPGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLN RDAEALTKNDRSTEGKAHREKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAK LSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLV VLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLL ILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIG GVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDK MVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSL LLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVDEIEVYGS EAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGHGK NGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEAD DDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPL GIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRHHRL ALHKPPLHHQSKVITLCLYRDLSGMFTTESRLGGARDELGGRSGPEAEGLGSETSPTV DDEEEMLYGDSGSLFSPSKEEARRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEIYQ LPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEARREEATRQGELPLVKEVLLVALGS RQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAE GGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPV DSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVE SKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIP NARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIE VVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV DNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRATE GLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR AFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETD RVTAHF" 3'UTR 4381..4463 BASE COUNT 879 a 1414 c 1364 g 806 t ORIGIN 1 ctgtcccggt tcctctcgag tcggctccaa ctgccagccc gggttggcgc catgtacgcc 61 gtgtacaaac aggcgcatcc gccacccggt ctggagttct ccatgtactg caacttcttc 121 aacaacagcg agcgcaacct ggtagtggcc gggacctcgc agctctacgt gtaccgcctc 181 aaccgcgacg ccgaggctct gaccaagaat gacaggagca cagaggggaa ggcccaccgg 241 gagaagctcg agcttgctgc ctccttctcc ttctttggca acgtcatgtc catggccagc 301 gtgcagctgg caggagccaa gcgggatgcc ctgctcctaa gcttcaagga tgccaagctg 361 tctgtggtgg agtacgaccc gggcacccat gacctgaaga ccctgtcact gcactacttt 421 gaggagcctg agcttcggga cgggtttgtg cagaatgtac acacgccgcg agtgcgggtg 481 gaccccgacg ggcgctgtgc agccatgctt gtctacggca cgcggctggt ggtcctgccc 541 ttccgcaggg agagcctggc tgaggagcac gaggggctcg tgggtgaggg gcagaggtcc 601 agcttcctgc ccagctacat catcgacgtg cgggccctag acgagaagct gctcaacatc 661 atcgacctgc agttcctgca tggctactac gagcctaccc tcctcatcct gtttgagccc 721 aaccagacct ggcctgggcg cgtggccgtg cggcaggaca cgtgctccat tgtggccatc 781 tcactgaaca tcacgcagaa ggtgcacccc gtcatctggt ccctcaccag cctgcccttt 841 gactgcaccc aggctctggc tgtgcccaag cccataggtg gggtggtggt gtttgccgtc 901 aactcgctgt tgtacctgaa ccagagcgtc cccccgtatg gcgtggctct caacagtctc 961 accacaggaa ccacggcttt cccgcttcgc acccaggagg gtgtgcggat caccctggac 1021 tgcgcccagg ccaccttcat ctcctacgac aagatggtca tctccctcaa gggcggcgag 1081 atctacgtgc tgaccctcat caccgacggc atgcgcagtg tccgagcgtt ccactttgac 1141 aaggcggccg ccagcgtcct caccaccagc atggtcacca tggagcccgg gtacctgttc 1201 ctgggttctc gcctgggcaa ttccctcctc ctcaagtaca cggagaagct gcaggagccc 1261 ccggccagtg ctgtccgtga ggctgccgac aaggaagagc ctccctcaaa gaagaagcga 1321 gtggatgcga cggccggctg gtcagctgcg ggtaagtcgg tgccgcagga tgaggtggac 1381 gagattgaag tgtacggcag cgaggcccag tcgggaacac agctggccac ctactccttt 1441 gaggtgtgtg acagcatcct gaacattgga ccctgtgcca atgccgccgt gggcgagcct 1501 gccttcctct ctgaagagtt tcagaacagc cccgagccgg acctggagat tgtggtttgc 1561 tccggccacg ggaagaacgg ggctttgtcg gtgctgcaga agagcatccg gccccaggtg 1621 gtgacaacct ttgagcttcc cggctgctat gacatgtgga cagtcatcgc cccggtgcgt 1681 aaggaggagg aggacaatcc caagggggag ggcacagagc aggaacccag caccacccct 1741 gaagcagacg acgacggccg cagacacgga ttcctgattc tgagccggga agactccacc 1801 atgatcctgc agacggggca ggagatcatg gagctggaca ccagtggctt cgccactcag 1861 ggccccacgg tctttgctgg gaacatcggg gacaaccgct acattgtcca agtgtcacca 1921 ctgggcatcc gcctgctgga aggagtgaat cagctgcact tcatccccgt ggacctgggc 1981 gcccccatcg tgcagtgcgc cgtggccgac ccctatgtgg tcatcatgag tgccgagggc 2041 cacgtcacca tgttcctgct gaagagtgac tcctacggtg gccgccacca ccgcctggcg 2101 ctgcacaagc ccccgctgca ccatcagtcc aaggtgatta cgctgtgcct gtaccgagac 2161 ctcagcggca tgttcaccac tgagagccgc ctgggtgggg cccgtgacga gctcgggggc 2221 cgcagtggcc cggaggccga gggcctgggc tcagagacta gccccacagt ggatgacgag 2281 gaggagatgc tgtatgggga ttcgggctcc ctcttcagcc ccagcaagga ggaggcccga 2341 agaagcagcc agccccctgc tgaccgggac cctgcaccct tccgggcaga gcctacccac 2401 tggtgcctgc tggtgcggga gaatggcacc atggagatct accagcttcc cgactggcgg 2461 ctggtgttcc tggtgaagaa cttccctgtg gggcagcggg tccttgtgga cagctccttt 2521 ggacagccca ctacacaggg cgaggcccgc agggaggagg ccacgcgcca gggggagctg 2581 cccctcgtca aggaggtgct gctggtggcg ctgggcagcc gccagagcag gccctacctg 2641 ctggtgcatg tggaccaaga gctgcttatc tacgaggcct tcccccacga ctctcagctc 2701 ggccagggca atctcaaagt ccgctttaag aaggtccctc acaacatcaa cttccgtgag 2761 aagaagccaa agccatccaa gaagaaagca gaaggtggcg gcgcagagga gggggctggg 2821 gcccggggcc gcgtggcgcg tttccgctac ttcgaggata tttatggcta ctcaggggtc 2881 ttcatctgcg gcccctcccc tcactggctc ttggtgaccg gccgaggggc tctgcggcta 2941 caccccatgg ccatcgacgg cccggtcgac tctttcgctc cattccacaa tgtcaactgt 3001 ccccgcggct tcctgtactt caacagacag ggcgagctga ggatcagtgt cctgcctgcc 3061 tacctgtcct atgatgcccc atggcctgtc aggaagatcc cgctgcgctg cacggcccac 3121 tatgtggctt accacgtgga gtctaaggtg tatgctgtgg ccaccagcac caacacgccg 3181 tgtgcccgca tcccacgcat gactggcgag gagaaggagt ttgagaccat cgagagagat 3241 gagcggtaca tccaccccca gcaggaggcc ttctccatcc agctcatctc cccggtcagc 3301 tgggaggcta ttcccaatgc caggatcgag ctgcaggagt gggagcatgt gacctgcatg 3361 aagacagtgt ctctgcgcag tgaggagacc gtgtcgggcc tcaaaggcta cgtggccgcc 3421 gggacctgcc tcatgcaggg ggaggaggtc acgtgccgag ggcggatctt gatcatggat 3481 gtgattgagg tggtgcccga gcctggccag cccttgacca agaacaagtt caaagtcctt 3541 tacgagaagg agcagaaggg gcccgtgacc gccctgtgcc actgcaatgg ccacctggtg 3601 tcggccatcg gccagaagat tttcctgtgg agcctgcggg ccagcgagct gacgggcatg 3661 gccttcatcg acacgcagct ctacatacac cagatgatca gcgtcaagaa cttcatcctg 3721 gcagccgacg tcatgaagag catttcgctg ctgcgctacc aggaggaaag caagacgctg 3781 agcctggtgt cgcgggatgc caagcccctg gaggtgtaca gcgtggactt catggtggac 3841 aatgcccagc tgggttttct ggtgtctgac cgcgaccgca acctcatggt gtacatgtac 3901 ctgcccgaag ccaaggagag tttcgggggc atgcgcctgc tgcgtcgggc agacttccac 3961 gtgggtgccc acgtgaacac gttctggagg accccgtgcc gggccactga agggctcagc 4021 aaaaagtcgg tcgtgtggga gaataagcac atcacgtggt ttgccaccct ggacggcggc 4081 atcgggctgc tgctgcccat gcaggagaag acctaccggc ggctgctgat gctgcagaac 4141 gcgctgacca ccatgctgcc acaccacgcc ggcctcaacc cccgcgcctt ccggatgctg 4201 cacgtggacc gccgcaccct ccagaatgcc gtgcgcaacg tgctggatgg ggagctgctc 4261 aaccgctacc tgtacctgag caccatggag cgcagcgagc tagccaagaa gatcggcacc 4321 acaccagaca taatcctgga cgacttgctg gagacggacc gcgtcaccgc ccacttctag 4381 ccccgtggat gccgtcacca ccagcacacg gaactacctc ccaccccctt tttgtacaaa 4441 acacaaggaa aaacattttt tgc // LOCUS HSU37139 1041 bp mRNA PRI 05-DEC-1995 DEFINITION Human beta 3-endonexin mRNA, long form and short form, complete cds. ACCESSION U37139 NID g1065438 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1041) AUTHORS Shattil,S.J., O'Toole,T., Eigenthaler,M., Thon,V., Williams,M., Babior,B.M. and Ginsberg,M.H. TITLE Beta 3-endonexin, a novel polypeptide that interacts specifically with the cytoplasmic tail of the integrin beta 3 subunit JOURNAL J. Cell Biol. 131 (3), 807-816 (1995) MEDLINE 96042322 REFERENCE 2 (bases 1 to 1041) AUTHORS Shattil,S. TITLE Direct Submission JOURNAL Submitted (27-SEP-1995) Sanford Shattil, Vascular Biology, The Scripps Research Institute, 10666 N. Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1041 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1041 /product="beta 3-endonexin long form" mRNA join(1..463,558..614,665..1041) /product="beta 3-endonexin short form" CDS 131..643 /note="The long form does not bind to the integrin beta 3 subunit cytoplasmic domain" /codon_start=1 /product="beta 3-endonexin long form" /db_xref="PID:g1065440" /translation="MPVKRSLKLDGLLEENSFDPSKITRKKSVITYSPTTGTCQMSLF ASPTSSEEQKHRNGLSNEKRKKLNHPSLTESKESTTKDNDEFMMLLSKVEKLSEEIME IMQNLSSIQALEGSRELENLIGISCASHFLKREMQKTKELMTKVNKQKLFEKSTGLPH KGQPQMSQPL" CDS join(131..463,558..560) /note="binds to the integrin beta 3 subunit cytoplasmic domain" /codon_start=1 /product="beta 3-endonexin short form" /db_xref="PID:g1065439" /translation="MPVKRSLKLDGLLEENSFDPSKITRKKSVITYSPTTGTCQMSLF ASPTSSEEQKHRNGLSNEKRKKLNHPSLTESKESTTKDNDEFMMLLSKVEKLSEEIME IMQNLSSIQ" BASE COUNT 348 a 200 c 200 g 293 t ORIGIN 1 ctggttcggc ccacctctga aggttccaga atcgatagtg aattcgtggt ttcctttggc 61 ggattttctg ttttcggaag ttgctgggtt cgttttattc agcggcagtg gtgctttccc 121 gaatctcaga atgcctgtta aaagatcact gaagttggat ggtctgttag aagaaaattc 181 atttgatcct tcaaaaatca caaggaagaa aagtgttata acttattctc caacaactgg 241 aacttgtcaa atgagtctat ttgcttctcc cacaagttct gaagagcaaa agcacagaaa 301 tggactatca aatgaaaaga gaaaaaaatt gaatcacccc agtttaactg aaagcaaaga 361 atctacaaca aaagacaatg atgaattcat gatgttgcta tcaaaagttg agaaattgtc 421 agaagaaatc atggagataa tgcaaaattt aagtagtata caggctttgg agggcagtag 481 agagcttgaa aatctcattg gaatctcctg tgcatcacat ttcttaaaaa gagaaatgca 541 gaaaaccaaa gaactaatga caaaagtgaa taaacaaaaa ctgtttgaaa agagtacagg 601 acttcctcac aaaggtcagc ctcagatgtc acaacctctg tgaagctctc cccagctctc 661 ctagcatcac gtcatcttga cagctatgaa ttccttaaag ccattttaaa ctgaggcatt 721 aagaagaaat gcactcacca tgagcaccaa cttctgcatc tgcctgatca tatttaaagg 781 aacagagaaa tatttgtaat taatctgccc agtaaatacc agctcgtagc agttggcagg 841 tgcatgtcta gataaaattt cttgcagcta atttaaactt tctacacgca ccagtagata 901 atctcaatgt aaataataca tttcttcttg gctctttaat gtaagccaac atggagagga 961 agatcttgac ttatattctg taccacatac acttctgtgg acttttagca tttgtgggta 1021 gacttaatgg ccttcgtggc c // LOCUS HSU37143 1857 bp mRNA PRI 16-FEB-1996 DEFINITION Human cytochrome P450 monooxygenase CYP2J2 mRNA, complete cds. ACCESSION U37143 NID g1185451 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1857) AUTHORS Wu,S., Moomaw,C.R., Tomer,K.B., Falck,J.R. and Zeldin,D.C. TITLE Molecular cloning and expression of CYP2J2, a human cytochrome P450 arachidonic acid epoxygenase highly expressed in heart JOURNAL J. Biol. Chem. 271 (7), 3460-3468 (1996) MEDLINE 96216439 REFERENCE 2 (bases 1 to 1857) AUTHORS Wu,S., Zeldin,D.C., Moomaw,C., Tomer,K.B. and Falck,J.R. TITLE Direct Submission JOURNAL Submitted (26-SEP-1995) Shu Wu, Pulmonary Pathobiol., NIH/NIEHS, 111 TW Alexander Drive, Research Triangle Park, NC 27709, USA FEATURES Location/Qualifiers source 1..1857 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="sw2-14" CDS 6..1514 /note="arachidonic acid monooxygenase; hemoprotein" /codon_start=1 /product="cytochrome P450 monooxygenase CYP2J2" /db_xref="PID:g1185452" /translation="MLAAMGSLAAALWAVVHPRTLLLGTVAFLLAADFLKRRRPKNYP PGPWRLPFLGNFFLVDFEQSHLEVQLFVKKYGNLFSLELGDISAVLITGLPLIKEALI HMDQNFGNRPVTPMREHIFKKNGLIMSSGQAWKEQRRFTLTALRNFGLGKKSLEERIQ EEAQHLTEAIKEENGQPFDPHFKINNAVSNIICSITFGERFEYQDSWFQQLLKLLDEV TYLEASKTCQLYNVFPWIMKFLPGPHQTLFSNWKKLKLFVSHMIDKHRKDWNPAETRD FIDAYLKEMSKHTGNPTSSFHEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEIQ EKVQVEIDRVIGQGQQPSTAARESMPYTNAVIHEVQRMGNIIPQNVPREVTVDTTLAG YHLPKGTMILTNLTALHRDPTEWATPDTFNPDHFLENGQFKKREAFMPFSIGKRACLG EQLARTELFIFFTSLMQKFTFRPPNNEKLSLKFRMGITISPVSHRLCAVPQV" polyA_site 1857 /note="20 A nucleotides" BASE COUNT 535 a 449 c 419 g 454 t ORIGIN 1 gagccatgct cgcggcgatg ggctctctgg cggctgccct ctgggcagtg gtccatcctc 61 ggactctcct actgggcact gtcgcctttc tgctcgctgc tgactttctc aaaagacggc 121 gcccaaagaa ctacccgccg gggccctggc gcctgccctt ccttggcaac ttcttccttg 181 tggacttcga gcagtcgcac ctggaggttc agctgtttgt gaagaaatat gggaaccttt 241 ttagcttgga gcttggtgac atatctgcag ttcttattac tggcttgccc ttaatcaaag 301 aagcccttat ccacatggac caaaactttg ggaaccggcc cgtgacccct atgcgagaac 361 atatctttaa gaaaaatgga ttgattatgt caagtggcca ggcatggaag gagcaaagaa 421 ggttcactct gacagcacta aggaactttg gtttaggaaa gaagagctta gaggaacgca 481 ttcaggagga ggcccaacac ctcactgaag caataaaaga ggagaacgga cagccttttg 541 accctcattt caagatcaac aatgcagttt ccaatatcat ttgctccatc accttcggag 601 aacgctttga gtaccaggat agttggtttc agcagctgct gaagttacta gatgaagtca 661 catacttgga ggcttcaaag acatgccagc tctacaatgt ctttccatgg ataatgaaat 721 tcctgcctgg accccaccaa actctcttca gcaactggaa aaaactgaaa ttgtttgttt 781 ctcatatgat tgacaaacac agaaaggatt ggaatcctgc agaaacaaga gactttattg 841 atgcttacct taaagaaatg tcaaagcaca caggcaatcc tacttcaagt ttccatgaag 901 aaaacctcat ctgcagcacc ctggacctct tctttgccgg aaccgagaca acttccacaa 961 ctctgcgatg ggctctgctt tatatggccc tctacccaga aatccaagaa aaagtacaag 1021 tcgagattga cagagtgatt ggccaggggc agcagccgag cacagccgcc cgggagtcca 1081 tgccctacac caatgctgtc atccatgagg tgcagagaat gggcaacatc atcccccaga 1141 acgttcccag ggaagtgaca gttgatacca ctttggctgg gtaccacctg cccaagggta 1201 ccatgatcct gaccaatttg acggcgctgc acagggaccc cacagagtgg gccacccctg 1261 acacattcaa tccggaccat tttctggaga atggacagtt taagaaaagg gaagccttta 1321 tgcctttctc aataggaaag cgggcatgcc tcggagaaca gttggccagg actgagctgt 1381 ttattttctt cacttccctt atgcaaaaat ttaccttcag gcccccaaac aatgagaagc 1441 tgagcctgaa gtttagaatg ggtatcacca tttccccagt cagtcaccgc ctctgcgctg 1501 ttcctcaggt gtaatattgt taagaaagaa aggggcaagg aaagcaagaa gacatggcac 1561 gtgttctgaa accactggtg tctgctcaga tgtgttggga caaaatgaaa gtgactttca 1621 agaaagatca gaggaatttg actcagagaa aactagatcc aaatcccagc tctactgtct 1681 ctcccgaatt agccttggga aaatcattta tatgctaaat aatttacctt tttatctagg 1741 agatgaaaag aggataatgt ttccttccat aaagaaagtt cttgtaagaa tcaaaagaaa 1801 tggtgagctt taagtggttt gtaaaccata aaacacatca taaaagttct atctata // LOCUS HSU37146 5989 bp mRNA PRI 31-OCT-1995 DEFINITION Human silencing mediator of retinoid and thyroid hormone action (SMRT) mRNA, complete cds. ACCESSION U37146 NID g1045654 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5989) AUTHORS Chen,J.D. and Evans,R.M. TITLE A transcriptional co-repressor that interacts with nuclear hormone receptors JOURNAL Nature 377 (6548), 454-457 (1995) MEDLINE 96008552 REFERENCE 2 (bases 1 to 5989) AUTHORS Chen,J.D. and Evans,R.M. TITLE Direct Submission JOURNAL Submitted (27-SEP-1995) J. Don Chen, Gene Expression Lab, The Salk Institute, 10010 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..5989 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 496..4983 /gene="SMRT" CDS 496..4983 /gene="SMRT" /note="transcriptional co-repressor" /codon_start=1 /product="silencing mediator of retinoid and thyroid hormone action" /db_xref="PID:g1045655" /translation="MEAWDAHPDKEAFAAEAQKLPGDPPCWTSGLPFPVPPREVIKAS PHAPDPSAFSYAPPGHPLPLGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQIGA ISQGMSVQLHVPYSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQEQLSPRGQAGPPE SLGVPTAQEASVLRGTALGSVPGGSITKGIPSTRVPSDSAITYRGSITHGTPADVLYK GTITRIIGEDSPSRLDRGREDSLPKGHVIYEGKKGHVLSYEGGMSVTQCSKEDGRSSS GPPHETAAPKRTYDMMEGRVGRAISSASIEGLMGRAIPPERHSPHHLKEQHHIRGSIT QGIPRSYVEAQEDYLRREAKLLKREGTPPPPPPSRDLTEAYKTQALGPLKLKPAHEGL VATVKEAGRSIHEIPREELRHTPELPLAPRPLKEGSITQGTPLKYDTGASTTGSKKHD VRSLIGSPGRTFPPVHPLDVMADARALERACYEESLKSRPGTASSSGGSIARGAPVIV PELGKPRQSPLTYEDHGAPFAGHLPRGSPVTMREPTPRLQEGSLSSSKASQDRKLTST PREIAKSPHSTVPEHHPHPISPYEHLLRGVSGVDLYRSHIPLAFDPTSIPRGIPLDAA AAYYLPRHLAPNPTYPHLYPPYLIRGYPDTAALENRQTIINDYITSQQMHHNTATAMA QRADMLRGLSPRESSLALNYAAGPRGIIDLSQVPHLPVLVPPTPGTPATAMDRLAYLP TAPQPFSSRHSSSPLSPGGPTHLTKPTTTSSSERERDRDRERDRDREREKSILTSTTT VEHAPIWRPGTEQSSGSSGSSGGGGGSSSRPASHSHAHQHSPISPRTQDALQQRPSVL HNTGMKGIITAVEPSKPTVLRSTSTSSPVRPAATFPPATHCPLGGTLDGVYPTLMEPV LLPKEAPRVARPERPRADTGHAFLAKPPARSGLEPASSPSKGSEPRPLVPPVSGHATI ARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQELELRSLGYHGSSYSPEG VEPVSPVSSPSLTHDKGLPKHLEELDKSHLEGELRPKQPGPVKLGGEAAHLPHLRPLP ESQPSSSPLLQTAPGVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPG ASCPVLDLRRPPSDLYLPPPDHGAPARGSPHSEGGKRSPEPNKTSVLGGGEDGIEPVS PPEGMTEPGHSRSAVYPLLYRDGEQTEPSRMGSKSPGNTSQPPAFFSKLTESNSAMVK SKKQEINKKLNTHNRNEPEYNISQPGTEIFNMPAITGTGLMTYRSQAVQEHASTNMGL EAIIRKALMGKYDQWEESPPLSANAFNPLNASASLPAAMPITAADGRSDHTLTSPGGG GKAKVSGRPSSRKAKSPAPGLASGDRPPSVSSVHSEGDCNRRTPLTNRVWEDRPSSAG STPFPYNPLIMRLQAGVMASPPPPGLPAGSGPLAGPHHAWDEEPKPLLCSQYETLSDS E" exon 4483..4620 /gene="SMRT" /note="alternatively spliced insert" BASE COUNT 1208 a 2200 c 1665 g 916 t ORIGIN 1 gtgttcctgt aatggtaatg atccgatctt cggatccttc taaaggctca tcaattttga 61 tcgaagctcc cgactcatga cggatttgtt taatccgctg accacctttg ccaataatag 121 atccagccaa atctttggga atagttactt gtgtagtaat aataggtcca ccaagatcac 181 catatgagcc acgaccccct gcataggaat aatcatatcc ggagccaccc tgtggttcat 241 aagccatctg ccattctgat gggctccatg tatctattgc agagtcccaa gtttcatcag 301 cactgaaacc aaccatgccg tcgtaacggt ctccaggtct ccctcttctg tcataggcca 361 tgaggtctcc ccctctaggt ggtggtggtg gaggaagagg aagattccga gctctgctac 421 caccccggcc gcctcgtccg ggaggagggg gaggtggtcc tcgacgaggg ctcatatcat 481 cataatctct tctagatgga ggcatgggac gcccaccccg acaaggaggc cttcgcagcc 541 gaggcccaga agctgcctgg ggacccccct tgctggactt ccggcctgcc cttccccgtg 601 cccccccgtg aggtgatcaa ggcctccccg catgccccgg acccctcagc cttctcctac 661 gctccacctg gtcacccact gcccctgggc ctccatgaca ctgcccggcc cgtcctgccg 721 cgcccaccca ccatctccaa cccgcctccc ctcatctcct ctgccaagca ccccagcgtc 781 ctcgagaggc aaataggtgc catctcccaa ggaatgtcgg tccagctcca cgtcccgtac 841 tcagagcatg ccaaggcccc ggtgggccct gtcaccatgg ggctgcccct gcccatggac 901 cccaaaaagc tggcaccctt cagcggagtg aagcaggagc agctgtcccc acggggccag 961 gctgggccac cggagagcct gggggtgccc acagcccagg aggcgtccgt gctgagaggg 1021 acagctctgg gctcagttcc gggcggaagc atcaccaaag gcattcccag cacacgggtg 1081 ccctcggaca gcgccatcac ataccgcggc tccatcaccc acggcacgcc agctgacgtc 1141 ctgtacaagg gcaccatcac caggatcatc ggcgaggaca gcccgagtcg cttggaccgc 1201 ggccgggagg acagcctgcc caagggccac gtcatctacg aaggcaagaa gggccacgtc 1261 ttgtcctatg agggtggcat gtctgtgacc cagtgctcca aggaggacgg cagaagcagc 1321 tcaggacccc cccatgagac ggccgccccc aagcgcacct atgacatgat ggagggccgc 1381 gtgggcagag ccatctcctc agccagcatc gaaggtctca tgggccgtgc catcccgccg 1441 gagcgacaca gcccccacca cctcaaagag cagcaccaca tccgcgggtc catcacacaa 1501 gggatccctc ggtcctacgt ggaggcacag gaggactacc tgcgtcggga ggccaagctc 1561 ctaaagcggg agggcacgcc tccgccccca ccgccctcac gggacctgac cgaggcctac 1621 aagacgcagg ccctgggccc cctgaagctg aagccggccc atgagggcct ggtggccacg 1681 gtgaaggagg cgggccgctc catccatgag atcccgcgcg aggagctgcg gcacacgccc 1741 gagctgcccc tggccccgcg gccgctcaag gagggctcca tcacgcaggg caccccgctc 1801 aagtacgaca ccggcgcgtc caccactggc tccaaaaagc acgacgtacg ctccctcatc 1861 ggcagccccg gccggacgtt cccacccgtg cacccgctgg atgtgatggc cgacgcccgg 1921 gcactggaac gtgcctgcta cgaggagagc ctgaagagcc ggccagggac cgccagcagc 1981 tcggggggct ccattgcgcg cggcgccccg gtcattgtgc ctgagctggg taagccgcgg 2041 cagagccccc tgacctatga ggaccacggg gcaccctttg ccggccacct cccacgaggt 2101 tcgcccgtga ccatgcggga gcccacgccg cgcctgcagg agggcagcct ttcgtccagc 2161 aaggcatccc aggaccgaaa gctgacgtcg acgcctcgtg agatcgccaa gtccccgcac 2221 agcaccgtgc ccgagcacca cccacacccc atctcgccct atgagcacct gcttcggggc 2281 gtgagtggcg tggacctgta tcgcagccac atccccctgg ccttcgaccc cacctccata 2341 ccccgcggca tccctctgga cgcagccgct gcctactacc tgccccgaca cctggccccc 2401 aaccccacct acccgcacct gtacccaccc tacctcatcc gcggctaccc cgacacggcg 2461 gcgctggaga accggcagac catcatcaat gactacatca cctcgcagca gatgcaccac 2521 aacacggcca ccgccatggc ccagcgagct gatatgctga ggggcctctc gccccgcgag 2581 tcctcgctgg cactcaacta cgctgcgggt ccccgaggca tcatcgacct gtcccaagtg 2641 ccacacctgc ctgtgctcgt gcccccgaca ccaggcaccc cagccaccgc catggaccgc 2701 cttgcctacc tccccaccgc gccccagccc ttcagcagcc gccacagcag ctccccactc 2761 tccccaggag gtccaacaca cttgacaaaa ccaaccacca cgtcctcgtc cgagcgggag 2821 cgagaccggg atcgagagcg ggaccgggat cgggagcggg aaaagtccat cctcacgtcc 2881 accacgacgg tggagcacgc acccatctgg agacctggta cagagcagag cagcggcagc 2941 agcggcagca gcggcggggg tgggggcagc agcagccgcc ccgcctccca ctcccatgcc 3001 caccagcact cgcccatctc ccctcggacc caggatgccc tccagcagag acccagtgtg 3061 cttcacaaca caggcatgaa gggtatcatc accgctgtgg agcccagcaa gcccacggtc 3121 ctgaggtcca cctccacctc ctcacccgtt cgcccagctg ccacattccc acctgccacc 3181 cactgcccac tgggcggcac cctcgatggg gtctacccta ccctcatgga gcccgtcttg 3241 ctgcccaagg aggccccccg ggtcgcccgg ccagagcggc cccgagcaga caccggccat 3301 gccttcctcg ccaagccccc agcccgctcc gggctggagc ccgcctcctc ccccagcaag 3361 ggctcggagc cccggcccct agtgcctcct gtctctggcc acgccaccat cgcccgcacc 3421 cctgcgaaga acctcgcacc tcaccacgcc agcccggacc cgccggcgcc acctgcctcg 3481 gcctcggacc cgcaccggga aaagactcaa agtaaaccct tttccatcca ggaactggaa 3541 ctccgttctc tgggttacca cggcagcagc tacagccccg aaggggtgga gcccgtcagc 3601 cctgtgagct cacccagtct gacccacgac aaggggctcc ccaagcacct ggaagagctc 3661 gacaagagcc acctggaggg ggagctgcgg cccaagcagc caggccccgt gaagcttggc 3721 ggggaggccg cccacctccc acacctgcgg ccgctgcctg agagccagcc ctcgtccagc 3781 ccgctgctcc agaccgcccc aggggtcaaa ggtcaccagc gggtggtcac cctggcccag 3841 cacatcagtg aggtcatcac acaggactac acccggcacc acccacagca gctcagcgca 3901 cccctgcccg cccccctcta ctccttccct ggggccagct gccccgtcct ggacctccgc 3961 cgcccaccca gtgacctcta cctcccgccc ccggaccatg gtgccccggc ccgtggctcc 4021 ccccacagcg aagggggcaa gaggtctcca gagccaaaca agacgtcggt cttgggtggt 4081 ggtgaggacg gtattgaacc tgtgtcccca ccggagggca tgacggagcc agggcactcc 4141 cggagtgctg tgtacccgct gctgtaccgg gatggggaac agacggagcc cagcaggatg 4201 ggctccaagt ctccaggcaa caccagccag ccgccagcct tcttcagcaa gctgaccgag 4261 agcaactccg ccatggtcaa gtccaagaag caagagatca acaagaagct gaacacccac 4321 aaccggaatg agcctgaata caatatcagc cagcctggga cggagatctt caatatgccc 4381 gccatcaccg gaacaggcct tatgacctat agaagccagg cggtgcagga acatgccagc 4441 accaacatgg ggctggaggc cataattaga aaggcactca tgggtaaata tgaccagtgg 4501 gaagagtccc cgccgctcag cgccaatgct tttaaccctc tgaatgccag tgccagcctg 4561 cccgctgcta tgcccataac cgctgctgac ggacggagtg accacacact cacctcgcca 4621 ggtggcggcg ggaaggccaa ggtctctggc agacccagca gccgaaaagc caagtccccg 4681 gccccgggcc tggcatctgg ggaccggcca ccctctgtct cctcagtgca ctcggaggga 4741 gactgcaacc gccggacgcc gctcaccaac cgcgtgtggg aggacaggcc ctcgtccgca 4801 ggttccacgc cattccccta caaccccctg atcatgcggc tgcaggcggg tgtcatggct 4861 tccccacccc caccgggcct ccccgcgggc agcgggcccc tcgctggccc ccaccacgcc 4921 tgggacgagg agcccaagcc actgctctgc tcgcagtacg agacactctc cgacagcgag 4981 tgactcagaa cagggcgggg gggggcgggc ggtgtcaggt cccagcgagc cacaggaacg 5041 gccctgcagg agcggggcgg ctgccgactc ccccaaccaa ggaaggagcc cctgagtccg 5101 cctgcgcctc catccatctg tccgtccaga gccggcatcc ttgcctgtct aaagccttaa 5161 ctaagactcc cgccccgggc tggccctgtg cagaccttac tcaggggatg tttacctggt 5221 gctcgggaag ggaggggaag gggccgggga gggggcacgg caggcgtgtg gcagccacac 5281 acaggcggcc agggcggcca gggacccaaa gcaggatgac cacgcacctc cacgccactg 5341 cctcccccga atgcatttgg aaccaaagtc taaactgagc tcgcagcccc cgcgccctcc 5401 ctccgcctcc catcccgctt agcgctctgg acagatggac gcaggccctg tccagccccc 5461 agtgcgctcg ttccggtccc cacagactgc cccagccaac gagattgctg gaaaccaagt 5521 caggccaggt gggcggacaa aagggccagg tgcggcctgg ggggaacgga tgctccgagg 5581 actggactgt ttttttcaca catcgttgcc gcagcggtgg gaaggaaagg cagatgtaaa 5641 tgatgtgttg gtttacaggg tatatttttg ataccttcaa tgaattaatt cagatgtttt 5701 acgcaaggaa ggacttaccc agtattactg ctgctgtgct tttgatctct gcttaccgtt 5761 caagaggcgt gtgcaggccg acagtcggtg accccatcac tcgcaggacc aagggggcgg 5821 ggactgctcg tcacgccccg ctgtgtcctc cctccctccc ttccttgggc agaatgaatt 5881 cgatgcgtat tctgtggccg ccatttgcgc agggtggtgg tattctgtca tttacacacg 5941 tcgttctaat taaaaagcga attatactcc aaaaaaaaaa aaaaaaaaa // LOCUS HSU37219 2589 bp mRNA PRI 22-FEB-1996 DEFINITION Human cyclophilin-like protein CyP-60 mRNA, complete cds. ACCESSION U37219 NID g1199597 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2589) AUTHORS Wang,B.B., Hayenga,K.J., Payan,D.G. and Fisher,J.M. TITLE Identification of a nuclear-specific cyclophilin which interacts with the proteinase inhibitor eglin c JOURNAL Biochem. J. 314 (Pt 1), 313-319 (1996) MEDLINE 96195145 REFERENCE 2 (bases 1 to 2589) AUTHORS Wang,B., Hayenga,K.J., Payan,D.G. and Fisher,J.M. TITLE Direct Submission JOURNAL Submitted (27-SEP-1995) Bruce Wang, Khepri Pharmaceuticals, 260 Littlefield Avenue, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2589 /organism="Homo sapiens" /note="Raji B lymphocyte cDNA library (Clontech)" /db_xref="taxon:9606" /clone="6.111" 5'UTR <1..92 CDS 93..1655 /codon_start=1 /product="cyclophilin-like protein CyP-60" /db_xref="PID:g1199598" /translation="MGKRQHQKDKMYITCAEYTHFYGGKKPDLPQTNFRRLPFDHCSL SLQPFVYPVCTPDGIVFDLLNIVPWLKKYGTNPSNGEKLDGRSLIKLNFSKNSEGKYH CPVLFTVFTNNTHIVAVRTTGNVYAYEAVEQLNIKAKNFRDLLTDEPFSRQDIITLQD PTNLDKFNVSNFYHVKNNMKIIDPDEEKAKQDPSYYLKNTNAETRETLQELYKEFKGD EILAATMKAPEKKKVDKLNAAHYSTGKVSASFTSTAMVPETTHEAAAIDEDVLRYQFV KKKGYVRLHTNKGDLNLELHCDLTPKTCENFIRLCKKHYYDGTIFHRSIRNFVIQGGD PTGTGTGGESYWGKPFKDEFRPNLSHTGRGILSMANSGPNSNRSQFFITFRSCAYLDK KHTIFGRVVGGFDVLTAMENVESDPKTDRPKEEIRIDATTVFVDPYEEADAQIAQERK TQLKVAPETKVKSSQPQAGSQGPQTFRQGVGKYINPAATKRAAEEEPSTSATVPMSKK KPSRGFGDFSSW" 3'UTR 1656..2589 polyA_site 1757..1762 misc_feature complement(2000..2151) /note="human Alu J subfamily similarity" BASE COUNT 584 a 795 c 668 g 542 t ORIGIN 1 cccccccccc ccccgaactc ggctgcggct ccatggtctg agttgtcagc cgttgttttt 61 tcgtgctcgc tagtcgccgc cgccgctccg ccatggggaa gcgacagcac caaaaggaca 121 aaatgtacat tacctgtgct gaatacactc acttttatgg tggcaagaag ccagatctcc 181 cacaaacaaa ttttcgtcgt ttaccttttg accactgcag tctctctctg cagccctttg 241 tctacccagt ctgcactccc gatggcatcg tctttgactt actgaacatt gttccatggc 301 ttaagaagta cgggaccaac cccagcaatg gagagaagct ggacgggagg tccctgatca 361 agctgaactt ttccaagaac agtgagggga agtaccactg cccagtgctg tttaccgtgt 421 tcaccaacaa cacccacatc gtggctgtga ggacgaccgg caacgtctac gcctatgagg 481 cagtggaaca gctaaatatc aaggccaaga acttccggga cctgctgacc gacgagccct 541 tctcccggca ggacatcatc accctccagg accccaccaa tttggacaag ttcaatgtct 601 ctaacttcta tcatgtgaag aataacatga aaataataga cccagatgaa gagaaggcca 661 aacaggaccc gtcttattat ctgaaaaata caaatgccga gacccgagag accctgcagg 721 agctctacaa ggagttcaaa ggggacgaga ttctggcagc caccatgaag gccccggaga 781 agaagaaagt ggacaagctg aatgctgccc actattccac agggaaggtc agcgcttcct 841 tcacctccac cgcgatggtc ccggagacca cacatgaagc agctgccatc gacgaggatg 901 tgctgcgcta ccagtttgtg aagaagaagg gctacgtgcg gctgcacacc aacaagggcg 961 acctcaacct ggagctgcac tgcgacctga caccaaaaac ctgcgaaaac ttcatcaggc 1021 tttgcaagaa gcattattac gatggcacca tcttccacag atccatccgg aactttgtga 1081 tccaaggggg cgaccccaca ggcacaggca cgggtgggga gtcatactgg gggaagccct 1141 tcaaagacga gttccggccc aacctctcgc acacgggccg cggcatcctc agcatggcca 1201 actccgggcc caacagcaac aggtctcaat tcttcatcac gtttcgctcc tgtgcctacc 1261 tggacaagaa gcataccatc tttggacggg ttgttggggg ctttgacgta ctgacagcca 1321 tggagaatgt ggagagtgac cccaaaactg accgccctaa ggaggagatc cgcattgatg 1381 ccactacagt gttcgtggac ccctatgagg aggccgatgc ccagattgcg caggagcgga 1441 agacacagct caaggtagcc ccggagacca aagtgaagag cagccagccc caggcaggga 1501 gccagggccc ccagaccttc cgccagggcg tgggcaagta catcaaccca gcagccacga 1561 agcgagcagc agaggaagag ccctcaacca gtgccactgt ccccatgtcc aagaagaagc 1621 ccagtcgggg ttttggggac ttcagctcct ggtagcagca ggttggccgc tgtggacctt 1681 ggtggggttg cagggctggg ggcccatgtc cacatctcca tttccagcct ttctagcctg 1741 ccctctgctg ccagccaata aattgcttgc ctgctgcctg catccccttt cctggcccct 1801 gggagcccac agccttccca tcccttaacc tgttgccaag ggccttggcc ctgtttccag 1861 gacctggccc agccagagcc cactgctggg accttcaagc acaaggcctg ccctacaccc 1921 aggctggtgc ctcaggcctc tcctctagta ggcaggccag gttagtgagg aaggactgtg 1981 tctccagatt gtggtttcct ctttaagaca gggtcttgct ctgttaccca ggctccagtg 2041 cagtggtgtg atcatggctc actgcagcct cgacctcctg ggctcaagca atcctcctgc 2101 ctcagcctcg caagtagctg ggactacagc cgtgcaccac tacatccagc tgtatatgtc 2161 tggttttctt acccctactt ctgtcatctt ctcagggaca gcctatttat acaaccagtg 2221 tggtcccctg accaacgcca ttacctggga caagttttca gaccccagac ttactgagcc 2281 taagcctctg cagggtgggc ttctcggtct gttttgacaa aacttcaggg gcttctgaag 2341 gctggtgttg gacggcagca ttgagtttcc tgccgtgccc tgcctgagct ctcagggccc 2401 tgctcacctg ctctggctgt gaaccacctg ggcttcatct caagcctgcc tggcgtctct 2461 gtgcccctgt gagaatcttg aggggaccca cactgggttg aggccagtgt ctcctgctgt 2521 gagaacaagt ggatgtccct ctccccgccc tcctgctgaa gtggccttgc tgctctcagg 2581 cccggccag // LOCUS HSU37251 2397 bp mRNA PRI 04-OCT-1996 DEFINITION Human KRAB zinc finger protein (ZNF177) mRNA, splicing variant, complete cds. ACCESSION U37251 NID g1049294 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2397) AUTHORS Baban,S., Freeman,J.D. and Mager,D.L. TITLE Transcripts from a novel human KRAB zinc finger gene contain spliced Alu and endogenous retroviral segments JOURNAL Genomics 33 (3), 463-472 (1996) MEDLINE 96299641 REFERENCE 2 (bases 1 to 2397) AUTHORS Baban,S., Freeman,J.D. and Mager,D.L. TITLE Direct Submission JOURNAL Submitted (28-SEP-1995) Dixie L. Mager, Terry Fox Laboratory, B.C. Cancer Agency, 601 West 10th Avenue, Vancouver, B.C. V5Z1L3, Canada FEATURES Location/Qualifiers source 1..2397 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="NTera2D1 cDNA library" /clone="SB2" 5'UTR <1..447 repeat_region complement(57..143) /note="similar to HERV-H endogenous retrovirus env gene" repeat_region complement(287..360) /note="similar to Alu repeat" gene 448..810 /gene="ZNF177" CDS 448..810 /gene="ZNF177" /note="Description: KRAB zinc finger protein; this is a splicing variant that contains a stop codon and frame shift between the KRAB box and the zinc finger region; Method: conceptual translation supplied by author." /codon_start=1 /db_xref="PID:g1049295" /translation="MAAGWLTTWSQNSVTFQEVAVDFSQEEWGLLDPAQKNLYKDVML ENFRNLASVGYQLCRHSLISKVDQEQLKTDERGILQGDCADWETQLKPKDTIAMQNIP GGKTSNGINMNPHGREFL" misc_feature 478..672 /gene="ZNF177" /note="encodes KRAB domain" misc_feature 1361..1948 /note="encodes zinc finger motifs in longer coding region variant" BASE COUNT 739 a 494 c 532 g 632 t ORIGIN 1 gaccttcgct tggtgtcctc ctggcctcag caacctgaca attctgtcgt gtcccgggct 61 gggtataagt aagcaagaag agggcctggg agaagagtct gacgagcaag gggaaggtag 121 ccaaggatgg agtgaaatac agggtgacct gtgtttttcc cctggctcct ttggaaatta 181 tgggaaaaaa tcaaatacca aggaagaaaa atctataatg aaattagatg tggatttctt 241 tttatgtatt cagcttgggg gcttatgata cttctagtac ccaaagagat gcggtcttgc 301 tgtgttgcct aggctggtct caaactcctg ctctcaagtg atcctcctgc ctcagcctcc 361 tgagtacatt tatatttaaa gtaattattg atggctctgc ctgcttagga ctctgcccag 421 ccaggaagga aacctacagg aggaagaatg gctgcagggt ggctgacaac ctggtcacag 481 aactcagtaa ccttccagga agtggcagtg gacttttccc aggaggagtg gggattgctg 541 gaccctgctc aaaaaaatct atacaaagat gtgatgctgg agaactttag gaacctggcc 601 tcagtagggt atcagctctg cagacacagt ctgatctcca aggtggatca agaacagctg 661 aagacagatg aaagaggaat tttacaaggt gactgtgcag actgggaaac tcaacttaaa 721 ccaaaagata caattgctat gcagaacatt cctgggggaa aaacatccaa tggcataaac 781 atgaatcctc atggaagaga attcctgtga ataaggtgaa agtgaaaaag cctctagtca 841 ccacttactc ccttttcaac aggcagaaaa tcaacctggt gagcactccc tggagtgtaa 901 ccattgtggg aaattcagaa agaacactcg ctttatttgt acaagatatt gcaagggaga 961 gaaatgctat aaatatataa agtatagcaa agtcttcaac catccctcaa ctcttaggag 1021 tcatgtgagc attcacattg gagagaaaac tcttgaattt actgattgta gaaaagcttt 1081 caatcaagag tcatccctca ggaaacactt aagaactccc acaggacaga agtttcagga 1141 gtatgagcaa tgtgatatgt ccttcagcct acactcttcc tgctcagtac gtgagcaaat 1201 acctactgga gagaaaggtg atgaatgcag tgactatggc aaaatatctc cccttagtgt 1261 ccacacaaaa actggtagtg tggaggaggg tttggaatgt aatgaacatg agaaaacttt 1321 cactgaccct ttgtcccttc agaactgtgt cagaactcac tctggagaga tgccctatga 1381 atgcagtgac tgtgggaaag ccttcatttt tcagtcttcc cttaagaaac acatgagatc 1441 tcatactgga gagaagcctt atgagtgtga tcactgtgga aaatccttta gccagagctc 1501 tcatctgaat gtgcacaaaa gaactcacac tggagagaaa ccctatgact gtaaggaatg 1561 tgggaaggct ttcactgttc cttcatccct tcagaaacat gtgagaaccc acactggaga 1621 gaaaccctat gaatgcagtg actgtggaaa agccttcatt gatcagtcat cccttaagaa 1681 acacacacgc tctcacactg gagagaagcc ttatgagtgt aaccagtgtg gaaagtcctt 1741 cagcacaggc tcttacctta ttgtgcacaa gagaactcac actggtgaga aaacctatga 1801 gtgtaaagaa tgtgggaagg cctttaggaa ttcctcttgc ctgagggtac acgtgagaac 1861 tcacactgga gagaagcctt ataaatgttt tcagtgtgaa aaagccttta gcacaagcac 1921 taaccttata atgcacaagc gaatccacaa tggccagaaa ctccatgaat gaaatgactc 1981 agggaagtgt ttgttgcccc tcatgcctcc tttctcactt tagaacatat attggagaga 2041 agccctgtta tggtcacctg gaaacagcct tctggcccaa ctctggatgc ctgttatact 2101 gggacaaacc ttataaatgt tatagctatg attgttttta tcagtgagta tttcattctt 2161 atatgtgtca caactgtaga tcctatgaat attttagtta tcttttcagt ttaattgcat 2221 tgtgaacaga acacatggtc ttcaatatgt agattctgtg tgacatagat ttgaacataa 2281 ttgctggaca ctgactgatg tactgtggca gacagaacta gttgctgtct caatatccat 2341 tcctcccttc tttcttaaag taataaaatt taccatttag ttcaaaaaaa aaaaaaa // LOCUS HSU37283 911 bp mRNA PRI 12-APR-1996 DEFINITION Human microfibril-associated glycoprotein-2 MAGP-2 mRNA, complete cds. ACCESSION U37283 NID g1165211 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 911) AUTHORS Gibson,M.A., Hatzinikolas,G., Kumaratilake,J.S., Sandberg,L.B., Nicholl,J.K., Sutherland,G.R. and Cleary,E.G. TITLE Further characterization of proteins associated with elastic fiber microfibrils including the molecular cloning of MAGP-2 (MP25) JOURNAL J. Biol. Chem. 271 (2), 1096-1103 (1996) MEDLINE 96132851 REFERENCE 2 (bases 1 to 911) AUTHORS Gibson,M.A., Hatzinikolas,G., Kumaratilake,J.S., Sandberg,L.B., Nicholl,J.K., Sutherland,G.R. and Cleary,E.G. TITLE Direct Submission JOURNAL Submitted (28-SEP-1995) Mark A. Gibson, Pathology, University of Adelaide, North Terrace, Adelaide, SA 5005, Australia FEATURES Location/Qualifiers source 1..911 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p12.3-12p13" /chromosome="12" CDS 187..708 /codon_start=1 /product="microfibril-associated glycoprotein-2 MAGP-2" /db_xref="PID:g1165212" /translation="MSLLGPKVLLFLAAFIITSDWIPLGVNSQRGDDVTQATPETFTE DPNLVNDPATDETVLAVLADIAPSTDDLASLSEKNTTAECWDEKFTCTRLYSVHRPVK QCIHQLCFTSLRRMYIVNKEICSRLVCKEHEAMKDELCRQMAGLPPRRLRRSNYFRLP PCENVDLQRPNGL" BASE COUNT 248 a 225 c 196 g 242 t ORIGIN 1 cgggctagcc tggctttctt gctctccctc atctcattgt ttcagccgga ggccaaatct 61 gaagtccttt ccagggagtg gctctgttca tcttattcgc cagccaaagt aggaacagcg 121 taagaggaga gagacacatt cagcagccaa aggactcggt ggaaagagca gaacaccata 181 gacaatatgt cgctcttggg acccaaggtg ctgctgtttc ttgctgcatt catcatcacc 241 tctgactgga tacccctggg ggtcaatagt caacgaggag acgatgtgac tcaagcgact 301 ccagaaacat tcacagaaga tcctaatctg gtgaatgatc ccgctacaga tgaaacagtt 361 ttggctgttt tggctgatat tgcaccttcc acagatgact tggcctccct cagtgaaaaa 421 aataccactg cagagtgctg ggatgagaaa tttacctgca caaggctcta ctctgtgcat 481 cggccggtta aacaatgcat tcatcagtta tgcttcacca gtttacgacg tatgtacatc 541 gtcaacaagg agatctgctc tcgtcttgtc tgtaaggaac acgaagctat gaaagatgag 601 ctttgccgtc agatggctgg tctgccccct aggagactcc gtcgctccaa ttacttccga 661 cttcctccct gtgaaaatgt ggatttgcag agacccaatg gtctgtgatc attgaaaaag 721 aggaaagaag aaaaaatgta tgggtgagag gaaggaggat ctccttcttc tccaaccatt 781 gacagctaac ccttagacag tatttcttaa accaatcctt ttgcaatgtc cagcttttac 841 ccctactctc tactttttca cccaaactga taacatttat ctcattttct agcacttaaa 901 atacaaagtc t // LOCUS HSU37352 4064 bp mRNA PRI 27-FEB-1996 DEFINITION Human protein phosphatase 2A B'alpha1 regulatory subunit mRNA, complete cds. ACCESSION U37352 NID g1203811 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4064) AUTHORS Tehrani,M.A., Mumby,M.C. and Kamibayashi,C. TITLE Identification of a novel protein phosphatase 2A regulatory subunit highly expressed in muscle JOURNAL J. Biol. Chem. 271 (9), 5164-5170 (1996) MEDLINE 96214950 REFERENCE 2 (bases 1 to 4064) AUTHORS Ahmadian-Tehrani,M., Mumby,M.C. and Kamibayashi,C. TITLE Direct Submission JOURNAL Submitted (29-SEP-1995) Craig Kamibayashi, Pharmacology, UTSWMC, 5323 Harry Hines Blvd., Dallas, TX 75235-9041, USA FEATURES Location/Qualifiers source 1..4064 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="epithelial cell" CDS 89..1633 /note="Method: conceptual translation supplied by author" /codon_start=1 /product="protein phosphatase 2A B'alpha1 regulatory subunit" /db_xref="PID:g1203812" /translation="MVVDAANSNGPFQPVVLLHIRDVPPADQEKLFIQKLRQCCVLFD FVSDPLSDLKWKEVKRAALSEMVEYITHNRNVITEPIYPEVVHMFAVNMFRTLPPSSN PTGAEFDPEEDEPTLEAAWPHLQLVYEFFLRFLESPDFQPNIAKKYIDQKFVLQLLEL FDSEDPRERDFLKTTLHRIYGKFLGLRAYIRKQINNIFYRFIYETEHHNGIAELLEIL GSIINGFALPLKEEHKIFLLKVLLPLHKVKSLSVYHPQLAYCVVQFLEKDSTLTEPVV MALLKYWPKTHSPKEVMFLNELEEILDVIEPSEFVKIMEPLFRQLAKCVSSPHFQVAE RALYYWNNEYIMSLISDNAAKILPIMFPSLYRNSKTHWNKTIHGLIYNALKLFMEMNQ KLFDDCTQQFKAEKLKEKLKMKEREEAWVKIENLAKANPQYTVYSQASTMSIPVAMET DGPLFEDVQMLRKTVKDEAHQAQKDPKKDRPLALRKSELPQDPHTKKALEAHCRADEL ASQDGR" BASE COUNT 1152 a 848 c 856 g 1208 t ORIGIN 1 ctgcggccgc ctggtttctt gccttaagga gcccattgcc tttcccgctg aagtctagat 61 gttgacatgt aataaagcgg gcagcaggat ggtggtggat gcggccaact ccaatgggcc 121 tttccagccc gtggtccttc tccatattcg agatgttcct cctgctgatc aagagaagct 181 ttttatccag aagttacgtc agtgttgcgt cctctttgac tttgtttctg atccactaag 241 tgacctaaag tggaaggaag taaaacgagc tgctttaagt gaaatggtag aatatatcac 301 ccataatcgg aatgtgatca cagagcctat ttacccagaa gtagtccata tgtttgcagt 361 taacatgttt cgaacattac caccttcctc caatcctacg ggagcggaat ttgacccgga 421 ggaagatgaa ccaacgttag aagcagcctg gcctcatcta cagcttgttt atgaattttt 481 cttaagattt ttagagtctc cagatttcca acctaatata gcgaagaaat atattgatca 541 gaagtttgta ttgcagcttt tagagctctt tgacagtgaa gatcctcggg agagagattt 601 tcttaaaacc acccttcaca gaatctatgg gaaattccta ggcttgagag cttacatcag 661 aaaacagata aataatatat tttataggtt tatttatgaa acagagcatc ataatggcat 721 agcagagtta ctggaaatat tgggaagtat aattaatgga tttgccttac cactaaaaga 781 agagcacaag attttcttat tgaaggtgtt actacctttg cacaaagtga aatctctgag 841 tgtctaccat ccccagctgg catactgtgt agtgcagttt ttagaaaagg acagcaccct 901 cacggaacca gtggtgatgg cacttctcaa atactggcca aagactcaca gtccaaaaga 961 agtaatgttc ttaaacgaat tagaagagat tttagatgtc attgaaccat cagaatttgt 1021 gaagatcatg gaacccctct tccggcagtt ggccaaatgt gtctccagcc cacacttcca 1081 ggtggcagag cgagctctct attactggaa taatgaatac atcatgagtt taatcagtga 1141 caacgcagcg aagattctgc ccatcatgtt tccttccttg taccgcaact caaagaccca 1201 ttggaacaag acaatacatg gcttgatata caacgccctg aagctcttca tggagatgaa 1261 ccaaaagcta tttgatgact gtacacaaca gttcaaagca gagaaactaa aagagaagct 1321 aaaaatgaaa gaacgggaag aagcatgggt taaaatagaa aatctagcca aagccaatcc 1381 ccagtacaca gtgtatagtc aagccagcac catgagcatt ccggttgcaa tggagacaga 1441 tgggccttta tttgaagatg tgcagatgct gagaaagaca gtgaaggacg aggctcatca 1501 ggcacagaaa gatccgaaga aggaccgtcc tcttgcactc cgcaagtccg agctgcctca 1561 ggacccccac accaagaaag ccttggaagc tcactgcagg gccgatgagc tggcctccca 1621 ggacggccgc tagcctccgg ggcgccgcgt cggggccggg cccgccagtt cttttccgga 1681 ttctgtagaa aatacatact tcctgtgcca taccaatcag ttacactcaa agctttcttg 1741 gaccccgttc cgtaggcaat aacgtgcgtc cgcctcagcg cgagattagg agttcaaaca 1801 atggtgactt cccagagccc gctggcagag ccgcgggttg acgacggtgt cctcgcagtg 1861 tcgccgccac cccagcgtag tccaagtcag actatttcac aaagtcagag cgataggaaa 1921 gcaccctgcc cttcatcttc atgttctccc aaatggaact taggatcttt taacataggt 1981 ggttctgtga taacatcagt gttttccaaa tcaaaggaac gctttaaaaa ataggaccta 2041 ttttttaaga ctttacagcc tttgaaatgg tttccacgtg attgttacgc cagcagttct 2101 tttgtttgtt tttcaatctc agtgaaatgg ctctttgctt tcgagttctc acgcaacgta 2161 ctgggcaaat gacaatcctc agccgctggt attttctaag gggtctcttc actttgatga 2221 gtgacatgaa caccgtgtct ccttctcttg tgtgtaccta aagccatatt tccaagtctg 2281 tggtactcca ggattccagg agtaagcctg tagaagagat ttattttaaa agagattgct 2341 ctgaaattta tcttaaaaga gcttgctctg tctaccttga cagaaattgg agttttaaaa 2401 ttatgtgtta atatttttat ttgcagattt cgtttccgtc aacttaaaca ttgttgccct 2461 tcaacaaggc tcttgaatta ataaaattat agtctctaag aattccacat tttatggaaa 2521 gttagagcaa aatcattttg agttaagcca gttcttagcc taatgcaaac tgcagcgcct 2581 ttaagcataa agtaacacaa cagcattgca cggggccggc actgccgctg ccttcactga 2641 aggctgcagt gctgttctga gagcttggag gaggcaccag cgaggatgac gtttagtgga 2701 gctctttctg ttgaaaagag ctcacgttat caacaccttg taaggaaaat acagtgtctg 2761 agttttcatc ggtcttcaca tgctgctata tattccacag agttccttgc atgtactgag 2821 cttttgtttt agatggaata gcacaaggag aaaaatcttt aaacttagtg ctttgtctat 2881 tctttatttc tctcagggtg gccagtattt tgacttattt atcctgcttg aaagctactt 2941 gagatgtgta ctgctattct aaacacgtga tctagtttct ttcatctctg gcataagatt 3001 atataactta atgttaagtg tcttgaggca taaaagacaa aatgtggctt attttaggat 3061 ctgttttttc atcgaggtct cgggtatcct ttcaaagata gtgagaagca gacactgctc 3121 cttgtgcagc tctggtacct cctgcccact gctgtcactt caagccactg gcaatgcttc 3181 tgtcctcgtg tcttggagga aaatcacctg gggggagggg acttcttgtg gtaagagcaa 3241 gtgcaggtat gaaatgcgaa gattgcccca gctaaaagtg gacaagtccg ctttgtgaga 3301 tgaatacttc ctgagaaact tgacaagtat ctctccattt taccattatg aaaactatca 3361 ttaaaaaaaa cagtttagat gccttctcct tttgagggaa aaagggtgct ttttattgta 3421 taaagcagcg tcttatgtat tttgatatac cattgtttga acttccgtct ttagctgata 3481 gattctcaaa tatccttgat tttggatgtt cagtatgttt gtgagagagg tttctgggaa 3541 gactctcttt ttgccctcgg gaaaaagcaa aatatcaatg tttgggtgac tgtgtaaagc 3601 tcagtgtgta agaacatctt tttgtctagg ttttctttct gctctttatt gaagacaaac 3661 actcaccaaa aagaaaaata aaagttttca gagaaactaa ttttctttgg caagagtatt 3721 acttaatatt ttggcctcct aaagtttccc tagttagtac tcggactcct gtgctaattg 3781 tcagcttaca tatcattgta tagagactgt ttattctgta ccaaactgat ttcaaaagta 3841 ctacattgaa aataaaccgg tgactgtttt tcttcataaa gttctgcgtt tggcatcttc 3901 actctttcca aaatgtatct gtacatcaga aatgtcacta ttccaagtgt ctttttagtg 3961 tggcctttag tatggcttcc ttttaatatt gtacatacat tgtatctttg ttttatggta 4021 ataagtaata aaaatgtaga cttcaaaaaa aaaagcggcc gcag // LOCUS HSU37359 2464 bp mRNA PRI 02-NOV-1995 DEFINITION Human MRE11 homolog hMRE11 mRNA, complete cds. ACCESSION U37359 NID g1049319 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2464) AUTHORS Petrini,J.H., Walsh,M.E., DiMare,C., Chen,X.N., Korenberg,J.R. and Weaver,D.T. TITLE Isolation and characterization of the human MRE11 homologue JOURNAL Genomics 29 (1), 80-86 (1995) MEDLINE 96079094 REFERENCE 2 (bases 1 to 2464) AUTHORS Petrini,J.H.J., Walsh,M.E., DiMare,C., Chen,X-N., Korenberg,J.R. and Weaver,D.T. TITLE Direct Submission JOURNAL Submitted (29-SEP-1995) John H. J. Petrini, Medical Genetics, University of Wisconsin, 425 Henry Mall, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..2464 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q14-q22" /chromosome="11" CDS 173..2299 /codon_start=1 /function="DNA repair and recombination protein" /product="MRE11 homologue hMre11" /db_xref="PID:g1049320" /translation="MSTADALDDENTFKILVATDIHLGFMEKDAARGNDTFVTLDEIL RLAQENEVDFILLGGDLFHENKPSRKTLHTCLELLRKYCMGDRPVQFEILSDQSVNFG FSKFPWVNYQDGNLNISIPVFSIHGNHDDPTGADALCALDILSCAGFVNHFGRSMSVE KIDISPVLLQKGSTKIALYGLGSIPDERLYRMFVNKKVTMLRPKEDENSWFNLFVIHQ NRSKHGSTNFIPEQFLDDFIDLVIWGHEHECKIAPTKNEQQLFYISQPGSSVVTSLSP GEAVKKHVGLLRIKGRKMNMHKIPLHTVRQFFMEDIVLANHPDIFNPDNPKVTQAIQS FCLEKIEEMLENAERERLGNSHQPEKPLVRLRVDYSGGFEPFSVLRFSQKFVDRVANP KDIIHFFRHREQKEKTGEEINFGKLITKPSEGTTLRVEDLVKQYFQTAEKNVQLSLLT ERGMGEAVQEFVDKEEKDAIEELVKYQLEKTQRFLKERHIDALEDKIDEEVRRFRETR QKNTNEEDDEVREAMTRARALRSQSEESASAFSADDLMSIDLAEQMANDSDDSISAAT NKGRGRGRGRRGGRGQNSASRGGSQRGRAFKSTRQQPSRNVTTKNYSEVIEVDESDVE EDIFPTTSKTDQRWSSTSSSKIMSQSQVSKGVDFESSEDDDDDPFMNTSSLRRNRRLI YLLALRNMQDTGKMKCYKLRVYSLRF" polyA_site 2464 /note="54 A nucleotides" BASE COUNT 820 a 415 c 559 g 670 t ORIGIN 1 gaattcgggc cgaaaagaag acagccttgg gtcgcgattg tggggcttcg aagagtccag 61 cagtgggaat ttctagaatt tggaatcgag tgcattttct gacatttgag tacagtaccc 121 aggggttctt ggagaagaac ctggtcccag aggagcttga ctgaccataa aaatgagtac 181 tgcagatgca cttgatgatg aaaacacatt taaaatatta gttgcaacag atattcatct 241 tggatttatg gagaaagatg cagccagagg aaatgatacg tttgtaacac tcgatgaaat 301 tttaagactt gcccaggaaa atgaagtgga ttttattttg ttaggtggtg atctttttca 361 tgaaaataag ccctcaagga aaacattaca tacctgcctc gagttattaa gaaaatattg 421 tatgggtgat cggcctgtcc agtttgaaat tctcagtgat cagtcagtca actttggttt 481 tagtaagttt ccatgggtga actatcaaga tggcaacctc aacatttcaa ttccagtgtt 541 tagtattcat ggcaatcatg acgatcccac aggggcagat gcactttgtg ccttggacat 601 tttaagttgt gctggatttg taaatcactt tggacgttca atgtctgtgg agaagataga 661 cattagtccg gttttgcttc aaaaaggaag cacaaagatt gcgctatatg gtttaggatc 721 cattccagat gaaaggctct atcgaatgtt tgtcaataaa aaagtaacaa tgttgagacc 781 aaaggaagat gagaactctt ggtttaactt atttgtgatt catcagaaca ggagtaaaca 841 tggaagtact aacttcattc cagaacaatt tttggatgac ttcattgatc ttgttatctg 901 gggccatgaa catgagtgta aaatagctcc aaccaaaaat gaacaacagc tgttttatat 961 ctcacaacct ggaagctcag tggttacttc tctttcccca ggagaagctg taaagaaaca 1021 tgttggtttg ctgcgtatta aagggaggaa gatgaatatg cataaaattc ctcttcacac 1081 agtgcggcag tttttcatgg aggatattgt tctagctaat catccagaca tttttaaccc 1141 agataatcct aaagtaaccc aagccataca aagcttctgt ttggagaaga ttgaagaaat 1201 gcttgaaaat gctgaacggg aacgtctggg taattctcac cagccagaga agcctcttgt 1261 acgactgcga gtggactata gtggaggttt tgaacctttc agtgttcttc gctttagcca 1321 gaaatttgtg gatcgggtag ctaatccaaa agacattatc cattttttca ggcatagaga 1381 acaaaaggaa aaaacaggag aagagatcaa ctttgggaaa cttatcacaa agccttcaga 1441 aggaacaact ttaagggtag aagatcttgt aaaacagtac tttcaaaccg cagagaagaa 1501 tgtgcagctc tcactgctaa cagaaagagg gatgggtgaa gcagtacaag aatttgtgga 1561 caaggaggag aaagatgcca ttgaggaatt agtgaaatac cagttggaaa aaacacagcg 1621 atttcttaaa gaacgtcata ttgatgccct cgaagacaaa atcgatgagg aggtacgtcg 1681 tttcagagaa accagacaaa aaaatactaa tgaagaagat gatgaagtcc gtgaggctat 1741 gaccagggcc agagcactca gatctcagtc agaggagtct gcttctgcct ttagtgctga 1801 tgaccttatg agtatagatt tagcagaaca gatggctaat gactctgatg atagcatctc 1861 agcagcaacc aacaaaggaa gaggccgagg aagaggtcga agaggtggaa gagggcagaa 1921 ttcagcatcg agaggagggt ctcaaagagg aagagccttt aaatctacaa gacagcagcc 1981 ttcccgaaat gtcactacta agaattattc agaggtgatt gaggtagatg aatcagatgt 2041 ggaagaagac atttttccta ccacttcaaa gacagatcaa aggtggtcca gcacatcatc 2101 cagcaaaatc atgtcccaga gtcaagtatc gaaaggggtt gattttgaat caagtgagga 2161 tgatgatgat gatcctttta tgaacactag ttctttaaga agaaatagaa gattaatata 2221 tttactggca ctgagaaaca tgcaagatac aggaaaaatg aaatgttaca agctaagagt 2281 ttacagttta agattttaag tattgtttcc tgagcataac tccataagta agaaatttct 2341 agttcacaga catacaatag cattgattca ccttgttttt ttaacctggt tgttgtagta 2401 agagctttgt ttcaatatca ctcttgagta aagattaaaa taaagctacc attttacatt 2461 tcta // LOCUS HSU37408 2103 bp mRNA PRI 15-NOV-1995 DEFINITION Human CtBP mRNA, complete cds. ACCESSION U37408 NID g1063637 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 47 to 1363) AUTHORS Schaeper,U., Boyd,J.M., Verma,S., Uhlmann,E., Subramanian,T. and Chinnadurai,G. TITLE Molecular cloning and characterization of a cellular phosphoprotein that interacts with a conserved C-terminal domain of adenovirus E1A involved in negative modulation of oncogenic transformation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92, 10667-10671 (1995) REFERENCE 2 (sites) AUTHORS Boyd,J.M., Subramanian,T., Schaeper,U., La Regina,M., Bayley,S. and Chinnadurai,G. TITLE A region in the C-terminus of adenovirus 2/5 E1a protein is required for association with a cellular phosphoprotein and important for the negative modulation of T24-ras mediated transformation, tumorigenesis and metastasis JOURNAL EMBO J. 12 (2), 469-478 (1993) MEDLINE 93178421 REFERENCE 3 (bases 1 to 2103) AUTHORS Chinnadurai,G. TITLE Direct Submission JOURNAL Submitted (29-SEP-1995) G. Chinnadurai, Institute for Molecular Virology, St. Louis University Health Sciences Center, 3681 Park Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2103 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="B-cell library of S. Elledge" CDS 47..1366 /citation=[1] /citation=[2] /codon_start=1 /function="Interacts with C-terminus of Adenovirus E1A" /evidence=experimental /product="CtBP" /db_xref="PID:g1063638" /translation="MGSSHLLNKGLPLGVRPPIMNGPLHPRPLVALLDGRDCTVEMPI LKDVATVAFCDAQSTQEIHEKVLNEAVGALMYHTITLTREDLEKFKALRIIVRIGSGF DNIDIKSAGDLGIAVCNVPAASVEETADSTLCHILNLYRRATGCTRRCGRAHESRASS RSARWRPRCQDPRGDLGHHRTWSRGAGSGAAGQRVGFNVLFYDPYLSDGVERALGLQR VSTLQDLLFHSDCVTLHCGLNEHNHHLINDFTVKQMRQGAFLVNTARGGLVDEKALAQ ALKEGRIRGAALDVHESEPFSFSQGPLKDAPNLICTPHAAWYSEQASIEMREEAAREI RRAITGRIPDSLKNCVNKDHLTAATHWASMDPAVVHPELNGAAYRYPPGVVGVAPTGI PAAVEGIVPSAMSLSHGLPPVAHPPHAPSPGQTVKPEADRDHASDQL" BASE COUNT 402 a 635 c 636 g 430 t ORIGIN 1 gcgcaggccg ccgagggtcg gggcccgcgc cggctcgcgc ctctcgatgg gcagctcgca 61 cttgctcaac aagggcctgc cgcttggcgt ccgacctccg atcatgaacg ggcccctgca 121 cccgcggccc ctggtggcat tgctggatgg ccgggactgc acagtggaga tgcccatcct 181 gaaggacgtg gccactgtgg ccttctgcga cgcgcagtcc acgcaggaga tccatgagaa 241 ggtcctgaac gaggctgtgg gggccctgat gtaccacacc atcactctca ccagggagga 301 cctggagaag ttcaaagccc tccgcatcat cgtccggatt ggcagtggtt ttgacaacat 361 cgacatcaag tcggccgggg atttaggcat tgccgtctgc aacgtgcccg cggcgtctgt 421 ggaggagacg gccgactcga cgctgtgcca catcctgaac ctgtaccggc gggccactgg 481 ctgcaccagg cgctgcggga gggcacacga gtccagagcg tcgagcagat ccgcgaggtg 541 gcgtccgcgc tgccaggatc cgcggggaga ccttgggcat catcggactt ggtcgcgtgg 601 ggcaggcagt ggcgctgcgg gccaacgtgt cggcttcaac gtgctcttct acgaccctta 661 cttgtcggat ggcgtggagc gggcgctggg gctgcagcgt gtcagcaccc tgcaggacct 721 gctcttccac agcgactgcg tgaccctgca ctgcggcctc aacgagcaca accaccacct 781 catcaacgac ttcaccgtca agcagatgag acaaggggcc ttcctggtga acacagcccg 841 gggtggcctg gtggatgaga aggcgctggc ccaggccctg aaggagggcc ggatccgcgg 901 cgcggccctg gatgtgcacg agtcggaacc cttcagcttt agccagggcc ctctgaagga 961 tgcacccaac ctcatctgca ccccccatgc tgcatggtac agcgagcagg catccatcga 1021 gatgcgagag gaggcggcac gggagatccg cagagccatc acaggccgga tcccagacag 1081 cctgaagaac tgtgtcaaca aggaccatct gacagccgcc acccactggg ccagcatgga 1141 ccccgccgtc gtgcaccctg agctcaatgg ggctgcctat aggtaccctc cgggcgtggt 1201 gggcgtggcc cccactggca tcccagctgc tgtggaaggt atcgtcccca gcgccatgtc 1261 cctgtcccac ggcctgcccc ctgtggccca cccgccccac gccccttctc ctggccaaac 1321 cgtcaagccc gaggcggata gagaccacgc cagtgaccag ttgtagcccg ggaggagctc 1381 tccagcctcg gcgcctgggg cagcgggccc ggaaaccctc gaccagagtg tgtgagagca 1441 tgtgtgtggt ggcccctggc actgcagaga ctggtccggg ctgtcaggag ggcgggaggg 1501 cgcagcgctg ggcctcgtgt cgcttgtcgt ccgtcctgtg ggcgctctgc cctgtgtcct 1561 tcgcgttcct cgttaagcag aagaagtcag tagttattct cccatgaacg ttcttgtctg 1621 tgtacagttt ttagaacatt acaaaggatc tgtttgctta gctgtcaaca aaaagaaaac 1681 ctgaaggagc atttggaagt caatttgagg tttttttttt tggttttttt ttttttgtat 1741 tttggaacgt gccccagaat gaggcagttg gcaaacttct caggacaatg aatcttcccg 1801 tttttctttt tatgccacac agtgcattgt tttttctacc tgcttgtctt atttttagca 1861 taatttagaa aaacaaaaca aaggctgttt ttcctaattt tggcatgaac ccccccttgt 1921 tccaaaatga agacggcatc atcacgaagc agctccaaaa ggaaaagctt ggcaggtgcc 1981 ctcgtcctgg ggacgtggag ggtggcacgg tccccgcctg caccagtgcc gtcctgctga 2041 tgtggtaggc tagcaatatt ttggttaaaa tcatgtttgt ggccgaacgg gcccctgcac 2101 ccg // LOCUS HSU37436 1776 bp mRNA PRI 17-APR-1996 DEFINITION Human AICAR formyltransferase/IMP cyclohydrolase bifunctional enzyme (purH) mRNA, complete cds. ACCESSION U37436 NID g1263195 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1776) AUTHORS Rayl,E.A., Moroson,B.A. and Beardsley,G.P. TITLE The human purH gene product, 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase. Cloning, sequencing, expression, purification, kinetic analysis, and domain mapping JOURNAL J. Biol. Chem. 271 (4), 2225-2233 (1996) MEDLINE 96147205 REFERENCE 2 (bases 1 to 1776) AUTHORS Rayl,E.A., Moroson,B.A. and Beardsley,G.P. TITLE Direct Submission JOURNAL Submitted (02-OCT-1995) Elizabeth A. Rayl, Pediatrics/LMP 3094, Yale University, 333 Cedar Street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..1776 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hepatoma" gene 1..1776 /gene="purH" CDS 1..1776 /gene="purH" /codon_start=1 /product="AICAR formyltransferase/IMP cyclohydrolase bifunctional enzyme" /db_xref="PID:g1263196" /translation="MSSLSALFSVSDKTGLVEFARNLTALGLNLVASGGTAKALRDAG LAVRDVSELTGFPEMLGGRVKTLHPAVHAGILARNIPEDNADMARLDFNLIRVVACNL YPFVKTVASPGVTVEEAVEQIDIGGVTLLRAAAKNHARVTVVCEPEDYVVVSTEMQSS ESKGTSLETRRQLALKAFTHTAQYDEAISDYFRKQYSKGVSQMPLRYGMNPHQTPAQL YTLQPKLPITVLNGAPGFINLCDALNAWQLVKELKEALGIPAAASFKHVSPAGAAVGI PLSEDEAKVCMVYDLYKTLTPISAAYARARGADRMSSFGDFVALSDVCDVPTAKIISR EVSDGIIAPGYEEEALTILSKKKNGNYCVLQMDQSYKPDENEVRTLFGLHLSQKRNNG VVDKSLFSNVVTKNKDLPESALRDLIVATIAVKYTQSNSVCYAKNGQVIGIGAGQQSR IHCTRLAGDKANYWWLRHHPQVLSMKFKTGVKRAEISNAIDQYVTGTIGEDEDLIKWK ALFEEVPELLTEAEKKEWVEKLTEVSISSDAFFPFRDNVDRAKRSGVAYIAAPSGSAA DKVVIEACDELGIILAHTNLRLFHH" BASE COUNT 487 a 394 c 437 g 458 t ORIGIN 1 atgtcttctc tctcagcctt atttagtgtc tctgacaaaa ccggccttgt ggaatttgca 61 agaaacctga ccgctcttgg tttgaacctg gtcgcttccg gagggactgc aaaagctctc 121 agggatgctg gtctggcagt cagagatgtc tctgagttga cgggatttcc tgaaatgttg 181 gggggacgtg tgaaaacttt gcatcctgca gtccatgctg gaatcctagc tcgtaatatt 241 ccagaagata atgctgacat ggccagactt gatttcaatc ttataagagt tgtcgcctgc 301 aatctctatc cctttgtaaa gacagtggct tctccaggtg taactgttga ggaggctgtg 361 gagcaaattg acattggtgg agtaacctta ctgagagctg cagccaaaaa ccacgctcga 421 gtgacagtgg tgtgtgaacc agaggactat gtggtggtgt ccacggagat gcagagctcc 481 gagagtaagg gcacctcctt ggagactaga cgccagttag ccttgaaggc attcactcat 541 acggcacaat atgatgaagc aatttcagat tatttcagga aacagtacag caaaggcgta 601 tctcagatgc ccttgagata tggaatgaac ccacatcaga cccctgccca gctgtacaca 661 ctgcagccca agcttcccat cacagttcta aatggagccc ctggatttat aaacttgtgc 721 gatgctttga acgcctggca gctggtgaag gaactcaagg aggctttagg tattccagcc 781 gctgcctctt tcaaacatgt cagcccagca ggtgctgctg ttggaattcc actcagtgaa 841 gatgaggcca aagtctgcat ggtttatgat ctctataaaa ccctcacacc catctcagcg 901 gcatatgcaa gagcaagagg ggctgatagg atgtcttcat ttggtgattt tgttgcattg 961 tctgatgttt gtgatgtacc aactgcaaaa attatttcca gagaagtatc tgatggtata 1021 attgccccag gatatgaaga agaagccttg acaatacttt ccaaaaagaa aaatggaaac 1081 tattgtgtcc ttcagatgga ccaatcttac aaaccagatg aaaatgaagt tcgaactctc 1141 tttggtcttc atttaagcca gaagagaaat aatggtgtcg tcgacaagtc attatttagc 1201 aatgttgtta ccaaaaataa agatttgcca gagtctgccc tccgagacct catcgtagcc 1261 accattgctg tcaagtacac tcagtctaac tctgtgtgct acgccaagaa cgggcaggtt 1321 atcggcattg gagcaggaca gcagtctcgt atacactgca ctcgccttgc aggagataag 1381 gcaaactatt ggtggcttag acaccatcca caagtgcttt cgatgaagtt taaaacagga 1441 gtgaagagag cagaaatctc caatgccatc gatcaatatg tgactggaac cattggcgag 1501 gatgaagatt tgataaagtg gaaggcactg tttgaggaag tccctgagtt actcactgag 1561 gcagagaaga aggaatgggt tgagaaactg actgaagttt ctatcagctc tgatgccttc 1621 ttccctttcc gagataacgt agacagagct aaaaggagtg gtgtggcgta cattgcggct 1681 ccctccggtt ctgctgctga caaagttgtg attgaggcct gcgacgaact gggaatcatc 1741 ctcgctcata cgaaccttcg gctcttccac cactga // LOCUS HSU37448 2309 bp mRNA PRI 14-DEC-1995 DEFINITION Human Mch3 isoform alpha (Mch3) mRNA, complete cds. ACCESSION U37448 NID g1117846 KEYWORDS Mch3 isoform alpha. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2309) AUTHORS Fernandes-Alnemri,T., Litwack,G. and Alnemri,E.S. TITLE CPP32, a novel human apoptotic protein with homology to Caenorhabditis elegans cell death protein Ced-3 and mammalian interleukin-1 beta-converting enzyme JOURNAL J. Biol. Chem. 269 (49), 30761-30764 (1994) MEDLINE 95074098 REFERENCE 2 (bases 1 to 2309) AUTHORS Fernandes-Alnemri,T., Takahashi,A., Armstrong,R., Krebs,J., Fritz,L., Tomaselli,K.J., Wang,L., Yu,Z., Croce,C.M., Salveson,G., Earnshaw,W.C., Litwack,G. and Alnemri,E.S. TITLE Mch3, a novel human apoptotic cysteine protease highly related to CPP32 JOURNAL Cancer Res. 55 (24), 6045-6052 (1995) MEDLINE 96105019 REFERENCE 3 (bases 1 to 2309) AUTHORS Alnemri,E.S. TITLE Direct Submission JOURNAL Submitted (03-OCT-1995) Emad S. Alnemri, Pharmacology, Thomas Jefferson University, Jefferson Cancer Institute, 233, S. Tenth Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..2309 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="T50-8A.1" /cell_line="Jurkat" /cell_type="T-lymphocyte" gene 44..955 /gene="Mch3" CDS 44..955 /gene="Mch3" /note="new member of ICE-like cysteine protease family" /codon_start=1 /function="cysteine protease" /evidence=experimental /product="Mch3 isoform alpha" /db_xref="PID:g1117847" /translation="MADDQGCIEEQGVEDSANEDSVDAKPDRSSFVPSLFSKKKKNVT MRSIKTTRDRVPTYQYNMNFEKLGKCIIINNKNFDKVTGMGVRNGTDKDAEALFKCFR SLGFDVIVYNDCSCAKMQDLLKKASEEDHTNAACFACILLSHGEENVIYGKDGVTPIK DLTAHFRGDRCKTLLEKPKLFFIQACRGTELDDGIQADSGPINDTDANPRYKIPVEAD FLFAYSTVPGYYSWRSPGRGSWFVQALCSILEEHGKDLEIMQILTRVNDRVARHFESQ SDDPHFHEKKQIPCVVSMLTKELYFSQ" polyA_site 2309 BASE COUNT 680 a 453 c 531 g 645 t ORIGIN 1 gagagactgt gccagtccca gccgccctac cgccgtggga acgatggcag atgatcaggg 61 ctgtattgaa gagcaggggg ttgaggattc agcaaatgaa gattcagtgg atgctaagcc 121 agaccggtcc tcgtttgtac cgtccctctt cagtaagaag aagaaaaatg tcaccatgcg 181 atccatcaag accacccggg accgagtgcc tacatatcag tacaacatga attttgaaaa 241 gctgggcaaa tgcatcataa taaacaacaa gaactttgat aaagtgacag gtatgggcgt 301 tcgaaacgga acagacaaag atgccgaggc gctcttcaag tgcttccgaa gcctgggttt 361 tgacgtgatt gtctataatg actgctcttg tgccaagatg caagatctgc ttaaaaaagc 421 ttctgaagag gaccatacaa atgccgcctg cttcgcctgc atcctcttaa gccatggaga 481 agaaaatgta atttatggga aagatggtgt cacaccaata aaggatttga cagcccactt 541 taggggggat agatgcaaaa cccttttaga gaaacccaaa ctcttcttca ttcaggcttg 601 ccgagggacc gagcttgatg atggcatcca ggccgactcg gggcccatca atgacacaga 661 tgctaatcct cgatacaaga tcccagtgga agctgacttc ctcttcgcct attccacggt 721 tccaggctat tactcgtgga ggagcccagg aagaggctcc tggtttgtgc aagccctctg 781 ctccatcctg gaggagcacg gaaaagacct ggaaatcatg cagatcctca ccagggtgaa 841 tgacagagtt gccaggcact ttgagtctca gtctgatgac ccacacttcc atgagaagaa 901 gcagatcccc tgtgtggtct ccatgctcac caaggaactc tacttcagtc aatagccata 961 tcaggggtac attctagctg agaagcaatg ggtcactcat taatgaatca cattttttta 1021 tgctcttgaa atattcagaa attctccagg attttaattt caggaaaatg tattgattca 1081 acagggaaga aactttctgg tgctgtcttt tgttctctga attttcagag acttttttat 1141 aatgttattc atttggtgac tgtgtaactt tctcttaaga ttaattttct ctttgtatgt 1201 ctgttacctt gttaatagac ttaatacatg caacagaagt gacttctgga gaaagctcat 1261 ggctgtgtcc actgcaattg gtggtaacag tggtagagtc atgtttgcac ttggcaaaaa 1321 gaatcccaat gtttgacaaa acacagccaa ggggatattt actgctcttt attgcagaat 1381 gtgggtattg agtgtgattt gaatgatttt tcattggctt agggcagatt ttcatgcaaa 1441 agttctcata tgagttagag gagaaaaagc ttaatgattc tgatatgtat ccatcaggat 1501 ccagtctgga aaacagaaac cattctaggt gtttcaacag agggagttta atacaggaaa 1561 ttgacttaca tagatgataa aagagaagcc aaacagcaag aagctgttac cacacccagg 1621 gctatgagga taatgggaag aggtttggtt tcctgtgtcc agtagtggga tcatccagag 1681 gagctggaac catggtgggg gctgcctagt gggagttagg accaccaatg gattgtggaa 1741 aatggagcca tgacaagaac aaagccactg actgagatgg agtgagctga gacagataag 1801 agaatacctt gtctcaccta tcctgccctc acatcttcca ccagcacctt actgcccagg 1861 cctatctgga agccacctca ccaaggacct tggaagagca agggacagtg aggcaggaga 1921 agaacaagaa atggatgtaa gcctggccca taatgtgaac ataagtaatc actaatgctc 1981 aacaatttat ccattcaatc atttattcat tgggttgtca gatagtctat gtatgtgtaa 2041 aacaatctgt tttggcttta tgtgcaaaat ctgttatagc tttaaaatat atctggaact 2101 ttttagatta ttccaagcct tattttgagt aaatatttgt tacttttagt tctataagtg 2161 aggaagagtt tatggcaaag atttttggca ctttgttttc aagatggtgt tatcttttga 2221 attcttgata aatgactgtt tttttctgcc taatagtaac tggttaaaaa acaaatgttc 2281 atatttattg attaaaaatg tggttgctt // LOCUS HSU37518 1769 bp mRNA PRI 07-JAN-1996 DEFINITION Human TNF-related apoptosis inducing ligand TRAIL mRNA, complete cds. ACCESSION U37518 NID g1149557 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1769) AUTHORS Wiley,S.R., Schooley,K., Smolak,P., Din,W.S., Huang,C.-P., Nicholl,J.K., Sutherland,G.R., Davis-Smith,T., Rauch,C., Smith,C.A. and Goodwin,R.G. TITLE Identification and characterization of a new member of the TNF family that induces apoptosis JOURNAL Immunity 3 (6), 673-682 (1995) MEDLINE 96111955 REFERENCE 2 (bases 1 to 1769) AUTHORS Wiley,S.R., Schooley,K., Smolak,P., Din,W.S., Huang,C-P., Nicholl,J.K., Sutherland,G.R., Davis-Smith,T., Rauch,C., Smith,C.A. and Goodwin,R.G. TITLE Direct Submission JOURNAL Submitted (03-OCT-1995) Steven R. Wiley, Molecular Biology, Immunex Corporation, 51 University Street, Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1769 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q26" /chromosome="3" CDS 88..933 /codon_start=1 /product="TNF-related apoptosis inducing ligand TRAIL" /db_xref="PID:g1149558" /translation="MAMMEVQGGPSLGQTCVLIVIFTVLLQSLCVAVTYVYFTNELKQ MQDKYSKSGIACFLKEDDSYWDPNDEESMNSPCWQVKWQLRQLVRKMILRTSEETIST VQEKQQNISPLVRERGPQRVAAHITGTRGRSNTLSSPNSKNEKALGRKINSWESSRSG HSFLSNLHLRNGELVIHEKGFYYIYSQTYFRFQEEIKENTKNDKQMVQYIYKYTSYPD PILLMKSARNSCWSKDAEYGLYSIYQGGIFELKENDRIFVSVTNEHLIDMDHEASFFG AFLVG" BASE COUNT 611 a 361 c 375 g 422 t ORIGIN 1 cctcactgac tataaaagaa tagagaagga agggcttcag tgaccggctg cctggctgac 61 ttacagcagt cagactctga caggatcatg gctatgatgg aggtccaggg gggacccagc 121 ctgggacaga cctgcgtgct gatcgtgatc ttcacagtgc tcctgcagtc tctctgtgtg 181 gctgtaactt acgtgtactt taccaacgag ctgaagcaga tgcaggacaa gtactccaaa 241 agtggcattg cttgtttctt aaaagaagat gacagttatt gggaccccaa tgacgaagag 301 agtatgaaca gcccctgctg gcaagtcaag tggcaactcc gtcagctcgt tagaaagatg 361 attttgagaa cctctgagga aaccatttct acagttcaag aaaagcaaca aaatatttct 421 cccctagtga gagaaagagg tcctcagaga gtagcagctc acataactgg gaccagagga 481 agaagcaaca cattgtcttc tccaaactcc aagaatgaaa aggctctggg ccgcaaaata 541 aactcctggg aatcatcaag gagtgggcat tcattcctga gcaacttgca cttgaggaat 601 ggtgaactgg tcatccatga aaaagggttt tactacatct attcccaaac atactttcga 661 tttcaggagg aaataaaaga aaacacaaag aacgacaaac aaatggtcca atatatttac 721 aaatacacaa gttatcctga ccctatattg ttgatgaaaa gtgctagaaa tagttgttgg 781 tctaaagatg cagaatatgg actctattcc atctatcaag ggggaatatt tgagcttaag 841 gaaaatgaca gaatttttgt ttctgtaaca aatgagcact tgatagacat ggaccatgaa 901 gccagttttt tcggggcctt tttagttggc taactgacct ggaaagaaaa agcaataacc 961 tcaaagtgac tattcagttt tcaggatgat acactatgaa gatgtttcaa aaaatctgac 1021 caaaacaaac aaacagaaaa cagaaaacaa aaaaacctct atgcaatctg agtagagcag 1081 ccacaaccaa aaaattctac aacacacact gttctgaaag tgactcactt atcccaagaa 1141 aatgaaattg ctgaaagatc tttcaggact ctacctcata tcagtttgct agcagaaatc 1201 tagaagactg tcagcttcca aacattaatg caatggttaa catcttctgt ctttataatc 1261 tactccttgt aaagactgta gaagaaagcg caacaatcca tctctcaagt agtgtatcac 1321 agtagtagcc tccaggtttc cttaagggac aacatcctta agtcaaaaga gagaagaggc 1381 accactaaaa gatcgcagtt tgcctggtgc agtggctcac acctgtaatc ccaacatttt 1441 gggaacccaa ggtgggtaga tcacgagatc aagagatcaa gaccatagtg accaacatag 1501 tgaaacccca tctctactga aagtgcaaaa attagctggg tgtgttggca catgcctgta 1561 gtcccagcta cttgagaggc tgaggcagga gaatcgtttg aacccgggag gcagaggttg 1621 cagtgtggtg agatcatgcc actacactcc agcctggcga cagagcgaga cttggtttca 1681 aaaaaaaaaa aaaaaaaaaa cttcagtaag tacgtgttat ttttttcaat aaaattctat 1741 tacagtatgt caaaaaaaaa aaaaaaaaa // LOCUS HSU37529 1102 bp mRNA PRI 12-OCT-1995 DEFINITION Human substance P beta-PPT-A mRNA, complete cds. ACCESSION U37529 NID g1017792 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1102) AUTHORS Tan,A. and Too,H.-P. TITLE Direct Submission JOURNAL Submitted (04-OCT-1995) Aileen Tan, Biochemistry, National University of Singapore, 10 Kent Ridge Crescent, Singapore 119260, Singapore FEATURES Location/Qualifiers source 1..1102 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain cortex" CDS 146..535 /codon_start=1 /product="substance P beta-PPT-A" /db_xref="PID:g1017793" /translation="MKILVALAVFFLVSTQLFAEEIGANDDLNYWSDWYDSDQIKEEL PEPFEHLLQRIARRPKPQQFFGLMGKRDADSSIEKQVALLKALYGHGQISHKRHKTDS FVGLMGKRALNSVAYERSAMQNYERRR" BASE COUNT 350 a 200 c 235 g 317 t ORIGIN 1 gcgccgcaag gcactgagca ggcgaaagag cgcgctcgga cctccttccc ggcggcagct 61 accgagagtg cggagcgacc agcgtgcgct cggaggaacc agagaaactc agcaccccgc 121 gggactgtcc gtcgcaaaat ccaacatgaa aatcctcgtg gccttggcag tcttttttct 181 tgtctccact cagctgtttg cagaagaaat aggagccaat gatgatctga attactggtc 241 cgactggtac gacagcgacc agatcaagga ggaactgccg gagccctttg agcatcttct 301 gcagagaatc gcccggagac ccaagcctca gcagttcttt ggattaatgg gcaaacggga 361 tgctgattcc tcaattgaaa aacaagtggc cctgttaaag gctctttatg gacatggcca 421 gatctctcac aaaagacata aaacagattc ctttgttgga ctaatgggca aaagagcttt 481 aaattctgtg gcttatgaaa ggagtgcaat gcagaattat gaaagaagac gttaataaac 541 tacctaacat tatttattca gcttcatttg tgtcaatggg caatgacagg taaattaaga 601 catgcactat gaggaataat tatttattta ataacaattg tttggggttg aaaattcaaa 661 aagtgtttat ttttcatatt gtgccaatat gtattgtaaa catgtgtttt aattccaata 721 tgatgactcc cttaaaatag aaataagtgg ttatttctca acaaagcaca gtgttaaatg 781 aaattgtaaa acctgtcaat gatacagtcc ctaaagaaaa aaaatcattg ctttgaagca 841 gttgtgtcag ctactgcgga aaaggaagga aactcctgac agtcttgtgc ttttcctatt 901 tgttttcatg gtgaaaatgt actgagattt tggtattaca ctgtatttgt atctctgaag 961 catgtttcat gttttgtgac tatatagaga tgtttttaaa agtttcaatg tgattctaat 1021 gtcttcattt cattgtatga tgtgttgtga tagctaacat tttaaataaa agaaaaaata 1081 tcttgaaaaa aaaaaaaaaa aa // LOCUS HSU37673 3443 bp mRNA PRI 16-OCT-1995 DEFINITION Human neuron-specific vesicle coat protein and cerebellar degeneration antigen (beta-NAP) mRNA, complete cds. ACCESSION U37673 NID g1019901 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3443) AUTHORS Newman,L.S., McKeever,M.O., Okano,H.J. and Darnell,R.B. TITLE Beta-NAP, a cerebellar degeneration antigen, is a neuron-specific vesicle coat protein JOURNAL Cell 82 (5), 773-783 (1995) MEDLINE 95401267 REFERENCE 2 (sites) AUTHORS Darnell,R.B., Furneaux,H.M. and Posner,J.B. TITLE Antiserum from a patient with cerebellar degeneration identifies a novel protein in Purkinje cells, cortical neurons, and neuroectodermal tumors JOURNAL J. Neurosci. 11 (5), 1224-1230 (1991) MEDLINE 91225752 REFERENCE 3 (bases 1 to 3443) AUTHORS Darnell,R.B. TITLE Direct Submission JOURNAL Submitted (04-OCT-1995) Robert B. Darnell, Head, Laboratory of Molecular Neuro-Oncology, Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..3443 /organism="Homo sapiens" /note="initial clone: human cerebellar cDNA library (ATCC);Overlapping clones obtained from human cerebellar, hippocampal libraries (Stratagene)" /db_xref="taxon:9606" /tissue_type="cerebellum; hippocampus" CDS 62..3307 /note="similar to beta-adaptin and beta-COP proteins; phosphoprotein, with membrane (including vesicular) and cytoplasmic pools; protein and mRNA expression is entirely neuron-specific; human autoimmune cerebellar degeneration antigen; neuron-specific vesicle coat protein" /codon_start=1 /product="beta-NAP" /db_xref="PID:g1019902" /translation="MSAAPAYSEDKGGSAGPGEPEYGHDPASGGIFSSDYKRHDDLKE MLDTNKDSLKLEAMKRIVAMIARGKNASDLFPAVVKNVACKNIEVKKLVYVYLVRYAE EQQDLALLSISTFQRGLKDPNQLIRASALRVLSSIRVPIIVPIMMLAIKEAASDMSPY VRKTAAHAIPKLYSLDSDQKDQLIEVIEKLLADKTTLVAGSVVMAFEEVCPERIDLIH KNYRKLCNLLIDVEEWGQVVIISMLTRYARTQFLSPTQNESLLEENAEKAFYGSEEDE AKGAGSEETAAAAAPSRKPYVMDPDHRLLLRNTKPLLQSRSAAVVMAVAQLYFHLGPR RKWRHRQGAGALLRSHSEVQYVVLQNVATMSIKRRGMFEPYLKSFYIRSTDPTQIKIL KLEVLTNLANETNIPTVLREFQTYIRSMDKDFVAATIQAIGRCATNIGRVRDTCLNGL VQLLSNRDELVVAESVVVIKKLLQMQPAQHGEIIKHLAKLTDNIQVPMARASILWLIG EYCEHVPRIAPDVLRKMAKSFTAEEDIVKLQVINLAAKLYLTNSKQTKLLTQYVLSLA KYDQNYDIRDRARFTRQLIVPSEQGGALSRHAKKLFLAPKPAPVLESSFKDRDHFQLG SLSHLLNAKATGYQELPDWPEEAPDPSVRNVEVPEWTKCSNREKRKEKEKPFYSDSEG ESGPTESADSDPESESESDSKSSSESGSGESSSESDNEDQDEDEEKGRGSESEQSEED GKRKTKKKVPERKGEASSSDEGSDSSSSSSESEMTSESEEEQLEPASWSRKTPPSSKS APATKEISLLDLEDFTPPSVQPVSPPAIVSTSLAADLEGLTLTDSTLVPSLLSPVSGV GRQELLHRVAGEGLAVDYTFSRQPFSGDPHMVSVHIHFSNSSDTPIKGLHVGTPKLPA GISIQEFPEIESLAPGESATAVMGINFCDSTQAANFQLCTQTRQFYVSIQPPVGELMA PVFMSENEFKKEQGKLMGMNEITEKLMLPDTCRSDHIVVQKVTATANLGRVPCGTSDE YRFAGRTLTGGSLVLLTLDARPAGAAQLTVNSEKMVIGTMLVKDVIQALTQ" misc_feature 1574..1591 /note="encodes WLIGEY; adaptin & COP coat protein consensus sequence" misc_feature 2000..2452 /note="encodes autoimmune epitope (hydrophilic domain)" 3'UTR 3308..3443 BASE COUNT 811 a 1016 c 953 g 663 t ORIGIN 1 cgggctccag cggcctcccg cgccgcaacc tcctcctcgg cgaagtctcc ctggccgccc 61 catgtcggcc gcccccgcct acagcgaaga caagggcggc tccgctggcc ccggggagcc 121 cgagtacggc cacgaccccg cgagcggcgg catcttctcc tccgactaca agcggcatga 181 tgacctgaag gagatgctgg acaccaacaa ggattctctc aagctggagg ccatgaagag 241 gattgtggcg atgattgccc gaggaaagaa tgcttcagac ctgtttcccg cggtggtgaa 301 gaacgtggcc tgtaagaaca tagaggtgaa gaagcttgtc tatgtgtacc tggtacgcta 361 cgctgaggag cagcaagacc tggccctgct gtccatctcc accttccaac gtggcctaaa 421 ggatcccaac cagctgattc gtgccagtgc cctccgtgtc ctctctagca tccgtgtgcc 481 catcatagtg cccatcatga tgctagctat caaggaagcc gcctcggaca tgtcacccta 541 tgtgcggaaa acagctgccc acgccatccc taaactctac agtttggact ctgaccagaa 601 ggatcagctg atagaagtca ttgagaagct tctggctgac aagaccacgc tggtggcggg 661 cagtgtggtg atggcctttg aggaggtctg cccggagcgc atcgacctga ttcacaaaaa 721 ctaccggaaa ctctgtaacc tgctgatcga cgtggaggag tggggccagg tggtcatcat 781 cagcatgctc acccgctacg cccgcacgca gttcctgagc cccacccaga acgaatccct 841 actagaggag aacgcggaaa aagccttcta cggctcagag gaggacgagg ccaagggcgc 901 ggggtctgag gagacggccg ccgcggccgc cccctcccga aagccctatg tcatggaccc 961 cgaccaccgg ctgctgctgc gcaacaccaa acccctgctg cagagccgca gcgccgcggt 1021 ggtgatggcg gtggcgcagc tctacttcca cctgggccca aggcggaagt ggcgtcatcg 1081 ccaaggcgct ggtgcgctgc tgcgcagcca cagtgaggtg cagtacgttg tgctccagaa 1141 cgtggccacc atgtccatca agcgccgggg tatgtttgag ccctacctga agagcttcta 1201 catcaggtcc accgacccca cccagattaa gatcctgaag ctggaagtgc tgaccaacct 1261 ggccaatgag accaacattc ctactgtcct acgggaattc cagacctata ttcgcagcat 1321 ggacaaggac tttgtggcag ccacaatcca ggccattgga cgctgtgcaa ctaacatcgg 1381 ccgagtccgt gacacctgcc tcaatggcct ggtgcagctg ctgtccaacc gtgatgagct 1441 tgtggttgca gagtcagtgg tcgtcattaa gaaattgcta cagatgcagc cagcacaaca 1501 tggagagatc atcaaacact tggcaaagct tacagacaac atccaggtgc ccatggcccg 1561 agccagcatc ctgtggctca tcggagagta ctgtgagcat gtccccagga ttgcacctga 1621 tgtcttaaga aaaatggcca agtcattcac agcagaggag gatattgtca agctgcaggt 1681 catcaacctg gcagccaagc tctacctgac caactctaaa cagaccaagc tgctgaccca 1741 gtatgtgctg agtctggcca aatatgacca gaactatgat attcgcgacc gggcgcgctt 1801 cacccggcag ctcatcgtcc cttccgagca gggtggggcc ctcagccgcc atgccaagaa 1861 gctcttcctg gcacccaaac cagctccagt cttggagtca tccttcaaag accgggacca 1921 cttccagctg ggctcactgt cccacctgct taatgccaag gccacaggct accaggagct 1981 cccagactgg ccggaggaag ccccagaccc atctgtgcgc aacgtggagg tacctgaatg 2041 gaccaagtgc tcaaatcggg agaagagaaa ggagaaggaa aaacccttct actcggactc 2101 tgagggggag tcaggcccca cggagtccgc agacagtgac cctgagtctg agagtgaatc 2161 ggacagtaag agcagcagtg agagcggctc tggggagtcc agcagtgagt ccgacaatga 2221 agaccaggat gaggatgagg agaaagggag aggcagtgag agtgaacaga gtgaggagga 2281 tggtaagagg aagacaaaga agaaggtgcc agagagaaaa ggagaagcgt catcctctga 2341 tgagggcagc gattccagca gtagctcatc agagtccgag atgacatcgg agtccgagga 2401 ggagcagtta gaacctgcct cctggagcag gaaaacacct cccagcagca aaagtgctcc 2461 tgcaaccaag gagatctccc tgcttgatct agaggatttc acccctccca gtgtccagcc 2521 tgtgtctccc ccagcaattg tgtctaccag tctggctgct gacctggagg gcctgacact 2581 cacagactcc accctggtac cgtcgcttct gagtccagta tcgggtgttg ggcggcagga 2641 gctgctgcac cgggtagctg gcgaggggct ggctgtggac tacaccttca gccgccaacc 2701 tttctccggg gatccccaca tggtgtccgt gcacatccac ttctccaaca gctctgatac 2761 ccccatcaag ggcctgcatg tgggcactcc caaactgcct gctggcatca gcatccaaga 2821 atttcccgaa attgagtccc tggcacctgg agaatctgcc actgctgtaa tgggcattaa 2881 tttctgtgac tcaacccagg cagccaactt ccagctgtgc acccagaccc gacagttcta 2941 cgtctccatt cagccacctg ttggggagct gatggcccct gtgttcatga gtgaaaatga 3001 gtttaagaag gaacagggaa agctgatggg catgaatgag atcacagaga aactcatgct 3061 gccagacacc tgtcggagtg accacattgt ggtgcagaaa gtgactgcca ctgccaacct 3121 gggtcgtgtt ccttgtggga catctgatga gtacaggttt gcagggagga cactgactgg 3181 tggaagcctc gttctgctga ccctggatgc ccggccagct ggagctgccc agctgactgt 3241 caacagcgag aaaatggtga ttggcaccat gctggtaaag gatgtgatac aggctctgac 3301 ccagtgactt ccaaatgctg tgacctgttt ggctcccatc tatacctccc catgacacct 3361 aggctgtcag tctctctcat ctttctctct ctctctcatc atcctcctca tgccagatag 3421 cattcagggt gtcctctctc ccc // LOCUS HSU37689 867 bp mRNA PRI 07-MAR-1996 DEFINITION Human RNA polymerase II subunit (hsRPB8) mRNA, complete cds. ACCESSION U37689 NID g1017822 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 867) AUTHORS McKune,K., Moore,P.A., Hull,M.W. and Woychik,N.A. TITLE Six human RNA polymerase subunits functionally substitute for their yeast counterparts JOURNAL Mol. Cell. Biol. 15 (12), 6895-6900 (1995) MEDLINE 96069399 REFERENCE 2 (bases 1 to 867) AUTHORS Woychik,N.A. TITLE Direct Submission JOURNAL Submitted (04-OCT-1995) Nancy A. Woychik, Roche Institute of Molecular Biology, 340 Kingsland St., Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1..867 /organism="Homo sapiens" /db_xref="taxon:9606" gene 166..618 /gene="hsRPB8" CDS 166..618 /gene="hsRPB8" /codon_start=1 /product="RNA polymerase II subunit" /db_xref="PID:g1017823" /translation="MAGILFEDIFDVKDIDPEAKKFDRVSRLHCESESFKMDLILDVN IQIYPVDLGDKFRLVIASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGD ETSTEAATRLSAYVSYGGLLMRLQGDANNLHGFEVDSRVYLLMKKLAF" polyA_site 867 BASE COUNT 201 a 212 c 232 g 222 t ORIGIN 1 ccctcactaa agggacaaaa gctggagctc caccgcggtg cggccgctct agaactagtg 61 gatcccccgg gctgcaggtc cggtcttgtc cacgctaggg ggtgcacgta ctcccaactg 121 tggtcgcgct ctcacccctt ctgctgctct cgtggccccc tcgcgatggc gggcatcctg 181 tttgaggata ttttcgatgt gaaggatatt gacccggagg ccaagaagtt tgaccgagtg 241 tctcgactgc attgtgagag tgaatctttc aagatggatc taatcttaga tgtaaacatt 301 caaatttacc ctgtagactt gggtgacaag tttcggttgg tcatagctag taccttgtat 361 gaagatggta ccctggatga tggtgaatac aaccccactg atgataggcc ttccagggct 421 gaccagtttg agtatgtaat gtatggaaaa gtgtacagga ttgagggaga tgaaacttct 481 actgaagcag caacacgcct ctctgcgtac gtgtcctatg ggggcctgct catgaggctg 541 cagggggatg ccaacaacct gcatggattc gaggtggact ccagagttta tctcctgatg 601 aagaagctag ccttctgaac ctcgcctgaa gccagcctct ctgccaagtc actcaggtca 661 tgggcattgt tcaagcctga gtggcagccg ctcttgctca cctgttgagg aagggctggc 721 tcactgtcca ccgtggcggc atctttaact ggcctccact caatgggaaa ctgactcgcc 781 tgtgaaagac acagtgggag agctgaaaat gaatcagaag ctttatgtat atgattttta 841 aattaaactt tactttttca gactgcc // LOCUS HSU37690 384 bp mRNA PRI 07-MAR-1996 DEFINITION Human RNA polymerase II subunit (hsRPB10) mRNA, complete cds. ACCESSION U37690 NID g1017824 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 384) AUTHORS McKune,K., Moore,P.A., Hull,M.W. and Woychik,N.A. TITLE Six human RNA polymerase subunits functionally substitute for their yeast counterparts JOURNAL Mol. Cell. Biol. 15 (12), 6895-6900 (1995) MEDLINE 96069399 REFERENCE 2 (bases 1 to 384) AUTHORS Woychik,N.A. TITLE Direct Submission JOURNAL Submitted (04-OCT-1995) Nancy A. Woychik, Roche Institute of Molecular Biology, 340 Kingsland St., Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1..384 /organism="Homo sapiens" /db_xref="taxon:9606" gene 14..217 /gene="hsRPB10" CDS 14..217 /gene="hsRPB10" /codon_start=1 /product="RNA polymerase II subunit" /db_xref="PID:g1017825" /translation="MIIPVRCFTCGKIVGNKWEAYLGLLQAEYTEGDALDALGLKRYC CRRMLLAHVDLIEKLLNYAPLEK" polyA_site 384 BASE COUNT 77 a 122 c 114 g 71 t ORIGIN 1 acgagccgcc gccatgatca tccctgtacg ctgcttcact tgtggcaaga tcgtcggcaa 61 caagtgggag gcttacctgg ggctgctgca ggccgagtac accgaggggg acgcgctgga 121 tgccctgggc ctgaagcgct actgctgccg ccggatgctg ctggcccacg tggacctgat 181 cgagaagctg ctcaattatg cacccctgga gaagtgacca cgctgaaacc cacccacccg 241 ctgtgctgac catgggccct gagcgtccta ccccgaattc acgaggctga ggcatccggg 301 agctggcgta atgcctggcc gcagtgtgtg tgtatcccat accccactct ggaaggaacc 361 atccagtaaa ggtctttcag aacc // LOCUS HSU37707 3012 bp mRNA PRI 29-NOV-1996 DEFINITION Human dlg3 mRNA, complete cds. ACCESSION U37707 NID g1022812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3012) AUTHORS Smith,S.A., Holik,P., Stevens,J., Mazoyer,S., Melis,R., Williams,B., White,R. and Albertsen,H. TITLE Isolation of a gene (DLG3) encoding a second member of the discs-large family on chromosome 17q12-q21 JOURNAL Genomics 31 (2), 145-150 (1996) MEDLINE 96422178 REFERENCE 2 (bases 1 to 3012) AUTHORS Albertsen,H. TITLE Direct Submission JOURNAL Submitted (05-OCT-1995) Hans Albertsen, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..3012 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q12-q21" 5'UTR 300..336 /gene="dlg3" gene 300..2995 /gene="dlg3" CDS 337..2094 /gene="dlg3" /note="human homolog of Drosophila lethal discs large 1; Method: conceptual translation supplied by author." /codon_start=1 /db_xref="PID:g1022813" /translation="MPVLSEDSGLHETLALLTSQLRPDSNHKEEMGFLRDDFSEKSLS YLMKIHEKLRYYERQSPTPVLHSAVALAEDVMEELQAASVHSDERELLQLLSTPHLRA VLMVHDTVAQKNFDPVLPPLPDNIDEDFDEESVKIVRLVKNKEPLGATIRRDEHSGAV VVARIMRGGAADRSGLVHVGDELREVNGIAVLHKRPDEISQILAQSQGSITLKIIPAT QEEDRLKESKVFMRALFHYNPREDRAIPCQEAGLPFQRRQVLEVVSQDDPTWWQAKRV GDTNLRAGLIPSKGFQERRLSYRRAAGTLPSPQSLRKPPYDQPCDKETCDCEGYLKGH YVAGLRRSFRLGCRERLGGSQEGKMSSGAESPELLTYEEVARYQHQPGERPRLVVLIG SLGARLHELKQKVVAENPQHFGVAVPHTTRPRKSHEKEGVEYHFVSKQAFEADLHHNK FLEHGEYKENLYGTSLEAIQAVMAKNKVCLVDVEPEALKQLRTSEFKPYIIFVKPAIQ EKRKTPPMSPACEDTAAPFDEQQQEMAASAAFIDRHYGHLVDAVLVKEDLQGAYSQLK VVLEKLSKDTHWVPVSWVR" misc_feature 745..972 /gene="dlg3" /note="encodes Drosophila homology region" misc_feature 1024..1218 /gene="dlg3" /note="encodes src oncogene homology motif 3" misc_feature 1489..2034 /gene="dlg3" /note="encodes guanylate kinase domain (GK)" 3'UTR 2095..2995 /gene="dlg3" polyA_signal 2963..2968 /gene="dlg3" polyA_site 2994..2995 /gene="dlg3" BASE COUNT 774 a 752 c 770 g 714 t 2 others ORIGIN 1 agcttccacg ccgctgtttc ttcatcaata aaatgggaac attcgcagtc atttctcagg 61 actttgtgca gacaaaaagt ttatcataga aatcacccca cacaatgctt ggcacatcga 121 ggacactcag cacatcttag ctttatttcc ttcctctctg aaaaaagtaa gcagagttat 181 ttgttcccat cctctctgtc cacagcctgg tctcagagaa cacagtagat gtgtagtcag 241 tttgttcaat aaatatgctt gatggagctt caatacccac ctccaccccc ttctnccaga 301 atctgcaggg agaggtcggg aggtgacaac gccagcatgc cagtgctatc ggaggactct 361 ggtttgcatg aaaccctggc cctgctgacc tcccagctca gacctgactc caaccacaag 421 gaggagatgg gcttcctgag ggatgatttc agtgaaaaaa gcctcagtta cttaatgaag 481 attcatgaga agcttcgcta ttatgaaagg caaagtccaa ccccagttct gcacagcgct 541 gtggccctcg ctgaggacgt gatggaggag ttgcaggccg cctccgtgca cagtgatgag 601 agggagctgc tccagctgct gtccaccccg cacctgaggg ctgtgctcat ggtacatgac 661 acggttgccc agaagaattt tgaccccgtt ctcccgcctc tgcctgacaa tatcgatgag 721 gattttgatg aggaatcggt gaagatcgtc cgcttggtga agaacaagga acccctgggt 781 gccaccatcc ggcgggacga gcactcaggg gctgttgtgg tggccaggat catgcgagga 841 ggcgcagcag acaggagcgg cctggtccac gttggagatg agctccgaga agtgaacggg 901 atcgcagtcc tgcacaagcg gcccgacgag atcagccaga ttctggccca gtcccaggga 961 tccatcaccc taaaaatcat cccagccacc caggaggaag atcgcttaaa ggagagcaag 1021 gtgttcatgc gcgccctctt ccactacaac cctcgggagg accgggccat cccttgccag 1081 gaggcgggcc tgcccttcca gcgcaggcag gtcctggagg tggtgagcca ggacgacccc 1141 acgtggtggc aggccaagcg agtcggggac accaaccttc gagccggsct catcccctcc 1201 aaggggttcc aggagagacg actaagctac cggagagccg cgggcaccct gccgagcccc 1261 cagagcctca ggaagccccc ctatgatcag ccttgtgaca aagagacctg tgactgtgag 1321 ggctacctca aagggcacta tgtggctggt cttcggagga gcttccggct gggctgtagg 1381 gagagactgg gtggctcgca ggaaggaaag atgtcctccg gagctgagtc tccggagctg 1441 ctgacttacg aagaggtggc caggtaccaa caccagcccg gagagcggcc ccgcctggtg 1501 gttctgatcg ggtctctggg agcccgactg cacgagctga agcaaaaggt ggtggctgag 1561 aacccacagc actttggcgt cgctgttcca cataccacca ggccccgaaa gagccatgag 1621 aaggaaggag tggaatatca ctttgtgtct aagcaagcat ttgaggccga cttacatcac 1681 aacaagttcc tggaacatgg tgaatataag gaaaatctgt atggaaccag cctggaggcc 1741 attcaggctg ttatggccaa aaacaaagtt tgtttggtgg atgtggagcc agaagcactg 1801 aaacaactga ggacctcaga atttaaaccc tatattatat ttgtaaagcc tgcaattcag 1861 gaaaaaagaa aaacgccacc tatgtcccca gcttgtgagg acacagcagc cccatttgat 1921 gagcagcagc aagagatggc cgcttctgcc gccttcatag accggcatta cgggcacctg 1981 gtagacgccg tgctggtgaa ggaggatctc cagggtgcct acagccagct caaagtggtc 2041 ttagagaagc tgagcaagga cactcactgg gtacctgtta gttgggtcag gtaactttat 2101 cccagaacat ccaagctgga cgggaccttg aagatcatct agtccagact ccctcatttt 2161 accatcaagg aatctcaagc gcagagaggg agagaattct ccacaaattc catcatcgag 2221 aagagtataa gtgggaagtc ttgtttgttg ttggtttttg tctgttgttt ttcactgcac 2281 ctctttggat catgatttga aaggggcata tcagaaaaca acacatttca tttattaaag 2341 tatcacaggc aagctgaccc tgattctttg taccaaagtt aagtagccac tgtcttttgt 2401 gggtggtagt ggttaattta tacagtactg attcgcagaa tgtttaagct ttttaaacat 2461 agtgacgctt agtagttttt ttggaagcta acttgtttta tccaggggga ttttacatgt 2521 aactgaagtt cccctgtctt caagcactaa aacgttgatc ttaacctttt ttttgaagtg 2581 cttgcctggt aatagaaaac gggttctctg cctattttaa aatagtgaat atacgtaaat 2641 tttctctgga aggctgaggc acacttcacc atcaacatga attactgtac tatcctgtac 2701 tgcagtggtg ccttcaggga ctcgaggaat gtaaggttgc ctttcccctt tctaaatacc 2761 ctcagattcc taacatcgag cccatgcttt gtttgatttg ttctattcca tccattgtcc 2821 cttttgttac tgacagttgc cttggtccta gccagtccct gccatgagat cataggggtt 2881 cccattgtgc tagatcttgg gaaaccagat gactctccct gtcaaaacta tggctacgtc 2941 actgtaaacc atttctgtca agaataaaag tatgtagacc cagagtgtgg gcctaaaaaa 3001 aaaaaaaaaa aa // LOCUS HSU38175 1230 bp mRNA PRI 04-FEB-1997 DEFINITION Human HuR RNA binding protein (HuR) mRNA, complete cds. ACCESSION U38175 NID g1022960 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Ma,W.J., Cheng,S., Campbell,C., Wright,A. and Furneaux,H. TITLE Cloning and characterization of HuR, a ubiquitously expressed Elav-like protein JOURNAL J. Biol. Chem. 271 (14), 8144-8151 (1996) MEDLINE 96215210 REFERENCE 2 (bases 1 to 1230) AUTHORS Ma,W.J., Cheng,S., Campbell,C. and Furneaux,H.M. TITLE Direct Submission JOURNAL Submitted (10-OCT-1995) Henry M. Furneaux, Prog. Mol. Pharm. & Therap., Sloan-Kettering Inst., 1275 York Ave, Box 20, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" gene 119..1099 /gene="HuR" CDS 119..1099 /gene="HuR" /note="Member of Elav-like family; binds specifically to AU rich elements in mRNA" /codon_start=1 /product="HuR RNA binding protein" /db_xref="PID:g1022961" /translation="MSNGYEDHMAEDCRGDIGRTNLIVNYLPQNMTQDELRSLFSSIG EVESAKLIRDKVAGHSLGYGFVNYVTAKDAERAINTLNGLRLQSKTIKVSYARPSSEV IKDANLYISGLPRTMTQKDVEDMFSRFGRIINSRVLVDQTTGLSRGVAFIRFDKRSEA EEAITSFNGHKPPGSSEPIAVKFAANPNQNKNVALLSQLYHSPARRFGGPVHHQAQRF RFSPMGVDHMSGLSGVNVPGNASSGWCIFIYNLGQDADEGILWQMFGPFGAVTNVKVI RDFNTNKCKGFGFVTMTNYEEAAMAIASLNGYRLGDKILQVSFKTNKSHK" BASE COUNT 307 a 336 c 327 g 260 t ORIGIN 1 ccgccgccac cgctaccgag gccgagcgga gccgttagcg ccgcgccgcc gccgcctccc 61 gcccgccccg gagcagcccc gggcccgccc gcccgcatcc agatttttga aaaatacaat 121 gtctaatggt tatgaagacc acatggccga agactgcagg ggtgacatcg ggagaacgaa 181 tttgatcgtc aactacctcc ctcagaacat gacccaggat gagttacgaa gcctgttcag 241 cagcattggt gaagttgaat ctgcaaaact tattcgggat aaagtagcag gacacagctt 301 gggctacggc tttgtgaact acgtgaccgc gaaggatgca gagagagcga tcaacacgct 361 gaacggcttg aggctccagt caaaaaccat taaggtgtcg tatgctcgcc cgagctcaga 421 ggtgatcaaa gacgccaact tgtacatcag cgggctcccg cggaccatga cccagaagga 481 cgtagaagac atgttctctc ggtttgggcg gatcatcaac tcgcgggtcc tcgtggatca 541 gactacaggt ttgtccagag gggttgcgtt tatccggttt gacaaacggt cggaggcaga 601 agaggcaatt accagtttca atggtcataa acccccaggt tcctctgagc ccatcgcagt 661 gaagtttgca gccaacccca accagaacaa aaacgtggca ctcctctcgc agctgtacca 721 ctcgccagcg cgacggttcg gaggccccgt tcaccaccag gcgcagagat tcaggttctc 781 ccccatgggc gtcgatcaca tgagcgggct ctctggcgtc aacgtgccag gaaacgcctc 841 ctccggctgg tgcattttca tctacaacct ggggcaggat gccgacgagg ggatcctctg 901 gcagatgttt gggccgtttg gtgccgtcac caatgtgaaa gtgatccgcg acttcaacac 961 caacaagtgc aaagggtttg gctttgtgac catgacaaac tatgaagaag ccgcgatggc 1021 catagccagc ctgaacggct accgcctggg ggacaaaatc ttacaggttt ccttcaaaac 1081 caacaagtcc cacaaataac tcgctcatgc tttttttgta cggaatagat aattaagagt 1141 gaaggagttg aaacttttct tgttagtgta caactcattt tgcgccaatt ttcacaagtg 1201 tttgtctttg tctgaatgag aagtgagaag // LOCUS HSU38254 1917 bp mRNA PRI 21-NOV-1995 DEFINITION Human amiloride sensitive sodium channel delta subunit (dNaCh) mRNA, complete cds. ACCESSION U38254 NID g1066456 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1917) AUTHORS Waldmann,R., Champigny,G., Bassilana,F., Voilley,N. and Lazdunski,M. TITLE Molecular cloning and functional expression of a novel amiloride-sensitive Na+ channel JOURNAL J. Biol. Chem. 270 (46), 27411-27414 (1995) MEDLINE 96070858 REFERENCE 2 (bases 1 to 1917) AUTHORS Waldmann,R., Champigny,G., Bassilana,F., Voilley,N. and Lazdunski,M. TITLE Direct Submission JOURNAL Submitted (10-OCT-1995) Rainer Waldmann, IPMC, CNRS, 660 Route des Lucioles, Valbonne 06560, France FEATURES Location/Qualifiers source 1..1917 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1917 /gene="dNaCh" CDS 1..1917 /gene="dNaCh" /codon_start=1 /product="amiloride sensitive sodium channel delta subunit" /db_xref="PID:g1066457" /translation="MAEHRSMDGRMEAATRGGSHLQAAAQTPPRPGPPSAPPPPPKEG HQEGLVELPASFRELLTFFCTNATIHGAIRLVCSRGNRLKTTSWGLLSLGALVALCWQ LGLLFERHWHRPVLMAVSVHSERKLLPLVTLCDGNPRRPSPVLRHLELLDEFARENID SLYNVNLSKGRAALSATVPRHEPPFHLDREIRLQRLSHSGSRVRVGFRLCNSTGGDCF YRGYTSGVAAVQDWYHFHYVDILALLPAAWEDSHGSQDGHFVLSCSYDGLDCQARQFR TFHHPTYGSCYTVDGVWTAQRPGITHGVGLVLRVEQQPHLPLLSTLAGIRVMVHGRNH TPFLGHHSFSVRPGTEATISIREDEVHRLGSPYGHCTAGGEGVEVELLHNTSYTRQAC LVSCFQQLMVETCSCGYYLHPLPAGAEYCSSARHPAWGHCFYRLYQDLETHRLPCTSR CPRPCRESAFKLSTGTSRWPSAKSAGWTLATLGEQGLPHQSHRQRSSLAKINIVYQEL NYRSVEEAPVYSVPQLLSAMGSLYSLWFGASVLSLLELLELLLDASALTLVLGGRRLR RAWFSWPRASPASGASSIKPEASQMPPPAGGTSDDPEPSGPHLPRVMLPGVLAGVSAE ESWAGPQPLETLDT" BASE COUNT 316 a 698 c 583 g 320 t ORIGIN 1 atggctgagc accgaagcat ggacgggaga atggaagcag ccacacgggg gggctctcac 61 ctccaggctg cagcccagac gccccccagg ccggggccac catcagcacc accaccacca 121 cccaaggagg ggcaccagga ggggctggtg gagctgcccg cctcgttccg ggagctgctc 181 accttcttct gcaccaatgc caccatccac ggcgccatcc gcctggtctg ctcccgcggg 241 aaccgcctca agacgacgtc ctgggggctg ctgtccctgg gagccctggt cgcgctctgc 301 tggcagctgg ggctcctctt tgagcgtcac tggcaccgcc cggtcctcat ggccgtctct 361 gtgcactcgg agcgcaagct gctcccgctg gtcaccctgt gtgacgggaa cccacgtcgg 421 ccgagtccgg tcctccgcca tctggagctg ctggacgagt ttgccaggga gaacattgac 481 tccctgtaca acgtcaacct cagcaaaggc agagccgccc tctccgccac tgtcccccgc 541 cacgagcccc ccttccacct ggaccgggag atccgtctgc agaggctgag ccactcgggc 601 agccgggtca gagtggggtt cagactgtgc aacagcacgg gcggcgactg cttttaccga 661 ggctacacgt caggcgtggc ggctgtccag gactggtacc acttccacta tgtggatatc 721 ctggccctgc tgcccgcggc atgggaggac agccacggga gccaggacgg ccacttcgtc 781 ctctcctgca gttacgatgg cctggactgc caggcccgac agttccggac cttccaccac 841 cccacctacg gcagctgcta cacggtcgat ggcgtctgga cagctcagcg ccccggcatc 901 acccacggag tcggcctggt cctcagggtt gagcagcagc ctcacctccc tctgctgtcc 961 acgctggccg gcatcagggt catggttcac ggccgtaacc acacgccctt cctggggcac 1021 cacagcttca gcgtccggcc agggacggag gccaccatca gcatccgaga ggacgaggtg 1081 caccggctcg ggagccccta cggccactgc accgccggcg gggaaggcgt ggaggtggag 1141 ctgctacaca acacctccta caccaggcag gcctgcctgg tgtcctgctt ccagcagctg 1201 atggtggaga cctgctcctg tggctactac ctccaccctc tgccggcggg ggctgagtac 1261 tgcagctctg cccggcaccc tgcctgggga cactgcttct accgcctcta ccaggacctg 1321 gagacccacc ggctcccctg tacctcccgc tgccccaggc cctgcaggga gtctgcattc 1381 aagctctcca ctgggacctc caggtggcct tccgccaagt cagctggatg gactctggcc 1441 acgctaggtg aacaggggct gccgcatcag agccacagac agaggagcag cctggccaaa 1501 atcaacatcg tctaccagga gctcaactac cgctcagtgg aggaggcgcc cgtgtactcg 1561 gtgccgcagc tgctctccgc catgggcagc ctctacagcc tgtggtttgg ggcctccgtc 1621 ctctccctcc tggagctcct ggagctgctg ctcgatgctt ctgccctcac cctggtgcta 1681 ggcggccgcc ggctccgcag ggcgtggttc tcctggccca gagccagccc tgcctcaggg 1741 gcgtccagca tcaagccaga ggccagtcag atgcccccgc ctgcaggcgg cacgtcagat 1801 gacccggagc ccagcgggcc tcatctccca cgggtgatgc ttccaggggt tctggcggga 1861 gtctcagccg aagagagctg ggctgggccc cagccccttg agactctgga cacctga // LOCUS HSU38545 3609 bp mRNA PRI 10-MAR-1997 DEFINITION Human ARF-activated phosphatidylcholine-specific phospholipase D1a (hPLD1) mRNA, complete cds. ACCESSION U38545 NID g1185462 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3609) AUTHORS Hammond,S.M., Altshuller,Y.M., Sung,T.C., Rudge,S.A., Rose,K., Engebrecht,J., Morris,A.J. and Frohman,M.A. TITLE Human ADP-ribosylation factor-activated phosphatidylcholine-specific phospholipase D defines a new and highly conserved gene family JOURNAL J. Biol. Chem. 270 (50), 29640-29643 (1995) MEDLINE 96102003 REFERENCE 2 (bases 1 to 3609) AUTHORS Hammond,S.M. and Frohman,M.A. TITLE Direct Submission JOURNAL Submitted (13-OCT-1995) Michael A. Frohman, Pharmacology, SUNY Stony Brook, BST, T7-172, Stony Brook, NY 11794-8651, USA FEATURES Location/Qualifiers source 1..3609 /organism="Homo sapiens" /note="Cloned from HeLa cell library." /db_xref="taxon:9606" /chromosome="3" /map="3q21.1-28" gene 96..3320 /gene="hPLD1" CDS 96..3320 /gene="hPLD1" /note="PLD1a; phosphatidylcholine-specific; activated by PIP2, ARF; inhibited by oleate; membrane associated" /codon_start=1 /function="Cleaves phosphatidylcholine to phosphatidic acid and choline" /product="phospholipase D1a" /db_xref="PID:g1185463" /translation="MSLKNEPRVNTSALQKIAADMSNIIENLDTRELHFEGEEVDYDV SPSDPKIQEVYIPFSAIYNTQGFKEPNIQTYLSGCPIKAQVLEVERFTSTTRVPSINL YTIELTHGEFKWQVKRKFKHFQEFHRELLKYKAFIRIPIPTRRHTFRRQNVREEPREM PSLPRSSENMIREEQFLGRRKQLEDYLTKILKMPMYRNYHATTEFLDISQLSFIHDLG PKGIEGMIMKRSGGHRIPGLNCCGQGRACYRWSKRWLIVKDSFLLYMKPDSGAIAFVL LVDKEFKIKVGKKETETKYGIRIDNLSRTLILKCNSYRHARWWGGAIEEFIQKHGTNF LKDHRFGSYAAIQENALAKWYVNAKGYFEDVANAMEEANEEIFITDWWLSPEIFLKRP VVEGNRWRLDCILKRKAQQGVRIFIMLYKEVELALGINSEYTKRTLMRLHPNIKVMRH PDHVSSTVYLWAHHEKLVIIDQSVAFVGGIDLAYGRWDDNEHRLTDVGSVKRVTSGPS LGSLPPAAMESMESLRLKDKNEPVQNLPIQKSIDDVDSKLKGIGKPRKFSKFSLYKQL HRHHLHDADSISSIDSTSSYFNHYRSHHNLIHGLKPHFKLFHPSSESEQGLTRPHADT GSIRSLQTGVGELHGETRFWHGKDYCNFVFKDWVQLDKPFADFIDRYSTPRMPWHDIA SAVHGKAARDVARHFIQRWNFTKIMKSKYRSLSYPFLLPKSQTTAHELRYQVPGSVHA NVQLLRSAADWSAGIKYHEESIHAAYVHVIENSRHYIYIENQFFISCADDKVVFNKIG DAIAQRILKAHRENQKYRVYVVIPLLPGFEGDISTGGGNALQAIMHFNYRTMCRGENS ILGQLKAELGNQWINYISFCGLRTHAELEGNLVTELIYVHSKLLIADDNTVIIGSANI NDRSMLGKRDSEMAVIVQDTETVPSVMDGKEYQAGRFARGLRLQCFRVVLGYLDDPSE DIQDPVSDKFFKEVWVSTAARNATIYDKVFRCLPNDEVHNLIQLRDFINKPVLAKEDP IRAEEELKKIRGFLVQFPFYFLSEESLLPSVGTKEAIVPMEVWT" BASE COUNT 1085 a 791 c 837 g 896 t ORIGIN 1 ggcacgagga gccctgagag tccgccgcca acgcgcaggt gctagcggcc ccttcgccct 61 gcagcccctt tgcttttact ctgtccaaag ttaacatgtc actgaaaaac gagccacggg 121 taaatacctc tgcactgcag aaaattgctg ctgacatgag taatatcata gaaaatctgg 181 acacgcggga actccacttt gagggagagg aggtagacta cgacgtgtct cccagcgatc 241 ccaagataca agaagtgtat atccctttct ctgctattta taacactcaa ggatttaagg 301 agcctaatat acagacgtat ctctccggct gtccaataaa agcacaagtt ctggaagtgg 361 aacgcttcac atctacaaca agggtaccaa gtattaatct ttacactatt gaattaacac 421 atggggaatt taaatggcaa gttaagagga aattcaagca ttttcaagaa tttcacagag 481 agctgctcaa gtacaaagcc tttatccgca tccccattcc cactagaaga cacacgttta 541 ggaggcaaaa cgtcagagag gagcctcgag agatgcccag tttgccccgt tcatctgaaa 601 acatgataag agaagaacaa ttccttggta gaagaaaaca actggaagat tacttgacaa 661 agatactaaa aatgcccatg tatagaaact atcatgccac aacagagttt cttgatataa 721 gccagctgtc tttcatccat gatttgggac caaagggcat agaaggtatg ataatgaaaa 781 gatctggagg acacagaata ccaggcttga attgctgtgg tcagggaaga gcctgctaca 841 gatggtcaaa aagatggtta atagtgaaag attccttttt attgtatatg aaaccagaca 901 gcggtgccat tgccttcgtc ctgctggtag acaaagaatt caaaattaag gtggggaaga 961 aggagacaga aacgaaatat ggaatccgaa ttgataatct ttcaaggaca cttattttaa 1021 aatgcaacag ctatagacat gctcggtggt ggggaggggc tatagaagaa ttcatccaga 1081 aacatggcac caactttctc aaagatcatc gatttgggtc atatgctgct atccaagaga 1141 atgctttagc taaatggtat gttaatgcca aaggatattt tgaagatgtg gcaaatgcaa 1201 tggaagaggc aaatgaagag atttttatca cagactggtg gctgagtcca gaaatcttcc 1261 tgaaacgccc agtggttgag ggaaatcgtt ggaggttgga ctgcattctt aaacgaaaag 1321 cacaacaagg agtgaggatc ttcataatgc tctacaaaga ggtggaactc gctcttggca 1381 tcaatagtga atacaccaag aggactttga tgcgtctaca tcccaacata aaggtgatga 1441 gacacccgga tcatgtgtca tccaccgtct atttgtgggc tcaccatgag aagcttgtca 1501 tcattgacca atcggtggcc tttgtgggag ggattgacct ggcctatgga aggtgggacg 1561 acaatgagca cagactcaca gacgtgggca gtgtgaagcg ggtcacttca ggaccgtctc 1621 tgggttccct cccacctgcc gcaatggagt ctatggaatc cttaagactc aaagataaaa 1681 atgagcctgt tcaaaaccta cccatccaga agagtattga tgatgtggat tcaaaactga 1741 aaggaatagg aaagccaaga aagttctcca aatttagtct ctacaagcag ctccacaggc 1801 accacctgca cgacgcagat agcatcagca gcattgacag cacctccagt tattttaatc 1861 actatagaag tcatcacaat ttaatccatg gtttaaaacc ccacttcaaa ctctttcacc 1921 cgtccagtga gtctgagcaa ggactcacta gacctcatgc tgataccggg tccatccgta 1981 gtttacagac aggtgtggga gagctgcatg gggaaaccag attctggcat ggaaaggact 2041 actgcaattt cgtcttcaaa gactgggttc aacttgataa accttttgct gatttcattg 2101 acaggtactc cacgccccgg atgccctggc atgacattgc ctctgcagtc cacgggaagg 2161 cggctcgtga tgtggcacgt cacttcatcc agcgctggaa cttcacaaaa attatgaaat 2221 caaaatatcg gtccctttct tatccttttc tgcttccaaa gtctcaaaca acagcccatg 2281 agttgagata tcaagtgcct gggtctgtcc atgctaacgt acagttgctc cgctctgctg 2341 ctgattggtc tgctggtata aagtaccatg aagagtccat ccacgccgct tacgtccatg 2401 tgatagagaa cagcaggcac tatatctata tcgaaaacca gtttttcata agctgtgctg 2461 atgacaaagt tgtgttcaac aagataggcg atgccattgc ccagaggatc ctgaaagctc 2521 acagggaaaa ccagaaatac cgggtatatg tcgtgatacc acttctgcca gggttcgaag 2581 gagacatttc aaccggcgga ggaaatgctc tacaggcaat catgcacttc aactacagaa 2641 ccatgtgcag aggagaaaat tccatccttg gacagttaaa agcagagctt ggtaatcagt 2701 ggataaatta catatcattc tgtggtctta gaacacatgc agagctcgaa ggaaacctag 2761 taactgagct tatctatgtc cacagcaagt tgttaattgc tgatgataac actgttatta 2821 ttggctctgc caacataaat gaccgcagca tgctgggaaa gcgtgacagt gaaatggctg 2881 tcattgtgca agatacagag actgttcctt cagtaatgga tggaaaagag taccaagctg 2941 gccggtttgc ccgaggactt cggctacagt gctttagggt tgtccttggc tatcttgatg 3001 acccaagtga ggacattcag gatccagtga gtgacaaatt cttcaaggag gtgtgggttt 3061 caacagcagc tcgaaatgct acaatttatg acaaggtttt ccggtgcctt cccaatgatg 3121 aagtacacaa tttaattcag ctgagagact ttataaacaa gcccgtatta gctaaggaag 3181 atcccattcg agctgaggag gaactgaaga agatccgtgg atttttggtg caattcccct 3241 tttatttctt gtctgaagaa agcctactgc cttctgttgg gaccaaagag gccatagtgc 3301 ccatggaggt ttggacttaa gagatattca ttggcagctc aaagacttcc accctggaga 3361 ccacactgca cacagtgact tcctggggat gtcatagcca aagccaggcc tgacgcattc 3421 tcgtatccaa cccaaggacc ttttggaatg actggggagg gctgcagtca cattgatgta 3481 aggactgtaa acatcagcaa gactttataa ttccttctgc ctaacttgta aaaagggggc 3541 tgcattcttg ttggtagcat gtactctgtt gagtaaaaca catattcaaa ttccgctcgt 3601 gccgaattc // LOCUS HSU38654 944 bp mRNA PRI 13-NOV-1995 DEFINITION Human Rab27 mRNA, complete cds. ACCESSION U38654 NID g1055280 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 944) AUTHORS Seabra,M.C., Ho,Y.K. and Anant,J.S. TITLE Deficient geranylgeranylation of Ram/Rab27 in choroideremia JOURNAL J. Biol. Chem. 270 (41), 24420-24427 (1995) MEDLINE 96025837 REFERENCE 2 (bases 1 to 944) AUTHORS Anant,J.S. and Seabra,M.C. TITLE Direct Submission JOURNAL Submitted (16-OCT-1995) Janmeet S. Anant, Molecular Genetics, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..944 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retinal pigment epithelium, spleen" CDS 246..911 /note="similar to rat ram p25, Swiss-Prot Accession Number P23640; Low Molecular Weight GTP-binding protein" /codon_start=1 /product="Rab27" /db_xref="PID:g1055281" /translation="MSDGDYDYLIKFLALGDSGVGKTSVLYQYTDGKFNSKFITTVGI DFREKRVVYRASGPDGATGRGQRIHLQLWDTAGQERFRSLTTAFFRDAMGFLLLCDLT NEQSFLNVRNWISQLQMHAYCENPDIVLCGNKSDLEDQRVVKEEEAIALAEKYGIPYF ETSAANGTNISQAIEMLLDLIMKRMERCVDKSWIPEGVVRSNGHASTDQLSEEKEKGA CGC" BASE COUNT 289 a 162 c 243 g 250 t ORIGIN 1 gttttgaaag ttgatggagc gaactgcttt tccaaagact cttttgaaaa actttttaag 61 taggccattc tgactttaac atttctcttt gtcttaacat tagacaaaaa gtaaccttcc 121 tgaagaggac atgtgattgg aagttgtcaa ttgttgaagc attggtaact ccagtctcta 181 acgttttaga aaatcataac aagcggttct ctaccctgta aaggtgaact actgagttct 241 tcattatgtc tgatggagat tatgattacc tcatcaagtt tttagctttg ggagactctg 301 gtgtagggaa gaccagtgta ctttaccaat atacagatgg taaatttaac tccaaattta 361 tcacaacagt gggcattgat ttcagggaaa aaagagtggt gtacagagcc agtgggccgg 421 atggagccac tggcagaggc cagagaatcc acctgcagtt atgggacaca gcagggcagg 481 agaggtttcg tagcttaacg acagcgttct tcagagatgc tatgggtttt cttctacttt 541 gtgatctgac aaatgagcaa agtttcctca atgtcagaaa ctggataagc cagctacaga 601 tgcatgcata ttgtgaaaac ccagatatag tgctgtgtgg aaacaagagt gatctggagg 661 accagagagt agtgaaagag gaggaagcca tagcactcgc agagaaatat ggaatcccct 721 actttgaaac tagtgctgcc aatgggacaa acataagcca agcaattgag atgcttctgg 781 acctgataat gaagcgaatg gaacggtgtg tggacaagtc ctggattcct gaaggagtgg 841 tgcgatcaaa tggtcatgcc tctacggatc agttaagtga agaaaaggag aaaggggcat 901 gtggctgttg agaagtcaag taagtgacat agtaggtcag gtgg // LOCUS HSU38810 2771 bp mRNA PRI 18-FEB-1997 DEFINITION Human mab-21 cell fate-determining protein homolog (CAGR1) mRNA, complete cds. ACCESSION U38810 NID g1209668 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2771) AUTHORS Margolis,R.L., Stine,Q.C., Mcinnis,M.G., Ranen,N.G., Rubinsztein,D.C., Leggo,J., Jones Brando,L.V., Kidwai,A.S., Loev,S.J., Breschel,T.S., Callahan,C., Simpson,S.G., DePaulo,J.R., McMahon,F.J., Jain,S., Paykel,E.S., Walsh,C., DeLisi,L.E., Crow,T.J., Torrey,E.F., Ashworth,R.G., Macke,J.P., Nathans,J. and Ross,C.A. TITLE cDNA cloning of a human homologue of the Caenorhabditis elegans cell fate-determining gene mab-21: expression, chromosomal localization and analysis of a highly polymorphic (CAG)n trinucleotide repeat JOURNAL Hum. Mol. Genet. 5 (5), 607-616 (1996) MEDLINE 96311555 REFERENCE 2 (bases 1 to 2771) AUTHORS Margolis,R.L., Kidwai,A.K., Breschel,T.B., McInnis,M.G., Mackie,J., Nathans,J. and Ross,C.A. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) Russell L. Margolis, Psychiatry, Johns Hopkins U. School of Med., 600 N. Wolfe St, Meyer 4-163, Baltimore, MD 21287, USA FEATURES Location/Qualifiers source 1..2771 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q13, near D13s220" repeat_region 598..660 /note="polymorphic CAG trinucleotide repeat" gene 819..1898 /gene="CAGR1" CDS 819..1898 /gene="CAGR1" /note="homolog to C.elegans mab-21 cell fate-determining protein, encoded by GenBank Accession Number U19861" /codon_start=1 /product="CAGR1" /db_xref="PID:g1209669" /translation="MIAAQAKLVYHLNKYYNEKCQARKAAIAKTIREVCKVVSDVLKE VEVQEPRFISSLNEMDNRYEGLEVISPTEFEVVLYLNQMGVFNFVDDGSLPGCAVLKL SDGRKRSMSLWVEFITASGYLSARKIRSRFQTLVAQAVDKCSYRDVVKMVADTSEVKL RIRDRYVVQITPAFKCTGIWPRSAAHWPLPHIPWPGPNRVAEVKAEGFNLLSKECHSL AGKQSSAESDAWVLQFAEAENRLQMGGCRKKCLSILKTLRDRHLELPGQPLNNYHMKT LVSYECEKHPRESDWDESCLGDRLNGILLQLISCLQCRRCPHYFLPNLDLFQGKPHSA LENAAKQTWRLAREILTNPKSLEKL" BASE COUNT 785 a 612 c 628 g 746 t ORIGIN 1 gaattccaca ataaggtaat tagatttaga agtactcagt cactttaagt ggataaatgt 61 attagttaaa actttagggt ttgctttttt gctgtttaga tcaaagtttt ttctgattct 121 tctgtcctca ttgtgaacat aaccgtgtag ttgaaacagt caaacttatt tttgtaatgt 181 atgttattgt gtgatgcagt tttttgcttc tgtctccaat attaaaccat tttcctaata 241 cttgtttctc tctctgcgtg ttgtattgtt ggtagtcatt atatgttggt gatacatctg 301 cacactcacc ccggacacac actcagcaca cttttcctcc atttgattaa cagtgctgca 361 cacacaatga ttacgggaaa gcgcaaataa atacggaaag gggtgcttat tttgactact 421 ggaagagctt tgctgggtct cagcgcaact tttgtttttt attcctgaga aggtgatctc 481 tccatgcggt tctctcacac aaggattctt taaaagagga agagagacaa gcagaggggg 541 gaggacagtc tttcacttta agaacggctg ggctcaaaga taaaaggaag ggaaaagcag 601 cagcagcagc agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag 661 ggaaaccaac gctgcagcac ttccgaaagg catttttgat ccatttctga gtgttgcggc 721 ccgtttctcc accgaagttg gctccagctc tagcagccgc attggatccc acagcttact 781 gcgagactcc ggtgtacaat ccggatctct gccccaacat gattgcggcc caggccaagc 841 tggtctacca tctgaataaa tactacaacg aaaaatgcca agccaggaaa gctgccattg 901 ccaaaactat ccgggaagtc tgcaaagtag tttccgacgt actgaaggaa gtggaagtgc 961 aggagccgcg gttcatcagc tctctcaacg agatggacaa tcgctacgag ggcctcgagg 1021 tcatctcccc caccgaattt gaagtggtgc tttatctcaa ccaaatgggg gtgttcaact 1081 tcgtggacga tggctcactg cccggctgcg cggtgctgaa gttgagcgac gggcgcaaga 1141 ggagcatgtc cctctgggtg gaattcatta ccgcctccgg ctacctctcg gcgcgcaaaa 1201 tccggtccag gtttcagacg ctggtggctc aagcggtaga caaatgtagc taccgggatg 1261 tggtaaagat ggtggcagac accagcgaag tgaaactgag aatccgagat aggtacgtgg 1321 tgcagatcac gccggccttt aaatgcaccg ggatctggcc gaggagtgct gcccactggc 1381 cacttcccca catcccctgg ccgggaccca accgggtggc ggaggtcaag gcggaaggtt 1441 tcaatctctt gtccaaggag tgccactcct tggccggcaa gcagagctcg gcggagagcg 1501 acgcctgggt gctgcagttc gcggaggcag agaacagact gcagatgggg ggctgcagaa 1561 agaagtgcct ctccatcctc aaaaccttaa gggatcgtca ccttgaactg ccgggccagc 1621 ccttgaacaa ttaccatatg aagactctgg tttcctacga gtgtgaaaag catccccgag 1681 agtcggactg ggacgagtct tgcctgggtg atcggctgaa cgggattttg ctgcaactta 1741 tctcctgcct gcagtgccgg cggtgtcccc actactttct accgaactta gatctgtttc 1801 aaggcaaacc tcactcagct ctggaaaacg ctgccaaaca aacgtggcga ctggcaagag 1861 agatcctgac caacccgaaa agtttggaaa aactttagag gatgatttaa tcaagagccg 1921 aaattattac ccttctcaaa gtccttatta agtgtaaact tctgttcaat tcctaatatt 1981 ccactccgca gtgcaaacaa tctcttcctt taaaaaggaa taataataca atatttaaac 2041 atcatctcac ccacccccac aaggggagaa aaagtagggg aagcggatgg agaaaaaccc 2101 aaagccacta gtattagaag acttctttcc acacgatttc ctatctccct tgaaaagtac 2161 accgtaacac tccgtaaaca gcccagctgt aacgccagac cgagacgaac actctgccta 2221 actatcaaag gattatagca atcctggtga tttaggtgca tctgtctgtg agtaaacacg 2281 atttggatat gccatctgaa agaaactgta atgtatattt tgatttgtaa caaatattgt 2341 gatctcacat tgtctttgaa agtgtggatg ttggtgtttt gtgatttggt gaacagaact 2401 taaattgcca ttctggatac ttccagacat tttccactaa caaagatatc atttaaaggt 2461 agatttcttc ctggtacttt tatctgtctt tgaaagtgtc tgaactttaa aaagtttaca 2521 ttttgtttca aatattgctt gttctatttc taacattcca taaatatact tgaaatgtta 2581 tttaaatata ttcaaagaaa tttgaattca gcttatataa taacgcttga atatctgaat 2641 tatatatttg aaaaatgcac ttgaaataca ctggataatt acttttgtga tttagatttt 2701 aatttgttgc tggtttttat ttaattagat ggtaataaat gaagtaaaat aaaaaaaaaa 2761 aaaaggaatt c // LOCUS HSU38817 744 bp mRNA PRI 13-NOV-1996 DEFINITION Human SUPT4H mRNA, complete cds. ACCESSION U38817 NID g1401054 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 744) AUTHORS Chiang,P.-W., Wang,S.-Q., Smithivas,P., Song,W.-J., Crombez,E., Akhtar,A., Im,R., Greenfield,J., Ramamoorthy,S., Van Keuren,M.L., Blackburn,C.C., Tsai,C.-H. and Kurnit,D.M. TITLE Isolation and characterization of the human and mouse homologues (SUPT4H and Supt4h) of the yeast SPT4 gene JOURNAL Genomics 34 (3), 368-375 (1996) MEDLINE 96374829 REFERENCE 2 (bases 1 to 744) AUTHORS Chiang,P.-W. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) Pei-Wen Chiang, Pediatrics, University of Michigan, MSRBI, Rm 3520, 1150 W. Medical Center Drive, Ann Arbor, MI 48109-0650, USA FEATURES Location/Qualifiers source 1..744 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q" gene 43..396 /gene="SUPT4H" CDS 43..396 /gene="SUPT4H" /codon_start=1 /product="SUPT4H" /db_xref="PID:g1401055" /translation="MALETVPKDLRHLRACLLCSLVKTIDQFEYDGCDNCDAYLQMKG NREMVYDCTSSSFDGIIAMMSPEDSWVSKWQRVSNFKPGVYAVSVTGRLPQGIVRELK SRGVAYKSRDTAIKT" BASE COUNT 188 a 167 c 196 g 193 t ORIGIN 1 gggcgggcgt ctatctccct gttgttcttc ccatcggcga agatggccct ggagacggtg 61 ccgaaggacc tgcggcatct gcgggcctgt ttgctgtgtt cgctggtcaa gactatagac 121 cagtttgaat atgatggttg tgacaattgt gatgcatatc tacaaatgaa gggtaaccga 181 gagatggtat atgactgcac tagctcttcc tttgatggaa tcattgcgat gatgagtcca 241 gaggacagct gggtctccaa gtggcagcga gtcagtaact ttaagccagg tgtatatgcg 301 gtgtcagtca ctggtcgcct gccccaagga atcgtgcggg agctgaaaag tcgaggagtg 361 gcctacaaat ccagagacac agctataaag acctagcaag atgcaaggct gccagcatct 421 ttgctctcca cctcctgcct ctgcttattt cttgttctgg aactaaatga acagaacttc 481 aaatacttcc taccctccaa ttcagactca gctgactgtt gagagagcag cacatcattt 541 tatcatttta tcttctttgg actacaggtg gggtgggagg gatttgggtt ggtggattaa 601 cagatggaat tgaggagaga gtaggatgct gattttccta cccgtggccc aggtctgtgc 661 cttccccatg ccaaggactc taggtcaaat gtcaataaat atgaacctcg agaaagttct 721 gaaggccaaa aaaaacccga attc // LOCUS HSU38846 1883 bp mRNA PRI 24-FEB-1996 DEFINITION Human stimulator of TAR RNA binding (SRB) mRNA, complete cds. ACCESSION U38846 NID g1200183 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1883) AUTHORS Wu-Baer,F., Lane,W.S. and Gaynor,R.B. TITLE Identification of a group of cellular cofactors that stimulate the binding of RNA polymerase II and TRP-185 to human immunodeficiency virus 1 TAR RNA JOURNAL J. Biol. Chem. 271 (8), 4201-4208 (1996) MEDLINE 96223995 REFERENCE 2 (bases 1 to 1883) AUTHORS Wu-Baer,F. and Gaynor,R.B. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) Foon Wu-Baer, Internal Medicine, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235-8594, USA FEATURES Location/Qualifiers source 1..1883 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1620 /gene="SRB" CDS 1..1620 /gene="SRB" /codon_start=1 /product="stimulator of TAR RNA binding" /db_xref="PID:g1200184" /translation="MPENVAPRSGATAGAAGGRGKGAYQDRDKPAQIRFSNISAAKAV ADAIRTSLGPKGMDKMIQDGKGDVTITNDGATILKQMQVLHPAARMLVELSKAQDIEA GDGTTSVVIIAGSLLDSCTKLLQKGIHPTIISESFQKALEKGIEILTDMSRPVELSDR ETLLNSATTSLNSKVVSQYSSLLSPMSVNAVMKVIDPATATSVDLRDIKIVKKLGGTI DDCELVEGLVLTQKVSNSGITRVEKAKIGLIQFCLSAPKTDMDNQIVVSDYAQMDRVL REERAYILNLVKQIKKTGCNVLLIQKSILRDALSDLALHFLNKMKIMVIKDIEREDIE FICKTIGTKPVAHIDQFTADMLGSAELAEEVNLNGSGKLLKITGCASPGKTVTIVVRG SNKLVIEEAERSIHDALCVIRCLVKKRALIAGGGAPEIELALALTEYSRTLSGMESYC VRAFADAMEVIPSTLAENAGLNPISTVTELRNRHAQGEKTAGINVRKGGISNILEELV VQPLLVSVSALTLATETVRSILKIDDVVNTR" polyA_site 1883 /note="18 A nucleotides" BASE COUNT 563 a 365 c 439 g 516 t ORIGIN 1 atgcccgaga atgtggcacc ccggagcggg gcgactgccg gggctgccgg cggccgcggg 61 aaaggcgcct atcaggaccg cgacaagcca gcccagatcc gcttcagcaa catttccgcc 121 gccaaagcgg ttgctgatgc tattagaaca agccttggac caaaaggaat ggataaaatg 181 attcaagatg gaaaaggtga tgtaaccatt acaaatgatg gtgctaccat tctgaaacaa 241 atgcaagtat tacatccagc agccagaatg ctggtggagc tgtctaaggc tcaagatata 301 gaagcaggag atggcaccac atcagtagtc atcattgctg gctccctctt agattcttgt 361 accaagcttc ttcagaaagg gattcatcca accatcattt ctgagtcatt ccagaaggcc 421 ctggaaaagg gcattgaaat cttgactgac atgtctcgac ctgtggaact gagtgacaga 481 gaaactttgt taaatagtgc aaccacttca ctgaactcaa aggtggtttc tcagtattca 541 agtctgcttt ctccaatgag tgtaaatgca gtgatgaaag tgattgaccc agccacagcc 601 accagtgtag atcttagaga tattaaaata gttaagaagc ttggtgggac aattgatgac 661 tgtgagttgg tggaagggct ggttctcacc caaaaagtgt caaattctgg cataaccaga 721 gttgaaaagg ccaagattgg gcttattcag ttttgcttat ctgctcccaa aacagacatg 781 gataatcaaa tagtggtttc tgactatgcc cagatggacc gagtgctgcg agaagagaga 841 gcctatattt taaatttagt gaagcaaatt aaaaaaacag gatgtaatgt ccttctcata 901 cagaaatcta ttctaagaga tgctcttagt gatcttgcat tacactttct gaataaaatg 961 aagatcatgg tgattaagga tattgaaaga gaagacattg aattcatttg taagacaatt 1021 ggaaccaagc cagttgctca tattgaccaa tttactgctg acatgctggg ttctgctgag 1081 ttagctgagg aggtcaattt aaatggttct ggcaaactgc tcaagattac aggctgtgcc 1141 agccctggaa aaacagttac aattgttgtt cgtggttcta acaaactggt gattgaagaa 1201 gctgagcgct ccattcatga tgccctatgt gttattcgtt gtttagtgaa gaagagggct 1261 cttattgcag gaggtggtgc tccagaaata gagttggccc tagcattaac tgaatattca 1321 cgaacactga gtggtatgga atcctactgc gttcgtgctt ttgcagatgc tatggaggtc 1381 attccatcta cactagctga aaatgccggc ctgaatccca tttctacagt aacagaacta 1441 agaaaccggc atgcccaggg agaaaaaact gcaggcatta atgtccgaaa gggtggtatt 1501 tccaacattt tggaggaact ggttgtccag cctctgttgg tatcagtcag tgctctgact 1561 cttgcaactg aaactgttcg gagcattctg aaaatagatg atgtggtaaa cactcgataa 1621 tctggataac tgactagcac cattatgatc accagtattg tggctggaat ggaagaagat 1681 caccttggtg ttccttgttt ggaagattat ttcctctgaa tttctgggct tggtcttcca 1741 gttggcattt gcctgaagtt gtattgaaac aatttaatga aaatattaaa tatttggttt 1801 caaaaggcag atttatcttc tcccaacatt ctgttatttc tgatactttt gaaaaactaa 1861 taaaaactaa taaaagaagc gta // LOCUS HSU38847 5173 bp mRNA PRI 22-FEB-1996 DEFINITION Human TAR RNA loop binding protein (TRP-185) mRNA, complete cds. ACCESSION U38847 NID g1184691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5173) AUTHORS Wu-Baer,F., Lane,W.S. and Gaynor,R.B. TITLE The cellular factor TRP-185 regulates RNA polymerase II binding to HIV-1 TAR RNA JOURNAL EMBO J. 14 (23), 5995-6009 (1995) MEDLINE 96112814 REFERENCE 2 (bases 1 to 5173) AUTHORS Wu-Baer,F. and Gaynor,R.B. TITLE Direct Submission JOURNAL Submitted (17-OCT-1995) Foon Wu-Baer, Internal Medicine, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235-8594, USA FEATURES Location/Qualifiers source 1..5173 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..4866 /gene="TRP-185" CDS 1..4866 /gene="TRP-185" /note="; TRP-185" /codon_start=1 /product="TAR RNA loop binding protein" /db_xref="PID:g1184692" /translation="MEWVLAEALLSQSRDPRALLGALCQGEASAERVETLRFLLQRLE DEEARGSGGAGALPEAAREVAAGYLVPLLRSLRGRPAGGPDPSLQPRHRRRVLRAAGA ALRSCVRLAGRPQLAAALAEEALRDLLAGWRAPGAEAAVEVLAAVGPCLRPREDGPLL ERVAGTAVALALGGGGDGDEAGPAEDAAALVAGRLLPVLVQCGGAALRAVWGGLAAPG ASLGSGRVEEKLLVLSALAEKLLPEPGGDRARGAREAGPDARRCWRFWRTVQAGLGQA DALTRKRARYLLQRAVEVSAELGADCTCGPQEGNGPSLFWWSERKKDELLKFWENYIL IMETLEGNQIHVIKPVLPKLNNLFEYAVSEENGCWLFHPSWHMCIYKRMFESENKILS KEGVIHFLELYETKILPFSPEFSEFIIGPLMDALSESSLYSRSPGQPIGSCSPLGLKL QKFLVTYISLLPEEIKSSFLLKFIRKMTSRHWCAVPILFLSKALANVPRHKALGIDGL LALRDVIHCTMITHQILLRGAAQCYLLQTAMNLLDVEKVSLSDVSTFLMSLRQEESLG RGTSLWTELCDWLRVNESYFKPSPTCSSIGLHKTSLNAYVKSIVQEYVKSSAWETGEN CFMPDWFEAKLVSLMVLLAVDVEGMKTQYSGKQRTENVLRIFLDPLLDVLMKFSTNAY MPLLKTDRCLQLLLKLLNTCRLKGSSAQDDEVSTVLQNFFMSTTESISEFILRRLTMN ELNSVSDLDRCHLYLMVLTELINLHLKVGWKRGNPIWRVISLLKNASIQHLQEMDSGQ EPTVGSQIQRVVSMAALAMVCEAIDQKPELQLDSLHAGPLESFLSSLQLNQTLQKPHA EEQSSYAHPLECSSVLEESSSSQGWGKIVAQYIHDQWVCLSFLLKKYHTLIPTTGSEI LEPFLPAVQMPIRTLQSALEALTVLSSDQVLPVFHCLKVLVPKLLTSSESLCIESFDM AWKIISSLSNTQLIFWANLKAFVQFVFDNKVLTIAAKIKGQAYFKIKEIMYKIIEMSA IKTGVFNTLISYCCQSWIVSASNVSQGSLSSAKNYSELILEACIFGTVFRRDQRLVQD VQTFIENLGHDCAANIVMENTKREDHYVRICAVKFLCLLDGSNMSHKLFIEDLAIKLL DKDELVSKSKKRYYVNSLQHRVKNRVWQTLLVLFPRLDQNFLNGIIDRIFQAGFTNNQ ASIKYFIEWIIILILHKFPQFLPKFWDCFSYGEENLKTSICTFLAVLSHLDIITQNIP EKKLILKQALIVVLQWCFNHNFSVRLYALVALKKLWTVCKVLSVEEFDALTPVIESSL HQVESMHGAGNAKKNWQRIQEHFFFATFHPLKDYCLETIFYILPRLSGLIEDEWITID KFTRFTDVPLAAGFQWYLSQTQLSKLKPGDWSQQDIGTNLVEADNQAEWTDVQKKIIP WNSRVSDLDLELLFQDRAARLGKSISRLIVVASLIDKPTNLGGLCRTCEVFGASVLVV GSLQCISDKQFQHLSVSAEQWLPLVEVKPPQLIDYLQQKKTEGYTIIGVEQTAKSLDL TQYCFPEKSLLLLGNEREGIPANLIQQLDVCVEIPQQGIIRSLNVHVSGALLIWEYTR QQLLSHGDTKP" BASE COUNT 1348 a 1116 c 1309 g 1400 t ORIGIN 1 atggagtggg tgctcgcgga agcgctgctc tcgcagagcc gggacccccg ggccctgctt 61 ggggcgctgt gccaagggga ggcatccgcg gagcgcgtgg agacgctgcg cttccttctg 121 cagcggctcg aggacgagga ggcgcgcggc agcgggggcg caggcgcgct cccggaggcg 181 gcgcgcgagg tggctgcagg gtacctcgtg ccactgctgc ggagcctgcg cggacgcccc 241 gcgggcggcc cggaccccag tctgcagcct cgccaccgcc ggcgcgtgct gagggcggcg 301 ggcgcggccc tgcgctcgtg cgtccgcctg gccgggcgtc cgcagctggc ggccgcgctg 361 gctgaggagg cgctgcgcga tctgctcgcc gggtggcgcg cgcctggcgc cgaggctgcc 421 gtggaagtgc tagcagccgt cgggccatgt ttgcggcccc gcgaggacgg gccgctactg 481 gagcgggtgg cggggaccgc cgtcgccctg gcgctgggcg ggggcgggga cggggatgag 541 gccgggcctg ccgaggacgc ggcggcgctg gtggccgggc gactgctgcc agtgctggtc 601 caatgtggcg gggcggcgct gcgggccgtg tggggcgggc tggccgcgcc tggggcgtcc 661 ctggggtccg gccgcgtaga ggagaagctg ctggtcctga gcgccctggc cgagaagctg 721 ttgcccgagc ccggcggcga ccgcgcccgc ggcgcgcgcg aggcgggccc ggacgcccgg 781 cgctgctggc gcttctggag gacggtgcag gcggggctgg gccaggcgga cgccctgacg 841 cgcaagcgag cgcgctacct gctgcagagg gcggtggagg tgtcggcgga gctgggggcc 901 gactgcacct gcgggcccca ggaaggaaac ggcccaagtc tgttttggtg gtctgagagg 961 aaaaaagatg agcttctaaa gttttgggaa aattatattt taattatgga gactttagaa 1021 ggaaatcaga tacatgttat aaagccagtt ttaccaaagc taaacaatct gtttgaatat 1081 gcggtgtcag aggaaaatgg atgttggctc tttcacccat cctggcatat gtgtatttat 1141 aaaagaatgt ttgaaagtga aaacaaaatc ctgtccaaag aaggtgttat ccattttttg 1201 gagctgtatg aaacaaagat tcttccattt tcaccagaat tttctgagtt tattattgga 1261 ccattaatgg atgcgctttc agagagctct ctgtatagca ggtccccagg ccagccaata 1321 ggaagctgtt ctccattggg actgaaatta cagaagtttt tagtcactta tatttctctt 1381 cttccagaag aaataaagag tagcttccta ttgaagttta ttcggaagat gacaagtagg 1441 cattggtgtg ctgttcccat tttgtttcta tctaaggctt tggcaaatgt cccaagacat 1501 aaggccctgg gtatagatgg gcttcttgct ctcagggatg ttattcattg cactatgatc 1561 acacatcaga ttctcctgag aggggcagcc caatgctacc ttcttcaaac agctatgaat 1621 ttgctagatg tggagaaagt gtcactttct gatgtctcaa cttttctcat gtctctgaga 1681 caagaggaat ccttaggacg aggaacttca ttgtggacag agctgtgtga ctggctacgt 1741 gttaatgaaa gctattttaa gccatcccct acgtgtagct ccattggact tcacaagaca 1801 tctttaaatg cttatgtaaa gagcattgtt caagagtatg ttaagtcatc tgcttgggaa 1861 acaggagaaa actgctttat gcctgattgg tttgaagcca agcttgtttc tctgatggtc 1921 ttgctggctg tggatgtgga aggaatgaag actcagtata gcggaaagca gagaacagag 1981 aatgtattgc ggatattctt agaccctctt ctggatgtgc ttatgaagtt tagtaccaat 2041 gcctacatgc ccttgctgaa gactgacaga tgcctccagc tgctgttgaa gctgttgaac 2101 acatgcaggt tgaaaggttc cagtgcccaa gatgatgagg tgtctactgt tcttcagaac 2161 tttttcatgt ctactacaga gagcatttct gaatttattc tcagaagact tactatgaat 2221 gagctaaata gtgtttcaga tctggatcgt tgccatttat acctgatggt gttaactgag 2281 cttataaatc tgcatttgaa ggttgggtgg aaaaggggta accctatctg gagagttatt 2341 tctcttttga aaaatgcatc cattcagcat cttcaagaga tggacagtgg acaggagcca 2401 acagttggaa gtcagattca gagagtagtg agcatggctg ccttggccat ggtgtgtgag 2461 gccatagacc agaagcctga gctgcagctg gactctctcc atgctgggcc cctggaaagc 2521 ttcctttcct ctcttcagct caatcagacg ctgcagaagc cccacgcaga ggagcagagc 2581 agttatgctc accccttgga gtgcagcagt gttttggaag aatcgtcatc ttcccaagga 2641 tggggaaaaa tagttgcaca atatattcat gatcaatggg tgtgcctctc tttcctgttg 2701 aaaaaatatc acacccttat accaaccaca gggagtgaaa ttctggaacc gtttctacct 2761 gccgttcaga tgccaataag gactttgcag tctgcactag aagccctcac agttctttct 2821 tctgatcaag ttttaccagt gttccattgc ttgaaagtgt tggttcccaa gcttctgact 2881 tcctctgaat cactctgcat agagtctttt gacatggcgt ggaaaattat atcttcttta 2941 agcaacactc agctgatatt ctgggctaat ttaaaagctt ttgttcagtt tgtttttgat 3001 aacaaagttc ttaccattgc tgccaaaatc aagggccagg catatttcaa aataaaagag 3061 attatgtaca agataattga aatgtctgct ataaagactg gagtcttcaa tacactgata 3121 agttactgct gtcagtcttg gatagtgtct gcttcaaatg tgtcccaagg atctttatca 3181 agtgctaaaa attatagcga acttatcctt gaggcttgta tatttggaac tgtgtttagg 3241 cgtgatcaaa gacttgttca ggatgtacag accttcatag aaaaccttgg acatgactgt 3301 gcggcaaata ttgttatgga aaatactaag agagaagacc attatgtgag aatttgtgct 3361 gtcaaattcc tgtgtttatt agatggctcc aatatgtccc acaagttgtt tattgaggat 3421 cttgcaatca agctattaga taaagatgaa ttagtgtcca agtccaaaaa acgctactat 3481 gtgaattctc tacagcacag agtgaaaaac cgagtctggc agactctgct ggtacttttc 3541 cctagacttg accagaattt cttgaatgga attattgaca ggattttcca ggctggtttc 3601 accaacaatc aagcatccat aaaatatttt atagaatgga ttattatatt gattcttcat 3661 aaattccctc aatttcttcc aaagttctgg gattgttttt cttatggtga agaaaatctt 3721 aaaacaagca tttgtacatt tttagcagtt ttatcacatt tagacattat tactcaaaat 3781 attccagaaa agaaactaat tctgaagcaa gcccttatag ttgtgctgca gtggtgtttc 3841 aatcacaatt ttagtgttcg actgtatgct ttagttgctc ttaagaaact ctggactgtg 3901 tgtaaagtgt taagtgttga agaatttgat gccctgactc ctgtgattga atccagcctc 3961 catcaagtgg aaagcatgca cggagcaggg aatgccaaga agaattggca acgcattcag 4021 gagcatttct tttttgcaac atttcaccca ctcaaggatt attgtctaga gaccatattt 4081 tacatccttc cacgcctttc aggccttatt gaagatgaat ggatcaccat tgataaattt 4141 accagattca ctgatgttcc tttagctgcg ggatttcagt ggtacctttc tcaaactcaa 4201 cttagtaaac taaaaccagg tgactggtct cagcaagaca taggtactaa tttggttgaa 4261 gcagataacc aagcagagtg gaccgacgtt cagaagaaga ttatcccgtg gaacagtcgt 4321 gtttccgact tagacctgga gctcctgttt caggatcgtg ctgccagact tggaaagtca 4381 attagtagac tcatcgttgt ggcctcgctc atcgacaaac cgaccaattt aggaggactg 4441 tgcaggacct gtgaggtatt tggggcttca gtgctcgttg ttggcagcct tcagtgtatc 4501 agcgacaaac agtttcagca cctcagtgtc tctgcagaac agtggcttcc tctagtggag 4561 gtaaaaccac ctcagctaat tgattatctg cagcagaaga aaacagaagg ttataccatc 4621 attggagtgg aacaaactgc caaaagttta gacctaaccc aatattgctt tcctgagaaa 4681 tctctgctct tgttgggaaa tgaacgtgag ggaattccag caaatctgat ccaacagttg 4741 gacgtttgtg tggaaattcc tcaacagggc attatccgct ccctgaatgt ccatgtgagt 4801 ggagccctgc tgatctggga gtacaccagg cagcagctgc tctcgcacgg agataccaag 4861 ccatgatgtg ccttccttag tgaactgctg ctgctgttca gactttttta aaaaaaacta 4921 tttggactaa agaaacagat tctgaaattt attgtgataa tttgtatttc ttttttcttg 4981 caatttaatg ccaaaagttt gccatgtgcc ttaaacatat tactatatat tttccccttt 5041 aataaacact ttttgttaaa ttgtattctt cctttaataa aatattttaa gcaattgtcc 5101 aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5161 aaaaaaaaaa aaa // LOCUS HSU38864 2235 bp mRNA PRI 13-MAY-1997 DEFINITION Human zinc-finger protein C2H2-150 mRNA, complete cds. ACCESSION U38864 NID g1055340 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2235) AUTHORS Becker,K.G., Nagle,J.W., Canning,R.D., Biddison,W.E., Ozato,K. and Drew,P.D. TITLE Rapid isolation and characterization of 118 novel C2H2-type zinc finger cDNAs expressed in human brain JOURNAL Hum. Mol. Genet. 4 (4), 685-691 (1995) MEDLINE 95359976 REFERENCE 2 (bases 1 to 2235) AUTHORS Becker,K.G., Nagle,J.W., Canning,R.D., Dehejia,A.M., Polymeropoulos,M.H., Gado,A.M., Biddison,W.E. and Drew,P.D. TITLE Molecular cloning and mapping of a novel human KRAB domain-containing C2H2-type zinc finger to chromosome 7q36.1 JOURNAL Genomics 41 (3), 502-504 (1997) MEDLINE 97312716 REFERENCE 3 (bases 1 to 2235) AUTHORS Becker,K.G. TITLE Direct Submission JOURNAL Submitted (18-OCT-1995) Kevin G. Becker, Neuroimmunology Branch, NINDS/NIH, Bldg10, Rm 5B04, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2235 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q35-36.1" /tissue_type="hippocampus" /clone_lib="Stratagene cat.# 936205" CDS 221..1066 /note="KRAB box-containing zinc finger protein" /codon_start=1 /product="C2H2-150" /db_xref="PID:g1055341" /translation="MIKQELQYTQEGPADLPGEFSCIAEEQAFLSPEQTELWGGQGSS VLLETGPGDSTLEEPVGSRVPSSSRTVGCPKQKSYRQVQLDQECGQGLKLKKDTSRPY ECSECEITFRYKQQLAAHLRSHSGWGSCTPEEPEESLRPRPRLKPQTKKAKLHQCDVC LRSFSCKVSLVTHQRCHLQEGPSAGQHVQERFSPNSLVALPGHIPWRKSRSSLICGYC GKSFSHPSDLVRHQRIHTGERPYSCTECEKSFVQKQHLLQHQKIHQRERGGLALEPGR PNGLL" misc_feature 530..590 /note="encodes zinc finger" misc_feature 689..751 /note="encodes zinc finger" misc_feature 863..925 /note="encodes zinc finger" misc_feature 947..997 /note="encodes zinc finger" BASE COUNT 494 a 535 c 658 g 548 t ORIGIN 1 ggtcactgga gaatgatggc gtctgtttca ccgagcagga atgggagaat ctggaggatt 61 ggcagaagga gctctacaga aacgtgatgg agagtaacta tgagacactg gtctctctga 121 aggtccttgg ccagacagag ggagaagcgg agttgggtac agagatgctg ggtgacttgg 181 aagaggaagg tcctgtggtg cccacccagc aggtggggtc atgatcaaac aggagctaca 241 gtatacacag gaaggccctg cggatcttcc tggagagttc tcatgcattg ctgaagagca 301 ggctttcctg agcccagagc agaccgaact ctggggtggt cagggcagtt ctgtcctctt 361 ggaaacaggt cctggggact ctactctaga ggagcctgtt ggtagtagag ttcctagcag 421 cagcagaact gtgggctgcc cgaagcagaa atcttatagg caggtacagc tggaccagga 481 atgtgggcag ggcctgaagc tgaaaaagga cacttcccgc ccctacgaat gttctgagtg 541 tgagatcacc ttccgctata agcagcagct ggccgcacat ctgcgcagcc actctgggtg 601 ggggtcttgt acacctgagg agccagagga gagccttagg cccaggccac ggctgaaacc 661 acagaccaaa aaggccaagc tgcatcagtg tgatgtgtgc ctgaggagct tcagctgcaa 721 ggtgagcctg gtgacccatc agcgttgcca cctgcaggag gggcccagtg ccggccagca 781 tgtccaagag aggttctcac ccaacagcct ggttgccctg cctggccaca tcccttggag 841 gaaaagccgg agttccctca tctgtggtta ctgtggcaag agcttcagtc acccatctga 901 cttggtgcgg caccagcgca tccacacggg tgagcggccc tacagctgca ctgagtgtga 961 gaagagcttt gtccagaagc agcacctcct gcagcaccag aagatccacc agcgggagcg 1021 gggtgggctg gccctggagc ccggaaggcc caatggcctg ctttaagggt gcagcccctc 1081 gcccgtctgg gggatggagg ggggtggcat tggttccccc gaagagacac tgcagtcagg 1141 gactgagttc ttcctgaggg cagttgtttg tgattgcctt cccttgtccc agtaccaagc 1201 caagcccaaa ggctgtcctg aaaaccctgt ggaagaagag tccaggccag gtcttcatcc 1261 tgctgccaag tttgctgttt cttggcacct tcaggtctct ggttttctca ttcatgccaa 1321 tgcttgtggg ctggggttgg cgttctgacc ccacagggac tggtggctgg ttccagggct 1381 cgtcccggca tttcatgtct tcccacgggg ttgagtcggg ccataggggt gagcagctgc 1441 ctggaagagt tctgggaagt ataaccctcc attttttctt gttttataat ctctttgttt 1501 aataataagt agaagaaata atttaaatga actgcttagc cctgctctga ataacctttt 1561 ttggaattag attttagttg attttttaaa gaatacagcc caacacttgt tttttacatt 1621 ttaagagttg tagaagttgt tcctaacttg gggactggac ctaccctcta ggagggagtt 1681 gttaatgggg ctcttttagc ccactgatgc ttacttaggc cggagagcag gggacacggt 1741 gctaggttcc ctcgtgcagt gcctggtgct ctcaaattgt ctcaaaagga ccaagaggaa 1801 aagagtcgga ggggtagacc ctgcagcccc tgttgagaag aaagattcca gtgaagttgc 1861 tgatggtatg gctgtggtct gggacttgcg gtgtctcggc atacccctct cctcttccca 1921 cctcctcctg gctatgttct gcagcctccc agagtagaaa actactttgt tacttaaggt 1981 tgttcacctt gtaccagtgg ttatttgagt ttgttcctat tacaccaatc cttacttgag 2041 gtggttcgga ttacagtttc caaatgcatt ctgggattgc ctattccaga gagggtacag 2101 aaaaagcaca cagatggctt gtctcaggag tgttgaatgt gtgccccgct gctgtctggg 2161 gggatgggag tgggctctgg ggtcatatgt gaacatcccc ttggatgatt tgcggttgct 2221 tagaataaaa cttgc // LOCUS HSU38894 2373 bp mRNA PRI 03-OCT-1996 DEFINITION Human protein tyrosine kinase t-Ror1 (Ror1) mRNA, complete cds. ACCESSION U38894 NID g1589739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2373) AUTHORS Reddy,U.R., Phatak,S. and Pleasure,D. TITLE Human neural tissues express a truncated Ror1 receptor tyrosine kinase, lacking both extracellular and transmembrane domains JOURNAL Oncogene 13 (7), 1555-1559 (1996) MEDLINE 97030043 REFERENCE 2 (bases 1 to 2373) AUTHORS Reddy,U.R., Phatak,S. and Pleasure,D. TITLE Direct Submission JOURNAL Submitted (18-OCT-1995) Usha R. Reddy, Neurology Research, Children's Hospital of Philadelphia, 34th Civic Center Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2373 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="NT2" gene 990..2156 /gene="Ror1" CDS 990..2156 /gene="Ror1" /function="protein kinase" /note="alternatively spliced form t-Ror1" /codon_start=1 /product="tyrosine kinase t-Ror1" /db_xref="PID:g1589740" /translation="MLFEYINQGDLHEFLIMRSPHSDVGCSSDEDGTVKSSLDHGDFL HIAIQIAAGMEYLSSHFFVHKDLAARNILIGEQLHVKISDLGLSREIYSADYYRVQSK SLLPIRWMPPEAIMYGKFSSDSDIWSFGVVLWEIFSFGLQPYYGFSNQEVIEMVRKRQ LLPCSEDCPPRMYSLMTECWNEIPSRRPRFKDIHVRLRSWEGLSSHTSSTTPSGGNAT TQTTSLSASPVSNLSNPRYPNYMFPSQGITPQGQIAGFIGPPIPQNQRFIPINGYPIP PGYAAFPAAHYQPTGPPRVIQHCPPPKSRSPSSASGSTSTGHVTSLPSSGSNQEANIP LLPHMSIPNHPGGMGITVFGNKSQKPYKIDSKQASLLGDANIHGHTESMISAEL" BASE COUNT 661 a 546 c 495 g 671 t ORIGIN 1 gcacgagcgg ttctgagcat tagtttgaga actcgttccc gaatgtgctt tcctccctct 61 cccctgccca cctcaagttt aataaataag gttgtacttt tcttactata aaataaatgt 121 ctgtaactgc tgtgcactgc tgtaaacttg ttagagaaaa aaataacctg catgtgggct 181 cctcagttat tgagtttttg tgatcctatc tcagtctggg ggggaacatt ctcaagaggt 241 gaaatacaga aagccttttt ttcttgatct tttcccgaga ttcaaatctc cgattcccat 301 ttgggggcaa gtttttttct tcaccttcaa tatgagaatt cagcgaactt gaaagaaaaa 361 tcatctgtga gttccttcag gttctcactc atagtcatga tccttcagag ggaatatgca 421 ctggcgagtt taaagtaagg gctatgatat ttgatggtcc caaagtacgg cagctgcaaa 481 aagtagtgga aggaaattgt ctacgtgtct tggaaaaatt agttaggaat ttggatgggt 541 aaaaggtacc cttgccttac tccatcttat tttcttagcc ccctttgagt gttttaactg 601 gtttcatgtc ctagtaggaa gtgcattctc catcctcatc ctctgccctc ccaggaagtc 661 agtgattgtc tttttgggct tcccctccaa aggaccttct gcagtggaag tgccacatcc 721 agttcttttc ttttgttgct gctgtgttta gataattgaa gagatctttg tgccacacag 781 gatttttttt ttttttaaga aaaacctata gatgaaaaat tactaatgaa actgtgtgta 841 cgtgtctgtg cgtgcaacat aaaaatacag tagcacctaa ggagcttgaa tcttggttcc 901 tgtaaaattt caaattgatg tggtattaat aaaaaaaaaa aaaaaaaact cgctcgtgcc 961 gaattcggca cgaggaacaa cctgtgtgca tgctttttga gtatattaat cagggggatc 1021 tccatgagtt cctcatcatg agatccccac actctgatgt tggctgcagc agtgatgaag 1081 atgggactgt gaaatccagc ctggaccacg gagattttct gcacattgca attcagattg 1141 cagctggcat ggaatacctg tctagtcact tctttgtcca caaggacctt gcagctcgca 1201 atattttaat cggagagcaa cttcatgtaa agatttcaga cttggggctt tccagagaaa 1261 tttactccgc tgattactac agggtccaga gtaagtcctt gctgcccatt cgctggatgc 1321 cccctgaagc catcatgtat ggcaaattct cttctgattc agatatctgg tcctttgggg 1381 ttgtcttgtg ggagattttc agttttggac tccagccata ttatggattc agtaaccagg 1441 aagtgattga gatggtgaga aaacggcagc tcttaccatg ctctgaagac tgcccaccca 1501 gaatgtacag cctcatgaca gagtgctgga atgagattcc ttctaggaga ccaagattta 1561 aagatattca cgtccggctt cggtcctggg agggactctc aagtcacaca agctctacta 1621 ctccttcagg gggaaatgcc accacacaga caacctccct cagtgccagc ccagtgagta 1681 atctcagtaa ccccagatat cctaattaca tgttcccgag ccagggtatt acaccacagg 1741 gccagattgc tggtttcatt ggcccgccaa tacctcagaa ccagcgattc attcccatca 1801 atggataccc aatacctcct ggatatgcag cgtttccagc tgcccactac cagccaacag 1861 gtcctcccag agtgattcag cactgcccac ctcccaagag tcggtcccca agcagtgcca 1921 gtgggtcgac tagcactggc catgtgacta gcttgccctc atcaggatcc aatcaggaag 1981 caaatattcc tttactacca cacatgtcaa ttccaaatca tcctggtgga atgggtatca 2041 ccgtttttgg caacaaatct caaaaaccct acaaaattga ctcaaagcaa gcatctttac 2101 taggagacgc caatattcat ggacacaccg aatctatgat ttctgcagaa ctgtaaaatg 2161 cacaactttt gtaaatgtgg tatacaggac aaactagacg gccgtagaaa agatttatat 2221 tcaaatgttt ttattaaagt aaggttctca tttagcagac atcgcaacaa gtaccttctg 2281 tgaagtttca ctgtgtctta ccaagcagga cagacactcg gccagaaaaa aaaaaaaaaa 2341 aaaaaactcg agggggggcc cgtacccgat cgc // LOCUS HSU38904 1805 bp mRNA PRI 02-APR-1996 DEFINITION Human zinc finger protein C2H2-25 mRNA, complete cds. ACCESSION U38904 NID g1199603 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1805) AUTHORS Becker,K.G., Nagle,J.W., Canning,R.D., Biddison,W.E., Ozato,K. and Drew,P.D. TITLE Rapid isolation and characterization of 118 novel C2H2-type zinc finger cDNAs expressed in human brain JOURNAL Hum. Mol. Genet. 4 (4), 685-691 (1995) MEDLINE 95359976 REFERENCE 2 (bases 1 to 1805) AUTHORS Becker,K.G., Canning,R.D., Nagle,J.W., Gado,A. and Biddison,W.E. TITLE C2H2-25:Isolation of an eleven fingered zinc finger gene from human brain JOURNAL Unpublished REFERENCE 3 (bases 1 to 1805) AUTHORS Becker,K.G. TITLE Direct Submission JOURNAL Submitted (18-OCT-1995) Kevin G. Becker, Neuroimmunology Branch, NINDS/NIH, Building 10, Rm. 5B04, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1805 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 115..1218 /codon_start=1 /product="zinc finger protein C2H2-25" /db_xref="PID:g1199604" /translation="MRFLHQDATQTGEKPNNSNKCAVAFYSGKSHHNWGKCSKAFSHI DTLVQDQRILTREGLFECSKCGKACTRRCNLIQHQKVHSEERPYECNECGKFFTYYSS FIIHQRVHTGERPYACPECGKSFSQIYSLNSHRKVHTGERPYECGECGKSFSQRSNLM QHRRVHTGERPYECSECGKSFSQNFSLIYHQRVHTGERPHECNECGKSFSRSSSLIHH RRLHTGERPYECSKCGKSFKQSSSFSSHRKVHTGERPYVCGECGKSFSHSSNLKNHQR VHTGERPVECSECSKSFSCKSNLIKHLRVHTGERPYECSECGKSFSQSSSLIQHRRVH TGKRPYQCSQCGKSFGCKSVLIQHQRVHIGEKP" misc_feature 298..360 /note="encodes zinc finger" misc_feature 382..444 /note="encodes zinc finger" misc_feature 466..528 /note="encodes zinc finger" misc_feature 550..612 /note="encodes zinc finger" misc_feature 634..696 /note="encodes zinc finger" misc_feature 718..780 /note="encodes zinc finger" misc_feature 802..864 /note="encodes zinc finger" misc_feature 886..948 /note="encodes zinc finger" misc_feature 970..1032 /note="encodes zinc finger" misc_feature 1054..1116 /note="encodes zinc finger" misc_feature 1138..1200 /note="encodes zinc finger" BASE COUNT 537 a 396 c 405 g 467 t ORIGIN 1 gaggcacctt tcagaagtta tgtggacact gcctcgttta cacagagttg catagtccat 61 gtgtcggaga aaccctttac ctgcagggag atcaggaaag acttcctggc caacatgagg 121 tttctccatc aagacgccac tcaaacaggg gagaagccaa ataacagtaa caagtgtgcg 181 gtggcctttt acagtggaaa aagtcatcac aactggggaa aatgcagtaa agcctttagc 241 cacatagaca cacttgttca ggaccagaga atcctcacta gagaaggact ttttgagtgc 301 agtaaatgtg ggaaagcatg tacgcgaaga tgtaacctca ttcagcacca gaaagtccac 361 agtgaagaaa ggccttatga atgcaatgaa tgtggaaaat tctttaccta ctactccagt 421 ttcattatac atcagagagt tcatactgga gaaaggcctt atgcgtgccc tgaatgtggg 481 aaatcgttta gtcagatata cagcctcaat agccatagga aagttcacac tggagaaagg 541 ccttatgaat gtggggaatg tgggaaatct tttagccaaa ggtccaacct catgcagcat 601 cgcagagttc acactggaga aaggccttat gaatgcagcg aatgtgggaa atcttttagc 661 caaaacttta gcctgatcta ccaccagaga gttcacactg gagaaagacc tcatgagtgc 721 aatgaatgtg gaaaatcctt tagccgaagc tccagcctca ttcaccaccg gagacttcac 781 actggagaaa gaccctatga gtgcagtaaa tgtgggaagt catttaagca aagctccagc 841 ttcagttcac atcggaaagt ccacacaggg gaaaggcctt atgtgtgtgg ggaatgtggg 901 aaatccttta gccatagctc caaccttaag aaccaccaga gagttcacac tggagaaaga 961 cctgttgagt gcagtgaatg tagcaaatcc tttagctgta aatctaacct cattaaacac 1021 ctgagagttc acactggaga aaggccttat gagtgcagtg aatgtgggaa atcctttagc 1081 caaagttcta gcctcattca acaccgcaga gttcacacgg gaaaaaggcc ttatcagtgc 1141 agtcaatgtg ggaaatcctt tggctgcaaa tctgtcctca ttcaacacca gagagttcac 1201 attggagaaa agccttagct gtactgagaa tatgcaattt ccttttagtg taattatact 1261 gaaggagtac acctgtgaga gagacaagta cctgatttgg aagccccaac atctaaggat 1321 atacagtggg cggattcccc ttaagttcca ggtatgtgtt acactttcta acatgccatt 1381 tagaaagtgt tagactttct cacctgccat ttatggctct tgccgtttat gtcactgaca 1441 gtttctgagg cagaagccgt atcatgtcta ccacctgtga ggtccacaca gtgtgtatca 1501 tttacctcct gaacctgctc aaggaagcag acctctgctt ctccccattt gctagaagaa 1561 atcatgaata gtctgagtct tcctctctga caagttaggg catggacttg acccagcttt 1621 gtgccagaga acccaatatg agtgttgttg gcagcttgcc aagaaggact gtctttttca 1681 agacatactg gtttcatgtg acacctccat ggattttttt ccagcctcta agtcaccaac 1741 ttgggaactg cttgtctcac gttgctttgt tttttacaat aataaaagca ttattattta 1801 agcta // LOCUS HSU39196 3181 bp mRNA PRI 28-APR-1997 DEFINITION Human clone hGIRK1 G-protein coupled inwardly rectifying potassium channel mRNA, complete cds. ACCESSION U39196 NID g1055027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Chan,K.W., Langan,M.N., Sui,J.L., Kozak,J.A., Pabon,A., Ladias,J.A. and Logothetis,D.E. TITLE A recombinant inwardly rectifying potassium channel coupled to GTP-binding proteins JOURNAL J. Gen. Physiol. 107 (3), 381-397 (1996) MEDLINE 97021689 REFERENCE 2 (bases 1 to 3181) AUTHORS Chan,K.W. TITLE Direct Submission JOURNAL Submitted (23-OCT-1995) Kim W. Chan, Mount Sinai School of Medicine, Physiology and Biophysics, One Gustave L. Levy Place, New York, NY 10029-6574, USA FEATURES Location/Qualifiers source 1..3181 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hGIRK1" /tissue_type="brain" /dev_stage="fetus" CDS 1361..2866 /codon_start=1 /evidence=experimental /product="G-protein coupled inwardly rectifying potassium channel" /db_xref="PID:g1055028" /translation="MSALRRKFGDDYQVVTTSSSGSGLQPQGPGQDPQQQLVPKKKRQ RFVDKNGRCNVQHGNLGSETSRYLSDLFTTLVDLKWRWNLFIFILTYTVAWLFMASMW WVIAYTRGDLNKAHVGNYTPCVANVYNFPSAFLFFIETEATIGYGYRYITDKCPEGII LFLFQSILGSIVDAFLIGCMFIKMSQPKKRAETLMFSEHAVISMRDGKLTLMFRVGNL RNSHMVSAQIRCKLLKSRQTPEGEFLPLDQLELDVGFSTGADQLFLVSPLTICHVIDA KSPFYDLSQRSMQTEQFEIVVILEGIVETTGMTCQARTSYTEDEVLWGHRFFPVISLE EGFFKVDYSQFHATFEVPTPPYSVKEQEEMLLMSSPLIAPAITNSKERHNSVECLDGL DDITTKLPSKLQKITGREDFPKKLLRMSSTTSEKAYSLGDLPMKLQRISSVPGNSEEK LVSKTTKMLSDPMSQSVADLPPKLQKMAGGAARMEGNLPAKLRKMNSDRFT" BASE COUNT 908 a 748 c 734 g 791 t ORIGIN 1 ctttgattta gtaacataag gtatagatgc atgaaaacat aaacagtaag gccaggcaca 61 gtggctcacg cctgtaatcc caacactttg ggaggccgag gccgggggat cacgaggtca 121 ggagatggag accatcctgg ctaacacggt gaccccccat ctctactaaa agtacaaaat 181 attagccggg cgtggtggtg ggcgcctgta gtcccaggta ctggggaggc tgaggcagaa 241 gcatggcgtg aaccgggagg cggagcttgc agtgagccga gattgcacca ctgcactcca 301 gcctgggtga cagagcaaga cttcgtctaa aaaaaaaaca aaaaaaaaca aaaaaaccat 361 aaacaatata ggagtgtcat aggattataa atagatggca tggataataa aatctacaag 421 tttagagaaa attcgtaatt ataggtgaga agtattaatg aaaggacaaa gcttgttctt 481 aaaggatcag aataggccaa gatgaaaaaa ggcatccatg ggaaaagaac tgttcatttc 541 tataatttta acaaaataga taatctatgt tgaataaaat ccataaatac tatttccaaa 601 tttgattaag gtatttactt tgaaaacctg aaacaggaac atatttacag tgttaagtga 661 tttgataatt ttgtagttct gccctaaaaa tcaatatcct taattaaccc ttcatcagtt 721 tcttaagtca tctgtcatcc tgaaagcaag ttaattattc taaatgttaa ataattttgg 781 tagtgtctaa tctataaaac atacatacac taccagtttt tttctacctt taacaaattc 841 gttcccataa aagacaattt tatcactaac aaaggataac tggcaggcag gtaaaggcag 901 cttcttaggc agccagcgag ttggcaggtg ggttgggact cgcttgtcaa ctcattaatt 961 ttaagcagtc agtttggggg tactgcagga taatcaggag ggccgtgggg agtcaagagg 1021 tgacccggga tgccggtggt ggggaaagaa aagaggagcc tctggaagct tggaggcaaa 1081 attgcgcttg ggttcctgtt ccttgcatcc ctcctggctt gagtgcggga gaacactttt 1141 taaagactca ccttggaaag aaggcctccg tcccagggga gaaggagagg cgtctgcagg 1201 gggcagagac cgcagctacc tgccgggtgc gccccccacc caggagcgct cgcttcgccc 1261 cctttcctcc cccgccccca cctccttatt ggtgctagtt tgcagcgccc agctcctgcg 1321 ccttcgcttc gcgtttgaat ctggctcgcc ccttcgtatt atgtctgcac tccgaaggaa 1381 atttggggac gattatcagg tagtgaccac atcgtccagc ggctcgggct tgcagcccca 1441 ggggccaggc caggaccctc agcagcagct tgtgcccaag aagaagcggc agcggttcgt 1501 ggacaagaac ggccggtgca atgtacagca cggcaacctg ggcagcgaga caagccgcta 1561 cctctcggac ctcttcacca cgctggtgga cctcaagtgg cgctggaacc tcttcatctt 1621 cattctcacc tacaccgtgg cctggctttt catggcgtcc atgtggtggg tgatcgccta 1681 cactcggggc gacctgaaca aagcccacgt cggtaactac acgccttgcg tggccaatgt 1741 ctataacttc ccttctgcct tcctcttctt catcgagacg gaggccacca tcggctatgg 1801 ctaccgatac atcacagaca agtgccccga gggcatcatc ctcttcctct tccagtccat 1861 cctgggctcc atcgtggacg ccttcctcat cggctgcatg ttcatcaaga tgtcccagcc 1921 caagaagcgc gccgagaccc tcatgttcag cgagcacgcg gtgatctcca tgagggacgg 1981 aaaactcacg cttatgttcc gggtgggcaa cctgcgcaac agccacatgg tctccgcgca 2041 gattcgctgc aagctgctca aatctcggca gacacctgag ggtgagttcc ttccccttga 2101 ccaacttgaa ctggatgtag gttttagtac aggggcagat caactttttc ttgtgtcccc 2161 cctcacaatt tgccacgtga tcgatgccaa aagccccttt tatgacctat cccagcgaag 2221 catgcaaact gaacagttcg agattgtcgt catcctagaa ggcattgtgg aaacaactgg 2281 gatgacttgt caagctcgaa catcatatac tgaagatgaa gttctttggg gtcatcgttt 2341 ttttcctgta atttccttag aagagggatt ctttaaagtt gattattccc agttccatgc 2401 aacatttgaa gtccccaccc caccttacag tgtgaaagag caggaggaaa tgcttctcat 2461 gtcgtccccc ttaatagcac cagccataac taacagcaaa gaaagacata attctgtgga 2521 atgcttagat ggactagatg atattactac aaaactacca tctaagctgc agaaaattac 2581 tggaagagaa gactttccca aaaaactctt gaggatgagt tctacaactt cagaaaaagc 2641 ctacagcttg ggagacttgc ccatgaaact tcaacgaata agttcagttc cgggcaactc 2701 agaagaaaaa ctggtatcta aaaccaccaa gatgttatct gatcccatga gccagtctgt 2761 ggctgatttg ccaccaaagc ttcaaaagat ggctggagga gcagctagga tggaagggaa 2821 ccttccagcc aaattaagaa aaatgaactc tgatcgcttc acataacaaa gcactccctt 2881 aggcattatt taatgtttga tttagtaata gtccaatatt tggcgatgag gtaattctcc 2941 ctaaggaatc tgaaagtata ttttcctccc agttctacaa gcatatttga gaacccttcc 3001 tttcccaagt attgcgaatg tgcagaaagc aacagttacg gagggaggac atcataagga 3061 agttattaac gggcatgtat tatcacatca agcatgcaat aatgtgcaaa ttttgcattt 3121 agttttatgg catgatttat atatggcata tttatattgt atattctgga aaaaaaaaaa 3181 a // LOCUS HSU39317 509 bp mRNA PRI 07-MAR-1996 DEFINITION Human E2 ubiquitin conjugating enzyme UbcH5B (UBCH5B) mRNA, complete cds. ACCESSION U39317 NID g1216392 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 509) AUTHORS Jensen,J.P., Bates,P.W., Yang,M., Vierstra,R.D. and Weissman,A.M. TITLE Identification of a family of closely related human ubiquitin conjugating enzymes JOURNAL J. Biol. Chem. 270 (51), 30408-30414 (1995) MEDLINE 96107191 REFERENCE 2 (bases 1 to 509) AUTHORS Jensen,J.P., Bates,P.W., Yang,M., Vierstra,R.D. and Weissman,A.M. TITLE Direct Submission JOURNAL Submitted (24-OCT-1995) Allan M. Weissman, Laboratory of Immune Cell Biology, NCI/NIH, Bldg. 10 Rm. 1B34, 9000 Rockville Pike, Bethesda, MD 20892-1152, USA FEATURES Location/Qualifiers source 1..509 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood lymphocyte" gene 23..466 /gene="UBCH5B" CDS 23..466 /gene="UBCH5B" /note="Transcript is widely expressed. Related to S. cerevisiae UBC4 and UBC5. Closely related to human UbcH5(A) and to UbcH5C" /codon_start=1 /function="E2 ubiquitin conjugating enzyme; functions with E6-AP in ubiquitin conjugation" /product="UbcH5B" /db_xref="PID:g1145689" /translation="MALKRIHKELNDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSP YQGGVFFLTIHFPTDYPFKPPKVAFTTRIYHPNINSNGSICLDILRSQWSPALTISKV LLSICSLLCDPNPDDPLVPEIARIYKTDREKYNRIAREWTQKYAM" BASE COUNT 160 a 111 c 102 g 136 t ORIGIN 1 gcacgggtca ccgcatcaca ccatggctct gaagagaatc cacaaggaat tgaatgatct 61 ggcacgggac cctccagcac agtgttcagc aggtcctgtt ggagatgata tgttccattg 121 gcaagctaca ataatggggc caaatgacag tccctatcag ggtggagtat ttttcttgac 181 aattcatttc ccaacagatt accccttcaa accacctaag gttgcattta caacaagaat 241 ttatcatcca aatattaaca gtaatggcag catttgtctt gatattctac gatcacagtg 301 gtctccagca ctaactattt caaaagtact cttgtccatc tgttctctgt tgtgtgatcc 361 caatccagat gatcctttag tgcctgagat tgctcggatc tacaaaacag atagagaaaa 421 gtacaacaga atagctcggg aatggactca gaagtatgcg atgtaattaa acaaattatt 481 ggataacctc tacaaataaa gatagggga // LOCUS HSU39360 1982 bp mRNA PRI 03-OCT-1997 DEFINITION Homo sapiens DNA-binding protein (CROC-1A) mRNA, complete cds. ACCESSION U39360 NID g1066079 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1982) AUTHORS Rothofsky,M.L. and Lin,S.L. TITLE CROC-1 encodes a protein which mediates transcriptional activation of the human FOS promoter JOURNAL Gene 195 (2), 141-149 (1997) MEDLINE 97449289 REFERENCE 2 (bases 1 to 1982) AUTHORS Lin,S.L. TITLE Direct Submission JOURNAL Submitted (25-OCT-1995) Stanley L. Lin, Tumor Biology, Schering-Plough Research Institute, 2015 Galloping Hill Road, Kenilworth, NJ 07033, USA FEATURES Location/Qualifiers source 1..1982 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CROC-1A" /tissue_type="brain" gene 1..1982 /gene="CROC-1A" CDS 70..582 /gene="CROC-1A" /codon_start=1 /function="transcriptional activation of c-fos proto-oncogene promoter" /product="DNA-binding protein" /db_xref="PID:g1066080" /translation="MPGEVQASYLKSQSKLSDEGRLEPRKFHCKGSKSPSQFRLLEEL EEGQKGVGDGTVSWGLEDDEDMTLTRWTGMIIGPPRTIYENRIYSLKIECGPKYPEAP PFVRFVTKINMNGVNSSNGVVDPRAISVLAKWQNSYSIKVVLQELRRLMMSKENMKLP QPPEGQCYSN" polyA_signal 1959..1964 /gene="CROC-1A" BASE COUNT 548 a 486 c 415 g 533 t ORIGIN 1 aggaaagcat tttatctcca cagcaatcct atgaggttga tactactatc ctcatagaag 61 gggaaactga tgccaggaga ggttcaagcg tcttacctga agtcacaaag caaactgagt 121 gatgaaggaa gacttgaacc tagaaaattt cactgcaaag ggagtaaaag tccctcgcaa 181 tttcgactgt tggaagaact cgaagaaggc cagaaaggag taggagatgg cacagttagc 241 tggggtctag aagatgacga agacatgaca cttacaagat ggacagggat gataattggg 301 cctccaagaa caatttatga aaaccgaata tacagcctta aaatagaatg tggacctaaa 361 tacccagaag cacccccctt tgtaagattt gtaacaaaaa ttaatatgaa tggagtaaat 421 agttctaatg gagtggtgga cccaagagcc atatcagtgc tagcaaaatg gcagaattca 481 tatagcatca aagttgtcct gcaagagctt cggcgcctaa tgatgtctaa agaaaatatg 541 aaactccctc agccgcccga aggacagtgt tacagcaatt aatcaaaaag aaaaaccaca 601 ggcccttccc cttcccccca attcgattta atcagtcttc attttccaca gtagtaaatt 661 ttctagatac gtcttgtaga cctcaaagta ccggaaagga agctcccatt caaaggaaat 721 ttatcttaag atactgtaaa tgatactaat tttttgtcca tttgaaatat ataagttgtg 781 ctataacaaa tcatcctgtc aagtgtaacc actgtccacg tagttgaact tctgggatca 841 agaaagtcta tttaaattga ttcccatcat aactggtggg gcacatctaa ctcaactgtg 901 aaaagacaca tcacacaatc accttgctgc tgattacacg gcctggggtc tctgccttct 961 ccctttaccc tcccgcctcc caccctccct gcaacaacag ccctctagcc tggggggctt 1021 gttagagtag atgtgaaggt ttcaggtcgc agcctgtggg actactgcta ggtgtgtggg 1081 gtgtttcgcc tgcacccctg gttcctttaa gtcttaagtg atgccccttc caaaccatca 1141 tcctgtcccc acgctcctcc actcccgccc ttggccgaag catagattgt aacccctcca 1201 ctcccctctg agattggctt cggtgaggaa ttcagggctt tccccatatc ttctctcccc 1261 ccacctttat cgaggggtgc tgctttttct ccctcctcct caagttcctt tttgcaccgt 1321 caccacccaa caccttccat gacacttcct tgctttggcc agaagccatc aggtaaggtt 1381 ggaaagagcc tctgacctcc cttgtttagt tttggaacca tactcactca ctctccacca 1441 gcctgggaaa tgaatattgg gtcctcagcc ctgccaccct ctgctgtcat cagctgatgc 1501 attgttttta gctcaggttt tgataaggtg aaaagaatag tcaccagggt tactcagacc 1561 tgccagctct cggagtcctt ggtggttgaa cttggagaaa gaccgcatga agatacttgt 1621 aagcacacat gatccctctg aattgtttta ctttcctgta actgcttttg cttttaaaaa 1681 ttgaagaagt tttaaacagg gctttcattt ggtcatcctt gcaatccatt ggggtctagt 1741 ttggaatctg acaactggaa caaaaagaac cttgaatccg gtgcatgcct tggttttggt 1801 gctgctgctg cttcccaaga tcctcagcag ggattaagaa ggaacccggt gtgcacagca 1861 gatccccgaa attggtgggc ttgacctcct ggcaaattgc tgcgtctttc cacttgctgt 1921 tcaggaccac taaatgctga aatgtggatg cataccgaaa taaaagcaat tcattgtgta 1981 ct // LOCUS HSU39400 2008 bp mRNA PRI 14-NOV-1996 DEFINITION Human NOF1 mRNA, complete cds. ACCESSION U39400 NID g1234796 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2008) AUTHORS Kas,K., Lemahieu,V., Meyen,E., Van de Ven,W.J.M. and Merregaert,J. TITLE Isolation, cDNA, and genomic structure of a conserved gene (NOF) at chromosome 11q13 next to FAU and oriented in the opposite transcriptional orientation JOURNAL Genomics 34 (1), 433-436 (1996) MEDLINE 96299700 REFERENCE 2 (bases 1 to 2008) AUTHORS Kas,K., Lemahieu,V., Meyen,E., Van de Ven,W.J.M. and Merregaert,J. TITLE Direct Submission JOURNAL Submitted (25-OCT-1995) Koen Kas, Lab for Molecular Oncology, Center for Human Genetics, Herestraat 49, Leuven 3000, Belgium FEATURES Location/Qualifiers source 1..2008 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" CDS 14..514 /note="neighbor of FAU" /codon_start=1 /product="NOF1" /db_xref="PID:g1234797" /translation="MAATMFRATLRGWRTGVQRGCGLRLLSQTQGPPDYPRFVESVDE YQFVERLLPATRIPDPPKHEHYPTPSGWQPPRDPPPNLPYFVRRSRMHNIPVYKDITH GNRQMTVIRKVEGDIWALQKDVEDFLSPLLGKTPVTQVNEVTGTLRIKGYFDQELKAW LLEKGF" repeat_region complement(778..1064) /note="Alu-Sx subfamily" /rpt_family="Alu" BASE COUNT 463 a 528 c 531 g 486 t ORIGIN 1 gtagcaggta aagatggcag ctaccatgtt ccgggctacg ctgcggggat ggagaaccgg 61 tgtccagcgg ggctgcgggc tacggctgtt gagccagacc cagggccctc cagattaccc 121 caggtttgtg gagtctgtgg atgaatatca gtttgtggag cgcctgttac cggctaccag 181 gatcccagat cccccaaagc atgaacatta tcctacccct agtggctggc agcctcccag 241 agacccccca cccaacctgc cttactttgt acgacgctct cggatgcaca acatccccgt 301 ctacaaggac atcacgcatg gcaaccggca gatgactgtg atccggaaag tggaagggga 361 catctgggcc ctgcagaaag acgtggaaga ttttctgagc ccgctgctgg ggaagacacc 421 tgtcacccag gtcaatgagg tgacaggtac cctacggatc aagggctact ttgaccagga 481 gcttaaagcc tggctcttgg agaaaggctt ctgaggccca gccgagcagc ctgcttgtca 541 gcatgccctg tggatcaagt ctagggggcc tcaggaggag ggaggtgggt gttggagccc 601 ctgagacagg ggatacagaa actagggcta aaggactttg gggtcaggcc ttgcttgcat 661 aaaggagaaa acaactctat gtacatgctg ggggagagtg cctaatgtgg gagaccaaat 721 agggatcacc aggctaatgg ggggcgtcac gagctttctc tccctcctat cttggcctgt 781 tcttttttgt tttttgagac ggagtctcac tctgttgccc agggtggagt gcagtggcat 841 gatcttggct cactgtcaac ttccacctct gggatcaagg attctcctgc ctcagcctct 901 tgagtagctg ggattacagg cgcccaccac cacagcctgc taatttttgt atttttagta 961 gagatggggt ttcaccatgt tggccaggct ggtctcaaac tcctgactcg aagtgatccg 1021 cccaccttgg cctcccaaag cgttgggatt ataggcatga gccatgtgcc tggtccacct 1081 tggcctgttt tgtttttctt tccttgggct cagcaattca aattctagtt gttatttggt 1141 ggaagcagta gcccaacccc agtttagggg aaggtagcac agggcagagc cactgggcac 1201 tttgtttcct tggccctccg aagctcactg ttgcaaatac ccccaagcct ttgctctagg 1261 ccagatcttg tttggtgcag gtgatggaga acacagatga ctcgggcatg ggtcttggag 1321 atcttctgtt caaagtacag tgctggcact ggcacagagt gcccacgtta gccccgggct 1381 ctgatagaga ggtaggaggc acgttcttgg tcactgttcc attgcagacc agacttgctg 1441 gcctgaccac aagggagtgg ctgggaactc acagccagca tagggacatc cccctgcagc 1501 cttctgacct gcaatcaagg ctggggaggg gtttgcaggc aggaatatgc tgacctttca 1561 ccctgccatc ccatcccaac cccagctcac tagccttcat atatgcctta tacttggagt 1621 cacaggggcc aaaggcctga gaccccaccc tgcccccaaa ctggctaaga cagctttcag 1681 ttcctgactc cccaacttgg tctctgccct gaagcagggc actgaactct gggctgcttc 1741 tctgtgtgta aaatgggcac atcttcctaa tctgttaatg gtcagtggtg tccccaagga 1801 tagtgctggc ttccatggaa accctcactc ctggagattc cattccattt tcaagtgtac 1861 agccacagca aggagcccga cactgatttg atcgattctg tgacacaaac cccaccaatt 1921 gttaatgcaa gtttttattt ggctgtatat acaatttaag ctattaaaat ttgtacaata 1981 tttacaaatt aaaaaaaaaa aaaaaaaa // LOCUS HSU39412 1279 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens alpha SNAP mRNA, complete cds. ACCESSION U39412 NID g1066083 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1279) AUTHORS Lemons,P.P., Chen,D., Bernstein,A.M., Bennett,M.K. and Whiteheart,S.W. TITLE Regulated secretion in platelets: identification of elements of the platelet exocytosis machinery JOURNAL Blood 90 (4), 1490-1500 (1997) MEDLINE 97413351 REFERENCE 2 (bases 1 to 1279) AUTHORS Chen,D., Shao,H.P. and Whiteheart,S.W. TITLE Direct Submission JOURNAL Submitted (25-OCT-1995) Dong Chen, Biochemistry, University of Kentucky, 800 Rose Street, Lexington, KY 40536, USA FEATURES Location/Qualifiers source 1..1279 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Platelet" CDS 68..955 /codon_start=1 /product="alpha SNAP" /db_xref="PID:g1066084" /translation="MDNSGKEAEAMALLAEAERKVKNSQSFFSGLFGGSSKIEEACEI YARAANMFKMAKNWSAAGNAFCQAAQLHLQLQSKHDAATCFVDAGNRFKKADPQEAIN CLMRAIEIYTDMGRFTIAAKHHISIAEIYEIELVDIEKAIAHYEQSADYYKGEESNSS ANKCLLKVAGYAALLEQYQKAIDIYEQVGTNAMDTPLLKYSAKDYFFKAALCHFCIDM LNAKLAVQKYEELFPAFSDSRECKLMKKLLEAHEEQNVDSYTESVKEYDSISRLDQWL TTMLLRIKKTIQDDEEDLR" BASE COUNT 322 a 377 c 337 g 243 t ORIGIN 1 gcggcccggc ggctgagtct tcccagggtc agggtcaggc gctttgctga gtccctttgt 61 ggccgccatg gacaattccg ggaaggaagc ggaggcgatg gcgctgttgg ccgaggcgga 121 gcgcaaagtg aagaactcgc agtccttctt ctctggcctc tttggaggct catccaaaat 181 agaggaagca tgcgaaatct acgccagagc agcaaacatg ttcaaaatgg ccaaaaactg 241 gagtgctgct ggaaacgcgt tctgccaggc tgcacagctg cacctgcagc tccagagcaa 301 gcacgacgca gccacctgct ttgtggacgc tggcaaccga ttcaagaaag ccgaccccca 361 agaggccatt aactgtttga tgcgagcaat cgagatctac acagacatgg gccgattcac 421 gattgcggcc aagcaccaca tctccattgc tgagatctat gagatagagt tggtggacat 481 cgagaaggcc attgcccact acgagcagtc tgcagactac tacaaaggcg aggagtccaa 541 cagctcagcc aacaagtgtc tgctgaaggt ggctggttac gctgcgctgc tggagcagta 601 tcagaaggcc attgacatct acgaacaggt ggggaccaat gccatggaca cccccctcct 661 caagtacagc gccaaagact acttcttcaa ggcggccctc tgccacttct gcatcgacat 721 gctcaacgcc aagctggctg tccaaaagta tgaggagctg ttcccagctt tctctgattc 781 ccgggaatgc aagttgatga aaaaattgct agaggcccac gaggagcaga atgtggacag 841 ctacaccgag tcggtgaagg aatacgactc catctcccgg ctggaccagt ggctcaccac 901 catgctgctg cgcatcaaga agaccatcca ggatgacgag gaagacctgc gctaaggcca 961 acccagcccc ccagtgcccg tcttcgctgt cccatctgct cagagagagc caagctctaa 1021 agcacatgta gccgctgaga cctgctgttt ctgctggggg caggctcctc ttcccccagc 1081 cccgggaggc tcccggagct tcctgcagcc ccgacctctc aggttagacc ctgggccctg 1141 gagcttaggg attctcccca ccccagcccc acacctgctc cttccctaat gctttgaggt 1201 tttcttggtt ggaagctgca gctggcccaa gaaagaaaat aaaaaacaac acttttgcaa 1261 aaaaaaaaaa aaaaaaccc // LOCUS HSU39573 2768 bp mRNA PRI 04-OCT-1996 DEFINITION Human salivary peroxidase mRNA, complete cds. ACCESSION U39573 NID g1209684 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2768) AUTHORS Kiser,C., Caterina,C.K., Engler,J.A., Rahemtulla,B. and Rahemtulla,F. TITLE Cloning and sequence analysis of the human salivary peroxidase-encoding cDNA JOURNAL Gene 173 (2), 261-264 (1996) MEDLINE 97082979 REFERENCE 2 (bases 1 to 2768) AUTHORS Kiser,C., Caterina,J., Engler,J.A., Rahemtulla,B. and Rahemtulla,F. TITLE Direct Submission JOURNAL Submitted (27-OCT-1995) Jeffrey A. Engler, Biochemistry and Molecular Genetics, University of Alabama at Birmingham, 1918 University Blvd., room 460 BHSB, Birmingham, AL 35294-0005, USA FEATURES Location/Qualifiers source 1..2768 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="submandibular gland" CDS 82..2220 /codon_start=1 /product="salivary peroxidase" /db_xref="PID:g1209685" /translation="MRVLLHLPALLASLILLQAAASTTRAQTTRTSAISDTVSQAKVQ VNKAFLDSRTRLKTAMSSETPTSRQLSEYLKHAKGRTRTAIRNGQVWEESLKRLRQKA SLTNVTDPSLDLTSLSLEVGCGAPAPVVRCDPCSPYRTITGDCNNRRKPALGAANRAL ARWLPAEYEDGLSLPFGWTPGKTRNGFPLPLAREVSNKIVGYLNEEGVLDQNRSLLFM QWGQIVDHDLDFAPDTELGSSEYSKAQCDEYCIQGDNCFPIMFPPNDPKAGTQGKCMP FFRAGFVCPTPPYKSLAREQINALTSFLDASFVYSSEPSLASRLRNLSSPLGLMAVNQ EVSDHGLPYLPYDSKKPSPCEFINTTARVPCFLAGDSRASEHILLATSHTLFLREHNR LARELKRLNPQWDGEKLYQEARKILGAFVQIITFRDYLPILLGDHMQKWIPPYQGYSE SVDPRISNVFTFAFRFGHLEVPSSMFRLDENYQPWGPEPELPLHTLFFNTWRMVKDGG IDPLVRGLLAKKSKLMKQNKMMTGELRNKLFQPTHRIHGFDLAAINTQRCRDHGQPGY NSWRAFCDLSQPQTLEELNTVLKSKMLAKKLLGLYGTPDNIDIWIGAIAEPLVERGRV GPLLACLLGKQFQQIRDGDRFWWENPGVFTNEQKDSLQKMSFSRLVCDNTRITKVPRD PFWANSYPYDFVDCSAIDKLDLSPWASVKN" BASE COUNT 632 a 857 c 697 g 582 t ORIGIN 1 ttgctggaag gtataaaaga ccagctcctc caagcagagc aactccctgg ctgccgtgaa 61 aagacaaggc actgggcagt gatgagggtc cttctccatc tcccagccct cctggcttcc 121 ctcatcttgc ttcaggctgc agcatctacc acaagagcgc agactaccag aacctctgcc 181 atctccgata ctgtgagtca ggccaaggtc caagtcaaca aggccttcct ggactcccga 241 accaggctga agaccgccat gagctctgag actcccacca gccgacagct ctcagaatac 301 ctcaagcatg ccaaaggccg gacgcgcaca gccatccgca atggacaggt gtgggaggag 361 tctttaaaga gactgaggca gaaggcatcc ttgaccaatg tcacagatcc cagcctggac 421 ttgacttcac tgtctctgga ggtgggctgt ggtgctcctg ctcccgtggt gagatgcgac 481 ccgtgcagcc cttaccgcac cattacggga gactgcaata acaggaggaa gcctgcgctg 541 ggcgccgcca acagggctct ggcgcgctgg ctgcccgcgg agtacgagga cgggctctcc 601 ctgcccttcg gctggacgcc ggggaagacg cgcaacggct tccctctccc gctggcccgg 661 gaggtatcta acaagattgt tggctatctg aatgaggagg gtgttctgga ccaaaacagg 721 tccctgctct tcatgcagtg gggtcagatt gtggatcatg acctggactt tgcccctgac 781 accgagctgg ggagtagcga gtactccaaa gcccagtgtg atgagtactg tatccaggga 841 gacaactgct tccccatcat gttcccaccc aatgacccca aggcggggac tcaagggaaa 901 tgcatgcctt tcttccgagc tgggttcgtc tgccccactc caccctacaa gtccctggcc 961 cgagagcaga tcaacgctct gacctccttc ctggatgcca gctttgtgta cagctccgag 1021 ccaagcctgg ccagccgcct ccgcaacctc agcagccccc tgggcctcat ggctgtcaac 1081 caggaggtct cagaccatgg actaccctac ctgccctatg acagcaagaa gccaagcccc 1141 tgtgagttca tcaacaccac tgcccgtgtg ccctgcttcc tggcaggaga ttctcgagcc 1201 tcagagcata ttctgctggc cacatcccac accctctttc tccgcgagca taaccggctg 1261 gccagagaac taaagagact caaccctcag tgggatggag agaagctcta ccaggaagcc 1321 cggaaaatcc tgggagcctt cgtgcagatt atcaccttta gggactacct acccattttg 1381 ctaggtgacc acatgcagaa gtggataccc ccatatcaag gctacagtga atctgtggat 1441 cccagaattt ccaatgtctt caccttcgcc ttccgctttg gccacttgga ggtcccctct 1501 agtatgttcc gcctggatga gaattatcag ccatgggggc cagaaccaga actccccctc 1561 cacaccctct tcttcaacac ttggaggatg gtcaaagatg gtggaattga tcctctggtg 1621 cggggcctgc tggccaagaa atccaagctg atgaaacaga ataaaatgat gactggagag 1681 ctgcgcaaca agcttttcca gccaactcac aggatccatg gctttgacct ggctgccatc 1741 aacacacagc gttgccggga ccatgggcaa cctgggtaca attcctggag agccttctgt 1801 gacctctcac agccgcagac actagaggag ttgaacacag tgctgaagag caagatgctg 1861 gccaagaagt tactgggtct ctacgggacc cctgacaaca tcgacatctg gataggggcc 1921 attgctgagc cgctggtgga aaggggtcgg gtggggcctc tcctggcctg cctcttgggc 1981 aagcagttcc agcagatccg tgatggagac aggttctggt gggaaaaccc tggggtcttc 2041 acgaacgagc agaaggactc tctacagaaa atgtccttct cacgccttgt ctgtgacaac 2101 acccgcatca ccaaggtccc acgggaccca ttctgggcca acagctaccc ctatgacttc 2161 gtggattgct cagccatcga caagctggac ctgtcaccct gggcctcagt gaagaattag 2221 gggcccgcgc tgcacaggaa agttcccttt ggtccacagg gccatttcaa gcaagttcaa 2281 tgacctggtc ccttagagct ccatatccca gtcccagccc ttctttgcag ctgggcctct 2341 ctatacccct ggatgaacag cttgctcagg ccccagggtg gctgcctcgg ccctcccagc 2401 tcttacactc agctccagtg gcttctcctt tctgtcaaga cttagccccg ctgagatgcc 2461 cttctgctcc agcttgctgg atgttacctg tcctcttccc tccacaagtc ttggccctta 2521 acctttatct ttcttcctgt cctctcacct agattgtaag ctccctgggc caggacttca 2581 gcctgcctcc gagggtcccc tgtggcacct agcatgtggc acagcacatt agaagtgctc 2641 aaaaacatct gatgactgac taaagatgct aggcgtgaca ccgttccctc caaaagcaga 2701 cctcggaatc actgccaaat aagtaactag acgtttacag gccaaaaaaa aaaaaaaaca 2761 ggaattcc // LOCUS HSU39576 2901 bp mRNA PRI 23-MAY-1996 DEFINITION Human butyrophilin precursor mRNA, complete cds. ACCESSION U39576 NID g1326082 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2901) AUTHORS Taylor,M.R., Peterson,J.A., Ceriani,R.L. and Couto,J.R. TITLE Cloning and sequence analysis of human butyrophilin reveals a potential receptor function JOURNAL Biochim. Biophys. Acta 1306 (1), 1-4 (1996) MEDLINE 96201696 REFERENCE 2 (bases 1 to 2901) AUTHORS Taylor,M.R., Peterson,J.A., Ceriani,R.L. and Couto,J.R. TITLE Direct Submission JOURNAL Submitted (27-OCT-1995) Michael R. Taylor, Cancer Research Fund of Contra Costa, 2055 N. Broadway, Walnut Creek, CA 94596, USA FEATURES Location/Qualifiers source 1..2901 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast" CDS 73..1653 /note="HuBut; milk fat globule (HMFG) glycoprotein; immunoglobulin fold; transmembrane protein" /codon_start=1 /product="butyrophilin precursor" /db_xref="PID:g1326083" /translation="MAVFPSSGLPRCLLTLILLQLPKLDSAPFDVIGPPEPILAVVGE DAELPCRLSPNASAEHLELRWFRKKVSPAVLVHRDGREQEAEQMPEYRGRATLVQDGI AKGRVALRIRGVRVSDDGEYTCFFREDGSYEEALVHLKVAALGSDPHISMQVQENGEI CLECTSVGWYPEPQVQWRTSKGEKFPSTSESRNPDEEGLFTVAASVIIRDTSTKNVSC YIQNLLLGQEKKVEISIPASSLPRLTPWIVAVAVILMVLGLLTIGSIFFTWRLYNERP RERRNEFSSKERLLEELKWKKATLHAVDVTLDPDTAHPHLFLYEDSKSVRLEDSRQKL PEKTERFDSWPCVLGRETFTSGRHYWEVEVGDRTDWAIGVCRENVMKKGFDPMTPENG FWAVELYGNGYWALTPLRTPLPLAGPPRRVGIFLDYESGDISFYNMNDGSDIYTFSNV TFSGPLRPFFCLWSSGKKPLTICPIADGPERVTVIANAQDLSKEIPLSPMGEESAPRD ADTLHSKLIPTQPSQGAP" sig_peptide 73..150 /product="butyrophilin" mat_peptide 151..1650 /product="butyrophilin" misc_feature 151..798 /note="encodes extracellular domain" /product="butyrophilin" misc_feature 151..495 /note="encodes Ig-like V-type domain" /product="butyrophilin" misc_feature 235..237 /note="encodes potential carbohydrate site" /product="butyrophilin" misc_feature 502..765 /note="encodes Ig-like C1-type domain" /product="butyrophilin" misc_feature 715..717 /note="encodes potential carbohydrate site" /product="butyrophilin" misc_feature 799..879 /note="encodes potential transmembrane" /product="butyrophilin" misc_feature 880..1650 /note="encodes cytoplasmic domain" /product="butyrophilin" BASE COUNT 684 a 747 c 719 g 751 t ORIGIN 1 ctccaagatc acccaggctg aagctcctga gggactcaca tcagttatct tgctgctcca 61 gaagggtggg agatggcagt tttcccaagc tccggtctcc ccagatgtct gctcaccctc 121 attctcctcc agctgcccaa actggattca gctccctttg acgtgattgg acccccggag 181 cccatcctgg ccgttgtggg tgaggacgcc gagctgccct gtcgcctgtc cccgaacgcg 241 agcgccgagc acttggagct acgctggttc cgaaagaagg tttcgccggc cgtgctggtg 301 catagggacg ggcgcgagca ggaagccgag cagatgcccg agtaccgcgg gcgggcgacg 361 ctggtccagg acggcatcgc caaggggcgc gtggccttga ggatccgtgg cgtcagagtc 421 tctgacgacg gggagtacac gtgctttttc agggaggatg gaagctacga agaagccctg 481 gtgcatctga aggtggctgc tctgggctct gaccctcaca tcagtatgca agttcaagag 541 aatggagaaa tctgtctgga gtgcacctca gtgggatggt acccagagcc ccaggtgcag 601 tggagaactt ccaagggaga gaagtttcca tctacatcag agtccaggaa tcctgatgaa 661 gaaggtttgt tcactgtggc tgcttcagtg atcatcagag acacttctac gaaaaatgtg 721 tcctgctaca tccagaatct ccttcttgga caggagaaga aagtagaaat atccatacca 781 gcttcctccc tcccaaggct gactccctgg atagtggctg tggctgtcat cctgatggtt 841 ctaggacttc tcaccattgg gtccatattt ttcacttgga gactatacaa cgaaagaccc 901 agagagagga ggaatgaatt cagctctaaa gagagactcc tggaagaact caaatggaaa 961 aaggctacct tgcatgcagt tgatgtgact ctggacccag acacagctca tccccacctc 1021 tttctttatg aggattcaaa atctgttcga ctggaagatt cacgtcagaa actgcctgag 1081 aaaacagaga gatttgactc ctggccctgt gtgttgggcc gtgagacctt cacctcagga 1141 aggcattact gggaggtgga ggtgggagac aggactgact gggcaatcgg cgtgtgtagg 1201 gagaatgtga tgaagaaagg atttgacccc atgactcctg agaatgggtt ctgggctgta 1261 gagttgtatg gaaatgggta ctgggccctc actcctctcc ggacccctct cccattggca 1321 gggcccccac gccgggttgg gattttccta gactatgaat caggagacat ctccttctac 1381 aacatgaatg atggatctga tatctatact ttctccaatg tcactttctc tggccccctc 1441 cggcccttct tttgcctatg gtctagcggt aaaaagcccc tgaccatctg cccaattgct 1501 gatgggcctg agagggtcac agtcattgct aatgcccagg acctttctaa ggagatccca 1561 ttgtccccca tgggggagga gtctgcccct agggatgcag acactctcca ttctaagcta 1621 atccctaccc aacccagcca aggggcacct taaggaatat ctcagctcat ctgttttcct 1681 ttcctctaac ccctctcctc catagccttc tgaggcttca cctgctagct ttacccagtc 1741 tgtttcttcc tgttgggtgg caattaatta atcctgtgaa ggttacattg ctgctgctag 1801 agagggtggg gattgcacct tccaaatctg tttctgtacc aatatttggg ggatggaggg 1861 gtgactcaaa ctgcttctag tgttctccta atcccttaag actagaacct ataggaaact 1921 acttggagca aactcaaagg acagattagg gatcgagatt gggtcaggtt agcatggggt 1981 tgtggttgaa atatcttggt atccaggata agggtatgtg gaaaaacagg ctttaggcaa 2041 gtggaaaatt caaaatgtgc tgtgaaagga caatctcagg ctgaaatccc ataaaggaac 2101 ttggagggaa tattatgatg gagggaagtg aggtgaatcc aggcacatga tgaacacctg 2161 gctcatccat agagttttca cagcctatat cgcaaatttt ctaagccacg tcctatagga 2221 cagaggagac tggccccact tctatgggtc tgagctgtgg aaaagggaga gcagagagga 2281 actgagatga gcagggatga agggtcaggc agaaagcgtg atagaggaga gaatttttga 2341 caaaactcaa aagttgtttg cacagctgtt ctttgtaccc tgttccttcc tctgcgccct 2401 cctgtttctc ccttgcctgg aagtcattcc accctcaatt tgttgatcca caagtttcca 2461 gttgtcctct tctttttgtt atagcatctc tctatttcaa agacattcct agaagtcatc 2521 cttcagtgat atcaccactt gctcagtcac catctcaacc ttatgtcacc tcagccctca 2581 tctcaatgcc caaacccctt acacacacct tcagttagct tcaactgcct ccgtttccac 2641 actgtgcacc tttcactttc cctacccagc tttcctacat gctgcctctc ctcagggtcc 2701 cctgaatgct gcatcattgt gttcagtgca gctggactga ttgcacctgt gtatttgccc 2761 ctgagcactt tcctttacac atgtggcttg tcttgccaat agactccagg cttacacctt 2821 ccatttccat cgtattctcc agtttccagg atagatgttg ctcatcgtct ttacctaata 2881 aataagtttg tctgattgct g // LOCUS HSU39656 1699 bp mRNA PRI 27-FEB-1996 DEFINITION Human MAP kinase kinase 6 (MKK6) mRNA, complete cds. ACCESSION U39656 NID g1203815 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1699) AUTHORS Raingeaud,J., Whitmarsh,A.J., Barrett,T., Derijard,B. and Davis,R.J. TITLE MKK3- and MKK6-regulated gene expression is mediated by the p38 mitogen-activated protein kinase signal transduction pathway JOURNAL Mol. Cell. Biol. 16 (3), 1247-1255 (1996) MEDLINE 96182129 REFERENCE 2 (bases 1 to 1699) AUTHORS Raingeaud,J., Barrett,T. and Davis,R.J. TITLE Direct Submission JOURNAL Submitted (27-OCT-1995) Roger J. Davis, Molecular Medicine, HHMI and UMASS Medical School, 373 Plantation Street, Worcester, MA 01605, USA FEATURES Location/Qualifiers source 1..1699 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" gene 341..1345 /gene="MKK6" CDS 341..1345 /gene="MKK6" /note="protein kinase" /codon_start=1 /function="MAP kinase kinase 6 phosphorylates and activates p38 MAP kinase" /product="MAP kinase kinase 6" /db_xref="PID:g1203816" /translation="MSQSKGKKRNPGLKIPKEAFEQPQTSSTPPRDLDSKACISIGNQ NFEVKADDLEPIMELGRGAYGVVEKMRHVPSGQIMAVKRIRATVNSQEQKRLLMDLDI SMRTVDCPFTVTFYGALFREGDVWICMELMDTSLDKFYKQVIDKGQTIPEDILGKIAV SIVKALEHLHSKLSVIHRDVKPSNVLINALGQVKMCDFGISGYLVDSVAKTIDAGCKP YMAPERINPELNQKGYSVKSDIWSLGITMIELAILRFPYDSWGTPFQQLKQVVEEPSP QLPADKFSAEFVDFTSQCLKKNSKERPTYPELMQHPFFTLHESKGTDVASFVKLILGD " BASE COUNT 544 a 352 c 380 g 423 t ORIGIN 1 ggcttctggt tcggcccacc tctgaaggtt ccagaatcga tagtgaattc gtggttccaa 61 gtttggagct tttagctgcc agccctggcc catcatgtag ctgcagcaca gccttcccta 121 acgttgcaac tgggggaaaa atcactttcc agtctgtttt gcaaggtgtg catttccatc 181 ttgattccct gaaagtccat ctgctgcatc ggtcaagaga aactccactt gcatgaagat 241 tgcacgcctg cagcttgcat ctttgttgca aaactagcta cagaagagaa gcaaggcaaa 301 gtcttttgtg ctcccctccc ccatcaaagg aaaggggaaa atgtctcagt cgaaaggcaa 361 gaagcgaaac cctggcctta aaattccaaa agaagcattt gaacaacctc agaccagttc 421 cacaccacct agagatttag actccaaggc ttgcatttct attggaaatc agaactttga 481 ggtgaaggca gatgacctgg agcctataat ggaactggga cgaggtgcgt acggggtggt 541 ggagaagatg cggcacgtgc ccagcgggca gatcatggca gtgaagcgga tccgagccac 601 agtaaatagc caggaacaga aacggctact gatggatttg gatatttcca tgaggacggt 661 ggactgtcca ttcactgtca ccttttatgg cgcactgttt cgggagggtg atgtgtggat 721 ctgcatggag ctcatggata catcactaga taaattctac aaacaagtta ttgataaagg 781 ccagacaatt ccagaggaca tcttagggaa aatagcagtt tctattgtaa aagcattaga 841 acatttacat agtaagctgt ctgtcattca cagagacgtc aagccttcta atgtactcat 901 caatgctctc ggtcaagtga agatgtgcga ttttggaatc agtggctact tggtggactc 961 tgttgctaaa acaattgatg caggttgcaa accatacatg gcccctgaaa gaataaaccc 1021 agagctcaac cagaagggat acagtgtgaa gtctgacatt tggagtctgg gcatcacgat 1081 gattgagttg gccatccttc gatttcccta tgattcatgg ggaactccat ttcagcagct 1141 caaacaggtg gtagaggagc catcgccaca actcccagca gacaagttct ctgcagagtt 1201 tgttgacttt acctcacagt gcttaaagaa gaattccaaa gaacggccta catacccaga 1261 gctaatgcaa catccatttt tcaccctaca tgaatccaaa ggaacagatg tggcatcttt 1321 tgtaaaactg attcttggag actaaaaagc agtggactta atcggttgac cctactgtgg 1381 attggtgggt ttcggggtga agcaagttca ctacagcatc aatagaaagt catctttgag 1441 ataatttaac cctgcctctc agagggtttt ctctcccaat tttcttttta ctccccctct 1501 taagggggcc ttggaatcta tagtatagaa tgaactgtct agatggatga attatgataa 1561 aggcttagga cttcaaaagg tgattaaata tttaatgatg tgtcatatga gtcctcaaaa 1621 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaa // LOCUS HSU39817 4437 bp mRNA PRI 16-FEB-1996 DEFINITION Human Bloom's syndrome protein (BLM) mRNA, complete cds. ACCESSION U39817 NID g1072121 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4437) AUTHORS Ellis,N.A., Groden,J., Ye,T.Z., Straughen,J., Lennon,D.J., Ciocci,S., Proytcheva,M. and German,J. TITLE The Bloom's syndrome gene product is homologous to RecQ helicases JOURNAL Cell 83 (4), 655-666 (1995) MEDLINE 96069866 REFERENCE 2 (bases 1 to 4437) AUTHORS Ellis,N. TITLE Direct Submission JOURNAL Submitted (01-NOV-1995) Nathan A. Ellis, Human Genetics, New York Blood Center, 310 E 67th St., New York, NY 10021, USA FEATURES Location/Qualifiers source 1..4437 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q26.1" /cell_type="HeLa" /cell_line="lymphoblastoid; normal diploid fibroblast" gene 75..4328 /gene="BLM" CDS 75..4328 /gene="BLM" /note="putative DNA helicase; contain motifs similar to the seven helicase motifs in RecQ helicases" /codon_start=1 /product="Bloom's syndrome protein" /db_xref="PID:g1072122" /translation="MAAVPQNNLQEQLERHSARTLNNKLSLSKPKFSGFTFKKKTSSD NNVSVTNVSVAKTPVLRNKDVNVTEDFSFSEPLPNTTNQQRVKDFFKNAPAGQETQRG GSKSLLPDFLQTPKEVVCTTQNTPTVKKSRDTALKKLEFSSSPDSLSTINDWDDMDDF DTSETSKSFVTPPQSHFVRVSTAQKSKKGKRNFFKAQLYTTNTVKTDLPPPSSESEQI DLTEEQKDDSEWLSSDVICIDDGPIAEVHINEDAQESDSLKTHLEDERDNSEKKKNLE EAELHSTEKVPCIEFDDDDYDTDFVPPSPEEIISASSSSSKCLSTLKDLDTSDRKEDV LSTSKDLLSKPEKMSMQELNPETSTDCDARQISLQQQLIHVMEHICKLIDTIPDDKLK LLDCGNELLQQRNIRRKLLTEVDFNKSDASLLGSLWRYRPDSLDGPMEGDSCPTGNSM KELNFSHLPSNSVSPGDCLLTTTLGKTGFSATRKNLFERPLFNTHLQKSFVSSNWAET PRLGKKNESSYFPGNVLTSTAVKDQNKHTASINDLERETQPSYDIDNFDIDDFDDDDD WEDIMHNLAASKSSTAAYQPIKEGRPIKSVSERLSSAKTDCLPVSSTAQNINFSESIQ NYTDKSAQNLASRNLKHERFQSLSFPHTKEMMKIFHKKFGLHNFRTNQLEAINAALLG EDCFILMPTGGGKSLCYQLPACVSPGVTVVISPLRSLIVDQVQKLTSLDIPATYLTGD KTDSEATNIYLQLSKKDPIIKLLYVTPEKICASNRLISTLENLYERKLLARFVIDEAH CVSQWGHDFRQDYKRMNMLRQKFPSVPVMALTATANPRVQKDILTQLKILRPQVFSMS FNRHNLKYYVLPKKPKKVAFDCLEWIRKHHPYDSGIIYCLSRRECDTMADTLQRDGLA ALAYHAGLSDSARDEVQQKWINQDGCQVICATIAFGMGIDKPDVRFVIHASLPKSVEG YYQESGRAGRDGEISHCLLFYTYHDVTRLKRLIMMEKDGNHHTRETHFNNLYSMVHYC ENITECRRIQLLAYFGENGFNPDFCKKHPDVSCDNCCKTKDYKTRDVTDDVKSIVRFV QEHSSSQGMRNIKHVGPSGRFTMNMLVDIFLGSKSAKIQSGIFGKGSAYSRHNAERLF KKLILDKILDEDLYINANDQAIAYVMLGNKAQTVLNGNLKVDFMETENSSSVKKQKAL VAKVSQREEMVKKCLGELTEVCKSLGKVFGVHYFNIFNTVTLKKLAESLSSDPEVLLQ IDGVTEDKLEKYGAEVISVLQKYSEWTSPAEDSSPGISLSSSRGPGRSAAEELDEEIP VSSHYFASKTRNERKRKKMPASQRSKRRKTASSGSKAKGGSATCRKISSKTKSSSIIG SSSASHTSQATSGANSKLGIMAPPKPINRPFLKPSYAFS" BASE COUNT 1465 a 865 c 913 g 1194 t ORIGIN 1 gcgcggcggc cgtggttgcg gcgcgggaag tttggatcct ggttccgtcc gctaggagtc 61 tgcgtgcgag gattatggct gctgttcctc aaaataatct acaggagcaa ctagaacgtc 121 actcagccag aacacttaat aataaattaa gtctttcaaa accaaaattt tcaggtttca 181 cttttaaaaa gaaaacatct tcagataaca atgtatctgt aactaatgtg tcagtagcaa 241 aaacacctgt attaagaaat aaagatgtta atgttaccga agacttttcc ttcagtgaac 301 ctctacccaa caccacaaat cagcaaaggg tcaaggactt ctttaaaaat gctccagcag 361 gacaggaaac acagagaggt ggatcaaaat cattattgcc agatttcttg cagactccga 421 aggaagttgt atgcactacc caaaacacac caactgtaaa gaaatcccgg gatactgctc 481 tcaagaaatt agaatttagt tcttcaccag attctttaag taccatcaat gattgggatg 541 atatggatga ctttgatact tctgagactt caaaatcatt tgttacacca ccccaaagtc 601 actttgtaag agtaagcact gctcagaaat caaaaaaggg taagagaaac ttttttaaag 661 cacagcttta tacaacaaac acagtaaaga ctgatttgcc tccaccctcc tctgaaagcg 721 agcaaataga tttgactgag gaacagaagg atgactcaga atggttaagc agcgatgtga 781 tttgcatcga tgatggcccc attgctgaag tgcatataaa tgaagatgct caggaaagtg 841 actctctgaa aactcatttg gaagatgaaa gagataatag cgaaaagaag aagaatttgg 901 aagaagctga attacattca actgagaaag ttccatgtat tgaatttgat gatgatgatt 961 atgatacgga ttttgttcca ccttctccag aagaaattat ttctgcttct tcttcctctt 1021 caaaatgcct tagtacgtta aaggaccttg acacatctga cagaaaagag gatgttctta 1081 gcacatcaaa agatcttttg tcaaaacctg agaaaatgag tatgcaggag ctgaatccag 1141 aaaccagcac agactgtgac gctagacaga taagtttaca gcagcagctt attcatgtga 1201 tggagcacat ctgtaaatta attgatacta ttcctgatga taaactgaaa cttttggatt 1261 gtgggaacga actgcttcag cagcggaaca taagaaggaa acttctaacg gaagtagatt 1321 ttaataaaag tgatgccagt cttcttggct cattgtggag atacaggcct gattcacttg 1381 atggccctat ggagggtgat tcctgcccta cagggaattc tatgaaggag ttaaattttt 1441 cacaccttcc ctcaaattct gtttctcctg gggactgttt actgactacc accctaggaa 1501 agacaggatt ctctgccacc aggaagaatc tttttgaaag gcctttattc aatacccatt 1561 tacagaagtc ctttgtaagt agcaactggg ctgaaacacc aagactagga aaaaaaaatg 1621 aaagctctta tttcccagga aatgttctca caagcactgc tgtgaaagat cagaataaac 1681 atactgcttc aataaatgac ttagaaagag aaacccaacc ttcctatgat attgataatt 1741 ttgacataga tgactttgat gatgatgatg actgggaaga cataatgcat aatttagcag 1801 ccagcaaatc ttccacagct gcctatcaac ccatcaagga aggtcggcca attaaatcag 1861 tatcagaaag actttcctca gccaagacag actgtcttcc agtgtcatct actgctcaaa 1921 atataaactt ctcagagtca attcagaatt atactgacaa gtcagcacaa aatttagcat 1981 ccagaaatct gaaacatgag cgtttccaaa gtcttagttt tcctcataca aaggaaatga 2041 tgaagatttt tcataaaaaa tttggcctgc ataattttag aactaatcag ctagaggcga 2101 tcaatgctgc actgcttggt gaagactgtt ttatcctgat gccgactgga ggtggtaaga 2161 gtttgtgtta ccagctccct gcctgtgttt ctcctggggt cactgttgtc atttctccct 2221 tgagatcact tatcgtagat caagtccaaa agctgacttc cttggatatt ccagctacat 2281 atctgacagg tgataagact gactcagaag ctacaaatat ttacctccag ttatcaaaaa 2341 aagacccaat cataaaactt ctatatgtca ctccagaaaa gatctgtgca agtaacagac 2401 tcatttctac tctggagaat ctctatgaga ggaagctctt ggcacgtttt gttattgatg 2461 aagcacattg tgtcagtcag tggggacatg attttcgtca agattacaaa agaatgaata 2521 tgcttcgcca gaagtttcct tctgttccgg tgatggctct tacggccaca gctaatccca 2581 gggtacagaa ggacatcctg actcagctga agattctcag acctcaggtg tttagcatga 2641 gctttaacag acataatctg aaatactatg tattaccgaa aaagcctaaa aaggtggcat 2701 ttgattgcct agaatggatc agaaagcacc acccatatga ttcagggata atttactgcc 2761 tctccaggcg agaatgtgac accatggctg acacgttaca gagagatggg ctcgctgctc 2821 ttgcttacca tgctggcctc agtgattctg ccagagatga agtgcagcag aagtggatta 2881 atcaggatgg ctgtcaggtt atctgtgcta caattgcatt tggaatgggg attgacaaac 2941 cggacgtgcg atttgtgatt catgcatctc tccctaaatc tgtggagggt tactaccaag 3001 aatctggcag agctggaaga gatggggaaa tatctcactg cctgcttttc tatacctatc 3061 atgatgtgac cagactgaaa agacttataa tgatggaaaa agatggaaac catcatacaa 3121 gagaaactca cttcaataat ttgtatagca tggtacatta ctgtgaaaat ataacggaat 3181 gcaggagaat acagcttttg gcctactttg gtgaaaatgg atttaatcct gatttttgta 3241 agaaacaccc agatgtttct tgtgataatt gctgtaaaac aaaggattat aaaacaagag 3301 atgtgactga cgatgtgaaa agtattgtaa gatttgttca agaacatagt tcatcacaag 3361 gaatgagaaa tataaaacat gtaggtcctt ctggaagatt tactatgaat atgctggtcg 3421 acattttctt ggggagtaag agtgcaaaaa tccagtcagg tatatttgga aaaggatctg 3481 cttattcacg acacaatgcc gaaagacttt ttaaaaagct gatacttgac aagattttgg 3541 atgaagactt atatatcaat gccaatgacc aggcgatcgc ttatgtgatg ctcggaaata 3601 aagcccaaac tgtactaaat ggcaatttaa aggtagactt tatggaaaca gaaaattcca 3661 gcagtgtgaa aaaacaaaaa gcgttagtag caaaagtgtc tcagagggaa gagatggtta 3721 aaaaatgtct tggagaactt acagaagtct gcaaatctct ggggaaagtt tttggtgtcc 3781 attacttcaa tatttttaat accgtcactc tcaagaagct tgcagaatct ttatcttctg 3841 atcctgaggt tttgcttcaa attgatggtg ttactgaaga caaactggaa aaatatggtg 3901 cggaagtgat ttcagtatta cagaaatact ctgaatggac atcgccagct gaagacagtt 3961 ccccagggat aagcctgtcc agcagcagag gccccggaag aagtgccgct gaggagcttg 4021 acgaggaaat acccgtatct tcccactact ttgcaagtaa aaccagaaat gaaaggaaga 4081 ggaaaaagat gccagcctcc caaaggtcta agaggagaaa aactgcttcc agtggttcca 4141 aggcaaaggg ggggtctgcc acatgtagaa agatatcttc caaaacgaaa tcctccagca 4201 tcattggatc cagttcagcc tcacatactt ctcaagcgac atcaggagcc aatagcaaat 4261 tggggattat ggctccaccg aagcctataa atagaccgtt tcttaagcct tcatatgcat 4321 tctcataaca accgaatctc aatgtacata gaccctcttt cttgtttgtc agcatctgac 4381 catctgtgac tataaagctg ttattcttgt tataccaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU39905 2749 bp mRNA PRI 16-MAY-1996 DEFINITION Human vesicular monoamine transporter VMAT1 mRNA, complete cds. ACCESSION U39905 NID g1314289 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2749) AUTHORS Erickson,J.D., Schafer,M.K., Bonner,T.I., Eiden,L.E. and Weihe,E. TITLE Distinct pharmacological properties and distribution in neurons and endocrine cells of two isoforms of the human vesicular monoamine transporter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (10), 5166-5171 (1996) MEDLINE 96209876 REFERENCE 2 (bases 1 to 2749) AUTHORS Bonner,T.I. and Erickson,J.D. TITLE Direct Submission JOURNAL Submitted (01-NOV-1995) Tom I. Bonner, Lab of Cell Biology, NIMH, Bldg. 36, Room 3A-17, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2749 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" CDS 268..1845 /codon_start=1 /product="vesicular monoamine transporter VMAT1" /db_xref="PID:g1314290" /translation="MLRTILDAPQRLLKEGRASRQLVLVVVFVALLLDNMLFTVVVPI VPTFLYDMEFKEVNSSLHLGHAGSSPHALASPAFSTIFSFFNNNTVAVEESVPSGIAW MNDTASTIPPPATEAISAHKNNCLQGTGFLEEEITRVGVLFASKAVMQLLVNPFVGPL TNRIGYHIPMFAGFVIMFLSTVMFAFSGTYTLLFVARTLQGIGSSFSSVAGLGMLASV YTDDHERGRAMGTALGGLALGLLVGAPFGSVMYEFVGKSAPFLILAFLALLDGALQLC ILQPSKVSPESAKGTPLFMLLKDPYILVAAGSICFANMGVAILEPTLPIWMMQTMCSP KWQLGLAFLPASVSYLIGTNLFGVLANKMGRWLCSLIGMLVVGTSLLCVPLAHNIFGL IGPNAGLGLAIGMVDSSMMPIMGHLVDLRHTSVYGSVYAIADVAFCMGFAIGPSTGGA IVKAIGFPWLMVITGVINIVYAPLCYYLRSPPAKEEKLAILSQDCPMETRMYATQKPT KEFPLGEDSDEEPDHEE" polyA_signal 2731..2736 polyA_site 2749 BASE COUNT 586 a 734 c 655 g 774 t ORIGIN 1 cacacacaca catacacaga atcctcagat aacaggaggc aataaatcca acagcacatc 61 cacgttcaga gaacagtgtc cctgctgtct tgctaacagc tgccaatacc tcactgagtg 121 cctcacacca acatgggctc caagtgagtt tccttcgtct gggcagactc cctcccctct 181 tccataaagg ctgcaggaga cctgtagctg tcacaggacc ttccctaaga gcccgcaggg 241 aaagactgcc ccagtccggc catcaccatg ctccggacca ttctggatgc tccccagcgg 301 ttgctgaagg aggggagagc gtcccggcag ctggtgctgg tggtggtatt cgtcgctttg 361 ctcctggaca acatgctgtt tactgtggtg gtgccaattg tgcccacctt cctatatgac 421 atggagttca aagaagtcaa ctcttctctg cacctcggcc atgccggaag ttccccacat 481 gccctcgcct ctcctgcctt ttccaccatc ttctccttct tcaacaacaa caccgtggct 541 gttgaagaaa gcgtacctag tggaatagca tggatgaatg acactgccag caccatccca 601 cctccagcca ctgaagccat ctcagctcat aaaaacaact gcttgcaagg cacaggtttc 661 ttggaggaag agattacccg ggtcggggtt ctgtttgctt caaaggctgt gatgcaactt 721 ctggtcaacc cattcgtggg ccctctcacc aacaggattg gatatcatat ccccatgttt 781 gctggctttg ttatcatgtt tctctccaca gttatgtttg ctttttctgg gacctatact 841 ctactctttg tggcccgaac ccttcaaggc attggatctt cattttcatc tgttgcaggt 901 cttggaatgc tggccagtgt ctacactgat gaccatgaga gaggacgagc catgggaact 961 gctctggggg gcctggcctt ggggttgctg gtgggagctc cctttggaag tgtaatgtac 1021 gagtttgttg ggaagtctgc acccttcctc atcctggcct tcctggcact actggatgga 1081 gcactccagc tttgcatcct acagccttcc aaagtctctc ctgagagtgc caaggggact 1141 cccctcttta tgcttctcaa agacccttac atcctggtgg ctgcagggtc catctgcttt 1201 gccaacatgg gggtggccat cctggagccc acactgccca tctggatgat gcagaccatg 1261 tgctccccca agtggcagct gggtctagct ttcttgcctg ccagtgtgtc ctacctcatt 1321 ggcaccaacc tctttggtgt gttggccaac aagatgggtc ggtggctgtg ttccctaatc 1381 gggatgctgg tagtaggtac cagcttgctc tgtgttcctc tggctcacaa tatttttggt 1441 ctcattggcc ccaatgcagg gcttggcctt gccataggca tggtggattc ttctatgatg 1501 cccatcatgg ggcacctggt ggatctacgc cacacctcgg tgtatgggag tgtctacgcc 1561 atcgctgatg tggctttttg catgggcttt gctataggtc catccaccgg tggtgccatt 1621 gtaaaggcca tcggttttcc ctggctcatg gtcatcactg gggtcatcaa catcgtctat 1681 gctccactct gctactacct gcggagcccc ccggcaaagg aagagaagct tgctattctg 1741 agtcaggact gccccatgga gacccggatg tatgcaaccc agaagcccac gaaggaattt 1801 cctctggggg aggacagtga tgaggagcct gaccatgagg agtagcagca gaaggtgctc 1861 cttgaattca tgatgcctca gtgaccacct ctttccctgg gaccagatca ccatggctga 1921 gcccacggct cagtgggctt cacatacctc tgcctgggaa tcttctttcc tcccctccca 1981 tggacactgt ccctgatact cttctcacct gtgtaacttg tagctcttcc tctatgcctt 2041 ggtgccgcag tggcccatct tttatgggaa gacagagtga tgcaccttcc cgctgctgtg 2101 aggttgatta aacttgagct gtgacgggtt ctgcaagggg tgactcattg catagaggtg 2161 gtagtgagta atgtgcccct gaaaccagtg gggtgactga caagcctctt taatctgttg 2221 cctgattttc tctggcatag tcccaacaga tcggaagagt gttaccctct tttcctcaac 2281 gtgttctttc ccgggttttc ccagccgagt tgagaaaatg ttctcagcat tgtcttgctg 2341 ccaaatgcca gcttgaagag ttttgttttg ttttttttcc atttattttt tttttttaat 2401 aaagtgagtg atttttctgt ggctaaatct agagctgcta aaagggcttt accctcagtg 2461 aaaagtgtct tctattttca ttatctttca gaaacaggag cccatttctc ttctgctgga 2521 gttattgaca ttctcctgac ctcccctgtg tgttcctacc ttttctgaac ctcttagact 2581 cttagaaata aaagtagaag aaagacagaa aaaataactg attagaccca agatttcatg 2641 ggaagaagtt aaaagaaact gccttgaaat ccctcctgat tgtagatttc ctaataggag 2701 gggtgtaatg tgacattgtt catacttgct aataaataca ttattgcct // LOCUS HSU39945 889 bp mRNA PRI 09-DEC-1996 DEFINITION Human adenylate kinase 2 (adk2) mRNA, complete cds. ACCESSION U39945 NID g1209686 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 889) AUTHORS Lee,Y., Kim,J.W., Lee,I.A., Kang,H.B., Choe,Y.K., Lee,H.G., Lim,J.S., Kim,H.J., Park,C. and Choe,I.S. TITLE Cloning and characterization of cDNA for human adenylate kinase 2A JOURNAL Biochem. Mol. Biol. Int. 39 (4), 833-842 (1996) MEDLINE 97000211 REFERENCE 2 (bases 1 to 889) AUTHORS Lee,Y., Kim,J., Lee,I., Moon,J., Song,J. and Choe,I. TITLE Isolation and characterization of human gene homologous with bovine adenylate kinase 2a JOURNAL Thesis (1995) Mol. and Cell. Biol. Research group, Korean Reasearch Institute of Bioscience and Biotechnology REFERENCE 3 (bases 1 to 889) AUTHORS Choe,I. TITLE Direct Submission JOURNAL Submitted (03-NOV-1995) Inseong Choe, Korean Reasearch Institute of Bioscience and Biotechnology, Mol. and Cell. Biol. Research group, Yoosung, Taejon 305-600, Korea FEATURES Location/Qualifiers source 1..889 /organism="Homo sapiens" /isolate="Korean" /db_xref="taxon:9606" /clone="A5" /tissue_type="liver" /dev_stage="fetus" gene 34..753 /gene="adk2" CDS 34..753 /gene="adk2" /EC_number="2.7.4.3" /codon_start=1 /function="phosphotransferase" /product="adenylate kinase 2" /db_xref="PID:g1209687" /translation="MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHL ATGDMLRAMVASGSELGKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFP RTVRQAEMLDDLMEKRKEKLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPK EPMKDDITGEPLIRRSDDNEKALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDV VFASILAAFSKATCKDLVMFI" polyA_signal 866..871 polyA_site 889 BASE COUNT 242 a 215 c 239 g 193 t ORIGIN 1 gcgaactggt ggcagtgaga gacttcggcg gacatggctc ccagcgtgcc agcggcagaa 61 cccgagtatc ctaaaggcat ccgggccgtg ctgctggggc ctcccggggc cggtaaaggg 121 acccaggcac ccagattggc tgaaaacttc tgtgtctgcc atttagctac tggggacatg 181 ctgagggcca tggtggcttc tggctcagag ctaggaaaaa agctgaaggc aactatggat 241 gctgggaaac tggtgagtga tgaaatggta gtggagctca ttgagaagaa tttggagacc 301 cccttgtgca aaaatggttt tcttctggat ggcttccctc ggactgtgag gcaggcagaa 361 atgctcgatg acctcatgga gaagaggaaa gagaagcttg attctgtgat tgaattcagc 421 atcccagact ctctgctgat ccgaagaatc acaggaaggc tgattcaccc caagagtggc 481 cgttcctacc acgaggagtt caaccctcca aaagagccca tgaaagatga catcaccggg 541 gaacccttga tccgtcgatc agatgataat gaaaaggcct tgaaaatccg cctgcaagcc 601 taccacactc aaaccacccc actcatagag tactacagga aacgggggat ccactccgcc 661 atcgatgcat cccagacccc cgatgtcgtg ttcgcaagca tcctagcagc cttctccaaa 721 gccacatgta aagacttggt tatgtttatc taatgttggg tccaagaagg aatttctttc 781 catccctgtg aggcaatggg tgggaatgat aggacaggca aagagaagct tcctcaggct 841 agcaaaaata tcatttgatg tattgattaa aaaagcactt gcttgatgt // LOCUS HSU40002 3804 bp mRNA PRI 06-SEP-1996 DEFINITION Human hormone-sensitive lipase testicular isoform mRNA, complete cds. ACCESSION U40002 NID g1488676 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3804) AUTHORS Holst,L.S., Langin,D., Mulder,H., Laurell,H., Grober,J., Bergh,A., Mohrenweiser,H.W., Edgren,G. and Holm,C. TITLE Molecular cloning, genomic organization, and expression of a testicular isoform of hormone-sensitive lipase JOURNAL Genomics 35 (3), 441-447 (1996) MEDLINE 97001144 REFERENCE 2 (bases 1 to 3804) AUTHORS Stenson Holst,L., Langin,D., Laurell,H., Grober,J., Edgren,G. and Holm,C. TITLE Direct Submission JOURNAL Submitted (02-NOV-1995) Lena Stenson Holst, Cell and Molecular Biology, Lund University, P.O. Box 94, Lund, S-22100, Sweden FEATURES Location/Qualifiers source 1..3804 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1-13.2" /tissue_type="testis" CDS 278..3508 /codon_start=1 /product="hormone-sensitive lipase testicular isoform" /db_xref="PID:g1488677" /translation="MEPGSKSVSRSDWQPEPHQRPITPLEPGPEKTPIAQPESKTLQG SNTQQKPASNQRPLTQQETPAQHDAESQKEPRAQQKSASQEEFLAPQKPAPQQSPYIQ RVLLTQQEAASQQGPGLGKESITQQEPALRQRHVAQPGPGPGEPPPAQQEAESTPAAQ AKPGAKREPSAPTESTSQETPEQSDKQTTPVQGAKSKQGSLTELGFLTKLQELSIQRS ALEWKALSEWVADSESESDVGSSSDTDSPATMGGMVAQGVKLGFKGKSGYKVMSGYSG TSPHEKTSARNHRHYQDTASRLIHNMDLRTMTQSLVTLAEDNIAFFSSQGPGETAQRL SGVFAGVREQALGLEPALGRLLGVAHLFDLDPETPANGYRSLVHTARCCLAHLLHKSR YVASNRRSIFFRTSHNLAELEAYLAALTQLRALVYYAQRLLVTNRPGVLFFEGDEGLT ADFLREYVTLHKGCFYGRCLGFQFTPAIRPFLQTISIGLVSFGEHYKRNETGLSVAAS SLFTSGRFAIDPELRGAEFERITQNLDVHFWKAFWNITEMEVLSSLANMASATVRVSR LLSLPPEAFEMPLTADPTLTVTISPPLAHTGPGPVLVRLISYDLREGQDSEELSSLIK SNGQRSLELWPRPQQAPRSRSLIVHFHGGGFVAQTSRSHEPYLKSWAQELGAPIISID YSLAPEAPFPRALEECFFAYCWAIKHCALLGSTGERICLAGDSAGGNLCFTVALRAAA YGVRVPDGIMAAYPATMLQPAASPSRLLSLMDPLLPLSVLSKCVSAYAGAKTEDHSNS DQKALGMMGLVRRDTALLLRDFRLGASSWLNSFLELSGRKSQKMSEPIAEPMRRSVSE AALAQPQGPLGTDSLKNLTLRDLSLRGNSETSSDTPEMSLSAETLSPSTPSDVNFLLP PEDAGEEAEAKNELSPMDRGLGVRAAFPEGFHPRRSSQGATQMPLYSSPIVKNPFMSP LLAPDSMLKSLPPVHIVACALDPMLDDSVMLARRLRNLGQPVTLRVVEDLPHGFLTLA ALCRETRQAAELCVERIRLVLTPPAGAGPSGETGAAGVDGGCGGRH" BASE COUNT 828 a 1225 c 1091 g 660 t ORIGIN 1 cttcttgtaa gagagtgcta ggcacatagc cccctcctat tcctaatcct cccaccaaag 61 aaagaggcac agagttcatt acttagtggg ggccagctgt gatcggccaa ctgccagctg 121 ccttaaaaag gaagaccagt gatgctagga tggagtgaaa cccaagagga agtgccatca 181 tgaggaatca atgagagatc tgtgaagaga gagggctggg tgggagccca gaaggataga 241 acctggaaga tcaatatctc ccgtgaggga aataacaatg gagccaggtt ctaagtcagt 301 gtctaggtca gactggcaac ctgaaccaca ccagaggcct ataaccccgc tagagcctgg 361 gccagaaaag acacccatag cccagccaga atcgaagact ctgcagggat ccaataccca 421 acagaagcct gcttcaaacc aaagacccct cacccagcag gagacccctg cacaacatga 481 tgctgaatcc cagaaggaac ctagagccca acaaaaatct gcttcacaag aggaatttct 541 tgccccacag aagcccgcac cacagcaatc accttacatc caaagggtgc tgctcactca 601 acaggaagct gcctcccagc agggacctgg gctaggaaaa gaatctataa ctcaacagga 661 gccagcattg agacaaagac atgtagccca gccagggcct gggccaggag agccacctcc 721 agctcaacaa gaagctgaat caacacctgc ggcccaggct aaacctggag ccaaaaggga 781 gccatctgcc ccgactgaat ctacatccca agagacacct gaacagtcag acaagcaaac 841 aacgccagtc cagggagcca aatccaagca gggatctttg acagagctgg gatttctaac 901 aaaacttcag gaactatcca tacagcgatc agccctagag tggaaggcac tttctgagtg 961 ggtcgcagat tctgagtcag aatcagatgt gggatcatct tcagacacag attctccagc 1021 cacgatgggt ggaatggtgg cccagggagt gaagctaggc ttcaaaggaa aatctggtta 1081 taaagtgatg tcaggataca gtgggacgtc gccacatgag aaaaccagtg ctcggaatca 1141 cagacactac caggatacag cctcaaggct catccacaac atggacctgc gcacaatgac 1201 acagtcgctg gtgactctgg cggaggacaa catagccttc ttctcgagcc agggtcctgg 1261 ggaaacggcc cagcggctgt caggcgtttt tgccggtgta cgggagcagg cgctggggct 1321 ggagccggcc ctgggccgcc tgctgggtgt ggcgcacctc tttgacctgg acccagagac 1381 accggccaac gggtaccgca gcctagtgca cacagcccgc tgctgcctgg cgcacctcct 1441 gcacaaatcc cgctatgtgg cctccaaccg ccgcagcatc ttcttccgca ccagccacaa 1501 cctggccgag ctggaggcct acctggctgc cctcacccag ctccgcgctc tggtctacta 1561 cgcccagcgc ctgctggtta ccaatcggcc gggggtactc ttctttgagg gcgacgaggg 1621 gctcaccgcc gacttcctcc gggagtatgt cacgctgcat aagggatgct tctatggccg 1681 ctgcctgggc ttccagttca cgcctgccat ccggccattc ctgcagacca tctccattgg 1741 gctggtgtcc ttcggggagc actacaaacg caacgagaca ggcctcagtg tggccgccag 1801 ctctctcttc accagcggcc gctttgccat cgaccccgag ctgcgtgggg ctgagtttga 1861 gcggatcaca cagaacctgg acgtgcactt ctggaaagcc ttctggaaca tcaccgagat 1921 ggaagtgcta tcgtctctgg ccaacatggc atcggccacc gtgagggtaa gccgcctgct 1981 cagcctgcca cccgaagcct ttgagatgcc actgactgcc gaccccacgc tcacggtcac 2041 catctcaccc ccactggccc acacaggccc tgggcccgtc ctcgtcaggc tcatctccta 2101 tgacctgcgt gaaggacagg acagtgagga gctcagcagc ctgataaagt ccaacggcca 2161 acggagcctg gagctgtggc cgcgccccca gcaggcaccc cgctcgcggt ccctgatagt 2221 gcacttccac ggcggtggct ttgtggccca gacctccaga tcccacgagc cctacctcaa 2281 gagctgggcc caggagctgg gcgcccccat catctccatc gactactccc tggcccctga 2341 ggcccccttc ccccgtgcgc tggaggagtg cttcttcgcc tactgctggg ccatcaagca 2401 ctgcgccctc cttggctcaa caggggaacg aatctgcctt gcgggggaca gtgcaggcgg 2461 gaacctctgc ttcaccgtgg ctcttcgggc agcagcctac ggggtgcggg tgccagatgg 2521 catcatggca gcctacccgg ccacaatgct gcagcctgcc gcctctccct cccgcctgct 2581 gagcctcatg gaccccttgc tgcccctcag tgtgctctcc aagtgtgtca gcgcctatgc 2641 tggtgcaaag acggaggacc actccaactc agaccagaaa gccctcggca tgatggggct 2701 ggtgcggcgg gacacagccc tgctcctccg agacttccgc ctgggtgcct cctcatggct 2761 caactccttc ctggagttaa gtgggcgcaa gtcccagaag atgtcggagc ccatagcaga 2821 gccgatgcgc cgcagtgtgt ctgaagcagc actggcccag ccccagggcc cactgggcac 2881 ggattccctc aagaacctga ccctgaggga cttgagcctg aggggaaact ccgagacgtc 2941 gtcggacacc cccgagatgt cgctgtcagc tgagacactt agcccctcca caccctccga 3001 tgtcaacttc ttattaccac ctgaggatgc aggggaagag gctgaggcca aaaatgagct 3061 gagccccatg gacagaggcc tgggcgtccg tgccgccttc cccgagggtt tccacccccg 3121 acgctccagc cagggtgcca cacagatgcc cctctactcc tcacccatag tcaagaaccc 3181 cttcatgtcg ccgctgctgg cacccgacag catgctcaag agcctgccac ctgtgcacat 3241 cgtggcgtgc gcgctggacc ccatgctgga cgactcggtc atgctcgcgc ggcgactgcg 3301 caacctgggc cagccggtga cgctgcgcgt ggtggaggac ctgccgcacg gcttcctgac 3361 cctagcggcg ctgtgccgcg agacgcgcca ggccgcagag ctgtgcgtgg agcgcatccg 3421 cctcgtcctc actcctcccg ccggagccgg gccgagcggg gagacggggg ctgcgggggt 3481 agacgggggc tgcggggggc gacactaaaa gcctgttgtt cccatctgcg ccggcctccg 3541 tcatgaatgc cttccgggcc gggcggaagg ggacgcgggc tgtgcttact taagtcgggg 3601 gtggcaaggg ggcggggcgg gggccgaaag ctgagaccct cgccacgggg agggggacgc 3661 gcacacacac cggtcaccga gacggctgga cctgcacgcc accgctgcct tttgctgctg 3721 ctgctgcggc gaccgccgca gggacgggga ctggccctcc cttgcaggtc ggtttggttt 3781 gttgtaaata aaagtattta atta // LOCUS HSU40038 1450 bp mRNA PRI 07-FEB-1996 DEFINITION Human GTP-binding protein alpha q subunit (GNAQ) mRNA, complete cds. ACCESSION U40038 NID g1181670 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1450) AUTHORS Dong,Q., Shenker,A., Way,J., Haddad,B.R., Lin,K., Hughes,M.R., McBride,O.W., Spiegel,A.M. and Battey,J. TITLE Molecular cloning of human G alpha q cDNA and chromosomal localization of the G alpha q gene (GNAQ) and a processed pseudogene JOURNAL Genomics 30 (3), 470-475 (1995) MEDLINE 96423032 REFERENCE 2 (bases 1 to 1450) AUTHORS Battey,J. and Way,J. TITLE Direct Submission JOURNAL Submitted (02-NOV-1995) J. Battey, NIDDK, NIH, Bldg 10/8C101, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1450 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q21" gene 43..1122 /gene="GNAQ" CDS 43..1122 /gene="GNAQ" /codon_start=1 /product="GTP-binding protein alpha q subunit" /db_xref="PID:g1181671" /translation="MTLESIMACCLSEEAKEARRINDEIERHVRRDKRDARRELKLLL LGTGESGKSTFIKQMRIIHGSGYSDEDKRGFTKLVYQNIFTAMQAMIRAMDTLKIPYK YEHNKAHAQLVREVDVEKVSAFENPYVDAIKSLWNDPGIQECYDRRREYQLSDSTKYY LNDLDRVADPAYLPTQQDVLRVRVPTTGIIEYPFDLQSVIFRMVDVGGQRSERRKWIH CFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKK DLLEEKIMYSHLVDYFPEYDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCATDT ENIRFVFAAVKDTILQLNLKEYNLV" BASE COUNT 423 a 323 c 348 g 356 t ORIGIN 1 ggggggttgc cggcggggct gcagcggagg cactttggaa gaatgactct ggagtccatc 61 atggcgtgct gcctgagcga ggaggccaag gaagcccggc ggatcaacga cgagatcgag 121 cggcacgtcc gcagggacaa gcgggacgcc cgccgggagc tcaagctgct gctgctcggg 181 acaggagaga gtggcaagag tacgtttatc aagcagatga gaatcatcca tgggtcagga 241 tactctgatg aagataaaag gggcttcacc aagctggtgt atcagaacat cttcacggcc 301 atgcaggcca tgatcagagc catggacaca ctcaagatcc catacaagta tgagcacaat 361 aaggctcatg cacaattagt tcgagaagtt gatgtggaga aggtgtctgc ttttgagaat 421 ccatatgtag atgcaataaa gagtttatgg aatgatcctg gaatccagga atgctatgat 481 agacgacgag aatatcaatt atctgactct accaaatact atcttaatga cttggaccgc 541 gtagctgacc ctgcctacct gcctacgcaa caagatgtgc ttagagttcg agtccccacc 601 acagggatca tcgaataccc ctttgactta caaagtgtca ttttcagaat ggtcgatgta 661 gggggccaaa ggtcagagag aagaaaatgg atacactgct ttgaaaatgt cacctctatc 721 atgtttctag tagcgcttag tgaatatgat caagttctcg tggagtcaga caatgagaac 781 cgaatggagg aaagcaaggc tctctttaga acaattatca catacccctg gttccagaac 841 tcctcggtta ttctgttctt aaacaagaaa gatcttctag aggagaaaat catgtattcc 901 catctagtcg actacttccc agaatatgat ggaccccaga gagatgccca ggcagcccga 961 gaattcattc tgaagatgtt cgtggacctg aacccagaca gtgacaaaat tatctactcc 1021 cacttcacgt gcgccacaga caccgagaat atccgctttg tctttgctgc cgtcaaggac 1081 accatcctcc agttgaacct gaaggagtac aatctggtct aattgtgcct cctagacacc 1141 cgccctgccc ttccctggtg ggctattgaa gatacacaag agggactgta ttttctgtgg 1201 aaaacaattt gcataatact aatttattgc cgtccggact ctgtgtgtat atgtgcaatt 1261 tttcaacaaa tgcaaaaaaa atacagcaca tgtattgaca gcttctgtca gcagcttgag 1321 ttgaaatttg atttaagaaa ataaatcatg attgttcaaa gctgctggga cgttagaatt 1381 aggccatgat actggtctca tttaactaca gtggtatttg gcactagtgt aaacttccat 1441 ataaatcact // LOCUS HSU40152 3153 bp mRNA PRI 22-DEC-1995 DEFINITION Human origin recognition complex 1 (HsORC1) mRNA, complete cds. ACCESSION U40152 NID g1113100 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3153) AUTHORS Gavin,K.A., Hidaka,M. and Stillman,B. TITLE Conserved initiator proteins in eukaryotes JOURNAL Science 270 (5242), 1667-1671 (1995) MEDLINE 96099401 REFERENCE 2 (bases 1 to 3153) AUTHORS Stillman,B. and Gavin,K.A. TITLE Direct Submission JOURNAL Submitted (05-NOV-1995) Bruce Stillman, Director, Cold Spring Harbor Laboratory, 1 Bungtown R, PO Box 100, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..3153 /organism="Homo sapiens" /db_xref="taxon:9606" gene 220..2805 /gene="HsORC1" CDS 220..2805 /gene="HsORC1" /note="origin recognition complex 1; similar to the Saccharomyces cerevisiae ORC1-related protein; Method: conceptual translation supplied by author" /codon_start=1 /product="HsORC1" /db_xref="PID:g1113101" /translation="MAHYPTRLKTRKTYSWVGRPLLDRKLHYQTYREMCVKTEGCSTE IHIQIGQFVLIEGDDDENPYVAKLLELFEDDSDPPPKKRARVQWFVRFCEVPACKRHL LGRKPGAQEIFWYDYPACDSNINAETIIGLVRVIPLAPKDVVPTNLKNEKTLFVKLSW NEKKFRPLSSELFAELNKPQESAAKCQKPVRAKSKSAESPSWTPAEHVAKRIESRHSA SKSRQTPTHPLTPRARKRLELGNLGNPQMSQQTSCASLDSPGRIKRKVAFSEITSPSK RSQPDKLQTLSPALKAPEKTRETGLSYTEDDKKASPEHRIILRTRIAASKTIDIREER TLTPISGGQRSSVVPSVILKPENIKKRDAKEAKAQNEATSTPHRIRRKSSVLTMNRIR QQLRFLGNSKSDQEEKEILPAAEISDSSSDEEEASTPPLPRRAPRTVSRNLRSSLKSS LHTLTKVPKKSLKPRTPRCAAPQIRSRSLAAQEPASVLEEARLRLHVSAVPESLPCRE QEFQDIYNFVESKLLDHTGGCMYISGVPGTGKTATVHEVIRCLQQAAQANDVPPFQYI EVNGMKLTEPHQVYVHILQKLTGQKATANHAAELLAKQFCTRGSPQETTVLLVDELDL LWTHKQDIMYNLFDWPTHKEARLVVLAIANTMDLPERIMMNRVSSRLGLTRMCFQPYT YSQLQQILRSRLKHLKAFEDDAIQLVARKVAALSGDARRCLDICRRATEICEFSQQKP DSPGLVTIAHSMEAVDEMFSSSYITAIKNSSVLEQSFLRAILAEFRRSGLEEATFQQI YSQHVALCRMEGLPYPTMSETMAVCSHLGSCRLLLVEPSRNDLLLRVRLNVSQDDVLY ALKDE" BASE COUNT 836 a 808 c 792 g 717 t ORIGIN 1 ccggggccac gcgattggcg cgaagttttc ttttctcctt ccaccttctt ttcatttcta 61 gtgagacaca cgctttggtc ctggctttcg gcccgtagtt gtagaaggag ccctgctggt 121 gcaggttaga ggtgccgcat cccccggagc tctcgaagtg gaggcggtag gaaacggagg 181 gcttgcggct agccggagga agctttggag ccggaagcca tggcacacta ccccacaagg 241 ctgaagacca gaaaaactta ttcatgggtt ggcaggccct tgttggatcg aaaactgcac 301 taccaaacct atagagaaat gtgtgtgaaa acagaaggtt gttccaccga gattcacatc 361 cagattggac agtttgtgtt gattgaaggg gatgatgatg aaaacccgta tgttgctaaa 421 ttgcttgagt tgttcgaaga tgactctgat cctcctccta agaaacgtgc tcgagtacag 481 tggtttgtcc gattctgtga agtccctgcc tgtaaacggc atttgttggg ccggaagcct 541 ggtgcacagg aaatattctg gtatgattac ccggcctgtg acagcaacat taatgcggag 601 accatcattg gccttgttcg ggtgatacct ttagccccaa aggatgtggt accgacgaat 661 ctgaaaaatg agaagacact ctttgtgaaa ctatcctgga atgagaagaa attcaggcca 721 ctttcctcag aactatttgc ggagttgaat aaaccacaag agagtgcagc caagtgccag 781 aaacccgtga gagccaagag taagagtgca gagagccctt cttggacccc agcagaacat 841 gtggccaaaa ggattgaatc aaggcactcc gcctccaaat ctcgccaaac tcctacccat 901 cctcttaccc caagagccag aaagaggctg gagcttggca acttaggtaa ccctcagatg 961 tcccagcaga cttcatgtgc ctccttggat tctccaggaa gaataaaacg gaaagtggcc 1021 ttctcggaga tcacctcacc ttctaagaga tctcagcctg ataaacttca aaccttgtct 1081 ccagctctga aagccccaga gaaaaccaga gagactggac tctcttatac tgaggatgac 1141 aagaaggctt cacctgaaca tcgcataatc ctgagaaccc gaattgcagc ttcgaaaacc 1201 atagacatta gagaggagag aacacttacc cctatcagtg ggggacagag atcttcagtg 1261 gtgccatccg tgattctgaa accagaaaac atcaaaaaga gggatgcaaa agaagcaaaa 1321 gcccagaatg aagcgacctc tactccccat cgtatccgca gaaagagttc tgtcttgact 1381 atgaatcgga ttaggcagca gcttcggttt ctaggtaata gtaaaagtga ccaagaagag 1441 aaagagattc tgccagcagc agagatttca gactctagca gtgacgaaga agaggcttcc 1501 acaccgcccc ttccaaggag agcacccaga actgtgtcca ggaacctgcg atcttccttg 1561 aagtcatcct tacataccct cacgaaggtg ccaaagaaga gtctcaagcc tagaacgcca 1621 cgttgtgccg ctcctcagat ccgtagtcga agcctggctg cccaggagcc agccagtgtg 1681 ctggaggaag cccgactgag gctgcatgtt tctgctgtac ctgagtctct tccctgtcgg 1741 gaacaggaat tccaagacat ctacaatttt gtggaaagca aactccttga ccataccgga 1801 gggtgcatgt acatctccgg tgtccctggg acagggaaga ctgccactgt tcatgaagtg 1861 atacgctgcc tgcagcaggc agcccaagcc aatgatgttc ctccctttca atacattgag 1921 gtcaatggca tgaagctgac ggagccccac caagtctatg tgcacatctt gcagaagcta 1981 acaggccaaa aagcaacagc caaccatgcg gcagaactgc tggcaaagca attctgcacc 2041 cgagggtcac ctcaggaaac caccgtcctg cttgtggatg agctcgacct tctgtggact 2101 cacaaacaag acataatgta caatctcttt gactggccca ctcataagga ggcccggctt 2161 gtggtcctgg caattgccaa cacaatggac ctgccagagc gaatcatgat gaaccgggtg 2221 tccagccgac tgggtcttac caggatgtgc ttccagccct atacatatag ccagctgcag 2281 cagatcctaa ggtcccggct caagcatcta aaggcctttg aagatgatgc catccagctg 2341 gtagccagga aggtagcagc actgtctgga gatgcacgac ggtgcctgga catctgcagg 2401 cgtgccacag agatctgtga gttctcccag cagaagcctg actcccctgg cctggtcacc 2461 atagcccact caatggaagc tgtggatgag atgttttcat catcatacat cacggccatc 2521 aaaaattcct ctgttctgga acagagcttc ctgagagcca tcctcgcaga gttccgtcga 2581 tcaggactgg aggaagccac gtttcaacag atatatagtc aacatgtggc actgtgcaga 2641 atggagggac tgccgtaccc caccatgtca gagaccatgg ccgtgtgttc tcacctgggc 2701 tcctgtcgcc tcctgcttgt ggagcccagc aggaacgatc tgctccttcg ggtgcggctc 2761 aacgtcagcc aggatgatgt gctgtatgcg ctgaaagacg agtaaagggg cttcacaagt 2821 taaaagactg gggtcttgct gggttttgtt ttttgagaca gggtcttgct ctgtcgccca 2881 ggctggagtg cagtggcacg atcatggctc actgcagcct tgacttctca ggcttaggtg 2941 accccccaac ctcatcctcc caggtggctg aaactacagg cacatgccac catgcccagc 3001 tgattttttg tagagacagg gcttcaccat gttgccaagc tagtctacaa agcatctgat 3061 tttggaagta catggaattg ttgtaacaaa gtatattgaa tggaaatggc tctcatgtat 3121 tttggaattt tccattaaat aatttgcttt tta // LOCUS HSU40215 2074 bp mRNA PRI 05-OCT-1996 DEFINITION Human synapsin IIb mRNA, complete cds. ACCESSION U40215 NID g1594276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2074) AUTHORS Xie,Y. TITLE Cloning and sequencing analysis of a human synapsin IIb-encoding brain cDNA JOURNAL Gene 173 (2), 289-290 (1996) MEDLINE 97082985 REFERENCE 2 (bases 1 to 2074) AUTHORS Xie,Y. TITLE Direct Submission JOURNAL Submitted (06-NOV-1995) Yong Xie, Biology, Hong Kong University of Science and Technology, Clear Water Bay Rd., Kowloon, Hong Kong, Hong Kong FEATURES Location/Qualifiers source 1..2074 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 57..1493 /note="peripheral membrane proteins of synaptic vesicles" /codon_start=1 /product="synapsin IIb" /db_xref="PID:g1594277" /translation="MMNFLRRRLSDSSFIANLPNGYMTDLQRPEPQQPPPPPPPGPGA ASASAAPPTASPGPERRPPPASAPAAQPAPTPSVGSSFFSSLSQAVKQTAASAGLVDA PAPAPAAARKAKVLLVVDEPHADWAKCFRGKKVLGDYDIKVEQAEFSELNLVAHADGT YAVDMQVLRNGTKVVRSFRPDFVLIRQHAFGMAENEDFRHLIIGMQYAGLPSINSLES IYNFCDKPWVFAQLVAIYKTLGGEKFPLIEQTYYPNHKEMLTLPTFPVVVKIGHAHSG MGKVKVENHYDFQDIASVVALTQTYATAEPFIDSKYDIRVQKIGNNYKAYMRTSISGN WKTNTGSAMLEQIAMSDRYKLWVDTCSEMFGGLDICAVKAVHGKDGKDYIFEVMDCSM PLIGEHQVEDRQLITELVISKMNQLLSRTPALSPQRPLTTQQPQSGTLKDPDSSKTPP QRPPPQGCLQYILDCNGIAVGPKQVQAS" BASE COUNT 488 a 604 c 530 g 452 t ORIGIN 1 ctccctccgc gccaccagac cccgtagccc cgcgcgcccc cagcccttta agccagatga 61 tgaacttcct gcggcgccgg ctgtcggaca gcagcttcat cgccaacctg cccaacggct 121 acatgaccga cctgcagcgg cccgagcccc agcagccgcc gccgccgccg ccccccggtc 181 cgggcgccgc ctcggcctcg gcggcgcccc cgaccgcctc gccgggcccg gagcggaggc 241 cgccgcccgc ctcggcgccc gccgcgcagc ccgcgccgac gccgtcggtg ggcagcagct 301 tcttcagctc gctgtcccaa gccgtgaagc agacggccgc ctcggctggc ctggtggacg 361 cgcccgctcc cgcgcccgca gccgccagga aggccaaggt gctgctggtg gtcgacgagc 421 cgcacgccga ctgggccaag tgctttcggg gcaaaaaagt ccttggagat tatgatatca 481 aggtggaaca ggcagaattt tcagagctca acctggtggc ccatgcagat ggcacctatg 541 ctgtggatat gcaggttctc cggaatggca caaaggttgt ccggtccttc cggccagact 601 tcgtgctcat ccggcagcat gcatttggca tggcggagaa tgaggacttc cgccacctga 661 tcattggtat gcagtatgca ggcctcccca gcatcaactc actggaatcc atatacaact 721 tctgtgacaa gccatgggtg tttgcccagc tggtcgctat ctataagaca ctgggaggag 781 aaaagttccc tctcattgaa cagacatact accccaacca caaagagatg ctgacactgc 841 ccacgttccc tgtggtggtg aagattggcc acgctcactc aggcatgggc aaggtcaaag 901 tggaaaacca ctacgacttc caggacattg ccagcgtggt ggctctcacc cagacctatg 961 ccactgcaga gcctttcatt gactccaagt atgacatccg ggtccagaag attggcaaca 1021 actacaaggc ttatatgagg acatcgatct cagggaactg gaagacgaac actggctctg 1081 cgatgctgga gcagattgcc atgtcagaca ggtacaaact gtgggtggac acctgctctg 1141 agatgtttgg cggcctggac atctgtgctg tcaaagctgt acatggcaaa gatgggaaag 1201 actacatttt tgaggtcatg gactgtagca tgccactgat tggggaacat caggtggagg 1261 acaggcaact catcaccgaa ctagtcatca gcaagatgaa ccagctgctg tccaggactc 1321 ctgccctgtc tcctcagaga cccctaacaa cccagcagcc acagagcgga acacttaagg 1381 atccggactc aagcaagacc ccacctcagc ggccaccccc tcaaggttgt ttacagtata 1441 ttctcgactg taatggcatt gcagtagggc caaaacaagt ccaagcttct taaaatgatt 1501 ggtggttaat ttttcaaagc agaaatttta agccaaaaac aaacgaaagg aaagcgggga 1561 ggggaaaaca gaccctccca ctggtgccgt tgctgcgttc tttcaatgct gactggactg 1621 tgtttttcct atgacgtgtc agctcctctg tctggttgtt tacctgttcc tgttcgtgct 1681 tgtaatgctc acttatgttt tctctgtata acttgtgatt ccagggctgt ttgtcaacag 1741 tatacaaaag aattgtgcct ctcccaagtc cagtgtgact ttatcttctg ggtggtttga 1801 tagtgttttt aaaagtaata tataatgtgg ggtgaaatgg gagtaggggg tggacagggg 1861 agaaacgaaa accacaaaaa gaaacccaac tcctctcctc cccaagctca gttaaatccc 1921 ccaccttccc aacctttccc tccaccagtg tgcttgggat cttcaatgaa ctgtgctttc 1981 gcttcttctg catgactatt ataactagat agaacattaa gagatttcaa gatcaactcc 2041 atagcttcta ccactgaatt gaggcatcac cttt // LOCUS HSU40223 1651 bp DNA PRI 19-JAN-1996 DEFINITION Human uridine nucleotide receptor (UNR) gene, complete cds. ACCESSION U40223 NID g1117912 KEYWORDS G protein-coupled receptor; purinoceptor; PCR; intronless; UTP. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1651) AUTHORS Nguyen,T., Erb,L., Weisman,G.A., Marchese,A., Heng,H.H., Garrad,R.C., George,S.R., Turner,J.T. and O'Dowd,B.F. TITLE Cloning, expression, and chromosomal localization of the human uridine nucleotide receptor gene JOURNAL J. Biol. Chem. 270 (52), 30845-30848 (1995) MEDLINE 96125054 REFERENCE 2 (bases 1 to 1651) AUTHORS O'Dowd,B.F., Nguyen,T., Marchese,A. and George,S.R. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Brian F. O'Dowd, Department of Pharmacology, University of Toronto, 8 Taddle Creek Road, Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1651 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq13" gene 391..1488 /gene="UNR" CDS 391..1488 /gene="UNR" /note="intronless coding region; Purinoceptor" /codon_start=1 /product="uridine nucleotide receptor" /db_xref="PID:g1117913" /translation="MASTESSLLRSLGLSPGPGSSEVELDCWFDEDFKFILLPVSYAV VFVLGLGLNAPTLWLFIFRLRPWDATATYMFHLALSDTLYVVSLPTLIYYYAAHNHWP FGTEICKFVRFLFYWNLYCSVLFLTCISVHRYLGICHPLRALRWGRPRLAGLLCLAVW LVVAGCLVPNLFFVTTSTKGTTVLCHDTTRPEEFDHYVHFSSAVMGLLFGVPCLVTLV CYGLMARRLYQPLPGAAQSSSRLRSLRTIAVVLTVFAVCFVPFHITRTIYYLARLLEA DCRVLNIVNVVYKVTRPLASANSCLDPVLYLLTGDKYRRQLRQLCGGGKPQPRTAASS LALVSLPEDSSCRWAATPQDSSCSTPRADRL" BASE COUNT 290 a 522 c 429 g 410 t ORIGIN 1 tcctcttcca ggatatagct gtgatgacga gtcagaagac acttggtctg gtatcttccc 61 acttgatagt gctgggaggc ctccaccctc ttcagccagc caggctctta gggacagagt 121 gagctgcaga gtcagtacaa cccaaataca cgggctgcct gcctgagccc cagcactgcc 181 tgctgcccac caacttccca agctggacca agggaggctt gggtaggggc caggctagcc 241 tgagtgcacc cagatgcgct tctgtcagct ctccctagtg cttcaaccac tgctctccct 301 gctctacttt ttttgctcca gctcagggat gggggtgggc agggaaatcc tgccaccctc 361 acttctcccc ttcccatctc caggggggcc atggccagta cagagtcctc cctgttgaga 421 tccctaggcc tcagcccagg tcctggcagc agtgaggtgg agctggactg ttggtttgat 481 gaggatttca agttcatcct gctgcctgtg agctatgcag ttgtctttgt gctgggcttg 541 ggccttaacg ccccaaccct atggctcttc atcttccgcc tccgaccctg ggatgcaacg 601 gccacctaca tgttccacct ggcattgtca gacaccttgt atgtcgtgtc gctgcccacc 661 ctcatctact attatgcagc ccacaaccac tggccctttg gcactgagat ctgcaagttc 721 gtccgctttc ttttctattg gaacctctac tgcagtgtcc ttttcctcac ctgcatcagc 781 gtgcaccgct acctgggcat ctgccaccca cttcgggcac tacgctgggg ccgccctcgc 841 ctcgcaggcc ttctctgcct ggcagtttgg ttggtcgtag ccggctgcct cgtgcccaac 901 ctgttctttg tcacaaccag caccaaaggg accaccgtcc tgtgccatga caccactcgg 961 cctgaagagt ttgaccacta tgtgcacttc agctcggcgg tcatggggct gctctttggc 1021 gtgccctgcc tggtcactct tgtttgctat ggactcatgg ctcgtcgcct gtatcagccc 1081 ttgccaggcg ctgcacagtc gtcttctcgc ctccgatctc tccgcaccat agctgtggtg 1141 ctgactgtct ttgctgtctg cttcgtgcct ttccacatca cccgcaccat ttactacctg 1201 gccaggctgt tggaagctga ctgccgagta ctgaacattg tcaacgtggt ctataaagtg 1261 actcggcccc tggccagtgc caacagctgc ctggatcctg tgctctactt gctcactggg 1321 gacaaatatc gacgtcagct ccgtcagctc tgtggtggtg gcaagcccca gccccgcacg 1381 gctgcctctt ccctggcact agtgtccctg cctgaggata gcagctgcag gtgggcggcc 1441 accccccagg acagtagctg ctctactcct agggcagata gattgtaaca cgggaagccg 1501 ggaagtgaga gaaaagggga tgagtgcagg gcagaggtga gggaacccaa tagtgatacc 1561 tggtaaggtg cttcttccct cttttcccag ggctcctgga gagaagccct caccctgagg 1621 ttgcatttat tgatttatat catgggtgac c // LOCUS HSU40268 2729 bp mRNA PRI 22-DEC-1995 DEFINITION Human origin recognition complex second largest subunit (hsOrc2) mRNA, complete cds. ACCESSION U40268 NID g1113106 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2729) AUTHORS Gavin,K.A., Hidaka,M. and Stillman,B. TITLE Conserved initiator proteins in eukaryotes JOURNAL Science 270 (5242), 1667-1671 (1995) MEDLINE 96099401 REFERENCE 2 (bases 1 to 2729) AUTHORS Hidaka,M. and Stillman,B. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Bruce Stillman, Cold Spring Harbor Laboratory, PO Box 100, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..2729 /organism="Homo sapiens" /db_xref="taxon:9606" gene 187..1920 /gene="hsOrc2" CDS 187..1920 /gene="hsOrc2" /note="human origin recognition complex 2 homolog; Method: conceptual translation supplied by author" /codon_start=1 /product="hsOrc2p" /db_xref="PID:g1113107" /translation="MSKPELKEDKMLEVHFVGDDDVLNHILDREGGAKLKKERAHVLV NPKKIIKKPEYDLEEDDQEVLKDQNYVEIMGRDVQESLKNGSATGGGNKVYSFQNRKH SEKMAKLASELAKTPQKSVSFSLKNDPEITINVPQSSKGHSASDKVQPKNNDKSEFLS TAPRSLRKRLIVPRSHSDSESEYSASNSEDDEGVAQEHEEDTNAVIFSQKIQAQNRVV SAPVGKETPSKRMKRDKTSDLVEEYFEAHSSSKVLTSDRTLQKLKRAKLDQQTLRNLL SKVSPSFSAELKQLNQQYEKLFHKWMLQLHLGFNIVLYGLGSKRDLLERFRTTMLQDS IHVVINGFFPGISVKSVLNSITEEVLDHMGTFRSILDQLDWIVNKFKEDSSLELFLLI HNLDSQMLRGEKSQQIIGQLSSLHNIYLIASIDHLNAPLMWDHAKQSLFNWLWYETTT YSPYTEETSYENSLLVKQSGSLPLSSLTHVLRSLTPNARGIFRLLIKYQLDNQDNPSY IGLSFQDFYQQCREAFLVNSDLTLRAQLTEFRDHKLIRTKKGTDGVEYLLIPVDNGTL TDFLEKEEEEA" BASE COUNT 849 a 502 c 608 g 770 t ORIGIN 1 ggcgcgaatt actggaaatt ggcttttccc gttggggccg aaggtacctt ccctgcggcg 61 gcgactcagc ggggtgtcgt tcggccggcg tgacgcagcc ggatcggcgc cagacggaaa 121 cctagcggtg actgtatctg aattttgcag ctgcagaatg tgtagtacct taaaaggttg 181 gcaacaatga gtaaaccaga attaaaggaa gacaagatgc tggaggttca ctttgtggga 241 gatgatgatg ttcttaatca cattctagat agagaaggag gagctaaatt gaagaaggag 301 cgagcgcacg ttttggtcaa ccccaaaaaa ataataaaga agccagaata tgatttggag 361 gaagatgacc aggaggtctt aaaagatcag aactatgtgg aaattatggg aagagatgtt 421 caagaatcat tgaaaaatgg ctctgctaca ggtggtggaa ataaagttta ttcttttcag 481 aatagaaaac actctgaaaa gatggctaaa ttagcttcag aactagcaaa aacaccacaa 541 aaaagtgttt cattcagttt gaagaatgat cctgagatta cgataaacgt tcctcaaagt 601 agcaagggcc attctgcttc agacaaggtt caaccgaaga acaatgacaa aagtgaattt 661 ctgtcaacag cacctcgtag tctaagaaaa agattaatag ttccaaggtc tcattctgac 721 agtgaaagcg aatattctgc ttccaactca gaggatgatg aaggggttgc acaggaacat 781 gaagaggaca ctaatgcagt catattcagc caaaagattc aagctcagaa tagagtagtt 841 tcagctcctg ttggcaaaga aacaccttct aagagaatga aaagagataa aacaagtgac 901 ttagtagaag aatattttga agctcacagc agttcaaaag ttttaacctc tgatagaaca 961 ctgcagaagc taaagagagc taaactggat cagcaaactt tgcgtaactt attgagcaag 1021 gtttcccctt ccttttctgc cgaacttaaa caactaaatc aacagtatga aaaattattt 1081 cataaatgga tgctgcaatt acaccttggg ttcaacattg tgctttatgg tttgggttct 1141 aagagagatt tactagaaag gtttcgaacc actatgctgc aagattccat tcacgttgtc 1201 atcaatggct tctttcctgg aatcagtgtg aaatcagtcc tgaattctat aacagaagaa 1261 gtcctcgatc atatgggtac tttccgcagt atactggatc agctagactg gatagtaaac 1321 aaatttaaag aagattcttc tttagaactc ttccttctca tccacaattt ggatagccag 1381 atgttgagag gagagaagag ccagcaaatc attggtcagt tgtcatcttt gcataacatt 1441 taccttatag catccattga ccacctcaat gctcctctca tgtgggatca tgcaaagcag 1501 agtcttttta actggctctg gtatgaaact actacataca gtccttatac tgaagaaacc 1561 tcctatgaga actctcttct ggtaaagcag tctggatccc tgccacttag ctcccttact 1621 catgtcttac gaagccttac ccctaatgca aggggaattt tcaggctact aataaaatac 1681 cagctggaca accaggataa cccttcttac attggccttt cttttcaaga tttttaccag 1741 cagtgtcggg aggcattcct cgtcaatagt gatctgacac tccgggccca gttaactgaa 1801 tttagggacc acaagcttat aagaacaaag aagggaactg atggagtaga gtatttatta 1861 attcctgttg ataatggaac attgactgat ttcttggaaa aggaagaaga ggaggcttga 1921 agctttcctt tattcttgaa tctcccatgg aagggttgta ccccagctgc cactcctcta 1981 gttgaaagtg ttgtgtttac atctgacatt aaattatttt tccagcatac aagatttaaa 2041 tttgggaagg gggggatgtc ctcaattaga actttttgat cagcctggct ggtaccgtct 2101 agtactatgc agcggtcctc aagttggaga aaatgtgcct ttcattcatt acctctctgg 2161 agacttcttg ctggaatgaa cagtgtgctc agggactatt tggaactgga tgtttttgaa 2221 ttattttata cttagagata ttctgaattt tttgagggcc ttttaacact ccccgagctg 2281 attgtttgca agtgtgtttg ttccagagtg tggaagtata aagacatggg catcacgtaa 2341 attggttttg tttgctattc tgtgtgtcag aaccaacgag tgtaatggag agggcaggtc 2401 atctcttatt gtttctaaaa caacttaaaa ggtgtagatt gggaagaggt gagtgatcca 2461 gctttctcct tttggattga ggctatgtac ttggtggggg caggggaggg aatatattat 2521 aatactattc agttgggata atgggaaaaa cagagtatat agggtatcta cccagcctag 2581 aaagcacagg aacaatacgt catatatttg gaacagttat tgtctgtgcc atgaccttca 2641 tgataccagt gagaagccag gctagagaaa taaaatcctg aattacattt tagtaattgt 2701 tttcaagaca acaaaaaata aaacatttc // LOCUS HSU40282 1786 bp mRNA PRI 28-NOV-1997 DEFINITION Homo sapiens integrin-linked kinase (ILK) mRNA, complete cds. ACCESSION U40282 NID g2648173 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1786) AUTHORS Hannigan,G.E., Leung-Hagesteijn,C., Fitz-Gibbon,L., Coppolino,M.G., Radeva,G., Filmus,J., Bell,J.C. and Dedhar,S. TITLE Regulation of cell adhesion and anchorage-dependent growth by a new beta 1-integrin-linked protein kinase JOURNAL Nature 379 (6560), 91-96 (1996) MEDLINE 96135142 REFERENCE 2 (bases 1 to 1786) AUTHORS Dedhar,S. and Hannigan,G.E. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Shoukat Dedhar, Cancer Biology Research, Sunnybrook Health Science Centre and University of Toronto, 2075 Bayview Avenue, North York, Ont. M4N 3M5, Canada FEATURES Location/Qualifiers source 1..1786 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15" /cell_line="HeLa" gene 1..1786 /gene="ILK" CDS 157..1512 /gene="ILK" /note="protein serine/threonine kinase" /codon_start=1 /product="integrin-linked kinase" /db_xref="PID:g2648174" /translation="MDDIFTQCREGNAVAVRLWLDNTENDLNQGDDHGFSPLHWACRE GRSAVVEMLIMRGARINVMNRGDDTPLHLAASHGHRDIVQKLLQYKADINAVNEHGNV PLHYACFWGQDQVAEDLVANGALVSICNKYGEMPVDKAKAPLRELLRERAEKMGQNLN RIPYKDTFWKGTTRTRPRNGTLNKHSGIDFKQLNFLTKLNENHSGELWKGRWQGNDIV VKVLKVRDWSTRKSRDFNEECPRLRIFSHPNVLPVLGACQSPPAPHPTLITHWMPYGS LYNVLHEGTNFVVDQSQAVKFALDMARGMAFLHTLEPLIPRHALNSRSVMIDEDMTAR ISMADVKFSFQCPGRMYAPAWVAPEALQKKPEDTNRRSADMWSFAVLLWELVTREVPF ADLSNMEIGMKVALEGLRTIPPGISPHVCKLMKICMNEDPAKRPKFDMIVPILEKMQD K" BASE COUNT 443 a 486 c 479 g 378 t ORIGIN 1 gaattcatct gtcgactgct accacgggag ttccccggag aaggatcctg cagcccgagt 61 cccgaggata aagcttgggg ttcatcctcc ttccctggat cactccacag tcctcaggct 121 tccccaatcc aggggactcg gcgccgggac gctgctatgg acgacatttt cactcagtgc 181 cgggagggca acgcagtcgc cgttcgcctg tggctggaca acacggagaa cgacctcaac 241 cagggggacg atcatggctt ctcccccttg cactgggcct gccgagaggg ccgctctgct 301 gtggttgaga tgttgatcat gcggggggca cggatcaatg taatgaaccg tggggatgac 361 acccccctgc atctggcagc cagtcatgga caccgtgata ttgtacagaa gctattgcag 421 tacaaggcag acatcaatgc agtgaatgaa cacgggaatg tgcccctgca ctatgcctgt 481 ttttggggcc aagatcaagt ggcagaggac ctggtggcaa atggggccct tgtcagcatc 541 tgtaacaagt atggagagat gcctgtggac aaagccaagg cacccctgag agagcttctc 601 cgagagcggg cagagaagat gggccagaat ctcaaccgta ttccatacaa ggacacattc 661 tggaagggga ccacccgcac tcggccccga aatggaaccc tgaacaaaca ctctggcatt 721 gacttcaaac agcttaactt cctgacgaag ctcaacgaga atcactctgg agagctatgg 781 aagggccgct ggcagggcaa tgacattgtc gtgaaggtgc tgaaggttcg agactggagt 841 acaaggaaga gcagggactt caatgaagag tgtccccggc tcaggatttt ctcgcatcca 901 aatgtgctcc cagtgctagg tgcctgccag tctccacctg ctcctcatcc tactctcatc 961 acacactgga tgccgtatgg atccctctac aatgtactac atgaaggcac caatttcgtc 1021 gtggaccaga gccaggctgt gaagtttgct ttggacatgg caaggggcat ggccttccta 1081 cacacactag agcccctcat cccacgacat gcactcaata gccgtagtgt aatgattgat 1141 gaggacatga ctgcccgaat tagcatggct gatgtcaagt tctctttcca atgtcctggt 1201 cgcatgtatg cacctgcctg ggtagccccc gaagctctgc agaagaagcc tgaagacaca 1261 aacagacgct cagcagacat gtggagtttt gcagtgcttc tgtgggaact ggtgacacgg 1321 gaggtaccct ttgctgacct ctccaatatg gagattggaa tgaaggtggc attggaaggc 1381 cttcgtacca tcccaccagg tatttcccct catgtgtgta agctcatgaa gatctgcatg 1441 aatgaagacc ctgcaaagcg acccaaattt gacatgattg tgcctatcct tgagaagatg 1501 caggacaagt aggactggaa ggtccttgcc tgaactccag aggtgtcggg acatggttgg 1561 gggaatgcac ctccccaaag cagcaggcct ctggttgcct cccccgcctc cagtcatggt 1621 actaccccag cctggggtcc atccccttcc cccatcccta ccactgtgcg caagaggggc 1681 gggctcagag ctttgtcact tgccacatgg tgtcttccaa catgggaggg atcagccccg 1741 cctgtcacaa taaagtttat tatgaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU40379 1392 bp mRNA PRI 12-AUG-1996 DEFINITION Human presenilin I-463 (AD3-3) mRNA, complete cds. ACCESSION U40379 NID g1244637 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1392) AUTHORS Sahara,N., Yahagi,Y., Takagi,H., Kondo,T., Okochi,M., Usami,M., Shirasawa,T. and Mori,H. TITLE Identification and characterization of presenilin I-467, I-463 and I-374 JOURNAL FEBS Lett. 381 (1-2), 7-11 (1996) MEDLINE 96193901 REFERENCE 2 (bases 1 to 1392) AUTHORS Shirasawa,T. TITLE Direct Submission JOURNAL Submitted (07-NOV-1995) Takuji Shirasawa, Molecular Pathology, Tokyo Metropolitan Institute of Gerontology, 35-2 Sakae-cho, Itabashi-ku, Tokyo, 175, Japan FEATURES Location/Qualifiers source 1..1392 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="AD3-3" /chromosome="14" /map="14q24.3" gene 1..1392 /gene="AD3-3" CDS 1..1392 /gene="AD3-3" /codon_start=1 /product="presenilin I-463" /db_xref="PID:g1244638" /translation="MTELPAPLSYFQNAQMSEDNHLSNTNDNRERQEHNDRRSLGHPE PLSNGRPQGNSRQVVEQDEEEDEELTLKYGAKHVIMLFVPVTLCMVVVVATIKSVSFY TRKDGQLIYTPFTEDTETVGQRALHSILNAAIMISVIVVMTILLVVLYKYRCYKVIHA WLIISSLLLLFFFSFIYLGEVFKTYNVAVDYITVALLIWNFGVVGMISIHWKGPLRLQ QAYLIMISALMALVFIKYLPEWTAWLILAVISVYDLVAVLCPKGPLRMLVETAQERNE TLFPALIYSSTMVWLVNMAEGDPEAQRRVSKNSKYNAESTERESQDTVAENDDGGFSE EWEAQRDSHLGPHRSTPESRAAVQELSSSILAGEDPEERGVKLGLGDFIFYSVLVGKA SATASGDWNTTIACFVAILIGLCLTLLLLAIFKKALPALPISITFGLVFYFATDYLVQ PFMDQLAFHQFYI" BASE COUNT 359 a 310 c 333 g 390 t ORIGIN 1 atgacagagt tacctgcacc gttgtcctac ttccagaatg cacagatgtc tgaggacaac 61 cacctgagca atactaatga caatagagaa cggcaggagc acaacgacag acggagcctt 121 ggccaccctg agccattatc taatggacga ccccagggta actcccggca ggtggtggag 181 caagatgagg aagaagatga ggagctgaca ttgaaatatg gcgccaagca tgtgatcatg 241 ctctttgtcc ctgtgactct ctgcatggtg gtggtcgtgg ctaccattaa gtcagtcagc 301 ttttataccc ggaaggatgg gcagctaatc tataccccat tcacagaaga taccgagact 361 gtgggccaga gagccctgca ctcaattctg aatgctgcca tcatgatcag tgtcattgtt 421 gtcatgacta tcctcctggt ggttctgtat aaatacaggt gctataaggt catccatgcc 481 tggcttatta tatcatctct attgttgctg ttcttttttt cattcattta cttgggggaa 541 gtgtttaaaa cctataacgt tgctgtggac tacattactg ttgcactcct gatctggaat 601 tttggtgtgg tgggaatgat ttccattcac tggaaaggtc cacttcgact ccagcaggca 661 tatctcatta tgattagtgc cctcatggcc ctggtgttta tcaagtacct ccctgaatgg 721 actgcgtggc tcatcttggc tgtgatttca gtatatgatt tagtggctgt tttgtgtccg 781 aaaggtccac ttcgtatgct ggttgaaaca gcccaggaga gaaatgaaac gctttttcca 841 gctctcattt actcctcaac aatggtgtgg ttggtgaata tggcagaagg agacccggaa 901 gctcaaagga gagtatccaa aaattccaag tataatgcag aaagcacaga aagggagtca 961 caagacactg ttgcagagaa tgatgatggc gggttcagtg aggaatggga agcccagagg 1021 gacagtcatc tagggcctca tcgctctaca cctgagtcac gagctgctgt ccaggaactt 1081 tccagcagta tcctcgctgg tgaagaccca gaggaaaggg gagtaaaact tggattggga 1141 gatttcattt tctacagtgt tctggttggt aaagcctcag caacagccag tggagactgg 1201 aacacaacca tagcctgttt cgtagccata ttaattggtt tgtgccttac attattactc 1261 cttgccattt tcaagaaagc attgccagct cttccaatct ccatcacctt tgggcttgtt 1321 ttctactttg ccacagatta tcttgtacag ccttttatgg accaattagc attccatcaa 1381 ttttatatct ag // LOCUS HSU40434 2114 bp mRNA PRI 19-JAN-1996 DEFINITION Human mesothelin or CAK1 antigen precursor mRNA, complete cds. ACCESSION U40434 NID g1145723 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2114) AUTHORS Chang,K. and Pastan,I. TITLE Molecular cloning of mesothelin, a differentiation antigen present on mesothelium, mesotheliomas, and ovarian cancers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (1), 136-140 (1996) MEDLINE 96133892 REFERENCE 2 (bases 1 to 2114) AUTHORS Chang,K. TITLE Direct Submission JOURNAL Submitted (09-NOV-1995) Kai Chang, Laboratory of Molecular Biology, National Cancer Institute, Building 37, Room 4B19, 37 Convent Drive, MSC4255, Bethesda, MD MD20892-4255, USA FEATURES Location/Qualifiers source 1..2114 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pcD3CAK1-9" /chromosome="16" /cell_line="KB, HeLa, OVCAR-3" /cell_type="HeLa" /tissue_type="ovarian cancers and squamous cell carcinoma." CDS 100..1986 /codon_start=1 /product="mesothelin or CAK1 antigen precursor" /db_xref="PID:g1145724" /translation="MALQRLDPCWSCGDRPGSLLFLLFSLGWVHPARTLAGETGTESA PLGGVLTTPHNISSLSPRQLLGFPCAEVSGLSTERVRELAVALAQKNVKLSTEQLRCL AHRLSEPPEDLDALPLDLLLFLNPDAFSGPQACTRFFSRITKANVDLLPRGAPERQRL LPAALACWGVRGSLLSEADVRALGGLACDLPGRFVAESAEVLLPRLVSCPGPLDQDQQ EAARAALQGGGPPYGPPSTWSVSTMDALRGLLPVLGQPIIRSIPQGIVAAWRQRSSRD PSWRQPERTILRPRFRREVEKTACPSGKKAREIDESLIFYKKWELEACVDAALLATQM DRVNAIPFTYEQLDVLKHKLDELYPQGYPESVIQHLGYLFLKMSPEDIRKWNVTSLET LKALLEVDKGHEMSPQAPRRPLPQVATLIDRFVKGRGQLDKDTLDTLTAFYPGYLCSL SPEELSSVPPSSIWAVRPQDLDTCDPRQLDVLYPKARLAFQNMNGSEYFVKIQSFLGG APTEDLKALSQQNVSMDLATFMKLRTDAVLPLTVAEVQKLLGPHVEGLKAEERHRPVR DWILRQRQDDLDTLGLGLQGGIPNGYLVLDLSVQETLSGTPCLLGPGPVLTVLALLLA STLA" sig_peptide 100..192 mat_peptide 193..1857 /product="mesothelin" misc_feature 193..978 /note="encodes putative cleaved N-terminal portion of mesothelin" misc_feature 973..978 /note="encodes putative proteolytic site" misc_feature 979..1983 /note="encodes putative membrane bound portion of mesothelin, bearing epitope(s) recognized by MAb K1" misc_feature 1858..1983 /note="encodes putative hydrophobic region for GPI anchorage" polyA_signal 2087..2093 /note="putative" polyA_site 2114 BASE COUNT 361 a 722 c 661 g 370 t ORIGIN 1 aggaattccg gtggccggcc actcccgtct gctgtgacgc gcggacagag agctaccggt 61 ggacccacgg tgcctccctc cctgggatct acacagacca tggccttgca acggctcgac 121 ccctgttggt cctgtgggga ccgccctggc agcctcctgt tcctgctctt cagcctcgga 181 tgggtgcatc ccgcgaggac cctggctgga gagacaggga cggagtctgc ccccctgggg 241 ggagtcctga caacccccca taacatttcc agcctctccc ctcgccaact ccttggcttc 301 ccgtgtgcgg aggtgtccgg cctgagcacg gagcgtgtcc gggagctggc tgtggccttg 361 gcacagaaga atgtcaagct ctcaacagag cagctgcgct gtctggctca ccggctctct 421 gagccccccg aggacctgga cgccctccca ttggacctgc tgctattcct caacccagat 481 gcgttctcgg ggccccaggc ctgcacccgt ttcttctccc gcatcacgaa ggccaatgtg 541 gacctgctcc cgaggggggc tcccgagcga cagcggctgc tgcctgcggc tctggcctgc 601 tggggtgtgc gggggtctct gctgagcgag gctgatgtgc gggctctggg aggcctggct 661 tgcgacctgc ctgggcgctt tgtggccgag tcggccgaag tgctgctacc ccggctggtg 721 agctgcccgg gacccctgga ccaggaccag caggaggcag ccagggcggc tctgcagggc 781 gggggacccc cctacggccc cccgtcgaca tggtctgtct ccacgatgga cgctctgcgg 841 ggcctgctgc ccgtgctggg ccagcccatc atccgcagca tcccgcaggg catcgtggcc 901 gcgtggcggc aacgctcctc tcgggaccca tcctggcggc agcctgaacg gaccatcctc 961 cggccgcggt tccggcggga agtggagaag acagcctgtc cttcaggcaa gaaggcccgc 1021 gagatagacg agagcctcat cttctacaag aagtgggagc tggaagcctg cgtggatgcg 1081 gccctgctgg ccacccagat ggaccgcgtg aacgccatcc ccttcaccta cgagcagctg 1141 gacgtcctaa agcataaact ggatgagctc tacccacaag gttaccccga gtctgtgatc 1201 cagcacctgg gctacctctt cctcaagatg agccctgagg acattcgcaa gtggaatgtg 1261 acgtccctgg agaccctgaa ggctttgctt gaagtcgaca aagggcacga aatgagtcct 1321 caggctcctc ggcggcccct cccacaggtg gccaccctga tcgaccgctt tgtgaaggga 1381 aggggccagc tagacaaaga caccctagac accctgaccg ccttctaccc tgggtacctg 1441 tgctccctca gccccgagga gctgagctcc gtgcccccca gcagcatctg ggcggtcagg 1501 ccccaggacc tggacacgtg tgacccaagg cagctggacg tcctctatcc caaggcccgc 1561 cttgctttcc agaacatgaa cgggtccgaa tacttcgtga agatccagtc cttcctgggt 1621 ggggccccca cggaggattt gaaggcgctc agtcagcaga atgtgagcat ggacttggcc 1681 acgttcatga agctgcggac ggatgcggtg ctgccgttga ctgtggctga ggtgcagaaa 1741 cttctgggac cccacgtgga gggcctgaag gcggaggagc ggcaccgccc ggtgcgggac 1801 tggatcctac ggcagcggca ggacgacctg gacacgctgg ggctggggct acagggcggc 1861 atccccaacg gctacctggt cctagacctc agcgtgcaag agaccctctc ggggacgccc 1921 tgcctcctag gacctggacc tgttctcacc gtcctggcac tgctcctagc ctccaccctg 1981 gcctgagggc cccactccct tgctggcccc agccctgctg gggatccccg cctggccagg 2041 agcaggcacg ggtgatcccc gttccacccc aagagaactc gcgctcagta aacgggaaca 2101 tgccccctgc agac // LOCUS HSU40462 3629 bp mRNA PRI 30-APR-1996 DEFINITION Human Ikaros/LyF-1 homolog (hIk-1) mRNA, complete cds. ACCESSION U40462 NID g1289370 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3629) AUTHORS Nietfeld,W. and Meyerhans,A. TITLE Cloning and sequencing of hIk-1, a cDNA encoding a human homologue of mouse Ikaros/LyF-1 JOURNAL Immunol. Lett. 49 (1-2), 139-141 (1996) MEDLINE 96252222 REFERENCE 2 (bases 1 to 3629) AUTHORS Nietfeld,W. TITLE Direct Submission JOURNAL Submitted (10-NOV-1995) Wilfried Nietfeld, Department of Virology, University of Freiburg, Institute for Medical Microbiology and Hygiene, Hermann-Herder-Strasse 11, Freiburg 79104, Germany FEATURES Location/Qualifiers source 1..3629 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" gene 169..1728 /gene="hIk-1" CDS 169..1728 /gene="hIk-1" /note="similar to mouse LyF-1, encoded by GenBank Accession Number S74708; similar to mouse Ikaros DNA-binding protein, Swiss-Prot Accession Number Q03267" /codon_start=1 /product="Ikaros/LyF-1 homolog" /db_xref="PID:g1289371" /translation="MDADEGQDMSQVSGKESPPVSDTPDEGDEPMPIPEDLSTTSGGQ QSSKSDRVVASNVKVETQSDEENGRACEMNGEECAEDLRMLDASGEKMNGSHRDQGSS ALSGVGGIRLPNGKLKCDICGIICIGPNVLMVHKRSHTGERPFQCNQCGASFTQKGNL LRHIKLHSGEKPFKCHLCNYACRRRDALTGHLRTHSVGKPHKCGYCGRSYKQRSSLEE HKERCHNYLESMGLPGTLYPVIKEETNHSEMAEDLCKIGSERSLVLDRLASNVAKRKS SMPQKFLGDKGLSDTPYDSSASYEKENEMMKSHVMDQAINNAINYLGAESLRPLVQTP PGGSEVVPVISPMYQLHKPLAEGTPRSNHSAQDSAVENLLLLSKAKLVPSEREASPSN SCQDSTDTESNNEEQRSGLIYLTNHIAPHARNGLSLKEEHRAYDLLRAASENSQDALR VVSTSGEQMKVYKCEHCRVLFLDHVMYTIHMGCHGFRDPFECNMCGYHSQDRYEFSSH ITRGEHRFHMS" BASE COUNT 917 a 1002 c 936 g 773 t 1 others ORIGIN 1 gaattccggc gtcgcggacg catcccagtc tgggcgggac gctcggccgc ggcgaggcgg 61 gcaagcctgg cagggcagag ggagccccgg ctccgaggtt gctcttcgcc cccgaggatc 121 agtcttggcc ccaaagcgcg acgcacaaat ccacataacc tgaggaccat ggatgctgat 181 gagggtcaag acatgtccca agtttcaggg aaggaaagcc cccctgtaag cgatactcca 241 gatgagggcg atgagcccat gccgatcccc gaggacctct ccaccacctc gggaggacag 301 caaagctcca agagtgacag agtcgtggcc agtaatgtta aagtagagac tcagagtgat 361 gaagagaatg ggcgtgcctg tgaaatgaat ggggaagaat gtgcggagga tttacgaatg 421 cttgatgcct cgggagagaa aatgaatggc tcccacaggg accaaggcag ctcggctttg 481 tcgggagttg gaggcattcg acttcctaac ggaaaactaa agtgtgatat ctgtgggatc 541 atttgcatcg ggcccaatgt gctcatggtt cacaaaagaa gccacactgg agaacggccc 601 ttccagtgca atcagtgcgg ggcctcattc acccagaagg gcaacctgct ccggcacatc 661 aagctgcatt ccggggagaa gcccttcaaa tgccacctct gcaactacgc ctgccgccgg 721 agggacgccc tcactggcca cctgaggacg cactccgttg gtaaacctca caaatgtgga 781 tattgtggcc gaagctataa acagcgaagc tctttagagg aacataaaga gcgctgccac 841 aactacttgg aaagcatggg ccttccgggc acactgtacc cagtcattaa agaagaaact 901 aatcacagtg aaatggcaga agacctgtgc aagataggat cagagagatc tctcgtgctg 961 gacagactag caagtaacgt cgccaaacgt aagagctcta tgcctcagaa atttcttggg 1021 gacaagggcc tgtccgacac gccctacgac agcagcgcca gctacgagaa ggagaacgaa 1081 atgatgaagt cccacgtgat ggaccaagcc atcaacaacg ccatcaacta cctgggggcc 1141 gagtccctgc gcccgctggt gcagacgccc ccgggcggtt ccgaggtggt cccggtcatc 1201 agcccgatgt accagctgca caagccgctc gcggagggca ccccgcgctc caaccactcg 1261 gcccaggaca gcgccgtgga gaacctgctg ctgctctcca aggccaagtt ggtgccctcg 1321 gagcgcgagg cgtccccgag caacagctgt caagactcca cggacaccga gagcaacaac 1381 gaggagcagc gcagcggtct catctacctg accaaccaca tcgccccgca cgcgcgcaac 1441 ggcttgtcgc tcaaggagga gcaccgcgcc tacgacctgc tgcgcgccgc ctccgagaac 1501 tcgcaggacg cgctccgcgt ggtcagcacc agcggggagc agatgaaggt gtacaagtgc 1561 gaacactgcc gggtgctctt cctggatcac gtcatgtaca ccatccacat gggctgccac 1621 ggcttccgtg atccttttga gtgcaacatg tgcggctacc acagccagga ccggtacgag 1681 ttctcgtcgc acataacgcg aggggagcac cgcttccaca tgagctaaag ccctcccgcg 1741 cccccacccc agaccccgag ccaccccagg aaaagcacaa ggactgccgc cttctcgctc 1801 ccgccagcag catagactgg actggaccag acaatgttgt gtttggattt gtaactgttt 1861 tttgtttttt gtttgagttg gttgattggg gtttgatttg cttttgaaaa gatttttatt 1921 tttagaggca gggctgcatt gggagcatcc agaactgcta ccttcctaga tgtttcccca 1981 gacgctggct gagattccct cacctgtcgc ttcctagaat ccccttctcc aaacgattag 2041 tctaaatttt cagagagaaa tagataaaac acgccacagc ctgggaagga gcgtgctcta 2101 ccctgtgcta agcacggggt tcgcgcacca ggtgtctttt tccagtcccc agaagcagag 2161 agcacagccc ctgctgtgtg ggtctgcagg tgagcagaca ggacaggtgt gccgccaccc 2221 aagtgccaag acacagcagg gccaacaacc tgtgcccagg ccagcttcga gctacatgca 2281 tctagggcgg agaggctgca cttgtgagag aaaatactta tttcaagtca tattctgcgg 2341 taggaaaatg attgggttgg ggaaagtcgg tgtctgtcag actgccctgg gtggagggag 2401 acgccgggtt agagcctttg ggatcgtcct ggattcactg gcttggggga ggctgttcag 2461 atggcctgag cctcccgagg cttgctgccc cgtaggagga gactgtcttc ccgtgggcat 2521 atctggggag ccctgttccc cgctttttca ctcccatacc tttaatggcc cccaaaatct 2581 gtcactacaa tttaaacacc agtcccgaaa tttggatctt ctttcttttt gaatctctca 2641 aacggcaaca ttcctcagaa accaaagctt tatttcaaat ctcttccttc cctggctggt 2701 tccatctagt accagaggcc tcttttcctg aagaaatcca atcctagccc tcattttaat 2761 tatgtacatc tgtttgtagc cacaagcctg aatttctcag tgttggtaag tttctttacc 2821 taccctcact atatattatt ctcgttttaa aacccataaa ggagtgattt agaacatcat 2881 taattttcca actcaatgaa aatatgtgaa gcccagcatc tctgttgcta acacacagag 2941 ctcacctgtt gaaacccaag ctttcaaaca tgttgaagct ctttactgta aaggcaagcc 3001 agcatgtgtg tccacacata cataggatgg ctggctctgc acctgtagga tattggaatg 3061 cacagggcaa ttgagggnct gagccagacc ttcggagagt aatgccacca gatcccctag 3121 gaaagaggag gcaaatggca ctgcaggtga gaaccccgcc catccgtgct atgacatgga 3181 ggcactgaag cccgaggaag gtgtgtggag attctaatcc caacaagcaa gggtctcctt 3241 caagattaat gctatcaatc attaaggtca ttactctcaa ccacctaggc aatgaagaat 3301 ataccatttc aaatatttac agtacttgtc ttcaccaaca ctgtcccaag gtgaaatgaa 3361 gcaacagaga ggaaattgta cataagtacc tcagcattta atccaaacag gggttcttag 3421 tctcagcact atgacatttt gggctgacta cttatttgtt aggcgggagc tctcctgtgc 3481 attgtaggat aattagcagt atccctggtg gctacccaat agacgccagt agcaccccga 3541 attgacaacc caaactctcc agacatcacc aactgtcccc tgcgaggaga aatcactcct 3601 gggggagaac cactgaccca aatgaattc // LOCUS HSU40490 4232 bp mRNA PRI 16-DEC-1997 DEFINITION Human nicotinamide nucleotide transhydrogenase mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U40490 NID g1110519 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4232) AUTHORS Ware,J. and Zieger,B. TITLE Cloning and deduced amino acid sequence of human nicotinamide nucleotide transhydrogenase JOURNAL DNA Seq. 7, 369-374 (1997) REFERENCE 2 (bases 1 to 4232) AUTHORS Ware,J. and Zieger,B. TITLE Direct Submission JOURNAL Submitted (10-NOV-1995) Jerry Ware, Molecular and Experimental Medicine, The Scripps Research Institute, 10666 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..4232 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 144..272 CDS 144..3404 /codon_start=1 /product="nicotinamide nucleotide transhydrogenase" /db_xref="PID:g1110520" /translation="MANLLKTVVTGCSCPLLSNLGSCKGLRVKKDFLRTFYTHQELWC KAPVKPGIPYKQLTVGVPKEIFQNEKRVALSPAGVQNLVKQGFNVVVESGAGEASKFS DDHYRVAGAQIQGAKEVLASDLVVKVRAPMVNPTLGVHEADLLKTSGTLISFIYPAQN PELLNKLSQRKTTVLAMDQVPRVTIAQGYDALSSMANIAGYKAVVLAANHFGRFFTGQ ITAAGKVPPAKILIVGGGVAGLASAGAEKSMGAIVRGFDTRAASLEQFKSLGAEPLEV DLKESGEGQGGYAKEMSKEFIEAEMKLFAQQCKEVDILISTALIPGKKAPVLFNKEMI ESMKEGSVVVDLAAEAGGNFETTKPGELYIHKGITHIGYTDLPSRMATQASTLYSNNI TKLLKAISPDKDNFYFDVKDDFDFGTMGHVIRGTVVMKDGKVIFPAPTPKNIPQGAPV KQKTVAELEAEKAATITPFRKTMSTASAYTAGLTGILGLGIAAPNLAFSQMVTTFGLA GIVGYHTVWGVTPALHSPLMSVTNAISGLTAVGGLALMGGHLYPSTTSQGLAALAAFI SSVNIAGGFLVTQRMLDMFKRPTDPPEYNYLYLLPAGTFVGGYLAALYSGYNIEQIMY LGSGLCCVGALAGLSTQGTARLGNALGMIGVAGGLAATLGVLKPGPELLAQMSGAMAL GGTIGLTIAKRIQISDLPQLVAASHSLVGLAAVLTCIAEYIIEYPHFAPDAAANLTKI VAYLGTYIGGVTFSGSLIAYGKLQGLLKSAPLLLPGRHLLNAGLLAASVGGIIPFMVD PSFTTGITCLGSVSALSAVMGVTLTAAIGGADMPVVITVLNSYSGWALCAEGFLLNNN LLTIVGALIGSSGAPLSYIMCVAMNRSLANVILGGYGTTSTAGGKPMEISGTHTEINL DNAIDMIREANSIIFTPGYGLCAAKAQYPIADLVKMLTEQGKKVRFGIHPVAGRMPGQ LNVLLAEAGVPYDIVLEMDEINHDFPDTDLVLVIGANDTVNSAAQEDPNSIIAGMPVL EVWKSKQVIVMKRSLGVGYAAVDNPIFYKPNTAMLLGDAKKTCDALQAKVRESYQK" mat_peptide 273..3401 /product="nicotinamide nucleotide transhydrogenase" BASE COUNT 1147 a 909 c 991 g 1185 t ORIGIN 1 gcgccgcggg gcccaagccc gggtctgcca gcgcgagctc ctctcgcggc cctcagggca 61 cagcccaagg ctgtcagcct cccggcccag tgatttgcct tcaaggaaac tggggagtca 121 gaaaattggg aactcatatc aacatggcaa acctattgaa aacagtggtg actggctgct 181 cgtgtcctct acttagcaat ttggggtcct gtaagggtct acgtgtgaag aaggattttt 241 tacgaacatt ttatactcac caagaactgt ggtgtaaagc gcctgtaaaa ccaggaattc 301 catataagca actgactgtt ggagtcccca aagagatatt ccaaaatgag aagcgagtgg 361 cattgtctcc tgctggtgtt cagaacttgg tcaagcaggg ttttaatgtt gtcgtggaat 421 cgggtgcggg cgaagcttcc aagttctcag atgatcacta tagagtggca ggtgcccaaa 481 tccaaggggc aaaggaagtg ctggcttctg atttggtggt caaagtgcga gcccctatgg 541 ttaatccaac attaggtgtt catgaagctg accttttaaa gacatcagga acgctgatta 601 gttttattta cccagcccaa aatccagagt tgctaaataa actttcccaa agaaaaacta 661 cagttctggc aatggaccag gttccaagag tcacaattgc tcagggatat gatgcgctaa 721 gctccatggc caacattgcg ggttataagg ctgttgtcct agcagcaaat cattttggac 781 gtttttttac tggtcagatc acagctgctg gaaaagttcc tccagctaag attctgatag 841 ttggtggtgg tgttgctggg cttgcttctg caggcgccga aaagtcgatg ggtgcaattg 901 ttcgaggatt tgacacaaga gctgccagtt tggaacagtt caagtctctt ggtgctgagc 961 ccttggaggt ggacttgaag gaatctggtg agggacaagg aggatatgca aaagagatgt 1021 ccaaagagtt cattgaagct gaaatgaaac tctttgctca acaatgcaag gaggtagaca 1081 tccttatcag cacagcactt attccaggta aaaaagctcc agttttattt aataaagaaa 1141 tgattgagtc aatgaaggaa ggttcagttg ttgtggattt agctgctgag gctggtggaa 1201 actttgaaac cactaagcca ggagaactct acattcataa gggaattact cacataggct 1261 acacagacct gcccagccga atggccactc aggccagcac cctatattcc aacaacatca 1321 ccaaactcct gaaggccatc agcccggaca aagataattt ttattttgat gtgaaagatg 1381 actttgactt tggtacgatg ggtcatgtca ttagaggaac tgtagtgatg aaagatggta 1441 aagtgatttt cccagctccc acaccgaaaa atattcctca aggtgcccca gtaaaacaga 1501 agacagtggc tgagctggaa gctgaaaaag cagctaccat tacacccttc aggaagacaa 1561 tgtcaacggc ttctgcatat acagcaggtc tcacagggat actgggtttg ggcattgcgg 1621 ctcccaatct agccttttct cagatggtga ccacttttgg cttggctggc attgtggggt 1681 atcataccgt ctggggagtg acccctgctc tccactcacc actgatgtct gtgacaaatg 1741 caatctcagg gctgactgca gttggtgggt tggcactgat gggaggacat ttgtatcctt 1801 ccacaacttc tcagggcctt gctgctcttg ctgcattcat atcctctgtc aacattgcag 1861 gtggctttct ggtgactcag agaatgctgg acatgttcaa gcgtcccact gaccccccag 1921 aatacaacta cctgtacctg ctccctgccg gcacctttgt tggtggatat ttagctgccc 1981 tctacagtgg ttataacatt gaacagatca tgtacctagg ctcgggtttg tgctgtgtcg 2041 gtgccttggc tggcctctcc acccagggaa cagcacgtct tggcaatgca ctgggcatga 2101 ttggggttgc tggaggactg gcagccaccc tcggagtcct aaaaccgggc ccagaattac 2161 tagctcagat gtctggagcg atggctttgg gtggtaccat tggattgaca attgccaaac 2221 gcatccagat ttctgattta cctcaattag ttgctgcttc acacagttta gtgggtttgg 2281 cagctgtact tacttgcata gctgagtaca ttatagaata tccacatttt gctccggatg 2341 cagcagcaaa tctcaccaag attgtggcct acctcggcac ttacattggt ggcgtcacct 2401 ttagtgggtc tctcattgcc tatggaaaat tgcagggtct cctgaaatct gcccctctcc 2461 tactgcctgg aaggcactta ctcaatgcag gcttactggc tgctagtgtg ggcgggataa 2521 tcccattcat ggtggaccca agctttacta ctggcatcac ctgtctgggt tcagtgtctg 2581 ctctctctgc tgtcatgggt gtgactttga cagctgctat tgggggtgct gacatgcccg 2641 tcgttatcac tgtgctgaac agctactcag gctgggccct gtgtgcagag ggcttcctgc 2701 tcaacaacaa tctgctgacc atcgtgggtg cactcatagg ctcgtctggt gctcccctgt 2761 catacatcat gtgtgtggca atgaatcgct ccctggctaa tgtgattctt ggaggctatg 2821 gcaccacttc aacagctggt ggaaaaccca tggaaatttc tggcacacat acggaaatca 2881 accttgacaa tgcaattgac atgattcgag aagctaatag cattattttt acaccaggct 2941 atggtctctg tgcagccaaa gctcaatacc ccattgctga tttggtaaag atgctcactg 3001 agcaaggcaa aaaagtcagg tttggaattc acccagttgc aggccgaatg cctggtcagc 3061 ttaatgtgct gctggctgag gctggtgtgc catatgacat tgtgttggaa atggatgaga 3121 tcaaccatga ttttccagat actgatttgg tccttgtaat tggagctaat gacactgtta 3181 attcagcagc tcaagaagat cccaactcta ttattgcagg catgccagtc cttgaggtct 3241 ggaaatcaaa gcaggtgatt gttatgaaga ggtctttggg tgttggctat gctgcagtgg 3301 acaatccaat cttctacaaa cctaacacgg ccatgcttct aggtgatgcc aagaaaacat 3361 gtgacgcgct ccaggcgaaa gttagagaat cctatcagaa gtaaatatta aggatcaagc 3421 tgttagctaa tatgccacct ctgcagtttt gggaacaggc aaataaagta tcagtataca 3481 tggtgatgta catctgtagc aaagctcttg gagcaaaatg aagactgaag aaagcaaagc 3541 aaaaactgta tagagagatt tttcaaaagc agtaatccct caattttaaa aaaggattga 3601 aaattctaaa tgtctttctg tgcatatttt ttgtgttagg aatcaaaagt attttataaa 3661 aggagaaaga acagcctcat tttagatgta gtcctgttgg attttttatg cctcctcagt 3721 aaccagaaat gttttaaaaa actaagtgtt taggatttca agacaacatt atacatggct 3781 ctgaaatatc tgacacaatg taaacattgc aggcacctgc attttatgtt ttttttttca 3841 acaaatgtga ctaatttgaa acttttatga acttctgagc tgtccccttg caattcaacc 3901 gcagtttgaa ttaatcatat caaatcagtt ttaatttttt aaattgtact tcagagtcta 3961 tatttcaagg gcacattttc tcactactat tttaatacat taaaggacta aataatcttt 4021 cagagatgct ggaaacaaat cacttgcttt atatgtttca ttagaatacc aatgaaacat 4081 acaacttgaa aattagtaat agtattttgg aagatcccat ttctaattgg agatctcttt 4141 aatttcgatc aacttataat gtgtagtact atattaagtg cacttgagtg ggattcaaca 4201 tttgactaat aaaatgagtt catcatgttg gc // LOCUS HSU40571 2110 bp mRNA PRI 25-APR-1996 DEFINITION Human alpha1-syntrophin (SNT A1) mRNA, complete cds. ACCESSION U40571 NID g1145727 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2110) AUTHORS Ahn,A.H., Freener,C.A., Gussoni,E., Yoshida,M., Ozawa,E. and Kunkel,L.M. TITLE The three human syntrophin genes are expressed in diverse tissues, have distinct chromosomal locations, and each bind to dystrophin and its relatives JOURNAL J. Biol. Chem. 271 (5), 2724-2730 (1996) MEDLINE 96162017 REFERENCE 2 (bases 1 to 2110) AUTHORS Ahn,A.H. TITLE Direct Submission JOURNAL Submitted (11-NOV-1995) Andrew H. Ahn, Division of Genetics, HHMI Childrens Hosp, 300 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2110 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q11.2" gene 38..1555 /gene="SNT A1" CDS 38..1555 /gene="SNT A1" /note="contains two pleckstrin homology domains and a domain related to both the tumor discs-large protein and the zonula occuldens protein; dystrophin-binding intracellular membrane-associated muscle protein" /codon_start=1 /product="alpha1-syntrophin" /db_xref="PID:g1145728" /translation="MASGRRAPRTGLLELRAGAGSGAGGERWQRVLLSLAEDVLTVSP ADGDPGPEPGAPREQEPAQLNGAAEPGAGPPQLPEALLLQRRRVTVRKADAGGLGISI KGGRENKMPILISKIFKGLAADQTEALFVGDAILSVNGEDLSSATHDEAVQVLKKTGK EVVLEVKYMKDVSPYFKNSTGGTSVGWDSPPASPLQRQPSSPGPTPRNFSEAKHMSLK MAYVSKRCTPNDPEPRYLEICSADGQDTLFLRAKDEASARSWATAIQAQVNTLTPRVK DELQALLAATSTAGSQDIKQIGWLTEQLPSGGTAPTLALLTEKELLLYLSLPETREAL SRPARTAPLIATRLVHSGPSKGSVPYDAELSFALRTGTRHGVDTHLFSVESPQELAAW TRQLVDGCHRAAEGVQEVSTACTWNGRPCSLSVHIDKGFTLWAAEPGAARAVLLRQPF EKLQMSSDDGASLLFLDFGGAEGEIQLDLHSCPKTIVFIIHSFLSAKVTRLGLLA" BASE COUNT 395 a 669 c 664 g 382 t ORIGIN 1 agggcgagcg cggcggcccg ggggctcgga ggcgaagatg gcgtccggca ggcgcgcccc 61 gcgcaccggg ctgctggagc tgcgcgccgg ggcgggctcg ggggccggcg gcgagcgatg 121 gcagcgggtg ctgctgagtc tggcggagga cgtgctgacc gtgagccccg ccgacggcga 181 ccctggtccc gagcccggcg ctccgcggga gcaggagccc gcgcagctca acggcgccgc 241 ggagccgggc gccgggcccc cgcagctgcc agaggcgcta ctgctccagc ggcgccgcgt 301 gacggtgcgc aaggccgacg ccggtgggct gggcatcagc atcaaaggcg gccgggagaa 361 caagatgcct attctcattt ccaagatctt caagggattg gcagctgacc agacagaggc 421 cctttttgtg ggggatgcca tcctgtctgt gaatggggaa gacttgtcct ctgctaccca 481 tgatgaggcg gtgcaggtcc tcaagaagac aggcaaggag gtggtgctgg aggtcaagta 541 tatgaaggac gtctcaccgt atttcaagaa ctctactggt gggacctcgg tcggctggga 601 ctcacctcct gcctcacccc ttcagcggca gccttcctcc cctggcccca caccccggaa 661 cttcagcgag gccaaacaca tgtccttgaa gatggcatat gtctcgaaga ggtgcacccc 721 caatgacccg gagcccaggt atctggagat ctgctcggca gatggtcaag acaccctctt 781 cctgagggcc aaggatgagg ctagtgcgag gtcgtgggcg actgccatcc aagcccaggt 841 caatactctg acgccgcggg tcaaggatga gctgcaggca ctgttggcag ccaccagcac 901 agctgggagc caggacatca agcagattgg ctggctaact gagcagctgc ccagtggggg 961 cacagccccc accctggccc tgctaactga aaaggaactg ctcctctact tgtctctccc 1021 cgagacccgc gaggccctga gccggccagc ccgtactgcc ccactcatcg ccaccagact 1081 ggtgcactca ggcccctcca agggctcagt gccctacgat gcagagctct cttttgccct 1141 gcgcacgggc acgcgtcacg gtgtggacac tcacctgttc agcgtggagt caccgcagga 1201 gctggctgcc tggacccgcc agcttgtgga tggctgtcac cgggccgccg agggtgtgca 1261 ggaggtgtct acagcctgca cgtggaatgg gcgtccctgc agcctgtctg tgcacatcga 1321 caagggcttc acactgtggg cggctgagcc aggtgcagcc cgagctgtgc tcctgcgaca 1381 gcccttcgag aagctgcaga tgtcttcaga tgacggtgcc agtctccttt tcctggattt 1441 tggaggtgct gaaggcgaga tccagctgga cctgcactcg tgtcccaaaa ccatagtctt 1501 catcatccac tccttcctgt cggccaaagt cacccgcctc gggctgttgg cctagaagtc 1561 gccggatgca ctagccctga agaggggtgt ccatgacatg gcctgagctg ggcctccacc 1621 gactgcctgc tcacccctgg gctgagggaa gggagaggag aggaacaagg gcctccgaaa 1681 ccccaaccct gagggagact ggattggtct tggggcccag gacccagacg caggacagag 1741 tggactctgc ctgtgatggg gtggccttcc tgctgccccc ctccaccagt gccttttgca 1801 gagagatatt ttgtgtacac agaagccatt ccgagtctgg gacctgcccc tgtgcggatc 1861 ctgaccccag ccaacagctg agctgccggg cctcctcgag gcccctaagc cacccccaga 1921 ggtcccatct gaagctggag taccctgggg tcagcagcaa gagaaagaag aggagatttt 1981 ctgtttgttt ttcccctcag ccctgccacc gtggggagtc tggtttttct cttcatcctg 2041 tctctctcct ccttactctt ggataaataa acagcctgtg agcacacagg cagcccggcc 2101 cagtaaaaaa // LOCUS HSU40572 1700 bp mRNA PRI 25-APR-1996 DEFINITION Human beta2-syntrophin (SNT B2) mRNA, complete cds. ACCESSION U40572 NID g1145729 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1700) AUTHORS Ahn,A.H., Freener,C.A., Gussoni,E., Yoshida,M., Ozawa,E. and Kunkel,L.M. TITLE The three human syntrophin genes are expressed in diverse tissues, have distinct chromosomal locations, and each bind to dystrophin and its relatives JOURNAL J. Biol. Chem. 271 (5), 2724-2730 (1996) MEDLINE 96162017 REFERENCE 2 (bases 1 to 1700) AUTHORS Ahn,A.H. TITLE Direct Submission JOURNAL Submitted (11-NOV-1995) Andrew H. Ahn, Division of Genetics, HHMI Childrens Hosp, 300 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1700 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16q23-24" gene 21..1643 /gene="SNT B2" CDS 21..1643 /gene="SNT B2" /note="contains two pleckstrin homology domains and a domain related to both the tumor discs-large protein and the zonula occludens protein; dystrophin-binding intracellular membrane cytoskeletal protein" /codon_start=1 /product="beta2-syntrophin" /db_xref="PID:g1145730" /translation="MRVAAATAAAGAGPAMAVWTRATKAGLVELLLRERWVRVVAELS GESLSLTGDAAAAELEPALGPAAAAFNGLPNGGGAGDSLPGSPSRGLGPPSPPAPPRG PAGEAGASPPVRRVRVVKQEAGGLGISIKGGRENRMPILISKIFPGLAADQSRALRLG DAILSVNGTDLRQATHDQAVQALKRAGKEVLLEVKFIREVTPYIKKPSLVSDLPWEGA APQSPSFSGSEDSGSPKHQNSTKDRKIIPLKMCFAARNLSMPDLENRLIELHSPDSRN TLILRCKDTATAHSWFVAIHTNIMALLPQVLAELNAMLGATSTAGGSKEVKHIAWLAE QAKLDGGRQQWRPVLMAVTEKDLLLYDCMPWTRDAWASPCHSYPLVATRLVHSGSGCR SPSLGSDLTFATRTGSRQGIEMHLFRVETHRDLSSWTRILVQGCHAAAELIKEVSLGC MLNGQEVRLTIHYENGFTISRENGGSSSILYRYPFERLKMSADDGIRNLYLDFGGPEG ELTMDLHSCPKPIVFVLHTFLSAKVTRMGLLV" BASE COUNT 375 a 479 c 523 g 323 t ORIGIN 1 gccgaggctg cctgactgga atgagggtag ctgcggcgac tgcggcggct ggagcggggc 61 cggccatggc ggtgtggacg cgggccacca aagcggggct ggtggagctg ctcctgaggg 121 agcgctgggt ccgagtggtg gccgagctga gcggggagag cctgagcctg acgggcgacg 181 ccgccgcggc cgagctggag cccgctctgg gacccgcggc cgccgccttc aacggcctcc 241 caaacggcgg cggcgcgggc gactcgctgc ccgggagccc aagccgcggc ctggggcccc 301 cgagcccgcc ggcgccgcct cggggccccg cgggtgaggc gggcgcgtcg ccgcccgtgc 361 gccgggtgcg ggtggtgaag caagaggcgg gcggcctggg catcagcatc aagggcggcc 421 gcgagaaccg gatgccgatc ctcatctcca agatcttccc cgggctggct gccgaccaga 481 gccgggcgct gcggctgggc gacgccatcc tgtcggtgaa cggcaccgac ctgcgccagg 541 ccacccacga ccaggccgtg caggcgctga agcgcgcggg caaggaggtg ctgctggagg 601 tcaagttcat ccgagaagta acaccatata tcaagaagcc atcattagta tcagatctgc 661 cgtgggaagg tgcagccccc cagtcaccaa gctttagtgg cagtgaggac tctggttcgc 721 caaaacacca gaacagcacc aaggacagga agatcatccc tctcaaaatg tgctttgctg 781 ctagaaacct aagcatgccg gatctggaaa acagattgat agagctacat tctcctgata 841 gcaggaacac gttgatccta cgctgcaaag atacagccac agcacactcc tggttcgtag 901 ctatccacac caacataatg gctctcctcc cacaggtgtt ggctgaactc aacgccatgc 961 ttggggcaac cagtacagca ggaggcagta aagaggtgaa gcatattgcc tggctggcag 1021 aacaggcaaa actagatggt ggaagacagc aatggagacc tgtcctcatg gctgtgactg 1081 agaaggattt gctgctctat gactgtatgc cgtggacaag agatgcctgg gcgtcaccat 1141 gccacagcta cccacttgtt gccaccaggt tggttcattc tggctccgga tgtcgatccc 1201 cctcccttgg atctgacctt acatttgcta ccaggacagg ctctcgacag ggcattgaga 1261 tgcatctctt cagggtggag acacatcggg atctgtcatc ctggaccagg atacttgttc 1321 agggttgcca tgctgctgct gagctgatca aggaagtctc tctaggctgc atgttaaatg 1381 gccaagaggt gaggcttact attcactatg aaaatgggtt caccatctca agggaaaatg 1441 gaggctccag cagcatattg taccgctacc cctttgaaag gctgaagatg tctgctgatg 1501 atggcatccg aaatctatac ttggattttg gtggtcccga gggagaactg accatggacc 1561 tgcactcttg tccgaagccg attgtatttg tgttgcacac gtttttatcg gccaaagtca 1621 ctcgtatggg actgcttgta tgagcaacaa aaaatcagaa aagagccttg actgtcacaa 1681 gaaatatttc cacctccaaa // LOCUS HSU40579 1320 bp mRNA PRI 05-APR-1996 DEFINITION Human deoxyhypusine synthase mRNA, complete cds. ACCESSION U40579 NID g1113108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1320) AUTHORS Bevec,D., Kappel,B., Jaksche,H., Csonga,R., Hauber,J., Klier,H. and Steinkasserer,A. TITLE Molecular characterization of a cDNA encoding functional human deoxyhypusine synthase and chromosomal mapping of the corresponding gene locus JOURNAL FEBS Lett. 378 (2), 195-198 (1996) MEDLINE 96140738 REFERENCE 2 (bases 1 to 1320) AUTHORS Werner,F. TITLE Direct Submission JOURNAL Submitted (13-NOV-1995) Fred-Jochen Werner, Sandoz Research Institute GmbH, IT, Brunner Strasse 59, Vienna, A-1235, Austria FEATURES Location/Qualifiers source 1..1320 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pdSYN-1" /cell_type="PBMC" /tissue_type="blood" /chromosome="19" CDS 84..1193 /codon_start=1 /evidence=experimental /product="deoxyhypusine synthase" /db_xref="PID:g1113109" /translation="MEGSLEREAPRGALAAVLKHSSTLPPESTQVRGYDFNRGVNYRA LLEAFGTTGFQATNFGRAVQQVNAMIEKKLEPLSQDEDQHADLTQSRRPLTSCTIFLG YTSNLISSGIRETIRYLVQHNMVDVLVTTAGGVEEDLIKCLAPTYLGEFSLRGKELRE NGINRIGNLLVPNENYCKFEDWLMPILDQMVMEQNTEGVKWTPSKMIARLGKEINNPE SVYYWAQKNHIPVFSPALTDGSLGDMIFFHSYKNPGLVLDIVEDLRLINTQAIFAKCT GMIILGGGVVKHHIANANLMRNGADYAVYINTAQEFDGSDSGARPDEAVSWGKIRVDA QPVKVYADASLVFPLLVAETFAQKMDAFMHEKNED" BASE COUNT 305 a 378 c 374 g 263 t ORIGIN 1 ggcgcgtcgg ggtttaacgc gtttctgggc cgccgtaagc ccggcctagg ggcagctttg 61 actcgagagc cggctatagg cgcatggaag gttccctgga acgggaggcg ccacgggggg 121 cgctggccgc cgtgctaaag cacagctcga ccttgccgcc cgaaagcacc caggtccggg 181 gctacgactt caaccgcggt gtgaattacc gcgcactgct ggaggccttc ggcaccaccg 241 gcttccaagc aaccaacttc gggcgcgctg tacagcaagt caatgccatg atcgagaaga 301 agctggaacc actgtcacag gatgaagacc agcacgcgga cctgacccag agccgccgcc 361 cacttaccag ctgcaccatt ttcctgggat atacatccaa cctcatcagt tcaggcatcc 421 gtgagaccat tcgctacctt gtgcagcaca acatggtgga cgtattggtg accacagctg 481 gcggcgtgga ggaagacctc atcaagtgcc tggcgcccac atacttgggc gagtttagcc 541 tcagggggaa ggagctccgg gagaacggga tcaataggat cggaaacctg ctggtgccca 601 atgagaatta ctgcaagttt gaggactggc tgatgcccat tctggatcag atggtgatgg 661 agcagaacac agagggtgta aagtggacgc cttctaagat gatcgcccgg ctgggcaagg 721 agatcaacaa cccagagtcc gtgtattact gggcccagaa gaaccacatc cctgtgttta 781 gtcccgcact tacagacggc tcgctgggcg acatgatctt cttccattcc tacaagaacc 841 cgggcctggt cctggacatc gttgaggacc tgaggctcat caacacacag gccatctttg 901 ccaagtgcac tgggatgatc attctgggcg gcggcgtggt gaagcaccac attgccaatg 961 ccaacctcat gcggaacggg gccgactacg ctgtttacat caacacagcc caggagtttg 1021 atggctctga ctcaggtgcc cgaccagacg aggctgtctc ctggggcaag atccgggtgg 1081 atgcacagcc cgtcaaggtc tatgctgacg cctccctggt cttccccctg cttgtggctg 1141 aaacctttgc ccagaagatg gatgccttca tgcatgagaa gaacgaggac tgagcggctg 1201 cgtcccagga aggtcttacc ccctcttcta tttattaatt tgcagaccca gcccctcccc 1261 tactttttgg tcagctacgt ctctagaata agatggtatc tgaagtcaaa aaaaaaaaaa // LOCUS HSU40583 1977 bp mRNA PRI 19-DEC-1995 DEFINITION Human alpha 7 neuronal nicotinic acetylcholine receptor mRNA, complete cds. ACCESSION U40583 NID g1125076 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1977) AUTHORS Logel,J., Drebing,C., Barnhart,M., Antle,C. and Leonard,S. TITLE Nucleotide Sequence and Transcript Size of the Alpha-7 Neuronal Nicotinic Acetylcholine Receptor in Human Postmortem Brain JOURNAL Unpublished REFERENCE 2 (bases 1 to 1977) AUTHORS Leonard,S. TITLE Direct Submission JOURNAL Submitted (13-NOV-1995) Sherry Leonard, University of Colorado Health Sciences Center, C-268-71 Pharmacology, 4200 E. Ninth Ave, Denver, CO 80262, USA FEATURES Location/Qualifiers source 1..1977 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBShalpha7" /clone_lib="Clontech lambda gt10 cDNA library: human hippocampus" /sex="male" /tissue_type="hippocampus" /dev_stage="20 yr old adult" 5'UTR 1..7 sig_peptide 8..73 CDS 8..1516 /codon_start=1 /product="alpha 7 neuronal nicotinic acetylcholine receptor" /db_xref="PID:g1125077" /translation="MRCSPGGVWLALAASLLHVSLQGEFQRKLYKELVKNYNPLERPV ANDSQPLTVYFSLNLLQIMDVDEKNQVLTTNIWLQMSWTDHYLQWNVSEYPGVKTVRF PDGQIWKPDILLYNSADERFDATFHTNVLVNPSGHCQYLPPGIFKSSCYIDVRWFPFD VQHCKLKFGSWSYGGWSLDLQMQEADISGYIPNGEWDLVGIPGKRSERFYECCKEPYP DVTFTVTMRRRTLYYGLNLLIPCVLISALALLVFLLPADSGEKISLGITVLLSLTVFM LLVAEIMPATSDSVPLIAQYFASTMIIVGLSVVVTVIVLQYHHHDPDGGKMPKWTRVI LLNWCAWFLRMKRPGEDKVRPACQHKQRRCSLASVEMSAVAPPPASNGNLLYIGFRGL DGVHCVPTPDSGVVCGRMACSPTHDEHLLHGGQPPEGDPDLAKILEEVRYIANRFRCQ DESEAVCSEWKFAACVVDRLCLMAFSVFTIICTIGILMSAPNFVEAVSKDFA" 3'UTR 1514..1977 polyA_site 1954 BASE COUNT 426 a 567 c 524 g 460 t ORIGIN 1 actcaacatg cgctgctcgc cgggaggcgt ctggctggcg ctggccgcgt cgctcctgca 61 cgtgtccctg caaggcgagt tccagaggaa gctttacaag gagctggtca agaactacaa 121 tcccttggag aggcccgtgg ccaatgactc gcaaccactc accgtctact tctccctgaa 181 cctcctgcag atcatggacg tggatgagaa gaaccaagtt ttaaccacca acatttggct 241 gcaaatgtct tggacagatc actatttaca gtggaatgtg tcagaatatc caggggtgaa 301 gactgttcgt ttcccagatg gccagatttg gaaaccagac attcttctct ataacagtgc 361 tgatgagcgc tttgacgcca cattccacac taacgtgttg gtgaatcctt ctgggcattg 421 ccagtacctg cctccaggca tattcaagag ttcctgctac atcgatgtac gctggtttcc 481 ctttgatgtg cagcactgca aactgaagtt tgggtcctgg tcttacggag gctggtcctt 541 ggatctgcag atgcaggagg cagatatcag tggctatatc cccaatggag aatgggacct 601 agtgggaatc cccggcaaga ggagtgaaag gttctatgag tgctgcaaag agccctaccc 661 tgatgtcacc ttcacagtga ccatgcgccg caggacactc tactatggcc tcaacctgct 721 gatcccctgt gtgctcatct ccgccctcgc cctgctggtg ttcctgcttc ctgcagattc 781 cggggagaag atttccctgg ggataacagt cttactctct cttaccgtct tcatgctgct 841 cgtggctgag atcatgcccg caacatccga ttcggtacca ttgatagccc agtacttcgc 901 cagcaccatg atcatcgtgg gcctctcggt ggtggtgacg gtgatcgtgc tgcagtacca 961 ccaccacgac cccgacgggg gcaagatgcc caagtggacc agagtcatcc ttctgaactg 1021 gtgcgcgtgg ttcctgcgaa tgaagaggcc cggggaggac aaggtgcgcc cggcctgcca 1081 gcacaagcag cggcgctgca gcctggccag tgtggagatg agcgccgtgg cgccgccgcc 1141 cgccagcaac gggaacctgc tgtacatcgg cttccgcggc ctggacggcg tgcactgtgt 1201 cccgaccccc gactctgggg tagtgtgtgg ccgcatggcc tgctccccca cgcacgatga 1261 gcacctcctg cacggtgggc aaccccccga gggggacccg gacttggcca agatcctgga 1321 ggaggtccgc tacattgcca accgcttccg ctgccaggac gaaagcgagg cggtctgcag 1381 cgagtggaag ttcgccgcct gtgtggtgga ccgcctgtgc ctcatggcct tctcggtctt 1441 caccatcatc tgcaccatcg gcatcctgat gtcggctccc aacttcgtgg aggccgtgtc 1501 caaagacttt gcgtaaccac gcctggttct gtacatgtgg aaaactcaca gatgggcaag 1561 gcctttggct tggcgagatt tgggggtgct aatccaggac agcattacac gccacaactc 1621 cagtgttccc ttctggctgt cagtcgtgtt gcttacggtt tctttgttac tttaggtagt 1681 agaatctcag cactttgttt catattctca gatgggctga tagatatcct tggcacatcc 1741 gtaccatcgg tcagcagggc cactgagtag tcattttgcc cattagccca ctgcctggaa 1801 agcccttcgg agagctcccc atggctcctc accaccgaga cagttggttt tgcatgtctg 1861 catgaaggtc tacctgaaaa ttcaacattt gctttttgct tgtgtacaaa cccagattga 1921 agctaaaata aaccagactc actaaatcct ttccaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU40622 1582 bp mRNA PRI 12-JAN-1996 DEFINITION Human XRCC4 mRNA, complete cds. ACCESSION U40622 NID g1151114 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1582) AUTHORS Li,Z., Otevrel,T., Gao,Y., Cheng,H.L., Seed,B., Stamato,T.D., Taccioli,G.E. and Alt,F.W. TITLE The XRCC4 gene encodes a novel protein involved in DNA double-strand break repair and V(D)J recombination JOURNAL Cell 83 (7), 1079-1089 (1995) MEDLINE 96128113 REFERENCE 2 (bases 1 to 1582) AUTHORS Li,Z. and Alt,F.W. TITLE Direct Submission JOURNAL Submitted (13-NOV-1995) Frederick W. Alt, Genetics/Center for Blood Research, Harvard Medical School, Warren Alpert Bldg, 200 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1582 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q13" CDS 51..1055 /codon_start=1 /function="DNA double-strand break repair and V(D)J recombination" /product="XRCC4" /db_xref="PID:g1151115" /translation="MERKISRIHLVSEPSITHFLQVSWEKTLESGFVITLTDGHSAWT GTVSESEISQEADDMAMEKGKYVGELRKALLSGAGPADVYTFNFSKESCYFFFEKNLK DVSFRLGSFNLEKVENPAEVIRELICYCLDTIAENQAKNEHLQKENERLLRDWNDVQG RFEKCVSAKEALETDLYKRFILVLNEKKTKIRSLHNKLLNAAQEREKDIKQEGETAIC SEMTADRDPVYDESTDEESENQTDLSGLASAAVSKDDSIISSLDVTDIAPSRKRRQRM QRNLGTEPKMAPQENQLQEKEKPDSSLPETSKKEHISAENMSLETLRNSSPEDLFDEI " BASE COUNT 566 a 249 c 311 g 456 t ORIGIN 1 ccggaagtgg ggctgcctct ttaaataaca aaaatctgag gtattaagaa atggagagaa 61 aaataagcag aatccacctt gtttctgaac ccagtataac tcattttcta caagtatctt 121 gggagaaaac actggaatct ggttttgtta ttacacttac tgatggtcat tcagcatgga 181 ctgggacagt ttctgaatca gagatttccc aagaagctga tgacatggca atggaaaaag 241 ggaaatatgt tggtgaactg agaaaagcat tgttgtcagg agcaggacca gctgatgtat 301 acacgtttaa tttttctaaa gagtcttgtt atttcttctt tgagaaaaac ctgaaagatg 361 tctcattcag acttggttcc ttcaacctag agaaagttga aaacccagct gaagtcatta 421 gagaacttat ttgttattgc ttggacacca ttgcagaaaa tcaagccaaa aatgagcacc 481 tgcagaaaga aaatgaaagg cttctgagag attggaatga tgttcaagga cgatttgaaa 541 aatgtgtgag tgctaaggaa gctttggaga ctgatcttta taagcggttt attctggtgt 601 tgaatgagaa gaaaacaaaa atcagaagtt tgcataataa attattaaat gcagctcaag 661 aacgagaaaa ggacatcaaa caagaagggg aaactgcaat ctgttctgaa atgactgctg 721 accgagatcc agtctatgat gagagtactg atgaggaaag tgaaaaccaa actgatctct 781 ctgggttggc ttcagctgct gtaagtaaag atgattccat tatttcaagt cttgatgtca 841 ctgatattgc accaagtaga aaaaggagac agcgaatgca aagaaatctt gggacagaac 901 ctaaaatggc tcctcaggag aatcagcttc aagaaaagga aaagcctgat tcttcactac 961 ctgagacgtc gaaaaaggag cacatctcag ctgaaaacat gtctttagaa actctgagaa 1021 acagcagccc agaagacctc tttgatgaga tttaacagtc tcaaaaaata ctttgatgtt 1081 cactagacta tgttttctat tcatttcttt aaaatgaaaa aggagaattt caagtcagca 1141 gccgctatta ccgtatctta caatttaatt acatacacag tgaattgaaa ccattgtgca 1201 aaatggatta cacatgtata caaagatacg atttgatgat gacactggca cattgagttc 1261 taaactattc attcagcatg cctataatta cataaattgt atgagacttt ttgttgcaaa 1321 ggacacattt atcatattca ttcacacata ttatatgtga tagctgtcca acatcctgtc 1381 tgggaagatt ttgaaaacag gacaaagaaa acatcatttt aaaatgtctt cagctttttt 1441 tgaatagacg tattcaaaca tattctgaac attgatgttt gaacatttta atttgtgtga 1501 tgatgtagaa aatataattt tagtttgtac ataaacattg tgaaaatctg ataataaaat 1561 ttttgataca ttgaaaaaaa aa // LOCUS HSU40623 3324 bp mRNA PRI 28-MAR-1996 DEFINITION Human subtilisin-like protease PC8 precursor (PC8) mRNA, complete cds. ACCESSION U40623 NID g1236802 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3324) AUTHORS Bruzzaniti,A., Goodge,K., Jay,P., Taviaux,S.A., Lam,M.H., Berta,P., Martin,T.J., Moseley,J.M. and Gillespie,M.T. TITLE PC8 [corrected], a new member of the convertase family JOURNAL Biochem. J. 314 (Pt 3), 727-731 (1996) MEDLINE 96177840 REMARK Erratum:[[published erratum appears in Biochem J 1996 Jun 15;316(Pt 3):1007]] REFERENCE 2 (bases 1 to 3324) AUTHORS Gillespie,M.T., Bruzzaniti,A., Goodge,K., Jay,P., Taviaux,S.A., Lam,M.H.C., Berta,P., Martin,T.J. and Moseley,J.M. TITLE Direct Submission JOURNAL Submitted (14-NOV-1995) Matthew T. Gillespie, St Vincent's Institute of Medical Research, 41 Victoria Parade, Fitzroy, Victoria 3065, Australia FEATURES Location/Qualifiers source 1..3324 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q23-q24" /cell_line="BEN cell line, Moseley, J.M. et al. (1987), Proc. Natl. Acad. Sci. USA 84: 5048-5052" sig_peptide 22..444 /gene="PC8" CDS 22..2379 /gene="PC8" /note="Prohormone converting enzyme, subtilisin-like protease, convertase" /codon_start=1 /product="PC8 precursor" /db_xref="PID:g1236803" /translation="MPKGRQKVPHLDAPLGLPTCLWLELAGLFLLVPWVMGLAGTGGP DGQGTGGASWAVHLESLEGDGEEETLEQQADALAQAAGLVNAGRIGELQGHYLFVQPA GHRPALEVEPIRQQVEAVLAGHEAVRWHSEQRLLRRAKRSVHFNDPKYPQQWHLNNRR SPGRDINVTGVWERNVTGRGVTVVVVDDGVEHTIQDIAPNYSPEGSYDLNSNDPDPMP HPDVENGNHHGTRCAGEIAAVPNNSFCAVGVAYGSRIAGIRVLDGPLTDSMEAVAFNK HYQINDIYSCSWGPDDDGKTVDGPHQLGKAALQHGVIAGRQGFGSIFVVASGNGGQHN DNCNYDGYANSIYTVTIGAVDEEGRMPFYAEECASMLAVTFSGGDKMLRSIVTTDWDL QKGTGCTEGHTGTSAAAPLAAGMIALMLQVRPCLTWRDVQHIIVFTATRYEDRRAEWV TNEAGFSHSHQHGFGLLNAWRLVNAAKIWTSVPYLASYVSPVLKENKAIPQSPRSLEV LWNVSRMDLEMSGLKTLEHVAVTVSITHPRRGSLELKLFCPSGMMSLIGAPRSMDSDP NGFNDWTFSTVRCWGERARGTYRLVIRDVGDESFQVGILRQWQLTLYGSVWSAVDIRD RQRLLESAMSGKYLHDDFALPCPPGLKIPEEDGYTITPNTLKTLVLVGCFTVFWTVYY MLEVYLSQRNVASNQVCRSGPCHWPHRSRKAKEEGTELESVPLCSSKDPDEVETESRG PPTTSDLLAPDLLEQGDWSLSQNKSALDCPHQHLDVPHGKEEQIC" gene 22..2379 /gene="PC8" mat_peptide 445..2376 /gene="PC8" /note="Prohormone converting enzyme, subtilisin-like protease, convertase" /product="PC8" BASE COUNT 755 a 930 c 976 g 663 t ORIGIN 1 agaatccagt ccactgctct gatgccgaag gggaggcaga aagtgccaca cttggatgcc 61 cccctgggcc tgcccacctg cctctggctg gaattagccg ggctcttctt actggttccc 121 tgggtcatgg gcctggcagg gacaggtggg cctgatggcc agggcacagg gggggcgagc 181 tgggctgtgc acctggaaag cctggaaggt gacggggagg aagagactct tgagcagcag 241 gcggatgcct tggcccaggc agcagggctg gtgaatgctg gacgcatcgg agagcttcag 301 gggcactacc tctttgtcca gcctgctggg cacaggccgg ccctggaggt ggagcccatc 361 cgccagcagg tggaggctgt gttggctggg catgaagctg tgcgctggca ctcagagcag 421 aggctgctaa ggcgggccaa gcgcagcgtc cacttcaacg accccaagta cccgcagcaa 481 tggcacctga ataaccgacg gagcccgggc agggacatca acgtgacggg tgtgtgggaa 541 cgcaatgtga ctgggcgagg ggtgacggtg gtggtagtgg atgacggagt ggaacacacc 601 atccaggaca ttgcacccaa ctatagccct gagggtagct atgacctcaa ctctaatgac 661 cctgacccca tgccccaccc ggatgtggag aatggcaacc accatggcac gcgatgtgca 721 ggagagatcg cggctgtgcc caacaacagc ttctgtgccg tgggcgtggc ctacgggagc 781 cgcatcgcag gtatccgggt actggatgga cctctcacag acagcatgga ggcagtggcg 841 ttcaacaagc actatcagat caatgacatc tacagctgca gctggggacc agatgacgat 901 gggaagacag tggatggccc ccatcagctt ggaaaggctg ccttacaaca tggggtgatt 961 gctggtcgcc agggctttgg gagcatcttt gtggtagcca gtggcaacgg aggccaacac 1021 aacgacaact gcaactacga tggctacgcc aactccatct acaccgtcac cataggagct 1081 gtggatgagg agggacgcat gcctttctat gcagaagaat gtgcctccat gctggcagtc 1141 accttcagtg gtggggacaa gatgcttcgg agcattgtga ccactgactg ggaccttcag 1201 aagggcactg gctgcactga gggccacaca gggacctcag ctgcagcgcc tctggcagct 1261 ggcatgatag ccttaatgct gcaggtgcgg ccctgcctca cgtggcgtga cgtccagcac 1321 atcattgtct tcacagccac ccggtatgag gatcgccgtg cagagtgggt caccaacgag 1381 gcaggcttca gccatagcca ccagcacggt ttcggcctcc tcaacgcctg gaggctcgtg 1441 aatgcagcca agatctggac atctgtccct tacttagcat cctacgtcag tcccgtgtta 1501 aaagaaaaca aggcgattcc gcagtccccc cgttccctgg aggtcctgtg gaatgtcagc 1561 aggatggacc tggagatgtc agggctgaag accctggagc atgtggcagt gacagtctcc 1621 atcactcacc cacggcgcgg cagcttggag ctgaagctgt tctgccccag tggcatgatg 1681 tccctcatcg gcgccccccg cagcatggac tcggatccca acggcttcaa tgactggacc 1741 ttctccactg tgcgatgctg gggggagaga gcccgaggga cctacaggct tgtcatcagg 1801 gatgtcgggg atgagtcatt ccaggtcggc atcctccggc aatggcagct gaccctatat 1861 ggctctgtgt ggagtgcagt agacatcagg gacagacaaa ggctgttaga gagtgccatg 1921 agtggaaaat acctgcacga tgacttcgcc ctgccctgcc caccggggct gaaaattcct 1981 gaggaagatg gttacaccat cacccccaac accctcaaga ccctggtgct ggtaggctgt 2041 ttcaccgtct tctggactgt ttactacatg ctggaagtat atttgagcca gaggaatgtg 2101 gcttccaatc aagtttgtag gagtggaccc tgccactggc cccatcggag ccggaaagcc 2161 aaggaggaag ggacagagct agaatcagtg ccactttgca gcagcaagga tccagacgaa 2221 gtggaaacag agagcagggg ccctcccacc acctctgacc tccttgcccc agacctgctg 2281 gagcaagggg actggagcct gtcccagaac aagagcgccc tggactgccc tcatcagcac 2341 ctagacgtac cgcacgggaa ggaggagcag atctgctgac ctcagggcct gacagtgtgg 2401 gacaggctct tctttcccaa aattagggag ctcttgacag aaagcagttc tgatgcttac 2461 atctggaatc tgaggcatcc tctgactcca ctcaaagagg gtgagggcct tcttaagata 2521 caaatggtgg aggattgctg ccagagaagt ctggtcagag ccacagggtc tgcctccagc 2581 caaacgggag cttttggtga gaaggtgttg gacaggggat tggcgccccc ctttggtttg 2641 gcctccatcc tcatctctct tgggccaagc cagctgccta ggtcccccaa gcatggggga 2701 ccccttccca catataagtt gagaaggtgc ctgccatagc caggagcgca tctcaatgga 2761 aacatcactg gggtcacttg ggaagaggac ttcggggtag aggctgggag gagcccctgg 2821 acatgcctgt cctgaaagcg gctgcctcca ttatccattc ccaagatgcc tgatcagaaa 2881 ccaaccatga atgaacccct ggctccttca ccacccccac gattggtatg atgctgccgg 2941 cacagctggg atacacacgg ctcccccagg cctgagctgc ttcactaggg aatcctgcgg 3001 caggactgca gagcagatgg cagatgcaca tgttggagga gagagccttg ggagccactg 3061 ccactccagt cctgccacca ccctgtcttc ctctgcaagt gctcagggaa atggccttcc 3121 cgccggaggc cagctatctg cctgacaggc tgtgactctt ctctcaacct tggccttctc 3181 ccctcttctg agctagttgg ttgaattttt tttaatgctt aagatttgtt tttctctttt 3241 cacagcaaca ttttcttgaa tttttttctg cacagctttt ccaaaataaa aaccttccaa 3301 acaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSU40714 1167 bp mRNA PRI 30-DEC-1996 DEFINITION Human tyrosyl-tRNA synthetase mRNA, complete cds. ACCESSION U40714 NID g1184698 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1167) AUTHORS Ribas de Pouplana,L., Frugier,M., Quinn,C.L. and Schimmel,P. TITLE Evidence that two present-day components needed for the genetic code appeared after nucleated cells separated from eubacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (1), 166-170 (1996) MEDLINE 96133898 REFERENCE 2 (bases 1 to 1167) AUTHORS Quinn,C.L. TITLE Direct Submission JOURNAL Submitted (15-NOV-1995) Cheryl L. Quinn, Cubist Pharmaceuticals, Inc., 24 Emily Street, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..1167 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1167 /codon_start=1 /product="tyrosyl-tRNA synthetase" /db_xref="PID:g1184699" /translation="MGDAPSPEEKLHLITRNLQEVLGEEKLKEILKERELKIYWGTAT TGKPHVAYFVPMSKIADFLKAGCEVTILFADLHAYLDNMKAPWELLELRVSYYENVIK AMLESIGVPLEKLKFIKGTDYQLSKEYTLDVYRLSSVVTQHDSKKAGAEVVKQVEHPL LSGLLYPGLQALDEEYLKVDAQFGGIDQRKIFTFAEKYLPALGYSKRVHLMNPMVPGL TGSKMSSSEEESKIDLLDRKEDVKKKLKKAFCEPGNVENNGVLSFIKHVLFPLKSEFV ILRDEKWGGNKTYTAYVDLEKDFAAEVVHPGDLKNSVEVALNKLLDPIREKFNTPALK KLASAAYPDPSKQKPMAKGLPRIQNQRRSSHPGWISVWGKSSLWRSTQMQTACM" BASE COUNT 340 a 263 c 308 g 256 t ORIGIN 1 atgggggacg ctcccagccc tgaagagaaa ctgcacctta tcacccggaa cctgcaggag 61 gttctggggg aagagaagct gaaggagata ctgaaggagc gggaacttaa aatttactgg 121 ggaacggcaa ccacgggcaa accacatgtg gcttactttg tgcccatgtc aaagattgca 181 gacttcttaa aggcagggtg tgaggtaaca attctgtttg cggacctcca cgcatacctg 241 gataacatga aagccccatg ggaacttcta gaactccgag tcagttacta tgagaatgtg 301 atcaaagcaa tgctggagag cattggtgtg cccttggaga agctcaagtt catcaaaggc 361 actgattacc agctcagcaa agagtacaca ctagatgtgt acagactctc ctccgtggtc 421 acacagcacg attccaagaa ggctggagct gaggtggtaa agcaggtgga gcaccctttg 481 ctgagtggcc tcttataccc cggactgcag gctttggatg aagagtattt aaaagtagat 541 gcccaatttg gaggcattga tcagagaaag attttcacct ttgcagagaa gtacctccct 601 gcacttggct attcaaaacg ggtccatctg atgaatccta tggttccagg attaacaggc 661 agcaaaatga gctcttcaga agaggagtcc aagattgatc tccttgatcg gaaggaggat 721 gtgaagaaaa aactgaagaa ggccttctgt gagccaggaa atgtggagaa caatggggtt 781 ctgtccttca tcaagcatgt cctttttccc cttaagtccg agtttgtgat cctacgagat 841 gagaaatggg gtggaaacaa aacctacaca gcttacgtgg acctggaaaa ggactttgct 901 gctgaggttg tacatcctgg agacctgaag aattctgttg aagtcgcact gaacaagttg 961 ctggatccaa tccgggaaaa gtttaatacc cctgccctga aaaaactggc cagcgctgcc 1021 tacccagatc cctcaaagca gaagccaatg gccaaaggcc tgccaagaat tcagaaccag 1081 aggaggtcat cccatcccgg ctggatatcc gtgtggggaa aatcatcact gtggagaagc 1141 acccagatgc agacagcctg tatgtag // LOCUS HSU40847 4789 bp mRNA PRI 19-DEC-1996 DEFINITION Human Treacher Collins syndrome (TCOF1) mRNA, complete cds. ACCESSION U40847 NID g1736916 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4789) AUTHORS Dixon,J., Edwards,S.J., Gladwin,A.J., Dixon,M.J., Loftus,S.K., Bonner,C.A., Koprivnikar,K. and Wasmuth,J.J. TITLE Positional cloning of a gene responsible for the pathogenesis of Treacher Collins syndrome JOURNAL Nat. Genet. 12, 130-136 (1996) REFERENCE 2 (bases 1 to 4789) AUTHORS Dixon,J., Edwards,S.J., Anderson,I., Brass,A., Scambler,P.J. and Dixon,M.J. TITLE Identification of the complete coding sequence and genomic organization of the Treacher Collins syndrome gene JOURNAL Unpublished REFERENCE 3 (bases 1 to 4789) AUTHORS Dixon,M.J. TITLE Direct Submission JOURNAL Submitted (17-NOV-1995) School of Biological Sciences and Departments of Dental Medicine and Surgery, University of Manchester, Oxford Road, Manchester M13 9PT, U.K. FEATURES Location/Qualifiers source 1..4789 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q32-33.1" 5'UTR 1..93 gene 94..4329 /gene="TCOF1" CDS 94..4329 /gene="TCOF1" /note="Treacher Collins syndrome" /codon_start=1 /db_xref="PID:g1736917" /translation="MAEARKRRELLPLIYHHLLRAGYVRAAREVKEQSGQKCFLAQPV TLLDIYTHWQQTSELGRKRKAEEDAALQAKKTRVSDPISTSESSEEEEEAEAETAKAT PRLASTNSSVLGADLPSSMKEKAKAETEKAGKTGNSMPHPATGKTVANLLSGKSPRKS AEPSANTTLVSETEEEGSVPAFGAAAKPGMVSAGQADSSSEDTSSSSDETDVEVKASE KILQVRAASAPAKGTPGKGATPAPPGKAGAVASQTKAGKPEEDSESSSEESSDSEEET PAAKALLQAKASGKTSQVGAASAPAKESPRKGAAPAPPGKTGPAVAKAQAGKREEDSQ SSSEESDSEEEAPAQAKPSGKAPQVRAASAPAKESPRKGAAPAPPRKTGPAAAQVQVG KQEEDSRSSSEESDSDREALAAMNAAQVKPLGKSPQVKPASTMGMGPLGKGAGPVPPG KVGPATPSAQVGKWEEDSESSSEESSDSSDGEVPTAVAPAQEKSLGNILQAKPTSSPA KGPPQKAGPVAVQVKAEKPMDNSESSEESSDSADSEEAPAAMTAAQAKPALKIPQTKA CPKKTNTTASAKVAPVRVGTQPPRKAGTATSPAGSSPAVAGGTQRPAEDSSSSEESDS EEEKTGLAVTVGQAKSVGKGLQVKAASVPVKGSLGQGTAPVLPGKTGPTVTQVKAEKQ EDSESSEEESDSEEAAASPAQVKTSVKKTQAKANPAAARAPSAKGTISAPGKVVTAAA QAKQRSPSKVKPPVRNPQNSTVLARGPASVPSVGKAVATAAQAQTGPEEDSGSSEEES DSEEEAETLAQAKPSGKTHQIRAALAPAKESPRKGAAPTPPGKTGPSAAQAGKQDDSG SSSEESDSDGEAPAAVTSAQVIKPPLIFVDPNRSPAGPAATPAQAQAASTPRKARASE STARSSSSESEDEDVIPATQCLTPGIRTNVVTMPTAHPRIAPKASMAGASSSKESSRI SDGKKQEGPATQVSKKNPASLPLTQAALKVLAQKASEAQPPVARTQPSSGVDSAVGTL PATSPQSTSVQAKGTNKLRKPKLPEVQQATKAPESSDDSEDSSDSSSGSEEDGEGPQG AKSAHTLGPTPSRTETLVEETAAESSEDDVVAPSQSLLSGYMTPGLTPANSQASKATP KLDSSPSVSSTLAAKDDPDGKQEAKPQQAAGMLSPKTGGKEAASGTTPQKSRKPKKGA GNPQASTLALQSNITQCLLGQPWPLNEAQVQASVVKVLTELLEQERKKVVDTTKESSR KGWESRKRKLSGDQPAARTPRSKKKKKLGAGEGGEASVSPEKTSTTSKGKAKRDKASG DVKEKKGKGSLGSQGAKDEPEEELQKGMGTVEGGDQSNPKSKKEKKKSDKRKKDKEKK EKKKKAKKASTKDSESPSQKKKKKKKKTAEQTV" 3'UTR 4330..4789 BASE COUNT 1292 a 1369 c 1465 g 663 t ORIGIN 1 agtggggcgc gcgaggtcta agggcgcgag ggaagtggcg ggcggggact aaggcggggc 61 gtgcaggtag ccggccggcc gggggtcgcg ggtatggccg aggccaggaa gcggcgggag 121 ctacttcccc tgatctacca ccatctgctg cgggctggct atgtgcgtgc ggcgcgggaa 181 gtgaaggagc agagcggcca gaagtgtttc ctggctcagc ccgtaaccct tctggacatc 241 tatacacact ggcaacaaac ctcagagctt ggtcggaagc ggaaggcaga ggaagatgcg 301 gcactgcaag ctaagaaaac ccgtgtgtca gaccccatca gcacctcgga gagctcggaa 361 gaggaggaag aagcagaagc cgaaaccgcc aaagccaccc caagactagc atctaccaac 421 tcctcagtcc tgggggcgga cttgccatca agcatgaaag aaaaagccaa ggcagagaca 481 gagaaagctg gcaagactgg gaattccatg ccacaccctg ccactgggaa gacggtggcc 541 aaccttcttt ctgggaagtc tcccaggaag tcagcagagc cctcagcaaa tactacgttg 601 gtctcagaaa ctgaggagga gggcagcgtc ccggcctttg gagctgctgc caagcctggg 661 atggtgtcag cgggccaggc cgacagctcc agcgaggaca cctccagctc cagtgatgag 721 acagacgtgg aggtaaaggc ctctgaaaaa attctccagg tcagagctgc ctcagcccct 781 gccaagggga cccctgggaa aggggctacc ccagcacccc ctgggaaggc aggggctgta 841 gcctcccaga ccaaggcagg gaagccagag gaggactcag agagcagcag cgaggagtca 901 tctgacagtg aggaggagac gccagctgcc aaggccctgc ttcaggcgaa ggcctcagga 961 aaaacctctc aggtcggagc tgcctcagcc cctgccaagg agtcccccag gaaaggagct 1021 gccccagcgc cccctgggaa gacagggcct gcagttgcca aggcccaggc ggggaagcgg 1081 gaggaggact cgcagagcag cagcgaggaa tcggacagtg aggaggaggc gcctgctcag 1141 gcgaagcctt cagggaaggc cccccaggtc agagccgcct cggcccctgc caaggagtcc 1201 cccaggaaag gggctgcccc agcacctcct aggaaaacag ggcctgcagc cgcccaggtc 1261 caggtgggga agcaggagga ggactcaaga agcagcagcg aggagtcaga cagtgacaga 1321 gaagcactgg cagccatgaa tgcagctcag gtgaagccct tggggaaaag cccccaggtg 1381 aaacctgcct ctaccatggg catggggccc ttggggaaag gcgccggccc agtgccacct 1441 gggaaggtgg ggcctgcaac cccctcagcc caggtgggga agtgggagga ggactcagag 1501 agcagtagtg aggagtcatc agacagcagt gatggagagg tgcccacagc tgtggccccg 1561 gctcaggaaa agtccttggg gaacatcctc caggccaaac ccacctccag tcctgccaag 1621 gggccccctc agaaggcagg gcctgtagcc gtccaggtca aggctgaaaa gcccatggac 1681 aactcggaga gcagcgagga gtcgtcggac agtgcggaca gtgaggaggc accagcagcc 1741 atgactgcag ctcaggcaaa accagctctg aaaattcctc agaccaaggc ctgcccaaag 1801 aaaaccaata ccactgcatc tgccaaggtc gcccctgtgc gagtgggcac ccaacccccc 1861 cggaaagcag gaactgcgac ttctccagca ggctcatccc cagctgtggc tgggggcacc 1921 cagagaccag cagaggattc ttcaagcagt gaggaatcag atagtgagga agagaagaca 1981 ggtcttgcag taaccgtggg acaggcaaag tctgtgggga aaggcctcca ggtgaaagca 2041 gcctcagtgc ctgtcaaggg gtccttgggg caagggactg ctccagtact ccctgggaag 2101 acggggccta cagtcaccca ggtgaaagct gaaaagcagg aagactctga gagcagtgag 2161 gaggaatcag acagtgagga agcagctgca tctccagcac aggtgaaaac ctcagtaaag 2221 aaaacccagg ccaaagccaa cccagctgcc gccagagcac cttcagcaaa agggacaatt 2281 tcagcccctg gaaaagttgt cactgcagct gctcaagcca agcagaggtc tccatccaag 2341 gtgaagccac cagtgagaaa cccccagaac agtaccgtct tggcgagggg cccagcatct 2401 gtgccatctg tggggaaggc cgtggctaca gcagctcagg cccagacagg gccagaggag 2461 gactcaggga gcagtgagga ggagtcagac agtgaggagg aggcggagac gctggctcag 2521 gcgaagcctt cagggaagac ccaccagatc agagctgcct tggctcctgc caaggagtcc 2581 cccaggaaag gggctgcccc aacacctcct gggaagacag ggccttcggc tgcccaggca 2641 gggaagcagg atgactcagg gagcagcagc gaggaatcag acagtgatgg ggaggcaccg 2701 gcagctgtga cctctgccca ggtgattaaa ccccccctga tttttgtcga ccctaatcgt 2761 agtccagctg gcccagctgc tacacccgca caagcccagg ctgcaagcac cccgaggaag 2821 gcccgagcct cggagagcac agccaggagc tcctcctccg agagcgagga tgaggacgtg 2881 atccccgcta cacaatgctt gactcctggc atcagaacca atgtggtgac catgcccact 2941 gcccacccaa gaatagcccc caaagccagc atggctgggg ccagcagcag caaggagtcc 3001 agtcggatat cagatggcaa gaaacaggag ggaccagcca ctcaggtgtc aaagaagaac 3061 ccagcttccc tcccactgac ccaggctgcc ctgaaggtcc tcgcccagaa agccagtgag 3121 gctcagcctc ctgttgccag gacccagcct tcaagtgggg ttgacagtgc tgtgggaaca 3181 ctccctgcaa caagtcccca gagcacctcc gtccaggcca aagggaccaa caagctcaga 3241 aaacctaagc ttcctgaggt ccagcaggcc accaaagccc ctgagagctc agatgacagt 3301 gaggacagca gcgacagttc ttcagggagt gaggaagatg gtgaagggcc ccagggggcc 3361 aagtcagccc acacgctggg tcccaccccc tccaggacag agaccctggt ggaggagacc 3421 gcagcagagt ccagcgagga tgatgtggtg gcgccatccc agtctctcct ctcaggttat 3481 atgacccctg gactaacccc agccaattcc caggcctcaa aagccactcc caagctagat 3541 tccagcccct cagtttcctc tactctggcc gccaaagatg acccagatgg caagcaggag 3601 gcaaagcccc aacaggcagc aggcatgttg tcccctaaaa caggtggaaa agaggctgct 3661 tcaggcacca cacctcagaa gtcccggaag cccaagaaag gggctgggaa cccccaagcc 3721 tcaaccctgg cgctgcaaag caacatcacc cagtgcctcc tgggccaacc ctggcccctg 3781 aatgaggccc aggtgcaggc ctcagtggtg aaggtcctga ctgagctgct ggaacaggaa 3841 agaaagaagg tggtggacac caccaaggag agcagcagga agggctggga gagccgcaag 3901 cggaagctat cgggagacca gccagctgcc aggaccccca ggagcaagaa gaagaagaag 3961 ctgggggccg gggaaggtgg ggaggcctct gtttccccag aaaagacctc cacgacttcc 4021 aaggggaaag caaagagaga caaagcaagt ggtgatgtca aggagaagaa agggaagggg 4081 tctcttggct cccaaggggc caaggacgag ccagaagagg agcttcagaa ggggatgggg 4141 acggttgaag gtggagatca aagcaaccca aagagcaaga aggagaagaa gaaatccgac 4201 aagagaaaaa aagacaaaga aaaaaaagaa aagaagaaga aagcaaaaaa ggcctcaacc 4261 aaagattctg agtcaccgtc ccagaagaaa aagaagaaaa agaagaagac agcagagcag 4321 actgtatgac gagcaccagc accaggcaca gggatttcct agccgagcag tggccatccc 4381 catgcctctg acctccaccc accatgggtt ggaactaaac tgttaccttc cctcgctcca 4441 cagaagaaga cagccagctt caggggtccc aagccagtga gcctgcgggg aggctggtcc 4501 aaggagaaag tggaccagct cccatgacct caccccacta ggacgcttca tatagatgtg 4561 tacagtatat gtattttttt aagtgacctc ctctccttcc acagacccca ggcctcggga 4621 cttcccacca ccttgctcca cagatccagc taggcctgac ctgtgcctca tcccgtgccg 4681 gctgatcccg aggctttgtc ttcctctcgt cagttctttt ggttgtgttt tttgtttttt 4741 tttaataact aaaagacttg gaggaagggt gaaaaaaaaa aaaaaaaaa // LOCUS HSU40998 1381 bp mRNA PRI 04-FEB-1996 DEFINITION Human retinal protein (HRG4) mRNA, complete cds. ACCESSION U40998 NID g1161375 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1381) AUTHORS Higashide,T., Murakami,A., McLaren,M.J. and Inana,G. TITLE Cloning of the cDNA for a novel photoreceptor protein JOURNAL J. Biol. Chem. 271 (3), 1797-1804 (1996) MEDLINE 96139522 REFERENCE 2 (bases 1 to 1381) AUTHORS Higashide,T. and Inana,G. TITLE Direct Submission JOURNAL Submitted (20-NOV-1995) George Inana, University of Miami School of Medicine, Ophthalmology, 1638 N.W. 10th Avenue, Miami, FL 33136, USA FEATURES Location/Qualifiers source 1..1381 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBS10-11-7" /tissue_type="retina" /dev_stage="adult" gene 55..777 /gene="HRG4" CDS 55..777 /gene="HRG4" /codon_start=1 /product="retinal protein" /db_xref="PID:g1161376" /translation="MKVKKGGGGAGTATESAPGPSGQSVAPIPQPPAESESGSESEPD AGPGPRPGPLQRKQPIGPEDVLGLQRITGDYLCSPEENIYKIDFVRFKIRDMDSGTVL FEIKKPPVSERLPINRRDLDPNAGRFVRYQFTPAFLRLRQVGATVEFTVGDKPVNNFR MIERHYFRNQLLKSFDFHFGFCIPSSKNTCEHIYDFPPLSEELISEMIRHPYETQSDS FYFVDDRLVMHNKADYSYSGTP" BASE COUNT 297 a 424 c 399 g 261 t ORIGIN 1 cagccggcgc aggcagcggc ggcagcagca ggcgagcctc ggccccgcaa ggccatgaag 61 gtgaagaagg gcggcggtgg ggccgggacg gcgacggagt ccgctccggg gccctcgggc 121 cagagcgtgg cccccatacc acagccgcct gcggaatccg aatctgggtc cgagtcggag 181 ccggacgcag gcccagggcc caggccgggg ccgctgcaga ggaagcagcc gatcgggccg 241 gaggacgtgc tggggctgca gcggatcacc ggtgactacc tctgctcccc tgaggagaat 301 atctacaaga tcgactttgt caggtttaag attcgggaca tggactcagg cactgtcctc 361 tttgaaatca agaagccccc agtctcagaa cggttgccca tcaaccggcg ggacctggac 421 cccaatgctg ggcgctttgt ccgctaccag ttcacgcctg ccttcctccg cctgaggcag 481 gtgggagcca cggtggagtt cacagtggga gacaagcctg tcaacaactt ccgcatgatc 541 gagaggcact acttccgcaa ccagctactc aaaagcttcg acttccactt tggcttctgc 601 atccccagca gcaagaacac ctgcgagcac atttacgact tcccccctct ctccgaggag 661 ctgatcagcg agatgatccg ccacccgtat gagacccagt ctgacagctt ctacttcgtg 721 gatgaccggc tggtgatgca caataaagca gactattcct acagcgggac accctgatcc 781 cacggctgcc ctgaccccag gaggctccag ttctgggctg ggagctgtga cctccccaac 841 gctcacccct caaccccaag tcctctgctt ggggagttct ccaggagctc cggaccctga 901 gtcaatgttg ggaggaaggg tacctggtgt ccccagtcaa gcccatgaag cccatgcggc 961 ctgctacatg gggtggggtc gtagggaggc tgtttgcctc cacgtctagg aaggcctgtg 1021 agaggagcag tcaggacttc cggacaactt agctgggccc tacttgggcc caagtttcag 1081 aatagtgttc ccctatcaag gctgtgacta gatcaggcag ggatccattc cctgtcccct 1141 gcccactacc ttcaggccat ttagagttgt aaatttacaa agatccacgg tgggctccag 1201 ctgccaagcc acccaaggga gtctgggccc taggcctagc cccatccctc cccatgaggg 1261 gccaagacac tgcctaaggt gtgggaggga ctggctgaga ttgcagccca tggtaggagc 1321 tggaccaact gtatatagtt ttcaataaac tttttccttt tctgttcaaa aaaaaaaaaa 1381 a // LOCUS HSU41060 3461 bp mRNA PRI 06-APR-1996 DEFINITION Human breast cancer, estrogen regulated LIV-1 protein (LIV-1) mRNA, partial cds. ACCESSION U41060 NID g1256000 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3461) AUTHORS Green,C., Gilhooly,E.M. and Walker,N.J. TITLE Direct Submission JOURNAL Submitted (21-NOV-1995) Chris Green, Biochemistry, University of Liverpool, P.O. Box 147, Liverpool L69 3BX, UK FEATURES Location/Qualifiers source 1..3461 /organism="Homo sapiens" /note="estrogen induced mRNA" /db_xref="taxon:9606" /cell_line="MCF-7 human breast cancer cell line" gene 138..2396 /gene="LIV-1" CDS 138..2396 /gene="LIV-1" /note="estrogen regulated mRNA; breast cancer" /codon_start=1 /product="LIV-1 protein" /db_xref="PID:g1256001" /translation="MARKLSVILILTFALSVTNPLHELKAAAFPQTTEKISPNWESGI NVDLAISTRQYHLQQLFYRYGENNSLSVEGFRKLLQNIGIDKIKRIHIHHDHDHHSDH EHHSDHERHSDHEHHSDHEHHSDHNHAASGKNKRKALCPDHDSDSSGKDPRNSQGKGA HRPEHASGRRNVKDSVSASEVTSTVYNTVSEGTHFLETIETPRPGKLFPKDVSSSTPP SVTSKSRVSRLAGRKTNESVSEPRKGFMYSRNTNENPQECFNASKLLTSHGMGIQVPL NATEFNYLCPAIINQIDARSCLIHTSEKKAEIPPKTYSLQIAWVGGFIAISIISFLSL LGVILVPLMNRVFFKFLLSFLVALAVGTLSGDAFLHLLPHSHASHHHSHSHEEPAMEM KRGPLFSHLSSQNIEESAYFDSTWKGLTALGGLYFMFLVEHVLTLIKQFKDKKKKNQK KPENDDDVEIKKQLSKYESQLSTNEEKVDTDDRTEGYLRADSQEPSHFDSQQPAVLEE EEVMIAHAHPQEVYNEYVPRGCKNKCHSHFHDTLGQSDDLIHHHHDYHHILHHHHHQN HHPHSHSQRYSREELKDAGVATLAWMVIMGDGLHNFSDGLAIGAAFTEGLSSGLSTSV AVFCHELPHELGDFAVLLKAGMTVKQAVLYNALSAMLAYLGMATGIFIGHYAENVSMW IFALTAGLFMYVALVDMVPEMLHNDASDHGCSRWGYFFLQNAGMLLGFGIMLLIPYLN IKSCSYKFLVKV" BASE COUNT 1057 a 679 c 727 g 998 t ORIGIN 1 ctcgtgccga attcggcacg agaccgcgtg ttcgcgcctg gtagagattt ctcgaagaca 61 ccagtgggcc cgtgtggaac caaacctgcg cgcgtggccg ggccgtggga caacgaggcc 121 gcggagacga aggcgcaatg gcgaggaagt tatctgtaat cttgatcctg acctttgccc 181 tctctgtcac aaatcccctt catgaactaa aagcagctgc tttcccccag accactgaga 241 aaattagtcc gaattgggaa tctggcatta atgttgactt ggcaatttcc acacggcaat 301 atcatctaca acagcttttc taccgctatg gagaaaataa ttctttgtca gttgaagggt 361 tcagaaaatt acttcaaaat ataggcatag ataagattaa aagaatccat atacaccatg 421 accacgacca tcactcagac cacgagcatc actcagacca tgagcgtcac tcagaccatg 481 agcatcactc agaccacgag catcactctg accataatca tgctgcttct ggtaaaaata 541 agcgaaaagc tctttgccca gaccatgact cagatagttc aggtaaagat cctagaaaca 601 gccaggggaa aggagctcac cgaccagaac atgccagtgg tagaaggaat gtcaaggaca 661 gtgttagtgc tagtgaagtg acctcaactg tgtacaacac tgtctctgaa ggaactcact 721 ttctagagac aatagagact ccaagacctg gaaaactctt ccccaaagat gtaagcagct 781 ccactccacc cagtgtcaca tcaaagagcc gggtgagccg gctggctggt aggaaaacaa 841 atgaatctgt gagtgagccc cgaaaaggct ttatgtattc cagaaacaca aatgaaaatc 901 ctcaggagtg tttcaatgca tcaaagctac tgacatctca tggcatgggc atccaggttc 961 cgctgaatgc aacagagttc aactatctct gtccagccat catcaaccaa attgatgcta 1021 gatcttgtct gattcataca agtgaaaaga aggctgaaat ccctccaaag acctattcat 1081 tacaaatagc ctgggttggt ggttttatag ccatttccat catcagtttc ctgtctctgc 1141 tgggggttat cttagtgcct ctcatgaatc gggtgttttt caaatttctc ctgagtttcc 1201 ttgtggcact ggccgttggg actttgagtg gtgatgcttt tttacacctt cttccacatt 1261 ctcatgcaag tcaccaccat agtcatagcc atgaagaacc agcaatggaa atgaaaagag 1321 gaccactttt cagtcatctg tcttctcaaa acatagaaga aagtgcctat tttgattcca 1381 cgtggaaggg tctaacagct ctaggaggcc tgtatttcat gtttcttgtt gaacatgtcc 1441 tcacattgat caaacaattt aaagataaga agaaaaagaa tcagaagaaa cctgaaaatg 1501 atgatgatgt ggagattaag aagcagttgt ccaagtatga atctcaactt tcaacaaatg 1561 aggagaaagt agatacagat gatcgaactg aaggctattt acgagcagac tcacaagagc 1621 cctcccactt tgattctcag cagcctgcag tcttggaaga agaagaggtc atgatagctc 1681 atgctcatcc acaggaagtc tacaatgaat atgtacccag agggtgcaag aataaatgcc 1741 attcacattt ccacgataca ctcggccagt cagacgatct cattcaccac catcatgact 1801 accatcatat tctccatcat caccaccacc aaaaccacca tcctcacagt cacagccagc 1861 gctactctcg ggaggagctg aaagatgccg gcgtcgccac tttggcctgg atggtgataa 1921 tgggtgatgg cctgcacaat ttcagcgatg gcctagcaat tggtgctgct tttactgaag 1981 gcttatcaag tggtttaagt acttctgttg ctgtgttctg tcatgagttg cctcatgaat 2041 taggtgactt tgctgttcta ctaaaggctg gcatgaccgt taagcaggct gtcctttata 2101 atgcattgtc agccatgctg gcgtatcttg gaatggcaac aggaattttc attggtcatt 2161 atgctgaaaa tgtttctatg tggatatttg cacttactgc tggcttattc atgtatgttg 2221 ctctggttga tatggtacct gaaatgctgc acaatgatgc tagtgaccat ggatgtagcc 2281 gctgggggta tttcttttta cagaatgctg ggatgctttt gggttttgga attatgttac 2341 ttattccata tttgaacata aaatcgtgtt cgtataaatt tctagttaag gtttaaatgc 2401 tagagtagct taaaaagttg tcatagtttc agtaggtcat agggagatga gtttgtatgc 2461 tgtactatgc agcgtttaaa gttagtgggt tttgtgattt ttgtattgaa tattgctgtc 2521 tgttacaaag tcagttaaag gtacgtttta atatttaagt tattctatct tggagataaa 2581 atctgtatgt gcaattcacc ggtattacca gtttattatg taaacaagag atttggcatg 2641 acatgttctg tatgtttcag ggaaaaatgt ctttaatgct ttttcaagaa ctaacacagt 2701 tattcctata ctggatttta ggtctctgaa gaactgctgg tgtttaggaa taagaatgtg 2761 catgaagcct aaaataccaa gaaagcttat actgaattta agcaaagaaa taaaggagaa 2821 aagagaagaa tctgagaatt ggggaggcat agattcttat aaaaatcaca aaatttgttg 2881 taaattagag gggagaaatt tagaattaag tataaaaagg cagaattagt atagagtaca 2941 ttcattaaac atttttgtca ggattatttc ccgtaaaaac gtagtgagca ctctcatata 3001 ctaattagtg tacatttaac tttgtataat acagaaatct aaatatattt aatgaattca 3061 agcaatatac acttgaccaa gaaattggaa tttcaaaatg ttcgtgcggg ttatatacca 3121 gatgagtaca gtgagtagtt tatgtatcac cagactgggt tattgccaag ttatatatca 3181 ccaaaagctg tatgactgga tgttctggtt acctggttta caaaattatc agagtagtaa 3241 aactttgata tatatgagga tattaaaact acactaagta tcatttgatt cgattcagaa 3301 agtactttga tatctctcag tgcttcagtg ctatcattgt gagcaattgt ctttatatac 3361 ggtactgtag ccatactagg cctgtctgtg gcattctcta gatgtttctt ttttacacaa 3421 taaattcctt atatcagctt gaaaaaaaaa aaaaaaaaaa a // LOCUS HSU41315 9358 bp DNA PRI 02-MAY-1996 DEFINITION Human ring zinc-finger protein (ZNF127-Xp) gene and 5' flanking sequence. ACCESSION U41315 NID g1304598 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9358) AUTHORS Hendrich,B.D., Longstreet,M., Gustashaw,K., Nicholls,R.D. and Willard,H.F. TITLE An X-linked homologue of the autosomal imprinted gene ZNF127 escapes X chromosome inactivation JOURNAL Unpublished REFERENCE 2 (bases 1 to 9358) AUTHORS Hendrich,B.D., Longstreet,M., Gustashaw,K., Nicholls,R.D. and Willard,H.F. TITLE Direct Submission JOURNAL Submitted (27-NOV-1995) B.D. Hendrich, Genetics, Case Western Reserve University, 2109 Adelbert Road, Cleveland, OH 44106-4955, USA FEATURES Location/Qualifiers source 1..9358 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp11.4" repeat_region 1070..1380 /rpt_family="Alu" misc_feature 3500..3800 /note="CpG island" repeat_region 3935..4295 /rpt_family="Alu" mRNA 4765..8450 /gene="ZNF127-Xp" gene 4765..8450 /gene="ZNF127-Xp" mRNA 4805..8450 /gene="ZNF127-Xp" /note="alternate transcription initiation site" misc_feature 4877..5373 /gene="ZNF127-Xp" /note="CpG Island" CDS 5107..6564 /gene="ZNF127-Xp" /note="ring zinc-finger protein; escapes X chromosome inactivation" /codon_start=1 /product="ZNF127-Xp" /db_xref="PID:g1304599" /translation="MAEAAAPGTTVTTSGAGAAAAEAAETAEAVSPTPIPTVTAPSPR AGGGVGGSDGSDGSGGRGDSGAYDGSGACGGSDACDGSGDSSGDSWTKQVTCRYFKYG ICKEGDNCRYSHDLSDRLCGVVCKYFQRGCCVYGDRCRCEHSKPLKQEEATATELTTK SSLAASSSLSSIVGPLVEMNTNEAESRNSNFATVVAGSEDWANAIEFVPGQPYCGRTV PSCTEAPLQGSVTKEESEEEQTAVETKKQLCPYAAVGQCRYGENCVYLHGDLCDMCGL QVLHPMDAAQRSQHIQACIEAHEKDMEFSFAVQRSKDKVCGICMEVVYEKANPNEHRF GILSNCNHTFCLKCIRKWRSAKEFESRIVKSCPQCRITSNFVIPSEYWVEEKEEKQKL IQKYKEAMSNKACKYFDEGRGSCPFGENCFYKHMYPDGRREEPQRQQVGTSSRNPGQQ RNHFWEFFEEGANSNPFDDEEEAVTFELGEMLLML" misc_feature 5392..5448 /gene="ZNF127-Xp" /note="C3H zinc finger motif (XRYFKYGICKEGDNCRYSH)" misc_feature 5479..5535 /gene="ZNF127-Xp" /note="encodes C3H zinc finger motif (XKYFQRGCCVYGDRCRCEH)" misc_feature 5849..5907 /gene="ZNF127-Xp" /note="encodes C3H zincfFinger motif (XAPMLQWDSADMGRTVCIST)" misc_feature 5917..6000 /gene="ZNF127-Xp" /note="encodes C2H2CH zinc finger motif (XDMCGLQVLHPMDAAQRSQHIQACIEAH)" misc_feature 6050..6216 /gene="ZNF127-Xp" /note="encodes RING zinc finger motif (XVGSAWRWSMRKPTPTSTASGSSPTATTPSVSSAFASGGVLRNL RAGSSSPAHNA)" misc_feature 6319..6381 /gene="ZNF127-Xp" /note="encodes C3H zinc finger motif (XKYFDEGRGSCPFGENCFYKH)" repeat_region 6870..7200 /rpt_family="Alu" repeat_region 7865..8165 /rpt_family="Alu" repeat_region 8415..8555 /rpt_family="Alu" repeat_region 9055..9270 /rpt_family="Alu" BASE COUNT 2229 a 2349 c 2316 g 2464 t ORIGIN 1 aagcttatgg tgcaaccaaa aatctttcag tgttttttcc tactcacact ttatactgac 61 tgaatagaga tcctgatggt tgacactacc aatttcagct ttggagaccc cttcactcag 121 tcatagctgc atgttttata gctgaagcta ttgggtttgt gacccctagc tgacttgacc 181 caggcttgca cagcagcaac cttctttgag aattccactt taattcaggg cctctgctgc 241 cccatcacca agataacatc tcttatgttc tcacctggtt tcaacacaag cgaagttttc 301 cagggcagca gcatctacct aggtttgatg gggtcagctt ctcattctcc agtgggtcac 361 ctttgccagc actgtaaacc caaattcaag gcagtgagag cctgaatact actcaatgcc 421 tcatctacgg tttcccatag gagtttgtct gtttcaggga aatctccctg aaaggcccac 481 agctccctta ttgcagccag gatctattgc aaaaaaaaaa aaccaaaaaa aaaaaaaaac 541 aaacaaaaaa aacctctaag accttctact gtctctccta gtggatctaa ggcagacacg 601 atggaccgca ggaggggctg agccataaga cccagtttac gttgccattc ttccttgaca 661 agcattacta gccgcacttc ctcatctccc aagtgaacca gccaaactcc taatgatgtg 721 cctggtttct gcccatagtg gtcatgaatt tccctccttc cacacatctc tgcaattgtt 781 gtctaaaaga gggcagaatc ttttatgtgc accagacaat acctagtact agttgttaca 841 gtggggcaga cctgagggta actccaaatt gttcaggaag ttctacacct gcattagagt 901 tagcaagaac cagggcacca tttgagcagt ggtctttgaa tctgcttgct ggtactcagc 961 ctgcatccac ctcttcaatc cccatcatga acaggcaata aagtgaagaa ctatttatgg 1021 cctagttgag cagataattt ggtcaacata tttcagcgac tttcttcaaa tttgttttct 1081 tttttttttg aaacagggtc tcgctctgtt gcccaggctg gagtgcagtg gcgcgctctt 1141 gctcactgca gcctcgactt cccaggttcg agcgattctc ccacctcagc ctcccgagca 1201 gctgggaaca caggcacaca tcaccatgcc cagctaattt ttgtattttt cgtagaggca 1261 aggttttgcc acgttgccca ggctggtctc aaactcctga gctcaagaaa tctgcttgcc 1321 ttggcctcac aaattgctgg gattacaggt gtgagccatc acgcccggcc ctcaaatttc 1381 attaaattta cccacaatat cctcatcctt ctccttctag aaagccatgt ctctgtcagt 1441 atcccgccct cctcactaaa gctgtcaaga actataaaag gtctgagatt ttgtcctact 1501 caacaagcta acaagtcagc ccactacagt ttcatggaat ctggtagaaa acatgacact 1561 cagagatggg cagtgttttg ctgtagaaag gtcagtcacc agatatcaac attttggcac 1621 cagtttcttg agccccgatt atgtagggga actcctctgt gctggtgccc tgaaaaccgt 1681 gcatatatcc tcataggcca tgtaagcaaa gctcaactgc aagctgatcc atgccgtggc 1741 agaaggctct gtgatcctcc ccaagcatga gaggataggc agtcttgtga ggagctgcca 1801 taccgccatt tctcacttgc ctttctattt ttttttcaca gtagcaagga catttattat 1861 ataaataggt ggataaaata agtgtgcttt aacttctgga tatatggcaa atgctactgt 1921 gatgcagttt ttcagtgcag acatgtgact cttcaatgat ggggatgggc cccctagtgt 1981 gatgtgcagc cactatactc catatggcaa acctcctggt tcaaggcaga gaggagacac 2041 cttattctat agaacaggat ggggcttcca ccctggggac catacaagtg gctcccatag 2101 gcagccacgc ctttctggca agctgtggca tacccaggtg caggctgggg gaggggctgg 2161 cacagcagcc ccagcgtgct ctggcttcag cctgatggag aagcaaaact aggacctccc 2221 tctcccagat gttgctgacc ctatcgccac ctccaccctc taccctgcgc tgggagttcg 2281 ctgtgacgca gcctgttcca tagccatagg gagaccaaag tgaacgatca cagtgcccct 2341 cacctattct aggggacttg gatgctcagt agagcttaat gccatggcat ctgagagcac 2401 ccaccctgtc cctgagggca gctgtgccgg gccagggggc cgcaagggca tgggtggaat 2461 cttggtcact ggcagcactc agccacgtgc ctgaggagct cttcaccttg ttcgtcactg 2521 aagcactgca ggcagtgagg gcactgaagt tccccctggc ctctctgggc tgcgccagga 2581 tgcccgccct ctgcaggggg ttccaaccgc ctgggaccca gtcccaggcc tccagccacc 2641 aggcacgata agctccactg catcggtggc caagtacttg gcagttttgc tcccagcgtg 2701 aatccagccg gcgtctggct cttgagaatc ctgtctccag gacacctggt gcagcaaaga 2761 ggtgaccttt tcctccagtt cctgaatcct accttgagcc tgttcccgat ctgccctttc 2821 tgacatgaag tcatccttgt aagcgagaat ctgctattcc agtatctgca cccattccag 2881 tgcagcatcc cgtgccgtcc tcgaggctgc cagcttctac gtcaattctg cacagtcgtt 2941 tattttctct tccaactatc tgttgagccg ggagatctcc ttcctcatca gctcgggctc 3001 atgggggggg gggggtctgc agccccctga gctgtgcatg gagccccctc atgtattcgt 3061 ccctgctgac atcgtatagc tgccacttgg catagaggtc ttcaacgtga gtcaccttct 3121 gttttaacag tcgattttct tcctcagtaa cactctggac agaggtgtcc ctacctgtat 3181 gttctgactt tcaggacttc tctcccccac gttcctttgt gcatgctgtc gttcatccag 3241 acacttggcc agatgctgac acatgtgggc agtggaggtc agcgtcctcc gcagctggtg 3301 ggtctcattg gccaaggagc tgcacagaac gtcactggcg gggtggggcg gttcgccctc 3361 cgccatgctc ctccgcatca gggcgacttc cttctctcgc tggtgctgtg gctggctcag 3421 cagctgctgc atctccctct ctttcgcttc tagtcgctca gtaagcctct caatttcctg 3481 gcgcatctgg gcctccgcgg cgccgttctc ctgccgctgc agctgctccc ggaagtgcgc 3541 cacctgctcc agcagcgcat ccaccaggga cggcgcagtg tacccctcca gcgcggccag 3601 cctggcgtgg aggcacgcga tgagggagtc gcgggcggcc agctggtcct gcaggcgccg 3661 cagccgctac ccgacctggt agcacaggcc gcagagctct gcagatgcgc gcggggcccc 3721 ctcccggccg tccgaccccg ggtccccgca cgtgactgtg ggcctgcctg ggaggccgca 3781 cggccgccgg caacttcggt gcccggaccc tgtcggctct ctatcttgag ggagtttctc 3841 atcgcctctt ggttccaatg aggtgtggga ctttccagct tgtcccctct cccttaggga 3901 ggctaagctg caggctataa aactacttag cagcagtata gaaatggctc ctctggcgcg 3961 gtggctcatg cctataatcc cagcactttg ggaggccgca gcgggtggat cacttgaggt 4021 caggagttcc agaccagcct ggccaacatg gtgaaaccct gtctctacta aaaatacaaa 4081 aattagcttg gcgtggtgat gtgtgcctgt aatcccagct actcgggaag ctgaggcagg 4141 agaatcactt gaacccggga ggtggaggtt acagtgagcc aagatcatgc cccactacac 4201 tgcagcctgg gcaacagagc gagactgtct cccaaaaaaa aaaaatatat agaaagaaag 4261 aaagagagaa agatatagaa agaaagaaaa atgaaatgac tcctcagcaa cagggtgccc 4321 caacactcat gtttcttgtc atttggcctc ttttgtttgg gtgtcctatt gtgggacaag 4381 gaatgcaggg agctgacaac atgctgttgt ttctttttct gtctatgtaa gtaataaact 4441 gtctgaatct aatcagtgct tgttgtcctc ttaccagcca aatctgtaag cctgctcaac 4501 acattctcac agggcaactc taaagaggac caggtaacac ctgcacgtgc agtggattac 4561 attataggag agaaaccctg gacttaggga atctgaacct ttcataatgg gcagtaatca 4621 ttcctgccct tcactccaga gggaggctat ttttattata gtggacagta agcatgcttg 4681 cccttttctt tggagggaga cactacctct gtctttcaag gatgcactgt atacaaatat 4741 cctggaaaag atagtccgga acaaaggaca gtacagtgcc ttacttgcaa gatgtgcaag 4801 cacacaagac ctatgaagaa ctgtctccca acaaaggata catgcagaag gagaaaaaac 4861 tctcttaaac aaaaagaggc agagccggca gggaccgagc gggtgcctca gtctccttcc 4921 cctcccctcg cctgtcctcg ccatcttctt ctcacagccg gaccggaact atgtgatccc 4981 ggaagttccg ggtcctttgg ccctatatga tcccggaagt tccggggctt ttggacctct 5041 gtgattccgg aagttccggg gcgttccggg gcgttctggg gcctttggag cgtgggataa 5101 gcagtaatgg cggaggctgc agctcccgga acaacagtca caacatcggg agcaggagca 5161 gcagcggcgg aggcggcgga gacggcggaa gcagtctccc cgactccgat ccccacagtc 5221 accgccccgt ccccgagggc gggcggaggg gtcggcggca gcgacggcag cgacggtagt 5281 ggcggcaggg gcgacagtgg cgcgtatgac ggcagcggtg cgtgcggcgg cagcgacgcg 5341 tgcgatggca gcggcgacag cagcggcgac agctggacta aacaggtcac ttgtagatat 5401 tttaagtatg ggatttgtaa ggaaggagat aactgtcgct actcgcatga cctctctgac 5461 cgtctgtgtg gtgtagtgtg caagtatttt cagcgagggt gctgtgttta tggagaccgc 5521 tgcagatgtg aacatagcaa gccattgaaa caggaagaag caactgctac agagctaact 5581 acaaagtcat cccttgctgc ttcctcaagt ctctcatcaa tagttggacc acttgttgaa 5641 atgaatacaa acgaagctga gtcaagaaat tcaaattttg caactgtagt agcaggttca 5701 gaggactggg cgaatgccat tgagtttgtt cctgggcaac cctactgtgg ccgtactgtg 5761 ccttcctgca ctgaagcacc cctgcagggc tcagtgacca aggaagaatc agaggaagag 5821 caaaccgccg tggaaacaaa gaagcagctg tgcccctatg ctgcagtggg acagtgccga 5881 tatggggaga actgtgtgta tctccacgga gatttatgtg acatgtgtgg gctgcaggtc 5941 ctgcatccga tggatgctgc ccagagatca cagcatatac aagcgtgcat tgaagcccat 6001 gagaaagaca tggagttctc atttgctgtg cagcgcagca aggacaaggt gtgtgggatc 6061 tgcatggagg tggtctatga gaaagccaac cccaacgagc accgcttcgg gatcctctcc 6121 aactgcaacc acaccttctg tctcaagtgc attcgcaagt ggaggagtgc taaggaattt 6181 gagagcagga tcgtcaagtc ctgcccacaa tgccgaatca catctaactt tgtcattcca 6241 agtgagtact gggtggagga gaaagaagag aagcagaaac tcattcagaa atacaaggag 6301 gcaatgagca acaaggcatg caagtatttt gatgaaggac gtgggagctg cccatttgga 6361 gagaactgtt tttacaagca tatgtaccct gatggccgca gagaggagcc acagagacag 6421 caagtgggaa catcaagcag aaacccaggc caacaaagga accacttctg ggaattcttt 6481 gaggaaggag cgaacagcaa cccctttgac gatgaagaag aggctgtcac ctttgagctg 6541 ggtgagatgt tgcttatgct ttaggctgca ggtggggacg acaaactgac agactctgaa 6601 aatgagtggg acttgttttg tgatgaagaa ttttatgtct tagatctata gaaaccttgc 6661 gtagtgtgtg agctggtctg ctgaccccag atagcagctg tcccctgtgg tggtgtggca 6721 gtgcctatgt tctctcctag gcagacctat caactccagg tgctgcggtt aagaatatgt 6781 acccagggcc tgtcttgtca acccctcacc tttccccaag gagtgtgttg ttttccctgt 6841 tggaaaaagt tacaaaaata aatgttaaag gttttttttg ttttttgttt tattgagaca 6901 gagtcccact ctgtcaccca ggctggagtg cagtggtgca atcttggctc actgcaacct 6961 ccgtcttccg ggttcaagcc attctcctgt ctcagcctcc caagttgctg ggactacagg 7021 tgcatgctac aatgcccagc taatttttct catttttagt agaaacgggg tttcaccata 7081 ttggtcaggc tggtctcgaa ctcctgacct caggtgatcc acctgccact accttccaaa 7141 gtgctgggat tacaggcgtc agccaccatg cccagcctta aagttagttt tttgtaacag 7201 gcatgagcca ccgcgcctgg ctttaaagtt cgttttttgt aacaggcata agccactgtg 7261 cctggcctta aagttagttt ttttgtaaca cgaatttaac tgttggacag ttagtttaga 7321 tgtgttgcat catctgtttt caaccagatt gtgtttatgg acttttcaca cactaatttt 7381 gaggacccca ggttcaaaag taaaagcagt ggccctgctt tggggtccaa taataggagt 7441 gatgggtgaa gggacctaag ctggccagta gccttctgct ccagacatgg gacgcggatc 7501 cttgaggttt ctggttaaat ctgcacatct gtgtttttat atctgttccc tacccctgta 7561 atccctaccg catgcactag ttctgtagtt ttggtctctc gtttaattgt atgcaagtag 7621 tactactggg taaccagagc caagcgtgaa tgtgttcaga tttctactgt tttgcatgat 7681 aggaaaattg agaaagaata catataaaag atatagaggc ataacatcaa tgcagagttg 7741 gaagttgacc tccacggggt gacatggtgt gtgtgagtgt gggtgtgtga gtgtgggtgt 7801 gtgataagct tctcaaacct gcatagatgc agtattcttg gctttggtag aaagccttgg 7861 tttaaggctg ggcgcggtgg ctcacgcctg taatcccagc actttgggag gctgaggcgg 7921 ccggatcaca aggccaggag atcgagacca tcctgtgaat ggcgaaaccc tgtctctact 7981 aaaaatacaa aaaattagcc gggcgtagtg gcgggcgcct gtagtcccag ctactctgga 8041 ggctgaggcg ggagaatggt gtgaacctgg gaggcggagc ttgcaatgaa ctgagaatgt 8101 gccactggac tcccagcctg ggtgacagag cgagactctg tctcaaacaa aaaaaaacaa 8161 aaacggaaac ccttggttta ggggtttaag tcttatgtgg tggttaagat cttaaaggac 8221 aaagcagtat attggtagtt atcaatatag cagtactagc tctgtttata taaatagaga 8281 aatggagtta gccatagagg ttaaaactac ctggttatcc catatattaa cccaaactgg 8341 gtcttggata cacagttgta tttaatgttt tacgatctag cctttccagt ataggcactt 8401 cctgaaaaac ctttgtcctc atttggggca ttttgttgtt gggtttcgcc atgttggcca 8461 ggctggtctc gaactcctga cctcaggtga tccacctgcc tcagcctccc aaagtgctgg 8521 gattacgggc atgagccacc acacccggcc aggaaaaggt attttatatt tatcattgct 8581 gtgctgttta ttcatttgta taaattcaag ttcccatctg gtattgtttt ccttcagcat 8641 gaataacttc tttgaacatt tcttgtggtt catgtctgct ggcaacaaat tctctcagct 8701 atttatctgg aaaagtcttt atttcacctt catatttcac ctgtattttc tttgtgtaca 8761 gaattctagg ttgacatttg ttcctgagtg ttttaaagat gtcatttcat attcttctgg 8821 tttgcataat ttctgatgag aagtctgcag tttattttct ttctatttct ctctcttttt 8881 gctcctctgc aaagccaatt cctcttgcta tttttaatat tttctcatta tcattgtttc 8941 ataggaattt gattatgatt ttccttggta tgatttcctt aatatttatt ctatttggga 9001 ccattgagct tctcatagct gcatatctgt atgtttataa ttttcatcca gtttaaaaaa 9061 aaagttttga gaaagagtct cactctgtca cccagtctaa agtacagtga cacgatcttg 9121 gctcactgca acctccgcct ccctggttca agtgattctt gtgcctcagt ctcctgagta 9181 gctggaatta caagtgcatg ccatcatgcc ctggtacttt ttgtttttac agtagagagg 9241 gcgttttggc cactactttc gtcctcaagt gtagccagct acctccggtt tcccaggcga 9301 tatcctttgg agagtttggg tgtgcagtgt ctctagtctc atttgcttca tcgagatc // LOCUS HSU41371 2839 bp mRNA PRI 18-APR-1996 DEFINITION Human spliceosome associated protein (SAP 145) mRNA, complete cds. ACCESSION U41371 NID g1173904 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2839) AUTHORS Gozani,O., Feld,R. and Reed,R. TITLE Evidence that sequence-independent binding of highly conserved U2 snRNP proteins upstream of the branch site is required for assembly of spliceosomal complex A JOURNAL Genes Dev. 10 (2), 233-243 (1996) MEDLINE 96154048 REFERENCE 2 (bases 1 to 2839) AUTHORS Reed,R. TITLE Direct Submission JOURNAL Submitted (27-NOV-1995) Robin Reed, Cell Biology, Harvard Medical School, 240 Longwood, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2839 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Hela" gene 49..2667 /gene="SAP 145" CDS 49..2667 /gene="SAP 145" /codon_start=1 /product="spliceosome associated protein" /db_xref="PID:g1173905" /translation="MAPGAAQELQAKLAEIGAPIQGNREELVERLQSYTRQTGIVLNR PVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPLQPPPPPPPPPPGLGLGFPMAHPP NLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQEERAKQQGDHSLKEHELLEQQKR AAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTPLGPRVAAPVGPVGPTPTVLPMGA PVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKILQLKESRQEEMNSQQEEEEMETD ARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKKKPQRVRGVSSESSGDREKDSTRS RGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEAFKLTDDVKKEKEKEPEKLDKLEN SAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLSKKKLRRMNRFTVAELKQLVARPD VVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKRKYLQGKRGIEKPPFELPDFIKRT GIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDIDYQKLHDAFFKWQTKPKLTIHGD LYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPNAHKVPPPWLIAMQRYGPPPSYPN LKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPLYGDVFGTNAAEFQTKTEEEEIDR TPWGELEPSDEESSEEEEEEESDEDKPDETGFITPADSGLITPGGFSSVPAGMETPEL IELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGAMMGSTHIYDMSTVMSRKGPAPEL QGVEVALAPEELELDPMAMTQKYEEHVREQQAQVEKEDFSDMVAEHAAKQKQKKRKAQ PQDSRGGSKKYKEFKF" BASE COUNT 757 a 738 c 826 g 518 t ORIGIN 1 ctcccaaagc agaattgcag ctgccgccgc cgccacctcc aggccactat ggcgcctggg 61 gctgcccagg agcttcaggc caagttggca gagatcggag ctccgatcca gggtaatcgc 121 gaggagctgg tggagcggct gcagagctac acccgccaga ctggcatcgt gctgaatcgg 181 ccggttttga gaggggaaga tggggacaaa gccgctccac ctcccatgtc ggcacagctc 241 cctggaattc ccatgccacc accacctttg ggactccccc ctctgcagcc tcctccgcca 301 cccccaccac ctccaccagg ccttggcctt ggctttccta tggcccaccc accaaatttg 361 gggcccccgc ctcctctccg tgtgggtgag ccagtggcac tgtcagagga ggagcggctg 421 aagttggctc agcagcaggc ggcattgctg atgcagcagg aggagcgtgc caagcagcag 481 ggagatcatt cgctgaagga acatgagctc ttggagcagc agaagcgggc agctgtgtta 541 ctggagcagg aacgacagca ggagattgcc aagatgggca ccccagtccc tcggccccca 601 caagacatgg gccagattgg tgtgcgcact cctctgggtc ctcgagtagc tgctccagtg 661 ggcccagtgg gccccactcc tacagttttg cccatgggag cccctgttcc ccggcctcgt 721 ggtcccccac cgccccctgg agatgagaac agagagatgg atgacccctc tgtgggcccc 781 aagatccccc aggctttgga gaagatcctg cagctgaagg agagccgcca ggaagagatg 841 aattctcagc aggaggaaga ggaaatggaa acagatgctc gctcgtccct gggccagtca 901 gcgtcagaga ctgaggagga cacagtgtcc gtatctaaaa aggagaaaaa ccggaagcgt 961 aggaaccgaa agaagaagaa aaagccccag cgggtgcgag gggtgtcctc tgagagctct 1021 ggggaccggg agaaagactc aacccggtcc cgtggctctg attccccagc agctgatgtt 1081 gagattgagt atgtgactga agaacctgaa atttacgagc ccaactttat cttctttaag 1141 aggatctttg aggcttttaa gctcactgat gatgtgaaga aggagaaaga gaaagagcca 1201 gagaaacttg acaaactgga gaactctgca gcccccaaga agaagggatt tgaagaggag 1261 cacaaggaca gtgatgatga cagcagtgat gacgagcagg aaaagaagcc agaagccccc 1321 aagctgtcca agaagaagtt gcgccgaatg aaccgcttca ctgtggctga actcaagcag 1381 ctggtggctc ggcccgatgt cgtggagatg cacgatgtga cagcgcagga ccctaagctc 1441 ttggttcacc tcaaggccac tcggaactct gtgcctgtgc cacgccactg gtgttttaag 1501 cgcaaatacc tgcagggcaa acggggcatt gagaagcccc ccttcgagct gccagacttc 1561 atcaaacgca caggcatcca ggagatgcga gaggccctgc aggagaagga agaacagaag 1621 accatgaagt caaaaatgcg agagaaagtt cggcctaaga tgggcaaaat tgacatcgac 1681 taccagaaac tgcatgatgc cttcttcaag tggcagacca agccaaagct gaccatccat 1741 ggggacctgt actatgaggg gaaggagttc gagacacgac tgaaggagaa gaagccagga 1801 gatctgtctg atgagctaag gatttccttg gggatgccag taggaccaaa tgcccacaag 1861 gtccctcccc catggctgat tgccatgcag cgatatggac cacccccatc gtatcccaac 1921 ctgaaaatcc ctgggctgaa ctcgcccatc cctgagagct gttcctttgg gtaccatgct 1981 ggtggctggg gcaaacctcc agtggatgag actgggaaac cgctctatgg ggacgtgttt 2041 ggaaccaatg ctgctgaatt tcagaccaag actgaggaag aagagattga tcggacccct 2101 tggggggaac tggaaccatc tgatgaagaa tcctcagaag aagaggaaga ggaagaaagt 2161 gatgaagaca aaccagatga gacaggcttt attacccctg cagacagtgg ccttatcact 2221 cctggaggct tttcatcagt gcctgctgga atggagaccc ctgaactcat tgagctgagg 2281 aagaagaaga ttgaggaggc gatggacgga agtgagacac ctcagctctt cactgtgttg 2341 ccagagaaga gaacagccac tgttggaggg gccatgatgg gatcaaccca catttatgac 2401 atgtccacgg ttatgagccg gaagggcccg gctcctgagc tgcaaggtgt ggaagtggcg 2461 ctggcgcctg aagagttgga gctggatcct atggccatga cccagaagta tgaggagcat 2521 gtgcgggagc agcaggctca agtagagaag gaggacttca gtgacatggt ggctgagcac 2581 gctgccaaac agaagcaaaa aaaacggaaa gctcagcccc aggacagccg tgggggcagc 2641 aagaaatata aggagttcaa gttttaggtc ccctcacact agcccttttt ttggccctac 2701 gtctggatgc ctgggcttca cacaagaacc acctctcccg cagttcccaa ggacttgtca 2761 tttcatgttc ttattttaga cctgttttgt aaataaagct gtttcccaag gaaagagatg 2821 aaaaaaaaaa aaaaaaaaa // LOCUS HSU41515 509 bp mRNA PRI 02-MAR-1996 DEFINITION Human deleted in split hand/split foot 1 (DSS1) mRNA, complete cds. ACCESSION U41515 NID g1209723 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 509) AUTHORS Crackower,M.A., Scherer,S.W., Rommens,J.M., Hui,C.C., Poorkaj,P., Soder,S., Cobben,J.M., Buys,C., Hudgins,L., Evans,J.P. and Tsui,L.-C. TITLE Characterization of the split hand/split foot malformation locus SHFM1 at 7q21.3-q22.1 and analysis of a candidate gene for its expression during limb development JOURNAL Unpublished REFERENCE 2 (bases 1 to 509) AUTHORS Tsui,L.-C., Crackower,M.A. and Scherer,S.W. TITLE Direct Submission JOURNAL Submitted (29-NOV-1995) L.-C. Tsui, Genetics, Hospital for Sick Children, 555 University Ave., Toronto, ON M5G 1X8, Canada FEATURES Location/Qualifiers source 1..509 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q21.3-q22.1" gene 129..341 /gene="DSS1" CDS 129..341 /gene="DSS1" /standard_name="deleted in split hand/split foot 1" /note="Method: conceptual translation supplied by author." /codon_start=1 /db_xref="PID:g1209724" /translation="MSEKKQPVDLGLLEEDDEFEEFPAEDWAGLDEDEDAHVWEDNWD DDNVEDDFSNQLRAELEKHGYKMETS" BASE COUNT 156 a 78 c 137 g 138 t ORIGIN 1 attctttccc caagtctcta tggtagcgtc agcgtcggag gcggtagtga cggtggcgtt 61 tccttgagga agagtgaggg ttccaacttt tctgcttatc tgggaggtgt tgggcgcgga 121 cagtcgagat gtcagagaaa aagcagccgg tagacttagg tctgttagag gaagacgacg 181 agtttgaaga gttccctgcc gaagactggg ctggcttaga tgaagatgaa gatgcacatg 241 tctgggagga taattgggat gatgacaatg tagaggatga cttctctaat cagttacgag 301 ctgaactaga gaaacatggt tataagatgg agacttcata gcatccagaa gaagtgttga 361 agtaacctaa acttgacctg cttaatacat tctagggcag agaacccagg atgggacact 421 aaaaaaatgt gtttatttca ttatctgctt ggatttattt gtgtttttgt aacacaaaaa 481 ataaatgttt tgatataaaa aaaaaaaaa // LOCUS HSU41737 586 bp mRNA PRI 18-NOV-1997 DEFINITION Human pancreatic beta cell growth factor (INGAP) mRNA, complete cds. ACCESSION U41737 NID g1514681 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 586) AUTHORS Rafaeloff,R., Pittenger,G.L., Barlow,S.W., Qin,X.F., Yan,B., Rosenberg,L., Duguid,W.P. and Vinik,A.I. TITLE Cloning and sequencing of the pancreatic islet neogenesis associated protein (INGAP) gene and its expression in islet neogenesis in hamsters JOURNAL J. Clin. Invest. 99 (9), 2100-2109 (1997) MEDLINE 97296198 REFERENCE 2 (bases 1 to 586) AUTHORS Vinik,A.I. TITLE Direct Submission JOURNAL Submitted (01-DEC-1995) Aaron I. Vinik, Eastern Virginia Medical School, The Diabetes Institutes, 855 W. Brambleton Ave, Norfolk, VA 23510, USA FEATURES Location/Qualifiers source 1..586 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 1..586 /gene="INGAP" CDS 6..521 /gene="INGAP" /codon_start=1 /product="pancreatic beta cell growth factor" /db_xref="PID:g1514682" /translation="MTLCRMSWMLLSCLMFLSWVEGEESQKKLPSSRITCPQGSVAYG SYCYSLILIPQTWSNAELSCQMHFSGHLAFLLSTGEITFVSSLVKNSLTAYQYIWIGL HDPSHGTLPNGSGWKWSSSNVLTFYNWERNPSIAADRGYCAVLSQKSGFQKWRDFNCE NELPYICKFKV" BASE COUNT 148 a 132 c 130 g 176 t ORIGIN 1 ttcccatgac cctctgtagg atgtcttgga tgctgctttc ctgcctgatg ttcctttctt 61 gggtggaagg tgaagaatct caaaagaaac tgccttcttc acgtataacc tgtcctcaag 121 gctctgtagc ctatgggtcc tattgctatt cactgatttt gataccacag acctggtcta 181 atgcagaact atcctgccag atgcatttct caggacacct ggcatttctt ctcagtactg 241 gtgaaattac cttcgtgtcc tcccttgtga agaacagttt gacggcctac cagtacatct 301 ggattggact ccatgatccc tcacatggta cactacccaa cggaagtgga tggaagtgga 361 gcagttccaa tgtgctgacc ttctataact gggagaggaa cccctctatt gctgctgacc 421 gtggttattg tgcagttttg tctcagaaat caggttttca gaagtggaga gattttaatt 481 gtgaaaatga gcttccctat atctgcaaat tcaaggtcta gggcagttct aatttcaaca 541 gcttgaaaat attatgaagc tcacatggac aaggaagcaa gtatga // LOCUS HSU41740 7695 bp mRNA PRI 10-APR-1996 DEFINITION Human trans-Golgi p230 mRNA, complete cds. ACCESSION U41740 NID g1213483 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7695) AUTHORS Erlich,R., Gleeson,P.A., Campbell,P., Dietzsch,E. and Toh,B.H. TITLE Molecular characterization of trans-Golgi p230. A human peripheral membrane protein encoded by a gene on chromosome 6p12-22 contains extensive coiled-coil alpha-helical domains and a granin motif JOURNAL J. Biol. Chem. 271 (14), 8328-8337 (1996) MEDLINE 96215236 REFERENCE 2 (bases 1 to 7695) AUTHORS Erlich,R., Gleeson,P.A., Campbell,P., Dietzsch,E. and Toh,B.H. TITLE Direct Submission JOURNAL Submitted (30-NOV-1995) Paul A. Gleeson, Pathology and Immunology, Monash University, Commercial Road, Melbourne, Victoria 3181, Australia FEATURES Location/Qualifiers source 1..7695 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p12-22" /cell_type="hepatoma cells (HepG2)" CDS 286..6978 /note="peripheral membrane protein" /codon_start=1 /product="trans-Golgi p230" /db_xref="PID:g1213484" /translation="MFKKLKQKISEEQQQLQQALAPAQASSNSSTPTRMRSRTSSFTE QLDEGTPNRESGDTQSFAQKLQLRVPSVESLFRSPIKESLFRSSSKESLVRTSSRESL NRLDLDSSTASFDPPSDMDSEAEDLVGNSDSLNKEQLIQRLRRMERSLSSYRGKYSEL VTAYQMLQREKKKLQGILSQSQDKSLRRIAELREELQMDQQAKKHLQEEFDASLEEKD QYISVLQTQVSLLKQRLRNGPMNVDVLKPLPQLEPQAEVFTKEENPESDGEPVVEDGT SVKTLETLQQRVKRQENLLKRCKETIQSHKEQCTLLTSEKEALQEQLDERLQELEKIK DLHMAEKTKLITQLRDAKNLIEQLEQDKGMVIAETKRQMHETLEMKEEEIAQLRSRIK QMTTQGEELREQKEKSERAAFEELEKALSTAQKTEEARRKLKAEMDEQIKTIEKTSEE ERISLQQELSRVKQEVVDVMKKSSEEQIAKLQKLHEKELARKEQELTKKLQTREREFQ EQMKVALEKSQSEYLKISQEKEQQESLALEELELQKKAILTESENKLRDLQQEAETYR TRILELESSLEKSLQENKNQSKDLAVHLEAEKNKHNKEITVMVEKHKTELESLKHQQD ALWTEKLQVLKQQYQTEMEKLREKCEQEKETLLKDKEIIFQAHIEEMNEKTLEKLDVK QTELESLSSELSEVLKARHKLEEELSVLKDQTDKMKQELEAKMDEQKNHHQQQVDSII KEHEVSIQRTEKALKDQINQLELLLKERDKHLKEHQAHVENLEADIKRSEGELQQASA KLDVFQSYQSATHEQTKAYEEQLAQLQQKLLDLETERILLTKQVAEVEAQKKDVCTEL DAHKIQVQDLMQQLEKQNSEMEQKVKSLTQVYESKLEDGNKEQEQTKQILVEKENMIL QMREGQKKEIEILTQKLSAKEDSIHILNEEYETKFKNQEKKMEKVKQKAKEMQETLKK KLLDQEAKLKKELENTALELSQKEKQFNAKMLEMAQANSAGISDAVSRLETNQKEQIE SLTEVHRRELNDVISIWEKKLNQQAEELQEIHEIQLQEKEQEVAELKQKILLFGCEKE EMNKEITWLKEEGVKQDTTLNELQEQLKQKSAHVNSLAQDETKLKAHLEKLEVDLNKS LKENTFLQEQLVELKMLAEEDKRKVSELTSKLKTTDEEFQSLKSSHEKSNKSLEDKSL EFKKLSEELAIQLDICCKKTEALLEAKTNELINISSSKTNAILSRISHCQHRTTKVKE ALLIKTCTVSELEAQLRQLTEEQNTLNISFQQATHQLEEKENQIKSMKADIESLVTEK EALQKEGGNQQQAASEKESCITQLKKELSENINAVTLMKEELKEKKVEISSLSKQLTD LNVQLQNSISLSEKEAAISSLRKQYDEEKCELLDQVQDLSFKVDTLSKEKISALEQVD DWSNKFSEWKKKAQSRFTQHQNTVKELQIQLELKSKEAYEKDEQINLLKEELDQQNKR FDCLKGEMEDDKSKMEKKESNLETELKSQTARIMELEDHITQKTIEIESLNEVLKNYN QQKDIEHKELVQKLQHFQELGEEKDNRVKEAEEKILTLENQVYSMKAELETKKKELEH VNLSVKSKEEELKALEDRLESESAAKLAELKRKAEQKIAAIKKQLLSQMEEKEEQYKK GTESHLSELNTKLQEREREVHILEEKLKSVESSQSETLIVPRSAKNVAAYTEQEEADS QGCVQKTYEEKISVLQRNLTEKEKLLQRVGQEKEETVSSHFEMRCQYQERLIKLEHAE AKQHEDQSMIGHLQEELEEKNKKYSLIVAQHVEKEGGKNNIQAKQNLENVFDDVQKTL QEKELTCQILEQKIKELDSCLVRQKEVHRVEMEELTSKYEKLQALQQMDGRNKPTELL EENTEEKSKSHLVQPKLLSNMEAQHNDLEFKLAGAEREKQKLGKEIVRLQKDLRMLRK EHQQELEILKKEYDQEREEKIKQEQEDLELKHNSTLKQLMREFNTQLAQKEQELEMTI KETINKAQEVEAELLESHQEETNQLLKKIAEKDDDLKRTAKRYEEILDAREEEMTAKV RDLQTQLEELQKKYQQKLEQEENPGNDNVTIMELQTQLAQKTTLISDSKLKEQEFREQ IHNLEDRLKKYEKNVYATTVGTPYKGGNLYHTDVSLFGEPTEFEYLRKVLFEYMMGRE TKTMAKVITTVLKFPDDQTQKILEREDARLMFTSPRSGIF" BASE COUNT 2985 a 1277 c 1727 g 1706 t ORIGIN 1 gcaacgaagg taccatggcc gttgtcgtcg ccgccgcggc tcccggggct ggatgggggg 61 ccgaggccag ccagtggcac ccggaagaaa gagacgcggc ggcggcgacg ccgacaccct 121 caggacgagt gtccggactt gcccacagcc tcaaggagga gacggcgagg cccggccccc 181 gctgtccctg gtgtaaagaa gtcgccgtag ccgtcgcggc cgggactccc cgggctctcg 241 cccttcaggt ttcgttgaca ctcaggaccg tacgtacgct gcgccatgtt caagaaactg 301 aagcaaaaga tcagcgagga gcagcagcag ctccagcagg cgctggctcc tgctcaggcg 361 tcctccaatt cttcaacacc aacaagaatg aggagcagga catcttcatt tacagagcaa 421 cttgatgaag gtacacccaa tagagagtca ggtgacacac agtcttttgc acagaagctc 481 cagctccggg tgccctccgt ggagtctttg tttcgaagtc cgataaagga atctctattc 541 cggtcttctt ctaaagagtc tttggtacga acatcttcca gagaatccct gaatcgactt 601 gacctggaca gttctactgc cagttttgat ccaccctctg atatggatag cgaggctgaa 661 gacttggtag ggaattcaga cagtctcaac aaagaacagt tgattcagcg gttgcgaaga 721 atggaacgaa gcttaagtag ctacagggga aaatattctg agcttgttac agcttatcag 781 atgcttcaga gagagaagaa aaagctacaa ggtatattaa gtcagagtca ggataaatca 841 cttcggagaa tagcagaatt aagagaggag ctccaaatgg accagcaggc aaagaaacat 901 ctgcaagagg agtttgatgc atctttagag gagaaagatc agtatatcag tgttctccaa 961 actcaggttt ctctactgaa acaacgatta cgaaatggcc cgatgaatgt tgatgtactg 1021 aaaccacttc ctcagctgga accacaggct gaagtcttca ctaaagaaga gaatccagaa 1081 agtgatggag agccagtagt ggaagatgga acttctgtaa aaacactgga aacactccag 1141 caaagagtga agcgtcaaga gaacctactt aagcgttgta aggaaacaat tcagtcacat 1201 aaggaacaat gtacactatt aactagtgaa aaagaagctc tgcaagaaca actggatgaa 1261 agacttcaag aactagaaaa gataaaggac cttcatatgg ccgagaagac taaacttatc 1321 actcagttgc gtgatgcaaa gaacttaatt gaacagcttg aacaagataa gggaatggta 1381 atcgcagaga caaaacgtca gatgcatgaa accctggaaa tgaaagaaga agaaattgct 1441 caactccgta gtcgcatcaa acagatgact acccagggag aggaattacg ggaacagaaa 1501 gaaaagtccg aaagagctgc ttttgaggaa cttgaaaaag ctttgagtac agcccaaaaa 1561 acagaggaag cacggagaaa actgaaggca gaaatggatg aacaaataaa aactatcgaa 1621 aaaacaagtg aggaggaacg catcagtctt caacaggaat taagtcgggt gaaacaggag 1681 gttgttgatg taatgaaaaa atcctcagaa gaacaaattg ctaagctaca gaagcttcat 1741 gaaaaggagc tggccagaaa agagcaggaa ctgaccaaga agcttcagac ccgagaaagg 1801 gaatttcagg aacaaatgaa agtagctctt gaaaagagtc aatcagaata tttgaagatc 1861 agccaagaaa aagaacagca agaatctttg gccctagaag agttagagtt gcagaaaaaa 1921 gcaatcctca cagaaagtga aaataaactt cgggaccttc agcaagaagc agagacttac 1981 agaactagaa ttcttgaatt ggaaagttct ttggaaaaaa gcttacaaga aaacaaaaat 2041 cagtcaaaag atttggctgt tcatctggaa gctgaaaaaa ataagcacaa taaggagatt 2101 acagtcatgg ttgaaaaaca caagacagaa ttggaaagcc ttaagcatca gcaggatgcc 2161 ctttggactg aaaaactcca agtcttaaag caacaatatc agactgaaat ggaaaaactt 2221 agggaaaagt gtgaacaaga aaaagaaaca ttgttgaaag acaaagagat tatcttccag 2281 gcccacatag aagaaatgaa tgaaaagact ttagaaaagc ttgatgtgaa gcaaacagaa 2341 ctagaatcat tatcttctga actgtcagaa gtattaaaag cccgtcacaa actagaagag 2401 gaactttctg ttctgaaaga tcaaacagat aaaatgaagc aggaattaga ggccaagatg 2461 gatgaacaga aaaatcatca ccagcagcaa gttgacagta tcattaaaga acacgaggta 2521 tctatccaga ggactgagaa ggcattaaaa gatcaaatta atcaacttga gcttctcttg 2581 aaggaaaggg acaagcattt gaaagagcat caggctcatg tagaaaattt agaggcagat 2641 attaaaaggt ctgaagggga actccagcag gcatctgcta agctggacgt ttttcagtct 2701 taccagagtg ccacacatga gcagacaaaa gcatatgagg aacagttggc ccaattgcag 2761 cagaagttgt tggatttgga aacagaaaga attcttctta ccaaacaggt tgctgaagtt 2821 gaagcacaaa agaaagatgt ttgtactgag ttagatgctc acaaaatcca ggtgcaggac 2881 ttaatgcagc aacttgaaaa acaaaatagt gaaatggagc aaaaagtaaa atctttaacc 2941 caagtctatg agtccaaact tgaagatggt aacaaagaac aggaacagac aaagcaaatc 3001 ttggtggaaa aggaaaatat gattttacaa atgagagaag gacagaagaa agaaattgag 3061 atactcacac agaaattgtc agccaaggag gacagtattc atattttgaa tgaggaatat 3121 gaaaccaaat ttaaaaacca agaaaaaaag atggaaaaag ttaagcagaa agcaaaggag 3181 atgcaagaaa cgttaaagaa aaaattactg gatcaggaag ccaaacttaa gaaagagctt 3241 gaaaatactg ctctagagct tagtcagaaa gaaaaacagt ttaatgccaa aatgctggaa 3301 atggcacagg ctaactcagc tggaatcagt gatgcagtgt caagactgga aacaaaccaa 3361 aaagaacaaa tagaaagtct tactgaggtt catcgacgag aactcaatga tgtcatatca 3421 atctgggaaa agaaacttaa tcagcaagct gaagaacttc aggaaataca tgaaatccaa 3481 ttacaggaaa aagaacaaga ggtagcagaa ctgaaacaaa agatcctcct atttgggtgt 3541 gaaaaagaag agatgaacaa ggaaataaca tggctgaagg aagaaggtgt taagcaggat 3601 acaacattaa atgaattaca ggaacagtta aagcagaagt ctgcccatgt gaattctctt 3661 gcacaagatg aaactaaact gaaagctcat cttgaaaagc tagaggttga cttgaataag 3721 tctctgaagg aaaatacttt tcttcaagag cagctagttg aactgaagat gctggcagaa 3781 gaagataagc ggaaggtttc tgagttgact agcaagttga aaaccacaga tgaagaattc 3841 cagagtttga aatcttcaca tgaaaaaagt aacaaaagcc tagaggacaa gagcttggaa 3901 tttaaaaaac tgtctgagga actagcgatt cagctagata tttgctgtaa gaaaaccgaa 3961 gccttattag aagctaaaac aaatgagcta atcaacatta gtagtagtaa aactaatgcc 4021 attctttcta ggatttctca ttgtcagcac cgtacaacta aagttaagga ggcactgtta 4081 attaaaactt gcacagtttc tgaattagaa gcacaactta gacagttgac agaggagcaa 4141 aatacactaa atatttcttt tcaacaggct actcatcagt tagaagaaaa agaaaatcaa 4201 attaagagca tgaaggctga tattgaaagt cttgtaacag aaaaagaagc cttacagaag 4261 gaaggaggca atcagcaaca ggctgcttct gaaaaggagt cttgtataac acagttgaag 4321 aaagagttat ctgaaaacat caatgctgtc acattgatga aagaagagct taaagaaaaa 4381 aaagttgaga ttagcagtct tagtaaacaa ctaactgatt tgaatgttca gcttcaaaat 4441 agcatcagcc tatccgaaaa agaagcagcc atttcatcac taagaaagca gtatgatgaa 4501 gaaaaatgtg aattgctgga tcaggtgcaa gatttatctt ttaaagttga cactctgagt 4561 aaagagaaaa tttctgctct tgagcaggta gatgactggt ccaataaatt ctcagaatgg 4621 aagaagaaag cacagtcaag atttacacag catcaaaaca ctgttaaaga attgcagatc 4681 cagcttgagt taaaatcaaa ggaagcttat gaaaaggatg agcagataaa tttattgaag 4741 gaagagcttg atcagcaaaa taaaagattt gattgtttaa agggtgaaat ggaagacgac 4801 aagagcaaga tggagaaaaa ggagtctaat ttagaaacag agttaaagtc tcaaacagca 4861 agaattatgg aattagagga ccatattacc cagaaaacta ttgaaataga gtccttaaat 4921 gaagttctta aaaattacaa tcaacaaaag gatattgaac acaaagaatt ggttcagaaa 4981 cttcaacatt ttcaagagtt aggagaagaa aaggacaaca gggttaaaga agctgaagaa 5041 aaaatcttaa cacttgaaaa ccaagtttat tccatgaaag ctgaacttga aactaagaag 5101 aaagaattag aacatgtgaa tttaagtgtg aaaagcaaag aggaggagtt aaaggcattg 5161 gaagataggc ttgagtcaga aagtgctgca aaattagcag agttgaagag aaaagctgaa 5221 caaaaaattg ctgccattaa gaagcagttg ttatctcaaa tggaagagaa agaagaacag 5281 tataaaaaag gtacagaaag ccatttgagt gagctaaata caaaattgca ggaaagagaa 5341 agggaagttc acatcttgga agaaaaactt aagtcagtgg aaagttcaca gtcagaaaca 5401 ttaattgtac ccagatcagc aaaaaatgtg gcagcatata ctgaacaaga agaagcagat 5461 tcccaaggct gtgtgcagaa gacatatgaa gaaaaaatca gtgttttaca aagaaactta 5521 actgaaaaag aaaagctatt gcagagggta gggcaggaaa aagaagagac agtttcttct 5581 cattttgaaa tgcgatgcca ataccaggag cgcttaataa agctagaaca tgctgaggca 5641 aagcaacatg aagatcaaag tatgataggt catcttcaag aggagcttga agaaaaaaac 5701 aagaaatatt ccttgatagt agcccagcat gtggaaaaag aaggaggtaa aaataacata 5761 caggcaaagc aaaacttgga aaatgtgttt gacgacgtcc agaaaaccct ccaggagaag 5821 gaactaacct gtcagatttt ggagcaaaag ataaaagagc tggattcctg cttagtaaga 5881 cagaaagaag tacatagagt tgaaatggaa gagttgacct caaaatatga aaaattacag 5941 gctttacaac agatggatgg aagaaataaa cccacagaac ttttggaaga aaacactgaa 6001 gaaaagtcca aatcacattt ggtccaaccc aaattgctta gtaacatgga agcccagcac 6061 aatgatctgg agtttaaatt agccggggca gaacgggaga aacagaaact gggcaaggag 6121 attgttagat tgcagaaaga ccttcgaatg ttgagaaagg agcatcagca agaattggaa 6181 atactaaaga aagaatatga tcaagaaagg gaagagaaaa tcaaacagga gcaggaagat 6241 cttgaactga agcacaattc cacattaaaa cagctgatga gggagtttaa tacacagctg 6301 gcacaaaagg aacaagagct ggaaatgacc ataaaagaaa ctatcaataa ggcccaggag 6361 gtggaggctg aacttttaga aagccatcaa gaagagacaa atcagttact taaaaaaatt 6421 gctgagaaag atgatgatct aaaacgaaca gccaaaagat atgaagaaat ccttgatgct 6481 cgtgaagaag aaatgactgc aaaagtaagg gacctgcaga ctcaacttga ggagctgcag 6541 aagaaatacc agcaaaagct agagcaggag gagaaccctg gcaatgataa tgtaacaatt 6601 atggagctac agacacagct agcacagaag acgactttaa tcagtgattc gaaattgaaa 6661 gagcaagagt tcagagaaca gattcacaat ttagaagacc gtttgaagaa atatgaaaag 6721 aatgtatatg caacaactgt ggggacacct tacaaaggtg gcaatttgta ccatacggat 6781 gtctcactct ttggagaacc taccgaattt gagtatttgc gaaaagtgct ttttgagtat 6841 atgatgggtc gtgagactaa gaccatggca aaagttataa ccaccgtact gaagttccct 6901 gatgatcaga ctcagaaaat tttggaaaga gaagatgctc ggctgatgtt tacttcacct 6961 cgcagtggta tcttctgagt aaaccatcag tctgtgctta gttaacatgt gtcatggctc 7021 cgatcttcat cttgaagaag agtgacattg ggtgactgct gcttggaaaa ctgtccacac 7081 ttgctactct ttgagaatga agttgtcatt cagggcccct catgtagcca aaagaccaag 7141 aaaaatctgg cccacagata agttgcagac tgcctttaaa atagatttta tcagtggaga 7201 aatggtgata gttttttctt cagttttctc ttgggaagga gttttatgtt gtttaaaaga 7261 tattttgata acttaacctg ctttatgggc ttacataata ttcctttcat ccattctttt 7321 taaagaacgg cttacctttc ctatttattt ttagggtgat tttttaaaaa gacttgtgca 7381 atacattttg aggtgaaact tagtggattt tttctgataa attagagcat ttaattgact 7441 attttattca ggttgatctg ttgaatattt gctaaagacc agttctttaa gctaagacat 7501 gtaaaaaatc ccaaatggca gtacctcatt gtttacttag cttttgtact tatatttttc 7561 agaggaaaaa acactactgt aaattgtgaa tagccaatac ataactgtat tgtatgcaaa 7621 tctgtgattg ttggcagtgt catctctgag aaacagataa ataaagttta tttactataa 7681 aaaaaaaaaa aaaag // LOCUS HSU41745 837 bp mRNA PRI 07-MAY-1996 DEFINITION Human PDGF associated protein mRNA, complete cds. ACCESSION U41745 NID g1136583 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 837) AUTHORS Fischer,W.H. and Schubert,D. TITLE Characterization of a novel platelet-derived growth factor-associated protein JOURNAL J. Neurochem. 66 (5), 2213-2216 (1996) MEDLINE 96373766 REFERENCE 2 (bases 1 to 837) AUTHORS Fischer,W.H. and Schubert,D. TITLE Direct Submission JOURNAL Submitted (01-DEC-1995) Wolfgang H. Fischer, PBL, The Salk Institute, 10010 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..837 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="neural retina" CDS 22..567 /codon_start=1 /product="PDGF associated protein" /db_xref="PID:g1136584" /translation="MPKGGRKGGHKGRARQYTSPEEIDAQLQAEKQKAREEEEQKEGG DGAAGDPKKEKKSLDSDESEDEEDDYQQKRKGVEGLIDIENPNRVAQTTKKVTQLDLD GPKELSRREREEIEKQKAKERYMKMHLAGKTEQAKADLARLAIIRKQREEAARKKEEE RKAKDDATLSGKRMQSLSLNK" BASE COUNT 247 a 193 c 271 g 126 t ORIGIN 1 gaattccgcg gcggcgcctc aatgcctaaa ggaggaagaa agggaggcca caaaggccgg 61 gcgaggcagt atacaagccc tgaggagatc gacgcgcagc tgcaggctga gaagcagaag 121 gccagggaag aagaggagca aaaagaaggt ggagatgggg ctgcaggtga ccccaaaaag 181 gagaagaaat ctctagactc agatgagagt gaggatgaag aagatgacta ccagcaaaag 241 cgcaaaggcg ttgaagggct catcgacatc gagaacccca accgggtggc acagacaacc 301 aaaaaggtca cacaactgga tctggacggg ccaaaggagc tttcgaggag agaacgagaa 361 gagattgaga agcagaaggc aaaagagcgt tacatgaaaa tgcacttggc cgggaagaca 421 gagcaagcca aggctgacct ggcccggctg gccatcatcc ggaaacagcg ggaggaggct 481 gcccggaaga aggaagagga aaggaaagca aaagacgatg ccacattgtc aggaaaacga 541 atgcagtcac tctccctgaa taagtaactg cgacccgtgg gaggagatgc cggggacctg 601 ggccgcgctg ccaggacctc tgctgtgtct cgcccaccct gtgccctggc gccgctgcaa 661 cagcccctca tggccaggag ccccccatgc ctgggcctcc tcttcatctt ggcacagaaa 721 ttgtttgggg gatggggggg ggactggggg aggggtagct gctatctttg agacagaaag 781 atgcaggaca gcatttcata tgtaaccatt tgaatgtttt tgctgttttt agaattc // LOCUS HSU41763 5564 bp mRNA PRI 30-MAY-1996 DEFINITION Human muscle specific clathrin heavy chain (CLTD) mRNA, complete cds. ACCESSION U41763 NID g1335853 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5564) AUTHORS Sirotkin,H., Morrow,B., DasGupta,R., Goldberg,R., Patanjali,S.R., Shi,G., Cannizzaro,L., Shprintzen,R., Weissman,S.M. and Kucherlapati,R. TITLE Isolation of a new clathrin heavy chain gene with muscle-specific expression from the region commonly deleted in velo-cardio-facial syndrome JOURNAL Hum. Mol. Genet. 5 (5), 617-624 (1996) MEDLINE 96311556 REFERENCE 2 (bases 1 to 5564) AUTHORS Sirotkin,H, Morrow,B, DasGupta,R., Patangali,S, Weissman,S and Kucherlapati,R. TITLE Direct Submission JOURNAL Submitted (01-DEC-1995) Howard Sirotkin, Molecular Genetics, AECOM, 1300 Morris Park Ave., Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..5564 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" gene 98..5020 /gene="CLTD" CDS 98..5020 /gene="CLTD" /codon_start=1 /product="muscle clathrin heavy chain" /db_xref="PID:g1335854" /translation="MAQILPVRFQEHFQLQNLGINPANIGFSTLTMESDKFICIREKV GEQAQVTIIDMSDPMAPIRRPISAESAIMNPASKVIALKAGKTLQIFNIEMKSKMKAH TMAEEVIFWKWVSVNTVALVTETAVYHWSMEGDSQPMKMFDRHTSLVGCQVIHYRTDE YQKWLLLVGISAQQNRVVGAMQLYSVDRKVSQHIEGHAAAFAEFKMEGNAKPATHFCF AVRNPTGGKLHIIEVGQPAAGNQPFVKKAVDVFFPPEAQNDFPVAMQIGAKHGVIYLI TKYGYLHLYDLESGVCICMNRISADTIFVTAPHKPTSGIIGVNKKGQVLSVCVEEDNI VNYATNVLQNPDLGLRLAVRSNLAGAEKLFVRKFNTLFAQGSYAEAAKVAASAPKGIL RTRETVQKFQSIPAQSGQASPLLQYFGILLDQGQLNKLESLELCHLVLQQGRKQLLEK WLKEDKLECSEELGDLVKTTDPMLALSVYLRANVPSKVIQCFAETGQFQKIVLYAKKV GYTPDWIFLLRGVMKISPEQGQQFSRMLVQDEEPLANISQIVDIFMENSLIQQCTSFL LDALKNNRPAEGLLQTWLLEMNLVHAPQVADAILGNKMFTHYDRAHIAQLCEKAGLLQ QALEHYTDLYDIKRAVVHTHLLNPEWLVNFFGSLSVEDSVECLHAMLSANIRQNLQLC VQVASKYHEQLGTQALVELFESFKSYKGLFYFLGSIVNFSQDPDVHLKYIQAACKTGQ IKEVERICRESSCYNPERVKNFLKEAKLTDQLPLIIVCDRFGFVHDLVLYLYRNNLQR YIEIYVQKVNPSRTPAVIGGLLDVDCSEEVIKHLIMAVRGQFSTDELVAEVEKRNRLK LLLPWLESQIQEGCEEPATHNALAKIYIDSNNSPECFLRENAYYDSSVVGRYCEKRDP HLACVAYERGQCDLELIKVCNENSLFKSEARYLVCRKDPELWAHVLEETNPSRRQLID QVVQTALSETRDPEEISVTVKAFMTADLPNELIELLEKIVLDNSVFSEHRNLQNLLIL TAIKADRTRVMEYISRLDNYDALDIASIAVSSALYEEAFTVFHKFDMNASAIQVLIEH IGNLDRAYEFAERCNEPAVWSQLAQAQLQKDLVKEAINSYIRGDDPSSYLEVVQSASR SNNWEDLVKFLQMARKKGRESYIETELIFALAKTSRVSELEDFINGPNNAHIQQVGDR CYEEGMYEAAKLLYSNVSNFARLASTLVHLGEYQAAVDNSRKASSTRTWKEVCFACMD GQEFRFAQLCGLHIVIHADELEELMCYYQDRGYFEELILLLEAALGLERAHMGMFTEL AILYSKFKPQKMLEHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAVLTM MSHPTEAWKEGQFKDIITKVANVELCYRALQFYLDYKPLLINDLLLVLSPRLDHTWTV SFFSKAGQLPLVKPYLRSVQSHNNKSVNEALNHLLTEEEDYQGLRASIDAYDNFDNIS LAQQLEKHQLMEFRCIAAYLYKGNNWWAQSVELCKKDHLYKDAMQHAAESRDAELAQK LLQWFLEEGKRECFAACLFTCYDLLRPDMVLELAWRHNLVDLAMPYFIQVMREYLSKV DKLDALESLRKQEEHVTEPAPLVFDFDGHE" BASE COUNT 1400 a 1397 c 1485 g 1282 t ORIGIN 1 ggcttctggt tcggcccacc tctgaaggtt ccagaatcga tagtgaattc gtgattcctg 61 ccgctgccgc cgccgccgcc gaggtcccgc accagccatg gcgcagatcc tccctgttcg 121 ctttcaggag cacttccagc tccaaaacct tggaattaat ccagctaaca ttggattcag 181 cacactgacc atggaatctg acaagttcat atgtatccga gagaaagttg gtgagcaggc 241 acaggtcacg atcattgaca tgagtgaccc aatggctccg atccgacggc ctatctctgc 301 agagagtgcc atcatgaatc cagcctctaa ggtgatagct ctgaaagctg ggaagacact 361 tcagatcttt aatattgaga tgaagagtaa aatgaaggct catactatgg cagaagaagt 421 gattttctgg aaatgggttt ctgtgaacac tgttgccttg gtgaccgaga ccgcggtcta 481 ccactggagc atggaaggtg actcccagcc catgaagatg tttgatagac ataccagtct 541 ggtgggctgc caggtgattc actaccggac tgatgagtac cagaagtggc tgctgctcgt 601 aggcatctcg gctcagcaaa accgtgtggt tggagcaatg cagctctact ctgtggatag 661 gaaggtttca caacacatag aaggccatgc tgcggctttt gcagagttca agatggaggg 721 gaatgccaag cctgccaccc atttctgctt tgctgtacgt aatcccacag gaggcaagtt 781 gcacatcatt gaagttggac agcctgcagc gggaaaccaa ccttttgtaa agaaagcagt 841 agatgtgttt tttcctccag aggcacagaa tgattttcca gtggctatgc agattggagc 901 taaacatggt gttatttact tgatcacaaa gtatggctat cttcatctgt acgacctaga 961 gtctggcgtg tgcatctgca tgaaccgtat tagtgctgac acaatatttg tcactgctcc 1021 acacaaacca acctctggaa ttattggtgt caacaaaaag ggacaggtac tgtcagtttg 1081 tgttgaggaa gataacattg tgaattatgc aaccaacgtg cttcagaatc cagaccttgg 1141 tctgcgtttg gccgttcgta gtaacctggc tggcgcagag aagttgtttg tgagaaaatt 1201 caataccctc tttgcacagg gcagctatgc tgaagccgcc aaagttgcag cgtctgcacc 1261 aaagggaatc ctgcgtacca gagagacggt ccagaaattc cagagtatac ccgctcagtc 1321 tggccaggct tctccattgc tgcagtactt cggaatcctg ctcgaccagg gtcagctcaa 1381 taaacttgaa tccttagaac tttgccatct ggttcttcag caggggcgta agcaactcct 1441 agagaagtgg ctgaaagaag ataagctgga gtgctcagag gagctcggag acttggtcaa 1501 aaccactgac cccatgctcg ctctgagtgt gtaccttcgg gcaaatgtgc caagcaaagt 1561 gatccagtgt tttgcagaaa caggccaatt ccagaaaatt gtgctctatg ccaaaaaggt 1621 tgggtacacc ccagactgga tctttctgct gaggggtgta atgaagatca gtccggaaca 1681 gggccagcag ttttctcgaa tgctagtgca ggacgaggag ccgctggcca acattagcca 1741 gattgtggac attttcatgg aaaacagttt aattcagcag tgtacttcct tcttattgga 1801 tgccttgaag aataatcgcc cagctgaggg actcctgcag acatggctgt tggagatgaa 1861 ccttgttcat gcaccccagg ttgcagatgc catccttgga aataaaatgt ttactcatta 1921 cgaccgggcc cacattgccc agctctgtga gaaggcaggc ctcctgcagc aagcactgga 1981 gcactacacc gacctctatg acatcaagag ggctgtggtc cacactcacc tcctcaatcc 2041 cgagtggctt gtcaatttct ttggctcctt atcggtggag gattctgtgg agtgtctgca 2101 tgccatgctg tctgctaaca tcagacagaa ccttcagctg tgtgtgcagg tggcctctaa 2161 gtaccacgag cagctgggca cgcaggccct ggtggagctc tttgaatcct tcaagagtta 2221 caaaggcctc ttctacttcc tgggctcaat cgtgaacttc agccaagacc cagatgtgca 2281 tctgaaatac attcaggctg cctgtaagac agggcagatc aaggaggtgg agaggatatg 2341 ccgagagagc agctgctaca acccagagcg tgtgaagaac ttcctgaagg aggccaagct 2401 cacagaccag cttcccctca tcatcgtgtg tgatcgtttt ggctttgtcc atgaccttgt 2461 cctatattta taccgcaaca acctgcagag gtacattgag atctacgtgc agaaggtcaa 2521 ccctagccgg accccagctg tgattggagg gctgcttgat gtggattgtt ctgaggaagt 2581 gattaaacac ttaatcatgg cagtgagagg acagttctct actgatgagt tggtggctga 2641 agtagaaaaa agaaataggc tcaagctgct gcttccctgg ctggagtccc agattcagga 2701 aggctgtgag gagcctgcca ctcacaatgc actggctaaa atctacatcg acagcaacaa 2761 cagccccgag tgcttcctga gagagaatgc ctactatgac agcagcgtgg tgggccgcta 2821 ctgtgagaag cgagaccccc atctggcctg tgttgcctat gagcgggggc agtgtgacct 2881 tgagctcatc aaggtgtgca atgagaattc tctgttcaaa agcgaggccc gctacctggt 2941 atgcagaaag gatccggagc tctgggctca cgtccttgag gagaccaacc catccaggag 3001 acagctaatt gaccaggtgg tacagacagc attgtcagaa acacgggatc ctgaagagat 3061 ttcggtcact gtcaaagcct ttatgacagc cgacctgcct aatgaactga ttgaactgct 3121 ggagaagata gttctggata actctgtctt cagcgagcac aggaatctac agaatctgtt 3181 gatcctgact gccatcaagg cagaccgcac acgggtcatg gagtacatca gccgcctgga 3241 caactatgac gcactggaca tcgcgagcat cgctgtcagc agcgcgctgt atgaggaggc 3301 cttcaccgtt ttccacaagt ttgatatgaa tgcctcagca atccaggtcc tgatcgagca 3361 cattggaaac ctggaccggg catatgagtt tgcggagaga tgcaatgagc ctgctgtgtg 3421 gagtcagctg gcccaagccc agctccagaa agatttggtg aaggaagcta tcaactccta 3481 tatcagaggg gacgaccctt cctcttacct ggaagttgtt cagtcagcca gcaggagcaa 3541 caactgggag gatctagtta aatttctgca gatggccagg aaaaagggcc gtgagtccta 3601 tatagagact gaacttattt ttgccttggc taaaaccagc cgtgtttctg agctagaaga 3661 ttttattaat ggacccaaca atgcccacat ccagcaggtt ggagaccgct gttacgagga 3721 gggaatgtac gaggctgcca agctgctcta tagcaatgtt tctaactttg cccgcctggc 3781 ttccaccttg gttcacctcg gtgagtatca ggcagcagtg gacaacagcc gcaaggccag 3841 cagcacccgg acgtggaagg aggtgtgctt tgcctgcatg gatggacaag agttccgctt 3901 cgcacagctg tgtggtcttc acatcgtcat tcatgcagat gagctggagg agctgatgtg 3961 ctattaccag gatcgtggct actttgagga gctgatcttg ctgttggaag cggccctggg 4021 cctggagcgg gcccacatgg gcatgttcac tgagctggcc atcctctact ccaaattcaa 4081 gccacagaag atgctggagc atctggagct tttctggtcc cgtgtcaaca tcccaaaggt 4141 gctgagggct gcagagcagg cacacctgtg ggctgagctg gtgttcctct atgacaagta 4201 cgaggagtat gacaatgctg tgctcaccat gatgagccac cccactgagg cctggaagga 4261 gggtcagttc aaggacatca ttaccaaggt tgccaacgtc gagctctgtt acagagccct 4321 gcagttctat ttggattaca aaccactgct catcaatgac ctgctgctgg tgctttcacc 4381 ccggctggac cacacctgga cagtcagttt cttttcaaag gcaggtcagc tgcccctggt 4441 gaagccttac ctgcggtcag tccagagcca caacaacaag agtgtgaatg aggcactcaa 4501 ccacctgctg acagaggagg aggactatca gggcttaagg gcatctatcg atgcctatga 4561 caactttgac aacatcagcc tggctcagca gctggagaag catcagctga tggagttcag 4621 gtgcattgcg gcctatctgt acaagggcaa taactggtgg gcccagagcg tggagctctg 4681 caagaaggat catctctaca aggatgccat gcagcatgct gcagagtcgc gggatgctga 4741 gctggcccag aagttgctgc agtggttcct ggaggaaggc aagagggagt gcttcgcagc 4801 ttgtctcttc acctgctatg acctgcttcg cccagacatg gtgcttgagc tggcctggag 4861 gcacaacctc gtggacttgg ccatgcccta cttcatccag gtgatgaggg agtacctgag 4921 caaggtggac aaactggatg ccttggagag tctgcgcaag caagaggagc atgtgacaga 4981 gcctgcccct ctcgtgtttg attttgatgg gcatgaatga gacccagctg attgcactaa 5041 gccctgccgt gggcccagcc cctgccagct tcccctatgg atatgcctct gctcccaact 5101 tcgccagcct ccagtgtaca acttccgcgt gtagtgggcg ttgtcaccac ccaccctacc 5161 tgcagagtta ctaacttctc caaggagcat gtcactccag cagcacaggg gacgcaatgg 5221 gaggcaggga cacctggaca atatttattt ttgctgaaac ccaatgacgg caacctctga 5281 gccatcccag agcctgggga ggccagggta gaggctgacg gcgcaagacc agctttagcc 5341 gacaacagag actggactgt gggccctcct gctggagcca ggccttcctc ctgggcgcct 5401 ccgactggct ggagctgccc cctccaggcc agtttgaaga ctacatgaac acgtcttgtt 5461 tggaggtacc ggacctcata aaaggactct cagcctcttg gcaatcataa atattaaagt 5521 cggtttatcc aggcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSU41766 3865 bp mRNA PRI 22-MAR-1996 DEFINITION Human metalloprotease/disintegrin/cysteine-rich protein precursor (MDC9) mRNA, complete cds. ACCESSION U41766 NID g1235671 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3865) AUTHORS Weskamp,G., Kratzschmar,J., Reid,M.S. and Blobel,C.P. TITLE MDC9, a widely expressed cellular disintegrin containing cytoplasmic SH3 ligand domains JOURNAL J. Cell Biol. 132 (4), 717-726 (1996) MEDLINE 96178079 REFERENCE 2 (bases 1 to 3865) AUTHORS Blobel,C.P. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) Carl P. Blobel, Cellular Biochemistry and Biophysics Program, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..3865 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MDA-MB-468 mammary epithelial carcinoma" gene 79..2538 /gene="MDC9" sig_peptide 79..166 /gene="MDC9" /product="metalloprotease/disintegrin/cysteine-rich protein precursor" CDS 79..2538 /gene="MDC9" /note="Method: conceptual translation supplied by author" /codon_start=1 /product="metalloprotease/disintegrin/cysteine-rich protein precursor" /db_xref="PID:g1235672" /translation="MGSGARFPSGTLRVRWLLLLGLVGPVLGAARPGFQQTSHLSSYE IITPWRLTRERREAPRPYSKQVSYVIQAEGKEHIIHLERNKDLLPEDFVVYTYNKEGT LITDHPNIQNHCHYRGYVEGVHNSSIALSDCFGLRGLLHLENASYGIEPLQNSSHFEH IIYRMDDVYKEPLKCGVSNKDIEKETAKDEEEEPPSMTQLLRRRRAVLPQTRYVELFI VVDKERYDMMGRNQTAVREEMILLANYLDSMYIMLNIRIVLVGLEIWTNGNLINIVGG AGDVLGNFVQWREKFLITRRRHDSAQLVLKKGFGGTAGMAFVGTVCSRSHAGGINVFG QITVETFASIVAHELGHNLGMNHDDGRDCSCGAKSCIMNSGASGSRNFSSCSAEDFEK LTLNKGGNCLLNIPKPDEAYSAPSCGNKLVDAGEECDCGTPKECELDPCCEGSTCKLK SFAECAYGDCCKDCRFLPGGTLCRGKTSECDVPEYCNGSSQFCQPDVFIQNGYPCQNN KAYCYNGMCQYYDAQCQVIFGSKAKAAPKDCFIEVNSKGDRFGNCGFSGNEYKKCATG NALCGKLQCENVQEIPVFGIVPAIIQTPSRGTKCWGVDFQLGSDVPDPGMVNEGTKCG AGKICRNFQCVDASVLNYDCDVQKKCHGHGVCNSNKNCHCENGWAPPNCETKGYGGSV DSGPTYNEMNTALRDGLLVFFFLIVPLIVCAIFIFIKRDQLWRSYFRKKRSQTYESDG KNQANPSRQPGSVPRHVSPVTPPREVPIYANRFAVPTYAAKQPQQFPSRPPPPQPKVS SQGNLIPARPAPAPPLYSSLT" mat_peptide 167..2535 /gene="MDC9" /product="metalloprotease/disintegrin/cysteine-rich protein" misc_feature 167..694 /gene="MDC9" /note="encodes the pro-domain" misc_feature 695..1315 /gene="MDC9" /note="encodes the metalloproteinase domain" misc_feature 1316..1588 /gene="MDC9" /note="encodes the disintegrin domain" misc_feature 1589..2008 /gene="MDC9" /note="encodes the cysteine-rich domain" misc_feature 2009..2098 /gene="MDC9" /note="encodes the EGF-like domain" misc_feature 2174..2233 /gene="MDC9" /note="encodes the transmembrane domain" misc_feature 2234..2535 /gene="MDC9" /note="encodes the cytoplasmic domain" BASE COUNT 1201 a 670 c 842 g 1152 t ORIGIN 1 cggcagggtt ggaaaatgat ggaagaggcg gaggtggagg cgaccgagtg ctgagaggaa 61 cctgcggaat cggccgagat ggggtctggc gcgcgctttc cctcggggac ccttcgtgtc 121 cggtggttgc tgttgcttgg cctggtgggc ccagtcctcg gtgcggcgcg gccaggcttt 181 caacagacct cacatctttc ttcttatgaa attataactc cttggagatt aactagagaa 241 agaagagaag cccctaggcc ctattcaaaa caagtatctt atgttattca ggctgaagga 301 aaagagcata ttattcactt ggaaaggaac aaagaccttt tgcctgaaga ttttgtggtt 361 tatacttaca acaaggaagg gactttaatc actgaccatc ccaatataca gaatcattgt 421 cattatcggg gctatgtgga gggagttcat aattcatcca ttgctcttag cgactgtttt 481 ggactcagag gattgctgca tttagagaat gcgagttatg ggattgaacc cctgcagaac 541 agctctcatt ttgagcacat catttatcga atggatgatg tctacaaaga gcctctgaaa 601 tgtggagttt ccaacaagga tatagagaaa gaaactgcaa aggatgaaga ggaagagcct 661 cccagcatga ctcagctact tcgaagaaga agagctgtct tgccacagac ccggtatgtg 721 gagctgttca ttgtcgtaga caaggaaagg tatgacatga tgggaagaaa tcagactgct 781 gtgagagaag agatgattct cctggcaaac tacttggata gtatgtatat tatgttaaat 841 attcgaattg tgctagttgg actggagatt tggaccaatg gaaacctgat caacatagtt 901 gggggtgctg gtgatgtgct ggggaacttc gtgcagtggc gggaaaagtt tcttatcaca 961 cgtcggagac atgacagtgc acagctagtt ctaaagaaag gttttggtgg aactgcagga 1021 atggcatttg tgggaacagt gtgttcaagg agccacgcag gcgggattaa tgtgtttgga 1081 caaatcactg tggagacatt tgcttccatt gttgctcatg aattgggtca taatcttgga 1141 atgaatcacg atgatgggag agattgttcc tgtggagcaa agagctgcat catgaattca 1201 ggagcatcgg gttccagaaa ctttagcagt tgcagtgcag aggactttga gaagttaact 1261 ttaaataaag gaggaaactg ccttcttaat attccaaagc ctgatgaagc ctatagtgct 1321 ccctcctgtg gtaataagtt ggtggacgct ggggaagagt gtgactgtgg tactccaaag 1381 gaatgtgaat tggacccttg ctgcgaagga agtacctgta agcttaaatc atttgctgag 1441 tgtgcatatg gtgactgttg taaagactgt cggttccttc caggaggtac tttatgccga 1501 ggaaaaacca gtgagtgtga tgttccagag tactgcaatg gttcttctca gttctgtcag 1561 ccagatgttt ttattcagaa tggatatcct tgccagaata acaaagccta ttgctacaac 1621 ggcatgtgcc agtattatga tgctcaatgt caagtcatct ttggctcaaa agccaaggct 1681 gcccccaaag attgtttcat tgaagtgaat tctaaaggtg acagatttgg caattgtggt 1741 ttctctggca atgaatacaa gaagtgtgcc actgggaatg ctttgtgtgg aaagcttcag 1801 tgtgagaatg tacaagagat acctgtattt ggaattgtgc ctgctattat tcaaacgcct 1861 agtcgaggca ccaaatgttg gggtgtggat ttccagctag gatcagatgt tccagatcct 1921 gggatggtta acgaaggcac aaaatgtggt gctggaaaga tctgtagaaa cttccagtgt 1981 gtagatgctt ctgttctgaa ttatgactgt gatgttcaga aaaagtgtca tggacatggg 2041 gtatgtaata gcaataagaa ttgtcactgt gaaaatggct gggctccccc aaattgtgag 2101 actaaaggat acggaggaag tgtggacagt ggacctacat acaatgaaat gaatactgca 2161 ttgagggacg gacttctggt cttcttcttc ctaattgttc cccttattgt ctgtgctatt 2221 tttatcttca tcaagaggga tcaactgtgg agaagctact tcagaaagaa gagatcacaa 2281 acatatgagt cagatggcaa aaatcaagca aacccttcta gacagccggg gagtgttcct 2341 cgacatgttt ctccagtgac acctcccaga gaagttccta tatatgcaaa cagatttgca 2401 gtaccaacct atgcagccaa gcaacctcag cagttcccat caaggccacc tccaccacaa 2461 ccgaaagtat catctcaggg aaacttaatt cctgcccgtc ctgctcctgc acctccttta 2521 tatagttccc tcacttgatt tttttaacct tctttttgca aatgtcttca gggaactgag 2581 ctaatacttt ttttttttct tgatgttttc ttgaaaagcc tttctgttgc aactatgaat 2641 gaaaacaaaa caccacaaaa cagacttcac taacacagaa aaacagaaac tgagtgtgag 2701 agttgtgaaa tacaaggaaa tgcagtaaag ccagggaatt tacaataaca tttccgtttc 2761 catcattgaa taagtcttat tcagtcatcg gtgaggttaa tgcactaatc atggattttt 2821 tgaacatgtt attgcagtga ttctcaaatt aactgtattg gtgtaagatt tttgtcatta 2881 agtgtttaag tgttattctg aattttctac cttagttatc attaatgtag ttcctcattg 2941 aacatgtgat aatctaatac ctgtgaaaac tgactaatca gctgccaata atatctaata 3001 tttttcatca tgcacgaatt aataatcatc atactctaga atcttgtctg tcactcacta 3061 catgaataag caaatattgt cttcaaaaga atgcacaaga accacaatta agatgtcata 3121 ttattttgaa agtacaaaat atactaaaag agtgtgtgtg tattcacgca gttactcgct 3181 tccattttta tgacctttca actataggta ataactctta gagaaattaa tttaatatta 3241 gaatttctat tatgaatcat gtgaaagcat gacattcgtt cacaatagca ctattttaaa 3301 taaattataa gctttaaggt acgaagtatt taatagatct aatcaaatat gttgattcat 3361 ggctataata aagcaggagc aattataaaa tcttcaatca attgaacttt tacaaaacca 3421 cttgagaatt tcatgagcac tttaaaatct gaactttcaa agcttgctat taaatcattt 3481 agaatgttta catttactaa ggtgtgctgg gtcatgtaaa atattagaca ctaatatttt 3541 catagaaatt aggctggaga aagaaggaag aaatggtttt cttaaatacc tacaaaaaag 3601 ttactgtggt atctatgagt tatcatctta gctgtgttaa aaatgaattt ttactatggc 3661 agatatggta tggatcgtaa aattttaagc actaaaaatt ttttcataac ctttcataat 3721 aaagtttaat aataggttta ttaactgaat ttcattagtt ttttaaaagt gtttttggtt 3781 tgtgtatata tacatataca aatacaacat ttacaataaa taaaatactt gaaattctca 3841 aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSU41767 2740 bp mRNA PRI 22-MAR-1996 DEFINITION Human metargidin precursor mRNA, complete cds. ACCESSION U41767 NID g1235673 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2740) AUTHORS Kratzschmar,J., Lum,L. and Blobel,C.P. TITLE Metargidin, a membrane-anchored metalloprotease-disintegrin protein with an RGD integrin binding sequence JOURNAL J. Biol. Chem. 271 (9), 4593-4596 (1996) MEDLINE 96214870 REFERENCE 2 (bases 1 to 2740) AUTHORS Blobel,C.P. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) Carl P. Blobel, Cellular Biochemistry and Biophysics Program, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..2740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MDA-MB-468 mammary epithelial carcinoma cells" sig_peptide 8..92 /product="metargidin precursor" CDS 8..2452 /note="Method: conceptual translation supplied by author" /codon_start=1 /product="metargidin precursor" /db_xref="PID:g1235674" /translation="MRLALLWALGLLGAGSPLPSWPLPNIGGTEEQQAESEKAPREPL EPQVLQDDLPISLKKVLQTSLPEPLRIKLELDGDSHILELLQNRELVPGRPTLVWYQP DGTRVVSEGHTLENCCYQGRVRGYAGSWVSICTCSGLRGLVVLTPERSYTLEQGPGDL QGPPIISRIQDLHLPGHTCALSWRESVHTQTPPEHPLGQRHIRRRRDVVTETKTVELV IVADHSEAQKYRDFQHLLNRTLEVALLLDTFFRPLNVRVALVGLEAWTQRDLVEISPN PAVTLENFLHWRRAHLLPRLPHDSAQLVTGTSFSGPTVGMAIQNSICSPDFSGGVNMD HSTSILGVASSIAHELGHSLGLDHDLPGNSCPCPGPAPAKTCIMEASTDFLPGLNFSN CSRRALEKALLDGMGSCLFERLPSLPPMAAFCGNMFVEPGEQCDCGFLDDCVDPCCDS LTCQLRPGAQCASDGPCCQNCQLRPSGWQCRPTRGDCDLPEFCPGDSSQCPPDVSLGD GEPCAGGQAVCMHGRCASYAQQCQSLWGPGAQPAAPLCLQTANTRGNAFGSCGRNPSG SYVSCTPRDAICGQLQCQTGRTQPLLGSIRDLLWETIDVNGTELNCSWVHLDLGSDVA QPLLTLPGTACGPGLVCIDHRCQRVDLLGAQECRSKCHGHGVCDSNRHCYCEEGWAPP DCTTQLKATSSLTTGLLLSLLVLLVLVMLGAGYWYRARLHQRLCQLKGPTCQYRAAQS GPSERPGPPQRALLARGTKSQGPAKPPPPRKPLPADPQGRCPSGDLPGPGAGIPPLVV PSRPAPPPPTVSSLYL" mat_peptide 93..2449 /product="metargidin" misc_feature 93..626 /note="encodes the pro_domain" misc_feature 627..1265 /note="encodes the metalloprotease domain" misc_feature 1266..1538 /note="encodes the disintegrin domain" misc_feature 1539..1976 /note="encodes the cysteine-rich domain" misc_feature 1977..2060 /note="encodes the EGF-like domain" misc_feature 2088..2141 /note="encodes the transmembrane domain" misc_feature 2142..2449 /note="encodes the cytoplasmic domain" BASE COUNT 522 a 859 c 824 g 535 t ORIGIN 1 cgctgccatg cggctggcgc tgctctgggc cctggggctc ctgggcgcgg gcagccctct 61 gccttcctgg ccgctcccaa atataggtgg cactgaggag cagcaggcag agtcagagaa 121 ggccccgagg gagcccttgg agccccaggt ccttcaggac gatctcccaa ttagcctcaa 181 aaaggtgctt cagaccagtc tgcctgagcc cctgaggatc aagttggagc tggacggtga 241 cagtcatatc ctggagctgc tacagaatag ggagttggtc ccaggccgcc caaccctggt 301 gtggtaccag cccgatggca ctcgggtggt cagtgaggga cacactttgg agaactgctg 361 ctaccaggga agagtgcggg gatatgcagg ctcctgggtg tccatctgca cctgctctgg 421 gctcagaggc ttggtggtcc tgaccccaga gagaagctat accctggagc aggggcctgg 481 ggaccttcag ggtcctccca ttatttcgcg aatccaagat ctccacctgc caggccacac 541 ctgtgccctg agctggcggg aatctgtaca cactcagacg ccaccagagc accccctggg 601 acagcgccac attcgccgga ggcgggatgt ggtaacagag accaagactg tggagttggt 661 gattgtggct gatcactcgg aggcccagaa ataccgggac ttccagcacc tgctaaaccg 721 cacactggaa gtggccctct tgctggacac attcttccgg cccctgaatg tacgagtggc 781 actagtgggc ctggaggcct ggacccagcg tgacctggtg gagatcagcc caaacccagc 841 tgtcaccctc gaaaacttcc tccactggcg cagggcacat ttgctgcctc gattgcccca 901 tgacagtgcc cagctggtga ctggtacttc attctctggg cctacggtgg gcatggccat 961 tcagaactcc atctgttctc ctgacttctc aggaggtgtg aacatggacc actccaccag 1021 catcctggga gtcgcctcct ccatagccca tgagttgggc cacagcctgg gcctggacca 1081 tgatttgcct gggaatagct gcccctgtcc aggtccagcc ccagccaaga cctgcatcat 1141 ggaggcctcc acagacttcc taccaggcct gaacttcagc aactgcagcc gacgggccct 1201 ggagaaagcc ctcctggatg gaatgggcag ctgcctcttc gaacggctgc ctagcctacc 1261 ccctatggct gctttctgcg gaaatatgtt tgtggagccg ggcgagcagt gtgactgtgg 1321 cttcctggat gactgcgtcg atccctgctg tgattctttg acctgccagc tgaggccagg 1381 tgcacagtgt gcatctgacg gaccctgttg tcaaaattgc cagctgcgcc cgtctggctg 1441 gcagtgtcgt cctaccagag gggattgtga cttgcctgaa ttctgcccag gagacagctc 1501 ccagtgtccc cctgatgtca gcctagggga tggcgagccc tgcgctggcg ggcaagctgt 1561 gtgcatgcac gggcgttgtg cctcctatgc ccagcagtgc cagtcacttt ggggacctgg 1621 agcccagccc gctgcgccac tttgcctcca gacagctaat actcggggaa atgcttttgg 1681 gagctgtggg cgcaacccca gtggcagtta tgtgtcctgc acccctagag atgccatttg 1741 tgggcagctc cagtgccaga caggtaggac ccagcctctg ctgggctcca tccgggatct 1801 actctgggag acaatagatg tgaatgggac tgagctgaac tgcagctggg tgcacctgga 1861 cctgggcagt gatgtggccc agcccctcct gactctgcct ggcacagcct gtggccctgg 1921 cctggtgtgt atagaccatc gatgccagcg tgtggatctc ctgggggcac aggaatgtcg 1981 aagcaaatgc catggacatg gggtctgtga cagcaacagg cactgctact gtgaggaggg 2041 ctgggcaccc cctgactgca ccactcagct caaagcaacc agctccctga ccacagggct 2101 gctcctcagc ctcctggtct tattggtcct ggtgatgctt ggtgccggct actggtaccg 2161 tgcccgcctg caccagcgac tctgccagct caagggaccc acctgccagt acagggcagc 2221 ccaatctggt ccctctgaac ggccaggacc tccgcagagg gccctgctgg cacgaggcac 2281 taagtctcag gggccagcca agcccccacc cccaaggaag ccactgcctg ccgaccccca 2341 gggccggtgc ccatcgggtg acctgcccgg cccaggggct ggaatcccgc ccctagtggt 2401 accctccaga ccagcgccac cgcctccgac agtgtcctcg ctctacctct gacctctccg 2461 gaggttccgc tgcctccaag ccggacttag ggcttcaaga ggcgggcgtg ccctctggag 2521 tcccctacca tgactgaagg cgccagagac tggcggtgtc ttaagactcc gggcaccgcc 2581 acgcgctgtc aagcaacact ctgcggacct gccggcgtag ttgcagcggg ggcttgggga 2641 ggggctgggg gttggacggg attgaggaag gtccgcacag cctgtctctg ctcagttgca 2701 ataaacgtga catcttggga gcgttaaaaa aaaaaaaaaa // LOCUS HSU41804 1303 bp mRNA PRI 02-APR-1996 DEFINITION Human putative T1/ST2 receptor binding protein precursor mRNA, complete cds. ACCESSION U41804 NID g1223889 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1303) AUTHORS Gayle,M.A., Slack,J.L., Bonnert,T.P., Renshaw,B.R., Sonoda,G., Taguchi,T., Testa,J.R., Dower,S.K. and Sims,J.E. TITLE Cloning of a putative ligand for the T1/ST2 receptor JOURNAL J. Biol. Chem. 271 (10), 5784-5789 (1996) MEDLINE 96215043 REFERENCE 2 (bases 1 to 1303) AUTHORS Sims,J.E. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) John E. Sims, Molecular Genetics, Immunex Corporation, 51 University Street, Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..1303 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19p13.2" CDS 88..771 /note="putative ligand" /codon_start=1 /product="putative T1/ST2 receptor binding protein precursor" /db_xref="PID:g1223890" /translation="MMAAGAALALALWLLMPPVEVGGAGPPPIQDGEFTFLLPAGRKQ CFYQSAPANASLETEYQVIGGAGLDVDFTLESPQGVLLVSESRKADGVHTVEPTEAGD YKLCFDNSFSTISEKLVFFELIFDSLQDDEEVEGWAEAVEPEEMLDVKMEDIKESIET MRTRLERSIQMLTLLRAFEARDRNLQEGNLERVNFWSAVNVAVLLLVAVLQVCTLKRF FQDKRPVPT" sig_peptide 88..159 /product="T1/ST2 receptor binding protein" mat_peptide 160..768 /product="T1/ST2 receptor binding protein" misc_feature 664..732 /note="encodes transmembrane domain" BASE COUNT 260 a 365 c 440 g 238 t ORIGIN 1 ctgccaatga gctccgccga gtagcaccgg ggcagggcta gcgcttaaag gagccgcgac 61 ccctttgcag accagagggt gacccggatg atggcggccg gcgcggccct agccctggcc 121 ttgtggctac taatgccacc agtggaggtg ggaggggcgg ggcccccgcc aatccaggac 181 ggtgagttca cgttcctgtt gccggcgggg aggaagcagt gtttctacca gtccgcgccg 241 gccaacgcaa gcctcgagac cgaataccag gtgatcggag gtgctggact ggacgtggac 301 ttcacgctgg agagccctca gggcgtgctg ttggtcagcg agtcccgcaa ggctgatggg 361 gtacacacgg tggagccaac ggaggccggg gactacaagc tgtgctttga caactccttc 421 agcaccatct ccgagaagct ggtgttcttt gaactgatct ttgacagcct ccaggatgac 481 gaggaggtcg aaggatgggc agaggctgtg gagcccgagg agatgctgga tgttaaaatg 541 gaggacatca aggagtccat tgagaccatg cggacccggc tggagcgcag catccagatg 601 ctcacgctac tgcgggcctt cgaggcacgt gaccgcaacc tgcaagaggg caacttggag 661 cgggtcaact tctggtcagc tgtcaacgtg gcggtgctgc tgctggtggc tgtgctgcag 721 gtctgcacgc tcaagcgctt cttccaggac aagcgcccgg tgcccacgta gcccctgcca 781 tggaaggaag aacgggacaa aggaggggca gcagggtgtg tgcatatgag acttgggggt 841 ccctccccaa ttttagtttc ctgccaaaac gggagtgtgc agtcagggcc tgcggtctgg 901 ccccatgagt ctccttccgt cctcagcggg cagggaacac ctctggcttg tagaagggac 961 ggctcagtgg ctgcaccgac ggtcctggaa atctcacatg gtgggcactg cagcgttgga 1021 acgtgagcct cggatttcct ggcccctcta ctgtaaatgt gccttagcct aagcctccca 1081 tcctgtgtta gcgttgcctg gtgcggggca gggcctaaca aggaaacctg ggccctccaa 1141 gccaggttga ggtctggtaa cagaatgcca ggaagggggc ctggaagacc acctgccccg 1201 gcccctctcc tgcaggggcc ccacacaggc atgagggatg gcccggccaa agtctaggca 1261 gaagcctcct ataacaaagg gtggtgtggc ctgggcattg gag // LOCUS HSU41806 1995 bp mRNA PRI 02-APR-1996 DEFINITION Human EBI3-associated protein p60 mRNA, complete cds. ACCESSION U41806 NID g1145798 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1995) AUTHORS Devergne,O., Hummel,M., Koeppen,H., Le Beau,M.M., Nathanson,E.C., Kieff,E. and Birkenbach,M. TITLE A novel interleukin-12 p40-related protein induced by latent Epstein-Barr virus infection in B lymphocytes JOURNAL J. Virol. 70 (2), 1143-1153 (1996) MEDLINE 96135230 REFERENCE 2 (bases 1 to 1995) AUTHORS Birkenbach,M. TITLE Direct Submission JOURNAL Submitted (01-DEC-1995) Mark Birkenbach, Pathology, University of Chicago, 910 E. 58th Street, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..1995 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BL41/B95-8" CDS 21..1343 /note="EBI3-associated protein" /codon_start=1 /product="p60" /db_xref="PID:g1145799" /translation="MASLTVKAYLLGKEDAAREIRRFSFCCSPEPEAEAEAAAGPGPC ERLLSRVAALFPALRPGGFQAHYRDEDGDLVAFSSDEELTMAMSYVKDDIFRIYIKEK KECRRDHRPPCAQEAPRNMVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLH RGHTKLAFPSPFGHLSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARP GPTAESASGPSEDPSVNFLKNVGESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESS STEEKSSSQPSSCCSDPSKPGGNVEGATQSLAEQMRKIALESEGAPEEQMESDNCSGG DDDWTHLSSKEVDPSTGELQSLQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPEADP RLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQYSKHPPPL" BASE COUNT 409 a 570 c 588 g 428 t ORIGIN 1 gaattccctc gccgctcgct atggcgtcgc tcaccgtgaa ggcctacctt ctgggcaagg 61 aggacgcggc gcgcgagatt cgccgcttca gcttctgctg cagccccgag cctgaggcgg 121 aagccgaggc tgcggcgggt ccgggaccct gcgagcggct gctgagccgg gtggccgccc 181 tgttccccgc gctgcggcct ggcggcttcc aggcgcacta ccgcgatgag gacggggact 241 tggttgcctt ttccagtgac gaggaattga caatggccat gtcctacgtg aaggatgaca 301 tcttccgaat ctacattaaa gagaaaaaag agtgccggcg ggaccaccgc ccaccgtgtg 361 ctcaggaggc gccccgcaac atggtgcacc ccaatgtgat ctgcgatggc tgcaatgggc 421 ctgtggtagg aacccgctac aagtgcagcg tctgcccaga ctacgacttg tgtagcgtct 481 gcgagggaaa gggcttgcac cgggggcaca ccaagctcgc attccccagc cccttcgggc 541 acctgtctga gggcttctcg cacagccgct ggctccggaa ggtgaaacac ggacacttcg 601 ggtggccagg atgggaaatg ggtccaccag gaaactggag cccacgtcct cctcgtgcag 661 gggaggcccg ccctggcccc acggcagaat cagcttctgg tccatcggag gatccgagtg 721 tgaatttcct gaagaacgtt ggggagagtg tggcagctgc ccttagccct ctgggcattg 781 aagttgatat cgatgtggag cacggaggga aaagaagccg cctgaccccc gtctctccag 841 agagttccag cacagaggag aagagcagct cacagccaag cagctgctgc tctgacccca 901 gcaagccggg tgggaatgtt gagggcgcca cgcagtctct ggcggagcag atgaggaaga 961 tcgccttgga gtccgagggg gcccctgagg aacagatgga gtcggataac tgttcaggag 1021 gagatgatga ctggacccat ctgtcttcaa aagaagtgga cccgtctaca ggtgaactcc 1081 agtccctaca gatgccagaa tccgaagggc caagctctct ggacccctcc caggagggac 1141 ccacagggct gaaggaagct gccttgtacc cacatctccc gccagaggct gacccgcggc 1201 tgattgagtc cctctcccag atgctgtcca tgggcttctc tgatgaaggc ggctggctca 1261 ccaggctcct gcagaccaag aactatgaca tcggagcggc tctggacacc atccagtatt 1321 caaagcatcc cccgccgttg tgaccacttt tgcccacctc ttctgcgtgc ccctcttctg 1381 tctcatagtt gtgttaagct tgcgtagaat tggcaggtct ctgtacgggc cagtttctct 1441 gccttcttcc aggatcaggg gttagggtgc aagaagccat ttagggcagc aaaacaagtg 1501 acatgaaggg agggtccctg tgtgtgtgtg tgctgatgtt tcctgggtgc cctggctcct 1561 tgcagcaggg ctgggcctgc gagacccaag gctcactgca gcgcgctcct gacccctccc 1621 tgcaggggct acgttagcag cccagcacat agcttgccta atggctttca ctttctcttt 1681 tgttttaaat gactcatagg tccctgacat ttagttgatt attttctgct acagacctgg 1741 tacactctga ttttagataa agtaagccta ggtgttgtca gcaggcaggc tggggaggcc 1801 agtgttgtgg gcctcctgct gggactgaga aggcccacga aggcgtccgc aatgttggtt 1861 tcactgagag ctgcctcctg gtctcttcac cactgtagtt ctctcatttc caaaccatca 1921 gctgctttta aaataagatc tctttgtagc catcctgtta aatttgtaaa caatctaatt 1981 aaatggcatg cgcag // LOCUS HSU41815 3630 bp mRNA PRI 10-FEB-1996 DEFINITION Human nucleoporin 98 (NUP98) mRNA, complete cds. ACCESSION U41815 NID g1184172 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3630) AUTHORS Borrow,J., Shearman,A.M., Stanton,V.P., Becher,R., Collins,T., Williams,A.J., Dube,I., Katz,F., Kwong,Y.L., Morris,C., Ohyashiki,K., Toyama,K., Rowley,J. and Housman,D.E. TITLE The t(7;11)(p15;p15) translocation in acute myeloid leukaemia fuses the genes for nucleoporin NUP98 and class I homeoprotein HOXA9 JOURNAL Nature Genet. 12 (2), 159-167 (1996) MEDLINE 96154188 REFERENCE 2 (bases 1 to 3630) AUTHORS Borrow,J. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) Julian Borrow, Center for Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..3630 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" /clone_lib="U937" gene 145..2907 /gene="NUP98" CDS 145..2907 /gene="NUP98" /codon_start=1 /product="nucleoporin 98" /db_xref="PID:g1184173" /translation="MFNKSFGTPFGGGTGGFGTTSTFGQNTGFGTTSGGAFGTSAFGS SNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTANTLFGTASTGTSLF SSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGT TIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELRLEDYQANRKGPQNQ VGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTTGFGTNPGGLFGQ QNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQASQPGGLFGTA TNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFGTTSGGLFG FGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLGTGAFGA PGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYSPFGDSPLFRNPMSD PKKKEERLKPTNPAAQKALTTPTHYKLTPRPATRVRPKALQTTGTAKSHLFDGLDDDE PSLANGAFMPKKSIKKLVLKNLNNSNLFSPVNRDSENLASPSEYPENGERFSFLSKPV DENHQQDGDEDSLVSHFYTNPIAKPIPQTPESAGNKHSNSNSVDDTIVALNMRAALRN GLEGSSEETSFHDESLQDDREEIENNSYHMHPAGIILTKVGYYTIPSMDDLAKITNEK GECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDIVHIRRKEVVVYLDDNQKPPVGEGL NRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEAVSRKQGAQFKEYRPETGSW VFKVSHFSKYGLQDSDEEEEEHPSKTSTKKLKTAPLPPASQTTPLQMALNGKPAPPPQ VEKKGQ" misc_feature 160..1735 /gene="NUP98" /note="encodes FG nucleoporin repeat domain" misc_feature 1552^1553 /gene="NUP98" /note="t(7;11)(p15;p15) acute myeloid leukaemia breakpoint position; fused to HOXA9" misc_feature 2398..2424 /gene="NUP98" /note="RNA-binding motif" BASE COUNT 1030 a 789 c 805 g 1006 t ORIGIN 1 cggtggcagg ggtggtagcg gcggcggcga cggtttcgtg ggggccgcgc gctgctctgt 61 gagcggcggg tggcagcagg ggactcctga cacttcccct tccccaccga accgcgcttt 121 ctgaaacaaa gactcatttt gaagatgttt aacaaatcat ttggaacacc ctttgggggt 181 ggcacaggtg gctttggcac aacttcaaca tttggacaga atactggctt tggcactact 241 agtggagggg catttggaac atctgcattt ggttctagca acaatactgg aggcctcttt 301 ggaaattcac agactaaacc aggaggattg tttggaacca gttcatttag ccagccagct 361 acctccacaa gcactggctt tgggtttggt acgtcaacag gaacagcaaa taccttgttt 421 ggaactgcaa gcacagggac cagtctcttc tcatcccaaa acaatgcctt tgcacaaaat 481 aaaccaactg gctttggcaa ttttggaacc agtactagca gtggaggact ctttggaacc 541 acaaatacca cctctaatcc ttttggcagc acatctggct ccctctttgg gccaagtagt 601 tttacagctg ctcctactgg gactactatt aaatttaacc ctccaactgg tacagatact 661 atggtcaaag ctggagttag cactaacata agtaccaagc accagtgtat tactgctatg 721 aaagaatatg aaagcaagtc actagaggaa cttcgtttag aggattatca ggctaacagg 781 aagggcccac agaaccaggt gggagcaggt accacaactg gcttgtttgg gtcttctcca 841 gccacttcca gcgcaacagg actcttcagc tcctccacca ctaattcagg ctttgcatat 901 ggtcagaaca aaactgcctt tggaactagt acaactggat ttggaacaaa tccaggtggt 961 ctctttggcc aacagaatca gcagactacc agcctcttca gcaaaccatt tggccaggct 1021 acaaccaccc agaacactgg cttttccttt ggtaatacca gcaccatagg acagccaagc 1081 accaacacca tgggattatt tggagtaacc caagcctcac agcctggagg tctttttggg 1141 acagctacaa acaccagcac tgggacagca tttggaacag gaacaggtct ctttgggcag 1201 accaatactg gatttggtgc tgttggttcg accctgtttg gcaataacaa gcttactaca 1261 tttggaagca gcacaaccag tgcaccttca tttggtacaa ccagtggcgg gctctttggt 1321 tttggcacaa ataccagtgg gaatagtatt tttggaagta aaccagcacc tgggactctt 1381 ggaactgggc ttggtgcagg atttggaaca gctcttggtg ctggacaggc atctttgttt 1441 gggaacaacc aacctaagat tggagggcct cttggtacag gagcctttgg ggcccctgga 1501 tttaatacta cgacagccac tttgggcttt ggagcccccc aggccccagt agctttgaca 1561 gatccaaatg cttctgctgc ccagcaggct gttctccagc agcacatcaa tagtctaaca 1621 tactcacctt ttggagactc tcctctcttc cggaatccga tgtcagaccc taagaagaag 1681 gaagagagat tgaaaccaac aaatccagca gcccagaagg ctcttactac acctactcat 1741 tataaactga caccccgccc tgccactaga gtccggccaa aggctttaca aacaacaggc 1801 acagccaagt cacatctctt tgatgggctg gatgacgatg aaccatccct agccaatgga 1861 gcattcatgc ccaagaagag cattaagaag ttggttttga agaaccttaa taatagcaat 1921 ctcttttctc ctgttaatcg tgattcagaa aatctagctt caccatctga atatccagaa 1981 aatggagaga gatttagttt cctaagcaaa cctgttgatg agaatcacca gcaggatgga 2041 gatgaagatt cccttgtttc acatttttat actaacccta ttgccaaacc tattcctcaa 2101 accccagaaa gtgctggaaa taaacacagc aacagcaaca gtgtggatga taccattgtt 2161 gcattaaaca tgcgtgctgc tttgcgaaat gggctggaag gaagcagtga agaaacgtct 2221 tttcatgatg agtcacttca ggatgaccga gaagaaatag aaaataattc ttaccatatg 2281 cacccagcag gtattattct cactaaggtt ggttactata ctattccatc tatggatgac 2341 cttgctaaaa ttaccaatga aaaaggagag tgcattgtct ctgatttcac tattggtcgg 2401 aaaggttatg gttcaatcta ttttgaagga gatgtgaatt tgacaaatct aaatttggat 2461 gatattgtgc atatccggag gaaagaagta gttgtctact tagatgataa ccaaaaacca 2521 cctgtgggtg aagggctaaa taggaaggct gaagttacat tggatggagt ttggccaaca 2581 gataaaacat ctcgttgttt aataaagagc ccagatcgcc ttgctgatat caactatgaa 2641 ggaagattgg aagcagtttc aaggaaacag ggagctcaat tcaaagaata ccggcctgaa 2701 actggttctt gggtgtttaa ggtctcccat ttttctaagt atggccttca ggattctgat 2761 gaagaggagg aggagcatcc gtctaaaact agtacaaaga agttgaagac tgctcctttg 2821 cctcctgcaa gccagactac gcccttgcag atggctctta atggcaaacc tgcacctcca 2881 cctcaggtag agaaaaaagg acagtgaatt tgaatggaat ccgtgatacc gaagttgaaa 2941 gcaagtcatt cagctaatac aaagctgttt tatgaccctt ggaactttga agagtacaaa 3001 cattggcaat cacgttgaaa caagtgcaag ggagggcgtg aggtcttgca ggcatctgtc 3061 tttttactgg agagatttaa agaattctct tgctgtttgg attattcctc tacagattgt 3121 cattttttaa accctttgtt ctctctcatt tgacttgctg aattctctgc tcagtgatta 3181 acttaagatt tgctcatgtg ggttcatgca cagtaaattc tgcctttatt gactacctga 3241 tgtgcagttt aatctttttc tttacctcca tggtttttta aaagttaaat tagctttctg 3301 aaagggtttt taatctccat ttttttaaag ttgtttgctt atacttcggg taaccttgat 3361 atttgtattt taatagtaca taatctttat gaaaaatagt ttgggaatgt aaatgaatta 3421 ttatttggct tggggagatt agggcctaca ttgtttatcg caattacttg tatcattgat 3481 acgggatttc tttgtaaagc atcctctacc tctcagctgc tgaaagctag acctttggta 3541 ttttccatgc tataattctt atggctgctg aatgtgtggt ttttatgatt tattaaataa 3601 tctcttagga ggcatttctg aaaaaaaaaa // LOCUS HSU41816 1241 bp mRNA PRI 18-OCT-1996 DEFINITION Human C-1 mRNA, complete cds. ACCESSION U41816 NID g1620560 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1241) AUTHORS Iijima,M., Kano,Y., Nohno,T. and Namba,M. TITLE Cloning of cDNA with possible transcription factor activity at the G1-S phase transition in human fibroblast cell lines JOURNAL Acta Medicinae Okayama 50 (2), 73-77 (1996) MEDLINE 96311295 REFERENCE 2 (bases 1 to 1241) AUTHORS Iijima,M. TITLE Direct Submission JOURNAL Submitted (02-DEC-1995) Mikio Iijima, Department of Cell Biology, Institute of Molecular and Cellular Biology, Okayama University Medical School, Okayama 700, Japan FEATURES Location/Qualifiers source 1..1241 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="SV40T transformed fibroblasts" gene 12..404 /gene="C-1" CDS 12..404 /gene="C-1" /codon_start=1 /product="C-1" /db_xref="PID:g1620561" /translation="MKKAAAEDVNVTFEDQQKINKFARNTSRITELKEEIEVKKKQLQ NLEDACDDIMLADDDCLMIPYQIGDVFISHSQEETQEMLEEAKKNLQEEIDALESRVE SIQRVLADLKVQLYAKFGSNINLEADES" BASE COUNT 463 a 156 c 209 g 413 t ORIGIN 1 tggcggccac catgaagaag gcggctgcag aagatgtcaa tgttactttc gaagatcaac 61 aaaagataaa caaatttgca cggaatacaa gtagaatcac agagctgaag gaagaaatag 121 aagtaaaaaa gaaacaactc caaaacctag aagatgcttg tgatgacatc atgcttgcag 181 atgatgattg cttaatgata ccttatcaaa ttggtgatgt cttcattagc cattctcaag 241 aagaaacgca agaaatgtta gaagaagcaa agaaaaattt gcaagaagaa attgacgcct 301 tagaatccag agtggaatca attcagcgag tgttagcaga tttgaaagtt cagttgtatg 361 caaaattcgg gagcaacata aaccttgaag ctgatgaaag ttaaacattt tataatactt 421 tttttatttg tttaataaac ttgaatattg tttaaaatga taatttcctt cttcaaatga 481 catggaaagc aaaactttct tttttaaaaa ttttcattta tttaatggaa acttgcccat 541 tttcacatgt ctgcttattt attttatatt tttaaaagaa gacagtattc acctatgtat 601 tttgcataac gattatatca agtctagggg cttcatgtca tgttattaaa atcagttaag 661 caatctttta tgtttctata ttatttagaa tatttgttgt tgcaattttc acataagaaa 721 atttaacagt tgtgtcatgt tgtttctgtc tgattttaat tgctgtctaa tgacggggaa 781 agcacgacga aaagatgtac aatcctgcat ccttgcttat ttcacaacta aagctttgtc 841 atagacttca aaatatatat gtatatattt tatttaaata tatgttacat attatattta 901 aacatacata tttaacattt tttacatatc tatcaatatc agagatttgg gtaaaagaat 961 gggtaatgtt taaacatgtg gaggcatgtg gagctttata caaacagggc agaaccacag 1021 aagacgtttt agaaaccaag agatgtgcag aaagaatgtt tagtgttttt tcgttttaaa 1081 ttttagattt tattttagtg ctttgtaatt aattggggtt tatattgata aagatgtgga 1141 agttaaacag ctatgtatgt aaaagtaagg cttatttctt aaataaagga tgcatttctt 1201 cccaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU41901 1017 bp mRNA PRI 11-JUL-1996 DEFINITION Human limbic system-associated membrane protein LAMP mRNA, complete cds. ACCESSION U41901 NID g1276898 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1017) AUTHORS Pimenta,A.F., Fischer,I. and Levitt,P. TITLE cDNA cloning and structural analysis of the human limbic-system-associated membrane protein (LAMP) JOURNAL Gene 170 (2), 189-195 (1996) MEDLINE 96235133 REFERENCE 2 (bases 1 to 1017) AUTHORS Levitt,P.R., Pimenta,A.F. and Fischer,I. TITLE Direct Submission JOURNAL Submitted (04-DEC-1995) Aurea F. Pimenta, Neuroscience and Cell Biology, UMDNJ-Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, NJ 08854, USA FEATURES Location/Qualifiers source 1..1017 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1017 /standard_name="limbic system-associated membrane protein" /note="IgSF" /codon_start=1 /product="LAMP" /db_xref="PID:g1276899" /translation="MVGRVQPDRKQLPLVLLRLLCLLPTGLPVRSVDFNRGTDNITVR QGDTAILRCVLEDKNSKVAWLNRSGIIFAGHDKWSLDPRVELEKRHSLEYSLRIQKVD VYDEGSYTCSVQTQHEPKTSQVYLIVQVPPKISNISSDVTVNEGSNVTLVCMANGRPE PVITWRHLTPTGREFEGEEEYLEILGITREQSGKYECKAANEVSSADVKQVKVTVNYP PTITESKSNEATTGRQASLKCEASAVPAPDFEWYRDDTRINSANGLEIKSTEGQSSLT VTNVTEEHYGNYTCVAANKLGVTNASLVLFRPGSVRGINGSISLAVPLWLLAASLLCL LSKC" BASE COUNT 264 a 267 c 266 g 220 t ORIGIN 1 atggtcggga gagttcaacc ggatcggaaa cagttgccac tggtcctact gagattgctc 61 tgccttcttc ccacaggact gcctgttcgc agcgtggatt ttaaccgagg cacggacaac 121 atcaccgtga ggcaggggga cacagccatc ctcaggtgcg ttctagaaga caagaactca 181 aaggtggcct ggttgaaccg ttctggcatc atttttgctg gacatgacaa gtggtctctg 241 gacccacggg ttgagctgga gaaacgccat tctctggaat acagcctccg aatccagaag 301 gtggatgtct atgatgaggg ttcctacact tgctcagttc agacacagca tgagcccaag 361 acctcccaag tttacttgat cgtacaagtc ccaccaaaga tctccaatat ctcctcggat 421 gtcactgtga atgagggcag caacgtgact ctggtctgca tggccaatgg ccgtcctgaa 481 cctgttatca cctggagaca ccttacacca actggaaggg aatttgaagg agaagaagaa 541 tatctggaga tccttggcat caccagggag cagtcaggca aatatgagtg caaagctgcc 601 aacgaggtct cctcggcgga tgtcaaacaa gtcaaggtca ctgtgaacta tcctcccact 661 atcacagaat ccaagagcaa tgaagccacc acaggacgac aagcttcact caaatgtgag 721 gcctcggcag tgcctgcacc tgactttgag tggtaccggg atgacactag gataaatagt 781 gccaatggcc ttgagattaa gagcacggag ggccagtctt ccctgacggt gaccaacgtc 841 actgaggagc actacggcaa ctacacctgt gtggctgcca acaagctggg ggtcaccaat 901 gccagcctag tccttttcag acctgggtcg gtgagaggaa taaatggatc catcagtctg 961 gccgtaccac tgtggctgct ggcagcatct ctgctctgcc ttctcagcaa atgttaa // LOCUS HSU42029 1426 bp mRNA PRI 25-APR-1996 DEFINITION Human P2Y1 purinoceptor mRNA, short form, complete cds. ACCESSION U42029 NID g1147730 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1426) AUTHORS Ayyanathan,K., Webbs,T.E., Sandhu,A.K., Athwal,R.S., Barnard,E.A. and Kunapuli,S.P. TITLE Cloning and chromosomal localization of the human P2Y1 purinoceptor JOURNAL Biochem. Biophys. Res. Commun. 218 (3), 783-788 (1996) MEDLINE 96158962 REFERENCE 2 (bases 1 to 1426) AUTHORS Ayyanathan,K. and Kunapuli,S.P. TITLE Direct Submission JOURNAL Submitted (05-DEC-1995) Satya Kunapuli, Physiology, Temple University Medical School, 3420 North Broad Street, Philadelphia, PA 19140, USA FEATURES Location/Qualifiers source 1..1426 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /cell_type="erythro leukemia cells" CDS 47..1168 /note="ATP receptor" /codon_start=1 /product="P2Y1 purinergic receptor, short form" /db_xref="PID:g1147731" /translation="MTEVLWPAVPNGTDAAFLAGPGSSWGNSTVASTAAVSSSFKCAL TKTGFQFYYLPAVYILVFIIGFLGNSVAIWMFVFHMKPWSGISVYMFNLALADFLYVL TLPALIFYYFNKTDWIFGDAMCKLQRFIFHVNLYGSILFLTCISAHRYSGVVYPLKSL GRLKKKNAICISVLVWLIVVVAISPILFYSGTGVRKNKTITCYDTTSDEYLRSYFIYS MCTTVAMFCVPLVLILGCYGLIVRALIYKDLDNSPLRRKSIYLVIIVLTVFAVSYIPF HVMKTMNLRARLDFQTPAMCAFNDRVYATYQVTRGLASLNSCVDPILYFLAGDTFRRR LSRATRKASRRSEANLQSKSEDMTLNILPEFKQNGDTSL" BASE COUNT 360 a 360 c 326 g 380 t ORIGIN 1 ccgcctccta cccctcggag ccgccgccta agtcgaggag gagagaatga ccgaggtgct 61 gtggccggct gtccccaacg ggacggacgc tgccttcctg gccggtccgg gttcgtcctg 121 ggggaacagc acggtcgcct ccactgccgc cgtctcctcg tcgttcaaat gcgccttgac 181 caagacgggc ttccagtttt actacctgcc ggctgtctac atcttggtat tcatcatcgg 241 cttcctgggc aacagcgtgg ccatctggat gttcgtcttc cacatgaagc cctggagcgg 301 catctccgtg tacatgttca atttggctct ggccgacttc ttgtacgtgc tgactctgcc 361 agccctgatc ttctactact tcaataaaac agactggatc ttcggggatg ccatgtgtaa 421 actgcagagg ttcatctttc atgtgaacct ctatggcagc atcttgtttc tgacatgcat 481 cagtgcccac cggtacagcg gtgtggtgta ccccctcaag tccctgggcc ggctcaaaaa 541 gaagaatgcg atctgtatca gcgtgctggt gtggctcatt gtggtggtgg cgatctcccc 601 catcctcttc tactcaggta ccggggtccg caaaaacaaa accatcacct gttacgacac 661 cacctcagac gagtacctgc gaagttattt catctacagc atgtgcacga ccgtggccat 721 gttctgtgtc cccttggtgc tgattctggg ctgttacgga ttaattgtga gagctttgat 781 ttacaaagat ctggacaact ctcctctgag gagaaaatcg atttacctgg taatcattgt 841 actgactgtt tttgctgtgt cttacatccc tttccatgtg atgaaaacga tgaacttgag 901 ggcccggctt gattttcaga ccccagcaat gtgtgctttc aatgacaggg tttatgccac 961 gtatcaggtg acaagaggtc tagcaagtct caacagttgt gtggacccca ttctctattt 1021 cttggcggga gatactttca gaaggagact ctcccgagcc acaaggaaag cttctagaag 1081 aagtgaggca aatttgcaat ccaagagtga agacatgacc ctcaatattt tacctgagtt 1141 caagcagaat ggagatacaa gcctgtgaag gcacaagaat ctccaaacac ctctctgttg 1201 taatatggta ggatgcttaa cagaatcaag tacttttccc ctctttaact ttctagttta 1261 gaaaaaaatc aaaccaagaa aatagtgagt taaaaaaata atagaagtag aaatgcccac 1321 atccacactt agcttgtttg ggtttgcttt cacagtctct cttccttctg actagaagta 1381 tgtataataa aacaatacta cctagttaaa aaaaaaaaaa aaaaaa // LOCUS HSU42068 1913 bp mRNA PRI 05-JAN-1996 DEFINITION Human liver endoplasmic reticulum P58 mRNA, complete cds. ACCESSION U42068 NID g1147738 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1913) AUTHORS Bourdi,M., Demady,D., Martin,J.L., Jabbour,S.K., Martin,B.M., George,J.W. and Pohl,L.R. TITLE cDNA cloning and baculovirus expression of the human liver endoplasmic reticulum P58: characterization as a protein disulfide isomerase isoform, but not as a protease or a carnitine acyltransferase JOURNAL Arch. Biochem. Biophys. 323 (2), 397-403 (1995) MEDLINE 96063616 REFERENCE 2 (bases 1 to 1913) AUTHORS Bourdi,M. TITLE Direct Submission JOURNAL Submitted (05-DEC-1995) Mohammed Bourdi, LMI/NHLBI, National Institues of Health, Building 10, Room 8N110, Bethesda, MD 20892-1760, USA FEATURES Location/Qualifiers source 1..1913 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver endoplasmic reticulum" CDS 79..1596 /note="protein disulfide isomerase isoform" /codon_start=1 /product="P58" /db_xref="PID:g1147739" /translation="MRLRRLALFPGVALLLAAARLAAASDVLELTDDNFESRISDTGS AGLMLVEFFAPWCGHCKRLAPEYEAAATRLKGIVPLAKVDCTANTNTCNKYGVSGYPT LKIFRDGEEAGAYDGPRTADGIVSHLKKQAGPASVPLRTEEEFKKFISDKDASIVGFF DDSFSEAHSEFLKAASNLRDNYRFAHTNVESLVNEYDDNGEGIILFRPSHLTNKFEDK TVAYTEQKMTSGKIKKFIQENIFGICPHMTEDNKDLIQGKDLLIAYYDVDYEKNAKGS NYWRNRVMMVAKKFLDAGHKLNFAVASRKTFSHELSDFGLESTAGEIPVVAIRTAKGE KFVMQEEFSRDGKALERFLQDYFDGNLKRYLKSEPIPESNDGPVKVVVAENFDEIVNN ENKDVLIEFYAPWCGHCKNLEPKYKELGEKLSKDPNIVIAKMDATANDVPSPYEVRGF PTIYFSPANKKLNPKKYEGGRELSDFISYLQREATNPPVIQEEKPKKKKKAQEDL" BASE COUNT 556 a 402 c 473 g 482 t ORIGIN 1 aattcggcac gaggcccgac ctccgcagtc ccagccgagc cgcgaccctt ccggccgtcc 61 ccaccccacc tcgccgccat gcgcctccgc cgcctagcgc tgttcccggg tgtggcgctg 121 cttcttgccg cggcccgcct cgccgctgcc tccgacgtgc tagaactcac ggacgacaac 181 ttcgagagtc gcatctccga cacgggctct gcgggcctca tgctcgtcga gttcttcgct 241 ccctggtgtg gacactgcaa gagacttgca cctgagtatg aagctgcagc taccagatta 301 aaaggaatag tcccattagc aaaggttgat tgcactgcca acactaacac ctgtaataaa 361 tatggagtca gtggatatcc aaccctgaag atatttagag atggtgaaga agcaggtgct 421 tatgatggac ctaggactgc tgatggaatt gtcagccact tgaagaagca ggcaggacca 481 gcttcagtgc ctctcaggac tgaggaagaa tttaagaaat tcattagtga taaagatgcc 541 tctatagtag gttttttcga tgattcattc agtgaggctc actccgagtt cctaaaagca 601 gccagcaact tgagggataa ctaccgattt gcacatacga atgttgagtc tctggtgaac 661 gagtatgatg ataatggaga gggtatcatc ttatttcgtc cttcacatct cactaacaag 721 tttgaggaca agactgtggc atatacagag caaaaaatga ccagtggcaa aattaaaaag 781 tttatccagg aaaacatttt tggtatctgc cctcacatga cagaagacaa taaagatttg 841 atacagggca aggacttact tattgcttac tatgatgtgg actatgaaaa gaacgctaaa 901 ggttccaact actggagaaa cagggtaatg atggtggcaa agaaattcct ggatgctggg 961 cacaaactca actttgctgt agctagccgc aaaaccttta gccatgaact ttctgatttt 1021 ggcttggaga gcactgctgg agagattcct gttgttgcta tcagaactgc taaaggagag 1081 aagtttgtca tgcaggagga gttctcgcgt gatgggaagg ctctggagag gttcctgcag 1141 gattactttg atggcaatct gaagagatac ctgaagtctg aacctatccc agagagcaat 1201 gatgggcctg tgaaggtagt ggtagcagag aattttgatg aaatagtgaa taatgaaaat 1261 aaagatgtgc tgattgaatt ttatgcccct tggtgtggtc attgtaagaa cctggagccc 1321 aagtataaag aacttggcga gaagctcagc aaagacccaa atatcgtcat agccaagatg 1381 gatgccacag ccaatgatgt gccttctcca tatgaagtca gaggttttcc taccatatac 1441 ttctctccag ccaacaagaa gctaaatcca aagaaatatg aaggtggccg tgaattaagt 1501 gattttatta gctatctaca aagagaagct acaaaccccc ctgtaattca agaagaaaaa 1561 cccaagaaga agaagaaggc acaggaggat ctctaaagca gtagccaaac accactttgt 1621 aaaaggactc ttccatcaga gatgggaaaa ccattgggga ggactaggac ccatatggga 1681 attattacct ctcagggccg agaggacaga atggatataa tctgaatcct gttaaatttt 1741 ctctaaactg tttcttagct gcactgttta tggaaatacc aggaccagtt tatgtttgtg 1801 gttttgggaa aaattatttg tgttggggga aatgttgtgg gggtggggtt gagttggggg 1861 tattttctaa ttttttttgt acatttggaa cagtgacaat aaatctcgtg ccg // LOCUS HSU42349 1342 bp mRNA PRI 07-NOV-1996 DEFINITION Human N33 mRNA, complete cds. ACCESSION U42349 NID g1353672 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1342) AUTHORS MacGrogan,D., Levy,A., Bova,G.S., Isaacs,W.B. and Bookstein,R. TITLE Structure and methylation-associated silencing of a gene within a homozygously deleted region of human chromosome band 8p22 JOURNAL Genomics 35 (1), 55-65 (1996) MEDLINE 96299740 REFERENCE 2 (bases 1 to 1342) AUTHORS Bookstein,R., MacGrogan,D., Levy,A., Bova,G.S. and Isaacs,W.B. TITLE Direct Submission JOURNAL Submitted (05-DEC-1995) Robert Bookstein, Molecular Biology, Canji, Inc., 3030 Science Park Rd., San Diego, CA 92121, USA FEATURES Location/Qualifiers source 1..1342 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p22" gene 158..1204 /gene="N33" CDS 158..1204 /gene="N33" /note="39 kDa encoded by N33" /codon_start=1 /db_xref="PID:g1353673" /translation="MGARGAPSRRRQAGRRLRYLPTGSFPFLLLLLLLCIQLGGGQKK KENLLAEKVEQLMEWSSRRSIFRMNGDKFRKFIKAPPRNYSMIVMFTALQPQRQCSVC RQANEEYQILANSWRYSSAFCNKLFFSMVDYDEGTDVFQQLNMNSAPTFMHFPPKGRP KRADTFDLQRIGFAAEQLAKWIADRTDVHIRVFRPPNYSGTIALALLVSLVGGLLYLR RNNLEFIYNKTGWAMVSLCIVFAMTSGQMWNHIRGPPYAHKNPHNGQVSYIHGSSQAQ FVAESHIILVLNAAITMGMVLLNEAATSKGDVGKRRIICLVGLGLVVFFFSFLLSIFR SKYHGYPYSDLDFE" BASE COUNT 324 a 314 c 348 g 356 t ORIGIN 1 gaattcgggc ggccgcggcc cgggtccctc gcaaagccgc tgccatcccg gagggcccag 61 ccagcgggct cccggaggct ggccgggcag gcgtggtgcg cggtaggagc tgggcgcgca 121 cggctaccgc gcgtggagga gacactgccc tgccgcgatg ggggcccggg gcgctccttc 181 acgccgtagg caagcggggc ggcggctgcg gtacctgccc accgggagct ttcccttcct 241 tctcctgctg ctgctgctct gcatccagct cgggggagga cagaagaaaa aggagaatct 301 tttagctgaa aaagtagagc agctgatgga atggagttcc agacgctcaa tcttccgaat 361 gaatggtgat aaattccgaa aatttataaa ggcaccacct cgaaactatt ccatgattgt 421 tatgttcact gctcttcagc ctcagcggca gtgttctgtg tgcaggcaag ctaatgaaga 481 atatcaaata ctggcgaact cctggcgcta ttcatctgct ttttgtaaca agctcttctt 541 cagtatggtg gactatgatg aggggacaga cgtttttcag cagctcaaca tgaactctgc 601 tcctacattc atgcattttc ctccaaaagg cagacctaag agagctgata cttttgacct 661 ccaaagaatt ggatttgcag ctgagcaact agcaaagtgg attgctgaca gaacggatgt 721 tcatattcgg gttttcagac cacccaacta ctctggtacc attgctttgg ccctgttagt 781 gtcgcttgtt ggaggtttgc tttatttgag aaggaacaac ttggagttca tctataacaa 841 gactggttgg gccatggtgt ctctgtgtat agtctttgct atgacttctg gccagatgtg 901 gaaccatatc cgtggacctc catatgctca taagaaccca cacaatggac aagtgagcta 961 cattcatggg agcagccagg ctcagtttgt ggcagaatca cacattattc tggtactgaa 1021 tgccgctatc accatgggga tggttcttct aaatgaagca gcaacttcga aaggcgatgt 1081 tggaaaaaga cggataattt gcctagtggg attgggcctg gtggtcttct tcttcagttt 1141 tctactttca atatttcgtt ccaagtacca cggctatcct tatagtgatc tggactttga 1201 gtgagaagat gtgatttgga ccatggcact taaaaactct ataacctcag ccttttaatt 1261 aaatgaagcc aagtgggatt tgcataaagt gaatgtttac catgaagata aactgttcct 1321 gactttatac tattttgaat tc // LOCUS HSU42390 8906 bp mRNA PRI 03-JUN-1997 DEFINITION Homo sapiens Trio mRNA, complete cds. ACCESSION U42390 NID g1353702 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8906) AUTHORS Debant,A., Serra-Pages,C., Seipel,K., O'Brien,S., Tang,M., Park,S.H. and Streuli,M. TITLE The multidomain protein Trio binds the LAR transmembrane tyrosine phosphatase, contains a protein kinase domain, and has separate rac-specific and rho-specific guanine nucleotide exchange factor domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (11), 5466-5471 (1996) MEDLINE 96224308 REFERENCE 2 (bases 1 to 8906) AUTHORS Streuli,M. TITLE Direct Submission JOURNAL Submitted (06-DEC-1995) Michel Streuli, Tumor Immunology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..8906 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5p14-p15.1" CDS 67..8652 /note="similar to protein kinase; rac guanine exchange factor; rho guanine exchange factor, spectrin-like repeats" /codon_start=1 /product="Trio" /db_xref="PID:g1353703" /translation="MKAMDVLPILKEKVAYLSGGRDKRGGPILTFPARSNHDRIRQED LRRLISYLACIPSEEVCKRGFTVIVDMRGSKWDSIKPLLKILQESFPCCIHVALIIKP DNFWQKQRTNFGSSKFEFETNMVSLEGLTKVVDPSQLTPEFDGCLEYNHEEWIEIRVA FEDYISNATHMLSRLEELQDILAKKELPQDLEGARNMIEEHSQLKKKVIKAPIEDLDL EGQKLLQRIQSSESFPKKNSGSGNADLQNLLPKVSTMLDRLHSTRQHLHQMWHVRKLK LDQCFQLRLFEQDAEKMFDWITHNKGLFLNSYTEIGTSHPHAMELQTQHNHFAMNCMN VYVNINRIMSVANRLVESGHYASQQIRQIASQLEQEWKAFAAALDERSTLLDMSSIFH QKAEKYMSNVDSWCKACGEVDLPSELQDLEDAIHHHQGIYEHITLAYSEVSQDGKSLL DKLQRPLTPGSSDSLTASANYSKAVHHVLDVIHEVLHHQRHVRTIWQHRKVRLHQRLQ LCVFQQEVQQVLDWIENHGEAFLSKHTGVGKSLHRARALQKRHEDFEEVAQNTYTNAD KLLEAAEQLAQTGECDPEEIYQAAHQLEDRIQDFVRRVEQRKILLDMSVSFHTHVKEL WTWLEELQKELLDDVYAESVEAVQDLIKRFGQQQQTTLQVTVNVIKEGEDLIQQLRDS AISSNKTPHNSSINHIETVLQQLDEAQSQMEELFQERKIKLELFLHVRIFERDAIDII SDLESWNDELSQQMNDFDTEDLTIAEQRLQHHADKALTMNNLTFDVIHQGQDLLQYVN EVQASGVELLCDRDVDMATRVQDLLEFLHEKQQELDLAAEQHRKHLEQCVQLRHLQAE VKQVLGWIRNGESMLNAGLITASSLQEAEQLQREHEQFQHAIEKTHQSALQVQQKAEA MLQANHYDMDMIRDCAEKVASHWQQLMLKMEDRLKLVNASVAFYKTSEQVCSVLESLE QEYKREEDWCGGADKLGPNSETDHVTPMISKHLEQKEAFLKACTLARRNADVFLKYLH RNSVNMPGMVTHIKAPEQQVKNILNELFQRENRVLHYWTMRKRRLDQCQQYVVFERSA KQALEWIHDNGEFYLSTHTSTGSSIQHTQELLKEHEEFQITAKQTKERVKLLIQLADG FCEKGHAHAAEIKKCVTAVDKRYRDFSLRMEKYRTSLEKALGISSDSNKSSKSLQLDI IPASIPGSEVKLRDAAHELNEEKRKSARRKEFIMAELIQTEKAYVRDLRECMDTYLWE MTSGVEEIPPGIVNKELIIFGNMQEIYEFHNNIFLKELEKYEQLPEDVGHCFVTWADK FQMYVTYCKNKPDSTQLILEHAGSYFDEIQQRHGLANSISSYLIKPVQRITKYQLLLK ELLTCCEEGKGEIKDGLEVMLSVPKRANDAMHLSMLEGFDENIESQGELILQESFQVW DPKTLIRKGRERHLFLFEMSLVFSKEVKDSSGRSKYLYKSKLFTSELGVTEHVEGDPC KFALWVGRTPTSDNKIVLKASSIENKQDWIKHIREVIQERTIHLKGALKEPIHIPKTA PATRQKGRRDGEDLDSQGDGSSQPDTISIASRTSQNTLDSDKLSGGCELTVVIHDFTA CNSNELTIRRGQTVEVLERPHDKPDWCLVRTTDRSPAAEGLVPCGSLCIAHSRSSMEM EGIFNHKDSLSVSSNDASPPASVASLQPHMIGAQSSPGPKRPGNTLRKWLTSPVRRLS SGKADGHVKKLAHKHKKSREVRKSADAGSQKDSDDSAATPQDETVEERGRNEGLSSGT LSKSSSSGMQSCGEEEGEEGADAVPLPPPMAIQQHSLLQPDSQDDKASSRLLVRPTSS ETPSAAELVSAIEELVKSKMALEDRPSSLLVDQGDSSSPSFNPSDNSLLSSSSPIDEM EERKSSSLKRRHYVLQELVETERDYVRDLGYVVEGYMALMKEDGVPDDMKGKDKIVFG NIHQIYDWHRDFFLGELEKCLEDPEKLGSLFVKHERRLHMYIAYCQNKPKSEHIVSEY IDTFFEDLKQRLGHRLQLTDLLIKPVQRIMKYQLLLKDFLKYSKKASLDTSELERAVE VMCIVPRRCNDMMNVGRLQGFDGKIVAQGKLLLQDTFLVTDQDAGLLPRCRERRIFLF EQIVIFSEPLDKKKGFSMPGFLFKNSIKVSCLCLEENVENDPCKFALTSRTGDVVETF ILHSSSPSVRQTWIHEINQILENQRNFLNALTSPIEYQRNHSGGGGGGGSGAAAGVGA AAAAGPPVAAAATVAAPAAAAAPPARAGAGPPGSPSLSDTTPPCWSPLQPRARQRQTR CQSESSSSSNISTMLVTHDYTAVKEDEINVYQGEVVQILASNQQNMFLVFRAATDQCP AAEGWIPGFVLGHTSAVIVENPDGTLKKSTSWHTALRLRKKSEKKDKDGKREGKLENG YRKSREGLSNKVSVKLLNPNYIYDVPPEFVIPLSEVTCETGETVVLRCRVCGRPKASI TWKGPEHNTLNNDGHYSISYSDLGEATLKIVGVTTEDDGIYTCIAVNDMGSASSSASL RVLGPGMDGIMVTWKDNFDSFYSEVAELGRGRFSVVKKCDQKGTKRAVATKFVNKKLM KRDQVTHELGILQSLQHPLLVGLLDTFETPTSYILVLEMADQGRLLDCVVRWGSLTEG KIRAHLGEVLEAVRYLHNCRIAHLDLKPENILVDESLAKPTIKLADFGDAVQLNTTYY IHQLLGNPEFAAPEIILGNPVSLTSDTWSVGVLTYVLLSGVSPFLDDSVEETCLNICR LDFSFPDDYFKGVSQKAKEFVCFLLQEDPAKRPSAALALQEQWLQAGNGRSTGVLDTS RLTSFIERRKHQNDVRPIRSIKNFLQSRLLPRV" BASE COUNT 2393 a 2260 c 2420 g 1833 t ORIGIN 1 gaggcggcca aggacctggc cgacatcgcg gccttcttcc gatccgggtt tcgaaaaaac 61 gatgaaatga aagctatgga tgttttacca attttgaagg aaaaagttgc atacctttca 121 ggtgggagag ataaacgtgg aggtcccatt ttaacgtttc cggcccgcag caatcatgac 181 agaatacgac aggaggatct caggagactc atttcctatc tagcctgtat tcccagcgag 241 gaggtctgca agcgtggctt cacggtgatc gtggacatgc gtgggtccaa gtgggactcc 301 atcaagcccc ttctgaagat cctgcaggag tccttcccct gctgcatcca tgtggccctg 361 atcatcaagc cagacaactt ctggcagaaa cagaggacta attttggcag ttctaaattt 421 gaatttgaga caaatatggt ctctttagaa ggccttacca aagtagttga tccttctcag 481 ctaactcctg agtttgatgg ctgcctggaa tacaaccacg aagaatggat tgaaatcaga 541 gttgcttttg aagactacat tagcaatgcc acccacatgc tgtctcggct ggaggaactt 601 caggacatcc tagctaagaa ggagctgcct caggatttag agggggctcg gaatatgatc 661 gaggaacatt ctcagctgaa gaagaaggtg attaaggccc ccatcgagga cctggatttg 721 gagggacaga agctgcttca gaggatacag agcagtgaaa gctttcccaa aaagaactca 781 ggctcaggca atgcggacct gcagaacctc ttgcccaagg tgtccaccat gctggaccgg 841 ctgcactcga cacggcagca tctgcaccag atgtggcatg tgaggaagct gaagctggac 901 cagtgcttcc agctgaggct gtttgaacag gatgctgaga agatgtttga ctggatcaca 961 cacaacaaag gcctgtttct aaacagctac acagagattg ggaccagcca ccctcatgcc 1021 atggagcttc agacgcagca caatcacttt gccatgaact gtatgaacgt gtatgtaaat 1081 ataaaccgca tcatgtcggt ggccaatcgt ctggtggagt ctggccacta tgcctcgcag 1141 cagatcaggc agatcgcgag tcagctggag caggagtgga aggcgtttgc ggcagccctg 1201 gatgagcgga gcaccttgct ggacatgtcc tccattttcc accagaaggc cgaaaagtat 1261 atgagcaacg tggattcatg gtgtaaagct tgcggtgagg tagaccttcc ctcagagctg 1321 caggacctag aagatgccat tcatcaccac cagggaatat atgaacatat cactcttgct 1381 tattctgagg tcagccaaga tgggaagtcg ctccttgaca agctccagcg gcccttgact 1441 cccggcagct ccgattccct gacagcctct gccaactact ccaaggccgt gcaccatgtc 1501 ctggatgtca tccacgaggt gctgcaccac cagcggcacg tgagaacaat ctggcaacac 1561 cgcaaggtcc ggctgcatca gaggctgcag ctgtgtgttt tccagcagga agttcagcag 1621 gtgctagact ggatcgagaa ccacggagaa gcatttctga gcaaacatac aggtgtgggg 1681 aaatctcttc atcgggccag agcattgcag aaacgtcatg aagattttga agaagtggca 1741 cagaacacat acaccaatgc ggataaatta ctggaagcag cagaacagct ggctcagact 1801 ggggaatgtg accccgaaga gatttatcag gctgcccatc agctggaaga ccggattcaa 1861 gatttcgttc ggcgtgttga gcagcgaaag atcctactgg acatgtcagt gtcctttcac 1921 acccatgtga aagagctgtg gacgtggctg gaggagctgc agaaggagct gctggacgac 1981 gtgtatgccg agtcggtgga ggccgtgcag gacctcatca agcgctttgg ccagcagcag 2041 cagaccaccc tgcaggtgac tgtcaacgtg atcaaggaag gggaggacct catccagcag 2101 ctcagggact ctgccatctc cagtaacaag accccccaca acagctccat caaccacatt 2161 gagacggtgc tgcagcagct ggacgaggcg cagtcgcaga tggaggagct cttccaggag 2221 cgcaagatca agctggagct cttcctgcac gtgcgcatct tcgagaggga cgccatcgac 2281 attatctcag acctcgagtc ttggaatgat gagctttctc agcaaatgaa tgacttcgac 2341 acagaagatc tcacgattgc agagcagcgc ctccagcacc atgcagacaa agccttgacc 2401 atgaacaact tgacttttga cgtcatccac caagggcaag atcttctgca gtatgtcaat 2461 gaggtccagg cctctggtgt ggagctgctg tgtgatagag atgtagacat ggcaactcgg 2521 gtccaggacc tgctggagtt tcttcatgaa aaacagcagg aattggattt agccgcagag 2581 cagcatcgga aacacctgga gcagtgcgtg cagctgcgcc acctgcaggc agaagtgaaa 2641 caggtgctgg gttggatccg caacggagag tccatgttaa atgccggact tatcacagcc 2701 agctcgttac aagaggcaga gcagctccag cgagagcacg agcagttcca gcatgccatt 2761 gagaaaacac atcagagcgc gctgcaggtg cagcagaagg cagaagccat gctacaggcc 2821 aaccactacg acatggacat gatccgggac tgcgccgaga aggtggcgtc tcactggcaa 2881 cagctcatgc tcaagatgga agatcgcctc aagctcgtca acgcctctgt cgctttctac 2941 aaaacctcag agcaggtctg cagcgtcctc gagagcctgg aacaggagta caagagagaa 3001 gaagactggt gtggcggggc ggataagctg ggcccaaact ctgagacgga ccacgtgacg 3061 cccatgatca gcaagcacct ggagcagaag gaggcattcc tgaaggcttg cacccttgct 3121 cggaggaatg cagacgtctt cctgaaatac ctgcacagga acagcgtgaa catgccagga 3181 atggtgacgc acatcaaagc tcctgaacag caagtgaaaa atatcttgaa tgaactcttc 3241 caacgggaga acagggtatt gcattactgg accatgagga agagacggct ggaccagtgt 3301 cagcagtacg tggtctttga gaggagtgcc aagcaggctt tggaatggat ccatgacaat 3361 ggcgagttct acctttccac acacacctcc acgggctcca gtatacagca cacccaggag 3421 ctcctgaaag agcacgagga gttccagata actgcaaagc aaaccaaaga gagagtgaag 3481 ctattgatac agctggctga tggcttttgt gaaaaagggc atgcccatgc ggcagagata 3541 aaaaaatgtg ttactgctgt ggataagagg tacagagatt tctctctgcg gatggagaag 3601 tacaggacct ctttggaaaa agccctgggg atttcttcag attccaacaa atcgagtaaa 3661 agtctccagc tagatatcat tccagccagt atccctggct cagaggtgaa acttcgagat 3721 gctgctcatg aacttaatga agagaagcgg aaatctgccc gcaggaaaga gttcataatg 3781 gctgagctca ttcaaactga aaaggcttat gtaagagacc tccgggaatg tatggatacg 3841 tacctgtggg aaatgaccag tggcgtggaa gagattccac ctggcattgt aaacaaagaa 3901 ctcatcatct tcggaaacat gcaagaaatc tacgaatttc ataataacat attcctaaag 3961 gagctggaaa aatatgaaca gttgccagag gatgttggac attgttttgt tacttgggca 4021 gacaagtttc agatgtatgt cacatattgc aaaaataagc ctgattctac tcagctgata 4081 ttggaacatg cagggtccta ttttgacgag atacagcagc gacatggatt agccaattcc 4141 atttcttcct accttattaa accagttcag cgaataacga aatatcagct ccttttaaaa 4201 gagctgctga cgtgctgtga ggaaggaaag ggagagatta aagatggcct ggaggtgatg 4261 ctcagcgtgc cgaagcgagc caatgacgcc atgcacctca gcatgctgga agggtttgat 4321 gaaaacattg agtctcaggg agaactcatc ctacaggaat ccttccaagt gtgggaccca 4381 aaaaccttaa ttcgaaaggg tcgagaacgg catctcttcc tttttgaaat gtccttagta 4441 tttagtaaag aagtgaaaga ttccagtggg agaagcaagt acctttataa aagcaaattg 4501 tttacctcag agttgggtgt cacagaacat gttgaaggag acccttgcaa atttgcactg 4561 tgggtgggga gaacaccaac ttcagataat aaaattgtcc ttaaggcttc cagcatagag 4621 aacaagcagg actggataaa gcatatccgc gaagtcatcc aggagcggac gatccacctg 4681 aagggagccc tgaaggagcc cattcacatc cctaagaccg ctcccgccac aagacagaag 4741 ggaaggaggg atggagagga tctggacagc caaggagacg gcagcagcca gcctgatacg 4801 atttccatcg cctcacggac gtctcagaac acgctggaca gcgataagct ctctggtggc 4861 tgtgagctga cagtggtgat ccatgacttc accgcttgca acagcaacga gctgaccatc 4921 cgacggggcc agaccgtgga agttctggag cggccgcatg acaagcctga ctggtgtctg 4981 gtgcggacca ctgaccgctc cccagcggca gaaggcctgg tcccctgtgg ttcactgtgc 5041 atcgcccact ccagaagtag catggaaatg gagggcatct tcaaccacaa agactcgctc 5101 tccgtctcca gcaatgacgc cagtccaccc gcatccgtgg cttccctcca gccccacatg 5161 atcggggccc agagctcgcc gggccccaag cggccgggca acaccctgcg caagtggctc 5221 accagccccg tgcggcggct cagcagcggc aaggccgacg ggcacgtgaa gaagctggcg 5281 cacaagcaca agaagagccg cgaggtccgc aagagcgccg acgccggctc gcagaaggac 5341 tccgacgaca gtgcggccac cccgcaggac gagacggtcg aggagagagg ccggaacgag 5401 ggcctgagca gcggtactct ctccaaatcc tcctcctcgg ggatgcagag ctgtggagaa 5461 gaggaaggcg aggagggggc cgacgccgtg cccctgccgc cacccatggc catccagcag 5521 cacagcctcc tccagccaga ctcacaggat gacaaggcct cttctcggtt attagtccgc 5581 cccaccagct ccgaaacacc gagtgcagcc gagctcgtca gtgcaattga ggaactcgtg 5641 aaaagcaaga tggcactgga ggatcgcccc agctcactcc ttgttgacca gggagatagt 5701 agcagccctt ccttcaaccc ttcggataat tcccttctct cttcctcctc gcccattgat 5761 gagatggaag aaaggaaatc cagctcttta aagagaagac actacgtttt gcaagaacta 5821 gtggagacag agcgtgacta tgtgcgggac cttggctatg tggttgaggg ctacatggca 5881 cttatgaaag aagatggtgt tcctgatgac atgaaaggaa aagacaaaat tgtgttcggc 5941 aacatccatc agatttacga ctggcacaga gacttttttt taggagagtt agagaagtgc 6001 cttgaagatc cagaaaaact aggatccctt tttgttaaac acgagagaag gttgcacatg 6061 tacatagctt attgtcaaaa taaaccaaag tctgagcaca ttgtctcaga atacattgat 6121 accttttttg aggacttaaa gcagcgtctt ggccacaggt tacagctcac agatctgttg 6181 atcaaaccag tgcagagaat catgaagtat cagctgttac tgaaggactt cctcaagtat 6241 tccaaaaagg ccagcctgga tacatcagaa ttagagagag ctgtggaagt catgtgcata 6301 gtacccaggc ggtgcaacga catgatgaac gtggggcggc tgcaaggatt cgacgggaaa 6361 atcgttgccc agggtaaact gctcttgcag gacacattct tggtcacaga ccaagatgca 6421 ggacttctgc ctcgctgcag agagaggcgc atcttcctct ttgagcagat cgtcatattc 6481 agcgaaccac ttgataaaaa gaagggcttc tccatgccgg gattcctgtt taagaacagt 6541 atcaaggtga gttgcctttg cctggaggaa aatgtggaaa atgatccctg taaatttgct 6601 ctgacatcga ggacgggtga cgtggtagag accttcattt tgcattcatc tagtccaagt 6661 gtccggcaaa cttggatcca tgaaatcaac caaattttag aaaaccagcg caatttttta 6721 aatgccttga catcgccaat cgagtaccag aggaaccaca gcgggggcgg cggcggcggc 6781 ggcagcgggg cagcggcggg ggtgggggca gcggcggcgg cggggccccc agtggcggca 6841 gcggccacag tggcggcccc agcagctgcg gcggcgcccc cagcacgagc aggagccggc 6901 cctcccggat cccccagcct gtccgacacc acccccccgt gctggtctcc tctgcagcct 6961 cgagccaggc agaggcagac aagatgtcag agtgaaagca gcagcagtag caacatctcc 7021 accatgttgg tgacacacga ttacacggca gtgaaggagg atgagatcaa cgtctaccaa 7081 ggagaggtcg ttcaaattct ggccagcaac cagcagaaca tgtttctggt gttccgagcc 7141 gccactgacc agtgccccgc agctgagggc tggattccag gctttgtcct gggccacacc 7201 agtgcagtca tcgtggagaa cccggacggg actctcaaga agtcaacatc ttggcacaca 7261 gcactccgtt taaggaaaaa atctgagaaa aaagataaag acggcaaaag ggaaggcaag 7321 ttagagaacg gttatcggaa gtcacgggaa ggactcagca acaaggtatc tgtgaagctt 7381 ctcaatccca actacattta tgacgttccc ccagaattcg tcattccatt gagtgaggtc 7441 acgtgtgaga caggggagac cgttgttctt agatgtcgag tctgtggccg ccccaaagcc 7501 tcaattacct ggaagggccc tgaacacaac accttgaaca acgatggtca ctacagcatc 7561 tcctacagtg acctgggaga ggccacgctg aagattgtgg gcgtgaccac ggaagatgac 7621 ggcatctaca cgtgcatcgc tgtcaatgac atgggttcag cctcatcatc ggccagcctg 7681 agggtcctag gtccagggat ggatgggatc atggtgacct ggaaagacaa ctttgactcc 7741 ttctacagtg aagtggctga gcttggcagg ggcagattct ctgtcgttaa gaaatgtgat 7801 cagaaaggaa ccaagcgagc agtggccact aagtttgtga acaagaagtt gatgaagcgc 7861 gaccaggtca cccatgagct tggcatcctg cagagcctcc agcaccccct gcttgtcggc 7921 ctcctcgaca cctttgagac ccccaccagc tacatcctgg tcttagaaat ggctgaccag 7981 ggtcgcctcc tggactgcgt ggtgcgatgg ggaagcctca ctgaagggaa gatcagggcg 8041 cacctggggg aggttctgga agctgtccgg tacctgcaca actgcaggat agcacacctg 8101 gacctaaagc ctgagaatat cctggtggat gagagtttag ccaagccaac catcaaactg 8161 gctgactttg gagatgctgt tcagctcaac acgacctact acatccacca gttactgggg 8221 aaccctgaat tcgcagcccc tgaaatcatc ctcgggaacc ctgtctccct gacctcggat 8281 acgtggagtg ttggagtgct cacatacgta cttcttagtg gcgtgtcccc cttcctggat 8341 gacagtgtgg aagagacctg cctgaacatt tgccgcttag actttagctt cccagatgac 8401 tactttaaag gagtgagcca gaaggccaag gagttcgtgt gcttcctcct gcaggaggac 8461 cccgccaagc gtccctcggc tgcgctggcc ctccaggagc agtggctgca ggccggcaac 8521 ggcagaagca cgggcgtcct cgacacgtcc agactgactt ccttcattga gcggcgcaaa 8581 caccagaatg atgttcgacc tatccgtagc attaaaaact ttctgcagag caggcttctg 8641 cctagagttt gacctatcca gaagttcttt ctcattctct ttcacctgcc aatcagctgt 8701 taatctgaat tttcaagaga aaacaagcaa acataactga tcagctgccg gtatgttcat 8761 cgtgtgaaat tgcattccaa gtgagctgtg ctcagcagtg cttggacaca gagctgcaag 8821 ctgcgctggg gtggaggacc gtcacttaca ctctgccaag gacggaggtc gcattgctgt 8881 atcacagtat tttttacgga tttctg // LOCUS HSU42408 2442 bp mRNA PRI 06-JUN-1997 DEFINITION Human ladinin (LAD) mRNA, complete cds. ACCESSION U42408 NID g2160516 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2442) AUTHORS Megahed,M., Motoki,K., McGrath,J., LaForgia,S. and Uitto,J. TITLE Cloning of a Gene (LAD) Underlining Linear IgA Desease and Encoding a Novel Anchoring Filament of the Basement Membrane Zone JOURNAL Unpublished REFERENCE 2 (bases 1 to 2442) AUTHORS Megahed,M.M. TITLE Direct Submission JOURNAL Submitted (06-DEC-1995) Mosaad M. Megahed, Dermatology, Thomas Jefferson University, 233 South 10th Street, Philadelphia, PA 19107-5541, USA FEATURES Location/Qualifiers source 1..2442 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="MU1" /cell_type="keratinocyte" /tissue_type="skin" /dev_stage="adult" gene 220..1773 /gene="LAD" CDS 220..1773 /gene="LAD" /codon_start=1 /product="ladinin" /db_xref="PID:g2160517" /translation="MAVSRKDWSALSSLARQRTLEDEEEQERERRRRHRNLSSTTDDE APRLSQNGDRQASASERLPSVEEAEVPKPLPPASKDEDEDIQSILRTRQERRQRRQVV EAAQAPIQERLEAEEGRNSLSPVQATQKPLVSKKELEIPPRRRLSREQRGPWPLEEES LVGREPEERKKGVPEKSPVLEKSSMPKKTAPEKSLVSDKTSISEKVLASEKTSLSEKI AVSEKRNSSEKKSVLEKTSVSEKSLAPGMALGSGRRLVSEKASIFEKALASEKSPTAD AKPAPKRATASEQPLAQEPPASGGSPATTKEQRGRALPGKNLPSLAKQGASDPPTVAS RLPPVTLQVKIPSKEEEADMSSPTQRTYSSSLKRSSPRTISFRMKPKKENSETTLTRS ASMKLPDNTVKLGEKLERYHTAIRRSESVKSRGLPCTELFVAPVGVASKRHLFEKELA GQSRAEPASSRKENLRLSGVVTSRLNLWISRTQESGDQDPQEAQKASSATERTQWGQK SDSSLDAEV" BASE COUNT 562 a 746 c 731 g 403 t ORIGIN 1 gcgggattcc gggccgggcc ggcctgggct gcaatcaatg cggctttgtc tgggacgccc 61 acatcccaga ggccattccc gggtcggcaa atcggagcgc ggcggggcgc gcgggggtga 121 gataagcggc catgtgatcc cacctgggct ggaaggggag gggcgccagg tgaggcggcg 181 gccggtgggg cgcgggcggc cacgcggggc tcctgcagca tggctgtcag caggaaggac 241 tggtccgcgc tgtccagcct tgcccggcag aggactctgg aggatgagga ggaacaggag 301 cgcgagcgca ggcggcggca ccgcaacctg agctccacca cggacgatga ggctcccagg 361 ctcagccaga atggagaccg gcaggcctct gcttctgaga gactaccgag cgtggaagaa 421 gcagaggtgc ccaagccact gcccccagcc tccaaagatg aggacgagga catccagagc 481 atcctcagaa cacggcagga gcggaggcag aggcggcagg tggtggaggc tgcacaggcc 541 cccatccagg agaggctgga ggcagaggag gggaggaaca gcttgagccc tgtgcaggcc 601 acacagaaac ccctagtctc caagaaggaa ctggaaatcc cacctcgccg gagactgagt 661 cgggaacagc ggggcccctg gcccctggag gaggagagct tggtgggcag ggagccagaa 721 gagaggaaga aaggggttcc agaaaagtcc ccagtcttgg agaagtcctc catgccaaag 781 aagacggcac ctgaaaagag cctggtctcc gataaaacct ccatctctga gaaggtgctg 841 gcctcagaga agacatctct atcagagaag atagcagtgt cagagaaaag aaacagctca 901 gagaagaagt ctgttctaga aaaaaccagt gtctctgaga agtcgctggc cccagggatg 961 gcactgggct caggaaggag gctggtgtct gagaaagctt ccatctttga gaaggcactg 1021 gcctcagaga agagcccaac tgcagatgct aagccggccc caaagagggc cacagcctca 1081 gagcagcccc tggcgcagga gccgccagcc tctgggggaa gcccagccac caccaaggag 1141 cagagaggaa gggccctccc tgggaagaac ctgccctctt tggcaaagca gggggcttca 1201 gaccctccga ctgtggcctc ccgcctccca cccgtcacac tccaggtgaa aatccccagc 1261 aaggaggaag aggcagatat gtcctcaccc acacagcgaa cctacagcag ctccctcaaa 1321 cgctccagcc ccaggaccat ctcctttcgg atgaaaccca agaaagaaaa ctcggaaaca 1381 accctaactc gcagtgccag catgaagctc ccagacaaca cagtgaagtt gggagagaag 1441 ctggagagat accacacggc catacggaga tcagaatctg tcaagtctcg gggtctgcct 1501 tgcactgagt tattcgtggc tcctgtgggt gtagccagca agcgccacct ctttgagaag 1561 gaactggcgg gccagagccg agcagaacca gcctccagcc ggaaggagaa cttgaggctc 1621 tcaggggttg tgacatcaag gctcaacctg tggatcagca ggacccagga atctggagat 1681 caggaccccc aggaggcaca gaaagcatca tctgcaaccg agaggactca gtggggacag 1741 aaatctgact cctcgctgga cgctgaggtg tgacaagccc cgccaagaca gacctgcaag 1801 tcttcgtctc aagggacctc cctcatgcca ggcccctgcc tctcacagca gcaccctttc 1861 ctctcattgt ccctgttccc ttgttggctg tggatctgtt tggccagggt ccctggggtc 1921 aggaatattt gcaagactca gccagctcct tcccagccca gcctcttggg gctgggactt 1981 tctcaccctg cggcaggcac aacagatgct gggacccagt ctctgcccag gtcacagcac 2041 aagtgcacat cagcactatg gggcctatgt cctgcccaga gacctctgct ccttcctgct 2101 cacatccaca gtcagggcac ggcgcccctc aagaactcca gagtcacctg tctcatcggc 2161 tcccaacaag tgcctctttg tctatgatgt cccccttctc tgaggcctgg acccacccat 2221 ctttgtccct gggggctgct cccagccact gaggcccgct ctggccaggg gagaaggagc 2281 tgccgtgcgt cttccctgtg ccccgtctcc ctgcttggtt ctcccctccc ttccctggcc 2341 ggctgccatg gccaggagct aagtgccttt ttgtgtgcaa ccacttaccc tttctctgaa 2401 aaacctgttc tcaggaagga tctgataaac tcatttactc tc // LOCUS HSU42412 1578 bp mRNA PRI 30-MAY-1996 DEFINITION Human 5'-AMP-activated protein kinase, gamma-1 subunit mRNA, complete cds. ACCESSION U42412 NID g1335855 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1578) AUTHORS Gao,G., Fernandez,C.S., Stapleton,D., Auster,A.S., Widmer,J., Dyck,J.R., Kemp,B.E. and Witters,L.A. TITLE Non-catalytic beta- and gamma-subunit isoforms of the 5'-AMP-activated protein kinase JOURNAL J. Biol. Chem. 271 (15), 8675-8681 (1996) MEDLINE 96224074 REFERENCE 2 (bases 1 to 1578) AUTHORS Fernandez,C.S., Stapleton,D.S., Gao,G., Widmer,J., Auster,A., Kemp,B.E. and Witters,L.A. TITLE Direct Submission JOURNAL Submitted (07-DEC-1995) Lee A. Witters, Medicine/Biochemistry, Dartmouth Medical School, N. College St., Hanover, NH 03755-3833, USA FEATURES Location/Qualifiers source 1..1578 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 95..1090 /codon_start=1 /product="5'-AMP-activated protein kinase, gamma-1 subunit" /db_xref="PID:g1335856" /translation="METVISSDSSPAVENEHPQETPESNNSVYTSFMKSHRCYDLIPT SSKLVVFDTSLQVKKAFFALVTNGVRAAPLWDSKKQSFVGMLTITDFINILHRYYKSA LVQIYELEEHKIETWREVYLQDSFKPLVCISPNASLFDAVSSLIRNKIHRLPVIDPES GNTLYILTHKRILKFLKLFITEFPKPEFMSKSLEELQIGTYANIAMVRTTTPVYVALG IFVQHRVSALPVVDEKGRVVDIYSKFDVINLAAEKTYNNLDVSVTKALQHRSHYFEGV LKCYLHETLETIINRLVEAEVHRLVVVDENDVVKGIVSLSDILQALVLTGGEKKP" BASE COUNT 400 a 377 c 379 g 422 t ORIGIN 1 gcgcccttaa agatggtgag ggggctcatg ctctgagtag aaggtggtga cctccaggag 61 cggtgggatg atgagggccc gggcgcctct tgcaatggag acggtcattt cttcagatag 121 ctccccagct gtggaaaatg agcatcctca agagacccca gaatccaaca atagcgtgta 181 tacttccttc atgaagtctc atcgctgcta tgacctgatt cccacaagct ccaaattggt 241 tgtatttgat acgtccctgc aggtgaagaa agcttttttt gctttggtga ctaacggtgt 301 acgagctgcc cctttatggg atagtaagaa gcaaagtttt gtgggcatgc tgaccatcac 361 tgatttcatc aatatcctgc accgctacta taaatcagcc ttggtacaga tctatgagct 421 agaagaacac aagatagaaa cttggagaga ggtgtatctc caggactcct ttaaaccgct 481 tgtctgcatt tctcctaatg ccagcttgtt tgatgctgtc tcttcattaa ttcggaacaa 541 gatccacagg ctgccagtta ttgacccaga atcaggcaat actttgtaca tcctcaccca 601 caagcgcatt ctgaagttcc tcaaattgtt tatcactgag ttccccaagc cagagttcat 661 gtccaagtct ctggaagagc tacagattgg cacctatgcc aatattgcta tggttcgcac 721 taccaccccc gtctatgtgg ctctggggat ttttgtacag catcgagtct cagccctgcc 781 agtggtggat gagaaggggc gtgtggtgga catctactcc aagtttgatg ttatcaatct 841 ggcagcagaa aagacctaca acaacctaga tgtatctgtg actaaagcct tgcaacatcg 901 atcacattac tttgagggtg ttctcaagtg ctacctgcat gagactctgg agaccatcat 961 caacaggcta gtggaagcag aggttcaccg acttgtagtg gtggatgaaa atgatgtggt 1021 caagggaatt gtatcactgt ctgacatcct gcaggccctg gtgctcacag gtggagagaa 1081 gaagccctga gctgggggaa ggggtcatgc agcaccaggg gatatgccca actcactgcc 1141 tgctggaagc tctgtgggaa tcagatgaaa cttgagggaa ttgtgactct gttccctgtt 1201 cagggtcccc tgcccttcta tctgggagct agggaaggta tgggggagga aagagaatgg 1261 atttatagct acccttaccc tcacacatac acttgaaaaa actttcagcc tagccagttc 1321 tagcccctgt cctcttagat atatccccct ttctgggtga actataggct ctgtgcctct 1381 cagacaaatt ctgatctcta agagatcccc agacctcact tgcctctgcc tccatcttgg 1441 ccctgattca accctaagat aatagcacaa caaaattctt cataaagata tttttattca 1501 cctgttccgt gctatatgga ggaggccaag tccatttagt gacatttctt cccataatgt 1561 gagtggggag gattgtgg // LOCUS HSU42604 1244 bp DNA PRI 02-FEB-1996 DEFINITION Human UDP-glucuronosyltransferase (UGT1H), exon 1. ACCESSION U42604 NID g1174043 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1244) AUTHORS Cho,J.W., Gholami,N. and Owens,I.S. TITLE Extension of UGT1 Locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 1244) AUTHORS Owens,I.S., Cho,J.W. and Gholami,N. TITLE Direct Submission JOURNAL Submitted (08-DEC-1995) Ida S. Owens, HDB, NICHD, 9000 Rockville Pike, Bldg. 10/8D43, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1244 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" TATA_signal 136..151 gene 264..1196 /gene="UGT1H" exon 264..1193 /gene="UGT1H" /number=1 CDS 264..1196 /gene="UGT1H" /codon_start=1 /product="UDP-glucuronosyltransferase" /db_xref="PID:g1174044" /translation="MARTGWTSPIPLCVSLLLTCGFAEAGKLLVVPMDGSHWFTMQSV VEKLILRGHEVVVVMPEVSWQLGKSLNCTVKTYSTSYTLEDLDREFMDFADAQWKAQV RSLFSLFLSSSNGFFNLFFSHCRSLFNDRKLVEYLKESSFDAVFLDPFDACALIVAKY FSLPSVVFARGIGCHYLEEGAQCPAPLSYVPRILLGFSDAMTFKERVRNHIMHLEEHL FCQYFSKNALEIASEILQTPVTAYDLYSHTSIWLLRTDFVLDYPKPVMPNMIFIGGIN CHQGKPLPMVSHLSFSTLGIILALEIKKRFLTEL" intron 1194..1244 /number=1 BASE COUNT 302 a 268 c 274 g 400 t ORIGIN 1 gggcatgatc tgtccaaggc agagactata agctactctt atagtactct tatgagatac 61 atacaagtag gtatctcaaa aaatgatact catgtattcc tgttcttatg agtaaatcat 121 tggcagtgag tgtgattttt ttttttttta tgacaggatc cctacacgcc ctctattggg 181 gtcaggtttt gtgcctgtag ttcttccgcc tacgtatcat agcagttaga atcccagctg 241 ctggctcggg ctgcagttct ctcatggctc gcacagggtg gaccagcccc attcccctat 301 gtgtttctct gctgctgacc tgtggctttg ctgaggcagg gaagctgctg gtagtgccca 361 tggatgggag tcactggttc accatgcagt cggtggtgga gaaacttatc ctcagggggc 421 atgaggtggt tgtagtcatg ccagaggtga gttggcaact gggaaaatca ctgaattgca 481 cagtgaagac ttactcaacc tcatacactc tggaggatct ggaccgggaa ttcatggatt 541 tcgccgatgc tcaatggaaa gcacaagtac gaagtttgtt ttctctattt ctgagttcat 601 ccaatggttt ttttaactta tttttttcgc attgcaggag tttgtttaat gaccgaaaat 661 tagtagaata cttaaaggag agttcttttg atgcggtgtt tcttgatcct tttgatgcct 721 gtgcgttaat tgttgccaaa tatttctccc tcccctctgt ggtcttcgcc aggggaatag 781 gttgccacta tcttgaagaa ggtgcacagt gccctgctcc tctttcctat gtccccagaa 841 ttctcttagg gttctcagat gccatgactt tcaaggagag agtacggaac cacatcatgc 901 acttggagga acatttattt tgccagtatt tttccaaaaa tgccctagaa atagcctctg 961 aaattctcca aacacctgtc acagcatatg atctctacag ccacacatca atttggttgt 1021 tgcgaacaga ctttgttttg gactatccca aacccgtgat gcccaatatg atcttcattg 1081 gtggtatcaa ctgccatcag ggaaagccat tgcctatggt aagtcacctc tcctttagca 1141 cattaggaat aatcttggct ttggaaatta aaaaaagatt ccttactgaa ttgtgatttg 1201 acattttcat ttgttgcatt tcaaatttct ttccagttta caga // LOCUS HSU43030 1539 bp mRNA PRI 12-JAN-1996 DEFINITION Human cardiotrophin-1 (CTF1) mRNA, complete cds. ACCESSION U43030 NID g1151149 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1539) AUTHORS Pennica,D., Swanson,T.A., Shaw,K.J., Kuang,W.-J., Gray,C., Beatty,B.G. and Wood,W.I. TITLE Human Cardiotrophin-1 Protein and Gene Structure, Biological Activities, and Chromosomal Localization JOURNAL Cytokine (1996) In press REFERENCE 2 (bases 1 to 1539) AUTHORS Wood,W.I. TITLE Direct Submission JOURNAL Submitted (11-DEC-1995) William I. Wood, Molecular Biology, Genentech, Inc., 460 Pt. San Bruno Blvd., S. San Francisco, CA 94402, USA FEATURES Location/Qualifiers source 1..1539 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p11.1-16p11.2" gene 33..638 /gene="CTF1" CDS 33..638 /gene="CTF1" /note="CT-1; IL-6 family member" /codon_start=1 /product="cardiotrophin-1" /db_xref="PID:g1151150" /translation="MSRREGSLEDPQTDSSVSLLPHLEAKIRQTHSLAHLLTKYAEQL LQEYVQLQGDPFGLPSFSPPRLPVAGLSAPAPSHAGLPVHERLRLDAAALAALPPLLD AVCRRQAELNPRAPRLLRRLEDAARQARALGAAVEALLAALGAANRGPRAEPPAATAS AASATGVFPAKVLGLRVCGLYREWLSRTEGDLGQLLPGGSA" BASE COUNT 183 a 579 c 434 g 343 t ORIGIN 1 gtgaagggag ccgggatcag ccaggggcca gcatgagccg gagggaggga agtctggaag 61 acccccagac tgattcctca gtctcacttc ttccccactt ggaggccaag atccgtcaga 121 cacacagcct tgcgcacctc ctcaccaaat acgctgagca gctgctccag gaatatgtgc 181 agctccaggg agaccccttc gggctgccca gcttctcgcc gccgcggctg ccggtggccg 241 gcctgagcgc cccggctccg agccacgcgg ggctgccagt gcacgagcgg ctgcggctgg 301 acgcggcggc gctggccgcg ctgcccccgc tgctggacgc agtgtgtcgc cgccaggccg 361 agctgaaccc gcgcgcgccg cgcctgctgc gccgcctgga ggacgcggcg cgccaggccc 421 gggccctggg cgccgccgtg gaggccttgc tggccgcgct gggcgccgcc aaccgcgggc 481 cccgggccga gccccccgcc gccaccgcct cagccgcctc cgccaccggg gtcttccccg 541 ccaaggtgct ggggctccgc gtttgcggcc tctaccgcga gtggctgagc cgcaccgagg 601 gcgacctggg ccagctgctg cccgggggct cggcctgagc gccgcggggc agctcgcccc 661 gcctcctccc gctgggttcc gtctctcctt ccgcttcttt gtctttctct gccgctgtcg 721 gtgtctgtct gtctgctctt agctgtctcc attgcctcgg ccttctttgc tttttgtggg 781 ggagagggga ggggacgggc agggtctctg tcgcccaggc tggggtgcag tggcgcgatc 841 ccagcactgc agcctcaacc tcctgggctc aagccatcct tccgcctcag cttccccagc 901 agctgggact acaggcacgc gccaccacag ccggctaatt ttttatttaa ttttttgtag 961 agacgaggtt tcgccatgtt gcccaggctg gtcttgaact ccggggctca agcgatcctc 1021 ccgcttcagc ctccctaagt gctgggattg caggcgtgag ccactttccc agcctctctt 1081 tgctttgcct gccccgttct cttaactctt ggaccctcct cgtctgcatg gtaactccgt 1141 ctgagtctac cattttcttg ctctccctcc ttccttgggc ctgcctcagt tccctttggc 1201 ctcccccttt acccagctct tggggtgtct ctgttttttc catccccact tcctgccttc 1261 tcgtggccct gtgtgagcac atgtgtacat ctcagcctta tctcaaggag gtgacacctt 1321 ctctccttgt ccccatctgg ccgtctctct gtgcttccct ggccaggggc gtgcctgctg 1381 gtcctatggg gggaaggcta ctccgcatct cagccacctt cctcaggctc actccaccta 1441 catccccagt ctgccacacc ccatcccttt gggcctcagc cctgtccctt tgatgtcctc 1501 ctttccttca gcccctctgc cctgtccctg cacacctcc // LOCUS HSU43077 1559 bp mRNA PRI 24-JUL-1997 DEFINITION Human CDC37 homolog mRNA, complete cds. ACCESSION U43077 NID g1375484 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1559) AUTHORS Stepanova,L., Leng,X., Parker,S.B. and Harper,J.W. TITLE Mammalian p50Cdc37 is a protein kinase-targeting subunit of Hsp90 that binds and stabilizes Cdk4 JOURNAL Genes Dev. 10 (12), 1491-1502 (1996) MEDLINE 96258250 REFERENCE 2 (bases 1 to 1559) AUTHORS Harper,J.W. TITLE Direct Submission JOURNAL Submitted (11-DEC-1995) J. Wade Harper, Biochemistry, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1559 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cells" CDS 9..1145 /note="similar to S. cerevisiae Cdc37p" /codon_start=1 /product="CDC37 homolog" /db_xref="PID:g1375485" /translation="MVDYSVWDHIEVSDDEDETHPNIDTASLFRWRHQARVERMEQFQ KEKEELDRGCRECKRKVAECQRKLKELEVAEGGKAELERLQAEAQQLRKEERSWEQKL EEMRKKEKSMPWNVDTLSKDGFSKSMVNTKPEKTEEDSEEVREQKHKTFVEKYEKQIK HFGMLRRWDDSQKYLSDNVHLVCEETANYLVIWCIDLEVEEKCALMEQVAHQTIVMQF ILELAKSLKVDPRACFRQFFTKIKTADRQYMEGFNDELEAFKERVRGRAKLRIEKAMK EYEEEERKKRLGPGGLDPVEVYESLPEELQKCFDVKDVQMLQDAISKMDPTDAKYHMQ RCIDSGLWVPNSKASEAKEGEEAGPGDPLLEAVPKTGDEKDVSV" polyA_site 1559 /note="13 A nucleotides" BASE COUNT 375 a 429 c 491 g 264 t ORIGIN 1 aaggaaagat ggtggactac agcgtgtggg accacattga ggtgtctgat gatgaagacg 61 agacgcaccc caacatcgac acggccagtc tcttccgctg gcggcatcag gcccgggtgg 121 aacgcatgga gcagttccag aaggagaagg aggaactgga caggggctgc cgcgagtgca 181 agcgcaaggt ggccgagtgc cagaggaaac tgaaggagct ggaggtggcc gagggcggca 241 aggcagagct ggagcgcctg caggccgagg cacagcagct gcgcaaggag gagcggagct 301 gggagcagaa gctggaggag atgcgcaaga aggagaagag catgccctgg aacgtggaca 361 cgctcagcaa agacggcttc agcaagagca tggtaaatac caagcccgag aagacggagg 421 aggactcaga ggaggtgagg gagcagaaac acaagacctt cgtggaaaaa tacgagaaac 481 agatcaagca ctttggcatg cttcgccgct gggatgacag ccaaaagtac ctgtcagaca 541 acgtccacct ggtgtgcgag gagacagcca attacctggt catttggtgc attgacctag 601 aggtggagga gaaatgtgca ctcatggagc aggtggccca ccagacaatc gtcatgcaat 661 ttatcctgga gctggccaag agcctaaagg tggacccccg ggcctgcttc cggcagttct 721 tcactaagat taagacagcc gatcgccagt acatggaggg cttcaacgac gagctggaag 781 ccttcaagga gcgtgtgcgg ggccgtgcca agctgcgcat cgagaaggcc atgaaggagt 841 acgaggagga ggagcgcaag aagcggctcg gccccggcgg cctggacccc gtcgaggtct 901 acgagtccct ccctgaggaa ctccagaagt gcttcgatgt gaaggacgtg cagatgctgc 961 aggacgccat cagcaagatg gaccccaccg acgcaaagta ccacatgcag cgctgcattg 1021 actctggcct ctgggtcccc aactctaagg ccagcgaggc caaggaggga gaggaggcag 1081 gtcctgggga cccattactg gaagctgttc ccaagacggg cgatgagaag gatgtcagtg 1141 tgtgacctgc cccagctacc accgccacct gcttccagcc cctatgtgcc ccttttcaga 1201 aaacagatag atgccatctc gcccgctcct gacttcctct acttgcgctg ctcggcccag 1261 cctgggggcc cgcccagccc tccctggcct ctccactgtc tccactctcc agcgcccatt 1321 caagtctctg ctttgagtca aggggcttca ctgcctgcag ccccccatca gcattatgcc 1381 aaaggccggg ggtccggaag ggcagaggtc accaggctgg tctaccaggt agttggggag 1441 ggtccccagc caaggggccg gctctcgtca ctgggctctg ttttcactgt tcgtctgctg 1501 tctgtgtctt ctatttggca aacagcaatg atcttccaat aaaagatttc agatgctca // LOCUS HSU43142 2015 bp mRNA PRI 10-JAN-1996 DEFINITION Human vascular endothelial growth factor related protein VRP mRNA, complete cds. ACCESSION U43142 NID g1150988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2015) AUTHORS Lee,J., Gray,A., Yuan,J., Luoh,S.-M., Avraham,H. and Wood,W.I. TITLE Vascular Endothelial Growth Factor Related Protein (VRP): A Ligand and Specific Activator of the Tyrosine Kinase Receptor Flt4 JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1996) In press REFERENCE 2 (bases 1 to 2015) AUTHORS Wood,W.I. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) William I. Wood, Molecular Biology, Genentech, Inc., 460 Pt. San Bruno Blvd., S. San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2015 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="vh1.4" /cell_line="human glioma cell line G61" source 206..2012 /organism="Homo sapiens" /clone="vh1.6" CDS 372..1631 /note="name is abbreviated as VRP or vh1" /codon_start=1 /function="ligand and activator of the receptor tyrosine kinase Flt4" /product="vascular endothelial growth factor related protein" /db_xref="PID:g1150989" /translation="MHLLGFFSVACSLLAAALLPGPREAPAAAAAFESGLDLSDAEPD AGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKMYKCQLRKGGWQHNREQANLNSR TEETIKFAAAHYNTEILKSIDNEWRKTQCMPREVCIDVGKEFGVATNTFFKPPCVSVY RCGGCCNSEGLQCMNTSTSYLSKTLFEITVPLSQGPKPVTISFANHTSCRCMSKLDVY RQVHSIIRRSLPATLPQCQAANKTCPTNYMWNNHICRCLAQEDFMFSSDAGDDSTDGF HDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKNKLFPSQCGANREF DENTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLKGKKFHHQTCSCYRRPCTNRQ KACEPGFSYSEEVCRCVPSYWKRPQMS" polyA_site 2015 BASE COUNT 525 a 532 c 499 g 459 t ORIGIN 1 cgcggggtgt tctggtgtcc cccgccccgc ctctccaaaa agctacaccg acgcggaccg 61 cggcggcgtc ctccctcgcc ctcgcttcac ctcgcgggct ccgaatgcgg ggagctcgga 121 tgtccggttt cctgtgaggc ttttacctga cacccgccgc ctttccccgg cactggctgg 181 gagggcgccc tgcaaagttg ggaacgcgga gccccggacc cgctcccgcc gcctccggct 241 cgcccagggg gggtcgccgg gaggagcccg ggggagaggg accaggaggg gcccgcggcc 301 tcgcaggggc gcccgcgccc ccacccctgc ccccgccagc ggaccggtcc cccacccccg 361 gtccttccac catgcacttg ctgggcttct tctctgtggc gtgttctctg ctcgccgctg 421 cgctgctccc gggtcctcgc gaggcgcccg ccgccgccgc cgccttcgag tccggactcg 481 acctctcgga cgcggagccc gacgcgggcg aggccacggc ttatgcaagc aaagatctgg 541 aggagcagtt acggtctgtg tccagtgtag atgaactcat gactgtactc tacccagaat 601 attggaaaat gtacaagtgt cagctaagga aaggaggctg gcaacataac agagaacagg 661 ccaacctcaa ctcaaggaca gaagagacta taaaatttgc tgcagcacat tataatacag 721 agatcttgaa aagtattgat aatgagtgga gaaagactca atgcatgcca cgggaggtgt 781 gtatagatgt ggggaaggag tttggagtcg cgacaaacac cttctttaaa cctccatgtg 841 tgtccgtcta cagatgtggg ggttgctgca atagtgaggg gctgcagtgc atgaacacca 901 gcacgagcta cctcagcaag acgttatttg aaattacagt gcctctctct caaggcccca 961 aaccagtaac aatcagtttt gccaatcaca cttcctgccg atgcatgtct aaactggatg 1021 tttacagaca agttcattcc attattagac gttccctgcc agcaacacta ccacagtgtc 1081 aggcagcgaa caagacctgc cccaccaatt acatgtggaa taatcacatc tgcagatgcc 1141 tggctcagga agattttatg ttttcctcgg atgctggaga tgactcaaca gatggattcc 1201 atgacatctg tggaccaaac aaggagctgg atgaagagac ctgtcagtgt gtctgcagag 1261 cggggcttcg gcctgccagc tgtggacccc acaaagaact agacagaaac tcatgccagt 1321 gtgtctgtaa aaacaaactc ttccccagcc aatgtggggc caaccgagaa tttgatgaaa 1381 acacatgcca gtgtgtatgt aaaagaacct gccccagaaa tcaaccccta aatcctggaa 1441 aatgtgcctg tgaatgtaca gaaagtccac agaaatgctt gttaaaagga aagaagttcc 1501 accaccaaac atgcagctgt tacagacggc catgtacgaa ccgccagaag gcttgtgagc 1561 caggattttc atatagtgaa gaagtgtgtc gttgtgtccc ttcatattgg aaaagaccac 1621 aaatgagcta agattgtact gttttccagt tcatcgattt tctattatgg aaaactgtgt 1681 tgccacagta gaactgtctg tgaacagaga gacccttgtg ggtccatgct aacaaagaca 1741 aaagtctgtc tttcctgaac catgtggata actttacaga aatggactgg agctcatctg 1801 caaaaggcct cttgtaaaga ctggttttct gccaatgacc aaacagccaa gattttcctc 1861 ttgtgatttc tttaaaagaa tgactatata atttatttcc actaaaaata ttgtttctgc 1921 attcattttt atagcaacaa caattggtaa aactcactgt gatcaatatt tttatatcat 1981 gcaaaatatg tttaaaataa aatgaaaatt gtatt // LOCUS HSU43148 6568 bp mRNA PRI 30-MAY-1996 DEFINITION Human patched homolog (PTC) mRNA, complete cds. ACCESSION U43148 NID g1335863 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6568) AUTHORS Hahn,H., Christiansen,J., Wicking,C., Zaphiropolous,P.G., Chidambaram,A., Gerrard,B., Vorechovsky,I., Bale,A.E., Toftgard,R., Dean,M. and Wainwright,B. TITLE A mammalian patched homolog is expressed in target tissues of sonic hedgehog and maps to a region associated with developmental abnormalities JOURNAL J. Biol. Chem. 271 (21), 12125-12128 (1996) MEDLINE 96218118 REFERENCE 2 (bases 1 to 6568) AUTHORS Hahn,H., Christiansen,J., Wicking,C., Zaphiropolous,P.G., Chidambaram,A., Gerrard,B., Vorechovsky,I., Bale,A., Toftgard,R., Dean,M. and Wainwright,B. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) Michael Dean, NCI-FCRDC, P.O. Box B, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..6568 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q22.3" gene 54..6568 /gene="PTC" CDS 442..4332 /gene="PTC" /note="patched gene homolog; similar to Drosophila patched protein, Swiss-Prot Accession Number P18502; transmembrane protein; Method: conceptual translation supplied by author" /codon_start=1 /db_xref="PID:g1335864" /translation="MFNPQLMIQTPKEEGANVLTTEALLQHLDSALQASRVHVYMYNR QWKLEHLCYKSGELITETGYMDQIIEYLYPCLIITPLDCFWEGAKLQSGTAYLLGKPP LRWTNFDPLEFLEELKKINYQVDSWEEMLNKAEVGHGYMDRPCLNPADPDCPATAPNK NSTKPLDMALVLNGGCHGLSRKYMHWQEELIVGGTVKNSTGKLVSAHALQTMFQLMTP KQMYEHFKGYEYVSHINWNEDKAAAILEAWQRTYVEVVHQSVAQNSTQKVLSFTTTTL DDILKSFSDVSVIRVASGYLLMLAYACLTMLRWDCSKSQGAVGLAGVLLVALSVAAGL GLCSLIGISFNAATTQVLPFLALGVGVDDVFLLAHAFSETGQNKRIPFEDRTGECLKR TGASVALTSISNVTAFFMAALIPIPALRAFSLQAAVVVVFNFAMVLLIFPAILSMDLY RREDRRLDIFCCFTSPCVSRVIQVEPQAYTDTHDNTRYSPPPPYSSHSFAHETQITMQ STVQLRTEYDPHTHVYYTTAEPRSEISVQPVTVTQDTLSCQSPESTSSTRDLLSQFSD SSLHCLEPPCTKWTLSSFAEKHYAPFLLKPKAKVVVIFLFLGLLGVSLYGTTRVRDGL DLTDIVPRETREYDFIAAQFKYFSFYNMYIVTQKADYPNIQHLLYDLHRSFSNVKYVM LEENKQLPKMWLHYFRDWLQGLQDAFDSDWETGKIMPNNYKNGSDDGVLAYKLLVQTG SRDKPIDISQLTKQRLVDADGIINPSAFYIYLTAWVSNDPVAYAASQANIRPHRPEWV HDKADYMPETRLRIPAAEPIEYAQFPFYLNGLRDTSDFVEAIEKVRTICSNYTSLGLS SYPNGYPFLFWEQYIGLRHWLLLFISVVLACTFLVCAVFLLNPWTAGIIVMVLALMTV ELFGMMGLIGIKLSAVPVVILIASVGIGVEFTVHVALAFLTAISDKNRRAVLALEHMF APVLDGAVSTLLGVLMLAGSDFDFIVRYFFAVLAILTILGVLNGLVLLPVLWSFFGPY PEVSPANGLNRLPTPSPEPPPSVVRFAMPPGHTHSGSDSSDSEYSSQTTVSGLSEELR HYEAQQGAGGPAHQVIVEATENPVFAHSTVVHPESRHHPPSNPKQQPHLDSGSLPPGR QGQQPRRDPPRKGLWPPLYRPRRDAFEISTEGHSGPSNRARWGPRGARSHNPRNPTST AMGSSVPGYCQPITTVTASASVTVAVHPPPVPGPGRNPRGGLCPGYPETDHGLFEDPH VPFHVRCERRDSKVEVIELQDVECEERPRGSSSN" repeat_region 6198..6241 BASE COUNT 1499 a 1781 c 1676 g 1609 t 3 others ORIGIN 1 gaaggcgagc acccagacgg gggcccgccg gggtcgcggc cagcgccggg gaaatgccgc 61 gccggggagc agcatgcgcc ggcctgagcc cttccctttg cactcggctg ttttttacgt 121 ttaaccagaa aggaagggag aggagggaaa gatccatgtg gctgccctct tccgatcaca 181 aatattgtcg ggaaggctac tggccggaaa gcgccgctgt ggctgagagc gaagtttcag 241 agactcttat ttaaactggg ttgttacatt caaaaaaact gcggcaagtt cttggttgtg 301 ggcctcctca tatttggggc cttcgcggtg ggattaaaag cagcgaacct cgagaccaac 361 gtggaggagc tgtgggtgga agttggagga cgagtaagtc gtgaattaaa ttatactcgc 421 cagaagattg gagaagaggc tatgtttaat cctcaactca tgatacagac ccctaaagaa 481 gaaggtgcta atgtcctgac cacagaagcg ctcctacaac acctggactc ggcactccag 541 gccagccgtg tccatgtata catgtacaac aggcagtgga aattggaaca tttgtgttac 601 aaatcaggag agcttatcac agaaacaggt tacatggatc agataataga atatctttac 661 ccttgtttga ttattacacc tttggactgc ttctgggaag gggcgaaatt acagtctggg 721 acagcatacc tcctaggtaa acctcctttg cggtggacaa acttcgaccc tttggaattc 781 ctggaagagt taaagaaaat aaactatcaa gtggacagct gggaggaaat gctgaataag 841 gctgaggttg gtcatggtta catggaccgc ccctgcctca atccggccga tccagactgc 901 cccgccacag cccccaacaa aaattcaacc aaacctcttg atatggccct tgttttgaat 961 ggtggatgtc atggcttatc cagaaagtat atgcactggc aggaggagtt gattgtgggt 1021 ggcacagtca agaacagcac tggaaaactc gtcagcgccc atgccctgca gaccatgttc 1081 cagttaatga ctcccaagca aatgtacgag cacttcaagg ggtacgagta tgtctcacac 1141 atcaactgga acgaggacaa agcggcagcc atcctggagg cctggcagag gacatatgtg 1201 gaggtggttc atcagagtgt cgcacagaac tccactcaaa aggtgctttc cttcaccacc 1261 acgaccctgg acgacatcct gaaatccttc tctgacgtca gtgtcatccg cgtggccagc 1321 ggctacttac tcatgctcgc ctatgcctgt ctaaccatgc tgcgctggga ctgctccaag 1381 tcccagggtg ccgtggggct ggctggcgtc ctgctggttg cactgtcagt ggctgcagga 1441 ctgggcctgt gctcattgat cggaatttcc tttaacgctg caacaactca ggttttgcca 1501 tttctcgctc ttggtgttgg tgtggatgat gtttttcttc tggcccacgc cttcagtgaa 1561 acaggacaga ataaaagaat cccttttgag gacaggaccg gggagtgcct gaagcgcaca 1621 ggagccagcg tggccctcac gtccatcagc aatgtcacag ccttcttcat ggccgcgtta 1681 atcccaattc ccgctctgcg ggcgttctcc ctccaggcag cggtagtagt ggtgttcaat 1741 tttgccatgg ttctgctcat ttttcctgca attctcagca tggatttata tcgacgcgag 1801 gacaggagac tggatatttt ctgctgtttt acaagcccct gcgtcagcag agtgattcag 1861 gttgaacctc aggcctacac cgacacacac gacaataccc gctacagccc cccacctccc 1921 tacagcagcc acagctttgc ccatgaaacg cagattacca tgcagtccac tgtccagctc 1981 cgcacggagt acgaccccca cacgcacgtg tactacacca ccgctgagcc gcgctccgag 2041 atctctgtgc agcccgtcac cgtgacacag gacaccctca gctgccagag cccagagagc 2101 accagctcca caagggacct gctctcccag ttctccgact ccagcctcca ctgcctcgag 2161 cccccctgta cgaagtggac actctcatct tttgctgaga agcactatgc tcctttcctc 2221 ttgaaaccaa aagccaaggt agtggtgatc ttcctttttc tgggcttgct gggggtcagc 2281 ctttatggca ccacccgagt gagagacggg ctggacctta cggacattgt acctcgggaa 2341 accagagaat atgactttat tgctgcacaa ttcaaatact tttctttcta caacatgtat 2401 atagtcaccc agaaagcaga ctacccgaat atccagcact tactttacga cctacacagg 2461 agtttcagta acgtgaagta tgtcatgttg gaagaaaaca aacagcttcc caaaatgtgg 2521 ctgcactact tcagagactg gcttcaggga cttcaggatg catttgacag tgactgggaa 2581 accgggaaaa tcatgccaaa caattacaag aatggatcag acgatggagt ccttgcctac 2641 aaactcctgg tgcaaaccgg cagccgcgat aagcccatcg acatcagcca gttgactaaa 2701 cagcgtctgg tggatgcaga tggcatcatt aatcccagcg ctttctacat ctacctgacg 2761 gcttgggtca gcaacgaccc cgtcgcgtat gctgcctccc aggccaacat ccggccacac 2821 cgaccagaat gggtccacga caaagccgac tacatgcctg aaacaaggct gagaatcccg 2881 gcagcagagc ccatcgagta tgcccagttc cctttctacc tcaacggctt gcgggacacc 2941 tcagactttg tggaggcaat tgaaaaagta aggaccatct gcagcaacta tacgagcctg 3001 gggctgtcca gttaccccaa cggctacccc ttcctcttct gggagcagta catcggcctc 3061 cgccactggc tgctgctgtt catcagcgtg gtgttggcct gcacattcct cgtgtgcgct 3121 gtcttccttc tgaacccctg gacggccggg atcattgtga tggtcctggc gctgatgacg 3181 gtcgagctgt tcggcatgat gggcctcatc ggaatcaagc tcagtgccgt gcccgtggtc 3241 atcctgatcg cttctgttgg cataggagtg gagttcaccg ttcacgttgc tttggccttt 3301 ctgacggcca tcagcgacaa gaaccgcagg gctgtgcttg ccctggagca catgtttgca 3361 cccgtcctgg atggcgccgt gtccactctg ctgggagtgc tgatgctggc gggatctgac 3421 ttcgacttca ttgtcaggta tttctttgct gtgctggcaa tcctcaccat cctcggcgtt 3481 ctcaatgggc tggttttgct tcccgtgctt tggtctttct ttggaccata tcctgaggtg 3541 tctccagcca acggcttgaa ccgcctgccc acaccctccc ctgagccacc ccccagcgtg 3601 gtccgcttcg ccatgccgcc cggccacacg cacagcgggt ctgattcctc cgactcggag 3661 tatagttccc agacgacagt gtcaggcctc agcgaggagc ttcggcacta cgaggcccag 3721 cagggcgcgg gaggccctgc ccaccaagtg atcgtggaag ccacagaaaa ccccgtcttc 3781 gcccactcca ctgtggtcca tcccgaatcc aggcatcacc caccctcgaa cccgaaacag 3841 cagccccacc tggactcagg gtccctgcct cccggacggc aaggccagca gccccgcagg 3901 gaccccccca gaaaaggctt gtggccaccc ctctacagac cgcgcagaga cgcttttgaa 3961 atttctactg aagggcattc tggccctagc aatagggccc gctggggccc tcgcggggcc 4021 cgttctcaca accctcggaa cccaacgtcc actgccatgg gcagctccgt gcccggctac 4081 tgccagccca tcaccactgt gacggcttct gcctccgtga ctgtcgccgt gcacccgccg 4141 cctgtccctg ggcctgggcg gaacccccga gggggactct gcccaggcta ccctgagact 4201 gaccacggcc tgtttgagga cccccacgtg cctttccacg tccggtgtga gaggagggat 4261 tcgaaggtgg aagtcattga gctgcaggac gtggaatgcg aggagaggcc ccggggaagc 4321 agctccaact gagggtgatt aaaatctgaa gcaaagaggc caaagattgg aaacccccca 4381 cccccacctc tttccagaac tgcttgaaga gaactggttg gagttatgga aaagatgccc 4441 tgtgccagga cagcagttca ttgttactgt aaccgattgt attattttgt taaatatttc 4501 tataaatatt taagagatgt acacatgtgt aatataggaa ggaaggatgt aaagtggtat 4561 gatctgggcc ttctccactc ctgccccaga gtgtggaggc cacagtgggg cctctccgta 4621 tttgtgcatt gggctccgtg ccacaaccaa gcttcattag tcttaaattt cagcatatgt 4681 tgctgctgct taaatattgt ataatttact tgtataattc tatgcaaata ttgcttatgt 4741 aataggatta ttttgtaaag gtttctgttt aaaatatttt aaatttgcat atcacaaccc 4801 tgtggtagta tgaaatgtta ctgttaactt tcaaacacgc tatgcgtgat aatttttttg 4861 tttaatgagc agatatgaag aaagcacgtt aatcctggtg gcttctctag gtgtcgttgt 4921 gtgcggtcct cttgtttggc tgtgcgtgtg aacacgtgtg tgagttcacc atgtactgta 4981 ctgtgatttt tttttttgtc ttgttttgtt tctctacact gtctgtaacc tgtagtaggc 5041 tctgacctat tcaggctgga aagcgtcagg atatcttttc ttcgtgctgg tgagggctgg 5101 ccctaaacat ccacctaatc ctttcaaatc agcccggcaa aagctaaact ctcctcgtgt 5161 ctacgggcat ctgttatgat cattggctgc catccaggac cccaatttgt gcttcagggg 5221 gataatctcc ttctctcgga tcattgtgat ggatgctgga acctcagggt atggagctca 5281 catcagttca tcatggtggg tgttagagaa ttcggtgaca tgcctagtgc tgagccttgg 5341 ctgggccatg agagtctgta taataaaaaa agcatgcagc atggtgcccc tcttttgacc 5401 aacacacaca agacccctcc cccaacaccc ccaaattcaa gagtggatgt ggccctgtca 5461 caggtagaaa aacctattta gttaattctt tcttggccca cagtctccca gaaatgatgt 5521 tttgagtccc tatagtttaa agtccctctc ttaaatggag cagctggttt gaggtttcta 5581 aatctgtttg cattttcttt aaaattaagt ggtgagcatg cattgtggtg tagaggcagg 5641 cattatgtag gataagagct ccggggggat tcttcatgca ccagtgttta gggtacgtgc 5701 ttcctaagta aatccaaaca ttgtctccat cctccccgtc attagtgctc tttcaatgtg 5761 atgtgggaaa gcaggaggat ggacacaccc cactgaaaga tgtaggcagg ggcaggtctc 5821 tcaaccaggc atatttttaa aagttgcttc tgtactggtt ctcttctttt gctctgaggt 5881 gtgggctccc tcatctcgta accagagacc agcacatgtc agggaagcac ccagtgtcgg 5941 ctccccatcc caatccacac cagcaccttg ttacagacaa gaagtcagag gaaagggcgg 6001 ggtccctgca gggctgaagc ctaagctact gtgaggtgct cacaagtggc agctcctgta 6061 atccctttta aattacgtgg gaatcttaac agaaagtaat gggcccccag aaatacccac 6121 agcataggac ntcagaccct gaactcacca caaaatttta agatgctgat tgggagccgc 6181 ttgtggctgc tggatgngtg tgtgtgtgtg tgtgtgtgcg tgcgtgcgtg tgtgtgtgtg 6241 tctgntgggg accctggcca cccccctgct gctgtcttgg tgcctgtcac ccacatggtc 6301 tgccatccta acacccagct ctgctcagaa aacgtcctgc gtggaggagg gatgatgcag 6361 aattctgaag tcgacttccc tctggctcct ggcgtgccct cgctcccttc ctgagcccag 6421 ctcgtgttgc gccggaggct gcgcggcccc tgatttctgc atggtgtaga actttctcca 6481 atagtcacat tggcaaaggg agaactgggg tgggcggggg gtggggctgg cagggaatta 6541 gcatttctct ctctctttta atagttaa // LOCUS HSU43168 3800 bp mRNA PRI 28-MAR-1996 DEFINITION Human leptin receptor (Ob-r) mRNA, complete cds. ACCESSION U43168 NID g1139594 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3800) AUTHORS Tartaglia,L.A., Dembski,M., Weng,X., Deng,N., Culpepper,J., Devos,R., Richards,G.J., Campfield,L.A., Clark,F.T., Deeds,J., Muir,C., Sanker,S., Moriarty,A., Moore,K.J., Smutko,J.S., Mays,G.G., Woolf,E.A., Selent-Munro,C. and Tepper,R.I. TITLE Identification and expression cloning of a leptin receptor, OB-R JOURNAL Cell 83 (7), 1263-1271 (1995) MEDLINE 96128129 REFERENCE 2 (bases 1 to 3800) AUTHORS Tartaglia,L.A. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) Louis A. Tartaglia, Millennium Pharmaceuticals, 640 Memorial Drive, Cambridge, MA 02139 FEATURES Location/Qualifiers source 1..3800 /organism="Homo sapiens" /db_xref="taxon:9606" gene 194..3691 /gene="0b-r" CDS 194..3691 /gene="0b-r" /note="OB-R" /codon_start=1 /product="leptin receptor" /db_xref="PID:g1139595" /translation="MICQKFCVVLLHWEFIYVITAFNLSYPITPWRFKLSCMPPNSTY DYFLLPAGLSKNTSNSNGHYETAVEPKFNSSGTHFSNLSKTTFHCCFRSEQDRNCSLC ADNIEGKTFVSTVNSLVFQQIDANWNIQCWLKGDLKLFICYVESLFKNLFRNYNYKVH LLYVLPEVLEDSPLVPQKGSFQMVHCNCSVHECCECLVPVPTAKLNDTLLMCLKITSG GVIFQSPLMSVQPINMVKPDPPLGLHMEITDDGNLKISWSSPPLVPFPLQYQVKYSEN STTVIREADKIVSATSLLVDSILPGSSYEVQVRGKRLDGPGIWSDWSTPRVFTTQDVI YFPPKILTSVGSNVSFHCIYKKENKIVPSKEIVWWMNLAEKIPQSQYDVVSDHVSKVT FFNLNETKPRGKFTYDAVYCCNEHECHHRYAELYVIDVNINISCETDGYLTKMTCRWS TSTIQSLAESTLQLRYHRSSLYCSDIPSIHPISEPKDCYLQSDGFYECIFQPIFLLSG YTMWIRINHSLGSLDSPPTCVLPDSVVKPLPPSSVKAEITINIGLLKISWEKPVFPEN NLQFQIRYGLSGKEVQWKMYEVYDAKSKSVSLPVPDLCAVYAVQVRCKRLDGLGYWSN WSNPAYTVVMDIKVPMRGPEFWRIINGDTMKKEKNVTLLWKPLMKNDSLCSVQRYVIN HHTSCNGTWSEDVGNHTKFTFLWTEQAHTVTVLAINSIGASVANFNLTFSWPMSKVNI VQSLSAYPLNSSCVIVSWILSPSDYKLMYFIIEWKNLNEDGEIKWLRISSSVKKYYIH DHFIPIEKYQFSLYPIFMEGVGKPKIINSFTQDDIEKHQSDAGLYVIVPVIISSSILL LGTLLISHQRMKKLFWEDVPNPKNCSWAQGLNFQKPETFEHLFIKHTASVTCGPLLLE PETISEDISVDTSWKNKDEMMPTTVVSLLSTTDLEKGSVCISDQFNSVNFSEAEGTEV TYEAESQRQPFVKYATLISNSKPSETGEEQGLINSSVTKCFSSKNSPLKDSFSNSSWE IEAQAFFILSDQHPNIISPHLTFSEGLDELLKLEGNFPEENNDKKSIYYLGVTSIKKR ESGVLLTDKSRVSCPFPAPCLFTDIRVLQDSCSHFVENNINLGTSSKKTFASYMPQFQ TCSTQTHKIMENKMCDLTV" BASE COUNT 1154 a 715 c 778 g 1153 t ORIGIN 1 ggcacgagcc ggtctggctt gggcaggctg cccgggccgt ggcaggaagc cggaagcagc 61 cgcggcccca gttcgggaga catggcgggc gttaaagctc tcgtggcatt atccttcagt 121 ggggctattg gactgacttt tcttatgctg ggatgtgcct tagaggatta tgggtgtact 181 tctctgaagt aagatgattt gtcaaaaatt ctgtgtggtt ttgttacatt gggaatttat 241 ttatgtgata actgcgttta acttgtcata tccaattact ccttggagat ttaagttgtc 301 ttgcatgcca ccaaattcaa cctatgacta cttccttttg cctgctggac tctcaaagaa 361 tacttcaaat tcgaatggac attatgagac agctgttgaa cctaagttta attcaagtgg 421 tactcacttt tctaacttat ccaaaacaac tttccactgt tgctttcgga gtgagcaaga 481 tagaaactgc tccttatgtg cagacaacat tgaaggaaag acatttgttt caacagtaaa 541 ttctttagtt tttcaacaaa tagatgcaaa ctggaacata cagtgctggc taaaaggaga 601 cttaaaatta ttcatctgtt atgtggagtc attatttaag aatctattca ggaattataa 661 ctataaggtc catcttttat atgttctgcc tgaagtgtta gaagattcac ctctggttcc 721 ccaaaaaggc agttttcaga tggttcactg caattgcagt gttcatgaat gttgtgaatg 781 tcttgtgcct gtgccaacag ccaaactcaa cgacactctc cttatgtgtt tgaaaatcac 841 atctggtgga gtaattttcc agtcacctct aatgtcagtt cagcccataa atatggtgaa 901 gcctgatcca ccattaggtt tgcatatgga aatcacagat gatggtaatt taaagatttc 961 ttggtccagc ccaccattgg taccatttcc acttcaatat caagtgaaat attcagagaa 1021 ttctacaaca gttatcagag aagctgacaa gattgtctca gctacatccc tgctagtaga 1081 cagtatactt cctgggtctt cgtatgaggt tcaggtgagg ggcaagagac tggatggccc 1141 aggaatctgg agtgactgga gtactcctcg tgtctttacc acacaagatg tcatatactt 1201 tccacctaaa attctgacaa gtgttgggtc taatgtttct tttcactgca tctataagaa 1261 ggaaaacaag attgttccct caaaagagat tgtttggtgg atgaatttag ctgagaaaat 1321 tcctcaaagc cagtatgatg ttgtgagtga tcatgttagc aaagttactt ttttcaatct 1381 gaatgaaacc aaacctcgag gaaagtttac ctatgatgca gtgtactgct gcaatgaaca 1441 tgaatgccat catcgctatg ctgaattata tgtgattgat gtcaatatca atatctcatg 1501 tgaaactgat gggtacttaa ctaaaatgac ttgcagatgg tcaaccagta caatccagtc 1561 acttgcggaa agcactttgc aattgaggta tcataggagc agcctttact gttctgatat 1621 tccatctatt catcccatat ctgagcccaa agattgctat ttgcagagtg atggttttta 1681 tgaatgcatt ttccagccaa tcttcctatt atctggctac acaatgtgga ttaggatcaa 1741 tcactctcta ggttcacttg actctccacc aacatgtgtc cttcctgatt ctgtggtgaa 1801 gccactgcct ccatccagtg tgaaagcaga aattactata aacattggat tattgaaaat 1861 atcttgggaa aagccagtct ttccagagaa taaccttcaa ttccagattc gctatggttt 1921 aagtggaaaa gaagtacaat ggaagatgta tgaggtttat gatgcaaaat caaaatctgt 1981 cagtctccca gttccagact tgtgtgcagt ctatgctgtt caggtgcgct gtaagaggct 2041 agatggactg ggatattgga gtaattggag caatccagcc tacacagttg tcatggatat 2101 aaaagttcct atgagaggac ctgaattttg gagaataatt aatggagata ctatgaaaaa 2161 ggagaaaaat gtcactttac tttggaagcc cctgatgaaa aatgactcat tgtgcagtgt 2221 tcagagatat gtgataaacc atcatacttc ctgcaatgga acatggtcag aagatgtggg 2281 aaatcacacg aaattcactt tcctgtggac agagcaagca catactgtta cggttctggc 2341 catcaattca attggtgctt ctgttgcaaa ttttaattta accttttcat ggcctatgag 2401 caaagtaaat atcgtgcagt cactcagtgc ttatccttta aacagcagtt gtgtgattgt 2461 ttcctggata ctatcaccca gtgattacaa gctaatgtat tttattattg agtggaaaaa 2521 tcttaatgaa gatggtgaaa taaaatggct tagaatctct tcatctgtta agaagtatta 2581 tatccatgat cattttatcc ccattgagaa gtaccagttc agtctttacc caatatttat 2641 ggaaggagtg ggaaaaccaa agataattaa tagtttcact caagatgata ttgaaaaaca 2701 ccagagtgat gcaggtttat atgtaattgt gccagtaatt atttcctctt ccatcttatt 2761 gcttggaaca ttattaatat cacaccaaag aatgaaaaag ctattttggg aagatgttcc 2821 gaaccccaag aattgttcct gggcacaagg acttaatttt cagaagccag aaacgtttga 2881 gcatcttttt atcaagcata cagcatcagt gacatgtggt cctcttcttt tggagcctga 2941 aacaatttca gaagatatca gtgttgatac atcatggaaa aataaagatg agatgatgcc 3001 aacaactgtg gtctctctac tttcaacaac agatcttgaa aagggttctg tttgtattag 3061 tgaccagttc aacagtgtta acttctctga ggctgagggt actgaggtaa cctatgaggc 3121 cgaaagccag agacaaccct ttgttaaata cgccacgctg atcagcaact ctaaaccaag 3181 tgaaactggt gaagaacaag ggcttataaa tagttcagtc accaagtgct tctctagcaa 3241 aaattctccg ttgaaggatt ctttctctaa tagctcatgg gagatagagg cccaggcatt 3301 ttttatatta tcagatcagc atcccaacat aatttcacca cacctcacat tctcagaagg 3361 attggatgaa cttttgaaat tggagggaaa tttccctgaa gaaaataatg ataaaaagtc 3421 tatctattat ttaggggtca cctcaatcaa aaagagagag agtggtgtgc ttttgactga 3481 caagtcaagg gtatcgtgcc cattcccagc cccctgttta ttcacggaca tcagagttct 3541 ccaggacagt tgctcacact ttgtagaaaa taatatcaac ttaggaactt ctagtaagaa 3601 gacttttgca tcttacatgc ctcaattcca aacttgttct actcagactc ataagatcat 3661 ggaaaacaag atgtgtgacc taactgtgta atttcactga agaaaccttc agatttgtgt 3721 tataatgggt aatataaagt gtaatagatt atagttgtgg gtgggagaga gaaaagaaac 3781 cagagtccaa atttgaaaat // LOCUS HSU43177 542 bp DNA PRI 16-OCT-1996 DEFINITION Human urocortin gene, complete cds. ACCESSION U43177 NID g1292909 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 542) AUTHORS Donaldson,C.J., Sutton,S.W., Perrin,M.H., Corrigan,A.Z., Lewis,K.A., Rivier,J.E., Vaughan,J.M. and Vale,W.W. TITLE Cloning and characterization of human urocortin JOURNAL Endocrinology 137 (5), 2167-2170 (1996) MEDLINE 96198824 REMARK Erratum:[Endocrinology 1996;137(9):3896] REFERENCE 2 (bases 1 to 542) AUTHORS Donaldson,C. TITLE Direct Submission JOURNAL Submitted (12-DEC-1995) Cynthia Donaldson, Peptide Biology Laboratory, The Salk Institute, 10010 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..542 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" CDS 41..415 /codon_start=1 /product="urocortin" /db_xref="PID:g1292910" /translation="MRQAGRAALLAALLLLVQLCPGSSQRSPEAAGVQDPSLRWSPGA RNQGGGARALLLLLAERFPRRAGPGRLGLGTAGERPRRDNPSLSIDLTFHLLRTLLEL ARTQSQRERAEQNRIIFDSVGK" BASE COUNT 81 a 181 c 195 g 85 t ORIGIN 1 caccctgcgc tgcccctgtg tgtccagggc cggcggcacc atgaggcagg cgggacgcgc 61 agcgctgctg gccgcgctgc tgctcctggt acagctgtgc cctgggagca gccagaggag 121 ccccgaggcg gccggggtcc aggacccgag tctgcgctgg agccccgggg cacggaacca 181 gggtggcggg gcccgcgcgc tcctcttgct gctggcggag cgcttcccgc gccgcgcggg 241 gcccggccga ttgggactcg ggacggcagg cgagcggccg cggcgggaca acccttctct 301 gtccattgac ctcacctttc acctgctgcg gaccctgctg gagctggcgc ggacgcagag 361 ccagcgggag cgcgccgagc agaaccgcat catattcgac tcggtgggca agtgatggcc 421 cggtttgggg ctgcgaaaac gttgacccct ttcccccacc ccagagttgg gatgcggggc 481 agagccacca gggcactgtc tgcgtgacta ttttttaata aaagtactga agacccgttg 541 gc // LOCUS HSU43188 3222 bp mRNA PRI 05-DEC-1996 DEFINITION Human Ets transcription factor (NERF-2) mRNA, complete cds. ACCESSION U43188 NID g1420888 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3222) AUTHORS Oettgen,P., Akbarali,Y., Boltax,J., Best,J., Kunsch,C. and Libermann,T.A. TITLE Characterization of NERF, a novel transcription factor related to the Ets factor ELF-1 JOURNAL Mol. Cell. Biol. 16 (9), 5091-5106 (1996) MEDLINE 96347578 REFERENCE 2 (bases 1 to 3222) AUTHORS Libermann,T.A., Oettgen,P., Kunsch,C., Akbarali,Y. and Boltax,J. TITLE Direct Submission JOURNAL Submitted (13-DEC-1995) Towia A. Libermann, Medicine, Beth Israel Hospital, 330 Brookline Avenue, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..3222 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen; fetal liver; fetal brain" gene 207..1952 /gene="NERF-2" CDS 207..1952 /gene="NERF-2" /codon_start=1 /product="Ets transcription factor" /db_xref="PID:g1420889" /translation="MTSAVVDSGGTILELSSNGVENQEESEKVSEYPAVIVEPVPSAR LEQGYAAQVLVYDDETYMMQDVAEEQEVETENVETVEASVHSSNAHCTDKTIEAAEAL LHMESPTCLRDSRSPEFIHAAMRPDVITETVVEVSTEESEPMDTSPIPTSPDSHEPMK KKKVGRKPKTQQSPISNGSPELGIKKKPREGKGNTTYLWEFLLDLLQDKNTCPRYIKW TQREKGIFKLVDSKAVSKLWGKHKNKPDMNYETMGRALRYYYQRGILAKVEGQRLVYQ FKDMPKNIVVIDDDKSETCNEDLAGTTDEKSLERVSLSAESLLKAASSVRSGKNSSPI NCSRAEKGVARVVNITSPGHDASSRSPTTTASVSATAAPRTVRVAMQVPVVMTSLGQK ISTVAVQSVNAGAPLITSTSPTTATSPKVVIQTIPTVMPASTENGDKITMQPAKIITI PATQLAQCQLQTKSNLTGSGSINIVGTPLAVRALTPVSIAHGTPVMRLSMPTQQASGQ TPPRVISAVIKGPEVKSEAVAKKQEHDVKTLQLVEEKPADGNKTVTHVVVVSAPSAIA LPVTMKTEGLVTCEK" BASE COUNT 1055 a 592 c 658 g 917 t ORIGIN 1 gtgttagtga aggatgctta gactacttaa catacaaact gctttctggt taatcatctt 61 tagaagactg gatttctgga tatctactcc actccatctc tattgacttt taaaacatga 121 taatgcaaac ctataacact ggcaaccatc agtgaacctt taatttcatt gattaatagc 181 gtttgaagct tcctcaggga ataacaatga catcagcagt ggttgacagt ggaggtacta 241 ttttggagct ttccagcaat ggagtagaaa atcaagagga aagtgaaaag gtttctgaat 301 atccagcagt gattgtggag ccagttccaa gtgccagatt agagcagggg tatgcagccc 361 aggttctggt ttatgatgat gagacttata tgatgcaaga tgtggcagaa gaacaagaag 421 ttgagaccga gaatgtggaa acagtggaag catcagttca cagcagtaat gcacactgta 481 cagataagac aattgaagct gctgaagccc tgcttcatat ggaatctcct acctgcttga 541 gggattcaag aagtcctgaa ttcatccatg ctgctatgag gccagatgtc attacagaaa 601 ctgtagtgga ggtgtcaact gaagagtctg aacccatgga tacctctcct attccaacat 661 caccagatag ccatgaacca atgaaaaaga aaaaagttgg ccgtaaacca aagacccagc 721 aatcaccaat ttccaatggg tctcctgagt taggtataaa gaagaaacca agagaaggaa 781 aaggaaacac aacctatttg tgggagtttc ttttagatct acttcaagat aaaaatactt 841 gtcccaggta tattaaatgg actcagagag aaaaaggcat attcaagctg gtggattcaa 901 aggctgtctc taagctttgg ggaaagcata agaacaaacc agacatgaac tatgaaacca 961 tgggacgagc tttgagatac tactaccaaa ggggaattct tgcaaaggtt gaaggacaga 1021 ggcttgtata tcagttcaag gatatgccga aaaacatagt ggtcatagat gatgacaaaa 1081 gtgaaacctg taatgaagat ttagcaggaa ctactgatga aaaatcatta gaacgagtgt 1141 cactgtctgc agaaagtctc ctgaaagcag catcctctgt tcgcagtgga aaaaattcat 1201 cccctataaa ctgctccaga gcagagaagg gtgtagctag agttgtgaat atcacttccc 1261 ctgggcacga tgcttcatcc aggtctccta ctaccactgc atctgtgtca gcaacagcag 1321 ctccaaggac agttcgtgtg gcaatgcagg tacctgttgt aatgacatca ttgggtcaga 1381 aaatttcaac tgtggcagtt cagtcagtta atgcaggtgc accattaata accagcacta 1441 gtccaacaac agcgacctct ccaaaggtag tcattcagac aatccctact gtgatgccag 1501 cttctactga aaatggagac aaaatcacca tgcagcctgc caaaattatt accatcccag 1561 ctacacagct tgcacagtgt caactgcaga caaagtcaaa tctgactgga tcaggaagca 1621 ttaacattgt tggaacccca ttggctgtga gagcacttac ccctgtttca atagcccatg 1681 gtacacctgt aatgagacta tcaatgccta ctcagcaggc atctggccag actcctcctc 1741 gagttatcag tgcagtcata aaggggccag aggttaaatc ggaagcagtg gcaaaaaagc 1801 aagaacatga tgtgaaaact ttgcagctag tagaagaaaa accagcagat ggaaataaga 1861 cagtgaccca cgtagtggtt gtcagtgcgc cttcagctat tgcccttcct gtaactatga 1921 aaacagaagg actagtgaca tgtgagaaat aaaatagcag ctccaccatg gacttcaggc 1981 tgttagtggc agtactgaca taaacatttg caagggaagt catcaagaaa agtccaaaga 2041 agactttaaa acatttttaa tgcatataca aaaacaatca gacttactgg aaataaatta 2101 cctatcccat gtttcagtgg gaaatgaact acatattgag atgctgacag aaaactgcct 2161 cttacagtag gaaacaactg aacccatcaa taagaaaaag gatcgaaagg gaccaagcag 2221 ctcactacga tatcaagtta cactaagact tggaacacta acattctgta agaggttata 2281 tagtttttca gtgggagggg ttgggatggg taatctcatt gttacatata gcaatttttg 2341 atgcatttta tatgcatacc agcaattatt actgtgttcg cacagttctc acttaactgg 2401 tgctatgtga agactctgct aatataggta ttttagaatg tgaattgaag aatggatccc 2461 aaaaacttca gaaagaggat agcaaaaaaa gatctagtgc gattttatat atatatatat 2521 atatatatac atacatatat atatatcata tagcttaagc tgatttaaaa caaaggcctt 2581 agactaattt tcgattttct ttcttgaaat aagctaatgg cttgtttgtg taaagctttt 2641 ttattaaaag aaaaatttta aaaatcttgt acctagcaca gtattgttat agaatataca 2701 tgtaacattt tatatggtag tttaagtctg tcagtttctt aattgtggac aaattaacag 2761 ttggctctgg ccttttgctg taacatgcct gtgtcactca cttagccttg gcatttgtgc 2821 agacatacca ttttcagttc tgctgtcact tggaagttca ggctcagcat gaatttttgg 2881 caggtagctc taatacctgg agttttcttt gttttttttt ctttttttta gttgaagttt 2941 atgagggaaa taccagtgtt cagttttgaa ctataatagt ttgtatattc aacatttgaa 3001 gtatattcta ttttgttgta ctcttgtttc aaagtgtatt caagtaggtt ttctgaaata 3061 tagaaatgaa atttatcttc tgttttggtc tctggtgata ttttaaacaa tatttaaaag 3121 tcagtataga agtgttttag ttaggaagtg ataaaacatc tctcttctcc ttcccaacta 3181 ctgcatgaag aaattctact tccattatat taatatttgg gc // LOCUS HSU43195 4065 bp mRNA PRI 25-JUN-1996 DEFINITION Human Rho-associated, coiled-coil containing protein kinase p160ROCK mRNA, complete cds. ACCESSION U43195 NID g1276900 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4065) AUTHORS Ishizaki,T., Maekawa,M., Fujisawa,K., Okawa,K., Iwamatsu,A., Fujita,A., Watanabe,N., Saito,Y., Kakizuka,A., Morii,N. and Narumiya,S. TITLE The small GTP-binding protein Rho binds to and activates a 160 kDa Ser/Thr protein kinase homologous to myotonic dystrophy kinase JOURNAL EMBO J. 15 (8), 1885-1893 (1996) MEDLINE 96203110 REFERENCE 2 (bases 1 to 4065) AUTHORS Narumiya,S. and Ishizaki,T. TITLE Direct Submission JOURNAL Submitted (13-DEC-1995) Toshimasa Ishizaki, Pharmacology, Kyoto University Faculty of Medicine, Yoshida Konoe-chou, Sakyou-ku, Kyoto, Kyoto 606, Japan FEATURES Location/Qualifiers source 1..4065 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="platelet" CDS 1..4065 /codon_start=1 /function="serine/threonine protein kinase" /product="Rho-associated, coiled-coil containing protein kinase p160ROCK" /db_xref="PID:g1276901" /translation="MSTGDSFETRFEKMDNLLRDPKSEVNSDCLLDGLDALVYDLDFP ALRKNKNIDNFLSRYKDTINKIRDLRMKAEDYEVVKVIGRGAFGEVQLVRHKSTRKVY AMKLLSKFEMIKRSDSAFFWEERDIMAFANSPWVVQLFYAFQDDRYLYMVMEYMPGGD LVNLMSNYDVPEKWARFYTAEVVLALDAIHSMGFIHRDVKPDNMLLDKSGHLKLADFG TCMKMNKEGMVRCDTAVGTPDYISPEVLKSQGGDGYYGRECDWWSVGVFLYEMLVGDT PFYADSLVGTYSKIMNHKNSLTFPDDNDISKEAKNLICAFLTDREVRLGRNGVEEIKR HLFFKNDQWAWETLRDTVAPVVPDLSSDIDTSNFDDLEEDKGEEETFPIPKAFVGNQL PFVGFTYYSNRRYLSSANPNDNRTSSNADKSLQESLQKTIYKLEEQLHNEMQLKDEME QKCRTSNIKLDKIMKELDEEGNQRRNLESTVSQIEKEKMLLQHRINEYQRKAEQENEK RRNVENEVSTLKDQLEDLKKVSQNSQLANEKLSQLQKQLEEANDLLRTESDTAVRLRK SHTEMSKSISQLESLNRELQERNRILENSKSQTDKDYYQLQAILEAERRDRGHDSEMI GDLQARITSLQEEVKHLKHNLEKVEGERKEAQDMLNHSEKEKNNLEIDLNYKLKSLQQ RLEQEVNEHKVTKARLTDKHQSIEEAKSVAMCEMEKKLKEEREAREKAENRVVQIEKQ CSMLDVDLKQSQQKLEHLTGNKERMEDEVKNLTLQLEQESNKRLLLQNELKTQAFEAD NLKGLEKQMKQEINTLLEAKRLLEFELAQLTKQYRGNEGQMRELQDQLEAEQYFSTLY KTQVKELKEEIEEKNRENLKKIQELQNEKETLATQLDLAETKAESEQLARGLLEEQYF ELTQESKKAASRNRQEITDKDHTVSRLEEANSMLTKDIEILRRENEELTEKMKKAEEE YKLEKEEEISNLKAAFEKNINTERTLKTQAVNKLAEIMNRKDFKIDRKKANTQDLRKK EKENRKLQLELNQEREKFNQMVVKHQKELNDMQAQLVEECAHRNELQMQLASKESDIE QLRAKLLDLSDSTSVASFPSADETDGNLPESRIEGWLSVPNRGNIKRYGWKKQYVVVS SKKILFYNDEQDKEQSNPSMVLDIDKLFHVRPVTQGDVYRAETEEIPKIFQILYANEG ECRKDVEMEPVQQAEKTNFQNHKGHEFIPTLYHFPANCDACAKPLWHVFKPPPALECR RCHVKCHRDHLDKKEDLICPCKVSYDVTSARDMLLLACSQDEQKKWVTHLVKKIPKNP PSGFVRASPRTLSTRSTANQSFRKVVKNTSGKTS" BASE COUNT 1538 a 624 c 908 g 995 t ORIGIN 1 atgtcgactg gggacagttt tgagactcga tttgaaaaaa tggacaacct gctgcgggat 61 cccaaatcgg aagtgaattc ggattgtttg ctggatggat tggatgcttt ggtatatgat 121 ttggattttc ctgccttaag aaaaaacaaa aatattgaca actttttaag cagatataaa 181 gacacaataa ataaaatcag agatttacga atgaaagctg aagattatga agtagtgaag 241 gtgattggta gaggtgcatt tggagaagtt caattggtaa ggcataaatc caccaggaag 301 gtatatgcta tgaagcttct cagcaaattt gaaatgataa agagatctga ttctgctttt 361 ttctgggaag aaagggacat catggctttt gccaacagtc cttgggttgt tcagcttttt 421 tatgcattcc aagatgatcg ttatctctac atggtgatgg aatacatgcc tggtggagat 481 cttgtaaact taatgagcaa ctatgatgtg cctgaaaaat gggcacgatt ctatactgca 541 gaagtagttc ttgcattgga tgcaatccat tccatgggtt ttattcacag agatgtgaag 601 cctgataaca tgctgctgga taaatctgga catttgaagt tagcagattt tggtacttgt 661 atgaagatga ataaggaagg catggtacga tgtgatacag cggttggaac acctgattat 721 atttcccctg aagtattaaa atcccaaggt ggtgatggtt attatggaag agaatgtgac 781 tggtggtcgg ttggggtatt tttatacgaa atgcttgtag gtgatacacc tttttatgca 841 gattctttgg ttggaactta cagtaaaatt atgaaccata aaaattcact tacctttcct 901 gatgataatg acatatcaaa agaagcaaaa aaccttattt gtgccttcct tactgacagg 961 gaagtgaggt tagggcgaaa tggtgtagaa gaaatcaaac gacatctctt cttcaaaaat 1021 gaccagtggg cttgggaaac gctccgagac actgtagcac cagttgtacc cgatttaagt 1081 agtgacattg atactagtaa ttttgatgac ttggaagaag ataaaggaga ggaagaaaca 1141 ttccctattc ctaaagcttt cgttggcaat caactacctt ttgtaggatt tacatattat 1201 agcaatcgta gatacttatc ttcagcaaat cctaatgata acagaactag ctccaatgca 1261 gataaaagct tgcaggaaag tttgcaaaaa acaatctata agctggaaga acagctgcat 1321 aatgaaatgc agttaaaaga tgaaatggag cagaagtgca gaacctcaaa cataaaacta 1381 gacaagataa tgaaagaatt ggatgaagag ggaaatcaaa gaagaaatct agaatctaca 1441 gtgtctcaga ttgagaagga gaaaatgttg ctacagcata gaattaatga gtaccaaaga 1501 aaagctgaac aggaaaatga gaagagaaga aatgtagaaa atgaagtttc tacattaaag 1561 gatcagttgg aagacttaaa gaaagtcagt cagaattcac agcttgctaa tgagaagctg 1621 tcccagttac aaaagcagct agaagaagcc aatgacttac ttaggacaga atcggacaca 1681 gctgtaagat tgaggaagag tcacacagag atgagcaagt caattagtca gttagagtcc 1741 ctgaacagag agttgcaaga gagaaatcga attttagaga attctaagtc acaaacagac 1801 aaagattatt accagctgca agctatatta gaagctgaac gaagagacag aggtcatgat 1861 tctgagatga ttggagacct tcaagctcga attacatctt tacaagagga ggtgaagcat 1921 ctcaaacata atctcgaaaa agtggaagga gaaagaaaag aggctcaaga catgcttaat 1981 cactcagaaa aggaaaagaa taatttagag atagatttaa actacaaact taaatcatta 2041 caacaacggt tagaacaaga ggtaaatgaa cacaaagtaa ccaaagctcg tttaactgac 2101 aaacatcaat ctattgaaga ggcaaagtct gtggcaatgt gtgagatgga aaaaaagctg 2161 aaagaagaaa gagaagctcg agagaaggct gaaaatcggg ttgttcagat tgagaaacag 2221 tgttccatgc tagacgttga tctgaagcaa tctcagcaga aactagaaca tttgactgga 2281 aataaagaaa ggatggagga tgaagttaag aatctaaccc tgcaactgga gcaggaatca 2341 aataagcggc tgttgttaca aaatgaattg aagactcaag catttgaggc agacaattta 2401 aaaggtttag aaaagcagat gaaacaggaa ataaatactt tattggaagc aaagagatta 2461 ttagaatttg agttagctca gcttacgaaa cagtatagag gaaatgaagg acagatgcgg 2521 gagctacaag atcagcttga agctgagcaa tatttctcga cactttataa aacccaggta 2581 aaggaactta aagaagaaat tgaagaaaaa aacagagaaa atttaaagaa aatacaggaa 2641 ctacaaaatg aaaaagaaac tcttgctact cagttggatc tagcagaaac aaaagctgag 2701 tctgagcagt tggcgcgagg ccttctggaa gaacagtatt ttgaattgac gcaagaaagc 2761 aagaaagctg cttcaagaaa tagacaagag attacagata aagatcacac tgttagtcgg 2821 cttgaagaag caaacagcat gctaaccaaa gatattgaaa tattaagaag agagaatgaa 2881 gagctaacag agaaaatgaa gaaggcagag gaagaatata aactggagaa ggaggaggag 2941 atcagtaatc ttaaggctgc ctttgaaaag aatatcaaca ctgaacgaac ccttaaaaca 3001 caggctgtta acaaattggc agaaataatg aatcgaaaag attttaaaat tgatagaaag 3061 aaagctaata cacaagattt gagaaagaaa gaaaaggaaa atcgaaagct gcaactggaa 3121 ctcaaccaag aaagagagaa attcaaccag atggtagtga aacatcagaa ggaactgaat 3181 gacatgcaag cgcaattggt agaagaatgt gcacatagga atgagcttca gatgcagttg 3241 gccagcaaag agagtgatat tgagcaattg cgtgctaaac ttttggacct ctcggattct 3301 acaagtgttg ctagttttcc tagtgctgat gaaactgatg gtaacctccc agagtcaaga 3361 attgaaggtt ggctttcagt accaaataga ggaaatatca aacgatatgg ctggaagaaa 3421 cagtatgttg tggtaagcag caaaaaaatt ttgttctata atgacgaaca agataaggag 3481 caatccaatc catctatggt attggacata gataaactgt ttcacgttag acctgtaacc 3541 caaggagatg tgtatagagc tgaaactgaa gaaattccta aaatattcca gatactatat 3601 gcaaatgaag gtgaatgtag aaaagatgta gagatggaac cagtacaaca agctgaaaaa 3661 actaatttcc aaaatcacaa aggccatgag tttattccta cactctacca ctttcctgcc 3721 aattgtgatg cctgtgccaa acctctctgg catgttttta agccaccccc tgccctagag 3781 tgtcgaagat gccatgttaa gtgccacaga gatcacttag ataagaaaga ggacttaatt 3841 tgtccatgta aagtaagtta tgatgtaaca tcagcaagag atatgctgct gttagcatgt 3901 tctcaggatg aacaaaaaaa atgggtaact catttagtaa agaaaatccc taagaatcca 3961 ccatctggtt ttgttcgtgc ttcccctcga acgctttcta caagatccac tgcaaatcag 4021 tctttccgga aagtggtcaa aaatacatct ggaaaaacta gttaa // LOCUS HSU43279 3980 bp mRNA PRI 28-SEP-1996 DEFINITION Human nucleoporin nup 36 mRNA, complete cds. ACCESSION U43279 NID g1565299 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3980) AUTHORS Bora,N.S., Bora,P.S., Tandhasetti,M.T., Cirrito,T.P. and Kaplan,H.J. TITLE Molecular cloning, sequencing, and expression of the 36 kDa protein present in pars planitis. Sequence homology with yeast nucleopore complex protein JOURNAL Invest. Ophthalmol. Vis. Sci. 37 (9), 1877-1883 (1996) MEDLINE 96335095 REFERENCE 2 (bases 1 to 3980) AUTHORS Bors,N.S., Bora,P.S. and Kaplan,H.J. TITLE Direct Submission JOURNAL Submitted (13-DEC-1995) Nalini S. Bors, Opthalmology, Wash. Univ. Med. Sch., 660, South Euclid Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..3980 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 323..1291 /note="nucleoporin" /codon_start=1 /product="nup 36" /db_xref="PID:g1565300" /translation="MKRQFVDFPKYRLAPKPLFAPSSNGDAKFQKWGKTLERSDRGSS TSNSITDPESSYLNSNDLLFDPDRRYLKHLVIKNNKNLNVINHNDDEASKVKLVTFTT ESASKDDQASSSIAASKLTEKAHSPQTDLKDDHDESTPDPQSKAPNGSTSIPMIENEK ISSKVPGLLSNDVTFFKNNYYISPSIETLGNKSLIELRKINNLVIGHRHYGKVEFLEP VDLLNTPLDTLCGDLVTFGPKSCSIYENCSIKPEKGEGINVRCRVTLYSCFPIDKETR KPIKNITHPLLKRSIAKLKENPVYKFESYDPVTGTYSYTIDHPVLP" BASE COUNT 1241 a 729 c 683 g 1327 t ORIGIN 1 gaattcttcc acaatcggcc aaaacaaacc agcttttgga ggcacaaccc gaaatacagg 61 actctttggc gccaccggca cgaactcttc agcagttggt tcaactggtg gactttttgg 121 ccagaataat aatacgctta atgttggtac acaaaatgta ccacctgtga acaataccac 181 ccaaaacacc cttttgggta caacggcagt tccttcccta caacaagctc cagtaactaa 241 tgaacagctt ttttccaaaa tatcaatccc taactctatt acaaatccag tcaaagcaac 301 aacttcaaaa gtgaacgccg ccatgaaaag acaatttgtc gacttcccta agtatagact 361 tgccccaaag ccgttatttg ctccctcttc gaatggcgat gctaaatttc aaaagtgggg 421 caagacactg gaaagaagtg atagaggaag cagtaccagc aattctatta cggatccaga 481 atcaagctat ctaaattcaa atgacttgtt gtttgatcca gatagaagat atttgaaaca 541 tctggtgatt aaaaataata agaacttaaa tgtcattaac cataatgatg atgaagcaag 601 caaagttaaa ttagtgacgt ttacaacaga atcagcttca aaagatgacc aagcctcatc 661 aagcattgct gcttcaaaat taactgaaaa agcacattct cctcagaccg acctaaaaga 721 tgatcatgat gaaagcactc ctgatcctca atcgaaagct ccaaacggtt ccacctctat 781 accaatgatt gagaatgaaa agattagcag caaagttccc ggcctattga gcaacgacgt 841 tacctttttc aagaataact actacatttc accttccata gaaacgcttg gcaataagtc 901 attaattgaa cttcgtaaaa taaacaacct agtcattggt cacagacatt atggtaaagt 961 cgagtttctg gagcccgttg atttgttgaa tactcctttg gatactttat gcggggatct 1021 tgtcaccttt ggaccaaaat catgttcaat atatgaaaac tgttccataa agccagaaaa 1081 gggcgaaggc attaatgtac gttgtagagt gactttatat tcctgttttc ctattgacaa 1141 agaaacaagg aaacctataa agaatataac acatcctcta ctgaaaagaa gtatagccaa 1201 actaaaagaa aacccagtgt acaagtttga aagctacgac cccgtaacag gcacctatag 1261 ttacaccata gatcatccag ttttacctta aaccggaata atttttgtag agaatccttg 1321 tatcgtctaa gtagtctaga tgttcagctg atagattttc gttgtattgt atatataata 1381 attgtcggaa aacaaaaata cttaattata attgtgtgac cgaaaatgcc tgatcaacag 1441 ccatggcaca tttgaatggg attttgagga gacaaaaatg aagaggttct ataacctttg 1501 tagaagaaca tcgacctctt ttcaatgaag attggtggca tcgccagatg cagaaagttt 1561 gcctcatgaa ataaaaatga acagaccatg aaggctaaag aaagaaaaaa ataaaatgcc 1621 atctcctaga atcgaaccag ggtttcatcg gccacaacga tgtgtactaa ccactatact 1681 aagatggcaa acaactgtga aatatttggt tacatctgca catctggtgg aataaataat 1741 atgtactctt tgctttttta tgttaaacct acaagtggtg actgtaaaga agcattacaa 1801 cgtagaactg ataaaggaga gtagttacat aagctttccg taatggtgaa tttatagcag 1861 ttttcttctc gatgaaagaa agggaaagaa ctaaatatac tcgaatgctt gtaccactcc 1921 atttccccat ttatcacatt taaagttacg agtaaaaaag tgaccgatat agaatgtctg 1981 atgaaagtga gatatatgtg ggtaattaga taattgttgg gattccattg ttgataaagg 2041 ctataatatt aggtatacag aatatactag aagttctcct cgaggatata ggaatcctca 2101 aaatggaatc tatatttcta catactaata ttacgattat tcctcattcc gttttatatg 2161 tttatattca ttgatcctat tacattatca atccttgcgt ttcagcttcc tctaacatcg 2221 atgacagctt ctcataactt atgtcatcat attaacactg aatatgataa tatattgata 2281 atataactat tagttataga cgatagtgga tttttattcc aacataccac ccataaagta 2341 atagatctaa tgaatccatt tgtttgttta tagtttaaat gtttttatcg gaagaggttt 2401 tgtcatcaca tcagcaatgt tcttcttggt ctcgatgtag tatacgtata cattattacc 2461 tgatacttca tctctaagtc tcattgcctt tgtgaaaaaa aatctgtttc taaatttctc 2521 ttcatttgta gacttaatta tactgatcgt tgatctacta tcagtaagta agcctttaat 2581 aattggtttc ttgttaagtt cttgcacaag gtgactgagg ttattcaata gcggaatagc 2641 ttcactgact gcgtgtattt ctgcttctgt agttgaagtg catgttaacg aagcctttgt 2701 cgactttcct ccaatcactt ttccgttgag taggaaaatg ttaccaattt gtgacttgta 2761 atatggttgg ttaccatatg aagcatcgct tattgcgact agtttattat ctggcttggt 2821 aggtttgttt ttgtgccata ttaattgttt atctctagtg tcccacatga attgtattaa 2881 ctcatatgtc atgtctaaaa cttgcctaga ggggaatagt atatgttgag caagtgcgtt 2941 gatgtagtat agtaagccaa atctaaattt atatccaaca tatgaagcta gaccaatcaa 3001 cttttgcatt tcatgtacct tctctttgta ttcatcttca tctatttcta gttcatcctg 3061 gtctatataa agacctggtt gacctggagc gcgaagtttt cttccttttg gattcaaagg 3121 tacgtttaat ttgggtattt tctcagttaa tgagttttcc atacctaatt tcatgtattt 3181 acctctttga tatttgattt ctaagccaag tatgtcgtac tgaatttcgt tatcactttc 3241 acccagattt attatctttg tatcgtattg tttcttgagt gttgttatga ttttcttatt 3301 tgcatttaag tctttgctga acaatatcat atcatcaacg aataagcaaa ttgttactgg 3361 actattctta aatacgcatg accatccacg aacttcttcc ataccacact gttttatcag 3421 gtatgatttg atagtttcgt accagttcgc tccactttgt ttcaatccat aaagtgattt 3481 cttcaaacgt atcaacttat cattcattcc taaatgtggt ggaggtctta tgtataattc 3541 ttctttgatg tctgcataca aatatgccga agatatgtct aattgtgtat atagtagtta 3601 ttgtctaatg caagtgacag ggatgtcatt aatgcatagt gatgtacggt attggattgc 3661 atgcctgagt cgtaagtgtc aggatgctga atatcacctc ttgcaacaaa tctagcttta 3721 tgagtaccgt cacgtttcct gttgaagata aacattgaat ttattactct tttagggtct 3781 atttcttttc tgtcataata aatgtcagtg tcccaagtat tcattttcaa tagttggttg 3841 acttctttgt ggtatgcttc gatatatttt tccttttctt taatatcttt attataggtg 3901 attgcctcat cgtatcttaa ggttgtccgt attggtttga ttgattttac tgcttttaca 3961 gctgcaatca ggtcgaattc // LOCUS HSU43292 1277 bp mRNA PRI 09-AUG-1996 DEFINITION Human MDS1B (MDS1) mRNA, complete cds. ACCESSION U43292 NID g1294814 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1277) AUTHORS Fears,S., Mathieu,C., Zeleznik-Le,N., Huang,S., Rowley,J.D. and Nucifora,G. TITLE Intergenic splicing of MDS1 and EVI1 occurs in normal tissues as well as in myeloid leukemia and produces a new member of the PR domain family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (4), 1642-1647 (1996) MEDLINE 96202331 REFERENCE 2 (bases 1 to 1277) AUTHORS Nucifora,G. TITLE Direct Submission JOURNAL Submitted (14-DEC-1995) Giuseppina Nucifora, Medicine, University of Chicago, 5841 S. Maryland, MC2115, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..1277 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q26" /tissue_type="pancreas" gene 308..817 /gene="MDS1" CDS 308..817 /gene="MDS1" /note="splice variant; normal mRNA; MDS1 gene is fused to AML1 in t(3;21) translocation associated with myeloid leukemia" /codon_start=1 /product="MDS1B" /db_xref="PID:g1294815" /translation="MRSKGRARKLATNNECVYGNYPEIPLEEMPDADGVASTPSLNIQ EPCSPATSSEAFTPKEGSPYKAPIYIPDDIPIPAEFELRESNMPGAGLGIWTKRKIEV GEKFGPYVGEQRSNLKDPSYGWEVHLPRSRRVSVHSWLYLGKRSSDVGIAFSQADVYM PGLQCAFLS" BASE COUNT 385 a 282 c 368 g 242 t ORIGIN 1 gcgcatgtgc aaggtgtcca aactgacaat gctggagaga tagcgagtgt ggattgagag 61 aaagggagag agggagggag agagagtgaa agaagaaaat acagagagtg agtgtgtgga 121 agagagagag aaacaggaga gaaacaggag ggggggagag agagagagag agagagagag 181 agagagagag agagagagag agagacagga gagagaggga gggagcgaga gggagagcaa 241 aagaaggaaa ggatccaaga aaaaaaagcc ccaaccacac accagaggct gcaggactgg 301 gcacagcatg agatccaaag gcagggcaag gaaactggcc acaaataatg agtgtgtata 361 tggcaactac cctgaaatac ctttggaaga aatgccagat gcagatggag tagccagcac 421 tccctccctc aatattcaag agccatgctc tcctgccaca tccagtgaag cattcactcc 481 aaaggagggt tctccttaca aagcccccat ctacatccct gatgatatcc ccattcctgc 541 tgagtttgaa cttcgagagt caaatatgcc tggggcagga ctaggaatat ggaccaaaag 601 gaagatcgaa gtaggtgaaa agtttgggcc ttatgtggga gagcagaggt caaacctgaa 661 agaccccagt tatggatggg aggtacatct tccaaggtct cggagggtaa gcgttcactc 721 ttggttgtat ttggggaaga gaagctcaga cgtaggaata gccttctctc aggctgatgt 781 ctacatgcct ggactgcagt gtgccttcct ctcgtagctc ggaaggacgc ggaacctggg 841 tggctggagc cgcccgctgc gctttatttc ggagcgcaat gccatctacc ggcgtcctgc 901 cgtacctgca agtatccaga ccttgaaagt cgctccgcct ccccccaccc gaagccaata 961 taggggaaaa aactcggagg ccctttccac gaaatcctaa tttagccagg acctgccaat 1021 gattccagac gaccttttgt tttcctaccg acgtttcctc gttttgaaag cagttttgta 1081 aagggcaagg aggtgggggc cgcagggttg gggcgctgag ctcccagacc ccctgatcag 1141 gcgcactgtc tgaagcaatc ggttccccag attacttgat atttaataca caatgcatca 1201 taaaacaaat ccctcatcct gacaggaaga aaatagaaca gctcatagct cgagccagtc 1261 caatttatgg ctaaatt // LOCUS HSU43318 2334 bp mRNA PRI 24-FEB-1996 DEFINITION Human putative transmembrane receptor (frizzled 5) mRNA, complete cds. ACCESSION U43318 NID g1151251 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2334) AUTHORS Wang,Y., Macke,J.P., Abella,B.S., Andreasson,K., Worley,P., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE A large family of putative transmembrane receptors homologous to the product of the Drosophila tissue polarity gene frizzled JOURNAL J. Biol. Chem. 271 (8), 4468-4476 (1996) MEDLINE 96224032 REFERENCE 2 (bases 1 to 2334) AUTHORS Abella,B., Wang,Y., Macke,J.P. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (14-DEC-1995) Jeremy Nathans, Molecular Biology and Genetics, Johns Hopkins Medical School, 725 N. Wolfe Street, Baltimore, MD 21205 FEATURES Location/Qualifiers source 1..2334 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q33-34" /tissue_type="retina" gene 321..2078 /gene="frizzled 5" CDS 321..2078 /gene="frizzled 5" /note="putative transmembrane receptor" /codon_start=1 /product="transmembrane receptor" /db_xref="PID:g1151252" /translation="MARPDPSAPPSLLLLLLAQLVGRAAAASKAPVCQEITVPMCRGI GYNLTHMPNQFNHDTQDEAGLEVHQFWPLVEIQCSPDLRFFLCTMYTPICLPDYHKPL PPCRSVCERAKAGCSPLMRQYGFAWPERMSCDRLPVLGRDAEVLCMDYNRSEATTAPP RPFPAKPTLPGPPGAPASGGECPAGGPFVCKCREPFVPILKESHPLYNKVRTGQVPNC AVPCYQPSFSADERTFATFWIGLWSVLCFISTSTTVATFLIDMDTFRYPERPIIFLSA CYLCVSLGFLVRLVVGHASVACSREHNHIHYETTGPALCTIVFLLVYFFGMASSIWWV ILSLTWFLAAAMKWGNEAIAGYGQYFHLAAWLIPSVKSITALALSSVDGDPVAGICYV GNQNLNSLRRFVLGPLVLYLLVGTLFLLAGFVSLFRIRSVIKQGGTKTDKLEKLMIRI GIFTLLYTVPASIVVACYLYEQHYRESWEAALTCACPGHDTGQPRAKPEYWVLMLKYF MCLVVGITSGVWIWSGKTVESWRRFTSRCCCRPRRGHKSGGAMAAGDYPEASAALTGR TGPPGPAATYHKQVSLSHV" BASE COUNT 356 a 803 c 736 g 439 t ORIGIN 1 acccagggac ggaggaccca ggctggcttg gggactgtct gctcttctcg gcgggagccg 61 tggagagtcc tttccctgga atccgagccc taaccgtctc tccccagccc tatccggcga 121 ggagcggagc gctgccagcg gaggcagcgc cttcccgaag cagtttatct ttggacggtt 181 ttctttaaag gaaaaacgaa ccaacaggtt gccagccccg gcgccacaca cgagacgccg 241 gagggagaag ccccggcccg gattcctctg cctgtgtgcg tccctcgcgg gctgctggag 301 gcgaggggag ggagggggcg atggctcggc ctgacccatc cgcgccgccc tcgctgttgc 361 tgctgctcct ggcgcagctg gtgggccggg cggccgccgc gtccaaggcc ccggtgtgcc 421 aggaaatcac ggtgcccatg tgccgcggca tcggctacaa cctgacgcac atgcccaacc 481 agttcaacca cgacacgcag gacgaggcgg gcctggaggt gcaccagttc tggccgctgg 541 tggagatcca atgctcgccg gacctgcgct tcttcctatg cactatgtac acgcccatct 601 gtctgcccga ctaccacaag ccgctgccgc cctgccgctc ggtgtgcgag cgcgccaagg 661 ccggctgctc gccgctgatg cgccagtacg gcttcgcctg gcccgagcgc atgagctgcg 721 accgcctccc ggtgctgggc cgcgacgccg aggtcctctg catggattac aaccgcagcg 781 aggccaccac ggcgcccccc aggcctttcc cagccaagcc cacccttcca ggcccgccag 841 gggcgccggc ctcggggggc gaatgccccg ctgggggccc gttcgtgtgc aagtgtcgcg 901 agcccttcgt gcccattctg aaggagtcac acccgctcta caacaaggtg cggacgggcc 961 aggtgcccaa ctgcgcggta ccctgctacc agccgtcctt cagtgccgac gagcgcacgt 1021 tcgccacctt ctggataggc ctgtggtcgg tgctgtgctt catctccacg tccaccacag 1081 tggccacctt cctcatcgac atggacacgt tccgctatcc tgagcgcccc atcatcttcc 1141 tgtcagcctg ctacctgtgc gtgtcgctgg gcttcctggt gcgtctggtc gtgggccatg 1201 ccagcgtggc ctgcagccgc gagcacaacc acatccacta cgagaccacg ggccctgcac 1261 tgtgcaccat cgtcttcctc ctggtctact tcttcggcat ggccagctcc atctggtggg 1321 tcatcctgtc gctcacctgg ttcctggccg ccgcgatgaa gtggggcaac gaggccatcg 1381 cgggctacgg ccagtacttc cacctggctg cgtggctcat ccccagcgtc aagtccatca 1441 cggcactggc gctgagctcc gtggacgggg acccagtggc cggcatctgc tacgtgggca 1501 accagaacct gaactcgctg cggcgcttcg tgctgggccc gctggtgctc tacctgctgg 1561 tgggcacgct cttcctgctg gcgggcttcg tgtcgctctt ccgcatccgc agcgtcatca 1621 agcagggcgg caccaagacg gacaagctgg agaagctcat gatccgcatc ggcatcttca 1681 cgctgctcta cacggtcccc gccagcattg tggtggcctg ctacctgtac gagcagcact 1741 accgcgagag ctgggaggcg gcgctcacct gcgcctgccc gggccacgac accggccagc 1801 cgcgcgccaa gcccgagtac tgggtgctca tgctcaagta cttcatgtgc ctggtggtgg 1861 gcatcacgtc gggcgtctgg atctggtcgg gcaagacggt ggagtcgtgg cggcgtttca 1921 ccagccgctg ctgctgccgc ccgcggcgcg gccacaagag cgggggcgcc atggccgcag 1981 gggactaccc cgaggcgagc gccgcgctca caggcaggac cgggccgccg ggccccgccg 2041 ccacctacca caagcaggtg tccctgtcgc acgtgtagga ggctgccgcc gagggactcg 2101 gccggagagc tgaggggagg ggggcgtttt gtttggtagt tttgccaagg tcacttccgt 2161 ttaccttcat ggtgctgttg ccccctcccg cggcgacttg gagagaggga agaggggcgt 2221 tttcgaggaa gaacctgtcc caggtcttct ccaaggggcc cagctcacgt gtattctatt 2281 ttgcgtttct tacctgcctt ctttatggga accctctttt taatttatat gtat // LOCUS HSU43341 2986 bp mRNA PRI 04-DEC-1996 DEFINITION Human transcription factor NFAT1 isoform B (NFAT1) mRNA, complete cds. ACCESSION U43341 NID g1353773 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2986) AUTHORS Luo,C., Burgeon,E., Carew,J.A., McCaffrey,P.G., Badalian,T.M., Lane,W.S., Hogan,P.G. and Rao,A. TITLE Recombinant NFAT1 (NFATp) is regulated by calcineurin in T cells and mediates transcription of several cytokine genes JOURNAL Mol. Cell. Biol. 16 (7), 3955-3966 (1996) MEDLINE 96251346 REFERENCE 2 (bases 1 to 2986) AUTHORS Luo,C. TITLE Direct Submission JOURNAL Submitted (15-DEC-1995) Chun Luo, Cellular and Molecular Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2986 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="20" /map="20q13.1-q13.2" /cell_line="Jurkat T cell line" gene 221..2986 /gene="NFAT1" CDS 221..2986 /gene="NFAT1" /note="NFAT1-B; NFATp" /codon_start=1 /product="transcription factor NFAT1 isoform B" /db_xref="PID:g1353774" /translation="MNAPERQPQPDGGDAPGHEPGGSPQDELDFSILFDYEYLNPNEE EPNAHKVASPPSGPAYPDDVMDYGLKPYSPLASLSGEPPGRFGEPDRVGPQKFLSAAK PAGASGLSPRIEITPSHELIQAVGPLRMRDAGLLVEQPPLAGVAASPRFTLPVPGFEG YREPLCLSPASSGSSASFISDTFSPYTSPCVSPNNGGPDDLCPQFQNIPAHYSPRTSP IMSPRTSLAEDSCLGRHSPVPRPASRSSSPGAKRRHSCAEALVALPPGASPQRSRSPS PQPSSHVAPQDHGSPAGYPPVAGSAVIMDALNSLATDSPCGIPPKMWKTSPDPSPVSA APSKAGLPRHIYPAVEFLGPCEQGERRNSAPESILLVPPTWPKPLVPAIPICSIPVTA SLPPLEWPLSSQSGSYELRIEVQPKPHHRAHYETEGSRGAVKAPTGGHPVVQLHGYME NKPLGLQIFIGTADERILKPHAFYQVHRITGKTVTTTSYEKIVGNTKVLEIPLEPKNN MRATIDCAGILKLRNADIELRKGETDIGRKNTRVRLVFRVHIPESSGRIVSLQTASNP IECSQRSAHELPMVERQDTDSCLVYGGQQMILTGQNFTSESKVVFTEKTTDGQQIWEM EATVDKDKSQPNMLFVEIPEYRNKHIRTPVKVNFYVINGKRKRSQPQHFTYHPVPAIK TEPTDEYDPTLICSPTHGGLGSQPYYPQHPMVAESPSCLVATMAPCQQFRTGLSSPDA RYQQQNPAAVLYQRSKSLSPSLLGYQQPALMAAPLSLADAHRSVLVHAGSQGQSSALL HPSPTNQQASPVIHYSPTNQQLRCGSHQEFQHIMYCENFAPGTTRPGPPPVSQGQRLS PGSYPTVIQQQNATSQRAAKNGPPVSDQKEVLPAGVTIKQEQNLDQTYLDDELIDTHL SWIQNIL" BASE COUNT 623 a 1080 c 795 g 488 t ORIGIN 1 agcaggaagc tcgcgccgcc gtcgccgccg ccgctcagct tccccgggcg cgtccaggac 61 ccgctgcgcc aggcgcgccg tccccggacc cggcgtgcgt ccctacgagg aaagggaccc 121 cgccgctcga gccgcctccg ccagccccac tgcgaggggt cccagagcca gccgcgcccg 181 ccctcgcccc cggccccgca gccttcccgc cctgcgcgcc atgaacgccc ccgagcggca 241 gccccaaccc gacggcgggg acgccccagg ccacgagcct gggggcagcc cccaagacga 301 gcttgacttc tccatcctct tcgactatga gtatttgaat ccgaacgaag aagagccgaa 361 tgcacataag gtcgccagcc caccctccgg acccgcatac cccgatgatg taatggacta 421 tggcctcaag ccatacagcc cccttgctag tctctctggc gagccccccg gccgattcgg 481 agagccggat agggtagggc cgcagaagtt tctgagcgcg gccaagccag caggggcctc 541 gggcctgagc cctcggatcg agatcactcc gtcccacgaa ctgatccagg cagtggggcc 601 cctccgcatg agagacgcgg gcctcctggt ggagcagcct cccctggccg gggtggccgc 661 cagcccgagg ttcaccctgc ccgtgcccgg cttcgagggc taccgcgagc cgctttgctt 721 gagccccgct agcagcggct cctctgccag cttcatttct gacaccttct ccccctacac 781 ctcgccctgc gtctcgccca ataacggcgg gcccgacgac ctgtgtccgc agtttcaaaa 841 catccctgct cattattccc ccagaacctc gccaataatg tcacctcgaa ccagcctcgc 901 cgaggacagc tgcctgggcc gccactcgcc cgtgccccgt ccggcctccc gctcctcatc 961 gcctggtgcc aagcggaggc attcgtgcgc cgaggccttg gttgccctgc cgcccggagc 1021 ctcaccccag cgctcccgga gcccctcgcc gcagccctca tctcacgtgg caccccagga 1081 ccacggctcc ccggctgggt acccccctgt ggctggctct gccgtgatca tggatgccct 1141 gaacagcctc gccacggact cgccttgtgg gatccccccc aagatgtgga agaccagccc 1201 tgacccctcg ccggtgtctg ccgccccatc caaggccggc ctgcctcgcc acatctaccc 1261 ggccgtggag ttcctggggc cctgcgagca gggcgagagg agaaactcgg ctccagaatc 1321 catcctgctg gttccgccca cttggcccaa gccgctggtg cctgccattc ccatctgcag 1381 catcccagtg actgcatccc tccctccact tgagtggccg ctgtccagtc agtcaggctc 1441 ttacgagctg cggatcgagg tgcagcccaa gccacatcac cgggcccact atgagacaga 1501 aggcagccga ggggctgtca aagctccaac tggaggccac cctgtggttc agctccatgg 1561 ctacatggaa aacaagcctc tgggacttca gatcttcatt gggacagctg atgagcggat 1621 ccttaagccg cacgccttct accaggtgca ccgaatcacg gggaaaactg tcaccaccac 1681 cagctatgag aagatagtgg gcaacaccaa agtcctggag atccccttgg agcccaaaaa 1741 caacatgagg gcaaccatcg actgtgcggg gatcttgaag cttagaaacg ccgacattga 1801 gctgcggaaa ggcgagacgg acattggaag aaagaacacg cgggtgagac tggttttccg 1861 agttcacatc ccagagtcca gtggcagaat cgtctcttta cagactgcat ctaaccccat 1921 cgagtgctcc cagcgatctg ctcacgagct gcccatggtt gaaagacaag acacagacag 1981 ctgcctggtc tatggcggcc agcaaatgat cctcacgggg cagaacttta catccgagtc 2041 caaagttgtg tttactgaga agaccacaga tggacagcaa atttgggaga tggaagccac 2101 ggtggataag gacaagagcc agcccaacat gctttttgtt gagatccctg aatatcggaa 2161 caagcatatc cgcacacctg taaaagtgaa cttctacgtc atcaatggga agagaaaacg 2221 aagtcagcct cagcacttta cctaccaccc agtcccagcc atcaagacgg agcccacgga 2281 tgaatatgac cccactctga tctgcagccc cacccatgga ggcctgggga gccagcctta 2341 ctacccccag cacccgatgg tggccgagtc cccctcctgc ctcgtggcca ccatggctcc 2401 ctgccagcag ttccgcacgg ggctctcatc ccctgacgcc cgctaccagc aacagaaccc 2461 agcggccgta ctctaccagc ggagcaagag cctgagcccc agcctgctgg gctatcagca 2521 gccggccctc atggccgccc cgctgtccct tgcggacgct caccgctctg tgctggtgca 2581 cgccggctcc cagggccaga gctcagccct gctccacccc tctccgacca accagcaggc 2641 ctcgcctgtg atccactact cacccaccaa ccagcagctg cgctgcggaa gccaccagga 2701 gttccagcac atcatgtact gcgagaattt cgcaccaggc accaccagac ctggcccgcc 2761 cccggtcagt caaggtcaga ggctgagccc gggttcctac cccacagtca ttcagcagca 2821 gaatgccacg agccaaagag ccgccaaaaa cggacccccg gtcagtgacc aaaaggaagt 2881 attacctgcg ggggtgacca ttaaacagga gcagaacttg gaccagacct acttggatga 2941 tgagctgata gacacacacc ttagctggat acaaaacata ttatga // LOCUS HSU43368 1079 bp mRNA PRI 07-MAR-1996 DEFINITION Human VEGF related factor isoform VRF186 precursor (VRF) mRNA, complete cds. ACCESSION U43368 NID g1216395 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1079) AUTHORS Grimmond,S., Lagercrantz,J., Drinkwater,C., Silins,G., Townson,S., Pollock,P., Gotley,D., Carson,E., Rakar,S., Nordenskjold,M., Ward,L., Hayward,N. and Weber,G. TITLE Cloning and characterization of a novel human gene related to vascular endothelial growth factor JOURNAL Genome Res. 6 (2), 122-129 (1996) REFERENCE 2 (bases 1 to 1079) AUTHORS Silins,G.U. TITLE Direct Submission JOURNAL Submitted (15-DEC-1995) Ginters U. Silins, Human Genetics, Queensland Institute of Medical Research, Herston, Queensland, 4029, Australia FEATURES Location/Qualifiers source 1..1079 /organism="Homo sapiens" /note="GDB DSEG number=D11S750" /db_xref="taxon:9606" /clone_lib="Human fetal brain cDNA library from Stratagene" /map="11q13" /chromosome="11" /tissue_type="brain" /dev_stage="fetus" gene 3..626 /gene="VRF" CDS 3..626 /gene="VRF" /codon_start=1 /product="VEGF related factor isoform VRF186 precursor" /db_xref="PID:g1216396" /translation="MSPLLRRLLLAALLQLAPAQAPVSQPDAPGHQRKVVSWIDVYTR ATCQPREVVVPLTVELMGTVAKQLVPSCVTVQRCGGCCPDDGLECVPTGQHQVRMQIL MIRYPSSQLGEMSLEEHSQCECRPKKKDSAVKPDRAATPHHRPQPRSVPGWDSAPGAP SPADITHPTPAPGPSAHAAPSTTSALTPGPAAAAADAAASSVAKGGA" sig_peptide 3..65 /gene="VRF" /note="putative" mat_peptide 66..623 /gene="VRF" /product="VEGF related factor isoform VRF186" polyA_site 1079 BASE COUNT 230 a 359 c 300 g 190 t ORIGIN 1 ccatgagccc tctgctccgc cgcctgctgc tcgccgcact cctgcagctg gcccccgccc 61 aggcccctgt ctcccagcct gatgcccctg gccaccagag gaaagtggtg tcatggatag 121 atgtgtatac tcgcgctacc tgccagcccc gggaggtggt ggtgcccttg actgtggagc 181 tcatgggcac cgtggccaaa cagctggtgc ccagctgcgt gactgtgcag cgctgtggtg 241 gctgctgccc tgacgatggc ctggagtgtg tgcccactgg gcagcaccaa gtccggatgc 301 agatcctcat gatccggtac ccgagcagtc agctggggga gatgtccctg gaagaacaca 361 gccagtgtga atgcagacct aaaaaaaagg acagtgctgt gaagccagac agggctgcca 421 ctccccacca ccgtccccag ccccgttctg ttccgggctg ggactctgcc cccggagcac 481 cctccccagc tgacatcacc catcccactc cagccccagg cccctctgcc cacgctgcac 541 ccagcaccac cagcgccctg acccccggac ctgccgctgc cgctgccgac gccgcagctt 601 cctccgttgc caagggcggg gcttagagct caacccagac acctgcaggt gccggaagct 661 gcgaaggtga cacatggctt ttcagactca gcagggtgac ttgcctcaga ggctatatcc 721 cagtggggga acaaagggga gcctggtaaa aaacagccaa gcccccaaga cctcagccca 781 ggcagaagct gctctaggac ctgggcctct cagagggctc ttctgccatc ccttgtctcc 841 ctgaggccat catcaaacag gacagagttg gaagaggaga ctgggaggca gcaagagggg 901 tcacatacca gctcagggga gaatggagta ctgtctcagt ttctaaccac tctgtgcaag 961 taagcatctt acaactggct cttcctcccc tcactaagaa gacccaaacc tctgcataat 1021 gggatttggg ctttggtaca agaactgtga cccccaaccc tgataaaaga gatggaagg // LOCUS HSU43408 2771 bp mRNA PRI 06-APR-1996 DEFINITION Human tyrosine kinase (Tnk1) mRNA, complete cds. ACCESSION U43408 NID g1256002 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2771) AUTHORS Hoehn,G.T., Stokland,T., Amin,S., Ramirez,M., Hawkins,A.L., Griffin,C.A., Small,D. and Civin,C.I. TITLE Tnk1: a novel intracellular tyrosine kinase gene isolated from human umbilical cord blood CD34+/Lin-/CD38- stem/progenitor cells JOURNAL Oncogene 12 (4), 903-913 (1996) MEDLINE 96197771 REFERENCE 2 (bases 1 to 2771) AUTHORS Hoehn,G.T. TITLE Direct Submission JOURNAL Submitted (15-DEC-1995) Gerard T. Hoehn, Pediatric Oncology, Johns Hopkins University School of Medicine, 600 North Wolfe Street, Baltimore, MD 21117, USA FEATURES Location/Qualifiers source 1..2771 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pTnk1" /cell_line="K562" /cell_type="CD34+/Lin-/CD38- blood cells" /tissue_type="umbilical cord blood" gene 117..2117 /gene="Tnk1" CDS 117..2117 /gene="Tnk1" /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g1256003" /translation="MLPEAGSLWLLKLLRDIQLAQFYWPILEELNVTRPGHFDFVKPE DLDGIGMGRPAQRRLSEALKSLRSGPKSKNWVYKILGGFAPEHKEPTLPTDSPRHLPE PEGGLKCLIPEGAVCRGELLGSGCFGVVHRGLWTLPSGKSVPVAVKSLRVGPEGPMGT ELGDFLREVSVMMNLEHPHVLRLHGLVLGQPLQMVMELAPLGSLHARLTAPAPTPPLL VALLCLFLRQLAGAMAYLGARGLVHRDLATRNLQLASPRTIKVADFGLVRPLGGARGR YVMGGPRPIPYTWCAPESLRHGAFSSASDVWMFGVTLWEMFSGGEEPWPGVPPYLILQ RLEDRARLPRPPPSSRALYSLALRCWAPHPADRPSFSHLEGLLQEAGPSEACCVRDAT EPGALRMETGDPITVIEGSSSFHSPDSTIWKDQNGRTFKVGSFPASAVTLTDAGGLPA TRPVHRGTPARGDQHPGSIDGDRKKANLWDAPPARGQRRNMPLERMKGISRSLESVLS LGPRPTGGGSSPPEIRQARAVPQGPPGLPPRPPLSSSSPQPSQPSRERLPWPERKPPH NHPMGMPGARKAAALSGGLLSDPELQRKIMEMELSVHWVTHQECQTALGATGGDVASA IRNLKVDQLFLLSSRSRADCWRILEHYQWDLSAASRYVLARP" BASE COUNT 534 a 885 c 814 g 538 t ORIGIN 1 gagtcgccgc ttccgccttg accaggtgga gctggagacc tggtctctct agggcctgcc 61 ctgagctcac catctgaagg agagtgccat catccttagg aactccttct ccagacatgc 121 ttcccgaggc tggctccctg tggctactga agctgctccg ggacatccag ttggcccagt 181 tttactggcc catcctagag gagcttaatg tcacccggcc agggcacttc gactttgtaa 241 agcctgagga cctggacggc attggcatgg gccggcctgc ccaacgcaga ctgtccgaag 301 ctctgaaaag cctacgttct gggcctaagt ctaagaactg ggtctacaag atccttggag 361 gttttgcccc tgagcacaag gagcccaccc tgcccacgga cagcccacgg cacctccctg 421 agccagaggg gggcctcaag tgtctgatcc cagagggtgc tgtttgcaga ggggagctgc 481 tgggttcagg ctgcttcggt gtggtgcacc gagggctgtg gacgctgccc agtggcaaga 541 gtgtcccagt ggctgtcaag tccctccggg taggtcccga aggcccgatg ggcacagaac 601 tgggggactt cctgcgagag gtatcggtca tgatgaactt ggagcaccca cacgtgctgc 661 gtctgcacgg ccttgtactg ggccagcctc tgcagatggt gatggagctg gcgccactgg 721 gctccctgca cgcgcgctta acggccccgg ccccgacacc cccgctgctc gtggccctgc 781 tctgcctctt cctgcggcag ctggcgggag ccatggcgta cctgggggcc cgcgggctgg 841 tgcaccgaga cctcgctacg cgcaacctac agctggcgtc gccgcgcacc atcaaggtgg 901 ctgacttcgg gctggtgcgg cctctgggcg gtgcccgggg ccgctacgtc atgggcgggc 961 cccgccctat cccctacacc tggtgtgccc cagagagcct gcgccacgga gccttctcgt 1021 ctgcctcgga cgtgtggatg tttggggtga cgctgtggga gatgttctcc gggggcgagg 1081 aaccctggcc cggggtccca ccgtacctca tcctgcagcg gctggaggac agagcccggc 1141 tgcctaggcc tcccccctcc tccagggccc tctactccct cgccttgcgc tgctgggccc 1201 cccaccctgc cgaccggcct agcttttccc acctggaggg gctgctgcaa gaggccgggc 1261 cttcggaagc atgttgtgtg agggatgcca cagaaccagg cgccctgagg atggagactg 1321 gtgaccccat cacagtcatc gagggcagct cctctttcca cagccccgac tccacaatct 1381 ggaaggacca gaatggtcgc accttcaaag tgggcagctt cccagcctcg gcagtgacgc 1441 tgacagatgc ggggggcttg ccagccaccc gtccagtcca cagaggcacc cctgcccggg 1501 gagatcaaca cccaggaagc atagatggag acagaaagaa ggcaaatctt tgggatgcgc 1561 ccccagcacg gggccagagg aggaacatgc ccctggagag gatgaaaggc atttccagga 1621 gtctggagtc agttctgtcc ctcggtcctc gtcccacagg gggtggttca agcccccctg 1681 aaattcgaca agccagagct gtgccccagg gacctccagg cctgcctcca cgcccacctt 1741 tatcctctag ctctcctcag cccagccagc cctctaggga gaggcttccc tggcccgaaa 1801 gaaaaccccc acacaatcac cccatgggaa tgcctggagc ccgtaaagcc gctgccctct 1861 ctggaggcct cttgtccgat cctgagttgc agaggaagat tatggaaatg gagctgagtg 1921 tgcattgggt cacccaccag gagtgccaga cagcactagg agccactggg ggagatgtgg 1981 cttctgccat ccggaacctc aaggtagatc agctcttcct cctgagtagc cggtccagag 2041 ctgactgctg gcgcatcctg gagcattacc agtgggacct ctcagctgcc agccgttatg 2101 tcctggccag gccctgagcc cagcttctgc gggcacagac accagcatga aaagcctagg 2161 tccctgaggg cctggccaca tgggaccaag tggaaccaga acaaggcccc gacaggggta 2221 gacgttccac ttgtggagat cccacctgcc gctaggcacg tggaggagga gcccagagtt 2281 gggcactggc aaatgtctcc tccctcccat gctccttggc ttctgaaggc tgaagcccct 2341 ttggctgggc caagaaggat ctagtctgcc cactacattc tcaaacaaga ggacttggag 2401 gaaaagagct actatacatc atatgcagag gaagcttcta cgcgctagag aggatcaagg 2461 ggccacactg gaccatgtga acagccatcc ggaactgcca tcagctacca cactggactc 2521 tgcagggcag ccatcctgga tgatggaagc caccatattg acctggggta taggcccaaa 2581 ctgccttcgt ttggtccagg gccatcgtgg gtgatgacga ttgctctctt gcactcatgg 2641 acatttgatg ctggtagtat ggattatgag atggactagc ccctgctcca gcccagttct 2701 cacattcccc tttgtttttt cccataccaa ctgcttctac cctcccctat tacatacatc 2761 tttcaatgtc c // LOCUS HSU43431 3755 bp mRNA PRI 12-JUL-1996 DEFINITION Human DNA topoisomerase III mRNA, complete cds. ACCESSION U43431 NID g1292911 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3755) AUTHORS Hanai,R., Caron,P.R. and Wang,J.C. TITLE Human TOP3: a single-copy gene encoding DNA topoisomerase III JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (8), 3653-3657 (1996) MEDLINE 96195027 REFERENCE 2 (bases 1 to 3755) AUTHORS Hanai,R. and Wang,J.C. TITLE Direct Submission JOURNAL Submitted (15-DEC-1995) Ryo Hanai, Molecular and Cellular Biology, Hrvard University, 7 Divinity Avenue, Cambridge, MA 02138, USA FEATURES Location/Qualifiers source 1..3755 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p11.2-12" /cell_line="Jurkat T-cell line, ATCC E6-1" gene 178..3741 /gene="TOP3" CDS 178..3183 /gene="TOP3" /note="putative; uses an alternative initiation codon" /codon_start=1 /product="DNA topoisomerase III" /db_xref="PID:g1292912" /translation="MIFPVARYALRWLRRPEDRAFSRAAMEMALRGVRKVLCVAEKND AAKGIADLLSNGRMRRREGLSKFNKIYEFDYHLYGQNVTMVMTSVSGHLLAHDFQMQF RKWQSCNPLVLFEAEIEKYCPENFVDIKKTLERETRQCQALVIWTDCDREGENIGFEI IHVCKAVKPNLQVLRARFSEITPHAVRTACENLTEPDQRVSDAVDVRQELDLRIGAAF TRFQTLRLQRIFPEVLAEQLISYGSCQFPTLGFVVERFKAIQAFVPEIFHRIKVTHDH KDGIVEFNWKRHRLFNHTACLVLYQLCVEDPMATVVEVRSKPKSKWRPQALDTVELEK LASRKLRINAKETMRIAEKLYTQGYISYPRTETNIFPRDLNLTVLVEQQTPDPRWGAF AQSILERGGPTPRNGNKSDQAHPPIHPTKYTNNLQGDEQRLYEFIVRHFLACCSQDAQ GQETTVEIDIAQERFVAHGLMILARNYLDVYPYDHWSDKILPVYEQGSHFQPSTVEMV DGETSPPKLLTEADLIALMEKHGIGTDATHAEHIETIKARMYVGLTPDKRFLPGHLGM GLVEGYDSMGYEMSKPDLRAELEADLKLICDGKKDKFVVLRQQVQKYKQVFIEAVAKA KKLDEALAQYFGNGTELAQQEDIYPAMPEPIRKCPQCNKDMVLKTKKNGGFYLSCMGF PECRSAVWLPDSVLEASRDSSVCPVCQPHPVYRLKLKFKRGSLPPTMPLEFVCCIGGC DDTLREILDLRFSGGPPRASQPSGRLQANQSLNRMDNSQHPQPADSRQTGSSKALAQT LPPPTAAGESNSVTCNCGQEAVLLTVRKEGPNRGRQFFKCNGGSCNFFLWADSPNPGA GGPPALAYRPLGASLGCPPGPGIHLGGFGNPGDGSGSGTSCLCSQPSVTRTVQKDGPN KGRQFHTCAKPREQQCGFFQWVDENTAPGTSGAPSWTGDRGRTLESEARSKRPRASSS DMGSTAKKPRKCSLCHQPGHTRPFCPQNR" CDS 253..3183 /gene="TOP3" /note="type I DNA topoisomerase, belonging to E. coli topoisomerase I/yeast topoisomerase III subfamily" /codon_start=1 /product="DNA topoisomerase III" /db_xref="PID:g1292913" /translation="MEMALRGVRKVLCVAEKNDAAKGIADLLSNGRMRRREGLSKFNK IYEFDYHLYGQNVTMVMTSVSGHLLAHDFQMQFRKWQSCNPLVLFEAEIEKYCPENFV DIKKTLERETRQCQALVIWTDCDREGENIGFEIIHVCKAVKPNLQVLRARFSEITPHA VRTACENLTEPDQRVSDAVDVRQELDLRIGAAFTRFQTLRLQRIFPEVLAEQLISYGS CQFPTLGFVVERFKAIQAFVPEIFHRIKVTHDHKDGIVEFNWKRHRLFNHTACLVLYQ LCVEDPMATVVEVRSKPKSKWRPQALDTVELEKLASRKLRINAKETMRIAEKLYTQGY ISYPRTETNIFPRDLNLTVLVEQQTPDPRWGAFAQSILERGGPTPRNGNKSDQAHPPI HPTKYTNNLQGDEQRLYEFIVRHFLACCSQDAQGQETTVEIDIAQERFVAHGLMILAR NYLDVYPYDHWSDKILPVYEQGSHFQPSTVEMVDGETSPPKLLTEADLIALMEKHGIG TDATHAEHIETIKARMYVGLTPDKRFLPGHLGMGLVEGYDSMGYEMSKPDLRAELEAD LKLICDGKKDKFVVLRQQVQKYKQVFIEAVAKAKKLDEALAQYFGNGTELAQQEDIYP AMPEPIRKCPQCNKDMVLKTKKNGGFYLSCMGFPECRSAVWLPDSVLEASRDSSVCPV CQPHPVYRLKLKFKRGSLPPTMPLEFVCCIGGCDDTLREILDLRFSGGPPRASQPSGR LQANQSLNRMDNSQHPQPADSRQTGSSKALAQTLPPPTAAGESNSVTCNCGQEAVLLT VRKEGPNRGRQFFKCNGGSCNFFLWADSPNPGAGGPPALAYRPLGASLGCPPGPGIHL GGFGNPGDGSGSGTSCLCSQPSVTRTVQKDGPNKGRQFHTCAKPREQQCGFFQWVDEN TAPGTSGAPSWTGDRGRTLESEARSKRPRASSSDMGSTAKKPRKCSLCHQPGHTRPFC PQNR" variation replace(1964,"a") /gene="TOP3" variation replace(3354,"c") /gene="TOP3" polyA_signal 3736..3741 /gene="TOP3" BASE COUNT 896 a 1020 c 1061 g 778 t ORIGIN 1 ggcggctgcg gcacgggaaa ggctcagtga ctgaagctcc aaaggccagc aggctggtgg 61 ggacgtgacc gaagcgaggc tctggttccc tttcggtggg cgccatttga gcctcatctc 121 tggcttcccc aggatgcgcc ggcagccggg gagcggctcc gggcgcgagg tctgaggatg 181 atctttcctg tcgcccgcta cgcgctccgg tggctgcgac ggcccgaaga ccgtgccttt 241 tcccgcgccg ccatggagat ggccctccga ggcgtgcgga aagtcctctg tgtggccgaa 301 aaaaacgacg cggccaaggg gatcgccgac ctgctgtcaa acggtcgcat gaggcggaga 361 gaaggacttt caaaattcaa caagatctat gaatttgatt atcatctgta tggccagaat 421 gttaccatgg taatgacttc agtttctgga catttactgg ctcatgattt ccagatgcag 481 tttcgaaaat ggcagagctg caaccctctt gtcctctttg aagcagaaat tgaaaagtac 541 tgcccagaga attttgtaga catcaagaaa actttggaac gagagactcg ccagtgccag 601 gctctggtga tctggactga ctgtgataga gaaggcgaaa acatcgggtt tgagattatc 661 cacgtgtgta aggctgtaaa gcccaatctg caggtgttgc gagcccgatt ctctgagatc 721 acaccccatg ccgtcaggac agcttgtgaa aacctgaccg agcctgatca gagggtgagc 781 gatgctgtgg atgtgaggca ggagctggac ctgaggattg gagctgcctt tactaggttc 841 cagaccctgc ggcttcagag gatttttcct gaggtgctgg cagagcagct catcagttac 901 ggcagctgcc agttccccac actgggcttt gtggtggagc ggttcaaagc cattcaggct 961 tttgtaccag aaatcttcca cagaattaaa gtaactcatg accacaaaga tggtatcgta 1021 gaattcaact ggaaaaggca tcgactcttt aaccacacgg cttgcctagt tctctatcag 1081 ttgtgtgtgg aggatcccat ggcaactgtg gtagaggtca gatctaagcc caagagcaag 1141 tggcggcctc aagccttgga cactgtggag cttgagaagc tggcttctcg aaagttgaga 1201 ataaatgcta aagaaaccat gaggattgct gagaagctct acactcaagg gtacatcagc 1261 tatccccgaa cagaaacaaa catttttccc agagacttaa acctgacggt gttggtggaa 1321 cagcagaccc ccgatccacg ctggggggcc tttgcccaga gcattctaga gcggggtggt 1381 cccaccccac gcaatgggaa caagtctgac caagctcacc ctcccattca ccccaccaaa 1441 tacaccaaca acttacaggg agatgaacag cgactgtacg agtttattgt tcgccatttc 1501 ctggcttgct gctcccagga tgctcagggg caggagacca cagtggagat cgacatcgct 1561 caggaacgct ttgtggccca tggcctcatg attctggccc gaaactatct ggatgtgtat 1621 ccatatgatc actggagtga caagatcctc cctgtctatg agcaaggatc ccactttcag 1681 cccagcaccg tggagatggt ggacggggag accagcccac ccaagctgct caccgaggcc 1741 gacctcattg ccctcatgga gaagcatggc attggtacgg atgccactca tgcggagcac 1801 atcgagacca tcaaagcccg gatgtacgtg ggcctcaccc cagacaagcg gttcctccct 1861 gggcacctgg gcatgggact tgtggaaggt tatgattcca tgggctatga aatgtctaag 1921 cctgacctcc gggctgaact ggaagctgat ctgaagctga tctgtgatgg caaaaaggac 1981 aaatttgtgg ttctaaggca gcaagtgcag aaatacaagc aggttttcat tgaagcggtg 2041 gctaaagcaa agaaattgga cgaggccttg gcccagtact ttgggaatgg gacagagttg 2101 gcccagcaag aagatatcta cccagccatg ccagagccca tcaggaagtg cccacagtgc 2161 aacaaggaca tggtccttaa gaccaagaag aatggcgggt tctacctcag ctgcatgggt 2221 ttcccagagt gtcgctcagc tgtgtggctt cctgactcgg tgctggaggc cagcagggac 2281 agcagtgtgt gtccagtttg tcagccacac cctgtgtaca ggttaaagtt aaagtttaag 2341 cgcggtagcc ttcccccgac catgcctctg gagtttgttt gctgcatcgg cggatgcgac 2401 gacaccctga gggagatcct ggacctgaga ttttcagggg gcccccccag ggctagccag 2461 ccctctggcc gcctgcaggc taaccagtcc ctgaacagga tggacaacag ccagcacccc 2521 cagcctgctg acagcagaca gactgggtcc tcaaaggctc tggcccagac cctcccacca 2581 cccacggctg ctggtgaaag caattctgtg acctgcaact gtggccagga ggctgtgctg 2641 ctcactgtcc gtaaggaggg ccccaaccgg ggccggcagt tctttaagtg caacggaggt 2701 agctgcaact tcttcctgtg ggcagacagc cccaatccgg gagcaggagg gcctcctgcc 2761 ttggcatata gacccctggg cgcctccctg ggatgcccac caggcccagg gatccaccta 2821 ggtgggtttg gcaaccctgg tgatggcagt ggtagtggca catcctgcct ttgcagccag 2881 ccctccgtca cacggactgt gcagaaggat ggacccaaca aggggcgcca gttccacaca 2941 tgtgccaagc cgagagagca gcagtgtggc tttttccagt gggtcgatga gaacaccgct 3001 ccagggactt ctggagcccc gtcctggaca ggagacagag gaagaaccct ggagtcggaa 3061 gccagaagca aaaggccccg ggccagttcc tcagacatgg ggtccacagc aaagaaaccc 3121 cggaaatgca gcctttgcca ccagcctgga cacacccgtc ccttttgtcc tcagaacaga 3181 tgagctcagg gtagggtaga gaacgccact ttctcagacc tgtccccttt gtgtttagaa 3241 atgagttaac caggaccaag tggccattta gtgtcctgga aacttagagg acagtgttgg 3301 cctttggagt cgggccttct tgtgttaagg ggcacaaggt ccagatcact ctggagcagg 3361 ccagctctgc tggacagtga ccctcttccc aggcctcagg agtgaccata gccactgctg 3421 aaaagtcacg cagctgctcc ctcggacccc ccaaggatgg ttgctgttag cagaggattg 3481 gtgcagtccc agctgaagcc cactgtgtgc caaaggaaga agctcccagg gctgcttcct 3541 tcacctgcag aaagccccaa gtgagccacc agcactcatg gggcagtccc tgtccaggct 3601 gcccagggct tctcatagac gtcctgagaa ggacggtgta atgcaaggaa atggctgtgg 3661 taacactgat ccttcagaag aagcttcatt ccctcttaat ctagttaagc caggacatcc 3721 agaattcatt gctttaataa agaacccagg ccggg // LOCUS HSU43519 3499 bp mRNA PRI 15-JUN-1996 DEFINITION Human dystrophin-related protein 2 (DRP2) mRNA, complete cds. ACCESSION U43519 NID g1353781 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 263 to 3189) AUTHORS Roberts,R.G., Freeman,T.C., Kendall,E., Vetrie,D.L., Dixon,A.K., Shaw-Smith,C., Bone,Q. and Bobrow,M. TITLE Characterization of DRP2, a novel human dystrophin homologue JOURNAL Nature Genet. 13 (2), 223-226 (1996) MEDLINE 96225452 REFERENCE 2 (bases 1 to 3499) AUTHORS Roberts,R.G. TITLE Direct Submission JOURNAL Submitted (16-DEC-1995) Roland G. Roberts, Medical Genetics, Addenbrooke's Hospital, 3rd Floor, Lab Block, Addenbrooke's Hospital, Hills Road, Cambridge, Cambs Cb2 2Qq, UK FEATURES Location/Qualifiers source 1..3499 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq22" /chromosome="X" 5'UTR <1..262 gene 328..3192 /gene="DRP2" CDS 328..3192 /gene="DRP2" /function="membrane cytoskeleton" /codon_start=1 /product="dystrophin-related protein 2" /db_xref="PID:g1353782" /translation="MVMQGCPYTLPRCHDWQAADQFHHSSSLRSTCPHPQVRAAVTSP APPQDGAGVPCLSLKLLNGSVGASGPLEPPAMNLCWNEIKKKSHNLRARLEAFSDHSG KLQLPLQEIIDWLSQKDEELSAQLPLQGDVALVQQEKETHAAFMEEVKSRAPYIYSVL ESAQAFLSQHPFEELEEPHSESKDTSPKQRIQNLSRFVWKQATVASELWEKLTARCVD QHRHIERTLEQLLEIQGAMEELSTTLSQAEGVRATWEPIGDLFIDSLPEHIQAIKLFK EEFSPMKDGVKLVNDLAHQLAISDVHLSMENSQALEQINVRWKQLQASVDERLKQLQD AHRDFGPGSQHFLSSSVQVPWERAISPNKVPYYINHQAQTTCWDHPKMTELYQTLADL NNIKFSAYRTAMKLRRVQKALRLDLVTLTTALEIFNEHDLQASEHVMDVVEVIHCLTA LYERLEEERGILVNVPLCVDMSLNWLLNVFDSGRSGKMRALSFKTGIACLCGTEVKEK LQYLFSQVANSGSQCDQRHLGVLLHEAIQVPRQLGEVAAFGGSNVEPSVRSCFRFSTG KPVIEASQFLEWVNLEPQSMVWLPVLHRVTIAEQVKHQTKCSICRQCPIKGFRYRSLK QFNVDICQTCFLTGRASKGNKLHYPIMEYYTPTTSSENMRDFATTLKNKFRSKHYFSK HPQRGYLPVQSVLEADYSETPASSPMWPHADTHSRIEHFASRLAEMESQNCSFFNDSL SPDDSIDEDQYLLRHSSPITDREPAFGQQAPCSVATESKGELQKILAHLEDENRILQG ELRRLKWQHEEAAEAPSLADGSTEAATDHRNEELLAEARILRQHKSRLETRMQILEDH NKQLESQLQRLRELLLQPPTESDGSGSAGSSLASSPQQSEGSHPREKGQTTPDTEAAD DVGSKSQDVSLCLEDIMEKLRHAFPSVRSSDVTANTLLAS" 3'UTR 3193..>3499 BASE COUNT 856 a 964 c 913 g 766 t ORIGIN 1 gggaagtggg agagacagag ggagagtgtg catacagcag ctgtcgggca tccctgtctc 61 agggacttct tcgctgattc acagaggcag cccagccccg gcctcccagc ttacactgcc 121 gcctgcttct cagaacctga cggaatcaga agctacatga tattcgccgt gtgagtggct 181 atcgtaatag ataacatttg aatatctact aaatgccaga aactatgaat taactcctct 241 caacaaagct aagaggtgct tgatgatcaa tagttggtgc actgcctatc ctcatccccc 301 ccatgagcct tggtttttat gcaacctatg gtcatgcagg gatgccctta caccctccca 361 cgatgtcatg actggcaggc agctgaccag ttccatcata gcagcagcct ccgaagcacc 421 tgcccccacc ctcaggttag agctgctgtc accagccctg cacctcctca agatggtgct 481 ggggttccct gcctaagcct aaagctgttg aacgggtctg ttggtgcctc tggacccctg 541 gaaccaccag ccatgaatct gtgttggaat gaaataaaaa agaagtctca caacctccgc 601 gctcgcctag aggccttctc agaccacagt ggaaagcttc agctccctct tcaagagatt 661 attgactggc tcagccaaaa ggatgaggag ttgtcagctc agctgcccct acagggggat 721 gtggccctgg tgcaacagga gaaggagaca catgcggcct ttatggaaga agtcaagtct 781 cgggccccct acatctattc tgtgctggag tcagctcagg ccttcctgtc ccagcaccca 841 tttgaggagt tagaggagcc tcattctgag agcaaagata cctccccgaa acagcggatc 901 cagaatctca gccgctttgt atggaagcag gcgacggtgg ccagtgaact gtgggagaag 961 ttgacagccc gctgtgtgga ccagcaccgt cacattgagc ggactctgga gcagctcttg 1021 gagattcaag gggcaatgga ggaactaagc actactctaa gccaagctga gggagtccga 1081 gccacttggg agcccattgg ggatctcttc attgattcac tcccagagca catccaggct 1141 attaagctgt tcaaagaaga attctccccc atgaaagatg gagtaaagtt ggtgaatgat 1201 ctggcccacc aacttgccat ttctgatgtg cacttgtcaa tggagaattc ccaggccctg 1261 gaacagatca acgtccgatg gaaacaacta caggcgtcag ttgatgagag gcttaagcag 1321 ctccaggatg cccaccggga ctttgggcct gggtcacagc actttctctc ctcctctgtc 1381 caggttccct gggaaagagc aatttcaccc aataaagttc cctactacat caaccaccag 1441 gctcagacca catgctggga ccatcccaag atgacagagt tataccaaac cctagctgat 1501 ctgaacaaca ttaagttctc agcttatcgc actgccatga aactccgcag agtccagaaa 1561 gccctgcgct tggacctggt aactttaacc acagccctgg aaatcttcaa tgagcatgat 1621 ctgcaggcca gtgagcacgt gatggatgtg gtagaggtca ttcactgcct gactgcctta 1681 tatgaacgtt tggaggagga aagaggcatc ctggtcaatg tgccactctg tgtggacatg 1741 agcctcaatt ggctcctcaa tgtttttgat agtggtcgca gcggaaagat gcgggcattg 1801 tcttttaaga ctggcattgc atgcttgtgt ggcacggaag tgaaggaaaa acttcagtac 1861 ctcttcagcc aagtggccaa ctcaggcagc cagtgtgacc agcgccacct tggtgtcctg 1921 cttcatgagg ccattcaggt gccccgtcag ctgggtgaag tggcagcctt tgggggcagc 1981 aatgtggagc ccagtgtccg tagttgcttc cgttttagca ccgggaagcc agtcattgaa 2041 gcatcccagt tcctggagtg ggtcaacctg gagccccagt ccatggtgtg gctgcctgtt 2101 ctgcatcggg taaccattgc tgagcaagtg aagcatcaga ccaagtgctc tatctgtagg 2161 cagtgcccca tcaaggggtt caggtaccgg agtctgaagc aattcaacgt tgacatctgc 2221 cagacctgct tcttgacagg cagggccagc aaaggcaata agctgcacta ccccatcatg 2281 gagtattaca caccgaccac atccagtgag aacatgaggg actttgccac aaccttaaag 2341 aacaaattcc gctccaagca ttatttcagc aaacaccctc agcgaggtta tctgcctgtg 2401 caatcagtgc tggaggctga ctacagtgag acgccggctt cttccccgat gtggccacac 2461 gccgacacac actcccgaat tgagcatttt gcgagcaggc ttgctgagat ggaaagtcaa 2521 aattgctcct tctttaatga cagcttgtcc ccagatgaca gcatagacga ggaccagtac 2581 ctgctgcggc actccagccc catcacagac cgggagccag cctttggaca gcaggctcca 2641 tgcagtgtgg ctacagaaag caaaggggag ctacagaaga tcctggccca cttggaagat 2701 gagaaccgga ttctccaggg agagctgagg cgcctgaagt ggcagcatga ggaggcagct 2761 gaggcaccca gtctggctga cggctccact gaggcagcaa cagaccaccg caatgaggag 2821 cttctggccg aggcccgtat ccttcggcaa cataagagcc gcctggagac gcgcatgcag 2881 atcctcgaag atcacaacaa gcagctagag tcccagctgc agcgtctgag ggagcttctc 2941 ctgcagccac ccaccgaatc agatggcagt ggctctgcag gctcgtccct agcttcctct 3001 ccacagcagt cagaaggcag tcacccccgg gagaagggac agaccactcc agataccgag 3061 gctgcagatg atgtggggtc aaagagccag gatgtcagcc tgtgcttgga ggacatcatg 3121 gagaaactcc gtcatgcctt ccccagtgtg cgaagttctg atgtgactgc caacaccctg 3181 ctggcctctt gatggagcca gatccccatc ctatagttca tagtcctctc ctggttccgg 3241 tcaaagcctt tcctcagcct tcacccaacc tttccagttt ccactggccc cacattcctc 3301 aactagtatt atttgggctc tgggcagcag cagggatctg gtggtatgtg aggtgcatgc 3361 gggcagtgat gggagaaggg gaggcatgat tcttctgacc ctagaaatgt tccctttaat 3421 cttcaagttc gagatcagcc ctttaagtac ctttctgttg cagcccaggc aaatatcact 3481 tggccattca gatggggag // LOCUS HSU43527 771 bp mRNA PRI 28-AUG-1997 DEFINITION Human malignant melanoma metastasis-suppressor (KiSS-1) gene, mRNA, complete cds. ACCESSION U43527 NID g2347057 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 771) AUTHORS Lee, J.-H., Welch and D.R. TITLE KiSS-1 and a novel human malignant melanoma metastasis-suppressor gene JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 771) AUTHORS Welch,D.R. and Lee,J.-H. TITLE Direct Submission JOURNAL Submitted (18-DEC-1995) Danny R. Welch, Pathology, Penn State College of Medicine, 500 University Drive, Hershey, PA 77030-0850, USA FEATURES Location/Qualifiers source 1..771 /organism="Homo sapiens" /note="identified using subtraction hybridization; this clone was derived from a hybrid of the highly metastatic melanoma cell line C8161 and human chromosome 6" /db_xref="taxon:9606" /cell_line="neo6/C8161.1" /chromosome="1" /map="1q" gene 1..771 /gene="KiSS-1" CDS 212..649 /gene="KiSS-1" /codon_start=1 /product="malignant melanoma metastasis-suppressor" /db_xref="PID:g2347058" /translation="MNSLVSWQLLLFLCATHFGEPLEKVASVGNSRPTGQQLESLGLL APGEQSLPCTERKPAATARLSRRGTSLSPPPESSGSRQQPGLSAPHSRQIPAPQGAVL VQREKDLPNYNWNSFGLRFGKREAAPGNHGRSAGRGWGAGAGQ" polyA_site 771 /gene="KiSS-1" /note="16 A nucleotides" BASE COUNT 136 a 261 c 237 g 137 t ORIGIN 1 ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 61 cctcgtgccg aattcggcac gaggctgccc accctctgga cattcaccca gccaggtggt 121 ctcgtcacct cagaggctcc gcagactcct gcccaggcca ggactgaggc aagcctcaag 181 gcacttctag gacctggctc ttctcaccaa gatgaactca ctggtttctt ggcagctact 241 gcttttcctc tgtgccaccc actttgggga gccattagaa aaggtggcct ctgtggggaa 301 ttctagaccc acaggccagc agctagaatc cctgggcctc ctggcccccg gggagcagag 361 cctgccgtgc accgagagga agccagctgc tactgccagg ctgagccgtc gggggacctc 421 gctgtccccg ccccccgaga gctccgggag ccgccagcag ccgggcctgt ccgcccccca 481 cagccgccag atccccgcac cccagggcgc ggtgctggtg cagcgggaga aggacctgcc 541 gaactacaac tggaactcct tcggcctgcg cttcggcaag cgggaggcgg caccagggaa 601 ccacggcaga agcgctgggc ggggctgggg cgcaggtgcg gggcagtgaa cttcagaccc 661 caaaggagtc agagcatgcg gggcgggggc ggggtggggg ggacgtaggg ctaagggagg 721 gggcgctgga gcttccaacc cgaggcaata aaagaaatgt tgcgtaactc a // LOCUS HSU43559 1128 bp mRNA PRI 09-OCT-1996 DEFINITION Human 11-cis retinol dehydrogenase mRNA, complete cds. ACCESSION U43559 NID g1616653 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1128) AUTHORS Simon,A., Hellman,U., Wernstedt,C. and Eriksson,U. TITLE The retinal pigment epithelial-specific 11-cis retinol dehydrogenase belongs to the family of short chain alcohol dehydrogenases JOURNAL J. Biol. Chem. 270 (3), 1107-1112 (1995) MEDLINE 95138097 REFERENCE 2 (bases 1 to 1128) AUTHORS Simon,A., Lagercrantz,J., Bajalica-Lagercrantz,S. and Eriksson,U. TITLE Primary structure of human 11-cis retinol dehydrogenase and organization and chromosomal localization of the corresponding gene JOURNAL Genomics 36 (3), 424-430 (1996) MEDLINE 97038684 REFERENCE 3 (bases 1 to 1128) AUTHORS Simon,A. and Eriksson,U. TITLE Direct Submission JOURNAL Submitted (18-DEC-1995) Ulf Eriksson, Ludwig Institute for Cancer Research, Karolinska Institutet, Doktorsringen 12A, Stockholm S-171 77, Sweden FEATURES Location/Qualifiers source 1..1128 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13-14" CDS 77..1033 /function="oxidizes 11-cis retinol into 11-cis retinaldehyde" /note="11-cis RDH" /codon_start=1 /product="11-cis retinol dehydrogenase" /db_xref="PID:g1616654" /translation="MWLPLLLGALLWAVLWLLRDRQSLPASNAFVFITGCDSGFGRLL ALQLDQRGFRVLASCLTPSGAEDLQRVASSRLHTTLLDITDPQSVQQAAKWVEMHVKE AGLFGLVNNAGVAGIIGPTPWLTRDDFQRVLNVNTMGPIGVTLALLPLLQQARGRVIN ITSVLGRLAANGGGYCVSKFGLEAFSDSLRRDVAHFGIRVSIVEPGFFRTPVTNLESL EKTLQACWARLPPATQAHYGGAFLTKYLKMQQRIMNLICDPDLTKVSRCLEHALTARH PRTRYSPGWDAKLLWLPASYLPASLVDAVLTWVLPKPAQAVY" BASE COUNT 202 a 359 c 322 g 245 t ORIGIN 1 taagcttcgg gcgctgtagt acctgccagc tttcgccaca ggaggctgcc acctgtaggt 61 cacttgggct ccagctatgt ggctgcctct tctgctgggt gccttactct gggcagtgct 121 gtggttgctc agggaccggc agagcctgcc cgccagcaat gcctttgtct tcatcaccgg 181 ctgtgactca ggctttgggc gccttctggc actgcagctg gaccagagag gcttccgagt 241 cctggccagc tgcctgaccc cctccggggc cgaggacctg cagcgggtgg cctcctcccg 301 cctccacacc accctgttgg atatcactga tccccagagc gtccagcagg cagccaagtg 361 ggtggagatg cacgttaagg aagcagggct ttttggtctg gtgaataatg ctggtgtggc 421 tggtatcatc ggacccacac catggctgac ccgggacgat ttccagcggg tgctgaatgt 481 gaacacaatg ggtcccatcg gggtcaccct tgccctgctg cctctgctgc agcaagcccg 541 gggccgggtg atcaacatca ccagcgtcct gggtcgcctg gcagccaatg gtgggggcta 601 ctgtgtctcc aaatttggcc tggaggcctt ctctgacagc ctgaggcggg atgtagctca 661 ttttgggata cgagtctcca tcgtggagcc tggcttcttc cgaacccctg tgaccaacct 721 ggagagtctg gagaaaaccc tgcaggcctg ctgggcacgg ctgcctcctg ccacacaggc 781 ccactatggg ggggccttcc tcaccaagta cctgaaaatg caacagcgca tcatgaacct 841 gatctgtgac ccggacctaa ccaaggtgag ccgatgcctg gagcatgccc tgactgctcg 901 acacccccga acccgctaca gcccaggttg ggatgccaag ctgctctggc tgcctgcctc 961 ctacctgcca gccagcctgg tggatgctgt gctcacctgg gtccttccca agcctgccca 1021 agcagtctac tgaatccagc cttccagcaa gagattgttt ttcaaggaca aggactttga 1081 tttatttctg cccccaccct ggtactgcct ggtgcctgcc acaaaata // LOCUS HSU43628 1598 bp mRNA PRI 12-JUN-1996 DEFINITION Human mucosal addressin cell adhesion molecule-1 (MAdCAM-1) mRNA, complete cds. ACCESSION U43628 NID g1244767 KEYWORDS . SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1598) AUTHORS Shyjan,A.M., Bertagnolli,M., Kenney,C.J. and Briskin,M.J. TITLE Human mucosal addressin cell adhesion molecule-1 (MAdCAM-1) demonstrates structural and functional similarities to the alpha 4 beta 7-integrin binding domains of murine MAdCAM-1, but extreme divergence of mucin-like sequences JOURNAL J. Immunol. 156 (8), 2851-2857 (1996) MEDLINE 96183239 REFERENCE 2 (bases 1 to 1598) AUTHORS Briskin,M.J. TITLE Direct Submission JOURNAL Submitted (19-DEC-1995) Michael J. Briskin, Molecular Biology, LeukoSite Inc., 215 First St., Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..1598 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="4" /clone_lib="lambda ziplox" /tissue_type="mesenteric lymph node" gene 1..1221 /gene="MAdCAM-1" CDS 1..1221 /gene="MAdCAM-1" /codon_start=1 /product="mucosal addressin cell adhesion molecule-1" /db_xref="PID:g1244768" /translation="MDFGLALLLAGLLGLLLGQSLQVKPLQVEPPEPVVAVALGASRQ LTCRLACADRGASVQWRGLDTSLGAVQSDTGRSVLTVRNASLSAAGTRVCVGSCGGRT FQHTVQLLVYAFPDQLTVSPAALVPGDPEVACTAHKVTPVDPNALSFSLLVGGQELEG AQALGPEVQEEEEEPQGDEDVLFRVTERWRLPPLGTPVPPALYCQATMRLPGLELSHR QAIPVLHSPTSPEPPDTTSPEPPNTTSPESPDTTSPESPDTTSQEPPDTTSQEPPDTT SQEPPDTTSPEPPDKTSPEPAPQQGSTHTPRSPGSTRTRRPEISQAGPTQGEVIPTGS SKPAGDQLPAALWTSSAVLGLLLLALPTYHLWKRCRHLAEDDTHPPASLRLLPQVSAW AGLRGTGQVGISPS" BASE COUNT 269 a 610 c 454 g 265 t ORIGIN 1 atggatttcg gactggccct cctgctggcg gggcttctgg ggctcctcct cggccagtcc 61 ctccaggtga agcccctgca ggtggagccc ccggagccgg tggtggccgt ggccttgggc 121 gcctcgcgcc agctcacctg ccgcctggcc tgcgcggacc gcggggcctc ggtgcagtgg 181 cggggcctgg acaccagcct gggcgcggtg cagtcggaca cgggccgcag cgtcctcacc 241 gtgcgcaacg cctcgctgtc ggcggccggg acccgcgtgt gcgtgggctc ctgcgggggc 301 cgcaccttcc agcacaccgt gcagctcctt gtgtacgcct tcccggacca gctgaccgtc 361 tccccagcag ccctggtgcc tggtgacccg gaggtggcct gtacggccca caaagtcacg 421 cccgtggacc ccaacgcgct ctccttctcc ctgctcgtcg ggggccagga actggagggg 481 gcgcaagccc tgggcccgga ggtgcaggag gaggaggagg agccccaggg ggacgaggac 541 gtgctgttca gggtgacaga gcgctggcgg ctgccgcccc tggggacccc tgtcccgccc 601 gccctctact gccaggccac gatgaggctg cctggcttgg agctcagcca ccgccaggcc 661 atccccgtcc tgcacagccc gacctccccg gagcctcccg acaccacctc cccggagcct 721 cccaacacca cctccccgga gtctcccgac accacctccc cggagtctcc cgacaccacc 781 tcccaggagc ctcccgacac cacctcccag gagcctcccg acaccacctc ccaggagcct 841 cccgacacca cctccccgga gcctcccgac aagacctccc cggagcccgc cccccagcag 901 ggctccacac acacccccag gagcccaggc tccaccagga ctcgccgccc tgagatctcc 961 caggctgggc ccacgcaggg agaagtgatc ccaacaggct cgtccaaacc tgcgggtgac 1021 cagctgcccg cggctctgtg gaccagcagt gcggtgctgg gactgctgct cctggccttg 1081 cccacgtatc acctctggaa acgctgccgg cacctggctg aggacgacac ccacccacca 1141 gcttctctga ggcttctgcc ccaggtgtcg gcctgggctg ggttaagggg gaccggccag 1201 gtcgggatca gcccctcctg agtggccagc ctttccccct gtgaaagcaa aatagcttgg 1261 accccttcaa gttgagaact ggtcagggca aacctgcctc ccattctact caaagtcatc 1321 cctctgctca cagagatgga tgcatgttct gattgcctct ttggagaagc tcatcagaaa 1381 ctcaaaagaa ggccactgtt tgtctcacct acccatgacc tgaagcccct ccctgagtgg 1441 tccccacctt tctggacgga accacgtact ttttacatac attgattcat gtctcacgtc 1501 tccctaaaaa tgcgtaagac caagctgtgc cctgaccacc ctgggcccct gtcgtcagga 1561 cctcctgagg ctttggcaaa taaacctcct aaaatgat // LOCUS HSU43672 3522 bp mRNA PRI 28-FEB-1996 DEFINITION Human putative transmembrane receptor IL-1Rrp mRNA, complete cds. ACCESSION U43672 NID g1206008 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3522) AUTHORS Parnet,P., Garka,K.E., Bonnert,T.P., Dower,S.K. and Sims,J.E. TITLE IL-1Rrp is a novel receptor-like molecule similar to the type I interleukin-1 receptor and its homologues T1/ST2 and IL-1R AcP JOURNAL J. Biol. Chem. 271 (8), 3967-3970 (1996) MEDLINE 96223957 REFERENCE 2 (bases 1 to 3522) AUTHORS Sims,J.E. TITLE Direct Submission JOURNAL Submitted (20-DEC-1995) John E. Sims, Molecular Genetics, Immunex Corporation, 51 University Street, Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..3522 /organism="Homo sapiens" /note="clones isolated from peripheral blood lymphocytes and from KB epidermal carcinoma cells" /db_xref="taxon:9606" /chromosome="2" /map="2q12-22" sig_peptide 25..81 CDS 25..1650 /note="homolog of type I IL-1 receptor; putative transmembrane receptor" /codon_start=1 /product="IL-1Rrp" /db_xref="PID:g1206009" /translation="MNCRELPLTLWVLISVSTAESCTSRPHITVVEGEPFYLKHCSCS LAHEIETTTKSWYKSSGSQEHVELNPRSSSRIALHDCVLEFWPVELNDTGSYFFQMKN YTQKWKLNVIRRNKHSCFTERQVTSKIVEVKKFFQITCENSYYQTLVNSTSLYKNCKK LLLENNKNPTIKKNAEFEDQGYYSCVHFLHHNGKLFNITKTFNITIVEDRSNIVPVLL GPKLNHVAVELGKNVRLNCSALLNEEDVIYWMFGEENGSDPNIHEEKEMRIMTPEGKW HASKVLRIENIGESNLNVLYNCTVASTGGTDTKSFILVRKADMADIPGHVFTRGMIIA VLILVAVVCLVTVCVIYRVDLVLFYRHLTRRDETLTDGKTYDAFVSYLKECRPENGEE HTFAVEILPRVLEKHFGYKLCIFERDVVPGGAVVDEIHSLIEKSRRLIIVLSKSYMSN EVRYELESGLHEALVERKIKIILIEFTPVTDFTFLPQSLKLLKSHRVLKWKADKSLSY NSRFWKNLLYLMPAKTVKPGRDEPEVLPVLSES" misc_feature 1012..1077 /note="encodes transmembrane region" BASE COUNT 1123 a 615 c 759 g 1025 t ORIGIN 1 gccatttgaa gcagaatcca aaccatgaat tgtagagaat tacccttgac cctttgggtg 61 cttatatctg taagcactgc agaatcttgt acttcacgtc cccacattac tgtggttgaa 121 ggggaacctt tctatctgaa acattgctcg tgttcacttg cacatgagat tgaaacaacc 181 accaaaagct ggtacaaaag cagtggatca caggaacatg tggagctgaa cccaaggagt 241 tcctcgagaa ttgctttgca tgattgtgtt ttggagtttt ggccagttga gttgaatgac 301 acaggatctt actttttcca aatgaaaaat tatactcaga aatggaaatt aaatgtcatc 361 agaagaaata aacacagctg tttcactgaa agacaagtaa ctagtaaaat tgtggaagtt 421 aaaaaatttt ttcagataac ctgtgaaaac agttactatc aaacactggt caacagcaca 481 tcattgtata agaactgtaa aaagctacta ctggagaaca ataaaaaccc aacgataaag 541 aagaacgccg agtttgaaga tcaggggtat tactcctgcg tgcatttcct tcatcataat 601 ggaaaactat ttaatatcac caaaaccttc aatataacaa tagtggaaga tcgcagtaat 661 atagttccgg ttcttcttgg accaaagctt aaccatgttg cagtggaatt aggaaaaaac 721 gtaaggctca actgctctgc tttgctgaat gaagaggatg taatttattg gatgttcggg 781 gaagaaaatg gatcggatcc taatatacat gaagagaaag aaatgagaat tatgactcca 841 gaaggcaaat ggcatgcttc aaaagtattg agaattgaaa atattggtga aagcaatcta 901 aatgttttat ataattgcac tgtggccagc acgggaggca cagacaccaa aagcttcatc 961 ttggtgagaa aagcagacat ggctgatatc ccaggccacg tcttcacaag aggaatgatc 1021 atagctgttt tgatcttggt ggcagtagtg tgcctagtga ctgtgtgtgt catttataga 1081 gttgacttgg ttctatttta tagacattta acgagaagag atgaaacatt aacagatgga 1141 aaaacatatg atgcttttgt gtcttaccta aaagaatgcc gacctgaaaa tggagaggag 1201 cacacctttg ctgtggagat tttgcccagg gtgttggaga aacattttgg gtataagtta 1261 tgcatatttg aaagggatgt agtgcctgga ggagctgttg ttgatgaaat ccactcactg 1321 atagagaaaa gccgaagact aatcattgtc ctaagtaaaa gttatatgtc taatgaggtc 1381 aggtatgaac ttgaaagtgg actccatgaa gcattggtgg aaagaaaaat taaaataatc 1441 ttaattgaat ttacacctgt tactgacttc acattcttgc cccaatcact aaagcttttg 1501 aaatctcaca gagttctgaa gtggaaggcc gataaatctc tttcttataa ctcaaggttc 1561 tggaagaacc ttctttactt aatgcctgca aaaacagtca agccaggtag agacgaaccg 1621 gaagtcttgc ctgttctttc cgagtcttaa tcttcagaaa cagtgaacgc caaaaagaac 1681 tcaagatatt ctggggactg agcatatgaa cctgttcata acaaaggctg tgactcgaaa 1741 taattaactt tgtcaaaatc ctgctcacaa tttgaagatg aaacttgtca ttaggttggc 1801 gggaatgaga ctaaagattg cgctgtgggc tgtggtcacg tgctcccaga agacctggaa 1861 ttcaaaagaa atggagctat tctttttctc cctctttcat aactggatgc agctgctcat 1921 actcaatccc atattcagca agtgtgaagc tggacgtgat gcaaaataac cgatgcccta 1981 caaaaagggc gcatctttaa gagttttaat gccagtgctt aattcgaatg aggggatttt 2041 aagtgtctga agaggcattt tctagggacc agtgggtgac tgagtaactg aaatgctgct 2101 ttcactccct aacaccatgg atctggttgt gcataggatg tgggaggagg ggctggcagg 2161 gccgccttca gaggctgcag ggcctcagcc tcaggatgca tttaatgtat cctggccaca 2221 gttgcagcca acggttcttg aaagctcggt aaggccctgc aacgcagagc ctgcttatgt 2281 ggatctattt atgggaactt cttaaaagga ccccagaata gctctttatc tttcacaaga 2341 gacacaaatt ctaattgagt taattatctg ggcctttcac tttggatgct ctgaaacatt 2401 tgttgatttt gtgtgaatgt ttatatcaaa atgtttgcca ggttgtatta gccattgaat 2461 agcaaaaaac tgatagttac ttgcttgttt tttaaaaatt acatattaaa aatgcccttg 2521 gcataaggca gcatggtgtg gcagttaaga gatgggctgt gcagcccatc ctgagctcca 2581 gtcctgagtt tgctacttac ttctgtggcc tctggaacct tatccaacct cttggtgctt 2641 cagtttcctc atctgtgaaa ttagaattta taataattgc acctacctcc caggggtaac 2701 taaatgaata aatataataa agtacttaca gtggttcctg acacagactc agcactccgt 2761 cagtgttgcc atgactattt ttattatcat tattaatgat tacttagatc aattatttag 2821 cagtggacta atggaagcta cagagcaggg aagggaagca gatctaggga ggaaggcagt 2881 tttgatttga ggaggtttgc acatgtagag aagcatactg gagaagcata tccagagggc 2941 gaaagatatc tctccattgt gcatctgcct cttttgacgt tggaagacac atgtcttact 3001 ccccaaaggg agcccagcac tgggagcctt cttgatgatc tcaaaaataa tagctattca 3061 agaaaatcac caagtgactg tgaaaccgtc agttcggaag gctggttaga acatgtggga 3121 gcaacatgaa tgttctacaa aagtttaaag cagagattgt ttcaaatggg tgtagtagat 3181 attactgaaa accaaaaaag agtgagattg tcagtgtaag aatgtgattt aatgtttgta 3241 gtgcttacaa ttttgtgtac caactggatg actaaaaaga gtaaaataac ttaattaata 3301 gctcatattt tatgtgtgaa aacatgttag tgaacatata taatcaaaat agatttcatt 3361 gctattgcat agtctctaat acatagaatg attttgcttt tctcttttat tatacttgct 3421 ttaaaatact tgaaatatat tttgcattaa atgcatttca agttaaatgt cttaaatgta 3481 tacattagat gtgtgtttta aaatgcataa aacacgttga aa // LOCUS HSU43746 10987 bp mRNA PRI 03-SEP-1996 DEFINITION Human breast cancer susceptibility (BRCA2) mRNA, complete cds. ACCESSION U43746 NID g1161383 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Teng,D.H., Bogden,R., Mitchell,J., Baumgard,M., Bell,R., Berry,S., Davis,T., Ha,P.C., Kehrer,R., Jammulapati,S., Chen,Q., Offit,K., Skolnick,M.H., Tavtigian,S.V., Jhanwar,S., Swedlund,B., Wong,A.K. and Kamb,A. TITLE Low incidence of BRCA2 mutations in breast carcinoma and other cancers JOURNAL Nature Genet. 13 (2), 241-244 (1996) MEDLINE 96225457 REFERENCE 2 (bases 1 to 10987) AUTHORS Tavtigian,S.V., Rommens,J.M., Couch,F.J., Neuhausen,S., Bell,R., Berry,S., Bogden,R., Chen,Q., Davis,T., Frye,C., Hattier,T., Jammulapati,S., Janecki,T., Jiang,P., Kehrer,R., Schroeder,M., Snyder,S., Stringfellow,M., Stroup,C., Swedlund,B., Teng,D., Thomas,A., Tran,T., Weaver-Feldhaus,J., Wong,A., Leblanc,J.-F., Belanger,C., Tranchant,M., Samson,C, Dumont,M., McArthur-Morrison,J., McSweeney,D., Peng,Y., Shizuya,H., Slepak,T., Simon,M.I., Labrie,F., Shattuck-Eidens,D., Skolnick,M., Goldgar,D., Weber,B.L., Simard,J. and Kamb,A. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) Sean Tavtigian, Myriad Genetics Inc., Department of Research, 390 Wakara Way, Salt Lake City, 84106, USA COMMENT Tavtigian,S.V., Bell,R., Berry,S., Bogden,R., Chen,Q., Davis,T., Frye,C., Hattier,T., Jammulapati,S., Janecki,T., Jiang,P., Kehrer,R., Schroeder,M., Snyder,S., Stringfellow,M., Stroup,C., Swedlund,B., Shattuck-Eidens,D., Skolnick,M., and Kamb,A. Myriad Genetics Inc. 390 Wakara Way Salt lake City, Utah 84106 Rommens,J.M., McArthur-Morrison,J., and McSweeney,D. Department of Genetics, Research Institute The Hospital for Sick Children Toronto, Canada Couch,F.J., Peng,Y. and Weber,B.L. Division of Hematology and Oncology Department of Medicine University of Pennsylvania Medical School Shizuya,H., Slepak,T., and Simon,M.I. Division of Biology 157-75 California Institute of Technology Dumont,M., Leblanc,J.-F., Belanger,C., Tranchant,M., Samson,C,. Labrie,F., and Simard,J. CHUL Research Center and Laval University Department of Molecular Endocrinology Quebec, Canada Neuhausen,S. and Goldgar,D. Department of Medical Informatics University of Utah Medical Center. FEATURES Location/Qualifiers source 1..10987 /organism="Homo sapiens" /db_xref="taxon:9606" /map="13q12-q13" /chromosome="13" gene 229..10485 /gene="BRCA2" CDS 229..10485 /gene="BRCA2" /codon_start=1 /product="BRCA2" /db_xref="PID:g1161384" /translation="MPIGSKERPTFFEIFKTRCNKADLGPISLNWFEELSSEAPPYNS EPAEESEHKNNNYEPNLFKTPQRKPSYNQLASTPIIFKEQGLTLPLYQSPVKELDKFK LDLGRNVPNSRHKSLRTVKTKMDQADDVSCPLLNSCLSESPVVLQCTHVTPQRDKSVV CGSLFHTPKFVKGRQTPKHISESLGAEVDPDMSWSSSLATPPTLSSTVLIVRNEEASE TVFPHDTTANVKSYFSNHDESLKKNDRFIASVTDSENTNQREAASHGFGKTSGNSFKV NSCKDHIGKSMPNVLEDEVYETVVDTSEEDSFSLCFSKCRTKNLQKVRTSKTRKKIFH EANADECEKSKNQVKEKYSFVSEVEPNDTDPLDSNVAHQKPFESGSDKISKEVVPSLA CEWSQLTLSGLNGAQMEKIPLLHISSCDQNISEKDLLDTENKRKKDFLTSENSLPRIS SLPKSEKPLNEETVVNKRDEEQHLESHTDCILAVKQAISGTSPVASSFQGIKKSIFRI RESPKETFNASFSGHMTDPNFKKETEASESGLEIHTVCSQKEDSLCPNLIDNGSWPAT TTQNSVALKNAGLISTLKKKTNKFIYAIHDETFYKGKKIPKDQKSELINCSAQFEANA FEAPLTFANADSGLLHSSVKRSCSQNDSEEPTLSLTSSFGTILRKCSRNETCSNNTVI SQDLDYKEAKCNKEKLQLFITPEADSLSCLQEGQCENDPKSKKVSDIKEEVLAAACHP VQHSKVEYSDTDFQSQKSLLYDHENASTLILTPTSKDVLSNLVMISRGKESYKMSDKL KGNNYESDVELTKNIPMEKNQDVCALNENYKNVELLPPEKYMRVASPSRKVQFNQNTN LRVIQKNQEETTSISKITVNPDSEELFSDNENNFVFQVANERNNLALGNTKELHETDL TCVNEPIFKNSTMVLYGDTGDKQATQVSIKKDLVYVLAEENKNSVKQHIKMTLGQDLK SDISLNIDKIPEKNNDYMNKWAGLLGPISNHSFGGSFRTASNKEIKLSEHNIKKSKMF FKDIEEQYPTSLACVEIVNTLALDNQKKLSKPQSINTVSAHLQSSVVVSDCKNSHITP QMLFSKQDFNSNHNLTPSQKAEITELSTILEESGSQFEFTQFRKPSYILQKSTFEVPE NQMTILKTTSEECRDADLHVIMNAPSIGQVDSSKQFEGTVEIKRKFAGLLKNDCNKSA SGYLTDENEVGFRGFYSAHGTKLNVSTEALQKAVKLFSDIENISEETSAEVHPISLSS SKCHDSVVSMFKIENHNDKTVSEKNNKCQLILQNNIEMTTGTFVEEITENYKRNTENE DNKYTAASRNSHNLEFDGSDSSKNDTVCIHKDETDLLFTDQHNICLKLSGQFMKEGNT QIKEDLSDLTFLEVAKAQEACHGNTSNKEQLTATKTEQNIKDFETSDTFFQTASGKNI SVAKESFNKIVNFFDQKPEELHNFSLNSELHSDIRKNKMDILSYEETDIVKHKILKES VPVGTGNQLVTFQGQPERDEKIKEPTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQG TSEITSFSHQWAKTLKYREACKDLELACETIEITAAPKCKEMQNSLNNDKNLVSIETV VPPKLLSDNLCRQTENLKTSKSIFLKVKVHENVEKETAKSPATCYTNQSPYSVIENSA LAFYTSCSRKTSVSQTSLLEAKKWLREGIFDGQPERINTADYVGNYLYENNSNSTIAE NDKNHLSEKQDTYLSNSSMSNSYSYHSDEVYNDSGYLSKNKLDSGIEPVLKNVEDQKN TSFSKVISNVKDANAYPQTVNEDICVEELVTSSSPCKNKNAAIKLSISNSNNFEVGPP AFRIASGKIVCVSHETIKKVKDIFTDSFSKVIKENNENKSKICQTKIMAGCYEALDDS EDILHNSLDNDECSTHSHKVFADIQSEEILQHNQNMSGLEKVSKISPCDVSLETSDIC KCSIGKLHKSVSSANTCGIFSTASGKSVQVSDASLQNARQVFSEIEDSTKQVFSKVLF KSNEHSDQLTREENTAIRTPEHLISQKGFSYNVVNSSAFSGFSTASGKQVSILESSLH KVKGVLEEFDLIRTEHSLHYSPTSRQNVSKILPRVDKRNPEHCVNSEMEKTCSKEFKL SNNLNVEGGSSENNHSIKVSPYLSQFQQDKQQLVLGTKVSLVENIHVLGKEQASPKNV KMEIGKTETFSDVPVKTNIEVCSTYSKDSENYFETEAVEIAKAFMEDDELTDSKLPSH ATHSLFTCPENEEMVLSNSRIGKRRGEPLILVGEPSIKRNLLNEFDRIIENQEKSLKA SKSTPDGTIKDRRLFMHHVSLEPITCVPFRTTKERQEIQNPNFTAPGQEFLSKSHLYE HLTLEKSSSNLAVSGHPFYQVSATRNEKMRHLITTGRPTKVFVPPFKTKSHFHRVEQC VRNINLEENRQKQNIDGHGSDDSKNKINDNEIHQFNKNNSNQAAAVTFTKCEEEPLDL ITSLQNARDIQDMRIKKKQRQRVFPQPGSLYLAKTSTLPRISLKAAVGGQVPSACSHK QLYTYGVSKHCIKINSKNAESFQFHTEDYFGKESLWTGKGIQLADGGWLIPSNDGKAG KEEFYRALCDTPGVDPKLISRIWVYNHYRWIIWKLAAMECAFPKEFANRCLSPERVLL QLKYRYDTEIDRSRRSAIKKIMERDDTAAKTLVLCVSDIISLSANISETSSNKTSSAD TQKVAIIELTDGWYAVKAQLDPPLLAVLKNGRLTVGQKIILHGAELVGSPDACTPLEA PESLMLKISANSTRPARWYTKLGFFPDPRPFPLPLSSLFSDGGNVGCVDVIIQRAYPI QWMEKTSSGLYIFRNEREEEKEAAKYVEAQQKRLEALFTKIQEEFEEHEENTTKPYLP SRALTRQQVRALQDGAELYEAVKNAADPAYLEGYFSEEQLRALNNHRQMLNDKKQAQI QLEIRKAMESAEQKEQGLSRDVTTVWKLRIVSYSKKEKDSVILSIWRPSSDLYSLLTE GKRYRIYHLATSKSKSKSERANIQLAATKKTQYQQLPVSDEILFQIYQPREPLHFSKF LDPDFQPSCSEVDLIGFVVSVVKKTGLAPFVYLSDECYNLLAIKFWIDLNEDIIKPHM LIAASNLQWRPESKSGLLTLFAGDFSVFSASPKEGHFQETFNKMKNTVENIDILCNEA ENKLMHILHANDPKWSTPTKDCTSGPYTAQIIPGTGNKLLMSSPNCEIYYQSPLSLCM AKRKSVSTPVSAQMTSKSCKGEKEIDDQKNCKKRRALDFLSRLPLPPPVSPICTFVSP AAQKAFQPPRSCGTKYETPIKKKELNSPQMTPFKKFNEISLLESNSIADEELALINTQ ALLSGSTGEKQFISVSESTRTAPTSSEDYLRLKRRCTTSLIKEQESSQASTEECEKNK QDTITTKKYI" BASE COUNT 3984 a 1931 c 2053 g 3019 t ORIGIN 1 ggtggcgcga gcttctgaaa ctaggcggca gaggcggagc cgctgtggca ctgctgcgcc 61 tctgctgcgc ctcgggtgtc ttttgcggcg gtgggtcgcc gccgggagaa gcgtgagggg 121 acagatttgt gaccggcgcg gtttttgtca gcttactccg gccaaaaaag aactgcacct 181 ctggagcgga cttatttacc aagcattgga ggaatatcgt aggtaaaaat gcctattgga 241 tccaaagaga ggccaacatt ttttgaaatt tttaagacac gctgcaacaa agcagattta 301 ggaccaataa gtcttaattg gtttgaagaa ctttcttcag aagctccacc ctataattct 361 gaacctgcag aagaatctga acataaaaac aacaattacg aaccaaacct atttaaaact 421 ccacaaagga aaccatctta taatcagctg gcttcaactc caataatatt caaagagcaa 481 gggctgactc tgccgctgta ccaatctcct gtaaaagaat tagataaatt caaattagac 541 ttaggaagga atgttcccaa tagtagacat aaaagtcttc gcacagtgaa aactaaaatg 601 gatcaagcag atgatgtttc ctgtccactt ctaaattctt gtcttagtga aagtcctgtt 661 gttctacaat gtacacatgt aacaccacaa agagataagt cagtggtatg tgggagtttg 721 tttcatacac caaagtttgt gaagggtcgt cagacaccaa aacatatttc tgaaagtcta 781 ggagctgagg tggatcctga tatgtcttgg tcaagttctt tagctacacc acccaccctt 841 agttctactg tgctcatagt cagaaatgaa gaagcatctg aaactgtatt tcctcatgat 901 actactgcta atgtgaaaag ctatttttcc aatcatgatg aaagtctgaa gaaaaatgat 961 agatttatcg cttctgtgac agacagtgaa aacacaaatc aaagagaagc tgcaagtcat 1021 ggatttggaa aaacatcagg gaattcattt aaagtaaata gctgcaaaga ccacattgga 1081 aagtcaatgc caaatgtcct agaagatgaa gtatatgaaa cagttgtaga tacctctgaa 1141 gaagatagtt tttcattatg tttttctaaa tgtagaacaa aaaatctaca aaaagtaaga 1201 actagcaaga ctaggaaaaa aattttccat gaagcaaacg ctgatgaatg tgaaaaatct 1261 aaaaaccaag tgaaagaaaa atactcattt gtatctgaag tggaaccaaa tgatactgat 1321 ccattagatt caaatgtagc acatcagaag ccctttgaga gtggaagtga caaaatctcc 1381 aaggaagttg taccgtcttt ggcctgtgaa tggtctcaac taaccctttc aggtctaaat 1441 ggagcccaga tggagaaaat acccctattg catatttctt catgtgacca aaatatttca 1501 gaaaaagacc tattagacac agagaacaaa agaaagaaag attttcttac ttcagagaat 1561 tctttgccac gtatttctag cctaccaaaa tcagagaagc cattaaatga ggaaacagtg 1621 gtaaataaga gagatgaaga gcagcatctt gaatctcata cagactgcat tcttgcagta 1681 aagcaggcaa tatctggaac ttctccagtg gcttcttcat ttcagggtat caaaaagtct 1741 atattcagaa taagagaatc acctaaagag actttcaatg caagtttttc aggtcatatg 1801 actgatccaa actttaaaaa agaaactgaa gcctctgaaa gtggactgga aatacatact 1861 gtttgctcac agaaggagga ctccttatgt ccaaatttaa ttgataatgg aagctggcca 1921 gccaccacca cacagaattc tgtagctttg aagaatgcag gtttaatatc cactttgaaa 1981 aagaaaacaa ataagtttat ttatgctata catgatgaaa cattttataa aggaaaaaaa 2041 ataccgaaag accaaaaatc agaactaatt aactgttcag cccagtttga agcaaatgct 2101 tttgaagcac cacttacatt tgcaaatgct gattcaggtt tattgcattc ttctgtgaaa 2161 agaagctgtt cacagaatga ttctgaagaa ccaactttgt ccttaactag ctcttttggg 2221 acaattctga ggaaatgttc tagaaatgaa acatgttcta ataatacagt aatctctcag 2281 gatcttgatt ataaagaagc aaaatgtaat aaggaaaaac tacagttatt tattacccca 2341 gaagctgatt ctctgtcatg cctgcaggaa ggacagtgtg aaaatgatcc aaaaagcaaa 2401 aaagtttcag atataaaaga agaggtcttg gctgcagcat gtcacccagt acaacattca 2461 aaagtggaat acagtgatac tgactttcaa tcccagaaaa gtcttttata tgatcatgaa 2521 aatgccagca ctcttatttt aactcctact tccaaggatg ttctgtcaaa cctagtcatg 2581 atttctagag gcaaagaatc atacaaaatg tcagacaagc tcaaaggtaa caattatgaa 2641 tctgatgttg aattaaccaa aaatattccc atggaaaaga atcaagatgt atgtgcttta 2701 aatgaaaatt ataaaaacgt tgagctgttg ccacctgaaa aatacatgag agtagcatca 2761 ccttcaagaa aggtacaatt caaccaaaac acaaatctaa gagtaatcca aaaaaatcaa 2821 gaagaaacta cttcaatttc aaaaataact gtcaatccag actctgaaga acttttctca 2881 gacaatgaga ataattttgt cttccaagta gctaatgaaa ggaataatct tgctttagga 2941 aatactaagg aacttcatga aacagacttg acttgtgtaa acgaacccat tttcaagaac 3001 tctaccatgg ttttatatgg agacacaggt gataaacaag caacccaagt gtcaattaaa 3061 aaagatttgg tttatgttct tgcagaggag aacaaaaata gtgtaaagca gcatataaaa 3121 atgactctag gtcaagattt aaaatcggac atctccttga atatagataa aataccagaa 3181 aaaaataatg attacatgaa caaatgggca ggactcttag gtccaatttc aaatcacagt 3241 tttggaggta gcttcagaac agcttcaaat aaggaaatca agctctctga acataacatt 3301 aagaagagca aaatgttctt caaagatatt gaagaacaat atcctactag tttagcttgt 3361 gttgaaattg taaatacctt ggcattagat aatcaaaaga aactgagcaa gcctcagtca 3421 attaatactg tatctgcaca tttacagagt agtgtagttg tttctgattg taaaaatagt 3481 catataaccc ctcagatgtt attttccaag caggatttta attcaaacca taatttaaca 3541 cctagccaaa aggcagaaat tacagaactt tctactatat tagaagaatc aggaagtcag 3601 tttgaattta ctcagtttag aaaaccaagc tacatattgc agaagagtac atttgaagtg 3661 cctgaaaacc agatgactat cttaaagacc acttctgagg aatgcagaga tgctgatctt 3721 catgtcataa tgaatgcccc atcgattggt caggtagaca gcagcaagca atttgaaggt 3781 acagttgaaa ttaaacggaa gtttgctggc ctgttgaaaa atgactgtaa caaaagtgct 3841 tctggttatt taacagatga aaatgaagtg gggtttaggg gcttttattc tgctcatggc 3901 acaaaactga atgtttctac tgaagctctg caaaaagctg tgaaactgtt tagtgatatt 3961 gagaatatta gtgaggaaac ttctgcagag gtacatccaa taagtttatc ttcaagtaaa 4021 tgtcatgatt ctgttgtttc aatgtttaag atagaaaatc ataatgataa aactgtaagt 4081 gaaaaaaata ataaatgcca actgatatta caaaataata ttgaaatgac tactggcact 4141 tttgttgaag aaattactga aaattacaag agaaatactg aaaatgaaga taacaaatat 4201 actgctgcca gtagaaattc tcataactta gaatttgatg gcagtgattc aagtaaaaat 4261 gatactgttt gtattcataa agatgaaacg gacttgctat ttactgatca gcacaacata 4321 tgtcttaaat tatctggcca gtttatgaag gagggaaaca ctcagattaa agaagatttg 4381 tcagatttaa cttttttgga agttgcgaaa gctcaagaag catgtcatgg taatacttca 4441 aataaagaac agttaactgc tactaaaacg gagcaaaata taaaagattt tgagacttct 4501 gatacatttt ttcagactgc aagtgggaaa aatattagtg tcgccaaaga gtcatttaat 4561 aaaattgtaa atttctttga tcagaaacca gaagaattgc ataacttttc cttaaattct 4621 gaattacatt ctgacataag aaagaacaaa atggacattc taagttatga ggaaacagac 4681 atagttaaac acaaaatact gaaagaaagt gtcccagttg gtactggaaa tcaactagtg 4741 accttccagg gacaacccga acgtgatgaa aagatcaaag aacctactct gttgggtttt 4801 catacagcta gcgggaaaaa agttaaaatt gcaaaggaat ctttggacaa agtgaaaaac 4861 ctttttgatg aaaaagagca aggtactagt gaaatcacca gttttagcca tcaatgggca 4921 aagaccctaa agtacagaga ggcctgtaaa gaccttgaat tagcatgtga gaccattgag 4981 atcacagctg ccccaaagtg taaagaaatg cagaattctc tcaataatga taaaaacctt 5041 gtttctattg agactgtggt gccacctaag ctcttaagtg ataatttatg tagacaaact 5101 gaaaatctca aaacatcaaa aagtatcttt ttgaaagtta aagtacatga aaatgtagaa 5161 aaagaaacag caaaaagtcc tgcaacttgt tacacaaatc agtcccctta ttcagtcatt 5221 gaaaattcag ccttagcttt ttacacaagt tgtagtagaa aaacttctgt gagtcagact 5281 tcattacttg aagcaaaaaa atggcttaga gaaggaatat ttgatggtca accagaaaga 5341 ataaatactg cagattatgt aggaaattat ttgtatgaaa ataattcaaa cagtactata 5401 gctgaaaatg acaaaaatca tctctccgaa aaacaagata cttatttaag taacagtagc 5461 atgtctaaca gctattccta ccattctgat gaggtatata atgattcagg atatctctca 5521 aaaaataaac ttgattctgg tattgagcca gtattgaaga atgttgaaga tcaaaaaaac 5581 actagttttt ccaaagtaat atccaatgta aaagatgcaa atgcataccc acaaactgta 5641 aatgaagata tttgcgttga ggaacttgtg actagctctt caccctgcaa aaataaaaat 5701 gcagccatta aattgtccat atctaatagt aataattttg aggtagggcc acctgcattt 5761 aggatagcca gtggtaaaat cgtttgtgtt tcacatgaaa caattaaaaa agtgaaagac 5821 atatttacag acagtttcag taaagtaatt aaggaaaaca acgagaataa atcaaaaatt 5881 tgccaaacga aaattatggc aggttgttac gaggcattgg atgattcaga ggatattctt 5941 cataactctc tagataatga tgaatgtagc acgcattcac ataaggtttt tgctgacatt 6001 cagagtgaag aaattttaca acataaccaa aatatgtctg gattggagaa agtttctaaa 6061 atatcacctt gtgatgttag tttggaaact tcagatatat gtaaatgtag tatagggaag 6121 cttcataagt cagtctcatc tgcaaatact tgtgggattt ttagcacagc aagtggaaaa 6181 tctgtccagg tatcagatgc ttcattacaa aacgcaagac aagtgttttc tgaaatagaa 6241 gatagtacca agcaagtctt ttccaaagta ttgtttaaaa gtaacgaaca ttcagaccag 6301 ctcacaagag aagaaaatac tgctatacgt actccagaac atttaatatc ccaaaaaggc 6361 ttttcatata atgtggtaaa ttcatctgct ttctctggat ttagtacagc aagtggaaag 6421 caagtttcca ttttagaaag ttccttacac aaagttaagg gagtgttaga ggaatttgat 6481 ttaatcagaa ctgagcatag tcttcactat tcacctacgt ctagacaaaa tgtatcaaaa 6541 atacttcctc gtgttgataa gagaaaccca gagcactgtg taaactcaga aatggaaaaa 6601 acctgcagta aagaatttaa attatcaaat aacttaaatg ttgaaggtgg ttcttcagaa 6661 aataatcact ctattaaagt ttctccatat ctctctcaat ttcaacaaga caaacaacag 6721 ttggtattag gaaccaaagt ctcacttgtt gagaacattc atgttttggg aaaagaacag 6781 gcttcaccta aaaacgtaaa aatggaaatt ggtaaaactg aaactttttc tgatgttcct 6841 gtgaaaacaa atatagaagt ttgttctact tactccaaag attcagaaaa ctactttgaa 6901 acagaagcag tagaaattgc taaagctttt atggaagatg atgaactgac agattctaaa 6961 ctgccaagtc atgccacaca ttctcttttt acatgtcccg aaaatgagga aatggttttg 7021 tcaaattcaa gaattggaaa aagaagagga gagcccctta tcttagtggg agaaccctca 7081 atcaaaagaa acttattaaa tgaatttgac aggataatag aaaatcaaga aaaatcctta 7141 aaggcttcaa aaagcactcc agatggcaca ataaaagatc gaagattgtt tatgcatcat 7201 gtttctttag agccgattac ctgtgtaccc tttcgcacaa ctaaggaacg tcaagagata 7261 cagaatccaa attttaccgc acctggtcaa gaatttctgt ctaaatctca tttgtatgaa 7321 catctgactt tggaaaaatc ttcaagcaat ttagcagttt caggacatcc attttatcaa 7381 gtttctgcta caagaaatga aaaaatgaga cacttgatta ctacaggcag accaaccaaa 7441 gtctttgttc caccttttaa aactaaatca cattttcaca gagttgaaca gtgtgttagg 7501 aatattaact tggaggaaaa cagacaaaag caaaacattg atggacatgg ctctgatgat 7561 agtaaaaata agattaatga caatgagatt catcagttta acaaaaacaa ctccaatcaa 7621 gcagcagctg taactttcac aaagtgtgaa gaagaacctt tagatttaat tacaagtctt 7681 cagaatgcca gagatataca ggatatgcga attaagaaga aacaaaggca acgcgtcttt 7741 ccacagccag gcagtctgta tcttgcaaaa acatccactc tgcctcgaat ctctctgaaa 7801 gcagcagtag gaggccaagt tccctctgcg tgttctcata aacagctgta tacgtatggc 7861 gtttctaaac attgcataaa aattaacagc aaaaatgcag agtcttttca gtttcacact 7921 gaagattatt ttggtaagga aagtttatgg actggaaaag gaatacagtt ggctgatggt 7981 ggatggctca taccctccaa tgatggaaag gctggaaaag aagaatttta tagggctctg 8041 tgtgacactc caggtgtgga tccaaagctt atttctagaa tttgggttta taatcactat 8101 agatggatca tatggaaact ggcagctatg gaatgtgcct ttcctaagga atttgctaat 8161 agatgcctaa gcccagaaag ggtgcttctt caactaaaat acagatatga tacggaaatt 8221 gatagaagca gaagatcggc tataaaaaag ataatggaaa gggatgacac agctgcaaaa 8281 acacttgttc tctgtgtttc tgacataatt tcattgagcg caaatatatc tgaaacttct 8341 agcaataaaa ctagtagtgc agatacccaa aaagtggcca ttattgaact tacagatggg 8401 tggtatgctg ttaaggccca gttagatcct cccctcttag ctgtcttaaa gaatggcaga 8461 ctgacagttg gtcagaagat tattcttcat ggagcagaac tggtgggctc tcctgatgcc 8521 tgtacacctc ttgaagcccc agaatctctt atgttaaaga tttctgctaa cagtactcgg 8581 cctgctcgct ggtataccaa acttggattc tttcctgacc ctagaccttt tcctctgccc 8641 ttatcatcgc ttttcagtga tggaggaaat gttggttgtg ttgatgtaat tattcaaaga 8701 gcatacccta tacagtggat ggagaagaca tcatctggat tatacatatt tcgcaatgaa 8761 agagaggaag aaaaggaagc agcaaaatat gtggaggccc aacaaaagag actagaagcc 8821 ttattcacta aaattcagga ggaatttgaa gaacatgaag aaaacacaac aaaaccatat 8881 ttaccatcac gtgcactaac aagacagcaa gttcgtgctt tgcaagatgg tgcagagctt 8941 tatgaagcag tgaagaatgc agcagaccca gcttaccttg agggttattt cagtgaagag 9001 cagttaagag ccttgaataa tcacaggcaa atgttgaatg ataagaaaca agctcagatc 9061 cagttggaaa ttaggaaggc catggaatct gctgaacaaa aggaacaagg tttatcaagg 9121 gatgtcacaa ccgtgtggaa gttgcgtatt gtaagctatt caaaaaaaga aaaagattca 9181 gttatactga gtatttggcg tccatcatca gatttatatt ctctgttaac agaaggaaag 9241 agatacagaa tttatcatct tgcaacttca aaatctaaaa gtaaatctga aagagctaac 9301 atacagttag cagcgacaaa aaaaactcag tatcaacaac taccggtttc agatgaaatt 9361 ttatttcaga tttaccagcc acgggagccc cttcacttca gcaaattttt agatccagac 9421 tttcagccat cttgttctga ggtggaccta ataggatttg tcgtttctgt tgtgaaaaaa 9481 acaggacttg cccctttcgt ctatttgtca gacgaatgtt acaatttact ggcaataaag 9541 ttttggatag accttaatga ggacattatt aagcctcata tgttaattgc tgcaagcaac 9601 ctccagtggc gaccagaatc caaatcaggc cttcttactt tatttgctgg agatttttct 9661 gtgttttctg ctagtccaaa agagggccac tttcaagaga cattcaacaa aatgaaaaat 9721 actgttgaga atattgacat actttgcaat gaagcagaaa acaagcttat gcatatactg 9781 catgcaaatg atcccaagtg gtccacccca actaaagact gtacttcagg gccgtacact 9841 gctcaaatca ttcctggtac aggaaacaag cttctgatgt cttctcctaa ttgtgagata 9901 tattatcaaa gtcctttatc actttgtatg gccaaaagga agtctgtttc cacacctgtc 9961 tcagcccaga tgacttcaaa gtcttgtaaa ggggagaaag agattgatga ccaaaagaac 10021 tgcaaaaaga gaagagcctt ggatttcttg agtagactgc ctttacctcc acctgttagt 10081 cccatttgta catttgtttc tccggctgca cagaaggcat ttcagccacc aaggagttgt 10141 ggcaccaaat acgaaacacc cataaagaaa aaagaactga attctcctca gatgactcca 10201 tttaaaaaat tcaatgaaat ttctcttttg gaaagtaatt caatagctga cgaagaactt 10261 gcattgataa atacccaagc tcttttgtct ggttcaacag gagaaaaaca atttatatct 10321 gtcagtgaat ccactaggac tgctcccacc agttcagaag attatctcag actgaaacga 10381 cgttgtacta catctctgat caaagaacag gagagttccc aggccagtac ggaagaatgt 10441 gagaaaaata agcaggacac aattacaact aaaaaatata tctaagcatt tgcaaaggcg 10501 acaataaatt attgacgctt aacctttcca gtttataaga ctggaatata atttcaaacc 10561 acacattagt acttatgttg cacaatgaga aaagaaatta gtttcaaatt tacctcagcg 10621 tttgtgtatc gggcaaaaat cgttttgccc gattccgtat tggtatactt ttgcttcagt 10681 tgcatatctt aaaactaaat gtaatttatt aactaatcaa gaaaaacatc tttggctgag 10741 ctcggtggct catgcctgta atcccaacac tttgagaagc tgaggtggga ggagtgcttg 10801 aggccaggag ttcaagacca gcctgggcaa catagggaga cccccatctt tacgaagaaa 10861 aaaaaaaagg ggaaaagaaa atcttttaaa tctttggatt tgatcactac aagtattatt 10921 ttacaatcaa caaaatggtc atccaaactc aaacttgaga aaatatcttg ctttcaaatt 10981 gacacta // LOCUS HSU43747 1505 bp mRNA PRI 27-APR-1996 DEFINITION Human frataxin (FRDA) mRNA, complete cds. ACCESSION U43747 NID g1237438 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 429) AUTHORS Campuzano,V., Montermini,L., Molto',M.D., Pianese,L., Cossee',M., Cavalcanti,F., Monros,E., Rodius,F., Duclos,F., Monticelli,A., Zara,F., Canizares,J., Koutnikova,H., Bidichandani,S., Gellera,C., Brice,A., Trouillas,P., De Michele,G., Filla,A., De Frutos,R., Palau,F., Patel,P.I., Di Donato,S., Mandel,J.-L., Cocozza,S., Koenig,M. and Pandolfo,M. TITLE Friedreich's ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion JOURNAL Science 271 (5254), 1423-1427 (1996) MEDLINE 96173952 REFERENCE 2 (bases 1 to 1505) AUTHORS Koenig,M. and Pandolfo,M. TITLE Direct Submission JOURNAL Submitted (21-DEC-1995) Massimo Pandolfo, Neurology-NB424, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1505 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q13" 5'UTR 305..525 /gene="FRDA" gene 305..1007 /gene="FRDA" exon 305..690 /gene="FRDA" /number=1 CDS 526..1158 /note="Friedreich's ataxia protein" /codon_start=1 /product="frataxin" /db_xref="PID:g1237439" /translation="MWTLGRRAVAGLLASPSPAQAQTLTRVPRPAELAPLCGRRGLRT DIDATCTPRRASSNQRGLNQIWNVKKQSVYLMNLRKSGTLGHPGSLDETTYERLAEET LDSLAEFFEDLADKPYTFEDYDVSFGSGVLTVKLGGDLGTYVINKQTPNKQIWLSSPS SGPKRYDWTGKNWVFSHDGVSLHELLAAELTKALKTKLDLSWLAYSGKDA" exon 691..788 /gene="FRDA" /number=2 exon 789..909 /gene="FRDA" /number=3 exon 910..1007 /gene="FRDA" /number=4 exon 1008..1505 /number=5 3'UTR 1159..1505 polyA_signal 1468..1472 BASE COUNT 373 a 400 c 358 g 370 t 4 others ORIGIN 1 tttacagggc ataactcatt ttatccttac cacaatccta tgaagtagga acttttataa 61 aacgcatttt atatncaagg gcacagagag gntaattaac ttgccctctg gtcacacagc 121 taggaagtgg gcagagtaca gatttacact aggcatccgt ctcctgnccc cacatancca 181 gctgctgtaa acccataccg gcggccaagc agcctcaatt tgtgcatgca cccacttccc 241 agcaagacag cagctcccaa gttcctcctg tttagaattt tagaagcggc gggccaccag 301 gctgcagtct cccttgggtc aggggtcctg gttgcactcc gtgctttgca caaagcaggc 361 tctccatttt tgttaaatgc acgaatagtg ctaagctggg aagttcttcc tgaggtctaa 421 cctctagctg ctcccccaca gaagagtgcc tgcggccagt ggccaccagg ggtcgccgca 481 gcacccagcg ctggagggcg gagcgggcgg cagacccgga gcagcatgtg gactctcggg 541 cgccgcgcag tagccggcct cctggcgtca cccagcccgg cccaggccca gaccctcacc 601 cgggtcccgc ggccggcaga gttggcccca ctctgcggcc gccgtggcct gcgcaccgac 661 atcgatgcga cctgcacgcc ccgccgcgca agttcgaacc aacgtggcct caaccagatt 721 tggaatgtca aaaagcagag tgtctatttg atgaatttga ggaaatctgg aactttgggc 781 cacccaggct ctctagatga gaccacctat gaaagactag cagaggaaac gctggactct 841 ttagcagagt tttttgaaga ccttgcagac aagccataca cgtttgagga ctatgatgtc 901 tcctttggga gtggtgtctt aactgtcaaa ctgggtggag atctaggaac ctatgtgatc 961 aacaagcaga cgccaaacaa gcaaatctgg ctatcttctc catccagtgg acctaagcgt 1021 tatgactgga ctgggaaaaa ctgggtgttc tcccacgacg gcgtgtccct ccatgagctg 1081 ctggccgcag agctcactaa agccttaaaa accaaactgg acttgtcttg gttggcctat 1141 tccggaaaag atgcttgatg cccagccccg ttttaaggac attaaaagct atcaggccaa 1201 gaccccagct tcattatgca gctgaggtgt gttttttgtt gttgttgttg tttatttttt 1261 ttattcctgc ttttgaggac acttgggcta tgtgtcacag ctctgtacaa acaatgtgtt 1321 gcctcctacc ttgcccccaa gttctgattt ttaatttcta tggaagattt tttggattgt 1381 cggatttcct ccctcacatg atacccctta tcttttataa tgtcttatgc ctatacctga 1441 atataacaac ctttaaaaaa gcaaaataat aagaaggaaa aattccagga gggaaaaaaa 1501 aaaaa // LOCUS HSU43843 1421 bp mRNA PRI 12-SEP-1996 DEFINITION Human h-neuro-d4 protein mRNA, complete cds. ACCESSION U43843 NID g1532120 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1421) AUTHORS Chestkov,A.V., Baka,I.D., Kost,M.V., Georgiev,G.P. and Buchman,V.L. TITLE The d4 gene family in the human genome JOURNAL Genomics 36 (1), 174-177 (1996) MEDLINE 96411662 REFERENCE 2 (bases 1 to 1421) AUTHORS Chestov,A.V., Baka,I.D., Kost,M.V., Georgiev,G.P. and Buchman,V.L. TITLE Direct Submission JOURNAL Submitted (22-DEC-1995) Vladimir L. Buchman, Biol. Med. Sciences, University of St.Andrews, Bute Medical Buildings, St.Andrews, Fife, KY16 9AJ, Scotland FEATURES Location/Qualifiers source 1..1421 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="h36, h34, h22, h32, RACE-h1" /clone_lib="Stratagene library (catalog No. 935206)" /chromosome="19" /sex="f" /tissue_type="brainstem" /dev_stage="postnatal (2 years)" CDS 7..1068 /codon_start=1 /product="h-neuro-d4 protein" /db_xref="PID:g1532121" /translation="MATVIPSPLSLGEDFYREAIEHCRSYNARLCAERSLRLPFLDSQ TGVAQNNCYIWMEKTHRGPGLAPGQIYTYPARCWRKKRRLNILEDPRLRPCEYKIDCE APLKKEGGLPEGPVLEALLCAETGEKKIELKEEETIMDCQKQQLLEFPHDLEVEDLED DIPRRKNRAKGKAYGIGGLRKRQDTASLEDRDKPYVCDKFYKELAWVPEAQRKHTAKK APDGTVIPNGYCDFCLGGSKKTGCPEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRW QCIECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWS CHLCLRHLKEKASAYITLT" variation 886..915 /note="these nucleotides and corresponding encoded amino acids are present only in rare variant of differential splicing, clone h34 only" /replace="" BASE COUNT 309 a 430 c 440 g 242 t ORIGIN 1 agcaagatgg ccacggtcat ccccagcccc ctgagcctag gcgaggactt ctaccgcgag 61 gccatcgagc actgccgcag ttacaacgcg cgcctgtgcg ccgagcgcag cctgcgactg 121 cccttcctcg actcgcagac cggcgtggcc cagaacaact gctacatctg gatggagaag 181 acccaccgcg ggccgggttt ggccccggga cagatttaca cgtaccccgc ccgctgttgg 241 aggaagaaac ggagactcaa catcctggag gaccccagac tcaggccctg cgagtacaag 301 atcgactgtg aagcacccct gaagaaggag ggtggcctcc cggaagggcc ggtcctcgag 361 gctctactgt gtgcagagac gggggagaag aagattgagc tgaaggagga ggagaccatt 421 atggactgtc agaaacagca gttgctggag tttccgcatg acctcgaggt ggaagacttg 481 gaggatgaca ttcccaggag gaagaacagg gccaaaggaa aggcatatgg catcgggggt 541 ctccggaaac gccaggacac cgcttccctg gaggaccgag acaagccgta tgtctgtgat 601 aagttttaca aagaattggc ctgggtccct gaggcacaaa ggaaacacac agccaagaag 661 gcgcccgacg gcactgtcat ccccaacggc tactgtgact tctgcctggg gggctccaag 721 aagacggggt gtcccgagga cctcatctcc tgtgcggact gtgggcgatc aggacacccc 781 tcgtgtttac aattcacggt gaacatgacg gcagccgtgc ggacctaccg ctggcagtgc 841 atcgagtgca aatcctgcag cctgtgcgga acctccgaga acgacggtgc cagctgggcg 901 ggtctcaccc cccaggacca gctgctgttt tgtgatgact gcgatcgggg ttaccacatg 961 tactgcctga gtccccccat ggcggagccc ccggaaggga gctggagctg tcacctctgt 1021 ctccggcacc tgaaggaaaa ggcttctgct tacatcaccc tcacctaggc cggctcggct 1081 cgccgcgact ctggggtggt gctcgcctac ctgcctctcc gagctcctca attctccccc 1141 accctgaaca tcccgcaggg ggagggggag agggggaagc cgagaggggg ctgggccacc 1201 ccctcccctc tgtgcaagtg gaatgtctgc cctgtgggtg ggtgggcccg gccagggcct 1261 ctccctccct ccctccctct ctgtcccttg gcaaatggac accaggggct tctcccctca 1321 aagccatacg cctctgggcg ggcatggggg gtggtgggtg ccagccaggg gcatggacag 1381 agcctttttc taaagaaaaa gacaaaaagt taaaaaaaaa a // LOCUS HSU43885 2467 bp mRNA PRI 22-FEB-1996 DEFINITION Human Grb2-associated binder-1 mRNA, complete cds. ACCESSION U43885 NID g1199617 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2467) AUTHORS Holgado-Madruga,M., Emlet,D.R., Moscatello,D.K., Godwin,A.K. and Wong,A.J. TITLE A Grb2-associated docking protein in EGF- and insulin-receptor signalling JOURNAL Nature 379 (6565), 560-564 (1996) MEDLINE 96170040 REFERENCE 2 (bases 1 to 2467) AUTHORS Holgado-Madruga,M., Emlet,D.R., Moscatello,D.K., Godwin,A.K. and Wong,A.J. TITLE Direct Submission JOURNAL Submitted (24-DEC-1995) Albert J. Wong, Jefferson Cancer Institute, Thomas Jefferson University, 233 S. 10th St., Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..2467 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 122..2206 /note="Grb2-associated binder-1; docking protein related to IRS-1; substrate of the EGF and insulin receptors" /codon_start=1 /product="Gab1" /db_xref="PID:g1199618" /translation="MSGGEVVCSGWLRKSPPEKKLKRYAWKRRWFVLRSGRLTGDPDV LEYYKNDHAKKPIRIIDLNLCQQVDAGLTFNKKEFENSYIFDINTIDRIFYLVADSEE EMNKWVRCICDICGFNPTEEDPVKPPGSSLQAPADLPLAINTAPPSTQADSSSATLPP PYQLINVPPHLETLGIQEDPQDYLLLINCQSKKPEPTRTHADSGKSTSSETDSNDNVP SHKNPASSQSKHGMNGFFQQQMIYDSPPSRAPSASVDSSLYNLPRSYSHDVLPKVSPS STEADGELYVFNTPSGTSSVETQMRHVSISYDIPPTPGNTYQIPRTFPEGTLGQTSKL DTIPDIPPPRPPKPHPAHDRSPVETCSIPRTASDTDSSYCIPTAGMSPSRSNTISTVD LNKLRKDASSQDCYDIPRAFPSDRSSSLEGFHNHFKVKNVLTVGSVSSEELDENYVPM NPNSPPRQHSSSFTEPIQEANYVPMTPGTFDFSSFGMQVPPPAHMGFRSSPKTPPRRP VPVADCEPPPVDRNLKPDRKVKPAPLEIKPLPEWEELQAPVRSPITRSFARDSSRFPM SPRPDSVHSTTSSSDSHDSEENYVPMNPNLSSEDPNLFGSNSLDGGSSPMIKPKGDKQ VEYLDLDLDSGKSTPPRKQKSSGSGSSVADERVDYVVVDQQKTLALKSTREAWTDGRQ STESETPAKSVK" BASE COUNT 738 a 606 c 517 g 606 t ORIGIN 1 acctctggtg gtggctggct actcggatac gaattcggca cgagggcagg cgtcggctag 61 tgtcgggagt cgcgcccgcc gcccctcagc tgcccggccc ggagcccgag acgcgcgcac 121 catgagcggt ggtgaagtgg tctgctccgg atggctccgc aagtcccccc cggagaaaaa 181 gttgaagcgt tatgcatgga agaggagatg gttcgtgtta cgcagtggcc gtttaactgg 241 agatccagat gttttggaat attacaaaaa tgatcatgcc aagaagccta ttcgtattat 301 tgatttaaat ttatgtcaac aagtagatgc tggattgaca tttaacaaaa aagagtttga 361 aaacagctac atttttgata tcaacactat tgaccggatt ttctacttgg tagcagacag 421 cgaggaggag atgaataagt gggttcgttg tatttgtgac atctgtgggt ttaatccaac 481 agaagaagat cctgtgaagc cacctggcag ctctttacaa gcaccagctg atttaccttt 541 agctataaat acagcaccac catccaccca ggcagattca tcctctgcta ctctacctcc 601 tccatatcag ctaatcaatg ttccaccaca cctggaaact cttggcattc aggaggatcc 661 tcaagactac ctgttgctca tcaactgtca aagcaagaag cccgaaccca ccagaacgca 721 tgctgattct ggaaaatcca cctcttctga aacagactcc aatgataacg tcccttctca 781 taaaaatcct gcttcctccc agagcaaaca tggaatgaat ggcttttttc agcagcaaat 841 gatatacgac tctccacctt cacgtgcccc atctgcttca gttgactcca gcctttataa 901 cctgcccagg agttattccc atgatgtttt accaaaggtg tctccatcaa gtactgaagc 961 agatggagaa ctctatgttt ttaatacccc atctgggaca tcgagtgtag agactcaaat 1021 gaggcatgta tctattagtt atgacattcc tccaacacct ggtaatactt atcagattcc 1081 acgaacattt ccagaaggaa ccttgggaca gacatcaaag ctagacacta ttccagatat 1141 tcctccacct cggccaccga aaccacatcc agctcatgac cgatctcctg tggaaacgtg 1201 tagtatccca cgcaccgcct cagacactga cagtagttac tgtatcccta cagcagggat 1261 gtcgccttca cgtagtaata ccatttccac tgtggattta aacaaattgc gaaaagatgc 1321 tagttctcaa gactgctatg atattccacg agcatttcca agtgatagat ctagttcact 1381 tgaaggcttc cataaccact ttaaagtcaa aaatgtgttg acagtgggaa gtgtttcaag 1441 tgaagaactg gatgaaaatt acgtcccaat gaatcccaat tcaccaccac gacaacattc 1501 cagcagtttt acagaaccaa ttcaggaagc aaattatgtg ccaatgactc caggaacatt 1561 tgatttttcc tcatttggaa tgcaagttcc tcctcctgct catatgggct tcaggtccag 1621 cccaaaaacc cctcccagaa ggccagttcc tgttgcagac tgtgaaccac cccccgtgga 1681 taggaacctc aagccagaca gaaaagtcaa gccagcgcct ttagaaataa aacctttgcc 1741 agaatgggaa gaattacaag ccccagttag atctcccatc actaggagtt ttgctcgaga 1801 ctcttccagg tttcccatgt ccccccgacc agattcagtg catagcacaa cttcaagcag 1861 tgactcacac gacagtgaag agaattatgt tcccatgaac ccaaacctgt ccagtgaaga 1921 cccaaatctc tttggcagta acagtcttga tggaggaagc agccctatga tcaagcccaa 1981 aggagacaaa caggtggaat acttagatct cgacttagat tctgggaaat ccacaccacc 2041 acgtaagcaa aagagcagtg gctcaggcag cagtgtagca gatgagagag tggattatgt 2101 tgttgttgac caacagaaga ccttggctct aaagagtacc cgggaagcct ggacagatgg 2161 gagacagtcc acagaatcag aaacgccagc gaagagtgtg aaatgaaaat attgccttgc 2221 catttctgaa caaaagaaaa ctgaattgta aagataaatc ccttttgaag aatgacttga 2281 cacttccact ctaggtagat cctcaaatga gtagagttga agtcaaagga cctttctgac 2341 ataatcaagc aatttagact taagtggtgc tttgtggtat ctgaacaatt cataacatgt 2401 aaataatgtg ggaaaatagt attgtttagc tcccagagaa acatttgttc cacagttaac 2461 acactcg // LOCUS HSU43899 2795 bp mRNA PRI 16-OCT-1996 DEFINITION Human signal transducing adaptor molecule STAM mRNA, complete cds. ACCESSION U43899 NID g1556458 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2795) AUTHORS Takeshita,T., Arita,T., Asao,H., Tanaka,N., Higuchi,M., Kuroda,H., Kaneko,K., Munakata,H., Endo,Y., Fujita,T. and Sugamura,K. TITLE Cloning of a novel signal-transducing adaptor molecule containing an SH3 domain and ITAM JOURNAL Biochem. Biophys. Res. Commun. 225 (3), 1035-1039 (1996) MEDLINE 96374438 REFERENCE 2 (bases 1 to 2795) AUTHORS Takeshita,T., Arita,T. and Sugamura,K. TITLE Direct Submission JOURNAL Submitted (26-DEC-1995) Toshikazu Takeshita, Microbiology, Tohoku Univ. School of Medicine, 2-1 Seiryo-machi Aoba-ku, Sendai, 980-77, Japan COMMENT STAM is tryosine-phosphorylated by various cytokines such as IL-2, IL-3, IL-4, IL-7, GM-CSF, EGF and PDGF, and contains an SH3 (Src homology 3) domain and the ITAM (immunoreceptor tyrosine-based activation motif). FEATURES Location/Qualifiers source 1..2795 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 43..1665 /function="signal transducing adaptor molecule" /codon_start=1 /product="STAM" /db_xref="PID:g1620217" /translation="MPLFATNPFDQDVEKATSEMNTAEDWGLILDICDKVGQSRTGPK DCLRSIMRRVNHKDPHVAMQALTLLGACVSNCGKIFHLEVCSRDFASEVSNVLNKGHP KVCEKLKALMVEWTDEFKNDPQLSLISAMIKNLKEQGVTFPAIGSQAAEQAKASPALV AKDPGTVANKKEEEDLAKAIELSLKEQRQQSTTLSTLYPSTSSLLTNHQHEGRKVRAI YDFEAAEDNELTFKAGEIITVLDDSDPNWWKGETHQGIGLFPSNFVTADLTAEPEMIK TEKKTVQFSDDVQVETIEPEPEPAFIDEDKMDQLLQMLQSTDPSDDQPDLPELLHLEA MCHQMGPLIDEKLEDIDRKHSELSELNVKVMEALSLYTKLMNEDPMYSMYAKLQNQPY YMQSSGVSGSQVYAGPPPSGAYLVAGNAQMSHLQSYSLPPEQLSSLSQAVVPPSANPA LPSQQTQAAYPNTMVSSVQGNTYPSQAPVYSPPPAATAAAATADVTLYQNAGPNMPQV PNYNLTSSTLPQPGGSQQPPQPQQPYSQKALL" BASE COUNT 849 a 575 c 554 g 817 t ORIGIN 1 gtcgagaggg agtccccggg gacacctcgg cacgcagcgg agatgcctct ttttgccacc 61 aatcccttcg atcaggatgt tgagaaagca accagcgaga tgaatactgc tgaggactgg 121 ggcctcattt tggatatctg tgataaagtt ggtcagtctc gcactggacc taaggattgt 181 cttcggtcta ttatgagaag agtgaaccac aaagatcctc acgttgctat gcaggctttg 241 actcttctag gagcatgtgt atcaaactgt ggcaaaattt ttcatttaga agtatgttca 301 agagattttg ctagtgaagt aagcaacgta ttaaataagg gtcatcctaa agtatgtgaa 361 aaattaaagg ctcttatggt tgaatggaca gatgaattta agaatgatcc acagcttagt 421 ctaatatcag caatgattaa gaaccttaag gaacaaggag ttacgttccc agctattggc 481 tctcaggctg cagaacaagc aaaagcaagc ccagctcttg tagccaagga tcctggtact 541 gtggctaaca aaaaagaaga agaagattta gcaaaagcca ttgagttgtc tctcaaggaa 601 caaaggcagc agtcaaccac cctttccact ttgtatccaa gcacatccag tctcttaact 661 aaccaccaac atgaaggccg aaaagttcgt gctatatatg actttgaagc tgctgaagac 721 aatgaactta cttttaaagc tggagaaatt attacagttc ttgatgacag tgatcctaac 781 tggtggaaag gtgaaaccca tcaaggcata gggttatttc cttctaattt tgtgactgca 841 gatctcactg ctgaaccaga aatgattaaa acagagaaga agacggtaca atttagtgat 901 gatgttcagg tagagacaat agaaccagag ccggaaccag cctttattga tgaagataaa 961 atggaccagt tgctacagat gctgcaaagt acagacccca gtgatgatca gccagaccta 1021 ccagagctgc ttcatcttga agcaatgtgt caccagatgg gacctctcat tgatgaaaag 1081 ctggaagata ttgatagaaa acattcagaa ctctcagaac ttaatgtgaa agtgatggag 1141 gccctttcct tatataccaa gttaatgaac gaagatccga tgtattccat gtatgcaaag 1201 ttacagaatc agccatatta tatgcagtca tctggtgttt ctggttctca ggtgtatgca 1261 gggcctcctc caagtggtgc ctacctggtt gcagggaacg cgcagatgag ccacctccag 1321 agctacagtc ttcccccgga gcagctgtct tctctcagcc aggcagtggt cccaccatcc 1381 gcaaacccag cccttcctag tcagcagact caggccgctt acccaaatac aatggtcagt 1441 tccgttcaag gaaacacata tcccagccag gcgccagtat atagtcctcc tcctgccgct 1501 actgctgctg ctgcaactgc cgatgtcact ctgtaccaga atgcaggacc taatatgccc 1561 caggtgccaa actataactt aacatcatca actctgcctc agcccggagg cagccaacag 1621 ccacctcagc cacagcaacc atattctcag aaggctctgc tataggaccc ggtgttcctc 1681 ttggtggcag atacctgcta aatgccactg acaatgttat gagattcatt actatcttaa 1741 gatgtgttta tcctcagctt ataggaatct ctccaggtca acaggttcaa atattcaaga 1801 aggtagaact ctcctcaatt tacactgact ttttagaggt tcttcccccc ccgcccctgc 1861 agaggaatga aactacttac aacatttaat tcctttcata atatgaaaga attgatacaa 1921 ggctatttgt ctcgtaaacc tggtctgcag aaagtcaaac ttacaaaaac tgttgtgaca 1981 aatgttatgt acatatattg atatgtaact gcattagtgg ccattttgaa tcacagtggt 2041 gatcgtgtga atatatttaa cactgtgtta aattaattta cgttgctatt ttattttaat 2101 cataaacaac taccatgttt cttaatgttt tgtgtaaatt taaggtaatt atactatcct 2161 tttaaacttc aagaaaacaa aattgttagc gtatttacat gaaggcgcat tatgttgtcg 2221 tgtgtttcag tttcacatta aactgaacct tttactaatt gtgagctaaa gagatatata 2281 tatatatgtg tgtgtatata tatatatcta catgtctttc tgtagcctct gcatactact 2341 ggctgtcatc acaccagcgt acagtagcta aatttttggt gcaattatta gcaaatgata 2401 atgttccctt ttgaactttt acattttggc atgacatttc agagtattgt gggaccatga 2461 gacaaaatta agtacgatca cattctttat ttctcatttt aaagaaatga tgttggttta 2521 ccttttccta gttgaagata gtaattaggt ttctaagctg tatactgtgt ttattggtgg 2581 cagtgacacc caaagataga ggcaatggat agaaattttt aaactggaaa gaaaacctga 2641 attacactac attttcgaag tctcttgtaa ttatttggga tatcaacaaa atttgattcg 2701 tctgtctaat cccttgctag tattttaaat atgtctttaa cacattgtat cctttaattc 2761 ttcattaaaa tggaaataag tagatgtttc aaagt // LOCUS HSU44060 2924 bp mRNA PRI 28-AUG-1996 DEFINITION Human homeodomain protein (Prox 1) mRNA, complete cds. ACCESSION U44060 NID g1511629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2924) AUTHORS Zinovieva,R.D., Duncan,M.K., Johnson,T.R., Torres,R., Polymeropoulos,M.H. and Tomarev,S.I. TITLE Structure and chromosomal localization of the human homeobox gene Prox 1 JOURNAL Genomics 35 (3), 517-522 (1996) MEDLINE 97001153 REFERENCE 2 (bases 1 to 2924) AUTHORS Tomarev,S.I, Zinovieva,R.D., Duncan,M.K, Johnson,T., Torres,R. and Polymeropoulos,M. TITLE Direct Submission JOURNAL Submitted (29-DEC-1995) Stanislav I. Tomarev, LMDB, NEI/NIH, Bldg. 6, Room 203, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2924 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q32.2-q32.3" gene 607..2817 /gene="Prox 1" CDS 607..2817 /gene="Prox 1" /note="homeobox gene" /codon_start=1 /product="homeodomain protein" /db_xref="PID:g1511630" /translation="MPDHDSTALLSRQTKRRRVDIGVKRTVGTASAFFAKARATFFSA MNPQGSEQDVEYSVVQHADGEKSNVLRKLLKRANSYEDAMMPFPGATIISQLLKNNMN KNGGTEPSFQASGLSSTGSEVHQEDICSNSSRDSPPECLSPFGRPTMSQFDMDRLCDE HLRAKRARVENIIRGMSHSPSVALRGNENEREMAPQSVSPRESYRENKRKQKLPQQQQ QSFQQLVSARKEQKREERRQLKQQLEDMQKQLLHVQEKFYQIYDSTDSENDEDGNLSE DSMRSEILDARAQDSVGRSDNEMCELDPGQFIDRARALIREQEMAENKPKREGNNKER DHGPNSLQPEGKHLAETLKQELNTAMSQVVDTVVKVFSAKPSRQVPQVFPPLQIPQAR FAVNGENHNFHTANQRLQCFGDVIIPNPLDTFGNVQMASSTDQTEALPLVVRKNSSDQ SASGLVGGHHQPLHQSPLSATTGFTTSTFRHPFPLPLMAYPFQSPLGAPSGSFSGKDR ASPESLDLTRDTTSLRTKMSSHHLSHHPCSPAHPPSTAEGLSLSLIKSECGDLQDMSE ISPYSGSAMQEGLSPNHLKKAKLMFFYTRYPSSNMLKTYFSDVKFNRCITSQLIKWFS NFREFYYIQMEKYARQAINDGVTSTEELSITRDCELYRALNMHYNKANDFEVPERFLE VAQITLREFFNAIIAGKDVDPSWKKAIYKVICKLDSEVPEFFKSPNCLQELLHE" BASE COUNT 740 a 833 c 678 g 673 t ORIGIN 1 aagtaaatct tgttgtggag cggagccctc agctgagggt gcgctctgaa ataatacacc 61 attgcagccg gggaaagcag agcgcgcaaa agagctctcg ccgggtccgc ctgctccctc 121 tccgcttcgc tcctcttctc ttctttaccc ttctcctctc tcctcctctg ctgctctctc 181 ctctcctccg ctcttctctc tcctcctctc ctgctctctc ctcttccctt agctcctctt 241 cttttcttct cctcttcttc cctctcctcg cctctcccct gctcctcttc tctcgtctcc 301 cctcccctcc cgcctctctc tcccctctcc ctctcccact cgccccgctc gctcgctcgt 361 cgtcgcacag actcaccgtc ccttgtccaa ttatcatatt catcacccgc aagatatcac 421 cgtgtgtgca ctcgcgtgtt ttcctctctc tgccggggga aaaaaaagag agagagaggg 481 atagagagag agagagagag agagagagag aggctcggtc ccactgctcc ctgcaccgcg 541 gtcccgggat tcttgagctg tgcccagctg acgagctttt gaagatggca caataaccgt 601 ccagtgatgc ctgaccatga cagcacagcc ctcttaagcc ggcaaaccaa gaggagaaga 661 gttgacattg gagtgaaaag gacggtaggg acagcatctg cattttttgc taaggcaaga 721 gcaacgtttt ttagtgccat gaatccccaa ggttctgagc aggatgttga gtattcagtg 781 gtgcagcatg cagatgggga aaagtcaaat gtactacgca agctgctgaa gagggcgaac 841 tcgtatgaag atgccatgat gccttttcca ggagcaacca taatttccca gctgttgaaa 901 aataacatga acaaaaatgg tggcacggag cccagtttcc aagccagcgg tctctctagt 961 acaggctccg aagtacatca ggaggatata tgcagcaact cttcaagaga cagcccccca 1021 gagtgtcttt ccccttttgg caggcctact atgagccagt ttgatatgga tcgcttatgt 1081 gatgagcacc tgagagcaaa gcgggcccgg gttgagaata taattcgggg tatgagccat 1141 tcccccagtg tggcattaag gggcaatgaa aatgaaagag agatggcccc gcagtctgtg 1201 agtccccgag aaagttacag agaaaacaaa cgcaagcaaa agcttcccca gcagcagcaa 1261 cagagtttcc agcagctggt ttcagcccga aaagaacaga agcgagagga gcgccgacag 1321 ctgaaacagc agctggagga catgcagaaa cagctgctcc acgtgcagga aaagttctac 1381 caaatctatg acagcactga ttcggaaaat gatgaagatg gtaacctgtc tgaagacagc 1441 atgcgctcgg agatcctgga tgccagggcc caggactctg tcggaaggtc agataatgag 1501 atgtgcgagc tagacccagg acagtttatt gaccgagctc gagccctgat cagagagcag 1561 gaaatggctg aaaacaagcc gaagcgagaa ggcaacaaca aagaaagaga ccatgggcca 1621 aactccttac aaccggaagg caaacatttg gctgagacct tgaaacagga actgaacact 1681 gccatgtcgc aagttgtgga cactgtggtc aaagtctttt cggccaagcc ctcccgccag 1741 gttcctcagg tcttcccacc tctccagatc ccccaggcca gatttgcagt caatggggaa 1801 aaccacaatt tccacaccgc caaccagcgc ctgcagtgct ttggcgacgt catcattccg 1861 aaccccctgg acacctttgg caatgtgcag atggccagtt ccactgacca gacagaagca 1921 ctgcccctgg ttgtccgcaa aaactcctct gaccagtctg cctccggcct ggtgggcggc 1981 caccaccagc ccctgcacca gtcgcctctc tctgccacca cgggcttcac cacgtccacc 2041 ttccgccacc ccttccccct tcccttgatg gcctatccat ttcagagccc attaggtgct 2101 ccctccggct ccttctctgg aaaagacaga gcctctcctg aatccttaga cttaactagg 2161 gataccacga gtctgaggac caagatgtca tctcaccacc tgagccacca cccttgttca 2221 ccagcacacc cgcccagcac cgccgaaggg ctctccttgt cgctcataaa gtccgagtgc 2281 ggcgatcttc aagatatgtc tgaaatatca ccttattcgg gaagtgcaat gcaggaagga 2341 ttgtcaccca atcacttgaa aaaagcaaag ctcatgtttt tttatacccg ttatcccagc 2401 tccaatatgc tgaagaccta cttctccgac gtaaagttca acagatgcat tacctctcag 2461 ctcatcaagt ggtttagcaa tttccgtgag ttttactaca ttcagatgga gaagtacgca 2521 cgtcaagcca tcaacgatgg ggtcaccagt actgaagagc tgtctataac cagagactgt 2581 gagctgtaca gggctctgaa catgcactac aataaagcaa atgactttga ggttccagag 2641 agattcctgg aagtggctca gatcacatta cgggagtttt tcaatgccat tatcgcaggc 2701 aaagatgttg atccttcctg gaagaaggcc atatacaagg tcatctgcaa gctggatagt 2761 gaagtccctg agtttttcaa atccccgaac tgcctacaag agctgcttca tgagtagaaa 2821 tttcaacaac tctttttgaa tgtatgaaga gtagcagtcc tctttggatg tccaagttat 2881 atgtgtctag attttgattt catatatatg tgtatgggag gcgg // LOCUS HSU44103 606 bp mRNA PRI 10-APR-1997 DEFINITION Human small GTP binding protein Rab9 mRNA, complete cds. ACCESSION U44103 NID g1174146 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 606) AUTHORS Davies,J.P., Cotter,P.D. and Ioannou,Y.A. TITLE Cloning and mapping of human Rab7 and Rab9 cDNA sequences and identification of a Rab9 pseudogene JOURNAL Genomics 41 (1), 131-134 (1997) MEDLINE 97271569 REFERENCE 2 (bases 1 to 606) AUTHORS Ioannou,Y.A. and Davies,J.P. TITLE Direct Submission JOURNAL Submitted (29-DEC-1995) Yiannis A. Ioannou, Human Genetics, Mount Sinai School of Medicine, One Gustave L. Levy Place, Box 1497, New York, NY 10128, USA FEATURES Location/Qualifiers source 1..606 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="U937 cells" CDS 1..606 /codon_start=1 /product="small GTP binding protein Rab9" /db_xref="PID:g1174147" /translation="MAGKSSLFKVILLGDGGVGKSSLMNRYVTNKFDTQLFHTIGVEF LNKDLEVDGHFVTMQIWDTAGQERFRSLRTPFYRGSDCCLLTFSVDDSQSFQNLSNWK KEFIYYADVKEPESFPFVILGNKIDISERQVSTEEAQAWCRDNGDYPYFETSAKDATN VAAAFEEAVRRVLATEDRSDHLIQTDTVNLHRKPKPSSSCC" BASE COUNT 182 a 112 c 147 g 165 t ORIGIN 1 atggcaggaa aatcttcact ttttaaagta attctccttg gagatggtgg agttgggaag 61 agttcactta tgaacagata tgtaactaat aagtttgata cccagctctt ccatacaata 121 ggtgtggaat ttttaaataa agatttggaa gtggatggac attttgttac catgcagatt 181 tgggacacgg caggtcagga gcgattccga agcctgagga caccatttta cagaggttct 241 gactgctgcc tgcttacttt tagtgtcgat gattcacaaa gcttccagaa cttaagtaac 301 tggaagaaag aattcatata ttatgcagat gtgaaagagc ctgagagctt tccttttgtg 361 attctgggta acaagattga cataagcgaa cggcaggtgt ctacagaaga agcccaagct 421 tggtgcaggg acaacggcga ctatccttat tttgaaacaa gtgcaaaaga tgccacaaat 481 gtggcagcag cctttgagga agcggttcga agagttcttg ctaccgagga taggtcagat 541 catttgattc agacagacac agtcaatctt caccgaaagc ccaagcctag ctcatcttgc 601 tgttga // LOCUS HSU44104 624 bp mRNA PRI 02-FEB-1996 DEFINITION Human small GTP binding protein Rab7 mRNA, complete cds. ACCESSION U44104 NID g1174148 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 624) AUTHORS Ioannou,Y.A. and Davies,J.P. TITLE Nucleotide Sequence of Human Rab7 and Rab9 Proteins and Human Rab9 Expressed Pseudogene JOURNAL Unpublished REFERENCE 2 (bases 1 to 624) AUTHORS Ioannou,Y.A. and Davies,J.P. TITLE Direct Submission JOURNAL Submitted (29-DEC-1995) Yiannis A. Ioannou, Human Genetics, Mount Sinai School of Medicine, One Gustave L. Levy Place, Box 1497, New York, NY 10128, USA FEATURES Location/Qualifiers source 1..624 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="U937 cells" CDS 1..624 /codon_start=1 /product="small GTP binding protein Rab7" /db_xref="PID:g1174149" /translation="MTSRKKVLLKVIILGDSGVGKTSLMNQYVNKKFSNQYKATIGAD FLTKEVMVDDRLVTMQIWDTAGQERFQSLGVAFYRGADCCVLVFDVTAPNTFKTLDSW RDEFLVQASPRDPENFPFVVLGNKVDLENRQVATKRAQAWCYSKNNIPYFETSAKEAI NVEQAFQTIARNALKQETEVELYNEFPEPIKLDKNDRAKASAESCSC" BASE COUNT 183 a 143 c 170 g 128 t ORIGIN 1 atgacctcta ggaagaaagt gttgctgaag gttatcatcc tgggagattc tggagtcggg 61 aagacatcac tcatgaacca gtatgtgaat aagaaattca gcaatcagta caaagccaca 121 ataggagctg actttctgac caaggaggtg atggtggatg acaggctggt cacaatgcag 181 atatgggaca cagcaggaca ggaacggttc cagtctctcg gtgtggcctt ctacagaggt 241 gcagactgct gcgttctggt atttgatgtg actgccccca acacattcaa aaccctagat 301 agctggagag atgagtttct cgtccaggcc agtccccgag atcctgaaaa cttcccattt 361 gttgtgttgg gaaacaaggt tgacctcgaa aacagacaag tggccacaaa gcgggcacag 421 gcctggtgct acagcaaaaa caacattccc tactttgaga ccagtgccaa ggaggccatc 481 aacgtggagc aggcgttcca gacgattgca cggaatgcac ttaagcagga aacggaggtg 541 gagctgtaca acgaatttcc tgaacctatc aaactggaca agaatgaccg ggccaaggcc 601 tcggcagaaa gctgcagttg ctga // LOCUS HSU44128 3131 bp mRNA PRI 31-JAN-1996 DEFINITION Human thiazide-sensitive Na-Cl cotransporter (hTSC) mRNA, complete cds. ACCESSION U44128 NID g1172160 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3131) AUTHORS Simon,D.B., Nelson-Williams,C., Bia,M.J., Ellison,D., Karet,F.E., Morey,Molina, A., Vaara,I., Iwata,F., Cushner,H.M., Koolen,M., Gainza,F.J., Gitelman,H.J. and Lifton,R.P. TITLE Gitelman's variant of Bartter's syndrome, inherited hypokalaemic alkalosis, is caused by mutations in the thiazide-sensitive Na-Cl cotransporter JOURNAL Nature Genet. 12 (1), 24-30 (1996) MEDLINE 96122035 REFERENCE 2 (bases 1 to 3131) AUTHORS Simon,D.B., Nelson-Williams,C., Ellison,D. and Lifton,R.P. TITLE Direct Submission JOURNAL Submitted (02-JAN-1996) David B. Simon, Internal Medicine, Yale University, 295 Congress Avenue, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..3131 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16q13" gene 7..3099 /gene="hTSC" exon 7..288 /gene="hTSC" /number=1 CDS 7..3099 /gene="hTSC" /codon_start=1 /product="thiazide-sensitive Na-Cl" /db_xref="PID:g1172161" /translation="MAELPTTETPGDATLCSGRFTISTLLSSDEPSPPAAYDSSHPSH LTHSSTFCMRTFGYNTIDVVPTYEHYANSTQPGEPRKVRPTLADLHSFLKQEGRHLHA LAFDSRPSHEMTDGLVEGEAGTSSEKNPEEPVRFGWVKGVMIRCMLNIWGVILYLRLP WITAQAGIVLTWIIILLSVTVTSITGLSISAISTNGKVKSGGTYFLISRSLGPELGGS IGLIFAFANAVGVAMHTVGFAETVRDLLQEYGAPIVDPINDIRIIGVVSVTVLLAISL AGMEWESKAQVLFFLVIMVSFANYLVGTLIPPSEDKASKGFFSYRADIFVQNLVPDWR GPDGTFFGMFSIFFPSATGILAGANISGDLKDPAIAIPKGTLMAIFWTTISYLAISAT IGSCVVRDASGVLNDTVTPGWGACEGLACSYGWNFTECTQQHSCHYGLINYYQTMSMV SGFAPLITAGIFGATLSSALACLVSAAKVFQCLCEDQLYPLIGFFGKGYGKNKEPVRG YLLAYAIAVAFIIIAELNTIAPIISNFFLCSYALINFSCFHASITNSPGWRPSFQYYN KWAALFGAIISVVIMFLLTWWAALIAIGVVLFLLLYVIYKKPEVNWGSSVQAGSYNLA LSYSVGLNEVEDHIKNYRPQCLVLTGPPNFRPALVDFVGTFTRNLSLMICGHVLIGPH KQRMPELQLIANGHTKWLNKRKIKAFYSDVIAEDLRRGVQILMQAAGLGRMKPNILVV GFKKNWQSAHPATVEDYIGILHDAFEFNYGVCVMRMREGLNVSKMMQAHINPVFDPAE DGKEASARGARPSVSGALDPKALVKEEQATTIFQSEQGKKTIDIYWLFDDGGLTLLIP YLLGRKRRWSKCKIRVFVGGQINRMDQERKAIISLLSKFRLGFHEVHILPDINQNPRA EHTKRFEDMIAPFRLNDGFKDEATVNEMRRDCPWKISDEEITKNRVKSLRQVRLNEIV LDYSRDAALIVITLPIGRKGKCPSSLYMAWLETLSQDLRPPVILIRGNQENVLTFYCQ " exon 289..435 /gene="hTSC" /number=2 exon 436..511 /gene="hTSC" /number=3 exon 512..607 /gene="hTSC" /number=4 exon 608..747 /gene="hTSC" /number=5 exon 748..858 /gene="hTSC" /number=6 exon 859..970 /gene="hTSC" /number=7 exon 971..1101 /gene="hTSC" /number=8 exon 1102..1186 /gene="hTSC" /number=9 exon 1187..1341 /gene="hTSC" /number=10 exon 1342..1449 /gene="hTSC" /number=11 exon 1450..1573 /gene="hTSC" /number=12 exon 1574..1675 /gene="hTSC" /number=13 exon 1676..1831 /gene="hTSC" /number=14 exon 1832..1931 /gene="hTSC" /number=15 exon 1932..2043 /gene="hTSC" /number=16 exon 2044..2184 /gene="hTSC" /number=17 exon 2185..2291 /gene="hTSC" /number=18 exon 2292..2374 /gene="hTSC" /number=19 exon 2375..2452 /gene="hTSC" /number=20 exon 2453..2554 /gene="hTSC" /number=21 exon 2555..2666 /gene="hTSC" /number=22 exon 2667..2753 /gene="hTSC" /number=23 exon 2754..2889 /gene="hTSC" /number=24 exon 2890..2952 /gene="hTSC" /number=25 exon 2953..3099 /gene="hTSC" /number=26 BASE COUNT 636 a 987 c 850 g 658 t ORIGIN 1 gcgacaatgg cagaactgcc cacaacagag acgcctgggg acgccacttt gtgcagcggg 61 cgcttcacca tcagcacact gctgagcagt gatgagccct ctccaccagc tgcctatgac 121 agcagccacc ccagccacct gacccacagc agcaccttct gcatgcgcac ctttggctac 181 aacacgatcg atgtggtgcc cacatatgag cactatgcca acagcaccca gcctggtgag 241 ccccggaagg tccggcccac actggctgac ctgcactcct tcctcaagca ggaaggcaga 301 cacctgcatg ccctggcctt tgacagccgg cccagccacg agatgactga tgggctggtg 361 gagggcgagg caggcaccag cagcgagaag aaccccgagg agccagtgcg cttcggctgg 421 gtcaaggggg tgatgattcg ttgcatgctc aacatttggg gcgtgatcct ctacctgcgg 481 ctgccctgga ttacggccca ggcaggcatc gtcctgacct ggatcatcat cctgctgtcg 541 gtcacggtga cctccatcac aggcctctcc atctcagcca tctccaccaa tggcaaggtc 601 aagtcaggtg gcacctactt cctcatctcc cggagtctgg gcccagagct tgggggctcc 661 atcggcctca ttttcgcttt cgccaatgcc gtgggtgtgg ccatgcacac ggtgggcttt 721 gcagagaccg tgcgggacct gctccaggag tatggggcac ccatcgtgga ccccattaac 781 gacatccgca tcattggcgt ggtctcggtc actgtgctgc tggccatctc cctggctggc 841 atggagtggg agtccaaggc ccaggtgctg ttcttccttg tcatcatggt ctcctttgcc 901 aactatttag tggggacgct gatcccccca tctgaggaca aggcctccaa gggcttcttc 961 agctaccggg cggacatttt tgtccagaac ttggtgcctg actggcgggg tccagatggc 1021 accttcttcg gaatgttctc catcttcttc ccctcggcca caggcatcct ggcaggggcc 1081 aacatatctg gtgacctcaa ggaccctgct atagccatcc ccaaggggac cctcatggcc 1141 attttctgga cgaccatttc ctacctggcc atctcagcca ccattggctc ctgcgtggtg 1201 cgtgatgcct ctggggtcct gaatgacaca gtgacccctg gctggggtgc ctgcgagggg 1261 ctggcctgca gctatggctg gaacttcacc gagtgcaccc agcagcacag ctgccactac 1321 ggcctcatca actattacca gaccatgagc atggtgtcag gcttcgcgcc cctgatcacg 1381 gctggcatct tcggggccac cctctcctct gccctggcct gccttgtctc tgctgccaaa 1441 gtcttccagt gcctttgcga ggaccagctg tacccactga tcggcttctt cggcaaaggc 1501 tatggcaaga acaaggagcc cgtgcgtggc tacctgctgg cctacgccat cgctgtggcc 1561 ttcatcatca tcgctgagct caacaccata gcccccatca tttccaactt cttcctctgc 1621 tcctatgccc tcatcaactt cagctgcttc cacgcctcca tcaccaactc gcctgggtgg 1681 agaccttcat tccaatacta caacaagtgg gcggcgctgt ttggggctat catctccgtg 1741 gtcatcatgt tcctcctcac ctggtgggcg gccctcatcg ccattggcgt ggtgctcttc 1801 ctcctgctct atgtcatcta caagaagcca gaggtaaatt ggggctcctc ggtacaggct 1861 ggctcctaca acctggccct cagctactcg gtgggcctca atgaggtgga agaccacatc 1921 aagaactacc gcccccagtg cctggtgctc acggggcccc ccaacttccg cccggccctg 1981 gtggactttg tgggcacctt cacccggaac ctcagcctga tgatctgtgg ccacgtgctc 2041 atcggacccc acaagcagag gatgcctgag ctccagctca tcgccaacgg gcacaccaag 2101 tggctgaaca agaggaagat caaggccttc tactcggatg tcattgccga ggacctccgc 2161 agaggcgtcc agatcctcat gcaggccgca ggtctcggga gaatgaagcc caacattctg 2221 gtggttgggt tcaagaagaa ctggcagtcg gctcacccgg ccacagtgga agactacatt 2281 ggcatcctcc atgatgcctt tgagttcaac tatggcgtgt gtgtcatgag gatgcgggag 2341 ggactcaacg tgtccaagat gatgcaggcg cacattaacc ccgtgtttga cccagcggag 2401 gacgggaagg aagccagcgc cagaggtgcc aggccatcag tctctggcgc tttggacccc 2461 aaggccctgg tgaaggagga gcaggccacc accatcttcc agtcggagca gggcaagaag 2521 accatagaca tctactggct ctttgacgat ggaggcctca ccctcctcat tccctatctc 2581 cttggccgca agaggaggtg gagcaaatgc aagatccgtg tgttcgtagg cggccagatt 2641 aacaggatgg accaggagag aaaggcgatc atttctctgc tgagcaagtt ccgactggga 2701 ttccatgaag tccacatcct ccctgacatc aaccagaacc ctcgggctga gcacaccaag 2761 aggtttgagg acatgattgc acccttccgt ctgaatgatg gcttcaagga tgaggccact 2821 gtcaacgaga tgcggcggga ctgcccctgg aagatctcag atgaggagat tacgaagaac 2881 agagtcaagt cccttcggca ggtgaggctg aatgagattg tgctggatta ctcccgagac 2941 gctgctctca tcgtcatcac tttgcccata gggaggaagg ggaagtgccc cagctcgctg 3001 tacatggcct ggctggagac cctgtcccag gacctcagac ctccagtcat cctgatccga 3061 ggaaaccagg aaaacgtgct caccttttac tgccagtaac tccaggcttt gacatccctg 3121 tccacagctc t // LOCUS HSU44378 2680 bp mRNA PRI 01-MAR-1996 DEFINITION Human homozygous deletion target in pancreatic carcinoma (DPC4) mRNA, complete cds. ACCESSION U44378 NID g1163233 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2680) AUTHORS Hahn,S.A., Hoque,A.T.M.S. and Kern,S.E. TITLE Homozygous deletion map at 18q21.1 in pancreatic cancer JOURNAL Cancer Res. (1996) In press REFERENCE 2 (bases 1 to 2680) AUTHORS Hahn,S.A., Schutte,M., Shamsul Hoque,A.T.M., Moskaluk,C.A., da Costa,L.T., Rozenblum,E., Weinstein,C.L., Fischer,A., Yeo,C.J., Hruban,R.H. and Kern,S.E. TITLE DPC4, a candidate tumor suppressor gene at human chromosome 18q21.1 JOURNAL Science 271 (5247), 350-353 (1996) MEDLINE 96144684 REFERENCE 3 (bases 1 to 2680) AUTHORS Hahn,S.A., Schutte,M. and Kern,S.E. TITLE Direct Submission JOURNAL Submitted (03-JAN-1996) Scott E. Kern, Oncology, Johns Hopkins, 628 Ross Bldg, 720 Rutland Ave., Baltimore, MD 21205-2196, USA FEATURES Location/Qualifiers source 1..2680 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18q21.1" gene 1..2680 /gene="DPC4" 5'UTR 1..128 /gene="DPC4" /note="exon structure of untranslated region was not determined" exon 1..377 /gene="DPC4" /number=1 CDS 129..1787 /gene="DPC4" /standard_name="homozygous deletion target in pancreatic carcinoma" /note="similar to Drosophila melanogaster Mothers against dpp (Mad) gene, GenBank Accession Number U10328" /codon_start=1 /product="Dpc4" /db_xref="PID:g1163234" /translation="MDNMSITNTPTSNDACLSIVHSLMCHRQGGESETFAKRAIESLV KKLKEKKDELDSLITAITTNGAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARLWRWP DLHKNELKHVKYCQYAFDLKCDSVCVNPYHYERVVSPGIDLSGLTLQSNAPSSMMVKD EYVHDFEGQPSLSTEGHSIQTIQHPPSNRASTETYSTPALLAPSESNATSTANFPNIP VASTSQPASILGGSHSEGLLQIASGPQPGQQQNGFTGQPATYHHNSTTTWTGSRTAPY TPNLPHHQNGHLQHHPPMPPHPGHYWPVHNELAFQPPISNHPAPEYWCSIAYFEMDVQ VGETFKVPSSCPIVTVDGYVDPSGGDRFCLGQLSNVHRTEAIERARLHIGKGVQLECK GEGDVWVRCLSDHAVFVQSYYLDREAGRAPGDAVHKIYPSAYIKVFDLRQCHRQMQQQ AATAQAAAAAQAAAVAGNIPGPGSVGGIAPAISLSAAAGIGVDDLRRLCILRMSFVKG WGPDYPRQSIKETPCWIEIHLHRALQLLDEVLHTMPIADPQPLD" exon 378..552 /gene="DPC4" /number=2 exon 553..582 /gene="DPC4" /number=3 exon 583..796 /gene="DPC4" /number=4 exon 797..915 /gene="DPC4" /number=5 exon 916..1032 /gene="DPC4" /number=6 exon 1033..1083 /gene="DPC4" /number=7 exon 1084..1267 /gene="DPC4" /number=8 exon 1268..1436 /gene="DPC4" /number=9 exon 1437..1575 /gene="DPC4" /number=10 exon 1576..>1787 /gene="DPC4" /number=11 3'UTR 1785..2680 /gene="DPC4" /note="exon structure of untranslated region was not determined" BASE COUNT 775 a 548 c 567 g 790 t ORIGIN 1 ggttatcctg aatacatgtc taacaatttt ccttgcaacg ttagctgttg tttttcactg 61 tttccaaagg atcaaaattg cttcagaaat tggagacata tttgatttaa aaggaaaaac 121 ttgaacaaat ggacaatatg tctattacga atacaccaac aagtaatgat gcctgtctga 181 gcattgtgca tagtttgatg tgccatagac aaggtggaga gagtgaaaca tttgcaaaaa 241 gagcaattga aagtttggta aagaagctga aggagaaaaa agatgaattg gattctttaa 301 taacagctat aactacaaat ggagctcatc ctagtaaatg tgttaccata cagagaacat 361 tggatgggag gcttcaggtg gctggtcgga aaggatttcc tcatgtgatc tatgcccgtc 421 tctggaggtg gcctgatctt cacaaaaatg aactaaaaca tgttaaatat tgtcagtatg 481 cgtttgactt aaaatgtgat agtgtctgtg tgaatccata tcactacgaa cgagttgtat 541 cacctggaat tgatctctca ggattaacac tgcagagtaa tgctccatca agtatgatgg 601 tgaaggatga atatgtgcat gactttgagg gacagccatc gttgtccact gaaggacatt 661 caattcaaac catccagcat ccaccaagta atcgtgcatc gacagagaca tacagcaccc 721 cagctctgtt agccccatct gagtctaatg ctaccagcac tgccaacttt cccaacattc 781 ctgtggcttc cacaagtcag cctgccagta tactgggggg cagccatagt gaaggactgt 841 tgcagatagc atcagggcct cagccaggac agcagcagaa tggatttact ggtcagccag 901 ctacttacca tcataacagc actaccacct ggactggaag taggactgca ccatacacac 961 ctaatttgcc tcaccaccaa aacggccatc ttcagcacca cccgcctatg ccgccccatc 1021 ccggacatta ctggcctgtt cacaatgagc ttgcattcca gcctcccatt tccaatcatc 1081 ctgctcctga gtattggtgt tccattgctt actttgaaat ggatgttcag gtaggagaga 1141 catttaaggt tccttcaagc tgccctattg ttactgttga tggatacgtg gacccttctg 1201 gaggagatcg cttttgtttg ggtcaactct ccaatgtcca caggacagaa gccattgaga 1261 gagcaaggtt gcacataggc aaaggtgtgc agttggaatg taaaggtgaa ggtgatgttt 1321 gggtcaggtg ccttagtgac cacgcggtct ttgtacagag ttactactta gacagagaag 1381 ctgggcgtgc acctggagat gctgttcata agatctaccc aagtgcatat ataaaggtct 1441 ttgatttgcg tcagtgtcat cgacagatgc agcagcaggc ggctactgca caagctgcag 1501 cagctgccca ggcagcagcc gtggcaggaa acatccctgg cccaggatca gtaggtggaa 1561 tagctccagc tatcagtctg tcagctgctg ctggaattgg tgttgatgac cttcgtcgct 1621 tatgcatact caggatgagt tttgtgaaag gctggggacc ggattaccca agacagagca 1681 tcaaagaaac accttgctgg attgaaattc acttacaccg ggccctccag ctcctagacg 1741 aagtacttca taccatgccg attgcagacc cacaaccttt agactgaggt cttttaccgt 1801 tggggccctt aaccttatca ggatggtgga ctacaaaata caatcctgtt tataatctga 1861 agatatattt cacttttctt ctgctttatc ttttcataaa gggttgaaaa tgtgtttgct 1921 gccttgctcc tagcagacag aaactggatt aaaacaattt ttttttcctc ttcagaactt 1981 gtcaggcatg gctcagagct tgaagattag gagaaacaca ttcttattaa ttcttcacct 2041 gttatgtatg aaggaatcat tccagtgcta gaaaatttag ccctttaaaa cgtcttagag 2101 ccttttatct gcagaacatc gatatgtata tcattctaca gaataatcca gtattgctga 2161 ttttaaaggc agagaagttc tcaaagttaa ttcacctatg ttattttgtg tacaagttgt 2221 tattgttgaa catacttcaa aaataatgtg ccatgtgggt gagttaattt taccaagagt 2281 aactttactc tgtgtttaaa aatgaagtta ataatgtatt gtaatctttc atccaaaata 2341 ttttttgcaa gttatattag tgaagatggt ttcaattcag attgtcttgc aacttcagtt 2401 ttatttttgc caaggcaaaa aactcttaat ctgtgtgtat attgagaatc ccttaaaatt 2461 accagacaaa aaaatttaaa attacgtttg ttattcctag tggatgactg ttgatgaagt 2521 atacttttcc cctgttaaac agtagttgta ttcttctgta tttctaggca caaggttggt 2581 tgctaagaag cctataagag gaatttcttt tccttcattc atagggaaag gttttgtatt 2641 ttttaaaaca ctaaaagcag cgtcactcta cctaatgtct // LOCUS HSU44427 1325 bp mRNA PRI 16-JAN-1997 DEFINITION Human D53 (hD53) mRNA, complete cds. ACCESSION U44427 NID g1469919 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1325) AUTHORS Byrne,J.A., Tomasetto,C., Garnier,J.M., Rouyer,N., Mattei,M.G., Bellocq,J.P., Rio,M.C. and Basset,P. TITLE A screening method to identify genes commonly overexpressed in carcinomas and the identification of a novel complementary DNA sequence JOURNAL Cancer Res. 55 (13), 2896-2903 (1995) MEDLINE 95316866 REFERENCE 2 (bases 1 to 1325) AUTHORS Byrne,J.A., Mattei,M.G. and Basset,P. TITLE Definition of the tumor protein D52 (TPD52) gene family through cloning of D52 homologues in human (hD53) and mouse (mD52) JOURNAL Genomics 35 (3), 523-532 (1996) MEDLINE 97001154 REFERENCE 3 (bases 1 to 1325) AUTHORS Byrne,J.A., Mattei,M.-G. and Basset,P. TITLE Direct Submission JOURNAL Submitted (03-JAN-1996) Jennifer Anne Byrne, Institut de Genetique et de Biologie Moleculaire et Cellulaire INSERM/CNRS/ULP/College de France, BP 163, ILLKIRCH CEDEX, 67404, France FEATURES Location/Qualifiers source 1..1325 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q22-q23" /tissue_type="breast carcinoma" gene 1..1325 /gene="hD53" 5'UTR 1..180 /gene="hD53" CDS 181..795 /gene="hD53" /note="D52 homologue" /codon_start=1 /product="D53" /db_xref="PID:g1469920" /translation="MEAQAQGLLETEPLQGTDEDAVASADFSSMLSEEEKEELKAELV QLEDEITTLRQVLSAKERHLVEIKQKLGMNLMNELKQNFSKSWHDMQTTTAYKKTHET LSHAGQKATAAFSNVGTAISKKFGDMSYSIRHSISMPAMRNSPTFKSFEERVETTVTS LKTKVGGTNPNGGSFEEVLSSTAHASAQSLAGGSRRTKEEELQC" misc_difference 567..666 /gene="hD53" /note="this region is deleted in EST clone 83289, GenBank Accession Number T68402 but is present in EST clone 116783, GenBank Accession Number T89899" 3'UTR 796..1325 /gene="hD53" variation replace(865,"t") /gene="hD53" /note="T/G polymorphism" polyA_signal 1308..1313 /gene="hD53" polyA_site 1325 /gene="hD53" BASE COUNT 369 a 325 c 317 g 314 t ORIGIN 1 cagaagcggc tagtggcggc tgcctgcgtc cccaaccccc tccgcgcagc gctcgcgaca 61 cgcgtgccag gagtgggagc gagcggcggg gccagctgcg ttctgagcct gggcgcagct 121 gccatctgct ctgggaagca ccagggtgtc cccgccgccc tcagctcgaa gtcagccacc 181 atggaggcgc aggcacaagg tttgttggag actgaaccgt tgcaaggaac agacgaagat 241 gcagtagcca gtgctgactt ctctagcatg ctctctgagg aggaaaagga agagttaaaa 301 gcagagttag ttcagctaga agacgaaatt acaacactac gacaagtttt gtcagcgaaa 361 gaaaggcatc tagttgagat aaaacaaaaa ctcggcatga acctgatgaa tgaattaaaa 421 cagaacttca gcaaaagctg gcatgacatg cagactacca ctgcctacaa gaaaacacat 481 gaaaccctga gtcacgcagg gcaaaaggca actgcagctt tcagcaacgt tggaacggcc 541 atcagcaaga agttcggaga catgagttac tccattcgcc attccataag tatgcctgct 601 atgaggaatt ctcctacttt caaatcattt gaggagaggg ttgagacaac tgtcacaagc 661 ctcaagacga aagtaggcgg tacgaaccct aatggaggca gttttgagga ggtcctcagc 721 tccacggccc atgccagtgc ccagagcttg gcaggaggct cccggcggac caaggaggag 781 gagctgcagt gctaagtcca gccagcgtgc agctgcatcc agaaaccggc cactacccag 841 cccatctctg cctgtgctta tccagataag aagaccaaaa tcccgctggg aaaaacccag 901 gccttgacat tgttattcaa atggcccctc cagaaagttt aatgatttcc atttgtattt 961 gtgttgatga tggaccactt gaccatcaca tttcagtatt catagatgac tgtcacattt 1021 taaaatgttc ccacttgagc aggtacacaa ctggtcataa ttcctgtctg tgtaattcga 1081 tgtatatttt tccaaacatg tagctattgt ttgctttgat ttttgcttgg cctcctttat 1141 gatgtgcatg tccttgaagg ctgaatgaac agtccctttc agttcagcag atcaacagga 1201 tggagctctt catgactgtc tccagcaata ggatgattta ctataaattt catccaacta 1261 cttgtgatct ctctcaccta catcaattat gtatgttaat ttcagcaatt aaaagaattg 1321 atttt // LOCUS HSU44755 1536 bp mRNA PRI 02-FEB-1996 DEFINITION Human PSE-binding factor PTF delta subunit mRNA, complete cds. ACCESSION U44755 NID g1174204 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1536) AUTHORS Yoon,J.B. and Roeder,R.G. TITLE Cloning of two proximal sequence element-binding transcription factor subunits (gamma and delta) that are required for transcription of small nuclear RNA genes by RNA polymerases II and III and interact with the TATA-binding protein JOURNAL Mol. Cell. Biol. 16 (1), 1-9 (1996) MEDLINE 96104548 REFERENCE 2 (bases 1 to 1536) AUTHORS Yoon,J.-B. and Roeder,R.G. TITLE Direct Submission JOURNAL Submitted (05-JAN-1996) Jong-Bok Yoon, Lab. of Biochemistry and Molecular Biology, Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1536 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 43..1047 /codon_start=1 /function="transcription factor" /product="PSE-binding factor PTF delta subunit" /db_xref="PID:g1174205" /translation="MKPPPRRRAAPARYLGEVTGPATWSAREKRQLVRLLQARQGQPE PDATELARELRGRSEAEIRVFLQQLKGRVAREAIQKVHPGGLQGPRRREAQPPAPIEV WTDLAEKITGPLEEAVAVAFSQVLTIAATEPVTLLHSKPPKPTQARGKPLLLSAPGGQ EDPAPEIPSSAPAAPSSAPRTPDPAPEKPSESSAGPSTEEDFAVDFEKIYKYLSSVSR SGRSPELSAAESAVVLDLLMSLPEELPLLPCTALVEHMTETYLRLTAPQPIPAGGSLG PAAEGDGAGSKAPEETPPATEKAEHSELKSPWQAAGICPLNPFLVPLELLGRAATPAR " BASE COUNT 275 a 527 c 458 g 276 t ORIGIN 1 ccggagaagc gaccttacag cgcctgcctc tttctgagcg gcatgaagcc acctcccagg 61 cggcgagcgg ccccggcgcg ctatctgggc gaggtgaccg gtcccgcgac ctggagcgct 121 cgcgagaagc ggcagctagt gcgactcctg caggcgcggc agggccagcc ggagccggac 181 gccaccgagc tggcccggga gctgcggggc cggagcgagg ctgagatccg ggtcttcctc 241 cagcagctca agggccgcgt agcccgggag gccattcaga aagtgcatcc gggtggcctt 301 cagggaccaa ggcgccggga ggcacagccc ccagccccca tagaggtctg gacggatctg 361 gctgagaaga taacagggcc actggaagaa gccgtggcag tggctttctc gcaggtgctc 421 accatcgcgg ccacggaacc ggtcaccctc ctgcactcca agccccccaa gcccacgcag 481 gcccgtggaa agcctttgct cctgagcgcc cctggaggac aggaagaccc cgcccctgaa 541 atacctagct ctgcccctgc tgcacctagc tccgcaccca ggactcctga ccctgcccct 601 gagaaacctt ctgagtcgtc ggctggtccc tccactgaag aagactttgc tgtggacttt 661 gagaagatct acaagtactt gtcctctgtc tcccgaagtg gccgcagccc cgagctctca 721 gcagctgagt ccgctgtggt cctcgacctg ctcatgtcac ttccagagga gctgccactc 781 ctgccctgca cagccctggt tgagcatatg acggagacgt acctacgcct gacagccccc 841 cagcccattc ccgctggagg gagcctgggg cctgcagcag aaggggatgg ggctggctcc 901 aaggcaccag aggagacccc cccagccacc gagaaggccg agcacagcga actgaaatcg 961 ccttggcaag cagctgggat ctgtcccctg aacccgttcc tggtgcccct ggagcttctg 1021 ggtcgggcag ccacccctgc caggtgaggg gcatggcggg caggaggcca caccaggccc 1081 cccgccctgc ccctcggttc tgctctgctg gccctggctc tttctgagga tcccgtcatg 1141 ggggaaggtc cttgagatga tgctcagctg tggggcgggc ctctaagatg ccccatactt 1201 tgggggtctc agaaatggaa cccccgttgt acaggggttg ggtgggggtt gcaggactcc 1261 actcacaagc ctcctgatgt caaggacagg cggacagggc tggcctcccc cagtccccaa 1321 gccccactgt gccttgttgt ctgctggggg gccatagctg cactgcccac cgtaaaggcc 1381 ctcgcacatt ttcccccttc ctgtacacct cggggccagc atcctcacct tcttcaactg 1441 accagtcgtg gttactccct gctgccaggt ccttcccctt cccgggggta ttctgtgacc 1501 atgaataaag ttatcattct ctttctcttt caaaaa // LOCUS HSU44798 1067 bp mRNA PRI 02-FEB-1996 DEFINITION Human U1-snRNP binding protein homolog mRNA, complete cds. ACCESSION U44798 NID g1174216 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1067) AUTHORS Adams,D.S., Li,Q., Szabo,T., Tan,X., Pero,S. and Czop,J.K. TITLE Cloning and Characterization of a Family of cDNAs from Human Histiocyte Macrophage Cells Encoding a Glycine/Argine-Rich Basic Protein Related to the Highly Conserved 70 kD U1-snRNP Protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1067) AUTHORS Adams,D. S. TITLE Direct Submission JOURNAL Submitted (05-JAN-1996) David S. Adams, Biology/Biotechnology, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609, USA FEATURES Location/Qualifiers source 1..1067 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="46" /cell_line="U937" /cell_type="histiocyte (immature macrophage)" 5'UTR 1..213 RBS 187..194 CDS 214..954 /codon_start=1 /product="U1-snRNP binding protein homolog" /db_xref="PID:g1174217" /translation="MNDWMPIAKEYDPLKAGSIDGTDEDPHDRAVWRAMLARYVPNKG VIGDPLLTLFVARLNLQTKEDKLKEVFSRYGDIRRLRLVRDLVTGFSKGYAFIEYKEE RAVIKAYRDADGLVIDQHEIFVDYELERTLKGWIPRRLGGGLGGKKESGQLRFGGRDR PFRKPINLPVVKNDLYREGKRERRERSRSRERHWDSRTRDRDHDRGREKRWQEREPTR VWPDNDWERERDFRDDRIKGREKKERGK" RBS 218..225 3'UTR 955..1062 polyA_signal 1043..1048 polyA_site 1062 BASE COUNT 298 a 226 c 332 g 211 t ORIGIN 1 ctgacatcag gagtttgagg ccggcttgga acatggtgaa atcctgtctg tactagaaat 61 gcaaaaatta gctgggcgtg gtggtgtgtg tctgtgatcc cagctgctcg gcctcccaag 121 gtgctgggat tacaggcgtg agccaccgcg tctggcctca gccaaggttt ttaagtaaca 181 tatttcagca ttggctctac agcgttgcag aacatgaacg attggatgcc catcgccaag 241 gagtatgatc cactcaaagc gggcagcatt gatggcaccg atgaagaccc acacgaccgc 301 gcggtctgga gggcaatgct ggcacgatat gtccccaaca aaggtgtcat aggagatccc 361 ctcctcaccc tgtttgtggc cagactaaac ttgcagacca aggaggacaa attaaaggaa 421 gtcttttccc gctatggtga catccggcgg cttcggctgg tcagggactt ggtcacaggt 481 ttttcaaagg gctacgcctt catcgaatac aaggaggagc gtgccgtgat caaagcttac 541 cgagatgctg atggcctggt tattgaccag catgagatat ttgtggacta cgagctggaa 601 aggactctca aagggtggat ccctcggcga cttggaggcg gtcttggggg aaaaaaggag 661 tctgggcaac tgagatttgg gggacgggac cggccttttc gaaaacctat taacttgcca 721 gttgttaaaa acgacctcta tagagaggga aaacgggaaa ggcgggagcg atctcgatcc 781 cgagaaagac actgggactc gaggacaagg gatcgagacc atgacagggg ccgggagaag 841 agatggcaag aaagagagcc gaccagggtg tggcccgaca atgactggga gagagagagg 901 gacttcagag atgacaggat caaggggagg gagaagaagg aaagaggcaa gtagaggccc 961 aacagcagaa ccccaaagtg aagttacagt ggaaatgagt ggagggggat tgtctttcaa 1021 cgcagcgtga gtctaatggt tgaataaaac ttactgatga tcaaaaa // LOCUS HSU44836 363 bp mRNA PRI 12-JUL-1996 DEFINITION Human inducible cAMP early repressor (CREM) mRNA, complete cds. ACCESSION U44836 NID g1177861 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 363) AUTHORS Bodor,J., Spetz,A.L., Strominger,J.L. and Habener,J.F. TITLE cAMP inducibility of transcriptional repressor ICER in developing and mature human T lymphocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (8), 3536-3541 (1996) MEDLINE 96195006 REFERENCE 2 (bases 1 to 363) AUTHORS Bodor,J., Spetz,A.-L., Strominger,J.L. and Habener,J.F. TITLE Direct Submission JOURNAL Submitted (05-JAN-1996) Josef Bodor, Molecular Endocrin, HHMI/MGH, Wellman 306, 50 Blossom St. MGH, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..363 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat leukemic T cell line" gene 1..363 /gene="CREM" CDS 1..363 /gene="CREM" /note="Allele: ICER; transcriptional repressor" /codon_start=1 /product="inducible cAMP early repressor" /db_xref="PID:g1177862" /translation="MAVTGDETDEETELAPSHMAAATGDMPTYRIRAPTAALPQGVVM AASPGSLHSPQQLAEEATRKRELRLMKNREAAKECRRRKKEYVKCLESRVAVLEVQNK KLIEELETLKDICSPKTD" BASE COUNT 115 a 81 c 98 g 69 t ORIGIN 1 atggctgtaa ctggagatga aactgatgag gaaactgaac ttgccccaag tcacatggct 61 gctgccactg gtgacatgcc aacttaccgg atccgagctc ctactgctgc tttgccacag 121 ggagtggtga tggctgcatc gcccggaagt ttgcacagtc cccagcagct ggcagaagaa 181 gcaacacgca aacgagagct gaggctaatg aaaaacaggg aagctgccaa agaatgtcga 241 cgtcgaaaga aagaatatgt aaaatgtctg gagagccgag ttgcagtgct ggaagtccag 301 aacaagaagc ttatagagga acttgaaacc ttgaaagaca tttgctctcc caaaacagat 361 tag // LOCUS HSU44839 3167 bp mRNA PRI 25-APR-1996 DEFINITION Human putative ubiquitin C-terminal hydrolase (UHX1) mRNA, complete cds. ACCESSION U44839 NID g1276911 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3167) AUTHORS Swanson,D.A., Freund,C.L., Ploder,L., McInnes,R.R. and Valle,D. TITLE A ubiquitin C-terminal hydrolase gene on the proximal short arm of the X chromosome: implications for X-linked retinal disorders JOURNAL Hum. Mol. Genet. 5 (4), 533-538 (1996) MEDLINE 96254985 REFERENCE 2 (bases 1 to 3167) AUTHORS Swanson,D.A. and Valle,D. TITLE Direct Submission JOURNAL Submitted (06-JAN-1996) David Valle, Genetics, Johns Hopkins University S.O.M., 725 N. Wolfe St. PCTB 802, Baltimore, MD 21218, USA FEATURES Location/Qualifiers source 1..3167 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp11.2-21.2" /tissue_type="retina" gene 680..2752 /gene="UHX1" CDS 680..2752 /gene="UHX1" /note="putative ubiquitin C-terminal hydrolase" /codon_start=1 /product="UHX1 protein" /db_xref="PID:g1276912" /translation="METRKKDGTWPSAQLHVMNNNMSEEDEDFKGQPGICGLTNLGNT CFMNSALQCLSNVPQLTEYFLNNCYLEELNFRNPLGMKGEIAEAYADLVKQAWSGHHR SIVPHVFKNKVGHFASQFLGYQQHDSQELLSFLLDGLHEDLNRVKKKEYVELCDAAGR PDQEVAQEAWQNHKRRNDSVIVDTFHGLFKSTLVCPDCGNVSVTFDPFCYLSVPLPIS HKRVLEVFFIPMDPRRKPEQHRLVVPKKGKISDLCVALSKHTGISPERMMVADVFSHR FYKLYQLEEPLSSILDRDDIFVYEVSGRIEAIEGSREDIVVPVYLRERTPARDYNNSY YGLMLFGHPLLVSVPRDRFTWEGLYNVLMYRLSRYVTKPNSDDEDDGDEKEDDEEDKD DVPGPSTGGSLRDPEPEQAGPSSGVTNRCPFLLDNCLGTSQWPPRRRRKQLFTLQTVN SNGTSDRTTSPEEVHAQPYIAIDWEPEMKKRYYDEVEAEGYVKHDCVGYVMKKAPVRL QECIELFTTVETLEKENPWYCPSCKQHQLATKKLDLWMLPEILIIHLKRFSYTKFSRE KLDTLVEFPIRDLDFSEFVIQPQNESNPELYKYDLIAVSNHYGGMRDGHYTTFACNKD SGQWHYFDDNSVSPVNENQIESKAAYVLFYQRQDVARRLLSPAGSSGAPASPACSSPP SSEFMDVN" BASE COUNT 711 a 906 c 873 g 677 t ORIGIN 1 gaattccgct tctacagctg ctgctgcggc ggctatggcg gcggcacggg cggtggactg 61 aggatagaga gccacagcac gaggagctgc caggcctgga cagccagtgc ggccagatag 121 aaaacggcga gagtgggcga gaacgtccac tgcgggccgg cgaaagctgg ttccttgtgg 181 agaagcactg gtataagcag tgggaggcat actgcaggga ggggaccagg actccagcac 241 cttccctggc tgcatcaaca atccacactc tttcaagatg agataaactg gccctcaagg 301 aggactggtg gaaggcgagg attatgtgct gctcccagca cgtgcttggc attacctggt 361 cagctggtat ggtctagagc atggccagcc acccattgaa cgcaaggtca tagagctgcc 421 caacatccag aaggtcgaag tgtacccagt agaactgctg cttgtccggc acaatgattt 481 gggcaaatct cacactgttc agttcagcca taccgattct attggcctag tattgcgcac 541 agctcgggag cggtttctgg tggagcccca ggaagacact cggctttggg ccaagaactc 601 agaaggctct ttggataggt tgtatgacac acacatcacg gttctcgatg cggcccttga 661 gactgggcag ttgatcatca tggagacccg caagaaagat ggcacttggc ccagcgcaca 721 gctgcatgtc atgaacaaca acatgtcgga agaggatgag gacttcaagg gtcagccagg 781 catctgtggc ctcaccaatc tgggcaacac gtgcttcatg aactcggccc tgcagtgcct 841 cagcaatgtg ccacagctca ccgagtactt cctcaacaac tgctacctgg aggagctcaa 901 cttccgcaac ccactgggca tgaagggtga gatcgcagag gcctatgcag acctggtgaa 961 gcaggcgtgg tctggccacc accgctccat tgtgccacat gtgttcaaga acaaggttgg 1021 ccattttgca tcccaatttc tgggctacca gcagcatgac tctcaggagc tgctgtcatt 1081 cctcctggac gggctgcatg aggaccttaa tcgggtgaag aagaaggagt atgtggagct 1141 gtgcgatgct gctgggcgac cggatcagga ggtggcacag gaggcatggc aaaaccacaa 1201 acggcggaac gattctgtga tcgtggacac tttccacggc ctcttcaagt ccacgctggt 1261 gtgccccgat tgtggcaatg tatctgtgac cttcgacccc ttctgctacc tcagtgttcc 1321 actgcctatc agccacaaga gggtcttgga ggtcttcttt atccccatgg atccgcgccg 1381 caagccagag cagcaccggc tcgtggtccc caagaaaggc aagatctcgg atctatgtgt 1441 ggctctgtcc aaacacacgg gcatctcgcc agagaggatg atggtggctg atgtcttcag 1501 tcaccgcttc tataagctct atcagctaga ggagcctctg agcagcatct tggaccgtga 1561 tgatatcttc gtctatgagg tgtcaggtcg cattgaggcc attgagggct caagagagga 1621 catcgtggtt cctgtctacc tgcgggagcg cacccctgcc cgtgactaca acaactccta 1681 ctacggcctg atgctttttg gacaccccct cctggtatca gtgccccggg accgcttcac 1741 ctgggagggc ctgtataacg tcctgatgta ccggctctca cgctacgtga ccaaacccaa 1801 ctcagatgat gaggacgatg gggatgagaa agaagatgac gaggaggata aagatgacgt 1861 ccctgggccc tcaactgggg gcagcctccg agaccctgag ccagagcagg ctgggcccag 1921 ctctggagtc acgaacaggt gcccgttcct cctggacaat tgccttggca catctcagtg 1981 gcccccaagg cgacgacgca agcagctgtt caccctgcag acggtgaact ccaatgggac 2041 cagcgaccgc acaacctccc ctgaagaagt ccatgcccag ccgtacattg ctatcgactg 2101 ggagccagag atgaagaagc gttactatga cgaggtagag gctgagggct acgtgaagca 2161 tgactgcgtc gggtacgtga tgaagaaggc tcccgtgcgg ctgcaggagt gcattgagct 2221 cttcaccact gtggagaccc tggagaagga aaacccctgg tactgccctt cctgcaagca 2281 gcaccagctg gcaaccaaga agctggacct gtggatgctg ccggagattc tcatcatcca 2341 cctgaaacgc ttttcctaca ccaagttctc ccgagagaag ctggacaccc tcgtggagtt 2401 tcctatccgg gacctggact tctctgagtt tgtcatccag ccacagaatg agtcgaatcc 2461 ggagctgtac aaatatgacc tcatcgcggt ttccaaccat tatgggggca tgcgtgatgg 2521 acactacaca acatttgcct gcaacaagga cagcggccag tggcactact ttgatgacaa 2581 cagcgtctcc cctgtcaatg agaatcagat cgagtccaag gcagcctatg tcctcttcta 2641 ccaacgccag gacgtggcgc gacgcctgct gtccccggcc ggctcatctg gcgccccagc 2701 ctcccctgcc tgcagctccc cacccagctc tgagttcatg gatgttaatt gagagccctg 2761 ggtcctgcca cagaaaaaaa aaaaaaaaag ccctctctgc aatctcgctt ctcgtgtccg 2821 ccccgcttct cttattcgtg ttaggtgccc ccgccaggca ttgcaggctt agtcgtggct 2881 actgttctcc tgtgccgctg catcgctctc tcccgggaaa gaacaggtcg tgtctcctcc 2941 tagcagtgcg cgccccgcct gtgtttgccc ttccagcagt gaccctccct tctagtcttt 3001 atttatggtc gtgcccttcc ctctcctcag cccagagtgt tctgcgtggg tggtgatggg 3061 ggttcacctg aacacagagt gtattttctt attgaggccc tgtaccttct gctgtgtgtg 3121 tgtatatata aagcaccagt ctgctcccca aaaaaaaaac ggaattc // LOCUS HSU45285 2655 bp mRNA PRI 25-APR-1996 DEFINITION Human specific 116-kDa vacuolar proton pump subunit (OC-116KDa) mRNA, complete cds. ACCESSION U45285 NID g1245045 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2655) AUTHORS Li,Y.P., Chen,W. and Stashenko,P. TITLE Molecular cloning and characterization of a putative novel human osteoclast-specific 116-kDa vacuolar proton pump subunit JOURNAL Biochem. Biophys. Res. Commun. 218 (3), 813-821 (1996) MEDLINE 96158968 REFERENCE 2 (bases 1 to 2655) AUTHORS Li,Y.-P., Chen,W. and Stashenko,P. TITLE Direct Submission JOURNAL Submitted (09-JAN-1996) Yi-Ping Li, Cytokine Biology, Forsyth Dental Center, 140 Fenway, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2655 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="osteoclastoma tumor" gene 58..2547 /gene="OC-116KDa" CDS 58..2547 /gene="OC-116KDa" /note="novel membrane bound protein with at least six transmembrane domains" /codon_start=1 /product="specific 116-kDa vacuolar proton pump subunit" /db_xref="PID:g1245046" /translation="MGSMFRSEEVALVQLFLPTAAAYTCVSRLGELGLVEFRDLNASV SAFQRRFVVDVWRCEELEKTFTFLQEEVRRAGLVLPPPKGRLPAPPPRDLLRIQEETE RLAQELRDVRGNQQALRAQLHQLQLHAAVLRQGHEPQLAAAHTDGASERTPLLQAPGG PHQDLRVNFVAGAVEPHKAPALERLLWRACRGFLIASFRELEQPLEHPVTGEPATWMT FLISYWGEQIGQKIRKITDCFHCHVFPFLQQEEARLGALQQLQQQSQELQEVLGETER FLSQVLGRVLQLLPPGQVQVHKMKAVYLALNQCSVSTTHKCLIAEAWCSVRDLPALQE ALRDSSMEEGVSAVAHRIPCRDMPPTLIRTNRFTASFQGIVDRYGVGRYQEVNPAPYT IITFPFLFAVMFGDVGHGLLMFLFALAMVLAENRPAVKAAQNEIWQTFFRGRYLLLLM GLFSIYTGFIYNECFSRATSIFPSGWSVAAMANQSGWSDAFLAQHTMLTLDPNVTGVF LGPYPFGIDPIWSLAANHLSFLNSFKMKMSVILGVVHMAFGVVLGVFNHVHFGQRHRL LLETLPELTFLLGLFGYLVFLVIYKWLCVWAARAASPSILIHFINMFLFSHSPSNRLL YPRQEVVQATLVVLALAMVPILLLGTPLHLLHRHRRRLRRRPADRQEENKAGLLDLPD ASVNGWSSDEEKAGGLDDEEEAELVPSEVLMHQAIHTIEFCLGCVSNTASYLRLWALS LAHAQLSEVLWAMVMRIGLGLGREVGVAAVVLVPIFAAFAVMTVAILLVMEGLSAFLH ALRLHWVEFQNKFYSGTGYKLSPFTFAATDD" BASE COUNT 460 a 885 c 824 g 486 t ORIGIN 1 cggcgtgcgc ggacgggcag ccagcagcgg aggcgcggcg cagcacaccc ggggaccatg 61 ggctccatgt tccggagcga ggaggtggcc ctggtccagc tctttctgcc cacagcggct 121 gcctacacct gcgtgagtcg gctgggcgag ctgggcctcg tggagttcag agacctcaac 181 gcctcggtga gcgccttcca gagacgcttt gtggttgatg tttggcgctg tgaggagctg 241 gagaagacct tcaccttcct gcaggaggag gtgcggcggg ctgggctggt cctgcccccg 301 ccaaagggga ggctgccggc acccccaccc cgggacctgc tgcgcatcca ggaggagacg 361 gagcgcctgg cccaggagct gcgggatgtg cggggcaacc agcaggccct gcgggcccag 421 ctgcaccagc tgcagctcca cgccgccgtg ctacgccagg gccatgaacc tcagctggca 481 gccgcccaca cagatggggc ctcagagagg acgcccctgc tccaggcccc cggggggccg 541 caccaggacc tgagggtcaa ctttgtggca ggtgccgtgg agccccacaa ggcccctgcc 601 ctagagcgcc tgctctggag ggcctgccgc ggcttcctca ttgccagctt cagggagctg 661 gagcagccgc tggagcaccc cgtgacgggc gagccagcca cgtggatgac cttcctcatc 721 tcctactggg gtgagcagat cggacagaag atccgcaaga tcacggactg cttccactgc 781 cacgtcttcc cgtttctgca gcaggaggag gcccgcctcg gggccctgca gcagctgcaa 841 cagcagagcc aggagctgca ggaggtcctc ggggagacag agcggttcct gagccaggtg 901 ctaggccggg tgctgcagct gctgccgcca gggcaggtgc aggtccacaa gatgaaggcc 961 gtgtacctgg ccctgaacca gtgcagcgtg agcaccacgc acaagtgcct cattgccgag 1021 gcctggtgct ctgtgcgaga cctgcccgcc ctgcaggagg ccctgcggga cagctcgatg 1081 gaggagggag tgagtgccgt ggctcaccgc atcccctgcc gggacatgcc ccccacactc 1141 atccgcacca accgcttcac ggccagcttc cagggcatcg tggatcgcta cggcgtgggc 1201 cgctaccagg aggtcaaccc cgctccctac accatcatca ccttcccctt cctgtttgct 1261 gtgatgttcg gggatgtggg ccacgggctg ctcatgttcc tcttcgccct ggccatggtc 1321 cttgcggaga accgaccggc tgtgaaagcc gcgcagaacg agatctggca gactttcttc 1381 aggggccgct acctgctcct gcttatgggc ctgttctcca tctacaccgg cttcatctac 1441 aacgagtgct tcagtcgcgc caccagcatc ttcccctcgg gctggagtgt ggccgccatg 1501 gccaaccagt ctggctggag tgatgcattc ctggcccagc acacgatgct taccctggat 1561 cccaacgtca ccggtgtctt cctgggaccc tacccctttg gcatcgatcc tatttggagc 1621 ctggctgcca accacttgag cttcctcaac tccttcaaga tgaagatgtc cgtcatcctg 1681 ggcgtcgtgc acatggcctt tggggtggtc ctcggagtct tcaaccacgt gcactttggc 1741 cagaggcacc ggctgctgct ggagacgctg ccggagctca ccttcctgct gggactcttc 1801 ggttacctcg tgttcctagt catctacaag tggctgtgtg tctgggctgc cagggccgcc 1861 tcgcccagca tcctcatcca cttcatcaac atgttcctct tctcccacag ccccagcaac 1921 aggctgctct acccccggca ggaggtggtc caggccacgc tggtggtcct ggccttggcc 1981 atggtgccca tcctgctgct tggcacaccc ctgcacctgc tgcaccgcca ccgccgccgc 2041 ctgcggagga ggcccgctga ccgacaggag gaaaacaagg ccgggttgct ggacctgcct 2101 gacgcatctg tgaatggctg gagctccgat gaggaaaagg cagggggcct ggatgatgaa 2161 gaggaggccg agctcgtccc ctccgaggtg ctcatgcacc aggccatcca caccatcgag 2221 ttctgcctgg gctgcgtctc caacaccgcc tcctacctgc gcctgtgggc cctgagcctg 2281 gcccacgccc agctgtccga ggttctgtgg gccatggtga tgcgcatagg cctgggcctg 2341 ggccgggagg tgggcgtggc ggctgtggtg ctggtcccca tctttgccgc ctttgccgtg 2401 atgaccgtgg ctatcctgct ggtgatggag ggactctcag ccttcctgca cgccctgcgg 2461 ctgcactggg tggaattcca gaacaagttc tactcaggca cgggctacaa gctgagtccc 2521 ttcaccttcg ctgccacaga tgactagggc ccactgcagg tcctgccaga cctccttcct 2581 gacctctgag gcaggagagg aataaagacg gtccgccctg gcaaaaaaaa aaaaaaaaaa 2641 aaaaaaaaaa aaaaa // LOCUS HSU45328 1203 bp mRNA PRI 02-FEB-1996 DEFINITION Human ubiquitin-conjugating enzyme (UBE2I) mRNA, complete cds. ACCESSION U45328 NID g1172223 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1203) AUTHORS Tachibana,M., Iwata,N., Watanabe,A., Nobukuni,Y., Ploplis,B. and Kajigaya,S. TITLE Assignment of the gene for a ubiquitin-conjugating enzyme (UBE2I) to human chromosome band 16p13.3 by in situ hybridization JOURNAL Unpublished REFERENCE 2 (bases 1 to 1203) AUTHORS Iwata,N., Nobukuni,Y., Takeda,K., Ploplis,B., Kajigaya,S. and Tachibana,M. TITLE Interaction of a transcription factor, MITF, and a ubiquitin conjugating enzyme as detected by yeast two hybrid system JOURNAL Unpublished REFERENCE 3 (bases 1 to 1203) AUTHORS Tachibana,M. TITLE Direct Submission JOURNAL Submitted (09-JAN-1996) Masayoshi Tachibana, Laboratory of Molecular Genetics, National Institute on Deafness and Other Communication Disorders, 5 Reseach Court, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1203 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" /cell_line="HeLa" 5'UTR 1..123 /gene="UBE2I" gene 1..1203 /gene="UBE2I" CDS 124..600 /gene="UBE2I" /note="similar to S. pombe ubiquitin-conjugating enzyme: SwissProt Accession Number P40984 and S. cerevisiae ubiquitin-conjugating enzyme encoded by EMBL Accession Number X82538" /codon_start=1 /product="ubiquitin-conjugating enzyme" /db_xref="PID:g1172224" /translation="MSGIALSRLAQERKAWRKDHPFGFVAVPTKNPDGTMNLMNWECA IPGKKGTPWEGGLFKLRMLFKDDYPSSPPKCKFEPPLFHPNVYPSGTVCLSILEEDKD WRPAITIKQILLGIQELLNEPNIQDPAQAEAYTIYCQNRVEYEKRVRAQAKKFAPS" 3'UTR 601..1203 /gene="UBE2I" BASE COUNT 296 a 310 c 305 g 292 t ORIGIN 1 gcgcggagcg ggctccggag ggaagtcccg agacaaaggg aagcgccgcc gccgccgccc 61 cgctcggtcc tccacctgtc cgctacgctc gccggggctg cggccgcccg agggactttg 121 aacatgtcgg ggatcgccct cagcagactc gcccaggaga ggaaagcatg gaggaaagac 181 cacccatttg gtttcgtggc tgtcccaaca aaaaatcccg atggcacgat gaacctcatg 241 aactgggagt gcgccattcc aggaaagaaa gggactccgt gggaaggagg cttgtttaaa 301 ctacggatgc ttttcaaaga tgattatcca tcttcgccac caaaatgtaa attcgaacca 361 ccattatttc acccgaatgt gtacccttcg gggacagtgt gcctgtccat cttagaggag 421 gacaaggact ggaggccagc catcacaatc aaacagatcc tattaggaat acaggaactt 481 ctaaatgaac caaatatcca agacccagct caagcagagg cctacacgat ttactgccaa 541 aacagagtgg agtacgagaa aagggtccga gcacaagcca agaagtttgc gccctcataa 601 gcagcgacct tgtggcatcg tcagaaggaa gggattggtt tggcaagaac ttgtttacaa 661 catttttgca aatctaaagt tgctccatac aatgactagt cacctggggg ggttgggcgg 721 gcgccatctt ccattgccgc cgcgggtgtg cggtctcgat tcgctgaatt gcccgtttcc 781 atacagggtc tcttccttcg gtcttttgta tttttgattg ttatgtaaaa ctcgctttta 841 ttttaatatt gatgtcagta tttcaactgc tgtaaaatta taaactttta tacttgggta 901 agtcccccag gggcgagttc ctcgctctgg gatgcaggca tgcttctcac cgtgcagagc 961 tgcacttggc ctcagctggc tgtatggaaa tgcaccctcc ctcctgccgc tcctctctag 1021 aaccttctag aacctgggct gtgctgcttt tgagcctcag accccagggc agcatctcgg 1081 ttctgcgcca cttcctttgt gtttatatgg cgttttgtct gtgttgctgt ttagagtaaa 1141 taaactgttt atataaaaaa aaaaaaaaaa aactcgaggg ggggcccggt acccaattcg 1201 ccc // LOCUS HSU45878 2916 bp mRNA PRI 16-FEB-1996 DEFINITION Human inhibitor of apoptosis protein 1 mRNA, complete cds. ACCESSION U45878 NID g1184315 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2916) AUTHORS Liston,P., Roy,N., Tamai,K., Lefebvre,C., Baird,S., Cherton-Horvat,G., Farahani,R., McLean,M., Ikeda,J., MacKenzie,A. and Korneluk,R.G. TITLE Suppression of apoptosis in mammalian cells by NAIP and a related family of IAP genes JOURNAL Nature 379 (6563), 349-353 (1996) MEDLINE 96149249 REFERENCE 2 (bases 1 to 2916) AUTHORS Baird,S.D. TITLE Direct Submission JOURNAL Submitted (16-JAN-1996) Stephen D. Baird, Children's Hospital of Eastern Ontario, Genetics, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada FEATURES Location/Qualifiers source 1..2916 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda UniZap human liver (Stratagene)" /tissue_type="liver" CDS 449..2263 /note="HIAP-1" /codon_start=1 /function="inhibition of apoptosis" /evidence=experimental /product="inhibitor of apoptosis protein 1" /db_xref="PID:g1184316" /translation="MNIVENSIFLSNLMKSANTFELKYDLSCELYRMSTYSTFPAGVP VSERSLARAGFYYTGVNDKVKCFCCGLMLDNWKRGDSPTEKHKKLYPSCRFVQSLNSV NNLEATSQPTFPSSVTHSTHSLLPGTENSGYFRGSYSNSPSNPVNSRANQEFSALMRS SYPCPMNNENARLLTFQTWPLTFLSPTDLARAGFYYIGPGDRVACFACGGKLSNWEPK DNAMSEHLRHFPKCPFIENQLQDTSRYTVSNLSMQTHAARFKTFFNWPSSVLVNPEQL ASAGFYYVGNSDDVKCFCCDGGLRCWESGDDPWVQHAKWFPRCEYLIRIKGQEFIRQV QASYPHLLEQLLSTSDSPGDENAESSIIHLEPGEDHSEDAIMMNTPVINAAVEMGFSR SLVKQTVQRKILATGENYRLVNDLVLDLLNAEDEIREEERERATEEKESNDLLLIRKN RMALFQHLTCVIPILDSLLTAGIINEQEHDVIKQKTQTSLQARELIDTILVKGNIAAT VFRNSLQEAEAVLYEHLFVQQDIKYIPTEDVSDLPVEEQLRRLPEERTCKVCMDKEVS IVFIPCGHLVVCKDCAPSLRKCPICRSTIKGTVRTFLS" misc_feature 533..733 /note="encodes BIR1 (Baculovirus IAP Repeat)" misc_feature 953..1153 /note="encodes BIR2" misc_feature 1211..1411 /note="encodes BIR3" misc_feature 2117..2221 /note="encodes Ring Zinc Finger" BASE COUNT 956 a 532 c 555 g 862 t 11 others ORIGIN 1 agcagagctt tcccnccatg nnagaagctt catgagtcac acattacatc tttgggttga 61 ttgaatgcca ctgaaacatt tctagtagcc tggagnagtt gacctacctg tggagatgcc 121 tgccattaaa tggcatcctg atggcttaat acacatcact cttctgtgna gggttttaat 181 tttcaacaca gcttactctg tagcatcatg tttacattgt atgtataaag attatacnaa 241 ggtgcaattg tgtatttctt ccttaaaatg tatcagtata ggatttagaa tctccatgtt 301 gaaactctaa atgcatagaa ataaaaataa taaaaaattt ttcattttgc cttttcagcc 361 tagtattaaa actgataaaa gcaaagccat gcacaaaact acctccctag agaaaggcta 421 gtcccttttc ttccccattc atttcattat gaacatagta gaaaacagca tattcttatc 481 aaatttgatg aaaagcgcca acacgtttga actgaaatac gacttgtcat gtgaactgta 541 ccgaatgtct acgtattcca cttttcctgc tggggttcct gtctcagaaa ggagtcttgc 601 tcgtgctggt ttctattaca ctggtgtgaa tgacaaggtc aaatgcttct gttgtggcct 661 gatgctggat aactggaaaa gaggagacag tcctactgaa aagcataaaa agttgtatcc 721 tagctgcaga ttcgttcaga gtctaaattc cgttaacaac ttggaagcta cctctcagcc 781 tacttttcct tcttcagtaa cacattccac acactcatta cttccgggta cagaaaacag 841 tggatatttc cgtggctctt attcaaactc tccatcaaat cctgtaaact ccagagcaaa 901 tcaagaattt tctgccttga tgagaagttc ctacccctgt ccaatgaata acgaaaatgc 961 cagattactt acttttcaga catggccatt gacttttctg tcgccaacag atctggcacg 1021 agcaggcttt tactacatag gacctggaga cagagtggct tgctttgcct gtggtggaaa 1081 attgagcaat tgggaaccga aggataatgc tatgtcagaa cacctgagac attttcccaa 1141 atgcccattt atagaaaatc agcttcaaga cacttcaaga tacacagttt ctaatctgag 1201 catgcagaca catgcagccc gctttaaaac attctttaac tggccctcta gtgttctagt 1261 taatcctgag cagcttgcaa gtgcgggttt ttattatgtg ggtaacagtg atgatgtcaa 1321 atgcttttgc tgtgatggtg gactcaggtg ttgggaatct ggagatgatc catgggttca 1381 acatgccaag tggtttccaa ggtgtgagta cttgataaga attaaaggac aggagttcat 1441 ccgtcaagtt caagccagtt accctcatct acttgaacag ctgctatcca catcagacag 1501 cccaggagat gaaaatgcag agtcatcaat tatccatttg gaacctggag aagaccattc 1561 agaagatgca atcatgatga atactcctgt gattaatgct gccgtggaaa tgggctttag 1621 tagaagcctg gtaaaacaga cagttcagag aaaaatccta gcaactggag agaattatag 1681 actagtcaat gatcttgtgt tagacttact caatgcagaa gatgaaataa gggaagagga 1741 gagagaaaga gcaactgagg aaaaagaatc aaatgattta ttattaatcc ggaagaatag 1801 aatggcactt tttcaacatt tgacttgtgt aattccaatc ctggatagtc tactaactgc 1861 cggaattatt aatgaacaag aacatgatgt tattaaacag aagacacaga cgtctttaca 1921 agcaagagaa ctgattgata cgattttagt aaaaggaaat attgcagcca ctgtattcag 1981 aaactctctg caagaagctg aagctgtgtt atatgagcat ttatttgtgc aacaggacat 2041 aaaatatatt cccacagaag atgtttcaga tctaccagtg gaagaacaat tgcggagact 2101 accagaagaa agaacatgta aagtgtgtat ggacaaagaa gtgtccatag tgtttattcc 2161 ttgtggtcat ctagtagtat gcaaagattg tgctccttct ttaagaaagt gtcctatttg 2221 taggagtaca atcaagggta cagttcgtac atttctttca tgaagaagaa ccaaaacatc 2281 gtctaaactt tagaattaat ttattaaatg tattataact ttaactttta tcctaatttg 2341 gtttccttaa aatttttatt tatttacaac tcaaaaaaca ttgttttgtg taacatattt 2401 atatatgtat ctaaaccata tgaacatata ttttttagaa actaagagaa tgataggctt 2461 ttgttcttat gaacgaaaaa gaggtagcac tacaaacaca atattcaatc caaatttcag 2521 cattattgaa attgtaagtg aagtaaaact taagatattt gagttaacct ttaagaattt 2581 taaatatttt ggcattgtac taataccggg aacatgaagc caggtgtggt ggtatgtacc 2641 tgtagtccca ggctgaggca agagaattac ttgagcccag gagtttgaat ccatcctggg 2701 cagcatactg agaccctgcc tttaaaaacn aacagnacca aanccaaaca ccagggacac 2761 atttctctgt cttttttgat cagtgtccta tacatcgaag gtgtgcatat atgttgaatc 2821 acattttagg gacatggtgt ttttataaag aattctgtga gnaaaaattt aataaagcaa 2881 ccnaaattac tcttaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU45880 2540 bp mRNA PRI 16-FEB-1996 DEFINITION Human X-linked inhibitor of apotosis protein XIAP mRNA, complete cds. ACCESSION U45880 NID g1184319 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2540) AUTHORS Liston,P., Roy,N., Tamai,K., Lefebvre,C., Baird,S., Cherton-Horvat,G., Farahani,R., McLean,M., Ikeda,J., MacKenzie,A. and Korneluk,R.G. TITLE Suppression of apoptosis in mammalian cells by NAIP and a related family of IAP genes JOURNAL Nature 379 (6563), 349-353 (1996) MEDLINE 96149249 REFERENCE 2 (bases 1 to 2540) AUTHORS Baird,S.D. TITLE Direct Submission JOURNAL Submitted (16-JAN-1996) Stephen D. Baird, Children's Hospital of Eastern Ontario, Genetics, 401 Smyth Rd., Ottawa, Ontario, K1H 8L1, Canada FEATURES Location/Qualifiers source 1..2540 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene lambdaZap-II human fetal brain" /map="Xq24-25" /tissue_type="brain" /dev_stage="fetal" CDS 34..1527 /note="XIAP" /codon_start=1 /function="inhibition of apoptosis" /evidence=experimental /product="X-linked inhibitor of apotosis protein" /db_xref="PID:g1184320" /translation="MTFNSFEGSKTCVPADINKEEEFVEEFNRLKTFANFPSGSPVSA STLARAGFLYTGEGDTVRCFSCHAAVDRWQYGDSAVGRHRKVSPNCRFINGFYLENSA TQSTNSGIQNGQYKVENYLGSRDHFALDRPSETHADYLLRTGQVVDISDTIYPRNPAM YCEEARLKSFQNWPDYAHLTPRELASAGLYYTGIGDQVQCFCCGGKLKNWEPCDRAWS EHRRHFPNCFFVLGRNLNIRSESDAVSSDRNFPNSTNLPRNPSMADYEARIFTFGTWI YSVNKEQLARAGFYALGEGDKVKCFHCGGGLTDWKPSEDPWEQHAKWYPGCKYLLEQK GQEYINNIHLTHSLEECLVRTTEKTPSLTRRIDDTIFQNPMVQEAIRMGFSFKDIKKI MEEKIQISGSNYKSLEVLVADLVNAQKDSMQDESSQTSLQKEISTEEQLRRLQEEKLC KICMDRNIAIVFVPCGHLVTCKQCAEAVDKCPMCYTVITFKQKIFMS" misc_feature 108..309 /note="encodes BIR1 (Baculovirus IAP Repeat)" misc_feature 520..723 /note="encodes BIR2" misc_feature 826..1020 /note="encodes BIR3" misc_feature 1381..1485 /note="encodes Ring Zinc Finger" BASE COUNT 781 a 415 c 571 g 773 t ORIGIN 1 gaaaaggtgg acaagtccta ttttcaagag aagatgactt ttaacagttt tgaaggatct 61 aaaacttgtg tacctgcaga catcaataag gaagaagaat ttgtagaaga gtttaataga 121 ttaaaaactt ttgctaattt tccaagtggt agtcctgttt cagcatcaac actggcacga 181 gcagggtttc tttatactgg tgaaggagat accgtgcggt gctttagttg tcatgcagct 241 gtagatagat ggcaatatgg agactcagca gttggaagac acaggaaagt atccccaaat 301 tgcagattta tcaacggctt ttatcttgaa aatagtgcca cgcagtctac aaattctggt 361 atccagaatg gtcagtacaa agttgaaaac tatctgggaa gcagagatca ttttgcctta 421 gacaggccat ctgagacaca tgcagactat cttttgagaa ctgggcaggt tgtagatata 481 tcagacacca tatacccgag gaaccctgcc atgtattgtg aagaagctag attaaagtcc 541 tttcagaact ggccagacta tgctcaccta accccaagag agttagcaag tgctggactc 601 tactacacag gtattggtga ccaagtgcag tgcttttgtt gtggtggaaa actgaaaaat 661 tgggaacctt gtgatcgtgc ctggtcagaa cacaggcgac actttcctaa ttgcttcttt 721 gttttgggcc ggaatcttaa tattcgaagt gaatctgatg ctgtgagttc tgataggaat 781 ttcccaaatt caacaaatct tccaagaaat ccatccatgg cagattatga agcacggatc 841 tttacttttg ggacatggat atactcagtt aacaaggagc agcttgcaag agctggattt 901 tatgctttag gtgaaggtga taaagtaaag tgctttcact gtggaggagg gctaactgat 961 tggaagccca gtgaagaccc ttgggaacaa catgctaaat ggtatccagg gtgcaaatat 1021 ctgttagaac agaagggaca agaatatata aacaatattc atttaactca ttcacttgag 1081 gagtgtctgg taagaactac tgagaaaaca ccatcactaa ctagaagaat tgatgatacc 1141 atcttccaaa atcctatggt acaagaagct atacgaatgg ggttcagttt caaggacatt 1201 aagaaaataa tggaggaaaa aattcagata tctgggagca actataaatc acttgaggtt 1261 ctggttgcag atctagtgaa tgctcagaaa gacagtatgc aagatgagtc aagtcagact 1321 tcattacaga aagagattag tactgaagag cagctaaggc gcctgcaaga ggagaagctt 1381 tgcaaaatct gtatggatag aaatattgct atcgtttttg ttccttgtgg acatctagtc 1441 acttgtaaac aatgtgctga agcagttgac aagtgtccca tgtgctacac agtcattact 1501 ttcaagcaaa aaatttttat gtcttaatct aactctatag taggcatgtt atgttgttct 1561 tattaccctg attgaatgtg tgatgtgaac tgactttaag taatcaggat tgaattccat 1621 tagcatttgc taccaagtag gaaaaaaaat gtacatggca gtgttttagt tggcaatata 1681 atctttgaat ttcttgattt ttcagggtat tagctgtatt atccattttt tttactgtta 1741 tttaattgaa accatagact aagaataaga agcatcatac tataactgaa cacaatgtgt 1801 attcatagta tactgattta atttctaagt gtaagtgaat taatcatctg gattttttat 1861 tcttttcaga taggcttaac aaatggagct ttctgtatat aaatgtggag attagagtta 1921 atctccccaa tcacataatt tgttttgtgt gaaaaaggaa taaattgttc catgctggtg 1981 gaaagataga gattgttttt agaggttggt tgttgtgttt taggattctg tccattttct 2041 tgtaaaggga taaacacgga cgtgtgcgaa atatgtttgt aaagtgattt gccattgttg 2101 aaagcgtatt taatgataga atactatcga gccaacatgt actgacatgg aaagatgtca 2161 gagatatgtt aagtgtaaaa tgcaagtggc gggacactat gtatagtctg agccagatca 2221 aagtatgtat gttgttaata tgcatagaac gagagatttg gaaagatata caccaaactg 2281 ttaaatgtgg tttctcttcg gggagggggg gattggggga ggggccccag aggggtttta 2341 gaggggcctt ttcactttcg acttttttca ttttgttctg ttcggatttt ttataagtat 2401 gtagaccccg aagggtttta tgggaactaa catcagtaac ctaacccccg tgactatcct 2461 gtgctcttcc tagggagctg tgttgtttcc cacccaccac ccttccctct gaacaaatgc 2521 ctgagtgctg gggcactttg // LOCUS HSU45976 2343 bp mRNA PRI 11-SEP-1996 DEFINITION Human clathrin assembly protein lymphoid myeloid leukemia (CALM) mRNA, complete cds. ACCESSION U45976 NID g1373145 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2343) AUTHORS Dreyling,M.H., Martinez-Climent,J.A., Zheng,M., Mao,J., Rowley,J.D. and Bohlander,S.K. TITLE The t(10;11)(p13;q14) in the U937 cell line results in the fusion of the AF10 gene and CALM, encoding a new member of the AP-3 clathrin assembly protein family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (10), 4804-4809 (1996) MEDLINE 96209813 REFERENCE 2 (bases 1 to 2343) AUTHORS Bohlander,S.K., Dreyling,M.H. and Rowley,J.D. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) Stefan K. Bohlander, Medicine, University of Chicago, 5841 S. Maryland, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..2343 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q14" gene 148..2106 /gene="CALM" CDS 148..2106 /gene="CALM" /standard_name="clathrin assembly protein lymphoid myeloid leukemia" /codon_start=1 /product="CALM" /db_xref="PID:g1373146" /translation="MSGQSLTDRITAAQHSVTGSAVSKTVCKATTHEIMGPKKKHLDY LIQCTNEMNVNIPQLADSLFERTTNSSWVVVFKSLITTHHLMVYGNERFIQYLASRNT LFNLSNFLDKSGLQGYDMSTFIRRYSRYLNEKAVSYRQVAFDFTKVKRGADGVMRTMN TEKLLKTVPIIQNQMDALLDFNVNSNELTNGVINAAFMLLFKDAIRLFAAYHEGIINL LEKYFDMKKNQCKEGLDIYKKFLTRMTRISEFLKVAEQVGIDRGDIPDLSQAPSSLLD ALEQHLASLEGKKIKDSTAASRATTLSNAVSSLASTGLSLTKVDEREKQAALEEEQAR LKALKEQRLKELAKKPHTSLTTAASPVSTSAGGIMTAPAIDIFSTPSSSNSTSKLPND LLDLQQPTFHPSVHPMSTASQVASTWGDPFSATVDAVDDAIPSLNPFLTKSSGDVHLS ISSDVSTFTTRTPTHEMFVGFTPSPVAQPHPSAGLNVDFESVFGNKSTNVIVDSGGFD ELGGLLKPTVASQNQNLPVAKLPPSKLVSDDLDSSLANLVGNLGIGNGTTKNDVNWSQ PGEKKLTGGSNCEPKVAPTTAWNAATMAPPVMAYPATTPTGMIGYGIPPQMGSVPVMT QPTLIYSQPVMRPPNPFGPVSGAQIQFM" exon 1405..1554 /gene="CALM" /note="alternatively spliced exon" exon 1658..1678 /gene="CALM" /note="alternatively spliced exon" BASE COUNT 695 a 511 c 523 g 610 t 4 others ORIGIN 1 gcgcggcccc gaaccgccgc caggccggca cgggggaagg agccggtggg ggtagggggt 61 gcggtggggg gtggggaccc tccggctctt gggggtccca gtccccgccg gctgctgagc 121 gggtggggtg gtggaggagc tgcagagatg tccggccaga gcctgacgga ccgaatcact 181 gccgcccagc acagtgtcac cggctctgcc gtatccaaga cagtatgcaa ggccacgacc 241 cacgagatca tggggcccaa gaaaaagcac ctggactact taattcagtg cacaaatgag 301 atgaatgtga acatcccaca gttggcagac agtttatttg aaagaactac taatagtagt 361 tgggtggtgg tcttcaaatc tctcattaca actcatcatt tgatggtgta tggaaatgag 421 cgttttattc agtatttggc ttcaagaaac acgttgttta acttaagcaa ttttttggat 481 aaaagtggat tgcaaggata tgacatgtct acatttatta ggcggtatag tagatattta 541 aatgagaaag cagtttcata cagacaagtt gcatttgatt tcacaaaagt gaagagaggg 601 gctgatggag ttatgagaac aatgaacaca gaaaaactcc taaaaactgt accaattatt 661 cagaatcaaa tggatgcact tcttgatttt aatgttaata gcaatgaact tacaaatggg 721 gtaataaatg ctgccttcat gctcctgttc aaagatgcca ttagactgtt tgcagcatac 781 catgaaggaa ttattaattt gttggaaaaa tattttgata tgaaaaagaa ccaatgcaaa 841 gaaggtcttg acatctataa gaagttccta actaggatga caagaatctc agagttcctc 901 aaagttgcag agcaagttgg aattgacaga ggtgatatac cagacctttc acaggcccct 961 agcagtcttc ttgatgcttt ggaacaacat ttagcttcct tggaaggaaa gaaaatcaaa 1021 gattctacag ctgcaagcag ggcaactaca ctttccaatg cagtgtcttc cctggcaagc 1081 actggtctat ctctgaccaa agtggatgaa agggaaaagc aggcagcatt agaggaagaa 1141 caggcacgtt tgaaagcttt aaaggaacag cgcctaaaag aacttgcaaa gaaacctcat 1201 acctctttaa caactgcagc ctctcctgta tccacctcag caggagggat aatgactgca 1261 ccagccattg acatattttc tacccctagt tcttctaaca gcacatcaaa gctgcccaat 1321 gatctgcttg atttgcagca gccaactttt cacccatctg tacatcctat gtcaactgct 1381 tctcaggtag caagtacatg gggagatcct ttctctgcta ctgtagatgc tgttgatgat 1441 gccattccaa gcttaaatcc tttcctcaca aaaagtagtg gtgatgttca cctttccatt 1501 tcttcagatg tatctacttt tactactagg acacctactc atgaaatgtt tgttggattc 1561 actccttctc cagttgcaca gccacaccct tcagctggcc ttaatgttga ctttgaatct 1621 gtgtttggaa ataaatctac aaatgttatt gtagattctg ggggctttga tgaactaggt 1681 ggacttctca aaccaacagt ggcctctcag aaccagaacc ttcctgttgc caaactccca 1741 cctagcaagt tagtatctga tgacttggat tcatctttag ccaaccttgt gggcaatctt 1801 ggcatcggaa atggaaccac taagaatgat gtaaattgga gtcaaccagg tgaaaagaag 1861 ttaactgggg gatctaactg cgaaccaaag gttgcaccaa caaccgcttg gaatgctgca 1921 acaatggcac cccctgtaat ggcctatcct gctactacac caacaggcat gataggatat 1981 ggaattcctc cacaaatggg aagtgttcct gtaatgacgc aaccaacctt aatatacagc 2041 cagcctgtca tgagacctcc aaaccccttt ggccctgtat caggagcaca gatacagttt 2101 atgtaacttg atggaagaaa atggaattac tccaaaaaga caagtgctca agcagcaaaa 2161 tccttacttc cagcaaaatc caaactgctg tctcttaaat ctcttaaact ctcttcttcc 2221 attaggatgc tacaagtanc tcagtgaagg cccatgaagg gaattgggga ctagtttata 2281 gggngaacgt attcattaca gtttataaag gccaggattg gnttggattt taggattang 2341 ttc // LOCUS HSU45982 2577 bp DNA PRI 02-APR-1996 DEFINITION Human G protein-coupled receptor GPR-9-6 gene, complete cds. ACCESSION U45982 NID g1245054 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2577) AUTHORS Lautens,L.L., Tiffany,H.L., Gao,J.-L., Modi,W., Murphy,P.M. and Bonner,T.I. TITLE Cloning, Tissue Distribution and Chromosomal Localization of two potential G-Protein-Linked Chemokine Receptors JOURNAL Unpublished REFERENCE 2 (bases 1 to 2577) AUTHORS Bonner,T.I. TITLE Direct Submission JOURNAL Submitted (16-JAN-1996) Tom I. Bonner, Lab of Cell Biology, NIMH, Bldg 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..2577 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3-22" CDS 58..1131 /note="G protein-coupled receptor" /codon_start=1 /product="GPR-9-6" /db_xref="PID:g1245055" /translation="MADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPLYW LVFIVGALGNSLVILVYWYCTRVKTMTDMFLLNLAIADLLFLVTLPFWAIAAADQWKF QTFMCKVVNSMYKMNFYSCVLLIMCISVDRYIAIAQAMRAHTWREKRLLYSKMVCFTI WVLAAALCIPEILYSQIKEESGIAICTMVYPSDESTKLKSAVLTLKVILGFFLPFVVM ACCYTIIIHTLIQAKKSSKHKALKVTITVLTVFVLSQFPYNCILLVQTIDAYAMFISN CAVSTNIDICFQVTQTIAFFHSCLNPVLYVFVGERFRRDLVKTLKNLGCISQAQWVSF TRREGSLKLSSMLLETTSGALSL" BASE COUNT 628 a 613 c 574 g 762 t ORIGIN 1 aatattttcc ttgacctaat gccatcttgt gtccccttgc agagccctat tcctaacatg 61 gctgatgact atggctctga atccacatct tccatggaag actacgttaa cttcaacttc 121 actgacttct actgtgagaa aaacaatgtc aggcagtttg cgagccattt cctcccaccc 181 ttgtactggc tcgtgttcat cgtgggtgcc ttgggcaaca gtcttgttat ccttgtctac 241 tggtactgca caagagtgaa gaccatgacc gacatgttcc ttttgaattt ggcaattgct 301 gacctcctct ttcttgtcac tcttcccttc tgggccattg ctgctgctga ccagtggaag 361 ttccagacct tcatgtgcaa ggtggtcaac agcatgtaca agatgaactt ctacagctgt 421 gtgttgctga tcatgtgcat cagcgtggac aggtacattg ccattgccca ggccatgaga 481 gcacatactt ggagggagaa aaggcttttg tacagcaaaa tggtttgctt taccatctgg 541 gtattggcag ctgctctctg catcccagaa atcttataca gccaaatcaa ggaggaatcc 601 ggcattgcta tctgcaccat ggtttaccct agcgatgaga gcaccaaact gaagtcagct 661 gtcttgaccc tgaaggtcat tctggggttc ttccttccct tcgtggtcat ggcttgctgc 721 tataccatca tcattcacac cctgatacaa gccaagaagt cttccaagca caaagcccta 781 aaagtgacca tcactgtcct gaccgtcttt gtcttgtctc agtttcccta caactgcatt 841 ttgttggtgc agaccattga cgcctatgcc atgttcatct ccaactgtgc cgtttccacc 901 aacattgaca tctgcttcca ggtcacccag accatcgcct tcttccacag ttgcctgaac 961 cctgttctct atgtttttgt gggtgagaga ttccgccggg atctcgtgaa aaccctgaag 1021 aacttgggtt gcatcagcca ggcccagtgg gtttcattta caaggagaga gggaagcttg 1081 aagctgtcgt ctatgttgct ggagacaacc tcaggagcac tctccctctg aggggtcttc 1141 tctgaggtgc atggttcttt tggaagaaat gagaaataca tgaaacagtt tccccactga 1201 tgggaccaga gagagtgaaa gagaaaagaa aactcagaaa gggatgaatc tgaactatat 1261 gattacttgt agtcagaatt tgccaaagca aatatttcaa aatcaactga ctagtgcagg 1321 aggctgttga ttggctcttg actgtgatgc ccgcaattct caaaggagga ctaaggaccg 1381 gcactgtgga gcaccctggc tttgccactc gccggagcat caatgccgct gcctctggag 1441 gagcccttgg attttctcca tgcactgtga acttctgtgg cttcagttct catgctgcct 1501 cttccaaaag gggacacaga agcactggct gctgctacag accgcaaaag cagaaagttt 1561 cgtgaaaatg tccatctttg ggaaattttc taccctgctc ttgagcctga taacccatgc 1621 caggtcttat agattcctga tctagaacct ttccaggcaa tctcagacct aatttccttc 1681 tgttctcctt gttctgttct gggccagtga aggtccttgt tctgattttg aaacgatctg 1741 caggtcttgc cagtgaaccc ctggacaact gaccacaccc acaaggcatc caaagtctgt 1801 tggcttccaa tccatttctg tgtcctgctg gaggttttaa cctagacaag gattccgctt 1861 attccttggt atggtgacag tgtctctcca tggcctgagc agggagatta taacagctgg 1921 gttcgcagga gccagccttg gccctgttgt aggcttgttc tgttgagtgg cacttgcttt 1981 gggtccaccg tctgtctgct ccctagaaaa tgggctggtt cttttggccc tcttctttct 2041 gaggcccact ttattctgag gaatacagtg agcagatatg ggcagcagcc aggtagggca 2101 aaggggtgaa gcgcaggcct tgctggaagg ctatttactt ccatgcttct ccttttctta 2161 ctctatagtg gcaacatttt aaaagctttt aacttagaga ttaggctgaa aaaaataagt 2221 aatggaattc acctttgcat cttttgtgtc tttcttatca tgatttggca aaatgcatca 2281 cctttgaaaa tatttcacat attggaaaag tgctttttaa tgtgtatatg aagcattaat 2341 tacttgtcac tttctttacc ctgtctcaat attttaagtg tgtgcaatta aagatcaaat 2401 agatacatta agagtgtgaa ggctggtctg aaggtagtga gctatctcaa tcggattgtt 2461 cacactcagt tacagattga actccttgtt ctacttccct gcttctctct actgcaattg 2521 actagtcttt aaaaaaaagt gtgaagagta agcaataggg ataaggaaat aagatct // LOCUS HSU46023 4599 bp mRNA PRI 20-JUN-1996 DEFINITION Human Xq28 mRNA, complete cds. ACCESSION U46023 NID g1378037 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4599) AUTHORS Laporte,J., Hu,L.J., Kretz,C., Mandel,J.L., Kioschis,P., Coy,J.F., Klauck,S.M., Poustka,A. and Dahl,N. TITLE A gene mutated in X-linked myotubular myopathy defines a new putative tyrosine phosphatase family conserved in yeast JOURNAL Nature Genet. 13 (2), 175-182 (1996) MEDLINE 96225444 REFERENCE 2 (bases 1 to 4599) AUTHORS Laporte,J., Kioschis,P., Hu,L.J., Poutska,A., Dahl,N. and Mandel,J.L. TITLE Direct Submission JOURNAL Submitted (12-JAN-1996) Jocelyn Laporte, Human Genetics, I.G.B.M.C., 1, rue Laurent Fries - BP 163, Illkirch, 67404, France FEATURES Location/Qualifiers source 1..4599 /organism="Homo sapiens" /note="F18, XAP80" /db_xref="taxon:9606" /chromosome="X" /map="Xq28;between DXS304 and DXS455" CDS 284..2389 /note="orf" /codon_start=1 /db_xref="PID:g1378038" /translation="MADGGYPNKIKRPCLEDVTLAMGPGAHPSTACAELQVPPLTINP SPAAMGVAGQSLLLENNPMNGNIMGSPFVVPQTTEVGLKGPTVPYYEKINSVPAVDQE LQELLEELTKIQDPSPNELDLEKILGTKPEEPLVLDHPQATLSTTPKPSVQMSHLESL ASSKEFASSCSQVTGMSLQIPSSSTGISYSIPSTSKQIVSPSSSMAQSKSQVQAMLPV ALPPLPVPQWHHAHQLKALAASKQGSATKQQGPTPSWSGLPPPGLSPPSRPVPSLQPP PLPLPPPPPPFSPQSLMVSCMSSNTLSGSTLRGSPNALLSSMTSSSNAALGPAMPYAP EKLPSPALTQQPQFGPQSSILANLMSSTIKTPQGHLMSALPASNPGPSPPYRPEKLSS PGLPQQSFTPQCSLIRSLTPTSNLLSQQQQQQQQQQQANVIFKPISSNSSKTLSMIMQ QGMASSSPGATEPFTFGNTKPLSHFVSEPGPQKMPSMPTTSRQPSLLHYLQQPTPTQA SSATASSTATATLQLQQQQQQQQQQPDHSSFLLQQMMQQPQRFQRSVASDSMPALPRQ VCCHLFAWTSAASSVKPQHQHGNSFTSRQDPQPGDVSPSNITHVDKACKLGEARHPQV SLGRQPPSCQALGSESFLPGSSFAHELARVTSSYSTSEAAPWGSWDPKAWRQVPAPLL PSCDATARGTEIRSYGNDP" BASE COUNT 1093 a 1366 c 1074 g 1066 t ORIGIN 1 ccctgtgtct aggtcgtttg ggaaacgcct tggagagtca agaataaatt tgcaggtcaa 61 acaatggatg actggaaaag tcggcttgta atcaagagca tgcttcccca tttcgccatg 121 gtgggaaatc gtcaggagcc cagaaagctc caggaatcgg gaaagaagcc ctcgtggatg 181 gaggaagaag atttatcttt tctctacaag agcagcccag gaagaaagca tcaggggaac 241 tgttaacagg agacaagaag aagaccactt ccagtttcca gacatggctg atgggggcta 301 ccctaataaa attaagaggc cttgccttga agatgtcacc cttgcaatgg gcccaggtgc 361 tcatcctagt actgcttgtg cagaactgca ggtccctcca ttgacaataa atcctagccc 421 tgcggctatg ggagtggctg gccagtcatt actgctggag aataacccta tgaatggcaa 481 catcatgggc tcaccatttg tagtaccaca gactacagaa gtgggactga aagggcccac 541 tgttccttac tatgagaaaa tcaacagcgt gccggctgta gaccaggagc ttcaagagct 601 gctagaggag ctcaccaaaa ttcaagaccc ttctccaaat gagctagatc ttgagaagat 661 actggggacg aagccagaag agccactggt tttagatcat ccccaggcaa ccctaagcac 721 aactcccaag ccttcggttc agatgtcaca cttggagagc ctggcttcca gcaaggagtt 781 tgcttctagt tgcagccaag ttactggcat gtcacttcag atcccatcct cctccacagg 841 gatcagctat tcgattcctt ccaccagtaa gcagatagtg tcaccgagtt cttcaatggc 901 acagtccaag agccaggtcc aggccatgct ccctgtcgct ctgcccccct taccagtgcc 961 tcagtggcat cacgcccacc agctgaaggc gttggcagcc agcaagcagg ggtctgctac 1021 aaagcagcaa gggcccaccc ccagttggtc tggtctgcct cctccaggac tctctccacc 1081 ttcccgccca gtgccatcac tacagccacc accgctgcca ctgccaccac caccaccccc 1141 attcagcccc cagagcctca tggtgtcctg catgtcgtcc aataccttgt cgggtagcac 1201 tctccgaggc tctcccaatg ccttactgtc aagcatgacg tccagcagca atgctgccct 1261 ggggcccgcc atgccctatg ctcctgagaa gctccccagc cctgctctca ctcaacagcc 1321 gcagttcggc cctcagagct ccattcttgc caacctcatg tcctctacca tcaaaacccc 1381 tcaaggacac ctgatgtctg ctctgcctgc cagcaaccct gggccgtccc caccctatcg 1441 cccagagaag ctctctagcc caggcttgcc acagcagtcc ttcaccccac agtgttccct 1501 gatccgaagc ctcactccca ccagtaatct tctaagccag caacagcagc agcagcagca 1561 gcagcagcaa gcaaatgtga tctttaagcc cataagcagc aactcatcca aaaccctgag 1621 catgatcatg cagcagggga tggcaagctc cagcccagga gccacggagc catttacttt 1681 tggcaacacc aagcccttgt cccattttgt ttctgagccg ggtccccaga agatgccctc 1741 catgcctacc acctctaggc agccttccct gctccactac ctgcagcagc cgacaccaac 1801 gcaggcctcc tcagccactg cctcctccac ggccactgcc accttgcagc tgcagcagca 1861 gcagcagcaa cagcagcagc agcctgacca ttcttcattc cttctgcagc agatgatgca 1921 gcaaccccag cgttttcagc gatcagtggc ctcagattcc atgcctgctc tgcccagaca 1981 ggtctgttgc catctgtttg catggacttc tgcagctagc tcggtgaagc cccagcatca 2041 acacgggaac tctttcacta gcaggcaaga tcctcagcct ggagacgtgt caccgtctaa 2101 cattactcat gtagacaaag cctgcaagct tggggaagcc aggcaccccc aggtcagcct 2161 cgggcgacag cccccgtcct gccaggccct ggggagtgag tccttcctgc ccggcagctc 2221 ctttgctcat gagctggccc gagtcacctc ctcgtacagc acctcagagg cagcgccctg 2281 gggcagctgg gatccgaagg cctggaggca ggtgcccgct ccactactgc ctagctgcga 2341 cgccacagca agaggaacag agatcaggtc ttatggcaat gacccctgaa cggcagaatg 2401 catatatctc ccaacagatg agtccatttg aagccgtcca agaacaagtc acctccaagt 2461 gtagccggat caaggcaagc cccccatcta gcaagcactt gatgccaccc agaactgggc 2521 ttcttcagaa caatctgagt ccaggaatga tcccactcac caggcaccag agctgcgagg 2581 gcatgggagt gatctcacca actctgggga agcggcaagg aattttcacc tccagccccc 2641 agtgtcccat cctctcacac tcaggccaga ctcccctggg cagacttgac tctgtctgcc 2701 agcatatgca gagccccaag gccaccccac cagaagtgcc cctgcctggg ttctgtccca 2761 gctccctggg cacccagtcc ttgagtcccc accagctcag acggcctagt gtgccaagaa 2821 tgcccactgc gttcaacaat gctgcatggg tcacagcggc agcagctgtg accacagcag 2881 tttcggggaa aacacccctc agccaagtgg ataatagcgt tcagcagcac tcaccttctg 2941 gccaggcctg ccttcagagg ccatctgatt gggaggcaca agtgcccgct gcgatgggaa 3001 cacaagtgcc cctggccaac aaccccagct tcagcctgct gggcagccag agcctcaggc 3061 agagcccggt acagggcccg gtgcctgtag caaacaccac caagttcctc cagcagggta 3121 tggccagctt tagtcccctg agccccatac agggcatcga gccaccaagc tatgtggctg 3181 ctgctgccac cgctgctgct gcttctgccg ttgctgccag ccagttccca ggtccgttcg 3241 acagaacgga tattccccct gagctgccac ctgccgactt tttgcgccag ccccaacccc 3301 cactaaatga tctgatttcg tcacctgact gcaatgaggt agatttcatt gaagctctct 3361 tgaaaggctc ctgtgtgagc ccagatgaag actgggtgtg caacttgagg ctgatcgacg 3421 acattttgga acagcatgct gctgctcaaa atgccacagc ccagaattct gggcaagtca 3481 cccaggatgc tggggcactt taaatctgag caggatgccc atagaaaccc ccatggtgac 3541 atcactctag gaagtggtgt cgatccatac ccgcagttgt ctcccgttac aatttgagtg 3601 gtgttgtcag cccatgctta tccctctctc tacctgtgac aaaatggaaa gctggtgatt 3661 tttcaagcta cgtgtacata tttgaaaatt ttgtaaatgg ttttcctaaa cattaatgac 3721 agaagtattt atacttcatt ttgtgacttt gtaaataaag cgacggcttt tgtttcagta 3781 gagttgtgtt tactatgcat tgttttgtgt ttattataca atgttacaaa tatgcagacc 3841 gtgttgtttg ctccagtgat accttgttaa gctaggtggc tgagtcgctt atggttttaa 3901 tgcaatgagc aatgtggata tgaccaagag ttgttgtgca agttgacaaa tgccaaatag 3961 aaaaccactt ggccatttat ttctatgttc actaaaaatc ctattgcctt gtgtgattct 4021 taatctcttt tgcgaacctt tcagtctccg ctagctcttt cctaatgagc tttacagcag 4081 aagctgtttt atcgttaagt gccccacaga gacactttac caggaggctg ggagagttct 4141 ccagatttgg gagaggcgca gagacagtgt gtgagccgag ccctgtctca gcaatccacc 4201 tggaggagct agagtatcct cctcccttta ccattcagac cgagagaaaa agcccagctt 4261 gtgtgcaccc tcgtggggtt aaggcgagct gttcctggtt taaagccttt cagtatttgt 4321 tttgatgtaa ggctctgtgg tttggggggg aacatctgta aacattatta gttgatttgg 4381 ggtttgtctt tgatggtttc tatctgcaat tatcgtcatg tatatttaag tgtctgttat 4441 agaaaaccca cacccactgt cctgtaaact tttctcagtg tccagacttt ctgtaatcac 4501 attttaattg ccacctcgta tttcacctct acatttgaaa tctggcgtct gtttcaagcc 4561 agtgtgtttt ttcttcgttc tgtaataaac agccaggag // LOCUS HSU46024 3411 bp mRNA PRI 27-AUG-1997 DEFINITION Homo sapiens myotubularin (MTM1) mRNA, complete cds. ACCESSION U46024 NID g1378039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3411) AUTHORS Laporte,J., Hu,L.J., Kretz,C., Mandel,J.L., Kioschis,P., Coy,J.F., Klauck,S.M., Poustka,A. and Dahl,N. TITLE A gene mutated in X-linked myotubular myopathy defines a new putative tyrosine phosphatase family conserved in yeast JOURNAL Nature Genet. 13 (2), 175-182 (1996) MEDLINE 96225444 REFERENCE 2 (bases 1 to 3411) AUTHORS Laporte,J., Kioschis,P., Hu,L.J., Kretz,C., Coy,J., Klauck,S., Poutska,A., Dahl,N. and Mandel,J.L. TITLE Direct Submission JOURNAL Submitted (12-JAN-1996) Jocelyn Laporte, Human Genetics, I.G.B.M.C., 1, rue Laurent Fries - BP 163, Illkirch, 67404, France FEATURES Location/Qualifiers source 1..3411 /organism="Homo sapiens" /note="names: G45, XAP84" /db_xref="taxon:9606" /chromosome="X" /map="Xq28, between DXS7423 and DXS1684" gene 1..3411 /gene="MTM1" CDS 55..1866 /gene="MTM1" /note="X-linked myotubular myopathy gene; protein tyrosine phosphatase homolog" /codon_start=1 /product="myotubularin" /db_xref="PID:g2344875" /translation="MASASTSKYNSHSLENESIKRTSRDGVNRDLTEAVPRLPGETLI TDKEVIYICPFNGPIKGRVYITNYRLYLRSLETDSSLILDVPLGVISRIEKMGGATSR GENSYGLDITCKDMRNLRFALKQEGHSRRDMFEILTRYAFPLAHSLPLFAFLNEEKFN VDGWTVYNPVEEYRRQGLPNHHWRITFINKCYELCDTYPALLVVPYRASDDDLRRVAT FRSRNRIPVLSWIHPENKTVIVRCSQPLVGMSGKRNKDDEKYLDVIRETNKQISKLTI YDARPSVNAVANKATGGGYESDDAYHNAELFFLDIHNIHVMRESLKKVKDIVYPNVEE SHWLSSLESTHWLEHIKLVLTGAIQVADKVSSGKSSVLVHCSDGWDRTAQLTSLAMLM LDSFYRSIEGFEILVQKEWISFGHKFASRIGHGDKNHTDADRSPIFLQFIDCVWQMSK QFPTAFEFNEQFLIIILDHLYSCRFGTFLFNCESARERQKVTERTVSLWSLINSNKEK FKNPFYTKEINRVLYPVASMRHLELWVNYYIRWNPRIKQQQPNPVEQRYMELLALRDE YIKRLEELQLANSAKLSDPPTSPSSPSQMMPHVQTHF" BASE COUNT 1057 a 609 c 701 g 1044 t ORIGIN 1 gcagccgagc agcctggcaa cggcggtggc gcccggagcc cgagagtttc caggatggct 61 tctgcatcaa cttctaaata taattcacac tccttggaga atgagtctat taagaggacg 121 tctcgagatg gagtcaatcg agatctcact gaggctgttc ctcgacttcc aggagaaaca 181 ctaatcactg acaaagaagt tatttacata tgtcctttca atggccccat taagggaaga 241 gtttacatca caaattatcg tctttattta agaagtttgg aaacggattc ttctctaata 301 cttgatgttc ctctgggtgt gatctcgaga attgaaaaaa tgggaggcgc gacaagtaga 361 ggagaaaatt cctatggtct agatattact tgtaaagaca tgagaaacct gaggttcgct 421 ttgaaacagg aaggccacag cagaagagat atgtttgaga tcctcacgag atacgcgttt 481 cccctggctc acagtctgcc attatttgca tttttaaatg aagaaaagtt taacgtggat 541 ggatggacag tttacaatcc agtggaagaa tacaggaggc agggcttgcc caatcaccat 601 tggagaataa cttttattaa taagtgctat gagctctgtg acacttaccc tgctcttttg 661 gtggttccgt atcgtgcctc agatgatgac ctccggagag ttgcaacttt taggtcccga 721 aatcgaattc cagtgctgtc atggattcat ccagaaaata agacggtcat tgtgcgttgc 781 agtcagcctc ttgtcggtat gagtgggaaa cgaaataaag atgatgagaa atatctcgat 841 gttatcaggg agactaataa acaaatttct aaactcacca tttatgatgc aagacccagc 901 gtaaatgcag tggccaacaa ggcaacagga ggaggatatg aaagtgatga tgcatatcat 961 aacgccgaac ttttcttctt agacattcat aatattcatg ttatgcggga atctttaaaa 1021 aaagtgaagg acattgttta tcctaatgta gaagaatctc attggttgtc cagtttggag 1081 tctactcatt ggttagaaca tatcaagctc gttttgacag gagccattca agtagcagac 1141 aaagtttctt cagggaagag ttcagtgctt gtgcattgca gtgacggatg ggacaggact 1201 gctcagctga catccttggc catgctgatg ttggatagct tctataggag cattgaaggg 1261 ttcgaaatac tggtacaaaa agaatggata agttttggac ataaatttgc atctcgaata 1321 ggtcatggtg ataaaaacca caccgatgct gaccgttctc ctatttttct ccagtttatt 1381 gattgtgtgt ggcaaatgtc aaaacagttc cctacagctt ttgaattcaa tgaacaattt 1441 ttgattataa ttttggatca tctgtatagt tgccgatttg gtactttctt attcaactgt 1501 gaatctgctc gagaaagaca gaaggttaca gaaaggactg tttctttatg gtcactgata 1561 aacagtaata aagaaaaatt caaaaacccc ttctatacta aagaaatcaa tcgagtttta 1621 tatccagttg ccagtatgcg tcacttggaa ctctgggtga attactacat tagatggaac 1681 cccaggatca agcaacaaca gccgaatcca gtggagcagc gttacatgga gctcttagcc 1741 ttacgcgacg aatacataaa gcggcttgag gaactgcagc tcgccaactc tgccaagctt 1801 tctgatcccc caacttcacc ttccagtcct tcgcaaatga tgccccatgt gcaaactcac 1861 ttctgagggg ggaccctggc accgcattag agctcgaaat aaaggcgata gctgactttc 1921 atttggggca tttgtaaaaa gtagattaaa atatttgcct ccatgtagaa cttgaactaa 1981 cataatctta aactcttgaa tatgtgcctt ctagaataca tattacaaga aaactacagg 2041 gtccacacgg caatcagaag aaaggagctg agatgaggtt ttggaaaacc ctgacacctt 2101 taaaaagcag tttttgaaag acaaaattta gatttaattt acgtcttgag aaatactata 2161 tatacaatat atatgggggg ggcttaattg aaacaacatt attttaaaat caaaggggat 2221 atatgtttgt ggaatggatt ttcctgaagc tgcttaacag ttgctttgga ttctctaaga 2281 tgaatccaaa tgtgaaagat gcatgttact gccaaaacca aattgagctc agcttcctag 2341 gcattaccca aaagcaaggt gtttaagtaa ttgccagctt ttataccatc atgagtggtg 2401 acttaaggag aaatagctgt atagatgagt ttttcattat ttggaaattt aggggtagaa 2461 aatgttttcc cctaattttc cagagaagcc tatttttata tttttaaaaa actgacaggg 2521 cccagttaaa tatgatttgc attttttaaa tttgccagtt ttattttcta aattctttca 2581 tgagcttgcc taaaattcgg aatggttttc gggttgtggc aaaccccaaa gagagcactg 2641 tccaaggatg tcgggagcat cctgctgctt aggggaatgt tttcgcaaat gttgctctag 2701 tcagtccagc tcatctgcca aaatgtaggg ctaccgtctt ggatgcatga gctattgcta 2761 gagcatcatc cttagaaatc agtgccccag atgtacatgt gttgagcgta ttcttgaagt 2821 attgtgttta tgcatttcaa tttcaatggt gttggcttcc cctccccacc ccacgcgtgc 2881 ataaaaactg gttctacaaa tttttacttg aagtaccagg ccgtttgctt tttcaggttg 2941 ttttgtttta tagtattaag tgaaatttta aatgcacagt tctatttgct atctgaacta 3001 attcatttat taagtatatt tgtaaaagct aaggctcgag ttaaaacaat gaagtgtttt 3061 acaatgattt gtaaaggact atttataact aatatggttt tgttttcaat gaattaagaa 3121 agattaaata tatctttgta aattatttta tgtcatagtt taattggtct cccaagtaag 3181 acatctcaaa tacagtagta taatgtatga attttgtaag tataagaaat tttattagac 3241 attctcttac tttttgtaaa tgctgtaaat atttcataaa ttaacaaagt gtcactccat 3301 aaaaagaaag ctaatactaa tagcctaaaa gattttgtga aatttcatga aaacttttta 3361 atggcaataa tgactaaaga cctgctgtaa taaatgtatt aactgaaacc t // LOCUS HSU46025 2898 bp DNA PRI 11-DEC-1996 DEFINITION Human translation initiation factor eIF-3 p110 subunit gene, complete cds. ACCESSION U46025 NID g1718196 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2898) AUTHORS Asano,K., Kinzy,T.G., Merrick,W.C. and Hershey,J.W.B. TITLE Conservation and diversity of eukaryotic translation initiation factor eIF3 JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2898) AUTHORS Asano,K. and Hershey,J.W.B. TITLE Direct Submission JOURNAL Submitted (12-JAN-1996) Katsura Asano, Biological Chemistry, University of California at Davis, School of Medicine, Building MS1A, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..2898 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 50..2791 /codon_start=1 /product="translation intiation factor eIF-3 p110 subunit" /db_xref="PID:g1718197" /translation="MSRFFTTGSDSESESSLSGEELVTKPVGGNYGKQPLLLSEDEED TKRVVRSAKDKRFEELTNLIRTIRNAMKIRDVTKCLEEFELLGKAYGKAKSIVDKEGV PRFYIRILADLEDYLNELWEDKEGKKKMNKNNAKALSTLRQKIRKYNRDFESHITSYK QNPEQSADEDAEKNEEDSEGSSDEDEDEDGVSAATFLKKKSEAPSGESRKFLKKMDDE DEDSEDSEDDEDWDTGSTSSDSDSEEEEGKQTALASRFLKKAPTTDEDKKAAEKKRED KAKKKHDRKSKRLDEEEEDNEGGEWERVRGGVPLVKEKPKMFAKGTEITHAVVIKKLN EILQARGKKGTDRAAQIELLQLLVQIAAENNLGEGVIVKIKFNIIASLYDYNPNLATY MKPEMWGKCLDCINELMDILFANPNIFVGENILEESENLHNADQPLRVRGCILTLVER MDEEFTKIMQNTDPHSQEYVEHLKDEAQVCAIIERVQRYLEEKGTTEEVCRIYLLRIL HTYYKFDYKAHQRQLTPPEGSSKSEQDQAENEGEDSAVLMERLCKYIYAKDRTDRIRT CAILCHIYHHALHSRWYQARDLMLMSHLQDNIQHADPPVQILYNRTMVQLGICAFRQG LTKDAHNALLDIQSSGRAKELLGQGLLLRSLQERNQEQEKVERRRQVPFHLHINLELL ECVYLVSAMLLEIPYMAAHESDARRRMISKQFHHQLRVGERQPLLGPPESMREHVVAA SKAMKMGDWKTCHSFIINEKMNGKVWDLFPEADKVRTMLVRKIQEESLRTYLFTYSSV YDSISMETLSDMFELDLPTVHSIISKMIINEELMASLDQPTQTVVMHRTEPTAQQNLA LQLAEKLGSLVENNERVFDHKQGTYGGYFRDQKDGYRKNEGYMRRGGYRQQQSQTAY" polyA_site 2898 /note="47 A nucleotides" BASE COUNT 772 a 762 c 819 g 545 t ORIGIN 1 tgactcgcgg gctcagctgg tccggccgta gcacctccgc gccgtcgcca tgtcgcggtt 61 tttcaccacc ggttcggaca gcgagtccga gtcgtccttg tccggggagg agctcgtcac 121 caaacctgtc ggaggcaact atggcaaaca gccattgttg ctgagcgagg atgaagaaga 181 taccaagaga gttgtccgca gtgccaagga caagaggttt gaggagctga ccaaccttat 241 ccggaccatc cgtaatgcca tgaagattcg tgatgtcacc aagtgcctgg aagagtttga 301 gctcctggga aaagcatatg ggaaggccaa aagcattgtg gacaaagaag gtgtcccccg 361 gttctatatc cgcatcctgg ctgacctaga ggactatctt aatgagcttt gggaagataa 421 ggaagggaag aagaagatga acaagaacaa tgccaaggct ctgagcacct tgcgtcagaa 481 gatccgaaaa tacaaccgtg atttcgagtc ccatatcaca agctacaagc agaaccccga 541 gcagtctgcg gatgaagatg ctgagaaaaa tgaggaggat tcagaaggct cttcagatga 601 ggatgaggat gaggacggag tcagtgctgc aactttcttg aagaagaaat cagaagctcc 661 ttctggggag agtcgcaagt tcctcaaaaa gatggatgat gaagatgagg actcagaaga 721 ttccgaagat gatgaagact gggacacagg ttccacatct tccgattccg actcagagga 781 ggaagaaggg aaacaaaccg cgctggcctc aagatttctt aaaaaggcac ccaccacaga 841 tgaggacaag aaggcagccg agaagaaacg ggaggacaaa gctaagaaga agcacgacag 901 gaaatccaag cgcctggatg aggaggagga ggacaatgaa ggcggggagt gggaaagggt 961 ccggggcgga gtgccgttgg ttaaggagaa gccaaaaatg tttgccaagg gaactgagat 1021 cacccatgct gttgttatca agaaactgaa tgagatccta caggcacgag gcaagaaggg 1081 aactgatcgt gctgcccaga ttgagctgct gcaactgctg gttcagattg cagcggaaaa 1141 caacctggga gagggcgtca ttgtcaagat caagttcaat atcatcgcct ctctctatga 1201 ctacaacccc aacctggcaa cctacatgaa gccagagatg tgggggaagt gcctggactg 1261 catcaatgag ctgatggata tcctgtttgc aaatcccaac atttttgttg gagagaatat 1321 tctggaagag agtgagaacc tgcacaacgc tgaccagcca ctgcgtgtcc gtggctgcat 1381 cctaactctg gtggaacgaa tggatgaaga atttaccaaa ataatgcaaa atactgaccc 1441 tcactcccaa gagtacgtgg agcacttgaa ggatgaggcc caggtgtgtg ccatcatcga 1501 gcgtgtgcag cgctacctgg aggagaaggg cactaccgag gaggtctgcc gcatctacct 1561 gctgcgcatc ctgcacacct actacaagtt tgattacaag gcccatcagc gacagctgac 1621 cccgcctgag ggctcctcaa agtctgagca agaccaggca gaaaatgagg gcgaggactc 1681 ggctgtgttg atggagagac tgtgcaagta catctacgcc aaggaccgca cagaccggat 1741 ccgcacatgt gccatcctct gccacatcta ccaccatgct ctgcactcgc gctggtacca 1801 ggcccgcgac ctcatgctca tgagccactt gcaggacaac attcagcatg cagacccgcc 1861 agtgcagatc ctttacaacc gcaccatggt gcagctgggc atctgtgcct tccgccaagg 1921 cctgaccaag gacgcacaca acgccctgct ggacatccag tcgagtggcc gagccaagga 1981 gcttctgggc cagggcctgc tgctgcgcag cctgcaggag cgcaaccagg agcaggagaa 2041 ggtggagcgg cgccgtcagg tccccttcca cctgcacatc aacctggagc tgctggagtg 2101 tgtctacctg gtgtctgcca tgctcctgga gatcccctac atggccgccc atgagagcga 2161 tgcccgccga cgcatgatca gcaagcagtt ccaccaccag ctgcgcgtgg gcgagcgaca 2221 gcccctgctg ggtccccctg agtccatgcg ggaacatgtg gtcgctgcct ccaaggccat 2281 gaagatgggt gactggaaga cctgtcacag ttttatcatc aatgagaaga tgaatgggaa 2341 agtgtgggac cttttccccg aggctgacaa agtccgcacc atgctggtta ggaagatcca 2401 ggaagagtca ctgaggacct acctcttcac ctacagcagt gtctatgact ccatcagcat 2461 ggagacgctg tcagacatgt ttgagctgga tctgcccact gtgcactcca tcatcagcaa 2521 aatgatcatt aatgaggagc tgatggcctc cctggaccag ccaacacaga cagtggtgat 2581 gcaccgcact gagcccactg cccagcagaa cctggctctg cagctggccg agaagctggg 2641 cagcctggtg gagaacaacg aacgggtgtt tgaccacaag cagggcacct acgggggcta 2701 cttccgagac cagaaggacg gctaccgcaa aaacgagggc tacatgcgcc gcggtggcta 2761 ccgccagcag cagtctcaga cggcctactg agctctccac tctgtttccc gcctgggcca 2821 tccaaccttg aagtcctaaa ccacacctca gtcactaaag gtctgtttaa agttgttctg 2881 gttgattgct tgttgcca // LOCUS HSU46191 1118 bp mRNA PRI 06-DEC-1996 DEFINITION Human renal cell carcinoma antigen RAGE-1 mRNA, complete putative cds. ACCESSION U46191 NID g1517896 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1118) AUTHORS Gaugler,B., Brouwenstijn,N., Vantomme,V., Szikora,J.P., Van der Spek,C.W., Patard,J.J., Boon,T., Schrier,P. and Van den Eynde,B.J. TITLE A new gene coding for an antigen recognized by autologous cytolytic T lymphocytes on a human renal carcinoma JOURNAL Immunogenetics 44 (5), 323-330 (1996) MEDLINE 96376527 REFERENCE 2 (bases 1 to 1118) AUTHORS Van den Eynde,B.J., Gaugler,B., Schrier,P. and Brouwenstijn,N. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) Benoit J. Van den Eynde, Ludwig Inst. for Cancer Research, Avenue Hippocrate, 74 - UCL7459, Brussels, B-1200, Belgium FEATURES Location/Qualifiers source 1..1118 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 204..326 /note="RAGE-1 ORF2; one of 3 possible coding regions; a peptide derived from this protein is displayed at the cell surface where it is recognized by CTL" /codon_start=1 /db_xref="PID:g1517897" /translation="MSSAHPLRRSSPSSNRIRNTSTNNQFVPTMPLPPARNGGL" CDS 313..399 /note="RAGE-1 ORF3; one of 3 possible coding regions" /codon_start=1 /db_xref="PID:g1517898" /translation="MVAYDPDERIAAHQALQHPYFQEQRNSP" CDS 444..665 /note="RAGE-1 ORF5; one of 3 possible coding regions" /codon_start=1 /db_xref="PID:g1517899" /translation="MELPKLKLSGVVRLSSYSSPTLQSVLGSGTNGRVPVLRPLKCIP ASKKTDPQKDLKPAPQQCRLPTIVRKGGR" BASE COUNT 283 a 301 c 297 g 237 t ORIGIN 1 ggagttttga gtccacagat aaaatgtgtc tccttcgtct ctactagaga ggaaaaagaa 61 ctggaattgg aagaacaggg agactgaagg gtagcaagag aggctggaga agagagtgaa 121 aagaccgctt acctgatttg aaattgtctg cagcccctct ttcctggagt aaatgaactg 181 gaccaaatct caaaaaatcc acgatgtcat cggcacaccc gctcagaaga tcctcaccaa 241 gttcaaacag gatcaggaat acctctacta acaaccaatt tgtccccaca atgcctctcc 301 ctcctgcacg caatggtggc ctatgatccc gatgagagaa tcgccgccca ccaggccctg 361 cagcacccct acttccaaga acagagaaac agtccctaaa gcaagaggag gaccgtccca 421 agagacgagg accggcctat gtcatggaac tgcccaaact aaagctttcg ggagtggtca 481 gactgtcgtc ttactccagc cccacgctgc agtccgtgct tggatctgga acaaatggaa 541 gagtgccggt gctgagaccc ttgaagtgca tccctgcgag caagaagaca gatccgcaga 601 aggaccttaa gcctgccccg cagcagtgtc gcctgcccac catagtgcgg aaaggcggaa 661 gataactgag cagcaccgtc gtctcgactt cggaggcaac accaagcccg accgggccag 721 gcctgggtga tctgctgctg agacgccacg gagggctggg gatgcgcctg cgtccgtttc 781 gcgctggccg gggctctggg tgctgccctg cgccctgccg cacccgcggc ccgcgcagct 841 gcctaggatg ttctgggcta atatacttgt aaaaccaccg cattctaggg ttttctttca 901 ttttcgttaa gaatttgggg caggaaatac tttgtaactt tgtatatgaa tcaaaacaaa 961 cgagcaggca tttctgtgat gtgttgggcg tggttggaag gtgggttctg cgtgtccctt 1021 cccagcgctg ctggtcagtc gtggagcgcc atcatgtctt accagtgacg ctgctgacac 1081 ccctgacttt tattaaagaa taagctgtcg ttaaaaaa // LOCUS HSU46192 1168 bp mRNA PRI 06-DEC-1996 DEFINITION Human renal cell carcinoma antigen RAGE-2 mRNA, complete putative cds. ACCESSION U46192 NID g1517900 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1168) AUTHORS Gaugler,B., Brouwenstijn,N., Vantomme,V., Szikora,J.P., Van der Spek,C.W., Patard,J.J., Boon,T., Schrier,P. and Van den Eynde,B.J. TITLE A new gene coding for an antigen recognized by autologous cytolytic T lymphocytes on a human renal carcinoma JOURNAL Immunogenetics 44 (5), 323-330 (1996) MEDLINE 96376527 REFERENCE 2 (bases 1 to 1168) AUTHORS Van den Eynde,B.J., Gaugler,B., Schrier,P. and Brouwenstijn,N. TITLE Direct Submission JOURNAL Submitted (18-JAN-1996) Benoit J. Van den Eynde, Ludwig Inst. for Cancer Research, Avenue Hippocrate, 74 - UCL7459, Brussels, B-1200, Belgium FEATURES Location/Qualifiers source 1..1168 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 217..276 /note="RAGE-2 ORF2; one of 3 possible coding regions" /codon_start=1 /db_xref="PID:g1517901" /translation="MSSAHPLRRSSPSSNSREL" CDS 273..449 /note="RAGE-2 ORF3; one of 3 possible coding regions" /codon_start=1 /db_xref="PID:g1517902" /translation="MNFDFPFKKGSGIPLLTTNLSPQCLSLLHAMVAYDPDERIAAHQ ALQHPYFQEQRNSP" CDS 494..715 /note="RAGE-2 ORF5; one of 3 possible coding regions" /codon_start=1 /db_xref="PID:g1517903" /translation="MELPKLKLSGVVRLSSYSSPTLQSVLGSGTNGRVPVLRPLKCIP ASKKTDPQKDLKPAPQQCRLPTIVRKGGR" BASE COUNT 294 a 308 c 309 g 257 t ORIGIN 1 ctgtgactgg tgctggagtt ttgagtccac agataaaatg tgtctccttc gtctctacta 61 gagaggaaaa agaactggaa ttggaagaac agggagactg aagggtagca agagaggctg 121 gagaagagag tgaaaagacc gcttacctga tttgaaattg tctgcagccc ctctttcctg 181 gagtaaatga actggaccaa atctcaaaaa tccacgatgt catcggcaca cccgctcaga 241 agatcctcac caagttcaaa cagtcgagag ctatgaattt tgattttcct tttaaaaagg 301 gatcaggaat acctctacta acaaccaatt tgtccccaca atgcctctcc ctcctgcacg 361 caatggtggc ctatgatccc gatgagagaa tcgccgccca ccaggccctg cagcacccct 421 acttccaaga acagagaaac agtccctaaa gcaagaggag gaccgtccca agagacgagg 481 accggcctat gtcatggaac tgcccaaact aaagctttcg ggagtggtca gactgtcgtc 541 ttactccagc cccacgctgc agtccgtgct tggatctgga acaaatggaa gagtgccggt 601 gctgagaccc ttgaagtgca tccctgcgag caagaagaca gatccgcaga aggaccttaa 661 gcctgccccg cagcagtgtc gcctgcccac catagtgcgg aaaggcggaa gataactgag 721 cagcaccgtc gtctcgactt cggaggcaac accaagcccg accgggccag gcctgggtga 781 tctgctgctg agacgccacg gagggctggg gatgcgcctg cgtccgtttc gcgctggccg 841 gggctctggg tgctgccctg cgccctgccg cacccgcggc ccgcgcagct gcctaggatg 901 ttctgggcta atatacttgt aaaaccaccg cattctaggg ttttctttca ttttcgttaa 961 gaatttgggg caggaaatac tttgtaactt tgtatatgaa tcaaaacaaa cgagcaggca 1021 tttctgtgat gtgttgggcg tggttggaag gtgggttctg cgtgtccctt cccagcgctg 1081 ctggtcagtc gtggagcgcc atcatgtctt accagtgacg ctgctgacac ccctgacttt 1141 tattaaagaa taagctgtcg ttaaaaaa // LOCUS HSU46570 1407 bp mRNA PRI 28-NOV-1996 DEFINITION Human tetratricopeptide repeat protein (tpr1) mRNA, complete cds. ACCESSION U46570 NID g1688073 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Murthy,A.E., Bernards,A., Church,D., Wasmuth,J. and Gusella,J.F. TITLE Identification and characterization of two novel tetratricopeptide repeat-containing genes JOURNAL DNA Cell Biol. 15 (9), 727-735 (1996) MEDLINE 96433003 REFERENCE 2 (bases 1 to 1407) AUTHORS Murthy,A.E., Bernards,A., Church,D., Wasmuth,J. and Gusella,J.F. TITLE Direct Submission JOURNAL Submitted (19-JAN-1996) Anita E. Murthy, Molecular Neurogenetics Unit, Massachusetts General Hospital, 149 13th St., Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q32-33.2" /dev_stage="fetus" /tissue_type="liver; brain" /cell_line="HeLa" gene 51..929 /gene="tpr1" CDS 51..929 /gene="tpr1" /codon_start=1 /product="tetratricopeptide repeat protein" /db_xref="PID:g1688074" /translation="MGEKSENCGVPEDLLNGLKVTDTQEAECAGPPVPDPKNQHSQSK LLRDDEAHLQEDQGEEECFHDCSASFEEEPGADKVENKSNEDVNSSELDEEYLIELEK NMSDEEKQKRREESTRLKEEGNEQFKKGDYIEAESSYSRALEMCPSCFQKERSILFSN RAAARMKQDKKEMAINDCSKAIQLNPSYIRAILRRAELYEKTDKLDEALEDYKSILEK DPSIHQAREACMRLPKQIEERNERLKEEMLGKLKDLGNLVLRPFGLSTENFQIKQDSS TGSYSINFVQNPNNNR" BASE COUNT 448 a 289 c 334 g 336 t ORIGIN 1 gaccggagaa gctgtgaggt tctttagcgt cacctccctc actgggcagc atgggggaga 61 agtcagagaa ctgtggggtt ccagaggatc tgttaaatgg tttgaaggtt acagatactc 121 aggaagccga gtgtgctggc cctccagttc ctgatcccaa aaatcagcat tcccagagta 181 agctgctcag ggatgatgag gcccatctcc aggaggacca gggagaagag gagtgttttc 241 atgactgcag tgcctcattt gaggaggagc caggagcgga caaggttgag aacaaatcta 301 atgaagatgt gaattcctct gaactagatg aagaatacct aatagaactg gaaaaaaaca 361 tgtcggatga agagaaacag aaaagaagag aagagagcac tagactaaag gaggagggaa 421 atgaacagtt taagaaagga gattatatag aagctgaaag ttcttatagt cgagccctcg 481 aaatgtgccc atcctgcttc caaaaggaga ggtcgattct attttcaaat agagctgcag 541 caaggatgaa acaggacaag aaagaaatgg ccatcaatga ctgcagcaaa gcaattcaat 601 taaaccccag ctatatcagg gcaatattga ggagagcaga gttgtatgag aagacggaca 661 agctagatga agccctggaa gactataaat ctatattaga aaaagatcca tcaatacatc 721 aagcaagaga agcttgtatg agattaccta agcaaattga agaacgtaat gaaagactaa 781 aagaagagat gttaggtaaa ttaaaagatc ttgggaactt ggttctccga ccttttgggc 841 tctccacgga aaatttccag atcaaacagg attcctctac cggctcgtac tccatcaatt 901 tcgttcaaaa tccaaataat aacagataac aaagataaca aaagctttac aagctgactt 961 ggaattgtgt gctgcttgct gttagctagg ggaaaggccc tgccaatgtt taacttttaa 1021 aagcatctta tctaaaagaa aggctatcca gtagagccca gtgctccctt gtccctcttt 1081 tatgatcagg gtgaaatgta cttcctgatg taatgaacct aatttgattt ccattttaag 1141 gtggtgtctg tgcagctggt gtccccgatt ctggctgtcc tatgtccagg aagaagccca 1201 tttgttgagg ctgaccttcc tgatcataca cacacacagc ccagcaaaag cctctcctga 1261 accaaacaaa cctgttggtt gggagactgc ccagacatga ttgatgacgg gttcccgcct 1321 gctgtcccct ccctgatcac acagctaacg aggctgcctc cagcatttcc tgatttcctc 1381 tgtggtaata aaagctttct gtgctta // LOCUS HSU46571 1756 bp mRNA PRI 28-NOV-1996 DEFINITION Human tetratricopeptide repeat protein (tpr2) mRNA, complete cds. ACCESSION U46571 NID g1688075 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1756) AUTHORS Murthy,A.E., Bernards,A., Church,D., Wasmuth,J. and Gusella,J.F. TITLE Identification and characterization of two novel tetratricopeptide repeat-containing genes JOURNAL DNA Cell Biol. 15 (9), 727-735 (1996) MEDLINE 96433003 REFERENCE 2 (bases 1 to 1756) AUTHORS Murthy,A.E., Bernards,A., Church,D., Wasmuth,J. and Gusella,J.F. TITLE Direct Submission JOURNAL Submitted (19-JAN-1996) Anita E. Murthy, Molecular Neurogenetics Unit, Massachusetts General Hospital, 149 13th St., Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..1756 /organism="Homo sapiens" /note="composite cDNA sequence derived from HeLa, human heart and human fetal retina libraries" /db_xref="taxon:9606" /chromosome="17" /map="17q11.2-23" gene 27..1481 /gene="tpr2" CDS 27..1481 /gene="tpr2" /note="related to p58" /codon_start=1 /product="tetratricopeptide repeat protein" /db_xref="PID:g1688076" /translation="MAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKA IDMCPKNASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNA MAACRSFQRALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALE FAPACHRFKILKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYEDCIEK AVQFFVQALRMAPDHEKACIACRNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGID PNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQY EEAVRDYEKVYQTEKTKEHKQLLKNAQLELKKSKRKDYYKILGVDKNASEDEIKKAYR KRALMHHPDRHSGASAEVQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDLDEEGMNM GDFDPNNIFKAFFGGPGGFSFEASGPGNFFFQFG" BASE COUNT 561 a 342 c 456 g 397 t ORIGIN 1 cggctgccgc ggagtgcgat gtggtaatgg cggcgaccga gccggagctg ctcgacgacc 61 aagaggcgaa gagggaagca gagactttca aggaacaagg aaatgcatac tatgccaaga 121 aagattacaa tgaagcttat aattattata caaaagccat agatatgtgt cctaaaaatg 181 ctagctatta tggtaatcga gcagccacct tgatgatgct tggaaggttc cgggaagctc 241 ttggagatgc acaacagtca gtgaggttgg atgacagttt tgtccgggga catctacgag 301 agggcaagtg ccacctctct ctggggaatg ccatggcagc atgtcgcagc ttccagagag 361 ccctagaact ggatcataaa aatgctcagg cacaacaaga gttcaagaat gctaatgcag 421 tcatggaata tgagaaaata gcagaaacag attttgagaa gcgagatttt cggaaggttg 481 ttttctgcat ggaccgtgcc ctagaatttg cccctgcctg ccatcgcttc aaaatcctca 541 aggcagaatg tttagcaatg ctgggtcgtt atccggaagc acagtctgtg gctagtgaca 601 ttctacgaat ggattccacc aatgcagatg ctctgtatgt acgaggtctt tgcctttatt 661 acgaagattg tattgagaag gcagttcagt ttttcgtaca ggctctcagg atggctcctg 721 accacgagaa ggcctgcatt gcctgcagaa atgccaaagc actcaaagca aagaaagaag 781 atgggaataa agcatttaag gaaggaaatt acaaactagc atatgaactg tacacagaag 841 ccctggggat agaccccaac aatataaaaa caaatgctaa actctactgt aatcggggta 901 cggttaattc caagcttagg aaactagatg atgcaataga agactgcaca aatgcagtga 961 agcttgatga cacttacata aaagcctact tgagaagagc tcagtgttac atggacacag 1021 aacagtatga agaagcagta cgagactatg aaaaagtata ccagacagag aaaacaaaag 1081 aacacaaaca gctcctaaaa aatgcgcagc tggaactgaa gaagagtaag aggaaagatt 1141 actacaagat tctaggagtg gacaagaatg cctctgagga cgagatcaag aaagcttatc 1201 ggaaacgggc cttgatgcac catccagatc ggcatagtgg agccagtgct gaggttcaga 1261 aggaggagga gaagaagttc aaggaagttg gagaggcctt tactatcctc tctgatccca 1321 agaaaaagac tcgctatgac agtggacagg acctagatga ggagggcatg aatatgggtg 1381 attttgatcc aaacaatatc ttcaaggcat tctttggcgg tcctggcggc ttcagctttg 1441 aagcatctgg tccagggaat ttcttttttc aatttggcta atgaagggca accacccaga 1501 acccagaaaa tgcagattca ctcagtttaa tcttgaatgt ggaaacagtt cacctcctcc 1561 cttcatcacg tctccgtgtg cttagagcag tttcgttttc tcagttggat gccctgtgtc 1621 tctgtgagtg gggtggagca aagggaacca atgccgaaga ccgagggcag gggagggagg 1681 cgggggtgga cagggaggca gcttgtgaat ttttgtttta ctgtttaact ttattaaaaa 1741 agaaaaaaaa aaaaaa // LOCUS HSU46837 740 bp mRNA PRI 27-APR-1996 DEFINITION Human RNA polymerase II holoenzyme component SRB7 (SRB7) mRNA, complete cds. ACCESSION U46837 NID g1197662 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 740) AUTHORS Chao,D.M., Gadbois,E.L., Murray,P.J., Anderson,S.F., Sonu,M.S., Parvin,J.D. and Young,R.A. TITLE A mammalian SRB protein associated with an RNA polymerase II holoenzyme JOURNAL Nature 380 (6569), 82-85 (1996) MEDLINE 96175648 REFERENCE 2 (bases 1 to 740) AUTHORS Chao,D.M. TITLE Direct Submission JOURNAL Submitted (23-JAN-1996) David M. Chao, Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood lymphocyte" gene 10..444 /gene="SRB7" CDS 10..444 /gene="SRB7" /note="RNA polymerase II holoenzyme component" /codon_start=1 /product="SRB7" /db_xref="PID:g1197663" /translation="MADRLTQLQDAVNSLADQFCNAIGVLQQCGPPASFNNIQTAINK DQPANPTEEYAQLFAALIARTAKDIDVLIDSLPSEESTAALQAASLYKLEEENHEAAT CLEDVVYRGDMLLEKIQSALADIAQSQLKTRSGTHSQSLPDS" BASE COUNT 249 a 144 c 141 g 206 t ORIGIN 1 ggtaggaaca tggcggatcg gctcacgcag cttcaggacg ctgtgaattc gcttgcagat 61 cagttttgta atgccattgg agtattgcag caatgtggtc ctcctgcctc tttcaataat 121 attcagacag caattaacaa agaccagcca gctaacccta cagaagagta tgcccagctt 181 tttgcagcac tgattgcacg aacagcaaaa gacattgatg ttttgataga ttccttaccc 241 agtgaagaat ctacagctgc tttacaggct gctagcttgt ataagctaga agaagaaaac 301 catgaagctg ctacatgtct ggaggatgtt gtttatcgag gagacatgct tctggagaag 361 atacaaagcg cacttgctga tattgcacag tcacagctga agacaagaag tggtacccat 421 agccagtctc ttccagactc atagcatcag tggataccat gtggctgaga aaagaactgt 481 ttgagtgcca ttaagaattc tgcatcagac ttagatacaa gccttaccaa caattacaga 541 aacattaaac aatatgacac attacctttt tagctatttt taatagtctt ctattttcac 601 tcttgataag cttataaaat catgattgaa tcagctttaa agcatcatac catcattttt 661 taactgagtg aaattattaa ggcatgtaat acattaatga acataatata aggaaacata 721 tgtaaaattc aaaaaaaaaa // LOCUS HSU46838 2913 bp mRNA PRI 21-OCT-1996 DEFINITION Human p105MCM mRNA, complete cds. ACCESSION U46838 NID g1197635 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2913) AUTHORS Holthoff,H.P., Hameister,H. and Knippers,R. TITLE A novel human Mcm protein: homology to the yeast replication protein Mis5 and chromosomal location JOURNAL Genomics 37 (1), 131-134 (1996) MEDLINE 97079669 REFERENCE 2 (bases 1 to 2913) AUTHORS Holthoff,H.P. and Knippers,R. TITLE Direct Submission JOURNAL Submitted (23-JAN-1996) Hans-Peter Holthoff, Biology, University of Konstanz, Universitaetsstr. 10, Konstanz 78434, Germany FEATURES Location/Qualifiers source 1..2913 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 50..2515 /note="hmis15; MCM family member" /codon_start=1 /product="p105MCM" /db_xref="PID:g1197636" /translation="MDLAAAAEPGAGSQHLEVRDEVAEKCQKLFLDFLEEFQSSDGEI KYLQLAEELIRPERNTLVVSFVDLEQFNQQLSTTIQEEFYRVYPYLCRALKTFVKDRK EIPLAKDFYVAFQDLPTRHKIRELTSSRIGLLTRISGQVVRTHPVHPELVSGTFLCLD CQTVIRDVEQQFKYTQPNICRNPVCANRRRFLLDTNKSRFVDFQKVRIQETQAELPRG SIPRSLEVILRAEAVESAQAGDKCDFTGTLIVVPDVSKLSTPGARAETNSRVSGVDGY ETEGIRGLRALGVRDLSYRLVFLACCVAPTNPRFGGKELRDEEQTAESIKNQMTVKEW EKVFEMSQDKNLYHNLCTSLFPTIHGNDEVKRGVLLMLFGGVSKDNRRRDLSSGDINV CIVGDPSTAKSQFLKHVEEFSPRAVYTSGKASSAAGLTAAVVRDEESHEFVIEAGALM LADNGVCCIDEFDKMDVRDQVAIHEAMEQQTISITKAGVKATLNTRTSILAAANPISG HYDRSKSLKQNINLSAPIMSRFDLFFILVDECNEVTDYAIARRIVDLHSRIEESIDRV YSLDDIRRYLLFARQFKPKISKESEDFIVEQYKHLRQRDGSGVTKSSWRITVRQLESM IRLSEAMARMHCCDEVQPKHVKEAFRLLNKSIIRVETPDVNLDQEEEIQMEVDEGAGG INGHADSPAPVNGINGYNEDINQESAPKASLRLGFSEYCRISNLIVLHLRKVEEEEDE SALKRSELVNWYLKEIESEIDSEEELINKKRIIEKVIHRLTHYDHVLIEPTQAGLKGS TEGSESYEEDPYLVVNPNYLLED" BASE COUNT 834 a 603 c 726 g 750 t ORIGIN 1 cttgtggcgg tcgagcgtgg cgtaggcgaa tcctcggcac taagcaaata tggacctcgc 61 ggcggcagcg gagccgggcg ccggcagcca gcacctggag gtccgcgacg aggtggccga 121 gaagtgccag aaactgttcc tggacttctt ggaggagttt cagagcagcg atggagaaat 181 taaatacttg caattagcag aggaactgat tcgtcctgag agaaacacat tggttgtgag 241 ttttgtggac ctggaacaat ttaaccagca actttccacc accattcaag aggagttcta 301 tagagtttac ccttacctgt gtcgggcctt gaaaacattc gtcaaagacc gtaaagagat 361 ccctcttgcc aaggattttt atgttgcatt ccaagacctg cctaccagac acaagattcg 421 agagctcacc tcatccagaa ttggtttgct cactcgcatc agtgggcagg tggtgcggac 481 tcacccagtt cacccagagc ttgtgagcgg aacttttctg tgcttggact gtcagacagt 541 gatcagggat gtagaacagc agttcaaata cacacagcca aacatctgcc gaaatccagt 601 ttgtgccaac aggaggagat tcttactgga tacaaataaa tcaagatttg ttgattttca 661 aaaggttcgt attcaagaga cccaagctga gcttcctcga gggagtatcc cccgcagttt 721 agaagtaatt ttaagggctg aagctgtgga atcagctcaa gctggtgaca agtgtgactt 781 tacagggaca ctgattgttg tgcctgacgt ctccaagctt agcacaccag gagcacgtgc 841 agaaactaat tcccgtgtca gtggtgttga tggatatgag acagaaggca ttcgaggact 901 ccgggccctt ggtgttaggg acctttctta taggctggtc tttcttgcct gctgtgttgc 961 gccaaccaac ccaaggtttg gggggaaaga gctcagagat gaggaacaga cagctgagag 1021 cattaagaac caaatgactg tgaaagaatg ggagaaagtg tttgagatga gtcaagataa 1081 aaatctatac cacaatcttt gtaccagcct gttccctact atacatggca atgatgaagt 1141 aaaacggggt gtcctgctga tgctctttgg tggcgtttcc aaagacaaca ggagaaggga 1201 cctctcttca ggggacataa atgtttgcat tgttggtgac ccaagtacag ctaagagcca 1261 atttctcaag cacgtggagg agttcagccc cagagctgtc tacaccagtg gtaaagcgtc 1321 cagtgctgct ggcttaacag cagctgttgt gagagatgaa gaatctcatg agtttgtcat 1381 tgaggctgga gctttgatgt tggctgataa tggtgtgtgt tgtattgatg aatttgataa 1441 gatggacgtg cgggatcaag ttgctattca tgaagctatg gaacagcaga ccatatccat 1501 cactaaagca ggagtgaagg ctactctgaa cacccggacg tccattttgg cagcagcaaa 1561 cccaatcagt ggacactatg acagatcaaa atcattgaaa cagaatataa atttgtcagc 1621 tcccatcatg tcccgattcg atctcttctt tatccttgtg gatgaatgta atgaggttac 1681 agattatgcc attgccaggc gcatagtaga tttgcattca agaattgagg aatcaattga 1741 tcgtgtctat tccctcgatg atatcagaag atatcttctc tttgcaagac agtttaaacc 1801 caagatttcc aaagagtcag aggacttcat tgtggagcaa tataaacatc tccgccagag 1861 agatggttct ggagtgacca agtcttcatg gaggattaca gtgcgacagc ttgagagcat 1921 gattcgtctc tctgaagcta tggctcggat gcactgctgt gatgaggtcc aacctaaaca 1981 tgtgaaggaa gctttccggt tactgaataa atcaatcatc cgtgtggaaa cacctgatgt 2041 caatctagat caagaggaag agatccagat ggaggtagat gagggtgctg gtggcatcaa 2101 tggtcatgct gacagccctg ctcctgtgaa cgggatcaat ggctacaatg aagacataaa 2161 tcaagagtct gctcccaaag cctccttaag gctgggcttc tctgagtact gccgaatctc 2221 taaccttatt gtgcttcacc tcagaaaggt ggaagaagaa gaggacgagt cagcattaaa 2281 gaggagcgag cttgttaact ggtacttgaa ggaaatcgaa tcagagatag actctgaaga 2341 agaacttata aataaaaaaa gaatcataga gaaagttatt catcgactca cacactatga 2401 tcatgttcta attgagccca cccaggctgg attgaaaggc tccacagagg gaagtgagag 2461 ctatgaagaa gatccctact tggtagttaa ccctaactac ttgctcgaag attgagatag 2521 tgaaagtaac tgaccagagc tgaggaactg tggcacagca cctcgtggcc tggagcctgg 2581 ctggagctct gctagggaca gaagtgtttc tggaagtgat gcttccagga tttgttttca 2641 gaaacaagaa ttgagttgat ggtcctatgt gtcacattca tcacaggttt cataccaaca 2701 caggcttcag cacttccttt ggtgtgtttc ctgtcccagt gaagttggaa ccaaataatg 2761 tgtagtctct ataaccaata cctttgtttt catgtgtaag aaaaggccca ttacttttaa 2821 ggtatgtgct gtcctattga gcaaataact ttttttcaat tgccagctac tgcttttatt 2881 catcaaaata aaataacttg ttctgaaaaa aaa // LOCUS HSU46922 1095 bp mRNA PRI 08-MAY-1996 DEFINITION Human FHIT mRNA, complete cds. ACCESSION U46922 NID g1203835 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1095) AUTHORS Kastury,K., Baffa,R., Druck,T., Ohta,M., Cotticelli,M.G., Inoue,H., Negrini,M., Rugge,M., Huang,D., Croce,C.M., Palazzo,J. and Huebner,K. TITLE Potential Gastrointestinal Tumor Suppressor Locus at the 3p14.2 FRA3B Site Identified by Homozygous Deletions in Tumor Cell Lines JOURNAL Cancer Res. (1996) In press REFERENCE 2 (bases 1 to 1095) AUTHORS Ohta,M., Hiroshi,I., Cotticelli,M.G., Kastury,K., Baffa,R., Palazzo,J., Siprashvili,Z., Mori,M., McCue,P., Druck,T., Croce,C.M. and Huebner,K. TITLE The FHIT gene, spanning the chromosome 3p14.2 fragile site and renal carcinoma-associated t(3;8) breakpoint, is abnormal in digestive tract cancers JOURNAL Cell 84 (4), 587-597 (1996) MEDLINE 96178471 REFERENCE 3 (bases 1 to 1095) AUTHORS Ohta,M., Hiroshi,I., Cotticelli,M.G., Kastury,K., Baffa,R., Palazzo,J., Siprashvili,Z., Mori,M., McCue,P., Druck,T., Croce,C.M. and Huebner,K. TITLE Direct Submission JOURNAL Submitted (19-JAN-1996) Teresa Druck, JCI, Thomas Jefferson University, 233 S. 10th St., Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..1095 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p14.2" gene 363..806 /gene="FHIT" CDS 363..806 /gene="FHIT" /note="member of the histidine triad (HIT) gene family; similar to the S. pombe diadenosine 5',5'''-P1,P4-tetraphosphate asymmetrical hydrolase" /codon_start=1 /db_xref="PID:g1203836" /translation="MSFRFGQHLIKPSVVFLKTELSFALVNRKPVVPGHVLVCPLRPV ERFHDLRPDEVADLFQTTQRVGTVVEKHFHGTSLTFSMQDGPEAGQTVKHVHVHVLPR KAGDFHRNDSIYEELQKHDKEDFPASWRSEEEMAAEAAALRVYFQ" BASE COUNT 274 a 313 c 252 g 256 t ORIGIN 1 tccccgctct gctctgtccg gtcacaggac tttttgccct ctgttcccgg gtccctcagg 61 cggccaccca gtgggcacac tcccaggcgg cgctccggcc ccgcgctccc tccctctgcc 121 tttcattccc agctgtcaac atcctggaag ctttgaagct caggaaagaa gagaaatcca 181 ctgagaacag tctgtaaagg tccgtagtgc tatctacatc cagacggtgg aagggagaga 241 aagagaaaga aggtatccta ggaatacctg cctgcttaga ccctctataa aagctctgtg 301 catcctgcca ctgaggactc cgaagaggta gcagtcttct gaaagacttc aactgtgagg 361 acatgtcgtt cagatttggc caacatctca tcaagccctc tgtagtgttt ctcaaaacag 421 aactgtcctt cgctcttgtg aataggaaac ctgtggtacc aggacatgtc cttgtgtgcc 481 cgctgcggcc agtggagcgc ttccatgacc tgcgtcctga tgaagtggcc gatttgtttc 541 agacgaccca gagagtcggg acagtggtgg aaaaacattt ccatgggacc tctctcacct 601 tttccatgca ggatggcccc gaagccggac agactgtgaa gcacgttcac gtccatgttc 661 ttcccaggaa ggctggagac tttcacagga atgacagcat ctatgaggag ctccagaaac 721 atgacaagga ggactttcct gcctcttgga gatcagagga ggaaatggca gcagaagccg 781 cagctctgcg ggtctacttt cagtgacaca gatgtttttc agatcctgaa ttccagcaaa 841 agagctattg ccaaccagtt tgaagaccgc ccccccgcct ctccccaaga ggaactgaat 901 cagcatgaaa atgcagtttc ttcatctcac catcctgtat tcttcaacca gtgatccccc 961 acctcggtca ctccaactcc cttaaaatac ctagacctaa acggctcaga caggcagatt 1021 tgaggtttcc ccctgtctcc ttattcggca gccttatgat taaacttcct tctctgctgc 1081 aaaaaaaaaa aaaaa // LOCUS HSU47050 3417 bp mRNA PRI 04-AUG-1997 DEFINITION Human putative calcium influx channel (htrp3) mRNA, complete cds. ACCESSION U47050 NID g2295902 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3417) AUTHORS Zhu,X., Jiang,M., Peyton,M., Boulay,G., Hurst,R., Stefani,E. and Birnbaumer,L. TITLE trp, a novel mammalian gene family essential for agonist-activated capacitative Ca2+ entry JOURNAL Cell 85 (5), 661-671 (1996) MEDLINE 96234226 REFERENCE 2 (bases 1 to 3417) AUTHORS Zhu,X., Peyton,M. and Birnbaumer,L. TITLE Direct Submission JOURNAL Submitted (24-JAN-1996) Xi Zhu, Anesthesiology, UCLA School of Medicine, BH-612, CHS, Los Angeles, CA 90095-1778, USA FEATURES Location/Qualifiers source 1..3417 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3417 /gene="htrp3" CDS 163..2709 /gene="htrp3" /note="Htrp-3; putative calcium influx channel; stimulated by agonist to some G protein coupled receptors; similar to Drosophila transient receptor potential" /codon_start=1 /product="calcium influx channel" /db_xref="PID:g2295903" /translation="MEGSPSLRRMTVMREKGRRQAVRGPAFMFNDRGTSLTAEEERFL DAAEYGNIPVVRKMLEESKTLNVNCVDYMGQNALQLAVGNEHLEVTELLLKKENLARI GDALLLAISKGYVRIVEAILNHPGFAASKRLTLSPCEQELQDDDFYAYDEDGTRFSPD ITPIILAAHCQKYEVVHMLLMKGARIERPHDYFCKCGDCMEKQRHDSFSHSRSRINAY KGLASPAYLSLSSEDPVLTALELSNELAKLANIEKEFKNDYRKLSMQCKDFVVGVLDL CRDSEEVEAILNGDLESAEPLEVHRHKASLSRVKLAIKYEVKKFVAHPNCQQQLLTIW YENLSGLREQTIAIKCLVVLVVALGLPFLAIGYWIAPCSRLGKILRSPFMKFVAHAAS FIIFLGLLVFNASDRFEGITTLPNITVTDYPKQIFRVKTTQFTWTEMLIMVWVLGMMW SECKELWLEGPREYILQLWNVLDFGMLSIFIAAFTARFLAFLQATKAQQYVDSYVQES DLSEVTLPPEIQYFTYARDKWLPSDPQIISEGLYAIAVVLSFSRIAYILPANESFGPL QISLGRTVKDIFKFMVLFIMVFFAFMIGMFILYSYYLGAKVNAAFTTVEESFKTLFWS IFGLSEVTSVVLKYDHKFIENIGYVLYGIYNVTMVVVLLNMLIAMINSSYQEIEDDSD VEWKFARSKLWLSYFDDGKTLPPPFSLVPSPKSFVYFIMRIVNFPKCRRRRLQKDIEM GMGNSKSRLNLFTQSNSRVFESHSFNSILNQPTRYQQIMKRLIKRYVLKAQVDKENDE VNEGELKEIKQDISSLRYELLEDKSQATEELAILIHKLSEKLNPSMLRCE" BASE COUNT 965 a 744 c 769 g 939 t ORIGIN 1 agagagtgct cttggaatat tgttgaggca gattgcatga actgaaagct ccttctaatt 61 aacctggagc caagtgaacc tgaatactgg atatctcatg ttctaacacg ggataaattc 121 aagttagaaa aagacaaaat attgaaatgc ttctctaggt ccatggaggg aagcccatcc 181 ctgagacgca tgacagtgat gcgggagaag ggccggcgcc aggctgtcag gggcccggcc 241 ttcatgttca atgaccgcgg caccagcctc accgccgagg aggagcgctt cctcgacgcc 301 gccgagtacg gcaacatccc agtggtgcgc aagatgctgg aggagtccaa gacgctgaac 361 gtcaactgcg tggactacat gggccagaac gcgctgcagc tggctgtggg caacgagcac 421 ctggaggtga ccgagctgct gctcaagaag gagaacctgg cgcgcattgg cgacgccctg 481 ctgctcgcca tcagcaaggg ctacgtgcgc attgtagagg ccatcctcaa ccaccctggc 541 ttcgcggcca gcaagcgtct cactctgagc ccctgtgagc aggagctgca ggacgacgac 601 ttctacgctt acgatgagga cggcacgcgc ttctcgccgg acatcacccc catcatcctg 661 gcggcgcact gccagaaata cgaagtggtg cacatgctgc tgatgaaggg tgccaggatc 721 gagcggccgc acgactattt ctgcaagtgc ggggactgca tggagaagca gaggcacgac 781 tccttcagcc actcacgctc gaggatcaat gcctacaagg ggctggccag cccggcttac 841 ctctcattgt ccagcgagga cccggtgctt acggccctag agctcagcaa cgagctggcc 901 aagctggcca acatagagaa ggagttcaag aatgactatc ggaagctctc catgcaatgc 961 aaagactttg tagtgggtgt gctggatctc tgccgagact cagaagaggt agaagccatt 1021 ctgaatggag atctggaatc agcagagcct ctggaggtac acaggcacaa agcttcatta 1081 agtcgtgtca aacttgccat taagtatgaa gtcaaaaagt ttgtggctca tcccaactgc 1141 cagcagcagc tcttgacgat ctggtatgag aacctctcag gcctaaggga gcagaccata 1201 gctatcaagt gtctcgttgt gctggtcgtg gccctgggcc ttccattcct ggccattggc 1261 tactggatcg caccttgcag caggctgggg aaaattctgc gaagcccttt tatgaagttt 1321 gtagcacatg cagcttcttt catcatcttc ctgggtctgc ttgtgttcaa tgcctcagac 1381 aggttcgaag gcatcaccac gctgcccaat atcacagtta ctgactatcc caaacagatc 1441 ttcagggtga aaaccaccca gtttacatgg actgaaatgc taattatggt ctgggttctt 1501 ggaatgatgt ggtctgaatg taaagagctc tggctggaag gacctaggga atacattttg 1561 cagttgtgga atgtgcttga ctttgggatg ctgtccatct tcattgctgc tttcacagcc 1621 agattcctag ctttccttca ggcaacgaag gcacaacagt atgtggacag ttacgtccaa 1681 gagagtgacc tcagtgaagt gacactccca ccagagatac agtatttcac ttatgctaga 1741 gataaatggc tcccttctga ccctcagatt atatctgaag gcctttatgc catagctgtt 1801 gtgctcagct tctctcggat tgcgtacatc ctccctgcaa atgagagctt tggccccctg 1861 cagatctctc ttggaaggac tgtaaaggac atattcaagt tcatggtcct ctttattatg 1921 gtgttttttg cctttatgat tggcatgttc atactttatt cttactacct tggggctaaa 1981 gttaatgctg cttttaccac tgtagaagaa agtttcaaga ctttattttg gtcaatattt 2041 gggttgtctg aagtgacttc cgttgtgctc aaatatgatc acaaattcat agaaaatatt 2101 ggatacgttc tttatggaat atacaatgta actatggtgg tcgttttact caacatgcta 2161 attgctatga ttaatagctc atatcaagaa attgaggatg acagtgatgt agaatggaag 2221 tttgctcgtt caaaactttg gttatcctat tttgatgatg gaaaaacatt acctccacct 2281 ttcagtctag ttcctagtcc aaaatcattt gtttatttca tcatgcgaat tgttaacttt 2341 cccaaatgca gaaggagaag acttcagaag gatatagaaa tgggaatggg taactcaaag 2401 tccaggttaa acctcttcac tcagtctaac tcaagagttt ttgaatcaca cagttttaac 2461 agcattctca atcagccaac acgttatcag cagataatga aaagacttat aaagcggtat 2521 gtcttgaaag cacaagtaga caaagaaaat gatgaagtta atgaaggtga attaaaagaa 2581 atcaagcaag atatctccag ccttcgttat gaacttttgg aagacaagag ccaagcaact 2641 gaggaattag ccattctaat tcataaactt agtgagaaac tgaatcccag catgctgaga 2701 tgtgaatgat gcagcaacct ggatttggct ttgactatag cacaaatgtg ggcaataata 2761 tttctaagta tgaaatactt gaaaaactat gatgtaaatt tttagtatta actaccttta 2821 tcatgtgaac ctttaaaagt tagctcttaa tggtttattg tttatcacat gaaaatgcat 2881 tttatttgtc tgctttgaca ttacagtggc ataccattgt gttgaaaagc ccaatattac 2941 tatattattg aaacttttat tcattttaga gtaaactcca catctttgca ctacctgttt 3001 gcctccaaga gactatcagt tccttgggga cagggaccat gtcttattca tctttgtgtc 3061 tccagcatct agtacagtgc ctggtatata gtaggtgctc aataaatgtt gaaaccaact 3121 gaactgccaa caaaataaaa ataaaaagtc ttcactatgt agcatacctt cccttgtcca 3181 agttctgaag aggttttttt tttttttttt ttaatagaaa ctgaagacat tttacaacca 3241 gctatgactt ggtaagacat tcttagaatt ttaggtgtca ctgataatcc tagaaccact 3301 gagccccaag tgaagaattt aacaacaaaa tgggttaatg aaaaatataa ttacattgta 3361 tatttaagtt tcatagaatt atttaaaaca acacattaaa gatttttcta aaatatg // LOCUS HSU47077 13506 bp mRNA PRI 08-JAN-1997 DEFINITION Human DNA-dependent protein kinase catalytic subunit (DNA-PKcs) mRNA, complete cds. ACCESSION U47077 NID g1765937 KEYWORDS DNAPKcs; serine/threonine protein kinase; double-strand break repair; V(D)J recombination; mouse SCID product. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13506) AUTHORS Hartley,K.O., Gell,D., Smith,G.C., Zhang,H., Divecha,N., Connelly,M.A., Admon,A., Lees-Miller,S.P., Anderson,C.W. and Jackson,S.P. TITLE DNA-dependent protein kinase catalytic subunit: a relative of phosphatidylinositol 3-kinase and the ataxia telangiectasia gene product JOURNAL Cell 82 (5), 849-856 (1995) MEDLINE 95401275 REFERENCE 2 (bases 1 to 13506) AUTHORS Connelly,M.A., Zhang,H., Kieleczawa,J. and Anderson,C.W. TITLE Alternate splice-site utilization in the gene for the catalytic subunit of the DNA-activated protein kinase, DNA-PKcs JOURNAL Gene 175 (1-2), 271-273 (1996) MEDLINE 97074683 REFERENCE 3 (bases 1 to 13506) AUTHORS Anderson,C.W. TITLE Direct Submission JOURNAL Submitted (25-JAN-1996) Biology, Brookhaven National Laboratory, 50 Bell Avenue, Upton, NY 11973-5000, USA FEATURES Location/Qualifiers source 1..13506 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q11" /cell_line="Daudi and CCRF-CEM" /clone_lib="Clontech HL-1117a and HL-10638" gene 58..12441 /gene="DNA-PKcs" CDS 58..12441 /gene="DNA-PKcs" /codon_start=1 /product="DNA-dependent protein kinase catalytic subunit" /db_xref="PID:g1765938" /translation="MAGSGAGVRCSLLRLQETLSAADRCGAALAGHQLIRGLGQECVL SSSPAVLALQTSLVFSRDFGLLVFVRKSLNSIEFRECREEILKFLCIFLEKMGQKIAP YSVEIKNTCTSVYTKDRAAKCKIPALDLLIKLLQTFRSSRLMDEFKIGELFSKFYGEL ALKKKIPDTVLEKVYELLGLLGEVHPSEMINNAENLFRAFLGELKTQMTSAVREPKLP VLAGCLKGLSSLLCNFTKSMEEDPQTSREIFNFVLKAIRPQIDLKRYAVPSAGLRLFA LHASQFSTCLLDNYVSLFEVLLKWCAHTNVELKKAALSALESFLKQVSNMVAKNAEMH KNKLQYFMEQFYGIIRNVDSNNKELSIAIRGYGLFAGPCKVINAKDVDFMYVELIQRC KQMFLTQTDTGDYRVYQMPSFLQSVASVLLYLDTVPEVYTPVLEHLVVMQIDSFPQYS PKMQLVCCRAIVKVFLALAAKGPVLRNCISTVVHQGLIRICSKPVVLPKGPESESEDH RASGEVRTGKWKVPTYKDYVDLFRHLLSSDQMMDSILADEAFFSVNSSSESLNHLLYD EFVKSVLKIVEKLDLTLEIQTVGEQENGDEAPGVWMIPTSDPAANLHPAKPKDFSAFI NLVEFCREILPEKQAEFFEPWVYSFSYELILQSTRLPLISGFYKLLSITVRNAKKIKY FEGVSPKSLKHSPEDPEKYSCFALFVKFGKEVAVKMKQYKDELLASCLTFLLSLPHNI IELDVRAYVPALQMAFKLGLSYTPLAEVGLNALEEWSIYIDRHVMQPYYKDILPCLDG YLKTSALSDETKNNWEVSALSRAAQKGFNKVVLKHLKKTKNLSSNEAISLEEIRIRVV QMLGSLGGQINKNLLTVTSSDEMMKSYVAWDREKRLSFAVPFREMKPVIFLDVFLPRV TELALTASDRQTKVAACELLHSMVMFMLGKATQMPEGGQGAPPMYQLYKRTFPVLLRL ACDVDQVTRQLYEPLVMQLIHWFTNNKKFESQDTVSLLEAILDGIVDPVDSTLRDFCG RCIREFLKWSIKQITPQQQEKSPVNTKSLFKRLYSLALHPNAFKRLGASLAFNNIYRE FREEESLVEQFVFEALVIYMESLALAHADEKSLGTIQQCCDAIDHLCRIIEKKHVSLN KAKKRRLPRGFPPSASLCLLDLVKWLLAHCGRPQTECRHKSIELFYKFVPLLPGNRSP NLWLKDVLKEEGVSFLINTFEGGGCGQPSGILAQPTLLYLRGPFSLQATLCWLDLLLA ALECYNTFIGERTVGALQVLGTEAQSSLLKAVAFFLESIAMHDIIAAEKCFGTGAAGN RTSPQEGERYNYSKCTVVVRIMEFTTTLLNTSPEGWKLLKKDLCNTHLMRVLVQTLCE PASIGFNIGDVQVMAHLPDVCVNLMKALKMSPYKDILETHLREKITAQSIEELCAVNL YGPDAQVDRSRLAAVVSACKQLHRAGLLHNILPSQSTDLHHSVGTELLSLVYKGIAPG DERQCLPSLDLSCKQLASGLLELAFAFGGLCERLVSLLLNPAVLSTASLGSSQGSVIH FSHGEYFYSLFSETINTELLKNLDLAVLELMQSSVDNTKMVSAVLNGMLDQSFRERAN QKHQGLKLATTILQHWKKCDSWWAKDSPLETKMAVLALLAKILQIDSSVSFNTSHGSF PEVFTTYISLLADTKLDLHLKGQAVTLLPFFTSLTGGSLEELRRVLEQLIVAHFPMQS REFPPGTPRFNNYVDCMKKFLDALELSQSPMLLELMTEVLCREQQHVMEELFQSSFRR IARRGSCVTQVGLLESVYEMFRKDDPRLSFTRQSFVDRSLLTLLWHCSLDALREFFST IVVDAIDVLKSRFTKLNESTFDTQITKKMGYYKILDVMYSRLPKDDVHAKESKINQVF HGSCITEGNELTKTLIKLCYDAFTENMAGENQLLERRRLYHCAAYNCAISVICCVFNE LKFYQGFLFSEKPEKNLLIFENLIDLKRRYNFPVEVEVPMERKKKYIEIRKEAREAAN GDSDGPSYMSSLSYLADSTLSEEMSQFDFSTGVQSYSYSSQDPRPATGRFRRREQRDP TVHDDVLELEMDELNRHECMAPLTALVKHMHRSLGPPQGEEDSVPRDLPSWMKFLHGK LGNPIVPLNIRLFLAKLVINTEEVFRPYAKHWLSPLLQLAASENNGGEGIHYMVVEIV ATILSWTGLATPTGVPKDEVLANRLLNFLMKHVFHPKRAVFRHNLEIIKTLVECWKDC LSIPYRLIFEKFSGKDPNSKDNSVGIQLLGIVMANDLPPYDPQCGIQSSEYFQALVNN MSFVRYKEVYAAAAEVLGLILRYVMERKNILEESLCELVAKQLKQHQNTMEDKFIVCL NKVTKSFPPLADRFMNAVFFLLPKFHGVLKTLCLEVVLCRVEGMTELYFQLKSKDFVQ VMRHRDERQKVCLDIIYKMMPKLKPVELRELLNPVVEFVSHPSTTCREQMYNILMWIH DNYRDPESETDNDSQEIFKLAKDVLIQGLIDENPGLQLIIRNFWSHETRLPSNTLDRL LALNSLYSPKIEVHFLSLATNFLLEMTSMSPDYPNPMFEHPLSECEFQEYTIDSDWRF RSTVLTPMFVETQASQGTLQTRTQEGSLSARWPVAGQIRATQQQHDFTLTQTADGRSS FDWLTGSSTDPLVDHTSPSSDSLLFAHKRSERLQRAPLKSVGPDFGKKRLGLPGDEVD NKVKGAAGRTDLLRLRRRFMRDQEKLSLMYARKGVAEQKREKEIKSELKMKQDAQVVL YRSYRHGDLPDIQIKHSSLITPLQAVAQRDPIIAKQLFSSLFSGILKEMDKFKTLSEK NNITQKLLQDFNRFLNTTFSFFPPFVSCIQDISCQHAALLSLDPAAVSAGCLASLQQP VGIRLLEEALLRLLPAELPAKRVRGKARLPPDVLRWVELAKLYRSIGEYDVLRGIFTS EIGTKQITQSALLAEARSDYSEAAKQYDEALNKQDWVDGEPTEAEKDFWELASLDCYN HLAEWKSLEYCSTASIDSENPPDLNKIWSEPFYQETYLPYMIRSKLKLLLQGEADQSL LTFIDKAMHGELQKAILELHYSQELSLLYLLQDDVDRAKYYIQNGIQSFMQNYSSIDV LLHQSRLTKLQSVQALTEIQEFISFISKQGNLSSQVPLKRLLNTWTNRYPDAKMDPMN IWDDIITNRCFFLSKIEEKLTPLPEDNSMNVDQDGDPSDRMEVQEQEEDISSLIRSCK FSMKMKMIDSARKQNNFSLAMKLLKELHKESKTRDDWLVSWVQSYCRLSHCRSRSQGC SEQVLTVLKTVSLLDENNVSSYLSKNILAFRDQNILLGTTYRIIANALSSEPACLAEI EEDKARRILELSGSSSEDSEKVIAGLYQRAFQHLSEAVQAAEEEAQPPSWSCGPAAGV IDAYMTLADFCDQQLRKEEENASVTDSAELQAYPALVVEKMLKALKLNSNEARLKFPR LLQIIERYPEETLSLMTKEISSVPCWQFISWISHMVALLDKDQAVAVQHSVEEITDNY PQAIVYPFIISSESYSFKDTSTGHKNKEFVARIKSKLDQGGVIQDFINALDQLSNPEL LFKDWSNDVRAELAKTPVNKKNIEKMYERMYAALGDPKAPGLGAFRRKFIQTFGKEFD KHFGKGGSKLLRMKLSDFNDITNMLLLKMNKDSKPPGNLKECSPWMSDFKVEFLRNEL EIPGQYDGRGKPLPEYHVRIAGFDERVTVMASLRRPKRIIIRGHDEREHPFLVKGGED LRQDQRVEQLFQVMNGILAQDSACSQRALQLRTYSVVPMTSRLGLIEWLENTVTLKDL LLNTMSQEEKAAYLSDPRAPPCEYKDWLTKMSGKHDVGAYMLMYKGANRTETVTSFRK RESKVPADLLKRAFVRMSTSPEAFLALRSHFASSHALICISHWILGIGDRHLNNFMVA METGGVIGIDFGHAFGSATQFLPVPELMPFRLTRQFINLMLPMKETGLMYSIMVHALR AFRSDPGLLTNTMDVFVKEPSFDWKNFEQKMLKKGGSWIQEINVAEKNWYPRQKICYA KRKLAGANPAVITCDELLLGHEKAPAFRDYVAVARGSKDHNIRAQEPESGLSEETQVK CLMDQATDPNILGRTWEGWEPWM" exon 11451..11544 /gene="DNA-PKcs" /note="exon occasionally removed by improper splicing; alternatively spliced transcript without this exon deposited in GenBank Accession Number U34994" BASE COUNT 3823 a 2889 c 3223 g 3571 t ORIGIN 1 ggggcatttc cgggtccggg ccgagcgggc gcacgcgcgg gagcgggact cggcggcatg 61 gcgggctccg gagccggtgt gcgttgctcc ctgctgcggc tgcaggagac cttgtccgct 121 gcggaccgct gcggtgctgc cctggccggt catcaactga tccgcggcct ggggcaggaa 181 tgcgtcctga gcagcagccc cgcggtgctg gcattacaga catctttagt tttttccaga 241 gatttcggtt tgcttgtatt tgtccggaag tcactcaaca gtattgaatt tcgtgaatgt 301 agagaagaaa tcctaaagtt tttatgtatt ttcttagaaa aaatgggcca gaagatcgca 361 ccttactctg ttgaaattaa gaacacttgt accagtgttt atacaaaaga tagagctgct 421 aaatgtaaaa ttccagccct ggaccttctt attaagttac ttcagacttt tagaagttct 481 agactcatgg atgaatttaa aattggagaa ttatttagta aattctatgg agaacttgca 541 ttgaaaaaaa aaataccaga tacagtttta gaaaaagtat atgagctcct aggattattg 601 ggtgaagttc atcctagtga gatgataaat aatgcagaaa acctgttccg cgcttttctg 661 ggtgaactta agacccagat gacatcagca gtaagagagc ccaaactacc tgttctggca 721 ggatgtctga aggggttgtc ctcacttctg tgcaacttca ctaagtccat ggaagaagat 781 ccccagactt caagggagat ttttaatttt gtactaaagg caattcgtcc tcagattgat 841 ctgaagagat atgctgtgcc ctcagctggc ttgcgcctat ttgccctgca tgcatctcag 901 tttagcacct gccttctgga caactacgtg tctctatttg aagtcttgtt aaagtggtgt 961 gcccacacaa atgtagaatt gaaaaaagct gcactttcag ccctggaatc ctttctgaaa 1021 caggtttcta atatggtggc gaaaaatgca gaaatgcata aaaataaact gcagtacttt 1081 atggagcagt tttatggaat catcagaaat gtggattcga acaacaagga gttatctatt 1141 gctatccgtg gatatggact ttttgcagga ccgtgcaagg ttataaacgc aaaagatgtt 1201 gacttcatgt acgttgagct cattcagcgc tgcaagcaga tgttcctcac ccagacagac 1261 actggtgact accgtgttta tcagatgcca agcttcctcc agtctgttgc aagcgtcttg 1321 ctgtaccttg acacagttcc tgaggtgtat actccagttc tggagcacct cgtggtgatg 1381 cagatagaca gtttcccaca gtacagtcca aaaatgcagc tggtgtgttg cagagccata 1441 gtgaaggtgt tcctagcttt ggcagcaaaa gggccagttc tcaggaattg cattagtact 1501 gtggtgcatc agggtttaat cagaatatgt tctaaaccag tggtccttcc aaagggccct 1561 gagtctgaat ctgaagacca ccgtgcttca ggggaagtca gaactggcaa atggaaggtg 1621 cccacataca aagactacgt ggatctcttc agacatctcc tgagctctga ccagatgatg 1681 gattctattt tagcagatga agcatttttc tctgtgaatt cctccagtga aagtctgaat 1741 catttacttt atgatgaatt tgtaaaatcc gttttgaaga ttgttgagaa attggatctt 1801 acacttgaaa tacagactgt tggggaacaa gagaatggag atgaggcgcc tggtgtttgg 1861 atgatcccaa cttcagatcc agcggctaac ttgcatccag ctaaacctaa agatttttcg 1921 gctttcatta acctggtgga attttgcaga gagattctcc ctgagaaaca agcagaattt 1981 tttgaaccat gggtgtactc attttcatat gaattaattt tgcaatctac aaggttgccc 2041 ctcatcagtg gtttctacaa attgctttct attacagtaa gaaatgccaa gaaaataaaa 2101 tatttcgagg gagttagtcc aaagagtctg aaacactctc ctgaagaccc agaaaagtat 2161 tcttgctttg ctttatttgt gaaatttggc aaagaggtgg cagttaaaat gaagcagtac 2221 aaagatgaac ttttggcctc ttgtttgacc tttcttctgt ccttgccaca caacatcatt 2281 gaactcgatg ttagagccta cgttcctgca ctgcagatgg ctttcaaact gggcctgagc 2341 tataccccct tggcagaagt aggcctgaat gctctagaag aatggtcaat ttatattgac 2401 agacatgtaa tgcagcctta ttacaaagac attctcccct gcctggatgg atacctgaag 2461 acttcagcct tgtcagatga gaccaagaat aactgggaag tgtcagctct ttctcgggct 2521 gcccagaaag gatttaataa agtggtgtta aagcatctga agaagacaaa gaacctttca 2581 tcaaacgaag caatatcctt agaagaaata agaattagag tagtacaaat gcttggatct 2641 ctaggaggac aaataaacaa aaatcttctg acagtcacgt cctcagatga gatgatgaag 2701 agctatgtgg cctgggacag agagaagcgg ctgagctttg cagtgccctt tagagagatg 2761 aaacctgtca ttttcctgga tgtgttcctg cctcgagtca cagaattagc gctcacagcc 2821 agtgacagac aaactaaagt tgcagcctgt gaacttttac atagcatggt tatgtttatg 2881 ttgggcaaag ccacgcagat gccagaaggg ggacagggag ccccacccat gtaccagctc 2941 tataagcgga cgtttcctgt gctgcttcga cttgcgtgtg atgttgatca ggtgacaagg 3001 caactgtatg agccactagt tatgcagctg attcactggt tcactaacaa caagaaattt 3061 gaaagtcagg atactgtttc cttactagaa gctatattgg atggaattgt ggaccctgtt 3121 gacagtactt taagagattt ttgtggtcgg tgtattcgag aattccttaa atggtccatt 3181 aagcaaataa caccacagca gcaggagaag agtccagtaa acaccaaatc gcttttcaag 3241 cgactttata gccttgcgct tcaccccaat gctttcaaga ggctgggagc atcacttgcc 3301 tttaataata tctacaggga attcagggaa gaagagtctc tggtggaaca gtttgtgttt 3361 gaagccttgg tgatatacat ggagagtctg gccttagcac atgcagatga gaagtcctta 3421 ggtacaattc aacagtgttg tgatgccatt gatcacctat gccgcatcat tgaaaagaag 3481 catgtttctt taaataaagc aaagaaacga cgtttgccgc gaggatttcc accttccgca 3541 tcattgtgtt tattggatct ggtcaagtgg cttttagctc attgtgggag gccccagaca 3601 gaatgtcgac acaaatccat tgaactcttt tataaattcg ttcctttatt gccaggcaac 3661 agatccccta atttgtggct gaaagatgtt ctcaaggaag aaggtgtctc ttttctcatc 3721 aacacctttg aggggggtgg ctgtggccag ccctcgggca tcctggccca gcccaccctc 3781 ttgtaccttc gggggccatt cagcctgcag gccacgctat gctggctgga cctgctcctg 3841 gccgcgttgg agtgctacaa cacgttcatt ggcgagagaa ctgtaggagc gctccaggtc 3901 ctaggtactg aagcccagtc ttcacttttg aaagcagtgg ctttcttctt agaaagcatt 3961 gccatgcatg acattatagc agcagaaaag tgctttggca ctggggcagc aggtaacaga 4021 acaagcccac aagagggaga aaggtacaac tacagcaaat gcaccgttgt ggtccggatt 4081 atggagttta ccacgactct gctaaacacc tccccggaag gatggaagct cctgaagaag 4141 gacttgtgta atacacacct gatgagagtc ctggtgcaga cgctgtgtga gcccgcaagc 4201 ataggtttca acatcggaga cgtccaggtt atggctcatc ttcctgatgt ttgtgtgaat 4261 ctgatgaaag ctctaaagat gtccccatac aaagatatcc tagagaccca tctgagagag 4321 aaaataacag cacagagcat tgaggagctt tgtgccgtca acttgtatgg ccctgacgcg 4381 caagtggaca ggagcaggct ggctgctgtt gtgtctgcct gtaaacagct tcacagagct 4441 gggcttctgc ataatatatt accgtctcag tccacagatt tgcatcattc tgttggcaca 4501 gaacttcttt ccctggttta taaaggcatt gcccctggag atgagagaca gtgtctgcct 4561 tctctagacc tcagttgtaa gcagctggcc agcggacttc tggagttagc ctttgctttt 4621 ggaggactgt gtgagcgcct tgtgagtctt ctcctgaacc cagcggtgct gtccacggcg 4681 tccttgggca gctcacaggg cagcgtcatc cacttctccc atggggagta tttctatagc 4741 ttgttctcag aaacgatcaa cacggaatta ttgaaaaatc tggatcttgc tgtattggag 4801 ctcatgcagt cttcagtgga taataccaaa atggtgagtg ccgttttgaa cggcatgtta 4861 gaccagagct tcagggagcg agcaaaccag aaacaccaag gactgaaact tgcgactaca 4921 attctgcaac actggaagaa gtgtgattca tggtgggcca aagattcccc tctcgaaact 4981 aaaatggcag tgctggcctt actggcaaaa attttacaga ttgattcatc tgtatctttt 5041 aatacaagtc atggttcatt ccctgaagtc tttacaacat atattagtct acttgctgac 5101 acaaagctgg atctacattt aaagggccaa gctgtcactc ttcttccatt cttcaccagc 5161 ctcactggag gcagtctgga ggaacttaga cgtgttctgg agcagctcat cgttgctcac 5221 ttccccatgc agtccaggga atttcctcca ggaactccgc ggttcaataa ttatgtggac 5281 tgcatgaaaa agtttctaga tgcattggaa ttatctcaaa gccctatgtt gttggaattg 5341 atgacagaag ttctttgtcg ggaacagcag catgtcatgg aagaattatt tcaatccagt 5401 ttcaggagga ttgccagaag gggttcatgt gtcacacaag taggccttct ggaaagcgtg 5461 tatgaaatgt tcaggaagga tgacccccgc ctaagtttca cacgccagtc ctttgtggac 5521 cgctccctcc tcactctgct gtggcactgt agcctggatg ctttgagaga attcttcagc 5581 acaattgtgg tggatgccat tgatgtgttg aagtccaggt ttacaaagct aaatgaatct 5641 acctttgata ctcaaatcac caagaagatg ggctactata agattctaga cgtgatgtat 5701 tctcgccttc ccaaagatga tgttcatgct aaggaatcaa aaattaatca agttttccat 5761 ggctcgtgta ttacagaagg aaatgaactt acaaagacat tgattaaatt gtgctacgat 5821 gcatttacag agaacatggc aggagagaat cagctgctgg agaggagaag actttaccat 5881 tgtgcagcat acaactgcgc catatctgtc atctgctgtg tcttcaatga gttaaaattt 5941 taccaaggtt ttctgtttag tgaaaaacca gaaaagaact tgcttatttt tgaaaatctg 6001 atcgacctga agcgccgcta taattttcct gtagaagttg aggttcctat ggaaagaaag 6061 aaaaagtaca ttgaaattag gaaagaagcc agagaagcag caaatgggga ttcagatggt 6121 ccttcctata tgtcttccct gtcatatttg gcagacagta ccctgagtga ggaaatgagt 6181 caatttgatt tctcaaccgg agttcagagc tattcataca gctcccaaga ccctagacct 6241 gccactggtc gttttcggag acgggagcag cgggacccca cggtgcatga tgatgtgctg 6301 gagctggaga tggacgagct caatcggcat gagtgcatgg cgcccctgac ggccctggtc 6361 aagcacatgc acagaagcct gggcccgcct caaggagaag aggattcagt gccaagagat 6421 cttccttctt ggatgaaatt cctccatggc aaactgggaa atccaatagt accattaaat 6481 atccgtctct tcttagccaa gcttgttatt aatacagaag aggtctttcg cccttacgcg 6541 aagcactggc ttagcccctt gctgcagctg gctgcttctg aaaacaatgg aggagaagga 6601 attcactaca tggtggttga gatagtggcc actattcttt catggacagg cttggccact 6661 ccaacagggg tccctaaaga tgaagtgtta gcaaatcgat tgcttaattt cctaatgaaa 6721 catgtctttc atccaaaaag agctgtgttt agacacaacc ttgaaattat aaagaccctt 6781 gtcgagtgct ggaaggattg tttatccatc ccttataggt taatatttga aaagttttcc 6841 ggtaaagatc ctaattctaa agacaactca gtagggattc aattgctagg catcgtgatg 6901 gccaatgacc tgcctcccta tgacccacag tgtggcatcc agagtagcga atacttccag 6961 gctttggtga ataatatgtc ctttgtaaga tataaagaag tgtatgccgc tgcagcagaa 7021 gttctaggac ttatacttcg atatgttatg gagagaaaaa acatactgga ggagtctctg 7081 tgtgaactgg ttgcgaaaca attgaagcaa catcagaata ctatggagga caagtttatt 7141 gtgtgcttga acaaagtgac caagagcttc cctcctcttg cagacaggtt catgaatgct 7201 gtgttctttc tgctgccaaa atttcatgga gtgttgaaaa cactctgtct ggaggtggta 7261 ctttgtcgtg tggagggaat gacagagctg tacttccagt taaagagcaa ggacttcgtt 7321 caagtcatga gacatagaga tgaaagacaa aaagtatgtt tggacataat ttataagatg 7381 atgccaaagt taaaaccagt agaactccga gaacttctga accccgttgt ggaattcgtt 7441 tcccatcctt ctacaacatg tagggaacaa atgtataata ttctcatgtg gattcatgat 7501 aattacagag atccagaaag tgagacagat aatgactccc aggaaatatt taagttggca 7561 aaagatgtgc tgattcaagg attgatcgat gagaaccctg gacttcaatt aattattcga 7621 aatttctgga gccatgaaac taggttacct tcaaatacct tggaccggtt gctggcacta 7681 aattccttat attctcctaa gatagaagtg cactttttaa gtttagcaac aaattttctg 7741 ctcgaaatga ccagcatgag cccagattat ccaaacccca tgttcgagca tcctctgtca 7801 gaatgcgaat ttcaggaata taccattgat tctgattggc gtttccgaag tactgttctc 7861 actccgatgt ttgtggagac ccaggcctcc cagggcactc tccagacccg tacccaggaa 7921 gggtccctct cagctcgctg gccagtggca gggcagataa gggccaccca gcagcagcat 7981 gacttcacac tgacacagac tgcagatgga agaagctcat ttgattggct gaccgggagc 8041 agcactgacc cgctggtcga ccacaccagt ccctcatctg actccttgct gtttgcccac 8101 aagaggagtg aaaggttaca gagagcaccc ttgaagtcag tggggcctga ttttgggaaa 8161 aaaaggctgg gccttccagg ggacgaggtg gataacaaag tgaaaggtgc ggccggccgg 8221 acggacctac tacgactgcg cagacggttt atgagggacc aggagaagct cagtttgatg 8281 tatgccagaa aaggcgttgc tgagcaaaaa cgagagaagg aaatcaagag tgagttaaaa 8341 atgaagcagg atgcccaggt cgttctgtac agaagctacc ggcacggaga ccttcctgac 8401 attcagatca agcacagcag cctcatcacc ccgttacagg ccgtggccca gagggaccca 8461 ataattgcaa aacagctctt tagcagcttg ttttctggaa ttttgaaaga gatggataaa 8521 tttaagacac tgtctgaaaa aaacaacatc actcaaaagt tgcttcaaga cttcaatcgt 8581 tttcttaata ccaccttctc tttctttcca ccctttgtct cttgtattca ggacattagc 8641 tgtcagcacg cagccctgct gagcctcgac ccagcggctg ttagcgctgg ttgcctggcc 8701 agcctacagc agcccgtggg catccgcctg ctagaggagg ctctgctccg cctgctgcct 8761 gctgagctgc ctgccaagcg agtccgtggg aaggcccgcc tccctcctga tgtcctcaga 8821 tgggtggagc ttgctaagct gtatagatca attggagaat acgacgtcct ccgtgggatt 8881 tttaccagtg agataggaac aaagcaaatc actcagagtg cattattagc agaagccaga 8941 agtgattatt ctgaagctgc taagcagtat gatgaggctc tcaataaaca agactgggta 9001 gatggtgagc ccacagaagc cgagaaggat ttttgggaac ttgcatccct tgactgttac 9061 aaccaccttg ctgagtggaa atcacttgaa tactgttcta cagccagtat agacagtgag 9121 aaccccccag acctaaataa aatctggagt gaaccatttt atcaggaaac atatctacct 9181 tacatgatcc gcagcaagct gaagctgctg ctccagggag aggctgacca gtccctgctg 9241 acatttattg acaaagctat gcacggggag ctccagaagg cgattctaga gcttcattac 9301 agtcaagagc tgagtctgct ttacctcctg caagatgatg ttgacagagc caaatattac 9361 attcaaaatg gcattcagag ttttatgcag aattattcta gtattgatgt cctcttacac 9421 caaagtagac tcaccaaatt gcagtctgta caggctttaa cagaaattca ggagttcatc 9481 agctttataa gcaaacaagg caatttatca tctcaagttc cccttaagag acttctgaac 9541 acctggacaa acagatatcc agatgctaaa atggacccaa tgaacatctg ggatgacatc 9601 atcacaaatc gatgtttctt tctcagcaaa atagaggaga agcttacccc tcttccagaa 9661 gataatagta tgaatgtgga tcaagatgga gaccccagtg acaggatgga agtgcaagag 9721 caggaagaag atatcagctc cctgatcagg agttgcaagt tttccatgaa aatgaagatg 9781 atagacagtg cccggaagca gaacaatttc tcacttgcta tgaaactact gaaggagctg 9841 cataaagagt caaaaaccag agacgattgg ctggtgagct gggtgcagag ctactgccgc 9901 ctgagccact gccggagccg gtcccagggc tgctctgagc aggtgctcac tgtgctgaaa 9961 acagtctctt tgttggatga gaacaacgtg tcaagctact taagcaaaaa tattctggct 10021 ttccgtgacc agaacattct cttgggtaca acttacagga tcatagcgaa tgctctcagc 10081 agtgagccag cctgccttgc tgaaatcgag gaggacaagg ctagaagaat cttagagctt 10141 tctggatcca gttcagagga ttcagagaag gtgatcgcgg gtctgtacca gagagcattc 10201 cagcacctct ctgaggctgt gcaggcggct gaggaggagg cccagcctcc ctcctggagc 10261 tgtgggcctg cagctggggt gattgatgct tacatgacgc tggcagattt ctgtgaccaa 10321 cagctgcgca aggaggaaga gaatgcatca gttactgatt ctgcagaact gcaggcgtat 10381 ccagcacttg tggtggagaa aatgttgaaa gctttaaaat taaattccaa tgaagccaga 10441 ttgaagtttc ctagattact tcagattata gaacggtatc cagaggagac tttgagcctc 10501 atgacaaaag agatctcttc cgttccctgc tggcagttca tcagctggat cagccacatg 10561 gtggccttac tggacaaaga ccaagccgtt gctgttcagc actctgtgga agaaatcact 10621 gataactacc cgcaggctat tgtttatccc ttcatcataa gcagcgaaag ctattccttc 10681 aaggatactt ctactggtca taagaataag gagtttgtgg caaggattaa aagtaagttg 10741 gatcaaggag gagtgattca agattttatt aatgccttag atcagctctc taatcctgaa 10801 ctgctcttta aggattggag caatgatgta agagctgaac tagcaaaaac ccctgtaaat 10861 aaaaaaaaca ttgaaaaaat gtatgaaaga atgtatgcag ccttgggtga cccaaaggct 10921 ccaggcctgg gggcctttag aaggaagttt attcagactt ttggaaaaga atttgataaa 10981 cattttggga aaggaggttc taaactactg agaatgaagc tcagtgactt caacgacatt 11041 accaacatgc tacttttaaa aatgaacaaa gactcaaagc cccctgggaa tctgaaagaa 11101 tgttcaccct ggatgagcga cttcaaagtg gagttcctga gaaatgagct ggagattccc 11161 ggtcagtatg acggtagggg aaagccattg ccagagtacc acgtgcgaat cgccgggttt 11221 gatgagcggg tgacagtcat ggcgtctctg cgaaggccca agcgcatcat catccgtggc 11281 catgacgaga gggaacaccc tttcctggtg aagggtggcg aggacctgcg gcaggaccag 11341 cgcgtggagc agctcttcca ggtcatgaat gggatcctgg cccaagactc cgcctgcagc 11401 cagagggccc tgcagctgag gacctatagc gttgtgccca tgacctccag gttaggatta 11461 attgagtggc ttgaaaatac tgttaccttg aaggaccttc ttttgaacac catgtcccaa 11521 gaggagaagg cggcttacct gagtgatccc agggcaccgc cgtgtgaata taaagattgg 11581 ctgacaaaaa tgtcaggaaa acatgatgtt ggagcttaca tgctaatgta taagggcgct 11641 aatcgtactg aaacagtcac gtcttttaga aaacgagaaa gtaaagtgcc tgctgatctc 11701 ttaaagcggg ccttcgtgag gatgagtaca agccctgagg ctttcctggc gctccgctcc 11761 cacttcgcca gctctcacgc tctgatatgc atcagccact ggatcctcgg gattggagac 11821 agacatctga acaactttat ggtggccatg gagactggcg gcgtgatcgg gatcgacttt 11881 gggcatgcgt ttggatccgc tacacagttt ctgccagtcc ctgagttgat gccttttcgg 11941 ctaactcgcc agtttatcaa tctgatgtta ccaatgaaag aaacgggcct tatgtacagc 12001 atcatggtac acgcactccg ggccttccgc tcagaccctg gcctgctcac caacaccatg 12061 gatgtgtttg tcaaggagcc ctcctttgat tggaaaaatt ttgaacagaa aatgctgaaa 12121 aaaggagggt catggattca agaaataaat gttgctgaaa aaaattggta cccccgacag 12181 aaaatatgtt acgctaagag aaagttagca ggtgccaatc cagcagtcat tacttgtgat 12241 gagctactcc tgggtcatga gaaggcccct gccttcagag actatgtggc tgtggcacga 12301 ggaagcaaag atcacaacat tcgtgcccaa gaaccagaga gtgggctttc agaagagact 12361 caagtgaagt gcctgatgga ccaggcaaca gaccccaaca tccttggcag aacctgggaa 12421 ggatgggagc cctggatgtg aggtctgtgg gagtctgcag atagaaagca ttacattgtt 12481 taaagaatct actatacttt ggttggcagc attccatgag ctgattttcc tgaaacacta 12541 aagagaaatg tcttttgtgc tacagtttcg tagcatgagt ttaaatcaag attatgatga 12601 gtaaatgtgt atgggttaaa tcaaagataa ggttatagta acatcaaaga ttaggtgagg 12661 tttatagaaa gatagatatc caggcttacc aaagtattaa gtcaagaata taatatgtga 12721 tcagctttca aagcatttac aagtgctgca agttagtgaa acagctgtct ccgtaaatgg 12781 aggaaatgtg gggaagcctt ggaatgccct tctggttctg gcacattgga aagcacactc 12841 agaaggcttc atcaccaaga ttttgggaga gtaaagctaa gtatagttga tgtaacattg 12901 tagaagcagc ataggaacaa taagaacaat aggtaaagct ataattatgg cttatattta 12961 gaaatgactg catttgatat tttaggatat ttttctaggt tttttccttt cattttattc 13021 tcttctagtt ttgacatttt atgatagatt tgctctctag aaggaaacgt ctttatttag 13081 gagggcaaaa attttggtca tagcattcac ttttgctatt ccaatctaca actggaagat 13141 acataaaagt gctttgcatt gaatttggga taacttcaaa aatcccatgg ttgttgttag 13201 ggatagtact aagcatttca gttccaggag aataaaagaa attcctattt gaaatgaatt 13261 cctcatttgg aggaaaaaaa gcatgcattc tagcacaaca agatgaaatt atggaataca 13321 aaagtggctc cttcccatgt gcagtccctg tccccccccg ccagtcctcc acacccaaac 13381 tgtttctgat tggcttttag ctttttgttg tttttttttt tccttctaac acttgtattt 13441 ggaggctctt ctgtgatttt gagaagtata ctcttgagtg tttaataaag tttttttcca 13501 aaagta // LOCUS HSU47105 1215 bp mRNA PRI 03-JUL-1996 DEFINITION Human H105e3 mRNA, complete cds. ACCESSION U47105 NID g1401079 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1215) AUTHORS Levin,M.L., Chatterjee,A., Pragliola,A., Worley,K.C., Wehnert,M., Zhuchenko,O., Smith,R.F., Lee,C.C. and Herman,G.E. TITLE A comparative transcription map of the murine bare patches (Bpa) and striated (Str) critical regions and human Xq28 JOURNAL Genome Res. 6 (6), 465-477 (1996) MEDLINE 96425694 REFERENCE 2 (bases 1 to 1215) AUTHORS Levin,M.L. and Herman,G.E. TITLE Direct Submission JOURNAL Submitted (24-JAN-1996) Michael L. Levin, Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1215 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq28" /tissue_type="heart" gene 499..1026 /gene="H105e3" CDS 499..1026 /gene="H105e3" /codon_start=1 /db_xref="PID:g1401080" /translation="MKFVIGNGKNLVDFTFVENVVHGHILAAEQLSRDSTLGGKAFHI TNDEPIPFWTFLSRILTGLNYEAPKYHIPYWVAYYLALLLSLLVMVISPVIQLQPTFT PMRVALAGTFHYYSCERAKKAMGYQPLVTMDDAMERTVQNFPATAEGQVRDTGGWALS TRCSASHSFPCGLMK" BASE COUNT 308 a 323 c 297 g 287 t ORIGIN 1 tggcaagagg atatgctgtc aatgtatttg atatccagca agggtttgat aatccccagg 61 tgcggttctt tctgggtgac ctctgcagcc gacaggatct gtacccagct ctgaaaggtg 121 taaacacagt tttccactgt gcgtcacccc caccatccag taacaacaag gagctctttt 181 atagagtgaa ttacattggc accaagaatg tcattgaaac ttgcaaagag gctggggttc 241 agaaactcat tttaaccagc agtgccagtg tcatctttga gggcgtcgat atcaagaatg 301 gaactgaaga ccttccctat gccatgaaac ccattgacta ctacacagag actaagatct 361 tacaggagag ggcagttctg ggcgccaacg atcctgagaa gaatttctta accacagcca 421 tccgccctca tggcattttc ggcccaaggg acccgcagtt ggtaacccat cctcatcgag 481 gcagccagga acggcaagat gaagttcgtg attggaaatg ggaagaactt ggtggacttc 541 acctttgtgg agaacgtggt ccatggacac atcctggcgg cagagcagct ctcccgagac 601 tcgacactgg gtgggaaggc atttcacatc accaatgatg agcccatccc tttctggaca 661 ttcctgtctc gcatcctgac aggcctcaat tatgaggccc ccaagtacca catcccctac 721 tgggtggcct actacctggc cctcctgcta tccctgctgg tgatggtgat cagtcctgtc 781 atccagctgc agcccacctt cacacccatg cgggtcgcac tggctggcac attccactac 841 tacagctgcg agagagccaa aaaggccatg ggctaccagc cactagtgac catggatgat 901 gctatggaga ggaccgtgca gaactttccg gccactgcgg agggtcaagt gagggacact 961 ggaggctggg ctctctcgac acgttgctca gccagtcact ccttcccctg tggattgatg 1021 aaataacatc ctttgaatga gtttgctctg agcctgtgac tccttctgct aggcagagag 1081 cgcaccctac tctttccgtg acgatgaggg cggcaaaaac agacatttct tccttcatgg 1141 aactggattt ggatttcttg aagcaggcag cttcatatta taccgatttg ttctctgtca 1201 aaaaaaaaaa aaaaa // LOCUS HSU47621 2347 bp mRNA PRI 23-OCT-1997 DEFINITION Homo sapiens nucleolar autoantigen No55 mRNA, complete cds. ACCESSION U47621 NID g1491808 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2347) AUTHORS Ochs,R.L., Stein,T.W. Jr., Chan,E.K., Ruutu,M. and Tan,E.M. TITLE cDNA cloning and characterization of a novel nucleolar protein JOURNAL Mol. Biol. Cell 7 (7), 1015-1024 (1996) MEDLINE 97015880 REFERENCE 2 (bases 1 to 2347) AUTHORS Chan,E.K.L. TITLE Direct Submission JOURNAL Submitted (29-JAN-1996) Edward K.L. Chan, Molecular and Experimental Medicine, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2347 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T24" CDS 12..1325 /codon_start=1 /product="nucleolar autoantigen No55" /db_xref="PID:g1491809" /translation="MARVAWGLLWLLLGSAGAQYEKYSFRGFPPEDLMPLAAAYGHAL EQYEGESWRESARYLEAALRLHRLLRDSEAFCHANCSGPAPAAKPDPDGGRADEWACE LRLFGRVLERAACLRRCKRTLPAFQVPYPPRQLLRDFQSRLPYQYLHYALFKANRLEK AVAAAYTFLQRNPKHELTAKYLNYYQGMLDVADESLTDLEAQPYEAVFLRAVKLYNSG DFRSSTEDMERALSEYLAVFARCLAGCEGAHEQVDFKDFYPAIADLFAESLQCKVDCE ANLTPNVGGYFVDKFVATMYHYLQFAYYKLNDVRQAARSAASYMLFDPKDSVMQQNLV YYRFHRARWGLEEEDFQPREEAMLYHNQTAELRELLEFTHMYLQSDDEMELEETEPPL EPEDALSDAEFEGEGDYEEGMYADWWQEPDAKGDEAEAEPEPELA" repeat_region 1431..1720 /note="Alu subtype Sx" repeat_region 1730..1995 /note="Alu subtype Sx" BASE COUNT 517 a 689 c 733 g 408 t ORIGIN 1 gcgcggcggg catggctcgg gtggcgtggg ggctgctgtg gttgctgctg ggcagcgccg 61 gggcgcagta cgagaagtac agcttccggg gcttcccgcc cgaggacctg atgccgctgg 121 ccgcggcgta cgggcacgct ctggagcagt acgagggaga gagctggcgc gagagcgcgc 181 gctacctgga ggcggcgctg cggctgcacc ggctcctgcg cgacagcgag gccttctgcc 241 acgccaactg cagcggcccc gcgcccgcgg ccaagcccga tcccgacggc ggccgcgcag 301 acgagtgggc ctgcgagctg cggctcttcg gccgcgtcct ggagcgagcc gcctgcctgc 361 ggcgctgcaa gcggacgctg cccgccttcc aggtgcccta cccgccgcgg cagctgctgc 421 gtgacttcca gagccgcctg ccctaccagt acctgcacta cgcgctgttc aaggctaacc 481 ggctggagaa ggcggtggcg gcggcctaca ccttcctcca gaggaacccg aagcacgagc 541 tgaccgccaa gtatctcaac tactatcagg ggatgctgga cgtcgccgac gagtccctca 601 cggacctaga ggcccagccc tacgaggccg tgttcctccg ggctgtgaag ctctacaaca 661 gcggggattt ccgcagcagc acggaggaca tggagcgggc cttgtcagag tacctggcag 721 tctttgcccg gtgcctggcc ggctgtgaag gggcccatga gcaggtggac ttcaaggact 781 tctacccggc catagcagat ctctttgcag agtccctgca gtgcaaggtg gactgtgagg 841 ccaatttgac ccccaatgtg ggtggctact tcgtggacaa gttcgtggcc accatgtacc 901 actacctgca gtttgcctac tataagttga atgatgtgcg ccaggctgcc cgcagcgccg 961 ccagctacat gctcttcgac cccaaggaca gcgtcatgca gcagaacctg gtgtattacc 1021 ggttccaccg ggctcgctgg ggcctggaag aggaggactt ccagccccgg gaggaggcca 1081 tgctctacca caaccagacc gccgagctgc gggagctgct ggagttcacc cacatgtacc 1141 tgcagtcaga tgatgagatg gagctggagg agacagaacc gcccctggag cctgaggatg 1201 ccctatctga cgccgagttt gagggggagg gtgactacga ggagggcatg tatgctgact 1261 ggtggcagga gccggatgcc aagggtgacg aggccgaggc tgagccagag cctgaactcg 1321 catgagaagg ggacacccca caccgctcaa gcttgggaag cctggtgccg atggccccac 1381 cctcaccagc ctgggcagca gcaagaacta tttattaaaa acttaagatg ggccaggtgc 1441 ggtggctcac acctgtaatc ccagcatttt gggaggccaa ggtgggtgga tcacttgagg 1501 ccaggagttc aagaccagcc tggccaacat gatgagacct ccgtctctac taaaatacat 1561 aaattagccg ggtgtggtgg caggcgcctg aaatcccagc tactcaagag gctgaggcag 1621 gagaatcgct tgaacctggg aggcaaaggt tgcggtgaac tgagattgcg ccaccgcact 1681 ccagcctggg cgacagagcg agactccatc tttaaaaaaa aacaagacgg gccggcacgg 1741 tggctcacgc ctgtaatccc agcactgaga ggccgatcac ttgaggtcag gagttcaaga 1801 cctgcctggc caacatggtg aaaccccatc tctactaaaa aatacaaaaa ttagccaggc 1861 atggtggcac acacctgtaa tcgtagctga ggcaggagaa tcgcctgaac ccaggaggcg 1921 gagcttgcag tgagccgaga tcgtgccact gcactccagc ctgggcgaca gagtgagact 1981 ccatctcaaa aaaaaaaaaa aaaaacttaa gatggacaca gctgactgga cccccatcct 2041 gcctcaccca tgggtgctgc accccagacc catcctgcca cttctatgtc tctggaccac 2101 aggatggtgg tggcattgca ggttggcaag tgggctgatg gggtccgccc tcctcactgc 2161 tgagctcctc acctggacag tctcctggac aaggagtttc cagctgctgg ctggagtctc 2221 aggccaaatt gcagagggtc ctccagggtc ctgaagagca ctggactaag agtctagtgg 2281 ttccagggcc ctgaccagta ggtgctcaat aaatgtttgt tgttgaatga aaaaaaaaaa 2341 aaaaaaa // LOCUS HSU47654 8409 bp DNA PRI 20-MAR-1996 DEFINITION Human pyruvate kinase PK-R gene, partial cds, and pyruvate kinase PK-L gene, complete cds. ACCESSION U47654 NID g1230588 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8409) AUTHORS Lenzner,C., Nuernberg,P., Jacobasch,G. and Thiele,H-J. TITLE Complete genomic sequence of the human pyruvate kinase L/R gene includes four intragenic polymorphisms defining different haplotype backgrounds on normal and mutant PK genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 8409) AUTHORS Lenzner,C., Nuernberg,P., Jacobasch,G. and Thiele,H-J. TITLE Direct Submission JOURNAL Submitted (30-JAN-1996) Peter Nuernberg, Institute for Medical Genetics, Charite Medical School of the Humboldt-University Berlin, Schumannstr. 20/21, Berlin 10098, Germany FEATURES Location/Qualifiers source 1..8409 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" gene 1..7678 /gene="PK-R" CDS join(<1..139,1145..1327,2508..2599,2696..2827,2961..3147, 3510..3780,3877..4027,4672..4824,4919..5085,6325..6506, 7572..7678) /gene="PK-R" /codon_start=1 /product="pyruvate kinase PK-R isoenzyme" /db_xref="PID:g1230589" /translation="HSMVPQPQAHTESMSIQENISSLQLRSWVSKSQRDLAKSILIGA PGGPAGYLRRASVAQLTQELGTAFFQQQQLPAAMADTFLEHLCLLDIDSEPVAARSTS IIATIGPASRSVERLKEMIKAGMNIARLNFSHGSHEYHAESIANVREAVESFAGSPLS YRPVAIALDTKGPEIRTGILQGGPESEVELVKGSQVLVTVDPAFRTRGNANTVWVDYP NIVRVVPVGGRIYIDDGLISLVVQKIGPEGLVTQVENGGVLGSRKGVNLPGAQVDLPG LSEQDVRDLRFGVEHGVDIVFASFVRKASDVAAVRAALGPEGHGIKIISKIENHEGVK RFDEILEVSDGIMVARGDLGIEIPAEKVFLAQKMMIGRCNLAGKPVVCATQMLESMIT KPRPTRAETSDVANAVLDGADCIMLSGETAKGNFPVEAVKMQHAIAREAEAAVYHRQL FEELRRAAPLSRDPTEVTAIGAVEAAFKCCAAAIIVLTTTGRSAQLLSRYRPRAAVIA VTRSAQAARQVHLCRGVFPLLYREPPEAIWADDVDRRVQFGIESGKLRGFLRVGDLVI VVTGWRPGSGYTNIMRVLSIS" gene 478..7678 /gene="PK-L" CDS join(478..553,1145..1327,2508..2599,2696..2827,2961..3147, 3510..3780,3877..4027,4672..4824,4919..5085,6325..6506, 7572..7678) /gene="PK-L" /codon_start=1 /product="pyruvate kinase PK-L isoenzyme" /db_xref="PID:g1230590" /translation="MPLTTQQCGADPQRGRPREVCSGMEGPAGYLRRASVAQLTQELG TAFFQQQQLPAAMADTFLEHLCLLDIDSEPVAARSTSIIATIGPASRSVERLKEMIKA GMNIARLNFSHGSHEYHAESIANVREAVESFAGSPLSYRPVAIALDTKGPEIRTGILQ GGPESEVELVKGSQVLVTVDPAFRTRGNANTVWVDYPNIVRVVPVGGRIYIDDGLISL VVQKIGPEGLVTQVENGGVLGSRKGVNLPGAQVDLPGLSEQDVRDLRFGVEHGVDIVF ASFVRKASDVAAVRAALGPEGHGIKIISKIENHEGVKRFDEILEVSDGIMVARGDLGI EIPAEKVFLAQKMMIGRCNLAGKPVVCATQMLESMITKPRPTRAETSDVANAVLDGAD CIMLSGETAKGNFPVEAVKMQHAIAREAEAAVYHRQLFEELRRAAPLSRDPTEVTAIG AVEAAFKCCAAAIIVLTTTGRSAQLLSRYRPRAAVIAVTRSAQAARQVHLCRGVFPLL YREPPEAIWADDVDRRVQFGIESGKLRGFLRVGDLVIVVTGWRPGSGYTNIMRVLSIS " BASE COUNT 1824 a 2278 c 2276 g 2031 t ORIGIN 1 cattccatgg tcccgcagcc ccaggcccac actgaaagca tgtcgatcca ggagaacata 61 tcatccctgc agcttcggtc atgggtctct aagtcccaaa gagacttagc aaagtccatc 121 ctgattgggg ctccaggagg taagaagggg agacagaagc catggaacat aggaggaaaa 181 tgagggtgaa aactaggagc caggctggag ggcataaatg atccacatca gccactggct 241 aggtgggttt tggagaggaa cgtacgttct tcagagcctc ccgtgtgtta aattatggac 301 cctggcctgg gtcttttcca ggccctatag gcaggccaga gcacagcatg taagccacgg 361 ggcactcccg tggttcctgg actctggccc ctggcataca gggcttccaa tggaacagga 421 gacagtggtg acactttaac cagtctgcag aactgatccc cagcccagct gggcctcatg 481 cctctgacaa cccaacagtg tggagcagac ccacagagag ggagacccag agaggtgtgc 541 agtggcatgg aaggtgcggc tggaatcggg ggctcctctg aactgggatg ggtcaagcta 601 cagggacctc tgtgtctgta gcagctttga gaagcctggg agactcagag gatggggtgg 661 ggaagccagc cagtagccag gggttggaag ggagaaaaca gagacctttg gagcaggaac 721 tgggtgattc tgggtctgca tagggtgagg ctgctgggga ctagacatca aggggttgag 781 ggtcatcagt agctgcaggt cggggggtcc tgggttgtgc ggggtcagta taactgagtg 841 gtcaagaact tgggctagag ttagtcagac ctagatttga atcctagctg atccatactt 901 agaagctgta taatattgga ccatttattt cctgtttctc agcctctgtt tctttatcca 961 taaaattgga taattataga acatttaaca gtacctgctt tataggatcc ttgtgacata 1021 ataaaataat atatgcatga tgcccagctc ataagaaggg aaggaacaga gggtatgctg 1081 agagacgaag gcatggggag gaagggcagg tgacatgcag tccctgagcc cccttctacc 1141 acagggccag cggggtatct gcggcgggcc agtgtggccc aactgaccca ggagctgggc 1201 actgccttct tccagcagca gcagctgcca gctgctatgg cagacacctt cctggaacac 1261 ctctgcctac tggacattga ctccgagccc gtggctgctc gcagtaccag catcattgcc 1321 accatcggta agcactccca tccccctgca gccacacagg gcctattggt atttcttgag 1381 gtgcttcttc atcttttgtc tcctttgaga cttctccatg tttgacacag tcattcattc 1441 aacaaaaatt tgttgagcat atagtagaca agattttggg ccctgggagt agatcagtga 1501 aaaaaacaga caaaaatccc tacccttggg gagctgacag tctagctgag tatgacaata 1561 aatagtaagc acaataaatt atttaaaata agtaaattat ttattccgtt agaaagtgag 1621 gccgggcatg gtggctcatg cctgtaatcg cagcatgttg ggaggcccag gtgggcagat 1681 cacttgaggt caggagttcg agactagcct gaccaacatg gagaaacccc gtctctacta 1741 aaaatacaaa attagccggg catggtggtg cgtgcctgca atcccagcta ctcaggaggc 1801 tgaagcagga taatcacttg aactggggag gcagaggttg cagtgagctg agatggtgcc 1861 actgcactcc agcctgggcg acaagagtga aactcccccc gtctcaaaaa aaaagaaaga 1921 aaaagaaaag gaaaacatgg gtgtcacaga ggatccatca gaatgggtct taatttatgt 1981 ccttgagcag gtgagaagga ggagacctag tgcacaagtg gaaagacaga tagggaccga 2041 agaagtagcc aacctgctct cggctagcag ctgccccata gcagtgcagg catgggatag 2101 aactcagctc tcctcagtcc atgaggcctc ctagctctaa aagcccgcac ccaaacgccc 2161 tcacctggct cccagcccct gccctacacc ccataccctg gggtagccgg gcaagcagca 2221 cttacatacc catgcccata cagtgcccat acatgcccat acagtgacct caggcctggc 2281 ggagggcact cccctccgat ttccacactg ctgcctccca aggggatgga tgttggcttg 2341 agagggaagg ggagtctgtg atctgtgggc aggggttgca tcagggaata aagatcaggt 2401 aacaggagcc ctgtggcgtg aggcgttctg agaatggtaa tgggttgggt ttggttgcct 2461 ctcatgttct gggggaacgt tgtctgaacg tgaatctctg gttctagggc cagcatctcg 2521 ctccgtggag cgcctcaagg agatgatcaa ggccgggatg aacattgcgc gactcaactt 2581 ctcccacggc tcccacgagg tgcgggacgg gccgccgggc agtgggtggg gcaggaggat 2641 gcctcgaggt cctggccacc ttcccctgaa accctcgctc cgctccctcc cccagtacca 2701 tgctgagtcc atcgccaacg tccgggaggc ggtggagagc tttgcaggtt ccccactcag 2761 ctaccggccc gtggccatcg ccctggacac caagggaccg gagatccgca ctgggatcct 2821 gcagggggtg agcagtgggg ctgggactcg ctgggccagg gccggaaagc ggcgcctgta 2881 gggttgggcc caggcgtggg caggggcggg tcccggactc cggggctcag aactcacatc 2941 tcctctggct ccctccttag ggtccagagt cggaagtgga gctggtgaag ggctcccagg 3001 tgctggtgac tgtggacccc gcgttccgga cgcgggggaa cgcgaacacc gtgtgggtgg 3061 actaccccaa tattgtccgg gtcgtgccgg tggggggccg catctacatt gacgacgggc 3121 tcatctccct agtggtccag aaaatcggtg cggacgcgcc tcccgccctg accacatccg 3181 tgcgctgggc acattccctt ctccttggct cccccatcag ccctcagacc gatcacacct 3241 tcccctggcc acgcgttttt tgccagcccg tcccaggagt ccccagcgtg tagactctgc 3301 gctcaccctg ggtttgggct ggactatggg tgggtcgttt cttccacgga cgtccatctg 3361 tgcctcttcc gcgaagccca gaacaagggc ggagagatga ggaggacatg gttcctgacc 3421 tcttgcgggt ccaagcccca gtgtcctctc tgctgcaact gtgccccgtc ctcacccctg 3481 accgcagctg gctctttcca tgtccgcagg cccagaggga ctggtgaccc aagtggagaa 3541 cggcggcgtc ctgggcagcc ggaagggcgt gaacttgcca ggggcccagg tggacttgcc 3601 cgggctgtcc gagcaggacg tccgagacct gcgcttcggg gtggagcatg gggtggacat 3661 cgtctttgcc tcctttgtgc ggaaagccag cgacgtggct gccgtcaggg ctgctctggg 3721 tccggaagga cacggcatca agatcatcag caaaattgag aaccacgaag gcgtgaagag 3781 gtgaggcttg ggctctgttc cccttcggcc ctgtcgctat tccccatcac ctttcttctc 3841 ctgcctgcct ctgccttgat tctcccaacc tctcaggttt gatgaaatcc tggaggtgag 3901 cgacggcatc atggtggcac ggggggacct aggcatcgag atcccagcag agaaggtttt 3961 cctggctcag aagatgatga ttgggcgctg caacttggcg ggcaagcctg ttgtctgtgc 4021 cacacaggtc tggagtgagg ccttgaggtt cggcactctg tgggttttag ggacacctgt 4081 gggtgaatac ccacactgta ggggtttatt ttgttttgtt ttgttttttg agacggagtc 4141 tcactctgtc atccaggctg gagtgcagtg gcgcaatctc cgctcactgc aacctccgcc 4201 tcttgggttc aaacaattct cctgcctcag cctcccaagt agtggggatt acaggtgacc 4261 gccgccatgc caggctactt tttgtatttt cagtagagac ggggtttcac catgttggcc 4321 agactggtct cgaactcctg acctcaggtg atctactcgc ctcggcctcc caaagtgctg 4381 ggattacagg tgtgagccac tgcacccagc ccacactgta ggtttatagc acatttggat 4441 gaaaagtgtt tgatcctcaa acgacaaagt taaatatact ttgaccccta ttttcagggg 4501 ttgtgaccaa acgacaaagt taaatatact ttgactccta ttttcagggg ttgtgactgt 4561 gaccctggat tttgggacac tctgagagtg tgggtgtcag agaagtagct tgggcagggt 4621 ccccagtcac agtgtgagtc ctacaacttt gacatccacg ctgtccccca gatgctggag 4681 agcatgatta ccaagccccg gccaacgagg gcagagacaa gcgatgtcgc caatgctgtg 4741 ctggatgggg ctgactgcat catgctgtca ggggagactg ccaagggcaa cttccctgtg 4801 gaagcggtga agatgcagca tgcggtagga gctcagaatg aaaagcaaat gggccaggga 4861 accaaatccc ttccataccc cagtgcccct tcccagacta acattctggc acctgcagat 4921 tgcccgggag gcagaggccg cagtgtacca ccggcagctg tttgaggagc tacgtcgggc 4981 agcgccacta agccgtgatc ccactgaggt caccgccatt ggtgctgtgg aggctgcctt 5041 caagtgctgt gctgctgcca tcattgtgct gaccacaact ggccggtgag ggggatattg 5101 ggaatgtcca gatggagctt tgggtcaggg gtgggctggg acgggcccca ggcttgggtt 5161 tagtctggtc accaggggtg aagagtgtcc acctgagaca agaggagagg cagcaatgac 5221 agctggaggc caggagagac agaatgccag tgagcttctg ggggctggaa ggggacagcg 5281 gcatcactgg gcacattggc ttcaaggcca tttgggcttc tggggctcag aggcaagtcc 5341 attcggccca cagagcctac caatactgag gtattacaga agggtccagt aggtctgagt 5401 ttaagtcttt actcagaaat gtagctctat tagcctgctg tctttcctca tgaacaggac 5461 aaggtaatga tttgttcttc atggggttgg caggattaac aggagatatt aagtactaga 5521 tagtacttaa tatctatata gtaaattaag cacataatgg actttctcaa aagggtttta 5581 tactcttggg tttttgtttg tttgttttgt tttgtttttc ttttgagaac ggcagctcgc 5641 tctgttgctc aggctggagt aaaatggtgc aatctcagct caccacaacc ttcacctccc 5701 aagttcaagt gattctcctg cctcagcctt ccgagtagct gggattacag gcgcatgcca 5761 caacgccctg catatttttt gtatttttag tagagacagg gtttcatcat gttagccagg 5821 ctggtctcaa actcctgacc tcaggtgatc caccctcctt ggcctcccaa agtgctggga 5881 ttataggcat gagccactgt gcctggccag cttttatatt tcttaaagag atttctcctt 5941 attatctcat ttgatatatc tgtattatct catatctttc ttgtgtggta gggagggcag 6001 ggattctttc tttttttttt cttaatgcag agatggggtc tcactatgtt gcctaggctg 6061 gtctccagct cctgggctca aacaattctc ccatctcggc ctcccaaagt gctgggatta 6121 caggcatgag ccaccgctcc cggcctggga ggcaggcatt cttacactca ttttacaggt 6181 gagaacacca aggcccagag aagtatgatg acttacccag ggtcacacag cttgttagtg 6241 acacctggaa ctggaacaaa gattctcctt tcctcgttca ccactttctt gctgttctgg 6301 gctgaccttc tctgcctcct ccagctcagc ccagcttctg tctcggtacc gacctcgggc 6361 agcagtcatt gctgtcaccc gctctgccca ggctgcccgc caggtccact tatgccgagg 6421 agtcttcccc ttgctttacc gtgaacctcc agaagccatc tgggcagatg atgtagatcg 6481 ccgggtgcaa tttggcattg aaagtggtga gctacctaga ccttccctgc cactcctacc 6541 atttgtatca ggagcccccc aacccagctt cccataccca ctcaaagggc cttgctctct 6601 cctgtggtcc caggttttcc ctgtgagagt cactaagact gagatatcag tctggcatat 6661 cacaaatacc tcttccatca gcatagccac acaggcagac gggcgtgcta cttagcaaca 6721 cactctgccc tagcccacag atactcctgc accttttttt tttttgagag agagtcttgc 6781 tctgtcgcca ggctggagtg caggggtacg atctcggctc actgcaatct ctgcctcctg 6841 ggttcaagtg attctcccac ctcagcctcc tgagtagctg ggactacagg catgcgccac 6901 cacgcctgga taatttttgt atttttagta gagacagggt ttcaccatgt tggccaggat 6961 ggtctcaatg tcttgaccac gtgatccgcc cgcctcggac tcccagagtg ctgggattac 7021 aggcgtcagc caccgtgctg gcatctctta cacctttaat atactgcagt ggtcacattc 7081 cctgcatgcc acccgtggga catacttcac tgcctcattc cctacaggat ggccttgatg 7141 tggtgaaagg tggtggctgg ttctcgttac agggctggcc tggtccctca atgactaatt 7201 ttctttcttt tcttctttta ttattattat tattattatt attattatta ttattattat 7261 tttgagatgg agtcttgctc tgtccccagg ctggagtgca gtggtgtgat ctcggctcac 7321 tgcaacctct acctcctgag ttcaagcgat tctcctgcct cagcctccct agtagctggg 7381 attacaggca tgcgccacca cacccagcta atttttgtat tttcagtaga gacaagtttc 7441 accatgttgg ccaggctggt ctcaaactcc tgacctcagg tgatcctcct gccttggctt 7501 cccaaagtga tgggattaca ggtgtgagcc accacacctg tccaatgatt tgttttcttt 7561 ccctccccca ggaaagctcc gtggcttcct ccgtgttgga gacctggtga ttgtggtgac 7621 aggctggcga cctggctccg gctacaccaa catcatgcgg gtgctaagca tatcctgaga 7681 cgcccctccc tcctctggcc cagcctaccc ttgtacccca tcccttcctc cccagtctac 7741 gttctccagc ccacacccct ccaaagcccc acctttaagt cctctcttct ctattcctga 7801 ccctccctac ctgaggccta tctgagacta taactgtcat ctagcccctt cgaggttgcc 7861 ccttccccat ctccatttca cacaggtcct gaaagtctgt gtccaattat gcactggcca 7921 cccaacagca ccaattgtac attccctgca tccaatctgc tcagcaggcc ctaagatgcc 7981 ttgagtcttt aatcccaggt ttggctggtt aattccataa ccccaggcat cccatccctt 8041 ggggtggggg agaggggaga cagggcaatc ttgtccacag tctcccattc tcatatgtag 8101 ccctcatgat aatctgggca tctcgtgcca gggcaggcta ccccttcatg gtgactaaca 8161 gttacatgaa agtccacgct tttgggaaaa ctgggtggga tggatgctgg ggagaagtga 8221 gggctgggca gctgattttg tcactgtctt cacaactcgt gctgggcttg tagaccactg 8281 tcctggctgc tctcatgcct gcctgatacc ctgcttggtc aaatcccggc tgcttccttc 8341 tgcacccaga aattccttcc cactcatgtt gttcccacac acaaaccaag agccaaaaat 8401 gagtgtgtc // LOCUS HSU47674 2262 bp mRNA PRI 20-FEB-1996 DEFINITION Human putative 32kDa heart protein PHP32 mRNA, complete cds. ACCESSION U47674 NID g1197767 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2262) AUTHORS Churchill,J.R., Wieland,S.J., Hoffman,S., Gallin,E.K. and Murphy,P.M. TITLE New subfamily of prenylated proteins predicted by a novel human cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 2262) AUTHORS Wieland,S.J., Hoffman,S., Churchill,J.R., Gallin,E.K. and Murphy,P.M. TITLE Direct Submission JOURNAL Submitted (30-JAN-1996) Steven J. Wieland, Neurobiology and Anatomy, MCP and Hahnemann University, MailStop 408, Broad and Vine, Philadelphia, PA 19102, USA FEATURES Location/Qualifiers source 1..2262 /organism="Homo sapiens" /note="a broadly expressed mRNA with highest expression in heart" /db_xref="taxon:9606" /cell_type="primary monocyte" CDS 366..1208 /note="protein is similar to human hypothetical protein (clone cPj-LTR), PIR Accession Number JH0791" /codon_start=1 /product="PHP32" /db_xref="PID:g1197768" /translation="MKGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGR NMDFGVFLGWNINNDTWVITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPG LFSLTLNERFSINGGYLGILEWILGKKDAMWIGFLTRTVLENSTSYEEAKNLLTKTKI LAPAYFILGGNQSGEGCVITRDRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDR RTPAKMCLNRTSQENISFETMYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDP CIGW" BASE COUNT 672 a 446 c 475 g 669 t ORIGIN 1 gggatgttgg ctgctagagc gatgccgggc cggagttgcg tcgccttagt cctcctggct 61 gccgcgtcag ctgtgccgtc gcgcacgacg cgccgccgtg gacagaggac tgcagaaaat 121 caacctatcc tcttcaggac caacgtacag aggtgcagtt ccatggtaca ccataaatct 181 tgacttacca ccctacaaaa gatggcatga attgatgctt gacaaggcac caatgctaaa 241 ggttatagtg aattctctga agaatatgat aaatacattc gttgccaagt ggaaaagtta 301 ttgcaggtgg tggatgaaaa ttgcctggcc tacttggcaa ctttcctggc ccttttgaag 361 aggaaatgaa gggtattgcc gctgttactg atataccttt aggagagatt atttcattca 421 atatttttta tgaattattt accatttgta cttcaatagt agcagaagac aaaaaaggtc 481 atctaataca tgggagaaac atggattttg gagtatttct tgggtggaac ataaataatg 541 atacctgggt cataactgag caactaaaac ctttaacagt gaatttggat ttccaaagaa 601 acaacaaaac tgtcttcaag gcttcaagct ttgctggcta tgtgggcatg ttaacaggat 661 tcaaaccagg actgttcagt cttacactga atgaacgttt cagtataaat ggtggttatc 721 tgggtattct agaatggatt ctgggaaaga aagatgccat gtggataggg ttcctcacta 781 gaacagttct ggaaaatagc acaagttatg aagaagccaa gaatttattg accaagacca 841 agatattggc cccagcctac tttatcctgg gaggcaacca gtctggggaa ggttgtgtga 901 ttacacgaga cagaaaggaa tcattggatg tatatgaact cgatgctaag cagggtagat 961 ggtatgtggt acaaacaaat tatgaccgtt ggaaacatcc cttcttcctt gatgatcgca 1021 gaacgcctgc aaagatgtgt ctgaaccgca ccagccaaga gaatatctca tttgaaacca 1081 tgtatgatgt cctgtcaaca aaacctgtcc tcaacaagct gaccgtatac acaaccttga 1141 tagatgttac caaaggtcaa ttcgaaactt acctgcggga ctgccctgac ccttgtatag 1201 gttggtgagc acacgtctgg cctacagaat gcggcctctg agacatgaag acaccatctc 1261 catgtgaccg aacactgcag ctgtctgacc ttccaaagac taagactcgc ggcaggttct 1321 ctttgagtca atagcttgtc ttcgtccatc tgttgacaaa tgacagactt tttttttttc 1381 cccctatcag ttgatttttc ttatttatag ataacttctt taggggaagt aaaacagtca 1441 tctagaattc actgagtttt gtttcacttt gacatttggg gatctggtgg gcagtcgaac 1501 catggtgaac tccacctccg tgaataaatg gagattcagc gtgggtgttg aatccagcac 1561 gtctgtgtga gtaacgggac agtaaacact ccacattctt cagtttttca cttctaccta 1621 catatttgta tgtttttctg tataacagcc ttttccttct ggttctaact gctgttaaaa 1681 ttaatatatc attatctttg ctgttattga cagcgatata attttattac atatgattag 1741 agggatgaga cagacattca cctgtatatt tcttttaatg ggcacaaaat tggtgccttt 1801 gcctctaaat agcacttttt cggggtcaag aagtaatcag atgcaaagca atcgtttata 1861 caataattga agcgcacctt tcaataccac tccagtacct aaggaagtgc tactaaactg 1921 catccacgtc tgtatagtaa taacagtcaa gctggaatcg aggaccaatt aattccaatg 1981 gcacagagta gcattcatgt aataaacagg tttttagttt gttcttcaga ttgataggga 2041 gttttaaaga aattttagta gttactaaaa ttatgttact gtatttttca gaaatccaac 2101 tgcttatgaa aagtactaat agaacttgtt aacctttcta accttcacga ttaactgtga 2161 aatgtacgtc atttgtgcaa gaccgtttgt ccacttcatt ttgtataatc acagttgtgt 2221 tcctgacact caataaacag tcattggaaa gagaaaaaaa aa // LOCUS HSU47678 2106 bp mRNA PRI 17-MAY-1996 DEFINITION Human 80 kDa estrogen receptor mRNA, containing exon 6 and 7 duplication, complete cds. ACCESSION U47678 NID g1197854 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2106) AUTHORS Pink,J.J., Wu,S.Q., Wolf,D.M., Bilimoria,M.M. and Jordan,V.C. TITLE A novel 80 kDa human estrogen receptor containing a duplication of exons 6 and 7 JOURNAL Nucleic Acids Res. 24 (5), 962-969 (1996) MEDLINE 96174665 REFERENCE 2 (bases 1 to 2106) AUTHORS Jordan,C.V. TITLE Direct Submission JOURNAL Submitted (30-JAN-1996) Craig V. Jordan, Robert H. Lurie Cancer Center, Northwestern University Medical Center, 303 E. Chicago Ave., Olson 8258, Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..2106 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q25" /cell_line="MCF-7" /tissue_type="breast" exon 1..450 /number=1 CDS 1..2106 /note="80 kDa estrogen receptor with an exon 6 and 7 repeat; It is able to bind estrogen" /codon_start=1 /product="80 kDa estrogen receptor" /db_xref="PID:g1197855" /translation="MTMTLHTKASGMALLHQIQGNELEPLNRPQLKIPLERPLGEVYL DSSKPAVYNYPEGAAYEFNAAAAANAQVYGQTGLPYGPGSEAAAFGSNGLGGFPPLNS VSPSPLMLLHPPPQLSPFLQPHGQQVPYYLENEPSGYTVREAGPPAFYRPNSDNRRQG GRERLASTNDKGSMAMESAKETRYCAVCNDYASGYHYGVWSCEGCKAFFKRSIQGHND YMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGEG RGEVGSAGDMRAANLWPSPLMIKRSKKNSLALSLTADQMVSALLDAEPPILYSEYDPT RPFSEASMMGLLTNLADRELVHMINWAKRVPGFVDLTLHDQVHLLECAWLEILMIGLV WRSMEHPVKLLFAPNLLLDRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKS IILLNSGVYTFLSSTLKSLEEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLL LILSHIRHMRNQGKCVEGMVEIFDMLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYT FLSSTLKSLEEKDHIHRVLDKITDTLIHLMAKAGLTLQQQHQRLAQLLLILSHIRHMS NKGMEHLYSMKCKNVVPLYDLLLEMLDAHRLHAPTSRGGASVEETDQSHLATAGSTSS HSLQKYYITGEAEGFPATV" exon 451..642 /number=2 exon 643..759 /number=3 exon 760..1095 /number=4 exon 1096..1233 /number=5 exon 1234..1368 /number=6 exon 1369..1551 /number=7 exon 1551..1685 /note="exon 6'" exon 1686..1868 /note="exon 7'" exon 1869..2106 /number=8 BASE COUNT 496 a 602 c 587 g 421 t ORIGIN 1 atgaccatga ccctccacac caaagcatct gggatggccc tactgcatca gatccaaggg 61 aacgagctgg agcccctgaa ccgtccgcag ctcaagatcc ccctggagcg gcccctgggc 121 gaggtgtacc tggacagcag caagcccgcc gtgtacaact accccgaggg cgccgcctac 181 gagttcaacg ccgcggccgc cgccaacgcg caggtctacg gtcagaccgg cctcccctac 241 ggccccgggt ctgaggctgc ggcgttcggc tccaacggcc tggggggttt ccccccactc 301 aacagcgtgt ctccgagccc gctgatgcta ctgcacccgc cgccgcagct gtcgcctttc 361 ctgcagcccc acggccagca ggtgccctac tacctggaga acgagcccag cggctacacg 421 gtgcgcgagg ccggcccgcc ggcattctac aggccaaatt cagataatcg acgccagggt 481 ggcagagaaa gattggccag taccaatgac aagggaagta tggctatgga atctgccaag 541 gagactcgct actgtgcagt gtgcaatgac tatgcttcag gctaccatta tggagtctgg 601 tcctgtgagg gctgcaaggc cttcttcaag agaagtattc aaggacataa cgactatatg 661 tgtccagcca ccaaccagtg caccattgat aaaaacagga ggaagagctg ccaggcctgc 721 cggctccgca aatgctacga agtgggaatg atgaaaggtg ggatacgaaa agaccgaaga 781 ggagggagaa tgttgaaaca caagcgccag agagatgatg gggagggcag gggtgaagtg 841 gggtctgctg gagacatgag agctgccaac ctttggccaa gcccgctcat gatcaaacgc 901 tctaagaaga acagcctggc cttgtccctg acggccgacc agatggtcag tgccttgttg 961 gatgctgagc cccccatact ctattccgag tatgatccta ccagaccctt cagtgaagct 1021 tcgatgatgg gcttactgac caacctggca gacagggagc tggttcacat gatcaactgg 1081 gcgaagaggg tgccaggctt tgtggatttg accctccatg atcaggtcca ccttctagaa 1141 tgtgcctggc tagagatcct gatgattggt ctcgtctggc gctccatgga gcacccagtg 1201 aagctactgt ttgctcctaa cttgctcttg gacaggaacc agggaaaatg tgtagagggc 1261 atggtggaga tcttcgacat gctgctggct acatcatctc ggttccgcat gatgaatctg 1321 cagggagagg agtttgtgtg cctcaaatct attattttgc ttaattctgg agtgtacaca 1381 tttctgtcca gcaccctgaa gtctctggaa gagaaggacc atatccaccg agtcctggac 1441 aagatcacag acactttgat ccacctgatg gccaaggcag gcctgaccct gcagcagcag 1501 caccagcggc tggcccagct cctcctcatc ctctcccaca tcaggcacat gaggaaccag 1561 ggaaaatgtg tagagggcat ggtggagatc ttcgacatgc tgctggctac atcatctcgg 1621 ttccgcatga tgaatctgca gggagaggag tttgtgtgcc tcaaatctat tattttgctt 1681 aattctggag tgtacacatt tctgtccagc accctgaagt ctctggaaga gaaggaccat 1741 atccaccgag tcctggacaa gatcacagac actttgatcc acctgatggc caaggcaggc 1801 ctgaccctgc agcagcagca ccagcggctg gcccagctcc tcctcatcct ctcccacatc 1861 aggcacatga gtaacaaagg catggagcat ctgtacagca tgaagtgcaa gaacgtggtg 1921 cccctctatg acctgctgct ggagatgctg gacgcccacc gcctacatgc gcccactagc 1981 cgtggagggg catccgtgga ggagacggac caaagccact tggccactgc gggctctact 2041 tcatcgcatt ccttgcaaaa gtattacatc acgggggagg cagagggttt ccctgccaca 2101 gtctga // LOCUS HSU47686 2782 bp mRNA PRI 24-MAY-1996 DEFINITION Human signal transducer and activator of transcription Stat5B mRNA, complete cds. ACCESSION U47686 NID g1330323 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2782) AUTHORS Lin,J.X., Mietz,J., Modi,W.S., John,S. and Leonard,W.J. TITLE Cloning of human Stat5B. Reconstitution of interleukin-2-induced Stat5A and Stat5B DNA binding activity in COS-7 cells JOURNAL J. Biol. Chem. 271 (18), 10738-10744 (1996) MEDLINE 96210005 REFERENCE 2 (bases 1 to 2782) AUTHORS Lin,J.-X., Mietz,J., Modi,W.S., John,S. and Leonard,W.J. TITLE Direct Submission JOURNAL Submitted (31-JAN-1996) Jian-Xin Lin, Lab of Molecular Immunology, NHLBI, NIH, 9000 Rockville Pike, Bldg. 10, Rm. 7N244, Bethesda, MD 20892-1674, USA FEATURES Location/Qualifiers source 1..2782 /organism="Homo sapiens" /chromosome="17" /map="17q11.2" CDS 147..2510 /note="STAT protein; is activated by IL-2, IL-7, IL-15, growth hormone, IL-3, GM-CSF, thrombopoietin, prolactin, and erythropoietin; tyrosine 699 phosphorylation is required for activation and dimerization of Stat5B" /codon_start=1 /product="signal transducer and activator of transcription Stat5B" /db_xref="PID:g1330324" /translation="MAVWIQAQQLQGEALHQMQALYGQHFPIEVRHYLSQWIESQAWD SVDLDNPQENIKATQLLEGLVQELQKKAEHQVGEDGFLLKIKLGHYATQLQNTYDRCP MELVRCIRHILYNEQRLVREANNGSSPAGSLADAMSQKHLQINQTFEELRLVTQDTEN ELKKLQQTQEYFIIQYQESLRIQAQFGPLAQLSPQERLSRETALQQKQVSLEAWLQRE AQTLQQYRVELPEKHQKTLQLLRKQQTIILDDELIQWKRRQQLAGNGGPPEGSLDVLQ SWCEKLAEIIWQNRQQIRRAEHLCQQLPIPGPVEEMLAEVNATITDIISALVTSTFII EKQPPQVLKTQTKFAATVRLLVGGKLNVHMNPPQVKATIISEQQAKSLLKNENTRNDY SGEILNNCCVMEYHQATGTLSAHFRNMSLKRIKRSDRRGAESVTEEKFTILFESQFSV GGNELVFQVKTLSLPVVVIVHGSQDNNATATVLWDNAFAEPGRVPFAVPDKVLWPQLC EALNMKFKAEVQSNRGLTKENLVFLAQKLFNNSSSHLEDYSGLSVSWSQFNRENLPGR NYTFWQWFDGVMEVLKKHLKPHWNDGAILGFVNKQQAHDLLINKPDGTFLLRFSDSEI GGITIAWKFDSQERMFWNLMPFTTRDFSIRSLADRLGDLNYLIYVFPDRPKDEVYSKY YTPVPCESATAKAVDGYVKPQIKQVVPEFVNASADAGGGSATYMDQAPSPAVCPQAHY NMYPQNPDSVLDTDGDFDLEDTMDVARRVEELLGRPMDSQWIPHAQS" BASE COUNT 669 a 739 c 801 g 573 t ORIGIN 1 ggagccgtca ccccgggcgg ggacccagcg caggcaactc cgcgcggcgc ccggccgagg 61 gagggagcga gcgggcgggc gggcaagcca gacagctggg ccggagcagc cgccggcgcc 121 cgaggggccg agcgagattg taaaccatgg ctgtgtggat acaagctcag cagctccaag 181 gagaagccct tcatcagatg caagcgttat atggccagca ttttcccatt gaggtgcggc 241 attatttatc ccagtggatt gaaagccaag catgggactc agtagatctt gataatccac 301 aggagaacat taaggccacc cagctcctgg agggcctggt gcaggagctg cagaagaagg 361 cagagcacca ggtgggggaa gatgggtttt tactgaagat caagctgggg cactatgcca 421 cacagctcca gaacacgtat gaccgctgcc ccatggagct ggtccgctgc atccgccata 481 tattgtacaa tgaacagagg ttggtccgag aagccaacaa tggtagctct ccagctggaa 541 gccttgctga tgccatgtcc cagaaacacc tccagatcaa ccagacgttt gaggagctgc 601 gactggtcac gcaggacaca gagaatgagt taaaaaagct gcagcagact caggagtact 661 tcatcatcca gtaccaggag agcctgagga tccaagctca gtttggcccg ctggcccagc 721 tgagccccca ggagcgtctg agccgggaga cggccctcca gcagaagcag gtgtctctgg 781 aggcctggtt gcagcgtgag gcacagacac tgcagcagta ccgcgtggag ctgcccgaga 841 agcaccagaa gaccctgcag ctgctgcgga agcagcagac catcatcctg gatgacgagc 901 tgatccagtg gaagcggcgg cagcagctgg ccgggaacgg cgggcccccc gagggcagcc 961 tggacgtgct acagtcctgg tgtgagaagt tggcggagat catctggcag aaccggcagc 1021 agatccgcag ggctgagcac ctctgccagc agctgcccat ccccggccca gtggaggaga 1081 tgctggccga ggtcaacgcc accatcacgg acattatctc agccctggtg accagcacgt 1141 tcatcattga gaagcagcct cctcaggtcc tgaagaccca gaccaagttt gcagccactg 1201 tgcgcctgct ggtgggcggg aagctgaacg tgcacatgaa ccccccccag gtgaaggcca 1261 ccatcatcag tgagcagcag gccaagtctc tgctcaagaa cgagaacacc cgcaatgatt 1321 acagtggcga gatcttgaac aactgctgcg tcatggagta ccaccaagcc acaggcaccc 1381 ttagtgccca cttcaggaat atgtccctga aacgaattaa gaggtcagac cgtcgtgggg 1441 cagagtcggt gacagaagaa aaatttacaa tcctgtttga atcccagttc agtgttggtg 1501 gaaatgagct ggtttttcaa gtcaagaccc tgtccctgcc agtggtggtg atcgttcatg 1561 gcagccagga caacaatgcg acggccactg ttctctggga caatgctttt gcagagcctg 1621 gcagggtgcc atttgccgtg cctgacaaag tgctgtggcc acagctgtgt gaggcgctca 1681 acatgaaatt caaggccgaa gtgcagagca accggggcct gaccaaggag aacctcgtgt 1741 tcctggcgca gaaactgttc aacaacagca gcagccacct ggaggactac agtggcctgt 1801 ctgtgtcctg gtcccagttc aacagggaga atttaccagg acggaattac actttctggc 1861 aatggtttga cggtgtgatg gaagtgttaa aaaaacatct caagcctcat tggaatgatg 1921 gggccatttt ggggtttgta aacaagcaac aggcccatga cctactgatt aacaagccag 1981 atgggacctt cctcctgaga ttcagtgact cagaaattgg cggcatcacc attgcttgga 2041 agtttgattc tcaggaaaga atgttttgga atctgatgcc ttttaccacc agagacttct 2101 ccatcaggtc cctagccgac cgcttgggag acttgaatta ccttatctac gtgtttcctg 2161 atcggccaaa agatgaagta tactccaaat actacacacc agttccctgc gagtctgcta 2221 ctgctaaagc tgttgatgga tacgtgaagc cacagatcaa gcaagtggtc cctgagtttg 2281 tgaacgcatc tgcagatgcc gggggcggca gcgccacgta catggaccag gccccctccc 2341 cagctgtgtg tccccaggct cactataaca tgtacccaca gaaccctgac tcagtccttg 2401 acaccgatgg ggacttcgat ctggaggaca caatggacgt agcgcggcgt gtggaggagc 2461 tcctgggccg gccaatggac agtcagtgga tcccgcacgc acaatcgtga ccccgcgacc 2521 tctccatctt cagcttcttc atcttcacca gaggaatcac tcttgtggat gttttaattc 2581 catgaatcgc ttctcttttg aaacaatact cataatgtga agtgttaata ctagttgtga 2641 ccttagtgtt tctgtgcatg gtggcaccag cgaagggagt gcgagtatgt gtttgtgtgt 2701 gtgtgtgtgt gtgtgtgtgt gtgcgttggt gcacgttatg gtgtttctcc ctctcactgt 2761 ctgagagttt agttgtagca ga // LOCUS HSU47742 7869 bp mRNA PRI 01-SEP-1996 DEFINITION Human monocytic leukaemia zinc finger protein (MOZ) mRNA, complete cds. ACCESSION U47742 NID g1517913 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7869) AUTHORS Borrow,J., Stanton,V.P., Andresen,J.M., Becher,R., Behm,F.G., Chaganti,R.S.K., Civin,C.I., Disteche,C., Dube,I., Frischauf,A.M., Horsman,D., Mitelman,F., Volinia,S., Watmore,A.E. and Housman,D.E. TITLE The translocation t(8;16)(p11;p13) of acute myeloid leukaemia fuses a putative acetyltransferase to the CREB-binding protein JOURNAL Nature Genet. 14 (1), 33-41 (1996) MEDLINE 96376968 REFERENCE 2 (bases 1 to 7869) AUTHORS Borrow,J., Stanton,V.P., Andresen,J.M., Becher,R., Behm,F.G., Chaganti,R.S.K., Civin,C.I., Disteche,C., Dube,I., Frischauf,A.M., Horsman,D., Mitelman,F., Volinia,S., Watmore,A.E. and Housman,D.E. TITLE Direct Submission JOURNAL Submitted (30-JAN-1996) Julian Borrow, Center for Cancer Research, E17-540, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..7869 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8p11" /cell_line="U937" gene 394..6408 /gene="MOZ" CDS 394..6408 /gene="MOZ" /codon_start=1 /product="monocytic leukaemia zinc finger protein" /db_xref="PID:g1517914" /translation="MVKLANPLYTEWILEAIKKVKKQKQRPSEERICNAVSSSHGLDR KTVLEQLELSVKDGTILKVSNKGLNSYKDPDNPGRIALPKPRNHGKLDNKQNVDWNKL IKRAVEGLAESGGSTLKSIERFLKGQKDVSALFGGSAASGFHQQLRLAIKRAIGHGRL LKDGPLYRLNTKATNVDGKESCESLSCLPPVSLLPHEKDKPVAEPIPICSFCLGTKEQ NREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSSCRDQGKNAD NMLFCDSCDRGFHMECCDPPLTRMPKGMWICQICRPRKKGRKLLQKKAAQIKRRYTNP IGRPKNRLKKQNTVSKGPFSKVRTGPGRGRKRKITLSSQSASSSSEEGYLERIDGLDF CRDSNVSLRFNKKTKGLIDGLTKFFTPSPDGRKARGEVVDYSEQYRIRKRGNRKSSTS DWPTDNQDGWDGKQENEERLFGSQEIMTEKDMELFRDIQEQALQKVGVTGPPDPQVRC PSVIEFGKYEIHTWYSSPYPQEYSRLPKLYLCEFCLKYMKSRTILQQHMKKCGWFHPP ANEIYRKNNISVFEVDGNVSTIYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDVK GCHLVGYFSKEKHCQQKYNVSCIMILPQYQRKGYGRFLIDFSYLLSKREGQAGSPEKP LSDLGRLSYMAYWKSVILECLYHQNDKQISIKKLSKLTGICPQDITSTLHHLRMLDFR SDQFVIIRREKLIQDHMAKLQLNLRPVDVDPECLRWTPVIVSNSVVSEEEEEEAEEGE NEEPQCQERELEISVGKSVSHENKEQDSYSVESEKKPEVMAPVSSTRLSKQVLPHDSL PANSQPSRRGRWGRKNRKTQERFGDKDSKLLLEETSSAPQEQYGECGEKSEATQEQYT ESEEQLVASEEQPSQDGKPDLPKRRLSEGVEPWRGQLKKSPEALKCRLTEGSERLPRR YSEGDRAVLRGFSESSEEEEEPESPRSSSPPILTKPTLKRKKPFLHRRRRVRKRKHHN SSVVTETISETTEVLDEPFEDSDSERPMPRLEPTFEIDEEEEEEDENELFPREYFRRL SSQDVLRCQSSSKRKSKDEEEDEESDDADDTPILKPVSLLRKRDVKNSPLEPDTSTPL KKKKGWPKGKSRKPIHWKKRPGRKPGFKLSREIMPVSTQACVIEPIVSIPKAGRKPKI QESEETVEPKEDMPLPEERKEEEEMQAEAEEAEEGEEEDAASSEVPAASPADSSNSPE TETKEPEVEEEEEKPRVSEEQRQSEEEQQELEEPEPEEEEDAAAETAQNDDHDADDED DGHLESTKKKELEEQPTREDVKEEPGVQESFLDANMQKSREKIKDKEETELDSEEEQP SHDTSVVSEQMAGSEDDHEEDSHTKEELIELKEEEEIPHSELDLETVQAVQSLTQEES SEHEGAYQDCEETLAACQTLQSYTQADEDPQMSMVEDCHASEHNSPISSVQSHPSQSV RSVSSPNVPALESGYTQISPEQGSLSAPSMQNMETSPMMDVPSVSDHSQQVVDSGFSD LGSIESTTENYENPSSYDSTMGGSICGNSSSQSSCSYGGLSSSSSLTQSSCVVTQQMA SMGSSCSMMQQSSVQPAANCSIKSPQSCVVERPPSNQQQQPPPPPPQQPQPPPPQPQP APQPPPPQQQPQQQPQPQPQQPPPPPPPQQQPPLSQCSMNNSFTPAPMIMEIPESGST GNISIYERIPGDFGAGSYSQPSATFSLAKLQQLTNTIMDPHAMPYSHSPAVTSYATSV SLSNTGLAQLAPSHPLAGTPQAQATMTPPPNLASTTMNLTSPLLQCNMSATNIGIPHT QRLQGQMPVKGHISIRSKSAPLPSAAAHQQQLYGRSPSAVAMQAGPRALAVQRGMNMG VNLMPTPAYNVNSMNMNTLNAMNSYRMTQPMMNSSYHSNPAYMNQTAQYPMQMQMGMM GSQAYTQQPMQPNPHGNMMYTGPSHHSYMNAAGVPKQSLNGPYMRR" misc_feature 1018..1323 /gene="MOZ" /note="encodes C4HC3 zinc finger domains; known as LAP, PHD and TTC domains" misc_feature 1957..2502 /gene="MOZ" /note="encodes MYST domain; homologous to C2HC zinc finger acetyltransferase" misc_feature 2755..4827 /gene="MOZ" /note="encodes acidic domain" misc_feature 5033..5034 /gene="MOZ" /note="breakpoint position in MOZ-CBP t(8;16) AML fusion message" misc_feature 5332..5496 /gene="MOZ" /note="encodes proline/glutamine domain" misc_feature 5512..6405 /gene="MOZ" /note="encodes methionine-rich domain" BASE COUNT 2323 a 1833 c 1857 g 1856 t ORIGIN 1 ggcacgaggt ttggggcatc tccgcggtcc ggcccggggc cccgggatct cggctgtcct 61 tcctcccggc ataagatgca catttttctg ctctggagcc gggaatgaaa tattcttgag 121 ttcttacaac tttatgacga gacccatgtg tggtgctatt gagaaattca ttgggaagtt 181 ggaagacatt tcaaacaaca ggttgttttg gtttctatag tacaattggg gtggcattct 241 gttttgtgaa aggaggaagg acttaggcca gaaaactcat atgctatggt taactggttc 301 ccagcctccg agaatcttgt tttccatggt gtaaaactta ctcagcatca ggataaggga 361 taacgactct atggatatac agaatccttc accatggtaa aactcgcaaa cccgctttat 421 actgagtgga ttttggaggc catcaaaaaa gtgaaaaagc agaaacagcg tccttcagaa 481 gaaaggatat gcaatgctgt gtcttcatcc catggcttgg atcgtaaaac tgttttagaa 541 caattggagt tgagtgttaa agatggaaca attttaaaag tctcaaataa aggactcaat 601 tcctataaag atcctgataa tcctgggcga atagcacttc ctaagcctcg gaaccatgga 661 aaattggata ataaacaaaa tgtggattgg aataaactga taaagcgggc agttgagggc 721 ttggcagagt ctggtggctc aactttgaaa agcattgaac gttttttgaa aggtcagaag 781 gatgtgtctg cattattcgg aggcagtgct gcctctggct ttcaccagca gttacgattg 841 gctatcaaac gtgccattgg ccacggcaga ctccttaaag atggacctct ttatcggctc 901 aacactaaag caaccaacgt ggatgggaaa gagagttgtg agtctctttc ctgtttacct 961 ccagtgtccc ttcttccaca tgaaaaggat aagccggttg ctgaaccaat ccccatctgt 1021 agtttctgtc ttggtacaaa agaacaaaac cgagaaaaga agccagagga actcatctcc 1081 tgtgccgact gtggcaacag tggccatcca tcctgtttaa agttttcccc tgaactaacg 1141 gttcgagtga aggccttacg gtggcagtgc atcgagtgta aaacatgcag ctcctgtcga 1201 gatcaaggca aaaatgcgga taacatgctc ttttgtgatt catgtgaccg aggttttcac 1261 atggagtgtt gtgatccgcc actcacccgt atgccaaaag gcatgtggat atgtcaaata 1321 tgtcgaccta ggaaaaaagg acgaaaactt ctacaaaaga aggcagcaca gataaaacgg 1381 cgctatacta atccaatagg acgtccaaaa aacaggttaa agaaacaaaa cacggtatca 1441 aaaggtccct tcagcaaagt tcgaactggc cctggaaggg gtaggaaacg aaaaatcact 1501 ctttccagcc aatcagcatc atcatcatca gaagaaggat atttagagcg gatagatggc 1561 ttggacttct gcagagatag caatgtctcc ttgaggttca acaagaaaac caaagggctc 1621 attgatggcc ttaccaaatt ttttacccct tcccctgatg ggcggaaagc tcggggggaa 1681 gtggtggact actctgagca atatcgaatc agaaagaggg gcaacaggaa atcaagcact 1741 tcagattggc ccacagacaa tcaggatggc tgggatggca aacaagaaaa tgaggagcga 1801 ctttttggga gccaggaaat catgactgag aaagatatgg aattatttcg tgatatccaa 1861 gaacaagcac tgcagaaagt tggagtgact ggtccccctg atccacaagt ccgctgtccc 1921 tctgtcattg agtttgggaa gtatgaaatt cacacctggt actcctcccc atatcctcaa 1981 gaatactcaa ggctgcccaa attgtatctt tgtgaatttt gtctaaaata tatgaaaagt 2041 agaactattc tgcagcagca catgaagaaa tgtggttggt tccatcctcc tgccaatgag 2101 atttacagaa agaataatat ttctgtcttt gaggttgatg ggaatgtgag taccatttat 2161 tgtcaaaacc tgtgtctttt ggcaaagttg tttcttgacc acaaaaccct ctattacgat 2221 gtggagccat ttctttttta tgtactaaca cagaatgatg tcaagggctg ccaccttgtt 2281 ggctactttt ctaaggaaaa gcactgccaa cagaagtaca atgtttcctg tataatgatt 2341 cttcctcaat accagcgtaa gggctatggc aggtttctca tcgatttcag ttatttgtta 2401 tcaaagcgtg aaggccaagc agggtctcca gagaaaccgt tatctgatct gggtcgtctt 2461 tcctacatgg catattggaa aagtgtaata ttggagtgcc tttatcacca aaatgacaag 2521 cagatcagca ttaagaagtt aagcaagttg actggaatct gccctcaaga catcacttcc 2581 acactccacc acctacgaat gctggacttc cgtagtgacc aatttgtgat tatccgccgg 2641 gaaaaactta tccaggatca catggcaaag cttcagctga atttgcgacc tgtagatgta 2701 gatccagaat gtttgcgctg gactccagtc atagtgtcca actctgtggt ctcagaggag 2761 gaagaagagg aggctgagga aggagaaaac gaagagccac agtgccagga aagagaatta 2821 gagatcagtg tgggaaagtc tgtgtctcat gagaacaaag aacaagattc ttattcagta 2881 gaaagtgaaa agaaaccaga agttatggct ccagtcagtt ctacacgttt gagcaaacaa 2941 gtccttcctc atgatagtct tcctgcaaat agccagccat ctcggagggg ccgctggggg 3001 aggaagaaca gaaaaaccca ggaacgtttt ggtgataaag attctaaact gctcttggaa 3061 gagacgtctt cagctcctca ggaacaatat ggagaatgtg gggagaaatc agaagccacc 3121 caggaacaat acactgaaag tgaagaacag ctggtggctt ctgaggagca gccaagccag 3181 gacgggaaac ctgaccttcc caagagaaga ctcagtgagg gggttgagcc ctggcgagga 3241 cagctcaaga aaagccctga ggctctgaag tgcagattaa cagaaggaag tgagaggctg 3301 ccccgtcgct acagtgaggg tgacagggct gtcctcaggg gcttcagtga gagcagcgag 3361 gaggaggagg agccggaaag ccctcggtca agctcgccac caattctcac aaagcccacg 3421 ctgaagcgaa agaaaccatt tctccaccga aggaggagag tccgaaagcg caaacaccac 3481 aatagcagtg tagtcacaga aactatttct gagaccactg aagtgttaga tgaacctttt 3541 gaagattctg actccgagag gccaatgcca agattagaac ccacatttga gatcgatgaa 3601 gaagaggagg aagaggatga aaatgaactt ttccctagag aatacttccg tcgtttgtct 3661 tcgcaggatg tactcaggtg tcagtcctct tctaagagga agtctaaaga tgaagaagaa 3721 gatgaagagt cagatgatgc tgatgacact cctatcttaa agccagtatc tcttttgcga 3781 aaacgtgatg tgaagaattc tcctcttgag ccagatacat ccacaccttt gaaaaagaaa 3841 aagggatggc ccaaaggcaa gagccgcaaa ccaatccact ggaagaaaag acctggtcga 3901 aaaccaggat ttaagttgag tcgggaaatc atgccagttt ctactcaagc atgcgtcatt 3961 gagcccatcg tttccattcc taaagctgga cgtaaaccca agatccagga gagtgaagaa 4021 actgttgagc caaaagaaga catgccccta cccgaggaga ggaaggagga ggaggagatg 4081 caagcagagg cagaagaggc tgaagagggt gaggaagagg atgcagccag cagtgaagtc 4141 ccagcagcct ctccagcaga cagcagcaat agtcctgaga ccgaaaccaa ggagcctgag 4201 gtggaggagg aagaagagaa gccccgtgtc tcagaggagc agaggcagtc agaggaggag 4261 cagcaggaat tagaggagcc agagccagag gaggaggaag atgcagctgc agagactgcc 4321 cagaatgacg accacgacgc tgatgatgag gatgatggcc acctggagtc cacaaagaaa 4381 aaggagctag aggaacagcc cacgagggaa gatgtcaagg aggagcctgg tgttcaagag 4441 tcttttttag atgctaatat gcagaagagt agggaaaaga taaaggataa agaggaaacc 4501 gagctggatt ccgaagagga gcagccttcc catgacacgt ccgtggtgtc agagcagatg 4561 gctgggtctg aggacgacca cgaagaagac tcccacacta aggaagagtt aatcgaatta 4621 aaagaggagg aagagattcc tcatagtgag ctggatctgg aaactgtaca ggcagtgcag 4681 tctttgactc aagaagaaag cagtgagcat gagggcgcct accaggactg tgaggaaact 4741 cttgcggcgt gtcagaccct gcagagttac acccaggctg acgaggaccc tcagatgtcc 4801 atggttgaag actgtcatgc gtcagaacat aatagcccta tctcctccgt tcagtctcac 4861 cccagccagt cagtccgttc ggtcagcagt cccaacgtgc ctgcccttga gagtggctac 4921 acccagatca gcccagaaca aggatccctg tccgcaccct ctatgcagaa catggagacc 4981 agccccatga tggatgtgcc ttccgtatca gaccactctc agcaggtggt ggacagcggc 5041 ttcagtgacc tgggcagcat tgagagcacc actgaaaact atgagaaccc aagcagttac 5101 gactccacga tgggcggcag catctgtggg aacagctctt cccagagcag ctgctcctac 5161 ggtgggctgt cgtcctccag cagcctcacc cagagcagct gtgtggtcac tcagcagatg 5221 gccagcatgg gcagcagctg cagcatgatg cagcagagca gcgtccagcc tgctgccaac 5281 tgcagcatca agtcacctca gagctgcgtg gtggagaggc ctcccagtaa ccagcagcag 5341 cagccgccac caccgcctcc acagcagcca cagccgccgc cgccacaacc acaaccagca 5401 ccacagcctc caccacccca gcagcagccg caacagcagc cgcagcctca gccccagcag 5461 cctccacccc caccccctcc ccagcagcag cccccgctgt cacagtgtag tatgaataac 5521 agtttcaccc cagctcctat gatcatggag ataccagaat ctggaagcac tgggaacata 5581 agtatctatg agaggattcc aggggatttt ggtgccggca gctactctca accatcagcc 5641 accttcagcc tagccaagct gcagcagctg accaacacca ttatggaccc tcatgccatg 5701 ccttatagcc attctcctgc tgtgacttcc tatgcaacca gtgtttctct gtccaataca 5761 ggactggctc agctggctcc atctcatccc ttagctggga ctcctcaagc acaagccacc 5821 atgacgccac ccccaaactt ggcatccact accatgaacc tcacatctcc tctgcttcag 5881 tgcaacatgt ctgccaccaa cattggcatt cctcacacgc agagattgca agggcaaatg 5941 ccagtgaagg ggcacatttc catccgctcc aagtctgcgc cactgccctc tgcggctgct 6001 caccagcagc agctgtatgg ccgtagccca tcggcagttg ccatgcaggc tggccctcgc 6061 gcactggctg ttcagcgtgg catgaacatg ggggttaatc tgatgcctac tcccgcctat 6121 aatgtcaatt ccatgaatat gaacaccttg aatgccatga acagctatcg aatgacacag 6181 cccatgatga acagcagtta ccatagtaac cctgcctaca tgaaccagac agcacagtat 6241 cctatgcaga tgcagatggg aatgatgggg agccaggcct atacccagca gcctatgcag 6301 cctaaccctc atgggaacat gatgtacaca ggcccctccc atcacagcta catgaacgct 6361 gctggcgtgc ccaagcagtc actcaacgga ccttacatga gaagatgagc aagatgaact 6421 tgcaatcaaa aacttaaata tatataaata aaggaacctt ttatactgac aaaccagaga 6481 aaaatggacc tttttccagt taaaatattg ctgtagattt agaggaattt ttctttggtt 6541 tattttattt tttagaaaac ctgatcttct cttttttttg ggttcatttt gttgtgggtt 6601 ttggttttct tcacaatctt gaacatttta cagtagaact catctaaaaa tggatttggg 6661 gatggggaaa catgcacaaa atcttttcat aattaaaaag agccttactt tctttacata 6721 ccacatggac agaatttgtg taaaagtgaa ttatctttat tttaaaatgt atgtttcccc 6781 tcactgtttg cagctcccaa tgttgtcatt tttaaatgtt atatacatct caagggttaa 6841 ccagaccctt tcctccaaac ccaacctttc atttcctact tcattccagc aggaggcact 6901 taggggagac tcggatgggg acatggagaa caacccaagc tccttaaact tattattatt 6961 gttaatatta ttattattat tattaataaa gtgaggcagg aaaatgcttc tccttttaaa 7021 atcccctcca ctcctcacac acacacacct cttgaaaccc ttccccaaga atgtttcttt 7081 atagacggac ttcattgaaa tctttgttgt tcttgaatca agtgtaatat aatttttttc 7141 ttctttttta aaatattccc actcagcact cagagacaca aaaatactgt aagtctcaat 7201 taacagcaga atctcagaga aaagctgttt gcaatccaaa tccagccttt ggaggaatag 7261 agatggtcaa ttaacaatca aaaagaggag attaacctct tgttttttta ccacctggtg 7321 aatcagccat aacgcacaca cacgccaccc agcctcttgt ttctagtatg tactttgaaa 7381 tgctaactga gggtcttgat gcttgagcct ttgactgata aaactcaaat agcagtcccc 7441 agtgatttgc ctcttaggtt ctttcttaaa ttgttggtgg atgactgtac attttagtga 7501 tttgaaaaat aactgacaaa ccattgaaac agtttatttt atgttggaag agatggcgca 7561 gatgtgtgtc agaagggaga tcacggtgtg agtttcgtag ctatttaagt gatacatacc 7621 tctagttttt gtatgtcttt tgagatcctg agttcatccc ctgtgaatca gagtgcacaa 7681 gcacctctcc tgtgagtggc taatgagaag agggacagac cgaccaccag cacagtaggg 7741 cagatctgga cagcagaatg ttataacgca agttcatgtg ttgctcccaa ctccattctc 7801 ttttctctcg tgcaaccagt ttgcccattc tcttcctatt acttgctcca gggataggta 7861 aaaaaaaaa // LOCUS HSU47924 222930 bp DNA PRI 02-APR-1997 DEFINITION Human chromosome 12p13 sequence, complete sequence. ACCESSION U47924 M86525 U72506 NID g1633547 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Muzny,D.M., Lu,J., Lu,F., Lilley,C.E., Spanos,S., Malley,T. and Gibbs,R.A. TITLE A gene-rich cluster between the CD4 and triosephosphate isomerase genes at human chromosome 12p13 JOURNAL Genome Res. 6 (4), 314-326 (1996) MEDLINE 96303695 REFERENCE 2 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination JOURNAL Genome Res. 7 (3), 268-280 (1997) MEDLINE 97228904 REFERENCE 3 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Muzny,D.M., Lu,J., Lu,F., Lilley,C.E., Spanos,S., Malley,T. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (31-JAN-1996) M. Ali Ansari-Lari, Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 4 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (24-OCT-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 5 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (29-JAN-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 6 (bases 1 to 222930) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (02-APR-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REMARK Large-Scale Sequencing in Human Chromosome 12p13: Experimental and Computational Gene Structure Determination. FEATURES Location/Qualifiers source 1..222930 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p13" repeat_region 7..132 /rpt_family="Alu" repeat_region complement(16..119) /rpt_family="SVA" repeat_region complement(398..522) /rpt_family="Alu" mRNA join(1524..1650,12084..12199,12319..12483,26154..26312, 26771..27004,28068..28415,29142..29342,30433..30554, 30859..30926,31311..32821) /gene="CD4" /product="CD4" gene 1524..32821 /gene="CD4" repeat_region complement(3242..3520) /rpt_family="Alu" repeat_region 3302..3500 /rpt_family="SVA" repeat_region 3587..3871 /rpt_family="Alu" repeat_region complement(3613..3816) /rpt_family="SVA" repeat_region complement(5695..5927) /rpt_family="Alu" repeat_region complement(6082..6657) /rpt_family="Alu" repeat_region 6134..6638 /rpt_family="SVA" repeat_region complement(6803..7082) /rpt_family="Alu" repeat_region 6831..7065 /rpt_family="SVA" repeat_region complement(9029..9155) /rpt_family="MER5" repeat_region complement(9415..10690) /rpt_family="LTR5" repeat_region complement(10192..10460) /rpt_family="Alu" repeat_region 10246..10457 /rpt_family="SVA" repeat_region complement(10489..10690) /rpt_family="SVA" repeat_region complement(11545..11784) /rpt_family="Alu" repeat_region 11562..11805 /rpt_family="SVA" CDS join(12151..12199,12319..12483,26154..26312,26771..27004, 28068..28415,29142..29342,30433..30554,30859..30926, 31311..31341) /gene="CD4" /function="T-cell coreceptor; involved in antigen recognition; participant in signal transduction pathway" /note="major receptor for HIV-1; member of immunoglobulin supergene family; T cell surface glycoprotein T4" /codon_start=1 /product="surface antigen CD4" /db_xref="PID:g1732407" /translation="MNRGVPFRHLLLVLQLALLPAATQGKKVVLGKKGDTVELTCTAS QKKSIQFHWKNSNQIKILGNQGSFLTKGPSKLNDRADSRRSLWDQGNFPLIIKNLKIE DSDTYICEVEDQKEEVQLLVFGLTANSDTHLLQGQSLTLTLESPPGSSPSVQCRSPRG KNIQGGKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIVVLAFQKASSIVYKKEGEQ VEFSFPLAFTVEKLTGSGELWWQAERASSSKSWITFDLKNKEVSVKRVTQDPKLQMGK KLPLHLTLPQALPQYAGSGNLTLALEAKTGKLHQEVNLVVMRATQLQKNLTCEVWGPT SPKLMLSLKLENKEAKVSKREKAVWVLNPEAGMWQCLLSDSGQVLLESNIKVLPTWST PVQPMALIVLGGVAGLLLFIGLGIFFCVRCRHRRRQAERMSQIKRLLSEKKTCQCPHR FQKTCSPI" repeat_region 12828..12918 /rpt_family="Alu" repeat_region complement(12928..13196) /rpt_family="Alu" repeat_region 13259..13536 /rpt_family="Alu" repeat_region complement(13584..14165) /rpt_family="Alu" repeat_region complement(14348..14608) /rpt_family="Alu" repeat_region complement(14759..15036) /rpt_family="Alu" repeat_region complement(15174..15450) /rpt_family="Alu" repeat_region 15227..15434 /rpt_family="SVA" repeat_region 15515..16126 /rpt_family="Alu" repeat_region complement(15541..15744) /rpt_family="SVA" repeat_region complement(15868..16070) /rpt_family="SVA" repeat_region 16251..16440 /rpt_family="L1" repeat_region 16613..17204 /rpt_family="Alu" repeat_region 17276..17583 /rpt_family="Alu" repeat_region complement(17319..17521) /rpt_family="SVA" repeat_region 17828..18496 /rpt_family="Alu" repeat_region complement(18330..18371) /rpt_family="Alu" repeat_region 18578..19641 /rpt_family="L1MB7" repeat_region 18578..19811 /rpt_family="L1MD1" repeat_region 18578..19811 /rpt_family="L1MD2" repeat_region 18578..18835 /rpt_family="L1PA15" repeat_region 18578..18835 /rpt_family="L1PA11" repeat_region 18578..19471 /rpt_family="L1ME3a" repeat_region 18578..19538 /rpt_family="L1ME2" repeat_region 18578..19543 /rpt_family="L1MC2" repeat_region 18578..18871 /rpt_family="L1MA2" repeat_region 18588..18888 /rpt_family="L1MA9" repeat_region 18588..18888 /rpt_family="L1MA5" repeat_region 18601..18868 /rpt_family="L1PB1" repeat_region 18601..19588 /rpt_family="L1MB3" repeat_region 18601..18868 /rpt_family="L1PB3" repeat_region 18601..19588 /rpt_family="L1MA10" repeat_region 18601..18835 /rpt_family="L1PA2" repeat_region 18672..18835 /rpt_family="L1" repeat_region 18675..18836 /rpt_family="L1PA7" repeat_region 18898..19185 /rpt_family="Alu" repeat_region complement(18924..19170) /rpt_family="SVA" repeat_region 19317..19462 /rpt_family="L1MA5" repeat_region 19317..19565 /rpt_family="L1MA9" repeat_region 19718..19796 /rpt_family="MER42a" repeat_region 19821..20099 /rpt_family="Alu" repeat_region complement(19851..20416) /rpt_family="SVA" repeat_region 20196..20470 /rpt_family="Alu" repeat_region 21223..21515 /rpt_family="Alu" repeat_region complement(21912..22193) /rpt_family="Alu" repeat_region 21966..22175 /rpt_family="SVA" repeat_region complement(23515..24060) /rpt_family="Alu" repeat_region 23574..23880 /rpt_family="SVA" repeat_region 24141..24366 /rpt_family="Alu" repeat_region 24420..24699 /rpt_family="Alu" repeat_region complement(24439..24670) /rpt_family="SVA" STS 24576..24908 /gene="CD4" /db_xref="dbSTS:Z52673" repeat_region complement(24847..24922) /rpt_family="Alu" repeat_region complement(24926..25090) /rpt_family="Alu" repeat_region complement(25095..25341) /rpt_family="Alu" repeat_region 27546..27710 /rpt_family="MIR" repeat_region complement(28932..28984) /rpt_family="MIR" polyA_signal 32790..32795 /gene="CD4" polyA_site 32821 /gene="CD4" mRNA join(33809..33912,35480..35775,37495..37684,38207..38364, 38664..39430) /gene="A" /product="A-1" /evidence=experimental mRNA join(33809..33912,35480..36777,37495..37684,38207..38364, 38664..39430) /gene="A" /product="A-2" gene 33809..39430 /gene="A" CDS join(35761..35775,37495..37684,38207..38364,38664..39215) /gene="A" /note="alternatively spliced product" /codon_start=1 /product="A-1" /db_xref="PID:g1732408" /translation="MLSTGVVSFFSLKSDSAPPWMVLAVLWCSMAQTLLLPSFIWSCE RYRADVRTVWEQCVAIMSEEDGDDDGGCDDYAEGRVCKVRFDANGATGPGSRDPAQVK LLPGRHMLFPPLERVHYLQVPLSRRLSHDETNIFSTPREPGSFLHKWSSSDDIRVLPA QSRALGGPPEYLGQRHRLEDEEDEEEAEGGGLASLRQFLESGVLGSGGGPPRGPGFFR EEITTFIDETPLPSPTASPGHSPRRPRPLGLSPRRLSLGSPESRAVGLPLGLSAGRRC SLTGGEESARAWGGSWGPGNPIFPQLTL" CDS join(35911..36777,37495..37684,38207..38364,38664..39215) /gene="A" /note="alternatively spliced product; A-2 form has seven putative transmembrane domains" /codon_start=1 /product="A-2" /db_xref="PID:g1732409" /translation="MARGGAGAEEASLRSNALSWLACGLLALLANAWIILSISAKQQK HKPLELLLCFLAGTHILMAAVPLTTFAVVQLRRQASSDYDWNESICKVFVSTYYTLAL ATCFTVASLSYHRMWMVRWPVNYRLSNAKKQALHAVMGIWMVSFILSTLPSIGWHNNG ERYYARGCQFIVSKIGLGFGVCFSLLLLGGIVMGLVCVAITFYQTLWARPRRARQARR VGGGGGTKAGGPGALGTRPAFEVPAIVVEDARGKRRSSLDGSESAKTSLQVTNLVSAI VFLYDSLTGVPILVVSFFSLKSDSAPPWMVLAVLWCSMAQTLLLPSFIWSCERYRADV RTVWEQCVAIMSEEDGDDDGGCDDYAEGRVCKVRFDANGATGPGSRDPAQVKLLPGRH MLFPPLERVHYLQVPLSRRLSHDETNIFSTPREPGSFLHKWSSSDDIRVLPAQSRALG GPPEYLGQRHRLEDEEDEEEAEGGGLASLRQFLESGVLGSGGGPPRGPGFFREEITTF IDETPLPSPTASPGHSPRRPRPLGLSPRRLSLGSPESRAVGLPLGLSAGRRCSLTGGE ESARAWGGSWGPGNPIFPQLTL" polyA_signal 39405..39410 /gene="A" polyA_site 39430 /gene="A" mRNA join(41838..42021,42363..42564,42649..42780,43226..43362, 43835..43924,45602..45654,45752..45819,45935..46059, 48995..49096,49433..49583,49742..49859,49968..50043, 51010..51150,51307..51856) /product="B" gene 41926..51856 /gene="B" CDS join(41926..42021,42363..42564,42649..42780,43226..43362, 43835..43924,45602..45654,45752..45819,45935..46059, 48995..49096,49433..49583,49742..49859,49968..50043, 51010..51150,51307..51471) /gene="B" /codon_start=1 /product="B" /db_xref="PID:g1200503" /translation="MHLQMREDMAKYRRMSGVRPQSFRDLETPPHWAAYDTGLELLGR QEAGLALPRLEEALQGSLAQMESCRADCEGPEEQQGAEEEEDGAASQGGLYEAIAGHW IQVLQCRQRCVGETATRPGRSFPVPDFLPNQLRRLHEAHAQVGNLSQAIENVLSVLLF YPEDEAAKRALNQYQAQLGEPRPGLGPREDIQRFILRSLGEKRQLYYAMEHLGTSFKD PDPWTPAALIPEALREKLREDQEKRPWDHEPVKPKPLTYWKDVLLLEGVTLTQDSRQL NGSERAVLDGLLTPAECGVLLQLAKDAAGAGARSGYRGRRSPHTPHERFEGLTVLKAA QLARAGTVGSQGAKLLLEVSERVRTLTQAYFSPERPLHLSFTHLVCRSAIEGEQEQRM DLSHPVHADNCVLDPDTGECWREPPAYTYRDYSGLLYLNDDFQGGDLFFTEPNALTVT ARVRPRCGRLVAFSSGVENPHGVWAVTRGRRCALALWHTWAPEHREQEWIEAKELLQE SQEEEEEEEEEMPSKDPSPEPPSRRHQRVQDKTGRAPRVREEL" repeat_region complement(44371..44571) /rpt_family="MER20" repeat_region complement(44603..44913) /rpt_family="Alu" repeat_region 44636..44886 /rpt_family="SVA" repeat_region 44940..45191 /rpt_family="Alu" repeat_region complement(44944..45183) /rpt_family="SVA" repeat_region 45238..45403 /rpt_family="MIR" repeat_region 46674..46956 /rpt_family="Alu" repeat_region complement(46690..46887) /rpt_family="SVA" repeat_region 47159..47297 /rpt_family="MIR" repeat_region 47321..47572 /rpt_family="Alu" repeat_region complement(47337..47566) /rpt_family="SVA" repeat_region 47964..48236 /rpt_family="Alu" repeat_region complement(47980..48210) /rpt_family="SVA" polyA_signal 51834..51839 /gene="B" polyA_site 51856 /gene="B" mRNA join(52221..52430,52879..53043,53268..53354,53596..53634, 54980..55086,55184..55247,55373..55535,55642..55708, 55787..55988,57596..57812,58802..59402) /gene="GNB3" /product="GNB3" gene 52221..59402 /gene="GNB3" CDS join(53298..53354,53596..53634,54980..55086,55184..55247, 55373..55535,55642..55708,55787..55988,57596..57812, 58802..58908) /gene="GNB3" /function="participant in signal transduction pathways" /note="one of the subunits of heterotrimeric G-proteins" /codon_start=1 /product="G-protein beta-3 chain" /db_xref="PID:g1732410" /translation="MGEMEQLRQEAEQLKKQIADARKACADVTLAELVSGLEVVGRVQ MRTRRTLRGHLAKIYAMHWATDSKLLVSASQDGKLIVWDSYTTNKVHAIPLRSSWVMT CAYAPSGNFVACGGLDNMCSIYNLKSREGNVKVSRELSAHTGYLSCCRFLDDNNIVTS SGDTTCALWDIETGQQKTVFVGHTGDCMSLAVSPDFNLFISGACDASAKLWDVREGTC RQTFTGHESDINAICFFPNGEAICTGSDDASCRLFDLRADQELICFSHESIICGITSV AFSLSGRLLFAGYDDFNCNVWDSMKSERVGILSGHDNRVSCLGVTADGMAVATGSWDS FLKIWN" repeat_region complement(54192..54265) /rpt_family="MIR" repeat_region 54277..54407 /rpt_family="Alu" repeat_region complement(56122..56405) /rpt_family="Alu" repeat_region 56136..56552 /rpt_family="SVA" repeat_region complement(56441..56725) /rpt_family="Alu" repeat_region complement(56731..57162) /rpt_family="MER42c" polyA_signal 59382..59387 /gene="GNB3" polyA_site 59402 /gene="GNB3" repeat_region 59944..60339 /rpt_family="Alu" repeat_region complement(60070..60309) /rpt_family="SVA" mRNA complement(join(60838..61208,61333..61439,61575..61868, 62477..62606,62843..63019,63232..63399)) /gene="C8" /note="Based on available overlapping partial EST sequences and prediction by FGENEH gene prediction program" /product="C8" /evidence=not_experimental gene complement(60838..63399) /gene="C8" /evidence=not_experimental CDS complement(join(61053..61208,61333..61439,61575..61868, 62477..62606,62843..62962)) /gene="C8" /codon_start=1 /evidence=not_experimental /product="C8" /db_xref="PID:g1633564" /translation="MGSAKSVPVTPARPPPHNKHLARVADPRSPSAGILRTPIQVESS PQPGLPAGEQLEGLKHAQDSDPRSPTLGIARTPMKTSSGDPPSPLVKQLSEVFETEDS KSNLPPEPVLPPEAPLSSELDLPLGTQLSVEEQMPPWNQTEFPSKQVFSKEEARQPTE TPVASQSSDKPSRDPETPRSSGSMRNRWKPNSSKVLGRSPLTILQDDNSPGTLTLRQG KRPSPLSENVSELKEGAILGTGRLLKTGGRAWEQGQDHDKENQHFPLVES" repeat_region complement(62021..62319) /rpt_family="Alu" repeat_region 62200..62330 /rpt_family="SVA" mRNA join(64138..64300,67411..67536,67765..67831,68027..68160, 68315..68460,68717..68901,69639..69733,70434..70627, 71480..71551,72161..72248,72376..72501,72963..73116, 73453..73627,74480..74568,75196..75318,75840..75983, 76060..76205,76732..76885,77174..77258,77994..78642) /gene="ISOT" /product="ISOT-1" mRNA join(64138..64300,67411..67536,67765..67831,68027..68160, 68315..68460,68717..68901,69639..69733,70434..70627, 71480..71551,72161..72248,72376..72501,72963..73116, 73453..73627,74480..74568,75196..75387,75840..75983, 76060..76205,76732..76885,77174..77258,77994..78642) /gene="ISOT" /product="ISOT-2" gene 64138..78642 /gene="ISOT" CDS join(64190..64300,67411..67536,67765..67831,68027..68160, 68315..68460,68717..68901,69639..69733,70434..70627, 71480..71551,72161..72248,72376..72501,72963..73116, 73453..73627,74480..74568,75196..75387,75840..75983, 76060..76205,76732..76885,77174..77258,77994..78087) /gene="ISOT" /function="ubiquitin carboxyl terminal hydrolase" /note="alternatively spliced product; Cys, His, His part of the active site" /codon_start=1 /product="isopeptidase T" /db_xref="PID:g1732412" /translation="MAELSEEALLSVLPTIRVPKAGDRVHKDECAFSFDTPESEGGLY ICMNTFLGFGKQYVERHFNKTGQRVYLHLRRTRRPKEEDPATGTGDPPRKKPTRLAIG VEGGFDLSEEKFELDEDVKIVILPDYLEIARDGLGGLPDIVRDRVTSAVEALLSADSA SRKQEVQAWDGEVRQVSKHAFSLKQLDNPARIPPCGWKCSKCDMRENLWLNLTDGSIL CGRRYFDGSGGNNHAVEHYRETGYPLAVKLGTITPDGADVYSYDEDDMVLDPSLAEHL SHFGIDMLKMQKTDKTMTELEIDMNQRIGEWELIQESGVPLKPLFGPGYTGIRNLGNS CYLNSVVQVLFSIPDFQRKYVDKLEKIFQNAPTDPTQDFSTQVAKLGHGLLSGEYSKP VPESGDGERVPEQKEVQDGIAPRMFKALIGKGHPEFSTNRQQDAQEFFLHLINMVERN CRSSENPNEVFRFLVEEKIKCLATEKVKYTQRVDYIMQLPVPMDAALNKEELLEYEEK KRQAEEEKMALPELVRAQVPFSSCLEAYGAPEQVDDFWSTALQAKSVAVKTTRFASFP DYLVIQIKKFTFGLDWVPKKLDVSIEMPEELDISQLRGTGLQPGEEELPDIAPPLVTP DEPKGSLGFYGNEDEDSFCSPHFSSPTSPMLDESVIIQLVEMGFPMDACRKAVYYTGN SGAEAAMNWVMSHMDDPDFANPLILPGSSGPGSTSAAADPPPEDCVTTIVSMGFSRDQ ALKALRATNNSLERAVDWIFSHIDDLDAEAAMDISEGRSAADSISESVPVGPKVRDGP GKYQLFAFISHMGTSTMCGHYVCHIKKEGRWVIYNDQKVCASEKPPKDLGYIYFYQRV AS" CDS join(64190..64300,67411..67536,67765..67831,68027..68160, 68315..68460,68717..68901,69639..69733,70434..70627, 71480..71551,72161..72248,72376..72501,72963..73116, 73453..73627,74480..74568,75196..75318,75840..75983, 76060..76205,76732..76885,77174..77258,77994..78087) /gene="ISOT" /function="ubiquitin carboxyl terminal hydrolase" /note="alternatively spliced product; Cys, His, His part of the active site" /codon_start=1 /product="isopeptidase T" /db_xref="PID:g1732411" /translation="MAELSEEALLSVLPTIRVPKAGDRVHKDECAFSFDTPESEGGLY ICMNTFLGFGKQYVERHFNKTGQRVYLHLRRTRRPKEEDPATGTGDPPRKKPTRLAIG VEGGFDLSEEKFELDEDVKIVILPDYLEIARDGLGGLPDIVRDRVTSAVEALLSADSA SRKQEVQAWDGEVRQVSKHAFSLKQLDNPARIPPCGWKCSKCDMRENLWLNLTDGSIL CGRRYFDGSGGNNHAVEHYRETGYPLAVKLGTITPDGADVYSYDEDDMVLDPSLAEHL SHFGIDMLKMQKTDKTMTELEIDMNQRIGEWELIQESGVPLKPLFGPGYTGIRNLGNS CYLNSVVQVLFSIPDFQRKYVDKLEKIFQNAPTDPTQDFSTQVAKLGHGLLSGEYSKP VPESGDGERVPEQKEVQDGIAPRMFKALIGKGHPEFSTNRQQDAQEFFLHLINMVERN CRSSENPNEVFRFLVEEKIKCLATEKVKYTQRVDYIMQLPVPMDAALNKEELLEYEEK KRQAEEEKMALPELVRAQVPFSSCLEAYGAPEQVDDFWSTALQAKSVAVKTTRFASFP DYLVIQIKKFTFGLDWVPKKLDVSIEMPEELDISQLRGTGLQPGEEELPDIAPPLVTP DEPKAPMLDESVIIQLVEMGFPMDACRKAVYYTGNSGAEAAMNWVMSHMDDPDFANPL ILPGSSGPGSTSAAADPPPEDCVTTIVSMGFSRDQALKALRATNNSLERAVDWIFSHI DDLDAEAAMDISEGRSAADSISESVPVGPKVRDGPGKYQLFAFISHMGTSTMCGHYVC HIKKEGRWVIYNDQKVCASEKPPKDLGYIYFYQRVAS" repeat_region 66505..66753 /rpt_family="Alu" repeat_region complement(66693..66745) /rpt_family="SVA" repeat_region 69022..69307 /rpt_family="Alu" repeat_region complement(69037..69253) /rpt_family="SVA" repeat_region 70771..70875 /rpt_family="Alu" repeat_region complement(71676..71955) /rpt_family="Alu" repeat_region 71683..71938 /rpt_family="SVA" repeat_region complement(74168..74245) /rpt_family="Alu" repeat_region 76418..76707 /rpt_family="Alu" repeat_region complement(76444..76697) /rpt_family="SVA" repeat_region complement(77444..77552) /rpt_family="Alu" polyA_signal 78622..78627 /gene="ISOT" polyA_site 78642 /gene="ISOT" mRNA join(79204..79691,80874..80997,81109..81193,81268..81400, 81698..81783,82059..82146,82275..83113) /gene="TPI" /product="TPI" gene 79204..83113 /gene="TPI" CDS join(79577..79691,80874..80997,81109..81193,81268..81400, 81698..81783,82059..82146,82275..82393) /gene="TPI" /function="interconversion of dihydroacetone phosphate and glyceraldehyde 3-phosphate" /codon_start=1 /product="triosephosphate isomerase" /db_xref="PID:g1200507" /translation="MAPSRKFFVGGNWKMNGRKQSLGELIGTLNAAKVPADTEVVCAP PTAYIDFARQKLDPKIAVAAQNCYKVTNGAFTGEISPGMIKDCGATWVVLGHSERRHV FGESDELIGQKVAHALAEGLGVIACIGEKLDEREAGITEKVVFEQTKVIADNVKDWSK VVLAYEPVWAIGTGKTATPQQAQEVHEKLRGWLKSNVSDAVAQSTRIIYGGSVTGATC KELASQPDVDGFLVGGASLKPEFVDIINAKQ" STS 82394..82838 /gene="TPI" /db_xref="dbSTS:G07200" mRNA complement(join(83202..83329,84248..84996)) /gene="C9" /note="Prediction is based on available partial EST sequences, Grail-2 and FGENEH gene prediction programs" /product="C9" /evidence=not_experimental gene complement(83202..84996) /gene="C9" /evidence=not_experimental CDS complement(join(83202..83329,84248..84911)) /gene="C9" /note="similar to C. elegans sequence encoded by GenBank Accession Number U13875" /codon_start=1 /evidence=not_experimental /product="C9" /db_xref="PID:g1732423" /translation="MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQR RHGWNPKDCSENIEVKEGGLYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGT HAVVGVATALAPLQTDHYAALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQGEQ LEVPERLLVVLDMEEGTLGYAIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLG ERRAEPHSLLHLSRLCVRHNLGDTRLGQVSALPLPPAMKRYLLYQ" misc_feature 84088..84381 /gene="C9" /note="CpG island; similar to sequence with GenBank Accession Number Z65668" /note="Region: Z65668)" misc_feature 84795..85150 /note="similar to EST sequence with GenBank Accession number AA014095" /note="Region: AA014095" misc_feature 85306..85562 /note="CpG island; similar to sequences with GenBank Accession Numbers Z61395 and Z64769" /note="Region: Z61395 and Z64769" repeat_region complement(86125..86662) /rpt_family="Alu" repeat_region 86213..86560 /rpt_family="SVA" repeat_region 86698..86794 /rpt_family="MIR" repeat_region 86930..86980 /rpt_family="MIR" repeat_region 87339..87599 /rpt_family="SVA" repeat_region complement(87346..87624) /rpt_family="Alu" repeat_region complement(87982..88290) /rpt_family="Alu" repeat_region 88377..88662 /rpt_family="Alu" repeat_region 88829..89107 /rpt_family="Alu" repeat_region 89709..90452 /rpt_family="Alu" repeat_region complement(89857..90396) /rpt_family="SVA" repeat_region 90522..90769 /rpt_family="Alu" repeat_region 90762..91236 /rpt_family="L1PB3" repeat_region 90762..91236 /rpt_family="L1PB1" repeat_region 90987..91237 /rpt_family="L1PA15" repeat_region 90987..91155 /rpt_family="L1PA7" repeat_region 90987..91151 /rpt_family="L1PA11" repeat_region 90989..91249 /rpt_family="L1" repeat_region 91368..91642 /rpt_family="Alu" repeat_region complement(91383..91585) /rpt_family="SVA" repeat_region 91835..92127 /rpt_family="Alu" repeat_region 92273..92569 /rpt_family="Alu" repeat_region complement(92299..92510) /rpt_family="SVA" repeat_region complement(92604..92677) /rpt_family="Alu" repeat_region complement(92711..93274) /rpt_family="Alu" repeat_region 93454..94019 /rpt_family="Alu" repeat_region complement(93675..93967) /rpt_family="SVA" repeat_region complement(94129..94176) /rpt_family="L1ME2" repeat_region 94209..94306 /rpt_family="Alu" repeat_region complement(94378..94476) /rpt_family="L1ME3a" repeat_region complement(94378..94476) /rpt_family="L1MD2" repeat_region complement(94378..94476) /rpt_family="L1MD1" repeat_region complement(94378..94476) /rpt_family="L1MB7" repeat_region complement(94378..94476) /rpt_family="L1MC2" repeat_region complement(94378..94476) /rpt_family="L1ME2" repeat_region complement(94379..94475) /rpt_family="L1PA15" repeat_region complement(94379..94475) /rpt_family="L1PA11" repeat_region complement(94477..94673) /rpt_family="Alu" repeat_region complement(94674..94755) /rpt_family="Alu" repeat_region complement(95090..95364) /rpt_family="Alu" repeat_region 95130..95338 /rpt_family="SVA" mRNA 96025..96610 /gene="RPL13-2" /pseudo /note="sequence of the transcript obtained from an RT-PCR product; even though an ORF exists, this region could be a pseudogene." gene 96025..96610 /gene="RPL13-2" /pseudo CDS 96073..96507 /gene="RPL13-2" /note="similar to 60S ribosomal protein L13 (breast basic conserved protein), Swiss-Prot Accession Number P26373" /codon_start=1 /pseudo /product="RPL13-2" /db_xref="PID:g1732413" mRNA 96742..97748 /pseudo /note="destrin 2; sequence of the transcript was obtained from an RT-PCR product; even though an ORF exists, this region could be a pseudogene." CDS 96960..97394 /note="similar to destrin (actin depolymerizing factor, ADF), Swiss-Prot Accession Number P18282" /codon_start=1 /pseudo /product="destrin 2" /db_xref="PID:g1732414" repeat_region complement(97372..97645) /rpt_family="Alu" repeat_region 97426..97631 /rpt_family="SVA" repeat_region complement(97873..98831) /rpt_family="LTR5" repeat_region complement(98508..98830) /rpt_family="SVA" repeat_region 98905..99191 /rpt_family="Alu" repeat_region complement(99458..99574) /rpt_family="Alu" repeat_region 99702..100000 /rpt_family="Alu" repeat_region complement(99722..99944) /rpt_family="SVA" repeat_region complement(100129..100383) /rpt_family="Alu" repeat_region complement(100811..101082) /rpt_family="Alu" repeat_region complement(101359..101635) /rpt_family="Alu" repeat_region 101420..101618 /rpt_family="SVA" repeat_region complement(101942..102436) /rpt_family="Alu" repeat_region 102180..102410 /rpt_family="SVA" misc_feature 102324..102515 /note="similar to EST sequence with GenBank Accession Number W02302" /note="Region: W02302" repeat_region 103846..103912 /rpt_family="Alu" repeat_region complement(104086..104365) /rpt_family="Alu" repeat_region 104144..104344 /rpt_family="SVA" repeat_region 104590..104720 /rpt_family="Alu" repeat_region 104836..105099 /rpt_family="Alu" repeat_region 105225..105473 /rpt_family="Alu" repeat_region complement(105240..105891) /rpt_family="SVA" STS 105450..105741 /db_xref="dbSTS:L30092" repeat_region 105664..105945 /rpt_family="Alu" repeat_region 106332..106691 /rpt_family="THE1b" repeat_region 106332..106691 /rpt_family="MSTa" repeat_region 106566..106652 /rpt_family="MSTc" repeat_region 107001..107263 /rpt_family="Alu" repeat_region complement(107249..107549) /rpt_family="SVA" repeat_region 107313..108251 /rpt_family="Alu" repeat_region complement(107808..108172) /rpt_family="SVA" repeat_region 108873..109230 /rpt_family="MLT1f" repeat_region 109047..109230 /rpt_family="MLT1e" repeat_region 109274..109605 /rpt_family="Alu" repeat_region 109999..110282 /rpt_family="Alu" repeat_region complement(110025..110232) /rpt_family="SVA" repeat_region 110291..110590 /rpt_family="Alu" repeat_region complement(110326..110529) /rpt_family="SVA" repeat_region complement(110653..110821) /rpt_family="Alu" repeat_region complement(110843..111255) /rpt_family="Alu" repeat_region 110901..111107 /rpt_family="SVA" repeat_region 111348..111629 /rpt_family="Alu" repeat_region 111889..111935 /rpt_family="Alu" misc_feature 112013..112324 /note="similar to EST sequence with GenBank Accession Number W27464" /note="Region: W27464" repeat_region complement(112542..112772) /rpt_family="Alu" repeat_region 112620..112783 /rpt_family="SVA" repeat_region 113154..113491 /rpt_family="Alu" repeat_region complement(113229..113437) /rpt_family="SVA" repeat_region complement(113698..113975) /rpt_family="Alu" repeat_region 113761..114108 /rpt_family="SVA" repeat_region complement(113992..114438) /rpt_family="Alu" repeat_region 114218..114416 /rpt_family="SVA" repeat_region 115022..115280 /rpt_family="Alu" repeat_region complement(115651..115757) /rpt_family="MIR" gene 116848..126238 /gene="B7" mRNA join(<116851..116913,117595..117769,118419..>118641) /gene="B7" /note="B9; sequence was obtained from RT-PCR product and represents alternatively spliced exons of the gene B7; full transcript containing B9 exons has not been determined" mRNA join(<116851..116913,118419..>118641) /gene="B7" /note="B8; sequence obtained from RT-PCR product and represents alternatively spliced exons in B7 gene; full transcript containing the B8 sequence has not been determined" mRNA join(116851..116913,117595..117769,117855..117964, 118419..118672,119325..119455,121900..122036, 125901..126238) /gene="B7" /note="sequence corresponding to B7 transcript was obtained by sequencing an RT-PCR product and a cDNA clone (IMAGE consortium ID # 302632; GenBank Accession Number W37069)" /product="B7" /evidence=experimental CDS join(117644..117769,117855..117964,118419..118672, 119325..119455,121900..122036,125901..126081) /gene="B7" /note="leucine rich protein" /codon_start=1 /product="B7" /db_xref="PID:g1732415" /translation="MSDEDDLEDSEPDQDDSEKEEDEKETEEGEDYRKEGEEFPEEWL PTPLTEDMMKEGLSLLCKTGNGLAHAYVKLEVKERDLTDIYLLRSYIHLRYVDISENH LTDLSPLNYLTHLLWLKADGNRLRSAQMNELPYLQIASFAYNQITDTEGISHPRLETL NLKGNSIHMVTGLDPEKLISLHTVELRGNQLESTLGINLPKLKNLYLAQNMLKKVEGL EDLSNLTTLHLRDNQIDTLSGFSREMKSLQYLNLRRSKTLAFRPDQTPRGSHHMYDRE QRMPVFAPKLEIHHNLRPRICSVPVLWAVWGAEWGA" repeat_region 119788..120254 /rpt_family="MER21B" repeat_region 119817..120215 /rpt_family="MER21" repeat_region 120325..120841 /rpt_family="Alu" repeat_region complement(120351..120813) /rpt_family="SVA" repeat_region 121095..121214 /rpt_family="HSATI" repeat_region complement(121137..121236) /rpt_family="HSATI" repeat_region 121158..121267 /rpt_family="HSATI" repeat_region complement(121167..121270) /rpt_family="HSATI" repeat_region 121174..121301 /rpt_family="HSATI" repeat_region complement(121226..121339) /rpt_family="HSATI" repeat_region 121250..121363 /rpt_family="HSATI" repeat_region complement(121311..121416) /rpt_family="HSATI" repeat_region 121328..121519 /rpt_family="HSATI" repeat_region complement(121424..121519) /rpt_family="HSATI" repeat_region 122332..122395 /rpt_family="Alu" repeat_region 122447..122912 /rpt_family="Alu" repeat_region complement(122645..122695) /rpt_family="SVA" repeat_region 123746..124021 /rpt_family="Alu" repeat_region complement(124207..124377) /rpt_family="Alu" repeat_region complement(125585..125861) /rpt_family="Alu" repeat_region 125616..125846 /rpt_family="SVA" polyA_signal 126220..126225 /gene="B7" polyA_site 126225 /gene="B7" mRNA join(126608..126669,127831..127927,128427..128522, 128681..128739,129048..129117,129591..129724, 129950..130172,131576..131773,133590..133791, 134065..134173,134353..134411,134740..135704) /gene="HSENO2" /product="neuron specific gamma-enolase" gene 126608..135704 /gene="HSENO2" CDS join(127843..127927,128427..128522,128681..128739, 129048..129117,129591..129724,129950..130172, 131576..131773,133590..133791,134065..134173, 134353..134411,134740..134809) /gene="HSENO2" /note="similar to sequences encoded by GenBank Accession Numbers X51956 and M22349" /codon_start=1 /product="neuron specific gamma-enolase" /db_xref="PID:g1732416" /translation="MSIEKIWAREILDSRGNPTVEVDLYTAKGLFRAAVPSGASTGIY EALELRDGDKQRYLGKGVLKAVDHINSTIAPALISSGLSVVEQEKLDNLMLELDGTEN KSKFGANAILGVSLAVCKAGAAERELPLYRHIAQLAGNSDLILPVPAFNVINGGSHAG NKLAMQEFMILPVGAESFRDAMRLGAEVYHTLKGVIKDKYGKDATNVGDEGGFAPNIL ENSEALELVKEAIDKAGYTEKIVIGMDVAASEFYRDGKYDLDFKSPTDPSRYITGDQL GALYQDFVRDYPVVSIEDPFDQDDWAAWSKFTANVGIQIVGDDLTVTNPKRIERAVEE KACNCLLLKVNQIGSVTEAIQACKLAQENGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLMRIEEELGDEARFAGHNFRNPSVL" repeat_region complement(130361..130638) /rpt_family="Alu" repeat_region 130431..130622 /rpt_family="SVA" repeat_region 130710..130999 /rpt_family="Alu" repeat_region complement(130736..130990) /rpt_family="SVA" repeat_region complement(132418..132858) /rpt_family="Alu" repeat_region 132470..132682 /rpt_family="SVA" repeat_region complement(132876..133076) /rpt_family="Alu" repeat_region 132893..133061 /rpt_family="SVA" repeat_region 133127..133409 /rpt_family="Alu" repeat_region complement(133148..133349) /rpt_family="SVA" STS 134794..135704 /gene="HSENO2" /db_xref="dbSTS:G10506" STS 134810..135209 /gene="HSENO2" /db_xref="dbSTS:G11165" polyA_signal 135686..135691 /gene="HSENO2" polyA_site 135691 /gene="HSENO2" repeat_region complement(138494..138765) /rpt_family="Alu" repeat_region 138524..138736 /rpt_family="SVA" misc_feature 139348..139597 /note="similar to EST sequence with GenBank Accession Number M78236" /note="Region: M78236" mRNA join(140328..140393,145849..146037,146185..146322, 146474..146587,147556..149570,149854..150076, 150490..151186,152889..153032,153383..153563, 153756..154329) /gene="HUMDRPLA1" /product="DRPLA" gene 140328..154329 /gene="HUMDRPLA1" repeat_region 142534..142811 /rpt_family="Alu" repeat_region complement(142549..142756) /rpt_family="SVA" repeat_region 143441..144051 /rpt_family="Alu" repeat_region complement(143467..143990) /rpt_family="SVA" repeat_region 144985..145157 /rpt_family="MIR" repeat_region complement(145236..145488) /rpt_family="Alu" repeat_region 145266..145468 /rpt_family="SVA" CDS join(146011..146037,146185..146322,146474..146587, 147556..149570,149854..150076,150490..151186, 152889..153032,153383..153563,153756..153789) /gene="HUMDRPLA1" /note="human dentatorubral and pallidoluysian atrophy gene; sequence similar to other DRPLA sequences encoded by GenBank Accession Numbers D31840 and U23851" /codon_start=1 /product="DRPLA" /db_xref="PID:g1732417" /translation="MKTRQNKDSMSMRSGRKKEAPGPREELRSRGRASPGGVSTSSSD GKAEKSRQTAKKARVEEASTPKVNKQGRSEEISESESEETNAPKKTKTEQELPRPQSP SDLDSLDGRSLNDDGSSDPRDIDQDNRSTSPSIYSPGSVENDSDSSSGLSQGPARPYH PPPLFPPSPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGAPPPHPQ LYPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAPPTKPP TTPVGGGNLPSAPPPANFPHVTPNLPPPPALRPLNNASASPPGLGAQPLPGHLPSPHA MGQGMGGLPPGPEKGPTLAPSPHSLPPASSSAPAPPMRFPYSSSSSSSAAASSSSSSS SSSASPFPASQALPSYPHSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGR LLANSNAHPGPFPPSTGAQSTAHPPVSTHHHHHQQQQQQQQQQQQQQQQQQQHHGNSG PPPPGAFPHPLEGGSSHHAHPYAMSPSLGSLRPYPPGPAHLPPPHSQVSYSQAGPNGP PVSSSSNSSSSTSQGSYPCSHPSPSQGPQGAPYPFPPVPTVTTSSATLSTVIATVASS PAGYKTASPPGPPPYGKRAPSPGAYKTATPPGYKPGSPPSFRTGTPPGYRGTSPPAGP GTFKPGSPTVGPGPLPPAGPSGLPSLPPPPAAPASGPPLSATQIKQEPAEEYETPESP VPPARSPSPPPKVVDVPSHASQSARFNKHLDRGFNSCARSDLYFVPLEGSKLAKKRAD LVEKVRREAEQRAREEKEREREREREKEREREKERELERSVKLAQEGRAPVECPSLGP VPHRPPFEPGSAVATVPPYLGPDTPALRTLSEYARPHVMSPGNRNHPFYVPLGAVDPG LLGYNVPALYSSDPAAREREREARERDLRDRLKPGFEVKPSELEPLHGVPGPGLDPFP RHGGLALQPGPPGLHPFPFHPSLGPLERERLALAAGPALRPDMSYAERLAAERQHAER VAALGNDPLARLQMLNVTPHHHQHSHIHSHLHLHQQDAIHAASASVHPLIDPLASGSH LTRIPYPAGTLPNPLLPHPLHENEVLRHQLFAAPYRDLPASLSAPMSAAHQLQAMHAQ SAELQRLALEQQQWLHAHHPLHSVPLPAQEDYYSHLKKESDKPL" repeat_region complement(146915..147196) /rpt_family="Alu" repeat_region 146976..147170 /rpt_family="SVA" repeat_region complement(151522..151803) /rpt_family="Alu" repeat_region 151581..151772 /rpt_family="SVA" repeat_region complement(152005..152290) /rpt_family="Alu" repeat_region 152011..152536 /rpt_family="SVA" repeat_region complement(152321..152576) /rpt_family="Alu" polyA_site 154329 /gene="HUMDRPLA1" repeat_region 154650..154847 /rpt_family="Alu" repeat_region 155046..155317 /rpt_family="Alu" repeat_region complement(155059..155311) /rpt_family="SVA" misc_feature 155619..155627 /gene="U7 snRNA" /note="distal sequence element (DSE)" /note="Region: distal sequence element (DSE)" gene 155619..155887 /gene="U7 snRNA" misc_feature 155764..155782 /gene="U7 snRNA" /note="proximal sequence element (PSE)" /note="Region: proximal sequence element (PSE)" snRNA <155824..>155887 /gene="U7 snRNA" /note="similar to other U7 snRNA sequences, GenBank Accession Numbers M17910 and X54165; full span of the gene is not known; forms the RNA portion of small nuclear ribonucleoprotein (snRNP); involved in post-transcriptional 3'-end processing of histone mRNA" /product="U7 snRNA" mRNA join(156073..156182,156485..156661,157780..158011) /gene="C10" /note="Evidence: Available partial EST sequences; FGENEH gene prediction program" /product="C10" /evidence=not_experimental gene 156073..158011 /gene="C10" CDS join(156131..156182,156485..156661,157780..157931) /gene="C10" /codon_start=1 /evidence=not_experimental /product="C10" /db_xref="PID:g1633566" /translation="MASASTQPAALSAEQAKVVLAEVIQAFSAPENAVRMDEARDNAC NDMGKMLQFVLPVATQIQQEVIKAYGFSCDGEGVLKFARLVKSYEAQDPEIASLSGKL KALFLPPMTLPPHGPAAGGSVAAS" polyA_signal 157991..157997 /gene="C10" mRNA join(158586..158748,163992..164186,166814..167003, 167160..167276,167381..167494,167669..167765, 168151..168230,168428..168577,169663..169794, 169928..170082,171936..172003,172097..172248, 172353..172444,172815..172954,173135..173322) /gene="HSPTP1CG" /note="alternatively spliced product; expressed in non-hematopoietic cells" /product="protein tyrosine phosphatase 1C" gene 158586..173322 /note="For reference please see GenBank Accession Number U15528-U15537; M77273; X82818; X82817" /gene="PTP1C" CDS join(158735..158748,163992..164186,166814..167003, 167160..167276,167381..167494,167669..167765, 168151..168230,168428..168577,169663..169794, 169928..170082,171936..172003,172097..172248, 172353..172444,172815..172929) /gene="HSPTP1CG" /note="alternatively spliced product; expressed in non-hematopoietic cells" /codon_start=1 /product="protein tyrosine phosphatase 1C" /db_xref="PID:g1732418" /translation="MLSRGVGDQVTHIRIQNSGDFYDLYGGEKFATLTELVEYYTQQQ GVLQDRDGTIIHLKYPLNCSDPTSERWYHGHMSGGQAETLLQAKGEPWTFLVRESLSQ PGDFVLSVLSDQPKAGPGSPLRVTHIKVMCEGGRYTVGGLETFDSLTDLVEHFKKTGI EEASGAFVYLRQPYYATRVNAADIENRVLELNKKQESEDTAKAGFWEEFESLQKQEVK NLHQRLEGQRPENKGKNRYKNILPFDHSRVILQGRDSNIPGSDYINANYIKNQLLGPD ENAKTYIASQGCLEATVNDFWQMAWQENSRVIVMTTREVEKGRNKCVPYWPEVGMQRA YGPYSVTNCGEHDTTEYKLRTLQVSPLDNGDLIREIWHYQYLSWPDHGVPSEPGGVLS FLDQINQRQESLPHAGPIIVHCSAGIGRTGTIIVIDMLMENISTKGLDCDIDIQKTIQ MVRAQRSGMVQTEAQYKFIYVAIAQFIETTKKKLEVLQSQKGQESEYGNITYPPAMKN AHAKASRTSSKHKEDVYENLHTKNKREEKVKKQRSADKEKSKGSLKRK" repeat_region complement(159191..159305) /rpt_family="MIR2" repeat_region 159397..159685 /rpt_family="Alu" repeat_region complement(159424..159486) /rpt_family="SVA" repeat_region 160023..160451 /rpt_family="Alu" repeat_region complement(160040..160144) /rpt_family="SVA" repeat_region 160677..160962 /rpt_family="Alu" repeat_region complement(160694..160949) /rpt_family="SVA" repeat_region 161510..161628 /rpt_family="MIR" repeat_region complement(161825..162088) /rpt_family="Alu" repeat_region 161842..162062 /rpt_family="SVA" mRNA join(163365..163529,163618..163740,163992..164186, 166814..167003,167160..167276,167381..167494, 167669..167765,168151..168230,168428..168577, 169663..169794,169928..170082,171936..172003, 172097..172248,172353..172444,172815..172954, 173135..173322) /gene="HSPTP1CG" /note="alernatively spliced product; expressed in hematopoietic cells" /product="protein tyrsoine phosphatase 1C" CDS join(163522..163529,163618..163740,163992..164186, 166814..167003,167160..167276,167381..167494, 167669..167765,168151..168230,168428..168577, 169663..169794,169928..170082,171936..172003, 172097..172248,172353..172444,172815..172929) /gene="PTP1C" /note="alternatively spliced product; expressed in hematopoietic cells" /codon_start=1 /product="protein tyrosine phosphatase 1C" /db_xref="PID:g1732419" /translation="MVRWFHRDLSGLDAETLLKGRGVHGSFLARPSRKNQGDFSLSVR VGDQVTHIRIQNSGDFYDLYGGEKFATLTELVEYYTQQQGVLQDRDGTIIHLKYPLNC SDPTSERWYHGHMSGGQAETLLQAKGEPWTFLVRESLSQPGDFVLSVLSDQPKAGPGS PLRVTHIKVMCEGGRYTVGGLETFDSLTDLVEHFKKTGIEEASGAFVYLRQPYYATRV NAADIENRVLELNKKQESEDTAKAGFWEEFESLQKQEVKNLHQRLEGQRPENKGKNRY KNILPFDHSRVILQGRDSNIPGSDYINANYIKNQLLGPDENAKTYIASQGCLEATVND FWQMAWQENSRVIVMTTREVEKGRNKCVPYWPEVGMQRAYGPYSVTNCGEHDTTEYKL RTLQVSPLDNGDLIREIWHYQYLSWPDHGVPSEPGGVLSFLDQINQRQESLPHAGPII VHCSAGIGRTGTIIVIDMLMENISTKGLDCDIDIQKTIQMVRAQRSGMVQTEAQYKFI YVAIAQFIETTKKKLEVLQSQKGQESEYGNITYPPAMKNAHAKASRTSSKHKEDVYEN LHTKNKREEKVKKQRSADKEKSKGSLKRK" repeat_region 166458..166542 /rpt_family="Alu" repeat_region complement(166479..166535) /rpt_family="SVA" repeat_region 166602..166714 /rpt_family="MIR" repeat_region 170811..170890 /rpt_family="MIR" polyA_signal 173303..173308 /gene="HSPTP1CG" polyA_site 173308 /gene="PTP1C" repeat_region complement(173667..173771) /rpt_family="MIR" mRNA complement(join(177363..177721,177920..177925, 178433..178509,179180..179257,179685..179788, 179901..180030,180420..180604,181507..181586, 182205..182289,182426..182561)) /gene="hBAP" /note="alternatively spliced product; sequence of the BAP transcript was obtained from cDNA clone (IMAGE consortium ID # 176864; GenBank Accession Number H45225); first two and a half exons were not part of this cDNA clone; those exons were predicted by sequence homology to the known mouse BAP sequence" /product="B-cell receptor associated protein" gene complement(177363..182561) /gene="hBAP" STS 177363..177566 /gene="hBAP" /db_xref="dbSTS:G07372" polyA_site complement(177381) /gene="hBAP" polyA_signal complement(177381..177386) /gene="hBAP" CDS complement(join(177694..177721,177920..177925, 178433..178509,179180..179257,179685..179788, 179901..180030,180420..180604,181507..181586, 182205..182289,182426..182552)) /gene="hBAP" /note="mouse BAP-37 homolog; related to prohibitin; splice junctions at 179257 and 179685 follow the noncanonical AT-AC rule" /codon_start=1 /product="B-cell receptor associated protein" /db_xref="PID:g1922935" /translation="MAQNLKDLAGRLPAGPRGMGTALKLLLGAGAVAYGVRESVFTVE GGHRAIFFNRIGGVQQDTILAEGLHFRIPWFQYPIIYDIRARPRKISSPTGSKDLQMV NISLRVLSRPNAQELPSMYQRLGLDYEERVLPSIVNEVLKSVVAKFNASQLITQRAQV SLLIRRELTERAKDFSLILDDVAITELSFSREYTAAVEAKQVAQQEAQRAQFLVEKAK QEQRQKIVQAEGEAEAAKMLGEALSKNPGYIKLRKIRAAQNISKTIATSQNRIYLTAD NLVLNLQDESFTRGSDSLIKGKK" repeat_region complement(178767..179056) /rpt_family="Alu" repeat_region 178773..178822 /rpt_family="SVA" repeat_region 178824..179017 /rpt_family="SVA" repeat_region 181627..181915 /rpt_family="Alu" repeat_region complement(181653..181902) /rpt_family="SVA" mRNA complement(join(182426..182561,182205..182289, 181507..181586,180420..180604,179901..180030, 179685..179788,179180..179257,178429..178509, 177363..177925)) /gene="hBAP" /note="alternatively spliced product; sequence of the transcript was obtained from a cDNA clone (IMAGE consortium ID# 135679; GenBank Accession Number R31335); likely represents an alternative splicing at the 3'-end of hBAP" /product="B-cell receptor associated protein" mRNA join(182946..183099,186347..186448,186560..186701, 187098..187156,187237..187386,187704..187982) /gene="C2f" /note="sequence of the transcript corresponding to the C2f gene was obtained by sequencing the cDNA clone (IMAGE consortium ID # 139446; GenBank Accession Number R64505)" gene 182946..187982 /gene="C2f" repeat_region complement(183292..183583) /rpt_family="Alu" repeat_region 183301..183557 /rpt_family="SVA" repeat_region complement(184087..184219) /rpt_family="Alu" repeat_region complement(184859..185137) /rpt_family="Alu" repeat_region complement(185200..185464) /rpt_family="Alu" repeat_region 185245..185823 /rpt_family="SVA" repeat_region complement(185574..185849) /rpt_family="Alu" repeat_region complement(186170..186291) /rpt_family="MER5" CDS join(186569..186701,187098..187156,187237..187386, 187704..187817) /gene="C2f" /note="similar to S. cerevisiae hypothetical protein L9470.5, encoded by GenBank Accession Number S51431, and to S. pombe hypothetical 34.9 kD protein, encoded by GenBank Accession Number Z68198" /codon_start=1 /product="C2f" /db_xref="PID:g1732421" /translation="MLMDSPLNRAGLLQVYIHTQKNVLIEVNPQTRIPRTFDRFCGLM VQLLHKLSVRAADGPQKLLKVIKNPVSDHFPVGCMKVGTSFSIPVVSDVRELVPSSDP IVFVVGAFAHGKVSVEYTEKMVSISNYPLSAALTCAKLTTAFEEVWGVI" STS 187889..187964 /gene="C2f" /db_xref="dbSTS:G27812" polyA_site 187982 /gene="C2f" polyA_site complement(188332) /gene="C3f" mRNA complement(join(188332..188899,189142..189270, 189375..189533,189606..189753,190349..190515, 190611..190697,191479..191587,193012..193190, 193602..193639,193818..193911,194683..194789, 195439..195509)) /gene="C3f" /note="sequence of the transcript was obtained from RT-PCR products and from the cDNA clone (IMAGE consortium ID# 188144; GenBank Accession Number H45806 and H45837)" /product="C3f" gene complement(188332..195509) /gene="C3f" repeat_region 188451..188738 /rpt_family="Alu" repeat_region complement(188477..188696) /rpt_family="SVA" CDS complement(join(189154..189270,189375..189533, 189606..189753,190349..190515,190611..190697, 191479..191587,193012..193190,193602..193639, 193818..193911,194683..194730)) /gene="C3f" /note="similar to S. cerevisiae ORF YOR175c, encoded by GenBank Accession Number Z75083" /codon_start=1 /product="C3f" /db_xref="PID:g1732422" /translation="MGRTITAVLTTFCFQMAYLLAGYYYTATGNYDIKWTMPHCVLTL KLIGLAVDYFDGGKDQNSLSSEQQKYAIRGVPSLLEVAGFSYFYGAFLVGPQFSMNHY MKLVQGELIDIPGKIPNSIIPALKRLSLGLFYLVGYTLLSPHITEDYLLTEDYDNHPF WFRCMYMLIWGKFVLYKYVTCWLVTEGVCILTGLGFNGFEEKGKAKWDACANMKVWLF ETNPRFTGTIASFNINTNAWVARYIFKRLKFLGNKELSQGLSLLFLALWHGLHSGYLV CFQMEFLIVIVERQAARLIQESPTLSKLAAITVLQPFYYLVQQTIHWLFMGYSMTAFC LFTWDKWLKVYKSIYFLGHIFFLSLLFILPYIHKAMVPRKEKLKKME" mRNA complement(join(190194..190515,190611..>191278)) /gene="C3f" /note="sequence of the transcript was obtained from a cDNA clone (IMAGE consortium ID# 128030; GenBank Accession Number R09366); only represents the 3' end of the transcript; likely represents an alternative form of gene C3f" /product="C7f" mRNA 190529..191944 /gene="40871" /note="sequence of the transcript was obtained from the cDNA clone (IMAGE consortium ID# 40871; GenBank Accession Number R56251 and R56334); no identifiable ORF; transcript is within C3f gene but in the opposite transcriptional orientation as C3f" /product="40871" gene 190529..191944 /gene="40871" polyA_signal 191924..191929 /gene="40871" polyA_site 191929 /gene="40871" repeat_region complement(192153..192339) /rpt_family="Alu" repeat_region 192261..192636 /rpt_family="SVA" repeat_region complement(192371..192637) /rpt_family="Alu" repeat_region complement(194127..194202) /rpt_family="Alu" repeat_region complement(194205..194334) /rpt_family="Alu" repeat_region 194212..194319 /rpt_family="SVA" repeat_region complement(195000..195068) /rpt_family="Alu" repeat_region 195877..196004 /rpt_family="Alu" repeat_region complement(195894..195983) /rpt_family="SVA" misc_feature 196121..197116 /note="sequence was obtained by sequencing the cDNA clone (IMAGE consortium ID # 111467; GenBank Accession Number T83233); no identifiable ORF" /note="Region: 111467" repeat_region complement(196606..196876) /rpt_family="Alu" repeat_region 197121..198057 /rpt_family="Alu" repeat_region complement(197265..198025) /rpt_family="SVA" repeat_region complement(198065..198140) /rpt_family="MER28" repeat_region 199783..200068 /rpt_family="Alu" repeat_region complement(199809..200014) /rpt_family="SVA" repeat_region complement(200262..200535) /rpt_family="Alu" repeat_region 200312..200509 /rpt_family="SVA" repeat_region 200837..201451 /rpt_family="Alu" repeat_region complement(201073..201802) /rpt_family="SVA" repeat_region 201589..201862 /rpt_family="Alu" misc_feature 201874..203019 /note="sequence of this transcript was obtained by sequencing the cDNA clone (IMAGE consortium ID #210531; GenBank Accession Number H65919); no identifiable ORF" /note="Region: 210531" repeat_region 202876..202997 /rpt_family="Alu" repeat_region 203062..203270 /rpt_family="Alu" repeat_region complement(203705..203912) /rpt_family="MER2" repeat_region complement(204021..204090) /rpt_family="L1MB7" repeat_region 204097..204375 /rpt_family="Alu" repeat_region complement(204122..204319) /rpt_family="SVA" repeat_region complement(204390..205127) /rpt_family="L1MB7" repeat_region complement(204432..204677) /rpt_family="L1MA9" repeat_region complement(204434..205127) /rpt_family="L1ME2" repeat_region complement(204471..204692) /rpt_family="L1MA10" repeat_region complement(204476..204692) /rpt_family="L1MB3" repeat_region complement(204500..205127) /rpt_family="L1MD2" repeat_region complement(204500..205127) /rpt_family="L1MD1" repeat_region complement(204573..204677) /rpt_family="L1MA2" repeat_region complement(204594..204677) /rpt_family="L1MA5" repeat_region complement(204603..204674) /rpt_family="L1PB1" repeat_region complement(204603..204674) /rpt_family="L1PB3" repeat_region complement(204758..204838) /rpt_family="Alu" repeat_region complement(204873..204937) /rpt_family="L1MB3" repeat_region complement(204873..204937) /rpt_family="L1MA10" repeat_region complement(204951..205053) /rpt_family="Alu" repeat_region complement(205150..205436) /rpt_family="Alu" repeat_region 205215..205465 /rpt_family="SVA" repeat_region complement(205450..205672) /rpt_family="Alu" repeat_region 206141..206419 /rpt_family="Alu" repeat_region complement(206158..206359) /rpt_family="SVA" repeat_region complement(208149..208270) /rpt_family="Alu" STS 208274..208657 /db_xref="dbSTS:Z52714" repeat_region complement(208391..208947) /rpt_family="Alu" repeat_region 208589..209009 /rpt_family="SVA" repeat_region complement(208968..209232) /rpt_family="Alu" repeat_region complement(210868..211157) /rpt_family="Alu" repeat_region 210927..211131 /rpt_family="SVA" repeat_region 212880..213179 /rpt_family="Alu" repeat_region complement(212906..213117) /rpt_family="SVA" gene complement(213527..218627) /gene="C6f" repeat_region 213792..214087 /rpt_family="Alu" repeat_region complement(213818..214026) /rpt_family="SVA" polyA_site complement(213821) /gene="C6f" repeat_region complement(214522..214663) /rpt_family="MIR" repeat_region complement(215106..215959) /rpt_family="L1MB3" repeat_region complement(215107..215959) /rpt_family="L1MB7" repeat_region complement(215270..215959) /rpt_family="L1MA10" repeat_region complement(215270..215959) /rpt_family="L1ME2" repeat_region complement(215272..215942) /rpt_family="L1MA9" repeat_region complement(215321..215959) /rpt_family="L1MC2" repeat_region complement(215323..215959) /rpt_family="L1MD2" repeat_region complement(215326..215959) /rpt_family="L1MD1" repeat_region complement(215332..215959) /rpt_family="L1ME3a" repeat_region complement(215360..215942) /rpt_family="L1MA5" repeat_region complement(215419..215905) /rpt_family="L1PB1" repeat_region complement(215419..215905) /rpt_family="L1PB3" repeat_region complement(215428..215909) /rpt_family="L1" repeat_region complement(215428..215909) /rpt_family="L1PA7" repeat_region complement(215428..215905) /rpt_family="L1MA2" repeat_region complement(215428..215871) /rpt_family="L1PA15" repeat_region complement(215428..215871) /rpt_family="L1PA11" repeat_region complement(215428..215824) /rpt_family="L1PA2" repeat_region complement(216393..216657) /rpt_family="Alu" repeat_region 216445..216643 /rpt_family="SVA" repeat_region complement(217204..217496) /rpt_family="Alu" repeat_region 217259..217919 /rpt_family="SVA" repeat_region complement(217656..217968) /rpt_family="Alu" mRNA complement(join(218280..218627,213527..>213821)) /gene="C6f" /note="sequence of the transcript was obtained from a cDNA clone (IMAGE consortium ID # 113390; GenBank Accession Numbers T78452 and T78519); represents the 3' end of putative C6f gene without any ORF; splice junction at 213821 does not obey the consensus (GT-AG rule)" /product="C6f" STS 218298..218650 /db_xref="dbSTS:G17671" BASE COUNT 53286 a 56778 c 57221 g 55643 t 2 others ORIGIN 1 tgtagcccca gctacttggg aggctgaggc aggagaattg cttgaacccg ggaggcagag 61 gttgcagtga gccaagatca cgccactgca ctctagcctg ggcaacagag tgagactccc 121 tctcaaaaaa aaaaaaaaaa aaaaaaaata tatatataga atgcctttaa tgagcagtaa 181 atctaatttt attaaatctc aacccttggg tacggtgtgt catgaaatgg gaagtagcac 241 acagtactat atgctacaga tgaagtacaa tgctgtcaaa taggggtact tgtgttaatt 301 gttggagtcg caagctgaac tagcgttttc ttttcttttc ctttcttttc ttttcttttc 361 ttttcttttc ttttcttctt ttcaagacag gttctcactc tgtcactcag gctagagtgc 421 agtggtgcaa tcacggttca ctgcagcctc aacttcctgg gctcaagcga tcctcccacc 481 tcggcctcct aaaatgctgg gattataggc atgagccacc actcccagcc ccactttttt 541 cagactggaa aacgcacact cacatgtgca tctttaaatg atcacttggg ctgtggtatg 601 gagaatggcg accagtgagg aggcaggagc tgttgtccga gcaagggatg atattggcat 661 cttggattgg catggtggca gtagtggtag tgcagagtga cttgggtaga ttttggagcc 721 atttagaagg taacatccac aggaactggt aaataaatac gtgggagaag ttgggtgaag 781 ggggtgtcaa agattacacc caatttattt tgcttgggca agttggtgga tggtgagccc 841 ctcactgagt gagaagcctg gagaagcagg tttggagggt ggtagtatgc aggtggtatg 901 catagttggg gatgtgtgtt gagtttgcta tgtccggtga gcttcccagt ggagatgtcc 961 aatgggcaga cggatactca catagagagt tcatggtaga ttcgggctag aggaaagcac 1021 ctgaggcctg gccagagacg cctagaggaa cagagcctgg ttaacagtca ctcctggtgt 1081 ctcagatatt ctctgctcag cccacgccct ctcttccaca ctgggccacc tataaagcct 1141 ccacagatac ccctggggca cccactggac acatgccctc agggccccag agcaaggagc 1201 tgtttgtggg cttaccactg ctgttcccat atgcccccaa ctgcctccca cttctttccc 1261 cacagcctgg tcagacatgg cactaccact aatggaatct ttcttgccat ctttttcttg 1321 ccgcttaaca gtggcagtga cactttgact cctgatttaa gcctgattct gcttaacttt 1381 ttcccttgac tttggcattt tcactttgac atgttccctg agagcctggg gggtggggaa 1441 cccagctcca gctggtgacg tttggggccg gcccaggcct agggtgtgga ggagccttgc 1501 catcgggctt cctgtctctc ttcatttaag cacgactctg cagaaggaac aaagcaccct 1561 ccccactggg ctcctggttg cagagctcca agtcctcaca cagatacgcc tgtttgagaa 1621 gcagcgggca agaaagacgc aagcccagag gtaaggtggt cagactcggc ttccttcccc 1681 ggagctgaga gggaggggaa cgtggggcag atgcacagga atgtgctctg cccagttgtc 1741 tgcccacagc tctggccacc ttctcttgca tttctcttgg aactggtcat gagcagcgat 1801 ttcccactgg aactgtgagc ttccagaggt cagagactgt gctagactcc tctctgcagc 1861 cccagcgtgc acagctcagt gtccagagca atgggtgctc cttagaggag tagtgaccta 1921 aatagcaaga tcagagaggg agtgaagact ggagactatc ccaggctggg aaaggcgtgg 1981 aaggcaacta gtcgtgggca gtggagggga gaagactgga agaggggaaa agaggagaaa 2041 aagagtgaag aaggggaggg aaaaaattag aaagaataaa taaatataag gtgggaggaa 2101 acctataaaa aagaaatgat gaggaaaaac gataaaaaca agaaaaagag caaaagaagt 2161 ggaatctagt ctagagaaaa ttcttgcaaa accaattttc ctaacgggac aatctccaaa 2221 aattgacacc aaatatcata cttgcagcat ttcataagtc gcatgatcta tgtaatagtc 2281 tcttttaact accttttgtg tttctgtgtg tatttttaaa tttttgtcta cttctttgct 2341 cttttaagac ttttgaacaa tgtctaaagg cacttctact ctagttagct ttaaaatgat 2401 tcatgaaata ggaaagctaa aattctcaga aagtgttgaa actgagcttt tgcgtatgaa 2461 ttgccctaaa agttgcagat acatatcttt agataaatat gtcatttaag aaacggattc 2521 gaaaagaatt ggtggtaggg gtctcatgag gccgggagag ttacagaagg atgtgcagcc 2581 cctcctgggc ctggcagggt gtgagggaga gtgaggactc actgtccctc ctgaagggaa 2641 gccctgtgcc atctcagcct ttccgccctc agacctttcc agcccctgag acctcatggc 2701 cttgaagccg tgctatcccc agtgtcctct gctttcccct cataggcttc ttctgggaga 2761 gaggtttcct ggggtacgtt ctgatccctc aaaacggaaa ggccctgttc tcaataattc 2821 aaagatttca ctctgagtgg gatagtgctt cctgaatgcc ctgctcttgt ggtggacatt 2881 tttatcgggg caaggctaag agcagggcct gatgggggaa gtcactgcta cttcacattt 2941 tgaccaataa ttccttgtgc tgtatcagat gctgtaggct gatacaaaat ggccgccgcc 3001 cttaaagtca gatgaaagag cccctgagga cagcgttaga gacactcggg agatgatttc 3061 cctctttcaa tgtgggagga cttacatagg agaggtctat atctagataa aaactcctcc 3121 caccactggt gctagacaga gcttgggggc agccctaggc tctgaccctg gccgtaatgg 3181 cggggtggtg ctgagggcaa ttggctagac caattgtctt gcacgtttta ttttttatta 3241 ttatttttga gacggagtct cgctctcttg cccaggctgg agtgcaatga cgtgatctcc 3301 actcactgca gcctccacct cctgggttca agcgattctc ctgcctcagc ctcttgagta 3361 gctgggatta caggagccca acaccacgcc cggctacttt ttgtattttt agtagagacc 3421 aggtttcact atgtcgggca ggctggcctc aaactcctgg cctcaaatga tccgcccgcc 3481 ttggcctccc aaagtgctgg cattacagac gtgagccacc atactgggcc agtcttgcac 3541 attttagaca ctcaataaat gtttgttgaa tgaaatacct gtgatgggcc gggcgtggtg 3601 gcccacacca gtaatcccag ccctttgaga ggccgaggca ggaggatggc ttgaacctgg 3661 gagtttgaga ccagcctggg caacatggtg aaacccccat ctctacaaac cccacaaaag 3721 ttagctgggc atggtagtgt gtgcctgtgg ttccagctac ttgggaagct gaggtgggag 3781 gattgcttga gcctgggaga cggaggctgc agtgaggcct gactgtgcca ctgcactcca 3841 gcctgggcga gagtgaggcc ctgtctcaaa ataactttga tgaaggtggg gaatcagagg 3901 tagatgggca ggggtctggg tttgcaaggc tctgagtttg agctctgttc tgtacttttg 3961 aggggatgag ggaaggaggg tgggcacggt tcccccgatg tgggtgtctg aggcgaagaa 4021 gaggatggcg gaggttgcag ccaccaacca caagagttcc ttagaggggt cacagtctct 4081 aggaagttta taggaagcta gtcagcagta gagagggtga acgcggtggg gcacatcccg 4141 cggctgggct tgagtgggct gcttgggggt tatggggaga agataaaagt gcctgtggga 4201 ccacagactc tcgctgtggt ggagctgggc cctcttaccc tcccaagcct cgcccctcat 4261 cccatccctg ggggccaggg gtgagggcgg caggaacctc aaggctctga gaaagtgcgt 4321 ggtgtgtgtt gccattttgg tctcttctct ttctcagtct ctctttgcct cactttggat 4381 ctatgctctg tgcatctgtc ttgcttctca gaatttcttc ttttcctctt tttttgtact 4441 accctgcgct ttgtgtgtga ttgtggattg tgtgtgcata gctgttgctt taacaagctg 4501 ctccaggtct ctctctccca catctttccc tcccctcctc ccttgaggct ctgtgcattc 4561 tggggaactc tgtccatttc cctttgtcca tgtgtccctc ccaccctgca gccggctccc 4621 tcacatccac cctgggctgc aggcatgctc ggcaggctcc ccacagatca aagcttgtcc 4681 agggtctgca ttgctgccaa aggccaggag gactgtgtac agaccggaag gagctagagc 4741 ttagtggcag cctgagaggg gaagctgaaa aaggagaaga ggcaaggggc attccagggg 4801 agcccgggag agccagcacg gcctcctggt atatgaggca aagaggaaga cagacacaga 4861 cacagggagc tgcaggctgg gggcataagc tgggggctgg gaagcataga tacagaaatg 4921 cacagatgtg agctgagaag caaggaggga gagagagaga cagaaagaga gagagagacg 4981 tgccagggct tgagggacca gagagccctc ccagcctctc tcggagtgct ggtatacagg 5041 atgctaccgt actagggtaa gacacctctg gggacgctga gtatgggaat caaaggccag 5101 atctctgggg tggcagcgga agcccaaagc accaaagcaa gcatgctgga aacccacagc 5161 ctcctccact tagcagagcc ttggggtgag atgaggcaga acagggagct ggaggcaggg 5221 aggtggctgt ctgcacatac ctcaggacca tggagctggg ggagtcaaaa cagccaccat 5281 atggggaagg gtcaagaatg cctctagtct tccccaggca tcttatcagg gtaagctgaa 5341 tttggacccc agagaaggga tatcgtttat ggagacttcc cctctttcat cccctgctca 5401 ccaaggaccc agtcagtctg ggatggggga cagtgggaac cactctttgg tatgaggcta 5461 ctcctattct actcttctca ctgcagcctt cccctagatg cctcccatct gcatgctaac 5521 ctgtaaacac ttcaccatca ctggtggcta gtctcccttc ctcctttttc agaaacctcc 5581 ttcctgacct cttttcttcc tacttccccg gtttaaacaa aattgtatct atctatcatc 5641 tatctatcta tctatctatc tatctatcta tctatctatc tatctatctt tttttttttt 5701 ttgagacaaa atctcactct gtcgcccagg ctgtagtgca gtggcactac aagtaatccc 5761 aagtagctga gattacaggc gcccgccacc atccccagct aatttttttg tatttttagt 5821 agacacgggg gtttcatgat gttggccagg ctggtctcaa actcctggcc tcaagtgatc 5881 tgcctgcctc ggcctcccaa ggtgctggat tataggtatg agccaccaca cacgacaccc 5941 ggttctatct atctatatct atctatctat ctatctatct atctatctat ctatctatct 6001 atcacctatc tatctaatct atctatatct gtctatctat ctttatgtat ctatcttatc 6061 tattgatcta tctatctttt tttttttttg agacagagtc actctgtcac ccaggctgga 6121 gtgcagtggc acgatctcgg ctcactgcaa cctccgcctc ccgggttcaa gcgattctcc 6181 tacctcagcc tcctcagtag ctgggactac ccaccaccac tcctggctaa tttttgtatt 6241 ttcagtagag atagggtttc actatgttgg ccaggctggt ctccaactcc tgacctaaag 6301 tgatccaccc accttggttt cccaaagtgc tgggattaca ggcgtgagcc accgtgcctg 6361 gacatatatc tatctttttt ttttttgaga tggagtctcg ctctgttgcc caggctggag 6421 tgcagtggcg tgatttcggc tcactgcaac ctccgcctcc cgggttcaag tgattctcct 6481 gcctcagcct cccaagtagc tgagattaca gacgtgcgtc accatgccca gctaattttt 6541 gtatttttag tagagatggg atttcactat gttggccagg ctggtctcgt actcccgacc 6601 tcaggtgatc cacttgcctt ggcctcccaa agtgctggaa ttacaggtgt gagccactgc 6661 atccggcctt atatatctat cttgtctgtc tgactgtcta atctaattca tctattttat 6721 ctgtttatct tatctatcat ctatttatct aatctatctg tctgtatgtc tgtttttttt 6781 tttttttttt tttttttttt tttttttttt gagatagagt cttgctctgt tgccgaggct 6841 ggagtgcggt ggcgcgatct cagctcactg ctgaacctcc gcctcctggg ttctaagcga 6901 ttctcctgcc tcaatctttg gagtagctgg gattacaggc ccgtaccact gtgcccggct 6961 aattttgtat ttttagtaga gaagggtttc accatgttgg tcaggcttgt attgaactcc 7021 tgacctcagg tgatctaccc gcctaagcct cccaaagtgc tgggagtaca ggtgtgagcc 7081 actgtgtctg tccctaaatg tctgtctcta tctatctatc tatctatcta tctatctatc 7141 tatctatcta tctaatctat ctttctgtct aacctaatct attttatcta tcttattcat 7201 catctatcta atctgtctgt atgtttatct aatctattta cctaatctat caatctatca 7261 tctaatctat ctaatctgtc tatctaatct attttatcta tctatctatc tgtccatcca 7321 tctatctacc tacctatctc aagcacctac cacgtattaa gccctggcta cctcctcttc 7381 caggcagatg gagtaactgg aggcagctaa caaagatgga gtcacttttc ttatcttctc 7441 ctaaaccacc gtaagaggac caagccccca caccttctga gtgccccatt cctctccaca 7501 gattgtgtct tagtgcccag caggaaacac agtccacctc ccatggttca agagattgta 7561 gaaagggggt tattcacata ggttaaggga atcaatcaat ttgaagcaca gacactatta 7621 acagcaggaa gagtcctgaa gaagtgaaaa tggtgtttct ggaacccaga gagtgcttgc 7681 actctggata aggggccacc ccacagaagc tgtggagggg cagggctgca ggtgaggatg 7741 aacacacagc tattgacaga aaatatgccc agggcaggga tagagtagga aaaatatccc 7801 agcttctttc ccccaccctt ccatctcatc tctgaaaggc acttcccact ggccagcccc 7861 gactggtgct ggagggcaag agagcctatg agccatgtgt ggctgtcagc cccttggtgg 7921 agagccacag acaggatgga gagtggctgg cagggccccg tggggatgaa cagcttggat 7981 tggggcgact gggcttcatc caggctgggc tggatgtgtg catacacttc agtgacccgt 8041 tttagaaaca gaattaatat ggtgaataga gaaagaagaa atcagtgact ttcgctcctc 8101 catacaattt aatttggctt aagttagcca aagccatacc aagtcctctc tctatgtctc 8161 agctgctgcc aggcttgtgg tggccacaca gctggctaga ctgtcatctc tgtcctcaag 8221 gggctcaagc tagaggagga gagttgagaa accaaatcac tacacacaaa gtagaaggtg 8281 gaacacaccc aggagcatgt caacggggtg ctgtgggact tcagagtagg cagatcgtca 8341 ccaagcttca acggcaaaga tgccactggg ggaaagaagg accaagcttg gaagagagag 8401 taagtctgga ggcaagatct tgtctcacca gcaggggcca ggtccatggt gaccccttcc 8461 ccaggcagtc acctctctga gcccacttta tatcctaggc ctggattcaa agacacttga 8521 gccctgctcc agccttcctt tgaggtgcta tcttggtgcc cttcctataa tcactgctcc 8581 agtcccatgt catctggtcc ccagttacca catcaagctt cccgaagctc cacacagacc 8641 atgccacatc tttaccaaaa aatcagcagt gggtcccctc acctccagga caaagctcca 8701 gctcttcgac ctgcctgtca atatttgcaa tcactgcctg cacaaattag ctgggtgttg 8761 tcatgaaagg atcacttgag cccaggagtt ccaggctgca atgacctatg attgaaccac 8821 tgcactctgg cctgggtgac agagtggatc taaactaaaa ataaaaagat ttacagtcaa 8881 gcctcaaagg cttttcccat accttcttcc accatcacct ccctgagccc tctctttcct 8941 ccgaagcctc ctcgcacatc cctaccacct ttgcacacct cagaatgggg acacctctcc 9001 cctttcctct ccatctaact tatggttttc aaacttgagc gtgatcagtt acctggagat 9061 ttgtgaaaac ccagatgact agacccaccc ccagtttctg attcagcagg tctggggtgg 9121 ggccgaggat ctgcatttct aacaagttcc caggtcatgt tgccactgct actgatccag 9181 gactttggga atcgctcctc taatctacag ctgtccattc cccatggtcc attcagagcc 9241 tctctgccct gcccccacca cccccagtct cgcctgtctg ccaagcgcac aggaaactct 9301 ccttcatcca aaccctggac caacgccttc tgcttggccc actcagaggc cttgtagggt 9361 tggtctgata ttggacagag aaatggccct ctgctctttc tcccctgacc tctctgaagg 9421 gggcctgccc ctccacacct gtgggtattt ctcgcaaggt ggagacaaga gactgagaaa 9481 agaaataaga cacagagaaa gtatagagga ataaaagtgg gcccagggga ccggcgctca 9541 gcaagtgagg acctgcaccg gtgctggtct ctgagttccc tcagtattta ttgatcacta 9601 tctttactat ctccgcgagg ggaatgtggt ggggctatag ggtgaaggtg aggagagggt 9661 cagcagaaaa acacgtgagc aaagactctg tgtcataaat aagtttaagg aaaggtgctg 9721 tgcctggatg tgctagattt atgtttaact ttacacaaac atctcagtgt agtaaagagt 9781 aacagagcag tattgccgcc atgatgtctc gcctccagac ataaggcagt tttctcctct 9841 ctcaaaatag aatgtatgat cggttttaca ccgggtcatt ccattcccag ggacgagcag 9901 gagacagatg ccttcctctt atctcaaccg aatagaggcc ttcctccttc actaatcctc 9961 ctcagcacag accctttacg ggtgtcgggc tggggggctg taaggtcttt cccttcccat 10021 gaggccatat ctcaggctgt ctcagtgggg ggaaacctgg acaataccta ggctttctcg 10081 ggcaggggtt cctgcggcct tccacagtgt attgtgtctc tggttaatag agaacggaga 10141 atggtgatga ctttcaccaa gcacactgcc tgcaagaact tttctttttt tttttttttg 10201 agacagagtc ttgctctgtc gcccaggctg gagtgcagtg gcgcgatctc ggctcactgc 10261 cacctctgcc tcccgggttc acgccattct cctgcctcag cctcccgagt agctgggact 10321 acaggcgccc gccaccacgc ccggctaatt tttttgtatt tttttagtag agatggggtt 10381 tcaccgtgtt caccaggatg gtctcgatct cctgacctca tgatccgccc gccttggcct 10441 cccaaaatgc tgggattaca cgtgtgaggc aagaactttt taaaagtgca tcttgcgcag 10501 ccctagatcc attaaacctt gattcaatac aggacatgtt tttgtgagca cagggttggg 10561 acaaaagtta cagattaaca gcatctcaaa gcagaacaat ttttcttagt acagatcaaa 10621 atggagtttc ttatgtcttc ctttttctac atagacacag taacaatctg atctctcttt 10681 cttttcccca tacctctcac gctgtatcag gccccaattc ttgggaacgt caccttagaa 10741 ctgtcccaca catttctaca gccacttggc tcaggccctt tgctgaccag gatggttgca 10801 gttctgcctt tggtgcctcg cctcctccag ttctttcact cagcagctgc aggggtccac 10861 gtggcaaatc taataatctt cttctctata gaaaatcctc tgctggctct ctagtgccca 10921 ggatccagtc ccagcatctc agcacggcct tcaagcattt ccacgtcctg gcctggctcc 10981 atggtctccc cgccaatttg ccaccttctc catgcatcct tttctgatcc cctcctcact 11041 catcccagca aagaaccccc tcctggcctg agcatagcat ttcgtggtgt gtatctcaga 11101 gcatccagtt aggggtgtgc aagtttactt tgttactggc tgatgttgtg aagtcccaag 11161 ttgttggtgc cgcaaacaaa aaattggaca tgacacacac aaatagcaaa gcagcaaaag 11221 tttattaagc acagtacgat ccactatgga tcaaggatga cctgcgaatg gtatcagcat 11281 cactttgcta tatttcatgg ccttttctat gtgttttttt tctctttttc ctcaagctgc 11341 ctaagcttta gccagcatgt gccttttggt tgacaggtgg gttgcttagt ttcttggcct 11401 ctgtgtgttt acgtgtcatt tccttcccat agttttaagt acatgcatga tatgcactct 11461 gtaggcatga accttaagta gctaattact atacggggtc attttgagga tatcttttct 11521 ctgtagtaca tgtgcatctt tttttgcagt ggtgcaatct tggctcactg caacctcctc 11581 ctccctggtt caagtgattc tcctgcctca gcttcctaag cacctgagac tacaggtgca 11641 tgccaccacg cccggctaat ttttgtattt ttagtagaga tggggtttca ccatgttggc 11701 caggctgatc tcgaactcct gaccgcaagt gatccaccca cctcggcctc ccaaagcact 11761 gggattacag gcatgagcca ccgcacccag cctagtatat gcccatctct taggagctgc 11821 tcctaactgg tttggtttgg atctagccag ccatggggct ccttattcac ttatttatct 11881 tctgtttttg ctcacctgcc tctttctctt gcttctgctc ctactcattc cttccttaat 11941 ccaacctcca attccctctg ctattctcct gcctcaagtt cactaggctg gctgcaaggg 12001 tcctgaggga gaggttgtgt atcgcccctg tatactccag gtccagtaaa tgtttgctga 12061 ctaatgattg gcatttccct caggccctgc catttctgtg ggctcaggtc cctactggct 12121 caggcccctg cctccctcgg caaggccaca atgaaccggg gagtcccttt taggcacttg 12181 cttctggtgc tgcaactggg taagttctca gacctggggt ctcaatgcag atgacgtggg 12241 aggaaaggca aaggtggagg atggggtaga gggggacagc ggcgacattg agacctgact 12301 cctttctttt ccacttagcg ctcctcccag cagccactca gggaaagaaa gtggtgctgg 12361 gcaaaaaagg ggatacagtg gaactgacct gtacagcttc ccagaagaag agcatacaat 12421 tccactggaa aaactccaac cagataaaga ttctgggaaa tcagggctcc ttcttaacta 12481 aaggtagggt tgcctggctc cccatccagg gaggaaaaca cactatggag tgaaagcctt 12541 tggtgtctga gatctggtct tagttaaact ctgggatccc agagggcttg ggttgacaga 12601 aactcagtgg cattcttatc cagagtttct ctacaccaac tgctggtggc ccagggaaag 12661 gtggtatgtg aatttcaata ttttaatatt taatattcat gaacttattt tagtgagttt 12721 tagaacaatc actatcactt aaaacccgtg atttcttgag tattgttgct acagacctat 12781 gtagataata ctttgcacag tgactcatat gtataatcct agcactgtgg gaggctgagg 12841 ccggaggatt gcttgagtcc aggagttcaa gaccagcctg aacaacatag tgagactctg 12901 tctctatgaa aaaaaatata tatatatttt ttttggagac aaggtctagt tctatcaccc 12961 aggctccagt gcagtggtgt gatctcggct cactgcaatc tccacctccc aggctcaagt 13021 catcatccca cctcagcctc ccaagtagct gggactacag gcatgcacca ccatgccagg 13081 ctaatttttg tattttttat agagacaggg tttcaccatg ttggccaggc tggtctcgaa 13141 ctcatgagct caagtgatcc actcaccttg gcctctcaga gtgctggaat tacaggtgtg 13201 tgtcactatg cctagccaaa aaaaattttt ttaattaaaa aaaaaaaggc cggctgtagt 13261 ggctcacacc tgtaatccag aactttggga gtttgaggtg ggcagatcac cggnggtcag 13321 gagttcaaga ccagtctggc caacatggtg aaacccggtc tctactaaaa atacaaaaat 13381 tagccaggtg tgggggtgca gtcctgtact tccagctact caggaggctg aggcaggaga 13441 ctcgcttgaa cctgggaggc aaaggctgca gtgagctgag attgcaccac tgcactccag 13501 cctgggtgac agagcaagac ttcatctcaa aaaaaaaaaa aaagctgcan atttattatt 13561 attattatta gtttatttat ttattttttt gagacagagt ctcgttctgt cgcccaggct 13621 ggagtgcggt ggcgtgatct tggctcattg caacctccac ctcccgggtt caagtgattc 13681 tcctgcctca gcctcccgag tagctgggac tacaggcgta tgccaccatg cctggctaat 13741 tttttgtact tttagtagag acagagtttc acggtgttag ccaggctggt cttgatctcc 13801 tgacctcgtg atttaccctc cttggcctcc caaagtgctg ggattacagg cgtgagtcac 13861 tgtgcccggc ccagaatcat ttttttctac tttttttttt ttgaggcaaa ctctcgatct 13921 gttgcccagg ctggagtgca gtgggcatga tcttggctca ctgcaagctc tgcctcccag 13981 gttcaagcaa ttctcctgcc tcagcctcct gagtagctgg gactacaggc gtgtgccacc 14041 atgcccggct aatttgcgta tttttagtag agaccggttt tcatcatatt ggccaggctg 14101 gtcttgaact cctgacctca agtgattctc ccaccttagc ctcccaaagt gctgggatta 14161 caggcatgag ctactgcact tggccttttc tcctggtttt aaaactatta tatgctcatt 14221 acaaaatatt tggtcaatga agaaaagaat atggaagaaa atcaaatgca tgcatacttc 14281 tatcactcag agatatcctc tgctaacatt ttgattgatt ttcttccaat cttttttttt 14341 ttttttcttt ttgagacagg gtctcactct gctgcccagg ctggagtaca gtggcatgac 14401 cacaacacat cacagcctca agtgatcttc ccacttcagc cttcccagta gctgggacta 14461 caggtgcacg ccaccatgtt cacctaattt tttacttttt gtagagatga gacttcacca 14521 tgttgctcag gctggtcttg aattcctagg ctcaagtgat cttcccgctt tggcctccca 14581 aagtgctggg attataggta tgagccactg catgtggcct attttcttcc actgttgttc 14641 ggcgtggaga atattatata cataattacg taaatgatat catactgtat ataccttttt 14701 tcctactcct tccttaagtt atatcataat gagactacca attattagac tttttttctt 14761 ttttttgaga cggagtctcg gtctgtcacc taggctggag tgcaatggcg cgatctcagc 14821 tcgctgcaac ctctgcctcc caggttcaag caattctgcc tcagcctccc gagtagctgg 14881 gactacagac acgtgccacc atgcccagct aactttttta tttttttatt agagacaggg 14941 ttccaccatg ctagcaggat ggtctcaatc tctcgacttc gtgatcagcc cggcttggcc 15001 tcccaaagtg ctgggagtac aggtgtgagc caccgcactc ggcctagact aactatttaa 15061 agtaatctgg caatgtttaa cgaatacaaa actctaaaac ccttggacct aataatagct 15121 attttggaaa gtctacttga cagaaataaa attgtgaata ttcttttttg ttgttttttt 15181 gagacagagt ctcatttgga cgcctaggct ggagtgcagt ggcatgatct cggctaactg 15241 caacctccac ctcctgggtt caagtgattc tcctgcctca gcctcctgag cagctgggat 15301 tacaggtgtg caccaccatg tctggctaat ttttgcattt ttagtagatg gggtttcacc 15361 atgttgacca gggtggtctg gaacttctac cctcaagtga tctacccacc ttggcctccc 15421 aaagtgctgg gattacaggt gtgagccacc acgcctgacc agtgaacact taataatatc 15481 tatggaaagg tgttattata agaattgctt gtggggccgg gcgtggtggc tcacgcctgt 15541 aatcccagca ctttgggagg ctgtggcagg cggatcacga ggtcaggaga tcaagatcat 15601 cctggctaac acggtgaaac cccgtctcta ctaaaaatac caaaaaatta gccaggcgtg 15661 gtggcgggca cttgtaatcc cagctatcca ggaggctgag gcaggagaat tgcgtgaacc 15721 caggaggcgg aggtcgcagt gagctgagac cgtgccattg cactccagcc tgagtgacag 15781 agtgagactc catcacaaaa aataaataaa taaataaata aaatataaat aagtaaataa 15841 aggtcaggag tggtggctca cgcctgtaat cccagcactt tgggaggccg aggtggacag 15901 atcatgaggt catgagatca agaccatcct ggctaacaca gtgaaaccct gcctctacta 15961 aaaatacaaa aagtcatcca ggtgtggtgg cacacaccta tagtcccagc tacttgggag 16021 gctgaggcag gagaatcact tgaacccagg aggcagaggt tgcagtgagc tgagatcgcg 16081 ccactacact ccagcctagg cgacagagca agactctgtc tcaaaataaa taaataaata 16141 aataaataaa taaataaata aataaaataa aaagcacaca cacacacaca cacacacaca 16201 cacaatgcaa aagacccacc ctactacaac taacattata tttaatggtg aaaaactgaa 16261 ttctttctcc ctaagtgcag gaataagaca aagatgtctg ctcttactac tcttattcaa 16321 cataatactg caatcccttg ccagtgcaat aaggcaagaa aaatgaaata aaaggaaaac 16381 tgatcagaaa gaaagaaata aaactgttcc tatttgtgga tgacatgatt acatagaaaa 16441 tctcaaagaa tctgtaagaa acttcttaga attaataaat gaattcatca aggttgcaga 16501 atataagata aacataaaaa atctattgta tttctatata ttagcaagga acatgtgtac 16561 acagaaatta aaactacaat accatttata attgctcaaa aaggccaggc atggtggctc 16621 acacctgtaa ttcctgcact ttgggaggcc aaggtgggaa gattgcttaa gcccaggagt 16681 tcaagaccag cccgggcaac atagtgagac cttgtctcta caaaaagtaa aaaattagct 16741 gagcatggcc gggtgcagtg gctcactcct gtaaccccaa cactttggga ggctgaggcg 16801 ggcggatcat gaggtcagga gatcgagacc atcctggcta acacggtgaa accctgtctc 16861 tactaaaaac acaaaaaatt agctggatgt ggtggcaggc gcctgtagac ccagctactc 16921 gggaagctga ggcaggagaa tggcgtgaac ctgggaggcg gagcttgcag tgagctgaga 16981 ttgtgccact gcactccagc ctgggtgaca cagtgagact acgtctcaaa aaaaaaaaaa 17041 aaaaaattag ctgagcatta tggtgtatgc ctgtagtccc agctactggg gaggctgagg 17101 tgggaggatt gcttgagccc taggagggca aggctgcagt gagccatgat cacaccactg 17161 ctttccagcc tcggtaggag agcaagaccc tatctcaaaa aaaaaaaaaa aaaaaaaaga 17221 aaagaaaaga aaagaaaaga aaaagaaaga gagaaagaaa tacttaggtg taaatctaaa 17281 aaacatgcgt agggccaggt gcagtggctc atgcctgtaa tcccagcact ttgggaagtt 17341 gaggctggcg gatcacttga agtcgggagt ttgagaccag cctggccaac atggtgaaac 17401 cccgtctcta ctaaaaatgc aaaaattagg caggtgttgt ggcgcatgcc tgatcccagc 17461 tactttggag gctgaggcag gagaattgct tcaacccggg aggcagaggt tgcagtgagc 17521 caagactgtt ccactgcact ccagcctggg caacagagta agagtctgtc tcccgaaaaa 17581 aaaaaaaaga aaaaagaaag cattgaattg tatgctaaaa actacacgat gctgattaaa 17641 gaagtcaaag aagatctaaa tatatggaga gacatgctgt actcatggat tgatggattg 17701 gaagactcaa cataagacag atatcaattt tccccaaatt aatatacaag tttaatccaa 17761 ttcctataaa aataccagca agattttttg tagatataaa caagttggcc aggtgtagtg 17821 gcttacacct gtaatcctag cactttggga ggctgaggtg ggaagatcgc ttgagcccag 17881 gtgttcacga ctgcagtgag ctatgattgt gtcactgcat tccagctggc actccagcct 17941 aagtgacaaa gggagaccct gtctcaaaaa caaaaacaaa accaaaataa ttttgctctg 18001 caaaatccct attaagaaga agaaaagagg ctgggcacag tggctcaccg ctgtaatccc 18061 agcacgttgg gaggctgagg caggctgatc acttcagccc agaagtttga gatcagcctg 18121 ggcaacatga ggaaaccccg tctctaccaa aaaaaaaaaa aggtacatac acacacacac 18181 acacacacac acacatacac aagtatatac acatatatat acacatacag gtgaatagat 18241 gtatatacat ctatttattg tgaatataca tctatacaca cacgtgtgtg tacacatata 18301 tttaaaattt atttttattt atttatttat ttttgagaca gagtcttgct ctgtcaccca 18361 ggctgggtgc acctgtattc ccaacgacac aggaggctga ggtgggagaa tcactgagcc 18421 agggaggcag aggttgcagt gagccaagat gttgcctggt tgcctgggca acagagcgag 18481 accctatatc aaaaaagaag aataataaga aaagacagtt tacagaatat aagaaaatat 18541 attcacaatc cacatactta gcaaaggact ggtatctaga atatgataaa caactctcaa 18601 aactcaaaac caaaaaaatg aacaattcaa ttagaaaaca ggccgaaaag gacatacagt 18661 tggcaaataa gcacatgaaa agttgttcaa catcattaat cattagggat atgtacatta 18721 aaaccacaat aggctatcac taaacctatc agaatggcta aatacaaaat tggaacacca 18781 ccaaatgctg atgaggatgt ggagaaactg ggtcattctt ccaatattgg tgggaggcta 18841 aaatggcaaa gccactctgg aaaacagttt gatagtttct tataaaacaa aacatgcggc 18901 cgggcgcggt agctcacgcc tgtaatccca gcactttggg aggccgaggc gggtggatca 18961 cgaggtcagg agatcgagac catcctggct aacacggtga aaccctgtct ctactaaaaa 19021 tacaaaaaat tagccgggcg tggtggcggg cgcctgtagt cccagctact cgggaggctg 19081 aggcaggaga atggtgtgaa cccgggaggc ggagcttgca gtgagccgag atcgcgccat 19141 tgcactccaa cctgggagac ggagggagac tccgtctcaa aaaaacaaaa acaaacaaac 19201 aaaaaacatg caacaatcca gcaatattgc acccctaggc atttatccta gagcaatgaa 19261 gacttatgcc cacacaaaaa gctgcacaca aatgttcata gcagctttat tcatggtagc 19321 caacaattag aaacaatcta gatgtccttc aactggtgaa tgattacatc cataccacga 19381 aatacttttc agcaataaaa aggatgaatc atagtacaca ccacaacctg gatgaatctc 19441 cagggaatta tgctgagtga aaaaaagcca atctcaaaag gtaatatact gtattaatcc 19501 atttatataa cattcttaaa ataactaatt atagaaatgg agaacagatg agtgattgcc 19561 aggggttaag gggctcaggg atggggaggg gaaggggtat ggctacaaaa agcaacaacc 19621 ttatggcgcc ggaaatgttc tgtattctga ttgtgtcaat gtgagcatac tggttgagat 19681 atagtgctac agttttgcaa gttattacca tcagagtaaa ctggatagag ggcacatagg 19741 atttctctgt attacttctt acaactgcaa gtgaatctac aattatctca aaataataag 19801 tttagtttaa tgctaggcgt ggtggctcac atctgtaatc tcagctcttt gggaggctga 19861 gacgggtgga tggcttgagt ccaggagttc gagaccagcc tggccaacat ggcaaaaccg 19921 gtctctacta aaaatacaaa aattagctgg gcgtggtggc aagtgcctgt agtcccagct 19981 actcgggagg ctgaggcagg agaattgctt gaacccggga ggtggaggtt gcagtgagcc 20041 gagatcacgc cactacactg tagcttgggc gacagagtga ggctctttct caaaaaaaaa 20101 aaaaaaaaaa aaaaagcagg caggcagggc caggaaagcg tataattttt gtagttcaaa 20161 tgactaacct aaaaagtgaa gattggccag gcgcagtggc tcacgcctgt aatcccagca 20221 ctttgggagg ccaaggcggg tggatcacga ggtcaggaga ttgagccact ctggctaaca 20281 cagtgaaacc ccgtctctac taaaatacaa aaaattagct gggcgtggtg gcacccgcct 20341 gtagttgcag ctacttggga ggctgaggca ggagaatcac ttgaacccag gaggcgaagt 20401 tgcagcgagc cgagatcaca ctactgcact ccagcctggg tgacaaagtg agattctgtc 20461 tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagtgaag tttacctttt tttttaaatt 20521 tttcttcttt tccttcccta ctttgtgaga taattttctt ctttttaaaa agccaagagc 20581 ttacttctgt aagtaaagat tatcttaaga caacttagaa atgtatatta ttagtatttt 20641 ctatttcatt gtaagttatt tgtaaatatt ggttttggtg ctaacctaga attccatcaa 20701 attaattgtc ccctaatata tggccattat cattttgtct aacattgtat cctattaaca 20761 atgctgtaag tattattttt gtagctaaat tatggtttgc attttaaaat tattgtttta 20821 aggataaagt tccagaaatg aaattaagga tatgaacttt ttgagcacat cttgtcagca 20881 ctgagtagta ttatttaaaa cttttggggg gggcaatttt ataattgaaa aatatatcat 20941 tgttttaatt tgcatttctt tcactgccta tgagattaaa acaatgcact actttccaaa 21001 aattcttaag tcttttgtgt tgatgctttt gttctgtttc tatggatctc atcttccttc 21061 agaacagctc cccttcccaa cttcctgatt tctaacaata acagtatcac cctccttgtt 21121 ctcccaattt ctgaaacaca gagtcatgtt tttttctctg cttcaatccc tggtttccta 21181 tcgtcatcaa ttatgacctt tccttgcttt gaaagtgttt tgggccgggc atgatgtctg 21241 ccacctattg taatcctagc actttgggag gctgaggcgg ctggatgact tgacctgagg 21301 atttcgagac cagcctgggc aacagggcga aacctcgtct ctacaaaaaa tacaaaagtt 21361 agtcgggagt ggtggcacat gcttgtagtc ccagttactt ggggggctga ggtggcagga 21421 tctcttgagc ccacgaggta gatgttgcag tgagccgtga ttgcgccact gcaccccagc 21481 ctaggtgaca gagtgagacc ctgtctcaaa aaaaaaaaaa tgttctagtt tcttcctctt 21541 ctttgttccc atgggaatgc caccatcacc agccaaggct cacatacctc ccacctggat 21601 tacagtgagc ttccaggtaa tttggtctgc tactagtctc gcctacttgg atttcccttc 21661 cccctgctgc agcattgcct tccaaagcca tgctttgcac atgccacatc ctagcccatt 21721 agactaagcc tagaagcctc tgcaggacgt tcaccctctc agcgccactg ctcagtttcc 21781 cagtggaaac ctctgcaccc aggaggtttc cccacagctt gcctgtgctg cctctctgga 21841 gcttttctcc cttcctgtaa tgtccttgct gctccccgtc tctagtccat tgcctatacc 21901 tctttttttt tttttttttg agatggagtc tctctctctc atccaggctg gagtgcagtg 21961 gcgcgatctc ggctcactgc aacctttgtc tcctgggttc aagggattct cctgcctcag 22021 cctcccgagt aactgggatt acaggcgtgc accaccattc ctggctaatt tttgtatttt 22081 tagtaaagac tgggtttcac catgttggcc aggctggtct tgaactcctg ccctcaggtg 22141 atccacctgc ctcggcctcc cagagtgctg ggattacagg cgtgagccac cgcacctgcc 22201 acaggcccat acctctttta agtcttcatt caataccagt tgtcccatga atttgtccca 22261 gactcactca tatgcttaga cctttcatat tatcttgcca tagctttttc aaagtatggg 22321 acagcatgga caagcaggcc atggttttct tttgaagaga agcaaggagg cagagttatt 22381 ttaggaggag ggttatacat ttcattttga accaattgcg tttggggtga tggcaggata 22441 ttaacataaa cttatttctt ggaccattgg aaatgtgtgc ctagaactga ggagagaggt 22501 cagggctggc agtaacaact tggccacaat ctgcagagct gactggggat gaggtggaat 22561 ttagaatgtc tgtagaaacg gggaagagaa ccaaagacag agtctgggac aacacctaaa 22621 tgtagatgtc agagcaagag ttcaagacga agaaaaacga atcatactta gaaatggagg 22681 ggaggaacaa aagaggcgga gcaaagtggg gcagaaccag agtaggccac gcttttaaga 22741 agtttggtaa aggaactgtg aaaggaatgt agttgaattt cagggtaagc tggggaatta 22801 aagcagtgtg tagatccagg gcaaacagca agtagggcag gaaccactga aggaacaaat 22861 aaagggggag gttgggtcca ggttgtcttg agtagggaag tttttttaaa aagtgtgaaa 22921 ctgaaggtgt ggggtggatt gggtgcctgc cgtgctctga ggaagcttgg ggcaactgtg 22981 tgctgaggct gtgaggttgt ctggaagggg ctcctggaca gtaagagctg agcagtgggg 23041 aagaggactg tgtggtctgg aagaggagag aaaggagagt gagtgactga actggtatcc 23101 aggctcccac accaaggcag aaagagggag aggacctggg catctcaggg aggcagaggc 23161 agtaccaagc agggtgagag gctttagtct tagccacctt tgccccattc ctccaaatat 23221 acattctaag taaaaacaaa acaaaacaga actgtttgct atgtaaattt agcttctaaa 23281 gccctgttct acagagattt tggagcttcc actgcaccca gaaaatgcac agctaaagag 23341 aaaacttccc ttggtgatgg ttattagatt ttacaagaag aggccaaagg agacacatac 23401 ttatgccaga agaactttcc agagatagca ttgcatagcg aaatagcctg aattattttt 23461 attttttaaa acattttttc ttttcttttt tcttttcttt ttcttttttt tttttttttt 23521 ttgagacaga gtctcactct gtcacccagg ctggagtgca gtggcgtgat cttggctcac 23581 tgcaatctcc acctcccggg ttcaagccat tctcctccct cagcctccca agtagctggg 23641 attacaggca tgcgtcacta tgctctggct aatttttttt ttcttttttt ttttggtatt 23701 tttagtagag atggggtttc accatgttgg ccaggctggt cttgaactcc tgacctcaag 23761 tgatccaccg ccttggcctc ccaaagtgct gggatttcag gcgtgagcca ccgcacccgg 23821 ccaaaaattt cttttcttta agatgaggcc tcactctgtt gcccaggctg gagtgcagtg 23881 ttacaatcat agctcactgt aactttgaac tcctgggctc aagtgatcct cctgcttcag 23941 cctctcaagt agctgggatt acaggcatgt gccaccacac ccagctaatt ttttttaaaa 24001 taattttttt tagagacgag ggtctcgatt ggctgcctag gttggtccca gactcctgac 24061 gggctgcatt ttaatcctag ctccaccact tacgggagtc aaaattcaaa agatagaaaa 24121 gggcatatag gctgggtgca gtggctcaca cctgcaatcc cagcaatttg ggaggctgag 24181 gtgggcgggt tgcttgaggt caggagttcg agatcagcct gggcaacatg gcaaaacttg 24241 tatctactaa aaatacaaaa attagccaga tgtggtggtg tacacctgta atcccagcta 24301 ctccgaaggc tgaggcaaga gaatcccttg aactcaggag gcagaggtta caatgagcag 24361 agatcgaaca ctcgactcca taaaaacaaa caaacaaaaa aagaaagcag gctgggtgtg 24421 gtggctcacg cctgtaaccc cagcacttcg ggaggccaag gcgagcggat cacctgaggt 24481 tgggcattcg agaccagcct gaccaacaag gagaaaccct gtctctactg aaaatacaaa 24541 attagccggg cttggttgcg catgcctgta atctcagcta ctcgggaggc agaggcaaga 24601 taattgcttg aacccgggag gcggaggttg cggtgagcca agatcatgcc attgcactcc 24661 aacctgggca acaatagcga aactccatct caaaaaaaaa aaagcaaagg gcatatagtg 24721 aaaagctttc ttcctacaca tgagtattca cttcctcttc ctagaggcaa ccaaggttat 24781 ttttgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tttgggacag tctcactctc 24841 tcaccaaggc tggaatgcag tggtgcgatc tcactgcaaa ctctgcctcc cagtctcaag 24901 cgatcttgtg cctcagcctc ccagtttttt tttcttttaa atggggtctc attctgtcgc 24961 ccagggtgga gtgcagtggc atgatcatag ctcactgcag cctcgacctc ctgggtcagg 25021 ttatcctccc acctcagcct ccggcatagc tggggctact ggcatgcacc accacactca 25081 gttaattttt tttctttttt gagacagagt ctcactctgt cacctagact ggagtgcagt 25141 ggtgccatct catttgtttc actgcaacct ttgacttctg ggctcaagtg attctcccac 25201 ctcagcctcc caaggcggct aattaaaaaa aatttttttt tttttttttt tagagatggg 25261 gtttcgccat gttgcccagg ctgatctcga actcctgggc acaaacaatc tttccacctc 25321 gatctttcaa agagctggga tgagagattt ccaccatgcc tggcctcatt ttctttttta 25381 attttttttt agacattata gctcttttta atggcctcat tttcttatgt ttaattcgag 25441 aattattctt ttcatataca aagaatatat tttctccacc tttaaaaaca aatagtagac 25501 tgtttaacat ctcgctttat tcagttagtg atgtttctta gatacgggtc caaattagta 25561 cacaaagcac ttcctcattc ctctcttacg gctgcatagc agtccactga atgggtgagc 25621 tatgatctat ttaacctatt ctttattgat ggacatttgg ttttgtatat acatttgtaa 25681 ttctgtatag attacaaatc accatccaaa gaaattgtac tggtttattc tcctacaatg 25741 tgtgagagtt gggtaattac ttaatctcaa tatgtgagag tttaggcagt tacctaatct 25801 ctctgagtct cagtttctct atctgcaaaa taaacaaaac agtgttgaca gtatctattt 25861 ctcggaatta ttgtggagat tactgagatg atgcctgtaa agtatttggc atgtaggagt 25921 tggtgctctc caaataagga tatgatttta tttgtatttg tgagctactg tcccagccag 25981 gtaaatggat atgatgagac ctccttgcca gaccgggttt ctctgattag aacgaggagc 26041 agatgttgca ggaaattagc aactgatatc agaagagccg tgggcattct cttgccagag 26101 gtgccctgtc tccagggcgc ctcagtcccc ccccatatgt cttctgctcc caggtccatc 26161 caagctgaat gatcgcgctg actcaagaag aagcctttgg gaccaaggaa actttcccct 26221 gatcatcaag aatcttaaga tagaagactc agatacttac atctgtgaag tggaggacca 26281 gaaggaggag gtgcaattgc tagtgttcgg atgtgagtgg ggcaggtggg gatgaggata 26341 cctcctgcct ggttcccttc cccactactc ccacccctgc accaaatcca gcctgagctg 26401 gtgataccgc agcagcccca agaggaccag gctgtcaaac tggcctccaa atgtcttaaa 26461 acccttcttg atcaggtgag ggatgctggt gggcggagga gggaagaggc cttgggaaaa 26521 ggaaagaaaa gggaaggagg caagggaagg agggagagag actggggaag agaggatgag 26581 gggagaggag gaaagaagag agagaggagg ggagagggaa accctatctt ggctgggggt 26641 gcgcagctgg gtgctgggag gaaggagatg ttgggacggc gataatggag agatgttgtt 26701 ggtttcctgt tgtctgccct tctccttggg gatggtatgt gtgtgacaca gctggccttt 26761 ccctccacag tgactgccaa ctctgacacc cacctgcttc aggggcagag cctgaccctg 26821 accttggaga gcccccctgg tagtagcccc tcagtgcaat gtaggagtcc aaggggtaaa 26881 aacatacagg gggggaagac cctctccgtg tctcagctgg agctccagga tagtggcacc 26941 tggacatgca ctgtcttgca gaaccagaag aaggtggagt tcaaaataga catcgtggtg 27001 ctaggtaagg gaagcccctc ttcgcgcagt ctcctccctg ccccaggggc tgacagcccc 27061 tccctctgct ctgactgccc tgtttctggt tctggtgctg ggaggtcagg agtggagaag 27121 actaggtccc ctagagctga ggcctgtctt gaaggactca ctggggccct catcctcagg 27181 gggctgattg gcagccaccc ctcagtgtgg tggacatgga gaaaggaaag gctggggaag 27241 gtaaggatgc tagaggcccg agtctccttt ggaggcccca aaggaggaat gtcagggagc 27301 ttactttctt tgttgcctca gctccacacc cctaccaagt tggcaaatcc acttactcag 27361 ggacactaac accagtaagc caaccctgat gatgttctat gttgtacctc tggacctcta 27421 agccaggcca ctgtggggag accaaggtcc taccccagat cctgtcccct gggtgcttat 27481 gtgacttaag gtagacataa ggtagtgtgc cagtttagtg catgtacgct gattgaaatc 27541 ctggttctgc cacaaccatg tgaccttggg tgagttacta aacctctctg caccttggtt 27601 tcagcctctg tgaaatgggg atgatgttaa ctgccatagt gactacctcg tattaagttg 27661 aggactgata tacgtaaggc actgaaaatg gtgcctggca cagagtaagc cctagttaag 27721 tgttcgctgt tattttgtga agggtgatga atacgcctct aaggagtgga ggccaaatgg 27781 cttctgtggt ccaggaatcc taaggacagc aaggatcccc tgtggctggg ctgctctgtg 27841 atggcttccg ggaggaggga ggtggcctgc tgtaggaaaa tgctgggtgg aagaagggag 27901 agaaggctgg agaggtagga aggaactgaa gtatctgaag tgacaaggtg ggtgtctgga 27961 ctcgtcgggt ccccttccat ctccctgctg cctccacatg ccaaccccac tcgtgcaccc 28021 tcatcttcct atctcctcac ccagggtctc tcccttccca cctccagctt tccagaaggc 28081 ctccagcata gtctataaga aagaggggga acaggtggag ttctccttcc cactcgcctt 28141 tacagttgaa aagctgacgg gcagtggcga gctgtggtgg caggcggaga gggcttcctc 28201 ctccaagtct tggatcacct ttgacctgaa gaacaaggaa gtgtctgtaa aacgggttac 28261 ccaggaccct aagctccaga tgggcaagaa gctcccgctc cacctcaccc tgccccaggc 28321 cttgcctcag tatgctggct ctggaaacct caccctggcc cttgaagcga aaacaggaaa 28381 gttgcatcag gaagtgaacc tggtggtgat gagaggtgag gggccaggcc agggaggggt 28441 gggcagggga aggagttgga ggggcctggc ccagggctcc ctctgaggca agccaggccc 28501 caagagggga tgcctaggcc ctggtcacct ggatgaagtg agggagggcc ctctgggttt 28561 ggggctggtt ttgaactgag acatccatga gccagcctgg ggctggcttc actgaagatc 28621 cccaaagcac ttgggctaag aaccagggtt ccagttcttc ctataaccaa ccctctgtga 28681 ccctggctaa gccccctccc actgcaggcc tgcttcctga cctgtctaat aaggataatg 28741 aaatctgctc tctgtgtgat agtaatgatg ataatgtcaa gctccaaatt gagtttctgg 28801 cttccttatc tccttatcat cataacgact ctgcaaatag taatggctaa cacttgatgc 28861 tcagcacgtg tcgggcccca tccaaacact ttacatgtat gcctgccttt agtactatct 28921 gggtgcaggt tgttaagtca cttgcccaga gacacacagc tgctaagcag tggagctggg 28981 attcaaatcc aataccactg gaccccaaac gctgtttttc ctcgtaggac tgcatgaaaa 29041 cctgctctaa aaggctaaaa gaaggtcacc catatagagt atcattttaa ttgttctact 29101 gaatctcagg cctttgatct cagcctctcg ttcctctgca gccactcagc tccagaaaaa 29161 tttgacctgt gaggtgtggg gacccacctc ccctaagctg atgctgagtt tgaaactgga 29221 gaacaaggag gcaaaggtct cgaagcggga gaaggcggtg tgggtgctga accctgaggc 29281 ggggatgtgg cagtgtctgc tgagtgactc gggacaggtc ctgctggaat ccaacatcaa 29341 gggtaaggac ccaggttcca aggcctctgc ctcctgggct gcgggacctt cctgtggttg 29401 gcagagacca cccagagtcc cggcctccca atgcctgagt tggggggtta tgggtatggt 29461 gtcctctgtg gtcccagggt gctttggagg ccccaaaagg aggcatagaa gtgatgaagt 29521 gaggtgggtt gtggtcctgg gctctaggct gccagttgta attctacaag ttcccactcc 29581 ctcaggtcta cagcctaggg atggggtcca cctgaccgag agcccccctc cctggctcct 29641 ggcttttctc caaatccaga cgcacttaca cacacactca cacacataca ctctcacaca 29701 catgcacaca ctcccataca ctctcacaca cgcacacaca tcactcacac actcgcacac 29761 actcatacac acactacaca cacatccaca ctcacaccca cactctcacc ccatgtactc 29821 acacccatgc actcacacac tgtcacacac tcatacacac acatgtactc acacatgcat 29881 gcacacacac acttacacat tcacgcacat actctcaccc acacactgtc gtacacgtac 29941 actcacacac ttatacactc acatccgcac acgcattatt cacacatgtg cacacacgcg 30001 cgcgcacatg catgcagtca cgcacacaca ttcatacacg cacacaggca cacattcaca 30061 cacatgcaca cacgcacaca cattcacaca tggactcaca cgcgcacacg cgcgcacaca 30121 cacacattca caccattcac acacgcacac acatgcacac actcacacat gcacacacac 30181 atgcactcac acacacagcc caggagccag accacagctt ctctctctcc agagtgccct 30241 ggatatggat atgcccaaac ataaaaccga ttccccagca ctggcggcct ttgagagccc 30301 ccaggcaccc ctcccctctc ccccaacccc agggtcaaac cagagactgg ccaggaggga 30361 ttgcagggca gtcctcagtc ccctggcccg tggaggaggg cggtgcattg agcacatttc 30421 tctcccttgc agttctgccc acatggtcca ccccggtgca gccaatggcc ctgattgtgc 30481 tggggggcgt cgccggcctc ctgcttttca ttgggctagg catcttcttc tgtgtcaggt 30541 gccggcaccg aagggtgagt aaccccacac ctggtcccca caaggccctc aaacccctga 30601 gtcctctacc aggagatcct gtatatggga actgattttg gcccagctcc ctctgcccac 30661 tcgtaagttc ccttgctgcc ctgtcccaga tcccactcaa gggagagaca ggaaggagca 30721 gagagttaat tccaggatag atggcctggg ccatgtaact gcttctcctg tcgcagcttc 30781 ccccactccc cccaccaagg ggcacctccc ttctggaggc ctgggaccct cgtgactccc 30841 tttcttgtcc ctggacagcg ccaagcagag cggatgtctc agatcaagag actcctcagt 30901 gagaagaaga cctgccagtg tcctcagtaa ggatctggga ggaggggttg agagagggga 30961 aagggggagg gggagggagt tagagaggag ggggaggaag gggagcaaag gggggcagga 31021 agggaggatg gagaggagga aggagttgag gaggaagagc tgggaggggt ggaggtgagg 31081 agatgggggc taaaggggtg tggtggagag gatagagggg tgggaaaaga tggccaggag 31141 ctagaaggag gcagaagtgg gaggatggag ctgaaggagc agcaggccag gaaaggccct 31201 gctggaaagc cactggagct gtgctgcgct ggaaaggcca ttggaggtgc tagaacgcaa 31261 aggggttgca gtggggacag acctgctccc cttcttcttt gttcctgcag ccggtttcag 31321 aagacatgta gccccatttg aggcacgagg ccaggcagat cccacttgca gcctccccag 31381 gtgtctgccc cgcgtttcct gcctgcggac cagatgaatg tagcagatcc ccagcctctg 31441 gcctcctgtt cgcctcctct acaatttgcc attgtttctc ctgggttagg ccccggcttc 31501 actggttgag tgttgctctc tagtttccag aggcttaatc acaccgtcct ccacgccatt 31561 tccttttcct tcaagcctag cccttctctc attatttctc tctgaccctc tccccactgc 31621 tcatttggat cccaggggag tgttcagggc cagccctggc tggcatggag ggtgaggctg 31681 ggtgtctgga agcatggagc atgggactgt tcttttacaa gacaggaccc tgggaccaca 31741 gagggcagga acttgcacaa aatcacacag ccaagccagt caaggatgga tgcagatcca 31801 gaggtttctg gcagccagta cctcctgccc catgctgccc gcttctcacc ctatgtgggt 31861 gggaccacag actcacatcc tgaccttgca caaacagccc ctctggacac agccccatgt 31921 acacggcctc aagggatgtc tcacatcctc tgtctatttg agacttagaa aaatcctaca 31981 aggctggcag tgacagaact aagatgatca tctccagttt atagaccaga accagagctc 32041 agagaggcta gatgattgat taccaagtgc cggactagca agtgctggag tcgggactaa 32101 cccaggtccc ttgtcccaag ttccactgct gcctcttgaa tgcagggaca aatgccacac 32161 ggctctcacc agtggctagt ggtgggtact caatgtgtac ttttgggttc acagaagcac 32221 agcacccatg ggaagggtcc atctcagaga atttacgagc agggatgaag gcctccctgt 32281 ctaaaatccc tccttcatcc cccgctggtg gcagaatctg ttaccagagg acaaagcctt 32341 tggctcttct aatcagagcg caagctggga gcacaggcac tgcaggagag aatgcccagt 32401 gaccagtcac tgaccctgtg cagaacctcc tggaagcgag ctttgctggg agagggggta 32461 gctagcctga gagggaaccc tctaagggac ctcaaaggtg attgtgccag gctctgcgcc 32521 tgccccacac cctcccttac cctcctccag accattcagg acacagggaa atcagggtta 32581 caaatcttct tgatccactt ctctcaggat cccctctctt cctacccttc ctcaccactt 32641 ccctcagtcc caactccttt tccctatttc cttctcctcc tgtctttaaa gcctgcctct 32701 tccaggaaga cccccctatt gctgctgggg ctccccattt gcttactttg catttgtgcc 32761 cactctccac ccctgctccc ctgagctgaa ataaaaatac aataaactta ctataaagat 32821 gctcttcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgag agagagagag 32881 agagagagac cccttctctg gggaccagga atcttctggt ctttggggtc ggacccagag 32941 gagaggttgc tgggcgtgcc ccaggaagga gatgaagttt ccctggagag ggaaaaattc 33001 tgtggaggct ggctacccac ctccacaacc cacagggctc tgtgtgggcc tgtgtgtgtg 33061 tgaggctgag actctgtaat tacatgtgtg tgatataccg tgggggagag actcagggtt 33121 gtgtgagctc tcctggtgta tgtgtgtggc agagaaaaag gactatgaga aactgtaaca 33181 tgtgtgtgca aattgtatcc ccaacatatg tgtttgtgtt ccaggaggag gaggcagagt 33241 gtgtgtgtct gtgtgcatgc atgcgcccac atccttttag cttgtgcaca accaccccca 33301 actccacaag cagccccagt gtgcaccaca agccccctca ctcccaaacc caaggcccag 33361 ccctcacaca gatacacaca gtgacacacg cttcctcctc cccgaacagc tagctcctca 33421 gtccccaaca acaccagctg cctcttggag cttgtgttcg tgttgaaaac gggaaggggt 33481 gggggcagga catgaaccat ttcaccaatg aagccggttc ccagccatcc tcctcaccgc 33541 cctccctctc agcgccttcc ttgcccctcc tccccgagtc accaccgtcc agttccccac 33601 cctgcactcg tttcccgccc ccccccgccc cccacaaagc tgtcctgcct tccctctggc 33661 ctgggctcac tggcaggacg accccgggag agagggtcct ggagagtgag aggggcagag 33721 accgagtgag ccgtgggctg agcctgggag gccgggactg agggggagga gggaggcccg 33781 ggggaaggga gggagggaga gagggaagga gggaggaggg cgggcgagct ggagccggca 33841 ggcagcggga gcccgagaga gccgcgtcgg gagtgcggtc tccatggcag tgctgggcgc 33901 agccggagag aggtaagacg acccctgcct gccccgtccc ctgtgttgcc tccctttccc 33961 cctcaagctg cctcctctcc cctaggccgg cccctcctcc ccagccacct gctgtttctg 34021 ccctggagaa cctggagggg gacaggaggg aggagccgcc tgaccttacc ccttcacccc 34081 aaagggcgag ggaccccgtc ggccagggaa cctcccagat cccccttccc accttcctct 34141 gatctctccc tgtccttcac ctccctctct ctgtccttca catctttcct gcctccttcc 34201 ctctctgact ccctatctct ttcatcctga gcccaccagc tgggcctggg tggaggtggg 34261 tgggaagggg tccagggctg taggagagga cactgggggc tgctcttggc caagtccctc 34321 cacctccctc tggctggcca ttgccatggc aactagggtg tactgggctc cggagggaag 34381 ccaagggtgg gggtggacca ggacctcttg gggaaagcaa gggtctgagc tgcttccatg 34441 ggaaacagac cctttcactg ctgtgcccca tttttacccc acttacccta aagccattat 34501 gtgcttcctt acagccccac tctgttcagg gtttcacccg attcccttac cccccttccc 34561 agggctcact gaagctcagc atcacgcact aaaactggga ccatccttcc tcccatgaga 34621 gggaaactga ggctctgggc cagggaggaa gagtcaaggt ctccccacgg ggagtcagca 34681 ccttattgaa catttggctt ttcttctcag cctcctattc cctgctccct ccctccctcc 34741 ctcctccagt ccctcctccc tccaaagcct cctctccagg gggtttgagc ttttcctgct 34801 attttcagat actaccactt tgccccctct ttctcttctg ctcaaactct cctccttctc 34861 cctagttccc cctctttcca tttctggatg cttctgatgt cagcggtttc cccgccttca 34921 ggttccctag cacccccttc ggtcccagac tcctttctcc cattttccca gatatgggta 34981 gctggagggt gatcacccag gtttggggaa gtaggggcta agcaccaaga cccctgcatc 35041 caggagaccg gcaggtaggg gaggaaggct gtgatgccat atcccagttc actgtgaacc 35101 ctcaagagga aaccagtctt gtgtctccca tcagtagttc tgaagagaca tttagggact 35161 tgaagatagg ggtggagtag tgctgagcag gggatgttct aagaaggaac ctgggagttc 35221 cagcgggagg aggaagggct aaaggagaca gggaggtggg aataccttct tccctagagg 35281 tctggaagag tgcggctctc atctgtgtgg ggtgtttagc tgtcagagtt aaggcaaagg 35341 aatgttctga atgaccctca gggccttagt aatcagtcat gcctatagtt tcaagaaagg 35401 gggcacagga ggctgattgg agaccagccc aggccctacc tttgccctct ccctcccaca 35461 tctcatcttt cccttgcagc ctgtccaggg ggctgagccc cacccccaaa tccctggggc 35521 atccagaaga ttcctgactg gtcaagaacc agaggcaaaa gagacctgga agtcccagca 35581 tggggaccag aaccccccag ccagcctcat agttgggaaa gtagccagct tgcctgcccc 35641 atcaattgca gggatgctta aggaaggccc cgcccagtat gaaagctgag gattgcctct 35701 gctgaccctc agtctcctcc cctgccctct acatctgccc tcagctgggt ccatcatgca 35761 atgctgagca ctggggtgag cctgggggca gcctgcctgc tgacagggcg aggattgtgg 35821 ggatcatggg agtgtttgtg agtggggctc ctgggtgaga cctagccccc acccccacag 35881 agctcaaggg ggtggggggc tgaggatagg atggctcggg gcggggcggg ggcagaggag 35941 gcctccctgc gctccaacgc attgtcctgg ctggcctgtg ggctcctggc gctgctggcc 36001 aatgcctgga tcatcctcag catctcggcc aagcagcaga agcacaagcc actggagctg 36061 ctgctctgct tcctagcggg cacacacata ctcatggcag ctgtgcccct caccaccttt 36121 gccgtggtgc agctgcgtcg tcaggcttcc tccgactatg actggaacga gagtatctgc 36181 aaggtcttcg tgtccaccta ctacaccctg gcgctggcca cctgcttcac ggtcgcctcc 36241 ctctcctacc atcgcatgtg gatggtgcgc tggcccgtca actaccgcct cagcaacgcc 36301 aagaagcagg cactgcatgc cgtcatgggc atctggatgg tcagcttcat cctctccaca 36361 ctgccctcca ttggctggca caacaacggc gagcgctact atgcccgcgg ctgccagttc 36421 atagtctcca agatcggcct cggctttggc gtttgcttca gcctcttgct acttggggga 36481 attgtcatgg gtctggtctg tgtggccatc accttctacc agacactgtg ggcccggccc 36541 cggagggctc ggcaggcccg gagagtgggg ggtggtgggg ggaccaaagc gggtgggcca 36601 ggggccttgg gtacccggcc agcttttgag gtaccagcca ttgtggtgga ggatgcccga 36661 gggaagcggc ggtcctcgct ggatggctcg gagtctgcca agacatccct gcaggtcacc 36721 aacttggtca gcgccatcgt ctttctctat gactcactca caggggtgcc catcttggtg 36781 agatcggggt cctccccacc tgttctcccc aatatccagg ccccagcatc atcacctgac 36841 ctccaacttc atcagccata cagcagctcc gtcatctctc ctccagttcc atcatccatc 36901 ttcctctcct ctgtccatcc cccactccat ttgtggcccc catcttgatc atctgtcttt 36961 tagttcccac taccctgttt accccagctc cagcatctgc ttcagttccc actcctcagc 37021 tccatcaccg tccttcagtt ccacgacctg tcctccagct ccgtcatcca tcctgcagct 37081 ccacggttca gccgcagacc tctcatccag ccgctgggct ccactctcca ttccctcctt 37141 ccctcgggct gccacatggt ctctgagcac tcttggctgg aaatcccaca gggctcaccc 37201 caccacccca tctagtcact gttcctgggt tcctgtctgt ctctacggtt cctggtccac 37261 ctctgttcct ctccctcctg accgcagggc tttgtctcca gcactctccc cgaccttcct 37321 ctgagtcagt cccttccctc tgttcactcc atctccacct ccgcttcgcc tctgttggag 37381 cacagggatc aggcattact tctgacgtct tgcccaccag ctgcactagg atatggggtt 37441 caggtgggct cctcagctct ggccctgacc cagcccgctc ccacccttcc ccaggtggtg 37501 agcttcttct ccctcaagtc ggactcggcg cccccctgga tggtgctggc tgtgctgtgg 37561 tgctccatgg cacagacgct gctgctgccc tccttcatct ggtcctgcga gcgctaccgc 37621 gccgacgtgc gcacagtgtg ggagcaatgc gtggccatca tgtctgagga ggatggagat 37681 gacggtcaga ggaggggctg ggctttggct ttcctggaca tctgtctatt cccttttctg 37741 ccgctccttc tcagccagag gatgggctgc cctggagtgc cccctactgg acagaatctg 37801 caccaccctg ggctcccctt tccctaagca ggcaccctgc tgtccactca ctgcttcccg 37861 cagcactcag ggctcccttc actgggccta ggctcccctc cagaaaccca cactgcccag 37921 ctctggcact atggggaacc tgagcttccc gagagaggcc ccccagctat agggacggct 37981 cttgggggag tgcaggcaat tgtttcaata acttcatcta tatgtatgaa gcactccact 38041 caggatttag cactgaacgc agcaaatgct agccactctc actgtcctct tggaacatac 38101 atgctaatgg gaactaaagg ggtctctggg agcttataaa ggcagtggtg ggaggccgcg 38161 gtaggcattc cctacctgta accactctgc ccattttctc tcctagatgg gggctgtgac 38221 gactatgcag agggccgagt ttgcaaagtt cgctttgatg ctaacggagc cacaggacca 38281 gggagccggg accccgccca ggtgaagctg ctgcctggaa ggcacatgct cttccctcct 38341 cttgagagag tccactactt acaggtatgg aactggggga tacatctcct acccttggtg 38401 ccgctcatca cttttctcca gtgtctgtta ggcaccccaa tcccctcttc cctagccctg 38461 gctttcaggg caccatagag ctgggtgaaa gaaagctata gcaagagagg catcccttca 38521 gacctcgtgg ggccaggttt gcccaccgga gcaaacgttt tgaaggttaa gaaggtggca 38581 ccatcttccg gagtccccac ttggttcctg cctctgcctc tgtccctgca gcaccttcag 38641 gtcttacccg ctcctcttcc caggtccccc tatcccggcg tctgtcccat gatgagacaa 38701 acatcttctc tacccctcgg gaaccaggct ccttcctgca caagtggtca tcctctgatg 38761 acatccgggt cctcccagcc cagagccggg ccctcggggg tcctcctgag tacctgggac 38821 aaagacacag gttggaggac gaggaggacg aggaagaggc tgaaggtggg gggctggcca 38881 gccttcgcca attcttggag agtggggttc tggggtcagg tgggggaccc ccacggggtc 38941 ctggcttctt ccgggaggag atcaccacct tcatcgatga gacacctctg ccttctccga 39001 ctgcctcacc agggcactct cctcgtcggc cccggccact gggcctctca ccccgccgac 39061 tctcccttgg gtcccctgag agcagagccg ttggacttcc tttgggacta agcgcaggga 39121 gacgctgctc cctgacgggg ggtgaagaaa gtgcaagggc ttggggagga tcctggggcc 39181 caggcaaccc catctttccc cagctgaccc tgtgagccca agcaggcctg ctgaactcag 39241 aggagaaagc ctgagtgagt aacacctcat tctggccgag agtagggcag ctgcctccag 39301 actctgggga gacgggcgct agatttgggg ctcagaaggc cctgctctct cccatccaag 39361 tgaccagatg ccctactcag cttccatcac ccctagcaat atgtattaaa gtctgaagtg 39421 ttgccatgga aacctcagtg tccagtgtac tgcggtggaa aggggcttgg agggtggagt 39481 ggctcacagg ggacacgaca gagcaggggc agcagacaga ggcacagaag gacacatgga 39541 gaaatgatca catgctccac caggcagcct cgcacaggga aggctgggag gagcatatcc 39601 tgctgggggc ggggcttgag gacacacatc ttcgttggtt ggggtatgaa ggatctctgt 39661 tcagatttgt tgaggtggtt ttagagcgtg gctccagccc cagtgcccaa ctctccttct 39721 aacctcaaat ccatgcaggg gctcacttta tgggcaggaa aggttttggg gctgatgcag 39781 tgtcccgggc ctggacacct cctccttgct gcattttaaa gctctttctc caaaggagtg 39841 caggggtgga tgagtgtgtg tgtgggcggg ggtgggagtg caggggtgga tgagtgtgtg 39901 tgtgggggag tgcaggggtg gatgagtgtg tggggggggt gggagtgcag gggtggatga 39961 gtgtgggggg gtgggagtgc aggggtggat gagtgtgggg gggtgggggg gggcgagggg 40021 gggatcggcg tccacactcc tttttctgga cccagaagcg tgtgtgtgga gggaacacag 40081 ccccagtagt tacccccaga cctcaggggc tgaagtcacc gctggggctg aagtcatcct 40141 tcatggatgc ccaggcctgc tccagttccc gccttttgcc ctcctcctcc cagtttctcc 40201 ctcttattcc acactagacc agctttgtcc actcagcact cttctctctc catcttttca 40261 catgggcctc tcccaccatc gggtctcagc cccatcttct ctctctctct ctctctcggc 40321 ctccctcatt atttctccct ctccctccgc ctctccatct ctccttcctc cccctccctc 40381 cccttccccc ctttcctctc cctccctctc ggctcccgca tttcccctcg gctgccggcg 40441 gctccgacat catgctccgg ctcctccggc cgctgctgct actgctgctg ctgcctcccc 40501 cggggtcccc tgagcccccc ggcctgaccc agctgtcccc gggggcgccc ccgcaggccc 40561 ccgacttgct ctacgctgac gggctgcgcg cctacgcggc cggggcttgg gcgccggccg 40621 tggcgctgct gcgggaggcg ctgcggagcc aggcggcgct gggccgggtg cggctggatt 40681 gcggggcgag ctgcgcggcc gatccgggcg ccgcgctccc cgccgtgctt ctcggggccc 40741 cggagcccga ctccgggccg ggacccacgc aggggtcctg ggagcgacag cttctccgtg 40801 cagcgctccg ccgcgcagac tgcctgaccc agtgcgcagc acggaggctg ggccccgggg 40861 gcgcggcgcg cttcgcgtgg ggagcgcgct ccgggacgcc ttccgccgtc gggagcccta 40921 caactacctg cagagggcct attaccaggt ggggagcggg ccgggcagct ccgagggtcc 40981 cagccctcac cacgacgctg tcctactttg cgttgcccag gagtaggcgg aatgctgttg 41041 cccgggcccc gcacggggct ggggagcgta ccgcgggcct ctgcgtagga gctggctaag 41101 cgaccgcgag gacctcttga gatcgcagag gagcagccga gggggagtgc gagcagaatg 41161 ggaatggggc ggggaggtcg tgaggtggga cgggagcaga tcgaagatgg agggacaggg 41221 ccgcctcttc ctaggaatga agcggaggcg gtgggggcga aaccggctct cggacgggaa 41281 gtgtgcgtgg gtgtgtgtgt gtgtcggggg tggtggtgag tgtgaacctt cgcttggggc 41341 aggaggtagc ttcggaaagg aaggagcaga cggaggcaag gttggggtcc tccgaggcca 41401 gacctctgcg ggcggggagg ggacagcacg ccgcagccgc cggtaccgca gcagttgcct 41461 acgtggctgg ggaagcgggg cgccggttgt actcacctca gctcagggtc ctagagacct 41521 gcgggttttg ctggtcgctg aggtctcccc cacttcccca cctcacttaa gccatcactt 41581 ccacctggtc tcccaaattg aggtcctgaa gtcctgagac ccatgtccca cccaactccg 41641 acgtctttag atcccctttc cctcggtgcc agccttctga gagtcccaac gttctggcct 41701 ctaggggatc tgcagttcgg gcggtgggcg gttctgattg gccagtcttc catgaggctc 41761 tggggcaccc agagtgtgtg tctggggtag ggtggggagg ctggccaggg ggcagaggtc 41821 tgccccccgt cccagggctc tgatgccctc ctcccttcgc ctcctcagtt gaagaagctg 41881 gatctggcag ctgcggcagc acacaccttc tttgtagcaa accccatgca cctgcagatg 41941 cgggaggaca tggctaagta cagacgaatg tcgggagttc ggccccagag cttccgggac 42001 ctggagacgc ccccacactg ggtgagaatc ccagcaccac ccctgtcttg ccaccagcct 42061 tttcctggct ggatacacag gcctcactaa actgtgcgtt gtgctgcttg agcatacaga 42121 tggggtaatg atcctcagcg accctggctg ggagtccttc tctaggactt cgtttccttc 42181 ctggatgatc actgaaatct gatgttctaa cattatgatt ctgaggccca cccttattct 42241 tctccacggt gaagagggac atggagtcct tgccccaaat gagaccaaca cccctctcct 42301 ctctacctcc cagtttggca acccctggat cttaggtgac tgactgctcc cttccccaac 42361 aggcagccta tgacactggc ctggagctac tggggcgcca ggaggcagga ctggcactgc 42421 ccaggctaga ggaggctctt caggggagcc tggcccagat ggagagctgc cgtgctgact 42481 gtgaggggcc tgaggagcag cagggggctg aagaagagga ggatggggct gcgagccagg 42541 ggggcctcta tgaggccatt gcaggtaagg gtcccgtgtg tgagggggtg ggtcggtgcc 42601 cagggagggg catgaacaag ctgaatggct tatgggtttc ctctgcagga cactggattc 42661 aggtcctgca gtgccggcaa cgctgtgtgg gggaaacagc cacacgccct ggtcgcagct 42721 tccctgtccc agacttcctt cccaaccagc tgaggcggct acatgaggcc catgctcagg 42781 gtcagttggg gaagggtgga aacggggagt gaagatttgc ctctgctgct atcctgagct 42841 ccatctctct atccctccca tctgtctctg gatttgtagt ttcactgtca gggctgcatg 42901 ggaaccatca cttcacaagg actctaattt gccctccttt ggcgcctgtg acaagctcag 42961 gagatgggct tccttctgcc ttgctgcttc tcaccttcct ttattttccc ccctcttgct 43021 cttctttgaa ctctccagct aaggtatgtt tgcaccagtg tttgaaagaa ccggcagctg 43081 aacttgtctg ccagtgggaa gggggctctt ggagttagct gtctggcctc tggagaccac 43141 cttctccagc actgcctctg ccccaaggat caatgtgctc taagtattca tcccccaacc 43201 cctgaccttg tcgctccctc tccagtgggc aatctgtccc aggctataga aaatgtcctg 43261 agtgtcctgc tcttctaccc ggaggatgag gctgccaaga gggctctgaa ccagtaccag 43321 gcccagctgg gagagccgag acctggcctc ggacccagag aggtaatccc ctctccacgc 43381 tcacctggga ggtagcccca aatcaaacaa atagacctga gaagtaacct ggacccccac 43441 cccccgctgg cctcttaccc gagcactcta gtcacccaaa ggactccaag tccagcatct 43501 gacttccgtc tccactccct gctccccacg gcagcctgga gttccgattc cagccctcag 43561 ccccgtctct cttgccttta cttcactcct gcctgccacc agcccccaaa ctcatgcata 43621 cacacaccca cacactcaca cacactccca ccccctacac cctttccagt ttaagtccag 43681 atgctctgaa gctgctgagg gggaacgttg aacagaggaa ccggggggca tggtgccagg 43741 ttcaaggagc atggagcacc caggcagtgc cctagagccg gcgcttccct gccttcctcc 43801 agctacccct cactgctcat catctcaccc tcaggacatc cagcgcttca tcctccgatc 43861 cctgggggag aagaggcagc tctactatgc catggagcac ctggggacca gcttcaagga 43921 tcctgtgagt cactccactt ctgcccatct cacccctccg cactcccagg agggatggca 43981 ctttggtttg caacagagtt ttccattttt ccaccatgag gcttggagaa gagtctgcca 44041 tcccagtttt acagggacgg agacagtgtg cctgtagtta gggcggcttt ggggtcacgc 44101 aggcctaaat gtcagtcctc ctctgcccca aacatctctg aacttcaact tgcctctgcc 44161 tcaatttcct tacctgtaag ttggaataat cacatctgtt tcacagggct gttgaaacca 44221 catcatttga taacaaaata tagaaagctc tgttactgct gcttttattt ctcttcttgt 44281 agttatgaga ggaaactgcg gggcggaggt tatctatggc taggactagg gcatcaagca 44341 ggcgttctca gtcttcctgg cgctactgat attttggatc tgacattttg gataattctt 44401 tgttggggca gagggcaggg cctgtcctat gccttggagg gtgttcagca gcttccctgg 44461 tctctaccca gtagagatca atagcacatt gacagcccct gagtcatggc aatcaatgtt 44521 tccacacttt gcctaatgac ccctgggggc agaagtgtcc tgcttgagaa cgactggcta 44581 aatgtgtgaa ctctgaagct agtttttttt atttttattt ttatttttgg agacagagtc 44641 tcactctgtc acccaggctg gagggcaatg gtgcgatctt ggctcactgc aacctccacc 44701 tcccaggttc aagcgattct tctgcctcag cctcctgagt agctgggatt acaggtgccc 44761 gccatcttgc ctggctaatt tttgtatttt tagtagagat ggggtttcac catgttggcc 44821 aggctggtct tgaactcctg acttcaggtg atccacccgc ctcggacccc caaagatgct 44881 gggattacag gcgtgagcca ccggaacccg gcctgcagct agtttaaaac ctaacgctgc 44941 tttgggaggc cgaggcaggt ggatcaccag atcaggagtt caagaccagc ctggccaaca 45001 tagtgaaacc ctgtctctac taaaaataca aaaaattagc tgggcgtggt ggcgggcgcc 45061 tataatccca gctacctggg atgctgaggc aggagaatcg cttgaaccca ggaggcggag 45121 gttgcagtga gctgagatcg cgccatagca ctccagcctg ggcgacagtg tgagactccg 45181 tctcaaaaaa acaaaaaaca aaaaacaaca acaaaaaaca actaaccctg tcatttgtag 45241 ctgtgtgatc ttgagcatgt tactttactt ctctgtgtct cagggctctc aattacaaaa 45301 tgagcacttt aatattctgt ttcatgagac tgttgtgaac tttaaaggat aaattccatt 45361 tccatgaata aagctcttag aatagcccct agcagatagt aagtgctata tgagttatag 45421 ttgtacttgt tgttacagtt tgatccactg cccagggctc ctgacattga gtcccgggct 45481 tttcctacct ccgtcttcct ctctggaaac agaagaaact taatgagata tttcagctct 45541 aatgtcaggg attccagggg ctgaccttgg tgggtgatga tgcttttctg gactgtcgca 45601 ggacccctgg acccctgcag ctctcatccc tgaggcactt agagaaaagc tcaggtagga 45661 tatgccccat ggtgggaggg gcttggcccg agctgggagg ctgggtctgg actgtcctgg 45721 gcacccactg agcgcatctc tcacccctga gagaggatca agagaagagg ccttgggacc 45781 atgagcccgt gaagccaaag cccttgacct actggaaggg tgagttcctg ggagggagaa 45841 ggcaggagcc tggagggtct gggggctgcc agctactgct ctgcctgccc ttaggggatg 45901 ctcagccccc tctgctctgt cttttccctg gcagatgtcc ttctcctgga gggtgtgacc 45961 ttgacccagg attccaggca gctgaatggg tcggagcggg cggtgttgga tgggctgctc 46021 accccagccg agtgtggggt gctgctgcag ctggctaagg taggaagacc tgcaagctca 46081 tcagctcgtt caagactctt ggatacaatc acctgttccc ttgctcttgg cctgccccct 46141 tcattctgcc tgtcctgatt atccaaccgg gactgtgctt accactgcct ctcagctgtg 46201 gataagttcc gtctcgtctt aatttagctg acataggtgg atgttcttta caaatgaatt 46261 ttgctagaaa ttgataggaa gaacaaaaca aaaaacatga aggcataaaa atggatgagt 46321 gtgcaggtaa ggattgcttg gttagccaag agtttagtta ctggcttggc tgatggctca 46381 gcagacacgc aggcctttct ttccctccta gtttaggttg aatgactcat actcaccagg 46441 ttgaggctac cattttatga agcttcaacc cttatcttag gttagtgctc attcagtaat 46501 atactcactg agtagctgct atacactaga aactgctagg ttctagggag aggaagatga 46561 ttatagactc tgccctttgg gagctcaagt gtagaggtgg aaacagacaa ggaaatacat 46621 aatataagca ggtggtaagt ttggtaaaag aagtatgttt ttgggctggg tatggtggct 46681 catgcctgta atcccagcaa tttgggaggc cgaggtgggt ggatcacctg aggctgggag 46741 ttcgagacca gcctgaccaa catggagaaa ccctgtctct aataaaaata caaaaaagtt 46801 agccaggcgt ggtggcacat gcctgtaatc ccaggtactc gggaggctga ggcaggagaa 46861 gcgcctgaac ccgggaggcg gaggttgtgg tgagctgatc tcgtgccatt gcactccagc 46921 gtgggcaaca agagcgaaac tccatctcaa aaaaaaaaaa aaaaagtatg tcaaatagaa 46981 gcactgaaat gcagttatag aaggcttgct ggagaggtgg tatttgagct cagtcttaag 47041 gatgagttag tcaggcgatt gggggtggaa tcagtctaag agggagaaga gcatggtgca 47101 ttctgggaac tgcagtcaag cacaggggtg agagcagggc cagctccata gtcattacct 47161 gtgtgatctt gggcaattta ttttaacatc tctgagccta atttgctcat ctataaaatg 47221 aggataataa tagtgttttt gtcatagtgt cgttgtgagg attaaatgag ttagcacatg 47281 ataaagctct tagaacacag aatcatcact ggctgggtgt ggtggctcat gcctgtaatc 47341 ccagcacttt aggaggtcaa ggcaggtgga tcacttgagg tcaggagttc aagaccagcc 47401 tggccaacat ggtgaaaccc ggtctttact aaaaatacaa aaaaattagt gggcattgtg 47461 gcaggtgcca gtaatcccaa ctactgggga ggctgaggca ggagaatcgc ttggacccag 47521 gaggcggagg tcgcagtgag ccgagatcgc accattgcag tccagcctgg gcaacaagat 47581 caaaactccg cctcgaaaaa taaatttttt taaataatta attaattaat taattaatta 47641 aaaaagaaca tagaatcatc agggtgtatg ttaccaagag acagtatagg gaaagaggga 47701 gagagcccac acatgaaggc cgagggacag tgagtgtctt ctgtggcctc ccaaggagtt 47761 tgggctttat ctgtgtatgg tagacagctg ttgcaggatt ttaggacggt tgacttttgc 47821 attcctgtag tacctggaat caggtgcagg gtctggagtc acctgtagga gtggtagagc 47881 cctctagtgg cagtctgtag aatgcagtgg gagaaagacc cggaaaaggg gagtaattga 47941 cgaatgccac cgttggctgg cgaggtggct cacacctgta atcccagcac tttgggaggc 48001 tgaggcaggc gaatggcttg agcccaggag ttccaggcca gcctgagcaa cgtggtaaaa 48061 ccccgtctct acaaaaatac aaaaaattag ctgggcctat gcgcctatat gttcccacct 48121 acttgggagg ctgaggtggg aggacctctg gagcctggga ggtggagatt gcagtgagct 48181 atgatggcgc cactgcactc cagcctgggc gatagagcaa gaccccgtct caaaaacaaa 48241 aaatgaaaaa aaaaaaaaac cccaggttag acggcttggg tgactaggtg tctctcagaa 48301 gtgagatttg tatgcaggcg taactagata gtggtgctta aacaaattca aaaagtagac 48361 aaaaaacggg tgtcagaaaa gcgaataatt ccagtgtaga catccagata ttaggagacc 48421 cccaggtagt gagagtggtc gcggaggtgg ctgggtagtt gtgactcagt atcaaggagt 48481 gggggagcca cggagtccac agtttctccg agagcatgtc aggtgggaag aggtcagaga 48541 gggatagccc gctgatgctg tctccctacc gccagtctgg cctaactgtg gaacttcctc 48601 ttagggtcct gtttccagag ggactcctct attcttggct catcggggat taatgatcaa 48661 gcccttcctg cctagttccc agctctgccc cacttttcca gctgtcccta ccctctcaga 48721 ggccccctta ctgctgtagg aagctctccc gagttctctc cacagtcccc ttagtaagcg 48781 ggattccagc ccgtaccaga gagggacaat cggacagccg tggcggtggg cactctctgg 48841 gaaagagcag cagctcagga agacgagcca caacagaact gtgggagcat ctagcttctt 48901 ctggagagcg ggagggccgg agggagggtc ttgcagccct gcagacagtg aggggctggg 48961 ggagtgactc cacggtaaac tccttcctat ttaggatgca gctggggctg gagccaggtc 49021 tggctatcgt ggtcgccgct cccctcacac cccccatgaa cgcttcgagg ggctcacggt 49081 gcttaaggct gcgcaggtga gcacaggagg cacccgggcc cgtctgatgc ccagacctgg 49141 aggagaggct gtgcaggaag ccgggcccca gggcctgggc ttttgtgaag gagctttctt 49201 gggaccgaga ctacctgata ggacccagcg tggagaaaag ggatgttctg agagctgggg 49261 ccggggttgt tgtcatggag agagaaggag cagtttgggc tgatggacag agctggatgc 49321 tgacagactc caaacccatg ggtggagagc ggcagagttg ggcaggggct agggccgaga 49381 ccaccctccc cttgccttcc tccttttctg ccctgccttt cactgcctgc agctggcccg 49441 ggctgggaca gtgggcagtc agggtgctaa gctgcttctg gaggtgagcg agcgggtgcg 49501 gaccttgacc caggcctact tctccccgga acggcccctg catctgtcct tcacccacct 49561 ggtgtgccgc agcgccatag aaggtacgac agggaccccc cactgctctt ctccaacctc 49621 aggccctgcc cccaggacac tgcccccaag agccccgggg tggacgtgca cggcctgagc 49681 ccacagggtg gcagatgggc acaggggcac agaggtaccc ccagacccct ttgtgtccta 49741 ggagagcaag agcagcgcat ggacctgagt cacccagtgc acgcagacaa ctgcgtcctg 49801 gaccctgaca cgggagagtg ctggcgggag cccccagcct acacctatcg ggactacagg 49861 tgggcagccc ctctagtggt caggcaggtg ggcagacaaa ggtcatccca ctgccggcct 49921 gggcctgggt tggggtctca ctgcctcttg cttttttccc tccccagcgg actcctctac 49981 ctcaacgatg acttccaggg tggggacctg ttcttcacgg agcccaacgc cctcactgtc 50041 acggtgcgtg gagtgggggg tgtgacggtg tgacaaaggg cccagctgtg gggtcaggat 50101 tcaaaacaga aggctccaga ggcaaatgca gggaaatggc cagggctttt aaccctttca 50161 cttccccgcc tccgcagccc tcccgctgct tcctccgtag caaggtctct aagtggccga 50221 atcaacccta ccatcagtat ttgtctcgcc tcctaaagaa gggtgaatga agcacagctt 50281 gcaggtgcct ccaggactcc atggccctat tctagggatt tggggaactt gaaatagtcc 50341 ttcctctttg cagcgaggtg agcacagtgg ctggcgcaca gataccaggg ccgccttgac 50401 tggggctcgg attctgaacc tttctgaggt ttagtttcct agactggggt gataataata 50461 gtacgtattt taaagggtca ttgtaggaag taatttaggt aatgtaccta aaagaatgag 50521 gcagcgcctg gtaatgataa gcaattaatg aatgttaggt gggaaatttc cggacaccag 50581 gggctaggat tgggtaggag gagaggggca gtgattgttc atctgggtgt ggggtgtgaa 50641 aatggttcca atatctgtat ccccttctgc atagagtctg gtactgcatg cacagcccca 50701 gattgcttca agccctggat tcaaagggct gagtgggaaa aatcagggta ccaggggtca 50761 ggaaccaaca ggagaccatc cgtggagacc cctttttcct ttggttccca actcctcttg 50821 catttttgct ctgcaagttg aagctgattt ctcatttgcc ccaagctagt ccctgaatta 50881 aggagtagga agtggtagcg agagagctga tcctctctgt gcccctgtcc ccatcctgct 50941 ctagggagtg agggccaagt tctctgagag agccaatccc tggagctgaa cctgccctca 51001 tccctccagg ctcgggtgcg tcctcgctgt gggcgccttg tggccttcag ctccggtgtc 51061 gagaatcccc atggggtgtg ggccgtgact cggggacggc gctgtgccct ggcactgtgg 51121 cacacgtggg cacctgagca cagggagcag gtaaggagcg gggtaggaag ggatgtggtt 51181 ctcctggtgc gtgaagggtg ggcaaggagc ccccgagaag gctcacagtc ggtgagggtc 51241 aggggctgag ctgacccagg gaaggtgggt gcagggaatg ggccacactc tcctccccaa 51301 ccccaggagt ggatagaagc caaagaactg ctgcaggagt cacaggagga ggaggaagag 51361 gaagaggaag aaatgcccag caaagaccct tccccagagc cccctagccg caggcaccag 51421 agggtccaag acaagactgg aagggcacct cgggttcggg aggagctgtg agtggctgag 51481 ccagctcctt gaggatgtgg ccacttgact tgtggaaggc catcttgatg ccaggacaca 51541 caggaagccc ctgtgtgaca tcaggagcag aacagcaagc tctctgtccc tgcaccccca 51601 ccatcttggg gacctacaag ggcctggact cagaggacag tgcacaggct agcctggagc 51661 tcaccaggcc tggggagctg ggacggggcc ccgctgccgg acctgcagcc ctggacagat 51721 ggggaacact gtgcctccct gaacagaaat ggcaggggag gaggctgatg ctttaaatga 51781 agaggatggt ggggttggga ggtataaccc tgctcctctc tcccagtctg tgcaataaag 51841 gtcgtgaaga tctctcagcc aggggccagt caagtgtatc acagattctg tgatggctgg 51901 aggtgggggt ggctctggct ggcctgggag gagacaggtg tctttgggcc tcccagaggc 51961 ggggcgcctg gagcctgtga tttgtcaggg aaatctcagc cacctgctcc cctcatgcaa 52021 atgacccaag cccctggcag ctgctcctta ggcctccaag tgtaagccct gtcttccctc 52081 tctgagcttt gctgagactt agggccgcct gctttctctc tgacccctgc tttcacagcc 52141 tcgagagggc aaggaggggg ccagacccca gcctgggagc agcgtgcggg ccttccctcc 52201 acttgaaatc cgttgcccgc ccacaatagg ggcagacctg tccatccttc tctgtgggtc 52261 ccctgtacct ttctccccca acaggatcag acccagaggc agctggttgg ggtttgtcga 52321 gaagaaggat tatccagatc agtcctttct aatctcagct cctgcctgta ccctcccata 52381 ctcaccaaac cctcttcccc accaccctga gctgaggagc acagtttgag gtactgacaa 52441 tggggccatc tatcaccctt atatcccacc tcctggcctg gttgctaagt ggccctgact 52501 gccaagatca tcaccatgga tgggggccgg gaccaggggg gccactggag ctgtagttgt 52561 ctgctccttt gcacccctcc caaaccccaa cccttaatcc tcacagcttt gctctaatcc 52621 tgtggggcct gatgctctta tctctgcctg cagagactga ccagggaagc accccttcac 52681 cccctcctct ctcagggctt gtggtggggg ctgcccctct ctatggcccc acactcctgc 52741 ccccaaccct cctcccggct gggcccaagt ctctctggaa tccgtacacc tccctccctc 52801 cccccacccc ttccccccac ccctgccagc tgatttcatt tgcctgacgt tgtggttggc 52861 tctacccttc cctgtcaggc ccccccaacc ccccgccggt cggggccagg ccaggccagg 52921 ccagctcctc tggcagcaga gcctgggcag gtgacgggcg ggcgcgggcg tcgcagctga 52981 gggagtaagg aggctcccag gaaccggagc tggaaacccg gccgaggtcc agccagagcc 53041 caagtaagag ggagctggcc tgggctctgc tctcctgggc tggggcgcag gcagggctca 53101 ggcaggctgg ggctggacgc aaaggacctc aaacttggag ggcctagctg gggagagggt 53161 gcagccaccc agtcgtttcc ctgtgtcttg gagtctggtg tgggcacgaa gggcagaagc 53221 acagcctggt ggggggttcc tcaacaccga ccccatgttc ctggcaggag ccagagtgac 53281 ccctcgacct gtcagccatg ggggagatgg agcaactgcg tcaggaagcg gagcagctca 53341 agaagcagat tgcagtaact ccagagccct acccctgggg ccccagaaaa cagctgggga 53401 catgaggagc gtggcccagg gggagggggc tgggtttgct cttgagtgac tagaagtgac 53461 cagggacttt ctaaacttgg ttcgcatatt tgagaccccc aagccccaga gccccaagac 53521 caacctgccc tgaaggccca tgcacctgcc acccccacct cgcccttgct cccctgatgt 53581 ctgctttccc tgcaggatgc caggaaagcc tgtgctgacg ttactctggc agaggtaaga 53641 ccccctgtcc cccggaaggc agggcatggg gggaggggaa gggagctccc cgacccgcaa 53701 tagcgcgttg ctccgagcat tagtacagca cgtccttgga gtgaggaaac ccattaatta 53761 attcattcga ccaacatttt agcgggacct gctctgagcc aggcaccata ttggatgcga 53821 aagatgcgga cacggttcct attctcatcc gtagaacaag agtgaccaca gccccctccc 53881 agggccttga agatgaagaa gcgatagtac tttgtaaatt ataaagccct tgtagactct 53941 agctgctagt tttctatgct gtattttgtc aattctaaga tgcacatttc ttcacatttt 54001 gacatctctg aaatcaaggt gcattcagca atggatcatt gtttcaatta gaattgacag 54061 catcttttct ttcttagtga tgcatcttag attggattaa atatgttaat agtgaacact 54121 ttttatgagc cagatactgg gctaagctgc ataaatacct gatttcattg aattttcatg 54181 atatctctgt aaggtaggtg ctatgataag cctcatttta caccttgagt aaacagactc 54241 agggaggtga agcaacttgc ccaagataac aaagcaggcc gggcgcagtg gttcacgcct 54301 gtaatcccag ccctttggga gactgacgtg aaaggatcac ttgagctcag gagtttgaga 54361 ccagcctggg caacatagtg agatctcttc ttaatataaa aaaaattgct tttaactttt 54421 taaaaaaaga tagcaatgca agtgatggca aatcaagtct gttcactctg gacccccagc 54481 ctaagtctaa aatgtggtta gagctcaagg ggcatagtac cacagatttc aggaggtaca 54541 gttcagggtc atggtgatgc cccaggcacc cagttaaaaa gctagccgtg ctgcagcggt 54601 gggtgcaagc aggctcctct cggcttggga gagccagctc tgtgcatctc ttcccaaatc 54661 tgtttccaat gacctcacat tggcagctgg aaactggtca aagtgcaagt attcgtatcg 54721 tggaaatcac cttcccacct tcctcaccaa cactggagag gtgcttgtta aacacttaca 54781 agcaacacac ctctggccat gcatgcatct ctttcccaca tcagtctaag acactccagg 54841 gcagagactt atttctctaa ggctgggggc ctgacctaga gcctggcaca gagcgggaac 54901 ttgtatagat cagtggcagt gacagacatc tgggcacgta acctgctcac cctgatattc 54961 agtgcccctc tctctgcagc tggtgtctgg cctagaggtg gtgggacgag tccagatgcg 55021 gacgcggcgg acgttaaggg gacacctggc caagatttac gccatgcact gggccactga 55081 ttctaagtga ggcttggggg ggaaccgaga atgggagggt gagagcggga gtgaggaagg 55141 cgggaagggg aggcttgcat gatatgggat gccctctccc caggctgctg gtaagtgcct 55201 cgcaagatgg gaagctgatc gtgtgggaca gctacaccac caacaaggta ccagccctgc 55261 ctccctgagc ctccaccact gcatccttcc taagggcgcc atgcctaccc tcctgtgccc 55321 agctgggagc ttggctccgg tcccatctct gctcaaacca ccctccctgc aggtgcacgc 55381 catcccactg cgctcctcct gggtcatgac ctgtgcctat gccccatcag ggaactttgt 55441 ggcatgtggg gggctggaca acatgtgttc catctacaac ctcaaatccc gtgagggcaa 55501 tgtcaaggtc agccgggagc tttctgctca cacaggtgag ggagagaccc tctcctcccc 55561 tcctgagggg ttcagggaac cctgggcttc cagtgggctg tggctctgca gccagggcac 55621 tgtccttcta accgcctcca ggttatctct cctgctgccg cttcctggat gacaacaata 55681 ttgtgaccag ctcgggggac accacgtggt gaggctgaac attgctggtg ctggggcttg 55741 ggagtgggcc cggcctttct ctaacagtct ccctccattt tggcagtgcc ttgtgggaca 55801 ttgagactgg gcagcagaag actgtatttg tgggacacac gggtgactgc atgagcctgg 55861 ctgtgtctcc tgacttcaat ctcttcattt cgggggcctg tgatgccagt gccaagctct 55921 gggatgtgcg agaggggacc tgccgtcaga ctttcactgg ccacgagtcg gacatcaacg 55981 ccatctgtgt gagtgcaccc cccaccccag cttcactcca actccttccc cgacactccc 56041 cacaacacat acaatacaca tcctctgccc ctcccatagc ttccctagcc ctttcttact 56101 gtattttttt tttttttttt tttttttttg agacagagtc tcactttgtc gcccaggctg 56161 gagtgcagtg gtgcaatctt ggctcactgc aacctctgcc tcctgggttg gagcaattct 56221 cctgcttcag cctcctgagt agctgggatt acaggtgtgc gccaccatgc ccggctaatt 56281 tttgtatttt tagtagagat ggggttttgc catgttggcc aggctggtct tgaactcttg 56341 acctgaggtg atcctcccac ctcagcttcc cagagtgcag tgctgggatt acagatgtga 56401 gccactgtgc ccagtcttcc ctagcccttt ctttctttcc tttttttgga gacagagtct 56461 cacactgttg cccagtctgg atgcagtggt gctatcatag ctcaccgcag tctcaacctc 56521 cagggttcaa ccaatcctcc caccacagcc tcctgagtag ctgggactac agatgcgtgt 56581 gaccacgctt ggctaatgtt ttgatttttg tgtagagatg ggttcccact atgttgccca 56641 ggctgagctg gaacttctga gctcaagcga tcctcctgcc tcagtctccc aaaatattgg 56701 gattacaggc atgagccacc acgcctagcc cattttatca cgtttgcttt atcatttttt 56761 ctctgtagag aaatagccat agatacacac acacatacac acttagatgt agatatacat 56821 tttctttctg tgtcttttga aagtaggttg cagacatcat gtttctttac cccttaagta 56881 cttcccgaaa caaagccctt ctgtagcata atcactgtat gattcttaag atcaggaaac 56941 tcaacattga cacaatacta cagtctacag tctatattca aatttcacca atagcctcaa 57001 aaatgtccat aatagctcat gtttttttct gtccaggatc caatccagaa acgtgcatta 57061 cctttcctcg ccatatcttt ttcgtcacca cctttaaact ggaactgttt tcccaccttt 57121 gtcttttatg tcattgacgt ttttgaagag tacaaaccag ttttctgtag actctccctc 57181 cgtttgagct tatctgacat ttgctcgccg tgagatccag gccttgcatt tgtactggac 57241 cctgttctta cacaccctga tccagcccac ttgtgtagtc tgggagtctg ggacaacctc 57301 cgtccgccct tctagccggg tcactgcagg caagccttgg tgctcttgcc tgcgacgtgg 57361 aaatgatgcc tgcctgcagc gctgtatagt gcagagcggg cgaggggcat agggaagtca 57421 ctggcacgtg gtatgtgttg gcagggctgc ttctcacccc aaaccaaggg agggacaggc 57481 agggaggctg agagcagcgg cttgccctgg agctgtcagg tgggaggcag agggcgggag 57541 aggctgtggg ctgcccaggt ctgatccctg acccacttgc cacccgtgcc ctcagttctt 57601 ccccaatgga gaggccatct gcacgggctc ggatgacgct tcctgccgct tgtttgacct 57661 gcgggcagac caggagctga tctgcttctc ccacgagagc atcatctgcg gcatcacgtc 57721 cgtggccttc tccctcagtg gccgcctact attcgctggc tacgacgact tcaactgcaa 57781 tgtctgggac tccatgaagt ctgagcgtgt gggtaagggc cagccctggc tgctgcttcc 57841 tcagctggaa ggaccctccc cagccctccc tccccattct gtacccccca tcagctccca 57901 tttcggactc tcttactgct gtcccttgtc actgggtgac tccacccctg gaatccagta 57961 ccccttggtt cccaactagg actgttttcc ctcagtgttg ctctaagcag cctctctcca 58021 ctgcccaatg ccatgactgc tccctgccct aggagatctg tggaccatga ctgtccagtc 58081 agttctgggt tcctggcatt tcaggggcac ccactgagag gcaagacagc ctcagggaaa 58141 catggaatca aggcagaatc aaggagatct ggagtggccc gagggccctg actgcagcct 58201 agggctcatc taagactagc ctgaagttgg tcagatagga tggcttcttc tctatcaaga 58261 ggccaggtgc tgatgaaaat aacactggcc aggggcagaa tccaaatcct aattctgttg 58321 ctcactttct gagtgtcctt gcacaagtca ctcctccatg ggcctcagtt tcttcatctg 58381 gaagcaaaga aataggatga acggttgcct tctcttttcc agcacctgtc aaggtgccct 58441 cgggtttttt catagtacac agatgcttcc agcctttctc tgtctgatcc cagccctccc 58501 ccaaccaaag ctcttgacag agaaccccct gccctacagc taatcctctt tcagtctctt 58561 agccttcttc aggggttgag cccagtctac cgagtctagg atccctgcat gcatgtgctc 58621 aagcacacat gcacacacac acgtacacac acccacacat gcatacacat gcacacatat 58681 ccccccacac acccacatac acacacacac ccacacaccc acacatacac ttacacgcat 58741 gcacacactg ctttccaaat cagcttggag agacaggctg actcctttcc ctcttcctca 58801 ggcatcctct ctggccacga taacagggtg agctgcctgg gagtcacagc tgacgggatg 58861 gctgtggcca caggttcctg ggacagcttc ctcaaaatct ggaactgagg aggctggaga 58921 aagggaagtg gaaggcagtg aacacactca gcagccccct gcccgacccc atctcattca 58981 ggtgttctct tctatattcc gggtgccatt cccactaagc tttctccttt gagggcagtg 59041 gggagcatgg gactgtgcct ttgggaggca gcatcaggga cacaggggca aagaactgcc 59101 ccatctcctc ccatggcctt ccctccccac agtcctcaca gcctctccct taatgagcaa 59161 ggacaacctg cccctcccca gccctttgca ggcccagcag acttgagtct gaggccccag 59221 gccctaggat tcctccccca gagccactac ctttgtccag gcctgggtgg tatagggcgt 59281 ttggccctgt gactatggct ctggcaccac tagggtcctg gccctcttct tattcatgct 59341 ttctcctttt tctacctttt tttctctcct aagacacctg caataaagtg tagcaccctg 59401 gtacatctgt gatgtttgcc ttctactctc ttctgttcca aaaagaccca ggtcccattt 59461 aagggcagta atgtgttaca ggtgctgtga taaaggctgg gtactggata gcttgtgggc 59521 ttatgggagg aggcctgaga tgggtcaggg ggagaaggta ttcagcaggt ggctggggga 59581 ctgtgtgcag cagttcgcta tggcctgcct gtggtgccca tgtgtttgta cgggagggtt 59641 agcttgagaa ggaatcagat tataaaaggt cttgaatgtc aagccagaga gtccagactt 59701 tttcctaagg gcaatgagaa gccattgagg agttctgagc agagtagtaa catgatcagt 59761 tatgcttctc agaaagattg ctccagatct ttcagaaaga agcagtgtag gttgatgaaa 59821 gggagggact ggcactgggg aaacaaggta ggaggcaggt ggcaagaaat gcaagagcga 59881 caggacagag gggttaggga gagggattag gaagagatta aagatgtgga atcagctgga 59941 acagtggctc atgcctgtaa tcgcagagcc ttggaaggct gaggcagaag gttctcttga 60001 ggtctggagt ttgagacaag cctgagcaac atagtaagac cctgtatcta caaaaaaaat 60061 taaaaattac ctgggcaggc tgggcatggt ggctcatgcc tgtaatccca gcacttcagg 60121 aggctgaggc aggcagatca cttgagccta ggagtttgaa accagcctgg gcaatatggc 60181 aagacctcat ctctataaaa aaaatacaaa aattagccag gcgtggtggc tcatgcctat 60241 aatcccagct actggagagg ctgagacagg agaatcgctt gagctcagga agttgaggct 60301 gcagtgagct gtgatcgtgc cactgcactc cagcctgggt gacaagagca agaccctgcc 60361 tctaaaaaac aagcaaacaa acaaaatgtg gaatcaccag gatttagtga ccaaatgcta 60421 ggatatagga cattgggagg gaaggattcg agataatggc caaggtttgg gtttggtggc 60481 tgggtagata atgctgtcat tcacaaggat agggaagagg gaatgaggaa tagcttttgg 60541 gggaagatca taaattcagt tttagacaaa ctgactttga agtacttata aagcagctgg 60601 ataaaggtca gggttagaga gagagacctg aaagttacca gaatataggg aagtgatagc 60661 taaagcccta gaagtaaaca cccagggaga gcagggccaa ggacagagcc atagggacac 60721 catggttaaa gggagatgtg caaagaaggc tcagagcatc agtgggctca ggccagggaa 60781 cgcaagctct ctaggccgcc attctacaca cagaaaactc ggtgcaaggt ttacatatac 60841 ctttttaaag taacatttaa aattacttcc tttaatataa aagaaacaca caagacacaa 60901 agaaagccca ataaagctgt gagtcccagt ttggtagctg aggaggagtc taagaaaaca 60961 agccattcct cagtatccct gggaaagaag gggtgagagg acacagatat caccaggccc 61021 tgggtgactg cattgctggg gccatgcagg gcctagctct ccaccaaggg aaagtgctga 61081 ttttccttgt catggtcctg gccttgctcc catgctcgtc ctccagtttt cagaagtcgt 61141 ccagttccaa gaatggctcc ttcctttagt tcactaacat tttcacttag gggtgaaggc 61201 cgcttaccct gaaaacggca gtgtatgagt tggggggagt tgctgttcag aagagggaag 61261 atctgggtaa aagggtctcc caccctctac gttggatatc ctagtaacag cttgcacttt 61321 ctcttccttt acctgtcgta gtgtcagggt gccaggggag ttgtcatcct gcaggatggt 61381 gaggggggat ctccctagta ccttgctgct gtttggtttc catctattgc gcatagaacc 61441 tggggtgggt aaggcgttaa agcaaggacc caaaactagg acttacagtt tattcctccc 61501 acactcaccc atggacctta tcccaaaaca ttatcctagt ttatcttccc tgcatcttgc 61561 cttagattct gtacctgaag atctgggagt ctcagggtcc cttgagggct tgtcggagct 61621 ctggctggcc acaggggttt ctgtgggctg tcttgcttcc tccttggaaa acacctgttt 61681 ggaggggaac tcagtctggt tccaaggtgg catctgttcc tcaacagata actgggtacc 61741 cagaggcaag tccaattcag aagataaagg tgcctctggg ggcagaacag gctctggggg 61801 aagatttgat ttagagtctt cagtttcaaa tacttcactc agctgtttca ccagtgggct 61861 tggggggtct agaggaagaa agtgaagtga acagtgacct tctgatacac aacattcagc 61921 aaggagcaaa acatacagaa acccaggact catgcccagg agggtgagca agtgggaaga 61981 gagaaatcaa ggaaatcaag gtgaaaggga gcaatttttt ttttttttga gatggggctt 62041 gctctgtccc ccaggctgga gtgcagtggc gtgatcacag ttcactgcag ccttgacctt 62101 gtgggctcaa gcaatcctcc cacttcagcc tcctgagtag ctgggatcac aggcccatgc 62161 caccgtgctt ttgtgtgtgt gtgtgtgtgt tatgtgtgtg tattttttct agagatgggg 62221 ttttgccatg ttgcctaggc tggtcttgaa ctcctgggct caagcgatct gaccaccccg 62281 gccttccaac gtgctgggat tacaggcatg agccaccgca cccaggctgg gaacaaaata 62341 ttgagataag agtgagatgg gtccgaagca ggaggataga ggccccttta aggaaccctg 62401 agagaatggc ttcatggacc ctacccaaga ataatagatg acagcttcat cagcattccc 62461 tgggcccaac gcttacctcc actgctggtc ttcataggtg tccgtgcaat accaagagta 62521 ggagagcggg gatctgagtc ctgggcatgt ttaagaccct ccagttgctc ccctgctggt 62581 aggcctggct gtggagagct ctccacctgt caagaccata ggctagaaca agtctgggcc 62641 ctgactcttc tctcctctcc tccccatatt ccaggaagga attgccaagg ccctcagata 62701 tccagcctac cccacacaga ttgagattgc agaaaataag gctaggataa gggtggggtc 62761 atgacggctc tctcaaaatc aggttaggaa gagacccaag aattcagaca ttctctgcct 62821 ttcccacgct ggcccagagt acctggatgg gagtgcgcag gatgccagca ctaggtgaac 62881 gggggtccgc cactcgagcc agatgcttgt tgtgcggcgg aggccgcgct ggtgtgactg 62941 ggacgctctt ggctgagccc atctcaacca ggagttgcag ggtgggggca agggccagcc 63001 cgggacgagg agggaatgcc tgtgagaagt gactcaggtc tcagccctta tctcttccac 63061 gtcggacctt caaccctaaa ccaccctgtc ttcccagttt ccagtgaaga aacttcctgg 63121 cgagggacca gaggcctctg cggagttaat gactgctgca gctgacaaaa tggctcgttc 63181 tgggcctgct gagccctcta acttatttat taaattattc aaatcactta ccgggcccca 63241 attccacctc tggatgcaca acagctcgtg gctcaactcc cgaagttacc agtttcaaac 63301 ttcccgccac gccccctgcc ctgaccctat tggctgagcc gacgtagttc ccataggccc 63361 tccacgtaag acccgcctcg agccaaagga agctgttggt ctgggagaac gtctatcatg 63421 gtgactgtag gtgtcaatgg actgatgcta ggcttgccca ggggtgacgc ctccttgata 63481 agggtacttc tgttcagtat cggcctgcga gccgtccgtg gcaggctgag gagtgccagt 63541 cttcgctttc gagacgtcta gtctcaagaa cgaattatta aacaaaatag aatcatgagt 63601 gattacacta ttatgcgcac atccagctgg agttgtgatg gttttcctga ggcgagcgat 63661 agtagcttta ttggttgaca gctgtatcag tcactcgaca caaccccctc aagctccgcc 63721 tcactggata ccccctttac aaataccaag acaactgtcc ctcgactttc cgaccggaca 63781 ggtccccagg aaactttctg gaattgagag gcctaaaaca accgtcagtt cctgctcttc 63841 aatagaaggc acgcgaatgg actctaaggg tcccggagcc ggagcagctg cggagccgca 63901 gcagctgcaa ccccgaagcc cacctagctg ctctacgtgc gctcccgccc tggaggagtg 63961 ggattggttc tttcaaccat caattgcctt aggagcgctt ctcattggcg tcagtcaagc 64021 gcggggcagg ggcggagcac cggcaggcag gctccgtcca actgccattc tcgcgcgtcg 64081 tctccgcggc gcatgcccct aacgagtggc gctcattgga cgggaggggc aggggagggg 64141 actgggaacg gtgggagccg ccgtgtgtgg agaagctgct gccggtgtca tggcggagct 64201 gagtgaggag gcgctgctgt cagtattacc gacgatccgg gtccctaagg ctggagaccg 64261 ggtccacaaa gacgagtgcg ccttctcctt cgacacgccg gtaagcccat tccccacgcc 64321 cgcaacgagc acgacttcct tccatcgccc tggtcattcc gctggggcct gcaaggcttg 64381 ggctaccgcc tccctgcgat gcaccatggg acttgtagtc tcccacgctc cactctgccg 64441 ttgcctttca ttaactgcgg ctgtcgtgtg actaccgtcc ccggaagccg ccgcgctcac 64501 ctctccggct ccccgcttcg tggagcactg tgggcattgt agtcactcac tgccttcgtt 64561 gccgttctaa gctaggggcg cagcccgacc gtcggactac atcccccaag tggacccttg 64621 agaaaacctg gcagtcggcc ttgtctgccg cgcacggctc tccgggtttt gtagtctcgt 64681 gtaggagtga tggggacgac tccccactgt tctcggccgg ggttggagtt ggggggtggg 64741 gaccgcagtt cctaggcggt taatagtctg ggagtggact ctcaagaagt ccgattctga 64801 gaaaggcaga gagcctccac cccgtggtta aaacagacag ggggcgggga ggggaggaaa 64861 gaggagagga aagctctagt cgtttctcct ttgaagtgag caatggccgc agcccccacc 64921 cttccccccg ccccccagcc tatcacagac agctgagata acccctttta aagtattcct 64981 agagaggtgg tttctttctc ggaggatgct aaggcctctt tcccatcccg ctcccactct 65041 tggctagtta cacaaagcca cctcccacac catgcttcac gttttgcagg ctccacctgg 65101 tacctccctt cctgcagttc tgccttccct atgtcaccct caagcggagg ccctttgggg 65161 cctgaactga gaaggcagca gcttggttgg gatccgcccc cctcaccacc cctcccctca 65221 agctgctgtg ccctgcccag tgtattctgc cagaaccagc tcatggcctg aaaactgcaa 65281 gtcagagtgg aaagccttca ttactgcatc accagtactg agtagaacag catgatgtga 65341 ggcccagttc aggatcatca ctgtgatttg tggtttgctt ggtttgctag gctttgatct 65401 gggcctcaga caatgtgctg cttctctgaa gtctggcttt ggcagaggta aaagagcact 65461 gcgcggagca gagctccggg gttcccaatc cttggtgaaa tcagcatctt taatttccaa 65521 cagaggagtt taatcacttc agagctccca cattcttccc aggagggaaa ggtttttctt 65581 aaattaccag caggtggtag aagtcagtgg gtccctcttt acacttttat tcactcctgt 65641 ctctctcctt agtaactgct ttgaataatc caaaaagggt ttttgtaatc ttctgaaact 65701 ttcccttttt caccccaagt caagagattt tcagtgtgcc ctagttcttt gctaaccaac 65761 ttccagcata tttggttacc ttcaaattgg gctcatggta gaaagagctt agttgacctc 65821 cccggtgaga aatgctctat ttgttcatga acttgcagta cacactgttc ctaggaagtg 65881 gtacagagca atctatggga ccaagaacta ccttagacag tgcactattt ttcatatcac 65941 agcagagaaa tgttcaaggc cactgagact tttattcctg gaagaggatg tctgttgttg 66001 ctcaaagcct tggatctgag ggaagaccct tattgtctcc aatttcctcc tccaccttct 66061 ccatcactct ttacaaaggc actggaaaac ttggtcagct ttgggggaag cccatcctcc 66121 cttctcagct gtgtttctgg gtggacaccc gtctgtccct gttttcacgg gttatgtgtg 66181 ttctagctat ctcagcaagt gttgcgagat tgttgtcaga acgggatggc tttgagattg 66241 gtgatttgtt gggacttcag cagggatctt attggtcgga ggggaggcag aggagagagg 66301 gtggtggcta tggaatgctg gacttcatga ccctgccaga ctgggtgcat gtttagctgt 66361 gagtccctaa caagtcattg ttcccctgga agccacactt cagcaggaga ggagggagaa 66421 gactcacagt gtggaccagg gttcgactac ctggtagaat gtggtaaaag aaaggtcttt 66481 ttcgctggat gtttagtctc accgctgtaa tcccagcact ttgggaggcc aaggtggaag 66541 gattgtttga ggccacacaa gttaaagacc agcctaagca acatagcgag accccacctc 66601 taaaaaaaaa aaaaaaaaaa atctaggcat ggtggcatgt gcctgtagtc ccagctcctt 66661 tggagactga ggcagaagga tcacttgagc cggggaggtt gaggctgcag tgagctatga 66721 tcgtgccact gcactccagc ctgggtgaca gagtgcagtg tctctaaaat aaaatttttt 66781 ttttttttaa aggaaaggta attttgcttg ctggcacact cgggtcagat atactgttca 66841 agtgctggaa tctctgctcc tctggtctgg aaggttttct gaatcatgga gatgagatag 66901 ggtttgtctt aggtgtcccc tggtttgggt tcagtgtctc acatgaaagt ctttcctctc 66961 ccatgaaggg accctttttc cctttgaatg taatctccag cactggaagt ccacagtatt 67021 tgatccccat ttactttctg gatctgttag ccccaggata gaggtgaagg aagtcctctg 67081 tgattgatta aagagctcca aagcccaaga aggagcaggt ggaaagataa atggtgactg 67141 gtgttgctgt tgaatggcga cgatgagctg gtgttcctca gccccctgat gtccttgttg 67201 gttgttatca aagtcatggg aggaaatggg gagaatctta gctatgtccc aggtcctaaa 67261 aggctaaata acttgccaag ggccacacag tgttagagaa cagaacattt cttctggaac 67321 ccagcagttt ctgactccag gttttggttt cctcacctga ccaggcttca taacatcctc 67381 gtgttttcac ccttacctct tgtcccacag gagtctgagg ggggcctcta catctgtatg 67441 aacacgtttc tgggctttgg gaaacagtat gtggagagac atttcaataa gaccggccag 67501 cgagtctact tgcacctccg gcggacccgg cgcccggtag gagcagggct ggggcaaggc 67561 ctgggtacat tgtctgttcc attctgacct cctattggac tcagtttctt tttttcacct 67621 acttttgtgt cattaaagct tggcacagat gttttttagc tttattccgt tctgagatga 67681 agcactacct ccttccgtcc ctcctcccga cttgttcctt cgctcgtgct cattgctgat 67741 ccagcccttc ctgcttcttt acagaaagag gaggaccctg ctacaggcac tggagaccca 67801 ccccggaaga agcccacgcg gctggctatt ggtgagcacc gctgcagtcc tattcttctc 67861 cctgagctgg tctttctggc tctcagcagc accaggaaag ccccaaagag tgggcctgat 67921 tgatggccct agaactctgg atggggcaag tgaggtgctg gctcggagga gggacattga 67981 cctgttgtca ttgctctact ctccctcttc ttccctatcc ttccaggtgt tgaaggcgga 68041 tttgacctta gcgaggagaa gtttgaatta gacgaggatg tgaagattgt cattttgcca 68101 gattacctgg agattgcccg ggatggactg gggggactgc ctgacattgt cagagatcgg 68161 gtatgactgc cccctatgct acccaagatt ctagagcaag atgggccagg gtagtggtgt 68221 cttaggcaag cactgacaaa gctgaaggcc aaggggaaga ggggcatgtg gggcaggggg 68281 ttgggtattg cctctgaccc tctgcttccc ccaggtgacc agtgcagtgg aggccctact 68341 gtcggccgac tcagcctccc gcaagcagga ggtgcaggca tgggatgggg aagtacggca 68401 ggtgtctaag catgccttca gcctcaagca gttggacaac cctgctcgaa tccctccctg 68461 gtgaggcctg gcccctctgc ctcgggcacc acccccagag caaggacaag gagcccactt 68521 ttctggggga tctggtggga gagagggtag ggagcaggac aggaagggaa gcttggaaat 68581 gaacacgaca tggggatggc cagaggagaa gagagagaga ccttggattg gcggggggcc 68641 tgcagagccc tctctctctg ccactccctc aaatccccga cccacatttc tgctgattct 68701 cttctctgtg ggttagtggc tggaagtgct ccaagtgtga catgagagag aacctgtggc 68761 tcaacctgac tgatggctcc atcctctgtg ggcgacgcta cttcgatggc agtgggggca 68821 acaaccacgc tgtggagcac taccgagaga caggctaccc gttagctgtc aagctgggca 68881 ccatcacccc tgatggagct ggtacagcct cccctcctcc agccactctc atgcttaaat 68941 atatttcatt tgttagtaat ttcgtgtgac atgtataata catgaatgca ttatcttgat 69001 aagaaaggaa agtagtcagg tgcggtggct catgtttgca atcccaacac tttgggaggc 69061 caaggcagga gaattgcttg agcctaggag ttcgagacca gcctgggcaa catagtgaga 69121 cctcatctct ccaaaaaaga aaaaaaaaga aaaattggct gggtgtggta gctcctgcct 69181 gtagtctcac ctactcagga ggctgaggca ggaggattgt ttgagcaccc gggaggttga 69241 ggctgcagtg agctatgatt gcaccactgg actccagcct gggctacaga gaccctgtct 69301 caaaaaagga aaatactgca gatgagacta aatttttgat tactccccac tcaaatcctc 69361 atccctttcc taccaagcct gtcttacctt atcctataca ggtagaatca gaatcacaag 69421 taattccaac cctcacccgg tttcatctgt cttccctgac cgtcagcccc agaagggcca 69481 tgaccgcact catattactc atattcaagg gaaccaagag ggtggacctt actggctctc 69541 cttctgtccc ctcaagtccc ttctgtgcaa gggatgggat ggtggcctgg ctagtcctga 69601 gccacttccc ctgattctct tcctgcctcc tgctctagac gtgtactcat atgatgagga 69661 tgacatggtc ctggacccca gcctggctga gcacctgtcc cacttcggca tcgacatgct 69721 gaagatgcag aaggtgagac ccccttcaac ttcagattct tctacttcct gcccctgtga 69781 gggcccttcc tctttgcctg aggctgaggt agggttttgg agaaatcttc tagttaaccc 69841 catccccaag atacacaggc ttcttttaaa tatctaaaat catttattca gtgaatagta 69901 ctgagtacct actatgtgct agaggctgtg tgaataagac agatgtggtt tttgcttaca 69961 agagttctac tctagaagaa aaggcagaca ttagtcacat tcccagataa ataagtgaat 70021 agttgcaaac tggtgggcac tgtgaaagag aggcggaggg tactacagta attgacagca 70081 gggtggcctg gcctagttta gggatcaggg acatttccct gaggaagcga tgcttaactg 70141 acaatgtgaa ggaggttagg ggtaacttaa ggatgggaaa aacatttcag gtagaaggaa 70201 attttattcc tcattcaagc agaggccctg atgactaacg ggcctctggg acaattatgt 70261 gtggccttcc agaacttgaa ctggacgata ctcaacatac ctcccatgtt tgagcctgta 70321 attccacaaa ttctaggcca cagttgagta tcatcctaga ctcagtgaga gatcgaggta 70381 aaaactacag ggttgagttt ctcactcagt ctgaagtgcc ccttctcaca cagacagaca 70441 agacgatgac tgagttggag atagacatga accagcggat tggtgaatgg gagctgatcc 70501 aggagtcagg tgtgccactc aagcccctgt ttgggcctgg ctacacaggc atccggaacc 70561 tgggtaacag ctgctacctc aactctgtgg tccaggtgct cttcagcatc cctgacttcc 70621 agaggaagtg agtagtgccc tctccttccc caggccccct cctggtcagc accctctggg 70681 catactcctc cttcagcttc cctcagcacc tctgtgtttg attctagtct tagagtagtt 70741 cctatcggct gggtgtgttg gcccacaccc gtaatcccag tacttcggga ggccaagatg 70801 ggagaattgc ttgagcccag gaggttgaga ccagcctggg caacatagcg agaccctgtc 70861 ttctcttaaa aaaaaaaaag aataattcct gtcacactct ggctgacatg ctgtgagggg 70921 agcaagttag ggagctgcac caccagcacc ccaacacaca aaccccactg atagtgcaga 70981 gtgccaagtc ccagggctct gtggccttct gacagactca agggttagca ggtggtcttt 71041 attttagggc agtgacatca gatgagcacc taacatggac atgtagcact tggggaggag 71101 gggggcttgg cacactaaga ataaaggcag gttctcagca gaaatgagtt tactttcgaa 71161 tttggggaac aattagaatg tgaatgctca gaacatgtca acaggttctg tccaatagaa 71221 cgtgaactgc acttatccta atcccaggct accttccccc tccccggctc ctctgccgcc 71281 gagcgtgctt gcacattgga catagctgag aaggggagaa caggcaggtt gctggagaga 71341 agcgactcca aggttcaccc acagcaactc cccactcttg aggggagccc gttcacagag 71401 ttagtaaatg tttgcttcct tcccttttcc tcggcgctca ggccagagcc cctccaactg 71461 tccttccctt gacttttagg tatgtggata agctggagaa gatcttccag aatgccccga 71521 cggaccctac ccaggatttc agcacccagg tgtatgtaac caggtcctat gtaggaaagc 71581 tgttgacagt catggccgta taccttcctc ctccctgacc gcctgggggg cacagcactt 71641 taacccctgg cctttttttg ttttgttttg ttttgttttt tgagacggag tcttgttctg 71701 ttgccaggct ggagtgcagt ggcgcgatct cggctcacta caacctctgc ctcctgggtt 71761 caagtgattc tcctgcctca gtctcctgag tagctaggac tacaggcaca tgccaccacg 71821 cccagctaaa ttttttttgt atttttagta gagacggggt ttcatcatgt tggccaggat 71881 ggtctcgatc tcctgacctc gtggtctgcc cacctcggcc tcccaaagtg ctgggattac 71941 aggcgtgagc caccgtgcac ggccaccact ggcctattag gagcattcgg gtgactgtct 72001 tctggacgtg ttgtctgtct tgagctgggt tcctgtagaa tctaaggttt ttcacgttgt 72061 tgagagttag ctagggagag ccacgagcag ggggttgagc tggggacaca cagggtcgtt 72121 agttgactga agtgaaatgt tttcttggat attaatgcag ggccaagctg ggccatggcc 72181 ttctctccgg ggagtattcc aagccagtac cggagtcggg cgatggggag cgggtgccag 72241 aacagaaggt gcgtctagga ccctgtccct ttcaggccct gggattgtgg ggaagctgag 72301 gtctgggaga tgtctaaaga aggcccctgg atggccactg agccccagct gagtccctgc 72361 cctgactctt cccaggaagt tcaagatggc attgcccctc ggatgttcaa ggccctcatc 72421 ggcaagggcc accctgaatt ctccaccaac cggcagcagg atgcccagga gttcttcctt 72481 caccttatca acatggtgga ggtaagggct ggcaagatgg cacaccccca tcttcctgca 72541 atttactcgc tctccttcct gcccatttct ccctctatca gccccaaccc agtcccatcc 72601 ctgaacccca acagtctgtg tccctgtgaa cagtgcttgc acctgaggag gacagtgaag 72661 gtgatgatcc ctacccttga tgggcttgaa gcccaattca gattatgccc aaagacaatg 72721 ataagagaat acttgctgct tataaagaaa aatacaagcg tgttctaaca cagtccctca 72781 gtgcagggat ggggccagaa atggggaggg gttggtgaat tagggaaggt ttctgctctg 72841 ctcttgtgtc cctgagttcc gagtggtagt ctgccttctc tccctgactc tcccatgaac 72901 ctttcaggcc ccattctgtt ccctggctgc ccagacctcc ctaccctgcc tctttcccat 72961 agaggaattg ccggagctct gaaaatccta atgaagtgtt ccgcttcttg gtggaggaaa 73021 agatcaagtg cctggccaca gagaaggtga agtacaccca gcgagttgac tacatcatgc 73081 agctgcctgt gcccatggat gcagccctta acaaaggtag gctgctccat cagcaaggcc 73141 gtggcacggt gggaggctaa ggtctaggag gaatgcttgg gcacctcatg ggagcacagc 73201 ccagggaatg ccctgcttca ccagccaagg ttccagtctg ggccaccagg agtaggtaga 73261 ttaacctcgt ccagtggaca ctcagcttgt tagcagagct gcatggaaac aggcgagata 73321 tattggacat gtgtcagggg tgctttggaa gggtagagga actgaaatac ggacacagag 73381 ccagtaggga gaggctaagg aggcaaagaa gcagcaccct ccgaaggatc caccaaccca 73441 ttctgtcacc agaggagctt ctggagtacg aggagaagaa gcggcaagcc gaagaggaga 73501 agatggcact gccagaactg gttcgggccc aggtgccctt cagctcttgc ctggaggcct 73561 acggggcccc tgagcaggtc gatgacttct ggagcacggc cctgcaggcc aagtcagtag 73621 ctgtcaagta agtcctctgg tcggggcctg aggctgtggg tctatggcag agctatccca 73681 gaggatactt ctgtctcttc cctgttcttt catccctttg tggttggaac ctcagggttg 73741 aatcagttgg gctggtaatc tggccctgat ggaaccaagg ggccctggta ggggggacac 73801 gggtactgac tctatctgtc accttgtcct gccccaacca ctccctcttc tgtggcacag 73861 ggtctctttg tagtgtagcc tctcttcatt gcccctcagt catggctgag cacaggcact 73921 taggatcagc tgtgggcaga atttggattt ccagcaaata gccacacttt gagcctagcc 73981 attggcacct ggcatgggtg tgagacctga aaaagggctt tggtctcggg gagtcagcta 74041 cccagccctt tgctgctcct gccagtgaaa cagctcttcc tctctgtgtg ggtttttttt 74101 ttttttctat gtgtgggtct tgctatgttt ttttccctac gtgtgggtct tgctatgttg 74161 cccaggttgg tctcttaact cctgggctca agcagtcgtc ctgccttggc ctcccaaagt 74221 gctggaatta cagatatgag ccaccatgcc tgggcttttt tctttttctt ttttttgttg 74281 acagccttcc ttctggcctc tactgctcct cgttggccga ggacaggcat cagacacctg 74341 cccaggggaa gagaagaata tggaggatct tctgggttat tgggacttct acccatgagg 74401 ggctgaaccc caggtgggag tttcaggaga ggaaaaccct ggggattgtc tgggtaccag 74461 cacagtctct cttccctagg accacacgat ttgcctcatt ccctgactac ctggtcatcc 74521 agatcaagaa gttcaccttc ggcttagact gggtgcccaa gaaactgggt atggctgctg 74581 gaatgaggga ggttacacag aaaatgctag aaaaagaagg ggctttaaca catataaact 74641 agtgtttctc aaagtttcct ctgaaagagg aaaaaaaatc accttcaatg tttcaaaatt 74701 tatctagttt gaaagcacta actaaaaata ttaaaagact tctttgattg ttataggtat 74761 gatttacaac caaaagtgat ttttgtagtt ttttaaaata attggtaaaa tgttttcatt 74821 ttgataatct tgaaacaatt aattctcgtg agtcttacag gtccctgact gtcatttgag 74881 aactcgtgat gtaccttcat tttctagata aggaagttga gactcacagg aatataaaat 74941 cattctgtct cacaagttct aggccccctg ccctgtccat agattattta tggtagatct 75001 aggaccagca tcctggcctc ccaactccca ggcttagtat tatttgtctt atactgggac 75061 ccctaagatg ggtgccgtgc ttttagcagc tcctctttac agagcagttc tgacataggg 75121 ggcaggggat tgaggttccc gaatcacttt ccaggtaggc ctccgggagc tgctgaggtg 75181 acccttttcc cacagatgtg tccatcgaga tgccagagga gctcgacatc tcccagttga 75241 ggggcacagg gctgcagccc ggagaggagg agctgccaga cattgcccca cccctggtca 75301 ctccggatga gcccaaaggt agccttggtt tctatggcaa cgaagacgaa gactccttct 75361 gctcccctca cttctcctct ccgacatgtt agtgactctt cttcctgcct gtctctctcc 75421 cgtgctgatg ggggcctctc tgccttgcat cccctgcccc tgtcctttgt gtttctcctg 75481 tcctcccttt caaatttcct ctgccctctt tgattgacat ggggcctccc cagcgcactc 75541 ctagccacct tctgggtgtg gatggcagca gtgccagcta gtggcctctc tgcccttgtt 75601 acctccctgg tttgggtagg gtgggggtat catgctccag aaacagggcc cattctgtga 75661 gaatgggcag ggacccacat tacaattgtg tggtcatttt ggttttggga taaggtgcca 75721 atccatggga gaaaaatgca tggaatgggt gattggaaga gggctgggtc ctgggggagg 75781 gaggaggagc aaacagtggc ccaatcagtc ggtccgtgta cccacaattc ccattacagc 75841 gcccatgctg gatgaatcag tcatcatcca gctggtggag atgggattcc ctatggacgc 75901 ctgccgcaaa gctgtctact acacgggcaa cagcggggct gaggccgcca tgaactgggt 75961 catgtcacac atggatgatc caggtaggct ggggtgggga gcagggtggg gcagggcctc 76021 catcctcccc caaacacatc aaccccttca catccacaga ttttgcaaac cccctcatcc 76081 tgcctggctc tagtgggccg ggctccacaa gcgcagcagc cgacccccct cctgaggact 76141 gtgtgaccac cattgtctcc atgggcttct cccgggacca ggccttgaaa gcgctgcggg 76201 ccacggtatg ggctgcccca gctaaggaca tggggccagt ggggaagaag ggggtgggaa 76261 tgaggggcca tccttcttga gcaagaccaa agacaacagg tgtggtctgg ccgaggttgg 76321 gcaccactct tttgtggctg cgatgaagga gctgaagcct gcgttcactg ctgggttttt 76381 tgccccagaa cttgttaaaa atgtaatgct tctgtttggc cgggcgcggt ggctcacgcc 76441 tgtaatccca gcactttggg agggtgaggt gggcggatca cgcggtcagc agatcaagac 76501 catcctggct aacacggtga aaccccgtct ctacgaaaaa tacaaaaaat taaccaagcg 76561 tggtggtggg cgtctgtagt cccagctact tgggaggctg aggcaggaga aaggcgtgaa 76621 cccgggaggc ggagcttgca gtgagccgag atcgcgccac tgcactccag cctgggcgac 76681 agagcaagac tccgtctcaa gaaaaaaagt aatgcttcct tcctctccaa gaacaatagt 76741 ttagaacggg ctgtggactg gatcttcagt cacattgacg acctggatgc tgaagctgcc 76801 atggacatct cagagggccg ctcagctgcc gactccatct ctgagtctgt gccagtggga 76861 cctaaagtcc gggatggtcc tggaagtgag tatccccagg aagcaggaca ggcctggtgg 76921 aatctggtca gtctactaca ccagatccct cattcaggca gctctgccct cctcaggagt 76981 caggggcttc tttccacctc aacgtggcat ctgagggtgg ggttctctga catggagaca 77041 ggctctggcc cagtacctgc ctcacattcc ctgaaactcc tgtcctgagt ctgagctgtt 77101 gtctggatct ttgggccatc actcaagcat tcctcttctg tttccttctc tgaccccctc 77161 tctcctttcc tagagtatca gctctttgcc ttcattagtc acatgggcac ctctaccatg 77221 tgtggtcact acgtctgcca catcaagaaa gaaggcaggt gagtgctggc cacatgcgtt 77281 ttgaatggag aatggtggtg ggaatgagga gggccatttg gaagtccttg ggatagctat 77341 gggagaaggt gaagggactg cctgatgggg acataaagtg ccagagagtg acagaaggtc 77401 ttgcagcagt gattggccca agggttggat ctggccacca gctttttttt taaatagaga 77461 tgggatctca ctatgttgcc caggctggtc ttgaactcct gagctcaagt gatccttctg 77521 cctcggcctc ccaaagtgct aggattacag gcatgagcta ccgtgcctgg ccaggccccc 77581 tgcctttgta aataaatttt cactggaacc tggacacact tgtttatgtg ttgtttgtgc 77641 ctgttttcac gctgcggcag gaaagttgag tcgttgtgtc agagaccaga gagagagcct 77701 gcagaacctc aaatactatc tggcccttgc cagaaaaagt ttaccaaccc cctgcctccc 77761 tggaatgggt ggagggtggt tgtaaaggta ctggaggatc tgaagacata atagggtccg 77821 tgacccttgt gaggttgtga agctccctta aggcacatgg tggctgggct gtggatttgg 77881 ggtatgggca gagagtgtgg agagcacttc caggggccat gtctgagaga ctacatgatg 77941 ccactttgaa tgcccagttt gttcatcctt ttctgttttc cccacttccc cagatgggtg 78001 atctacaatg accagaaagt gtgtgcctcc gagaagccgc ccaaggacct gggctacatc 78061 tacttctacc agagagtggc cagctaagag cctgcctcac cccttaccaa tgagggcagg 78121 ggaagaccac ctggcatgag ggagaggggc tgagggatgg acttcagccc ctctgctctg 78181 tacccttttt ccttttgtcc ccggcagcag ggaagaagct ggaggccgtg ggagaatggc 78241 tgggcagagc agaggggcag cgatagactc tggggatgga gcaggacggg gacgggaggg 78301 gccggccacc tgtctgtaag gagactttgt tgcttcccct gcccccggaa tccacagtgc 78361 tctgcttctc tgtgtcgccc cgcccagccc cctggtgtgg agggaggggt ctcgtttgtg 78421 cgcgtgggtg tagctttgtg catcctctcc cagtggagcg atcacctgtg cctcccctcc 78481 ccctttgttt gcccctgtgt ggttggtcaa ggagggatgt gagggaaata gggacccccc 78541 gacttgccct cctgcctcag tctttccccc accctgtctc ttccttgtcc ttctctggaa 78601 aatgccaaaa tacacgatgt gaataaaagt acaacggcta aattgtgtcc tgtttgatac 78661 cttgggggag aggcttacct tcctggggtt agcaggaggg cgcttaagaa aactcctaac 78721 tctggccgcc tccctgccaa agtcaagtct ccacttttca ctggttctag agctctagga 78781 aaattggggt tgggtgggga ggtggagtag agtgactaaa tgccgacaca aagccaagga 78841 aagatggagt gaagaaccct tccctctctt tattcacaca ggagtggagg atttcccaaa 78901 tgtccctaac tggctagctg gcttcaggct gggactcagt ccctgcagtt cctgccaggc 78961 cttgccagcc ggggcgaggg ttgggatgat cctggcggcc tatgcctgtg tgggctgccc 79021 ctcccgctgt gaaccctgca tttgtcccgc aagttttcac tcaggtagac tccctgggta 79081 caagggtgcc tgctcagcag tcgggcatga gctgctccga tgggcgaagg aggttgtcta 79141 tcccacagtt ggagaggggc cctctctgcc ccagtgggcg atctgggcta cggccaagtt 79201 gccaccagct agttccgctt gaaaaccact tctggccccg tgggggactc aagtcgccaa 79261 gcgagggttc ccctgagcgc cggagctcac aggtctcgcc ttgtcccgaa agccccgcaa 79321 tcgaggcgga ggcgaccgag cccccgactc tcctagaacg ttgccacaag aagggggaac 79381 gtcggaacag tgcatcatcg ggcggcggcc ggggcggcgg caggagggcg ggcggggggc 79441 agggctccgg gggactgggc gggccatggc ggaggacggc gaggaggcgg agttccactt 79501 cgcggcgctc tatataagtg ggcagtggcc gcgactgcgc gcagacactg accttcagcg 79561 cctcggctcc agcgccatgg cgccctccag gaagttcttc gttgggggaa actggaagat 79621 gaacgggcgg aagcagagtc tgggggagct catcggcact ctgaacgcgg ccaaggtgcc 79681 ggccgacacc ggtaagccct cgccgaggag gggtctggcc gggccggggc cggggccggg 79741 gcaggagtgg cagcgccctc tcccgaggcc ccgaggcccc gaggccggta tccgcgcgga 79801 cctgatgcag ggctgtggga cgagggccgc tggggtccgg gcaggggcct cgcagccgca 79861 gccccgtcgg tgcgtcgagg gggcagggcg gagcacatga tgccccttgg actatggggc 79921 aggtaaggac gttttgggtc tcctggagga aggtggcccc ggggcgcgca ctggggctgt 79981 gcccgccagg cgacggggtt aggagcggag cccgaggctc tgcgggagac cgggggaggc 80041 tgggccgcgt gggcttcccg ctccctgcgc cctggcctcc cgcgccgtgc gccgccgcac 80101 gtagccccag actcctcccc ttcctcgccg gcgtccgcgt ccccgcgccg agctgctcgg 80161 gctccctgag ccccagatct gaccccttcc cttcggcaac ctgaacgact cccgccttcc 80221 acggaaggga ccgagcccgt gccaaacagg ctgagcgatt tgggagtgag gagccatcct 80281 accgctttcc ccaacctgga aacagcaaag cgcaaggcct ctgagtcagt taggtctctg 80341 ccacccacgg gcaaaggatg ctctcctcca tcctccttcc tccctccacc gaaatcggag 80401 agccgcgggc ctgatccaaa gaggcatccc cttctcgttc attccccaga ggcctcaata 80461 caaaccccag gagttggccc ctctcctttt gctacaaatc cttgccttgc aaaggggagg 80521 tgaggatggg ctattttaga agggaagcag ggttgctccc tggagaatgc tgagtctgtg 80581 aggtgcctat gccgagaata gctcgaggaa attggagccc cagctgttaa aagagcagag 80641 ggcagggtga gggccgtggc ctctcagggg tatctggaag gctcttcgag ttgagtgcag 80701 acccagcctg ggctggaaaa tggacaaagg tcatcttgct ggggtgaaaa gggggagagc 80761 agaaccaaga agaagagggt gagggctggg gggctccagg gcactggtta ggaattgtgg 80821 ggaatgaagg ctttctttag tctcatcccc ctgtggtacc atcttgtcct cagaggtggt 80881 ttgtgctccc cctactgcct atatcgactt cgcccggcag aagctagatc ccaagattgc 80941 tgtggctgcg cagaactgct acaaagtgac taatggggct tttactgggg agatcaggtg 81001 agatcgaggt ggagaggggt gtgtgggacc cttccctcac tttcctcgtt gaggggaaag 81061 ccacagggtg ggctccctgc tgaaccttgg cttcatctct tcctttagcc ctggcatgat 81121 caaagactgc ggagccacgt gggtggtcct ggggcactca gagagaaggc atgtctttgg 81181 ggagtcagat gaggttagta gccaagagag aagataaggg atgtcttttt ccaagaagga 81241 tgtctcacca agtctgtttc tcaacagctg attgggcaga aagtggccca tgctctggca 81301 gagggactcg gagtaatcgc ctgcattggg gagaagctag atgaaaggga agctggcatc 81361 actgagaagg ttgttttcga gcagacaaag gtcatcgcag gtatctctgg agaaagggac 81421 ctttgagcct atccagggcc acagagactc agagggtagg gtcaggccct ggagcctgtc 81481 ttggtcccca tgctgatcca gaaaaggaaa aaggggaggg ggagtgacaa tctttgcttg 81541 gggcctatga cttctccagc cccaaggtag atgccacctg gaaatccccc aatgtccact 81601 agggggcagt aggccaccgt tcttcgtact ccggagaacc tggctggaga gctctttctt 81661 gttcaccctt ccctccatct gtatctctgc cctgcagata acgtgaagga ctggagcaag 81721 gtcgtcctgg cctatgagcc tgtgtgggcc attggtactg gcaagactgc aacaccccaa 81781 caggtaaccg ggcccaggag ccctgccctc atcccagcct gcctcaatag gtttggacag 81841 acacagccca catggggcaa ccccttattt caaagacaca gagaccttga acccagagac 81901 agtgacttgt ccaagggcat ccagtccagg gcctggcttg gatcagagcc ctggtactct 81961 gactcagtca gaaaccacac taagtgtcca ctggtgccag tgatttttcc tcttagagag 82021 gcagaaaagg tcttacttag gccagcttct tgttctaggc ccaggaagta cacgagaagc 82081 tccgaggatg gctgaagtcc aacgtctctg atgcggtggc tcagagcacc cgtatcattt 82141 atggaggtga gtggctttgg ttcccggctg aggtggagtg ggctgaggac tagactgagc 82201 cctcggacat ggaggtgggg atggggcaga ctcatcccat tcttgaccaa gcccttgttc 82261 tgctcccttc ccaggctctg tgactggggc aacctgcaag gagctggcca gccagcctga 82321 tgtggatggc ttccttgtgg gtggtgcttc cctcaagccc gaattcgtgg acatcatcaa 82381 tgccaaacaa tgagccccat ccatcttccc tacccttcct gccaagccag ggactaagca 82441 gcccagaagc ccagtaactg ccctttccct gcatatgctt ctgatggtgt catctgctcc 82501 ttcctgtggc ctcatccaaa ctgtatcttc ctttactgtt tatatcttca ccctgtaatg 82561 gttgggacca ggccaatccc ttctccactt actataatgg ttggaactaa acgtcaccaa 82621 ggtggcttct ccttggctga gagatggaag gcgtggtggg atttgctcct gggttcccta 82681 ggccctagtg agggcagaag agaaaccatc ctctcccttc ttacaccgtg aggccaagat 82741 cccctcagaa ggcaggagtg ctgccctctc ccatggtgcc cgtgcctctg tgctgtgtat 82801 gtgaaccacc catgtgaggg aataaacctg gcactaggtc ttgtggtttg tctgccttca 82861 ctggacttgc ccagataatc ttcctttttg aggcagctat ataaatgatc atttgtgcaa 82921 gaaaaaaaaa aaaacaagaa caggtttcta taacaacatc tcttactatt tttacttgaa 82981 aaaatgtttt gcgtagcaga ctgtcatagc cttgaacgcc ggctcccttt cttcctccct 83041 ccaagtggct ctggggctgt tgatttccgc agagcttggg ttggggtagg ggctcagcct 83101 caccagcttt cagcagctgg tctaggccag cagtgcctcc ccacctcccc aaggggaggg 83161 gtggtggcaa gacctcagca cagtctgtgg tatcacaggg ctcactggta gagcaggtag 83221 cgcttcatgg cagggggcaa gggcagggca gacacctggc cgagccgggt atcccccagg 83281 ttgtggcgca cacacaggcg gctcaggtgc agaagggagt gtggctccgc tgggagagag 83341 aaggagggga atgtaagtat gggtgcagcc accagccaga tgtcctcaaa ctacggggtc 83401 ctactcagat gcctttctgc tttcctgctt cgagtgtgcc cacctggctg aaaggggaat 83461 ttgagatacc cggaagttct gcctcccaga taagatttca cacatcccta gtcagagctg 83521 ggggtgaaga gctggctaag gccctctaaa caacaggcca aggtggctct gacagtggtg 83581 gagctggccc aggctttgac tccagaggct tgggagctgg ggctgaggtg aggagggatg 83641 gccctccact ctacagccca cacaactgca gagagcagct ccaagccctg gacccagtca 83701 gttcctgggg aggctcctcc cctgctgccc caccctaagg ccctgcctcc tccactgctc 83761 tcctccctgg tgcccagggc cccagtgtct ccatccagag gtgtggctga ggaaggaagt 83821 aggtatgtgg cacagagaca ggttagagcc cagggaatcc ggtatacagc ctgggtacct 83881 ggctctgccc atccttcttt tggacctgta catcaaaccc agtacctaac gctttgcacc 83941 tcttgctcag gggttgatat ctcctgaatt ctctcatccc tgcagttcct ccctccattc 84001 tgaaggttca ccatttactg gccaagagaa cttgagaaag ttgctttctc aagcctgttt 84061 cacctctatt gctgttgggt taaattaaat aacagctgtg aagaccccta gcctttccca 84121 gcactgaaca gaggttgagg caggctggat gaaggtccat cccctctgct cttgtcagaa 84181 gagtttccat cccaaaccac tgccaccagg gacagaaagt tctccccacg tctgccccag 84241 gcctcacctc tcctttcgcc caggtagcgg atgcggacct ggcactggcc ccagacagcg 84301 cttactgccg gatagagggt cctgcccttc agtccgcgga atgctggccc caggtaggtg 84361 cccccaatag cgtagcccag agttccctcc tccatgtcca gaaccaccag cagtctctct 84421 ggcacctcca gctgctcacc ctgagttccc gctggatact ggggggctcc gggccccttg 84481 ctctgatggt acagcttccc ccgcccgatg tcccagcccc acgactcgct gttgctgccc 84541 agcagcgccg cgtagtggtc agtctgcagc ggggcgaggg ccgtggccac gcccaccacg 84601 gcatgcgtgc ccctctgctc taggggccag ctgatctccc aggcgtgcag gccccttgaa 84661 tagcccctct taccccgggc cccatcagtg ctctgggcca cgggccgccg ctcaaagtac 84721 aaccctcctt ccttgacctc gatgttctct gaacagtctt tggggttcca accgtggcgc 84781 cgctgggccc ccaggtcagg agggggtgca gacagcagct cttccaagcc ctcgggacag 84841 gagaggtcag ggtacagggc ctgtggcgtg ggggtgctgc tgctgccccc tgccagagct 84901 gtctggccca tggaggtgag gagctagagg actccccgaa agaggcttgc tggggcgtct 84961 ttctcttctg aaagttgagc tccagtttgg attgacctgg aagggagctg ggagaaaagg 85021 ggcgtgaaga gcagaatcct ggcgggggct cgtcacccag gtaccccatc ctttcaccca 85081 agtttgccaa cgggcccgct cctccctcct cgctgcccca aagctcctgg gcccttcatc 85141 tgccctcgcc ccatctgatc caggtcgcca gagactctcg cgagttcgcc ccatttgggc 85201 ctcagcatgg accatccccc ttactcgccc ggagccctcg ccgctgcccc cgaacctcgg 85261 ttgacttccc agccctggag gtcgccgggc ctctgaagga gccgggcgga aagaagccga 85321 ggcgcgcagg gcgcagaccg ccggccgctt ccgcccccag ctcctccctt gggccgagcc 85381 cgtcccgcgg ccccgcccct ccagggccct ctggggctta accctctcat cgccgcccac 85441 ggccgtgcag ctccgaggcc acgcccctgt ggggccccgc cccctcctgg gcagttaccg 85501 ctctataccg gcagcagccg cggagagcat gccccgcccc cgtggagccc accccgggcg 85561 gttaacctcg ggtctcagtc ccgggctgtg accctccccg aggccccgcc cccacggcga 85621 aggcccgggg cagttaaccc ttctcttgct gcggcagagt ccgcacccgg gcaggcccat 85681 ctcagaatta acgctttgat ggcatcaccg cgtcgggaat ccctggggat ggtgttctcc 85741 accgtcaaga cctttgagcc gcctgagcga ctaactcctg cgcccctgag ggtaaggagc 85801 atggggtcaa ggaggggctt cattgtgccg aatcccattt ctacttccct gtgattctgg 85861 gtcactgttt ctgggctggt ccctttctgc cccgggttgt gagtcagcct gtgccacctc 85921 tgaggctgtg aaatctgctt caggctctag gctgtacgca tcatagttct ccctgctggt 85981 ccagtcttca tactggatgc caatccatta ctgagccgtt gatctcattt ggtagagtgt 86041 cctgattaag acttcgagtt tgagtgaagc tatgccacta gttaactgcg cgatcttggg 86101 gaagttgccc gatttctctt tttttttttt ttgagacagg gtcttgctgt caccaaggct 86161 agagtgcagt ggcgccatca cagcccactg cagccttgac ctcctgggct caagtgatcc 86221 tcctacctca gcctcccaag tagctgggac tacaggcatg tgctaccaca cccagctaat 86281 ttttaaaaaa atatttttat ttttattttt atttttttcg agacagagtc ttgctcttgt 86341 tgcccaggct ggagtgcagc ggcacgatct cggctctctg taacctccat ctcccaggtt 86401 caggggattc tcctgcctca gcctcctgag tagctgggat tacaggcacc tgccaccacg 86461 cctagctatt ttttgtattt ttagtagaca cagggttttg ccatgttggc caggctggtc 86521 tagaactcct gaccttgtga tctgcctgcc ttagcctccc aaagtgctga gattacaggt 86581 gtgagccacc acgtccggcc aaaaaaaaat tttttttaga gacagggtct cccaaagtgc 86641 tgggatttca ggcatgagcc actacgccca gccccagatt tctgtagttt tctgttatgt 86701 aaaataggag caatgatggt agttacctca cagggttgtt gtgcagttga aataatccag 86761 gtaaagcatg tagcacagag ctgggtacat agtagtcact caccaagcat agctgtcaat 86821 gtgcttaatg agtgccgaca ctataaggag gcaatataat ataatggcaa acatgccagc 86881 tgtgggacca gactccccgg gcatgaatac ctgaaggcca gtgtgtgtat gaccttgggc 86941 aaattactga acctccctgt gcctgtttct tcctctgtaa gacagagatc acattagtat 87001 ctattgcatg gggttggtag tgagaatcca atgaggttat agatgtaaca catttggaag 87061 agtttctggt tcgcatttaa tgatttgtgc taagcactgg agattaagca gtgaatcctt 87121 aaggaggtta caaagacttt tataaaataa tgtatcatgt agtaatagta aataaaatta 87181 tggagatata aagaaagtac tataggagta aaaagaatag gactcctatc cggggttcaa 87241 gcagttagga aaggagcatc tccaaaagag gttcacacct gggctgaatc ttttcatgtc 87301 ctagaagagg ttcacacctt cacactgggc tgaatctttc tctttttttt tttgagacgg 87361 agtctaagct ctgatgccta ggctggagtg cagtggcgtg atctcagctc acctctgcct 87421 cctggggtca agtgattctc ctgcctcagc ctcccaagta gctgggacta caagcgtgca 87481 ccaccatgcc tggctaattt ttgtattttt agtagagaca gggtttcact atgttgccca 87541 ggctggcctc gaactcctga cctcgtgatc tgcccgcctc ggcctcccaa agtgctgggg 87601 ttaccggcat gagccaccgc gcccagcctg ggctgaatct aaaaggatga ccaggaattg 87661 gcctgagacg agccacttct ctgcagtatg gaggttgggt tacggttgct gcatgtagaa 87721 gatattccag acagaagagt ccgcatgaaa aaaagcaaga agcagcaggt gggtttgggg 87781 aagtacgata tcttcaaggt tgctacagcc agaaaggaag catgagaagg gccagataag 87841 tagacagcaa gtcacagagg ccttctaggc tgagttaagg aatgcaaatg ttttcccata 87901 ggacaggcca gtgatgggga gccactgtag gatagaagaa gagaggaaca tggtcagata 87961 ggcatttgat ttatttatat attttgagcc agggtctcac tttgtcaccc aggctggagt 88021 ggagtagcgt gatcacgact catggcagcc acaacctcct gagctccgct gatcctccta 88081 cctcagcctc ctgagtagct gggactacag gtgcataccg ccatgccgag ctaatttatt 88141 tttatttttt taatttacaa attattattt cttaattttt aaaatagaga cagagtctca 88201 gtctgttgcc caggctggtc tcgaactcct gagtcaagcg atcctcctgc ctcagcatcc 88261 caaagtactg ggattacagg cgtgagccac tgtgcctggc cagatctgca ttttagaaaa 88321 accaccctgg cagcaacata gaggagcttc aagggaaata gaactggaag ttggctgggc 88381 gcgatggctc acacctgtaa tcccagcact ttggaaggct gagatgggcg aatcacctga 88441 ggtttggagt tcgagaccag cctggccaac atggagaaac cctgtctcta ctaaaaatac 88501 aaaattagcc gggcatggtg gagcatgcct gtaatcccag ctactcagga ggttgaggca 88561 ggagaattgc ttgaacctgg gaggtggagg ttgcggtgag ccgagatcac gccattgcac 88621 tccagcctgg gcaacaagag tgaaactctg tctcaaaaaa aaaaaaaaaa aaggactgga 88681 agcaaagaca ccatttccag cagcccaggt gacaagtgtc aacagcctga aataggacag 88741 tagaaataag gatgaaaata attcaagaaa cactgatgaa acggcaagat ttggtgactg 88801 attagatatg gagttgaggc tgggcacagt ggctcacgcc tgtaatccca gcactttggg 88861 aggccgaggc aggcggatca cctgaggttg ggagtttgag accagcctgg ccaacatggt 88921 gaaaccctgt ctctactaaa aatacaaaaa ttagctgtgt gtggtggtgg gcacctgtaa 88981 tcccagctac ttcggagggt gaagcaggag aattgcttga accagggagg tggaggctgc 89041 agtgagccga gatggcgcta ccgcactcca gcctgggcga cagagcaaga ctccatctca 89101 aaaaaaaaaa aaaaaaaaaa agagaaaggg aagtttctag cctggaaaaa ttgattggaa 89161 ggtggtgcca ccaatcaaga tggagatgac agaaagagga gtaattgagg gcaggggcag 89221 gagggtggga gagggagatg gtgagttcag tttaggatat gtttttgagg tgtctgagcc 89281 acaagcaaat ggaaccatcc agaacacaca agaaaatggc aggtctggag gcagaatgag 89341 aggcctgggc tggagacagg gatttgggag gtgtgtcagt ctttagtcag gagttgaact 89401 atgagaatga gtgagtgaaa tatgggcgat ggggacaaag agagtgtgga gcaacagaag 89461 ggtcaaggag aatagaacaa ggtagaaaat tcagttttga ttacttacaa gattgttttg 89521 gtcagaggtc actggcaacc ttggtgagag acagcacatg tggggactgg taggtggaac 89581 tccaggctga aggggctgga ctgtcacaag ctttacattt tgaaggaata gaaaatgagg 89641 agaagtaaat tacaattatg tcacatagtt aaagttaccc ttggaaatcc cttagtaggc 89701 tgggtgcagt ggctcattcc tgtaatccca ccactttgag aggctgaagc aggagcttga 89761 gcccaggagt ttgagaccag cctggaagac tagtgagacc ccatctctac caaataataa 89821 taataaaata ggccggacgc ggtggctgac gcctgtaatc ccagcacttt gggaggccga 89881 ggcgtgtgga tcacctgagg tcaggagttc aagaccagcc tggccaacat ggtgaaatcc 89941 cgtctctact aaatatacaa aaaaaaaaaa attagccagg cgtggtggtg tgcacctgta 90001 atccctcagc aggagaattg tttgaaccca ggaggcggag gttgcagtga gtcaggatgg 90061 cgccactgca ctccaacatg ggcgacagag tgatactcca tctcagaaaa caaaacaaaa 90121 aaataaaata ggctgggcat ggtggctcat gcctgtaatc tcagcacttt gggaggccga 90181 ggcaggtgga tcacttgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc 90241 cgtctctact aaaaatacaa aaattagctg ggcgtggtgg catgtgccta taatcccagc 90301 tacgcaggag gctgaggaag gagaatcgtg cctgtaatcc cagctactca ggaggctgag 90361 gcagaagaac ccgcaaggca gaggttgcag tgagccaaga tcctgccact gcactccagc 90421 ctggcgacag agcaagactc cgtctcaaaa aataaagtaa aataaaataa aaataattga 90481 ggaccacaag aactaggaaa aaaagaaaaa agaaaaaaga aaggccgagg caggcagatc 90541 acctgaggtc gggagtttga ggccagcctg accaacatgg agaaacctcg tctctactaa 90601 aaatacaaaa atcagctagg catggtggca catgcctgta atcccagcta ctcgggaggc 90661 tgaggcagga gaatcgcttg aacctgggag gcagaggttg tggtgagctg agattgagcc 90721 attgcactcc agcctggaca acaagagcga aactccatct caaaaaaaaa aaaaaaaaga 90781 aaaagaaaat ccctcagtta ctgcatctgt taggtgagca tatattcgtt tgtgtaaaaa 90841 tcactatgtg gctgtatata tatatatata tatatatata tatatatata tatatatata 90901 tatatatata aaatgatgta caaaatattt tataatgttt taaagaggaa gaagaaagca 90961 gcacactaag aagtaactca ggaatggaaa accaaacatc atatgttctc ataagtggga 91021 gctaagccat gagaatgcag aggcataaga atgacacaat ggaccttggg tactcagggg 91081 gaaagggtgg gaagaggtgg agggataaaa gactataaat agcgttcagt gtatactgct 91141 tgggtgatgg gtgcaccaaa atctcacaaa tcaccacgaa agaacttact catataacca 91201 aacaccacct gttccccaat aacctatgaa aataaaataa taataaaaag aagaagaaag 91261 catactaaaa taaagatatg aacctgaaac ttgtcatcca cccaacctgg atatgcttgc 91321 cagtatgtga catcaataat aacatcatga ctgtaagacc aggcacagtg gctcacacct 91381 gtaatcccag cactttggga ggccaaggtg ggccgatttg agatcagaag ttgaagacca 91441 gcctgggcag catgatgaaa ccccgcctct acaaaaaata caaaaaatta gccaggcttt 91501 ggtggcttgc gcctgccatc ccagctactc aagaggctga ggcaagagaa tcccttgagt 91561 ctagaattgg aggttgcggt gagccaagat caggccactg cactccagcc tgggtgacag 91621 agcgagaccc cgtctcaaaa aataaataac atcatgactg tgggtggcag cctatgatga 91681 atagtagtga aggacccatg tgataaccaa ggcctgggga cagagaacag tgcctgcccc 91741 atggctagct ctagctcagc ttgagaattt aatcacgcct gtgctgctga tgctgctgct 91801 gctgctgcta ctggttgttt aaaacatggt tgttggccgg gcacagtggc tcacgcctgt 91861 aatcccacca ctttgggagg ctgaggcagg cagatcatct gagataagga gttcgagacc 91921 agcctggcct acatggtgaa accccgtctc tactaaaaaa tacataaaaa ttagccaggc 91981 ctactggcac gtgcctgtag tcccagctac tcaggaagcc gaggcagaag aaccacttga 92041 acccaggagg cggaggttgc agtgagccga gattgcgcca ctgcactcca gtctgggcga 92101 cagagggaga ctttgtctca aaaaaaaaaa aaaaaggttg ttttaaacaa ttcatcagtt 92161 gatgccactg aatctggagc tggtcaaaca tttgttgcta aaaggtaaaa aatttcgtaa 92221 tgctacaagt aatttaaatt ttttacttaa aaattaaaaa aaaaaaaaaa ttggccgggc 92281 atggtggctc acacctgtaa tcccagcact ttgggaggcc gaggcgggtg gatcacaagg 92341 tcaagagatc gagaccatcc tggctttagt agagacagtg aaaccccgtc tctactaaaa 92401 atacaaaaaa aattagccag gcgtggtggt gggcgcctgt agtcccagct actcgggagg 92461 ctgaggcaag agaatggcgt gaaccctgga ggcagagctt gcagtgagcc aagatcacgc 92521 cactgcactc cagcctgggt gacagagcga cattccatct caaaaaaaaa aaaaaaatta 92581 tagatgaggt cttgctattt tgcccaggct ggactcaaat tcctgggttc aagtaatccc 92641 cccaacctca gcctcccaag tagctgagac tacaggcatg tgccacaatg cctttttact 92701 tattttatta ttttgttgag acagggtctc agtgtactcc aggctggagt gcagtggcac 92761 gaccatgact cactgcagcc tcaccccagc ctcctgagta gctaggacta caggcacatg 92821 ccaccaggcc tggctaatat ttaacttttc gtagagacag ggactctctc tatgttgccc 92881 aggttgatct caaactcctg tactcaagca attctcctgc ctcgtcctcc caaagtgctg 92941 ggattacagg catgagccac catgcctggc tgtttttaaa tttttatata tatatatttt 93001 gagacaaggt ctggctctgt tgcccaggct ggagtgcagt gcacgatctc ggctcactgc 93061 aacctctgcc ttctgggttc aagcaacttt cccgcctcag cctctggagt agctgggaca 93121 acaggcgccc cgccagcatg cctggataat ttttttgtat gcttttataa agacagggtt 93181 tcaccatgtt gcccaggttg gtctcgaact cctgggctca agcaatccac ccatcttggt 93241 ctctcaaaat attgtgatta caggggtgag ccactgcgat cggtcctttt aaaaaaaaag 93301 tcccctccca ccactgccgt ttttactttt tgattgagat gtagcatata taaagcacac 93361 taatcttaaa ttgcccagtg aacttttaca tacatataca cttgggtgac aaccaccaaa 93421 aagatcaaga taaagaatcc agctggctgg gcacggtggc tcacacctat aatcccagta 93481 ctttgggagg ctaaggcagg ggatcacttg agctcaagag ttctgagatg agcctgggca 93541 acatagggag acctcgtctc tactaaaaat tcaaaaaagt tagctgggtg tggtggtgca 93601 cgcctctagt cccagttact ctggtgggct gaggtcggag gactgcttga ggctgcagtg 93661 agccctgacc atgcccactg cactccagcc tgagtgacag agctggatgc tgtcttaaaa 93721 aaacaaaaac aaggccgggc gcggtggctc acgcctgtaa tcccagcact ttgggaggcc 93781 gaggcgggtg gatcacgagg tcaggagatc gagaccatcc tggctaacac ggtgaaaccc 93841 cgtctctact aaaaatacaa aaaattagcc gggcgtggtg gcgggcgcct gtagtcccag 93901 ctactctgga ggctgaggca ggagaatggc gtgaacccgg gaggcggagc ttgcagtgag 93961 ccgagatcgc gccattgcac tccagcctgg gtgacagagc aagactccat ctcaaaaaac 94021 aaacaaacaa acaaacaaac aaaaacacaa aaaaagaatt cagcatctgt cttgctcttt 94081 ccctgatgat acattcccaa gggtagctcc tattagaccc cctatcagta tagattagtg 94141 ttgtctgttc ttcaatttca tataaatgga atcataatgt caaatgcagt gtcacttgcc 94201 tctagtcaca gctacttgga aggctgagat gtgaagacta cttgagctca gaagttccac 94261 tgcactccag cctgggttac agagggaaac cttgtctcta aaaaaaagaa aaaagaaaga 94321 aagaaaaaga aaacatagta tgtattcttt gtgtctttcc agcattgtat ctgtaagtca 94381 tccatgttct tgcatgaaat agctcataat tttcattgtc atatagtatt ccatcatcta 94441 aatataccac aatatatcta ttgtactgtt gatggatttt ttttttaaga cagtcttgct 94501 ctctcgccca gacaggagtg cagtcgtgcg atctcgactc actgcaacct ccacctccca 94561 ggttcaagca attctcgtgc ttcaacctcc taagtagctg ggacttcagg catgtactac 94621 aacgcctgga taatttttgt atttttagta gagatggggt ttcaccatgt tggccaggct 94681 ggtctcaaac tcctgacctc aggtgatcca cctgccccag cctcccaaag tgctgggatt 94741 acagatgtga gccactgtac tgggcctgtt aatggatatt taaattgttt ccagttttcc 94801 accattataa agttgccatg tacattactg tacaagtctg ttaacaaaag accatgaggc 94861 ctgcagaaga aaaacaaagg agagctttat tttttggaaa aaaaaaaaaa attccaaatt 94921 ggggaatgct gctggtttac acactgttca ctggaataaa gtctcttccc tctgaattcc 94981 tttacagcct gtgatcccag tattttggga agctgagatg ggaagatcag ttgaggccag 95041 gagttgggga ctagcctagg tagtatatcg agacctcctt agccaataat tttttttttc 95101 tgttgcccag gctggagtgc aatggcacta tctcggctca ctgcaacctc tgcctcccgg 95161 gttcaagtga ttctcctgcc tcagcctccc gagtagctgg gattacaggc atgcgccacc 95221 atgcccagct aattttgtat ttttagtaca gacagggttt ctccatgttg gtcagactgg 95281 tctcgaactc ccgatctcag gtgattcacc cgccttggcc tcccaaagtg ctgggattat 95341 aggcatgagc cactgggccc ggccaataaa aaaatttttt aattagttgg gtgatggtgc 95401 acacctgtag tcctagtact caggatgcca aagtgggagg attgcttgag tccaggagtt 95461 caaggctgca ggggctgcga atgtgccact gcactacaaa ctgggcaaca gagtaagacc 95521 ctgtctttta aaacaaacaa actaataaac aaaacaaaac caaattcctt ttcagagaac 95581 ttttttcaca agtctttggt ggagctaggc atgcattttt cttgggtgca aaactatgtg 95641 cagaattgct gggtaataag ggattacaac atatagattt aaccacatgc aattttctga 95701 tttctttcca tagggacatt ttattcagaa attaaatcat tcagagttcc agcactgtcg 95761 ggggtcatcg ggctctgtcc accgccatag gcctagcatt gtcaccaacg gagccataga 95821 gggacctcgg cccaagctgg ggcctcttca ttgtcgaaga aggcctcttg ccagaccagg 95881 atctgtgggc gcggccaggc tggaggacta gggcggtggc agtggccagg tgagtgcaca 95941 tggctgtggg tggtccctca attgctccct cctctctttc gctaccccct gggctggagg 96001 gaggcagggc acccagccgg aatggcatga tcctgaagcc caacttccac aaggactggc 96061 agtggtgtgg ccatgtggct cagccagccc atggggcaga cctgcagaag caaggtggcc 96121 agcaaaagca ggctgcatgg cccactgata tgggcccatc cacccatagt aaggtgcccc 96181 atgctgaggc atcactacaa agcacaagct ggcaggggcc tcagcttgga ggagttaagg 96241 gtggctggca tttacaagaa ggtggcccag accattggca tctctgagga tgcaaggagg 96301 aggaaccagt ccacccaggc cctgcaggcc aaggtgcaga ggctgaagga ggaccgctcc 96361 tcactcatcc tcttccccag gaagcccttg gcccccaaga agggagacag ttctgctgaa 96421 gaactcgaac tggatactca gctgacagga ccagaaatgc ccatcggcaa tgtctacaag 96481 gagaaagcca gagtcatcgc tgactaggag gaaaacttcc aggccttgtt agtctctgta 96541 tggcccatgc aaatgcccag ctcttcggca tataggcaca aagagccaag gaagctgtag 96601 aagaggacgc ttaaaaaaaa aaaaaacggg taggctcgag ccgctgcatc agcttcgcgc 96661 tgggtctctc ggtcccgcag ccatgaggag gacgctgccc agactcacta cccgccctct 96721 cagtcccgca gccacgagga ggacgctgcc cagactcact acccgccggc tccctccccc 96781 gcgtccctgt ggtggtgggc gaagatgacc tgttgtgcat cagatgcctt tgagacatac 96841 aactggagat gtgagatagg aaattgaatg taaagactgg aagctcggca gagcagaact 96901 ctgaacaacg tgcctcagga gtgcaagttg ctgatgaagt gtgtcgcatt ttttatgaca 96961 tgagagttcg taaatgctcc acaccagaag aaatcaagaa aagaacaaag gctgtcattt 97021 ttttgtctca gtgcagacaa aaagtgcatc atgtagaagg caaagagatc ttggttggag 97081 atgttggtgt aactataagt gagcctttca agcattttgt gggaatgctt cctgaaaaag 97141 attgttgcta tgctttgtat gatgcaagct ttgaaacaaa agaatccaga agaattgatg 97201 ttttcttgtg ggcatcagaa ctagcacctt tgaaaagtaa aatgatctat acaagctcca 97261 aggatgcaat caaaaagaaa tttcaaggca taaaacatga atggcaaaca aacggaccag 97321 aagatctcaa tcgggcttgt actgctgaaa agttaggtgg atcctttttt tttttttttg 97381 agatggagtc ttgatctgtc gcccaggctg gagtgcagtg acaggatctc ggctcactgc 97441 aacctctgcc tcccgggttc aagtgatttt cctgcccagc ctcctgagta gctgggatta 97501 tagcgtgcgc caccacacct ggctaatttt tgtattctta gtagagtcag ggtttcacca 97561 tattggtcag gctggtctcg aactcctgac ctcgtgattt gccctccttg gcctcccaaa 97621 gtgctgggat tacaggcgtg agccagcatg cctggccagt ggatccttaa ttgtagcctt 97681 tgaaggatgc cctgtgtaga tcatcattca gtgccacaaa ttgaaagctt ccacgtttaa 97741 tgttatcctc ttgctatata aataaagcaa atatatttag gccagggtct cactgagggg 97801 agctgtcttg tcatctttta gagtaaacta aatactctat aaacatatgc aaacagccct 97861 aaatatatct gtaggggtgg gttgcccctc cacacctgtg ggtgtttctc gtaaggtggg 97921 acgagagact tgggaaagaa aaagacacaa agtatagaga aagaaataag gggacccggg 97981 gaagcagcgt tcagcatatg gaggatcctg ccagcctctg agttccctta gtatttatta 98041 atcatttgtg ggtgtttctc gaagaggggg atgtgtcagg gtcacaagac aattgtgggg 98101 agagggtcag cagacaaaca cgtgaacaaa ggtctttgca tcatagacaa tgtaaaggat 98161 taagtgctgt gcttttagat atgcatacac ataaacatct caatgcttta caaagcagta 98221 ttgctgcccg caggtcccac ctccagccct aaggcggttt ttccctatct cagtagatgg 98281 agcatacaat cgggttttat accaagacat tccattgccc agggacaggc aggtgacaga 98341 tgccttcctc ttgtctcaac tgcaagaggc attccttcct cttttactaa tcctcctcag 98401 cacagaacct ttacgggtgt cgggatgggg gatggtcagg tctttccctt cccacgaggc 98461 catatttcag actatcatat ggggagaaac cttggacaat acctggcttt cctaggcaga 98521 ggtccctgcg gacttccgca gtgtttgtgt ccctgggtac ttgagattag ggagtggtga 98581 tgactcttaa ggagcatgct gccttcaagc atctgtttaa caaagcacat cttgcaccgc 98641 ccttaatcca tttaactctg agttgacaca gcacatgttt cagagagcac ggggttgggg 98701 gtaaggttat agattaacag aatctcaagg cagaagaatt tttcttagta cagaacaaaa 98761 tggagtctcc tatgtctact tctttctaca cagacacagt aacaatctga tctcttttgc 98821 ttttccccac atatatctaa agtctaaagt tgttttttta aaaaaaaaaa agaaaagaaa 98881 agaaaagaaa aaagaaaata gccaggcgcg gtgctcacgc ctgtagttct agcactttgg 98941 gaggcgaaag tgggtggatt gtttgagctc aggagtccaa gaccagcctg ggcatcacgg 99001 tgaaacccca tctctacttt aaatatacaa aaaattagcc aggcatggcg gcgtgtgcct 99061 gtagtcctgg ctacgaggga ggctgaggca ggagaattgc ttgaccctgg gaggcagagg 99121 ttgcagtgag ccgagattgt gccactgaac tccagcctgg gccacagagc aagactctgt 99181 ctcaaaaaaa aaaaagaaaa gaaaagaaaa gaaaaaaaat aaagccctgt tgaggacttg 99241 taatatatca ggaataatgc aaaaaaaaaa ttattcagac attttgcagc atgaaacact 99301 tcagtaaatt tatgaagctt tagtttggtc ttttcctttt tctcctgagg gtgattttta 99361 tcagttcctc aagatttttt attagtgttt ctctgtggct ctcaaaaact tctttatttt 99421 tatgattatt tccttatatt ttagagactg ggtcttactc tgtcacacag gctggagtgc 99481 agtggcgaag tagctcactg tcaccttgag ccatgatcaa gggatcctcg gcttcagtcc 99541 ctcaaagtgc tgggattact ggcatgagcc accatacctg gccaggccaa gaagttcttt 99601 actttcttcc tcaatgatgt ataaacaaag ccatcacgat ccacttgcta ggcgacttgc 99661 acaatgagtt tcacttcttt gtcaatagaa acccttggcc aggcgcggtg ctcacgcctg 99721 taatcccagc actttaggag gccgaggtgg gtgtatcagt tgaggtcagg agttcaagac 99781 cagccaggcc aacatggtga aataatacta ataatgtaaa ataatatctc tactaataat 99841 acaaaaatta gccaggcatg gtggcacatg cctgtaatcc cagctacttg ggaaggtgag 99901 gcaggagaat tgcttgaaca tgggaggtgg aggttgcagt gagctgagat cacaccactg 99961 cattcccgtc tgggcaacag agcaagactc tgtctcaaaa cacaaaaaac aaaaaaaaaa 100021 aagagaaaga aacccttaaa attctgaaca cattccacat gcattgactg tcaggcttcg 100081 ttgcttccat tacctgtttg tttgtttgtt tgtttattta tttatttatt gagacagggt 100141 cttgctgtat cacccaggct ggagtgcagt agcatgagca tggctcactg cagcctcaac 100201 tttcccaggc acaggtgatc ctcctgcctg cacctcctga gtagctggga ctacaggtgc 100261 acaccaccac gttcagctaa ttttctgcag agatgaggtc ttgccatgtt gcccaggctg 100321 gtcttgaact cttggactca agagatctgc agcctcccaa agtgctggga ttacaggcgt 100381 gagacaccac acctggcctc ccattgcctg tttaaataga aatgatgcaa ctagcaatat 100441 ggaaggactt ccagatcttt accacattaa gattagggtc agcatccatg gctctgggag 100501 tcaagggaga atctcagcct aatgtatgtg gctttgaaac agcaaataat gcttagatca 100561 gcagagtgaa ggatggagtc atgttgggac aggaaacctt cattcagcgc ctttatgagc 100621 acattcataa agtgtgtggt acatcagtgt ggtattgttg tgtcctgcag cacaaagcca 100681 gattttttct ctttgaggtc ctgcttgact tctagaatga agatatttga aactgaatca 100741 gggaaagtac tacagtcacc caggactttg catttgattg ctagagcaca ggcaggtttt 100801 tgtttgtttg ttttttttga cagtctcgct ctgcagccgc ggctggagta aagtggctca 100861 atctctgctc actgcaaact ctccctccca ggttcaagca attctgtctc agcctcccga 100921 gtagctggga ttacaggggt gtgccaccat gcctggctaa tttttgtatt tttaatagag 100981 atggggtttc actgtgttgg ccaggctgtc ttgaactcct gacctcaagt gctctgccca 101041 tctcagcctc ccaaaggctg ggattacagg tgtgagccac cgtaccgccg ttttttgttt 101101 gtttgttttg tttttttttt taaagggcac caggatttgt agaatgacag ccctgtagaa 101161 ctacactgcc cctgcacttg ggggcaaggg caaagggtgg aggggaatag cattcagaag 101221 atcatgagtt gctttcaaac cagggccttg tgtttctagt ttggaaatat taaatgcagt 101281 tgggcatctt tccccagaag atagcaacca ctaacctcat ggtattctgg aagatggcct 101341 tttcaccagt aatttttctt ttttttgaga tggagtcttg cgctgttgtc caggctgcag 101401 tgcagtggca cgatcttgac tcactgcaac ctttgcctcc tgggttcaag caattcttct 101461 gcctcagcct ccgagtagcc gggattatag tgcccatcac catgcccggc taatttttgt 101521 attattagta gagacggcat ttcaccatgt tggccaggct ggtctcgaac tcctgacctc 101581 atgatccacc tgccttggcc tcccaaagtg ctgggattac aggcgtgagc caccgtgcct 101641 ggcctatgcc aataaacctt tctaatagct ctgagtgggt tgtgctgtct gttcctctac 101701 agacacatct caccaatagt ttgtgtgttt ttgaggttta aatgggcttc taaaatcatt 101761 atgccatcct tggctggttt tgaaatcttt ccatccaaag gctgacttct ctagtaggag 101821 gtttgaacta tgacttgaga tgaagagttt ttttcttctc tttttttgcc acgtcttgcc 101881 attgtaaggc acatatttct ttttggtttt ctttcttttt aatatataga ggcagggtct 101941 tgctctgttg tccaggctgg agtgcagtgg tgcaatcata tctcactgca gcctacaact 102001 cctggcctca agaggattag cttctggtct tgccctccca aagggctggg attataggcc 102061 tgagccactt gcacctggcc cagacacata tttctggttt ttgtctttcg gtaatgttta 102121 gtggcctttc tattttcttt tttctttttt tttgagacgg agtttcgctc ttgttgccca 102181 agctggagtg caatggtgca atctcggctc actgcaacct ccgcctcctg ggttcaagtg 102241 atactcttgc ctcagcctcc tgagtagctg ggattacagg cgtccgtcac cacgcccggc 102301 taatttttgc atttttagta gagacgggtt ttcgctgttt tagccaggat ggtctcgaat 102361 tcctgacctc aggtggacca cccacctcgg cctcccaaag tgctgggatt ccaggcgtga 102421 gccaccgcgc ccggcctgcc attttcatta accacatatc ttcgttgtca ctttagcagt 102481 tgtaggagaa tctgctgacc tcgcttcaca aattttttga atcttgattg aatgtacaga 102541 ttttttgagg ccgtattttt tttttttcca ctttggggat agacttgtat gctagcatgg 102601 aagggaattg tttcttcggt tgagtgccta aagtgcttga tggcctttca gatggatcat 102661 gtcatacaga aaggaagtat cttaaaacac tacaatctag tttaagacat aaaacagtag 102721 aatcgctgtc agcgtcacaa gatacaaacg aagggctcag aactgggacc aagagaacat 102781 agtctcccca tcttacactc gctcggaatg ggggctggcc ccactctgct ctatttaatc 102841 tgagcactgt tgaatgaatc tgcgatgaaa ctccacattt cctttaatcc tcacaacaaa 102901 gataaaccaa tttttagatt cgtaaactga ggcttagaga agttaaacgt attgtccgag 102961 actacagaac tggaaaatgg caaagtcaag atccaactcc gaagagcctc cccctgccca 103021 ctagcgggga ggaaatggca aatgacacag caaaaaatgt ttaaaaccag gattctcggt 103081 cgccgtcccg cccagcgcca acgctccgtc ccgcctcgca tcaacgcctc gctcctcccc 103141 gcccagcgcc aatgccccgc cccgcgcagc cgcccaggtg ccagttgcgt aaccacggca 103201 aagctcgcgg cagggggcgg ggcagacgct gactgtcgcc gggcggcggt cacgtgagcg 103261 ggagcgcagc aggccggcgg tccgagtgcc tcgcgcgcgc cccacagcgg ccgcacgcgt 103321 ctcgggctct cctcccagta gttcactcct tagtggctac agaaactgga gcccgccatc 103381 gggctttccc ctgaactgca gcttccgtcc ccgccaatct cttctccggc cccagcgctg 103441 actttcatgc gtggagcatt ctggcccgga acagagatcc cagggcggaa actagggccc 103501 gaggctgaaa gtccgttctt ggcggcgagg gccgccgcta aactggctgc tgagggcgtc 103561 tctagctcgg gttccagaag agagtggaat ggactgagtg cttgtgagaa acttcctaat 103621 tgaggcttac ctactaaagt tctttacgtt ttaaagggtt ccattgcatc agccctcaaa 103681 atgtttcctt ccccagccct ctttcttcta tcatggtggt tatgtctagg gactctccac 103741 tgggagacaa tcattttgtt ctccccgagg agggaggtca gaagtgggag acacccaggt 103801 tgtgttagaa actatggtag ctggggcgat ggctcatgcc tgtggtccca gctactcagg 103861 aggctagggc aggaggattg cttgagccct ggaggtcgag gctgcagtga gctatgatca 103921 cgcctgtgaa tactctggga ggatgggtga catagcaaga ccctttcaaa aagaggagaa 103981 agaaactgca gcgtaaagta gcacacctgt tacaaaggta aatgctcttt tgtctatcct 104041 tgctattttt aaaaatactc aatttttcct agttttcctt ttttgttttt tgagacagtc 104101 tcattcggac gcccaggctg gagtgcagtg gcttgaactt ggttcactgc aacttccact 104161 tcccaggttc aagcgattct gttgcctcag cctcccgagt agctgggctt acaggcatgc 104221 gccaccacgc ccagctcatt ttgtattttt agtagagacg ggatttcacc atgttggtca 104281 ggttggtctc gaactcctga cctcaggtga tccgcccgcc tccgcctccc aaagcgctgg 104341 gattaaatgc gtgagccacc gcgcctggcc ctagttttcc ttttgacatc cttccgagcc 104401 tctgtttctt gctagttaat cctaacaggg tatcatggag gcttgtcact tgttcacgac 104461 ttagcagtga ttctcaacgt ggagggaggt aaggggagtt tatctgtaag ggcaaatcca 104521 aattgaggga gggtgctctt gattactttt tttaaaaaag cacatttagg ccaggcatag 104581 tggctcactc ctgtaatacc agctaccctg gaggttgagg tgggaggatt gcttgagcct 104641 gagaggtcca ggctgcagtg agctgtgatc tcgctattac agttgagcct gggtgacaga 104701 gtgagaccct gtctcaaaaa gaaaaaaagt atctattgaa catttactgt gtgctagaca 104761 ctgtgccagg cagtagggtg cagagtgaaa ctgaagaact caagatctat tggaggccaa 104821 gtgcaagttg ctcatgcctg taatcctagc actttgggag gccaaggcag tcggatccct 104881 tgaggacagg agattgagac cagcctgggc aacataggga gacccccacc gctacaaaaa 104941 aactagccag gcgtgttggt gcatgcctgt agtcctagct acttgggagg ctcaggcagg 105001 aggatcactt gagtctagga agtcaaggct gcagtgagcc atgattgtgc cactgcactc 105061 cagcctgagt aacagagtga gaccctgttg caaaaaaaaa aaaaaaaaga aaaagaaaac 105121 aagaaaaagt tctgtttggc agttctgcaa ggatgaccca aagattttga tttgagcatc 105181 cgaaagtgat ataggtaaca ctaaagaaaa gcaggctggg cgcagtggct catgcctgta 105241 atcccagcac tttgggaggc tgaggtgggc agatcacctg aggtcaggag ttcaagacca 105301 gcctggccaa catagtgaaa cccggtctct actaaaaata caaaaattag ctgggtatgg 105361 tggcggtcac ctgtaatctc agctacttgg gaggctgagg cacgggaatc acttgaactg 105421 ggaggcagag gttgcagtga gccaagatca tgccactgca ctctagcctg ggcatcaaag 105481 tgagactaca tctcaataat aataatttga atgtgaaaaa tgggacacac acacatgcac 105541 acacaaacaa acatacatct ggttttgcaa actctatggc tagtagatat acaaagaaag 105601 ccttttgagt atcagtgatt tgcaaggagc agtgtatcaa gatactttgc attggccagg 105661 catggtggct cacgcctgta atcccagcac tttgggaggc caaagcaggc agaccacctg 105721 aggccagggg ttcaagacca gcctgaccaa catggtgaaa ccctgtctct acaaaaaata 105781 caaaaaatta gctgggtgtg gtggtgaacg cctgtaatcc cagctacttg gaaggttgag 105841 gcagaaaaat cgcttgaacc caggaggcag aggtttgcag taagccgaga ttgctccact 105901 gtactctagc ctaggtgaca gagcaagact ccctctcaaa aaaaaaaaaa aacaaaaaac 105961 acaaaaaaca aaaaacaaaa aaactttcta ctgagtcata aaaattctga ggaaatggta 106021 ttaaagggca tttcatatgg acatttatta tctcacataa aaataaatca ggaggcttct 106081 ggaaagtgga tggttccttg ctcttatgag acagcagtaa gaaaggactg tcagtggtcc 106141 cacttgagtc tctgatagct atgtgcagcc atcacatgac actgggaaga ctcagttaca 106201 ggagaagtca tcactgtgga ttgcagtcct ggaagacatg gtagagctga aggatcagcc 106261 aactttgaga ctcagctcta gctgcctgtc tccagtggcc taatggacat ctctatgagg 106321 atgtttgata gggtttggct ctgtgtccct acccaaattt gatctcaaat tgtagtccct 106381 gtaatcctca catccaggga gggacctggt gggaggtgac tggatcttgg gggcggtttc 106441 ccccatgctg ttgttgtgat agtgagtgag ttctcatgag agctgatggt tttctacgtg 106501 tttgacagtt cctccttcac actctctcgc ctgctaccat gtaagacatg gctgcttctc 106561 cttttgccat gattgcaaag ttcctgaggc ctccccagcc atgcagaact gtgactcaat 106621 taaacctctt ttctttataa attacccagt cttgggcagt tctttatagg gtatgaaaac 106681 agagtaatac agtgttccat taccactaac tagttggata tattcatctt ctccatgtcc 106741 tatgtccatt cagttgctga gtcttgttga gtcttcctct ccatccctac ctccttactc 106801 catacactta tcatcttcag cttcctaaac tttcaagatt cttctacctt cactgaatca 106861 tatgctagat atataccata tatatgcatt tgctatctat tttaagaatt taaagtaaca 106921 ttttagaatt tattgagttt ttctacttct ttctatccaa tagtctattc tattattaga 106981 aacaataagg tgggctgggt gcggtggctc atgcctgtag tcccagcact tcaggaggct 107041 gaagcagaca gatatcttga gcccaggaac tcaagaccag cctgggcaac atggcaaaac 107101 ctcgtctcta caaaacacac acagacacac acacacacaa attacccagg catggtggtg 107161 tgtgcttgta gtcccagctg cttgggaggc tgaggtagga ggattggttg agcttgggtt 107221 gttgaggctg cagtgagctg tgatcacacc actgcactcc agcttgagtg acagaccaag 107281 accctgtttc taaaggaaaa gaaggcctgg gtcgggcgca gtggctcacg gctataatcc 107341 cagcactttg gaaggctgag gcgggcgaat cacctgaggt caggagttcg agaccagcct 107401 ggccaacatg gtgaaacccc gtttctacta aaaatacaaa aattagctgg gtatggtggg 107461 agcacctgta atcccagcta ctcgggaggc tgaggcagga gaatcgcttg attctccgct 107521 gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct gggggacaaa 107581 agcgagactt cgtctcaaaa aaaaaaaaaa aaaaaaaaga aggccgggca tggtggttca 107641 tgcctgtaat cccagcattt tggaaggctg aggcaggagg atctcttgag cccaggaatt 107701 ggggagtagc ccaaggaaca tagtgagacc ctatctctac aaaacaattt taaaaattaa 107761 ttatccaggc atggtggcat gcccggtaac taggtgattc tagttacttg ggaggccaag 107821 gcattacttg agcccaggag tttgaggctg caatgagcca tgttcccaaa ctgcactcca 107881 gtttgagtga caaagcaaga ccctgtctca ggccaggggt ggtggctaac gcctgtaatc 107941 ccaacatttt aagaggctga ggcaggtgga ttgcttgagc ccaggagttt gagattagcc 108001 tggccaacat ggcaaaaccc caaatctact actactacta ctactactac tactactact 108061 actactacta ctaaaaaact ggccaaggtg gtgtgcgcct gtaatcccaa ctactcagga 108121 ggctgaggca cgagaatcgc ttgacgggag cagaggttgc agtgagccga gactgcacca 108181 ctgcactcca gcctgggtga cacagtgaga ctctgtctta aaaaacaaaa aagaccctgt 108241 ctcaaaaaaa aaaagaaata ataaggtggc aggatatatg atatatagag aaaaatcaat 108301 actaaaatca atagttatga cagtatgaca ttggagcagc agtagacata gagatcaatg 108361 gaagataata gggtccagga ataaaaccca tatatatgat aaagatggca tttcaagtca 108421 gtagagaaga atgggattaa tcaataaatg atgtagaaaa aaattggctc cccttttggc 108481 tcccctacct cacactatat accaacctat tctagatgta ataaagaccg aaaaatattt 108541 aaataccatc aaaatagtca agagaatatt ttcataacct taggggtgaa aaaggctttt 108601 ctgaaccaga cccacagttc agaagcctta tgcaagagac tggcagttgg gactacacac 108661 aaatgtcaaa tttctgtaag gtgacactct ataagcaaag tgaaatgaca aagtaggaaa 108721 atattttcaa aatgtataag agacaagaag ttaatgtgtc tgtgtggtgg tttaaaaata 108781 tgtccatgaa ttctgtgata ctcttccttt taagaggtga gccttattcc cctccacttg 108841 agtgtgggct ggacttagtg actggcgtct aaagaatagc ataaagtgga agtgatggtg 108901 tgtgactttg gagactcagt taaaggcaga gtggctttca tcttggtctc tttctctctt 108961 ggatcacttg ctctggggga agccagctgc catgttatga gggcactcag gccctgtgga 109021 gaagtccaga tggtaaggaa ctgtcttgcc aacaaccatg tgagtaagcc atcttggaag 109081 caaatccttc agccccagcc aagccttcag ataacggcag ctccagccag ccgtttgact 109141 ggaacccacc tgggccactc ttggattcct gagaaactga gataatacat tttgctgttt 109201 taagctgcta cgttttgaga tgatttgtta tacagctata atcagttaac agttgcactt 109261 ggctggctgg gcatggtggc tcatgcctgt aaccccagca ctttgggagg ccaaggcagg 109321 agaatcgctt gagcccagga gttccagacc agtctctgca acatagtgag actccatctt 109381 tacaaaaaaa aagttaaaaa ttagctgggc atggtggggc gcacctgcag tcccagctac 109441 ttaggaggct gaagtgggag ggtggcacgt gcttgtaatt ccagctactc gggaggctaa 109501 ggcactagaa tcgcttgaac ccaggaagca gaggttgcag tgagccgaga tcttgccatt 109561 tcactccagc ctgggtgaca gagcaagact ccatctcaaa aaaaaaaaaa aaaatcaccc 109621 tgttaagtat aagctctgac aaatgcatag tcctatcata atcaagacat agaacaatcg 109681 tagaatcact ctaacagttc cctcaagatc aagatcaaga agtcaacccc ttcttgatcc 109741 caaggaactc attgacctgt ttttctgtca aaagggtcaa atttaatcac taaaatgatc 109801 tacagtacta tggaaaaatt tgtaagcata gaaaacacac aaaccaactc tttttaaaaa 109861 gtaagatatt ttaaaagttt accataccaa cagaatgaag atagcttcac attttactac 109921 agatgtgcag atattcatca aagtccacca ttgtcatata tatggctcaa cagtaatatt 109981 taagaatata ctggccaggg ccgggagtgg tggctcacgc ctgtaatccc agcactttgg 110041 gaggctgagg cgggcggatc atgaggtcag gagattgaga ccatcctggc caacatggtg 110101 aaatcccatc tctactaaaa atagaaaaat tagctgggtg tggtggtgcg tgcctgtaat 110161 cccagctact caggaggcgg aggcaggaga atcacttgaa ccagggagtc ggaggttgtg 110221 gtgagccgag atcactccac tgcactccag cctggtgaca gagcgagact ctgtctcaaa 110281 aagaaaaaga aaaaatactg gccaggcacg gtggctcatg cctgtaatcc cagcactttg 110341 ggaggctgag gcgggtggat cacctgaggt caggagttcg agaccagcct ggccaacatg 110401 gtgaaactcc atctctacta aaaatacaaa aatcaaccag gtgtggcgtc acacacctct 110461 aatcccaact actcgggagg ctgaggcagg agaatcattt gaacccagga ggcggaggtt 110521 gcagtgagct gagattgtgc cattgcactc cagcctgggt gacaagagag aaactctgtc 110581 tcaaaaaaaa aaaaaaaaaa tactgtaaaa taacaatctt tagaagagtt agaataatgt 110641 tatttttgtt tgtttttttc gagacagggt cttgctctgt catccaggct ggagtgcagt 110701 gacgtaatct tggctcactg caacccctgt ctctcaggtt caagagattc ttgtacatta 110761 gccacccaag tagctgggat tacaggtgtg tgccaccatg ccagctaatt tatttgtatt 110821 tatttattta tttatttatt tattgagaca gagtcttact ctgtcaccca ggctggagtg 110881 cagtggcgtg atctcagctt actgcaatct ctgcctcctg ggttcaagcg attatcctgc 110941 ctcagcctcc cagtagctgg aattacaggc atgtgccacc accacacccg gctatttttt 111001 ttttttgggg gggggggtag atatggggtt ttgccatgtt ggccaggctg gtctcgaact 111061 cctgacctcg tgatctgccc gcctcagcct ccccaagtgc tgggattact gtgtgagtca 111121 ctgtgcctgg ccaatttttg tattttagta gagaaggggc ttcaccattt tggccaggct 111181 ggtctcaaac tcctaaactc agatgatcag cccgcctcag ccttacaaag tgctgggatt 111241 acaggcgtga gccactgtac ccagccctca aaagctattt tttgacatat taagattggc 111301 aaagtacgta ccatttgacc cagaaattct accttctaag aatgtacgcc gggcacggtg 111361 gctcctgcct gtctgtaatc ccagcacttt gggaagctga ggcgggcaga tggcttcagt 111421 ccaggagttc aagactagcc tgggcaatat agagaaatac ccgtgtctac aaaaattgca 111481 aaaaattagc taggcatggt gacatgcgcc tgtagtccca gctacctggg aggctgagtt 111541 aggagaatca cctgagctta ggatgttggg gctgcagtga gccatgatga ttgcaccact 111601 gcactccagc ctgggaaaga aaaaaaaaag tttcccacat agtcatatta ctcaatgcgt 111661 agtccatgaa tcaagagcat cagcatcact tgagaggtct acatttcaat gagatcctca 111721 agcaatttct atacacgtta acatttgagg aatactgcta cagagtactc atgcagaaga 111781 gcaaagatat atctatgggg atttaaacag cgttatttgt aatatggaca atttttaaaa 111841 aaaagttaat atccatcaat agggaacagt taaaagtcta gccaggcttg gtggcacaca 111901 gctgtagtcc cagctacttg agatgctgag gcaggtcgat ggcttgagcc caggagtttg 111961 aggctgtagt gcacaatgat tgtgggcagg aaacaggacc tcgactttgg accagattca 112021 agactggctg aaacagggaa gaggcacaga aagcacttct ccgtaagaca cgcccaccag 112081 cacgacagtt taccattgcc atggcgacac ccagaagtta ctgccttttt ccataacaac 112141 ctggaagtta ccaccccttt tctggaaatt tctgaataac ccactcttta atttgcatgt 112201 aattaaaagt caatataaat gtgactgcag aagtttccct gagttgctgc tctccacaca 112261 ctgcctagag ggtggccttg ctcggtagga gcagtcatgg ggctgtaaaa ctgcatcctc 112321 aataaagctg tatcttctac cacatgctca ggaaagaacc atgagcctta ttgggctaat 112381 cgccaatttg ggtgttcacc tgccctgcat cagttgtgcc tgtgaaaaac tactgcactc 112441 cagctgtgac cggttgaggg taattgtcca ggttcttggt gttttgcaca aagaattcga 112501 caaaacacac aaacaaagca aggcaatgaa agcagagatt ttttttgttt gtttttttaa 112561 ttaccgtgcc cgcgagacgg agtcttgctg tgtcgcccag gctggagtgc agtggcacga 112621 tctcggctcc tgggttcaag caattctcct gcctcagcct cctgagtagc tgggattaca 112681 ggcttgcacc accacgcccc gctaattttt gtattcttag tagagatggg gtttcaccat 112741 gttggccagg ctgatctcca actcttgacc tcttgatcca ccaaaagcag agatttattt 112801 taaaggaaag tacagtccac agagtgggag caagctccag caagtggctc aagagcattg 112861 gtgacagaat tttctggggt ttaaatacaa tctagaggtt tcccattggt tcaccctatg 112921 caaatgaagt agtggccgac tgccagtcta attgtgggag gggaccaatc aaaatgaaga 112981 gagtaggcct gctatcagtc tgactggttg tgagagggga ccaatcagac gtactttcat 113041 ttttcaaatg ccatacagag aaaggagggt ttgaaaaggg agtactctct gatattcagt 113101 cagcatgatt cggccttaga ttccctgcct ccggacccta ttctcctgcc tctcagcctg 113161 gacaatgtag tgagactctg tctctagaaa aacacaagaa agttccgggc gcggtggctc 113221 acgcctgtaa tcccagcact ttgggaggcc gaggagggca gatcacgagg tcaggagatc 113281 gagacgatcc tggctaacac agtgaaaccc cgtctctact aaaaatacaa aaaattagcc 113341 aggtgtggtg gtgggcgcct gtagtcccag ctactctgga ggctgaggca ggagaatggc 113401 gtgaacccgg gaggtagagc ttgcagtgag ccgagatcac gccactgcac tccagcctgg 113461 gagagagagc cagactccat ctcaaaaaaa aaaaaaacaa caccaaaaaa aaaaaaaaaa 113521 aaaggaaaaa cgcaagaaag aaaggaaagt ggtagtatat ccacataatg gaagactaga 113581 caactatgag aaaaagaggt agatttatgc ttagtgacat gaaagaagcc caagacatat 113641 tagaattgaa taattagaag aaaaaaaccc taaactaatc taaaaatctt tttttctttt 113701 tttttgagac cgagtctcac tctgttgccc aggctggagt gcagtggtgc gatgtcagca 113761 cactgcaatc tccgcctccc aggttcaagc gattctcctg cctccgcctc cggagtagct 113821 gggataacaa acgcccgcca ccacgcccgg cgaatttttg tattttcagt agagacaggg 113881 tttcgccatg ttggccaggc tggtgttaaa ctcctgacct caggtgatcc gcccaccttg 113941 gcctcccaga gtgctgagat gataggtgtg agccaaagcg cctggctttt tttttttttt 114001 agacagagtc tcgatctgtt gcccaggctg gagtgcagta gcgtgatctc ggctcactgc 114061 aacctctgcc tcccgggttc aagtgattct cgtgtctcag cctcccaagt agctgaaact 114121 acaggcacaa gccacaacgc ctggctaatt ttttttcttt ttttgagacg gagtttcact 114181 cttgttgccc aggctggagt gcagtggcgc gatctcagct cactgcaacc taggcctcct 114241 gtattaaagc gattctgctg tctcagcctc ccgagtagct gggattacag gtgcgagcca 114301 ccaccccagc taattttttg tatgtagaga cggggtttca ccatattggc cagactggtc 114361 tcaaactcct gacctcaggt gatccacccg cctaggcctc ccaaaatgct gggattacag 114421 gcgtgagcca ccacgcccag cctaaatatt acttttataa ttgaaagaga ctctatctag 114481 accagccgat gggggaaaaa tatcaaggat gttgtggagt atctagctct cattacacag 114541 tgtaacttca gtccttgcca actaaacacg aaataaaacc ttcaatagct ccccacagct 114601 ttcaggataa agtccagatc cttcagtgtg atccaagtct gtttactgct tctaccatat 114661 tccaaccccc gactgcacct actgttccag ccagcagaac cacctgcttt ctttgcacag 114721 gcattgagct ctttcttctc caagcattac cacaggtgat tcctgcctct gaaatctctg 114781 ccccgaatat atgatatcat gtatatgcat atactataca atgatcccta gcacttagta 114841 aaagctctat aaatgttggg tatgattatt ataacatgtc tcattatatt gaattacttg 114901 tgtacttaat cttgtcactg aactgtacac ctcttgaaga taaaagataa ggacttctgt 114961 gctttactgt cccggagcac agggacagat taagtgctta acaaaagcat ggcagggcgc 115021 agtggctcat gcctataatc ccagcacttt ggaaggccga ggcgggcgga ttgcttgagc 115081 ccaggagttt gagactagcc tgggcaacat agcaaaacct tgtctctaca aaaaatacaa 115141 aaaaattaac ttggcatggt ggcgcacacc tgtagtccca ggtacccagg aggctgagac 115201 ggaaggactg cttgagccca ggaggtggac gttgcagtga gctatgattg ctccaccgca 115261 ctccatcctg ggcgacagag ggaaactcta tttcaaaaca aaaataaaaa caaaataaca 115321 aaagtgtatg ttcaaggaca ccttcctaac ccttacatca caacttttct tcccatctaa 115381 tgtgtcttcc tctttcttct tggccaagga aactcatatc catttctttt gactacatca 115441 aaccacatgg atttttccct tctccacact ccaacaatac aataactttc atttgtatag 115501 cacatctcac tcttcagagt gttcatattt attatttctt tttcctcaca gcaaccccag 115561 taccaaggac acaggccagg ggtggggagg agcaggaaaa tgtttactaa gggtcaacta 115621 agtgcaaggc gatttaccca cattatctct tttaatccct taacaatcct tacaggtgga 115681 cgttttaata atccccattt cacgactgtg gaaattagac ctcagtaaga tttcataacc 115741 tgctgaagga cacacaggta ttaagtagta gagatgggtt ttgaagctgg tgtctctgac 115801 tttaaagtcc ctactggttc catgacacca agtaaacaga tttgcctatt aaccacattc 115861 ctccaagctg ccctgggcag gaggtaatat aatacttcct gatttcttgc ccaccacatc 115921 ttccaccaga ccactttgcc gttatataaa gtctgtgggc acaatttaac aatgaatcac 115981 atgccatctt acctggttac ctcattgttt catttacatt agattccaca acaagattgg 116041 taacatcccc ctttccccag gcaggaattg tgtcatctta acctccctga gcctcagtat 116101 cctcttcttt caaatgtcta tcttacagga ctgttgcaga gattaaatat cagttgtaaa 116161 gatgtttgac ataatgtctc gcacaaactc tccagaaata tttattgcat ctcaacatct 116221 ttttcatctc cttcaacatg tcagacaggc ctgagtgcct gggtttggct atgccagttc 116281 ctgttgatta ctgatataac aacgttggat caggagtatt ttagtgggca gatgttgaat 116341 tttctgcctt ccacccacta aaaggtttgc atcaaaccag tccaggataa ctcatactgg 116401 atccctgtca gttgatatag ttaagctaat tctacagata acaaagagtc ttgggtacgt 116461 ttagctagca aatgcagatc tgccaagact ccaacccagt tcttaaggta tctgggtctg 116521 gactccttcc gctccaccaa gccaccttcc aaaggtctac ctgaaaataa ctaatgagca 116581 gaaactatag ggggaagggg aaaagacccc ttatcttggt gtttcttgct tttcctcgca 116641 gccccgccct ttttgcttcg aaggacgagt ttttcctgaa ttcacaatac ttttgcattg 116701 aagatgtccg gtccctttaa ggtcttgggt gacgtaatga ctagccccgc cccctctctt 116761 ctcgcttctc agccctgccc acttcttttt gtcctcagcc aattagcggc ccaacgcccc 116821 ttccgcgcct catccgggtt cccgggacgg gaaagtggcg tggtaaccag gcaactactg 116881 atcaatcccc tccccggctg atttgcgcat caggtagtcg ccgctctact ctcaacggtg 116941 gcgagctgca gttgccaagt gccccgtccc cgttgccata acagcagtac acaacccctc 117001 cttcccctcg cccgcttgct aaacagtcct tccctctcgg gaccaagggc ctctccagaa 117061 ctgcttctga ttgagcagaa acagggtctg agcccctttg ctaaggcgaa ttgagtggga 117121 gggttggaga ctgcgcttta ggctggtgta ccagagggcc caaggaaatt ggcccagtgg 117181 ggagggaaaa gtagaagaaa tgaatggagc agggattacg gatggcgcaa tgggagctga 117241 gaggagagga ggaggaattg gtagggaaga ttggagagca tggcttaagg gggaggtggg 117301 aagaatgggg gagagggaag cccaggaaat agggacaccc ggctgctgga ggagagctgg 117361 agtcactgtg aaggacagga aaggcctggg ggtgtgggtg acatcctgaa gggagggggt 117421 cctcagtgga aagtgtgaag tgggagttga gaatcaaaat gagatgttca cagggataag 117481 gaaagaagcc tagagttggt gggtcagaag tcatgggggt gatgggagtg gcacgtgtca 117541 atgacgatga aggagctgaa ggctggcaga agctgatgag accttctttt tcaggaggag 117601 gactgagctt atctgactcc agagctttca ggagggaaga aagatgtcag atgaagatga 117661 tctagaagac tctgagccag accaggatga ttctgagaaa gaagaggacg agaaggagac 117721 agaggagggg gaggactaca gaaaagaggg ggaagagttc cctgaggaag tgagagcctg 117781 gacaaggagg ggtcctaggg ctgaaggagg gggttgggca ggggagagac accttactcc 117841 acttctacct gcagtggctg cccacccccc tcacggagga catgatgaag gaagggcttt 117901 ctctgctctg taagacaggc aatgggctgg ctcatgctta tgtcaagctg gaggttaaag 117961 agaggtgcgt tttgggggga ccagatgagg actgtgagac tgtagggccc ctatctcctt 118021 tcccactgtc attcctgggt gttctggttg agagtttgac tcttctcatg cctagtggaa 118081 agggaaggac agaatttact tttcaggagc tctgagactt ctcctatcaa agctgacctg 118141 gggaattgga gattgagagt atatgggggg attcctgctg gattccctta tttgtctttc 118201 agagtatggg ttctctgtcc ttcacagaca caggtctctc tgcccactgt taagaaccca 118261 cctaccttct cttacccttc ttctccccta ccatctgttg cccattcttc tccccattat 118321 aagccatctc catattgcct tctgctttag caactctgcc agtcctcact gggtgcttaa 118381 ccctaaacct ggtttcttct ccctcttcca tctcctaggg acctgacaga catctacttg 118441 ctgcgctcct acatccatct gcgctatgtg gatatttctg agaaccacct gacagacctg 118501 tctccactca actacctcac ccacctgctc tggctcaagg ctgatggcaa tcggctgcga 118561 agtgcccaga tgaatgaact gccctacctg cagattgcta gttttgctta taaccagatt 118621 actgacactg aaggcatctc tcatcctcgt cttgaaaccc tgaatctcaa aggtgggtct 118681 ttaggatggg ctacacaaga ttctttcctt cctagcctga ggccaacaga agtgggcagt 118741 atggtccttg gctagggcac atggcagtct gcattcattc attcattcag atacgcaata 118801 tgattcatta tgacagttct acatgctaac catataggat ccaataagac aaggaacgtc 118861 cctattctca tatagcttat attctaattg ggagagacaa acaagtaaca acaaacacag 118921 taagtacatt catgtagcat gttaaaagga gacaagtgcc ttggaaaaaa tagatcagag 118981 taagaggaat gggggacaca ggtggcaatg tcaaatagat tggttaggac aaccatcact 119041 gagaagatga catttgagca aagaattgaa gaaggagagg gaattagaca tgtcagctgc 119101 ttcattcact cttcaaaggc ttaggtggat atgtccatca ggaccttgga ctacctacca 119161 caccacttgc ctcagggctc cagaggtatt aagtctggag ctggccttcc tcagtatcta 119221 ccctgctttg cgaatggtag gatgatttag aggctcccca aattcaggct ctaatgctga 119281 gcatttggag tggccctttg agctcttgaa accctcctcc ccagggaaca gcatccacat 119341 ggtgacaggt ctggaccccg agaagttgat cagcctgcac acagtggagc ttcgggggaa 119401 ccagctggaa agcaccctgg gaatcaatct tcctaagctg aagaacctct acctggtagc 119461 tcactgggtc agagggtggt gcagggaaga gggcactgtc ctgggggtca ggatgcctgc 119521 tttctagtag gctcagctac taacttcatc attatgataa taactggtat tattatcaag 119581 accatagttg gttccagatg ctgtggagat acacagccac ctagcagggc tgataatata 119641 taatgtgtgg ataatacaga gataatttac cttgatacag gtagtagcaa aatggaccat 119701 aattgaggta caaacatcag tctgtggaaa tgttttagga aaaccctcct ctcagactgg 119761 gtttttcttc agctcttaca ctaccaccac aacaatcatc aacatacgac ttccgggacc 119821 aaaggtgtgg gggtttctcc ccatacacca agtagtagac accagctggg tgatctctaa 119881 ttcgattcac tgtctacctg gggatggcat caaatcccac agattgaggg ctcggttccc 119941 aagagtgacc cctacaccgc ccacatacac cagttgcaag tccaagcctc cagaacttct 120001 gacacaccag cttcaacatg aagctggttc ccatgacctc ttctctggat ttaattaatt 120061 ttctagagtg gtttttatta gtttattgca agggcaaagg atacagatga agagatgctt 120121 agggcgaagt atggggatat gggtgcagtg ctccatgccc ttcccagatg catcaccctc 120181 caaaagcttt cacgtgttca gctacccaga agctctctgg acccagtcct cttggctttt 120241 catggaagct tcatgatctt agtattcctt cccccaggaa actggggagg ctgtcgctgg 120301 ggagggtctt aagaaccaca atcaggccgg gcgcggtggc tcatgtctgt aatcccagca 120361 ctttgggagg ctgaggcggg cggatcacct gaggtccgga gttcgaaacc agcctggcca 120421 acatggcgaa actctgtctc tactaaaaat acaaaaatta gcccagtgtg ctgacatgcg 120481 cctgtaatcc cagctactag ggaggctgag gcagcagaat ggcttgaacc cagaaggcag 120541 agtttgcagt gagctgagat cacgccactg caatccagcc tgggagacag agcaagactc 120601 catctcaaaa aaaaaaaaaa aaaaaaaaaa ccaaagaaaa acacggtgaa accccgtctc 120661 tactaaaaat acaaaaaaaa aaaattagct gcgtgtggtg gcaggcacct gtagtcccag 120721 ctactcggga ggctgaggca ggagaatggc gtgaacccag gaggcggggc ttgcagtgag 120781 ccgagattgc gccactgcac tccagcctgg gcaacagagc tagactctgt ttcaaaaaaa 120841 aaaaaagaaa gaatcacagt cagaaagagg gagtgggggg aaagattagc gctttctctt 120901 ggagcaggtg aaaagacggc tggagaagtt cctgagacct gcccctgagg cccagcacac 120961 ccaactttat aacaaaagac tgcaagaagg gctattatgg gggttataag ctagggactg 121021 tggatgaaaa ccgatatata tattatatat tatataatta tatattatat ataaatatat 121081 attatatatt atataattat atattatata taatatataa ttacatataa tatataatat 121141 ataattatat attatatatt atatattata tataaaatat ataatatata attatatatt 121201 atatataata tataattata tataatatat ataattagat ataatatata attatatatt 121261 atatataatt atatataata tattatatat aattatatat aatacataaa tatatataat 121321 agtatatatt atatataata tatataatat ataaatatat ataatagtat attatatata 121381 atatatttta tataatatat aaatattata tataatagta tattatatat aatatataaa 121441 tattatatat aatagtatat aatatataat atataatata tattttatat attatataat 121501 atattatata ttatatatat aataactcca caggaagcca ctgaaatatt tagtctttta 121561 gactcttaag cttcgttatt ctgggatggt gctgatgcca atcctcagtg gtggcctggg 121621 ggctcctggg ccgggtaggg gtgggttgaa gcatctcctc catcccaagt ggctttgatg 121681 gggcgaatgg tagagggtta ctgctgacct ttctctggga tggcagcttc tcctgaactt 121741 ctttactttc tatactgatc tctactcacc cccactcctt tcttccactc ctccctctct 121801 acctcttgga tttcctcttt tgctttatct cattctgact tctctttctt ccctatccct 121861 gctccctctc aatcaatcca ctgtgccctg ggggtctagg cccaaaacat gctgaagaag 121921 gtggaaggct tggaggatct gagcaatctc accaccttgc atcttcgaga caaccagatt 121981 gacaccctga gtggcttctc cagagaaatg aaatcattgc agtacctcaa cctgaggtat 122041 gcaccctctc caagccccac cttgccccta cccctgacca ggtgcagctt ttgagtctgt 122101 gttcttccct ggcccaatcc cccagtccta gcccacagct ggattctcaa ggaaagcccg 122161 gctttccttc ctttcttctg cactgagttg caacactaaa tcccttatag cttgtgaccc 122221 ctgaattttt attcattcaa tccacaatta ttgagaatgt gctatgtgcc aggccttgtg 122281 ctgtgttttg gggacattta aagcaatcag aggctgggca tggtggttca tgcctgtact 122341 cccagcactt taggaggaag gatcacttga ggccaggagt tcaggaccag cctgggcaac 122401 ctagtgagac catgtttcta cagaagtgaa aataatggcc cggcatggtg gctcacacct 122461 gtagttctag cactttggga agccgaagtg ggcagatcgt tggagcctag gagttcaaga 122521 ccagtctggg caacatggca aaatcccatc tctaccaaaa atacaaaaat tagccaggca 122581 tggtgacagg cacctgtaat cccagctact caggagactt aggcgggagg atcgctggag 122641 cctcgaggtg gagtttgcag tgagctgaga taaagccgct gcactccagc ctgggtgaca 122701 gagccagacc ctgcctcaaa aacaaaacaa aaataaaata ttaaataaaa attaaaatat 122761 taaattaaaa ataaaaatta actgagcctg gtggtgcaca cctgtggttt cagctactca 122821 ggaggctgag gttggaggat cacttgagcc caggagttca aggttgcagt gagccatgat 122881 tgtgccactg cactccagcc tgggtgacag agtgagctgt ctctaaaaga aagaaaggaa 122941 tcagaggtga ttcctgttcc taaggggctc acagtctggc tccaactctt ttttcctcac 123001 cctatcctat ataagtttcc actgcccttc tccatttcct cggtcccaat tcaccttccc 123061 atttcctgcc tctcttctcc tgagccccac ctgtaaggga tgaccttttt actgattaaa 123121 aaaacacaag atgtgctcag atttgaggag agaaggacat ttgtaagaaa atcttttggt 123181 agaaaggcac tgaagatcag caactcttgt cggatggagg actttatgtg tgtttatttc 123241 agcaaacatg taaggaaact gcctctttcc ttagtcaaca agaccagatt cttgagctgg 123301 taatggaaag agcatctcca agtagggaga ggtggttgca tctggaagcc ccaactctca 123361 tcctgacccc ctgggcattc tcttattcat gaactctgcc acacggatct ggtggggtgt 123421 gtatctgtgc catcttcctg ttaatgatct agacaggcat gatggaccct tcctgagtgg 123481 ctattaaatg gggatgtttt gagagtaagt cgggtgtgtg tgtggagggg ggtgagggtg 123541 tctcaatgat tggcaggtgc ctctggggcc agagatgcta agcatccaac aatgtatggc 123601 agagtccagt acaaagttcc tgtagccctc ctggtgagaa acactgcatg aggctcagca 123661 cgaccatgag atcaggctga aacacttaaa tagatttcat aaaagcttaa agccagaaca 123721 tgaaaataaa taaaattcca ggcatggtgg ctcacgcctg taatcccagc actttggggg 123781 ggccaaggca agaggactgc ttgaggccag gcgttcaaga caaacctggg caacatagtg 123841 agacccagtc tctacaaaaa atttaaacat tagccaggtg tggtggcaca tgcctggagg 123901 tacttaggaa gctaaggtga gaggatcact tgagcccagg aattcaaggc tgcagtgagc 123961 tatgatcatg ccactgcacc cccaaccggg gtgacatagt gagaccctgt ctcaataaaa 124021 ataaaataaa aacagagtat catataatga ggcacattag gagatgaagt ttattcttca 124081 ggttgggagg ctccaaccca tccatagcct taggttctct ttgtggagag cacgaaccga 124141 gcatccacca ccctaaactc cagcttagat cccagttcct tgtccactga cctaggtcct 124201 acagggtgga gtgcagtggc tcgatctcgg ctcactgtaa tctcaaactc ctggggtcaa 124261 gcaatccctc ctcctcagct aagtagcaag gactacaggt gtatgccatc acgcccagct 124321 aattttttta tttttagctt ttgtacagat ggggtctcgc tatgttgccc aggctgggtc 124381 ctaccttcta ccctccattt aggtgtgatc acagaaaatg tctctccctg tgaggaaaat 124441 ttgccctctc ctccccagct acaagacagg gttgtggcct agctttgctg caccaaaaga 124501 acacatgaat gtattacaga gtagaatatg gcacttaggg attccttgaa gggcagaagc 124561 aggaccagag cggggtcaca cagcacctct gcagtaaatc tggcaaggct tggcaggcca 124621 gtcctcacag tgtttctagg gacactggaa ccccgagagt cccccagagg ccccattcct 124681 agcaactggc aaccaaagcc cattattcct gcatgaaccc ctcctgacca cactggcagg 124741 ggcaacatgg tggccaacct gggggagctg gccaagcttc gagacctgcc caagctgcga 124801 gcgttggtgc tgcttgataa cccatgcacg gacgaaacca gctaccgcca ggaggccctg 124861 gtgcagatgc cataccttga acgcctggac aaggaattct atgaggagga ggaacgggct 124921 gaggctgatg tgattcgaca gaggctgaag gaagaaaagg agcaggagcc tgagccccag 124981 cgtgacctgg aacccgaaca gtcattgatc tagcagcagt tctagcctct aaagatagta 125041 aggaagcctg cagggaggca gtgggaggag gccaagggct gggcaggtag gggaagaggc 125101 aagaggggaa gctgctgcag aaggaggtgg gagaggaaag catcagacaa gcaggaccct 125161 taaagagagg agggttagga gtcagggaga ggaaaaggga cccaaggggc ctgggaccag 125221 ctgagaaaga cttaggaggc cagaagagta agtgaaaaga attggggtgg caggcagagg 125281 agttggtggg gggtggggca gccatacctg acacagagtg aagtcggcta ggaaaggaca 125341 ggtgtgggtg catggtaggg gctgcagggg aaagttggtg gtgtatgcag ctggacctag 125401 gagagaagca ggagaggaag atccagcaca aaaaatctga agctaaaaac aggacacaga 125461 gatgggggaa gaaaagaggg cagagtgagg caaaaagaga ctgaagagat gagggtggcc 125521 gccaggcact ttagataggg gagaggcttt atttacctct gtttgttttt tttttttttt 125581 tttttttttt tttgcgaggt agtcttgctt agtctccagg ctggagtgca gtggcacaat 125641 ctcagctcac tgcaacttcc acctcctggg ttcaagcaat tctcctgcct cagcctcccg 125701 agtagctggg actacaggcg catgcaaccg cgcctggcta atttttgtat ttttagtaga 125761 aacggggttt caccacgtta gccaggatgg tctggatctc ctgacctcgt gatctgcccg 125821 cctccgcctt ccaaagtgct gggattacag gggtgagcca cagcgcctgg tccctattta 125881 cttctgtctt ctacctccag gagatcaaag acgctggcct tcagacctga tcagactccc 125941 aggggcagcc accacatgta tgacagagaa cagaggatgc ctgtttttgc cccaaagctg 126001 gaaattcatc acaacctgag gcccaggatc tgctctgtgc cggtcctctg ggcagtgtgg 126061 ggtgcagaat ggggtgccta ggcctgagcg ttgcctggag cctaggccgg gggccgccct 126121 cgggcaggcg tgggtgagag ccaagaccgc gtgggccgcg gggtgctggt aggagtggtt 126181 ggagagactt gcgaaggcgg ctggggtgtt cggatttcca ataaagaaac agagtgatgc 126241 tcctgtgtct gaccgggttt gtgagacatt gaggctgtct tgggcttcac tggcagtgtg 126301 ggccttcgta cccgggctac aggggtgcgg ctctgcctgt tactgtcgag tgggtcgggc 126361 cgtgggtatg agcgcttgtg tgcgctgggg ccaggtcgtg ggtgccccca cccttccccc 126421 atcctcctcc cttccccact ccaccctcgt cggtccccca cccgcgctcg tacgtgcgcc 126481 tccgccggca gctcctgact catcgggggc tccgggtcac atgcgcccgc gcggccctat 126541 aggcgcctcc tccgcccgcc gcccgggagc cgcagccgcc gccgccactg ccactcccgc 126601 tctctcagcg ccgccgtcgc caccgccacc gccaccgcca ctaccaccgt ctgagtctgc 126661 agtcccgagg tgaagccccc gccaggccca gagcccctgt ggccccctcc cttcgccgcg 126721 gcgcccctgc ctcctttacc cggtgccgct gcgcacctct ccgcatctct ggcccggtgc 126781 agctgcgcac ctgctccgcc gcccgcgccc agggcgcctt ccctccctgg ccttccccgc 126841 ccgctgcctt cgctttctgg ccctctgcgc gcctatctct aactgcgcct ctccaccctt 126901 gcctgcctct ctcccggtgc tcgccctcat ctacggttct atttgtttct aagttgggag 126961 cctctccggg cggtgcattt ttatcctaga gcgtcccttt tggtttgcat ttggggaaat 127021 gtcttctctc accgttcctc acctccccca gacttccctt ggactcccct ctctcctgct 127081 cttccccacg gcgcccctct ccgttcgcgc ttcctcccct ctgctgcacg ggggagagat 127141 ggaagaagtg gggatctgtc gaggcgcaga ggggaggaca ggcccagctc gcctccactc 127201 cccaccggct cttatcctct tcacttcccg ctgcaacccc cagggactgc agggcctttc 127261 tcaggaccct cttccccgac acctgtattg catgcgcctt tcgcgggaag aggagggata 127321 cacgtttggg agagagtggg accttgggga aggggcagtg ttcaggcagc tggggtgtta 127381 agttggggag gtatgggggc tctggaagag aggcccaggc agcccatcct cttcttggct 127441 cccaggagaa atgcaagcct ggcagccatt ccgctctgag gagatctggg ggaaccccac 127501 cggctggcca agctgcaaag aggcgcggga accactgtcg tgcccctccc ctcctcactg 127561 cagtgttctc ccatctcatc atttgatctt accccgcccc ccactgcagt gacctcccct 127621 tcctcactgc attgaccttc tcctcttccc aggggggagg gggaatccct tctacagccc 127681 tagactgcag ccagcgcctc cctacccacc ccccacccac tgcagtaacc tctttcccat 127741 ccctcccagc ctcccgcccc gcccattgat ctcaggctcc acccctctaa gcctcttatc 127801 tttctccttc cttcccttcc acccgaggag atcccagcca tcatgtccat agagaagatc 127861 tgggcccggg agatcctgga ctcccgcggg aaccccacag tggaggtgga tctctatact 127921 gccaaaggta atgggtgtgg catgggcctt cctacagccc tagcttttcc acggccaggc 127981 tgggttcggc caggggttcg gaggcctttt ttgataccca ggggtatggg gtgctgggcc 128041 aggctcacaa gcctgggttg tggcggttgg atcctccttg tccggggagc cagggtaggt 128101 gggtctgtgt tcgagtttag cgtgtgaggc atgtcctgcc tccgtgtgtc tgcctgtgca 128161 cttgcatgtg tgcagacgtg tgctgcaagc aattttcttt ctgcgggctc ccacttgtgc 128221 atgtggggcc tcagatgggt ggatgaggag gccacttctt gtgcatctgc ctcagtgtgt 128281 gtgtggggtg ggggtggggg gatgttaggg agcagtgtgg ggagagagct agatgtggta 128341 ggcaggcagt ttagagccag aaggctgaag gactccctgg cccctgtgtc ctcttcactc 128401 cctctcattc catgttcctc ctttaggtct tttccgggct gcagtgccca gtggagcctc 128461 tacgggcatc tatgaggccc tggagctgag ggatggagac aaacagcgtt acttaggcaa 128521 aggtgaggtc ccttctcttt tccagactct cccccacctc agccttatgc ccctacctca 128581 caccagtccc cagtcctcct ctagcatggc ttcccctcct cccattgatc ccttccggcc 128641 cctcctggcc cgacccagtc cagcctcttc ctttccccag gtgtcctgaa ggcagtggac 128701 cacatcaact ccaccatcgc gccagccctc atcagctcag tgaggcctgc tctttgctgg 128761 ggatagcagg gccagagttc tggaaggaat cccggagcag ggcaggagga agggaagaaa 128821 gaaggcccac tcttaggaat catggttaca agggggaagg gtggggaaca gcttccttaa 128881 tgcaccctgc tcccatggga gttcaggtcc cctaatccag gtaggcccct gtcacaggga 128941 cctggttgga ccctggccaa atgtgagctt gggtgtgaat gaggggaccc tctgccttag 129001 ggctcagcct ccagcctggc cctgggtgat ggaggctctg ccctcagggt ctctctgtgg 129061 tggagcaaga gaaactggac aacctgatgc tggagttgga tgggactgag aacaaatgtg 129121 agccggggcc gggagaaagt ggggaagcgt cagggtgggg aggcgtggag cagatagaga 129181 gctgaagggc cagtgctgta gtggcttcct caggaatgac tgtcaggggc attctcctct 129241 caaagccaga gcaaggggag atgagtttag ctgcagaggg aaggaccgac agtaggcaga 129301 aggaagacct tctttgcagc atacagagga gggggatggc ctgagagagt ctgtggtctc 129361 agggatattt agaaagaggt gtggctctct gccgtttcca atctcctcct ccccacccat 129421 tcctctccct gctgttttca agagagcatg gatgaggtgt tgctggggca ggggtgggga 129481 gggaggaggg ggtctccttt actggctcct tttggagact acagatggag gcaggagcta 129541 gaaaggagaa ggggacattg tgctcagcac cttcctctat aatctcctag ccaagtttgg 129601 ggccaatgcc atcctgggtg tgtctctggc cgtgtgtaag gcaggggcag ctgagcggga 129661 actgcccctg tatcgccaca ttgctcagct ggccgggaac tcagacctca tcctgcctgt 129721 gccggtgagc aataagccag cctgcggctc tcccaggggc gggtggggga gggagcatgc 129781 aactcatgag gaatgatggg aggaaagtga attgagggag gtaaagagga aggatgggga 129841 cgtgagactt agtccggaaa gctgggggaa gtttgggatc ttgggttaac actcctgggg 129901 cgggcaggga ggggctcttt gacccttctg tctttctgtg gctccccagg ccttcaacgt 129961 gatcaatggt ggctctcatg ctggcaacaa gctggccatg caggagttca tgatcctccc 130021 agtgggagct gagagctttc gggatgccat gcgactaggt gcagaggtct accatacact 130081 caagggagtc atcaaggaca aatacggcaa ggatgccacc aatgtggggg atgaaggtgg 130141 ctttgccccc aatatcctgg agaacagtga aggtgaggcc aggagcccca ctcccagctc 130201 taagtcttac cctattgtgg gacatcagaa agggtgacac agttcaccaa gtcctgagta 130261 ggcgtggagg gtcctaggac tctgcaaact ccaaaaggta ccagttctta gagtggattg 130321 cagagagcct gccaaattca catgcagacc tagggggaca gtattttttt ttttttttga 130381 gacggagtct tgctctgtca cccaggctgg agtgcagtgg ctcgatctca gctcactgca 130441 agctccgcct cccgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta 130501 caggcgcccg ccaccacgcc cggctaattt tttgtatttt ttagtagaga cagggtttca 130561 ctgtggtctc gaactcctga ccttgtgatc ggcccgcctc ggcctcccaa agtgctggga 130621 ttacaagcgt gagccaccac gcctggccag gggacagtct tttacctgcc tagccagatg 130681 tgttagcatc tgttaagttg ccactggaag gccgggcgcg gtggctcaca cctgtaatcc 130741 cagcactttg ggaggctgag gcgggtggat cacctgaggt caggagtttg agaccagcct 130801 ggccaacaag gtgaaacccc gtctctacta aaaatacaaa aattagccgg gcatggtggc 130861 gtgtgcctgt aatcacagat actagcgggg ctgaggcagg aggatcgctt gtacccggga 130921 ggcggaggtt gtggtgagcc gagatcatgc cactgcactc cagcctgggc aacagagcga 130981 gactccgtct caaaaaaaaa aaaaaaaaaa aaagttgtca ctggagccct tgggaatact 131041 gggagatggt ctggatgact gttggattca tcccatccat tcgaaatctg tcctgtcccc 131101 gtcccaacct cctaggcctc tagaatccct aatttttctg tgccttgggg gaaactgtat 131161 agggaatgga aagaatatag gtagtggtta agagttaagg ttctggggcc aaataacctg 131221 gatttatttg aaccttgact tgtggagtta ctgctggtga actttcttac ttctctctgg 131281 aaataataac agaatctagc tcatggtagt gtgaggctta aatgaaatat atataaaatg 131341 cttagatgac atgataccaa tgaaagtata gtaagtatta ttaagagaat tccattcctc 131401 tgtgttccta gaagatggcc tctcctccca gacctgggga taaccccaac agcatccccg 131461 ccacacttcc ctcaggaaca gccctccacc tctgccctga atgtcttttc tttccctcct 131521 cctccttgcc catccctcct gcttgtacta taatctcact gtattctgtc cccagccttg 131581 gagctggtga aggaagccat cgacaaggct ggctacacgg aaaagatcgt tattggcatg 131641 gatgttgctg cctcagagtt ttatcgtgat ggcaaatatg acttggactt caagtctccc 131701 actgatcctt cccgatacat cactggggac cagctggggg cactctacca ggactttgtc 131761 agggactatc ctggtgagag gaagtggtgt gagggggagg tctgggggca ggcagggacg 131821 tgtcccagca actctggacc ttatggggtg ctgactcagg caccaggtgg gggtgtccta 131881 agaagaacct gagaaccagg gagagggtgc aggagccacc tgcaaagact gggctttgta 131941 tgtagtgtaa aaatgcaggt acccgtgacc aatctgttct gtctcagatc ctgattaaag 132001 tcatgggttc tgaaaactac tgggtcatgg ggaaggctct ggaaggaacc agggatgata 132061 tgagtgtgac tggatctagc agagaactag aacatttcag tatctttgat tgatgaaatt 132121 gtggatgctg aatggagctg ggactgatgt gtgagtagaa agaaggctga gggggactga 132181 attagcttct gcaagtctgc ccagggcctt tatctcagga tggaggcagt tggtcgccat 132241 cttcctgaca cagagcgaaa gaaaatcaac ataattgcag aattaaggat ttggattagc 132301 aatgaggaag ggcttgctgg cagtgagaac aggaaaatgc aaagctaggt tgtgaaccct 132361 tgttactgga agattttttt tggggggggg gtggggttgt tttttttgtt ttgttttttt 132421 tttttgagac agagtttgac tcttcttgcc caggctggag tgcaatggcg ccatctcggc 132481 tcaccacaac ctccacctcc cggattcaag cgattcttct gcctcagcct cccgaatagc 132541 tgggattaca ggcatgtgcc accatgcctg gctaattttg tatttttagt agagacaggg 132601 gtttctccat gttggtcagg ctggtctcca actcctgacc tcaggtaatc tatccacctt 132661 ggtctcccaa agtgctggga ttacaggcgt gagccaccac gcctggctgg tttgctttta 132721 attaactttt tttttttttt tttttttttg tagagacagg gtttagccat gttgctcatg 132781 gctggtctca aactactggg ctcaagtgat ctgcatgcct cggcctccca aagtgctggg 132841 attacaggca tgagccactg tgcccagcct atggaagtgg cgtgagatct ctgctcactg 132901 cacgattctc ttagcctccc aagtagctgg gattacaggc gtgcaccacc atgcctggct 132961 aatttttgta tttttagtag agatggggtt ttactatgtt agccagggaa ctcctatcct 133021 caagtgatcc gttcacctca gtatcccaaa gtgctgggat tacaggcatg agccactgtg 133081 cctggcctct ccatgtaagg ttttatgaaa taagaatcag gagccaggcg tggtggctca 133141 tgcctgtaat cccattactt tgggaagccg aggcaggagg actgcttgaa tccaggagtt 133201 cgagactggc ctggcaatac agtgagacct catctctaca aaaattttaa aaattagctg 133261 agtgtggtgc cacacaccta aagtccctgc tactcaggag gctgaggtgg caggatcact 133321 tgattgggga ggtggaggtt gcagtgagct gagattgtgc cactgcactc cagcctggat 133381 gacagaatga ggctctgtca aaaaaaaaaa aaaagttaag aatcagttag ggcaggctta 133441 cactgggggg atttgtctta gcaaggatga gcaggtgtag ttaaccaagg gcctgtccat 133501 ttcagggaat aaaggggcat gttcctgcct gatgtagaga cccagggaag atgaacacct 133561 cccccctccc caccatgctc tctctgcagt ggtctccatt gaggacccat ttgaccagga 133621 tgattgggct gcctggtcca agttcacagc caatgtaggg atccagattg tgggtgatga 133681 cctgacagtg accaacccaa aacgtattga gcgggcagtg gaagaaaagg cctgcaactg 133741 tctgctgctc aaggtcaacc agatcggctc ggtcactgaa gccatccaag cgtgagtgac 133801 ttctggccct ctcctgtgtg gtcctcgttt ctataagact ccttttgcaa gtgctccagc 133861 ctaattctac ccaggggtgc caaagagagc ggggaacctg gaatcatcct cacagttctc 133921 tcacctctgc ccctccaccc ctgattctct gctcccctcc cagatagctt tcccctagat 133981 gtttcctgac atagaccaag gttggggctg ggaagagagt gcccagtgtg agagctggag 134041 aatcagtgct gtgtgtggat acaggtgcaa gctggcccag gagaatggct ggggggtcat 134101 ggtgagtcat cgctcaggag agactgagga cacattcatt gctgacctgg tggtggggct 134161 gtgcacaggc caggtgagtg aggcagcctg gtgagtgaag agaactctct gtgggattgg 134221 tatttctagc tcacccacct ggtctctcct tccaggtgtt tgagggtgtc aggggagttt 134281 caggagagca gaagtttcct ttcaggggtg agagggcagt cactgagctg caaatccttt 134341 gaaatgtttc agatcaagac tggtgccccg tgccgttctg aacgtctggc taaatacaac 134401 cagctcatga ggtgagggtc cctggggtgg gagcccctgg cccagatggc taaaggcccc 134461 atttgcctgc cagaccatct gtagcaccaa gggcctggat aacagtccat ttcctggata 134521 acagtccaac agataatatt ggtttttgct tcctgggttt attgatggcc tgattgacaa 134581 atcccagaga tcacatggga aagccaggga atgctaagcc ttggggcagg acacaaaagc 134641 aggtggtgtg ggggtggttg gagtctgggg gacccctaga gagagaagca ggatcctcct 134701 gcatccctga ccacttcctt tgtggttcat ctctctcaga attgaggaag agctggggga 134761 tgaagctcgc tttgccggac ataacttccg taatcccagt gtgctgtgat tcctctgctt 134821 gcctggagac gtggaacctc tgtctcatcc tcctggaacc ttgctgtcct gatctgtgat 134881 agttcacccc ctgagatccc ctgagcccca gggtgcccag aacttccctg attgacctgc 134941 tccgctgctc cttggcttac ctgacctctt gctgtctctg ctcgccctcc tttctgtgcc 135001 ctactcattg gggttccgca ctttccactt cttcctttct ctttctctct tccctcagaa 135061 actagaaatg tgaatgagga ttattataaa agggggtccg tggaagaatg atcagcatct 135121 gtgatgggag cgtcagggtt ggtgtgctga ggtgttagag agggaccatg tgtcacttgt 135181 gctttgctct tgtcccacgt gtcttccact ttgcatatga gccgtgaact gtgcatagtg 135241 ctgggatgga ggggagtgtt gggcatgtga tcacgcctgg ctaataaggc tttagtgtat 135301 ttatttattt atttatttta tttgtttttc attcatccca ttaatcattt ccccataact 135361 caatggccta aaactggcct gacttggggg aacgatgtgt ctgtatttca tgtggctgta 135421 gatcccaaga tgactggggt gggaggtctt gctagaatgg gaagggtcat agaaagggcc 135481 ttgacatcag ttcctttgtg tgtactcact gaagcctgcg ttggtccaga gcggaggctg 135541 tgtgcctggg ggagttttcc tctatacatc tctccccaac cctaggttcc ctgttcttcc 135601 tccagctgca ccagagcaac ctctcactcc ccatgccacg ttccacagtt gccaccacct 135661 ctgtggcatt gaaatgagca cctccattaa agtctgaatc agtgcactgt tgtgtctaag 135721 gagtcttact ctagtcccta tgaggggaga gaagatggag cacctggaag ctggtgaaac 135781 tggatagcag agctgggggg gcacaaaaag aggaagacaa actgaacaaa tatggccgag 135841 atgatggcac tgcctacccc attctggcta ggtggggtgc atgtggcccc tgctttctta 135901 gcagaaggct tggctcccag acgcaggtga attaaggggt tcaagagccc ctaaaagcat 135961 aaaatatttt gtgtgtgtgt gtgtgtgcac gcgcattttg ggggaaaggg ggtctaaggt 136021 gttttcatat ccaaagggct tgtggactgg agcagctcct gtactgggcc tctgccaaca 136081 aaaccctggc tggttctcga atggaacagg acttcatggc catcacccac tgcaagatgg 136141 ggaaatggga aggaagaatg gttccggggg tagtatacgg aaggacctaa ggaaacagag 136201 tcctcaataa actgaagatt caggaacaaa agtgcttaac agaaccctgg ctgggtcaga 136261 ctaacagtag gtttccaata tgtggctaga gacgtactac tccttagcta taactcgtta 136321 actcttatgg ggcctcagcg ttgagatgct cattctaggt acccccgagg gggacaaatc 136381 ccttcctagt ggctcggatg caaggttgcg tcccacagat agggtttgca catgggagat 136441 ctaaccacta gagggagtca gagttttgca ggacgccata ctggacgcca agtgggagga 136501 acttcaaggc tgtcccctgc gggcctcccg ctctgcttct gcgaaggtct ggtcctggct 136561 ccctctgcca tcctgcccag ctccaaggct aaaataggtt ctagttctgt atgggtaggg 136621 cccatttctt ttcttgccct ccttcccggg tccctgacac ctcagccatt ttcagttcaa 136681 agagcctcat gccccctttc catttctgtg ccttttagtt tcagtgtggg catctcatct 136741 ttcccttttc tcctggcgac cctttatgcc tcccttgggg tttcctgtac tttcctctgt 136801 gcctctcagt tttctctggg gtccctgata cccatcttac cttagtaact cccagactga 136861 ccaggactat cctagtttga gcctagttcc tagcctctgt cctttctttc atccctcctc 136921 accaggtcgc cttctctgac agtctggccc cccaaccccc aaagtatcgg ggttggattg 136981 ggttacagct agctctgcct gggcctccca ggcccttctc tccactttcc ctttgtgccg 137041 ttgcctggag accaatgggt tgtggggggc tgagccagag agggagctgg ggtgagggat 137101 ggctggggct gtgtgtgtgc gtgtgcgtgt gtgtgtgtgt gtgtgtgtgt gagagagaga 137161 gagagagatg tcccctgcag ctggactata agcctggccc tcaggtaggg gaggaatggg 137221 agagggtaga caatggtttc ttttgtgact ctcaccatcc cctatcccca catatgccta 137281 aatttttctt ttagtatcca ttgtagccca ttccttagtg attaaaaaac aagggagaag 137341 agctcagcct ggaagggatt tggaaagggc ttgagccaag gcccagtggg agactgaact 137401 gagtggtaca gaaagagagg aaatgccatg gggagagagg gtgagaggga aagggggagc 137461 cctggaacaa agtcaggggt ggggaggagc agcagctacc ctagagcaag aggaagggag 137521 tcggggtgca cagggagaca gggagctggc ctggtctcag ggaggccctg ggaggaaggt 137581 gtgcttggag tggtaaagtg agtgccaacc aattcaaggg acttcagagt gatgactcat 137641 tcccaggcca aggaagtgtt cctgttcctg cgcccagttc cccagtgctc tggaatcttg 137701 tttagagctt ttgagtattc ccagaacttt gttgaatctg gcccaaggaa ttgtgagaca 137761 aaaatctact gagctgctcc agagctaacc tccttccttc cctgaaggca ggtgggatct 137821 tgccaattca gggcccaccc ttagcctcca gcaagactgg ccccacaagt acccgacccc 137881 caccccaaat ctggagctct actgagggct cctaatccac actccagctc cagctcttcc 137941 ggcttgccag cctgcctccc tcccattctc tggccttccc aatttctgct gccaaaaatc 138001 accctgtgtc ctttgtcctg ggagatgttg ccgcttggtg gagggccagg agggagccca 138061 ggagaaggaa ctggtggtgg gctggcggag gtgctggctt cacggccggg agcagactga 138121 ctgccagagg ctacctggag agcaggggag ggagaggaag ggttgggagg agcctgcagt 138181 ctgtggtggg gactgtctcc accttagaac aagtgggatt gagcaacaag gatacccagg 138241 gatttgggtc cacattaagc ccagtcttca cccgactgct tccctgtgac ccaggggatc 138301 tttagtgatg actttcaaca gatttgagaa ggaaaatgga gatgccacta gatggaaggc 138361 ctgaggatgg tgtctcactc ctccaagctg tatcccttac ctatctaccc atgagcttgc 138421 cagcaacctc cccccgctcc cacatctgtc atttccttta actgccctcg tttcccatct 138481 ccattttttc tttttttttt tgagacggcg tctcgctctg tctcccaggc tggagtgcag 138541 tggcgcgatc tctgcttgct gaaacctttg cctcccaggt tcaaggattc tcctgcctca 138601 gcctctgagt agctgggatt acaggcatgc gccaccacac ccggctaatt tttgtatttt 138661 tagtagagac ggggtttcgc catgttggcc aggctggtct tgaactcctg acaagtgacc 138721 cacccgactc agcctccgaa agtgctgaga ttacaggcgt gagcccccgc accggcccca 138781 tctccatttt cttttcccat ccatgatctt tcctcttcca tactgccgcc agccataggc 138841 cttgttcagc ttacatttcc ttccaatctt cttatccccc tcttccccat tctgcccgcc 138901 caagtaaacc ctcctcggct ctactgggag cccccacccc caggatccag ccagcccggt 138961 gtctcccacc cttctttccc atccagagcc ctcctttctc acccagaacc tattttccct 139021 agaattggca cctgggccca cctcgttcat cccttcttaa ggttcctcac gcgttccctc 139081 ctctctctcc tcactgacgt ttcctctgga cttctctctc tccagcattc cttcatcccc 139141 ccacccccaa ttcttatatt tatgactctt ttgtcaacac taatcacccc caaaagcttt 139201 actttttcca gaagccccat ctttctatat atgtcatctt ttcaattttt ctctctcctt 139261 atgaccccaa acttccatcc cattctctct gataatagcc cccaaacccg tctggctcca 139321 tctctgccac agatccctca ccccaagctt ctctcacttc tcaccaggca cccacaaagc 139381 ccccaggcag ctccatcttt ccaatccaat cccattatcc caatctctac cccaggatcc 139441 cccaaactcc tcccacttca cctctgccac agacccgctc gcccccaaac ttcagcctcc 139501 cctcatctgc cctcaccacc cacagcccct cctacctagc cctctcccgc gccgggcccg 139561 cgggctcccc acattccgcc acggctagcc ctccccgcgc cccggccccc tcggcctcgg 139621 cctcggccaa ctcgcgctgc gcctccccct ctccggctcc ccctctctcc cccctcgcgg 139681 cttgcctcgc gctccctccc tcgcgtctcc ctcctggccc ctctctcctc cgttcctccg 139741 cgctccctcc ttctccctct tcctcccttc tcccatcccc ctctcccagg ctccctccct 139801 tcgccgtccg ctttcctgtg cgagtcgccg gacgcaccgc ccagcccgtc cgcagccccg 139861 cgccgggtcg gggcctcggg ccgggccatt tcggagggac cgcggcttcg gggaggggct 139921 aggagggggg agagggagcc gactgccggg gaaggggcgg gggccgcggg cgaggcggcc 139981 gaggggcccg ggatcgcgcg ggtgcgggct gcgctaggcg ggcgcgggcg gcggcgggtc 140041 ggaaccgcgc cgagggccgg gcgggccgcg gggccgggcg gcgcggcggg ggcgggcggc 140101 gcggcccggg gcattccggg cggccagggg gaggcggcga gcccggggag gtggggagca 140161 gagcggggat cggggtttgc tccgggggcc ggcgggcgat tggggccagg cggggaaaag 140221 gggggatggg ggccgccctc cgggggggtc ggggccgccg ccgccgtcgt cgcggcggcg 140281 actgaggccg agaagaggag aggggggcgg gggagctgcc gccgccgccc cccagaggcg 140341 ccggagcccg gaatcccgct cggagccagc cagccgtccc gagctaccag caggtaaggt 140401 ctgcggccgc ctgggccccg ggcccgcggg gttctgtccc gcccgggctc cggcctgcgg 140461 ctccccaccc ccacccctcc cagttccccc tcctgccgcc gccgcctcca tttgttattt 140521 tcccaatccc gtccccccac tgcgggctct gcctggcctg agggccgggg gctcaggggg 140581 tggggcccgc cgaggagagt cgggggccag ggtttcggga ggatcgggaa ggtggggaag 140641 gagggagggt ctccggagga ggcctgtggg gccagagggt atccgggacc cgcagactca 140701 aaagaggagg aacatcgagg ccggggttgg cggaggaaaa cgggggatgc tgtgagtggg 140761 gttccgggcg aaggcagaat ggataggaga gatggcagcg aggtggggca tgaagcctcc 140821 aggaaaaagc tccaaaggac cagaacacgg gttccggagc ctggggttgg gggctttagg 140881 aatagacggg aagtgggaaa cgggaaaggg gtgtggtagg tggaaagagg gattgtggct 140941 gatgaacctg ggggagacgg aggcttgggc aaagagaggt ccatttttga gaagagccgg 141001 aggttttcca gcgtggaaga gggaaggcca tgtctttgtg acggttttca ttcccctact 141061 gcaggtcacc tctgaccctg ttttccctat cccagacata gttatttctt ggagagggca 141121 ctggggccca gtagtgtggg ttatcccaaa actagtgagg aggaaaacag gaacacagaa 141181 gggggagttg ggaagaaagg ggttctggga ggatggaaat ggagaattga gcaaccttac 141241 aggcactcca ggcaagttgc gacctccacc ctccaccctc caatttccgg agcgtccgct 141301 gcccctggag ctgtgatcta tatagggagg ccttagttac tgctttgagc tcaaagatct 141361 ccctttgtgc atgggcttat tttgactagg gtaggtacct tcaagctccc ttacctgagg 141421 tgctatagga gggtaggagt tgctttggca actggggtag ggggtagctg ccagatggac 141481 acctgggatg gtctcttgag gggcaccttt tgccactgtg ttgctatggt agtgatccag 141541 tgggatgtca gattggttgg gggggcttcg aatgtttatt ggacctcagc ctgtggctgt 141601 ccttgggact ctgggctggg ttttggaggt tgtttctgga tctgttggtg gttgggagtt 141661 tccccacggt gtgcaccctt tcatgttgaa gtctgttgga acaggtagat acgttccttg 141721 cagatcatag gtgtttctcc aagtctctcc acctgggcat gatgggagga tgaaaacgct 141781 tctgttcagg ttacaccatc ttttggcagt tggttctgac ctctacctcc cttattatgt 141841 tattttttgt agagttgtct ttttgcttct ttgcactttc tttcattttc taaaacttga 141901 attcctcttt gagcctccct tcttgagtaa cccttttcct ctttccctca actaaccttt 141961 ggccttcttc cctgaccttg tagaaatggc tgggctccca ggcccatccc aggtgagggt 142021 ggcccaggac agagggctgt gaaacagctt catgcagggg aaatactata gaacaggagc 142081 caggagacct gtgtctagac ctggctgtgc ctctgtgacc ttgggcaagt cacttctctt 142141 tgagcctggg cttcattgtt aagatgaagg ggctatagat gagatgatct ctgaggcctc 142201 tctgaaggtc ccggattctc agaaggctta ggccagtgca tcgttatctg ctggtctggg 142261 ctgcattgtt atctaggagt cactttggac tggtttgtta gtctggcagg tccagaatga 142321 tgttgggggc tgggccaatg atcctgaaat ttagcctgcg agcccctccc agagagcttg 142381 tgactccact gtattccttc tgacatccca tatttctctc ttaatttctt ttcctgttct 142441 tacttagggg ctctttttct agttctcact ttcctcccaa cccagtctgg tcttaatcat 142501 tttcttttta aagtaaggta atggcagtgc gcagtggctc acgcctgtaa tcccagcact 142561 ttgggaggct gaggcgggcg gatcacgagg tcaggcgatt gagaccatcc tggctaacac 142621 ggtgaaaccc cgtctctagt aaaaatacaa aaaaattagc cgggcgtggt ggcgggcgcc 142681 tgtagtccca gctactcagg aggctgaggc aggagaatgg cgtgaacccg ggaggtggag 142741 cttgcagtga gccgagccga gatcgcggca cggcactcca gcctgggtga cagagtgaga 142801 ctccgtctca ataaataaat aaataaataa ataaataaaa taaagcaagg taatgaaggt 142861 gaatgtgctt agtatgtggc cagatacaga gtaggtgctc tgtaatatta gttacagtga 142921 ttgcctgcta ggagtgtagg ctggtgctaa aacatgaccc aggtctagaa agacacacaa 142981 tccaccccta actcctttcc tcgtctgcca ctccttatcc ccaggattac ttgttctttt 143041 atgactgctt ttctccttca aagcttctca ttgctagttt ttatcagatt tcaggtgatt 143101 aaaaaagaag ggcatactat gattgctgtg ttcctcaggt acattcctgt gtttctctga 143161 ccttttggat cccctaacct aactgacttg gtgaacaggt tctgtgatgg ggaagggaga 143221 tggacaccct tctctaggag ggctggcaaa aatacaccaa tcacaaatag tcatctttgt 143281 tgtgcttttt gtttaagatt gttttcctct tttgctatgg tgggtgttgt tcttcggatt 143341 tttttttttt tttggttgat agtattttaa aaccagtcta cataacaact aaaaaatgtg 143401 ggaaaatagc tgtagttgaa aaaaaaaaaa aaaaaaaaaa ggccgggcat ggtggctcac 143461 gcctgtaatc ccaacacttt aggaggctga ggcaggtgga tcatctgagg tcaggagttc 143521 gagaccagcc tggtcaacat ggtgaaactc cgtctctact aaaaatacaa aaattagcca 143581 ggtgtggtgg tgtgcgcttg taatcccagc tacttgggag gctgaggcag gagaatcgct 143641 tgaacctggg aggcggaggt ggcagtgagc tgagatcatg ccatagcacc tccagcctgg 143701 ggaacaagag caaagatcca tctcaaaaaa aaaaaaaagg aaaaaaaaag aaaactagtt 143761 ggccgggcac agtggctcac gcctgtaatc ccagcacttg ggaaggccga ggcaggcaga 143821 tcacctgagg tcagtagttc gagaccaccc tgactaacat ggtgaaaccc tgtctctact 143881 aaaaatacaa aaaattagcc gggtgtggtg gtgcatgcct gtaatcccag ctattcagga 143941 ggctgaggca ggagaatcgc ttgaacctgg gaggcagagg ttgcagtgag tgagatcacg 144001 ccactgcact ccagcctggg caacaagagt gaaactctgt ctcaaaaaaa aaaaaaaaaa 144061 aaaaaaaaaa gacaagtcta ctgggcagag catcagttga aagttcagaa gataggctct 144121 gtcccagcaa ttttgtgatt ctaagtaaat tgtgaagttt cctggaaagg ttagactcac 144181 cccttactgc ctgcatcagt atggttccgt ggaggtgagt gagaattact gtgtttcttg 144241 ggaaaaaata cactacagaa tatggacatg ccgtttacct agcaggcctc caactcagaa 144301 tagaatacat gggtaagctt agtctgttcc cactggagct agcagcactt aattcaggaa 144361 gaaagaccta ctgatgttag tgggtttaaa ggtcaatcac aagaagagcg cacacacatc 144421 agtggaaagg aagaaaacag taagaaaata ccatttttgc tctcaaaaga tcttacaatc 144481 agtaatttgg agggacttga cacttagcaa gaaatacaaa caagtgcaaa tgaatacagc 144541 tggctatagg gatttggata aagatagata cctgctgata gacagatacg gatggattgg 144601 ctgtgagaaa ctgtgtaacc cagtctacac caagtaggag gataggctta aaaactggta 144661 catctgtgtt gtttgctcac ggatattttg agtctttaat cattcatgtg aaaaccctga 144721 cttttcctct agccttaggg agctcaagta ctagagaatg ggtcctttat ggggctctca 144781 aaaatgaagt tctacaattt ggggctacct aggtttagtc ttgggattgg tagggctgtg 144841 ttcctttgta ttatagaatg ggacattctt cttctaccta ctctgatgtg agaaaagatt 144901 cttttcatat ctctgtgact gatctaggcc ttctgagaat tctccccagc aacttctgag 144961 gagaatacag caaagtatag aagaaagagc atgggctttg aagtcaccga catgggtttg 145021 attctggctc tgtcactggt tcctcactgt gtgaggttgg gctagtcaca atcacaatct 145081 ttgagcctca gttccctcac ctatataatg agggtaataa tgcttaccat tcagggttgg 145141 ttgtaaatat taaatgaaat catgtatgtg aagggcttga tataaagaag gtattcaata 145201 aattcatact tcaatgtcat ttcttttttc tttttttttt tttgagacag agtcttactc 145261 tgtcacccag gctggagtgc agtggcatga tctcggctca ctgcaacctc tgcctcccag 145321 gttcaagtga tttgcttgcc tcagcctcct gagtagctgg gattacaggc acgtgccacc 145381 acgcctggct gggtttcacc atgttggcta ggctggtctt gaactcctga ccttgtggtc 145441 catccaccta ggcctcccaa agtgctggaa ttatagacat gagccaccat gccccgctcc 145501 atttctttat gttcctttgt aaggctccgt ggaatgtgtg tgttcacaaa tatcctttag 145561 ggatcatgaa gggtaaactg tgtgatctcc tggattgggg cttgtaggta catccgaatt 145621 ttctgccctc tacagagatg aaagagatta tacttcgagc agaatctgcc tcccttttag 145681 atagctaagg tagcccatgg ggcatgccag caatgtcttg tggtatctgt acctcttggc 145741 ccaaaggctt agggttgggc cgcttgctca gttctctgac aagcagaata gtattccttt 145801 gtgactctgt tctccaagtt ggagactctg atggtttcct tctaacaggt ttcattgaaa 145861 acagatcctg caaaagttcc aggtgcccac actggaaact tggagatcct gcttcccaga 145921 ccacagctgt ggggaacttg gggtggagca gagaagtttc tgtattcagc tgcccaggca 145981 gaggagaatg gggtctccac agcctgaaga atgaagacac gacagaataa agactcggtg 146041 agttaaaatg agagacatga aagatgaggg gcggggcagg caagctagga ggaagggtct 146101 agagaagaag aacaaataat gtgcaccata aagttaggtg caatgtaaag aacagtgtta 146161 cctacctcct tcctcctcct gtagatgtca atgaggagtg gacggaagaa agaggcccct 146221 gggccccggg aagaactgag atcgaggggc cgggcctccc ctggaggggt cagcacgtcc 146281 agcagtgatg gcaaagctga gaagtccagg cagacagcca aggtattctg tcctcaggtc 146341 ctcccacagg atgcccaagg cactggggct gagggtgtgt gtgtgttgtg ggggaacttc 146401 ctgtttggca gagggtaacg gtggagctcg ggaggtaggg aaaagacagg aattttctct 146461 ttctctctaa cagaaggccc gagtagagga agcctccacc ccaaaggtca acaagcaggg 146521 tcggagtgag gagatctcag agagtgaaag tgaggagacc aatgcaccaa aaaagaccaa 146581 aactgaggtg ggaaaccctt gtcgccatcc tgacccatct tgtgaccatt cttttctcag 146641 acttgcttat gctcactatt cttagctgga tctctcctgg gacataagag aaaggccaga 146701 tcatagtgct tatgagagca gttctgtcta taatatgcca gagagattct tagagctttg 146761 acagaccacc agatgaccag gcagggccaa aggggaccag aagagttggg ggattctagt 146821 ctctgggtga gaagtcttag tcggaggaac aaataaatct taaggaaaat gcagaagtgg 146881 tctttctttt ttattgtttt tttttgtttg tttgtttttg agacagtttc gctctgtcac 146941 ccaggctgga gtacagtggc acaatctcag ctcattgcaa cctccgcctc ctgggttcaa 147001 acaattgtgc ctcagcctcc cgagtggctg ggattatagg catgagccac catgcccggc 147061 taatttttgt atttttggta gagacagtgt ttcaccatgt tggccaggct ggtttggaac 147121 tcctggcatc aagtgatccg cccgcttcag cttcccaaag tgctgggatt acaggtgtaa 147181 gcctctgtgc ccggccagaa gtggtataaa aaccaagggc ttgggggatg gaggatggtt 147241 aaagtggtgg ttatgagatg gtggaagaca gggagttgga gtcagctgtg ggtagagaca 147301 aggtgcttga gatggatcct tgaggaagga atagggtttt acaggcggga aacaagactg 147361 gctgctaggg agcagccaag aatgtgagga aagtgagaaa tccccagatg gtaagaggtg 147421 ggcttgagca agagtactca ccacatcaca gtacaacagt gtgctgtgag gggaaatgat 147481 tttatgcaag agagccccaa atctcaggga agcaggaaag gaaaagagaa aaaatgagtc 147541 ttcccttttc tacagcagga actccctcgg ccacagtctc cctccgatct ggatagcttg 147601 gacgggcgga gccttaatga tgatggcagc agcgacccta gggatatcga ccaggacaac 147661 cgaagcacgt cccccagtat ctacagccct ggaagtgtgg agaatgactc tgactcatct 147721 tctggcctgt cccagggccc agcccgcccc taccacccac ctccactctt tcctccttcc 147781 cctcaaccgc cagacagcac ccctcgacag ccagaggcta gctttgaacc ccatccttct 147841 gtgacaccca ctggatatca tgctcccatg gagcccccca catctcgaat gttccaggct 147901 cctcctgggg cccctccccc tcacccacag ctctatcctg ggggcactgg tggagttttg 147961 tctggacccc caatgggtcc caagggggga ggggctgcct catcagtggg gggccctaat 148021 gggggtaagc agcacccccc acccactact cccatttcag tatcaagctc tggggctagt 148081 ggtgctcccc caacaaagcc gcctaccact ccagtgggtg gtgggaacct accttctgct 148141 ccaccaccag ccaacttccc ccatgtgaca ccgaacctgc ctcccccacc tgccctgaga 148201 cccctcaaca atgcatcagc ctctccccct ggcctggggg cccaaccact acctggtcat 148261 ctgccctctc cccacgccat gggacagggt atgggtggac ttcctcctgg cccagagaag 148321 ggcccaactc tggctccttc accccactct ctgcctcctg cttcctcttc tgctccagcg 148381 ccccccatga ggtttcctta ttcatcctct agtagtagct ctgcagcagc ctcctcttcc 148441 agttcttcct cctcttcctc tgcctccccc ttcccagctt cccaggcatt gcccagctac 148501 ccccactctt tccctccccc aacaagcctc tctgtctcca atcagccccc caagtatact 148561 cagccttctc tcccatccca ggctgtgtgg agccagggtc ccccaccacc tcctccctat 148621 ggccgcctct tagccaacag caatgcccat ccaggcccct tccctccctc tactggggcc 148681 cagtccaccg cccacccacc agtctcaaca catcaccatc accaccagca acagcaacag 148741 cagcagcagc agcagcagca gcagcagcag cagcagcagc agcatcacgg aaactctggg 148801 ccccctcctc ctggagcatt tccccaccca ctggagggcg gtagctccca ccacgcacac 148861 ccttacgcca tgtctccctc cctggggtct ctgaggccct acccaccagg gccagcacac 148921 ctgcccccac ctcacagcca ggtgtcctac agccaagcag gccccaatgg ccctccagtc 148981 tcttcctctt ccaactcttc ctcttccact tctcaagggt cctacccatg ttcacacccc 149041 tccccttccc agggccctca aggggcgccc taccctttcc caccggtgcc tacggtcacc 149101 acctcttcgg ctaccctttc cacggtcatt gccaccgtgg cttcctcgcc agcaggctac 149161 aaaacggcct ccccacctgg gcccccaccg tacggaaaga gagccccgtc cccgggggcc 149221 tacaagacag ccaccccacc cggatacaaa cccgggtcgc ctccctcctt ccgaacgggg 149281 accccaccgg gctatcgagg aacctcgcca cctgcaggcc cagggacctt caagccgggc 149341 tcgcccaccg tgggacctgg gcccctgcca cctgcggggc cctcaggcct gccatcgctg 149401 ccaccaccac ctgcggcccc tgcctcaggg ccgcccctga gcgccacgca gatcaaacag 149461 gagccggctg aggagtatga gacccccgag agcccggtgc ccccagcccg cagcccctcg 149521 ccccctccca aggtggtaga tgtacccagc catgccagtc agtctgccag gtgagcggcc 149581 aggtggggcg gaggtgggcc tggaaagggg acgacgacaa ggcggcgacg agagagggag 149641 tagcagggag gggccttgcg ctggtgtagt gttttagaaa agcacgcccc tctcctccgt 149701 ccaggcctag tggccagtga ggcccgcagc agctcacagc ctgcaggggt ggttttgagg 149761 cgggggctac aagcactcgc cggggccgcg gcgctgcggg ctccatcggg cagctcgcac 149821 cgcctgagcg cccgctgctt ccacgcccgg caggttcaac aaacacctgg atcgcggctt 149881 caactcgtgc gcgcgcagcg acctgtactt cgtgccactg gagggctcca agctggccaa 149941 gaagcgggcc gacctggtgg agaaggtgcg gcgcgaggcc gagcagcgcg cgcgcgaaga 150001 aaaggagcgc gagcgcgagc gggaacgcga gaaagagcgc gagcgcgaga aggagcgcga 150061 gcttgaacgc agcgtggtga gtgcgtcact gcctgcgcca ccgccttctt tccctctttc 150121 cttccttccc tctgcgctgc gctgcgctac gctacgctgc ggggctgtgg ctgggtgggc 150181 gggcgagtca cgtcgccacc tgtcggaggg gaggtaccac tgcagcccag gaaatggagc 150241 ccaaaaggtt ttcagcgaga gccatctgca ttcctgggtt gggaaaaggc atgctcagat 150301 gggactacct gtggatccca agaagggcaa atattcggga gcgggggccg cagttaagtt 150361 ccaggtgggc agagtttcaa tgagttgagg cattttggca ttcggctgtc gaaacaaatg 150421 ggcagcttaa aaccagccac ctcttttcat aactgccgct ttgactccac ttttccctgt 150481 atcccacaga agttggctca ggagggccgt gctccggtgg aatgcccatc tctgggccca 150541 gtgccccatc gccctccatt tgaaccgggc agtgcggtgg ctacagtgcc cccctacctg 150601 ggtcctgaca ctccagcctt gcgcactctc agtgaatatg cccggcctca tgtcatgtct 150661 cctggcaatc gcaaccatcc attctacgtg cccctggggg cagtggaccc ggggctcctg 150721 ggttacaatg tcccggccct gtacagcagt gatccagctg cccgggagag ggaacgggaa 150781 gcccgtgaac gagacctccg tgaccgcctc aagcctggct ttgaggtgaa gcctagtgag 150841 ctggaacccc tacatggggt ccctgggccg ggcttggatc cctttccccg acatgggggc 150901 ctggctctgc agcctggccc acctggcctg caccctttcc cctttcatcc gagcctgggg 150961 cccctggagc gagaacgtct agcgctggca gctgggccag ccctgcggcc tgacatgtcc 151021 tatgctgagc ggctggcagc tgagaggcag cacgcagaaa gggtggcggc cctgggcaat 151081 gacccactgg cccggctgca gatgctcaat gtgactcccc atcaccacca gcactcccac 151141 atccactcgc acctgcacct gcaccagcaa gatgctatcc atgcaggtga gacccctcct 151201 tccttgccct ggccctttgg ggccaccttc cccctatcat gactgggctc tttcattcct 151261 atggccaaga ctttcctccc ctggcctggc atgaaccttt cctgagattg ctgccctgaa 151321 ctttgcctcc ctgacgttct ctcagccttg tcttctcatg gccttccctg gccacaaagc 151381 ctgtatgccc cacagcccag ttctctccat agtctagtat ccttcccatg acagaggccc 151441 acacgggagc ttttgaagga cacttttcta ttctcacagc cccattctcc caagcccttc 151501 acaaacctgt cttcttattt atttttttga gacatggtct ggctctgcca cccaggctgg 151561 agtgtagtgg cgcaatcaca gctcactgca gcctcgactt cccggtctca agtgatcctc 151621 ctacctcagc ctctggagta gctgggatta caggcgtgtg tcaccatacc tggctacttt 151681 ttgtattttt ggtagagacg gggtttcgcc atgttggcca tggctggtct caaattcctg 151741 agctcaaagc gatccaccca cctcggcctc ccaaagtcct gagattacag gcgtgagcca 151801 ccgtgcccaa cccacaaacc tatcttctga tttctctgga ttcaactcat tttttgccat 151861 tttctcctct tctccctctg cgaggtcttt catgtttttc cccatctcta gatctgtaac 151921 tttctagaaa gggccctgta gctttgggtt tgggtgtcat aggcgataat gagctttact 151981 tactatcctt tcaccatttt ttggtttttg agacggagtc tcgccctgtc accaagctgg 152041 agtgcagtgg cgcgatctcg gcttactgca acctccgcct ctggggttca agtgattctt 152101 ctgcctcagc aggcactgcc tgcccgagta gctgggactg caggcgtgtg ccaccaagta 152161 cagctaattt ttgtattttt agtagacacg gggtttcacc atgttggcca ggatagtctt 152221 gatctcttga ccttgtgatc cacccgcctt ggcctcccaa agtgctggaa ttacaagcgt 152281 gagccaccgc acccggcttc ctttttttct tttgcccaga ctggagtgca gtggcgcgat 152341 ctcggctcac tgcagcttcc acctcccagg ttcaagcgat tctcctgcct cagcctcccg 152401 aggagctggg actataggcg tgtgccaccc tgcctggcta atttttttgt attttttgta 152461 gaaacagggt ttcaccatct tggccaggct ggtcttgaac tcctcacctc aggtgagctg 152521 cctgcctcag cctcccaaag tgctgagatt ataggcttga gccactgtgc ccggcctctt 152581 cttgtcttat tttctaaact tctatcatct tccaccttct tcacgcagtc tacctctctc 152641 tgtccttccc ctcagaccct tctgatgact gtaatctcct gtcctgtgtc atccatatgt 152701 ccctcctcca gtagggctca gcctgtccac ccctgccagg cctctagttc ttccactctg 152761 cctttgttcc actttggacc tccttgtgtt tgtctgactc agttcccagg cttggatccc 152821 ttcacccaaa tgcatggttt gccccaaacc tgccttctgg tgacaccttc ctcttctctg 152881 ccctccagcc tctgcctcgg tgcaccctct cattgacccc ctggcctcag ggtctcacct 152941 tacccggatc ccctacccag ctggaactct ccctaacccc ctgcttcctc accctctgca 153001 cgagaacgaa gttcttcgtc accagctctt tggtaaggat ggaagttggg gtaggcagct 153061 ccaatgagaa aagggcagaa aggaggtatt tgggtggggg gatgggccta gttgggcttg 153121 gggagggatg aggaggtgcc tagaggagct gggcatggga ataggagagc tggagctctg 153181 cccaagagaa gcacgagttt tagtgtcagc ctaagaggtt cgaatcccaa ttccacccta 153241 ccactggcaa cttacagaac tgggtgtgtt acctcacttt ttagttcatt acctttgtat 153301 gtaaaggagc tggctatccc ctggtccaga gcaggtactt gttatccgtt ggtcatatgc 153361 cccttgccct tcctgctcac agctgcccct taccgggacc tgccggcctc cctttctgcc 153421 ccgatgtcag cagctcatca gctgcaggcc atgcacgcac agtcagctga gctgcagcgc 153481 ttggcgctgg aacagcagca gtggctgcat gcccatcacc cgctgcacag tgtgccgctg 153541 cctgcccagg aggactacta caggtaccct agggtgcccc agcccagggg acatgggctc 153601 agcgagcctg ggaggagctg tgggcatggt acggctgggc accgtgctcc tgggggaggg 153661 aacccctcct ctcccaaccc cttcggtaag agggggcaag gtcagagttg gtctcaagtc 153721 tcttacctct ctgctatgca ctgccccttc cctagtcacc tgaagaagga aagcgacaag 153781 ccactgtaga acctgcgatc aagagagcac catggctcct acattggacc ttggagcacc 153841 cccaccctcc ccccaccgtg cccttggcct gccacccaga gccaagaggg tgctgctcag 153901 ttgcagggcc tccgcagctg gacagagagt gggggaggga gggacagaca gaaggccaag 153961 gcccgatgtg gtgtgcagag gtggggaggt ggcgaggatg gggacagaaa gcgcacagaa 154021 tcttggacca ggtctctctt ccttgtcccc cctgcttttc tcctccccca tgcccaaccc 154081 ctgtggccgc cgcccctccc ctgccccgtt ggtgtgatta tttcatctgt tagatgtggc 154141 tgttttgcgt agcatcgtgt gccacccctg cccctccccg atccctgtgt gcgcgccccc 154201 tctgcaatgt atgccccttg ccccttcccc acactaataa tttatatata taaatatcta 154261 tatgacgctc ttaaaaaaac atcccaacca aaaccaacca aacaaaaaca tcctcacaac 154321 tccccaggaa catggctgtg actattcttt gcgggatatg ggggtgaggc tgggtctcag 154381 tgcagcggtg cttagagaga actgcagcca gggcctggca gggaagcggg aagagaccag 154441 accttcttaa ggaacggggt gtgggggctg ggaagaattg gagaggggat ccctgaagag 154501 cccagagcct tgactcagct ggaggcggta ccatgggtgc tggtgccccc tcctgcccca 154561 aggctgggca ggaccgctgc gcgccagtgg aggcctcggt ctgggctctg tggcttatgc 154621 ctgtgagacg cgggtggatt gcttgagccc aggagttgga aaccagcttg ggcaacatag 154681 agaaacccca tcttaagaca aaatcagctg ggggttgtgg tgcacttgtg gtcccagcta 154741 ctcaggaggc tgaggtggga ggatcacttg ggggtcgagg ctacagtgag tcatgatcgc 154801 accactgcac tctagcctgg gtgacagagc aaggccctat ctcaaaacaa acaaacaaaa 154861 gacaccccaa ccaaccaaac actagaggcc tgggacaggc ccacctctga cgtggtcagt 154921 tctaggaatc tattcagacg ggtctcactg ctaggagcca ccagagtcct ccggaacaag 154981 ggcttcagag tcctgaaacc tttccgtgga atactttaag aaacggttat gaggccaggc 155041 gcgacggctc acgctggtaa tcccaggact ttgggaggcc gaggcgggcg gatcacgagg 155101 tcaggagttc cagaccagct tggccaacat ggcaaaaccc tgtctctact aaaaacaaaa 155161 cattagctgg gcatggtggc gcgcgcctgt cgtcccagcg acttgggagg ttgaggcagg 155221 agaatcgctt gaacccggga ggtggaggtt atggtgagcc aagatccccc ccagctgcac 155281 tccagcttgg cgacagagca agactccgtc tcaaaaacaa aaaacaaagc cggttatgaa 155341 gcgggggtgg ggtgggctag ttttaatagg tccaggcgat tagtacctgg catccttaac 155401 cacctacagt ttgagaaggg agtgggtgat aaaagcctgg aagggaaggg aaatcgggcc 155461 gggcattagg ctccatcgct catcaataga caaggccttt aggaaactgc gacaacggct 155521 tttgctctgg gcctttactg ccgaatccag gtctccgggc ttaacaacaa cgaaggggct 155581 gtgactggct gctttctcaa ccaatcagca ccgaactcat ttgcatgggc tgagaacaaa 155641 tgttcgcgaa ctctagaaat gaatgactta agtaagttcc ttagaatatt atttttccta 155701 ctgaaagtta ccacatgcgt cgttgtttat acagtaatag gaacaagaaa aaagtcacct 155761 aagctcaccc tcatcaattg tggagttcct ttatatccca tcttctctcc aaacacatac 155821 gcagcagtgt tacagctctt ttagaatttg tctagtaggc tttctggctt tttaccggaa 155881 agcccctctt atgatgtttg ttgccaatga tagattgttt tcactgtgca aaaattatgg 155941 gtagttttgg tggtcttgat gcagttgtaa gcttggggta tgaaggtttg ggccacgcct 156001 gggcgcttcc ggctgcgccg gatgctgttt cctttccgct cccaggggcg ttgggaacgg 156061 ttgtaggacg tggctcttta ttcgtgagtt ttccatttac ctccgctgaa cctagagctt 156121 cagacgccct atggcgtccg cctcgaccca accggcggcc ttgagcgctg agcaagcaaa 156181 gggtgagaat cgtcctagtc aaggcatagg ctgctggcct ggggtagtca aggcatgggc 156241 tgctggtctg gggaggatgc gggcggagga tgtggggcga caaacctggc tacgtccgcc 156301 gggaaaatgg ggtaggggac gccgggtacc gtccttctaa gtggggcgct tgcccccaag 156361 acttgggatc ttattgggtt acctacagcc tcaatccact aattccttgc gctctccgct 156421 gggcccgctg tctgttctcc gacgcctacc cgggacgcct ccctgggatg cttctggcgc 156481 gcagtggtcc tcgcggaggt gatccaggcg ttctccgccc cggagaatgc agtgcgcatg 156541 gacgaggctc gggataacgc ctgcaacgac atgggtaaga tgctgcaatt cgtgctgccc 156601 gtggccacgc agatccagca ggaggttatc aaagcctatg gcttcagctg cgacggggaa 156661 ggtgggtcag acgcgggaag gcgggtcaga cgcgggaagg cgggagttgg gtcgggagag 156721 ggcgccggat ctgtgggccc atgagcggtt gtcccctttc tcgccacacg gcggcagcca 156781 caatctgact aacattcttg gcactcagag cccagggtct accctgagct tgtgcgtccg 156841 agttgcgttt tgtacagtag aggttctgta tcccttaccc gaaatggttg ggaccggaag 156901 tgtttcgggt tttttcagat ttcggaatat ttacatatac atggatatct tgggggatgg 156961 gacctaaatc tgaacacgaa attgttttgt ttcatataca cgcaccttat gcacatagcc 157021 tgagggtaat tttatataat agtctgaata attttgtgca tgagacaaag ttttgactgc 157081 gttttgacta cagtctcccc caccccccac tatgaagtca ggtgtggaat tttccatttt 157141 cggcctcaca ctggcactca taatttcaga ttttggattt tctaattaag gatggtcaac 157201 ctgtactgtt aatcacatta ttattccatt ttgtgatgag gaagcaaact ctaatggtgc 157261 agactctcga gtagcagagc tggacttaaa tgctgttgtg ccacctttta gtgacatgtc 157321 tgcatgacag cagtggctgc agtctctcca gggtgtttca gctaattttt gtttgggggg 157381 ggttggtggt taaatttttg gtgagatcct ggtgttcccc aaaggtgttg actgattcag 157441 ctctgctgcc tcatagcctt cctgagttcc agctacagac ctggggtctc gtgttcattc 157501 ttggggttcc agtgactaaa ctgctaaaat ggttcacctg agagcaagaa gcacactgcc 157561 ttgggacatt ttcattcact gcatcccatc tagttgtgcc cagtgtagac caaatgtcat 157621 catccaaacc acacgggaca gaggcctgca tgcgtcctgt ttggcaaaca gctgcccagc 157681 cagtggggga gcagttcatg cttagactac caccccctcc aggtgtctta ggcacgctgg 157741 tccttaggag aagggttgac cttccactcc ctcttgcagg tgtccttaag tttgctcgct 157801 tggtcaagtc ctacgaagcc caggatcctg agatcgccag cctgtcaggc aagctgaagg 157861 cgctgtttct gccgcccatg accctgccac cccatgggcc tgctgctggt ggcagcgtgg 157921 ccgcctcctg agagttggcc ctcccttgtg ccactgccag gggaggaaag gccttgatgt 157981 tccagacaat aataaatgcg cctgtgactt agccttggtg tcagtctctt gcggacctga 158041 caacccccat ctctccttcc ctgattccct ctgcctttcc aggccccatc cccctgaaca 158101 gctcctccct atggtcctgg ctgggcctaa ccctgcccca gggcctaacc ctacctgagg 158161 ctcctcccct tcccccgggg caggttgaga ggctggagtg ggtccctcag cgccctgggt 158221 gggtgggcct gcacaggggg tacctccttc tctgaggaac tgggctgtta gggattttcc 158281 ttaggccctt tggtttccgc ctacggagag gtttccccca ttggttgctc ttcctcagcc 158341 agggttactt cctggtctgt tcccctaccc aataccccgc cgctctgtca gcttgagctc 158401 caggtggagc tccaggtggc tcctcctctc ccgggggaag gcggccctgg accagcaggc 158461 gggcctgctg tactcccgct ttggggctgc agggaagctg gccgctgtgg gcggtctcgg 158521 gccagccccg ccccacctgt ccttttcctg gagactatta gtccagggtt tgtccctgca 158581 gtgccattgg cctggcaggc aggatcgagg aggaagtggc tgattactga gcggttcttc 158641 ctcacctggc ttgggccact gtgcacagct gtgccgctgg ctcagccccg ccccctgcgg 158701 ccctctgccg tggcttcccc ctccctacag agagatgctg tcccgtgggt aagtcccggg 158761 caccatcggg gtcccagtct cctgttagtt ttggagggag ggagggcttt gttgatgctc 158821 actccgacgt gtgtgaacgt gagtgcgatc tgccgctgcc ctgcgcctgt ttccggtccc 158881 tatgaacttc cccttcccgc aaggtgtgag gacccccggc tcactcatgc tcctctgccc 158941 cctctttaac attttcccct ggacaagtgt gtatctgttc tctccattgc atttctactt 159001 ccagcctctg ggctcctgct tctgcctcct gcttaggacc tgtccccctg ggtagctcac 159061 aacacctcaa acatagcagt cagaggccac ccgcgaaggc cctcccacgt ccagccaact 159121 tctccgcact tcccaacatc agactttggt cccatcttct ttgtttcctt tcacttccct 159181 ttcccctgca tcattcattc aacaggtacg tgttgagcat ctattatgca ccaggtgctg 159241 tttaagatgc tggtaatact ggagtgaaca agacagacat ggtctctgct ctcacggagc 159301 ttacattcca gtgggaggtt acagaccgaa caaataaccc aataaattgg atcattgcag 159361 attctcagaa gtattacgca gaaaatagac agccttggcc gggtgtagtg gttcacacct 159421 gtgatcccag cactgtggga ggctgaggcg agaggattgc ttgagcccag gagtttgaga 159481 ccagcctggc caatatagtg agaccctgtc tctacaaaaa ataagaaatt agctgggtgt 159541 ggtggcacac gtcctgtggt tccagctatg gagaggctaa ggtgagaggc ttgcttgagc 159601 ctgggaggtc aaggctgcag tcagcgatga ttgcaccact gcacaccagc ctgggcgaca 159661 gagtgagacc ttgtctcaaa aaaaaaaaaa aaaaaagaaa atgaaccagc ttcatatgct 159721 agcaagtgac tgggtgtgca ggtgacatta ctagctggag ggatcaggga ggccttcccg 159781 aggaggtgac atttgagctg agacccggat gaggaggaag aggagctggc catgtgacgt 159841 agtgatcaag agtcaagcat ctctgggcag aggagatggt gagcacaaag ccctaatgtg 159901 ggaacaaaca aaaaaaggac agtgtgcccg tggcagagga ccctagtgga gcggaggcag 159961 ggccacagca ggttagacca tgttggagct aggatgttga aagtgaaaac ctgacgagat 160021 gaggtggcgc acgtctgtga tcccagcact ttgggaggcc gaagggggaa gattgcttga 160081 gctcaggagt ttaaaaccag cctgggcaac atagagagac cccatctcta ttaaaaaaaa 160141 atactgggta tgatggccca agcatgtggt agtcctagca gtttgggagg ctgaggtggg 160201 aggatcactt gagcccaaga gttcaagacc accctgggca acatagggag agacctcatc 160261 tctactacga ctacgactac tactactact aataaatagc tggatgtagt ggcatgcacc 160321 tgtggtctca gttacttgga aggctgaggc aggaggatca cctgagccaa ggaggtcgac 160381 gctgcagtga gttggattgt gacactgcac ttcagcctgg gtgataaagc aagattctgt 160441 gtcaaaaaaa aaaaaaaaag agagggaagg aaagaaggaa gggaaggaag aaagaaaaag 160501 agaaagaagg aaaaaaagga aagagcgaga aagaagaaag aaaaggaagg aaggaaagaa 160561 agaaaagaaa ggaaagaaaa agaaaaagtg acacccagtc gaaagaagaa aggaaagaaa 160621 aagaaaaagt gacaaccggt cgaaagaaaa aagaaaaagt gacaaccggc tgggcatggt 160681 ggctcaagcc tgtaatccca gcactttggg aggccgaggc aggtggatca cgaggtcagg 160741 agttcaagac cagcctggcc aacatggtga aaccctgtct caactaaaga tacaaaaaaa 160801 aaattaggct ggcacagtgg tgcgcacctg tgagtcccag ctactaggga ggctgaggca 160861 ggagaattgc ttgaacccag gaggcggagg ttgcagtgag ccgagattgc gtcactgcac 160921 tccagcctga gtgcagcggg agagactcca tctcaaaaaa aaaaaaaaaa gaaaagaaaa 160981 agtgacaacc tgcttacaga gtactggcga gtttgtgggt gggtggctcc ctagccctgc 161041 tgattcttgc ttctcacact catgtctgcc cctgccccag tgcacatctt gtcactgtcg 161101 gccccaccga tggggttcct actgagtctt ctggtccctg atcccgtctg tggtcatttt 161161 cctgccaggt agcttggcca ggcctcccct ggtgcagatt tcatccttgg tttctcagcc 161221 tggccttgaa tgaccctcta cagcagggtc cccacctctc agaacaactt tgctccagcc 161281 acatggcttg ctcacggcca ggcactgccc atgtggactc tgtgcgtgcc acctctttgc 161341 cctgacccat gttgcctctg ggggagcact tcttcctcca ccttccatca tgggctgtgg 161401 cagtgcccat cccatctgcc cccgacgctg tctgctgcag tatggttgtt gggggaaagg 161461 gcaccaggct ccggcgtctg acagccgtgt tttacccacc ttcctactca ctagcttgtg 161521 accttgggca attacttaac atctctgagt cttagtttct gtttctaaaa ttgggtgaat 161581 aacacctact aagtagggtt ggcctgagga ttaatagtat aatgtaaaag ctggcagcac 161641 tgaaaccctg ccacttacca gcttttcaca tcagtatttg ggaaatattg ttaagctcat 161701 ttgtcaggcg gggattctga ggctcagagc agttccagaa ctttctacag attattttgc 161761 cttgtttgcg cttccagact gcctatcttc ttgtatcacc attgatcttg atctgtatgg 161821 tttttaattt tttttttttt gagacggagt ttcactctgt tgcccaggct ggagtgcggt 161881 ggcatgatct cggctcactg caacctccac ctcctgagaa gctgggatta caggcatgtg 161941 ccaccacacc cggctaatct ttgtattttt agtagagttg gggtttcact atgttggcca 162001 ggctggtctc gatctcctga cctcgtgatc cgccagcctt ggcctcccaa agtgctggga 162061 ttacaggcgt gagccactgc gcccggccaa cttcacgttt atacacaccc atgcaaacag 162121 catccagata gagacaaaga gccttccctg taccctaaaa gtttcccaga aattgttccc 162181 agttagcata tttattttta taaaggtaat gcatgcccat catataacat tcaaaaaggt 162241 atgtagagaa ccaagtgtct cccccagccc tgtcctccag ccacccagtt tccctcccta 162301 ggggaagcca ccaatatgtg tttcttatgt atcccctgtt gagctgcttt tcctcgtttt 162361 ggtttggcgg tgttgatgtt tgtatttgga attacaggta ggcagcatca tataccttag 162421 tgtttagggc ctctaagatc aaccagccct gagaaaatca gccatggtga ggaccttgtc 162481 ccccagcccc ccaggagata ggccccctgg tgggagtgct ggggcagggc agaggcctag 162541 ggacaagaat tagaaaggac ccatgttgac agggctgctc agggtcatgt tgtccatccc 162601 tctgccacag tggcatggac aaactgcata tgttggttag aggagggcac ccttctctct 162661 tgcaagcatt ggcaaggtct taactattag tctcctgctc ccatggcagc ccctttggac 162721 aaggaggctc ttaatctctg ttctttgaag ccctgagggc tggtgtatag gagttcaaag 162781 cactggcttt ggaaccggac tgtctgggtt tgaatcctgg cactgcagct gactcactga 162841 tggactcagg caatgcctta aactccctga gcctcaggtt ccttgtctgt aaaatgataa 162901 agatagcccc tgtttcatag ggctgtggtg agaaaccaat cagacaaggc atgtgaacgc 162961 cattatagca cagcgcccgg catccagcag gactcactcg atgacagttg tcaccgccat 163021 cattgttatt agcgtgggcc agggagggct gcgtaaaagc agctggtgga ggagggagag 163081 atgccgtggg accgtctggg ttcgcatgcg tgaagtatta tctgggcctg gagtgtgcaa 163141 ggcacacatg tgtccttact gcatgtgttg tcacatatgt gcaatgccat gctcctgagc 163201 ctttgattgc agacgtgtgg gaagtgggcc ccgtccccac ccccagtgcc accctgctct 163261 gcttctcttc ccttgctgtg ctctaaaacg agaagtacaa gtgagttccc ccaaggggtc 163321 ggccgcgcct cttcctgtcc ccgccctgcc ggctgcccca ggccagtgga gtggcagccc 163381 cagaactggg accaccgggg gtggtgaggc ggcccggcac tgggagctgc atctgaggct 163441 tagtccctga gctctctgcc tgcccagact agctgcacct cctcattccc tgcgccccct 163501 tcctctccgg aagcccccag gatggtgagg taagggcctg ccacccacgg tagacaggag 163561 gcaagggtgc ctggtgccca cgggacccct cctcactgcc ctgcctgggc cgcccaggtg 163621 gtttcaccga gacctcagtg ggctggatgc agagaccctg ctcaagggcc gaggtgtcca 163681 cggtagcttc ctggctcggc ccagtcgcaa gaaccagggt gacttctcgc tctccgtcag 163741 gtaggtgggc cccccgcaac cccgggcatt ttggccactc tcttgtgcca tccaggccct 163801 gaaccactca ttcctggttc cccgtggcag tgctgactcc ccgtctgttc ccttgccccc 163861 aacccccaca ctccccatcc ctgtctgtgc ccacccatgc ccatgtgtgc ccccacccag 163921 gacctcagcc gatccctgcc ctcctgcctc tactcctgca ccgactggcc tcaccgcctg 163981 gtgccctgca gggtggggga tcaggtgacc catattcgga tccagaactc aggggatttc 164041 tatgacctgt atggagggga gaagtttgcg actctgacag agctggtgga gtactacact 164101 cagcagcagg gtgtcctgca ggaccgcgac ggcaccatca tccacctcaa gtacccgctg 164161 aactgctccg atcccactag tgagaggtga gggctccgca cccccgccat tcccaagcag 164221 ggatgagccg gctcccaccc tgaacagcca gggaggcagg gagactggca gccggcgctg 164281 cctaccctcc atcccctccc ctccctgcac cagctggggc tctcaatgtc cctcctccct 164341 gctgtcctgg gacctggtgt ctcagagcct aacctaccac cctttccacc taaccccgag 164401 gaagccacag aaagctgcct cgccctactc cgggagccct ggccgctgca acccaggtcc 164461 cactggagac agggaggcca ctgctggtgg ccagcatgtc gtgcaggcca gctctgttgt 164521 tagaaagctc ttcttcctct ggaatcgagc ctgccttcct ccgtctgccc ctcaccccag 164581 cacatgttag gacagtgagg agctgacact ggggtgaaga tggggatgaa tgcttgccaa 164641 gacacttgat gccttgtccc agccgccccg tggggatggg tctgtcctgt ggggtcaaat 164701 aggtctccgg cccaaacaga gatcattgag agcacgatgt gaagtgttca cctgtgtaaa 164761 gtgtctcacg ctgtcccggg cacagagtaa tactccaggc atttccttcc tgtggcctcc 164821 ccgactcctc ctgtggtctc ccaaaggcat gggctggggg ctgggggctc tgaatgctcc 164881 tcatgacacc atggctcctt tcagcagccg catctcaatg ccagatcccc ttagagtaaa 164941 gggcagcgga ataacgctag ggggttttca catgcacccc tgggccaagc cgacttgccc 165001 ttgccgtgga tccctgcatt catggatcgg ttattgaaat gatcgggaac cttgctcctg 165061 ccagcttgca gcctctctga gattcgggcc tccaaactgc atcaatattt ttggtcaagg 165121 cactgattga aacttagagc tggattcggt cacggtgcag ccctgtggcc cacctgggag 165181 gcctcctttc ctggatcggc ctccttcaag gccttccctc tctctgtgag cctcacatgg 165241 ctggctccgt gtctgccccc tgcccttcct cttccccacc gcaacactca gggggctttt 165301 ggcaccgaga ccctctaaag ctcatgtcct ctctttctcc ttgcctccag ccaggagagg 165361 aggacgggct gaccagtgcc tggaggtgga agagaggagc agggccccag gaggcccctg 165421 cagaggaggc tgaggcctgg gttcaaggag aagagagaag agagagaagg aagggagggc 165481 agtgccgggg cgggaggtta agaccaggga agccgcactg gaggcccttt tgggtgaccc 165541 gtcccaggag ccagtgtcac ccctgagcct gggagtgtgt gagaggctct ttctcccagg 165601 ttctgctgtg tcctctgcct tgtctgtgcg cctcctcctc tgcgagaatt tgcatctgtc 165661 cctcggtggc tctgcgcttc ctgtggtcag cctgacattt gcatggagac ttcctcatcc 165721 tggggcctga gggaaggggc tcagccccct ccccgctacc tggggtccta gcctgtcccc 165781 aggcggtggg ctgaagtagc ccagtggggt taggaggctc tgggggtctc tcggctggag 165841 tcacctccgg gcaggggtga gatgggttgg gacagactgg tcctcccctc cttcccccca 165901 tccctgcggt tggaaaattt gcccgccctc ccctcgtccc tgggctgagg aaacctcaca 165961 acctcacttc tcactctctc cccagaagga gttttgtgtt ttttccatca cgtggtttcc 166021 tgtggggctg ggctttgtgg ggctacagtt tcctcctggg aaaggggtgt gcttcgggga 166081 aagggcttag ttctgctttc tgccctgaca gccccttcaa atccgtttga accctgggct 166141 ccccttcagt gacatcatcc agggcacccc agaaccccct acaccactct ttccccagtg 166201 gggttgtctt ccccgcctcc ctggcggagc gcaccccatc cgccttcctt gtgacttgag 166261 tctgtgtgtc catctcccac cactccctgt ggtgtggcct cggtctgcgt ttctctttgc 166321 ctctggtctc tgctggggca cagtcccatc cttcacggag attcatcctt agcttctctc 166381 ctccaaatat tttgaatatt gccagccttt ctgcctttca gaggtgggct ctgggttcga 166441 agcccggtta gaactctgga ggctaggatg gcttgaacct gggaggtcga ggctgcagag 166501 agctgtaacc gcgccactgc actccagcct gggcaacaga gctctggaag cttgccctag 166561 agtcagtcaa gggccctagg ccagtgagta acagctcagc gtcagtttcc tcatctataa 166621 aatgggggta atatcatacc tagctctcag catgtttgtg agagacctaa atgaggtggt 166681 ggatttggaa gcatgtagcg cagtgcctgg cacacagtag gtgcttgatt tccggcccct 166741 ctctgtgaat gtctctgctc agcgccttcc cctgtggcct gggtcttacc ttccctgacg 166801 ctgccttctc taggtggtac catggccaca tgtctggcgg gcaggcagag acgctgctgc 166861 aggccaaggg cgagccctgg acgtttcttg tgcgtgagag cctcagccag cctggagact 166921 tcgtgctttc tgtgctcagt gaccagccca aggctggccc aggctccccg ctcagggtca 166981 cccacatcaa ggtcatgtgc gaggtaaggc agccaggcgg cgggggagcc tctgctgagg 167041 ctcctgtctg tgaccacagt gtgggtggca gggagggtct gcctgggctt gaattcaagg 167101 ctggggaccc agggagggag actcaagtcc tgtgaatggc ctaatttggc tccccccagg 167161 gtggacgcta cacagtgggt ggtttggaga ccttcgacag cctcacggac ctggtggagc 167221 atttcaagaa gacggggatt gaggaggcct caggcgcctt tgtctacctg cggcaggtca 167281 ggggtgggcc cagctgcctc cccacttccc ctgagctgtc ccccagatgt gagcttctgg 167341 gatctctgag ttgctgactt ctcgctcttc cccaccccag ccgtactatg ccacgagggt 167401 gaatgcggct gacattgaga accgagtgtt ggaactgaac aagaagcagg agtccgagga 167461 tacagccaag gctggcttct gggaggagtt tgaggtgcat ggtggggacc ggcagggctg 167521 gggcagctga ggtggtggca gcggcctggg gccccaggcg gacaccttcc cctccttgcc 167581 cacctctgct cctgacccac cccacgtgag ctcccccgat ggatgccctc tttgggagct 167641 gatgctcatt tccccaccca catctcagag tttgcagaag caggaggtga agaacttgca 167701 ccagcgtctg gaagggcagc ggccagagaa caagggcaag aaccgctaca agaacattct 167761 cccctgtgag cacccaggct gccccattca cccaggatac cgcccctgcc ccagctgcct 167821 cccctcatct cacaggtctc caccctccac gccaggaggg gccatctccc cacacccccc 167881 acagagcctc ccccttctcc aaaaggcctc tactcctccc agaagtgcct ccccaccacc 167941 agcaggcagg ttgccccctg ctcccaacct ccttgtgaac tccctcactc cctccataca 168001 gatgatcccc cacccctgct gcccacagtc ccccgcaagc ctcatggctt ctgagaccag 168061 aatggcctgt tagctcagga gggtctgacc caggtgtggt gagtccctgg ctaacccaga 168121 ccatctcgcc tcctctccgc ccactcccag ttgaccacag ccgagtgatc ctgcagggac 168181 gggacagtaa catccccggg tccgactaca tcaatgccaa ctacatcaag gtcagcagtg 168241 tgggccacgt gggaggagag gctgggccct gggaattccc tgtctggtgg ggggacccta 168301 gatccagaga cagctgggca aagccgaagc tggcttcttg catgggtgag ggtggcagtg 168361 gttcagggcc tgtgctgggc caaggggctc actgtcttgg ggtgcgtctc tccacgcttg 168421 cgtccagaac cagctgctag gccctgatga gaacgctaag acctacatcg ccagccaggg 168481 ttgtctggag gccacggtca atgacttctg gcagatggcg tggcaggaga acagccgtgt 168541 catcgtcatg accacccgag aggtggagaa aggccgggta gggcgccccc ccttccccgc 168601 atccgccccc gtgcttgtgg tcatgccatt aagtcgaaga gcagtcagat gccagggcag 168661 aaagggatct caggggtgag ggtccggccc ttgttgggaa actgagggct agtgacaaag 168721 tctcgactac acaacgtgac ccccagatcc ctgcatgcat ccctgggctc ttctgagctc 168781 cagacccagg ttccaggctg tcctccttcc tcctacccct gccccacctg tctgcatcca 168841 ggcccctcct gtcctccctg ccccatagat ctctctggag tctgcccctt accctgcagg 168901 ctccccctac acagcaccct ctgtgctgcc attgaagtga tcccatccgt gacacaaact 168961 gggtcaagtt ccttcctttc tgaaatctct tccatggctc ctggtcacct ttgggataaa 169021 gtcgcactct aaggcctggc attcaaggtc tggtggcttc cctctgaccc gcacgcttct 169081 cttgaaggct caccgccccc agcagcccca gctctttcag gttcccagcc tttctttgca 169141 caagctcatt ttctgctagg aaatgactct ctccacacta tctctgcctg gcagatgcct 169201 cgtttttgaa gacacagccg gagcgctgcc tcctctgtga atccaggtct tgtttcctcc 169261 aggacctaga gggagaatta cgtctttccc agccacgctc ctcagcgcgg tgtctccccc 169321 ggtcacctgt ctctgtgagc tcctcgaggc acaggggcac agactgggtg ttatttgtgt 169381 ctgtgaagct gtgtggtttg cacagcttcg gggacaatgc ctgccctggc aacgtttgtt 169441 gaatgacaaa cggatgtacc ggtgaagtgg ctggccaggc ctcaccacct gttggtggtt 169501 gatctgagac gagagcccag gtctcctgcc tctctgccag cccatccgtc catccaacaa 169561 atgtttgggc cggtgccagg cactcagaac atagagcagg acctgggatg ggccacagtg 169621 ccctgctctg tgcctcatcc ccacccgacc ctccctttcc agaacaaatg cgtcccatac 169681 tggcccgagg tgggcatgca gcgtgcttat gggccctact ctgtgaccaa ctgcggggag 169741 catgacacaa ccgaatacaa actccgtacc ttacaggtct ccccgctgga caatgtgagt 169801 ggcccccacg ccctgcccca ttccgggagt ccctccctgg acttgttctc ctctctggtc 169861 gggtagggtg agatggatga ggtgttccga gagaggaggg ggcactgacc ctatgtcctc 169921 ggcttaggga gacctgattc gggagatctg gcattaccag tacctgagct ggcccgacca 169981 tggggtcccc agtgagcctg ggggtgtcct cagcttcctg gaccagatca accagcggca 170041 ggaaagtctg cctcacgcag ggcccatcat cgtgcactgc aggtgaggat gataatcctg 170101 atggtagtag tgacagctga gaagtaaata ctgctaagtg ccatgagctg ttataagcaa 170161 tataaacgtt agctcgcaca ttgagtgccc tccgctcacc cccggcttct cctgggtccc 170221 ctcatggctc cagaaccctg ggtggatcgt ggctggaacc agccccactt tggccctctg 170281 cctgtgggta tcttcctcag agccctctcc ggatgtacca tctcgcccaa ccctgccaaa 170341 tacagaggag gagcccggga cccagttgct ggccaggccc aagctagtca gggcaaggcc 170401 gggcaggcac ccacagtagg cctgtgtccc ggctgctccg ctttctctcg aggtcccatt 170461 ctgttggttt cttctcccag gaacatctat gaggcatgtg ctccccattc ctcctctttt 170521 tccatcggta gccgcagggc ttcggcttct tcctgactct gccctctctc ccagcttccc 170581 caggcagtgc cccatcctgg cccccagggc tgtgtgggga tgggtgatgc ttctttgggg 170641 ctgcacataa ctcctctgtc tatctacccg catgtttgtg atcaggagac ctctggtaag 170701 gtgcagaggt gggggctgca aggaggagca ggggttccac aggtgagccc actgagctgg 170761 cctggcctgg gtggatgaga ggcagtgggt gcagggcccc tccgcttacc agctgtgtgg 170821 tcttggacaa attacttaac ttttctaacc ctcagcttcc tcatctgtaa aatcaggatc 170881 tcagggttgt cgtgagaact caatgagacc ctatcgttgt ggctggaatt ccgtcagccc 170941 tcaaaaactg ggcgctgtta ctagtttagt aactcacatc aggcagagaa taggggaatg 171001 ggaacctgcc ttgccccggt cccttcccac tccctccgtg gaccccaggc ctgcgacggc 171061 ctctggcttc ctcctcttcc cccagcagct gtttgtcctg ggacagggca agtcggctga 171121 atctagaggt gcccccgatg ggctgtccgg ggacgcggct ctgtcctgtg ctctctcagg 171181 gacaggccca tccccgagag ctaccctcct gctcacccgc cacacacaca ttcacacact 171241 tcttgaaagc cccatggcct ttatttagac gttacaggaa ggaagtgggt gtggggggtt 171301 atttttgaca atctgggttt gaaattagac agcgcgactc agggcatcag cttgctgggc 171361 tcagctgagg gtgggcctgg ggtctccctg aggtctgttt gcccagggct gggaaaggag 171421 agaaacttcc tactgcactg ctcccctgag tcccctgacc ctgtgccccc gcaccctgct 171481 gtctcagggc tatcctttcc ctgacgtcag ggtttgaagg aaaagggaag tgaagccatg 171541 ctgagagacg ctccataact ccttcaggga gaggcgggga gggctcaggg tacctgggag 171601 ccggcaggac agtggtggga tttgggggtc ccaggtcttc cggggtgggg gcagccactc 171661 actaggagtg aggagtcggc gcgaggagtg gaggagggaa ggatggtggc agctggggag 171721 ccagcgtcag caccgcagag cccgaggtgg agcgtgtcca tgcagagctg ggcaaacctc 171781 catcatcact tgcccggtga ccctgggcac attccctccc atcactggag gctcaggctg 171841 ctcctgtggt gcctggggct ggagctgagc gctgggtacc ccccttcccg gggagggctt 171901 gactggcctc tgatggcacc cccgtctttc cccagcgccg gcatcggccg cacaggcacc 171961 atcattgtca tcgacatgct catggagaac atctccacca agggtgaggg gcacctgggg 172021 gtttgggggt ggggggtgag cagcccctcg gtgtccgcct atgcctggac ctgaggtttg 172081 actgcccccc acccaggcct ggactgtgac attgacatcc agaagaccat ccagatggtg 172141 cgggcgcagc gctcgggcat ggtgcagacg gaggcgcagt acaagttcat ctacgtggcc 172201 atcgcccagt tcattgaaac cactaagaag aagctggagg tcctgcaggt gcgtgcagag 172261 cagggcctgg gggggggggg ggctgcagtg caggatgggt gccacctggc cctgctggga 172321 ccaccacctt cccactgtcc ctctgcccac agtcgcagaa gggccaggag tcggagtacg 172381 ggaacatcac ctatccccca gccatgaaga atgcccatgc caaggcctcc cgcacctcgt 172441 ccaagtgagt ggccctgact gccactgccc ggcatccacc cctttgtcct gcccagcccg 172501 atcctcactt tctggagagg acaagtgttg cagctggggg gacctggctt caagttcagg 172561 cttggttctc accccttctg ttcataagca tttcctgagt gcccacacgt gtgggcctct 172621 gctaggtacc agcagcgcac tcgtgtatga gatgtagcct ctgtcctcta ggagcttgga 172681 gtctagtgca gggaccgtgg ctgcgtcacc tgtgagacgg ggtggccaga ggggactgcc 172741 agtgccgggt ccccctgtgc tgtctcctga cctgcaccaa ctgcctgtac ttgcccccct 172801 gcacccggct gcagacacaa ggaggatgtg tatgagaacc tgcacactaa gaacaagagg 172861 gaggagaaag tgaagaagca gcggtcagca gacaaggaga agagcaaggg ttccctcaag 172921 aggaagtgag cggtgctgtc ctcaggtggc catggtacag ctcttctgcc tgggtgtcct 172981 ccctgccctg ccctgtgtcc ttggctccac tgccttccct gggtggatgg ggtggccgca 173041 gcctcattct gtgcttccca gctgccccag accctcttgt tccacctcca ggttccagct 173101 accctctcac tccctcactc ccttctcttg gcagcctcag ccctgaccct gtggaagcat 173161 ttcgcgatgg acagactcac aacctgaacc taggagtgcc ccattctttt gtaatttaaa 173221 tggctgcatc ccccccacct ctccctgacc ctgtatatag cccagccagg ccccaggcag 173281 ggccaaccct tctcctcttg taaataaagc cctgggatca ctgtgtgtcg cctctgagcc 173341 ctttgcttgc ccagtgagtg ggcggccaga gggcagggca ggatgggtaa ctgtgtgtgc 173401 ctccgtgcgt gcctcgcgtg aaagctccgc cttccgtcag acggacgtgg gtcgggactc 173461 cgcctcgcac gtgggagggt gaccgtgggt gaagctcccc agtctccttc tttaaaatgg 173521 agggcgatca taacagggtg gttgtgaaaa gcaccgagat gacggctgac gataagacgg 173581 gcacagtgac tcatcacacg cttgccatgt gcccaggcac taaaagacta cacacgttag 173641 ttcagtctag gcacttctgt cattctcatt ttaccgtggc ggaaactgag ggacagaaaa 173701 actaagtaac ttggtcactt gcccaaggtc acagggctat ggaacagtga ggctgggatt 173761 cgaacccagg ctgtctgacc ccagagccca cactccttac cctggagttg cagctggggc 173821 caccctcagg ggggccctga tcacactccc ctgatgctga gttccagatc tgaactaaga 173881 agagtagtta acagccggaa gcgcagacct gaggccagcc cggctgcgtc ccctctggcg 173941 ggaacaggga caggctcctc agagcacccg ggcacgccca gctcctcccc tcatccaggc 174001 cgctgctgcc cttatcctct tgggcagagt ttgaagagct ggctgacgtg aagagtgctt 174061 tgttttttgt cccctcttcc ttcccccatg tcaggagtgg ggtttcttct ttatttgaaa 174121 cactggtgtc ctggggagta aagccggtgg gagtcatccc tcaggaagtg ctggcgccca 174181 ctcctggaaa ggctgagaca gcacaggtcc caaagcccag aggctgggcg tgcattactc 174241 agcaaatcct tacagagccc ccggcgtcac aggcattcac agtcccccga cctcctggaa 174301 cttaggaggc tggtcaggga gacagattca caaaccgatc acaagcatca aataattgca 174361 ggcgggtatt aagaaggaag caaacaaagc ctgggagaga gaacagcgag ggattgagca 174421 acccaatagc tccctggggc tgtggccctc cagggcacgg gtggggcaat cacctgtcgg 174481 gcccaggtcc cctctcccag caacgccctt tctatacaag ccgcagctgc acaaaggcaa 174541 gtcccacctc ctctaaccgc ccttgggcta cctgcccttg gggtgggact tggactccac 174601 tgagggctgt gctgtgaggt gggtccggga gcagctcggt ccggagagtg gagcgcgatc 174661 gttccctctg caggcctcaa tcgaggggca aggctgatgc tccgggccag ctggggtctc 174721 tgggtagcgg tgggtaactt cacactagcg acaccttgct gggacccgcc tgcccttcac 174781 aggcctggcg gggccttctc cctccccttt ccctcagggg atcccagcac aggctgggca 174841 ctgcggggcg gcacagccca actcctgccc agctgacccc tcgctgacct cagggatctc 174901 tctggcctgc agctccgctg tgggcagggt ctgaggccac agaggaatgg gctagtcctg 174961 ggggcagcat ctgctgtggg gaggggaccc aaggacagcc cccgcttttt gtacctctgg 175021 agacaggggt gagactaggc aggttggaga aaagaggccc ctgggagagg gtgggaggcc 175081 tagaggagtg gccaagcctt agaggaggtg cccgtggctg gcgctgggag ggaaggggtt 175141 aaggcagtgg gggggcagcc tatggcagga ggacacacct gtgcgcaggg tggcaggcgg 175201 ggcccaggta aggagcctgc gctggctgcc cggcaggcgg agaaggaagg aggaagagcg 175261 gaggccaggg cgggctctag gccgtggaat ctggggcctt aaagcccctt cgtctcccca 175321 gcacccactc tctgggggca ggtgggcccg gtgacaggta aaggccacca ggggagaggt 175381 cctgggctga gcttgggact gcagaggggg gatgagggtg ggtaaatcgg tgtgtgtcgc 175441 gggtcgggaa aggctgccgg gggtagggga aggtggctca gaggcggcgg gccgacggtc 175501 gaggggcttc ggagggcctg cttggactgc aacctgggcc tcgtgatcag cgacccaggg 175561 tgtggctggt ggcgggcagc agggctcacc aggaagtgtc cccagggact cgggtggtgg 175621 ggggatggga gccagggatc tgcagctttt ccgcagggat cctgggcctg aagctgcctg 175681 acccaaggtg ggcgggctgg gcgggggccc tcgtcttacc cagcagtgtt tgggtgcggt 175741 tgggagtctc taatactgcc gggtaatgat ggaggcccct gtccctgtgt cagcaacatc 175801 catcgcctca ggtccccagc ccttagctgg ctgcagcccc ctccccactt cccacgcacc 175861 ccggaagccc ctcgtcttga gctgagagcg ttgcacaagg ggtggttctt gttggctggc 175921 tgccactaag ggacacaatg ggccccagcc cctcctccca cccagtgcga tttgtcacct 175981 ggtggatcca gaacccacag tcgaccttga gcttggggtt ggctcgcccc ctctcaagag 176041 acctcacctg gcctgtggcc agggtcccct gtagcaactg gtgagcgcgc accgtagttc 176101 tctgtcggcc ggccctgggt ccatcttcca gtacagtgtt ggatggtcta attgtgaagc 176161 tcctaacact gtctggtaaa gatggctccc gggtgggttc tctcggcagt aaccttcagg 176221 gagccctgaa gaccatggag gactactgac caacaacctc tgaccttcac ccctctggat 176281 gggggacgaa tcactaggca aaggggaaca atgggaagga gacaaaatgg ctgcctttac 176341 agctgcagca agatgtggaa acactggttc cctaggcacc tccattgtct tcccgccttg 176401 ggagcatgaa ataaataaag tgtgaagggg aacatgagga agactgatcc aagtcacttg 176461 acctgtcttt tgtctagttg tttatctcat cttagcctcc cagcaactca ggattggggc 176521 tgcgacgtct gctttgtctt tcatgatact caaccccaag cagagaaaaa tatcttaaaa 176581 atcgaagcag attaacccag tccctctgtt cagatccacc tgctgcccta tctgcatttg 176641 gcagttcctg gacatacttt gcacctatgt tttgtctgag ttccacatct tccttgtctc 176701 cccgcaggga tgggaaatgg gcagtagtgg gagattcagg tataaacggt ggctctgttt 176761 ctctggtgcc tgagattcgg gaaggcctgt cttgggaaga taagtgtcag cttatctgtg 176821 gactgaacag acaggcacct gttctgttgg ccccggaact gcgggcatga aaagcccacc 176881 tccccagggg aggcctgtta gctagcagct cctcacgcct ggtgactgag aaatggagtc 176941 tggggggcct acagcaggcc acgcgatcca ctgggcttct gggcacagaa gtgcctcaac 177001 aggcacaggc aggctcccca cagccacaag gaaccctgct gccaaggaag cagctcctgt 177061 tggggtcatc agatttttaa tctggctcag caaagccatg gctgtccagc cttaaaaaaa 177121 tgggtgatgt gggagaaatg gagggtgtgt gaagagcaca gctgggcctc gagaactcag 177181 ggccagcctt cccagcttgg gtcctgtttc ggagcccagg ccttgcttcc ccttagccgc 177241 cccagccaga ttcttttgcg ccccctgctg gcacacacaa agacagaccc aggagcgcac 177301 atgtgaacgt gcgcatgctt gctgccagtt acccagttca tgaacacgtg aagccttgac 177361 ttcaggttaa gtaataaaaa tttattgaga attcctgggt tggtgtttat ctcctcccag 177421 ccttgaggga gggaacaaca ctgtaggaaa tcactgagaa atcacgcact gtccccaaca 177481 gccccagtta acacagggag gaggaaagta attccccaga aaaggggcta gtcttcagtc 177541 ttccttaatc caagaggggt tcagggaacc ggtgtggggg accatcgcat gatactgggg 177601 cggggtaggg ctgtgctgga cccctggctg gctcctcaaa aactggagaa gcagatccac 177661 ttcctctggg ggtggagttc ttggtgacta ggctcatttc ttacccttga tgaggctgtc 177721 actgtaggaa aaaaaagata gataatgaca ttattagggg acataaatgt gagaggcagg 177781 acactctagg ccattccctc tacgaccctc ctaccctgat tgagggtttg tcttcgggga 177841 ggtgggaaag ggggtagggt aggaggcggg tactggagaa ggtggcctgc aggaccccac 177901 agaagcaaca acagcttacc ttcccctgtg gtgccagatc gccagatgaa caagaaacag 177961 agaagagaaa tgcacatgtt aattgacagc ttcaggcccc actcagcttt gaaccctcct 178021 ttgctcccaa gagaaaagat aaacagggtt gacagccagg aatctcaggc tcatgaaagg 178081 aggaggcatg ttctcatggc cactgctatt tctcatctcc tttcctaaca tccctccatt 178141 caccagagga gtttgagggc tcttgagtaa gaaaactgag tatcatcttt catcactttt 178201 tggttagatg aaaactttat aattaaagtg ctttttatgt gaataatcct atttgatctc 178261 ataaaatcaa tcctatgaga tgcaagatag aaactgtcaa ccaccccctt ttctaatttt 178321 ggttcaatgt ctctgagcca tatgaatagc tgtagcaggc tccagagccc aaacaaccag 178381 actcaggtcc cacgttcttg gtgatacccc acagtgtggc cacatctctc acctggtgaa 178441 actttcatcc tgtaggttca gcacaaggtt gtcagctgtg agatagatac gattctgtga 178501 tgtggcgatc tacagggcag gaaataagac agatgcttgc attaagcaaa gacctcaatt 178561 ccgacccctg caattcagca gctacttaac atggaacaca tgcagattag ggcaggggtg 178621 cagcaatgag taacacatgg tttctgctcg cacttaggga tccagagcgc tgagcagcag 178681 ctatgattag aacaaaaggt cagggaaccc agaggaatca ttcttgacag gaggagtatc 178741 ggagaagcca attttttatt tatttatttt tgagacggag tcttgctctg ttgctcaggc 178801 tggagtgcag tggcatgatc tcagctcact gcaacttccg cctcctgggt tcaagtgatt 178861 ctcctgcctc agcctcctgc gtagctggga atatagctaa acgccaccac agcgggctca 178921 ttatcttttg tatttttagt agagatgggg tttcaccatg ttggccaggc tggtctcgaa 178981 ctcctgacct caggtgatcc acccaccttg gcctcccaaa gtgcggggat tacatgcgtg 179041 agccaccgcg ccggccagag aagccaatct gagtagaaac cggaacaagc aaggttcgaa 179101 ttccctcacc tcaatgtgcc ttaactgaaa gcactttctc aaggcagccc catcagagac 179161 gctgggctga cacactcacc gtcttggaga tattctgggc tgctcgaatc ttgcgaagtt 179221 tgatgtagcc agggttcttg ctcagtgctt ctccaaggtg aggagggtta aggtacacgc 179281 aagggcaggt ctcaatccct ggccctgtcc ttaccacact cccttgctcc agtcctcctc 179341 tgggctgtca gatccaaggt tgcgctcagg tggcttgtgc ccagccctac cgtggacccc 179401 acctgtgggc ctccctgcag gctgctgcca ggaactaggg gcagcaccag aaatgaaggc 179461 aaggccacca atgctattga tctggcctta cagtggggag tcatggctca ggtactacca 179521 ctaagatttc agatctcatc tgtagtcccc acccccaaca aggagccaag ggccagagag 179581 cacggagtcg cattcgcctt agtctcatca gcctgcccat gaaggagaat ggggaactca 179641 ggtgccctag gggctgggct gagatctctc cagcagaagg atatcatctt ggcagcctcg 179701 gcctcaccct cggcctgcac aattttctgc cgctgttcct gctttgcttt ttctaccaag 179761 aattgggccc gctgggcctc ctgctgggct gtggtgggag agagtcaggg agaccctgtc 179821 ctgggtcagg agccccaccc atggagtctt tcctcctcct gcatctcaga agccctcacc 179881 ccacggctct tgcgactcac ccacttgttt ggcttctaca gcagctgtgt actctcggct 179941 aaagctcagc tctgtgatgg ccacatcatc caggatgagg ctgaagtcct tggccctctc 180001 tgtcagctcc cggcggatca acagggatac ctgagggcag gggtgaagag gggaagggag 180061 gggtggtttg aggggactgg ggagctgaaa ggaaggttgc gacccctaac ccttcactct 180121 caatacaaca tgcactgcct tagcttcctg tcaggcaaac ctgtaacata agcctctctg 180181 ctcgaagagg tgcaggaaaa gccaagactt ggccattttc ctctgtgctt cctaacaccg 180241 agtgctatcg agttctaatg ctggctctcc ttatttcaca gtggcaatta gcacagcttt 180301 taggtggaaa tggcactagg gtgacaatgc ttttctaata agctgccttt cctaattccc 180361 aatactctgg gcctaggaag gaaaggctga caccacgcag atggtggtgg gagtcagacc 180421 tgggcccgct gggtgatcag ctgtgaggca ttgaacttgg ccaccacact cttgagcacc 180481 tcgttgacaa tggacggcaa cactcgttcc tcgtagtcca gccctaggcg ctggtacatg 180541 ctaggaagct cctgagcatt gggtcgagac aacactcgca gggagatatt caccatctgt 180601 aggtctgaga ttgaggtcag cagtggctgg tcaaggccaa acaccctttc ccaagcattt 180661 tctccttcat gttcctccct gtatgccctg tgcaatggtg tacagcctgg cactaccctg 180721 gggggaggag agttaatcag gctgacaagg aagacaagac gcagcacagc tgaaatgcac 180781 cctcacccac tgtgaagcct tgaactgcaa accccgtcac cagcagagct gactactctt 180841 tccccctctg tactctgtgc aacctctaac gtgggcgctt tcgtgctgta ccgtaatcat 180901 gtctcctgta aggacttggg ggacaagaat tgtgtttaat gaatctttta ttcccagggc 180961 ccagtctagt aaatgaataa gcgacctgca ctaggtatag cacaaagaag tacattacag 181021 gttacaggtg agctacctga cccagggtgt atatgtgtgc gtgggtgggg aggaggaaga 181081 ggtctgggga actcaaaggc acggctttta tgtatggtct agaaggagag aacaggtgaa 181141 ctagtcaagc ttagggacaa acttctccag aacagagtgt actagtggat gtaactgtag 181201 atatagtaag tcaagcagtt agaaaaaaaa ggctttaaaa caaagctttg atctaggcag 181261 tgaacaagac gagggacaaa caatactctt attgaataca cgtcagggag gagacagtaa 181321 aaagcacgca aacacagtgt gttcagaatt tgctgctttg accccaggag gcaggtattt 181381 ttgttacagc tcttgcagat gtggaaagag gcctaaaggc ctgacactac catattcccc 181441 tggggtttct tgccagctac cttgatcatc ccacctgcca tgtgattacc aagtgctcag 181501 acctaccttt ggagcctgta ggggaggaga tttttcgagg tctggcccga atgtcataga 181561 taatggggta ctggaaccaa gggatcctgg agaggacagg gataggtatt aagaggccac 181621 agtttcggcc gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg 181681 ggcggatcat ttgaggtcag gagttcgaga ccagcttgat ctacatagtg aaaccccgtc 181741 tctactaaaa tacaaaaatt agccgggcgt ggtggcgggc gcctgtaatc ccagctactc 181801 gtgaggctga ggcaggagaa tcgcttgaaa tcaggaggcg gaggttgcag tgagccgaga 181861 tcgcgccact gcactccagc ctgggcgaca gagcaagact ccctctccaa aaaaaaaaaa 181921 agaaaaggcc gcagtttcac tagccttgcc cggtttcccc tcaccacgcg ttttttggcc 181981 tgctcctctc cacgaccaca gacaaggaga gattctcttg tccctctgga aaacaacagt 182041 ttgtatgctg ctggaggttc tcgcagcacc cactatccta ggggcaggga tgaggagggt 182101 gggaaaagag cagcgttgaa tcctgttgca cgtccgacta tagccactgc tgggtcggcg 182161 tcaagggtga aaggtcaggg tcagcaggct ctgcccgcca ttacctgaag tgaaggccct 182221 cggccaggat agtgtcctgc tgcactccac cgatccgatt gaagaagatg gctctgtgcc 182281 cgccttccac tgtggggaga tgggtggtga tcaggccagg ccgctgctca gaggaaatgc 182341 taggcccgtg gaggggcgcg gggacagggc aaggggtttg ggggagggac tggaagcgtc 182401 cggcgagcag gcggaggttg ctcaccggtg aacacagatt cgcgcacacc gtaggccacg 182461 gcgccggccc ccagcaacag cttcagggcc gtgcccatgc cccggggccc ggcgggcagc 182521 cgtcccgcca agtccttcaa gttctgggcc atgtctgatc ttgaggccgg cggcactgga 182581 ggtcagaagg gggtgccggc ccgcctctac cccgctccgg cttaggtact gcacccttca 182641 cacgagggtt cgggcccgta aggctggcga aagaaagggc agcggaagtg cgctcccttt 182701 gaaaccctcc cccttagccc actacggacc cgaacttcgc gcacaggaat cgcgcatacg 182761 gaagtcccgc ccctttctgg aaggctgccc tcccagggag ggcagcgcaa gacagcaagt 182821 catctccatt tcctggccca ctttcaaaat ggcagccgga aggaaatttg tgattagaag 182881 ccgcgctgtt cttatttaag agcgttagcg caacttccgg tattgttgca agatggccgc 182941 gcccagtgat ggattcaagc ctcgtgaacg aagcggtggg gagcaggcac aggactggga 183001 tgctctgcca cccaagcggc cccgactagg ggcaggaaac aagatcggag gccgtagttt 183061 attgtggtgc tggaaggggc cagtctggag acagtcaagg tagtttggga caggaagtgg 183121 agaagtagta aatcgatagg ttgggactcc gtggaatgag ggtaaggggc ccagagtgga 183181 tgtagaaagc agagaggggt gaaagatgct tttgaaggaa ggtggcttgg ttggctttgc 183241 gttgatttga catcctggga tggtagtact catttttctt tctttttttt tttttttttg 183301 agacggagtc tcgctctgtc gcccaggctg gagtgcagtt gcgcgacctc ggctcactgc 183361 aacttctgcc tcccgccttc aaacagttct cctgactcag cctctggagt agctgggact 183421 acaggcaggt gccaccacgc ccggctaatt tttttgtatt tttagttcag atggggtttc 183481 accatgttgg ccacgctggt ctcgaactcc tgacctcaag tgatccgccc gcctcggcct 183541 cccaaagtgt tgggattaga ggcgtgagcc actgtgcccg gccggtagta ctcattttct 183601 tttgctcttt ttgaatgata ttctagccct cacctccttg cttccaattg gtttaccagg 183661 attctgtggt atagtagtct aagcagagga aagtttcgtt ccttgcgtca ttccacatcc 183721 caagacaagt tactgggcag atgagaaacg tagttatgta gcctagtctg cccacacttt 183781 ttgtaagggc ttcgtgtttc aattcattag tatccatagt cacctctctc taatatccac 183841 ctatgataca ctgtccagac ctggttatta tttaaaactt ttacatctgc atttttatct 183901 atcattcatc tctttcccca catgtaatag aaccagcagt tctctatctt aaagccttgg 183961 gcagtgttct tcctcctcct ttctcttacc cgttagaact aattgaatag gcccagaaga 184021 aatcgcattg gtttagaagt caggccagga ttttaatctt cgttctaaat acactttttt 184081 tttttttttt ttttgagatg gagtctcagt ctctaccagg ctggagtgca gtggcacgat 184141 cttggctcac tgcagcctcc gcctcccggg ttcaatcgat tctcctgcct cagcctccag 184201 agtagctggg attacaggcc tgcgcctgta atctattaaa agaatagaaa acatgattat 184261 atcctactaa tgggttgaaa ctgtattatt cattcaagaa ggtttttttc ttctataact 184321 aagggtgtct catggactta gttcttggtc atttgttctt tgtgctctct gtgacattac 184381 ttcaactatt caatttcaaa atctacatcc cttttttcgc agactttttg gagccatata 184441 tctcaagaat gttgctagac atatacattc cagtgataca taaaaactta accttccaaa 184501 acttgtattt gtatataaca gtttgttttt agacttttta ctgaccaccc taatgctcct 184561 tgggactcca aattgcaact tggaattatt tcttttagct gctacagatg tagtccactt 184621 ctttaacatc aaacttctga tgtcttttcc agtgtacaga gagttgttag gatagtgtct 184681 gtcagtcatt cccatcctgc cctgcttact ccagaattat ttttggcttt gtgcttgata 184741 cattaggatt ctgtggttta caaagcagct tcatatataa tcactgccct ttagtgtctc 184801 agctcccaat tttcctcaaa atttcctttc ttcgtttcca ctttttcttt tttgtttctt 184861 ttttgagata gggtcttgct ctgttgcctg ggcaacagag tgcagtggtg tgatggttca 184921 ctgcagcctc tacttccctg gctcaagcag tcctcccacc tcatcctcct gagtaactgg 184981 gactgccagc aaatggcact gcgcctggct aatttttttt tttttttttg tgagacaggg 185041 tcccaccgtg ttgcccaggc cggtctcaaa ttcctgggct caagtgatcc ttccacctcg 185101 gcctcctaaa gtgttgggat cacaggcata agccaccaca cctggctact ttgtcttgat 185161 tccatctgta cctttgctca tgccagtctt tcttttttct tttttttgag acagggtctt 185221 gctggagtgc agtggtacag tcttggctca ctgcaatctc tgtctcctgg gctcaagcca 185281 tcctcccacc ttagcctccc aagtagctag gactacaggc atgtgccacc atgcccagct 185341 aatttttgta tttttagtag agatggggtt tcgccatgtt gcccaggctg gtcttgaact 185401 cctgacctta agtgatttgc ctgccttggc ctcccagagt gctgggatta cagccgtgag 185461 ccactgcatc tgcccccatg cccgtcttaa aactgggaat aacccctctt ccttttactt 185521 ctgagagttt tctttgatta aactgcctcc tgcatcatta gttttcacat ttcttttttt 185581 tgagatggag tctcgctctg ttgcccgggc tggagtgcag tggcgctgca agctccgcct 185641 cctgggttca tgccattctc ctgcctctgc ctcccaagta gctgggacta caggcgcctg 185701 ccaccatgcc cggctaattt tttatatttt tagtagagat ggggtttcac cgtgttagcc 185761 acgatggtct caatctcctg accttgtgat ctgcccgcct cggcctccca aagtgctggg 185821 attacaggcg tgagccacca cgcccggcct tcacatttct tgttcagatt tgcagctctt 185881 cacttagtgc atgtttggtg tacgcaaact gagggtggtg actcgatatt ttgcacagta 185941 cctacttact gctcctgtaa taaacacagc attcagcctt gctaactact aactcctgcc 186001 tagttccagg atgtctcatt ggccttgcct aacagccaca ggttttttaa ttaaatccag 186061 tgtattagta gataatgtga agtcacaggt tgtgcccttc ctcctgtttc cctctcagac 186121 cattcactgg ggagtgcaaa taaggctgca cagtaatccc ccgaagggct tgctgggctc 186181 cacccccagt gttcctggtt tagtaggtct agggtgggcc tgagaatttg cctaacaagt 186241 ttccaggtgc agctgctgct ggtagtttgg ggaccacact tgaagaacca cggggctagg 186301 taacagaagc ttatgctgtt ctctcgtcat gttccctgtt cttcaggtag ggaagacata 186361 tgagctactc aactgtgaca agcacaagtc tatattgttg aagaatggac gggaccctgg 186421 ggaagcgcgg ccagatatca cccaccaggt aactccaggg acagtgctca caaccctttg 186481 agcctctgta tggaagggtt ggcagctgag tgctgcctct cttcagcctt aaccatgtct 186541 cggtttctgc tttgctcaga gtttgctgat gctgatggat agtcccctga accgagctgg 186601 cttgctacag gtttatatcc atacacagaa gaatgttctg attgaagtga atccccagac 186661 ccgaattccc agaacctttg accgcttttg tggcctcatg ggtaagaagc cttagaacaa 186721 agttagaatg aacttgtcag tagggaagaa gggaggaaga ggaaaaggga gaactaaatg 186781 tggattttta agcgagaaaa tgggagaaca acatgattaa taccaggaca aggactgtta 186841 ttatttttct atgtttgtgg aaactccact cctgttcttg cagtagcttc ctggctgagt 186901 gaaagaggga gtctgaaccc atcactgtac agctagccat atgcttggca actgtttgtt 186961 cctaacattt caggagtcca gtctagatat aaagcacaca gggaactcat cttatccatg 187021 gggttttcct tgttcgatga ctggacagaa ggactgtggt ctccatctag ctctgaactc 187081 ttttttcccc cttctagttc aacttttaca caagctcagt gttcgagcag ctgatggccc 187141 ccagaagctt ttgaaggtga ggtattgaaa cctgttagtt gaaggctggt tctgggaatg 187201 tttctggggc tgacttttct ctctttttta ctttaggtaa ttaagaatcc agtatcagat 187261 cactttccag ttggatgtat gaaagttggc acttcttttt ccatcccggt tgtcagtgat 187321 gtgcgtgagc tggtgcccag cagtgatcct attgtttttg tggtaggggc ctttgcccat 187381 ggcaaggtaa ggtctgggct caaccctgaa attcttggta gagctgaact tagtatagaa 187441 ttcccagagc agtaggcatt ttaacaatgc ttacaatgag ctagaagaca catgacagtt 187501 ccacaccctg ccccagggca catcctttga gggctgctgc cataattgga agtcacagtt 187561 aggaccttct tcatcctttg tagggatttg atattcaaca gcacagctga aatactagct 187621 cagccatagt tttcctgccc taaagaaggg ctgaaacagc tactgagtga cagagttggc 187681 tgacaaaact gttcttttct taggtcagtg tggagtatac agagaagatg gtgtccatca 187741 gtaactaccc cctttctgct gccctcacct gtgcaaaact taccacagcc tttgaggaag 187801 tatggggggt catttgacag tagtagaacc tgttctgaaa ccagaaactg ttgatgtcac 187861 atcctttgac cctggtctga gctgactgct ggaagatgat ctttctgcac tgagactgtg 187921 gagtttgggg aagccaaggc tgtacatttg ctatttgttt atcctatgaa tactgttctt 187981 gcaaacctgg ttgttttggg gttcctaaag tatccagtgg tgtaaaactg tttgttcccc 188041 gggacttcag ggacagatag gaggttacag agtttgcagt ttggttccat gctttgaagg 188101 caggctttag ctcccagatt cccatgtgct aaaggagaga accctgatga tggagaagaa 188161 ctgtgaaaga gagcagtcag gaatgctagt ggtgaaaaac tgaacaaaca gaagtgattt 188221 tatctaatac agttccaagg tagaaaaagt ggagcaggca gggccttgca cccctctcca 188281 cccccccatg gggggggtgg tggtagcggc acatacacaa tcatagtaaa ttggcagaag 188341 aaaaacacaa tagattcctg gctagatggg gagagataag gcaatgtgca tgggggaatc 188401 agaggggaga tgtgagcccc tctgctcctc ccacaagagt ttcccctttg ggccgggcac 188461 ggtggctcac gcctgtaatc ccagcacttt gggaggcgga ggcgggtgga tcacttgcgg 188521 tcaggagttc gagaccagcc tggccaacgt ggtgaaatcc cgtttctact gaaaatacaa 188581 aaattagctg ggcatggtgg cgtgcctgta ttcccagcta cttgggaggc tgaggcagga 188641 aaatcacttg aacccaggag gtggaggttg cggtgagctg agatcccgcc actgcattcc 188701 agcgtgggtg acaaagcaag acgccttctc caaaaaaaag tttccccttt ggccccaaat 188761 gaagacttgg ctggcagcag aggcacagct ggaagcatcg atcttccacc tccctggctt 188821 ttccattctc tgctctgggg caaaggagtg ctgtgaaaag ggagacgagt agtttctgca 188881 ccagtcccgc acaggccacc tgcaagacaa gaggagtttg gaaggctggt tagttactcc 188941 tgtaattcct ggtctatagc ccttccagat gtttcctagc atgcctcaat aagtcacagt 189001 agtcattgcc catactgtgt tccttagtag ccaggctaat ccttggaatt caccccagat 189061 ttctaatact attgtttttt tccagtctgt tgctctattc tgtaacctgg tggtagtttt 189121 agtttagctg tattaactta ccagggaaat ggattattcc atcttcttta acttctcttt 189181 ccttggcacc attgctttgt gaatataagg caatatgaat agtaggctca ggaagaagat 189241 gtggccaagg aaatagatgg atttatacac ctgtggagag agaggccact aaggtagaca 189301 ggcctggagt gtcctttgca acctttgagg ttgcagtgag tccctcccag tctcacaagc 189361 aggccttcac ttgccttaag ccatttgtcc cacgtgaaga ggcagaaggc agtcatggag 189421 taacccatga agagccagtg gatggtctgt tgcaccaaat agtagaaggg ctggaggaca 189481 gtaatggcgg ccagcttgct cagggtgggg ctctcttgaa tgagcctggc agcctgggga 189541 gggaggagga gctcatcagc atcttgtccc ttatattccc cttcaccccc accctggagg 189601 cctacctgtc tttccacaat aacaatgagg aattccatct ggaagcagac caggtatcct 189661 gagtgcaggc cgtgccagag ggccaggaat agcaacgaga gaccctgaga gagttcttta 189721 tttccaagga acttgagtcg tttgaagatg tagctggaga aaagggtggg tggggcgacc 189781 ctcaaactga ctggtccttg catcccgcca cctgcctctg ggtcctcacc ctgaggattg 189841 gattggagtg ctggtgggtt cccacgtgta gcccccagag ggtacaggag gcagtgctga 189901 ctgattactt ttagagatgg aaagcagacc caaggcggag ctggagaccg tgtgcgcacg 189961 agccacttgg ttcagcagca gtgactgagg ctgatgctga gatcagtggt gaaccagaca 190021 ctctactcaa gctgcccaca ctctagtggt ggggacaaac aagtaaagct gttgataaac 190081 agggcagtgg aggacataag tccattggca gagctggagt aacacccagg tcttaacgca 190141 cttttatatc ttgccttttt ttcaaatatg acagtaatgg tttttttggg aggggggtat 190201 aggtgggggt caaagtcagt gtgagcgaca gggggttctg cccagatggg aaagacgcat 190261 aggggtgaca tggtacaccc ctgccctcca atctgggaag acagtggaag gaaggaacca 190321 gggtccaggc tccccaccag cagctcaccg ggccacccag gcgttggtgt tgatgttgaa 190381 tgaggcaatg gtgccagtga agcgggggtt tgtttcaaag agccacacct tcatgttggc 190441 acaggcatcc cactttgcct tgcccttttc ttcaaagcca ttgaagccca ggcccgtcaa 190501 aatgcatact ccttcctgag agggaatagc tcagttaggg ctcttgccac tccccatact 190561 ggcccccatg gcttgtctaa ataggacctt gtttcaactt ttctacttac tgtgaccagc 190621 caacaggtga catatttgta cagcacaaac ttgccccaga tcagcatgta catgcagcgg 190681 aaccagaagg ggtggttctt gagggaagaa agcacagtgc attagggata tcacatgact 190741 aggcagtttc tctcagcact cttccttttc acacttgtgg ctggctactt catacctgcc 190801 tgagtcctgc tgccagatgc cctcaatagt ctggcctgat tgccttcaca ataggcagag 190861 aggaataagc agagggcctg gagaatattc attcgccttt cccttggaga gcctcaaggg 190921 cagcactgta tttaacttct ctactgtttg ccttcgttgg caaagtgttt ggaatgagca 190981 tttggaatgt taagtacaga ggggcccata ttggatttta atttaaggag agaaacctgc 191041 ccagaattac tgaactgttt tcaagccttt cagctgggca ggagcaaaag ccagcacttc 191101 cccccttccc ttggtttctg aattccctag aagtgcccaa atgtatcagt caagagaaga 191161 aaataggatg gagaatcaga agctgctgtg ctctgagggg tcacgtggat gtgataaggc 191221 aagctaggag cggctcctag agaaggcaac gggtgctaaa tgtgcacctg gcacagccct 191281 gtgcccgcga aggttgttca gtgctggctg atgaacatgt cccagcacgg ctggcattga 191341 caactcacag atctagagca aaaccaacat gcacttgtag atgatatttc ctacttctct 191401 gctatctgta gggatccttg cctgtccatt ttctagctct ggtgcagtca tgctgctgct 191461 gctttagtag acactcacgt catagtcttc agtgaggaga tagtcttctg tgatgtgggg 191521 gctgagcagt gtgtagccca ctaggtagaa aaggcccaga ctcaggcgct tgagagcagg 191581 aatgatgctg gaaaggaacg agtgaagttc ctggtcacag agagcagaga ctattccctt 191641 caccctctga ccttcagtta tggagaggag tgtttagggg tgtggttgtc tacctggaat 191701 gggatgagaa gctctgctgc ccaaagtgtt tcctgtttca gggaaagatt tctatggctg 191761 gctatgagga atggggactg caaacctctt aagagttgta ggaaagtcag ggcagcagac 191821 agtggaccag tcttgtgttt acccctcagg gatgctactg atctcaaggt cttgccaggt 191881 ctcacaatct ccattgggac tgaaaaccca gtggaggtaa gataataaaa ataaccactt 191941 tgaatctgct ccttcatttc ttggttgaat tgatgctgaa acacaggatg gacaagttct 192001 cagtgaagga attcttctgg caattaaact tttttttttt tttttttttt accattttaa 192061 tttttatttt ctagagtcag ggccttgctc tgccactccg gctagagtac agaggcatag 192121 tcattgctcg ctgtggcctt gaactcctgg gctcaagcga tccccttgcc ttggccacct 192181 gagtagctgg gactataggc atgtaccacc ttgcctggat aatttttttt tgtagagatg 192241 gggtctcact atgttgccca ggctggtctt gaactcctgg cctccagtga tcctcctgcc 192301 ttggcctccc agagcgttgg cattacaggc atgagccact gtgccctcct gcatttccta 192361 ctgataaata tttttgagac aggattttgc tatgttgccc agaatggagt gcagtggtgt 192421 gatcaaagct cactgcagcc ttgacccacc tcccctggac tcaagcaatt ctcccacttc 192481 agcctcctat gtagcttgga ctacagatgc atgccactat gtctgataat ttttgtattt 192541 ttttgtagag acagagtctc tctatgttgc caggctggtc tagaactcac gggctcaagt 192601 gatcctccca cctcggcctc ccaaagtgct gggattatag aggtgaaacc actgtgccca 192661 gcctctatct acctacctac ctatcaacct gcctacctac ctatcaatca taaatatatt 192721 tcatatatac atgaaaatag gctttaaagg cagaaatgtg actggttcag gcaaaatcct 192781 gtggcataaa tgtggatttt tatgtttgta ctagtgttag aatggataac tggaagaacc 192841 ctaaactaaa aagggcccac tcctggcaca gagtgcctgt tacaacagtc cagggcctgc 192901 atgactcagg tctctatgcc agggtcatgc tggagaatgc agctttcaga agagtcactt 192961 cagagtgagt gaaataccta cacaaacatc tggaccaaga ggggcaatta cctgtttggt 193021 atctttcctg gtatgtcaat cagctctccc tgcaccagct tcatgtagtg attcattgag 193081 aactggggcc ctaccaagaa ggccccatag aagtaggaga aaccagcaac ttccagcagg 193141 gaaggaacac cacgtatggc atatttctgt tgctcagagg acaaggaatt ctatgccaag 193201 aagagaatgc atggttcagg atagccttga atctccccac aagagccact ttgagtgttc 193261 cccacctgtg tgccccactg actggggttc ttcaaatagc cttctctttg ggggatgaca 193321 aatagttgct tcttgtggaa atgcgtatgt gtgtgcatag ctggctgttg ctgctttagc 193381 aaatgccttg ctggcattga tctctggact ttgtgcaaga gctggatcct gggcaaatta 193441 gatttgttga tccttgctca gtgctatctg aaagggagat tgcacaggct ggggaatgag 193501 ggagagctcc gctctggaat ttgggttcag tcttttgtta acaacttttt tcttcccttt 193561 cattaagtct tgaacctctc tgccgatgaa tgggtactta cctgatcttt ccctccgtca 193621 aagtagtcaa cagccaaacc tgagcagaga gagaacggat gggtagggtg gtgggagagg 193681 acactgagga tgtggcctgt agactgagtg cagccagaag tgtatggcgg ggggcggagg 193741 gggggtgccg aggaagtttt tagtgagatg ggggagggac ctacatgtaa ggaaggcagg 193801 cagtgaccat cactcaccaa tcagcttcaa agtcagaaca caatgtggca ttgtccactt 193861 gatatcgtag ttgccggtgg cagtgtaata gtatccagcc agaaggtagg cctaggagag 193921 gcagaagtat ttattctagc atcactacta tttcttctcc ttgctctgaa attcaatgtt 193981 ctctttcctt tttcctttct cctcatccca tcattaaaag ccttaataaa ttcctacaaa 194041 tggaggtgct gaaaacctgt aatgggaaga gcgttgctca aggcttcttg gcaaaatgag 194101 ggctaaactg agatgagaat ttagatgctg ggaccacagg tgcatgccac cccacccggc 194161 taatttttaa ttttttttgt ttcaccatgt tgcccaggct ggagggctaa tttttgtatt 194221 ttttttgtag agatggggtt tcaccatgtt gcccgggtta gtctcaaact cctgggctca 194281 agcaatctga ctgccttggc ctcccaaggt gctgggatta caggtgtgag ccactttgcc 194341 tggcctaggc ttaggttctt taactcattc taagttgctt ttctgtcttg ccttgaagtg 194401 actctgctcc tggatagtgg gttaaaccaa agagcctaat ggcacagtat gcaaagggtt 194461 agctggtggc ccttccttga ggcaggcaaa gagttaccat tacaatctgt ggatggaaca 194521 cttgggcctg tgataagcca cctacctgaa gtcatactgc taagtgctgg gagaggggtt 194581 agaaattagg tctgctaatt tctataccac agtcccctgc ctacccctgt cccctgaacc 194641 gctgtcagct gtagcctgag ctgctaaggg aaagacgttt accatctgga agcaaaaggt 194701 agtgaggacg gcagtgatgg tgcggcccat tagtcgaagg atgaggaact gaagcacaat 194761 acacagcagg gagtggtaga gctggtttcc tggatgcaag aagagagaat ttagcctggt 194821 ttttacactc ccaccgtcct gagacttcca atcacaacgt aatatataaa gaaaaaatgt 194881 tggcatcagg atcttttttt tcagaataac ctcatgtttt ggaggagagg atggaaatta 194941 tttattaaaa taaaaaagat tattagggtg ttattttatt tcttttcttt ttttgtttgt 195001 ttttttagag atgcgacttg ctctgttgtc cagacttgag tgcagtagct caatcatagc 195061 tcactgcagc cttgaattcc gggctaccac acccagcttg ttactgtatt tttgaatgtc 195121 tgaagtgaag aatgaatcta agtggggact gcttggctcg gtcattcagt tacatccaca 195181 gcacagagaa actgaggatt ctttgttgat gaggtatggc aagggtaacc accctcaaaa 195241 tgtttttatc tgacagaaga atgtacataa aagaaaaaaa ggaaaagtta agctgtgttc 195301 ctcttcatac aaccctcttt gcaagtggga gcattataac ttccctacct attagattct 195361 ctcaaggaaa ttttgttcaa gtctgttcct tccaggttcc cactaattgc ggtgtcactg 195421 ctaatttagt ttactcacca aagttaaaat aagcaattga gaggcctgta aaggtatgga 195481 agaggtggat gaggtaggtc tccttgtaga aaaggtaatg ccgataaaac aaagcaaagg 195541 ggtaacctag atgggggaaa agataagaag agtgttattt gtgcctggtg ccatcccagt 195601 ttggtttgga agtttatctg gcatgaaacg cagcccagag ggagagagaa aaaaaaaaca 195661 acatacatta tatggatggc aacagagaag gcaataacgg tatccccaag aaccaaaagt 195721 tttttgtaaa tacaaatttt gaactaaaga tattaatatt tgattgaggc aatataaagc 195781 tgggtcctaa gactaggttt catttatagc ttatgaacta ttgccagaca ttttctctta 195841 cttgaatttt aaaaaatgat acaaggaagc taggcatggt ggctcccatg tataatccca 195901 gcactctggg aggctgaggc aaggggattg cttgagccga ggagttcgac accagcctga 195961 gcaacatagc gaaaccccgt ctctattaaa aaacaagata caaagatttg agcgtaccca 196021 tctatgctcc agaaacactc atttacactt tgtgatctat cttgctagca ctgtattatc 196081 agattatgag gaaaaatata aattaatatc aggctaaata ctggaattgc tgactgcata 196141 tatagcagtt acaagttatg tggagatact cctcacagtc tgtaatctgg gcatcaacca 196201 agttataaaa tccatttaag tatactaaaa aaatgtcttc taaagccatg attcagagta 196261 tagtccaaag gccatgagtg aaccacagag gatcttctga tgggtcatga actgattata 196321 catggccaag ggttgcttat tgaattaaaa acggtcaata aaatttggta ttcctaaact 196381 aaaattagca cattccatgg ctttactgca gactcacctt cagtgttcta tctagaggtc 196441 tggcggccat gcctggcaac atccccactg cactagtgca tatgtggagg atggggatct 196501 ctcaactgct ttttggaaag acattctgcc tattctttca ttgattattc tcttgagttg 196561 gtcatggttt atactttctg caattcttgc ttttattttt atttattttg agacagggtc 196621 tcttgctctg tcacccaggc tggagtgcag tggcacgatc atcgctcact gcagccttga 196681 cctcctgggc tcaagtgatc ctccaacttc agcctcttga gtagctggga ccacaggtgc 196741 ttgccaccat gcctggctat tttgtaattt ttgtggacat gagttctcac tatgttgccc 196801 aggctggcct tgacctcctg ggctcaagga atcttcctgc cttggcctcc caaagtgttg 196861 ggattacagg cgtgagtcac tgtgcctgat gaattgctgc gattcttgct tttaaattat 196921 gtaactgcag tttttatcat gggcaataca gtttactgtg agtttgctta actctaaaac 196981 gcttagaata gtatcataca gaaataaatt gctcaattat ttgttaaata agtaaatgca 197041 cacaatcatg ttatatgttg gtttctgtcc ttctagtcat ttttccccca tacattaaaa 197101 aaaaaaaaaa aaaaaaggct gggcgcggtg gctcatgcct gtaatcccag cactatggga 197161 ggctgagacg ggcggatcat gaggacagga gatcgagact atcctggcta acacggtgaa 197221 ccccgtctct actaaaaata caaaaaaaaa aattagccgg gtgtggtggc gggcgcctat 197281 agtcccagct actagggagg ctgaggcagg agaatggcat gaacacggga ggcggagctt 197341 gcagtgagct gagatggcac cactgcactc cagcctgggc gacagagcga gattccgtct 197401 caaaaaacca aaataaaaca aaacaaaaaa ctgtactggc tggtgcagtg gctcacgcct 197461 gtaaaccaag gcactttggg aggctgaggt gggtggatca cttgagatcc ggagtttgag 197521 accatactgg ccaacatggt gaaaccccat ctctaccaaa aatataaaaa attagctggg 197581 tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaatcacttg 197641 aacctgggag gcagaggttg cagtgagcag agattgcagt gagcagagag agccactgca 197701 ctccagcctg gtgacagagt gagactctgt ctcaaaaaaa aaaaaaaaaa aaaaaaaagt 197761 tacacctgtc ggccgggtgc ggcagctcac acctgtaatc ccctactttg ggaggcttag 197821 gcgggtgggt cgcctgagat caggagtttg agacgagcct ggccaacatg gtgaaacccc 197881 atctctacta aaaatacaaa aattagttag gcgtggtgca ggcacctgta atcccaccta 197941 cttgggaagc tgaggcagga gaattgcttg aacccaggag gcggaggtgg cagtgagctg 198001 agatcacgcc attgcactcc agcttgggca acgagcaaga ttctgtctca aaaaaaaaag 198061 aaaaaaaagt tttatgagga tttttgactg tacaaggggt gggatcccat aggacctgcg 198121 ctgttcaagg ctcaactgta attaattcta cagatatctt atggattact ggctcatgta 198181 ttcattcacc aagtgtttag taaatgcctg ctatatgcca ggtattctgt ttatgaagca 198241 cgtaattggg taggaggtgg catgtgatgg ttcgttttct agctggctga gatgatggga 198301 tataggatca ctagctctag ggaggtggag acatctatga ccctttcagg tagatcaatg 198361 agcaaggaaa taggatcaag tctcatatat ttcacatggg cagaaatttg gcagagacag 198421 aacaccaggt tagcagcaaa tattgctaag aactagagcc aggaactaga aagtatatag 198481 actaaatgtc aatgccttca atcttttgct ttatttttac ttttgtttta gtaacggggt 198541 ataaataaaa aaatataaaa cagtagataa ttatctagag cactcataaa taagttctag 198601 gtagctaagt ttctctcttt aagcatgaaa acccttaacc atttggaatg ctcgaaatta 198661 aaaaacacca cacatattac tctgcctctt taagttgaat ctaatttaac attttctagg 198721 tgtctggatc gtatctattc cagagtaaag tcatgatggc tttatgacgt tctgaggtat 198781 gtgaaattgt ctgcctgact ctaacaaggc ctacactgtc ctgagttctg agttcttgtg 198841 tccacttctt atagcctgtc tgtccctttc cttgctactc tgggatcaac agtcacctct 198901 tgaacttttg gggtgcttgg caatagttcc tctccctact cctaatttac ggcaggcctt 198961 agaaaccata accttatttt aaaggtgtaa aaaaaaaaag atttaagaca aaagcaaggg 199021 gcttgggtgc tttccttatg gacttaggcc tggtaacatc tgttctggcc acttagaggc 199081 cttgtgtgct atttcttgtt ttcaggtgcg ttttgcagga ggggacgttg ttgagttcca 199141 aacaggtgag gtattgcaca ctagcaaaca catgagaaga aggcggagga attgggagaa 199201 aaataaaaag aatgcagcag gccaggttag caggaacgtt aagacggtga cggagaacag 199261 caaagcctgg aagcaagccg ccgtggagaa ggaagaactg tgctgaggtg agttgctgtg 199321 acaacccagg ctgattttga gtatgtaaac accaaacctt gttcttggct gccgctcagc 199381 tcagcgggct ttggagcctg gctgcccagc caccacttca gggatgtgct gtttttaggg 199441 agggtgtgac cctacaagat gtttctgagc cttaatgctt ttttgtggga gccaatgctt 199501 aatatggtgg ctagagttac ctgaagaatc tataaaaaat gaccgaagcc ccttctgctc 199561 accctcccac tcatcagagt tggcttccgt gggtctgagt gggaaggact tccactttta 199621 acagcatgag acacggttct gacagcccca ctaacatccg aatgcaggcc gcagtgctca 199681 gtcctgagga taaaattctc agcttggaga ttggggttga tgccttacct ttattagcac 199741 cagatgggtt tgtaacaacc cagaggtttg taagaacttg ttggccgggc gccgtggctc 199801 acgcctgtaa tcccagcact ttgggaggcc gaggcaggcg gatcacctga ggtcaggagt 199861 ttgagactag cctcaacatg gagaaaccct gtctctacta aaaaaaatac aaaattagct 199921 gggcgtggta gtgcatgcct gtaatcccag ctactcggga ggctgaggca gaattgcttg 199981 aacctgggag gtggaggttg tggtgagccg agatcacgcc attgcactcc agcctggcaa 200041 caagagcgaa actccatctc aaaaaaaaaa aaagaacttg tttggcagca ctgtaactgt 200101 tccttctttt tgattgtttg tttaaagcag ggacttcaga aatttattag caggcgaagg 200161 atgatgacct ttagtacact ccaaacctga ggatcttcta ctagaatggg acctttataa 200221 tccctaatgc tagggacatt caaaatgcgt gttttttttt tttttttttg agacagagtc 200281 tctgtcgccc aggctggagt gcagtggtgc aatctcggct cactgcaagc tccgcctccc 200341 gggttcacgc cattctcctg cctcagcctc tccgagtagc tgggactaca ggcgcccgcc 200401 atcacgccca gctaattttt tgtattttta gtagagacag ggtttcaccg tggtctcgat 200461 ctcctgacct cgtgatccgc ccgcctcggc ctcccaaagt gctgggatta caagcgtgag 200521 tcaccgtgcc cggccaatgc tgtggtcttt caagcagctg ctgggataca tttaatttgt 200581 acaagccctc ttcaggggtt gtagtcaagc acagggagtg gatagaactg tattattcag 200641 tctctggact tcactcagtt ccaagtgctg tttgtgtcag ggaccagatc tatcacaacg 200701 tgcattgtta gcgggaacgt tttcttattt cctgtacaat agttgtgaga ttacttatta 200761 atcctaaagt tgtgaggttt catctgaaga aacagaaatg gactgtttcc attaagtctg 200821 gtaaatttgg ctgggagtgg tggctcacac ctgtaatccc agtgctttgg gaggctgaga 200881 cgggagaatc actggaaccc aggagtttga gaccagcctg ggcaacatag caagactctg 200941 tctctacaaa aaataaaaaa aatagttggg tggggtgggt ggtgcgtgcc tgtagtccca 201001 gctacttgag aggctgaggc aggaggatca cctgagtctg gggagacaga agctacaatg 201061 agctacgatg atgccactgc actccagcct gggcaacaaa gttgtttttt ttgagaccct 201121 gtctcaaaaa taaataaata aataaataaa aataaaaaaa atataagcgg ggcacggtgg 201181 ctcatgcctg taatcccagc acttcgggag gctgaggcgg gcggatcacg aggtcaggag 201241 atcgagacca tcctggttaa catggtgaaa ccctgtctct actaaaaata caaaaaatta 201301 gccgggtgtg gtggcgggca cctgtagccc cagctactcg ggaggctgag gcaggagaat 201361 ggtgtgaacc tgggaggcag agcttgcagt gagccgagat tgtgccactg cactccagcc 201421 tgggcgacac agtgagactc cgcctcaaaa agaaaattaa aataaaataa ataaataaat 201481 aaataaataa ataaataaat aaattgagca ccccaaacaa cttttgttaa tgtgggaact 201541 atatatcaat gtctactgtg ttagaaaata agactaaaaa ctgggtgtgg tggctcactc 201601 ctgtaatccc aatgctttgg gaggccgagg tgggtggatc acttgaggtc aggagtttga 201661 gaccatcctg gccaacatgg tgaaacccta ctaaaaatac aaaaaccagc cagacatggt 201721 ggcaggcgcc tgtgatccca gctacttggg aggctgaggc aggagaatcg cttgaaccca 201781 ggaggtgggg gttgtagtga gctgagatca cgccactgtg ctccagacta ggcaacagag 201841 cgagactcca aatcaaaaaa aaaaaaaaaa ggaagaaaat aaaactaaaa aaaaagtaaa 201901 agatatgatt aattcagtaa aaatattaac agtaaaacta ctacacatta tattaatata 201961 aataacatac tttaaatgta aaccactata tttcccaaaa caatcaaaga aataaagcca 202021 tctaatgaga agaatgccac tgttttacat tttcataaat ttatggcctc tgtctgactt 202081 aatagaagat gactggattc tcaaatctgc ttctgcattt aatctgttgt aatatgttag 202141 tttggttaaa atatacaaag aaaatctggc tttacacaga cagagaaagt atctgcattt 202201 tcagataatt ttggatattc tttagcgcta catcaaaact tgacaagtag tagtttctta 202261 acagttagga acttttgtta taataaaacc cattggcttc tcttgcactt tgaatggata 202321 ttttaccatg catgactttg taacatcaca catagatcac tgcaaaatac tggttccctg 202381 ttgttaggca gatcttctaa atactgacat atttcattag aatacatatc aaaaaatcat 202441 attccttaaa tttcaccatt gatttcagca gaaaagtctg tatatattga aaagtggtca 202501 agctcatggg agtggataca agttttccaa aattctaatt tttacttgaa agtgtaaatt 202561 ttatcactgg ctacaaatat tgtcatttgt ttaccttgaa gtgacaggct cacttcatac 202621 agtttcaaga gactgtctgc cagatgccta agtctaaaac catagtttgt ctatcattct 202681 ttcaagtaaa aatggtggtc ggtgaaaaaa gaagctgcta aatcaatatg caacttgaat 202741 aatcgcccaa atgcttttcc ttgtgacaac caccgtactc taccagtgtg cagcagaggc 202801 ggtttatgca tatttcccat ttcttcaaac aagtttaaaa agatgtactc aggatcaggc 202861 atggtggccc acacctccca gcactctggg aggccaagac gggtagatca ctttaggtca 202921 ggagttccag accagcctgg ccaacatggc gaaaccccgt ctctactaaa aatacaaaaa 202981 ttagctgggt gtggtggtgc ataccagtta aaaaaaaaag atttattcaa ggactgacat 203041 ttgatacaat taacaatatt tagcctgggc gacagagtga aaccccatct ctaaaaacaa 203101 aacaaaacaa aacaaaaaca aaaattagcc aggtgtggtg gtgcacacct gtagcctcag 203161 ctactcagga ggctgaggta gcatcacctg agcccaggaa gttgaggcgg cagtgagctg 203221 tgatgccccc accgcactct agcctgggta agaccctgcc tcaaaaaaaa aaaacaaaaa 203281 aaatttacca cttcatcaac gattcttaag tgaaactggc tctgttttga ttgtgagtgc 203341 atggcagtaa agaatgcagt gaccactggt acagtttggt gtcaccgtcc tgatttgtgc 203401 taaggcgcca gcagttttac caccattttg caacatcagt gcaagtgtca acatagggaa 203461 aagacaaata catcttagta gtatcatgaa aataattttt accccaaaga gattccttaa 203521 agaagtctca ggaactttta ggagtagtct atggactgca tgctgatatg gaatgatttg 203581 ttttaatctt tcatcttcta atacccaaat ccactagtct ttcctttctc cctggtttcc 203641 ttgactgccc tgctgtgttt tcaatctttt tgaaacttct ctttcactgg tttcattggt 203701 tttgaataca gttatctctt ggtgtccatg aaggattggt tctagtactc gcagcagaca 203761 ccacagatgc tcaagtccct tatataatat ggtgtagtat tgcatgtaac ctatgcacat 203821 cttcacatac actttaaatc atctctggat tactcataat acctaatata atgtaaatgc 203881 aatgcaaata gctgctatat agcatattgt tttttattta tatttttatt attgtattat 203941 tatttagctt agaatccatg aatgtggaac ccacaaatat ggagggctga ctatacacag 204001 tttcctggat ttttttctac tgagatataa tttaccataa aattcaccct tcaaagtgta 204061 aaatgtaatg ttttttagta tattcaaaag gttgtcgccg ggcatggtgg cttatgcctg 204121 taatcccagc actttgggag ccggaggagg gcagatcacg aggtcaggag atcaagacca 204181 tcctggccaa catggtgaaa ccctgtctct actaacaaaa aattagctgg gcgtggtggt 204241 gcctgcctgt agtcccagct actcaggagg ctgagggagg agaattgctt gaacccagga 204301 ggcagagatt gcagtgagct gagattgcgc cactgcactc cagcctggca acagagcgag 204361 actccatctc aaaaagaaaa aaaaaaaagg ttgtgcagcc atccccacta tctatctaat 204421 ttcagaatat cttcatcacc tcaagtagaa accccataca tgttggcagt catttcccat 204481 tctctcttaa ctcccagagc ctggcaacca ttcatctact ttatgtctct atagattggc 204541 ctattctagg tgtttcatat aaatggcgtc aggcaatgtg tagacctttg tgtctggctt 204601 atttcactta gcatgttttc aaggttcatc catgttgtag catgtatcag tacttcattc 204661 ctgtttatgg ctgaataata tcccgttgta tggatactct gcattctttt ttttaacatt 204721 taaaaatttt ttatagacaa ggtctcacta tgttgctcag gctggtcttg aattcctgag 204781 ctcaaatgat ctgtccacct cagcttccca aagtgctagg attgcaggca tgagccacta 204841 cgcccagcct ggatactctg catataatct acccattcat cagctgacaa acaactgggt 204901 tgtttccact ttaggaacat tatgaagaat gctgctacaa acattcatgc attttgtaga 204961 gacagggtct cactatgttg cctaggccgg tcttgaactc ctggcctcaa atgatcctcc 205021 tgctttggcc tcccagagtg ctgcaattac aggtatgagc ccatgtacaa gtttttgtgt 205081 gaacatatgt ttttatgttt tcaattctct tgggtatata cctaggaatg gatctcctgg 205141 atctttttct tttttttccc tttgggacag agtctctctg tgttgtccag gctgaagtgc 205201 agtggcatga tcttggctca cggcaacctc cgcttcccag gttcaagtga ttctcctgtc 205261 tcagcctccc gagtagctgg gattacaggt gtgcaccacc atgcccagct aatttttgta 205321 ttttcagtag agatggggtt tctccatgtt ggccaggcta gtctcgaact cctgacctca 205381 agtgatcgac ctgccttggc gtcccaaagt gctgggatta caggcgtgag ccaccgggct 205441 gttgcccacg ctggagtgca gtggcatgct cacagctcac tgcagcctca actttctagg 205501 ttcaagtgat cctcccacct gagcctccct agcagctggg actacaggtg tgcacaggac 205561 cggctaattt cttgtagagt tggagtttca ccatgttgct caggctggtc tcgaactcct 205621 gagctcaagc aatctgcccg ccttggcctc ccaaagtgtt gggattacag gcatcagcca 205681 ctgtgcttgg ccaatgttca ctcttacttt ctgtgcttac ttctggcatg ggttggctga 205741 ctacaatttg aatagaccca tctttccgtt tctttcaccg aaggcatttt ttctggctcc 205801 tagatgttgc tgacccccag gatccagtct tggccctcca ctattcttgc tacactgctt 205861 ctttggaaac tcatctactc tcaagtaact taatataatc tttatgccaa agacagatcc 205921 acatctttag ttctaacttc tttttcagtt ccacatttcc tacttccttg ctggacagtt 205981 ttaatttgat atttcattac catctcaaac tctgtataac taaggcataa atttctccct 206041 aaatcagcct tctctgggct tttttttttt ttttaaatta acagtgttac cattctccag 206101 gtcacacaac attcaaaagt gtgttttcct gccagggcat ggtggctcca cgtctgtaat 206161 cccagcactt tgggaggcca aggcaggcgg atcaccaggt caagagttca agaccagcct 206221 gaccaacatg gtgaaacccc atctctacta aaaatacaaa aattagccgg gcgtggtggt 206281 gcatgcctgt aatcccagct actcaggagg ctgaggcagg agaatcgctt gaacccagga 206341 ggcggaggtt gcagtgagct gagatcgtct cactgtactc cagcctgggc aacagagcca 206401 gaatccgttt caaaaaaaaa aaaaaagtgt tttccttctt catattggtt ctctgttatc 206461 tagtcatgaa atcctgttga ttcctccttt ccagtctctc tcaccatcac tgtcaccatg 206521 accatcacca ctcccttttt cctctgctga tttgcagtta aaacccttat ggatctcacc 206581 cttaaatcct tgcaatggcc tcctacctga tctctgcctg caccacgtct tcccagcagt 206641 cctataccgg ggtaatcttc cttcagctcc tgctcagaaa ccatcaatag ctcccagctg 206701 actactcaac agagtcagca atcctgtctg ttctaccttc aaaataaatc caaaatctga 206761 ccacttctct ccgcctctac tgcttcccct ggtctgagtt gccactatct ctggattatt 206821 attattaata tatcattatt atattcaatg tattatcatt atatacttgc ctcctgactg 206881 atctcaccct gcctttgcct cccttcagtc tagccttaat gaagcatcta gagggttcta 206941 ttcaacttaa gtcagcaggt cactcctctg ctcaaagccc tctcaaggcc tctattctca 207001 ctcagatcaa aaggctgatt gccagcaccc gcaggttctg tccctctgcc cgggccccac 207061 tgtcctctga ctcatctctc aatctggcct ctacccttct gctccagccc caaagctttc 207121 ccttcctgga atgttaagca ggtccagcct tggtgccttc acatggtaag ttccttgcct 207181 ggaaggctct ttgcacagat aagctcaaat ccttcctaca cctcaggtcc tttgtaaaat 207241 gtcaccataa gtatcagtga ggacttccct atcttatcta gaagtgtata caacactacc 207301 cctccccacc ctgtaaccct cccccatcac acacttcttg ttcttctttc ctgctttttc 207361 tatttttctt ctcagtactt attacctttt gacataccat atatcttact tttcagtctt 207421 tcaaagcagg gattatcatc tactttagtc cctaccgtac cccagtgcat agtacagttc 207481 ctggtacaca aaatttctca aaaagtatta gctgaatggc cgaacaatga gtgaacaagt 207541 gctctgtact ctaggcagtc aatacaatat ttattaagca ctcactatgt ggttagcatt 207601 gcattaggca ttgggggaat atggtagaat tttaagaggt gctttctatt cttaaggagc 207661 aaaatcaaac aatgatgttc tatgctaagg gctaactgta tggaatagcc aatgttgtag 207721 aggttaaaga gaaatcaatc tcgaccacag tggtcgggag aagtagttct tgaacagaga 207781 agagatgacg gcattccaag ggaggataat gtagtaaaga cacagcggta ggaaggagca 207841 tggggcattc actccaagag actgggctaa tttcaaggga ggaaaaatat tgtaacaaaa 207901 tggatgggaa agtgaaaaac caggcagagt ctgaaattga taaatgggaa aaaaaaaatc 207961 aattgactag gtgtaggaga aggaaaagga caaattaaag atgtttatgc cactgatata 208021 atggtcttca ggtcttcaaa cctttttgct ggcatagctc ctagaagaat tttgaaaaac 208081 tgtatatcct cctttcacat tttaaagttg ccatctaaac cttatgtttt tatttcttta 208141 tttttttatt ttcttgagac agtacttcac tctgttgccc aggctggagt gcagcggcac 208201 aatcacagct cactgcagtt tcaacttcct gggttcaagt gatcctcctg cctgagcctc 208261 ccaagtagct aggactatag gtatgtgccg ccatgccccc aaaacaacag caaattatgt 208321 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtttca agacacagtc 208381 tcactctgtt gcccaggctg gagtgcagtg gtgcaatctt ggcttactgc aacctctgcc 208441 tcccaggttc aagcgattct cctgcctcag cctcccaggt agatgggatt acaactgtgt 208501 gtcaccatgc atggctgatt ttcttttctt tttttttttt tgagatggag tttcgctctt 208561 gttgcccagg ctggagtgca atggcgtgat ctcggctcac cgtaacctct gcctcctggg 208621 ttcaagtgat tctcctgcct cagcctccca agtagctggg attacaggca ggcgccacca 208681 cacggggcta attttgtatt tttagtagag acaggatttc accatgttgg tcaggctggt 208741 ctcgaactcc tgacctcagg tgatcctcct gccttagctt cccaaagtgc tgggattaca 208801 ggcgagagcc actgcacctg gccgcatggc tgatttttgt atttttagta gagtagggtt 208861 tcaccatgtt ggccaggctg gtcttaaact cctgacctca agtgatctgc ctgcctcagc 208921 ctcctaaagt gctgggcata agccaccatc cctagtcagt gtgtgtgttt tgaggcaggg 208981 tcttgctctg tcgcaaaggc tggagtgcaa tggcacagtc atggctcact gcagccttga 209041 tctcctgggc tcaagtgatc ttccgacctc agacccctga gtagctgaga ccacaggaat 209101 gtactaccac attcagataa ttttaaaatt ttttgtagag atggcatctt gctatattgc 209161 ccaggctggt cttgaactgc tgggctcaag caatcctcct gcctcagcct cacaaagtat 209221 tggcattaca ggtgtgagct actatgtttg gccgtgtgtg tgctttctaa tgcctattga 209281 aggtgtgttt taagtgctta tggcagatgt ctgccatgca atctaaatgg caaaccaatc 209341 agctggatca gtcttactta caacacacct ggcctgtccc tcaacttttc tattctttca 209401 ccaaaatgga agttccttat gttagagaac ttacagagaa aaggaagaca aaggaaaata 209461 ggaaggagag ttaagctctg cctgacactg tctacttaag tgatgggtaa ctgattaggt 209521 tcagtgctta gaacttcata taatgagata caaaatctca ttactgaccc actctgccag 209581 ctagaagagc acttgtagat catggacatt taggtgggga tatagccacc tttgatcctt 209641 tctgttcact agattaagcc ttgttgctag tacaggcagg agccaacata ccttccacac 209701 tccagcaatc ctgtacacct tgggctgagt catatgctga acagttattc atgaatacag 209761 attcttactt agaactgaag aattacctgc tccaaaccct tcattaggta gataagagac 209821 taaggtccaa aggggataac tggcatgccc aaggttaaca gagcagagct caaattagaa 209881 tccacatcac ctgatttcca ttctactatt cttgccacat gccttggcac ttggcagtga 209941 cctgggagtg aatcactaat aaccagctgt gtgctctttt aaagctacag ctctattatt 210001 ctttgatgtc cagataacaa tctatgctcc agcacctgtt tatagacata agcacaccca 210061 tttttatgtt tgcttatcct cctattgaaa aatattttaa aagttcattt taatgctaaa 210121 tttaggtgta ctttctttgt taatctaatt cttgagcact ctgcaccctt cagcagttca 210181 tttctgaaag ttacctccac ctaaatagct caagcattgt gacagctgtg atacaggact 210241 catggaaact gggacaagtg atcaaaaatg tctgagggag gaacaataga ggggacattt 210301 taaaggtaag ctattggata gttgttacct gtgagaaaga aagaaaaagg gttaggttaa 210361 gaaatgttag ttttttactc tctcaaccca gccaaacaac ctaaacttga actgtcccaa 210421 aggcctacca aaattcttta atttcttcac ttaagctgag tgctcttaat aagcaagact 210481 tctgaggtgt agtgctagtg attaatgtta tacacacccg ttggctgact ggtgaaacct 210541 gctgcaataa ctaaaaatta ttctataaaa atgtatagac agagagaatg ttaaagctaa 210601 aagaggccgt tgactcattt ctttgtttta gtagaacagg tgcaacagat tgaggcactt 210661 taagacatct aaatggcttt actagcaaag ataaaaatca cacttgcctg tactcaggga 210721 attttatcac attctaattc tttgtaaatc agtgaatccc ctggtgcaac ttctagccag 210781 attatctggg ctttggaatc agaattatct agtttaaaat ttagctctgc agcttgggca 210841 aattacttac attctttttt tttttttttt tttttgagac agagtctcgc tctgttcctc 210901 aggctggagt gcagtagtgc agtcttggct cactgaaacc tctgcctctc aggttcaagc 210961 aattctcctg cctcagcctc ctgggtagct gggactacag gtgcccgcca ccacgcctgg 211021 ctaatttttt tgtattttta gtggagacgg ggtttcaccg tgttagccag gatagtctcg 211081 atctcctgac ctcgtgatct gcccgcctca gcctcccaaa gtgctgggat tataggtgtg 211141 agccactgcg cccggccgaa aattgagtat tcttttaatg gattttaaga tataaaatct 211201 caggctcaga aatttgagcc acctaggggt gagaaatttt ggcctttgcc cgcttcgaac 211261 tgacttaact aaattaaatc tatagaaagt aaaaaataaa ggtctgtgaa tgtacccagt 211321 actctcagga agaagcatat ttgccatgaa gctaaaaaag cttcagtttc actccccttc 211381 caaggctctg gggggtgggg tgggggcggg agctagcaac gtgctcacat ggtcatatat 211441 ttttgtaaaa cttatagaag atatttttgt attctttttc ttaaagaaga tatccaagat 211501 tgtataagct tcagatccca gaaatcctag atttaaaaaa aataatcaac caccacccca 211561 acaaaattag gcacatgcct gaaaattctg ctgctactgc cagaattctg cgggtagggg 211621 cttcatccaa ggtgttccta ataataggca gagtccccag attagttccg gtgaggcttt 211681 tgggaggggt tatttggcgc ggtgttggtg ttggtaggat cctggctgtg aggctgaggc 211741 aggcctgagg agggctgggg ctgacaggta ggcagagttg gctgggatgt gaggtgcaag 211801 gggtggtact tcggaaacaa actttgaaga ggagaagggg cagcaagact gtgagtctgg 211861 actttgtgat acactgtcac ccctagttta gtgtgcctgg ttagggagaa tcaagcagca 211921 tggaacccct cttccttttt cactaacatt tttctgtata gtggttgaaa cccagtgtta 211981 agagactgca tgacattgga aaagaagggg aggagaaaaa gagtccatat gaggcaagaa 212041 cagaacgcag ccagaattag ggcaggcagg gaaaggaagg gagtgagatg tgggagtcgg 212101 gggggagttg aggtaacctg cattttatat gctagagcta cgtattttag atgcaggggc 212161 aactgtttcg gtccaggagt ggaaaggact gtagaagcaa tctagcacag ctgtttctca 212221 cttgggaaaa ctgaagccct cagaggcaga gggaggaatg gggagcggcc aggccaggac 212281 aggactcagc ccccagttcc tcttctcagg cgtccgattc cttctgatct ttctctcctg 212341 ccctctctgt tactgcttcc tcctgttctt acccttatat acacttgaag ttttatcccc 212401 attttttatg cttatgggat tgtacacttt ctggttctct ctcaagtcca accagtatgt 212461 ggtaacctgt ctcttcccac ttcatttgtg gcactggttt gcagtggaca aaaggtccgt 212521 gctcctcttc taacctaatc tggactgggt tgcccaaagg ttgccctgcc acactgccaa 212581 gtgcctaatt agctgttttc tctccaaccc ctccaaacac ttatcatgag taatttctct 212641 tgtctttaga gttgccaaat ctaatctctg taaatacaaa tgtggtgaga cttcttctca 212701 ggagtttcag caaatgaaac aataaactct tttttaccct gctaagattc taaagataac 212761 catgagaata ctcctaatta tccttataaa tttgaataag tgtggttgtt gggttctctc 212821 acccttttta tatcccttca aaagaaaata caagtttgaa ttctataaaa tatttttctg 212881 gccgggcgca gtggctaacg cctgtaatcc cagcactttg ggaggctgag gtgggtggat 212941 cacaaggtca ggagttcaag accagcctgg ccaacatggt gaaaccccat ctctactaaa 213001 aatacaaaag ttagctgggc gtggtggcgg gcgcctgtaa tcccagctgc tcaggaggct 213061 gaggcaggag aatcacttga aactggaagg cagaggttgc agtgagctga gatggcacca 213121 ctgcactcca gcctgggcga aagagtgaaa ctccatctca aaaaaagaaa aaaaaaattt 213181 ctaatattta tggaatgcac tctcttatat accaacatac atttgtagca ttttttttcc 213241 ttcagagtat gaagaaatca gtaaaaatgg gcaaagcaca aatgacaact aatggatttt 213301 gttaagtcct atgtgtcact ttaagaactt gaaactggct caaataaaaa ccattttatt 213361 tggttattaa aatgagactt aaaaacttaa aacaagattt taagtgatct taaaatgtct 213421 tatcctacag aagattttag aggcaatgta gtacaaaaag ctcttccttt tctctgaaga 213481 atttattgag tgctttggga gtagggtgag tttttttttt ttttttaaca tatgtttgat 213541 tcaattaagc aaaaatattg tttcatgtct cactgaactg aatacattct actatccatt 213601 ggttttcaat gttcacttat ttattttatt ttagtagaga ctgggtctca ctgtgttgtc 213661 caggttgggt tttcaaattt tatgtctaac acagtgtgct aaatacaact gagtttcata 213721 agccattttt cccacttctg tacatggcac tggtctcaac ctgtatcatg cataagaatt 213781 acctaggatg cggccgggcg cggtggctca cgcctgtaat cccagcactt tgggaggccg 213841 aggcgggtgg atcatgaggt caggagatcg agaccatcct ggctaacaag gtgaaacccc 213901 gtctctatta aaaatacaaa aaattagccg ggcgcggtgg cgggcgcctg tagtcccagc 213961 tacttgggag gctgaggcag gagaatggcg tgaacccggg aagcggagct tgcagtgagc 214021 cgagattgca ccactgcagt ccgcagtccg gcctgggcga cagagcgaga ctccgtctca 214081 aaaaaaaaaa aaaaaaaaaa agaattacct aggatgctta ttaaaaatgc agattcccaa 214141 ttcctgaagg gaggtcatgc cttggcatct tacttagcga atcagtgcta tacagaaagt 214201 atgagagact ggttcatatc aaagaatatt gtgaaactct acactatatt tgtgaagata 214261 tagaaaaccc gttttatggc taaaaatcat cattttacta ctactaagct tcattattag 214321 tagatggagt ttttttaaag ggtcattttt cacaaagaaa aatttgtttt gtcttttaac 214381 actttaggac tgttacttca tagtttatcc ggggcagaga acaacagcgg gttactctca 214441 aacctgctct ggagtttcag actagcagct gacatttttg gggacctttc tttaagcttg 214501 acattgtaaa cactttgtat gtattaaccc acttaatcct caaaacaacc ttgatgagta 214561 ggtatcctta tcttacggaa gaggaagctg aagcacagac agggttagtg acttacctaa 214621 ggtcacagtg ctggaaggta gcccagctgg aatttgaacc cagacagcat gtgggttcag 214681 agtgttttca tttttaacta ctctgctata ttaccctctc tagttttcag tattcatatt 214741 agcacctcag ccttccccag atgactacaa atcccataca tatgtgtata cacacatacc 214801 cagggtcatt catgaaatct gaaacttcgg gtgttcaatt tgactttata gaaccttcta 214861 cctctcctgg ctgatagtat ctctttctca tttcacaaat taaacttcct ataacactac 214921 cttaaacatg ccacccctgc tcaaaatact ctggtggctt cctacgatct ataagatgaa 214981 agctaacttc cttagcttgc cattcaaggc ccttcacagt ctttccagtc ttactgttta 215041 ttctatccct acttgagcac aaagcatttt tttctttgat catgcatagt ccttctttct 215101 tcattttttt aagtgtacaa ttcagtggat tttagtatat tcagtgttat gcaaccatct 215161 ccactatcta atgccagaac attttcatct ctgctatcta atgccagaac attagaaacc 215221 ctgtaccgca tagtggccat tcccaattca cccttcctct tatcccctag caaccaccaa 215281 tctattttct gtctttacga attagcttat tctgggtatt tcatataaat ggaatcatac 215341 aatatgtggc catttgtgtc tggcttgttt catgtcacaa tttttcaagg ttcatctgtg 215401 tcgcagcatg tgatcaggat ttcattgttt tcttatggct gaataacatt ccattgtgtg 215461 gatataacat tttgtgtatt cattcaattt gtcccaattg gtggacaact gggttgtttc 215521 cacttttggc tattataata aagctatgaa catttgtgta caagtttttg tgtgaatcta 215581 tgctttcagt tcccttgggt gtaaccctag gaatggaatt gctgggtcat atggtaactg 215641 tgaacttttt gaggaactga caaactattt tccacaagtg gctgcaccat tttacattcc 215701 caccagcaat gcttgagggt tccaatttct ccgtatcttc accaacatct gtttgtgtgt 215761 ttttgattat tgccatccta gtaggtgtga aatgatatat cattgtggtt ttgatttgca 215821 tttccctaat gactaaagat attaaacagc ttttcacgag cttattggcc atgtatatgt 215881 ctcttttaga gaactgtcta ttcatatcca ttttgtaaac tgggttatct ttttattgtt 215941 gaattctaag agttctttag acatctgaat actagactta tgatttacaa atattttctc 216001 ctattctgtg ggatagcttt ttactttctt gatagtatcc tttggtgcac aaaagctttt 216061 aattttgatg aagtccaatt tatcactgat cataccagct ggaaatctac ttttttctat 216121 gtcagcatct caatattact tgatgatctg gggagaattt ttgaaaatat aaaaataata 216181 tacataaaat tttgatataa aacaagaggg ccaagacttg aagaaatctg gagcttctgt 216241 tgagacgatg acctggggtc ccaatgatta ctgttaggat cttaatgtac ttagtggaaa 216301 ggttgagagg cgctggccta cggcaggcac tcctacagta agactgaacc taaactgaca 216361 ttggtcagtt tacttgggaa gttgaccata tgttttttgt ttgttttctc ccaggctata 216421 gtacagtggc gtgatcttgg ctccctgcaa cctctgtctt ccaggttcaa gcagttctac 216481 tgcctcagcc tcctgagtag ctgggattac aggtgtgtgc taccacaccc agttagtttt 216541 tgtattttta gtagagatgg ggtttcacca tgttgcccag ggtggtctcc aactcctgac 216601 ctcaaatgat ccgcccacct tggcctccca aagtgctggg attacaggcg tgagccatgg 216661 tgcctggccg atggtataca ttttaaaaat aaatgtatgc tatctaggag ttcccatgtt 216721 ttgctattat ttcctattct ttcagagcac caatagtcaa tgaagaatta aaagtgaagt 216781 cagattatcc tttagctaaa gaatggggat ttctagtagt acggttcaat atttttggtc 216841 agggccccat atgtttatac tccaacttta atcaatcttt tccacttctg actgagggat 216901 atgctgtagg aactttatgt gtactatttc agttggcctc tgctggctta taaacagtag 216961 ttgatattta gggacagtaa tccagagtag tgtccaagtc catctgagtt tagaattgct 217021 accacagcag aaattggaaa cctcgagagt tataagccat ttaccacatc tacatggatg 217081 ccacttctcc ttaaagcata tacaaatata tagagatcgg aactttaacc ttctgggatc 217141 taatatcttc ctaagatttg catacagaat gtacacatgg atcttctatt atagcaaatt 217201 cttttttttt tgagacggag tttttgctct tgttgcccag gtggagtgca atggcgcgat 217261 ctcggctcac tgcaacctct gcctcccggg ttcaagtgat tctcctgcct cagcctccca 217321 agttgctggg attacaagcg cctgccacca tgcccggcta attttttgga tttttttttt 217381 tagtagagac agggtttcgc catgttggcc aggctggtct caaactcctg acctcaggtg 217441 atccacccac cttggctttc caaagtgctg ggattacagg tgtaagccac cgcgcctggc 217501 cagcaaattc ttaattgcct gtaaaattac tatctcacta tgtattattt tcggaagcag 217561 atatgaccgt gtacagtttt cttataaaag tgccatagca acatgaatac tcagtactga 217621 caagcatctt taattgacaa aagctttttt tttttttttt tttgagacgg agtctcactc 217681 tgtcccccag gctggagtgc aatggtacga tctcggctca ctgcaacctc cccctcccag 217741 gttcaagcga ttctcctgcc tcagcctccc gagtagctgg gattacaggc gtgcaccacc 217801 gtgcccggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggtcagacca 217861 gtctcaaatt cctgacctca ggtgatccac ccacctcagc ctcccaaagt gctgggatta 217921 caggcatgag ccactgtgcc cggccatgac aaaagctttt tatttttaat ttaatcttag 217981 cctatggtta aagtgaagca tttaaatagc ctacttgatg acttcatgtg agccaagaac 218041 aaaacctggg tctgataatc ttctacctac tctctgacaa actgtcccta caattataat 218101 gcaatgacat ttctatgagc agcctctatt actaacagtt attatacact tggttcttat 218161 acagaatact tcattcaaga tgtccagagc ttcatctagc tcactatctt caaagcttct 218221 gcttgcaaag ggaggaactt acaccatcag tccattttaa gccaaaagga aattcttacc 218281 cagaaattaa ttacttatcc tttttttagt gtgttactct ccacaccccc ttcctgtttt 218341 gaaagcaggg agaaataatg atgtatacag tatcttctga actatacaaa ttacagataa 218401 gctttaaagg tctctaggaa gctactactg tttcaccata agaactccaa gtgggcccta 218461 aatctcctgg tccttgactt cagccactgc taggaaaatg tcaatacaca cacactccta 218521 tcttaaaaac tctgtgcttg agaagaagga tggtttgctt tgatattaga ctagctttcc 218581 ctggcagctg caactaggga acctaatttt tgttaaagct acaagtcaca atttattact 218641 tatattaaga aacgtaagga gatcaatcag gagaatgcta tattgcttaa ggtggttgat 218701 gcaacacaaa aaagatggag ccattagtcc aataacctta ttaaaaaaat tcctcctctg 218761 ggtgtggtgg ctcacgcctg taatcccagc tactccggag gctgaggcag gagaattgct 218821 ggaacctcgg aggtggaggt tgcagtgagc tgagattgta cctctgcact ccagcctggg 218881 caacaaagtg agactccatc tcaaaaaaaa aacaaaaaac acaactcctc tgtatctaac 218941 cttcttatca aaagtgtgaa acagatggtg tatgtgatga tcatgataat gaacataata 219001 acaatgtgaa cataataaca atgtgaacat aataacaaca acatctcttt attcctcaga 219061 taacatccag agcaggaact ctggagtcag gcagaactga atgtgacttt aagcaagtta 219121 ctcaattttt gtaggcccaa ctttcctatc tgtcaggcct acaaatggga aaaactaaac 219181 cataaggaat ctaagagttt tgtttattcc aaagagtaca ggatacaatg acctttcttt 219241 ttttttctgt cacccaggct ggagtgccat ggtgcaatct cggctcacta caacctctgc 219301 ctcctgggtt ccagcgatta tcctgtctca gcttcccaag tagctgggac tacaggcatg 219361 tgccaccatg cccagctaat ttttatcttt ttaatagaaa tgggtttcac catgttggcc 219421 aggctagtct ccaactcctg acctcaggtg atctgcctgc ctcggcctcc caaagtgctg 219481 ggattacagg cgtgagccac cgcgcctggc caatgacatt ccaaaaaggt actatcagga 219541 catcaagtac tacaaaatag gcaccggaag aaatatgcta aattagtaat cctcaaactc 219601 tagagtaata agtataacct ggcttgctag aacagattac tgggcactat cctcagagtt 219661 tctcaatcag tagttttgga gtagaccaaa gaatctgcat ttctaacagg ctcccataag 219721 atacagatgc tgttggtcac aagaccatac tttgtgtctc cttggaaagg gagaaaataa 219781 atatttattg acagggaagg gaagaacaat tatttaagat gatcttgggt caagcatggt 219841 taagtgttta catagtttat cttatttagt ccttacaaca tcccaatgag caacagttct 219901 caaacttttc caggccaaag gagaatgaac acatcacaaa agagtatgcc taagcagtct 219961 tctagaaata gaaaatttct ttgatgactg tattacttta aaaattcata taaattttgt 220021 attctgtttc aaaatactgc tgctagttat gaatgaaata taaatctaaa aatatttatc 220081 agataaattg cttaactctt cattgcaatt ctttagtaaa attggcctgt gggaatataa 220141 agaaatgggt ggttgaactc tggtaatgag cttatggtga aacagctagt acatatttat 220201 ttgatccagc atttgaacct aggccagaag atctctaaag acaatgcaca tgttactacc 220261 ttactctgtc attgacctcc acctacataa aaggcctatc atgtttttct ctactgctct 220321 tttacgatct tctattgttc tggttaaaga ggactacttg ctgttcctca gatttttctc 220381 aaaatttttt tttctttttc cttagattca gggacagaca gggtcttact cttcctgatc 220441 tccagattac tctttgtagc tggaaagctc ctccgctttc atctctactt tactagaatc 220501 ttaccatccc ctttaaggcc cagctaaaat gtcactttct ttcagaagac caaataccat 220561 tattgattga ttgattgact gactgagaca aggcctcact cctgttgccc aggctggtgc 220621 agtgatgtga tcatggctca ctgcagcctt gacttctggg gctctggtgc atcctctcac 220681 ttcagtcccc tgagtagctg ggactacaga cacatgctac catgcccagc taatgcaaat 220741 atcattttta aaaggcgact gaactggacg cctcatatga gctcccatgg ctgcccagac 220801 atgctttcat gtcagtgatt atataatttt ttttgtaaat tagcttatgc aaataatctt 220861 gtggacccta acttatacat gcttctgcaa agaaacatgt ttaacgataa agttatactg 220921 gaattcaaaa catgatgttt tatggaatgt aagacattgg ggtatagata aaaagtggtt 220981 ggaaaaaata tatatattta tttttagaga tgaggtctcc ttctgtcatc taggttggaa 221041 tgcagtggca tcatcatagt tcactgcagt ctcaaattcc tgggctcaaa tgatcctccc 221101 accttggtct cctgaatagc tgggactaca ggtgcatgcc atcatgcctg gctaattaaa 221161 aacaaaattt atttatttat ttttgagaca gaatcttgct cttttgccca tgctggagtg 221221 tagaggtatg atcttggctc aatgcagcct caacgtcctg agttgagcgg aggaccccgg 221281 gctccagtga tccttccacc tcagcctccc aagcagcagg gactacagac atatgccacc 221341 cagcccagtt aattttgttc actttttgta gagatgaggt atcactatgt tgcccaggct 221401 ggtcttgaac tcccggactc aagtgatcct cttgctttgg cctcccaaag tgctggggtt 221461 acaggcgtaa gcaccgtgcc tgacctggaa aattgtattt aaaagaaatt cttggccagg 221521 ggcaatggct tatgcctgta atctcagcac tttgggaggc tgaggctggt ggattgcttg 221581 agctcaggag tttgagacca ccctgggcaa cacggtaaaa ccctgtttct acaaaaaata 221641 cagaaaaaat tagccaggca tggtggctca cgcctgtagt tccagctact caggaggctg 221701 aggctagaga attgcttgat ctgggaagca gacgttgcag tgagctgaga ttgcaccact 221761 gcactccagc ctgaatgaca gagtaacacc atgtctcaaa aaaaaaaaaa aaaaaaaaga 221821 aagaaagaaa aagaaattcc caaacgggaa ggtttagatt atgatgaaat ttgtggctat 221881 gaactagaat aaagtctaaa catgtagaag taatatacct agacttttaa aattggaagg 221941 gtccttgaaa gtcatttagg acagtctccc tattttatag cagtagaatt tttggaccct 222001 gctttgtatt atggtcagaa gtctctttgt ttcattaatg ctgcttaatt ttcatctagc 222061 ttacctttgg tctctttata ttgtgattcc agtgctacct gaagaaagct ttctgcactg 222121 aataatggat ttttttctga agggccagac ctgactttgg tccaaaggga actaaactgc 222181 aggacctcct ttgattttaa aggtcgctaa ctttagtgat gctaattctg aacctctgag 222241 ggatgaattg ttacccccag ctgcaatttt tgatcagtct aacagccaag ccaccatcat 222301 ggccaagact agggaacact cacagtaagc agggtggcca tctgctccat ttggccagct 222361 ctagctttgg acactgagag gtggctcata tactgattgc atcttggttt aaccactgag 222421 agtgaaaaga gaaaaggaca ccaagggcat ttatcttttc cagccccaca tgctcctccc 222481 aatctcagcg gtaaagagca ggaactctgg agtcaggcag aactcagtgt gacttactca 222541 atctttgtag ggctgacttt cctatctgta ccaggagatc ttaacagcac tataattacc 222601 tcacgggtaa ctggcaggtt taggtgggga actgcctgtt gaccatttag gacaatgtct 222661 ggtacacaga aaatccttaa taaatgccag ctattactat tatcaagggg tagcaacatt 222721 aacataccca atcaacgtga ataagatcct cttctctgta taaccttttc catggcattt 222781 cattccctcc ctgagtcagt tatgatacca atactgtctc ctaaaactct gtctttagcc 222841 cagcttcttt cctgagctca agaccaggct tcacaactcc caagatacct ggaagtgccc 222901 ttggaaccta aagtgaaccc caaaccagat // LOCUS HSU48224 1465 bp mRNA PRI 24-MAR-1996 DEFINITION Human beaded filament protein CP49 (LIFL-L) mRNA, complete cds. ACCESSION U48224 NID g1236056 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1465) AUTHORS Hess,J.F., Casselman,J.T. and FitzGerald,P.G. TITLE Gene structure and cDNA sequence identify the beaded filament protein CP49 as a highly divergent type I intermediate filament protein JOURNAL J. Biol. Chem. 271 (12), 6729-6735 (1996) MEDLINE 96215094 REFERENCE 2 (bases 1 to 1465) AUTHORS Hess,J.F., Casselman,J.T. and Fitzgerald,P.G. TITLE Direct Submission JOURNAL Submitted (01-FEB-1996) John F. Hess, Cell Biology & Human Anatomy, University of California, School of Medicine, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..1465 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q21-26" /tissue_type="lens" gene 1..1248 /gene="LIFL-L" CDS 1..1248 /gene="LIFL-L" /note="a very divergent type I intermediate filament protein expressed only in the lens" /codon_start=1 /product="beaded filament protein CP49" /db_xref="PID:g1236057" /translation="MSERRVVVDLPTSASSSMPLQRRRASFRGPRSSSSLESPPASRT NAMSGLVRAPGVYVGTAPSGCIGGLGARVTRRALGISSVFLQGLRSSGLATVPAPGLE RDHGAVEDLGGCLVEYMAKVHALEQVSQELETQLRMHLESKATRSGNWGALRASWASS CQQVGEAVLENARLMLQTETIQAGADDFKERYENEQPFRKAAEEEINSLYKVIDEANL TKMDLESQIESLKEELGSLSRNYEEDVKLLHKQLAGCELEQMDAPIGTGLDDILETIR IQWERDVEKNRVEAGALLQAKQQAEVAHMSQTQEEKLAAALRVELHNTSCQVQSLQAE TESLRALKRGLENTLHDAKHWHDMELQNLGAVVGRLEAELREIRAEAEQQQQERAHLL ARKCQLQKDVASYHALLDREESG" BASE COUNT 356 a 378 c 450 g 280 t 1 others ORIGIN 1 atgagtgaga ggcgggtggt agtggacttg cccaccagtg ccagctccag catgcccctc 61 cagaggcgca gggcgtcctt cagggggcca cggtcatcat cctccctgga gagcccccca 121 gcctccagga ccaatgccat gagtggcctt gtccgagcac ccggggtcta tgtaggaaca 181 gcacccagtg ggtgcatagg tggcttgggt gcccgtgtga cccgccgggc cctcggcatc 241 agcagtgtct tccttcaggg cctgcggagc tcaggcctgg ccaccgtgcc ggctccaggt 301 ttggagaggg accatggtgc tgttgaggac ctagggggct gcctggtgga atatatggcc 361 aaagtgcacg cccttgagca agtcagtcag gagctggaaa cacaactgcg gatgcacctg 421 gagagcaaag ccacacgctc gggaaactgg ggtgcsctac gggcttcctg ggccagcagc 481 tgccagcagg tgggtgaggc agtcttggaa aatgcccggc tcatgctgca gacagaaact 541 atccaggccg gagcagatga ctttaaagag agatatgaaa atgagcagcc atttcgaaag 601 gcagcagaag aggaaattaa ctctctgtat aaagtcattg atgaggctaa tttgactaaa 661 atggacctgg agagtcaaat agaaagtctg aaagaagaac ttggctctct atcaagaaac 721 tatgaagagg atgtgaagct gctgcacaaa cagttggcag ggtgtgagct ggaacaaatg 781 gatgctccca ttggcactgg tctggacgac atccttgaga cgatcagaat tcagtgggag 841 agagatgttg aaaagaaccg ggtggaggca ggagccctgc tccaagctaa gcaacaggcg 901 gaggtggccc acatgtccca gacccaggag gagaagctgg cagctgccct cagggtggag 961 ttacacaaca cttcgtgcca agtccagagc ctccaggctg agacagaatc cttacgtgcc 1021 ctgaaacgag gcctggagaa caccttgcac gatgccaagc actggcatga catggagctc 1081 cagaacctgg gcgctgtggt cggccggctg gaggcggagc tcagggaaat ccgagcggag 1141 gcggagcagc agcaacagga gcgcgcgcat ctgctggccc gcaagtgcca gctgcagaag 1201 gacgtggcgt cctaccacgc cctgctggac agggaggaga gcggctgatg gagaaacttc 1261 ctctttttca tgaagaaaac acccttcctc aacagctgac ccaagaagtt gcttgaggag 1321 ctttctcctg agctccagtc cctgctggat tccctggtta attcagcttg agctgaaaag 1381 cttcctggaa gtggagagat ccttctgctt taatctgagt agtctgtagc ttgagcaatc 1441 tccttgtcct cttccaataa tgctt // LOCUS HSU48263 1198 bp mRNA PRI 23-AUG-1996 DEFINITION Human pre-pro-orphanin FQ (OFQ) mRNA, complete cds. ACCESSION U48263 NID g1185009 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1198) AUTHORS Nothacker,H.P., Reinscheid,R.K., Mansour,A., Henningsen,R.A., Ardati,A., Monsma,F.J. Jr., Watson,S.J. and Civelli,O. TITLE Primary structure and tissue distribution of the orphanin FQ precursor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (16), 8677-8682 (1996) MEDLINE 96323283 REFERENCE 2 (bases 1 to 1198) AUTHORS Nothacker,H.-P. and Henningsen,R.A. TITLE Direct Submission JOURNAL Submitted (02-FEB-1996) Hans-Peter Nothacker, PRPN 69/202, Hoffmann-La Roche AG, Grenzacherstr 124, Basel, 4070, Switzerland FEATURES Location/Qualifiers source 1..1198 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" 5'UTR 1..211 gene 212..742 /gene="OFQ" CDS 212..742 /gene="OFQ" /codon_start=1 /product="pre-pro-orphanin FQ" /db_xref="PID:g1185010" /translation="MKVLLCDLLLLSLFSSVFSSCQRDCLTCQEKLHPALDSFDLEVC ILECEEKVFPSPLWTPCTKVMARSSWQLSPAAPEHVAAALYQPRASEMQHLRRMPRVR SLFQEQEEPEPGMEEAGEMEQKQLQKRFGGFTGARKSARKLANQKRFSEFMRQYLVLS MQSSQRRRTLHQNGNV" sig_peptide 212..268 /gene="OFQ" mat_peptide 599..649 /gene="OFQ" /product="orphanin FQ" 3'UTR 743..1198 polyA_site 1198 /note="29 A nucleotides" BASE COUNT 244 a 335 c 345 g 273 t 1 others ORIGIN 1 gccaggaagg cttgcaggtt ctgctgtttg gttgctgaag ggggtcagtg tgtgtatgtg 61 tcatggaggt gggcagggaa ggggagggct gtgcgtgggg gagatgagga tatatatgct 121 ggtgtggctg agaaagcgga accgagcctc gcatccatcg gagggagccg gggactgaca 181 gctctcagca cctgcttcct gctcctgcac catgaaagtc ctgctttgtg acctgctgct 241 gctcagtctc ttctccagtg tgttcagcag ttgtcagagg gactgtctca catgccagga 301 gaagctccac ccagccctgg acagcttcga cctggaggtg tgcatcctcg agtgcgaaga 361 gaaggtcttc cccagccccc tctggactcc atgcaccaag gtcatggcca ggagctcttg 421 gcagctcagc cctgccgccc cagagcatgt ggcggctgct ctctaccagc cgagagcttc 481 ggagatgcag catctgcggc gaatgccccg agtccggagc ttgttccagg agcaggaaga 541 gcccgagcct ggcatggagg aggctggtga gatggagcag aagcagctgc agaagagatt 601 tgggggcttc accggggccc ggaagtcggc caggaagttg gccaatcaga agcggttcag 661 tgagtttatg aggcaatact tggtcctgag catgcagtcc agccagcgcc ggcgcaccct 721 gcaccagaat ggtaatgtgt agccggaagg ggcgctcctc ccagctgtac cggccactgc 781 aacccatgag cgtccaggtg atcccccaaa cagcatgtgc tcagmcccag acctgccgcc 841 tgggaatcag gattccttct tccccaaggc actgagcgcc tgcagatccc gcaggcttcg 901 tttgcctcca gaaccttccc gtctgattgt tcctccccag ccccctggca tgtttcacca 961 caaccctgtt gctacatcag agtgtatttt tgtaattcct ctagctacca tttcaatagc 1021 cccatctctc ctgctcaccc gcctcttgcc ccttctaggg gcaggtgaaa ggaataggaa 1081 attgaacctg gggttttgac ttgccactgc cataacttgt ttgtaaaaga gctgttcttt 1141 ttgactgatt gttttaaaca acgatttctc cattaaactt ctactgagca aatggtta // LOCUS HSU48296 2200 bp mRNA PRI 01-JAN-1997 DEFINITION Human protein tyrosine phosphatase PTPCAAX1 (hPTPCAAX1) mRNA, complete cds. ACCESSION U48296 NID g1777754 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2200) AUTHORS Crowell,P.L., Crowell,D.N., Randall,S.K. and Cates,C.A. TITLE Direct Submission JOURNAL Submitted (01-FEB-1996) Pamela L. Crowell, Biology, IUPUI, 723 West Michigan St., SL310, Indianapolis, IN 46202-5132, USA FEATURES Location/Qualifiers source 1..2200 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="breast carcinoma cell line ZR-75-1 (ATCC CRL 1500)" gene 393..914 /gene="hPTPCAAX1" CDS 393..914 /gene="hPTPCAAX1" /codon_start=1 /product="protein tyrosine phosphatase PTPCAAX1" /db_xref="PID:g1777755" /translation="MARMNRPAPVEVTYKNMRFLITHNPTNATLNKFIEELKKYGVTT IVRVCEATYDTTLVEKEGIHVLDWPFDDGAPPSNQIVDDWLSLVKIKFREEPGCCIAV HCVAGLGRAPVLVALALIEGGMKYEDAVQFIRQKRRGAFNSKQLLYLEKYRPKMRLRF KDSNGHRNNCCIQ" BASE COUNT 664 a 379 c 415 g 742 t ORIGIN 1 cgggattact gccaggcaca gcacgacctc tatgcagaca agtgaactgt agaaactgat 61 tactgctcca ccaagaagcc cccataagag tggttatcct ggacacagaa gtgttgaatt 121 gaaatccaca gagcatttta caagagttct gacctggatg gggtaaacct cagtgcactt 181 cttttctgtt ggcctcagta ttactggatt gaagaattgc tgcttcttgt taggaggttc 241 atttcactta tcattactta caacttcata ctcaaagcac tgagaatttc aagtggagta 301 tattgaagta gacttcagtt tctttgcatc atttctgtat tcaatttttt taattatttc 361 ataaccctat tgagtgtttt taactaaata acatggctcg aatgaaccgc ccagctcctg 421 tggaagtcac atacaagaac atgagatttc ttattacaca caatccaacc aatgcgacct 481 taaacaaatt tatagaggaa cttaagaagt atggagttac cacaatagta agagtatgtg 541 aagcaactta tgacactact cttgtggaga aagaaggtat ccatgttctt gattggcctt 601 ttgatgatgg tgcaccacca tccaaccaga ttgttgatga ctggttaagt cttgtgaaaa 661 ttaagtttcg tgaagaacct ggttgttgta ttgctgttca ttgcgttgca ggccttggga 721 gagctccagt acttgttgcc ctagcattaa ttgaaggtgg aatgaaatac gaagatgcag 781 tacaattcat aagacaaaag cggcgtggag cttttaacag caagcaactt ctgtatttgg 841 agaagtatcg tcctaaaatg cggctgcgtt tcaaagattc caacggtcat agaaacaact 901 gttgcattca ataaaattgg ggtgcctaat gctactggaa gtggaacttg agatagggcc 961 taatttgtta tacatattag ccaacatgtt ggcttagtaa gtctaatgaa gcttccatag 1021 gagtattgaa aggcagtttt accaggcctc aagctagaca gatttggcaa cctctgtatt 1081 tgggttacag tcaacctatt tggatacttg gcaaaagatt cttgctgtca gcatataaaa 1141 tgtgcttgtc atttgtatca attgaccttt ccccaaatca tgcagtattg agttatgact 1201 tgttaaatct attcccatgc cagaatctta tcaatacata agaaatttag gaagattagg 1261 tgccaaaata cccagcacaa tacttgtata tttttagtac catacagaag taaaatccca 1321 ggaactatga acactagacc ttatgtggtt tattccttca atcatttcaa acattgaaag 1381 tagggcctac atggttattt gcctgctcac tttatgttta catctcccac attcatacca 1441 atatacgtca ggtttgctta accattgatt tttttttttt ttaccaagtc ttacagtgat 1501 tattttacgt gtttccatgt atctcacttt gtgctgtatt aaaaaaacct ccattttgaa 1561 aatctacgtt gtacagaagc acatgtcttt aatgtcttca gacaaaaaag ccttacatta 1621 atttaatgtt tgcactctga ggtgcaactt aacagggagg gcctgagaaa agaatgggag 1681 ggggctatta attatttttt agcaaaatgt tgcctttgtc ttgtgcaaac atgtagaata 1741 tgctctttaa tctagtaaaa tattttttta aaaggtagag atgctttgtt attgtaatca 1801 taaacttcct gaaattcttg taattttttc ccatacttat cagaagtgtg tttaccaact 1861 tatttttgtt tgaaagtgtg attttttttt tccttcccaa cctctcttgc aaaaaaagaa 1921 atgggtttct gctaatgaat tgagcagaga tctaatattt tatatgcctt ttgagctgtg 1981 taagttaata tttgatactt gacaatttgt tttattatgt aattgataaa atggtgatgt 2041 gtattaatgt tagttcaacc atatatttat actgtctggg gatgtgtggt tatagttctg 2101 tgggagaaat aattttgtca gtgttcacca gcttgtaaaa acttagtgcg agagctgaaa 2161 catctaaata aataatgaca tgcatttatc atcattgaaa // LOCUS HSU48361 1735 bp mRNA PRI 16-JUL-1996 DEFINITION Human NGFI-A binding protein 2 (NAB2) mRNA, complete cds. ACCESSION U48361 NID g1206026 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1735) AUTHORS Russo,M.W., Sevetson,B.R. and Milbrandt,J. TITLE Identification of NAB1, a repressor of NGFI-A- and Krox20-mediated transcription JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (15), 6873-6877 (1995) MEDLINE 95350172 REFERENCE 2 (bases 1 to 1735) AUTHORS Svaren,J., Sevetson,B.R., Apel,E.D., Zimonjic,D.B., Popescu,N.C. and Milbrandt,J. TITLE NAB2, a corepressor of NGFI-A (Egr-1) and Krox20, is induced by proliferative and differentiative stimuli JOURNAL Mol. Cell. Biol. 16 (7), 3545-3553 (1996) MEDLINE 96251303 REFERENCE 3 (bases 1 to 1735) AUTHORS Svaren,J. and Milbrandt,J. TITLE Direct Submission JOURNAL Submitted (03-FEB-1996) Jeffrey Milbrandt, Pathology, Washington University School of Medicine, 660 S. Euclid Ave., St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1735 /organism="Homo sapiens" /note="sequence found in human Drop9 EST" /db_xref="taxon:9606" /chromosome="12" /map="12q13.3-14.1" /tissue_type="placenta" gene 158..1735 /gene="NAB2" CDS 158..1735 /gene="NAB2" /note="similar to R. norvegicus NAB1 protein encoded by GenBank Accession Number U17253" /codon_start=1 /function="transcriptional co-repressor of NGFI-A and Krox20" /product="NGFI-A binding protein 2" /db_xref="PID:g1206027" /translation="MHRAPSPTAEQPPGGGDSARRTLQPRLKPSARAMALPRTLGELQ LYRVLQRANLLSYYETFIQQGGDDVQQLCEAGEEEFLEIMALVGMATKPLHVRRLQKA LREWATNPGLFSQPVPAVPVSSIPLFKISETAGTRKGSMSNGHGSPGEKAGSARSFSP KSPLELGEKLSPLPGGPGAGDPRIWPGRSTPESDVGAGGEEEAGSPPFSPPAGGGVPE GTGAGGLAAGGTGGGPDRLEPEMVRMVVESVERIFRSFPRGDAGEVTSLLKLNKKLAR SVGHIFEMDDNDSQKEEEIRKYSIIYGRFDSKRREGKQLSLHELTINEAAAQFCMRDN TLLLRRVELFSLSRQVARESTYLSSLKGSRLHPEELGGPPLKKLKQEVGEQSHPEIQQ PPPGPESYVPPYRPSLEEDSASLSGESLDGHLQAVGSCPRLTPPPADLPLALPAHGLW SRHILQQTLMDEGLRLARLVSHDRVGRLSPCVPAKPPLAEFEEGLLDRCPAPGPHPAL VEGRRSSVKVEAEASRQ" BASE COUNT 357 a 516 c 594 g 268 t ORIGIN 1 gacagaggcg cggaggctcg gagagagaag acgtggaggg agggacagag cctggacagc 61 ggtggacacg gcatcgtgcg cggggaagag ggcagcacgc agcaggcgcc gagcgccggg 121 caccgggaag ggcagcccgg gtgatctccg gccgtccatg cacagagcgc cttcccccac 181 agccgagcag ccgccgggcg gaggggacag cgcccgccgg accctgcagc ccagactcaa 241 gcccagtgcc cgagccatgg cactgcctcg gacgctgggg gagctgcagc tgtaccgggt 301 cctgcagcgc gccaacctcc tttcctacta tgagaccttc atccagcagg gaggggacga 361 cgtgcagcag ctgtgtgagg cgggtgagga ggagtttctg gagatcatgg cacttgtggg 421 catggccacc aagcccctcc atgtccggcg cctgcagaag gcactgagag agtgggccac 481 caatccaggg ctcttcagtc aaccagtgcc tgctgttccc gtctccagca tcccgctctt 541 caagatctct gagactgcgg gtacccggaa agggagcatg agcaatgggc atggcagccc 601 aggggaaaag gcaggcagtg cccgcagttt tagccccaag agcccccttg aacttggaga 661 gaagctatca ccactgcctg ggggacctgg ggcaggggac ccccggatct ggccaggccg 721 gagcactcca gagtcggacg ttggggcagg aggagaagag gaggctggct cgcccccctt 781 ctccccccct gcagggggag gagtccctga ggggactggg gctggggggc tggcagcagg 841 tgggactggg ggtggtccag accgactgga gccagagatg gtacgcatgg tggtggaaag 901 tgtggagagg atcttccgga gcttcccaag gggggatgct ggggaggtca catccctgct 961 aaagctgaat aagaagctgg cacggagcgt tgggcacatc tttgagatgg atgataatga 1021 cagccagaag gaagaggaga tccgcaaata cagcatcatc tatggccgtt tcgactctaa 1081 gcggcgggag ggcaagcagc tcagcctgca cgagctcacc atcaacgagg ctgctgccca 1141 gttctgcatg agggacaaca cgctcttatt acggagagtg gagctcttct ctttgtcccg 1201 ccaagtagcc cgagagagca cctacttgtc ctccttgaag ggctccaggc ttcaccctga 1261 agaactggga ggccctccac tgaagaagct gaaacaagag gttggagaac agagtcaccc 1321 tgaaatccag cagcctcccc caggccctga gtcctatgta cccccatacc gccccagcct 1381 ggaggaggac agcgccagcc tgtctgggga gagtctggat ggacatttgc aggctgtggg 1441 gtcatgtcca aggctgacgc cgccccctgc tgacctgcct ctggcattgc cagcccatgg 1501 gctatggagc cgacacatcc tgcagcagac actgatggac gaggggctgc ggctcgcccg 1561 cctcgtctcc cacgaccgcg tgggccgcct cagcccctgt gtgcctgcga agccacctct 1621 cgcagagttc gaggaagggc tgctggacag atgtcctgcc ccaggacccc atcccgcgct 1681 ggtggagggt cgcaggagca gcgtgaaagt ggaggctgag gccagccggc agtga // LOCUS HSU48408 1347 bp mRNA PRI 31-JAN-1997 DEFINITION Human kidney water channel (hKID) mRNA, complete cds. ACCESSION U48408 NID g1293545 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1347) AUTHORS Ma,T., Yang,B., Kuo,W.L. and Verkman,A.S. TITLE cDNA cloning and gene structure of a novel water channel expressed exclusively in human kidney: evidence for a gene cluster of aquaporins at chromosome locus 12q13 JOURNAL Genomics 35 (3), 543-550 (1996) MEDLINE 97001157 REFERENCE 2 (bases 1 to 1347) AUTHORS Verkman,A.S., Ma,T. and Yang,B. TITLE Direct Submission JOURNAL Submitted (05-FEB-1996) Alan S. Verkman, CVRI, UCSF, 3rd. & Parnassus, San Francisco, CA 94143-0521, USA FEATURES Location/Qualifiers source 1..1347 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13" gene 343..1191 /gene="hKID" CDS 343..1191 /gene="hKID" /codon_start=1 /product="water channel" /db_xref="PID:g1293546" /translation="MDAEVPGGRGWASMLACRLWKAISRALFAEFLATGLYVFFGVGS VMRWPTALPSVLQIAITFNLVTAMAVQVTWKTSGAHANPAVTLAFLVGSHISLPRAVA YVAAQLVGATVGAALLYGVMPGDIRETLGINVVRNSVSTGQAVAVELLLTLQLVLCVF ASTDSRQTSGSPATMIGISWALGHLIGILFTGCSMNPARSFGPAIIIGKFTVHWVFWV GPLMGALLASLIYNFVLFPDTKTLAQRLAILTGTVEVGTGARAGAEPLKKESQPGSGA VEMESV" BASE COUNT 262 a 407 c 427 g 251 t ORIGIN 1 tcatgtagga ggttccagct cagtgcccag agacagcccc cacatcccac cccatcaggt 61 cagctcaggc tggagcaggt atatgagcaa cacccctcat tccaccccca tacacatgca 121 taagcccatc cgcctgcgcc ctgccagtct gaccagcaca gtcctgggga caggagcgtg 181 gtggaggagc tgcaggtggg ggccagagaa gcctccacag agagcaaaag cccagggtca 241 gccagaagac aggacaccag aagaagacag gaagaatcaa gaagaccaga ggaacagaga 301 agaggcccca gagcaaggca agaacggcca aggcaccagg acatggatgc tgaggtgcca 361 gggggacgtg gctgggccag catgttggcg tgcaggcttt ggaaagccat cagcagggcg 421 ctgtttgcag agttcctggc cacggggctg tatgtgttct ttggcgtggg ctcagtcatg 481 cgctggccca cagcacttcc ctccgtgcta cagattgcca tcaccttcaa cctggtcacc 541 gccatggctg tgcaggtcac ctggaagacc agcggggccc acgccaaccc cgccgtgacg 601 ctggccttcc tcgtaggctc ccacatctct ctgccccgtg ctgtggccta tgtggctgcc 661 cagctggtgg gggccacggt gggggctgct ctgctttatg gggtcatgcc gggagacatc 721 cgagagaccc ttgggatcaa cgtggtccgg aacagtgtct caactggcca ggcggtggca 781 gtggagctgc ttctgaccct gcagctggtg ctctgtgtct tcgcttccac cgacagccgt 841 cagacatcag gctccccggc caccatgatt gggatctctt gggcactggg ccacctcatt 901 gggatcctct tcactggctg ctccatgaat ccagcccgct ccttcggccc tgccatcatc 961 attgggaagt tcacagtcca ctgggtcttc tgggtggggc ccctgatggg agccctcctg 1021 gcctcactga tctacaactt cgtcctgttc cccgacacca agaccctggc gcagcggctg 1081 gctatcctca caggcaccgt agaggtgggg acaggggcac gggcaggggc ggagcccctg 1141 aagaaggaat cccagccggg ttcgggagcc gtggagatgg agagtgtgtg aaacagccta 1201 cgcctggccg cgccttgggt tcctgccttg cagacctgcc tggaggttct ccctggggtg 1261 gcgggagggg gaggttactt ttgtctgacc aggtgggctg gaggggacaa gcctatacct 1321 ggcattacgt tctaggtaga atctggg // LOCUS HSU48707 662 bp mRNA PRI 14-JUN-1996 DEFINITION Human protein phosphatase-1 inhibitor mRNA, complete cds. ACCESSION U48707 NID g1374797 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 662) AUTHORS Endo,S., Zhou,X., Connor,J., Wang,B. and Shenolikar,S. TITLE Multiple structural elements define the specificity of recombinant human inhibitor-1 as a protein phosphatase-1 inhibitor JOURNAL Biochemistry 35 (16), 5220-5228 (1996) MEDLINE 96196629 REFERENCE 2 (bases 1 to 662) AUTHORS Shenolikar,S. TITLE Direct Submission JOURNAL Submitted (07-FEB-1996) Shirish Shenolikar, Duke University Medical Center, Pharmacology, C151 LSRC South LaSalle Street Extension, Durham, NC 27708, USA FEATURES Location/Qualifiers source 1..662 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="adult" /chromosome="12" CDS 9..524 /codon_start=1 /product="protein phosphatase-1 inhibitor" /db_xref="PID:g1374798" /translation="MEQDNSPRKIQFTVPLLEPHLDPEAAEQIRRRRPTPATLVLTSD QSSPEIDEDRIPNPHLKSTLAMSPRQRKKMTRITPTMKELQMMVEHHLGQQQQGEEPE GAAESTGTQESRPPGIPDTEVESRLGTSGTAKKTAECIPKTHERGSKEPSTKEPSTHI PPLDSKGANSV" BASE COUNT 198 a 184 c 171 g 109 t ORIGIN 1 ccccagccat ggagcaagac aacagccccc gaaagatcca gttcacggtc ccgctgctgg 61 agccgcacct tgaccccgag gcggcggagc agattcggag gcgccgcccc acccctgcca 121 ccctcgtgct gaccagtgac cagtcatccc cagagataga tgaagaccgg atccccaacc 181 cacatctcaa gtccactttg gcaatgtcgc cacggcaacg gaagaagatg acaaggatca 241 cacccacaat gaaagagctc cagatgatgg ttgaacatca cctggggcaa cagcagcaag 301 gagaggaacc tgagggggcc gctgagagca caggaaccca ggagtcccgc ccacctggga 361 tcccagacac agaagtggag tcaaggctgg gcacctctgg gacagcaaaa aaaactgcag 421 aatgcatccc taaaactcac gagagaggca gtaaggaacc cagcacaaaa gaaccctcaa 481 cccatatacc accactggat tccaagggag ccaactcggt ctgagagagg aggaggtatc 541 ttgggatcaa gactgcagtt tgggaatgca tggacaccgg atttgtttct tattccttca 601 cttttgggga aaatctcttg tttttaaaaa gtgataaatt tggtgttagg tcaaaaaaaa 661 aa // LOCUS HSU48736 1718 bp mRNA PRI 02-JUL-1996 DEFINITION Human serine/threonine-protein kinase PRP4h (PRP4h) mRNA, complete cds. ACCESSION U48736 NID g1399461 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1718) AUTHORS Luetzelberger,M., Schnell,B., Klingenhoff,A., Peter,C., Gross,T., Shenoy,S. and Kaeufer,N.F. TITLE Prp4, a kinase from the fission yeast, involved in pre-mRNA splicing and its mammalian homolog JOURNAL Unpublished REFERENCE 2 (bases 1 to 1718) AUTHORS Luetzelberger,M. TITLE Direct Submission JOURNAL Submitted (07-FEB-1996) Martin Luetzelberger, Genetics, Technical University Braunschweig, Spielmannstrasse 7, Braunschweig, 38106, Germany FEATURES Location/Qualifiers source 1..1718 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" gene 114..1604 /gene="PRP4h" CDS 114..1604 /gene="PRP4h" /note="similar to S. pombe serine/threonine-protein kinase PRP4: SwissProt Accession Number Q07538" /codon_start=1 /product="serine/threonine-protein kinase PRP4h" /db_xref="PID:g1399462" /translation="MKVEQESSSDDNLEDFDVEEEDEEALIEQRRIQRQAIVQKYKYL AEDSNMSVPSEPSSPQSSTRTRSPSPDDILERVAADVKEYERENVDTFEASVKAKHNL MTVEQNNGSSQKKLLAPDMFTESDDMFAAYFDSARLRAAGIGKDFKENPNLRDNWTDA EGYYRVNIGEVLDKRYNVYGYTGQGVFSNVVRARDNARANQEVAVKIIRNNELMQKTG LKELEFLKKLNDADPDDKFHCLRLFRHFYHKQHLCLVFEPLSMNLREVLKKYGKDVGL HIKAVRSYSQQLFLALKLLKRCNILHADIKPDNILVNESKTILKLCDFGSASHVADND ITPYLVSRFYRAPEIIIGKSYDYGIDMWSVGCTLYELYTGKILFPGKTNNHMLKLAMD LKGKMPNKMIRKGVFKDQHFDQNLNFMYIEVDKVTEREKVTVMSTINPTKDLLADLIG CQRLPEDQRKKVHQLKDLLDQILMLDPAKRISINQALQHAFIQEKI" BASE COUNT 584 a 305 c 383 g 446 t ORIGIN 1 ggtcggagga gcagatcacg cttgcgaagg cggtctcgat cacgcggtgg tcgtagacga 61 aggagcagaa gcaaagtaag gaagataaat ttaaaggaag tctttctgaa ggaatgaaag 121 ttgagcagga atcttcgtct gatgataacc ttgaagactt tgatgtagag gaagaagatg 181 aagaagccct aatagaacag agaagaatcc aaaggcaggc aattgttcag aaatataaat 241 accttgctga agatagcaac atgtctgtgc catctgaacc aagcagcccc cagagcagta 301 cgagaacacg atcaccatct ccagatgaca ttctggagcg agtagctgct gatgttaaag 361 agtatgaacg ggaaaatgtt gatacatttg aggcctcagt gaaagccaag cataatctaa 421 tgacagttga acagaataat ggttcatctc agaagaagtt gttggcacct gatatgttta 481 cagaatctga tgatatgttt gctgcgtatt ttgatagtgc tcgtcttcgg gccgctggca 541 ttggaaaaga tttcaaagag aatcccaacc tcagagataa ctggaccgat gcagaaggct 601 attatcgtgt gaacataggt gaagtcctag ataaacgtta caatgtgtat ggctacactg 661 ggcaaggtgt attcagtaat gttgtacgag ccagagataa tgcaagagcc aaccaagaag 721 tggctgtaaa gatcatcaga aacaatgagc tcatgcaaaa gactggttta aaagaattag 781 agttcttgaa aaagcttaat gatgctgatc ctgatgacaa atttcattgt ctgagactct 841 tcaggcactt ctatcacaag cagcatcttt gtctggtatt cgagcctctc agcatgaact 901 tacgagaggt gttaaaaaaa tatggtaaag atgttggtct tcatattaaa gctgtaagat 961 cctatagtca gcagttgttc ctggcattga aactccttaa aagatgcaat atcctacatg 1021 cagatatcaa gccagacaat atcctggtta atgaatccaa aactatttta aagctttgcg 1081 attttgggtc ggcttcacat gttgcggata atgacataac accttatctt gtcagtagat 1141 tttatcgtgc tcctgaaatc attataggta aaagctatga ctatggtata gatatgtggt 1201 ctgtaggttg caccttatac gaactctata ctggaaaaat tttattccct ggcaaaacca 1261 ataaccatat gctgaagctt gcaatggatc tcaaaggaaa gatgccaaat aagatgattc 1321 gaaaaggtgt gttcaaagat cagcattttg atcaaaatct caacttcatg tacatagaag 1381 ttgataaagt aacagagagg gagaaagtta ctgttatgag caccattaat ccaactaagg 1441 acctgttggc tgacttgatt gggtgccaga gacttcctga agaccaacgt aagaaagtac 1501 accagctaaa ggacttgttg gaccagattc tgatgttgga cccagctaaa cgaattagca 1561 tcaaccaggc cctacagcac gccttcatcc aggaaaaaat ttaaacaaga tgaagaaact 1621 ccaagggttt gagtaaatac aaagatgaag aaatttcaca gcagtttcat taatgtatat 1681 aaacttataa atatttctcc agcaaatttg aggaagca // LOCUS HSU48861 2455 bp mRNA PRI 14-MAR-1996 DEFINITION Human beta 4 nicotinic acetylcholine receptor subunit mRNA, complete cds. ACCESSION U48861 NID g1224053 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2455) AUTHORS Gerzanich,V., Kuryatov,A., Anand,R. and Lindstrom,J. TITLE Orphan alpha 6 nicotinic AChR subunit forms a functional heteromeric receptor JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2455) AUTHORS Kuryatov,A. and Anand,R. TITLE Direct Submission JOURNAL Submitted (09-FEB-1996) Alexander Kuryatov, Department of Neuroscience, University of Pennsylvania, Stemmler Hall 235, 36th & Hamilton Walk, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2455 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 121..1617 /codon_start=1 /product="beta 4 nicotinic acetylcholine receptor subunit" /db_xref="PID:g1224054" /translation="MRRAPSLVLFFLVALCGRGNCRVANAEEKLMDDLLNKTRYNNLI RPATSSSQLISIKLQLSLAQLISVNEREQIMTTNVWLKQEWTDYRLTWNSSRYEGVNI LRIPAKRIWLPDIVLYNNADGTYEVSVYTNLIVRSNGSVLWLPPAIYKSACKIEVKYF PFDQQNCTLKFRSWTYDHTEIDMVLMTPTASMDDFTPSGEWDIVALPGRRTVNPQDPS YVDVTYDFIIKRKPLFYTINLIIPCVLTTLLAILVFYLPSDCGEKMTLCISVLLALTF FLLLISKIVPPTSLDVPLIGKYLMFTMVLVTFSIVTSVCVLNVHHRSPSTHTMAPWVK RCFLHKLPTFLFMKRPGPDSSPARAFPPSKSCVTKPEATATSTSPSNFYGNSMYFVNP ASAASKSPAGSTPVAIPRDFWLRSSGRFRQDVQEALEGVSFIAQHMKNDDEDQSVVED WKYVAMVVDRLFLWVFMFVCVLGTVGLFLPPLFQTHAASEGPYAAQRD" BASE COUNT 480 a 818 c 600 g 557 t ORIGIN 1 ggcacgagcc gccagcaaac ctcgggggcc aggaccggcg ctcactcgac cgcgcggctc 61 acgggtgccc tgtgacccca cagcggacgt cgcggcggct gccacccggc cccgccggcc 121 atgaggcgcg cgccttccct ggtccttttc ttcctggtcg ccctttgcgg gcgcgggaac 181 tgccgcgtgg ccaatgcgga ggaaaagctg atggacgacc ttctgaacaa aacccgttac 241 aataacctga tccgcccagc caccagctcc tcacagctca tctccatcaa gctgcagctc 301 tccctggccc agcttatcag cgtgaatgag cgagagcaga tcatgaccac caatgtctgg 361 ctgaaacagg aatggactga ttaccgcctg acctggaaca gctcccgcta cgagggtgtg 421 aacatcctga ggatccctgc aaagcgcatc tggttgcctg acatcgtgct ttacaacaac 481 gccgacggga cctatgaggt gtctgtctac accaacttga tagtccggtc caacggcagc 541 gtcctgtggc tgccccctgc catctacaag agcgcctgca agattgaggt gaagtacttt 601 cccttcgacc agcagaactg caccctcaag ttccgctcct ggacctatga ccacacggag 661 atagacatgg tcctcatgac gcccacagcc agcatggatg actttactcc cagtggtgag 721 tgggacatag tggccctccc agggagaagg acagtgaacc cacaagaccc cagctacgtg 781 gacgtgactt acgacttcat catcaagcgc aagcctctgt tctacaccat caacctcatc 841 atcccctgcg tgctcaccac cttgctggcc atcctcgtct tctacctgcc atccgactgc 901 ggcgagaaga tgacactgtg catctcagtg ctgctggcac tgacattctt cctgctgctc 961 atctccaaga tcgtgccacc cacctccctc gatgtgcctc tcatcggcaa gtacctcatg 1021 ttcaccatgg tgctggtcac cttctccatc gtcaccagcg tctgtgtgct caatgtgcac 1081 caccgctcgc ccagcaccca caccatggca ccctgggtca agcgctgctt cctgcacaag 1141 ctgcctacct tcctcttcat gaagcgccct ggccccgaca gcagcccggc cagagccttc 1201 ccgcccagca agtcatgcgt gaccaagccc gaggccaccg ccacctccac cagcccctcc 1261 aacttctatg ggaactccat gtactttgtg aaccccgcct ctgcagcttc caagtctcca 1321 gccggctcta ccccggtggc tatccccagg gatttctggc tgcggtcctc tgggaggttc 1381 cgacaggatg tgcaggaggc attagaaggt gtcagcttca tcgcccagca catgaagaat 1441 gacgatgaag accagagtgt cgttgaggac tggaagtacg tggctatggt ggtggaccgg 1501 ctgttcctgt gggtgttcat gtttgtgtgc gtcctgggca ctgtggggct cttcctaccg 1561 cccctcttcc agacccatgc agcttctgag gggccctacg ctgcccagcg tgactgaggg 1621 ccccctgggt tgtggggtga gaggatgtga gtggccgggt gggcactttg ctgcttcttt 1681 ctgggttgtg gccgatgagg cctaagtaaa tatgtgagca ttggccatca accccatcaa 1741 accagcccaa gccgtggaac aggcaaggat gggggcctgg gctgtcctct ctgaatgcct 1801 tggagggatc ccaggaagcc ccagtaggag ggagcttcag acagttcaat tctggcctgt 1861 cttccttccc tgcaccgggc aatggggata aagatgactt cgtagcagca cctactatgc 1921 ttcaggcatg gtgccggcct gcctctccat caccatctct ctccacttcc ccttgtccag 1981 ttcctcacac acttctagat tcttcccagc tcagaggctt ggcatttgcc atatcatcat 2041 cttttttttt ttcttttaaa cggagtcttg ctatatcgcc caggctcaag tgattctcct 2101 gcctgaacct cccaagtagc tgggattaca gaaagccgcc accgtgccca gctaattttt 2161 gtatttttgt tggccaggct gatttcgaac tcctgacctc agatgaccca cctgcctcgg 2221 cctcccaaag tgctgggatt acaggtgtga gccactatgc ctggccttcc ctatcttcta 2281 cctgaacctt cttcctcttc ccccagggct tcccaagctc tttccacagc cacatcagtc 2341 agggcatggc tccaatccat tctttgctcc aatgtcacct cctctgagag gccttcccta 2401 accacccaat cattctaaag cagcctccct acagtcacgt taccctgcct ccttc // LOCUS HSU49070 994 bp mRNA PRI 25-MAY-1996 DEFINITION Human peptidyl-prolyl isomerase and essential mitotic regulator (PIN1) mRNA, complete cds. ACCESSION U49070 NID g1332709 KEYWORDS peptidyl-prolyl isomerase; cell cycle; mitotic regulator; NIMA; Ess1; NIMA-interacting protein 1 (Pin1). SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 994) AUTHORS Lu,K.P., Hanes,S.D. and Hunter,T. TITLE A human peptidyl-prolyl isomerase essential for regulation of mitosis JOURNAL Nature 380 (6574), 544-547 (1996) MEDLINE 96195064 REFERENCE 2 (bases 1 to 994) AUTHORS Lu,K.P. and Hunter,T. TITLE Direct Submission JOURNAL Submitted (12-FEB-1996) Kun Ping Lu, Molecular Biology and Virology Laboratory, Salk Institute, 10010 North Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..994 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 25..516 /gene="PIN1" CDS 25..516 /gene="PIN1" /note="NIMA-interacting protein 1, essential mitotic regulator, essential peptidyl-prolyl isomerase" /codon_start=1 /product="Pin1" /db_xref="PID:g1332710" /translation="MADEEKLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSG GKNGQGEPARVRCSHLLVKHSQSRRPSSWRQEKITRTKEEALELINGYIQKIKSGEED FESLASQFSDCSSAKARGDLGAFSRGQMQKPFEDASFALRTGEMSGPVFTDSGIHIIL RTE" BASE COUNT 202 a 309 c 324 g 159 t ORIGIN 1 tgctggccag cacctcgagg gaagatggcg gacgaggaga agctgccgcc cggctgggag 61 aagcgcatga gccgcagctc aggccgagtg tactacttca accacatcac taacgccagc 121 cagtgggagc ggcccagcgg caacagcagc agtggtggca aaaacgggca gggggagcct 181 gccagggtcc gctgctcgca cctgctggtg aagcacagcc agtcacggcg gccctcgtcc 241 tggcggcagg agaagatcac ccggaccaag gaggaggccc tggagctgat caacggctac 301 atccagaaga tcaagtcggg agaggaggac tttgagtctc tggcctcaca gttcagcgac 361 tgcagctcag ccaaggccag gggagacctg ggtgccttca gcagaggtca gatgcagaag 421 ccatttgaag acgcctcgtt tgcgctgcgg acgggggaga tgagcgggcc cgtgttcacg 481 gattccggca tccacatcat cctccgcact gagtgagggt ggggagccca ggcctggcct 541 cggggcaggg cagggcggct aggccggcca gctccccctt gcccgccagc cagtggccga 601 accccccact ccctgccacc gtcacacagt atttattgtt cccacaatgg ctgggagggg 661 gcccttccag attgggggcc ctggggtccc cactccctgt ccatccccag ttggggctgc 721 gaccgccaga ttctccctta aggaattgac ttcagcaggg gtgggaggct cccagaccca 781 gggcagtgtg gtgggagggg tgttccaaag agaaggcctg gtcagcagag ccgccccgtg 841 tccccccagg tgctggaggc agactcgagg gccgaattgt ttctagttag gccacgctcc 901 tctgttcagt cgcaaaggtg aacactcatg cggcagccat gggccctctg agcaactgtg 961 cagacccttt cacccccaat taaacccaga acca // LOCUS HSU49082 2431 bp mRNA PRI 13-FEB-1997 DEFINITION Human transporter protein (g17) mRNA, complete cds. ACCESSION U49082 NID g1840044 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2431) AUTHORS Latif,F., Lerman,M., Minna,J., Duh,F.M., Koonin,E. and Bader,S. TITLE Direct Submission JOURNAL Submitted (12-FEB-1996) Farida Latif, Laboratory of Immunobiology, NCI-FCRDC, Room 12-71, Bldg. 560, Frederick, MD 21702, USA FEATURES Location/Qualifiers source 1..2431 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3p21.3" gene 125..1639 /gene="g17" CDS 125..1639 /gene="g17" /note="member of a new family of transporter protein genes; resides in the lung cancer TSG region" /codon_start=1 /product="transporter protein" /db_xref="PID:g1840045" /translation="MEAPLQTEMVELVPNGKHSEGLLPVITPMAGNQRVEDPARSCME GKSFLQKSPSKEPHFTDFEGKTSFGMSVFNLSNAIMGSGILGLAYAMANTGIILFLFL LTAVALLSSYSIHLLLKSSGVVGIRAYEQLGYRAFGTPGKLAAALAITLQNIGAMSSY LYIIKSELPLVIQTFLNLEEKTSDWYMNGNYLVILVSVTIILPLALMRQLGYLGYSSG FSLSCMVFFLIAVIYKKFHVPCPLPPNFNNTTGNFSHVEIVKEKVQLQVEPEASAFCT PSYFTLNSQTAYTIPIMAFAFVCHPEVLPIYTELKDPSKKKMQHISNLSIAVMYIMYF LAALFGYLTFYNGVESELLHTYSKVDPFDVLILCVRVAVLTAVTLTVPIVLFPVRRAI QQMLFPNQEFSWLRHVLIAVGLLTCINLLVIFAPNILGIFGVIGATSAPFLIFIFPAI FYFRIMPTEKEPARSTPKILALCFAMLGFLLMTMSLSFIIIDWASGTSRHGGNH" BASE COUNT 477 a 805 c 622 g 527 t ORIGIN 1 ggcacgaggc cgggagcaga gcgaaccgca ccggcccgag cggagcgccg cacgttccca 61 accgcgaggc cagacatctg actgttggtg tgagaccagt gctcctggtg gtgtgccctg 121 agccatggag gcgcctttgc agacagagat ggtggagctg gtgcccaatg gcaaacactc 181 agaggggctg ctcccggtca tcacccccat ggcaggcaac cagagggtcg aggaccctgc 241 acggagctgt atggagggca agagcttcct acagaaaagt cccagcaagg agccacactt 301 cactgacttc gaggggaaga catcattcgg gatgtcagtg ttcaacctca gcaatgccat 361 catgggcagc ggcatcctgg gactcgccta tgccatggcc aatacgggca ttatcctttt 421 cctgttcctg ttgacagctg tcgccttgct ctccagctac tccatccacc tgctactcaa 481 gtcctcaggg gtcgtgggca tccgtgccta tgagcagctg ggctaccgtg cctttgggac 541 cccaggaaag ctggcagcag ccctggccat cacgctccag aacatcggag ccatgtccag 601 ctacctgtac atcatcaagt ctgagctgcc acttgtcata cagaccttcc tgaacctgga 661 ggagaaaacc tcggactggt acatgaacgg gaactacctg gtaatccttg tctctgtcac 721 catcattctg cccctggcac tgatgcggca gcttggctac ctgggctact ccagcggctt 781 ctctcttagc tgcatggtgt tcttcctaat tgcagtcatc tacaaaaagt tccacgtgcc 841 ctgcccactg ccccccaact tcaacaacac cacaggcaac ttcagccacg tggagatcgt 901 gaaggagaag gtgcagctgc aggtcgagcc tgaggcttca gccttctgca ctcccagcta 961 cttcacgctc aactcacaga cagcatacac catccccatc atggccttcg ccttcgtctg 1021 ccaccccgag gtgctgccca tctatactga gctcaaggac ccctccaaga agaagatgca 1081 gcacatctcc aacctgtcca tcgctgtcat gtacatcatg tacttcctgg ctgccctctt 1141 cggctacctc accttctaca acggggtgga gtcggagctg ctgcacacct acagcaaggt 1201 ggacccgttt gacgtcctga tcctgtgtgt gcgcgtggcc gtgctgacag cagtcacgct 1261 cacagtgccc atcgttctgt tcccggtgcg ccgcgccatc cagcagatgc tgtttccaaa 1321 ccaggagttc agctggctgc ggcatgtgct tattgccgtt ggcctgctca cttgtatcaa 1381 cctgctggtc atctttgccc ccaacatcct gggcatcttt ggggtcatcg gtgccacatc 1441 tgccccattc ctcatcttca tcttccctgc catcttctac ttccgaatca tgcccacgga 1501 gaaggagcct gcaagatcca cccccaaaat cctggccctg tgttttgcta tgcttggctt 1561 cttgctgatg accatgagct tgagcttcat catcattgac tgggcctcag ggaccagccg 1621 gcatggagga aaccactagg gtgaccctca tcctgttctg tctactcacc ctagcagccc 1681 tgcccagact cttcagcccc tgctcccatc cagtggccag tcgggggagg agaaagacgc 1741 gattaacact gtggcattca gccaggcccc atgtcctctc tgtggaaggt ttttgttcaa 1801 gagccaggac caaggccctt gggccactac cctgctaggc tctggagctg tagaggcttc 1861 ctgaactggg agcagggtag ggctgtcgcc ttagatcccg cccaagcccc tcattccctc 1921 cttgcacaga tgcatacact ggggcccagc agctgcctcc tgaggtgaca cagcctgtag 1981 gaacatacac agctgggatc agcctgcagc catcccccga ccctgctgct aggccacggt 2041 ctgcgccctg gggcctcatc tcccccagcc cacttgtttt cccccctttt attccctagg 2101 cccttttcag actcctgggc ccttggatac tcttctccca tctcccttca caggatgaca 2161 ccctcccata ccccatagct ggggccagca ggttctgctg agggtggggc tggtgtaggg 2221 acccccaaga gacccctgtc ctgtcccttc accagtcctg gggaggctgg gactccccct 2281 gccacaagcc tgggccacag ctcacattcc actgctggga gaagaaacag gccgaggccc 2341 agagtggcct gcccccggga gccaaagacc ccagtggcca cactgggata gggtggggag 2401 gctggcagcc ctcttttata aatctcgtgc c // LOCUS HSU49089 3100 bp mRNA PRI 24-JUN-1997 DEFINITION Human neuroendocrine-dlg (NE-dlg) mRNA, complete cds. ACCESSION U49089 NID g1515354 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3100) AUTHORS Makino,K., Kuwahara,H., Masuko,N., Nishiyama,Y., Morisaki,T., Sasaki,J., Nakao,M., Kuwano,A., Nakata,M., Ushio,Y. and Saya,H. TITLE Cloning and characterization of NE-dlg: a novel human homolog of the Drosophila discs large (dlg) tumor suppressor protein interacts with the APC protein JOURNAL Oncogene 14 (20), 2425-2433 (1997) MEDLINE 97332623 REFERENCE 2 (bases 1 to 3100) AUTHORS Makino,K., Kuwahara,H., Nakao,M. and Saya,H. TITLE Direct Submission JOURNAL Submitted (13-FEB-1996) Hideyuki Saya, Tumor Genetics and Biology, Kumamoto University School of Medicine, 2-2-1 Honjo, Kumamoto 860, Japan FEATURES Location/Qualifiers source 1..3100 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /map="Xq13" gene 234..2687 /gene="NE-dlg" CDS 234..2687 /gene="NE-dlg" /note="similar to human homolog of Drosophila discs large protein, encoded by Genbank Accession Number U13896" /codon_start=1 /product="neuroendocrine-dlg" /db_xref="PID:g1515355" /translation="MHKHQHCCKCPECYEVTRLAALRRLEPPGYGDWQVPDPYGPGGG NGASAGYGGYSSQTLPSQAGATPTPRTKAKLIPTGRDVGPVPLKPVPGKSTPKLNGSG PSWWPECTCTNRDWYEQVNGSDGMFKYEEIVLERGNSGLGFSIAGGIDNPHVPDDPGI FITKIIPGGAAAMDGRLGVNDCVLRVNEVEVSEVVHSRAVEALKEAGPVVRLVVRRRQ PPPETIMEVNLLKGPKGLGFSIAGGIGNQHIPGDNSIYITKIIEGGAAQKDGRLQIGD RLLAVNNTNLQDVRHEEAVASLKNTSDMVYLKVAKPGSLHLNDMYAPPDYASTFTALA DNHISHNSSLGYLGAVESKVSYPAPPQVPPTRYSPIPRHMLAEEDFTREPRKIILHKG STGLGFNIVGGEDGEGIFVSFILAGGPADLSGELRRGDRILSVNGVNLRNATHEQAAA ALKRAGQSVTIVAQYRPEEYSRFESKIHDLREQMMNSSMSSGSGSLRTSEKRSLYVRA LFDYDRTRDSCLPSQGLSFSYGDILHVINASDDEWWQARLVTPHGESEQIGVIPSKKR VEKKERARLKTVKFHARTGMIESNRDFPGLSDDYYGAKNLKGQEDAILSYEPVTRQEI HYARPVIILGPMKDRVNDDLISEFPHKFGSCVPHTTRPRRDNEVDGQDYHFVVSREQM EKDIQDNKFIEAGQFNDNLYGTSIQSVRAVAERGKHCILDVSGNAIKRLQQAQLYPIA IFIKPKSIEALMEMNRRQTYEQANKIYDKAMKLEQEFGEYFTAIVQGDSLEEIYNKIK QIIEDQSGHYIWVPSPEKL" repeat_unit 621..884 /gene="NE-dlg" /note="dlg homology repeat 1; DHR1" repeat_unit 909..1166 /gene="NE-dlg" /note="dlg homology repeat 2; DHR2" repeat_unit 1368..1628 /gene="NE-dlg" /note="dlg homology repeat 3; DHR3" misc_feature 1740..1937 /gene="NE-dlg" /note="encodes SH3 Domain" misc_feature 2115..2642 /gene="NE-dlg" /note="encodes guanylate kinase-related domain" BASE COUNT 753 a 799 c 902 g 646 t ORIGIN 1 tgctggaggg ggagccgtgg gtgcggggca ccgtgggggc cgaggccccg ggacgcccgc 61 ccggccccag gccccgctca gcccgggcgc ccccacgggt gcccccccct tcttggtccg 121 agcagtgtga gtgtgccagg gagcccggcg gcggcggcgg cggtggtggc ggcggtggcg 181 gcggcgtgga atccggcgtg ggctgggggg tccgagccgc ggggggcagt gccatgcaca 241 agcaccagca ctgctgtaag tgccctgagt gctatgaggt gacccgcctg gccgccctgc 301 ggcgcctcga gcctccgggg tacggcgact ggcaagtccc cgacccttac gggccaggtg 361 ggggcaacgg cgccagcgcg ggttatgggg gctacagctc gcagaccttg ccctcgcagg 421 cgggggccac ccccacccct cgcaccaagg ccaagctcat ccccaccggc cgggatgtgg 481 ggccggtgcc tcttaagcca gtcccgggca agagcacccc caaactcaac ggcagcggcc 541 ccagctggtg gccagagtgc acctgtacca accgggactg gtatgagcag gtgaatggca 601 gtgatggcat gttcaaatat gaggaaatcg tacttgagag gggcaactct ggcctgggct 661 tcagtatcgc aggtggcatc gacaatcccc atgtccctga tgaccctggc atttttatta 721 ccaagattat ccctggtgga gcagctgcca tggatgggag gctgggggtg aatgactgtg 781 tgctgcgggt gaatgaggtg gaagtgtcgg aggtggtaca cagccgggcg gtggaggcgc 841 tgaaagaggc aggccctgtg gtgcgattgg tggtgcggag gcgacagcct ccacccgaga 901 ccatcatgga ggtcaacctg ctcaaagggc ccaaaggcct gggtttcagc attgctgggg 961 gtattggcaa ccagcacatc ccaggagaca acagcatcta catcaccaag atcattgagg 1021 ggggtgctgc tcagaaggat ggacgcctac agattgggga ccggctgctg gcggtgaaca 1081 acaccaatct gcaggatgtg aggcacgagg aagctgtggc ctcactgaag aacacatctg 1141 atatggtgta tttgaaggtg gccaagccag gcagcctcca cctcaacgac atgtacgctc 1201 cccctgacta cgccagcact tttactgcct tggctgacaa ccacataagc cataattcca 1261 gcctgggtta tctcggggct gtggagagca aggtcagcta ccctgctcct cctcaggttc 1321 cccccacccg ctactctcct attcccaggc acatgctggc tgaggaggac ttcaccagag 1381 agcctcgcaa gatcatcctg cacaaaggct ccacaggcct gggcttcaac atcgtaggag 1441 gagaggatgg agaaggcatt tttgtctcct tcatcctggc aggaggccca gctgacctga 1501 gtggggagct gcgcagggga gaccggatct tatcggtgaa tggagtgaat ctgaggaatg 1561 caactcatga gcaggctgca gctgctctga aacgggccgg ccagtcagtc accattgtgg 1621 cccagtacag acctgaagaa tacagtcgct ttgaatcgaa gatacatgac ttacgagaac 1681 aaatgatgaa cagcagcatg agctctgggt ctgggtccct ccgaacaagt gaaaagaggt 1741 ccttgtatgt cagggccctg tttgattatg atcggactcg ggacagctgc ctgccaagcc 1801 aggggctcag cttctcttat ggtgacattc tgcatgtcat taatgcctct gatgatgagt 1861 ggtggcaggc aaggctggtg accccacacg gagaaagtga gcagatcggt gtgatcccca 1921 gtaagaagag ggtggaaaag aaagaaagag ctcgattgaa aactgtgaag ttccatgcca 1981 ggacggggat gattgagtct aacagggact tcccggggtt aagtgacgat tattatggag 2041 caaagaacct gaaaggacaa gaggatgcta ttttgtcata tgagccagtg acacggcaag 2101 aaattcacta tgcaaggcct gtgatcatcc tgggcccaat gaaggaccga gtcaatgatg 2161 acctgatctc cgaatttcca cataaatttg gatcctgtgt gccacatact acccggcctc 2221 gacgtgataa tgaggtggat ggacaagact accactttgt ggtgtcccga gaacaaatgg 2281 agaaagatat tcaggacaac aagttcatcg aggcgggcca atttaatgat aacctctatg 2341 ggaccagcat ccagtcagtg cgggcagttg cagagagggg caagcactgc atcttagatg 2401 tttccggcaa tgctatcaag agactgcagc aagcacaact ttaccccatt gccattttca 2461 tcaagcccaa gtccattgaa gcccttatgg aaatgaaccg aaggcagaca tatgaacaag 2521 caaataagat ctatgacaaa gccatgaaac tggagcagga atttggagag tactttacag 2581 ccattgtaca gggtgactca ctggaagaga tttataacaa aatcaaacaa atcattgagg 2641 accagtctgg gcactacatt tgggtcccat cccctgaaaa actctgaaga atcccctcca 2701 accattctct tgtgaacaga agaaatcaag tccctcttcc ctcctccctc ttcattcctg 2761 tccccatggg gagaacaaat gctactgttc ttgtcccctt ttttagatat gtcaaaaaaa 2821 attaagtttt ctagtcctgt tctttttttt ttttaagttt ttgtttgttt cagtttattt 2881 tttgggatga tgccatctca ttcatcatgt gactgtgccc attcctgcat ggacctttcc 2941 caagcgctag cataggtgca aaatccatca gagccattgt tttcataaaa accaagcaga 3001 agtgaagaga aaagaggagg actgatggaa agacagactc tggacagctg cacggcttgt 3061 gaagtgagct aaatgcacca catgatgaga tgctcctggg // LOCUS HSU49184 2379 bp mRNA PRI 25-APR-1996 DEFINITION Human occludin mRNA, complete cds. ACCESSION U49184 NID g1276978 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2379) AUTHORS Ando-Akatsuka,Y., Saitou,M., Hirase,T., Kishi,M., Sakakibara,A., Itoh,M., Yonemura,S., Furuse,M. and Tsukita,S. TITLE Interspecies diversity of the occludin sequence: cDNA cloning of human, mouse, dog, and rat-kangaroo homologues JOURNAL J. Cell Biol. 133 (1), 43-47 (1996) MEDLINE 96181088 REFERENCE 2 (bases 1 to 2379) AUTHORS Ando-Akatsuka,Y., Saitou,M., Hirase,T., Kishi,M., Sakakibara,A., Itoh,M., Yonemura,S., Furuse,M. and Tsukita,S. TITLE Direct Submission JOURNAL Submitted (01-FEB-1996) Yuhko Ando-Akatsuka, Cell Biology, Kyoto University Faculty of Medicine, Konoe-Yoshida, Sakyo-ku, Kyoto, 606-01, Japan FEATURES Location/Qualifiers source 1..2379 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 168..1736 /note="integral membrane protein localized at tight junctions" /codon_start=1 /product="occludin" /db_xref="PID:g1276979" /translation="MSSRPLESPPPYRPDEFKPNHYAPSNDIYGGEMHVRPMLSQPAY SFYPEDEILHFYKWTSPPGVIRILSMLIIVMCIAIFACVASTLAWDRGYGTSLLGGSV GYPYGGSGFGSYGSGYGYGYGYGYGYGGYTDPRAAKGFMLAMAAFCFIAALVIFVTSV IRSEMSRTRRYYLSVIIVSAILGIMVFIATIVYIMGVNPTAQSSGSLYGSQIYALCNQ FYTPAATGLYVDQYLYHYCVVDPQEAIAIVLGFMIIVAFALIIFFAVKTRRKMDRYDK SNILWDKEHIYDEQPPNVEEWVKNVSAGTQDVPSPPSDYVERVDSPMAYSSNGKVNDK RFYPESSYKSTPVPEVVQELPLTSPVDDFRQPRYSSGGNFETPSKRAPAKGRAGRSKR TEQDHYETDYTTGGESCDELEEDWIREYPPITSDQQRQLYKRNFDTGLQEYKSLQSEL DEINKELSRLDKELDDYREESEEYMAAADEYNRLKQVKGSADYKSKKNHCKQLKSKLS HIKKMVGDYDRQKT" BASE COUNT 686 a 494 c 526 g 673 t ORIGIN 1 ctcccgcgtc cacctctccc tccctgcttc ctctggcgga ggcggcagga accgagagcc 61 aggtccagag cgccgaggag ccggtctagg acgcagcaga ttggtttatc ttggaagcta 121 aagggcattg ctcatcctga agatcagctg accattgaca atcagccatg tcatccaggc 181 ctcttgaaag tccacctcct tacaggcctg atgaattcaa accgaatcat tatgcaccaa 241 gcaatgacat atatggtgga gagatgcatg ttcgaccaat gctctctcag ccagcctact 301 ctttttaccc agaagatgaa attcttcact tctacaaatg gacctctcct ccaggagtga 361 ttcggatcct gtctatgctc attattgtga tgtgcattgc catctttgcc tgtgtggcct 421 ccacgcttgc ctgggacaga ggctatggaa cttccctttt aggaggtagt gtaggctacc 481 cttatggagg aagtggcttt ggtagctacg gaagtggcta tggctatggc tatggttatg 541 gctatggcta cggaggctat acagacccaa gagcagcaaa gggcttcatg ttggccatgg 601 ctgccttttg tttcattgcc gcgttggtga tctttgttac cagtgttata agatctgaaa 661 tgtccagaac aagaagatac tacttaagtg tgataatagt gagtgctatc ctgggcatca 721 tggtgtttat tgccacaatt gtctatataa tgggagtgaa cccaactgct cagtcttctg 781 gatctctata tggttcacaa atatatgccc tctgcaacca attttataca cctgcagcta 841 ctggactcta cgtggatcag tatttgtatc actactgtgt tgtggatccc caggaggcca 901 ttgccattgt actggggttc atgattattg tggcttttgc tttaataatt ttctttgctg 961 tgaaaactcg aagaaagatg gacaggtatg acaagtccaa tattttgtgg gacaaggaac 1021 acatttatga tgagcagccc cccaatgtcg aggagtgggt taaaaatgtg tctgcaggca 1081 cacaggacgt gccttcaccc ccatctgact atgtggaaag agttgacagt cccatggcat 1141 actcttccaa tggcaaagtg aatgacaagc ggttttatcc agagtcttcc tataaatcca 1201 cgccggttcc tgaagtggtt caggagcttc cattaacttc gcctgtggat gacttcaggc 1261 agcctcgtta cagcagcggt ggtaactttg agacaccttc aaaaagagca cctgcaaagg 1321 gaagagcagg aaggtcaaag agaacagagc aagatcacta tgagacagac tacacaactg 1381 gcggcgagtc ctgtgatgag ctggaggagg actggatcag ggaatatcca cctatcactt 1441 cagatcaaca aagacaactg tacaagagga attttgacac tggcctacag gaatacaaga 1501 gcttacaatc agaacttgat gagatcaata aagaactctc ccgtttggat aaagaattgg 1561 atgactatag agaagaaagt gaagagtaca tggctgctgc tgatgaatac aatagactga 1621 agcaagtgaa gggatctgca gattacaaaa gtaagaagaa tcattgcaag cagttaaaga 1681 gcaaattgtc acacatcaag aagatggttg gagactatga tagacagaaa acatagaagg 1741 ctgatgccaa gttgtttgag aaattaagta tctgacatct ctgcaatctt ctcagaaggc 1801 aaatgacttt ggaccataac cccggaagcc aaacctctgt gagcatcaca aagttttggt 1861 tgctttaaca tcatcagtat tgaagcattt tataaatcgc ttttgataat caactgggct 1921 gaacactcca attaaggatt ttatgcttta aacattggtt cttgtattaa gaatgaaata 1981 ctgtttgagg tttttaagcc ttaaaggaag gttctggtgt gaactaaact ttcacacccc 2041 agacgatgtc ttcataccta catgtatttg tttgcatagg tgatctcatt taatcctctc 2101 aaccaccttt cagataactg ttatttataa tcactttttt ccacataagg aaactgggtt 2161 cctgcaatga agtctctgaa gtgaaactgc ttgtttccta gcacacactt ttggttaagt 2221 ctgttttatg acttcattaa taataaattc cctggccttt catattttag ctactatata 2281 tgtgatgatc taccagcctc cctatttttt ttctgttata taaatggtta aaagaggttt 2341 ttcttaaata ataaagatca tgtaaaagta aaaaaaaaa // LOCUS HSU49245 1138 bp mRNA PRI 07-MAR-1996 DEFINITION Human geranylgeranyl transferase type II beta-subunit mRNA, complete cds. ACCESSION U49245 NID g1216503 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1138) AUTHORS Chang,H.-Y., Wu,S.-R. and Peng,H.-L. TITLE Human geranylgeranyl transferase type II beta-subunit cDNA sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 1138) AUTHORS Peng,H.-L. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) Hwei-Ling Peng, Microbiology and Immunology, Chang Gung College of Medicine and Technology, 259 Wen-Hwa 1 Road, Kwei-San, Tao-Yuan, 33333, Taiwan FEATURES Location/Qualifiers source 1..1138 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Human fetal brain MATCHMAKER cDNA library" /tissue_type="brain" /dev_stage="19-22 weeks fetus" CDS 1..996 /codon_start=1 /product="geranylgeranyl transferase type II beta-subunit" /db_xref="PID:g1216504" /translation="MGTPQKDVIIKSDAPDTLLLEKHADYIASYGSKKDDYEYCMSEY LRMSGIYWGLTVMDLMGQLHRMNREEILAFIKSCQHECGGISASIGHDPHLLYTLSAV QILTLYDSINVIDVNKVVEYVKGLQKEDGSFAGDIWGEIDTRFSFCAVATFALLGKLD AINVEKAIEFVLSCMNFDGGFGCRPGSESHAGQIYCCTGFLAITSQLHQVNSDLLGWW LCERQLPSGGLNGRPEKLPDVCYSWWVLASLKIIGRLHWIDREKLRNFILACQDEETG GFADRPGDMVDPFHTLFGIAGLSLLGEEQIKPVNPVFCMPEEVLQRVNVQPELVS" BASE COUNT 323 a 169 c 270 g 376 t ORIGIN 1 atgggcactc cacagaagga tgttattatc aagtcagatg caccggacac tttgttattg 61 gagaaacatg cagattatat cgcatcctat ggctcaaaga aagatgatta tgaatactgt 121 atgtctgagt atttgagaat gagtggcatc tattggggtc tgacagtaat ggatctcatg 181 ggacaacttc atcgcatgaa tagagaagag attctggcat ttattaagtc ttgccaacat 241 gaatgtggtg gaataagtgc tagtatcgga catgatcctc atcttttata cactcttagt 301 gctgtccaga ttcttacgct gtatgacagt attaatgtta ttgacgtaaa taaagttgtg 361 gaatatgtta aaggtctaca gaaagaagat ggttcttttg ctggagatat ttggggagaa 421 attgacacaa gattctcttt ttgtgcggtg gcaactttcg ctttgttggg gaagcttgat 481 gctattaatg tggaaaaggc aatcgaattt gttttatcct gtatgaactt tgacggtgga 541 tttggttgca gaccaggttc tgaatcccat gctgggcaga tctattgttg cacaggattt 601 ctggctatta caagtcagtt gcatcaagta aattctgatt tacttggctg gtggctttgt 661 gaacgacaat taccctcagg cgggctcaat ggaaggccgg agaagttacc agatgtatgc 721 tactcatggt gggtcctggc ttccctaaag ataattggaa gacttcattg gattgataga 781 gagaaactgc gtaatttcat tttagcatgt caagatgaag aaacgggggg atttgcagac 841 aggccaggag atatggtgga tccttttcat accttatttg gaattgctgg attgtcactt 901 ttgggagaag aacagattaa acctgttaat cctgtctttt gcatgcctga agaagtgctt 961 cagagagtga atgttcagcc tgagctagtg agctagattc attgaattga aagttgcata 1021 gtatagtttt gccattttaa catttctgta tttgaagtgc ttatcgaatc taaaagtgac 1081 tactgttaat attttgtata ttgtgttaaa ttaattttaa taaattatat aattatat // LOCUS HSU49250 2894 bp mRNA PRI 12-MAR-1996 DEFINITION Human putative cerebral cortex transcriptional regulator T-Brain-1 (Tbr-1) mRNA, complete cds. ACCESSION U49250 S78865 NID g1222542 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2894) AUTHORS Bulfone,A., Smiga,S.M., Shimamura,K., Peterson,A., Puelles,L. and Rubenstein,J.L. TITLE T-brain-1: a homolog of Brachyury whose expression defines molecularly distinct domains within the cerebral cortex JOURNAL Neuron 15 (1), 63-78 (1995) MEDLINE 95344783 REFERENCE 2 (bases 1 to 2894) AUTHORS Smiga,S.M. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) Susan M. Smiga, Psychiatry, UCSF, 401 Parnassus Avenue, SF, CA 94143, USA FEATURES Location/Qualifiers source 1..2894 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="17 week fetal brain cDNA library (Stratagene)" /tissue_type="brain" /dev_stage="17 week fetus" gene 25..2073 /gene="Tbr-1" CDS 25..2073 /gene="Tbr-1" /note="putative transcriptional regulator; T-box brain gene; Tes-56" /codon_start=1 /product="T-Brain-1" /db_xref="PID:g1222543" /translation="MQLEHCLSPSIMLSKKFLNVSSSYPHSGGSELVLHDHPIISTTD NLERSSPLKKITRGMTNQSDTDNFPDSKDSPGDVQRSKLSPVLDGVSELRHSFDGSAA DRYLLSQSSQPQSAATAPSAMFPYPGQHGPAHPAFSIGSPSRYMAHHPVITNGAYNSL LSNSSPQGYPTAGYPYPQQYGHSYQGAPFYQFSSTQPGLVPGKAQVYLCNRPLWLKFH RHQTEMIITKQGRRMFPFLSFNISGLDPTAHYNIFVDVILADPNHWRFQGGKWVPCGK ADTNVQGNRVYMHPDSPNTGAHWMRQEISFGKLKLTNNKGASNNNGQMVVLQSLHKYQ PRLHVVEVNEDGTEDTSQPGRVQTFTFPETQFIAVTAYQNTDITQLKIDHNPFAKGFR DNYDTIYTGCDMDRLTPSPNDSPRSQIVPGARYAMAGSFLQDQFVSNYAKARFHPGAG AGPGPGTDRSVPHTNGLLSPQQAEDPGAPSPQRWFVTPANNRLDFAASAYDTATDFAG NAATLLSYAAAGVKALPLQAAGCTGRPLGYYADPSGWGARSPPQYCGTKSGSVLPCWP NSAAAAARMAGANPYLGEEAEGLAAERSPLPPGAAEDAKPKDLSDSSWIETPSSIKSI DSSDSGIYEQAKRRRISPADTPVSESSSPLKSEVLAQRDCEKNCAKDISGYYGFYSHS " BASE COUNT 605 a 946 c 708 g 635 t ORIGIN 1 ggcgagtgtt caggttctag agctatgcag ctggagcact gcctttctcc ttctatcatg 61 ctctccaaga aatttctcaa tgtgagcagc agctacccac attcaggcgg atccgagctt 121 gtcttgcacg atcatcccat tatctcgacc actgacaacc tggagagaag ttcacctttg 181 aaaaaaatta ccagggggat gacgaatcag tcagatacag acaattttcc tgactccaag 241 gactcaccag gggacgtcca gagaagtaaa ctctctcctg tcttggacgg ggtctctgag 301 cttcgtcaca gtttcgatgg ctctgctgca gatcgctacc tcctctctca gtccagccag 361 ccacagtctg cggccactgc tcccagtgcc atgttcccgt accccggcca gcacggaccg 421 gcgcaccccg ccttctccat cggcagccct agccgctaca tggcccacca cccggtcatc 481 accaacggag cctacaacag cctcctgtcc aactcctcgc cgcagggata ccccacggcc 541 ggctacccct acccacagca gtacggccac tcctaccaag gagctccgtt ctaccagttc 601 tcctccaccc agccggggct ggtgcccggc aaagcacagg tgtacctgtg caacaggccc 661 ctttggctga aatttcaccg gcaccaaacg gagatgatca tcaccaaaca gggaaggcgc 721 atgtttcctt ttttaagttt taacatttct ggtctcgatc ccacggctca ttacaatatt 781 tttgtggatg tgattttggc ggatcccaat cactggaggt ttcaaggagg caaatgggtt 841 ccttgcggca aagcggacac caatgtgcaa ggaaatcggg tctatatgca tccggattcc 901 cccaacactg gggctcactg gatgcgccaa gaaatctctt ttggaaaatt aaaacttacg 961 aacaacaaag gagcttcaaa taacaatggg cagatggtgg ttttacagtc cttgcacaag 1021 taccagcccc gcctgcatgt ggtggaagtg aacgaggacg gcacggagga cactagccag 1081 cccggccgcg tgcagacgtt cactttccct gagactcagt tcatcgccgt caccgcctac 1141 cagaacacgg atattacaca actgaaaata gatcacaacc cttttgcaaa aggatttcgg 1201 gataattatg acacgatcta caccggctgt gacatggacc gcctgacccc ctcgcccaac 1261 gactcgccgc gctcgcagat cgtgcccggg gcccgctacg ccatggccgg ctctttcctg 1321 caggaccagt tcgtgagcaa ctacgccaag gcccgcttcc acccgggcgc gggcgcgggc 1381 cccgggccgg gtacggaccg cagcgtgccg cacaccaacg ggctgctgtc gccgcagcag 1441 gccgaggacc cgggcgcgcc ctcgccgcaa cgctggtttg tgacgccggc caacaaccgg 1501 ctggacttcg cggcctcggc ctatgacacg gccacggact tcgcgggcaa cgcggccacg 1561 ctgctctctt acgcggcggc gggcgtgaag gcgctgccgc tgcaggctgc aggctgcact 1621 ggccgcccgc tcggctacta cgccgacccg tcgggctggg gcgcccgcag tcccccgcag 1681 tactgcggca ccaagtcggg ctcggtgctg ccctgctggc ccaacagcgc cgcggccgcc 1741 gcgcgcatgg ccggcgccaa tccctacctg ggcgaggagg ccgagggcct ggccgccgag 1801 cgctcgccgc tgccgcccgg cgccgccgag gacgccaagc ccaaggacct gtccgattcc 1861 agctggatcg agacgccctc ctcgatcaag tccatcgact ccagcgactc ggggatttac 1921 gagcaggcca agcggaggcg gatctcgccg gccgacacgc ccgtgtccga gagttcgtcc 1981 ccgctcaaga gcgaggtgct ggcccagcgg gactgcgaga agaactgcgc caaggacatt 2041 agcggctact atggcttcta ctcgcacagc taggccgccc ctacccgccc ggccccgccg 2101 cggcccggac ccccagccag cccctcacag ctcttcccca gctccgcctc cccacactcc 2161 tccttgcgca cccactcatt ttatttgacc ctcgatggcc gtctgcagcg aataagtgca 2221 ggtctccgag cgtgatttta accttttttg cacagcagtc tctgcaatta gctcaccgac 2281 cttcaacttt gctgtaaacc ttttggtttt gctacttact cttcttctgt ggagttatcc 2341 tcctacattc ccctccccct cgtcttctct tacctcctac ttctctttct tgtaatgaaa 2401 ctcttcacct ttaggagacc tgggcagtct gtcaggcagc agcgattcct ccgccaagtc 2461 tcggccctcc acattaacca taggatgttg actctagaac ctggacccac ccagcgcgtc 2521 ctttcttatc cccgagtgga tggatggatg gatggatgga tggatgttaa taatttagtg 2581 gacaagcctg tgaaatgatt gtacatagtg taattatgta acgaatggca tgttttattc 2641 tcgtcaaggc acaaaaccag ttcatgctta acctttttcc tttcctttct ttgcttttct 2701 ttctctcctc tcatactttc tcttctctct cttttaattt tcttgtgaga taatattcta 2761 agaggctcta gaaacatgaa atactcagta gtgatgggtt tcccacttct cctcaatccg 2821 ttgcatgaaa taattactat gtgccctaat gcacacaaat agctaaggag aatccaccca 2881 aacaccttta aagg // LOCUS HSU49260 1812 bp mRNA PRI 17-APR-1996 DEFINITION Human mevalonate pyrophosphate decarboxylase (MPD) mRNA, complete cds. ACCESSION U49260 NID g1235681 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1812) AUTHORS Toth,M.J. and Huwyler,L. TITLE Molecular cloning and expression of the cDNAs encoding human and yeast mevalonate pyrophosphate decarboxylase JOURNAL J. Biol. Chem. 271 (14), 7895-7898 (1996) MEDLINE 96215173 REFERENCE 2 (bases 1 to 1812) AUTHORS Toth,M.J. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) Matthew J. Toth, Research-CV, Ciba-Geigy, 556 Morris Ave, Summit, NJ 07901, USA FEATURES Location/Qualifiers source 1..1812 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 8..1210 /gene="MPD" CDS 8..1210 /gene="MPD" /EC_number="4.1.1.33" /codon_start=1 /function="cholesterol biosynthesis pathway enzyme" /product="mevalonate pyrophosphate decarboxylase" /db_xref="PID:g1235682" /translation="MASEKPLAAVTCTAPVNIAVIKYWGKRDEELVLPINSSLSVTLH QDQLKTTTTAVISKDFTEDRIWLNGREEDVGQPRLQACLREIRCLARKRRNSRDGDPL PSSLSCKVHVASVNNFPTAAGLASSAAGYACLAYTLARVYGVESDLSEVARRGSGSAC RSLYGGFVEWQMGEQADGKDSIARQVAPESHWPELRVLILVVSAEKKLTGSTVGMRAS VETSPLLRFRAESVVPARMAEMARCIRERDFPSFAQLTMKDSNQFHATCLDTFPPISY LNAISWRIIHLVHRFNAHHGDTKVAYTFDAGPNAVIFTLDDTVAEFVAAVWHGFPPGS NGDTFLKGLQVRPAPLSAELQAALAMEPTPGGVKYIIVTQVGPGPQILDDPCAHLLGP DGLPKPAA" BASE COUNT 327 a 563 c 610 g 312 t ORIGIN 1 tgggaccatg gcctcggaga agccgctggc ggcagtcact tgtacagcgc cggtcaacat 61 cgcggtcatc aagtactggg gcaagcgcga tgaagagctg gttctgccca tcaactcctc 121 cctgagcgtc actctgcacc aggaccagtt aaaaaccacc acaacagccg tcatcagcaa 181 ggacttcacc gaggaccgga tttggctgaa tggccgggag gaggatgtgg ggcagccgag 241 gctgcaggcc tgcctgcggg agatccgctg cctggcccgg aagcggagga actcacggga 301 tggggacccg ctgccctcca gcctcagctg caaggtgcac gtggcatcgg tgaacaactt 361 ccccacggct gcgggcctgg cctcctcagc ggcgggctat gcctgcctag cctacaccct 421 ggcccgtgtc tacggcgtgg agagtgacct ctcagaagtg gctcgccggg gctcaggcag 481 cgcctgccgg agcctgtatg ggggctttgt ggagtggcag atgggagagc aggccgacgg 541 gaaggacagc atcgctcggc aagtggcccc cgagtcacac tggcctgaac tccgcgtgct 601 catccttgtg gtgagcgctg agaagaagct gacaggcagt accgtgggca tgcgggccag 661 tgtggagacc agccccctgc ttcggttccg ggccgagtcc gtggtgcccg cgcgcatggc 721 ggagatggcc cgctgcatcc gggagcgaga cttccccagc ttcgcccagc tgaccatgaa 781 ggacagcaac cagttccacg ccacctgcct cgacaccttc ccgcccatct cttacctcaa 841 tgccatctcc tggcgcatca tccacctggt gcaccgcttc aacgcccacc acggggacac 901 caaggtggcg tacacctttg acgcgggccc caatgccgtg atcttcaccc tggacgacac 961 tgtggctgag tttgtggctg ctgtgtggca cggctttccc ccaggctcga atggagacac 1021 gtttctgaag gggctgcagg tgaggccggc ccctctctca gctgagcttc aggctgcgct 1081 ggccatggag ccgacccccg gtggggtcaa atacatcatt gtcactcagg tggggccagg 1141 gcctcaaatc ctggatgacc cctgcgccca cctcctgggt cctgacggcc tgccgaagcc 1201 agctgcctga ctgcctcagc agggaccgca tgccgcttgg agaaggggtg gcctcgccgg 1261 agctagggag cggatgtggt gggctggccg gactcctggg acatgtgggt ggtggcttga 1321 ccccgggccc atgggcagct tgctgtgggg cagtgcaggg agtcctgcgg ccgcccaggt 1381 gtcaggagag gtccccgccg agtgcttcag ctgccctaag ctgcaccagc gctttgccaa 1441 gatgggatgg ggagggggta tgagaactgg cagagcctcg gtgcagcagg gctgaagggc 1501 tttctcaccc cagctctggc tatgcccagt tctctgagaa aggagctcag tggggaggtg 1561 gtccctccag cggaccaggg aaggggtcac tgtgctggga gcagcctcct tgggcctcag 1621 gaaaccacca agtgcctcgg atggtggctg cccacggcgc ttctgctgag accctgcccc 1681 cggcccaggt gtctcggagg gtggctgccc acggcctggg tgtggctgga atggtggcag 1741 gagtgggcac cagtgcggcc ccggtggcca tggggaataa accagcattg ctgccaaaaa 1801 aaaaaaaaaa aa // LOCUS HSU49283 1597 bp mRNA PRI 02-JAN-1998 DEFINITION Human NAD+-specific isocitrate dehydrogenase beta subunit precursor, mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U49283 NID g2737885 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1597) AUTHORS Ko,H., Kim,Y., Park,H., Oh,I., Yeo,S., Hong,S., Chae,B., Kim,S., Son,B., Lee,Y., Yeo,H. and Huh,T.-L. TITLE Mitochondrial NAD+-specific isocitrate dehydrogenase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1597) AUTHORS Huh,T.-L. TITLE Direct Submission JOURNAL Submitted (10-FEB-1996) Tae-Lin Huh, Kyungpook National University, College of Natural Sciences, Genetic Engineering, 1370 Sankyuk-Dong, Pook-Ku, Taegu, 702-701, Korea FEATURES Location/Qualifiers source 1..1597 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hidhb" /sex="male" /tissue_type="heart" /dev_stage="adult" 5'UTR 1..79 transit_peptide 80..181 CDS 80..1237 /codon_start=1 /product="NAD+-specific isocitrate dehydrogenase beta precursor" /db_xref="PID:g2737886" /translation="MAVLSGVRWLTRALVSAGNPGAWRGLSTSAAAHAASRSQAEDVR VEGSFPVTMLPGDGVGPELMHAVKEVFKAAAVPVEFQEHHLSEVQNMASEEKLEQVLS SMKENKVAIIGKIHTPMEYKGELASYDMRLRRKLDLFANVVHVKSLPGYMTRHNNLDL VIIREQTEGEYSSLEHESARGVIECLKIVTRAKSQRIAKFAFDYATKKGRGKVTAVHK ANIMKLGDGLFLQCCEEVAELYPKIKFETMIIDNCCMQLVQNPYQFDVLVMPNLYGNI IDNLAAGLVGGAGVVPGESYSAEYAVFETGARHPFAQAVGRNIANPTAMLLSASNMLR HLNLEYHSSMIADAVKKVIKVGKVRTRDMGGYSTTTDFIKSVIGHLQTKGS" mat_peptide 182..1234 /function="decarboxylation of isocitrate to alpha-ketoglutarate" /product="NAD+-specific isocitrate dehydrogenase beta subunit" 3'UTR 1235..1597 BASE COUNT 370 a 388 c 471 g 368 t ORIGIN 1 tgatacatcg ttgctcattt tgcagaacat aacagttgca cattgcaaga gtcaacttgc 61 ttcgggtctt ttcatgctca tggcggtatt gagcggagtc cgctggctga cccgagcgct 121 ggtctccgcc gggaaccctg gggcatggag aggtctgagt acctcggccg cggcgcacgc 181 tgcatcgcgg agccaggccg aggacgtgag ggtggagggc tcctttcccg tgaccatgct 241 tccgggagac ggtgtggggc ctgagctgat gcacgccgtc aaggaggtgt tcaaggctgc 301 cgctgtccca gtggagttcc aggagcacca cctgagtgag gtgcagaata tggcatctga 361 ggagaagctg gagcaggtgc tgagttccat gaaggagaac aaagtggcca tcattggaaa 421 gattcatacc ccgatggagt ataaggggga gctagcctcc tatgatatgc ggctgaggcg 481 taagttggac ttatttgcca acgtagtcca tgtgaagtca cttcctgggt atatgactcg 541 gcacaacaat ctagacctgg tgatcattcg agagcagaca gaaggggagt acagctctct 601 ggaacatgag agtgcaaggg gtgtgattga gtgtttgaag attgtcacac gagccaagtc 661 tcagcggatt gcaaagttcg cctttgacta tgccaccaag aaggggcggg gcaaggtcac 721 tgctgtccac aaggccaaca tcatgaaact tggggatggg ttgttcctgc agtgctgtga 781 ggaagttgct gaactgtacc ccaaaatcaa atttgagaca atgatcatag acaactgctg 841 catgcagctg gtgcagaatc cttaccagtt tgatgtgctt gtgatgccca atctctatgg 901 gaacattatt gacaatctgg ctgctggcct ggttggggga gctggtgtgg tccctggtga 961 gagctatagt gcagaatacg cagtctttga gacgggtgcc cggcacccat ttgcccaggc 1021 agtgggcagg aatatagcca atcccacggc catgctgctg tcggcttcca acatgctgcg 1081 gcatcttaat cttgagtatc actccagcat gatcgcagat gcggtgaaga aggtgatcaa 1141 agttggcaag gtgcggactc gagacatggg cggctacagc accacaaccg acttcatcaa 1201 gtctgtcatc ggtcacctgc agactaaagg gagctagagc cctttatttc ttccaacctt 1261 gcaaggacca cactccccat acccttcagt gcagtgtacc agggaagaga ccttgtgcct 1321 ctaagcagtg gaccatggtc accccttgct gggtagagcc taggttgtcc ttgggccggc 1381 ttccttaggg gacagactgt tgggtggtga tggggattgt taggatggag cccaggccac 1441 atggatgatg atgattctcc cccacaggtt cgaacctctg acatgggtgg ctatgctact 1501 tgccatgact tcactgaggc tgtcattgct gccttgcccc acccataggc cctgtccata 1561 cccatgtaag gtgttcaata aagaacatga accaaaa // LOCUS HSU49356 6858 bp mRNA PRI 28-FEB-1996 DEFINITION Human DNA polymerase epsilon catalytic subunit mRNA, complete cds. ACCESSION U49356 NID g1206034 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6858) AUTHORS Asahara,H., Goldsmith,J.S., Lee,E. and Linn,S. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) Stuart M. Linn, Department of Molecular and Cell Biology, University of California, Berkeley, 401 Barker Hall, Berkeley, CA 94720-3202, USA FEATURES Location/Qualifiers source 1..6858 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="fibroblast CRL 1262 cDNA in lambda ZAP and HeLa S3 cDNA in lambda gt10" /cell_type="fetal skin fibroblast and cervical carcinoma cells" CDS 1..6858 /codon_start=1 /product="DNA polymerase epsilon catalytic subunit" /db_xref="PID:g1206035" /translation="MSLRSGGRRRADPGADGEASRDDGATSSVSALKRLERSQWTDKM DLRFGFERLKEPGEKTGWLINMHPTEILDEDKRLGSAVDYYFIQDDGSRFKVALPYKP YFYIATRKGCEREVSSFLSKKFQGKIAKVETVPKEDLDLPNHLVGLKRNYIRLSFHTV EDLVKVRKEISPAVKKNREQDHASDAYTALLSSVLQRGGVITDEEETSKKIADQLDNI VDMREYDVPYHIRLSIDLKIHVAHWYNVRYRGNAFPVEITRRDDLVERPDPVVLAFDI ETTKLPLKFPDAETDQIMMISYMIDGQGYLITNREIVSEDIEDFEFTPKPEYEGPFCV FNEPDEAHLIQRWFEHVQETKPTIMVTYNGDFFDWPFVEARAAVHGLSMQQEIGFQKD SQGEYKAPQCIHMDCLRWVKRDSYLPVGSHNLKAAASKLGYDPVELDPEDMCRMATEQ PQTLATYSVSDAVATYYLYMKYVHPFIFALCTIIPMEPDEVLRKGSGTLCEALLMVQA FHANIIFPNKQEQEFNKLTDDGHVLDSETYVGGHVEALESGVFRSDIPCRFRMNPAAF DFLLQRVEKTLRHALEEEEKVPVEQVTNFEEVCDEIKSKLASLKDVPSRIECPLIYHL DVGAMYPNIILTNRLQPSAMVDEATCAACDFNKPGANCQRKMAWQWRGEFMPASRSEY HRIQHQLESEKFPPLFPEGPARAFHELSREEQAKYEKRRLADYCRKAYKKIHITKVEE RLTTICQRENSFYVDTVRAFRDRRYEFKGLHKVWKKKLSAAVEVGDAAEVKRCKNMEV LYDSLQLAHKCILNSFYGYVMRKGARWYSMEMAGIVCFTGANIITQARELIEQIGRPL ELDTDGIWCVLPNSFPENFVFKTTNVKKPKVTISYPGAMLNIMVKEGFTNDQYQELAE PSSLTYVTRSENSIFFEVDGPYLAMILPASKEEGKKLKKRYAVFNEDGSLAELKGFEV KRRGELQLIKIFQSSVFEAFLKGSTLEEVYGSVAKVADYWLDVLYSKAANMPDSELFE LISENRSMSRKLEDYGEQKSTSISTAKRLAEFLGDQMVKDAGLSCRYIISRKPEGSPV TERAIPLAIFQAEPTVRKHFLRKWLKSSSLQDFDIRAILDWDYYIERLGSAIQKIITI PAALQQVKNPVPRVKHPDWLHKKLLEKNDVYKQKKISELFTLEGRRQVTMAEASEDSP RPSAPDMEDFGLVKLPHPAAPVTVKRKRVLWESQEESQDLTPTVPWQEILGQPPALGT SQEEWLVWLRFHKKKWQLQARQRLARRKRQRLESADGVLRPGAIRDGPATGLGSFLRR TARSILDLPWQIVQISETSQAGLFRLWALVGSDLHCIRLSIPRVFYVNQRVAKAEEGA SYRKVNRVLPRSNMVYNLYEYSVPEDMYQEHINEINAELSAPDIEGVYETQVPLLFRA LVHLGCVCVVNKQLVRHLSGWEAETFALEHLEMRSLAQFSYLEPGSIRHIYLYHHAQA HKALFGIFIPSQRRASVFVLDTVRSNQMPSLGALYSAEHGLLLEKVGPELLPPPKHTF EVRAETDLKTICRAIQRFLLAYKEERRGPTLIAVQSSWELKRLASEIPVLEEFPLVPI CVADKINYGVLDWQRHGARRMIRHYLNLDTCLSQAFEMSRYFHIPIGNLPEDISTFGS DLFFARHLQRHNHLLWLSPTARPDLGGKEADDNCLVMEFDDQATVEINSSGCYSTVCV ELDLQNLAVNTILQSHHVNDMEGADSMGISFDVIQQASLEDMITGGQAASAPASYDET ALCSNTFRILKSMVVGWVKEITQYHNIYADNQVMHFYRWLRSPSSLLHDPALHRTLHN MMKKLFLQLIAEFKRLGSSVIYANFNRIILCTKKRRVEDAIAYVEYITSSIHSKETFH SLTISFSRCWEFLLWMDPSNYGGIKGKVSSRIHCGLQDSQKAGGAEDEQENEDDEEER DGEEEEEAEESNVEDLLENNWNILQFLPQAASCQNYFLMIVSAYIVAVYHCMKDGLRR SAPGSTPVRRRGASQLSQEAEGAVGALPGMITFSQDYVANELTQSFFTITQKIQKKVT GSRNSTELSEMFPVLPGSHLLLNNPALEFIKYVCKVLSLDTNITNQVNKLNRDLLRLV DVGEFSEEAQFRDPCRSYVLPEVICRSCNFCRDLDLCKDSSFSEDGAVLPQWLCSNCQ APYDSSAIEMTLVEVLQKKLMAFTLQDLVCLKCRGVKETSMPVYCSCAGDFALTIHTQ VFMEQIGIFRNIAQHYGMSYLLETLEWLLQKNPQLGH" BASE COUNT 1591 a 1917 c 1928 g 1422 t ORIGIN 1 atgtctctga ggagcggcgg gcggcggcgc gcggacccag gcgcggatgg cgaggccagc 61 agggatgatg gcgccacttc ctcagtttcg gcactcaagc gcctggaacg gagtcagtgg 121 acggataaga tggatttgcg gtttggtttt gagcggctga aggagcctgg tgagaagaca 181 ggctggctca ttaacatgca tcctaccgag attttagatg aagataagcg cttaggcagt 241 gcagtggatt actactttat tcaagatgac ggaagcagat ttaaggtggc tttgccctat 301 aaaccgtatt tctacattgc gaccagaaag ggttgtgagc gagaagtttc atcttttctc 361 tccaagaagt ttcagggcaa aattgcaaaa gtggagactg tccccaaaga ggatctggac 421 ttgccaaatc acttggtggg tttgaagcga aattacatca ggctgtcctt ccacactgtg 481 gaggatcttg tcaaagtgag gaaggagatc tcccctgccg tgaagaagaa cagggagcag 541 gatcacgcca gcgacgcgta cacagctctg ctttccagtg ttctgcagag gggcggtgtc 601 attactgatg aagaggaaac ctctaagaag atagctgacc agttggacaa cattgtggac 661 atgcgcgagt acgatgttcc ctaccacatc cgcctctcca ttgacctgaa gatccacgtg 721 gctcattggt acaatgtcag ataccgagga aatgcttttc cggtagaaat cacccgccga 781 gatgaccttg ttgaacgacc tgaccctgtg gttttggcat ttgacattga gacgaccaaa 841 ctgcccctca agtttcctga tgctgagaca gaccagatta tgatgatttc ctacatgatc 901 gatggccagg gctacctcat caccaacagg gagattgttt cagaagatat tgaagatttt 961 gagttcaccc ccaagccaga atatgaaggc cccttttgtg tcttcaatga acccgatgag 1021 gctcatctga tccaaaggtg gtttgaacac gtccaggaga ccaaacccac catcatggtc 1081 acctacaacg gggacttttt tgactggcca tttgtggagg cccgggcagc agtccacggt 1141 ctgagcatgc agcaggagat aggcttccag aaggacagcc agggggagta caaggcgccc 1201 cagtgcatcc acatggactg cctcaggtgg gtgaagaggg acagttacct tcctgtgggc 1261 agtcataatc tcaaggcggc cgcaagcaag ctaggctatg atcccgtgga gctagacccg 1321 gaggacatgt gccggatggc cacggagcag ccccagactc tggccacgta ttctgtgtca 1381 gatgctgtcg ccacttacta cctgtacatg aagtacgtcc acccattcat ctttgctctg 1441 tgcaccatta ttcccatgga gcccgacgag gtgctgcgga agggctctgg cactctgtgt 1501 gaggccttgc tgatggtgca ggccttccac gccaacatca tcttccccaa caagcaagag 1561 caggagttca ataagctgac ggacgacgga cacgtgctgg actctgagac ctacgtcggg 1621 ggccacgtgg aggccctcga gtctggggtt ttccgcagcg atatcccttg ccggtttagg 1681 atgaatcctg ccgcctttga cttcctgctg cagcgggttg agaagacctt gcgccacgcc 1741 cttgaggaag aggagaaagt gcctgtggag caagtcacca actttgaaga ggtgtgtgat 1801 gagattaaga gcaagcttgc ctccctgaag gacgttccca gccgcatcga gtgtccactc 1861 atctaccacc tggacgtggg ggccatgtac cccaacatca tcctgaccaa ccgcctgcag 1921 ccctctgcca tggtggacga agccacctgt gctgcctgtg acttcaataa gcctggagca 1981 aactgccagc ggaagatggc ctggcagtgg aggggcgagt tcatgccagc cagtcgcagc 2041 gaataccatc ggatccagca ccagctggag tcagagaagt tccccccctt gttcccagag 2101 gggccagctc gggcctttca tgaactgtcc cgcgaggaac aggcgaaata cgagaagaga 2161 aggctggcgg attactgccg gaaagcctac aagaagatcc acatcaccaa ggtggaagag 2221 cgtctcacca ccatctgcca gcgggaaaac tccttctacg tggacaccgt gcgtgccttc 2281 cgggacaggc gttacgagtt caaagggctc cacaaggtgt ggaaaaagaa gctctcggcg 2341 gccgtggagg tgggcgacgc ggctgaggtg aagcgctgca agaacatgga ggtgctgtat 2401 gactcgctgc agctggccca caagtgcatc ctgaactcct tctatggcta tgtcatgcgc 2461 aagggggctc gctggtactc catggagatg gctggcatcg tctgcttcac aggggccaac 2521 atcatcaccc aggcacggga gctgatcgag cagattggga ggcccttaga gctggacaca 2581 gatggtatat ggtgcgtcct gcccaacagc ttcccagaaa attttgtctt caagacgacc 2641 aatgtgaaga agcccaaagt gaccatctcc tacccaggcg ccatgttgaa catcatggtc 2701 aaggaaggct tcaccaatga ccagtaccag gagctggctg agccgtcctc actcacctac 2761 gtcacccgct cagagaacag catctttttt gaggttgatg ggccctacct tgccatgatt 2821 cttccagcct ccaaggaaga aggcaagaaa ttgaagaaga ggtatgctgt gttcaatgaa 2881 gacggttctc tggctgagct caagggcttt gaggtcaaac gccgcgggga actgcagctg 2941 attaagatct tccaatcctc ggtgtttgag gccttcctca agggcagcac gctggaagag 3001 gtgtatggct ctgtagccaa ggtggctgac tactggctgg acgtgctgta cagcaaggca 3061 gccaacatgc ctgactctga gctattcgag ctcatctctg agaaccgttc catgtctcgg 3121 aaactggaag attacgggga gcagaagtct acatccatca gcacagcaaa gcgcctggcc 3181 gagttcctgg gagaccagat ggtcaaggat gcagggctga gttgccgcta catcatctcc 3241 cgcaagcccg agggctcccc tgtcacggag agggccatcc cacttgccat tttccaagca 3301 gagcccacgg tgaggaagca ctttctccgg aaatggctca agagctcttc ccttcaagac 3361 tttgatattc gagcaattct ggattgggac tactacattg agcggctggg aagcgccatc 3421 cagaagatca tcaccatccc tgcggccctg cagcaggtaa agaacccagt gccacgtgtc 3481 aaacaccccg actggctgca caaaaaactg ctggagaaga atgatgtcta caagcagaag 3541 aagatcagtg agctcttcac cctggagggc aggagacagg tcacgatggc cgaggcctca 3601 gaagacagtc cgaggccaag tgctcctgac atggaggact tcggcctcgt aaagctgcct 3661 cacccagcag cccctgtcac tgtgaagagg aagcgagttc tttgggagag ccaggaggag 3721 tcccaggacc tcacgccgac tgtgccctgg caggaaatct tggggcagcc tcccgccctg 3781 ggaaccagcc aggaggaatg gcttgtctgg ctccggttcc acaagaagaa gtggcagctg 3841 caggcccggc agcgcctcgc ccgcaggaag aggcagcgtc tggagtcggc agatggtgtg 3901 ctcaggcccg gggccatccg ggatggtcct gccacggggc tggggagctt cttgcgaaga 3961 actgcccgca gcatcctgga ccttccgtgg cagattgtgc agatcagcga gaccagccag 4021 gccggcctgt tcaggctgtg ggcgctcgtt ggcagtgact tgcactgcat caggctgagc 4081 atcccccgtg tgttctacgt gaaccagcga gtcgctaaag cggaggaggg tgcttcgtat 4141 cgcaaggtaa atcgggtcct tcctcgctcc aacatggtct acaatctcta tgagtattca 4201 gtgccagagg acatgtacca ggaacacatc aacgagatca acgctgagct gtcagcgcca 4261 gacatcgagg gcgtatatga gactcaggtt ccgttactgt tccgggccct ggtgcacctg 4321 ggctgtgtgt gtgtggtcaa taaacagctg gtgaggcacc tttcaggctg ggaagcagag 4381 acctttgctc ttgagcacct ggagatgcgc tctctggccc agttcagcta cctggaacca 4441 gggagtatcc gccatatcta cctgtaccac cacgcacagg cccacaaagc gctcttcggg 4501 atcttcatcc cctcacagcg cagggcatcc gtctttgtgc tggacactgt gcgcagcaac 4561 cagatgccca gccttggcgc cctgtactca gcagagcacg gcctcctcct ggagaaggtg 4621 ggccctgagc tcctgccacc ccccaaacac accttcgaag ttcgggcaga aactgacctg 4681 aagaccatct gcagagccat ccagcgattc ctgctcgcct acaaggagga gcgccggggg 4741 cccacactca tcgctgttca gtccagctgg gagctgaaga ggctggccag tgaaattcct 4801 gtcttggagg aattcccact ggtgcctatc tgtgtggctg acaagatcaa ctatggggtc 4861 ctggactggc agcgccatgg agcccggcgc atgatccgtc actacctcaa cctggacacc 4921 tgcctgtcgc aggccttcga gatgagcagg tactttcaca ttcccattgg gaacctacca 4981 gaggacatct ccacattcgg ctccgacctc ttctttgccc gccacctcca gcgtcacaac 5041 cacctgctct ggctgtcccc tacagcccgc cctgacctgg gtggaaagga ggctgatgac 5101 aactgtcttg tcatggagtt cgatgaccaa gccactgttg agatcaacag ttcaggctgt 5161 tactccacag tgtgtgtgga gctggacctt cagaacctgg ccgtcaacac cattctccag 5221 tctcaccatg tcaacgacat ggagggggcc gacagcatgg ggatcagctt cgacgtgatc 5281 cagcaggcct ccctggagga catgatcacg ggtggtcagg ctgccagtgc cccggccagc 5341 tacgatgaga cagccctgtg ctctaacacc ttcaggatcc tgaagagcat ggtcgtgggc 5401 tgggtgaagg agatcaccca gtaccacaac atctatgcag acaaccaggt gatgcacttc 5461 taccgctggc ttcggtcgcc atcctctctg cttcatgacc ctgccctgca ccgcacactc 5521 cacaacatga tgaagaagct cttcctgcag ctcatcgctg agttcaagcg cctggggtca 5581 tcagtcatct acgccaactt caaccgcatc atcctctgta caaagaagcg ccgtgtggaa 5641 gatgccatcg cttacgtgga gtacatcacc agcagcatcc attcaaagga gaccttccat 5701 tctctgacaa tttctttctc tcgatgctgg gaatttcttc tctggatgga tccatctaac 5761 tatggcggaa tcaaaggaaa agtttcatct cgtattcact gtggactgca agactcccag 5821 aaagcagggg gagcagagga tgagcaggaa aatgaggacg atgaggagga aagagatggg 5881 gaggaggagg aagaggcgga ggaatccaac gtggaggatt tactggaaaa caactggaac 5941 attttgcagt ttttgccaca ggcagcctcc tgccagaact acttcctcat gattgtttca 6001 gcgtacatcg tggccgtgta ccactgcatg aaggacgggc tgaggcgcag tgctccaggg 6061 agcacccccg tgaggaggag gggggccagc cagctctccc aggaggccga gggggcggtc 6121 ggagcccttc ccggaatgat caccttctct caggattatg tcgcaaatga gctcactcag 6181 agcttcttca ccatcactca gaagattcag aagaaagtca caggctctcg gaactccact 6241 gagctctcgg agatgtttcc tgtcctcccc ggttcccact tgctgctcaa taaccctgcc 6301 ctggagttca tcaaatacgt gtgcaaggtg ctgtccctgg acaccaacat cacaaaccag 6361 gtgaataagc tgaaccgaga cctgcttcgc ctggtggatg tcggcgagtt ctccgaggag 6421 gcccagttcc gagacccctg ccgctcctac gtgcttcctg aggtcatctg ccgcagctgt 6481 aacttctgcc gcgacctgga cctgtgtaaa gactcttcct tctcagagga tggggcggtc 6541 ctgcctcagt ggctctgctc caactgtcag gcgccctacg actcctctgc catcgagatg 6601 acgctggtgg aagttctaca gaagaagctg atggccttca ccctgcagga cctggtctgc 6661 ctgaagtgcc gcggggtgaa ggagaccagc atgcctgtgt actgcagctg cgcgggagac 6721 ttcgccctca ccatccacac ccaggtcttc atggaacaga tcggaatatt ccggaacatt 6781 gcccagcact acggcatgtc gtacctcctg gagaccctgg agtggctgct gcagaagaac 6841 ccacagctgg gccattag // LOCUS HSU49379 2562 bp mRNA PRI 30-MAY-1996 DEFINITION Human diacylglycerol kinase epsilon DGK mRNA, complete cds. ACCESSION U49379 NID g1289444 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2562) AUTHORS Tang,W., Bunting,M., Zimmerman,G.A., McIntyre,T.M. and Prescott,S.M. TITLE Molecular cloning of a novel human diacylglycerol kinase highly selective for arachidonate-containing substrates JOURNAL J. Biol. Chem. 271 (17), 10237-10241 (1996) MEDLINE 96215320 REFERENCE 2 (bases 1 to 2562) AUTHORS Prescott,S.M. TITLE Direct Submission JOURNAL Submitted (16-FEB-1996) Stephen M. Prescott, Program in Human Molecular Biology and Genetics, University of Utah, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..2562 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /tissue_type="umbilical vein" CDS 88..1791 /EC_number="2.7.1.107" /note="lipid kinase" /codon_start=1 /product="diacylglycerol kinase epsilon DGK" /db_xref="PID:g1289445" /translation="MEAERRPAPGSPSEGLFADGHLILWTLCSVLLPVFITFWCSLQR SRRQLHRRDIFRKSKHGWRDTDLFSQPTYCCVCAQHILQGAFCDCCGLRVDEGCLRKA DKRFQCKEIMLKNDTKVLDAMPHHWIRGNVPLCSYCMVCKQQCGCQPKLCDYRCIWCQ KTVHDECMKNSLKNEKCDFGEFKNLIIPPSYLTSINQMRKDKKTDYEVLASKLGKQWT PLIILANSRSGTNMGEGLLGEFRILLNPVQVFDVTKTPPIKALQLCTLLPYYSARVLV CGGDGTVGWVLDAVDDMKIKGQEKYIPQVAVLPLGTGNDLSNTLGWGTGYAGEIPVAQ VLRNVMEADGIKLDRWKVQVTNKGYYNLRKPKEFTMNNYFSVGPDALMALNFHAHREK APSLFSSRILNKAVYLFYGTKDCLVQECKDLNKKVELELDGERVALPSLEGIIVLNIG YWGGGCRLWEGMGDETYPLARHDDGLLEVVGVYGSFHCAQIQVKLANPFRIGQAHTVR LILKCSMMPMQVDGEPWAQGPCTVTITHKTHAMMLYFSGEQTDDDISSTSDQEDIKAT E" BASE COUNT 737 a 525 c 621 g 679 t ORIGIN 1 gcgtcgttct cctcctgcgc gaggcggcca aggcctgctg gtccggagcc gcgcctccac 61 ccgcgcgagg tatcgtcctt ggagaagatg gaagcggaga ggcggccggc gccgggctcg 121 ccctccgagg gcctgtttgc ggacgggcac ctgatcttgt ggacgctgtg ctcggtcctg 181 ctgccggtgt tcatcacctt ctggtgtagc ctccagcggt cgcgccggca gctgcaccgc 241 agggacatct tccgcaagag caagcacggg tggcgcgaca cggacctgtt cagccagccc 301 acctactgct gcgtgtgcgc gcagcacatt ctgcagggcg ccttctgcga ctgctgcggg 361 ctccgcgtgg acgagggctg cctcaggaag gccgacaagc gcttccagtg caaggagatt 421 atgctcaaga atgacaccaa ggtcctggac gccatgcccc accactggat ccggggcaac 481 gtgcccctgt gcagttactg tatggtttgc aagcagcagt gtggctgtca acccaagctt 541 tgcgattaca ggtgcatttg gtgccagaaa acagtacatg atgagtgcat gaaaaatagt 601 ttaaagaatg aaaaatgtga ttttggagaa ttcaaaaacc taatcattcc accaagttat 661 ttaacatcca ttaatcagat gcgtaaagac aaaaaaacag attatgaagt gctagcctct 721 aagcttggaa agcagtggac cccattaata atcctggcca actctcgtag tggaactaat 781 atgggagaag gactgttggg agaatttagg atcttgttga atccagtcca ggtttttgat 841 gtaactaaaa ctcctcctat caaagcccta caactctgta ctcttctccc atattattca 901 gctcgagtac ttgtttgtgg aggggatggg actgtagggt gggtcctgga tgcagttgat 961 gacatgaaga ttaagggaca agaaaagtac attccacaag ttgcagtttt gcctctggga 1021 acaggcaacg atctatccaa tacattgggt tggggtacag gttatgctgg agaaattcca 1081 gttgcgcagg ttttgcgaaa tgtaatggaa gcagatggaa ttaaactaga tcgatggaaa 1141 gttcaagtaa caaataaagg atactacaac ttaagaaaac ccaaggaatt cacaatgaac 1201 aactattttt ctgttggacc tgatgctctc atggctctca attttcatgc tcatcgtgag 1261 aaggcaccat ctctgttttc tagcagaatt cttaataagg cggtttactt attctatgga 1321 accaaagatt gtttagtgca agaatgtaaa gatttgaata aaaaagttga gctagaactg 1381 gatggtgagc gagtagcact gcccagcttg gaaggtatta tagttctgaa catcggatac 1441 tggggcggtg gctgcagact atgggaaggg atgggggacg agacttaccc tctagccagg 1501 catgacgatg gtctgctgga agtcgttgga gtatatgggt ctttccactg tgctcagatt 1561 caagtaaaac tggctaatcc ttttcgaata ggacaggcac atacagtgag gctgattttg 1621 aagtgctcca tgatgccaat gcaggtggat ggggagcctt gggcccaagg gccctgcact 1681 gtcaccataa ctcacaagac acatgcaatg atgttatatt tctctggaga acaaacagat 1741 gatgacatct ctagtacttc ggatcaagaa gatataaagg cgactgaata gatggatgag 1801 ggagtgaaaa ctttgcatag aatcctcacg caagtagata catgttcatc caaaagtatt 1861 aatagaaatt ctctatcagc tattcagtct taatttcact agtagtataa tgggtataca 1921 tttttgtaaa tagcatcccc aaaccagcca gccttcagtt atttacaaat gtttgtcctt 1981 ttttcagcaa aatacttcaa atgaatagta ttaacttaca aaaagtcacg aaaaacttac 2041 atgagagtga aaatttgtta tgactgtttt gagagtggga ctcactctga agtatgtgct 2101 gtctcatgtc ttatttttga accatgcata tgatggacac acaatggatg gacacattat 2161 atctccaaca aggtgtgggt ggaaagatca aattaacctg cttttttgaa aggaaatgat 2221 tactgtcaaa ccagcatggt taattgtgag catcctctgc agcatgcccc ttaagatttt 2281 ctacaaccca aaccaagtgt atgtattgat ttctaggaac ccccaaaagg agaatagtaa 2341 aaaaagatca tacttaaaat ttgtattaca atttttattt taggaactta ttcagacacg 2401 taaatgttgt ttaattctgt aggtaaccat ttgagctgca attcaggatc ttttttataa 2461 caccagtgta gccaaaagag aaacagataa gtgaattggt aagaaataag attcagagca 2521 cttgggattg taagttatag gttctgagct gaactgttta tc // LOCUS HSU49392 639 bp mRNA PRI 17-MAR-1996 DEFINITION Human allograft inflammatory factor-1 (AIF-1) mRNA, complete cds. ACCESSION U49392 NID g1229021 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 639) AUTHORS Utans,U., Arceci,R.J., Yamashita,Y. and Russell,M.E. TITLE Cloning and characterization of allograft inflammatory factor-1: a novel macrophage factor identified in rat cardiac allografts with chronic rejection JOURNAL J. Clin. Invest. 95 (6), 2954-2962 (1995) MEDLINE 95286865 REFERENCE 2 (bases 1 to 639) AUTHORS Autieri,M.V. and Belkowski,S.M. TITLE cDNA cloning of human AIF-1: tissue distribution and expression in activated smooth muscle cells JOURNAL Unpublished REFERENCE 3 (bases 1 to 639) AUTHORS Autieri,M.V. TITLE Direct Submission JOURNAL Submitted (19-FEB-1996) Michael V. Autieri, Molecular Biology, Deborah Research Institute, 20 Pine Mill Road, Browns Mills, NJ 08015, USA COMMENT Autieri,M.V. Lab. Invest. 72, 656-661, 1995. FEATURES Location/Qualifiers source 1..639 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripherial blood lymphocyte" gene 86..517 /gene="AIF-1" CDS 86..517 /gene="AIF-1" /codon_start=1 /product="allograft inflammatory factor-1" /db_xref="PID:g1229022" /translation="MSQTRDLQGGKAFGLLKAQQEERLDEINKQFLHDPKYSSDEDLP SKLEGFKEKYMEFDLNGNGDIDIMSLKRMLEKLGVPKTHLELKKLIGEVSSGSGETFS YPDFLRMMLGKRSAILKMILMYEEKARERKTNTPPSQESPI" BASE COUNT 217 a 134 c 166 g 122 t ORIGIN 1 ggacggaggg cacgagagaa ggagacgctg cagaaagagg cctccagctt ggtctgtctc 61 ccacctctac cagatctgct gagctatgag ccaaaccagg gatttacagg gaggaaaagc 121 tttcggactg ctgaaggccc agcaggaaga gaggctggat gagatcaaca agcaattcct 181 acacgatccc aaatatagca gtgatgagga tctgccctcc aaactggaag gcttcaaaga 241 gaaatacatg gagtttgacc ttaatggaaa tggcgatatt gatatcatgt ccttgaaacg 301 aatgctggag aaacttggag tccccaagac tcacctagag ctaaagaaat taattggaga 361 ggtgtccagt ggctccgggg agacgttcag ctaccctgac tttctcagga tgatgctggg 421 caagagatct gccatcctaa aaatgatcct gatgtatgag gaaaaagcga gagaaaggaa 481 aaccaacacg ccccccagcc aagaaagccc tatctgagat gcctgatttg agggaaaagg 541 gatgatggga ttgaaggggt tctaataccc agatatggaa acagaagaca aaatcgtaag 601 ccagagtcaa caaattaaat aaatttaccc caaaaaaaa // LOCUS HSU49395 1986 bp mRNA PRI 21-SEP-1996 DEFINITION Human ionotropic ATP receptor P2X5a mRNA, complete cds. ACCESSION U49395 NID g1552521 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1986) AUTHORS Tokuyama,Y., Mereu,L., Chen,X., Rouard,M. and Bell,G.I. TITLE Cloning of human P2X purinoceptor new subtype(P2X5a) JOURNAL Unpublished REFERENCE 2 (bases 1 to 1986) AUTHORS Tokuyama,Y., Mereu,L., Chen,X., Rouard,M. and Bell,G.I. TITLE Direct Submission JOURNAL Submitted (19-FEB-1996) Graeme I Bell, Biochemistry, Molecular Biology, Howard Hughes Medical Institute, 5841 S. Maryland Ave. MC 1028, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..1986 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 53..1318 /note="ionotropic ATP receptor" /codon_start=1 /product="P2X5a" /db_xref="PID:g1552522" /translation="MGQAGCKGLCLSLFDYKTEKYVIAKNKKVGLLYRLLQASILAYL VVWVFLIKKGYQDVDTSLQSAVITKVKGVAFTNTSDLGQRIWDVADYVIPAQEKNVFF VVTNLIVTPNQRQNVCAENEGIPDGACSKDSDCHAGEAVTAGNGVKTGRCLRRENLAR GTCEIFAWCPLETSSRPEEPFLKEAEDFTIFIKNHIRFPKFNFSNNVMDVKDRSFLKS CHFGPKNHYCPIFRLGSVIRWAGSDFQDIALEGGVIGINIEWNCDLDKAASECHPHYS FSRLDNKLSKSVSSGYNFRFARYYRDAAGVEFRTLMKAYGIRFDVMVNGKGASFCDLV LIYLIKKREFYRDKKYQEVRGLEDSSQEAEDEASGLGLSEQLTSGPGLLGMPEQQELQ EPPEANVGSSSQKGNGSVCPQLLEPHRST" BASE COUNT 477 a 549 c 537 g 423 t ORIGIN 1 catgattccc aagcttggca cgagggtccg caagcccggc tgagagcgcg ccatggggca 61 ggcgggctgc aaggggctct gcctgtcgct gttcgactac aagaccgaga agtatgtcat 121 cgccaagaac aagaaggtgg gcctgctgta ccggctgctg caggcctcca tcctggcgta 181 cctggtcgta tgggtgttcc tgataaagaa gggttaccaa gacgtcgaca cctccctgca 241 gagtgctgtc atcaccaaag tcaagggcgt ggccttcacc aacacctcgg atcttgggca 301 gcggatctgg gatgtcgccg actacgtcat tccagcccag gaaaagaacg tcttttttgt 361 ggtcaccaac ctgattgtga cccccaacca gcggcagaac gtctgtgctg agaatgaagg 421 cattcctgat ggcgcgtgct ccaaggacag cgactgccac gctggggaag cggttacagc 481 tggaaacgga gtgaagaccg gccgctgcct gcggagagag aacttggcca ggggcacctg 541 tgagatcttt gcctggtgcc cgttggagac aagctccagg ccggaggagc cattcctgaa 601 ggaggccgaa gacttcacca ttttcataaa gaaccacatc cgtttcccca aattcaactt 661 ctccaacaat gtgatggacg tcaaggacag atctttcctg aaatcatgcc actttggccc 721 caagaaccac tactgcccca tcttccgact gggctccgtg atccgctggg ccgggagcga 781 cttccaggat atagccctgg agggtggcgt gataggaatt aatattgaat ggaactgtga 841 tcttgataaa gctgcctctg agtgccaccc tcactattct tttagccgtc tggacaataa 901 actttcaaag tctgtctcct ccgggtacaa cttcagattt gccagatatt accgagacgc 961 agccggggtg gagttccgca ccctgatgaa agcctacggg atccgctttg acgtgatggt 1021 gaacggcaag ggtgcctcct tctgcgacct ggtactcatc tacctcatca aaaagagaga 1081 gttttaccgt gacaagaagt accaggaagt gaggggccta gaagacagtt cccaggaggc 1141 cgaggacgag gcatcggggc tggggctatc tgagcagctc acatctgggc cagggctgct 1201 ggggatgccg gagcagcagg agctgcagga gccacccgag gcgaacgttg gaagcagcag 1261 tcagaagggg aacggatctg tgtgcccaca gctcctggag ccccacagga gcacgtgaat 1321 tgcctctgct tacgttcagg ccctgtccta aacccagccg tctagcaccc agtgatccca 1381 tgcctttggg aatcccagga tgctgcccaa cgggaaattt gtacattggg tgctatgaat 1441 gccacatcac agggaccagc catcacagag caaagtgacc tccacgtctg atgctggggt 1501 catcaggacg gacccatcat ggctatcttt ttgccccacc ccctgccgtc agttcttcct 1561 ttctccgtgg ctggcttccc gcactaggga acgggttgta aatggggaac atgacttcct 1621 tccggagtcc ttgagcacct cagctaagga ccgcagtgcc ctgtagagtt cctagattac 1681 ctcactggga atagcattgt gcgtgtccgg aaaagggctc catttggttc cagcccactc 1741 ccctctgcaa gtgccgcagc ttccctcagg catactctcc agtggatcca agtactctct 1801 ctcctaaaga caccaccttc ctgccagctg tttgccctta ggccagtaca cagaattgaa 1861 agtgggggag gtggcagacg ctttctggga cctgcccaag atatgtattc tctgacactc 1921 ttatttggtc ataaaacaat aaatggtgtc aatttcaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaa // LOCUS HSU49436 1826 bp mRNA PRI 12-JUL-1996 DEFINITION Human translation initiation factor 5 (eIF5) mRNA, complete cds. ACCESSION U49436 NID g1229139 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1826) AUTHORS Si,K., Das,K. and Maitra,U. TITLE Characterization of multiple mRNAs that encode mammalian translation initiation factor 5 (eIF-5) JOURNAL J. Biol. Chem. 271 (28), 16934-16938 (1996) MEDLINE 96279275 REFERENCE 2 (bases 1 to 1826) AUTHORS Kausik,S. and Kallol,D. TITLE Direct Submission JOURNAL Submitted (20-FEB-1996) Si Kausik, DMB, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..1826 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela cell" gene 320..1615 /gene="eIF5" CDS 320..1615 /gene="eIF5" /codon_start=1 /product="translation initiation factor 5" /db_xref="PID:g1229140" /translation="MSVNVNRSVSDQFYRYKMPRLIAKVEGKGNGIKTVIVNMVDVAK ALNRPPTYPTKYFGCELGAQTQFDVKNDRYIVNGSHEANKLQDMLDGFIKKFVLCPEC ENPETDLHVNPKKQTIGNSCKACGYRGMLDTHHKLCTFILKNPPENSDSGTGKKEKEK KNRKGKDKENGSVSSSETPPPPPPPNEINPPPHTMEEEEDDDSGEDTTEEAQRRRMDE ISDHAKVLTLSDDLERTIEERVNILFDFVKKKKEEGVIDSSDKEIVAEAERLDVKAMG PLVLTEVLFNEKIREQIKKYRRHFLRFCHNNKKAQRYLLHGLECVVAMHQAQLISKIP HILKEMYDADLLEEEVIISWSEKASKKYVSKELAKEIRVKAEPFIKWLKEAEEESSGG EEEDEDENIEVVYSKAASVPKVETVKSDNKDDDIDIDAI" BASE COUNT 624 a 348 c 399 g 455 t ORIGIN 1 tacctgaccc cagccatttc ccttctagaa attgttctac agacatatgt aataacatat 61 acaaaaggtt attctttcag cagtgtttgt cagagcgaga acattccaga gagctgttgc 121 gcagccattg gtacctgtat tggggaaaca tagcatacaa tcaagaagct tacagcctca 181 gtggcgaaaa ttttttcatg tcagagaccg agaactcttg cagtcgttta tgtcatccct 241 tcttctccag acagaagata ccaaaaagtt gcaatcaaag atctcttcat cttattgata 301 aagccactaa taagccaaaa tgtctgtcaa tgtcaaccgc agcgtgtcag accagttcta 361 tcgctacaag atgccccgtc tgattgccaa ggttgagggc aaaggcaatg gaatcaagac 421 agttatagtc aacatggttg acgttgcaaa ggcgcttaat cggcctccaa cgtatcccac 481 caaatatttt ggttgtgagc tgggagcaca gacccagttt gatgttaaga atgaccgtta 541 cattgtcaat ggatctcatg aggcgaataa gctgcaagac atgttggatg gattcattaa 601 aaaatttgtt ctctgtcctg aatgtgagaa tcctgaaaca gatttgcatg tcaatccaaa 661 gaagcaaaca ataggtaatt cttgtaaagc ctgtggctat cgaggcatgc ttgacacaca 721 tcataaactc tgcacattca ttctcaaaaa cccacctgag aatagtgaca gtggtacagg 781 aaagaaagaa aaagaaaaga aaaacagaaa gggcaaagac aaggaaaatg gctccgtatc 841 cagcagtgag acaccaccac caccaccacc accaaatgaa attaatcctc ctccacatac 901 aatggaagaa gaggaggatg atgactcggg agaagataca actgaggaag ctcaaaggcg 961 tcgaatggat gaaatcagtg accatgcaaa agttctgaca ctcagtgatg atttggaaag 1021 aacaattgag gagagggtca atatcctctt tgattttgtt aagaaaaaga aagaagaggg 1081 tgttattgat tcatctgaca aagaaatcgt tgctgaagca gaaagactgg atgtaaaagc 1141 catgggccct cttgttctaa ctgaagttct ttttaatgag aagattagag aacagattaa 1201 gaaatacagg cgccatttcc tacgattttg tcacaacaac aaaaaagccc aacggtacct 1261 tcttcatggt ttggagtgtg tggtagcaat gcatcaagct cagcttatct ccaagattcc 1321 acatatcttg aaggagatgt acgatgcaga ccttttagaa gaagaggtca tcatcagctg 1381 gtcggaaaag gcctctaaga aatatgtctc caaagaactt gccaaagaga ttcgtgtcaa 1441 agcagaacca tttataaaat ggttgaagga ggcagaggaa gaatcttctg gtggcgaaga 1501 agaagatgaa gatgagaaca ttgaggtggt gtattcgaag gctgccagtg taccgaaagt 1561 tgagactgta aagtcagaca acaaggatga cgacatcgat attgatgcca tttaaaggga 1621 tggatgcaac ctagcttaac agtataatgc tgcaaatttt cctccattat cagccagaag 1681 tgcaacatgt atgtgcaaaa gctaaaatgg cttaacatca tgctacactt tacactaaaa 1741 atctattact gtgagtgtga aaaactagtg gtggacacat ttggatcaca tttatacagt 1801 tataaaaata aaggtttgat tttggt // LOCUS HSU49835 1418 bp mRNA PRI 25-JUL-1996 DEFINITION Human YKL-39 precursor mRNA, complete cds. ACCESSION U49835 NID g1457940 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1418) AUTHORS Hu,B., Trinh,K., Figueira,W.F. and Price,P.A. TITLE Isolation and sequence of a novel human chondrocyte protein related to mammalian members of the chitinase protein family JOURNAL J. Biol. Chem. 271 (32), 19415-19420 (1996) MEDLINE 96325055 REFERENCE 2 (bases 1 to 1418) AUTHORS Price,P.A. TITLE Direct Submission JOURNAL Submitted (23-FEB-1996) Paul A. Price, Biology, 0322, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0322, USA FEATURES Location/Qualifiers source 1..1418 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="articular cartilage" CDS 36..1193 /note="similar to human glycoprotein encoded by Genbank Accession Number M80927, human chitotriosidase precursor encoded by Genbank Accession Number U29615, human oviductal glycoprotein encoded by Genbank Accession Number U09550, and to mouse secretory protein YM-1 precursor encoded by Genbank Accession Number S27879" /codon_start=1 /product="YKL-39 precursor" /db_xref="PID:g1457941" /translation="MDQKSLWAGVVVLLLLQGGSAYKLVCYFTNWSQDRQEPGKFTPE NIDPFLCSHLIYSFASIENNKVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFG SKGFHPMVDSSTSRLEFINSIILFLRNHNFDGLDVSWIYPDQKENTHFTVLIHELAEA FQKDFTKSTKERLLLTAGVSAGRQMIDNSYQVEKLAKDLDFINLLSFDFHGSWEKPLI TGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTYGHSFTLASAETT VGAPASGPGAAGPITESSGFLAYYEICQFLKGAKITRLQDQQVPYAVKGNQWVGYDDV KSMETKVQFLKNLNLGGAMIWSIDMDDFTGKSCNQGPYPLVQAVKRSLGSL" sig_peptide 36..98 mat_peptide 99..1190 /product="YKL-39" polyA_site 1418 /note="18 A nucleotides" BASE COUNT 370 a 357 c 337 g 354 t ORIGIN 1 agaagaagct ggccaaggat atgggagcaa ccaccatgga ccagaagtct ctctgggcag 61 gtgtagtggt cttgctgctt ctccagggag gatctgccta caaactggtt tgctacttta 121 ccaactggtc ccaggaccgg caggaaccag gaaaattcac ccctgagaat attgacccct 181 tcctatgctc tcatctcatc tattcattcg ccagcatcga aaacaacaag gttatcatca 241 aggacaagag tgaagtgatg ctctaccaga ccatcaacag tctcaaaacc aagaatccca 301 aactgaaaat tctcttgtcc attggagggt acctgtttgg ttccaaaggg ttccacccta 361 tggtggattc ttctacatca cgcttggaat tcattaactc cataatcctg tttctgagga 421 accataactt tgatggactg gatgtaagct ggatctaccc agatcagaaa gaaaacactc 481 atttcactgt gctgattcat gagttagcag aagcctttca gaaggacttc acaaaatcca 541 ccaaggaaag gcttctcttg actgcgggcg tatctgcagg gaggcaaatg attgataaca 601 gctatcaagt tgagaaactg gcaaaagatc tggatttcat caacctcctg tcctttgact 661 tccatgggtc ttgggaaaag ccccttatca ctggccacaa cagccctctg agcaaggggt 721 ggcaggacag agggccaagc tcctactaca atgtggaata tgctgtgggg tactggatac 781 ataagggaat gccatcagag aaggtggtca tgggcatccc cacatatggg cactccttca 841 cactggcctc tgcagaaacc accgtggggg cccctgcctc tggccctgga gctgctggac 901 ccatcacaga gtcttcaggc ttcctggcct attatgagat ctgccagttc ctgaaaggag 961 ccaagatcac gcgcctccag gatcagcagg ttccctacgc agtcaagggg aaccagtggg 1021 tgggctatga tgatgtgaag agtatggaga ccaaggttca gttcttaaag aatttaaacc 1081 tgggaggagc catgatctgg tctattgaca tggatgactt cactggcaaa tcctgcaacc 1141 agggccctta ccctcttgtc caagcagtca agagaagcct tggctccttg tgaaggatta 1201 acttacagag aagcaggcaa gatgaccttg ctgcctgggg cctgctctct cccaggaatt 1261 ctcatgtggg attccccttg ccaggctggc ctttggatct ctcttccaag cctttcctga 1321 cttcctctta gatcatagat tggacctggt tttgttttcc tgcagctgtt gacttgttgc 1381 cctgaagtac aataaaaaaa attcattttg ctccagta // LOCUS HSU49844 8210 bp mRNA PRI 23-MAR-1996 DEFINITION Human FRAP-related protein (FRP1) mRNA, complete cds. ACCESSION U49844 NID g1235901 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8210) AUTHORS Cimprich,K.A., Shin,T.B., Keith,C.T. and Schreiber,S.L. TITLE cDNA cloning and gene mapping of a candidate human cell cycle checkpoint protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (7), 2850-2855 (1996) MEDLINE 96181495 REFERENCE 2 (bases 1 to 8210) AUTHORS Cimprich,K.A., Shin,T.B., Keith,C.T. and Schreiber,S.L. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) Karlene A. Cimprich, Chemistry, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA FEATURES Location/Qualifiers source 1..8210 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T-cell" /chromosome="3" /map="3q22-q24" gene 106..8040 /gene="FRP1" CDS 106..8040 /gene="FRP1" /note="similar to FRAP, Mec1p, Tor1p, Tor2p, and ATM" /codon_start=1 /product="FRAP-related protein" /db_xref="PID:g1235902" /translation="MGEHGLELASMIPALRELGSATPEEYNTVVQKPRQILCQFIDRI LTDVNVVAVELVKKTDSQPTSVMLLDFIQHIMKSSPLMFVNVSGSHEAKGSCIEFSNW IITRLLRIAATPSCHLLHKKICEVICSLLFLFKSKSPAIFGVLTKELLQLFEDLVYLH RRNVMGHAVEWPVVMSRFLSQLDEHMGYLQSAPLQLMSMQNLEFIEVTLLMVLTRIIA IVFFRRQELLLWQIGCVLLEYGSPKIKSLAISFLTELFQLGGLPAQPASTFFSSFLEL LKHLVEMDTDQLKLYEEPLSKLIKTLFPFEAEAYRNIEPVYLNMLLEKLCVMFEDGVL MRLKSDLLKAALCHLLQYFLKFVPAGYESALQVRKVYVRNICKALLDVLGIEVDAEYL LGPLYAALKMESMEIIEEIQCQTQQENLSSNSDGISPKRRRLSSSLNPSKRAPKQTEE IKHVDMNQKSILWSALKQKAESLQISLEYSGLKNPVIEMLEGIAVVLQLTALCTVHCS HQNMNCRTFKDCQHKSKKKPSVVITWMSLDFYTKVLKSCRSLLESVQKLDLEATIDKV VKIYDALIYMQVNSSFEDHILEDLCGMLSLPWIYSHSDDGCLKLTTFAANLLTLSCRI SDSYSPQAQSRCVFLLTLFPRRIFLEWRTAVYNWALQSSHEVIRASCVSGFFILLQQQ NSCNRVPKILIDKVKDDSDIVKKEFASILGQLVCTLHGMFYLTSSLTEPFSEHGHVDL FCRNLKATSQHECSSSQLKASVCKPFLFLLKKKIPSPVKLAFIDNLHHLCKHLDFRED ETDVKAVLGTLLNLMEDPDKDVRVAFSGNIKHILESLDSEDGFIKELFVLRMKEAYTH AQISRNNELKDTLILTTGDIGRAAKGDLVPFALLHLLHCLLSKSASVSGAAYTEIRAL VAAKSVKLQSFFSQYKKPICQFLVESLHSSQMTALPNTPCQNADVRKQDVAHQREMAL NTLSEIANVFDFPDLNRFLTRTLQVLLPDLAAKASPAASALIRTLGKQLNVNRREILI NNFKYIFSHLVCSCSKDELERALHYLKNETEIELGSLLRQDFQGLHNELLLRIGEHYQ QVFNGLSILASFASSDDPYQGPRDIISPELMADYLQPKLLGILAFFNMQLLSSSVGIE DKKMALNSLMSLMKLMGPKHVSSVRVKMMTTLRTGLRFKDDFPELCCRAWDCFVRCLD HACLGSLLSHVIVALLPLIHIQPKETAAIFHYLIIENRDAVQDFLHEIYFLPDHPELK KIKAVLQEYRKETSESTDLQTTLQLSMKAIQHENVDVRIHALTSLKETLYKNQEKLIK YATDSETVEPIISQLVTVLLKGCQDANSQARLLCGECLGELGAIDPGRLDFSTTETQG KDFTFVTGVEDSSFAYGLLMELTRAYLAYADNSRAQDSAAYAIQELLSIYDCREMETN GPGHQLWRRFPEHVREILEPHLNTRYKSSQKSTDWSGVKKPIYLSKLGSNFAEWSASW AGYLITKVRHDLASKIFTCCSIMMKHDFKVTIYLLPHILVYVLLGCNQEDQQEVYAEI MAVLKHDDQHTINTQDIASDLCQLSTQTVFSMLDHLTQWARHKFQALKAEKCPHSKSN RNKVDSMVSTVDYEDYQSVTRFLDLIPQDTLAVASFRSKAYTRAVMHFESFITEKKQN IQEHLGFLQKLYAAMHEPDGVAGVSAIRKAEPSLKEQILEHESLGLLRDATACYDRAI QLEPDQIIHYHGVVKSMLGLGQLSTVITQVNGVHANRSEWTDELNTYRVEAAWKLSQW DLVENYLAADGKSTTWSVRLGQLLLSAKKRDITAFYDSLKLVRAEQIVPLSAASFERG SYQRGYEYIVRLHMLCELEHSIKPLFQHSPGDSSQEDSLNWVARLEMTQNSYRAKEPI LALRRALLSLNKRPDYNEMVGECWLQSARVARKAGHHQTAYNALLNAGESRLAELYVE RAKWLWSKGDVHQALIVLQKGVELCFPENETPPEGKNMLIHGRAMLLVGRFMEETANF ESNAIMKKYKDVTACLPEWEDGHFYLAKYYDKLMPMVTDNKMEKQGDLIRYIVLHFGR SLQYGNQFIYQSMPRMLTLWLDYGTKAYEWEKAGRSDRVQMRNDLGKINKVITEHTNY LAPYQFLTAFSQLISRICHSHDEVFVVLMEIIAKVFLAYPQQAMWMMTAVSKSSYPMR VNRCKEILNKAIHMKKSLEKFVGDATRLTDKLLELCNKPVDGSSSTLSMSTHFKMLKK LVEEATFSEILIPLQSVMIPTLPSILGTHANHASHEPFPGHWAYIAGFDDMVEILASL QKPKKISLKGSDGKFYIMMCKPKDDLRKDCRLMEFNSLINKCLRKDAESRRRELHIRT YAVIPLNDECGIIEWVNNTAGLRPILTKLYKEKGVYMTGKELRQCMLPKSAALSEKLK VFREFLLPRHPPIFHEWFLRTFPDPTSWYSSRSAYCRSTAVMSMVGYILGLGDRHGEN ILFDSLTGECVHVDFNCLFNKGETFEVPEIVPFRLTHNMVNGMGPMGTEGLFRRACEV TMRLMRDQREPLMSVLKTFLHDPLVEWSKPVKGHSKAPLNETGEVVNEKAKTHVLDIE QRLQGVIKTRNRVTGLPLSIEGHVHYLIQEATDENLLCQMYLGWTPYM" BASE COUNT 2511 a 1555 c 1738 g 2406 t ORIGIN 1 gcctccacac ggctccgtcg ggcgccgcgc tcttccggca gcggtagctt tggagacgcc 61 gggaacccgc gttggcgtgg ttgactagtg cctcgcagcc tcagcatggg ggaacatggc 121 ctggagctgg cttccatgat ccccgccctg cgggagctgg gcagtgccac accagaggaa 181 tataatacag ttgtacagaa gccaagacaa attctgtgtc aattcattga ccggatactt 241 acagatgtaa atgttgttgc tgtagaactt gtaaagaaaa ctgactctca gccaacctcc 301 gtgatgttgc ttgatttcat ccagcatatc atgaaatcct ccccacttat gtttgtaaat 361 gtgagtggaa gccatgaggc caaaggcagt tgtattgaat tcagtaattg gatcataacg 421 agacttctgc ggattgcagc aactccctcc tgtcatttgt tacacaagaa aatctgtgaa 481 gtcatctgtt cattattatt tctttttaaa agcaagagtc ctgctatttt tggggtactc 541 acaaaagaat tattacaact ttttgaagac ttggtttacc tccatagaag aaatgtgatg 601 ggtcatgctg tggaatggcc agtggtcatg agccgatttt taagtcaatt agatgaacac 661 atgggatatt tacaatcagc tcctttgcag ttgatgagta tgcaaaattt agaatttatt 721 gaagtcactt tattaatggt tcttactcgt attattgcaa ttgtgttttt tagaaggcaa 781 gaactcttac tttggcagat aggttgtgtt ctgctagagt atggtagtcc aaaaattaaa 841 tccctagcaa ttagcttttt aacagaactt tttcagcttg gaggactacc agcacaacca 901 gctagcactt ttttcagctc atttttggaa ttattaaaac accttgtaga aatggatact 961 gaccaattga aactctatga agagccatta tcaaagctga taaagacact atttcccttt 1021 gaagcagaag cttatagaaa tattgaacct gtctatttaa atatgctgct ggaaaaactc 1081 tgtgtcatgt ttgaagacgg tgtgctcatg cggcttaagt ctgatttgct aaaagcagct 1141 ttgtgccatt tactgcagta tttccttaaa tttgtgccag ctgggtatga atctgcttta 1201 caagtcagga aggtctatgt gagaaatatt tgtaaagctc ttttggatgt gcttggaatt 1261 gaggtagatg cagagtactt gttgggccca ctttatgcag ctttgaaaat ggaaagtatg 1321 gaaatcattg aggagattca atgccaaact caacaggaaa acctcagcag taatagtgat 1381 ggaatatcac ccaaaaggcg tcgtctcagc tcgtctctaa acccttctaa aagagcacca 1441 aaacagactg aggaaattaa acatgtggac atgaaccaaa agagcatatt atggagtgca 1501 ctgaaacaga aagctgaatc ccttcagatt tcccttgaat acagtggcct aaagaatcct 1561 gttattgaga tgttagaagg aattgctgtt gtcttacaac tgactgctct gtgtactgtt 1621 cattgttctc atcaaaacat gaactgccgt actttcaagg actgtcaaca taaatccaag 1681 aagaaacctt ctgtagtgat aacttggatg tcattggatt tttacacaaa agtgcttaag 1741 agctgtagaa gtttgttaga atctgttcag aaactggacc tggaggcaac cattgataag 1801 gtggtgaaaa tttatgatgc tttgatttat atgcaagtaa acagttcatt tgaagatcat 1861 atcctggaag atttatgtgg tatgctctca cttccatgga tttattccca ttctgatgat 1921 ggctgtttaa agttgaccac atttgccgct aatcttctaa cattaagctg taggatttca 1981 gatagctatt caccacaggc acaatcacga tgtgtgtttc ttctgactct gtttccaaga 2041 agaatattcc ttgagtggag aacagcagtt tacaactggg ccctgcagag ctcccatgaa 2101 gtaatccggg ctagttgtgt tagtggattt tttatcttat tgcagcagca gaattcttgt 2161 aacagagttc ccaagattct tatagataaa gtcaaagatg attctgacat tgtcaagaaa 2221 gaatttgctt ctatacttgg tcaacttgtc tgtactcttc acggcatgtt ttatctgaca 2281 agttctttaa cagaaccttt ctctgaacac ggacatgtgg acctcttctg taggaacttg 2341 aaagccactt ctcaacatga atgttcatct tctcaactaa aagcttctgt ctgcaagcca 2401 ttccttttcc tactgaaaaa aaaaatacct agtccagtaa aacttgcttt catagataat 2461 ctacatcatc tttgtaagca tcttgatttt agagaagatg aaacagatgt aaaagcagtt 2521 cttggaactt tattaaattt aatggaagat ccagacaaag atgttagagt ggcttttagt 2581 ggaaatatca agcacatatt ggaatccttg gactctgaag atggatttat aaaggagctt 2641 tttgtcttaa gaatgaagga agcatataca catgcccaaa tatcaagaaa taatgagctg 2701 aaggatacct tgattcttac aacaggggat attggaaggg ccgcaaaagg agatttggta 2761 ccatttgcac tcttacactt attgcattgt ttgttatcca agtcagcatc tgtctctgga 2821 gcagcataca cagaaattag agctctggtt gcagctaaaa gtgttaaact gcaaagtttt 2881 ttcagccagt ataagaaacc catctgtcag tttttggtag aatcccttca ctctagtcag 2941 atgacagcac ttccgaatac tccatgccag aatgctgacg tgcgaaaaca agatgtggct 3001 caccagagag aaatggcttt aaatacgttg tctgaaattg ccaacgtttt cgactttcct 3061 gatcttaatc gttttcttac taggacatta caagttctac tacctgatct tgctgccaaa 3121 gcaagccctg cagcttctgc tctcattcga actttaggaa aacaattaaa tgtcaatcgt 3181 agagagattt taataaacaa cttcaaatat attttttctc atttggtctg ttcttgttcc 3241 aaagatgaat tagaacgtgc ccttcattat ctgaagaatg aaacagaaat tgaactgggg 3301 agcctgttga gacaagattt ccaaggattg cataatgaat tattgctgcg tattggagaa 3361 cactatcaac aggtttttaa tggtttgtca atacttgcct catttgcatc cagtgatgat 3421 ccatatcagg gcccgagaga tatcatatca cctgaactga tggctgatta tttacaaccc 3481 aaattgttgg gcattttggc tttttttaac atgcagttac tgagctctag tgttggcatt 3541 gaagataaga aaatggcctt gaacagtttg atgtctttga tgaagttaat gggacccaaa 3601 catgtcagtt ctgtgagggt gaagatgatg accacactga gaactggcct tcgattcaag 3661 gatgattttc ctgaattgtg ttgcagagct tgggactgct ttgttcgctg cctggatcat 3721 gcttgtctgg gctcccttct cagtcatgta atagtagctt tgttacctct tatacacatc 3781 cagcctaaag aaactgcagc tatcttccac tacctcataa ttgaaaacag ggatgctgtg 3841 caagattttc ttcatgaaat atatttttta cctgatcatc cagaattaaa aaagataaaa 3901 gccgttctcc aggaatacag aaaggagacc tctgagagca ctgatcttca gacaactctt 3961 cagctctcta tgaaggccat tcaacatgaa aatgtcgatg ttcgtattca tgctcttaca 4021 agcttgaagg aaaccttgta taaaaatcag gaaaaactga taaagtatgc aacagacagt 4081 gaaacagtag aacctattat ctcacagttg gtgacagtgc ttttgaaagg ttgccaagat 4141 gcaaactctc aagctcggtt gctctgtggg gaatgtttag gggaattggg ggcgatagat 4201 ccaggtcgat tagatttctc aacaactgaa actcaaggaa aagattttac atttgtgact 4261 ggagtagaag attcaagctt tgcctatgga ttattgatgg agctaacaag agcttacctt 4321 gcgtacgctg ataatagccg agctcaagat tcagctgcct atgccattca ggagttgctt 4381 tctatttatg actgtagaga gatggagacc aacggcccag gtcaccaatt gtggaggaga 4441 tttcctgagc atgttcggga aatactagaa cctcatctaa ataccagata caagagttct 4501 cagaagtcaa ccgattggtc tggagtaaag aagccaattt acttaagtaa attgggtagt 4561 aactttgcag aatggtcagc atcttgggca ggttatctta ttacaaaggt tcgacatgat 4621 cttgccagta aaattttcac ctgctgtagc attatgatga agcatgattt caaagtgacc 4681 atctatcttc ttccacatat tctggtgtat gtcttactgg gttgtaatca agaagatcag 4741 caggaggttt atgcagaaat tatggcagtt ctaaagcatg acgatcagca taccataaat 4801 acccaagaca ttgcatctga tctgtgtcaa ctcagtacac agactgtgtt ctccatgctt 4861 gaccatctca cacagtgggc aaggcacaaa tttcaggcac tgaaagctga gaaatgtcca 4921 cacagcaaat caaacagaaa taaggtagac tcaatggtat ctactgtgga ttatgaagac 4981 tatcagagtg taacccgttt tctagacctc ataccccagg atactctggc agtagcttcc 5041 tttcgctcca aagcatacac acgagctgta atgcactttg aatcatttat tacagaaaag 5101 aagcaaaata ttcaggaaca tcttggattt ttacagaaat tgtatgctgc tatgcatgaa 5161 cctgatggag tggccggagt cagtgcaatt agaaaggcag aaccatctct aaaagaacag 5221 atccttgaac atgaaagcct tggcttgctg agggatgcca ctgcttgtta tgacagggct 5281 attcagctag aaccagacca gatcattcat tatcatggtg tagtaaagtc catgttaggt 5341 cttggtcagc tgtctactgt tatcactcag gtgaatggag tgcatgctaa caggtccgag 5401 tggacagatg aattaaacac gtacagagtg gaagcagctt ggaaattgtc acagtgggat 5461 ttggtggaaa actatttggc agcagatgga aaatctacaa catggagtgt cagactggga 5521 cagctattat tatcagccaa aaaaagagat atcacagctt tttatgactc actgaaacta 5581 gtgagagcag aacaaattgt acctctttca gctgcaagct ttgaaagagg ctcctaccaa 5641 cgaggatatg aatatattgt gagattgcac atgttatgtg agttggagca tagcatcaaa 5701 ccacttttcc agcattctcc aggtgacagt tctcaagaag attctctaaa ctgggtagct 5761 cgactagaaa tgacccagaa ttcctacaga gccaaggagc ctatcctggc tctccggagg 5821 gctttactaa gcctcaacaa aagaccagat tacaatgaaa tggttggaga atgctggctg 5881 cagagtgcca gggtagctag aaaggctggt caccaccaga cagcctacaa tgctctcctt 5941 aatgcagggg aatcacgact cgctgaactg tacgtggaaa gggcaaagtg gctctggtcc 6001 aagggtgatg ttcaccaggc actaattgtt cttcaaaaag gtgttgaatt atgttttcct 6061 gaaaatgaaa ccccacctga gggtaagaac atgttaatcc atggtcgagc tatgctacta 6121 gtgggccgat ttatggaaga aacagctaac tttgaaagca atgcaattat gaaaaaatat 6181 aaggatgtga ccgcgtgcct gccagaatgg gaggatgggc atttttacct tgccaagtac 6241 tatgacaaat tgatgcccat ggtcacagac aacaaaatgg aaaagcaagg tgatctcatc 6301 cggtatatag ttcttcattt tggcagatct ctacaatatg gaaatcagtt catatatcag 6361 tcaatgccac gaatgttaac tctatggctt gattatggta caaaggcata tgaatgggaa 6421 aaagctggcc gctccgatcg tgtacaaatg aggaatgatt tgggtaaaat aaacaaggtt 6481 atcacagagc atacaaacta tttagctcca tatcaatttt tgactgcttt ttcacaattg 6541 atctctcgaa tttgtcattc tcacgatgaa gtttttgttg tcttgatgga aataatagcc 6601 aaagtatttc tagcctatcc tcaacaagca atgtggatga tgacagctgt gtcaaagtca 6661 tcttatccca tgcgtgtgaa cagatgcaag gaaatcctca ataaagctat tcatatgaaa 6721 aaatccttag agaagtttgt tggagatgca actcgcctaa cagataagct tctagaattg 6781 tgcaataaac cggttgatgg aagtagttcc acattaagca tgagcactca ttttaaaatg 6841 cttaaaaagc tggtagaaga agcaacattt agtgaaatcc tcattcctct acaatcagtc 6901 atgataccta cacttccatc aattctgggt acccatgcta accatgctag ccatgaacca 6961 tttcctggac attgggccta tattgcaggg tttgatgata tggtggaaat tcttgcttct 7021 cttcagaaac caaagaagat ttctttaaaa ggctcagatg gaaagttcta catcatgatg 7081 tgtaagccaa aagatgacct gagaaaggat tgtagactaa tggaattcaa ttccttgatt 7141 aataagtgct taagaaaaga tgcagagtct cgtagaagag aacttcatat tcgaacatat 7201 gcagttattc cactaaatga tgaatgtggg attattgaat gggtgaacaa cactgctggt 7261 ttgagaccta ttctgaccaa actatataaa gaaaagggag tgtatatgac aggaaaagaa 7321 cttcgccagt gtatgctacc aaagtcagca gctttatctg aaaaactcaa agtattccga 7381 gaatttctcc tgcccaggca tcctcctatt tttcatgagt ggtttctgag aacattccct 7441 gatcctacat catggtacag tagtagatca gcttactgcc gttccactgc agtaatgtca 7501 atggttggtt atattctggg gcttggagac cgtcatggtg aaaatattct ctttgattct 7561 ttgactggtg aatgcgtaca tgtagatttc aattgtcttt tcaataaggg agaaaccttt 7621 gaagttccag aaattgtgcc atttcgcctg actcataata tggttaatgg aatgggtcct 7681 atgggaacag agggtctttt tcgaagagca tgtgaagtta caatgaggct gatgcgtgat 7741 cagcgagagc ctttaatgag tgtcttaaag acttttctac atgatcctct tgtggaatgg 7801 agtaaaccag tgaaagggca ttccaaagcg ccactgaatg aaactggaga agttgtcaat 7861 gaaaaggcca agacccatgt tcttgacatt gagcagcgac tacaaggtgt aatcaagact 7921 cgaaatagag tgacaggact gccgttatct attgaaggac atgtgcatta ccttatacag 7981 gaagctactg atgaaaactt actatgccag atgtatcttg gttggactcc atatatgtga 8041 aatgaaatta tgtaaaagaa tatgttaata atctaaaagt aatgcatttg gtatgaatct 8101 gtggttgtat ctgttcaatt ctaaagtaca acataaattt acgttctcag caactgttat 8161 ttctctctga tcattaatta tatgtaaaat aatatacatt cactcgtgcc // LOCUS HSU49857 879 bp mRNA PRI 28-MAR-1996 DEFINITION Human transcriptional activator mRNA, complete cds. ACCESSION U49857 NID g1236938 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Lin,S.L. TITLE CROC-4 encodes a brain-specific protein which causes transcriptional activation of the human c-fos proto-oncogene promoter JOURNAL Unpublished REFERENCE 2 (bases 1 to 879) AUTHORS Lin,S.L. TITLE Direct Submission JOURNAL Submitted (23-FEB-1996) Stanley L. Lin, Tumor Biology, Schering-Plough Research Institute, 2015 Galloping Hill Road, Kenilworth, NJ 07033, USA FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CROC-4" /tissue_type="brain" /dev_stage="adult" 5'UTR 1..117 CDS 118..588 /codon_start=1 /function="transcriptional activation of the c-fos proto-oncogene promoter" /evidence=experimental /product="transcriptional acitvator" /db_xref="PID:g1236939" /translation="MFLTEDLITFNLRNFLLFQLWESSFSPGAGGFCTTLPPSFLRVD DRATSSTTDSSRAPSSPRPPGSTSHCGISTRCTERCLCVLPLRTSQVPDVMAPQHDQE KFHDLAYSCLGKSFSMSNQDLYGYSTSSLALGLAWLSWETKKKNVLHLVGLDSL" 3'UTR 589..879 polyA_signal 869..874 BASE COUNT 200 a 249 c 203 g 227 t ORIGIN 1 ctcctcacag aagcctggag ctgggcatcc aagaagaagc agcctcattt gttttctggt 61 gtcatcgtag gtggccacct atggcttttg ggaatgtaaa aagggcagct ctctggcatg 121 ttcctgactg aggatctcat aacatttaac ttgaggaact tcctcctttt ccagctttgg 181 gagtcaagct tctcacctgg ggcgggtggg ttctgcacca ccctcccacc ctccttcctc 241 cgtgtggacg atagagccac atccagcacc acggacagct cccgggcgcc ttcatctcct 301 cgtcctccag gcagcacaag ccattgtgga atctccacca ggtgtacaga acggtgcctc 361 tgcgtcctgc cactcaggac ctctcaagtc cccgatgtga tggctcctca gcatgatcag 421 gagaaattcc atgatcttgc ttattcctgt cttgggaagt ccttctccat gtctaaccaa 481 gatctatatg gctatagcac cagctctttg gctcttggct tggcatggct aagttgggag 541 accaaaaaga agaatgtact tcatctggtt gggctggatt ccctctgata agccttccca 601 gttgactgaa agatgaggct aggctctagc aagttgaagt caaaccagct ccttcaagaa 661 gctttgagca gaatgaagtg gggaggaccc agcttccagc ccaggaagcc cactgtacct 721 ggagccatct gggataagac tttgacccat gactcccata tccacagcct gtccatccta 781 gcccatccca gtttatcctg tatcatttga gctgggattc ccacatcctc tgagttggaa 841 gtcccatctc aagtcttcaa taaagactct tgaatattg // LOCUS HSU49928 3096 bp mRNA PRI 03-JUL-1996 DEFINITION Human TAK1 binding protein 1 (TAB1) mRNA, complete cds. ACCESSION U49928 NID g1401125 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3096) AUTHORS Shibuya,H., Yamaguchi,K., Shirakabe,K., Tonegawa,A., Gotoh,Y., Ueno,N., Irie,K., Nishida,E. and Matsumoto,K. TITLE TAB1: an activator of the TAK1 MAPKKK in TGF-beta signal transduction JOURNAL Science 272 (5265), 1179-1182 (1996) MEDLINE 96216294 REFERENCE 2 (bases 1 to 3096) AUTHORS Shibuya,H. TITLE Direct Submission JOURNAL Submitted (25-FEB-1996) Hiroshi Shibuya, Faculty of Pharmaceutical Sciences, Hokkaido University, Nishi 6-chome, Kita 12, Kita-ku, Sapporo, Hokkaido 060, Japan FEATURES Location/Qualifiers source 1..3096 /organism="Homo sapiens" /db_xref="taxon:9606" gene 21..1535 /gene="TAB1" CDS 21..1535 /gene="TAB1" /note="activator for TAK1" /codon_start=1 /product="TAK1 binding protein" /db_xref="PID:g1401126" /translation="MAAQRRSLLQSEQQPSWTDDLPLCHLSGVGSASNRSYSADGKGT ESHPPEDSWLKFRSENNCFLYGVFNGYDGNRVTNFVAQRLSAELLLGQLNAEHAEADV RRVLLQAFDVVERSFLESIDDALAEKASLQSQLPEGVPQHQLPPQYQKILERLKTLER EISGGAMAVVAVLLNNKLYVANVGTNRALLCKSTVDGLQVTQLNVDHTTENEDELFRL SQLGLDAGKIKQVGIICGQESTRRIGDYKVKYGYTDIDLLSAAKSKPIIAEPEIHGAQ PLDGVTGFLVLMSEGLYKALEAAHGPGQANQEIAAMIDTEFAKQTSLDAVAQAVVDRV KRIHSDTFASGGERARFCPRHEDMTLLVRNFGYPLGEMSQPTPSPAPAAGGRVYPVSV PYSSAQSTSKTSVTLSLVMPSQGQMVNGAHSASTLDEATPTLTNQSPTLTLQSTNTHT QSSSSSSDGGLFRSRPAHSLPPGEDGRVEPYVDFAEFYRLWSVDHGEQSVVTAP" BASE COUNT 642 a 936 c 952 g 566 t ORIGIN 1 gcccgcaggg ttcctccaag atggcggcgc agaggaggag cttgctgcag agtgagcagc 61 agccaagctg gacagatgac ctgcctctct gccacctctc tggggttggc tcagcctcca 121 accgcagcta ctctgctgat ggcaagggca ctgagagcca cccgccagag gacagctggc 181 tcaagttcag gagtgagaac aactgcttcc tgtatggggt cttcaacggc tatgatggca 241 accgagtgac caacttcgtg gcccagcggc tgtccgcaga gctcctgctg ggccagctga 301 atgccgagca cgccgaggcc gatgtgcggc gtgtgctgct gcaggccttc gatgtggtgg 361 agaggagctt cctggagtcc attgacgacg ccttggctga gaaggcaagc ctccagtcgc 421 aattgccaga gggagtccct cagcaccagc tgcctcctca gtatcagaag atccttgaga 481 gactcaagac gttagagagg gaaatttcgg gaggggccat ggccgttgtg gcggtccttc 541 tcaacaacaa gctctacgtc gccaatgtcg gtacaaaccg tgcactttta tgcaaatcga 601 cagtggatgg gttgcaggtg acacagctga acgtggacca caccacagag aacgaggatg 661 agctcttccg tctttcgcag ctgggcttgg atgctggaaa gatcaagcag gtggggatca 721 tctgtgggca ggagagcacc cggcggatcg gggattacaa ggttaaatat ggctacacgg 781 acattgacct tctcagcgct gccaagtcca aaccaatcat cgcagagcca gaaatccatg 841 gggcacagcc gctggatggg gtgacgggct tcttggtgct gatgtcggag gggttgtaca 901 aggccctaga ggcagcccat gggcctgggc aggccaacca ggagattgct gcgatgattg 961 acactgagtt tgccaagcag acctccctgg acgcagtggc ccaggccgtc gtggaccggg 1021 tgaagcgcat ccacagcgac accttcgcca gtggtgggga gcgtgccagg ttctgccccc 1081 ggcacgagga catgaccctg ctagtgagga actttggcta cccgctgggc gaaatgagcc 1141 agcccacacc gagcccagcc ccagctgcag gaggacgagt gtaccctgtg tctgtgccat 1201 actccagcgc ccagagcacc agcaagacca gcgtgaccct ctcccttgtc atgccctccc 1261 agggccagat ggtcaacggg gctcacagtg cttccaccct ggacgaagcc acccccaccc 1321 tcaccaacca aagcccgacc ttaaccctgc agtccaccaa cacgcacacg cagagcagca 1381 gctccagctc tgacggaggc ctcttccgct cccggcccgc ccactcgctc ccgcctggcg 1441 aggacggtcg tgttgagccc tatgtggact ttgctgagtt ttaccgcctc tggagcgtgg 1501 accatggcga gcagagcgtg gtgacagcac cgtagggcag ccggaggaat gcagcccaag 1561 cagggcctgg catggggcag gacagggtcc agccttttcc taacatctgc ctgtgccaca 1621 acggccagca ggtgccccat cctctgccca cagcagactc tgtcccatgg ctctccgggc 1681 agtagagtgt gtgagtgcag actggacctg tggttcatac cttgtcacca cccgggaagc 1741 tgaaggccac ttcctcccag atggcctcag ccaggaccat cgccctttct cagagcagag 1801 ggccaggtag ggaaaccgca gtgggcctgc aagccgcccg agcctcccca gcagcctcct 1861 acagagcagg aagaggcgcc ctgtgaaccc tgtagtgttg caggcccagc agaccctgct 1921 gtcccaagcc cacccctcct cccaccatca cctccctcac ctcgggacag tagccctcca 1981 cttctccagc ctctcagccc tgtgctcctg tatccagagt ggaacccagg ctggtgtccg 2041 tatctgtccc tgggccccac ccctggacct gccttggttg tgtcatctgt tgtaaacatt 2101 ccaggaggac caggggagca tctggggcct gggatggcca cagaaggggc aggccaggtg 2161 gaaaggagcc agggggaagt ggtctaagag acctggaact gccagaggat ggcggcctgg 2221 gcttccccag agccaggcgt gcgggagagg tgaggactgg ccccggtggg ctgaggcagg 2281 ggccgctgtc gtcaggcctg agccagggtg agctggtgcc tgccttgctt cttccttctg 2341 gtgctgtgaa gaccataggc tggcaggcag ctgagatgaa ctgtctttac cactgatgag 2401 gggcctctgc cggctgaggg tagcaagcag gggttgtgag tcaggctggg ggacttgttt 2461 gaaagaaaga ggagttggaa tgtggttccc aggagggaag aggttccttt gagacacagt 2521 aaccctggga ggcataggag aagggtcggg ccagcccagc ccagggcctg agttagacta 2581 tttcccacat gttctctgcc ttcagtgggg agggggtgcc accagggctg tcggccagga 2641 ttgccactcc tgtttcagag gaagcaggcc gagagacttg caccttggac aagccacaca 2701 atcagtgggg cagccagagc tcagacctga gccattgtgt cagtatccag gaccccccgg 2761 attctccacg ccctccccat ctcccagtct ccctgccccc catgccccag accggcccac 2821 cagggactag ccgctgtcgc acagcctctg gggtgcttgg tctctgcaaa gtcaaaggcc 2881 tgacagctct gtggcctggg aatccatttt cctgcgggag agcagggcct ggtgtggaac 2941 cagggagctg tgggaagcca cagcagaaat ggaagaaaaa caggtctcag cccagggtcc 3001 tcgctcactc cctcactccc cactttgaag ccatctctgt tctgcaggtg agaggattta 3061 aagtcagtca caaaggcttg ggaacaaaag gaattc // LOCUS HSU49957 5656 bp mRNA PRI 18-OCT-1996 DEFINITION Human LIM protein (LPP) mRNA, partial cds. ACCESSION U49957 NID g1537016 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5656) AUTHORS Petit,M.M.R., Mols,R., Schoenmakers,E.F., Mandahl,N. and Van de Ven,W.J. TITLE LPP, the preferred fusion partner gene of HMGIC in lipomas, is a novel member of the LIM protein gene family JOURNAL Genomics 36 (1), 118-129 (1996) MEDLINE 96411654 REFERENCE 2 (bases 1 to 5656) AUTHORS Petit,M. and Mols,R. TITLE Direct Submission JOURNAL Submitted (23-FEB-1996) Center fo Human Genetics, Catholic University of Leuven, Herestraat 49, Leuven B-3000, Belgium FEATURES Location/Qualifiers source 1..5656 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="small intestine" /chromosome="3" /map="3q27-q28" gene 247..2085 /gene="LPP" CDS 247..2085 /gene="LPP" /note="lipoma preferred partner; LIM protein gene" /codon_start=1 /product="LIM protein" /db_xref="PID:g1537017" /translation="MSHPSWLPPKSTGEPLGHVPARMETTHSFGNPSISVSTQQPPKK FAPVVAPKPKYNPYKQPGGEGDFLPPPPPPLDDSSALPSISGNFPPPPPLDEEAFKVQ GNPGGKTLEERRSSLDAEIDSLTSILADLECSSPYKPRPPQSSTGSTASPPVSTPVTG HKRMVIPNQPPLTATKKSTLKPQPAPQAGPIPVAPIGTLKPQPQPVPASYTTASTSSR PTFNVQVKSAQPSPHYMAAPSSGQIYGSGPQGYNTQPVPVSGQCPPPSTRGGMDYAYI PPPGLQPEPGYGYAPNQGRYYEGYYAAGPGYGGRNDSDPTYGQQGHPNTWKREPGYTP PGAGNQNPPGMYPVTGPKKTYITDPVSAPCAPPLQPKGGHSGQLGPSSVAPSFRPEDE LEHLTKKMLYDMENPPADEYFGRCARCGENVVGEGTGCTAMDQVFHVDCFTCIICNNK LRGQPFYAVEKKAYCEPCYINTLEQCNVCSKPIMERILRATGKAYHPHCFTCVMCHRS LDGIPFTVDAGGLIHCIEDFHKKFAPRCSVCKEPIMPAPGQEETVRIVALDRDFHVHC YRCEDCGGLLSEGDNQGCYPLDGHILCKTCNSARIRVLTAKASTDL" BASE COUNT 1646 a 1239 c 1068 g 1702 t 1 others ORIGIN 1 gtcactttta tttgggggtg tggacagctg ctttcccagg ggagtacttc ttacagtggg 61 atttcaagac aagatcggcc tgaagaaaaa ttatatttgt atatttttta aaaagtggga 121 actttgaggc tcagagacag agcagaagac agaacctggt cttctgattc cctgtgttct 181 gcttttttca ttgttccact ggacgctcat cagagggaag atctttttcc tcaattgatt 241 ccaacaatgt ctcacccatc ttggctgcca cccaaaagca ctggtgagcc cctcggccat 301 gtgcctgcac ggatggagac cacccattcc tttgggaacc ccagcatttc agtgtctaca 361 caacagccac ccaaaaagtt tgccccggta gttgctccaa aacctaagta caacccatac 421 aaacaacctg gaggtgaggg tgattttctt ccacccccac ctccacctct agatgattcc 481 agtgcccttc catctatctc tggaaacttt cctcctccac cacctcttga tgaagaggct 541 ttcaaagtac aggggaatcc cggaggcaag acacttgagg agaggcgctc cagcctggac 601 gctgagattg actccttgac cagcatcttg gctgaccttg agtgcagctc cccctataag 661 cctcggcctc cacagagctc cactggttca acagcctctc ctccagtttc gaccccagtc 721 acaggacaca agagaatggt catcccgaac caaccccctc taacagcaac caagaagtct 781 acattgaaac cacagcctgc accccaggct ggacccatcc ctgtggctcc aatcggaaca 841 ctcaaacccc agcctcagcc agtcccagcc tcctacacca cggcctccac ttcttcaagg 901 cctaccttta atgtgcaggt gaagtcagcc cagcccagcc ctcattatat ggctgcccct 961 tcatcaggac aaatttatgg ctcagggccc cagggctata acactcagcc agttcctgtc 1021 tctgggcagt gtccacctcc ttcaacacgg ggaggcatgg attatgccta cattccacca 1081 ccaggacttc agccggagcc tgggtatggg tatgccccca accagggacg ctattatgaa 1141 ggctactatg cagcagggcc aggctatggg ggcagaaatg actctgaccc tacctatggt 1201 caacaaggtc acccaaatac ctggaaacgg gaaccagggt acactcctcc tggagcaggg 1261 aaccagaacc ctcctgggat gtatccagtc actggtccca agaagaccta tatcacagat 1321 cctgtttcag ccccctgtgc gccaccattg cagccaaagg gtggccattc agggcaactg 1381 gggccttcgt cagttgcccc ttcattccgc ccagaggatg agcttgagca cctgaccaaa 1441 aagatgctgt atgacatgga aaatccacct gctgacgaat actttggccg ctgtgctcgc 1501 tgtggagaaa acgtagttgg ggaaggtaca ggatgcactg ccatggatca ggtcttccac 1561 gtggattgtt ttacctgcat catctgcaac aacaagctcc gagggcagcc attctatgct 1621 gtggaaaaga aagcatactg cgagccctgc tacattaata ctctggagca gtgcaatgtg 1681 tgttccaagc ccatcatgga gcggattctc cgagccaccg ggaaggccta tcatcctcac 1741 tgtttcacct gcgtgatgtg ccaccgcagc ctggatggga tcccattcac tgtggatgct 1801 ggcgggctca ttcactgcat tgaggacttc cacaagaaat ttgccccgcg gtgttctgtg 1861 tgcaaggagc ctattatgcc agccccgggc caggaggaga ctgtccgtat tgtggctttg 1921 gatcgagatt tccatgttca ctgctaccga tgcgaggatt gcggtggtct cctgtctgaa 1981 ggagataacc aaggctgcta ccccttggat gggcacatcc tctgcaagac ctgcaactct 2041 gcccgcatca gggtgttgac cgccaaggcg agcactgacc tttagattca gtcacctgtt 2101 cagccggcac tgagaagaac gaacacaaga aaaagataag aaatactaga gtaaaggcca 2161 tcaaactacg cgatagtctc tgttcttcat ctgctattaa ccttgcctta gaaacacata 2221 aattatgaga ttttttttta aaagttgtta ccaaatacac atttcacatt gaatcatgta 2281 ggatcttgat gggcctttgt tcccaaggac ttccacattt ttgcacagat tatgctccat 2341 cccttcactt ctgcattcct gtaactttta atccctatgt ttgtctcact tttcatctgg 2401 ttgaatggct tttcttagtg tggtatttgc tgtcacatag ttttttcctg ggtgagtctg 2461 ccaactcaca ggtgctttta ggcttgaaat ctccatccta tcatttccgt tttgcctgtg 2521 actgtaaaga gtagccattc ttttcccatg tattgaagag gatattcttc tcttgcttta 2581 tactactcac gtccttgggg agggaaatgc acaatttttt tttgttaggc tgtaaagaat 2641 ttaagctgta aattacataa gttagaacaa gcccaaattt aatttgcaac catcagaatt 2701 cagaatctat agtgaccagt gatcaaggct aattggaaaa gagttatcgg cccatagcta 2761 ataagtagtg acagacaacc aagcttcaat atttttctaa agaaattaca ggtgggatat 2821 gctagaaaag gcattttggg gttatgttta aaaaaacatt attgtcccac aatattacct 2881 taagattttt cttttccgca ctacctgaac attgtaatac agacaaactt gatttcttct 2941 agaagataac attttcaata ctgtcccact tctcatctta aaaatattgt catgtttatt 3001 ctaatatcca acgcaactat caaaattgcc tttttctcta gaggatgaag gctgtgaaaa 3061 aaccgttcaa attctcttct ttttcttttt tattaccagg tccattttgc ctgacaattg 3121 caaatcagag catacaaaat aaaactgtgc agttttgttt ggtttacttt caaaagagta 3181 gaaagcttga aaagattctg aaaccacagt ttcattattc tcataatcct tctgcaactg 3241 aaattacata ttgcaggaga cattttcata tcatcaatgt gacatttaca ccacactttc 3301 aaagacaatc actgaaacaa aaattgtctt tatgagctaa aaatatgcag aatctctgcc 3361 tagaatcttt attcaaactt ttattagcca gtgaaacact tgcttgccaa ctgccaagcc 3421 atacttatta agttcgaaca tgtttcactt aaggagagac acctagctta gtcatggcaa 3481 gttgccattt tgtaaactaa ggattttgga ctgagatttc ttaaatcttt cttcaaatct 3541 cccacaagta tatactttta aattatggag tattttaagt ctacaaaaag gtataaataa 3601 taatataatg aattcctata tacctaatac ccagtttaag acaccaaata taacaagtat 3661 aattacatcc tccaatgtac cgtttcctta ttccacagat atctttttca ttattgtgaa 3721 gtgatgttca gatttctagt ttttttttct agtttttaat tttaacatca gaactgaaat 3781 aaaaaattat ggatacgtgt tttgaattgc aaactattcc tcaggaattc caattaaatt 3841 tattttactt gaataggaat gatcataaaa gtgattcttt ttttgtgact agaaattctt 3901 aagccgatgg tcactatagc tcatccttaa tgtatggctc atttgctttt gtcactaaac 3961 ggttttgtgt tagaaccacc aaaattatag cttttaagag cttcctttga ccactgtctt 4021 tttcttaccc tacttctctt atctttgatc gtatatttct cataatgtga aatatgatga 4081 gattcactta ggggcagcat gttagttttg ggaggcaatg tcaactgtgt ctctgaattc 4141 ctgtcttcca aattgaagcc agaccatgct gatgacctca agtagcactg actatttgac 4201 aatagggctg ataatgtaat cggcttgaat tttgacttag taacttttta tgtaatactt 4261 tcggagaaat tctctttagg acaaagcaga gagtccaatt tattgaggga tagattgtat 4321 ctcttaaagt ggtcatatta ttattaattt tgcaaaaaga agaagaattt attgaatatt 4381 tatactataa aatgcagtaa cattctactt gcttattaca tttaaaccct tgtgtaaccg 4441 aatgtttaca tgtcaagagg aaaagctgta agaaaatgct gtcacagacc catccctgtg 4501 gcatcgtcag caggatactg cttgcttncc cctttattat gcattcttat aacataacct 4561 cagagtatct ttaccaaagg tttttaaagt aaatctcttt ttaagataga cttattcttt 4621 tataataatg aatgtgcact catacatatg tagaaacatt tagatttaga gttttttttt 4681 tctttcacaa ggtataataa ggaaaggatc ctacaatatt ttattctagg gtttcttaaa 4741 tatatgattt atattactgt cttttctatt aatcatttga tttaaacaat gccaagtcac 4801 tcttttttag ttgcatgaaa ttttgcctgc aacagagaga aaaagattgt attactttaa 4861 tgattataat catactgtct gctgatataa tatcataatt gttgtggttt aaatacataa 4921 atagtagaaa aatcagagtc tataacagaa agtttgtaaa aatatactga ttttgaaaag 4981 tatcaggaat ataaatattt ggtaattctg tgtaacaaga gagacatgga aaaggaaaaa 5041 aatccactat atcatgggct ctgcagaaca tatcaaaata ggatttctta aatttttcaa 5101 cccccagacc atactgactc aacatggagt ctcactggcc aagaaaattg cctttaaaac 5161 tcgaaaataa ggccaggtgt ggtgactcac acttgtaatc ccagcaattt gggaggctga 5221 gacaggagga ttgcctgagg ccaggagttt gacaccagcc tgggcaacat agtgagacct 5281 tgtctctaca aaaaataaaa taaaataaaa aaccaaatta gccagacatg gtggcatgcg 5341 cttgtttaaa aaaacaaata aataaataat cgactcaaaa ataagtcatg tgtcccagca 5401 taaggcatca ggttgttaga tgctggcatc tctgcagctc aaagatgtgg gttctttttc 5461 ttgtcattaa cacattgtta tttctgtagg accaacttct ctgatcaaaa ttacttttct 5521 gggtatgtgc tgattaaggg ggtggactta tcaacactat aattgttccc tatgaaagat 5581 tccacagaga tgtttatggt gaggtttaaa ggatatgaaa ctctacattt aaaacaagta 5641 ttttatattt gggcac // LOCUS HSU49973 2418 bp DNA PRI 28-JUN-1997 DEFINITION Human Tigger1 transposable element, complete consensus sequence. ACCESSION U49973 NID g2226003 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2418) AUTHORS Smit,A.F. and Riggs,A.D. TITLE Tiggers and DNA transposon fossils in the human genome JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (4), 1443-1448 (1996) MEDLINE 96202298 REFERENCE 2 (bases 1 to 2418) AUTHORS Robertson,H.M. TITLE Members of the pogo superfamily of DNA-mediated transposons in the human genome JOURNAL Mol. Gen. Genet. 252 (6), 761-766 (1996) MEDLINE 97074895 REFERENCE 3 (bases 1 to 2418) AUTHORS Robertson,H.M. TITLE Direct Submission JOURNAL Submitted (15-FEB-1996) Hugh M. Robertson, Entomology, University of Illinois at Urbana-Champaign, 505 S. Goodwin, Urbana, IL 61801, USA FEATURES Location/Qualifiers source 1..2418 /organism="Homo sapiens" /transposon="Tigger1" /note="consensus sequence based on 50 full-length genomic sequences" /db_xref="taxon:9606" repeat_region 1..13 /rpt_type=inverted CDS 425..1789 /note="ORF1; MER37; putative transposase similar to pogo element" /citation=[1] /codon_start=1 /db_xref="PID:g2226004" /translation="MASKCSSERKSRTSLTLNQKLEMIKLSEEGMSKAEIGRKLGLLR QTVSQVVNAKEKFLKEIKSATPVNTRMIRKRNSLIADMEKVLVVWIEDQTSHNIPLSQ SLIQSKALTLFNSMKAERGEEAAEEKLEASRGWFMRFKERSRLHNIKVQGEAASADGE AAASYPEDLAKIIDEGGYTKQQIFNVDETAFYWKKMPSRTFIAREEKSMPGFKASKDR LTLLLGANAAGDFKLKPMLIYHSENPRALKNYAKSTLPVLYKWNNKAWMTAHLFTAWF TEYFKPTVETYCSEKKISFKILLLIDNAPGHPRALMEMYKEINVVFMPANTTSILQPM DQGVISTFKSYYLRNTFRKAIAAIDSDSSDGSGQSKLKTFWKGFTILDAIKNIRDSWE EVKISTLTGVWKKLIPTLMDDFEGFKTSVEEVTADVVEIARELELEVEPEDVTELLQS HDKT" CDS 1811..2206 /note="ORF2: function unknown" /codon_start=1 /db_xref="PID:g2226005" /translation="MDEQRKWFLEMESTPGEDAVNIVEMTTKDLEYYINLVDKAAAGF ERIDSNFERSSTVGKMLSNSIACYREIFRERKSQSMRQTSLLSYFKKLPQPPQPSATT TLISQQPSTSRQDPPPAKRLRLAEGSDDR" polyA_signal 2218..2223 repeat_region 2405..2418 /rpt_type=inverted BASE COUNT 746 a 477 c 529 g 666 t ORIGIN 1 caggcatacc tcgttttatt gcgcttcgct ttattgcgct tcgcagatat tgcgtttttt 61 acaaattgaa ggtttgtggc aaccctgcgt cgagcaagtc tatcggcgcc atttttccaa 121 cagcatgtgc tcacttcgtg tctctgtgtc acattttggt aattctcgca atatttcaaa 181 ctttttcatt attattatat ctgttatggt gatctgtgat cagtgatctt tgatgttact 241 attgtaattg ttttggggcg ccacgaaccg cgcccatata agacggcgaa cttaatcgat 301 aaatgttgtg tgtgttctga ctgctccacc gaccggccgt tcccccgtct ctctccctct 361 cctcgggcct ccctattccc tgagacacaa caatattgaa attaggccaa ttaataaccc 421 tacaatggcc tctaagtgtt caagtgaaag gaagagtcgc acgtctctca ctttaaatca 481 aaagctagaa atgattaagc ttagtgagga aggcatgtcg aaagccgaga taggccgaaa 541 gctaggcctc ttgcgccaaa cagttagcca agttgtgaat gcaaaggaaa agttcttgaa 601 ggaaattaaa agtgctactc cagtgaacac acgaatgata agaaagcgaa acagccttat 661 tgctgatatg gagaaagttt tagtggtctg gatagaagat caaaccagcc acaacattcc 721 cttaagccaa agcctaatcc agagcaaggc cctaactctc ttcaattcta tgaaggctga 781 gagaggtgag gaagctgcag aagaaaagtt ggaagctagc agaggttggt tcatgaggtt 841 taaggaaaga agccgtctcc ataacataaa agtgcaaggt gaagcagcaa gtgctgatgg 901 agaagctgca gcaagttatc cagaagatct agctaagata attgatgaag gtggctacac 961 taaacaacag attttcaatg tagacgaaac agccttctat tggaagaaga tgccatctag 1021 gactttcata gctagagagg agaagtcaat gcctggcttc aaagcttcaa aggacaggct 1081 gactctcttg ttaggggcta atgcagctgg tgactttaag ttgaagccaa tgctcattta 1141 ccattccgaa aatcctaggg cccttaagaa ttatgctaaa tctactctgc ctgtgctcta 1201 taaatggaac aacaaagcct ggatgacagc acatctgttt acagcatggt ttactgaata 1261 ttttaagccc actgttgaga cctactgctc agaaaaaaag atttctttca aaatattact 1321 gctcattgac aatgcacctg gtcacccaag agctctgatg gagatgtaca aggagattaa 1381 tgttgttttc atgcctgcta acacaacatc cattctgcag cccatggatc aaggagtaat 1441 ttcgactttc aagtcttatt atttaagaaa tacatttcgt aaggctatag ctgccataga 1501 tagtgattcc tctgatggat ctgggcaaag taaattgaaa accttctgga aaggattcac 1561 cattctagat gccattaaga acattcgtga ttcatgggag gaggtcaaaa tatcaacatt 1621 aacaggagtt tggaagaagt tgattccaac cctcatggat gactttgagg ggttcaagac 1681 ttcagtggag gaagtaactg cagatgtggt ggaaatagca agagaactag aattagaagt 1741 ggagcctgaa gatgtgactg aattgctgca atctcatgat aaaacttgaa cggatgagga 1801 gttgcttctt atggatgagc aaagaaagtg gtttcttgag atggaatcta ctcctggtga 1861 agatgctgtg aacattgttg aaatgacaac aaaggattta gaatattaca taaacttagt 1921 tgataaagca gcggcagggt ttgagaggat tgactccaat tttgaaagaa gttctactgt 1981 gggtaaaatg ctatcaaaca gcatcgcatg ctacagagaa atctttcgtg aaaggaagag 2041 tcaatcgatg cggcaaactt cattgttgtc ttattttaag aaattgccac agccacccca 2101 accttcagca accaccaccc tgatcagtca gcagccatca acatcgaggc aagaccctcc 2161 accagcaaaa agattacgac tcgctgaagg ctcagatgat cgttagcatt ttttagcaat 2221 aaagtatttt taaattaagg tatgtacatt gttttttaga cataatgcta ttgcacactt 2281 aatagactac agtatagtgt aaacataact tttatatgca ctgggaaacc aaaaaattcg 2341 tgtgactcgc tttattgcga tatttgcttt attgcggtgg tctggaaccg aacccgcaat 2401 atctccgagg tatgcctg // LOCUS HSU49974 1301 bp DNA PRI 26-JAN-1998 DEFINITION Human mariner2 transposable element, complete consensus sequence. ACCESSION U49974 NID g1698454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1301) AUTHORS Robertson,H.M., Zumpano,K.L., Lohe,A.R. and Hartl,D.L. TITLE Reconstructing the ancient mariners of humans JOURNAL Nature Genet. 12 (4), 360-361 (1996) MEDLINE 96224393 REFERENCE 2 (bases 1 to 1301) AUTHORS Robertson,H.M. and Martos,R. TITLE Molecular evolution of the second ancient human mariner transposon, Hsmar2, illustrates patterns of neutral evolution in the human genome lineage JOURNAL Gene 205, 219-228 (1997) REFERENCE 3 (bases 1 to 1301) AUTHORS Robertson,H.M. TITLE Direct Submission JOURNAL Submitted (22-FEB-1996) Hugh M. Robertson, Entomology, University of Illinois at Urbana-Champaign, 505 S. Goodwin, Urbana, IL 61801, USA FEATURES Location/Qualifiers source 1..1301 /organism="Homo sapiens" /transposon="Hsmar2" /note="consensus sequence based on 20 unique long genomic copy sequences and 18 unique cDNAs of variable length; consensus with recognized 25 CpG hypermutable base pairs" /db_xref="taxon:9606" repeat_region 1..31 /rpt_type=inverted CDS 183..1238 /codon_start=1 /product="mariner transposase" /db_xref="PID:g1698455" /translation="MNSAKIEARTNIKFMVKLGWKNGEITDALRKVYGDNAPKKSAVY KWITRFKKGRDDVEDEARSGRPSTSICEEKINLVRALIEEDRRLTAETIANTTDISIG SAYTILTEKLKLSKLSTRWVPKPLRPDQLQTRAELSMEILNKWDQDPEAFLRRIVTGD ETWLYQYDPEDKAQSKQWLPRGGSGPVKAKADWSRAKVMATVFWDAQGILLVDFLEGQ RTITSAYYESVLRKLAKALAEKRPGKLHQRVLLHHDNAPAHSSHQTRAILREFRWEII RHPPYSPDLAPSDFFLFPNLKKSLKGTHFSSVNNVKKTALTWLNSQDPQFFRDGLNGW YHRLQKCLELDGAYVEK" repeat_region 1270..1301 /rpt_type=inverted BASE COUNT 428 a 250 c 272 g 351 t ORIGIN 1 cgaggggtct tcaaaaagtt catggaaaat gcgtattatg aaaaaactat gcatggattt 61 caaaaatttt ttgcaccaaa ataaactcgt actaacttgt tataacatgt ctgaacagga 121 tctagtttga ggcactaaga aggataagac atcagtttga aaagagcccc tatcagagca 181 acatgaattc tgctaaaatt gaagcaagaa caaacatcaa atttatggtg aagcttgggt 241 ggaagaatgg tgaaatcact gatgctttac gaaaagttta tggggacaat gccccaaaga 301 aatcagcagt ttacaaatgg ataactcgtt ttaagaaggg acgagacgat gttgaagatg 361 aagcccgcag cggcagacca tccacatcaa tttgtgagga aaaaattaat cttgttcgtg 421 ccctaattga agaggaccga cgattaacag cagaaacaat agccaacacc acggacatct 481 caattggttc agcttacaca attctgactg aaaaattaaa gttgagcaaa ctttccactc 541 gatgggtgcc aaaaccgttg cgcccagatc agctgcagac aagagcagag ctttcaatgg 601 aaattttaaa caagtgggat caagatcctg aagcatttct tcgaagaatt gtaacaggag 661 atgaaacgtg gctttaccag tacgatcctg aagacaaagc acaatcaaag caatggctac 721 caagaggtgg aagtggtcca gtcaaagcaa aagcggactg gtcaagagca aaggtcatgg 781 caacagtttt ttgggatgct caaggcattt tgcttgttga ctttctggag ggccaaagaa 841 cgataacatc tgcttattat gagagtgttt tgagaaagtt agccaaagct ttagcagaaa 901 aacgcccggg aaagcttcac cagagagtcc ttctccacca cgacaatgct cctgctcatt 961 cctctcatca aacaagggca attttgcgag agtttcgatg ggaaatcatt aggcatccac 1021 cttacagtcc tgatttggct ccttctgact tctttttgtt tcctaatctt aaaaaatctt 1081 taaagggcac ccatttttct tcagttaata atgtaaaaaa gactgcattg acatggttaa 1141 attcccagga ccctcagttc tttagggatg gactaaatgg ctggtatcat cgcttacaaa 1201 agtgtcttga acttgatgga gcttatgttg agaaataaag tttatatttt taatttttat 1261 cttttaattc cattttccac gaactttttg aagtcccctc g // LOCUS HSU50040 4147 bp mRNA PRI 25-APR-1996 DEFINITION Human signaling inositol polyphosphate 5 phosphatase SIP-110 mRNA, complete cds. ACCESSION U50040 NID g1245336 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4147) AUTHORS Kavanaugh,W.M., Pot,D.A., Chin,S.M., Deuter-Reinhard,M., Jefferson,A.B., Norris,F.A., Masiarz,F.R., Cousens,L.S., Majerus,P.W. and Williams,L.T. TITLE Multiple forms of an inositol polyphosphate 5-phosphatase form signaling complexes with Shc and Grb2 JOURNAL Curr. Biol. 6 (4), 438-445 (1996) MEDLINE 96298867 REFERENCE 2 (bases 1 to 4147) AUTHORS Pot,D.A. TITLE Direct Submission JOURNAL Submitted (26-FEB-1996) David A. Pot, Technologies, Chiron Corporation, 4560 Horton St., Emeryville, CA 94608-2916, USA FEATURES Location/Qualifiers source 1..4147 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" CDS 17..2947 /note="110 kDa protein" /codon_start=1 /product="signaling inositol polyphosphate 5 phosphatase SIP-110" /db_xref="PID:g1245337" /translation="MFTLSPAPREVIRTLPSLESLQRLFDQQLSPGLRPRPQVPGEAN PINMVSKLSQLTSLLSSIEDKVKALLHEGPESPHRPSLIPPVTFEVKAESLGIPQKMQ LKVDVESGKLIIKKSKDGSEDKFYSHKKILQLIKSQKFLNKLVILVETEKEKILRKEY VFADSKKREGFCQLLQQMKNKHSEQPEPDMITIFIGTWNMGNAPPPKKITSWFLSKGQ GKTRDDSADYIPHDIYVIGTQEDPLSEKEWLEILKHSLQEITSVTFKTVAIHTLWNIR IVVLAKPEHENRISHICTDNVKTGIANTLGNKGAVGVSFMFNGTSLGFVNSHLTSGSE KKLRRNQNYMNILRFLALGDKKLSPFNITHRFTHLFWFGDLNYRVDLPTWEAETIIQK IKQQQYADLLSHDQLLTERREQKVFLHFEEEEITFAPTYRFERLTRDKYAYTKQKATG MKYNLPSWCDRVLWKSYPLVHVVCQSYGSTSDIMTSDHSPVFATFEAGVTSQFVSKNG PGTVDSQGQIEFLRCYATLKTKSQTKFYLEFHSSCLESFVKSQEGENEEGSEGELVVK FGETLPKLKPIISDPEYLLDQHILISIKSSDSDESYGEGCIALRLEATETQLPIYTPL THHGELTGHFQGEIKLQTSQGKTREKLYDFVKTERDESSGPKTLKSLTSHDPMKQWEV TSRAPPCSGSSITEIINPNYMGVGPFGPPMPLHVKQTLSPDQQPTAWSYDQPPKDSPL GPCRGESPPTPPGQPPISPKKFLPSTANRGLPPRTQESRPSDLGKNAGDTLPQEDLPL TKPEMFENPLYGSLSSFPKPAPRKDQESPKMPRKEPPPCPEPGILSPSIVLTKAQEAD RGEGPGKQVPAPRLRSFTCSSSAEGRAAGGDKSQGKPKTPVSSQAPVPAKRPIKPSRS EINQQTPPTPTPRPPLPVKSPAVLHLQHSKGRDYRDNTELPHHGKHRPEEGPPGPLGR TAMQ" BASE COUNT 1010 a 1212 c 1103 g 822 t ORIGIN 1 cgcccactaa tccttgatgt tcaccttgtc ccctgccccc agagaagtca tccggaccct 61 cccatccctg gagtctctgc agaggttatt tgaccagcag ctctccccgg gcctccgtcc 121 acgtcctcag gttcctggtg aggccaatcc catcaacatg gtgtccaagc tcagccaact 181 gacaagcctg ttgtcatcca ttgaagacaa ggtcaaggcc ttgctgcacg agggtcctga 241 gtctccgcac cggccctccc ttatccctcc agtcaccttt gaggtgaagg cagagtctct 301 ggggattcct cagaaaatgc agctcaaagt cgacgttgag tctgggaaac tgatcattaa 361 gaagtccaag gatggttctg aggacaagtt ctacagccac aagaaaatcc tgcagctcat 421 taagtcacag aaatttctga ataagttggt gatcttggtg gaaacagaga aggagaagat 481 cctgcggaag gaatatgttt ttgctgactc caaaaagaga gaaggcttct gccagctcct 541 gcagcagatg aagaacaagc actcagagca gccggagccc gacatgatca ccatcttcat 601 cggcacctgg aacatgggta acgccccccc tcccaagaag atcacgtcct ggtttctctc 661 caaggggcag ggaaagacgc gggacgactc tgcggactac atcccccatg acatttacgt 721 gatcggcacc caagaggacc ccctgagtga gaaggagtgg ctggagatcc tcaaacactc 781 cctgcaagaa atcaccagtg tgacttttaa aacagtcgcc atccacacgc tctggaacat 841 ccgcatcgtg gtgctggcca agcctgagca cgagaaccgg atcagccaca tctgtactga 901 caacgtgaag acaggcattg caaacacact ggggaacaag ggagccgtgg gggtgtcgtt 961 catgttcaat ggaacctcct tagggttcgt caacagccac ttgacttcag gaagtgaaaa 1021 gaaactcagg cgaaaccaaa actatatgaa cattctccgg ttcctggccc tgggcgacaa 1081 gaagctgagt ccctttaaca tcactcaccg cttcacgcac ctcttctggt ttggggatct 1141 taactaccgt gtggatctgc ctacctggga ggcagaaacc atcatccaga aaatcaagca 1201 gcagcagtac gcagacctcc tgtcccacga ccagctgctc acagagagga gggagcagaa 1261 ggtcttccta cacttcgagg aggaagaaat cacgtttgcc ccaacctacc gttttgagag 1321 actgactcgg gacaaatacg cctacaccaa gcagaaagcg acagggatga agtacaactt 1381 gccttcctgg tgtgaccgag tcctctggaa gtcttatccc ctggtgcacg tggtgtgtca 1441 gtcttatggc agtaccagcg acatcatgac gagtgaccac agccctgtct ttgccacatt 1501 tgaggcagga gtcacttccc agtttgtctc caagaacggt cccgggactg ttgacagcca 1561 aggacagatt gagtttctca ggtgctatgc cacattgaag accaagtccc agaccaaatt 1621 ctacctggag ttccactcga gctgcttgga gagttttgtc aagagtcagg aaggagaaaa 1681 tgaagaagga agtgaggggg agctggtggt gaagtttggt gagactcttc caaagctgaa 1741 gcccattatc tctgaccctg agtacctgct agaccagcac atcctcatca gcatcaagtc 1801 ctctgacagc gacgaatcct atggcgaggg ctgcattgcc cttcggttag aggccacaga 1861 aacgcagctg cccatctaca cgcctctcac ccaccatggg gagttgacag gccacttcca 1921 gggggagatc aagctgcaga cctctcaggg caagacgagg gagaagctct atgactttgt 1981 gaagacggag cgtgatgaat ccagtgggcc aaagaccctg aagagcctca ccagccacga 2041 ccccatgaag cagtgggaag tcactagcag ggcccctccg tgcagtggct ccagcatcac 2101 tgaaatcatc aaccccaact acatgggagt ggggcccttt gggccaccaa tgcccctgca 2161 cgtgaagcag accttgtccc ctgaccagca gcccacagcc tggagctacg accagccgcc 2221 caaggactcc ccgctggggc cctgcagggg agaaagtcct ccgacacctc ccggccagcc 2281 gcccatatca cccaagaagt ttttaccctc aacagcaaac cggggtctcc ctcccaggac 2341 acaggagtca aggcccagtg acctggggaa gaacgcaggg gacacgctgc ctcaggagga 2401 cctgccgctg acgaagcccg agatgtttga gaaccccctg tatgggtccc tgagttcctt 2461 ccctaagcct gctcccagga aggaccagga atcccccaaa atgccgcgga aggaaccccc 2521 gccctgcccg gaacccggca tcttgtcgcc cagcatcgtg ctcaccaaag cccaggaggc 2581 tgatcgcggc gaggggcccg gcaagcaggt gcccgcgccc cggctgcgct ccttcacgtg 2641 ctcatcctct gccgagggca gggcggccgg cggggacaag agccaaggga agcccaagac 2701 cccggtcagc tcccaggccc cggtgccggc caagaggccc atcaagcctt ccagatcgga 2761 aatcaaccag cagaccccgc ccaccccgac gccgcggccg ccgctgccag tcaagagccc 2821 ggcggtgctg cacctccagc actccaaggg ccgcgactac cgcgacaaca ccgagctccc 2881 gcatcacggc aagcaccggc cggaggaggg gccaccaggg cctctaggca ggactgccat 2941 gcagtgaagc cctcagtgag ctgccactga gtcgggagcc cagaggaacg gcgtgaagcc 3001 actggaccct ctcccgggac ctcctgctgg ctcctcctgc ccagcttcct atgcaaggct 3061 ttgtgttttc aggaaagggc ctagcttctg tgtggcccac agagttcact gcctgtgaga 3121 cttagcacca agtgctgagg ctggaagaaa aacgcacacc agacgggcaa caaacagtct 3181 gggtccccag ctcgctcttg gtacttggga ccccagtgcc tcgttgaggg cgccattctg 3241 aagaaaggaa ctgcagcgcc gatttgaggg tggagatata gataataata atattaataa 3301 taataatggc cacatggatc gaacactcat gatgtgccaa atgctgtgct aagtgcttta 3361 cgaacattcg tcatatcagg atgacctcga gagctgaggc tctagcacct aaaaccacgt 3421 gcccaaaccc accagtttaa aacggtgtgt gttcggaggg gtgaaagcat taagaagccc 3481 agtgccctcc tggagtgaga caagggctcg gccttaagga gctgaagagt ctgggtagct 3541 tgtttagggt acaagaagcc tgttctgtcc agcttcagtg acacaagctg ctttagctaa 3601 agtcccgcgg gttccggcat ggctaggctg agagcaggga tctacctggc ttctcagttc 3661 tttggttgga aggagcagga aatcagctcc tattctccag tggagagatc tggcctcagc 3721 ttgggctaga gatgccaagg cctgtgccag gttccctgtg ccctcctcga ggtgggcagc 3781 catcaccagc cacagttaag ccaagccccc caacatgtat tccatcgtgc tggtagaaga 3841 gtctttgctg ttgctcccga aagccgtgct ctccagcctg gctgccaggg agggtgggcc 3901 tcttggttcc aggctcttga aatagtgcag ccttttcttc ctatctctgt ggctttcagc 3961 tctgcttcct tggttattag gagaatagat gggtgatgtc tttccttatg ttgctttttc 4021 aacatagcag aattaatgta gggagctaaa tccagtggtg tgtgtgaatg cagaagggaa 4081 tgcaccccac attcccatga tggaagtctg cgtaaccaat aaattgtgcc tttctcactc 4141 aaaaccc // LOCUS HSU50062 2617 bp DNA PRI 25-APR-1996 DEFINITION Human RIP protein kinase gene, complete cds. ACCESSION U50062 NID g1236942 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2617) AUTHORS Hsu,H., Huang,J., Shu,H.B., Baichwal,V. and Goeddel,D.V. TITLE TNF-dependent recruitment of the protein kinase RIP to the TNF receptor-1 signaling complex JOURNAL Immunity 4 (4), 387-396 (1996) MEDLINE 96200892 REFERENCE 2 (bases 1 to 2617) AUTHORS Huang,J., Hsu,H., Baichwal,V.R. and Goeddel,D.V. TITLE Direct Submission JOURNAL Submitted (26-FEB-1996) Vijay R. Baichwal, Biology, Tularik Inc., 270 East Grand Avenue, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2617 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein endothelium" CDS 1..2016 /note="Ser/Thr protein kinase; protein has death domain sequence at the carboxyl terminus" /codon_start=1 /product="RIP protein kinase" /db_xref="PID:g1236943" /translation="MQPDMSLNVIKMKSSDFLESAELDSGGFGKVSLCFHRTQGLMIM KTVYKGPNCIEHNEALLEEAKMMNRLRHSRVVKLLGVIIEEGKYSLVMEYMEKGNLMH VLKAEMSTPLSVKGRIIWEIIEGMCYLHGKGVIHKDLKPENILVDNDFHIKIADLGLA SFKMWSKLNNEEHNELREVDGTAKKNGGTLYYMAPEHLNDVNAKPTEKSDVYSFAVVL WAIFANKEPYENAICEQQLIMCIKSGNRPDVDDITEYCPREIISLMKLCWEANPEARP TFPGIEEKFRPFYLSQLEESVEEDVKSLKKEYSNENAVVKRMQSLQLDCVAVPSSRSN SATEQPGSLHSSQGLGMGPVEESWFAPSLEHPQEENEPSLQSKLQDEANYHLYGSRMD RQTKQQPRQNVAYNREEERRRRVSHDPFAQQRPYENFQNTEGKGTVYSSAASHGNAVH QPSGLTSQPQVLYQNNGLYSSHGFGTRPLDPGTAGPRVWYRPIPSHMPSLHNIPVPET NYLGNTPTMPFSSLPPTDESIKYTIYNSTGIQIGAYNYMEIGGTSSSLLDSTNTNFKE EPAAKYQAIFDNTTSLTDKHLDPIRENLGKHWKNCARKLGFTQSQIDEIDHDYERDGL KEKVYQMLQKWVMREGIKGATVGKLAQALHQCSRIDLLSSLIYVSQN" BASE COUNT 794 a 586 c 660 g 573 t 4 others ORIGIN 1 atgcaaccag acatgtcctt gaatgtcatt aagatgaaat ccagtgactt cctggagagt 61 gcagaactgg acagcggagg ctttgggaag gtgtctctgt gtttccacag aacccaggga 121 ctcatgatca tgaaaacagt gtacaagggg cccaactgca ttgagcacaa cgaggccctc 181 ttggaggagg cgaagatgat gaacagactg agacacagcc gggtggtgaa gctcctgggc 241 gtcatcatag aggaagggaa gtactccctg gtgatggagt acatggagaa gggcaacctg 301 atgcacgtgc tgaaagccga gatgagtact ccgctttctg taaaaggaag gataatttgg 361 gaaatcattg aaggaatgtg ctacttacat ggaaaaggcg tgatacacaa ggacctgaag 421 cctgaaaata tccttgttga taatgacttc cacattaaga tcgcagacct cggccttgcc 481 tcctttaaga tgtggagcaa actgaataat gaagagcaca atgagctgag ggaagtggac 541 ggcaccgcta agaagaatgg cggcaccctc tactacatgg cgcccgagca cctgaatgac 601 gtcaacgcaa agcccacaga gaagtcggat gtgtacagct ttgctgtagt actctgggcg 661 atatttgcaa ataaggagcc atatgaaaat gctatctgtg agcagcagtt gataatgtgc 721 ataaaatctg ggaacaggcc agatgtggat gacatcactg agtactgccc aagagaaatt 781 atcagtctca tgaagctctg ctgggaagcg aatccggaag ctcggccgac atttcctggc 841 attgaagaaa aatttaggcc tttttattta agtcaattag aagaaagtgt agaagaggac 901 gtgaagagtt taaagaaaga gtattcaaac gaaaatgcag ttgtgaagag aatgcagtct 961 cttcaacttg attgtgtggc agtaccttca agccggtcaa attcagccac agaacagcct 1021 ggttcactgc acagttccca gggacttggg atgggtcctg tggaggagtc ctggtttgct 1081 ccttccctgg agcacccaca agaagagaat gagcccagcc tgcagagtaa actccaagac 1141 gaagccaact accatcttta tggcagccgc atggacaggc agacgaaaca gcagcccaga 1201 cagaatgtgg cttacaacag agaggaggaa aggagacgca gggtctccca tgaccctttt 1261 gcacagcaaa gaccttacga gaattttcag aatacagagg gaaaaggcac tgtttattcc 1321 agtgcagcca gtcatggtaa tgcagtgcac cagccctcag ggctcaccag ccaacctcaa 1381 gtactgtatc agaacaatgg attatatagc tcacatggct ttggaacaag accactggat 1441 ccaggaacag caggtcccag agtttggtac aggccaattc caagtcatat gcctagtctg 1501 cataatatcc cagtgcctga gaccaactat ctaggaaata cacccaccat gccattcagc 1561 tccttgccac caacagatga atctataaaa tataccatat acaatagtac tggcattcag 1621 attggagcct acaattatat ggagattggt gggacgagtt catcactact agacagcaca 1681 aatacgaact tcaaagaaga gccagctgct aagtaccaag ctatctttga taataccact 1741 agtctgacgg ataaacacct ggacccaatc agggaaaatc tgggaaagca ctggaaaaac 1801 tgtgcccgta aactgggctt cacacagtct cagattgatg aaattgacca tgactatgag 1861 cgagatggac tgaaagaaaa ggtttaccag atgctccaaa agtgggtgat gagggaaggc 1921 ataaagggag ccacggtggg gaagctggcc caggcgctcc accagtgttc caggatcgac 1981 cttctgagca gcttgattta cgtcagccag aactaaccct ggatgggcta cggcagctga 2041 agtggacgcc tcacttagcg gataacccca gaaagttggc tgcctcagag cattcagaat 2101 tctgtcctca ctgatagggg ttctgtgtct gcagaaattt ngtttcctgt acttcatagc 2161 tggagaatgg ggaaagaaat ctgcagcaaa ggggtctcac tctgttgcca ggctggtctc 2221 aaacttctgg actcaagtga tcctcccgcc tcggccttcc aaagtgctgg gatatcaggc 2281 actgagccac tgcgcccagt caacaatccg ntctgaggaa agcgtaagca ggaagacctc 2341 ttaatggcat agcaccaata aaaaaatgac tcctagttgt gtttggaaag ggagagaaga 2401 gatgtctgag gaaggtcatg ttctttcagc ttatggcatt tcctagagtt tngttgaagc 2461 aagaagaaaa actcagagaa tataaaatca actttnaaaa ttgtgtgctc tcttcttcac 2521 gtaggctcct gttaaaaaca aagtgcagtc agattctaag ccctgttcag agacttcgcg 2581 gatcacagct gcagctcacc gccacatcac aggatcc // LOCUS HSU50078 15171 bp mRNA PRI 29-SEP-1997 DEFINITION Human guanine nucleotide exchange factor p532 mRNA, complete cds. ACCESSION U50078 NID g1477564 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 15171) AUTHORS Rosa,J.L., Casaroli-Marano,R.P., Buckler,A.J., Vilaro,S. and Barbacid,M. TITLE p619, a giant protein related to the chromosome condensation regulator RCC1, stimulates guanine nucleotide exchange on ARF1 and Rab proteins JOURNAL EMBO J. 15 (16), 4262-4273 (1996) MEDLINE 97015127 REFERENCE 2 (bases 1 to 15171) AUTHORS Rosa,J.L. and Barbacid,M. TITLE A giant protein that stimulates guanine nucleotide exchange on ARF1 and Rab proteins forms a cytosolic ternary complex with clathrin and Hsp70 JOURNAL Oncogene 15 (1), 1-6 (1997) MEDLINE 97377001 REFERENCE 3 (bases 1 to 15171) AUTHORS Rosa,J.L. TITLE Direct Submission JOURNAL Submitted (27-FEB-1996) Jose L. Rosa, Unitat Bioquimica, Campus de Bellvitge, Universitat de Barcelona, Feixa Llarga s/n (Pavello Central), Hospitalet, Barcelona 08907, Spain FEATURES Location/Qualifiers source 1..15171 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 97..14682 /codon_start=1 /product="p532" /db_xref="PID:g1477565" /translation="MATMIPPVKLKWLEHLNSSWITEDSESIATREGVAVLYSKLVSN KEVVPLPQQVLCLKGPQLPDFERESLSSDEQDHYLDALLSSQLALAKMVCSDSPFAGA LRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEM GVRTGLSLLFALLRQSWMMPVSGPGLSLCNDVIHTAIEVVSSLPPLSLANESKIPPMG LDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGASA VVHTMEKGKLLSSQEGMISFDCFMTILMQMRRSLGSSADRSQWREPTRTSDGLCSLYE AALCLFEEVCRMASDYSRTCASPDSIQTGDAPIVSETCEVYVWGSNSSHQLVEGTQEK ILQPKLAPSFSDAQTIEAGQYCTFVISTDGSVRACGKGSYGRLGLGDSNNQSTLKKLT FEPHRSIKKVSSSKGSDGHTLAFTTEGEVFSWGDGDYGKLGHGNSSTQKYPKLIQGPL QGKVVVCVSAGYRHSAAVTEDGELYTWGEGDFGRLGHGDSNSRNIPTLVKDISNVGEV SCGSSHTIALSKDGRTVWSFGGGDNGKLGHGDTNRVYKPKVIEALQGMFIRKVCAGSQ SSLALTSTGQVYAWGCGACLGCGSSEATALRPKLIEELAATRIVDVSIGDSHCLALSH DNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAIQQISAGTSHSLAWTALPRDR QVVAWHRPYCVDLEESTFSHLRSFLERYCDKINSEIPPLPFPSSREHHSFLKLCLKLL SNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQEVVIETLSVGATMLLPP LRERMELLHSLLPQGPDRWESLSKGQRMQLDIILTSLQDHTHVASLLGYSSPSDAADL SSVCTGYGNLSDQPYGTQSCHPDTHLAEILMKTLLRNLGFYTDQAFGELEKNSDKFLL GTSSSENSQPAHLHELLCSLQKQLLAFCHINNISENSSSVALLHKHLQLLLPHATDIY SRSANLLKESPWNGSVGEKLRDVIYVSAAGSMLCQIVNSLLLLPVSVARPLLSYLLDL LPPLDCLNRLLPAADLLEDQELQWPLHGGPELIDPAGLPLPQPAQSWVWLVDLERTIA LLIGRCLGGMLQGSPVSPEEQDTAYWMKTPLFSDGVEMDTPQLDKCMSCLLEVALSGN EEQKPFDYKLRPEIAVYVDLALGCSKEPARSLWISMQDYAVSKDWDSATLSNESLLDT VSRFVLAALLKHTNLLSQACGESRYQPGKHLSEVYRCVYKVRSRLLACKNLELIQTRS SSRDRWISENQDSADVDPQEHSFTRTIDEEAEMEEQAERDREEGHPEPEDEEEEREHE VMTAGKIFQCFLSAREVARSRDRDRMNSGAGSGARADDPPPQSQQERRVSTDLPEGQD VYTAACNSVIHRCALLILGVSPVIDELQKRREEGQLQQPSTSASEGGGLMTRSESLTA ESRLVHTSPNYRLIKSRSESDLSQPESDEEGYALSGRQNVDLDLAASHRKRGPMHSQL ESLSDSWARLKHSRDWLCNSSYSFESDFDLTKSLGVHTLIENVVSFVSGDVGNAPGFK EPEESMSTSPQASIIAMEQQQLRAELRLEALHQILVLLSGMEEKGSISLAGSRLSSGF QSSTLLTSVRLQFLAGCFGLGTVGHTGAKGESGRLHHYQDGIRAAKRNIQIEIQVAVH KIYQQLSATLERALQANKHHIEAQQRLLLVTVFALSVHYQPVDVSLAISTGLLNVLSQ LCGTDTMLGQPLQLLPKTGVSQLSTALKVASTRLLQILAITTGTYADKLSPKVVQSLL DLLCSQLKNLLSQTGVLHMASFGEGEQEDGEEEEKKVDSSGETEKKDFRAALRKQHAA ELHLGDFLVFLRRVVSSKAIQSKMASPKWTEVLLNIASQKCSSGIPLVGNLRTRLLAL HVLEAVLPACESGVEDDQMAQIVERLFSLLSDCMWETPIAQAKHAIQIKEKEQEIKLQ KQGELEEEDENLPIQEVSFDPEKAQCCLVENGQILTHGSGGKGYGLASTGVTSGCYQW KFYIVKENRGNEGTCVGVSRWPVHDFNHRTTSDMWLYRAYSGNLYHNGEQTLTLSSFT QGDFITCVLDMEARTISFGKNGEEPKLAFEDVDAAELYPCVMFYSSNPGEKVKICDMQ MRGTPRDLLPGDPICSPVAAVLAEATIQLVRILHRTDRWTYCINKKMMERLHKIKICI KESGQKLKKSRSVQSREENEMREEKESKEEEKGKHTRHGLADLSELQLRTLCIEVWPV LAVIGGVDAGLRVGGRCVHKQTGRHATLLGVVKEGSTSAKVQWDEAEITISFPTFWSP SDTPLYNLEPCEPLPFDVARFRGLTASVLLDLTYLTGVHEDMGKQSTKRHEKKHRHES EEKGDVEQKPESESALDMRTGLTSDDVKSQSTTSSKSENEIASFSLDPTLPSVESQHQ ITEGKRKNHEHMSKNHDVAQSEIRAVQLSYLYLGAMKSLSALLGCSKYAELLLIPKVL AENGHNSDCASSPVVHEDVEMRAALQFLMRHMVKRAVMRSPIKRALGLADLERAQAMI YKLVVHGLLEDQFGGKIKQEIDQQAEESDPAQQAQTPVTTSPSASSTTSFMSSSLEDT TTATTPVTDTETVPASESPGVMPLSLLRQMFSSYPTTTVLPTRRAQTPPISSLPTSPS DEVGRRQSLTSPDSQSARPANRTALSDPSSRLSTSPPPPAIAVPLLEMGFSLRQIAKA MEATGARGEADAQNITVLAMWMIEHPGHEDEEEPQSGSTADSRPGAAVLGSGGKSNDP CYLQSPGDIPSADAAEMEEGFSESPDNLDHTENAASGSGPSARGRSAVTRRHKFDLAA RTLLARAAGLYRSVQAHRNQSRREGISLQQDPGALYDFNLDEELEIDLDDEAMEAMFG QDLTSDNDILGMWIPEVLDWPTWHVCESEDREEVVVCELCECSVVSFNQHMKRNHPGC GRSANRQGYRSNGSYVDGWFGGECGSGNPYYLLCGTCREKYLAMKTKSKSTSSERYKG QAPDLIGKQDSVYEEDWDMLDVDEDEKLTGEEEFELLAGPLGLNDRRIVPEPVQFPDS DPLGASVAMVTATNSMEETLMQIGCHGSVEKSSSGRITLGEQAAALANPHDRVVALRR VTAAAQVLLARTMVMRALSLLSVSGSSCSLAAGLESLGLTDIRTLVRLMCLAAAGRAG LSTSPSAMASTSERSRGGHSKANKPISCLAYLSTAVGCLASNAPSAAKLLVQLCTQNL ISAATGVNLTTVDDSIQRKFLPSFLRGIAEENKLVTSPNFVVTQALVALLADKGAKLR PNYDKSEVEKKGPLELANALAACCLSSRLSSQHRQWAAQQLVRTLAAHDRDNQTTLQT LADMGGDLRKCSFIKLEAHQNRVMTCVWCNKKGLLATSGNDGTIRVWNVTKKQYSLQQ TCVFNRLEGDAEESLGSPSDPSFSPVSWSISGKYLAGALEKMVNIWQVNGGKGLVDIQ PHWVSALAWPEEGPATAWSGESPELLLVGRMDGSLGLIEVVDVSTMHRRELEHCYRKD VSVTCIAWFSEDRPFAVGYFDGKLLLGTKEPLEKGGIVLIDAHKDTLISMKWDPTGHI LMTCAKEDSVKLWGSISGCWCCLHSLCHPSIVNGIAWCRLPGKGSKLQLLMATGCQSG LVCVWRIPQDTTQTNVTSAEGWWDQESNCQDGYRKSSGAKCVYQLRGHITPVRTVAFS SDGLALVSGGLGGLMNIWSLRDGSVLQTVVIGSGAIQTTVWIPEVGVAACSNRSKDVL VVNCTAEWAAANHVLATCRTALKQQGVLGLNMAPCMRAFLERLPMMLQEQYAYEKPHV VCGDQLVHSPYMQCLASLAVGLHLDQLLCNPPVPPHHQNCLPDPASWNPNEWAWLECF STTIKAAEALTNGAQFPESFTVPDLEPVPEDELVFLMDNSKWINGMDEQIMSWATSRP EDWHLGGKCDVYLWGAGRHGQLAEAGRNVMVPAAAPSFSQAQQVICGQNCTFVIQANG TVLACGEGSYGRLGQGNSDDLHVLTVISALQGFVVTQLVTSCGSDGHSMALTESGEVF SWGDGDYGKLGHGNSDRQRRPRQIEALQGEEVVQMSCGFKHSAVVTSDGKLFTFGNGD YGRLGLGNTSNKKLPERVTALEGYQIGQVACGLNHTLAVSADGSMVWAFGDGDYGKLG LGNSTAKSSPQKIDVLCGIGIKKVACGTQFSVALTKDGHVYTFGQDRLIGLPEGRARN HNRPQQIPVLAGVIIEDVAVGAEHTLALASNGDVYAWGSNSEGQLGLGHTNHVREPTL VTGLQGKNVRQISAGRCHSAAWTAPPVPPRAPGVSVPLQLGLPDTVPPQYGALREVSI HTVRARLRLLYHFSDLMYSSWRLLNLSPNNQNSTSHYNAGTWGIVQGQLRPLLAPRVY TLPMVRSIGKTMVQGKNYGPQITVKRISTRGRKCKPIFVQIARQVVKLNASDLRLPSR AWKVKLVGEGADDAGGVFDDTITEMCQELETGIVDLLIPSPNATAEVGYNRDRFLFNP SACLDEHLMQFKFLGILMGVAIRTKKPLDLHLAPLVWKQLCCVPLTLEDLEEVDLLYV QTLNSILHIEDSGITEESFHEMIPLDSFVGQSADGKMVPIIPGGNSIPLTFSNRKEYV ERAIEYRLHEMDRQVAAVREGMSWIVPVPLLSLLTAKQLEQMVCGMPEISVEVLKKVV RYREVDEQHQLVQWFWHTLEEFSNEERVLFMRFVSGRSRLPANTADISQRFQIMKVDR PYDSLPTSQTCFFQLRLPPYSSQLVMAERLRYAINNCRSIDMDNYMLSRNVDNAEGSD TDY" BASE COUNT 4132 a 3278 c 3849 g 3912 t ORIGIN 1 gaattccgcc tctgcggagc cgggctcggg tcgccggagc cgcgccccac cccgccagct 61 ccagagccac gactaatggc tgaaggataa atcaacatgg caactatgat tccaccagtg 121 aagctgaaat ggcttgaaca cttgaacagc tcctggatta cagaggacag tgaatctatt 181 gctacaagag agggagttgc tgttctgtat tctaaactgg ttagcaataa ggaagtagta 241 cctttgcccc aacaagtttt atgcctcaaa ggaccacagt tgccagactt tgaacgtgag 301 tctctttcaa gtgatgagca ggaccactat ttggatgccc ttcttagcag ccagctagca 361 ttggcaaaga tggtatgttc agattcccca tttgccgggg cacttagaaa acgactgctt 421 gtactccagc gtgtctttta tgcactttct aataaatacc atgacaaagg caaggtgaag 481 cagcagcagc attctccgga gagcagttct ggttcagcag atgtccattc tgttagtgaa 541 cgcccccggt caagcactga tgcacttata gaaatgggtg ttcgaactgg tctaagttta 601 ttatttgcgc ttctaagaca aagttggatg atgcctgtgt caggacctgg tctcagtctt 661 tgcaacgatg tcattcatac tgcaattgaa gttgtgagct ctttgccacc attatcatta 721 gcaaatgaaa gcaagattcc tcctatgggc ttggactgct tatcgcaagt aacaacattt 781 cttaaaggag tcactattcc taattctggg gcagacactt taggtcgtag attagcttct 841 gagttgctgc ttggtttggc agctcaacga ggctcattgc gatatcttct tgaatggata 901 gaaatggctt tgggggcttc ggcagttgta cacaccatgg agaaaggcaa actactctca 961 agccaggaag gaatgatcag ctttgactgc tttatgacca tattaatgca gatgaggcgt 1021 tctttgggtt catctgctga tcggagtcag tggagagaac caaccagaac atcggatggc 1081 ttgtgctccc tttacgaggc agcattatgt ctctttgaag aggtttgcag aatggcttct 1141 gattattcga gaacatgtgc tagcccagat agcattcaga ctggtgatgc tcccattgtc 1201 tccgaaacct gtgaggttta tgtttggggg agcaatagca gccatcagtt ggtagaaggt 1261 acacaggaga aaatactgca acccaaactg gctcctagtt tctctgatgc acagaccatt 1321 gaagctggac agtactgcac ttttgtcatt tctacggatg gctcagttag agcttgcggg 1381 aaaggcagct atgggagact gggccttgga gactccaata atcagtcaac tttaaaaaag 1441 ttaacattcg agcctcacag atccattaaa aaggtttcat cttctaaagg atctgatggt 1501 cacactttag cctttacgac agaaggagaa gtcttcagtt ggggagatgg tgattatggg 1561 aaactggggc atggaaatag ttcaacacag aaatatccca agcttattca gggacctcta 1621 caaggaaagg tagttgtttg tgtgtcagct ggatacagac atagtgctgc tgtcacagag 1681 gatggggaat tatacacatg gggtgaagga gactttggaa gattaggtca tggtgacagc 1741 aatagtcgta acattccaac attagtaaaa gacatcagca atgtaggaga ggtttcttgt 1801 ggcagttcac atactattgc tctgtctaaa gatgggagaa ctgtatggtc ttttggagga 1861 ggagacaatg gtaaacttgg tcatggtgat accaacagag tgtataaacc taaagttatt 1921 gaagctttac aaggaatgtt cattcgcaaa gtttgtgctg ggagccagtc ttcacttgct 1981 ttgacatcaa cagggcaggt ctatgcttgg ggctgtggag cttgtctagg ttgtggttct 2041 tcagaagcta ctgctttgag acccaagctt attgaagaac tggctgccac aagaatagtt 2101 gatgtttcta ttggagacag tcattgtttg gctctttctc atgataatga agtttatgcc 2161 tggggcaata actcaatggg gcaatgtggt cagggaaatt ccacaggtcc tattactaaa 2221 ccaaagaaag tgagtggctt agatggcata gctattcagc agatttcggc tggaacatca 2281 catagtctgg catggactgc tcttcctagg gacagacaag ttgttgcatg gcaccgacct 2341 tattgtgtag atcttgaaga gagtaccttc tcacacctgc gttcttttct tgagagatac 2401 tgtgataaaa taaacagtga gattccccca ctccctttcc cttcatcaag agaacaccac 2461 agttttctca agctgtgcct gaagctactt tcaaatcacc ttgctcttgc acttgcggga 2521 ggggtagcta ccagcattct cgggaggcag gcaggtccac ttcgaaattt gctcttcaga 2581 ctgatggact caactgtccc agatgaaatc caagaggtgg taattgaaac tttatcagtg 2641 ggagcaacca tgctgttacc tccattacga gaacggatgg aattacttca ttctctttta 2701 cctcaaggac ctgatagatg ggaaagctta tctaaaggac agagaatgca actggatatc 2761 atcctgacaa gtttgcaaga tcatacccac gtagcctccc tacttggcta tagttcaccc 2821 tctgatgctg ctgacctatc ttctgtgtgt actggctacg gaaatctgtc agatcaacct 2881 tacggcactc agagctgcca tccagatacc cacctggctg aaattttgat gaagaccctc 2941 ttaagaaatt taggatttta tacagatcaa gcatttggag agctagaaaa gaatagtgat 3001 aaatttctac ttggaacatc atcatcagaa aacagtcagc ctgctcatct tcatgaactg 3061 ctatgttcac tacagaaaca gctgctggca ttttgccata tcaataacat tagtgagaac 3121 tcaagcagtg tggcattgct tcataaacat cttcagcttt tgttgcctca tgccacagat 3181 atttattcac gttctgcaaa tttgctcaaa gaaagtcctt ggaatggcag tgttggagaa 3241 aaattaagag atgtgatata cgtctcagct gctggcagta tgctctgcca gattgttaac 3301 tccctgctgt tactccctgt gtcagtggct cggcctttat tgagttacct cctcgacttg 3361 ttgccacctc ttgattgcct taatagactc ctgccagctg ctgatctttt agaagaccag 3421 gagttacagt ggcctcttca tggagggcca gaactaattg atcctgctgg tctgccatta 3481 cctcagccag ctcagtcctg ggtatggctt gtggatctag aaagaacaat tgctctcctt 3541 attgggcggt gtcttggtgg catgcttcag ggctcccctg tgtctccaga ggaacaggac 3601 actgcatatt ggatgaaaac gccactgttc agtgacggtg tagaaatgga cactcctcaa 3661 ttggataaat gtatgagttg cctgttagaa gtagcacttt ctggaaatga agaacagaag 3721 ccttttgatt ataaattgcg gcctgaaatt gctgtctatg tagacttggc attgggttgt 3781 tctaaagagc ctgcccgaag cctttggatc agcatgcagg actatgctgt tagtaaagat 3841 tgggacagtg caactttaag taatgagtca ctcttggaca ctgtgtctag atttgttctt 3901 gcagctcttc tgaaacacac aaatttactt agtcaagcat gtggagaaag ccgatatcaa 3961 cctggtaaac acttatcaga agtgtaccgt tgtgtataca aagttcgaag tcgtttactt 4021 gcttgcaaga accttgaact tattcaaaca aggtcatcat cacgggacag atggatatca 4081 gaaaaccagg actctgcaga tgttgatcct caggagcatt catttactcg aactattgat 4141 gaagaagctg aaatggaaga acaggctgag agagaccggg aagaggggca tccggagcca 4201 gaggatgaag aggaggaacg ggaacatgaa gtgatgacag ctggcaaaat ctttcagtgt 4261 ttcctctcag cccgtgaagt agctcgtagc cgagaccgag atagaatgaa cagtggggca 4321 gggtctgggg ctcgagctga tgatccacct cctcagtctc agcaagagcg aagggtcagc 4381 acagaccttc ctgagggtca ggatgtgtac actgctgcat gcaactccgt gatccatcgg 4441 tgtgccctgt taatattagg agtaagtcct gtgatagatg agcttcagaa gcgaagagaa 4501 gaaggacagt tgcagcaacc ttcaacaagt gcctctgaag ggggtggact tatgaccagg 4561 agtgaaagtc ttactgcaga gagccggcta gtccacacaa gcccaaatta tagactgatc 4621 aaatcgagga gtgaatctga tttgtctcag cctgaatcag atgaagaggg ttacgcactg 4681 agtggcagac aaaatgttga tttggatttg gcagcatctc acagaaagag aggtcctatg 4741 cacagtcaat tggaatccct gagtgactct tgggctcgcc tgaaacatag cagagactgg 4801 ttatgcaact cctcctattc ctttgagtca gattttgatc ttaccaagtc tttgggagtt 4861 cacactttga ttgaaaatgt tgtaagcttt gtgagtggag atgtggggaa tgccccaggt 4921 tttaaagagc cagaggaaag tatgtctaca agtccccagg cctccatcat tgcaatggaa 4981 cagcagcagt taagggcaga acttcgttta gaggcacttc atcagatcct cgttctattg 5041 tctgggatgg aagaaaaagg tagcatctca ctggcaggaa gcagattgag ttcaggcttc 5101 cagtcctcca cactactcac gtctgtgagg ctgcagttcc tagcagggtg ttttggttta 5161 ggcactgttg gacacacagg agccaaggga gagagtggcc gattgcatca ctatcaggat 5221 gggatcagag cagctaagag aaatattcag attgaaatcc aggtagctgt gcataaaatt 5281 tatcaacagt tgtctgctac cctggaaaga gccctgcaag caaacaagca tcacattgaa 5341 gcccagcaac gtctgcttct ggttacagtt tttgccctaa gtgttcatta tcaaccagta 5401 gatgtttctt tggcaatttc cactggtctg ctaaacgtat tgtcacagtt gtgtggtaca 5461 gacaccatgc taggacagcc cctgcagttg ttgccaaaga cgggtgtttc ccagcttagc 5521 acagctttga aagtggccag tacaaggttg ctccagattc tagccatcac tactgggacc 5581 tatgctgata aactgagtcc caaagtagtt caatccttgt tggatctact ctgtagtcag 5641 ttgaagaatt tattgtccca aactggtgta ctacatatgg cctctttcgg agaaggggag 5701 caagaagacg gtgaagaaga agaaaaaaaa gttgactcca gtggagaaac tgagaagaaa 5761 gatttcagag ctgctcttag gaaacaacat gcagccgaac tccatctagg ggatttttta 5821 gtttttcttc gcagagttgt atcttcaaaa gcaattcaat caaaaatggc ttccccaaag 5881 tggaccgaag tgcttctaaa tatagcatct cagaaatgtt cttcaggtat ccctctggtt 5941 ggtaacttaa gaacaaggct ccttgcactt catgtccttg aagctgtgct gccagcttgt 6001 gaatctggtg tagaagatga tcaaatggcc cagattgttg agcgcttatt ttcccttctc 6061 tctgattgta tgtgggagac acccattgct caggccaaac atgctattca gataaaggaa 6121 aaagaacaag aaataaaact acagaagcag ggcgagttgg aagaagaaga tgagaatctt 6181 cctatccaag aagtatcctt tgacccggag aaagctcagt gttgcctagt ggagaatgga 6241 cagattttaa ctcacggcag tggagggaaa ggatatggat tggcatctac aggagtaact 6301 tctgggtgct atcagtggaa gttttatatt gtgaaggaaa acagaggtaa tgaaggcacg 6361 tgtgttggag tttctcgctg gccagtacat gactttaatc accgcactac ctcggatatg 6421 tggctctata gggcctacag tggtaacctc tatcacaatg gagaacagac tctcacattg 6481 tccagcttta ctcaaggaga tttcattacc tgtgtgttag acatggaagc caggaccatt 6541 tcttttggga aaaatggaga ggaacccaaa ttagcttttg aagatgtgga tgcagcagag 6601 ttgtacccat gtgtgatgtt ctatagtagc aatccagggg aaaaggtgaa aatttgtgat 6661 atgcagatgc gtggcacacc ccgagactta cttccaggag accctatttg tagtccagta 6721 gcagcagtgc tggctgaggc cactattcag ctcgtccgta tccttcaccg aacagaccgt 6781 tggacttact gcattaacaa aaaaatgatg gaaaggcttc acaaaattaa gatatgtatt 6841 aaagagtcag gtcagaagct aaagaaaagc cgctcggttc agagccgaga ggaaaatgaa 6901 atgagagagg agaaggagag caaagaggaa gagaaaggta aacatactag gcatggcctc 6961 gctgacctct cagagctgca gctgaggact ctttgcatag aggtgtggcc cgtgctggct 7021 gtgataggag gagttgatgc tggtcttaga gttggaggtc ggtgtgttca caagcaaact 7081 gggcgccatg ccacgctgct gggagtggtc aaagagggca gcacgtctgc caaggtccaa 7141 tgggatgaag cagaaattac tatcagcttc ccaacttttt ggtcgcctag tgatactcca 7201 ttgtataatc tggaaccctg tgaaccattg ccgtttgatg tggcgcgatt ccgaggcctg 7261 acggcttctg tgctgctgga cctaacatat ctcactggcg ttcatgaaga catgggcaaa 7321 cagagcacca aacgacatga aaagaaacac cgacatgaat ccgaggagaa aggggatgtt 7381 gagcagaaac ctgagagtga atccgcttta gatatgcgaa caggcctaac atctgatgac 7441 gtcaaaagtc agagtaccac aagctccaaa tcagaaaatg aaatcgcttc attttcttta 7501 gatccaacac tgccaagtgt ggaatcccaa catcaaataa cagaagggaa aagaaaaaat 7561 catgaacaca tgtccaaaaa ccatgatgta gcccagtcag aaatcagagc agtccagctg 7621 tcctatcttt acctcggtgc tatgaagtca cttagtgccc ttcttggctg tagtaaatat 7681 gctgagctgt tgctgatacc aaaagttctg gctgaaaatg gccacaactc agactgtgca 7741 agttctccag ttgttcatga agacgtggag atgcgagcag ccctgcagtt cttgatgcga 7801 cacatggtga agcgagcagt catgcggtca cccataaaga gagcattggg attagctgat 7861 ctggaacgag cgcaagccat gatctataaa ttagtggttc atgggctttt ggaagaccag 7921 tttgggggca aaattaagca agagattgat caacaagctg aagaaagtga ccctgcccag 7981 caggcacaga caccagttac tactagccca tcagcctcaa gcacgacctc ctttatgagc 8041 agctctctgg aggacaccac aactgccacc actccagtca ctgacacaga aacagtgcct 8101 gcatccgagt ccccgggagt gatgcctctt agtcttctca ggcaaatgtt ctctagttac 8161 ccaactacca ctgtacttcc cacacgtcgg gcacagactc ctccaatatc ttcgttacca 8221 acctctcctt ctgatgaagt aggaaggagg caaagtttaa cttctcctga ttcccagtca 8281 gcaaggccag ctaaccgcac agccttgtca gacccaagca gtagactttc aacttctcct 8341 cctcctccag caattgcagt tcccttgctg gaaatggggt tctctcttcg gcagattgcc 8401 aaagccatgg aagctacagg tgctagggga gaggctgatg cccagaatat cactgtcctt 8461 gccatgtgga tgatagagca ccctgggcat gaggatgaag aggagcccca gtcgggcagc 8521 acagcagact ctaggcctgg agcagccgtt ctaggcagtg gcgggaagtc aaatgatccc 8581 tgttatttgc agtcacctgg agacatacca tcagctgatg ctgctgaaat ggaggaaggt 8641 tttagtgaaa gccctgataa tttggatcat acagagaatg cagcttctgg aagtggacca 8701 tcagctagag gtcgctcagc ggtaacaaga agacacaagt ttgacttagc tgctcgcaca 8761 ctgctagcaa gagcagcggg attataccgc tctgtgcagg cccacaggaa tcaaagtcgg 8821 agagaaggaa tatctttgca gcaagaccca ggggcgttgt atgactttaa tttagatgag 8881 gaattggaaa ttgatcttga tgatgaggcg atggaagcta tgtttggaca agacctgacc 8941 agtgacaatg atattctggg aatgtggatc ccagaggtac tggattggcc tacctggcat 9001 gtttgtgagt ctgaagacag ggaagaagtg gtggtgtgtg aactgtgtga atgcagcgtc 9061 gtcagcttca atcagcacat gaagagaaac catccaggct gtgggcgcag tgcaaaccgc 9121 cagggctatc gcagcaatgg ttcctatgtg gatggctggt ttggcggtga atgtgggagt 9181 ggaaatccat actacctgtt atgtggcacc tgcagggaga agtacttagc catgaagacc 9241 aaatctaagt caacaagttc tgaaaggtac aagggacaag ctccagatct aattggcaag 9301 caagacagtg tgtatgaaga agactgggac atgttggatg ttgatgaaga tgaaaagcta 9361 actggtgaag aagaatttga attacttgct ggaccgcttg gtttaaatga ccggcgcatt 9421 gtaccagaac cagttcagtt ccctgacagc gatccactgg gagcatcagt agcaatggtc 9481 acagccacca acagtatgga agagactctg atgcaaatag gttgccatgg ctccgtagaa 9541 aagagctcct ctgggagaat aacgttagga gagcaggcag ctgccctagc aaaccctcat 9601 gaccgtgtgg tggctttaag gagagtgact gctgctgctc aggttcttct ggccagaacc 9661 atggtcatga gagcgctgtc tcttctctca gtcagtggtt ccagttgtag cctggctgct 9721 ggtcttgagt ctctggggct aacagatatc cgaacgctag ttcgattaat gtgcttggca 9781 gcagcaggga gagctggcct ctccaccagc ccttctgcca tggctagcac ctcagaacga 9841 tcacgaggtg ggcatagcaa ggctaacaag cctatctctt gcctggccta tttgagcaca 9901 gcagtgggat gtctggcatc aaatgctcct agtgctgcca aactgcttgt acagttgtgt 9961 acacagaact tgatttctgc tgcaacaggt gtaaatctaa ccacagttga tgactcaatt 10021 cagcgaaagt ttctacccag ctttctccga ggaattgctg aagagaacaa gcttgtgacc 10081 tccccaaact ttgttgtaac acaggccctt gtggcattgc tagcagacaa aggggccaaa 10141 ctaagaccta actatgataa gtcagaagtt gaaaagaaag gccctctgga gttggctaat 10201 gccctggcag cctgctgcct ctcctccagg ctgtcctcac agcatcggca atgggcagct 10261 cagcaactcg tgcgcactct tgctgcacac gaccgtgaca accaaactac tctgcagaca 10321 cttgctgata tgggaggaga tcttagaaaa tgctccttta tcaaattgga ggctcatcag 10381 aacagagtaa tgacatgtgt ttggtgtaat aaaaaaggtc ttttggctac aagtggcaat 10441 gatggcacca tccgcgtatg gaatgttacc aagaagcaat attcactgca acagacctgt 10501 gtgttcaaca gattggaagg ggatgctgag gaaagcctgg gatcacccag tgatccaagt 10561 ttctcaccag tttcctggag tatcagtggc aaatatctag caggcgcttt ggaaaagatg 10621 gtgaatatct ggcaagttaa tggaggaaaa ggattagtag atattcagcc tcattgggta 10681 tctgccctgg cttggccaga agagggtccg gctacagcct ggtcaggaga gtctccagaa 10741 ttgttgttgg tgggacggat ggatggatct ctgggactga ttgaagttgt tgatgtgtcc 10801 accatgcacc gtcgagaatt ggagcattgc tatcgaaagg atgtgtctgt tacttgcatt 10861 gcatggttca gtgaagacag accatttgca gtgggatatt ttgatggaaa actgttactg 10921 ggaacaaagg aaccacttga gaaaggaggc attgttctaa ttgatgcaca taaggatact 10981 cttattagca tgaagtggga ccctacaggt catattctta tgacatgtgc caaagaagac 11041 agtgtgaaac tctggggctc tatttcggga tgctggtgct gtctacattc actctgccat 11101 ccatctattg taaatggcat tgcttggtgc cgccttccag ggaaaggatc caagttgcag 11161 ttactgatgg ctactggctg tcagagtggc ttagtatgtg tttggcgcat tcctcaagat 11221 actacacaga ccaatgtgac tagtgcagaa ggatggtggg accaggaatc aaattgccag 11281 gatggatata ggaaatcatc aggagccaag tgtgtttatc agctgcgggg acacatcact 11341 cctgttcgga ctgttgcctt tagttctgat gggttggccc tggtgtctgg tggactaggt 11401 gggctcatga acatttggtc tttaagggat ggctctgtct tgcaaactgt tgtgataggc 11461 tctggagcta ttcagaccac agtatggatt ccagaagttg gagtagctgc ttgctcaaat 11521 agatcaaagg atgttttggt cgtgaattgt acagcagaat gggcagctgc caatcatgtt 11581 ttggcaacct gtaggacagc attgaaacag cagggtgttc tgggattgaa catggctccc 11641 tgcatgagag catttttgga gcggctcccc atgatgcttc aggagcagta tgcctatgaa 11701 aagcctcatg tggtttgtgg tgaccaactt gttcatagcc cctatatgca atgcttggct 11761 tcccttgctg tgggacttca tctggatcag ctgttgtgta accctccagt gccaccacac 11821 caccagaact gtctgcctga ccctgcatcc tggaatccaa atgaatgggc ctggttagaa 11881 tgtttctcaa ccactataaa agctgccgaa gccctgacca atggagccca gtttccagaa 11941 tcttttaccg ttccagatct agaacctgtt ccagaggatg aacttgtatt tctaatggat 12001 aacagtaaat ggattaacgg catggatgaa caaattatgt cttgggcaac ttccagacct 12061 gaggactggc acctgggagg taaatgtgat gtctacttat ggggtgctgg taggcatgga 12121 cagctggcag aagctggaag aaatgtaatg gtacctgcag cagctccctc attctcacag 12181 gcccaacagg tcatttgtgg tcagaattgt acctttgtca tccaggccaa tggcacagtg 12241 ttggcttgtg gggaaggaag ttatggcaga ttaggacaag gaaattcaga tgaccttcat 12301 gtgctgacag ttatttcagc cttacaaggc tttgtggtga cccagctggt gacttcctgt 12361 ggttctgatg ggcactctat ggccctaact gaaagtggtg aggtctttag ctggggagat 12421 ggtgactatg gtaaacttgg ccatgggaac agcgacaggc agcggcggcc caggcagatc 12481 gaggccttac aaggagaaga agtggtgcag atgtcttgtg gcttcaagca ctcagcagtg 12541 gtcacttcag atggcaaact gttcaccttt gggaatggtg actatggtcg tctgggtctt 12601 ggaaatacct ctaacaaaaa acttccagag agagtgactg cactggaggg atatcagatt 12661 ggacaggtgg cctgtggatt aaaccacact ttggcagtgt cagcagatgg ttccatggtg 12721 tgggcttttg gagatggaga ctatggaaaa ctaggcttag gaaattccac tgcaaaatct 12781 tcacctcaga aaattgacgt cctttgtgga attggaataa aaaaggttgc ttgtggaact 12841 cagttttctg ttgctttgac caaagatggt catgtgtata cctttggtca agatcgcctg 12901 ataggcttgc cagaggggcg tgctcgcaat cacaatcgac cgcaacaaat ccctgtcctg 12961 gctggagtaa tcattgaaga tgtggcagtt ggagctgaac acacacttgc tttggcatca 13021 aatggagatg tgtatgcctg ggggagcaat tcagaagggc agctcggctt aggccatacc 13081 aaccatgttc gagaaccaac cctggtaaca ggtctgcaag ggaaaaatgt tcggcagatc 13141 tcggctggcc gctgccacag tgctgcatgg acagcaccac ctgtcccacc aagagcacca 13201 ggtgtgtcag tacctctgca gctgggcctg cctgacacag tgccccccca gtatggggcg 13261 ctgagagaag tcagcattca cacggtgcgg gccaggctcc ggctgctcta ccacttctct 13321 gacctcatgt actcatcctg gagactgctg aaccttagcc ccaacaacca gaacagcaca 13381 tcccattata atgctggaac ttggggcatt gtacagggac aacttcggcc tttgttagcc 13441 ccaagagtct acactctgcc aatggtgcgc tccataggaa aaaccatggt tcaaggcaaa 13501 aactatggac ctcagataac tgtaaagagg atatcaacca gaggacggaa gtgtaagcct 13561 atttttgtcc aaatagcgag acaagtagtt aagctgaatg cttcagacct ccgcctgcct 13621 tcccgagcgt ggaaggttaa gctggttgga gaaggggctg atgatgctgg aggagtgttt 13681 gatgacacca tcacagagat gtgccaggaa cttgaaactg gtattgttga ccttcttata 13741 ccctctccca atgccaccgc agaagtgggt tacaataggg acaggttcct ttttaaccct 13801 tctgcctgcc tcgatgaaca cttaatgcag tttaagtttt taggaatttt aatgggggtt 13861 gccattcgca caaagaagcc tctggacctc cacttggccc ctctggtgtg gaagcagctg 13921 tgctgtgtcc cactcaccct agaggacctg gaggaggtgg atctgctcta cgtgcagact 13981 ctcaacagca ttcttcacat tgaagacagt gggattaccg aggagagttt ccatgagatg 14041 attcctcttg attcttttgt tggccagagt gctgatggca aaatggttcc tataatccct 14101 ggtggaaata gtatcccact cacattttcc aacaggaagg aatatgtgga gagggccatt 14161 gaatatcgac ttcatgagat ggacagacag gtggctgcag tccgagaagg gatgtcctgg 14221 attgttcctg tgccgctgct gtccctcctc acagcaaaac aactggagca gatggtgtgt 14281 gggatgcccg agatctctgt ggaagtcttg aagaaagtgg tgcggtaccg tgaggtggat 14341 gagcagcatc agctggtgca gtggttctgg cacacgctgg aagagttctc caatgaggag 14401 cgggtgcttt tcatgaggtt tgtgtcagga agatctcgac taccagccaa cactgctgac 14461 atttctcaga gatttcaaat catgaaggtt gataggcctt acgacagtct gcctacctca 14521 cagacctgct tcttccagct gaggctgccc ccgtactcca gccagctggt catggccgag 14581 cgcctgcgct atgccatcaa caactgccgc tcaatcgaca tggacaacta catgctctcg 14641 agaaacgtgg acaacgccga gggctccgac actgactact gaccgtgcgg gtgctctcac 14701 cctcccttct ctccctcaat aatgctcact tctgatttga tgttgatata cttttatggt 14761 aactacatag atgttataag aacataaacc aacattataa acaatggcca catttagtta 14821 ctctaaatgt aacaaagaaa ttagatgttt ttatttttct gtgattgtac aaaaacaaca 14881 aaaacgaagt gctctcagtc aggtttttcc ctccatattt ttggtcactt ttgataagtt 14941 tgcatgaaac cattttggtg catttttagt tgggaatggt acatttttgt aaatccaccc 15001 agtgaacatg aaattgtaca ttgtgtataa ttgttcatta gaaaggacag ttttacatga 15061 atattcatat atttattttg ttttaatttg aattgcctgt tcagggttcc ttatgcagag 15121 aaataaagca gattcaggaa ttggaaaaaa aaaaaaaaaa aaaaggaatt c // LOCUS HSU50079 1611 bp mRNA PRI 16-MAY-1996 DEFINITION Human histone deacetylase HD1 mRNA, complete cds. ACCESSION U50079 NID g1277083 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1611) AUTHORS Taunton,J., Hassig,C.A. and Schreiber,S.L. TITLE A mammalian histone deacetylase related to the yeast transcriptional regulator Rpd3p JOURNAL Science 272 (5260), 408-411 (1996) MEDLINE 96185499 REFERENCE 2 (bases 1 to 1611) AUTHORS Taunton,J., Hassig,C.A. and Schreiber,S.L. TITLE Direct Submission JOURNAL Submitted (27-FEB-1996) Jack Taunton, Chemistry, Harvard University, 12 Oxford, Cambridge, MA 02138, usa FEATURES Location/Qualifiers source 1..1611 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cell" CDS 111..1559 /note="similar to S. cerevisiae RPD3, a global transcriptional regulator; trapoxin receptor" /codon_start=1 /product="histone deacetylase HD1" /db_xref="PID:g1277084" /translation="MAQTQGTRRKVCYYYDGDVGNYYYGQGHPMKPHRIRMTHNLLLN YGLYRKMEIYRPHKANAEEMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPV FDGLFEFCQLSTGGSVASAVKLNKQQTDIAVNWAGGLHHAKKSEASGFCYVNDIVLAI LELLKYHQRVLYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGK GKYYAVNYPLRDGIDDESYEAIFKPVMSKVMEMFQPSAVVLQCGSDSLSGDRLGCFNL TIKGHAKCVEFVKSFNLPMLMLGGGGYTIRNVARCWTYETAVALDTEIPNELPYNDYF EYFGPDFKLHISPSNMTNQNTNEYLEKIKQRLFENLRMLPHAPGVQMQAIPEDAIPEE SGDEDEDDPDKRISICSSDKRIACEEEFSDSEEEGEGGRKNSSNFKKAKRVKTEDEKE KDPEEKKEVTEEEKTKEEKPEAKGVKEEVKLA" BASE COUNT 428 a 385 c 440 g 358 t ORIGIN 1 atgtctgggg tctctgcccg ctggtgctgc tgtctcccac tcggtcatcc tgagaacaca 61 gcctgagcgt ctctgtcact cggggtagac cacgcgggga ggcgagcaag atggcgcaga 121 cgcagggcac ccggaggaaa gtctgttact actacgacgg ggatgttgga aattactatt 181 atggacaagg ccacccaatg aagcctcacc gaatccgcat gactcataat ttgctgctca 241 actatggtct ctaccgaaaa atggaaatct atcgccctca caaagccaat gctgaggaga 301 tgaccaagta ccacagcgat gactacatta aattcttgcg ctccatccgt ccagataaca 361 tgtcggagta cagcaagcag atgcagagat tcaacgttgg tgaggactgt ccagtattcg 421 atggcctgtt tgagttctgt cagttgtcta ctggtggttc tgtggcaagt gctgtgaaac 481 ttaataagca gcagacggac atcgctgtga attgggctgg gggcctgcac catgcaaaga 541 agtccgaggc atctggcttc tgttacgtca atgatatcgt cttggccatc ctggaactgc 601 taaagtatca ccagagggtg ctgtacattg acattgatat tcaccatggt gacggcgtgg 661 aagaggcctt ctacaccacg gaccgggtca tgactgtgtc ctttcataag tatggagagt 721 acttcccagg aactggggac ctacgggata tcggggctgg caaaggcaag tattatgctg 781 ttaactaccc gctccgagac gggattgatg acgagtccta tgaggccatt ttcaagccgg 841 tcatgtccaa agtaatggag atgttccagc ctagtgcggt ggtcttacag tgtggctcag 901 actccctatc tggggatcgg ttaggttgct tcaatctaac tatcaaagga cacgccaagt 961 gtgtggaatt tgtcaagagc tttaacctgc ctatgctgat gctgggaggc ggtggttaca 1021 ccattcgtaa cgttgcccgg tgctggacat atgagacagc tgtggccctg gatacggaga 1081 tccctaatga gcttccatac aatgactact ttgaatactt tggaccagat ttcaagctcc 1141 acatcagtcc ttccaatatg actaaccaga acacgaatga gtacctggag aagatcaaac 1201 agcgactgtt tgagaacctt agaatgctgc cgcacgcacc tggggtccaa atgcaggcga 1261 ttcctgagga cgccatccct gaggagagtg gcgatgagga cgaagacgac cctgacaagc 1321 gcatctcgat ctgctcctct gacaaacgaa ttgcctgtga ggaagagttc tccgattctg 1381 aagaggaggg agaggggggc cgcaagaact cttccaactt caaaaaagcc aagagagtca 1441 aaacagagga tgaaaaagag aaagacccag aggagaagaa agaagtcacc gaagaggaga 1501 aaaccaagga ggagaagcca gaagccaaag gggtcaagga ggaggtcaag ttggcctgaa 1561 tggacctctc cagctctggc ttcctgctga gtccctcacg tttctttccc c // LOCUS HSU50196 1809 bp mRNA PRI 25-APR-1996 DEFINITION Human adenosine kinase mRNA, complete cds. ACCESSION U50196 NID g1224124 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1809) AUTHORS Spychala,J., Datta,N.S., Takabayashi,K., Datta,M., Fox,I.H., Gribbin,T. and Mitchell,B.S. TITLE Cloning of human adenosine kinase cDNA: sequence similarity to microbial ribokinases and fructokinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (3), 1232-1237 (1996) MEDLINE 96165550 REFERENCE 2 (bases 1 to 1809) AUTHORS Spychala,J., Datta,S.N., Takabayashi,K., Fox,I.H., Gribbin,T. and Mitchell,B.S. TITLE Direct Submission JOURNAL Submitted (28-FEB-1996) Jozef Spychala, Pharmacology, University of North Carolina, 1106 FLOB, Chapel Hill, NC 27599-7365, USA FEATURES Location/Qualifiers source 1..1809 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 184..1221 /EC_number="2.7.1.20" /codon_start=1 /product="adenosine kinase" /db_xref="PID:g1224125" /translation="MTSVRENILFGMGNPLLDISAVVDKDFLDKYSLKPNDQILAEDK HKELFDELVKKFKVEYHAGGSTQNSIKVAQWMIQQPHKAATFFGCIGIDKFGEILKRK AAEAHVDAHYYEQNEQPTGTCAACITGDNRSLIANLAAANCYKKEKHLDLEKNWMLVE KARVCYIAGFFLHVSPESVLKVAHHASENNRIFTLNLSAPFISQFYKESLMKVMPYVD ILFGNETEAATFAREQGFETKDIKEIAKKTQALPKMNSKRQRIVIFTQGRDDTIMATE SEVTAFAVLDQDQKEIIDTNGAGDAFVGGFLSQLVSDKPLTECIRAGHYAASIIIRRT GCTFPEKPDFH" BASE COUNT 571 a 346 c 378 g 514 t ORIGIN 1 cgccttccct ccaatcagca ccggggccgg ctagccaggg gccggccgcg cggggtgtgt 61 gaggacgcgc tcccagtcgc tgagtgcctg agccgggaag cagttgctgt ggtacctgcg 121 ctgcccgagc ggacgtagag catcggacgc gggcgccgtg gcgttgggca ggaggcgaag 181 ccaatgacgt cagtcagaga aaatattctc tttggaatgg gaaatcctct gcttgacatc 241 tctgctgtag tggacaaaga tttccttgat aagtattctc tgaaaccaaa tgaccaaatc 301 ttggctgaag acaaacacaa ggaactgttt gatgaacttg tgaaaaaatt caaagtcgaa 361 tatcatgctg gtggctctac ccagaattca attaaagtgg ctcagtggat gattcaacag 421 ccacacaaag cagcaacatt ttttggatgc attgggatag ataaatttgg ggagatcctg 481 aagagaaaag ctgctgaagc ccatgtggat gctcattact acgagcagaa tgagcagcca 541 acaggaactt gtgctgcatg catcactggt gacaacaggt ccctcatagc taatcttgct 601 gctgccaatt gttataaaaa ggaaaaacat cttgatctgg agaaaaactg gatgttggta 661 gaaaaagcaa gagtttgtta tatagcaggc ttttttcttc acgtttcccc agagtcagta 721 ttaaaggtgg ctcaccatgc ttctgaaaac aacaggattt tcactttgaa tctatctgca 781 ccgtttatta gccagttcta caaggaatca ttgatgaaag ttatgcctta tgttgatata 841 ctttttggaa atgagacaga agctgccact tttgctagag agcaaggctt tgagactaaa 901 gacattaaag agatagccaa aaagacacaa gccctgccaa agatgaactc aaagaggcag 961 cgaatcgtga tcttcaccca agggagagat gacactataa tggctacaga aagtgaagtc 1021 actgcttttg ctgtcttgga tcaagaccag aaagaaatta ttgataccaa tggagctgga 1081 gatgcatttg ttggaggttt tctgtctcaa ctggtctctg acaagcctct gactgaatgt 1141 atccgtgctg gccactatgc agcaagcatc ataattagac ggactggctg cacctttcct 1201 gagaagccag acttccactg atggaagagc tgaaaacaca agcccaggag tgcagacact 1261 gccctaattg cttcctgaca attcccatat taataaagaa gaaaattatc tgccattttt 1321 tcctactata ataatgctga atcttaattt agagggtaca agggtatggt aatgcttgta 1381 gaatctttat tatctcaaca atctaaaaaa tgatgtttat ttccatagtt tgatagtgcc 1441 acttaaatgc caattaaaca agaatataac atttcaatag aaatttttat ttcattttca 1501 attactttgt aaattcgtgt gtatttagta cactgatttg ttttttacat ttctgctttg 1561 aatgcagatg caatttaata taatagattt tttaatgaat taatcttaac atagtaatct 1621 ttagcttttt atacaaatat atttaattta ggagtatatg tgtgtctata cacacacata 1681 cataaatata ccacatatac acctgatagt caaataaggt acagaaattt tatcttgtca 1741 attatgccaa ataatctctt taatgtgcac tcaacatgta ataaactttg gataattaaa 1801 aaaaaaaaa // LOCUS HSU50352 1539 bp mRNA PRI 06-APR-1996 DEFINITION Human sodium channel 1 (BNC1) mRNA, complete cds. ACCESSION U50352 NID g1256016 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1539) AUTHORS Price,M.P., Snyder,P.M. and Welsh,M.J. TITLE Cloning and expression of a novel human brain Na+ channel JOURNAL J. Biol. Chem. 271 (14), 7879-7882 (1996) MEDLINE 96215169 REFERENCE 2 (bases 1 to 1539) AUTHORS Price,M.P., Snyder,P.M. and Welsh,M.J. TITLE Direct Submission JOURNAL Submitted (29-FEB-1996) Margaret P. Price, Internal Medicine, University of Iowa, 500 EMRB, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..1539 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 1..1539 /gene="BNC1" CDS 1..1539 /gene="BNC1" /note="amiloride sensitive sodium channel" /codon_start=1 /product="sodium channel 1" /db_xref="PID:g1256017" /translation="MDLKESPSEGSLQPSSIQIFANTSTLHGIRHIFVYGPLTIRRVL WAVAFVGSLGLLLVESSERVSYYFSYQHVTKVDEVVAQSLVFPAVTLCNLNGFRFSRL TTNDLYHAGELLALLDVNLQIPDPHLADPSVLEALRQKANFKHYKPKQFSMLEFLHRV GHDLKDMMLYCKFKGQECGHQDFTTVFTKYGKCYMFNSGEDGKPLLTTVKGGTGNGLE IMLDIQQDEYLPIWGETEETTFEAGVKVQIHSQSEPPFIQELGFGVAPGFQTFVATQE QRLTYLPPPWGECRSSEMGLDFFPVYSITACRIDCETRYIVENCNCRMVHMPGDAPFC TPEQHKECAEPALGLLAEKDSNYCLCRTPCNLTRYNKELSMVKIPSKTSAKYLEKKFN KSEKYISENILVLDIFFEALNYETIEQKKAYEVAALLGDIGGQMGLFIGASILTILEL FDYIYELIKEKLLDLLGKEEDEGSHDENVSTCDTMPNHSETISHAVNVPLQTTLGTLE EIAC" BASE COUNT 379 a 406 c 409 g 345 t ORIGIN 1 atggacctca aggaaagccc cagtgagggc agcctgcaac cttctagcat ccagatcttt 61 gccaacacct ccaccctcca tggcatccgc cacatcttcg tgtatgggcc gctgaccatc 121 cggcgtgtgc tgtgggcagt ggccttcgtg ggctctctgg gcctgctgct ggtggagagc 181 tctgagaggg tgtcctacta cttctcctac cagcatgtca ctaaggtgga cgaagtggtg 241 gctcaaagcc tggtcttccc agctgtgacc ctctgtaacc tcaatggctt ccggttctcc 301 aggctcacca ccaacgacct gtaccatgct ggggagctgc tggccctgct ggatgtcaac 361 ctgcagatcc cggaccccca tctggctgac ccctccgtgc tggaggccct gcggcagaag 421 gccaacttca agcactacaa acccaagcag ttcagcatgc tggagttcct gcaccgtgtg 481 ggccatgacc tgaaggatat gatgctctac tgcaagttca aagggcagga gtgcggccac 541 caagacttca ccacagtgtt tacaaaatat gggaagtgtt acatgtttaa ctcaggcgag 601 gatggcaaac ctctgctcac cacggtcaag ggggggacag gcaacgggct ggagatcatg 661 ctggacattc agcaggatga gtacctgccc atctggggag agacagagga aacgacattt 721 gaagcaggag tgaaagttca gatccacagt cagtctgagc cacctttcat ccaagagctg 781 ggctttgggg tggctccagg gttccagacc tttgtggcca cacaggagca gaggctcaca 841 tacctgcccc caccgtgggg tgagtgccga tcctcagaga tgggcctcga cttttttcct 901 gtttacagca tcaccgcctg taggattgac tgtgagaccc gctacattgt ggaaaactgc 961 aactgccgca tggttcacat gccaggggat gccccttttt gtacccctga gcagcacaag 1021 gagtgtgcag agcctgccct aggtctgttg gcggaaaagg acagcaatta ctgtctctgc 1081 aggacaccct gcaacctaac ccgctacaac aaagagctct ccatggtgaa gatccccagc 1141 aagacatcag ccaagtacct tgagaagaaa tttaacaaat cagaaaaata tatctcagag 1201 aacatccttg ttctggatat attttttgaa gctctcaatt atgagacaat tgaacagaag 1261 aaggcgtatg aagttgctgc cttacttggt gatattggtg gtcagatggg attgttcatt 1321 ggtgctagta tccttacaat actagagctc tttgattata tttatgagct gatcaaagag 1381 aagctattag acctgcttgg caaagaggag gacgaaggga gccacgatga gaatgtgagt 1441 acttgtgaca caatgccaaa ccactctgaa accatcagtc acgctgtgaa cgtgcccctg 1501 cagacgaccc tggggacctt ggaggagatt gcctgctga // LOCUS HSU50410 2269 bp mRNA PRI 02-APR-1996 DEFINITION Human heparan sulphate proteoglycan (OCI5) mRNA, complete cds. ACCESSION U50410 NID g1245416 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2269) AUTHORS Hamid,J., Shen,T., Sonoda,G., Li,M., Filmus,J., Testa,J.R. and Buick,R.N. TITLE Molecular cloning and mapping of a human cDNA for the glypican-related transcript OCI5 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2269) AUTHORS Hamid,J., Shen,T., Sonoda,G., Li,M., Filmus,J., Testa,J.R. and Buick,R.N. TITLE Direct Submission JOURNAL Submitted (03-MAR-1996) Ronald N. Buick, Medical Biophysics, Ontario Cancer Institute and University of Toronto, 610 University Ave, Totonto, Ontario, M5G 2M9, Canada FEATURES Location/Qualifiers source 1..2269 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CaCo-2" /chromosome="X" /map="Xq26-27" gene 145..1887 /gene="OCI5" CDS 145..1887 /gene="OCI5" /note="OCI-5" /codon_start=1 /product="heparan sulphate proteoglycan" /db_xref="PID:g1245417" /translation="MAGTVRTACLVVAMLLSLDFPGQAQPPPPPPDATCHQVRSFFQR LQPGLKWVPETPVPGSDLQVCLPKGPTCCSRKMEEKYQLTARLNMEQLLQSASMELKF LIIQNAAVFQEAFEIVVRHAKNYTNAMFKNNYPSLTPQAFEFVGEFFTDVSLYILGSD INVDDMVNELFDSLFPVIYTQLMNPGLPDSALDINECLRGARRDLKVFGNFPKLIMTQ VSKSLQVTRIFLQALNLGIEVINTTDHLKFSKDCGRMLTRMWYCSYCQGLMMVKPCGG YCNVVMQGCMAGVVEIDKYWREYILSLEELVNGMYRIYDMENVLLGLFSTIHDSIQYV QKNAGKLTTTIGKLCAHSQQRQYRSAYYPEDLFIDKKVLKVAHVEHEETLSSRRRELI QKLKSFISFYSALPGYICSHSPVAENDTLCWNGQELVERYSQKAARNGMKNQFNLHEL KMKGPEPVVSQIIDKLKHINQLLRTMSMPKGRVLDKNLDEEGFESGDCGDDEDECIGG SGDGMIKVKNQLRFLAELAYDLDVDDAPGNSQQATPKDNEISTFHNLGNVHSPLKLLT SMAISVVCFFFLVH" BASE COUNT 601 a 541 c 529 g 598 t ORIGIN 1 aggtagctgc gaggaaactt ttgcagcggc tgggtagcag cacgtctctt gctcctcagg 61 gccactgcca ggcttgccga gtcctgggac tgctctcgct ccggctgcca ctctcccgcg 121 ctctcctagc tccctgcgaa cgagatggcc gggaccgtgc gcaccgcgtg cttggtggtg 181 gcgatgctgc tcagcttgga cttcccggga caggcgcagc ccccgccgcc gccgccggac 241 gccacctgtc accaagtccg ctccttcttc cagagactgc agcccggact caagtgggtg 301 ccagaaactc ccgtgccagg atcagatttg caagtatgtc tccctaaggg cccaacatgc 361 tgctcaagaa agatggaaga aaaataccaa ctaacagctc gattgaacat ggaacagctg 421 cttcagtctg caagtatgga gctcaagttc ttaattattc agaatgctgc ggttttccaa 481 gaggcctttg aaattgttgt tcgccatgcc aagaactaca ccaatgccat gttcaagaac 541 aactacccaa gcctgactcc acaagctttt gagtttgtgg gtgaattttt cacagatgtg 601 tctctctaca tcttgggttc tgacatcaat gtagatgaca tggtcaatga attgtttgac 661 agcctgtttc cagtcatcta tacccagcta atgaacccag gcctgcctga ttcagccttg 721 gacatcaatg agtgcctccg aggagcaaga cgtgacctga aagtatttgg gaatttcccc 781 aagcttatta tgacccaggt ttccaagtca ctgcaagtca ctaggatctt ccttcaggct 841 ctgaatcttg gaattgaagt gatcaacaca actgatcacc tgaagttcag taaggactgt 901 ggccgaatgc tcaccagaat gtggtactgc tcttactgcc agggactgat gatggttaaa 961 ccctgtggcg gttactgcaa tgtggtcatg caaggctgta tggcaggtgt ggtggagatt 1021 gacaagtact ggagagaata cattctgtcc cttgaagaac ttgtgaatgg catgtacaga 1081 atctatgaca tggagaacgt actgcttggt ctcttttcaa caatccatga ttctatccag 1141 tatgtccaga agaatgcagg aaagctgacc accactattg gcaagttatg tgcccattct 1201 caacaacgcc aatatagatc tgcttattat cctgaagatc tctttattga caagaaagta 1261 ttaaaagttg ctcatgtaga acatgaagaa accttatcca gccgaagaag ggaactaatt 1321 cagaagttga agtctttcat cagcttctat agtgctttgc ctggctacat ctgcagccat 1381 agccctgtgg cggaaaacga caccctttgc tggaatggac aagaactcgt ggagagatac 1441 agccaaaagg cagcaaggaa tggaatgaaa aaccagttca atctccatga gctgaaaatg 1501 aagggccctg agccagtggt cagtcaaatt attgacaaac tgaagcacat taaccagctc 1561 ctgagaacca tgtctatgcc caaaggtaga gttctggata aaaacctgga tgaggaaggg 1621 tttgaaagtg gagactgcgg tgatgatgaa gatgagtgca ttggagggtc tggtgatgga 1681 atgataaaag tgaagaatca gctccgcttc cttgcagaac tggcctatga tctggatgtg 1741 gatgatgcgc ctggaaacag tcagcaggca actccgaagg acaacgagat aagcaccttt 1801 cacaacctcg ggaacgttca ttccccgctg aagcttctca ccagcatggc catctcggtg 1861 gtgtgcttct tcttcctggt gcactgactg cctggtgccc agcacatgtg ctgccctaca 1921 gcaccctgtg gtcttcctcg ataaagggaa ccactttctt atttttttct attttttttt 1981 ttttgttata cctggtatac ctcctccagc catgaagtag aggactaacc atgtgttatg 2041 ttttcgaaaa tcaaatggta tcttttggag gaagatacat tttagtggta gcatatagat 2101 tgtccttttg caaagaaaga aaaaaaacca tcaagttgtg ccaaattatt ctcctatgtt 2161 tggctgctag aacatggtta ccatgtcttt ctctctcact ccctcccttt ctatcgttct 2221 ctctttgcat ggatttcttt aaaaaaaata aattgctcaa ataaaaaca // LOCUS HSU50532 2115 bp mRNA PRI 27-NOV-1996 DEFINITION Human BRCA2 region, mRNA sequence CG005. ACCESSION U50532 NID g1531603 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2115) AUTHORS Couch,F.J., Rommens,J.M., Neuhausen,S.L., Belanger,C., Dumont,M., Kenneth,A., Bell,R., Berry,S., Bogden,R., Cannon-Albright,L., Farid,L., Frye,C., Hattier,T., Janecki,T., Jiang,P., Kehrer,R., Leblanc,J.-F., McArthur-Morrison,J., McSweeney,D., Miki,Y., Peng,Y., Samson,C., Schroeder,M., Snyder,S.C., Stringfellow,M., Stroup,C., Swedlund,B., Swensen,J., Teng,D., Thakur,S., Tran,T., Tranchant,M., Welver-Feldhaus,J., Wong,A.K.C., Shizuya,H., Labrie,F., Skolnick,M.H., Goldgar,D.E., Kamb,A., Weber,B.L., Tavtigian,S.V. and Simard,J. TITLE Generation of an integrated transcription map of the BRCA2 region on chromosome 13q12-q13 JOURNAL Genomics 36 (1), 86-99 (1996) MEDLINE 96411650 REFERENCE 2 (bases 1 to 2115) AUTHORS Simard,J. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) Jacques Simard, Laboratory of Molecular Endocrinology, CHUL Research Center, 2705, Boulevard Laurier, Quebec City, Quebec G1V 4G2, Canada FEATURES Location/Qualifiers source 1..2115 /organism="Homo sapiens" /note="CG005; DSEG numbers: D13S1701 and D13S1695" /db_xref="taxon:9606" /chromosome="13" /map="13q12-q13" CDS 169..1920 /codon_start=1 /product="unknown" /db_xref="PID:g1531604" /translation="MSYGEIEGKFLGPREEVTSEPRCKKLKSTTESYVFHNHSNADFH RIQEKTGNDWVPVTIIDVRGHSYLQENKIKTTDLHRPLHDEMPGNRPDVIESIDSQVL QEARPPLVSADDEIYSTSKAFIGPIYKPPEKKKRNEGRNEAHVLNGINDRGGQKEKQK FNSEKSEIDNELFQFYKEIEELEKEKDGFENSCKESEPSQEQFVPFYEGHNNGLLKPD EEKKDLSNKAMPSHCDYQQNLGNEPDKYPCNGQVIPTFCDTSFTSFRPEWQSVYPFIV PYGPPLPSLNYHLNIQRFSGPPNPPSNIFQAQDDSQIQNGYYVNNCHVNWNCMTFDQN NEYTDCSENRSSVHPSGNGCSMQDRYVSNGFCEVRERCWKDHCMDKHNGTDRFVNQQF QEEKLNKLQKLLILLRGLPGSGKTTLSRILLGQNRDGIVFSTDDYFHHQDGYRYNVNQ LGDAHDWNQNRAKQAIDQGRSPVIIDNTNIQAWEMKPYVEVAIGKGYRVEFHEPETWW KFDPEELEKRNKHGVSRKKIAQMLDRYEYQMSISIVMNSVEPSHKSTQRPPPPQGRQR WGGSLGSHNRVCVTNNH" BASE COUNT 722 a 349 c 461 g 583 t ORIGIN 1 gcggggaggt gaggtttgtt accgcgattc tgagaggtgg gcttttagtc cctccagacc 61 tcggctttag tgctgtctcc gcttttcttt caccttcaca gaggttcgtg tcttcctaaa 121 agaaggtttt attgggaggt aaaggtcaat gcgtaggggt agagtaagat gtcttatggt 181 gaaattgaag gtaaattctt gggacctaga gaagaagtaa cgagtgagcc acgctgtaaa 241 aaattgaagt caaccacaga gtcgtatgtt tttcacaatc atagtaatgc tgattttcac 301 agaatccaag agaaaactgg aaatgattgg gtccctgtga ccatcattga tgtcagagga 361 catagttatt tgcaggagaa caaaatcaaa actacagatt tgcatagacc tttgcatgat 421 gagatgcctg gtaatagacc agatgttatt gaatccattg attcacaggt tttacaggaa 481 gcacgtcctc cattagtatc cgcagacgat gagatatata gcacaagtaa agcatttata 541 ggacccattt acaaaccccc tgagaaaaag aaacgtaatg aagggaggaa tgaggcacat 601 gttctaaatg gtataaatga cagaggagga caaaaagaga aacagaaatt taactctgaa 661 aaatcagaga ttgacaatga attattccag ttttacaaag aaattgaaga gcttgaaaag 721 gaaaaagatg gttttgagaa cagttgtaaa gaatctgaac cttctcagga acaatttgtt 781 ccattttatg agggtcataa taatggtctc ttaaaacctg atgaagaaaa gaaagatctt 841 agtaataaag ctatgccatc acattgtgat tatcagcaga acttggggaa tgagccagac 901 aaatatccct gtaatggaca agtaatacct acattttgtg acacttcatt tacttctttc 961 aggcctgaat ggcagtcagt atatcctttt atagtgccct atggtccccc tcttcccagt 1021 ttgaactatc atttaaacat tcagagattc agtggtccac caaatccacc atcaaatatt 1081 ttccaagccc aagatgactc tcagatacaa aatggatatt atgtaaataa ttgtcatgtt 1141 aactggaatt gcatgacttt tgatcagaac aatgaatata ctgactgtag tgagaatagg 1201 agtagtgttc atccctctgg aaatggctgc agtatgcaag atcgatatgt gagtaatggt 1261 ttctgtgaag tcagagaaag atgctggaaa gatcattgta tggacaagca taatggaaca 1321 gacaggtttg tgaaccagca gtttcaagag gaaaagttaa ataaattgca gaagttactt 1381 attcttttaa gaggtctgcc tggttctggg aaaacaacat tgtctcgaat tctgcttggt 1441 cagaatcgtg atggcattgt gttcagcact gatgactatt ttcaccatca agatgggtac 1501 aggtataatg ttaatcaact tggtgatgcc catgactgga accagaacag agcaaaacaa 1561 gctatcgatc agggaagatc tccagttata atagataaca ctaatataca agcttgggaa 1621 atgaagccat atgtggaagt ggccatagga aaaggataca gagtagagtt tcatgaacct 1681 gaaacttggt ggaaatttga tcctgaagaa ttagaaaaga ggaataaaca tggtgtgtct 1741 cgaaagaaga ttgctcagat gttggatcgt tatgaatatc aaatgtccat ttctattgta 1801 atgaattcag tggaaccatc acacaaaagc acacaaagac ctcctcctcc acaggggaga 1861 cagaggtggg gaggctctct tggctcacat aatcgtgtct gtgtcacaaa taatcattaa 1921 attagctatt ttcagctaac acatttgttg ttgcacttga aaaagagtta gtgagcctgt 1981 cttggagttt aagtagtttc aaataaaaaa aggctacagt gcctcacaaa ggatgttccc 2041 agcaagttgt ttaaattccc agcaagttgt taaagtgtaa ataaaaatat atgaaattgt 2101 aaaaaaaaaa aaaaa // LOCUS HSU50733 1721 bp mRNA PRI 04-APR-1996 DEFINITION Human dynamitin mRNA, complete cds. ACCESSION U50733 NID g1255187 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1721) AUTHORS Echeverri,C.J., Paschal,B.M., Vaughan,K.T. and Vallee,R.B. TITLE Molecular characterization of the 50-kD subunit of dynactin reveals function for the complex in chromosome alignment and spindle organization during mitosis JOURNAL J. Cell Biol. 132 (4), 617-633 (1996) MEDLINE 96178072 REFERENCE 2 (bases 1 to 1721) AUTHORS Echeverri,C.J. TITLE Direct Submission JOURNAL Submitted (05-MAR-1996) Christophe J. Echeverri, Cell Biology, Worcester Foundation for Biomedical Research, 222 Maple Avenue, Shrewsbury, MA 01545, USA FEATURES Location/Qualifiers source 1..1721 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 79..1299 /note="similar to GenBank EST Accession Number T08494; p50 subunit of dynactin complex" /codon_start=1 /product="dynamitin" /db_xref="PID:g1255188" /translation="MADPKYADLPGIARNEPDVYETSDLPEDDQAEFDAFAQELEELT STSVEHIIVNPNAAYDKFKDKRVGTKGLDFSDRIGKTKRTGYESGEYEMLGEGLGVKE TPQQKYQRLLHEVQELTTEVEKIKTTVKESATEEKLTPVLLAKQLAALKQQLVASHLE KLLGPDAAINLTDPDGALAKRLLLQLEATKNSKGGSGGKTTGTPPDSSLVTYELHSRP EQDKFSQAAKVAELEKRLTELETAVRCDQDAQNPLSAGLQGACLMETVELLQAKVSAL DLAVLDQVEARLQSVLGKVNEIAKHKASVEDADTQSKVHQLYETIQRWSPIASTLPEL VQRLVTIKQLHEQAMQFGQLLTHLDTTQQMIANSLKDNTTLLTQVQTTMRENLATVEG NFASIDERMKKLGK" BASE COUNT 473 a 445 c 453 g 350 t ORIGIN 1 aacccagcct ctcccctacc cgaacaccgg ccccggctcc accgaggccc gggtccccca 61 gcccgtctcg ccgccgccat ggcggaccct aaatacgccg accttcccgg cattgccagg 121 aatgagccag atgtttatga aactagcgac ctacctgagg atgatcaagc ggagttcgat 181 gcgtttgcac aagagctgga ggagctgaca agcacaagtg tggaacacat cattgtcaat 241 cctaatgctg cctatgacaa gttcaaggac aagagagtgg ggacaaaggg acttgatttc 301 tcagatcgta ttggaaaaac caagaggaca ggatatgaat ctggagaata tgagatgctt 361 ggagagggtc tgggagtgaa ggagacaccc cagcaaaagt accagcgcct actgcatgag 421 gtccaagagc tgacaactga agttgaaaaa atcaagacga cagtgaagga gtcagccaca 481 gaggagaagc tgacccctgt gttgctggct aaacagctgg cagccctgaa gcagcagctg 541 gttgcttccc acctggagaa gctgctggga ccagatgctg caatcaacct taccgacccc 601 gatggcgccc tggctaagcg cctactactg cagctggaag caacaaagaa cagcaaaggg 661 ggatcagggg gaaaaaccac tgggaccccc ccagatagca gccttgtcac ttatgaacta 721 cattctcggc ctgagcagga caagttctct caagctgcca aagtcgcaga acttgaaaag 781 cgcctgacag agctggagac agctgtacgt tgtgatcagg atgctcagaa tcccctttct 841 gcaggtctac agggagcctg tctcatggag actgtagagc tgttgcaagc aaaggtgagc 901 gccctagacc ttgcagtttt ggatcaagtg gaggctcggc tacagagtgt cctgggaaag 961 gtgaacgaga ttgccaagca taaagcctct gtagaagatg cagatacaca aagcaaggtg 1021 caccagctat atgaaactat acagcgctgg agccccattg cctccaccct ccctgagctg 1081 gtgcagagac ttgtcaccat caagcagctg cacgagcaag ccatgcagtt tggtcagctc 1141 ctgacacact tggataccac ccagcagatg attgctaatt ccttgaagga caataccacc 1201 ctcttgaccc aggtgcagac aaccatgcgt gaaaacctgg ccacagttga ggggaacttt 1261 gccagcattg atgaacggat gaagaagctg ggaaagtgag cacatttggg agctggagaa 1321 caggggttat ccctacccct gtgaactctg ttaacagctt acatagggtt tcccctttac 1381 tataactcta gcatccccat cccatttgac actgggggca agggttcttc ttgcatgtgg 1441 ggtttatacc cctcccctga tgaatacaga gtggtagcta ggggttggtt atcatcagaa 1501 ggtggtctcc cctcaggcct gggggataag gacgtgggcc cagccacatg ccaactcatg 1561 tccaatactg ctttgcctgg tgtggggaag gattgggtct tgtcccccaa cacagcttct 1621 gtggctgact gtaatactgt acaactgttt ctgaccatta aatgctgttg tactctgaaa 1681 aaaaaaaaaa aaaaaaaaaa aaattcctgc ggccgcaagc t // LOCUS HSU50822 1676 bp DNA PRI 02-APR-1996 DEFINITION Human neurogenic helix-loop-helix protein NEUROD (neurod) gene, complete cds. ACCESSION U50822 NID g1245454 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1676) AUTHORS Tamimi,R., Steingrimsson,E., Copeland,N.G., Dyer-Montgomery,K., Lee,J.E., Hernandez,R., Jenkins,N.A. and Tapscott,S.J. TITLE The neurod gene maps to human chromosome 2q32 and mouse chromosome 2 JOURNAL Genomics (1996) In press REFERENCE 2 (bases 1 to 1676) AUTHORS Tapscott,S.J., Tamimi,R., Lee,J.E. and Hernandez,R. TITLE Direct Submission JOURNAL Submitted (04-MAR-1996) Stephen J. Tapscott, Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..1676 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q32" mRNA 160..1676 /gene="neurod" gene 160..1676 /gene="neurod" CDS 173..1243 /gene="neurod" /note="neurogenic basic helix-loop-helix protein" /codon_start=1 /product="NEUROD" /db_xref="PID:g1245455" /translation="MTKSYSESGLMGEPQPQGPPSWTDECLSSQDEEHEADKKEDDLE AMNAEEDSLRNGGEEEDEDEDLEEEEEEEEEDDDQKPKRRGPKKKKMTKARLERFKLR RMKANARERNRMHGLNAALDNLRKVVPCYSKTQKLSKIETLRLAKNYIWALSEISRSG KSPDLVSFVQTLCKGLSQPTTNLVAGCLQLNPRTFLPEQNQDMPPHLPTASASFPVHP YSYQSPGLPSPPYGTMDSSHVFHVKPPPHAYSAALEPFFESPLTDCTSPSFDGPLSPP LSINGNFSFKHEPSAEFEKNYAFTMHYPAATLAGAQSHGSIFSGTAAPRCEIPIDNIM SFDSHSHHERVMSAQLNAIFHD" BASE COUNT 452 a 435 c 387 g 402 t ORIGIN 1 acatcgatta actttttctc agaggcattc attttgtaat gggcaggtac ttttcgcaag 61 catttgtaca ggtttaggga gtggaagctg aaggcgatct ttcttttgat atagcgtttt 121 tctgcttttc tttctgtttg cctctccctt gttgaatgta ggaaatcgaa acatgaccaa 181 atcgtacagc gagagtgggc tgatgggcga gcctcagccc caaggtcctc caagctggac 241 agacgagtgt ctcagttctc aggacgagga gcacgaggca gacaagaagg aggacgacct 301 cgaagccatg aacgcagagg aggactcact gaggaacggg ggagaggagg aggacgaaga 361 tgaggacctg gaagaggagg aagaagagga agaggaggat gacgatcaaa agcccaagag 421 acgcggcccc aaaaagaaga agatgactaa ggctcgcctg gagcgtttta aattgagacg 481 catgaaggct aacgcccggg agcggaaccg catgcacgga ctgaacgcgg cgctagacaa 541 cctgcgcaag gtggtgcctt gctattctaa gacgcagaag ctgtccaaaa tcgagactct 601 gcgcttggcc aagaactaca tctgggctct gtcggagatc tcgcgctcag gcaaaagccc 661 agacctggtc tccttcgttc agacgctttg caagggctta tcccaaccca ccaccaacct 721 ggttgcgggc tgcctgcaac tcaatcctcg gacttttctg cctgagcaga accaggacat 781 gcccccgcac ctgccgacgg ccagcgcttc cttccctgta cacccctact cctaccagtc 841 gcctgggctg cccagtccgc cttacggtac catggacagc tcccatgtct tccacgttaa 901 gcctccgccg cacgcctaca gcgcagcgct ggagcccttc tttgaaagcc ctctgactga 961 ttgcaccagc ccttcctttg atggacccct cagcccgccg ctcagcatca atggcaactt 1021 ctctttcaaa cacgaaccgt ccgccgagtt tgagaaaaat tatgccttta ccatgcacta 1081 tcctgcagcg acactggcag gggcccaaag ccacggatca atcttctcag gcaccgctgc 1141 ccctcgctgc gagatcccca tagacaatat tatgtccttc gatagccatt cacatcatga 1201 gcgagtcatg agtgcccagc tcaatgccat atttcatgat tagaggcacg ccagtttcac 1261 catttccggg aaacgaaccc actgtgctta cagtgactgt cgtgtttaca aaaggcagcc 1321 ctttggtact actgctgcaa agtgcaaata ctccaagctt caagtgatat atgtatttat 1381 tgtcattact gcctttggaa gaaacagggg atcaaagttc ctgttcacct tatgtattat 1441 tttctataga ctcttctatt ttaaaaaata aaaaaataca gtaaagttta aaaaatacac 1501 cacgaatttg gtgtggctgt attcagatcg tattaattat ctgatcggga taacaaaatc 1561 acaagcaata attaggatct atgcaatttt taaactagta atgggccaat taaaatatat 1621 ataaatatat atttcaacca gcattttact acttgttacc tcccatgctg aattat // LOCUS HSU50928 5057 bp mRNA PRI 12-JUN-1996 DEFINITION Human autosomal dominant polycystic kidney disease type II (PKD2) mRNA, complete cds. ACCESSION U50928 NID g1373168 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5057) AUTHORS Mochizuki,T., Wu,G., Hayashi,T., Xenophontos,S.L., Veldhuisen,B., Saris,J.J., Reynolds,D.M., Cai,Y., Gabow,P.A., Pierides,A., Kimberling,W.J., Breuning,M.H., Deltas,C.C., Peters,D.J.M. and Somlo,S. TITLE PKD2, a gene for polycystic kidney disease that encodes an integral membrane protein JOURNAL Science 272 (5266), 1339-1342 (1996) MEDLINE 96243133 REFERENCE 2 (bases 1 to 5057) AUTHORS Somlo,S., Mochizuki,T., Wu,G.Q., Hayashi,T., Xenophontos,S.L., Veldhuisen,B., Saris,J.J., Reynolds,D.M., Cai,Y., Gabow,P.A., Pierides,A., Kimberling,W.J., Breuning,M.H., Deltas,C.C. and Peters,D.J.M. TITLE Direct Submission JOURNAL Submitted (07-MAR-1996) Stefan Somlo, Departments of Medicine and Molecular Genetics, The Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461-1602, USA FEATURES Location/Qualifiers source 1..5057 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q21-23" gene 67..2973 /gene="PKD2" CDS 67..2973 /gene="PKD2" /note="autosomal dominant polycystic kidney disease type II" /codon_start=1 /db_xref="PID:g1373169" /translation="MVNSSRVQPQQPGDAKRPPAPRAPDPGRLMAGCAAVGASLAAPG GLCEQRGLEIEMQRIRQAAARDPPAGAAASPSPPLSSCSRQAWSRDNPGFEAEEEEEE VEGEEGGMVVEMDVEWRPGSRRSAASSAVSSVGARSRGLGGYHGAGHPSGRRRRREDQ GPPCPSPVGGGDPLHRHLPLEGQPPRVAWAERLVRGLRGLWGTRLMEESSTNREKYLK SVLRELVTYLLFLIVLCILTYGMMSSNVYYYTRMMSQLFLDTPVSKTEKTNFKTLSSM EDFWKFTEGSLLDGLYWKMQPSNQTEADNRSFIFYENLLLGVPRIRQLRVRNGSCSIP QDLRDEIKECYDVYSVSSEDRAPFGPRNGTAWIYTSEKDLNGSSHWGIIATYSGAGYY LDLSRTREETAAQVASLKKNVWLDRGTRATFIDFSVYNANINLFCVVRLLVEFPATGG VIPSWQFQPLKLIRYVTTFDFFLAACEIIFCFFIFYYVVEEILEIRIHKLHYFRSFWN CLDVVIVVLSVVAIGINIYRTSNVEVLLQFLEDQNTFPNFEHLAYWQIQFNNIAAVTV FFVWIKLFKFINFNRTMSQLSTTMSRCAKDLFGFAIMFFIIFLAYAQLAYLVFGTQVD DFSTFQECIFTQFRIILGDINFAEIEEANRVLGPIYFTTFVFFMFFILLNMFLAIIND TYSEVKSDLAQQKAEMELSDLIRKGYHKALVKLKLKKNTVDDISESLRQGGGKLNFDE LRQDLKGKGHTDAEIEAIFTKYDQDGDQELTEHEHQQMRDDLEKEREDLDLDHSSLPR PMSSRSFPRSLDDSEEDDDEDSGHSSRRRGSISSGVSYEEFQVLVRRVDRMEHSIGSI VSKIDAVIVKLEIMERAKLKRREVLGRLLDGVAEDERLGRDSEIHREQMERLVREELE RWESDDAASQISHGLGTPVGLNGQPRPRSSRPSSSQSTEGMEGAGGNGSSNVHV" BASE COUNT 1384 a 1071 c 1190 g 1403 t 9 others ORIGIN 1 ggctcctgag gcgcacagcg ccgagcgcgg cgccgcgcac ccgcgcgccg gacgccagtg 61 accgcgatgg tgaactccag tcgcgtgcag cctcagcagc ccggggacgc caagcggccg 121 cccgcgcccc gcgcgccgga cccgggccgg ctgatggctg gctgcgcggc cgtgggcgcc 181 agcctcgccg ccccgggcgg cctctgcgag cagcggggcc tggagatcga gatgcagcgc 241 atccggcagg cggccgcgcg ggaccccccg gccggagccg cggcctcccc ttctcctccg 301 ctctcgtcgt gctcccggca ggcgtggagc cgcgataacc ccggcttcga ggccgaggag 361 gaggaggagg aggtggaagg ggaagaaggc ggaatggtgg tggagatgga cgtagagtgg 421 cgcccgggca gccggaggtc ggccgcctcc tcggccgtga gctccgtggg cgcgcggagc 481 cgggggcttg ggggctacca cggcgcgggc cacccgagcg ggaggcggcg ccggcgagag 541 gaccagggcc cgccgtgccc cagcccagtc ggcggcgggg acccgctgca tcgccacctc 601 cccctggaag ggcagccgcc ccgagtggcc tgggcggaga ggctggttcg cgggctgcga 661 ggtctctggg gaacaagact catggaggaa agcagcacta accgagagaa ataccttaaa 721 agtgttttac gggaactggt cacatacctc ctttttctca tagtcttgtg catcttgacc 781 tacggcatga tgagctccaa tgtgtactac tacacccgga tgatgtcaca gctcttccta 841 gacacccccg tgtccaaaac ggagaaaact aactttaaaa ctctgtcttc catggaagac 901 ttctggaagt tcacagaagg ctccttattg gatgggctgt actggaagat gcagcccagc 961 aaccagactg aagctgacaa ccgaagtttc atcttctatg agaacctgct gttaggggtt 1021 ccacgaatac ggcaactccg agtcagaaat ggatcctgct ctatccccca ggacttgaga 1081 gatgaaatta aagagtgcta tgatgtctac tctgtcagta gtgaagatag ggctcccttt 1141 gggccccgaa atggaaccgc ttggatctac acaagtgaaa aagacttgaa tggtagtagc 1201 cactggggaa tcattgcaac ttatagtgga gctggctatt atctggattt gtcaagaaca 1261 agagaggaaa cagctgcaca agttgctagc ctcaagaaaa atgtctggct ggaccgagga 1321 accagggcaa cttttattga cttctcagtg tacaacgcca acattaacct gttctgtgtg 1381 gtcaggttat tggttgaatt cccagcaaca ggtggtgtga ttccatcttg gcaatttcag 1441 cctttaaagc tgatccgata tgtcacaact tttgatttct tcctggcagc ctgtgagatt 1501 atcttttgtt tctttatctt ttactatgtg gtggaagaga tattggaaat tcgcattcac 1561 aaactacact atttcaggag tttctggaat tgtctggatg ttgtgatcgt tgtgctgtca 1621 gtggtagcta taggaattaa catatacaga acatcaaatg tggaggtgct actacagttt 1681 ctggaagatc aaaatacttt ccccaacttt gagcatctgg catattggca gatacagttc 1741 aacaatatag ctgctgtcac agtatttttt gtctggatta agctcttcaa attcatcaat 1801 tttaacagga ccatgagcca gctctcgaca accatgtctc gatgtgccaa agacctgttt 1861 ggctttgcta ttatgttctt cattattttc ctagcgtatg ctcagttggc ataccttgtc 1921 tttggcactc aggtcgatga cttcagtact ttccaagagt gtatcttcac tcaattccgt 1981 atcattttgg gcgatatcaa ctttgcagag attgaggaag ctaatcgagt tttgggacca 2041 atttatttca ctacatttgt gttctttatg ttcttcattc ttttgaatat gtttttggct 2101 atcatcaatg atacttactc tgaagtgaaa tctgacttgg cacagcagaa agctgaaatg 2161 gaactctcag atcttatcag aaagggctac cataaagctt tggtcaaact aaaactgaaa 2221 aaaaataccg tggatgacat ttcagagagt ctgcggcaag gaggaggcaa gttaaacttt 2281 gacgaacttc gacaagatct caaagggaag ggccatactg atgcagagat tgaggcaata 2341 ttcacaaagt acgaccaaga tggagaccaa gaactgaccg aacatgaaca tcagcagatg 2401 agagacgact tggagaaaga gagggaggac ctggatttgg atcacagttc tttaccacgt 2461 cccatgagca gccgaagttt ccctcgaagc ctggatgact ctgaggagga tgacgatgaa 2521 gatagcggac atagctccag aaggagggga agcatttcta gtggcgtttc ttacgaagag 2581 tttcaagtcc tggtgagacg agtggaccgg atggagcatt ccatcggcag catagtgtcc 2641 aagattgacg ccgtgatcgt gaagctagag attatggagc gagccaaact gaagaggagg 2701 gaggtgctgg gaaggctgtt ggatggggtg gccgaggatg aaaggctggg tcgtgacagt 2761 gaaatccata gggaacagat ggaacggcta gtacgtgaag agttggaacg ctgggaatcc 2821 gatgatgcag cttcccagat cagtcatggt ttaggcacgc cagtgggact aaatggtcaa 2881 cctcgcccca gaagctcccg cccatcttcc tcccaatcta cagaaggcat ggaaggtgca 2941 ggtggaaatg ggagttctaa tgtccacgta tgatatgtgt gtttcagtat gtgtgtttct 3001 aataagtgag gaagtggctg tcctgaattg ctgtaacaag cacactattt atatgccctg 3061 accaccatag gatgctagtc tttgtgaccg attgctaatc ttctgcactt taatttattt 3121 tatataaact ttacccatgg ttcaaagatt tttttttctt tttctcatat aagaaatcta 3181 ggtgtaaata ttgagtacag aaaaaaaatc ttcatgatgt gtattgagcg gtacgcccag 3241 ttgccaccat gactgagtct tctcagttga caatgaagta gccttttaaa gctagaaaac 3301 tgtcaaaggg cttctgagtt tcatttccag tcacaaaaat cagtattgtt atttttttcc 3361 aagagtgtga aggaaaatgg ggcaattcct ttccactctg gcatagttca tgagcttaat 3421 acatagcttt cttttaagaa aggagccttt tttttcaact agcttcctgg ggtaaacttt 3481 tctaaaagat aaaatgggaa ggaactccaa actatgatag aatctgtgtg aatggttaag 3541 atgaatgtta aatactatgc ttttttgtaa gttgatcgta tctgatgtct gtgggactaa 3601 ctgtatcact taatttttac cttattttgg ctctaatttg aataagctga gtaaaaccac 3661 caaagatcag ttataggata aaatggcatc tctaaccata acacaggaga attggaagga 3721 gccctaagtt gtcactcagt ttaatttctt ttaatggtta gtttagccta aagatttatc 3781 tgcatattct ttttcccatg tggctctact catttgcaac tgaatttaat gttataactc 3841 atctagtgag accaacttac taaattttta gtatgcactg aaagttttta tccaacaatt 3901 atgttcattt taagcaaaat tttaagaaag ttttgaaatt cataaagcat ttggttttaa 3961 actattttaa gaatatagta ctcggtcagg tatgnnncac gcctgtaatc ccagcacttt 4021 gggaggccga aacaggcgaa tcacttgagc ccaggagttc aagaccaaca tgggcaatgt 4081 ggcgaaactc catctctaca aaaaatgcaa aaataaaaaa tatagtactc aagtattctt 4141 gatcctgtgt ttcaaaacta gaatttgtaa tgcaaatgga gctcagtcta ataaaaaaga 4201 ggttttggta ttaaaagttc atacattaga cagtatcagc caaaatttga gttagcaaca 4261 ctgttttctt tacgagaggg tctcacccaa atttatgggg agaaatctat ttctcaaaaa 4321 aaaaaaatct tcttttacag aaatgttgag taaggtgaca ttttgagcgc taataagcaa 4381 aagagcatgc agtgctgttg aataaccctc acttggagaa ccaagagaat cctgtcgttt 4441 aatgctatat tttaatttca caagttgttc atttaactgg tagaatgtca gtccaatctc 4501 caatgagaac atgagcaaat agacctttcc aggttgaaag tgaaacatac tgggtttctg 4561 taagtttttc ctcatggctt catctctatc tttactttct cttgaatatg ctacacaaag 4621 ttctttatta ctacatacta aagtttgcat tccagggata ttgactgtac atatttatgt 4681 atatgtacca tgttgttaca tgtaaacaaa cttcaatttg aagtgcagct attatgtggt 4741 atccatgtgt atcgaccatg tgccatatat caattatggt cactagaaag tctctttatg 4801 atacttttta ttgtactgtt tttcatttca cttgcaaaat tttgcagaat tcctcctttc 4861 tacccataaa ttacatataa tttttcttct ttagtcatgg agaacncccc cccatcatct 4921 canccctatt anctttccca tgtgtactgg tattattaaa aagacattta catacgcaag 4981 tttttcactg acaancaaga atgttattaa tgtgtaatac tgagcacntt tacttcttaa 5041 taaaaacttg atatant // LOCUS HSU50929 2436 bp mRNA PRI 06-SEP-1996 DEFINITION Human betaine:homocysteine methyltransferase mRNA, complete cds. ACCESSION U50929 NID g1522682 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2436) AUTHORS Garrow,T.A. TITLE Purification, kinetic properties, and cDNA cloning of mammalian betaine-homocysteine methyltransferase JOURNAL J. Biol. Chem. 271 (37), 22831-22838 (1996) MEDLINE 96394355 REFERENCE 2 (bases 1 to 2436) AUTHORS Garrow,T.A. TITLE Direct Submission JOURNAL Submitted (07-MAR-1996) Timothy A. Garrow, Food Science and Human Nutrition, University of Illinois, Urbana-Champaign, 905 South Goodwin Avenue, Urbana, IL 61801, USA FEATURES Location/Qualifiers source 1..2436 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 27..1247 /EC_number="2.1.1.5" /note="betaine-dependent methylation of homocysteine; methyltransferase" /codon_start=1 /product="betaine:homocysteine methyltransferase" /db_xref="PID:g1522683" /translation="MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPW TPEAAVEHPEAVRQLHREFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEA ACDIARQVADEGDALVAGGVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEY FEHVEEAVWAVETLIASGKPVAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCH FDPTISLKTVKLMKEGLEAAQLKAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVA TRWDIQKYAREAYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPPASEKHGSWGSG LDMHTKPWVRARARKEYWENLRIASGRPYNPSMSKPDGWGVTKGTAELMQQKEATTEQ QLKELFEKQKFKSQ" BASE COUNT 772 a 498 c 568 g 598 t ORIGIN 1 cgaccacctg tctggacacc acaaagatgc cacccgttgg gggcaaaaag gccaagaagg 61 gcatcctaga acgtttaaat gctggagaga ttgtgattgg agatggaggg tttgtctttg 121 cactggagaa gaggggctac gtaaaggcag gaccctggac tcctgaagct gctgtggagc 181 acccagaagc agttcgccag cttcatcgag agttcctcag agctggctca aacgtcatgc 241 agaccttcac cttctatgcg agtgaagaca agctggagaa caggggcaac tatgtcttag 301 agaagatatc tgggcaggaa gtcaatgaag ctgcttgcga catcgcccga caagtggctg 361 atgaaggaga tgctttggta gcaggaggag tgagtcagac accttcatac cttagctgca 421 agagtgaaac tgaagtcaaa aaagtatttc tgcaacagtt agaggtcttt atgaagaaga 481 acgtggactt cttgattgca gagtattttg aacacgttga agaagctgtg tgggcagttg 541 aaaccttgat agcatccggt aaacctgtgg cagcaaccat gtgcattggc ccagaaggag 601 atttgcatgg cgtgcccccc ggcgagtgtg cagtgcgcct ggtgaaagca ggagcatcca 661 tcattggtgt gaactgccac tttgacccca ccattagttt aaaaacagtg aagctcatga 721 aggagggctt ggaggctgcc caactgaaag ctcacctgat gagccagccc ttggcttacc 781 acactcctga ctgcaacaag cagggattca tcgatctccc agaattccca tttggactgg 841 aacccagagt tgccaccaga tgggatattc aaaaatacgc cagagaggcc tacaacctgg 901 gggtcaggta cattggcggg tgctgtggat ttgagcccta ccacatcagg gcaattgcag 961 aggagctggc cccagaaagg ggctttttgc caccagcttc agaaaaacat ggcagctggg 1021 gaagtggttt ggacatgcac accaaaccct gggttagagc aagggccagg aaggaatact 1081 gggagaatct tcggatagcc tcaggccggc catacaaccc ttcaatgtca aagccagatg 1141 gctggggagt gaccaaagga acagccgagc tgatgcagca gaaagaagcc acaactgagc 1201 agcagctgaa agagctcttt gaaaaacaaa aattcaaatc acagtagcct cgatagaagc 1261 tatttttgat gaatttctag gtgtttgggt cacagttcct acaaatacgg aaaagggggt 1321 taaaaagcag tgctttcatg aatgccatcc tacacatatt attgctatta cctgaacaaa 1381 atagaattac aaatagcact tgataatttt aaagtatgtt ttagaaattt tcttaggagc 1441 aaaataagta caaagtaaat cttgaacagg ttcactaagc acccaccctg tgaaaagtat 1501 tatggaaatc actgcagcac aggaaaagta attcagatgt taatgccact tgaagaagtt 1561 ggtaggctag caaagaggat gagacatgaa ctgtcataaa ggactcagca accagccagg 1621 gacagataaa gcgctatgga aaggggcttc caagttcttt tgaacatgac ccttagtaac 1681 aaacacaatt tatataatga cccagcaaaa cacatcacat cttactgtcg aaattaaatg 1741 tgtgatccat cctagtattt tctgttccat tccttttcat tctatttcat ttataaaaca 1801 tgctagttga gacttttcaa atggattttt atgacccact actgggtttg gatccacagt 1861 ttgaaaaata ttgctacaag acacttaagg agaccatcct gtttaagttt attcttataa 1921 gtaggtcagt catatgagac ctgatcaata aatatccaat acccagagtc ctgctctcag 1981 agttcttctg tttcgtgacc cacttttcta ccagtaaaag acatagacca atggggagga 2041 ggggaggaga gatggatatt tcagccctct ccatcctagt caacactgga tccacctagt 2101 gcctctgggc cataaggctg agcagagtga gcttgtatta gttggtagct tttaaaaaat 2161 ataataaaaa aaaagtagag attctccaaa ctctagcctg gtttcctaga ttgagaacta 2221 tgatattttt ctctgataat ttaatatcta ctctcctaca aaagctcaag cctgaagata 2281 caagactatt agaagaaaca tgactaccct cagtgtatta gaaaagaggt catgcagctt 2341 tctaaacatt attgaattgt ttgagctgtt ttgaaattgt aattcttttc agctattaaa 2401 aagaagagca atgagaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU50939 1823 bp mRNA PRI 16-MAY-1996 DEFINITION Human amyloid precursor protein-binding protein 1 mRNA, complete cds. ACCESSION U50939 NID g1314559 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1823) AUTHORS Chow,N., Korenberg,J.R., Chen,X.N. and Neve,R.L. TITLE APP-BP1, a novel protein that binds to the carboxyl-terminal region of the amyloid precursor protein JOURNAL J. Biol. Chem. 271 (19), 11339-11346 (1996) MEDLINE 96212203 REFERENCE 2 (bases 1 to 1823) AUTHORS Chow,N. TITLE Direct Submission JOURNAL Submitted (08-MAR-1996) Nienwen Chow, Neurobiology & Anatomy, Univ. Rochester, 601 Elmwood Ave., Rochester, NY 14642, USA FEATURES Location/Qualifiers source 1..1823 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16q22" CDS 74..1678 /function="binds to the carboxyl-terminal region of the amyloid precursor protein" /note="APP-BP1" /codon_start=1 /evidence=experimental /product="amyloid precursor protein-binding protein 1" /db_xref="PID:g1314560" /translation="MAQLGKLLKEQKYDRQLRLWGDHGQEALESAHVCLINATATGTE ILKNLVLPGIGSFTIIDGNQVSGEDAGNNFFLQRSSIGKNRAEAAMEFLQELNSDVSG SFVEESPENLLDNDPSFFCRFTVVVATQLPESTSLRLADVLWNSQIPLLICRTYGLVG YMRIIIKEHPVIESHPDNALEDLRLDKPFPELREHFQSYDLDHMEKKDHSHTPWIVII AKYLAQWYSETNGRIPKTYKEKEDFRDLIRQGILKNENGAPEDEENFEEAIKNVNTAL NTTQIPSSIEDIFNDDRCINITKQTPSFWILARALKEFVAKEGQGNLPVRGTIPDMIA DSGKYIKLQNVYREKAKKDAAAVGNHVAKLLQSIGQAPESISEKELKLLCSNSAFLRV VRCRSLAEEYGLDTINKDEIISSMDNPDNEIVLYLMLRAVDRFHKQQGRYPGVSNYQV EEDIGKLKSCLTGFLQEYGLSVMVKDDYVHEFCRYGAAEPHTIAAFLGGAAAQEVIKI ITKQFVIFNNTYIYSGMSQTSATFQL" BASE COUNT 587 a 320 c 404 g 512 t ORIGIN 1 gaattccgcg cttgtggagc tggtggcggc gctccgcagg ggctcggctg ttttccgcgc 61 ggcaggcgcg gccatggcgc agctgggaaa gctgctcaag gagcagaagt acgaccggca 121 gctgaggttg tggggtgatc atgggcaaga ggctttagaa tctgctcatg tttgcctaat 181 aaatgcaaca gccacaggaa ctgaaattct taaaaacttg gtactaccag gtattggttc 241 gtttacaatt attgatggaa atcaggtcag cggagaagat gctggaaaca atttcttcct 301 tcaaagaagc agtatcggca agaaccgagc tgaagctgcc atggaattct tacaagaatt 361 aaatagcgat gtctctggaa gttttgtgga agagagtcca gaaaaccttc tagacaatga 421 tccctcattt ttctgtaggt ttactgttgt agttgcaact cagcttcctg aaagcacttc 481 actacgctta gcagatgtcc tctggaattc ccagattcct cttttgatct gtaggacata 541 tggactagtt ggttatatga ggatcattat aaaagaacat ccagtaatag aatctcatcc 601 agataatgca ttagaggatc tacgactaga taagccattt cctgaactga gagaacattt 661 tcagtcctat gatttggatc atatggaaaa aaaggaccac agtcatactc catggattgt 721 gatcatagct aaatatttag cacagtggta tagtgaaaca aatggacgaa tacctaaaac 781 gtataaagaa aaagaggact tcagagattt gattagacaa ggaattctaa aaaatgaaaa 841 tggggctcca gaagatgaag agaattttga agaagctatt aaaaatgtga acacagcact 901 aaatacaact cagatcccaa gcagtattga agatatattt aatgatgatc gctgcataaa 961 tatcaccaaa cagactccat cattttggat tttagctcgt gccttaaagg aatttgtggc 1021 caaagagggt caaggaaatt tacctgttcg aggcacaatt cctgatatga ttgcagattc 1081 aggcaaatat ataaaactgc aaaacgttta ccgtgaaaaa gcaaagaaag atgctgccgc 1141 tgtgggtaat catgttgcca aattgctgca gtccattggc caggcaccag agtccatttc 1201 agagaaagaa ttaaaattac tctgcagcaa ttctgcattt cttcgagtgg taagatgtcg 1261 atccttagct gaagaatatg gtttggatac aattaacaag gatgaaatta tttctagcat 1321 ggacaatcca gataatgaaa tagtgttgta cttaatgtta cgggctgttg atagatttca 1381 taaacaacag ggtagatatc caggagtatc taactatcaa gttgaagaag atataggaaa 1441 gttgaagtct tgtctcactg gcttccttca ggaatatggt ttatctgtaa tggtgaaaga 1501 tgattatgtc cacgaatttt gccgatatgg agctgctgag ccacatacca ttgctgcatt 1561 cttgggggga gctgctgctc aagaggtcat caaaataatc accaaacaat ttgtaatttt 1621 taataatact tacatttaca gtggcatgtc acaaacttca gcaactttcc agttgtagag 1681 taagcaagca ccttaagtag tgtgttaatg attgaaactg taattgcctt cgggttgtgc 1741 tttagtctgt aaaattctaa aggagagctg ctaaattgtt ttcttaataa acatttttct 1801 catttgtaaa aaaaaaggaa ttc // LOCUS HSU50950 2152 bp mRNA PRI 08-APR-1996 DEFINITION Human infant brain unknown product mRNA, complete cds. ACCESSION U50950 NID g1256384 KEYWORDS PRI. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2152) AUTHORS Mihalek,R. and Homanics,G.E. TITLE Direct Submission JOURNAL Submitted (08-MAR-1996) Robert Mihalek, Anesthesiology, University of Pittsburgh, 3500 Terrace St., Pittsburgh, PA 15213, USA FEATURES Location/Qualifiers source 1..2152 /organism="Homo sapiens" /note="complete sequence, including poly A tail, of EST 50150 from the IMAGE Consortium" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="infant" /clone_lib="Bento Soares, Columbia University" CDS 134..1297 /codon_start=1 /product="unknown" /db_xref="PID:g1256385" /translation="MDCCVDIKSKEEESVHVTQRKTHYSMDSLSSWYMTVTQKTDPKM LSKKRTTSSQWQSPPVVVILKDMESFATKVLQDFIIISSQHLHEFPLILIFGIATSPI IIHRLLPHAVSSLLCIELFQSLSCKEHLTTVLDKLLLTTQFPFKINEKVLQVLTNIFL YHDFSVQNFIKGLQLSLLEHFYSQPLSVLCCNLPEAKRRINFLSNNQCENIRRLPSFR RYVEKQASEKQVALLTNERYLKEETQLLLENLHVYHMNYFLVLRCLHKFTSSLPKYPL GRQIRELYCTCLEKNIWDSEEYASVLQLLRMLAKDELMTILEKCFKVFKSYCENHLGS TAKRIEEFLAQFQSLDETKEEEDASGSQPKGLQKTDLYHLQKSLLEMKEFRRS" BASE COUNT 720 a 432 c 407 g 593 t ORIGIN 1 ggcacgaggg atttgacatt cggaagtcta acagaggccc ttcagaataa tgtcacacca 61 tatgtagtct cattgcaagc taaagattgt ccagatatga aacatttttt tgcaaaagtt 121 gatctcacag ttgatggact gctgtgtaga tataaaatcc aaagaggagg aaagtgttca 181 cgtcacccaa agaaagacac attattcaat ggattcactt tccagttggt atatgactgt 241 cacacagaag acggacccaa aaatgctaag caaaaaaagg actacttcta gccaatggca 301 gtctcctcct gttgtcgtta tcttgaagga tatggaaagc tttgccacaa aagtactaca 361 agacttcata attatcagca gtcaacatct ccatgaattt ccactaatac tcatttttgg 421 aatagccaca tctcctatta tcatccaccg attgcttcct catgcagtat catctctatt 481 gtgcatagaa ctgttccaat ctttgtcttg taaggagcac ctgactacgg tactcgataa 541 gctacttctt acaactcagt ttccctttaa aataaatgaa aaagtattac aggttctgac 601 caacatcttt ttgtatcatg atttctcagt tcaaaacttt ataaaaggac ttcagctttc 661 tctattagag catttctatt cccagccctt aagtgtcctg tgctgtaatc ttccagaagc 721 caaaagaaga ataaattttt tatcaaataa tcaatgtgaa aacatccgac gtctaccatc 781 ttttaggagg tacgtggaaa agcaagcttc agaaaagcaa gttgcgctct tgaccaatga 841 gagatatttg aaggaggaaa cacaattatt actagaaaac ctgcatgttt atcatatgaa 901 ttacttcctg gttttgagat gtcttcataa gttcacctct tctcttccca agtatccact 961 aggtcgacag atcagagagt tgtactgtac atgtttagaa aagaacatat gggattcaga 1021 ggagtatgca tcagtcttgc agctgctgag gatgttggca aaggatgaac tgatgaccat 1081 acttgagaaa tgtttcaagg tttttaagtc ttattgtgaa aaccaccttg gcagcacagc 1141 taagagaata gaggagttcc tggcccagtt tcagagcctc gatgaaacca aagaagaaga 1201 agatgcttct gggtcacagc caaaggggct tcagaagaca gacctctatc atcttcagaa 1261 gtccttattg gaaatgaagg agtttagaag aagttagaag caaaccaaat ttgaagtact 1321 cagagaaaat gttgtgaact tcattgactg tctagtgaga gaataccttc tgcctcctga 1381 gacacagcct ctccatgagg tggtgtactt cagtgctgcc catgcccttc gtgagcattt 1441 aaatgctgct ccgcgaattg ccctccatac tgcactcaac aatccttact attatctcaa 1501 gaatgaagca ctgaaaagcg aagaaggctg cattccgaat atcgccccag acatctgcat 1561 agcatacaaa ctgcacctag agtgtagcag gctcatcaac ctcgtggact ggtcagaggc 1621 ttttgcaaca gttgtgacag ctgctgaaaa aatggatgca aattctgcaa cctcagaaga 1681 aatgaatgaa attatccatg ctcggtttat tagagctgtt tctgaactag aacttttagg 1741 atttataaaa cctaccaaac agaagactga ccatgtggca agactaacat ggggaggctg 1801 ctagaaagca aataagcaaa gccagaacta tcacatttag cttaagagaa aaaggtgacc 1861 agtcatattt acatatatta gaggagcctg ttttgttgag aagataaatg tgtaaccccc 1921 attgatgttt aaccagaaaa gtacattgct aaccccaaac aggcatgtat caaaacacct 1981 gtggagtact ttagactcca acaaataata atgtaactaa aactgctcac acattttact 2041 gtactttcca aagtcattac taaattgtga gtaaatcatt cttgaactta gagtatgtaa 2101 atgtaataaa ttccgttatc caggagtata aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU50P6 943 bp mRNA PRI 29-JAN-1998 DEFINITION Homo sapiens cDNA homologous to Yeast SCO1 & SCO2 genes and C.elegans C01F1.2 gene. ACCESSION AL021683 NID g2827694 KEYWORDS C01F1.2 gene; SCO1 gene; SCO2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 943) AUTHORS Smink,L.J. and Burton,J. TITLE Direct Submission JOURNAL Submitted (21-JAN-1998) E-mail contact: humquery@sanger.ac.uk Clone requests:clonerequest@sanger.ac.uk COMMENT This sequence was generated from a cDNA clone isolated using sequence from the BAC clone CIT987SK-384D8 sequenced by The Institute for Genomic Research (U62317). All matches to EMBL sequences shown 90% or more. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/. FEATURES Location/Qualifiers source 1..943 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /tissue_type="monocyte" /map="q13" /clone="U50P6" exon 1..74 /number=1 mRNA 1..931 /product="hypothetical protein" misc_feature join(13..64,75..212,345..449,551..572) /note="match: 5' EST AA001318 clone 427850" misc_feature 50..193 /note="match: 3' EST AA490655 clone 785115" exon 75..931 /number=2 CDS 86..886 /note="unnamed protein product" /codon_start=1 /db_xref="PID:e1248832" /db_xref="PID:g2827695" /translation="MLLLTRSPTAWHRLSQLKPPVFPGTLGGQALHLRSWLLSRQGPA ETGGQGQPQGPGLRTRLLITGLFGAGLGGAWLALRAEKERLQQQKRTEALRQAAVGQG DFHLLDHRGRARCKADFRGQWVLMYFGFTHCPDICPDELEKLVQVVRQLEAEPGLPPV QPVFITVDPERDDVEAMARYVQDFHPRLLGLTGSTKQVAQASHSYRVYYNAGPKDEDQ DYIVDHSIAIYLLNPDGLFTDYYGRSRSAEQISDSVRRHMAAFRSVLS" misc_feature complement(join(314..433,224..306)) /note="match: 5' EST AA490654 clone 785115" misc_feature join(326..592,586..602,609..632,631..654,650..710, 708..752) /note="match: 5' EST AA374910 clone 427850" misc_feature complement(389..937) /note="match: EST AA613650 clone IMAGE:1103079" misc_feature 408..747 /note="match: 5' EST H27782 clone 162419" misc_feature complement(470..943) /note="match: EST AA595365 clone IMAGE:1102772" misc_feature complement(473..939) /note="match: EST AA580265 clone IMAGE:1083638" misc_feature complement(611..929) /note="match: 5' EST T07218 clone HFBEF54" misc_feature 730..899 /note="match: 5' EST T30682 clone 785115" misc_feature join(832..859,860..927) /note="match: 3' EST C01279 clone 785115" misc_feature complement(join(842..932,788..842)) /note="match: 3' EST AA001905 clone 427850" misc_feature complement(join(863..937,618..868)) /note="match: 5' EST AA568231 clone IMAGE:1061232" polyA_signal 910..915 /note="probable poly-A signal" polyA_site 930 BASE COUNT 181 a 293 c 296 g 173 t ORIGIN 1 gcagagccca gggagctgga ggtcggcgct tcctctcgtg cttggtccac tgacgcgcgg 61 ccccgccgcg aggagcatca gatccatgct gctgctgact cggagcccca cagcttggca 121 caggctctct cagctcaagc ctccggtctt ccctgggacc ctgggaggcc aggccctgca 181 tctgaggtcc tggcttttgt caaggcaggg ccctgcagag acaggtgggc agggccagcc 241 ccagggccct gggcttcgaa cccggctgct gatcacaggc ctgttcgggg ctggactcgg 301 tggggcctgg ctggccctga gggctgagaa ggagaggctg cagcagcaaa agcgaacaga 361 agccctgcgc caggcagctg tgggccaggg cgacttccac ctgctggatc acagaggccg 421 ggctcgctgc aaggctgact tccggggcca gtgggtgctg atgtactttg gcttcactca 481 ctgccctgac atctgcccag acgagctgga gaagctggtg caggtggtgc ggcagctgga 541 agcagagcct ggtttgcctc cagtgcagcc tgtcttcatc actgtggacc ccgagcggga 601 cgacgttgaa gccatggccc gctacgtcca ggacttccac ccaagactgt tgggtctgac 661 cggctccacc aaacaggttg cccaggctag tcacagttac cgcgtgtact acaatgccgg 721 ccccaaggat gaggaccagg actacatcgt ggaccactcc attgccatct acctgctcaa 781 ccctgacggc ctcttcacgg attactacgg ccggagcaga tcggctgagc agatctcaga 841 cagtgtgcgg cggcacatgg cggctttccg cagtgtcctg tcttgagcca ctgcagtctg 901 ggccccatca ttaaacgggc tgcgtttaaa aaaaaaaaaa aaa // LOCUS HSU51007 1330 bp mRNA PRI 08-APR-1996 DEFINITION Human 26S protease subunit S5a mRNA, complete cds. ACCESSION U51007 NID g1256400 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1330) AUTHORS Ferrell,K., Deveraux,Q., van Nocker,S. and Rechsteiner,M. TITLE Molecular cloning and expression of a multiubiquitin chain binding subunit of the human 26S protease JOURNAL FEBS Lett. 381 (1-2), 143-148 (1996) MEDLINE 96193932 REFERENCE 2 (bases 1 to 1330) AUTHORS Deveraux,Q. TITLE Direct Submission JOURNAL Submitted (08-MAR-1996) Quinn Devereaux, Biochemistry, University of Utah, 50 North Medical Drive, SLC, UT 84112, USA FEATURES Location/Qualifiers source 1..1330 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 145..1278 /note="Subunit of the 26S protease that binds and presumably selects ubiquitin-conjugates for destruction; Subunit 5a of the 26S protease regulatory complex" /codon_start=1 /product="26S protease subunit S5a" /db_xref="PID:g1256401" /translation="MVLESTMVCVDNSEYMRNGDFLPTRLQAQQDAVNIVCHSKTRSN PENNVGLITLANDCEVLTTLTPDTGRILSKLHTVQPKGKITFCTGIRVAHLALKHRQG KNHKMRIIAFVGSPVEDNEKDLVKLAKRLKKEKVNVDIINFGEEEVNTEKLTAFVNTL NGKDGTGSHLVTVPPGPSLADALISSPILAGEGGAMLGLGASDFEFGVDPSADPELAL ALRVSMEEQRQRQEEEARRAAAASAAEAGIATTGTEDSDDALLKMTISQQEFGRTGLP DLSSMTEEEQIAYAMQMSLQGAEFGQAESADIDASSAMDTSEPAKEEDDYDVMQDPEF LQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEEDKK" BASE COUNT 353 a 308 c 394 g 275 t ORIGIN 1 aattcccaaa tgacctttta tttcatacag agatacaaag gcaactatgt gcagcaacaa 61 tctgatgggc agtccaaact cttgggagga agtaaattca tggtaaatgt catgatggcg 121 gtcgggaggg aggaaggtgg caagatggtg ttggaaagca ctatggtgtg tgtggacaac 181 agtgagtata tgcggaatgg agacttctta cccaccaggc tgcaggccca gcaggatgct 241 gtcaacatag tttgtcattc aaagacccgc agcaaccctg agaacaacgt gggccttatc 301 acactggcta atgactgtga agtgctgacc acactcaccc cagacactgg ccgtatcctg 361 tccaagctac atactgtcca acccaagggc aagatcacct tctgcacggg catccgcgtg 421 gcccatctgg ctctgaagca ccgacaaggc aagaatcaca agatgcgcat cattgccttt 481 gtgggaagcc cagtggagga caatgagaag gatctggtga aactggctaa acgcctcaag 541 aaggagaaag taaatgttga cattatcaat tttggggaag aggaggtgaa cacagaaaag 601 ctgacagcct ttgtaaacac gttgaatggc aaagatggaa ccggttctca tctggtgaca 661 gtgcctcctg ggcccagttt ggctgatgct ctcatcagtt ctccgatttt ggctggtgaa 721 ggtggtgcca tgctgggtct tggtgccagt gactttgaat ttggagtaga tcccagtgct 781 gatcctgagc tggccttggc ccttcgtgta tctatggaag agcagcggca gcggcaggag 841 gaggaggccc ggcgggcagc tgcagcttct gctgctgagg ccgggattgc tacgactggg 901 actgaagact cagacgatgc cctgctgaag atgaccatca gccagcaaga gtttggccgc 961 actgggcttc ctgacctaag cagtatgact gaggaagagc agattgctta tgccatgcag 1021 atgtccctgc agggagcaga gtttggccag gcggaatcag cagacattga tgccagctca 1081 gctatggaca catccgagcc agccaaggag gaggatgatt acgacgtgat gcaggacccc 1141 gagttccttc agagtgtcct agagaacctc ccaggtgtgg atcccaacaa tgaagccatt 1201 cgaaatgcta tgggctccct ggcctcccag gccaccaagg acggcaagaa ggacaagaag 1261 gaggaagaca agaagtgaga ctggagggaa agggtagctg agtctgctta gggactgcat 1321 gggggaattc // LOCUS HSU51095 1699 bp mRNA PRI 01-JAN-1997 DEFINITION Human homeobox protein Cdx1 mRNA, complete cds. ACCESSION U51095 NID g1777771 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1699) AUTHORS Mallo,G.V., Rechreche,H., Frigerio,J.M., Rocha,D., Zweibaum,A., Lacasa,M., Jordan,B.R., Dusetti,N.J., Dagorn,J.C. and Iovanna,J.L. TITLE Molecular cloning, sequencing and expression of the mRNAs encoding human Cdx1 and Cdx2 homeobox. Down-regulation of Cdx1 and Cdx2 mRNA expression during colorectal carcinogenesis JOURNAL Unpublished REFERENCE 2 (bases 1 to 1699) AUTHORS Iovanna,J.L. TITLE Direct Submission JOURNAL Submitted (11-MAR-1996) Juan L. Iovanna, Department of Molecular Biology, INSERM U 315, 46, Boulevard de la Gaye, Marseille, 13009, FRANCE FEATURES Location/Qualifiers source 1..1699 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="6G2" /sex="male" /tissue_type="colocarcinoma" 5'UTR 1..81 CDS 82..879 /codon_start=1 /product="homeobox protein Cdx1" /db_xref="PID:g1777772" /translation="MYVGYVLDKDSPVYPGPARPASLGLGPANYGPPAPPPAPPQYPD FSSYSHVEPAPAPPTAWGAPFPAPKDDWAAAYGPGPAAPAASPASLAFGPPPDFSPVP APPGPGPGLLAQPLGGPGTPSSPGAQRPTPYEWMRRSVAAGGGGGSGKTRTKDKYRVV YTDHQRLELEKEFHYSRYITIRRKSELAANLGLTERQVKIWFQNRRAKERKVNKKKQQ QQQPPQPPMAHDITATPAGPSLGGLCPSNTSLLATSSPMPVKEEFLP" 3'UTR 880..1699 polyA_signal 1675 BASE COUNT 325 a 568 c 522 g 284 t ORIGIN 1 aggtgagcgg ttgctcgtcg tcggggcggc cggcagcggc ggctccaggg cccagcatgc 61 gcgggggacc ccgcggccac catgtatgtg ggctatgtgc tggacaagga ttcgcccgtg 121 taccccggcc cagccaggcc agccagcctc ggcctgggcc cggcaaacta cggccccccg 181 gccccgcccc cggcgccccc gcagtacccc gacttctcca gctactctca cgtggagccg 241 gcccccgcgc ccccgacggc ctggggggcg cccttccctg cgcccaagga cgactgggcc 301 gccgcctacg gcccgggccc cgcggcccct gccgccagcc cagcttcgct ggcattcggg 361 ccccctccag actttagccc ggtgccggcg ccccctgggc ccggcccggg cctcctggcg 421 cagcccctcg ggggcccggg cacaccgtcc tcgcccggag cgcagaggcc gacgccctac 481 gagtggatgc ggcgcagcgt ggcggccgga ggcggcggtg gcagcggtaa gactcggacc 541 aaggacaagt accgcgtggt ctacaccgac caccaacgcc tggagctgga gaaggagttt 601 cattacagcc gttacatcac aatccggcgg aaatcagagc tggctgccaa tctggggctc 661 actgaacggc aggtgaagat ctggttccaa aaccggcggg caaaggagcg caaagtgaac 721 aagaagaaac agcagcagca acagccccca cagccgccga tggcccacga catcacggcc 781 accccagccg ggccatccct ggggggcctg tgtcccagca acaccagcct cctggccacc 841 tcctctccaa tgcctgtgaa agaggagttt ctgccatagc cccatgccca gcctgtgcgc 901 cgggggacct ggggactcgg gtgctgggag tgtggctcct gtgggcccag gaggtctggt 961 ccgagtctca gccctgacct tctgggacat ggtggacagt cacctatcca ccctctgcat 1021 ccccttggcc cattgtgtgc agtaagcctg ttggataaag accttccagc tcctgtgttc 1081 tagacctctg ggggataagg gagtccaggg tggatgatct caatctcccg tgggcatctc 1141 aagccccaaa tggttggggg aggggcctag acaaggctcc aggccccacc tcctcctcca 1201 tacgttcaga ggtgcagctg gaggcctgtg tggggaccac actgatcctg gagaaaaggg 1261 atggagctga aaaagatgga atgcttgcag agcatgacct gaggagggag gaacgtggtc 1321 aactcacacc tgcctcttct gcagcctcac ctctacctgc ccccatcata agggcactga 1381 gcccttccca ggctggatac taagcacaaa gcccatagca ctgggctctg atggctgctc 1441 cactgggtta cagaatcaca gccctcatga tcattctcag tgagggctct ggattgagag 1501 ggaggccctg ggaggagaga agggggcaga gtcttcccta ccaggtttct acacccccgc 1561 caggctgccc atcagggccc agggagcccc cagaggactt tattcggacc aagcagagct 1621 cacagctgga caggtgttgt atatagagtg gaatctcttg gatgcagctt caagaataaa 1681 tttttcttct cttttcaaa // LOCUS HSU51127 2187 bp mRNA PRI 04-APR-1996 DEFINITION Human interferon regulatory factor 5 (Humirf5) mRNA, complete cds. ACCESSION U51127 NID g1255254 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2187) AUTHORS Grossman,A., Mittrucker,H.W., Lantonio,L. and Mak,T.W. TITLE Generation of mutant mice deficient in IRF5 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2187) AUTHORS Grossman,A., Mittrucker,H.W., Lantonio,L. and Mak,T.W. TITLE Direct Submission JOURNAL Submitted (12-MAR-1996) Alex Grossman, Amgen Institute, 620 University Avenue, Toronto, Ontario M5G 2C1, Canada FEATURES Location/Qualifiers source 1..2187 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" 5'UTR 1..103 gene 104..2187 /gene="Humirf5" misc_feature 104..451 /gene="Humirf5" /note="encodes tryptophan pentad repeat" /function="DNA binding domain" CDS 104..1618 /gene="Humirf5" /function="transcription factor" /note="IRF5" /codon_start=1 /product="interferon regulatory factor 5" /db_xref="PID:g1255255" /translation="MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFC IPWRHATRHGPSQDGDNTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRL IYDGPRDMPPQPYKIYEVCSNGPAPTDSQPPEDYSFGAGEEEEEEEELQRMLPSLSLT DAVQSGPHMTPYSLLKEDVKWPPTLQPPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRE LLSEVLEPGPLPASLPPAGEQLLPDLLISPHMLPLTDLEIKFQYRGRPPRALTISNPH GCRLFYSQLEATQEQVELFGPISLEQVRFPSPEDIPSDKQRFYTNQLLDVLDRGLILQ LQGQDLYAIRLCQCKVFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELILFQKGQ TNTPPPFEIFFCFGEEWPDRKPREKKLITVQVVPVAARLLLEMFSGELSWSADSIRLQ ISNPDLKDRMVEQFKELHHIWQSQQRLQPVAQAPPGAGLGVGQGPWPMHPAGMQ" misc_feature 452..807 /gene="Humirf5" /note="encodes PEST domain" misc_feature 527..550 /gene="Humirf5" /note="encodes glutamate stretch" misc_feature 808..1618 /gene="Humirf5" /note="encodes carboxy terminal protein interaction domain" 3'UTR 1619..2187 /gene="Humirf5" polyA_signal 2170..2175 /gene="Humirf5" BASE COUNT 452 a 705 c 607 g 423 t ORIGIN 1 gcggcgggag gcgcagcctg ggcagagctc agcttggtcc cgccgcccgg ccggtgctcc 61 ctggcgcagc cacgcaggcg caccgcagac agacccctct gccatgaacc agtccatccc 121 agtggctccc accccacccc gccgcgtgcg gctgaagccc tggctggtgg cccaggtgaa 181 cagctgccag tacccagggc ttcaatgggt caacggggaa aagaaattat tctgcatccc 241 ctggaggcat gccacaaggc atggtcccag ccaggacgga gataacacca tcttcaaggc 301 ctgggccaag gagacaggga aatacaccga aggcgtggat gaagccgatc cggccaagtg 361 gaaggccaac ctgcgctgtg cccttaacaa gagccgggac ttccgcctca tctacgacgg 421 gccccgggac atgccacctc agccctacaa gatctacgag gtctgctcca atggccctgc 481 tcccacagac tcccagcccc ctgaggatta ctcttttggt gcaggagagg aggaggaaga 541 agaggaagag ctgcagagga tgttgccaag cctgagcctc acagatgcag tgcagtctgg 601 cccccacatg acaccctatt ctttactcaa agaggatgtc aagtggccgc ccactctgca 661 gccgcccact ctgcagccgc ccgtggtgct gggtccccct gctccagacc ccagccccct 721 ggctcctccc cctggcaacc ctgctggctt cagggagctt ctctctgagg tcctggagcc 781 tgggcccctg cctgccagcc tgccccctgc aggcgaacag ctcctgccag acctgctgat 841 cagcccccac atgctgcctc tgaccgacct ggagatcaag tttcagtacc gggggcggcc 901 accccgggcc ctcaccatca gcaaccccca tggctgccgg ctcttctaca gccagctgga 961 ggccacccag gagcaggtgg aactcttcgg ccccataagc ctggagcaag tgcgcttccc 1021 cagccctgag gacatcccca gtgacaagca gcgcttctac acgaaccagc tgctggatgt 1081 cctggaccgc gggctcatcc tccagctaca gggccaggac ctttatgcca tccgcctgtg 1141 tcagtgcaag gtgttctgga gcgggccttg tgcctcagcc catgactcat gccccaaccc 1201 catccagcgg gaggtcaaga ccaagctttt cagcctggag cattttctca atgagctcat 1261 cctgttccaa aagggccaga ccaacacccc accacccttc gagatcttct tctgctttgg 1321 ggaagaatgg cctgaccgca aaccccgaga gaagaagctc attactgtac aggtggtgcc 1381 tgtagcagct cgactgctgc tggagatgtt ctcaggggag ctatcttggt cagctgatag 1441 tatccggcta cagatctcaa acccagacct caaagaccgc atggtggagc aattcaagga 1501 gctccatcac atctggcagt cccagcagcg gttgcagcct gtggcccagg cccctcctgg 1561 agcaggcctt ggtgttggcc aggggccctg gcctatgcac ccagctggca tgcaataaca 1621 aggctgcaga cggtgactgg ccctggcttc ctgggtggcg gtgcggactg atgtggagat 1681 gtgacagccc cgatgagcac ctggctggct gcagggtcct acctctgggt ttcctggaag 1741 tggatttggg ccaagaagga gagggagaaa ggcccgagcc cctgccttcc cgggcctttc 1801 tctcctgggc tgtctctggt ctggtcagcc tggctctcgg gaaattcagc catgagcagg 1861 gaaagaactc tcccaaccct ggggcctagc tgtataggag gaattgccta agggtggccc 1921 actcttgtga ttgccccatt tcctctggca acaaaagcca gagtgttgtg ggccaagtcc 1981 ccccacaggg cctctgcagg gcatggccct gatttccctg gtttgagact cacttcctca 2041 tctccctgtc ctctgagata atatgagtga gcacttaggt atcatatcag atgctcaagg 2101 ctggcagcta cccccttctt gagagtccaa gaacctggag cagaaataat ttttatgtat 2161 ttttggatta ataaatgtta aaaacag // LOCUS HSU51166 3410 bp mRNA PRI 18-JUN-1996 DEFINITION Human G/T mismatch-specific thymine DNA glycosylase mRNA, complete cds. ACCESSION U51166 NID g1378106 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3410) AUTHORS Neddermann,P. and Jiricny,J. TITLE Efficient removal of uracil from G.U mispairs by the mismatch-specific thymine DNA glycosylase from HeLa cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (5), 1642-1646 (1994) MEDLINE 94173886 REFERENCE 2 (bases 1 to 3410) AUTHORS Neddermann,P. and Jiricny,J. TITLE The purification of a mismatch-specific thymine-DNA glycosylase from HeLa cells JOURNAL J. Biol. Chem. 268 (28), 21218-21224 (1993) MEDLINE 94012674 REFERENCE 3 (bases 1 to 3410) AUTHORS Neddermann,P., Gallinari,P., Lettieri,T., Schmid,D., Truong,O., Hsuan,J.J., Wiebauer,K. and Jiricny,J. TITLE Cloning and expression of human G/T mismatch-specific thymine-DNA glycosylase JOURNAL J. Biol. Chem. 271 (22), 12767-12774 (1996) MEDLINE 96278662 REFERENCE 4 (bases 1 to 3410) AUTHORS Jiricny,J. TITLE Direct Submission JOURNAL Submitted (12-MAR-1996) Josef Jiricny, Biochemistry, IRBM, Via Pontina Km 30.600, Pomezia, Rome 00040, Italy FEATURES Location/Qualifiers source 1..3410 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Lambda ZAP II and Uni ZAP HeLa cDNA library (Stratagene)" CDS 400..1632 /codon_start=1 /product="G/T mismatch-specific thymine DNA glycosylase" /db_xref="PID:g1378107" /translation="MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEV PAPAPAQEPVQEAPKGRKRKPRTTEPKQPVEPKKPVESKKSGKSAKPKEKQEKITDTF KVKRKVDRFNGVSEAELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFW KCLFMSGLSEVQLNHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQK LQKYQPRIAVFNGKCIYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARC AQFPRAQDKVHYYIKLKDLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKY DPGYEAAYGGAYGENPCSSEPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQI PSFSNHCGTQEQEEESHA" BASE COUNT 1014 a 615 c 755 g 1026 t ORIGIN 1 gcaccaggcg cccagtggag ccgtttggga gaattgcctg cgccacgcag cggggccgga 61 caggcggtaa ggatctgatt aggctttcga acttgagttt gactgatgtc ttctgtgtgg 121 tgtccgctaa atcccacagc atataggatc agtcgcattg gttataaggt ttgcttctgg 181 ctgggtgcgg tggctcatgc ctgtaatcca acattgggag gccaaggcag gcggaccacc 241 tgaagtcggg agcttgagtc cagccactgt ctgggtactg ccagccatcg ggcccaggtc 301 tctggggttg tcttaccgca gtgagtacca cgcggtacta cagagaccgg ctgcccgtgt 361 gcccggcagg tggagccgcc gcatcagcgg cctcggggaa tggaagcgga gaacgcgggc 421 agctattccc ttcagcaagc tcaagctttt tatacgtttc catttcaaca actgatggct 481 gaagctccta atatggcagt tgtgaatgaa cagcaaatgc cagaagaagt tccagcccca 541 gctcctgctc aggaaccagt gcaagaggct ccaaaaggaa gaaaaagaaa acccagaaca 601 acagaaccaa aacaaccagt ggaacccaaa aaacctgttg agtcaaaaaa atctggcaag 661 tctgcaaaac caaaagaaaa acaagaaaaa attacagaca catttaaagt aaaaagaaaa 721 gtagaccgtt ttaatggtgt ttcagaagct gaacttctga ccaagactct ccccgatatt 781 ttgaccttca atctggacat tgtcattatt ggcataaacc cgggactaat ggctgcttac 841 aaagggcatc attaccctgg acctggaaac catttttgga agtgtttgtt tatgtcaggg 901 ctcagtgagg tccagctgaa ccatatggat gatcacactc taccagggaa gtatggtatt 961 ggatttacca acatggtgga aaggaccacg cccggcagca aagatctctc cagtaaagaa 1021 tttcgtgaag gaggacgtat tctagtacag aaattacaga aatatcagcc acgaatagca 1081 gtgtttaatg gaaaatgtat ttatgaaatt tttagtaaag aagtttttgg agtaaaggtt 1141 aagaacttgg aatttgggct tcagccccat aagattccag acacagaaac tctctgctat 1201 gttatgccat catccagtgc aagatgtgct cagtttcctc gagcccaaga caaagttcat 1261 tactacataa aactgaagga cttaagagat cagttgaaag gcattgaacg aaatatggac 1321 gttcaagagg tgcaatatac atttgaccta cagcttgccc aagaggatgc aaagaagatg 1381 gctgttaagg aagaaaaata tgatccaggt tatgaggcag catatggtgg tgcttacgga 1441 gaaaatccat gcagcagtga accttgtggc ttctcttcaa atgggctaat tgagagcgtg 1501 gagttaagag gagaatcagc tttcagtggc attcctaatg ggcagtggat gacccagtca 1561 tttacagacc aaattccttc ctttagtaat cactgtggaa cacaagaaca ggaagaagaa 1621 agccatgctt aagaatggtg cttctcagct ctgcttaaat gctgcagttt taatgcagtt 1681 gtcaacaagt agaacctcag tttgctaact gaagtgtttt attagtattt tactctagtg 1741 gtgtaattgt aatgtagaac agttgtgtgg tagtgtgaac cgtatgaacc taagtagttt 1801 ggaagaaaaa gtagggtttt tgtatactag cttttgtatt tgaattaatt atcattccag 1861 ctttttatat actatatttc atttatgaag aaattgattt tcttttggga gtcactttta 1921 atctgtaatt ttaaaataca agtctgaata tttatagttg attcttaact gtgcataaac 1981 ctagatatac cattatccct tttataccta agaagggcat gctaataatt accactgtca 2041 aagaggcaaa ggtgttgatt tttgtatata agttaagcct cagtggagtc tcatttgtta 2101 gtttttagtg gtaactaagg gtaaactcag ggttccctga gctatatgca cactcagacc 2161 tctttgcttt accagtggtg tttgtgagtt gctcagtagt aaaaactggc ccttacctga 2221 cagagccctg gctttgacct gctcagccct gtgtgttaat cctctagtag ccaattaact 2281 actctggggt ggcaggttcc agagaatcga gtagaccttt tgccactcat ctgtgtttta 2341 cttgagacat gtaaatatga tagggaagga actgaatttc tccattcata tttataacca 2401 ttctagtttt atcttccttg gctttaagag tgtgccatgg aaagtgataa gaaatgaact 2461 tctaggctaa gcaaaaagat gctggagata tttgatactc tcatttaaac tggtgcttta 2521 tgtacatgag atgtactaaa ataagtaata tagaattttt cttgctaggt aaatccagta 2581 agccaataat tttaaagatt ctttatctgc atcattgctg tttgttacta taaattaaat 2641 gaacctcatg gaaaggttga ggtgtatacc tttgtgattt tctaatgagt tttccatggt 2701 gctacaaata atccagacta ccaggtctgg tagatattaa agctgggtac taagaaatgt 2761 tatttgcatc ctctcagtta ctcctgaata ttctgatttc atacgtaccc agggagcatg 2821 ctgttttgtc aatcaatata aaatatttat gaggtctccc ccacccccag gaggttatat 2881 gattgctctt ctctttataa taagagaaac aaattcttat tgtgaatctt aacatgcttt 2941 ttagctgtgg ctatgatgga ttttattttt tcctaggtca agctgtgtaa aagtcattta 3001 tgttatttaa atgatgtact gtactgctgt ttacatggac gttttgtgcg ggtgctttga 3061 agtgccttgc atcagggatt aggagcaatt aaattatttt ttcacgggac tgtgtaaagc 3121 atgtaactag gtattgcttt ggtatataac tattgtagct ttacaagaga ttgttttatt 3181 tgaatgggga aaataccctt taaattatga cggacatcca ctagagatgg gtttgaggat 3241 tttccaagcg tgtaataatg atgtttttcc taacatgaca gatgagtagt aaatgttgat 3301 atatcctata catgacagtg tgagactttt tcattaaata atattgaaag attttaaaat 3361 tcatttgaaa gtctgatggc ttttacaata aaagatatta agaattgtta // LOCUS HSU51205 922 bp mRNA PRI 13-DEC-1996 DEFINITION Human COP9 homolog (HCOP9) mRNA, complete cds. ACCESSION U51205 NID g1730283 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 922) AUTHORS Chamovitz,D.A. and Deng,X.W. TITLE The novel components of the Arabidopsis light signaling pathway may define a group of general developmental regulators shared by both animal and plant kingdoms JOURNAL Cell 82 (3), 353-354 (1995) MEDLINE 95360978 REFERENCE 2 (bases 1 to 922) AUTHORS Chamovitz,D.A. and Deng,X.-W. TITLE The COP9 complex, a link between photomorphogenesis and general development regulation JOURNAL Plant Cell Environ. (1997) In press REFERENCE 3 (bases 1 to 922) AUTHORS Chamovitz,D.A. and Deng,X.-W. TITLE Direct Submission JOURNAL Submitted (12-MAR-1996) Daniel A. Chamovitz, Biology, Yale University, P.O.Box 208104, New Haven, CT 06520-8104, USA FEATURES Location/Qualifiers source 1..922 /organism="Homo sapiens" /db_xref="taxon:9606" gene 50..679 /gene="HCOP9" CDS 50..679 /gene="HCOP9" /note="similar to Arabidopsis COP9" /codon_start=1 /product="HCOP9" /db_xref="PID:g1730284" /translation="MPVAVMAESAFSFKKLLDQCENQELEAPGGIATPPVYGQLLALY LLHNDMNNARYLWKRIPPAIKSANSELGGIWSVGQRIWQRDFPGIYTTINAHQWSETV QPIMEALRDATRRRAFALVSQAYTSIIADDFAAFVGLPVEEAVKGILEQGWQADSTTR MVLPRKPVAGALDVSFNKFIPLSEPAPVPPIPNEQQLARLTDYVAFLEN" BASE COUNT 248 a 193 c 218 g 261 t 2 others ORIGIN 1 tctggggttt ggctgtccgg acggtgcagc ggcgaggccg gccgcgaaga tgccagtggc 61 ggtgatggcg gaaagcgcct ttagtttcaa aaagttgctg gatcagtgcg agaaccagga 121 gctcgaggcc cctggaggaa ttgctacacc cccagtgtat ggtcagcttc tagctttata 181 tttgctccat aatgacatga ataatgcaag atatctttgg aaaagaatac cacctgctat 241 aaaatctgca aattctgaac ttgggggaat ttggtcagta ggacaaagaa tctggcagag 301 agatttccct gggatctata caaccatcaa cgctcaccag tggtctgaga cggtccagcc 361 aattatggaa gcacttagag atgcaacaag gagacgcgcc tttgccctgg tctctcaagc 421 gtatacttca atcatcgccg atgattttgc agcctttgtt ggacttcctg tagaagaggc 481 tgtgaaaggc atattagaac aaggatggca agctgattcc accacaagaa tggttctgcc 541 cagaaagcca gttgcagggg ccctggatgt ttcctttaac aagtttattc ccttatcaga 601 gcctgctcca gttcccccaa tacccaatga acagcagtta gccagactga cggattatgt 661 ggctttcctt gaaaactgat ttatcactct gagttcaaga ttcatcttca gaatcctgta 721 tactggacaa acgtaggaaa tgttaaagtt tgttattttc caatttattg ggatgggctt 781 taagcacctn caggcattcc tttactatgg tggataaaaa tacatattag gattttaagg 841 tnatactatt attacatttt gttcccttaa acgtttatgg ctgaataggt tgttggaaac 901 agttctccat tttgtaggta tt // LOCUS HSU51224 3341 bp DNA PRI 02-MAY-1996 DEFINITION Human U2AFBPL gene, complete cds. ACCESSION U51224 NID g1293652 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3341) AUTHORS Pearsall,R.S., Shibata,H., Brozowska,A., Yoshino,K., Okuda,K., Plass,C., Chapman,V., deJong,P., Hayashizaki,Y. and Held,W.A. TITLE Absence of imprinting for U2AFBPL, a human homologue of the imprinted mouse gene U2afbp-rs JOURNAL Unpublished REFERENCE 2 (bases 1 to 3341) AUTHORS Pearsall,R.S. TITLE Direct Submission JOURNAL Submitted (12-MAR-1996) R. Scott Pearsall, Molecular and Cellular Biology, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA FEATURES Location/Qualifiers source 1..3341 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q22-31" repeat_region 184..452 /note="Alu element" repeat_region 878..1147 /note="Alu element" mRNA 1303..>2853 gene 1414..2853 /gene="U2AFBPL" CDS 1414..2853 /gene="U2AFBPL" /note="similar to mouse u2afbp-rs; similar to U2AF 35 kDa splicing factor" /codon_start=1 /product="U2AFBPL" /db_xref="PID:g1293653" /translation="MAALEKMTFPKKMTFPEKPSHKKYRAALKKEKRKKRRQELARLR DSGLSQEEEEDTFIEEQQLEEEKLLERERERLHEEWLLREQKAQEEFRIKKEKEEAAK KWLEEQERKLKEQWKEQQRKEREEEEQKQQEKKEKEEAVQKMLDQAENDLENSTTWQN PEPPVDFRVMEKDRANCPFYSKTGACRFGDRCSRKHNFPTSSPTLLIKSMFTTFGMEQ CRRDDYDPDASLEYSEEETYQQFLDFYEDVLPEFKNVGKVIQFKVSCNLEPHLRGNVY VQYQSEEECQAALSLFNGRWYAGRQLQCEFCPVTRWKMAICGLFEIQQCPRGKHCNFL HVFRNPNNEFWEANRDIYLSSDQTGSSFGKNSERREKMGHHDHYYSRQRGRRNPSPDH TYKRNGESERKKSSHRGKKSHKRTSKSRERHNSPSRGRNRHRSWDQGRRSQSRRSHRS RSQSSSRCRSRGRRKSGNRDRTVQSPQSK" polyA_signal 2849..2854 misc_feature complement(2902..3161) /note="Alu element" BASE COUNT 1122 a 685 c 769 g 765 t ORIGIN 1 ctgcagtgag ctgtgatacc atcactgcaa tccaggcttg gtgagacagc aagaccctgt 61 ctcaaaaaga caaaaaaaag ttgatatata aaacaaccta gaaattagta agaatacaga 121 ataaagaaca caaaatttaa gtcttttttt ttttttaaat aaaaaagggt ttcactttgt 181 cacccaggca ggagtgcagt ggcacaaata caaaacacta tagcctcaac ttcctgggct 241 caagtgattc tcccgcctca gccccccagg tagctgaaga ccacaggcat gcaggcacgc 301 accatcacgt ctggctaatt tttgtacttt ttgtagagat ggggtttcgc catgttggcc 361 gggctggtct cgaactcctg acctcaagtg atccacccat ctcagcctcc taaagtgctg 421 ggattacagg tgtgagccac tgcacctggc ctaaaatctc actctaaaga tatatacaga 481 tctctcaacc caacaaagag ggattcaaag agttaatata gtatatagag agtaacagga 541 tggactcaaa gaaactaact tccaatcctg gttctgccat ttactagcta agcaaacctt 601 atacaagtta cttaatcact taagtctggt ttttcctcta taaaataggt attaattgaa 661 aattttaaaa tttaccccta taattttgta agaaaataaa attactaagc atagcaagga 721 ctttctctga cccaagaata ctgtgttttc taacaacatc tatgaaacat tactaacagg 781 agaacatgtt agctttcagt aggggaaagc aaatcctaga accaaaaata tttaagcaaa 841 tatttatttt tattttttat ttttgagaca gagtctcact ctgtcaccca ggctggagtg 901 tagtggcaca gtcttggctc actgcaacct ctgcctcctg ggttcaagca gttctcctgc 961 ctcagcctcc caagtagctg ggattacagg cacgtgctac caagcctggc taattttttt 1021 atttttagta gagatgagtt tttgccatgt tggccaggct ggtctcaaac tcctggcctc 1081 aggtgatccg cccatttcgg cctcctaaag tgctgggatt acaggagtga gccaccgcac 1141 ctggccataa gtaaatattt tagaactctc cttttagtac atgaaatgaa attcaaattt 1201 atgatatact taattacaaa aaaagtctaa ctgcaatata aaggagaaac aaaatgaaag 1261 taatttgtaa tatatataag tatgcaaaca aaacaatact agaaaacata aaatattcat 1321 aagtagaatc actgacagtg gcagctatga acaaaatctt ccatatattc ccataggtta 1381 agaatgtctg aggggtcagg cggtgctggc aagatggctg cacttgagaa gatgacgttt 1441 cccaagaaga tgacatttcc agagaaacca agccacaaaa agtacagggc cgccctgaag 1501 aaggagaaac gaaagaaacg tcggcaggaa cttgctcgac tgagagactc aggactctca 1561 caggaggagg aagaggacac ttttattgaa gaacaacaac tagaagaaga gaagctattg 1621 gaaagagaga gggaaagatt acatgaggag tggttgctga gggagcagaa ggcacaagaa 1681 gaattcagaa taaagaagga aaaggaagag gcggctaaaa aatggctaga agaacaagag 1741 agaaagttaa aggaacaatg gaaagaacag cagaggaaag agagagaaga ggaggagcag 1801 aaacaacagg agaagaaaga aaaagaggaa gctgtgcaga agatgctgga tcaggctgaa 1861 aatgatttag aaaatagtac cacatggcaa aacccagaac cacccgtgga tttcagagta 1921 atggagaagg atcgagctaa ttgtcccttc tacagtaaaa caggagcttg cagatttgga 1981 gacagatgtt cacgtaaaca taatttccca acatctagtc ctacccttct tattaagagc 2041 atgtttacaa cgtttggaat ggagcagtgc aggagggatg actatgaccc tgacgcaagc 2101 ctggagtaca gcgaggaaga aacctaccaa cagttcctag atttctatga ggatgtgttg 2161 cccgagttca agaacgtggg gaaagtgatt cagttcaagg tcagctgcaa tttggaacct 2221 cacctgaggg gcaatgtata tgttcagtac cagtcggaag aagaatgcca agcagccctt 2281 tctctgttta acggacgatg gtatgcagga cgacagctgc aatgtgaatt ctgcccagtg 2341 acccggtgga aaatggcgat ttgtggttta tttgaaatac aacaatgtcc aagaggaaaa 2401 cactgcaact ttcttcatgt gttcagaaat cccaacaatg aattctggga agctaataga 2461 gacatctact tgtcttcaga tcagactggc tcctcctttg gcaagaactc cgagaggagg 2521 gagaagatgg gccaccacga ccactactac agcaggcagc ggggaaggag aaaccctagt 2581 ccagaccaca cctacaaaag aaatggggaa tccgagagaa aaaagagtag tcataggggg 2641 aagaaatctc acaaacgcac atcaaagagt cgggagaggc acaattcacc aagcagagga 2701 agaaataggc accgcagctg ggaccagggc cgccggagcc agagccgcag gagccaccgc 2761 agccggagcc aaagttcctc taggtgccga agtcgtggga ggaggaagtc gggtaataga 2821 gacagaactg ttcagagtcc ccaatccaaa taaactagtt ttgttcttaa aaaaaaaaaa 2881 aaaaaaaaaa agaatggacc aggccaggta tagtggctca cacctgtaat cccagcactt 2941 taggaggctg aggtgggtgg atcacttgag gtcaggagtt tgaggccagc ctggccaaca 3001 tggcaaaacc ccatttctac taaaaataca aaaattagcc aggtgtggtg gcaggcgtct 3061 gtaatccaag ctacttggga agctgaggca ggagaatcgc ttgaatctgg aaggcggagg 3121 ttgcagtgag tcgacatcat gccacttcac tccagcctgg gtgacagagc gagactgtgt 3181 ctcaaaaaat agaatttttg gactagtgga actacacagc ctgctttcca tatcagacca 3241 ttcaacccaa tgggattcac tgccataatc tccatgttat gattttaagc caggcctctc 3301 ttcacctttc ctattcctta catttaaaga ctccaaagct t // LOCUS HSU51269 4031 bp mRNA PRI 11-APR-1997 DEFINITION Human armadillo repeat protein mRNA, complete cds. ACCESSION U51269 NID g1932726 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4031) AUTHORS Sirotkin,H., O'Donnell,H., DasGupta,R., St. Halford,S., Jore,B., Puech,A., Parimoo,S., Morrow,B., Skoultchi,A., Weissman,S., Scambler,P. and Kucherlapati,R. TITLE Identification of a new human catenin gene family member (ARVCF) from the region deleted in velo-cardio-facial syndrome JOURNAL Genomics 41 (1), 75-83 (1997) MEDLINE 97271559 REFERENCE 2 (bases 1 to 4031) AUTHORS Sirotkin,H. TITLE Direct Submission JOURNAL Submitted (13-MAR-1996) Howard Sirotkin, Molecular Genetics, Aecom, 1300 Morris Park Ave., Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..4031 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11" CDS 272..3160 /note="this gene is deleted in velo-cardiofacial syndrome" /codon_start=1 /product="armadillo repeat protein" /db_xref="PID:g1932727" /translation="MEDCNVHSAASILASVKEQEARFERLTRALEQERRHVALQLERA QQPGMVSGGMGSGQPLPMAWQQLVLQEQSPGSQASLATMPEAPDVLEETVTVEEDPGT PTSHVSIVTSEDGTTRRTETKVTKTVKTVTTRTVRQVPVGPDGLPLLDGGPPLGPFAD GALDRHFLLRGGGPVATLSRAYLSSGGGFPEGPEPRDSPSYGSLSRGLGMRPPRAGPL GPGPGDGCFTLPGHREAFPVGPEPGPPGGRSLPERFQAEPYGLEDDTRSLAADDEGGP ELEPDYGTATRRRPECGRGLHTRAYEDTADDGGELADERPAFPMVTAPLAQPERGSMG SLDRLVRRSPSVDSARKEPRWRDPELPEVLAMLRHPVDPVKANAAAYLQHLCFENEGV KRRVRQLRGLPLLVALLDHPRAEVRRRACGALRNLSYGRDTDNKAAIRDCGGVPALVR LLRAARDNEVRELVTGTLWNLSSYEPLKMVIIDHGLQTLTHEVIVPHSGWEREPNEDS KPRDAEWTTVFKNTSGCLRNVSSDGAEARRRLRECEGLVDALLHALQSAVGRKDTDNK SVENCVCIMRNLSYHVHKEVPGADRYQEAEPGPLGSAVGSQRRRRDDASCFGGKKAKE EWFHQGKKDGEMDRNFDTLDLPKRTEAAKGFELLYQPEVVRLYLSLLTESRNFNTLEA AAGALQNLSAGNWMWATYIRATVRKERGLPVLVELLQSETDKVVRAVAIALRNLSLDR RNKDLIGSYAMAELVRNVRNAQAPPRPGACLEEDTVVAVLNTIHEIVSDSLDNARSLL QARGVPALVALVASSQSVREAKAASHVLQTVWSYKELRGTLQKDGWTKARFQSAAATA KGPKGALSPGGFDDSTLPLVDKSLEGEKTGSRDVIPMDALGPDGYSTVDRRERRPRGA SSAGEASEKEPLKLDPSRKAPPPGPSRPAVRLVDAVGDAKPQPVDSWV" BASE COUNT 747 a 1245 c 1401 g 638 t ORIGIN 1 cggtctcggg ttgcgcgctg ggcctggagg gagggggcgg cccccgcacc ggtccgagtt 61 gcggccgcgt ggactgcgac ccgcgccgcg ccgcaccgcg ccgcgccctg ggaacgccgc 121 tccccgcgcg ccaacggacc cggggaagcc cttctggggt ccgaggccgc gctgcggggc 181 cgcccacgct gcgctccagg agccagagcc tgcagggtgg gtgtcgggac atgctgatct 241 tccctcaaga cagctggcgg gcgctctggt catggaggac tgcaatgtgc actcggccgc 301 cagcatcctg gcctcggtga aggagcagga ggcccgcttc gagaggctga cacgggcact 361 ggagcaggag cggcgccatg ttgccctaca gctggagcgt gcccagcagc ctggcatggt 421 cagtggtggc atgggcagtg ggcagcccct gccaatggcc tggcaacagc tggtcctcca 481 ggagcagagc ccaggcagcc aggcctcact ggccacgatg ccggaggcac ctgatgtgct 541 ggaggagacc gtgacggtgg aggaggaccc cggcacaccc acttcccatg tgtctattgt 601 cacatccgaa gatggcacaa cccggcgcac cgagaccaag gtcaccaaga ctgtcaagac 661 ggtgaccact cggacagtac gccaggtgcc cgtgggccca gatggactcc ccctgctgga 721 tggcggcccc ccactaggcc cttttgcaga tggtgccctg gaccggcatt tcctgctgcg 781 tggtggtggc ccagtggcca cactctctcg agcctacctc agcagtgggg gtggctttcc 841 cgaaggcccc gagccccggg acagccccag ctatggcagc ctgtcccgag ggctgggcat 901 gcggccccca cgtgctggcc cccttggccc aggccctggt gatggctgct tcacactgcc 961 tggccaccgg gaagccttcc cggtgggtcc tgagcctggg ccaccaggtg gccgctccct 1021 gcccgagcgc ttccaggcag agccgtatgg cttggaggat gacacgcgca gcctggccgc 1081 tgatgacgag ggtggccctg agctggagcc tgactatggc acggccacaa ggaggaggcc 1141 tgagtgtggg cggggccttc ataccagggc ctacgaggac acagcagatg atggcggcga 1201 gctggcggac gagcggcctg cgttcccaat ggtgacggcg cccctggccc agcctgaacg 1261 gggcagcatg ggcagcctgg accggctggt gcggcgctcg ccctcagtgg atagcgcccg 1321 caaggagccg cgctggcggg accctgagct gcctgaggtg ctggccatgc tgcggcaccc 1381 cgtggacccc gtgaaggcca atgcggccgc ctacctgcag catctgtgct ttgagaacga 1441 gggtgtcaag cggcgtgtac ggcagttgcg ggggctgccg ctgcttgtgg cactgctgga 1501 ccacccgcgg gctgaggtgc ggcgccgggc ctgtggggca ctgcgcaacc tctcctatgg 1561 ccgcgacact gacaacaagg ccgccatccg ggactgcggt ggtgtgcctg ccctggtgcg 1621 cctgctgagg gctgcccggg acaacgaggt ccgtgagctt gtcactggca ccctgtggaa 1681 cctgtcatcc tatgagcccc tgaagatggt catcattgac catggcctgc agacgctgac 1741 ccacgaggtg atcgtgcccc actcaggatg ggagcgtgag cccaacgagg actccaagcc 1801 acgggacgcc gagtggacaa ctgtcttcaa gaacacgtcg ggctgcctga ggaatgtgag 1861 ctccgatggt gctgaggccc ggcggcgact ccgggagtgt gaagggctgg tggacgcgct 1921 cctgcatgcc ctgcagtcgg ctgtgggccg gaaggacact gacaacaagt cggtggagaa 1981 ctgcgtgtgc atcatgcgga acctgtccta ccacgtgcac aaggaggtgc ccggggccga 2041 caggtaccag gaggccgagc ccgggcccct gggcagtgct gtaggctccc agcgccggag 2101 gcgggatgat gccagctgct ttggaggcaa gaaggccaaa gaggagtggt tccaccaagg 2161 aaagaaggat ggtgagatgg accggaactt tgacacgcta gacctgccca agcgaactga 2221 ggccgccaaa ggctttgagc tgctgtacca gcccgaggtg gtacgtctct acctctccct 2281 cctcacggag agccggaact tcaacaccct ggaggctgcc gccggcgctc tgcagaacct 2341 cagtgccggc aactggatgt gggccacgta catccgcgcc acagtgcgca aagagcgcgg 2401 gctgccggtg cttgtggaac tgctgcagtc tgagaccgac aaggtggtgc gcgccgtcgc 2461 catcgctctg cgcaacctct cgctggaccg gcgcaacaaa gacctcatcg ggagctacgc 2521 catggctgag cttgtgcgga atgtgcgcaa tgcacaggct ccgccgcgac cgggggcctg 2581 cctggaggaa gacaccgtgg tggcggtgct caacaccatc cacgaaatcg tgtccgacag 2641 cctggataac gcgcgctcgc tcctgcaggc acgcggggtg ccagcgttgg tggctctcgt 2701 ggcctccagc caatcggtac gcgaagcgaa ggcggcgtca cacgtgctgc agacagtgtg 2761 gagctacaag gagctgcgtg gtaccttgca gaaagatggt tggaccaagg cgcgcttcca 2821 gtcagctgct gctactgcca aggggcctaa gggagcactg agtcctgggg gcttcgatga 2881 cagcacgctg ccactggtgg acaagagcct tgagggcgag aaaactggca gccgggatgt 2941 gatccccatg gatgcgctgg gcccagacgg atactccacg gtggaccgga gggagcggag 3001 gccacggggc gccagctctg caggagaggc ctctgagaag gaacccttga aactcgaccc 3061 cagcaggaag gcccctcccc ccgggcccag caggcccgcg gtcaggctgg tggacgccgt 3121 aggggacgct aagcctcagc ccgttgattc ctgggtctag cttgcatgcc tgggcccagg 3181 cccagcctct tgttcttagg gcttggatcg tggaagaagg gccaccctga gcagatcgtg 3241 ccgtggagct tggagccccc tggcggcagg ggttggcagc gcctcgcccc agccaagcct 3301 gactttgggg acaccctcct gcccccccac cccactcccc ccatcctacc taactcccta 3361 ggcctgagtt agctgtttac tgacatggtg ctgtgtgtga gcgagtgaca gaaggcccga 3421 ggaggccgac tgggctgctg gggcagggcc cagggaggtg gcagggaagg gtgcctggct 3481 gggctttggg gctggtggga taggagggtg cccctgggaa cacagtgccc cagtggtgcg 3541 ggccaggtct ggcgccaggg gcaggcagct aggcagggag ggggctggga aagaacttgg 3601 acttctaaag gcggagggtc tccctgggca ccctgcaggc aggcagacca cggtgtgggg 3661 ggtgccccac aaggcccagc cctgggcaag gaagctttgg aagcagcaag gtgggggtca 3721 ggctacaggt gccccacctc cccagagccc tgcagtcacg tgaggcccct gtgcagtaga 3781 gagcctttct ccagcccatg ctggctgggg ccggggctgc cgcatctgcc tggagcctgg 3841 gcagcacctc cgggactgac cttcggcagt ggctggggga ctgctttcag tgcctgctgt 3901 tcgtgactca gaaacagaag aaaaacactg agaaaaagca ttaaaaataa gccaaaataa 3961 gactattggt aaacacctaa atttttgtaa gaaaatttaa aaattaaagc agcaaaggaa 4021 aaaaaaaaaa a // LOCUS HSU51333 3062 bp mRNA PRI 15-OCT-1996 DEFINITION Human hexokinase III (HK3) mRNA, complete cds. ACCESSION U51333 NID g1255787 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3062) AUTHORS Furuta,H., Nishi,S., Le Beau,M.M., Fernald,A.A., Yano,H. and Bell,G.I. TITLE Sequence of human hexokinase III cDNA and assignment of the human hexokinase III gene (HK3) to chromosome band 5q35.2 by fluorescence in situ hybridization JOURNAL Genomics 36 (1), 206-209 (1996) MEDLINE 96411670 REFERENCE 2 (bases 1 to 3062) AUTHORS Furuta,H., Nishi,S., LeBeau,M.M., Fernald,A.A., Yano,H. and Bell,G.I. TITLE Direct Submission JOURNAL Submitted (14-MAR-1996) Hiroto Furuta, Department of Biochemistry and Molecular Biology and Medicine, Howard Hughes Medical Institute, The University of Chicago, 5841 S. Maryland Avenue, N237A, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..3062 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q35.2" gene 75..2846 /gene="HK3" CDS 75..2846 /gene="HK3" /EC_number="2.7.1.1" /note="ATP:D-hexose 6-phosphotransferase" /codon_start=1 /product="hexokinase III" /db_xref="PID:g1255788" /translation="MDSIGSSGLRQGEETLSCSEEGLPGPSDSSELVQECLQQFKVTR AQLQQIQASLLGSMEQALRGQASPAPAVRMLPTYVGSTPHGTEQGDFVVLELGATGAS LRVLWVTLTGIEGHRVEPRSQEFVIPQEVMLGAGQQLFDFAAHCLSEFLDAQPVNKQG LQLGFSFSFPCHQTGLDRSTLISWTKGFRCSGVEGQDVVQLLRDAIRRQGAYNIDVVA VVNDTVGTMMGCEPGVRPCEVGLVVDTGTNACYMEEARHVAVLDEDRGRVCVSVEWGS LSDDGALGPVLTTFDHTLDHESLNPGAQRFEKMIGGLYLGELVRLVLAHLARCGVLFG GCTSPALLSQGSILLEHVAEMEDPSTGAARVHAILQDLGLSPGASDVELVQHVCAAVC TRAAQLCAAALAAVLSCLQHSREQQTLQVAVATGGRVCERHPRFCSVLQGTVMLLAPE CDVSLIPSVDGGGRGVAMVTAVAARLAAHRRLLEETLAPFRLNHDQLAAVQAQMRKAM AKGLRGEASSLRMLPTFVRATPDGSERGDFLALDLGGTNFRVLLVRVTTGVQITSEIY SIPETVAQGSGQQLFDHIVDCIVDFQQKQGLSGQSLPLGFTFSFPCRQLGLDQGILLN WTKGFKASDCEGQDVVSLLREAITRRQAVELNVVAIVNDTVGTMMSCGYEDPRCEIGL IVGTGTNACYMEELRNVAGVPGDSGRMCINMEWGAFGDDGSLAMLSTRFDASVDQASI NPGKQRFEKMISGMYLGEIVRHILLHLTSLGVLFRGQQIQRLQTRDIFKTKFLSEIES DSLALRQVRAILEDLGLPLTSDDALMVLEVCQAVSQRAAQLCGAGVAAVVEKIRGNRG LEELAVSVGVDGTLYKLHPRFSSLVAATVRELAPRCVVTFLQSEDGSGKGAALVTAVA CRLAQLTRV" BASE COUNT 547 a 870 c 1009 g 636 t ORIGIN 1 gacaagagct cagacctgag gagagtgact agcttctctg tgtcccaggt ggccaccttc 61 cactgtggaa gctcatggac tccattgggt cttcagggtt gcggcagggg gaagaaaccc 121 tgagttgctc tgaggagggc ttgcccgggc cctcagacag ctcagagctg gtgcaggagt 181 gcctgcagca gttcaaggtg acaagggcac agctacagca gatccaagcc agcctcttgg 241 gttccatgga gcaggcgctg aggggacagg ccagccctgc ccctgcggtc cggatgctgc 301 ctacatacgt ggggtccacc ccacatggca ctgagcaagg agacttcgtg gtgctggagc 361 tgggggccac aggggcctca ctgcgtgttt tgtgggtgac tctaactggc attgaggggc 421 atagggtgga gcccagaagc caggagtttg tgatccccca agaggtgatg ctgggtgctg 481 gccagcagct ctttgacttt gctgcccact gcctgtctga gttcctggat gcgcagcctg 541 tgaacaaaca gggtctgcag cttggcttca gcttctcttt cccttgtcac cagacgggct 601 tggacaggag caccctcatt tcctggacca aaggttttag gtgcagtggt gtggaaggcc 661 aggatgtggt ccagctgctg agagatgcca ttcggaggca gggggcctac aacatcgacg 721 tggttgctgt ggtgaacgac acagtgggca ccatgatggg ctgtgagccg ggggtcaggc 781 cgtgtgaggt tgggctagtt gtagacacgg gcaccaacgc gtgttacatg gaggaggcac 841 ggcatgtggc agtgctggac gaagaccggg gccgcgtctg cgtcagcgtc gagtggggct 901 ccttaagcga tgatggggcg ctgggaccag tgctgaccac cttcgaccat accctggacc 961 atgagtccct gaatcctggt gctcagaggt ttgagaagat gatcggaggc ctgtacctgg 1021 gtgagctggt gcggctggtg ctggctcact tggcccggtg tggggtcctc tttggtggct 1081 gcacctcccc tgccctgctg agccaaggca gcatcctcct ggaacacgtg gctgagatgg 1141 aggacccctc tactggggca gcccgtgtcc atgctatcct gcaggacttg ggcctgagcc 1201 ctggggcttc ggatgttgag cttgtgcagc acgtctgtgc ggccgtgtgc acgcgggctg 1261 cccagctctg tgctgccgcc ctggccgctg ttctctcctg cctccagcac agccgggagc 1321 aacaaacact ccaggttgct gtggccaccg gaggccgagt gtgtgagcgg caccccaggt 1381 tctgcagcgt cctgcagggg acagtgatgc tcctggcccc ggaatgcgat gtctccttaa 1441 tcccctctgt ggatggtggt ggccggggag tggcgatggt gactgctgtg gctgcccgtc 1501 tggctgccca ccggcgcctg ctggaggaga ccctggcccc attccggttg aaccatgatc 1561 aactggctgc ggttcaggca cagatgcgga aggccatggc caaggggctc cgaggggagg 1621 cctcctccct tcgcatgctg cccactttcg tccgggccac ccctgacggc agcgagcgag 1681 gggatttcct ggccctggac ctcgggggca cgaacttccg tgtcctcctg gtacgtgtga 1741 ccacaggcgt gcagatcacc agcgagatct actccattcc cgagactgtg gcccagggtt 1801 ctgggcagca gctctttgac cacatcgtgg actgcatcgt ggacttccag cagaagcagg 1861 gcctgagcgg gcagagcctc ccactgggtt ttaccttctc cttcccatgt aggcagcttg 1921 gcctagacca gggcatcctc ctgaactgga ccaagggttt caaggcatca gactgcgagg 1981 gccaagatgt cgtgagtctg ttgcgggaag ccatcactcg cagacaggca gtggagctga 2041 atgtggttgc cattgtcaat gacacggtgg ggaccatgat gtcctgtggc tatgaggacc 2101 cccgttgcga gataggcctc attgtcggaa ccggcaccaa tgcctgctac atggaggagc 2161 tccggaatgt ggcgggcgtg cctggggact caggccgcat gtgcatcaac atggagtggg 2221 gcgcctttgg ggacgatggc tctctggcca tgctcagcac ccgctttgat gcaagtgtgg 2281 accaggcgtc catcaacccc ggcaagcaga ggtttgaaaa gatgatcagc ggcatgtacc 2341 tgggggagat cgtccgccac atccttttac atttaaccag ccttggcgtt ctcttccggg 2401 gccagcagat ccagcgcctt cagaccaggg acatcttcaa gaccaagttc ctctctgaga 2461 tcgaaagtga cagcctggcc ctgcggcagg tccgagccat cctagaggat ctggggctac 2521 ccctgacctc agatgacgcc ctgatggtgc tagaggtgtg ccaggctgtg tcccagaggg 2581 ctgcccagct ctgtggggcg ggtgtagctg ccgtggtgga gaagatccgg gggaaccggg 2641 gcctggaaga gctggcagtg tctgtggggg tggatggaac gctctacaag ctgcacccgc 2701 gcttctccag cctggtggcg gccacagtgc gggagctggc ccctcgctgt gtggtcacgt 2761 tcctgcagtc agaggatggg tccggcaaag gtgcggccct ggtcaccgct gttgcctgcc 2821 gccttgcgca gttgactcgt gtctgaggaa acctccaggc tgaggaggtc tccgccgcag 2881 ccttgctgga gccgggtcgg ggtctgcctg tttcccagcc aggcccagcc acccaggact 2941 cctgggacat cccatgtgtg acccctctgc ggccatttgg ccttgctccc tggctttccc 3001 tgagagaagt agcactcagg ttagcaatat atatatataa tttatttaca aaaaaaaaaa 3061 aa // LOCUS HSU51336 3049 bp mRNA PRI 17-MAY-1996 DEFINITION Human inositol 1,3,4-trisphosphate 5/6-kinase mRNA, complete cds. ACCESSION U51336 NID g1322037 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3049) AUTHORS Wilson,M.P. and Majerus,P.W. TITLE Isolation of inositol 1,3,4-trisphosphate 5/6-kinase, cDNA cloning and expression of the recombinant enzyme JOURNAL J. Biol. Chem. 271 (20), 11904-11910 (1996) MEDLINE 96216112 REFERENCE 2 (bases 1 to 3049) AUTHORS Wilson,M. TITLE Direct Submission JOURNAL Submitted (13-MAR-1996) Monita Wilson, Medicine, Washington University School of Medicine, 660 South Euclid, St. Louis, MO, 63110, USA FEATURES Location/Qualifiers source 1..3049 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="clone 15" /tissue_type="brain" /dev_stage="fetal" CDS 119..1363 /codon_start=1 /product="inositol 1,3,4-trisphosphate 5/6-kinase" /db_xref="PID:g1322038" /translation="MQTFLKGKRVGYWLSEKKIKKLNFQAFAELCRKRGMEVVQLNLS RPIEEQGPLDVIIHKLTDVILEADQNDSQSLELVHRFQEYIDAHPETIVLDPLPAIRT LLDRSKSYELIRKIEAYMEDDRICSPPFMELTSLCGDDTMRLLEKNGLTFPFICKTRV AHGTNSHEMAIVFNQEGLNAIQPPCVVQNFINHNAVLYKVFVVGESYTVVQRPSLKNF SAGTSDRESIFFNSHNVSKPESSSVLTELDKIEGVFERPSDEVIRELSRALRQALGVS LFGIDIIINNQTGQHAVIDINAFPGYEGVSEFFTDLLNHIATVLQGQSTAMAATGDVA LLRHSKLLAEPAGGLVGERTCNASPGCCGSMMGQDAPWKAEADAGGTAKLPHQRLGCN AGVSPSFQQHCVASLATKASSQ" 3'UTR 1364..3049 BASE COUNT 616 a 938 c 895 g 600 t ORIGIN 1 cccgcgggca ggggcggcga gtgcgcgggc cgccgccctt ctcggcgggc agcgcgcgag 61 gaccaggccg aggaggaagt ggcggcggcg gcggcgggct ccccgcccga ggaggaagat 121 gcagaccttt ctgaaaggga agagagttgg ctactggctg agcgagaaga aaatcaagaa 181 gctgaatttc caggctttcg ccgagctgtg caggaagcga gggatggagg ttgtgcagct 241 gaaccttagc cggccgatcg aggagcaggg ccccctggac gtcatcatcc acaagctgac 301 tgacgtcatc cttgaagccg accagaatga tagccagtcc ctggagctgg tgcacaggtt 361 ccaggagtac atcgatgccc accctgagac catcgtcctg gacccgctcc ctgccatcag 421 aaccctgctt gaccgctcca agtcctatga gctcatccgg aagattgagg cctacatgga 481 agacgacagg atctgctcgc cacccttcat ggagctcacg agcctgtgcg gggatgacac 541 catgcggctg ctggagaaga acggcttgac tttcccattc atttgcaaaa ccagagtggc 601 tcatggcacc aactctcacg agatggctat cgtgttcaac caggagggcc tgaacgccat 661 ccagccaccc tgcgtggtcc agaatttcat caaccacaac gccgtcctgt acaaggtgtt 721 cgtggttggc gagtcctaca ccgtggtcca gaggccctca ctcaagaact tctccgcagg 781 cacatcagac cgtgagtcca tcttcttcaa cagccacaac gtgtcaaagc cggagtcgtc 841 atcggtcctg acggagctgg acaagatcga gggcgtgttc gagcggccga gcgacgaggt 901 catccgggag ctctcccggg ccctgcggca ggcactgggc gtgtcactct tcggcatcga 961 catcatcatc aacaaccaga cagggcagca cgccgtcatt gacatcaatg ccttcccagg 1021 ctacgagggc gtgagcgagt tcttcacaga cctcctgaac cacatcgcca ctgtcctgca 1081 gggccagagc acagccatgg cagccacagg ggacgtggcc ctgctgaggc acagcaagct 1141 tctggccgag ccggcgggcg gcctggtggg cgagcggaca tgcaacgcca gccccggctg 1201 ctgcggcagc atgatgggcc aggacgcgcc ctggaaagct gaggccgacg cgggcggcac 1261 cgccaagctg ccgcaccaga gactcggctg caacgccggc gtgtctccca gcttccagca 1321 gcattgtgtg gcctccctgg ccaccaaggc ctcctcccag tagccacgga gccgggaccc 1381 agagggcagc gcaggcgcag gagcacaccc gctgggccag cagctcccaa cggcgatgct 1441 actactaaga atccccagtg atctgattct tctgtttttt aatttttaac ctgattttct 1501 gatgtcatga tctaaatgag gggtagaaga gagtaccagg tggtccaccg ttggggagcg 1561 gggccgtccg cctgctctct actgtgcaga cctcctaact gagtttacac acgcttgtgt 1621 tgcaacacta ggtctggatg ggaggtgagg ggggtgcgta tactgccatg ccagtgtctg 1681 tgcacatccc tgtctgttgt ctccatggcc actgtggact gggacccttg aagcctgccc 1741 atgtgggtgt gggaggctga tcagtgcgtg tgagagtggc ttcccttctg cctgactccc 1801 cactccctga cctgcccctt ccttgttttt cctcctactg gtctccacca aggctttgtt 1861 agcccccacc ctgcctggtg tgcagctaac ccctccctcc ccacagccag aggaggccac 1921 agacccctca gggagttccg cgctggggtc tgggctgtgc tccctcacta aagggaagga 1981 aaggaagctg ggcgtcctcc gggcccccca acacacgtcc catttagccc tgcacagcgg 2041 tctccttccc ctaagccagc actgctgctc cctggagccg ggaaggaggc tgcctggctg 2101 gaggccgagc cgatgggcct gtgctgagga tttgtgctgt gatttgggca aatcattcca 2161 ggtctttggg cctccacccc ctcgtctcta gtggacattt gagatcagag agcaccacag 2221 ggctggcttt gtgccctaac ccctgggatg cagcctgcct ttccataaag tcacctaggt 2281 gaggataggc gcgggagcct cggcatgaca ccatggagat cggggccctc ttcccagtgg 2341 gttcactcct tttcacacct gctgggtccc tcctcgccca gcaggcctgg tccacctctc 2401 attgcaagcc cgcaagcact gagccgagta aggtgcttag tgtgagccac ccgcccccca 2461 tagcttctgc acacctcaga ctcaccccat caccttggca gcaaagcact gctctgccgt 2521 ctgacccctg atccaggcag cagccccctc cgcagagaaa agggttgggg agaagcctct 2581 gcagtcctgg aagatgtggg gtgctgggtg agaggcatca gcccccacaa gtatgttttt 2641 gtgtcttaag atagcagttt actttgaaaa agtgaaaaag gcttccgggc tgtcctctgc 2701 ccagtgagat ggaggacgct agagaaagtg ctgagtgtcc cgagagaggc ccccgagcca 2761 gtgcatggag gtcttcggcc tggctcagct gggctgcagg atgcccactt tgaggaggga 2821 ggcacagggc ttgggcgagg ggcagaggcc atcagaactg cccggctttt ttggaaactg 2881 aggacccaac aactaaccac gtttacacga cttgagtttt gaaccccgat taatgtctgt 2941 acgtcacctt tcctagttct gaccctgagc cctggggaac aggaaagcgt ggctggcctc 3001 ttgcactgct ttgtctccaa aataaactac tgaaatcaaa ccgcatttc // LOCUS HSU51432 2122 bp mRNA PRI 28-MAR-1996 DEFINITION Human nuclear protein Skip mRNA, complete cds. ACCESSION U51432 NID g1236985 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2122) AUTHORS Dahl,R., Wani,B. and Hayman,M.J. TITLE The Ski oncoprotein interacts with Skip, the human homolog of drosophilla Bx42 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2122) AUTHORS Dahl,R., Wani,B. and Hayman,M.J. TITLE Direct Submission JOURNAL Submitted (14-MAR-1996) Microbiology, SUNY Stony Brook, Life Sciences, Stony Brook, NY 11794, USA FEATURES Location/Qualifiers source 1..2122 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" CDS 22..1632 /note="Nuclear protein; similar to the Drosophila puff specific protein Bx42" /codon_start=1 /product="Skip" /db_xref="PID:g1236986" /translation="MALTSFLPAPTQLSQDQLEAEEKARSQRSRQTSLVSSRREPPPY GYRKGWIPRLLEDFGDGGAFPEIHVAQYPLDMGRKKKMSNALAIQVDSEGKIKYDAIA RQGQSKDKVIYSKYTDLVPKEVMNADDPDLQRPDEEAIKEITEKTRVALEKSVSQKVA AAMPVRAADKLAPAQYIRYTPSQQGVAFNSGAKQRVIRMVEMQKDPMEPPRFKINKKI PRGPPSPPAPVMHSPSRKMTVKEQQEWKIPPCISNWKNAKGYTIPLDKRLAADGRGLQ TVHINENFAKLAEALYIADRKAREAVEMRAQVERKMAQKEKEKHEEKLREMAQKARER RAGIKTHVEKEDGEARERDEIRHDRRKERQHDRNLSRAAPDKRSKLQRNENRDISEVI ALGVPNPRTSNEVQYDQRLFNQSKGMDSGFAGGEDEIYNVYDQAWRGGKDMAQSIYRP SKNLDKDMYGDDLEARIKTNRFVPDKEFSGSDRRQRGREGPVQFEEDPFGLDKFLEEA KQHGGSKRPSDSSRPKEHEHEGKKRRKE" BASE COUNT 697 a 415 c 525 g 485 t ORIGIN 1 aggaagaaga agcggtagaa gatggcgctc accagctttt tacctgcacc tactcagcta 61 tctcaggacc agcttgaggc tgaagaaaag gcaagatccc agagatcacg gcagacctca 121 ctggtctcct cccgaagaga acctcccccg tacggatacc ggaaaggctg gatacctcgg 181 ttattagagg attttggaga tggaggtgct tttccagaga tccatgtggc ccagtatcca 241 ctggatatgg gacgaaagaa aaaaatgtcg aatgcgctgg ccattcaggt ggattctgaa 301 ggaaaaatta aatatgatgc aattgctcga caaggacagt caaaagacaa ggtcatttat 361 agcaaataca ctgacctggt tccaaaggag gttatgaatg cagatgatcc agacctgcaa 421 aggcccgatg aagaagctat taaagagata acagaaaaga caagagtagc cttagaaaaa 481 tctgtatcac agaaggtcgc cgcagccatg ccagttcgag cagctgacaa attggctcct 541 gctcagtata tccgatacac accatctcag caaggagtgg cattcaactc tggagctaaa 601 cagagggtta ttcggatggt agaaatgcag aaagatccaa tggagcctcc aaggttcaag 661 attaataaga aaattccccg gggaccacct tctcctcctg cgcctgtcat gcattctcct 721 agccgaaaga tgactgtaaa ggaacaacaa gagtggaaga ttcctccttg tatttctaac 781 tggaaaaatg caaagggtta tacaattcca ttagacaaac gtctggctgc tgatggaaga 841 ggactacaga cagtacacat aaatgaaaat ttcgccaaat tggcagaagc cctctacatt 901 gctgatcgga aggctcgtga agctgtggaa atgcgtgccc aagtagagag aaaaatggct 961 cagaaagaaa aggaaaaaca tgaagagaaa cttagagaaa tggcccagaa agccagggaa 1021 agaagagctg ggatcaaaac tcatgtggaa aaagaggatg gggaggcacg tgagagggat 1081 gaaatccggc atgacaggcg aaaagagaga cagcatgacc ggaatctttc cagggcagct 1141 cctgataaga ggtcgaaact tcagagaaat gaaaatcggg atatcagtga agttattgct 1201 ctcggtgttc ctaatcctcg gacttccaat gaagttcagt atgaccaaag gctcttcaac 1261 caatccaagg gtatggacag tggatttgca ggtggagaag atgaaattta taatgtttat 1321 gatcaagcct ggagaggtgg taaagatatg gcccagagta tttataggcc cagtaaaaat 1381 ctggacaagg acatgtatgg tgatgaccta gaagccagaa taaagaccaa cagatttgtt 1441 cccgacaagg agttttctgg ttcagaccgt agacagagag gccgagaagg accagttcag 1501 tttgaggaag atccttttgg tttggacaag tttttggaag aagccaaaca gcatggtggc 1561 tctaaaagac cctcagatag cagccgcccc aaggaacacg agcatgaagg caagaagagg 1621 aggaaggaat aggcacaggt ctctccaaag tgaatgaact cttacccata accctaatga 1681 tgcaagtcat atgggggaac actttgtaaa tggtcaggat aaaaaccaaa tctgggtgcc 1741 agatcccagc actacttttt attactggag aaatgggggg gatagaaaat tctactttga 1801 attatttagt tttttttaaa gagtgggttg tgtttgtgct tctcccacct ttcagcattt 1861 atagaacatg ctgccccaca tacaaagtca agaccactta cttttatgtg acactagtag 1921 tttggggtta atgttttgtg taagaacagc tgcatatgag taaagttacc ccaaccacag 1981 tgaggaggaa gatgttcaca tactggaact gtcctgccaa ataaattttg cccctattgt 2041 gctctgtttt aatttggagt gggcaaagta acctcttgct tggtgcaact atttgtttca 2101 aataaaaaca tttagacaaa aa // LOCUS HSU51477 3490 bp mRNA PRI 16-MAY-1996 DEFINITION Human diacylglycerol kinase zeta mRNA, complete cds. ACCESSION U51477 NID g1293078 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3490) AUTHORS Bunting,M., Tang,W., Zimmerman,G.A., McIntyre,T.M. and Prescott,S.M. TITLE Molecular cloning and characterization of a novel human diacylglycerol kinase zeta JOURNAL J. Biol. Chem. 271 (17), 10230-10236 (1996) MEDLINE 96215319 REFERENCE 2 (bases 1 to 3490) AUTHORS Prescott,S.M. TITLE Direct Submission JOURNAL Submitted (14-MAR-1996) S.M. Prescott, Program in Human Molecular Biology and Genetics, University of Utah, Bldg. 533 Rm. 4220, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..3490 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="umbilical vein endothelial cell" CDS 89..2875 /note="DGK" /codon_start=1 /product="diacylglycerol kinase zeta" /db_xref="PID:g1293079" /translation="MEPRDGSPEARSSDSESASASSSGSERDAGPEPDKAPRRLNKRR FPGLRLFGHRKAITKSGLQHLAPPPPTPGAPCSESERQIRSTVDWSESATYGEHIWFE TNVSGDFCYVGEQYCVARMLKSVSRRKCAACKIVVHTPCIEQLEKINFRCKPSFRESG SRNVREPTFVRHHWVHRRRQDGKCRHCGKGFQQKFTFHSKEIVAISCSWCKQAYHSKV SCFMLQQIEEPCSLGVHAAVVIPPTWILRARRPQNTLKASKKKKRASFKRKSSKKGPE EGRWRPFIIRPTPSPLMKPLLVFVNPKSGGNQGAKIIQSFLWYLNPRQVFDLSQGGPK EALEMYRKVHNLRILACGGDGTVGWILSTLDQLRLKPPPPVAILPLGTGNDLARTLNW GGGYTDEPVSKILSHVEEGNVVQLDRWDLHAEPNPEAGPEDRDEGATDRLPLDVFNNY FSLGFDAHVTLEFHESREANPEKFNSRFRNKMFYAGTAFSDFLMGSSKDLAKHIRVVC DGMDLTPKIQDLKPQCVVFLNIPRYCAGTMPWGHPGEHHDFEPQRHDDGYLEVIGFTM TSLAALQVGGHGERLTQCREVVLTTSKAIPVQVDGEPCKLAASRIRIALRNQATMVQK AKRRSAAPLHSDQQPVPEQLRIQVSRVSMHDYEALHYDKEQLKEASVPLGTVVVPGDS DLELCRAHIERLQQEPDGAGAKSPTCQKLSPKWCFLDATTASRFYRIDRAQEHLNYVT EIAQDEIYILDPELLGASARPDLPTPTSPLPTSPCSPTPRSLQGDAAPPQGEELIEAA KRNDFCKLQELHRAGGDLMHRDEQSRTLLHHAVSTGSKDVVRYLLDHAPPEILDAVEE NGETCLHQAAALGQRTICHYIVEAGASLMKTDQQGDTPRQRAEKAQDTELAAYLENRQ HYQMIQREDQETAV" BASE COUNT 683 a 1154 c 1094 g 559 t ORIGIN 1 gcggcgcgga gcgggcgtgc tgagccccgg ccgccggccc ggcatgggcg tctcccgcgg 61 gccctccgcc ggccggggct agggccggat ggagccgcgg gacggtagcc ccgaggcccg 121 gagcagcgac tccgagtcgg cttccgcctc gtccagcggc tccgagcgcg acgccggtcc 181 cgagccggac aaggcgccgc ggcgactcaa caagcggcgc ttcccggggc tgcggctctt 241 cgggcacagg aaagccatca ccaagtcggg cctccagcac ctggcccccc ctccgcccac 301 ccctggggcc ccgtgcagcg agtcagagcg gcagatccgg agtacagtgg actggagcga 361 gtcagcgaca tatggggagc acatctggtt cgagaccaac gtgtccgggg acttctgcta 421 cgttggggag cagtactgtg tagccaggat gctgaagtca gtgtctcgaa gaaagtgcgc 481 agcctgcaag attgtggtgc acacgccctg catcgagcag ctggagaaga taaatttccg 541 ctgtaagccg tccttccgtg aatcaggctc caggaatgtc cgcgagccaa cctttgtacg 601 gcaccactgg gtacacagac gacgccagga cggcaagtgt cggcactgtg ggaagggatt 661 ccagcagaag ttcaccttcc acagcaagga gattgtggcc atcagctgct cgtggtgcaa 721 gcaggcatac cacagcaagg tgtcctgctt catgctgcag cagatcgagg agccgtgctc 781 gctgggggtc cacgcagccg tggtcatccc gcccacctgg atcctccgcg cccggaggcc 841 ccagaatact ctgaaagcaa gcaagaagaa gaagagggca tccttcaaga ggaagtccag 901 caagaaaggg cctgaggagg gccgctggag acccttcatc atcaggccca ccccctcccc 961 gctcatgaag cccctgctgg tgtttgtgaa ccccaagagt gggggcaacc agggtgcaaa 1021 gatcatccag tctttcctct ggtatctcaa tccccgacaa gtcttcgacc tgagccaggg 1081 agggcccaag gaggcgctgg agatgtaccg caaagtgcac aacctgcgga tcctggcgtg 1141 cgggggcgac ggcacggtgg gctggatcct ctccaccctg gaccagctac gcctgaagcc 1201 gccaccccct gttgccatcc tgcccctggg tactggcaac gacttggccc gaaccctcaa 1261 ctggggtggg ggctacacag atgagcctgt gtccaagatc ctctcccacg tggaggaggg 1321 gaacgtggta cagctggacc gctgggacct ccacgctgag cccaaccccg aggcagggcc 1381 tgaggaccga gatgaaggcg ccaccgaccg gttgcccctg gatgtcttca acaactactt 1441 cagcctgggc tttgacgccc acgtcaccct ggagttccac gagtctcgag aggccaaccc 1501 agagaaattc aacagccgct ttcggaataa gatgttctac gccgggacag ctttctctga 1561 cttcctgatg ggcagctcca aggacctggc caagcacatc cgagtggtgt gtgatggaat 1621 ggacttgact cccaagatcc aggacctgaa accccagtgt gttgttttcc tgaacatccc 1681 caggtactgt gcgggcacca tgccctgggg ccaccctggg gagcaccacg actttgagcc 1741 ccagcggcat gacgacggct acctcgaggt cattggcttc accatgacgt cgttggccgc 1801 gctgcaggtg ggcggacacg gcgagcggct gacgcagtgt cgcgaggtgg tgctcaccac 1861 atccaaggcc atcccggtgc aggtggatgg cgagccctgc aagcttgcag cctcacgcat 1921 ccgcatcgcc ctgcgcaacc aggccaccat ggtgcagaag gccaagcggc ggagcgccgc 1981 ccccctgcac agcgaccagc agccggtgcc agagcagttg cgcatccagg tgagtcgcgt 2041 cagcatgcac gactatgagg ccctgcacta cgacaaggag cagctcaagg aggcctctgt 2101 gccgctgggc actgtggtgg tcccaggaga cagtgaccta gagctctgcc gtgcccacat 2161 tgagagactc cagcaggagc ccgatggtgc tggagccaag tccccgacat gccagaaact 2221 gtcccccaag tggtgcttcc tggacgccac cactgccagc cgcttctaca ggatcgaccg 2281 agcccaggag cacctcaact atgtgactga gatcgcacag gatgagattt atatcctgga 2341 ccctgagctg ctgggggcat cggcccggcc tgacctccca acccccactt cccctctccc 2401 cacctcaccc tgctcaccca cgccccggtc actgcaaggg gatgctgcac cccctcaagg 2461 tgaagagctg attgaggctg ccaagaggaa cgacttctgt aagctccagg agctgcaccg 2521 agctgggggc gacctcatgc accgagacga gcagagtcgc acgctcctgc accacgcagt 2581 cagcactggc agcaaggatg tggtccgcta cctgctggac cacgcccccc cagagatcct 2641 tgatgcggtg gaggaaaacg gggagacctg tttgcaccaa gcagcggccc tgggccagcg 2701 caccatctgc cactacatcg tggaggccgg ggcctcgctc atgaagacag accagcaggg 2761 cgacactccc cggcagcggg ctgagaaggc tcaggacacc gagctggccg cctacctgga 2821 gaaccggcag cactaccaga tgatccagcg ggaggaccag gagacggctg tgtagcgggc 2881 cgcccacggg cagcaggagg gacaatgcgg ccaggggacg agcgccttcc ttgcccacct 2941 cactgccaca ttccagtggg acggccacgg ggggacctag gccccaggga aagagcccca 3001 tgccgccccc taaggagccg cccagaccta gggctggact caggagctgg gggggcctca 3061 cctgttcccc tgaggacccc gccggacccg gaggctcaca gggaacaaga cacggctggg 3121 ttggatatgc ctttgccggg gttctggggc agggcgctcc ctggccgcag cagatgccct 3181 cccaggagtg gaggggctgg agagggggag gccttcggga agaggcttcc tgggccccct 3241 ggtcttcggc cgggtcccca gcccccgctc ctgccccacc ccacctcctc cgggcttcct 3301 cccggaaact cagcgcctgc tgcacttgcc tgccctgcct tgcttggcac ccgctccggc 3361 gaccctcccc gctcccctgt catttcatcg cggactgtgc ggcctggggg tggggggcgg 3421 gactctcacg gtgacatgtt tacagctggg tgtgactcag taaagtggat ttttttttct 3481 ttaaaaaaaa // LOCUS HSU51587 4860 bp mRNA PRI 23-OCT-1997 DEFINITION Homo sapiens Golgi complex autoantigen golgin-97 mRNA, complete cds. ACCESSION U51587 NID g1669823 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4860) AUTHORS Griffith,K.J., Chan,E.K., Lung,C.C., Hamel,J.C., Guo,X., Miyachi,K. and Fritzler,M.J. TITLE Molecular cloning of a novel 97-kd Golgi complex autoantigen associated with Sjogren's syndrome JOURNAL Arthritis Rheum. 40 (9), 1693-1702 (1997) MEDLINE 97464204 REFERENCE 2 (bases 1 to 4860) AUTHORS Chan,E.K.L. TITLE Direct Submission JOURNAL Submitted (15-MAR-1996) Molecular and Experimental Medicine, The Scripps Research Intitute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..4860 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 335..2638 /codon_start=1 /product="Golgi complex autoantigen golgin-97" /db_xref="PID:g1669824" /translation="MFAKLKKKIAEETAVAQRPGGATRIPRSVSKESVASMGADSGDD FASDGSSSREDLSSQLLRRNEQIRKLEARLSDYAEQVRNLQKIKEKLEIALEKHQDSS MRKFQEQNETFQANRAKMAEGLALALARKDQEWSEKMDQLEKEKNILTAQLQEMKNQS MNLFQRRDEMDELEGFQQQELSKIKHMLLKKEESLGKMEQELEARTRELSRTQEELMN SNQMSSDLSQKLEELQRHYSTLEEQRDHVIASKTGAESKITALEQKEQELQALIQQLS IDLQKVTAETQEKEDVITHLQEKVASLEKRLEQNLSGEEHVQELLKEKTLAEQNLEDT RQQLLAARSSQAKAINTLETRVRELEQTLQASEEQLQQSKGIVAAQETQIQELAAANQ ESSHVQQQALALEQQFLERTQALEAQIVALERTRAADQTTAEQGMRQLEQENAALKEC RNEYERSLQNHQFELKKLKEEWSQREIVSVAMAQALEEVRKQREEFQQQAANLTAIID EKEQNLREKTEVLLQKEQEILQLERGHNSALLQIHQLQAELEALRTLKAEEAAVVAEQ EDLLRLRGPLQAEALSVNESHVTSRAMQDPVFQLPTAGRTPNGEVGAMDLTQLQKEKQ DLEQQLLEKNKTIKQMQQRMLELRKTLQKELKIRPDNELFEVREKPGPEMANMAPSVT NNTDLTDAREINFEYLKHVVLKFMSCRESEAFHLIKAVSVLLNFSQEEENMLKETLEY KMSWFGSKPAPKGSIRPSISNPRIPWS" BASE COUNT 1448 a 1130 c 1185 g 1097 t ORIGIN 1 gcactgggag acgtgcggtt ccgggtcgct gctcggctcg ccggctgggc ggtgggggtt 61 ggtgacgcgg gccggcgctc acgagaggcc cgggaggcgg ggctttgctg gcttcccaga 121 gagaggcagg acagaatgct ttgacctcca agctgtttta aatctagtag ataagccaga 181 tcctgtgttg ccataagccc ttggcccaca tttaagtggg aatgcagcta gcttggatgt 241 ctgaaacttt gtaggcgcct ctgtctgaat cctgaacaca ggcaccagga ctactgagag 301 ctcgtcatct gtgcaggata gccacacagc aaacatgttt gcaaaactga agaagaaaat 361 tgcagaagag actgctgttg ctcagaggcc aggaggtgct actaggatcc cacggtctgt 421 gagcaaggaa tcagttgcct caatgggagc tgactcagga gatgactttg cttccgatgg 481 aagcagctcc agagaagatc tttcatccca gcttctgaga aggaatgaac agatacggaa 541 gttagaggcc agactttctg actatgctga acaggtccga aacttgcaga agataaaaga 601 gaagcttgaa attgcattag aaaaacacca ggattcttcc atgcggaaat ttcaagagca 661 gaatgagaca ttccaagcca acagagccaa aatggcagaa ggactggctt tggcattagc 721 cagaaaggac caggaatggt cagaaaagat ggatcagctt gaaaaggaga aaaatattct 781 gacagcccag ttacaggaaa tgaagaacca gagtatgaat cttttccaaa ggagagatga 841 aatggatgaa ttagaggggt tccagcagca ggaactaagt aaaataaagc acatgctttt 901 aaaaaaagaa gaaagtctag ggaaaatgga acaagaattg gaggcacgaa ccagagaact 961 tagtcgtacc caggaggagt tgatgaactc caatcagatg tcatcagact taagccagaa 1021 gctagaagaa ttgcagagac actactcaac gctggaagag cagagagatc atgtgatagc 1081 ttcaaaaaca ggtgcagaaa gtaagatcac agccctggaa caaaaggaac aagagctcca 1141 agcactcatt cagcagcttt ccattgattt gcaaaaggtc actgctgaaa ctcaagagaa 1201 agaagacgtt atcacacatt tgcaagagaa ggttgcatcc ttggagaaga gactagaaca 1261 gaacttatca ggagaagaac acgtgcaaga actcctgaaa gagaaaacac ttgctgagca 1321 gaatttggag gataccagac aacagctctt ggcagccaga agcagccagg ctaaggccat 1381 taacaccctg gagactcggg tgagagaact ggagcagacc ttgcaggcct ctgaggagca 1441 gctccaacag agcaagggca ttgtggctgc ccaggaaact cagatacagg agctcgctgc 1501 cgccaaccag gagagcagcc atgtgcagca gcaggccctt gctctggagc agcagttctt 1561 ggagcgcacc caggcgctag aagcccagat agtggccctg gagagaacgc gggcagctga 1621 ccagaccacc gcagagcaag ggatgagaca actggagcaa gaaaatgcag cccttaaaga 1681 atgcaggaat gaatatgaac gttctttaca aaatcaccaa tttgaactaa agaagctgaa 1741 ggaagaatgg agccaaagag aaattgtgag cgtggccatg gctcaagccc tggaggaggt 1801 gcggaagcaa agggaagagt tccagcaaca ggcagctaac ctgacagcca taatagacga 1861 gaaggaacag aatctgcggg aaaaaaccga agtgcttctc cagaaagagc aggagattct 1921 ccagctggag cgaggtcaca actctgccct gctgcagata caccagctgc aggccgagct 1981 ggaggccctg aggaccctca aggcggagga ggctgcagtg gtcgcggagc aggaggacct 2041 gctgaggctg cggggcccat tgcaggccga agcactctca gtcaatgagt cgcacgtgac 2101 ctcgagggcc atgcaggacc ctgtgttcca gcttccaact gcaggaagaa caccaaatgg 2161 tgaggttggg gccatggatc tcacacagct acagaaggag aaacaggact tggagcagca 2221 acttctggag aaaaataaga ccataaagca gatgcagcag cggatgctgg agctccggaa 2281 gactctgcag aaggagctga aaatcagacc cgataatgag ctcttcgaag tccgggagaa 2341 acctggacct gagatggcaa acatggcgcc ttccgtcacg aataacactg acctgacaga 2401 tgcccgcgag atcaactttg agtaccttaa acatgtggtt ttaaaattca tgtcttgtcg 2461 cgaatccgag gcttttcatc ttataaaagc tgtgtcagtg ttgctgaact tttcccaaga 2521 ggaggagaac atgctcaagg aaactctgga atataagatg tcatggtttg ggtccaaacc 2581 agctcccaag ggcagcatcc ggccgtctat ctcaaaccct cggataccat ggtcctagag 2641 gggactaccc aaggatggag ctccgtgggt tgacactttt tctgtgaaaa gaacactgac 2701 acaccagtct gggtgggttt ttaatcactg taactgcagt attttgtaca agtgtctaaa 2761 cattgtttac aagactaagg cccacttccc tgcaggctga cctgaacctc agggggtagc 2821 tgatcctgtc attctggtca ccaaacagga gggtcctggc actacccaga tttccacagt 2881 gctgctaata tcccagctcc agccagcacc ccatctgcac ctgaatcctc taacttcacg 2941 gtagcactta cagctgaagc catcagcatc tggcaggcac acctgagtca ccatgtagcg 3001 ctgctactgg aggtagagac ggccctttga gatggtgccc agcaggccaa acccacctgc 3061 ctctgccagg aacagccaac tccatgggaa ctctatggga gtggctttta aaaattcaga 3121 tgagttagaa gctttttatc ccttcctctc aagaaaatat tctttcaccc tgtctctcaa 3181 accacctaga actttagagg atccatcttt aagggccggt gtggatgaat gagaaaatgc 3241 acctttctga cagtatctcc actttactta agaaaactag caaatatatg aaaagaccct 3301 tagtaccaaa tacactcaat tgccttttta atgaatgtac ttgtcttgga taggttgctg 3361 gtaaaccatt ttaaactatt ttttatagct gaagttcttc actactataa acgtcttctg 3421 tactataaaa tccatttaac tggtgttctt aaaatcagag cgtccagagg aaattcttcc 3481 taaaactagg attcctgttc ctttgtcttc tcactcgcac tctggcactg ctccctctga 3541 agtgcagtgg gatctcgtgt gctttgtctt gattctgtgc tgcgctgccg ctgggcgatg 3601 cagaccacct gtcttctact gaaggacagt ccgctgtctc cagtgggggc agcagctgtc 3661 ccccagcctc gatggagaca ttggggcagt ctgccttgtc tgtggagctg ctctctctcc 3721 ctcatcccac cccaaatact taaaatgaca ctacacccag acggcgccca gctggctgca 3781 gcacttgtag catgcacatg actctggtag taaccaacaa aaacttgttt tatggattcc 3841 tcgtttactg aggaaaggga acatgctggt tctggaaaag ccacaatatt gaatctaaaa 3901 ggaaaccgtt tattgtttga tgaaagtttc actggttaaa taaaaaacta aattaataac 3961 tggagcctct aaatttatta tccattatcc agtgatggaa agttgtattt ctcaatcatg 4021 cttagggcca aaataggtat ataaaatgtg tcacagaaaa acacgcattt gcaacgttaa 4081 cctaacgaaa tttccatgaa gaaccaagtc agggcagcat ctccttagtc ccagctcagg 4141 ctctctgcct tccagaggcc gcttctccag tgactaacct cctcctctgg ctcctccttg 4201 cagacagtta tcccttgttt agaacacgaa tttccattta cctggtggga acacgaaaca 4261 ggagtctctt ctgttctgca agtttgatgg gtaagaggta gccttttttc aaagtaggat 4321 ttcctttttc aactgttcca ggaaagaatc tctaagactg ggtagctcac agccagccaa 4381 aggcagctac atttccacag aagcccgtcc gctgcctccg tggcttctcc agccattgaa 4441 ctggtcccac acgcacccca ggccccactc ctcggcagtt tcaggtgtag ctgtggggcc 4501 cgttcctagg tctgtactca ctttagggag gcttcactga ctaggctttc ctcctgcatg 4561 ttgaatttcc ttcagcttta agaggaagag tggaataaat attctaagtg atttaatgca 4621 ctttgacttg tataaaactt tctgtgttag cgacggtatc tatagccctt tatacgagcg 4681 atggatcttg agctctcctt ccatgttgta aatagggatt gtattcttga aaactgctgt 4741 agcaaattca tctgtggtgc aatacacttt ttgattaaag ctcttcaatc cagatgcaaa 4801 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU51626 2730 bp mRNA PRI 12-APR-1997 DEFINITION Human MOP2 mRNA, complete cds. ACCESSION U51626 NID g1695800 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2730) AUTHORS Hogenesch,J.B., Chan,W.K., Jackiw,V.H., Brown,R.C., Gu,Y.Z., Pray-Grant,M., Perdew,G.H. and Bradfield,C.A. TITLE Characterization of a subset of the basic-helix-loop-helix-PAS superfamily that interacts with components of the dioxin signaling pathway JOURNAL J. Biol. Chem. 272 (13), 8581-8593 (1997) MEDLINE 97236817 REFERENCE 2 (bases 1 to 2730) AUTHORS Hogenesch,J.B. TITLE Direct Submission JOURNAL Submitted (18-MAR-1996) John B. Hogenesch, MPBC, Northwestern University, 745 N. Fairbanks Ct./Ward 8-296, Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..2730 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 33..2645 /note="HIF2-alpha; PAS protein 2" /codon_start=1 /product="MOP2" /db_xref="PID:g1695801" /translation="MTADKEKKRSSSERRKEKSRDAARCRRSKETEVFYELAHELPLP HSVSSHLDKASIMRLAISFLRTHKLLSSVCSENESEAEADQQMDNLYLKALEGFIAVV TQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLSLKNGSGFGKKSK DMSTERDFFMRMKCTVTNRGRTVNLKSATWKVLHCTGQVKVYNNCPPHNSLCGYKEPL LSCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMDMKFTYCDDRITELIGYHPEELLGRS AYEFYHALDSENMTKSHQNLCTKGQVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQP QCIMCVNYVLSEIEKNDVVFSMDQTESLFKPHLMAMNSIFDSSGKGAVSEKSNFLFTK LKEEPEELAQLAPTPGDAIISLDFGNQNFEESSAYGKAILPPSQPWATELRSHSTQSE AGSLPAFTVPQAAAPGSTTPSATSSSSSCSTPNSPEDYYTSLDNDLKIEVIEKLFAMD TEAKDQCSTQTDFNELDLETLAPYIPMDGEGFQLSPICPEERLLAENPQSTPQHCFSA MTNIFQPLAPVAPHSPFLLDKFQQQLESKKTEPERRPMSSIFFDAGSKASLPPCCGQA STPLSSMGGRSNTQWPPDPPLHFGPTKWAVGDQRTEFLGAAPLGPPVSPPHVSTFKTR SAKGFGARGPNVLSPAMVALSNKLKLKRQLEYEKQAFQDPSGGDPPGGSTSHLMWKRM KNLRGGSCPLMPDKPLSANVPNDKLTQNSMRGLGHPLRHLPLPQPPSAISPGENSKSR FPPQCYATQYQDYSLSSAHKVSGMASRLLGPSFESYLLPELTRYDREVKVPVLGSSTL LQGGDLLRALDQAT" BASE COUNT 659 a 839 c 727 g 505 t ORIGIN 1 gcgtctgaac gtctcaaagg gccacagcga caatgacagc tgacaaggag aagaaaagga 61 gtagctcgga gaggaggaag gagaagtccc gggatgctgc gcggtgccgg cggagcaagg 121 agacggaggt gttctatgag ctggcccatg agctgcctct gccccacagt gtgagctccc 181 atctggacaa ggcctccatc atgcgactgg caatcagctt cctgcgaaca cacaagctcc 241 tctcctcagt ttgctctgaa aacgagtccg aagccgaagc tgaccagcag atggacaact 301 tgtacctgaa agccttggag ggtttcattg ccgtggtgac ccaagatggc gacatgatct 361 ttctgtcaga aaacatcagc aagttcatgg gacttacaca ggtggagcta acaggacata 421 gtatctttga cttcactcat ccctgcgacc atgaggagat tcgtgagaac ctgagtctca 481 aaaatggctc tggttttggg aaaaaaagca aagacatgtc cacagagcgg gacttcttca 541 tgaggatgaa gtgcacggtc accaacagag gccgtactgt caacctcaag tcagccacct 601 ggaaggtctt gcactgcacg ggccaggtga aagtctacaa caactgccct cctcacaata 661 gtctgtgtgg ctacaaggag cccctgctgt cctgcctcat catcatgtgt gaaccaatcc 721 agcacccatc ccacatggac atccccctgg atagcaagac cttcctgagc cgccacagca 781 tggacatgaa gttcacctac tgtgatgaca gaatcacaga actgattggt taccaccctg 841 aggagctgct tggccgctca gcctatgaat tctaccatgc gctagactcc gagaacatga 901 ccaagagtca ccagaacttg tgcaccaagg gtcaggtagt aagtggccag taccggatgc 961 tcgcaaagca tgggggctac gtgtggctgg agacccaggg gacggtcatc tacaaccctc 1021 gcaacctgca gccccagtgc atcatgtgtg tcaactacgt cctgagtgag attgagaaga 1081 atgacgtggt gttctccatg gaccagactg aatccctgtt caagccccac ctgatggcca 1141 tgaacagcat ctttgatagc agtggcaagg gggctgtgtc tgagaagagt aacttcctat 1201 tcaccaagct aaaggaggag cccgaggagc tggcccagct ggctcccacc ccaggagacg 1261 ccatcatctc tctggatttc gggaatcaga acttcgagga gtcctcagcc tatggcaagg 1321 ccatcctgcc cccgagccag ccatgggcca cggagttgag gagccacagc acccagagcg 1381 aggctgggag cctgcctgcc ttcaccgtgc cccaggcagc tgccccgggc agcaccaccc 1441 ccagtgccac cagcagcagc agcagctgct ccacgcccaa tagccctgaa gactattaca 1501 catctttgga taacgacctg aagattgaag tgattgagaa gctcttcgcc atggacacag 1561 aggccaagga ccaatgcagt acccagacgg atttcaatga gctggacttg gagacactgg 1621 caccctatat ccccatggac ggggaaggct tccagctaag ccccatctgc cccgaggagc 1681 ggctcttggc ggagaaccca cagtccaccc cccagcactg cttcagtgcc atgacaaaca 1741 tcttccagcc actggcccct gtagccccgc acagtccctt cctcctggac aagtttcagc 1801 agcagctgga gagcaagaag acagagcccg agcgccggcc catgtcctcc atcttctttg 1861 atgccggaag caaagcatcc ctgccaccgt gctgtggcca ggccagcacc cctctctctt 1921 ccatgggggg cagatccaac acccagtggc ccccagatcc accattacat tttgggccca 1981 caaagtgggc cgtcggggat cagcgcacag agttcttggg agcagcgccg ttggggcccc 2041 ctgtctctcc accccatgtc tccaccttca aaacaaggtc tgcaaagggt tttggggctc 2101 gaggcccaaa cgtgctgagt ccggccatgg tagccctctc caacaagctg aagctgaagc 2161 gacagctgga gtatgaaaag caagccttcc aggacccgag cgggggggac ccacctggtg 2221 gcagcacctc acatttgatg tggaaacgga tgaagaacct caggggtggg agctgccctt 2281 tgatgccgga caagccactg agcgcaaatg tacccaatga taagctcacc caaaactcca 2341 tgaggggcct gggccatccc ctgagacatc tgccgctgcc acagcctcca tctgccatca 2401 gtcccgggga gaacagcaag agcaggttcc ccccacagtg ctacgccacc cagtaccagg 2461 actacagcct gtcgtcagcc cacaaggtgt caggcatggc aagccggctg ctcgggccct 2521 catttgagtc ctacctgctg cccgaactga ccagatatga ccgtgaggtg aaagtgcccg 2581 tgctgggaag ctccacgctc ctgcaaggag gggacctcct cagagccctg gaccaggcca 2641 cctgagccag gcttctacct gggcagcacc tctgccgacg ccgtcccacc agcttcactc 2701 tctccgtctg tctttgcaac taggtatttg // LOCUS HSU51678 843 bp mRNA PRI 30-JAN-1998 DEFINITION Homo sapiens small acidic protein mRNA, complete cds. ACCESSION U51678 NID g1915966 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 843) AUTHORS Gong,T.W., Hegeman,A.D., Shin,J.J., Lindberg,K.H., Barald,K.F. and Lomax,M.I. TITLE Novel genes expressed in the chick otocyst during development: identification using differential display of RNA JOURNAL Int. J. Dev. Neurosci. 15 (4-5), 585-594 (1997) MEDLINE 97408516 REFERENCE 2 (bases 1 to 843) AUTHORS Lomax,M.I. and Gong,T.-W.L. TITLE Direct Submission JOURNAL Submitted (18-MAR-1996) Otolaryngology-Kresge Hearing Research Institute, University of Michigan, 9301 MSRBIII, 1150 W. Medical Center Dr., Ann Arbor, MI 48109-0648, USA FEATURES Location/Qualifiers source 1..843 /organism="Homo sapiens" /note="the EST sequence deposited in GenBank Accession Number Z45801 overlaps this sequence on the 5' end" /db_xref="taxon:9606" /clone="KH251; cloneID 145052 from IMAGE Consortium" /clone_lib="Soares placenta Nb2HP" /tissue_type="placenta" 5'UTR <1..35 CDS 36..587 /function="unknown" /codon_start=1 /product="small acidic protein" /db_xref="PID:g1915967" /translation="MSAARESHPHGVKRSASPDDDLGSSNWEAADLGNEERKQKFLRL MGAGKKEHTGRLVIGDHKSTSHFRTGEEDKKINEELESQYQQSMDSKLSGRYRRHCGL GFSEVEDHDGEGDVAGDDDDDDDDSPDPESPDDSESDSESEKEESAEELQAAEHPDEV EDPKNKKDAKSNYKMMFVKSSGS" 3'UTR 588..829 polyA_signal 803..808 BASE COUNT 294 a 141 c 198 g 210 t ORIGIN 1 agggttcggc atttttcgtc gggatccccg caaggatgag tgctgccaga gagtctcacc 61 cgcatggggt gaagcgttca gcctccccag acgacgatct gggatctagc aattgggagg 121 cagcagactt gggtaatgaa gagagaaaac aaaagttctt gagacttatg ggtgcaggaa 181 agaaagaaca tactggtcgt cttgttatag gagatcacaa atcaacatct cacttccgaa 241 ccggggaaga agacaagaaa attaatgaag aactggagtc tcaatatcag caaagtatgg 301 acagtaaatt atcaggaaga tatcggcgac attgtggact tggcttcagt gaggtagaag 361 accatgatgg agaaggtgat gtggctggag atgatgatga tgacgatgat gattcacctg 421 atcctgaaag tccagatgat tctgaaagcg attcagagtc agagaaagaa gaatctgctg 481 aagaactcca agctgctgag caccctgatg aagtggagga tcccaaaaac aaaaaagatg 541 caaaaagcaa ttataaaatg atgtttgtta aatccagtgg ttcataactc ccaaacgctt 601 agtctttgta ttaaaagtaa gccttattgt tacaatgcac agtggaggac tgcttataga 661 gcacagacct ttgtattata atttttaaaa aggccctttt aaataattac aaagagtgtt 721 tgctttcaaa tgccatgggt tacactttta tgggcatgac tataaccatt tttgtaaaga 781 gtaagagttg tataaaataa gaaataaata cagtactcaa cttcctttca aaaaaaaaaa 841 aaa // LOCUS HSU51869 2647 bp mRNA PRI 06-JAN-1998 DEFINITION Human proto-oncogene Bcd orf1 and orf2 mRNA, complete cds. ACCESSION U51869 NID g2745959 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2647) AUTHORS El Rouby,S. and Newcomb,E.W. TITLE Identification of Bcd, a novel proto-oncogene expressed in B-cells JOURNAL Oncogene 13 (12), 2623-2630 (1996) MEDLINE 97152553 REFERENCE 2 (bases 1 to 2647) AUTHORS EL-Rouby,S. and Newcomb,E.W. TITLE Direct Submission JOURNAL Submitted (19-MAR-1996) Soumaya EL-Rouby, Pathology, NYU Medical Center, MSB 533, 550 First Avenue, New York, NY 10016, USA FEATURES Location/Qualifiers source 1..2647 /organism="Homo sapiens" /note="expression specific to CD19+ B-cells" /db_xref="taxon:9606" /chromosome="10" /cell_type="peripheral blood lymphocytes" CDS 353..592 /codon_start=1 /product="Bcd orf1" /db_xref="PID:g2745960" /translation="MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQ SEPCYVSASEIKFDSQEGSVDQNHSWLGRKRRNPN" CDS 677..1324 /codon_start=1 /product="Bcd orf2" /db_xref="PID:g2745961" /translation="MSAANPLTAPRNFLPRPSLPPTPLAKFWVSSGKLSSSVTSTPPS SPELSREPSQLWGCVPGELPSAREGAQRTSGKPGDKGNGDASPDGRRRVHRCHFNGCR KVYTKSSHLKAHQRTHTGQCPRGPRRAVAGGRHCIAPALEGAGPGCTGCGQGRLEILT SSDVFHQSYRSFWSSDENPKDWTFDWINVLTIFSINKQETRSFSPRTPKPPPPTP" BASE COUNT 662 a 671 c 701 g 613 t ORIGIN 1 cgggtgttgt aagatctggg gagaggggaa gtactggctc cttctaatca gcaacactgt 61 gtgggcatac aatggaggaa tccagtaatg gaaactatag gcctgagtaa tttagaacag 121 aatttcacaa ttatatacag catataggta gggaaggaca tggagtatat aattgtaaat 181 attgtgtggg ctccgcgcgc tgcgggctgc ggcagggtcc ggccggatgt ctctgcagag 241 cctggagttt gcatgaaact ttcacctgcg ctccggggag actttcggtt ccggctccca 301 ccgcgcgcct cgccgccctc gcgaccgcgg gctccgtcca acccggcccg acatggacgt 361 gctccccatg tgcagcatct tccaggagct ccagatcgtg cacgagaccg gctacttctc 421 ggcgctgccg tctctggagg agtactggca acagacctgc ctagagctgg aacgttacct 481 ccagagcgag ccctgctatg tttcagcctc agaaatcaaa tttgacagcc aggaaggatc 541 tgtggaccaa aatcattctt ggctcgggag aaaaaggagg aatccgaact gaagatatct 601 tccagtcctc cagaggacac tctcatcaag gcccgagctt ttggttacaa cttagagacc 661 aacagcctga actcagatgt cagcagcgaa tcctctgaca gctccgagga actttctccc 721 acggccaagt ttacctccga ccccattggc gaagttttgg gtcagctcgg gaaaattgag 781 ctcctctgtc acctccacgc ctccatcttc tccggaactg agcagggaac cttctcaact 841 gtggggttgc gtgcccgggg agctgccttc ggccagggaa ggtgcgcagc ggacttcggg 901 gaagccaggt gacaagggaa atggcgatgc ctcccccgac ggcaggagga gggtgcaccg 961 gtgccacttt aacggctgca ggaaagttta caccaaaagc tcccacttga aagcacacca 1021 gcggacgcac acaggtcagt gcccacgcgg gccccggagg gcggtcgctg gtgggcgcca 1081 ctgcattgca ccagccctgg agggagccgg gcctggctgc acaggatgtg gtcagggcag 1141 actagaaatc ctcacttcct cagatgtttt ccaccaaagc tataggtctt tttggagttc 1201 agatgagaat cctaaagact ggacttttga ttggataaat gtgctaacaa ttttcagtat 1261 aaacaaacaa gagactagat ccttttctcc acggacaccc aaacctccac cccccacccc 1321 ttagtagtgc tggggatccg aggccactgc ccttcaccag tgcactcgca cgaggctacc 1381 tcgagcgggc ctggggtttc ctaaatgaaa ctcaagggtc aggacagagg gttgctgggc 1441 agcgtggagt gtgtgggttt gatgctgacg gcccgaggcc cgagtgggac cggcctgctc 1501 tgtaagcagc agcattgatc agcgagtgtt tcctgagaac ttctccgtgt ctcatgcagc 1561 ctttgtttct gataccgctt gaaacagttt cttaatgaaa tgccatacct aggtgaaagt 1621 gccatttaaa aataccttga catgttctag gataattggt gaggaatcac agaacattta 1681 gaactgggaa gggtcttagt gatcacgtga tgcaggctct tctcttatca gtaggagagc 1741 aaattgctga gagtcagtcc cagacaggct tggtgacagc tgagattgag atccgggtgg 1801 cccaatatcc aggcccaggc ctgtctcaac ataccctgag attggcttga caactttgtt 1861 ttctcaggta gcacttgtag taaattcata tttatgattt gaccaaggaa tgaagtgaac 1921 ccagttgttc aattgccatt tagagaatga ttccggggcc ctgtactggg gctttccaga 1981 agctcgtaac ttcagctttg tagaaaggta gaacgtccct gaggaaactg gacaggcaca 2041 ttccataggg aagtgaggat ggaacagaag tgtgtttggg agaaacagtt gccatgaaga 2101 aagcaatagc tctgcctttg ccggggctgt gggtccggca ggctgacacc tcatcccgca 2161 agcattttgc tggtctgagt cgtggtcgtt cttccacgtt aactttgatg acagcaccat 2221 gggcttggct gaagctgtgt cccttggaca gcagtgggag gcctgagact gggtcaggag 2281 agagctgctg ttgtctctct gaggctgcca gttgttgtgt gttaccgatg ccagaagcca 2341 ctgggtccct cctcacatgg cagaagaaat ggaaaggaaa gaaccttctc actctcccca 2401 gcctttttat aagggctcta atcccacctg tgagggctct gtcctcatca cttaatcacc 2461 tcctaaaggc cccacttctt actactatca cattgaccat taagtttcaa catacaactt 2521 ttcatggaca cattcagacc atacgtagca cctgctgaag aacatcttgg ttccttccaa 2581 gttttggcaa gtatgaataa agctgctata aacatgaaaa aaaaaaaaaa aaaaaaaaaa 2641 aaaaaaa // LOCUS HSU51903 5767 bp mRNA PRI 05-DEC-1996 DEFINITION Human RasGAP-related protein (IQGAP2) mRNA, complete cds. ACCESSION U51903 NID g1262925 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5767) AUTHORS Brill,S., Li,S., Lyman,C.W., Church,D.M., Wasmuth,J.J., Weissbach,L., Bernards,A. and Snijders,A.J. TITLE The Ras GTPase-activating-protein-related human protein IQGAP2 harbors a potential actin binding domain and interacts with calmodulin and Rho family GTPases JOURNAL Mol. Cell. Biol. 16 (9), 4869-4878 (1996) MEDLINE 96347557 REFERENCE 2 (bases 1 to 5767) AUTHORS Bernards,A., Snijders,A.J., Brill,S. and Li,S. TITLE Direct Submission JOURNAL Submitted (20-MAR-1996) Andre Bernards, Cancer Center, Massachusetts General Hospital, Building 149, 13th Street, Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..5767 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q" /tissue_type="liver" gene 223..4950 /gene="IQGAP2" CDS 223..4950 /gene="IQGAP2" /note="IQGAP2; Cdc42-, Rac1-, and calmodulin-binding protein" /codon_start=1 /product="RasGAP-related protein" /db_xref="PID:g1262926" /translation="MPHEELPSLQRPRYGSIVDDERLSAEEMDERRRQNIAYEYLCHL EEAKRWMEVCLVEELPPTTELEEGLRNGVYLAKLAKFFAPKMVSEKKIYDVEQTRYKK SGLHFRHTDNTVQWLRAMESIGLPKIFYPETTDVYDRKNIPRMIYCIHALSLYLFKLG IAPQIQDLLGKVDFTEEEISNMRKELEKYGIQMPSFSKIGGILANELSVDEAALHAAV IAINEAVEKGIAEQTVVTLRNPNAVLTLVDDNLAPEYQKELWDAKKKKEENARLKNSC ISEEERDAYEELLTQAEIQGNINKVNRQAAVDHINAVIPEGDPENTLLALKKPEAQLP AVYPFAAAMYQNELFNLQKQNTMNYLAHEELLIAVEMLSAVALLNQALESNDLVSVQN QLRSPAIGLNNLDKAYVERYANTLLSVKLEVLSQGQDNLSWNEIQNCIDMVNAQIQEE NDRVVAVGYINEAIDEGNPLRTLETLLLPTANISDVDPAHAQHYQDVLYHAKSQKLGD SESVSKVLWLDEIQQAVDEANVDEDRAKQWVTLVVDVNQCLEGKKSSDILSVLKSSTS NANDIIPECADKYYDALVKAKELKSERVSSDGSWLKLNLHKKYDYYYNTDSKESSWVT PESCFYKESWLTGKEIEDIIEEVTVGYIRENIWSASEELLLRFQATSSGPILREEFEA RKSFLHEQEENVVKIQAFWKGYKQRKEYMHRRQTFIDNTDSVVKIQSWFRMATARKSY LSRLQYFRDHNNEIVKIQSLLRANKARDDYKTLVGSENPPLTVIRKFVYLLDQSDLDF QEELEVARLREEVVTKIRANQQLEKDLNLMDIKIGLLVKNRITLEDVISHSKKLNKKK GGEMEILNNTDNQGIKSLSKERRKTLETYQQLFYLLQTNPLYLAKLIFQMPQNKSTKF MDTVIFTLYNYASNQREEYLLLKLFKTALEEEIKSKVDQVQDIVTGNPTVIKMVVSFN RGARGQNTLRQLLAPVVKEIIDDKSLIINTNPVEVYKAWVNQLETQTGEASKLPYDVT TEQALTYPEVKNKLEASIENLRRVTDKVLNSIISSLDLLPYGLRYIAKVLKNSIHEKF PDATEDELLKIVGNLLYYRYMNPAIVAPDGFDIIDMTAGGQINSDQRRNLGSVAKVLQ HAASNKLFEGENEHLSSMNNYLSETYQEFRKYFKEACNVPEPEEKFNMDKYTDLVTVS KPVIYISIEEIISTHSLLLEHQDAIAPEKNDLLSELLGSLGEVPTVESFLGEGAVDPN DPNKANTLSQLSKTEISLVLTSKYDIEDGEAIDSRSLMIKTKKLIIDVIRNQPGNTLT EILETPATAQQEVDHATDMVSRAMIDSRTPEEMKHSQSMIEDAQLPLEQKKRKIQRNL RTLEQTGHVSSENKYQDILNEIAKDIRNQRIYRKLRKAELAKLQQTLNALNKKAAFYE EQINYYDTYIKTCLDNLKRKNTRRSIKLDGKGEPKGAKRAKPVKYTAAKLHEKGVLLD IDDLQTNQFKNVTFDIIATEDVGIFDVRSKFLGVEMEKVQLNIQDLLQMQYEGVAVMK MFDKVKVNVNLLIYLLNKKFYGK" BASE COUNT 1938 a 1089 c 1298 g 1442 t ORIGIN 1 gagggaggag agttcacttt tacttcagtg tcagcgcgcg gcggccgtgg ctggctctgg 61 cgagagagca ccgagggagt gggtcgcaga tcttcgggcg gctaggggaa atcggcgaga 121 ggcgggatcc gagcgcgccg gcggggcgca gagcccgcga gcctggccag cgagggtagc 181 cgcggggggc gcgccccggg cgggcccccg gagacgcgca ggatgccaca cgaagagctg 241 ccgtcgctgc agagaccccg ctatggctct attgtggacg atgaaaggct ctctgcagag 301 gagatggatg agaggaggcg gcagaacatt gcttatgaat atctgtgcca cttagaggaa 361 gccaaaaggt ggatggaagt ttgcttagtt gaagaattgc caccaaccac tgaattggaa 421 gaagggctcc ggaatggagt ttaccttgca aagttagcca agttctttgc cccgaaaatg 481 gtatcagaga aaaagatcta tgatgtggaa caaacacgtt ataagaagtc tggccttcat 541 tttcgacaca cagataatac cgtccagtgg ttaagagcga tggagtctat tggtctaccc 601 aagatatttt atccagaaac aacagatgtc tatgatcgga aaaacatacc aagaatgata 661 tattgcattc acgcactgag tttgtatctg ttcaaactag gaatagcacc ccagatccag 721 gatttgttgg gcaaagtaga cttcacagag gaggaaatca gtaatatgag aaaagaactt 781 gagaaatatg gaatacagat gccatctttc agcaaaatag gtggtattct ggccaatgaa 841 ctgtccgtgg atgaagctgc attacatgct gcagttatag ccattaatga agcagttgaa 901 aaaggaatag cagagcaaac cgttgtaaca ctaagaaacc caaatgcggt tttaacttta 961 gtggatgaca accttgcacc agaatatcag aaagaactct gggatgccaa aaagaaaaaa 1021 gaggaaaatg caagactgaa gaatagctgt atttcagaag aagaaagaga tgcttatgaa 1081 gaactgctga cacaagcaga aatccaaggc aatattaata aagtcaacag gcaggctgca 1141 gtggaccata tcaatgctgt cattccggaa ggtgaccccg agaatacgct gcttgcactg 1201 aagaaaccag aggcccagct gcctgctgtt tatccctttg ctgctgccat gtatcagaac 1261 gaacttttca acctccagaa acagaacacc atgaactact tggcccacga ggagcttttg 1321 attgctgtgg aaatgttgtc tgctgttgct ttactaaacc aggccttgga aagcaacgat 1381 cttgtgtctg tgcagaatca actcagaagc cccgcaatag gcttaaacaa tctggacaag 1441 gcatatgtgg aacgttatgc aaacacacta ctctctgtta aactagaagt tttatcccaa 1501 gggcaagata acttaagctg gaatgaaatt cagaattgta ttgatatggt taatgctcaa 1561 attcaagaag aaaatgaccg agttgtagct gtagggtaca tcaatgaagc tattgatgaa 1621 gggaatcctt tgaggacttt agaaactttg ctcctaccta ctgcgaatat tagtgatgtg 1681 gacccagccc atgcccagca ctaccaggat gttttatacc atgctaaatc acagaaactc 1741 ggagactctg agagtgtttc caaagtgctt tggctggatg agatacagca agccgtcgat 1801 gaggccaacg tggacgagga cagagcaaaa caatgggtta ctctggtggt tgatgttaat 1861 cagtgtttgg aaggaaaaaa atcaagtgat attttgtctg tattgaagtc ttccacttct 1921 aatgcaaatg acataatccc ggagtgtgct gacaaatact atgatgccct tgtgaaggca 1981 aaagagctca aatctgaaag agtgtctagt gacggttcat ggctcaaact caacctgcac 2041 aaaaaatatg actactatta caacactgat tcaaaagaga gttcctgggt cacacctgaa 2101 tcatgcttct ataaagaatc atggctcaca ggaaaagaaa tcgaggacat tattgaggaa 2161 gtcacagtag gttacattcg tgagaatata tggtctgctt cagaagagtt gcttcttcgc 2221 tttcaagcca caagctcagg acccatcctt agggaagagt ttgaagctag aaaatcattt 2281 ttgcatgaac aagaagagaa tgtggtcaaa atacaggctt tttggaaagg atataaacaa 2341 cggaaggagt atatgcacag gcggcaaacg ttcattgata atactgattc tgttgtgaag 2401 attcagtcct ggttccgaat ggcaactgca agaaagagct atctttcaag actacagtat 2461 ttcagagatc ataataatga aattgtgaaa atacagtcac tgttgagagc gaacaaagct 2521 agagatgact acaaaacatt ggttggctct gaaaacccac cattaacagt aattcgcaaa 2581 tttgtatacc tgctggacca aagtgatttg gatttccagg aggaactaga ggttgcacga 2641 ttaagggaag aagtagtgac caagatcagg gccaatcaac agctggaaaa agacctgaac 2701 ctgatggaca tcaagattgg actgctggtg aagaacagga tcacactaga ggatgtaatt 2761 tcacacagta aaaagctgaa caagaaaaaa ggaggagaaa tggaaatact gaataacacc 2821 gacaaccaag gaataaaaag tttgagtaag gagaggagaa aaacactaga aacatatcag 2881 cagctgtttt accttttaca gaccaaccct ttatacttgg ctaagctgat tttccagatg 2941 ccacagaaca agtccactaa atttatggat actgttattt tcacactata taattatgcc 3001 tctaatcagc gagaagaata tctacttctc aagcttttta aaactgctct ggaggaagaa 3061 ataaaatcaa aagtggacca ggtacaggac atagttactg gtaaccctac agtcatcaag 3121 atggtcgtca gcttcaatag aggtgcccgg ggacagaaca ccctgcgcca actcctggct 3181 ccagtggtaa aagagatcat cgacgacaag tcgctgatta tcaacacaaa ccctgtagag 3241 gtgtacaagg cttgggtgaa ccaactagaa acacagactg gagaggccag caagttgcct 3301 tatgatgtga ccacagaaca agctctaaca tacccagaag tgaaaaataa actggaggct 3361 tccattgaga acctgagaag ggtcaccgac aaagtcctga attctatcat ttcttccctt 3421 gatctactgc cttatggatt gaggtatata gccaaagtac tgaagaattc gatccatgag 3481 aaattccccg atgcaacaga agatgagcta ttaaagattg ttggaaacct cctgtactat 3541 cggtacatga atccagccat tgtagctcca gatggctttg atatcatcga catgacagct 3601 ggaggtcaga taaattctga ccaaaggaga aacttaggat cagtggccaa ggttcttcag 3661 cacgcagcct ccaacaagct gtttgaagga gaaaatgagc atctctcatc tatgaacaat 3721 tatttatcag agacgtatca ggaattcagg aaatatttca aagaagcatg taatgtccct 3781 gagccagaag agaagtttaa tatggacaaa tacacagacc tggtgacagt cagcaaacca 3841 gtcatttata tttcaattga agaaatcatc agcacacact cactcctgtt ggaacaccag 3901 gatgcaattg cccctgagaa aaatgactta ctgagtgaat tgctggggtc gctgggagag 3961 gtgccaaccg tggaatcttt tcttggggaa ggagcagttg accccaatga ccctaacaag 4021 gcaaatacac taagtcagct ttcaaagacc gagatttctc ttgtcttgac aagcaaatat 4081 gacatagagg acggtgaagc tatagatagc cgaagcctca tgataaagac caagaagctg 4141 ataattgatg tgatccggaa ccagccaggg aacacattga cagaaatctt agagacacca 4201 gcaactgcgc aacaggaggt agaccatgcc acggacatgg tgagccgtgc aatgatagat 4261 tccaggactc cagaagaaat gaagcatagc caatctatga ttgaagatgc acagctgcct 4321 cttgagcaga agaagaggaa aatccagagg aatcttcgga cgttggaaca gactggacac 4381 gtgtcatccg aaaataaata ccaagacatt ctcaatgaga ttgccaagga tattcgaaat 4441 caaagaatct atcgtaagct tcgaaaagct gaattggcaa aacttcagca gaccctgaat 4501 gcacttaaca agaaggcagc attttatgaa gagcaaatca attattatga cacctacata 4561 aagacttgtt tagacaactt aaaaagaaaa aatactcgga gatcaattaa actagatgga 4621 aaaggagaac ccaaaggggc gaagagagcg aagccagtga agtacactgc agcaaagctg 4681 catgagaaag gtgtcctgct agatatagat gatcttcaaa caaaccagtt taagaatgtt 4741 acatttgata tcatagctac tgaagatgta ggcattttcg atgtaagatc aaaattcctt 4801 ggtgttgaga tggaaaaggt gcaactcaat attcaggatt tacttcagat gcaatatgaa 4861 ggagtagctg taatgaaaat gtttgataag gttaaagtga atgtaaacct tctcatatac 4921 ctgctgaaca agaagttcta tggaaagtga agtgcctaca gaaatttctt ggattctgta 4981 tcatctggat taggaaatga atttgtttaa tatttttgtt tttaaacatg attgaaatca 5041 ctgcttataa atgtgtgatt ttttttaaat gaccaaaact gttctgaaga atgtacccag 5101 gtgccttttt gctaatttga tactataata gaatgagaca taaaatgaat taatggaaac 5161 atatccacac tgtactgtga tataggtact ctgatttaaa actttggaca tcctgtgatc 5221 tgttttaaag ttggggggtg ggaaatttag ctgactaggg acaaacatgt aaacctattt 5281 tcctatgaaa aaagttttaa atgtcccact tgaataacgt aattcttcat agttttttta 5341 atctatggat aaatggaaac ctaattattt gtaatgaatt atttagacag ttctaagccc 5401 tgtcttctgg gagttatcaa ttttaaagag aacttttgtg caattcaaat gaagttttta 5461 taagtaattg aaaatgacaa cacaataaca ctttctgtat aaaagtatat attttatgtg 5521 atttattcct actaaatgaa agtgcactac tgcctcatgt aaagactctt gcacgcagag 5581 cctttaagtg actaaggaac aacatagata gtgagcatag tccccacctc cacccctcac 5641 aatttatttg aatacttcaa ttgtgcctct caattttttg taatgctaaa aaatcagtat 5701 ctagatggtt tttaaatgta ttctctggaa attgttttat gtaaaataaa tgttacttaa 5761 ttccatt // LOCUS HSU51920 2017 bp mRNA PRI 13-FEB-1997 DEFINITION Human signal recognition particle (SRP54) mRNA, complete cds. ACCESSION U51920 NID g1256819 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2017) AUTHORS Gowda,K., Chittenden,K. and Zwieb,C. TITLE Binding site of the M-domain of human protein SRP54 determined by systematic site-directed mutagenesis of signal recognition particle RNA JOURNAL Nucleic Acids Res. 25 (2), 388-394 (1997) MEDLINE 97169426 REFERENCE 2 (bases 1 to 2017) AUTHORS Zwieb,C. TITLE Direct Submission JOURNAL Submitted (20-MAR-1996) Christian Zwieb, Molecular Biology, University of Texas Health Center at Tyler, Highway 271 N, Tyler, TX 75710, USA FEATURES Location/Qualifiers source 1..2017 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HepG2" gene 72..1586 /gene="SRP54" CDS 72..1586 /gene="SRP54" /codon_start=1 /product="signal recognition particle" /db_xref="PID:g1256820" /translation="MVLADLGRKITSALRSLSNATIINEEVLNAMLKEVCTALLEADV NIKLVKQLRENVKSAIDLEEMASGLNKRKMIQHAVFKELVKLVDPGVKAWTPTKGKQN VIMFVGLQGSGKTTTCSKLAYYYQRKGWKTCLICADTFRAGAFDQLKQNATKARIPFY GSYTEMDPVIIASEGVEKFKNENFEIIIVDTSGRHKQEDSLFEEMLQVANAIQPDNIV YVMDASIGQACEAQAKAFKDKVDVASVIVTKLDGHAKGGGALSAVAATKSPIIFIGTG EHIDDFEPFKTQPFISKLLGMGDIEGLIDKVNELKLDDNEALIEKLKHGQFTLRDMYE QFQNIMKMGPFSQILGMIPGFGTDFMSKGNEQESMARLKKLMTIMDSMNDQELDSTDG AKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKMGGIKGLFKGGDMSKNV SQSQMAKLNQQMAKMMDPRVLHHMGGMAGLQSMMRQFQQGAAGNMKGMMGFNNM" BASE COUNT 668 a 328 c 441 g 580 t ORIGIN 1 cggcccctca gggcacggct ttagcggtgt cttttgcgag ttcttcgtaa gtacatctta 61 aagctgtcaa gatggttcta gcagaccttg gaagaaaaat aacatcagca ttacgctcgt 121 tgagcaatgc caccattatc aatgaagagg tattgaatgc tatgctaaaa gaagtctgta 181 ccgctttgtt ggaagcagat gttaatatta aactagtgaa gcaactaaga gaaaatgtta 241 agtctgctat tgatcttgaa gagatggcat ctggtcttaa caaaagaaaa atgattcagc 301 atgctgtatt taaagaactt gtgaagcttg tagaccctgg agttaaggca tggacaccca 361 ctaaaggaaa acaaaatgtg attatgtttg ttggattgca agggagtggt aaaacaacaa 421 catgttcaaa gctagcatat tattaccaga ggaaaggttg gaagacctgt ttaatatgtg 481 cagacacatt cagagcaggg gcttttgacc aactaaaaca gaatgctacc aaagcaagaa 541 ttccatttta tggaagctat acagaaatgg atcctgtcat cattgcttct gaaggagtag 601 agaaatttaa aaatgaaaat tttgaaatta ttattgttga tacaagtggc cgccacaaac 661 aagaagactc tttgtttgaa gaaatgcttc aagttgctaa tgctatacaa cctgataaca 721 ttgtttatgt gatggatgcc tccattgggc aggcttgtga agcccaggct aaggctttta 781 aagataaagt agatgtagcc tcagtaatag tgacaaaact tgatggccat gcaaaaggag 841 gtggtgcact cagtgcagtc gctgccacaa aaagtccgat tattttcatt ggtacagggg 901 aacatataga tgactttgaa cctttcaaaa cacagccttt tattagcaaa cttcttggta 961 tgggcgacat tgaaggactg atagataaag tcaacgagtt gaagttggat gacaatgaag 1021 cacttataga gaagttgaaa catggtcagt ttacgttgcg agacatgtat gagcaatttc 1081 aaaatatcat gaaaatgggc cccttcagtc agatcttggg gatgatccct ggttttggga 1141 cagattttat gagcaaagga aatgaacagg agtcaatggc aaggctaaag aaattaatga 1201 caataatgga tagtatgaat gatcaagaac tagacagtac ggatggtgcc aaagttttta 1261 gtaaacaacc aggaagaatc caaagagtag caagaggatc gggtgtatca acaagagatg 1321 ttcaagaact tttgacacaa tataccaagt ttgcacagat ggtaaaaaag atgggaggta 1381 tcaaaggact tttcaaaggt ggcgacatgt ctaagaatgt gagccagtca cagatggcaa 1441 aattgaacca acaaatggcc aaaatgatgg atcctagggt tcttcatcac atgggtggta 1501 tggcaggact tcagtcaatg atgaggcagt ttcaacaggg tgctgctggc aacatgaaag 1561 gcatgatggg attcaataat atgtaaagaa aatgtcctta atataaactg actcagttga 1621 atacctaatt tgctgagacc tcagcgtttc ccttcttttt gcgaattggg gggaaagtgt 1681 atttttcttg cttatcatgc actctttcct tttcttctcg cccgcttttc ccctcctttt 1741 ctttttcctt ccttctttcc tccctttaat ataagggaga aatacatggt ttttgtggaa 1801 atcattatat gtttgcttta gattttgttc tgttttcacc atcataacac ttaagttaaa 1861 tcatgatgta aaattttagt acttaaaggt ttttaattat ctcgaaggcc aagcattgca 1921 tttgtaaaca gtcctggtca gtagttaaat aatgtttcaa ttaaagtgct gtaaaataaa 1981 cttcaaagtg gttataagtt aaaaaaaaaa aaaaagg // LOCUS HSU51990 1450 bp mRNA PRI 29-JAN-1997 DEFINITION Human hPrp18 mRNA, complete cds. ACCESSION U51990 NID g1805248 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1450) AUTHORS Horowitz,D.S. and Krainer,A.R. TITLE A human protein required for the second step of pre-mRNA splicing is functionally related to a yeast splicing factor JOURNAL Genes Dev. 11 (1), 139-151 (1997) MEDLINE 97152474 REFERENCE 2 (bases 1 to 1450) AUTHORS Horowitz,D.S. and Krainer,A.R. TITLE Direct Submission JOURNAL Submitted (21-MAR-1996) David S. Horowitz, Cold Spring Harbor Laboratory, 1 Bungtown Rd., Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..1450 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa cells" CDS 73..1101 /function="required for the second step of mRNA splicing" /note="pre-mRNA splicing factor; similar to S. cerevisiae Prp18 protein encoded by GenBank Accession Number L03536, to C. elegans EST sequence in dbEST T01256, and to rice protein encoded by GenBank Accession Number D15798" /codon_start=1 /product="hPrp18" /db_xref="PID:g1805249" /translation="MDILKSEILRKRQLVEDRNLLVENKKYFKRSELAKKEEEAYFER CGYKIQPKEEDQKPLTSSNPVLELELAEEKLPMTLSRQEVIRRLRERGEPIRLFGETD YDAFQRLRKIEILTPEVNKGLRNDLKAALDKIDQQYLNEIVGGQEPGEEDTQNDLKVH EENTTIEELEALGESLGKGDDHKDMDIITKFLKFLLGVWAKELNAREDYVKRSVQGKL NSATQKQTESYLRPLFRKLRKRNLPADIKESITDIIKFMLQREYVKANDAYLQMAIGN APWPIGVTMVGIHARTGREKIFSKHVAHVLNDETQRKYIQGLKRLMTICQKHFPTDPS KCVEYNAL" BASE COUNT 483 a 248 c 352 g 367 t ORIGIN 1 cggccgccgg cccagtgagg ctgggttcga ggagctggag cgggaaactg gagcttaaat 61 tctggcggcg agatggacat tctgaaatca gagatccttc ggaagcggca gctggtggag 121 gacaggaacc tgctggtgga aaataaaaaa tatttcaagc gtagtgagct cgccaaaaaa 181 gaagaggaag catattttga aagatgtggc tacaagatac agccaaaaga ggaggaccag 241 aaaccattaa cttcatcgaa tccagtgtta gaacttgaac tggcagagga aaaattacct 301 atgacgcttt ctaggcaaga ggtcatcaga agattgagag aaagaggaga accaatcaga 361 ctatttggag agactgatta tgatgctttt caacgtttaa ggaaaataga gatcctcaca 421 ccagaagtta acaagggatt gaggaatgat ttgaaagcag ccttggataa gattgatcag 481 cagtacctca atgaaatcgt cggcggtcag gagcctggag aggaagacac acagaatgat 541 ctgaaagttc atgaggaaaa caccacaatt gaagagttag aggcgcttgg agagtcctta 601 gggaaaggcg atgatcataa agacatggac atcatcacca aattcctgaa gtttcttctt 661 ggcgtttggg ctaaagaatt gaatgccaga gaagattatg tgaaacgcag tgtgcagggt 721 aaactgaaca gtgcgaccca gaaacagacc gagtcctacc taagaccact ttttagaaag 781 ctacggaaaa ggaatcttcc tgctgatatt aaagaatcaa taacggatat tattaaattc 841 atgttgcaga gagaatacgt gaaggcaaat gatgcttatc ttcagatggc cattggaaat 901 gcgccttggc ccatcggtgt cactatggtt ggtatccatg ccagaactgg cagagaaaag 961 attttttcca agcatgttgc acatgtttta aatgacgaaa ctcagcggaa atatattcag 1021 ggattgaaga ggttaatgac catttgccag aaacactttc ctacagaccc atccaaatgt 1081 gtggagtaca atgcactgtg agatctgtgt atggtgtgtt aataacaata agaaacttag 1141 ggaagcaggc tgtggacttc tggaattacc aacaggaatg aggaaagaag aaaactggag 1201 tttccagtct ctgagttcta cctgatgtaa ctcttgattg gttttaagaa ctttgttggc 1261 cttcatttca tatctgactg caagctgatt tttctttctt gctttcattt taattagtcc 1321 aaaattaagt tttaaagatt tttcctcaca atttaaatcc atagacaaca gaagggggtt 1381 taaaatgacc tttttttcag ttgacccgaa agttgtggtt agatgattaa aaagaaacat 1441 ttgaaaaaaa // LOCUS HSU52111 153460 bp DNA PRI 11-SEP-1997 DEFINITION Homo sapiens Xq28 genomic DNA in the region of the ALD locus containing the genes for creatine transporter (SLC6A8), CDM, adrenoleukodystrophy (ALD), Na+-isocitrate dehydrogenase gamma subunit (IDH), and translocon-associated protein delta (TRAP) genes, complete cds, plexin related protein (PLEXR) and serine kinase (SK) genes, partial cds, Xq28lu1 gene and cytochrome C (CCp) pseudogene. ACCESSION U52111 NID g1302649 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 153460) AUTHORS Brenner,V., Nyakatura,G., Rosenthal,A. and Platzer,M. TITLE Genomic organization of two novel genes on human Xq28: compact head to head arrangement of IDH gamma and TRAP delta is conserved in rat and mouse JOURNAL Genomics 44 (1), 8-14 (1997) MEDLINE 97432815 REFERENCE 2 (bases 1 to 153460) AUTHORS Platzer,M., Bauer,D. and Drescher,B. TITLE Direct Submission JOURNAL Submitted (22-MAR-1996) Matthias Platzer, Institute of Molecular Biotechnology, Department of Genome Analysis, Beutenbergstr. 11, Jena 07745 Germany FEATURES Location/Qualifiers source 1..153460 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq28" /chromosome="X" mRNA join(1042..1876,3415..3546,4344..4593,5015..5147, 6081..6215,6303..6406,6502..6626,6945..7057,7170..7307, 7384..7486,7573..7673,7759..7929,8114..9625) /gene="SLC6A8" exon 1042..1876 /gene="SLC6A8" /number=1 5'UTR 1042..1614 /gene="SLC6A8" gene 1042..9625 /gene="SLC6A8" repeat_region 1122..1607 /rpt_family="TAR1" CDS join(1615..1876,3415..3546,4344..4593,5015..5147, 6081..6215,6303..6406,6502..6626,6945..7057,7170..7307, 7384..7486,7573..7673,7759..7929,8114..8254) /gene="SLC6A8" /note="CRTR" /codon_start=1 /product="creatine transporter" /db_xref="PID:g1302650" /translation="MAKKSAENGIYSVSGDEKKGPLIAPGPDGAPAKGDGPVGLGTPG GRLAVPPRETWTRQMDFIMSCVGFAVGLGNVWRFPYLCYKNGGGVFLIPYVLIALVGG IPIFFLEISLGQFMKAGSINVWNICPLFKGLGYASMVIVFYCNTYYIMVLAWGFYYLV KSFTTTLPWATCGHTWNTPDCVEIFRHEDCANASLANLTCDQLADRRSPVIEFWENKV LRLSGGLEVPGALNWEVTLCLLACWVLVYFCVWKGVKSTGKIVYFTATFPYVVLVVLL VRGVLLPGALDGIIYYLKPDWSKLGSPQVWIDAGTQIFFSYAIGLGALTALGSYNRFN NNCYKDAIILALINSGTSFFAGFVVFSILGFMAAEQGVHISKVAESGPGLAFIAYPRA VTLMPVAPLWAALFFFMLLLLGLDSQFVGVEGFITGLLDLLPASYYFRFQREISVALC CALCFVIDLSMVTDGGMYVFQLFDYYSASGTTLLWQAFWECVVVAWVYGADRFMDDIA CMIGYRPCPWMKWCWSFFTPLVCMGIFIFNVVYYEPLVYNNTYVYPWWGEAMGWAFAL SSMLCVPLHLLGCLLRAKGTMAERWQHLTQPIWGLHHLEYRAQDADVRGLTTLTPVSE SSKVVVVESVM" exon 3415..3546 /gene="SLC6A8" /number=2 exon 4344..4593 /gene="SLC6A8" /number=3 exon 5015..5147 /gene="SLC6A8" /number=4 exon 6081..6215 /gene="SLC6A8" /number=5 exon 6303..6406 /gene="SLC6A8" /number=6 exon 6502..6626 /gene="SLC6A8" /number=7 exon 6945..7057 /gene="SLC6A8" /number=8 exon 7170..7307 /gene="SLC6A8" /number=9 exon 7384..7486 /gene="SLC6A8" /number=10 exon 7573..7673 /gene="SLC6A8" /number=11 exon 7759..7929 /gene="SLC6A8" /number=12 exon 8114..9625 /gene="SLC6A8" /number=13 3'UTR 8255..9625 /gene="SLC6A8" repeat_region 10650..11006 /rpt_family="L1MB7" mRNA complement(join(13537..14013,15045..15145,15973..16096, 16997..17132,28581..28728,33911..34011,36192..36327, 37373..37464)) /gene="CDM" exon complement(13537..14013) /gene="CDM" /number=8 gene complement(13537..37464) /gene="CDM" 3'UTR complement(13537..13974) /gene="CDM" CDS complement(join(13975..14013,15045..15145,15973..16096, 16997..17132,28581..28728,33911..34011,36192..36283)) /gene="CDM" /note="DXS1375E" /codon_start=1 /product="CDM protein" /db_xref="PID:g1508820" /translation="MSLQWTAVATFLYAEVFVVLLLCIPFISPKRWQKIFKSRLVELL VSYGNTFFVVLIVILVLLVIDAVREIRKYDDVTEKVNLQNNPGAMEHFHMKLFRAQRN LYIAGFSLLLSFLLRRLVTLISQQATLLASNEAFKKQAESASEAAKKYMEENDQLKKG AAVDGGKLDVGNAEVKLEEENRSLKADLQKLKDELASTKQKLEKAENQVLAMRKQSEG LTKEYDRLLEEHAKLQAAVDGPMDKKEE" exon complement(15045..15145) /gene="CDM" /number=7 exon complement(15973..16096) /gene="CDM" /number=6 exon complement(16997..17132) /gene="CDM" /number=5 repeat_region complement(17561..17736) /rpt_family="MIR" repeat_region 24665..24951 /rpt_family="Alu-Sz" repeat_region 25179..25350 /rpt_family="Alu-Jo" repeat_region 25351..25449 /rpt_family="Alu-J" repeat_region 25963..26253 /rpt_family="Alu-Ya" repeat_region complement(26937..27215) /rpt_family="Alu-Sx" exon complement(28581..28728) /gene="CDM" /number=4 repeat_region 28915..28949 /rpt_family="Alu" repeat_region 28950..29213 /rpt_family="Alu-Sq" repeat_region 29498..29700 /rpt_family="Alu-Jo" repeat_region 29701..29804 /rpt_family="Alu-J" repeat_region complement(30498..30753) /rpt_family="Alu-Jb" repeat_region 31073..31364 /rpt_family="Alu-Sz" repeat_region complement(33413..33702) /rpt_family="Alu-Sq" exon complement(33911..34011) /gene="CDM" /number=3 repeat_region 34205..34482 /rpt_family="Alu-Sz" repeat_region complement(34545..34615) /rpt_family="Alu-J" repeat_region 34645..34945 /rpt_family="Alu-Sx" exon complement(36192..36327) /gene="CDM" /number=2 exon complement(37373..37464) /gene="CDM" /number=1 5'UTR complement(join(37373..37464,36284..36327)) /gene="CDM" mRNA join(37920..39205,42271..42451,49150..49292,49383..49551, 50195..50289,53130..53275,53612..53757,56025..56109, 56259..56384,56527..56899) /gene="ALD" gene 37920..56899 /gene="ALD" exon 37920..39205 /gene="ALD" /number=1 5'UTR 37920..38305 /gene="ALD" CDS join(38306..39205,42271..42451,49150..49292,49383..49551, 50195..50289,53130..53275,53612..53757,56025..56109, 56259..56384,56527..56773) /gene="ALD" /codon_start=1 /product="adrenoleukodystrophy protein" /db_xref="PID:g1302652" /translation="MPVLSRPRPWRGNTLKRTAVLLALAAYGAHKVYPLVRQCLAPAR GLQAPAGEPTQEASGVAAAKAGMNRVFLQRLLWLLRLLFPRVLCRETGLLALHSAALV SRTFLSVYVARLDGRLARCIVRKDPRAFGWQLLQWLLIALPATFVNSAIRYLEGQLAL SFRSRLVAHAYRLYFSQQTYYRVSNMDGRLRNPDQSLTEDVVAFAASVAHLYSNLTKP LLDVAVTSYTLLRAARSRGAGTAWPSAIAGLVVFLTANVLRAFSPKFGELVAEEARRK GELRYMHSRVVANSEEIAFYGGHEVELALLQRSYQDLASQINLILLERLWYVMLEQFL MKYVWSASGLLMVAVPIITATGYSESDAEAVKKAALEKKEEELVSERTEAFTIARNLL TAAADAIERIMSSYKEVTELAGYTARVHEMFQVFEDVQRCHFKRPRELEDAQAGSGTI GRSGVRVEGPLKIRGQVVDVEQGIICENIPIVTPSGEVVVASLNIRVEEGMHLLITGP NGCGKSSLFRILGGLWPTYGGVLYKPPPQRMFYIPQRPYMSVGSLRDQVIYPDSVEDM QRKGYSEQDLEAILDVVHLHHILQREGGWEAMCDWKDVLSGGEKQRIGMARMFYHRPK YALLDECTSAVSIDVEGKIFQAAKDAGIALLSITHRPSLWKYHTHLLQFDGEGGWKFE KLDSAARLSLTEEKQRLEQQLAGIPKMQRRLQELCQILGEAVAPAHVPAPSPQGPGGL QGAST" repeat_region complement(41598..41880) /rpt_family="Alu-Jo" exon 42271..42451 /gene="ALD" /number=2 repeat_region complement(42978..43260) /rpt_family="Alu-Jb" repeat_region 43350..43638 /rpt_family="Alu-Ya" repeat_region complement(44088..44220) /rpt_family="Alu-Jo" repeat_region complement(45925..46208) /rpt_family="Alu-Sz" repeat_region 46399..46478 /rpt_family="MIR2" repeat_region 47331..47399 /rpt_family="Alu" repeat_region 47400..47624 /rpt_family="Alu-Sxzg" repeat_region complement(47857..48001) /rpt_family="L1MB7" repeat_region complement(48003..48336) /rpt_family="L1MA10" repeat_region complement(48339..48692) /rpt_family="L1MD1" repeat_region complement(48730..48918) /rpt_family="L1" exon 49150..49292 /gene="ALD" /number=3 exon 49383..49551 /gene="ALD" /number=4 exon 50195..50289 /gene="ALD" /number=5 repeat_region complement(51328..51481) /rpt_family="MLT1C" repeat_region complement(51485..51791) /rpt_family="MLT1C" repeat_region 52111..52412 /rpt_family="Alu-Jb" repeat_region 52425..52695 /note="possibly Alu-FLA" /rpt_family="Alu" exon 53130..53275 /gene="ALD" /number=6 exon 53612..53757 /gene="ALD" /number=7 repeat_region complement(55504..55591) /rpt_family="MLT1B" repeat_region complement(55621..55858) /rpt_family="MLT1C" exon 56025..56109 /gene="ALD" /number=8 exon 56259..56384 /gene="ALD" /number=9 exon 56527..56899 /gene="ALD" /number=10 3'UTR 56774..56899 /gene="ALD" repeat_region 58481..58684 /rpt_family="L1MB7" repeat_region complement(58738..58794) /rpt_family="Alu-J" repeat_region complement(58803..59031) /rpt_family="Alu-Jo" repeat_region 59034..59147 /rpt_family="L1MB7" repeat_region complement(59313..59443) /rpt_family="Alu-FLA" repeat_region 60439..61186 /rpt_family="Alu-Sx" repeat_region 61447..61499 /rpt_family="Alu" repeat_region 61500..61584 /rpt_family="Alu-J" repeat_region 61746..62036 /rpt_family="Alu-Sx" repeat_region 62331..62436 /rpt_family="MLT1B" repeat_region complement(64154..64199) /rpt_family="Alu" repeat_region complement(64200..64443) /rpt_family="Alu-Spqxzg" repeat_region 64444..64820 /rpt_family="L1ME3A" repeat_region 64860..64969 /rpt_family="L1MB3" repeat_region 65019..65335 /rpt_family="MER1A" repeat_region 65747..65949 /rpt_family="Alu-Spqxzg" repeat_region 66302..66570 /rpt_family="Alu-Ya" repeat_region 66591..66742 /rpt_family="L1" repeat_region 66750..67209 /rpt_family="L1" repeat_region 67210..67482 /rpt_family="Alu-Jo" repeat_region 67500..67629 /rpt_family="Alu-Jo" repeat_region 67635..68293 /rpt_family="L1" repeat_region 67866..68155 /rpt_family="Alu-Spqxzg" repeat_region complement(68323..68548) /rpt_family="Alu-J" repeat_region complement(68550..68608) /rpt_family="Alu-Jb" repeat_region complement(68667..68939) /rpt_family="Alu-Ya" repeat_region 68997..69139 /rpt_family="L1" repeat_region 69155..69839 /rpt_family="L1" repeat_region 69843..69932 /rpt_family="L1" repeat_region 69987..70113 /rpt_family="L1MA2" repeat_region 70114..70400 /rpt_family="Alu-Sx" repeat_region 70403..70545 /rpt_family="L1MA9" repeat_region 70552..70697 /rpt_family="L1MA2" repeat_region 70699..71077 /rpt_family="L1MA9" repeat_region 71445..71733 /rpt_family="Alu-Sx" repeat_region 72093..72298 /rpt_family="Alu-Spqxzg" repeat_region 72303..72505 /rpt_family="Alu-J" repeat_region 72514..72557 /rpt_family="L1MB3" repeat_region 72674..72962 /rpt_family="Alu-Sx" repeat_region 73084..73196 /rpt_family="L1" repeat_region 73294..73515 /rpt_family="L1" repeat_region complement(73547..73838) /rpt_family="Alu-Sp" repeat_region 73918..74700 /rpt_family="Alu-Jo" repeat_region 73950..74195 /rpt_family="Alu-Sg" repeat_region complement(75450..75728) /rpt_family="Alu-Jo" repeat_region 76070..76198 /rpt_family="MIR" repeat_region 76910..76996 /rpt_family="MIR" gene 79726..92379 /gene="PLEXR" exon 79726..80952 /gene="PLEXR" /note="based on gene prediction" /number=1 /evidence=not_experimental exon 81288..81467 /gene="PLEXR" /note="based on gene prediction" /number=2 /evidence=not_experimental exon 81987..82115 /gene="PLEXR" /note="based on gene prediction" /number=3 /evidence=not_experimental exon 82865..82978 /gene="PLEXR" /note="based on gene prediction" /number=4 /evidence=not_experimental exon 83131..83287 /gene="PLEXR" /note="based on gene prediction" /number=5 /evidence=not_experimental exon 83377..83485 /gene="PLEXR" /note="based on gene prediction" /number=6 /evidence=not_experimental exon 83567..83685 /gene="PLEXR" /note="based on gene prediction" /number=7 /evidence=not_experimental exon 83843..84120 /gene="PLEXR" /note="based on gene prediction" /number=8 /evidence=not_experimental exon 84349..84442 /gene="PLEXR" /note="based on gene prediction" /number=9 /evidence=not_experimental exon 84525..84694 /gene="PLEXR" /note="based on gene prediction" /number=10 /evidence=not_experimental exon 84903..85054 /gene="PLEXR" /note="based on gene prediction" /number=11 /evidence=not_experimental exon 85948..86055 /gene="PLEXR" /note="based on gene prediction" /number=12 /evidence=not_experimental exon 86269..86445 /gene="PLEXR" /note="based on gene prediction" /number=13 /evidence=not_experimental exon 86557..86752 /gene="PLEXR" /note="based on gene prediction" /number=14 /evidence=not_experimental exon 86898..87137 /gene="PLEXR" /note="based on gene prediction" /number=15 /evidence=not_experimental exon 87225..87383 /gene="PLEXR" /note="based on gene prediction" /number=16 /evidence=not_experimental exon 87459..87610 /gene="PLEXR" /note="based on gene prediction" /number=17 /evidence=not_experimental exon 87745..87845 /gene="PLEXR" /note="based on gene prediction" /number=18 /evidence=not_experimental exon 87919..88214 /gene="PLEXR" /note="based on gene prediction" /number=19 /evidence=not_experimental exon 88264..88442 /gene="PLEXR" /note="partially confirmed by cDNA" /number=20 mRNA join(<88420..88442,88633..88699,88926..89253,89395..89498, 89926..90019,90247..90419,90567..90727,90987..91134, 91300..91364,91435..91510,91639..91713,91974..92379) /gene="PLEXR" CDS join(<88420..88442,88633..88699,88926..89253,89395..89498, 89926..90019,90247..90419,90567..90727,90987..91134, 91300..91364,91435..91510,91639..91713,91974..92078) /gene="PLEXR" /codon_start=1 /product="plexin related protein" /db_xref="PID:g1508821" /translation="NPKLMLRRTETMVEKLLTNWLSICLYAFLREVAGEPLYMLFRAI QYQVDKGPVDAVTGKAKRTLNDSRLLREDVEFQPLTLMVLVGPGAGGAAGSSEMQRVP ARVLDTDTITQVKEKVLDQVYKGTPFSQRPSVHALDLEWRSGLAGHLTLSDEDLTSVT QNHWKRLNTLQHYKVPDGATVGLVPQLHRGSTISQSLAQRCPLGENIPTLEDGEEGGV CLWHLVKATEEPEGAKVRCSSLREREPARAKAIPEIYLTRLLSMKGTLQKFVDDTFQA ILSVNRPIPIAVKYLFDLLDELAEKHGIEDPGTLHIWKTNSLLLRFWVNALKNPQLIF DVRVSDNVDAILAVIAQTFIDSCTTSEHKVGRDSPVNKLLYAREIPRYKQMVERYYAD IRQSSPASYQEMNSALAELSGNYTSAPHCLEALQELYNHIHRYYDQIISALEEDPVGQ KLQLACRLQQVAALVENKVTDL" exon 88633..88699 /gene="PLEXR" /number=21 exon 88926..89253 /gene="PLEXR" /number=22 exon 89395..89498 /gene="PLEXR" /number=23 exon 89926..90019 /gene="PLEXR" /number=24 exon 90247..90419 /gene="PLEXR" /number=25 exon 90567..90727 /gene="PLEXR" /number=26 exon 90987..91134 /gene="PLEXR" /number=27 exon 91300..91364 /gene="PLEXR" /number=28 exon 91435..91510 /gene="PLEXR" /number=29 exon 91639..91713 /gene="PLEXR" /number=30 exon 91974..92379 /gene="PLEXR" /number=31 3'UTR 92079..92379 /gene="PLEXR" repeat_region complement(92377..92668) /rpt_family="Alu-Sx" exon 94126..94184 /gene="SK" /note="based on gene prediction" /number=1 /evidence=not_experimental gene 94126..98490 /gene="SK" exon 94255..94385 /gene="SK" /note="based on gene prediction" /number=2 /evidence=not_experimental exon 94544..94652 /gene="SK" /note="based on gene prediction" /number=3 /evidence=not_experimental exon 94782..94869 /gene="SK" /note="based on gene prediction" /number=4 /evidence=not_experimental exon 95160..95247 /gene="SK" /note="based on gene prediction" /number=5 /evidence=not_experimental exon 95811..95917 /gene="SK" /note="based on gene prediction" /number=6 /evidence=not_experimental exon 95992..96157 /gene="SK" /note="based on gene prediction" /number=7 /evidence=not_experimental exon 97121..97252 /gene="SK" /note="based on gene prediction" /number=8 /evidence=not_experimental exon 97333..97433 /gene="SK" /note="based on gene prediction" /number=9 /evidence=not_experimental mRNA join(<97392..97433,97789..97896,97973..98042,98120..98212, 98375..>98490) /gene="SK" /note="partial" CDS join(<97392..97433,97789..97896,97973..98042,98120..98212, 98375..>98490) /gene="SK" /codon_start=1 /product="serine kinase" /db_xref="PID:g1302654" /translation="IKIKIADLGNACWVHKHFTEDIQTRQYRAVEVLIGAEYGPPADI WSTACMAFELATGDYLFEPHSGEDYSRDEDHIAHIVELLGDIPPAFALSGRYSREFFN RRGELRHIHNLKHWGLYEVLMEKYEWPLEQATQFSAFLLPM" exon 97789..97896 /gene="SK" /number=10 exon 97973..98042 /gene="SK" /number=11 exon 98120..98212 /gene="SK" /number=12 exon 98375..98559 /note="partially confirmed by cDNA" /number=13 3'UTR complement(98808..98910) /gene="IDH" exon complement(98808..99012) /gene="IDH" /number=13 mRNA complement(join(98808..99012,99251..99311,99394..99488, 99840..99986,100090..100192,100503..100636,100862..100994, 101114..101174,102751..102863,103234..103331, 103603..103614,103849..103890,107285..107444)) /gene="IDH" gene complement(98808..107444) /gene="IDH" CDS complement(join(98911..99012,99251..99311,99394..99488, 99840..99986,100090..100192,100503..100636,100862..100994, 101114..101174,102751..102863,103234..103331, 103603..103614,103849..103890,107285..107365)) /gene="IDH" /codon_start=1 /product="NAD+-isocitrate dehydrogenase gamma subunit" /db_xref="PID:g1302655" /translation="MALKVATVAGSAAKAVLGPALLCRPWEVLGAHEVPSRNIFSEQT IPPSAKYGGRHTVTMIPGDGIGPELMLHVKSVFRHACVPVDFEEVHVSSNADEEDIRN AIMAIRRNRVALKGNIETNHNLPPSHKSRNNILRTSLDLYANVIHCKSLPGVVTRHKD IDILIVRENTEGEYSSLEHESVAGVVESLKIITKAKSLRIAEYAFKLAQESGRKKVTA VHKANIMKLGDGLFLQCCREVAARYPQITFENMIVDNTTMQLVSRPQQFDVMVMPNLY GNIVNNVCAGLVGGPGLVAGANYGHVYAVFETATRNTGKSIANKNIANPTATLLASCM MLDHLKLHSYATSIRKAVLASMDNENMHTPDIGGQGTTSEAIQDVIRHIRVINGRAVE A" exon complement(99251..99311) /gene="IDH" /number=12 exon complement(99394..99488) /gene="IDH" /number=11 repeat_region 99556..99772 /rpt_family="L1MB7" exon complement(99840..99986) /gene="IDH" /number=10 exon complement(100090..100192) /gene="IDH" /number=9 exon complement(100503..100636) /gene="IDH" /number=8 exon complement(100862..100994) /gene="IDH" /number=7 exon complement(101114..101174) /gene="IDH" /number=6 repeat_region 102303..102590 /rpt_family="Alu-Sg" exon complement(102751..102863) /gene="IDH" /number=5 exon complement(103234..103331) /gene="IDH" /number=4 exon complement(103603..103614) /gene="IDH" /number=3 exon complement(103849..103890) /gene="IDH" /number=2 exon complement(107285..107444) /gene="IDH" /number=1 5'UTR complement(107366..107444) /gene="IDH" 5'UTR 107677..107726 /gene="TRAP" exon 107677..107793 /gene="TRAP" /number=1 mRNA join(107677..107793,109473..109591,110497..110571, 110764..110853,111110..111175,111368..111537) /gene="TRAP" gene 107677..111537 /gene="TRAP" CDS join(107727..107793,109473..109591,110497..110571, 110764..110853,111110..111175,111368..111472) /gene="TRAP" /codon_start=1 /product="translocon-associated protein delta" /db_xref="PID:g1302656" /translation="MAAMASLGALALLLLSSLSRCSAEACLEPQITPSYYTTSDAVIS TETVFIVEISLTCKNRVQNMALYADVGGKQFPVTRGQDVGRYQVSWSLDHKSAHAGTY EVRFFDEESYSLLRKAQRNNEDISIIPPLFTVSVDHRGTWNGPWVSTEVLAAAIGLVI YYLAFSAKSHIQA" exon 109473..109591 /gene="TRAP" /number=2 repeat_region complement(109931..110087) /rpt_family="Alu-Sxzg" exon 110497..110571 /gene="TRAP" /number=3 exon 110764..110853 /gene="TRAP" /number=4 exon 111110..111175 /gene="TRAP" /number=5 exon 111368..111537 /gene="TRAP" /number=6 3'UTR 111473..111537 /gene="TRAP" repeat_region 112034..112300 /rpt_family="Alu-Jb" repeat_region complement(112744..113035) /rpt_family="Alu-Sq" repeat_region complement(114172..114462) /rpt_family="Alu-Sz" 3'UTR complement(115207..116391) /gene="Xq28lu1" exon complement(115207..117939) /gene="Xq28lu1" /note="partially confirmed by cDNA" /number=7 gene complement(115207..121686) /gene="Xq28lu1" exon complement(118136..118246) /gene="Xq28lu1" /note="based on gene prediction" /number=6 /evidence=not_experimental exon complement(118544..118645) /gene="Xq28lu1" /note="based on gene prediction" /number=5 /evidence=not_experimental exon complement(119110..119172) /gene="Xq28lu1" /note="based on gene prediction" /number=4 /evidence=not_experimental exon complement(119781..119879) /gene="Xq28lu1" /note="based on gene prediction" /number=3 /evidence=not_experimental exon complement(120318..120408) /gene="Xq28lu1" /note="based on gene prediction" /number=2 /evidence=not_experimental exon complement(121381..121626) /gene="Xq28lu1" /note="based on gene prediction" /number=1 /evidence=not_experimental 5'UTR complement(121452..121686) /gene="Xq28lu1" repeat_region 122284..122340 /rpt_family="Alu-S" repeat_region 122342..122573 /rpt_family="Alu-Ya" repeat_region complement(123992..124137) /rpt_family="Alu-Sc" repeat_region complement(124140..124415) /rpt_family="Alu-Sz" repeat_region 126245..126332 /rpt_family="L1PA7" repeat_region complement(127024..127125) /rpt_family="Alu-Jb" repeat_region 127143..127171 /rpt_family="Alu" repeat_region 127172..127450 /rpt_family="Alu-Sx" repeat_region 130000..130211 /rpt_family="L1PA15'" repeat_region 130321..130676 /rpt_family="L1MB7" repeat_region 130943..131232 /rpt_family="Alu-Sz" repeat_region 131250..131425 /rpt_family="Alu-J" repeat_region 133046..133332 /rpt_family="Alu-Sq" repeat_region 133988..134743 /rpt_family="L1" repeat_region 134262..134552 /rpt_family="Alu-Sz" repeat_region complement(134747..134789) /rpt_family="Alu" repeat_region complement(134790..135039) /rpt_family="Alu-Spqxzg" repeat_region 135449..135980 /rpt_family="L1MB3" repeat_region 135981..136198 /rpt_family="L1MB7" repeat_region 141353..141508 /rpt_family="Alu-J" repeat_region complement(142107..142747) /rpt_family="MER2" repeat_region complement(142336..142624) /rpt_family="Alu-Ya" repeat_region complement(143173..143478) /rpt_family="MLT1A" repeat_region complement(143614..143904) /rpt_family="Alu-Sx" repeat_region 144093..144365 /rpt_family="Alu-Sx" repeat_region 144390..144665 /rpt_family="Alu-Sg" repeat_region 144860..145071 /rpt_family="Alu-J" repeat_region complement(145103..145225) /rpt_family="L1MB3" repeat_region complement(145253..145560) /rpt_family="L1MA10" repeat_region complement(145590..145875) /rpt_family="Alu-Jo" repeat_region 145944..146194 /rpt_family="Alu-Sq" repeat_region complement(146324..146401) /rpt_family="L1MD2" repeat_region complement(146413..147087) /rpt_family="Alu-J" repeat_region complement(147090..147157) /rpt_family="Alu-J" repeat_region complement(147459..147662) /rpt_family="Alu-J" repeat_region complement(148061..148230) /rpt_family="Alu-J" repeat_region 148614..148899 /rpt_family="Alu-Sx" repeat_region complement(148986..149075) /rpt_family="L1" repeat_region 149122..149388 /rpt_family="Alu-Ya" exon 149516..150040 /gene="CCp" /pseudo /number=1 mRNA 149516..150040 /gene="CCp" gene 149516..150040 /gene="CCp" CDS 149578..149895 /gene="CCp" /note="cytochrome c pseudogene" /codon_start=1 /pseudo repeat_region complement(150707..151001) /rpt_family="Alu-Sz" repeat_region 151163..151447 /rpt_family="Alu-Sx" repeat_region complement(151471..151839) /rpt_family="L1MB7" repeat_region 151840..151884 /rpt_family="MER42C" repeat_region 152320..152604 /rpt_family="Alu-Jo" repeat_region 152708..153079 /rpt_family="L1" BASE COUNT 34849 a 44143 c 42700 g 31768 t ORIGIN 1 gatcagggag cttgaagctg aggggggcac actttacctc ccaggccagg acaatgacca 61 cttccttccc caccccaccc ccaggctact cttagcccta gaaaattcta aacaagctgc 121 tcagctggcg gcggagaggc agcccaacaa gctggctctt gctagggagg cctggggggt 181 cctggggaga ggaacacggg gtgggtgggg ggcgggcagc caggacctca ggcctgaggc 241 ctttggggaa gggtctgtgc acctgccagg caccaggggg cagccttgcc ttgttcccgc 301 tccagtcccc tcaagtccga agcccctacc cactctcacg ccaggcaggg gtgggggccg 361 ccggggtcat ttacccgggc cccttctctg ccttgatgac aaagtcgagg cttgctcatc 421 agccaggcag gctcccctct gcccactgtg gagacacaga ggcctgtcac ctgaagagct 481 ggtcccggcc tccagcttcc agggtagccg ggaagctgta gcccccagtg ggcagcggtg 541 gagagagctc aaggaaggag ggagcaccgg gaggagacgg ctgcagcctg ccaggagcgg 601 ggagaaaggg agagaagggg aggcggaggg ctgagggggc ccgggggacg tcttcccagg 661 gctgggaggg gccggccggg aagcctgggc tgcactagga gccggcgacc ctggggcgag 721 gggcggcccg gagccctgcg ggaggagctg gcggccgccc caggtagcaa ccatcctgcc 781 tcccgctgga gcggcgtctc ctccccggga ggagggcagg gaggaggtgg gcggagtgtg 841 acgaggaggg cgggagggag ggatgcggga gggggagggg gaggggggcc ggccggccgt 901 gggggtgggg cgatagtgac atcaccccgg agtcggtttt taagcggcgg ccggccgggg 961 acggggaaga gagggatagt cggagcgagg tggcgagtcg ctgagcccgc cgcggccccg 1021 agagcggctg cagccgccgc cgccgggaag gagagggcga ggcgcgcccg agccgccgcc 1081 gccgccgcca ccgccgccgc cgccaccacc gccaccggag tcgcgggcca gccgggcagc 1141 ctccgcgggc cccggccggg gcggggggcg cgggccacag gcccctgctc cggccgccgc 1201 ttgcagaccg cgggcgccga tgtcgcccgc gccccgctag gctgagcctc gggtcgggcg 1261 aggagccgcc gcagccgccg ccgcccgagc cgcgggcagg agcctcggga gccgccgccg 1321 ccgccgccgc cgcccggccg ggccccgccg ccgcccgcgc gcccccgggc ccccgacaca 1381 catgagattc ttcaggctca ctttcaagtg cttcgtggac tgcttctgac tgcgccgccc 1441 gcgccccgca ccccgccgcc cgcccgccgc cccgtccccc ggcccggccg ccccccggcc 1501 cccggccggc ccgcgccctc ggggccctcc ccggtgccgc cggtgccccc cgcctgaccg 1561 ccgccccccg tgaggcgccg cgaccccggc ccggccgtgc ggcccgccga ggccatggcg 1621 aagaagagcg ccgagaacgg catctatagc gtgtccggcg acgagaagaa gggccccctc 1681 atcgcgcccg ggcccgacgg ggccccggcc aagggcgacg gccccgtggg cctggggaca 1741 cccggcggcc gcctggccgt gccgccgcgc gagacctgga cgcgccagat ggacttcatc 1801 atgtcgtgcg tgggcttcgc cgtgggcttg ggcaacgtgt ggcgcttccc ctacctgtgc 1861 tacaagaacg gcggaggtga gttcccccgc ccgccgcggc ctcctccccc agcaggccgc 1921 cggcccccgc ccgacccccg gagccgccgc ggaggggtga agtccgggca acgggtggcc 1981 cccgggcacg cgggggtcgg ggccgcccct cgtccgccgc tgccgctcgg tggccgggcc 2041 gggcgcctcc acccccctcg cagtcatgtg cctggcatgg tggggggagg gggccggcga 2101 tgcccgcgag gctgcccccc agactcccgg gctgggagga gcgattggcc gccgaggtgg 2161 gaaagcaggc ctgcgccttg gggtctccgc gaggtaagga gccctggctg cccccacggg 2221 tcgggcacac aagcggcaca ttgtgtgggc cccccacgtg tgcacacaca cgaacacaca 2281 cacacacaat gggccactct gtccctcccc ctgccctccc ctcccctcgc ggccctcccg 2341 cccctcccct ctggcccggg cctggaacac tgggtgcccg agccaggctt gggaagcctg 2401 cggcctggcc cgcctggcgc cgccactgga cacactgcat gcacgtccca tgcccgcccg 2461 cccgcccgcc cgcccgggcc cagcttagca acagcgatgg gcacgcgtgt gtcctgtgac 2521 tacaaaacag cactggggtt gctggaagcc gaagtgaccc ggtgatgggt gggaaacaga 2581 ggtccagagc aaaggccttt gcccaaggtc aggagaagga tgctgggacc tggagtcagg 2641 caagttgcag ccaagctcag cctctgagta gtggagcgag cccagccagg gcaagggtag 2701 gaggcccaga gaggagaagg gggtagtggc acccagctct ccctgccctt ctgccacccc 2761 caccccagcc tgctggcctc aggagatagg cctgtgtcac gccctgccta tctcctgcag 2821 agcctgactc cctggccttg ctaaggccgg cctggcccct cttccgcacc tgtatccctc 2881 tgtccttgca catcgccatc ccaccagcag gggactgtga cccacccacc ctctgcctta 2941 gacctcacac ttgcaggcaa gcgtccaagg gcaggacagt cgcgctccct gcctttggat 3001 gagcccccca ggcctgatca cccagccttg gcacacatgc acacatgcac gtgccctcac 3061 tgtgctgcct gaaacaggga attgcagcac tagggacagc ccgcgtgtct gagcgtgtgt 3121 gtcctccatg gccatcgccc caagtgaccg tgggggtgga agccctgggg gcctagggcc 3181 cctctgccac ccagggaata gggctccaat ggctcagggg ctactgtagc ccctcttcaa 3241 cacactcaac ccaccccctc aagactccac ctggggcctg agtcagtggc cacccctaca 3301 ctgactcacc cagtcggaag ttgtgatggg gcctttggag tctgggctgg cccgctgggc 3361 ctgggcagcc tggctggggg ccaccctgag tccacgctgt gcctccaccc ccaggtgtgt 3421 tccttattcc ctacgtcctg atcgccctgg ttggaggaat ccccattttc ttcttagaga 3481 tctcgctggg ccagttcatg aaggccggca gcatcaatgt ctggaacatc tgtcccctgt 3541 tcaaaggtga gcagcccttg gccagcctca gggactgccc ccttctccca gctggctccc 3601 acttgagaaa tcttttcctg tcgtgagcac caggcctggg gccacgtgat ggcgtcccag 3661 tctcgagggg ggagcctgga ggagatgttc aggccgcaca gcgaacttgg ggaagcgggg 3721 actagagggg gcataggcag ctccacaagg caaggacagg ccaggcatag ccgggctggg 3781 gacgggacct gcccagcagc acccttggct ctctaggtag gtcctactgt tactatcccc 3841 aaggacgctg gggcacagac aggtggagcg acgtactgag gttgcccact gcaggggcga 3901 ctgtctccaa cactacctca ggcgactaga aacccccccc ccccaccacc accatcaaca 3961 ccagctgctg aggactggag gctactgggt ggccaggcag aggcttggac ctcctggaac 4021 cgccatggtg gcagtgggac ccacagaagg ggccaggtgt atgaggctgg agactccaca 4081 gcacttggtc agatggggac aggaggagag gggctcgctc tgccttgggt ctagggggcg 4141 gctggaggag aggagacagg ctggggagtc agcgcagtgt tggggctcac acaaggggga 4201 gcccagggga gtcaggagca ccacaaacaa ggctccagga ggacagatgg tgggagcacg 4261 gccagcctgg gtggggacat aaaggggtgg cagggggagg tggccaggga agaatctaca 4321 tggcaaggac ttcccggccc caggcctggg ctacgcctcc atggtgatcg tcttctactg 4381 caacacctac tacatcatgg tgctggcctg gggcttctat tacctggtca agtcctttac 4441 caccacgctg ccctgggcca catgtggcca cacctggaac actcccgact gcgtggagat 4501 cttccgccat gaagactgtg ccaatgccag cctggccaac ctcacctgtg accagcttgc 4561 tgaccgccgg tcccctgtca tcgagttctg ggagtgagtc cggcacctct gggccaagcc 4621 catcccatcc cccaggtctc cctcatgttg cccggctcca ggggagtggc cctgaggggg 4681 caccagggtg ttgcctggca gtccatcctg gaccctgcct gcccttgcct gtcctcggag 4741 agtcctgggg ccagcctcgc tcctgggttc ggcagccgat cactgtcctg gtcactcccc 4801 cctgatgggg gagctggggc tgcatgtgag gtgggatggg agtggcctcc caatggccag 4861 gggatcgtgg gctccaggcc cagcccaatt ggacaagagg gacccgctga accctgggct 4921 gtgggagaga agggagccac aactcctggg ggtggaccct gtggctccat cctctgctgg 4981 cacaggcctc atgggacctc cctccctccc ctaggaacaa agtcttgagg ctgtctgggg 5041 gactggaggt gccaggggcc ctcaactggg aggtgaccct ttgtctgctg gcctgctggg 5101 tgctggtcta cttctgtgtc tggaaggggg tcaaatccac gggaaaggta ccactagagg 5161 catgcagcgg ggagggtggc tcagccctgg gagccggatg tctgtgccag gcacacctgt 5221 ggcaacggga ggtgaccaga cagagtctag ccctaaggaa gggggaggta ctgaaagcca 5281 agcaatgctc cccaccctgc aaatccaggg cccagcagcc tttgctcctg gggatagagg 5341 ccctggcagg cactgtccct tccctgtgcc catcaccccc actggtgccc tcctgccagt 5401 ctctgactct tgtgacagtc tggtggacct ggtctggcca tctgttacct atcttgcctt 5461 ggggacccag agcagagtct ggccacatcc cttgggggct cctggtcagg ctggggagtc 5521 acctgaacaa agaagacagt gtctagagct gtgggacatg gccagctccc tgggggacaa 5581 ggtccccaga gcagcatgtg ggaagagggg gcagacagtg tggcagctgc atctcgcctg 5641 cctctgcctg gcccagttcc actctccacc tgctcaaccc ccacctctct ccagaagagg 5701 agggggaccc gacccggatc caatatcccg ctccctgcct gggcctccca cacctgcact 5761 gcccacacac tcatacagct ctcactcccc acgtgctcca cgcctcctgt ccccactgag 5821 gagagctccc agaggctcgc ctgctcccca ccgacacgcg tccctgcaga caaacgaggc 5881 gcccagggag cttccccact gcacttggcc agggctgccg gggcgcagcc ttgcccctag 5941 cttcctctgg cgggagccat ggctcggagg acaatgggga cctctgaaca tacctgcccg 6001 caagggggac cggaggcgct gggagtgggg gtgtgaggga ggtggtgcca cagcctccgc 6061 tgagcagcct ggccccccag atcgtgtact tcactgctac attcccctac gtggtcctgg 6121 tcgtgctgct ggtgcgtgga gtgctgctgc ctggcgccct ggatggcatc atttactatc 6181 tcaagcctga ctggtcaaag ctggggtccc ctcaggtgag gtggaggtgg agaggctgca 6241 gcagggcgct gcgggggagc cctgcaggcc cctcatgcct gcgctctccg gcccttctct 6301 aggtgtggat agatgcgggg acccagattt tcttttctta cgccattggc ctgggggccc 6361 tcacagccct gggcagctac aaccgcttca acaacaactg ctacaagtaa gcaccgccgc 6421 cctgccaccc gtgccctgtc ctgccctgcc ccgccctgcc cagcagccta acccatccac 6481 tctggcccct ccacccctca gggacgccat catcctggct ctcatcaaca gtgggaccag 6541 cttctttgct ggcttcgtgg tcttctccat cctgggcttc atggctgcag agcagggcgt 6601 gcacatctcc aaggtggcag agtcaggtag ggccctaccc ccagccccgc ctccagagca 6661 gcgagtgcta cccagatgca tgatgtacag gaacatgcaa tagaaatgct gaaaagtgac 6721 gaggattcaa acggaacttg tcagattgtg ggcctgtggg ggcaggtcct gggatttgtc 6781 aatgttgaca gagaaaggac ctcccagccc ctgccgcacg acccagggtt gacagcgcct 6841 ctgaggcagg cgtgggcatg ggcgcgagtg ttgcaggcag ggctcagggt gcgcacaggg 6901 caggacatcg gctacaaggt ctagagcctg cacctttccc acagggccgg gcctggcctt 6961 catcgcctac ccgcgggctg tcacgctgat gccagtggcc ccactctggg ctgccctgtt 7021 cttcttcatg ctgttgctgc ttggtctcga cagccaggtt tgcatggggc tctgggacag 7081 ggagccagga ggggggcgga gggagggctg caggcaagga aaggggtgga gggcggtgcg 7141 gggctcggcc tgagctgccc tggccacagt ttgtaggtgt ggagggcttc atcaccggcc 7201 tcctcgacct cctcccggcc tcctactact tccgtttcca aagggagatc tctgtggccc 7261 tctgttgtgc cctctgcttt gtcatcgatc tctccatggt gactgatgtg agtggggtgg 7321 ggggtctgcc tgtgacctct ggtggccgtc tgccatcctc cctgactggg ctctgtcccc 7381 cagggcggga tgtacgtctt ccagctgttt gactactact cggccagcgg caccaccctg 7441 ctctggcagg ccttttggga gtgcgtggtg gtggcctggg tgtacggtag gtcatggctg 7501 agggctgggc tgggggatgg tggcggggaa ggcaggtctc cagcttggcc ctcccgcctc 7561 acctcgccgc aggagctgac cgcttcatgg acgacattgc ctgtatgatc gggtaccgac 7621 cttgcccctg gatgaaatgg tgctggtcct tcttcacccc gctggtctgc atggtaaggg 7681 ctgggggagg tggggcaggg cggggggcga ggcagggcgg ggtaggggcc ccattaaccg 7741 cagcattctg gtccgtaggg catcttcatc ttcaacgttg tgtactacga gccgctggtc 7801 tacaacaaca cctacgtgta cccgtggtgg ggtgaggcca tgggctgggc cttcgccctg 7861 tcctccatgc tgtgcgtgcc gctgcacctc ctgggctgcc tcctcagggc caagggcacc 7921 atggctgagg taaggctccc gcccggcccg ccctcccctc ccctgctgtg aacattcaac 7981 ccagcctgct tcctagccag ggagtggccc cgactagggt ggcaggcagt gggaaccgga 8041 gagaggcaga ggaagtcacc gtggggacga gcaggtgacc ctgggggctt cagcatgtcc 8101 tcctctcctg cagcgctggc agcacctgac ccagcccatc tggggcctcc accacttgga 8161 gtaccgagct caggacgcag atgtcagggg cctgaccacc ctgaccccag tgtccgagag 8221 cagcaaggtc gtcgtggtgg agagtgtcat gtgacaactc agctcacatc accagctcac 8281 ctctggtagc catagcagcc cctgcttcag ccccaccgca cccctccagg gggcctgcct 8341 ttccctgaca cttttggggt ctgcctgggg gaggagggga gaaagcacca tgagtgctca 8401 ctaaaacaac tttttccatt tttaataaaa cgccaaaaat atcacaaccc accaaaaata 8461 gatgcctctc cccctccagc cctagccgag ctggtcctag gccccgccta gtgccccacc 8521 cccacccaca gtgctgcact cctcctgccc ctgccacgcc caccccctgc ccacctctcc 8581 aggctctgct ctgcagcaca cccgtgggtg acccctcacc ccagaagcag cagtggcagc 8641 ttgggaaatg tgaggaaggg aaggagggag agacgggagg gaggagagag aggagaaggg 8701 aggcagggga ggggcagcag aaccaaggca aatatttcag ctgggctata cccctctccc 8761 catccctgtt atagaagctt agagagccag ccagcaatgg aaccttctgg ttcctgcgcc 8821 aatcgccacc agtatcaatt gtgtgagctt gggtgcgagt gcacgcgtgc gtgagtacgg 8881 agagtatata tagatctcta tctcttagca aaggtgaatg ccagatgtaa atggcgcctc 8941 tgggcaaagg aggcttgtat tttgcacatt ttataaaaac ttgagagaat gagatttctg 9001 cttgtatatt tctaaaaaga ggaaggagcc caaaccatcc tctccttacc actcccatcc 9061 ctgtgagccc taccttaccc ctctgcccct agccaaggag tgtgaattta tagatctaac 9121 tttcataggc aaaacaaaag cttcgagctg ttgcgtgtgt gagtctgttg tgtggatgtg 9181 cgtgtgtggt ccccagcccc agactggatt ggaaaagtgc atggtggggg cctcggggct 9241 gtccccacgc tgtccctttg ccacaagtct gtggggcaag aggctgcaat attccgtcct 9301 gggtgtctgg gctgctaacc tggcctgctc aggcttccca ccctgtgcgg ggcacacccc 9361 caggaaggga ccctggacac ggctcccacg tccaggctta aggtggatgc acttcccgca 9421 cctccagtct tctgtgtagc agctttaacc cacgtttgtc tgtcacgtcc agtcccgaga 9481 cggctgagtg accccaagaa aggcttcccc gacacccaga cagaggctgc agggctgggg 9541 ctgggtgagg gtggcgggcc tgcggggaca ttctactgtg ctaaaaagcc actgcagaca 9601 tagcaataaa aacatgtcat tttccaaagc aggctcctgc ttccgcctct gctgctctaa 9661 ggaaggggtc ggggtacagg aggcaggggg aacctcctcc agctggagct gctgccgtga 9721 gcaaggctct gctctggagg cctctgcggc cggcaccctt ctggggactg ggaagggggc 9781 agggaaggca gcagcccagg ggaaggcctt gtccccctgg agccgaggca gttggggaga 9841 gcaggacgag agtgagctgg agagcagcca cacccgcggg gaagggtggc gtaaagccat 9901 gggtgctgaa attttcaaaa tgttacccca agaatttgtc actgaacagg tgccttgtgt 9961 cacttgggcc aggctggtag cagcagaggg gataactctg catcagggat caattttgaa 10021 ggtggagcca ataggggttg tgcatgacca ggatgcaggg ctcaaagagg agttaaggac 10081 aacagatttg gcctgagcaa gaggaaagat ggagctgcca ggtcctgcaa tggggaggca 10141 aggagagaat ggtctggagt cagccttggg tgtgtcatgc aggaagtgtc atccaagtgg 10201 agatgtctag ttggcaggtg gacacaggag ttccagaaag tactggagat ggaactgtgc 10261 aagttcttac cacatagaga tgacactgaa agccctgagc ctgagtgagc tcacagggac 10321 gccgcaagcc ccggaacaca atgagagggg cagagcgaag acgtggcagt gataggggag 10381 gacgcctgag agttcctggt ggggtcctgc aacctgagcc agtgaggacc cctcacaggt 10441 cagggaggag cagtggctgg ctccatctgt ccagtgctgc tgctggtgaa ggacagtgac 10501 ctgcaaatgc tcactgagtc tggcaagggt cacgggggcc tggcgagggt ggcttgcatg 10561 agcgggtgcg tgtgaaaggc tgggtggtgt gcgactgaga aaaggagtgg cggcagcgca 10621 gtgtcatctg cagacgaagg gagagacaac aacgtagttc acccagacaa ggaaatatga 10681 gccagcctgg aaagggaagg cattccaaca cacgacacaa catggctgac cctggagggc 10741 atttctgtga aatgagccat cataaaggga tacttgctat agggttctgc tcctgtgaga 10801 gagacagggc cttacatgag aggagggaga tccacagaga cagagggcaa gggtgggtgc 10861 caggggctgg ggacagggtg gggagtgttg agtggggaca gagtgtcagt ttgagaaaat 10921 aaattctaga ggtggatgga agtggtggct gcgcaacact gtgactgcac ttaatgccac 10981 tgaattgcac atttaacgat ggtgaaaatg gctcattaca tatacactga tgacactata 11041 tatatgtatg atatatatgc gttttaccat gagaagaggt ggagaggaat tggagacact 11101 gagtacagac aggtccttca acgggcggga ccccgtgcac aagatgagca tgtggcaccc 11161 caccctcaaa gggctgggca ccatggcagg gcacagcagg caatgcagtg ggcggctcag 11221 gcaagcacag agagcatcag agattggagc ctgtgaaggg ggagcaggtg acccctcaga 11281 gcaaagtgac agcttgggct gctccctttg cgtcctgccc aggactgcta tcgtgctatg 11341 ggagaacccc cagaggccct gctcctcagc aggcagcacc ccctatggag gggctttacc 11401 cctaaacttc tggagccagg ggagggacct ggcttggaat acggccaacc aagagcctgg 11461 gtgagaaata cacggaccag acagggagca gagaaaggag tggcagtgca gtcccaccct 11521 agctcagcca ggggctctgg agcctgtcct gcagtccctg gcccccatct cttcagcaac 11581 cgctgtttcc agttttcttt ttctccctga gaagcctgtc ctctcaccat gcctgcgcct 11641 tcaagaaccc cgcctgctgg cagctcccac atctccggcc tggccctcct tagctgcaaa 11701 ggtgcttccc aacatcagca agacctctcc ccagggtgcc ccaggccctc acacagcccc 11761 tgtccccaac cgactccaac tgtcctgcag cccacagtca ccctcaggac ccctgagctc 11821 aggccaactg ctttatacac tgtcagccaa gtctctgcct ggatgacaat caccctctgc 11881 taattgttct ccgcacctcc aggccaaatg ccctccaagc cacctcatgc accacgatga 11941 cactaaacac acagaaaaaa gacattgaaa aaaggaaact tcacagaggc ctgtcactta 12001 aagagggtcc tgaaatagag acaccatttc ctcaggactt agctcctgca tcaggggtta 12061 ggacacagag atcaacaagc agcaggcttt gccctcaagc agctcgcagt ctagtggaag 12121 atgggtaaga aaacagatca ggacgcccac gggtgcagat gccctggaac agaagctgat 12181 ccaggaaggc gcgagctgca ggccgccctc cagtctaggc tgggcaagca cctcaatttt 12241 catctctaag agcctgtgcc cacaccccct gccccgttgt tgttccatca ctccactaga 12301 aagggcgctc cagaagctgg cctcgtgcag ctttctgtct gctgctggcc taggcagaac 12361 agcggaagaa gccatcaggg ctggtgaggg aagcacccgt ttggacttta gcctttcaaa 12421 gctcagagaa gggtgagctc agggaggtcc aaggtagctg agagcacttc ctggaaaagt 12481 gggatcagcc ttcggccttg gcacagcaac cagagggtat cgcccacgtg tcccctactc 12541 cctcagacac cacctctcag accgcctgga aagggacaga actcgtcatg aggcggctgt 12601 gctctgagca caagggaagg gcgacaggat gctagagaag ggaaccactg gcctgggccc 12661 ggacagggca ggcagaagcg agcatgcaca gcaggccgtc agctaccctg ccagcatcaa 12721 catccttcag gggtcccccc agttccagga gacacacctc taacctgctc ccctgaccct 12781 tccgcccagt cctcatgcag acaccaggca tggcagaggc cctgcagggt ggaagcactg 12841 tgctgcgggc gggggctgcc ttcctcatgt gctactggag agtagcacag tgcaggggcc 12901 tgggcactgg tgccaggcag gaagccccgg tactggcctg gcttgctgtg ggcctggaag 12961 acacagctct gagggagcca cgggagggac accctggagc cagcacagcg ctctggtggc 13021 aggcacacac ccagcacgtt ctcagggcca agggccccag cccattccca gcccctttct 13081 gcctagctct gccctgggcc agctccaggt cactgccaag gacaagtctc ctctcccagc 13141 tggcattagt cagaggtcat cctgcaaacc ttcggggggg ggggcaggga gtgactagtg 13201 gcgttctgcc acgttctgtc tgtcccaaat gtgacgaaca ggaacccaga gaaggcaagc 13261 gagtcctcta cccggaagcc ccgccggttt actgagcctc ccaagctgcc cacacccagg 13321 gaggcagaca ggacacacac tcggcgggtg gccctgaagc gaggcctggc ccagcccggg 13381 gagcaggagg acagagaggg caaggccttc gagaacaggt gtgagcctgg ccttcagtgg 13441 gggaaacagg ttgaagggct gtggccgctt gggggctcca ggcaggagag aaagcagagc 13501 cctccccaca gctgcagtca cacaccgcac cacgtacaca ccatgacaac ttttattgcc 13561 ctcaagagaa actccagtcc acctgctcca cccaccctcc tgcgggacca aagaaaacac 13621 ccagagggca aaacaaaaag gggctcaaac caacaggaag tcagccccac cgcaagccgg 13681 actacaacta actcgtgctc tccacgctca ggcgtggaag ccaaggctgt gccaggcctg 13741 gccaggccaa gcaggatgac agcaaacgca ttctgaacgt gtagcaatca ggtcccctgt 13801 aatgtgcttg gagagtgtgg acaagggccg agatgacgag ctatgagctg tggaagggaa 13861 tgggggaagc agaagggcac aaacagaagt actggaggga gaggccgggc tctcaggaag 13921 cagcaggcac gtgccaggtg gaagccagct gcaggcaggg gaggaaggag gcccttactc 13981 ttccttcttg tccatgggac catctactgc agcctggaaa gggacagaaa tcccacagca 14041 gtaggttggc cgggtccact cctcccctgc cacctccagc cccatgcccc agaggtccac 14101 ctcggttccc ctctctccta acaacagcta ttcaagtgaa caaggggccc cctccccagc 14161 tgcacccaaa ggcctgccag ggtgggagcg tcagccctgg cccacgctct agggaaagcc 14221 ctggacctaa cgccagccag ggaggactgc caggacctca ctgggggctg agtcctggct 14281 gcagggaaca gcaaggcatc cagtcccctt caagacctga tcagaccctt cccaactctg 14341 cacacctttg acaggtgccc tcgaagccca tctgccaagc ctgccccata cagagggcat 14401 gggtgccccc tttgaggctg gacccttcct ccccacctgc tgtggtgccc aaacttgggc 14461 caccaagcac tgaggccagc tgtccaaagt taggagtatt tatgtggccc tcactcccaa 14521 cgtcaagacc gcctgggctt ccagatgcgg cctggtgcac ccaagctagt ctgaggactc 14581 agatcaggcc tagggcagca ggtgatggcc acaactagcg cctgctaggg aaggtgcctt 14641 tttgacacct tgtgccctca cttgcccagg gatctttgcc ctacgtcact ccccagcacc 14701 ctaggaaaga aggccagcag tgggtcccag agtttcacct gcttctttgt tcttgaccag 14761 gccccaaacc atggctgcgc ctgagcacga aggtaggaag gctcagagcc tagtgagcca 14821 gtgccactcc tgagggccgc cttggcaagt gcctacatct gctgccaggc cacccccctc 14881 ctgcccggtg aagggtccca ctcagtaggg cagaggtggc cagggggagt ggtggagagg 14941 gcagccagcc cctgggcccc tggaaggttc cctccgcacc cgcaggggct gcctcatcct 15001 gctctgctct cctgccctgg gcgcagcaat acgggagggc tgacctgcag ctttgcgtgc 15061 tcctccagca agcggtcgta ctccttggtg aggccctcag actgcttccg catggccaga 15121 acctggtttt cagctttctc tagttcttga aatgatgtaa atgaccaaga aaacagaaac 15181 gaaaagacag gaattagggg gaaaaaaccc gactgctaca gacaccagaa actggcccaa 15241 atctatctca aacgaggtta tacaggaggc tacttctcaa aataaagccc ctctgctttt 15301 gcaggccccc aaagtagagg gaaagggctg acaaaaaagc tcaagataaa gcaaaagaaa 15361 cacagaggcc atcccccagt ccctttaatg gagaggaact ctagtggctc tcggcaaggg 15421 taacctccag ggaggctgag agtgggagac agggagcaag atcccagcct gcaagcgaga 15481 cccaatgaca accacgcctt gcacacagca gcagcaggcg aggcctgtgg tattggggga 15541 aaacgcccca gacttaagtc tatgcgtggg agaccaaaga caggcaggcc gcttgggagc 15601 cgcccactcc cctcctgaac gccactccca cactcccctc attctcagcc cccaggcatg 15661 ctggggctac cgtgccacac tctggacggg aaagccccag catgcactgc tctagtgcag 15721 ggcaatcgag gcccaccaac tgcagcctgg ttcctcctga gccccattca aaccacttag 15781 cctcactggc ctgccggcta agcatggctg cattggggtt ggaggcgcag ggtgctattg 15841 gtctgttttc agccagccct cgagcgtgcg tgcaaggctt gttactaata ctttggcaca 15901 aaatgggcag cagcgggcag aggaggctcc tctggacttc cctgcgggga aggacacgag 15961 gtcgagcctc actttgctta gtgctggcca gctcgtcctt tagcttctgc aggtcagcct 16021 tcaggctcct gttctcttcc tccaacttca cctcagcatt cccgacatcc aacttgcctc 16081 cgtcaacagc agctccctgg gaaaagtgcc aaaggccagg gttactcagg agggagggag 16141 ggagaggttc cagccccatc ctccccaccg agctgcggtt cctcaagctg ccctggccac 16201 acgccccttc ggaaatgtca acgcggaacc gagccaccac ttgctcccag ctcctaggca 16261 aaggccaggg cgtggttgcc cgccgaggga aagagaagcg ccagcggggc cacctgctgc 16321 agctcgccgg gaacgccttg cctgccctgg cccctggccc ctggcccctg gccctgcctc 16381 cttcccaagc agcagggctc agcagctcca tggtgctcac caacccctcc acagatggcg 16441 gtgcctcgtg ctccctacat ggtgccgctc actgcagtta ggagccccca gtcggcctgg 16501 ccagctctat cccacctctg catccacatc cctccgagct tgccttgcag ctcacctcct 16561 gacgggacgt ctaagactgg ccaactaccc tgcccccacc tcctctccag cactgaggga 16621 tgccacagac cccgagttcc agagggggtg cggcaatctt gcagggaaca agggcctagc 16681 tgagggcctt cggatcacag cagagggcct ggctcactga ggggccattt ttctcaggga 16741 agggtctaac tggaagcagt ggatggaaac gagagcagca acaccctcct cctcacccgg 16801 accctcacac acagacgcct ccagcaggca tactctcccc actgaggact tcccctctgc 16861 gcctccaccc aactctggct tttcaggcac atttcccagc gtgacaggct agcagtggcc 16921 actgaggccc tgaagaatgt ggctcccaca gtgtaacacc aggacgcccc atggtgggtc 16981 gggaagctgg gctcaccttc ttgagctggt cattctcctc catgtacttc ttggccgcct 17041 cactagcact ctccgcctgc tttttaaagg cttcattgga ggccagcagc gtggcctgct 17101 gcgaaatgag agtcaccagg cgtctaagca ggctgcaatt atttacaaaa agaagggaga 17161 agtgagaaaa agagcatgaa gggctggcag gagcacctcc tggttgctcc cactccacac 17221 ctagctccag cctggacctg ccctcttgcc aaggcagccg agtgagaagc cgccaacctg 17281 gtgctggcag cgtgagggaa aaggtggggc ccaggagccg tcctctgccg ctgtgcccaa 17341 cggccaccct cagctctcag aggggctgga agcaggagcc tggggggctg ggaagagcct 17401 cgctacagca tgaggtccca gaacgcggca ctttccgggt cggggcctag acgtgccaga 17461 caagccacag caccaccttc ctccctgcga ggctgggctt gccttggtaa ggtaacgaga 17521 gaagctaatc aatccaagca cttccaacat gccaggccgc atcctcacat ctacctgatg 17581 aggaagttac tatcactgcc ccagcttata gaagaggaaa ctgaagttca gcagcgtaaa 17641 tcaatgtacc caaggccaaa aaccagaaat ggacatggct ggaattccaa attatgtctg 17701 cctgactcca gaacctgagc tcggaaccac tctgctctct aaactaacag ggaacagctc 17761 ccaggtccca acgtaagata gaactctctt ctctggccag cctccttccc aacccatcat 17821 gcaggctgcg ctggaacaca tccgttatgt aacagcaccc cgaatgaggt cttcttgggc 17881 tggagggtgt agaggaatca ggacacaggc gcaggctgcc tctctgaagc agccaggaga 17941 gacaggcaaa caggtggcag ctggaggcag atgctagtcc ccaaacagag attggaatgg 18001 ccacttcatt tcccttggtt cacccttgcc ccgagatgtt agctggcagg aagagaggag 18061 ggaaggactc gttcaaacag tcaaaacaag gcaggggttc ctttctcaca cacctcagaa 18121 ggcaagggtc acacagggcc tgggggaagg aagagacaaa tctgcttagt ccagagtgct 18181 tcaacaacag cttactcaga agagtcgaag tggcctcctg ccccagccag gccttcacac 18241 ttcacagcct ctgctcatgg ccaggggcag cccggaaggg ctggagaaag taaagagcag 18301 acaaggtgag ctacctccct ggcccaagcc atggctctcc agggcctcgg cagagcccct 18361 ttccagatgt actcaggaca gaaagtaccc acccgggcca ggagacaccc ctgaggttcc 18421 tggtttgggg agaggctccc aggggcccct ggcagcacca ggagagccag gccgttgatt 18481 cctggcagag aaggagagtt tccagtgaca tgtgctttct aaaattagcg gcccaggacc 18541 tcgtggccta gggctcaggt ttccctgcct cagcccccag ctgcccacca gcctgccccg 18601 cactgggcta cagcctgaag gtggaggaag ctactgagcg ccctaggagc cagagagaaa 18661 caatgcatct gactcacatc ggcatggcca gaagtcaatg gagaggccta gaaagaaagg 18721 caagtctgac taagacccag gcccccggca aggagctgcc cagccccaga gcggatccca 18781 gtgatgtaga aagaggaaga ggaccgctcc tcccagctgg aattgagggg tgggggtcat 18841 gccacctggt ggtagagaga ggaccaagca agactgaagg ctatactccc cgccaccagg 18901 ccaggcaagc ggctgctggt gagtgcccat ggctgtcacc ccagtaccca gggagatagc 18961 taacacaaat gcttccgcgg cagtgcagca gaggcccagc tcttttcgga ccgtcccagg 19021 cccttcccgg ctattgagaa ccagggcttc caagataggc cagggcatac acaaagtcca 19081 gcgcaagatc cacgctgtgt gtgtccgaaa gcctggccct gctcagcccc agcccaggcc 19141 ttcagttccc agccttgaga cagtctgggg ctcccctctg ccaggccccg gttccccttc 19201 ctcttgccaa ccctcacagg cgctccccac ccccacagca ccccgggcat actcctccca 19261 ctgcaccccc agcccgatag ttctttttca caccttctag gtcctctctc ttcctgctgg 19321 atgacccggg atcattctcc ccccaggaac ctcaccttca actgcctcct tcctggagtc 19381 accctgccca agcccctggt cttttccctc ccatatattc ctcaacctag gctggccaag 19441 gcctgccctt ccaagccagc agcagggcca ccagtggcct cctaaccgcc caggccggag 19501 gtcaccctga actccttgct ctgctgctaa gttaccctcc tgaggtcccc tcgcaacacc 19561 ctcctcccac tgttattctg ctccctctgg ggtctgcact cttcagctga caccctatac 19621 cttcctccca gccactctta tccccgaaag ggttttctct gtggcccaga ctcataccta 19681 acctcctgct aaacattggc tcctggatgt ccccagagac attctagact cagcttgtcc 19741 aaaacgggcc ttcccttgtc ctgcctgacc tgaccacctc gtgtagcccc tgctgtagtg 19801 gtgggcaggc aaaccgcctt ggactcagcc ctcttggccc ccagcccaag ccacaaccca 19861 gcactttcca tgtcaactgc aaacatgccc accatcatcc ccactgctgg cgcctcctcc 19921 ctatctgctg ccatacggct ttcccatcca cctcccagag caaggcaaat ccgaccatgt 19981 cagccctctg cttaagccac ctgctgccag cacgcatgcc ctcagtgagc tctcctcctt 20041 cactaaccac gtggccccct gctctagtaa cacctaaccc ctcaccattc ctggaacacg 20101 cctggctctg tgtggcagtt ctccaggccg gaatgtcctc tcgacccagc tcaatcctca 20161 cctcccccca gaaacccttt tggatctccc accccatcag agggacgcct tctgggggct 20221 cctgcagcag ccccccaggc acccgcatgt aactacctca ttctctgttc tctgcgtggc 20281 tgccatccgt ttatatggct gccctaccag gctatgaagg tctttaggct gggcactgtg 20341 ccttcatctc tgcactccca tacctggcac actgaaaagg ggtcttccgc ccactccagc 20401 aagtatagct aaaaaaaaaa agggggggag ggcgcggggc tgggcttcca gatgactgga 20461 tcccactccc aggagaggaa atgctccctg acaggtgagg ggacagattt gaggctgcac 20521 gtaaggctgg acagaatctc cctgggccta gactgcacct gtgttcacct gggagcctgg 20581 caccaagagg ggcagaggca gacacagagc tgctcagtct agcaacagag gagacagaag 20641 acaggagtgg gaaggcgccg tctcagaccc gttctgatgg gcaagccagg ctcatggctg 20701 cagggggaaa aaacattcac tgccgcgacc tgaaggcaca acccagagct ccagcctctg 20761 catcctcaca ccctcaaccc ccacccaggg cccaagcaat gcagaccagg tcctctctga 20821 tcactggcat ttttcagcct gggagccagc cttctagaac attttcccgc tccctcacac 20881 tgggtcactc aggcacgtta acgtgcgctt gtctgttccc ttgtagcttc ccaggccccc 20941 aggacagggc acgaacatgg cctttagctt ctgcctctgc tggatctccc aagtagtctt 21001 acccgaatca ctgttcttag ctattcattt ccagaaaaca ggaaagaacc taagagccaa 21061 aggcaactcc tacagataca gggtggtcac caatagaatg gcctggggtc caaaaaaagg 21121 ccagtgaacg aaacttaaca gaatccagat gtggccttgg aagacacatg gcagccccaa 21181 tgcctcaatc tgactgggct ttcttgatag aatgttgttg gacactgagc agggctatcg 21241 tgcttttata aaaggttgag taaaccagag aaggcaggag aaacagaacc tctccacaga 21301 ctagagaaac agggccaacc atatcaaatg gagagagcca tggctcataa gcacttttca 21361 gcagccctgt cttcccccat gagcaagggg aagaggacac gggcttaata ggaaatggag 21421 aaggagcaag tcccgaccaa aagattccat gctgtggcca cccccggccc gccctgctga 21481 cgggtttcag gcgagtcaag tcattcaacc cccagcccct gcatacacat ggtgttcaca 21541 taagctcact cctcagcccc cagccggcag aaagccggtg tcccagcgcc acctgctgac 21601 tttccaggcc taccgcaggg tggccagtgg actctgggtg aacacgcccc agctgtggaa 21661 gaaaaaaaat gaggcagcgc ccaggcaagg aagcaagtca ggtgacgcct caggaaggct 21721 tcagtgaaga agaatgacta acaccagggc ttccactgcc ctcagcgact cttacccacc 21781 agtctggaat caggaaaaca ggttacaact gggagagtca cctagagcag acccgagaag 21841 gctgccccaa agggctgccc caagtccatt ttggtacagc tgcgtggcct tccctgtagc 21901 ctcccagcac acagacgctg gagaagacgg gaagaggagg gctagagctg ggggaaatgg 21961 aggccgtttc aaatgagaac atgacttgtg gcagctccag cccacgaccc agatggagct 22021 cacccatcct gaggacagtg cactaagcgc agggcaaagg ggcaggtgtg ggtctggcct 22081 gtcctccctt cttcttgaga acaagtgaca cagaccagct gggtttctgg ggttttgctg 22141 tgtatctttt ttaaaaccag ctatctgagg ggtttggggt aagctggagg gtagagagca 22201 accgactgag gtaagacaac ttaggcaaag gtagtctgtg attagatgac tcaacctaaa 22261 aaagaagaaa aagcagctca gcagagaagc acgggcagct ccatctgggc taatggcagc 22321 gatgggattc taccctggag gggtaaagag gaaacaaaag atgcctgtgg atcaagttca 22381 ggtcagcaaa aattcagggg gcttccacac aaacaggggc cttcctgcga ctggctgcta 22441 accagcactt tgggcctaac cttgaccgtc atttaagctg agtaaggcag agaaggcagt 22501 gcaggtcctc tgaacacaca aaccccagcc cagagggagc tgccgtcccc aacacactcc 22561 aagactcaag agggcctctc gctagctgtg cccccgaagt gcaaggttgg caggaaggga 22621 acaggagcga ctgccggagt cttccacaag tggaaaccag tggctcatcc agtgtggtcc 22681 cctggaggtg gccccgatgg acccgccttc acaaactgtc atagctccta agacctgaaa 22741 agctgggctt cttggctaaa aagcccaaca agttcaaccc aggcacgcac ctaaagctgt 22801 cgccgtcagc ccgggacagc ccattcagtc accaaatgtt tcagcgccct tcatatgtgc 22861 caggcccttg gcactgagct gaacagtctg aaggggaaga gcccaggttt tccacgatgg 22921 gcaaccctgc caagtgccac acctcagagc tgcgtgtgca ggctgccctg ggacccgagg 22981 acagcgctat gggtcagccg ggaacatggt gtgggcccct cggaacaggc tccacaggga 23041 agcctcggag attcacgaag aggaggtgcc ggctgggccg gcagctggag ggggtgttcc 23101 gcacagaggt ccccaaaatg ctcagagaat cgagttgggg gagagcatgt gttacgtgag 23161 gctctcccat gagacccaca tgactgcttc atgacagggg gaggccgaag cagagactgt 23221 gggggagccg cgtcctggag gatccatgtg atagcgagcc actggaagtg gggtgcacag 23281 gccaaagggg ggaaggcagg tggcagggag cccgcttggt ctatacggga tggtggtggc 23341 cccaccacag cagtggtccc aagggttgtg agagagaggc ttaggaggtg acatctacag 23401 gctgtttcat gggtggagtc cagctctgca ggctgaagac ttctggaggt tggctacttg 23461 acaccgtgaa agcgcctcac cctgctgggc cacacactga gaaatggcca cgatggttgg 23521 gcagtcacat gggacaagaa gaaagggcag agcagcccca ggcttctggg tcaagtgaca 23581 ggactgagac agtagtggca gaggcaggac aaaagctcag aaggctttgg ctgggaagct 23641 gggactctcc cactgctatc ccaggcagca gcagcagact atggggggcc aagggtacag 23701 acttgcttct aggtgtgatg tttcctttca ggccaggccc cctttcccaa ttacaaaggc 23761 tactcgggag ctctcaggct aacctcctat gtgttctgag cccagtcccg ctgaaaacta 23821 gtgccaagca ccaggccttc tccagaatgt gctcccctcc ttggccacta acctgctcac 23881 atcctccttc ttgatcttgc ttccctcttc cttctgctcc ccgatcttct atcgctctgc 23941 tggaggctgg aatccatcct gccagcacat tccctttgcc ctggcctcaa tgcctctgaa 24001 gccagcaacc caagctcgac tgcccggaag caccctatcc tgctcatctg ccaggcctcc 24061 cctgctcaac cctgctctcc ctgtcccctc ctttccttgc tgcccccagg cctggccaga 24121 agtcccactc tgcaaccagc cctcacacct agcacgatag tgttactcca tgggcagcca 24181 gagctccctt tccagcaggg ggctgcgtcc tcgcattccg caagtccaca gcagaaccaa 24241 gatcatctca gactcccaga gactggaaaa gcctgctgat tcaactccac ctgggcctct 24301 cagctctgtc ccctccaccc cacttctact accactgtac cactgccccc gttcaggttc 24361 ccagcaagtc tcactgacaa cctccaactt ggtctcccca cttcaggctc tcctgctcca 24421 ctccaaccca tacacccttg caaaatgtta atccacacag gtgactgcat gccagcagta 24481 ctggaatacc cactaggcag gctctctacc acgcagaaaa gttgcatacg aagtctggaa 24541 cccttaactc ctaaccatct aacctgctcg ggccatgagt acctgctcgc gccatgagta 24601 cctgctcgcg ttcaagaact gagcctctca gtgggacata aagaacatgg aaagaaagag 24661 aggtgggtgt ggtggctcat gcctgtattc ccagcacttt gggaggccga ggtgggcaga 24721 tcacatgagg ccaggagtca gagacgacca gcctggccaa cagggcaaaa ccgtctctac 24781 taaaaataca aaaattagcc aggcgtggtg gcatgtgcct gtagtcccag ctacttggga 24841 ggctgaggca tgagaactgc ttgaacccag gaggaggagg ctgcagtaag ccaagactgt 24901 gccactgcac tccggcctca gcgacacaga gagactctgt ctcaaaaaaa aaaaaaaaag 24961 aaagaaagaa agaaaaagaa aaagaaaaaa atcaacagca acaaaaaaga aagagacaga 25021 taataggagt ggcatgggtg ctccaagagg atcaggaggc ccaaagaaag ctgactagct 25081 gaggccactg tttatgacat cagaaacaga gctgcaggct cgacatccac caatgaggaa 25141 ttgggtagac actcaagaac actcaagaac gctggagagg ccaggcacag tggctcatgc 25201 ctgtaatcct agcactttgg gaggatgagg tgggaggatt tcttgagccc agcagtttga 25261 gatcagcttg ggcaacagag caagactctg tctctacaaa aaattaaaaa attagcagca 25321 cgtggtggca catgcctata gtcccagcta ctcgggaggc tgaggcaaaa gggcggggct 25381 gcagtgagcc atgatcacac caatgcactc cagcctgggt gatggagcga gaccttgtct 25441 caaaaaaaat aaataaataa ataaataaat aaataaatat gctggaaaca ggtcagttgt 25501 cccagaaaaa cattcatgat aaactgagta gaacactcaa gtcaccaagg ggcatttaaa 25561 gcatgtggtg cttaaaagcc ccgtggttaa ctttttttaa acacgggaat gtttttaaaa 25621 agcatgtgga ggctgggcgt ggtggctcag ctgccgcact ctcccgtgtt cccatccagt 25681 agcctgatcc aaaaaagcca tgaggttggt cttgcgtgac ttcttagaaa aggaaacggt 25741 gatcccagag atcagtgtgg attcaccagt tgcccataag cgatctagtt aatcatttct 25801 ggaattttgc cagaaatata tactccttgc tagtctaaga gttaaatcta agatggtggc 25861 tgtgacccta gaggaccttg gctttctgtg tgatgctctt gtccagcctt atggccacac 25921 agcccatttg cagcttgcag gaaacactga aaaaacaaag caggccaggc gcggtggctc 25981 acgcctgtga tcccagcact ttgggaggct gaggcaggca gatcacctga gctcaggagt 26041 ttgagaccag cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaaaaattag 26101 cagggcatgg tggcgggcgc ctgtagtccc agctacttgg gaggctgaga caggagaatg 26161 gcgtgaaccc gggagacaga gcttgcagtg agccgagatc gagccactgc actcctgcct 26221 gggcgacaga gtgagactct gtctcaaaaa aaaagagtga gccccctaag gcttctatgt 26281 gtatgtgtgg atagagccaa tccacatggc agccactcat gtgggttagg accacaccta 26341 actacactgt tctgccagca ccttccccac acttcggcat ctggtcgtta agaacagact 26401 cagactggca agcatcatgc taaaaatcca tctacactcc atctatagaa ggctgacaga 26461 acttctccag gaattcacag cagtcaccaa tgatcccgct cttcttttcc aaaagcatac 26521 aagctattct ttctagagca tggctggaga tcaaagtcaa ccctccccag gtcaattcct 26581 catctacctt ctcctccctt tggaaattgg gatgtttgcc aggctgcagt ctgccagtat 26641 ctctcccact ctgcaaatcc tcaagcatcg gtggcaatgc cttctgcaga tgatctcaac 26701 agcttctgga cataatctca tcaagtacat cctttaggga tgccactctt gcccacctgt 26761 aatttccatt ttctcatcct agtgtttcct ttgcccttga ccacctaccc tccatctcct 26821 tccttacaca gacatataaa agggactgcc ttctcctcac ttctccatgt ctttcatgat 26881 tcagagctcc cggcaggttt tcaccttccc gatactattc tttttttttt tttttttttt 26941 ttttgagatg tcacccaggc tggagtgcag tggtgtgatc tcggctcact gcaacctctg 27001 cctcccgggt tcaagcaatt ctcctgcctc agcctcccga gtagctggca ctacaggcac 27061 ctaccaccaa gcccagctaa tttttttgta ttgttagtag agactgggtg tcaccatgtt 27121 ggccaggctg gtctcgaact cctgacctca ggtgatccac ctgcctcggc ctcccaaagt 27181 gctgggatta caggcctaag tcaccgcacc cagcccaccc ctcccccata ccattcttaa 27241 agctctagat gcttttctgg tagatggcct agtttcctct ctttccctct caatgtggtc 27301 ttttttcctc attttgctga ctgtcacact cagaattctg ggtttgagac tcagtctttc 27361 agatctgctt ccctttcagc ctcagaccac aagataatac ttgtttggaa cttcctgaaa 27421 aatttagggt atgtgtctga ctcctcccag ccttcctgac tttcctaagt ttgaagacag 27481 caagcttgta gatcaaatct gtgatcaaac ccattatctt gaaaaaaatg tgtttgcctt 27541 ttctagctcc acccctcttt ccaacttggt cgcagagagt accagatcat ctaaacaaca 27601 gattttaaga caagtagtca tcgtagcgcc tagtaaagca ggacacacca ggtgactaga 27661 gagcaagaat ctcctaggca tggagattct tgagtctcgg ggcacaaaac caagtgggga 27721 ataactgtcc atgagcctga gaatcacttg gtgctatggt ctgagtgccc ttcaaacttc 27781 atgtgttgga atttaatcct cactgccgta gcattaagag gtggggtctt ttgagaagtg 27841 attaagtgat gagagctcca ccctcaaagc aagcgccttt ccaatgcctt catacatggt 27901 ctgagctccc atccacctcc cagccaggcc ctgctgatca gaacggctat gtgaagcagg 27961 aggcagcaaa cagggcccca ggctcaaata ggcacttcgt agtggtctag ttttgcccga 28021 ctagttaccc ttagccttga ttaaggtact tagttttacc aaaaaaatca tcagaaatac 28081 tctggctgcc atggaatgta acatgtcctc attacgagtt tcacgtgggg aaggccctga 28141 ggtgaggaga ggcccagcct cttcgtgcca cttttacctg ctgtccctag gtcaacaccc 28201 cggacacaaa gagtccccca ttcagtcgct cccttgtgag ctggactctg aaggtcctct 28261 cccagaggag ggcaaggcct taccgttaca tctcactctc catgcaaaca gaccgtgaga 28321 tagtcatctg tttgcctgag agtatgtggt gtgtgagggt cttctgatat ttcaggcagc 28381 cctctcctac tctccacgct gcctctggag gtcaggagaa aactatgtgg cttccctaac 28441 acagacaggg ctttgggatc caggccacag actgaggagg aatcaaagtc acagggggga 28501 gactgagccc actgagctcg gtgtccatgg ctagcccatg gctcctttca cagcacctgc 28561 cccctcaacc ccccactcac aaggacagca gcaaggaaaa gccagcaatg tagagattcc 28621 tctgggcacg gaaaagcttc atgtggaagt gctccatggc cccgggattg ttctggaggt 28681 tcaccttttc cgtcacatca tcatacttcc gaatttcgcg cacggcatct gtggtggagc 28741 agagaggaga cacgggcatt tagcggacac tagggcaaga taaggccata ccaggcagac 28801 aggcggcatg agacaggtca gactcacttc cctcaaatcc gcctccccag ttcttcccac 28861 ttctatgcac atacacccct catgggatgc tgttatcaga agctcccacc ctcaggccag 28921 gtgcagtggc tcacgcctat aatctcagca ctttgggagg cagaggcggg cagatcacct 28981 gaggtcagga gttcgagaca agcctggcca acatggcaaa accccatctc tactaaaaat 29041 acaaaaaaaa aaaatttagc ccggcgtggt ggtgggcgcc tgtaatccca gctactcggg 29101 aggctgaggc aaaagaatca tttgaacccg ggaggcggag gttgcagtga gccgaaattg 29161 agccactgca ctccagaccg ggggacaaga gcgaaactcc atctcaaaaa aaaaaaaaag 29221 gaagctccca ccctcagacc cactccttcc tccagctttt caatccctgg ccccattcta 29281 cacaggcccc aggcttgctc tgcagttcac gatgctaacc aagcctaaag agtttctgga 29341 aatgggggtg gcttcctttt cccatggtac cctcaccgcg tcttgttctg ttcatctcaa 29401 gttggcccat agccccccaa cccccacaat gataagttga acctggagat ggaaatgcct 29461 gctccgggga cctaaggtgc ttaagagtca agacccaggc cagatgtggt ggctcacgcc 29521 tgtaatccca acactttggg aggccaaggc aggaagactg cttgaggcca acgagttcaa 29581 gaccagcctg ggcaacacag ggagactctc tctcaacaaa aaaaaaaaaa aaaaaaaaaa 29641 aaattttaaa tagctgggca tggtgctgca cacctgtagt cccagctact cgggaggctg 29701 agacaggaag atcgcttgag cccaagaggt caaggctgct gtgagttacg atcacaccac 29761 tgcattccag cctgggcgac agagcgagac tctgtctcga aaaagccaaa acaaaacaag 29821 caaacaaaca aaaaaatagg agtcaagacc tgcaggatac cagctaggag tcaagggttt 29881 catttcaaag ctttacaatg ggttttatta tctataatgt tttattctga ttacgaaagg 29941 aattcacact catcgaaagt tcaaacacac ccaaaatgta gaaacggaaa ggatcccata 30001 acataatcac atctagcagt atatacaata aaaatgtggt cacatctaaa gtcattttta 30061 gtcacattac actgttacct gcttcgtaat agacatccat ctctgaaatt ctctttaaac 30121 acattttttg ggtccatctc tcccaaagta gactacaaac ttcataagag cagaaatctt 30181 ttatactata ttcccagggc ctgggatagc tctgggtgca gagtgggcac tgaaataatt 30241 gctgagtcag tgagcagcct ctgctcctga aagagccgct gttagacata tagagctcca 30301 gttaaaaata tatttatatt tgtctctaat gtagaaattg agccacgcca aacctactgc 30361 ttttccccat gcatcctgag acatcgtttt aagtcaatac actaagatgt acccaaagaa 30421 tgctcttaat agactcccat atgagaggag caaggtatca ctgtatgggt atcgaggttc 30481 tttttgtatg gtgggttaca gggtctcact ctgtcaccca ggcttgagtg cagtggcacg 30541 atcccagctc actgcaacct ccacctctca gcctccccag taactgggac cacaagtgtg 30601 tgctaccaca ctcagttaat tttttctttt tctatttttg gtagagacgg ggtttcgcca 30661 tgttgcccag gctagtttca aactcctgag cgcaagcaac tgcctcagcc tcacaaagtg 30721 tggggattac aggcgtgagt cagggcactg gcctcatggg gttcttaatt tccggtttta 30781 ggctgcttag tttttatgtt gatcttgcat tataaaaatc tgcccatccc tcagcaccac 30841 gaagcgagta ctctcaaacg ctgttagcag gggtaaaaac agcacacctg tataaaatgc 30901 aaaatgttca taccctcggg ttcaacaatt cacctctgaa attatcccac agaagtattt 30961 gcataaattt gcaatgattc gaagatgctt ataaagaatg gattacggca caaccacatg 31021 agagaatact acatgatctt taagaagaat gagattgcta taaaaagatg ttggccggac 31081 acagtggctc atgcctgtaa tttcagcact ttgggaggcc gaggtgggcg gatcacttga 31141 ggccaggagt tcaagaccag cctggccaac acggtgaaac ctcgtctcta ctaaaaatac 31201 aaaaaattag ccgggcatgg tggcgggcgc ctgtaacctg cagctactcg gaagctgagg 31261 caggggaaat cgcttgaacc tgggaggcgg aggttgcagt gagccaagat ggcgccactg 31321 cactgcagtc tgggcaacaa agtgagattc catctcaaaa aaaaaaaaaa aaaaaaaaga 31381 tgtctaacgt gcagcaggta gaaaaagcac aatccaaagt attggctagg ctctgctatg 31441 gacagcagtc ttatctcaga gctgtgctac cagctcaggc gtggaatgct cctcctgcca 31501 tgtcccagca ggaatgtgga aagaaacata aatactttgt gtgtatgatc ccaccgttgt 31561 agacatttta ttaaagatgt cagcatacaa aacccctctg gaaatgcaca acaaaatact 31621 catgatggtt acctgtggga aatgggatta tggggagttt aactttctaa attcaacttt 31681 tgaattttta agaataatta cccattattt ttgtaatcag aaaaacaaga taaacatgtt 31741 tccttacact gtcacctgca tccaggaacc ccaccatcct aaggaccacc ttggccaatg 31801 accaatagtg gggatggggg actgagcaat actaacccaa gtcccacagt agaaaccagc 31861 cctccctgtc tcgggcctca gcccaggtgg gaagatcctg caccctcagc aactccatcc 31921 tccaggagga tctgctcaac cctccccacc cccaagcaag gccacagctc tagctagtta 31981 tgccccttac ccttcactag tacctgtgac taacacttct gtttgaagaa ggctctgcag 32041 atctttctag attatgcttc ttaacatggg agtaggaagc agccagtcat catggccata 32101 tgtcccccta gaacttccaa gcagttcccc gatcccaaaa aagaatgaag gctttgtcaa 32161 ctactggatt ccgcaaagga tctctccagc ccactcagct gctctaggaa agaaacatac 32221 tccttctggt ttggggaagg ggcaagaccc tgagggaatc atcctcagca tggtccctcc 32281 cctccccagt tattaccccc accccagacc caaaagccaa aacacacatg ctcagttcaa 32341 aaactcagtg gcctggggat gacacatata ttacctacct actcctggct tggaggctgc 32401 tcagggcgag aaccctggtt agaaagagct actacctctt ggggccttgc caagtatttc 32461 aagaacactg aggtgctcag gcagctggca cctaccctat gtggttggaa gctgtcaggt 32521 aacacttggc ctgttcagag gcagtccagc ctgaacatat cctatcaggc tctgagacat 32581 caggaaggag aaatgcctaa tgctgaagct ggtgggtgag gctttgcaca ttcccctgaa 32641 ccacagcccc ccacccgctc agtgtttctg ctgggactcg gtgggaggag tgttccacgc 32701 ctaggcaggc accacagctc tgagacaaga gtggtgtcca cagcagagcc aagccaatac 32761 tttggagccc aaacaggggg acatatcgat cacttaccac agcaaaacaa caaacatcta 32821 ccagtcagac aagagcttta ttgattcttg tctccagatg gtcaccctac cacctcattg 32881 caaaatgtct gcctttcatc ttaccaccac ctggcctggc cttatcccag agcctgcttg 32941 cccttggacc agtttatctt tcaagcacct gctagtgtac acctcaatgt cagagtttac 33001 agaactacag agctggaacg acctgggaaa acctcaagtc tagcccagat gcagagaggt 33061 ctaggatctc tccccaagat atgctgccac acatctctgt gttctcctcc tactactaca 33121 gggaatggca ggacttcact gtgtgcttgg ttgctcagtc ccccattaga ggccctgctc 33181 caggcaattc tgctgtttaa gtgactggtg agcagcacta ctcagaccaa ggtcacaggc 33241 cagttagctt ctcttggctt cagttctaga gtgacctgag gagcagcctc agaaacctta 33301 ggcgtccctc cttccaaggt cttgaaaaaa agcaatgtaa ggtggccgtc ataagctgca 33361 tacaaactgc ttggtataag cccacgccca ctgctagagg ggcctctttt tttttttttt 33421 gagatggagt ctccctcttg ttgcccaggc tggagtgcaa tggtgcgatt taggctcact 33481 gcaacctctg cctcgtgggt tcaagcaatt ctcctgcctc agtctcccaa gtagctggga 33541 ttacaggtgc ccgccaccac gcccagctaa ttttgtattt ttcgtagaga tggggtttca 33601 ccacgttggc caggctgatc tcgaactcct catctcaggt gatccaccga cctcagcctc 33661 ccaaagtgct gggattacag gcatgagcca ccatgtacgg cctagagggg cctcaaagtg 33721 aagaaccgac tagcggtcag cagcatgggc aaagggagcc tcttccctcc ctcaagagaa 33781 agacacagca tttcattggt ctgtctccta gcagccaaaa ctggatgcta cacatcaaaa 33841 gtggcaaagg gttttgcagc agagaccagg gtctaggtca ggtagctgcc ctcagccata 33901 gctcactcac cgatgaccaa cagcacaagg atgacaatga gaaccacaaa gaaggtgttg 33961 ccataggaca ctaacaactc caccagccgg gacttgaaaa tcttctgcca tctgtgaaga 34021 agacaaaaag gacaggagtt gaagagaaag cacacacacg agctctaggg ccctgagcaa 34081 aatggaaatc cagctttgta ctcttctcca atggccgaat agcccatgca aagtcagcct 34141 cagtggcttg caattctcta atttgacatc cattcaagac tattggaaaa aaggccaggt 34201 gcaatggctc acacctgtaa tctcagcact ttgggagacc aaggtgggtg gatcacttga 34261 ggtcaggagt gcaagaccaa actggccaac atggcgaaac cctgtctcta ctaaaagtac 34321 gaaaattagc caggcgtggt ggtgggcacc tgtaatccca gctactcagg agggtgaggc 34381 aggagaatca cttgaacctg ggaggcggaa gttgcagtga gccgagatcg cgccactgca 34441 ctccagcctg ggtgacagag cgagaccttg tctcaaaaaa aaaaaaaaaa aaagatgctg 34501 gtccacgtat agatcaaaat gtctggaaaa ctaatctttt tttttttttt ttgagacaga 34561 gtcttgctct gttgcccagg ctggagtgca gaggtgtaag ccaccgcgcc cagcctggaa 34621 aactaatctt aaacgtgggc agttggctgg gcgcactggc tcacgtctgt aaacccagca 34681 ctttgggagg ctgagatggg cagatcacct gaggtcagaa gttcaagacc ggcctggcca 34741 acatggtgaa ctcccatctc tactaaaaat atatatataa aaaaaaatta gctgggcatg 34801 ttggcaggca cctgtagtcc cagctactcg tgaggctgag gcaggagaat tgcttgaacc 34861 cgggaggtgg aggctgctgt cagccgagca ccacaccact gcactccagc ctgggtgaca 34921 gagcaagact ctgtctcaaa ataaaaacaa aaacaaacaa aaaaaccaca acaacaaaaa 34981 gaaaacatac tggtagtctc tgaaaaggga aacttacttt tcattgtact cccctttgta 35041 atattcaaat gtttgcccca tacacgttat ttctttaaac tgtctttcca catgataaac 35101 cttggtttgt ctatatcccc cgaccctgtt atgatgagat ggaccccaga cctcacatca 35161 tttccatcca caaatgcttt gggatgcagc tctaagaaga gacttcagaa gtgtttacat 35221 attgttcaaa tcaacaagaa gccctcatca tcagggctag agttgaatat gccaatcact 35281 gaaggcaatg atatttgttt gaatccggac ccaacaaggt accgatttgg cattcggtag 35341 ataaacgtct taaatccagc ttattctgga agctccccct tccctctccc tctttcattt 35401 gcttttccag tttcctttgt gtagaaatca gattgtctgt cctctaaagc ttcccagatt 35461 aaagctcccc ggggccctct atctctgcac gtcttgtcaa ctgtacgaag gtgtggtttt 35521 ctacaagaat cctcctcaga cagcactgca tactttctat ggtatcacat caaagataat 35581 ttttcacgaa aaaacggctt ttaaaaatac ctgtggagat ttggccttat ttgtaatatc 35641 gtaatttaaa tagggatgtg ttcaggtatt atacataatt aaaattaatc tccaattcac 35701 ttttaacagg tcctctgttc attaaaaggg aatatggctt catcctccac aggctgagta 35761 atgttcatgc agccactgcg caataccagt gagactagtt aggaggacaa gagtcaccaa 35821 atcagcaaca acgtgaccca gtggtgcaag tcttccgaat cccacgctac catcccagca 35881 ccccgtggac caagagtggg tgggagggca tcatttccag aactgtaaca tagatcttcc 35941 tgcctctcgg gggcattctg aagaccagaa ggcaggggac acaggaaaag gaactgagca 36001 agtcatgacg aagcagaacc ctgggaaggg ttggcaataa caaacacaac ccctgccacc 36061 cccaccccag acaggttcta cctgttccat cgggtcccaa ttccagggtc caccagctgc 36121 acacaccagt cccagggcta gggcacaggc accctcctgc ctaactcgcc tgcttgctcc 36181 ataggccata ccttttagga gaaatgaagg gaatgcagag aagcaacaca acaaagacct 36241 ccgcatagag gaaggtggca actgcagtcc actgcagact catcctgttg ctagaaggtt 36301 tcccacagga agatgtgagc ttgtttcctg gcagggcaca aaaggtacgg gacttgtaag 36361 cgctgggcag ccaagaacat ccagttcagt taccctgcga ggctggggcc gccactctcc 36421 accctgaccc cctgcacttt gcttcaaagt ctctgatgcg ctggcggaag cagggctcca 36481 ctcggcaggg cctcttggcc agcagcgttc acaggcccca caaacttccg ttcgctggtc 36541 aattacctgc cccctccagc acgtgtgtca acacgtccag agcggcctct cccgacgatc 36601 cctgccccag gaagcccgag atttccgacg cccgtttaac tgaaaggcgt tcttcgggaa 36661 gagcagtgcc agggcaccaa gaggaggacg cctcggcacc catcgggcgc tcccttcccc 36721 gccaggcaga actcagccgc agaggcgggc gctctgcggg cccaaatccc cgctaccagg 36781 caggcccaag gccgcaccta gtccaagcgc tgccgacgcc cccgcctccc accgtcccca 36841 cggcgcccgc ggagagaaac cggcacctcc ctcgaggatc cagcggcctt ccgcccgggg 36901 ccgcttcgcc tccggtgggc tggaggcccg caagagcgac tcctagaggg caggattcgg 36961 gaccaagcgc aaaggcaggt ctcgaccaag cacctcaagg ccccatacgg agaaagttct 37021 agacgcagta tcctcagaag ccaggggtcc ttacagtagc cctcgcgggc cccagcgccc 37081 acccagagcg aggggcctcc gacttggccc cggcctggca caccgtcccg gaggcccatc 37141 ccggccgctc ctccaggtgg ggcttcaccg ccccccgccc cgcccccgag accagcttct 37201 agaggcgccg cccggtttcc cctcgcccct gcctctcaca cgcaggtagg ctgcgggccc 37261 cgagattccc cggcccccgg gcctccccgc gccgctcgcc tctctccctc gtcgatgggc 37321 cggggagcct ccgcggtccc ggagcccagc ccggcgcgcg gagcccgctc accgagtttc 37381 ccacagtcaa cgtgcaggcc ccgccgcagc aacagaactc tcccacagca gccccggccc 37441 cgcccctcat accgcggccg gaaaccggaa gcgcccgccg ggcaccgccc accagccctc 37501 gcgaggcccc ggaggctccg cccacctccg cttcccaccc cgccccggag cggagggccg 37561 gcgctccgag cgggagagga agaggcgcct cgggctccgg gcgagcaggg cggggtggag 37621 cgagcacgcg ggcggggcgg ggcggggctt tgtcgggccg gcgagggccg cttctctagt 37681 ccgcgcggcc gtcccacgtc tctgtggtgc gggaggggcc ccgccgaggg gcgagaacgg 37741 gaggtggggg tgtgggcggg ccccgccgag gggcgagaac agggtggggc tcccgcgccc 37801 ggactccgcc cctcgcccct cctccgcctc ctccccttcc cccgactcgc ccctggggaa 37861 gagtgggtgg ggattctggg ccggtggagg agtcactgtc gcttcagcca ggctgcggag 37921 cggacggacg cgcctggtgc cccggggagg ggcgccaccg ggggaggagg aggaggagaa 37981 ggtggagagg aagagacgcc ccctctgccc gagacctctc aaggccctga cctcaggggc 38041 cagggcactg acaggacagg agagccaagt tcctccactt gggctgcccg aagaggccgc 38101 gaccctggag ggccctgagc ccaccgcacc aggggcccca gcaccacccc gggggcctaa 38161 agcgacagtc tcaggggcca tcgcaaggtt tccagttgcc tagacaacag gcccagggtc 38221 agagcaacaa tccttccagc cacctgcctc aactgctgcc ccaggcacca gccccagtcc 38281 ctacgcggca gccagcccag gtgacatgcc ggtgctctcc aggccccggc cctggcgggg 38341 gaacacgctg aagcgcacgg ccgtgctcct ggccctcgcg gcctatggag cccacaaagt 38401 ctaccccttg gtgcgccagt gcctggcccc ggccaggggt cttcaggcgc ccgccgggga 38461 gcccacgcag gaggcctccg gggtcgcggc ggccaaagct ggcatgaacc gggtattcct 38521 gcagcggctc ctgtggctcc tgcggctgct gttcccccgg gtcctgtgcc gggagacggg 38581 gctgctggcc ctgcactcgg ccgccttggt gagccgcacc ttcctgtcgg tgtatgtggc 38641 ccgcctggac ggaaggctgg cccgctgcat cgtccgcaag gacccgcggg cttttggctg 38701 gcagctgctg cagtggctcc tcatcgccct ccctgctacc ttcgtcaaca gtgccatccg 38761 ttacctggag ggccaactgg ccctgtcgtt ccgcagccgt ctggtggccc acgcctaccg 38821 cctctacttc tcccagcaga cctactaccg ggtcagcaac atggacgggc ggcttcgcaa 38881 ccctgaccag tctctgacgg aggacgtggt ggcctttgcg gcctctgtgg cccacctcta 38941 ctccaacctg accaagccac tcctggacgt ggctgtgact tcctacaccc tgcttcgggc 39001 ggcccgctcc cgtggagccg gcacagcctg gccctcggcc atcgccggcc tcgtggtgtt 39061 cctcacggcc aacgtgctgc gggccttctc gcccaagttc ggggagctgg tggcagagga 39121 ggcgcggcgg aagggggagc tgcgctacat gcactcgcgt gtggtggcca actcggagga 39181 gatcgccttc tatgggggcc atgaggtggg gcaggttggg gtgccgggca cggagggaag 39241 cgtgtggcag ggaggcccgg gggcaggcag ccgtgagcgg tggggacagt ctggggcggg 39301 ccggggctga tgccaaaggt gtgggcaggc catgggagag ccgggctggg gtgggcaggg 39361 cctttgggca gccgtggact caggcgcggc agtggagagg caggaaggct gggtggggac 39421 tgtcctgtgc tggttgcctg cacgctcgag gccacttctg cttcctctcc tcctcaagga 39481 ggttgtcctg gccttagagc tgcgatccta gcggtttgag cctcgagagc tcctgcccgc 39541 cccactcctg cagccagcca ggaggagacg cctgccattc atgagcgggg accgagggac 39601 gcagcctgct gcctagcccc tcctgggccc ttgggccctt tgaaggccgg cgtccagcag 39661 agctggctgg ccaggcaggc cggcattatg ggcatgactc agcccaagcg gaggttaatg 39721 agcagcgccc agcacagcag tgggacttgg ggtcaaggcc acccccgcct gagcccacag 39781 gcctgcctgg ctaccaactg gctcagctgc cttcccgggc ccccagacca gagatgccgg 39841 gcccaggccg ccactgcggc gggacacact tcctgctcct ggcgtggctg tccttccaac 39901 cctgtctgtc tcctggctcc ctggtgggcc ggggccggcg ggccagacag gcgctgggaa 39961 aggttactcc aggtcaatgc tccctttatc tcgcctcagc ccctcccctt cctcgtgcct 40021 ggggttgcag ctgccttggg ccctgctctc tgcgactcag tttcggttcc ccagtgcttc 40081 ttgggaggag gggcacaatg gcatccatcc cccgaaggcc tgtgtgtgct ccctgggtca 40141 ggtggcctct tgccctggga ccttgtctca ctggctgtgc accagcagag aagcaggctt 40201 gtccccgaat tccacggaga ggggcatccc gggtgtgggc cagactgcag attcagagaa 40261 aaggcccctg gacttcagcc accaccctgg cttccctctc ctcttctccc gcatgctggg 40321 ctgcagggcc ttggcaagca gctgcagcct tgggcgaggc gcttggcaca ttccccgcag 40381 ctacattgtc agccttggct ggcacccctg ccagctccca gcacgagtct ggattgccag 40441 ggtgcttgct tcaggaatgg gagatcgggc ttgcagggag ctcagctgtg caggccgacc 40501 tgggtggcgg gggcagaaga gagatcactg gttctttgaa ggccttcgtc cgggctagct 40561 tcaggaagta gagagattac tggttctttg aagggctagc ttcaggaagt agcacgttgg 40621 ccaagagggt ttgttggcca gggcaggagg gcccggtgtg cattcacggc ctgtctgcat 40681 aggcctcggc tggaaagctg tgtggggtga gaggaccctc gggtagcatg tggcccatgg 40741 cagtcccact ggctgatgtc cccgtggaca ctggcctagg ctcagatcag ggcagaagca 40801 gctgactggc tggagggcac atagcagagt attctgtcct ttctgggatt gccgcgagga 40861 ggcttattga aacgcacatg cacacgatct cttgtttgag aacaaagtaa agctctcttg 40921 ataagtctaa gcatcattta caagaatgtg gttcttctgg cagctctgcc agtgtggtcc 40981 aaattcccac actggttgcc accatcaggc tgtatgggct tgggcaggtc actttgcctc 41041 cctgggcttc agtgttctgt gtacagtggg cacgtgaata cacatgcagt gggatgtggg 41101 gagaatcaaa tgcagagacg tgaaagcact tggcaaacag tggcaggcta ggcagctgtc 41161 atcagacggc tgtggggagg caaggctggg gtgcgtgccc ctggactgag cccagaaatt 41221 cacaacccct tggctacctt gctgggagca cttcccaaca cctcccatct cccccgcctg 41281 tctgactgct gctgccacct cttccctgcg gctgtccctt tccccacctg ctggtgtcct 41341 gacatagtgg tggccagtgc aggaaggggg acagaaggac aggggaggcc tacccactgc 41401 tgagggagca aggtccaggc tcagcagtgg ggacatccac tggggcgagt cctgtagggc 41461 ccagctcagg agcgcatcct cctggtttca gtgggaaaat gccatgcaaa tttctcacca 41521 aaaggaagtg tgggaaagtt gaggggaaag agggcgtaaa taggcccaga ctgttgaacc 41581 gagtttttca gaaccaagag acagggtttc actgtgctgc ccaggctggg gtgcagtggc 41641 gccatcactg cagcttcaaa ctccagggct cccgcgatcc tcccacctca gcctcccgag 41701 tagctgggac tacaggtgtg tgccaccaca cctggctaat tttttttttt aattattttt 41761 ggtagagaca gcatctcgcc gtgttggcca ggctggtctc caactcctgg gctcaggcaa 41821 tcctcccacc tcagcctctc aaagtgctgg gattgcaggt gtgagctact gcactcagcc 41881 caagagcact gccttttgct gtcctgcgct gctctttgct tagtttaggg ggggaaatta 41941 gagctgatgg atgatctctc cgagccagga ggagggggct ggcagggagc ccaaagaaat 42001 gggctcagca gaggacagaa acaaggtgac tagagaggga gtggagaggg gacgggagcc 42061 gcactgtgac atcagccagt cccttatacc cccttacacc ttgagtttga gacctggccc 42121 cacccaatcg taacctctgg ctctcggcct tctgatggcc accatggcac agcgtgtgtg 42181 agtggcactg ggagaccctg accatcgccc ccacgggagc tgcccctgtg catggccagg 42241 aagcctctct gtgtctgtca ccccccgcag gtggagctgg ccctgctaca gcgctcctac 42301 caggacctgg cctcgcagat caacctcatc cttctggaac gcctgtggta tgttatgctg 42361 gagcagttcc tcatgaagta tgtgtggagc gcctcgggcc tgctcatggt ggctgtcccc 42421 atcatcactg ccactggcta ctcagagtca ggtgagaccc agggctccaa gaggatccag 42481 gccaggggcc tgtcccccat accgctgggt gctgagctca cgagggccca actcagccag 42541 cccgccgccc acttctgctg ccggggccac cgaggccctg ctgccagcct tgatgctttc 42601 agaggttgag ctcgccttgc ccctccttgt tgccttttgc cctgcgcgca cctcacgccc 42661 tttgttacca ctcagacaag cccaggactg caagtcagga acactacatg tcacttctcc 42721 aggcacagat accctcaccc acctgtccct gtcctcaacc caactgcgac ttagaggtgg 42781 gagaatgtgc ggagtacctg gagtgagccc cttcatcact ccagccttgg tttcctcgcc 42841 ctgaaacagc tcttgcaccc taaggtgttt tagaggagtg ggaagcctgt tctacctgtt 42901 attttagtga taagattaaa tgtttaggtt tctgcattcc tgttgttttt gtttgtttgt 42961 tttattttct ttttggcttt tgaaacagga gtctgactgt cgcccaggct ggagtgcagt 43021 ggcgcgacct tggctcactg cagcctcaac ttcccaggct caagtggtcc tcccacttca 43081 gtctccccag tcgctgggac tacaggcaca agccgccgta cctggctaat tttaatattt 43141 ttggtagaga cagggtttcg ccatgttgcc cagactggtc tcgaactcct ggtcaggtga 43201 tcctcccgcc tcagcctccc aaagtgctgg gattacaggt gtgagctgcc gcgcccggcc 43261 tgcattcctg ttttgttcct ctgccggatt gcaacttctg catttgattt ttagtccatt 43321 agtggtcacc ctttaaaagg tcaacacgca gccgggcgcg ggggctcccg cctggcatcc 43381 cagcactttg ggagggtgag gcgggcagat cacgaggtca ggaggtcgag accatcctgg 43441 ctaacagggt gaaaccccat ctctactaaa aatacaaaaa tttagccggg cgcggtggcg 43501 ggcgcctata gtcccagcta ctcgggacgc tgaggcagaa gaatggcgtg aacccgggag 43561 gcggagctgg cagtgagccc agatagcgcc actgcactcc agcctgggcg aaagaacgag 43621 acttcgtctc aaataaaaat aaaaataaaa aataaaagat caacacgcat acttgttagg 43681 ttaatcaata gctccgcctt ctcctggacc ttagaacgca gaagcagccc cgtggggcat 43741 ggcatcatag cccagccaag agttggattt gtacctagtt tttcctcttt gctatcgcct 43801 tatccacacg tgttgaaact cactcgcatg tttgctgagt tctgtgctca ccatttcgtt 43861 cctacatctc aatttctctt ctatgttctg ttatcttcct ccagatagac atgtttttag 43921 ttttttgttt gtttgttttt taactcacaa ctatgctcag gacaaacagt catcagtgct 43981 tctagttgtg gttggttttt aaaaaaggtt ctttcagcaa gggttttgtc atgggaaatt 44041 cttgattatt ttattgtaat tctttgtgtt gtttagtttt ttgtttgttt gtttgttttt 44101 tgtagaggca ggttctcgct gtgttgccca ggctagtgtg gaactcctgg gctcaagcgg 44161 tcctcccgca ttagcctctc aaagtgctgg gattacagac gtgagccacc acacccaggc 44221 tgttgtcatt cttgattatt tttgttgaaa atgtttctat ttcaccctta ttcccaagct 44281 aaagttgagt tgcgcaatgg agtctaggtg gactgttctt ttctctcagc cctttcacaa 44341 tgtctcattg tctccctgct tcttctgtag ctgctgacaa atctgtagct gctaacaaat 44401 tgtcctccct gagtaccctg tgtttctttt tttctgggta ctttgatcac tttctttctt 44461 tttggcattc tgcagtttca ctatgacatg tttaggtgtc agtttttgtt gattctgctt 44521 gagacttgtt atatgtcctt tgaatatttg agtttttcaa tcattctgga agattctcag 44581 ctattagctt ttcagatatt gcctgactct cattcttttt tcctgttctt ctagaactgg 44641 cagccagtgc tggacttctg actctcttcg ttgtgtcttt cggtttctgt ttactgtttc 44701 ccatctcaca ctgtctgtgc tgaattctgg ggatgccctg aggccttcct gtttgctcat 44761 tttccagttc accagttttc tctttggcca tgtctaattg gaattttgtt cacatttcag 44821 tggctgtttt gtaagttcta tttcattcta tttcaaatag gaatttgaaa atggacattt 44881 cctttttcaa gtagaaatgt caggctgttt ctgtgtctta tttgctcatc tttttgactg 44941 ctccttttat actgtgtatc aggttagtca tacttacccc accgtctcta tctgatggtt 45001 tggttagctt cagttcttgg gagtatcatt cgcctgtctt ctgtctgctg gtcttcactc 45061 atggtggatc atttccttgt gggttttgca acagaacttc tatggggatt ggtgtggcct 45121 caatggaggg tgtaagtcca aatgtccttc tgttttggcc aagcactcca gggtaagcag 45181 ttggaggcca ttagttagtg cggaggggat ggttcctgaa ttgagaggtg gtttcatttg 45241 aattcatttg aactccaggc acattgttac aaattttcag aggaggcttt ttgttcattg 45301 ggcatagagc ccaggctaag caaggaaagc ttcctggtct ttgccctgcc tgcctctggg 45361 gctatcaaag gatcacctgg ctgtctgctt ctctttgctc ccagaagttt cccttgcttt 45421 ccgtcaagct tggctctgct cgaaatagtg tcgtagttta cccagtttgg tgggaagagg 45481 gttttcgatt tataaacaga agtctgattc tctccctgtt cctgtttcca cctcggggct 45541 tccatgccct caactgtctg tgtcatcctc tgggtgactc accaagagga aggatctaat 45601 tcggggactg cacatcaagg gcagccttga tgctggcgta ggtgcccagc ccagtcagcg 45661 cacctagccc cgggcaccga gacagcccgc gagacagcac ctgcagccgc ttcgctccat 45721 ggctgccatt ggtcacatcg ggctgctccc tgccctgaag gcttaggact tttctgagcc 45781 tctgctgtgc caggctttgt gggctcctgg ccctgctctg gtggtcctca gagccagcag 45841 cctttgaagg ctgtgaagca ttccctttcc cctggcttga taggctctgt cctgtggccg 45901 tctgtttttt tttggagggg gagcttttga gatggagtct cactctgtct cccaggctgg 45961 agtgcagtgg cacgatcttg gctcactgca acctccatct cccgggttca agtgattctc 46021 ctgcctcagc ctcccaagta gctgggacta caggtacacg ccaccatgcc tggctaattt 46081 ttgtgttttt agtagagatg gggtttcgcc atgttggtca ggctggtctt gaactcctga 46141 cctcaagtga tccacctgcc tcggcctccc aaagtgctgg gattataggt gtgagccact 46201 gtgcctggtc ctgtggcctt cttatttccc caaatgccag gcctgctctg tcttggggcc 46261 tgtgccagga gcactctttc ctctgctgcc ccatggcgca gcccccctag gcaaggcccg 46321 gcctgcccac cgccaccttt cctgtctcct tccctgcccc actgctctcc caagcactcg 46381 acaccccgtc agtggcgctt ttctctctcc ctgccatgga ctgcaagctc catggggtca 46441 gggatttttg tctgttttgt ccctgctgtg tcccctgcat cgggtggcta gtgagggctc 46501 caggtgcttc aaggaggtgg agcgcactgg gtgggaaggc aggctcttac cggttggaag 46561 atttagataa gctttgctgg gctggtgtgc aaggcaggtg ggctcaggag atgggcaggc 46621 ctgtggctcc cactccagcc ctgaggcctt gctctgtcca gctctggggc cctggcctta 46681 ttcacccaag gctccaagag aacctgcaga aggcagctgg ttctacgcca ggatgccctg 46741 ggcacaaaga cttgcagacc cttccctttg tccacagcga ctgtccagct ctcagaacac 46801 ctggggagga gggctgtgtc cttgagagca ccgtgggaag gaaggtccaa ggccttcgac 46861 ctttaggatt cacatccccg ccccgccagc ccaatctagt tccctggttt cccgggccag 46921 gcccctttcc accctgcatg gccctgagcg gatactgcgt tccgtgtctt cccccagccc 46981 ccagccaatg atccctgagg cttccccctc aggatcacac ccacccctgg atacagtcct 47041 cgggtccttc acaggacatt cccaccactt cagccacacc ccagcaccct cagaggcggc 47101 cttcgccctt gctccccacc tctgctcctg tggggaatct aaggatcaag aaactgagag 47161 tcaaggccat tgatgagggt caggggtgct gccacggggc ctagagtgtg acagaaaagc 47221 aaattaagac aggagcagct cctgggagaa gcagacacca aacaatacgg ctgctggccc 47281 agaggtcaaa aaccatggcc tagagggggc ggtcaggaag tgggacatta ggccgggcgt 47341 ggtggcccat gcctgcaatc ccagcatctt tggaggccaa ggcaagtgga tcacctgagt 47401 tcaggagttt gagaccagcc tggccaacat ggtgagactt cgtctctact aaaaatacaa 47461 aaaaaattcg ctgggcttgg tggcgggtgc ctgtaatccc agctactcgg gaggctgagg 47521 cacaagaatc ccttgaacct ggggaggcag aggttgcagt gagccaagat cacaccactg 47581 cactccagcc tgggcaacag agcaagactc catctcaaaa aaaagaaaag aaaaaaaaaa 47641 gaaagttggc gtttcaaagc caaacagctc tgggcgttgg atgacaaagt tcagatgtgc 47701 cccaggaggg gcgacatcag tggtgcagga cagagggccg gtggaaggag ctggctgtac 47761 gtagaaacaa aaccagatgc ctactggtgc atttaaaaat caacgttatt gagacgtaat 47821 taacatacta tacaattctt ccttttccat gtacagttta aattcattca cagagttgtg 47881 cagccatcat cactaactcc agaatgtttt atttttattt tatcccccaa agaaacccca 47941 gacccatgaa cagtcactcc tcattccctc tctccagccc ctggcaccca ctcatctgct 48001 tcctgtctct gtggatttgc ctatctggac atgtcctaga aatggaatca tgtgctctgt 48061 ggcattttgt gactagcttc cttcactgag catcatggtt tcaagattcg tccatgtcat 48121 aagatgaatc agtccttcat tccttttcat ggctgaataa tattccattg tgtgcataga 48181 ccacaatttc tttatccatt catcccttga tggacatttt gggtttcttc atgttttggc 48241 tattgtgaat aacactgctg tgaacatcca tggacaagtc tctatgtgtg cagatatttt 48301 cgtttctcct gggtgtgtag ctaggagtag aattgccagg tcacatggta actggacgtt 48361 tcactttttg aggagctgcg agactgttct ccacagtggc tgccccattt taccttcccg 48421 ccagcagtgt tggagggttc caccttttca tcgtggctag cactggttat catctccttt 48481 gtattctagc cacctagtgg gtgtgaggca gtatctcttg gtggttttga tttgcatttc 48541 cctgatgact aatgacgctg agcctctttt gatgtgttga gtggccattt gtatgtcttc 48601 tttggagaaa tgtctgttca cgtccttcgc ccatgtgtga tcgggttatc tctgtcgctg 48661 agttgtaaaa gctctttgta tattctggat actgcaccct catcagatgt gtggttcacc 48721 agtccagagt tttctcccag tctgtaggtt gtctttttca ctgtctcgat attgtccttt 48781 ggtgcacaaa agtgttgagt tgaatgaggt tcagttgatc tgtttttctc ttgttgctca 48841 tgcttttggt gtcctagtga aggaactatt gccatatcca aggtcgtgaa gttttatccc 48901 attttcttct gagagtttca tactttgggc cacactacac atcagcctgt gatgtgctct 48961 gggttggttt gtctgtatgg tgtgaggggt ccctctcctg cacatagaga gaaagagaga 49021 gagagctggt tgccccggca ccatttgcag aagagcctcg cctttctctc cagcggctca 49081 tttttgactt tccgctgtct ctgccctgcc cctccccgcc ccgccaccca cccctctggg 49141 gctttgcaga tgcagaggcc gtgaagaagg cagccttgga aaagaaggag gaggagctgg 49201 tgagcgagcg cacagaagcc ttcactattg cccgcaacct cctgacagcg gctgcagatg 49261 ccattgagcg gatcatgtcg tcgtacaagg aggtacccct ggcccagccc cacccttgcc 49321 atccttgcca tgcttctctc cctgcaactg gcaggggctg agccagggtc accctccctc 49381 aggtgacgga gctggctggc tacacagccc gggtgcacga gatgttccag gtatttgaag 49441 atgttcagcg ctgtcacttc aagaggccca gggagctaga ggacgctcag gcggggtctg 49501 ggaccatagg ccggtctggt gtccgtgtgg agggccccct gaagatccga ggtaaggctg 49561 tcccctccct atgagtgacc ccgcccctgc tgctgctgca ggtgctgacc tgctgcccca 49621 gctcctccta ttcccgctcc ctcactcagg gacctccatg tgcttctggc ccatcccagt 49681 ccacccagga cgggagggct gccgggcagg gtctttgagg acttcggcct ggtcgagctg 49741 ggcccctgga gggtttcctg cagagaggtg ctggtccgcc cgccttcctt cccagacagt 49801 agctgccggc caccgtactg actcgccctt tgagggcctc agcctggatt attcattcaa 49861 aacaaggggg atgtggtccc ctcacccatg caggacagca agagaaagtt ccagtcagtg 49921 tgccagctgc tggctgccac gggaggcagg tgctgcagaa gggagtggcg gcccagggca 49981 ctgtattaga cactggggga agagttcagc ttgttggaag acctggctgt gttccctagg 50041 gaccctggac cacaggctgc tggtcaggaa ccagctggca tgctgccagg gatgggaatg 50101 agggcgtgca gccaggggca cgcagactcc ccagaatgca gaggggtcgc caccactccc 50161 tctccacccc agccccgctg tgctgtctct gcaggccagg tggtggatgt ggaacagggg 50221 atcatctgcg agaacatccc catcgtcacg ccctcaggag aggtggtggt ggccagcctc 50281 aacatcaggg taggtccagc ggggagggcg ccagccacgc acatatgcaa gcctcagccc 50341 ttggcttccc gcctgtctgt gctggcaaca gccattgtcc ctagatgtac gtggcaggtg 50401 ggccaaggtc aaggtgagag accaacgtgt ctctgactgt tcatcctggg gcaacagagg 50461 cagggctcat aaaagagact agtgatacca ggattgacca aggttcaccc cggcgttcct 50521 ggccctatca tctgatgcca actccccaca ctcctaagaa agccaagacc cgggtggggg 50581 ggctctggtt caaaccttgc cgctcgacct ccctcggaag gccacagcaa ggaatccaca 50641 gatcacctgt gtccccagcc aggggtttgg acagggctgg ggagtaatgg agtgaggggg 50701 agactggggt ggagggacag gtagagaagt gacaaggaat cactcattca ttcactcatt 50761 taaccaatgt gccctgaact ctgagccggg caccagaaac ccgaggtaaa tcaggagacc 50821 tgcactcagg gagtcttcac tgtggagggg cactaaagtg ttacaaaggg tctccaggta 50881 gacagctgtt caagggacag tgggggtcac aagaagagtg gtcagagtcc ctgggggtgg 50941 tgggggtggg atgaagcctt gcccaggagt tgctgtgagc gagtgggcag gcagctgagg 51001 gtagaggagt gaggggccgt gggcctgagg ggcaggtcac gcaggaggaa gcagagagga 51061 ggggcatgcc agggaggagg gggccggcac aggtggttac ccctcaccgc tcgcagcggc 51121 ccctcctagg atgtcggggg agctgatcac cagtgagtcc aaggaaggtg gtttccaggc 51181 tggccccggg cagcacaagc aggcaggggc agcgggcaag ctcatggggc ccctgcgcgc 51241 agggccacat atgctcaggg agccgggtat gcgagatggg gcaaggccca ggccccaccc 51301 ttcaggaggg gacagtcagg tggcttcatt agcatcctgt ggctgcggtc acaaagcgtt 51361 acaaactttg agtggctttc ccagcagaga tggcctctct cccggctcgg ggaatagcag 51421 tccgagagga aggcgcaggc agggcgggct tctgccaagg accgagaagg tgcctccgct 51481 cggggcctct gtcccagctt ctgctctgct gcccatctgc gggcttccct ggcttctgcc 51541 acggcaggtc ggcctcagcc tctgtctcca cacggcgctc tccctctggg tgtgtccgtg 51601 tctccgtctc ccctttctgt caggacacag gtcacactgc attagggccc acccctctgc 51661 agaatgacct catgcagacc taactcatca cgtccgcaat gaccctgttt ccaaataagc 51721 tcacactccg aggtagtggg gattagggtt cccacatagg aatttcagag gacagagttc 51781 cacccatgac actgcctgag gtaagctaaa gaccacggcc tcaagtcttc ccaggagccc 51841 cgtgtagcat tgttgttgtt accgtgaact tcactgactc caggcccctg gcctcctccc 51901 tgcacacagc ccgcctccag cctggccggc attttcccaa agtaggcatt tcctagctcc 51961 agcgaggacc atggagtcag tgaattgagg agcctgaggt ccatgatgca gagcccaggg 52021 gccactgtgg catctctggg ccactctggc acctggggag gcagtggggt ctgtactgtc 52081 agtctagaga cataaagaaa gtgctttttg ggccgggcgt ggtcgctcat gcctgtcatc 52141 ccagcacttt gggaggccga ggtgggcaga tcgcttaagc ccaggagttc aagaccagcc 52201 tgggcaacat ggcaaacccc gtctctacag aaatttttaa aaatacacaa ataagccaag 52261 tgtggtggcg gtgcctgtag ttctagccac ttgaaaaaaa aggctaaagt gagagggtct 52321 cttgagccca ggaggttgag cctgcagtga gccatgatcc caccactgca ctccagcctg 52381 ggcaacagag caaggccccg tctcaaaaag aaaagaaaga aactgccttt tgtccccagt 52441 gactcaggag gccaaggtga gagagtcgct tgaggccaag agtttgagac cagcctgagt 52501 aacatagcaa gaccctgtct ctaaaacaac atttaaaaat tagccaggca tggtggcgtg 52561 catctgtagg cccagctact caggaggctg aggtgggagg atcacttgag cccaggagtt 52621 ggagactgcg gtgagctgtg atcataccgc tgcactccag cctgggcaac agagtgaggt 52681 ctcgtctctt gaaaacaaag tgcctttcag ggcagttcct taaagggggc tgacagttga 52741 ccctgcactt ggattcctgg tgaagtggga gtcggatggg actgaggacg gcgctggctg 52801 tgttggaaca cacctactca ttcagctgtg gcagaatagg cccttcctct tgtgctggca 52861 ccatgttctc caggcgtgtc agggcctgag gactgggccg gggcttgtcc attcctgtgt 52921 cctgggccag gcatttagcg agagccaaat ttagctaggg ctgtggacgc tggaccccat 52981 cccccaggcc ctgctgtccc ttatcaagag atcaagaatg gcctgcgtgc tggcctcggg 53041 cattgggagc ctctcaaggc tggtcaggag gccatagggt acgggaaggg gcctgcgctc 53101 tctggcgtca gcggctgttg cccctgcagg tggaggaagg catgcatctg ctcatcacag 53161 gccccaatgg ctgcggcaag agctccctgt tccggatcct gggtgggctc tggcccacgt 53221 acggtggtgt gctctacaag cccccacccc agcgcatgtt ctacatcccg cagaggtaag 53281 gaagcccgtg cgcctctcct ccacctcttc ctgcctgtgc gctcacacat ggcttcctgc 53341 agaggcccag gaagtggtga agagtcagca cctcaggaga ggacactgag gcactgtccc 53401 cagagccaga gacgggctgt ggttcctgct ccctccaaac ccgcccgatc cactgccctg 53461 ttttggatct gtgtggggtg tgtgcacggg cggcgatgtg agcgtgtgga tgcgtgtgag 53521 cgtggcatgt ggacactgcc tgggaggcgc agagtatctt gggggaggca gagccggccc 53581 ttccctccgt ggacacccag ctttcccaca ggccctacat gtctgtgggc tccctgcgtg 53641 accaggtgat ctacccggac tcagtggagg acatgcaaag gaagggctac tcggagcagg 53701 acctggaagc catcctggac gtcgtgcacc tgcaccacat cctgcagcgg gagggaggta 53761 ggaggcctgg ggctggcagc caccctttgt cccaccctgg cctctccctt ggcctccagg 53821 gagtgaagat tacctcaaca tccagagtct aaagtgccag gtgccacggg gcggggcaga 53881 ggctgctacc agggaggacc aacaccacac agatggcccc aggtgctcta gggaaggggg 53941 cacctagcag ggatgtgcac ctcactgggg gacccaggat accctctccc agagaaaaga 54001 ggtctgagct gagccctgca gaatgctgag tggttacccc gtccggaagc caggggcagc 54061 agggcggagt gcgttccgaa ggcttggtgg tgcgagaggc tggctcacag agggccctcg 54121 ggaccaggcg ggagcctagg ctttccctga gcaggatcag acgctcttgg aaggaccatg 54181 gggtggtggg caggggcagc ctgggagggg caggcacatg tgtgcagtga tggctactgt 54241 caagaggttt gtgcagacgc ttggaggggg ctggggccag cagagtcagg tggattcaga 54301 gatgagttca ctgaaaagga ggccagactg agctgttgtc ttgtcctggg cttatcaagg 54361 aatactgctt gtccacagtg tctgtcgggc cggaagagcg gaggaggaga gggggctgca 54421 gctacaggga cacagtagat ggagtgttca gttctgtctt tgaattctga gcctctgggt 54481 tctgcttcca gcctgcactg ctgggtgcga gatggccctg ggcaaggacc tcgccttgct 54541 ggggctcccc ttcacggttc aagggcacgg gcaccaagcc ctccctcggt ggcaacatga 54601 gaagaagtgg ctcctgcagg aaatggccgg ggtgttgtca cctgcctgtg gaggaagcgg 54661 ggacacaggt ggcaatggca gtggagcagc ccctggcccg gccctgcctc ttgctcctgc 54721 tgccctcagc ctgggagcac gtggcccctc ccgcctctgt ggcagcctga atgcccaggg 54781 cctgtggccg gccagcatga gccattagga tggagttgag ctgcgaggaa cagaacgggc 54841 ctccccgcaa tagtggctaa gatcatctgt gagtttatcc tactgagctg ttaggtccca 54901 agagagccag gccacggttg ccagggctgg ccctgctctg tgaaggcccc agggctcgag 54961 gattttctac caggtcactc tgctgtgttt ggcctccgtt cccaaagtca cctcatgatc 55021 caggagggct gctgcagccc tcacatcatg tcccaggctg taggatggag gaagtagaag 55081 ggaaggggca aaaggtatgt gccttctttt aaggaaggtt ccagaagccg ccatattgaa 55141 tacttacagt tatatctcat tggccacaac ttagtctcat gctcacacct catcacaagg 55201 ccacctggga agcgtaatct ctactctggg tggccatatg ccctgttgcc acttctagcc 55261 ctgggccgct ggggaaggca gcatgggcga gaagacagga agggccgctt ctgccgcagc 55321 gccccgacct aatggagcag ccggctcacc tgctcgttca agcagcccac tcgagccttg 55381 ccaaagtgct gacacggggc agtgacagga ggcccaaccc ctgtgggtga caagcccccg 55441 gtctggggag agcactcagg ccgctctgga gctctgtgcc aaggaactgt atgggtgccc 55501 tggggctgcc ataaaccgca gggatggatt gtctcctaga tccagcagtc cgagatccag 55561 gtgccagcag ggtgggctcc ttccgggtgc catgacagaa ggatgtgttc caggcctctg 55621 tcctcggctc gcagatggtc cacttctccc tgtatatctt cacctcatgt tcccctgtgc 55681 atgtcttctg cccacacacc ccctttttat gaggacacag tcatattgaa ttagggtcca 55741 ctctgatgac ctcatcttag tgtgatcacc tctgcgaagg ccctgtctcc aaataaggtc 55801 acactgaagt gttggggctt ggactccacc gtatctcttc tgggggaagg cacgattcca 55861 gtccccactc ctccatgatt aatgcctgtc agacagacaa ggacgcagag gcacaggggc 55921 cctgtcgtca cagctagctc attcccgcag ctcccccagc tccccggctg gcccccgggt 55981 ctgggtgctg gtggaactga gccaagacca ttgcccccgc ctaggttggg aggctatgtg 56041 tgactggaag gacgtcctgt cgggtggcga gaagcagaga atcggcatgg cccgcatgtt 56101 ctaccacagg tgagcactcc gggccggcag gctccctggg gtcccctgga aggggaagta 56161 gcagctgtgg ggaggcctgg gctcagtgga gcctgagccg ggctggggtg ttgggccctg 56221 gagggtgcac agactctcct ctcggcccgg acccccaggc ccaagtacgc cctcctggat 56281 gaatgcacca gcgccgtgag catcgacgtg gaaggcaaga tcttccaggc ggccaaggac 56341 gcgggcattg ccctgctctc catcacccac cggccctccc tgtggtaggt gccctgtctc 56401 cctgcctggg gtcggtggga gtggctgcct gaggggagga ggtggcctgg cgggcccggc 56461 agcagcaggc ggctgtcatc agcagccccc gtgccgtgcc cctgaccctg tccctctcct 56521 ggccaggaaa taccacacac acttgctaca gttcgatggg gagggcggct ggaagttcga 56581 gaagctggac tcagctgccc gcctgagcct gacggaggag aagcagcggc tggagcagca 56641 gctggcgggc attcccaaga tgcagcggcg cctccaggag ctctgccaga tcctgggcga 56701 ggccgtggcc ccagcgcatg tgccggcacc tagcccgcaa ggccctggtg gcctccaggg 56761 tgcctccacc tgacacaacc gtccccggcc cctgccccgc ccccaagctc ggatcacatg 56821 aaggagacag cagcacccac ccatgcacgc accccgcccc tgcatgcctg gcccctcctc 56881 ctagaaaacc cttcccgccc tcgggaaagt agatgtggag ggtggcgccc tgcgtaaccc 56941 tcgccctgtc cctcccactc cctgggggcg ctgttccaca gtgactgggc cctgtccagg 57001 gcagtgagtc ctctactttg ctccgtggag gaagctgggg tacaaggggc ccagtgctgg 57061 ccacacagca gcgcagccga gccccaggag cccgtcaggc cacagcccct ggcactgcag 57121 gtggcctccc tccagagact cgagtcccca tgattccctc ctcgtcagtc tctcaaagac 57181 cccatggtcc atcccctgag ggtggtcagc caaggctccc gttccgtggg atgccataaa 57241 agccgcccag tgggacccac agtcacacag agcgcctcac ctgcatcctc tcccccacaa 57301 gagccccaaa gatcccacgg gagaggggag agggacgcac agcactgcct gccaagcgag 57361 aatgcaggcc ccgccccctc ggcccctcac cacctctttc tacagcctaa tttattggat 57421 tccctattcg tagccatctc cgtggccaat gtgactaccg tgccagcagc gggggcggcc 57481 cagcctctga gtcccgtggg gccccggctc ccaccggtgc caaacccagc ccctgcggcc 57541 gtcaccccgc cagcctacac tgccagccgc caccggggca cacgggcctc tgcttgccag 57601 ccaggagtgc ggacaccatg ttcccagctc agtgccaaag aggggtcacc agggggagct 57661 gtctgcggag ccagcgcctg cccgagagag accccaccgc caccgtgtgc ctttcccggg 57721 ccctcagccc tcgggccggg caccaccccc agtcccccca gtaaaagcct ccactggcaa 57781 atgcagtcct tcctccctgc ctcagagcct ggtggtgtct gctgtgggtc tcgaggagag 57841 atggaggaga gggagtgggt tgcctgtggg ggaaagagtg agtttgggaa aggagtgggc 57901 ctgaccccca agcccctccg agggggaaag tcaccagaag acatggtcca gcatgccctc 57961 cgccgagcct cacgccaatg ctcttaggat tcctgtgacg gtggcggggc ggaacctgca 58021 acaacattgc acagaaatac tggctgagcc caaataggac taggggaggg gatcatgctg 58081 gtccctgtgg gaggagcacg aaggcaagag aagggatgtc taagctgcca cacagggtgc 58141 tgctggccct tctagggaga ggcggccact tgtgcagggg cctgggggga actgggagca 58201 cagcgcaggg tgttcgtgct gcatgcaggg gaagggaggg caggggaagg gagggctgcg 58261 gccggcgggc cttggaggcc acactacaga gacaggactt agcccagagg ccaccgagga 58321 gctttcagca acagggaagc agtgtcgagt actgcaggcc acgtggctgc atgtgagggt 58381 ggctggtggg aatagggtgc ggcagcccat ctggcctcag aggcacgaga actgagaaca 58441 gctgtgcggc cataccttta tgcatggatg gccacagcct cccaaaggtg gggcagcctg 58501 agtgttcatc aacagacaaa tggacaaaca gcctgtccat aaggcacagt gccattctgc 58561 cataacacga cagatagacc tcaaagagtt cgtgctgggt gaaagaagcc agacacaaat 58621 gtccagaata ggctcatcgg gacagaaagc agacaagtgg gtgtcagggg ctggggcagg 58681 ggaaggaaaa tgtggcgggg ggagtccttt ttaaaaaatt ttgtatttat tttttatttt 58741 ttaatgagac agacagggtc tcaccctgtc acccaggctg gagtgcagtg gcgcagtcat 58801 aactcactgc agccttgatc tcccgggctc aagcaatccc gccccagcct cctgagtagc 58861 tggaaccaca ggcgtgtgcc accataccct gctaattttg tgattttttt tttttggaga 58921 caggatctca ctatgttgcc caggctggtc tcaaactgct gaactcaagc gatcgtcctg 58981 cctcagcctc ccacagtgct ggattacagg catgagccac cacacccagc ctcgggtttc 59041 tttttatttc gaagaaaatg ttctcgaact atagagcata ctaaatgcca ctgaattatg 59101 cactttaaag ggattgattg tatattttgt gaatatcgcc tcaaaaacag atgattgatg 59161 gataaattga tacatagata tatagatata tagacatgat tgatatagat gattgattga 59221 tagatgatgg atgattcata ggtgataagt gatagataaa atacatgata gatacatgga 59281 tagacagatg aatagagaga gatgatagat ggttttaaaa gtttttttag agacaagatc 59341 tcactatctt gcccagtctg gactcgatct gctagcatca agcagccctc ctacctcagc 59401 ctcctgagtt actgggacta caggcacatg ctactgtgcc tggtgataga taaatgattg 59461 aaagatagac atgatagagg cataaatgat agatagatgg atagacatga taaaaggaag 59521 atacatggga tagatcaatg attgattata taagtaaatg atataaattg atagattatt 59581 gattatagat taataggcgg atagttgatt gatagatgat tgatcgattg attgattgat 59641 tgatggagag agacagagaa gcaagcacag ccattgcagc cacccagaca agacatgctg 59701 aggcctgaag ttccagaagg ttcgagcagt tggaagaact caacaggcat gggggcagct 59761 tcttcaggga gtggaggggg cagcaaggta ccactgggtt ctggcttgga aggttaggtg 59821 agcgacagca cccttggtgg acagaggcag ctccaaagga ggggcaggcc tggggagcag 59881 gtgcagcccg aggggatggt gtgtaggcag ttggttcaga gcttggggct cctcagtggg 59941 acgtgggtca gcagggaggc cagtggtcat tgaagtttgg atggagacag cctggctgag 60001 gggaggggca tgcttggcat ctcatttagg ggacaagagg tagactgttt acctgcattt 60061 tgagattagg atttattcct gatcccagga ggtggccgat tcggagggct gagagttgtt 60121 cctccatttc ttttgacgat tgtgtaagtc gcccatgctt gatcataaac cccgtttgtt 60181 tatttctgac acatagctgg aatggctcta attactacag atagaaggag acacatctgg 60241 caaagaccat ccaaaaggaa gctagtggag agaagctcat attgcacaaa taggctccag 60301 ggcaaaatca ttattaggat taaaagtggt tgcagcatac tgataagtat tcattccaaa 60361 gcattcactg gcgggggagg ggtgggggaa aaagaataaa tacataaata agttaattat 60421 tttaaaagaa gtattagcgg ccaggcgcgg tggctcacgc ctgtaatccc agcattttgg 60481 gaggccaagg cgggcagatc acctgaggtc aggagttcga gaccagcctg aacaacatgg 60541 tgaaacccca tctctactgc agtacaaaat tagccaggca tggtggctca tgcctgtagt 60601 cccagctact tgggaggctg agacagaact gcttgaaccc gggacgcgaa ggttgcagtg 60661 tgccaagatc acgccattgc actccagcct gggcgacagg aaaacaaaaa acaaaaaaaa 60721 gaagcattag ccattctgat cttgtgtgca cctgcataat gatagagccc caaatgacta 60781 caaaacaaaa aagtgtcaag aaaaggaaaa atgaataaat gagcacattc tccacgcagg 60841 aaattatacc acttctcact ggaactgctg gtttaagcag actcaatgag taagaatata 60901 gaaaaattgg gccaggcata gtgtctcatg cttgtaatcc caacactttg ggaggcgaag 60961 gcaggcggat tacttgaggt cgggagtttg agaccagctt ggccaacgtg ctgaaactaa 61021 aataaaaaat acaaaaatga gccagatgtg gtggctcatg cctgtaatcc cagctactcg 61081 ggtggctaag gcaggagaat cacttgaact tgaggtttca gtgagctgag attgtgcccc 61141 tgcactccag cctgggcaac agagtgagac tctgtcaaaa aataaaaata aaataaaaaa 61201 atatggaaaa gttgaacaaa cttgatttag tggacaccca aaaactacag accacacatt 61261 gttttcaagt tcaccttgga catttactaa caccatgtcc taggctgcaa agcaagactc 61321 aacaaatagc aaaagaactt gcatcacacc agccatgttc ttgatgcaac agaataaagg 61381 cataaattgg caaccaaact aaaattaagg gctcccctat gtttggaaat ttaaagatac 61441 actactggcc aggcacggtg gctcacacct ataatcccag aacttcggga ggccaaggca 61501 ggaggatccc tggagcccag gagttcaaga ccagcctgag caacatagtg agacccccca 61561 tctctataaa actaaattaa attatttttt aaattaaaaa aataaaaaat aatgcactgg 61621 tccgagaaga attagaatga aaatctaaaa gcatttagaa ccgaacaatg aaaactatgt 61681 acaaaagctt aagccatgta gcccaagcag tactatgagg aaattaaaaa agaaaaaaaa 61741 agtgtggcca ggcgcggtgg ctcacgcctg taatcccagc acctcgggag gccaaggtga 61801 gcggatcacc tgaggtcagg agttagagac catcctggcc aacatgacga aaccccatct 61861 ctactaaaaa tacaaaaatt agctgggcgt ggtggcgggc acctgtaatc ccagccacct 61921 gggaggctga ggcaggagaa ttgcttgaaa ctggaaggcg ggggtcgcat tgagccaaga 61981 ttgtaccgct gcactccagc ctgggcgaga agagcaaaac tccatctcaa aaaaaaaaag 62041 tgtaaaaaat aaaaataata ttatgaagta cagagggatc tccctgcagg caccactgag 62101 agctgaaaca tcggggcacc tgggggcgaa agacatgagt gggaacaact tcagctaaaa 62161 ggaatgcaaa gggattgcag atgtaaaagg gagagttcac agcagagagt gagaggagcg 62221 cctgccagga acatcacaga agctggaaaa caagtggggg agtgataact gattcaaggg 62281 atcagcgtga acttggaaaa aagtggtggg aggcaccaag ggcacgtgcc cacagaggaa 62341 aggccacatg aggccaccac ccagatgaga ggccttggaa gaaaccaacc ctgctggcac 62401 cgtgatcctg ggcttccagc ctgcagggct gtcagaccat tacggtgttg gaccgacact 62461 gtaaagaaag aagcagtgat agcacacatg gttgtctcgc tccagttcta aaagcgggaa 62521 ggatgctggg cgcggtggtc tccagaaggc ccctccatgt ttctctgtgg cacccgcagc 62581 acctggatgt cagcactggg agaaaccgcc tccacattca tctgtaaaat ccgagcatca 62641 gtgagctcaa ctctcccgct ttctcagctc tctgtttcca ttaggtttgg ttgatctggg 62701 ccagggccag catcagaggc aaacccaaag ctgttctctg tgggctgacc tcttctgcct 62761 tctcttacgt gtctgccatg tccactagcc tgtgagcttc aggagacaaa ggagcatgtt 62821 tcttcttcct gatacccccg aaacttgcac agtacatgct atcaaatgca gattcaatga 62881 gggctctttg cattacaatt ttgaaaaaga atcatgcact atgatttagc gtctcttccc 62941 atcaaaaccc aggcccatag agcaattgcc tttgcctgtg atgcacacct cctacctgtt 63001 ctccccccaa cccggcatct ctgtcttgca gaccaaacac aaaactcatt gccatccctc 63061 ttcaacctgg cttcctttct gacttccctt caggttcccc aggcagaacc atggggatca 63121 tctgaccctc tgtctccctc atccttccct aacccaccag cccgtgtcct actgactcag 63181 tccccaaagt ccctctcgct ccccactgga tcctcagtgt ctggtagtac agtgcctgac 63241 gtgggaggta cacagcgacc actaggtgaa tacaagaatg atgtgattga gggtgttttc 63301 ggtctcatat gctgcccatg ctcggtgatg cagtaaatca tcatctgttt ggaacggtct 63361 aagtctccag ccagaaagga gggagaaggc accatgtgac cctgcatctc ccttgccagt 63421 gctctgattt gacgttctag aagagtagtc tttatgccgg ggcacatgct cccctgagga 63481 ttatgaagcc tgtccaggag gggctcagca tggatggtcc caaggagcca atgtcccgcc 63541 tcagcttgca caggggctct aacctaacct gcctttccca gaaacctgac cccagttgac 63601 aagagcagga tgttgcacca tgatacgtcg tggggtcaaa ggctaccagg caccaggaaa 63661 agaagccatt tgaaataccg cctcctttca ccaacctgac atctccacct ctgcatttca 63721 ggaaaagaaa tacatccaaa agaattttcc cttcacattt aataattaca aaaattgtgt 63781 tattttcctc aattacatgc taggaaaact tgtaggaatt ttgccatctc gaataaatgt 63841 tttaacactt aggggaaaaa aagtagtggg aggcagccag tggcaaagtt gggtttgcac 63901 aagaatcgga ggcaccagat aactcagaaa caatgtgtgt gtatgtgttg gggagtgcag 63961 aacaagatga tttattggaa gtctctataa gaagctgctg tccttgcact ggctatgcag 64021 attaaaaaaa aagaagaaga aaacaaagaa gatgctagat gcccatatct ctgccccact 64081 ttgtacagaa gaccagagat agaacagaag gtctgtggag atgcaaattt aagccacagt 64141 aaaataccct tttttttttt tgagacggag tctcactcta tcacccaggc tagagtgcaa 64201 tggcacaatc tcagctcact gcaacctctg ccccccaggt tcaagcaatt ctcctgcttc 64261 agcttcccaa gtagctggaa ttacaggtgc acgccaccac acctggctaa tttttgtatt 64321 tttagtagag acagggtttt atcacgttgg tcaggctggt ctcaaactcc tgacctcagg 64381 tgacccaccc acgttggcct cccaaagtgc tgagattaca ggcatgagcc cccgcacccg 64441 gcccacagtg aagtatcttt ttgcatccac tgattggcag aaatgtaaag tctgacaata 64501 ccaagtgtta gcagtactac agagcagcag gaactcacac ttccccgggg caggcaaact 64561 gggacaacca ctttggaaaa cagtgagcat tattcagtat agttgaagat gtgcacattt 64621 agaagctaac aatttcactc ctagacacca gattgcatgt acaaaatcgc tcacagcatc 64681 attgtttata atgcttaaaa taatgtttaa atgtttttta ataggcaaaa gccagaagca 64741 acccaagtgt ccattagcag aagaacagat aaaagatcta tgttagagtc atacaatgaa 64801 acagtataca gcaatagaaa tgtcccagag tcagaccata gaagtctaaa aaaataaata 64861 aatgaataaa ctacggttat gtgcatcaac ttggatgtct gtcacaaatg tcatgtggag 64921 tgaaaacagc aggttataga aaaatatatc cagtatgatt ccacttatag ggaagttaaa 64981 aaaaaataat gctgcaggat ttgaacaaca ctctagacca ggggtcccca accccaggcc 65041 actgaccagt actagtctgt ggcctgttag gaaccgggct gcagagcagg aggtgactgg 65101 tgggcaagtg agcattcccg cctgagctcc atctcctggc agatcagcag cggcatcaga 65161 ttctcatagg agcacgaacc ctatcatgaa ctgcgaatgt gtgggatcga ggttgtgcct 65221 gatgatctga ggtggaacag tttcatcctg aaaccatacc ccccaaccct ccatccatgg 65281 aaaaattatc ttccacgaaa ctggtccttg gtgccaaaaa gattggggac cactgctcta 65341 gagcaaatgg acctagcaga catacacaga acattccatc caccagcagc agaattcgct 65401 ttcttctcag ctgcacatgg ggtagtctcc aggatagacc atatgtcagg ccacaaaaca 65461 agtcttaaca aattcaagaa gatcaaatca cattaagcat ctttcccaac cgcaaacaag 65521 agatcactaa gagaaagcaa actggaaaac tcacagattt gtggaaatac acaacaccct 65581 cctgaacaac caatgggtca aagaagaaat caaaagggaa acaaaaaaaa aaatctcttg 65641 agacaaatga acatggaaac acaacataca aaatgcagtg tgcagcaaag caattccgag 65701 agggaagttt agagccttaa atgcctccat taagaaataa gaaagaggct gggcatggtg 65761 gctcacgtct ataatcccag cacattggga ggccaaggtt ggtgggtcac ttgaggccag 65821 aagttcaaga ccagcctgag caacatggcg aaaacccgtc tctactaaaa atacaaaaat 65881 tagtgggcgt ggtggcacat gcctgtagtc ccagctactt gggaggctga ggcacgagag 65941 tcactggaaa aaaaaaaaaa agaaaaaaga agtaagaaag atctcaaata aacaacctaa 66001 ctttacacac caaggaacta gaaaaggaag aacaaaataa gcccaaagtc agcagaaaga 66061 aggaaagaac aaagatcaaa gcagaaatac atgaaataga gactagaaaa ataacagaaa 66121 agatgaatga aactaagagt tgggattttt ttaagataaa atcaacaaac ctctagctag 66181 accaactaag aagagataag actcaaataa atcaatcaga aatgagacat tacaactgat 66241 accacagaaa tgcaaaggat tgtaagaaac tactatgaat aggccgggca cggtggctca 66301 agcctgtaat cccagcactt tgggaggccg aggcgggtgg atcatgaagt caggagatca 66361 agaccagcct ggctaacacg gtgaaacccc gtccccgcta aaaatacaaa aaattagccg 66421 ggcgtggtgg cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggca 66481 tgaacccggg aggcagagct tgcagtgagc tgagatcgtg ccactgcact ccagcctggg 66541 caacagagtg agactccgtc tcaaaaaaaa aaaaaaaaga aaaaaaaaaa agaaactact 66601 atgaataatt atactccaac aaattgggta tcctagaaaa aatggataca ttcctaggca 66661 catccaacct accaagactg acttaagaag aaatagaaaa tctgaacaga ccaataatga 66721 gtaaggagat tggacaagta atagagtctc tcatcaaaga aaagctcagg cctgatggct 66781 tcattggtaa attctaccaa acagttaaat aactaatacc tatccttctc aaactcttcc 66841 aaaaaactaa agaggaggga acgcttccaa actcatttaa taaggtcaga aaattccctg 66901 atatcaaagc tggacaagaa cactaaaaga aaaaaagcta caggccaata tccctgatga 66961 acacaggtgc aaaaatcttc aacaaaatac tagcaaaccg aattcaatag cacattaaaa 67021 ggctcattca ccatgagcaa gtagaattta tccctgggat acaagggtga ttgaatatat 67081 gcaaaccaat aaatgtgatg caccacgtta acagaatgga gaataaaatc atttctcaat 67141 tgatgcagaa aaagcatttg acaaaattta acattctttc atgatgtaaa aaaaaaaaaa 67201 ttagctcttg gccaggtgca gtggctcatg cctgtaatcc cagcactttg ggatgccgag 67261 gcaggaggat ctcttgagcc caggagttca gaccagcctg ggcaacatag caagacacct 67321 ctacaaaaaa aaattttttt aaattaacca atcatggtgg tgtgtgcctg tagtcccagc 67381 tactctggag gcttgagccc tggagttcga ggctactgtg agccaggatc ataccactgc 67441 actccagtct gggtgacaga gcaagaccct gtctcaaaca aacaaacaaa tacaaattag 67501 ccaggtacat tggtgcgtgc ctgtagttcc agcactttgg gaggccaagg caggaggatc 67561 acttgggccc aggagtttaa gaccagcctg ggcaacatag gccctgtctc taccaaaaaa 67621 acaaaattaa aaagaaactc tcaacaaatt aggtatagaa ggaatatacc tcaacatcat 67681 aaaggccata tatgacaagc ccacagctaa catcatactg aatggtgaaa agctgaaagc 67741 tcttccttta agatcaggag caagacaagg atgcccactc tcaccacttc tactcagcat 67801 agtcctggaa ggcctagcca cagcaatgag aaaagaaaaa aaaaaacatt taaaaaccca 67861 aattaggcca ggctcggtgg ctcacacata taatcccagc actttcggag gccgaggtag 67921 gcagatcacc tgaggtcagg agttcgagac cagcctggct aacatggtga aaccccatct 67981 ctactaaaaa gacagtaatt agccaggtgt ggtggtgcat gcctgtaatc ccagctactt 68041 gggaggctga ggcaggagaa tcacttgaac ttgagaggct gaggttgcag tgagccgaga 68101 tcacactact gcactccagc ctggacaaca gagtgacact tcgtaaaaaa aaaaaaaaaa 68161 aaaagagaga gagagagatt aaattgtctc tgtttgcaga tcacacaatc ctgtatgcag 68221 aaaaccctaa agactccacc gaaaaattgt tagaattcag tcaaggtaca ggatacaaaa 68281 tcagcataca aaacaatagc ttttattttt tattttattt tttttttttt gagacagaat 68341 cttgctctat caccccacct ggagtacagt ggctcaatca cagttcactg cagccttgac 68401 ctcctgggct tctcccacct cagcttcctg agtagctgag actacaaatg tgcaccatca 68461 tgcctgacta attagtatta atattattgc tattattgag acagggtctt gtttagttgc 68521 ctgggctggt cttgaatttc tgggctcaag tgatcctcct gccttggcct ctcaaagtgc 68581 tggaattaca ggtgtgagcc atcatgcctc acctagtagc atttctatat gctatatgct 68641 aacaatgcac tttctttttt tttttttttt ttttgagaca gagtctctgt tgcccaggct 68701 ggagtgcagt ggcactatct cggctcactg caagctccgc ctcccgggtt cacgccattc 68761 tcctgtctca gcctctccga gtagctggga ctacaggcgc ccgccaccat gcccagctaa 68821 ttttttgtat ttttagtaga gacagggttt caccgtggtc tcgattcctg accttgtgat 68881 ccgcccacct cggcctccca aagtgctggg attacaagcg tgagccactg cgcccagcca 68941 atgcactttc taaaaaagaa atcatcaaag aagatttatt ttgttatcca aaaaggaaat 69001 caagccaaca atctcattta taatagctat aaaaaataaa atacttagga ataagtttaa 69061 ccaaggaggt gaaagaccta tacaccgaaa actgtaaacc acagatgaaa gaaactgaag 69121 aaaacacaaa taaatggaaa gataccctgt gttcatggat tgggatagtt aatattgtta 69181 aaatgtctat actacccaaa gtgatctaca cattcaatgt aatccctatc aatattccca 69241 tgagattttt cacagaaata gagaaagcaa ttctaaaatg tatgttaaac cacaaaagac 69301 cccaaatagc caaagcaatc ttgaacaaaa agaacaaaac tggaagactc atgctatctg 69361 atctcaaaat atgttacaaa gctatagtaa ccaaaacagc atggttctgg cataaaaaca 69421 gacacataca ccaacgtaac agaatagtga acccagaaat aaatctatgc atttacagtt 69481 gattgatctt cgacaaaggt gccaataata catgatagac aaaagacagt cttgtcaata 69541 aatggtgttg ggaaaattgg atattcactt gcagtagaat gaaattagac ccttatctca 69601 cacaatgcac aaagatcaac tcaaaacgga ctgaagattc aaatgtgaga cctgaaatta 69661 gaaaactact agaagaaact tacgtgaaaa gctgcttggc actggtctgg gcaataagtt 69721 tttcaatatg acaccaaaag cacaggcaac aaaacaaaaa cagacaaatg agtttgcatc 69781 aaactaaaaa gcttctgcac agcagaggaa gcaatcaaca gagtgaagga gacagcctat 69841 ggaatggaag aacatagttg taaaccatcc atctgataca aggttaatac acagaacaca 69901 caaggaactc atacaactca atagcaagca aatgtatttt ctaacccagt taaaataacc 69961 cagttacacc tactatgtac tcctaaaaat taaaaattga aaattgaaaa aataaataac 70021 ccaatttaaa tgggcaagga cctgaacagg catttctcaa aagaagacat acaaacggcc 70081 aataggtcta ttaaaaaaaa atgctcaaca tcagccgggt gcagtgctca cgcctgtaat 70141 cctagcactt tgggaagccg aggtgagtgg atcacctgag gtcaagagtt caggacctgc 70201 ctggccaaca tggtgaaacc ctgtctctac taaaaataca aaacttagcc aggtgtggtg 70261 gcatacgcct gtaatcccag ctactccaga ggctgaggca ggagaatcac ttgaatccgg 70321 ggggcagagg ttgcagtaag ccaagatcgc gccacttcac tccagcctgg gcgaaagaac 70381 aaaactccat cttaaaaaaa gaaaaaaatg ctcgacatca ccaatcatca gggaaatgca 70441 actcaaaacc acaatgagac accacctcac tcaaattaga atgatcgttt tcaaaaagac 70501 aaaagaaaac aaactttggc gagggtgtgg agaaaaggga acattttgga acagtatgga 70561 aaatggtatg catggtggtt cctcaaaata tttaaagtga gaatgaccat acaatccagc 70621 aatcccattt ctgggtatgt atctaaagga aatgaaatcg gtatgtcgag gcaatatcca 70681 cactcgcatg tccattgaag cactactcac aatcgccaag atatggaatc aaacgaaatg 70741 cccaccaaca gatgaatgga taaagaaaat gtggtacaga tacacaatgg atcactactc 70801 agcctttaaa aagaaggcaa tcctgccatt tgcaacatgg aggaacctgg aggacattac 70861 attaagtgaa gtaagccggg cacagaaagg caaatactgc atgacgctgc tcacgtggaa 70921 agtaaaaatg tcagcctcat ggaaacagag cagaacagtg gccgccagga ctggggtagg 70981 ggaaatggga aaatgctggt caaagggtac caagtttcag ttatgcagga tggagacgtt 71041 ccagagatct gatgtacagc atggtgattc tagttaacac atttaaaaaa ataaaaccaa 71101 aaaatttgtt gcaaaagtaa ataatttatt gtttaaggct aattcatatt tgggacaagt 71161 ataaagaaaa ccaaggaaag gctaggacta actgcgaagg gaaaaagaag ggtccatgat 71221 ggctgtcagt ggggatgatt catctaagca gcatatgctc ttccgtgtgt attagattga 71281 acaagatgaa actgcaattt ttgtaggtta acaataaatg aacattaagc aacgtcatct 71341 ggttttgacc caagacattt attaataata aatcaatgat aaaataaaac caaaacactt 71401 tatgctaata aacttgaaaa cttaaacaaa atgggcatgt tccagccagg cgcggtggct 71461 cacgcttgta atcccagcac tttgggagac cgacgcaggt ggatcacctg aggtcaggag 71521 ttcaagacca gcctggccaa cacagtgaaa ctccgtctct gctaaaaata caaaaatcag 71581 ctgggcgtgg tagcgggcac ctgtaatccc agctactcga gaggctgagg caggaggatt 71641 gcttgaaccc aggaggcaaa ggttgcagtg agccaagatc atgccactgc actccagccc 71701 gggcgacaga gctagattct gtctaaaaaa aaaaaaaaaa aagcatgttc ctagaaaaat 71761 attgtttatc aaaaattgac tcaaaataga atgattcatc ctaaattcag atagaatctc 71821 aaggggctcc aaataaccaa aacaattctg aaaaaggaga acaaagttcg aggactcaca 71881 cttcctgatt tacaaaataa aacatattac agtgtggtcc tggcataaag acagagacag 71941 aatataatcc agagcccaga aagaaactct caagtatatg gtcagatgat ttttgacaag 72001 ggtatcaagg ctacaaattg ggaaaagggc agtcttttca acaaatggta taggggaaac 72061 ttgatatcca catgcaaaag aataaagttg gaggctgggc atggtggctc acacctgtaa 72121 tcccagtgct ttgggaagcc aaggtaggag gattgattga gtctgggaat gggagaccag 72181 cctgggcaaa gtaacaagat cccatctcta ccaaaaacat taaaaatttg aggctgaagc 72241 acaaaaattg cttgaacccg ggaggcagag gttgcagtga gctgagatca cgccactgta 72301 ctccagcctg ggtgacaaag tgagatctgt ctcaaaaaaa aaaattaaat taaaaattag 72361 ctgggcatgg tggtgaatgc ctatagtccc agctacccag gaggctgagg caggagaatt 72421 gcttgagacc aggagtttga ggttacaatg attatgccac tgcaccccag cctgggcaac 72481 agagcaagac ctgtctcaaa aaaaaaaaaa aaaaagatgg caaattttgt gttatgtatt 72541 ttttcccaca ataaaaaaat tactcaagaa gaaacagaaa acctaaatgg tcccataatt 72601 attacagaag ttggattcat tgtttaaaat ctataggggg acaggcagga agctacaaaa 72661 acaaggaagg ccaggccaga tgcggtggct catgcctgta atcccagcac tttgtgaggc 72721 cgaggcaggc agatcacctg aggtcaggag ttcaagacca gactggccaa catggcgaaa 72781 caccgtctct actaaaaata caaaacttag ctggacgtgg tggtgcatgc ctgcagtccc 72841 agctactcgg gaggctgagg caggagaatc gcttgcaccc aggaggcaga ggttgcagta 72901 agccaagact gtaccactgc actccagcct gggcaacaga gtgagactct gtctcaaaac 72961 aacaacaaaa caaaacaaaa caaaacaaaa aaaacaaaga agagaaaaga aaagaaacaa 73021 aacaaaaaac agaaaataaa aaacaaggaa ggcctagatg gttttacaga tgatctccag 73081 caaacatgga agtaactgat catctccatc ttagactaac tgttccagtg aacagctgaa 73141 aaaggaaata ctgcctcaac tcactttacg atgtcttgat accaaaacca gacgaggata 73201 ataatacgag gctaaaaaaa aaaaaaaaaa ccacccaatc cagaattgta tatgaaaaaa 73261 atacatcacg caaagttgag tttagtccag aaaatacaag agtggtttaa catttaaaaa 73321 tgaattaatg taatttgcca cattaacaga ttagaggagg aaaaccctgt gatcatatta 73381 atgcattcag aaaaagcatt ttataaaaat tcaacatata gcacactagg aatgggaggg 73441 agctttctaa atctgataaa catatctacc gaatgctccc tgcaaacatc atacttaata 73501 atggaacact gagaatgcct cctatgcttt tttttttttt tttttttttt ttttgagaca 73561 gagtttcgct tttgttgccc aggctggagt gcaatggctc gatcttggct caccgcaacc 73621 tcggcctcct gggttcaagc gagtctcctg cctcagcctc ccaagtagct gggattacaa 73681 gcatgggcca ccatgctcgg ctaatttttt tgtattttta gtagagacag agtttctcca 73741 tgttggtcag gctggtctcg aactcctgac ctcaggtgat ccacccacct cggcctccca 73801 aagtgctggg attacaggcg tgagccacca tgcccagcgc ctcctatgct ttcattcaaa 73861 ggaagtaaaa gcaagagaaa gcaagtccag gggaagtaaa gcaaggaaac gaaatcagcc 73921 gggcacagtg gctcacgcct gtaatcctag cactttggga ggccaaggca ggtggatcag 73981 gagttccaga tcagcctggc caacatggtg aaaccctgtg tctggtaaag atacaaaaat 74041 tagtcaggcg tggtggtggg tgcctgtagt cccagcttct tgggaggctg aggcaggaga 74101 actgcttgaa ccagggaggc ggaggttgca gtgagccaag attgtgccac tatactccag 74161 cctgggtgac agagccagac tctgcctcaa aaaaagaaat caaagctaaa aagactggaa 74221 tagaagaaac aaaatggtca ttacttgcag atgataggat tgtgtacgca gaaaatcaaa 74281 aagaatccac aaagagaacc caaacataat tattagactt aatatgagag tttacaaggt 74341 tacttgatac atagaaatcc attgtttttc tatgtaccag caacacacag tctgaatggg 74401 aattcaaaag aaggtagcca ggcgcagtgg ctcacaggct cacgggccag cactttggga 74461 agccaaggcg ggagggccgc ttgagtccag gagtttgaga ccagcctggg caacatagcg 74521 agaccctgtc tctacaaaaa aattacaaaa ttagccaggc actgtggcgc ctgcctgtag 74581 tcccaggtgt ttgggaggct aaggtgggag gatcacttga gcccaggagg cttcagtgag 74641 ccatgattgc gccaatgcac cccagcctgc acgacagagc gagaccccgt ctcaaaaaaa 74701 gaaatcagat tgataaatgc tatgatatta aaattaagac tttctgttta tcacatgata 74761 catcagcaaa ctgagaagac aagtaacagg taggccgaag acatacaccc cagggattgg 74821 tgtggaaaat acagaacacc tgcaaagtgt aagccaagac atgcacgggc atttgacaga 74881 agaaatgaaa agtgctgcaa tatggccggt ggtcaaggaa tgccgattca aaacccagca 74941 gggcaccatt tcacacccac tgtatgcgca gaaactctca agtctgctct ctggtgtacg 75001 tgaagacgtg gtgcagcagt ggcagtgggg gtgtccgctg ggatgccagc atggagagct 75061 acgtcttcag cccagcaaac gtgaagccca cgtcccactg acccaacgag tccccttcca 75121 gcattgagaa gcctggcaca cagcaaggag acgacaccag atcagttgac gctgccttcc 75181 catcccttgg tgaggggacc cggccattgc aggataaatc cagcacgact cccctgacat 75241 aagatgttcc ggttccccaa accagcctag atcctttctg gttacaccag cttcagacca 75301 ggggctgcca tagccacgcc tcttatctgc tcagaaagcg tagaaggtgc atatgcaggc 75361 catatacaca catatttatg aaaatggtgc tgcctgtgga tggatttatt ttcttttttc 75421 tttctttctt ttcttttttt tttaaagaca gagtcttgca ctgtcaccca ggctagagtg 75481 caatggcacc atcacggctc actgcaggca ccaactccca ggctcaggcg atcctcccac 75541 ttcagcctcc caagtagctg ggactacagg ctcgcaccac catgcctggc taatttttaa 75601 atttttttat agagatgggg gtctcactat gttgcccatg ctggtctaga actcctgggc 75661 ccaagtgatc ctgctgcctt aggctcccaa agtgctggga ttacaggcgg gagccacgac 75721 gcccagcctg atttattttc ttctgtggaa aggaaggaga aacccattcc ccatgccaga 75781 ttgcaggcgt ctgtctcatc cggggccagc aaggacatgt gtgtgtgtgt tggggtaggc 75841 atggggggaa ggacagggag agggagagag gccagctggc tcccccaagc ctcccctcct 75901 ggagcttcta aatccctccc tggctccttc aaggctgcct ggccttcctg ttggggagag 75961 acccctcgcc ggctgctcac tcgctttttg gtggtccttc ccacgggcct gcctggtctg 76021 cctccacatc ctcagcggct gcaggtgaga agctgtggat ctggggccca gtggtccaga 76081 gcgtggcccg tgccagaatg cccaggttca aatcccatct ctgtggaatg cactctgaag 76141 aaacgacgtc aggtcatggc cctgggcaag ttacccacct ctgtgcccca tttctctcca 76201 tgcaggttgg aaagaatgac acagctgctc agggaagcac ccataccccg agtctgctgg 76261 gcctttgagc tccagggcca acacctggcc ctataccctg gccgccccca gggctccagc 76321 cacctgagcc aagcttctca cccccaggcc tccccagccc catcaccatt tcctcctccc 76381 tcatgggccc ctcagtctta ccgtcctccc acttccgcca gcccctctgt ggatggccct 76441 gccacagggg gaacagtgcc ccagcagaaa agccccaggg tcgcctggat tctcctgcct 76501 tcgcctcctg ccctgggtcc aacgcacagt cttcagaatg ctgcctcccc agagcgcctc 76561 tagaacgcgc ccgcctctct ccatgcccat gaccaccact ggaccaggcc tccccacatt 76621 ctctgcccca gtgccaccag cttgcagttg cccatggcga gacaagttca agttctcaag 76681 cctggccaca aagaccccac ctgatttgct gacctccgcc acaccctggg agacggccct 76741 tgaccaacct tgtgttccag tcggtgggaa acactggttt ttcctcaacg ccacgtgggc 76801 atgtcacgtc caggcctggc ctggggtagg cctcagctcc atctcctgcc caacgcctgc 76861 caaggttcat ggttcaaggt catgtagcct gacggttaag tgaaggggcc ctgtgattga 76921 ctcagccttg ccacatacta gctgtgtgac tttattcaag tttcctaacc tcctgtgcct 76981 cagtttccca acctgtgttt gtttctggat ggtgccagtg ccaagcccaa gcctggcatc 77041 cagtaggtgc ttcgtgccag gttatggaat gcccacccaa taggagggcg ctgccgtctc 77101 tcacctgcca ggtgcccacg gtcagtactc catcatgccc gctgcggcca gcagagggca 77161 gcaacatcct gccctgtgcc attgggcaaa ggggcgggcg gagagaggct gggctgcctg 77221 gccgggcttg ttgggatgag tcggactggg cacgattgca ttgtggggag ggaggctggg 77281 gctctgtgag acgtgctcct ggcaccgcca gctgctactt ggccctcgcc ggtggcccac 77341 caggtaggca agagctgggg caggcttggg cttgggcttg gggtgcaggg gctaccagcc 77401 tcagggcaag gcatcagggt ctcagggcca actggctggg gtcaggcaag agcgctcggg 77461 tgtcggccca gcctgacgac ggcactaggt gtctgggagc acggtgtccc ccaagtggta 77521 gccctggggt taggaggtgg ggaaccatgt ttcttccccg aaatgggaaa gccaacgcca 77581 gcctcgggca gggtacaggg tagagaggct gcccgcctgg ctccccaccc ccacagccac 77641 ccaactggcc cagccaggtg acagcagccc actgccacct tgccctggag cctactgctg 77701 ccgtgctgtg ggggaggcag agaccccatc cccagtgcac ctgctcctgt gccactggcc 77761 agggtggcct gtgagggtga cagggggtct agctgctagg tgcccctccc ttccccatcc 77821 cctgcatgga tggcatctag gccagcagga cggtgcttca gagggtgggg tggacagccc 77881 cggcctgtgg accctctgcc tcagcccatc tcaagcagcc tcctcccgtg gtcctgctcg 77941 ctgccaccat gctgtacctg gcaccacttc accagtcccc aggggcaggg ctgggctggg 78001 ggtgtggagg aagatctttt ggacaaaggc aggcattgag ggccaggact ggcacttcac 78061 ccaggcccca aggctaccag ccggcatttg caggggctcc aaggtccacc ccgtgccctt 78121 cacagggacc cggggagcca gggaggggca tgcttagtga cacaggggtt gcagggccag 78181 agggggaggg ctggctggag gaaaccccgg ccctttgtcc ccggatcccg ggggtggctc 78241 cctgagggtg ggaagtggca aggagccccc tgtccagaac agcccagtgg gggtgacagc 78301 actagcgagt ggggccctgg cctggcctct ggacagaggg aacattgttg tggtggaggc 78361 agggctgcct tggcctggcc actattgaca aagctgccag ctgtgaggga gcccggtcgc 78421 taggccctgg gcgccacccc caccgccgcc cctccttccc caggccagct cgaatggggc 78481 tgagcctcga ctgtctccct cccactcagg acaatgcccc cccgcagcca tctcatgccc 78541 atcgccactg ccctggggca gctgaactga gcgtatgtgc cacgccgccc aggagacccc 78601 tctgctgcac cacttcatgg taagtgccca ggcccggtgt ccccagaaat gttcctggga 78661 ggggcagggt agggcaggat ggctgtggcc aggcccagga catacaggct gcagcctccc 78721 tcatggcaga cactgcccca gatagcccgg ggtccgcagg aagtgtccgt gctcttcctg 78781 cctgtgttcc tgagataagc ggctcctggg ccttcctcca ggcgctgctc tgccctgagt 78841 cgcagcggcc ccactcctgg tcggcatgtg accacacacc cagggcagca tctctctgtg 78901 gagggcagga ggggccagag aaggggctgc tcccacctct caccctcggg ggctcacccc 78961 ggggtcccac caggcacacc caggaaccat ctccagtctg gcacccctcc ctgccccttc 79021 cctgacccct cccccagcca gctctcctgg tctgctcttc tcctgggaca ctcctacact 79081 gaggccccct gtgccaggcc ctgagagacc caaactgctg gggcccatgg ggcaggagca 79141 ggcctccccg cacccctgaa gcctgtgttg tctcttgaca cccctggctt ctcccagcac 79201 ccccgctgcc ttgccccagt gcttcctcgc cctggagctg ggcgccagca tggagctcac 79261 ccctgcctct tcgctgactt gctccttgct cagcccgcgg ctgcctggct ctttccccca 79321 gctgcggagg gttcctcctt gcagccggcc ctggctgccc aaggcaggtt ttctctccct 79381 gtctggtccc ttggggcaca cggtggcctc actcctctca cctgacttag atctctcttg 79441 tctgggtggt gggcgcaggg gcgcgctgtg acacagcttc cgccagcctt ccagctcccg 79501 cggcctccct ccctcccctg ccctctctcc atccctgccc cctctgtgac tcacagtccc 79561 tctgttagcg ctcttgcggc tttccaggac ctgcccccct tcaacaatct acatgcatat 79621 aaagtaggag atgaaagaat tctgtccttg tgtgcttgtg cgtgcgtgca tgcacccctc 79681 cctgcctctt gctccagctc ttggacactg tctttgtccc tccttatgtc cccctctctc 79741 tgtgccccct ctgtctcctt gtctccgctc actctctctc attctccatc tctctgcctc 79801 cctccctccc tggccctgag caccctggcc tctgtccacc ccaccttcca ccttcctctg 79861 gtcctctatc tgccttgcgc tcccactgcc ctgacctgtg ccctctcaca ggcccccgtg 79921 atggctcgct ggcctccctt cggcctctgc ctcctcctgc tgctgctgtc cccaccgcca 79981 ctgcccttga caggggccca tcgcttctcc gcacctaata ccactctcaa ccacttggca 80041 ctggcacctg gccgaggcac actctatgtc ggcgcagtga accgcctctt ccagctcagc 80101 cccgagctgc agctcgaggc cgtggctgtc actggccctg taatcgacag ccctgactgc 80161 gtgcccttcc gtgacccagc cgagtgccca caggcccagc tcactgacaa tgccaaccag 80221 ctgctgctgg tgagcagccg cgcccaggag ctggtggcct gcgggcaggt gcggcagggc 80281 gtgtgtgaga cacggcgcct tggggatgtg gccgaggtgc tgtaccaggc tgaggaccct 80341 ggtgacgggc agtttgtggc tgccaatacc ccgggagtgg caacggtggg gctggtggtg 80401 cccttgcccg gccgggacct cctgcttgtg gccagaggcc tggcgggcaa gctgtcggca 80461 ggggtgccac ccctggccat ccgccagctg gccgggtctc agcccttctc cagcgagggc 80521 ctgggccgcc tggtggtggg cgacttctcc gactacaaca acagctacgt cggggccttt 80581 gccgacgccc gctccgccta cttcgtgttc cgccgccgcg gggcccgggc ccaggctgag 80641 taccgctcct acgtggcccg cgtctgcctg ggggacacca acctgtactc ctacgtggag 80701 gtccccctcg cctgccaggg ccagggcctc atccaggccg ccttccttgc cccgggcacc 80761 ttgctagggg tgtttgccgc gggcccaagg ggcacccagg cggcgctctg tgccttcccc 80821 atggtggagc tgggtgccag catggagcag gcccggagac tctgctacac ggcgggcggc 80881 cggggcccca gcggcgcaga ggaagccacc gtggagtacg gcgtcacgtc gcgctgcgtc 80941 accctgcccc ttgtgagtgg catgcccttc catcccccct ctctgtggat ggctggcatc 81001 tttcagcact ggacaggggt gcctgcccat gggaagtgcc agccaacctc tcagccaggg 81061 gtcagctgag tctccaggtc tcctgcctgc tctccaccca tttctttcca cctttctccc 81121 tctggatgct atgagcttgc caggtgccct ggcccttcgg ccctgattgc tgggccttgg 81181 ggtcatccca gggtgcctgg ctctcctccc gctggcagcc tgggtacccc ccctgactgc 81241 ctctctgctc tcttccgggg gctgattctc cacttcctgc cacgtaggat tcccccgagt 81301 cgtacccctg tggcgacgag cacaccccca gccccattgc tggccgccag cccctggagg 81361 tccagcctct gctgaagctc gggcagccgg tcagcgccgt ggcagctctc caggcagatg 81421 ggcacatgat agccttcctg ggggacaccc agggccagct gtacaaggtg agggcccggc 81481 cttgctgtcc ggctgggtgt gcccctggcc acggaggctc aggaaagccc ccagcaactt 81541 ccacctctta gcccgcatcc cacggttggg tcaggggact ttgccacaca gtgactgctg 81601 gctgagcact gccccctgca gcccggcctt caccagctgg tggtggctca cggtgaaggt 81661 ggctggggtg gctgggtgag gtgggcaggg tgtggtgcct cctggtgacc ttcctgtctc 81721 cccatagctg ccctcccacc tggcagaact gtccctggta cacggggcct cacccagggt 81781 gccgctgctg ccccggtgtc atggttccct ggagagtgtc ctaggcctga gggtgctgag 81841 tgggggaagt gatgtgtcct ccctcgatgg cccggcagga aggggaactg gctctaaaaa 81901 ggagccccca ctaggaggag ggcgtgtccc tggaacaggc aacctttggc catctggccc 81961 agcctcgtcc ttgtcccccc actcaggtct ttctccacgg ctcccagggc caggtttacc 82021 actcccagca agtggggcct ccaggctcag ccatcagccc agacctgctg ctggacagca 82081 gtggcagtca cctctatgtc ctgactgccc accaggtgag ggccatcctg gggctgaagg 82141 ggccagcaca cgcggcccaa gtctcgtgcc cattgcctct ctgctctgct gcccctccag 82201 gtggaccgga tacctgtggc agcctgcccc cagttccctg actgtgccag ctgcctccag 82261 gcccaggacc cgctgtgtgg ctggtgtgtc ctccagggca ggtgagcacg gggcctgtgc 82321 ctaggctagg gccaacgggt ggtgtgtgcc acgaggctgc ccctggaagt agccctaggc 82381 gtgccgggag gccaaccctg cctgttggag agactcaggg ccaccacgga ggtcacccat 82441 cctgccccat ctcactgagc cgtcaatctg gttggcaccc cccttttccg acaccacccg 82501 gcctgtggac tcctcatccc agcagcccca gccagggtcc agaaccccac acagctgtcc 82561 tccagccggt gctccacacg gcagcccctg gaatccttta aggcagccct gagtcactgt 82621 gcgtccctca gcccagccgc acctgcctct cagggctgga gcccacgcct tggccatgag 82681 gccctgcctg cttggcccct ttctctctgg acacagtccc cttcacgcct cctgctcatt 82741 gcaccccact gcactccagc tgggccggcc tccccaggcc cctgttgccg gtcatccttg 82801 gagtctagga cccccacggt gacctgatgg acccctgcct ggcaggtgta cccggaaggg 82861 ccagtgcggg cgggcaggcc agctgaacca gtggctgtgg agttatgagg aggacagcca 82921 ctgcctgcac atccagagcc tgctgccggg ccaccacccc cgccaggagc agggccaggt 82981 aagccgccca ccaccactgg gccctctggg cagcatgacc actgcctgga gcatgaaccc 83041 cgcctgcctc ccgtcctcct tcctctccca tggcctctgc tgccccctct ctctggctac 83101 atctcccggt ttctccccgc tgcatcccag gtcactttgt ctgtcccccg gctgcccatc 83161 ctggatgcag atgaatactt ccattgtgcg ttcggggact atgacagctt ggctcatgtg 83221 gaagggcccc acgtggcctg tgtcacccct ccccaagacc aggtgccact taaccctcca 83281 ggcacaggtg agtggcccat ggggtagggg gctgggatgg aggctggagg ggtaatgagc 83341 cacctgacct gtcctgcccc cactgtcccc tcccagacca cgtcactgtg cccctggccc 83401 tgatgttcga ggacgtgact gtggctgcca ccaacttctc cttttatgac tgcagtgccg 83461 tccaggcctt ggaggcggct gccccgtgag tccctgggcc tgcctcctgg ggtaggggtg 83521 gcgaccccag agggcactca gttgagcagc caccctgccc ctctaggtgt cgcgcttgcg 83581 tgggcagcat ctggcggtgt cactggtgcc cgcagagtag ccactgcgtg tacggagagc 83641 actgcccaga gggcgagagg accatctaca gcgcccagga ggtgggtggg cccgaacttc 83701 gggcagagac agggctgtcc cttctccacc cttcccagac ccgcccagca gtaggccctt 83761 tgagttctga agggctgagg gctcttgttt tcccaggtgg acatccaggt gcgtggccca 83821 ggggcttgcc cacaggtcga aggcctggca ggtccccacc tggtgcctgt gggctgggag 83881 agccatttgg ccctacgcgt gcggaacctt caacatttcc gagtgagcca tcaggaggga 83941 aggggacagg gcagtgaggt ctgccaacag cgtgtccaca ctcctgacag cgtccttccc 84001 cagggcctgc ctgcctcctt ccactgctgg ctggagctgc ctggagaact tcggggactg 84061 ccggccaccc tggaggagac agcaggggat tcaggcctca tccactgcca ggcccaccag 84121 gtgagtggct gccttccaaa ccctcttgcc cccaagcttc cgtagaccct caggggtctg 84181 ccatctttgt ggagagcctg cttggggccc ataacctctg tctgtgggcc ccagtcttgg 84241 gggagaggca gggaacaatc acattcattc tggggggtgg cccagaggag acaagagtca 84301 gaggtgccct gagtgggcac ccagcctgac cccacactct gcccacagtt ttatccctcc 84361 atgtcccagc gggagctccc agtgcccatc tacgtcaccc agggtgaagc ccagaggctg 84421 gacaacaccc atgctcttta tggtgagcct gagggcagcc aggcaggcgg ggcagggtgg 84481 gtggcagaca ggaggcgctc agcacactgc ctgaccctcc ctagtgatcc tgtacgactg 84541 cgccatgggc cacccggact gcagccactg ccaagcggcc aacaggagcc tgggctgcct 84601 gtggtgtgct gacggccagc ctgcctgtcg ctatgggccc ttgtgcccgc cgggggctgt 84661 ggagctgctg tgtcctgcgc ccagcattga tgcagtgagt ctcctgccgg gcccccacag 84721 cccagtggcc cacttctcct gccccgcact gtcctgtcct ccgtgatcag accagccctg 84781 ccccaggccc ccaaacccca gcagctcggc ctggctgggc tggttggctg gccgggcacc 84841 cagcactgca gagtggagcg tgggtgcggg ggaccccatc tgccatcatt tgcctgctgc 84901 aggtcgagcc cctgaccggt ccccctgagg gaggcttggc cctcaccatc ctgggctcca 84961 acctgggccg ggccttcgcc gatgtgcagt acgccgtgag cgtggccagc cggccctgca 85021 accctgagcc ctctctctac cgcacgtcgg cccggtgagg cacttggagg gtgaagatgg 85081 gtggggaggc cctcagagat ggccacccag agataaggaa ggcctgtggc caagaaacct 85141 ggggcactcg ggtcagggga ggggtgggct gtcccctccg tccctgagcc ctgagcagct 85201 cccaggcctc ggctttccag gattgtgtgt gtgacatctc ctgcccccaa tggcaccact 85261 gggcccgtcc gggtggccat taagagccag ccaccaggca tctcaagcca gcacttcacc 85321 taccaggtca gtggcctccc aggtgtgccc tacgcagggg agggtaggag ggagggccag 85381 gaaggacagg cgcctagtaa ggatgtcaag ctgggtctgc ggggagcagg aagcaccgct 85441 gcatgtctag gagggaagcc atggcatgtg agtggaggtg ggcagccagg ccagtctgag 85501 gcacgaagct ctcaggatct gttgccctgg gcacttggtg gcaccgtttc ctgggtggga 85561 agcctgggac aggagcagtg agggcagggg tgggttctgc ccaaagaggc aactggattt 85621 gagagtttag agctccaggt ctgggggctg taagctgtgg tggggaggag gctgtcccaa 85681 gggggacaga gtggcagtga ggatgaaaga gccccacttc tactccccat gcggcagggg 85741 tgggtgggag gaagctggcc agtgtgcagt ggcctcggga aggagggcgt ggtccacttt 85801 ggccccagga ggtgaagtcc agagatgacc tgcgcttcag aaccggaggg ccttgggcat 85861 ggcccagggg ggtgaaaggg ccaggctggg ctgccgtgcc tacccagcct gcgctgttcg 85921 ccaggaccct gtcctgctga gcctgagtcc tcgctggggc ccccaggcag ggggcaccca 85981 gctcaccatc cgaggtcagc acctccagac aggtggcaac accagtgcct tcgtgggtgg 86041 ccaaccctgt cccatgtgag tcccggcctg gctgccgtcg ggtggtgggc actcccatac 86101 gcctcagtgg gggtggtaca tttagagaag tggttgctgt ggccaccccc agtggtccct 86161 gcccttgtca aggcaggcaa agcagggctt ctcctacggg ggttgggcca gggctggggt 86221 agaggggtgg tagagccgag atgagagacc ccactaccca tcctgcagcc tggagccagt 86281 gtgtccggag gccatcgtgt gccgtaccag gccccaggct gccccaggag aagcagcggt 86341 ccttgtggtc tttggccatg cccagcgcac actgctcgcc agccccttcc gctacaccgc 86401 caacccccag cttgtagcgg cggagcccag tgccagcttc cgggggtgag ggtcagcccg 86461 caggccagcc tgtgcaccac gcaggaggga aggtgggatg cgggctgggc tatgttgccc 86521 accgcccgcc cacacccgac tgccatcctg gtacaggggt gggcgactga tccgtgtcag 86581 gggcaccggc ctagacgtgg tgcagcggcc cctactgtct gtgtggctgg aggctgacgc 86641 agaggtgcag gcttccaggg cccagcccca ggacccacag ccaaggagga gctgtggagc 86701 ccctgctgcg gacccccagg cttgtatcca gctcggtggg gggctgctgc aggtgagccc 86761 ctcaccagca ggcgacaagg ctgtccccga ccatgcccat gcccagtggg gaggaggagg 86821 gtaggccgcc atgcccctag cagcgcacag cagagcccag ctcactccac ctgtggtcgg 86881 ccctgaatgc cccacagtgc tccaccgtct gctccgtcaa ctcgtccagc ctcctcctgt 86941 gccggagccc tgctgtacca gacagagccc acccgcagcg ggtcttcttc accctagaca 87001 acgtgcaagt ggacttcgcc agtgccagtg ggggccaggg cttcctgtac cagcccaacc 87061 cccgcctggc acccctcagc cgcgaggggc ctgcccgccc ctaccgcctc aagccaggcc 87121 atgtcctgga tgtggaggtg agggccacct tcaaccctgc cccgccacgg tgctcaggcc 87181 gcctctgtgg gggccagcag cttaggctcc catgtgtgtc ccagggcgag ggcctcaacc 87241 tgggcatcag caaggaggag gtgcgcgtgc acatcggccg cggcgagtgc ctggtgaaga 87301 cgctcacgcg cacccacctg tactgcgagc cgcctgcgca cgccccgcag cctgccaatg 87361 gctccggcct gccacagttc gtggtgagtc cgtgccctgg gtgggcaggt ggaccgggtg 87421 ccgccagcat gcactcagga accctgcccg ccccccaggt gcagatgggc aatgtgcagc 87481 tggccctggg ccctgtgcag tacgaggctg aacccccgct gtctgccttt cccgtggagg 87541 cccaggcagg cgtgggcatg ggtgctgcag tgctgattgc cgccgtgctc ctcctcaccc 87601 tcatgtacag gtgagacccg cccaccccca gcacacttcc ctcctcgcca ttggcaaggg 87661 cgccctggca ggcggctggc ctggccccgg ccctggctga tgctggcccc ggccctggct 87721 gatgaagctg gccccgggct gcaggcacaa gagcaagcag gccctgcggg actaccagaa 87781 ggtgctagtg cagctggaga gcctggagac cggcgtggga gaccagtgcc gcaaggagtt 87841 cacaggtggg tgcggaggcg gggggacagg gtacccacga ggcctgaact cacacctcgg 87901 cgcctcctgg ccccgcagac ctcatgacgg agatgaccga cctcagcagc gacctggagg 87961 gcagcgggat ccccttcctg gactaccgca cctacgccga gcgcgccttc ttccctggcc 88021 atggcggttg cccgctgcag cccaagcctg aggggccagg ggaggacggc cactgtgcca 88081 ctgtgcgcca gggcctcacg cagctctcca acctgctcaa cagcaagctc ttcctcctca 88141 cggtgagggc cgtgtggcgg gagtgcccag tgggcaagga ggtggggctg gggaactact 88201 ggcctgagac aaaggtgggg gaggagtggg gcttcccagg atgagcctcc gacccctgct 88261 cagctcatcc acaccctgga ggagcagccc agcttttccc agagggatcg ctgccatgtg 88321 gcttcgctgc tgtcgctagc gctacacggc aagctggagt acctgacgga catcatgagg 88381 accctgctgg gtgacctggc ggcccattac gtgcacagga accccaagct catgctacgc 88441 aggttggcct tgacctggac ccggtggcgg gggtgagggc ttgccacaga cctgccccca 88501 tcgcaccctg ggcgggctct gtccaggggc gggtggggcc aggaagcccc agcaggtggg 88561 gggaaggaaa gtgagatgtc ctcagcaggg acacaggcac aaaggcctga ccctgtggcc 88621 ggctccccgc aggacagaga ccatggtgga gaaactgctc accaactggc tgtccatctg 88681 cctgtacgcc ttcctgaggg tgaggggcac tgtcccgcct gctcccagcc ctgagtggca 88741 gactcctccc ctcttgccat gagggggcta ctgcctgcgc cctacgtgac gaaggcccgg 88801 caaggggggc tttggaaagg gaaggacttt gaaccctctc cggggctgga gacgaggcag 88861 aggcgagaag ggctatcatc aatcctccct cgcttggctg atgccggctc atcttggcag 88921 tgcaggaggt ggctggtgaa ccactgtaca tgctcttccg ggccatccag taccaggtgg 88981 acaaaggccc cgtggacgcc gtgacaggca aggccaaacg gaccctgaat gatagccgct 89041 tgctgcggga ggacgtggag ttccagcccc tgacgctgat ggtgctggtg gggcccgggg 89101 ctggcggggc cgcaggcagc agcgagatgc agcgcgtgcc agcccgggtg ctcgacacgg 89161 acaccatcac ccaggtcaag gagaaggtgt tggaccaagt ctacaagggc acccccttct 89221 cccagaggcc ctcagtgcat gccctagacc ttggtgagag agccagccct gcccacccac 89281 cccagggacc cttccctacc cctccggcac ctggagcccc tcaactgtgt cttactatga 89341 gtgtggggcg tggagagcag gctgtagact atctgcttcc ctgactccct ccagagtggc 89401 gctcaggcct ggctggtcac ctgaccctat cggacgaaga cttgacctcc gtgacccaaa 89461 accactggaa gagactcaac accttgcaac actacaaggt gtgagcaggg acggggcgag 89521 gcagggcggg gctggggcgg ggcagggcga ggcagggcag ggcggggctg gggcggggcg 89581 aggcagggcg gggctggggc ggggcgaggc agggcggggc tggggcgagg cggggcaggg 89641 cggggctggg gtggggaggg gcgaggcaga gcagggcagg gctggggcgg ggcgaggcac 89701 ggcagggcga ggatggggcg gggcgaggca gggcggggct ggggcgggga tggggggcgg 89761 ggcggggcga ggcaaggcag ggaagggcag ggatggggtg gggcagggtg aggcgaggca 89821 gggcagggcg ggtctggggc ggggaagggc gaggcagggc aggatgtggg cggtgtaggc 89881 gcctggctag gggtcagcac agcctccgct ccccatctct gccaggtccc agatggagca 89941 acagtggggc tcgtccctca gctgcaccgt ggcagcacca tctcccagag cctggcccag 90001 aggtgcccct tgggagagag tgagtccctc ggccctgacc tggggccact ggagtccagg 90061 ctgggcaggc agaactcctg gccatgcatc tgcctcaact tcatccccca ccccaccgag 90121 atcttctgcc catctccttc ctttcttcca gtccccacac tgccccacct tgtcggggga 90181 gtggactggg catcccaggc caggggcagg gggtatagcc ctgaagccag gctcctgtgc 90241 cctcagacat acccacgctg gaggatggcg aggagggggg ggtgtgcctc tggcacctgg 90301 tgaaagccac cgaggagcca gaaggggcca aggtgcggtg cagcagcctg cgggagcgcg 90361 agccagcaag ggccaaggcc attccggaaa tctacctcac ccgtctgctg tccatgaagg 90421 ttggtgcggc ctgggtggct gggcctgaga ggaggctcag ccagggaccc cgaccgagcc 90481 agggtgtggg aggggcaggg gcagcctcag ccgtggatgg cccccacacc ctgccctcca 90541 cacagccctt atcccctgcc tcgcagggca cgctgcagaa gtttgtggac gacaccttcc 90601 aggccattct cagcgtgaac cggcccatcc ccatcgccgt caagtacctg tttgaccttc 90661 tggatgagct agcagagaag cacggcatcg aggacccagg gaccctgcac atctggaaga 90721 ccaacaggtg cctttcctgc tgccccaccc ctgctgtgca tatggtccac tgagtcccag 90781 agagaccagg acattcccag ggtggatgcg cccacctggg gtttctggaa cttacaggaa 90841 gatctagggc ccaggtcacc taggccacca ggccagcttc cagcggcccc tggctccgag 90901 tgtgttgcca gtaggctgga gtacatgggg cggagatcac gatggcaggc cagggcctca 90961 cgcccacgcc tgccctgcgc ccccagtctg ctgctgcggt tctgggtgaa tgccttgaag 91021 aacccacagc tcatctttga tgtacgggtg tcggacaatg tggacgccat ccttgctgtc 91081 atcgcccaga ccttcattga ctcctgtacc acctcggagc ataaagtggg ccgggtgaga 91141 gcagtgccag cagcagcagc tggcaggggc ttgaggagga aaggcttatg ggggaagcct 91201 agagggctgt gcacagagct ctgggtgggc agtggcagca tcatgggggc accttcacct 91261 ccgagctcat gcctagcgcc tcccctccct ccggagcagg attccccagt gaacaaactg 91321 ctctacgccc gggagatccc acgctacaag cagatggtgg agaggtgggt gtcagaggca 91381 tcggggctgc ggggaagggg gctgccccac ccctaacgaa gtctgctcct ccaggtacta 91441 tgcggacatt cgccagagct ctccggcgag ctaccaggag atgaactctg ctttggctga 91501 gctctccggg gtgaggcatg gcccgggggg tgcgcctgtc cacacgtggg tggaaagact 91561 agcagagcag aggggggaga cttggggctt gaggaccagg ctgggacctc actgcccccc 91621 tccacgtggt gcccccagaa ctacacttct gctccccact gtctggaggc tctgcaagaa 91681 ctctacaacc acatccacag gtactatgat caggtgaggc ccagggcact ccggagggga 91741 ggcacagtgg agcgggaggc ccgtggaccc tcccggggga gcaggggtgc cagcccatgc 91801 tggcggggcc caggctgggg aagggactcg gctttcattc tgattcccca gggagacgcc 91861 aggcagcccc tgctggatcc ccaggctcct gcggtgatgg aagcagggtg ggtggccctg 91921 ggccagcagg cagaggggca ggctcagaca ggcaccctcc tctgcccggg cagattatca 91981 gtgccctgga ggaggaccct gtgggccaga agctgcagct ggcctgccgc ctgcagcagg 92041 tcgccgccct ggtggaaaac aaagtgactg acctgtgagc tctggctcag acagcagcaa 92101 gccggatcca ccaacaccgc agcgccttat gaccccggaa ccgagccagc cactgagggg 92161 agctggcaga gcctgggggc acagggtgca aagccaggca ctgtgcccag cagtgggctc 92221 cctgcctgcc acctcccctg ccagcccacc caccttcccc ccacctgaga ttgtttctaa 92281 tttataagga tccccctcct tccccctctc cccattgtat ttatttgcct gctggaaaat 92341 cacatccgga aataaaatag aaatatgtct ttttatttta ttttgagacg gagtctcgct 92401 ctctctccca ggctggagtg cagtggcgca atctcggctc actgcaacct ctttttcccg 92461 gagttcaagc aattctcctg cctcagcctc ccgagtagct gggactacag gcgtgtgcca 92521 ccacacccga ctaattttgg tatttttagt aaagacaggg tttccccatg tcggacaggc 92581 tggtctcaaa actcctgact tcaggtgatc tgcccgcctc agcctcccaa agtgctggga 92641 tcacaggcat gagccaccgc gcctggccag aaatgtcttt tgacaaaggg cccaatcccc 92701 agcccctgct gttccctgca ggagctgact cagtgctctc ctgactgaga gtccctcagc 92761 cccatgccta taacctgcta cccacagaca aagagaggcc atcaggtacc cgggcttcac 92821 tttccctccc actctctgac ctcgctggcc cctccagcaa tgcctggccc acccctacca 92881 gttcacggac cctggtacag ggacactgca ccaatatctt gggacaaaca ccctctctgc 92941 cgagccccct ccctttcatt gggactcctc tgtcaggccc cagctggcct cccggctcca 93001 ccctcgcctc tccattccct tgtccttgaa gtggccagag tgatcttttg ctgagcagaa 93061 gctcagtgac caccccccaa ccccatatca cccaagctcc acattaagcc tacagacacc 93121 aggcctctag tcctcgccct cgaaccatgc ggcagccgtg gccagcacgt gaagggctgg 93181 aacgcgtctt ggccgcaggc cttgccccac tgtgccctgt gctgccctgg atgcttctgg 93241 ggacccaatt tctcctggtc tctcatgatg cagcccctgg gtcacaggcc cctcagtcac 93301 caccgcactc atggggaagt ggctatgggc ttgcttgtcc ccagcagtga gcagcttcct 93361 gctgcctccc tcatgtctgg ctatggagca gcccacgaca ggcaccttgc aaatggtgga 93421 gaggaagagc aggcgctggg agccagagcg ccaggcagcg tcgcaggagc aggttctggg 93481 ctggcaagct cccaggcctc ccctcagcag gcctgttgcc tctggtaggg gaaatgaggg 93541 gtccccagtg acgccacatt cagagggcac tggactccca gttgtcctag gggctgatga 93601 aacatagaag aagcaggaag tggctgggag cagccaggga cagggttccc atgggaatga 93661 gaagggcacc tccaggagga tgcccacagg ggcacaggtg gtgggggctt ggcctaagga 93721 ccaggttgcc aagtggagca catccccaca gtccgcaccg gagaacggga gcccgacctg 93781 gggtacgggc tgcttcaccc ccgcgtggac ctccccctca gctggggtgc ggaggctcag 93841 cctgggggcc cagttgcccc tggtccccaa gcagcagccc cctctgcctc cagggtgggc 93901 ctggagcagg aggggggcct ctgcccgggg gccgggctct ctggggccgc cgtcgcgccg 93961 ccccctcccc gcaccgcata ccttgcccac agccgagcag ctgggaggct atttataaag 94021 gcgggtgaga tcagcggccg gccaaggcta taaattcgca ggccgcggcc gggccccaca 94081 ggagcagccg cccggggcac cggagctgcg ggctgcgtgg ccgggatgag cgccagcacg 94141 ggcggtggtg gggacagcgg cggcagcggc ggcagtagca gcaggtaggg ctcggctggg 94201 gcacccggag cccctggcgt ctctcatgcc cactgccact caccccaccc gcagctcaca 94261 ggcctcctgc gggcccgagt cctcgggctc cgaactagcc ctggccacac cggtgcctca 94321 gatgctgcag ggccttctgg gctccgacga cgaggaacag gaagacccca aagactactg 94381 caagggtgag acttggcctt ggggacatgc ggcctcacgg ccactgcagg gacccagggc 94441 agtcctgggc ccacatgggc cagatggtca atggggccga ggtgttcggg ggcccagggg 94501 agtgagaacc ccctccaccc caatctacat ctcccctggg caggcggcta ccaccctgtg 94561 aagatcggcg acgtgttcaa tgggcggtac cacgtggtgc gcaaactggg ctggggccac 94621 ttctccaccg tctggctctg ctgggacatc cagtgagtgc ctccttcgcc tccggggccc 94681 agcactggct gggatcctgt ccctgggctt gcttggggcc accctgatcc ccgccgtggg 94741 tcctgccagg gccacagcct acaagggtct cggtattgca ggcgcaagcg ctttgtggcc 94801 ctcaaagtgg tgaagagtgc ggggcattac acggagacag ctgtggatga gatcaagctc 94861 ctgaaatgtg tgaggcacct ccctacccca ctcccagctc ccctggagct gcctggggcc 94921 tggcaatgcg ggtgcaaggc ctgccggggc tctgtggggc agggcggggc tccctgaggg 94981 gcagcctcca gctggctgtg cccaaggggg aggatctgga ggaacaggcg agggacagga 95041 ggggttggcg gcctttcttc cagcagggcc cagctggagc aggagaaggg tacactgaag 95101 ggagctgtgg gcttcagggc agggtggaac catctgtggc cccttggctt ttgctccagg 95161 tccgggacag cgaccccagt gaccccaaaa gagagaccat tgtccagctc attgatgact 95221 tcaggatctc aggagtcaat ggagtccgta tcctttgcag gaagagcaaa gcagtgtggc 95281 agccaagggc cggcaaatgg gggggccctc gctgctagag cctgtctgca gacccgcaca 95341 atgggcttgc atccctccca gtaacgggag cctctggcac gaccccgccc ccaggtaact 95401 gtgtttacgt ggaggcagta acaagctagc gttgattgtc catggacgtt gctattatcc 95461 ccaaacggca ggtgctgtct ctgtggcact gccttccctt tgggggactg ccaagaggcc 95521 ctatggcata actagcatgg gagtgggctg gcccgagagg cctctgtgcc tgctcctcca 95581 gcgccacctc taggatgacc acaaaatgca ccatcccaac tggcatacct gttgagactg 95641 acagggacac tatcgatgat gacaccagca taatggtgac caggactgcc caggggaggc 95701 cctgcctccc cacctccaca cggcccaagc cttgctgctc ccctagctga gggtgggtgg 95761 ggcctgtgcc gcagctggtg tccactggcc gcccttcact cccagcccag atgtgtgcat 95821 ggtgctggag gtgctgggcc accagctcct caaatggatc atcaagtcca actaccaggg 95881 cctgcccgtg ccctgcgtga agagcatcgt gaggcaggtg agtgccaccc actgggctgc 95941 ccagcctggc ctgggcggga gttggaggag gtcaggtgcg actctctgca ggtgctgcac 96001 ggcctggact acctccacac caagtgcaag atcatccaca cggacatcaa gcccgagaac 96061 atcttgctgt gtgtggggga cgcttacatc aggcgcctgg ctgccgaggc cacggagtgg 96121 caacaggcag gggcgccgcc cccctcccgc tccataggta ccaagggccc acatggggct 96181 gggtcggggc ctctgggcct gaacccctgc ctgagtctct gttcctccct gtcccccccc 96241 accgctcccc acctgcactc ccagtcagca ctgcccccca ggaggtcttg gtaagttggg 96301 gggcccctct ctcccatgcc tcctctccca tctgagccaa ctggaggcca tctctggagc 96361 cacagtggct ccacccccca ccttcacgca ctcccacggt ggtaatcccg aaaggctggg 96421 tggctgggct gacggtaatt cccggggggg gtcaagtgcc ccaaactgct cttggtgaaa 96481 ggatgctgtc ttccccgaat ggccacttcc gcctgcctta gcttgggctg agaggggaca 96541 gagagcaccc tgaggcgggc cggccaggtc ttcccactcc taatggagct gtggggagtg 96601 gggccacagg cggggaggca gggagagtag tgagtagctg gtgccaaggg gcgctggcgc 96661 cacattctgg tgtccatggg agccctgggg cccggagagg cctcttccct ggcggctgtg 96721 cagggaaacc tccacttcat gctgactggg gcgggcgaca ggaaccctgg ggtgaccctg 96781 gctctgacag cagaccggta agctgtccaa aaacaagagg aagaagatga ggcgcaaacg 96841 gaaacagcag aagcggctgc tggaggagcg gctgcgggac ctgcagaggc tggaggccat 96901 ggaggctgcc acccaggctg agggtgaggg gccacagagg gtgatgggcc gtggagcgca 96961 gcaaaggctg caagacatct gctcagcagc tgcctccacc ccgtctcccc agactctggc 97021 ttgagactag acgggggcag cggctccaca tcctcttcag gctgtcaccc cgggggcgcc 97081 agagcaggtc cctccccagc ctcttcctcc cccgccccag ggggcggccg tagcctcagc 97141 gcgggctcac agacctcagg cttctccggc tccctcttct ctcctgcctc ctgctccatc 97201 ctctccggct cgtccaatca gcgagagacc gggggcctcc tgtcgcctag cagtaagttg 97261 ggtggcaagt ggtgggcagg cagggctggc agtagtcgga ccacttcagt ctccctgctc 97321 tgccttcccc agcaccattc ggtgcctcga acctcctggt gaaccccctg gagccccaaa 97381 atgcagataa gatcaagatc aagatcgcag acctgggcaa cgcctgctgg gtggtatgag 97441 caagtgtggg agagcagagt ggggggccct gctccaaggg tggaggcaca gggccgctct 97501 tggggagccc taccccagtc tgcagtgcac gtgaaccgtc ggctgggtgg gcactggtcc 97561 tgcccagtca acagcactgg ggccatggcc aagggcaggg gccactagga agggatcagc 97621 ctcagcctca catcactggg cctgtccctc ttggaggacc tggggacccc gaggctcaca 97681 gcaaacccca ctgagctcct cgggtaggcg gatcggggtg gggcaggagt ccgtgggggc 97741 aggacagcct tggccccagc ccgtccccag ggctcccctt gcttccagca caagcacttc 97801 acggaagaca tccagactcg gcagtaccgg gccgtcgagg tgctgatcgg cgccgaatac 97861 ggccccccgg cagacatctg gagcacagcc tgcatggtac gcccgcccgg gctgccctgt 97921 gcccagggcc agcagcccac cagccagcag cctcacctcc tcccccttcc aggccttcga 97981 gctggccact ggtgactacc tgttcgagcc gcattctgga gaagactaca gtcgtgatga 98041 gggtaagggg tgagggctct gggctcagcc tcccggcctc ccggcctgcc tgcccccaac 98101 ctcctctttc tgcccacaga ccacatcgct cacatagtgg agcttctggg ggacatcccc 98161 ccagccttcg ccctctcagg ccgctattcc cgggagttct tcaaccggag aggtgagggc 98221 ccgggcagcc tcaggccatg tggctgcagg gagggtggga cggggacctt ggattctgca 98281 acagagggaa cactgggtcc caggagccag ggcctaagca gaaggcaggt ccagagacag 98341 ggacagagcc tgacgcccgc tggcctgccc gcaggagagc tgcggcacat ccacaatctc 98401 aagcactggg gcctgtacga ggtactcatg gaaaagtacg agtggcccct agagcaggcc 98461 acacagttca gcgcctttct gctgcccatg atggagtaca tccccgaaaa gcgggccagt 98521 gccgctgact gcctccagca cccctggctc aacccctagg cccggctgtg gctccacctc 98581 cagctctccg tgccttaagg gaaaagcggg acagctccca ccaccctgct gggcgccagt 98641 tctccacaac cacagggcag agagacgctg gagccaggcc cggctctcag agcgtgttct 98701 gcctgagacc cccgtgaggg ctctcggaga aagtgtgtgt attcctttct taataaagtg 98761 tggactgaac atcggtgcct ggagtgaggg aggcccacgc caagtctagg gagaaggtgc 98821 tttattctgg gatctgcgta ccaggctggc tggggtgctg gagtgggaag gggaatccaa 98881 ggagcaaacc aagaaggtcc tagggccagc ctaggcctcc acggcccggc cgttgatgac 98941 gcggatgtgg cggatgacgt cctggatggc ttcagatgtt gtgccctggc ccccgatgtc 99001 cggagtgtgc atctgtagga cacaggcagg ctcggcacac accaggccct gccacccccc 99061 gctgtctcct gtgacgtgca ggactgggat tgccacagga gggaagtggc accagctaag 99121 ggccagaaga tggggccaga tccagtacca tcctcccctg ggggctgtgg tcagtgcatg 99181 aggcgggcta caggaggctg cagggggaga ctgggcgggg cagcagggta gggtgcgagg 99241 ggaacctcac attctcattg tccatggatg ccaggacagc cttacggatg gaggtggcat 99301 aggagtgcag cctagaggat gggacagcca gccttcagtc cctggggccc ggagggctgg 99361 ccaggggctt gcggggcact gggagccact cacttgaggt ggtccagcat catgcagctg 99421 gccagcaggg tggccgtggg gttggcgatg ttcttattgg cgatactctt gccggtgttc 99481 ctcgtagcct ggggaggcaa gacgaaggag agtgggtgga gggcagaagg atgccgggca 99541 gtgactctgc tcctgtgaca cgtccagaag aagccactcc acagggacaa aagcccacca 99601 gcgggtgcca ggggttgggg gaaggtgtgg ggacagatag cttatgggga caggctttcc 99661 tcgtggggtg gtgaaaatgt tctggaatga cctggtggtg acggcagtac aaccctggat 99721 atccaaaaaa ctactaaatc atgcattttg aacgggctaa ccggatggga tgagtgtgcc 99781 tcactaagtc tgtgaacagg agaaaggcgg caagccgcgg cactgccacc ggcactcact 99841 gtttcaaaca ccgcgtacac atggccatag ttggccccag ccacaaggcc tgggcccccg 99901 accagtcccg cgcagacatt gttgacgatg ttgccataga gattgggcat caccatgaca 99961 tcaaactgct ggggccggga caccagctgc gggacaagga gggggctggc tgaggagcag 100021 attcggaagc atgggtgaga gaagcccagg gcactcaggg cagggggaca gcactgggca 100081 ggctaatacc tgcatggtgg tgttatccac aatcatgttc tcgaaggtga tctgagggta 100141 gcgggctgcc acctccctgc agcactggag gaaaagccca tcgcccagtt tcctggtggg 100201 gggttaggaa taggacacca gcttggccat cgcaacagtc agccagaggg acggggcgtg 100261 cagagggccc caggaggctg gggtctgaat tctaacctgt ccctcaccag caaaatggac 100321 agaatcccac cccacctccc ctaagtgtgg gttctggcaa ggcctcgagg gcaacagggc 100381 actcgcctag ggtggcacct cggtgggtag ctgggctttt ggaccacaaa tccctcagag 100441 caggcccaag caggccactg cagctgcttc tggactgctc gcagagaggt caggggacat 100501 acatgatgtt ggccttgtgc acggccgtca ctttcttgcg cccgctctcc tgcgccagct 100561 tgaaggcata ctcggcaatg cgcagggact tggccttggt gatgatcttc aggctctcca 100621 ccactcccgc cacactctga ggagggcatg gggagaagag acccatgtgc tactgaagag 100681 cagcactggc caaacaagct ggcgcgaccg ggccaccgtg ggaagcaacc ctgtctgcct 100741 atttctggct tctccctcgg gcacagcccc tgccctctaa gggtacacct gtccctggtt 100801 ccttaagctc tccccttaat cttgacgctg ggggggctca tctcgggccc cccatgcaca 100861 cctcatgctc caggctgctg tactcgccct ctgtgttctc ccggacaatg aggatgtcta 100921 tgtccttgtg ccgggtcacc acgcctggaa ggctcttaca gtggatgacg ttggcataga 100981 ggtccaggct ggtgctggga ggggacggag aaagaggctg ctaggcctga caggtggctg 101041 actggagcca gactccactg gctcacccca ccttagcccc accaagcccc cagcccgcac 101101 acccggatct caccgaagga tgttgtttcg agatttgtgc gacggtggca ggttatggtt 101161 ggtttcgatg ttgcctacaa aacacaacac aggcttagtg gcactgccca tccccgcccc 101221 acctcagcag ggcagctgaa ggctgccggg gtcagaagac ttggggagca aaaatgtttc 101281 ctgaacacta gaatgctaga atgcaacttc cagcctaagc cccacgaggc tacacgctca 101341 gggatctcga agggcacctg gtgcagcagg tgctccatga ataacccctg aacgacggaa 101401 tgaacaagtc aataaatgaa taaaggctgg caaccagcca ggcagccatg ccattctcca 101461 tccaggctct ggagcaacag aaagtgacca ctctttacta acaggttggc cggccaggct 101521 ggagctagca aggtggcctt ggcggtgtta ctgccaagcc cccgcatcaa ggcaccacac 101581 tcagaggcag gcgggggctg cctggagaag gccacggatt ggccggaccc taccctacga 101641 cccaggacac ctggaggaag ggaagcagag gccagtggcc gctggagaca gcagtcagag 101701 ggcgccaagc agaggccgcc tgctggtgag cggtacccca cccagactgc tccactcaca 101761 gcgcgggggt cccctgtggc tggcacaaag gcccctctac ttgactgcca gggcctgggg 101821 ctccccctgg gttcatcttt gctcacacgc tcctcacaca gtgcaagtaa tgggctgagt 101881 gtgcaaaatg agcaatggaa gtagtttaag gacctcaagt gaagccccaa gaatccctgc 101941 ctgggctgtg ccatccacct ccaggctggc tgcagccgag gagccaggca gagggtcgga 102001 actgcatctc tgtgatacct ggtttgtgaa gaaagatgct gagggtcaaa gctgacccca 102061 gcagagagga ctgtctcagg agggaaggag gggtgggagg gaacccatca gggagcccgt 102121 cactggacag gctcacacac agctgctctc actcgaatcc cagcatcgca gcgcaaacag 102181 gcccgaggag gcgtccaccc agtctccaag cacgagggtc ctctctagcc agacaagccc 102241 tgaaatcaga gccgtggggc cccacaagag tcagctcaga gggaggtaaa agaaatcaca 102301 ggggctgggc acgacggctc acgcctgtaa tcccaacact ttgagaggct gaggcgggca 102361 gatcacgagg tcaggagttc aagaccagcc tggccaacat ggtgaaaccc actctctact 102421 gaaaatacaa aaatcagctg ggcgtggtgg cgggcgcctg taatctcagc tactcgggag 102481 gctgaggcag gagaatcgct tgaaactgga aggcagaggt tgcactgagc cgagatcaca 102541 ccgctgcact ctagcctggg cgacagagcg agaccccatc tcagaaagaa cgcaagaaag 102601 aaagcaagaa agcaaaaaag caagcaagca agaaagcagg aaagcaagca agaaagcaag 102661 caagcgagca agtaagcaag aaacgaagga agaaagcaag aaagaaatca cacctagggc 102721 cccatcctgg cccctggccc tggagctcac ccttcagggc cacgcggttc cggcggatgg 102781 ccatgatggc attgcgaatg tcctcttcat cagcattgga actcacgtgc acctcttcaa 102841 agtccactgg tacacatgcg tgcctgaggc acggcagggt cagggaggct ggcccagacc 102901 cccccgcacc tcctccagga agcctgccca ggctacctct ctcctgttgc tacctgccct 102961 gacctggctc tgactgagaa gggactgggc ccaccttcac tgactgatgg ggacacgctt 103021 gaactccaca atgcacactg tggcaggact ggtgccacct tccccgagcc caggccagtc 103081 tggcgggatc acaggccatg ggacgcaagg cctgggctta caggtcgtct tgccatggcc 103141 tcaccgagtg cttctgggcg caatgcccct cttggcacaa agtcctcagg gccccctggc 103201 tgaggacaag gcggggaccc aggagctcca tacctgaaga cggacttgac atgcagcatg 103261 agctctggcc cgatgccatc ccctgggatc atggtcaccg tgtgccgccc gccatactta 103321 gcggacggag gctgtgggag gcagagggtg aaggtgggcc ttcggggatc ccacatgccc 103381 ccagctcggc ccccatgggc tcaagccaat tcccttcttc ccactggcgc tgaccccaca 103441 ggctgccccg tcacaaggtg ccaacttggg gcagccccgt cccagacagg gctgctcatc 103501 cagggctcct gggtcagcag cagcccaccc ccacctggaa tttagttaac cccaaagcaa 103561 ctaagagccc cccaaaccta acaggcagag aagaatactc acaattgttt gttcctagaa 103621 aggatgagaa aggaagaaga cacaatcagc caaggttcca agagcaaggc cgtgaccagg 103681 aaagtgggac cttttcccag ggggatgctg gctagtacaa ctcccagaag aggggtgggt 103741 agcccctgga gaccacgtct cagaggacag gacacaggcc gtgcacacac gcgcacaatt 103801 gcggccacat ggagaaagac acatgcgtgg gcacgggcag gtacttactg aaaagatgtt 103861 cctcgagggg acctcgtggg cgcctagaac ctggcagaga ccaaatgcca gcggttggtt 103921 tgatgtctaa cccagaaagc caagttggaa ggaaaaacgg accgtcccca ctgtgcccag 103981 gcaggtcccc tgtctgtgca gcatggccca ctgccagggg cacaatagat aattagctcc 104041 aaagcctcag tccccagggc cggcccgctg gagcacctgg aaagtagaag gcgctccagc 104101 ctcctgcctc cttccgctct tcagaaaatt tcttaaaagg cgtgtccctg ccctccctcc 104161 tcttttttgc ttctgattat ttttttggcc agttattccc gctcagcgat gtttgtaaca 104221 tgcaaaaact ggcaaactaa ccctggtaca gcagggaaat cattacaaca aaccacagat 104281 ccatgacaac agtcacgtta tcacttcgcc gatcactttt tcttattaca aaagcagcat 104341 gtggtcgcta cagaaaaagc agaataggga gataagaagg tagcacccag cctccggcct 104401 ccctgagaca agcactgctg gccccctggg gaatggccct tcggcattag aaatcgaaaa 104461 caaatgtgta gggacatcca gccatttctg gaaggttgat aacagaaacc caactctagg 104521 aaggccttcg gaaaggactg ctggatgccg aatgtcccca acgccctgac tggaaatgcc 104581 ggacagcttc agagccaaag ctgcctgagc caaatgtgcc aggtgccaag tgacctgctg 104641 tcctggaaga agccccacct agaacacacc tgagacggtg gcagagccaa ggtcttccct 104701 gtcacttggg gatgccctgc cacttgacca ccaggcagca gcaaagtgtt ccagccaatg 104761 gcagaagcat catagccatt ggccaaagag ccatgccttc cagctagaag tccagaaagg 104821 ctcgggcgga gtctgacctc tccctgtctg tgtgtccctg gccggcactc atcagtccaa 104881 gccaggggct aggagagggt gcccacaagg accagagccc cgaggatagc tcagctgatc 104941 tgtgtggcac ctggatctgc aagcacaggc acggtcggga cttttctgtc ttctctctag 105001 cgcttatcaa cgggtcggga cttttctgtc ttctctctag cgcttatcaa catctgaaat 105061 catcttattt gggggctctt gtaagcacgc agaaagcggc ttcgatggac cttcctccag 105121 ggagctttcc ctgatgccct aaggctactg agctgggacc ctaaggccct acttaccccc 105181 ttccctaggt tatcactgca ttgctccctc ctcaacccct gagcccgaga ggaggggcca 105241 agtcagtgtg gtccaatact gtcaccctag gacccagcac agtacctggc ccagagctgg 105301 cacataagga acatttcctg agtgaacaag cacatcagaa aatgaaaaaa gatctggaga 105361 ctagggaggc ctcggtccaa atccaccatc cactcttctt gagatggctt catggcctca 105421 tctgaaattc acagaacgga ggtacctact ttccgtgact gttggaaaga ttaggggaca 105481 ggaggtgtgt gagctgctca cagtagctag tgccacacag cacgcctgcc ccttcacccc 105541 actggcccct caggtcagag ttcaaggctg gtgtccctgg gaggtcacgg gtggacccct 105601 gcagctccca gcagtagttc ctcctgggct tcagctgtgg cagcaagcca atctcctgag 105661 agcatggacc tccctcacag atggcaacat caccaccagg gctgagctga acctctctgc 105721 cacccggtcc ctgtgccagg tgttcctttg ggctttggca aaagacccag agagcgctcc 105781 agatggactt ccctgacctt actgtacaga tgcaggcacc ttccttcagg aggtcgcaaa 105841 ggctctgcct tgtgccctga cttccctgtg aagccaccaa tcaaaggagc caagtgtgag 105901 gcaggatgca atatgtacac gcctcccgct ctcccatcac cctttcatgg ttccatggaa 105961 agaaagaaca tccacacccc acatcttatg cctgctgagg cttctgtctt tggcatattt 106021 ctgttggacc atgtgtcaat ggtggctgca tgttttccta tggcgtccaa atcagtggga 106081 ttcaagacct ggggttcctc tgtaagggtt ctcacaccca ggacggtcag tctcagggca 106141 aggcactgcc cagctaccca ccgttcactg tcccctactc ccaccccaac gcacacacag 106201 gcacacacac aggggaggtc agggaggagg cacacatgtc ttcaccaaga gcctacagga 106261 gcccagttgc tggagccatc agagcctctg gaaagccact atatactgcc gtcctttcag 106321 cagcaaggtg ggagaagtca gcatcgtcaa gagaatttca ttaggcacaa gcagtctccc 106381 ccagaggtca gaaggagaaa acatggcaag tagagatgtc cctggggagt ggccgcaaag 106441 tcacggagtt ctggaaggtc agagttgaca ggatcctttg agaacatgaa gtctcctccc 106501 ctctgggttc caaaggggaa actagaaccc agagagagga ggaaatggcc taaggccacg 106561 cagtggctca gtgtgggagc atggcctaga ggtcaaacct cttacccatc agggccaggt 106621 gctggctggc agctgcttca gatggtttcc agcccaccta ggacctaaat taacctcttc 106681 ttaaataagt gtaaggaaag aagactcgga cctgaatacg caagaccgca cacggagaag 106741 agcagaagca aaacccccgc agcctcagaa accctagctc cttcccattc caggcgcctc 106801 gaggggaagg catctggggc tctaagactg ggggactatg ggggatgact ggcaaagtcc 106861 ggttctttgc caccgggata ggtggcaaag cggtccacga gagaggaaac cgtggagaaa 106921 tttgagtccc ggaaccggat tcccgaagcg gtttgagaaa ggcgggccgg gatacgtgag 106981 tgactgcctc agaagggctc cttcaaggac actaaggagg tgacagccgc cgtggcccac 107041 tctcccggcc cgtccggagg gccccccgcc acccgatgca gaccccctgg gggagccggt 107101 ccagggccaa cttcggccaa tcccctcagt gacagcggag gcggccaatc aaccccggcg 107161 cgaagccctt tccccgcccc tggtggggcc cctagccaat cggactccag actgcttcgg 107221 gtgcggctac cccaccgctc ccctgcgacc gctgccgcgg tcccgtggct ctttccctgc 107281 tcacctccca gggacggcag agaagggctg gcccgagcac cgccttcgcg gcgctgccgg 107341 cgacggtcgc taccttcagc gccatgacgg aaagtgagag cctccgcacg tcccgacacg 107401 cagataccgc tctcgcgaga gttcgacggg gtgcgaagtt tcggggacag gcgcggaccc 107461 ggtactgcgc acgcgcgcgg tcgcaccgat tcacgccccc ttccggcgcc tagagcaccg 107521 ctgccgccat gttgaggggg ggaccgcgac cagctgggcc cctggctcag ggaggggcca 107581 cgtcagtgct gccagagacg tcacaatgcc ggcccagccg ttcggtgcgc gattggctgc 107641 cgctgccact tacgcgtcgc tcttcctcgt ttgcccctcg tgttcatggg agctcgtttt 107701 cttttcctct aggcagagaa gaggcgatgg cggcgatggc atctctcggc gccctggcgc 107761 tgctcctgct gtccagcctc tcccgctgct caggtagcgg cccagccggg gcttctttct 107821 tgcgagcctc tgacccacgg caggtggtgt tgggggcagg agggatgcgg gggtccggcc 107881 tttcagggtc cgtcgctgcg ggccgggcct tcccccgaga ggcaggcggc cttgggacgg 107941 ggggaccaaa gcacctcgcg cacgtttacc ggcagccagg cttttccttg tctgcccata 108001 cgaagaggag tccccgagcc tccatggtgc gcccggcccc caagctgcag cacggaacgt 108061 gacacatgtc ccggctcccg ctccctgcgg ggtcggaccg gagcgaagct ctgtagcgct 108121 gagcgatcgc ggccaaaaac ggggctggag gagccacata agctaggaag ggacctcagt 108181 gaggagaggg agcaagagcg gggcgagtta gatggggagc caaaggtgcg cagggccttg 108241 agtgccaggc cgaagactgg cttttccgga gggccctgcc ttacccttac cagagttttc 108301 ggcagagccg accaggggtc ccgtcgattt gcattcgtga ggggctgagg tctgaactcc 108361 agagtgtgtt taggaggtgg ctgcagcagc cagccccggg cccgggggcc aggactggac 108421 aggggcggct ttctccacat tcccttttct gccgtccttc aggccagtag ccttgctgcc 108481 tgcaacgttt tggcagccta actggcctcc ctttccgcac actcctcatc tgcccctcag 108541 gaattccctg aactatagtc agggacgtat ccagaggaca agcctggcca taaaaccctt 108601 cgggggctcc cctgcgccac aggaagcagg cccaacttag gcaggacaca gaagggctgg 108661 cgggatctag cccctgagaa ctgcctccac gctgttccta gggatctgcc tcctgtggtc 108721 cagacccaga gccttctctg acgtcctctg ctgaagccat cgttctgcca gggcaccaca 108781 tttggacagc gggtggctga gaacattccc actttgggga gcctttgttt cacaccctct 108841 gatgtaggcg gcagccttcc tttcctttgg gcctctggtt atggagaagg ctaaggcaag 108901 gtcctttctc tcacagctaa caagttgtgc ttctggaacc agagtctctc ccgtttactt 108961 cttcaggaaa cgctggggcc agtcttcacc ttctgtcaac tggccggggg cagagaccct 109021 ttctctcatc ttgaaagccc cagtgcattg cctgcctgca gcccccaccc ccgatgtctc 109081 tgctgaggat gccttcagta actcgtccag gccgtttgtg gctctgaatt gaggctggcg 109141 tggcccccgg gcctccgtgt cataggtcca agtgttggca tgtattgtcc agtgaatcca 109201 gcccccacct gtccccagtt cgctcttcct gcacacctcc agaaagccgg ccttgcccct 109261 ccccagcctt ccttcacagc catcccgcct acgttgccat tgcatttgtg accgagcact 109321 tggatctgtc tcgcatatcc cacctggaag gggcaggaag agcgcatggg tccccagctg 109381 atccgtgcta tgaggcaggg ccgggccatc ctgccctcag accgggatac tcgagctggc 109441 cttacccagg catctctccc tcttccccgc agccgaggcc tgcctggagc cccagatcac 109501 cccttcctac tacaccactt ctgacgctgt catttccact gagaccgtct tcattgtgga 109561 gatctccctg acatgcaaga acagggtcca ggtgagacag tggggtttca gacaggaggg 109621 cgggtggggg gtgctcctca ctgctagttg atgggggacc tgtgtcgata gagggagaat 109681 caagattcca actcttgggg tgccggagag atcagggcac ggtgatgcca gatcctagcc 109741 agtgttgaca ggtcaccttc ctcacctgct ttgtgtgctg tgcctacacg aggtaaccct 109801 gggctcacca cccgctgttt cctgaatgag tatctggacc gggagaaagg gccaggagtc 109861 agcccacccc ggttgccatt ggccagtttg tctgtgtggg tgttttgttt ttgtttttgt 109921 tttttgtttg tttttttgag atagggtctc actctgttgc ctcagctgga gtgcattggc 109981 acgatcttgg ctcactgcag cctccacctc ccgggttcaa gcgattctcc tctcagtctc 110041 ctaagtagct gggatgacag gtgctcgcca ccatgcccag ctaagttccc ataagtccaa 110101 agaaggaaat ggggcctttt ttgggtatgg ccctcttagg gtaaagcacc ccttgggcag 110161 cccactgggc acccctgacc ccagcacctc ccttgtagac tcaggaaatc actcagccct 110221 tttgatcatc ccgcccctgc tcacagtcaa cagggttcct atgcgtccag ttaggcccgg 110281 ccatggggat ctggccttgt gcccccgtag ggaagaccaa tgcagagggc cagtcacggg 110341 attggtgagt gttacctggt acctcctgcc agggacactg cagcccccaa ctgggcctag 110401 cctgcccacc tgcaggccgt gtgagcagcg cacagggctc ctctgcccac acccagaggg 110461 ggcagaaggt gaccctgcct ttgttccctc acccagaaca tggctctcta tgctgacgtc 110521 ggtggaaaac aattccctgt cactcgaggc caggatgtgg ggcgttatca ggtgaggggc 110581 caatggttcc cttgctaggg ggctccctgc tcccgggtgt gacctgaagc cccaggggtg 110641 gccggtcaac cagggccagg ggccgtgggc tctggctgcc ggagtgctgc agtgtcggca 110701 ctggtggtca gggtggcccc tccgtgtcca ctctgcccac actctgctca acacccaacc 110761 caggtgtcct ggagcctgga ccacaagagc gcccacgcag gcacctatga ggttagattc 110821 ttcgacgagg agtcctacag cctcctcagg aaggtgagga ctcctgtagc ccactgtgct 110881 cccctgtccc tggggagcag gatgggctgg gttgggaggt gctggcagca agtcctgagc 110941 tgggtggcct ttctgtgatc ctgtcccttc ctcagtgtct cttgcccatt tctctccttt 111001 ccttttctgg ggcttgggcc ggtgttccta cctgtctttc ccctcccctc cccaccccca 111061 cacgccaggc acccctgacc ccagcacctc ccttgcacct cccttgcagg ctcagaggaa 111121 taacgaggac atttccatca tcccgcctct gtttacagtc agcgtggacc atcgggtgag 111181 tggcctggtc cctcctcctt tttggggttg ttgggctgag tgaaggttat cctctccaca 111241 gccccagctc tgctgctggg ccgtgattgg ccagcatgtc ttggttcccc tggcggaagg 111301 tgaccagggc tggctggtct gctcacctgt actcccctga gctggcttgt gatctccttt 111361 ttttcagggc acttggaacg ggccctgggt gtccactgag gtgctggctg cggcgatcgg 111421 ccttgtgatc tactacttgg ccttcagtgc gaagagccac atccaggcct gagggcggca 111481 ccccagccct gcccttgctt ccttcaataa acatcacagg acctgggact gcacaggacc 111541 tggggctgct ggcttgcgtt attgtgcctt cccccgactg ggagagctgg gggcccagcg 111601 tcctcttgtc tgcctggcca gcagaggcac caggcaggaa aggggtgggc tttggtctag 111661 aactcccgtc cctcctccca atgaagcccc ccgtctggtc cccacaaaac cctgtatcca 111721 ccttcccagt gactctctcc tggttctggt gagggcagtg ggcttggcca cctcctcccc 111781 aggtggcccc acaggctcca tggggaaaca gccagcctgg tcttcatcac agctcctgtg 111841 ttggaagccc cgggcccatt ccgcacgcag aggggtttcc ctgcactgct ttcgggcaca 111901 gcaagtgccc ccaccctgcc catgcggcgt gcagctgtct gggcttggcc cctccatact 111961 ccacaccctg accatgccac ctggcttccc gtgtgctgtg ccttcatggg atattgagaa 112021 aatacacacc catggccagg tgtggtggca cacacctgta atccccacgc tttgggaggc 112081 cgaggttaga ggatcacctg aagtcaggag tctgagacca acctgggcaa catggtgaga 112141 ccctgtctct acagaaaaaa tagaaaaatt agctgggtgt ggtggcatgc gcctgtagtt 112201 cagctacttg acctggagag gtcaaggata cagtgagtcg tgatcacagc agtgtgctcc 112261 agcccgggtg acagagtgtg agaccttgtc tcaagaaaaa aacaaaaccc cacctgtgag 112321 gggtacatca cttcctccta gagacacctc tcccaagagc acccgcgccc tcaggacacc 112381 aggcaggcat cccggcctgg caccacgcct ggagttcatc ctgcacctct cccatcagcc 112441 tgattttcag cttggcacat gctggctcct gaccagctaa tggtattcct tagtggcact 112501 gagggcccat gaggcagtga gggtcttgga gtccaaaggg actgggtgga aatcccaggt 112561 gagcccttcc cagccatggg acccaagtcg gtttgctcag ctgcgaggca gtgacagtgc 112621 ctactccaag gtgtcaggga gtccactggg accagccacc aaagtcctga catctgggtc 112681 acccattggc tgttactgga gagaaatgga attctccaga gaggctgcct ttttcttgtt 112741 tgtttgtttt tgagacagag tttcgctctt gttgcccagg ctggagtgca atggcgcaat 112801 ctcagctcac tgcaacctcc acctcccagg ttcaaacaat tctcctgcct cagcctcccg 112861 agtagctggg attacaggca cctgccacca cacccggcta atgttttgta ttgttggtag 112921 agacggggtt tcactatgtt ggccaggctg gtctcgaact cctgacctca ggtgatccat 112981 ccatctcggc ctcccaaagt tctgggatta taggcatgag ccaccgcgct cagccaaggc 113041 tgaaaaggag ggaaggccca aaacttgcct gggaggtagg ggctcctcag tacttcagct 113101 cggtggctag agagagcagc actgaggggc cgggggtgtc aggagcaggt gaggactgcc 113161 tgtgacctgg gtctgtcaga gtctcgggga gggggtcatc tatataagca gggagccctc 113221 ttcccacccc caggccttgg gaggggggtt gggaggggcc ggctgccaag aataggtggg 113281 cttctaggaa gagctgggcg ctcaccccag gacagtgcag caggagcgtt ggggctgagg 113341 ggccacacca gctttgccag cagatgcagt ttctagattc cagaggagac tcagcagcct 113401 gtgcagggcg ggtggctggg gcaggccggg tgcagcaggc accggggcga ctgtcgggcg 113461 cccggttgct ctttttagtt ctggctccac cacgaagcct gatggccgtg ggacccagac 113521 gtgaagttgc tattccaggg catcagtagc caagtgctgc cagccttcct ggcgccacct 113581 ctccctgagc cagccaaggg actccaggct ccaggcccat atgagcccac atcccgtgaa 113641 cactggaggg caccccacca ttggaagaag gggggtgcct ttaggaccat ccaagtgagc 113701 cccccagcag ctagagggag gccaagggga cagtttgacc tccatctcca gtcaactccg 113761 gccaattccc tgtggctggg cccttggagc cttgttcccc caagggacag ggttagggag 113821 ctggtcaagg gttggatgtg ccaagccctc atgggggtga ggccctgagg ggtgggggaa 113881 gccattatct ccagcctctg ccctctgctc cccttcctgc catccagcca gcaagcctct 113941 tcccctcttc cctgctggtg gggtgtgata tgccaggctt tccctcccag agctgctgtg 114001 cggcagagag gcagctggca cccttgtgat ccagccttcc ctgcagcggg actccaggct 114061 tctgggaaca ggcactggga ctttttcccc ccacagcctt cttagcgtct aatccgtcac 114121 tgacttgctg cccagatctc ggcacttaag agaagtttaa acgaaaaaaa attttttttg 114181 agacgggggt ctcactctgt cgccgaggct ggagggcagt ggcatgatct tgactcactg 114241 caacctccac ctcccgggtt caagtgattc tcctgcctca gcctcctgag tagctgggat 114301 cacaggcacc tgccaccacg cccggctaat ttttgtattt ttagtagaga cggggtttct 114361 ccatgttggc caggctgctc tcgaactcct gacctcatgt gatgcaccca cctcggcctc 114421 ccaaagtgct gggattacag gcgtgagcca ccgcacctgg cctatattgt cttctttctg 114481 gagaatttga atgcctcctt gaccacttcc gctccttcat ggcagcactg cccactccag 114541 ggagtgccac tcccatgaga ggctgtgccc atgttttctg cggagttgtc tttgaggccc 114601 tgaccctgcc cagcattgag catggctcct aggacttgat agagcccgaa ggtttgctga 114661 gagctcccag gatggaaggg ccctggcaac gacgcggtta ttcatctatc tggccaagac 114721 atccactagg tgccttctgc aaactgagca ccttgctagg cactgtggga tcaccaaatg 114781 gggagggcag cttccgcctt ccagagtgga cggcatcagg ccctgctggg agggatgggg 114841 ctggaaccca cttctctgtg gcatgtgcca gcagcgggcc ccacctacgg tttggcccat 114901 ggagatgacc gtggcccttg cctggaggct agtctgaagt caggcttctg ccagccttct 114961 gcatctggtc accaccctgt gactgtgaca cctctgaaat gactcaggcg actctggtat 115021 atctcctgga gtgccacaca attgtccatg ggaggtcaca gtcactgctc ctgacatcca 115081 agacaaaccg agggctgcag cgtggctttc agttgaccct aaccctgccc ccctcttcct 115141 tctcccaccc caaactgaca actctttccc cagcccaata agaatgaaca cagggaggcc 115201 acgattcaac caagtacaaa ggcagcttgc tttattcttg agatgcaggg ggggaagggg 115261 tggtgcagtc tgtctgcctc caatctgggg ctctccaagc ctaaggggca ccccacaggc 115321 agcgtagttc acacgtgcaa atcctgagga ggttgggggc ctctccttcc cagtcagccc 115381 accgcctctc actggggtcc aggttgcgga tcgccaccaa gtcctcccca ggccgcatgg 115441 cccgaggtct tccactgctg ctcatggggc ttggtttcta gagccctcac catctaagca 115501 ccttcctcag ttcactcaga agagaagccc ctgctccaga tggtgtgggg cgcagaacgg 115561 gctctgtgac ttcttggtgg acaccctgca ggacccctcc gttcctgcag tcttgctctc 115621 tctgagtcac tggggaaaca ggtgcccccc agaggctccc cgttcccaca cctcagagtg 115681 gaagccacca gaacagctgc ttcccattcc tcccaccttt ccctcccctt ccctcccatc 115741 catccccagc cctccctccc actcttcctc cccaactccc ttcccagcca tcagtggaga 115801 ggggagttcc acttctgcca ggctgcctga aggagctgcc agcccccagg gcaggtccac 115861 aaaacaaaac aactattggc aagcacagaa agggagaaaa accacaaaaa tcaggggtgc 115921 ctttgtctac aaataactgc agtgcgcagc ggcggggaga ggccgcgctc agcagcgaag 115981 ccggacactc tcttgtgctc ctttcagctc agcttccctg tctgcaagta cacgggtcgg 116041 gtggagagag cactgggtgt gtgtgtgcgt gcacgtgcaa gcacacgcaa gagcatgggg 116101 gatggggaat gcacacgcac agctgaaggg gtgcccatca caggtgtgcc cccagtgctg 116161 gccactggca tggtcacttt acttgggcag aaggaagaaa agcgtccctt ctcctcaggg 116221 gcttctctct gctaacaaag ccctgtgcgc acacccagac gaggagatgt gtgcgtgcgc 116281 acagcgagca cgcgcgcgcg cacacacaca cacacacaat ctctatagga gagtgagggc 116341 cggggcccca gggggttccc tgggcctggg ccgtgtaccc gcccgggcag ctcacacggt 116401 ggtgactgag agaagagggt tgtagacccg cttgccatcg gcggagcgcg cgccgtgggc 116461 cagcatctcc tggatggtga tccagttgtc caggatcttc ttgttccgct tcttcatggt 116521 tttgcggtgg ctcagggcaa tgatgttgag ctcgggcttg ctgtcgccat tctgctgctc 116581 ccgcaggcac tccagccggc tctgcatcat gaactcgcgc cgcttccgct gctcacgggc 116641 ccggatcagg tgctgcttcc gctcctcctt gctccagtag cggcccatct tcatctcgct 116701 caccgcgtcg tcgtcggtcg tcataccgct gcgctcctcc cggatcttca gggcacgggc 116761 tttcagcagc cgatctcgca cgggccgctt ggccacgtag cgggttccgt cgctgcgcac 116821 cttcactttc cactccatgc gcggtgcttc agtggccgcg gccgccaccc cgcccacccg 116881 agggccaccg gccaagctca gggggccgtg gcccagctcc tccaggcctc gcgtcggggc 116941 cagctgcacg cagctgtggt agtgctcgcc ctcctggccc tggccgcggt ggcgccgcga 117001 gaggtaaggg ctgctttcag ggcccacacg ctccagggtc aaccccgtct tggggttgcg 117061 gcggccgcgc tcctccgcgt gctgcctccg gccggcctca ggatcccggg agagggaccg 117121 gaacttggcg gggctccccg gtggaggagc tgccttggcg ggggtggcaa cagcggggcc 117181 gggaggggtc cggttcaagt tggagttgcc ggccatggcc cgccgcaggg ggctctcggg 117241 caggggctcc acaagcagcg gggtgctgcg gcagctctcc ccagtgttgt aggcgctggt 117301 gctgtccttg tccgacttct cgggcagctc ggagatgtcg gacagctcgt gcttcttggg 117361 ctcgctggcc gccaggtcgt agaggctctc ctcctccagc agccaggcct tcatgcagcg 117421 ctcacgcagc tgctgcatct tctgcgcccg cagtatgttg cggcacttga attccaggtg 117481 ccttagctcc tcctccagca tggccatctc gtggcccagg ctctcgttgc ggttgacgtc 117541 cagggcgctg ttgcctccgg aggcccgggg aaagaggagg cccagctggt tgccgttctc 117601 caggtggcac ttgatctcca ggagctcacg gtagcgctca tactcctcat ccgtgaggcc 117661 cgggacgtcg ccccctccca gccccgcccc ctcggccagc agagagtcca tgctgaaatg 117721 gaagtcccgg ctctgcaggg cgtccccttg caggccaaac ttgcgcaggc ttcccggggt 117781 gttggtggag ctcgggggtt cgtcccccag caggtcgtgc tcagagctct cttcgttccg 117841 ggtgctctcg tcagtccggc ccaccccgct gtccagctcc tggctgttgc tcaggcctgg 117901 gccggcatcg ggagccccct tctcctcttc gtttccgggc tagagcagaa aggtaagtac 117961 cgctcagtta ggtcccacgg acagggatgt cacgggatgt cacaggagga tcggaagggg 118021 cttcaggcct ggttcatttc cctgccaagc attgggagac cctgcacagc ctcagcgcct 118081 tccacccagt tctctcctcc tcgccctccc caccagcgcc cttttccttc ctcacctgct 118141 gggcaggggg tgatttcagt ttacgagcac gcagctcccc ctcattctca gagccaaagt 118201 catccaggaa gtcatcccgg tcgctgtcct tccacctttt cgccagctgt ggagacaggc 118261 cctggtgtcc ccacagcttc aaggacctcc ccatataccc ttcttgtctg ctctggaccc 118321 cgcacttggc ttgtttcggt gaagctcagc aacctggggg cagcgtccca gccccctcag 118381 ccttcagagg ccaggagggg aaacgggccc tgagagaccc ctgcccaccc gccctccatc 118441 aactccaggt cgttggcaca tctgtgccct agtcacggcc agcagcttcc aagacagaaa 118501 gcctgatgtc gtggccccgg gtcgccctgc caggcataca cacctgactc tcaggtcggg 118561 ccaccagcag ggagatgttg gtgttctctt cctggctcag gatggccacc gcctcttccc 118621 ggttctggac gtctacaccg ttaatctgag gcaggcagga tacaatcaac aagcatgcta 118681 gggctggccc tggggcggac cctgggcggg tgcagggagc ccaagcactc tgggctcggc 118741 tgggctaggc tggggcttga gctctgggag gatgtaacta agtggaaggt accagggaga 118801 gggcggcctg actcttgcaa gagcaaatgt gtgctcagga gagcatggag tggggcagag 118861 ggagaggggc caggaggttt cagagcagca ggggccctgg atgccagcag atcccatagg 118921 cacacgggag ccactggggt gagtgggcag gagagagatg agactcagcc cgggtttagg 118981 acacggcagg ctgctgggcc tcagggttag ctcttagtgg gagagagcta gggactggct 119041 aaagagaggt ggctgggccg acgggcacag cggggccttg ggccaggcct gcagcctgca 119101 agtgctcacc tggatgatgc ggtctccctc acggatccgg ccgtctttgg ctgcaatgct 119161 gttgggattt acctgcagga cacacgcatc ttggggactt ggtgcctaac atgccctttc 119221 ggtagaaggg ggcaggctca cctcaggggc ccccaaagca agctgaaggc tctctggctc 119281 cccttccagc tctgagcccc aggaagcaga ggcagaggct gtggagtagt cacctctccc 119341 cgcccctgcc ctgcactcct aggctcttct gcacggaagc ctgtgctttt ggtcagcccc 119401 gggcattggt gggaagtagg acattgtcca tctggaaggg tggctttagt gggatgctac 119461 tagtagggag gggagcgagg ggccatcccc ctcctgggga tacacaagac atctgcccaa 119521 gatgggttcg ggagaaggct agacagcaac cctgttgcta gttctcaggc ccatctgctg 119581 tcccacccag gcacaggcct ctttctgcag atgactacct cactggacag tggctgaagc 119641 cgtcgaccat catcgcctca ccacgcagca ccctagctgt gatgccctgt ttctggtagc 119701 tgggacctta agggcagcct ctgagccaca ggcaggccat gtggatgggc ctcggggctg 119761 gggcataccc gtctgcatac ctctccgaca taaatgccca ggtcctcctc gtcgtccgtg 119821 cggtagcaaa ccatcaggcc cagcttgtcc cggtggctgc ttttatacag ctccacctcc 119881 tgccacgagg ggtccgggag gaagcagagg tagaaagccc cagaaatgcg tgagtgaggg 119941 agcaggcagc acagcccagc ccctcagccc gcaacccctc agcccgcagc ccctcagctt 120001 cctctggagc cccctgagct cagtttccag gccccgcttt gagggtctct cttaccatcc 120061 cacgtggcgc acgtactcag tggtctccct cccaggaaat gcacgaggca cacacaggca 120121 caggacacag aagacagacg tgcatgggac gtggactcac cgacatggac ggaacacgta 120181 acctcacccg caccctccct gactcagcca gatgcgtgcc aggagggcca cggggttggg 120241 cctgggcagg aaggcacaca ggcgtcgggg cggctcccgc agctggctgc gggccctggc 120301 ctggccaccc ggctcacctc atactccagc tcatccaagc ggtctgcctc ctgcgggccg 120361 ccctccataa actccgccgg gtcataatac tcatggctga ttggggggct gcccaggaga 120421 aggtgggggt cagaccagct ggccatcctc tcacacctca gccccagctc ccaggcaggc 120481 cgatcccagc gtgcctggca cggaggcctg ctctgcttgt gctctcaagg gtgggtggcc 120541 cagttgcctc ctgcagagcc ccttcctggg caacatgcag ccatgcaaag accgggggca 120601 cttctcgcac ctgagtgtgc agcgtccacg cagaggacct gcgcatgcct ctagcgttcc 120661 cgtcccccag accagaactc ctccaaccac gccagctggc agggaccaac ttctccaatc 120721 tccgggtgcc cacccgcccg cccagcactg gccccactca tgcaacgccc ccaggagtta 120781 tccaggccct ggaggacttc tgcagagcag aggggggccc accctacctc caggggctgc 120841 cctcctggcc ccaaaactaa ggttggctgc tcatggccct gtggcctacc tccccgcagc 120901 ccgggctaag attagcagaa tgaggcagcg ctcacgcggt catgagccac caggcctgga 120961 tgaaaggccc ttcttccggt cgggccaaag aaagaagctg aagagatgga ggaaattcag 121021 agcaggtaga ggaatgagac agacaaggga tgcaagaaga ttcaaggagt gatgcttgtg 121081 aggcggcaga ggagagagga tggaggagga ggagctgagg ccctcaaccc tggcacttcc 121141 ggcagcgacg tcccggacag gagggtcctc cctcggccct gggggtagat acaacggggc 121201 gcctgctgga gcccggccca aggctagagg cctggaggtt ttggagccct tgggtaccac 121261 ggtcagttct ggaggccgag gggcctgctt cctgcccgag agggtggcgt gcggatggcc 121321 gctccaggtg gcctgtcctc tctggaggca gggcccgggg ccctcctgag ggcgactcac 121381 cccagcgcca tgatatgctc gaaggtgatg tcggtctgag tgccactgtc caccagctgc 121441 aggtcgtgac aggagctgtc cccccggagg cgggggctgc gtctcagcac ctggatcacc 121501 aggggctcct tggaggagcg cagggcctgc agagtttgct cctgagacag cttggagagc 121561 tccttcccgt tcacctgcgg cagagacacg cctccgtaag aaccctcaaa ggcttgctcc 121621 tcccctctga ggtcaaggca gaggccctag ctggcaggac ctgggctgac aaccagcacc 121681 ccaagccact caagcaagga atctgctgtc tgcagccctg gctacctttg gtgcccctgt 121741 tcccaggcaa gccccactgc tcagccttca tcccactgaa ggccagctgg ggatcaccaa 121801 gtcacaccaa ggactggggg acccagcctc tgtgcactgg gtctggaggc tgtgataccg 121861 ccctacaacg tcctccagac actgcttgag ctctcctggc ctctccccat ccggacccac 121921 ttctttctag ggcggcactg aggccagagc tagcatgaag atcccattga gttccttggc 121981 aatgactgga ccataacagc caacactcag tgagcatctg tgttgtgctc attcagcatc 122041 aagcaagagg atgaccttga ctcttcagat gagccgagtc tctcaacaaa tgtctactca 122101 ctagggaggg ccagggtcag gttgggggac aagtaggaag atgcctaaag atacagtgtt 122161 gtgtcctcac agtgacaact gacttgccca agagaaaagg ccttgatgta caagaagccc 122221 cctggaactg caaagtagcc taaaagaggt gccaagaaag tggtccttat aaagccacca 122281 ctaggccggg tgcggtggct cacgcctgta atcccagcac tttgggaggc ctaggcgggc 122341 agatcacgag gtcaggagat cgagaccatc ctggctaaac acagtgaaac cccatctcta 122401 ctaaaaatac aaaaaattag ccgggcgtgg tggtgggcgc ctgtagtccc agctcctcgg 122461 gaggctgagg caggaaaatg gcatgaaccc aggaggcgga gcttgcagtg agccgagatc 122521 gcaccactgt actccagctt gggcgacaga gcgagactcc gtctcaaaga aaaaaaaatg 122581 ccactactgt tttttgggca cgtgccatgt gctacattct gggaaagacc cctgatgtgg 122641 gtctccacaa aatcctaacc cccaacaacc ttgagggaga tgccattatt gcagatgagg 122701 aaaccatgtc agcaagctca gcaaggttgt ggcactgagt caaggtcatg agccaggaca 122761 ggctgcccgg tggccagaag gaggggtcag ggaggaggag gccatcgcgg gccagggccc 122821 aaggcggaag taggacctaa gcagctctgc gggatgctgt aggggtgaag gaaggacaat 122881 gaagggtagc agaaggaagc ccattacgca cagcgcaggt ggtggggcaa gcagtgccca 122941 caaggtcgaa ggctgcttcc gcagcaccct gggacagtat gctgcgacct tcaagtcctc 123001 cttggggaag ggaggcaagt agagaaggga cgaatgacta aggaccaaga aggactgaac 123061 agactccacc agaattcagg gcacgtgcca tggggccagc aagggtcagc aggaagcact 123121 tctaggagga ggcagctcga ggcaggcgtg cagggtctcc tcatgggctc agcggcttat 123181 ggaagggctg ttctgggcct ggggaggcta cacactagca ttatgcccaa atggctatgc 123241 cccgcaggtg agcgtgggga cagaccttac ccacagctaa gggagcacac ggctggccat 123301 gcacgcctcg gtgggaggcg gggaaggctt ctggagcagg ggacacctgg cctccatcag 123361 gcaaagggaa cagggccaag aaaagaccca gggaggctgg tttgcagttg gaagcaaagc 123421 ccccagtcag cacagagagg gcagggcgag gagacccaga gtccgcctag ggatggagca 123481 ggctgggagc ttgtgccaga tgagagggca gaagggccct gggcatgggc gcaggtggct 123541 gcgcctgcac ggagccctct gcttttccga gttgtgcgaa tattagcaag tttagaagaa 123601 ctctctgtcc tgcattctca cggcggcacc tcccaccccg ggtttctcac agagatgtca 123661 attctgacgg caaaatcttg tcttagtgcc cgcgatacag tagaaccacc gctgtacgag 123721 tcttttcggg aagacagcta tctgaggaga gcccagcttg tcctcggaac ccctgtggcc 123781 tcagcagcgg ctcctgcatt actttttttc tatttgacta tgtccttgta gataatcatt 123841 gaagttttct atgtctccct tctccccaaa attttcatat tctgaaaaga gctttttgtc 123901 aaaatcagtt ttgtccgatt ataaaagaaa cacagttcat ttttttgttt tttgggggct 123961 tttttttgtt gagatggagt cttgctctgt tgcccaggct ggggtgcagt ggcacgatct 124021 tggctcactg caacctccac ctcctgggct caagcaattc tcctgcctca gcctcccaag 124081 tagctgggac tacaggtgtg tgccaccaca cccagctgtt gttttttttt ttttagacag 124141 agtcttgctc tgtcatccag gctgaagtgc agtggcataa tcttggctca ctgcaacctc 124201 tgcctcctgg gttcaagtga ttctcctgcc tcagcctccc caatagctag gattacaggc 124261 acccaccacc acgcctggat aatttttgta tttttagtag agatggggtt tcactatgtt 124321 ggccaggatg gtctcaaact cctgacctca agtaatcctc ctgcctcagc ctcccaaagt 124381 gctgggatta taggcatgag ccaccgcacc tggccatgcg gcctgtttcc ttccagtctg 124441 ttttctatgt gtgccatttt gcagctgcag gccccttaag gtccatccac accaacacac 124501 cagtggacca tcatgttcat catcacatca gtaagggtgg ctttgatttt ttattataaa 124561 gaatgtggca aagacacatt ctggggtcca ttttatacat ttttcatact taaatctttg 124621 tgtaaccagt ggttgtcaag gtggagaaaa tttcttgagg tggaatttct ggatcaaata 124681 ctataaagga aacagctttc agccaagaag gtatttgttg gaagaccata agaatgtcag 124741 tcaggacagg gcatgctgtg cttggtggag aggctacagt tagaaaggca ggatggaaga 124801 gggtgggcgg cagcagagtg tgggttcagg aaggctgtag acaaagggac atgatggaat 124861 ggcctcgaca ctctcagcag cagtggctag agccaactga agctgggaaa tttggaagcc 124921 agttgtcaaa cacagcgatt attggtattt taaccgtaga aatgtacagg taactgaatg 124981 atattaacaa caaaggcaat aaatcctcgg aacacatcac tgtctaatta tcttactaca 125041 cttttctgtc atctgtgctt ttggggttat tcacatctgg tatgggagat ggaaatacta 125101 tgtgtgcatt cccccaactc caagttcagt gacatcgcag tggtggcttg aaatcggcta 125161 tagtggcagc atttacacca gggaaatcgg caaatgctcc aaatcagggc tttttgtctg 125221 ggaacattta ccaccttacc actgcccttg aggagaagcc aacacacaag cagaagggct 125281 cggagctgtc ctcagggttt cttcctgaac tcccggggca ggacagcaag caggcagtgg 125341 gatgctcagg tgacagagat gacagagaac gcacgaatgc ctttctaccc ggaatcatta 125401 ccgagcagca gtgttgctat ggaaactgag gagcgcttgc tgggcgggct ggcggaggct 125461 gggatgtcaa gagggggatg ggcgtgggca cagcctcagc acctcggatc agcagcaacc 125521 agagtgatcg cctgtgacag ccaggcaggt atttgggtgg aagccacggg ctcctgaagc 125581 cttctcgctc tctctctctc tcctcctcat cggtcatgct ggacccccaa ggaagaaatg 125641 agctcccctc tggggaaggg ctaagggcta ccaacaaggt gggaaaatat ggggccatga 125701 gcaaagaggt atagggccag gcacatggta aggagccaca ctgtggccct tctgatgaca 125761 gcagctcttc cagtcctgct cagagccctg cctccccgcc ccacccccaa ccatcctgag 125821 gtagcacact ggtaaagcca cccctgtaat gccccaaccc tcggattgcc tactcgacca 125881 ctccagtctc taaaaggcac aaacactcaa catggccaac cccaggcttg cttcccacca 125941 gactctccct cctgccccac ctgcaaacgt ttccacaccc tgggagctcg ccccaccaag 126001 acagaagacc tagtccaagc cctggccaag caattccaat tggatcaaag acctacaggg 126061 acaaagcaaa acaagagaac aatatcttta tggtgtggga tagagacgaa attctcaata 126121 tttcaaatgg ggaagcaata atggaagaga tgtaaaactt ctagacgaca aaagacaccc 126181 taaacacagt aaaaaggcgg gctcagagga gagccttaca atgcatggac ctcataatgg 126241 gttttccaga atatgcaaag aattcctaca aaacaacaag aaaacacaga cacaaacaac 126301 tcaagaagaa catgggcaaa ggatatcaac aggtgctcag ggcaaagggg acccactagc 126361 aagcacttgg agccatgaga tgagaggaga tgcccttcag aaccaccacc aggaggacca 126421 ggaggatgtg ggcatgaagg ccacagtggc caaacaccag ggacagccca aaggtctgcc 126481 agcagtcgaa caaacaattc ctggtattcc gcatgtgcag ttaatggagg cgcacataat 126541 ggaggctcat tcgtagtatg tacttggtgg aaaacacggc agagaggcga gcagatgaag 126601 tgcgtagggg atgggtgggc acaggcggct tcaatctgct tttaatacct tcttcaagga 126661 aaaacgctca ggccaacagg acccacatca aggctgggtg gcagtgctgg tgctaagtga 126721 tgcgagatga tgccccatac ttctccagat gctacagcaa gtccaggagg cctcaagata 126781 cccaggctgg agctcagcct tttatgtgtg attgtctaat tacctccctt tcattagaca 126841 acaaacccca agagggcttc accctgagtt ctacttccat cccaactcct aacagtttgc 126901 tgaagataaa actttaatat gtttcttggc tgaattaaaa ttttatatat ttctgtaatt 126961 tgttgaatat atattaagga cctgtaattt atctgatcta ccttttaaaa aataataata 127021 ataagagatg gggtttcacc atgttgccca ggctagtctc gaactcctgg gctcaagcca 127081 tctgcttagt ccccccaaaa gtgttgagat tacaggcatg acccagcccc tgagtctgat 127141 gtacccggac tctactaaaa atacaaaaat tgcccagcac ggaggctcac gcctgtaatc 127201 ccagcacttt gggaggccaa ggcaggcaga tcacctgagg tcaggagttc gagaccagcc 127261 tggccaacac agtgaaaccc cgtctcaact aaaaatacaa aacttagccg ggcatggtgg 127321 tgcacgcctg taatctcagc tacttgagag gctgaggcag gagaatcact tgaacctggg 127381 aggtggaggc tgcagtgagc tgagattgca ccactgcact ccagcctggg caacagagtc 127441 agactccatc caccccccac ccccccccca aaaaaaagtc tccttcatca agtgcaaact 127501 tctctccttt ggaacaatcc agaagaaatc cactttgctt ctctagtatg gggagagtgg 127561 ggattgttga ctgagctcat tatcggaaaa ctctcactat tgaatgtttc cttcattgag 127621 agcaacctcc ctcctttgga acaatccaga agaaacccac tttatcttgc aacacttcac 127681 gtacctgaag cccactcaaa tgccccttgt cccccctcgc ccaggttccc agcgactctc 127741 atcttctcca gtgctcccgt gccatcctga ccacgtgctg ggcatcatgc ttcaccttag 127801 cgaggtcacc ccgtggcagt caaccctgtt gactgggtag tgagctgtga gccaagcaaa 127861 ccctcaaagc ctgttcacaa agacccctcc tccatcacgt gctagcacaa tagattttgt 127921 acaaattttc gcatttcctt tcaaataatc gaagtgctca gagcgccaca ctgaggaggc 127981 ggaggagcac aaagatgaaa atacaagtcc ctggcactcc gctgccccgg ggagtgcagc 128041 agcctgtgag cacaccacgg ctcagccagg ggcttgacgt gccccatggc gactcttcag 128101 cacacagatc tacttccttc ctgatttgct gccaagatgg gcccaagagg gctgactcta 128161 ggactgcagc tccaatctca accacagttg acatgaacgt ccttaatttc caccatggtc 128221 cctgctctga acaggggact caagccttag cagggcccct ctgcctgaat gtggcccaag 128281 tggccctcac atcctgcttg ggacagccca gttccccttt gcacactgag agcactcact 128341 gccccaggtc ccctgcaagc ccagttcctc ctcccagccg accccatcac cctttctaga 128401 cacacttagt ctgaatctta ccccagagtg taaagtggtc aaatttctgg agaagaattt 128461 gtcaataact ataaaaaccc ttgaaaacgt tcatggtagc ctttgatcca gtcaatgtgc 128521 tctaagtaat ttagtctaaa acagtaataa taataatgga tattacatga atttaattac 128581 aaggacgttt acaatttaat tacaaggctg tgggctccag gtcggggagg ggacctggcg 128641 gcctggaagg ggacctgggg aaggaggaat gaagcacagg ccccacccct ccagaggagg 128701 tggccccagg ccttccccag ggaaccctgg gaagctggtt ctttctggaa ggctgcagaa 128761 gggcctggcc atgggtcctg gaggccactt aggctaggcg ggcctcaggc caggaggact 128821 gagccatgcg ccctccccac agcgcagccc ctctgcacac accccacagc tggtccctct 128881 aagacttgcg gaaagggctc tgagtggccg agctgtggcc cctgatctcc caacccagca 128941 tctgagcaga agaggacagg agctggggca ctgatcgccc caggcccggt cggcagggtg 129001 ggggccaggg ccggcaggat cccgggttgc gctgccttcc gagggagccg gtgcccaccc 129061 agcggcctgc agcccaaggc agggtgccct gagaaagcca ccgccacccc cagcccctcg 129121 gtgtgagcaa gacccccagc tggtgggtcc ccacccacca gtggggctcc caggcctgtc 129181 gggggctcgg tggtgggcct ggatgctagg ggggcaacgt gcggcaggga ccaagctctc 129241 ccgcccacag atgagtggga gggggtcgtg gtgggcatgg gaagaccttt ccaacaatca 129301 ggtgtgttgg ggggtgagca aggcagggga ggggccctgg caggtggggt gccagggacc 129361 ccagggctgg ggctaggcct gtgggtgggg cgtgtgctgg ggattctgtg ggccgaggac 129421 atgtgactcc cgggcaggct ggagatgctg gggggtgggc ccgggaagca cgggagatga 129481 gggaagaggg gggaatcctg ccccacctgc cctggacagg gtgccccata ggaaaggagt 129541 gccaaggagc tgaagggaga gacggtggat ccagaacaca ccccccatcc cgttccgctc 129601 cccaaggggc ttggttggaa ggtgggaggg gacagagaca aagctgcctg ggtgggtaga 129661 gtggtcagga cagcctctct gggtcggtga cacttgagct gagacctaaa tgatgacaag 129721 aaaccagcca tgcttccttc cttgtagcga agggagctcc caggagaaca gactgtaagg 129781 gccagaccag caggcaacgg cagtggtcca gaccacagat gatggtggca tgaccccagg 129841 ggtggcagta gaggtggtgg gatgtggtca gagtcagctt gggttttaga ggtaggctgc 129901 caggacctgc tgttgggctg ggggcaaggg acagtgaggg cttgaagaag acatcgaata 129961 agcgcaggaa aagacgcgtc agcaccggcc agtacgcaca gaggaaacac acacccaaac 130021 cacagtgaga taccgccgca caccacgagg aggctggaat cagtgaaagg aagcacgagt 130081 attggagagg acgcggggac cctggagcct cacgccttgc tgctgggcat gtaaaatagt 130141 gcagccaggg aagcagtctg gtggctcctc acaaggttag gtgtagctta tgtacatgac 130201 ccagcaatcc ccaaagaact gagaacaggc gtgcaaagaa aacttacact caaatgctca 130261 cagcagagta tccatagttg agaacaggca tgcaaagaaa acttatactc aaatgctcac 130321 agcagagtat ccatagtaac caagactcgg aggcaacccg tgtccaccag tggatgaatg 130381 gatgagcgag cacaacatgg tacacccatc ccgacataaa aagaatgaac gaccggcaca 130441 cacgactgca tgagcacagt ctagccaagg caaagaggca gacagaaagc agacgaatgg 130501 ttgccaggac ctggaggaag gggggcctgg ggagtcactg ctgagggata tgaggtccct 130561 tttggggcga tgaacctgtt ctggaactag acagtggcca cggctgcaca acattgagaa 130621 tagactaaaa gtcactagtg gtaaatgtta tgttatgcgt atcttaccac aagaaaacat 130681 aaaattgcca cacaggacca caaggaccag gggtatcctc acagcagtga ggaagaccga 130741 ggaaggaagg actcagggat taggcccagg gcaggaaatg gaaggttttt atctctagct 130801 tgaacttccc gaggcatgtg gacatctcca gtgccccact ccaaatgtct ggcccaagcc 130861 caagtcagct aactgggtgg catgtgacca ttgctgttca attaaactag gaatttttct 130921 ttatttaaaa tcataagatg acggccaggc acggtggctc aagcctgtag tcccagcact 130981 ttgggaggcc gagacagacg gatcacttga ggtcaggagt tcgagaccag cttggccaac 131041 atggcgaaac cctatctcta cgaaaaatac aaaaattagc agggcgtggt ggcaggcacc 131101 tgtaatccca actacttggg aggctgaggc aggagaacca ctggaaccca ggaggtggag 131161 gttgcagtga gcccagatca caccactgta ctccagcctg ggcaacagag caagactccg 131221 tctcaaacaa aaaaagaatc atgagatgaa gctgggcgtg gtgtctcgct cctgtagccc 131281 caagtggtgg aggaggtagg gggcaggcgg tagggctgag gcaagaggat cacctgagca 131341 ttgggaggtc aaggctgcag tgagccgtga ttgtgccact ccactccagc ctgagcaacg 131401 gactgagacc ctgtctcaaa aataataaaa atttaaaaat aaataaataa aaatttaaaa 131461 agcaaaagca tgagatggag gagctcacct agggagtgaa tgtagtcaag aacaggagag 131521 gcccaaggag agtcctgggg caccctgaca ctgagccggt tggggagaga gactagaaag 131581 tcccgggagg gaagccgagg agctgctcaa tgcactgggt cggagcccgg gaggggaggg 131641 cgaaggtggg gcctctggat ttgctaaaag gtgggagggg gccgggtggg gaagctcgac 131701 ctgcatcgac acagccttcc ggagcttttt gcagagaagg agcaggttgg gttttcagga 131761 acatcagagg ggcggctgaa taccgatgga agagcaattc gcaaggccag ggaaagtgct 131821 ggcaccgctt tccccgcatg agggggacat gcgggcaagg gaggggggac acagggacag 131881 gggagggggc aacaccgggc aggcgaggca ggcatgacgc gaagtcggcc ccacgggcag 131941 tcgctgtggg gcggcgatct cccgctgggc aagggactgg cgggccttag gggacgcggg 132001 aagcggggcg ccctaggcgg agagggcgga gccccaggcg cgctcgcccc gccccctgag 132061 cccggagccg cagacaaaga gcgcgccgcg acggcggcgg ggtggctgcg gctgcgggag 132121 cagggcggac cacactcgtc ccggcgccgc cacgctggtg gctgtattgg caggactcct 132181 gcccgcccgc cccgccaacc ccgtcggagg ctgagctctg tttccaggct ctgccccagg 132241 caccctcctg cacccagcgg tggcaggcgc acagccgggt ggggcgcgac gccctcttgc 132301 tgatcaggaa tctctgggaa gccgagaaga gaggccggag aggctaagaa aacagttgtg 132361 gcctgggaga ctcgcaatcc caagtgcttg tcacacggcg cctgcgaccc ctttcactct 132421 cactggcaac gcgcgggatg ggaggtgaag ccacatcact tccaggggcc cccgtggctg 132481 ccacggaggt gggggggctg ccaccttcgg actcccctta ccccactgct gtagagtagg 132541 aaacaagctg gggaaggaca gagacaggaa gttatcctct gcccggggcc gagctctgac 132601 ctgcctgctg gctctcagct ctgccgtctg gagtatccag gacaccttgg aagcagttgt 132661 gactgtagct tccaggccaa gtcaggcatc gggggatctc ccaacgagta gagcatcaga 132721 ggagaaagga gaagccgcag gctcactaca ttcacccact cagtgaccca gcaaagtccc 132781 cattcatgtt tccccagctc gggattttga aaattgtccg cccacaagca tgtgggaaca 132841 tctgcagatt ccccacacct gctctcgggg tttctacaca cacatgtgca gaagcatttg 132901 aaagcaagat gccatcagtc ccattctccg tggaaccaca acacccttag cacggaattc 132961 cgcactcata caatgggcca gcctcacatc tccccagctg tcccgctaac acccttatag 133021 ctttttttaa aaatctagga tttgaggctg ggcacaatgg ctcatgcctg taatcccagc 133081 actttgggag gctgaggcgg gcagatcaca aggtcaggag ttcgagacca gcatggccaa 133141 tatggtgaaa ccccgtctct actaaaaata caaaaattag ccaggcatgg tggcgcatgc 133201 ctgtaatccc agctactcgg gaggctgagg caggagaatc gcttgaaccc tggaggcgga 133261 ggttttggtg agcccagatc gtaccattgc actccagcct gggcaacaag agcgaaactc 133321 tgtctcaaac aacaacaaca acaacaacaa aaggggagtc tgggggaggg gcaaggccct 133381 gtctgcagtg tctccctgct gaatgccacg cacccaggat ggctgccagc accatggggg 133441 ctcacatgca ggtgctttag gcaagaatgc ctgggagatc accagaaggg ctggccagct 133501 gtcctttggc ctccctcgat gtggacagag gggagatgac aggcggtcct gacccactcc 133561 ccctgtaaag cctgagtagg atgggcggga gaccagtcct aagtaccagg ggtttatccc 133621 ctgaccatct ccggcgtgct cggatgctag caagcacatc atggtgtcag agagggggtg 133681 ccaaggccaa gacaagttca gcagagtgtg gaccacgatg agatatatga caagggggcc 133741 tggtggcagt ggagggacat gggctgcaga aaaggctccg gagctgggga catgctgttt 133801 cctgccttcc attctgggca accagcctgc ttccttttcc taactcaaca cgcagtaatc 133861 ctccattttc aggggaggtc ccagattttc caaactagga ggacatgatt ggaatatagg 133921 aatggggctg aggttccgac atgacacacg aaagtgcctg tgacaaccca catccgtgga 133981 ataaaggaac catgtggtca tctcactagc cacagaagca gcactgcaca atccaatgcc 134041 tctcatgata aaaacacact aggaatagaa tgggaacttc cttaccctga taaaggacag 134101 ctatgaaaaa gccacagcta actttatacc ccctagtgaa agactggaag cttctttacc 134161 tttaagatga ggaacaaggg aaagacattc acaggcactt ctattcaaca ttgtactaga 134221 gactccagcc agagcaaatt aacaagaaaa agaaatcgaa gggccaggaa cggtggttta 134281 caccggtaat cccagcactt tgggaggccg aggcgggcag gattgcttga ggccaggagt 134341 tcaagactag cctggccaac atggcaaaag ctcgtctctg ctaaaaatac aaaaattagc 134401 tgggtgtggt agtgtgtgcc tgtaatccta gctacatggg aagctgaggc aggagaatca 134461 cttgaacccg ggaggcggag gctgcagtga gccgagattg caccactgca ctccagtctg 134521 ggtgacagag tgagactgtg tctcaaaaaa aaaaaaaaaa agaaagaaag aaaagaaaag 134581 aaatcaaagc catccagatt ggaaaggaag aagaaaaacg atttctattt gccaatgaca 134641 tgatcctaat atgtagataa tcctgaggaa tctactaaac aaattatact taataaacaa 134701 ttttagtaag gtcgcaagat ataggatcaa tgtataaaaa tcaatttttt ttttgagaca 134761 gagtcctgcc ctgtcaccca ggctggagtg cagcggtgca atctcagctc actgcaacct 134821 ctgcctctgg ggttcatgcg attctcttgc ctcagtctcc tgagtagctg ggattacagg 134881 cacacatcac catgccccgt taattttttt tgtattttta gtagagacgg ggttccacta 134941 tgttggccag gctggtcttg aactcctaag cacaggtgag ccacccgcct tggcctccca 135001 aagtgctggg attacaggtg tgagccaaca cgctcagcca atcaattgta tttctataca 135061 gtagcaatga gaaggctgaa agtaaaatta agaaaacgat tccatttatg atagcatcat 135121 aaagaaaata ctcatgaata aacttaaaca aaagtgcaaa acatcatgta aagaaattaa 135181 agacctaaat aaatggaaaa cgcaccccat gctcatgaac aatactaccc aaattgatcc 135241 acaaatttga tgcaatctct atcaaaatgc caactggctt ctttccagaa attaacaacc 135301 tgatcctgaa attaatatgg aaacttgagg gacttagaat agacaaaaca gtcttgaaaa 135361 agacgtacaa agttggagga ctctcatttc cttacttcaa aacttactac aaagctgcag 135421 taatgaaagc agtgtggcgc tggcttaaga acagacattt ttctaaagaa gatccacaaa 135481 tggccaacaa gcacaagaaa aggtgctcat catttagcta acaggaagtg caaatcaaaa 135541 tcacaatgag ataccacttc acacccagca ggacggccat aacaaaaacc cagaaaataa 135601 caactgttgg caaagatata gagaaattgg aacccttata tgttgctggt gggaatgtaa 135661 actggttcca gccactgtgg aaaacagttc aatggctaca ggcacacact ccaaagaatt 135721 aaaagcaagg actcaaacag atacaagtag accagtgttc acagcagcgc tattcacaac 135781 agcccaaggg tggaagcacc ccaaatgctc atcaactgac gaatggataa atatgttgta 135841 tatcacaaaa tgaaatatta tttggcaata aaaagaaatt gactattgat acatggatga 135901 accttgaaaa cattatgatt aatgaaagcc agacacaaaa ggtcagtttc tatgaaatgt 135961 ttagaataaa caaatccata gagacaggaa gtagattggt gggtgccagg ggctaggaag 136021 tttgagggag aaatggggat gtgattgcta aggagtgcag ggtttctttt ggggtgatga 136081 aaacgttctg gaattactgg tgtgggctgc acaaccttgt gagtatacta aaaaccactg 136141 acttgtacac tttgaaaggg taagttgtgc agtaggagaa tgatagctca atacagctgt 136201 aattttggaa gtgttgtgag tcattggcag aaaatggctt ggaacacaag gctaggctag 136261 agacttgggg gttcctggcc tggggatccc aggcctctgt ggattgtaga tggaaggcgc 136321 tgcagatgca catgtgcttc tttctgaaaa tcaggcccat gaagtgcctc accctgacaa 136381 ggtgcggacg aggttactca ggggggacag ggagaagtgc cccctaaaga tgggggagca 136441 cccgtgttta aggaggccgg gaggcaggca cggggtccca gaagctgacc tgtcttttgg 136501 aacaaaagct ccttctcctc tgaagaggct gggacctggt gatgcgtcca cccccagaag 136561 cgcagtcccc tgggacagct catggcttgc actgctttcc gtatctactt ttttcagtcc 136621 tcttgttttc ccttagttgt gaaggatggt tctcctcgta aaatcccggc tttgattggg 136681 ccaaggagga atgtagaaag tgaaagtttg cagccaaact cccacggttt caaagggagc 136741 ccagcaggaa agcgcacaga gggcgcatgc ggcagtggcc ctcctcctgg cgggtctcct 136801 tggggtgggc aagtcccttc catgaaggag agcttagggc gcagagcagg gagggcccac 136861 ccaagttcac ctccctgaag gcaggcccag cccgacctgc cgaggccttc ccggagggct 136921 gtggggagcc ccaagggcta ccggtgtcgt cccatggctc atttcaggcc tggggtgcag 136981 ctcgggtcgc tctcccctat ccacacgtcc ccacgaggtt ggccttaaat gaagtgcgac 137041 accaatcctt ctccaggaaa gtcaaaacgg ccccggagga atcgcttgga gaaatcgctt 137101 gccgtttgat ggaatgactc ccgaaaagga agcaggagga tgggatccag agtccccgtc 137161 aaccaaccat cccagccctg gaatccaaca cggattccac aggacgccct ctcggtcgca 137221 ttcctatccc gctagggggc tggcaatggc cacacaaggt ttccaaacag cgggaacaag 137281 agacgaggag tggcagccgg gaactgagag ggaagaaggc ggaggacagc gaacgggaga 137341 agccgagggc cccgcgcacc cccggagaag ggccggggag agaggaccag gagctccgcc 137401 agggtggggc aagatttgcg tccccgccct gcccctcggc tcccgcacca ggtggccaga 137461 ggagctcggc tcacagcgtg aagtcagtcc ccggattcct gggaattctc ccccgtgccc 137521 agcccctcct tcccccgccc ccgcccccgc cccggagttc tcgccgatcc ctgcacgggt 137581 tcccccacgt tctcccaagc caacacccct cctgcaaagc acttgattcc ttcccagtcc 137641 agaccaataa acccgggagt tgaatccagg ccactaaacg catacctgtc tgagccctgt 137701 ctgctcctgg gtgccttgac cttgacctcc ctccacttag ccctaggagg tctcagccca 137761 cagcatccgg gacatttctc tagacagtct gcagggtgca cacagaaagt acacaaatgc 137821 ggggccgggg gagcctccgc gcacacagag acggggcaca ggaacgcact cacgtagtca 137881 cgccacgcac aggcaagaag ccccgttccg ggccgcgggg ctgggctcca ggtccgagct 137941 ccgggccccg cgtctccttc cctccccgcg gggccctgct ggtccacgcc ccgcaaagcc 138001 cggccaggca ctgccacccg cagagagccc aacccgctga tcccggcggc gcccactcac 138061 ccgagacgcc taaggcccgg gagcccatcg ttcgggcagg gccaggcggg agtgggtgtc 138121 ggggtgggca cccagccccc ggacggactt ggaccgcgcc cggggacgcc cacgcactcc 138181 cacgctccag gcccgccggg atctagattc ggcgcccccc gcacggcggg agcctggctc 138241 cctcccccgc cctgggtcct gcggggagag cgtgtgcggg gaactgcgct cgttgctatg 138301 caacctccat cccgcgcccg ccgcaccgcg cctcccggga cggaagcgcg ccccacccgg 138361 gccaacccca acttcccacc ccacccgtgg ctggttccca cggagcgcca acacccaaca 138421 gccccaggat gggagggtgc ggacggctcc ctaccttgtc ctcctcgagc cctcccgctc 138481 tcccacgccc gccgctccgg gcctccccac tggcccgctc ctccccgcgg agggccgcgc 138541 ccccgccgca gcgccggcgg ccctacctgc agcatcactt tgtactgctc ctccggtttc 138601 tggaccacgc acatattaca tcccatgttg ctggccggcg agaggaggcg gagggcggga 138661 acgggaagac gcctttcact gacccgggcg cgggacctcg ggtcccgggc cggggccagg 138721 ggccataccc tggcgcgggg ggcgggccgc ctagcggggg aggggggcac gcccccgagg 138781 tgggggcagc ggctcgcccc tcaggttaac tcttcgctgg ccgaggccag gcgcgggcat 138841 gctccctcgc acccggccag gagaaaaagg gcagcggggg atgggcgggc gcgccgggcg 138901 ttgccaggct acggcccccc agggagcgcg ggggcgcgcc gggccgccag gggccgtcag 138961 gggccgtggg ccgcgcgcct ccccgcccgc ggggcccgcc gctgccgctg ccgcctccgg 139021 ctcctgccca tggccgccag ccccgggagc ccggccctcg cgctcggcgc cgcggcccgg 139081 cccgcgacag gggggcagcg gcgcggcggc ggcccccgcc cgcggccagg ccagcgcccg 139141 cgctcccgcc tcgcccggcc gcgctctcgg agctcccggc gcccgctctt cgtccagcag 139201 ccgctggccg tctccagctc ccgcccggga cgacgggcac gcgagcgcgc ccacggcggc 139261 gggggagggg cgcgggcacg gggcgcacgc gcgccgggag gggcggggag gggagcgggg 139321 ccggctgcgg gagggagcgg cgggggcgct ggagccggcg cgggaagccg ggggcgggcg 139381 ctcgacggaa gcgggcgggg gtcctaggca ccccacgaca cccggcgtcc ggtcctgagg 139441 ggcgggcacc aggcgcgttg tctgcgcgat gccgctctgg gagagagggg tcctgccagg 139501 aggagcagtc tcctgggtcc cccagagaga gtccgcgatt tcagcccctc tccccggtct 139561 tggacgcagc agccctcacg cctctgctcc ttcctcgggc ttcgccttaa atgactgcat 139621 ccctccttca actcagccat tccctgctgt ctttgaaacg gacacctttg tttaaaacaa 139681 gaaagaaaga gagaaaggga ggaaagaaaa gaaagagaga aagaaaaaag gatgagcgaa 139741 gcgtggggag ggggtgggtc gtggggagag tggcctcgga ggccttggga cccgggcctg 139801 cacacagcac tggccagcct gggcccgccg cctacctgtt cctcagcccc agccctcacc 139861 cggtccccgc tttcacagcc cttccatctc ttcctcttct ctcccaccac ctccccgtct 139921 tcttggacaa gcctggaggg agaaggcctt tgcctggaaa gcagcagcgg gtggaagtct 139981 gcgttcccag aggctgcccc ggctgcagga gatcacttgg agaaattgct acatgcttca 140041 tttccaggtt tgggcaagtg agggaacccc cagccacacg gggctgttga atcggagccc 140101 ttcctttcct ccctggctcc tcgttagaag agaaatcaag aatggcgggc ctgcccagaa 140161 tgtggagaag agaggcgagt ttcttatcca gggcttctcc cagaaagcag aacttgagac 140221 aagggctggt gtgcaggttg caacctgcca cccaatggct ggctagtctt gaggaatggc 140281 gtcatattga gggctcagca tcggtctctg cagctggtgg atggaagaac attcactagc 140341 ggcagtgtct agatgagccg tggtaagagg aagcccttgc cgttgggctt atacagagcc 140401 tccagccctc ccaccatggc cactggcatc gtgggtccat cgcatcctgg ggagaagctg 140461 gctgacctcc accagatggt cccctctgtt cgctagcttc tggcgcgccc cctccacact 140521 gcatgctccc cagcggcatg ggcgcgaaga cgagcacggg gccccacact ctccctgagc 140581 actcccgtag gtacctcttc gccaaacctg tctcttcttc caatcttcct ccacccgggt 140641 ccctgaccag tcagcccagc cattttccac tgcccggggg tcatggacac cttcatctca 140701 ggccatttct ccctctacaa agtagacaac ctagtgtctt gctcccctag aggatgtccc 140761 ctcactgctg tctctcaggg ccaccctgca aagggcctca gtgtgacagc aggtgttttt 140821 cctagtggtg acatgagatg accagtcgag gacctgggtc tgagtcagtc atggggacac 140881 cccatcacca cctcccatga ggccataggt gtgagctgaa ggggcatacc agtgtgaaag 140941 agccgggtga ccgggggtct ggactgcctg ggcctcaatg tgccatctcc acctcaatat 141001 tcagttcaca atgggtggct tgggtcacac aggccctgga tgccccatga tcaggctgta 141061 agtctctcct aggtcccagt agcatggcca cagctatgct tttttcagtg gtgagtgagt 141121 agctctgcca ccgaaggtgt ggccctgcca cagcctctat ggtccctgct gggatccttc 141181 cattgaggct tcctgagggt tccgcacagc atcctggcct accatggaca gatccctcca 141241 gcaccgttag gttggcgggc tctcgctgct ggtggcagag cagcttgccc cgcagcccag 141301 tcctcctgca gggtcctttc ttgctctgga tcccactcaa aacaggtctc agttagccgg 141361 gcgtggtggc acgcacctgt agtcccagct actcaggagg ctgaggcggg aggatggctg 141421 gccccaggaa ttcaatgtta caaggagctg tggttgcacc actgcactcc agccggggcc 141481 acagagtgag actcagtctc taaaacaaca aaacaaaaaa agaaccggtc tcgcctgtgt 141541 gtccgaaacc ccgctccatt agacattcca ccagtttggt gatctaccca gcatcatgcc 141601 agataaatcc attttcttgc taaagttaga gtcagcttct ccctgtaatc agaaaaccct 141661 aacaaatata cagccctggg agagcgtccg ggaggaagga catgagagac agctgatcat 141721 ggggcagtta ctgcgcttga cagtgcagct tgaggacagt gcagtgacat cccagcacag 141781 cagtcaggct ggagcgggcg attgcgctca agatagagac agggtgaggg cggcaccacc 141841 atgctccccg gggcactcca gcattcttta acctgaagat gggtacccag gtgagtctgg 141901 cccagattcc tatctggatg ggtcctatct tactctgtgc tgatgaggta ggtccctgct 141961 ctccctccac gtcccagagc ttggacccac tgaggatgcc atgacacccc acagaagggc 142021 tccactctga cccaccatag agagctgcaa aaaggccatg gtgctcttag aagcaggaca 142081 cacagaagat cccacccttc tgactacagt tgtccttcgg tgtccacggg gcatgggttc 142141 caggcttccc cctactgggg ataccaaaat ccatggacac ccaaatccct gatctaaaat 142201 gacatactat ttgcatagga cctttgctca tcctctctaa tgctttaaat cttctctaga 142261 ttacttgtat taccggatgc aatgtcagtg ttacataaac agttatagtg taatgatttt 142321 tttttttttt tttttttttt tttgagatgg agtctcactc tgtcgcccag gctggagtgc 142381 agtggcgcga tctcggctca ctgcaagctc cgcctcccgg gttcacacca ttctcccacc 142441 tcagcctccc tagtagctgg gactacaggt gcctgtcacc atgcccggct aattttttgt 142501 atttttagta gagacagggt ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 142561 gtgatccgcc caactcggcc tcccaaagtg ctgggattac aggcatgagc ccctgtgccc 142621 ggcctgattt gttttttaat ttgtattatc tttattgttg tattgctatt ttttattttt 142681 caagtgtttt ccatccatgg ccagttgaat cctcagatgt gaaacccacg gatatgaagg 142741 ccaactgtat tttttccact ggatgcgtgc aatccccagt ccccagccca gccgtgtcag 142801 atgtgtgagt ggggacccat gtttcccagg agtctctaac gaccttgccc actggctttt 142861 ccagaaggga ctgtcaggga ggctgaagcc agaccacgac ccagaaactc tgtccagctc 142921 ccttctcctg ggagccttgg cctggagtaa ccgggttgcc ctttccttac ttggccatga 142981 gctgagtttg ctgctttgtt ttctctctct ccatgggtcg tgctgttggc cttgtatcta 143041 aagaagtcat caccaaaccc agtgtcacct gatgttccca tgttatcctc taggattcga 143101 tagttttgtg ttttacattt gagtccatga tctgttttga gataatgttt gtgaaggatg 143161 gaaggtctgt ctcagatcat ttttctcaca gttctggagg ctgggaagcc caagatcaca 143221 gtgccaactg gttccatgcc tggtgaaggt ctgttccatc aatgatatct tctcatgatg 143281 tccttacgtg ttggaaaagg ggagtaaagc aagaccactg catgaagcct cttttatcag 143341 ggcactaatc ctgttcatga gagtagagcc ctcatgaccg aatcacctct caaaggcccc 143401 accctgcgat accatcatct tgataattag gtttcagcat atgaattttg gaaggccaca 143461 taccttcaag ccacagcagt ctgtgcctag gttcattttt ccatgtggat gtccagttgt 143521 tccagcacca tttgctggga aggctaccgt ttctccattg aattgccttt gctctttggt 143581 ctatttctgc tacacttttt ttttctttct ttctttcttt tgagacagag tctcgctgtg 143641 tcacccaggc tggagtgcag tggcacaatc ttggctcatt gcaacctccg cctcccgggt 143701 tcaagagatt ctcctgcctc agcctcccta gctgctggga ttacaggcac ctgccaccac 143761 acccagctaa tttttgtatt tttagtagag acggcgtttc accatgttgg ccagactggt 143821 cttgaactcc tgacctcagg tgatccacct acctcaacct ctcaaagtgc tgggattaca 143881 ggtgtgagcc acctgcacgt ggcctactac gccatttttt cagatgggca gggccccgga 143941 gggcttctca tgttctcata tgaacgtggt attttccttc tgtctttcgt tgttgttgct 144001 tcaagttgac agagccagat tatcattatc catcaacact tgtccaccca aggccctctt 144061 tcgttttatt ttctgtttag aaaataagct aaggccgtgc gcggtggcca acactttggg 144121 aggccgaggt gggcggatca cctaaggtca gaagttcgag accagcctgg tcaacatggt 144181 gaaagccagt ctctaacaaa aatacaaaaa aaaaatagcc agatgtggtg gtgcatgcct 144241 gtgatcccag ctactcagaa ggctgaggca ggacaatcac ttgaacctgg gaggtggagg 144301 ttgcagtgag ccgagattgt gccactgcac tccagcctgt gtggcagaga ctccatctca 144361 aaaaagtaaa tcaataaggc tgggcgaggt ggctcaagtc tgtaatccca gcactttggg 144421 aggccgaggc gggcagatca caaggtcagg agttcaagac cagcctggcc aacatggtga 144481 aaccccgtct ccattaaaaa ttcaaaaaaa ttagccgggc gcagtggcag gcgcctgtaa 144541 tcccagctac tcaggaggct gaggcaggag aattgcttga acccgggagg cagaagttgc 144601 agtgagctga gatcgcgcca ctgcactgca gcctggggga cagagcaaga atccatctca 144661 gaaaataaat aaaataaaat aaatacaaaa taagctaatg aatgtctgta agtccactgg 144721 ccagcccagg aatgggcccc gtaccaatgc cctactgaaa tacacccagg tgctctgctg 144781 tctccccaac accacctccc cactccaaga taaccagtat cctgcatttt gcatttatta 144841 ttctcctgct tttgaaaatt ttgatagcag ccagggcaac atagtgacac cacgtcttta 144901 caagaaatta gaaaatcagc caggcatggt ggtgcgttac tgtggtccca gctactcagg 144961 aggctgagat gggaggacca cttaagccca gaaggtcaag gctgcagtga gctatgatca 145021 caccactgta ctccagcccg ggtgacaaag cgagaccctg tctggaaaaa atatatcaaa 145081 aaaataaaaa ataataaaat aaaattgtga tagaacacat atataacaca acacttgcca 145141 ttgtaaccat ttttcagtat gactcagtgg tacaactgac ttgtgttgta taagtagagc 145201 attcacaatg ttcggccatc accactgttt cccaaacact gccatcgccc caaacagaaa 145261 ctctgcaccc gttgagcaat agctccccag ttcctgctcc cccagcccct ggtaacctgt 145321 agccactttc tgtctctgtg aatttgccca ttctagatat tttatgtaag tggaatcacc 145381 atatttgtcc ttttgtgact ggcttatgtg acttagcatg ttgttttcag gttgatgtat 145441 cttgtaacat acatcagaac tacgttcctt tttatggcgc aatagtaatc tgttgtgcgg 145501 gtgcaccaca ttttctttat ccattcattt gtcgatgggt gcttgggttg cttcccactt 145561 aggttttttt tcagcaggga gtagagtctg gctctgttgc tcagaatgga gtatagtggt 145621 gccatctcag ctcactgcac cttccgcctt ctgggctcaa gtgatcctcc ctctgatcct 145681 cccacctcag cctcccaagt ggctgggact acaggcacat gccaccacac tcagctaata 145741 atttacaatt ttttttgtag agacagggtc tcactatatt gcccagactg gtctcaaact 145801 cctgggctca aacgatccct ccccctcagc ctctcaaagt gctgggctta ccagtgtgag 145861 ccaccacccc cggcctagtt tttttttttt tttttaatat agctgtatca tatatgctta 145921 aaattatatg ttctttagtt gagtttggga ggctgaggcg ggcagatcat ctgaggtcag 145981 gagtttgaga ccagcctggc caacatggtg aaatcccatc tctactaaaa atacaaaatt 146041 agacgggtgt agtggcatgc gcctgtaatc ccagctactc gggagactga gacaggagaa 146101 tcgcttgaac cgaggaggca gaggttgcag tgagctgaga ttgtgccatt gcacgccagc 146161 ctgggcaaca agagcgaaac tccatctcaa aaaataaaaa taaataaaca gcattgctgt 146221 aaacatttgt gtacctgtct cctagtgcaa gggggagaag ttgctcttgt atgtatgcct 146281 gggagttact gtaagatcag ccagcatgca actttacata agccctaatt gttttccaaa 146341 gtgctcctat caatttacac ttccactaac aatgtgtaca agatcctttt tgatctacat 146401 ctttctcatc tatttttttt tagacagtct tgctctgtag cccaggctgg agtgcagtgg 146461 cgtgatctcg gctcgctgca agtccgcctc ccgggttcaa gcgtacatct ccttcaactc 146521 ttgctatttg ggggattcct tcgtttttgc cagactgatg tgtgcgaaac agtaccccac 146581 cataatcttg catttccctg attaccagca aggctgctct tttcttccat ggtttctggg 146641 ccacatgtgt tttctcttct gttttgcctg gagttgtctt ttgcccgttt ttatattttc 146701 ttcttccttt tgcttattga tttgtaggaa ttttttacat attctagaca ctaatccttt 146761 gccagctatg tgtattcttt catttgaagc ttgccttttt actttcttta aggagtcttt 146821 cataaaaaga agtttctaat tctagtgtaa ttaaatccat cttatctctt cttttctttt 146881 cttttttttt tttttttgac agggtttcat tgcccaggct ggagtgcagt ggcacaaaca 146941 tggctcgaac tcctgggcta aagcgatcat cccacctcag ctgcccatgt agctgagact 147001 acaggcaccc accacaccca gataattttt taattttttg tacagatgag gtcacatcat 147061 gttgtccaga ctggtctcaa actcctgggc tcaagcgatc ttcccacctc cacttcccaa 147121 agtgctggga taacaggcat gagccactgt gcccggctac cttatttttt tcttaaggat 147181 taatgctttt tatatctggt ttaagaaatc cttttctaat ccaaggtcag aaagatactt 147241 tttacatttt cttttaagaa tcttaaagtt tcctttttga catttaaatc cttaatccat 147301 ctggaaatta tttttgtgta agatgtggag ggcctaattt catttttccc cattataaat 147361 cacctctatt tccttctcct tttgttaaat aatccctctt tcctcgctaa tctgccttgc 147421 caccttgtca tattatatac gtatttttct ctcttttttt ttttttgaga caggttctca 147481 ctctgtcatc caggttagag tacagtggca cgatcacagc tcactgcaac ctcaatctcc 147541 caggctcaag tgattatccc acctcagcct cccgagtagc tgggaccaca gatgcatgcc 147601 acatctggct tacttattta tttatttatt ttttgcagag acatgttgcc caggctgttg 147661 tcatatatta aaggtccatc tttctggggt ctctttaggg ctctctattt tgtcccactg 147721 gctactttgt ctaccctcga actaaggtca ggctttgtta attatgatag ttttataata 147781 agccttgtta tttgccaggc aagccctcct cgtgcatctt cagggtggtc tgagctagtc 147841 ttgacctttg ctctaccata tacattttag aatgaacttt agtttctcaa aacatcctgt 147901 ggggattttg attagagtgg cattgaatct gttcactcaa tttggggaga tttgacacta 147961 tcttaacaat caccattgaa tatctctcca tttatttagg tcttttaaag tgtgtttaga 148021 attttctgta catctttctt ttttcttttt atgattatta ttattttaga gatagggtct 148081 ccctctgttg tccaggctgg agtgcagtgg catgatccta gctcactgca gcctcaaact 148141 cctgggctca agtgatcctc ctgcatcagc ctcccaaaat actgggatta cagccatgag 148201 ccaccatgcc ccgcccacat cttttatttt acttacttct gagtttttta aattttttgt 148261 tgctattata aatgacgtct ttttacagag cgcattttct aaatactatg ggtggtgcat 148321 agacatacac ttggcttgtg tatattcatc attatgtcca gtcagtttgc taaagtctct 148381 tttctttcta ataatttgtc tttataccct cttcggaatt tttttttttg tttttttcag 148441 aagatcactt catgtgaaat attgacagat gtatttccac cgttctaatc tttatgcctt 148501 gcatttcttt tcttgctata ctaccccggc tagagcctcc ggaacaaagc ccagaagagg 148561 tggcacaagt ggacaggcat gctttgttac tgattgttaa gaaaatgctt ctaggccagg 148621 cgcggtggct ctcacctgta atcccagcac tttgggaggc cgaggtgggc ggatcacctg 148681 aggtcaggag tttgagacca gcctagccaa cgtggtaaaa ccccgtctct actaaaaata 148741 caaaaattag ccgggcatgg tggcaggtgg ctgtaatccc agctacttgg gaggctgagg 148801 caggagaatt gcttgaacct gggaggcaga ggttgcagtg agccgagatc gcgccactgc 148861 actccagcct gggcgacaga gactgcatct caaaataaaa aatatatata aaaaaaagaa 148921 agaaaatatt tctaacattt ctctattaat acaattatgt gtgtatttta tcaagtctag 148981 gaagttccct tttattccca gcttattaag agtttttgtt atgaataagt gttgaatttt 149041 agcaaatgct ttttctgcac ttgacgagat gaccagcgtc tttctccttt atcctgctcc 149101 tgggttaaaa gacacttatg agccgggcgc ggtggctcac gcctgtcatc ccagcacttt 149161 gggaggccaa ggcgggcgga tcacgaggtc aggagatcga gactatcctg gctaacacgg 149221 tgaaacccca tctctactaa aaatacaaaa attagccagg cgtgatggcg ggtgcctgtg 149281 gtcccagcta ctcgggaggc tgaggcagga ggcagagctt gcagtgagct gagatcgcgc 149341 cactgcactc cagcctgggt gacagagcaa gactctgtta aaaaaaaaaa aagacattta 149401 tggacattct aatattaaat caccttgctt tactgacatt cgctgaactt ggtcattatt 149461 ggctattttc ctttatacac atttggattt ggtttagaaa aatttttgtg agagtgggga 149521 cgtccagctt cggagcggga atcttcgctg cgccagcgac taaaaggaga attaaatatg 149581 ggtgatgttg agaaaggcaa gaagattttt attatgaagt gttcccagtg ccacaccgtt 149641 gaaaagggag gcaagcacaa gactgggcca aatctccaag gcctcttcgg gcggaagaca 149701 ggtcaggccg ctggatactc ttacacggtt gccagtaaga acaaaggcat cacctgggga 149761 gaggatacac tgatggagta tttggagaat cccaagaagc acatacctga aacaaaaatg 149821 atctttgtcg gcattaagaa gaaggaagaa agggcagact cgatagcttt tctcaaaaaa 149881 gctactaatg agtaataatt ggccactgcc ttgtttatta caaaacagaa atgtctcatg 149941 acttttgtat gcataccatc ctttaataga tctcatacac cagaattcag atcatgaatg 150001 actgacagaa tattttgttg agcagtcctg atttaaaact aagactggtt tgtggttaaa 150061 tgaatacgtt cagttcttga attttaatag taactccaat tcagtaaatg ctatcactgt 150121 ttaccccttc taaagatatg attagacttc gttagtaatg ttcaactttt cacaaagatg 150181 gtgagtgccg tcttaaaact tactggagat tggttttata tttagattta tatgactggt 150241 tatgtgaata tatgtaaata ctggggaaat tccttcactg tcttagaacc aagcaagatt 150301 cacctgtgtt ttgtgttcat ttgcctctta aaggcaacgg ttgaaggtaa ataagggagc 150361 aatgtctata gttttggcct taactatgcc aatctaatta gaattccctg cattaaaaaa 150421 aaaaaagaaa aatttttgtt taggactttt gtctctgtat tccagaccaa aatggaccta 150481 caatcttcct atcttgttct gtttttgcct agttaatatt aaggttttgt tgggtaatgg 150541 tccttccttt tcaattctct ggcagagtct gtagagttgc tagaacttgt ctgtaaaacc 150601 atgcgggccg gctgttttat ttgtgggaag atttttaaac cgctgcttca atctgtttta 150661 tacttttagg gttactaggg ctcgcttgct tgcttttttt tttttttttt ttttgagacg 150721 gagtctcact ctgtcaccta ggctagagtg cagtggcgtg gcatgatctc ggctcactgc 150781 aacctccacc tcccaggttc aagtgattct cctgtctcag cctcccgagt aggtgggatt 150841 acaggcaccc accaccacgc ccggctaatt tttgtatttt tagtagagac ggggtttcac 150901 catgttggcc aggctggtct caaactcctg acctcaagtg atccacccgc ctcggcctcc 150961 caaagtgctg ggattacagg tgtgagccac catgcccggc ctttttttgt ctgttttagt 151021 aaatttagta tatttctact ttgtttcagg tttattgtca tcagtagttg gtggtttatt 151081 tttagtcctt aaactgtctg tatacatttt ctgtattttc atataattca caattttaaa 151141 ggcaaaatta aaaaaaacaa taggccgggc atggtggctc acgcctgtaa tcccagcact 151201 ttgggaggcc gaggcgggcg gatcacctga ggtcaggagt tcgagaccag cctggccaac 151261 atagtgaaac cccgtctcta ctaaaaatac aaaaattagc tgggcgtggt ggtgcgcatc 151321 tgtggtccca gctactcagg aggctgaggc aggagaatcc cttgaaccgg gaggcggagg 151381 ctgcagtgag ccgagattac accactgcac tccagccttg gtgacagaga ctctgtctca 151441 aaaaaaaaaa aaaaaaaaaa aaaaaaggta atcacattac ccactttaaa gcatacatac 151501 aattcggtgg ttgttagtat actcacaatg ttgtgccacc atcactaata attccagaac 151561 attttcatca ccccaaaaag aaacccctta cttgttagca atcaatctcc gttcccccct 151621 cctctagccc ctggcaacca ttcatctcct tgctgtctct atggatttgc ctattctgat 151681 cttttatatg aatggaatca tacgctatgt ggccttttgt gtgtagcttc tttcactcag 151741 cacaatgctt ccaaggctca tctgtgttgt ggcatggatc agtattttat ttctttttat 151801 gcctgaatag tatatcaaat gatgagacca aatcattgta ctgttcttgc aacttttctg 151861 taggttttta cattttcaaa gtaagaagtt gaaaagaaaa aacttatctt aagcaatcat 151921 tggattaaag aggagagcat acagaagcga caagcttcga gagcatcgtg aaaagaagaa 151981 cacgtcttac cccaacctct acacacaact caagctttcc acgaggggca gttagaatag 152041 tcaagatatg catgattaag gaggatagat caaaagtcaa cgacaaccag aaaaagaatg 152101 agagaagcca aaggggaaac aaaggaaggg ggaaaaagag aataaagatt aaaaacataa 152161 aatagtaaaa attgtaaata aatacaaaag ctgattcttt aacaacagaa aaaaattgat 152221 aatttcctaa caaaatataa ctttctgaaa acccaacaag aactagaaaa tttgagtcta 152281 caaattacca ttttaaagta gagaaggaag ccaaactcag tagctcacgc ctgtaatccc 152341 aacagtttgg gaggccaagg tgggaggatc acttgagtct aggagtttga ggccagcctg 152401 agcaatatag tgagactccc atctctacaa aaacaaaaat taaaaaatta gctggacgtg 152461 gtggcacaca cctgtagtcc tagctgctca ggagggtgag gtgagaggat cgcttgagcc 152521 caggaatttg aggctgcact gagctacagt catgccactg tactccagcc ctggcaatgg 152581 agggagaccc tgtgtcaaat aaaataaaat agagagggga ttctccataa aaacagtacc 152641 aaccctaagt agtttcaaag ccaagtccta tataatcttt caagacaaaa taattctgat 152701 atcatttaaa ctattccagt ccacacaaaa acatttcata aagccaacaa tacctaaacc 152761 taggttgatg cccaaacctc ctaaacagag tacaaaaaga aacctgtgaa caactgtcac 152821 ttctgattac aaatgcaaaa attataaata aaagataagc acatggattc cagcgatata 152881 gcaaaagaat aagacattat gtctgagcaa gacataatat tctattatta tgaatagaat 152941 agaatttcgg gtctactcca gaaatacaga ggtgattgaa catcaggaaa tctttccaca 153001 gaattcatta ctcagcaaat taaaggagga aactcacagg atcactagtt gctgactagg 153061 catttgatgc atttcgacag gcgttccttg aaaaagaaag aacatctctt ctaaaaggat 153121 actgggtaga aacacgtaaa ggtggtagtg agagacagca tttccactct tctgcatagc 153181 aacacaaaag caacggggtg catggaaacc ccctgccaac tccattgttg gccaaaggag 153241 gagtcagaca gcctcctagc aagcaaagca gaaatctttc ccaagcccca ggtcctgaag 153301 atattctcct gttgtcttct acaagcttta ttgttttgcc ttttgttctt agatctacaa 153361 catacctgga attgattttt ttgtgtataa tgtcaggcag tggtcaaaat ttgttttttc 153421 tttaaaaaaa aaaaacaaaa agacttccag gctgaagatc // LOCUS HSU52155 2211 bp mRNA PRI 02-SEP-1996 DEFINITION Human ATP-dependent inwardly rectifying potassium channel Kir4.1 mRNA, complete cds. ACCESSION U52155 NID g1518529 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2211) AUTHORS Schoots,O. and Van Tol,H.H.M. TITLE Cloning of cDNAs for four inwardly rectifying potassium channels from human JOURNAL Unpublished REFERENCE 2 (bases 1 to 2211) AUTHORS Schoots,O. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Oscar Schoots, Molecular Neurobiology, Clarke Institute of Psychiatry, 250 College Street, Toronto, ON M5T 1R8, Canada FEATURES Location/Qualifiers source 1..2211 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Clontech #HL1128a" /tissue_type="cerebellum" CDS 227..1366 /codon_start=1 /product="ATP-dependent inwardly rectifying potassium channel Kir4.1" /db_xref="PID:g1518530" /translation="MTSVAKVYYSQTTQTESRPLMGPGIRRRRVLTKDGRSNVRMEHI ADKRFPYLKDLWTTFIDMQWRYKLLLFSATFAGTWFLFGVVWYLVAVAHGDLLELDPP ANHTPCVVQVHTLTGAFLFSLESQTTIGYGFRYISEECPLAIVLLIAQLVLTTILEIF ITGTFQAKIARPKKRAETIRFSQHAVVASHNGKPCLMIRVANMRKSLLIGCQVTGKLL QTHQTKEGENIRLNQVNVTFQVDTASDSPFLILPLTFYHVVDETSPLKDLPLCSGEGD FELVLILSGTVESTSATCQVRTSYLPEEILWGYEFTPAISLSASGKYIADFSLFDQVV KVASPSGLRDSTVRYGDPEKLKLEESLREQAEKEGSALSVRISNV" BASE COUNT 482 a 678 c 528 g 523 t ORIGIN 1 ggcgctgcgg agggaggggg cggcccggcc cggcccagct ctgcccccgg ccggcccgac 61 cccggccccg gcccccggac aagcccttat ctgatcccag ctccgggttt aagagtcctg 121 gcccggcccg tcgcacagct ctgctcctaa ctcctgcccg ccccgtccgt ccatctgtcc 181 cgctgccccg cggcccatcc aagggccact ccacctcgga cccaagatga cgtcagttgc 241 caaggtgtat tacagtcaga ccactcagac agaaagccgg cccctaatgg gcccagggat 301 acgacggcgg agagtcctga caaaagatgg tcgcagcaac gtgagaatgg agcacattgc 361 cgacaagcgc ttcccctacc tcaaggacct gtggacaacc ttcattgaca tgcagtggcg 421 ctacaagctt ctgctcttct ctgcgacctt tgcaggcaca tggttcctct ttggcgtggt 481 gtggtatctg gtagctgtgg cacatgggga cctgctggag ctggaccccc cggccaacca 541 caccccctgt gtggtacagg tgcacacact cactggagcc ttcctcttct cccttgaatc 601 ccaaaccacc attggctatg gcttccgcta catcagtgag gaatgtccac tggccattgt 661 gcttcttatt gcccagctgg tgctcaccac catcctggaa atcttcatca caggtacctt 721 ccaggcgaag attgcccggc ccaagaagcg ggctgagacc attcgtttca gccagcatgc 781 agttgtggcc tcccacaatg gcaagccctg cctcatgatc cgagttgcca atatgcgcaa 841 aagcctcctc attggctgcc aggtgacagg aaaactgctt cagacccacc aaaccaagga 901 aggggagaac atccggctca accaggtcaa tgtgactttc caagtagaca cagcctctga 961 cagccccttc cttattctac cccttacctt ctatcatgtg gtagatgaga ccagtccctt 1021 gaaagatctc cctctttgca gtggtgaggg tgactttgag ctggtgctga tcctaagtgg 1081 gacagtggag tccaccagtg ccacctgtca ggtgcgcact tcctacctgc cagaggagat 1141 cctttggggc tacgagttca cacctgccat ctcactgtca gccagtggta aatacatagc 1201 tgactttagc ctttttgacc aagttgtgaa agtggcctct cctagtggcc tccgtgacag 1261 cactgtacgc tacggagacc ctgaaaagct caagttggag gagtcattaa gggagcaagc 1321 tgagaaggag ggcagtgccc ttagtgtgcg catcagcaat gtctgatgac ctgttcccac 1381 tcccccattc ctctggtctc ttttcctctc ttccaatgcc ctggtaagga atactacccg 1441 ggtttactgg agatcccccg aagcacccat cctccactcc ctcttcttta acccagtggc 1501 ctgttggtag cttaggccaa ctggagtcca ggttcgcctc ccactgtccc ctttccactt 1561 ccccagcttc tgccccaata cacatacctc ccttaagcca ggatggggga aagagtggga 1621 ttaggctgaa gtggcttaga aggcctcagc catgcttgga tactcacatt aggaggacca 1681 tgtggttgga aggatagact gccccctacc tcccaccacc accatgaagt ttggtgactt 1741 gaggctggag ctccctctgt tacctttcca tctgacggat tcccaaaggc aagactctct 1801 ctgatggtca ctttgtggtc tgtgctttca gaaatacagg aatctgatat caacatatcc 1861 tagggtttct accaatctct gttgaaagaa gccagggttt gccactgtga agcttgattt 1921 ctgctggtga cttctgacca taagctagaa ccatggtcgc cactgttttc cctctgtagt 1981 ttctcaagtg aacactctca ggatacccag ttccctcata gcctctgttc tcagagaatt 2041 ggagttggcc caagaaacat aaacatataa ccacccatat ctatcctgga ttctgaactc 2101 ttcaatttgg agtgactaac acaagttgtt atctaaacct ttaaacctat cttccaggca 2161 gcccagagaa gatctgtttc cctgtgtcct gtgaatggaa ggacccgagc c // LOCUS HSU52219 1939 bp mRNA PRI 28-JUL-1996 DEFINITION Human melatonin-related receptor mRNA, complete cds. ACCESSION U52219 NID g1326154 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1939) AUTHORS Reppert,S.M., Weaver,D.R., Ebisawa,T., Mahle,C.D. and Kolakowski,L.F. Jr. TITLE Cloning of a melatonin-related receptor from human pituitary JOURNAL FEBS Lett. 386 (2-3), 219-224 (1996) MEDLINE 96228068 REFERENCE 2 (bases 1 to 1939) AUTHORS Reppert,S.M. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Steven M. Reppert, Pediatrics, Massachusetts General Hospital, Fruit Street, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..1939 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H9" /tissue_type="pituitary" /dev_stage="adult" 5'UTR 1..69 CDS 70..1911 /codon_start=1 /product="melatonin-related receptor" /db_xref="PID:g1326155" /translation="MGPTLAVPTPYGCIGCKLPQPEYPPALIIFMFCAMVITIVVDLI GNSMVILAVTKNKKLRNSGNIFVVSLSVADMLVAIYPYPLMLHAMSIGGWDLSQLQCQ MVGFITGLSVVGSIFNIVAIAINRYCYICHSLQYERIFSVRNTCIYLVITWIMTVLAV LPNMYIGTIEYDPRTYTCIFNYLNNPVFTVTIVCIHFVLPLLIVGFCYVRIWTKVLAA RDPAGQNPDNQLAEVRNFLTMFVIFLLFAVCWCPINVLTVLVAVSPKEMAGKIPNWLY LAAYFIAYFNSCLNAVIYGLLNENFRREYWTIFHAMRHPIIFFPGLISDIREMQEART LARARAHARDQAREQDRAHACPAVEETPMNVRNVPLPGDAAAGHPDRASGHPKPHSRS SSAYRKSASTHHKSVFSHSKAASGHLKPVSGHSKPASGHPKSATVYPKPASVHFKGDS VHFKGDSVHFKPDSVHFKPASSNPKPITGHHVSAGSHSKSAFSAATSHPKPIKPATSH AEPTTADYPKPATTSHPKPAAADNPELSASHCPEIPAIAHPVSDDSDLPESASSPAAG PTKPAASQLESDTIADLPDPTVVTTSTNDYHDVVVVDVEDDPDEMAV" 3'UTR 1912..1939 BASE COUNT 384 a 643 c 428 g 484 t ORIGIN 1 tgtttgctgt ctggacctgg ctgctgatcc tgagcctgct gggagatctt aacgatcccc 61 aggagcaaca tggggcccac cctagcggtt cccaccccct atggctgtat tggctgtaag 121 ctaccccagc cagaataccc accggctcta atcatcttta tgttctgcgc gatggttatc 181 accatcgttg tagacctaat cggcaactcc atggtcattt tggctgtgac gaagaacaag 241 aagctccgga attctggcaa catcttcgtg gtcagtctct ctgtggccga tatgctggtg 301 gccatctacc catacccttt gatgctgcat gccatgtcca ttgggggctg ggatctgagc 361 cagttacagt gccagatggt cgggttcatc acagggctga gtgtggtcgg ctccatcttc 421 aacatcgtgg caatcgctat caaccgttac tgctacatct gccacagcct ccagtacgaa 481 cggatcttca gtgtgcgcaa tacctgcatc tacctggtca tcacctggat catgaccgtc 541 ctggctgtcc tgcccaacat gtacattggc accatcgagt acgatcctcg cacctacacc 601 tgcatcttca actatctgaa caaccctgtc ttcactgtta ccatcgtctg catccacttc 661 gtcctccctc tcctcatcgt gggtttctgc tacgtgagga tctggaccaa agtgctggcg 721 gcccgtgacc ctgcagggca gaatcctgac aaccaacttg ctgaggttcg caattttcta 781 accatgtttg tgatcttcct cctctttgca gtgtgctggt gccctatcaa cgtgctcact 841 gtcttggtgg ctgtcagtcc gaaggagatg gcaggcaaga tccccaactg gctttatctt 901 gcagcctact tcatagccta cttcaacagc tgcctcaacg ctgtgatcta cgggctcctc 961 aatgagaatt tccgaagaga atactggacc atcttccatg ctatgcggca ccctatcata 1021 ttcttccctg gcctcatcag tgatattcgt gagatgcagg aggcccgtac cctggcccgc 1081 gcccgtgccc atgctcgcga ccaagctcgt gaacaagacc gtgcccatgc ctgtcctgct 1141 gtggaggaaa ccccgatgaa tgtccggaat gttccattac ctggtgatgc tgcagctggc 1201 caccccgacc gtgcctctgg ccaccctaag ccccattcca gatcctcctc tgcctatcgc 1261 aaatctgcct ctacccacca caagtctgtc tttagccact ccaaggctgc ctctggtcac 1321 ctcaagcctg tctctggcca ctccaagcct gcctctggtc accccaagtc tgccactgtc 1381 taccctaagc ctgcctctgt ccatttcaag ggtgactctg tccatttcaa gggtgactct 1441 gtccatttca agcctgactc tgttcatttc aagcctgctt ccagcaaccc caagcccatc 1501 actggccacc atgtctctgc tggcagccac tccaagtctg ccttcagtgc tgccaccagc 1561 caccctaaac ccatcaagcc agctaccagc catgctgagc ccaccactgc tgactatccc 1621 aagcctgcca ctaccagcca ccctaagccc gctgctgctg acaaccctga gctctctgcc 1681 tcccattgcc ccgagatccc tgccattgcc caccctgtgt ctgacgacag tgacctccct 1741 gagtcggcct ctagccctgc cgctgggccc accaagcctg ctgccagcca gctggagtct 1801 gacaccatcg ctgaccttcc tgaccctact gtagtcacta ccagtaccaa tgattaccat 1861 gatgtcgtgg ttgttgatgt tgaagatgat cctgatgaaa tggctgtgtg aaaaatgctc 1921 tcgtaggtgg ccaggcagt // LOCUS HSU52370 2572 bp mRNA PRI 27-FEB-1997 DEFINITION Human fertilin beta mRNA, complete cds. ACCESSION U52370 NID g1850326 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2572) AUTHORS Vidaeus,C.M., von Kapp-Herr,C., Golden,W.L., Eddy,R.L., Shows,T.B. and Herr,J.C. TITLE Human fertilin beta: identification, characterization, and chromosomal mapping of an ADAM gene family member JOURNAL Mol. Reprod. Dev. 46 (3), 363-369 (1997) MEDLINE 97193554 REFERENCE 2 (bases 1 to 2572) AUTHORS Vidaeus,C.M. TITLE Direct Submission JOURNAL Submitted (26-MAR-1996) Cecilia M. Vidaeus, University of Virginia, Cell Biology, 1300 Jefferson Park Avenue, Charlottesville, VA 22908, USA FEATURES Location/Qualifiers source 1..2572 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="testis" 5'UTR <1..7 CDS 8..2215 /note="PH30" /codon_start=1 /product="fertilin beta" /db_xref="PID:g1850327" /translation="MWRVLFLLSGLGGLRMDSNFDSLPVQITVPEKIRSIIKEGIESQ ASYKIVIEGKPYTVNLMQKNFLPHNFRVYSYSGTGIMKPLDQDFQNFCHYQGYIEGYP KSVVMVSTCTGLRGVLQFENVSYGIEPLESSVGFEHVIYQVKHKKADVSLYNEKDIES RDLSFKLQSVEPQQDFAKYIEMHVIVEKQLYNHMGSDTTVVAQKVFQLIGLTNAIFVS FNITIILSSLELWIDENKIATTGEANELLHTFLRWKTSYLVLRPHDVAFLLVYREKSN YVGATFQGKMCHANYAGGVVLHPRTISLESLAVILAQLLSLSMGTTYDDINKCQCSGA VCIMNPEAIHFSGVKIFSNCSFEDFAHFISKQKSQCLHNQPRLDPFFKQQAVCGNAKL EAGEECDCGTEQDCALIGETCCDIATCRFKAGSNCAEGPCCENCLFMSKERMCRPSFE ECDLPEYCNGSSASCPENHYVQTGHPCGLNQWICIDGVCMSGDKQCTDTFGKEVEFGP SECYSHLNSKTDVSGNCGISDSGYTQCEADNLQCGKLICKYVGKFLLQIPRATIIYAN ISGHLCIAVEFASDHADSQKMWIKDGTSCGSNKVCRNQRCVSSSYLGYDCTTDKCNDR GVCNNKKHCHCSASYLPPDCSVQSDLWPGGSIDSGNFPPVAIPARLPERRYIENIYHS KPMRWPFFLFIPFFIIFCVLIAIMVKVNFQRKKWRTEDYSSDEQPESESEPKG" 3'UTR 2216..2572 polyA_signal 2350..2355 BASE COUNT 814 a 422 c 544 g 792 t ORIGIN 1 gcgagccatg tggcgcgtct tgtttctgct cagcgggctc ggcgggctgc gcatggacag 61 taattttgat agtttacctg tgcaaattac agttccggag aaaatacggt caataataaa 121 ggaaggaatt gaatcgcagg catcctacaa aattgtaatt gaagggaaac catatactgt 181 gaatttaatg caaaaaaact ttttacccca taattttaga gtttacagtt atagtggcac 241 aggaattatg aaaccacttg accaagattt tcagaatttc tgccactacc aagggtatat 301 tgaaggttat ccaaaatctg tggtgatggt tagcacatgt actggactca ggggcgtact 361 acagtttgaa aatgttagtt atggaataga acccctggag tcttcagttg gctttgaaca 421 tgtaatttac caagtaaaac ataagaaagc agatgtttcc ttatataatg agaaggatat 481 tgaatcaaga gatctgtcct ttaaattaca aagcgtagag ccacagcaag attttgcaaa 541 gtatatagaa atgcatgtta tagttgaaaa acaattgtat aatcatatgg ggtctgatac 601 aactgttgtc gctcaaaaag ttttccagtt gattggattg acgaatgcta tttttgtttc 661 atttaatatt acaattattc tgtcttcatt ggagctttgg atagatgaaa ataaaattgc 721 aaccactgga gaagctaatg agttattaca cacattttta agatggaaaa catcttatct 781 tgttttacgt cctcatgatg tggcattttt acttgtttac agagaaaagt caaattatgt 841 tggtgcaacc tttcaaggga agatgtgtca tgcaaactat gcaggaggtg ttgttctgca 901 ccccagaacc ataagtctgg aatcacttgc agttatttta gctcaattat tgagccttag 961 tatggggact acttatgatg acattaacaa atgccagtgc tcaggagctg tctgcattat 1021 gaatccagaa gcaattcatt tcagtggtgt gaagatcttt agtaactgca gcttcgaaga 1081 ctttgcacat tttatttcaa agcagaagtc ccagtgtctt cacaatcagc ctcgcttaga 1141 tccttttttc aaacagcaag cagtgtgtgg taatgcaaag ctggaagcag gagaggagtg 1201 tgactgtggg actgaacagg attgtgccct tattggagaa acatgctgtg atattgccac 1261 atgtagattt aaagccggtt caaactgtgc tgaaggacca tgctgcgaaa actgtctatt 1321 tatgtcaaaa gaaagaatgt gtaggccttc ctttgaagaa tgcgacctcc ctgaatattg 1381 caatggatca tctgcatcat gcccagaaaa ccactatgtt cagactgggc atccgtgtgg 1441 actgaatcaa tggatctgta tagatggagt ttgtatgagt ggggataaac aatgtacaga 1501 cacatttggc aaagaagtag agtttggccc ttcagaatgt tattctcacc ttaattcaaa 1561 gactgatgta tctggaaact gtggtataag tgattcagga tacacacagt gtgaagctga 1621 caatctgcag tgcggaaaat taatatgtaa atatgtaggt aaatttttat tacaaattcc 1681 aagagccact attatttatg ccaacataag tggacatctc tgcattgctg tggaatttgc 1741 cagtgatcat gcagacagcc aaaagatgtg gataaaagat ggaacttctt gtggttcaaa 1801 taaggtttgc aggaatcaaa gatgtgtgag ttcttcatac ttgggttatg attgtactac 1861 tgacaaatgc aatgatagag gtgtatgcaa taacaaaaag cactgtcact gtagtgcttc 1921 atatttacct ccagattgct cagttcaatc agatctatgg cctggtggga gtattgacag 1981 tggcaatttt ccacctgtag ctataccagc cagactccct gaaaggcgct acattgagaa 2041 catttaccat tccaaaccaa tgagatggcc atttttctta ttcattcctt tctttattat 2101 tttctgtgta ctgattgcta taatggtgaa agttaatttc caaaggaaaa aatggagaac 2161 tgaggactat tcaagcgatg agcaacctga aagtgagagt gaacctaaag ggtagtctgg 2221 acaacagaga tgccatgata tcacttcttc tagactaatt atctgtgatg gatggacaca 2281 aaaaaatgga aagaaaagaa tgtacattac ctggtttcct gggattcaaa cctgcatatt 2341 gtgattttaa tttgaccaga aaatatgata tatatgtata atttcacaga taatttactt 2401 atttaaaaat gcatgataat gagttttaca ttacaaattt ctgttttttt aaagttatct 2461 tacgctattt ctgttggtta gtagacacta attctgtcag taggggcatg gtataaggaa 2521 atatcataat gtaatgaggt ggtactatga ttaaaagcca ctgttacatt tc // LOCUS HSU52426 4040 bp mRNA PRI 18-JUL-1997 DEFINITION Homo sapiens GOK (STIM1) mRNA, complete cds. ACCESSION U52426 NID g2264345 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4040) AUTHORS Parker,N.J., Begley,C.G., Smith,P.J. and Fox,R.M. TITLE Molecular cloning of a novel human gene (D11S4896E) at chromosomal region 11p15.5 JOURNAL Genomics 37 (2), 253-256 (1996) MEDLINE 97079692 REFERENCE 2 (bases 1 to 4040) AUTHORS Parker,N.J., Begley,C.G., Smith,P.J. and Fox,R.M. TITLE Direct Submission JOURNAL Submitted (25-MAR-1996) Nigel J. Parker, Haematology/Oncology, Royal Chldren's Hospital, Flemington Rd., Parkville, VIC 3055, Australia FEATURES Location/Qualifiers source 1..4040 /organism="Homo sapiens" /note="located immediately telomeric of PRM1 gene; DSEG number: D11S4896E" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" gene 1..4040 /gene="STIM1" CDS 566..2623 /gene="STIM1" /codon_start=1 /product="GOK" /db_xref="PID:g2264346" /translation="MDVCVRLALWLLWGLLLHQGQSLSHSHSEKATGTSSGANSEEST AAEFCRIDKPLCHSEDEKLSFEAVRNIHKLMDDDANGDVDVEESDEFLREDLNYHDPT VKHSTFHGEDKLISVEDLWKAWKSSEVYNWTVDEVVQWLITYVELPQYEETFRKLQLS GHAMPRLAVTNTTMTGTVLKMTDRSHRQKLQLKALDTVLFGPPLLTRHNHLKDFMLVV SIVIGVGGCWFAYIQNRYSKEHMKKMMKDLEGLHRAEQSLHDLQERLHKAQEEHRTVE VEKVHLEKKLRDEINLAKQEAQRLKELREGTENERSRQKYAEEELEQVREALRKAEKE LESHSSWYAPEALQKWLQLTHEVEVQYYNIKKQNAEKQLLVAKEGAEKIKKKRNTLFG TFHVAHSSSLDDVDHKILTAKQALSEVTAALRERLHRWQQIEILCGFQIVNNPGIHSL VAALNIDPSWMGSTRPNPAHFIMTDDVDDMDEEIVSPLSMQSPSLQSSVRQRLTEPQH GLGSQRDLTHSDSESSLHMSDRQRVAPKPPQMSRAADEALNAMTSNGRHRLIEGVHPG SLVEKLPDSPALAKKALLALNHGLDKAHSLMELSPSAPPGGSPHLDSSRSHSPSSPDP DTPSPVGDSRALQASRNTRIPHLAGKKAVAEEDNGSIGEETDSSPGRKKFPLKIFKKP LKK" variation 1645 /gene="STIM1" /replace="g" BASE COUNT 893 a 1149 c 1114 g 884 t ORIGIN 1 tcgacctgga cctgggcacc gccagccgcc tgggcacggg actgggcggg ggcgctgacc 61 tcggcctagg aggcccagga tcccggagac gcccgcgccc tcaggaccct gcgggtcgca 121 cgccctcccc agcttctgct gctcgccgct cttcggcagg gcgaggtcag gtgccccctt 181 ctcgcctctc ttctcttctc ttcctcctcc acttctgtgc ccgcggagac tccggccgcc 241 ccttccgcag gggtgtagta atctgcggag ctgacagcag ccccgcagcc accctgcccg 301 aagtctccgg aagcggcacg agctcaggcc gccgcagccc cggcgaccca ctgttggacc 361 tgaggagcca gccctcctcc cgcacccaaa cttggagcac ttgacctttg gctgttggag 421 ggggcaggct cgcgggtggc tggacagctg cggacgcgcg agggcatctt gcctggagac 481 cgtcggctgc actcccgggc tcctggcttt gctctgggat cccgaggtgt ccacatcaga 541 cgcatgttga ctgagaccta gagtcatgga tgtatgcgtc cgtcttgccc tgtggctcct 601 ctgggggctc ctcctgcacc agggccagag cctcagccat agtcacagtg agaaggcgac 661 aggaaccagc tcgggggcca actctgagga gtccactgca gcagagtttt gccgaattga 721 caagcccctg tgtcacagtg aggatgagaa actcagcttc gaggcagtcc gtaacatcca 781 caaactgatg gacgatgatg ccaatggtga tgtggatgtg gaagaaagtg atgagttcct 841 gagggaagac ctcaattacc atgacccaac agtgaaacac agcaccttcc atggtgagga 901 taagctcatc agcgtggagg acctgtggaa ggcatggaag tcatcagaag tatacaattg 961 gaccgtggat gaggtggtac agtggctgat cacatatgtg gagctgcctc agtatgagga 1021 gaccttccgg aagctgcagc tcagtggcca tgccatgcca aggctggctg tcaccaacac 1081 caccatgaca gggactgtgc tgaagatgac agaccggagt catcggcaga agctgcagct 1141 gaaggctctg gatacagtgc tctttgggcc tcctctcttg actcgccata atcacctcaa 1201 ggacttcatg ctggtggtgt ctatcgttat tggtgtgggc ggctgctggt ttgcctatat 1261 ccagaaccgt tactccaagg agcacatgaa gaagatgatg aaggacttgg aggggttaca 1321 ccgagctgag cagagtctgc atgaccttca ggaaaggctg cacaaggccc aggaggagca 1381 ccgcacagtg gaggtggaga aggtccatct ggaaaagaag ctgcgcgatg agatcaacct 1441 tgctaagcag gaagcccagc ggctgaagga gctgcgggag ggtactgaga atgagcggag 1501 ccgccaaaaa tatgctgagg aggagttgga gcaggttcgg gaggccttga ggaaagcaga 1561 gaaggagcta gaatctcaca gctcatggta tgctccagag gcccttcaga agtggctgca 1621 gctgacacat gaggtggagg tgcaatatta caacatcaag aagcaaaatg ctgagaagca 1681 gctgctggtg gccaaggagg gggctgagaa gataaaaaag aagagaaaca cactctttgg 1741 caccttccac gtggcccaca gctcttccct ggatgatgta gatcataaaa ttctaacagc 1801 taagcaagca ctgagcgagg tgacagcagc attgcgggag cgcctgcacc gctggcaaca 1861 gatcgagatc ctctgtggct tccagattgt caacaaccct ggcatccact cactggtggc 1921 tgccctcaac atagacccca gctggatggg cagtacacgc cccaaccctg ctcacttcat 1981 catgactgac gacgtggatg acatggatga ggagattgtg tctcccttgt ccatgcagtc 2041 ccctagcctg cagagcagtg ttcggcagcg cctgacggag ccacagcatg gcctgggatc 2101 tcagagggat ttgacccatt ccgattcgga gtcctccctc cacatgagtg accgccagcg 2161 tgtggccccc aaacctcctc agatgagccg tgctgcagac gaggctctca atgccatgac 2221 ttccaatgga cgccaccggc tgatcgaggg ggtccaccca gggtctctgg tggagaaact 2281 gcctgacagc cctgccctgg ccaagaaggc attactggcg ctgaaccatg ggctggacaa 2341 ggcccacagc ctgatggagc tgagcccctc agccccacct ggtggctctc cacatttgga 2401 ttcttcccgt tctcacagcc ccagctcccc agacccagac acaccatctc cagttgggga 2461 cagccgagcc ctgcaagcca gccgaaacac acgcattccc cacctggctg gcaagaaggc 2521 tgtggctgag gaggataatg gctctattgg cgaggaaaca gactccagcc caggccggaa 2581 gaagtttcct ctcaaaatct ttaagaagcc tcttaagaag taggcaggat ggggtggcag 2641 taaagggaca gcttgtcctt ccctgggtgt tctgtctctc cttccctccc ttccttcaag 2701 ataactggcc ccaagagtgg ggcatgggaa gggctggtcc aggggtctgg gcactgtaca 2761 tacctgcccc ctcatccttg ggtccttcat tattatttat taactgacca ccatggcctg 2821 cctgccctgc ctccgtccca accatgggct gctgctgtca ctccctctcc acttcagtgc 2881 atgtcttagt tgctgttccc tcagctccca gctccacctc tggggttcag cttctgtctc 2941 tgctgtccca gttttgaggt ttggtttctt gtttctgtct cttgctttca ggctcctccc 3001 tcccaccact ccccaacttc ccctagcagt tgcagggaag ataggacgag tagcttctga 3061 catgtgtgcc tcagatctgt tccacccact cacagtggtt ctgtttgctc cagactgggg 3121 ctagggctaa tctttgaagt ttgttctttg gtattgatgt gggtcagaag gaggcctcat 3181 cctaatctca ctcaggcctc cagggatcca tgggggagtg aaaccaattc tcagagaaca 3241 acccaccaga gacttttaaa gagaggccag gcttgggaat gggttgggag aggcatctgt 3301 tcattggagc atgagtggat gccagaactg taggttataa ggcagtcact ttttctctct 3361 actcccaccc acacctgcct ccctcttacc cctgctcccc acactgcagg aggatttgtc 3421 tctaagaggt gctgccccaa agctccccaa gcatcaatac tcctagggct caggacaagt 3481 ggctcccctg gccaggagag ccacagccat gatacaggcc tcttatggag ccctggagtt 3541 gttgggcaag gatgctgtca ttttttgaac caaaagacaa acaggttaaa aggaaaaaaa 3601 gtaatctgaa tttcccaagt gcctacgctg catattcccc ttgttagatc ccattttcat 3661 gttactttgt agccttggcc cagaggctca aaaaggacac aaccagtttg gggaaggggt 3721 ggctaaggaa gatggtatag gtgaaggcgg ctgtgtgacc actttccccc acccttccca 3781 ccctctagac aactctctcc cttacctgtt tttgctatgg ctgtaaaggt atttttcctc 3841 tgccccactc cctgccatac ctttatcctg ggatcctatt ttgggcctgg ggtgggtata 3901 cctggggctg gtcttaggag ggtgctaggc tgcagactgc cttgtactcc ctggacaccc 3961 tcaaatgggg gtttctgtgt tatttcataa aattctttga agtccaataa agcatgtagg 4021 agattttaac cacaaaaaaa // LOCUS HSU52518 874 bp mRNA PRI 18-JUN-1996 DEFINITION Human Grb2-related adaptor protein (Grap) mRNA, complete cds. ACCESSION U52518 NID g1354384 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 874) AUTHORS Feng,G.S., Ouyang,Y.B., Hu,D.P., Shi,Z.Q., Gentz,R. and Ni,J. TITLE Grap is a novel SH3-SH2-SH3 adaptor protein that couples tyrosine kinases to the Ras pathway JOURNAL J. Biol. Chem. 271 (21), 12129-12132 (1996) MEDLINE 96218119 REFERENCE 2 (bases 1 to 874) AUTHORS Ni,J. TITLE Direct Submission JOURNAL Submitted (26-MAR-1996) Jian Ni, Protein Expression and Purification, Human Genome Sciences, Inc., 9410 Key West Avenue, Rockville, MD 20850-3338, USA FEATURES Location/Qualifiers source 1..874 /organism="Homo sapiens" /db_xref="taxon:9606" gene 34..687 /gene="Grap" CDS 34..687 /gene="Grap" /note="Grb2 homolog; Conceptual translation supplied by author" /codon_start=1 /product="Grb2-related adaptor protein" /db_xref="PID:g1354385" /translation="MESVALYSFQATESDELAFNKGDTLKILNMEDDQNWYKAELRGV EGFIPKNYIRVKPHPWYSGRISRQLAEEILMKRNHLGAFLIRESESSPGEFSVSVNYG DQVQHFKVLREASGKYFLWEEKFNSLNELVDFYRTTTIAKKRQIFLRDEEPLLKSPGA CFAQAQFDFSAQDPSQLSFRRGDIIEVLERPDPHWWRGRSCGRVGFFPRSYVQPVHL" BASE COUNT 175 a 268 c 270 g 161 t ORIGIN 1 ctgagcccag ctgctggagc cccgagcagc ggcatggagt ccgtggccct gtacagcttt 61 caggctacag agagcgacga gctggccttc aacaagggag acacactcaa gatcctgaac 121 atggaggatg accagaactg gtacaaggcc gagctccggg gtgtcgaggg atttattccc 181 aagaactaca tccgcgtcaa gccccatccg tggtactcgg gcaggatttc ccggcagctg 241 gccgaagaga ttctgatgaa gcggaaccat ctgggagcct tcctgatccg ggagagtgag 301 agctccccag gggagttctc tgtgtctgtg aactatggag accaggtgca gcacttcaag 361 gtgctgcgtg aggcctcggg gaagtacttc ctgtgggagg agaagttcaa ctccctcaac 421 gagctggtcg acttctaccg caccaccacc atcgccaaga agcggcagat cttcctgcgc 481 gacgaggagc ccttgctcaa gtcacctggg gcctgctttg cccaggccca gtttgacttc 541 tcagcccagg acccctcgca gctcagcttc cgccgtggcg acatcattga ggtcctggag 601 cgcccagacc cccactggtg gcggggccgg tcctgcgggc gcgttggctt cttcccacgg 661 agttacgtgc agcccgtgca cctgtgagca gcccggcggc gatcttgcca acggggcttt 721 ttacaggaac tgaggtccag agaggacatg gacaccccca gctctgtcag agtcacacgg 781 ggctcagtgg acggccttgg actgaacgta ggctcctaac tgcctcccgg ccggtctgca 841 caaactggga tgggccaggt cccccagcaa gggt // LOCUS HSU52521 1234 bp mRNA PRI 25-APR-1996 DEFINITION Human arfaptin 1, putative target protein of ADP-ribosylation factor, mRNA, complete cds. ACCESSION U52521 NID g1279760 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1234) AUTHORS Kanoh,H. and Exton,J.H. TITLE Direct Submission JOURNAL Submitted (26-MAR-1996) H. Kanoh, Howard Hughes Medical Institute, Vanderbilt, 831 Light Hall, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..1234 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="leukemia cell line HL60" CDS 131..1156 /codon_start=1 /product="arfaptin 1" /db_xref="PID:g1279761" /translation="MAQESPKNSAAEIPVTSNGEVDDSREHSFNRDLKHSLPSGLGLS ETQITSHGFDNTKEGVIEAGAFQGGQRTQTKSGPVILADEIKNPAMEKLELVRKWSLN TYKCTRQIISEKLGRGSRTVDLELEAQIDILRDNKKKYENILKLAQTLSTQLFQMVHT QRQLGDAFADLSLKSLELHEEFGYNADTQKLLAKNGETLLGAINFFIASVNTLVNKTI EDTLMTVKQYESARIEYDAYRTDLEELNLGPRDANTLPKIEQSQHLFQAHKEKYDKMR NDVSVKLKFLEENKVKVLHNQLVLFHNAIAAYFAGNQKQLEQTLKQFHIKLKTPGVDA PSWLEEQ" BASE COUNT 403 a 241 c 272 g 318 t ORIGIN 1 cgctgctctt ggttctggtt ctggaggctg ggttgagagg tcgccggtcc gactgtcctc 61 ggcggttggt cagtgtgaat ttgtgacagc tgcagttgct ccccgccccc gagcagccga 121 ggagtctacc atggctcaag aatctcccaa aaattcagca gcagaaattc cagtgactag 181 taatggagaa gttgatgact ctcgtgaaca tagctttaat agggatttga agcattcatt 241 accatctgga cttggtctct cagaaaccca aattacatct catggctttg acaataccaa 301 agagggtgtt attgaagcag gagcatttca aggtggccag agaacacaga caaaaagtgg 361 accagttatt ctagcagatg aaattaaaaa tcctgcaatg gaaaagttag aacttgttag 421 aaaatggagt ctaaacacct ataagtgtac tcgacagatt atctctgaga agctaggccg 481 tggctcaaga actgtggacc ttgaacttga agctcagatt gatatattaa gggataacaa 541 gaaaaaatat gaaaatattt taaaactggc tcaaacattg tcgacccagc ttttccagat 601 ggtacatacc caaaggcaac ttggagatgc atttgctgac ctgagtttga agtcactaga 661 acttcatgaa gaatttggct ataatgccga tacccagaaa ctgctggcta aaaatggaga 721 gactcttctt ggggccatta attttttcat tgctagtgtg aacactttgg tgaataaaac 781 cattgaagat acattaatga ctgtgaaaca gtatgaaagt gccaggattg aatatgatgc 841 atatcgcact gatttggaag aactgaatct tggaccacgt gacgcaaaca ctctgccaaa 901 gattgagcag tcacagcatc tcttccaagc acataaggaa aaatatgata aaatgcgcaa 961 tgatgtttct gtcaaattga aatttctaga agaaaataag gttaaagtat tgcacaatca 1021 gctggtcctt ttccacaatg ccattgccgc ttactttgct gggaatcaga agcagcttga 1081 acagacactt aaacagttcc atatcaaatt gaaaacccct ggagtggatg ccccatcttg 1141 gcttgaagaa cagtaaaatc acagcggaaa ataaaaagaa agtcgcgttg ttatatttct 1201 aaaccaacct aacaagaatt aagcagagtt gggc // LOCUS HSU52522 1654 bp mRNA PRI 25-APR-1996 DEFINITION Human arfaptin 2, putative target protein of ADP-ribosylation factor, mRNA, complete cds. ACCESSION U52522 NID g1279762 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1654) AUTHORS Kanoh,H. and Exton,J.H. TITLE Direct Submission JOURNAL Submitted (26-MAR-1996) H. Kanoh, Howard Hughes Medical Institute, Vanderbilt, 831 Light Hall, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..1654 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="leukemia cell line HL60" CDS 68..1093 /codon_start=1 /product="arfaptin 2" /db_xref="PID:g1279763" /translation="MTDGILGKAATMEIPIHGNGEARQLPEDDGLEQDLQQVMVSGPN LNETSIVSGGYGGSGDGLIPTGSGRHPSHSTTPSGPGDEVARGIAGEKFDIVKKWGIN TYKCTKQLLSERFGRGSRTVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQT QHALGDAFADLSQKSPELQEEFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTM EDTLMTVKQYEAARLEYDAYRTDLEELSLGPRDAGTRGRLESAQATFQAHRDKYEKLR GDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFAGNQKQLEQTLQQFNIKLRPPGAEK PSWLEEQ" BASE COUNT 387 a 456 c 476 g 335 t ORIGIN 1 tggagcccga ggtccccgcg cggcccgggc ctggcgccct gaggggaaga gcggcccggc 61 ccgagccatg acggacggga tcctagggaa ggcagccaca atggagatcc ctatccacgg 121 gaacggcgaa gccaggcagc ttcctgaaga tgatgggctg gagcaggacc tccagcaggt 181 gatggtgtca ggacccaacc tcaatgaaac cagcattgtg tctggtggct atgggggctc 241 tggtgatgga ctcatcccca cagggtctgg ccgccatcca tctcacagca ccactccttc 301 tggccctgga gatgaggtgg ctcggggcat tgctggagaa aagtttgaca tcgtcaagaa 361 atggggcatc aacacctata agtgcacaaa gcaactgtta tcagaacgat ttggtcgagg 421 ctcacggact gtggacctgg agctagagct gcagattgag ttgctgcgtg agacgaagcg 481 caagtatgag agtgtcctgc agctgggccg ggcactgaca gcccacctct acagcctgct 541 gcagacccag catgcactgg gtgatgcctt tgctgacctc agccagaagt ccccagagct 601 tcaggaggaa tttggctaca atgcagagac acagaaacta ctatgcaaga atggggaaac 661 gctgctagga gccgtgaact tctttgtctc tagcatcaac acattggtca ccaagaccat 721 ggaagacacg ctcatgactg tgaaacagta tgaggctgcc aggctggaat atgatgccta 781 ccgaacagac ttagaggagc tgagtctagg cccccgggat gcagggacac gtggtcgact 841 tgagagtgcc caggccactt tccaggccca tcgggacaag tatgagaagc tgcggggaga 901 tgtggccatc aagctcaagt tcctggaaga aaacaagatc aaggtgatgc acaagcagct 961 gctgctcttc cacaatgctg tgtccgccta ctttgctggg aaccagaaac agctggagca 1021 gaccctgcag cagttcaaca tcaagctgcg gcctccagga gctgagaaac cctcctggct 1081 agaggagcag tgagctgctc ccagcccaac ttggctatca agaaagacat tgggaagggc 1141 agccccaggg tgtgggagat tggacatggt acatcctttg tcacttgccc tctggcttgg 1201 gctccttttt ctggctgggg cctgacacca gttttgccca cattgctatg gtgggaagag 1261 tgcctggagg cccagaagtt gctgccctgt ctatcttcct ggccacaggg cttcattccc 1321 agatcttttc cttccacttc acagccaacg gctatgacaa aaccactccc tggccaatgg 1381 catcactctt caggctgggg tgtgctccct gaccaatgac agagcctgaa aatgccctgt 1441 cagccaatgg cagctcttct cggactcccc tgggccaatg atgttgcgtc taataccctt 1501 tgtctctcct ctatgcgtgc ccattgcaga gaaggggact gggaccaaag gggtggggat 1561 aatggggagc cccattgctg gccttgcatc tgaataggcc taccctcacc cacccaccca 1621 gtttaattgt gcttagagcc caagaagatt ggga // LOCUS HSU52840 8056 bp mRNA PRI 16-JAN-1998 DEFINITION Homo sapiens semaphorin F homolog mRNA, complete cds. ACCESSION U52840 NID g2772583 KEYWORDS . SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8056) AUTHORS Simmons,A.D., Overhauser,J. and Lovett,M. TITLE Isolation of cDNAs from the Cri-du-chat critical region by direct screening of a chromosome 5-specific cDNA library JOURNAL Genome Res. 7 (2), 118-127 (1997) MEDLINE 97202103 REFERENCE 2 (bases 1 to 8056) AUTHORS Simmons,A.D., Puschel,A.W., McPherson,J.D., Overhauser,J. and Lovett,M. TITLE Molecular cloning and mapping of human semaphorin F from the Cri-du-chat candidate interval JOURNAL Biochem. Biophys. Res. Commun. (1998) In press REFERENCE 3 (bases 1 to 8056) AUTHORS Lovett,M., Simmons,A.D. and Overhauser,J. TITLE Direct Submission JOURNAL Submitted (27-MAR-1996) Biochemistry and the McDermott Center, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75235-8591, USA REFERENCE 4 (bases 1 to 8056) AUTHORS Simmons,A. TITLE Direct Submission JOURNAL Submitted (14-JAN-1998) Otorhinolaryngology and the McDermott Center, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75235-8591, USA REMARK Nucleotide and amino acid sequence updated by submitter FEATURES Location/Qualifiers source 1..8056 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5p15.2" /clone="CSA1" CDS 638..3862 /note="Cri-du-chat region" /codon_start=1 /product="semaphorin F homolog" /db_xref="PID:g2772584" /translation="MKGTCVIAWLFSSLGLWRLAHPEAQGTTQCQRTEHPVISYKEIG PWLREFRAKNAADFSQLTFDPGQKELVVGARNYLFRLQLEDLSLIQAVEWECDEATKK ACYSKGKSKEECQNYIRVLLVGGDRLFTCGTNAFTPVCTNRSLSNLAEIHDQISGMAR CPYSPQHNSTALLTAGGELYAATAMDFPGRDPAIYRSLGILPPLRTAQYNSKWLNEPN FVSSYDIGNFTYFFFRENAVEHDCGKTVFSRAARVCKNDIGGRFLLEDTWTTFMKARL NCSRPGEVPFYYNELQSTFFLPELDLIYGIFTTNVNSIAASAVCVFNLSAIAQAFSGP FKYQENSRSAWLPYPNPNPHFQCGTVDQGLYVNLTERNLQDAQKFILVHEVVQPVTTV PSFMEDNSRFSHVAVDVVQGREALVHIIYLATDYGTIKKVRVPLNQTSSSCLLEEIEL FPERRREPIRSLQILHSQSVLFVGLREHVVKIPLKRCQFYRTRSTCIGAQDPYCGWDV VMKKCTSLEESLSMTQWEQSISACPTRNLTVDGHFGVWSPWTPCTHTDGSAVGSCLCR TRSCDSPAPQCGGWQCEGPGMEIANCSRNGGWTPWTSWSPCSTTCGIGFQVRQRSCSN PTPRHGGRVCVGQNREERYCNEHLLCPPHMFWTGWGPWERCTAQCGGGIQARRRICEN GPDCAGCNVEYQSCNTNPCPELKKTTPWTPWTPVNISDNGDHYEQRFRYTCKARLADP NLLEVGRQRIEMRYCSSDGTSGCSTDGLSGDFLRAGRYSAHTVNGAWSAWTSWSQCSR DCSRGIRNRKRVCNNPEPKYGGMPCLGPSLEYQECNTLPCPVDGVWSCWSPWTKCSAT CGGGHYMRTRSCSNPAPAYGGDICLGLHTEEALCNTQPCPESWSEWSDWSECEASGVQ VRARQCILLFPMGSQCSGNTTESRPCVFDSNFIPEVSVARSSSVEEKRCGEFNMFHMI AVGLSSSILGCLLTLLVYTYCQRYQQQSHDATVIHPVSPAPLNTSITNHINKLDKYDS VEAIKAFNKNNLILEERNKYFNPHLTGKTYSNAYFTDLNNYDEY" misc_feature 2767..2820 /note="encodes type I thrombospondin repeat; similar to Swiss-Prot Accession Number P07996" misc_feature 2824..2877 /note="encodes type I thrombospondin repeat; similar to Swiss-Prot Accession Number P07996" repeat_region 7766..8055 /note="Alu" BASE COUNT 1957 a 2055 c 1970 g 2074 t ORIGIN 1 ctgctctccc tgagcccgct cccgagcgct gctttcccgc cgcgggtggg cttcgcagcc 61 tcaggccagc cgcggccctt ggcccgctgc agccccggcc ctccaccttc cccgtgcagg 121 ggcggcccgg ccagtgtcgc tcatcccggg acgctccctt ctcccaccca ggactgcccc 181 gcggagctgg cttggacacc caactttgcc acctcgaggg tcgtctctgc tgggcgcgaa 241 cctgcccacc caccggttgg ccgcgcgctc ggggaccgtg ctcgtggccc ccaagccggt 301 gcccccattc tggaactcag cgagtagggg gcggctctgg ggaagtggca gggggcggct 361 gcagctgctg cctccacttc cctagccagg tgctgaagag gatcctcgga gccgctctgg 421 cccccaggcg ctggatgact ggcaccagcg ctcctcgcac ctgtgttggt gtgtgagact 481 tgggctggag tgcccacgtg gctgtggagt cagtgtgatt catgattgag gaaacgcgtc 541 ctccatcctc tctctccttg gcactttcca cacatgagga gaagaagagc ttctgtttag 601 aagacacgtg cccagagtca gaggcccctt gcccaccatg aagggaacct gtgttatagc 661 atggctgttc tcaagcctgg ggctgtggag actcgcccac ccagaggccc agggtacgac 721 tcagtgccag agaaccgagc atccagtcat ctcctataaa gaaattggcc cctggttacg 781 ggagttcaga gcgaagaatg ctgcggattt ctcgcagtta acatttgacc caggacagaa 841 agaacttgtt gtaggagcaa gaaactacct cttcaggtta cagcttgagg atctgtctct 901 tatccaggct gtggaatggg agtgtgatga agctaccaaa aaggcctgtt acagcaaagg 961 caaatcaaag gaggaatgtc agaactacat ccgggtgctt ctggtgggtg gcgaccggtt 1021 attcacctgt gggaccaatg cattcacgcc tgtctgcacc aaccgctcgt tgagcaacct 1081 ggctgagatc catgatcaga tcagtggcat ggcccgctgt ccctacagtc cccagcacaa 1141 ttccacagcg ctcctcacag ctggtgggga gctctatgct gctacagcca tggattttcc 1201 aggacgtgat cctgccattt accgaagcct aggcatttta cctcctctcc gcacggcgca 1261 gtacaactcc aaatggctca atgagccaaa ctttgtgtca tcttatgaca tcggaaattt 1321 tacctacttc tttttccgag aaaatgcagt agagcatgac tgtgggaaaa cagtgttctc 1381 cagagctgcc cgggtgtgca agaacgatat tggtgggcgc ttcctgctgg aagacacctg 1441 gaccacattc atgaaggctc gcctgaactg ctcccgtcct ggggaagtcc ccttttacta 1501 caacgaattg cagagtactt tcttcctgcc tgagctggat ttgatctatg gcatctttac 1561 caccaatgtg aacagcattg cggcctcagc tgtgtgcgtc ttcaacctga gcgccatcgc 1621 gcaggccttc tctgggccct tcaagtacca agaaaactcg cgctcggcct ggctaccgta 1681 tcccaaccca aacccccact tccagtgtgg caccgtggac cagggcctgt acgtgaacct 1741 gaccgagaga aatctgcagg atgctcagaa gttcattctg gtgcatgagg tggtacagcc 1801 agtgaccaca gtgccctcct tcatggagga caatagccgc ttttcccacg tggcagtcga 1861 cgtggtgcag ggcagagaag cgctcgtcca catcatctat ttggccacag attacggaac 1921 cattaagaaa gtgcgggtac ccctgaatca gacctcaagc agctgtttgc tggaagagat 1981 tgagctcttc cctgagaggc ggagggagcc catcaggagc ctgcagatcc tgcacagcca 2041 gagtgtcctg ttcgtggggc tgcgggagca cgtggtcaag atccccctga agaggtgcca 2101 gttctaccgc acacgcagca cctgcattgg ggcccaggac ccttactgtg gctgggatgt 2161 ggtaatgaag aaatgcacaa gcctggagga gagcctgagc atgacgcagt gggaacagag 2221 catctctgcg tgtccgacca ggaatctcac cgtggatggg cactttggtg tgtggtctcc 2281 gtggacgcct tgcacgcaca cagatggcag cgccgtggga tcctgcctct gtcgaacccg 2341 ctcctgcgac agcccggccc cgcagtgtgg tggctggcag tgcgagggcc ctggcatgga 2401 gatcgccaac tgttccagga acggaggctg gactccctgg acctcgtggt ctccctgcag 2461 cactacctgt gggatcggct tccaggtgcg gcagcgctcc tgcagcaacc ccactcccag 2521 gcacgggggc cgggtgtgcg tgggacagaa ccgcgaggaa agatactgca atgaacattt 2581 gctatgtccc ccacacatgt tctggacagg ctggggtcct tgggaacggt gcacagccca 2641 atgcgggggt ggcattcaag ctcgccgcag gatctgtgag aatgggcctg actgtgcagg 2701 ctgcaatgtg gagtaccagt cttgcaacac caacccgtgt cctgagctga agaagaccac 2761 gccctggaca ccctggacac ctgtcaacat ctctgacaac ggcgaccact atgagcaacg 2821 attccgatac acatgcaaag cccgcctggc tgatccgaat ttgctggaag tgggaagaca 2881 gagaatcgaa atgcggtact gttctagcga cggcaccagt ggctgctcca cagatgggct 2941 ttctggggat ttcctgcgtg ctgggagata ctctgcccac acggtcaacg gggcttggtc 3001 agcctggacg tcgtggtcac agtgcagccg tgactgcagc aggggcattc ggaaccggaa 3061 gcgtgtttgc aacaaccccg aacccaagta tgggggaatg ccttgccttg gcccatctct 3121 ggaataccag gaatgcaaca ctttgccctg cccagtggat ggcgtgtggt cttgctggtc 3181 cccctggaca aaatgttcag caacatgcgg cggtggacac tatatgagga cccgctcttg 3241 ctccaatcca gccccggcct atggagggga catctgcctg gggctgcaca cagaagaggc 3301 actctgcaac acgcagccct gcccagagag ctggtcggag tggtcggact ggtctgagtg 3361 tgaagcctct ggcgtccaag tccgcgcccg ccagtgcatc ctcctgttcc ccatgggcag 3421 ccagtgctcc gggaacacca cggagagccg gccgtgtgtg tttgactcta atttcatccc 3481 agaagtatct gtggcaagat ccagtagcgt agaagagaaa aggtgtggag agttcaacat 3541 gttccacatg atcgccgtgg ggctgagcag ctccatcctc ggctgcctcc tcaccctgct 3601 cgtctatact tactgccagc ggtaccagca gcaatcccac gatgcgactg tcatccaccc 3661 cgtctcacct gcccccctta ataccagcat aaccaaccac atcaacaaac tggacaagta 3721 cgactcggtg gaggccatca aggcatttaa caaaaacaac ttgatcctag aggaaagaaa 3781 caaatacttc aacccacatc tcactgggaa gacctattct aatgcctact ttacagatct 3841 caataattat gatgagtact aacagctttc atgttttggg cttcttgtaa atccccagtt 3901 cctcaaggcc tgtgccccat gactgcccat gtttctgagg cttcagagtc gaagtttgga 3961 tacatttcaa gtgcatttca agccaccaga gtgtcccatt ggtgccaaaa atacacgtct 4021 ttaaaagcaa caaaaattga aataagacat cgtgaaaatc ttgaccattg ttgattgagc 4081 cagggtggtg aagtttttaa ttggtgttca tcctattttt ctgacaagtc cattggtttg 4141 ttttttgagc attattttat aaatgtgcca cccacattgg aaggagtctt tctttagaac 4201 tttggagtgt aaatcttcat gatgttgtaa attcaagaaa ataggcactt tctctgaaag 4261 acctgctcct tccacaagaa gtgcataggt ccataatatt tcataaaatg aagaaaaaga 4321 atgtggccaa acaattattc accatggatt gcccaacttt ccaaatctgg ataaagctgt 4381 gggattcttg gaagcagctt gagtgttttc atcttgcctg ggaagcccag gaattccacc 4441 tggtccacac cggcagaagt tacagtagac tgtgaggcac cccaccttgc tcctgatgca 4501 gtttctgtgc cattgctggt gctggtggag gcagcggagc agaggctcag gcacaatgaa 4561 gcgtggatgt gttctgcagg ttgctgcaaa actcacctta ttctgacttt tggatttcat 4621 ggcattccag gaagctcctt gccatgctgt tggcctggaa gtccacctgt ctggtccata 4681 gtgacgtcct gaagagccag tctgtaaaat aaccaaccac ttacttagcg tttggataga 4741 ctccatgcct tctctctccc tgcaaagaaa aatcttgaac atttatgatg tcaattagtg 4801 aaagtattga aaatactaaa ttataactaa aagcaacttt ttatgttatt gaaaatattg 4861 aaagaactga tattaatgat aatcatttta tttttcatct ctctgatata cccaatgtgt 4921 ggcaatcagc ccctaccatg agcattaatg ccatgtaaag ctggctttct ggagtctgcc 4981 aggtccacac gaagtgctac ccccagtttc tgctgttact tgctgcctcc aggccagggg 5041 agcaggagat gctcagctct ggtgcctttt cttttatttc agtgctgcct tcccacccac 5101 ccagtccatt caccctcacc tctccctgca tggaggcaac agcatttcca agatgtacct 5161 tgggaagtca ggatagctgg aagtaagggt ttttggggat ccctgtggtc tcttcactga 5221 tcatcgataa gcatttaaga gtgtgctaac accattcaca gaggtccctg gaaaaaaata 5281 tataattttc tcacaaacaa ccaatacata acctatggca gttggttgaa taattattga 5341 aaaaaagaac acgactgaaa aagttcatct gatgcctctg agcatgtgat aaagtcctgg 5401 gtagacatgg aaacgtgttc ttacaaatga accttcggat ctatgaaaga aaatatagta 5461 gggggactaa ggaacataga attttattag catatgtgat tttaccctta tctttgtttc 5521 taatttaaag aaacaatttc agaaagtgtt agaagaattc tatgtttaat aatgaatatt 5581 gttgagttca aaatattctc atatacccag atttacagag atgacacagt attgaaatgg 5641 caggtgggct ctgtaagtta tttggggatt aatgaaatcc agggtcttgg caggggggac 5701 tcaatgtggc cctctcactc atccctcata tgtcaagagt ccttcctgtc ttatatgcac 5761 atgctccaag gtccatactc agtaggggga aatctagccg gatggttccc caattgttcc 5821 tcggctggat tttcatcatc agaaatttcc aactcatctc atcagttttc ttgtaagaaa 5881 aaaacatttt tcactttgta gcacaatttt agttattttt gagtcttctt ccctaatagt 5941 ttgtaagctt aaagggacaa ttttaatttt cccctggtta aagagggatt ctaatatttc 6001 caagaatgtt ttccactgga attgatggga aatggtttct aaatggcatc gaagctggtc 6061 tttaatgatc ttttgcaatg acttgggaaa aacaccctcc ctcactacag gatcttcatg 6121 ttgttaataa tataaacaaa cctttgaaat tagcattgca aaagaatccc atttttgttt 6181 cgtaccacac tgttcaaagg aaaattgttt atctctcttc ctttctctct ctctcttatt 6241 tgacagaata gcaagtgttt agataatttc agatatattc ctaagtattt taccagctgt 6301 ggaataagtt gtgtttgtcc atcactgtgt agctaccgtt cacatgttct ggctctgtac 6361 tccacgctta tcgcgtgaac cctcacatgt tggactttca cagtgtgatc tctcacctgt 6421 cttggaaact ccttcataaa gccgctcttc cttaggcctg ggctgtttgg aagtcctggt 6481 gaaactttgc ggttcacatt ttaaattcct gaatgacctt ttcattctct ctttctttat 6541 ctttgttttt gtccttcatt ccctcctttt gttcttcctt cctgcctttc acgtcaccta 6601 cttatccgac tatttcctcc atttgcttct tttctatttg ccatcacctg tgacccgagt 6661 ggaacaaaaa ccaacacata gttttaagct acacctttct gcacctgatg taaaatgaca 6721 ttaatccact atttaacttc taaaattatt tctgaataca gttgcagata ggctggtttt 6781 catgaggaag ctcgcttggc tttagtctta catttaaatt ctttgaaagt ggctcccagt 6841 gctattcagg gtcctttctt gaggccagac tcatcaccat ctactattga ttttacagtg 6901 cactgacagt ttacaggaag gagaagaaca gaatttctgc acactacaca acatgtgggc 6961 ttcctgtagc tccagggaca tgtagctttg gtgaaactgg ggttttgtaa cctctgaatg 7021 atatggactt agtgaattct aaggaactca gggggcacgt ggtcaggctc caccgcacag 7081 aagagccaca gtctccagac tcatggcgtt ccctccagaa ctccccattc ccctctgagc 7141 atatttctat gctgctgctt cttgtgactt acatgaggca tctcagcttc ttcatgtttg 7201 tagggatggt tccctaagcc tgttcaagta ggctctcact agagttacca ttcatatttg 7261 agaagaaaag aacatttatt aaatgtatgt gtggggctga ggttctaaag gaattgaaaa 7321 gagacgataa acattgaatg agggtgaggg catccctgct ggggagaaac ctctgtcccc 7381 taggaagagt ccgttcatgt catgtggttt gggatgaacc aggggtctgc ccccatcggg 7441 tcacaggtga tgccaaatag aaaagaagca aatggaggaa atagtcggat aagtaggtga 7501 cgtgaaaggc aggacctggt caccccagca agtgctatgg acagttcccg gaaacggttg 7561 cccacttcac aggtccatgg gtctgaccct tggactctgc caggatcaac tgcccagagt 7621 gccagagttt tagccaaagg tgtacttact tccttattta tctccaaaag gatggaaact 7681 gtgggagtca aagcctattt tgctgagtgt tcccactgga tcctctgtag aattagcagg 7741 tcatgctgtc aaaatcatgg acaaaggctg ggtgcagtgg ctcatgccta taatcccagc 7801 actttgggag gccaaggtgg gcggatcacc tgagctcagg agtttaagac cagcctgggc 7861 aacatgggga aactccatct ctacaaaata tacaaaatat tagccagcca tcgtggtgcg 7921 tgcctgtggt cccagtttct tgggaggctg aggcgggagt atcatttgag ccaggaggtt 7981 gaggttgcag tgagctgaga tcacatcact gcactccatc ctgggtgaca gagagagacc 8041 ctgtctcaaa aaaaaa // LOCUS HSU52965 2666 bp DNA PRI 19-JUL-1996 DEFINITION Human putative transcriptional regulator ENX-1 mRNA, complete cds. ACCESSION U52965 NID g1279912 KEYWORDS vertebrate polycomb-group gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2666) AUTHORS Hobert,O., Jallal,B. and Ullrich,A. TITLE Interaction of Vav with ENX-1, a putative transcriptional regulator of homeobox gene expression JOURNAL Mol. Cell. Biol. 16 (6), 3066-3073 (1996) MEDLINE 96220494 REFERENCE 2 (bases 1 to 2666) AUTHORS Hobert,O. TITLE Direct Submission JOURNAL Submitted (28-MAR-1996) Department of Molecular Biology, Massachusetts General Hospital, Wellman 8, Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..2666 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="ZR-75" CDS 538..2379 /note="putative transcriptional regulator of chromatin activity; human homolog of the Drosophila Polycomb group gene Enhancer of zeste; contains CXC domain and SET domain; interacts with the Vav protooncogene product" /codon_start=1 /product="ENX-1" /db_xref="PID:g1279913" /translation="MGDEVLDQDGTFIEELIKNYDGKVHGDRECGFINDEIFVELVNA LGQYNDDDDDDDGDDPEEREEKQKDLEDHRDDKESRPPRKFPSDKIFEAISSMFPDKG TAEELKEKYKELTEQQLPGALPPECTPNIDGPNAKSVQREQSLHSFHTLFCRRCFKYD CFLHPFHATPNTYKRKNTETALDNKPCGPQCYQHLEGAKEFAAALTAERIKTPPKRPG GRRRGRLPNNSSRPSTPTINVLESKDTDSDREAGTETGGENNDKEEEEKKDETSSSSE ANSRCQTPIKMKPNIEPPENVEWSGAEASMFRVLIGTYYDNFCAIARLIGTKTCRQVY EFRVKESSIIAPAPAEDVDTPPRKKKRKHRLWAAHCRKIQLKKDGSSNHVYNYQPCDH PRQPCDSSCPCVIAQNFCEKFCQCSSECQNRFPGCRCKAQCNTKQCPCYLAVRECDPD LCLTCGAADHWDSKNVSCKNCSIQRGSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEY CGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKV MMVNGDHRIGIFAKRAIQTGEELFFDYRYSQADALKYVGIEREMEIP" BASE COUNT 877 a 515 c 584 g 690 t ORIGIN 1 ctagcctata gtaaatacat atatgtatgt gtaggtatat ataattattt tctaacctac 61 agaactgtga gaacctgcaa aatagttaag tgaactgtta ctaatcagag aagaactatg 121 gtgaatgaga gaggaactaa aagatgaaga caatttagtc atcgttcagt atgcatgggg 181 gattggttcc aagacccctt accaaatctg cagatgctca attatcttat ataaagtgat 241 gtagtatttc aaataaccta cacacatcct cttgtatatt ttaaatcatc tctcgattac 301 ttataatacc taacacaatg cctacacgtc atttgcatgg attcaacata gtacttggtg 361 tgtggcaaat tgaagttttg ctttttgaaa ctctatggaa ttttttctga atacttttga 421 tccatgattg gctgaatcca tggacgtgaa actcagggat aaggagggcc acctgcactc 481 tcccagtgtt ctaaaaagaa gatggatcat aaagatcata aaccactgtt ttttaaaatg 541 ggagatgaag ttttagatca ggatggtact ttcattgaag aactaataaa aaattatgat 601 gggaaagtac acggggatag agaatgtggg tttataaatg atgaaatttt tgtggagttg 661 gtgaatgccc ttggtcaata taatgatgat gacgatgatg atgatggaga cgatcctgaa 721 gaaagagaag aaaagcagaa agatctggag gatcaccgag atgataaaga aagccgccca 781 cctcggaaat ttccttctga taaaattttt gaagccattt cctcaatgtt tccagataag 841 ggcacagcag aagaactaaa ggaaaaatat aaagaactca ccgaacagca gctcccaggc 901 gcacttcctc ctgaatgtac ccccaacata gatggaccaa atgctaaatc tgttcagaga 961 gagcaaagct tacactcctt tcatacgctt ttctgtaggc gatgttttaa atatgactgc 1021 ttcctacatc cttttcatgc aacacccaac acttataagc ggaagaacac agaaacagct 1081 ctagacaaca aaccttgtgg accacagtgt taccagcatt tggagggagc aaaggagttt 1141 gctgctgctc tcaccgctga gcggataaag accccaccaa aacgtccagg aggccgcaga 1201 agaggacggc ttcccaataa cagtagcagg cccagcaccc ccaccattaa tgtgctggaa 1261 tcaaaggata cagacagtga tagggaagca gggactgaaa cggggggaga gaacaatgat 1321 aaagaagaag aagagaagaa agatgaaact tcgagctcct ctgaagcaaa ttctcggtgt 1381 caaacaccaa taaagatgaa gccaaatatt gaacctcctg agaatgtgga gtggagtggt 1441 gctgaagcct caatgtttag agtcctcatt ggcacttact atgacaattt ctgtgccatt 1501 gctaggttaa ttgggaccaa aacatgtaga caggtgtatg agtttagagt caaagaatct 1561 agcatcatag ctccagctcc cgctgaggat gtggatactc ctccaaggaa aaagaagagg 1621 aaacaccggt tgtgggctgc acactgcaga aagatacagc tgaaaaagga cggctcctct 1681 aaccatgttt acaactatca accctgtgat catccacggc agccttgtga cagttcgtgc 1741 ccttgtgtga tagcacaaaa tttttgtgaa aagttttgtc aatgtagttc agagtgtcaa 1801 aaccgctttc cgggatgccg ctgcaaagca cagtgcaaca ccaagcagtg cccgtgctac 1861 ctggctgtcc gagagtgtga ccctgacctc tgtcttactt gtggagccgc tgaccattgg 1921 gacagtaaaa atgtgtcctg caagaactgc agtattcagc ggggctccaa aaagcatcta 1981 ttgctggcac catctgacgt ggcaggctgg gggattttta tcaaagatcc tgtgcagaaa 2041 aatgaattca tctcagaata ctgtggagag attatttctc aagatgaagc tgacagaaga 2101 gggaaagtgt atgataaata catgtgcagc tttctgttca acttgaacaa tgattttgtg 2161 gtggatgcaa cccgcaaggg taacaaaatt cgttttgcaa atcattcggt aaatccaaac 2221 tgctatgcaa aagttatgat ggttaacggt gatcacagga taggtatttt tgccaagaga 2281 gccatccaga ctggcgaaga gctgtttttt gattacagat acagccaggc tgatgccctg 2341 aagtatgtcg gcatcgaaag agaaatggaa atcccttgac atctgctacc tcctccccct 2401 cctctgaaac agctgcctta gcttcaggaa cctcgagtac tgtgggcaat ttagaaaaag 2461 aacatgcagt ttgaaattct gaatttgcaa agtactgtaa gaataattta tagtaatgag 2521 tttaaaaatc aactttttat tgccttctca ccagctgcaa agtgttttgt accagtgaat 2581 ttttgcaata atgcagtatg gtacattttt caactttgaa taaagatact tgaacttgaa 2641 aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU53174 2102 bp mRNA PRI 08-JAN-1997 DEFINITION Human cell cycle checkpoint control protein mRNA, complete cds. ACCESSION U53174 NID g1765955 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2102) AUTHORS Lieberman,H.B., Hopkins,K.M., Nass,M., Demetrick,D. and Davey,S. TITLE A human homolog of the Schizosaccharomyces pombe rad9+ checkpoint control gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (24), 13890-13895 (1996) MEDLINE 97098491 REFERENCE 2 (bases 1 to 2102) AUTHORS Lieberman,H.B. TITLE Direct Submission JOURNAL Submitted (29-MAR-1996) Howard B. Lieberman, Center for Radiological Research, Columbia University, 630 W 168th Street, New York, NY 10032, USA FEATURES Location/Qualifiers source 1..2102 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..76 CDS 77..1252 /note="involved in resistance to hydroxyurea and gamma radiation" /codon_start=1 /product="cell cycle checkpoint control protein" /db_xref="PID:g1765956" /translation="MKCLVTGGNVKVLGKAVHSLSRIGDELYLEPLEDGLSLRTVNSS RSAYACFLFAPLFFQQYQAATPGQDLLRCKILMKSFLSVFRSLAMLEKTVEKCCISLN GRSSRLVVQLHCKFGVRKTHNLSFQDCESLQAVFDPASCPHMLRAPARVLGEAVLPFS PALAEVTLGIGRGRRVILRSYHEEEADSTAKAMVTEMCLGEEDFQQLQAQEGVAITFC LKEFRGLLSFAESANLNLSIHFDAPGRPAIFTIKDSLLDGHFVLATLSDTDSHSQDLG SPERHQPVPQLQAHSTPHPDDFANDDIDSYMIAMETTIGNEGSRVLPSISLSPGPQPP KSPGPHSEEEDEAEPSTVPGTPPPKKFRSLFFGSILAPVRSPQGPSPVLAEDSEGEG" 3'UTR 1253..2102 BASE COUNT 401 a 675 c 595 g 431 t ORIGIN 1 gcgcgggaag ggaccccgga cccggaggtc gcggagagct gggcagtgtt ggccgctggc 61 ggagcgctgg ggcagcatga agtgcctggt cacgggcggc aacgtgaagg tgctcggcaa 121 ggccgtccac tccctgtccc gcatcgggga cgagctctac ctggaaccct tggaggacgg 181 gctctccctc cggacggtga actcctcccg ctctgcctat gcctgctttc tctttgcccc 241 gctcttcttc cagcaatacc aggcagccac ccctggtcag gacctgctgc gctgtaagat 301 cctgatgaag tctttcctgt ctgtcttccg ctcactggcg atgctggaga agacggtgga 361 aaaatgctgc atctccctga atggccggag cagccgcctg gtggtccagc tgcattgcaa 421 gttcggggtg cggaagactc acaacctgtc cttccaggac tgtgagtccc tgcaggccgt 481 cttcgaccca gcctcgtgcc cccacatgct ccgcgcccca gcacgggttc tgggggaggc 541 tgttctgccc ttctctcctg cactggctga agtgacgctg ggcattggcc gtggccgcag 601 ggtcatcctg cgcagctacc acgaggagga ggcagacagc actgccaaag ccatggtgac 661 tgagatgtgc cttggagagg aggatttcca gcagctgcag gcccaggaag gggtggccat 721 cactttctgc ctcaaggaat tccgggggct cctgagcttt gcagagtcag caaacttgaa 781 tcttagcatt cattttgatg ctccaggcag gcccgccatc ttcaccatca aggactcttt 841 gctggacggc cactttgtct tggccacact ctcagacacc gactcgcact cccaggacct 901 gggctcccca gagcgtcacc agccagtgcc tcagctccag gctcacagca caccccaccc 961 ggacgacttt gccaatgacg acattgactc ttacatgatc gccatggaaa ccactatagg 1021 caatgagggc tcgcgggtgc tgccctccat ttccctttca cctggccccc agccccccaa 1081 gagccccggt ccccactccg aggaggaaga tgaggctgag cccagtacag tgcctgggac 1141 tcccccaccc aagaagttcc gctcactgtt cttcggctcc atcctggccc ctgtacgctc 1201 cccccagggc cccagccctg tgctggcgga agacagtgag ggtgaaggct gaaccaagaa 1261 cctgaagcct gtacccagag gccttggact agacgaagcc ccagccagtg gcagaactgg 1321 gtctctcagc cctggggatc agaaaggtgg gcttgctgga gctgagctgt ttcactgcct 1381 ctcgcaggcc ccagctggct gtcactgtaa agctgtccca cagcggtcgg gcctgggccg 1441 ttatctcccc acaaccccca gccaatcagg actttccaga cttggccctg aactactgac 1501 gttcctacct cttatttctc attgagcctc aggctatact ccagctggcc aaggctggaa 1561 acctgtctcc ctcaggctca ccttcctaag gaaaatgtca tagtaggtgc tgctggcccc 1621 tggtgatcca gcttctctgc caatcatgac ctgttccttc ctgaagtcct gggcatgcat 1681 ctgggacccc cgtggagctg acaagttttc cttgctttcc tgatactctt tggcgctgac 1741 ttggaattct aagagccttg gacccgagtg tgtggctagg gttgccctgg ctggggcccg 1801 gtgccgagac tcccaagcgg ctctgtgcag aagagctgcc aggcagtgtc ttagatgtga 1861 gacggaggcc atggcgagaa tccagctttg acctttattc aagagaccag atgggttgcc 1921 ccaggatccg gctgccagcc ctgaggccaa gcacggctgg agacccacga cctggcctgc 1981 cgttgccctg agctgcagcc tcggccccag gatcctgctc acagtcaccg caggtgcagg 2041 caggaagcag ccctggggga ctggacgctg ctattgattc attaaaaaaa gaaaagaaaa 2101 at // LOCUS HSU53209 1563 bp mRNA PRI 28-AUG-1996 DEFINITION Human transformer-2 alpha (htra-2 alpha) mRNA, complete cds. ACCESSION U53209 NID g1256836 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1563) AUTHORS Dauwalder,B., Amaya-Manzanares,F. and Mattox,W. TITLE A human homologue of the Drosophila sex determination factor transformer-2 has conserved splicing regulatory functions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (17), 9004-9009 (1996) MEDLINE 96392356 REFERENCE 2 (bases 1 to 1563) AUTHORS Mattox,W. TITLE Direct Submission JOURNAL Submitted (01-APR-1996) William Mattox, Molecular Genetics, University of Texas, MD Anderson Cancer Center, 1515 Holcombe Blvd., Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1563 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 158..1006 /gene="htra-2 alpha" CDS 158..1006 /gene="htra-2 alpha" /note="pre-mRNA processing factor" /codon_start=1 /product="transformer-2 alpha" /db_xref="PID:g1256837" /translation="MSDVEENNFEGRESRSQSKSPTGTPARVKSESRSGSRSPSRVSK HSESHSRSRSKSRSRSRRHSHRRYTRSRSHSHSHRRRSRSRSYTPEYRRRRSRSHSPM SNRRRHTGSRANPDPNTCLGVFGLSLYTTERDLREVFSRYGPLSGVNVVYDQRTGRSR GFAFVYFERIDDSKEAMERANGMELDGRRIRVDYSITKRAHTPTPGIYMGRPTHSGGG GGGGGGGGGGGGGRRRDSYYDRGYDRGYDRYEDYDYRYRRRSPSPYYSRYRSRSRSRS YSPRRY" BASE COUNT 421 a 298 c 369 g 475 t ORIGIN 1 gctaggcctc gaggcctagc cgagtgggag tggagtggag cggctgtggt tgccgactct 61 ttcctcttcc ccacggtcca gtcagcgggt taattaggcc atcggccctc gagccgagac 121 ttgtctctta tttagttctg gggagcgcct cgtcgacatg agtgatgtgg aggaaaacaa 181 cttcgagggc agagagtctc gctctcagtc aaaatctcca acgggaactc ctgctcgtgt 241 aaaatcggag agcaggtcag gatctcgtag tccatcaagg gtttccaaac actctgaatc 301 ccattctcga tcaagatcaa aatccaggtc gaggtcaagg agacattctc atagacgtta 361 cactcgatcc agatcccact ctcactctca taggagacga tctcgaagta gatcatatac 421 accagaatac cggcggcgaa ggagccgaag ccattctcca atgtctaacc ggagaagaca 481 tactggcagc agggcaaatc cagatcccaa cacttgcctt ggagtgtttg gcctcagttt 541 gtacacaaca gagagggatc ttcgtgaagt attttctcga tatggaccat tgagtggtgt 601 caatgtggtt tatgatcagc gaactgggcg atctcgagga tttgcttttg tgtattttga 661 gagaatagat gactcaaagg aggctatgga aagggcaaat ggaatggagc tggatggtag 721 aagaattcgg gtggattatt ctataaccaa gagagcgcac acaccaacac caggcatcta 781 catgggcaga ccaactcata gtggtggggg tggtggagga ggcggcggcg gtggaggtgg 841 aggtggtggc agacgtcgag attcttacta tgatagagga tatgatcgtg ggtatgacag 901 atatgaagac tatgattacc gatacagaag acgatcacct tctccttatt atagtcgata 961 tagatcacga tcaagatctc gttcctacag cccaagacgc tattgataac ggaatggttg 1021 caattaagga catttttttt cctctttttt tttttttttt ttttaattct gagatttccc 1081 caagctgtgg attcttccta ctccttaaga aaaaaacttt ggtttattta gcatctacac 1141 ttttgtcagt tgtgttgctg ttttccaccc attttattat actcttaaaa gatgtaattg 1201 ttgtcatttt gaacagttaa acatcttgag tataaaaaga accccaatgt tatgttatgc 1261 tttgtaaatt tttttttttg cttttaccta gataaacttc tagctaatca aataaggaaa 1321 gaaactgtct ttttaaagct tcttttgtgt tagatactgt attagagatc tgcatttatc 1381 atgagttcct tttttttttt ttaactttat ttttgggaaa gtaacacatg aagtagttca 1441 gtcatgtcag gtttgtctgg ggtggaatgg aacagtcagg tagttgaaag ttttttttta 1501 gagatgaaaa gcttgtgaac tcctgtaaaa catgctgtat ttgaaataca tctgttaaaa 1561 ctt // LOCUS HSU53347 2885 bp mRNA PRI 03-AUG-1996 DEFINITION Human neutral amino acid transporter B mRNA, complete cds. ACCESSION U53347 NID g1478280 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2885) AUTHORS Kekuda,R., Prasad,P.D., Fei,Y.J., Torres-Zamorano,V., Sinha,S., Yang-Feng,T.L., Leibach,F.H. and Ganapathy,V. TITLE Cloning of the sodium-dependent, broad-scope, neutral amino acid transporter Bo from a human placental choriocarcinoma cell line JOURNAL J. Biol. Chem. 271 (31), 18657-18661 (1996) MEDLINE 96324943 REFERENCE 2 (bases 1 to 2885) AUTHORS Ganapathy,V., Kekuda,R., Prasad,P.D., Fei,Y.J., Zamorano,V.T., Gibson,L., Yang-Feng,T.L. and Leibach,F.H. TITLE Direct Submission JOURNAL Submitted (01-APR-1996) Vadivel Ganapathy, Biochemistry and Molecular Biology, Medical College of Georgia, 1120, 15th Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..2885 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="choriocarcinoma cell line, JAR" /chromosome="19" /map="19q13.3" CDS 620..2245 /codon_start=1 /product="neutral amino acid transporter B" /db_xref="PID:g1478281" /translation="MVADPPRDSKGLAAAEPPPTGAWQLASIEDQGAAAGGYCGSRDL VRRCLRANLLVLLTVVAVVAGVALGLGVSGAGGALALGPGALEAFVFPGELLLRLLRM IILPLVVCSLIGGAASLDPGALGRLGAWALLFFLVTTLLASALGVGLALALQPGAASA AINASVGAAGSAENAPSKEVLDSFLDLARNIFPSNLVSAAFRSYSTTYEERNITGTRV KVPVGQEVEGMNILGLVVFAIVFGVALRKLGPEGELLIRFFNSFNEATMVLVSWIMWY APVGIMFLVAGKIVEMEDVGLLFARLGKYILCCLLGHAIHGLLVLPLIYFLFTRKNPY RFLWGIVTPLATAFGTSSSSATLPLMMKCVEENNGVAKHISRFILPIGATVNMDGAAL FQCVAAVFIAQLSQQSLDFVKIITILVTATASSVGAAGIPAGGVLTLAIILEAVNLPV DHISLILAVDWLVDRSCTVLNVEGDALGAGLLQNYVDRTESRSTEPELIQVKSELPLD PLPVPTEEGNPLLKHYRGPAGDATVASEKESVM" BASE COUNT 530 a 902 c 854 g 599 t ORIGIN 1 cggcacgccc gggaggcttt ctctggctgg taaccgctac tcccggacac cagaccaccg 61 ccttccgtac acaggggccc gcatcccacc ctcccggacc taagagcctg ggtcccctgt 121 ttccggagtc cgcttcccgg cccccagatt ctggcatccc agccctcagt gtccaagacc 181 caggcagccc gggtccccgc ctcccggatc caggcgtccg ggatctgcgc caccagaacc 241 tagcctcctg cagacctccg ccatctgggg gcactcaacc tcctggagcc aagggcccca 301 cgtcccaccc agagaaactc tcgtattccc agctcctagg gccaaggaac ccgggcgctc 361 cgaactccca gctttcggac atctggcaca cggggcagag cagagaagcc tcagcgccca 421 gcctggggaa tttaaacact ccagcttcca agagccaagg aacttcagtg ctgtgaactc 481 acaactctaa ggagccctcc aaagttccag tctccaggtg ctgttactca actcagtcct 541 aggaacgtcg ggtcctggga aggagcccaa gcgctcccag ccagcttcca ggcgctaaga 601 aaccccggtg cttcccatca tggtggccga tcctcctcga gactccaagg ggctcgcagc 661 ggcggagcca ccgccaacgg gggcctggca gctggcctcc atcgaggacc aaggcgcggc 721 agcaggcggc tactgcggtt cccgggacct ggtgcgccgc tgccttcgag ccaacctgct 781 tgtgctgctg acagtggtgg ccgtggtggc cggcgtggcg ctgggactgg gggtgtcggg 841 ggccgggggt gcgctggcgt tgggcccggg agcgcttgag gccttcgtct tcccgggcga 901 gctgctgctg cgtctgctgc ggatgatcat cttgccgctg gtggtgtgca gcttgatcgg 961 cggcgccgcc agcctggacc ccggcgcgct cggccgtctg ggcgcctggg cgctgctctt 1021 tttcctggtc accacgctgc tggcgtcggc gctcggagtg ggcttggcgc tggctctgca 1081 gccgggcgcc gcctccgccg ccatcaacgc ctccgtggga gccgcgggca gtgccgaaaa 1141 tgcccccagc aaggaggtgc tcgattcgtt cctggatctt gcgagaaata tcttcccttc 1201 caacctggtg tcagcagcct ttcgctcata ctctaccacc tatgaagaga ggaatatcac 1261 cggaaccagg gtgaaggtgc ccgtggggca ggaggtggag gggatgaaca tcctgggctt 1321 ggtagtgttt gccatcgtct ttggtgtggc gctgcggaag ctggggcctg aaggggagct 1381 gcttatccgc ttcttcaact ccttcaatga ggccaccatg gttctggtct cctggatcat 1441 gtggtacgcc cctgtgggca tcatgttcct ggtggctggc aagatcgtgg agatggagga 1501 tgtgggttta ctctttgccc gccttggcaa gtacattctg tgctgcctgc tgggtcacgc 1561 catccatggg ctcctggtac tgcccctcat ctacttcctc ttcacccgca aaaaccccta 1621 ccgcttcctg tggggcatcg tgacgccgct ggccactgcc tttgggacct cttccagttc 1681 cgccacgctg ccgctgatga tgaagtgcgt ggaggagaat aatggcgtgg ccaagcacat 1741 cagccgtttc atcctgccca tcggcgccac cgtcaacatg gacggtgccg cgctcttcca 1801 gtgcgtggcc gcagtgttca ttgcacagct cagccagcag tccttggact tcgtaaagat 1861 catcaccatc ctggtcacgg ccacagcgtc cagcgtgggg gcagcgggca tccctgctgg 1921 aggtgtcctc actctggcca tcatcctcga agcagtcaac ctcccggtcg accatatctc 1981 cttgatcctg gctgtggact ggctagtcga ccggtcctgt accgtcctca atgtagaagg 2041 tgacgctctg ggggcaggac tcctccaaaa ttatgtggac cgtacggagt cgagaagcac 2101 agagcctgag ttgatacaag tgaagagtga gctgcccctg gatccgctgc cagtccccac 2161 tgaggaagga aaccccctcc tcaaacacta tcgggggccc gcaggggatg ccacggtcgc 2221 ctctgagaag gaatcagtca tgtaaacccc gggagggacc ttccctgccc tgctgggggt 2281 gctctttgga cactggatta tgaggaatgg ataaatggat gagctagggc tctgggggtc 2341 tgcctgcaca ctctggggag ccaggggccc cagcaccctc caggacagga gatctgggat 2401 gcctggctgc tggagtacat gtgttcacaa gggttactcc tcaaaacccc cagttctcac 2461 tcatgtcccc aactcaaggc tagaaaacag caagatggag aaataatgtt ctgctgcgtc 2521 cccaccgtga cctgcctggc ctcccctgtc tcagggagca ggtcacaggt caccatgggg 2581 aattctagcc cccactgggg ggatgttaca acaccatgct ggttattttg gcggctgtag 2641 ttgtgggggg atgtgtgtgt gcacgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 2701 tctgtgacct cctgtcccca tggtacgtcc caccctgtcc ccagatcccc tattccctcc 2761 acaataacag aaacactccc agggactctg gggagaggct gaggacaaat acctgctgtc 2821 actccagagg acattttttt tagcaataaa attgagtgtc aactattaaa aaaaaaaaaa 2881 aaaaa // LOCUS HSU53445 3025 bp mRNA PRI 07-MAY-1996 DEFINITION Human ovarian cancer downregulated myosin heavy chain homolog (Doc1) mRNA, complete cds. ACCESSION U53445 NID g1297318 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3025) AUTHORS Mok,S.C., Wong,K.K., Chan,R.K., Lau,C.C., Tsao,S.W., Knapp,R.C. and Berkowitz,R.S. TITLE Molecular cloning of differentially expressed genes in human epithelial ovarian cancer JOURNAL Gynecol Oncol 52 (2), 247-252 (1994) MEDLINE 94148289 REFERENCE 2 (bases 1 to 3025) AUTHORS Wong,K.K. and Mok,S.C. TITLE Cloning and sequencing of full length Doc1 and Doc2 mRNAs JOURNAL Unpublished REFERENCE 3 (bases 1 to 3025) AUTHORS Wong,K.K. and Mok,S.C. TITLE Direct Submission JOURNAL Submitted (02-APR-1996) Kwong Kwok Wong, Molecular Biosciences, Pacific Northwest National Laboratory, Battelle Blvd., Richland, WA 99352, USA FEATURES Location/Qualifiers source 1..3025 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="mesothelial MESO306" /chromosome="3" gene 135..2393 /gene="Doc1" CDS 135..2393 /gene="Doc1" /note="myosin heavy chain homolog" /codon_start=1 /product="DOC1" /db_xref="PID:g1297319" /translation="MVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEE EQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNR SLRKAEEELQDIKEKISKGEYGNAGIMAEVEELIKMEEQCRDLNKRLERETLQSKDFK LEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKEL EAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQN KVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML KNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE DDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKR LQEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQGDHSVCKKKLNQQENRNRDLGR EIENLTKELERYRHFSKSLRPSLNGRRISDPQVFSKEVQTEAVDNEPPDYKSLIPLER AVINGQLYEESENQDEDPNDEGSVLSFKCSQSTPCPVNRKLWIPWMKSKEGHLQNGKM QTKPNANFVQPGDLVLSHTPGQPLHIKVTPDHVQNTATLEITSPTTESPHSYTSTAVI PNCGTPKQRITILQNASITPVKSKTSTEDLMNLEQGMSPITMATFARAQTPESCGSLT PERTMSLFRFWL" BASE COUNT 1147 a 619 c 629 g 630 t ORIGIN 1 gcacgagcag gcagttcaga ttaaagaagc taattgatca agaaatcaag tctcaggagg 61 agaaggagca agaaaaggag aaaagggtca ccaccctgaa agaggagctg accaagctga 121 agtcttttgc tttgatggtg gtggatgaac agcaaaggct gacggcacag ctcacccttc 181 aaagacagaa aatccaagag ctgaccacaa atgcaaagga aacacatacc aaactagccc 241 ttgctgaagc cagagttcag gaggaagagc agaaggcaac cagactagag aaggaactgc 301 aaacgcagac cacaaagttt caccaagacc aagacacaat tatggcgaag ctcaccaatg 361 aggacagtca aaatcgccag cttcaacaaa agctggcagc actcagccgg cagattgatg 421 agttagaaga gacaaacagg tctttacgaa aagcagaaga ggagctgcaa gatataaaag 481 aaaaaatcag taagggagaa tatggaaacg ctggtatcat ggctgaagtg gaagagctca 541 taaaaatgga ggagcagtgc agagatctca ataagaggct tgaaagggag acgttacaga 601 gtaaagactt taaactagag gttgaaaaac tcagtaaaag aattatggct ctggaaaagt 661 tagaagacgc tttcaacaaa agcaaacaag aatgctactc tctgaaatgc aatttagaaa 721 aagaaaggat gaccacaaag cagttgtctc aagaactgga gagtttaaaa gtaaggatca 781 aagagctaga agccattgaa agtcggctag aaaagacaga attcactcta aaagaggatt 841 taactaaact gaaaacatta actgtgatgt ttgtagatga acggaaaaca atgagtgaaa 901 aattaaagaa aactgaagat aaattacaag ctgcttcttc tcagcttcaa gtggagcaaa 961 ataaagtaac aacagttact gagaagttaa ttgaggaaac taaaagggcg ctcaagtcca 1021 aaaccgatgt agaagaaaag atgtacagcg taaccaagga gagagatgat ttaaaaaaca 1081 aattgaaagc ggaagaagag aaaggaaatg atctcctgtc aagagttaat atgttgaaaa 1141 ataggcttca atcattggaa gcaattgaga aagatttcct aaaaaacaaa ttaaatcaag 1201 actctgggaa atccacaaca gcattacacc aagaaaacaa taagattaag gagctctctc 1261 aagaagtgga aagactgaaa ctgaagctaa aggacatgaa agccattgag gatgacctca 1321 tgaaaacaga agatgaatat gagactctag aacgaaggta tgctaatgaa cgagacaaag 1381 ctcaattttt atctaaagag ctagaacatg ttaaaatgga acttgctaag tacaagttag 1441 cagaaaagac agagaccagc catgaacaat ggcttttcaa aaggcttcaa gaagaagaag 1501 ctaagtcagg gcacctctca agagaagtgg atgcattaaa agagaaaatt catgaataca 1561 tggcaactga agacctaata tgtcacctcc agggagatca ctcagtctgc aaaaaaaaac 1621 taaatcaaca agaaaacagg aacagagatt taggaagaga gattgaaaac ctcactaagg 1681 agttagagag gtaccggcat ttcagtaaga gcctcaggcc tagtctcaat ggaagaagaa 1741 tttccgatcc tcaagtattt tctaaagaag ttcagacaga agcagtagac aatgaaccac 1801 ctgattacaa gagcctcatt cctctggaac gtgcagtcat caatggtcag ttatatgagg 1861 agagtgagaa tcaagacgag gaccctaatg atgagggatc tgtgctgtcc ttcaaatgca 1921 gccagtctac tccatgtcct gttaacagaa agctatggat tccctggatg aaatccaagg 1981 agggccatct tcagaatgga aaaatgcaaa ctaaacccaa tgccaacttt gtgcaacctg 2041 gagatctagt cctaagccac acacctgggc agccacttca tataaaggtt actccagacc 2101 atgtacaaaa cacagccact cttgaaatca caagtccaac cacagagagt cctcactctt 2161 acacgagtac tgcagtgata ccgaactgtg gcacgccaaa gcaaaggata accatcctcc 2221 aaaacgcctc cataacacca gtaaagtcca aaacctctac cgaagacctc atgaatttag 2281 aacaaggcat gtccccaatt accatggcaa cctttgccag agcacagacc ccagagtctt 2341 gtggttctct aactccagaa aggacaatgt ccctattcag gttttggctg tgactggttc 2401 agctagctct cctgagcagg gacgctcccc agaaccaaca gaaatcagtg ccaagcatgc 2461 gatattcaga gtctccccag accggcagtc atcatggcag tttcagcgtt caaacagcaa 2521 tagctcaagt gtgataacta ctgaggataa taaaatccac attcacttag gaagtcctta 2581 catgcaagct gtagccagcc cttcagcacc actgcaggat aaccgaactc aaggcttaat 2641 taacggggca ctaaacaaaa caaccaataa agtcaccagc agtattacta tcacaccaac 2701 agccacacct cttcctcgac aatcacaaat tacagtaagt aatatatata actgaccacg 2761 ctcaccctca tccagtccat actgatattt ttgcaaggaa ctcaatcctt ttttaatcat 2821 ccctccatat cccccaagac tgactgaact cgtactttgg gaaggtttgt gcatgaacta 2881 tacaagagta tctgaaacta actgttgcct gcatagtcat atcgagtgtg cacttactgt 2941 atatcttttc atttacatac ttgtatggaa aatatttagt ctgcacttgt ataaatacat 3001 ctttatgtat ttgaaaaaaa aaaaa // LOCUS HSU53446 3268 bp mRNA PRI 07-MAY-1996 DEFINITION Human mitogen-responsive phosphoprotein DOC-2 mRNA, complete cds. ACCESSION U53446 NID g1297329 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3268) AUTHORS Mok,S.C., Wong,K.K., Chan,R.K., Lau,C.C., Tsao,S.W., Knapp,R.C. and Berkowitz,R.S. TITLE Molecular cloning of differentially expressed genes in human epithelial ovarian cancer JOURNAL Gynecol Oncol 52 (2), 247-252 (1994) MEDLINE 94148289 REFERENCE 2 (bases 1 to 3268) AUTHORS Wong,K.K. and Mok,S.C. TITLE Cloning and sequencing of full length DOC-1 and DOC-2 mRNAs JOURNAL Unpublished REFERENCE 3 (bases 1 to 3268) AUTHORS Wong,K.K. and Mok,S.C. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) Kwong-Kwok Wong, Molecular Biosciences Department, P7-56, Pacific Northwest National Laboratory, Battlelle Blvd., Richland, WA 99352, USA FEATURES Location/Qualifiers source 1..3268 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary normal ovarian epithelial" /chromosome="5" CDS 145..2457 /note="mitogen-responsive phosphoprotein" /codon_start=1 /product="DOC-2" /db_xref="PID:g1297330" /translation="MSNEVETSATNGQPDQQAAPKAPSKKEKKKGPEKTDEYLLARFK GDGVKYKAKLIGIDDVPDARGDKMSQDSMMKLKGMAARGRSQGQHKQRIWVNISLSGI KIIDEKTGVIEHEHPVNKISFIARDVTDNRAFGYVCGGEGQHQFFTIKTGQQAEPLVV DLKDLFQVIYNVKKKEEEKKKIEEASKAVENGSEALRILDDQTNKLKSGVDQMDLFGD MSTPPDLNSPTESKDILLVDLNSEIDTNQNSLRENPFLTNGITSCSLPRPTPQASFLP ENAFSANLNFFPTPNPDPFRDDPFTQPDQSTPSSFDSLKSPDQKKENSSSSSTPLSNG PLNGDVDYFGQQFDQISNRTGKQEAQAGPWPFSSSQTQPAVRTQNGVSEREQNGFSVK SSPNPFVGSPPKGLSIQNGVKQDLESSVQSSPHDSIAIIPPPQSTKPGRGRRTAKSSA NDLLASDIFAPPVSEPSGQASPTGQPTALQPNPLDLFKTSAPAPVGPLVGLGGVTVTL PQAGPWNTASLVFNQSPSMAPGAMMGGQPSGFSQPVIFGTSPAVSGWNQPSPFAASTP PPVPVVWGPSASVAPNAWSTTSPLGNPFQSNIFPAPAVSTQPPSMHSSLLVTPPQPPP RAGPPKDISSDAFTALDPLGDKEIKDVKEMFKDFQLRQPPAVPARKGEQTSSGTLSAF ASYFNSKVGIPQENADHDDFDANQLLNKINEPPKPAPRQVSLPVTKSTDNAFENPFFK DSFGSSQASVASSQPVSSEMYRDPFGNPFA" BASE COUNT 914 a 826 c 664 g 864 t ORIGIN 1 gccggggaag tcatgctcgc ttcacggagg caatagctag ccggtgtctg tgggaggtta 61 tgtttatttg agacttctcc atcgggatcg cctggtgtca ccaagtgtcc actggtactg 121 aggtttgctg cctgccttct tgccatgtct aacgaagtag aaacaagtgc aaccaatggt 181 cagcccgacc aacaggccgc accaaaagca ccctcaaaga aggaaaaaaa gaaaggccct 241 gaaaagacag atgaatatct cctagcaagg ttcaaaggcg atggcgtaaa atataaggcc 301 aagctgattg gcattgatga tgtgccagat gcaagagggg ataaaatgag ccaagactct 361 atgatgaaac taaagggaat ggcggcacgt ggtcggtctc agggacaaca caaacaaagg 421 atctgggtca acatttccct ttctgggata aaaataattg atgagaaaac tggggtaata 481 gagcatgaac atccagtaaa taagatttct ttcattgccc gtgatgtgac agacaaccgg 541 gcatttggtt acgtgtgtgg aggagaaggc cagcatcagt tttttaccat aaaaaccggg 601 caacaggctg aaccattagt tgttgatctt aaagaccttt ttcaagttat ctataatgta 661 aagaaaaagg aagaagaaaa gaaaaagata gaggaagcca gcaaagcagt tgagaatggg 721 agtgaggccc taaggattct agatgaccaa actaacaaac tgaaatcggg tgttgaccag 781 atggatttgt ttggggacat gtctacacct cctgacctaa atagtccaac agaaagcaaa 841 gatatcctgt tagtggatct aaactctgaa atcgacacca atcagaattc tttaagagaa 901 aatccattct taacaaacgg catcacctcc tgttctcttc ctcgaccaac gcctcaggca 961 tccttcttgc ctgaaaatgc cttttctgcc aatctcaact tctttcccac ccctaatcct 1021 gatcctttcc gtgacgatcc tttcacacag ccagaccaat cgacaccttc ttcgtttgat 1081 tctctcaaat ctccagatca gaagaaagag aattcgagta gctcgtctac tccgctgagt 1141 aatgggcccc tgaatggtga tgttgactac tttggtcagc aatttgacca gatctctaac 1201 cggactggca aacaggaagc tcaggcaggc ccatggccct tttcaagttc gcaaacccag 1261 ccagcagtga gaactcaaaa tggggtatct gaaagagaac agaacggctt ctctgtcaaa 1321 tcctccccga acccttttgt gggaagccct cccaaaggac tgtccataca gaatggcgta 1381 aagcaggact tggaaagctc tgtccagtcc tcaccacatg actccatagc cattatccca 1441 cctccacaaa gtaccaaacc aggaagaggc agaaggactg ctaagtcttc agccaatgac 1501 ttgcttgcat cagacatctt tgctcctccc gtctcagaac cttcaggcca ggcgtcaccc 1561 acaggacaac ctacagccct gcagcccaac cctctggatc tcttcaaaac aagtgctcct 1621 gccccagtgg ggcccctggt gggtctaggt ggtgtaactg tcacactccc tcaggcagga 1681 ccatggaaca cagcatcttt ggtcttcaat cagtcccctt caatggctcc gggagccatg 1741 atgggtggtc aaccttcagg ttttagtcag cccgtcattt ttggtacaag tccagctgtt 1801 tcaggttgga accagccttc accctttgca gcctcaactc cccctccagt gcctgttgtc 1861 tggggccctt ctgcatctgt ggcacccaat gcttggtcaa caacaagccc tttggggaat 1921 ccttttcaga gcaatatttt tccagctcct gctgtgtcca ctcagccccc atccatgcac 1981 tcctctctcc tggtcactcc tcctcagcca cctcccagag ctggccctcc caaggacatc 2041 tccagtgatg ccttcactgc cttagaccca cttggggata aagagatcaa ggatgtgaaa 2101 gaaatgttta aggatttcca actgcggcag ccacctgctg tgcccgcgcg gaagggagag 2161 cagacttctt ctgggacttt gagtgccttt gccagttatt tcaacagcaa ggttggcatt 2221 cctcaggaga atgcagacca tgatgacttt gatgctaatc aactattgaa caagatcaat 2281 gaaccaccaa agccagctcc cagacaagtt tccctgccag ttaccaaatc tactgacaat 2341 gcatttgaga accctttctt taaagattct tttggttcat cacaagcctc tgtggcttct 2401 tctcaacctg tatcttctga gatgtatagg gatccatttg gaaatccttt tgcctaaatt 2461 ctgaacttgg tctgcagacc atccagagga ataaaaaggt tggccttagt agtcaaaaac 2521 aaagctgata gccagacacg ttctgatttc tgcccttgtt ccagctttga cgtattatct 2581 gttgccttat ttctcattgc ctcttctact tgtaaaatgc ttttcacttt ctgtctaggt 2641 taaagctaaa ctgaatctat ggctttaaat aaattaagat cctaaactct ctagcttaag 2701 tgtaaatgaa gtacagtagt ttccctactg aaccctacct cttgtgtccc tggaaccttc 2761 tagaacacct gccttctacc ctctggttgg gagatgcagc caccacatcc cttcatatca 2821 tactgttttg aataaatttt caaatcctta ttgttcagag ttgtttgggg gttctgtttc 2881 agagcataaa acctaaaggt tatagtagaa caaggcacct tcttaaaaga aatcttgctt 2941 cagaccatca gttacagaga atttcctaaa gtaaaattga agcaactaca acttctcctt 3001 agacactttg gaatctaacc acttaaggac ctttttaaag agatagcttc tcttctttct 3061 gaagatcaat ttctcccaag gccaagattg tccttttctc ccatttcttg ctagctattg 3121 caaatgaggg aagaacatta ttcatctctc ctcccctttt ttttctgatt cttttttcag 3181 tcagttttgc tcctgggttc aagtagtatt accacccttt cacaagcaac agactctcac 3241 agggcaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSU53468 1475 bp mRNA PRI 12-JUN-1996 DEFINITION Human NADH:ubiquinone oxidoreductase subunit B13 (B13) mRNA, complete cds. ACCESSION U53468 NID g1373172 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1475) AUTHORS Pata,I., Tensing,K. and Metspalu,A. TITLE A cDNA sequence of the human homologue to bovine NADH:ubiquinone oxidoreductase subunit B13 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1475) AUTHORS Pata,I. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) Illar Pata, Tartu University, Institute of Molecular and Cell Biology, 23 Riia St, Tartu, Estonia, Ee2400 FEATURES Location/Qualifiers source 1..1475 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="b13.6.2" /sex="male" /cell_line="U937" 5'UTR 1..23 gene 24..374 /gene="B13" CDS 24..374 /gene="B13" /EC_number="1.6.5.3" /function="mitochondrial respiratory chain complex I subunit" /note="similar to bovine NADH:ubiquinone oxidoreductase subunit B13" /codon_start=1 /evidence=experimental /product="NADH:ubiquinone oxidoreductase subunit B13" /db_xref="PID:g1373173" /translation="MAGVLKKTTGLVGLAVCNTPHERLRILYTKILDVLEEIPKNAAY RKYTEQITNEKLAMVKAEPDVKKLEDQLQGGQLEEVILQAEHELNLARKMREWKLWEP LVEEPPADQWKWPI" 3'UTR 375..1463 polyA_site 1464 BASE COUNT 519 a 189 c 288 g 479 t ORIGIN 1 gtcaccgagt cgttggcgct gtcatggcgg gtgtgctgaa gaagaccact ggccttgtgg 61 gattggctgt gtgcaatact cctcacgaga ggctaagaat attgtacaca aagattcttg 121 atgttcttga ggaaatccct aaaaatgcag catatagaaa gtatacagaa cagattacaa 181 atgagaagct ggctatggtt aaagcggaac cagatgttaa aaaattagaa gaccaacttc 241 aaggcggtca attagaagag gtgattcttc aggctgaaca tgaactaaat ctggcaagaa 301 aaatgaggga atggaaacta tgggagccat tagtggaaga gcctcctgcc gatcagtgga 361 aatggccaat ataattatta agtgactttg gtgtgttcat gggaaactga tgtaattaaa 421 tattctgtta tattaagagc gtgttcttat tactgacatt ttgtaatcaa gaaaagtgat 481 atagaaaata tgtaggagac tgttaaaatt ggtgattatg gtaatatggt catgtgaatc 541 aatttttgat ttataaagta ctcacacaag ttgtttcaaa gatgatattt ctgtgaacag 601 agaggccatg ggaagatttg aaaattatta aagaaaaatt cctacagatt ttcaatgcag 661 agaccataat caaaaagtaa actttcttta gtagtatgtt caatacatca tttaattttt 721 taagttatcc tgaagaagga aaggtcctta attattatag tctaaacaaa tttatagatt 781 actgtttgaa gtaaataata cgagtgaata ttttcaaatg tgataaaata gcacaagtgg 841 ctggtgataa aatttgaaat tatggttaac ctcagctgtg atcttatgta tgtaaagtga 901 aatttaaata gataattata ggttgattac aaaatccata gtgtcatttt attttagtca 961 ttattgaatt ataccattta ctctgttttc ttatagtctt aattttatta tattttgttg 1021 ttactgtatt atatttgaaa accttcaaat tagaatacat tgtacagtta aagaaattga 1081 cttggtactt aaaagaaaga tttcccattg catacaggtt attggagaaa ttttcctttt 1141 gttgcatttg tggaagttag ttttctggcc cgtggccttt aattttctta atcaacctaa 1201 ttacatcagg atagaggtag agtttctgta aaagaagaga cattaagagt tcctgaaatt 1261 tatatctggc ataccgatag gcttatattc aaaacatctt agtcatacga ccataaatta 1321 aaagtggagt cactaaatag tttgcagtac gtttctaata taagtgtagg tgggtatcaa 1381 aacaagacaa atgctgttca gggaaagaag ttggcaagct taaggttaaa caaaaataaa 1441 attacatgtg ttttcgcctt cctaaaaaaa aaaaa // LOCUS HSU53476 1469 bp mRNA PRI 20-MAY-1997 DEFINITION Human proto-oncogene Wnt7a mRNA, complete cds. ACCESSION U53476 NID g2105099 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1469) AUTHORS Bui,T.D., Lako,M., Lejeune,S., Curtis,A.R., Strachan,T., Lindsay,S. and Harris,A.L. TITLE Isolation of a full-length human WNT7A gene implicated in limb development and cell transformation, and mapping to chromosome 3p25 JOURNAL Gene 189 (1), 25-29 (1997) MEDLINE 97305141 REFERENCE 2 (bases 1 to 1469) AUTHORS Bui,T.D., Lejeune,S. and Harris,A.L. TITLE Direct Submission JOURNAL Submitted (03-APR-1996) T.D. Bui, Imperial Cancer Research Fund, Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford OX3 9DU, UK FEATURES Location/Qualifiers source 1..1469 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="17-week embryo" /clone="ICRFp507A0938" /clone_lib="human fetal brain cDNA library in pSPORT 1 vector, generated by oligo dT priming (from ICRF library number 507), cloning sites: NotI and SalI" gene 25..1074 /gene="Wnt7a" CDS 25..1074 /gene="Wnt7a" /function="signal patterning" /note="proto-oncogene; signal transducer" /codon_start=1 /product="Wnt7a protein" /db_xref="PID:g2105100" /translation="MNRKALRCLGHLFLSLGMVCLRIGGFSSVVALGATIICNKIPGL APRQRAICQSRPDAIIVIGEGSQMGLDECQFQFRNGRWNCSALGERTVFGKELKVGSR DGAFTYAIIAAGVAHAITAACTHGNLSDCGCDKEKQGQYHRDEGWKWGGCSADIRYGI GFAKVFVDAREIKQNARTLMNLHNNEAGRKILEENMKLECKCHGVSGSCTTKTCWTTL PQFRELGYVLKDKYNEAVHVEPVRASRNKRPTFLKIKKPLSYRKPMDTDLVYIEKSPN YCEEDPVTGSVGTQGRACNKTAPQASGCDLMCCGRGYNTHQYARVWQCNCKFHWCCYV KCNTCSERTEMYTCK" BASE COUNT 347 a 435 c 416 g 271 t ORIGIN 1 cacgcgtccg ggccaatcgg gactatgaac cggaaagcgc tgcgctgcct gggccacctc 61 tttctcagcc tgggcatggt ctgcctccgg atcggtggct tctcctcagt ggtagctctg 121 ggcgcaacga tcatctgtaa caagatccca ggcctggctc ccagacagcg ggcgatctgc 181 cagagccggc ccgacgccat catcgtcata ggagaaggct cacaaatggg cctggacgag 241 tgtcagtttc agttccgcaa tggccgctgg aactgctctg cactgggaga gcgcaccgtc 301 ttcgggaagg agctcaaagt ggggagccgg gacggtgcgt tcacctacgc catcattgcc 361 gccggcgtgg cccacgccat cacagctgcc tgtacccatg gcaacctgag cgactgtggc 421 tgcgacaaag agaagcaagg ccagtaccac cgggacgagg gctggaagtg gggtggctgc 481 tctgccgaca tccgctacgg catcggcttc gccaaggtct ttgtggatgc ccgggagatc 541 aagcagaatg cccggactct catgaacttg cacaacaacg aggcaggccg aaagatcctg 601 gaggagaaca tgaagctgga atgtaagtgc cacggcgtgt caggctcgtg caccaccaag 661 acgtgctgga ccacactgcc acagtttcgg gagctgggct acgtgctcaa ggacaagtac 721 aacgaggccg ttcacgtgga gcctgtgcgt gccagccgca acaagcggcc caccttcctg 781 aagatcaaga agccactgtc gtaccgcaag cccatggaca cggacctggt gtacatcgag 841 aagtcgccca actactgcga ggaggacccg gtgaccggca gtgtgggcac ccagggccgc 901 gcctgcaaca agacggctcc ccaggccagc ggctgtgacc tcatgtgctg tgggcgtggc 961 tacaacaccc accagtacgc ccgcgtgtgg cagtgcaact gtaagttcca ctggtgctgc 1021 tatgtcaagt gcaacacgtg cagcgagcgc acggagatgt acacgtgcaa gtgagccccg 1081 tgtgcacacc accctcccgc tgcaagtcag attgctggga ggactggacc gtttccaagc 1141 tgcgggctcc ctggcaggat gctgagcttg tcttttctgc tgaggaaggt acttttcctg 1201 ggtttcctgc aggcatccgt gggggaaaaa aaatctctca gaaccctcaa ctattctgtt 1261 ccacacccaa tgctgctcca ccctccccca gacacagccc aagtccctcc gcggctggag 1321 cgaagccttc tgcagcagga actctggacc cctgggcctc atcacagcaa tatttaacaa 1381 tttattctga taaaaataat attaatttat ttaattaaaa agaattcttc cacctcaaaa 1441 aaaaaaaaaa aaaaaaaaaa aaagggggg // LOCUS HSU53786 6457 bp mRNA PRI 07-SEP-1996 DEFINITION Human envoplakin (EVPL) mRNA, complete cds. ACCESSION U53786 NID g1524400 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6457) AUTHORS Ruhrberg,C., Hajibagheri,M.A., Simon,M., Dooley,T.P. and Watt,F.M. TITLE Envoplakin, a novel precursor of the cornified envelope that has homology to desmoplakin JOURNAL J. Cell Biol. 134 (3), 715-729 (1996) MEDLINE 96326676 REFERENCE 2 (bases 1 to 6457) AUTHORS Ruhrberg,C. and Watt,F.M. TITLE Direct Submission JOURNAL Submitted (06-APR-1996) Christiana Ruhrberg, Keratinocyte Laboratory, ICRF, 44 Lincoln's Inn Fields, London WC2A 3PX, UK FEATURES Location/Qualifiers source 1..6457 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary keratinocyte" 5'UTR 1..98 gene 99..6200 /gene="EVPL" CDS 99..6200 /gene="EVPL" /note="cornified envelope precursor" /codon_start=1 /product="envoplakin" /db_xref="PID:g1524401" /translation="MFKGLSKGSQGKGSPKGSPAKGSPKGSPSRHSRAATQELALLIS RMQANADQVERDILETQKRLQQDRLNSEQSQALQHQQETGRSLKEAEVLLKDLFLDVD KARRLKHPQAEEIEKDIKQLHERVTQECAEYRALYEKMVLPPDVGPRVDWARVLEQKQ KQVCAGQYGPGMAELEQQIAEHNILQKEIDAYGQQLRSLVGPDAATIRSQYRDLLKAA SWRGQSLGSLYTHLQGCTRQLSALAEQQRRILQQDWSDLMADPAGVRREYEHFKQHEL LSQEQSVNQLEDDGERMVELRHPAVGPIQAHQEALKMEWQNFLNLCICQETQLQHVED YRRFQEEADSVSQTLAKLNSNLDAKYSPAPGGPPGAPTELLQQLEAEEKRLAVTERAT GDLQRRSRDVAPLPQRRNPPQQPLHVDSICDWDSGEVQLLQGERYKLVDNTEPHAWVV QGPGGETKRRPAACFCIPAPDPDAVARASRLASELQALKQKLATVQSRLKASAVESLR PSQQAPSGSDLANPQAQKLLTQMTRLDGDLGQIERQVLAWARAPLSRPTPLEDLEGRI HSHEGTAQRLQSLGTEKETAQKECEAFLSTRPVGPAALQLPVALNSVKNKFSDVQVLC SLYGEKAKAALDLERQIQDADRVIRGFEATLVQEAPIPAEPGALQERVSELQRQRREL LEQQTCVLRLHRALKASEHACAALQNNFQEFCQDLPRQQRQVRALTDRYHAVGDQLDL REKVVQDAALTYQQFKNCKDNLSSWLEHLPRSQVRPSDGPSQIAYKLQAQKRLTQEIQ SRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQ EKNLAKAYTEVAAAQQQLLQQLEFARKMLEKKELSEDIRRTHDAKQGSESPAQAGRES EALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSL SRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVERDPGLDS QAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEM VKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDP GLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIFQVDP ETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRHER SPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRDGAKVETKTVSKEVVRH EKDPVLEKEAEWLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVT QKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVE RELRQLTLRIQELEKRPPTVAGEDHHGGSGQAGEGPGPGEVHGSPAWDLDQEKTQVTE LNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQA REEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQT LQLQEESKLLSQKTESERQKAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSR EELSQETQTRETNLSTKISILEPETGKDMSPYEAYKRGIIDRGQYLQLQELECDWEEV TTSGPCGEESVLLDRKSGKQYSIEAALRCRRISKEEYHLYKDGHLPISEFALLVAGET KPSSSLSIGSIISKSPLASPAPQSTSFFSPSFSLGLGDDSFPIAGIYDTTTDNKCSIK TAVAKNMLDPITGQKLLEAQAATGGIVDLLSRERYSVHKAMERGLIENTSTQRLLNAQ KAFTGIEDPVTKKRLSVGEAVQKGWMPRESVLPHLQVQHLTGGLIDPKRTGRIPIQQA LLSGMISEELAQLLQDESSYEKDLTDPISKERLSYKEAMGRCRKDPLSGLLLLPAALE GYRCYRSASPTVPRSLR" 3'UTR 6201..6457 polyA_signal 6433..6438 /note="polyA signal" polyA_site 6457 BASE COUNT 1426 a 1956 c 2234 g 841 t ORIGIN 1 gctgaccagc cagtgaggac gcccgctgcc tcccacctgc cctcctgccg tctttcgcca 61 gccaagccca gcctgagcca gcacttgcct ttacgaccat gttcaagggg ctgagcaaag 121 gctcccaggg gaaggggtcc cccaagggct cccccgccaa ggggtccccc aaaggctccc 181 ccagcaggca cagccgggct gccacccagg agctggccct tctcatctcc cgcatgcaag 241 ccaacgccga ccaggtggag cgggacatcc tggagacgca gaagaggctg cagcaggacc 301 ggctgaacag tgagcagagc caggccctgc agcaccagca ggagacgggc cgcagcctga 361 aggaggctga ggtgctgctc aaggacctct tcctggacgt ggacaaggcc cggcggctca 421 agcacccgca ggctgaggag attgagaagg acatcaagca gctgcacgag cgggtgaccc 481 aggagtgtgc ggagtaccgt gccctgtacg agaagatggt gctgcccccc gacgtgggac 541 ccagggtcga ctgggcacgc gtgctggagc agaaacagaa gcaggtctgc gcaggccagt 601 acgggccggg catggcggag ctggagcaac agatcgccga gcacaacatc ctgcagaagg 661 agatcgacgc ctatgggcag cagctgcgga gcctcgtggg gccggatgca gccaccatcc 721 ggagccaata ccgagaccta ctgaaggcgg cgtcgtggcg cgggcagagc ctgggcagcc 781 tgtacacgca cctccagggc tgcacgcggc agctgagcgc cctggctgag cagcagcgcc 841 gcatcctgca gcaggactgg agcgacctca tggccgaccc tgcgggcgtg cggcgggagt 901 acgagcactt caagcagcac gagctgctga gccaggagca gagcgtgaac cagctggagg 961 acgacggcga gcgcatggtg gagctgcggc accccgcggt ggggcccatc caggcccacc 1021 aggaggccct gaagatggag tggcagaact tcctgaacct gtgtatctgc caggagaccc 1081 agctgcagca cgtggaggac taccgccggt tccaggaaga ggccgactca gtcagccaga 1141 ccctggcgaa gctcaactcc aacttggatg ccaagtacag ccctgcacct gggggccccc 1201 ctggcgcccc cacagagctg ctgcaacagc tggaggcaga ggaaaaacgg ctggccgtca 1261 ccgagagggc cactggggac ctgcagcggc gaagccggga tgtggcccct ctgccacagc 1321 gaagaaaccc ccctcagcag cccctgcacg tggacagcat ctgcgactgg gactcaggag 1381 aagtgcagct gctgcagggt gagcggtata agctggtaga taacactgaa ccgcacgcct 1441 gggtcgtgca gggccctggc ggggagacca agcgtcgtcc cgccgcctgc ttctgcatcc 1501 cagcaccaga ccctgatgct gtggccaggg cctcccggct ggcctcagag ctgcaggccc 1561 tgaagcagaa attggccaca gtccagagcc gcctgaaggc cagtgctgtg gagtctcttc 1621 ggcccagcca gcaggctcca tctggctcag acctggccaa cccacaggcc cagaagctcc 1681 tgacacagat gacccggctg gatggagacc tgggacagat agagaggcag gtgctggcct 1741 gggcgcgggc cccgctgagc cgccccacac ccttggagga cttggagggc cgcatccaca 1801 gccatgaggg cacagcccag cgcctgcaga gcctgggaac ggagaaggag acagcccaga 1861 aggagtgcga ggcgtttctg tccacgcggc ccgtgggccc cgctgccctg cagctgcccg 1921 tagccctcaa cagcgtgaag aacaagttca gtgacgtgca ggttctgtgc agcctctacg 1981 gggagaaagc caaggctgcc ctggatctgg agcggcagat ccaggatgcg gacagggtca 2041 tccgaggctt cgaggccacc ctggtgcagg aggcccccat ccctgctgaa ccgggggctc 2101 tgcaggagag ggtcagcgag ctgcagcgcc agcggaggga gctgctggaa cagcagacct 2161 gcgtgctgcg gctacaccgc gcgctgaagg cctcggagca cgcatgcgct gccctgcaga 2221 acaacttcca ggagttctgc caagacctgc ctcgccagca gcgccaggtg cgagccctca 2281 ccgaccgcta ccacgccgta ggggaccagc tggacctgcg ggagaaggtg gtgcaggatg 2341 ccgccctcac ctaccagcag ttcaagaact gcaaggataa cctgagctcc tggctggagc 2401 acctgccccg cagccaggtg cggcccagcg acggccccag ccagatcgcc tacaagctgc 2461 aggcgcagaa gaggctgacg caggagatcc agagccgaga gcgggacagg gccacagcat 2521 cccacctctc ccaggccctg caggcagcgc tccaggacta tgagctccag gcagacacct 2581 accgctgctc tttggagccc accctggcag tgtcagcccc caagagaccc cgagtggctc 2641 ccctgcaaga gagcatccaa gcccaggaga agaaccttgc aaaggcctat actgaggttg 2701 cagcagcaca gcagcagctg ctccagcagc tggagtttgc tagaaaaatg ctggagaaga 2761 aggagctcag tgaggacatc cgaaggaccc atgatgcaaa gcagggctcc gagagccctg 2821 cccaagcagg gagagagtca gaggccctga aggcccagct ggaagaggag aggaagcggg 2881 tggcccgggt gcagcatgag ctggaggcgc agaggagcca actgctgcag ctgaggaccc 2941 agcggccctt ggagaggctg gaggagaagg aagtggtaga gttctaccgg gacccccagc 3001 tggagggcag cctgtccagg gtgaaggccc aggtggagga ggagggcaag cggcgggctg 3061 gcctgcaggc agacctggaa gtggcagccc agaaggtcgt gcagctggaa agcaagagga 3121 agaccatgca gcctcatctg ctgaccaagg aggtcaccca ggtggagagg gaccccggcc 3181 tggacagcca ggcggcccag ctcaggatcc agatccagca gctccgcggg gaggatgccg 3241 tcatctcggc ccggctggaa gggctgaaga aggagctact ggcccttgag aagagggagg 3301 tggacgtgaa ggagaaggtc gtggtgaaag aggtagtcaa ggtggagaag aatctggaaa 3361 tggtcaaggc agcccaggct ctgaggctgc agatggagga ggatgctgcg cggaggaagc 3421 aggcggagga ggctgtggcc aagctacagg ctcgcatcga agacctggag cgggctatca 3481 gctcggtgga gcccaaggtc atcgtgaagg aggtgaagaa ggtggagcag gacccagggc 3541 tcctccagga gtcctccagg ctgaggagcc tcctcgagga ggagaggacc aagaacgcga 3601 cgctggccag ggagctgagc gacctgcaca gcaagtacag cgtggtggag aagcagaggc 3661 ccaaagtgca gctccaggag cgcgtccacg agatcttcca ggtggatccg gagacagagc 3721 aggagatcac tcggctcaag gccaagctgc aggagatggc gggcaagagg agcggtgtgg 3781 agaaggaggt ggagaagctg ctgcccgacc tggaggtcct gcgggcccag aagcccacgg 3841 tggagtacaa ggaggtgacc caggaggtgg tgaggcatga gaggagcccc gaggtgctgc 3901 gtgagattga ccgcctgaag gctcagctca acgagctcgt caacagccac gggcgctccc 3961 aggagcagct catccgcctg cagggtgagc gcgacgagtg gaggcgcgac ggggccaagg 4021 tggagaccaa gacggtgagc aaggaggtgg tgcgccacga gaaggacccg gtgctggaga 4081 aagaagcaga gtggctccgc caggaggtgc gggaggcggc ccagaagagg cgggccgcgg 4141 aggacgcggt gtacgagctg cagagcaagc gcctgctgct ggagaggagg aagcccgagg 4201 agaaggtggt ggtgcaggag gtggtggtca cccagaagga cccgaagctg cgcgaggagc 4261 acagccggct gagcgggagc ctggatgagg aggtgggccg gcggcgccag ctagagcttg 4321 aggtgcagca gctgcgggcc ggcgtggagg agcaggaggg cctgctcagc ttccaggagg 4381 accgcagcaa gaagctggcc gtggagaggg agctgcggca gctgaccttg aggatccagg 4441 agctcgagaa gcggcctccc acggttgcag gagaagatca tcatggagga agtggtcaag 4501 ctggagaagg acccggacct ggagaagtcc acggaagccc tgcgtgggac ctggaccagg 4561 agaagaccca ggtaaccgag ctgaatcggg agtgcaagaa cctgcaggtc cagattgacg 4621 tcctccagaa agccaaatcg caggagaaga ccatctacaa ggaagtgatc cgggtgcaga 4681 aggaccgcgt cctggaagat gagcgggccc gcgtgtggga gatgctcaac agggagcgca 4741 cggcccggca ggcccgggag gaggaggcac ggcgcctgcg ggagcgcatt gaccgggccg 4801 agacgctggg gagaacctgg tcccgggagg agtccgagct gcagagggcc cgggaccagg 4861 ccgaccagga gtgtgggcgg ctgcagcagg agctgcgggc tctggagagg cagaagcagc 4921 agcagacact gcagctgcag gaggagtcga agctgctcag ccagaagacg gagagcgagc 4981 gacagaaggc ggcccagcgg ggccaggagc tctcgcggct ggaggcggcc atcctccgcg 5041 agaaggacca gatctacgag aaggagcgga cgctccggga cctccacgcc aaggtgagcc 5101 gggaggagct cagccaggag acccagacgc gagagaccaa cctttccacc aagatctcca 5161 tcctggaacc cgagacgggg aaggacatgt ccccatacga ggcctacaag aggggcatca 5221 tcgacagggg ccagtacttg cagctgcagg agctcgagtg tgactgggag gaggtcacca 5281 cctcggggcc ctgtggggag gagtctgtgc tcctggaccg caagagcggg aagcagtact 5341 ccatcgaggc cgccctccgc tgccggcgca tctctaagga ggagtaccat ctgtacaagg 5401 acggccacct gcccatctcc gagtttgcgc tgcttgtagc tggggagacc aagccaagct 5461 cctcactctc catcggctct atcatctcca agtccccgct cgcctccccg gccccccaga 5521 gcaccagttt cttctctccc agcttctctc tcgggctcgg tgatgacagc ttccctatcg 5581 ccgggatcta tgacacaacc acagacaaca agtgcagcat caagacggcc gtggccaaga 5641 acatgctgga ccccatcact gggcagaagc tactggaggc ccaggcggcc acagggggca 5701 tcgtggacct gctcagccgt gagcgctact ctgtgcacaa ggcgatggag aggggcctga 5761 tcgagaacac ctccacacag aggctgctta acgcccagaa ggccttcacc ggcatcgagg 5821 accccgtcac caagaagagg ctctcggtgg gcgaggccgt ccagaagggc tggatgcccc 5881 gggagagcgt gctcccacac ctgcaggtgc agcacctgac cggggggctc atcgacccca 5941 agaggacagg ccgcatcccc atccagcagg ccctcctctc cgggatgatc agtgaagagc 6001 tggcccagct cctgcaggac gagtccagct acgagaagga tttgacagac cccatctcca 6061 aggaacggct gagctacaag gaggccatgg gccgctgccg caaagacccc ctgagcggcc 6121 tgctgctcct gccagcggca ctggaggggt accgctgcta ccgctccgcc tcccccaccg 6181 tcccgcgctc ccttcgctga cacgggccaa ggagccagtg gggaagtgcg tgtgttgggc 6241 caggtaggat acgtacacct cttgcctcag agcagcctca tcccaggcag tgggtcttcc 6301 ctctgtccaa ccactgtttt attattttac taacgaggtg atgggctccc tcccctaacc 6361 ttggagcctg atccatcccc agaccaggac agcagccact cagttcttcc tccacctcca 6421 cccagtgatc ccaataaacg aattctgtct ccccgtg // LOCUS HSU53830 1890 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens interferon regulatory factor 7A mRNA, complete cds. ACCESSION U53830 NID g2098580 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1890) AUTHORS Zhang,L. and Pagano,J.S. TITLE IRF-7, a new interferon regulatory factor associated with Epstein-Barr virus latency JOURNAL Mol. Cell. Biol. 17 (10), 5748-5757 (1997) MEDLINE 97459673 REFERENCE 2 (bases 1 to 1890) AUTHORS Zhang,L. and Pagano,J.S. TITLE Direct Submission JOURNAL Submitted (08-APR-1996) Luwen Zhang, Cancer Research, University of North Carolina, 110 Lineberger Cancer Center, Chapel Hill, NC 27599, USA FEATURES Location/Qualifiers source 1..1890 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" CDS 293..1804 /note="IRF-7A; negative regulator of interferon pathway; family of interferon regulatory factors" /codon_start=1 /product="interferon regulatory factor 7A" /db_xref="PID:g2098581" /translation="MALAPERAAPRVLFGEWLLGEISSGCYEGLQWLDEARTCFRVPW KHFARKDLSEADARIFKAWAVARGRWPPSSRGGGPPPEAETAERAGWKTNFRCALRST RRFVMLRDNSGDPADPHKVYALSRELCWREGPGTDQTEAEAPAAVPPPQGGPPGPFLA HTHAGLQAPGPLPAPAGDKGDLLLQAVQQSCLADHLLTASWGADPVPTKAPGEGQEGL PLTGACAGGPGLPAGELYGWAVETTPSPGPQPAALTTGEAAAPESPHQAEPYLSPSPS ACTAVQEPSPGALDVTIMYKGRTVLQKVVGHPSCTFLYGPPDPAVRATDPQQVAFPSP AELPDQKQLRYTEELLRHVAPGLHLELRGPQLWARRMGKCKVYWEVGGPPGSASPSTP ACLLPRNCDTPIFDFRVFFQELVEFRARQRRGSPRYTIYLGFGQDLSAGRPKEKSLVL VKLEPWLCRVHLEGTQREGVSSLDSSSLSLCLSSANSLYDDIECFLMELEQPA" CDS 892..1494 /note="ORF" /codon_start=1 /evidence=not_experimental /product="putative collagen homolog protein-a" /db_xref="PID:g2145010" /translation="MGGRSSPNQGSWRGTRRASPDWGLCWRPRAPCWGAVRVGSRDDP QPRAPARGTNDRRGRGPRVPAPGRAVPVTLPKRLHRGARAQPRGAGRDHHVQGPHGAA EGGGTPELHVPIRPPRPSCPGHRPPAGSIPQPCRAPGPEAAALHGGTAAARGPWVAPG ASGATAVGPAHGQVQGVLGGGRTPRLRQPLHPSLPAASEL" BASE COUNT 350 a 663 c 596 g 281 t ORIGIN 1 ggcacccagg gtccggcctg cgccttcccg ccaggcctgg acactggttc aacacctgtg 61 acttcatgtg tgcgcgccgg ccacacctgc agtcacacct gtagccccct ctgccaagag 121 atccataccg aggcagcgtc ggtggctaca agccctcagt ccacacctgt ggacacctgt 181 gacacctggc cacacgacct gtggccgcgg cctggcgtct gctgcgacag gagcccttac 241 ctcccctgtt ataacacctg accgccacct aactgcccct gcagaaggag caatggcctt 301 ggctcctgag agggcagccc cacgcgtgct gttcggagag tggctccttg gagagatcag 361 cagcggctgc tatgaggggc tgcagtggct ggacgaggcc cgcacctgtt tccgcgtgcc 421 ctggaagcac ttcgcgcgca aggacctgag cgaggccgac gcgcgcatct tcaaggcctg 481 ggctgtggcc cgcggcaggt ggccgcctag cagcagggga ggtggcccgc cccccgaggc 541 tgagactgcg gagcgcgccg gctggaaaac caacttccgc tgcgcactgc gcagcacgcg 601 tcgcttcgtg atgctgcggg ataactcggg ggacccggcc gacccgcaca aggtgtacgc 661 gctcagccgg gagctgtgct ggcgagaagg cccaggcacg gaccagactg aggcagaggc 721 ccccgcagct gtcccaccac cacagggtgg gcccccaggg ccattcttgg cacacacaca 781 tgctggactc caagccccag gccccctccc tgccccagct ggtgacaagg gggacctcct 841 gctccaggca gtgcaacaga gctgcctggc agaccatctg ctgacagcgt catggggggc 901 agatccagtc ccaaccaagg ctcctggaga gggacaagaa gggcttcccc tgactggggc 961 ctgtgctgga ggcccagggc tccctgctgg ggagctgtac gggtgggcag tagagacgac 1021 ccccagcccc gggccccagc ccgcggcact aacgacaggc gaggccgcgg ccccagagtc 1081 cccgcaccag gcagagccgt acctgtcacc ctccccaagc gcctgcaccg cggtgcaaga 1141 gcccagccca ggggcgctgg acgtgaccat catgtacaag ggccgcacgg tgctgcagaa 1201 ggtggtggga cacccgagct gcacgttcct atacggcccc ccagacccag ctgtccgggc 1261 cacagacccc cagcaggtag cattccccag ccctgccgag ctcccggacc agaagcagct 1321 gcgctacacg gaggaactgc tgcggcacgt ggcccctggg ttgcacctgg agcttcgggg 1381 gccacagctg tgggcccggc gcatgggcaa gtgcaaggtg tactgggagg tgggcggacc 1441 cccaggctcc gccagcccct ccaccccagc ctgcctgctg cctcggaact gtgacacccc 1501 catcttcgac ttcagagtct tcttccaaga gctggtggaa ttccgggcac ggcagcgccg 1561 tggctcccca cgctatacca tctacctggg cttcgggcag gacctgtcag ctgggaggcc 1621 caaggagaag agcctggtcc tggtgaagct ggaaccctgg ctgtgccgag tgcacctaga 1681 gggcacgcag cgtgagggtg tgtcttccct ggatagcagc agcctcagcc tctgcctgtc 1741 cagcgccaac agcctctatg acgacatcga gtgcttcctt atggagctgg agcagcccgc 1801 ctagaaccca gtctaatgag aactccagaa agctggagca gcccacctag agctggccgc 1861 ggccgcccag tctaataaaa agaactccag // LOCUS HSU54558 1881 bp mRNA PRI 03-SEP-1997 DEFINITION Human translation initiation factor eIF3 p66 subunit mRNA, complete cds. ACCESSION U54558 NID g2351377 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1881) AUTHORS Asano,K., Vornlocher,H.-P., Richter-Cook,N.J., Merrick,W.C., Hinnebusch,A.G. and Hershey,J.W.B. TITLE Structure of cDNAs encoding human eIF3 subunits: Possible roles in RNA binding and macromolecular assembly JOURNAL Journal of Biological Chemistry (1997) In press REFERENCE 2 (bases 1 to 1881) AUTHORS Asano,K. and Hershey,J.W.B. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) K. Asano, Biological Chemistry, University of California, Davis, School of Medicine, Building MS1A, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..1881 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterus" CDS 85..1731 /note="translation initiation factor eIF3 p66 subunit" /codon_start=1 /product="eIF3-p66" /db_xref="PID:g2351378" /translation="MAKFMTPVIQDNPSGWGPCAVPEQFRDMPYQPFSKGDRLGKVAD WTGATYQDKRYTNKYSSQFGGGSQYAYFHEEDESSFQLVDTARTQKTAYQRNRMRFAQ RNLRRDKDRRNMLQFNLQILPKSAKQKERERIRLQKKFQKQFGVRQKWDQKSQKPRDS SVEVRSDWEVKEEMDFPQLMKMRYLEVSEPQDIECCGALEYYDKAFDRITTRSEKPLR SIKRIFHTVTTTDDPVIRKLAKTQGNVFATDAILATLMSCTRSVYSWDIVVQRVGSKL FFDKRDNSDFDLLTVSETANEPPQDEGNSFNSPRNLAMEATYINHNFSQQCLRMGKER YNFPNPNPFVEDDMDKNEIASVAYRYRRWKLGDDIDLIVRCEHDGVMTGANGEVSFIN IKTLNEWDSRHCNGVDWRQKLDSQRGAVIATELKNNSYKLARWTCCALLAGSEYLKLG YVSRYHVKDSSRHVILGTQQFKPNEFASQINLSVENAWGILRCVIDICMKLEEGKYLI LKDPNKQVIRVYSLPDGTFSSDEDEEEEEEEEEEEEEEET" BASE COUNT 528 a 452 c 497 g 404 t ORIGIN 1 gaattcggca cgagctaacg cggtccccgg cacgcaccat ctgttgccat cccggccggc 61 cgaggccatt gcagattttg gaagatggca aagttcatga cacccgtgat ccaggacaac 121 ccctcaggct ggggtccctg tgcggttccc gagcagtttc gggatatgcc ctaccagccg 181 ttcagcaaag gagatcggct aggaaaggtt gcagactgga caggagccac ataccaagat 241 aagaggtaca caaataagta ctcctctcag tttggtggtg gaagtcaata tgcttatttc 301 catgaggagg atgaaagtag cttccagctg gtggatacag cgcgcacaca gaagacggcc 361 taccagcgga atcgaatgag atttgcccag aggaacctcc gcagagacaa agatcgtcgg 421 aacatgttgc agttcaacct gcagatcctg cctaagagtg ccaaacagaa agagagagaa 481 cgcattcgac tgcagaaaaa gttccagaaa caatttgggg ttaggcagaa atgggatcag 541 aaatcacaga aaccccgaga ctcttcagtt gaagttcgta gtgattggga agtgaaagag 601 gaaatggatt ttcctcagtt gatgaagatg cgctacttgg aagtatcaga gccacaggac 661 attgagtgtt gtggggccct agaatactac gacaaagcct ttgaccgcat caccacgagg 721 agtgagaagc cactgcggag catcaagcgc atcttccaca ctgtcaccac cacagacgac 781 cctgtcatcc gcaagctggc aaaaactcag gggaatgtgt ttgccactga tgccatcctg 841 gccacgctga tgagctgtac ccgctcagtg tattcctggg atattgtcgt ccagagagtt 901 gggtccaaac tcttctttga caagagagac aactctgact ttgacctcct gacagtgagt 961 gagactgcca atgagccccc tcaagatgaa ggtaattcct tcaattcacc ccgcaacctg 1021 gccatggagg caacctacat caaccacaat ttctcccagc agtgcttgag aatggggaag 1081 gaaagataca acttccccaa cccaaacccg tttgtggagg acgacatgga taagaatgaa 1141 atcgcctctg ttgcgtaccg ttaccgcagg tggaagcttg gagatgatat tgaccttatt 1201 gtccgttgtg agcacgatgg cgtcatgact ggagccaacg gggaagtgtc cttcatcaac 1261 atcaagacac tcaatgagtg ggattccagg cactgtaatg gcgttgactg gcgtcagaag 1321 ctggactctc agcgaggggc tgtcattgcc acggagctga agaacaacag ctacaagttg 1381 gcccggtgga cctgctgtgc tttgctggct ggatctgagt acctcaagct tggttatgtg 1441 tctcggtacc acgtgaaaga ctcctcacgc cacgtcatcc taggcaccca gcagttcaag 1501 cctaatgagt ttgccagcca gatcaacctg agcgtggaga atgcctgggg cattttacgc 1561 tgcgtcattg acatctgcat gaagctggag gagggcaaat acctcatcct caaggacccc 1621 aacaagcagg tcatccgtgt ctacagcctc cctgatggca ccttcagctc tgatgaagat 1681 gaggaggaag aggaggagga agaagaggaa gaagaagagg aagaaactta aaccagtgat 1741 gtggagctgg agtttgtcct tccaccgaga ctacgagggc ctttgatgct tagtggaatg 1801 tgtgtctaac ttgctctctg acatttagca gatgaaataa aatatatatc tgtttagtct 1861 tttaaaaaaa aaaaaaaaaa a // LOCUS HSU54559 1280 bp mRNA PRI 03-SEP-1997 DEFINITION Human translation initiation factor eIF3 p40 subunit mRNA, complete cds. ACCESSION U54559 NID g2351379 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1280) AUTHORS Asano,K., Vornlocher,H.-P., Richter-Cook,N.J., Merrick,W.C., Hinnebusch,A.G. and Hershey,J.W.B. TITLE Structure of cDNAs encoding human eIF3 subunits: Possible roles in RNA binding and macromolecular assembly JOURNAL Journal of Biological Chemistry (1997) In press REFERENCE 2 (bases 1 to 1280) AUTHORS Asano,K. and Hershey,J.W.B. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) K. Asano, Biological Chemistry, University of California, Davis, School of Medicine, Building MS1A, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..1280 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 6..1064 /note="translation initiation factor eIF3 p40 subunit" /codon_start=1 /product="eIF3-p40" /db_xref="PID:g2351380" /translation="MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDSAVKQVQIDGL VVLKIIKHYQEEGQGTEVVQGVLLGLVVEDRLEITNCFPFPQHTEDDADFDEVQYQME MMRSLRHVNIDHLHVGWYQSTYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTAQG SLSLKAYRLTPKLMEVCKEKDFSPEALKKANITFEYMFEEVPIVIKNSHLINVLMWEL EKKSAVADKHELLSLASSNHLGKNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQKHQ YQQRRQQENMQRQSRGEPPLPEEDLSKLFKPPQPPARMDSLLIAGQINTYCQNIKEFT AQNLGKLFMAQALQEYNN" BASE COUNT 446 a 273 c 274 g 287 t ORIGIN 1 gaaagatggc gtcccgcaag gaaggtaccg gctctactgc cacctcttcc agctccaccg 61 ccggcgcagc agggaaaggc aaaggcaaag gcggctcggg agattcagcc gtgaagcaag 121 tgcagataga tggccttgtg gtattaaaga taatcaaaca ttatcaagaa gaaggacaag 181 gaactgaagt tgttcaagga gtgcttttgg gtctggttgt agaagatcgg cttgaaatta 241 ccaactgctt tcctttccct cagcacacag aggatgatgc tgactttgat gaagtccaat 301 atcagatgga aatgatgcgg agccttcgcc atgtaaacat tgatcatctt cacgtgggct 361 ggtatcagtc cacatactat ggctcattcg ttacccgggc actcctggac tctcagttta 421 gttaccagca tgccattgaa gaatctgtcg ttctcattta tgatcccata aaaactgccc 481 aaggatctct ctcactaaag gcatacagac tgactcctaa actgatggaa gtttgtaaag 541 aaaaggattt ttcccctgaa gcattgaaaa aagcaaatat cacctttgag tacatgtttg 601 aagaagtgcc gattgtaatt aaaaattcac atctgatcaa tgtcctaatg tgggaacttg 661 aaaagaagtc agctgttgca gataaacatg aattgctcag ccttgccagc agcaatcatt 721 tggggaagaa tctacagttg ctgatggaca gagtggatga aatgagccaa gatatagtta 781 aatacaacac atacatgagg aatactagta aacaacagca gcagaaacat cagtatcagc 841 agcgtcgcca gcaggagaat atgcagcgcc agagccgagg agaacccccg ctccctgagg 901 aggacctgtc caaactcttc aaaccaccac agccgcctgc caggatggac tcgctgctca 961 ttgcaggcca gataaacact tactgccaga acatcaagga gttcactgcc caaaacttag 1021 gcaagctctt catggcccag gctcttcaag aatacaacaa ctaagaaaag gaagtttcca 1081 gaaaagaagt taacatgaac tcttgaagtc acaccagggc aactcttgga agaaatatat 1141 ttgcatattg aaaagcacag aggatttctt tagtgtcatt gccgattttg gctataacag 1201 tgtctttcta gccataataa aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1261 aaaaaaaaaa aaaaaaaaaa // LOCUS HSU54562 1485 bp mRNA PRI 19-SEP-1997 DEFINITION Human translation initiation factor eIF3 p48 subunit (Int-6) mRNA, complete cds. ACCESSION U54562 NID g2351381 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1485) AUTHORS Asano,K., Merrick,W.C. and Hershey,J.W. TITLE The translation initiation factor eIF3-p48 subunit is encoded by int-6, a site of frequent integration by the mouse mammary tumor virus genome JOURNAL J. Biol. Chem. 272 (38), 23477-23480 (1997) MEDLINE 97442403 REFERENCE 2 (bases 1 to 1485) AUTHORS Asano,K. and Hershey,J.W.B. TITLE Direct Submission JOURNAL Submitted (09-APR-1996) K. Asano, Biological Chemistry, University of California, School of Medicine, Building MS1A, Davis, CA 95616, USA FEATURES Location/Qualifiers source 1..1485 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 1..1485 /gene="Int-6" CDS 7..1344 /gene="Int-6" /note="translation initiation factor eIF3 p48 subunit, Int-6 oncoprotein" /codon_start=1 /product="eIF3-p48" /db_xref="PID:g2351382" /translation="MAEYDLTTRIAHFLDRHLVFPLLEFLSVKEIYNEKELLQGKLDL LSDTNMVDFAMDVYKNLYSDDIPHALREKRTTVVAQLKQLQAETEPIVKMFEDPETTR QMQSTRDGRMLFDYLADKHGFRQEYLDTLYRYAKFQYECGNYSGAAEYLYFFRVLVPA TDRNALSSLWGKLASEILMQNWDAAMEDLTRLKETIDNNSVSSPLQSLQQRTWLIHWS LFVFFNHPKGRDNIIDLFLYQPQYLNAIQTMCPHILRYLTTAVITNKDVRKRRQVLKD LVKVIQQESYTYKDPITEFVECLYVNFDFDGAQKKLRECESVLVNDFFLVACLEDFIE NARLFIFETFCRIHQCISINMLADKLNMTPEEAERWIVNLIRNARLDAKIDSKLGHVV MGNNAVSPYQQVIEKTKSLSFRSQMLAMNIEKKLNQNSRSEAPNWATQDSGFY" BASE COUNT 490 a 262 c 304 g 429 t ORIGIN 1 ggcaagatgg cggagtacga cttgactact cgcatcgcgc actttttgga tcggcatcta 61 gtctttccgc ttcttgaatt tctctctgta aaggagatat ataatgaaaa ggaattatta 121 caaggtaaat tggaccttct tagtgatacc aacatggtag actttgctat ggatgtatac 181 aaaaaccttt attctgatga tattcctcat gctttgagag agaaaagaac cacagtggtt 241 gcacaactga aacagcttca ggcagaaaca gaaccaattg tgaagatgtt tgaagatcca 301 gaaactacaa ggcaaatgca gtcaaccagg gatggtagga tgctctttga ctacctggcg 361 gacaagcatg gttttaggca ggaatattta gatacactct acagatatgc aaaattccag 421 tacgaatgtg ggaattactc aggagcagca gaatatcttt atttttttag agtgctggtt 481 ccagcaacag atagaaatgc tttaagttca ctctggggaa agctggcctc tgaaatctta 541 atgcagaatt gggatgcagc catggaagac cttacacggt taaaagagac catagataat 601 aattctgtga gttctccact tcagtctctt cagcagagaa catggctcat tcactggtct 661 ctgtttgttt tcttcaatca ccccaaaggt cgcgataata ttattgacct cttcctttat 721 cagccacaat atcttaatgc aattcagaca atgtgtccac acattcttcg ctatttgact 781 acagcagtca taacaaacaa ggatgttcga aaacgtcggc aggttctaaa agatctagtt 841 aaagttattc aacaggagtc ttacacatat aaagacccaa ttacagaatt tgttgaatgt 901 ttatatgtta actttgactt tgatggggct cagaaaaagc tgagggaatg tgaatcagtg 961 cttgtgaatg acttcttctt ggtggcttgt cttgaggatt tcattgaaaa tgcccgtctc 1021 ttcatatttg agactttctg tcgcatccac cagtgtatca gcattaacat gttggcagat 1081 aaattgaaca tgactccaga agaagctgaa aggtggattg taaatttgat tagaaatgca 1141 agactggatg ccaagattga ttctaaatta ggtcatgtgg ttatgggtaa caatgcagtc 1201 tcaccctatc agcaagtgat tgaaaagacc aaaagccttt cctttagaag ccagatgttg 1261 gccatgaata ttgagaagaa acttaatcag aatagcaggt cagaggctcc taactgggca 1321 actcaagatt ctggcttcta ctgaagaacc ataaagaaaa gatgaaaaaa aaaactatca 1381 aagaaagatg aaataataaa actattatat aaagggtgac ttacattttg gaaacaacat 1441 attacgtata aattttgaag aattggaata aaattgattc atttt // LOCUS HSU54617 1798 bp mRNA PRI 06-SEP-1996 DEFINITION Human pyruvate dehydrogenase kinase isoform 4 mRNA, complete cds. ACCESSION U54617 NID g1399196 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1798) AUTHORS Rowles,J., Scherer,S.W., Xi,T., Majer,M., Nickle,D.C., Rommens,J.M., Popov,K.M., Harris,R.A., Riebow,N.L., Xia,J., Tsui,L., Bogardus,C. and Prochazka,M. TITLE Cloning and characterization of PDK4 on 7q21.3 encoding a fourth pyruvate dehydrogenase kinase isoenzyme in human JOURNAL J. Biol. Chem. 271 (37), 22376-22382 (1996) MEDLINE 96394293 REFERENCE 2 (bases 1 to 1798) AUTHORS Prochazka,M. TITLE Direct Submission JOURNAL Submitted (10-APR-1996) Michal Prochazka, CDNS/PECRB, NIDDK/NIH, 4212 N. 16th Street, Phoenix, AZ 85016, USA FEATURES Location/Qualifiers source 1..1798 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q21.3" /tissue_type="frontal cortex" misc_feature 1..50 /note="determined by 5' RACE" CDS 223..1458 /codon_start=1 /product="pyruvate dehydrogenase kinase isoform 4" /db_xref="PID:g1399197" /translation="MKAARFVLRSAGSLNGAGLVPREVEHFSRYSPSPLSMKQLLDFG SENACERTSFAFLRQELPVRLANILKEIDILPTQLVNTSSVQLVKSWYIQSLMDLVEF HEKSPDDQKALSDFVDTLIKVRNRHHNVVPTMAQGIIEYKDACTVDPVTNQNLQYFLD RFYMNRISTRMLMNQHILIFSDSQTGNPSHIGSIDPNCDVVAVVQDAFECSRMLCDQY YLSSPELKLTQVNGKFPDQPIHIVYVPSHLHHMLFELFKNAMRATVEHQENQPSLTPI EVIVVLGKEDLTIKISDRGGGVPLRIIDRLFSYTYSTAPTPVMDNSRNAPLAGFGYGL PISRLYAKYFQGDLNLYSLSGYGTDAIIYLKALSSESIEKLPVFNKSAFKHYQMSSEA DDWCIPSREPKNLAKEVAM" BASE COUNT 500 a 440 c 393 g 465 t ORIGIN 1 agacttgaac ttgaatctcg aaccactgca tctccgactc tgcccagact cttcactccg 61 cggcaccctc aaaccccagc ccaggccggg gcgcacaagc cagccagcgc acctgcagtc 121 ctcgcccgga cgcgccgcgc cccctcggaa ccaggctctg ctccgagcag ccttcgcccc 181 tcaagccagc cacagtcccc gccaggccgg gtgggcgtca agatgaaggc ggcccgcttc 241 gtgctgcgca gcgctggctc gctcaacggc gccggcctgg tgccccgaga ggtggagcat 301 ttctcgcgct acagcccgtc cccgctgtcc atgaagcagc tactggactt tggttcagaa 361 aatgcatgtg aaagaacttc ttttgcattt ttgcgacaag aattgcctgt gagactcgcc 421 aacattctga aggaaattga tatcctcccg acccaattag taaatacctc ttcagtgcaa 481 ttggttaaaa gctggtatat acagagcctg atggatttgg tggaattcca tgagaaaagc 541 ccagatgacc agaaagcatt atcagacttt gtagatacac tcatcaaagt tcgaaataga 601 caccataatg tagtccctac aatggcacaa ggaatcatag agtataaaga tgcctgtaca 661 gttgacccag tcaccaatca aaatcttcaa tatttcttgg atcgatttta catgaaccgt 721 atttctactc ggatgctgat gaaccagcac attcttatat ttagtgactc acagacagga 781 aacccaagcc acattggaag cattgatcct aactgtgatg tggtagcagt ggtccaagat 841 gcctttgagt gttcaaggat gctctgtgat cagtattatt tatcatctcc agaattaaag 901 cttacacaag tgaatggaaa atttccagac caaccaattc acatcgtgta tgttccttct 961 cacctccatc atatgctctt tgaactattt aagaatgcaa tgcgggcaac agttgaacac 1021 caggaaaatc agccttccct tacaccaata gaggttattg ttgtcttggg aaaagaagac 1081 cttaccatta agatttcaga cagaggaggt ggtgttcccc tgagaattat tgaccgcctc 1141 tttagttata catactccac tgcaccaacg cctgtgatgg ataattcccg gaatgctcct 1201 ttggctggtt ttggttacgg cttgccaatt tctcgtctgt atgcaaagta ctttcaagga 1261 gatctgaatc tctactcttt atcaggatat ggaacagatg ctatcatcta cttaaaggct 1321 ttgtcttctg agtctataga aaaacttcca gtttttaaca agtcagcctt caaacattat 1381 cagatgagct ctgaggctga tgactggtgt atcccaagca gggaaccaaa gaacctggca 1441 aaagaagtgg ccatgtgaag agggacactc aggacacttt acgggatcaa agtgggtcta 1501 caccagtgct gcttcctgaa tgtttgtgtg tgaacccttg tttcctccaa aacaaacgac 1561 agcaacgaaa actccttaat cagaacactg atccaatgag gaatggagct tgtttctgtg 1621 acccaggaga acttagtgca agactacagg agttaacaga tggccagctc cttatttttt 1681 aatgtagaat aactcctgag tttatatcaa atcctgaaga aataagcctc agttttccat 1741 ctgtttttga taagaataag aaagggagtg agtgtgaaga tggtggttag cagtttcg // LOCUS HSU54644 2040 bp mRNA PRI 03-MAY-1997 DEFINITION Human tub homolog mRNA, complete cds. ACCESSION U54644 NID g1305496 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS Kleyn,P.W., Fan,W., Kovats,S.G., Lee,J.J., Pulido,J.C., Wu,Y., Berkemeier,L.R., Misumi,D.J., Holmgren,L., Charlat,O., Woolf,E.A., Tayber,O., Brody,T., Shu,P., Hawkins,F., Kennedy,B., Baldini,L., Ebeling,C., Alperin,G.D., Deeds,J., Lakey,N.D., Culpepper,J., Chen,H., Glucksmann-Kuis,M.A., Carlson,G.A., Duyk,G.D. and Moore,K.J. TITLE Identification and characterization of the mouse obesity gene tubby: a member of a novel gene family JOURNAL Cell 85 (2), 281-290 (1996) MEDLINE 96200779 REFERENCE 2 (bases 1 to 2040) AUTHORS Woolf,B. TITLE Direct Submission JOURNAL Submitted (10-APR-1996) Betty Woolf, Millennium Pharmaceuticals, 640 Memorial Drive, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..2040 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 153..1673 /codon_start=1 /product="tub homolog" /db_xref="PID:g1305497" /translation="MTSKPHSDWIPYSVLDDEGRNLRQQKLDRQRALLEQKQKKKRQE PLMVQANADGRPRSRRARQSEEQAPLVESYLSSSGSTSYQVQEADSLASVQLGATRPT APASAKRTKAAATAGGQGGAARKEKKGKHKGTSGPAALAEDKSEAQGPVQILTVGQSD HAQDAGETAAGGGERPSGQDLRATMQRKGISSSMSFDEDEEDEEENSSSSSQLNSNTR PSSATSRKSVREAASAPSPTAPEQPVDVEVQDLEEFALRPAPQGITIKCRITRDKKGM DRGMYPTYFLHLDREDGKKVFLLAGRKRKKSKTSNYLISVDPTDLSRGGDSYIGKLRS NLMGTKFTVYDNGVNPQKASSSTLESGTLRQELAAVCYETNVLGFKGPRKMSVIVPGM NMVHERVSIRPRNEHETLLARWQNKNTESIIELQNKTPVWNDDTQSYVLNFHGRVTQA SVKNFQIIHGNDPDYIVMQFGRVAEDVFTMDYNYPLCALQAFAIALSSFDSKLACE" BASE COUNT 480 a 584 c 630 g 344 t 2 others ORIGIN 1 tggcgtgcag caggggcctc ggcggggccc agcccnccgg tcccggggag gatacgtccc 61 gggggcggcc cgggagctga gcaggccccc cgcgccggcc cctccgggcc ccggcctcca 121 gagccgcagc caccgccccg cccccgagag acatgacttc caagccgcat tccgactgga 181 ttccctacag tgtcttagat gatgagggca gaaacctgag gcagcagaag cttgatcggc 241 agcgggccct gctggagcag aagcagaaga agaagcgcca ggagcccctg atggtgcagg 301 ccaatgcaga tgggcggccc cggagccggc gggcccggca gtcagaggaa caagcccccc 361 tggtggagtc ctacctcagc agcagtggca gcaccagcta ccaagttcaa gaggccgact 421 cactcgccag tgtgcagctg ggagccacgc gcccaacagc accagcttca gccaagagaa 481 ccaaggcggc agctacagca gggggccagg gtggcgccgc taggaaggag aagaagggaa 541 agcacaaagg caccagcggg ccagcagcac tggcagaaga caagtctgag gcccaaggcc 601 cagtgcagat tctgactgtg ggccagtcag accacgccca ggacgcaggg gagacggcag 661 ctggtggggg cgaacggccc agcgggcagg atctccgtgc cacgatgcag aggaagggca 721 tctccagcag catgagcttt gacgaggatg aggaggatga ggaggagaat agctccagct 781 cctcccagct aaatagtaac acccgcccca gctctgctac tagcaggaag tccgtcaggg 841 aggcagcctc agcccctagc ccaacagctc cagagcaacc agtggacgtt gaggtccagg 901 atcttgagga gtttgcactg aggccggccc cccagggtat caccatcaaa tgccgcatca 961 ctcgggacaa gaaagggatg gaccggggca tgtaccccac ctactttctg cacctggacc 1021 gtgaggatgg gaagaaggtg ttcctcctgg cgggaaggaa gagaaagaag agtaaaactt 1081 ccaattacct catctctgtg gacccaacag acttgtctcg aggaggggac agctatatcg 1141 ggaaactgcg gtccaacttg atgggcacca agttcactgt ttatgacaat ggagtcaacc 1201 ctcagaaggc ctcatcctcc actttggaaa gtggaacctt acgtcaggag ctggcagctg 1261 tgtgctacga gacaaacgtc ttaggcttca aggggcctcg gaagatgagc gtgattgtcc 1321 caggcatgaa catggttcat gagagagtct ctatccgccc ccgcaacgag catgagacac 1381 tgctagcacg ctggcagaat aagaacacgg agagtatcat cgagctgcaa aacaagacac 1441 ctgtctggaa tgatgacaca cagtcctatg tactcaactt ccatgggcgc gtcacacagg 1501 cctccgtgaa gaacttccag atcatccatg gcaatgaccc ggactacatc gtgatgcagt 1561 ttggccgggt agcagaggat gtgttcacca tggattacaa ctacccgctg tgtgcactgc 1621 aggcctttgc cattgccctg tccagcttcg acagcaagct ggcgtgcgag tagaggcctc 1681 ttcgtgccct ttggggttgc ccagcctgga gcggagcttg cctgcctgcc tgtggagaca 1741 gccctgccta tcctctgtat ataggccttc cgccagatga agctttggcc ctcagtgggc 1801 tccctggccc agccagccag gaactggctc ctttggctct gctactgagg caggggagta 1861 gtggagagcg ggtgggtggg tgttgaaggg attgagaatt aattctttcc atgccacgag 1921 gatcaacaca cactcccacc cttgggtagt aagtggttgt tgtnagtcgg tactttacca 1981 aagcttgagc aacctcttcc aagcttggga aagggccgca aaaaggcatt aggaggggag // LOCUS HSU54804 3003 bp mRNA PRI 14-SEP-1996 DEFINITION Human Has2 mRNA, complete cds. ACCESSION U54804 NID g1543067 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3003) AUTHORS Watanabe,K. and Yamaguchi,Y. TITLE Molecular identification of a putative human hyaluronan synthase JOURNAL J. Biol. Chem. 271 (38), 22945-22948 (1996) MEDLINE 96394371 REFERENCE 2 (bases 1 to 3003) AUTHORS Watanabe,K. and Yamaguchi,Y. TITLE Direct Submission JOURNAL Submitted (11-APR-1996) Yu Yamaguchi, The Burnham Institute, 10901 N. Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..3003 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 536..2194 /note="putative hyaluronan synthase." /codon_start=1 /product="Has2" /db_xref="PID:g1543068" /translation="MHCERFLCILRIIGTTLFGVSLLLGITAAYIVGYQFIQTDNYYF SFGLYGAFLASHLIIQSLFAFLEHRKMKKSLETPIKLNKTVALCIAAYQEDPDYLRKC LQSVKRLTYPGIKVVMVIDGNSEDDLYMMDIFSEVMGRDKSATYIWKNNFHEKGPGET DESHKESSQHVTQLVLSNKSICIMQKWGGKREVMYTAFRALGRSVDYVQVCDSDTMLD PASSVEMVKVLEEDPMVGGVGGDVQILNKYDSWISFLSSVRYWMAFNIERACQSYFGC VQCISGPLGMYRNSLLHEFVEDWYNQEFMGNQCSFGDDRHLTNRVLSLGYATKYTARS KCLTETPIEYLRWLNQQTRWSKSYFREWLYNAMWFHKHHLWMTYEAIITGFFPFFLIA TVIQLFYRGKIWNILLFLLTVQLVGLIKSSFASCLRGNIVMVFMSLYSVLYMSSLLPA KMFAIATINKAGWGTSGRKTIVVNFIGLIPVSVWFTILLGGVIFTIYKESKRPFSESK QTVLIVGTLLYACYWVMLLTLYVVLINKCGRRKKGQQYDMVLDV" BASE COUNT 901 a 568 c 595 g 939 t ORIGIN 1 cgaagtcaag acgtctggaa agaattaccc agtcctggct tcgagcagcc cattgaacca 61 gagacttgaa acagccccag ccaaagactt ttctcccaat tctgcgcttc ctgggttctg 121 ctgagtcttc cacaggcttt tttttttttt tttttttttt aagacgaaaa agagattttc 181 tgttatcggg ggcagaaaga ctgaagcaca aaaaaaaaaa aaaagaaaag aaaagaaaag 241 aaaaaagaaa agttaattta tttttaaagc ataatttttt taagaattag actgaagtgc 301 aacggaaaca taaagagaat attagtgaaa ttatttttta aagtggggaa gaatcaaaca 361 tttaagactc ccctatcctt tttaaatgtt gtttttaaat ttcttatttt ttttggccgg 421 tcgtctcaaa ttcatctgat ctcttattac ctcaattttg gaaactgccc gccaccgacc 481 ctccgggacc acacagacag gctgaggacg actttatgac caagagctga acaagatgca 541 ttgtgagagg tttctatgta tcctgagaat aattggaacc acactctttg gagtctctct 601 cctccttgga atcacagctg cttatattgt tggctaccag tttatccaaa cggataatta 661 ctatttctct tttggactgt atggtgcctt tttggcatca cacctcatca tccaaagcct 721 gtttgccttt ttggagcacc gaaaaatgaa aaaatcccta gaaaccccca taaagttgaa 781 caaaacagtt gccctttgca tcgctgccta tcaagaagat ccagactact taaggaaatg 841 tttgcaatct gtgaaaaggc taacctaccc tgggattaaa gttgtcatgg tcatagatgg 901 gaactcagaa gatgaccttt acatgatgga catcttcagt gaagtcatgg gcagagacaa 961 atcagccact tatatctgga agaacaactt ccacgaaaag ggtcccggtg agacagatga 1021 gtcacataaa gaaagctcgc aacacgtaac gcaattggtc ttgtccaaca aaagtatctg 1081 catcatgcaa aaatggggtg gaaaaagaga agtcatgtac acagccttca gagcactggg 1141 acgaagtgtg gattatgtac aggtttgtga ttcagacact atgcttgacc cagcctcatc 1201 tgtggagatg gtaaaagttt tagaagaaga tcccatggtt ggaggtgttg ggggagatgt 1261 ccagatttta aacaagtacg attcctggat ctcattcctc agcagtgtaa gatattggat 1321 ggcttttaat atagaaaggg cctgtcagtc ttattttggg tgtgttcagt gcattagtgg 1381 acctctggga atgtacagaa actccttgtt gcatgagttt gtggaagatt ggtacaatca 1441 agaatttatg ggcaaccaat gtagctttgg tgatgacagg catctcacga accgggtgct 1501 gagcctgggc tatgcaacaa aatacacagc tcgatctaag tgccttactg aaacacctat 1561 agagtatctc agatggctaa accagcagac ccgttggagc aagtcctact tccgagaatg 1621 gctgtacaat gcaatgtggt ttcacaaaca tcacttgtgg atgacctacg aagcgattat 1681 cactggattc tttcctttct ttctcattgc cacagtaatc cagctcttct accggggtaa 1741 aatttggaac attctcctct tcttgttaac tgtccagcta gtaggtctca taaaatcatc 1801 ttttgccagc tgccttagag gaaatatcgt catggtcttc atgtctctct actcagtgtt 1861 atacatgtcg agtttacttc ccgccaagat gtttgcaatt gcaacaataa acaaagctgg 1921 gtggggcaca tcaggaagga aaaccattgt tgttaatttc ataggactca ttccagtatc 1981 agtttggttt acaatcctcc tgggtggtgt gattttcacc atttataagg agtctaaaag 2041 gccattttca gaatccaaac agacagttct aattgttgga acgttgctct atgcatgcta 2101 ttgggtcatg cttttgacgc tgtatgtagt tctcatcaat aagtgtggca ggcggaagaa 2161 gggacaacaa tatgacatgg tgcttgatgt atgatcttcc atgttttgac gtttgcagtc 2221 acacacaaca ccttagttcc tctaggggct gtacagtatt gtggcatcag ataatgccac 2281 caaaggagac atatcactgc tgctgggact tgaacaaaga catttatatg ggtttatttt 2341 cattctgcca aagtaaaaca atacatcaac aagaagaaac tcagatttaa cctgttattt 2401 ctatgaaaat gggatgaatt ctttgtttat gcactttttc cttactgtgc atccgcctga 2461 aagtgttttg gcctatatac ctcactagcc atgctttatg tgggttatca tggaagaaaa 2521 ggattttgga aactcaagga aaagttcttt caacctatac aacctaactt atggactgtt 2581 tgatagatga taattttttt tttttaggaa ggattttctt tttaacttta ccaaatgaaa 2641 tgccaaagga agttttaaag gccgtggctg tgctgtattt gatataattg tactgtgttt 2701 ttaaattgtg tatgccaatc ttaaagacaa attttgcata ttctctattt tacttttctg 2761 ccaaaataaa cctgttcttc cttttttaaa ataaaataag ttcttaaaaa atttatactt 2821 aaaaaatcct gcccaaaatg tgaagcttgg ttgactgatg ttcatgatag aaagaataaa 2881 atgtttctct ctctctacct tttaaaattg aatagtttat ttctgtgaaa gaagtattta 2941 aactttcaat attttaactt tttgttttta tttcttttag aaaaggccaa tatacctatc 3001 gcg // LOCUS HSU54826 1658 bp mRNA PRI 25-MAY-1996 DEFINITION Human mad-related protein MADR1 mRNA, complete cds. ACCESSION U54826 NID g1332713 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1658) AUTHORS Hoodless,P.A., Haerry,T., Abdollah,S., Stapleton,M., O'Connor,M.B., Attisano,L. and Wrana,J.L. TITLE MADR1, a MAD-related protein that functions in BMP2 signaling pathways JOURNAL Cell 85 (4), 489-500 (1996) MEDLINE 96222292 REFERENCE 2 (bases 1 to 1658) AUTHORS Attisano,L., Hoodless,P.A. and Wrana,J.L. TITLE Direct Submission JOURNAL Submitted (12-APR-1996) Liliana Attisano, Div. of Gastroenterology and Program in Developmental Biology, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 241..1638 /codon_start=1 /product="mad-related protein MADR1" /db_xref="PID:g1332714" /translation="MNVTSLFSFTSPAVKRLLGWKQGDEEEKWAEKAVDALVKKLKKK KGAMEELEKALSCPGQPSNCVTIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQSHHE LKPLECCEFPFGSKQKEVCINPYHYKRVESPVLPPVLVPRHSEYNPQHSLLAQFRNLG QNEPHMPLNATFPDSFQQPNSHPFPHSPNSSYPNSPGSSSSTYPHSPTSSDPGSPFQM PADTPPPAYLPPEDPMTQDGSQPMDTNMMAPPLPSEINRGDVQAVAYEEPKHWCSIVY YELNNRVGEAFHASSTSVLVDGFTDPSNNKNRFCLGLLSNVNRNSTIENTRRHIGKGV HLYYVGGEVYAECLSDSSIFVQSRNCNYHHGFHPTTVCKIPSGCSLKIFNNQEFAQLL AQSVNHGFETVYELTKMCTIRMSFVKGWGAEYHRQDVTSTPCWIEIHLHGPLQWLDKV LTQMGSPHNPISSVS" BASE COUNT 433 a 422 c 363 g 440 t ORIGIN 1 cactgcatgt gtattcgtga gttcgcggtt gaacaactgt tcctttactc tgctccctgt 61 ctttgtgctg actgggttac ttttttaaac actaggaatg gtaatttcta ctcttctgga 121 cttcaaacta agaagttaaa gagacttctc tgtaaataaa caaatctttt ctgctgtcct 181 tttgcatttg gagacagctt tatttcacca tatccaagga gtataactag tgctgtcatt 241 atgaatgtga caagtttatt ttcctttaca agtccagctg tgaagagact tcttgggtgg 301 aaacagggcg atgaagaaga aaaatgggca gagaaagctg ttgatgcttt ggtgaaaaaa 361 ctgaagaaaa agaaaggtgc catggaggaa ctggaaaagg ccttgagctg cccagggcaa 421 ccgagtaact gtgtcaccat tccccgctct ctggatggca ggctgcaagt ctcccaccgg 481 aagggactgc ctcatgtcat ttactgccgt gtgtggcgct ggcccgatct tcagagccac 541 catgaactaa aaccactgga atgctgtgag tttccttttg gttccaagca gaaggaggtc 601 tgcatcaatc cctaccacta taagagagta gaaagccctg tacttcctcc tgtgctggtt 661 ccaagacaca gcgaatataa tcctcagcac agcctcttag ctcagttccg taacttagga 721 caaaatgagc ctcacatgcc actcaacgcc acttttccag attctttcca gcaacccaac 781 agccacccgt ttcctcactc tcccaatagc agttacccaa actctcctgg gagcagcagc 841 agcacctacc ctcactctcc caccagctca gacccaggaa gccctttcca gatgccagct 901 gatacgcccc cacctgctta cctgcctcct gaagacccca tgacccagga tggctctcag 961 ccgatggaca caaacatgat ggcgcctccc ctgccctcag aaatcaacag aggagatgtt 1021 caggcggttg cttatgagga accaaaacac tggtgctcta ttgtctacta tgagctcaac 1081 aatcgtgtgg gtgaagcgtt ccatgcctcc tccacaagtg tgttggtgga tggtttcact 1141 gatccttcca acaataagaa ccgtttctgc cttgggctgc tctccaatgt taaccggaat 1201 tccactattg aaaacaccag gcggcatatt ggaaaaggag ttcatcttta ttatgttgga 1261 ggggaggtgt atgccgaatg ccttagtgac agtagcatct ttgtgcaaag tcggaactgc 1321 aactaccatc atggatttca tcctactact gtttgcaaga tccctagtgg gtgtagtctg 1381 aaaattttta acaaccaaga atttgctcag ttattggcac agtctgtgaa ccatggattt 1441 gagacagtct atgagcttac aaaaatgtgt actatacgta tgagctttgt gaagggctgg 1501 ggagcagaat accaccgcca ggatgttact agcaccccct gctggattga gatacatctg 1561 cacggccccc tccagtggct ggataaagtt cttactcaaa tgggttcacc tcataatcct 1621 atttcatctg tatcttaaat ggccccaggc atctgcct // LOCUS HSU54996 2883 bp mRNA PRI 05-DEC-1997 DEFINITION Human protein ZW10 homolog (HZW10) mRNA, complete cds. ACCESSION U54996 NID g2661163 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2883) AUTHORS Starr,D.A., Williams,B.C., Li,Z., Etemad-Moghadam,B., Dawe,R.K. and Goldberg,M.L. TITLE Conservation of the centromere/kinetochore protein ZW10 JOURNAL J. Cell Biol. 138 (6), 1289-1301 (1997) MEDLINE 97444363 REFERENCE 2 (bases 1 to 2883) AUTHORS Starr,D.A., Williams,B.C., Li,Z. and Goldberg,M.L. TITLE Direct Submission JOURNAL Submitted (13-APR-1996) D.A. Starr, Genetics and Development, Cornell University, 427 Biotechnology Building, Ithaca, NY 14853, USA FEATURES Location/Qualifiers source 1..2883 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..2883 /gene="HZW10" CDS 24..2363 /gene="HZW10" /note="Centromere binding protein at prophase, metaphase, and early anaphase. Also binds kinetochore microtubules at metaphase; homolog of Drosophila melanogaster gene l(1)zw10, protein ZW10, PIR Accession Number A43275" /codon_start=1 /product="HZW10" /db_xref="PID:g2661164" /translation="MASFVTEVLAHSGRLEKEDLGTRISRLTRRVEEIKGEVCNMISK KYSEFLPSMQSAQGLITQVDKLSEDIDLLKSRIESEVRRDLHVSTGEFTDLKQQLERD SVVLSLLKQLQEFSTAIEEYNCALTEKKYVTGAQRLEEAQKCLKLLKSRKCFDLKILK SLSMELTIQKQNILYHLGEEWQKLIVWKFPPSKDTSSLESYLQTELHLYTEQSHKEEK TPMPPISSVLLAFSVLGELHSKLKSFGQMLLKYILRPLASCPSLHAVIESQPNIVIIR FESIMTNLEYPSPSEVFTKIRLVLEVLQKQLLDLPLDTDLENEKTSTVPLAEMLGDMI WEDLSECLIKNCLVYSIPTNSSKLQQYEEIIQSTEEFENALKEMRFLKGDTTDLLKYA RNINSHFANKKCQDVIVAARNLMTSEIHNTVKIIPDSKINVPELPTPDEDNKLEVQKV SNTQYHEVMNLEPENTLDQHSFSLPTCRISESVKKLMELAYQTLLEATTSSDQCAVQL FYSVRNIFHLFHDVVPTYHKENLQKLPQLAAIHHNNCMYIAHHLLTLGHQFRLRLAPI LCDGTATFVDLVPGFRRLGTECFLAQMRAQKGELLERLSSARNFSNMDDEENYSAASK AVRQVLHQLKRLGIVWQDVLPVNIYCKAMGTLLNTAISEVIGKITALEDISTEDGDRL YSLCKTVMDEGPQVFAPLSEESKNKKYQEEVPVYVPKWMPFKELMMMLQASLQEIGDR WADGKGPLAAAFSSSEVKALIRALFQNTERRAAALAKIK" BASE COUNT 886 a 578 c 639 g 780 t ORIGIN 1 gcacgagggt tcccgtcttg gccatggcct cgttcgtgac agaagttttg gcacactccg 61 ggaggctgga aaaggaggat ctggggaccc ggatcagccg cctgacccgg cgggtggagg 121 agatcaaggg tgaggtgtgc aatatgatta gcaagaagta cagtgaattc ctgcctagca 181 tgcagagcgc gcagggcctg attacccagg tggataagct atctgaagac attgacctgc 241 tgaaatccag gatagagagt gaggtccgcc gggatcttca cgtatcaacc ggtgaattta 301 cagacttaaa gcagcagttg gaaagagact cagttgtcct aagtttgctt aaacagttgc 361 aggagttttc cactgctatt gaagaatata attgtgcatt aacagagaag aagtatgtca 421 ctggtgctca gcgtctggaa gaggcacaga aatgcttgaa gttattaaaa tccagaaaat 481 gctttgattt aaaaatattg aaatctctca gcatggagct cacaatacag aaacagaaca 541 tactttatca ccttggagaa gagtggcaga agctgattgt atggaagttc ccaccatcaa 601 aagataccag cagtttggaa tcttacctac aaactgaact tcatttatac actgaacaat 661 cgcacaaaga ggagaagacc cctatgccac ccatcagttc tgtcctcttg gcattttctg 721 ttcttggaga actacacagc aagcttaaat catttggtca gatgctgctg aagtatatcc 781 ttaggccgct ggcatcttgc ccatcccttc atgctgtgat agaaagccag cctaacatag 841 ttattattcg ttttgaatct ataatgacta acttggaata tccatcacca tctgaagttt 901 ttacaaagat cagactggta ctagaagtgc tccagaaaca gcttctagat ttgccacttg 961 acactgacct ggaaaatgaa aaaacatcta ctgtcccatt ggctgagatg cttggagaca 1021 tgatctggga ggacttgtct gagtgcctca tcaaaaactg tttggtttat tcgattccaa 1081 caaatagcag caaattacag caatatgaag agatcataca gtccactgaa gaatttgaaa 1141 atgccctaaa ggaaatgaga tttttaaaag gagatactac agatttgctg aaatacgctc 1201 gtaacatcaa ttctcatttt gcaaacaaaa agtgccagga tgtgattgtg gcagccagaa 1261 atctaatgac ctcagaaatt cataacactg tgaagattat tcctgattct aagataaatg 1321 tgccagagtt acccactcct gatgaggata acaaactgga agtacagaaa gtatccaata 1381 ctcagtacca cgaagtgatg aatttagagc ctgaaaatac attggaccaa cattcctttt 1441 ccttgcccac atgccgtatc agtgagtctg tgaagaaatt aatggaactc gcctatcaga 1501 ctttactaga ggcaacaacc agtagtgatc aatgtgctgt tcaacttttc tactcagtga 1561 ggaatatctt ccatttgttc catgatgttg taccaacata tcacaaggag aaccttcaaa 1621 aacttcccca gttggctgct attcatcaca acaactgtat gtacattgct caccacttgc 1681 tgaccctcgg gcatcagttc agattgcgtc ttgcccccat tctttgtgat ggcactgcta 1741 cttttgtgga tcttgtacct ggcttcagga gacttgggac agaatgcttt ttggcccaaa 1801 tgcgggcaca gaaaggtgaa cttctggaaa gattatcaag tgctaggaac ttttcaaata 1861 tggacgatga agagaattat tctgcagcaa gtaaagcagt ccggcaggta ctgcaccaac 1921 taaagagact tggaattgtg tggcaggatg tcctgccagt gaatatatat tgcaaggcta 1981 tggggacttt actcaataca gcaatttctg aggtcattgg caaaattact gccctagagg 2041 acatatctac tgaagatggt gataggttat attccttatg caaaacagtg atggatgaag 2101 gaccccaagt atttgcacct ttatctgaag aaagcaagaa caagaaatat caagaagagg 2161 ttccagtcta tgtgccaaaa tggatgccat tcaaggaatt gatgatgatg ctacaagcca 2221 gcttgcaaga aattggggat cggtgggcag atggaaaagg acccctggca gctgcgttct 2281 cttccagtga agtaaaagct ttaattcgtg ccttgtttca gaacacagaa agaagagcag 2341 ctgcccttgc taaaattaaa tagctccatc ttcttaagaa agctatgtct tgaatatgtg 2401 gattcttccc ttggcataat tactccctta aagacttctt tgaatcgccc attggttttg 2461 gtgaaccagt acatcttgga agtttgactt tacagaagaa cgtctacctc ctggcctgta 2521 cgaggctttg tttaagaact gtttattaag ataaattgtc aagtaaagca cctcaattca 2581 ttgactttct agccatcttc ctttgattag ctaacaaact gtcaggcagc attatttcat 2641 gctgcttcca gagcctctgg gagctatata cattgtaaat gcaggcccta gctttggaac 2701 gaggaattgg gagattccag gagtcagggt agagaatttc tgagcaaatc ggagatattt 2761 taggggtgtg gaggagggga agggaggaat gggccaccat atttggctta caggaattaa 2821 ggagacttcc tgtaatattt ctttccaata aatattgctt tttacaaaaa aaaaaaaaaa 2881 aaa // LOCUS HSU54999 2336 bp mRNA PRI 14-JAN-1997 DEFINITION Human LGN protein mRNA, complete cds. ACCESSION U54999 NID g1408181 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2336) AUTHORS Mochizuki,N., Cho,G., Wen,B. and Insel,P.A. TITLE Identification and cDNA cloning of a novel human mosaic protein, LGN, based on interaction with G alpha i2 JOURNAL Gene 181 (1-2), 39-43 (1996) MEDLINE 97128765 REFERENCE 2 (bases 1 to 2336) AUTHORS Mochizuki,N., Hibi,M., Kanai,Y. and Insel,P.A. TITLE Direct Submission JOURNAL Submitted (12-APR-1996) Pharmacology, UCSD, 9500 Gilman Drive, La Jolla, CA 92093-0636, USA FEATURES Location/Qualifiers source 1..2336 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 174..2207 /codon_start=1 /product="LGN protein" /db_xref="PID:g1408182" /translation="MREDHSFHVRYRMEASCLELALEGERLCKSGDCRAGVSFFEAAV QVGTEDLKTLSAIYSQLGNAYFYLHDYAKALEYHHHDLTLARTIGDQLGEAKASGNLG NTLKVLGNFDEAIVCCQRHLDISRELNDKVGEARALYNLGNVYHAKGKSFGCPGPQDV GEFPEEVRDALQAAVDFYEENLSLVTALGDRAAQGRAFGNLGNTHYLLGNFRDAVIAH EQRLLIAKEFGDKAAERRAYSNLGNAYIFLGEFETASEYYKKTLLLARQLKDRAVEAQ SCYSLGNTYTLLQDYEKAIDYHLKHLAIAQELNDRIGEGRACWSLGNAYTALGNHDQA MHFAEKHLEISREVGDKSGELTARLNLSDLQMVLGLSYSTNNSIMSENTEIDSSLNGV LPKLGRRHSMENMELMKLTPEKVQNWNSEILAKQKPLIAKPSAKLLFVNRLKGKKYKT NSSTKVLQDASNSIDHRIPNSQRKISADTIGDEGFFDLLSRFQSNRMDDQRCCLQEKN CHTASTTTSSTPPKMMLKTSSVPVVSPNTDEFLDLLASSQSRRLDDQRASFSNLPGLR LTQNSQSVLSHLMTNDNKEADEDFFDILVKCQGSRLDDQRCAPPPATTKGPTVPDEDF FSLILRSQGKRMDEQRVLLQRDQNRDTDFGLKDFLQNNALLEFKNSGKKSADH" BASE COUNT 769 a 437 c 503 g 627 t ORIGIN 1 ggcacgagga agaatcagga gcttaggatg tattaacacc aactcattaa tatactaacc 61 ggacaatgtt ctacaaacaa ttctacattg taaaggactg gattggcaca aaataaaata 121 attttatttt attcagctta taatatgact cgatggagga aaatttgata agcatgagag 181 aagaccattc ttttcatgtt cgttacagaa tggaagcttc ttgcctagag ctggccttgg 241 aaggggaacg tctatgtaaa tcaggagact gccgcgctgg cgtgtcattc tttgaagctg 301 cagttcaagt tggaactgaa gacctaaaaa cacttagcgc tatttacagc cagttgggca 361 atgcttattt ctatttgcat gattatgcca aagcattaga atatcaccat catgatttaa 421 cccttgcaag gactattgga gaccagctgg gggaagcgaa agctagtggt aatctgggaa 481 acaccttaaa agttcttggg aattttgacg aagccatagt ttgttgtcag cgacacctag 541 atatttccag agagcttaat gacaaggtgg gagaagcaag agcactttac aatcttggga 601 atgtgtatca tgccaaaggg aaaagttttg gttgccctgg tccccaggat gtaggagaat 661 ttccagaaga agtgagagat gctctgcagg cagccgtgga tttttatgag gaaaacctat 721 cattagtgac tgctttgggt gaccgagcgg cacaaggacg tgcctttgga aatcttggaa 781 acacacatta cctccttggc aacttcaggg atgcagttat agctcatgag cagcgtctcc 841 ttattgcaaa agaatttgga gataaagcag ctgaaagaag agcatatagc aaccttggaa 901 atgcatatat atttcttggt gaatttgaaa ctgcctcgga atactacaag aagacactac 961 tgttggcccg acagcttaaa gaccgagctg tagaagcaca gtcttgttac agtcttggaa 1021 atacatatac tttacttcaa gactatgaaa aggccattga ttatcatctg aagcacttag 1081 caattgctca agagctgaat gatagaattg gtgaaggaag agcatgttgg agcttaggaa 1141 atgcatacac agcactagga aatcatgatc aagcaatgca ttttgctgaa aagcacttgg 1201 aaatttcaag agaggttggg gataaaagtg gtgaactaac agcacgactt aatctctcag 1261 accttcaaat ggttcttggt ctgagctaca gcacaaataa ctccataatg tctgaaaata 1321 ctgaaattga tagcagtttg aatggtgtac tccccaagtt gggacgccgg catagtatgg 1381 aaaatatgga acttatgaag ttaacaccag aaaaggtaca gaactggaac agtgaaattc 1441 ttgctaagca aaaacctctt attgccaaac cttctgcaaa gctactcttt gtcaacagac 1501 tgaaggggaa aaaatacaaa acgaattcct ccactaaagt tctccaagat gccagtaatt 1561 ctattgacca ccgaattcca aattctcaga ggaaaatcag tgcagatact attggagatg 1621 aagggttctt tgacttatta agccgatttc aaagcaatag gatggatgat cagagatgtt 1681 gcttacaaga aaagaactgc catacagctt caacaacaac ttcttccact ccccctaaaa 1741 tgatgctaaa aacatcatct gttcctgtgg tatcccccaa cacggatgag tttttagatc 1801 ttcttgccag ctcacagagt cgccgtctgg atgaccagag ggctagtttc agtaatttgc 1861 cagggcttcg tctaacacaa aacagccagt cggtacttag ccacctgatg actaatgaca 1921 acaaagaggc tgatgaagat ttctttgaca tccttgtaaa atgtcaagga tccagattag 1981 atgatcaaag atgtgctcca ccacctgcta ccacaaaggg tccgacagta ccagatgaag 2041 actttttcag ccttatttta cggtcccagg gaaagagaat ggatgaacag agagttcttt 2101 tacaaagaga tcaaaacaga gacactgact ttgggctaaa ggactttttg caaaataatg 2161 ctttgttgga gtttaaaaat tcagggaaaa aatcggcaga ccattagtta ctatggattt 2221 attttttttc ctttcaaaca cggtaaggaa acaatctatt acttttttcc ttaaaaggag 2281 aatttatagc actgtaatac agcttaaaat atttttagaa tgatgtaaat agttaa // LOCUS HSU55206 1280 bp mRNA PRI 22-OCT-1996 DEFINITION Human gamma-glutamyl hydrolase (hGH) mRNA, complete cds. ACCESSION U55206 NID g1621542 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1280) AUTHORS Yao,R., Nimec,Z., Ryan,T.J. and Galivan,J. TITLE Identification, cloning, and sequencing of a cDNA coding for rat gamma-glutamyl hydrolase JOURNAL J. Biol. Chem. 271 (15), 8525-8528 (1996) MEDLINE 96224049 REFERENCE 2 (bases 1 to 1280) AUTHORS Yao,R., Schneider,E., Ryan,T.J. and Galivan,J. TITLE Human gamma-glutamyl hydrolase: cloning and characterization of the enzyme expressed in vitro JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (19), 10134-10138 (1996) MEDLINE 96413608 REFERENCE 3 (bases 1 to 1280) AUTHORS Galivan,J. TITLE Direct Submission JOURNAL Submitted (16-APR-1996) John Galivan, Molecular Medicine, Wadsworth Center, Empire State Plaza, Albany, NY 12201-0509, USA FEATURES Location/Qualifiers source 1..1280 /organism="Homo sapiens" /note="derived using human placenta and brain ESTs" /db_xref="taxon:9606" gene 60..1016 /gene="hGH" CDS 60..1016 /gene="hGH" /EC_number="3.4.22.12" /codon_start=1 /product="human gamma-glutamyl hydrolase" /db_xref="PID:g1621543" /translation="MASPGCLLCVLGLLLCGAASLELSRPHGDTAKKPIIGILMQKCR NKVMKNYGRYYIAASYVKYLESAGARVVPVRLDLTEKDYEILFKSINGILFPGGSVDL RRSDYAKVAKIFYNLSIQSFDDGDYFPVWGTCLGFEELSLLISGECLLTATDTVDVAM PLNFTGGQLHSRMFQNFPTELLLSLAVEPLTANFHKWSLSVKNFTMNEKLKKFFNVLT TNTDGKIEFISTMEGYKYPVYGVQWHPEKAPYEWKNLDGISHAPNAVNPAFYLAEFFV NEARKKNHHFKSESEEEKALIYQFSPIYTGNISSFQQCYIFD" BASE COUNT 390 a 237 c 278 g 375 t ORIGIN 1 tgccgcagcc cccgcccgcc cgcagagctt ttgaaaggcg gcgggaggcg gcgagcgcca 61 tggccagtcc gggctgcctg ctgtgcgtgc tgggcctgct actctgcggg gcggcgagcc 121 tcgagctgtc tagaccccac ggcgacaccg ccaagaagcc catcatcgga atattaatgc 181 aaaaatgccg taataaagtc atgaaaaact atggaagata ctatattgct gcgtcctatg 241 taaagtactt ggagtctgca ggtgcgagag ttgtaccagt aaggctggat cttacagaga 301 aagactatga aatacttttc aaatctatta atggaatcct tttccctgga ggaagtgttg 361 acctcagacg ctcagattat gctaaagtgg ccaaaatatt ttataacttg tccatacaga 421 gttttgatga tggagactat tttcctgtgt ggggcacatg ccttggattt gaagagcttt 481 cactgctgat tagtggagag tgcttattaa ctgccacaga tactgttgac gtggcaatgc 541 cgctgaactt cactggaggt caattgcaca gcagaatgtt ccagaatttt cctactgagt 601 tgttgctgtc attagcagta gaacctctga ctgccaattt ccataagtgg agcctctccg 661 tgaagaattt tacaatgaat gaaaagttaa agaagttttt caatgtctta actacaaata 721 cagatggcaa gattgagttt atttcaacaa tggaaggata taagtatcca gtatatggtg 781 tccagtggca tccagagaaa gcaccttatg agtggaagaa tttggatggc atttcccatg 841 cacctaatgc tgtgaacccc gcattttatt tagcagagtt ttttgttaat gaagctcgga 901 aaaagaacca tcattttaaa tctgaatctg aagaggagaa agcattgatt tatcagttca 961 gtccaattta tactggaaat atttcttcat ttcagcaatg ttacatattt gattgaaagt 1021 cttcaatttg ttaacagagc aaatttgaat aattccatga ttaagctgtt agaataactt 1081 gctactcatg gcaagattag gaagtcacag attcttttct ataatgtgcc tggctctgat 1141 tcttcattat gtatgtgact atttatataa cattagataa ttaaatagtg agacataaat 1201 agagtgcttt ttcatggaaa agccttctta tatctgaaga ttgaaaaata aatttactga 1261 aatacaaaaa aaaaaaaaaa // LOCUS HSU55312 1903 bp DNA PRI 20-MAY-1996 DEFINITION Human G protein-coupled receptor GPR-NGA gene, complete cds. ACCESSION U55312 NID g1323695 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1903) AUTHORS Bonner,T.I and Matsuda,L.A. TITLE A G protein-coupled receptor expressed in NG108-15 and AtT-20 cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1903) AUTHORS Bonner,T.I and Matsuda,L.A. TITLE Direct Submission JOURNAL Submitted (17-APR-1996) T.I. Bonner, Lab of Cell Biology, NIMH, Bldg 36, Rm 3A-17, MSC 4090, Bethesda, MD 20892-4090, USA FEATURES Location/Qualifiers source 1..1903 /organism="Homo sapiens" /note="PCR product" /db_xref="taxon:9606" /cell_line="DMR1" /clone="hNGA" intron <1..382 /note="based on comparison with human brain cDNA, GenBank Accession Number H07970" mRNA <383..1791 CDS 405..1652 /note="G protein-coupled receptor of the rhodopsin family, ligand unknown" /codon_start=1 /product="GPR-NGA" /db_xref="PID:g1323696" /translation="MVFAHRMDNSKPHLIIPTLLVPLQNRSCTETATPLPSQYLMELS EEHSWMSNQTDLHYVLKPGEVATASIFFGILWLFSIFGNSLVCLVIHRSRRTQSTTNY FVVSMACADLLISVASTPFVLLQFTTGRWTLGSATCKVVRYFQYLTPGVQIYVLLSIC IDRFYTIVYPLSFKVSREKAKKMIAASWIFDAGFVTPVLFFYGSNWDSHCNYFLPSSW EGTAYTVIHFLVGFVIPSVLIILFYQKVIKYIWRIGTDGRTVRRTMNIVPRTKVKTIK MFLILNLLFLLSWLPFHVAQLWHPHEQDYKKSSLVFTAITWISFSSSASKPTLYSIYN ANFRRGMKETFCMSSMKCYRSNAYTITTSSRMAKKNYVGISEIPSMAKTITKDSIYDS FDREAKEKKLAWPINSNPPNTFV" polyA_site 1792 /note="based on comparison with human brain cDNA, GenBank Accession Number H07878" BASE COUNT 540 a 422 c 373 g 568 t ORIGIN 1 gaattccagc aaatcttcag ttggtggtaa cacccttacc atgagccaga tatgagatcc 61 ctaatattct gtgatccctg atgagtgaag ggaacaagga tatgtgtagg aggggagctc 121 ggtatgacta agggtcaaga gaaggtgagg ccagagagag cctgagctga gatctgctga 181 aacagctcct aaaatgaaaa caaggttggg gccagaattt tttctggata gtagttatgt 241 tttcctgcca acgctcaagt cctacacaaa gacaaatgac aatcaatgta aatgtcaaat 301 aagatcgtta gcctgagtaa tcataaccaa tctgtatgac acctttttaa caggaggcct 361 cattcttctt ttccccaacc agaattaaga gaaaaaaagt gaatatggtt tttgctcaca 421 gaatggataa cagcaagcca catttgatta ttcctacact tctggtgccc ctccaaaacc 481 gcagctgcac tgaaacagcc acacctctgc caagccaata cctgatggaa ttaagtgagg 541 agcacagttg gatgagcaac caaacagacc ttcactatgt gctgaaaccc ggggaagtgg 601 ccacagccag catcttcttt gggattctgt ggttgttttc tatcttcggc aattccctgg 661 tttgtttggt catccatagg agtaggagga ctcagtctac caccaactac tttgtggtct 721 ccatggcatg tgctgacctt ctcatcagcg ttgccagcac gcctttcgtc ctgctccagt 781 tcaccactgg aaggtggacg ctgggtagtg caacgtgcaa ggttgtgcga tattttcaat 841 atctcactcc aggtgtccag atctacgttc tcctctccat ctgcatagac cggttctaca 901 ccatcgtcta tcctctgagc ttcaaggtgt ccagagaaaa agccaagaaa atgattgcgg 961 catcgtggat ctttgatgca ggctttgtga cccctgtgct ctttttctat ggctccaact 1021 gggacagtca ttgtaactat ttcctcccct cctcttggga aggcactgcc tacactgtca 1081 tccacttctt ggtgggcttt gtgattccat ctgtcctcat aattttattt taccaaaagg 1141 tcataaaata tatttggaga ataggcacag atggccgaac ggtgaggagg acaatgaaca 1201 ttgtccctcg gacaaaagtg aaaactatca agatgttcct cattttaaat ctgttgtttt 1261 tgctctcctg gctgcctttt catgtagctc agctatggca cccccatgaa caagactata 1321 agaaaagttc ccttgttttc acagctatca catggatatc ctttagttct tcagcctcta 1381 aacctactct gtattcaatt tataatgcca attttcggag agggatgaaa gagacttttt 1441 gcatgtcctc tatgaaatgt taccgaagca atgcctatac tatcacaaca agttcaagga 1501 tggccaaaaa aaactacgtt ggcatttcag aaatcccttc catggccaaa actattacca 1561 aagactcgat ctatgactca tttgacagag aagccaagga aaaaaagctt gcttggccca 1621 ttaactcaaa tccaccaaat acttttgtct aagttctcat tctttcaatt gttatgcacc 1681 agagattaaa aagctttaac tataaaaaca gaagctattt acatatttgt tttcactcaa 1741 ctttccaagg gaaatgtttt attttgtaaa atgcattcat ttgtttactg tagtttttgt 1801 gggttttatt ttacttgctt tttatgtttt aggaaaagcg ttcactttga actttagcca 1861 acagtccttt tactattaat atattagtta catgcataaa aaa // LOCUS HSU55766 1141 bp mRNA PRI 23-MAY-1996 DEFINITION Human Rev interacting protein Rip-1 mRNA, complete cds. ACCESSION U55766 NID g1326183 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1141) AUTHORS D'Sa-Eipper,C., Venkatesh,L.K. and Chinnadurai,G. TITLE HIV Rev interacting protein-1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1141) AUTHORS D'Sa-Eipper,C. TITLE Direct Submission JOURNAL Submitted (18-APR-1996) Cleta D'Sa-Eipper, Inst. Mol. Virology, St Louis University, 3681 Park Avenue, St Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..1141 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" CDS 13..879 /function="interacts with HIV Rev protein" /codon_start=1 /product="Rev interacting protein Rip-1" /db_xref="PID:g1326184" /translation="MASPSLERPEKGAGKSEFRNQKPKPENQDESELLTVPDGWKEPA FSKEDNPRGLLEESSFATLFPKYREAYLKECWPLVQKALNEHHVNATLDLIEGSMTVC TTKKTFDPYIIIRARDLIKLLARSVSFEQAVRILQDDVACDIIKIGSLVRNKERFVKR RQRLIGPKGSTLKALELLTNCYIMVQGNTVSAIGPFSGLKEVRKVVLDTMKNIHPIYN IKSLMIKRELAKDSELRSQSWERFLPQFKHKNVNKRKEPKKKTVKKDIRHSHHHNQKV RSIKNWLVVNTF" BASE COUNT 430 a 194 c 255 g 262 t ORIGIN 1 cgcagcttgc aaatggcgtc tccctcgctg gagcggccag aaaaaggcgc tggaaaaagt 61 gaatttcgta accagaagcc gaagccggag aaccaagatg aatcagaact ccttacggtt 121 cctgatggtt ggaaggaacc agctttttcc aaagaggaca atcccagagg acttttggag 181 gagagcagtt tcgcaacttt gttcccaaaa tacagggaag cttacttgaa agagtgttgg 241 ccattggtgc agaaagcctt aaatgaacat catgttaatg caaccctgga cctgatcgaa 301 ggcagcatga ctgtttgtac tacaaagaag acttttgatc catatatcat cattagggcc 361 agagatctga taaaactgtt agcaaggagt gtttcatttg aacaggcagt acgaattctt 421 caggatgatg ttgcatgtga catcattaaa ataggttctt tagtaaggaa taaagagaga 481 tttgtaaaac gaagacaacg gcttattggt cccaaaggat ctacattgaa ggcattggaa 541 ctcttaacta attgttacat tatggttcag ggaaacacag tttcagccat tggacctttt 601 agtggcttaa aagaggttag aaaagtagtc cttgatacta tgaagaatat tcatccaatt 661 tataacatta aaagcttaat gattaagaga gagttggcaa aagattctga attacgatca 721 caaagttggg agagattttt gccacagttc aaacacaaaa atgtgaataa acgcaaggaa 781 ccaaagaaaa aaactgttaa gaaagatata cgccattccc accaccacaa ccagaaagtc 841 agatcgataa agaattggct agtggtgaat actttttgaa ggcaaatcag aagaagcggc 901 agaaaatgaa gcaataaagg ctaaacaagc agaagccatc agtaagagac aagaggaaag 961 aaacaaagca tttattccac ctaaggaaaa accaattgtg aaacctaagg aagcttctac 1021 tgaaactaaa attgatgtgg ccagcatcaa ggaaaaggtt aagaaagcaa agaataagaa 1081 actgggagct cttacagctg aagaaattgc acttaagatg gaggcagatg aaaaaaaaaa 1141 a // LOCUS HSU55853 2506 bp mRNA PRI 01-JUN-1997 DEFINITION Homo sapiens 130 kD Golgi-localized phosphoprotein (GPP130) mRNA, complete cds. ACCESSION U55853 NID g2145094 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2506) AUTHORS Linstedt,A.D., Mehta,A., Suhan,J., Reggio,H. and Hauri,H.P. TITLE Sequence and overexpression of GPP130/GIMPc: evidence for saturable pH-sensitive targeting of a type II early Golgi membrane protein JOURNAL Mol. Biol. Cell 8 (1997) In press REFERENCE 2 (bases 1 to 2506) AUTHORS Linstedt,A.D. TITLE Direct Submission JOURNAL Submitted (19-APR-1996) Biol. Sci., Carnegie Mellon, 4400 5th Ave., Pittsburgh, PA 15213, USA FEATURES Location/Qualifiers source 1..2506 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /tissue_type="small intestine" gene 14..2104 /gene="GPP130" CDS 14..2104 /gene="GPP130" /note="GPP130; type II Golgi membrane protein" /codon_start=1 /product="130 kD Golgi-localized phosphoprotein" /db_xref="PID:g2145095" /translation="MGNGMCSRKQKRIFQTLLLLTVVFGFLYGAMLYYELQTQLRKAE AVALKYQQHQESLSAQLQVVYEHRSRLEKSLQKERLEHKKAKEDFLVYKLEAQETLNK GRQDSNSRYSALNVQHQMLKSQHEELKKQHSDLEEEHRKQGEDFSRTFNDHKQKYLQL QQEKEQELSKLKETVYNLREENRQLRKAHQDIHTQLQDVKQQHKNLLSEHEQLVVTLE DHKSALAAAQTQVAEYKQLKDTLNRIPSLRKPDPAEQQNVTQVAHSPQGYNTAREKPT REVQEVSRNNDVWQNHEAVPGRAEDTKLYAPTHKEAEFQAPPEPIQQEVERREPEEHQ VEEEHRKALEEEEMEQVGQAEHLEEEHDPSPEEQDREWKEQHEQREAANLLEGHARAE VYPSAKPMIKFQSPYEEQLEQQRLAVQQVEEAQQLREHQEALHQQRLQGHLLRQQEQQ QQQVAREMALQRQAELEEGRPQHQEQLRQQAHYDAMDNDIVQGAEDQGIQGEEGAYER DNQHQDEAEGDPGNRHEPREQGPREADPESEADRAAVEDINPADDPNNQGEDEFEEAE QVREENLPDENEEQKQSNQKQENTEVEEHLVMAGNPDQQEDNVDEQYQEEAEEEVQED LTEEKKRELEHNAEETYGENDENTDDKNNDGEEQEVRDDNRPKGREEHYEEEEEEEED GAAVAEKSHRRAEM" BASE COUNT 900 a 502 c 647 g 457 t ORIGIN 1 tccaggcggg actatgggaa acgggatgtg ctcccgaaag cagaagcgga ttttccagac 61 gctgctgctg ctgaccgtcg tgttcggctt tctctacggc gcgatgctct actacgagct 121 gcagacgcag ctgcggaaag ccgaggcggt ggcgctcaag taccagcagc accaggagtc 181 cctctccgcc cagttacaag ttgtatatga acacagatca agattagaga aatccttgca 241 aaaagaaaga cttgaacata aaaaagcaaa ggaagatttt cttgtttata agttagaagc 301 acaagaaaca ttaaataaag gaaggcaaga ttccaatagc agatacagtg cactgaatgt 361 ccaacatcag atgttgaaaa gccaacacga ggagctaaag aaacagcaca gtgacttgga 421 agaggaacat cgcaaacaag gggaagactt cagtagaaca tttaatgacc ataagcaaaa 481 atacttgcag ctccagcaag aaaaagaaca agaactttct aagctaaaag agactgtata 541 caatttgaga gaagagaata gacaactaag gaaagcacac caagacatac atacacagct 601 tcaagatgtc aagcaacagc ataagaattt actctccgag catgaacaac ttgtagtgac 661 tttggaagac cacaagagtg cactagctgc tgcacagact caagttgcag aatataaaca 721 actgaaagat actctgaata ggattccaag ccttcgaaaa cctgatccag cagaacagca 781 aaatgtgacc caggtggcac attctccaca aggttacaac acagcaaggg agaagccaac 841 ccgagaggtg caggaggtgt ctcgaaataa tgatgtgtgg cagaaccatg aagcagttcc 901 tggaagagca gaagacacaa aactctatgc tcccacccat aaggaggcag aatttcaggc 961 tcccccagag ccaatccaac aagaagtgga acgcagagaa cctgaggagc atcaggtgga 1021 agaggagcac agaaaggccc tggaggagga agaaatggag caggtcgggc aagcagaaca 1081 tcttgaggag gaacacgatc catcaccaga ggagcaggat cgggagtgga aagagcagca 1141 tgagcaacga gaagcagcca acctcctgga agggcacgcg cgtgctgagg tgtacccttc 1201 agccaagcca atgatcaaat tccaatcacc ctatgaggaa cagttggaac agcagagact 1261 ggcagtgcag caggtggagg aggcccagca gctgcgggaa caccaggaag ctttgcacca 1321 gcagaggctg caggggcact tactacggca gcaggaacag cagcagcagc aggtggcaag 1381 agagatggcc ctgcagaggc aggctgagct tgaggagggc cggccgcagc accaggagca 1441 gctccggcag caagctcatt atgatgctat ggataatgat atcgttcagg gagcagagga 1501 ccagggaatc caaggagagg aaggagccta tgaaagagac aaccagcacc aagatgaagc 1561 agaaggagat ccaggtaata gacatgagcc tcgtgaacaa ggaccccgag aagccgaccc 1621 agaatctgag gcagataggg cagctgtaga agatataaac ccagcagatg accctaataa 1681 tcaaggtgag gatgaatttg aagaagccga gcaagtgaga gaagaaaatt tgccagatga 1741 aaatgaagag caaaaacaaa gtaatcaaaa gcaagagaat acagaagtgg aggaacattt 1801 ggtgatggca ggaaatccag accagcagga ggacaatgtt gatgaacagt accaggaaga 1861 ggcagaagag gaggttcagg aagatttgac tgaagagaaa aaaagggaac tggagcataa 1921 tgctgaagag acctatggtg aaaatgatga aaatactgat gataaaaata atgatggaga 1981 agagcaagaa gttcgagatg acaaccgccc caaaggccga gaggaacact acgaggagga 2041 agaagaggag gaagaagacg gggctgcagt tgctgagaaa tcacatcgaa gagctgaaat 2101 gtagcagcac ccaatttcta gacaacgctc agccaacgga ttcttttcaa gctgctcaaa 2161 cataaatctg cctactgaac tctaggatat ttaattacaa aaattaagaa cttagacttt 2221 tttaaaactt tgtattagaa atgcgcatac atttatatga atatattttg ataacatagg 2281 tctagagctt cttttatatt caagctaaac atgaaaaaga agaaaaacaa taaagtaaac 2341 ctgagccccc acgtcccaat ttttttaata gattatgtga tgttggaaag ctcattgatt 2401 tgtatatgtt tcagtgtgtt acctttctgg cttccagttc ccaggtgttc tttgtttgcc 2461 tttgataaaa tacaggattt aagaacagaa agtagctgca aaatgc // LOCUS HSU56079 1418 bp mRNA PRI 01-AUG-1996 DEFINITION Human Y5 receptor mRNA, complete cds. ACCESSION U56079 NID g1438903 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1200) AUTHORS Gerald,C., Walker,M.W., Criscione,L., Gustafson,E.L., Batzl-Hartmann,C., Smith,K.E., Vaysse,P., Durkin,M.M., Laz,T.M., Linemeyer,D.L., Schaffhauser,A.O., Whitebread,S., Hofbauer,K.G., Taber,R.I., Branchek,T.A. and Weinshank,R.L. TITLE A receptor subtype involved in neuropeptide-Y-induced food intake JOURNAL Nature 382 (6587), 168-171 (1996) MEDLINE 96317589 REFERENCE 2 (bases 1 to 1418) AUTHORS Gerald,C.A. TITLE Direct Submission JOURNAL Submitted (22-APR-1996) Christophe A. Gerald, Synaptic Pharmaceutical Corporation, Molecular Biology, 215 College Road, Paramus, NJ 07652, USA FEATURES Location/Qualifiers source 1..1418 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="humanY5" /sex="male" /tissue_type="hippocampus" /dev_stage="adult" 5'UTR 1..25 CDS 26..1393 /codon_start=1 /product="Y5 receptor" /db_xref="PID:g1438904" /translation="MSFYSKQDYNMDLELDEYYNKTLATENNTAATRNSDFPVWDDYK SSVDDLQYFLIGLYTFVSLLGFMGNLLILMALMKKRNQKTTVNFLIGNLAFSDILVVL FCSPFTLTSVLLDQWMFGKVMCHIMPFLQCVSVLVSTLILISIAIVRYHMIKHPISNN LTANHGYFLIATVWTLGFAICSPLPVFHSLVELQETFGSALLSSRYLCVESWPSDSYR IAFTISLLLVQYILPLVCLTVSHTSVCRSISCGLSNKENRLEENEMINLTLHPSKKSG PQVKLSGSHKWSYSFIKKHRRRYSKKTACVLPAPERPSQENHSRILPENFGSVRSQLS SSSKFIPGVPTCFEIKPEENSDVHELRVKRSVTRIKKRSRSVFYRLTILILVFAVSWM PLHLFHVVTDFNDNLISNRHFKLVYCICHLLGMMSCCLNPILYGFLNNGIKADLVSLI HCLHM" 3'UTR 1394..1418 BASE COUNT 405 a 268 c 266 g 479 t ORIGIN 1 gtaatgtttt tttggttgct gacaaatgtc tttttattcc aagcaggact ataatatgga 61 tttagagctc gacgagtatt ataacaagac acttgccaca gagaataata ctgctgccac 121 tcggaattct gatttcccag tctgggatga ctataaaagc agtgtagatg acttacagta 181 ttttctgatt gggctctata catttgtaag tcttcttggc tttatgggga atctacttat 241 tttaatggct ctcatgaaaa agcgtaatca gaagactacg gtaaacttcc tcataggcaa 301 tctggccttt tctgatatct tggttgtgct gttttgctca cctttcacac tgacgtctgt 361 cttgctggat cagtggatgt ttggcaaagt catgtgccat attatgcctt ttcttcaatg 421 tgtgtcagtt ttggtttcaa ctttaatttt aatatcaatt gccattgtca ggtatcatat 481 gataaaacat cccatatcta ataatttaac agcaaaccat ggctactttc tgatagctac 541 tgtctggaca ctaggttttg ccatctgttc tccccttcca gtgtttcaca gtcttgtgga 601 acttcaagaa acatttggtt cagcattgct gagcagcagg tatttatgtg ttgagtcatg 661 gccatctgat tcatacagaa ttgcctttac tatctcttta ttgctagttc agtatattct 721 gcccttagtt tgtcttactg taagtcatac aagtgtctgc agaagtataa gctgtggatt 781 gtccaacaaa gaaaacagac ttgaagaaaa tgagatgatc aacttaactc ttcatccatc 841 caaaaagagt gggcctcagg tgaaactctc tggcagccat aaatggagtt attcattcat 901 caaaaaacac agaagaagat atagcaagaa gacagcatgt gtgttacctg ctccagaaag 961 accttctcaa gagaaccact ccagaatact tccagaaaac tttggctctg taagaagtca 1021 gctctcttca tccagtaagt tcataccagg ggtccccact tgctttgaga taaaacctga 1081 agaaaattca gatgttcatg aattgagagt aaaacgttct gttacaagaa taaaaaagag 1141 atctcgaagt gttttctaca gactgaccat actgatatta gtatttgctg ttagttggat 1201 gccactacac cttttccatg tggtaactga ttttaatgac aatcttattt caaataggca 1261 tttcaagttg gtgtattgca tttgtcattt gttgggcatg atgtcctgtt gtcttaatcc 1321 aattctatat gggtttctta ataatgggat taaagctgat ttagtgtccc ttatacactg 1381 tcttcatatg taataattct cactgtttac caaggaaa // LOCUS HSU56102 2603 bp mRNA PRI 04-JUL-1996 DEFINITION Human adhesion molecule DNAM-1 mRNA, complete cds. ACCESSION U56102 NID g1401184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2603) AUTHORS Shibuya,A., Campbell,D., Hannum,C., Yssel,H., Franz-Bacon,K., McClanahan,T., Kitamura,T., Nicholl,J., Sutherland,G.R., Lanier,L.L. and Phillips,J.H. TITLE DNAM-1, a novel adhesion molecule involved in the cytolytic function of T lymphocytes JOURNAL Immunity 4 (6), 573-581 (1996) MEDLINE 96256836 REFERENCE 2 (bases 1 to 2603) AUTHORS Shibuya,A., Phillips,J.H. and Lanier,L.L. TITLE Direct Submission JOURNAL Submitted (22-APR-1996) Lewis L. Lanier, Human Immunology, DNAX Research Institute, 901 California Avenue, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..2603 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="18" /map="18q22.3" /cell_type="T cell" CDS 209..1219 /function="adhesion molecule" /note="stary" /codon_start=1 /product="DNAM-1" /db_xref="PID:g1401185" /translation="MDYPTLLLALLHVYRALCEEVLWHTSVPFAENMSLECVYPSMGI LTQVEWFKIGTQQDSIAIFSPTHGMVIRKPYAERVYFLNSTMASNNMTLFFRNASEDD VGYYSCSLYTYPQGTWQKVIQVVQSDSFEAAVPSNSHIVSEPGKNVTLTCQPQMTWPV QAVRWEKIQPRQIDLLTYCNLVHGRNFTSKFPRQIVSNCSHGRWSVIVIPDVTVSDSG LYRCYLQASAGENETFVMRLTVAEGKTDNQYTLFVAGGTVLLLLFVISITTIIVIFLN RRRRRERRDLFTESWDTQKAPNNYRSPISTGQPTNQSMDDTREDIYVNYPTFSRRPKT RV" BASE COUNT 813 a 512 c 562 g 716 t ORIGIN 1 ccacgcgtcc gcgctcccct cagctcctgc agtgctaatt aagggaggga gcagcgggga 61 gcttgcagtg accaagaggg tgttgaggct aggaggccac gataaacagg atacgataaa 121 agtccttaac caagacgcag atgggaagaa gcgttagagc gagcagcact cacatctcaa 181 gaaccagcct ttcaaacagt ttccagagat ggattatcct actttacttt tggctcttct 241 tcatgtatac agagctctat gtgaagaggt gctttggcat acatcagttc cctttgccga 301 gaacatgtct ctagaatgtg tgtatccatc aatgggcatc ttaacacagg tggagtggtt 361 caagatcggg acccagcagg attccatagc cattttcagc cctactcatg gcatggtcat 421 aaggaagccc tatgctgaga gggtttactt tttgaattca acgatggctt ccaataacat 481 gactcttttc tttcggaatg cctctgaaga tgatgttggc tactattcct gctctcttta 541 cacttaccca cagggaactt ggcagaaggt gatacaggtg gttcagtcag atagttttga 601 ggcagctgtg ccatcaaata gccacattgt ttcggaacct ggaaagaatg tcacactcac 661 ttgtcagcct cagatgacgt ggcctgtgca ggcagtgagg tgggaaaaga tccagccccg 721 tcagatcgac ctcttaactt actgcaactt ggtccatggc agaaatttca cctccaagtt 781 cccaagacaa atagtgagca actgcagcca cggaaggtgg agcgtcatcg tcatccccga 841 tgtcacagtc tcagactcgg ggctttaccg ctgctacttg caggccagcg caggagaaaa 901 cgaaaccttc gtgatgagat tgactgtagc cgagggtaaa accgataacc aatataccct 961 ctttgtggct ggagggacag ttttattgtt gttgtttgtt atctcaatta ccaccatcat 1021 tgtcattttc cttaacagaa ggagaaggag agagagaaga gatctattta cagagtcctg 1081 ggatacacag aaggcaccca ataactatag aagtcccatc tctaccggtc aacctaccaa 1141 tcaatccatg gatgatacaa gagaggatat ttatgtcaac tatccaacct tctctcgcag 1201 accaaagact agagtttaag cttattcttg acatgagtgc attagtaatg actcttatgt 1261 actcatgcat ggatctttat gcaatttttt tccactaccc aaggtctacc ttagatacta 1321 gttgtctgaa ttgagttact ttgataggaa aaatacttca ttacctaaaa tcatttttca 1381 tagaactgtt tcagaaaacc tgactctaac tggtttatat acaaaagaaa acttactgta 1441 tcatataaca gaatgatcca ggggagatta agctttgggc aagggctatt taccagggct 1501 taaatgttgt gtctagaatt aagtatgggc ataaactggc ttctgaatcc ctttccagag 1561 tgttggatcc atttccctgg tcttggcctc actctcatgc aggctttcct cttgtgttgg 1621 caagatggct gccaactctt ggcaattcat acatccttgt ttctgtctgg tagagagttt 1681 gcttctcaaa tggagcaaac aaatttgatt attttttcat tgttaaatag gcaacatgac 1741 cagaaaggat ggaatggctt aagtaaacta agggttcact tctagagctg agaagcaggg 1801 tcaaagcaca atactgggca attcagagca tggttagaag aggaaagggg agtctcaaag 1861 ctggagagtt taccaacaaa tattgactgc agtgattaac caagacattt ttgttaacta 1921 aaaagtgaaa tatgggatgg attctagaaa tggggtatct ctgtccatac ttctagaatc 1981 cactctatca gcatagtcca gaagaatacc tggcagtaga agaaatgaat attcaagagg 2041 aagataaatg cgagagggca atcctttact attctcatat ttatttatct ctcattctgt 2101 atagaattct tgccgccatc ccaggtctag ccttaggagc aaatgtagta gatagtcgaa 2161 taataaataa cttaatgttt tggacatatt ttgtctactt ttgagaatta tttttaatat 2221 gtaaattctc tcaaaagggt caggcaccta gttattattt tttaatgatt atgtgaaagt 2281 tgaatataat ataccactaa aagtgacagt tgaaagtggt ggcataggat ggtagggtag 2341 aaatttggga gggaaaaaag aaattgggag ggtacaggca acaggagaaa ggaatcaaac 2401 cacagaaaaa tacaaaggga aacttctgct tcactattca gacaaagaca gccctaatga 2461 catcaccaac agtcaaagca attagagacc atacctaata ttgtttaaat tctagatgta 2521 ggctaacaat gaaaagtatt tgccaaactg aataaaactg tcatggttac cttgaaaaaa 2581 aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSU56244 2414 bp mRNA PRI 09-MAY-1996 DEFINITION Human HIG-1 mRNA, complete cds. ACCESSION U56244 NID g1305696 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2414) AUTHORS Rupec,R.A. TITLE Hypoxia-inducible gene JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2414) AUTHORS Rupec,R.A. TITLE Direct Submission JOURNAL Submitted (23-APR-1996) Rudolf A. Rupec, Biochemistry, University of Freiburg, Hermann-Herder-Str. 7, Freiburg i. Br. D-79104, Germany FEATURES Location/Qualifiers source 1..2414 /organism="Homo sapiens" /note="isolated by subtractive hybridization from hypoxia-induced HeLa cells" /db_xref="taxon:9606" /cell_line="HeLa" 5'UTR 1..651 gene 652..1632 /gene="HIG-1" CDS 652..1632 /gene="HIG-1" /note="hypoxia-inducible gene" /codon_start=1 /db_xref="PID:g1305697" /translation="MFDDLDMKESGNKAWSGAQFVLERTSVLVFLPGLGEINYMHELL TSLVHKRLQVYPLHSSVALEEQNNVFLSPVPGYRKIILSTNIAESSVTVPDVKYVIDF CLTRTLVCDEDTNYQSLRLSWASKTSCNQRKGRAGRVSRGYCYRLVHKDFWDNSIPDH VVPEMLRCPLGSTILKVKLLDMGEPRALLATALSPPGLSDIERTILLLKEVGALAVSG QREDENPHDGELTFLGRVLAQLPVNQQLGKLIVLGHVFGCLDECLIIAAALSLKNFFA MPFRQHLDGYRNKVNFSGSSKSDCIALVEAFKTWKACIQTGELGYPKDVT" 3'UTR 1633..2414 BASE COUNT 685 a 441 c 580 g 708 t ORIGIN 1 ggcacgaggt aattccgtgg tgattatcca tggggccacg ggaagcggta aaagcactca 61 gctcccgcag tatatcttgg accactacgt tcagcgctcc gcctactgca gcattgtggt 121 cacccagccc cggaagatag gggcaagcag catcgccagg tggatcagta aagagcgtgc 181 ctggaccctg ggaggtgtgg tgggctacca ggtagggcta gagaaaatag caacagagga 241 caccaggcta atttatatga caactggagt cctgcttcag aaaatagtta gtgccaagag 301 tttgatggaa ttcacacata tcatcattga tgaagtacac gaacgaacag aagaaatgga 361 tttcctgcta ttggtagtcc gcaaactctt aagaacaaat tcacgttttg tgaaggtggt 421 cctgatgtcg gctaccatca gctgtaaaga gtttgcagac tactttgctg ttcctgttca 481 aaacaagatg aatcctgcat atattgttga agtggaagca agccccattc agttgaagag 541 tattatctta atgatttgga gcacattcat catagcaagc tctctcctca tctcctggag 601 gaaccggtga taactaagga tatatatgaa gttgctgtct ctctcattca gatgtttgat 661 gacttggata tgaaggagag tgggaacaag gcttggtcgg gggcccagtt tgtgttggag 721 cgaaccagtg tgttggtgtt tttgccaggt ctgggtgaaa taaattatat gcatgaactt 781 ctcacaagcc tggttcataa aaggttgcag gtctatccac tccattcaag tgtggcttta 841 gaagaacaga ataatgtctt tttaagtcca gtccctgggt acagaaagat tattctgtcc 901 accaatattg cagagagttc tgtcacagtt ccagatgtca aatatgttat agatttttgt 961 ttgactagaa ctttggtctg tgatgaagat acaaattatc agagtctgcg attgagttgg 1021 gcctctaaaa ccagctgtaa tcagagaaaa ggccgtgctg gacgagtgtc tagagggtac 1081 tgttaccggc tggtacacaa ggatttctgg gacaactcca tccctgatca tgttgttcct 1141 gagatgttgc gttgtccatt aggaagcacg atcttgaaag tgaaattact tgacatgggt 1201 gagccgagag ctctgctggc cactgccctt tccccgcctg ggctgagtga cattgagcgc 1261 accatccttc tactaaagga ggttggagca cttgcagtga gtgggcagag agaagatgaa 1321 aacccccatg atggtgaatt gaccttctta ggaagagttt tagcccaact tcctgtaaat 1381 cagcaacttg gtaaactcat agtccttgga catgtatttg gatgtctaga tgaatgtctt 1441 attatagcgg cagctctttc tttgaagaat ttttttgcaa tgcctttccg gcagcatctc 1501 gatggatata ggaacaaagt gaatttctct ggcagtagca agagtgactg tattgcactt 1561 gttgaggcat ttaaaacatg gaaggcttgc atacagacag gggagctggg gtacccgaag 1621 gatgtaactt aattggggta cggttaaatt acattcaaat caagagaatt agagaggtgg 1681 ctgaattata tgaagaattg aagactagaa tctcacagtt caacatgcat gttgattctc 1741 ggcgacctgt catggaccaa gagtatatat ataagcagcg attcatccta caggttgtat 1801 tggcaggtgc tttctatcca aattacttta cttttggaca gccggatgag gagatggcgg 1861 tgagggagct ggctggcaag gaccccaaga caactgtcgt gttgaaacac attcctccct 1921 atggatttct ttactataaa caactacagt ctgtgcccac tgcatcctaa aggccttttc 1981 tttcttcttt tctctttggg tgatagtcag agagtggtgt ttttgttcag gtgggaagga 2041 ttggaaactc tagtcttttc tagaaacaga aaatcactgt attaaatatt ttggaaagat 2101 tgttctgaaa gaagtctgtt tggataaaga gctgtatttt gctttaaatt tattaaggta 2161 aatataagta gttaatctta gatgtaaggt tccagaatgt gcttacatat tctgttctgt 2221 tacagtgatt taaaccagta gtataggaaa aaacttaaaa aacaaaaaaa ccatgtagta 2281 ttttctgatt tttttttcca tgagggaaaa tatctaattt ttataagact aagttgagta 2341 tacttcttgg gttcacattt tggaaatcag agattacaga ttacatggcc atagcttatc 2401 tgtgttaaaa caat // LOCUS HSU56387 2766 bp mRNA PRI 20-AUG-1996 DEFINITION Human PC6A protease (hPC6) mRNA, complete cds. ACCESSION U56387 NID g1498312 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2766) AUTHORS Miranda,L., Wolf,J., Pichuantes,S., Duke,R. and Franzusoff,A. TITLE Isolation of the human PC6 gene encoding the putative host protease for HIV-1 gp160 processing in CD4+ T lymphocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (15), 7695-7700 (1996) MEDLINE 96353880 REFERENCE 2 (bases 1 to 2766) AUTHORS Franzusoff,A., Miranda,L., Wolf,J., Pichuantes,S. and Duke,R. TITLE Direct Submission JOURNAL Submitted (23-APR-1996) Alex Franzusoff, Cellular and Structural Biology, University of Colorado Health Sciences Center, Box B-111, 4200 East Ninth Avenue, Denver, CO 80262, USA FEATURES Location/Qualifiers source 1..2766 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CEM human T cell" gene 13..2760 /gene="hPC6" CDS 13..2760 /gene="hPC6" /note="KEX2-like protease" /codon_start=1 /product="PC6A protease" /db_xref="PID:g1498313" /translation="MDWDWGNRCSRPGRRDLLCVLALLAGCLLPVCRTRVYTNHWAVK IAGGFAEADRIASKYGFINVGQIGALKDYYHFYHSRTIKRSVLSSRGTHSFISMEPKV EWIQQQVVKKRTKRDYDLSHAQSTYFNDPKWPSMWYMHCSDNTHPCQSDMNIEGAWKR GYTGKNIVVTILDDGIERTHPDLMQNYDALASCDVNGNDLDPMPRYDASNENKHGTRC AGEVAAAANNSHCTVGIAFNAKIGGVRMLDGDVTDMVEAKSVSFNPQHVHIYSASWGP DDDGKTVDGPAPLTRQAFENGVRMGRRGLGSVFVWASGNGGRSKDHCSCDGYTNSIYT ISISSTAESGKKPWYLEECSSTLATTYSSGESYDKKIITTDLRQRCTDNHTGTSASAP MAAGIIALALEANPFLTWRDVQHVIVRTSRAGHLNANDWKTNAAGFKVSHLYGFGLMD AEAMVMEAEKWTTVPRQHVCVESTDRQIKTIRPNSAVRSIYKASGCSDNPNRHVNYLE HVVVRITITHPRRGDLAIYLTSPSGTRSQLLANRLFDHSMEGFKNWEFMTIHCWGERA AGDWVLEVYDTPSQLRNFKTPGKLKEWSLVLYGTSVRPYSPTNEFPKVERFRYSRVED PTDDYGTEDYAGPCDPECSEVGCDGPGPDHCNDCLHYYYKLKNNTRICVSSCPPGHYH ADKKRCRKCAPNCESCFGSHGDQCMSCKYGYFLNEETNSCVTHCPDGSYQDTKKNLCR KCSENCKTCTEFHNCTECRDGLSLQGSRCSVSCEDGRYFNGQDCQPCHRFCATCAGAG ADGCINCTEGYFMEDGRCVQSCSISYYFDHSSENGYKSCKKCDISCLTCNGPGFKNCT SCPSGYLLDLGMCQMGAICKDATEESWAEGGFCMLVKKNNLCQRKVLQQLCCKTCTFQ G" BASE COUNT 736 a 702 c 747 g 581 t ORIGIN 1 agcgtcggga ccatggattg ggattggggg aaccgctgca gccgcccggg acggcgggac 61 ctgctgtgcg tgctggcact gctcgccggc tgtctgctcc cggtatgccg gacgcgcgtc 121 tacaccaacc actgggcagt gaagatcgcc ggcggcttcg cggaggcaga tcgcatagcc 181 agcaagtacg gattcatcaa cgtaggacag atcggtgcac tgaaggacta ctatcacttc 241 taccatagta ggaccattaa aaggtctgtt ctctcgagca gaggaaccca cagtttcatt 301 tcaatggaac caaaggtgga gtggatccaa cagcaagtgg tgaaaaaaag aaccaagagg 361 gattatgacc tcagccatgc ccagtcaacc tacttcaatg atcccaagtg gccaagtatg 421 tggtacatgc actgtagcga caatacacat ccctgccagt ctgacatgaa tatcgaagga 481 gcctggaaga gaggctacac gggaaagaac attgtggtca ctatcctgga tgacggaatt 541 gagagaaccc atccagatct gatgcaaaac tacgatgctc tggcaagttg cgacgtgaat 601 gggaatgact tggacccaat gcctcgttat gatgcaagca acgagaacaa gcatgggact 661 cgctgtgctg gagaagtggc agccgctgca aacaattcgc actgcacagt cggaattgct 721 ttcaacgcca agatcggagg agtgcgaatg ctggacggag atgtcacgga catggttgaa 781 gcaaaatcag ttagcttcaa cccccagcac gtgcacattt acagcgccag ctggggcccg 841 gatgatgatg gcaagactgt ggacggacca gcccccctca cccggcaagc ctttgaaaac 901 ggcgttagaa tggggcggag aggcctcggc tctgtgtttg tttgggcatc tggaaatggt 961 ggaaggagca aagaccactg ctcctgtgat ggctacacca acagcatcta caccatctcc 1021 atcagcagca ctgcagaaag cggaaagaaa ccttggtacc tggaagagtg ttcatccacg 1081 ctggccacaa cctacagcag cggggagtcc tacgataaga aaatcatcac tacagatctg 1141 aggcagcgtt gcacggacaa ccacactggg acgtcagcct cagcccccat ggctgcaggc 1201 atcattgcgc tggccctgga agccaatccg tttctgacct ggagagacgt acagcatgtt 1261 attgtcagga cttcccgtgc gggacatttg aacgctaatg actggaaaac caatgctgct 1321 ggttttaagg tgagccatct ttatggattt ggactgatgg acgcagaagc catggtgatg 1381 gaggcagaga agtggaccac cgttccccgg cagcacgtgt gtgtggagag cacagaccga 1441 caaatcaaga caatccgccc taacagtgca gtgcgctcca tctacaaagc ttcaggctgc 1501 tcggataacc ccaaccgcca tgtcaactac ctggagcacg tcgttgtgcg catcaccatc 1561 acccacccca ggagaggaga cctggccatc tacctgacct cgccctctgg aactaggtct 1621 cagcttttgg ccaacaggct atttgatcac tccatggaag gattcaaaaa ctgggagttc 1681 atgaccattc attgctgggg agaaagagct gctggtgact gggtccttga agtttatgat 1741 actccctctc agctaaggaa ctttaagact ccaggtaaat tgaaagaatg gtctttggtc 1801 ctctacggca cctccgtgcg gccatattca ccaaccaatg aatttccgaa agtggaacgg 1861 ttccgctata gccgagttga agaccccaca gacgactatg gcacagagga ttatgcaggt 1921 ccctgcgacc ctgagtgcag tgaggttggc tgtgacgggc caggaccaga ccactgcaat 1981 gactgtttgc actactacta caagctgaaa aacaatacca ggatctgtgt ctccagctgc 2041 ccccctggcc actaccacgc cgacaagaag cgctgcagga agtgtgcccc caactgtgag 2101 tcctgctttg ggagccatgg tgaccaatgc atgtcctgca aatatggata ctttctgaat 2161 gaagaaacca acagctgtgt tactcactgc cctgatgggt catatcagga taccaagaaa 2221 aatctttgcc ggaaatgcag tgaaaactgc aagacatgta ctgaattcca taactgtaca 2281 gaatgtaggg atgggttaag cctgcaggga tcccggtgct ctgtctcctg tgaagatgga 2341 cggtatttca acggccagga ctgccagccc tgccaccgct tctgcgccac ttgtgctggg 2401 gcaggagctg atgggtgcat taactgcaca gagggctact tcatggagga tgggagatgc 2461 gtgcagagct gtagtatcag ctattacttt gaccactctt cagagaatgg atacaaatcc 2521 tgcaaaaaat gtgatatcag ttgtttgacg tgcaatggcc caggattcaa gaactgtaca 2581 agctgcccta gtgggtatct cttagactta ggaatgtgtc aaatgggagc catttgcaag 2641 gatgcaacgg aagagtcctg ggcggaagga ggcttctgta tgcttgtgaa aaagaacaat 2701 ctgtgccaac ggaaggttct tcaacaactt tgctgcaaaa catgtacatt ccaaggctga 2761 gcagcc // LOCUS HSU56390 1481 bp mRNA PRI 16-AUG-1996 DEFINITION Human cysteine protease ICE-LAP6 mRNA, complete cds. ACCESSION U56390 NID g1336026 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1481) AUTHORS Duan,H., Orth,K., Chinnaiyan,A.M., Poirier,G.G., Froelich,C.J., He,W.W. and Dixit,V.M. TITLE ICE-LAP6, a novel member of the ICE/Ced-3 gene family, is activated by the cytotoxic T cell protease granzyme B JOURNAL J. Biol. Chem. 271 (28), 16720-16724 (1996) MEDLINE 96279246 REFERENCE 2 (bases 1 to 1481) AUTHORS Duan,H., Orth,K., Chinnaiyan,A.M., Poirier,G.G., Froelich,C.J., He,W.W. and Dixit,V.M. TITLE Direct Submission JOURNAL Submitted (23-APR-1996) H. Duan, Pathology, University of Michigan, 1301 Catherine St., Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..1481 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 4..1254 /note="cysteine protease" /codon_start=1 /product="ICE-LAP6" /db_xref="PID:g1336027" /translation="MDEADRRLLRRCRLRLVEELQVDQLWDVLLSRELFRPHMIEDIQ RAGSGSRRDQARQLIIDLETRGSQALPLFISCLEDTGQDMLASFLRTNRQAGKLSKPT LENLTPVVLRPEIRKPEVLRPETPRPVDIGSGGFGDVGALESLRGNADLAYILSMEPC GHCLIINNVNFCRESGLRTRTGSNIDCEKLRRRFSSLHFMVEVKGDLTAKKMVLALLE LARQDHGALDCCVVVILSHGCQASHLQFPGAVYGTDGCPVSVEKIVNIFNGTSCPSLG GKPKLFFIQACGGEQKDHGFEVASTSPEDESPGSNPEPDATPFQEGLRTFDQLDAISS LPTPSDIFVSYSTFPGFVSWRDPKSGSWYVETLDDIFEQWAHSEDLQSLLLRVANAVS VKGIYKQMPGCFNFLRKKLFFKTS" BASE COUNT 308 a 407 c 431 g 335 t ORIGIN 1 gccatggacg aagcggatcg gcggctcctg cggcggtgcc ggctgcggct ggtggaagag 61 ctgcaggtgg accagctctg ggacgtcctg ctgagccgcg agctgttcag gccccatatg 121 atcgaggaca tccagcgggc aggctctgga tctcggcggg atcaggccag gcagctgatc 181 atagatctgg agactcgagg gagtcaggct cttcctttgt tcatctcctg cttagaggac 241 acaggccagg acatgctggc ttcgtttctg cgaactaaca ggcaagcagg aaagttgtcg 301 aagccaaccc tagaaaacct taccccagtg gtgctcagac cagagattcg caaaccagag 361 gttctcagac cggaaacacc cagaccagtg gacattggtt ctggaggatt cggtgatgtc 421 ggtgctcttg agagtttgag gggaaatgca gatttggctt acatcctgag catggagccc 481 tgtggccact gcctcattat caacaatgtg aacttctgcc gtgagtccgg gctccgcacc 541 cgcactggct ccaacatcga ctgtgagaag ttgcggcgtc gcttctcctc gctgcatttc 601 atggtggagg tgaagggcga cctgactgcc aagaaaatgg tgctggcttt gctggagctg 661 gcgcggcagg accacggtgc tctggactgc tgcgtggtgg tcattctctc tcacggctgt 721 caggccagcc acctgcagtt cccaggggct gtctacggca cagatggatg ccctgtgtcg 781 gtcgagaaga ttgtgaacat cttcaatggg accagctgcc ccagcctggg agggaagccc 841 aagctctttt tcatccaggc ctgtggtggg gagcagaaag accatgggtt tgaggtggcc 901 tccacttccc ctgaagacga gtcccctggc agtaaccccg agccagatgc caccccgttc 961 caggaaggtt tgaggacctt cgaccagctg gacgccatat ctagtttgcc cacacccagt 1021 gacatctttg tgtcctactc tactttccca ggttttgttt cctggaggga ccccaagagt 1081 ggctcctggt acgttgagac cctggacgac atctttgagc agtgggctca ctctgaagac 1141 ctgcagtccc tcctgcttag ggtcgctaat gctgtttcgg tgaaagggat ttataaacag 1201 atgcctggtt gctttaattt cctccggaaa aaacttttct ttaaaacatc ataaggccag 1261 ggcccctcac cctgccttat cttgcacccc caaagctttc ctgccccagg cctgaaagag 1321 gctgaggcct ggactttcct gcaactcaag gactttgcag ccggcacagg gtctgctctt 1381 tctctgccag tgacagacag gctcttagca gcttccagat tgacgacaag tgctgaacag 1441 tggaggaaga gggacagatg aatgccgtgg attgcacgtg g // LOCUS HSU56420 945 bp DNA PRI 30-MAY-1996 DEFINITION Human olfactory receptor (OLF1) gene, complete cds. ACCESSION U56420 NID g1336040 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 945) AUTHORS Issel-Tarver,L. and Rine,J. TITLE Evolution of Mammalian Olfactory Receptor Genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 945) AUTHORS Issel-Tarver,L. and Rine,J. TITLE Direct Submission JOURNAL Submitted (24-APR-1996) L. Issel-Tarver, MCB, UC Berkeley, 401 Barker Hall, Berkeley, CA 94720, USA FEATURES Location/Qualifiers source 1..945 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q11" gene 1..945 /gene="OLF1" CDS 1..945 /gene="OLF1" /note="olfactory receptor" /codon_start=1 /product="HsOLF1" /db_xref="PID:g1336041" /translation="MEFTDRNYTLVTEFILLGFPTRPELQIVLFLMFLTLYAIILIGN IGLMLLIRIDPHLQTPMYFFLSNLSFVDLCYFSDIVPKMLVNFLSENKSISYYGCALQ FYFFCTFADTESFILAAMAYDRYVAICNPLLYTVVMSRGICMRLIVLSYLGGNMSSLV HTSFAFILKYCDKNVINHFFCDLPPLLKLSCTDTTINEWLLSTYGSSVEIICFIIIII SYFFILLSVLKIRSFSGRKKTFSTCASHLTSVTIYQGTLLFIYSRPSYLYSPNTDKII SVFYTIFIPVLNPLIYSLRNKDVKDAAEKVLRSKVDSS" BASE COUNT 232 a 221 c 162 g 330 t ORIGIN 1 atggaattta cagatagaaa ctacacgttg gtcactgagt ttattctatt aggttttcca 61 actcgccctg aactgcagat tgtcctgttc ctcatgtttc tgacattgta tgctataatt 121 ctgataggga acattggatt gatgctgttg atcaggattg atcctcacct tcaaaccccc 181 atgtattttt tccttagcaa cctatcattt gtagaccttt gctatttctc agacattgtt 241 cccaaaatgc tggtcaattt cctctcggag aacaaatcta tttcctatta tgggtgtgcc 301 ctgcagtttt attttttctg tacttttgca gatacagaat ccttcatcct ggccgccatg 361 gcctatgatc gctatgtcgc catctgtaac cctttattgt acacagttgt gatgtctagg 421 ggcatctgta tgcggttgat tgtcttgtca taccttggag gcaacatgag ttccctggtt 481 cacacatcct ttgcctttat tctgaaatat tgtgacaaaa atgttattaa tcattttttc 541 tgtgacctcc ctcccctgct taaactatcc tgcactgaca caacaattaa tgagtggctc 601 ctctccacat acggcagctc agtggaaatc atttgtttta tcatcatcat catctcctac 661 tttttcattc ttctctcagt cttaaagatc cgctctttca gtgggaggaa gaagaccttt 721 tctacatgcg cctctcacct gacttcagtg acgatctacc aagggactct cctctttatt 781 tactcacggc ccagctacct gtattctcca aacactgata aaattatctc agtgttctac 841 accattttca ttccagtgct gaatccgttg atttatagtt tgagaaataa agatgtaaag 901 gatgcagctg agaaagttct aagatcaaag gtagattctt catga // LOCUS HSU56637 2385 bp mRNA PRI 04-FEB-1998 DEFINITION Human capping protein alpha subunit isoform 1 mRNA, complete cds. ACCESSION U56637 NID g1336098 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2385) AUTHORS Hart,M.C., Korshunova,Y.O. and Cooper,J.A. TITLE Vertebrates have conserved capping protein alpha isoforms with specific expression patterns JOURNAL Cell Motil. Cytoskeleton 38 (2), 120-132 (1997) MEDLINE 97470757 REFERENCE 2 (bases 1 to 2385) AUTHORS Hart,M.C., Korshunova,Y.O. and Cooper,J.A. TITLE Direct Submission JOURNAL Submitted (25-APR-1996) Marilyn C. Hart, Cell Biology & Physiology, Washington University, Box 8228, 660 S. Euclid Ave., St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2385 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..861 /function="binds barbed ends of actin filaments" /codon_start=1 /product="capping protein alpha subunit isoform 1" /db_xref="PID:g1336099" /translation="MADFDDRVSDEEKVRIAAKFITHAPPGEFNEVFNDVRLLLNNDN LLREGAAHAFAQYNMDQFTPVKIEGYEDQVLITEHGDLGNSRFLDPRNKISFKFDHLR KEASDPQPEEADGGLKSWRESCDSALRAYVKDHYSNGFCTVYAKTIDGQQTIIACIES HQFQPKNFWNGRWRSEWKFTITPPTAQVVGVLKIQVHYYEDGNVQLVSHKDVQDSLTV SNEAQTAKEFIKIIENAENEYQTAISENYQTMSDTTFKALRRQLPVTRTKIDWNKILS YKIGKEMQNA" source 1..33 /organism="Homo sapiens" /note="derived by RACE from RNA" /tissue_type="testis" source 34..2385 /organism="Homo sapiens" /note="see also EST sequence, GenBank Accession Number R58525" /tissue_type="heart" /dev_stage="fetal" /clone="G4021" source 812..2385 /organism="Homo sapiens" /tissue_type="placenta" /clone="146582" polyA_signal 2268..2273 BASE COUNT 724 a 448 c 447 g 766 t ORIGIN 1 atggccgact tcgatgatcg tgtgtcggat gaggagaagg tacgcatagc tgctaaattc 61 atcactcatg cacccccagg ggaatttaat gaagtattca atgacgttcg gctactactt 121 aataatgaca atctcctcag ggaaggggca gcacatgcat ttgcccagta taacatggat 181 cagttcacgc ctgtgaagat agaaggatat gaagatcagg tcttaattac agagcacggt 241 gacctgggta atagcagatt tttagatcca agaaacaaaa tttcctttaa atttgaccac 301 ttacggaaag aagcaagtga cccccagcca gaagaagcag atggaggtct gaagtcttgg 361 agagaatcct gtgacagtgc tttaagagcc tatgtgaaag accattattc caacggcttc 421 tgtactgttt atgctaaaac tatcgatggg caacagacta ttattgcatg tattgaaagc 481 caccagtttc agcctaaaaa cttctggaat ggtcgttgga gatcagagtg gaagttcacc 541 atcacaccac ctacagccca ggtggttggc gtgcttaaga ttcaggttca ctattatgaa 601 gatggcaatg ttcagttggt tagtcataaa gatgtacagg attcactaac tgtttcgaat 661 gaagcccaaa ctgccaagga gtttattaaa atcatagaga atgcagaaaa tgagtatcag 721 acagcaatta gtgaaaacta tcaaacaatg tcagatacca cattcaaggc cttgcgccgc 781 cagcttccag ttacccgcac caaaatcgac tggaacaaga tactcagcta caagattggc 841 aaagaaatgc agaatgctta aaggctgaat gtaggattct tcagtatgtg gaaagacaag 901 gattcaacgt gtggtcatat gataaataag tgatttataa acaagagtga tattttgcta 961 gggctttcaa agttaaccgg ttttctagcc tcatggaata ctgttgaacc tatagcgttg 1021 tcttgattct tttgtgttct ctgccttgta attttctgtt actgctatat ctacgtgtaa 1081 atcttttttt cttttttttt tttttttttt ttcttttttg gttaattctg ccacatttaa 1141 tgttggtgag agagtgatct atcctaatga catttactgt ttaaaaaagt ttcctagcca 1201 tgaagccctg ctactgattt agacaaggta ttatggtcat tactttgtac ccctatcctt 1261 ccaagcactt ctggtacttc agtcgttttt actgatccac caacacctaa agaggctatg 1321 ctacagtctc tagctaaatg gaagacacat tcatccttct ccctctgact gctttgatca 1381 tcatttattg catcgtcata tcatatttat cgcatctcat aactaacttt ctaaagtttg 1441 gattgggact tttcaggtcc tttttggagg gcaaaggaag ttccagcttc tctggggaac 1501 ttgtttttaa atccaaagac ttgaaccaca ttccctgcac atgaacatgt ttgcttttat 1561 cccttctctc attggctcct tcccatctta gtaccattgt agttatacat ctgcattttt 1621 tagaagcatt ttacccattt atttttttaa acattcaaga actgctgacg tactgtggat 1681 gtagagtata aaacttgaaa aatgcagatg ttgaaggaat aataggtatc ttgtgcttta 1741 atactttatg gcaggattgt actataagca aatgaattaa acagctatgt aaatcataaa 1801 gaaaaactaa aaatgaacca aagtgaaagg ataacttcca ggcagtatct ttctattgta 1861 acctgttatt taaggaaata ctagtgattt cttctaaata ggatgtaaac ttctttcaaa 1921 ttactcttcc tcagtctgcc tgccaagaac tcaagtgtaa ctgtgataaa ataacctttc 1981 ccaggtatat tcggcaggta tgtgtgtaat ctcagaatac acaggtgaca tagatatgat 2041 atgacaactg gtaatggtgg attcatttac attgtttaca cttctatgac caggccttaa 2101 gggaaggtca gttttttaaa aaaccaagta gtgtcttcct acctatctcc agatacatgt 2161 caaaaagaaa aggtgtttgt gctccgtttt gtttctgctc agtaatatag tcaagcaagt 2221 ttgttccagg tgacccattg agctgtgtat gcatttttgt ttatttcaat aaaatatatt 2281 tgtattattt gtccttcata ctatccatcc ataccacact atcttctgta tcaggtagtc 2341 taatagaaat atacctgttt tgttctaaaa aaaaaaaaaa aaaaa // LOCUS HSU56814 1023 bp mRNA PRI 24-JUL-1997 DEFINITION Human DNase1-Like III protein (DNAS1L3) mRNA, complete cds. ACCESSION U56814 NID g1399718 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1023) AUTHORS Rodriguez,A.M., Rodin,D., Nomura,H., Morton,C.C., Weremowicz,S. and Schneider,M.C. TITLE Identification, localization, and expression of two novel human genes similar to deoxyribonuclease I JOURNAL Genomics 42 (3), 507-513 (1997) MEDLINE 97349121 REFERENCE 2 (bases 1 to 1023) AUTHORS Schneider,M.C. and Rodriguez,A. TITLE Direct Submission JOURNAL Submitted (25-APR-1996) M.C. Schneider, Renal Division, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1023 /organism="Homo sapiens" /note="corresponds to EST clones 82269 (GenBank Accession Number T68985, T69063), 82738 (GenBank Accession Number T73558, T73653), 78422 (GenBank Accession Number T61400, T61368)" /db_xref="taxon:9606" /clone_lib="Stratagene liver (#937224)" gene 1..1023 /gene="DNAS1L3" CDS 25..942 /gene="DNAS1L3" /codon_start=1 /product="DNase1-Like III protein" /db_xref="PID:g1399719" /translation="MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDV IVKVIKRCDIILVMEIKDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQY AFLYKEKLVSVKRSYHYHDYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPET SVKEIDELVEVYTDVKHRWKAENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIG DQEDTTVKKSTNCAYDRIVLRGQEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFP VEFKLQSSRAFTNSKKSVTLRKKTKSKRS" BASE COUNT 312 a 244 c 240 g 227 t ORIGIN 1 tcttgaagcc agagcagcgc caggatgtca cgggagctgg ccccactgct gcttctcctc 61 ctctccatcc acagcgccct ggccatgagg atctgctcct tcaacgtcag gtcctttggg 121 gaaagcaagc aggaagacaa gaatgccatg gatgtcattg tgaaggtcat caaacgctgt 181 gacatcatac tcgtgatgga aatcaaggac agcaacaaca ggatctgccc catactgatg 241 gagaagctga acagaaattc aaggagaggc ataacgtaca actatgtgat tagctctcgg 301 cttggaagaa acacatataa agaacaatat gcctttctct acaaggaaaa gctggtgtct 361 gtgaagagga gttatcacta ccatgactat caggatggag acgcagatgt gttttccagg 421 gagccctttg tggtctggtt ccaatctccc cacactgctg tcaaagactt cgtgattatc 481 cccctgcaca ccaccccaga gacatccgtt aaggagatcg atgagttggt tgaggtctac 541 acggacgtga aacaccgctg gaaggcggag aatttcattt tcatgggtga cttcaatgcc 601 ggctgcagct acgtccccaa gaaggcctgg aagaacatcc gcttgaggac tgaccccagg 661 tttgtttggc tgatcgggga ccaagaggac accacggtga agaagagcac caactgtgca 721 tatgacagga ttgtgcttag aggacaagaa atcgtcagtt ctgttgttcc caagtcaaac 781 agtgtttttg acttccagaa agcttacaag ctgactgaag aggaggccct ggatgtcagc 841 gaccactttc cagttgaatt taaactacag tcttcaaggg ccttcaccaa cagcaaaaaa 901 tctgtcactc taaggaagaa aacaaagagc aaacgctcct agacccaagg gtctcatctt 961 attaaccatt tcttgcctct aaataaaatg tctctaacag aaaaaaaaaa aaaaaaaaaa 1021 aaa // LOCUS HSU56976 2265 bp mRNA PRI 22-OCT-1996 DEFINITION Human calmodulin dependent phosphodiesterase PDE1B1 mRNA, complete cds. ACCESSION U56976 NID g1621591 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2265) AUTHORS Jiang,X., Li,J., Paskind,M. and Epstein,P.M. TITLE Inhibition of calmodulin-dependent phosphodiesterase induces apoptosis in human leukemic cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (20), 11236-11241 (1996) MEDLINE 97008163 REFERENCE 2 (bases 1 to 2265) AUTHORS Epstein,P.M. TITLE Direct Submission JOURNAL Submitted (29-APR-1996) Paul M. Epstein, Pharmacology, University of Connecticut Health Center, 263 Farmington Avenue, Farmington, CT 06030, USA FEATURES Location/Qualifiers source 1..2265 /organism="Homo sapiens" /note="derived from a patient with acute lymphocytic leukemia" /db_xref="taxon:9606" /cell_line="RPMI 8392 B lymphoblastoid" CDS 35..1645 /note="Method: conceptual translation supplied by author" /codon_start=1 /product="calmodulin dependent phosphodiesterase PDE1B1" /db_xref="PID:g1621592" /translation="MELSPRSPPEMLEESDCPSPLELKSAPSKKMWIKLRSLLRYMVK QLENGEINIEELKKNLEYTASLLEAVYIDETRQILDTEDELQELRSDAVPSEVRDWLA STFTQQARAKGRRAEEKPKFRSIVHAVQAGIFVERMFRRTYTSVGPTYSTAVLNCLKN LDLWCFDVFSLNQAADDHALRTIVFELLTRHNLISRFKIPTVFLMSFLDALETGYGKY KNPYHNQIHAADVTQTVHCFLLRTGMVHCLSEIELLAIIFAAAIHDYEHTGTTNSFHI QTKSECAIVYNDRSVLENHHISSVFRLMQDDEMNIFINLTKDEFVELRALVIEMVLAT DMSCHFQQVKTMKTALQQLERIDKPKALSLLLHAADISHPTKQWLVHSRWTKALMEEF FRQGDKEAELGLPFSPLCDRTSTLVAQSQIGFIDFIVEPTFSVLTDVAEKSVQPLADE DSKSKNQPSFQWRQPSLDVEVGDPNPDVVSFRSTWVKRIQENKQKWKERAASGITNQM SIDELSPCEEEAPPSPAEDEHNQNGNLD" BASE COUNT 506 a 629 c 628 g 499 t 3 others ORIGIN 1 gctrgtccmy gccagccgca gaccgtggct gagcatggag ctgtcccccc gcagtcctcc 61 ggagatgctg gaggagtcgg attgcccgtc acccctggag ctgaagtcag cccccagcaa 121 gaagatgtgg attaagcttc ggtctctgct gcgctacatg gtgaagcagt tggagaatgg 181 ggagataaac attgaggagc tgaagaaaaa tctggagtac acagcttctc tgctggaagc 241 cgtctacata gatgagacac ggcaaatctt ggacacggag gacgagctgc aggagctgcg 301 gtcagatgcc gtgccttcgg aggtgcggga ctggctggcc tccaccttca cccagcaggc 361 ccgggccaaa ggccgccgag cagaggagaa gcccaagttc cgaagcattg tgcacgctgt 421 gcaggctggg atcttcgtgg aacggatgtt ccggagaaca tacacctctg tgggccccac 481 ttactctact gcggttctca actgtctcaa gaacctggat ctctggtgct ttgatgtctt 541 ttccttgaac caggcagcag atgaccatgc cctgaggacc attgtttttg agttgctgac 601 tcggcataac ctcatcagcc gcttcaagat tcccactgtg tttttgatga gtttcctgga 661 tgccttggag acaggctatg ggaagtacaa gaatccttac cacaaccaga tccacgcagc 721 cgatgttacc cagacagtcc attgcttctt gctccgcaca gggatggtgc actgcctgtc 781 ggagattgag ctcctggcca tcatctttgc tgcagctatc catgattatg agcacacggg 841 cactaccaac agcttccaca tccagaccaa gtcagaatgt gccatcgtgt acaatgatcg 901 ttcagtgctg gagaatcacc acatcagctc tgttttccga ttgatgcagg atgatgagat 961 gaacattttc atcaacctca ccaaggatga gtttgtagaa ctccgagccc tggtcattga 1021 gatggtgttg gccacagaca tgtcctgcca tttccagcaa gtgaagacca tgaagacagc 1081 cttgcaacag ctggagagga ttgacaagcc caaggccctg tctctactgc tccatgctgc 1141 tgacatcagc cacccaacca agcagtggtt ggtccacagc cgttggacca aggccctcat 1201 ggaggaattc ttccgtcagg gtgacaagga ggcagagttg ggcctgccct tttctccact 1261 ctgtgaccgc acttccactc tagtggcaca gtctcagata gggttcatcg acttcattgt 1321 ggagcccaca ttctctgtgc tgactgacgt ggcagagaag agtgttcagc ccctggcgga 1381 tgaggactcc aagtctaaaa accagcccag ctttcagtgg cgccagccct ctctggatgt 1441 ggaagtggga gaccccaacc ctgatgtggt cagctttcgt tccacctggg tcaagcgcat 1501 tcaggagaac aagcagaaat ggaaggaacg ggcagcaagt ggcatcacca accagatgtc 1561 cattgacgag ctgtccccct gtgaagaaga ggccccccca tcccctgccg aagatgaaca 1621 caaccagaat gggaatctgg attagccctg gggctggccc aggtcttcat tgagtccaaa 1681 gtgtttgatg tcatcagcac catccatcag gactggctcc cccatctgct ccaagggagc 1741 gtggtcgtgg aagaaacaac ccacctgaag gccaaatgcc agagatttgg ggttggggaa 1801 agggcccctc cccacctgac acccactggg gtgcacttta atgttccggc agcaagactg 1861 gggaacttca ggctcccagt ggtcactgtg cccatccctc agcctctgga ttctcttcat 1921 ggccaggtgg ctgccaggga gcggggagct tcctggaggc ttcccagggc cttggggaag 1981 ggtcagagat gccagccccc tgggacctcc cccatccttt ttgcctccaa gtttctaagc 2041 aatacatttt gggggttccc tcagcccccc accccagatc ttagctggca ggtctgggtg 2101 ccccttttcc tcccctggga agggctggaa taggatagaa agctgggggt tttcagagcc 2161 ctatgtgtgg ggaggggagt ggattccttc agggcatggt acctttctag gatctgggaa 2221 tggggtggag aggacatcct cttcacccca gaattgcggg aattc // LOCUS HSU56978 638 bp mRNA PRI 15-JUL-1996 DEFINITION Human fibroblast growth factor 8 (FGF-8) mRNA, complete cds. ACCESSION U56978 NID g1418263 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 638) AUTHORS Payson,R.A., Wu,J., Liu,Y. and Chiu,I.-M. TITLE The Human FGF-8 Gene Localizes on Chromosome 10q24 and is Subjected to Induction by Androgen in Breast Cancer Cells JOURNAL Oncogene (1996) In press REFERENCE 2 (bases 1 to 638) AUTHORS Chiu,I.-M. TITLE Direct Submission JOURNAL Submitted (29-APR-1996) Internal Medicine, The Ohio-State University, S-2052 Davis Center, 480 W. 9th Ave., Columbus, OH 43210, USA FEATURES Location/Qualifiers source 1..638 /organism="Homo sapiens" /note="the second of two FGF-8 cDNA clones; this has a deletion of the second exon, 37 bp in size" /db_xref="taxon:9606" /chromosome="10" /map="10q24" /clone="FGF-8a" gene 10..45 /gene="FGF-8" CDS 10..45 /gene="FGF-8" /note="reading frame shift; the polypeptide encoded by this CDS is not likely to have functional significance." /codon_start=1 /product="fibroblast growth factor FGF-8a" /db_xref="PID:g1418264" /translation="MGSPRSALSCL" BASE COUNT 147 a 213 c 186 g 92 t ORIGIN 1 gggatccata tgggcagccc ccgctccgcg ctgagctgcc tgtaactgtt cagtcctcac 61 ctaattttac acagcatgtg agggagcaga gcctggtgac ggatcagctc agccgccgcc 121 tcatccggac ctaccaactc tacagccgca ccagcgggaa gcacgtgcag gtcctggcca 181 acaagcgcat caacgccatg gcagaggacg gcgacccctt cgcaaagctc atcgtggaga 241 cggacacctt tggaagcaga gttcgagtcc gaggagccga gacgggcctc tacatctgca 301 tgaacaagaa ggggaagctg atcgccaaga gcaacggcaa aggcaaggac tgcgtcttca 361 cggagattgt gctggagaac aactacacag cgctgcagaa tgccaagtac gagggctggt 421 acatggcctt cacccgcaag ggccggcccc gcaagggctc caagacgcgg cagcaccagc 481 gtgaggtcca cttcatgaag cggctgcccc ggggccacca caccaccgag cagagcctgc 541 gcttcgagtt cctcaactac ccgcccttca cgcgcagcct gcgcggcagc cagaggactt 601 gggccccgga accccgatag gcgctcgccc agatctcc // LOCUS HSU56998 2169 bp mRNA PRI 13-AUG-1996 DEFINITION Human putative serine/threonine protein kinase PRK (prk) mRNA, complete cds. ACCESSION U56998 NID g1488262 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2169) AUTHORS Li,B., Ouyang,B., Pan,H., Reissmann,P.T., Slamon,D.J., Arceci,R., Lu,L. and Dai,W. TITLE Prk, a cytokine-inducible human protein serine/threonine kinase whose expression appears to be down-regulated in lung carcinomas JOURNAL J. Biol. Chem. 271 (32), 19402-19408 (1996) MEDLINE 96325053 REFERENCE 2 (bases 1 to 2169) AUTHORS Dai,W. TITLE Direct Submission JOURNAL Submitted (29-APR-1996) Wei Dai, Department of Internal Medicine, K-Pavilion, University of Cincinnati, 231 Bethesda Avenue, ML-508, Cincinnati, OH 45267, USA FEATURES Location/Qualifiers source 1..2169 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="megakaryoblastic leukemia" 5'UTR 1..36 gene 37..1860 /gene="prk" CDS 37..1860 /gene="prk" /codon_start=1 /product="putative serine/threonine protein kinase PRK" /db_xref="PID:g1488263" /translation="MLAGLPTSDPGRLITDPRSGRTYLKGRLLGKGGFARCYEATDTE TGSAYAVKVIPQSRVAKPHQREKILNEIELHRDLQHRHIVRFSHHFEDADNIYIFLEL CSRKSLAHIWKARHTLLEPEVRYYLRQILSGLKYLHQRGILHRDLKLGNFFITENMEL KVGDFGLAARLEPPEQRKKTICGTPNYVAPEVLLRQGHGPEADVWSLGCVMYTLLCGS PPFETADLKETYRCIKQVHYTLPASLSLPARQLLAAILRASPRDRPSIDQILRHDFFT KGYTPDRLPISSCVTVPDLTPPNPARSLFAKVTKSLFGRKKKSKNHAQERDEVSGLVS GLMRTSVGHQDARPEAPAASGPAPVSLVETAPEDSSPRGTLASSGDGFEEGLTVATVV ESALCALRNCIAFMPPAEQNPAPLAQPEPLVWVSKWVDYSNKFGFGYQLSSRRVAVLF NDGTHMALSANRKTVHYNPTSTKHFSFSVGAVPRALQPQLGILRYFASYMEQHLMKGG DLPSVEEVEVPAPPLLLQWVKTDQALLMLFSDGTVQVNFYGDHTKLILSGWEPLLVTF VARNRSACTYLASHLRQLGCSPDLRQRLRYALRLLRDRSPA" 3'UTR 1861..2169 polyA_site 2169 /note="15 A nucleotides" BASE COUNT 426 a 680 c 603 g 460 t ORIGIN 1 ccgcctccga gtgccttgcg cggacctgag ctggagatgc tggccgggct accgacgtca 61 gaccccgggc gcctcatcac ggacccgcgc agcggccgca cctacctcaa aggccgcttg 121 ttgggcaagg ggggcttcgc ccgctgctac gaggccactg acacagagac tggcagcgcc 181 tacgctgtca aagtcatccc gcagagccgc gtcgccaagc cgcatcagcg cgagaagatc 241 ctaaatgaga ttgagctgca ccgagacctg cagcaccgcc acatcgtgcg tttttcgcac 301 cactttgagg acgctgacaa catctacatt ttcttggagc tctgcagccg aaagtccctg 361 gcccacatct ggaaggcccg gcacaccctg ttggagccag aagtgcgcta ctacctgcgg 421 cagatccttt ctggcctcaa gtacttgcac cagcgcggca tcttgcaccg ggacctcaag 481 ttgggaaatt ttttcatcac tgagaacatg gaactgaagg tgggggattt tgggctggca 541 gcccggttgg agcctccgga gcagaggaag aagaccatct gtggcacccc caactatgtg 601 gctccagaag tgctgctgag acagggccac ggccctgaag cggatgtatg gtcactgggc 661 tgtgtcatgt acacgctgct ctgcgggagc cctccctttg agacggctga cctgaaggag 721 acgtaccgct gcatcaagca ggttcactac acgctgcctg ccagcctctc actgcctgcc 781 cggcagctcc tggccgccat ccttcgggcc tcaccccgag accgcccctc tattgaccag 841 atcctgcgcc atgacttctt taccaagggc tacacccccg atcgactccc tatcagcagc 901 tgcgtgacag tcccagacct gacacccccc aacccagcta ggagtctgtt tgccaaagtt 961 accaagagcc tctttggcag aaagaagaag agtaagaatc atgcccagga gagggatgag 1021 gtctccggtt tggtgagcgg cctcatgcgc acatccgttg gccatcagga tgccaggcca 1081 gaggctccag cagcttctgg cccagcccct gtcagcctgg tagagacagc acctgaagac 1141 agctcacccc gtgggacact ggcaagcagt ggagatggat ttgaagaagg tctgactgtg 1201 gccacagtag tggagtcagc cctttgtgct ctgagaaatt gtatagcttt catgccccca 1261 gcggaacaga acccggcccc cctggcccag ccagagcctc tggtgtgggt cagcaagtgg 1321 gttgactact ccaataagtt cggctttggg tatcaactgt ccagccgccg tgtggctgtg 1381 ctcttcaacg atggcacaca tatggccctg tcggccaaca gaaagactgt gcactacaat 1441 cccaccagca caaagcactt ctccttctcc gtgggtgctg tgccccgggc cctgcagcct 1501 cagctgggta tcctgcggta cttcgcctcc tacatggagc agcacctcat gaagggtgga 1561 gatctgccca gtgtggaaga ggtagaggta cctgctccgc ccttgctgct gcagtgggtc 1621 aagacggatc aggctctcct catgctgttt agtgatggca ctgtccaggt gaacttctac 1681 ggggaccaca ccaagctgat tctcagtggc tgggagcccc tccttgtgac ttttgtggcc 1741 cgaaatcgta gtgcttgtac ttacctcgct tcccaccttc ggcagctggg ctgctctcca 1801 gacctgcggc agcgactccg ctatgctctg cgcctgctcc gggaccgcag cccagcttag 1861 gacccaagcc ctgaaggcct gaggcctgtg cctgtcaggc tctggccctt gcctttgtgg 1921 ccttccccct tcctttggtg cctcactggg ggctttgggc cgaatccccc agggaatcag 1981 ggaccagctt tactggagtt gggggcggct tgtcttcgct ggctcctacc ccatctccaa 2041 gataagcctg agccttagct cccagctagg gggcgttatt tatggaccac ttttatttat 2101 tgtcagacac ttatttattg ggatgtgagc cccagggggc ctcctcctag gataataaac 2161 aattttgca // LOCUS HSU56K21 2233 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens partial cDNA homologous to M.musculus JIP-1 gene. ACCESSION AL021708 NID g2832712 KEYWORDS JIP-1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2233) AUTHORS Smink,L.J. and Burton,J. TITLE Direct Submission JOURNAL Submitted (20-JAN-1998) E-mail contact: humquery@sanger.ac.uk Clone requests:clonerequest@sanger.ac.uk COMMENT This sequence was generated from a cDNA clone isolated using sequence from the BAC clone CIT987SK-384D8 sequenced by The Institute for Genomic Research(U62317). Exon numbers stated derived by comparison to M.musculus JIP-1 gene sequence : AF003115. Further information can be found at http://www.sanger.ac.uk/HGP/Chr22/ All matches to EMBL sequences shown 90% or more. FEATURES Location/Qualifiers source 1..2233 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /tissue_type="monocyte" /map="q13" /clone="U56K21" exon <1..903 /number=5 mRNA <1..>2233 /product="hypothetical protein" CDS 209..1612 /note="unnamed protein product" /codon_start=1 /db_xref="PID:e1249595" /db_xref="PID:g2832713" /translation="MISEGSSPIRCPGQCLSPAPRPPGEPVSPAGGAAQDSQDPEAAA GPGGVELVDMETLCAPPPPAPAAPRPGPAQPGPCLFLSNPTRDTITPLWAAPGRAARP GRACSAACSEEEDEEDDEEEEDAEDSAGSPGGRGTGPSAPRDASLVYDAVKYTLVVDE HTQLELVSLRRCAGLGHDSEEDSGGEASEEEAGAALLGGGQVSGDTSPDSPDLTFSKK FLNVFVNSTSRSSSTESFGLFSCLVNGEEREQTHRAVFRFIPRHPDELELDVDDPVLV EAEEDDFWFRGFNMRTGERGVFPAFYAHAVPGPAKDLLGSKRSPCWVERFDVQFLGSV EVPCHQGNGILCAAMQKIATARKLTVHLRPPASCDLEISLRGVKLSLSGGGPEFQRCS HFFQMKNISFCGCHPRNSCYFGFITKHPLLSRFACHVFVSQESMRPVAQSVGRAFLEY YQEHLAYACPTEDIYLE" misc_feature 542..574 /note="match: 5' EST W41873 clone 353471" misc_feature join(816..833,830..1167,1166..1220) /note="match: 5' EST T32933 clone 183939" misc_feature 900..1146 /note="match: 5' EST Z43824 clone c-1lg04" exon 904..979 /number=6 misc_feature join(930..948,959..986,986..1091,1094..1166,1164..1281) /note="match: 5' EST T78842 clone 23959" exon 980..1152 /number=7 misc_feature 1004..1316 /note="match: 5' EST AA324906 clone 192685" exon 1153..1262 /number=8 exon 1263..1370 /number=9 misc_feature join(1268..1508,1503..1553) /note="match: 5' EST AA323678 clone 281387" misc_feature join(1311..1549,1546..1587,1582..1638) /note="match: 5' EST H38480 clone 192685" misc_feature 1321..1508 /note="match: 5' EST R87578 clone 166137" misc_feature join(1322..1549,1546..1660,1658..1727) /note="match: 5' EST H30298 clone 183939" misc_feature join(1352..1469,1465..1525,1526..1549,1546..1570, 1581..1618,1618..1646,1680..1727) /note="match: 5' EST AA505001 clone 839760" misc_feature join(1363..1389,1388..1525,1546..1587,1582..1667) /note="match: 5' EST N53101 clone 281387" misc_feature join(1365..1587,1582..1627) /note="match: 5' EST AA323104 clone 281387" misc_feature join(1365..1607,1606..1634) /note="match: 5' EST AA323084 clone 281387" misc_feature join(1365..1587,1582..1666) /note="match: 5' EST AA322904 clone 839760" misc_feature 1368..1442 /note="match: 3' EST AA666239 clone 859098" exon 1371..1441 /number=10 misc_feature join(1399..1574,1573..1680) /note="match: 5' EST AA323870 clone 281387" exon 1442..1540 /number=11 misc_feature complement(join(1478..1589,1386..1478)) /note="match: 3' EST AA351451 clone IMAGE:964182" exon 1541..>2233 /number=12 misc_feature 1626..1842 /note="match: 3' EST AA324271 clone IMAGE:964182" misc_feature 1743..2063 /note="match: 5' EST R76018 clone 159021" misc_feature join(1743..1969,1986..2060) /note="match: 5' EST R83813 clone 186661" misc_feature join(1747..2095,2093..2116) /note="match: 5' EST R22305 clone 130843" misc_feature complement(2017..2223) /note="match: 3' EST AA507924 clone IMAGE:964182" misc_feature 2051..2102 /note="match: 5' EST AA033504 clone 466371" misc_feature complement(join(2109..2178,1974..2009)) /note="match: 3' EST R88296 clone 166137" misc_feature complement(join(2109..2233,1986..2009)) /note="match: 3' EST H38446 clone 192685" misc_feature complement(2109..2178) /note="match: 3' EST R22306 clone 130843; match: 3' EST Z39890 clone c-1lg04" misc_feature complement(join(2109..2178,1992..2018)) /note="match: 3' EST H41254 clone 192535" misc_feature complement(2124..2178) /note="match: 3' EST T16799 clone c-1lg04" BASE COUNT 339 a 762 c 723 g 409 t ORIGIN 1 ccgctcctcg cacctcacca actccatcga ggaggcctcg tcgcccgcct cggagccgga 61 gcccccgcgc gaacccccgc gccgccccgc cttcctgccc gtgggccccg acgacaccaa 121 cagcgagtac gagtcggggt cggagtcgga gccggacctc agcgaggacg cggactcgcc 181 ctggctgctc agcaacctgg tgagccgcat gatctccgag ggctcctcgc ccatccgctg 241 ccccggccag tgcctgtctc ctgcgccgcg cccgcccggg gagcccgtgt cgccggccgg 301 cggggccgcc caggactccc aggaccccga ggcggccgcg gggcccggcg gcgtggagct 361 ggtggacatg gagacgctgt gcgcgccgcc gccgcccgcg cccgccgcgc ctcgacccgg 421 ccccgcgcag cccgggcctt gcctattcct cagcaacccc acgcgtgaca ccatcacgcc 481 gctgtgggcc gcgcccggcc gcgccgcccg cccgggacga gcctgctccg ccgcctgctc 541 cgaggaggag gacgaagagg acgacgagga agaggaggat gccgaggaca gtgcggggtc 601 ccccgggggc aggggcacgg gcccctcggc gccgcgggac gcgtcgctgg tgtacgacgc 661 ggtcaagtac acgctggtgg tggatgaaca cacgcagctg gagctggtga gcctgcggcg 721 ctgtgctggg ctgggccacg acagcgaaga ggacagcggc ggggaggcca gcgaggagga 781 ggcgggcgcg gcgctgctag gcggcggtca ggtctcgggg gacacctcgc cggacagccc 841 tgacctcact ttctccaaga agttcctcaa tgtcttcgtc aacagcacat ctcggtcctc 901 cagcaccgag tcctttggcc ttttctcctg tctggtcaac ggcgaggagc gagagcagac 961 tcaccgggct gtgttcaggt tcatcccgcg gcatccagac gagctggagc tggatgtgga 1021 tgaccctgtg ttggtggagg ccgaggagga cgacttctgg ttccgtggct tcaacatgcg 1081 cacgggggag cgcggtgtgt ttcctgcctt ctacgcccat gcggtgcccg gccctgccaa 1141 ggacctgctg gggagtaagc ggagcccctg ctgggtggag cgctttgacg tgcagttcct 1201 gggctccgtg gaggtgccct gccaccaggg caacggcatc ctgtgtgcag ccatgcagaa 1261 gattgccact gcccggaaac tgaccgtcca cctgcgccct cctgcctcct gtgacctcga 1321 gatctctctt cggggggtca agctgagtct gagcggagga gggcctgagt tccagcgctg 1381 cagccatttc ttccagatga agaacatctc cttctgcggc tgccatcccc gcaacagctg 1441 ctatttcggc ttcatcacca aacaccccct gctgagccgc ttcgcctgcc acgtctttgt 1501 ctcccaggag tccatgaggc cggtggcgca gagtgtgggc cgcgccttcc tggagtacta 1561 ccaagagcac ctggcgtacg cctgccccac ggaggacatc tacctggagt agcctgccca 1621 ccgctctgtc tcctggccgt ctgctccagg cgcagctggg gctgaggatg tctcaagagg 1681 ccaccatggc tttggcaagg actggattgg ggggacatgg gaccttacgc ttgtgggggt 1741 ctgcgggctg ggaactctcg tcctcggtcc ccagggcgca gctgttgggg ctgcggggga 1801 gtggagcccc cgtgcccctg cttttcctca gatccgttct ttctctgtgt tgtcctcctc 1861 cttcccttcc cagtctccct tttctctctc ctggtgtctc tgcccctctc tgtgcctgca 1921 aaccttatcc tctattcttc actttggggt cagaacttcg ggggtgtgga ggaggtgcgg 1981 ctctagggac aggtaatgtc ggcttccaag tgatgccctc ctgccactcc cctgcgattt 2041 ataagggaat tttaattttt atagtgaatc tcaagggatc atcccatcct tgaccacagg 2101 gacaagacgg gccccctcgc cccagcccca ccccacatgg agctcagggg gaagcaggga 2161 ggggttccca aaagagccct gcggaggcta ggagtggttc ttgatgctca cctgaagccc 2221 ctagacgctg cta // LOCUS HSU57029 1429 bp mRNA PRI 07-JUL-1996 DEFINITION Human T-cell leukemia virus enhancer factor (HTLF) mRNA, complete cds. ACCESSION U57029 M94653 NID g1405357 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1429) AUTHORS Li,C., Lusis,A.J., Sparkes,R., Tran,S.M. and Gaynor,R. TITLE Characterization and chromosomal mapping of the gene encoding the cellular DNA binding protein HTLF JOURNAL Genomics 13 (3), 658-664 (1992) MEDLINE 92347862 REFERENCE 2 (bases 1 to 1429) AUTHORS Li,C., Nirula,A. and Gaynor,R.B. TITLE Direct Submission JOURNAL Submitted (26-APR-1996) C. Li, Internal Medicine, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235-8594, USA FEATURES Location/Qualifiers source 1..1429 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Jurkat lambda gt11 cDNA expression library" /chromosome="2" /map="2p16-22" gene 282..1307 /gene="HTLF" CDS 282..1307 /gene="HTLF" /note="T-cell leukemia virus factor; DNA-binding protein; transcription factor; contains a winged helix DNA-binding domain" /codon_start=1 /product="HTLF" /db_xref="PID:g1405358" /translation="MMCDPLDQLATRTQKKKSATSKPPYSFSLLIYMAIEHSPNKCLP VKEIYSWILDHFPYFATAPTGWKNSVRHNLSLNKCFQKVERSHGKVNGKGSLWCVDPE YKPNLIQALKKQPFSSASSQNGSLSPHYLSSVIKQNQVRNLKESDIDAAAAMMLLNTS IEQGILECEKPLPLKTALQKKRSYGNAFHHPSAVRLQESDSLATSIDPKEDHNYSASS MAAQRCASRSSVSSLSSVDEVYEFIPKNSHVGSDGSEGFHSEEDTDVDYEDDPLGDSG YASQPCAKISEKGQSGKKMRKQTCQEIDEELKEAAGSLLHLAGIRTCLGSLISTAKTQ NQKQRKK" misc_feature 318..620 /gene="HTLF" /note="encodes forkhead DNA binding domain" BASE COUNT 461 a 285 c 306 g 377 t ORIGIN 1 gaattcgcgg ccgcgtccag taattggaat gactccagat aagagagctg aaaccccagg 61 agctgaaaag attgcaggat taagccagat ttacaaaatg ggaagcttgc ctgaagctgt 121 tgatgctgcc aggccgaagg ccactctagt ggacagtgag tcagcagatg atgaactcac 181 aaacttgaac tggcttcatg aaagcactaa tcttctaaca aacttcagcc tgggaagtga 241 gggtcttcca attgttagtc cattgtatga catagaggga gatgatgtgc gatcctttgg 301 accagcttgc taccagaacc cagaaaaaaa aatcagcgac ttcaaagccc ccatactcct 361 ttagtcttct catttatatg gccattgagc actctccaaa taaatgtttg cctgtcaaag 421 aaatttatag ctggattctg gaccattttc catattttgc tactgcacca acaggctgga 481 agaattctgt tcgacataat ctgtccctga ataaatgttt tcagaaagtg gaaagaagcc 541 atggcaaggt taatggaaaa ggttccttat ggtgtgttga tccggaatat aaacccaatc 601 ttatccaggc actgaagaag caaccttttt cttcagcatc ttcacaaaat ggttctttat 661 cacctcacta tttaagctct gtaatcaagc agaaccaggt gcgaaacctc aaagaatctg 721 atattgatgc tgctgctgca atgatgcttt taaatacttc tatagaacaa ggaattttag 781 aatgtgagaa gcctcttcct cttaaaacag cattgcaaaa aaagaggagt tacggcaatg 841 catttcatca tcccagtgct gtacgattac aagagagtga ttctttagcc accagcattg 901 atccaaaaga agatcacaat tacagtgcaa gtagcatggc agcacagcgt tgtgcatcca 961 ggtctagcgt gtcttccctg tcttctgtgg atgaggtata tgaatttatc ccaaagaata 1021 gtcacgtggg aagtgatggc agtgaaggat ttcacagtga agaagataca gacgttgatt 1081 atgaagatga tcctcttgga gacagtggct atgcatcaca gccttgtgca aaaatctctg 1141 aaaaagggca gtcaggcaaa aagatgcgaa aacagacatg tcaagaaatt gatgaggagc 1201 tcaaagaggc agctggatct ctgctccacc ttgctggaat tcgtacatgt ttaggttccc 1261 taataagtac tgcaaagaca caaaatcaaa agcaacggaa aaaatagaaa tacttaaagt 1321 gtggcaatac tctttcactt aattctttac aagggatatc aaagccataa tggacttcat 1381 tagttttagg gtagggaagg gatactaatt acttatttct ttcaaaaca // LOCUS HSU57052 1026 bp mRNA PRI 05-SEP-1996 DEFINITION Human Hoxb-13 mRNA, complete cds. ACCESSION U57052 NID g1519039 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1026) AUTHORS Zeltser,L., Desplan,C. and Heintz,N. TITLE Hoxb-13: a new Hox gene in a distant region of the HOXB cluster maintains colinearity JOURNAL Development 122 (8), 2475-2484 (1996) MEDLINE 96312926 REFERENCE 2 (bases 1 to 1026) AUTHORS Zeltser,L.M. TITLE Direct Submission JOURNAL Submitted (29-APR-1996) Lori M. Zeltser, Heintz Lab, Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1026 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" CDS 55..909 /codon_start=1 /product="Hoxb-13" /db_xref="PID:g1519040" /translation="MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMP AVNYAPLDLPGSAEPPKQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQ AATLAAYPAETPTAGEEYPSRPTEFAFYPGYPGTYHAMASYLDVSVVQTLGAPGEPRH DSLLPVDSYQSWALAGGWNSQMCCQGEQNPPGPFWKAAFADSSGQHPPDASAFRRGRK KRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLA KVKNSATP" misc_feature 699..879 /note="homeobox" BASE COUNT 203 a 333 c 312 g 178 t ORIGIN 1 cgggtgcccc ctagattccc cgcccccgca cctcatgagc cgaccctcgg ctccatggag 61 cccggcaatt atgccacctt ggatggagcc aaggatatcg aaggcttgct gggagcggga 121 ggggggcgga atctggtcgc ccactcccct ctgaccagcc acccagcggc gcctacgctg 181 atgcctgctg tcaactatgc ccccttggat ctgccaggct cggcggagcc gccaaagcaa 241 tgccacccat gccctggggt gccccagggg acgtccccag ctcccgtgcc ttatggttac 301 tttggaggcg ggtactactc ctgccgagtg tcccggagct cgctgaaacc ctgtgcccag 361 gcagccaccc tggccgcgta ccccgcggag actcccacgg ccggggaaga gtaccccagc 421 cgccccactg agtttgcctt ctatccggga tatccgggaa cctaccacgc tatggccagt 481 tacctggacg tgtctgtggt gcagactctg ggtgctcctg gagaaccgcg acatgactcc 541 ctgttgcctg tggacagtta ccagtcttgg gctctcgctg gtggctggaa cagccagatg 601 tgttgccagg gagaacagaa cccaccaggt cccttttgga aggcagcatt tgcagactcc 661 agcgggcagc accctcctga cgcctccgcc tttcgtcgcg gccgcaagaa acgcattccg 721 tacagcaagg ggcagttgcg ggagctggag cgggagtatg cggctaacaa gttcatcacc 781 aaggacaaga ggcgcaagat ctcggcagcc accagcctct cggagcgcca gattaccatc 841 tggtttcaga accgccgggt caaagagaag aaggttctcg ccaaggtgaa gaacagcgct 901 accccttaag agatctcctt gcctgggtgg gaggagcgaa agtgggggtg tcctggggag 961 accaggaacc tgccaagccc aggctggggc caaggactct gctgagaggc ccctagagac 1021 aacacc // LOCUS HSU57057 2430 bp RNA PRI 21-FEB-1997 DEFINITION Human WD protein IR10 mRNA, complete cds. ACCESSION U57057 NID g1654312 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2430) AUTHORS Zaphiropoulos,P.G. and Toftgard,R. TITLE cDNA cloning of a novel WD repeat protein mapping to the 9q22.3 chromosomal region JOURNAL DNA Cell Biol. 15 (12), 1049-1056 (1996) MEDLINE 97138092 REFERENCE 2 (bases 1 to 2430) AUTHORS Zaphiropoulos,P. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) Center for Nutrition and Toxicology, Karolinska Institute, Huddinge, 141 57, Sweden FEATURES Location/Qualifiers source 1..2430 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q22.3" /tissue_type="epidermis" /dev_stage="adult" CDS 267..1844 /codon_start=1 /product="WD protein IR10" /db_xref="PID:g1654313" /translation="MSWHPQYRSSKFRHVFGKPASKENCYDSVPITRSVHDNHFCAVN PHFIAVVTECAGGGGFLVIPLHQTGKLDPHYPKVCGHRGNVLDVKWNPFDDFEIASCS EDATIKIWSIPKQLLTRNLTAYRKELVGHARRVGLVEWHPTAANILFSAGYDYKVMIW NLDTKESVITSPMSTISCHQDVILSMSFNTNGSLLATTCKDRKIRVIDPRAGTVLQEA SYKGHRASKVLFLGNLKKLMSTGTSRWNNRQVALWDQDNLSVPLMEEDLDGSSGVLFP FYDADTSMLYVVGKGDGNIRYYEVSADKPHLSYLTEYRSYNPQKGIGVMPKRGLDVSS CEIFRFYKLITTKSLIEPISMIVPRGSESYQEDIYPPTGGAQPSLTAQEWLSGMNRDP ILVSLRPGSELLRPHPLPAERPIFNSMAPASPRLLNQTEKLAAEDGWRSSSLLEEKMP RWAAEHRLEEKKTWLTNGFDVFECPPPKTENELLQMFYRQQEEIRRLRELLTQREVQA KQLELEIKNLRMGSEQL" BASE COUNT 593 a 681 c 648 g 508 t ORIGIN 1 cacaggggca ctgacatctt tgctggactc ttttaatcct cataaaccca ggctgctggc 61 actcgaaatt tgagaagggc aatcagtctg gcctcttgcc tttctggcct gcaggcccag 121 cctgaacaaa caatttaggt ggagctcctg gaggctggaa gcagagagga ggtacctcac 181 catcctctcc tgccagctgt tgcagactct gcctggggaa ctgtttaagg gtgagcagag 241 gcatttatct ccatgttaac aagcagatgt catggcaccc ccagtaccgg agctccaagt 301 tccgtcatgt ctttggcaaa ccagccagca aggagaactg ctacgactcc gtgcctatca 361 cccgcagcgt tcacgacaac cacttctgtg ccgtgaaccc ccacttcatt gcagttgtga 421 ctgagtgtgc tggtggaggg ggcttcctcg tcatccccct gcaccagaca gggaagttgg 481 acccccacta cccaaaagtc tgcgggcaca gaggcaacgt tttggatgtc aagtggaacc 541 cttttgatga ttttgagatc gcctcctgtt ctgaagatgc cacaattaag atctggagca 601 tccccaagca gctgctgacc aggaacctca cggcctacag gaaggaactc gtgggccacg 661 cgcgcagagt aggcctggtg gagtggcacc ccacggccgc caacatcctc ttcagtgctg 721 gctatgacta caaggtgatg atctggaacc tggatacaaa ggagtctgtc atcacaagcc 781 ccatgagtac gattagctgt caccaagatg tgatcctctc catgtccttc aacaccaacg 841 ggagcctgtt ggccaccacc tgcaaagacc gcaagattcg ggttattgac ccccgagcag 901 ggaccgtcct ccaggaggcc agctacaaag ggcaccgggc cagcaaagtg ctgtttctgg 961 ggaacctgaa gaagctgatg tccacaggca catcccgatg gaacaaccgg caggtggcct 1021 tgtgggacca ggataacctc tctgtgcctc tgatggagga ggacctggac ggctcctcgg 1081 gcgtgctgtt tcccttctat gacgcggaca ccagcatgct ctacgtggtg gggaagggag 1141 atggcaacat ccgctactac gaggtgagcg ccgacaagcc tcacctgagc tacctgactg 1201 agtaccgctc ctataaccca cagaagggga tcggtgtcat gccaaagaga ggactcgacg 1261 tgtcctcctg cgagatcttc cgcttctaca agctgatcac aaccaaaagc ctcatcgagc 1321 ccatctccat gattgtgccc cgggggtcag aatcctacca agaggacata taccctccaa 1381 caggaggggc ccagccctcc ctgacggccc aggagtggct cagcgggatg aatcgagacc 1441 caatcctggt gtcccttagg cctggctctg agctgctgag accccaccca ctgcctgcag 1501 agagacctat cttcaattcc atggccccag cctcaccccg gctcttgaat cagacagaaa 1561 agctggctgc agaagatggc tggaggtctt cctccctgtt ggaggagaag atgccaaggt 1621 gggcagcaga acacaggctg gaggagaaga aaacctggct gacaaatggc tttgacgttt 1681 tcgaatgccc cccaccaaag acagagaatg agttgctgca gatgttctac cggcaacagg 1741 aggagatccg aaggctccgg gagctgttga cccagcgaga ggtccaggcc aaacagttgg 1801 aactggagat caaaaacttg cggatgggct cagagcagct ctgagcagag acctctgccc 1861 tcctcaccct cagggacacc actcggctcc atggggaggt ttagaaccaa accacaagtc 1921 ccctcaagga caaccactat ttctatattt tttaccagaa aaccaaactc tccatcgctg 1981 aaagagattc cagtgggaca tggtgccgtt tttctgtttg ccttcttgca acaacagttt 2041 ctgaattgac tttgttttca gatgatgcct tctgatgaat tctgttatta agggcccatg 2101 atgagctgta acttctcaag aggaaagaac acagtagaaa actagagctg gaaggatcta 2161 ggttgacctg tctgtgattt tcaaccctgt ggtccagaga atagggaaga ctggccagac 2221 gcactggctc atgcctgtaa tcccagcact ctgggaggct gaggtgggcg gatttcttga 2281 ggtcgggagt ttgaaaccag tctcaccagg cgcagtggct cacgcctgta attccagcat 2341 gttgggaggc cgaggctggt ggatcacgag gtcagcagtt cgagaccagc ctgaccaaca 2401 tggtgaaacc ccatctctac taaaaatccc // LOCUS HSU57091 899 bp mRNA PRI 24-OCT-1996 DEFINITION Human small GTP-binding protein rab22b mRNA, complete cds. ACCESSION U57091 NID g1457953 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 899) AUTHORS Chen,D., Guo,J., Miki,T., Tachibana,M. and Gahl,W.A. TITLE Molecular cloning of two novel rab genes from human melanocytes JOURNAL Gene 174 (1), 129-134 (1996) MEDLINE 97017138 REFERENCE 2 (bases 1 to 899) AUTHORS Chen,D. and Gahl,W.A. TITLE Direct Submission JOURNAL Submitted (01-MAY-1996) Heritable Disorders Branch, NICHD/NIH, Building 10/Room 9S242, 10 Center Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..899 /organism="Homo sapiens" /note="human melanocyte" /db_xref="taxon:9606" /chromosome="18" CDS 46..630 /codon_start=1 /product="Rab22b" /db_xref="PID:g1457954" /translation="MAIRELKVCLLGDTGVGKSSIVCRFVQDHFDHNISPTIGASFMT KTVPCGNELHKFLIWDTAGQERFHSLAPMYYRGSAAAVIVYDITKQDSFYTLKKWVKE LKEHGPENIVMAIAGNKCDLSDIREVPLKDAKEYAESIGAIVVETSAKNAINIEELFQ GISRQIPPLDPHENGNNGTIKVEKPTMQSSRRCC" BASE COUNT 252 a 216 c 229 g 202 t ORIGIN 1 cgaggatgct gctgagcccc ggcactgcct ggctgcgagc acatgatggc gatacgggag 61 ctcaaagtgt gccttctcgg ggacactggg gttgggaaat caagcatcgt gtgtcgattt 121 gtccaggatc actttgacca caacatcagc cctactattg gggcatcttt tatgaccaaa 181 actgtgcctt gtggaaatga acttcacaag ttcctcatct gggacactgc tggtcaggaa 241 cggtttcatt cattggctcc catgtactat cgaggctcag ctgcagctgt tatcgtgtat 301 gatattacca agcaggattc attttatacc ttgaagaaat gggtcaagga gctgaaagaa 361 catggtccag aaaacattgt aatggccatc gctggaaaca agtgcgacct ctcagatatt 421 agggaggttc ccctgaagga tgctaaggaa tacgctgaat ccataggtgc catcgtggtt 481 gagacaagtg caaaaaatgc tattaatatc gaagagctct ttcaaggaat cagccgccag 541 atcccaccct tggaccccca tgaaaatgga aacaatggaa caatcaaagt tgagaagcca 601 accatgcaat ccagccgccg gtgctgttga cccaagggcc gtggtccacg tacttgaaga 661 agccagagcc cacatcctgt gcactgctga aggaccctac gctcggtggc ctggcacctc 721 actttgagaa gagtgagcac actggctttg catcctggaa gacctgcagg ggcgggcagg 781 aaatgtacct gaaaaggatt ttagaaaacc ctggaaaacc caccacacca ccaccacaaa 841 atggccttta gtgtatgaaa tgcacatgga ggggatgtag ttgcattttt gctaaaaaa // LOCUS HSU57092 828 bp mRNA PRI 24-OCT-1996 DEFINITION Human small GTP-binding protein rab30. ACCESSION U57092 NID g1457955 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 828) AUTHORS Chen,D., Guo,J., Miki,T., Tachibana,M. and Gahl,W.A. TITLE Molecular cloning of two novel rab genes from human melanocytes JOURNAL Gene 174 (1), 129-134 (1996) MEDLINE 97017138 REFERENCE 2 (bases 1 to 828) AUTHORS Chen,D. and Gahl,W.A. TITLE Direct Submission JOURNAL Submitted (01-MAY-1996) Heritable Disorders Branch, NICHD/NIH, Building 10/Room 9S242, 10 Center Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..828 /organism="Homo sapiens" /note="human melanoma" /db_xref="taxon:9606" /chromosome="11" CDS 19..630 /codon_start=1 /product="Rab30" /db_xref="PID:g1457956" /translation="MSMEDYDFLFKIVLIGNAGVGKTCLVRRFTQGLFPPGQGATIGV GFMIKTVEINGEKVKLQIWDTAGQERFRSITQSYYRSANALILTYDITCEESFRCLPE WLREIEQYASNKVITVLVGNKIDLAERREVSQQRAEEFSEAQDMYYLETSAKESDNVE KLFLDLACRLISEARQNTLVNNVSSPLPGEGKSISYLTCCNFN" BASE COUNT 234 a 181 c 211 g 202 t ORIGIN 1 aaaattgaag ctgtgtaaat gagtatggaa gattatgatt tcctgttcaa aattgtttta 61 attggcaacg ctggtgtggg gaagacgtgc ctcgtccgaa gattcactca gggtcttttc 121 cccccaggtc aaggagccac aattggagtt ggttttatga ttaagacagt ggagattaat 181 ggtgaaaaag taaagctaca gatctgggac acagcaggtc aagagagatt tcggtccatt 241 acccagagtt actaccgaag cgccaatgcc ttgatcctca cctatgacat tacctgtgag 301 gaatccttcc gttgccttcc tgagtggctg cgggagatag aacaatatgc cagcaacaag 361 gtcatcactg tgttagtggg caacaagatt gacctggctg aaaggagaga ggtttcccag 421 cagcgagctg aagaattctc agaagctcag gacatgtatt atctggagac ctcagccaag 481 gaatctgata atgtggagaa actcttcctt gacttagcat gccgactcat cagtgaagcc 541 agacagaaca cacttgtgaa caatgtatcc tcacccttac ctggagaagg gaaaagcatc 601 agctatttga cttgttgtaa tttcaactaa aggctgaggc acggagaaga aaaggaatca 661 gcaactgccc tgatgcggca atgagatgct ggggagatct ggcgatgact gtggctcccg 721 ctctctgtcc ttctgactcc tgtgggctcc tgagcttaca aagcatggca ggccaagggc 781 tcgaccacag gccagcatta gcagaacata taatggtttc accctttt // LOCUS HSU57093 1042 bp mRNA PRI 10-APR-1997 DEFINITION Human small GTP-binding protein rab27b mRNA, complete cds. ACCESSION U57093 NID g1931574 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1042) AUTHORS Chen,D., Guo,J., Miki,T., Tachibana,M. and Gahl,W.A. TITLE Molecular cloning and characterization of rab27a and rab27b, novel human rab proteins shared by melanocytes and platelets JOURNAL Biochem. Mol. Med. 60 (1), 27-37 (1997) MEDLINE 97219695 REFERENCE 2 (bases 1 to 1042) AUTHORS Chen,D. and Gahl,W.A. TITLE Direct Submission JOURNAL Submitted (01-MAY-1996) Heritable Disorders Branch, NICHD/NIH, Building 10/Room 9S242, 10 Center Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1042 /organism="Homo sapiens" /note="Human melanoma" /db_xref="taxon:9606" CDS 93..749 /codon_start=1 /product="Rab27b" /db_xref="PID:g1931575" /translation="MTDGDYDYLIKLLALGDSGVGKTTFLYRYTDNKFNPKFITTVGI DFREKRVVYNAQGPNGSSGKAFKVHLQLWDTAGQERFRSLTTAFFRDAMGFLLMFDLT SQQSFLNVRNWMSQLQANAYCENPDIVLIGNKADLPDQREVNERQARELADKYGIPYF ETSAATGQNVEKAVETLLDLIMKRMEQCVEKTQIPDTVNGGNSGNWDGEKPPEKKCIC " BASE COUNT 354 a 208 c 223 g 257 t ORIGIN 1 ggcttgggaa ggggaaggaa acttctctga aatctgaaca cctgctctcc cggcaaggaa 61 acttcgaagg ctgaccgacc aagaccatca ctatgaccga tggagactat gattatctga 121 tcaaactcct ggccctcggg gattcagggg tggggaagac aacatttctt tatagataca 181 cagataataa attcaatccc aaattcatca ctacagtagg aatagacttt cgggaaaaac 241 gtgtggttta taatgcacaa ggaccgaatg gatcttcagg gaaagcattt aaagtgcatc 301 ttcagctttg ggacactgcg ggacaagagc ggttccggag tctcaccact gcatttttca 361 gagacgccat gggcttctta ttaatgtttg acctcaccag tcaacagagc ttcttaaatg 421 tcagaaactg gatgagccaa ctgcaagcaa atgcttattg tgaaaatcca gatatagtat 481 taattggcaa caaggcagac ctaccagatc agagggaagt caatgaacgg caagctcggg 541 aactggctga caaatatggc ataccatatt ttgaaacaag tgcagcaact ggacagaatg 601 tggagaaagc tgtagaaacc cttttggact taatcatgaa gcgaatggaa cagtgtgtgg 661 agaagacaca aatccctgat actgtcaatg gtggaaattc tggaaactgg gatggggaaa 721 agccaccaga gaagaaatgt atctgctaga ctctacatag aaactgaaca tcaagaaccc 781 caccaaaata ttacttttaa aacaatgaca aaccacacaa ttgttgttga gtaaaccacg 841 cacaatggca tgtctttctt tttctgccag aaaatctatt ttaagaaacc agaatagtca 901 acagtgttca aaagaattga ctagttatcc ctgaggccct ttcaaacatg atcaaagatt 961 tcccaatgtg atctcatcat catggatact caatttgttt tttcttatag agaaaatgag 1021 tatatagaca tatacagaga at // LOCUS HSU57099 1271 bp mRNA PRI 25-JUL-1996 DEFINITION Human APEG-1 mRNA, complete cds. ACCESSION U57099 NID g1457947 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1271) AUTHORS Hsieh,C.M., Yoshizumi,M., Endege,W.O., Kho,C.J., Jain,M.K., Kashiki,S., de los Santos,R., Lee,W.S., Perrella,M.A. and Lee,M.E. TITLE APEG-1, a novel gene preferentially expressed in aortic smooth muscle cells, is down-regulated by vascular injury JOURNAL J. Biol. Chem. 271 (29), 17354-17359 (1996) MEDLINE 96291890 REFERENCE 2 (bases 1 to 1271) AUTHORS Hsieh,C.-M., Yoshizumi,M., Endege,W.O., Jain,M.K., Kashiki,S., Hong,A.M., de los Santos,R., Lee,W.-S., Perrella,M.A. and Lee,M.-E. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) Chung-Ming Hsieh, Cardiovascular Biology Laboratory, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1271 /organism="Homo sapiens" /db_xref="taxon:9606" gene 126..467 /gene="APEG-1" CDS 126..467 /gene="APEG-1" /codon_start=1 /product="APEG-1 protein" /db_xref="PID:g1457948" /translation="MKPSPSQNRRSSDTGSKAPPTFKVSLMDQSVREGQDVIMSIRVQ GEPKPVVSWLRNRQPVRPDQRRFAEEAEGGLCRLRILAAERGDAGFYTCKAVNEYGAR QCEARLEVRGE" BASE COUNT 259 a 384 c 391 g 237 t ORIGIN 1 cactgaccgt gagacccggt gggtctacat ccccttttag cagccccatc acctccgacg 61 aggaatacct gagcccccca gaggagttcc cagagcctgg ggagacctgg ccgcgaaccc 121 ccaccatgaa gcccagtccc agccagaacc gccgttcttc tgacactggc tccaaggcac 181 cccccacctt caaggtctca cttatggacc agtcagtaag agaaggccaa gatgtcatca 241 tgagcatccg cgtgcagggg gagcccaagc ctgtggtctc ctggctgaga aaccgccagc 301 ccgtgcgccc agaccagcgg cgctttgcgg aggaggctga gggtgggctg tgccggctgc 361 ggatcctggc tgcagagcgt ggcgatgctg gtttctacac ttgcaaagcg gtcaatgagt 421 atggtgctcg gcagtgcgag gcccgcttgg aggtccgagg cgagtgagct cagggggcca 481 cctgcgctcc ccccgctacc ctccgagccg cgcccctgtc tcaggcacct ctcggacctc 541 gctgtgtttc actgcctcct gcccacagac ccagctgccg gcccggaccc gtcccagcct 601 cccctcccca ccccatgcag cccccagggg gatagcccat gggcccctgt ggaccctccc 661 tccccaagtg gacacatggc tgtgcagcca ggaggcccac agatggactg agtgctggga 721 aggggcggct gcgaggggta tcaacccccc gagtctctcc ctgaagggga gcaccgggcg 781 agtgcatgtg ctactgctgc tacaggcctg tctatctgtt tgtctgtctg tgtgtctgtg 841 acagtcaggg aaggatgcct cggagctgag gtggggtgag acagagtggg agagattacg 901 gcatggcatg gaggggccca aggagcaggg gctgttgaca aaggccttac caggaagggt 961 taggacactg accattctag aaatgggatt cgaatggcac aacactttct atttcacaaa 1021 agaccaaaag ccagaggccc caggctctgt gctgatgaac agcctggctg agccctggcc 1081 ctggcaggtt tagggcccat ttggggcccc ctccttctct gtcagggctg gggtgctctg 1141 tctgggaatg agggagttaa ccaagtttgg tgcaggagca ggggcagggg gccactgtag 1201 tgagcgtgga gaaatttgga aacacctatt tcttaactca aataaagtcc agtttgtacc 1261 taaaaaaaaa a // LOCUS HSU57316 2093 bp DNA PRI 16-AUG-1996 DEFINITION Human GCN5 (hGCN5) gene, complete cds. ACCESSION U57316 NID g1491934 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2093) AUTHORS Berry,R., Stevens,T.J., Walter,N.A., Wilcox,A.S., Rubano,T., Hopkins,J.A., Weber,J., Goold,R., Soares,M.B. and Sikela,J.M. TITLE Gene-based sequence-tagged-sites (STSs) as the basis for a human gene map JOURNAL Nature Genet. 10 (4), 415-423 (1995) MEDLINE 95400322 REFERENCE 2 (bases 1 to 2093) AUTHORS Yang,X.J., Ogryzko,V.V., Nishikawa,J., Howard,B.H. and Nakatani,Y. TITLE A p300/CBP-associated factor that competes with the adenoviral oncoprotein E1A JOURNAL Nature 382 (6589), 319-324 (1996) MEDLINE 96300317 REFERENCE 3 (bases 1 to 2093) AUTHORS Walter,N. and Sikela,J.M. TITLE Randomly sequenced clone NIB2000-R JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 2093) AUTHORS Nakatani,Y. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) NICHD, NIH, Bldg. 6, Rm. 416, 6 Center Dr., MSC 2753, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2093 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetal" /note="hGCN5 was isolated by use of NIB2000-R as the probe" gene 352..1782 /gene="hGCN5" CDS 352..1782 /gene="hGCN5" /note="similar to yeast GCN5 and human p300/CBP-associated factor (P/CAF)" /codon_start=1 /product="GCN5" /db_xref="PID:g1491935" /translation="MLEEEIYGANSPIWESGFTMPPSEGTQLVPRPASVSAAVVPSTP IFSPSMGGGSNSSLSLDSAGAEPMPGEKRTLPENLTLEDAKRLRVMGDIPMELVNEVM LTITDPAAMLGPETSLLSANAARDETARLEERRGIIEFHVIGNSLTPKANRRVLLWLV GLQNVFSHQLPRMPKEYIARLVFDPKHKTLALIKDGRVIGGICFRMFPTQGFTEIVFC AVTSNEQVKGYGTHLMNHLKEYHIKHNILYFLTYADEYAIGYFKKQGFSKDIKVPKSR YLGYIKDYEGATLMECELNPRIPYTELSHIIKKQKEIIKKLIERKQAQIRKVYPGLSC FKEGVRQIPVESVPGIRETGWKPLGKEKGKELKDPDQLYTTLKNLLAQIKSHPSAWPF MEPVKKSEAPDYYEVIRFPIDLKTMTERLRSRYYVTRKLFVADLQRVIANCREYNPPD SEYCRCASALEKFFYFKLKEGGLIDK" BASE COUNT 435 a 653 c 586 g 419 t ORIGIN 1 gaattccggc gaaaccactc atgtctttgg gcgaagcctt ctccggtcca ttttcaccgt 61 tacccgccgg cagctgctgg aaaagttccg agtggagaag gacaaattgg tgcccgagaa 121 gaggaccctc atcctcactc acttccccaa gtaaggctcc ttctggccta ccaggatttg 181 gccccaagtt cacatcctcc ctgttgtccc cttttttcca ggaaggcttc ctggattggt 241 ccctcctctc cctccatggg ccttttggga tctgggcgtc tacctggcag acttgcccat 301 ggcccagaag caacttgcta gtactagtct ggggatggca gattcctgtc catgctggag 361 gaggagatct atggggcaaa ctctccaatc tgggagtcag gcttcaccat gccaccctca 421 gaggggacac agctggttcc ccggccagct tcagtcagtg cagcggttgt tcccagcacc 481 cccatcttca gccccagcat gggtgggggc agcaacagct ccctgagtct ggattctgca 541 ggggccgagc ctatgccagg cgagaagagg acgctcccag agaacctgac cctggaggat 601 gccaagcggc tccgtgtgat gggtgacatc cccatggagc tggtcaatga ggtcatgctg 661 accatcactg accctgctgc catgctgggg cctgagacga gcctgctttc ggccaatgcg 721 gcccgggatg agacagcccg cctggaggag cgccgcggca tcatcgagtt ccatgtcatc 781 ggcaactcac tgacgcccaa ggccaaccgg cgggtgttgc tgtggctcgt ggggctgcag 841 aatgtctttt cccaccagct gccgcgcatg cctaaggagt atatcgcccg cctcgtcttt 901 gacccgaagc acaagactct ggccttgatc aaggatgggc gggtcatcgg tggcatctgc 961 ttccgcatgt ttcccaccca gggcttcacg gagattgtct tctgtgctgt cacctcgaat 1021 gagcaggtca agggttatgg gacccacctg atgaaccacc tgaaggagta tcacatcaag 1081 cacaacattc tctacttcct cacctacgcc gacgagtacg ccatcggcta cttcaaaaag 1141 cagggtttct ccaaggacat caaggtgccc aagagccgct acctgggcta catcaaggac 1201 tacgagggag cgacgctgat ggagtgtgag ctgaatcccc gcatccccta cacggagctg 1261 tcccacatca tcaagaagca gaaagagatc atcaagaagc tgattgagcg caaacaggcc 1321 cagatccgca aggtctaccc ggggctcagc tgcttcaagg agggcgtgag gcagatccct 1381 gtggagagcg ttcctggcat tcgagagaca ggctggaagc cattggggaa ggagaagggg 1441 aaggagctga aggaccccga ccagctctac acaaccctca aaaacctgct ggcccaaatc 1501 aagtctcacc ccagtgcctg gcccttcatg gagcctgtga agaagtcgga ggcccctgac 1561 tactacgagg tcatccgctt ccccattgac ctgaagacca tgactgagcg gctgcgaagc 1621 cgctactacg tgacccggaa gctctttgtg gccgacctgc agcgggtcat cgccaactgt 1681 cgcgagtaca accccccgga cagcgagtac tgccgctgtg ccagcgccct ggagaagttc 1741 ttctacttca agctcaagga gggaggcctc attgacaagt aggcccatct ttgggccgca 1801 gccctgacct ggaatgtctc cacctcggat tctgatctga tccttagggg gtgccctggc 1861 cccacggacc cgactcagct tgagacactc cagccaaggg tcctccggac ccgatcctgc 1921 agctctttct ggaccttcag gcacccccaa gcgtgcagct ctgtcccagc cttcactgtg 1981 tgtgagaggt ctcctgggtt ggggcccagc ccctctagag tagctggtgg ccagggatga 2041 accttgccca gccgtggtgg cccccaggcc tggtccccaa gagcccggaa ttc // LOCUS HSU57317 3014 bp mRNA PRI 05-DEC-1996 DEFINITION Human p300/CBP-associated factor (P/CAF) mRNA, complete cds. ACCESSION U57317 NID g1491936 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3014) AUTHORS Yang,X.J., Ogryzko,V.V., Nishikawa,J., Howard,B.H. and Nakatani,Y. TITLE A p300/CBP-associated factor that competes with the adenoviral oncoprotein E1A JOURNAL Nature 382 (6589), 319-324 (1996) MEDLINE 96300317 REFERENCE 2 (bases 1 to 3014) AUTHORS Nakatani,Y. TITLE Direct Submission JOURNAL Submitted (30-APR-1996) NICHD, NIH, Bldg. 6, Rm 416, 6 Center Dr., MSC 2753, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3014 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /dev_stage="fetal" gene 459..2957 /gene="P/CAF" CDS 459..2957 /gene="P/CAF" /note="similar to hGCN5 and yeast GCN5; histone acetyltransferase" /codon_start=1 /product="p300/CBP-associated factor" /db_xref="PID:g1491937" /translation="MSEAGGAGPGGCGAGAGAGAGPGALPPQPAALPPAPPQGSPCAA AAGGSGACGPATAVAAAGTAEGPGGGGSARIAVKKAQLRSAPRAKKLEKLGVYSACKA EESCKCNGWKNPNPSPTPPRADLQQIIVSLTESCRSCSHALAAHVSHLENVSEEEMNR LLGIVLDVEYLFTCVHKEEDADTKQVYFYLFKLLRKSILQRGKPVVEGSLEKKPPFEK PSIEQGVNNFVQYKFSHLPAKERQTIVELAKMFLNRINYWHLEAPSQRRLRSPNDDIS GYKENYTRWLCYCNVPQFCDSLPRYETTQVFGRTLLRSVFTVMRRQLLEQARQEKDKL PLEKRTLILTHFPKFLSMLEEEVYSQNSPIWDQDFLSASSRTSQLGIQTVINPPPVAG TISYNSTSSSLEQPNAGSSSPACKASSGLEANPGEKRKMTDSHVLEEAKKPRVMGDIP MELINEVMSTITDPAAMLGPETNFLSAHSARDEAARLEERRGVIEFHVVGNSLNQKPN KKILMWLVGLQNVFSHQLPRMPKEYITRLVFDPKHKTLALIKDGRVIGGICFRMFPSQ GFTEIVFCAVTSNEQVKGYGTHLMNHLKEYHIKHDILNFLTYADEYAIGYFKKQGFSK EIKIPKTKYVGYIKDYEGATLMGCELNPRIPYTEFSVIIKKQKEIIKKLIERKQAQIR KVYPGLSCFKDGVRQIPIESIPGIRETGWKPSGKEKSKEPRDPDQLYSTLKSILQQVK SHQSAWPFMEPVKRTEAPGYYEVIRSPMDLKTMSERLKNRYYVSKKLFMADLQRVFTN CKEYNAPESEYYKCANILEKFFFSKIKEAGLIDK" BASE COUNT 824 a 770 c 810 g 610 t ORIGIN 1 ggggccgcgt cgacgcggaa aagaggccgt ggggggcctc ccagcgctgg cagacaccgt 61 gaggctggca gccgccggca cgcacaccta gtccgcagtc ccgaggaaca tgtccgcagc 121 cagggcgcgg agcagagtcc cgggcaggag aaccaaggga gggcgtgtgc tgtggcggcg 181 gcggcagcgg cagcggagcc gctagtcccc tccctcctgg gggagcagct gccgccgctg 241 ccgccgccgc caccaccatc agcgcgcggg gcccggccag agcgagccgg gcgagcggcg 301 cgctaggggg agggcggggg cggggagggg ggtgggcgaa gggggcggga gggcgtgggg 361 ggagggtctc gctctcccga ctaccagagc ccgagggaga ccctggcggc ggcggcggcg 421 cctgacactc ggcgcctcct gccgtgctcc ggggcggcat gtccgaggct ggcggggccg 481 ggccgggcgg ctgcggggca ggagccgggg caggggccgg gcccggggcg ctgcccccgc 541 agcctgcggc gcttccgccc gcgcccccgc agggctcccc ctgcgccgct gccgccgggg 601 gctcgggcgc ctgcggtccg gcgacggcag tggctgcagc gggcacggcc gaaggaccgg 661 gaggcggtgg ctcggcccga atcgccgtga agaaagcgca actacgctcc gctccgcggg 721 ccaagaaact ggagaaactc ggagtgtact ccgcctgcaa ggccgaggag tcttgtaaat 781 gtaatggctg gaaaaaccct aacccctcac ccactccccc cagagccgac ctgcagcaaa 841 taattgtcag tctaacagaa tcctgtcgga gttgtagcca tgccctagct gctcatgttt 901 cccacctgga gaatgtgtca gaggaagaaa tgaacagact cctgggaata gtattggatg 961 tggaatatct ctttacctgt gtccacaagg aagaagatgc agataccaaa caagtttatt 1021 tctatctatt taagctcttg agaaagtcta ttttacaaag aggaaaacct gtggttgaag 1081 gctctttgga aaagaaaccc ccatttgaaa aacctagcat tgaacagggt gtgaataact 1141 ttgtgcagta caaatttagt cacctgccag caaaagaaag gcaaacaata gttgagttgg 1201 caaaaatgtt cctaaaccgc atcaactatt ggcatctgga ggcaccatct caacgaagac 1261 tgcgatctcc caatgatgat atttctggat acaaagagaa ctacacaagg tggctgtgtt 1321 actgcaacgt gccacagttc tgcgacagtc tacctcggta cgaaaccaca caggtgtttg 1381 ggagaacatt gcttcgctcg gtcttcactg ttatgaggcg acaactcctg gaacaagcaa 1441 gacaggaaaa agataaactg cctcttgaaa aacgaactct aatcctcact catttcccaa 1501 aatttctgtc catgctagaa gaagaagtat atagtcaaaa ctctcccatc tgggatcagg 1561 attttctctc agcctcttcc agaaccagcc agctaggcat ccaaacagtt atcaatccac 1621 ctcctgtggc tgggacaatt tcatacaatt caacctcatc ttcccttgag cagccaaacg 1681 cagggagcag cagtcctgcc tgcaaagcct cttctggact tgaggcaaac ccaggagaaa 1741 agaggaaaat gactgattct catgttctgg aggaggccaa gaaaccccga gttatggggg 1801 atattccgat ggaattaatc aacgaggtta tgtctaccat cacggaccct gcagcaatgc 1861 ttggaccaga gaccaatttt ctgtcagcac actcggccag ggatgaggcg gcaaggttgg 1921 aagagcgcag gggtgtaatt gaatttcacg tggttggcaa ttccctcaac cagaaaccaa 1981 acaagaagat cctgatgtgg ctggttggcc tacagaacgt tttctcccac cagctgcccc 2041 gaatgccaaa agaatacatc acacggctcg tctttgaccc gaaacacaaa acccttgctt 2101 taattaaaga tggccgtgtt attggtggta tctgtttccg tatgttccca tctcaaggat 2161 tcacagagat tgtcttctgt gctgtaacct caaatgagca agtcaagggc tatggaacac 2221 acctgatgaa tcatttgaaa gaatatcaca taaagcatga catcctgaac ttcctcacat 2281 atgcagatga atatgcaatt ggatacttta agaaacaggg tttctccaaa gaaattaaaa 2341 tacctaaaac caaatatgtt ggctatatca aggattatga aggagccact ttaatgggat 2401 gtgagctaaa tccacggatc ccgtacacag aattttctgt catcattaaa aagcagaagg 2461 agataattaa aaaactgatt gaaagaaaac aggcacaaat tcgaaaagtt taccctggac 2521 tttcatgttt taaagatgga gttcgacaga ttcctataga aagcattcct ggaattagag 2581 agacaggctg gaaaccgagt ggaaaagaga aaagtaaaga gcccagagac cctgaccagc 2641 tttacagcac gctcaagagc atcctccagc aggtgaagag ccatcaaagc gcttggccct 2701 tcatggaacc tgtgaagaga acagaagctc caggatatta tgaagttata aggtccccca 2761 tggatctcaa aaccatgagt gaacgcctca agaataggta ctacgtgtct aagaaattat 2821 tcatggcaga cttacagcga gtctttacca attgcaaaga gtacaacgcc cctgagagtg 2881 aatactacaa atgtgccaat atcctggaga aattcttctt cagtaaaatt aaggaagctg 2941 gattaattga caagtgattt tttttccccc tctgcttctt agaaactcac caagcagtgt 3001 gcctaaagca aggt // LOCUS HSU57342 1502 bp mRNA PRI 28-SEP-1996 DEFINITION Human myelodysplasia/myeloid leukemia factor 2 (MLF2) mRNA, complete cds. ACCESSION U57342 NID g1399744 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1502) AUTHORS Kuefer,M.U., Look,A.T., Williams,D.C., Valentine,V., Naeve,C.W., Behm,F.G., Mullersman,J.E., Yoneda-Kato,N., Montgomery,K., Kucherlapati,R. and Morris,S.W. TITLE cDNA cloning, tissue distribution, and chromosomal localization of myelodysplasia/myeloid leukemia factor 2 (MLF2) JOURNAL Genomics 35 (2), 392-396 (1996) MEDLINE 96299794 REFERENCE 2 (bases 1 to 1502) AUTHORS Kuefer,M.U. TITLE Direct Submission JOURNAL Submitted (01-MAY-1996) Martin U. Kuefer, Experimental Oncology, St. Jude Children's Research Hospital, 332 N. Lauderdale, Memphis, TN 38101, USA FEATURES Location/Qualifiers source 1..1502 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p13" /clone="765b9" gene 180..926 /gene="MLF2" CDS 180..926 /gene="MLF2" /codon_start=1 /product="myelodysplasia/myeloid leukemia factor 2" /db_xref="PID:g1399745" /translation="MFRFMRDVEPEDPMFLMDPFAIHRQHMSRMLSGGFGYSPFLSIT DGNMPGTRPASRRMQQAGAVSPFGMLGMSGGFMDMFGMMNDMIGNMEHMTAGGNCQTF SSSTVISYSNTGDGAPKVYQETSEMRSAPGGIRETRRTVRDSDSGLEQMSIGHHIRDR AHILQRSRNHRTGDQEERQDYINLDESEAAAFDDEWRRETSRFRQQRPLEFRRLESSG AGGRRAEGPPRLAIQGPEDSPSRQSRRYDW" BASE COUNT 307 a 449 c 408 g 338 t ORIGIN 1 ctctaaaggg cagctgtggg aggaggcggc gtggaaggcc gaggagctca agcccggacc 61 aatccccacg ttccgggccg cgaccctgac cctgcagcgt accgggaagc gaaaccggcc 121 ggatgggccg ctgagcccga atcgggcact gtgtggagcc ccctggagct gagatcagga 181 tgttccgctt catgagggac gtggagcctg aggatcccat gttcctgatg gatccctttg 241 ctattcaccg tcagcatatg agccgtatgt tgtcaggtgg ctttggatat agccccttcc 301 tcagcatcac agatggcaac atgccaggga ccaggcctgc cagccgccgg atgcagcagg 361 ctggagctgt ctcccccttt gggatgctgg gaatgtcggg tggtttcatg gacatgtttg 421 ggatgatgaa tgacatgatt ggaaacatgg aacacatgac agctggaggc aattgccaga 481 ccttctcatc ttccactgtc atctcctact ccaatacggg tgatggtgcc cccaaggtct 541 accaagagac atcagagatg cgctcggcac caggcgggat ccgggagaca cggaggactg 601 ttcgggattc agacagtgga ctggagcaga tgtccattgg gcatcacatc cgggacaggg 661 ctcacatcct ccagcgctcc cgaaaccatc gcacggggga ccaggaggag cggcaggact 721 atatcaacct ggatgagagt gaggccgcag cgtttgatga cgagtggcgg cgggagacct 781 cccgattccg gcagcagcgt cccctggagt ttcggcggct tgagtcctca ggggctgggg 841 gacgaagggc ggaggggcct ccccgcctgg ccatccaggg acctgaggac tccccttccc 901 gacagtcccg ccgctatgac tggtgagggc cccgggccct cagcctctct tgtacaggct 961 gagaggctga gaaatcatcc cctgaataac tttttcctct cgattcccat ccccaattta 1021 atattaaatt aacaggcaag ccggccccca cctctccctg ggggtctcag ggagaacctt 1081 tcacggcacc ctttccctac cttttccttc tttaatctcc tggtttacca ttgatgactt 1141 cgcctctgca tctactgact tgatttttca ttctgccact ccatcttcaa accccctcac 1201 ctttcccatc ctactcctgc catgcattga agggtcaatg cattttgggg tgagctctgg 1261 gtttaggggc cccctccatc cctcagctac cctggatctt tgcccacctc ttcctcagag 1321 cccccactga ggggccgtag ccctatctag ggctgtggaa ggagcagact ggttcctaac 1381 tctctccctc ctcctgccca cacacatcaa aagaatcttc cctacaccct tctctgcctt 1441 tattttttga tttgtgcaac ttgtaactag gtgtttatgg aataaaggag aatggaaaaa 1501 ag // LOCUS HSU57592 4068 bp mRNA PRI 16-NOV-1996 DEFINITION Human jumonji putative protein (jumonji) mRNA, complete cds. ACCESSION U57592 NID g1669845 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4068) AUTHORS Berge-Lefranc,J.L., Jay,P., Massacrier,A., Cau,P., Mattei,M.G., Bauer,S., Marsollier,C., Berta,P. and Fontes,M. TITLE Characterization of the human jumonji gene JOURNAL Hum. Mol. Genet. 5 (10), 1637-1641 (1996) MEDLINE 97049972 REFERENCE 2 (bases 1 to 4068) AUTHORS Jay,P. TITLE Direct Submission JOURNAL Submitted (05-MAY-1996) Philippe Jay, Centre de Recherches de Biologie Macromolculaire, CNRS/INSERM, Campus CNRS, 1919 Route de Mende, Montpellier, 34033, France FEATURES Location/Qualifiers source 1..4068 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Y4" /chromosome="6" /map="6p24-p23" /dev_stage="fetus" gene 245..4045 /gene="jumonji" CDS 245..4045 /gene="jumonji" /codon_start=1 /product="jumonji putative protein" /db_xref="PID:g1669846" /translation="MSKERPKRNIIQKKYDDSDGIPWSEERVVRKVLYLSLKEFKNSQ KRQHAEGIAGSLKTVNGLLGNDQSKGLGPASEQSENEKDDASQVSSTSNDVSSSDFEE GPSRKRPRLQAQRKFAQSQPNSPSTTPVKIVEPLLPPPATQISDLSKRKPKTEDFLTF LCLRGSPALPNSMVYFGSSQDEEEVEEEDDETEDVKTATNNASSSCQSTPRKGKTHKH VHNGHVFNGSSRSTREKEPVQKHKSKEATPAKEKHSDHRADSRREQASANHPAAAPST GSSAKGLAATHHHPPLHRSAQDLRKQVSKVNGVTRMSSLGAGVTSAKKMREVRPSPSK TVKYTATVTKGAVTYTKAKRELVKDTKPNHHKPSSAVNHTISGKTESSNAKTRKQVLS LGGASKSTGPAVNGLKVSGRLNPKSCTKEVGGRQLREGLQLREGLRNSKRRLEEAHQA EKPQSPPKKMKGAAGPAEGPGKKAPAERGLLNGHVKKEVPERSLERNRPKRATAGKST PGRQAHGKADSASCENRSTSQPESVHKPQDSGKAEKGGGKAGWAAMDEIPVLRPSAKE FHDPLIYIESVRAQVEKFGMCRVIPPPDWRPECKLNDEMRFVTQIQHIHKLGRRWGPN VQRLACIKKHLKSQGITMDELPLIGGCELDLACFFRLINEMGGMQQVTELKKWNKLSD MLRIPKTAQERLAKLQEAYCQYILSYDSLSPEEHRRLEKEVLMEKEILEKRKGPLEGH TENDHHKFHPLPRLEPKNGLIHGVAPRNGFRSKLKEVGQAQLKTGRRRLFAQEKEVVK EEEEDKGVLNDFHKCIYKGRSVSLTTFYRTARNIMSMCFSKEPAPAEIEQEYWRLVEE KDCHVAVHCGKVDTNTHGSGFPVGKSEPFSRHGWNLTVLPNNTGSILRHLGAVPGVTI PWLNIGMVFSTSCWSRDQNHLPYIDYLHTGADCIWYCIPAEEENKLEDVVHTLLQANG TPGLQMLESNVMISPEVLCKEGIKVHRTVQQSGQFVVCFPGSFVSKVCCGYSVSETVH FATTQWTSMGFETAKEMKRRHIAKPFSMEKLLYQIAQAEAKKENGPTLSTISALLDEL RDTELRQRRQLFEAGLHSSARYGSHDGSSTVADGKKKPRKWLQLETSERRCQICQHLC YLSMVVQENENVVFCLECALRHVEKQKSCRGLKLMYRYDEEQIISLVNQICGKVSGKN GSIENCLHKPTPKRGPRKRATVDVPPSRAVSLQFIQKCFELHHEDAQRPWSIYIYFFV IIIF" BASE COUNT 1087 a 1076 c 1129 g 776 t ORIGIN 1 gttttactaa agtgaatttt tttttgtttg cttcgttcgt ctttggctct ttttttttcc 61 ttcccaattt cggatttatt tcaaggcgaa tctggctttg ggggaagagg aagaaaagtc 121 ggattacaag atcaaccacc accaacaaca ataaaaacca ccaggatatt tttttgcaaa 181 tttctgacgg ctttaaattc atgaagcaat tgtccccttt tgcaatcagc atttggatct 241 cagaatgagc aaggaaagac ccaagaggaa tatcattcag aagaaatacg atgacagtga 301 tgggattccg tggtcagaag aacgggtggt acgtaaagtc ctttatttgt ccctgaagga 361 attcaagaat tcccagaaga ggcagcatgc ggaaggcatt gctgggagcc tgaaaactgt 421 gaatgggctc cttggtaatg accagtctaa gggattagga ccagcatcag aacagtcaga 481 gaatgaaaag gacgatgcat cccaagtgtc ctccactagc aacgatgtta gttcttcaga 541 ttttgaagaa gggccgtcga ggaaaaggcc caggctgcaa gcacaaagga agtttgctca 601 gtctcagccg aatagtccca gcacaactcc agtaaagata gtggagccat tgctaccccc 661 tccagctact cagatatcag acctctctaa aaggaagcct aagacagaag attttcttac 721 ctttctctgc cttcgaggtt ctcctgcgct gcccaacagc atggtgtatt ttggaagctc 781 tcaggatgag gaggaagtcg aggaggaaga tgatgagaca gaagacgtca aaacagccac 841 caacaatgct tcatcttcat gccagtcgac ccccaggaaa ggaaaaaccc acaaacatgt 901 tcacaacggg catgttttca atggttccag caggtcaaca cgggagaagg aacctgttca 961 aaaacacaaa agcaaagagg ccactcccgc aaaggagaag cacagcgatc accgggctga 1021 cagccgccgg gagcaggctt cagctaacca ccccgcagcg gccccctcca cgggttcctc 1081 ggccaagggg cttgctgcca cccatcacca cccccctctg catcggtcgg ctcaggactt 1141 acggaaacag gtttctaagg taaacggagt cactcgaatg tcatctctgg gtgcaggtgt 1201 aaccagtgcc aaaaagatgc gcgaggtcag accttcacca tccaaaactg tgaagtacac 1261 tgccacggtg acgaaggggg ctgtcacata caccaaagcc aagagagaac tggtcaagga 1321 caccaaaccc aatcaccaca agcccagttc cgctgtcaac cacacaatct cagggaaaac 1381 tgaaagtagc aatgcaaaaa cccgcaaaca ggtgctatcc ctcggggggg cgtccaagtc 1441 cactgggccc gccgtcaatg gcctcaaggt cagtggcagg ttgaacccaa agtcatgcac 1501 taaggaggtg ggggggcggc agctgcggga gggcctgcag ctgcgggagg ggctgcggaa 1561 ctccaagagg agactggaag aggcacacca ggcggagaag ccgcagtcgc cccccaagaa 1621 gatgaaaggg gcggctggcc ccgccgaagg ccctggcaag aaggccccgg ccgagagagg 1681 tctgctgaac ggacacgtga agaaggaagt gccggagcgc agtctggaga ggaatcggcc 1741 gaagcgggcc acggccggga agagcacgcc aggcagacaa gcacatggca aggcggacag 1801 cgcctcctgt gaaaatcgtt ctacctcgca accggagtcc gtgcacaagc cgcaggactc 1861 gggcaaggcc gagaagggcg gcggcaaggc cgggtgggcg gccatggacg agatccccgt 1921 cctcaggccc tccgccaagg agttccacga tccgctcatc tacatcgagt cggtccgcgc 1981 tcaggtggag aagttcggga tgtgcagggt gatcccccct ccggactggc ggcccgagtg 2041 caagctcaac gatgagatgc ggtttgtcac gcagattcag cacatccaca agctgggccg 2101 gcgctggggc cccaacgtgc agcggctggc ctgcatcaag aagcacctca aatctcaggg 2161 catcaccatg gacgagctcc cgctcatagg gggctgtgag ctcgacctgg cctgcttttt 2221 ccggctgatt aatgagatgg gcggcatgca acaagtgact gaactcaaaa aatggaacaa 2281 actatcagac atgctgcgca tccccaaaac tgcccaggaa cggctggcca agctgcagga 2341 agcctactgc cagtacatac tttcgtatga ctccctgtcc ccagaggagc accggcggct 2401 ggagaaggag gtgctgatgg agaaggagat cctggagaag cgcaaggggc cgctggaagg 2461 ccacacagag aacgaccacc acaagttcca ccctctgccc cgcttagagc ccaagaatgg 2521 gctcatccac ggcgtggccc ccaggaacgg cttccgcagc aagctcaagg aggtgggcca 2581 ggcccagttg aagactggcc ggcggcgact cttcgctcag gaaaaagaag tggtcaagga 2641 agaggaggag gacaaaggcg tcctcaatga cttccacaag tgcatctata agggaaggtc 2701 tgtttctcta acaacttttt atcgaacagc gaggaatatc atgagcatgt gtttcagcaa 2761 ggagcctgcc ccagccgaaa tcgagcaaga gtactggagg ctagtggaag agaaggactg 2821 ccacgtggca gtgcactgcg gcaaggtgga caccaacact cacggcagtg gattcccagt 2881 aggaaaatca gaaccctttt cgaggcatgg atggaacctc accgtcctcc ccaataacac 2941 agggtccatc ctgcgtcacc tcggtgctgt gcctggagtg actattccct ggctaaatat 3001 tggcatggtc ttttctacct catgctggtc tcgagaccaa aatcaccttc catacattga 3061 ctacttacac actggtgctg actgcatttg gtattgcatt cctgctgagg aggagaacaa 3121 gctggaagat gtggtccaca ccctgctgca agccaatggc accccagggc tgcagatgct 3181 ggaaagcaac gtcatgatct ccccggaggt gctgtgcaaa gaggggatca aggtgcacag 3241 gaccgtgcag cagagtggcc agtttgtcgt ctgcttcccg ggatcctttg tgtccaaagt 3301 gtgctgtggg tacagcgtgt ctgaaaccgt gcactttgct accacccagt ggacaagtat 3361 gggctttgag accgccaagg aaatgaagcg tcgccatata gctaagccat tctccatgga 3421 gaagttactc taccagattg cacaagcaga agcaaaaaaa gaaaacggtc ccactctcag 3481 taccatctca gccctcctgg atgagctcag ggatacagag ctacggcagc gcaggcagct 3541 gttcgaggct ggcctccact cctccgcacg ctatggcagc cacgatggca gcagcacggt 3601 ggcggacggg aagaaaaagc ctcgaaagtg gctgcagttg gagacgtcag agaggaggtg 3661 tcagatctgc cagcacctgt gctacctgtc catggtggta caagagaacg aaaacgtcgt 3721 gttctgtctg gagtgtgctc tgcgccacgt ggagaaacag aagtcctgcc gagggctgaa 3781 gttgatgtac cgctacgatg aggaacagat tatcagtctg gtcaatcaga tctgcggcaa 3841 agtgtctggt aaaaacggca gcattgagaa ctgtctccat aaacccacac caaaaagagg 3901 tccccgcaag agagcgacag tggacgtgcc cccctcccgt gctgtcagcc tccagttcat 3961 ccaaaagtgc ttcgagctac atcatgaaga tgcccaacgc ccgtggtcga tttatatata 4021 tttttttgta attattatat tctagtttgg agtacttgct gtaggatc // LOCUS HSU57629 2784 bp mRNA PRI 16-MAY-1996 DEFINITION Human retinitis pigmentosa GTPase regulator (RPGR) mRNA, complete cds. ACCESSION U57629 NID g1314870 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2784) AUTHORS Meindl,A., Dry,K., Herrmann,K., Manson,F., Ciccodicola,A., Edgar,A., Carvalho,M.R.S., Achatz,H., Hellebrand,H., Lennon,A., Migliaccio,C., Porter,K., Zrenner,E., Bird,A., Jay,M., Lorenz,B., Wittwer,B., D'Urso,M., Meitinger,T. and Wright,A. TITLE A gene (RPGR) with homology to the RCC1 guanine nucleotide exchange factor is mutated in X-linked retinitis pigmentosa (RP3) JOURNAL Nature Genet. 13 (1), 35-42 (1996) MEDLINE 96241570 REFERENCE 2 (bases 1 to 2784) AUTHORS Herrmann,K. TITLE Direct Submission JOURNAL Submitted (06-MAY-1996) Klaus Herrmann, Paediatrische Genetik, LMU Muenchen, Goethestrasse 29, Muenchen D-80336, Germany FEATURES Location/Qualifiers source 1..2784 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp21.1" gene 1..2507 /gene="RPGR" exon 1..87 /gene="RPGR" /number=1 CDS 60..2507 /gene="RPGR" /note="similar to RCC1 guanine nucleotide exchange factor, SwissProt Accession Number P18754" /codon_start=1 /product="retinitis pigmentosa GTPase regulator" /db_xref="PID:g1314871" /translation="MREPEELMPDSGAVFTFGKSKFAENNPGKFWFKNDVPVHLSCGD EHSAVVTGNNKLYMFGSNNWGQLGLGSKSAISKPTCVKALKPEKVKLAACGRNHTLVS TEGGNVYATGGNNEGQLGLGDTEERNTFHVISFFTSEHKIKQLSAGSNTSAALTEDGR LFMWGDNSEGQIGLKNVSNVCVPQQVTIGKPVSWISCGYYHSAFVTTDGELYVFGEPE NGKLGLPNQLLGNHRTPQLVSEIPEKVIQVACGGEHTVVLTENAVYTFGLGQFGQLGL GTFLFETSEPKVIENIRDQTISYISCGENHTALITDIGLMYTFGDGRHGKLGLGLENF TNHFIPTLCSNFLRFIVKLVACGGCHMVVFAAPHRGVAKEIEFDEINDTCLSVATFLP YSSLTSGNVLQRTLSARMRRRERERSPDSFSMRRTLPPIEGTLGLSACFLPNSVFPRC SERNLQESVLSEQDLMQPEEPDYLLDEMTKEAEIDNSSTVESLGETTDILNMTHIMSL NSNEKSLKLSPVQKQKKQQTIGELTQDTALTENDDSDEYEEMSEMKEGKACKQHVSQG IFMTQPATTIEAFSDEEVEIPEEKEGAEDSKGNGIEEQEVEANEENVKVHGGRKEKTE ILSDDLTDKAEDHEFSKTEELKLEDVDEEINAENVESKKKTVGDDESVPTGYHSKTEG AERTNDDSSAETIEKKEKANLEERAICEYNENPKGYMLDDADSSSLEILENSETTPSK DMKKTKKIFLFKRVPSINQKIVKNNNEPLPEIKSIGDQIILKSDNKDADQNHMSQNHQ NIPPTNTERRSKSCTIL" exon 88..213 /gene="RPGR" /number=2 misc_feature 96..118 /gene="RPGR" /note="encodes GTP phosphate binding motif 'A'" misc_feature 147..164 /gene="RPGR" /note="encodes GTP phosphate binding motif 'B'" misc_feature 174..1154 /gene="RPGR" /note="encodes region that is similar to RCC1" exon 214..306 /gene="RPGR" /number=3 exon 307..369 /gene="RPGR" /number=4 exon 370..528 /gene="RPGR" /number=5 exon 529..678 /gene="RPGR" /number=6 exon 679..837 /gene="RPGR" /number=6 exon 838..993 /gene="RPGR" /number=8 exon 994..1118 /gene="RPGR" /number=9 exon 1119..1304 /gene="RPGR" /number=10 exon 1305..1473 /gene="RPGR" /number=11 exon 1474..1565 /gene="RPGR" /number=12 exon 1566..1631 /gene="RPGR" /number=13 exon 1632..1812 /gene="RPGR" /number=14 exon 1813..1964 /gene="RPGR" /number=15 exon 1965..2150 /gene="RPGR" /number=16 exon 2151..2208 /gene="RPGR" /number=17 exon 2209..2300 /gene="RPGR" /number=18 exon 2301..2504 /gene="RPGR" /number=19 misc_feature 2493..2501 /gene="RPGR" /note="encodes isoprenylation site" polyA_signal 2755..2760 BASE COUNT 952 a 481 c 621 g 730 t ORIGIN 1 accaaaccgt cctctacagc ctcctggccc cggcgcaggc tgcccgtact gcccgtggca 61 tgagggagcc ggaagagctg atgcccgatt cgggtgctgt gtttacattt gggaaaagta 121 aatttgctga aaataatccc ggtaaattct ggtttaaaaa tgatgtccct gtacatcttt 181 catgtggaga tgaacattct gctgttgtta ccggaaataa taaactttac atgtttggca 241 gtaacaactg gggtcagtta ggattaggat caaagtcagc catcagcaag ccaacatgtg 301 tcaaagctct aaaacctgaa aaagtgaaat tagctgcctg tggaaggaac cacaccctgg 361 tgtcaacaga aggaggcaat gtatatgcaa ctggtggaaa taatgaagga cagttggggc 421 ttggtgacac cgaagaaaga aacacttttc atgtaattag cttttttaca tccgagcata 481 agattaagca gctgtctgct ggatctaata cttcagctgc cctaactgag gatggaagac 541 tttttatgtg gggtgacaat tccgaagggc aaattggttt aaaaaatgta agtaatgtct 601 gtgtccctca gcaagtgacc attgggaaac ctgtctcctg gatctcttgt ggatattacc 661 attcagcttt tgtaacaaca gatggtgagc tatatgtgtt tggagaacct gagaatggga 721 agttaggtct tcccaatcag ctcctgggca atcacagaac accccagctg gtgtctgaaa 781 ttccggagaa ggtgatccaa gtagcctgtg gtggagagca tactgtggtt ctcacggaga 841 atgctgtgta tacctttggg ctgggacaat ttggtcagct gggtcttggc acttttcttt 901 ttgaaacttc agaacccaaa gtcattgaga atattaggga tcaaacaata agttatattt 961 cttgtggaga aaatcacaca gctttgataa cagatatcgg ccttatgtat acttttggag 1021 atggtcgcca cggaaaatta ggacttggac tggagaattt taccaatcac ttcattccta 1081 ctttgtgctc taattttttg aggtttatag ttaaattggt tgcttgtggt ggatgtcaca 1141 tggtagtttt tgctgctcct catcgtggtg tggcaaaaga aattgaattc gatgaaataa 1201 atgatacttg cttatctgtg gcgacttttc tgccgtatag cagtttaacc tcaggaaatg 1261 tactgcagag gactctatca gcacgtatgc ggcgaagaga gagggagagg tctccagatt 1321 ctttttcaat gaggagaaca ctacctccaa tagaagggac tcttggcctt tctgcttgtt 1381 ttctccccaa ttcagtcttt ccacgatgtt ctgagagaaa cctccaagag agtgtcttat 1441 ctgaacagga cctcatgcag ccagaggaac cagattattt gctagatgaa atgaccaaag 1501 aagcagagat agataattct tcaactgtag aaagccttgg agaaactact gatatcttaa 1561 acatgacaca catcatgagc ctgaattcca atgaaaagtc attaaaatta tcaccagttc 1621 agaaacaaaa gaaacaacaa acaattgggg aactgacgca ggatacagct cttactgaaa 1681 acgatgatag tgatgaatat gaagaaatgt cagaaatgaa agaagggaaa gcatgtaaac 1741 aacatgtgtc acaagggatt ttcatgacgc agccagctac gactatcgaa gcattttcag 1801 atgaggaagt agagatccca gaggagaagg aaggagcaga ggattcaaaa ggaaatggaa 1861 tagaggagca agaggtagaa gcaaatgagg aaaatgtgaa ggtgcatgga ggaagaaagg 1921 agaaaacaga gatcctatca gatgacctta cagacaaagc agaggatcat gaattttcta 1981 aaactgagga actaaaacta gaagatgtgg atgaggaaat taatgctgaa aatgtggaaa 2041 gcaagaagaa aactgtggga gatgatgaaa gtgttcctac aggttatcac agtaaaacag 2101 aaggagcaga aagaaccaat gatgatagct cagctgaaac tattgaaaag aaagaaaaag 2161 ccaacctaga ggaacgggcc atttgtgagt acaatgaaaa cccaaaagga tacatgcttg 2221 atgatgcaga tagcagttca ttagaaatcc tagaaaacag tgaaacaaca ccaagcaaag 2281 acatgaaaaa aacaaagaag atttttctgt tcaaaagagt cccctcaata aatcaaaaga 2341 ttgtcaagaa taacaatgag ccgctcccag agataaaatc cataggagac cagatcattt 2401 taaaaagtga taataaagat gccgaccaga accacatgag tcagaatcat cagaatatcc 2461 caccaacaaa tacagagaga agatcaaaat cctgtacaat actataaata tatatttatg 2521 ttttcacagt caccaagtgt attgtaatgt atacttgaaa aatgttataa cttatgaagt 2581 aaagtttctg atagtagtct ttaaaagata taagacttaa tatgttttat tcagcttcta 2641 taagtgtgac cagttttgat atttatttat gctaatattt ttaacaagtc atttcaaaat 2701 atgtgtatct caaattctcc ctaaagtgtt gtggccttaa ctgttcagta ttgcaataaa 2761 aaatatattt ttatatgtgg aaaa // LOCUS HSU57721 1637 bp mRNA PRI 23-AUG-1996 DEFINITION Human L-kynurenine hydrolase mRNA, complete cds. ACCESSION U57721 NID g1323714 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1637) AUTHORS Alberati-Giani,D., Buchli,R., Malherbe,P., Broger,C., Lang,G., Kohler,C., Lahm,H.W. and Cesura,A.M. TITLE Isolation and expression of a cDNA clone encoding human kynureninase JOURNAL Eur. J. Biochem. 239 (2), 460-468 (1996) MEDLINE 96314506 REFERENCE 2 (bases 1 to 1637) AUTHORS Malherbe,P. TITLE Direct Submission JOURNAL Submitted (08-MAY-1996) Pari Malherbe, Pharma Division, Perclinical Research CNS, F. Hoffmann-La Roche, Bldg. 69/333, Basel CH-4070, Switzerland FEATURES Location/Qualifiers source 1..1637 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hepatoma" /cell_line="Hep G2" CDS 107..1504 /EC_number="3.7.1.3" /note="kynureninase" /codon_start=1 /product="L-kynurenine hydrolase" /db_xref="PID:g1323715" /translation="MEPSSLELPADTVQRIAAELKCHPTDERVALHLDEEDKLRHFRE CFYIPKIQDLPPVDLSLVNKDENAIYFLGNSLGLQPKMVKTYLEEELDKWAKIAAYGH EVGKRPWITGDESIVGLMKDIVGANEKEIALMNALTVNLHLLMLSFFKPTPKRYKILL EAKAFPSDHYAIESQLQLHGLNIEESMRMIKPREGEETLRIEDILEVIEKEGDSIAVI LFSGVHFYTGQHFNIPAITKAGQAKGCYVGFDLAHAVGNVELYLHDWGVDFACWCSYK YLNAGAGGIAGAFIHEKHAHTIKPALVGWFGHELSTRFKMDNKLQLIPGVCGFRISNP PILLVCSLHASLEIFKQATMKALRKKSVLLTGYLEYLIKHNYGKDKAATKKPVVNIIT PSHVEERGCQLTITFSVPNKDVFQELEKRGVVCDKRNPNGIRVAPVPLYNSFHDVYKF TNLLTSILDSAETKN" BASE COUNT 511 a 306 c 355 g 465 t ORIGIN 1 aagaactggc ctgtacattt tcaaggaatt cttgagaggt tcttggagag attctgggag 61 ccaaacactc cattgggatc ctagctgttt tagagaacaa cttgtaatgg agccttcatc 121 tcttgagctg ccggctgaca cagtgcagcg cattgcggct gaactcaaat gccacccaac 181 ggatgagagg gtggctctcc acctagatga ggaagataag ctgaggcact tcagggagtg 241 cttttatatt cccaaaatac aggatctgcc tccagttgat ttatcattag tgaataaaga 301 tgaaaatgcc atctatttct tgggaaattc tcttggcctt caaccaaaaa tggttaaaac 361 atatcttgaa gaagaactag ataagtgggc caaaatagca gcctatggtc atgaagtggg 421 gaagcgtcct tggattacag gagatgagag tattgtaggc cttatgaagg acattgtagg 481 agccaatgag aaagaaatag ccctaatgaa tgctttgact gtaaatttac atcttctaat 541 gttatcattt tttaagccta cgccaaaacg atataaaatt cttctagaag ccaaagcctt 601 cccttctgat cattatgcta ttgagtcaca actacaactt cacggactta acattgaaga 661 aagtatgcgg atgataaagc caagagaggg ggaagaaacc ttaagaatag aggatatcct 721 tgaagtaatt gagaaggaag gagactcaat tgcagtgatc ctgttcagtg gggtgcattt 781 ttacactgga cagcacttta atattcctgc catcacaaaa gctggacaag cgaagggttg 841 ttatgttggc tttgatctag cacatgcagt tggaaatgtt gaactctact tacatgactg 901 gggagttgat tttgcctgct ggtgttccta caagtattta aatgcaggag caggaggaat 961 tgctggtgcc ttcattcatg aaaagcatgc ccatacgatt aaacctgcat tagtgggatg 1021 gtttggccat gaactcagca ccagatttaa gatggataac aaactgcagt taatccctgg 1081 ggtctgtgga ttccgaattt caaatcctcc cattttgttg gtctgttcct tgcatgctag 1141 tttagagatc tttaagcaag cgacaatgaa ggcattgcgg aaaaaatctg ttttgctaac 1201 tggctatctg gaatacctga tcaagcataa ctatggcaaa gataaagcag caaccaagaa 1261 accagttgtg aacataatta ctccgtctca tgtagaggag cgggggtgcc agctaacaat 1321 aacattttct gttccaaaca aagatgtttt ccaagaacta gaaaaaagag gagtggtttg 1381 tgacaagcgg aatccaaatg gcattcgagt ggctccagtt cctctctata attctttcca 1441 tgatgtttat aaatttacca atctgctcac ttctatactt gactctgcag aaacaaaaaa 1501 ttagcagtgt tttctagaac aacttaagca aattatactg aaagctgctg tggttatttc 1561 agtattattc gatttttaat tattgaaagt atgtcaccat tgaccacatg taactaacaa 1621 taaataatat accttac // LOCUS HSU57796 3923 bp mRNA PRI 12-JUN-1996 DEFINITION Human zinc finger protein (LD5-1) mRNA, complete cds. ACCESSION U57796 NID g1373393 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3923) AUTHORS Beutler,E., Gelbart,T., West,C., Kuhl,W. and Lee,P.L. TITLE A strategy for cloning the hereditary hemochromatosis gene JOURNAL Blood Cells Mol. Dis. 21, 206-216 (1995) REFERENCE 2 (bases 1 to 3923) AUTHORS Lee,P.L. TITLE Direct Submission JOURNAL Submitted (08-MAY-1996) Pauline L. Lee, Molecular and Experimental Medicine, Scripps Research Institute, 10550 North Torrey Pines Rd, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..3923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /tissue_type="ovary" gene 185..1921 /gene="LD5-1" CDS 185..1921 /gene="LD5-1" /note="contains nine zinc finger motifs" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g1373394" /translation="MAEESRKPSAPSPPDQTPEEDLVIVKVEEDHGWAQESSLHESNP LGQEVFRLRFRQLRYQETLGPREALIQLRALCHQWLRPDLNTKEQILELLVLEQFLTI LPEELQTLVKDHQLENGEEVVTLLEDLERQIDILGRPVSARVHGHRVLWEEVVHSASA PEPPNTQLQSEATQHKSPVPQESQERAMSTSQSPTRSQKGSSGDQEMTATLLTAGFQT LEKIEDMAVSLIREEWLLDPSQKDLCRDNRPENFRNMFSLGGETRSENRELASKQVIS TGIQPHGETAAKCNGDVIRGLEHEEARDLLGRLERQRGNPTQERRHKCDECGKSFAQS SGLVRHWRIHTGEKPYQCNVCGKAFSYRSALLSHQDIHNKVKRYHCKECGKAFSQNTG LILHQRIHTGEKPYQCNQCGKAFSQSAGLILHQRIHSGERPYECNECGKAFSHSSHLI GHQRIHTGEKPYECDECGKTFRRSSHLIGHQRSHTGEKPYKCNECGRAFSQKSGLIEH QRIHTGERPYKCKECGKAFNGNTGLIQHLRIHTGEKPYQCNECGKAFIQRSSLIRHQR IHSGEKSESISV" BASE COUNT 1132 a 869 c 826 g 1094 t 2 others ORIGIN 1 gctttcatcg cctttactcc ccgaccttcc ttcgagtctg tttatccgtt gcagcctccc 61 ttccccacga cggggcgcct ctgcaactca caaagtaccc ttagaaagag gccctcagaa 121 gagtcttctc ttaagaagat aaagaaggta gtggaaacga acttcctgag cttttcaggc 181 tctaatggct gaagaatcaa gaaagccttc agccccatcc ccaccagacc agactcctga 241 agaggatctt gtaatcgtca aggtagagga ggatcatggt tgggcccagg aatctagtct 301 gcatgaaagt aaccctcttg gccaagaagt gttccgcctg cgcttcaggc agttacgcta 361 ccaggagaca ctaggacccc gagaagctct gatccaacta cgggcccttt gccatcagtg 421 gctgaggcca gatttgaaca ccaaggaaca gatcctggag ctgctggtgc tggagcagtt 481 cttgaccatc ctacctgagg agctccagac actggttaag gatcatcagt tagagaacgg 541 agaggaggtg gtgaccctat tagaggattt ggaaaggcag attgatatac taggacgacc 601 agtctcagct cgcgtacatg gacatagggt actctgggag gaggtagtac attcagcatc 661 tgcaccagag cctccaaata ctcagctcca atctgaggca acccaacata aatctccagt 721 gccccaagag tcacaagaga gagccatgtc tacttcccag agtcctactc gttcccagaa 781 aggaagttct ggagaccagg aaatgacagc tacacttctc acagcagggt tccagacttt 841 ggagaagatt gaagacatgg ctgtgtccct tattcgagag gagtggcttc ttgatccatc 901 acagaaggat ctgtgtagag ataacaggcc agaaaatttc agaaacatgt tctccctggg 961 tggtgagacc aggagtgaga acagggaatt agcttcaaaa caggtaatat ctactggaat 1021 ccagccacat ggagagacag ctgccaaatg caacggggat gttatcaggg gtcttgagca 1081 tgaagaagcc cgagaccttc tgggcagatt agagaggcag cggggaaatc ccacacaaga 1141 gagacgacat aaatgtgatg aatgtgggaa aagctttgct cagagctcag gccttgttcg 1201 ccactggaga atccacactg gggagaaacc ctatcagtgt aatgtgtgtg gtaaagcctt 1261 cagttacagg tcagcccttc tttcacatca ggatatccac aacaaagtaa aacgctatca 1321 ctgtaaggag tgtggcaaag ccttcagtca gaacacaggc ctgattctgc accagagaat 1381 ccacactggg gagaagccat atcagtgcaa tcagtgtggg aaggctttca gtcagagtgc 1441 gggccttatt ctgcaccaga gaatccacag tggagagaga ccctatgaat gtaatgagtg 1501 tgggaaagct ttcagtcata gctcacacct cattggacat cagagaatcc acactgggga 1561 gaagccctat gagtgtgatg agtgtgggaa aaccttcagg cggagctcac atcttattgg 1621 tcatcagagg agccacactg gggagaaacc ctacaaatgc aatgagtgtg ggagggcctt 1681 cagtcagaag tcaggcctta ttgaacatca gagaatccac actggagaaa gaccctataa 1741 atgtaaagaa tgtgggaaag ctttcaatgg gaacactggt ctcattcaac acctgagaat 1801 tcacacaggg gagaagccct accaatgtaa tgagtgtggg aaagccttta ttcagaggtc 1861 aagtctcatt cgacatcaga gaatccacag tggtgaaaaa tctgaatcca taagcgttta 1921 ggaacaacat cagttagagt ttgagcatta ttcagcatta gggaaaccac acactggtga 1981 gaggtctttc agtgtactaa aaggcagaaa ggtcatcaca actttagtgt cagaatctat 2041 agtggtggca aagttaggat agctctttag taatttgggc ccagtgcctt ggtgaaggtt 2101 gattaccttg actgtctttg aaagtatctg atggagctcc tgatgaccta atgtatcctt 2161 tagaaattta aaatagcatt agagcaagtt gcttgtcatg gcttgaagag cttaactttg 2221 tcttttgggt gagtagctat aggtttgaga gaggctggct actcctagtt cctgtgcttc 2281 ttcccatctc ctgtgatccc cctctcccat ttattccctt gaaggtgccc ttgtattcct 2341 aatctgatct aagaagctgc tgtagagtca gaagtagctt ctgtgtgaaa cttatatttg 2401 tatgtagctt tctaggaaaa ttttacagac acttaaagta tttatgctta agtgtgcatt 2461 tttttattct aatattcctt ctagtcagtg gaaatttgta ttcccataca aactcagaat 2521 gctatatttt taacagcact ccagcatttt tcttcatgta tcaccttcac tgactccatt 2581 gagtcattga gcatctgtac atatccttca ccacccccag tcccagcacc tgtgtcagta 2641 aagaaagtgg acacattagt aatggttatg ggagtaatac agagaaagag gtgagtggag 2701 actcagccca gctgctcttg agcttggata gttatacatt ttggtgactt ctgactctcc 2761 ttccacctcc ccactgtaca ctgtggtaca gatagttggc actatcctag ttatggcatc 2821 aggtaggaga aggagacgaa gatgctttag gaagataaaa accttacaaa gaatgtatac 2881 tcatcttttt tggcccatta caatactgct ttctcaggtt cttaacagcc tgtgcaaact 2941 ccctgactag atggtctgtc actcctatca ttaaccttgc acccatctct ttagatggcc 3001 tcagcacctc actcgttatt ctccatctgt ttcttgccat tagtttatat gactttaata 3061 tccacagtgg taatgtaata ccttacctca caggttttca ttcttttgaa ctctggtgca 3121 ctccatctgt aatccatgtc tataatctac caacaaagat gtactcttaa actcagtatc 3181 accaggcact atatttcatc tttgaggctt ttgttatacc aaggaatact atataaagta 3241 atgagactgg tttaaagtaa catgccagaa cttacgcatc caatgtgggt tgtcacttgt 3301 tattcccaag tactattatt ccatactact atattcttcc aaatatttgg aacactctta 3361 gaattcccat ttaataaacc atatgagaaa accctctatt actcttacac tttagtttta 3421 ccatttgagt agccacttat tcaccagagt tggaattgag aaattttggg ctttccaaaa 3481 ttaaccttta aacttaaaat tgattacatt gagcctattc agaaaaaaaa ttatataggt 3541 tctgatgaaa gcttcataat ttcaccctgt gactatcctg aaaagaaata ataatcattt 3601 tatgattgat gggccaaaaa aagcaatcaa agtcttacgt catagaggac caccttttgg 3661 atcataaatt ctttccctat aacattcctt ctntgcctct ctatggccca ctttgaccac 3721 taacaacatc agaattatnt acatcaattt cattgagcca ttggtgaatg aagacaccca 3781 tcacttgtca tgttggtttc caacagtcct ttcgattatc ctttattttc ttatcctccc 3841 ttctcttgtt tcttttattt ataagttacc attcctagtt tctttgtact tgaaatttta 3901 agaaaccaga aacaaataaa ttt // LOCUS HSU57911 2272 bp mRNA PRI 07-JUL-1996 DEFINITION Human fetal brain (239FB) mRNA, from the WAGR region, complete cds. ACCESSION U57911 NID g1405359 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2272) AUTHORS Schwartz,F., Neve,R., Eisenman,R., Gessler,M. and Bruns,G. TITLE A WAGR region gene between PAX-6 and FSHB expressed in fetal brain JOURNAL Hum. Genet. 94 (6), 658-664 (1994) MEDLINE 95080775 REFERENCE 2 (bases 1 to 2272) AUTHORS Schwartz,F., Eisenman,R., Knoll,J., Gessler,M. and Bruns,G. TITLE cDNA sequence, genomic organization, and evolutionary conservation of a novel gene from the WAGR region JOURNAL Genomics 29 (2), 526-532 (1995) MEDLINE 96115606 REFERENCE 3 (bases 1 to 2272) AUTHORS Schwartz,F. and Bruns,G.A.P. TITLE Direct Submission JOURNAL Submitted (09-MAY-1996) F. Schwartz, Center for Human Genetics, Boston University School of Medicine, 80 East Concord Street, Boston, MA 02118, USA FEATURES Location/Qualifiers source 1..2272 /organism="Homo sapiens" /note="maps to 11p13/p14 boundary region; associated with mental retardation component of the WAGR syndrome; expressed in fetal brain" /db_xref="taxon:9606" /chromosome="11" /map="11p13" gene 121..1005 /gene="239FB" CDS 121..1005 /gene="239FB" /codon_start=1 /db_xref="PID:g1405360" /translation="MAHGIPSQGKVTITVDEYSSNPTQAFTHYNINQSRFQPPHVHMV DPIPYDTPKPAGHTRFVCISDTHSRTDGIQMPYGDILLHTGDFTELGLPSEVKKFNDW LGNLPYEYKIVIAGNHELTFDKEFMADLVKQDYYRFPSVSKLKPEDFDNVQSLLTNSI YLQDSEVTVKGFRIYGAPWTPWFNGWGFNLPRGQSLLDKWNLIPEGIDILMTHGPPLG FRDWVPKELQRVGCVELLNTVQRRVRPKLHVFGGIHEGYGIMTDGYTTYINASTCTVS FQPTNPPIIFDLPNPQGS" BASE COUNT 681 a 438 c 452 g 701 t ORIGIN 1 aatgcacagc ggtattgatg agtagatcct tggattcaga ggttggctga aacgcaccat 61 gcctgcttcc atcttttgct ctgtaaagtt gtgaattgct catgcctata gggaggaagg 121 atggcacatg ggattccttc tcaaggcaaa gttaccataa cggtggatga gtacagctca 181 aaccccaccc aggcattcac gcactacaac atcaaccaga gcagattcca gcctccacat 241 gtacatatgg tcgaccccat cccatatgac actccaaaac cagcgggcca cacgcggttt 301 gtctgcatct cagacacaca ctccagaaca gatggtatcc agatgcctta tggggacatc 361 cttctccaca caggcgattt caccgagctg ggactgccct cagaggttaa gaagtttaat 421 gactggttag gaaacctgcc atatgaatat aaaatagtga ttgctgggaa tcatgaactg 481 acatttgata aggaattcat ggcagacctt gttaaacagg actactaccg tttcccctct 541 gtgtccaaat tgaaaccaga ggactttgac aatgttcagt ccctcctgac aaacagtatt 601 tacttacaag attcggaggt aacagtgaag ggattcagga tatacggtgc accttggacc 661 ccgtggttta atggatgggg ctttaaccta cccagaggtc agtctctgct ggacaagtgg 721 aacctcatcc ctgagggcat tgacatactc atgacacatg gacctcctct aggttttcga 781 gactgggttc caaaggagct tcaaagagtg ggctgtgtgg agctgttaaa cacggttcag 841 aggcgagtcc ggcccaagct ccatgtgttt ggtggaatcc atgaaggtta tggcatcatg 901 accgacggtt acacaacgta catcaatgcc tcgacgtgta cagtcagctt tcaaccgacc 961 aaccctccaa ttatatttga ccttccaaac ccacagggtt cctgaagctc taaatgccct 1021 attggaatgt gagggaaggt ctataaactg ccatttttct aattataaac ttacattctc 1081 ttacttattt acaaaccctg tgagttcttt ttgtaaattg ttggaacaca aatgatgcta 1141 gaggttgtgc ttcttatttt attttatttt aaatggggca tccatttgaa atcagaggaa 1201 cattgtgaat ttgtaaaatg acttctgttt tctcaaaggc catgccattg taaattgtta 1261 gtgttcgcca aaggacagcc aagctttctt ttaaaaagtg ataaaagtct tattttaata 1321 tgctttaagc tgaaagaaaa aaaaataaga aacaggcagt gttttaaaaa ccaacacaga 1381 tttgcacaac tgtttaagag tattgtttga aatattttaa ttttcaatgt tttgttgttg 1441 ttgttttctt ggtaatgctt cttttttgca gatgtggtcc caatttatag caatcttctc 1501 aacagaagta ggcatggaaa agacttcttt tcatactctc actataaaga aagctgcatt 1561 gagaagaaaa tggctgtcat ttaaaggatg gtttaactag tgagattcct attgtggtta 1621 tacaaggtct cattgtttgt ttgtttcttt taaattattt cagctttaaa aatacagaaa 1681 tggaatctgt caagagcagg tatttcatac ggttaaaaaa atgaacatgc agactccttt 1741 tcaatatggg tttatatata taagtatttt ttgtgtatta tgactacgtt aggagtttaa 1801 tattgtcaag gacagtacaa ctgcaaaggg atgctgtata gcagcacatc agaagtcgga 1861 aggaactgac acattctctc agagctcaag gtcttaaaga gcttgagtta aatctaggta 1921 cagttacagg catgtataga cttaaatgga tgcaatggaa gctaactaaa ataaggctta 1981 gttgtccttt ctatttaaat accccaagtt gtcttcttac ttcctctccc ctctcccatt 2041 ttgcactgtg tgtcgatgca atcttcgcta gcacaaaata ttgtcgctaa tagtcatttc 2101 tgttttccca ttgtaaatgc tgttgagctt tattctattt tatgttactt tgttaatgaa 2161 atttaggaaa gcagttgttt ctttaaattt attgtgatat tctatatcta gcggccttta 2221 tatgcaaata aaattgcaag atttttaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU58048 2474 bp mRNA PRI 24-OCT-1996 DEFINITION Human metallopeptidase PRSM1 mRNA, complete cds. ACCESSION U58048 NID g1354930 KEYWORDS zincin, metalloproteinase, gluzincin, procollagen type III N-proteinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2474) AUTHORS Scott,I.C., Halila,R., Jenkins,J.M., Mehan,S., Apostolou,S., Winqvist,R., Callen,D.F., Prockop,D.J., Peltonen,L. and Kadler,K.E. TITLE Molecular cloning, expression and chromosomal localization of a human gene encoding a 33 kDa putative metallopeptidase (PRSM1) JOURNAL Gene 174 (1), 135-143 (1996) MEDLINE 97017139 REFERENCE 2 (bases 1 to 2474) AUTHORS Scott,I.C., Halila,R., Jenkins,J.M., Mehan,S., Apostolou,S., Winqvist,R., Callen,D.F., Prockop,D.J., Peltonen,L. and Kadler,K.E. TITLE Direct Submission JOURNAL Submitted (13-MAY-1996) Karl E. Kadler, Wellcome Trust Centre for Cell-Matrix Research, University of Manchester, School of Biological Sciences, Oxford Road, Manchester, UK M13 9PT, England FEATURES Location/Qualifiers source 1..2474 /organism="Homo sapiens" /note="Identified by immune screening of a human placental cDNA library." /db_xref="taxon:9606" /chromosome="16" /map="16q24.3" CDS 41..997 /note="protease, metallo, 33 kDa, number 1; immunolocalized to various human tissues and cell lines including placenta, aorta, skin fibroblasts, fetal liver, kidney and bone cells; Contains a HEXXH motif found in the zincin superfamily of metallopeptidases. putative metallopeptidase" /codon_start=1 /product="PRSM1" /db_xref="PID:g1354931" /translation="MATAAGSRAPPLWRDRPPSSREEHPVRPEVGRRPRKSPPGAAWS VRSPPGPDTARSWRPFSLSECCSCHCGHGRYPVPVEVHGEAAGEAGQEGGEGLQGGAG QSEEGPSAEKCRVCPCVCRERHPQEERRCELASDGVPRRRSGLQGGHSCDYEGGDQEY GPGDQSPGQGPEHHGPAEGLLSDGQVRAAGAEPGRPYIGDGGLHELGHHPDHAAGAGG QPHHADRRGEWPGGAGPAQPAARGRLCRGRELCAQPGGPAVTEVGRLEELAVPRRCAP PLPRDVLEGSCPLPTASCLCADPAGLRPAATLRLSPARPAWP" BASE COUNT 433 a 760 c 763 g 518 t ORIGIN 1 agcgactcac aaaaggtcct cggccacggc gtgcgtcacc atggcgaccg ccgccggcag 61 ccgcgccccg cccctctggc gggaccggcc accatcctct cgcgaggagc atcccgtgcg 121 accggaagtg gggcggcgac cccggaagtc cccgccgggt gcagcttggt cggttcgatc 181 gccgccggga cctgacaccg cccggagttg gcgtcccttc tccctctccg agtgctgctc 241 ctgtcattgt ggccatggac gataccctgt tccagttgaa gttcacggcg aagcagctgg 301 agaagctggc caagaaggcg gagaaggact ccaaggcgga gcaggccaaa gtgaagaagg 361 cccttctgca gaaaaatgta gagtgtgccc gtgtgtatgc cgagaacgcc atccgcaaga 421 agaacgaagg tgtgaactgg cttcggatgg cgtcccgcgt agacgcagtg gcctccaagg 481 tggacacagc tgtgactatg aagggggtga ccaagaatat ggcccaggtg accaaagccc 541 tggacaaggc cctgagcacc atggacctgc agaaggtctc ctcagtgatg gacaggttcg 601 agcagcaggt gcagaacctg gacgtccata catcggtgat ggaggactcc atgagctcgg 661 ccaccaccct gaccacgccg caggagcagg tggacagcct catcatgcag atcgccgagg 721 agaatggcct ggaggtgctg gaccagctca gccagctgcc cgagggcgcc tctgccgtgg 781 gcgagagctc tgtgcgcagc caggaggacc agctgtcacg gaggttggcc gccttgagga 841 actagccgtg ccccgccggt gtgcaccgcc tctgccccgt gatgtgctgg aaggctcctg 901 tcctctcccc accgcgtctt gcctttgtgc tgaccccgcg gggctgcggc cggcagccac 961 tctgcgtctc tcacctgcca ggcctgcgtg gccttagggt tgttcctgtt cttttaggtt 1021 gggcggtggg tctgtgtcct ggtgttgagt ttctgcaaat ttctgggggt gatttctgtg 1081 actctgggcc cacagcgggg aggccaagag gggccctgtg gactttcacc cagcactgtg 1141 ggggccttca gactctgggg cagcagacat gctgcttccc atcagccaga gggggtcagg 1201 gctgccctgt tgccaaacaa ctccctgagg cctctccgca ccacctcagc gggcaggagg 1261 tcccaccatg tggacagaca tagcccaagg aggcaccaca ggtctatgtg tgctggggga 1321 tgtcaggtgc cacccaacgc tgtcctggtg gtatttacaa tgacatcctc ctcctccatc 1381 actccagggg tggtgtctcg gccgccccta ccagctggct gagccccctg gcctcctgcg 1441 ctccctcact tccctcagtt cccaaagctg cccagtccat ggggacagaa ccgtcactca 1501 gatccacatt caagtgtgcc caccctgcag tcttcatcct cactcagctg ctgcctctgg 1561 aggtgccttt ggccacatgt gctgtgctgt ttgtctcctc gacagggagc ctgtccacca 1621 gcaggctgcg gtcccagcgg gtgcgtctgc agctcctccc cttgggcagg ctggttctcc 1681 cggaggacct ttccttgggg ccctgcttca tgacgatgct gcctgtgtca ccctctacca 1741 tctgtaaaca actgggtgcc ttccccgacc acaccccaat gccttcccag cttggaagcc 1801 aaggcagctg atgaagggag ctcaggagag ccgtcttcag ctgggctggg gttggggctg 1861 ctgtgaggaa aacctgccat tgtggccctg gagagtcacc agcagctctt gggaaggact 1921 tgctgggagg ctgagagagg ctttgggcac agcctgctgt cttttccatt tcctaaagtt 1981 tacttcattg tcttgaggct tccaggtttt gtttttgttt ttgccaaagt agaaaaggca 2041 ggtggtgggc ggctggcagg gagtgcgggt ccccgcccct cttcagtcct gccctcccct 2101 cctcagtcct gcccaccccg tgcagcccat gctgaggctg cagtggtgtc gtgggtgtta 2161 cgtgcaggaa cgtggagacc ctgacgtggg ctcactgcat ttggttttct tttcagaact 2221 tgggagcccc cagggagggg ctagtgttgg taggtcctag acgtggttcc ctccagcctc 2281 cccaaaatca accctggtgt tgagagaacg tccttctgtc catcgtgggt aacagccttg 2341 gggagggtgc agagctctgc agagccatgg gccaggtggg gctgcctcag tcctgtcccc 2401 ttgggcactg aggagagggg cccattcacc tttctcctag aatgctgttg taaataaaca 2461 aatggatccc tgga // LOCUS HSU58087 2511 bp mRNA PRI 19-JUN-1996 DEFINITION Human Hs-cul-1 mRNA, complete cds. ACCESSION U58087 NID g1381141 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2511) AUTHORS Kipreos,E.T., Lander,L.E., Wing,J.P., He,W.W. and Hedgecock,E.M. TITLE cul-1 is required for cell cycle exit in C. elegans and identifies a novel gene family JOURNAL Cell 85 (6), 829-839 (1996) MEDLINE 96279828 REFERENCE 2 (bases 1 to 2511) AUTHORS Kipreos,E.T., Lander,L.E., Wing,J.P., He,W.W. and Hedgecock,E.M. TITLE Direct Submission JOURNAL Submitted (13-MAY-1996) Edward T. Kipreos, Department of Cellular Biology, University of Georgia, 353 Biological Sciences Building, Athens, GA 30605, USA FEATURES Location/Qualifiers source 1..2511 /organism="Homo sapiens" /db_xref="taxon:9606" gene 125..2383 /gene="Hs-cul-1" CDS 125..2383 /gene="Hs-cul-1" /note="Cullin gene family member" /codon_start=1 /product="Hs-CUL-1" /db_xref="PID:g1381142" /translation="MSSTRSQNPHGLKQIGLDQIWDDLRAGIQQVYTRQSMAKSRYME LYTHVYNYCTSVHQFVGLELYKRLKEFLKNYLTNLLKDGEDLMDESVLKFYTQQWEDY RFSSKVLNGICAYLNRHWVRRECDEGRKGIYEIYSLALVTWRDCLFRPLNKQVTNAVL KLIEKERNGETINTRLISGVVQSYVELGLNEDDAFAKGPTLTVYKESFESQFLADTER FYTRESTEFLQQNPVTEYMKKAEARLLEEQRRVQVYLHESTQDELARKCEQVLIEKHL EIFHTEFQNLLDADKNEDLGRMYNLVSRIQDGLGELKKLLETHIHNQGLAAIEKCGEA ALNDPKMYVQTVLDVHKKYNALVMSAFNNDAGFVAALDKACGRFINNNAVTKMAQSSS KSPELLARYCDSLLKKSSKNPEEAELEDTLNQVMVVFKYIEDKDVFQKFYAKMLAKRL VHQNSASDDAEASMISKLKQACGFEYTSKLQRMFQDIGVSKDLNEQFKKHLTNSEPLD LDFSIQVLSSGSWPFQQSCTFALPSELERSYQRFTAFYASRHSGRKLTWLYQLSKGEL VTNCFKNRYTLQASTFQMAILLQYNTEDAYTVQQLTDSTQIKMDILAQVLQILLKSKL LVLEDENANVDEVELKPDTLIKLYLGYKNKKLRVNINVPMKTEQKQEQETTHKNIEED RKLLIQAAIVRIMKMRKVLKHQQLLGEVLTQLSSRFKPRVPVIKKCIDILIEKEYLER VDGEKDTYSYLA" BASE COUNT 804 a 501 c 576 g 630 t ORIGIN 1 gaagatcctt tctgagctgc tgtgaataaa tttggaatgg tactgtatat ttccatctaa 61 tggagaacta gctgtacttt gaataaggat tgctgcactg gacgacttta gaacatccct 121 cacaatgtcg tcaacccgga gccagaaccc ccacggcctg aagcagattg gcctggacca 181 gatctgggac gacctcagag ccggcatcca gcaggtgtac acacggcaga gcatggccaa 241 gtccagatat atggagctct acactcatgt ttataactac tgtactagtg ttcaccagtt 301 tgttggcctg gaattatata aacgacttaa ggaatttttg aagaattact tgacaaatct 361 tcttaaggat ggagaagatt tgatggatga gagtgtactg aaattctaca ctcaacaatg 421 ggaagattat cgattttcaa gcaaagtgct gaatggaatt tgtgcctacc tcaatagaca 481 ttgggttcgc cgtgaatgtg acgaaggacg aaaaggaata tatgaaatct attcgcttgc 541 attggtgact tggagagact gtctgttcag gccactgaat aaacaggtaa caaatgctgt 601 tttaaagctg attgaaaagg aaaggaatgg tgaaaccatc aatacaagat tgattagtgg 661 agttgtacag tcttacgtgg aattggggct gaatgaagat gatgcatttg caaagggccc 721 tacgttaaca gtgtataaag aatcctttga atctcaattt ttggctgaca cagagagatt 781 ttataccaga gagagtactg aattcttgca gcagaaccca gttactgaat atatgaaaaa 841 ggcagaggct cgtctgcttg aggaacaacg aagagttcag gtttaccttc atgaaagcac 901 acaagatgaa ttagcaagga aatgtgaaca agtcctcatt gaaaaacact tggaaatttt 961 ccacacagaa tttcagaatt tattggatgc tgacaaaaat gaagatttgg gacgcatgta 1021 taatcttgta tctagaatcc aggatggcct aggagaattg aaaaaactgt tggagacaca 1081 cattcataat cagggtcttg cagccattga aaagtgtgga gaagctgctt taaatgaccc 1141 caaaatgtat gtacagacag tgcttgatgt tcataaaaaa tacaatgccc tggtaatgtc 1201 tgcattcaac aatgacgctg gctttgtggc tgctcttgat aaggcttgtg gtcgcttcat 1261 aaacaacaac gcggttacca agatggccca atcatccagt aaatcccctg agttgctggc 1321 tcgatactgt gactccttgt tgaagaaaag ttccaagaac ccagaggagg cagaactaga 1381 agacacactc aatcaagtga tggttgtctt caagtacata gaagacaaag acgtatttca 1441 gaagttctat gcgaagatgc tcgccaagag gctcgtccac cagaacagtg caagtgacga 1501 tgccgaagcc agcatgatct ccaagttaaa gcaagcttgc gggttcgagt acacctctaa 1561 acttcagcgc atgtttcaag acattggcgt gagcaaagat ctgaacgagc aattcaaaaa 1621 gcacttgaca aactcagaac ccctagactt ggatttcagc attcaagtgc tgagctccgg 1681 gtcctggccc ttccagcagt cttgtacatt tgccttgccg tcagagttgg aacgtagtta 1741 tcagcgattc acagctttct acgccagccg ccacagtggc cgaaaattga cgtggttata 1801 tcagttgtct aaaggagaat tggtaactaa ctgcttcaaa aacagatata ctttgcaggc 1861 gtcgacattc cagatggcta tcctgcttca gtacaacacg gaagatgcct acactgtgca 1921 gcagctgacc gacagcactc aaattaaaat ggacattttg gcgcaagttt tacagatttt 1981 attaaagtcg aagctattgg tcttggaaga tgaaaatgca aatgttgatg aggtggaatt 2041 gaagccagat accttaataa aattatatct tggttataaa aataagaaat taagggttaa 2101 catcaatgtg ccaatgaaaa ccgaacagaa gcaggaacaa gaaaccacac acaaaaacat 2161 cgaggaagac cgcaaactac tgattcaggc ggccatcgtg agaatcatga agatgaggaa 2221 ggttctgaaa caccagcagt tacttggcga ggtcctcact cagctgtcct ccaggttcaa 2281 acctcgagtc cctgtgatca agaaatgcat tgacattcta attgagaaag aatatttgga 2341 gcgagtggat ggtgaaaagg acacctacag ttacttggct taacccttct ggaagggtct 2401 gactgtgtga cccgcagcaa atagttcatg ttggaaagaa tgaaaacaac ttcaagttca 2461 taggcagcca gcctgccgcc attggacctc ccttttaaaa actgaggacc a // LOCUS HSU58130 3362 bp mRNA PRI 05-SEP-1996 DEFINITION Human bumetanide-sensitive Na-K-2Cl cotransporter (NKCC2) mRNA, complete cds. ACCESSION U58130 NID g1373424 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3362) AUTHORS Simon,D.B., Karet,F.E., Hamdan,J.M., DiPietro,A., Sanjad,S.A. and Lifton,R.P. TITLE Bartter's syndrome, hypokalaemic alkalosis with hypercalciuria, is caused by mutations in the Na-K-2Cl cotransporter NKCC2 JOURNAL Nature Genet. 13 (2), 183-188 (1996) MEDLINE 96225445 REFERENCE 2 (bases 1 to 3362) AUTHORS Simon,D.B. TITLE Direct Submission JOURNAL Submitted (10-MAY-1996) David B. Simon, Genetics, Yale University School of Medicine, BCMM #136, 295 Congress Avenue, New Haven, CT 06536, USA FEATURES Location/Qualifiers source 1..3362 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q15-21" gene 20..3319 /gene="NKCC2" CDS 20..3319 /gene="NKCC2" /note="Na-K-2Cl cotransporter" /codon_start=1 /product="bumetanide-sensitive Na-K-2Cl cotransporter" /db_xref="PID:g1373425" /translation="MSLNNSSNVFLDSVPSNTNRFQVSVINENHESSAAADDNTDPPH YEETSFGDEAQKRLRISFRPGNQECYDNFLHSGETAKTDASFHAYDSHTNTYYLQTFG HNTMDAVPKIEYYRNTGSISGPKVNRPSLLEIHEQLAKNVAVTPSSADRVANGDGIPG DEQAENKEDDQAGVVKFGWVKGVLVRCMLNIWGVMLFIRLSWIVGEAGIGLGVIIIGL STIVTTITGMSTSAIATNGVVRGGGAYYLISRSLGPEFGGSIGLIFAFANAVAVAMYV VGFAETVVDLLKESDSMMVDPTNDIRIIGSITVVILLGISVAGMEWEAKAQVILLVIL LIAIANFFIGTVIPSNNEKKSRGFFNYQASIFAENFGPRFTKGEGFFSVFAIFFPAAT GILAGANISGDLEDPQDAIPRGTMLAIFITTVAYLGVAICVGACVVRDATGNMNDTII SGMNCNGSAACGLGYDFSRCRHEPCQYGLMNNFQVMSMVSGFGPLITAGIFSATLSSA LASLVSAPKVFQALCKDNIYKALQFFAKGYGKNNEPLRGYILTFLIAMAFILIAELNT IAPIISNFFLASYALINFSCFHASYAKSPGWRPAYGIYNMWVSLFGAVLCCAVMFVIN WWAAVITYVIEFFLYVYVTCKKPDVNWGSSTQALSYVSALDNALELTTVEDHVKNFRP QCIVLTGGPMTRPALLDITHAFTKNSGLCICCEVFVGPRKLCVKEMNSGMAKKQAWLI KNKIKAFYAAVAADCFRDGVRSLLQASGLGRMKPNTLVIGYKKNWRKAPLTEIENYVG IIHDAFDFEIGVVIVRISQGFDISQVLQVQEELERLEQERLALEATIKDNECEEESGG IRGLFKKAGKLNITKTTPKKDGSINTSQSMHVGEFNQKLVEASTQFKKKQEKGTIDVW WLFDDGGLTLLIPYILTLRKKWKDCKLRIYVGGKINRIEEEKIAMASLLSKFRIKFAD IHIIGDINIRPNKESWKVFEEMIEPYRLHESCKDLTTAEKLKRETPWKITDAELEAVK EKSYRQVRLNELLQEHSRAANLIVLSLPVARKGSISDLLYMAWLEILTKNLPPVLLVR GNHKNVLTFYS" BASE COUNT 978 a 706 c 785 g 893 t ORIGIN 1 aaaaaatcaa ttttggaaga tgtcactgaa caactcttcc aatgtatttc tggattcagt 61 gcccagtaat accaatcgct ttcaagttag tgtcataaat gagaaccatg agagcagtgc 121 agctgcagat gacaatactg acccaccaca ttatgaagaa acctcttttg gggatgaagc 181 tcagaaaaga ctcagaatca gctttaggcc tgggaatcag gagtgctatg acaatttcct 241 ccacagtgga gaaactgcta aaacagatgc cagttttcac gcttatgatt ctcacacaaa 301 cacatactat ctacaaactt ttggccacaa caccatggat gccgttccca agatagagta 361 ctatcgtaac accggcagca tcagtgggcc caaggtcaac cgacccagcc tgcttgagat 421 tcacgagcaa ctcgcaaaga atgtggcagt caccccaagt tcagctgaca gagttgctaa 481 cggtgatggg atacctggag atgaacaagc tgaaaataag gaagatgatc aagctggtgt 541 tgtgaagttt ggatgggtga aaggtgtgct ggtaagatgc atgctgaaca tctggggagt 601 catgctcttc attcgcctct cctggattgt tggagaagct ggaattggtc ttggagttat 661 catcattggc ctatccacca tagtaacgac aatcacaggt atgtccacgt ctgctattgc 721 cacgaacgga gttgttagag gaggtggggc ctactatctt atttccagaa gtttagggcc 781 cgagttcggt gggtcaatag gcctgatctt tgcttttgct aatgcagtgg ctgttgctat 841 gtatgtggtg ggattcgctg aaactgtagt agatctactt aaggagagtg attcgatgat 901 ggtggatcca accaatgaca tccggattat aggctccatc acagtggtga ttcttctagg 961 aatttcagta gctggaatgg aatgggaggc aaaggcccaa gtcattcttc tggtcattct 1021 tctaattgct attgcaaact tcttcattgg aactgtcatt ccatccaaca atgagaaaaa 1081 gtccagaggt ttctttaatt accaagcatc aatatttgca gaaaactttg ggccacgctt 1141 cacaaagggt gaaggcttct tctctgtctt tgccattttt ttcccagcag ctactgggat 1201 tcttgctggt gccaatatct caggagattt ggaggatccc caagatgcca tccccagagg 1261 aaccatgctg gccattttca tcaccactgt tgcctactta ggggttgcaa tttgtgtagg 1321 ggcctgtgtg gtccgagatg ccaccgggaa catgaatgac accatcattt ctgggatgaa 1381 ctgcaatggt tcagcagcat gtgggttggg ctatgacttc tcaagatgtc gacatgaacc 1441 atgtcagtac gggctgatga acaatttcca ggtcatgagc atggtatcag ggttcggccc 1501 cctcatcact gcgggaatct tttctgcaac actctcctcc gccctggcct cccttgtcag 1561 cgcacccaaa gtgttccagg ctctgtgcaa ggacaacatc tacaaagccc tgcagttttt 1621 tgcaaaggga tatgggaaaa acaatgaacc cctgagagga tatattctca cttttcttat 1681 agccatggca tttattctta ttgcggaact gaacaccatt gctcccatca tctccaactt 1741 tttcctggcc tcatatgcac ttattaattt ctcctgcttc catgcctctt atgccaaatc 1801 tccaggatgg agacctgcgt atggaattta caacatgtgg gtatctcttt ttggagctgt 1861 tttgtgctgt gcagtcatgt ttgtcatcaa ctggtgggca gctgtcatca cctatgtcat 1921 tgaattcttc ctttacgtct atgtgacttg taagaagcca gatgtgaact ggggctcctc 1981 cacacaggct ctttcctacg tgagtgcttt agacaatgct ctggaattaa ccacagtgga 2041 agaccacgta aaaaacttca ggccccagtg cattgtctta acagggggac ccatgacaag 2101 acctgctctc ctggacataa ctcacgcctt taccaagaac agtggccttt gcatctgctg 2161 tgaagtcttt gtgggaccgc gcaaactgtg tgttaaggag atgaacagtg gcatggcgaa 2221 aaaacaggcc tggcttataa agaacaaaat caaggctttt tatgctgcag tggcggcaga 2281 ctgtttcagg gatggtgtcc gaagtcttct tcaggcctca ggcttaggaa gaatgaaacc 2341 aaacactctg gtgattggat ataagaaaaa ctggaggaaa gctcccttga cagagattga 2401 gaactacgtg ggaatcatac atgatgcatt tgattttgag attggcgtgg ttatagtcag 2461 aatcagccaa ggatttgaca tctctcaggt tcttcaggtg caagaggaat tagagagatt 2521 agaacaggag agactagcat tggaagcgac tatcaaagat aatgagtgtg aagaggaaag 2581 tggaggcatc cgaggcttgt ttaaaaaagc tggcaagttg aacattacta agacaacgcc 2641 taaaaaagat ggcagcatta acacaagcca gtcgatgcat gtgggagagt tcaaccagaa 2701 actggtggaa gccagcactc aatttaaaaa gaaacaagaa aaaggcacaa ttgatgtttg 2761 gtggttgttt gatgatggag ggttaacact tcttatcccc tatatcttaa ctctcagaaa 2821 aaaatggaaa gactgtaaat taagaatcta tgtgggaggg aagatcaacc gcattgaaga 2881 agaaaaaatt gcaatggctt cccttctgag caaatttagg ataaaatttg cagacatcca 2941 tatcatcggt gacatcaaca ttaggccaaa caaagagagc tggaaagtct ttgaagagat 3001 gattgaacca tatcgtctcc atgaaagctg caaagattta acaactgctg agaaattaaa 3061 aagagaaact ccgtggaaaa ttacagatgc agaactggaa gcagtcaagg aaaagagtta 3121 ccgccaagtt cgactgaatg aactcttaca ggagcactcc agagctgcta atctcattgt 3181 cctgagcctt cccgtggcaa gaaagggatc catatcggat ttgttgtata tggcttggtt 3241 ggaaatcctc acaaagaacc tcccacctgt cttactagtt agaggaaatc acaaaaatgt 3301 cttgacattt tactcttaaa acatgaaaga ttggaataca ttttaactta atgtaatgca 3361 ta // LOCUS HSU58334 4534 bp mRNA PRI 02-JUL-1996 DEFINITION Human Bcl2, p53 binding protein Bbp/53BP2 (BBP/53BP2) mRNA, complete cds. ACCESSION U58334 NID g1399804 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4534) AUTHORS Naumovski,L. and Cleary,M.L. TITLE The p53-binding protein 53BP2 also interacts with Bc12 and impedes cell cycle progression at G2/M JOURNAL Mol. Cell. Biol. 16 (7), 3884-3892 (1996) MEDLINE 96251339 REFERENCE 2 (bases 1 to 4534) AUTHORS Naumovski,L. and Cleary,M.L. TITLE Direct Submission JOURNAL Submitted (14-MAY-1996) Pediatrics, Stanford Medical Center, 300 Pasteur Drive, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..4534 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HAL-01; B-cell progenitor cell line" gene 757..3774 /gene="BBP/53BP2" CDS 757..3774 /gene="BBP/53BP2" /note="Bcl2, p53 binding protein" /codon_start=1 /product="Bbp/53BP2" /db_xref="PID:g1399805" /translation="MDLTLAELQEMASRQQQQIEAQQQLLATKEQRLKFLKQQDQRQQ QQVAEQEKLKRLKEIAENQEAKLKKVRALKGHVEQKRLSNGKLVEEIEQMNNLFQQKQ RELVLAVSKVEELTRQLEMLKNGRIDSHHDNQSAVAELDRLYKELQLRNKLNQEQNAK LQQQRECLNKRNSEVAVMDKRVNELRDRLWKKKAALQQKENLPVSSDGNLPQQAASAP SRVAAVGPYIQSSTMPRMPSRPELLVKPALPDGSLVIQASEGPMKIQTLPNMRSGAAS QTKGSKIHPVGPDWSPSNADLFPSQGSASVPQSTGNALDQVDDGEVPLREKEKKVRPF SMFDAVDQSNAPPSFGTLRKNQSSEDILRDAQVANKNVAKVPPPVPTKPKQINLPYFG QTNQPPSDIKPDGSSQQLSTVVPSMGTKPKPAGQQPRVLLSPSIPSVGQDQTLSPGSK QESPPAAAVRPFTPQPSKDTLLPPFRKPQTVAASSIYSMYTQQQAPGKNFQQAVQSAL TKTHTRGPHFSSVYGKPVIAAAQNQQQHPENIYSNSQGKPGSPEPETEPVSSVQENHE NERIPRPLSPTKLLPFLSNPYRNQSDADLEALRKKLSNAPRPLKKRSSITEPEGPNGP NIQKLLYQRTTIAAMETISVPSYPSKSASVTASSESPVEIQNPYLHVEPEKEVVSLVP ESLSPEDVGNASTENSDMPAPSPGLDYEPEGVPDNSPNLQNNPEEPNPEAPHVLDVYL EEYPPYPPPPYPSGEPEGPGEDSVSMRPPEITGQVSLPPGKRTNLRKTGSERIAHGMR VKFNPLALLLDSSLEGEFDLVQRIIYEVDDPSLPNDEGITALHNAVCAGHTEIVKFLV QFGVNVNAADSDGWTPLHCAASCNNVQVCKFLVESGAAVFAMTYSDMQTAADKCEEME EGYTQCSQFLYGVQEKMGIMNKGVIYALWDYEPQNDDELPMKEGDCMTIIHREDEDEI EWWWARLNDKEGYVPRNLLGLYPRIKPRQRSLA" BASE COUNT 1298 a 1073 c 1107 g 1056 t ORIGIN 1 gtcacgagcg tcgaagagac aaagccgcgt cagggggccc ggccggggcg ggggagcccg 61 gggcttgttg gtgccccagc ccgcgcggag ggcccttcgg acccgcgcgc cgccgctgcc 121 gccgccgccg cctcgcaaca ggtccgggcg gcctcgctct ccgctcccct cccccgcatc 181 cgcgaccctc cggggcacct cagctcggcc ggggccgcag tctggccacc cgcttccatg 241 cggttcgggt ccaagatgat gccgatgttt cttaccgtgt atctcagtaa caatgagcag 301 cacttcacag aagttccagt tactccagaa acaatatgca gagacgtggt ggatctgtgc 361 aaagaacccg gcgagagtga ttgccatttg gctgaagtgt ggtgtggctc tgtagagata 421 gagtttcatc atgttggcca ggatggtctc gatctcctga ccttgtgatc cgcctgcctc 481 ggcctcccaa agtgctggat tacaggtgtg agccaccacg atcagcctct agtgtttaaa 541 aaagaacgtc cagttgcgga taatgagcga atgtttgatg ttcttcaacg atttggaagt 601 cagaggaacg aagttcgctt cttccttcgt catgaacgcc cccctggcag ggacattgtg 661 agtggaccaa gatctcagga tccaagttta aaaagaaatg gtgtaaaagt tcctggtgaa 721 tatcgaagaa aggagaacgg tgttaatagt cctaggatgg atctgactct tgctgaactt 781 caggaaatgg catctcgcca gcagcaacag attgaagccc agcaacaatt gctggcaact 841 aaggaacagc gcttaaagtt tttgaaacaa caagatcagc gacaacagca acaagttgct 901 gagcaggaga aacttaaaag gctaaaagaa atagctgaga atcaggaagc taagctaaaa 961 aaagtgagag cacttaaagg ccacgtggaa cagaagagac taagcaatgg gaaacttgtg 1021 gaggaaattg aacagatgaa taatttgttc cagcaaaaac agagggagct cgtcctggct 1081 gtgtcaaaag tagaagaact gaccaggcag ctagagatgc tcaagaacgg caggatcgac 1141 agccaccatg acaatcagtc tgcagtggct gagcttgatc gcctctataa ggagctgcag 1201 ctaagaaaca aattgaatca agagcagaat gccaagctac aacaacagag ggagtgtttg 1261 aataagcgta attcagaagt ggcagtcatg gataagcgtg ttaatgagct gagggaccgg 1321 ctgtggaaga agaaggcagc tctacagcaa aaagaaaatc taccagtttc atctgatgga 1381 aatcttcccc agcaagccgc gtcagcccca agccgtgtgg ctgcagtagg tccctatatc 1441 cagtcatcta ctatgcctcg gatgccctca aggcctgaat tgctggtgaa gccagccctg 1501 ccggatggtt ccttggtcat tcaggcttca gaggggccga tgaaaataca gacactgccc 1561 aacatgagat ctggggctgc ttcacaaact aaaggctcta aaatccatcc agttggccct 1621 gattggagtc cttcaaatgc agatcttttc ccaagccaag gctctgcttc tgtacctcaa 1681 agcactggga atgctctgga tcaagttgat gatggagagg ttccgctgag ggagaaagag 1741 aagaaagtgc gtccgttctc aatgtttgat gcagtagacc agtccaatgc cccaccttcc 1801 tttggtactc tgaggaagaa ccagagcagt gaagatatct tgcgggatgc tcaggttgca 1861 aataaaaatg tggctaaagt accacctcct gttcctacaa aaccaaaaca gattaatttg 1921 ccttattttg gacaaactaa tcagccacct tcagacatta agccagacgg aagttctcag 1981 cagttgtcaa cagttgttcc gtccatggga actaaaccaa aaccagcagg gcagcagccg 2041 agagtgctgc tatctcccag cataccttcg gttggccaag accagaccct ttctccaggt 2101 tctaagcaag aaagtccacc tgctgctgcc gtccggccct ttactcccca gccttccaaa 2161 gacaccttac ttccaccctt cagaaaaccc cagaccgtgg cagcaagttc aatatattcc 2221 atgtatacgc aacagcaggc gccaggaaaa aacttccagc aggctgtgca gagcgcgttg 2281 accaagactc ataccagagg gccacacttt tcaagtgtat atggtaagcc tgtaattgct 2341 gctgcccaga atcaacagca gcacccagag aacatttatt ccaatagcca gggcaagcct 2401 ggcagtccag aacctgaaac agagcctgtt tcttcagttc aggagaacca tgaaaacgaa 2461 agaattcctc ggccactcag cccaactaaa ttactgcctt tcttatctaa tccttaccga 2521 aaccagagtg atgctgacct agaagcctta cgaaagaaac tgtctaacgc accaaggcct 2581 ctaaagaaac gtagttctat tacagagcca gagggtccta atgggccaaa tattcagaag 2641 cttttatatc agaggaccac catagcggcc atggagacca tctctgtccc atcataccca 2701 tccaagtcag cttctgtgac tgccagctca gaaagcccag tagaaatcca gaatccatat 2761 ttacatgtgg agcccgaaaa ggaggtggtc tctctggttc ctgaatcatt gtccccagag 2821 gatgtgggga atgccagtac agagaacagt gacatgccag ctccttctcc aggccttgat 2881 tatgagcctg agggagtccc agacaacagc ccaaatctcc agaataaccc agaagaacca 2941 aatccagagg ctccacatgt gcttgatgtg tacctggagg agtaccctcc atacccaccc 3001 ccaccatacc catctgggga gcctgaaggg cccggagaag actcggtgag catgcgcccg 3061 cctgaaatca ccgggcaggt ctctctgcct cctggtaaaa ggacaaactt gcgtaaaact 3121 ggctcagagc gtatcgctca tggaatgagg gtgaaattca acccccttgc tttactgcta 3181 gattcgtctt tggagggaga atttgacctt gtacagagaa ttatttatga ggttgatgac 3241 ccaagcctgc ccaatgatga aggcatcacg gctcttcaca atgctgtgtg tgcaggccac 3301 acagaaatcg ttaagttcct ggtacagttt ggtgtaaatg taaatgctgc tgatagtgat 3361 ggatggactc cattacattg tgctgcctca tgtaacaacg tccaagtgtg taagtttttg 3421 gtggagtcag gagccgctgt gtttgccatg acctacagtg acatgcagac tgctgcagat 3481 aagtgcgagg aaatggagga aggctacact cagtgctccc aatttcttta tggagttcag 3541 gagaagatgg gcataatgaa taaaggagtc atttatgcgc tttgggatta tgaacctcag 3601 aatgatgatg agctgcccat gaaagaagga gactgcatga caatcatcca cagggaagac 3661 gaagatgaaa tcgaatggtg gtgggcgcgc cttaatgata aggagggata tgttccacgt 3721 aacttgctgg gactgtaccc aagaattaaa ccaagacaaa ggagcttggc ctgaaacttc 3781 cacacagaat tttagtcaat gaagaattaa tctctgttaa gaagaagtaa tacgattatt 3841 tttggcaaaa atttcacaag acttatttta atgacaatgt agcttgaaag cgatgaagaa 3901 tgtctctaga agagaatgaa ggattgaaga attcaccatt agaggacatt tagcgtgatg 3961 aaataaagca tctacgtcag caggccatac tgtgttgggg caaaggtgtc ccgtgtagca 4021 ctcagataag tatacagcga caatcctgtt ttctacaaga atcctgtcta gtaaatagga 4081 tcatttattg ggcagttggg aaatcagctc tctgtcctgt tgagtgtttt cagcagctgc 4141 tcctaaacca gtcctcctgc cagaaaggac cagtgccgtc acatcgctgt ctctgattgt 4201 ccccggcacc agcaggcctt ggggctcact gaaggctcga aggcactgca caccttgtat 4261 attgtcagtg aagaacgtta gttggttgtc agtgaacaat aactttatta tatgagtttt 4321 tgtagcatct taagaattat acatatgttt gaaatattga aactaagcta cagtaccagt 4381 aattagatgt agaatcttgt ttgtaggctg aattttaatc tgtatttatt gtcttttgta 4441 tctcagaaat tagaaacttg ctacagactt acccgtaata tttgtcaaga tcatagctga 4501 ctttaaaaac agttgtaata aactttttga tgct // LOCUS HSU58516 1934 bp mRNA PRI 19-JUN-1996 DEFINITION Human breast epithelial antigen BA46 mRNA, complete cds. ACCESSION U58516 NID g1381161 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1934) AUTHORS Couto,J.R., Taylor,M.R., Godwin,S.G., Ceriani,R.L. and Peterson,J.A. TITLE Cloning and sequence analysis of human breast epithelial antigen BA46 reveals an RGD cell adhesion sequence presented on an epidermal growth factor-like domain JOURNAL DNA Cell Biol. 15 (4), 281-286 (1996) MEDLINE 96213908 REFERENCE 2 (bases 1 to 1934) AUTHORS Couto,J.R., Taylor,M.R., Godwin,S.G., Ceriani,R.L. and Peterson,J.A. TITLE Direct Submission JOURNAL Submitted (16-MAY-1996) J.R. Couto, Cancer Research Fund of Contra Costa, 2055 North Broadway, Walnut Creek, CA 94596, USA FEATURES Location/Qualifiers source 1..1934 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 61..1224 /note="breast epithelial antigen" /codon_start=1 /product="BA46" /db_xref="PID:g1381162" /translation="MPRPRLLAALCGALLCAPSLLVALDICSKNPCHNGGLCEEISQE VRGDVFPSYTCTCLKGYAGNHCETKCVEPLGMENGNIANSQIAASSVRVTFLGLQHWV PELARLNRAGMVNAWTPSSNDDNPWIQVNLLRRMWVTGVVTQGASRLASHEYLKAFKV AYSLNGHEFDFIHDVNKKHKEFVGNWNKNAVHVNLFETPVEAQYVRLYPTSCHTACTL RFELLGCELNGCANPLGLKNNSIPDKQITASSSYKTWGLHLFSWNPSYARLDKQGNFN AWVAGSYGNDQWLQVDLGSSKEVTGIITQGARNFGSVQFVASYKVAYSNDSANWTEYQ DPRTGSSKIFPGNWDNHSHKKNLFETPILARYVRILPVAWHNRIALRLELLGC" sig_peptide 61..129 /note="putative signal peptide:" misc_feature 130..264 /note="encodes EGF-like domain" misc_feature 174..182 /note="encodes RGD cell adhesion sequence" misc_feature 265..744 /note="encodes region similar to C1 domain of blood coagulation factors V and VIII" misc_feature 745..1221 /note="encodes region similar to C2 domain of blood coagulation factors V and VIII" BASE COUNT 379 a 613 c 547 g 395 t ORIGIN 1 agaaccccgc ggggtctgag cagcccagcg tgcccattcc agcgcccgcg tccccgcagc 61 atgccgcgcc cccgcctgct ggccgcgctg tgcggcgcgc tgctctgcgc ccccagcctc 121 ctcgtcgccc tggatatctg ttccaaaaac ccctgccaca acggtggttt atgcgaggag 181 atttcccaag aagtgcgagg agatgtcttc ccctcgtaca cctgcacgtg ccttaagggc 241 tacgcgggca accactgtga gacgaaatgt gtcgagccac tgggcatgga gaatgggaac 301 attgccaact cacagatcgc cgcctcatct gtgcgtgtga ccttcttggg tttgcagcat 361 tgggtcccgg agctggcccg cctgaaccgc gcaggcatgg tcaatgcctg gacacccagc 421 agcaatgacg ataacccctg gatccaggtg aacctgctgc ggaggatgtg ggtaacaggt 481 gtggtgacgc agggtgccag ccgcttggcc agtcatgagt acctgaaggc cttcaaggtg 541 gcctacagcc ttaatggaca cgaattcgat ttcatccatg atgttaataa aaaacacaag 601 gagtttgtgg gtaactggaa caaaaacgcg gtgcatgtca acctgtttga gacccctgtg 661 gaggctcagt acgtgagatt gtaccccacg agctgccaca cggcctgcac tctgcgcttt 721 gagctactgg gctgtgagct gaacggatgc gccaatcccc tgggcctgaa gaataacagc 781 atccctgaca agcagatcac ggcctccagc agctacaaga cctggggctt gcatctcttc 841 agctggaacc cctcctatgc acggctggac aagcagggca acttcaacgc ctgggttgcg 901 gggagctacg gtaacgatca gtggctgcag gtggacctgg gctcctcgaa ggaggtgaca 961 ggcatcatca cccagggggc ccgtaacttt ggctctgtcc agtttgtggc atcctacaag 1021 gttgcctaca gtaatgacag tgcgaactgg actgagtacc aggaccccag gactggcagc 1081 agtaagatct tccctggcaa ctgggacaac cactcccaca agaagaactt gtttgagacg 1141 cccatcctgg ctcgctatgt gcgcatcctg cctgtagcct ggcacaaccg catcgccctg 1201 cgcctggagc tgctgggctg ttagtggcca cctgccaccc ccaggtcttc ctgctttcca 1261 tgggcccgct gcctcttggc ttctcagccc ctttaaatca ccatagggct ggggactggg 1321 gaaggggagg gtgttcagag gcagcaccac cacacagtca cccctccctc cctctttccc 1381 accctccacc tctcacgggc cctgccccag cccctaagcc ccgtccccta acccccagtc 1441 ctcactgtcc tgttttctta ggcactgagg gatctgagta ggtctgggat ggacaggaaa 1501 gggcaaagta gggcgtgtgg tttccctgcc cctgtccgga ccgccgatcc caggtgcgtg 1561 tgtctctgtc tctcctagcc cctctctcac acatcacatt cccatggtgg cctcaagaaa 1621 ggcccggaag ccccaggctg gagataacag cctcttgccc gtcggccctg cgtcggccct 1681 ggggtaccat gtgccacaac tgctgtggcc ccctgtcccc aagacacttc cccttgtctc 1741 cctggttgcc tctcttgccc cttgtcctga agcccagcga cacagaaggg ggtggggcgg 1801 gtctatgggg agaaagggag cgaggtcaga ggagccggca tgggttggca gggtgggcgt 1861 ttggggccct catgctggct tttcacccca gaggacacag gcagcttcca aaatatattt 1921 atcttcttca cggg // LOCUS HSU58522 2149 bp mRNA PRI 11-AUG-1996 DEFINITION Human huntingtin interacting protein (HIP2) mRNA, complete cds. ACCESSION U58522 NID g1381163 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2149) AUTHORS Kalchman,M.A., Graham,R.K., Xia,G., Koide,H.B., Hodgson,J.G., Graham,K.C., Goldberg,Y.P., Gietz,R.D., Pickart,C.M. and Hayden,M.R. TITLE Huntingtin is ubiquitinated and interacts with a specific ubiquitin-conjugating enzyme JOURNAL J. Biol. Chem. 271 (32), 19385-19394 (1996) MEDLINE 96325051 REFERENCE 2 (bases 1 to 2149) AUTHORS Kalchman,M.A., Goldberg,Y.P., Geitz,R.D. and Hayden,M.R. TITLE Direct Submission JOURNAL Submitted (16-MAY-1996) M.A. Kalchman, Medical Genetics, University of British Columbia, #416-2125 East Mall - NCE Bldg., Vancouver, BC V6T 1Z4, Canada FEATURES Location/Qualifiers source 1..2149 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p14" gene 4..606 /gene="HIP2" CDS 4..606 /gene="HIP2" /note="E2-25K ubiquitin conjugating enzyme" /codon_start=1 /product="huntingtin interacting protein" /db_xref="PID:g1381164" /translation="MANIAVQRIKREFKEVLKSEETSKNQIKVDLVDENFTELRGEIA GPPDTPYEGGRYQLEIKIPETYPFNPPKVRFITKIWHPNISSVTGAICLDILKDQWAA AMTLRTVLLSLQALLAAAEPDDPQDAVVANQYKQNPEMFKQTARLWAHVYAGAPVSSP EYTKKIENLCAMGFDRNAVIVALSSKSWDVETATELLLSN" BASE COUNT 667 a 374 c 416 g 691 t 1 others ORIGIN 1 gacatggcca acatcgcggt gcagcgaatc aagcgggagt tcaaggaggt gctgaagagc 61 gaggagacga gcaaaaatca aattaaagta gatcttgtag atgagaattt tacagaatta 121 agaggagaaa tagcaggacc tccagacaca ccatatgaag gaggaagata ccaactagag 181 ataaaaatac cagaaacata cccatttaat ccccctaagg tccggtttat cactaaaata 241 tggcatccta atattagttc cgtcacaggg gctatttgtt tggatatcct gaaagatcaa 301 tgggcagctg caatgactct ccgcacggta ttattgtcat tgcaagcact attggcagct 361 gcagagccag atgatccaca ggatgctgta gtagcaaatc agtacaaaca aaatcccgaa 421 atgttcaaac agacagctcg actttgggca catgtgtatg ctggagcacc agtttctagt 481 ccagaataca ccaaaaaaat agaaaaccta tgtgctatgg gctttgatag gaatgcagta 541 atagtggcct tgtcttcaaa atcatgggat gtagagactg caacagaatt gcttctgagt 601 aactgaggca tagagagctg ctgatatagt caagcttgcc tcttcttgag gagcaccaac 661 atctgttatt tttaggattc tgcatagatt tcttttaatc tggcattctc gcctaatgat 721 gttatctagg caccattgga gactgaaaaa aaaaaatccc tgctctgtaa ataaagctaa 781 ttaaacgtct gtgtaaattt aaaaagggga aatactttaa ttttttttct taatagtgta 841 aaaattccct gagctaagct aaaaccatgg aagaaacatg ctactttagt gtttagcagt 901 gtaccaagac tagcaagagt ttgcttcagg atttggttga ataattaaga taatatttgg 961 agtgtgtcag ggccattcaa attgttggtg ttgcatcaca gctaccttaa ctgtttttaa 1021 catggatcct ctgtgcctgt gaatttactt gcatgcttgt acttgacttc ttaggatggg 1081 tagctgaaaa gaccaccatt ttaagcattt gagaattctt aaatatgaaa tttattcaga 1141 attgaagatg gtgacctatt cagagccttt ttgtccttgt caacagactg ggacagtgtc 1201 tgattccccc ttcacccccc cccacccccg ccttggcaca cacagctaat attctaatgg 1261 taaatttctc tgtatcaggt ggggaaatgt gctgaaggac agtatgtatc ccttgcttca 1321 tttttaggtc gtaggtttgg aatgtcttgt cccagttctt caaacactct taaatttttc 1381 ttaagtaatg taaaaatgga actgccaatt ttatttctct tgcaaaaata gtaaatactt 1441 gatgttacat tattcccagg tttaatgaaa gaacccaact tagtttttca gtgaatttga 1501 cacctatttt ttagtgatga aatttttctt tgagaactgg caaggatgca gtcagctgtt 1561 tgcagttttt agcctgattt tggggtctat agagattgct ttattggata cttcaagtca 1621 ttcttgcttg cacttcccct attgacacat gaaagctgtg ttggtgtttt attgtacata 1681 cttcagatgc acataggaat agaagtgtgt tataaatcta gctttcttta tgatgtttct 1741 gataatacga gaattgaaaa ctttaccttc tcttgtacat agtcagacta tttgtattaa 1801 atttacattt cattctaagt tccaaaagtt tgaaaattat tagttttgca agatcacaca 1861 ctaatgtaac cattttatga aggttgaagt ggatttatgc aggcagttct atatatagaa 1921 atncaattct ttttaaattt ttaggaccaa tacaaaataa cacaaatgta atggaatcag 1981 actgaattaa agtaaggctg tatattgaaa gtcatattat aaaaggtttg ctttctttaa 2041 gtgttattta tcttaaatta taatcgttaa atgtttggaa gataattttt gaatcataac 2101 gtcagcataa cttcatttga cttctcaata atcttgtcga cgcggccgc // LOCUS HSU58681 1535 bp DNA PRI 04-DEC-1996 DEFINITION Human neurogenic basic-helix-loop-helix protein (NeuroD2) gene, complete cds. ACCESSION U58681 NID g1477748 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1535) AUTHORS McCormick,M.B., Tamimi,R.M., Snider,L., Asakura,A., Bergstrom,D. and Tapscott,S.J. TITLE NeuroD2 and neuroD3: distinct expression patterns and transcriptional activation potentials within the neuroD gene family JOURNAL Mol. Cell. Biol. 16 (10), 5792-5800 (1996) MEDLINE 96413331 REFERENCE 2 (bases 1 to 1535) AUTHORS Tapscott,S.J., Tamimi,R.T. and McCormick,B.M. TITLE Direct Submission JOURNAL Submitted (17-MAY-1996) Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 19104, USA FEATURES Location/Qualifiers source 1..1535 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q12" gene 55..1200 /gene="NeuroD2" CDS 55..1200 /gene="NeuroD2" /note="neurogenic basic-helix-loop-helix (bHLH) protein" /codon_start=1 /db_xref="PID:g1477749" /translation="MLTRLFSEPGLLSDVPKFASWGDGEDDEPRSDKGDAPPPPPPAP GPGAPGPARAAKPVPLRGEEGTEATLAEVKEEGELGGEEEEEEEEEEGLDEAEGERPK KRGPKKRKMTKARLERSKLRRQKANARERNRMHDLNAALDNLRKVVPCYSKTQKLSKI ETLRLAKNYIWALSEILRSGKRPDLVSYVQTLCKGLSQPTTNLVAGCLQLNSRNFLTE QGADGAGRFHGSGGPFAMHPYPYPCSRLAGAQCQAAGGLGGGAAHALRTHGYCAAYET LYAAAGGGGASPDYNSSEYEGPLSPPLCLNGNFSLKQDSSPDHEKSYHYSMHYSALPG SRHGHGLVFGSSAVRGGVHSENLLSYDMHLHHDRGPMYEELNAFFHN" BASE COUNT 250 a 559 c 476 g 244 t 6 others ORIGIN 1 cccctcactt tgtgctgtct gtctcccctt cccgcccggg gnccctcagg caccatgctg 61 acccgcctgt tcagcgagcc cggccttctc tcggacgtgc ccaagttcgc cagctggggc 121 gacggcgaag acgacgagcc gaggagcgac aagggcgacg cgccgccacc gccaccgcct 181 gcgcccgggc caggggctcc ggggccagcc cgggcggcca agccagtccc tctccgtgga 241 gaagagggga cggaggccac gttggccgag gtcaaggagg aaggcgagct ggggggagag 301 gaggaggagg aagaggagga ggaagaagga ctggacgagg cggagggcga gcggcccaag 361 aagcgcgggc ccaagaagcg caagatgacc aaggcgcgct tggagcgctc caagcttcgg 421 cggcagaagg cgaacgcgcg ggagcgcaac cgcatgcacg acctgaacgc agccctggac 481 aacctgcgca aggtggtgcc ctgctactcc aagacgcaga agctgtccaa gatcgagacg 541 ctgcgcctag ccaagaacta tatctgggcg ctctcggaga tcctgcgctc cggcaagcgg 601 ccagacctag tgtcctacgt gcagactctg tgcaagggtc tgtcgcagcc caccaccaat 661 ctggtggccg gctgtctgca gctcaactct cgcaacttcc tcacggagca aggcgccgac 721 ggtgccggcc gcttccacgg ctcgggcggc ccgttcgcca tgcaccccta cccgtacccg 781 tgctcgcgcc tggcgggcgc acagtgccag gcggccggcg gcctgggcgg cggcgcggcg 841 cacgccctgc ggacccacgg ctactgcgcc gcctacgaga cgctgtatgc ggcggcaggc 901 ggtggcggcg cgagcccgga ctacaacagc tccgagtacg agggcccgct cagccccccg 961 ctctgtctca atggcaactt ctcactcaag caggactcct cgcccgacca cgagaaaagc 1021 taccactact ctatgcacta ctcggcgctg cccggttcgc gccacggcca cgggctagtc 1081 ttcggctcgt cggctgtgcg cgggggcgtc cactcggaga atctcttgtc ttacgatatg 1141 caccttcacc acgaccgggg ccccatgtac gaggagctca atgcgttttt tcataactga 1201 gacttcgcgc cgnctccctn ctttttcttt tgcctttgcc cgcccccctg tccccagccc 1261 ccagagcgca gggacacccc catnctaccc cggcnccggc ggagcgggcc accggtctgc 1321 cgctctcctg gggcagcgca gtctgttacn tgtgggtggc tgtcccaggg gcctcgcttc 1381 ccccagggac tcgccttctc tctccaaggg gttccctcct cctctctccc aaggagtgct 1441 tctccaggga cctctctccg ggggctccct ggaggcaccc ctcccccatt cccaatatct 1501 tcgctgaggt ttcctcctcc ccctcctccc tgcag // LOCUS HSU58682 361 bp mRNA PRI 02-SEP-1996 DEFINITION Human ribosomal protein S28 mRNA, complete cds. ACCESSION U58682 NID g1518636 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Chan,Y.L., Olvera,J. and Wool,I.G. TITLE The primary structure of rat ribosomal protein S28 JOURNAL Biochem. Biophys. Res. Commun. 179 (1), 314-318 (1991) MEDLINE 91354268 REFERENCE 2 (bases 1 to 361) AUTHORS Kim,J.M. and Bae,Y.S. TITLE Direct Submission JOURNAL Submitted (19-MAY-1996) J. M. Kim, Biochemistry, Kyungpook National University, 1370 Sankyuk-Dong, Taegu 702-701, South Korea FEATURES Location/Qualifiers source 1..361 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 13..222 /note="40S ribosome" /codon_start=1 /product="ribosomal protein S28" /db_xref="PID:g1518637" /translation="MDTSRVQPIKLARVTKVLGRTGSQGQCTQVRVEFMDDTSRSIIR NVKGPVREGDVLTLLESEREARRLR" BASE COUNT 70 a 102 c 106 g 83 t ORIGIN 1 cgcgccgcca tcatggacac cagccgtgtg cagcctatca agctggccag ggtcaccaag 61 gtcctgggca ggaccggttc tcagggacag tgcacgcagg tgcgcgtgga attcatggac 121 gacacgagcc gatccatcat ccgcaatgta aaaggccccg tgcgcgaggg cgacgtgctc 181 acccttttgg agtcagagcg agaagcccgg aggttgcgct gagcttggct gctcgctccc 241 tcttggatgt cgggttcgac cacttggccg atgggaatgg tctgtcacaa tctgctcctt 301 ttttttgtcc gccacacgta actgagatgc tcctttaaat aaagcgtttg tgtttcaagt 361 t // LOCUS HSU58766 1340 bp mRNA PRI 01-NOV-1996 DEFINITION Human FX protein mRNA, complete cds. ACCESSION U58766 NID g1381178 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1340) AUTHORS Tonetti,M., Sturla,L., Bisso,A., Benatti,U. and De Flora,A. TITLE Synthesis of GDP-L-fucose by the human FX protein JOURNAL J. Biol. Chem. 271 (44), 27274-27279 (1996) MEDLINE 97066899 REFERENCE 2 (bases 1 to 1340) AUTHORS Tonetti,M., Sturla,L., Benatti,U. and De Flora,A. TITLE Direct Submission JOURNAL Submitted (20-MAY-1996) M. Tonetti, Institute of Biochemistry, University of Genova, Viale Benedetto XV,1, Genova 16132, Italy FEATURES Location/Qualifiers source 1..1340 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 75..1040 /note="GDP-4-keto-6-deoxy-D-mannose epimerase-reductase" /codon_start=1 /product="FX" /db_xref="PID:g1381179" /translation="MGEPQGSMRILVTGGSGLVGKAIQKVVADGAGLPGEDWVFVSSK DADLTDTAQTRALFEKVQPTHVIHLAAMVGGLFRNIKYNLDFWRKNVHMNDNVLHSAF EVGARKVVSCLSTCIFPDKTTYPIDETMIHNGPPHNSNFGYSYAKRMIDVQNRAYFQQ YGCTFTAVIPTNVFGPHDNFNIEDGHVLPGLIHKVHLAKSSGSALTVWGTGNPRRQFI YSLDLAQLFIWVLREYNEVEPIILSVGEEDEVSIKEAAEAVVEAMDFHGEVTFDTTKS DGQFKKTASNSKLRTYLPDFRFTPFKQAVKETCAWFTDNYEQARK" BASE COUNT 311 a 400 c 376 g 253 t ORIGIN 1 ctagaattca gcggccgctg aattctagct agaattcagc ggccgctgaa ttctagaacc 61 caggtgcaac tgacatgggt gaaccccagg gatccatgcg gattctagtg acagggggct 121 ctgggctggt aggcaaagcc atccagaagg tggtagcaga tggagctgga cttcctggag 181 aggactgggt gtttgtctcc tctaaagacg ccgatctcac ggatacagca cagacccgcg 241 ccctgtttga gaaggtccaa cccacacacg tcatccatct tgctgcaatg gtggggggcc 301 tgttccggaa tatcaaatac aatttggact tctggaggaa aaacgtgcac atgaacgaca 361 acgtcctgca ctcggccttt gaggtggggg cccgcaaggt ggtgtcctgc ctgtccacct 421 gtatcttccc tgacaagacg acctacccga tagatgagac catgatccac aatgggcctc 481 cccacaacag caattttggg tactcgtatg ccaagaggat gatcgacgtg cagaacaggg 541 cctacttcca gcagtacggc tgcaccttca ccgctgtcat ccccaccaac gttttcgggc 601 cccacgacaa cttcaacatc gaggatggcc acgtgctgcc tggcctcatc cacaaggtgc 661 acctggccaa gagcagcggc tcggccctga cggtgtgggg tacagggaat ccgcggaggc 721 agttcatata ctcgctggac ctggcccagc tctttatctg ggtcctgcgg gagtacaatg 781 aagtggagcc catcatcctc tccgtgggcg aggaagatga ggtctccatc aaggaggcag 841 ccgaggcggt ggtggaggcc atggacttcc atggggaagt cacctttgat acaaccaagt 901 cggatgggca gtttaagaag acagccagta acagcaagct gaggacctac ctgcccgact 961 tccggttcac acccttcaag caggcggtga aggagacctg tgcttggttc actgacaact 1021 acgagcaggc ccggaagtga agctggaaga caggatcagg tgccagcgga ccatcggctg 1081 gcagagccca gcggccacca cccgtcaacc ctgccaggag ctgagggcac cacccagcaa 1141 cctgggcctg cattccatcc gctctgcagc cccaagcatc tttccagtgg ggcccccatt 1201 cacgttggtc ctcagggaaa ccagggtccg gggcaggccc ggcgctttgc tccccacacc 1261 agccccctgc gcgtgtccac tctgatcctg catcccactc cctgggagcc aataaagtgc 1321 attttcacag aaaaaaaaaa // LOCUS HSU58856 1620 bp mRNA PRI 24-JUN-1997 DEFINITION Human chromosome 17 unknown product mRNA, complete cds. ACCESSION U58856 NID g2209262 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1620) AUTHORS Tajima,Y., Tashiro,K. and Camerini,D. TITLE Cloning of human chromosome 17-specific cDNAs using representational difference analysis and human-mouse hybrid cells JOURNAL Genomics 42 (2), 353-355 (1997) MEDLINE 97336065 REFERENCE 2 (bases 1 to 1620) AUTHORS Tajima,Y. and Camerini,D. TITLE Direct Submission JOURNAL Submitted (20-MAY-1996) Y. Tajima, Microbiology, University of Virginia, Health Science Center, Charlottesville, VA 22906, USA FEATURES Location/Qualifiers source 1..1620 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" CDS 121..1326 /codon_start=1 /product="unknown" /db_xref="PID:g2209263" /translation="MAAVPRDWQTPRGFPSGSTAILSTWSCCWATRRRDSAARERVGP SCLSWMRWRMCLSGSTCRAMRPESGRLAGHELQPQRRHSGLAGQHSCELLQLGAPGLG PQHAEPQQLLLDSDNSGLWRPGACTNITMGVVCKLPRAEQTLLPISASRKPAALVVVL MAVLLLLALLTAALILYRRRQNIERGAFEGARYSRSSSSPTEATEKNILVSDMEMNEQ QEYNHARGQGQGGKIWGTGALGQSGPPPAACPFGLWKGALGSPCSQPELGIPWAGGVP PSHKGWAETQLSAPWRFPFWGGLRSCHLVLCPHRNHMLDGKGNETSFSPEPPAQACFI RAPGPLLCRARGASPVPSGRSVVSLFPPGSLSSVPLTLLPLAPSLPPLPSEPGPGDWG ALLFLMRVS" BASE COUNT 282 a 506 c 518 g 314 t ORIGIN 1 ctctaaagac ctacctagat gtggacgggg cctggcgcac caccagctgt gacaccaagc 61 tgcagggggc tgtgtgtggg gttagcagtg ggccccctcc tccccgaaga ataagctacc 121 atggcagctg tccccaggga ctggcagact ccgcgtggat tcccttccgg gagcactgct 181 attctttcca catggagctg ctgctgggcc acaaggaggc gcgacagcgc tgccagagag 241 cgggtggggc cgtcctgtct atcctggatg agatggagaa tgtgtttgtc tgggagcacc 301 tgcagagcta tgaggccaga gtcggggcgc ctggctgggc atgaacttca accccaaagg 361 aggcactctg gtctggcagg acaacacagc tgtgaactac tccaactggg ggcccccggg 421 cttgggcccc agcatgctga gccacaacag ctgctactgg attcagacaa cagcgggcta 481 tggcgccccg gcgcttgcac caacatcacc atgggtgtcg tctgcaagct tcctcgtgct 541 gaacagacac ttctccccat cagcgcttcc agaaaaccag cggccctggt ggtggtgctg 601 atggcggtgc tgctgctcct ggccttgctg accgcagccc tcatccttta ccggaggcgc 661 cagaacatcg agcgcggggc ctttgagggt gcccgctaca gccgcagcag ctccagcccc 721 accgaggcca ctgaaaaaaa catcctggtg tcagacatgg aaatgaatga gcaacaagaa 781 tacaaccacg cgcgtgggca gggccagggc gggaagatct ggggaactgg ggccctgggt 841 cagtctggcc ccccaccagc tgcctgtcca tttggcctat ggaagggtgc ccttgggagt 901 ccctgttccc aaccggaact gggcataccc tgggctggtg gggtgccacc ctcccacaag 961 ggctgggctg agacccagct gagtgcaccg tggcgtttcc ctttctgggg gggcctgagg 1021 tcttgtcacc tggtcctgtg cccccaccgg aaccatatgt tagatgggaa ggggaacgag 1081 acctctttct ccccagagcc cccggcccag gcctgtttca tccgcgcccc aggacccctt 1141 ctttgcagag cccgaggagc ctcccctgtc ccctcgggca gatctgttgt gtctctcttc 1201 ccacctggca gcctcagctc tgtgcccctc accctgctcc ctctcgcccc ttctctccca 1261 ccccttcctt ctgagccggg ccctggggat tggggagccc tcttgttcct gatgagggtc 1321 agctgagggg gctgagcatc catcactcct gtgcctgctg gggtggctgt ggggcgtggc 1381 aggagggcct aggtgggttg ggcctgagaa ccagggcacg ggtgtggtgt ctgctgggct 1441 ggagataaga ctggggagag acaccccaac ctcccagggt gggagctggg ccgggctggg 1501 atgtcatctc ctgccgggcg ggggagggct ctgcccctgg aagagtcccc tgtggggacc 1561 aaataaagtt ccctaacatc tccagctcct ggctctggtt tggagcaagg ggaagggttg // LOCUS HSU58917 3120 bp mRNA PRI 31-JAN-1998 DEFINITION Homo sapiens IL-17 receptor mRNA, complete cds. ACCESSION U58917 NID g2826475 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3120) AUTHORS Yao,Z., Spriggs,M.K., Derry,J.M.J., Strockbine,L., Park,L.S., VandenBos,T., Zappone,J., Painter,S.L. and Armitage,R.J. TITLE Molecular characterization of the human interleukin (Il)-17 receptor JOURNAL Cytokine 9 (11), 794-800 (1997) MEDLINE 98035683 REFERENCE 2 (bases 1 to 3120) AUTHORS Spriggs,M.K. TITLE Direct Submission JOURNAL Submitted (21-MAY-1996) Molecular Biology, Immunex Corporation, 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..3120 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cells" CDS 33..2633 /note="receptor for interleukin 17 family" /codon_start=1 /product="IL-17 receptor" /db_xref="PID:g2826476" /translation="MGAARSPPSAVPGPLLGLLLLLLGVLAPGGASLRLLDHRALVCS QPGLNCTVKNSTCLDDSWIHPRNLTPSSPKDLQIQLHFAHTQQGDLFPVAHIEWTLQT DASILYLEGAELSVLQLNTNERLCVRFEFLSKLRHHHRRWRFTFSHFVVDPDQEYEVT VHHLPKPIPDGDPNHQSKNFLVPDCEHARMKVTTPCMSSGSLWDPNITVETLEAHQLR VSFTLWNESTHYQILLTSFPHMENHSCFEHMHHIPAPRPEEFHQRSNVTLTLRNLKGC CRHQVQIQPFFSSCLNDCLRHSATVSCPEMPDTPEPIPDYMPLWVYWFITGISILLVG SVILLIVCMTWRLAGPGSEKYSDDTKYTDGLPAADLIPPPLKPRKVWIIYSADHPLYV DVVLKFAQFLLTACGTEVALDLLEEQAISEAGVMTWVGRQKQEMVESNSKIIVLCSRG TRAKWQALLGRGAPVRLRCDHGKPVGDLFTAAMNMILPDFKRPACFGTYVVCYFSEVS CDGDVPDLFGAAPRYPLMDRFEEVYFRIQDLEMFQPGRMHRVGELSGDNYLRSPGGRQ LRAALDRFRDWQVRCPDWFECENLYSADDQDAPSLDEEVFEEPLLPPGTGIVKRAPLV REPGSQACLAIDPLVGEEGGAAVAKLEPHLQPRGQPAPQPLHTLVLAAEEGALVAAVE PGPLADGAAVRLALAGEGEACPLLGSPGAGRNSVLFLPVDPEDSPLGSSTPMASPDLL PEDVREHLEGLMLSLFEQSLSCQAQGGCSRPAMVLTDPHTPYEEEQRQSVQSDQGYIS RSSPQPPEGLTEMEEEEEEEQDPGKPALPLSPEDLESLRSLQRQLLFRQLQKNSGWDT MGSESEGPSA" BASE COUNT 610 a 1002 c 938 g 570 t ORIGIN 1 ggggccgagc cctccgcgac gccacccggg ccatgggggc cgcacgcagc ccgccgtccg 61 ctgtcccggg gcccctgctg gggctgctcc tgctgctcct gggcgtgctg gccccgggtg 121 gcgcctccct gcgactcctg gaccaccggg cgctggtctg ctcccagccg gggctaaact 181 gcacggtcaa gaatagtacc tgcctggatg acagctggat tcaccctcga aacctgaccc 241 cctcctcccc aaaggacctg cagatccagc tgcactttgc ccacacccaa caaggagacc 301 tgttccccgt ggctcacatc gaatggacac tgcagacaga cgccagcatc ctgtacctcg 361 agggtgcaga gttatctgtc ctgcagctga acaccaatga acgtttgtgc gtcaggtttg 421 agtttctgtc caaactgagg catcaccaca ggcggtggcg ttttaccttc agccactttg 481 tggttgaccc tgaccaggaa tatgaggtga ccgttcacca cctgcccaag cccatccctg 541 atggggaccc aaaccaccag tccaagaatt tccttgtgcc tgactgtgag cacgccagga 601 tgaaggtaac cacgccatgc atgagctcag gcagcctgtg ggaccccaac atcaccgtgg 661 agaccctgga ggcccaccag ctgcgtgtga gcttcaccct gtggaacgaa tctacccatt 721 accagatcct gctgaccagt tttccgcaca tggagaacca cagttgcttt gagcacatgc 781 accacatacc tgcgcccaga ccagaagagt tccaccagcg atccaacgtc acactcactc 841 tacgcaacct taaagggtgc tgtcgccacc aagtgcagat ccagcccttc ttcagcagct 901 gcctcaatga ctgcctcaga cactccgcga ctgtttcctg cccagaaatg ccagacactc 961 cagaaccaat tccggactac atgcccctgt gggtgtactg gttcatcacg ggcatctcca 1021 tcctgctggt gggctccgtc atcctgctca tcgtctgcat gacctggagg ctagctgggc 1081 ctggaagtga aaaatacagt gatgacacca aatacaccga tggcctgcct gcggctgacc 1141 tgatcccccc accgctgaag cccaggaagg tctggatcat ctactcagcc gaccaccccc 1201 tctacgtgga cgtggtcctg aaattcgccc agttcctgct caccgcctgc ggcacggaag 1261 tggccctgga cctgctggaa gagcaggcca tctcggaggc aggagtcatg acctgggtgg 1321 gccgtcagaa gcaggagatg gtggagagca actctaagat catcgtcctg tgctcccgcg 1381 gcacgcgcgc caagtggcag gcgctcctgg gccggggggc gcctgtgcgg ctgcgctgcg 1441 accacggaaa gcccgtgggg gacctgttca ctgcagccat gaacatgatc ctcccggact 1501 tcaagaggcc agcctgcttc ggcacctacg tagtctgcta cttcagcgag gtcagctgtg 1561 acggcgacgt ccccgacctg ttcggcgcgg cgccgcggta cccgctcatg gacaggttcg 1621 aggaggtgta cttccgcatc caggacctgg agatgttcca gccgggccgc atgcaccgcg 1681 taggggagct gtcgggggac aactacctgc ggagcccggg cggcaggcag ctccgcgccg 1741 ccctggacag gttccgggac tggcaggtcc gctgtcccga ctggttcgaa tgtgagaacc 1801 tctactcagc agatgaccag gatgccccgt ccctggacga agaggtgttt gaggagccac 1861 tgctgcctcc gggaaccggc atcgtgaagc gggcgcccct ggtgcgcgag cctggctccc 1921 aggcctgcct ggccatagac ccgctggtcg gggaggaagg aggagcagca gtggcaaagc 1981 tggaacctca cctgcagccc cggggtcagc cagcgccgca gcccctccac accctggtgc 2041 tcgccgcaga ggagggggcc ctggtggccg cggtggagcc tgggcccctg gctgacggtg 2101 ccgcagtccg gctggcactg gcgggggagg gcgaggcctg cccgctgctg ggcagcccgg 2161 gcgctgggcg aaatagcgtc ctcttcctcc ccgtggaccc cgaggactcg ccccttggca 2221 gcagcacccc catggcgtct cctgacctcc ttccagagga cgtgagggag cacctcgaag 2281 gcttgatgct ctcgctcttc gagcagagtc tgagctgcca ggcccagggg ggctgcagta 2341 gacccgccat ggtcctcaca gacccacaca cgccctacga ggaggagcag cggcagtcag 2401 tgcagtctga ccagggctac atctccagga gctccccgca gccccccgag ggactcacgg 2461 aaatggagga agaggaggaa gaggagcagg acccagggaa gccggccctg ccactctctc 2521 ccgaggacct ggagagcctg aggagcctcc agcggcagct gcttttccgc cagctgcaga 2581 agaactcggg ctgggacacg atggggtcag agtcagaggg gcccagtgca tgagggcggc 2641 tccccaggga ccgcccagat cccagctttg agagaggagt gtgtgtgcac gtattcatct 2701 gtgtgtacat gtctgcatgt gtatatgttc gtgtgtgaaa tgtaggcttt aaaatgtaaa 2761 tgtctggatt ttaatcccag gcatccctcc taacttttct ttgtgcagcg gtctggttat 2821 cgtctatccc caggggaatc cacacagccc gctcccagga gctaatggta gagcgtcctt 2881 gaggctccat tattcgttca ttcagcattt attgtgcacc tactatgtgg cgggcatttg 2941 ggataccaag ataaattgca tgcggcatgg ccccagccat gaaggaactt aaccgctagt 3001 gccgaggaca cgttaaacga acaggatggg ccgggcacgg tggctcacgc ctgtaatccc 3061 agcacactgg gaggccgagg caggtggatc actctgaggt caggagtttg agccagcctg // LOCUS HSU58970 1867 bp mRNA PRI 02-JUL-1996 DEFINITION Human putative outer mitochondrial membrane 34 kDa translocase hTOM34 mRNA, complete cds. ACCESSION U58970 NID g1399812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1867) AUTHORS Nuttall,S.D., Hanson,B.J. and Hoogenraad,N.J. TITLE hTOM34 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1867) AUTHORS Nuttall,S.D., Hanson,B.J. and Hoogenraad,N.J. TITLE Direct Submission JOURNAL Submitted (22-MAY-1996) Biochemistry, La Trobe University, Bundoora, Victoria 3083, Australia FEATURES Location/Qualifiers source 1..1867 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 32..961 /note="putative outer mitochondrial membrane translocase" /codon_start=1 /product="hTOM34p" /db_xref="PID:g1399813" /translation="MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGS SDPEEESVLYSNRAACHWKNGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYP MAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSFPLVPVSAQKRWNF LPSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSE SLLCSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKD YKSSFADISNLLQIEPRNGPAQKLRQEVKQNLH" BASE COUNT 473 a 502 c 476 g 416 t ORIGIN 1 gaagctccca actcgccggc ctggccacgg gatggccccc aaattcccag actctgtgga 61 ggagctccgc gccgccggca atgagagttt ccgcaacggc cagtacgccg aggcctccgc 121 gctctacggc cgcgcgctgc gggtgctgca ggcgcaaggt tcttcagacc cagaagaaga 181 aagtgttctc tactccaacc gagcagcatg tcactggaag aatggaaact gcagagactg 241 catcaaagat tgcacttcag cactggcctt ggttcccttc agcattaagc ccctgctgcg 301 gcgagcatct gcttatgagg ctctggagaa gtaccctatg gcctatgttg actataagac 361 tgtgctgcag attgatgata atgtgacgtc agccgtagaa ggcatcaaca gaatgaccag 421 agctctcatg gactcgcttg ggcctgagtg gcgcctgaag ctgccctcat tccccttggt 481 gcctgtgtca gctcagaaga ggtggaattt cttgccttcg gagaaccaca aagagatggc 541 taaaagcaaa tccaaagaaa ccacagctac aaagaacaga gtgccttctg ctggggatgt 601 ggagaaagcc agagttctga aggaagaagg caatgagctt gtaaagaagg gaaaccataa 661 gaaagctatt gagaagtaca gtgaaagcct cttgtgtagt aacctggaat ctgccacgta 721 cagcaacaga gcactctgct atttggtcct gaagcagtac acagaagcag tgaaggactg 781 cacagaagcc ctcaagctgg atggaaagaa cgtgaaggca ttctacagac gggctcaagc 841 ccacaaagca ctcaaggact ataaatccag ctttgcagac atcagcaacc tcctacagat 901 tgagcctagg aatggtcctg cacagaagtt gcggcaggaa gtgaagcaga acctacacta 961 aaaacccaac agggcaactg gaacccctgc ctgaccttac ccagagaagc catgggccac 1021 ctgctctgtg cccgctcctg aaacccagca tgccccaagt gagctctgaa gccccctcct 1081 caatcccttg atggcctccc accctgtaag aggctttgct tgttcaaatt aaactcagtg 1141 tagtcaaaca cagacatggt tgttgcacca gaaaggtccc cactagagct aagcgtgaag 1201 ctgaagctct gtccctattc ccccagccca gctagctgat cacaccaaca gatcctcatc 1261 agcaaagcat ttggctttgt cctgcccaag tgggctgcag actgagtgct gcccttgtag 1321 cttccccaga ccccaactca ctgcagttca tctgaacaac ctgagctcct gggccggggt 1381 ggaaggaggg ggataaacct aaggccctga tccaaagcag cctgttgagc tggttctcca 1441 gggctgcagt ctctccaggt gtacagctgt ccctgccctg tcctgtcctt gcacagtctc 1501 ctatgtctga gccccagtgc cttctgttcg ggccctcctt tggtgggaaa ggcagagccc 1561 tgacccttga atggttgtcc ttgactctgt gctgctgcct tctgcagaga ggcacctaag 1621 ctgtttaaag agcccagtga ttgtggctgc tcctcctaga ggtgggaggg ggcaagaggc 1681 ctccttggtc agtgtccatg ctttctgggc agggacttgg ttttttgttc caacagtggc 1741 cttctccggg cttcatagtt ctttgtaata tgttgaagtt aatttgaatt gactgatttt 1801 gttgaactgt gtgtttaagc tgttgcatta aaaagctttc ttctacatca aaaaaaaaaa 1861 aaaaaaa // LOCUS HSU58996 1928 bp mRNA PRI 25-JUN-1996 DEFINITION Human testis calpastatin mRNA, complete cds. ACCESSION U58996 NID g1388176 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1928) AUTHORS O'Hern,P.A., Liang,Z.G., Wang,G.Y., Yavetz,B., Kim,E. and Goldberg.,E. TITLE A Novel Testis-specific Isoform of Calpastatin Detected with Serum from an Infertile Patient JOURNAL Unpublished REFERENCE 2 (bases 1 to 1928) AUTHORS O'Hern,P.A., Liang,Z.G., Wang,G.Y., Yavetz,B., Kim,E. and Goldberg.,E. TITLE Direct Submission JOURNAL Submitted (22-MAY-1996) Department of Biochemistry, Molecular Biology and Cell Biology, Northwestern University, 2153 Sheridan Road, Evanston, IL 60208, USA FEATURES Location/Qualifiers source 1..1928 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" CDS 76..1368 /function="inhibitor of calpain (calcium-dependent neutral protease)" /note="t-CAST; testis specific isoform of calpastatin" /codon_start=1 /product="testis calpastatin" /db_xref="PID:g1388177" /translation="MGQFLSSTFLEGSPATVSTISFVTVNAEEQEKQFVSSRTKQKAK EEKLEKCGEDDETIPSEYRLKPATDKDGKPLLPEPEEKPKPRSESELIDELSEDFDRS ECKEKPSKPTEKTEESKAAAPAPVSEAVSRTSMCSIQSAPPEPATLKGTVPDDAVEAL ADSLGKKEADPEDGKPVMDKVKEKAKEEDREKLGEKEETIPPDYRLEEVKDKDGKPLL PKESKEQLPPMSEDFLLDALSEDFSGPQNASSLKFEDAKLAAAISEVVSQTPASTTQA GAPPRDTSSDKDLDDALDKLSDSLGQRQPDPDENKPMEDKVKEKAKAEHRDKLGERDD TIPPEYRHLLDDNGQDKPVKPPTKKSEDSKKPADDQDPIDALSGDLDSCPSTTETSQN TAKDKCKKAASSSKAPKNGGKAKDSAKTTEETSKPKDD" misc_feature 76..198 /note="encodes amino acids unique to testis isoform" polyA_signal 1828..1833 polyA_signal 1874..1879 BASE COUNT 662 a 385 c 425 g 456 t ORIGIN 1 cttgatatcg aattcggggg gagtctccct gacttccagc aacaatcctt gagtctggga 61 ctgccctggc ctaagatggg ccagtttcta tcttcgactt tcttggaggg ctcaccggcc 121 acagtgtcga cgataagctt tgtgacggtg aacgcagagg agcaagagaa gcagttcgta 181 tcttccagga ccaagcaaaa agctaaagaa gaaaaactag agaagtgtgg tgaggatgat 241 gaaacaatcc catctgagta cagattaaaa ccagccacgg ataaagatgg aaaaccacta 301 ttgccagagc ctgaagaaaa acccaagcct cggagtgaat cagaactcat tgatgaactt 361 tcagaagatt ttgaccggtc tgaatgtaaa gagaaaccat ctaagccaac tgaaaagaca 421 gaagaatcta aggccgctgc tccagctcct gtgtcggagg ctgtgtctcg gacctccatg 481 tgtagtatac agtcagcacc ccctgagccg gctaccttga agggcacagt gccagatgat 541 gctgtagaag ccttggctga tagcctgggg aaaaaggaag cagatccaga agatggaaaa 601 cctgtgatgg ataaagtcaa ggagaaggcc aaagaagaag accgtgaaaa gcttggtgaa 661 aaagaagaaa caattcctcc tgattataga ttagaagagg tcaaggataa agatggaaag 721 ccactcctgc caaaagagtc taaggaacag cttccaccca tgagtgaaga cttccttctg 781 gatgctttgt ctgaggactt ctctggtcca caaaatgctt catctcttaa atttgaagat 841 gctaaacttg ctgctgccat ctctgaagtg gtttcccaaa ccccagcttc aacgacccaa 901 gctggagccc caccccgtga tacctcgagt gacaaagacc tcgatgatgc cttggataaa 961 ctctctgaca gtctaggaca aaggcagcct gacccagatg agaacaaacc aatggaagat 1021 aaagtaaagg aaaaagctaa agctgaacat agagacaagc ttggagagag agatgacact 1081 atcccacctg aatacagaca tctcctggat gataatggac aggacaaacc agtgaagcca 1141 cctacaaaga aatcagagga ttcaaagaaa cctgcagatg accaagaccc cattgatgct 1201 ctctcaggag atctggacag ctgtccctcc actacagaaa cctcacagaa cacagcaaag 1261 gataagtgca agaaggctgc ttccagctcc aaagcaccta agaatggagg taaagcgaag 1321 gattcagcaa agacaacaga ggaaacttcc aagccaaaag atgactaaag aaatacaagt 1381 taaggtatct ggtatctgca tttaaaatct tcagctggtg gattgtgact tttgaagaac 1441 aaaaggcttt ggcaacagaa aacaattgtt ctgggtgatt tctagaatgt tttttgttga 1501 gtctctgaac atcctaaata tttgtttgtt attcttttcc agaaagaaaa tgaatttgac 1561 tggttcacct gtgtactgag tattgataaa cttcgaattt tttaaatttc cttcaaggga 1621 gagaaagctt atattggttt gttattcttt tccagaaaga aaatgaattt gactgggttc 1681 actgtgttac tgagtattga taaactttga atttttgcaa ttgccttcaa tttttagagg 1741 aaaagcttta tatttgtgtt attacttctt catcttacag tcatcacaga acacactgag 1801 acttgaatca agtcagcaac agagcaaaat aaaggttaga taagtccttg tgtagcaaat 1861 ttcgagcata agaaataaaa tctaattaat tcttagggta aaaaaaaaaa aaaaaaaaaa 1921 aaaaaaaa // LOCUS HSU59111 1515 bp mRNA PRI 25-JAN-1997 DEFINITION Human dermatan sulfate proteoglycan 3 (DSPG3) mRNA, complete cds. ACCESSION U59111 NID g1794208 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1515) AUTHORS Deere,M., Johnson,J., Garza,S., Harrison,W.R., Yoon,S.J., Elder,F.F.B., Kucherlapati,R., Hook,M. and Hecht,J.T. TITLE Characterization of human DSPG3, a small dermatan sulfate proteoglycan JOURNAL Genomics 38 (3), 399-404 (1996) MEDLINE 97131519 REFERENCE 2 (bases 1 to 1515) AUTHORS Deere,M. TITLE Direct Submission JOURNAL Submitted (22-MAY-1996) Pediatrics, UTHSC-H, P.O. Box 20708, Houston, TX 77225-0708, USA FEATURES Location/Qualifiers source 1..1515 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="chondrocyte" /chromosome="12" /map="12q21" gene 47..1015 /gene="DSPG3" CDS 47..1015 /gene="DSPG3" /note="similar to chick PG-Lb" /codon_start=1 /product="dermatan sulfate proteoglycan 3" /db_xref="PID:g1794209" /translation="MKTLAGLVLGLVILDAAVTAPTLESINYDSETYDATLEDLDNLY NYENIPVDKVEIEIATVMPSGNRELLTPPPQPEKAQEEEEEEESTPRLIDGSSPQEPE FTGVLGPHTNEDFPTCLWCTCISTTVYCDDHELDAIPPLPKNTAYFYSRFNRIKKINK NDFASLSDLKRIDLTSNLISEIDEDAFRKLPQLRELVLRDNKIRQLPELPTTSTFIDI SNNRLGRKGIKQEAFKDMYDLHHLYLTDNNLDHIPLPLPENLRALHLQNNNILEMHED TFCNGKNLTYIRKALEDIRLDGNPINLSKTPQAYMCLPRLPVGSLV" BASE COUNT 531 a 300 c 259 g 425 t ORIGIN 1 tttttttttt ttttcatcag gtcagagcca aaggaaagct tgaaaaatga agacattagc 61 aggacttgtt ctgggacttg tcatcttgga tgctgctgtg actgccccaa ctctagagtc 121 catcaactat gactcagaaa cctatgatgc caccttagaa gacctggata atttgtacaa 181 ctatgaaaac atacctgttg ataaagttga gattgaaata gccacagtaa tgccttcagg 241 gaacagagag ctcctcactc cacccccaca gcctgagaag gcccaggaag aggaagagga 301 ggaggaatct actcccaggc tgattgatgg ctcttctccc caggagcctg aattcacagg 361 ggttctgggg ccacacacaa atgaagactt tccaacctgt ctttggtgta cttgtataag 421 taccaccgtg tactgtgatg accatgaact tgatgctatt cctccgctgc caaagaacac 481 cgcttatttc tattcccgct ttaacagaat taaaaagatc aacaaaaatg actttgcaag 541 cctaagtgat ttaaaaagga ttgatctgac atcaaattta atatctgaga ttgatgaaga 601 tgcattccga aaactgcctc aacttcgaga gcttgtcctg cgtgacaaca aaataaggca 661 gctcccagaa ttgccaacca cttcgacatt tattgatatt agcaacaata gacttggaag 721 gaaagggata aagcaagaag catttaaaga catgtatgat ctccatcatc tgtacctcac 781 tgataacaac ttggaccaca tccctctgcc actcccagaa aatctacgag cccttcacct 841 ccagaataac aacattctgg aaatgcacga agatacgttc tgcaatggta aaaatttgac 901 ttatattcgt aaggcactag aggacattcg attggatgga aaccctatta atctcagcaa 961 aactccacaa gcatacatgt gtctacctcg tctgcctgtt gggagccttg tctaatttca 1021 gataatggtt agcattacga tggctactat aaataaacca ttcttactgc tctcttccaa 1081 aacaaaactc agcatgatac tttgagattg tgttctgaga gatgatatga ctacataaaa 1141 tacaattaaa aatgttataa tataatgaaa atgtagtaat ttaagaaaac accagatgag 1201 ttaggaataa acctataaca tttacaaaaa gagcaaaact aagtgataga aaatatttca 1261 cacatgttct tatagatcat gtatcacttg caagttttag gagttcatat cctatatcat 1321 ttcaaattaa gtacataata aagtaaaatt ttgaaatgaa cactttaggt atttttgcca 1381 agatttagat gtttttaatt aaacttttct cttccttttt ttttcactaa ggcatgttta 1441 ttcccctaat ccattaaaga gcatgaaaaa aagaataaat gtatttgaaa aaaaaaaaaa 1501 aaaaaaaaaa aaaaa // LOCUS HSU59151 1900 bp mRNA PRI 02-JAN-1998 DEFINITION Human Cbf5p homolog (CBF5) mRNA, complete cds. ACCESSION U59151 NID g2737893 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1900) AUTHORS Jiang,W., Clifford,J. and Koltin,Y. TITLE A highly conserved nucleolar protein from human interacts with a HMG-like protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1900) AUTHORS Jiang,W., Clifford,J. and Koltin,Y. TITLE Direct Submission JOURNAL Submitted (23-MAY-1996) Weidong Jiang, Mol. Biol., ChemGenics, One Kendall Square, Bldg. 300, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..1900 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 81..1625 /gene="CBF5" CDS 81..1625 /gene="CBF5" /note="nucleolar protein; similar to yeast Cbf5p" /codon_start=1 /product="Cbf5p homolog" /db_xref="PID:g2737894" /translation="MADAEVIILPKKHKKKKERKSLPEEDVAEIQHAEEFFIKPESKV AKLDTSQWPLLLKNFDKLNVRTTHYTPLACGSNPLKREIGDYIRTGFINLDKPSNPSS HEVVAWIRRILRVEKTGHSGTLDPKVTGCLIVCIERATRLVKSQQSAGKEYVGIVRLH NAIEGGTQLSRALETLTGALFQRPPLIAAVKRQLRVRTIYESKMIEYDPERRLGIFWV SCEAGTYIRTLCVHLGLLLGVGGQMQELRRVRSGVMSEKDHMVTMHDVLDAQWLYDNH KDESYLRRVVYPLEKLLTSHKRLVMKDSAVNAICYGAKIMLPGVLRYEDGIEVNQEIV VITTKGEAICMAIALMTTAVISTCDHGIVAKIKRVIMERDTYPRKWGLGPKASQKKLM IKQGLLDKHGKPTDSTPATWKQEYVDYSESAKKEVVAEVVKAPQVVAEAAKTAKRKRE SESESDETPPAAPQLIKKEKKKSKKDKKAKAGLESGAEPGDGDSDTTKKKKKKKKAKE VELVSE" BASE COUNT 547 a 371 c 549 g 433 t ORIGIN 1 cctgacggga ccaaggcggc gggagtctgc ggtcgttccc tcggctgtgg accgggcggc 61 acgacgcggt gcagggtaac atggcggatg cggaagtaat tattttgcca aagaaacata 121 agaagaaaaa ggagcggaag tcattgccag aagaagatgt agccgaaata caacacgctg 181 aagaattttt tatcaaacct gaatccaaag ttgctaagtt ggacacgtct cagtggcccc 241 ttttgctaaa gaattttgat aagctgaatg taaggacaac acactataca cctcttgcat 301 gtggttcaaa tcctctgaag agagagattg gggactatat caggacaggt ttcattaatc 361 ttgacaagcc ctctaacccc tcttcccatg aggtggtagc ctggattcga cggatacttc 421 gggtggagaa gacagggcac agtggtactc tggatcccaa ggtgactggt tgtttaatcg 481 tgtgcataga acgagccact cgcttggtga agtcacaaca gagtgcaggc aaagagtatg 541 tggggattgt ccggctgcac aatgctattg aaggggggac ccagctttct agggccctag 601 aaactctgac aggtgcctta ttccagcgac ccccacttat tgctgcagta aagaggcagc 661 tccgagtgag gaccatctac gagagcaaaa tgattgaata cgatcctgaa agaagattag 721 gaatcttttg ggtgagttgt gaggctggca cctacattcg gacattatgt gtgcaccttg 781 gtttgttatt gggagttggt ggtcagatgc aggagcttcg gagggttcgt tctggagtca 841 tgagtgaaaa ggaccacatg gtgacaatgc atgatgtgct tgatgctcag tggctgtatg 901 ataaccacaa ggatgagagt tacctgcggc gagttgttta ccctttggaa aagctgttga 961 catctcataa acggctggtt atgaaagaca gtgcagtaaa tgccatctgc tatggggcca 1021 agattatgct tccaggtgtt cttcgatatg aggacggcat tgaggtcaat caggagattg 1081 tggttatcac caccaaagga gaagcaatct gcatggctat tgcattaatg accacagcgg 1141 tcatctctac ctgcgaccat ggtatagtag ccaagatcaa gagagtgatc atggagagag 1201 acacttaccc tcggaagtgg ggtttaggtc caaaggcaag tcagaagaag ctgatgatca 1261 agcagggcct tctggacaag catgggaagc ccacagacag cacacctgcc acctggaagc 1321 aggagtatgt tgactacagt gagtctgcca aaaaagaggt ggttgctgaa gtggtaaaag 1381 ccccgcaggt agttgccgaa gcagcaaaaa ctgcgaagcg gaagcgagag agtgagagtg 1441 aaagtgacga gactcctcca gcagctcctc agttgatcaa gaaggaaaag aagaagagta 1501 agaaggacaa gaaggccaaa gctggtctgg agagcggggc cgagcctgga gatggggaca 1561 gtgataccac caagaagaag aagaagaaga agaaagcaaa agaggtagaa ttggtttctg 1621 agtagtgaag gccacttgaa gctggaggag aaactaaagc cttattgaga aaacatgtta 1681 tagatccttt tgttgctgag agagtggaac ataggtccta gacagggtga agagttctgg 1741 cacattttag ctgctacttt gagacctcgg tgatgttacc tggtgtggtc atcccatctt 1801 gtcctgtttt aaggatatgg gtggtgaaag atgaaagagg cagagtttat cccaatgact 1861 tctctgtttg agttgggaag cctcaccttc agacccagta // LOCUS HSU59185 2529 bp mRNA PRI 04-OCT-1997 DEFINITION Human putative monocarboxylate transporter (MCT) mRNA, complete cds. ACCESSION U59185 NID g2463627 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2529) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Cloning of a monocarboxylate transporter from human placenta JOURNAL Unpublished REFERENCE 2 (bases 1 to 2529) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Direct Submission JOURNAL Submitted (23-MAY-1996) Biochemistry, University of Bristol, Medical School, University Walk, Bristol BS8 1TD, U.K. FEATURES Location/Qualifiers source 1..2529 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /dev_stage="34 week" gene 183..1646 /gene="MCT" CDS 183..1646 /gene="MCT" /note="putative monocarboxylate transporter" /codon_start=1 /db_xref="PID:g2463628" /translation="MLKREGKVQPYTKTLDGGWGWMIVIHFFLVNVFVMGMTKTFAIF FVVFQEEFEGTSEQIGWIGSIMSSLRFCAGPLVAIICDILGEKTTSILGAFVVTGGYL ISSWATSIPFLCVTMGLLPGLGSAFLYQVAAVVTTKYFKKRLALSTAIARSGMGLTFL LAPFTKFLIDLYDWTGALILFGAIALNLVPSSMLLRPIHIKSENNSGIKDKGSSLSAH GPEAHATETHCHETEESTIKDSTTQKAGLPSKNLTVSQNQSEEFYNGPNRNRLLLKSD EESDKVISWSCKQLFDISLFRNPFFYIFTWSFLLSQLAYFIPTFHLVARAKTLGIDIM DASYLVSVAGILETVSQIISGWVADQNWIKKYHYHKSYLILCGITNLLAPLATTFPLL MTYTICFAIFAGGYLALILPVLVDLCRNSTVNRFLGLASFFAGMAVLSGPPIAGWLYD YTQTYNGSFYFSGICYLLSSVSFFFVPLAERWKNSLT" repeat_region 1768..2066 /rpt_family="Alu" BASE COUNT 691 a 521 c 510 g 807 t ORIGIN 1 cttggctctt acaatgctca cttgttttca caatgcagca aaatgaaatg ccttagaaaa 61 agagtaacat tccagaaaac ggtgtaattt atttttcttc cttaattgcc ccatctgtgg 121 aggatttctt tgctgaacac cacatcaaag ggatcttctg catttaaaat agaagaggca 181 tcatgctgaa gagggagggg aaggtccaac cttacactaa aaccctggat ggaggatggg 241 gatggatgat tgtgattcat tttttcctgg tgaatgtgtt tgtgatgggg atgaccaaga 301 cttttgcaat tttctttgtg gtctttcaag aagagtttga aggcacctca gagcaaattg 361 gttggattgg atccatcatg tcatctcttc gtttttgtgc aggtcccctg gttgctatta 421 tttgtgacat acttggagag aaaactacct ccattcttgg ggctttcgtt gttactggtg 481 gatatctgat cagcagctgg gccacaagta ttccttttct ttgtgtgact atgggacttc 541 tacccggttt gggttctgct ttcttatacc aagtggctgc tgtggtaact accaaatact 601 tcaaaaaacg attggctctt tctacagcta ttgcccgttc tgggatggga ctgacttttc 661 ttttggcacc ctttacaaaa ttcctgatag atctgtatga ctggacagga gcccttatat 721 tatttggagc tatcgcattg aatttggtgc cttctagtat gctcttaaga cccatccata 781 tcaaaagtga gaacaattct ggtattaaag ataaaggcag cagtttgtct gcacatggtc 841 cagaggcaca tgcaacagaa acacactgcc atgagacaga agagtctacc atcaaggaca 901 gtactacgca gaaggctgga ctacctagca aaaatttaac agtctcacaa aatcaaagtg 961 aagagttcta caatgggcct aacaggaaca gactgttatt aaagagtgat gaagaaagtg 1021 ataaggttat ttcgtggagc tgcaaacaac tgtttgacat ttctctcttt agaaatcctt 1081 tcttctacat atttacttgg tcttttctcc tcagtcagtt agcatacttc atccctacct 1141 ttcacctggt agccagagcc aaaacactgg ggattgacat catggatgcc tcttaccttg 1201 tttctgtagc aggtatcctt gagacggtca gtcagattat ttctggatgg gttgctgatc 1261 aaaactggat taagaagtat cattaccaca agtcttacct catcctctgc ggcatcacta 1321 acctgcttgc tcctttagcc accacatttc cactacttat gacctacacc atctgctttg 1381 ccatctttgc tggtggttac ctggcattga tactgcctgt actggttgat ctgtgtagga 1441 attctacagt aaacaggttt ttgggacttg ccagtttctt tgctgggatg gctgtccttt 1501 ctggaccacc tatagcaggc tggttatatg attataccca gacatacaat ggctctttct 1561 acttctctgg catatgctat ctcctctctt cagtttcctt tttttttgta ccattggccg 1621 aaagatggaa aaacagtctg acctgaaaga aagaagactg caatcaagtg agagctaaac 1681 aaaagaaaac ctaaactaat gtcattggaa acaaaagctt gaaagaaaca catcgcatct 1741 acatttgtaa catgagaagg aaaacaattt tttttttttt ttttttgaga cggagtctcg 1801 ctctttcgcc caggctggag tgcagtggcg caatctcggc tcactgtaat ctccgcctcc 1861 tgggttcaag ggattctcct gcctcagcct cccaagtagc tgggactaca ggcacacgcc 1921 accacaccca gctaattttt tgtattttta gtagaggcgg ggtttcacca tgttagccag 1981 gatggtctcc atctcctgac ctcgtgatcc gcccgccttg tcctccaaag tgctgggatt 2041 acaggcatga gccactgggc gcggccagat aagtttttaa ggttccttct tgctttagca 2101 ttctgagaaa tgtctaattg gtagtaagac aagagtaata gcaacctgta ttgttagtat 2161 ttaaccaaat aggctaaaat tttaatcagg taccttatgt attaaataga aatcggaatg 2221 taccataata aatccaaact ctcaattacg ccatggtaat tcagtcacta aaatatgtaa 2281 agatagaaaa ttttttaatt taaagaagtg tgaaacatag ccattgattg atcagaattc 2341 tggaatctga atattaaaac cttacttagt gactggaatg gtatatgctc cctccaaaag 2401 tttatctttg tttattgatt aaaggtaatc cttactttct ttgtattact taggttctca 2461 attaaaggta atccttactt tctttgtatt acttaggttc ttaaatttct atgataaaca 2521 tgtattgct // LOCUS HSU59228 854 bp mRNA PRI 07-SEP-1996 DEFINITION Human ectodermal dysplasia protein (EDA) mRNA, complete cds. ACCESSION U59228 NID g1524408 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 854) AUTHORS Kere,J., Srivastava,A.K., Montonen,O., Zonana,J., Thomas,N., Ferguson,B., Munoz,F., Morgan,D., Clarke,A., Baybayan,P., Chen,E.Y., Ezer,S., Saarialho-Kere,U., de la Chapelle,A. and Schlessinger,D. TITLE X-linked anhidrotic (hypohidrotic) ectodermal dysplasia is caused by mutation in a novel transmembrane protein JOURNAL Nature Genet. 13 (4), 409-416 (1996) MEDLINE 96331280 REFERENCE 2 (bases 1 to 854) AUTHORS Srivastava,A.K., Kere,J. and Schlessinger,D. TITLE Direct Submission JOURNAL Submitted (23-MAY-1996) J.C. Self Research Institute of Human Genetics, Greenwood Genetic Center, 1 Gregor Mendel Circle, Greenwood, SC 29646, USA FEATURES Location/Qualifiers source 1..854 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq13.1" /tissue_type="sweat gland" /clone="27G4" 5'UTR 1..242 gene 243..650 /gene="EDA" CDS 243..650 /gene="EDA" /codon_start=1 /product="ectodermal dysplasia protein" /db_xref="PID:g1524409" /translation="MGYPEVERRELLPAAAPRERGSQGCGCGGAPARAGEGNSCLLFL GFFGLSLALHLLTLCCYLELRSELRRERGAESRLGGSGTPGTSGTLSSLGGLDPDSPI TSHLGQPSPKQQPLEPGEAALHSDSQDGHQGHQ" 3'UTR 651..854 BASE COUNT 164 a 287 c 242 g 161 t ORIGIN 1 attccctcgg cgggccgagc ctcccctctc tcccgcccct cctcctccct ttcccacccc 61 tcggagtaga gctgcacatg cggctgctcc ctgctccgtc ccgcccagcc actgtcgcgc 121 aggaacgggt ccctgcagcc cccagccgat ggcaggacag tagccgcctg tcagaggtcg 181 tgaacggctg aggcagacgc agcggctccc gggcctcaag agagtggatg tctccggagg 241 ccatgggcta cccggaggtg gagcgcaggg aactcctgcc tgcagcagcg ccgcgggagc 301 gagggagcca gggctgcggg tgtggcgggg cccctgcccg ggcgggcgaa gggaacagct 361 gcctgctctt cctgggtttc tttggcctct cgctggccct ccacctgctg acgttgtgct 421 gctacctaga gttgcgctcg gagttgcggc gggaacgtgg agccgagtcc cgccttggcg 481 gctcgggcac ccctggcacc tctggcaccc taagcagcct cggtggcctc gaccctgaca 541 gccccatcac cagtcacctt gggcagccgt cacctaagca gcagccattg gaaccgggag 601 aagccgcact ccactctgac tcccaggacg ggcaccaggg acaccaatga gttgtgtctt 661 ccctctgtcc actctcagca ccttcactct gaagatctgt taaaagcaca cgagtcgtct 721 cagtccctca gtgggagctg tttcacctgg cgtcatctag tcagccatct tcaataataa 781 ctgttaaatg aacatttata tccactgaaa ccactaagtg aaataaagat gtgtttaggc 841 aaaaaaaaaa aaaa // LOCUS HSU59269 2088 bp mRNA PRI 25-SEP-1996 DEFINITION Human hyaluronan synthase mRNA, complete cds. ACCESSION U59269 NID g1556464 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2088) AUTHORS Shyjan,A.M., Heldin,P., Butcher,E.C., Yoshino,T. and Briskin,M.J. TITLE Functional cloning of the cDNA for a human hyaluronan synthase JOURNAL J. Biol. Chem. 271 (38), 23395-23399 (1996) MEDLINE 96394438 REFERENCE 2 (bases 1 to 2088) AUTHORS Briskin,M.J. and Shyjan,A.M. TITLE Direct Submission JOURNAL Submitted (24-MAY-1996) LeukoSite Inc., 215 First Street, Cambridge, MA 02142, USA FEATURES Location/Qualifiers source 1..2088 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 36..1772 /codon_start=1 /product="hyaluronan synthase" /db_xref="PID:g1556465" /translation="MRQQDAPKPTPAARRCSGLARRVLTIAFALLILGLMTWAYAAGV PLASDRYGLLAFGLYGAFLSAHLVAQSLFAYLEHRRVAAAARGPLDAATARSVALTIS AYQEDPAYLRQCLASARALLYPRARLRVLMVVDGNRAEDLYMVDMFREVFADEDPATY VWDGNYHQPWEPAAAGAVGAGAYREVEAEDPGRLAVEALVRTRRCVCVAQRWGGKREV MYTAFKALGDSVDYVQVCDSDTRLDPMALLELVRVLDEDPRVGAVGGDVRILNPLDSW VSFLSSLRYWVAFNVERACQSYFHCVSCISGPLGLYRNNLLQQFLEAWYNQKFLGTHC TFGDDRHLTNRMLSMGYATKYTSRSRCYSETPSSFLRWLSQQTRWSKSYFREWLYNAL WWHRHHAWMTYEAVVSGLFPFFVAATVLRLFYAGRPWALLWVLLCVQGVALAKAAFAA WLRGCLRMVLLSLYAPLYMCGLLPAKFLALVTMNQSGWGTSGRRKLAANYVPLLPLAL WALLLLGGLVRSVAHEARADWSGPSRAAEAYHLAAGAGAYVGYWVAMLTLYWVGVRRL CRRRTGGYRVQV" BASE COUNT 302 a 652 c 711 g 423 t ORIGIN 1 cggagagaag agagagcccg gccagaccca ctgcgatgag acagcaggac gcgcccaagc 61 ccactcctgc agcccgccgc tgctccggcc tggcccggag ggtgctgacc atcgccttcg 121 ccctgctcat cctgggcctc atgacctggg cctacgccgc cggggtgccg ctggcctccg 181 atcgctacgg cctcctggcc ttcggcctct acggggcctt cctttcagcg cacctggtgg 241 cgcagagcct cttcgcgtac ctggagcacc ggcgggtggc ggcggcggcg cgggggccgc 301 tggatgcagc caccgcgcgc agtgtggcgc tgaccatctc cgcctaccag gaggaccccg 361 cgtacctgcg ccagtgcctg gcgtccgccc gcgccctgct gtacccgcgc gcgcggctgc 421 gcgtcctcat ggtggtggat ggcaaccgcg ccgaggacct ctacatggtc gacatgttcc 481 gcgaggtctt cgctgacgag gaccccgcca cgtacgtgtg ggacggcaac taccaccagc 541 cctgggaacc cgcggcggcg ggcgcggtgg gcgccggagc ctatcgggag gtggaggcgg 601 aggatcctgg gcggctggca gtggaggcgc tggtgaggac tcgcaggtgc gtgtgcgtgg 661 cgcagcgctg gggcggcaag cgcgaggtca tgtacacagc cttcaaggcg ctcggagatt 721 cggtggacta cgtgcaggtc tgtgactcgg acacaaggtt ggaccccatg gcactgctgg 781 agctcgtgcg ggtactggac gaggaccccc gggtaggggc tgttggtggg gacgtgcgga 841 tccttaaccc tctggactcc tgggtcagct tcctaagcag cctgcgatac tgggtagcct 901 tcaatgtgga gcgggcttgt cagagctact tccactgtgt atcctgcatc agcggtcctc 961 taggcctata taggaataac ctcttgcagc agtttcttga ggcctggtac aaccagaagt 1021 tcctgggtac ccactgtact tttggggatg accggcacct caccaaccgc atgctcagca 1081 tgggttatgc taccaagtac acctccaggt cccgctgcta ctcagagacg ccctcgtcct 1141 tcctgcggtg gctgagccag cagacacgct ggtccaagtc gtacttccgt gagtggctgt 1201 acaacgcgct ctggtggcac cggcaccatg cgtggatgac ctacgaggcg gtggtctccg 1261 gcctgttccc cttcttcgtg gcggccactg tgctgcgtct gttctacgcg ggccgccctt 1321 gggcgctgct gtgggtgctg ctgtgcgtgc agggcgtggc actggccaag gcggccttcg 1381 cggcctggct gcggggctgc ctgcgcatgg tgcttctctc gctctacgcg cccctctaca 1441 tgtgtggcct cctgcctgcc aagttcctgg cgctagtcac catgaaccag agtggctggg 1501 gcacctcggg ccggcggaag ctggccgcta actacgtccc tctgctgccc ctggcgctct 1561 gggcgctgct gctgcttggg ggcctggtcc gcagcgtagc acacgaggcc agggccgact 1621 ggagcggccc ttcccgcgca gccgaggcct accacttggc cgcgggggcc ggcgcctacg 1681 tgggctactg ggtggccatg ttgacgctgt actgggtggg cgtgcggagg ctttgccggc 1741 ggcggaccgg gggctaccgc gtccaggtgt gagtccagcc acgcggatgc cgcctcaagg 1801 gtcttcaggg gaggccagag gagagctgct gggccccgag ccacgaactt gctgggtggt 1861 tctctgggcc tcagtttccc tcctctgcca aacgaggggg tcagcccaag attcttcagt 1921 ctggactata ttgggactgg gacttctggg tctccaggga gggtatttat tggtcaggat 1981 gtgggatttg aggagtggag gggaaagggt cctgctttct cctcgttctt atttaatctc 2041 catttctact gtgtgatcag gatgtaataa agaattttat ttattttc // LOCUS HSU59299 1719 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens putative monocarboxylate transporter MCT mRNA, complete cds. ACCESSION U59299 NID g2463629 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Cloning and sequencing of four new mammalian monocarboxylate transporter (MCT) homologues confirms the existence of a transporter family with an ancient past JOURNAL Biochem. J. 329 (2), 321-328 (1998) REFERENCE 2 (bases 1 to 1719) AUTHORS Jackson,V.N., Price,N.T. and Halestrap,A.P. TITLE Direct Submission JOURNAL Submitted (24-MAY-1996) Biochemistry, University of Bristol, Medical School, University Walk, Bristol BS8 1TD, UK FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta (34 week)" CDS 61..1578 /note="putative monocarboxylate transporter" /codon_start=1 /product="MCT" /db_xref="PID:g2463630" /translation="MPQALERADGSWAWVVLLATMVTQGLTLGFPTCIGIFFTELQWE FQASNSETSWFPSILTAVLHMAGPLCSILVGRFGCRVTVMLGGVLASLGMVASSFSHN LSQLYFTAGFITGLGMCFSFQSSITVLGFYFVRRRVLANALASMGVSLGITLWPLLSR YLLENLGWRGTFLVFGGIFLHCCICGAIIRPVATSVAPETKECPPPPPETPALGCLAA CGRTIQRHLAFDILRHNTGYCVYILGVMWSVLGFPLPQVFLVPYAMWHSVDEQQAALL ISIIGFSNIFLRPLAGLMAGRPAFASHRKYLFSLALLLNGLTNLVCAASGDFWVLVGY CLAYSVSMSGIGALIFQVLMDIVPMDQFPRALGLFTVLDGLAFLISPPLAGLLLDATN NFSYVFYMSSFFLISAALFMGGSFYALQKKEQGKQAVAADALERDLFLEAKDGPGKQR SPEIMCQSSRQPRPAGVNKHLWGCPASSRTSHEWLLWPKAVLQAKQTALGWNSPT" BASE COUNT 316 a 543 c 477 g 383 t ORIGIN 1 ccgaattcgg gggcagcagc cacattggca gtgaggccgt ggcagcgtca gcagcagagg 61 atgccccagg ccctggagcg tgcagatggc agctgggcct gggtggtgct gctggccacc 121 atggtgaccc agggcctcac cctgggcttc cccacgtgta tcggcatctt cttcactgaa 181 ctgcaatggg agttccaggc cagcaacagc gagacctctt ggttcccctc catcctcacg 241 gctgtgctcc acatggcagg gcccctgtgc agcatcctgg tgggacgctt cggctgccga 301 gtgaccgtga tgctgggggg cgtgctggcc agcctgggca tggtggccag ctccttctct 361 cacaacctca gccagctcta cttcacagca ggattcatca caggcctggg catgtgcttc 421 agcttccagt caagcatcac ggtgctgggc ttctactttg tccgccggcg ggtgctggcc 481 aacgcgctgg cctcgatggg cgtctccctg ggcatcaccc tctggccgct gctctcccgt 541 taccttctgg agaacctggg ctggaggggt accttccttg tcttcggcgg gatctttctc 601 cactgctgca tctgcggggc catcataagg cctgtggcca ccagtgtggc ccctgagacc 661 aaagaatgtc ccccgccacc tcccgagaca cctgcacttg gctgcctggc tgcatgcggc 721 cggaccatcc agcgccacct ggccttcgac atcctgcggc acaacacagg ctactgcgtg 781 tacatactgg gtgtgatgtg gtccgtcctg ggcttcccac tgccacaagt cttcctggtg 841 ccatatgcca tgtggcacag cgtggacgag cagcaggcag ccctcctcat ctccatcatc 901 ggcttcagca acatcttcct gaggccccta gccgggctga tggcaggacg gccggccttt 961 gctagccacc gcaagtacct gttcagcctg gcactcctgc tcaatgggct cactaacctg 1021 gtgtgtgcgg catcaggtga cttctgggtg ctcgtgggct actgcctggc gtacagcgtg 1081 tccatgagtg gcatcggcgc cctcatcttc caggttctca tggacatcgt ccccatggat 1141 cagttcccca gagccctggg actcttcact gtcctggacg gccttgcttt cctcatctcc 1201 ccaccactgg ccgggttgct cctggacgcc accaacaact ttagctatgt tttctacatg 1261 tccagcttct tcctcatctc agctgccctc ttcatgggtg gcagcttcta cgccctgcag 1321 aagaaggagc aaggcaagca ggctgtcgcg gcggatgccc tggagcggga tcttttcttg 1381 gaagccaaag acggtcctgg gaagcaacgg tcccctgaga tcatgtgcca gtcttcccgc 1441 cagccacgtc cagctggcgt caataagcat ctttggggat gtcctgcctc ctccaggacc 1501 agccatgagt ggctcttatg gccaaaggcg gtactgcagg ccaagcaaac ggctctgggc 1561 tggaatagcc ctacctgagt gccctgtttg actccgccac tatctgccat gtgagttggg 1621 caaattgttg accacctctg agccttgaaa aagtaggagg ttactttgtt agagcaaaat 1681 aataaaattt aattttaaaa aagaaaaaaa aaaaaaaaa // LOCUS HSU59305 2785 bp mRNA PRI 01-DEC-1996 DEFINITION Human ser-thr protein kinase PK428 mRNA, complete cds. ACCESSION U59305 NID g1695872 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2785) AUTHORS Zhao,Y., Kidd,V. and Kraft,A.S. TITLE Cloning of a novel member of the myotonic dystrophy family of protein kinases JOURNAL Unpublished REMARK Submitted for publication REFERENCE 2 (bases 1 to 2785) AUTHORS Zhao,Y., Kidd,V. and Kraft,A.S. TITLE Direct Submission JOURNAL Submitted (27-MAY-1996) Hematology/Oncology, University of Alabama at Birmingham, WTI 564, Birmingham, AL 35294, USA FEATURES Location/Qualifiers source 1..2785 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" CDS 1289..2779 /note="protein kinase related to the myotonic dystrophy protein kinase family" /codon_start=1 /product="ser-thr protein kinase PK428" /db_xref="PID:g1695873" /translation="MSGEVRLRQLEQFILDGPAQTNGQYFSVETLLDILICLYDECNN SPLRREKNILEYLEWAKPFTSKVKQMRLHREDFEILKVIGRGAFGEVAVVKLKNADKV FAMKILNKWEMLKRAETACFREERDVLVNGDNKWITTLHYAFQDDNNLYLVMDYYVGG DLLTLLSKFEDRLPEDMARFYLAEMVIAIDSVHQLHYVHRDIKPDNILMDMNGHIRLA DFGSCLKLMEDGTVQSSVAVGTPDYISPEILQAMEDGKGRYGPECDWWSLGVCMYEML YGETPFYAESLVETYGKIMNHKERFQFPAQVTDVSENAKDLIRRLICSREHRLGQNGI EDFKKHPFFSGIDWDNIRNCEAPYIPEVSSPTDTSNFDVDDDCLKNSETMPPPTHTAF SGHHLPFVGFTYTSSCVLSDRSCLRVTAGPTSLDLDVNVQRTLDNNLATEAYERRIKR LEQEKLELSRKLQESTQTVQALQYSTVDGPLTASKDLEIKNLKEEI" BASE COUNT 775 a 564 c 708 g 738 t ORIGIN 1 tccgcaagcc tccgcctctg tgcgcgggac tggaggagcc tcgctgagcc cagggcgcga 61 cggcgagagg agagggaagg cgggggaggg ctggaaggga gaggaaggga gtggttggga 121 gccgggctgc cgcagcctct agtctcctca gccgcggaag cacccctcct cctgcgccgc 181 ggccgcctcc ctcctcgctg tggaaagatg cccttagccc aggggtgtga agaaggggga 241 gaagtagctg ccagagccgc ccgcgccgcc gccgcagctg ctgctcgttg tgtcctttga 301 attcagagaa gcaccccccc gcctggggcg tcgggagcct ccgggcggcg gccgcggtgc 361 ggtgtcgggg agaccgggct ctctgcccgc gcggcgcggc gcggctcggc ccacgagcga 421 ccaccgacat ggagtgggct cgggcggcca agtagccgct tctccggagc cggtgccagt 481 gccgcccgca gcccgccttc cacccccggc cgcgccgccg gtcaggccct agggtgaagc 541 cgggaggaaa atgaagagtt ttcacccgga atccgttgaa aataggactg actgcaaagc 601 cttaaagaaa gaaggacctc gggaggagaa acgaaaagcc gcctccgggc aagacttggc 661 gtgctccgag ccgaggggct gcttcaggga cctcgccccc tccctttccc gctggagaaa 721 ttgccgctga tgcattatcc aagtggtggt tgggaggatt tgcagcaaca tttttggttt 781 tccctccccc ttctatgcat tctgtttttt tcctcccttt tctgtttttc ttcttcccgg 841 gaagtgaatt gctgatgcaa atcggacttt attcattaat gatgcaaccg gattcgtttc 901 aggattacgt tgcacgagtt gaattttgaa tgaaggagaa gagttttttt ttttttttaa 961 agaagtgttg actctctagt tcgttgtact tttaattatt attttattta aatatacgac 1021 ttaattgtat tcttttaaaa atgcattaag tatatatttt atggtaattt accctcaaaa 1081 tatatgtata tgggtgaaat tgaagacgcc ttcagttaag tgaggttact ggtgtgttgg 1141 atgtttaatt cagcaccagc attgcatgac agttgtttga ataacaagtg gtttattttt 1201 aaaaccatac cttttaaaat ttaggttcag ataatagtaa aagtcatcat aataatttaa 1261 aggaaaacca gcagaaatcg aagcaaacat gtctggagaa gtgcgtttga ggcagttgga 1321 gcagtttatt ttggacgggc ccgctcagac caatgggcag tacttcagtg tggagacatt 1381 actggatata ctcatctgcc tttatgatga atgcaataat tctccattga gaagagagaa 1441 gaacattctc gaatacctag aatgggctaa accatttact tctaaagtga aacaaatgcg 1501 attacataga gaagactttg aaatattaaa ggtgattggt cgaggagctt ttggggaggt 1561 tgctgtagta aaactaaaaa atgcagataa agtgtttgcc atgaaaatat tgaataaatg 1621 ggaaatgctg aaaagagctg agacagcatg ttttcgtgaa gaaagggatg tattagtgaa 1681 tggagacaat aaatggatta caaccttgca ctatgctttc caggatgaca ataacttata 1741 cctggttatg gattattatg ttggtgggga tttgcttact ctactcagca aatttgaaga 1801 tagattgcct gaagatatgg ctagatttta cttggctgag atggtgatag caattgactc 1861 agttcatcag ctacattatg tacacagaga cattaaacct gacaatatac tgatggatat 1921 gaatggacat attcggttag cagattttgg ttcttgtctg aagctgatgg aagatggaac 1981 ggttcagtcc tcagtggctg taggaactcc agattatatc tctcctgaaa tccttcaagc 2041 catggaagat ggaaaaggga gatatggacc tgaatgtgac tggtggtctt tgggggtctg 2101 tatgtatgaa atgctttacg gagaaacacc attttatgca gaatcgctgg tggagacata 2161 cggaaaaatc atgaaccaca aagagaggtt tcagtttcca gcccaagtga ctgatgtgtc 2221 tgaaaatgct aaggatctta ttcgaaggct catttgtagc agagaacatc gacttggtca 2281 aaatggaata gaagacttta agaaacaccc atttttcagt ggaattgatt gggataatat 2341 tcggaactgt gaagcacctt atattccaga agttagtagc ccaacagata catcgaattt 2401 tgatgtagat gatgattgtt taaaaaattc tgaaacgatg cccccaccaa cacatactgc 2461 attttctggc caccatctgc catttgttgg ttttacatat actagtagct gtgtactttc 2521 tgatcggagc tgtttaagag ttacggctgg tcccacctca ctggatcttg atgttaatgt 2581 tcagaggact ctagacaaca acttagcaac tgaagcttat gaaagaagaa ttaagcgcct 2641 tgagcaagaa aaacttgaac tcagtagaaa acttcaagag tcaacacaga ctgtccaagc 2701 tctgcagtat tcaactgttg atggtccact aacagcaagc aaagatttag aaataaaaaa 2761 cttaaaagaa gaaatttgaa aaaaa // LOCUS HSU59325 2880 bp mRNA PRI 27-JUN-1996 DEFINITION Human cadherin-14 mRNA, complete cds. ACCESSION U59325 NID g1389852 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2880) AUTHORS Shibata,T., Shimoyama,Y., Gotoh,M. and Hirohashi,S. TITLE Molecular cloning of human cadherin-14 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2880) AUTHORS Shibata,T., Shimoyama,Y., Gotoh,M. and Hirohashi,S. TITLE Direct Submission JOURNAL Submitted (26-MAY-1996) Pathology Division, National Cancer Center, Tsukiji 5-1-1, Chuo-ku, Tokyo 104, Japan FEATURES Location/Qualifiers source 1..2880 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="adult" CDS 314..2686 /note="Ca2+ dependent cell adhesion molecule" /codon_start=1 /product="cadherin-14" /db_xref="PID:g1389853" /translation="MKITSTSCICPVLVCLCFVQRCYGTAHHSSIKVMRNQTKHIEGE TEVHHRPKRGWVWNQFFVLEEHMGPDPQYVGKLHSNSDKGDGSVKYILTGEGAGTIFI IDDTTGDIHSTKSLDREQKTHYVLHAQAIDRRTNKPLEPESEFIIKVQDINDNAPKFT DGPYIVTVPEMSDMGTSVLQVTATDADDPTYGNSARVVYSILQGQPYFSVDPKTGVIR TALHNMDREAREHYSVVIQAKDMAGQVGGLSGSTTVNITLTDVNDNPPRFPQKHYQLY VPESAQVGSAVGKIKANDADTGSNADMTYSIINGDGMGIFSISTDKETREGILSLKKP LNYEKKKSYTLNIEGANTHLDFRFSHLGPFKDATMLKIIVGDVDEPPLFSMPSYLMEV YENAKIGTVVGTVLAQDPDSTNSLVRYFINYNVEDDRFFNIDANTGTIRTTKVLDREE TPWYNITVTASEIDNPDLLSHVTVGIRVLDVNDNPPELAREYDIIVCENSKPGQVIHT ISATDKDDFANGPRFNFFLDERLPVNPNFTLKDNEDNTASILTRRRRFSRTVQDVYYL PIMISDGGIPSLSSSSTLTIRVCACERDGRVRTCHAEAFLSSAGLSTGALIAILLCVL ILLAIVVLFITLRRSKKEPLIISEEDVRENVVTYDDEGGGEEDTEAFDITALRNPSAA EELKYRRDIRPEVKLTPRHQTSSTLESIDVQEFIKQRLAEADLDPSVPPYDSLQTYAY EGQRSEAGSISSLDSATTQSDQDYHYLGDWGPEFKKLAELYGEIESERTT" BASE COUNT 877 a 624 c 649 g 730 t ORIGIN 1 tggggagaga tgatggtgtc tgcagcagag tgacagcagt ggagattgag agaaaaggtt 61 gaaaatggtt gcttgtgttc aactactgtg aagagttgga atcagacaac tacggctttg 121 attttccaag tggttcagaa tgtaaatatg atctgacctt tccctgatgg acagttaaat 181 catggacacg ctaaacaatt tactttggac tgtccttcct aaagatgctc tggcttctct 241 cttgaatgct cactaagctt tcgtagctga caaggaaagg aaggaaaact gtgaactgga 301 aagttatctt acaatgaaaa ttactagcac atcttgcatc tgtccagtcc tagtgtgtct 361 ctgttttgtg cagaggtgtt atggaactgc tcaccacagc tccatcaagg tgatgagaaa 421 ccaaaccaaa cacattgaag gtgaaaccga agtccatcat cgtcccaaaa ggggatgggt 481 atggaatcag ttctttgttt tagaagaaca tatgggacca gatcctcagt atgttggaaa 541 gctgcactcc aattctgaca aaggtgatgg atctgtcaag tacatcctta ctggagaggg 601 tgctgggact atatttatca ttgacgatac cacgggtgat atccactcaa caaaaagcct 661 agacagagag cagaagaccc actatgtgct tcatgctcaa gctattgata gacgtacaaa 721 caaacctctt gagcctgaat ccgagttcat catcaaagtg caagacatca atgacaacgc 781 tccaaaattc acagatggac catacattgt tactgtgcct gaaatgtcag atatgggtac 841 ctctgttcta caggtgacag ctactgatgc agatgaccct acctatggaa acagcgctcg 901 ggtggtttac agcattctcc agggacaacc ctacttctcc gtcgacccta aaacaggagt 961 tattagaacg gccttacata acatggacag agaagccaga gaacattact ccgtagtcat 1021 tcaagccaaa gacatggctg ggcaagttgg agggctttca ggatctacaa cagtcaacat 1081 caccttaacc gatgtcaatg acaacccacc acgctttcct caaaaacact atcagctata 1141 tgttcctgag tcagctcaag ttggttcagc tgttgggaaa atcaaggcaa atgatgctga 1201 cactggctca aatgctgaca tgacctactc catcataaat ggtgatggca tgggaatatt 1261 ctcaatctcc actgacaaag agaccagaga aggaatcctt tctttaaaga agccactgaa 1321 ctatgagaaa aagaagtcat ataccctcaa catagaagga gcaaatacac atcttgattt 1381 tcgcttttct cacttgggtc cttttaaaga tgctactatg ctgaagatca ttgttgggga 1441 tgtagatgaa ccaccactat tttccatgcc ttcctacctc atggaagtct acgaaaatgc 1501 caagattggg accgtcgttg gtacagtttt ggcacaagat cctgacagta ctaacagctt 1561 agtaagatac ttcatcaact acaatgttga agacgacaga tttttcaaca ttgatgccaa 1621 tactgggacc attaggacta caaaggttct cgacagagaa gaaactccat ggtacaacat 1681 cacagtcact gcttcagaaa ttgataatcc tgatttgctg agccatgtca cagtgggtat 1741 tagagttctg gatgtcaatg acaatccacc cgaacttgcc agggaatatg atattattgt 1801 atgtgaaaat tctaagcctg gccaggttat tcataccatc agtgccactg ataaagatga 1861 ttttgccaat ggaccaaggt ttaacttctt tcttgatgaa cgcctgcctg taaatccaaa 1921 cttcactctg aaggacaatg aagataacac agccagcatt ctgacaaggc ggaggagatt 1981 tagtcgaact gttcaggatg tgtattatct gcccattatg atctctgatg gtggaatccc 2041 ctctctcagc agcagcagca ccctcaccat cagggtttgt gcatgcgaga gagatgggcg 2101 tgtgcggacc tgccatgcag aagccttcct gtcctcggct ggtttgagta caggagcctt 2161 aatcgctatt cttctctgtg ttctcattct cctggcaatt gtggtacttt ttatcaccct 2221 gaggcgcagc aaaaaagagc ccttgatcat ttcagaagag gatgtacggg agaacgtggt 2281 cacctatgat gatgaaggag gcggagagga agacacagag gcctttgaca tcacagcctt 2341 gaggaatcct tctgctgctg aggagctcaa gtaccggagg gatatcagac ctgaagtgaa 2401 gctcactccc agacaccaga catcatccac cctggaaagc atagatgttc aggaatttat 2461 taagcaaaga ctggcagaag cagacctaga ccctagcgtt cccccttatg actctcttca 2521 gacttatgcc tatgagggtc agagatcaga agctgggtct atcagctcgc tggattcagc 2581 aacgacacaa tcagaccagg attatcacta ccttggagac tggggacccg agtttaaaaa 2641 gttagctgaa ctctatggag aaatagaatc tgaaagaaca acttaggggg tcagttcttg 2701 caaccttgtg gaatttgctt cctgagtaag tggatataca acctcagcaa tgacaaggaa 2761 gaagtgtgga aacagtactt ggaactgagg aagctggaca cagctcctgt agaaacaagt 2821 gcccttttca tgatcgaaac tgggttattt aattggaaga aagtaaaaaa aaaaaaaaaa // LOCUS HSU59435 1697 bp mRNA PRI 19-DEC-1997 DEFINITION Human cell cycle protein p38-2G4 homolog (hG4-1) mRNA, complete cds. ACCESSION U59435 NID g2697004 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1697) AUTHORS Lamartine,J., Seri,M., Cinti,R., Heitzmann,F., Creaven,M., Radomski,N., Jost,E., Lenoir,G.M., Romeo,G. and Sylla,B.S. TITLE Molecular cloning and mapping of a human cDNA (PA2G4) that encodes a protein highly homologous to the mouse cell cycle protein p38-2G4 JOURNAL Cytogenet. Cell Genet. 78 (1), 31-35 (1997) MEDLINE 98005911 REFERENCE 2 (bases 1 to 1697) AUTHORS Sylla,B.S., Lamartine,J., Seri,M., Cinti,R., Heitzmann,F., Craeven,M., Radomsky,N., Jost,E., Haber,D., Lenoir,G. and Romeo,G. TITLE Direct Submission JOURNAL Submitted (29-MAY-1996) GCS, IARC, 150 cours A. Thomas, Lyon 69372, France FEATURES Location/Qualifiers source 1..1697 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13" gene 1..1697 /gene="hG4-1" CDS 98..1282 /gene="hG4-1" /codon_start=1 /product="cell cycle protein p38-2G4 homolog" /db_xref="PID:g2697005" /translation="MSGEDEQQEQTIAEDLVVTKYKMGGDIANRVLRSLVEASSSGVS VLSLCEKGDAMIMEETGKIFKKEKEMKKGIAFPTSISVNNCVCHFSPLKSDQDYILKE GDLVKIDLGVHVDGFIANVAHTFVVDVAQGTQVTGRKADVIKAAHLCAEAALRLVKPG NQNTQVTEAWNKVAHSFNCTPIEGMLSHQLKQHVIDGEKTIIQNPTDQQKKDHEKAEF EVHEVYAVDVLVSSGEGKAKDAGQRTTIYKRDPSKQYGLKMKTSRAFFSEVERRFDAM PFTLRAFEDEKKARMGVVECAKHELLQPFNVLYEKEGEFVAQFKFTVLLMPNGPMRIT SGPFEPDLYKSEMEVQDAELKALLQSSASRKTQKKKKKKASKTAENPTSGETLEENEA GD" BASE COUNT 492 a 403 c 429 g 373 t ORIGIN 1 ggatcgaggg gactctgacc acagcctgtg gctgggaagg gagacagagg cggcggcggc 61 tcaggggaaa cgaggctgca gtggtggtag taggaagatg tcgggcgagg acgagcaaca 121 ggagcaaact atcgctgagg acctggtcgt gaccaagtat aagatggggg gcgacatcgc 181 caacagggta cttcggtcct tggtggaagc atctagctca ggtgtgtcgg tactcagcct 241 gtgtgagaaa ggtgatgcca tgattatgga agaaacaggg aaaatcttca agaaagaaaa 301 ggaaatgaag aaaggtattg cttttcccac cagcatttcg gtaaataact gtgtatgtca 361 cttctcccct ttgaagagcg accaggatta tattctcaag gaaggtgact tggtaaaaat 421 tgaccttggg gtccatgtgg atggcttcat cgctaatgta gctcacactt ttgtggttga 481 tgtagctcag gggacccaag taacagggag gaaagcagat gttattaagg cagctcacct 541 ttgtgctgaa gctgccctac gcctggtcaa acctggaaat cagaacacac aagtgacaga 601 agcctggaac aaagttgccc actcatttaa ctgcacgcca atagaaggta tgctgtcaca 661 ccagttgaag cagcatgtca tcgatggaga aaaaaccatt atccagaatc ccacagacca 721 gcagaagaag gaccatgaaa aagctgaatt tgaggtacat gaagtatatg ctgtggatgt 781 tctcgtcagc tcaggagagg gcaaggccaa ggatgcagga cagagaacca ctatttacaa 841 acgagacccc tctaaacagt atggactgaa aatgaaaact tcacgtgcct tcttcagtga 901 ggtggaaagg cgttttgatg ccatgccgtt tactttaaga gcatttgaag atgagaagaa 961 ggctcggatg ggtgtggtgg agtgcgccaa acatgaactg ctgcaaccat ttaatgttct 1021 ctatgagaag gagggtgaat ttgttgccca gtttaaattt acagttctgc tcatgcccaa 1081 tggccccatg cggataacca gtggtccctt cgagcctgac ctctacaagt ctgagatgga 1141 ggtccaggat gcagagctaa aggccctcct ccagagttct gcaagtcgaa aaacccagaa 1201 aaagaaaaaa aagaaggcct ccaagactgc agagaatccc accagtgggg aaacattaga 1261 agaaaatgaa gctggggact gaggtgcgtc ccatctcccc agcttgctgc tcctgcctca 1321 tccccttccc accaaacccc agactctgtg aagtgcagtt cttctccacc taggaccgcc 1381 agcagagcgg ggggatctcc ctgcccccac cccagttccc caacccactc ccttccaaca 1441 acaaccagct ccaactgact ctggtcttgg gaggtgaggc ttcccaacca cggaagacta 1501 ctttaaacga aaaaaagaaa ttgaataata aaatcaggag tcaaaattca tcgtcttcaa 1561 ggcccctctt tctagccttt tctactactc tctgcttggt caaggtttgt gccccactac 1621 agaacagggc taaattagcc accaccactg aaaactcagc cgaatttttt tataccactc 1681 tgacgtcagc atttttt // LOCUS HSU59863 2088 bp mRNA PRI 10-SEP-1996 DEFINITION Human TRAF-interacting protein I-TRAF mRNA, complete cds. ACCESSION U59863 NID g1518017 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2088) AUTHORS Rothe,M., Xiong,J., Shu,H.B., Williamson,K., Goddard,A. and Goeddel,D.V. TITLE I-TRAF is a novel TRAF-interacting protein that regulates TRAF-mediated signal transduction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (16), 8241-8246 (1996) MEDLINE 96323205 REFERENCE 2 (bases 1 to 2088) AUTHORS Rothe,M., Xiong,J., Shu,H.-B., Williamson,K., Goddard,A. and Goeddel,R. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) Tularik, Inc., Two Corporate Drive, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2088 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 159..1436 /codon_start=1 /product="I-TRAF" /db_xref="PID:g1518018" /translation="MDKNIGEQLNKAYEAFRQACMDRDSAVKELQQKTENYEQRIREQ QEQLSLQQTIIDKLKSQLLLVNSTQDNNYGCVPLLEDSDTRKNTLTLAQPQDKVISGI AREKLPKVRRQEVSSPRKETSARSLGSPLLHERGNIEKTSWDLKEEFHKICMLAKAQK DHLSKLNIPDTATETQCSVPIQCTDKTDKQEALFTPQAKDDINRGAPSITSVTPRGLC RDEEDTSLESLSKFNVKFPPMDNDSTFLHSTPERPGILSPATSEAVCQEKFNMEFRDN PGNFVKTEETLFEIQGIDPIASAIQNLKTTDKTKPSNLVNTCIRTTLDRAACLPPGDH NALYVNSFPLLDPSDAPFPSLDSPGKAIRGPQQPIWKPFPNQDSDSVVLSGTDSELHI PRVCEFCQAVFPPSITSRGDFLRHLNSHFNGET" BASE COUNT 719 a 388 c 378 g 603 t ORIGIN 1 gtttgagcag cattgttaga gcctgtggaa aacactttac aactgtgtaa ctgtcttcat 61 ctttacagag gaatagtcta caaaggaaga cttgtaacct ggagaagaga cctgtcattt 121 actccatcct ttatagtgat gctacaggac gaagaggaat ggataaaaac attggcgagc 181 aactcaataa agcgtatgaa gccttccggc aggcatgcat ggatagagat tctgcagtaa 241 aagaattaca gcaaaagact gagaactatg agcagagaat acgtgaacaa caggaacagc 301 tgtcactcca acagactatt attgacaagc taaaatctca gttacttctt gtgaattcca 361 ctcaagataa caattatggc tgtgtccctc tgcttgaaga cagtgacaca agaaagaata 421 ctttgactct tgctcagcca caagataaag tgatttcagg aatagcaaga gaaaaactac 481 caaaggtaag aagacaagag gtttcttctc ctagaaaaga aacttcagca aggagtcttg 541 gcagtccttt gctccatgaa aggggtaata tagagaagac ttcctgggat ctgaaagaag 601 aatttcataa aatatgcatg ctagcaaaag cacagaaaga ccacttaagc aaacttaata 661 taccagacac tgcaactgaa acacagtgct ctgtgcctat acagtgtacg gataaaacag 721 ataaacaaga agcgctgttt acgcctcagg ctaaagatga tataaataga ggtgcaccat 781 ccatcacatc tgtcacacca agaggactgt gcagagatga ggaagacacc tctttggaat 841 cactttctaa attcaatgtc aagtttccac ctatggacaa tgactcaact ttcttacata 901 gcactccaga gagacccggc atccttagtc ctgccacgtc tgaggcagtg tgccaagaga 961 aatttaatat ggagttcaga gacaacccag ggaactttgt taaaacagaa gaaactttat 1021 ttgaaattca gggaattgac cccatagctt cagctataca aaaccttaaa acaactgaca 1081 aaacaaagcc ctcaaatctc gtaaacactt gtatcaggac aactctggat agagctgcgt 1141 gtttgccacc tggagaccat aatgcattat atgtaaatag cttcccactt ctggacccat 1201 ctgatgcacc ttttccctca ctcgattccc cgggaaaagc aatccgagga ccacagcagc 1261 ccatttggaa gccctttcct aatcaagaca gtgactcggt ggtactaagt ggcacagact 1321 cagaactgca tatacctcga gtatgtgaat tctgtcaagc agttttccca ccatccatta 1381 catccagggg ggatttcctt cggcatctta attcacactt caatggagag acttaagaca 1441 catttgaaaa cagacatatc aagttctatg tgatgatttt gggtttttaa tactataaat 1501 acttgattgt aaactaaatt caagatcatt tataggaaaa tctagtttca cagctatttg 1561 aatttttttc tggatttact atataactct tattttttaa aagatcattc tgttctttca 1621 aggagaaata agcctaaaag aagaaaaaca aaaaaaattc tgtataaaac tgtaatcctt 1681 tgtattcatg tttacagtgc tattactata attcaaaatt atgtatgtga cttagagtta 1741 tataatcata atttatgttt atttcaaata tctaagttta ttgcttggat ttctagtgag 1801 agctgttgaa tttggtgatg tcaaatgttt ctagggtttt ttagtttgtt tttattgaga 1861 aaattgatta tttatgctat aggtgatatt ctctttgaat aaacctataa taggaaatag 1921 cagaccacat aaacatcttt gtaaatatca aacctaatac atttcttgtc cagtgataaa 1981 acaactggta gaattattta aacactttag atttttaaat aataaacatg gctttaattt 2041 ttactgtgtg tatagctaca tgatgaaatt aattaaatat taagaggt // LOCUS HSU59914 1280 bp mRNA PRI 01-NOV-1996 DEFINITION Human chromosome 15 Mad homolog Smad6 mRNA, complete cds. ACCESSION U59914 NID g1654326 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1280) AUTHORS Riggins,G.J., Thiagalingam,S., Rozenblum,E., Weinstein,C.L., Kern,S.E., Hamilton,S.R., Willson,J.K., Markowitz,S.D., Kinzler,K.W. and Vogelstein,B. TITLE Mad-related genes in the human JOURNAL Nature Genet. 13 (3), 347-349 (1996) MEDLINE 96259564 REFERENCE 2 (bases 1 to 1280) AUTHORS Riggins,G.J., Thiagalingam,S., Kinzler,K.W. and Vogelstein,B.V. TITLE Direct Submission JOURNAL Submitted (04-JUN-1996) Oncology Center, Rm. 109, Johns Hopkins, 424 N. Bond St., Baltimore, MD 21231, USA FEATURES Location/Qualifiers source 1..1280 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" CDS 113..820 /note="Mad homolog" /codon_start=1 /product="Smad6" /db_xref="PID:g1654327" /translation="MSRMGKPIETQKSPPPPYSRLSPRDEYKPLDLSDSTLSYTETEA TNSLITAPGEFSDASMSPDATKPSHWCSVAYWEHRTRVGRLYAVYDQAVSIFYDLPQG SGFCLGQLNLEQRSESVRRTRSKIGFGILLSKEPDGVWAYNRGEHPIFVNSPTLDAPG GRALVVRKVPPGYSIKVFDFERSGLQHAPEPDAADGPYDPNSVRISFAKGWGPCYSRQ FITSCPCWLEILLNNPR" BASE COUNT 320 a 378 c 311 g 271 t ORIGIN 1 aaaagaacga atccagcacc aaaacgtgct acaacatgga tgaacttcga tgactttgtg 61 ccacatgaaa gaagaagcca gccacaaaag gccatatatt gtatgaaatg aaatgtccag 121 aatgggcaaa cccatagaga cacaaaaatc tccgccacct ccctactctc ggctgtctcc 181 tcgcgacgag tacaagccac tggatctgtc cgattccaca ttgtcttaca ctgaaacgga 241 ggctaccaac tccctcatca ctgctccggg tgaattctca gacgccagca tgtctccgga 301 cgccaccaag ccgagccact ggtgcagcgt ggcgtactgg gagcaccgga cgcgcgtggg 361 ccgcctctat gcggtgtacg accaggccgt cagcatcttc tacgacctac ctcagggcag 421 cggcttctgc ctgggccagc tcaacctgga gcagcgcagc gagtcggtgc ggcgaacgcg 481 cagcaagatc ggcttcggca tcctgctcag caaggagccc gacggcgtgt gggcctacaa 541 ccgcggcgag caccccatct tcgtcaactc cccgacgctg gacgcgcccg gcggccgcgc 601 cctggtcgtg cgcaaggtgc cccccggcta ctccatcaag gtgttcgact tcgagcgctc 661 gggcctgcag cacgcgcccg agcccgacgc cgccgacggc ccctacgacc ccaacagcgt 721 ccgcatcagc ttcgccaagg gctgggggcc ctgctactcc cggcagttca tcacctcctg 781 cccctgctgg ctggagatcc tcctcaacaa ccccagatag tggcggcccc ggcgggaggg 841 gcgggtggga ggccgcggcc accgccacct gccggcctcg agaggggccg atgcccagag 901 acacagcccc cacggacaaa accccccaga tatcatctac ctagatttaa tataaagttt 961 tatatattat atggaaatat atattatact tgtaattatg gagtcatttt tacaatgtaa 1021 ttatttatgt atggtgcaat gtgtgtatat ggacaaaaca agaaagacgc actttggctt 1081 ataattcttt caatacagat atattttctt tctcttcctc cttcctcttc cttacttttt 1141 atatatatat ataaagaaaa tgatacagca gagctaggtg gaaaagcctg ggtttggtgt 1201 atggtttttg agatattaat gcccagacaa aaagctaata ccagtcactc gataataaag 1261 tattcgcatt ataaaaaaga // LOCUS HSU59919 2648 bp mRNA PRI 01-NOV-1996 DEFINITION Human Smg GDS-associated protein SMAP mRNA, complete cds. ACCESSION U59919 NID g1633620 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2648) AUTHORS Shimizu,K., Kawabe,H., Minami,S., Honda,T., Takaishi,K., Shirataki,H. and Takai,Y. TITLE SMAP, an Smg GDS-associating protein having arm repeats and phosphorylated by Src tyrosine kinase JOURNAL J. Biol. Chem. 271 (43), 27013-27017 (1996) MEDLINE 97059159 REFERENCE 2 (bases 1 to 2648) AUTHORS Shimizu,K. and Takai,Y. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) Molecular Biology and Biochemistry, Osaka University Medical School, 2-2 Yamada-oka, Suita, Osaka 565, Japan FEATURES Location/Qualifiers source 1..2648 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" 5'UTR 1..188 CDS 189..2567 /note="Smg GDS-associated protein having arm repeats and phosphorylated by Src tyrosine kinase" /codon_start=1 /product="SMAP" /db_xref="PID:g1654321" /translation="MQGEDARYLKRKVKGGNIDVHPSEKALIVHYEVEATILGEMGDP MLGERKECQKIIRLKSLNANTDITSLARKVVEECKLIHPSKLNEVELLLYYLQNRRDS LSGKEKKEKSSKPKDPPPFEGMEIDEVANINDMDEYIELLYEDIPDKVRGSALILQLA RNPDNLEELLLNETALGALARVLREDWKQSVELATNIIYIFFCFSSFSQFHGLITHYK IGALCMNIIDHELKRHELWQEELSKKKKAVDEDPENQTLRKDYEKTFKKYQGLVVKQE QLLRVALYLLLNLAEDTRTELKMRNKNIVHMLVKALDRDNFELLILVVSFLKKLSIFM ENKNDMVEMDIVEKLVKMIPCEHEDLLNITLRLLLNLSFDTGLRNKMVQVGLLPKLTA LLGNDNYKQIAMCVLYHISMDDRFKSMFAYTDCIPQLMKMLFECSDERIDLELISFCI NLAANKRNVQLICEGNGLKMLMKRALKFKDPLLMKMIRNISQHDGPTKNLFIDYVGDL AAQISNDEEEEFVIECLGTLANLTIPDLDWELVLKEYKLVPYLKDKLKPGAAEDDLVL EVVIMIGTVSMDDSCAALLAKSGIIPALIELLNAQQEDDEFVCQIIYVFYQMVFHQAT RDVIIKETQAPAYLIDLMHDKNNEIRKVCDNTLDIIAEYDEEWAKKIQSEKFRWHNSQ WLEMVESRQMDESEQYLYGDDRIEPYIHEGDILERPDLFYNSDGLIASEGAISPDFFN DYHLQNGDVVGQHSFPGSLGMDGFGQPVGILGRPATAYGFRPDEPYYYGYGS" 3'UTR 2568..2648 BASE COUNT 827 a 495 c 592 g 734 t ORIGIN 1 gtgacagaag ctgtgggagg agctggaggc ttcaccgtgg taatcacagc gccgctgctg 61 ccccgccttg caggtctcag gactgtcatc gcctctgggt gtgagggtac tttggccacc 121 gtccccggaa ataaccgcgc ctgcctctca agatacccca tcctctccac gccgctgccg 181 ctgccgccat gcaaggggag gacgccagat acctcaaaag gaaagttaaa ggagggaata 241 tagatgtaca tccatcagaa aaagcactca ttgttcacta tgaagtggaa gctaccattc 301 ttggagaaat gggggacccc atgttgggag aacgaaaaga atgtcaaaaa atcattcgac 361 ttaagagtct caatgccaac acagatataa cttccctggc aaggaaggtg gttgaagaat 421 gtaaactcat tcatccttca aaactaaatg aggtagaact gctgttgtac tatctacaga 481 accgccgtga ttcattgtca ggaaaagaga aaaaagaaaa atcaagcaag cctaaagatc 541 cacctccttt tgaaggaatg gagattgatg aagttgctaa cattaatgac atggatgaat 601 atattgagtt attatatgaa gatattcctg acaaagttcg gggttctgct ttgatcctgc 661 agcttgctcg aaatcctgat aacttggaag aactactatt gaatgaaact gcccttggtg 721 cattagcaag ggtcctgaga gaagactgga agcaaagtgt cgagttagct acaaacataa 781 tttacatctt tttttgtttc tccagctttt ctcaatttca tggacttatt actcactata 841 aaattggagc tctgtgtatg aatattattg atcatgagtt aaaaagacat gagctttggc 901 aagaagaact ctcaaagaag aagaaagctg ttgatgaaga ccctgaaaac caaaccttga 961 gaaaggatta tgaaaaaacc tttaaaaagt accaggggct tgtggtaaaa caggaacagc 1021 tattacgagt tgctctttat ttgcttctga atcttgctga ggatactcgt accgaactga 1081 aaatgaggaa caagaacata gttcacatgt tggtgaaagc ccttgatcgg gacaattttg 1141 agctgctaat tttagttgtg tcattcttga agaaactcag catttttatg gagaataaaa 1201 atgatatggt ggaaatggat attgttgaaa aactggtgaa aatgatacct tgtgagcatg 1261 aagacctgct gaatatcacc ctccgacttt tactaaacct atcctttgac acaggactga 1321 ggaataagat ggtacaagtt ggactgcttc ccaagctcac tgcactccta ggcaatgaca 1381 actacaaaca aatagcaatg tgtgttcttt accacataag catggatgac cgctttaaat 1441 caatgtttgc atacactgac tgtataccac agttaatgaa gatgctgttt gaatgttcag 1501 atgaacgaat tgacttggaa ctcatttctt tctgcattaa tcttgctgct aacaaaagaa 1561 atgtacagct tatctgtgaa ggaaatgggc tgaagatgct catgaagagg gctctgaagt 1621 ttaaggatcc attgctgatg aaaatgatta gaaacatttc tcagcatgat ggaccaacta 1681 aaaatctgtt tattgattat gttggggacc ttgcagccca gatctctaat gatgaagaag 1741 aggagttcgt gattgaatgt ttgggaactc ttgcaaactt gaccattcca gacttagact 1801 gggaattggt tcttaaagaa tataagttgg ttccatacct caaggataaa ctaaaaccag 1861 gtgctgcaga agatgatctt gttttagaag tggttataat gattggaact gtatccatgg 1921 atgactcttg tgctgcattg ctagccaaat ctggcataat ccctgcactc attgaattgc 1981 taaatgctca acaagaagat gatgaatttg tgtgtcagat aatttatgtc ttctaccaga 2041 tggttttcca ccaagccaca agagacgtca taatcaagga aacacaggct ccagcatatc 2101 tcatagacct aatgcatgat aagaataatg aaatccgaaa ggtctgtgat aatacattag 2161 atattatagc ggaatatgat gaagaatggg ctaagaaaat tcagagtgaa aagtttcgct 2221 ggcataactc tcagtggctg gagatggtag agagtcgtca gatggatgag agtgagcagt 2281 acttgtatgg tgatgatcga attgagccat acattcatga aggagatatt ctcgaaagac 2341 ctgacctttt ctacaactca gatggattaa ttgcctctga aggagccata agtcccgatt 2401 tcttcaatga ttaccacctt caaaatggag atgttgttgg gcagcattca tttcctggca 2461 gccttggaat ggatggcttt ggccaaccag ttggcattct tggacgccct gccacagcat 2521 atggattccg ccctgatgaa ccttactact atggctatgg atcttgataa agtatctgtt 2581 tccatgtgta atctcagctt agaagaaatc tgtgtgggtt gggttaattt tggatctttg 2641 cctaataa // LOCUS HSU60060 1654 bp mRNA PRI 13-MAY-1997 DEFINITION Human FEZ1 mRNA, complete cds. ACCESSION U60060 NID g1927201 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1654) AUTHORS Bloom,L. and Horvitz,H.R. TITLE The Caenorhabditis elegans gene unc-76 and its human homologs define a new gene family involved in axonal outgrowth and fasciculation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (7), 3414-3419 (1997) MEDLINE 97250552 REFERENCE 2 (bases 1 to 1654) AUTHORS Bloom,L. and Horvitz,H.R. TITLE Direct Submission JOURNAL Submitted (05-JUN-1996) Center for Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..1654 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="42963" gene 100..1278 /gene="FEZ1" CDS 100..1278 /gene="FEZ1" /note="similar to C. elegans UNC-76" /codon_start=1 /product="FEZ1" /db_xref="PID:g1927202" /translation="MEAPLVSLDEEFEDLRPSCSEDPEEKPQCFYGSSPHHLEDPSLS ELENFSSEIISFKSMEDLVNEFDEKLNVCFRNYNAKTENLAPVKNQLQIQEEEETLQD EEVWDALTDNYIPSLSEDWRDPNIEALNGNCSDTEIHEKEEEEFNEKSENDSGINEEP LLTADQVIEEIEEMMQNSPDPEEEEEVLEEEDGGETSSQADSVLLQEMQALTQTFNNN WSYEGLRHMSGSELTELLDQVEGAIRDFSEELVQQLARRDELEFEKEVKNSFITVLIE VQNKQKEQRELMKKRRKEKGLSLQSSRIEKGNQMPLKRFSMEGISNILQSGIRQTFGS SGTDKQYLNTVIPYEKKASPPSVEDLQMLTNILFAMKEDNEKVPTLLTDYILKVLCPT " BASE COUNT 472 a 400 c 438 g 344 t ORIGIN 1 acgagggtcc ggctgagccc cgggatccgc ctccctccgc caggacccgc acagataaac 61 tcatcctgaa agtcgctgtt gttctcctgc tgagcaagaa tggaggcccc actggtgagt 121 ctggatgaag agtttgagga ccttcgaccc tcctgctcgg aggacccgga ggagaagccc 181 cagtgtttct atggttcatc tccccaccat ctcgaggacc cctccctctc cgagcttgag 241 aatttttctt ccgaaataat cagcttcaag tccatggagg acctcgtaaa tgaatttgat 301 gagaagctca atgtctgctt tcggaactac aacgccaaga ccgagaacct agctcccgtg 361 aagaaccagt tacagatcca agaggaggag gagacccttc aggacgagga ggtttgggat 421 gctctgacag acaattacat cccttcactc tcagaagact ggagggatcc aaacatcgag 481 gctctgaatg gcaactgctc tgacactgag atccatgaga aagaagagga agagttcaat 541 gagaagagtg aaaatgattc cggtatcaac gaggagcctc tgctcacagc agatcaggta 601 attgaggaga ttgaggaaat gatgcagaac tccccagacc ctgaggaaga agaggaggtt 661 ctggaagaag aggatggagg agaaacttcc tcccaggcag actcggtcct cctgcaggag 721 atgcaggcat tgacacagac cttcaacaac aactggtcct atgaagggct gaggcacatg 781 tctgggtctg agctgaccga gctgctggac caggtggagg gtgccatccg tgacttctcg 841 gaggagctgg tgcagcagct ggcccgccgg gacgagctgg agtttgagaa ggaagtgaag 901 aactccttta tcacggtgct tattgaggtt cagaacaagc agaaggagca gcgagaactg 961 atgaaaaaga ggcggaaaga gaaagggctg agcctgcaga gcagccggat agagaaggga 1021 aaccagatgc ctctcaagcg cttcagcatg gaaggcatct ccaacattct gcagagtggc 1081 atccgccaga cctttggctc ctcaggaact gacaaacagt atctgaacac agtcattcct 1141 tacgagaaga aagcctctcc tccctcagtg gaagacctgc agatgctgac aaacattctc 1201 tttgccatga aggaggataa tgagaaggtg cctactttgc taacggacta cattttaaaa 1261 gtgctctgcc ctacctaacc ttgccctttg gagcagcctc gctgcaggag gtcactgagc 1321 aagagtcatt ccatcacagg gactgcatga gaccatgtaa cctccgacat gtatttaaac 1381 gtgtatagct taacctggat taaacacgag caagcgcgcg gggtcctttg ccgttggctt 1441 ctagtgctag taatcattgg atgcatgatg gggcagggcc ggtgatggtg cctccccctt 1501 gctggtgtca ggagagggga aggcagccgc tttcaccgct cattatgtag tctggctaca 1561 gccctccaaa acagcttata ctcttaagac taattttgaa ataaaacctt catttaatta 1621 aaaaaaaaaa aaaaaaaaaa aaaaaaaaat taaa // LOCUS HSU60205 1751 bp mRNA PRI 13-JUL-1996 DEFINITION Human methyl sterol oxidase (ERG25) mRNA, complete cds. ACCESSION U60205 NID g1408205 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1751) AUTHORS Li,L. and Kaplan,J. TITLE Characterization of yeast methyl sterol oxidase (ERG25) and identification of a human homologue JOURNAL J. Biol. Chem. 271 (28), 16927-16933 (1996) MEDLINE 96279274 REFERENCE 2 (bases 1 to 1751) AUTHORS Kaplan,J. and Li,L. TITLE Direct Submission JOURNAL Submitted (06-JUN-1996) Department of Pathology, University of Utah, 50 North Medical Drive, Salt Lake City, UT 84132, USA FEATURES Location/Qualifiers source 1..1751 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="intestine" /chromosome="4" /map="4q32-34" gene 27..908 /gene="ERG25" CDS 27..908 /gene="ERG25" /note="enzyme involved in the cholesterol biosynthetic pathway; iron is an essential cofactor needed for enzyme activity; human homologue of Saccharomyces cerevisiae C-4 methyl sterol oxidase" /codon_start=1 /product="methyl sterol oxidase" /db_xref="PID:g1408206" /translation="MATNESVSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNY TKFQIATWGSLIVHEALYFLFCLPGFLFQFIPYMKKYKIQKDKPETWENQWKCFKVLL FNHFCIQLPLICGTYYFTEYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLL HHKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVT IRLLETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDS QYNAYNEKRKKFEKKTE" BASE COUNT 517 a 336 c 317 g 581 t ORIGIN 1 gcgagatgac tgcagagatt tgaaaaatgg caacaaatga aagtgtcagc atctttagtt 61 cagcatcctt ggctgtggaa tatgtagatt cacttttacc tgagaatcct ctgcaagaac 121 catttaaaaa tgcttggaac tatatgttga ataattatac aaagttccag attgcaacat 181 ggggatccct tatagttcat gaagcccttt atttcttatt ctgtttacct ggatttttat 241 ttcaatttat accttatatg aaaaaataca aaattcaaaa ggataagcca gagacatggg 301 aaaaccaatg gaagtgtttc aaagttcttc tctttaatca cttctgtatc cagctgcctt 361 tgatttgtgg aacctattat tttacagagt atttcaatat tccttatgat tgggaaagaa 421 tgccaagatg gtattttctt ttggcaagat gctttggttg tgcagtcatt gaagatactt 481 ggcactattt tctgcataga ctcttacacc acaaaagaat atacaagtat attcataaag 541 ttcatcatga gtttcaggct ccatttggaa tggaagctga atatgcacat cctttggaga 601 ctctaattct tggaactgga tttttcattg gaatcgtgct tttgtgtgat catgtaattc 661 ttctttgggc atgggtgacc attcgtttat tagaaactat tgatgtccat agtggttatg 721 atattcctct caacccttta aatctgatcc ctttctatgc tggttctcgg catcatgatt 781 tccaccacat gaacttcatt ggaaactatg cttcaacatt tacatggtgg gatcgaattt 841 ttggaacaga ctctcagtat aatgcctata atgaaaagag gaagaagttt gagaaaaaga 901 ctgaataaat atctcacgta aaccttcctg aaagataaac gttttcctga attcagaaac 961 tagtagctaa cattgcttct ggagagcaga aataagcatg tcttctggct actaagtgat 1021 aaaaagaaca ttaacaacct ttaattacct tcctagtggg aactttttct actttaccta 1081 caagttctat atatgtagaa atgaataaat atatatttaa gtacagtttt catgaggaag 1141 ttttaaaaga ccatgttcct aagcttccaa gaaggttttg gatactagaa gtattaatct 1201 atggcttttc tcccagtaaa accataggcc tgaagttcac attgggtctt taaatctttt 1261 agatatatac tggtcatttc agaaaattct tcatagtggt attggcctta tatttaactt 1321 tttttttatt ttttttttga gacaaagcca cactctgtct ccttgtctgg agtgtggtgg 1381 cacagtctca gctcactgca acctctgcct cccagttcaa gcaattcttc tgcctcagcc 1441 tcccaagtag ctgggattac aggcacccgc caccacgccc agctaatttt tgtatttttg 1501 tagagatggg gtttctcgat gttggccagg ctggtctcaa acttctgacc tcaagtgatc 1561 tgcccacctt ggcctcccaa agtgctggga ttacaggtgt aagccactgc gcccggcctt 1621 tttaacttta aacatgtttt agaattcacc taaagatcaa aatatcatgg attgaacctc 1681 atcaattgat agcagtgagt gactgaagct tccaaatcaa gaaaagccgg caccaagaac 1741 ttccattcta a // LOCUS HSU60276 1216 bp mRNA PRI 11-OCT-1996 DEFINITION Human hASNA-I mRNA, complete cds. ACCESSION U60276 NID g1616740 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1216) AUTHORS Kurdi-Haidar,B., Aebi,S., Heath,D., Enns,R.E., Naredi,P., Hom,D.K. and Howell,S.B. TITLE Isolation of the ATP-binding human homolog of the arsA component of the bacterial arsenite transporter JOURNAL Genomics 36 (3), 486-491 (1996) MEDLINE 97038691 REFERENCE 2 (bases 1 to 1216) AUTHORS Kurdi-Haidar,B. and Howell,S.B. TITLE Direct Submission JOURNAL Submitted (09-JUN-1996) UCSD Cancer Center, UCSD, 9500 Gilman Drive, La Jolla, CA 92093-0812, USA FEATURES Location/Qualifiers source 1..1216 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..999 /note="bacterial ArsA homolog" /codon_start=1 /product="hASNA-I" /db_xref="PID:g1616741" /translation="MLLDVEPLEPTLSNIIEQRSLKWIFVGGKGGVGKTTCSCSLAVQ LSKGRESVLIISTDPAHNISDAFDQKFSKVPTKVKGYDNLFAMEIDPSLGVADVPDEF FEEDNMLSMGKKMMQEAMSAFPGIDEAMSYAEVMRLVKGMNFSVVVFDTAPTGHTLRL LNFPTIVERGLGRLMQIKNQISPFISQMCNMLGLGDMNADQLASKLEETLPVIRSVSE QFKDPEQTTFICVCIAEFLSLYETERLIQELAKCKIDTHNIIVNQLVFPDPEKPCKMC EARHKIQAKYLDQMEDLYEDFHIVKLPLLPHEVRGADKVNTFSALLLEPYKPPSAQ" BASE COUNT 276 a 367 c 337 g 236 t ORIGIN 1 atgctcctcg atgtggagcc gctggagcct acacttagca acatcatcga gcagcgcagc 61 ctgaagtgga tcttcgtcgg gggcaagggt ggtgtgggca agaccacctg cagctgcagc 121 ctggcagtcc agctctccaa ggggcgtgag agtgttctga tcatctccac agacccagca 181 cacaacatct cagatgcttt tgaccagaag ttctcaaagg tgcctaccaa ggtcaaaggc 241 tatgacaacc tctttgctat ggagattgac cccagcctgg gcgtggcgga cgtgcctgac 301 gagttcttcg aggaggacaa catgctgagc atgggcaaga agatgatgca ggaggccatg 361 agcgcatttc ccggcatcga tgaggccatg agctatgccg aggtcatgag gctggtgaag 421 ggcatgaact tctcggtggt ggtatttgac acggcaccca cgggccacac cctgaggctg 481 ctcaacttcc ccaccatcgt ggagcggggc ctgggccggc ttatgcagat caagaaccag 541 atcagccctt tcatctcaca gatgtgcaac atgctgggcc tgggggacat gaacgcagac 601 cagctggcct ccaagctgga ggagacgctg cccgtcatcc gctcagtcag cgaacagttc 661 aaggaccctg agcagacaac tttcatctgc gtatgcattg ctgagttcct gtccctgtat 721 gagacagaga ggctgatcca ggagctggcc aagtgcaaga ttgacacaca caatataatt 781 gtcaaccagc tcgtcttccc cgaccccgag aagccctgca agatgtgtga ggcccgtcac 841 aagatccagg ccaagtatct ggaccagatg gaggacctgt atgaagactt ccacatcgtg 901 aagctgccgc tgttacccca tgaggtgcgg ggggcagaca aggtcaacac cttctcggcc 961 ctcctcctgg agccctacaa gccccccagt gcccagtagc acagctgcca gccccaaccg 1021 ctgccatttc acactcaccc tccaccctcc ccaccccctc ggggcagagt ttgcacaaag 1081 tcccccccat aatacagggg gagccacttg ggcaggaggc agggaggggt ccattccccc 1141 tggtggggct ggtggggagc tgtagttgcc ccctacctct cccacctctt gctcttcaat 1201 aaatgatctt aaactg // LOCUS HSU60519 3536 bp mRNA PRI 20-AUG-1996 DEFINITION Human apoptotic cysteine protease Mch4 (Mch4) mRNA, complete cds. ACCESSION U60519 NID g1498323 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2660; 1 to 1001) AUTHORS Fernandes-Alnemri,T., Litwack,G. and Alnemri,E.S. TITLE CPP32, a novel human apoptotic protein with homology to C. elegans cell death protein CED-3 and mammalian interleukin-1-beta converting enzyme JOURNAL J. Biol. Chem. 269, 308761-308764 (1994) REFERENCE 2 (bases 1 to 2309) AUTHORS Fernandes-Alnemri,T.F., Takahashi,A., Armstrong,R., Krebs,J., Fritz,L., Tomaselli,K.J., Wang,L., Yu,Z., Croce,C.M., Salvesen,G., Earnshaw,W.C., Litwack,G. and Alnemri,E.S. TITLE Mch3, a novel human apoptotic cysteine protease highly related to CPP32 JOURNAL Cancer Res. 55 (24), 6045-6052 (1995) MEDLINE 96105019 REFERENCE 3 (bases 1 to 3536) AUTHORS Fernandes-Alnemri,T.F., Armstrong,R.C., Krebs,J., Srinivasula,S.M., Wang,L., Bullrich,F., Fritz,L.C., Trapani,J.A., Tomaselli,K.J., Litwack,G. and Alnemri,E.S. TITLE In vitro activation of CPP32 and Mch3 by Mch4, a novel human apoptotic cysteine protease containing two FADD-like domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (15), 7464-7469 (1996) MEDLINE 96353838 REFERENCE 4 (bases 1 to 3536) AUTHORS Alnemri,E.S. TITLE Direct Submission JOURNAL Submitted (11-JUN-1996) Pharmacology, Thomas Jefferson University, Jefferson Cancer Institute, 233, S. Tenth Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..3536 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="t96-18B.1" /cell_line="Jurkat" /cell_type="T-lymphocyte" gene 148..1587 /gene="Mch4" CDS 148..1587 /gene="Mch4" /function="activates CPP32 and Mch3" /note="a new member of the aspartate-specific cysteine protease family; the prodomain of this protease contains FADD-like death effector domains" /codon_start=1 /evidence=experimental /product="apoptotic cysteine protease Mch4" /db_xref="PID:g1498324" /translation="MKSQGQHWYSSSDKNCKVSFREKLLIIDSNLGVQDVENLKFLCI GLVPNKKLEKSSSASDVFEHLLAEDLLSEEDPFFLAELLYIIRQKKLLQHLNCTKEEV ERLLPTRQRVSLFRNLLYELSEGIDSENLKDMIFLLKDSLPKTEMTSLSFLAFLEKQG KIDEDNLTCLEDLCKTVVPKLLRNIEKYKREKAIQIVTPPVDKEAESYQGEEELVSQT DVKTFLEALPRAAVYRMNRNHRGLCVIVNNHSFTSLKDRQGTHKDAEILSHVFQWLGF TVHIHNNVTKVEMEMVLQKQKCNPAHADGDCFVFCILTHGRFGAVYSSDEALIPIREI MSHFTALQCPRLAEKPKLFFIQACQGEEIQPSVSIEADALNPEQAPTSLQDSIPAEAD FLLGLATVPGYVSFRHVEEGSWYIQSLCNHLKKLVPRHEDILSILTAVNDDVSRRVDK QGTKKQMPQPAFTLRKKLVFPVPLDALSI" polyA_site 3536 BASE COUNT 944 a 820 c 818 g 954 t ORIGIN 1 tgaagtctct tcccaagcaa atgggagctt ctttggacct tggagcacac agaggattct 61 actttcttta aaactttgtt ttcaggcaat ttccctgaga accgtttact tccagaagat 121 tggtggagct tgatctgaag gctggccatg aaatctcaag gtcaacattg gtattccagt 181 tcagataaaa actgtaaagt gagctttcgt gagaagcttc tgattattga ttcaaacctg 241 ggggtccaag atgtggagaa cctcaagttt ctctgcatag gattggtccc caacaagaag 301 ctggagaagt ccagctcagc ctcagatgtt tttgaacatc tcttggcaga ggatctgctg 361 agtgaggaag accctttctt cctggcagaa ctcctctata tcatacggca gaagaagctg 421 ctgcagcacc tcaactgtac caaagaggaa gtggagcgac tgctgcccac ccgacaaagg 481 gtttctctgt ttagaaacct gctctacgaa ctgtcagaag gcattgactc agagaactta 541 aaggacatga tcttccttct gaaagactcg cttcccaaaa ctgaaatgac ctccctaagt 601 ttcctggcat ttctagagaa acaaggtaaa atagatgaag ataatctgac atgcctggag 661 gacctctgca aaacagttgt acctaaactt ttgagaaaca tagagaaata caaaagagag 721 aaagctatcc agatagtgac acctcctgta gacaaggaag ccgagtcgta tcaaggagag 781 gaagaactag tttcccaaac agatgttaag acattcttgg aagccttacc gagggcagct 841 gtgtacagga tgaatcggaa ccacagaggc ctctgtgtca ttgtcaacaa ccacagcttt 901 acctccctga aggacagaca aggaacccat aaagatgctg agatcctgag tcatgtgttc 961 cagtggcttg ggttcacagt gcatatacac aataatgtga cgaaagtgga aatggagatg 1021 gtcctgcaga agcagaagtg caatccagcc catgccgacg gggactgctt cgtgttctgt 1081 attctgaccc atgggagatt tggagctgtc tactcttcgg atgaggccct cattcccatt 1141 cgggagatca tgtctcactt cacagccctg cagtgcccta gactggctga aaaacctaaa 1201 ctctttttca tccaggcctg ccaaggtgaa gagatacagc cttccgtatc catcgaagca 1261 gatgctctga accctgagca ggcacccact tccctgcagg acagtattcc tgccgaggct 1321 gacttcctac ttggtctggc cactgtccca ggctatgtat cctttcggca tgtggaggaa 1381 ggcagctggt atattcagtc tctgtgtaat catctgaaga aattggtccc aagacatgaa 1441 gacatcttat ccatcctcac tgctgtcaac gatgatgtga gtcgaagagt ggacaaacag 1501 ggaacaaaga aacagatgcc ccagcctgct ttcacactaa ggaaaaaact agtattccct 1561 gtgcccctgg atgcactttc aatatagcag agagtttttg ttggttctta gacctcaaac 1621 gaatcattgg gtataacctc cagcctcctg cccagcacag gaatcggtgg tctccacctg 1681 tcattctaga aacaggaaac accgtgtttt ctgacacagt caattctgat tttctttttc 1741 ttttgcaagt ctaaatgtta gaaaactttc tttttttgga gatagtctca ttctgtcacc 1801 cagactggag tgcagggggg caatcacggc tcactgtagt ctcgacctcc caggctcaag 1861 ctgtcctccc acctcagcct cccaagtagc tgagactaca ggtgtgtgtc catgcacagc 1921 taacttttta ttttttttgt ggagatgggg tttcactatg ttgcctaagc tggtctcaaa 1981 ctcctgggct caagcgatcc tcccacctca gcttctcaaa gttctgggac tacaggcatg 2041 aaatactgtg cctggcctgg ggaccaggtg cattttaagg ttccttggtg ttcaaaaacc 2101 acgttcttag cctagattga gcttagattg cctctctaga caactacccc ttagttataa 2161 ttctgtgtcc cctctgcatg cccttaaaca ttggacagtg aggtcacagt ccacccaccc 2221 tctctctgat ctcccccttc ctaagacttc tcttttgcac atctagtgag gtgaaaattt 2281 ggtctatgcc aggcccattt cctgcttttg tgtaaggaag gtgctcacat aggaagtttt 2341 tatttggtta gagacaggtt tccctgtagg aagatgatgg ctcatttaca ctcagctgct 2401 ctgcaagcag aaactttaca acctgatgtc atattccatt ttggactggg tgcggtgact 2461 catgcctgta atcccagtac tctgggaagc caaggcaggc agatcacttg aggtcaggag 2521 ttcgagacca gcctggccaa tacggcaaaa cctcatcatt actaaaaaca caaaaattag 2581 ccaggtgtgg cggcgagcac ctgtaatccc agctactcgg gaggctgaga caggagaatc 2641 tcttgaatcc aggaggcaga ggctgtggtg agccaagatg acacaactgc actccagctt 2701 gggcaacagg gcgagacctt gtttaaaaaa aaaattcaat attggggttg gaacatttca 2761 gttgccattg acagaacacc caattcaaat tgactgaagc aaagaaggga atttattgcc 2821 tctttcacat tgaaacccag gagtggataa cactggcttc aggcaaagct tgaatcagga 2881 ctcaatctac aggccagcac ctttctcttg gccggatgtc ctcagggctg gcagatgcag 2941 tagactgcag tggacagtcc ccaccttgtt actgctacta cactttgctc ctctggccca 3001 aggcatgagg agagaggctg tgtcagaaac tgaagctgtt ctcaggatca ctgggctctt 3061 cttggcagag gggatgtctg gcttgcctga agggagtggc tctgtaagga cgccttgatg 3121 ctttcttcat taagattttg agcattttta cgtacttgag cttttttttt tttttttttc 3181 aatttctaga ggaacttttt ctctgttaat tcctggaact gtattttgaa tccttaaagg 3241 tgagccctca tagggagatc caaagtcctg tggttaacgc cttcatttat agatgaggca 3301 gctgaggcct ggggatgtga acaacctgct cacagtcctc atttactgga tttgacttca 3361 gccaggtgaa ctggaatgcc ttggggcgtg gaagggcatt aggagtgttt catttgatat 3421 gtgaatgctc ataaaaaaat gtcaaggaat gaagaacaac aactctcagt ggtgcctgca 3481 tttataatta tttatgtgaa agtcaaattc atgtacagta aatttgttat aagaat // LOCUS HSU60644 2131 bp mRNA PRI 08-OCT-1996 DEFINITION Human HU-K4 mRNA, complete cds. ACCESSION U60644 NID g1575346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2131) AUTHORS Upton,C., Cao,J. and Koop,B. TITLE Direct Submission JOURNAL Submitted (12-JUN-1996) Biochemistry & Microbiology, University of Victoria, PO Box 3055, Victoria, BC V8W 3P6, Canada FEATURES Location/Qualifiers source 1..2131 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /dev_stage="adult" /tissue_type="breast" /clone="I.M.A.G.E Consortium Clone ID 159455" CDS 488..1801 /note="similar to Vaccinia virus HindIII K4L ORF, and to Vaccinia virus p37 (HindIII F13L ORF)" /codon_start=1 /product="HU-K4" /db_xref="PID:g1575347" /translation="MTQLFLWEYGDLHLFGPNQRPAPCYDPCEAVLVESIPEGLDFPN ASTGNPSTSQAWLGLLAGAHSSLDIASFYWTLTNNDTHTQEPSAQQGEEVLRQLQTLA PKGVNVRIAVSKPSGPQPQADLQALLQSGAQVRMVDMQKLTHGVLHTKFWVVDQTHFY LGSANMDWRSLTQVKELGVVMYNCSCLARDLTKIFEAYWFLGQAGSSIPSTWPRFYDT RYNQETPMEICLNGTPALAYLASAPPPLCPSGRTPDLKALLNVVDNARSFIYVAVMNY LPTLEFSHPHRFWPAIDDGLRRATYERGVKVRLLISCWGHSEPSMRAFLLSLAALRDN HTHSDIQVKLFVVPADEAQARIPYARVNHNKYMVTERATYIGTSNWSGNYFTETAGTS LLVTQNGRGGLRSQLEAIFLRDWDSPYIHDLDTSADSVGNACRLL" BASE COUNT 426 a 710 c 598 g 397 t ORIGIN 1 ctctttataa tttagtttcc atagaagtta tatgtgcatt taaaaaaatt caatgctgga 61 gcgaccgtgt ctggggagcc gagccccgct tctcgctgcg gtgagcccgg actggggcac 121 gcactgcgca gactccccgc tgcagtgggc ggagtcccac aggccccgcc cctcctccca 181 ccctcgttca gcctgtccag acagaagctg gggcccagcg gaggtagcag cagacgcctg 241 agagcgaggc cgaggccctc agggtttgga gaccctgaca cacccacctt ctcacctggg 301 ctctgcgtat cccccagcct tgagggaaga tgaagcctaa actgatgtac caggagctga 361 aggtgcctgc agaggagccc gccaatgagc tgcccatgaa tgagattgag gcgtggaagg 421 ctgcggaaaa gaaagcccgc tgggtcctgc tggtcctcat tctggcggtt gtgggcttcg 481 gagcctgatg actcagctgt ttctatggga atacggcgac ttgcatctct ttgggcccaa 541 ccagcgccca gccccctgct atgacccttg cgaagcagtg ctggtggaaa gcattcctga 601 gggcctggac ttccccaatg cctccacggg gaacccttcc accagccagg cctggctggg 661 cctgctcgcc ggtgcgcaca gcagcctgga catcgcctcc ttctactgga ccctcaccaa 721 caatgacacc cacacgcagg agccctctgc ccagcagggt gaggaggtcc tccggcagct 781 gcagaccctg gcaccaaagg gcgtgaacgt ccgcatcgct gtgagcaagc ccagcgggcc 841 ccagccacag gcggacctgc aggctctgct gcagagcggt gcccaggtcc gcatggtgga 901 catgcagaag ctgacccatg gcgtcctgca taccaagttc tgggtggtgg accagaccca 961 cttctacctg ggcagtgcca acatggactg gcgttcactg acccaggtca aggagctggg 1021 cgtggtcatg tacaactgca gctgcctggc tcgagacctg accaagatct ttgaggccta 1081 ctggttcctg ggccaggcag gcagctccat cccatcaact tggccccggt tctatgacac 1141 ccgctacaac caagagacac caatggagat ctgcctcaat ggaacccctg ctctggccta 1201 cctggcgagt gcgcccccac ccctgtgtcc aagtggccgc actccagacc tgaaggctct 1261 actcaacgtg gtggacaatg cccggagttt catctacgtc gctgtcatga actacctgcc 1321 cactctggag ttctcccacc ctcacaggtt ctggcctgcc attgacgatg ggctgcggcg 1381 ggccacctac gagcgtggcg tcaaggtgcg cctgctcatc agctgctggg gacactcgga 1441 gccatccatg cgggccttcc tgctctctct ggctgccctg cgtgacaacc atacccactc 1501 tgacatccag gtgaaactct ttgtggtccc cgcggatgag gcccaggctc gaatcccata 1561 tgcccgtgtc aaccacaaca agtacatggt gactgaacgc gccacctaca tcggaacctc 1621 caactggtct ggcaactact tcacggagac ggcgggcacc tcgctgctgg tgacgcagaa 1681 tgggaggggc ggcctgcgga gccagctgga ggccattttc ctgagggact gggactcccc 1741 ttacattcat gaccttgaca cctcagctga cagcgtgggc aacgcctgcc gcctgctctg 1801 aggcccgatc cagtgggcag gccaaggcct gctgggcccc cgcggaccca ggtgctctgg 1861 gtcacggtcc ctgtccccgc acccccgctt ctgtctgccc cattgtggct cctcaggctc 1921 tctcccctgc tctcccacct ctacctccac ccccaccggc ctgacgctgt ggccccggga 1981 cccagcagag ctgggggagg gatcagcccc caaagaaatg ggggtgcatg ctggcctgcc 2041 ccctggccca cccccacttt ccagggcaaa aagggcccag ggttataata agtaaataac 2101 ttgtctgtaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU60665 2192 bp mRNA PRI 27-JUN-1996 DEFINITION Human testis specific basic protein (TSBP), complete cds. ACCESSION U60665 NID g1390020 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2192) AUTHORS O'Hern,P.A., Yavetz,H., Moy,T., Yavetz,B., Liang,Z.G., Wang,G.Y. and Goldberg,E. TITLE Direct Submission JOURNAL Submitted (12-JUN-1996) Biochemistry, Molecular Biology, and Cell Biology, Northwestern University, 2153 N. Campus Drive, Evanston, IL 60208, USA FEATURES Location/Qualifiers source 1..2192 /organism="Homo sapiens" /note="testis lambda gt11 cDNA library clone" /db_xref="taxon:9606" gene 237..1943 /gene="TSBP" CDS 237..1943 /gene="TSBP" /codon_start=1 /product="testis specific basic protein" /db_xref="PID:g1390021" /translation="MTVLEITLAVILTLLGLAILAILLTRWARRKQSEMYISRYSSEQ SARLLDYEDGRGSRHAYQHKVTLHMITERDPKRDYTPSTNSLALSRSSIALPQGSMSS IKCLQTTEEPPSRTAGAMMQFTALFPELQDLSSSLKKPLCKLQDLLYNIWIQCQIASH TITGHLQHPRSPMAPIIISQRTASQLAAPIRIPQVHTMDSSGKITLTPVVILTGYMDE ELRKKSCSKIQILKCGGTARSQIAEKKTRKQLKNDIIFTNSVESLKSAHIKEPEREGK GTDLEKDKIGMEVKVDSDAGIPKRQETQLKISEDEYTTRTGSPNKEKCVRCTKRTGVQ VKKSESGVPKGQEAQVTKSGLVVLKGQEAQVEKSEMGVPRRQESQVKKSQSGVSKGQE AQVKKRESVVLKGQEAQVEKSELKVPKGQEGQVEKTEADVPKEQEVQEKKSEAGVLKG PESQVKNTEVSVPETLESQVKKSESGVLKGQEAQEKKESFEDKGNNDKEKERDAEKDP NKKEKGDKNTKGDKGKDKVKGKRESEINGEKSKGSKRRRQIQEGSTTKKWKSKDKFFK GP" BASE COUNT 831 a 429 c 484 g 448 t ORIGIN 1 agctcagctg ggagcgcaga ggctcacgcc tgtaatccca tcatttgctt aggtctgatc 61 aatctgctcc acacaatttc tcagtgatcc tctgcatctc tgcctacaag ggcctccctg 121 acacccaagt tcatattgct cagaaacagt gaacttgagt ttttcgtttt accttgatct 181 ctctctgaca aagaaatcca gatgatgcaa cacctgatga agacaataca tggaaaatga 241 cagtcttgga aataactttg gctgtcatcc tgactctact gggacttgcc atcctggcta 301 ttttgttaac aagatgggca cgacgtaagc aaagtgaaat gtatatctcc agatacagtt 361 cagaacaaag tgctagactt ctggactatg aggatggtag aggatcccga catgcatatc 421 aacacaaagt gacacttcat atgataaccg agagagatcc aaaaagagat tacacaccat 481 caaccaactc tctagcactg tctcgatcaa gtattgcttt acctcaagga tccatgagta 541 gtataaaatg tttacaaaca actgaagaac ctccttccag aactgcagga gccatgatgc 601 aattcacagc cctattcccg gagctacagg acctatcaag ctctctcaaa aaaccattgt 661 gcaaactcca ggacctattg tacaatatct ggatccaatg tcagatcgca tctcacacaa 721 tcactggtca ccttcagcac ccgcggtcac ccatggcacc cataataatt tcacagagaa 781 ccgcaagtca gctggcagca cctataagaa tacctcaagt tcacactatg gacagttctg 841 gaaaaatcac actgactcct gtggttatat taacaggtta catggacgaa gaacttcgaa 901 aaaaatcttg ttccaaaatc cagattctaa aatgtggagg cactgcaagg tctcagatag 961 ccgagaagaa aacaaggaag caactaaaga atgacatcat atttacgaat tctgtagaat 1021 ccttgaaatc agcacacata aaggagccag aaagagaagg aaaaggcact gatttagaga 1081 aagacaaaat aggaatggag gtcaaggtag acagtgacgc tggaatacca aaaagacagg 1141 aaacccaact aaaaatcagt gaagatgagt ataccacaag gacagggagc ccaaataaag 1201 aaaagtgtgt cagatgtacc aagaggacag gagtccaagt aaagaagagt gagtcaggtg 1261 tcccaaaagg acaagaagcc caagtaacga agagtgggtt ggttgtactg aaaggacagg 1321 aagcccaggt agagaagagt gagatgggtg tgccaagaag acaggaatcc caagtaaaga 1381 agagtcagtc tggtgtctca aagggacagg aagcccaggt aaagaagagg gagtcagttg 1441 tactgaaagg acaggaagcc caggtagaga agagtgagtt gaaggtacca aaaggacaag 1501 aaggccaagt agagaagact gaggcagatg tgccaaagga acaagaggtc caagaaaaga 1561 agagtgaggc aggtgtactg aaaggaccag aatcccaagt aaagaacact gaggtgagtg 1621 taccagaaac actggaatcc caagtaaaga agagtgagtc aggtgtacta aaaggacagg 1681 aagcccaaga aaagaaggag agttttgagg ataaaggaaa taatgataaa gaaaaggaga 1741 gagatgcaga gaaagatcca aataaaaaag aaaaaggtga caaaaacaca aaaggtgaca 1801 aaggaaagga caaagttaaa ggaaagagag aatcagaaat caatggtgaa aaatcaaaag 1861 gctcgaaaag gcgaaggcaa atacaggaag gaagtacaac aaaaaagtgg aagagtaagg 1921 ataaattttt taaaggccca taagacaagt gattattatg attcccatac tccagataca 1981 aaccatatcc cagccattgc ctaaacagat tacaattata aaatcccttt catcttcata 2041 tcacagtttc tgctcttcag aagtttcacc ctttttaatc tctcagccac aaacctcagt 2101 tccaatattg ttataagtta agacgtatat gattccgtca agaaagactg gatactttct 2161 gaagtaaaac attttaatta aagaaaaaaa aa // LOCUS HSU60666 2446 bp mRNA PRI 27-JUN-1996 DEFINITION Human testis specific leucine rich repeat protein (TSLRP), complete cds. ACCESSION U60666 NID g1390022 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2446) AUTHORS O'Hern,P.A., Yavetz,H., Moy,T., Yavetz,B., Liang,Z.G., Wang,G.Y. and Goldberg,E. TITLE Direct Submission JOURNAL Submitted (12-JUN-1996) Biochemistry, Molecular Biology, and Cell Biology, Northwestern University, 2153 N. Campus Drive, Evanston, IL 60208, USA FEATURES Location/Qualifiers source 1..2446 /organism="Homo sapiens" /note="testis lambda gt11 cDNA library clone" /db_xref="taxon:9606" gene 843..2234 /gene="TSLRP" CDS 843..2234 /gene="TSLRP" /codon_start=1 /product="testis specific leucine rich repeat protein" /db_xref="PID:g1390023" /translation="MGWITEDLIRRNAEHNDCVIFSLEELSLHQQEIERLEHIDKWCR DLKILYLQNNLIGKIENVSKLKKLEYLNLALNNIEKIENLEGCEELAKLDLTVNFIGE LSSIKNLQHNIHLKELFLMGNPCASFDHYREFVVATLPQLKWLDGKEIEPSERIKALQ DYSVIEPQIREQEKDHCLKRAKLKEEAQRKHQEEDKNEDKRSNAGFDGRWYTDINATL SSLESKDHLQAPDIEEHNTKKLDDDLEFWNKPCLFTPESRLETLRHMEKQRKKQEKLS EKKKKVKPPRTLITEDGKALNVNEPKIDFSLKDNEKQIILDLAVYRYMDTSLIDVDVQ PTYVRVMIKGKPFQLVLPAEVKPDSSSAKRSQTTGHLVICMPKVGEVITGGQRAFKSM KTTSDRSREQTNTRSKHMEKLEVDPSKHSFPDVTNIVQEKKHTPRRRPEPKIIPSEED PTFEDNPEVPPLI" BASE COUNT 882 a 455 c 554 g 555 t ORIGIN 1 gaattccggg cccccctcaa aagaagtgaa gacattagaa tgatgagaac agtccattgt 61 aggatagacc caaatttaat agcttaaaga attgaggaag ttggatcctt ttcaaagcgt 121 acaacaagaa tccaaactat atcagaagac agagagaaga gagcaataaa atgccagatc 181 ggttactcta aattaaccca aactgtgaac agcacacaga gaaaaatcaa gccctttccc 241 taagaactat gaagatacag cacaaattta caggtatgag atcaagaaag ctaaggcagg 301 atgggagatt gcagcttcaa gagacatcaa aggtaacatt ctttaattat acaagaaaaa 361 ggaggagggc cgaggataat gttgcctcta ttaagaaagc gggagggaaa gctgttagca 421 gatggctatg aaatggcaag agttgctgtt tctcatgctt tactctagga gggaaggaag 481 gaaggaaaca gggagggaaa ggacggaggg aaagaaatca caagcaacta ctaaaaaagt 541 gccatttggg tgggtgccaa attagtaaag ataaataatt acatggaatc tttaagcaac 601 tggattcatg atcctgaact gagttgcaga tggggcctcc catcaacatg gatcagacgc 661 atacttggat gctaagttgg gaagatgtta ccaaaatatg aattgcctag aggctgttga 721 agacaagata catcactgaa atttggaaaa aggcaaacaa acccgggagc aggggtgaag 781 ggtgaaaagc gtcattcgag gtccgggtcc ggcttgcggg gtcagcgaac tggagaggcg 841 ccatgggctg gatcacagaa gatcttatta gacggaatgc tgaacacaac gactgtgtca 901 ttttttccct ggaggaactc tcgttgcatc agcaagaaat agaaagacta gaacacattg 961 ataaatggtg ccgggattta aaaattctct atcttcaaaa taatcttatt gggaaaattg 1021 aaaatgttag caaactcaag aaacttgaat atttgaattt agctttaaac aacattgaaa 1081 aaatagaaaa cttggaagga tgtgaagagc tggcaaaact tgacctgact gtgaatttca 1141 ttggagagct gagcagcatt aaaaacttgc agcacaatat ccatctgaag gagctctttc 1201 tcatggggaa cccatgtgct tcctttgacc actataggga gttcgtggta gcaactcttc 1261 cacaattaaa gtggttggat ggtaaagaaa tagagccttc agaaaggatt aaggcattgc 1321 aggactattc agtaattgaa ccacaaatca gagagcagga aaaagatcac tgtcttaaac 1381 gagccaaact caaggaagag gctcagagga aacaccaaga agaggataaa aatgaagaca 1441 agagaagtaa cgcaggcttt gatggacgtt ggtacacaga catcaatgct actctttcct 1501 ctttagagag caaagaccac ctacaggcac cagacataga ggaacacaac acaaagaaat 1561 tagacgatga cttggaattc tggaataagc cctgtttgtt tactcctgaa tcaagattgg 1621 aaactcttag acacatggaa aaacaacgga agaaacagga aaaattaagt gaaaaaaaga 1681 agaaagtgaa accacccagg actttgatca ctgaagatgg gaaagcccta aatgtgaatg 1741 agcccaaaat tgacttctct ttgaaagata acgaaaagca gatcatcctg gaccttgctg 1801 tctataggta tatggatacc tctttaatcg atgttgatgt gcaaccaact tacgtgcgag 1861 taatgatcaa aggaaagcca tttcagcttg tccttcctgc agaagtgaaa cccgatagta 1921 gttctgctaa aagatctcag acaacgggtc atttggtcat ctgcatgccc aaggtaggag 1981 aagtaatcac aggtggtcag cgagcattca aatctatgaa aactacctcg gacaggagca 2041 gagaacaaac aaatacaaga agcaagcaca tggagaaact agaagtagac cctagcaagc 2101 actcattccc tgatgtgact aacatagttc aagagaaaaa acacacaccc agaagacgac 2161 ctgaacccaa aattatacca agtgaggaag acccaacctt tgaagacaac cctgaagtgc 2221 ctccgctgat ttgaaacatc tggctgcgtt gccattggct gagacccacc aggtccagtt 2281 ttggttggtg tagagaccat atgcatatta ttcctggata aacacagagt tatttgtcaa 2341 tatcgctgct ccagtgtttt aactcttact ttgcatagta atatccttaa gatagcttaa 2401 aattaaaatg tatgtcttaa atgctataac tcatcatccc gaattc // LOCUS HSU60800 4157 bp mRNA PRI 08-NOV-1996 DEFINITION Human semaphorin (CD100) mRNA, complete cds. ACCESSION U60800 NID g1663566 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4157) AUTHORS Hall,K.T., Boumsell,L., Schultze,J.L., Boussiotis,V.A., Dorfman,D.M., Cardoso,A.A., Bensussan,A., Nadler,L.M. and Freeman,G.J. TITLE Human CD100, a novel leukocyte semaphorin that promotes B-cell aggregation and differentiation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (21), 11780-11785 (1996) MEDLINE 97030273 REFERENCE 2 (bases 1 to 4157) AUTHORS Hall,K.T., Boumsell,L., Schultze,J., Boussiotis,V., Dorfman,D., Cardoso,A., Bensussan,A., Nadler,L.M. and Freeman,G. TITLE Direct Submission JOURNAL Submitted (13-JUN-1996) Hematologic Malignancies, Dana-Farber Cancer Inst, D736 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..4157 /organism="Homo sapiens" /db_xref="taxon:9606" gene 88..2676 /gene="CD100" CDS 88..2676 /gene="CD100" /codon_start=1 /product="semaphorin" /db_xref="PID:g1663567" /translation="MRMCTPIRGLLMALAVMFGTAMAFAPIPRITWEHREVHLVQFHE PDIYNYSALLLSEDKDTLYIGAREAVFAVNALNISEKQHEVYWKVSEDKKAKCAEKGK SKQTECLNYIRVLQPLSATSLYVCGTNAFQPACDHLNLTSFKFLGKNEDGKGRCPFDP AHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHSPLRTEYAIPWLNEPSFVFADVIR KSPDSPDGEDDRVYFFFTEVSVEYEFVFRVLIPRIARVCKGDQGGLRTLQKKWTSFLK ARLICSRPDSGLVFNVLRDVFVLRSPGLKVPVFYALFTPQLNNVGLSAVCAYNLSTAE EVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACIDSEARAANYTSSLNLPDKTLQ FVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIVVDRTQALDGTVYDVMFVSTDRGALHK AISLEHAVHIIEETQLFQDFEPVQTLLLSSKKGNRFVYAGSNSGVVQAPLAFCGKHGT CEDCVLARDPYCAWSPPTATCVALHQTESPSRGLIQEMSGDASVCPDKSKGSYRQHFF KHGGTAELKCSQKSNLARVFWKFQNGVLKAESPKYGLMGRKNLLIFNLSEGDSGVYQC LSEERVKNKTVFQVVAKHVLEVKVVPKPVVAPTLSVVQTEGSRIATKVLVASTQGSSP PTPAVQATSSGAITLPPKPAPTGTSCEPKIVINTVPQLHSEKTMYLKSSDNRLLMSLF LFFFVLFLCLFFYNCYKGYLPRQCLKFRSALLIGKKKPKSDFCDREQSLKETLVEPGS FSQQNGEHPKPALDTGYETEQDTITSKVPTDREDSQRIDDLSARDKPFDVKCELKFAD SDADGD" BASE COUNT 960 a 1094 c 1058 g 1045 t ORIGIN 1 ctgagccgca tctgcaatag cacacttgcc cggccacctg ctgccgtgag cctttgctgc 61 tgaagcccct ggggtcgcct ctacctgatg aggatgtgca cccccattag ggggctgctc 121 atggcccttg cagtgatgtt tgggacagcg atggcatttg cacccatacc ccggatcacc 181 tgggagcaca gagaggtgca cctggtgcag tttcatgagc cagacatcta caactactca 241 gccttgctgc tgagcgagga caaggacacc ttgtacatag gtgcccggga ggcggtcttc 301 gctgtgaacg cactcaacat ctccgagaag cagcatgagg tgtattggaa ggtctcagaa 361 gacaaaaaag caaaatgtgc agaaaagggg aaatcaaaac agacagagtg cctcaactac 421 atccgggtgc tgcagccact cagcgccact tccctttacg tgtgtgggac caacgcattc 481 cagccggcct gtgaccacct gaacttaaca tcctttaagt ttctggggaa aaatgaagat 541 ggcaaaggaa gatgtccctt tgacccagca cacagctaca catccgtcat ggttgatgga 601 gaactttatt cggggacgtc gtataatttt ttgggaagtg aacccatcat ctcccgaaat 661 tcttcccaca gtcctctgag gacagaatat gcaatccctt ggctgaacga gcctagtttc 721 gtgtttgctg acgtgatccg aaaaagccca gacagccccg acggcgagga tgacagggtc 781 tacttcttct tcacggaggt gtctgtggag tatgagtttg tgttcagggt gctgatccca 841 cggatagcaa gagtgtgcaa gggggaccag ggcggcctga ggaccttgca gaagaaatgg 901 acctccttcc tgaaagcccg actcatctgc tcccggccag acagcggctt ggtcttcaat 961 gtgctgcggg atgtcttcgt gctcaggtcc ccgggcctga aggtgcctgt gttctatgca 1021 ctcttcaccc cacagctgaa caacgtgggg ctgtcggcag tgtgcgccta caacctgtcc 1081 acagccgagg aggtcttctc ccacgggaag tacatgcaga gcaccacagt ggagcagtcc 1141 cacaccaagt gggtgcgcta taatggcccg gtacccaagc cgcggcctgg agcgtgcatc 1201 gacagcgagg cacgggccgc caactacacc agctccttga atttgccaga caagacgctg 1261 cagttcgtta aagaccaccc tttgatggat gactcggtaa ccccaataga caacaggccc 1321 aggttaatca agaaagatgt gaactacacc cagatcgtgg tggaccggac ccaggccctg 1381 gatgggactg tctatgatgt catgtttgtc agcacagacc ggggagctct gcacaaagcc 1441 atcagcctcg agcacgctgt tcacatcatc gaggagaccc agctcttcca ggactttgag 1501 ccagtccaga ccctgctgct gtcttcaaag aagggcaaca ggtttgtcta tgctggctct 1561 aactcgggcg tggtccaggc cccgctggcc ttctgtggga agcacggcac ctgcgaggac 1621 tgtgtgctgg cgcgggaccc ctactgcgcc tggagcccgc ccacagcgac ctgcgtggct 1681 ctgcaccaga ccgagagccc cagcaggggt ttgattcagg agatgagcgg cgatgcttct 1741 gtgtgcccgg ataaaagtaa aggaagttac cggcagcatt ttttcaagca cggtggcaca 1801 gcggaactga aatgctccca aaaatccaac ctggcccggg tcttttggaa gttccagaat 1861 ggcgtgttga aggccgagag ccccaagtac ggtcttatgg gcagaaaaaa cttgctcatc 1921 ttcaacttgt cagaaggaga cagtggggtg taccagtgcc tgtcagagga gagggttaag 1981 aacaaaacgg tcttccaagt ggtcgccaag cacgtcctgg aagtgaaggt ggttccaaag 2041 cccgtagtgg cccccacctt gtcagttgtt cagacagaag gtagtaggat tgccaccaaa 2101 gtgttggtgg catccaccca agggtcttct cccccaaccc cagccgtgca ggccacctcc 2161 tccggggcca tcacccttcc tcccaagcct gcgcccaccg gcacatcctg cgaaccaaag 2221 atcgtcatca acacggtccc ccagctccac tcggagaaaa ccatgtatct taagtccagc 2281 gacaaccgcc tcctcatgtc cctcttcctc ttcttctttg ttctcttcct ctgcctcttt 2341 ttctacaact gctataaggg atacctgccc agacagtgct tgaaattccg ctcggcccta 2401 ctaattggga agaagaagcc caagtcagat ttctgtgacc gtgagcagag cctgaaggag 2461 acgttagtag agccagggag cttctcccag cagaatgggg agcaccccaa gccagccctg 2521 gacaccggct atgagaccga gcaagacacc atcaccagca aagtccccac ggatagggag 2581 gactcacaga ggatcgacga cctttctgcc agggacaagc cctttgacgt caagtgtgag 2641 ctgaagttcg ctgactcaga cgcagatgga gactgaggcc ggctgtgcat ccccgctggt 2701 gcctcggctg cgacgtgtcc aggcgtggag agttttgtgt ttctcctgtt cagtatccga 2761 gtctcgtgca gtgctgcgta ggttagcccg catcgtgcag acaacctcag tcctcttgtc 2821 tattttctct tgggttgagc ctgtgacttg gtttctcttt gtccttttgg aaaaatgaca 2881 agcattgcat cccagtcttg tgttccgaag tcagtcggag tacttgaaga aggcccacgg 2941 gcggcacgga gttcctgagc cctttctgta gtgggggaaa ggtggctgga cctctgttgg 3001 ctgagaagag catcccttca gcttcccctc cccgtagcag ccactaaaag attatttaat 3061 tccagattgg aaatgacatt ttagtttatc agattggtaa cttatcgcct gttgtccaga 3121 ttggcacgaa ccttttcttc cacttaatta tttttttagg attttgcttt gattgtgttt 3181 atgtcatggg tcattttttt ttagttacag aagcagttgt gttaatattt agaagaagat 3241 gtatatcttc cagattttgt tatatatttg gcataaaata cggcttacgt tgcttaagat 3301 tctcagggat aaacttcctt ttgctaaatg cattctttct gcttttagaa atgtagacat 3361 aaacactccc cggagcccac tcaccttttt tctttttctt tttttttttt taactttatt 3421 ccttgaggga agcattgttt ttggagagat tttctttctg tacttcgttt tacttttctt 3481 tttttttaac ttttactctc tcgaagaaga ggaccttccc acatccacga ggtgggtttt 3541 gagcaaggga aggtagcctg gatgagctga gtggagccag gctggcccag agctgagatg 3601 ggagtgcggt acaatctgga gcccacagct gtcggtcaga acctcctgtg agacagatgg 3661 aaccttcaca agggcgcctt tggttctctg aacatctcct ttctcttctt gcttcaattg 3721 cttacccact gcctgcccag actttctatc cagcctcact gagctgccca ctactggaag 3781 ggaactgggc ctcggtggcc ggggccgcga gctgtgacca cagcaccctc aagcatacgg 3841 cgctgttcct gccactgtcc tgaagatgtg aatgggtggt acgatttcaa cactggttaa 3901 tttcacactc catctccccg ctttgtaaat acccatcggg aagagacttt ttttccatgg 3961 tgaagagcaa taaactctgg atgtttgtgc gcgtgtgtgg acagtcttat cttccagcat 4021 gataggattt gaccattttg gtgtaaacat ttgtgtttta taagatttac cttgttttta 4081 tttttctact ttgaattgta tacatttgga aagtacccaa ataaatgaga agcttctatc 4141 cttaaaaaaa aaaaaaa // LOCUS HSU60805 4171 bp mRNA PRI 25-JAN-1997 DEFINITION Human oncostatin-M specific receptor beta subunit (OSMRB) mRNA, complete cds. ACCESSION U60805 NID g1794210 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4171) AUTHORS Mosley,B., De Imus,C., Friend,D., Boiani,N., Thoma,B., Park,L.S. and Cosman,D. TITLE Dual oncostatin M (OSM) receptors. Cloning and characterization of an alternative signaling subunit conferring OSM-specific receptor activation JOURNAL J. Biol. Chem. 271 (51), 32635-32643 (1996) MEDLINE 97115791 REFERENCE 2 (bases 1 to 4171) AUTHORS Mosley,B. and Cosman,D. TITLE Direct Submission JOURNAL Submitted (13-JUN-1996) Bruce Mosley, Molecular Biology, Immunex Corporation, 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..4171 /organism="Homo sapiens" /db_xref="taxon:9606" gene 368..3307 /gene="OSMRB" CDS 368..3307 /gene="OSMRB" /codon_start=1 /product="oncostatin-M specific receptor beta subunit" /db_xref="PID:g1794211" /translation="MALFAVFQTTFFLTLLSLRTYQSEVLAERLPLTPVSLKVSTNST RQSLHLQWTVHNLPYHQELKMVFQIQISRIETSNVIWVGNYSTTVKWNQVLHWSWESE LPLECATHFVRIKSLVDDAKFPEPNFWSNWSSWEEVSVQDSTGQDILFVFPKDKLVEE GTNVTICYVSRNIQNNVSCYLEGKQIHGEQLDPHVTAFNLNSVPFIRNKGTNIYCEAS QGNVSEGMKGIVLFVSKVLEEPKDFSCETEDFKTLHCTWDPGTDTALGWSKQPSQSYT LFESFSGEKKLCTHKNWCNWQITQDSQETYNFTLIAENYLRKRSVNILFNLTHRVYLM NPFSVNFENVNATNAIMTWKVHSIRNNFTYLCQIELHGEGKMMQYNVSIKVNGEYFLS ELEPATEYMARVRCADASHFWKWSEWSGQNFTTLEAAPSEAPDVWRIVSLEPGNHTVT LFWKPLSKLHANGKILFYNVVVENLDKPSSSELHSIPAPANSTKLILDRCSYQICVIA NNSVGASPASVIVISADPENKEVEEERIAGTEGGFSLSWKPQPGDVIGYVVDWCDHTQ DVLGDFQWKNVGPNTTSTVISTDAFRPGVRYDFRIYGLSTKRIACLLEKKTGYSQELA PSDNPHVLVDTLTSHSFTLSWKDYSTESQPGFIQGYHVYLKSKARQCHPRFEKAVLSD GSECCKYKIDNPEEKALIVDNLKPESFYEFFITPFTSAGEGPSATFTKVTTPDEHSSM LIHILLPMVFCVLLIMVMCYLKSQWIKETCYPDIPDPYKSSILSLIKFKENPHLIIMN VSDCIPDAIEVVSKPEGTKIQFLGTRKSLTETELTKPNYLYLLPTEKNHSGPGPCICF ENLTYNQAASDSGSCGHVPVSPKAPSMLGLMTSPENVLKALEKNYMNSLGEIPAGETS LNYVSQLASPMFGDKDSLPTNPVEAPHCSEYKMQMAVSLRLALPPPTENSSLSSITLL DPGEHYC" BASE COUNT 1169 a 1004 c 911 g 1087 t ORIGIN 1 gggccgcctc tgcacgtccg ccccggagcc cgcacccgcg ccccacgcgc cgccgaggac 61 tcggcccggc tcgtggagcc cttcgcccgc ggcgtgagta cccccgaccc gcccgtcccc 121 gctctgctcg cgccctgccg ctgcgccgcc ctcggtggct tttccgacgg gcgagccccg 181 tgctgtgcgg gaaagaatcc gacaacttcg cagcccatcc cggctggacg cgaccgggag 241 tgcagcagcc cgttcccctc ctcggtgccg cctctgccca gcgtttgctt ggctgggcta 301 ccacctgcgc tcggacggcg ctcggagggt cctcgccccc ggcctgccta cctgaaaacc 361 agaactgatg gctctatttg cagtctttca gacaacattc ttcttaacat tgctgtcctt 421 gaggacttac cagagtgaag tcttggctga acgtttacca ttgactcctg tatcacttaa 481 agtttccacc aattctacgc gtcagagttt gcacttacaa tggactgtcc acaaccttcc 541 ttatcatcag gaattgaaaa tggtatttca gatccagatc agtaggattg aaacatccaa 601 tgtcatctgg gtggggaatt acagcaccac tgtgaagtgg aaccaggttc tgcattggag 661 ctgggaatct gagctccctt tggaatgtgc cacacacttt gtaagaataa agagtttggt 721 ggacgatgcc aagttccctg agccaaattt ctggagcaac tggagttcct gggaggaagt 781 cagtgtacaa gattctactg gacaggatat attgttcgtt ttccctaaag ataagctggt 841 ggaagaaggc accaatgtta ccatttgtta cgtttctagg aacattcaaa ataatgtatc 901 ctgttatttg gaagggaaac agattcatgg agaacaactt gatccacatg taactgcatt 961 caacttgaat agtgtgcctt tcattaggaa taaagggaca aatatctatt gtgaggcaag 1021 tcaaggaaat gtcagtgaag gcatgaaagg catcgttctt tttgtctcaa aagtacttga 1081 ggagcccaag gacttttctt gtgaaaccga ggacttcaag actttgcact gtacttggga 1141 tcctgggacg gacactgcct tggggtggtc taaacaacct tcccaaagct acactttatt 1201 tgaatcattt tctggggaaa agaaactttg tacacacaaa aactggtgta attggcaaat 1261 aactcaagac tcacaagaaa cctataactt cacactcata gctgaaaatt acttaaggaa 1321 gagaagtgtc aatatccttt ttaacctgac tcatcgagtt tatttaatga atccttttag 1381 tgtcaacttt gaaaatgtaa atgccacaaa tgccatcatg acctggaagg tgcactccat 1441 aaggaataat ttcacatatt tgtgtcagat tgaactccat ggtgaaggaa aaatgatgca 1501 atacaatgtt tccatcaagg tgaacggtga gtacttctta agtgaactgg aacctgccac 1561 agagtacatg gcgcgagtac ggtgtgctga tgccagccac ttctggaaat ggagtgaatg 1621 gagtggtcag aacttcacca cacttgaagc tgctccctca gaggcccctg atgtctggag 1681 aattgtgagc ttggagccag gaaatcatac tgtgacctta ttctggaagc cattatcaaa 1741 actgcatgcc aatggaaaga tcctgttcta taatgtagtt gtagaaaacc tagacaaacc 1801 atccagttca gagctccatt ccattccagc accagccaac agcacaaaac taatccttga 1861 caggtgttcc taccaaatct gcgtcatagc caacaacagt gtgggtgctt ctcctgcttc 1921 tgtaatagtc atctctgcag accccgaaaa caaagaggtt gaggaagaaa gaattgcagg 1981 cacagagggt ggattctctc tgtcttggaa accccaacct ggagatgtta taggctatgt 2041 tgtggactgg tgtgaccata cccaggatgt gctcggtgat ttccagtgga agaatgtagg 2101 tcccaatacc acaagcacag tcattagcac agatgctttt aggccaggag ttcgatatga 2161 cttcagaatt tatgggttat ctacaaaaag gattgcttgt ttattagaga aaaaaacagg 2221 atactctcag gaacttgctc cttcagacaa ccctcacgtg ctggtggata cattgacatc 2281 ccactccttc actctgagtt ggaaagatta ctctactgaa tctcaacctg gttttataca 2341 agggtaccat gtctatctga aatccaaggc gaggcagtgc cacccacgat ttgaaaaggc 2401 agttctttca gatggttcag aatgttgcaa atacaaaatt gacaacccgg aagaaaaggc 2461 attgattgtg gacaacctaa agccagaatc cttctatgag tttttcatca ctccattcac 2521 tagtgctggt gaaggcccca gtgctacgtt cacgaaggtc acgactccgg atgaacactc 2581 ctcgatgctg attcatatcc tactgcccat ggttttctgc gtcttgctca tcatggtcat 2641 gtgctacttg aaaagtcagt ggatcaagga gacctgttat cctgacatcc ctgaccctta 2701 caagagcagc atcctgtcat taataaaatt caaggagaac cctcacctaa taataatgaa 2761 tgtcagtgac tgtatcccag atgctattga agttgtaagc aagccagaag ggacaaagat 2821 acagttccta ggcactagga agtcactcac agaaaccgag ttgactaagc ctaactacct 2881 ttatctcctt ccaacagaaa agaatcactc tggccctggc ccctgcatct gttttgagaa 2941 cttgacctat aaccaggcag cttctgactc tggctcttgt ggccatgttc cagtatcccc 3001 aaaagcccca agtatgctgg gactaatgac ctcacctgaa aatgtactaa aggcactaga 3061 aaaaaactac atgaactccc tgggagaaat cccagctgga gaaacaagtt tgaattatgt 3121 gtcccagttg gcttcaccca tgtttggaga caaggacagt ctcccaacaa acccagtaga 3181 ggcaccacac tgttcagagt ataaaatgca aatggcagtc tccctgcgtc ttgccttgcc 3241 tcccccgacc gagaatagca gcctctcctc aattaccctt ttagatccag gtgaacacta 3301 ctgctaacca gcatgccgat ttcatacctt atgctacaca gacattaaga agagcagagc 3361 tggcaccctg tcatcaccag tggccttggt ccttaatccc agtacaattt gcaggtctgg 3421 tttatataag accactacag tctggctagg ttaaaggcca gaggctatgg aacttaacac 3481 tccccattgg agcaagcttg ccctagagac ggcaggatca tgggagcatg cttaccttct 3541 gctgtttgtt ccaggctcac ctttagaaca ggagacttga gcttgaccta aggatatgca 3601 ttaaccactc tacagactcc cactcagtac tgtacagggt ggctgtggtc ctagaagttc 3661 agtttttact gaggaaatat ttccattaac agcaattatt atattgaagg ctttaataaa 3721 ggccacagga gacattacta tagcatagat tgtcaaatgt aaatttactg agcgtgtttt 3781 ataaaaaact cacaggtgtt tgaggccaaa acagatttta gacttacctt gaacggataa 3841 gaatctatag ttcactgaca cagtaaaatt aactctgtgg gtgggggcgg ggggcatagc 3901 tctaatctaa tatataaaat gtgtgatgaa tcaacaagat ttccacaatt cttctgtcaa 3961 gcttactaca gtgaaagaat gggattggca agtaacttct gacttactgt cagttgtact 4021 tctgctccat agacatcagt attctgccat catttttgat gactacctca gaacataaaa 4081 aggaacgtat atcacataat tccagtcaca gtttttggtt cctcttttct ttcaagaact 4141 atatataaat gacctgtttt cacgcggccg c // LOCUS HSU60808 2051 bp mRNA PRI 02-APR-1997 DEFINITION Human CDP-diacylglycerol synthase (CDS) mRNA, complete cds. ACCESSION U60808 NID g1915971 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2051) AUTHORS Weeks,R., Dowhan,W., Shen,H., Balantac,N., Meengs,B., Nudelman,E. and Leung,D.W. TITLE Isolation and expression of an isoform of human CDP-diacylglycerol synthase cDNA JOURNAL DNA Cell Biol. 16 (3), 281-289 (1997) MEDLINE 97238623 REFERENCE 2 (bases 1 to 2051) AUTHORS Leung,D.W. TITLE Direct Submission JOURNAL Submitted (13-JUN-1996) Mol. Biol., Cell Therapeutics, Inc., 201 Elliott Ave., W., Suite 400, Seattle, WA 98119, USA FEATURES Location/Qualifiers source 1..2051 /organism="Homo sapiens" /db_xref="taxon:9606" gene 150..1535 /gene="CDS" CDS 150..1535 /gene="CDS" /note="CTP:phosphatidate cytidylytransferase" /codon_start=1 /product="CDP-diacylglycerol synthase" /db_xref="PID:g1915972" /translation="MLELRHRGSCPGPREAVSPPHREGEAAGGDHETESTSDKETDID DRYGDLDSRTDSDIPEIPPSSDRTPEILKKALSGLSSRWKNWWIRGILTLTMISLFFL IIYMGSFMLMLLVLGIQVKCFHEIITIGYRVYHSYDLPWFRTLSWYFLLCVNYFFYGE TVADYFATFVQREEQLQFLIRYHRFISFALYLAGFCMFVLSLVKKHYRLQFYMFAWTH VTLLITVTQSHLVIQNLFEGMIWFLVPISSVICNDITAYLFGFFFGRTPLIKLSPKKT WEGFIGGFFSTVVFGFIAAYVLSKYQYFVCPVEYRSDVNSFVTECEPSELFQLQTYSL PPFLKAVLRQERVSLYPFQIHSIALSTFASLIGPFGGFFASGFKRAFKIKDFANTIPG HGGIMDRFDCQYLMATFVHVYITSFIRGPNPSKVLQQLLVLQPEQQLNIYKTLKTHLI EKGILQPTLKV" BASE COUNT 566 a 413 c 446 g 626 t ORIGIN 1 tctatggtgg ggccgcgtta gtggctgcgg ctccgcggga ctccagggcg cggctgcgag 61 gtggcggggc gccccgcctg cagaaccctg cttgcagctc aggtttcggg gtgcttgagg 121 aggccgccac ggcagcgcgg gagcggaaga tgttggagct gaggcaccgg ggaagctgcc 181 ccggccccag ggaagcggtg tcgccgccac accgcgaggg agaggcggcc ggcggcgacc 241 acgaaaccga gagcaccagc gacaaagaaa cagatattga tgacagatat ggagatttgg 301 attccagaac agattctgat attccggaaa ttccaccatc ctcagataga acccctgaga 361 ttctcaaaaa agctctatct ggtttatctt caaggtggaa aaactggtgg atacgtggaa 421 ttctcactct aactatgatc tcgttgtttt tcctgatcat ctatatggga tccttcatgc 481 tgatgcttct tgttctgggc atccaagtga aatgcttcca tgaaattatc actataggtt 541 atagagtcta tcattcttat gatctaccat ggtttagaac actaagttgg tactttctat 601 tgtgtgtaaa ctactttttc tatggagaga ctgtagctga ttattttgct acatttgttc 661 aaagagaaga acaacttcag ttcctcattc gctaccatag atttatatca tttgccctct 721 atctggcagg tttctgcatg tttgtactga gtttggtgaa gaaacattat cgtctgcagt 781 tttatatgtt cgcatggact catgtcactt tactgataac tgtcactcag tcacaccttg 841 tcatccaaaa tctgtttgaa ggcatgatat ggttccttgt tccaatatca agtgttatct 901 gcaatgacat aactgcttac ctttttggat ttttttttgg gagaactcca ttaattaagt 961 tgtctcctaa aaagacttgg gaaggattca ttggtggttt cttttccaca gttgtgtttg 1021 gattcattgc tgcctatgtg ttatccaaat accagtactt tgtctgccca gtggaatacc 1081 gaagtgatgt aaactccttc gtgacagaat gtgagccctc agaacttttc cagcttcaga 1141 cttactcact tccacccttt ctaaaggcag tcttgagaca ggaaagagtg agcttgtacc 1201 ctttccagat ccacagcatt gcactgtcaa cctttgcatc tttaattggc ccatttggag 1261 gcttctttgc tagtggattc aaaagagcct tcaaaatcaa ggattttgca aataccattc 1321 ctggacatgg tgggataatg gacagatttg attgtcagta tttgatggca acttttgtac 1381 atgtgtacat cacaagtttt ataaggggcc caaatcccag caaagtgcta cagcagttgt 1441 tggtgcttca acctgaacag cagttaaata tatataaaac cctgaagact catctcattg 1501 agaaaggaat cctacaaccc accttgaagg tataactgga tccagagagg gaaggactga 1561 caagaaggaa ttattcagaa aaacactgac agatgtttta taaattgtac agaaaaatag 1621 ttaaaaatgc aataggttga agttttggag atatgtttct ctctgaaatt actgtgaata 1681 tttaacaaac acttacttga tctatgttat gaaataagta gcaaattgcc agcaaaatgt 1741 cttgtacctt ttctaaagtg tattttctga tgtgaacttc cttcccctta cttgctaggt 1801 ttcataattt aaaagactgg tatttaaaag agtcaaacac tataaaatga gtaagttgac 1861 gatgttttaa gattgcacct ggcagtgtgc ctttttgcac aaatatttac ttttgcactt 1921 ggagctgctt ttaattttag caaaatgttt tatgcaaggc acaataggaa gtcagttctc 1981 ctgcacttcc tcctcatgta gtctggagta ctttctaaag ggcttagttg gatttaaaaa 2041 aaaaaaaaaa a // LOCUS HSU61148 1572 bp DNA PRI 25-JAN-1997 DEFINITION Human atonal homolog 1 (Hath1) gene, complete cds. ACCESSION U61148 NID g1575354 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1572) AUTHORS Ben-Arie,N., McCall,A.E., Berkman,S., Eichele,G., Bellen,H.J. and Zoghbi,H.Y. TITLE Evolutionary conservation of sequence and expression of the bHLH protein Atonal suggests a conserved role in neurogenesis JOURNAL Hum. Mol. Genet. 5 (9), 1207-1216 (1996) MEDLINE 97026280 REFERENCE 2 (bases 1 to 1572) AUTHORS Ben-Arie,N. TITLE Direct Submission JOURNAL Submitted (18-JUN-1996) Pediatrics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1572 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q22" gene 363..1427 /gene="Hath1" CDS 363..1427 /gene="Hath1" /note="atonal homolog 1" /codon_start=1 /product="HATH1" /db_xref="PID:g1575355" /translation="MSRLLHAEEWAEVKELGDHHRQPQPHHLPQPPPPPQPPATLQAR EHPVYPPELSLLDSTDPRAWLAPTLQGICTARAAQYLLHSPELGASEAAAPRDEVDGR GELVRRSSGGASSSKSPGPVKVREQLCKLKGGVVVDELGCSRQRAPSSKQVNGVQKQR RLAANARERRRMHGLNHAFDQLRNVIPSFNNDKKLSKYETLQMAQIYINALSELLQTP SGGEQPPPPPASCKSDHHHLRTAASYEGGAGNATAAGAQQASGGSQRPTPPGSCRTRF SAPASAGGYSVQLDALHFSTFEDSALTAMMAQKNLSPSLPGSILQPVQEENSKTSPRS HRSDGEFSPHSHYSDSDEAS" misc_feature 834..1004 /gene="Hath1" /note="encodes basic helix-loop-helix" BASE COUNT 334 a 475 c 497 g 262 t 4 others ORIGIN 1 gtcctctgca cacaagaact tttctcgggg tgtaaaaact ctttgattgg ctgctcgcac 61 gcgcctgccc gcgccctcca ttggctgaga agacacgcga ccggcgcgag gagggggttg 121 ggagaggagc ggggggagac tgagtggcgc gtgccgcttt ttaaaggggc gcagcgcctt 181 cagcaaccgg agaagcatag ttgcacgcga cctggtgtgt gatctccgag tgggtggggg 241 agggtcgagg agggaaaaaa aaataagacg ttgcagaaga gacccggaaa gggccttttt 301 tttggttgag ctggtgtccc agtgctgcct ccgatcctga gcgtccgagc ctttgcagtg 361 caatgtcccg cctgctgcat gcagaagagt gggctgaagt gaaggagttg ggagaccacc 421 atcgccagcc ccagccgcat catctcccgc aaccgccgcc gccgccgcag ccacctgcaa 481 ctttgcaggc gagagagcat cccgtctacc cgcctgagct gtccctcctg gacagcaccg 541 acccacgcgc ctggctggct cccactttgc agggcatctg cacggcacgc gccgcccagt 601 atttgctaca ttccccggag ctgggtgcct cagaggccgc tgcgccccgg gacgaggtgg 661 acggccgggg ggagctggta aggaggagca gcggcggtgc cagcagcagc aagagccccg 721 ggccggtgaa agtgcgggaa cagctgtgca agctgaaagg cggggtggtg gtagacgagc 781 tgggctgcag ccgccaacgg gccccttcca gcaaacaggt gaatggggtg cagaagcaga 841 gacggctagc agccaacgcc agggagcggc gcaggatgca tgggctgaac cacgccttcg 901 accagctgcg caatgttatc ccgtcgttca acaacgacaa gaagctgtcc aaatatgaga 961 ccctgcagat ggcccaaatc tacatcaacg ccttgtccga gctgctacaa acgcccagcg 1021 gaggggaaca gccaccgccg cctccagcct cctgcaaaag cgaccaccac caccttcgca 1081 ccgcggcctc ctatgaaggg ggcgcgggca acgcgaccgc agctggggct cagcaggctt 1141 ccggagggag ccagcggccg accccgcccg ggagttgccg gactcgcttc tcagccccag 1201 cttctgcggg agggtactcg gtgcagctgg acgctctgca cttctcgact ttcgaggaca 1261 gcgccctgac agcgatgatg gcgcaaaaga atttgtctcc ttctctcccc gggagcatct 1321 tgcagccagt gcaggaggaa aacagcaaaa cttcgcctcg gtcccacaga agcgacgggg 1381 aattttcccc ccattcccat tacagtgact cggatgaggc aagttaggaa ggtgacagaa 1441 gcctgaaaac tgagacagaa acaaaactgc cctttcccag tgcgcgggaa gccccgnggt 1501 taangatccc cgcacccttt aatttnggct ctgcgatggt cgttgtttag caacgacttg 1561 gctncagatg gt // LOCUS HSU61166 3241 bp mRNA PRI 23-JUL-1996 DEFINITION Human SH3 domain-containing protein SH3P17 mRNA, complete cds. ACCESSION U61166 NID g1438932 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3241) AUTHORS Sparks,A.B., Hoffman,N.G., McConnell,S.J., Fowlkes,D.M. and Kay,B.K. TITLE Cloning of ligand targets: Systematic isolation of SH3 domain-containing proteins JOURNAL Nat. Biotechnol. 14, 741-744 (1996) REFERENCE 2 (bases 1 to 3241) AUTHORS Pirozzi,G., McConnell,S.J., Uveges,A. and Fowlkes,D.M. TITLE Direct Submission JOURNAL Submitted (18-JUN-1996) CYTOGEN Corp., 307 College Road East, Princeton, NJ 08540, USA FEATURES Location/Qualifiers source 1..3241 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bone marrow" CDS 37..1599 /codon_start=1 /product="SH3 domain-containing protein SH3P17" /db_xref="PID:g1438933" /translation="MEAERLKQKEQERKIIELEKQKEEAQRRAQERDKQWLEHVQQED EHQRPRKLHEEEKLKREESVKKKDGEEKGKQEAQDKLGRLFHQHQEPAKPAVQAPWST AEKGPLTISAQENVKVVYYRALYPFESRSHDEITIQPGDIVMVDESQTGEPGWLGGEL KGKTGWFPANYAEKIPENEVPAPVKPVTDSTSAPAPKLALRETPAPLAVTSSEPSTTP NNWADFSSTWPTSTNEKPETDNWDAWAAQPSLTVPSAGQLRQRSAFTPATATGSSPSP VLGQGEKVEGLQAQALYPWRAKKDNHLNFNKNDVITVLEQQDMWWFGEVQGQKGWFPK SYVKLISGPIRKSTSMDSGSSESPASLKRVASPAAKPVVSGEEIAQVIASYTATGPEQ LTLAPGQLILIRKKNPGGWWEGELQARGKKRQIGWFPANYVKLLSPGTSKITPTEPPK STALAAVCQVIGMYDYTAQNDDELAFNKGQIINVLNKEDPDWWKGEVNGQVGLFPSNY VKLTTDMDPSQQ" BASE COUNT 994 a 756 c 702 g 789 t ORIGIN 1 gaattcgcgg ccgcgtcgac ccagaagcaa aagtccatgg aggctgaacg actgaaacag 61 aaagaacaag aacgaaagat catagaatta gaaaaacaaa aagaagaagc ccaaagacga 121 gctcaggaaa gggacaagca gtggctggag catgtgcagc aggaggacga gcatcagaga 181 ccaagaaaac tccacgaaga ggaaaaactg aaaagggagg agagtgtcaa aaagaaggat 241 ggcgaggaaa aaggcaaaca ggaagcacaa gacaagctgg gtcggctttt ccatcaacac 301 caagaaccag ctaagccagc tgtccaggca ccctggtcca ctgcagaaaa aggtccactt 361 accatttctg cacaggaaaa tgtaaaagtg gtgtattacc gggcactgta cccctttgaa 421 tccagaagcc atgatgaaat cactatccag ccaggagaca tagtcatggt ggatgaaagc 481 caaactggag aacccggctg gcttggagga gaattaaaag gaaagacagg gtggttccct 541 gcaaactatg cagagaaaat cccagaaaat gaggttcccg ctccagtgaa accagtgact 601 gattcaacat ctgcccctgc ccccaaactg gccttgcgtg agacccccgc ccctttggca 661 gtaacctctt cagagccctc cacgacccct aataactggg ccgacttcag ctccacgtgg 721 cccaccagca cgaatgagaa accagaaacg gataactggg atgcatgggc agcccagccc 781 tctctcaccg ttccaagtgc cggccagtta aggcagaggt ccgcctttac tccagccacg 841 gccactggct cctccccgtc tcctgtgcta ggccagggtg aaaaggtgga ggggctacaa 901 gctcaagccc tatatccttg gagagccaaa aaagacaacc acttaaattt taacaaaaat 961 gatgtcatca ccgtcctgga acagcaagac atgtggtggt ttggagaagt tcaaggtcag 1021 aagggttggt tccccaagtc ttacgtgaaa ctcatttcag ggcccataag gaagtctaca 1081 agcatggatt ctggttcttc agagagtcct gctagtctaa agcgagtagc ctctccagca 1141 gccaagccgg tcgtttcggg agaagaaatt gcccaggtta ttgcctcata caccgccacc 1201 ggccccgagc agctcactct cgcccctggt cagctgattt tgatccgaaa aaagaaccca 1261 ggtggatggt gggaaggaga gctgcaagca cgtgggaaaa agcgccagat aggctggttc 1321 ccagctaatt atgtaaagct tctaagccct gggacgagca aaatcactcc aacagagcca 1381 cctaagtcaa cagcattagc ggcagtgtgc caggtgattg ggatgtacga ctacaccgcg 1441 cagaatgacg atgagctggc cttcaacaag ggccagatca tcaacgtcct caacaaggag 1501 gaccctgact ggtggaaagg agaagtcaat ggacaagtgg ggctcttccc atccaattat 1561 gtgaagctga ccacagacat ggacccaagc cagcaatgaa tcatatgttg tccatccccc 1621 cctcaggctt gaaagtcctc aaagagaccc actatcccat atcactgccc agagggatga 1681 tgggagatgc agccttgatc atgtgacttc cagcatgatc acctactgcc ttctgagtta 1741 gaagaactca ctgcaaaaca gtttacccca tttaacctta gttgcatgtt aaccccaagt 1801 ttgaattaat acctggcaaa aatagaacca aaatttccat aaaacccacg gggtagtggg 1861 tccctttgtg tggctttccc taattaactc caaattgaat ttccccccac cttggccaca 1921 gggtgctttc aaatattttt aaaattaatt ttaaaaaaaa aatttattga tttaaccttt 1981 taaataacca aaataattaa ttaactcctt gtctatttgg ggtttggcaa aagaaccccc 2041 tattcaaagg aatgtcgcct gttcgctata aaaaaaatgg ttccaaaatt ttccataaaa 2101 ccgtgaaaac tgaatgttct tcttcatttt gctcccgtgt taccaaccta aattgctgca 2161 cactttgggg ggcttcggtt tttttttttt ttccccccct tcaacccctt caaaattctc 2221 cccaaaagga aagttttcct ctcccccccc cgcctatatt aaacccacac ttaaactgaa 2281 cccccccacc ccccccctct aaaaaaaacc aaatggtttt ttggttcgac catctaaagc 2341 atctctgctc ataacaaatt cctttttttt ccagggcaaa gctattacct tgtaggatgc 2401 tctaatcata ttggcattta attttatttt gcaacagtga ccttgtagcc acatgagaaa 2461 gcactctgtg tttttgttcg gtctcagatt tatctggttg agttggtgtt ttgtttgggg 2521 tttttaattt tgcgtgtttg catagcataa aatcagtaga caacaccact gaggtcgtta 2581 cgatcaacga tatccacagt ctctttttag tctctgttac atgaagtttt attccagtta 2641 cttttcatgg aatgacctat tttgaacaag taattttctt gacaagaaag aatgtataga 2701 agtctccctg aaataaattt cccaaagtta aaaatttttt taaataggac tgtgggaatt 2761 tttaaagatt aattatgaaa atggagctca gggtccgttt gggggtaaga aaagctgtag 2821 gggaaagccc tgtttgtttt ttaaacacta ggtggaaggt ttcaataaaa aaagcctgct 2881 gctcacagca cagaaaatgg ggcaggggga gcctcaagca caatttagct gtcctcctaa 2941 agaatttgta atgctcaatc cccttgggtt tctcccggcg ctgtcgggag gctgtgctgg 3001 tggttgtgta gaggtccttt tcctttcaaa tggtgcagag agagaggacc tttcctcctt 3061 gttcagttgc aattcagtat tttcacggat atgaatgtaa aatatataaa tatataaacc 3121 tgaggattta acaaatgtaa aacaaccttt tgaattagtt ccgagtatag ataattaaat 3181 ttttaaaaca aaagtaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaagtcg acgcggccgc 3241 g // LOCUS HSU61232 1931 bp mRNA PRI 26-OCT-1996 DEFINITION Human tubulin-folding cofactor E mRNA, complete cds. ACCESSION U61232 NID g1465771 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1931) AUTHORS Tian,G., Huang,Y., Rommelaere,H., Vandekerckhove,J., Ampe,C. and Cowan,N.J. TITLE Pathway leading to correctly folded beta-tubulin JOURNAL Cell 86 (2), 287-296 (1996) MEDLINE 96319731 REFERENCE 2 (bases 1 to 1931) AUTHORS Cowan,N.J. TITLE Direct Submission JOURNAL Submitted (19-JUN-1996) Biochemistry, NYU Medical Center, 550 First Avenue, New York, NY 10016, USA COMMENT Cofactor E is involved in the folding of beta tubulin intermediates after their release from cytosolic chaperonin. It is a homolog of the Pac2p from S. cerevisae. FEATURES Location/Qualifiers source 1..1931 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 130..1713 /note="tubulin-folding protein" /codon_start=1 /product="cofactor E" /db_xref="PID:g1465772" /translation="MSDTLTADVIGRRVEVNGEHATVRFAGVVPPVAGPWLGVEWDNP ERGKHDGSHEGTVYFKCRHPTGGSFIRPNKVNFGTDFLTAIKNRYVLEDGPEEDRKEQ IVTIGNKPVETIGFDSIMKQQSQLSKLQEVSLRNCAVSCAGEKGGVAEACPNIRKVDL SKNLLSSWDEVIHIADQLRHLEVLNVSENKLKFPSGSVLTGTLSVLKVLVLNQTGITW AEVLRCVAGCPGLEELYLESNNIFISERPTDVLQTVKLLDLSSNQLIDENQLYLIAHL PRLEQLILSDTGISSLHFPDAGIGCKTSMFPSLKYLVVNDNQISQWSFFNELEKLPSL RALSCLRNPLTKEDKEAETARLLIIASIGQLKTLNKCEILPEERRRAELDYRKAFGNE WKQAGGHKDPEKNRLSEEFLTAHPRYQFLCLKYGAPEDWELKTQQPLMLKNQLLTLKI KYPHQLDQKVLEKQLPGSMTIQKVKGLLSRLLKVPVSDLLLSYESPKKPGREIELEND LKSLQFYSVENGDCLLVRW" BASE COUNT 599 a 393 c 457 g 482 t ORIGIN 1 ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggtg ttggctggag 61 gggctgctgc tgggaacacc tggagtctcc gcgggcagat ctcatatttt ggattctgga 121 tatattataa tgagtgacac tttgacagcg gatgtcattg gtcgaagagt tgaagttaat 181 ggagaacatg caacagtacg ttttgctggt gttgtccctc ccgtggcagg accctggtta 241 ggagtagaat gggacaatcc cgagagagga aagcatgatg ggagccacga agggactgtg 301 tattttaaat gcaggcaccc gacaggagga tcctttattc gtccgaacaa ggtaaatttt 361 ggaacagact ttcttactgc aattaagaac cgctatgtgt tagaagatgg accagaggaa 421 gatagaaaag agcaaattgt tacaattgga aataaacctg tggagactat cggttttgac 481 tctattatga aacagcaaag tcagctgagc aagttgcaag aagtttctct gaggaactgt 541 gcagtaagtt gtgctggtga aaaaggagga gttgctgaag catgtcctaa tatcagaaag 601 gtagatttgt caaaaaacct gttgtcatca tgggatgaag tgatacacat tgctgatcag 661 ctcagacacc tggaagtcct taatgtcagt gaaaataaac taaaatttcc ctccggttca 721 gtattaactg gaacgctttc tgtactgaag gttttagtcc tcaatcaaac aggaataacg 781 tgggctgagg tgctgcggtg tgtcgcgggg tgcccaggcc tggaggaact ctaccttgag 841 tctaacaaca ttttcatttc cgaaaggcca acagatgttc tccagacagt caagttatta 901 gatctttcct ctaatcaatt aattgatgaa aatcagctgt atctgatagc ccacctgccc 961 aggttagaac aattaatcct ctctgacact ggaatttctt ctctacattt tccggatgct 1021 ggaattgggt gcaaaacgtc catgttccca tccttgaagt acctggtagt aaacgacaat 1081 cagatatcac aatggtcgtt tttcaatgag ctagagaagt taccaagtct acgggctttg 1141 tcctgcctaa gaaaccccct gaccaaagag gacaaagaag cagagacggc gcgactactc 1201 attatcgcca gcattggcca gctgaagacg ctgaacaaat gtgagattct ccccgaggag 1261 aggcggagag ctgagcttga ctaccgaaaa gcttttggaa atgagtggaa acaggctggt 1321 ggacataagg atccggaaaa aaacagactc agcgaagaat tcctcacagc ccatcccaga 1381 taccagttcc tctgcctgaa atatggtgca cctgaagatt gggaactcaa aacacagcaa 1441 ccacttatgc tgaaaaacca gctactaaca ctgaagataa aataccctca tcaacttgat 1501 cagaaagtcc tggagaaaca actgccgggc tccatgacaa ttcaaaaggt gaagggattg 1561 ctgtcacgtc ttctcaaagt tcctgtgtca gaccttctgt tgtcctatga aagtcccaaa 1621 aagccgggca gagaaatcga gctggaaaat gacctaaagt cattacagtt ttattctgtg 1681 gaaaatggag attgtctatt agtgcgatgg tgacaaccaa ctaataaaat ttaaagacca 1741 cactgcttat cgtgtctggg gttcaccgga aataaatgat tcactggaac aattctactg 1801 tcaaaacaaa gggggtttac aacttgtcct aagtataaca agggatgtat ttttagttgg 1861 gaagtgacca tttctaggct tatacataat agcaataata aaggctttga acctacagaa 1921 aaaaaaaaaa a // LOCUS HSU61234 1575 bp mRNA PRI 26-OCT-1996 DEFINITION Human tubulin-folding cofactor C mRNA, complete cds. ACCESSION U61234 NID g1465773 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1575) AUTHORS Tian,G., Huang,Y., Rommelaere,H., Vandekerckhove,J., Ampe,C. and Cowan,N.J. TITLE Pathway leading to correctly folded beta-tubulin JOURNAL Cell 86 (2), 287-296 (1996) MEDLINE 96319731 REFERENCE 2 (bases 1 to 1575) AUTHORS Cowan,N.J. TITLE Direct Submission JOURNAL Submitted (19-JUN-1996) Biochemistry, NYU Medical Center, 550 First Avenue, New York, NY 10016, USA FEATURES Location/Qualifiers source 1..1575 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 24..1064 /note="tubulin-folding protein; involved in the final step of the beta tubulin folding pathway" /codon_start=1 /product="cofactor C" /db_xref="PID:g1465774" /translation="MESVSCSAAAVRTGDMESQRDLSLVPERLQRREQERQLEVERRK QKRQNQEVEKENSHFFVATFARERAAVEELLERAESVERLEEAASRLQGLQKLINDSV FFLAAYDLRQGQEALARLQAALAERRRGLQPKKRFAFKTRGKDAASSTKVDAAPGIPP AVESIQDSPLPKKAEGDLGPSWVCGFSNLESQVLEKRASELHQRDVLLTELSNCTVRL YGNPNTLRLTKAHSCKLLCGPVSTSVFLEDCSDCVLAVACQQLRIHSTKDTRIFLQVT SRAIVEDCSGIQFAPYTWSYPEIDKDFESSGLDRSKNNWNDVDDFNWLARDMASPNWS ILPEEERNIQWD" BASE COUNT 354 a 416 c 450 g 355 t ORIGIN 1 gagagaggaa gcttgaagcc aatatggagt ccgtcagttg ctccgctgct gctgtcagga 61 ccggagacat ggagtcccag cgggacctga gcctggtgcc tgagcggctt cagagacgcg 121 aacaagaacg gcagctggaa gttgaaaggc ggaaacaaaa gcggcagaac caggaggtag 181 agaaggagaa cagccacttt ttcgtcgcca cctttgctcg ggagcgagcg gccgtggaag 241 agcttctgga gcgcgcggag tcggtcgagc ggctggagga ggcggcctct cggctccagg 301 ggctgcagaa actaatcaac gactcagttt ttttcctagc cgcttacgac ctgcggcagg 361 gacaagaggc gctggcgcgg ctgcaggcgg ccttggccga gcggcgccgg gggctgcagc 421 ccaagaagcg tttcgctttc aagacccggg gaaaggatgc tgcttcgtct accaaagtag 481 acgcggctcc tggcatcccc ccggcagttg aaagcataca ggactccccg ctgcccaaga 541 aggcggaagg agacctcggc cccagctggg tctgcggttt ctccaacctg gagtcccaag 601 tcttggagaa gagagccagc gagttgcacc agcgcgacgt tcttttgacc gaactgagca 661 actgcacggt cagactgtat ggaaatccca acaccctgcg gctaaccaag gcccacagct 721 gcaagctgct ctgcggtccg gtgtctacct ctgttttcct ggaggactgc agtgactgcg 781 tgctggcagt ggcctgccaa cagctccgca tacacagtac gaaagacacc cgcatcttcc 841 tgcaggtgac cagcagggcc atcgtggagg actgcagtgg gatccagttc gccccttaca 901 cctggagcta cccggagatc gacaaggact tcgagagctc tggtttagat aggagcaaaa 961 ataactggaa cgatgttgac gattttaact ggctggcccg ggatatggcc tccccaaact 1021 ggagtattct tcctgaagag gagcgaaata tccagtggga ctaagcagtt gtcactctgt 1081 tcttcactcc taccaaatac tttccacgtt ggactttccc ccttattggg tctcgaagtt 1141 tacttattgt cacactgtgt atgttttcag cattttaagg ctagagattg taatgggctc 1201 ctacttgtaa tttccattaa attcgtaaca ggtataacac taaagcattt ttgctatttt 1261 cgtcatgcct ttgagactga gtcttactcc gtcccccagc gtggtggcgc gctgggatta 1321 caggcgcgcg ccaccacgcg aactcgtatt tttagtagag acggggtttc gccatgttgt 1381 ccgggctgct ctcgaactcc tgacctcagg tgatccaccc gcttcagctt cccaaagtgc 1441 tggcattaca ggcgtgagcc accacgccag ggctttattt atttattttt accacaatag 1501 tttgaagcag taagggggaa ggagggtgat tatattgctt tgtaatggtt tgtgatactt 1561 gaaacatcac ggtgc // LOCUS HSU61262 5297 bp mRNA PRI 22-OCT-1996 DEFINITION Human neogenin mRNA, complete cds. ACCESSION U61262 NID g1621606 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5297) AUTHORS Meyerhardt,J.A., Look,A.T., Bigner,S.H. and Fearon,E.R. TITLE Identification and characterization of neogenin, a DCC-related gene JOURNAL Oncogene (1996) In press REFERENCE 2 (bases 1 to 5297) AUTHORS Meyerhardt,J.A., Look,A.T., Bigner,S.H. and Fearon,E.R. TITLE Direct Submission JOURNAL Submitted (18-JUN-1996) Internal Medicine, University of Michigan, 1150 W. Medical Center Drive, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..5297 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" /map="15q22" CDS 137..4522 /codon_start=1 /product="neogenin" /db_xref="PID:g1621607" /translation="MAAERGARRLLSTPSFWLYCLLLLGRRAPGAAAARSGSAPQSPG ASIRTFTPFYFLVEPVDTLSVRGSSVILNCSAYSEPSPKIEWKKDGTFLNLVSDDRRQ LLPDGSLFISNVVHSKHNKPDEGYYQCVATVESLGTIISRTAKLIVAGLPRFTSQPEP SSVYAGNGAILNCEVNADLVPFVRWEQNRQPLLLDDRVIKLPSGMLVISNATEGDGGL YRCVVESGGPPKYSDEVELKVLPDPEVISDLVFLKQPSPLVRVIGQDVVLPCVASGLP TPTIKWMKNEEALDTESSERLVLLAGGSLEISDVTEDDAGTYFCIADNGNETIEAQAE LTVQAQPEFLKQPTNIYAHESMDIVFECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKE HNLQVLGLVKSDEGFYQCIAENDVGNAQAGAQLIILEHAPATTGPLPSAPRDVVASLV STRFIKLTWRTPASDPHGDNLTYSVFYTKEGIARERVENTSHPGEMQVTIQNLMPATV YIFRVMAQNKHGSGESSAPLRVETQPEVQLPGPAPNLRAYAASPTSITVTWETPVSGN GEIQNYKLYYMEKGTDKEQDVDVSSHSYTINGLKKYTEYSFRVVAYNKHGPGVSTPDV AVRTLSDVPSAAPQNLSLEVRNSKSIMIHWQPPAPATQNGQITGYKIRYRKASRKSDV TETLVSGTQLSQLIEGLDRGTEYNFRVAALTINGTGPATDWLSAETFESDLDETRVPE VPSSLHVRPLVTSIVVSWTPPENQNIVVRGYAIGYGIGSPHAQTIKVDYKQRYYTIEN LDPSSHYVITLKAFNNVGEGIPLYESAVTRPHTDTSEVDLFVINAPYTPVPDPTPMMP PVGVQASILSHDTIRITWADNSLPKHQKITDSRYYTVRWKTNIPANTKYKNANATTLS YLVTGLKPNTLYEFSVMVTKGRRSSTWSMTAHGTTFELVPTSPPKDVTVVSKEGKPKT IIVNWQPPSEANGKITGYIIYYSTDVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYF KIQARNSKGMGPMSEAVQFRTPKADSSDKMPNDQASGSGGKGSRLPDLGSDYKPPMSG SNSPHGSPTSPLDSNMLLVIIVSVGVITIVVVVIIAVFCTRRTTSHQKKKRAACKSVN GSHKYKGNSKDVKPPDLWIHHERLELKPIDKSPDPNPIMTDTPIPRNSQDITPVDNSM DSNIHQRRNSYRGHESEDSMSTLAGRRGMRPKMMMPFDSQPPQPVISAHPIHSLDNPH HHFHSSSLASPARSHLYHPGSPWPIGTSMSLSDRANSTESVRNTPSTDTMPASSSQTC CTDHQDPEGATSSSYLASSQEEDSGQSLPTAHVRPSHPLKSFAVPAIPPPGPPTYDPA LPSTPLLSQQALNHHIHSVKTASIGTLGRSRPPMPVVVPSAPEVQETTRMLEDSESSY EPDELTKEMAHLEGLMKDLNAITTA" BASE COUNT 1441 a 1302 c 1245 g 1309 t ORIGIN 1 gggccgggcc gggctgggct ggagcagcgg cgcccgggag ccgagcttgc agcgagggac 61 cggctgaggc gcgcgggagg gaaggaggca agggctccgc ggcgctgtcg cgctgccgct 121 cactctcggg gaagagatgg cggcggagcg gggagcccgg cgactcctca gcaccccctc 181 cttctggctc tactgcctgc tgctgctcgg gcgccgggcg ccgggcgccg cggcggccag 241 gagcggctcc gcgccgcagt ccccaggagc cagcattcga acgttcactc cattttattt 301 tctggtggag ccggtggata cactctcagt tagaggctct tctgttatat taaactgttc 361 agcatattct gagccttctc caaaaattga atggaaaaaa gatggaactt ttttaaactt 421 agtatcagat gatcgacgcc agcttctccc ggatggatct ttatttatca gcaatgtggt 481 gcattccaaa cacaataaac ctgatgaagg ttattatcag tgtgtggcca ctgttgagag 541 tcttggaact attatcagta gaacagcgaa gctcatagta gcaggtcttc caagatttac 601 cagccaacca gaaccttcct cagtttatgc tgggaacgga gcaattctga attgtgaagt 661 taatgcagat ttggtcccat ttgtgaggtg ggaacagaac agacaacccc ttcttctgga 721 tgatagagtt atcaaacttc caagtggaat gctggttatc agcaatgcaa ctgaaggaga 781 tggcgggctt tatcgctgcg tagtggaaag tggtgggcca ccaaagtata gtgatgaagt 841 tgaattgaag gttcttccag atcctgaggt gatatcagac ttggtatttt tgaaacagcc 901 ttctccctta gtcagagtca ttggtcagga tgtagtgttg ccatgtgttg cttcaggact 961 tcctactcca accattaaat ggatgaaaaa tgaggaggca cttgacacag aaagctctga 1021 aagattggta ttgctggcag gtggtagcct ggagatcagt gatgttactg aggatgatgc 1081 tgggacttat ttttgtatag ctgataatgg aaatgagaca attgaagctc aagcagagct 1141 tacagtgcaa gctcaacctg aattcctgaa gcagcctact aatatatatg ctcacgaatc 1201 tatggatatt gtatttgaat gtgaagtgac tggaaaacca actccaactg tgaagtgggt 1261 caaaaatggg gatatggtta tcccaagtga ttattttaag attgtaaagg aacataatct 1321 tcaagttttg ggtctggtga aatcagatga agggttctat cagtgcattg ctgaaaatga 1381 tgttggaaat gcacaagctg gagcccaact gataatcctt gaacatgcac cagccacaac 1441 gggaccactg ccttcagctc ctcgggatgt cgtggcctcc ctggtctcta cccgcttcat 1501 caaattgacg tggcggacac ctgcatcaga tcctcacgga gacaacctta cctactctgt 1561 gttctacacc aaggaaggga ttgctaggga acgtgttgag aataccagtc acccaggaga 1621 gatgcaagta accattcaaa acctaatgcc agcgaccgtg tacatcttta gagttatggc 1681 tcaaaataag catggctcag gagagagttc agctccactg cgagtagaaa cacaacctga 1741 ggttcagctc cctggcccag cacctaacct tcgtgcatat gcagcttcgc ctacctccat 1801 cactgttacg tgggaaacac cagtgtctgg caatggggaa attcagaatt ataagttgta 1861 ctacatggaa aaggggactg ataaagaaca ggatgttgat gtttcaagtc actcttacac 1921 cattaatggg ttgaaaaaat atacagagta tagtttccga gtggtggcct acaataaaca 1981 tggtcctgga gtttccacac cagatgttgc tgttcgaaca ttgtcagatg ttcccagtgc 2041 tgctcctcag aatctgtcct tggaagtgag aaattcaaag agtattatga ttcactggca 2101 gccacctgct ccagccacac aaaatgggca gattactggc tacaagattc gctaccgaaa 2161 ggcctcccga aagagtgatg tcactgagac cttggtaagc gggacacagc tgtctcagct 2221 gattgaaggt cttgatcggg ggactgagta taatttccga gtggctgctc taacaatcaa 2281 tggtacaggc ccggcaactg actggctgtc tgctgaaact tttgaaagtg acctagatga 2341 aactcgtgtt cctgaagtgc ctagctctct tcacgtacgc ccgctcgtta ctagcatcgt 2401 agtgagctgg actcctccag agaatcagaa cattgtggtc agaggttacg ccattggtta 2461 tggcattggc agccctcatg cccagaccat caaagtggac tataaacagc gctattacac 2521 cattgaaaat ctggatccca gctctcacta tgtgattacc ctgaaagcat ttaataacgt 2581 gggtgaaggc atccccctgt atgagagtgc tgtgaccagg cctcacacag acacttctga 2641 agttgattta tttgttatta atgctccata cactccagtg ccagatccca ctcccatgat 2701 gccaccagtg ggagttcagg cttccattct gagtcatgac accatcagga ttacgtgggc 2761 agacaactcg ctgcccaagc accagaagat tacagactcc cgatactaca ccgtccgatg 2821 gaaaaccaac atcccagcaa acaccaagta caagaatgca aatgcaacca ctttgagtta 2881 tttggtgact ggtttaaagc cgaatacact ctatgaattc tctgtgatgg tgaccaaagg 2941 tcgaagatca agtacatgga gtatgacagc ccatgggacc acctttgaat tagttccgac 3001 ttctccaccc aaggatgtga ctgttgtgag taaagagggg aaacctaaga ccataattgt 3061 gaattggcag cctccctctg aagccaatgg caaaattaca ggttacatca tatattacag 3121 tacagatgtg aatgcagaga tacatgactg ggttattgag cctgttgtgg gaaacagact 3181 gactcaccag atacaagagt taactcttga cacaccatac tacttcaaaa tccaggcacg 3241 gaactcaaag ggcatgggac ccatgtctga agctgtccaa ttcagaacac ctaaagcgga 3301 ctcctctgat aaaatgccta atgatcaagc ctcagggtct ggagggaaag gaagccggct 3361 gccagaccta ggatccgact acaaacctcc aatgagcggc agtaacagcc ctcatgggag 3421 ccccacctct cctctggaca gtaatatgct gctggtcata attgtttctg ttggcgtcat 3481 caccatcgtg gtggttgtga ttatcgctgt cttttgtacc cgtcgtacca cctctcacca 3541 gaaaaagaaa cgagctgcct gcaaatcagt gaatggctct cataagtaca aagggaattc 3601 caaagatgtg aaacctccag atctctggat ccatcatgag agactggagc tgaaacccat 3661 tgataagtct ccagacccaa accccatcat gactgatact ccaattcctc gcaactctca 3721 agatatcaca ccagttgaca actccatgga cagcaatatc catcaaaggc gaaattcata 3781 cagagggcat gagtcagagg acagcatgtc tacactggct ggaaggcgag gaatgagacc 3841 aaaaatgatg atgccctttg actcccagcc accccagcct gtgattagtg cccatcccat 3901 ccattccctc gataaccctc accatcattt ccactccagc agcctcgctt ctccagctcg 3961 cagtcatctc taccacccgg gcagcccatg gcccattggc acatccatgt ccctttcaga 4021 cagggccaat tccacagaat ccgttcgaaa tacccccagc actgacacca tgccagcctc 4081 ttcgtctcaa acatgctgca ctgatcacca ggaccctgaa ggtgctacca gctcctctta 4141 cttggccagc tcccaagagg aagattcagg ccagagtctt cccactgccc atgttcgccc 4201 ttcccaccca ttgaagagct tcgccgtgcc agcaatcccg cctccaggac ctcccaccta 4261 tgatcctgca ttgccaagca caccattact gtcccagcaa gctctgaacc atcacattca 4321 ctcagtgaag acagcctcca tcgggactct aggaaggagc cggcctccta tgccagtggt 4381 tgttcccagt gcccctgaag tgcaggagac cacaaggatg ttggaagact ccgagagtag 4441 ctatgaacca gatgagctga ccaaagagat ggcccacctg gaaggactaa tgaaggacct 4501 aaacgctatc acaacagcat gacgaccttc accaggacct gacttcaaac ctgagtctgg 4561 aagtcttgga acttaaccct tgaaaacaag gaattgtaca gagtacgaga ggacagcact 4621 tgagaacaca gaatgagcca gcagactggc cagcgcctct gtgtagggct ggctccaggc 4681 atggccacct gccttcccct ggtcagcctg gaagaagcct gtgtcgaggc agcttccctt 4741 tgcctgctga tattctgcag gactgggcac catgggccaa aattttgtgt ccagggaaga 4801 ggcgagaagt gcaacctgca tttcactttg tggtcaggcc gtgtctttgt gctgtgactg 4861 catcaccttt atggagtgta gacattggca tttatgtaca attttatttg tgtcttattt 4921 tattttacct tcaaaaacaa aaacgccatc caaaaccaag gaagtccttg gtgttctcca 4981 caagtggttg acatttgact gcttgttcca attatgtatg gaaagtcttt gacagtgtgg 5041 gtcgttcctg gggttggctt gttttttggt ttcattttta ttttttaatt ctgagtcatt 5101 gcatcctcta ccagctgtta atccatcact ctgaggggga ggaaatgttg cattgctgtt 5161 tgtaagcttt ttttattatt tttttattat aattattaaa ggcctgactc tttcctctca 5221 tcactgtgag attacagatc tatttgaatt gaatgaaatg taacattgaa aaaaaaaaaa 5281 aaaaaaaaaa aaaaaaa // LOCUS HSU61267 1972 bp mRNA PRI 15-JUL-1996 DEFINITION Human putative splice factor transformer2-beta mRNA, complete cds. ACCESSION U61267 NID g1418285 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1972) AUTHORS Beil,B. and Stamm,S. TITLE Molecular cloning of htra2-beta, a human homologue of tra-2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1972) AUTHORS Beil,B., Cap,C. and Stamm,S. TITLE Direct Submission JOURNAL Submitted (19-JUN-1996) Max-Planck-Institute for Psychiatry, Am Klopferspitz 18a, Planegg 82152, Germany FEATURES Location/Qualifiers source 1..1972 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 122..988 /note="putative splice factor; transformer2-beta; SR-protein, RNA binding protein" /codon_start=1 /product="htra2-beta" /db_xref="PID:g1418286" /translation="MSDSGEQNYGERESRSASRSGSAHGSGKSARHTPARSRSKEDSR RSRSKSRSRSESRSRSRRSSRRHYTRSRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMS TRRRHVGNRANPDPNCCLGVFGLSLYTTERDLREVFSKYGPIADVSIVYDQQSRRSRG FAFVYFENVDDAKEAKERANGMELDGRRIRVDFSITKRPHTPTPGIYMGRPTYGSSRR RDYYDRGYDRGYDDRDYYSRSYRGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRGGYRS RSRSRSYSPRRY" BASE COUNT 578 a 387 c 451 g 556 t ORIGIN 1 gaattcggca cgagggcgac cggcgcgtcg tgcggggctg cggcggagcc tccttaagga 61 aggtgcaaga ggttggcagc ttcgattgaa gcacatcgac cggcgacagc agccaggagt 121 catgagcgac agcggcgagc agaactacgg cgagcgggaa tcccgttctg cttccagaag 181 tggaagtgct cacggatcgg ggaaatctgc aaggcatacc cctgcaaggt ctcgctccaa 241 ggaagattcc aggcgttcca gatcaaagtc caggtcccga tctgaatcta ggtctagatc 301 cagaagaagc tcccgaaggc attatacccg gtcacggtct cgctcccgct cccatagacg 361 atcacgtagc aggtcttaca gtcgagatta tcgtagacgg cacagccaca gccattctcc 421 catgtctact cgcaggcgtc atgttgggaa tcgggcaaat cctgatccta actgttgtct 481 tggagtattt gggctgagct tgtacaccac agaaagagat ctaagagaag tgttctctaa 541 atatggtccc attgccgatg tgtctattgt atatgaccag cagtctaggc gttcaagagg 601 atttgccttt gtatattttg aaaatgtaga tgatgccaag gaagctaaag aacgtgccaa 661 tggaatggag cttgatgggc gtaggatcag agttgatttc tctataacaa aaagaccaca 721 tacgccaaca ccaggaattt acatggggag acctacctat ggcagctctc gccgtcggga 781 ttactatgac agaggatatg atcggggcta tgatgatcgg gactactata gcagatcata 841 cagaggagga ggtggaggag gaggaggatg gagagctgcc caagacaggg atcagattta 901 tagaaggcgg tcaccttctc cttactatag tcgtggagga tacagatcac gttccagatc 961 tcgatcatac tcacctcgtc gctattaaag catgaagact ttctgaaacc tgccctagag 1021 ctgggatatt gtttgtgggc aatatttttt attgtctctt gtttaaaaag tgaacagtgc 1081 ctagtgaagt taggtgactt ttacaccttt tacgatgact acttttggtg gagttgaaat 1141 gctgttttca ttctgcattt gtgtagtttg gtgctttgtt ccaagttaag tgttttcaga 1201 aaagtatgtt ttgcatgtat ttttttacag tctaaatttt gactgctgag aagtttctat 1261 tgtacaaaac ttcatttaaa aggtttttct actgaatcca gggtattctg aagatcgaag 1321 cctgtgtaaa atgctaccaa atggcaaaaa gcaacaataa acagtttgat ttttactttt 1381 ctttctaaca tatcaatgct tagcagaact attcagattg tcagtagtaa atttaaagac 1441 aaatgcccgt tttcctccag tccatgaaac ataccatact tatatacctg caactaagtg 1501 tttaaaatta tgctctgtaa ctctgtactg ctagtattag aactaaaaat cttaaaatac 1561 agccagtgct taatgcttat atcaatgtgg atttgtcggc ttttatgtaa tctgtaatat 1621 gtatagcagg aaatacgaag agttacacag tgtatgcctt aaaaggctgt ttcttaaagg 1681 tgttacaagg ggataatggt atttcaacta gttatcagca agtgacaata cattccacca 1741 caaatacact cttgttcttc tagcttttag actatatgaa aaaaccgggt gcttcaaagt 1801 acatgataag ggaacactat acctgtcatg gatgaactga agactttgcc tgttcatttt 1861 ttaaatatta ttttcaggtc ctttgcttac caaaggaggc ccaatttcac tcaaatgttt 1921 tgagaactgt gtttaaataa acgcaaatga aaagaaaaaa aaaaaaaaaa aa // LOCUS HSU61374 1800 bp mRNA PRI 15-JAN-1997 DEFINITION Human novel protein with short consensus repeats of six cysteines mRNA, complete cds. ACCESSION U61374 NID g1778409 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1800) AUTHORS Nangaku,M., Bomsztyk,K., Johnson,R.J. and Couser,W.G. TITLE Direct Submission JOURNAL Submitted (19-JUN-1996) Division of Nephrology, University of Washington, 1959 NE Pacific Ave, Seattle, WA 98195, USA COMMENT The short consensus repeat (SCR) is known to exist in complement components, complement regulatory proteins, selectins, and some other genes. Generally SCR has a characteristic framework of four conserved half-cysteine residues (at position 2, 31, 45, and 58). This clone has six cysteines per repeat instead of the four. FEATURES Location/Qualifiers source 1..1800 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUVEC" CDS 42..1427 /note="a novel protein with short consensus repeats of six cysteines; the protein has three short consensus repeats (SCR), which is a motif of approximately 60 amino acids" /codon_start=1 /db_xref="PID:g1778410" /translation="MGSPAHRPALLLLLPPLLLLLLRVPPSRSFPGSGDSPLEDDEVG YSHPRYKDTPWCSPIKVKYGDVYCRAPQGGYYKTALGTRCDIRCQKGYELHGSSLLIC QSNKRWSDKVICKQKRCPTLAMPANGGFKCVDGAYFNSRCEYYCSPGYTLKGERTVTC MDNKAWSGASLLCGYGPPRIKCPSVKERIAEPNKLTVRVWETPEGRDTADGILTDVIL KGLPPGSNFPEGDHKIQYTVYDRAENKGTCKFRVKVRVKRCGKLNAPENGYMKCSSDG DNYGATCEFSCIGGYELQGSPARVCQSNLAWSGTEPTCAAMNVNVGVRTAAALLDQFY EKRRLLIVSTPTARNLLYRLQLGMLQQAQCGLDLRHITVVELVGVFPTLIGRIGAKIM PPALALQLRLLLRIPLYSFSMVLVDKHGMDKERYVSLVMPVALFNLIDTFPLRKEEMV LQAEMSQTCNT" BASE COUNT 471 a 454 c 433 g 442 t ORIGIN 1 ggaaggcgcg cctgccgagg cgagctaagc gcccgctcgc catggggagc cccgcacatc 61 ggcccgcgct gctgctgctg ctgccgcctc tgctgctgct gctgctgcgc gtcccgccca 121 gccgcagctt cccaggatcg ggagactcac cactagaaga cgatgaagtc gggtattcac 181 accctagata taaagatacc ccgtggtgct cccccatcaa ggtgaagtat ggggatgtgt 241 actgcagggc ccctcaagga ggatactaca aaacagccct gggaaccagg tgcgacattc 301 gctgccagaa gggctacgag ctgcatggct cttccctact gatctgccag tcaaacaaac 361 gatggtctga caaggtcatc tgcaaacaaa agcgatgtcc tacccttgcc atgccagcaa 421 atggagggtt taagtgtgta gatggtgcct actttaactc ccggtgtgag tattattgtt 481 caccaggata cacgttgaaa ggggagcgga ccgtcacatg tatggacaac aaggcctgga 541 gcggcgccag cctcctgtgt ggatatggac ctcctagaat caagtgccca agtgtgaagg 601 aacgcattgc agaacccaac aaactgacag tccgtgtctg ggagacaccc gaaggaagag 661 acacagcaga tggaattctt actgatgtca ttctaaaagg cctcccccca ggctccaact 721 ttccagaagg agaccacaag atccagtaca cagtctatga cagagctgag aataagggca 781 cttgcaaatt tcgagttaaa gtaagagtca aacgctgtgg caaactcaat gccccagaga 841 atggttacat gaagtgctcc agcgacggtg ataattatgg agccacctgt gagttctcct 901 gcatcggcgg ctatgagctc cagggtagcc ctgcccgagt atgtcaatcc aacctggctt 961 ggtctggcac ggagcccacc tgtgcagcca tgaacgtcaa tgtgggtgtc agaacggcag 1021 ctgcacttct ggatcagttt tatgagaaaa ggagactcct cattgtgtcc acacccacag 1081 cccgaaacct cctttaccgg ctccagctag gaatgctgca gcaagcacag tgtggccttg 1141 atcttcgaca catcaccgtg gtggagctgg tgggtgtgtt cccgactctc attggcagga 1201 taggagcaaa gattatgcct ccagccctag cgctgcagct caggctgttg ctgcgaatcc 1261 cactctactc cttcagtatg gtgctagtgg ataagcatgg catggacaaa gagcgctatg 1321 tctccctggt gatgcctgtg gccctgttca acctgattga cacttttccc ttgagaaaag 1381 aagagatggt cctacaagcc gaaatgagcc agacctgtaa cacctgacat gatggttcct 1441 ctcttggcaa ttcctcttca ttgtctacat agtgacatgc acacgggaaa gccttaaaaa 1501 tatccttgat gtacagattt tatttgtaat ttaaaagtct attttattat gagctttctt 1561 gcacttaaaa attagcatgc tgctttttgt acttggaagt gtttcaaaaa attatatgac 1621 catatttact ctttctaact ttctttactc catcatggct ggttgatttt gtagagaaat 1681 tagaacccat aaccatacac aggctatcaa catgttattc aatgtgacac ctaactcttt 1741 tctattttgt tttttaagta agacttttat taataaaaca aaatgttttg gaaaaaaaaa // LOCUS HSU61538 800 bp mRNA PRI 05-DEC-1996 DEFINITION Human calcium-binding protein chp mRNA, complete cds. ACCESSION U61538 NID g1706966 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Lin,X. and Barber,D.L. TITLE A calcineurin homologous protein inhibits GTPase-stimulated Na-H exchange JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (22), 12631-12636 (1996) MEDLINE 97057295 REFERENCE 2 (bases 1 to 800) AUTHORS Lin,X. and Barber,D.L. TITLE Direct Submission JOURNAL Submitted (20-JUN-1996) Stomatology, University of California at San Francisco, HSW604, San Francisco, CA 94143-0521, USA FEATURES Location/Qualifiers source 1..800 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell" CDS 182..769 /codon_start=1 /product="calcium-binding protein chp" /db_xref="PID:g1706967" /translation="MGSRASTLLRDEELEEIKKETGFSHSQITRLYSRFTSLDKGENG TLSREDFQRIPELAINPLGDRIINAFFPEGEDQVNFRGFMRTLAHFRPIEDNEKSKDV NGPEPLNSRSNKLHFAFRLYDLDKDEKISRDELLQVLRMMVGVNISDEQLGSIADRTI QEADQDGDSAISFTEFVKVLEKVDVEQKMSIRFLH" BASE COUNT 198 a 207 c 202 g 193 t ORIGIN 1 gaattccggg caaagctctt tcaccagatg tagactgtag ccctgctgcc ttccctccag 61 cgagtctgcc agcatgcttc ttcatccttt ttatatgttc tttgcttcct tccctccctc 121 cttgcctcct gtcgccgtct cttctggcgc cgctgctccc ggaggagctc ccggcacggc 181 gatgggttct cgggcctcca cgttactgcg ggacgaagag ctcgaggaga tcaagaagga 241 gaccggcttt tcccacagtc aaatcactcg cctctacagc cggttcacca gcctggacaa 301 aggagagaat gggactctca gccgggaaga tttccagagg attccagaac ttgccatcaa 361 cccactgggg gaccggatca tcaatgcctt ctttccagag ggagaggacc aggtaaactt 421 ccgtggattc atgcgaactt tggctcattt ccgccccatt gaggataatg aaaagagcaa 481 agatgtgaat ggacccgaac cactcaacag ccgaagcaac aaactgcact ttgcttttcg 541 actatatgat ttggataaag atgaaaagat ctcccgtgat gagctgttac aggtgctacg 601 catgatggtc ggagtaaata tctcagatga gcagctgggc agcatcgcag acaggaccat 661 tcaggaggct gatcaggatg gggacagtgc catatctttc acagaatttg ttaaggtttt 721 ggagaaggtg gatgtagaac agaaaatgag catccgattt cttcactaaa ggagaccaaa 781 ctgttcttgc ggtctagtat // LOCUS HSU61837 725 bp mRNA PRI 09-DEC-1997 DEFINITION Homo sapiens putative cyclin G1 interacting protein mRNA, complete cds. ACCESSION U61837 NID g2668504 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 725) AUTHORS Xu,F., Hall,F.L., Starnes,V. and Wu,L. TITLE Direct Submission JOURNAL Submitted (24-JUN-1996) Department of Molecular Pharmacology and Toxicology, University of Southern California, 4650 Sunset Blvd., Los Angeles, CA 90027, USA REFERENCE 2 (bases 1 to 725) AUTHORS Xu,F. TITLE Direct Submission JOURNAL Submitted (09-DEC-1997) Department of Molecular Pharmacology and Toxicology, University of Southern California, 1985 Zonal Avenue, Los Angeles, CA 90033, USA FEATURES Location/Qualifiers source 1..725 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 31..495 /codon_start=1 /product="putative cyclin G1 interacting protein" /db_xref="PID:g2668505" /translation="MVEKKTSVRSQDPGQRRVLDRAARQRRINRQLEALENDNFQDDP HAGLPQLGKRLPQFDDDADTGKKKKKTRGDHFKLRFRKNFQALLEEQNLSVAEGPNYL TACAGPPSRPQRPFCAVCGFPSPYTCVSCGARYCTVRCLGTHQETRCLKWTV" BASE COUNT 157 a 214 c 215 g 139 t ORIGIN 1 gcagtttatt ccgacagttg tgttgtgcca atggtggaga agaaaacttc ggttcgctcc 61 caggaccccg ggcagcggcg ggtgctggac cgggctgccc ggcagcgtcg catcaaccgg 121 cagctggagg ccctggagaa tgacaacttc caggatgacc cccacgcggg actccctcag 181 ctcggcaaga gactgcctca gtttgatgac gatgcggaca ctggaaagaa aaagaagaaa 241 acccgaggtg atcattttaa acttcgcttc cgaaaaaact ttcaggccct gttggaggag 301 cagaacttga gtgtggccga gggccctaac tacctgacgg cctgtgcggg acccccatcg 361 cggccccagc gccccttctg tgctgtctgt ggcttcccat ccccctacac ctgtgtcagc 421 tgcggtgccc ggtactgcac tgtgcgctgt ctggggaccc accaggagac caggtgtctg 481 aagtggactg tgtgagcctg ggcattccca gagaggaagg gccgctgtgc actgcccggc 541 cttcagaaag acagaatttc atcacccaat gcagggggag ctcttcctgg accaagggag 601 gagccgctca ttcacccaac aaaactgtgt cttatctgcc aggaaagacc agcctcactc 661 ctgggaactg tctggcaggt aggctgggcc ccccagtgct gttagaataa aaagcctcgt 721 gccgg // LOCUS HSU61849 5071 bp mRNA PRI 09-OCT-1996 DEFINITION Human neuronal pentraxin 1 (NPTX1) mRNA, complete cds. ACCESSION U61849 NID g1438953 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5071) AUTHORS Omeis,I.A., Hsu,Y.C. and Perin,M.S. TITLE Mouse and human neuronal pentraxin 1 (NPTX1): conservation, genomic structure, and chromosomal localization JOURNAL Genomics 36 (3), 543-545 (1996) MEDLINE 97038700 REFERENCE 2 (bases 1 to 5071) AUTHORS Perin,M.S., Omeis,I.A. and Hsu,Y-C. TITLE Direct Submission JOURNAL Submitted (23-JUN-1996) Neuroscience, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..5071 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25.1-17q25.2" gene 139..1431 /gene="NPTX1" CDS 139..1431 /gene="NPTX1" /note="similar to rat neuronal pentraxin 1, GenBank Accession Number U18772" /codon_start=1 /product="neuronal pentraxin 1" /db_xref="PID:g1438954" /translation="MPAGRARTCALLALCLLGPQDFGPTRFICTSVPVDADMCAASVA AGGAEELRSSNVLQLRETVLQQKETILSQKETIRELTAKLGRCESQSTLDPGAGEARA GGGRKQPGSGKNTMGDLSRTPAAETLSQLGQTLQSLKTRLENLEQYSRLNSSSQTNSL KDLLQSKIDELERQVLSRVNTLEEGKGGPKNDTEERVKIETALTSLHQRISELEKGQK DNRPGDKFQLTFPLRTNYMYAKVKKSLPEMYAFTVCMWLKSSATPGVGTPFSYAVPGQ ANELVLIEWGNNPMEILINDKVAKLPFVINDGKWHHICVTWTTRDGVWEAYQDGTQGG SGENLAPYHPIKPQGVLVLGQEQDTLGGGFDATQAFVGELAHFNIWDRKLTPGEVYNL ATCSTKALSGNVIAWAESHIEIYGGATKWTFEACRQIN" BASE COUNT 1055 a 1472 c 1416 g 1128 t ORIGIN 1 ggcctgatag cgcggcgtgt ggaccggccg aagagcgcgc ccagagcggc gccgtcgcga 61 gccacagccc gagccggtcc cagccgagcc gagccccagc cgagccgagc cggcccgagc 121 gctccggtgc ccgcagccat gccggccggc cgcgcgcgca cctgtgcgct gctcgccctc 181 tgcctcctgg ggccccagga tttcgggccg acgcgcttca tctgcacttc ggtgccggtg 241 gacgccgaca tgtgtgccgc gtccgtggcc gcgggcggcg cggaggagct ccggagcagc 301 aatgtgctgc agctccggga gaccgtgctg cagcagaagg agaccatcct gagccagaag 361 gagaccatcc gcgagctgac cgccaagctg ggccgctgcg agagccagag cacgctggac 421 cccggagccg gcgaggcccg ggcgggcggc ggccgcaagc agccgggctc gggcaagaac 481 accatgggcg acctgtcccg gacaccggcc gccgagacgc tcagccaact cgggcaaact 541 ttgcaatcgc tcaaaacccg cctggagaac ctcgagcagt acagccgcct caattcctcc 601 agccagacca acagcctcaa ggatctgctg cagagcaaga tcgatgagct ggagaggcag 661 gtgctgtccc gggtgaacac cctggaggag ggcaaggggg ggcccaagaa cgacaccgag 721 gagagggtca agatcgagac cgccctgacc tccctgcacc agcggatcag cgagctcgag 781 aaaggtcaga aagacaaccg ccctggagac aagttccagc tcacattccc actgcggacc 841 aactatatgt atgccaaggt gaagaagagc ctgccagaga tgtacgcctt cactgtctgc 901 atgtggctca agtccagcgc cacgccaggt gtgggcacgc ccttctccta cgctgtgccc 961 ggccaggcca acgagctggt cctcattgag tggggcaaca accccatgga gatcctcatc 1021 aatgacaagg tggccaagct gccttttgtc atcaatgatg gcaagtggca ccacatctgt 1081 gtcacctgga ccacccggga cggggtctgg gaggcctacc aggatggcac gcagggtggc 1141 agtggcgaga acttggcgcc ctatcacccc atcaagcccc agggcgtgct ggtgctgggc 1201 caggagcagg acactctggg tggtgggttt gatgccaccc aggcatttgt gggtgagctg 1261 gcccacttca acatctggga ccgcaagctg acccccgggg aggtgtacaa cctggccacc 1321 tgcagcacca aggctctgtc cggcaatgtc atcgcctggg ctgaatccca catcgagatc 1381 tacggcgggg ccaccaagtg gaccttcgag gcctgtcgcc agatcaactg agcacggcag 1441 gccaggctga gccgcccgcc ctcgccccct gcttgtgcgg cgatgatctg tttgtgcgtc 1501 tcttctctcc cttttcccca ggaatgaacc gaggccgtcg ccctgcacac gcacacgcac 1561 acagcctggt tttgtcctca tgcacacgaa gcagcccctg ctcccatctg tccctgagga 1621 agccccactt ctctgtagga gcccggactc tctcagcatg ccccattcac agctgaagtg 1681 ggtgctgcaa cgtcttgaac aaggcagaag ttggtgagag gatctgtgtg tgcgtgtcta 1741 catgtgtgtg tctacgtgtg tgcgtgcgtg gctgggggag gccttttctt tgaggacgta 1801 cctcatttcc ttctttcttc tggctttgga aaaatctcat gatgaaaatt catatttgcc 1861 aactttgtta gctgcgtgcg tgctttgggg ttggtgcaac ctcagtacac gcatttgtct 1921 ttgtttgcaa acctttctca gagcgacata tctttatatt gatgtaataa atgtctttta 1981 gtggtttgtc aaaggccggg ggcgggggct ctctacagag aatttttatt ttgtaataga 2041 agtgaactgt ctctgaaggt gaaggcaggc cgtcctggga tggtaccctg tgctctcccg 2101 tggaggagag gggatggctg aggacactgg cccttaccca gggcgaacag catccatccc 2161 tgctgtttgc atcttgagag cagcatgggg cctgggaggt cggcctgtgt gcccagctca 2221 gctagctctg ccccaggacg gccctgccct cgaccttccc acctcctcag atcctgcaag 2281 gctgggtctg cccctccctt ctcacctctg gagctgtgct gcactgcttc ccagagaggg 2341 ccctgagaga ggagcgtgcc acccaccagg gaagccgggc cccagcaccc ctctcctttg 2401 gcctcccgga gtgcagacca gaggggacct tttaaggaaa gaagccgtgt ttcgatgaag 2461 acctggccac atggggcact gggacttcaa cccagcccat cggtgggaag gtcctttttg 2521 ggggactttg acagccatat ccctcccagc acaccaggcg ccaggtgagc tggttcagac 2581 ccctccaggg tactccagag acctcacgtg tggagccagg cctggccagg caggggcctg 2641 aaacccactc ctccatctca tgggctcacg gcctacacga gcccacaagc tgccactggc 2701 cggcgacact gacacctgag cagtgtccag aacctttttg cctttttttg ttccccgtga 2761 aaagcaacat ggacatttcc ttctagtcct tccaaggagg ggagagaagt gtatgtgcat 2821 ttgtgtgtgt gtgtgtgtgt tgtgtgtgtg tgtgcgctaa gtgagaaaga gagcaggctc 2881 gggaggccct gcccagggta ggaggagctt cctgctttgc accatctggt ggtcgcaccc 2941 tggagggcac cccgactctg tctccaggag tctcatcagc aaaccgctga caagtctttc 3001 tagaaattct actgcactgc ctggctcagc tgcacgctgc agacatttct gcaggaggag 3061 caggtgtttc tgtcttctgt tccttctagg gccacctgtc cccttaaaca caggtccacg 3121 ttgtgtcaag aacctagtgc atctgtgtgt gtctgtcagt gtctctgtgt cagtgttctc 3181 gtgggtgtct gcacggtacc ggcccgcctt ctgcaatcat cactcccgca gagggggtgc 3241 agatcaggcg ccgtgctgcg gttgttgttc aacagtgctt tttcttagat agcgtcttcc 3301 tcagcgcccg tcggttgtgg catccttgat ctcagggatc ttctccgttt gcatgtcctc 3361 ggagtggcgt gttccttctc cctgggtccg acatgtgttc ccgcacctgc atggactgcc 3421 ccggttctgt gttgtgtgcc gagtgccgcc cagtgttctg tgaccacccg tgtagctact 3481 gaaaatggct gggtaagcaa gtcaagggtg ttggaggagg tcaagagaga gctcagtttc 3541 cctctccccc tccccaaaca caccaagaag catttttaac gtgtaggttg agaacaagcc 3601 taaaggattc ccacagctgg gagccagcaa gagagcttgg agtcgcctct ctagaccaga 3661 tctagcccca ccctcactcc agccatctcg gagcccttgt gtaggcaacg ccggtgcgcg 3721 gctgtgtggg gtgctccctg ccagcacctc cggccagccc cgcccctgcc gatctactgg 3781 accgcagacc accttctgcc cccgtggcca ggtgggagct gtccgttcag gaccatgagc 3841 catcctctgc cctgactagc gaggggcaga gcacacccca gtgcttacgc ctccacccct 3901 gcagcctcct ggcccgctca ccttcctcac ccctcctctg acccacccat ggtgccaggg 3961 ccgaagctga cctttagctc cctcctgccc ttgctagggt ctgagccaag cccctcgact 4021 cctcactgtg ttgacacttg gcactttgct ggccccgaga aggtcgatga cacagccgca 4081 aatctaatcc acgtagttcc catttactcc ttaatctgat tgatgttccc tcttgcactg 4141 aataatacat gcctctctca ggtaagccat tttataaaac aagaagataa aaagcactgt 4201 tgaggcagtg tttgcttttg ccgagctggt gtccgacagc tccctgggtg tccggggtgg 4261 gagagctgtt gacagaagct ctccgggcct cagggcttag atcccacttg agtcgtaagc 4321 cttcttgctt ttgataacac agtattattt ctcttactgt agaagaaaaa gtttattacc 4381 aaacaagagt atttttatga aagaaaagga caaacctata aattaactca acctatatct 4441 cccttgaaaa tactttcagg ctccaccaaa acgtagaact gaaagcatgt attttggaag 4501 aaagagatac attttgtatg ctttcttttc cttttgtaga ttcccagttt attttctaag 4561 actgcaaaga tcactttgtc accagccctg ggacctgaga ccaagggggt gtcttgtggg 4621 cagtgagggg gtgaggagag gctggcatga ggttcagtca ttccagtgag ctccaaagag 4681 gggccacctg ttctcaaaag catgttgggg accaggaggt aaaactggcc atttatggtg 4741 aacctgtgtc ttggagctga cttactaagt ggaatgagcc gaggatttga atatcagttc 4801 taaccttgat agaagaacct tgggttacat gtggttcaca ttaagaggat agaatccttt 4861 ggaatcttat ggcaaccaaa tgtggcttga cgaagtcgtg gtttcatctc ttaaacacag 4921 tgtgtaaatt tattcaacta acgatgggaa atgtattact tctgtacaca gtggactgaa 4981 gtgcaatttg ttgaaaggga acaagtcatt gaagagaaaa aaaaaaagcc caatacttag 5041 agtcccaatt ttgtctcatt tgccaaaaaa a // LOCUS HSU61981 4374 bp mRNA PRI 15-AUG-1996 DEFINITION Human putative mismatch repair/binding protein hMSH3 (hMSH3) mRNA, complete cds. ACCESSION U61981 NID g1490520 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4374) AUTHORS Acharya,S., Wilson,T., Gradia,S., Kane,M., Guerrette,S., Marsischky,G., Kolodner,R. and Fishel,R. TITLE Human hMSH2, hMSH3 and hMSH6 form specific mispair-binding protein complexes that account for the allele distribution in HNPCC JOURNAL Unpublished REFERENCE 2 (bases 1 to 4374) AUTHORS Guerrette,S., Lescoe,M.K., Alder,H. and Fishel,R.A. TITLE Direct Submission JOURNAL Submitted (24-JUN-1996) Microbiology, Kimmel Cancer Institute, Thomas Jefferson University, 233 South 10th Street, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..4374 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /chromosome="5" /map="5q11.2-q13.3" gene 17..3403 /gene="hMSH3" CDS 17..3403 /gene="hMSH3" /function="putative mismatch repair/binding protein" /note="HUMDUG; human divergent upstream protein gene DUG, GenBank Accession Number J04810; E. Coli Muts homolog" /codon_start=1 /product="hMSH3" /db_xref="PID:g1490521" /translation="MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAA DQVDPGAAAAAAPPAPAFPPQLPPHVATEIDRRKKRPLENDGPVKKKVKKVQQKEGGS DLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDF DDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKS IYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASI PTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGED VNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVV FDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMD NIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEK MLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRK LKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKC STQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNE QAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSG QEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEW LDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQEERKIVIKNGRHP VIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVALITIMAQIGSYVP AEEATIGIVDGIFTRMGAADNIYKGRSTFMEELTDTAEIIRKATSQSLVILDELGRGT STHDGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDES KLDPGAAEQVPDFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKELEGLI NTKRKRLKYFAKLWTMHNAQDLQKWTEEFNMEETQTSLLH" BASE COUNT 1463 a 846 c 937 g 1128 t ORIGIN 1 gggcacgagc cctgccatgt ctcgccggaa gcctgcgtcg ggcggcctcg ctgcctccag 61 ctcagcccct gcgaggcaag cggttttgag ccgattcttc cagtctacgg gaagcctgaa 121 atccacctcc tcctccacag gtgcagccga ccaggtggac cctggcgctg cagcggccgc 181 agcgccccca gcgcccgcct tcccgcccca gctgccgccg cacgtagcta cagaaattga 241 cagaagaaag aagagaccat tggaaaatga tgggcctgtt aaaaagaaag taaagaaagt 301 ccaacaaaag gaaggaggaa gtgatctggg aatgtctggc aactctgagc caaagaaatg 361 tctgaggacc aggaatgttt caaagtctct ggaaaaattg aaagaattct gctgcgattc 421 tgcccttcct caaagtagag tccagacaga atctctgcag gagagatttg cagttctgcc 481 aaaatgtact gattttgatg atatcagtct tctacacgca aagaatgcag tttcttctga 541 agattcgaaa cgtcaaatta atcaaaagga cacaacactt tttgatctca gtcagtttgg 601 atcatcaaat acaagtcatg aaaatttaca gaaaactgct tccaaatcag ctaacaaacg 661 gtccaaaagc atctatacgc cgctagaatt acaatacata gaaatgaagc agcagcacaa 721 agatgcagtt ttgtgtgtgg aatgtggata taagtataga ttctttgggg aagatgcaga 781 gattgcagcc cgagagctca atatttattg ccatttagat cacaacttta tgacagcaag 841 tatacctact cacagactgt ttgttcatgt acgccgcctg gtggcaaaag gatataaggt 901 gggagttgtg aagcaaactg aaactgcagc attaaaggcc attggagaca acagaagttc 961 actcttttcc cggaaattga ctgcccttta tacaaaatct acacttattg gagaagatgt 1021 gaatccccta atcaagctgg atgatgctgt aaatgttgat gagataatga ctgatacttc 1081 taccagctat cttctgtgca tctctgaaaa taaggaaaat gttagggaca aaaaaaaggg 1141 caacattttt attggcattg tgggagtgca gcctgccaca ggcgaggttg tgtttgatag 1201 tttccaggac tctgcttctc gttcagagct agaaacccgg atgtcaagcc tgcagccagt 1261 agagctgctg cttccttcgg ccttgtccga gcaaacagag gcgctcatcc acagagccac 1321 atctgttagt gtgcaggatg acagaattcg agtcgaaagg atggataaca tttattttga 1381 atacagccat gctttccagg cagttacaga gttttatgca aaagatacag ttgacatcaa 1441 aggttctcaa attatttctg gcattgttaa cttagagaag cctgtgattt gctctttggc 1501 tgccatcata aaatacctca aagaattcaa cttggaaaag atgctctcca aacctgagaa 1561 ttttaaacag ctatcaagta aaatggaatt tatgacaatt aatggaacaa cattaaggaa 1621 tctggaaatc ctacagaatc agactgatat gaaaaccaaa ggaagtttgc tgtgggtttt 1681 agaccacact aaaacttcat ttgggagacg gaagttaaag aagtgggtga cccagccact 1741 ccttaaatta agggaaataa atgcccggct tgatgctgta tcggaagttc tccattcaga 1801 atctagtgtg tttggtcaga tagaaaatca tctacgtaaa ttgcccgaca tagagagggg 1861 actctgtagc atttatcaca aaaaatgttc tacccaagag ttcttcttga ttgtcaaaac 1921 tttatatcac ctaaagtcag aatttcaagc aataatacct gctgttaatt cccacattca 1981 gtcagacttg ctccggaccg ttattttaga aattcctgaa ctcctcagtc cagtggagca 2041 ttacttaaag atactcaatg aacaagctgc caaagttggg gataaaactg aattatttaa 2101 agacctttct gacttccctt taataaaaaa gaggaaggat gaaattcaag gtgttattga 2161 cgagatccga atgcatttgc aagaaatacg aaaaatacta aaaaatcctt ctgcacaata 2221 tgtgacagta tcaggacagg agtttatgat agaaataaag aactctgctg tatcttgtat 2281 accaactgat tgggtaaagg ttggaagcac aaaagctgtg agccgctttc actctccttt 2341 tattgtagaa aattacagac atctgaatca gctccgggag cagctagtcc ttgactgcag 2401 tgctgaatgg cttgattttc tagagaaatt cagtgaacat tatcactcct tgtgtaaagc 2461 agtgcatcac ctagcaactg ttgactgcat tttctccctg gccaaggtcg ctaagcaagg 2521 agattactgc agaccaactg tacaagaaga aagaaaaatt gtaataaaaa atggaaggca 2581 ccctgtgatt gatgtgttgc tgggagaaca ggatcaatat gtcccaaata atacagattt 2641 atcagaggac tcagagagag taatgataat taccggacca aacatgggtg gaaagagctc 2701 ctacataaaa caagttgcat tgattaccat catggctcag attggctcct atgttcctgc 2761 agaagaagcg acaattggga ttgtggatgg cattttcaca aggatgggtg ctgcagacaa 2821 tatatataaa ggacggagta catttatgga agaactgact gacacagcag aaataatcag 2881 aaaagcaaca tcacagtcct tggttatctt ggatgaacta ggaagaggga cgagcactca 2941 tgatggaatt gccattgcct atgctacact tgagtatttc atcagagatg tgaaatcctt 3001 aaccctgttt gtcacccatt atccgccagt ttgtgaacta gaaaaaaatt actcacacca 3061 ggtggggaat taccacatgg gattcttggt cagtgaggat gaaagcaaac tggatccagg 3121 cgcagcagaa caagtccctg attttgtcac cttcctttac caaataacta gaggaattgc 3181 agcaaggagt tatggattaa atgtggctaa actagcagat gttcctggag aaattttgaa 3241 gaaagcagct cacaagtcaa aagagctgga aggattaata aatacgaaaa gaaagagact 3301 caagtatttt gcaaagttat ggacgatgca taatgcacaa gacctgcaga agtggacaga 3361 ggagttcaac atggaagaaa cacagacttc tcttcttcat taaaatgaag actacatttg 3421 tgaacaaaaa atggagaatt aaaaatacca actgtacaaa ataactctcc agtaacagcc 3481 tatctttgtg tgacatgtga gcataaaatt atgaccatgg tatattccta ttggaaacag 3541 agaggttttt ctgaagacag tctttttcaa gtttctgtct tcctaacttt tctacgtata 3601 aacactcttg aatagacttc cactttgtaa ttagaaaatt ttatggacag taagtccagt 3661 aaagccttaa gtggcagaat ataattccca agcttttgga gggtgatata aaaatttact 3721 tgatattttt atttgtttca gttcagataa ttggcaactg ggtgaatctg gcaggaatct 3781 atccattgaa ctaaaataat tttattatgc aaccagttta tccaccaaga acataagaat 3841 tttttataag tagaaagaat tggccaggca tggtggctca tgcctgtaat cccagcactt 3901 tgggaggcca aggtaggcag atcacctgag gtcaggagtt caagaccagc ctggccaaca 3961 tggcaaaacc ccatctttac taaaaatata aagtacatct ctactaaaaa tacgaaaaaa 4021 ttagctgggc atggtggcgc acacctgtag tcccagctac tccggaggct gaggcaggag 4081 aatctcttga acctgggagg cggaggttgc aatgagccga gatcacgtca ctgcactcca 4141 gcttgggcaa cagagcaaga ctccatctca aaaaagaaaa aagaaaagaa atagaattat 4201 caagctttta aaaactagag cacagaagga ataaggtcat gaaatttaaa aggttaaata 4261 ttgtcatagg attaagcagt ttaaagattg ttggatgaaa ttatttgtca ttcattcaag 4321 taataaatat ttaatgaata cttgctataa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSU62293 65608 bp DNA PRI 26-OCT-1996 DEFINITION Human LIM-kinase1 and alternatively spliced LIM-kinase1 (LIMK1) gene, complete cds. ACCESSION U62293 NID g1432163 KEYWORDS Williams syndrome. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 65608) AUTHORS Frangiskakis,J.M., Ewart,A.K., Morris,C.A., Mervis,C.B., Bertrand,J., Robinson,B.F., Klein,B.P., Ensing,G.J., Everett,L.A., Green,E.D., Proschel,C., Gutowski,N.J., Noble,M., Atkinson,D.L., Odelberg,S.J. and Keating,M.T. TITLE LIM-kinase1 hemizygosity implicated in impaired visuospatial constructive cognition JOURNAL Cell 86 (1), 59-69 (1996) MEDLINE 96291399 REFERENCE 2 (bases 1 to 65608) AUTHORS Frangiskakis,J.M., Odelberg,S.J., Atkinson,D.L. and Keating,M.T. TITLE Direct Submission JOURNAL Submitted (25-JUN-1996) Human Genetics, Univ. of Utah, 10 N 2030 E, Bldg 533, Suite 2100, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..65608 /organism="Homo sapiens" /note="Stratagene catalog No. 936205; chromosome 7-specific flow-sorted cosmid library from Lawrence Livermore National Laboratories" /db_xref="taxon:9606" /chromosome="7" /map="7q11.23" /tissue_type="placenta; hippocampus" mRNA join(<1..101,1789..1885,12663..12801,13121..13230, 15072..15278,21920..22025,22122..22288,23055..23238, 23915..24001,24949..25080,27692..27751,27977..28042, 31846..32002,36610..36665,36919..37076,37166..38524) /gene="LIMK1" /note="Williams syndrome region; Kiz-1; protein kinase with two LIM domains" /product="LIM-kinase1" gene 1..38534 /gene="LIMK1" CDS join(47..101,1789..1885,12663..12801,13121..13230, 15072..15278,21920..22025,22122..22288,23055..23238, 23915..24001,24949..25080,27692..27751,27977..28042, 31846..32002,36610..36665,36919..37076,37166..37328) /gene="LIMK1" /note="Williams syndrome region; Kiz-1; protein kinase with two LIM domains" /codon_start=1 /product="LIM-kinase1" /db_xref="PID:g1432164" /translation="MRLTLLCCTWREERMGEEGSELPVCASCGQRIYDGQYLQALNAD WHADCFRCCDCSASLSHQYYEKDGQLFCKKDYWARYGESCHGCSEQITKGLVMVAGEL KYHPECFICLTCGTFIGDGDTYTLVEHSKLYCGHCYYQTVVTPVIEQILPDSPGSHLP HTVTLVSIPASSHGKRGLSVSIDPPHGPPGCGTEHSHTVRVQGVDPGCMSPDVKNSIH VGDRILEINGTPIRNVPLDEIDLLIQETSRLLQLTLEHDPHDTLGHGLGPETSPLSSP AYTPSGEAGSSARQKPVLRSCSIDRSPGAGSLGSPASQRKDLGRSESLRVVCRPHRIF RPSDLIHGEVLGKGCFGQAIKVTHRETGEVMVMKELIRFDEETQRTFLKEVKVMRCLE HPNVLKFIGVLYKDKRLNFITEYIKGGTLRGIIKSMDSQYPWSQRVSFAKDIASGMAY LHSMNIIHRDLNSHNCLVRENKNVVVADFGLARLMVDEKTQPEGLRSLKKPDRKKRYT VVGNPYWMAPEMINGRSYDEKVDVFSFGIVLCEIIGRVNADPDYLPRTMDFGLNVRGF LDRYCPPNCPPSFYPITVRCCDLDPEKRPSFVKLEHWLETLRMHLAGHLPLGPQLEQL DRGFWETYRRGESGLPAHPEVPD" CDS join(89..101,1789..1885,12663..12801,13121..13230, 15072..15278,21920..22025,22122..22288,23055..23238, 23915..24001,24949..25080,27692..27751,27977..28042, 31846..32002,36610..36665,36919..37076,37166..37328) /gene="LIMK1" /note="Williams syndrome region; Kiz-1; protein kinase with two LIM domains; alternative translation initiation site" /codon_start=1 /product="alternatively spliced LIM-kinase1" /db_xref="PID:g1432165" /translation="MGEEGSELPVCASCGQRIYDGQYLQALNADWHADCFRCCDCSAS LSHQYYEKDGQLFCKKDYWARYGESCHGCSEQITKGLVMVAGELKYHPECFICLTCGT FIGDGDTYTLVEHSKLYCGHCYYQTVVTPVIEQILPDSPGSHLPHTVTLVSIPASSHG KRGLSVSIDPPHGPPGCGTEHSHTVRVQGVDPGCMSPDVKNSIHVGDRILEINGTPIR NVPLDEIDLLIQETSRLLQLTLEHDPHDTLGHGLGPETSPLSSPAYTPSGEAGSSARQ KPVLRSCSIDRSPGAGSLGSPASQRKDLGRSESLRVVCRPHRIFRPSDLIHGEVLGKG CFGQAIKVTHRETGEVMVMKELIRFDEETQRTFLKEVKVMRCLEHPNVLKFIGVLYKD KRLNFITEYIKGGTLRGIIKSMDSQYPWSQRVSFAKDIASGMAYLHSMNIIHRDLNSH NCLVRENKNVVVADFGLARLMVDEKTQPEGLRSLKKPDRKKRYTVVGNPYWMAPEMIN GRSYDEKVDVFSFGIVLCEIIGRVNADPDYLPRTMDFGLNVRGFLDRYCPPNCPPSFY PITVRCCDLDPEKRPSFVKLEHWLETLRMHLAGHLPLGPQLEQLDRGFWETYRRGESG LPAHPEVPD" polyA_site 38524..38529 /gene="LIMK1" polyA_site 38529..38534 /gene="LIMK1" misc_feature 62161 /note="K2049 deletion breakpoint" BASE COUNT 15987 a 16979 c 16761 g 15878 t 3 others ORIGIN 1 ccgcccccag ccccagcccc gccgggcccc gccccccgtc gagtgcatga ggttgacgct 61 actttgttgc acctggaggg aagaacgtat gggagaggaa ggtgcgcggg ccgcggggtg 121 tggggcgagg gcctggaggg ggtgcccggg cagcgtgggg cacgggaggg ggccgggtct 181 gccaggaggc cgcgccctgc ctcctccggg atgagctcgt ccttacgaag cccgcaggcc 241 cctccctgtc cccctcccgc ccgggatccc cctccccggc ccccggcgag ctgccctcct 301 gcgggtctgg gggcccctgg accctttttc ctcctcccac gtccccccgc gaaggactcc 361 cagacactgc ccaccccgcg tcggcctcca tccgcgtgct ctgtccacca cccgggcctc 421 gctggggcca ccctttatcc agtctcggaa gaaagagcgg ctggggacac agccccgggt 481 cccagtggcc gcctgcccgg ctctgtgacc ttgagccagg cgctgacttc ctggtcctca 541 gtttcccctt ctgtacattt ggaaactggg tagttgcccc cccggtgtcg gtgattgggg 601 gccagatggg tagagcggag ataggcgtcc aggaagccgg aggccgtgta ctgcgggagc 661 ctcatccact ctccctgtcc gtgccccaaa cccggtgcct gccctcagtc ttggctggga 721 gcatgactca tcctaacctc ctctttagcc ccttctccct cactggggcc caaggcgcag 781 tactgcactg cagttagggt tcaaggactc ccccagccta ggacagggtc tgggggcccc 841 tccttggatc tccttcgctg acctgtcact tagatccacc tggccccaag gcagggcctg 901 actccacacc tccccctgcc accaactctt cccaggccca tgaaaacctg attggggtag 961 gggcccacct tcctgtagcc cctgcctacc taaggtacct gcgtcttcac agagggtcag 1021 gctgttgtgg ccttgggacc tagctatgtg actgggcaag ccatgccatc tctggggctc 1081 agtctcccct tctgtacagt ggagaggggc aggtctgggg cattttccag ggcccaccag 1141 ctccaagggt gccaggcccc aaggatgact aagcatcgtg tggctggcta gaggaggtgc 1201 caggcctccc tgggacaggt gtctgggagt acccacgtct gcagcccctt ccccttgcca 1261 agccagggca ttcattgcca aggatctgtt agggccggca cctccaggct tcctgccctt 1321 gacctcccag ctggcttcag cccaggatgc actaatccag ccctgtccag tccctgcctt 1381 tgaagggccc tcttagtact tcttcctggg caggagaggg aagaaaggag gctgtgatag 1441 gaatgtcacc cactgcctta tccctaaagc cactgcttcc tttctcctca tttaccttgc 1501 cagatccaat gctatagcgg gaggatggac ctgatcctcc tcctaagctg atacataggg 1561 aaacagggcc agagaagctt ggcaacctag tcagtatctc agcaagactc aggccagcgc 1621 cctttcttct cctatttggc acagcgactg ccctgcctgg gcgctgcaca tgtgcagtgt 1681 gcgaggattg gtgcaggtgt aggtatatgt ggggtgggca gggcaagctg ggcctgcacc 1741 agatcacact tcctgagaat gcttcccaac tcccttccca ccctgcagga agcgagttgc 1801 ccgtgtgtgc aagctgcggc cagaggatct atgatggcca gtacctccag gccctgaacg 1861 cggactggca cgcagactgc ttcaggtagg gtggggtgcc cagggcctgt gttgccctaa 1921 acaaggcctg ccagagagga caggctggtc aaggaatggg ggaggccggg atatgcctcc 1981 tggtgccgtc ccctattgtg acttcgtggc cttaatttac catttatgac atgaggtgtt 2041 ttgactagaa aatccctaca ggccttcctg ttgtcatttt atttatctat ttttttttct 2101 ttttgagacg gagtctcgct ctgtcaccca ggctggagta cagtggtgcg atcttggctc 2161 attgcaacct cttcctcctg ggctcacgca gttctcctgt gtcagcctct ggagtagctg 2221 ggattacagg cgtgcaccac cacgcccagc taatttttgt atttttagta gagacgggtt 2281 ttgccatgtt agccaggctg gtctgaaact tctgacctca agtgatcttc ccacctcagc 2341 ctcccaaagt gctgggatga cagacataaa ccaccgctcc tggcctcatt ttattttctt 2401 ttatgtattt ttcttttttc gaaatggtct tgctctgttg cccaggctgg agtgcagtgg 2461 tgccatctcg gctcattgca acctccatct cccgggctaa agtgatcctc ctacttcagc 2521 ctcccgagta gctgggatta taggtataca ccacaatgct cagctaattt tttaaatttt 2581 gtgtaaagac agggtctcac tattgagacc caggctggtc ttgaacttgt gacctcaagc 2641 aatcctcctg ccttggcctc cgaaagtgct aggcttacag gcgtgagcta acgccttggc 2701 ctctgttgtc atcctagatc tctgagatct aaatcttaga gaggatggga gagacctcca 2761 attgagccag tgcctgcaat tcagccccct gctggcaccc agacaggggg aagagttgga 2821 aggaatgtcc ctcctgcctt ctgggtgttc atgctcttgc agggagggaa gacaaaccag 2881 gccttaaggg aaaccaggcc accctcagtg tcttcccagg ctgcttgcga acatgcataa 2941 cccagtcaca ccagccccag tgtccagaca cacacccaca ggtaggaaga aagtagggtc 3001 agggttgtgg cggaggataa agagtacatg aggacctgaa ggtcacccag taggaccatc 3061 ctgagaagcc aggagcaggg gtctacctgc cttgagccag agcagggcca gagcaggggt 3121 ctcaaaggat gtgagatttc ctgggtagaa aagtagagtg gaggtggggc gtggtggctc 3181 acacctataa tcccatcact ttttggggct gaggtgggca gatcacttga gttcaggagt 3241 tcgagacaag cctgggcaat atggcaacac cctgtctcca ctgaaaatac aaaaaattag 3301 ccgggcgtgg tgcgcatgcc tgtagtccca gctactcaag aggctgaggt ggcagggtta 3361 cttgagcctg ggaggtggag gctgcagtga gctatgatcg caccactgca ctctagcctg 3421 ggcaatagag cgagacccag tctcaatttt taaaaaagaa agaaagaaaa acaaatggtg 3481 tgggagagaa ttacaggcat agtcaccaaa cagcaaggtt caggggagaa aactccataa 3541 aagggtagaa ggtgaagctt ctgggatgcc cagcaggggt caagacatcc accactagga 3601 ctttatttta ggcttctgcc ttggtttatt ttttggtttt tggttttttt gagacagtct 3661 tgttgtgtcg cccaggctgg ggagcagtgg cgcgatccct cctcactgca acctccgcct 3721 cccaggttca agcgattctc ctgcttcagc ctcccaagta gctgggatta caggtgtgca 3781 ccaccacgcc cggctaagtt ttgtattttc agtagagata gggttttgcc atgttggcca 3841 ggctggtctc gaactcctga cctcaagtga tctgcccgcc tcagcctccc aaaatgctgg 3901 aattacaggc atgagccact gcacctggcc tcggtttgtt tttttgtttc ttcttttctt 3961 tttttttaca cagggtcttg ctgtgtcacc caggctggcg tgcagtggtg agatcatagc 4021 ccactgtagc ctccagctcc aactggttca agcgatcctt ctgactcagc ctcccaaagt 4081 gctgggatta caagcataag ccaccatgcc cagcctgttt tttctttttt aggaataacg 4141 tctaacgttt tctaacattc agtaagggac aacccctgtt ctaagtactt tgcatagtta 4201 gatattagtg ctgtctttgt tttgccagag agaaaattgg gacacagaga ggttaattct 4261 cttgatgaaa gtcacacagc cagtgagtga aatgaacaca ctcagtgtgg ctgaaaggag 4321 acagacagca tgccctggga ttctgcatca ggtgctcaga aagaggcctt cggggggcaa 4381 gagggctctc aacaggcaga ggaaaccatc tgcacagcgg tgggatggtg cggactgctg 4441 agggaacagg aacagttccc ttggaaggaa cagaataagc tgagggatcc aacaagaaac 4501 aaagttgaga ccgattcgtg aagggccttg aatgccaaga taaggagttt cagaagtcag 4561 gatgggggtg gtggctcatc cccgtaatcc cagcactttg ggaggccgag gcaggcagat 4621 cacttgaccc caggagcttg agaccagcct ggccaacgtg gtgaaacccc cgtctctact 4681 aaatattcaa aaattagcca ggcatggtgg cacatgactg taatccaagc tactcgggag 4741 gctaaggaag gagaatcact tgaacctggg aggcggaggc tgcagtgagc tgagatcacg 4801 tcactgcact ccagcctggg agacagagcg agactccatc tcaaaaaaaa aaaaaaaaaa 4861 aaatagaagg gagtcggcag aaagccaggg aggggctggg gtgacatgct gttgaagaat 4921 gccatcccag tgggccggtg gtggtatcta ggcagggaag ggactgtccc agtaactcaa 4981 gggtctgagc tcataggacc tgacctggga cagtgactga ggatggagag aatttcaggc 5041 agaagggaca gtttttggtg agtatttgtc atattggcta ccatgcattg agcactcttc 5101 atgctaattt gttaaatctt catcataact ctatgaggga ctgtatgtgc ccagtttgca 5161 tgggagaaac agagattcca tgcaatcaag tgcctcgctg aaggttgtaa catctagagc 5221 tgggactaaa accttctcac tccacatcgc cacagagtag gaaaggcagg ggctggcggt 5281 ggcacatgcc tataatccca gccctttggg aggcggaggc aggtggatct cttgagccca 5341 ggagtttgag accagtgtgg gcaacatagt gaaaccttgt ctctacaaaa aaattagctg 5401 agcatggtgg tggtgcctgt agtcccagct actcaagggc gctgacatgg gagggttgct 5461 tgagcctggg aggtggaggt tgcagtgagc tatgatcaca ccactgcaag ccagcctggg 5521 tgacagagtg aaatcccatc tcaaaaaaaa gaaagaaagg aagaaagaaa aaggcagggg 5581 cttcggggag ggcatgggca ctggcgaatg gcagggtgga acctgaagcc atctggtttt 5641 ctaacctggg cactggggag ttggtggttt gttgactctg atggaattgg gggtcatgtt 5701 ggggaggaga catgctcatc tgtgttgagc tggaggggac atgggctatc catggtggct 5761 gtgtcctgcc cagagctagc catgggagcc tgagtccagt tggaggtagg aaagtcagaa 5821 aaaacggccg cctcggagct ggccctgaga tggtgagtgg gatttgtgat agggccaaga 5881 cgaatgaagg gaagaacttt ggggacccct gtgtctgcgg tgagggggga gatggagcct 5941 tgggtgatgg agagagggtc aggagtagag ccacagaagc cacaggaggg aagccgtgtt 6001 acaggatggg tgtacctggc tttggagtgg cctgtcccaa atcactcacc aggagagggg 6061 tgagtccccg ggtcagggca gtaaagagga ggcatgtttg tgctgtccct ggtgtagtga 6121 aactcaagaa ggaagccagg tgcagtggct cacgcctgta atcccagcac tttgggaggc 6181 caaggcaggc agatcacctg aggtcgggag tttgagacca gtctggccaa catggtgaaa 6241 ccccatctct actaaaaata caaaaattag ccgggcctgt tggtgggcgc ctgtaatccc 6301 agctactcag gaggctgagg cagaagaatc gcttgaaccc gggaggcagt gattgcagtg 6361 agtcaagaat cgcgccactg cactctagcc tgggtgacag agcaagactc catctcgaaa 6421 aaaaaaaagt ctcaatatgg ggaaagatcc actagaagta agagccatgg cttctacctc 6481 gtggcttgtg ggtgtgatac tcccaacagt ccccaaagct ggtggtcctc accgcgtgac 6541 agtgagcaga gcagctcaga gggggtcact gctcacctgg gtgcatggct gaccacagcc 6601 aggctggctc tcagtgggat gcccaaggtg ctagactctg cttagtctcc ctcgggccct 6661 gggcttgagg cattgggccc ggcccagacc tcatttcatg cactgagacc tttgttccag 6721 ggcccctcac ccctctgaag gtgttcgggc aggggcaatg tgataaggcc atgaggggtc 6781 tgcagcctcc agccccactg gggaggtggc cagtgatttc caccttcctg gcccctctgc 6841 atgcccctcc cagtggaact tcctagggtc cctgagtcag tcacttgcaa ataattatgg 6901 cgtgcccact ctgcattagg cccctctcac aacaacccag taagggggtg ctatttattt 6961 attaaagcga tttttttttt gagtctcgct ctgtcgccca ggctggagtg cggtggcgca 7021 atctcggctt actgcaagct ctgcctcccg ggttcacacc attctcctgc ctcagcctcc 7081 caagtagctg ggactacagg cgcccaccac cacacccggc taattttttt ttgtttgttt 7141 gtatttttag tacagacgag gtttcgctgt gtgagccagg atggtctcga tctcctgacc 7201 tcgtgatccg cccacctcag cctcccaaag tgctgggatt acaggcgtga gccaccgtgc 7261 ccggcaatat taaagcgatt ttaaggccaa ggctggtaac tcacgcctgt aatcccagca 7321 ctttgggagg ctgaggcagg aggactgctt gaggccagga gtttgagatc aacctaggca 7381 acatagtgag actccatctc tacaaaaaaa ttagccaggc gtggtggtgc gtacctgtag 7441 tcccagctac tcaggaggct gagatgggag gatcatttga acccaggatg tcgaagctgc 7501 agtgagctgt gatcacgcca ctgcactctg gcctgggcaa cagagcgaga cactgtctca 7561 aatttttaaa aagcgatttt acaaatgagg tgcagagttc agtcacttgc caaaagtctc 7621 acagcgcgtg aggagtagaa tcaggactcg aaccgaggca gcctggcttc agagcctaca 7681 gtgtaaccac agcttagtcc cacacctccc agaccaacag ggtccctgcc ttctagtggg 7741 caagacactc agtgaacaaa tgtagtgtca ggtattgggg gacagcactc tcaggaagtg 7801 atgtttaagg gacagaattg aagggagcag tgtttagagg atgtcggggg tagggccggt 7861 gcatgtgcaa aggccttggg gtgggaatgt gcttggcaca actgaggacc acaaagccag 7921 cgtgcgggag tgcagtcagt ggccaggggt gcatagagcc ttgtgggccc cgtggaaggt 7981 gccgttggct gtacactttt tttttttttt tttttttttt tttttgagac agagtctcgc 8041 ttttgttgcc caggctggag tgcagtggcg tgatctcagc tcactgcaac ctccgcctcc 8101 cgggttcaag cgattctcct gcctcagctt cctggtagct gggactacag gcgcccacca 8161 ccacacctgg ctaatttttg tgtttttaat agagacgggg tttcaccatg ttagccaggc 8221 tggcctcaaa ctcctgacct caagcgatct gtctacctca gcctcccaaa gtgctggggt 8281 tacaggcatg agccactgcg cacaggcagc tgtgcatctt tgaatgtcat aacctgagca 8341 tctgagagct gctcctgtcc cctggcccct gctcttgagg aagtcccacg ctgataggac 8401 agacagggtc ataagtgctg tgatgggggc ctgcaggctg ctggagggct cagccgggac 8461 cagatgctgc ccctctttgt agagtgggac aattgctgca ggcccatggg acctctggta 8521 ttagccctga gggttgtcac tccggggcct gcccctttct gtgttctgac ctcccagccc 8581 cttgcaggcc ccgcctcccg gaaggttatg accaggcttg gactggtcca ggcttccctt 8641 tggctcacat actgcctctg cgaggtcccc tccaggaagc ctcctgtgca caacccccag 8701 ggctgccgca tccctggtag catctccttg gcagctgggt gggctggccc tgggcaagga 8761 gggctgagca tgctgctggc ctgtggggtt ggagcagcgg cgggatgcaa cctccctttc 8821 ttcaggggac ctttttggcg aagacaaact gtccatagga agtcgacctc tgttcccttg 8881 ggggcagcag tggaagaggc agctgctttt gagcttgtcc ctgtccccag agaagcctga 8941 ggccttcagt gccgttgcca gggccgaggc tgaggagcct acagcgtgtg ttcaggactg 9001 agggccaggg acgggcacag gctccctgcc tggggtccaa gcctagatcg ctcgctcccc 9061 acccgcacca aagcccaggc aaagggtgct tcagccactt cctgttgcag gctcagacca 9121 agtcccctgg cacccacgcg gctgcagctc ctcctgtgcg ctgcagccac gctggcccca 9181 ccctctgcag cctccaatcc tgagcccctg agggaggatg gggaagcagc tggtctggcc 9241 acccctgccc tcccttagac ctccagagcc cccagtgtag ccacagagga tgctgttggc 9301 ttcagcccca agaagacgcc gcttcctcca gagggctaag taagtgggaa tccccctccc 9361 tacttgtcct gggctccagg cagggcccct ggtgtaaggc ctgggctgga agccgaccca 9421 cctaggtcca ggctctgggg cagaactgaa actccttggt tactgtcggc tgcagctggg 9481 agcaggccac tccaaagctg tgggtccttc caggacagtc tccccatgag gccggtcctc 9541 cacctgctgt ttcttcacac ctggtggcca gggatgtggc cctgggtaga acgatgattc 9601 tccactcctg tcattatgga agccaccgct gtctcccagc ccagccagcc acctgggctg 9661 cagagcaccc ctttcatgcc ctccgggtgc ctcccccttc tcctgcccca gcctggcttt 9721 gtcctaccct gctctcaggg aggggtaccc tggagtgggg ccagggcatg gctctccccc 9781 gagggagttc ctctctggct gtccccaggg cagctctgca cagcctcagt acctggcgca 9841 cctcccttga catccttctt agggacagtc aggcactctg tgtggggcac tcaagagagc 9901 caggcccgtc agcctctagc tcctgccaga atgcaggcct gaggggtgag gggcggggca 9961 ggggcagggg cagggacagg aactccggct tgctctccat ccgcaaaggt tcactgaggc 10021 cccgagcccc agccactgag ccaccaagtc agcctgggcc aggcctgggt gccctgtctg 10081 caatggaggc agagacgggg tctcggggca gttctgagga tgctgggtgc acagcggggg 10141 cctcgccggc aggaatcact tatgctctct cctgggccaa gctttgtgga tgcccagcct 10201 ggggccgcgg ggagctggca ggtcagtggc agacactggt gggcagacct agtgtctggt 10261 agaacaggca tcaaggaagt ggtgaccgga gggaagccaa gtgcactcaa accctcgggt 10321 gagtcatcac cgccgggtct ttcacagctg ctgaaagtga gcaacagtga tgaaggtttg 10381 tgagtttctg cgtgagcgag tgaatggacc agtagcagtt tccaggttgt ggaagagcgt 10441 tccctccccg ggatggggac acttggttac agcaattcct aatcccccac ccacccaccg 10501 cccactgcag aggtatgcgg gggccctgct tcctgcaggc aggagtgagg ggcactcctg 10561 tgatgtggca cccctgtgac cgaggtcatg tgtgatcggt gtaagggcag gaagcgagtc 10621 attggtctgc accaggcgtg ggggcttctg cgagggcagg acccaaagtc ggcctggcct 10681 cccggctgca gcactccttt ccctttcgaa ttaggttaga gccctgggac gggaggtgcc 10741 ctgtagacca cccccctcac caacttccgt cctccgcccc acccccgcgg tgatccggtg 10801 aactgccggc cccctgctgt gcaccgagtg gggcagtgac cctgacgtgg cgtctcctgc 10861 cgcccctgcc accgccacca cctccggtgg cccagcctcc gcattcccca cccccatgga 10921 ggaatgcacc aggcctccct tcctggatgc acccctcacc cacatgcttc caaaccctgg 10981 cattttctgc tcccccttta ctcccacccc ttcccctagg ctcccagaca aaggggaact 11041 ggctggatcc tcttaaaggg acagtgtccc accagcttac tgctgaactc ccctcctcaa 11101 ccccagttcc ctagttacag ttaattagca ttagcagaca gcccatgagt gatacccatg 11161 caggccccag gctgtggaga gtttcctggg taggaaacag cccttaaggt ccctcatctc 11221 atccaggtcc cagtctttcc tacctgcctc tctcctagat tgtggccctt tggagcctgg 11281 ttcttctgtc cctgtgtgac cgacacatag cacccaaaca gtggcagagc gggacggacc 11341 ccctagcctg ttctctgtgt gggtctgtac cctgacccag acatgccccc ccacagcagg 11401 acccaggggg gcacatgtgt gcctgcgggt tcactggggc acccgcattt ggtttatttt 11461 attttttaga gagagggtct tgctgtgtca cccagctgga gtgcagtggt gtaatcatag 11521 cacactgcag ccttcaactc ctgggctcaa gcgatcctcc ctccccagcc tccctagtag 11581 ctgggagtac aggacccact gtatcctggc taatttttta ataatttttt aagagatggg 11641 gtcttactgt gttgcccagg ctggcctcaa acctctggcc tcaagtgaac ctcccacctt 11701 cgcctcctga agtgctgaga ttacagcatg agccaccatg cccatcccag actgacattt 11761 ctatatttgt tcatcctggc tgggcagggc tgctggtccc cacccaccgg gatgcttggc 11821 tgggaaaaag ccgggaatgt aggtctaacc ctggcctgtg ttgtggcacc tacagcctgg 11881 cattcctccc catctgccct tcaaggcccc accaaccagg cctccttggt agcctctagt 11941 gaggaaacag gcgaaccgtg gctttgatga ccctgcacac ctggggattc tcctctattt 12001 ttctttttct tttttttttt tttggagaca gagtctcact ctgtcgccag gctggagtgc 12061 agtggcacaa ttttggctca ctgcaacctc tgcctcccag gttcaagcga ttcttctgcc 12121 tcagcctccc gagtagctgg gattacaggt gcccaccacc atgcctggct agtttttgta 12181 tttttagtgg agactgggtt ttgccatgtt ggccaggctg gtctcagact cctgacccca 12241 agtgatctgc ccacctcggc ctcccaaagt gctgggatta caggtgtgag ccaccgcttt 12301 gggaggccga ggtgggcgga tcacgaggtc aagagctcaa gaccatcctg gccaagatgg 12361 tgaaacccca tctctactaa aaatacaaaa aattagctgg gcatggtggt gtgtgcctgt 12421 agtcccagct actcaggagg ctgaggcagg aggatcactt gaacctggaa ggcagaggtt 12481 gcagtgagcc gagatcgagc cactgcactg cagcctggcg acagagcaag actccgtctc 12541 aaaaaacaaa caaaaagaaa acttgttcta attcttacaa aggtgcctgt agccgaggca 12601 gcggcccagg tgaggtggag gagggcggga gtggacgtct cagcccggcc cctctcctgc 12661 aggtgttgtg actgcagtgc ctccctgtcg caccagtact atgagaagga tgggcagctc 12721 ttctgcaaga aggactactg ggcccgctat ggcgagtcct gccatgggtg ctctgagcaa 12781 atcaccaagg gactggttat ggtgagcgcc ccctgccttg cacactcacc tggggtgggg 12841 gtatccaagc agaccccatg ctccaggtct ctctcccatc attgtctctc ctggtctcct 12901 ttttgctggt ctttggagct gctttctgag cctgactgtc tgtctgtatc cctcagcgcc 12961 cccatctatg gagccagctc tgtccaggag ctcagcagct ggccagccgg gtccctgcag 13021 ttgttttttt ggtgacaccc ttggaagagg cctaggggag gatctgtggg ggttgttggg 13081 tctgctgagc tgggctgttc cctcctcacc cccgcaccag gtggctgggg agctgaagta 13141 ccaccccgag tgtttcatct gcctcacgtg tgggaccttt atcggtgacg gggacaccta 13201 cacgctggtg gagcactcca agctgtactg gtgagtgcct tggcccctcc ctgagcctag 13261 gaggcccacc tgtgtcacag atctgcaagg gtgctgactc tcccacaccc gggcctcctg 13321 ccctttccca tggggtgagg tttgttgggg caaatgttca tatctccttt cccatcccgg 13381 catggaaaca agtgagaaat aacacacaga agtcagtgtg aaaaagcctc agacggccag 13441 gcatgctggc tcacgcctgt aaacccagca ctttgggatt ccgaggtggg tggatccctt 13501 gaggctagga gttcaagacc agcctggcca acatggtgga accccatctc tattaaaaat 13561 acaaaaatta accaggtgtg gtggcgggtg cctgtaatcc cagctactca ggaggctgag 13621 gcaggagact ctcttgaacc tgggaggtgg aagttgcagt gagccaagat tgcaccactg 13681 ccctccagcc taggcaacag agcaagactc tgtctcaaaa cagaaaacct cagacgtcag 13741 ctttcttact ggccatgact gcagcatggt gctggcacaa accaccagag gtggggtgga 13801 tgccacaagt taaggacacc atccccagca taactgctcc ctctttagac accagccaca 13861 agttcagggg tccccaaccc actcacactt ctgaccgact ggctacaaat tcagggactc 13921 ccaagaccct gccaagtttg atcgtttgct aacagactca cagaactcag gaaatcctcc 13981 atttttatcc cagttttatt atgaaggaca cagctcaggt ccgaccaaat gaagaagcat 14041 ctcccctccc tcccctagca catcaatgtg atcaccaacc aggaagcttc actgagcttc 14101 agcagccaga gtttttattg ggatttcatt acatcgtcat gactgattga gtcattggcc 14161 gtatgatcaa gcttagtctc tagcccccgt tcttggaggt caggctggat gaaagctgca 14221 accctcttca aatcacatga tgtatctttg cggggctgag tcatctcatt agtatcaact 14281 caggaatagt ctgaggggct catgaataac aaagataccc cattccaagg acttagagtc 14341 tccctcccag gaatcaggac aaaacccaga cagattcttt cttatacaac actgatcaag 14401 ctggattaga ggacaacgtg gcttgatccc agatgggctt ttaatgactt cctcctgaac 14461 tggatttatc ctcaggcctt gtcctggccg ccttacagga tcacagcgag tagacagacc 14521 cgaatgactc agagggacga gggctggctg ggcacgcaca gttcctgctc ccagttccat 14581 aggaagagtg aaagaaaaga aagctggcca ggtgcagtgg ctcaccccta taatcccagc 14641 actttgggag gccaaggcag gcagatcacc tgaggtctgg agtttgaggc cagcctggcc 14701 aacatggtga aaccgtctct actaaaaata agaaattagc caggcatggt ggtgcttgcc 14761 cgtaatccca gctactcagg aggctgaggc aggagaatcg cttgaaccca ggaggcggag 14821 gttacagtga gccaagatca caccactgca cttttggaca attgctagct ttccttttct 14881 tttgagacag agtcttgctt tgtcacccag gctggggtgc agtgttgtaa tcaacagagt 14941 gagactccat ctcaaaaaaa aaaaaaaaaa ggaagggatt gggggaagag cctggggctg 15001 ggggctgcag agatgctgaa attgatgacg cccttgacac tcttttcttc ccaccccggc 15061 ggctcttgca gcgggcactg ctactaccag actgtggtga cccccgtcat cgagcagatc 15121 ctgcctgact cccctggctc ccacctgccc cacaccgtca ccctggtgtc catcccagcc 15181 tcatctcatg gcaagcgtgg actttcagtc tccattgacc ccccgcacgg cccaccgggc 15241 tgtggcaccg agcactcaca caccgtccgc gtccaggggt gagtggccgg cctgccgagg 15301 ctgccgtcgg tgtggctatg gctgttgatg tgggtggcag agtctggcac tgggggccct 15361 gaaaatgaat gggcgagtgt ttgggtacag atggggccca gttctgacaa cctggtttgc 15421 cagatttctg gcccagtcat tcctctgaat accattacaa atgccagata caataaaaag 15481 acattttcaa ccgggcatgg tggcccacac ctgtaatctc agcacttcgg gaggccgaag 15541 tgggtggatc acctgaggtc aggagttcga gaccagcctg ggcaatgtgg tgaaaccccg 15601 tctctactaa aaatacaaac gtagccaggc atggtagtgt gtgcctatag tgccagctgc 15661 ttgggaggct gaggcaggag aatcacttga acccaggagg tggaggtttc agtgagcccc 15721 gactgccatt gcactccagg ctgggcaaca agagtgtaac tctgtatcaa aaaaataaaa 15781 ataaaaaaaa cacactcaaa aaataaaaag acattttctt tagtccatgt ctgatccaac 15841 aagaaagagg aggaaccaag tcaagaatga gtgaagaagc tgggcgcagt aactcacacc 15901 tgtaatctca gcactttggg aggccaaagt gagaggatca cttaaggcca gaagtttgag 15961 accagcttgg gcaacatagc gagacctgca tgtctacaaa aaaaaaaaaa aaaattaaaa 16021 attagccagg catggtgaaa tcactgaaca cataaaggct gggcatggtt gctcacactt 16081 ataatcgaaa cactttggga ggctgagatg ggaggatcac ttgaggccag gagttcgaaa 16141 ccagcctggg aaacattgta gtcacagcta cttgggaggc tgaggcagaa ggatctcttg 16201 agcccaggaa gtggctacag tgagctataa ttgcacgact gcactctagg ctgggcaatg 16261 gagcaaaacc ctgtctcaaa aaaatggggc agggctgata aagattagat tactgtgtga 16321 ctttgagcag ctgctttctc tctaggcttt gggggtctgt ttgaacaatg agggagttgg 16381 ataccttgga gctttctaag atttctgtgg cgcctttatt gacaccttga gaagtagcat 16441 gcagtgtttc tacttttggg caattggtca cttctttttt tttgagacag tctcactctg 16501 tcgcccagtc tggggtgcag tggtgtgata ccagctcact gcaacctcca cccacaaggt 16561 tcaagcaatt cttgcacctc agccccctga gtagctggga ctacaggtga ccacatgtgg 16621 ctaatttttg tatttttagt aaagacaggg tttcaccatg ttggccaggc tcgtttcaaa 16681 ctcctgggct caagtgatcc tcccttctcg gcctcccaaa gtgccgggat tacaggtgtg 16741 agccaccgtg cccggcccaa gtgctagctt tctctctctc tttttttttt tttcgagacg 16801 gagtctcgct ctgtcgccca ggctggagtg cagtggtgtg gtctcggctc actgcaagcc 16861 ccgcctcctg ggttcacgcc attctcctgc ctcagcctcc cgagtagctg ggactacagg 16921 cacctgccac catgcccggc taattttttt tttatattta gtagagacag ggtttcacca 16981 tattaggcag gatggtctcg atctcctgac ctcgtgatcc gcccgtctcg gcctcccaaa 17041 gtgctgcgat tacgggcatg agccaccacg cccggcccta ccaagtgcta gctttcattt 17101 gacgcagtga atgtttcttg tacacctggc aggtgcctgg cactgcatag gcactgttga 17161 gatgtgaagg tggccctggg gacagaaaat tatactgggc ttgactgtgt gtctccatcc 17221 cttgacatca gccaagccag cagctgcttt acatacatga tgagcagaca gctgcttgaa 17281 agagatgagg aaactcccag accaacggct cttaccagag ggccaaggga ggtccccaca 17341 gagtcagagg ctgcagctgg tccctgaaat ccaggcagaa ttttagaaat gaagacagtc 17401 agctgggtgc agcggctcat gcctgttatc tcagccactt cggagggctg aggtgagagg 17461 attgcttgag cccaggaggt ggaggctgca gcaagctatg atgacaccat gcattccagc 17521 ttgggcgaca gagcgagacc ctatctctaa aataaaaatg aagaagacag ttaatgacgt 17581 ctcctccctg tctgcctcac tgggtaagca ttcgcccagc caacatctgg aacatcccag 17641 ttctgcaaag agccacaccc ttcccagaaa gagcccaact tgccaaagat ttacttattt 17701 gttttaaact ggttttagtt gaccgctttt cattttgtgt atagcagcgt tttaaggaag 17761 gtctaattta tccaggccac ctgctgcttt agcaaaccaa gggagaggat gtgagattct 17821 aaggaattta catatgtatg tcatatatat atatatatat agacacacaa tttttttttg 17881 agacagggtc ttgctctgtc atacaggctg gagtgcagtg gccaatcata gctcactata 17941 gcctcagatg cctgtgctca agcaatccac tcacctcggc ctcctgagta gtgagactac 18001 aggcacacac caccacaccc agctaatttt ttaatttttt gtagagactg agtcttgctg 18061 tgtcgcccag gctagtcttg aactcctggg ctcaagcaat cctcccacat tggcttccca 18121 aagtgctagg attacaagcg tgagccacta tgcctggctt atttttaagg ttatatgcat 18181 gcaaagcctg tatcaatgaa aatattttct ttggtttttt tcaacttttc atcttcgcat 18241 tttgcagatt tatagaaaat ttgctaaaat aataagtcca ttgaatacat acacaccctt 18301 caccaaggtt caccaattcg taactgccat atttgggagt tatatgtgtg tctctctata 18361 tatacatata tggatacaga tacatataca tgtttagtga cttgtttata tttgtacata 18421 catgtacatg ttgttattta ttgatcgttt gggagtaagt tgcagggatc attgactccc 18481 ccacaattat gctagatatt ctcaaaagaa ggaccttctc tttttttttt tttttttttt 18541 ttttttggag acagggtatc actgtcattg aggctggagt gcagtgatgc gatcacagct 18601 cactgcagcc tcaacctccc aggctcaagt gatcctccca cctctgcctc ccaagtagct 18661 gggactacag gcacgggcca ccacgcctgg ctaggcattc tgttatgtaa ttatccaatt 18721 gtatcttata gttcagtgat cacattttgg aaatgtaaca ttgataccat tatctaatac 18781 acagaccata ttcaaatttt gcctattgtc tctatactga actactgaac tgtcctttat 18841 agcaatctcc ccctcatcca cagtccagtc catgatcaac attgcattta atcgtcatgt 18901 gtcatcagta tctttttttt tttttttttt gagacggaat tttgctcttg ttgcccaggt 18961 tggagcgcaa tggcgcaatc ttggcttatt gcaacctccg cctttgggct taagtgattc 19021 tcctgcctca gcctcctaag tagctgagat tacaggcgtg caccattatg catgcctaat 19081 ttttgtattt ttattagaga cggggtttta ccatgttgcc ctggctggtc ttgaactcct 19141 gacctcaaat gatccaccca cctcagcctc ccaaaatgct gggtttacag gcatgagcca 19201 ctgcgtctgg ccatttcctc agcctttcat tgcccttcat gatcttgaca tttttgaagt 19261 gtacaggcca gtcattaaag taaaatgttt ttcctttttt tttttttttt ttttaaaaag 19321 agacagggtc tcactgtgtt gcccaggctg gtctcagact cctaggctca agtgatcctc 19381 ccgcctcagc ttcccaaagt gctgggatta caggcgtgag ccatcgtacc tgccctcgca 19441 tttgggtttg actgatgttt cctcttaggg agacaggctc tgcaggtttg gcctgatact 19501 gcataagtga tcctctgtcc ttccgagtgg atcttgccag gagacatatg atgtcagtgt 19561 gcccttggct gaggatgttc actttgatta cttgtttttt ctgtactgta aggatttttt 19621 tccctttgtc atcaataaac catttgtgag atttgagtct gtaaatatcc tgttcccaaa 19681 aacccttccc caaatgattt gagcatctat tgatgattct tgcctgtagc gattattact 19741 agggtggcta ccaaatgctg aatttctaac tctgttcttc cttctgcatt tgttactgta 19801 aggaagagct tctcccccat acgagaatag tctttttgtt tgcttggttg tttttttgag 19861 atagggtctc actctgttgc ccaggctgga gtgcagtgac atgatcatag ctcactgcag 19921 cctcgacctc atgggctcaa gcgatcctcc tgcctcagcc tctcgagtag ctgggactac 19981 aggcagcacc accatgcctg gctaattttt tattttttgt aatggtgagg tctcactatt 20041 ttgctcaggc tggtctcgaa ctcctgacct caagtgatct tcccacctca gcctcccaaa 20101 tagctgggat tacaggagtg tgccaccatg ctcagctaat tttctgtaaa aaatgtcata 20161 gagatggggt cttgctatgc tgcccaggct ggtctcaaac ccctagtctc aagcaatcct 20221 cccaccttgg cctcccaaag tgctgggatt ccaggcatga gccaccacac ctggccctgt 20281 ttttcttaaa gttctcagtc tcctctctgc cttaccccca tccccttttc catctccagg 20341 acctagggca gagacaaagt gagcattccc taaaaagctt ttatgaggca aaatgaaaac 20401 cagctcacgc ctataatccc agcactttgg gaggccaagg tgggtggatt acctgaggtc 20461 aggagttcaa gaccagcctg accaacatag agaaacccca tctgtactaa aaatacaaaa 20521 ttagccaggc atggtggcac atgcctgtaa tcccagctac tcaggagcct gaggcaagag 20581 aatcacttga acctgggagg cggaagttgc aatgagccga gatcactcca ttgcactcca 20641 gcctgggcaa caagagcaaa actctgtctc aaaaaaaaaa aagaaaagaa aagaaaacca 20701 ggtccctaac accgaagagt taaaagaaat aagtaaattt ggcaaattgg tctttttgtg 20761 agttagctta taggcaactg atcgagggtc tctttcccgt cttcaccctg caattgtggc 20821 tcagggcaag ctgccagctc cctcctgcca atgcaggagc aatagagctt ggcctcctct 20881 tgcagggcga gtttgggagt cagatatgaa gccactaatc cgggaccttt ttgggaccca 20941 aggcactcat ctgccccaag cataccaggc aggccaggtg caatgactca tgtctgtaat 21001 cctagcactt tgtttttgcg acggagtctc gctctgtcca cccaggctgg agtgcagtgg 21061 cagaatcttg actcactgca acctccacct cccaggttca agcaattcct gcctcagcct 21121 cccaagtagc taggactaca ggcgcccact gccacgctcg gctaattttt gtattttcag 21181 tagagacggc gtttcaccat gttggccagg ctggtctcaa actcctgact tcaagtaatc 21241 catccacctt ggcctcccca actgttggga ttacaggtgt gagccactgc gcccggccag 21301 tcctagccct ttgggaggct aaggcgggcg gattgcatga gctcaggagt tcgagaccag 21361 cctgggaaat gtggtgtaac cccgtctcta ctaaaaatac aaaaaaaatt agctgggtgt 21421 ggtggtgtgc acctgtaatc ccagctactc aggaggctga ggtacgagaa tcgcttgaac 21481 tcaggaggca gaggctgcag tgagctgaga ttgtgccatt gcactccagc ctgggtaaca 21541 gagtgagatt ctgtctccaa aaaaaaaaaa aaaaaaaatt cgagaccaaa catacctggg 21601 atttggaagg atagatctgt tcccccaggg tggagacaat ggtccattga atgggaacag 21661 ctgagcatct tgtgtgggtg gccagtgcct acaagcgtgc cacctttctc cagctcacac 21721 ctgtggcaga catcagtaat tgattacaga attcctcccc tgaaaccaga actcggtgtt 21781 ctggccatct gctacttccc agtcacacga agtagaatcc tccacctgct caccctggat 21841 ctggtgccct tcgccttggt ttcctgttgg ggctctgagg gacaggtggg cactggcctg 21901 acccctgcct tacccacaga gtggatccgg gctgcatgag cccagatgtg aagaattcca 21961 tccacgtcgg agaccggatc ttggaaatca atggcacgcc catccgaaat gtgcccctgg 22021 acgaggtacg gtcctgagtc tgtggggcag gacgggaggt agtgccttca tgcctagccc 22081 cctccccact ccacccccat tcacatgcct gctgtcccca gattgacctg ctgattcagg 22141 aaaccagccg cctgctccag ctgaccctcg agcatgaccc tcacgataca ctgggccacg 22201 ggctggggcc tgagaccagc cccctgagct ctccggctta tactcccagc ggggaggcgg 22261 gcagctctgc ccggcagaaa cctgtcttgt aagtcagcct gctcctcggt tcagctgggt 22321 gctttcactc ctgctggggc tcaggggctg tgggacctag gtcggggagc cagccctgca 22381 caaatgcagc ccaggcttga gccagggagg tggaggctgc agtaagctgt catcacacca 22441 ctgctctcca gcttgggtga caaaacaaga cccactctca aaaaaaaaga ggaaacacac 22501 attttttaaa aagccgggga cggggccagg cgtggtggct catgcctgta atcccagcac 22561 tttgggaggc cgaggcaggt ggatcacctg aggtcaggag ttcaagacca gcctggccaa 22621 catgggaaac ctcatcttta ctgaaaatac aaaaattagc cgggcttggt ggcaggtgcc 22681 tgtagtccca gctactcagg aggctgaggc agatgaatca cttgaaccca ggagatggag 22741 gttgcagtga gccaaggtca cgccactata ctccagcctg ggcaacagtg tgagactctg 22801 tctcaaaaaa aaagaggatg acagagcagg atctgagggg ttgaggggag ctgggggctg 22861 ccactagagc caggataggc cgagacactg ggatgggcag cctttggact gtcccaggcg 22921 ggccctccca aagcaggggg tgattgcata gactggcatg gacaggggca tgcaggcagg 22981 aggaggaagg ggcagggcct tggccgggtg ctacctgtcc cccggtggca cttggcacca 23041 tgtgtgcccc ccaggaggag ctgcagcatc gacaggtctc cgggcgctgg ctcactgggc 23101 tccccggcct cccagcgcaa ggacctgggt cgctctgagt ccctccgcgt agtctgccgg 23161 ccacaccgca tcttccggcc gtcggacctc atccacgggg aggtgctggg caagggctgc 23221 ttcggccagg ctatcaaggt acagagcatg ccagggtctc aggggacagt ctgggtggga 23281 cccctccatc ctccttcctt cccagtctat ggaaacacag tggaaggggt atctggcttc 23341 cagactccct ggccagtgcc ctctcctccc ttggcctcct ggagctaatt aggaacaggg 23401 gacctcctac aggtagactg agaccttatg tgcgggaggt cattgaaagg tggctcctag 23461 ccaggcacag tagtttatcc ctgtaatccc agcaccatga gaggctaagg ctgtaggatc 23521 gcttgagccc aggaattcaa gaccagcctt gacatcatct ctacaaaaaa tttaaaaatt 23581 aattgggtat agtggtgcat gcctgtggtc ccagctactt gggaggctta ggcaggagga 23641 ttgtgagcca ggagttcaag gctgcagtga gctatgatca tgccacagca ctccagcctg 23701 ggcaatagag caagacccca tctcaaaaaa aaaaaaaaaa gacaagggat taatacatcc 23761 catccacttg ggtatttggg aacatcccat gcacagccta gagtatgaag ccatctgcac 23821 atctccctgg cagtcctggg gtggagatgg ggcttcctag aaggcgggct tacagcagag 23881 cttctgtctt cacacctctg tgtcccacac gcaggtgaca caccgtgaga caggtgaggt 23941 gatggtgatg aaggagctga tccggttcga cgaggagacc cagaggacgt tcctcaagga 24001 ggtcagtgag cggaatgccc tcttccctcc agagggactt ccaggtgctc acccctgccc 24061 catcaacaca ggtcggaaaa gggctctggg aaccattgaa agaagagcga gcaggccagg 24121 catagtggct cacgcctgta atcccaacac tttgggaggt taaggagaga ggatactttg 24181 agaccaacct gggcaacata gcaagacccc gtctctacaa aaaaatttta aattaaccga 24241 gcttggcaat gtgcacctgt catcccagct actcgggggg ctgaggtggg aggctcgctt 24301 gagcccagga gttggaggct gcaatgagcc atgatcgcac cactgcactc cagcctgggg 24361 aacaaggcaa gaccctgtgt ccaaaaaaaa taaaagtaac tgcattggtc gggcatagtg 24421 gctcacgcct gtaatcccag cactttggga ggctgagccg ggcggatcac ctgaggtcag 24481 gagttcgaga ctaccctggc caacatggca aaaccccgtc tctactaaaa atacaaaaat 24541 tagcccagca tgatggtggt gagtgcctgt catccaggct actcaggagg ctgaggcagg 24601 agaatttctt gaactcagga ggcggaggtt gcagtgagcc aagatcgtgc cgctgccctc 24661 cagcctgggc gacagagtga gactccttct caaaaaaaaa aaaaagaaaa gaaaaaagaa 24721 agtaactgca ggcaggggac tgggaaaaag agcatcgctg ggggtggggg cagctcaagc 24781 agagggcaca ggacgccaga gggtgtggca gaggcaggag aggggagctg ggggttccgt 24841 atctttgaga ccgcctacag cccctggtgg gatggaaaag ggagaagcag acccaagcac 24901 agctgggacc acacagagcc cgggcccagc ctgtttgtgc cccgccaggt gaaggtcatg 24961 cgatgcctgg aacaccccaa cgtgctcaag ttcatcgggg tgctctacaa ggacaagagg 25021 ctcaacttca tcactgagta catcaagggc ggcacgctcc ggggcatcat caagagcatg 25081 gtgagtcctg ggcagagcca gccacccccg ctgtgcggcc ccgggcaaag cagctccctc 25141 tgtgagcctc agtctcatct cttcaatggg gggaagccac aggggtctca aaggccctct 25201 gaaccctgat tcctaatcaa aaaggggagc gactgactcc atctaaagct aggaaaggcc 25261 aggtacaatg gtgcacacct gttattctgg cactttggga gcccaaggca agaggatcac 25321 tcgaggccag gaattcaagg ctgcagtgag ctgtgatctc accactgcac tccagcctgg 25381 accacacagc aagaccctat ctcaaaaact aaaataaaat tcagagcttt ccttaaggat 25441 ttgaataaaa ttacaaatcc atctttagaa ataaagtgct caggccaggt gcagtggctc 25501 atgcctataa tctcagcact ttcagaggct gaggccagca gatcacctga ggtcaggagt 25561 ccaagaccag cctggccaac atggtgaaac cccgtctcta ctaaaaatac aaaaattagc 25621 tgggcctggt ggcaggcacc tgtaatccca gcactttggg agactgaggt tggcagatca 25681 cctgaggtca ggagttcgag accatcctgg taacccgtct ctactaaaaa tacaaaaaat 25741 tagccgggca aggtggcagg tgcctgtagt cccagctact cgggagactg aagcaggaga 25801 atggcgttga acccaggggg cagagcctgc agtgagccaa gatcgcacca ctgcgctcta 25861 gcctgggtga cagcgagatt ccgtctcaaa aaaaaagcac ttggaggaag cctcacagag 25921 tcctgtgctg gaccacaccc tggggatcca gtcctggcct ccagccccat ttctgtacca 25981 ccctgagacc atgggatctt cctcaggttg gattaccttg tatccaaggt gtggacccta 26041 tgggctcctg ctaggtgtaa cttgacacaa cgggttccgt tgtcaggtgc aatttagaaa 26101 ctctgggcta ggccaagcgc agtggctcac acctgaattc ccaaactttg gaaggccgag 26161 gcaggagggt cactagaggt caggaggtca agaccagctt ggacaacata atgagatccc 26221 aatcccatct ctacaaaaaa aattaaaaaa ttagccaaat gtggtgacac atgcctgtgg 26281 ttccagctcc acaggaggct gaggcagaag gatcacttga gcacaggagg tcgaggctgc 26341 actccagcct gggtgataga gtgagaccct gtctcaataa aaaataaaga tctccaaggg 26401 gatgaggttt gagaatgagg cgtctccccc aaatgatttg agcccaaagc cccgttctcc 26461 tggcatggct cagtgctgcc actgcgcagg tgaccttgct gggcccttct acctcttacc 26521 tgtctgtgaa agtaggttct aattttttaa aaacctagaa agatgagttt tttgtttttg 26581 tttttgtttt tcccgagatg gagttttgct cttactgtcc agcctgaagt gcaatggcgt 26641 gatctcggct cactgcaacc tccacctccc aggttcaatc gattctgcct cagcctcccg 26701 agtagctggg attacaggag cccaccacca cacccggcta atttttgcgt ttttagtaga 26761 gacagggttt caccatgttg gtcaggctgg tctcaaactc ctgacctcgt gatccaacca 26821 ctctgacctc ccaaagtgtt gggattacag gcgtgagcca ccacacctga cagaaagatg 26881 agattttata gaaaataaat atagcttgtt ttctcagagg aggcagattg ggagctatag 26941 aggaatatcc ctgcttagag tttgaaatca gttctgttag gaaataatgt ttgtaggggc 27001 cgggtgcggt ggctcacgcc tgtaatgcca gcactttggg aggctgaggc aggtggatca 27061 cttgaggtta ggagtttgag aacagcctgg ccaacatggt gaaaccctgt ctctactaaa 27121 actacaaaaa ttagctgggt ttggtggtgg acacctgtaa tcccagctac ttgggaggct 27181 gaggcgagag aattgcttga ggccgggtgc agtggctcat gcctgtaatc ccaacactgg 27241 gaggccaagg tgggcagatc acctgaggta aggagttcaa gaccagcctg accaacatgg 27301 tgaaaccccg tctctactaa aaatacaaaa aattagctgg gtgtggtggc gcatgcccat 27361 agtcccagct actcaggagg ctgagacaca agaatcactt gagccccgga ggcgaaggtt 27421 gtagggagct gagatggtac cactgcactc caccctgggt gacagagtga gactccatct 27481 aaagaaaaaa aaaaaaggaa ataatgtctg tgagctgtgt tgactcatac tccttagaag 27541 cagacagttg tgggtgcccg aagaaatcgg ggtgttgggg agcccaggga ccctctagga 27601 cgcttgcctc ttcctgcctc tgtctcatgc aaccatccct gccatcgggg cccccaccgg 27661 ccccaccctg gccattcttt ctccatccca ggacagccag tacccatgga gccagagagt 27721 gagctttgcc aaggacatcg catcagggat ggtgagtgag ccgggtgctc tagctccatt 27781 cataatccca ccaggaattt gcaaacagaa cccacaaaga agctttgaaa gagggcagag 27841 ggggtcgatg ggagagtggg aagaatcgtc ccgactggcc tgattggggt gggagcagag 27901 ggagttcctg gggagccagg atgggctggg gtccctctgc acagctgccc cctgactccc 27961 gtgtccccgt ccctaggcct acctccactc catgaacatc atccaccgag acctcaactc 28021 ccacaactgc ctggtccgcg aggtgagtac cagggcccca cgtggctggg tgtcaggaga 28081 cagcaggagc ccatccaacc ccagcctcag ggccttccca gaactggagg cccctccatg 28141 ttgcctccat gacttcaatt tgaggtgggg gtggggggca gcagcccgtg gggaagagcg 28201 cagggtcagg aggcagacag acctgggttt gagtcctgtc tctgccactg actcatggtg 28261 gaccatcaga gtcccaggct ggtaggaggg tctcataaat caatgaagga gaaagtgaca 28321 tgtaagctac aaaggaccag gaccgtggtc ttcatagagc acagcccatg gcagagtggc 28381 catgggctac accagacagc accagcatct gggggccaca gagtgggggc ataggcgtat 28441 gggctggagt ggtcagggca ggcttcctga aagaggaggc ttggccagac acagtggctc 28501 acacctgtaa tcccagcact ttgggaggcc gaggcaggcg gatcacgagg tcaggagatc 28561 gagaccgtcc tggctaacat gggcactgtg gctcacacct acaatcccaa cactttggga 28621 ggccgaggtg ggtggatcac ttgaagccag gagttcaaga ccagcctggc caacatggct 28681 aacacggtga aaccccatct ctactaaaaa tataaaaaat tagccgggcg tggtggcagg 28741 tgcctgtagt cccaactact tgggaggctg aagcaggaga atggtgtgaa cccgggaggc 28801 ggaacttgca gtgagccaag atcgcgccac cgcactccag cctgggtgac agagcgagac 28861 tccatctcaa aaaaaaaaag aggaggcttt aggtggatat ttaagcaggg gacgggcagg 28921 caaagagccc agtgtctaag gattgtcaag ggaggagagc ccggttctcc accaaaagca 28981 caggagcgag taaccatgcc catctggaga ggtggtgtat tcgtgtcctg gggctgccat 29041 catgaagtac tgtgaaccag atggctcaaa acaacagaaa tgtgctgggc acagtggctc 29101 acacctaaaa tcccagcaat ttgggaggcc aaggcaggtg gattgcttga gctcaggagt 29161 ttgagaccag cctgggcaac attacgaaag cccatctctg ccaaaaatac aaaacggaat 29221 agccagccgt ggtggcataa gcctatggtc ccaactacct gggaggctga ggtgggagga 29281 tcacttgagc ctgggaggta gaggttgcag tgagccaaga ttgtgctact ctactccagc 29341 ctgggagaca gagccagacc ctgtctcaaa aaaacaaaac aaaacaaggc caggcactgt 29401 ggctcacgcc tgtaatccca gcactttggg aggccgaagt gggtggatca cttgaagcca 29461 ggagttcaag accagcctgg ccaacatggc aaaaccctgt ttctactaaa aattcaaaaa 29521 ttagcaggca tggtggcgca tgcctgtaat cccagctact cgggaggctg aggcaggaga 29581 attgcttgaa cccaggaggc agaggttgta gtgagctgag attatgccac tgcactccag 29641 cctgggtgat agagtcagac accgtctcaa aaaaaaaaaa gcatcacatg gcaagagggg 29701 ctgacaagag acccccaaac tgaccattat acagacccac tcttgtgata actaacctgg 29761 tccctcaata acccattaat ctgttaattc atacagagcc ctcatgaccc aatcacctct 29821 tacaggccct gcctcttaat accgttagag tcaggccagg catggtgaca tgggcctgta 29881 gtcccagcta gttggaaggc taggtgggag gatcccttga gtccaggagg taaatgttac 29941 agtgagctct gattgtgtca ctgcactcca gcctgggcaa cagagcgagc ccctgttttt 30001 aaaacagcaa caagccaggc acagtggctc acgcctgtaa tcccaacact ttgggagact 30061 gaggcaggca gatcacttga ggtcaggagt tcaagaccag cctcaccaac acagtgagac 30121 ccctctctac taaaaataca aaaattagct gggcgtagtg gtgggtgcct gtagtctcag 30181 ctactcatga gactgaggca gaattgcttg aacccgggag gtggaggttg ctgtgagccg 30241 agatcacgtc actgcactcc agcaacagag tgggactcca tctcaaaaaa aataaaaaat 30301 aacagagatc tgtgttggct tacacctgta atcccagcac tttgggagtc caaggtgggc 30361 agattgcttg agcccaggag tttgagacca gccaggcaac atggcaaaaa aataaaaaaa 30421 tttgtctcta caaaaaaatt aaaaaattag ctggcatggt ggtgagtatc tatagtacca 30481 gctactcagg aggtggaggt gggaggatcg cttgagcctg ggaagttgag gctgcaatga 30541 gctgtgttcg tgccactgca ctccagcctg ggccacggga gggagactct gcctcaaaaa 30601 aaaaaaaaaa aaatcaaacc cgaaaagcaa aaaacataga cctcacctgc ttattgggaa 30661 tattcaagat aaaattaggc caggcacggt ggctcacgcc tgtaatccca gcactttggg 30721 aggccgacgt gggcggatca cgaggtcagg agatcgagac catcctggct aacacggtga 30781 aaccccgtct ctactaaaaa tacaaaaaat tagctgggca tggtggcagg cgcctgtagt 30841 cccagctact tgggaggctg aggcaggaga atggcgtgaa cctgggaggc agagcttgca 30901 gtgagctgag atcgtgccac tgcacttcaa cctgggcaat agagcaagac tccaactcaa 30961 aaaaaaaaaa aaaaagataa aattgggcca ggtatggtgg cttactcctg taatcccagc 31021 actttgaaag gctgaggcag gtggaccact tgaggccaga agttgaagac cagtctgggc 31081 aacatagcaa gaccctatct caatcagtca atcaacctaa ataaatagta aatctggtgg 31141 catgccaagc acaggacctg ggtctataat caaaattcct gtcttgatgg gcacagtggc 31201 tcacacctgt aatcccagca ctttggtagg ccacagtggg tggatcacct gagatcagga 31261 gttcgaaacc tgcctagcca agtatggtga aacccgtctt tactaaaaat acaaaaatta 31321 gccaggcatg gtggcaggcg cctgtaatcc cagctactcg ggagggtgag gcaggagaat 31381 cgcttgaacc tgggaggcgg aggttgcagt gagccgagat catgccactg cgctccagcc 31441 tgggtgacag agcaagactc cgtctgaaaa aaaaaacaaa agaattcctg tcttctctcc 31501 gaaacaaagc agcatcagtg cccccgcagg tgggagggag cgcttgcagg agggagcagt 31561 gggtccgcca cgacggtctg gggagcaggt ggggaggggg cagagggtgc agcgtgtggt 31621 gggagggagg aagccacact gctatcttca ggtgcttccc gcagctccat ttgcaaagag 31681 cggatgggtt tggggaagga aggggtcccc accctgtgcc aatacagcgt atcagaggta 31741 tgttctctgg gctgtctacg ggttggcttg gggtcctggg gaggggcagg ccaagcgggc 31801 agtactagga tcgggtccca gcatgacccg gcttcacctt cccagaacaa gaatgtggtg 31861 gtggctgact tcgggctggc gcgtctcatg gtggacgaga agactcagcc tgagggcctg 31921 cggagcctca agaagccaga ccgcaagaag cgctacaccg tggtgggcaa cccctactgg 31981 atggcacctg agatgatcaa cggtgagtgg ttcagccctg cccatcatgg ccctcacggg 32041 aagccatggg ggagcccagg agagctgtaa cctcccaagc ccctggcccc tcccagcctc 32101 cttggctctt cagttaccct gtgggtcctg ttgctcctat aacacactta gtggcagcca 32161 ggcacggtgg ctcacgcctg taatcccagc actttgggag gctgaggtga gtggatcacc 32221 tgaggtcagt agttggagac cagcctagcc aacatggtga aacccccatt ctttactaaa 32281 aatacaaaaa ttagctgggc atggtggcgg gtgcctgtaa tcccagctac tagggaagct 32341 gaggcaggag aatcgcttga acctgggagg cagaggttgc agtgagccga gatcgcgcca 32401 ttgcactcca gcctgggtga cgagcgaaac tccatctcaa aaaataaata aatagaagac 32461 acttagtggc ttaaataaat gatcatacag ttctggagtc tgaagtccag cgtcagcctc 32521 accgggctga aatcaaggcg ccggtagggt gagctccttc tgcaggctcc ggggcacctg 32581 tttcctgacc ttttctggct cgtggaggct tcctcattcc tcctgttgct gccccctcct 32641 ctgtcttcag ggctggctgc aaagcatctt ctcttctctg atctctgcat ccatccccgc 32701 atctctttcc ctggctctaa ccttcctcct tttttttttt ttttttaaag agggtctcgc 32761 tctgttactc aggctggagt gcagtggtgc caccatagct cactgcagcc tcaaccttct 32821 gggctcaaac tgtcatccca ccccagcctc ctgaatagct gggaccacag gcatgcaaca 32881 ccacacccag ctaatttttt tattttttat tttttatttt tttttgagac agagtctcgc 32941 tgtgtctccc aggctagagt gcagtggcgt gatctcagct cactgcaagc tccgcctcct 33001 gggttcacgc cattctcctg cctcagcctc ccgagtagct gggactacag gcgcccgcca 33061 acacgcctgg ctaatttttt gtatttttag tagaaacggg gtttcaccgt gttagccaag 33121 atggtgtcga tctcctgacc tcgtgatccg cccgtctcgg cctcccaaag tgctgggatt 33181 acaggcgtga gccaccgcgc ctggccaatt ttttaaattt ttaatagaga cgggggtatc 33241 actatgttgc ccaggctggt ctcaaactcc tggcttcagg cgatcctcct gccttgacct 33301 ttcaaagtgc tgggattcca ggcatgagcc accatggccc tccatccttc tgatagggac 33361 ccttacggtg acattgggcc cacctggata atccaaaagc agccctccat ctcaagaccc 33421 tcaacttaat cccatctgca gagtccgatg gaaggtggga cgtatacaag tcccagggat 33481 caggacgcag tcatctttgg ggatcatagt tctgcctccc acagggtctg cttccctcag 33541 tccatttctt tgctgtcaat ggtcctatat atgcccagat tataggttat aaagtccttc 33601 tacaagcagg tgacacatga acacaggttc agggcaggca gaccccagcc atcacctcat 33661 catagttaac ctagttaaat tagcctggca tgtggcgtgg tgcctaatgc ctgtggtccc 33721 agctactcag gaagccaaag cgggagattt acttgagcca aggagatcaa ggctgcagtg 33781 agctatgatc ataccactgc cttctagcct gggcaacgga gtgagaccct gtctcaagaa 33841 aacaaaaaat aggccaggca cagtggctca cacctgtaat tccagcactt tgggaggctg 33901 aagcaggcgg attgcttgag gccaggagtt cgagaccagc ctggccaaca tggtgaaacg 33961 ctgtctctac tgaaaataca aaaattaccc gggtgtggtg gcacagctac tagggaggct 34021 gaggcaggag aatcacttga acccaggagc agaggttaca ttgggccaag attgcaccac 34081 tgcactccag cctgggcaac agaggaagac tgtgtctcaa aaagaaaaaa aaaaaaacct 34141 tcctgtaatc ccagcacttt gggaggctga ggtgggcgga tcacgaggtc aagagattga 34201 gaccatcctg gtcaacatga tgaaacccca tctctactaa aaatacaaaa aaattagctg 34261 ggcgtggttg cacgcgtctg tagtcccagc tacccgggag gctggggcag gagaatgatg 34321 tgaacccagg aggcggagct tgcagtgagc cgagatcgca ccactgtact ccagcctgac 34381 gacagagtgg gactctgtgt caaacacaca cacacacaca cacacacaca cacacacaca 34441 cacacacaca cacagagtta acatagcccg caaagaagac tataaaacag tcttagtggc 34501 cgggcgcagt ggttcacgct tgtaatccca gcactttggg aggccgaggc aggtggatca 34561 tgaggtcagg agtttgagac cagcctggcc aacacagtga aaccccatct ctactaaaaa 34621 tacaaaaatt agctggacat ggtttcgggc gcccgtaatc ccagctactc aggaggctga 34681 ggcaggagag ttgcttgaac ccaggaggca gaggcaggag agttgcttga acccaggagg 34741 cagaggttgc agtgggcgac agagcaagac tctgtctcaa aaaacaaaaa agtcttagtg 34801 tttcctatgt ttagggatta gtgtgaggat taaaggttgt aaactcattt ccacctagtt 34861 ggcattcagt aaatgagaat tgacatttag tactaattgt ttcgggtatt ttgttttttg 34921 ttttttgttt tttgtttttt ctgagaccga gtcttgctct gtcatccagg ctagaatgca 34981 tggtgcgatc tcggctcact gcaactccgc ctcccgggtt cacaccattc tcctgcctca 35041 gcctcccacg tagctgggac tacaggcgcc cgccaccacg cctggctaat tttttgtatt 35101 tttagtagag acggggtttc accatgatct cgatctcctg acctcgtgat ccacccgcct 35161 cagcctccca aagtgctggg attacaggtg tgagccaccg tgcccggcca gttttttgtt 35221 tttgagatgg agtcttgcat tgtcacccag gctggagtac agtggcgtga tctcggctca 35281 ctgcaacctc cacctcctgg gttcaagtga ttctcctgcc tcagtttccc tagtagctgg 35341 gattacaggc acctgccacc atgcctggct aatttttcta tttttagtag agatggggtt 35401 tcaccatgtt ggccaggctg atcttgaact cctgacctca ggtgatccac ccgcctcggc 35461 ctcccaaagt gctgggatta caggtgtgaa ccactgtgcc cggccatgta ccgattattt 35521 ttaacatcat taagtagctg gtatcattcc cattttacaa taaggaaact gaggctcaga 35581 gagtctgtgt cagtttcctg aggttgctgt aataaattgt tagaaacttg attatttaaa 35641 acagcagaaa atggtcaggc acagtggctc acacctgtaa tcccagcact ttgggaggcc 35701 gaggcgggca gatcactgga ggtcaggagt tcgagaccag cctggccaac atggtgaaac 35761 accatctcta ctaaaagtac aaaaattagc tgggcatggt ggcaggcgcc tgtaatccca 35821 gctactcggg aaattgaggc aggagaatcg cttgaaccca ggaggcagag gttgcagtga 35881 gccacaatcg taccactgca ctcttgcctg gacaacaaag caagactcca tctcaagata 35941 aaataaacag cagaaattta ttccctctta gttttggaag ccagaaggtt gaaatccaac 36001 agggctgcgc tccctccagg gcgatctagg ggagaatgca ttccttgcct cttccacctt 36061 ctggttgttt tgcattcctg ggcttgtggc cgcatcactc cagtctccac ccctgtcttc 36121 acagggccac ctcctcctct tctgctgtgt cttctctgtg tctctctcaa gagggcattt 36181 gcagtggcat ttggggccca cccagatcat ccagcatcat ctcatctcca gatccttaac 36241 ttaatcccat ctgcaaaaga ccctttttct gacccagtaa cattcacaga ttccagagac 36301 ctgacatggt tcccttttgg gaccagcaca gagttcatga cttgtgcaaa gtcacgcagc 36361 tgatcggtgc ctcgaactcc ttgtccaggg ctctgcccct tgctcctcag agctcccaaa 36421 ggcttgctca gacctggtgg ggttggggga aagagcctaa gcctgggttc ccatagaggt 36481 tgccggcatc tgcctcctgg gcctggacct cccggccggg gcatcctccc agctggcctg 36541 gtcccctgcc ttttggcatc cctggcaccc ccatgtgttc atctgctgac agtcggtctc 36601 tttatccagg ccgcagctat gatgagaagg tggatgtgtt ctcctttggg atcgtcctgt 36661 gcgaggtagg tccagggttg ggtagcagcg gtgttgaggc ctgggctcct ccccactcac 36721 ccaggctgca ggctcagcat ctgcaggggc ctcatgccag gaagcctgcc cacagcaagg 36781 catgggctgg cccccatggg gtactgcagt caggctgcag ccaggcccag tgccacctgc 36841 cctcaaacca cctggatggc acccagatgc ccaggctgag ggccccctgg agtaactgcc 36901 gggccttgta ctggacagat catcgggcgg gtgaacgcag accctgacta cctgccccgc 36961 accatggact ttggcctcaa cgtgcgagga ttcctggacc gctactgccc cccaaactgc 37021 cccccgagct tctaccccat caccgtgcgc tgttgcgatc tggaccccga gaagaggtga 37081 gtggggtggg gccctggcct gggagacggt ggggccgatt cccgggacag ccagacccac 37141 cgttccccac ccacctgtca cccaggccat cctttgtgaa gctggaacac tggctggaga 37201 ccctccgcat gcacctggcc ggccacctgc cactgggccc acagctggag cagctggaca 37261 gaggtttctg ggagacctac cggcgcggcg agagcggact gcctgcccac cctgaggtcc 37321 ccgactgagc cagggccact cagctgcccc tgtccccacc tctggagaat ccacccccac 37381 cagattcctc cgcgggaggt ggccctcagc tgggacagtg gggacccagg cttctcctca 37441 gagccaggcc ctgacttgcc ttctcccacc ccgtggaccg cttcccctgc cttctctctg 37501 ccgtggccca gagccggccc agctgcacac acacaccatg ctctcgccct gctgtaacct 37561 ctgtcttggc agggctgtcc cctcttgctt ctccttgcat gagctggagg gcctgtgtga 37621 gttacgcccc tttccacacg ccgctgcccc agcaaccctg ttcacgctcc acctgtctgg 37681 tccatagctc cctggaggct gggccaggag gcagcctccg aaccatgccc catataacgc 37741 ttgggtgcgt gggagggcgc acatcagggc agaggccaag ttccaggtgt ctgtgttccc 37801 aggaaccaaa tggggagtct ggggcccgtt ttccccccag ggggtgtcta ggtagcaaca 37861 ggtatcgagg actctccaaa cccccaaagc agagagaggg ctgatcccat ggggcggagg 37921 tccccagtgg ctgagcaaac agccccttct ctcgctttgg gtcttttttt tgtttctttc 37981 ttaaagccac tttagtgaga agcaggtacc aagcctcagg gtgaaggggg tcccttgagg 38041 gagcgtggag ctgcggtgcc ctggccggcg atggggagga gccggctccg gcagtgagag 38101 gataggcaca gtggaccggg caggtgtcca ccagcagctc agcccctgca gtcatctcag 38161 agccccttcc cgggcctctc ccccaaggct ccctgcccct cctcatgccc ctctgtcctc 38221 tgcgtttttt ctgtgtaatc tattttttaa gaagagtttg tattattttt tcatacggct 38281 gcagcagcag ctgccagggg cttgggattt tatttttgtg gcgggcgggg gtgggagggc 38341 cattttgtca ctttgcctca gttgagcatc taggaagtat taaaactgtg aagctttctc 38401 agtgcacttt gaacctggaa aacaatccca acaggcccgt gggaccatga cttagggagg 38461 tgggacccac ccacccccat ccaggaaccg tgacgtccaa ggaaccaaac ccagacgcag 38521 aacaataaaa taaattccgt actccccacc caggtcctgc gtggcgatgt gtgtctgggg 38581 ccctggggaa atagtcaagg taagaggagt tagtcttccc tgaccagaag acaaggatga 38641 gtgtggtggc tcatgcctgt gatcccagca ctctgggagg ctgagacagg acgatccctt 38701 aagcccagga gttcaagacc agtctggaca acatagtgag atcctgtctc tacaaaaatt 38761 tttttttaat tagttgggca gaggccaggt gtggtggctc atgcctgtaa tcccagcact 38821 ttgggaggca gaggcgggtg gatcacctga agttaggagt tcaagaccag tctggccaac 38881 atggtgaaaa ctcgtctcta ctaaaaatac aaaaattagc cgggcgtggt ggcacatgcc 38941 tgtagtccta gctacttggg agactgaggc aggagaatcg cttgaacccg aaaggcagag 39001 gttgcagtga gccgaggtgg tgccattcca ctccagcctg ggaaagagcg agactttgtc 39061 tccaaaaaaa aaaaaaaaaa aattggcagg ccaggcacag tggctcacac ctgtaatccc 39121 agccctctgg gaggccgagg caggaggatc tcctgaggtc aggagtttga gaacagcctg 39181 actgacatag tgaaacccca tctctactaa caatacaaaa ttagccaggt gtgatggcac 39241 atgcctgaaa tcccagctac ttggggggtt gaggcaggag aattgcttga acccaggagg 39301 cagaggttgc agtgagccga gatcgcacca ttgcacccca gcctgggcaa caagagcgaa 39361 actccatctc aaaaaaaaaa aaaaaaatta gttgggcatg gtggcatgca cctatagtcc 39421 cagctactca ggaggctgag gtgggaggat cctttgagcc caagagatca aggctgcagt 39481 gagccatgtt tgcaccactg cactccagcc tgggcaacaa aacaagactc tgtctcaaaa 39541 aaaaaaaaaa aaaaaaaaaa aggcagggat ggagggggga agagaacaca gcccagtttt 39601 aggtggagct gaggtggtgg cccagccagg acaagtgaag agtcttcaga ggctgggttt 39661 ggagggccgt gcatattccg gaggtactgc tttcatactt aaatgttttc ttgtaaaact 39721 cacacctgta atcccagcac tttgggaggc caaggtgggc ggatcatctg aggtcggggg 39781 ttcaagacca acctgaccaa catggagaaa ccccgtctac taaaaataca aaaaattagc 39841 caggtgtggt gacacatgcc tgtaatccca gctactcggg aggctgaggt aggagaattg 39901 cttgaacctg ggaggcggaa gttgtggtga gctgagatcg tgccattaca cttcagcctg 39961 ggcaacaaga gcaaaactcc atctcaaaca aaactaaact aaactaaact aaagggttct 40021 atcaagaaga tgggctgcac gtgatggctc acacctagac tcccagcgct tcaggaggcc 40081 gaggtggaag gatcacttga ggccaggagt tcaagatctg cctgggcaac atagcaagac 40141 cctgttttta cccaaaaaat aaaaaaatta cccagatgct gtggtgtgtg cctgtagtac 40201 cagctactga gaggctgagg caggaggacc gcttgagcct gggaggtcaa ggctgcagtg 40261 agctgtgatc gtgccactgc actccagcct gggtgacaca gcaagacctt gtctcaaaaa 40321 taaataaaac attttaaaaa cacactaggt attgcaaata cagggcattt aatttggttt 40381 tttgtttctg ttttgttgtt gttttgagac aggtctcact ctgtcaccca ggctggacag 40441 cagtggcaca gtcatggctc actgcagcct caacatccca gggttgagta atcctcccac 40501 ctcagcttct caggtagctg actatagata cacgccacta caccaagtta atttaaagaa 40561 aaaaaatgtg agaggccagg cgcagtggct cacgcctgta atcctgacac tttgggaggc 40621 cgaggcaggc ggatcacctg aggtcaggag ttcaagacca gcctggccaa catggtgaaa 40681 ccccatctct actaaaaata caaaaattag ccaggtgtgg tggcaggcac ctgtaatccc 40741 agctactcgg gaggctgtga cagaagaatc atttgaacct gggaggcgga ggttgcagtt 40801 agccgagatc acgccattgc actccagcct gggtgacaag agtgaaactg cctctcaaaa 40861 aaaaagttta gaggcaaggt ctcactttct tctctaggct ggcctcaaac tcctgggctc 40921 aagcagtctc ctgggcctcc caaagtgctg ggattacagg catgagactc catgctcagc 40981 cacatttaat acgagaattt ttttgttttg tttttttggt tttttttttt gagatggagt 41041 ctcgcactgt cacccaggct agagctcagt ggcacgatct ccgctcactg taagctctgc 41101 cttccgggtt cacaccattc tcctgcctca gcctcccgag tagctgggac tacaggcgcc 41161 cgccaccatg cccggctaat ttttttctat ttttagtaga gacggggttt caccatgtga 41221 accaggatag tctcgatctc ctgacctcat gatccaccca tctcggcttc ccaaagtgct 41281 gggattacag gcgtgagcca ctacacccag ccaatacaag gaaattttta catggctgtt 41341 gaaagacaga ggaaaggcca aaagtggaca cttaggtaac ccagagatga ttgcaggaga 41401 gagctaccac cctcggtggg gggattgaag gggagaggtg atcacttgag ttatctaatg 41461 ttgcataggg aagtcacctc tcaacttggt tgcttaaagt aacagggatc actcattgct 41521 catgatttct ggtttttttt tttttttttt gagacggagt ctcgctctgt cgcccaggct 41581 ggagtgcagt ggcacaatct tggctcactg caagccattc tcctgcctca gcctcccaag 41641 tagctaggac tacaggcgcc cgccaccaca cctggctaat tttttgtatt tttagtagat 41701 acagggtttc accgtgttag ccaggatggt ctcgaactcc tgacctcatg atccgcccac 41761 cttggcctcc caaagtgttg agattacagg cgtgagccac cgcgcccagc ttgatttctg 41821 tttgtcaaga atttgggagt cattttggtg gggaatttgt atgtgggggt ctctcctggg 41881 gctgcagtcc tttgagggtg taactggggc tgaagttccc ttccaagaac cctcatatgt 41941 ggctcactca catggcgggc aatttggtgc tagcagttga ttctacagag aaaaacgggc 42001 ttgagccaat gtgctacaag ccaatactat gacaccaggc ttttggtttt ttgtttttat 42061 gatttatgta tgtatttttt ttttttttga gacagaatct cattctatca ccctggctgc 42121 agtgcagtgg cacaatctcg gctcactgca agctccacct cccaggttaa agggattctc 42181 gtgcctcagc ctccctagta gctgggacta caggcgtgca ccaccatgcc tggctaattt 42241 ttgtaccttt agtagagaca gggtttcact atgttggcca gactggtctc aaactcccga 42301 cctcaagtga tccacctgcc tcagcctctc aaagtgctgg gattacaggt gcaggcaacc 42361 atgactggcc gttttttttg tttttaaagt tggggtctca ctatgttgtc ccggctggtc 42421 ttgaactcca aggctcaagt gatcctcctg cctcgacctc ccaaagtgct aggcttacag 42481 tcatgagcca ccatgcccag ctgacaccag gcttttcaga aaagaatagc tttattgcaa 42541 gtcaaccagt aaggagacag aagtctagct caaatctgtc cccctgtgct ggctttaagg 42601 cggtaatttt attaggaaag gtttaggggg tggattctga tattaggtga ttggcggaag 42661 caaaggggag gcctggaaag tgctcaggca tgcgcagttc cctcttcatg ttatctcatg 42721 gggggcatgt gcaaattccg ggggtggtta gtatgtaaca tgcactggaa attcgggctg 42781 tgacatcagc aagcttgttc tgtgcaaact gcagttggcc atattggtcc caatctattt 42841 cagccagcgt gttaatccca ccagcagatg aatttcagca tttctgcaag tcgtttcttt 42901 ttttatctgc catcctgcaa actggaaaat ttctgctagt cactggtttc tttaactctt 42961 tggggcacgg tttcactggt aggaggcctc agtttatccc atgggcctct ccatagggct 43021 acttcagagt ccccacagca gcctccagaa tgaatatccc aagaaagaaa agaaaagtgc 43081 cactaggggc cgggtgtggt ggctcacgcc tgtaatccca gcactttgga agtctgaggc 43141 aggaggatcc cttgagccca gaagttcaag ccagcctggg caatgtaggg agacgccatc 43201 tctactaaaa aaaaaaaaaa aaaagaagaa gaatttaggc cgggcgtggt ggctcacgcc 43261 tgtaatccca gcactttggg aggctgaggc aggcggatca cgaggtcagg agtttgagac 43321 cagcctggcc aagatggtga aaccctgtct ctactaaaaa tacaaaaatt agccaggcac 43381 ggtggcgggc gcctgtaatc ccagctactc aggaggctga ggcaggagaa ttgcttcaac 43441 ctgggaggcg gaggttgcag tgagccaaga tcgtgccact gtactccagc ctgggtgaca 43501 aagcaagact ccatctcaaa aaaaaaaaaa aaaaaaaaag aaagaaatta gctgggtatg 43561 gtggcacaca cctgtggtcc cagctatttg ggaggccaag gcaggaggat tggttgagcc 43621 cagaaggtca aggctacaat gagccagatt gtaccattgc actccagcct gggcaacaga 43681 gtaagacgcc atctcaaaaa aagaaaagag gccaggtgca gtggatcaca cctgtaatcc 43741 caacattgtg ggaggccaag acaggatccc ttgaggccag gagtttgaga ccagcctggc 43801 caacttggca aaaccctgtc tttaccaaaa aatacaaaaa taagctgggc gtggtggccc 43861 actcctgtaa tcccacctac ttgggaggct gaggcgggag aatcacttga acctgggagg 43921 cagaggttac agtgagccga gactgcgcta ttgcactcca gcctgagcga cagagcgaga 43981 ctccgtctca aaaaaaaaga aaaaaattac cacaagcgca gctctgggtg cattgcttat 44041 gaattaactc ctgctttgca aggagcagct ctggttcaat aaaagattgc tgtgtaacac 44101 caccagctta cccttgaatt ctttgagtga aaccaaaaac cctcccaggc taatccacaa 44161 tttgggggct tagctatatg cctgtatcgg tactaattgt cttcattatt gtagctttgt 44221 tgtaactttt gaagttgaga aatgtgagcc ttccaacttt gtttttcttt ttctagactg 44281 ttttggctat ttgaagtccc ttgaatttcc acaagaattt ttttttttta agtgccaaga 44341 tctcagctca ctgcaacctc tgcctcccag gttcaagcaa ttctcccaac ttagcctccc 44401 aagtagctgg gactagaggc atgcaccacc atgctaattt ttgtgttttt agtagagatg 44461 gggtttcacc atgttgtcca ggctggtctc aaactccttg cctcaagtga tccacccacc 44521 ntcaggctcc caaagtgctg ggattataga tgtgagccac catgcccagc ctccacatga 44581 atttttagga tgagcttgtc aatttctgaa aacaagccag ctggggattt gtttgtttag 44641 acacaagatg tcattctgtc acccagactg gagtgcagtg gcacaactcc tagctcactg 44701 cagcctggaa cccctaggct caagtgatcc tctcatctca gcctcctgag taccagggaa 44761 tacagacaca tgccaccatg ccctgctaat tttttaattt ttgtagcgac atggtctcaa 44821 actcctgccc aaccaggctg atctcttttt ttttttgaga tggactctca ctctgtcgcc 44881 caggctggag tgcagtggcg caacctcgcc tcactgcaac ctctgcctcc tgggttcaag 44941 cgattctcct ccctcagcct cccgagtagc tggtgggcat gggcgcctgc caccatgccc 45001 ggctaatttt tcatattttt agtagagatg gggtttcacc atgttggcca ggctggtctc 45061 gaactcctgg cctcaagtga tcctcctgcc tcagcctccc acagcactgg aattacaggc 45121 atgagtcact gttcccggtc cagctgagga ttttgacagg gattggttta tgtctatatg 45181 tgaactgggg agtattggaa tattgacatc gtaataatat taagtctctc aggccaggca 45241 tggtggctca cacctgtaat cccagcactt tgggagctcg aggcaggtgg atcaattgag 45301 gtcaggagtt caagaccagc ctggccaaca tggcgaaacc ccgtctctgc taaaaataca 45361 aaaattagcc aggtgtggtg gtgtgtgcct gtagttccag ctacttggga ggccgaggca 45421 agaggatcac ttgaacctgg taggcagagg tggcagtgag cctagattgc accactgcac 45481 tccagcctgg gtgaaagagc aaggctctgt ctcaaaaaaa aaaaaaaaaa aaggaagaag 45541 gaggaggagg agggggagaa ggagaagggg aaggaaggag gaaggaggaa gaagaagaaa 45601 tacctgaaac tgggtaattt tttttttgag aaaggatctt gctctgtttc ccaggctgga 45661 gtgcagtggc acaatcttgg ctcactgcaa caaccacctc ctgggttcaa gcgattctca 45721 tgcctcagcc tcctgagtag ctggaattga gatgtgcaca ccacgcccag ctaattttta 45781 tatttttagt agagacgcgg tttcatcatg ttggccaggc tggtctcaaa cccctgacct 45841 caggtgatca acccacctca gcctcccaag tgccgcaatt acaggcgtgt gagccactgc 45901 gcccggcttc aaaagtacca tttaatggct gacaattact tgccctgaaa tgtgaaacaa 45961 aattcattta ctacattgtt tttaagatag cacctgacct tcagtaatcg gaaataatga 46021 tttcctataa ataaaaacca ctgcagtgct tttagtgatt agtgtacata gagtttttcc 46081 cctggctgtg acatcatatt attaaaagca ttaagcacct ggaattcatg ctgtagttga 46141 tttataagtt acataatgta caaagctcct tttataagaa tgttttgtgg tcacaattac 46201 ttcaaaaccc aattacattc aaataatcta atagctcatg ctttggcaat tatagaagtg 46261 tgattttgac acatagaaat tttatgaggt tagcaaataa aaaacgctat aaaagaggtg 46321 aacaatggtt cctctgttta aatttagagt gcagcaatat ttaggtaata tttttcagtt 46381 aatataatca gcctagaata tagcattgta aatcatacag tgttttagaa atacggatct 46441 aaagaaggta ataccttttc caaattataa aattttggca aatcaataca gtactttgta 46501 atacaataaa actatgtttt tgttggagtc atatatgact ttaatcataa tttccactgc 46561 aaaagcacca cctaaatact aaatcaatta tgaaggcttt tcatgacagt ttataacaga 46621 gtcagttgtt ttacacaaat taatatggct tttaaaaaat tatataattt cttggccggg 46681 cacactggct catgactgta atcccagcac tttgtggggc tgagaccagc aaattgctga 46741 gctcaggagt ttgagaccag catggacaac atggcaagac cctgtctcta aaaataaaaa 46801 tgttttaaaa gctgcagagt taacacagta gagaaatcat gtgcatataa aatatgctac 46861 gtttccttct gggattggtc caaaactgct cacaaaaaac ttcaaaactc tactttaaga 46921 agttccaggc cgggcacggt ggctcacgcc tgtaatccca gcactttggg aggccgaggc 46981 aggcgaatca caaggtcagg agttcgagac cagcctggcc aacatggtga aaccccgtct 47041 ctactaaaaa tacaataaaa attagctcag catagaggcg tgcgcctgta atcccaggta 47101 ctcgggaggc tgaggcagga gagtcacttg aacctgggag gcggaggttg cagtaagcca 47161 agatcgcgct actgcactcc agcccaggcg acagagcgag actctgtctc aggaaaaaaa 47221 aaaaaaaaag aagctccaat accaaattaa agtcgttttt caagtattgg taaatcttcc 47281 ataaacaggg caacacttaa tgatcaatag atcattcgac tagggcttat gctggtggat 47341 ctcttttgtt taaagctcca aactcagctg ggcttggtgc ttcacgcctg taatcccagc 47401 actttaggag gccaaggcag gtggatcacc tgaggtcaga agttcgagac cagcctggcc 47461 aacatagtga aacccccgtc tgtactaaaa atacaaaaat tagacaggcg tggtggcaca 47521 gaaaaaaaaa gtcaattatc ctatttgggg atttaaatta tactattttt tatttttttg 47581 agacagagtt tcactctgtc acccagtctg gagtgcagtg gtacaatctt agctcactgc 47641 aacctccacc tcctgagttc aagcgattct cctgcctcag cctcccgagt agctaggatt 47701 acaggcacca gccaccacct ggctaatttt tgtatttttt gtagagacgg ggtttcacca 47761 tgttggccag gctggtctca aactcctggc ctcaagtgat ctgcctgctt cggcctccca 47821 aagtactggg attacaggag tgagccacca caccacctcg accagccttt tcctctataa 47881 atttaaaaaa aaaaaaaggc caggtgcgga ggttcatgcc cgtaatccca gcactttggg 47941 acggatcact gtaattccag ctactcagga gcctgaggca ggaggatcac ttgaacccag 48001 gagtcggagg ttgcagtgaa ccaagattgc tccactgcac tccagcctgg gcaacagagc 48061 aagactccag ctcaaaaaca aagaaaaaag aaaaaggcca ggtaaggtga cttacatctg 48121 taatcccagt actttgggaa gctgaggcag gaggattgct tgagcccagg agttcaaggc 48181 tacagtaagc tagtaagcta tgattgcacc actgtgctgc agcctgggtg acagagccag 48241 accctgtctc atgaaaaaaa aaaaaaaaaa aaaaagaaaa gaaaagaaag gaagaaaagt 48301 gccaaattgt ttctcaaagc agttctagtg atttatggtc tcacttgcag tatatcagat 48361 tcttcgttgt ccagatcttt ttaatttttt acagactaac aggtacaata cagtatctta 48421 ctgtggtact aatttgagtt tccctgattt cctctatagt tgagcatctt tacgtgttta 48481 gtggccactc atgtttcttc agatcttctg cctgccttcc tccctccctt cctcccttcc 48541 tccctccctt cctcccttcc tcccttcctc cttcccgccc tcccttcctt tttttttttt 48601 tttttttttt ttttgagacg gagtcttgct ctgtcgccca ggctggagtg cagtggcggg 48661 acctcagctc actacaagct ccacctccca agttaaatcg attatccggc ctcagcctcc 48721 tgagtagctg ggactacagg cgcccgccac cacgcccagc taattttttg tattttcagt 48781 agagacaggg tttcaccgtg ttagccagga tggtctcgat ctcctgacct catgatccgc 48841 ccacctcggc ctcccaaagt gctgggatta caggcgtgag cgtgagccac cgcgcccggc 48901 cccttccttc ttttttttta aaaagagaga cgggtgctcc ctttggcagc agatatacta 48961 aaaaagagag acgggaaggc caggcacagt ggctcacacc tgtaatccca gcactttgag 49021 aggccgaggc tggtggatca cctgaggtca gaagttcgag accagcctgg ccaacatggt 49081 gaaaccccat ctctactaaa aatacaaaat tagacgggtg tggtagtgca tgcctgtaat 49141 cccagctact caggaggctg aggcaggaga atcaatgaac ccgggaggcg aaagttgcag 49201 agagatgaga ttgtgccatt gcattccagc ctgggcaaca agagcgaaac tacgtctcaa 49261 aaaaaaaaat gcataagttt tgtgaacaaa tatttcataa ttttctctac tgaggtctta 49321 gacttttttt ttttacattt tacagaatac ttcatatctt ctttgtctct cccctttttt 49381 tttgcaatca ccttgaaaac attaagattc agatggtcct ctaattttcc tgtctcctgt 49441 tatcctttgt ggtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtttgagac 49501 agagtctcac tctgctggac aggctgcagt agagtgatgg catctcggct cgctgcaacc 49561 tccgcctcct gggctcaagt gattctcctg cttcagcctc ccgagtagct gggattatcg 49621 gcatgtgcca ccacccctag ctaatttttg tatttttagt agagacgggg tttcaccatg 49681 ttggccaggc tggtttcaaa ctcttgacct caagtgatct gcccacctca gcctcccaaa 49741 ctgctgggat tacagacgtg agccactgcg cccagcctgt tatcctttgt ttttggaagg 49801 aagcatttga aaaagagtga ctctatcttg aataggggct gggtaagatg aggctgagac 49861 ctgctgggct gcattcccag taggtgagac attcttattc acaggatgag acagaaggtt 49921 ggcaggactg gtatcacaag atacgggtca caaagaccct gctgataaaa caggatgctg 49981 acagggcaca gtggctcact cctgtaatcc cagcattctg ggaggctgag gcgggcaaat 50041 cacttgatgc caggagatca agaccagcct ggccaacatg gtgaaaccct gtctctacca 50101 aaaatacaaa aattacccag acatggtggc aggcacctgt actcccagct actcaggagg 50161 ctgaggcaag agaattgctt gaactcggga ggcagaagtt gcagtgagcc aggatcgcac 50221 cactgcactc cagacggggc aacagatcga gactccatcc caaaaaaaaa aaaaaaaaag 50281 aaaacaaaaa caggacgcag taaagaagcc agccccaaaa cccaccaacg gtgatgaaac 50341 tgacctctgg tcatcctcac tgctcattat acactaatta taatacatta ccatgctaaa 50401 agacactccc accaggacta tgacagttta caagtgccac ggcaacaccc ggaagttacc 50461 ctatatggtc taaaagaagg aagaaccctc agttctggga aatccctgcc ctttcctgga 50521 aaactcatga ataacccata cttcgtttag catagaatga agaaataact gtaagtatac 50581 tcagtcaagc agcccatgcc actgctctgc ctatggagga gtcattcttt attcctttcc 50641 tattcttttt ttttttttct ttttcgagac agagtcccac tctgttgccc aggctggagt 50701 gcagtggcac gatcttgact cactgcaacc tctgcctccc aggttcaagc aattctcctg 50761 cctcagcctt ccgagtagct ggaattacag gtatgcacca ccacacccag ctaatttttg 50821 tatttttaat agagatggag tttcaccagg ttggccaggc tggtctcgac ctcctgacct 50881 caggtgatcc acttgcctca gcctcccaaa gtgctggaat tacagacgtg agccactgcg 50941 cccggctatt cctttatttt cctgataagc ttgctttcag gtcgggtgtg atggttcaca 51001 tgtgtaatcc cagcactttg ggaggcctaa gtggcaggac tgcttgagcc cagaaattca 51061 agaccaacca gcgccacata gtgagtgaga ccatatttct attaaaaaaa aaacgaaaac 51121 aaaaaaaact tggccaacat gacgaaaacc tgcctctact aaaaaaatac aaaaattagc 51181 caggaatggt aacacatgcc tgtaatccca gctactcagg aggctgaggc aggacagtca 51241 cttgaacctg ggaggcagag cttgcaatga gctgagatca agccactgca ctcgagcctg 51301 ggtgacagag cgagactctg tctaaaaaaa aaatacaaaa taaaaaaaag aacttattta 51361 tgtaaccaaa taccacctgt tcacctgttc cccaaaaacc tgttgaaaca aaaataaata 51421 aataaatata aagaaataat ttttatttat ttattttatt atattttgag acgaagtttc 51481 actcttgtcg cccaggctgg agtgcaatgg cgtggtctca gctcactgca acctctgcct 51541 cctgggttca agcgattctc ctgcctcagc ctcccgagta gctgggacta caggcacctg 51601 ccaccacgcc tggttaattt tgtattttag tagagacagg gtttcaccat gttggtcagg 51661 ctggtctcca gctcctgacc tcaggtgatc cacccgcctt ggcctcccaa agtgctggaa 51721 ttacaggtgt gagccaccac acccagcctt taattttatt ttctatagag aggagtccca 51781 taatattacc caagctggtc tcaaactctt ggcctcaata aatcctccca cctcagcctc 51841 ctgagtagct aggactacag gagtgcacca ccatgcccag ctaatgtttt tatgttttgt 51901 agagatgagg gtctcattat gttgcccagg ctcgtcttga actcctgggc tcaagtgatc 51961 catcctcctg cctcagcttc ccaaagtgct gggattacag gtgtgagcaa acatgcccag 52021 cctaatatta ttaatacatc gtagctgtcc atatttatag ggtgcatgtg aaattttgtt 52081 acgtgcatag aagtgcgatt gtaggaacca aggaaaaaac ttctgcttca ccttctcaag 52141 gtttgctgat aaatcagctc acaaaaggca gattaattgg aaaaaggggg atacaaattg 52201 cattcacacg tatctgggga gaaccacacc acagcgtgat tacccaccac cccaaaggca 52261 ttcagacgct tatataccat cttctttttt ttttttaagt agagactggg ttttcgccat 52321 gttgccaggc tggtcttgaa ctcctgcact caagtgatct tcccctcttg gcctcccaaa 52381 gtggcgctgg gattaccgcc atgagccact gtgcctggca ctatatacat atatatagat 52441 atgtatacat atctatatct atagatatct atatatctat agatatctat atatctatat 52501 ctatatgtat acatatctat atatatagac atgtgtatat atatctatag atatctatat 52561 ctatagatat agatatacta tcttgcagat acagaaagaa taggggtttg gatcctggta 52621 aaacaggtta tggcaggggg aagaaagagg aattctattg aggggacata aaagattact 52681 gggggctagg cagagtggct catgcctgta atcctagcac tttgggaggc caaggtgggc 52741 agatcacttg aggtcaggag ttcgagacca gcctggccaa catggcgaaa ccctgtctct 52801 actcaaaaca caaaaattag ccagtcatgg tggcacatag ctgaaatccc agctactcag 52861 gaggctgagg caggagaatc acttgaaccc aggaggagga agttgcagtg agctgagatg 52921 gcatcactgc actccagcct gggtgacaga gtgagactcc atctcaataa aaataaaaat 52981 aaaaataaag cattgctggg gagaatgaat ggatttagga acagagatta acttgtacat 53041 aattctcttt ggaatttcaa tgagcctgag ggagacatta tcttgcggaa gagtctgttc 53101 aggtgtggtt ccattcttga ttttatagaa aggagaagaa aaaaaaacaa ttgttttcct 53161 tgttgagggg ggatgtctgg atcttaggca gagaaagtaa cttcaacttc atcctgtgct 53221 gtgggagaaa agacggtctt ttagacacag tttatcgtta ctgctgcttt tcctgtgttt 53281 ggcctatacc ttcctgcctc tttgaatgat gggtagacca gagtttgtga gtcaatttgt 53341 attagctgtg tgatctggag caagctactg ttgtcagagg agtttgaacc acagtgattc 53401 catcttgaat agggggtggg taaaatgagg ctgagacctg ctgatattga caggaggcag 53461 ccaattgcct aggccaatag gggcgggtcc gcggtgaaac cccacctcca acccgaagac 53521 ggtttaaagc ctgaaactga aggtacaagt ttaaacctta gaccggattg agagcttacc 53581 ttcctgtttg tcgcgctttc ctctgattga tccccaccct tcgcctattt tacatatacc 53641 caccctttcc taattggttt tctactcttt cttttttttt ttgacagagt ctcgctctgt 53701 cacccaggct ggagtgcagt ggtgcaatct cggctcactg caatctccac cccccgggtt 53761 catgtcattc tcctgcctca gcctccccag tagctgggac tacaggcgcc tgctaccacg 53821 cccggctaat tttttgtatt ttttagtaga gatgaggttt caccgtgtta gccaggatgg 53881 cctcaatctc ctgaccttgt gatctgcccg ccttggcctc ccaaagtgct ggcattacag 53941 gcatgagcca ccgtgcccgg cggttttcta ctctttcatg accacctttg agtagtgtct 54001 ttgctttaac tcacctcatt agcataaact ccagtgtgat caaaaggact cattataaat 54061 aacaaaagac attcctccaa ctcctggact taagggatcc ctcaagcaag cctcagcctc 54121 ctgaatagct gggactactc ctttttgcat actcacaagc caatcagcac acactcccca 54181 ccctgtgcct ataaaggctc cagactcagt cagcagggga aaagacgacc tgacttcggg 54241 gaaggcaacc tgcacttccc atcccctctc cagctcccct ctccactgag agtcgctttc 54301 attgctcaat aaaattctcc accttcatca tccttcaatc gtccgtgtaa cttcattctt 54361 cctggatgct ggacaagagc ttgggaccca gtgagtgagg atacccagaa aggctgtcac 54421 actgggcctt tgccctcgcc tgtgaagggc agctgtcccc atgtgatgag gcaaggggcc 54481 agctgatctg ctgacatgtc accatctgtg gacagcagaa ctaaaggagc actgtaataa 54541 caccctctct gcagcttcgg ggacacgggc accctcacct aggtgctgct gctttcccct 54601 caaggtgacg tgcctgctct ggccatgggc cctgcataca gcttgctcct gtgttggtgc 54661 ctggagaagc cagctggcca gatcccacac ttagtcactt gtgtgctccc tcctgcaagg 54721 ggttgagcac agggggctga gtagatgggg catcccttcc atgagtccag cgaaggtgcc 54781 tagaaaaacc ctgcatcacc actgagctac tttcccagga ggtgaggcat tcccagtcac 54841 aggatgacac aggagggtgg cacaagacat aggtgacaaa aaaccttgct gataaaacag 54901 gttgcagcaa agaagccggc caaaacccac caaaaccaag gtggcgatga aagtgacctc 54961 tggtaggctg ggtgcggtgg ctcaacgcct ataatcccag cactttggga ggcccaggcg 55021 ggcggatcac ctgaggccag gagtttgaga ccagcctgac caacatggag aaactccgtc 55081 tctactaaaa atacaaaaaa ttagctgggc gtggtggcac atgcctgtaa tcccagctac 55141 tcaggaggct gaggcaggag aattgcttga acccgggagg cggaggttgc agtgagccaa 55201 gatcgtgcca ttgcattcta gcctggatga caagagtgaa actccatctc aaaaaataga 55261 aagaaagtga cctctggtcg tcctcactgc tcattatgtg ctaattataa tatattagca 55321 tgctaaagac actcccatca gtgccatgac agtttagaaa tgccgtggca acatcaggaa 55381 gttaccctat attgtctaaa aaggggagga accggccggg cgcagtggct catgcctgta 55441 atcccagcac tttgggaggc caaggcaggt ggattgcaag gtcaagagtt caagaccagc 55501 ctggccaaga tgtgaaaccc tgtctctact aaaaatacaa aaattagctg ggcatggtgg 55561 cgggcgcctg taatcccagc tactccagag gctgaggcag gagaattgct tggacccagg 55621 aggcagaggt agcagtgagc tgagattgca ccattgcact ccagcctggg tggcagagca 55681 agactctgtc tcaaaaaaaa gtggggagga accctcagtt ccaggaattg cccgtgcctt 55741 tcccagaaaa ttcatgaata atccaccctt gttgggcatg taatcaagag ataactataa 55801 aaaatatcca gccagcaacc ttaggggatg ctctgcctat ggagtagaca ttctttgttc 55861 ctttactttc tttttttttt tttttttttg tgagatggag tctcactttg tcatccaggc 55921 tggagtgcag tggtgcaatc ttggctcact gcaacctcta cctccccagc tcaagcgatt 55981 ctcctgcctc agcctcccaa gtagctggga ttacaggcgt atgtcaccac gcccagctag 56041 tttttgtatt ttttagtgga gacagggttt caccatgttg gctagtctgg tcttgaactc 56101 ccaatctcaa atgatccgcc caccttggcc tatcaaagtg ctgggattac aggtgtgagc 56161 cactgtgccc agcctattcc tttactttct taatatactt gcttccactt tactccatgg 56221 actcgcctgg aattgtttct tgcgtgagat tcaagaactc tctcttggct gggtgtggtg 56281 gctcacgcct gtaatcccag cactttggga ggccgaggca ggtggatcat gaggtcagga 56341 gtttgagacc agcctgacca acatggggaa accctgtctc tactaaaaat acaagaaaat 56401 tagccgggcg tggtggcacg tgcctgtaat cccagctact caggaggctg aggcaggaga 56461 atcacttgaa cccgggaggc agagggcgcc actgcagtcc agcttgggca atagagtgag 56521 accctgtctc aaaaaaaaaa aaaaaaaaaa aaagattaaa aaagaaccct ctcttggggt 56581 cttgattggg actcctttcc agtaacagtg tgaaagaaaa ataaaatcac cagaccccaa 56641 actcactatg tcaaagggca aaaagctaag cttaggaact gagtcataca ggaaactgca 56701 ttttcttttg ttcctaacca gatagctgca agattgaatg ccacgtatct ccacaggtgg 56761 cttccctcac cctgaccatg taaattcagc ttaccttcac aggtacagga caaataaaaa 56821 aatagaaatc tggccaggca tgggggctca cacctgtaat tccaacactt tgggaggctg 56881 gggtgagaga attgctggag ctcaggggtt ggagatcacc ctgggcaacc cagtgagagg 56941 ctgtctctac ggaaaagatt ttaaattagc ctggtgtggt agtgcacacc tgtagtacca 57001 gctactcagg aggctgcatt gggagtattg cttaagctca ggaggtcgag gctctagtga 57061 ggtgtgatcg caccgctgca ctccaacctg agcaacagaa taagaccctg tctcaaaaaa 57121 aaaaaaaaaa aaaaaaaatc atggccgggc gtggtggctc acggctgtaa tcccaacact 57181 ttgagaggcc aagggatcac ctgaggtcac gagttcgtga ccagcctgac caacatggtg 57241 aaaccccgtc tctactatag acaaaaaatt agacaggcat ggtggcacat gcctgtaatt 57301 ccagctactt gggaagctga ggcaagagaa tcacttgagc tgaggcggca gaggttgcag 57361 tgagccaaga ttgcaccatt gcattccagc ctgggccaca agagtgaaac tctgtctcaa 57421 aaaaataaca ataatttttt tttttttttg aggtggagtc ttgccctgtc acccaggctg 57481 gaatgcagtg gcacgacctt ggctcactgc aagctccgcc tcccgggttc acgccattct 57541 cctgccccag cctcccgagt agctgggact acaggcgcct gccaccacgc ccggctaaat 57601 gttttgtatt tttagtagag acagggtttc accatgttag ccaggatggt ctcaatctcc 57661 tgatctcatc atccgtccgc ctaggcctcc caaagtgctg ggattacagg tgtgagccac 57721 cgcgtccggc caatattttt ctttttttta aatcatactt ccaggtccng gtgcggtggc 57781 tcacacctgt aatcccagcn ctttaggagg ctgaggtagg cagatcacaa ggtcaggagt 57841 tcgagaccag cctggctaac atggtgaaac cctgtctgta ctaaaaacta caaaaattag 57901 ctgggcgtgg tggcacacac ctgtaatgct agctactcag gaggctgagg caggagaatt 57961 gcttgagccc gggaggcgga ggttgcagtg agctgagatc acactactgc actcctgcct 58021 gggggacaaa gtgagactct gtctcagaaa aaaataataa taataaatca tacttacccc 58081 caccctaaga caaaagcata attgacttct tcctctactc tgtgtttact ttatcttgtg 58141 taaaatacag atatatttag cacaagatga attcataata gactgttcct ttttccctcc 58201 tttcacatgt gttaaaagaa aaacttcagc caaattaaat ttaagggagt ttaattgagc 58261 aatgaacaat ttgtgaatcg ggcagccccc agaatcacag ccgattcaga cagactccag 58321 tgcagccatg tgatggaaga agatttatag acaaagggaa atgacataca gaagtcagtg 58381 aggtacaaaa acaactggat tggctacagg tcggcatttg ccttatttga atatggctca 58441 aacagttggc tacatctgac tggccaaaac tcagtgattg gcacagggtg tgggctatgg 58501 ccgagttata cctccgcttg ttacagttca caatgtacag aaaaaccttt aggccaaatt 58561 gaaatatgta aagaagcagc tttaggctaa acttgattaa cgtatgtaag atgtggattc 58621 agtgatcatg aatgaaagcc tcacagaaag tgaccactta tttcactacc ttccctagtg 58681 tttttgttgt tgttgtttgt tttgttttgt tttgtttttt gagatggagt ctcactatat 58741 catccaggct ggagtgcagt gaagcgatct tggctcactg caagctccgc ctcccgggtt 58801 cacgccattc tcctgcctca gcctcctgag tagctgggac tacaggcgtc cgtgaccacg 58861 cccggctaat ttttttgtat ttttagtaca gacggggttt cactccgtgt tagccaggat 58921 ggtctcgatc tcctgacctc gtgatctgcc cacctcggcc tcccaatgtg ctgggattac 58981 aggcgtgagc caccgcaccc ggccaccttc cctccttttt catttctttc ctccttcccc 59041 tcctgcccac tctttctcct ttaaatattg aagtcctcaa aactctctgg aaaagccatg 59101 ggtcacagat ttttctttgg cttgggtctc tttttcctgg gcatgtcctc aaccttagca 59161 aaataaacct ctaaattcat tgagtcccct cctctcccct cccctcctct tcccttccct 59221 tcccttcccc tttctttgag acagggtctc actctgtcat ccaggccagg gtacagtggt 59281 gcaaatgata gggacaagag gcagggaaat tctgggcaga agagggtggg tccccagaga 59341 gggcattgcc ctcaagctga aaaacctgga actgcagccc aaagtgagaa ctgacatccc 59401 tgttttttgt tttttggttt tttttgagat ggagtctccc cttctgtcac ccaggctgga 59461 gtacaatggt gcgattttgg ctcactgcaa cctccacctc ccgggttcaa gtgattctcc 59521 tgcctcagcc tcccgagtaa tccgagccgg gattacaggc acacaccacc acacccggct 59581 aatttttgta tttttattag agaaggggtt tcactatctt ggccaggctg gtgttgaact 59641 cctgatttcg tgatccaccc tccttgcctc ccaaagtgct gggattacag gcatgagcca 59701 ccgtgcccag ccaacatcgc tgctttcctg cttgaatgtt gccttttcca aaaccaccct 59761 tgacctgccc tgcccccaat cctgtgccca taaaaacccc aggcccagct agcagagaga 59821 ggagaagcag ctggacgtca aagaccatgg ttgaacattg gagagaagtg gcttgacttc 59881 agagggacag tttgctggag tagctttgga ggagtatggc cagggacagc tggacttcag 59941 agaaagatta ccttcctgct ctgtcccctt ttcagctccc cttcccgctt agagccactt 60001 tcatcagcaa taaagtctcc tgcatttacc atcttcaatt catttgtgtg acctaattcc 60061 tcctggacac tgaaaaagaa cttgggtgcc acgagtgtgg atgcaaaagg ctgtcacacc 60121 gatcctccac taagctgtta acacttaagc cattcacaga cagcagagct aaaagagtac 60181 tctaacactg cctctggggc ttcaatagtc tccggcaccc tccgctagac actatcatgg 60241 ggctggtatg gagatggctc ttgctggcgc ctaaaaactc tcgccccgtc tcctgcacct 60301 gctcacctgt gctccctctc ctgtgagggg tggagtagtg agtgagtgga gttcacccct 60361 accagcacca aagcagctgg ctagttctta ggcaacatcc tgcttcacaa tcacagctca 60421 ctgcaacctc ccacctccca ggctcaagtg ttcctcctgc ctcagcctcc caaagtgctg 60481 ggattgcagg catgagccac catgcccagc cagtcatttt ctttggttta cactacttta 60541 cctccctgag ccttattttc cccaaatgag aggtagaaac tcctctgttg ggaggattaa 60601 atgagatatg tctcaaattt ttgttgaaaa ctggacattt tattttatct tattttactt 60661 atttttgaga caaggtctca ctcactctgt cactcaggct agagtgcagt ggtgcaatct 60721 tggctccctg aaagcttaac ctcctgggct caagtgatcc tcctgtctca gcctcctgag 60781 cagctgggac tataggctcc agccaccaca cttggctaat ttatttttat ttttattttt 60841 tgtagagaca gagtctcact atgttgccta ggatggtctg gaactcctgg gctcaagtgg 60901 ttctcctgac tcggccccac aaagtgctgg cattacaggt gtgagccatg gcacccagca 60961 aaaactggac attttaaatc atgtattgta attctaaatt ctgatgtcct ggtggtagct 61021 gttgtagatt ttgacattgt tgttgtttgc tggttgtctg tttggttgtt taataacttg 61081 aagccactaa aggaagcctc tgttttgttt tgtgattctt gcttttattt tcaagactgg 61141 cttcctaggg gtccatctct gaatcagcat tgcttagtgc ccagccactg tttggtcaga 61201 aggtttccgt aaacaccttg acacactaag ccttccttgg tcaagaggac ctgtgagggg 61261 ggttgggaca caggttaaat tatttcctca agggcgttga catttctttc ttttttcttt 61321 tttttttgag atggagtctg tctctatcac tcaggctgga gtgcagtagc atgatcttgg 61381 ctcactgcaa cctctacctc ccaggttcaa gcgattctcc tgcctcagcc tcccgagtag 61441 ctgggattac aggcgcccgc caccacaccc aactaatttt tgtattttag tagagatggg 61501 gtttcaccac catgttggcc aggctggtct ggaaccctcg acttcaagtg atccacctgc 61561 ctcagcctcc cagagtttgg gattacaggt gtgagccacc acacctggcc tctttttttc 61621 ttttcttttc tttttttttt tttttgagat ggagtttcgc tcttgttgcc caggctggag 61681 ggcaatggca tgatctcggc tcactgcaac ctctggctcc cgggtacgag caattctcct 61741 gcctcagcct cccaagtagc tgggactata gacatgcgcc acacgcctaa ttgtttgtat 61801 ttttagtaga gatggggttt caccatgttg accaggcagg tctcgaactc ctgacctcag 61861 gagatctgct cacctcagcc tcccacaggt atgagccacc atgctcagct ttattttgtt 61921 ttattttatt ttattttatt ttattttatt ttatttgaga cagagtctcg ctctgtcgcc 61981 caggctggag tccagtggag ctatctcggc tcactgcaac ctctgcctct caggttcaag 62041 caattctcat gtctcagtct ctcaagtagc tgggattaca ggtgtgtgcc accacgccca 62101 gataattttt ttattattag ttttagtaga gtcggggttt tgccatgttg cccagcctgg 62161 tcttgtactc ctgacctcaa gatatccacc cgcctcggcc tcccaaagtg ctgggattat 62221 aggcatgagc caccataccc ggcctctttt tttaattttt atggatatgt ggtaggtata 62281 tgtatttatg aggtacatga gatattttga tacaggcata caatgcatca taatcacatc 62341 agagtaaatg gggtatccat catctcaaac atttatcatt tctttgttac aaacattcca 62401 attatgctct tctagttatt tttaattgca taataaatta ttgttgactg cccaggcaca 62461 gtggctcacg cctgtaatcc cagcactttg ggaggccgag gcaggtggat tgcctgaagt 62521 caggagttca agaccagcct gaacaacatg gagaaatccc gtctctacta aaaatacaaa 62581 attagccagg tgcagtggcg catgcctgta atcccagcta cttgggagga tgaggtagga 62641 gaatctcttg aacccaggag acagaggttg cggtgagccg agatcgcacc attgcattcc 62701 agcctgggcg acattttgta tgacattgct taaccataaa ctcttcattt gcttttgttt 62761 ttcttttctt tttttttgag acggagtctc gctctgttgc ccacgggttc caccgtgtta 62821 gccaggatgg tctcgatctc ttgaccttgt gatccgccag cctcggcctc ccaaagtgct 62881 gggattacag gtatgagcca ccacccacgg cctgtttttc attttattgt ctgagaatcc 62941 cttgcagcct gggggcatag attcggggaa ttctcccact cctcactttc ttttcttcct 63001 taggaatatc ttggccaggt gcagtggctt acacctgaaa tcccagaact ttggcaagct 63061 aaggcaggag gaatgcttga ggtcaggagt ttgagacccg cctggggaac aaagtgagat 63121 cctatctcta tttaaaaaat aagaataatg gccagtcttg ggggatcact cctgtaatcc 63181 cagaactttg gaaggcagag gtgggaggat cacttgaacc cacaaggttg aggctgcagt 63241 gagacgagat tgttctgcca cactccagcc tgggtggcag agtgagaccc tgtctcaaaa 63301 caacaacaac aattaaaaaa aaaaaaaaaa gaatatcttt atttctgact tgggggcttg 63361 caggtggctg aactatttct gtggaatgat ctggaaaccc acacatatgt gaagccaggt 63421 cagggctttg aattctttga attatcaggc tgaggcaggc aagtttgtca ctcctcaagg 63481 tagatgaact catgatctcc agtctaccct ttcacagact gtgtggcttt tcaaggatca 63541 catttcaaag ggatctcagg cacaatttcc atttgaactg ggtccagata caatttccat 63601 ttgaactgga cctcaatgta gtagtctctc attgtttgaa gtatcactcg gagttctttg 63661 tctcacaacc atgaaaatta aggagcatgg gcaccaagga tgaggctgga gtgaaagttt 63721 aataagctaa agaagaaagc tctctgccgt ggagaggggg tctgaaagag gccattatta 63781 tttatttatt tatttgagac agagtttcac tcttgttgcc caggctggag tgcaatggca 63841 tgatctcggc tcaccacaac ctccacctcc cgggttcaag tgattctcct gcctcagcct 63901 cctgagtagc tgggattata ggcatgcacc accacaccca gttaattttg tattttcagt 63961 agagacgggg tttctccatg ttggtcaggc tagtgtcgaa ctctcctcag gagatccacc 64021 cacctcggcc tcccaaagta ctgggattac aggcatgagc caccgtgccc agccaaaaga 64081 ggccattttt acagttgaat gcaaaagctt ttataagaaa ccaatgaggg ctgggcattt 64141 catttacata aggtgtgaat ttctcctatc tccaccccat ccttctaatg cgcatggggg 64201 cccttagctt aatttactcc atattgcttt aatttttttt taaattagcc atattttgca 64261 aaaaaaaaaa aaaaagtgca tacatcctat aatgtcctat tttatctagt aactctagcc 64321 tagggcctca tctcctgacc tgacacgggc attaaagcaa gctcctggcc actgaccctc 64381 agtgaccatt cagagcagag acgtgatcaa ttcattgcct atcatctgtg gcgtttagtt 64441 tcctctttgt ttctggattc ctaggatttc cctttctttc atgggagctc aactgggcat 64501 tgaaaataat tttttttaat tgtattaaac atttcaaaga gtttcaatag gaaggttttc 64561 tggttctccc tgcctggcaa atcagaaaca tatggagagg tttttcagta catgtttcat 64621 agcccttctt tctctgccaa aattctgata tagccccctg gagaacaaca aaatctggat 64681 ggagtttggg ccagaattgg ggtggggtat agattggctc ctatgtgctt ggaaaataac 64741 tcacaaccca ctttcccagt gttgattcaa ttctttgtgt cttagacatt ttttctcatt 64801 ttgttttgtt tgagacaggg tctcgctctg tcacccaggc tggagtacag tggcacaatc 64861 ttagctcact gtagtcttgg cacccccggg ctcaagccat cctcctgcct cagcctccca 64921 catagctggg actacagatg cgcaccacca tgcccggcta agtctttttt tttttttttt 64981 ttttttttga gacggagtct cgctctgtca cccaggctgg agtgcagtgg cgtgatctcg 65041 gctcactgca agctccgcct cccaggttca cgccattctc ctgcctcagc ctccagagta 65101 gctgctggga ctacaggtgc ccactaccac acccgactaa ttttttgtat ttttagtaga 65161 gatggggttt caccatgttg gccaggatgg tctcgatctc ttgacctcgt gatccacccg 65221 cctcggcctc ccaaagtgct gggattacag gcgtgagcca ccacgcccgg ccaatttttt 65281 gtatttttag tacagacagg gtttcaccat gttagccagg ttggtcttga tctcccgacc 65341 ttgtgatccg cccgtcttgg cctcccaaag tgctgggatt acaggtgtga gccagcacgc 65401 ccggccctgg ctaagtctta gacttttgtt tccccaacgt ctaacacagt ttcatggccc 65461 atagaagata ctgagtgcat gaatgaggaa tgcacgaatg actcttggca gacacttcgt 65521 ggtcagcata aaagagggag aaagctggct gggcaaagtg gctcacacct gcaatcccag 65581 cactttggga ggccgaggcc agtggatc // LOCUS HSU62392 1574 bp mRNA PRI 09-AUG-1997 DEFINITION Homo sapiens zinc finger protein mRNA, complete cds. ACCESSION U62392 NID g2315851 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1574) AUTHORS Lee,P.L., Gelbart,T., West,C., Adams,M., Blackstone,R. and Beutler,E. TITLE Three genes encoding zinc finger proteins on human chromosome 6p21.3: members of a new subclass of the Kruppel gene family containing the conserved SCAN box domain JOURNAL Genomics 43 (2), 191-201 (1997) MEDLINE 97386587 REFERENCE 2 (bases 1 to 1574) AUTHORS Beutler,E., Lee,P.L., Gelbart,T. and West,C. TITLE Direct Submission JOURNAL Submitted (26-JUN-1996) Molecular and Experimental Medicine, Scripps Research Institute, 10550 North Torrey Pines Road, SBR3, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1574 /organism="Homo sapiens" /note="PRD51 was originally identified by selective hybridization of human microsatellites D6S306, D6S1001, D6S105, D6S464 and D6S1260" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" /tissue_type="duodenum" /clone="PRD51" CDS 102..1286 /note="five zinc finger motifs; candidate for the hemochromatosis gene which has been shown to be in linkage disequilibrium with D6S105 and D6S1260" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g2315852" /translation="MNTNSKEVLSLGVQVPEAWEELLTMKVEAKSHLQWQESRLKRSN PLAREIFRRHFRQLCYQETPGPREALTRLQELCYQWLRPHVSTKEQILDLLVLEQFLS ILPKELQGWVREHCPESGEEAVILLEDLERELDEPQHEMVAHRHRQEVLCKEMVPLAE QTPLTLQSQPKEPQLTCDSAQKCHSIGETDEVTKTEDRELVLRKDCPKIVEPHGKMFN EQTWEVSQQDPSHGEVGEHKDRIERQWGNLLGEGQHKCDECGKSFTQSSGLIRHQRIH TGERPYECNECGKAFSRSSGLFNHRGIHNIQKRYHCKECGKVFSQSAGLIQHQRIHKG EKPYQCSQCSKSYSRRSFLIEHQRSHTGERPHQCIECGKSFNRHCNLIRHQKIHTVAE LV" BASE COUNT 449 a 341 c 404 g 380 t ORIGIN 1 ctctccctcc ttgcgcgttc cgggtctcgc aagcgcctcc aaggtttgtc ttgaagcata 61 gctccagctg gagggtacct tttaagctgt tcaaggtcaa gatgaataca aactcaaagg 121 aggttttatc cctgggtgtt caagttcccg aggcatggga agaacttctg acaatgaaag 181 tggaagcaaa aagtcacctt caatggcagg aatccagact gaaacgcagt aatccactgg 241 caagggaaat cttccgaagg cactttcgac agctgtgcta ccaagagacc cctggaccaa 301 gggaggctct tactcgactc caggaacttt gctaccagtg gttgaggcca catgtgagca 361 caaaggagca gattttggat ctgctggtgc tggagcagtt tctatccatt ctgcccaagg 421 agctccaggg ctgggtgagg gaacactgtc cagagagtgg agaagaggct gtgattttgc 481 tggaggatct ggagagagag ctcgatgaac cacaacatga gatggtggcc cacagacaca 541 gacaagaagt cctctgtaaa gagatggtgc ctctagcaga gcagacacca ctgacccttc 601 agtcccagcc taaggagcca cagctcacat gtgactctgc tcagaagtgc cattctattg 661 gagagacaga tgaagtaacc aagactgagg acagagagtt ggtgctaagg aaagactgtc 721 ctaagatagt ggaaccacat gggaaaatgt ttaatgagca gacctgggag gtatcacagc 781 aggatccctc acatggagaa gttggtgaac ataaggatag gatagagagg cagtggggaa 841 acctcttagg agaggggcaa cacaaatgtg atgaatgtgg gaagagcttt actcagagct 901 caggtctcat tcgacatcaa agaattcata ctggagaaag accttatgaa tgtaatgaat 961 gtgggaaagc cttcagtcga agttctggtc tttttaatca ccgaggaatc cacaatatac 1021 agaaacggta ccactgcaag gagtgtggga aggtcttcag tcagagtgcg ggtcttatcc 1081 agcatcagag aatccacaaa ggagaaaagc cgtatcagtg cagccagtgc agtaagagct 1141 acagtcggcg ttcatttctc attgaacatc agagaagcca cacaggggag cgacctcacc 1201 agtgcattga atgtgggaaa agctttaatc gacactgcaa cctcattcgc catcagaaga 1261 tccacacagt ggctgagctg gtctagggct tggctatgag caagttttcc agatcaccac 1321 ccaagttgtg tggggcaggt tgagactaga aaatgcctct ttcttccttt ctccatgaaa 1381 tgtgtttgaa acaaatcctg acttaaggcc cagggacttc cttaaaggaa agttgggtgt 1441 ttgaagctac tgttttctct tttgttcact ttacctcttt cttactctta ctagctgtgt 1501 ccctcttatt tataatttat ttattttttt gagatggctg ctaaaccctt ctaataatat 1561 aataaatggc actg // LOCUS HSU62431 2664 bp mRNA PRI 11-JAN-1997 DEFINITION Human nicotinic acetylcholine receptor alpha2 subunit precursor, mRNA, complete cds. ACCESSION U62431 NID g1458109 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2664) AUTHORS Elliott,K.J., Ellis,S.B., Berckhan,K.J., Urrutia,A., Chavez-Noriega,L.E., Johnson,E.C., Velicelebi,G. and Harpold,M.M. TITLE Comparative structure of human neuronal alpha 2-alpha 7 and beta 2-beta 4 nicotinic acetylcholine receptor subunits and functional expression of the alpha 2, alpha 3, alpha 4, alpha 7, beta 2, and beta 4 subunits JOURNAL J. Mol. Neurosci. 7 (3), 217-228 (1996) MEDLINE 97062879 REFERENCE 2 (bases 1 to 2664) AUTHORS Elliott,K.J. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) Kathryn J. Elliott, SIBIA Neurosciences, Inc., 505 Coast Blvd. So., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2664 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Halpha2.13" /clone_lib="SIBIA lambda gt11 library (M. Williams)" /tissue_type="thalamus" 5'UTR 1..554 sig_peptide 555..632 CDS 555..2144 /codon_start=1 /product="nicotinic acetylcholine receptor alpha2 subunit precursor" /db_xref="PID:g1458110" /translation="MGPSCPVFLSFTKLSLWWLLLTPAGGEEAKRPPPRAPGDPLSSP SPTALPQGGSHTETEDRLFKHLFRGYNRWARPVPNTSDVVIVRFGLSIAQLIDVDEKN QMMTTNVWLKQEWSDYKLRWNPADFGNITSLRVPSEMIWIPDIVLYNNADGEFAVTHM TKAHLFSTGTVHWVPPAIYKSSCSIDVTFFPFDQQNCKMKFGSWTYDKAKIDLEQMEQ TVDLKDYWESGEWAIVNATGTYNSKKYDCCAEIYPDVTYAFVIRRLPLFYTINLIIPC LLISCLTVLVFYLPSDCGEKITLCISVLLSLTVFLLLITEIIPSTSLVIPLIGEYLLF TMIFVTLSIVITVFVLNVHHRSPSTHTMPHWVRGALLGCVPRWLLMNRPPPPVELCHP LRLKLSPSYHWLESNVDAEEREVVVEEEDRWACAGHVAPSVGTLCSHGHLHSGASGPK AEALLQEGELLLSPHMQKALEGVHYIADHLRSEDADSSVKEDWKYVAMVIDRIFLWLF IIVCFLGTIGLFLPPFLAGMI" mat_peptide 633..2141 /product="nicotinic acetylcholine receptor alpha2 subunit" 3'UTR 2145..2664 BASE COUNT 518 a 815 c 743 g 588 t ORIGIN 1 gagagaacag cgtgagcctg tgtgcttgtg tgctgagccc tcatcccctc ctggggccag 61 gcttgggttt cacctgcaga atcgcttgtg ctgggctgcc tgggctgtcc tcagtggcac 121 ctgcatgaag ccgttctggc tgccagagct ggacagcccc aggaaaaccc acctctctgc 181 agagcttgcc cagctgtccc cgggaagcca aatgcctctc atgtaagtct tctgctcgac 241 ggggtgtctc ctaaaccctc actcttcagc ctctgtttga ccatgaaatg aagtgactga 301 gctctattct gtacctgcca ctctatttct ggggtgactt ttgtcagctg cccagaatct 361 ccaagccagg ctggttctct gcatcctttc aatgacctgt tttcttctgt aaccacaggt 421 tcggtggtga gaggaagcct cgcagaatcc agcagaatcc tcacagaatc cagcagcagc 481 tctgctgggg acatggtcca tggtgcaacc cacagcaaag ccctgacctg acctcctgat 541 gctcaggaga agccatgggc ccctcctgtc ctgtgttcct gtccttcaca aagctcagcc 601 tgtggtggct ccttctgacc ccagcaggtg gagaggaagc taagcgccca cctcccaggg 661 ctcctggaga cccactctcc tctcccagtc ccacggcatt gccgcaggga ggctcgcata 721 ccgagactga ggaccggctc ttcaaacacc tcttccgggg ctacaaccgc tgggcgcgcc 781 cggtgcccaa cacttcagac gtggtgattg tgcgctttgg actgtccatc gctcagctca 841 tcgatgtgga tgagaagaac caaatgatga ccaccaacgt ctggctaaaa caggagtgga 901 gcgactacaa actgcgctgg aaccccgctg attttggcaa catcacatct ctcagggtcc 961 cttctgagat gatctggatc cccgacattg ttctctacaa caatgcagat ggggagtttg 1021 cagtgaccca catgaccaag gcccacctct tctccacggg cactgtgcac tgggtgcccc 1081 cggccatcta caagagctcc tgcagcatcg acgtcacctt cttccccttc gaccagcaga 1141 actgcaagat gaagtttggc tcctggactt atgacaaggc caagatcgac ctggagcaga 1201 tggagcagac tgtggacctg aaggactact gggagagcgg cgagtgggcc atcgtcaatg 1261 ccacgggcac ctacaacagc aagaagtacg actgctgcgc cgagatctac cccgacgtca 1321 cctacgcctt cgtcatccgg cggctgccgc tcttctacac catcaacctc atcatcccct 1381 gcctgctcat ctcctgcctc actgtgctgg tcttctacct gccctccgac tgcggcgaga 1441 agatcacgct gtgcatttcg gtgctgctgt cactcaccgt cttcctgctg ctcatcactg 1501 agatcatccc gtccacctcg ctggtcatcc cgctcatcgg cgagtacctg ctgttcacca 1561 tgatcttcgt caccctgtcc atcgtcatca ccgtcttcgt gctcaatgtg caccaccgct 1621 cccccagcac ccacaccatg ccccactggg tgcggggggc ccttctgggc tgtgtgcccc 1681 ggtggcttct gatgaaccgg cccccaccac ccgtggagct ctgccacccc ctacgcctga 1741 agctcagccc ctcttatcac tggctggaga gcaacgtgga tgccgaggag agggaggtgg 1801 tggtggagga ggaggacaga tgggcatgtg caggtcatgt ggccccctct gtgggcaccc 1861 tctgcagcca cggccacctg cactctgggg cctcaggtcc caaggctgag gctctgctgc 1921 aggagggtga gctgctgcta tcaccccaca tgcagaaggc actggaaggt gtgcactaca 1981 ttgccgacca cctgcggtct gaggatgctg actcttcggt gaaggaggac tggaagtatg 2041 ttgccatggt catcgacagg atcttcctct ggctgtttat catcgtctgc ttcctgggga 2101 ccatcggcct ctttctgcct ccgttcctag ctggaatgat ctgactgcac ctccctcgag 2161 ctggctccca gggcaaaggg gagggttctt ggatgtggaa gggctttgaa caatgtttag 2221 atttggagat gagcccaaag tgccagggag aacagccagg tgaggtggga ggttggagag 2281 ccaggtgagg tctctctaag tcaggctggg gttgaagttt ggagtctgtc cgagtttgca 2341 gggtgctgag ctgtatggtc cagcagggga gtaataaggg ctcttccgga aggggaggaa 2401 gcgggaggca ggcctgcacc tgatgtggag gtacaggcag atcttcccta ccggggaggg 2461 atggatggtt ggatacaggt ggctgggcta ttccatccat ctggaagcac atttgagcct 2521 ccaggcttct ccttgacgtc attcctctcc ttccttgctg caaaatggct ctgcaccagc 2581 cggcccccag gaggtctggc agagctgaga gccatggcct gcaggggctc catatgtccc 2641 tacgcgtgca gcaggcaaac aaga // LOCUS HSU62435 1743 bp mRNA PRI 11-JAN-1997 DEFINITION Human nicotinic acetylcholine receptor alpha6 subunit precursor, mRNA, complete cds. ACCESSION U62435 NID g1458117 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS Elliott,K.J., Ellis,S.B., Berckhan,K.J., Urrutia,A., Chavez-Noriega,L.E., Johnson,E.C., Velicelebi,G. and Harpold,M.M. TITLE Comparative structure of human neuronal alpha 2-alpha 7 and beta 2-beta 4 nicotinic acetylcholine receptor subunits and functional expression of the alpha 2, alpha 3, alpha 4, alpha 7, beta 2, and beta 4 subunits JOURNAL J. Mol. Neurosci. 7 (3), 217-228 (1996) MEDLINE 97062879 REFERENCE 2 (bases 1 to 1743) AUTHORS Elliott,K.J. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) Kathryn J. Elliott, SIBIA Neurosciences, Inc., 505 Coast Blvd. So., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="KEalpha6.3" /clone_lib="Clontech #HL1179a, lambda gt10" /tissue_type="Substantia nigra" 5'UTR 1..142 sig_peptide 143..232 CDS 143..1627 /codon_start=1 /product="nicotinic acetylcholine receptor alpha6 subunit precursor" /db_xref="PID:g1458118" /translation="MLTSKGQGFLHGGLCLWLCVFTPFFKGCVGCATEERLFHKLFSH YNQFIRPVENVSDPVTVHFEVAITQLANVDEVNQIMETNLWLRHIWNDYKLRWDPMEY DGIETLRVPADKIWKPDIVLYNNAVGDFQVEGKTKALLKYNGMITWTPPAIFKSSCPM DITFFPFDHQNCSLKFGSWTYDKAEIDLLIIGSKVDMNDFWENSEWEIIDASGYKHDI KYNCCEEIYTDITYSFYIRRLPMFYTINLIIPCLFISFLTVLVFYLPSDCGEKVTLCI SVLLSLTVFLLVITETIPSTSLVVPLVGEYLLFTMIFVTLSIVVTVFVLNIHYRTPTT HTMPRWVKTVFLKLLPQVLLMRWPLDKTRGTGSDAVPRGLARRPAKGKLASHGEPRHL KECFHCHKSNELATSKRRLSHQPLQWVVENSEHSPEVEDVINSVQFIAENMKSHNETK EVEDDWKYVAMVVDRVFLWVFIIVCVFGTAGLFLQPLLGNTGKS" mat_peptide 233..1624 /product="nicotinic acetylcholine receptor alpha6 subunit" 3'UTR 1628..1743 BASE COUNT 469 a 383 c 396 g 495 t ORIGIN 1 cgggttttga tttctgagaa gacacacacg gattgcagtg ggcttctgat gatgtcaagg 61 ttggatgcat gtggctgact gatagctctt tgttttccac aatcctttgc ctaggaaaaa 121 ggaatccaag tgtgttttaa ccatgctgac cagcaagggg cagggattcc ttcatggggg 181 cttgtgtctc tggctgtgtg tgttcacacc tttctttaaa ggctgtgtgg gctgtgcaac 241 tgaggagagg ctcttccaca aactgttttc tcattacaac cagttcatca ggcctgtgga 301 aaacgtttcc gaccctgtca cggtacactt tgaagtggcc atcacccagc tggccaacgt 361 ggatgaagta aaccagatca tggaaaccaa tttgtggctg cgtcacatct ggaatgatta 421 taaattgcgc tgggatccaa tggaatatga tggcattgag actcttcgcg ttcctgcaga 481 taagatttgg aagcccgaca ttgttctcta taacaatgct gttggtgact tccaagtaga 541 aggcaaaaca aaagctcttc ttaaatacaa tggcatgata acctggactc caccagctat 601 ttttaagagt tcctgcccta tggatatcac ctttttccct tttgatcatc aaaactgttc 661 cctaaaattt ggttcctgga cgtatgacaa agctgaaatt gatcttctaa tcattggatc 721 aaaagtggat atgaatgatt tttgggaaaa cagtgaatgg gaaatcattg atgcctctgg 781 ctacaaacat gacatcaaat acaactgttg tgaagagata tacacagata taacctattc 841 tttctacatt agaagattgc cgatgtttta cacgattaat ctgatcatcc cttgtctctt 901 tatttcattt ctaaccgtgt tggtctttta ccttccttcg gactgtggtg aaaaagtgac 961 gctttgtatt tcagtcctgc tttctctgac tgtgtttttg ctggtcatca cagaaaccat 1021 cccatccaca tctctggtgg tcccactggt gggtgagtac ctgctgttca ccatgatctt 1081 tgtcacactg tccatcgtgg tgactgtgtt tgtgttgaac atacactacc gcaccccaac 1141 cacgcacaca atgcccaggt gggtgaagac agttttcctg aagctgctgc cccaggtcct 1201 gctgatgagg tggcctctgg acaagacaag gggcacaggc tctgatgcag tgcccagagg 1261 ccttgccagg aggcctgcca aaggcaagct tgcaagccat ggggaaccca gacatcttaa 1321 agaatgcttc cattgtcaca aatcaaatga gcttgccaca agcaagagaa gattaagtca 1381 tcagccatta cagtgggtgg tggaaaattc ggagcactcg cctgaagttg aagatgtgat 1441 taacagtgtt cagttcatag cagaaaacat gaagagccac aatgaaacca aggaggtaga 1501 agatgactgg aaatacgtgg ccatggtggt ggacagagta tttctttggg tatttataat 1561 tgtctgtgta tttggaactg cagggctatt tctacagcca ctacttggga acacaggaaa 1621 atcttaaaat gtattttctt ttatgttcag aaatttacag acaccatatt tgttctgcat 1681 tccctgccac aaggaaagga aagcaaaggc ttcccaccca agtcccccat ctgctaaaac 1741 ccg // LOCUS HSU62583 2995 bp mRNA PRI 05-FEB-1997 DEFINITION Human Prt1 homolog mRNA, complete cds. ACCESSION U62583 NID g1778050 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2995) AUTHORS Methot,N., Rom,E., Olsen,H. and Sonenberg,N. TITLE The human homologue of the yeast Prt1 protein is an integral part of the eukaryotic initiation factor 3 complex and interacts with p170 JOURNAL J. Biol. Chem. 272 (2), 1110-1116 (1997) MEDLINE 97150874 REFERENCE 2 (bases 1 to 2995) AUTHORS Methot,N., Rom,E., Olsen,H. and Sonenberg,N. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) Biochemistry, McGill University, 3655 Drummond St. Rm 807, Montreal, QUE H3G 1Y6, Canada FEATURES Location/Qualifiers source 1..2995 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="cDNA library provided by Dr. Morag Park" CDS 54..2675 /function="interacts with the large subunit of eIF3, p170" /note="similar to yeast Prt1; contains an RNA recognition motif; eukaryotic initiation factor 3 subunit" /codon_start=1 /product="Prt1 homolog" /db_xref="PID:g1778051" /translation="MQDAENVAVPEAAEERAEPGQQQPAAEPPPAEGLLRPAGPGAPE AAGTEASSEEVGIAEAGPEPEVRTEPAAEAEAASGPSESPSPPAAEELPGSHAEPPVP AQGEAPGEQARDAGSDSRAQAVSEDAGGNEGRAAEAEPRALENGDADEPSFSDPEDFV DDVSEEELLGDVLKDRPQEADGIDSVIVVDNVPQVGPDRLEKLKNVIHKIFSKFGKIT NDFYPEEDGKTKGYIFLEYASPAHAVDAVKNADGYKLDKQHTFRVNLFTDFDKYMTIS DEWDIPEKQPFKDLGNLRYWLEEAECRDQYSVIFESGDRTSIFWNDVKDPVSIEERAR WTETYVRWSPKGTYLATFHQRGIALWGGEKFKQIQRFSHQGVQLIDFSPCERYLVTFS PLMDTQDDPQAIIIWDILTGHKKRGFHCESSAHWPIFKWSHDGKFFARMTLDTLSIYE TPSMGLLDKKSLKISGIKDFSWSPGGNIIAFWVPEDKDIPARVTLMQLPTRQEIRVRN LFNVVDCKLHWQKNGDYLCVKVDRTPKGTQGVVTNFEIFRMREKQVPVDVVEMKETII AFAWEPNGSKFAVLHGEAPRISVSFYHVKNNGKIELIKMFDKQQANTIFWSPQGQFVV LAGLRSMNGALAFVDTSDCTVMNIAEHYMASDVEWDPTGRYVVTSVSWWSHKVDNAYW LWTFQGRLLQKNNKDRFCQLLWRPRPPTLLSQEQIKQIKKDLKKYSKIFEQKDRLSQS KASKELVERRRTMMEDFRKYRKMAQELYMEQKNERLELRGGVDTDELDSNVDDWEEET IEFFVTEEIIPLGIRSDLEHCAQPCVLWSRGRPAGSRVTPASSLCSLALDCDCAWILP LRHIFVPFSPWCLQWGI" BASE COUNT 717 a 770 c 910 g 598 t ORIGIN 1 tgctaccgaa ggccggcggc cgcggagccc tgcgagtagg cagcgttggg cccatgcagg 61 acgcggagaa cgtggcggtg cccgaggcgg ccgaggagcg cgccgagccc ggccagcagc 121 agccggccgc cgagccgccg ccagccgagg ggctgctgcg gcccgcgggg cccggcgctc 181 cggaggccgc ggggaccgag gcctccagtg aggaggtggg gatcgcggag gccgggccgg 241 agcccgaggt gaggaccgag ccggcggccg aggcagaggc ggcctccggc ccgtccgagt 301 cgccctcgcc gccggccgcc gaggagctgc ccgggtcgca tgctgagccc cctgtcccgg 361 cacagggcga ggccccagga gagcaggctc gggacgcagg ctccgacagc cgggcccagg 421 cggtgtccga ggacgcggga ggaaacgagg gcagagcggc cgaggccgaa ccccgggcgc 481 tggagaacgg cgacgcggac gagccctcct tcagcgaccc cgaggacttc gtggacgacg 541 tgagcgagga agaattactg ggagatgtac tcaaagatcg gccccaggaa gcagatggaa 601 tcgattcggt gattgtagtg gacaatgtcc ctcaggtggg acccgaccga cttgagaaac 661 tcaaaaatgt catccacaag atcttttcca agtttgggaa aatcacaaat gatttttatc 721 ctgaagagga tgggaagaca aaagggtata ttttcctgga gtacgcgtcc cctgcccacg 781 ctgtggatgc tgtgaagaac gccgacggct acaagcttga caagcagcac acattccggg 841 tcaacctctt tacggatttt gacaagtata tgacgatcag tgacgagtgg gatattccag 901 agaaacagcc tttcaaagac ctggggaact tacgttactg gcttgaagag gcagaatgca 961 gagatcagta cagtgtgatt tttgagagtg gagaccgcac ttccatattc tggaatgacg 1021 taaaagaccc tgtctcaatt gaagaaagag cgagatggac agagacgtat gtgcgttggt 1081 ctcctaaggg cacctacctg gctacctttc atcaaagagg cattgctcta tgggggggag 1141 agaaattcaa gcaaattcag agattcagcc accaaggggt tcagcttatt gacttctcac 1201 cttgtgaaag gtacctggtg acctttagcc ccctgatgga cacgcaggat gaccctcagg 1261 ccataatcat ctgggacatc cttacggggc acaagaagag gggttttcac tgtgagagct 1321 cagcccattg gcctattttt aagtggagcc atgatggcaa attctttgcc agaatgaccc 1381 tggatacgct tagcatctat gaaactcctt ctatgggtct tttggacaag aagagtttga 1441 agatctctgg gataaaagac ttttcttggt ctcctggtgg taacataatc gccttctggg 1501 tgcctgaaga caaagatatt ccagccaggg taaccctgat gcagctccct accaggcaag 1561 agatccgagt gaggaacctg ttcaatgtgg tggactgcaa gctccattgg cagaagaacg 1621 gagactactt gtgtgtgaaa gtagatagga ctccgaaagg cacccagggt gttgtcacaa 1681 attttgaaat tttccgaatg agggagaaac aggtacctgt ggatgtggtc gagatgaaag 1741 aaaccatcat agcctttgcc tgggaaccaa atggaagtaa gtttgctgtg ctgcacggag 1801 aggctccgcg gatatctgtg tctttctacc acgtcaaaaa caacgggaag attgaactca 1861 tcaagatgtt cgacaagcag caggcgaaca ccatcttctg gagcccccaa ggacagttcg 1921 tggtgttggc gggcctgagg agtatgaacg gtgccttagc gtttgtggac acttcggact 1981 gcacggtcat gaacatcgca gagcactaca tggcttccga cgtcgaatgg gatcctactg 2041 ggcgctacgt cgtcacctct gtgtcctggt ggagccataa ggtggacaac gcgtactggc 2101 tgtggacttt ccagggacgc ctcctgcaga agaacaacaa ggaccgcttc tgccagctgc 2161 tgtggcggcc ccggcctccc acactcctga gccaggaaca gatcaagcaa attaaaaagg 2221 atctgaagaa atactctaag atctttgaac agaaggatcg tttgagtcag tccaaagcct 2281 caaaggaatt ggtggagaga aggcgcacca tgatggaaga tttccggaag taccggaaaa 2341 tggcccagga gctctatatg gagcagaaaa acgagcgcct ggagttgcga ggaggggtgg 2401 acactgacga gctggacagc aacgtggacg actgggaaga ggagaccatt gagttcttcg 2461 tcactgaaga aatcattccc ctcggaatca ggagtgacct ggagcactgt gcgcagccgt 2521 gtgtgctgtg gagccgaggc cgtcctgcag gaagccgcgt gactcccgcc tcctccctgt 2581 gctctctggc tctggactgt gactgcgcct ggattctgcc attgcgacac atttttgtgc 2641 ctttcagccc ctggtgtctg cagtggggga tttaaggcac ccgcttccac ttctttcttg 2701 tttggagttt tctgttggaa ccgccggcgt tggctccgaa gacttagcga cgcactggcg 2761 gcaccttctc ctgcgcccag tgatgtttcc acggtgcctg tacacagccg agcagcattt 2821 ccgttgaagg acttgcatcc ccattgcggg cagtgctgga cgtgtcccgg agacccaccg 2881 gagggcgccg catgccttgt acccccaccg tgcaggttgt ggccggtttt ctccgcaggt 2941 tgaacatgga aataaaagca aacttgtatg aaaaaaaaaa aaaaaaaaac tcgag // LOCUS HSU62647 1183 bp mRNA PRI 25-JUL-1997 DEFINITION Human DNase 1 homolog (DNAS1L2) mRNA, complete cds. ACCESSION U62647 NID g1518783 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1183) AUTHORS Germino,G.G., Weinstat-Saslow,D., Himmelbauer,H., Gillespie,G.A., Somlo,S., Wirth,B., Barton,N., Harris,K.L., Frischauf,A.M. and Reeders,S.T. TITLE The gene for autosomal dominant polycystic kidney disease lies in a 750-kb CpG-rich region JOURNAL Genomics 13 (1), 144-151 (1992) MEDLINE 92250035 REFERENCE 2 (bases 1 to 1183) AUTHORS Rodriguez,A.M., Rodin,D., Nomura,H., Morton,C.C., Weremowicz,S. and Schneider,M.C. TITLE Identification, localization, and expression of two novel human genes similar to deoxyribonuclease I JOURNAL Genomics 42 (3), 507-513 (1997) MEDLINE 97349121 REFERENCE 3 (bases 1 to 1183) AUTHORS Rodriguez,A., Rodin,D., Nomura,H. and Schneider,M.C. TITLE Direct Submission JOURNAL Submitted (28-JUN-1996) Renal Division, Brigham and Women's Hospital, 75 Francis Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1183 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" /clone="H42990; y011c03.s1; 177604" /clone_lib="Soares adult brain, N2b5HB55Y" gene 1..1183 /gene="DNAS1L2" CDS 93..992 /gene="DNAS1L2" /note="DNase1-Like II; probable deoxyribonuclease" /codon_start=1 /product="DNase 1 homolog" /db_xref="PID:g1518784" /translation="MGGPRALLAALWALEAAGTAALRIGAFNIQSFGDSKVSDPACGS IIAKILAGYDLALVQEVRDPDLSAVSALMEQINSVSEHEYSFVSSQPLGRDQYKEMYL FVYRKDAVSVVDTYLYPDPEDVFSREPFVVKFSAPGTGERAPPLPSRRALTPPPLPAA AQNLVLIPLHAAPHQAVAEIDALYDVYLDVIDKWGTDDMLFLGDFNADCSYVRAQDWA AIRLRSSEVFKWLIPDSADTTVGNSDCAYDRIVACGARLRRSLKPQSATVHDFQEEFG LDQTQALAISDHFPVEVTLKFHR" BASE COUNT 199 a 402 c 383 g 199 t ORIGIN 1 gcggccgcgg agggaagggg tgggtcggtg ggtctgacag cgggtctgcg taggcggcag 61 cgtctgtccc tcccagcctc tcgctccgcg ccatgggcgg gccccgggct ctgctggccg 121 cactctgggc gctggaagcc gccgggaccg ccgcgcttcg catcggagcc ttcaacattc 181 agagcttcgg tgacagcaaa gtgtcggacc ccgcttgcgg cagcatcatc gcgaagatcc 241 tggctggcta tgacctcgcg ctggtgcagg aggtgcgaga cccagacctc agcgccgtgt 301 ccgcgctcat ggagcagatc aacagcgtgt ccgagcacga gtacagcttt gtgagcagcc 361 agcccctggg ccgggaccag tacaaggaga tgtacctgtt cgtgtacagg aaagacgcgg 421 tgtcggtcgt ggacacctac ctgtacccag accccgagga cgtcttcagc cgcgagccct 481 tcgtggtcaa gttctcggcc cccggcaccg gtgagcgggc cccgcccctc ccctcccgcc 541 gagctctgac gcccccaccc cttcccgcag cagcacagaa cctggtgctg atcccgctgc 601 acgcggcgcc gcatcaagcc gtggcggaga tcgacgcgct ctacgacgtg tacctggacg 661 tgatcgacaa gtggggcacc gacgacatgc tgttcctggg cgacttcaac gccgactgca 721 gctatgtgcg ggcgcaggac tgggccgcca tccgtctgag gagcagtgag gtcttcaagt 781 ggctcatccc tgacagcgcc gacaccacgg tgggcaactc agactgcgcc tacgaccgca 841 ttgtggcctg tggcgcccgc ctgcgccgga gcctgaagcc ccagtcggcc accgtgcacg 901 acttccagga ggaattcggc ctggaccaga ctcaggctct tgccatcagc gaccactttc 961 cagtggaggt gaccctcaag ttccaccgat gactcgaggc ctgactgggg catgccacct 1021 gcagaccctg gctctgagga atggcccaac agtggcccct tcagggtggc agccaccctt 1081 cagtgaggcc ccaaggcaga gtcggctggg cgtggaccag gggcatggac acgtgatgtg 1141 ctgctctgta cctccgttcc ccatctgtgg gacgggctgg atc // LOCUS HSU62739 1509 bp mRNA PRI 28-APR-1997 DEFINITION Human branched-chain amino acid aminotransferase (ECA40) mRNA, complete cds. ACCESSION U62739 NID g2052345 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1509) AUTHORS Eden,A., Simchen,G. and Benvenisty,N. TITLE Two yeast homologs of ECA39, a target for c-Myc regulation, code for cytosolic and mitochondrial branched-chain amino acid aminotransferases JOURNAL JBC (1996) In press REFERENCE 2 (bases 1 to 1509) AUTHORS Eden,A., Simchen,G. and Benvenisty,N. TITLE Direct Submission JOURNAL Submitted (02-JUL-1996) Genetics, Hebrew University, Givat Ram, Jerusalem 91904, Israel FEATURES Location/Qualifiers source 1..1509 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="infant" /tissue_type="brain" /clone="ATCC 353794" gene 97..1155 /gene="ECA40" CDS 97..1155 /gene="ECA40" /note="BCAT; putative branched-chain amino acid aminotransferase, based on homology to yeast BAT1 and BAT2 genes; similar to human ECA39, encoded by Genbank Accession Number U21551" /codon_start=1 /product="branched-chain amino acid aminotransferase" /db_xref="PID:g2052346" /translation="MTQKPHKKPGPGEPLVFGKTFTDHMLMVEWNDKGWGQPRIQPFQ NLTLHPASSSLHYSLQLFEGMKAFKGKDQQVRLFRPWLNMDRMLRSAMRLCLPSFDKL ELLECIRRLIELDKDWVPDAAGTSLYVRPVLIGNEPSLGVSQPRRALLFVILCPVGAY FPGGSVNPVSLLAEPTFIRAWVGGVGNYKLGGNYGPTVLVQQEALKRGCEQVFWLYGP DHQLTEVGTMNIFVYWTHEDGVLELVTPPLNGVILPGVVRQSLLDMAQTWGEFRVVER TITMKQLLRPLEEARVREVFGSGTACQVCPVHGILYKDRNFHIPTMENGPELIFRFQK ELKEIQYGIRAHEWMFPV" BASE COUNT 296 a 459 c 432 g 318 t 4 others ORIGIN 1 tacncaagct tggcacgaag gctgtccctt ggcttctgtg tggtcccaaa aaaatatgcc 61 tcctccagtt tcaaggctgc agacctgcag ctggaaatga cacagaagcc tcataagaag 121 cctggccccg gcgagcccct ggtgtttggg aaaacattta ccgaccacat gctgatggtg 181 gaatggaatg acaagggctg gggccagccc cgaatccagc ccttccagaa cctcacgctg 241 cacccagcct cctccagcct ccactactcc ctgcagctgt ttgagggcat gaaggcgttc 301 aaaggcaaag accagcaggt gcgcctcttc cgcccctggc tcaacatgga ccggatgctg 361 cgctcagcca tgcgcctgtg cctgccgagt ttcgacaagc tggagttgct ggagtgcatc 421 cgccggctca tcgaattgga caaggactgg gtccccgatg ccgccggcac cagcctctat 481 gtgcggcctg tgctcattgg gaacgagccc tcgctgggtg tcagccagcc caggcgcgcg 541 ctcctgttcg tcattctctg cccagtgggt gcctacttcc ctggaggctc cgtgaacccg 601 gtctccctcc tggccgaacc aaccttcatc cgggcctggg ttggcggggt cggcaactac 661 aagttaggtg ggaattatgg gcccaccgtg ttagtgcaac aggaggcact caagcggggc 721 tgtgaacagg tcttctggct gtatgggccc gaccaccagc tcaccgaggt gggaaccatg 781 aacatctttg tctactggac ccacgaagat ggggtgctgg agctggtgac gcccccgctg 841 aatggtgtta tcctgcctgg agtggtcaga cagagtctac tggacatggc tcagacctgg 901 ggtgagttcc gggtggtgga gcgcacgatc accatgaagc agttgttgcg gcccttggag 961 gaggcccgcg tgcgggaagt ctttggctcg ggcaccgctt gccaggtctg cccagtgcac 1021 ggaatcctgt acaaagacag gaacttccat attcccacca tggaaaatgg gcctgagctg 1081 atcttccgct tccagaagga gctgaaggag atccagtacg gaatcagagc ccacgagtgg 1141 atgttcccgg tgtgaagctg caggctgtgc tccagatcca ccgacccgta gcatntcgta 1201 acgccagcac tcgcntcctt accaatgact cacctgaagt gcaatacgaa ataaaaggcc 1261 agcgggcggc gtctgggtct ctggcgcccc catgtggttg cgacactccc aaagccgtaa 1321 gggccgaccc aggcatcttg gcccccagcc cntcgtcgcg ggttcaggtc cgcccattac 1381 tcccttgtcg tgcggtcaag gatacacctt ggccccgatt ccggatctct ccgttctcag 1441 gccagacccc tggtgctgcc gttgattttt ttttctctgt ctttgctgca attttgaaat 1501 aaaatgcca // LOCUS HSU62740 3175 bp mRNA PRI 01-SEP-1996 DEFINITION Human hereditary multiple exostoses gene 2 (EXT2) mRNA, complete cds. ACCESSION U62740 NID g1518041 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3175) AUTHORS Stickens,D.J., Clines,G., Burbee,D.G., Ramos,P., Thomas,S., Hogue,D., Hecht,J.T., Lovett,M. and Evans,G.A. TITLE The EXT2 multiple exostoses gene defines a family of putative tumour suppressor genes JOURNAL Nature Genet. (1996) In press REFERENCE 2 (bases 1 to 3175) AUTHORS Stickens,D.J., Clines,G., Burbee,D.G., Ramos,P., Thomas,S., Hogue,D., Hecht,J.T., Lovett,M. and Evans,G.A. TITLE Direct Submission JOURNAL Submitted (01-JUL-1996) Mcdermott Center, UT Southwestern Medical Center, 6000 Harry Hines Blvd, Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..3175 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /chromosome="11" /map="11p11" gene 335..2491 /gene="EXT2" CDS 335..2491 /gene="EXT2" /note="Description: Hereditary multiple exostoses gene 2" /codon_start=1 /product="EXT2" /db_xref="PID:g1518042" /translation="MCASVKYNIRGPALIPRMKTKHRIYYITLFSIVLLGLIATGMFQ FWPHSIESSNDWNVEKRSIRDVPVVRLPADSPIPERGDLSCRMHTCFDVYRCGFNPKN KIKVYIYALKKYVDDFGVSVSNTISREYNELLMAISDSDYYTDDINRACLFVPSIDVL NQNTLRIKETAQAMAQLSRWDRGTNHLLFNMLPGGPPDYNTALDVPRDRALLAGGGFS TWTYRQGYDVSIPVYSPLSAEVDLPEKGPGPRQYFLLSSQVGLHPEYREDLEALQVKH GESVLVLDKCTNLSEGVLSVRKRCHKHQVFDYPQVLQEATFCVVLRGARLGQAVLSDV LQAGCVPVVIADSYILPFSEVLDWKRASVVVPEEKMSDVYSILQSIPQRQIEEMQRQA RWFWEAYFQSIKAIALATLQIINDRIYPYAAISYEEWNDPPAVKWGSVSNPLFLPLIP PQSQGFTAIVLTYDRVESLFRVITEVSKVPSLSKLLVVWNNQNKNPPEDSLWPKIRVP LKVVRTAENKLSNRFFPYDEIETEAVLAIDDDIIMLTSDELQFGYEVWREFPDRLVGY PGRLHLWDHEMNKWKYESEWTNEVSMVLTGAAFYHKYFNYLYTYKMPGDIKNWVDAHM NCEDIAMNFLVANVTGKAVIKVTPRKKFKCPECTAIDGLSLDQTHMVERSECINKFAS VFGTMPLKVVEHRADPVLYKDDFPEKLKSFPNIGSL" BASE COUNT 780 a 792 c 817 g 786 t ORIGIN 1 ctgtctgagc atttcactgc ggagcctgag cgcgcctgcc tgggaaaaca ctgcagcggt 61 gctcggactc ctcctgtcca gcaggaggcg cggcccggca gctcccgcat gcgcagtgcg 121 ctcggtgtca gacggcccgg atcccggtta ccggcccctc gctcgctgct cgccagccca 181 gactcggccc tggcagtggc ggctggcgat tcggaccgat ccgacctggg cggaggtggc 241 ccgcgccccg cggcatgagc cggtgaccaa gctcggggcc gagcgggagg cagccgtggc 301 cgaggagtgt gaggaagagg ctgtctgtgt cattatgtgt gcgtcggtca agtataatat 361 ccggggtcct gccctcatcc caagaatgaa gaccaagcac cgaatctact atatcaccct 421 cttctccatt gtcctcctgg gcctcattgc cactggcatg tttcagtttt ggccccattc 481 tatcgagtcc tcaaatgact ggaatgtaga gaagcgcagc atccgtgatg tgccggttgt 541 taggctgcca gccgacagtc ccatcccaga gcggggggat ctcagttgca gaatgcacac 601 gtgttttgat gtctatcgct gtggcttcaa cccaaagaac aaaatcaagg tgtatatcta 661 tgctctgaaa aagtacgtgg atgactttgg cgtctctgtc agcaacacca tctcccggga 721 gtataatgaa ctgctcatgg ccatctcaga cagtgactac tacactgatg acatcaaccg 781 ggcctgtctg tttgttccct ccatcgatgt gcttaaccag aacacactgc gcatcaagga 841 gacagcacaa gcgatggccc agctctctag gtgggatcga ggtacgaatc acctgttgtt 901 caacatgttg cctggaggtc ccccagatta taacacagcc ctggatgtcc ccagagacag 961 ggccctgttg gctggtggcg gcttttctac gtggacttac cggcaaggct acgatgtcag 1021 cattcctgtc tatagtccac tgtcagctga ggtggatctt ccagagaaag gaccaggtcc 1081 acggcaatac ttcctcctgt catctcaggt gggtctccat cctgagtaca gagaggacct 1141 agaagccctc caggtcaaac atggagagtc agtgttagta ctcgataaat gcaccaacct 1201 ctcagagggt gtcctttctg tccgtaagcg ctgccacaag caccaggtct tcgattaccc 1261 acaggtgcta caggaggcta ctttctgtgt ggttcttcgt ggagctcggc tgggccaggc 1321 agtattgagc gatgtgttac aagctggctg tgtcccggtt gtcattgcag actcctatat 1381 tttgcctttc tctgaagttc ttgactggaa gagagcatct gtggttgtac cagaagaaaa 1441 gatgtcagat gtgtacagta ttttgcagag catcccccaa agacagattg aagaaatgca 1501 gagacaggcc cggtggttct gggaagcgta cttccagtca attaaagcca ttgccctggc 1561 caccctgcag attatcaatg accggatcta tccatatgct gccatctcct atgaagaatg 1621 gaatgaccct cctgctgtga agtggggcag cgtgagcaat ccactcttcc tcccgctgat 1681 cccaccacag tctcaagggt tcaccgccat agtcctcacc tacgaccgag tagagagcct 1741 cttccgggtc atcactgaag tgtccaaggt gcccagtcta tccaaactac ttgtcgtctg 1801 gaataatcag aataaaaacc ctccagaaga ttctctctgg cccaaaatcc gggttccatt 1861 aaaagttgtg aggactgctg aaaacaagtt aagtaaccgt ttcttccctt atgatgaaat 1921 cgagacagaa gctgttctgg ccattgatga tgatatcatt atgctgacct ctgacgagct 1981 gcaatttggt tatgaggtct ggcgggaatt tcctgaccgg ttggtgggtt acccgggtcg 2041 tctgcatctc tgggaccatg agatgaataa gtggaagtat gagtctgagt ggacgaatga 2101 agtgtccatg gtgctcactg gggcagcttt ttatcacaag tattttaatt acctgtatac 2161 ctacaaaatg cctggggata tcaagaactg ggtagatgct catatgaact gtgaagatat 2221 tgccatgaac ttcctggtgg ccaacgtcac gggaaaagca gttatcaagg taaccccacg 2281 aaagaaattc aagtgtcctg agtgcacagc catagatggg ctttcactag accaaacaca 2341 catggtggag aggtcagagt gcatcaacaa gtttgcttca gtcttcggga ccatgcctct 2401 caaggtggtg gaacaccgag ctgaccctgt cctgtacaaa gatgactttc ctgagaagct 2461 gaagagcttc cccaacattg gcagcttatg aaacgtgtca ttggtggagg tctgaatgtg 2521 aggctgggac agagggagag aacaaggcct cccagcactc tgatgtcaga gtagtaggtt 2581 aagggtggaa ggttgaccta cttggatctt ggcatgcacc cacctaaccc actttctcaa 2641 gaacaagaac ctagaatgaa tatccaagca cctcgagcta tgcaacctct gttcttgtat 2701 ttcttatgat ctctgatggg ttcttctcga aaatgccaag tggaagactt tgtggcatgc 2761 tccagattta aatccagctg aggctccctt tgttttcagt tccatgtaac aatctggaag 2821 gaaacttcac ggacaggaag actgctggag aagagaagcg tgttagccca tttgaggtct 2881 ggggaatcat gtaaagggta cccagacctc acttttagtt atttacatca atgagttctt 2941 tcagggaacc aaacccagaa ttcggtgcaa aagccaaaca tcttggtggg atttgataaa 3001 tgccttggga cctggagtgc tgggcttgtg cacaggaaga gcaccagccg ctgagtcagg 3061 atcctgtcag ttccatgagc tattcctctt tggtttggct ttttgatatg attaaaatta 3121 ttttttattc cttttaaaaa aaaaaaaaaa aaaaaaaatt cgtcgtgctt aaaca // LOCUS HSU62768 3642 bp mRNA PRI 15-AUG-1997 DEFINITION Human oxytocinase splice variant 1 mRNA, complete cds. ACCESSION U62768 NID g2209275 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3642) AUTHORS Laustsen,P.G., Rasmussen,T.E., Petersen,K., Pedraza-Diaz,S., Moestrup,S.K., Gliemann,J., Sottrup-Jensen,L. and Kristensen,T. TITLE The complete amino acid sequence of human placental oxytocinase JOURNAL Biochim. Biophys. Acta 1352 (1), 1-7 (1997) MEDLINE 97320624 REFERENCE 2 (bases 1 to 3642) AUTHORS Laustsen,P.G., Rasmussen,T.E., Petersen,K., Moestrup,S., Gliemann,J., Sottrup-Jensen,L. and Kristensen,T. TITLE Direct Submission JOURNAL Submitted (02-JUL-1996) Department of Molecular and Structural Biology, Aarhus University, Langelandsgade 140, Aarhus 8000 C, Denmark FEATURES Location/Qualifiers source 1..3642 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 62..3139 /EC_number="3.4.11.3" /function="inactivates oxytocin and vasopressin by hydrolysis of the peptide bond between half Cys and Tyr" /note="cysteine aminopeptidase; leucine aminopeptidase; vasopressinase" /codon_start=1 /product="oxytocinase splice variant 1" /db_xref="PID:g2209276" /translation="MEPFTNDRLQLPRNMIENSMFEEEPDVVDLAKEPCLHPLEPDEV EYEPRGSRLLVRGLGEHEMEEDEEDYESSAKLLGMSFMNRSSGLRNSATGYRQSPDGA CSVPSARTMVVCAFVIVVAVSVIMVIYLLPRCTFTKEGCHKKNQSIGLIQPFATNGKL FPWAQIRLPTAVVPLRYELSLHPNLTSMTFRGSVTISVQALQVTWNIILHSTGHNISR VTFMSAVSSQEKQAEILEYAYHGQIAIVAPEALLAGHNYTLKIEYSANISSSYYGFYG FSYTDESNEKKYFAATQFEPLAARSAFPCFDEPAFKATFIIKIIRDEQYTALSNMPKK SSVVLDDGLVQDEFSESVKMSTYLVAFIVGEMKNLSQDVNGTLVSIYAVPENIGQVHY ALETTVKLLEFFQNYFEIQYPLKKLDLVAIPDFEAGAMENWGLLTFREETLLYDSNTS SMADRKLVTKIIAHELAHQWFGNLVTMKWWNDLWLNEGFATFMEYFSLEKIFKELSSY EDFLDARFKTMKKDSLNSSHPISSSVQSSEQIEEMFDSLSYFKGSSLLLMLKTYLSED VFQHAVVLYLHNHSYASIQSDDLWDSFNEVTNQTLDVKRMMKTWTLQKGFPLVTVQKK GKELFIQQERFFLNMKPEIQPSDTSYLWHIPLSYVTEGRNYSKYQSVSLLDKKSGVIN LTEEVLWVKVNINMNGYYIVHYADDDWEALIHQLKINPYVLSDKDRANLINNIFELAG LGKVPLKRAFDLINYLGNENHTAPITEALFQTDLIYNLLEKLGYMDLASRLVTRVFKL LQNQIQQQTWTDEGTPSMRELRSALLEFACTHNLGNCSTTAMKLFDDWMASNGTQSLP TDVMTTVFKVGAKTDKGWSFLLGKYISIGSEAEKNKILEALASSEDVRKLYWLMKSSL NGDNFRTQKLSFIIRTVGRHFPGHLLAWDFVKENWNKLVQKFPLGSYTIQNIVAGSTY LFSTKTHLSEVQAFFENQSEATFRLRCVQEALEVIQLNIQWMEKNLKSLTWWL" BASE COUNT 1076 a 733 c 799 g 1034 t ORIGIN 1 ctcagctctc ggagtaggaa gctcgggcgc tccggctgta aggagccgcg gcagggggaa 61 aatggagccc ttcaccaatg atcggcttca gctccccagg aatatgattg aaaacagcat 121 gtttgaggaa gaaccagatg tggtggattt agccaaagag ccttgtttac atcctctaga 181 gcctgatgag gtggaatatg agccccgggg ttcccgactg ctggtgcggg gtcttggtga 241 gcatgagatg gaggaggatg aagaggatta tgagtcatca gcaaagctgc tgggcatgtc 301 cttcatgaat agaagctcag gccttcggaa cagtgcaact ggttacaggc agagcccaga 361 tggggcttgt tcagtaccct ctgcaaggac catggtggtc tgtgcttttg tcatcgtggt 421 tgctgtttct gtaatcatgg tgatttactt actgcccaga tgtaccttta ccaaagaagg 481 ctgccataaa aaaaaccagt caattggact aattcagcca tttgcaacaa atgggaaatt 541 gtttccatgg gcacagatca ggcttcccac tgccgttgtg ccactacgct atgaactcag 601 cctacacccg aacctaacct cgatgacatt caggggttct gtgacaattt cagttcaggc 661 tcttcaggtc acatggaata tcattcttca tagcacaggt cataatattt caagagtgac 721 ctttatgtca gcagtttcaa gccaagaaaa acaagctgag atcctggaat atgcatatca 781 tggacagatc gccattgttg cccccgaagc ccttctagca gggcacaatt atacgttgaa 841 gatagagtac tcggcaaata tatctagttc ttattatggg ttttatggct tctcctacac 901 agatgaaagt aatgagaaaa agtactttgc agcaactcag tttgaacccc tggcagcaag 961 atctgctttt ccttgttttg atgaaccagc atttaaagcc acttttatca tcaagatcat 1021 aagggatgag caatacaccg ctttatcaaa tatgcctaag aagtcatcag tcgttctaga 1081 tgatggactt gttcaggatg agttttctga gagtgtgaag atgagcactt acttggttgc 1141 tttcattgtg ggagagatga agaacctgag tcaggacgta aatggaaccc tggtttctat 1201 atatgctgta ccagaaaata ttggtcaagt tcattatgcc ttggaaacaa ctgtgaagct 1261 tcttgagttt tttcaaaact actttgaaat tcagtaccca cttaagaaat tggatttggt 1321 ggctattcct gactttgaag caggagcaat ggaaaattgg ggtttgctca ccttccgaga 1381 ggagacactt ctgtatgaca gtaacacttc ttcaatggcg gatagaaagc tggtgactaa 1441 aatcattgct catgagctgg cccaccagtg gtttggcaat ctggtaacaa tgaagtggtg 1501 gaatgaccta tggctaaatg aaggttttgc cactttcatg gagtatttct ctttggaaaa 1561 aatattcaaa gagctttcta gttatgaaga tttcttagat gctcgattta aaaccatgaa 1621 gaaagattcc ttaaattcat ctcatccaat atcatcatct gttcagtctt cagaacaaat 1681 tgaagaaatg tttgattctc tttcctattt taagggatct tctctcttgt tgatgttgaa 1741 aacttacctt agtgaagatg tgtttcaaca tgctgttgtc ctttacctgc ataatcacag 1801 ctatgcatct attcaaagtg atgatctgtg ggatagtttt aatgaggtca caaaccaaac 1861 actagatgta aagagaatga tgaaaacctg gaccctgcag aaaggatttc ctttagtgac 1921 tgttcaaaag aaaggaaagg aactttttat acaacaagag agattctttt taaatatgaa 1981 gcctgaaatt cagccttcag atacaagcta cctgtggcat attccactat cctatgtcac 2041 tgaaggaaga aattattcaa aatatcaatc ggtatcatta ctggataaga aatcaggtgt 2101 catcaatctt acagaagaag tgctgtgggt caaagtgaat ataaacatga atggttatta 2161 tattgtacac tatgcagatg atgattggga agcactaatc catcagttga aaataaatcc 2221 ttatgttctg agtgacaaag accgagccaa ccttatcaac aacatctttg aacttgcagg 2281 cctaggcaag gtacctctca agagggcctt tgatttgatt aattatcttg gaaatgagaa 2341 ccatactgca cccatcaccg aagccctgtt tcagacagac ctcatctata acctccttga 2401 aaaactggga tacatggatc tggcctcaag actggtgact agggtattta aattacttca 2461 aaaccaaatt caacaacaaa cttggactga tgagggcact ccatctatgc gagagcttcg 2521 gtcagccctg ctagagtttg cttgcaccca caacctgggg aactgctcta ctactgccat 2581 gaaactgttt gatgactgga tggcatccaa tggaactcaa agcctaccta ctgatgtcat 2641 gacaactgtg ttcaaagttg gagcaaaaac tgacaaaggc tggtcattcc ttttgggcaa 2701 atacatttct ataggctctg aagcagagaa gaacaaaata ctagaagcac ttgccagctc 2761 agaggatgtg cggaagcttt actggttaat gaaaagtagc ctgaatggag ataacttccg 2821 aacacagaag ctgtctttta tcattagaac agtgggtcga cattttcctg gacacttact 2881 ggcatgggat tttgtcaaag agaactggaa taagcttgta cagaagttcc ctctggggtc 2941 ctataccata caaaatattg ttgctggatc aacttacctg ttttcaacaa agacacattt 3001 atctgaggtt caggcattct ttgaaaatca gtcagaggca accttccggc ttcgttgtgt 3061 ccaggaggct ttggaagtca ttcagttgaa tatccagtgg atggagaaga acctcaaaag 3121 tctcacatgg tggctgtagc atgcacaacc gcacctcatt ttgttgccca ttcagagagc 3181 ttgtaagctt gggctctgcc gcttttgcaa aagccaaggt aaagccagga tcgctgccaa 3241 gttgtttgca ctctttggag ttctagttag ctcagggcct gactgtattt ttcatccatc 3301 ttttctgaag tgtctttggg cagtatgtag ttatttatta caaaattata ttcacgtaaa 3361 tgccaaccat ctacaaaaac aatgagtaat ttttctactt tgaagataca cagatgggga 3421 caaaaaccct gttttggaat tctgttctat tcctcagtat ccagaaagtt actgacacag 3481 taaaacaagg aaagttctac cctaagagcc gccatcactt caggccgctg gtttgtcagc 3541 catctgctgc ttcttattga tagatggcat tggaatgtgg tacaaagtta gctctgaaga 3601 atatggtaac gaagacaata aagcatgcac tgtaagaact ga // LOCUS HSU62800 577 bp mRNA PRI 24-AUG-1996 DEFINITION Human cystatin M (CST6) mRNA, complete cds. ACCESSION U62800 NID g1488690 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 577) AUTHORS Sotiropoulou,G., Anisowicz,A. and Sager,R. TITLE Identification, Cloning and Characterization of cystatin M, a Novel Cysteine Proteinase Inhibitor, Down-Regulated in Breast Cancer JOURNAL J. Biol. Chem. (1996) In press REFERENCE 2 (bases 1 to 577) AUTHORS Sotiropoulou,G., Anisowicz,A. and Sager,R. TITLE Direct Submission JOURNAL Submitted (02-JUL-1996) Cancer Genetics, Dana-Farber Cancer Institute/Harvard Medical School, 44 Binney, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..577 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q11-q13" /tissue_type="mammary epithelial" /cell_line="21PT" gene 24..473 /gene="CST6" CDS 24..473 /gene="CST6" /function="cysteine proteinase inhibitor" /note="similar to human cystatin family 2 members" /codon_start=1 /product="cystatin M" /db_xref="PID:g1488691" /translation="MARSNLPLALGLALVAFCLLALPRDARARPQERMVGELRDLSPD DPQVQKAAQAAVASYNMGSNSIYYFRDTHIIKAQSQLVAGIKYFLTMEMGSTDCRKTR VTGDHVDLTTCPLAAGAQQEKLRCDFEVLVVPWQNSSQLLKHNCVQM" BASE COUNT 110 a 181 c 187 g 99 t ORIGIN 1 gagctccgac ggcactgacg gccatggcgc gttcgaacct cccgctggcg ctgggcctgg 61 ccctggtcgc attctgcctc ctggcgctgc cacgcgatgc ccgggcccgg ccgcaggagc 121 gcatggtcgg agaactccgg gacctgtcgc ccgacgaccc gcaggtgcag aaggcggcgc 181 aggcggccgt ggccagctac aacatgggca gcaacagcat ctactacttc cgagacacgc 241 acatcatcaa ggcgcagagc cagctggtgg ccggcatcaa gtacttcctg acgatggaga 301 tggggagcac agactgccgc aagaccaggg tcactggaga ccacgtcgac ctcaccactt 361 gccccctggc agcaggggcg cagcaggaga agctgcgctg tgactttgag gtccttgtgg 421 ttccctggca gaactcctct cagctcctaa agcacaactg tgtgcagatg tgataagtcc 481 ccgagggcga aggccattgg gtttggggcc atggtggagg gcacttcagg tccgtgggcc 541 gtatctgtca caataaatgg ccagtgctgc ttcttgc // LOCUS HSU62961 3337 bp mRNA PRI 05-SEP-1996 DEFINITION Human succinyl CoA:3-oxoacid CoA transferase precursor (OXCT) mRNA, complete cds. ACCESSION U62961 NID g1519051 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3337) AUTHORS Kassovska-Bratinova,S.G., Fukao,T., Song,X., Duncan,A., Chen,H., Robert,M., Chartrand,C., Vobecky,S., Perez-Cerda,C., Ugarte,M., Kondo,N. and Mitchell,G.A. TITLE Succinyl CoA:3-oxoacid CoA transferase (SCOT): human cDNA cloning, human chromosomal mapping to 5p13, and mutation detection in a SCOT-deficient patient JOURNAL Am. J. Hum. Genet. 59 59, 519-528 (1996) REFERENCE 2 (bases 1 to 3337) AUTHORS Mitchell,G.A. TITLE Direct Submission JOURNAL Submitted (04-JUL-1996) Genetique Medicale, Hopital Ste Justine, 3175, Cote Ste Catherine, Montreal, Quebec H3T 1C5, Canada FEATURES Location/Qualifiers source 1..3337 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="SCOT-B, SCOT-C, SCOT-D, SCOT-F, SCOT-G" /chromosome="5" /map="proximal 5p13" /tissue_type="heart" 5'UTR 1..98 gene 99..1661 /gene="OXCT" sig_peptide 99..215 /gene="OXCT" CDS 99..1661 /gene="OXCT" /EC_number="2.8.3.5" /function="key enzyme of ketolysis catalyses the transfer of CoA from succinyl-CoA to 3-oxoacids such as acetoacetic acid" /codon_start=1 /product="succinyl CoA:3-oxoacid CoA transferase precursor" /db_xref="PID:g1519052" /translation="MAALKLLSSGLRLCASARGSGATWYKGCVCSFSTSAHRHTKFYT DPVEAVKDIPDGATVLVGGFGLCGIPENLIDALLKTGVKGLTAVSNNAGVDNFGLGLL LRSKQIKRMVSSYVGENAEFERQYLSGELEVELTPQGTLAERIRAGGAGVPAFYTPTG YGTLVQEGGSPIKYNKDGSVAIASKPREVREFNGQHFILEEAITGDFALVKAWKADRA GNVIFRKSARNFNLPMCKAAETTVVEVEEIVDIGAFAPEDIHIPQIYVHRLIKGEKYE KRIERLSIRKEGDGEAKSAKPGDDVRERIIKRAALEFEDGMYANLGIGIPLLASNFIS PNITVHLQSENGVLGLGPYPRQHEADADLINAGKETVTILPGASFFSSDESFAMIRGG HVDLTMLGAMQVSKYGDLANWMIPGKMVKGMGGAMDLVSSAKTKVVVTMEHSAKGNAH KIMEKCTLPLTGKQCVNRIITEKAVFDVDKKKGLTLIELWEGLTVDDVQKSTGCDFAV SPKLMPMQQIAN" mat_peptide 216..1658 /gene="OXCT" /evidence=experimental /product="succinyl CoA:3-oxoacid CoA transferase" misc_feature 1128..1130 /gene="OXCT" /note="encodes Glu-344 within the enzyme's active site" 3'UTR 1662..3320 polyA_signal 3300..3305 polyA_site 3321 BASE COUNT 994 a 620 c 770 g 953 t ORIGIN 1 gtcgagcctc tagcccgccc gggtttcctt cgcagtcgcg caccgacgct caaacgcgcg 61 ctccaacccg cagcctcctc ctgcctcacc gcccgaagat ggcggctctc aaactcctct 121 cctccgggct tcggctctgc gcctctgccc gcggatctgg ggcaacctgg tacaagggat 181 gtgtttgttc cttttccacc agtgctcatc gccataccaa gttttataca gatccagtag 241 aagctgtaaa agacatccct gatggtgcca cggttttggt tggtggtttt gggctatgtg 301 gaattccaga gaatcttata gatgctttac tgaaaactgg agtaaaagga ctaactgcag 361 tcagcaacaa tgcaggggtt gacaattttg gtttggggct tttgcttcgg tcgaagcaga 421 taaaacgcat ggtctcttca tatgtgggag aaaatgcaga atttgaacga cagtacttat 481 ctggtgaatt agaagtggag ctgacaccac agggcacact tgcagagagg atccgtgcag 541 gcggggctgg agttcctgca ttttacaccc caacagggta tgggaccctg gtacaagaag 601 gaggatcgcc catcaaatac aacaaagatg gcagtgttgc cattgccagt aagccaagag 661 aggtgaggga gttcaatggt cagcacttta ttttggagga agcaattaca ggggattttg 721 ctttggtgaa agcctggaag gcggaccgag caggaaacgt gattttcagg aaaagtgcaa 781 ggaatttcaa cttgccaatg tgcaaagctg cagaaaccac agtggtagag gttgaagaaa 841 ttgtggatat tggagcattt gctccagaag acatccatat tcctcagatt tatgtacatc 901 gccttataaa gggagaaaaa tatgagaaaa gaattgagcg tttatcaatc cggaaagagg 961 gagatgggga agccaaatct gctaaacctg gagatgacgt aagggaacga atcatcaaga 1021 gggccgctct tgagtttgag gatggcatgt atgctaattt gggcatagga atccctctcc 1081 tggccagcaa ttttatcagc ccaaatataa ctgttcatct tcaaagtgaa aatggagttc 1141 tgggtttggg tccatatcca cgacaacatg aagctgatgc agatctcatc aatgcaggca 1201 aggaaacagt tactattctt ccaggagcct cttttttctc cagcgatgaa tcatttgcaa 1261 tgattagagg tggacacgtc gatctgacaa tgctaggagc gatgcaggtt tccaaatatg 1321 gtgacctggc taactggatg atacctggga agatggtgaa aggaatggga ggtgctatgg 1381 atttagtgtc cagtgcgaaa accaaagtgg tggtcaccat ggagcattct gcaaagggaa 1441 atgcacataa aatcatggag aaatgtacat taccattgac tggaaagcaa tgtgtcaacc 1501 gcattattac tgaaaaggct gtgtttgatg tggacaagaa gaaagggttg actctgattg 1561 agctctggga aggcctgaca gtggatgacg tacaaaagag tactgggtgt gattttgcag 1621 tttcaccaaa actcatgcca atgcagcaga tcgcaaattg aaatatggat atttgtacca 1681 ggctgcgtgt ttttcatttt aaacacacaa gatttaattg aaaggacatc aataatcata 1741 attgtgtatt taacaggtgg ttttttatta gttttcttgt gtttcagact ttatgcagcc 1801 atataaactg ttctctaggc atgctgtgac attttaataa aaagcaaaag gagcatttat 1861 aattatctca tttgttaagg ctgagaaggt tgtttttata ataggtaatt atattgaatg 1921 cattttcact gaatatggta tgtatgctaa attatatgaa cctttcccca agaagggccc 1981 tagaaattga tgtggctttc ctcttaaata ttaattatta gtcctgaaag aaagataaca 2041 tatgtgattt ttgtggttag gagagttgct gtcatgattg ttttttcttc agcctcctct 2101 gacttttctt ttggggcttc agattttatg attacatctt gtccccctag aacatccccc 2161 ttcctcccat actgctttta aacagatgcc caagaaggca agcaggaatg cctcttgtgg 2221 gggagggcag ggagaaataa ctagttcaaa ccaactatct atctatgctt tgcaaagact 2281 aaggcgtatt ataggaagag ggctagaaac ctaactgatt cttctcagtt ttctcatttt 2341 aaaacagccc agtattcctt tgtatcctca agggtccttg agaatacttc tgttattgaa 2401 accctgtggg ctacttgtac tgtacctcct ctcaagccaa gaagggctgt gggataattt 2461 accatgaatc cttagtagca atgacagcag agttaaaaaa taaaaggtgt tttactttca 2521 ggctcttgtt ttggttcaga ggagatttta aatattgaat gacacttcta cagaacaacg 2581 gtttttcttc tgccaaggct acttccttta acgaagtgcc tttaattcag ccttatccaa 2641 ctagggaaaa taatgttgga caagtctagg atttgaagag tcagtgaact tttagtgtca 2701 gggaataaac atggtgggta gattaggttt gaaaaaaact tccttagagg tatttattct 2761 caatacctga caggggccca tgggaatgac ttcagaagca tcccggataa tagatgggta 2821 aaaagtctag gcaccctgaa gaacaggtga gacagctggc ctctggacag aggtaggcat 2881 agtacagtac gatatatcat tcctctggtc ctaaatatac aaacttattc atgtttttag 2941 gtgatgatgg tcattgaaac tcacttcttt tcaggtgtag ctacaattgt gtaatgtaca 3001 atattagaga aaggacaggc tttttatgag taacacacac catatataaa acagcctttc 3061 tggctgacca catggttaaa tgcatacctt cccagtactg gggggaaaat gacccttctt 3121 agaatgtgca agttccatag agtaatatat tgatatgatt ttgaaaagaa ttgttgatag 3181 ttacatcttc aaacttatca ttccagtatg catctttaag ataatgtgat tctaagtaga 3241 tgactttata ttcttgatta aagagtgcta tacatgttaa gaaatgcatt aaggaataca 3301 ataaatattc taaactgatg aaaaaaaaaa aaaaaaa // LOCUS HSU62966 2790 bp mRNA PRI 08-MAY-1997 DEFINITION Human Na+/nucleoside cotransporter (hCNT1a) mRNA, complete cds. ACCESSION U62966 NID g2072781 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2790) AUTHORS Ritzel,M.W., Yao,S.Y., Huang,M.Y., Elliott,J.F., Cass,C.E. and Young,J.D. TITLE Molecular cloning and functional expression of cDNAs encoding a human Na+-nucleoside cotransporter (hCNT1) JOURNAL Am. J. Physiol. 272 (2), C707-C714 (1997) MEDLINE 97215943 REFERENCE 2 (bases 1 to 2790) AUTHORS Ritzel,M.W.L., Yao,S.Y.M., Huang,M.-Y., Elliott,J.F., Cass,C.E. and Young,J.D. TITLE Direct Submission JOURNAL Submitted (04-JUL-1996) Physiology, University of Alberta, 7-25 Medical Sciences Building, U. of Alberta, Edmonton, AB T6G 2H7, Canada FEATURES Location/Qualifiers source 1..2790 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" gene 185..2137 /gene="hCNT1a" CDS 185..2137 /gene="hCNT1a" /codon_start=1 /product="Na+/nucleoside cotransporter" /db_xref="PID:g2072782" /translation="MENDPSRRRESISLTPVAKGLENMGADFLESLEGGQLPRSDLSP AEIRSSWSEAAPKPFSRWRNLQPALRARSFCREHMQLFRWIGTGLLCTGLSAFLLVAC LLDFQRALALFVLTCVVLTFLGHRLLKRLLGPKLRRFLVKPQGHPRLLLWFKRGLALA AFLGLVLWLSLDTSQRPEQLVSFAGICVFVALLFACSKHHCAVSWRAVSWGLGLQFVL GLLVIRTEPGFIAFEWLGEQIRIFLSYTKAGSSFVFGEALVKDVFAFQVLPIIVFFSC VISVLYHVGLMQWVILKIAWLMQVTMGTTATETLSVAGNIFVSQTEAPLLIRPYLADM TLSEVHVVMTGGYATIAGSLLGAYISFGIDATSLIAASVMAAPCALALSKLVYPEVEE SKFRREEGVKLTYGDAQNLIEAASTGAAISVKVVANIAANLIAFLAVLDFINAALSWL GDMVDIQGLSFQLICSYILRPVAFLMGVAWEDCPVVAELLGIKLFLNEFVAYQDLSKY KQRRLAGAEEWVGDRKQWISVRAEVLTTFALCGFANFSSIGIMLGGLTSMVPQRKSDF SQIVLRALFTGACVSLVNACMAGILYMPRGAEVDCMSLLNTTLSSSSFEIYQCCREAF QSVNPEFSPEALDNCCRFYNHTICAQ" BASE COUNT 557 a 810 c 752 g 671 t ORIGIN 1 ctggctgtgc tgttcatctc ctagatgaat gggatggtct acattcatcc atttggattt 61 ggccaaagac accaacaccc ctttctccct ctacataagc tgcactgcat ggttgctgct 121 ggatgtgttg tgttcctggc ttccctctgg atgctgacag aaacaaggct ggaaggtctg 181 ggacatggag aacgacccct cgagacgaag agagtccatc tctctcacac ctgtggccaa 241 gggtctggag aacatggggg ctgatttctt ggaaagcctg gagggaggcc agctccctag 301 gagtgacttg agccccgcag agatcaggag cagctggagc gaggcggcgc cgaagccctt 361 ctccagatgg aggaacctgc agccagccct gagagccaga agcttctgca gggagcacat 421 gcagctgttt cgatggatcg gcacaggcct gctctgcact gggctctctg ccttcctgct 481 ggtggcctgc ctcctggatt tccagagggc cctggctctg tttgtcctca cctgtgtggt 541 cctcaccttc ctgggccacc gcctgctgaa acggcttctg gggccaaagc tgaggaggtt 601 tcttgtcaag cctcagggcc atccccgcct gctgctctgg tttaagaggg gtctagctct 661 tgctgctttc ctgggcctgg tcctgtggct gtctctggac acctcccagc ggcctgagca 721 actggtgtcc ttcgcaggaa tctgcgtgtt cgtcgctctc ctctttgcct gctcaaagca 781 tcattgcgca gtgtcctgga gggccgtgtc ttggggactt ggactgcagt ttgtacttgg 841 actcctcgtc atcagaacag aaccaggatt cattgcgttc gagtggctgg gcgagcagat 901 ccggatcttc ctgagctaca cgaaggctgg ctccagcttc gtgtttgggg aggcgctggt 961 caaggatgtc tttgcctttc aggttctgcc catcattgtc tttttcagct gtgtcatatc 1021 cgttctctac cacgtgggcc tcatgcagtg ggtgatcctg aagattgcct ggctgatgca 1081 agtcaccatg ggcaccacag ccactgagac cctgagtgtg gctggaaaca tctttgtgag 1141 ccagaccgag gctccattac tgatccggcc ctacttggca gacatgacac tctctgaagt 1201 ccacgttgtc atgaccggag gttacgccac cattgctggc agcctgctgg gtgcctacat 1261 ctcctttggg atcgatgcca cctcgttgat tgcagcctct gtgatggctg ccccttgtgc 1321 cttggccctc tccaaactgg tctacccgga ggtggaggag tccaagttta ggagggagga 1381 aggagtgaaa ctgacctatg gagatgctca gaacctcata gaagcagcca gcactggggc 1441 cgccatctcc gtgaaggtgg tcgccaacat cgctgccaac ctgattgcgt tcctggctgt 1501 gctggacttt atcaatgctg ccctctcctg gctgggagac atggtggaca tccaggggct 1561 cagcttccag ctcatctgct cctacatcct gcggcctgta gccttcttga tgggtgtggc 1621 gtgggaggac tgcccagtgg tagctgagct gctggggatc aagctgtttc tgaacgagtt 1681 tgtggcctat caagacctct ccaagtacaa gcaacgccgc ctggcagggg ccgaggagtg 1741 ggtcggcgac aggaagcagt ggatctccgt cagagctgaa gtcctcacga cgtttgccct 1801 ctgtggattt gccaatttca gctccattgg gatcatgctg ggaggcttga cctccatggt 1861 cccccaacgg aagagcgact tctcccagat agtgctccgg gcgctcttca cgggagcctg 1921 tgtgtccctg gtgaacgcct gtatggcagg gatcctctac atgcccaggg gggctgaagt 1981 tgactgcatg tccctcttga acacgaccct cagcagcagc agctttgaga tttaccagtg 2041 ctgccgtgag gccttccaga gcgtcaatcc agagttcagc ccagaggccc tggacaactg 2101 ctgtcggttt tacaaccaca cgatctgtgc acagtgagga cagaacatgc ttgtgcttct 2161 gcgcttctga gggctgttct cccccgggaa ccatctgtcc ccaccttccc tttcccagag 2221 ccctcttcag ggaagccaca ggacttagac ccagctcaat cccacaattg ggaaggggtc 2281 atggagtgag tgtgcagaga gtgagtgagg acataaggaa ggacatgtcc cactccatcc 2341 cccttcctgc tcccccattt cctaactccc ccagtgtgaa ttctcagggt cacttctgcc 2401 tcctcccgtt tcccctccac atccaaacag caccctggtc ctctctatcc cccctctcct 2461 ggggtccctc acatgcccct tcccttctgt tgtgggctgc acaccaaagc ctcctcccct 2521 ccccacttcc taggcactag gatctctctg tggcttcccc tgctgggtgg tgtcacctct 2581 ttctctgctt tcagagaaac ccttcccgcc tttcctcaga gtgcttccca aactgaggtc 2641 ccatggcaca ctgtcctggg aggcgttcag agggttccat gatggactag gtttggaacc 2701 actgggttaa ataaacttag agagggctgt ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2761 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU63289 2113 bp mRNA PRI 12-DEC-1996 DEFINITION Human RNA-binding protein CUG-BP/hNab50 (NAB50) mRNA, complete cds. ACCESSION U63289 NID g1518801 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2113) AUTHORS Timchenko,L.T., Miller,J.W., Timchenko,N.A., DeVore,D.R., Datar,K.V., Lin,L., Roberts,R., Caskey,C.T. and Swanson,M.S. TITLE Identification of a (CUG)n triplet repeat RNA-binding protein and its expression in myotonic dystrophy JOURNAL Nucleic Acids Res. 24 (22), 4407-4414 (1996) MEDLINE 97105883 REFERENCE 2 (bases 1 to 2113) AUTHORS Timchenko,L.T., Miller,J.W., Timchenko,N.A., DeVore,D.R., Datar,K.V., Lin,L., Roberts,R., Caskey,C.T. and Swanson,M.S. TITLE Direct Submission JOURNAL Submitted (08-JUL-1996) Molecular Genetics and Microbiology, University of Florida, 1600 SW Archer Road, Gainesville, FL 32610-0266, USA FEATURES Location/Qualifiers source 1..2113 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="osteosarcoma cell line" gene 138..1586 /gene="NAB50" CDS 138..1586 /gene="NAB50" /function="binds to (CUG)n triplet repeats" /note="RNA-binding protein" /codon_start=1 /product="CUG-BP/hNab50" /db_xref="PID:g1518802" /translation="MNGTLDHPDQPDLDAIKMFVGQVPRTWSEKDLRELFEQYGAVYE INVLRDRSQNPPQSKGCCFVTFYTRKAALEAQNALHNMKVLPGMHHPIQMKPADSEKN NAVEDRKLFIGMISKKCTENDIRVMFSSFGQIEECRILRGPDGLSRGCAFVTFTTRAM AQTAIKAMHQAQTMEGCSSPMVVKFADTQKDKEQKRMAQQLQQQMQQISAASVWGNLA GLNTLGPQYLALLQQTASSGNLNTLSSLHPMGGLNAMQLQNLAALAAAASAAQNTPSG TNALTTSSSPLSVLTSSGSSPSSSSSNSVNPIASLGALQTLAGATAGLNVGSLAGMAA LNGGLGSSGLSNGTGSTMEALTQAYSGIQQYAAAALPTLYNQNLLTQQSIGAAGSQKE GPEGANLFIYHLPQEFGDQDLLQMFMPFGNVVSAKVFIDKQTNLSKCFGFVSYDNPVS AQAAIQSMNGFQIGMKRLKVQLKRSKNDSKPY" BASE COUNT 523 a 518 c 523 g 549 t ORIGIN 1 gggcagcggc agcggcggcg ggacgcggag gctcccccgg gattcggcct cagcagcgag 61 gcggcggcgg cggctgcgga ggcgcaggca gcaactgagg cagcggcagg ctcaggtgca 121 gccgctgctc aaagaaaatg aacggcaccc tggaccaccc agaccaacca gatcttgatg 181 ctatcaagat gtttgtgggc caggttccaa ggacctggtc tgaaaaggac ttgcgggaac 241 tcttcgaaca gtatggtgct gtgtatgaaa tcaacgtcct aagggatagg agccaaaacc 301 cgcctcagag caaagggtgc tgttttgtta cattttacac ccgtaaagct gcattagaag 361 ctcagaatgc tcttcacaac atgaaagtcc tcccagggat gcatcaccct atacagatga 421 aacctgctga cagtgagaag aacaatgcag tggaagacag gaagctgttt attggtatga 481 tttccaagaa gtgcactgaa aatgacatcc gagtcatgtt ctcttcgttt ggacagattg 541 aagaatgccg gatattgcgg ggacctgatg gcctgagccg aggttgtgca tttgtgactt 601 ttacaacaag agccatggca cagacggcta tcaaggcaat gcaccaagca cagaccatgg 661 agggttgctc atcacccatg gtggtaaaat ttgctgatac acagaaggac aaagaacaga 721 agagaatggc ccagcagctc cagcagcaga tgcagcaaat cagcgcagca tctgtgtggg 781 gaaaccttgc tggtctaaat actcttggac cccagtattt agcactcctt cagcagactg 841 cctcctctgg gaacctcaac accctgagca gcctccaccc aatgggaggg ttgaatgcaa 901 tgcagttaca gaatttggct gcactagctg ctgcagctag tgcagctcag aacacaccaa 961 gtggtaccaa tgctctcact acatccagca gtcccctcag cgtgctcact agttcagggt 1021 cctcacctag ctctagcagc agtaattctg tcaaccccat agcctcactt ggagccctgc 1081 agacattagc tggagcaacg gctggcctca atgttggctc tttggcagga atggctgctt 1141 taaatggtgg cctgggcagc agtggccttt ccaatggcac cgggagcacc atggaggccc 1201 tcactcaggc ctactcgggt atccagcaat atgctgctgc tgcgctcccc actctgtaca 1261 accagaatct tctgacacag cagagtattg gtgctgctgg aagccagaag gaaggtccag 1321 agggagccaa cctgttcatc taccacctgc cccaggagtt tggtgatcag gacctgctgc 1381 agatgtttat gccctttggg aatgtcgtgt ctgccaaggt tttcatagac aagcagacaa 1441 acctgagcaa gtgttttggt tttgtaagtt acgacaatcc tgtttcggcc caagctgcca 1501 tccagtccat gaacggcttt cagattggca tgaagcggct taaagtgcag ctcaaacgtt 1561 cgaagaatga cagcaagccc tactgagcgt gctcccctct gagactggag tgagagggtc 1621 ttctgattcc tgccgtttgt tcatcgttgt gcctaaagca tgtcgatgtg gcgtcaagta 1681 catcgtccaa atccctgtct cttcagcttc tctgatgctt gaactctcac ctttgacctt 1741 gtgttgacct ttgatgctga tgtgtatttt tattatgttt gtttctttct tcgttttttt 1801 ttcttttttt ctttcctttt tttttccttt tgtgctgcca aattggtttt gctagaacga 1861 ctgctgaagg ggaaatattt aaacttgcat ttgaatataa aaaaaatcta tttttctaga 1921 acttcataag ataaccactt gattttgtga ttccaattct ttgtaattgt cttcagagca 1981 gccctactag cacataccgc gtggtgtttg tatttctgtg aacacacagc cagtccgttt 2041 ctaggctttg tttctctgtg tgcttagttt taaagacaac tttgaagtaa acaatgaaat 2101 aaaagatgtc act // LOCUS HSU63295 1884 bp mRNA PRI 08-OCT-1996 DEFINITION Human seven in absentia homolog mRNA, complete cds. ACCESSION U63295 NID g1508828 KEYWORDS tumor suppression; apoptosis; development. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1884) AUTHORS Nemani,M., Linares-Cruz,G., Bruzzoni-Giovanelli,H., Roperch,J.P., Tuynder,M., Bougueleret,L., Cherif,D., Medhioub,M., Pasturaud,P., Alvaro,V., Der Sarkissan,H., Cazes,L., Le Paslier,D., Le Gall,I., Israeli,D., Dausset,J., Sigaux,F., Chumakov,I., Oren,M., Calvo,F., Amson,R.B., Cohen,D. and Telerman,A. TITLE Activation of the human homologue of the Drosophila sina gene in apoptosis and tumor suppression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (17), 9039-9042 (1996) MEDLINE 96392362 REFERENCE 2 (bases 1 to 1884) AUTHORS Nemani,M., Linares-Cruz,G., Bruzzoni-Giovanelli,H., Roperch,J.P., Tuynder,M., Bougueleret,L., Cherif,D., Medhioub,M., Pasturaud,P., Alvaro,V., Der Sarkissan,H., Cazes,L., Le Paslier,D., Le Gall,I., Israeli,D., Dausset,J., Sigaux,F., Chumakov,I., Oren,M., Calvo,F., Amson,R.B., Cohen,D. and Telerman,A. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) Cancer Research Program, Fondation Jean Dausset-CEPH, 27, Rue Juliette Dodu, Paris 75010, France FEATURES Location/Qualifiers source 1..1884 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="intestinal epithelium" /cell_type="U937; K562" /chromosome="16" /map="16q12-13" CDS 1..849 /codon_start=1 /product="seven in absentia homolog" /db_xref="PID:g1508829" /translation="MSRQTATALPTGTSKCPPSQRVPALTGTTASNNDLASLFECPVC FDYVLPPILQCQSGHLVCSNCRPKLTCCPTCRGPLGSIRNLAMEKVANSVLFPCKYAS SGCEITLPHTEKADHEELCEFRPYSCPCPGASCKWQGSLDAVMPHLMHQHKSITTLQG EDIVFLATDINLPGAVDWVMMQSCFGFHFMLVLEKQEKYDGHQQFFAIVQLIGTRKQA ENFAYRLELNGHRRRLTWEATPRSIHEGIATAIMNSDCLVFDPALHSFLQTNGNLGIN VTISMC" BASE COUNT 524 a 370 c 372 g 618 t ORIGIN 1 atgagccgtc agactgctac agcattacct accggtacct cgaagtgtcc accatcccag 61 agggtgcctg ccctgactgg cacaactgca tccaacaatg acttggcgag tctttttgag 121 tgtccagtct gctttgacta tgtgttaccg cccattcttc aatgtcagag tggccatctt 181 gtttgtagca actgtcgccc aaagctcaca tgttgtccaa cttgccgggg ccctttggga 241 tccattcgca acttggctat ggagaaagtg gctaattcag tacttttccc ctgtaaatat 301 gcgtcttctg gatgtgaaat aactctgcca cacacagaaa aagcagacca tgaagagctc 361 tgtgagttta ggccttattc ctgtccgtgc cctggtgctt cctgtaaatg gcaaggctct 421 ctggatgctg taatgcccca tctgatgcat cagcataagt ccattacaac cctacaggga 481 gaggatatag tttttcttgc tacagacatt aatcttcctg gtgctgttga ctgggtgatg 541 atgcagtcct gttttggctt tcacttcatg ttagtcttag agaaacagga aaaatacgat 601 ggtcaccagc agttcttcgc aatcgtacag ctgataggaa cacgcaagca agctgaaaat 661 tttgcttacc gacttgagct aaatggtcat aggcgacgat tgacttggga agcgactcct 721 cgatctattc atgaaggaat tgcaacagcc attatgaata gcgactgtct agtctttgac 781 ccagcattgc acagcttttt gcagacaaat ggcaatttag gcatcaatgt aactatttcc 841 atgtgttgaa atggcaatca aacattttct ggccagtgtt taaaacttca gtttcacaga 901 aaataaggca cccatctgtc tgccaaccta aaactctttc ggtaggtaga agctcgacat 961 gaaggccaat aaaaagaaag actgctaaat acaggaaaca gttccatgta gtaacactaa 1021 tatatttaaa aataagtcaa cagtaaacca ctgaaaaaat atatgtatat acacccaaga 1081 tgggcatctt ttgtattaag aaaggaagca ttgtaaaata attctgagtt ttgtgtttgt 1141 tgtagattga ttgtattgtt gaaaaagttt gtttttgcgt gggagtgtgt gcctgcgtgg 1201 gtgtgtgcgt gtttgggttt ttttccttta actgacaagc catcttgagt ggtcatgggc 1261 cactgctttt ccctttgtga gtcaatacat agtgctgctg taagccgttt ttgtgtgtat 1321 ttgctaattt ttattaattt tagtttttca ttaaataaat ttgacttttc tgtaattcag 1381 gtttttcctt tttttgtacc attttaaagt tagtatcttt tgatatggca tatttgttta 1441 tggtaaaaaa tttataacgg gttcaatatt ttcttttccc ccattaatca agtccattgg 1501 aaatatttta aaaccagcct attttggtga acccatgagt tcccagaaag taaaggtgac 1561 acccggaaaa ataatccaaa agcctattta aagccaccta taaggtgccc ccctttcctg 1621 tcttcctaca gatgagtcac acctttgagc cttaaccttt gaaaggttag agaataaatt 1681 gatttttata aatactgcaa atccaggctt ttgtttcctt tttccagata tccttggaca 1741 aatcacatat tttaaaattt gttcttgtat ttattggttt tgcagaagaa ggcatcgtca 1801 tgcacagtat ttgtaattaa aagcaaattc atttgtttaa aaaggcagtt tgcaaaaaat 1861 gtttttggtc ttttataatt ctca // LOCUS HSU63329 1869 bp DNA PRI 28-JUL-1996 DEFINITION Human mutY homolog (hMYH) gene, complete cds. ACCESSION U63329 NID g1458227 KEYWORDS mutY; micA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1869) AUTHORS Slupska,M.M., Baikalov,C., Luther,W.M., Chiang,J.H., Wei,Y.F. and Miller,J.H. TITLE Cloning and sequencing a human homolog (hMYH) of the Escherichia coli mutY gene whose function is required for the repair of oxidative DNA damage JOURNAL J. Bacteriol. 178 (13), 3885-3892 (1996) MEDLINE 96272264 REFERENCE 2 (bases 1 to 1869) AUTHORS Slupska,M.M., Baikalov,C., Luther,W.M., Chiang,J-H., Wei,Y-F. and Miller,J.H. TITLE Direct Submission JOURNAL Submitted (09-JUL-1996) Microbiology and Molecular Genetics, UCLA, 405 Hilgard Ave, Los Angeles, CA 90025, USA FEATURES Location/Qualifiers source 1..1869 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="between 1p32.1 and 1p34.3" 5'UTR 1..182 gene 183..1790 /gene="hMYH" CDS 183..1790 /gene="hMYH" /codon_start=1 /product="mutY homolog" /db_xref="PID:g1458228" /translation="MTPLVSRLSRLWAIMRKPRAAVGSGHRKQAASQEGRQKHAKNNS QAKPSACDGLARQPEEVVLQASVSSYHLFRDVAEVTAFRGSLLSWYDQEKRDLPWRRR AEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYTGWMQKWPTLQDLASASLEEVNQLW AGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIASIAFGQAT GVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMELGATVC TPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPPSEP WDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFPS VTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQT PVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQPGTCMGSKRSQVSSPCSRKKP RMGQQVLDNFFRSHISTDAHSLNSAAQ" allele replace(377,"t") /gene="hMYH" allele replace(740,"t") /gene="hMYH" allele replace(1154,"c") /gene="hMYH" polyA_site 1853 BASE COUNT 401 a 550 c 570 g 348 t ORIGIN 1 tctcctcgtg gctagttcag gcggaaggag cagtcctctg aagcttgagg agcctctaga 61 actatgagcc cgaggccttc ccctctccca gagcgcagag gctttgaagg ctacctctgg 121 gaagccgctc accgtcggaa gctgcgggag ctgaaactgc gccatcgtca ctgtcggcgg 181 ccatgacacc gctcgtctcc cgcctgagtc gtctgtgggc catcatgagg aagccacgag 241 cagccgtggg aagtggtcac aggaagcagg cagccagcca ggaagggagg cagaagcatg 301 ctaagaacaa cagtcaggcc aagccttctg cctgtgatgg cctggccagg cagccggaag 361 aggtggtatt gcaggcctct gtctcctcat accatctatt cagagacgta gctgaagtca 421 cagccttccg agggagcctg ctaagctggt acgaccaaga gaaacgggac ctaccatgga 481 gaagacgggc agaagatgag atggacctgg acaggcgggc atatgctgtg tgggtctcag 541 aggtcatgct gcagcagacc caggttgcca ctgtgatcaa ctactatacc ggatggatgc 601 agaagtggcc tacactgcag gacctggcca gtgcttccct ggaggaggtg aatcaactct 661 gggctggcct gggctactat tctcgtggcc ggcggctgca ggagggagct cggaaggtgg 721 tagaggagct agggggccac atgccacgta cagcagagac cctgcagcag ctcctgcctg 781 gcgtggggcg ctacacagct ggggccattg cctctatcgc ctttggccag gcaaccggtg 841 tggtggatgg caacgtagca cgggtgctgt gccgtgtccg agccattggt gctgatccca 901 gcagcaccct tgtttcccag cagctctggg gtctagccca gcagctggtg gacccagccc 961 ggccaggaga tttcaaccaa gcagccatgg agctaggggc cacagtgtgt accccacagc 1021 gcccactgtg cagccagtgc cctgtggaga gcctgtgccg ggcacgccag agagtggagc 1081 aggaacagct cttagcctca gggagcctgt cgggcagtcc tgacgtggag gagtgtgctc 1141 ccaacactgg acagtgccac ctgtgcctgc ctccctcgga gccctgggac cagaccctgg 1201 gagtggtcaa cttccccaga aaggccagcc gcaagccccc cagggaggag agctctgcca 1261 cctgtgttct ggaacagcct ggggcccttg gggcccaaat tctgctggtg cagaggccca 1321 actcaggtct gctggcagga ctgtgggagt tcccgtccgt gacctgggag ccctcagagc 1381 agcttcagcg caaggccctg ctgcaggaac tacagcgttg ggctgggccc ctcccagcca 1441 cgcacctccg gcaccttggg gaggttgtcc acaccttctc tcacatcaag ctgacatatc 1501 aagtatatgg gctggccttg gaagggcaga ccccagtgac caccgtacca ccaggtgctc 1561 gctggctgac gcaggaggaa tttcacaccg cagctgtttc caccgccatg aaaaaggttt 1621 tccgtgtgta tcagggccaa cagccaggga cctgtatggg ttccaaaagg tcccaggtgt 1681 cctctccgtg cagtcggaaa aagccccgca tgggccagca agtcctggat aatttctttc 1741 ggtctcacat ctccactgat gcacacagcc tcaacagtgc agcccagtga cacctctgaa 1801 agcccccatt ccctgagaat cctgttgtta gtaaagtgct tatttttgta gttaaaaaaa 1861 aaaaaaaaa // LOCUS HSU63336 2261 bp mRNA PRI 13-JAN-1997 DEFINITION Human MHC Class I region proline rich protein mRNA, complete cds. ACCESSION U63336 NID g1773023 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2261) AUTHORS Wei,H. and Weissman,S.M. TITLE Human Proline Rich Sequence (CAT 56) from the MHC Class I Region JOURNAL Unpublished REFERENCE 2 (bases 1 to 2261) AUTHORS Wei,H. and Weissman,S.M. TITLE Direct Submission JOURNAL Submitted (10-JUL-1996) Genetics, Yale University School of Medicine, 295 Congress Ave, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..2261 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" CDS 265..1026 /codon_start=1 /product="MHC Class I region proline rich protein" /db_xref="PID:g1773024" /translation="MPGRTEGGSWLRNAAAADVLRNLPEVEWREPQHCCPPTPRKRKQ NPRVPLPHYPPNPAAAIAADTMPKRKKQNHHQPPTQQQPPLPEREETGDEEDGSPIGP PSLLGPPPMANGKPGDPKSALHRGPPGSRGPLIPPLLSLPPPPWGRGPIRRGLGPRSS PYGRGWWGVNAEPPFPGPGHGGPTRGSFHKEQRNPRRLKSWSLIKNTCPPKDDPQVME DKSDRPVCRHFAKKGHCRYEDLCAFYHPGVNGPPL" BASE COUNT 488 a 633 c 549 g 591 t ORIGIN 1 aacccaagcg ggacaaggac ttttgggggg aggtcaaagg gcacgaagtt gtgcctacag 61 ctgttaccat agtaaccgag gaccggatgt ggcgatcttg gcggtgcgac agtcctcttc 121 tcaggccctc tggcccgaga gcctgttgac tctgtgacac actctgagga gctggttgtg 181 gtgttttcca gcgagggaag aaaagagtaa ttttttcaaa gcatttatag aaacgcagca 241 aagggaaggt gtgaggttgc cgccatgcct ggcagaacgg agggaggcag ttggctccgg 301 aatgcggccg ccgcagatgt tctccgcaac cttccggaag tggaatggcg ggagcctcag 361 cattgctgcc caccgacccc ccggaagcgg aaacagaatc cccgcgtgcc ccttcctcac 421 taccctccaa atcccgctgc agccattgcc gcagacacga tgccgaaacg aaagaagcag 481 aatcatcacc agccaccgac acagcagcag cccccgctgc ccgagcggga agagactgga 541 gatgaggagg atgggagtcc catcggacca cccagccttc tgggccctcc ccccatggcc 601 aatggaaaac ctggcgaccc taagtcagct cttcacagag gtcctccagg atcaagggga 661 ccactgattc caccactgct gagtctccca cctcctcctt ggggtagagg cccaattcgg 721 agagggcttg gccccaggtc tagcccatat ggtcgtggtt ggtggggagt caatgcagaa 781 cctccttttc cggggccagg ccatgggggt cccaccaggg gaagctttca caaggaacag 841 agaaaccctc gaaggctcaa aagctggtct cttatcaaga atacctgccc gcccaaggat 901 gacccccagg ttatggaaga caaatccgac cgccctgtct gccgacattt tgccaaaaag 961 ggccactgtc gatatgagga cctctgtgcc ttctaccatc caggcgtcaa tggacctcct 1021 ctgtgagact gtgccttccc atccaggctg gaaggagctc tctgtgacct agcggccatt 1081 tatttctctg tagccctatg atggctactg tgaggctctt ctaacaccct cagtcagtga 1141 cacacccatc ccatccacca cttcccccgt gtggggtcca gagtggtgtt gcatcactgg 1201 tgcgcggcat acgcgctttc ttctgatcca gcctgtagag actcgccttt gggacccatc 1261 tttgcttcct ttcagttgcc tcctggatct tctttcccgt catcaaatga ctgctgaaca 1321 ggaaacctct ttggtgctgt ttcttgtgca tctgtccacc tgttccccag tattgccctc 1381 aattcctgag agccctggag cggtttccta ccattccctt cttttagctg cttgttttaa 1441 gtccttttta tgtgacattc cctaccccca atgttgtcag ctgcttgtga aactcagcca 1501 ggttgtctaa cctggggtca agtttgggtg actggtgcag agttacttcc taaaaggcca 1561 ctctccctgc ctttggattt catagtttct ctgtcagtag catgatcccc accgctatgg 1621 tctatctatg atcaccgtgc tttgtgaaac tgtgcatccc cttgtagcct ttctcagtgt 1681 ccgtggcatt tttgtgactt cccagcacta gaataagttt tcctgccaaa atgagtgagg 1741 cgcttggtgc cctctggact ttcccacttc ccaacatggg agaattgtga actttccatc 1801 agactgcctc cctggccctc cccattcttc tcctgttggt tattctgagt ctgacacaga 1861 cccatgacat gtcttataaa gcctccaatg gctttatcct acctagatcc cttccagccc 1921 attttaatta gactatgtca ttgtgaggcc accagtccat tcatttgaat tctgtgaatc 1981 tccaccttgc ctatctttgg gtagaagctg gacagtactg ttgccctctt ccaatcctct 2041 tcccctacat ccctggcact ggttgttttc tgtgaaaaca gcagtgaaca ggttcagttt 2101 tgaactggcc ctgaggaaat gggtcaggag ttgtattggc aagagggagg ggtgagagct 2161 gttggagaac tgagaatgag gttttttttt tttttttctt tttaactttt tttatattag 2221 taataaatgc agtggaaacc agcattttat ttaaaaaaaa a // LOCUS HSU63421 4851 bp mRNA PRI 27-NOV-1996 DEFINITION Human sulfonylurea receptor (SUR1) mRNA, complete cds. ACCESSION U63421 NID g1480870 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4851) AUTHORS Thomas,P.M., Wohllk,N., Huang,E., Kuhnle,U., Rabl,W., Gagel,R.F. and Cote,G.J. TITLE Inactivation of the first nucleotide-binding fold of the sulfonylurea receptor, and familial persistent hyperinsulinemic hypoglycemia of infancy JOURNAL Am. J. Hum. Genet. 59 (3), 510-518 (1996) MEDLINE 96354544 REFERENCE 2 (bases 1 to 4851) AUTHORS Thomas,P.T., Wohllk,N., Huang,E., Gagel,R.F. and Cote,G.J. TITLE Direct Submission JOURNAL Submitted (10-JUL-1996) Endocrinology, M.D. Anderson Cancer Center, 1515 Holcombe Blvd - Box 15, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..4851 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.1" /tissue_type="brain" /cell_type="SNB19 glioblastoma cell" gene 1..4746 /gene="SUR1" CDS 1..4746 /gene="SUR1" /note="K+ ion channel membrane protein" /codon_start=1 /product="sulfonylurea receptor" /db_xref="PID:g1480871" /translation="MPLAFCGSENHSAAYRVDQGVLNNGCFVDALNVVPHVFLLFITF PILFIGWGSQSSKVHIHHSTWLHFPGHNLRWILTFMLLFVLVCEIAEGILSDGVTESH HLHLYMPAGMAFMAAVTSVVYYHNIETSNFPKLLIALLVYWTLAFITKTIKFVKLLDH AIGFSQLRFCLTGLLVILYGMLLLVEVNVIRVRRYIFFKTPREVKPPEDLQDLGVRFL QPFVNLPSKGTYWWMNAFIKTAHKKPIDLRAIGKLPIVMRALTNYQRLCEAFDAQVRK DIQGTQGARAIWQALSHAFGRRLVLSSTFRILADLLGFAGPLCIFGIVDHLGKENDVF QPKTQFLGVYFVSSQEFLANAYVLAVLLFLALLLQRTFLQASYYVAIETGINLRGAIQ TKIYNKIMHLSTSNLSMGEMTAGQICNLVAIDTNQLMWFFFLCPNLWAMPVQIIVGVI LLYYILGVSALIGAAVIILLAPVQYFVATKLSQAQRSTLEYSNERLKQTNEMLRGIKL LKLYAWENIFRTRVETTRRKEMTSLRAFAIYTSISIFMNTAIPIAAVLITFVGHVSFF KEADFSPSVAFASLSLFHILVTPLFLLSSVVRSTVKALVSVQKLSEFLSSAEIREEQC APHEPTPQGPASKYQAVPLRVVNRKRPAREDCRGLTGPLQSLVPSADGDADNCCVQIM GGYFTWTPDGIPTLSNITIRIPRGQLTMIVGQVGCGKSSLLLAALGEMQKVSGAVFWS SLPDSEIGEDPSPERETATDLDIRKRGPVAYASQKPWLLNATVEENIIFESPFNKQRY KMVIEACSLQPDIDILPHGDQTQIGERGINLSGGQRQRISVARALYQHANVVFLDDPF SALDIHLSDHLMQAGILELLRDDKRTVVLVTHKLQYLPHADWIIAMKDGTIQREGTLK DFQRSECQLFEHWKTLMNRQDQELEKETVTERKATEPPQGLSRAMSSRDGLLQDEEEE EEEAAESEEDDNLSSMLHQRAEIPWRACAKYLSSAGILLLSLLVFSQLLKHMVLVAID YWLAKWTDSALTLTPAARNCSLSQECTLDQTVYAMVFTVLCSLGIVLCLVTSVTVEWT GLKVAKRLHRSLLNRIILAPMRFFETTPLGSILNRFSSDCNTIDQHIPSTLECLSRST LLCVSALAVISYVTPVFLVALLPLAVVCYFIQKYFRVASRDLQQLDDTTQLPLLSHFA ETVEGLTTIRAFRYEARFQQKLLEYTDSNNIASLFLTAANRWLEVRMEYIGACVVLIA AVTSISNSLHRELSAGLVGLGLTYALMVSNYLNWMVRNLADMELQLGAVKRIHGLLKT EAESYEGLLAPSLIPKNWPDQGKIQIQNLSVRYDSSLKPVLKHVNALISPGQKIGICG RTGSGKSSFSLAFFRMVDTFEGHIIIDGIDIRKLPLHTLPSRLSIILQDPVLFSGTIR FNLDPERKCSDSTLWEALEIAQLKLVVKALPGGLDAIITEGGENFSQGQRQLFCLARA FVRKTSIFIMDEATASIDMATENILQKVVMTAFADRTVVTIAHRVHTILSADLVIVLK RGAILEFDKPEKLLSRKDSVFASFVRADK" BASE COUNT 1033 a 1507 c 1277 g 1034 t ORIGIN 1 atgcccctgg ccttctgcgg cagcgagaac cactcggccg cctaccgggt ggaccagggg 61 gtcctcaaca acggctgctt tgtggacgcg ctcaacgtgg tgccgcacgt cttcctactc 121 ttcatcacct tccccatcct cttcattgga tggggaagtc agagctccaa ggtgcacatc 181 caccacagca catggcttca tttccccggg cacaacctgc ggtggatcct gaccttcatg 241 ctgctcttcg tcctggtgtg tgagattgca gagggcatcc tgtctgatgg ggtgaccgaa 301 tcccaccatc tgcacctgta catgccagcc gggatggcgt tcatggctgc tgtcacctcc 361 gtggtctact atcacaacat cgagacttcc aacttcccca agctgctaat tgccctgctg 421 gtgtattgga ccctggcctt catcaccaag accatcaagt ttgtcaagct cttggaccac 481 gccatcggct tctcgcagct acgcttctgc ctcacagggc tgctggtgat cctctatggg 541 atgctgctcc tcgtggaggt caatgtcatc agggtgagga gatacatctt cttcaagaca 601 ccgagggagg tgaagcctcc cgaggacctg caagacctgg gggtacgctt cctgcagccc 661 ttcgtgaatc tgccgtccaa aggcacctac tggtggatga acgccttcat caagactgcc 721 cacaagaagc ccatcgactt gcgagccatc gggaagctgc ccatcgttat gagggccctc 781 accaactacc aacggctctg cgaggccttt gacgcccagg tgcggaagga cattcagggc 841 actcaaggtg cccgggccat ctggcaggca ctcagccatg ccttcgggag gcgcctggtc 901 ctcagcagca ctttccgcat cttggccgac ctgctgggct tcgccgggcc actgtgcatc 961 tttgggatcg tggaccacct tgggaaggag aacgacgtct tccagcccaa gacacaattt 1021 ctcggggttt actttgtctc atcccaagag ttccttgcca atgcctacgt cttagctgtg 1081 cttctgttcc ttgccctcct actgcaaagg acatttctgc aagcatccta ctatgtggcc 1141 attgaaactg gaattaactt gagaggagca atacagacca agatttacaa taaaattatg 1201 cacctgtcca cctccaacct gtccatggga gaaatgactg ctggacagat ctgtaatctg 1261 gttgccatcg acaccaatca gctcatgtgg tttttcttct tgtgcccaaa cctctgggct 1321 atgccagtac agatcattgt gggtgtgatt ctcctctact acatactcgg agtcagtgcc 1381 ttaattggag cagctgtcat cattctactg gctcctgtcc agtacttcgt ggccaccaag 1441 ctgtctcagg cccagcggag cacactggag tattccaatg agcggctgaa gcagaccaac 1501 gagatgctcc gcggcatcaa gctgctgaag ctgtacgcct gggagaacat cttccgcacg 1561 cgggtggaga cgacccgcag gaaggagatg accagcctca gggcctttgc catctatacc 1621 tccatctcca ttttcatgaa cacggccatc cccattgcag ctgtcctcat aactttcgtg 1681 ggccatgtca gcttcttcaa agaggccgac ttctcgccct ccgtggcctt tgcctccctc 1741 tccctcttcc atatcttggt cacaccgctg ttcctgctgt ccagtgtggt ccgatctacc 1801 gtcaaagctc tagtgagcgt gcaaaagcta agcgagttcc tgtccagtgc agagatccgt 1861 gaggagcagt gtgcccccca tgagcccaca cctcagggcc cagccagcaa gtaccaggcg 1921 gtgcccctca gggttgtgaa ccgcaagcgt ccagcccggg aggattgtcg gggcctcacc 1981 ggcccactgc agagcctggt ccccagtgca gatggcgatg ctgacaactg ctgtgtccag 2041 atcatgggag gctacttcac gtggacccca gatggaatcc ccacactgtc caacatcacc 2101 attcgtatcc cccgaggcca gctgactatg atcgtggggc aggtgggctg cggcaagtcc 2161 tcgctccttc tagccgcact gggggagatg cagaaggtct caggggctgt cttctggagc 2221 agccttcctg acagcgagat aggagaggac cccagcccag agcgggagac agcgaccgac 2281 ttggatatca ggaagagagg ccccgtggcc tatgcttcgc agaaaccatg gctgctaaat 2341 gccactgtgg aggagaacat catctttgag agtcccttca acaaacaacg gtacaagatg 2401 gtcattgaag cctgctctct gcagccagac atcgacatcc tgccccatgg agaccagacc 2461 cagattgggg aacggggcat caacctgtct ggtggtcaac gccagcgaat cagtgtggcc 2521 cgagccctct accagcacgc caacgttgtc ttcttggatg accccttctc agctctggat 2581 atccatctga gtgaccactt aatgcaggcc ggcatccttg agctgctccg ggacgacaag 2641 aggacagtgg tcttagtgac ccacaagcta cagtacctgc cccatgcaga ctggatcatt 2701 gccatgaagg atggcaccat ccagagggag ggtaccctca aggacttcca gaggtctgaa 2761 tgccagctct ttgagcactg gaagaccctc atgaaccgac aggaccaaga gctggagaag 2821 gagactgtca cagagagaaa agccacagag ccaccccagg gcctatctcg tgccatgtcc 2881 tcgagggatg gccttctgca ggatgaggaa gaggaggaag aggaggcagc tgagagcgag 2941 gaggatgaca acctgtcgtc catgctgcac cagcgtgctg agatcccatg gcgagcctgc 3001 gccaagtacc tgtcctccgc cggcatcctg ctcctgtcgt tgctggtctt ctcacagctg 3061 ctcaagcaca tggtcctggt ggccatcgac tactggctgg ccaagtggac cgacagcgcc 3121 ctgaccctga cccctgcagc caggaactgc tccctcagcc aggagtgcac cctcgaccag 3181 actgtctatg ccatggtgtt cacggtgctc tgcagcctgg gcattgtgct gtgcctcgtc 3241 acgtctgtca ctgtggagtg gacagggctg aaggtggcca agagactgca ccgcagcctg 3301 ctaaaccgga tcatcctagc ccccatgagg ttttttgaga ccacgcccct tgggagcatc 3361 ctgaacagat tttcatctga ctgtaacacc atcgaccagc acatcccatc cacgctggag 3421 tgcctgagcc gctccaccct gctctgtgtc tcagccctgg ccgtcatctc ctatgtcaca 3481 cctgtgttcc tcgtggccct cttgcccctc gcagtcgtgt gctacttcat ccagaagtac 3541 ttccgggtgg cgtccaggga cctgcagcag ctggatgaca ccacccagct tccacttctc 3601 tcacactttg ccgaaaccgt agaaggactc accaccatcc gggccttcag gtatgaggcc 3661 cggttccagc agaagcttct cgaatacaca gactccaaca acattgcttc cctcttcctc 3721 acagctgcca acagatggct ggaagtccga atggagtaca tcggtgcatg tgtggtgctc 3781 atcgcagcgg tgacctccat ctccaactcc ctgcacaggg agctctctgc tggcctggtg 3841 ggcctgggcc ttacctacgc cctaatggtc tccaactacc tcaactggat ggtgaggaac 3901 ctggcagaca tggagctcca gctgggggct gtgaagcgca tccatgggct cctgaaaacc 3961 gaggcagaga gctacgaggg gctcctggca ccatcgctga tcccaaagaa ctggccagac 4021 caagggaaga tccagatcca gaacctgagc gtgcgctacg acagctccct gaagccggtg 4081 ctgaagcacg tcaatgccct catctcccct ggacagaaga tcgggatctg cggccgcacc 4141 ggcagtggga agtcctcctt ctctcttgcc ttcttccgca tggtggacac gttcgaaggg 4201 cacatcatca ttgatggcat tgacatccgc aaactgccgc tgcacaccct gccgtcacgc 4261 ctctccatca tcctgcagga ccccgtcctc ttcagcggca ccatccgatt taacctggac 4321 cctgagagga agtgctcaga tagcacactg tgggaggccc tggaaatcgc ccagctgaag 4381 ctggtggtga aggcactgcc aggaggcctc gatgccatca tcacagaagg cggggagaat 4441 ttcagccagg gacagaggca gctgttctgc ctggcccggg ccttcgtgag gaagaccagc 4501 atcttcatca tggacgaggc cacggcttcc attgacatgg ccacggaaaa catcctccaa 4561 aaggtggtga tgacagcctt cgcagaccgc actgtggtca ccatcgcgca tcgagtgcac 4621 accatcctga gtgcagacct ggtgatcgtc ctgaagcggg gtgccatcct tgagttcgat 4681 aagccagaga agctgctcag ccggaaggac agcgtcttcg cctccttcgt ccgtgcagac 4741 aagtgacctg ccagagccca agtgccatcc cacattcgga ccctgcccat acccctgcct 4801 gggttttcta actgtaaatc acttgtaaat aaatagattt gattatttcc t // LOCUS HSU63717 901 bp mRNA PRI 20-AUG-1996 DEFINITION Human osteoclast stimulating factor mRNA, complete cds. ACCESSION U63717 NID g1498487 KEYWORDS Osteoclast; Bone resorption. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 901) AUTHORS Reddy,S.V., Devlin,R. and Roodman,G.D. TITLE Cloning and characterization of a novel autocrine osteoclast (OCL) stimulating factor (OSF) JOURNAL J. Bone Miner. Res. 10, S325-S325 (1995) REMARK Abstract REFERENCE 2 (bases 1 to 901) AUTHORS Reddy,S.V., Devlin,R., Leach,R.J. and Roodman,G.D. TITLE Characterization of a novel osteoclast stimulating factor (OSF) JOURNAL Unpublished REFERENCE 3 (bases 1 to 901) AUTHORS Reddy,S.V. and Roodman,G.D. TITLE Direct Submission JOURNAL Submitted (13-JUL-1996) Medicine/Hematology, UTHSCSA, 7703 Floyd Curl Drive, San Antonio, TX 78284, USA FEATURES Location/Qualifiers source 1..901 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="osteoclast-like cells formed in long-term human bone marrow cultures" /chromosome="12" /map="12q24.1-24.2" CDS 150..794 /note="OSF; contains SH3 domain and ankyrin repeat" /codon_start=1 /product="osteoclast stimulating factor" /db_xref="PID:g1498488" /translation="MSKPPPKPVKPGQVKVFRALYTFEPRTPDELYFEEGDIIYITDM SDTNWWKGTSKGRTGLIPSNYVAEQAESIDNPLHEAAKRGNLSWLRECLDNRVGVNGL DKAGSTALYWACHGGHKDIVEMLFTQPNIELNQQNKLGDTAFDAAAWKGYADIVQLLL AKGARTDLRNIEKKLAFDMATNAACASLLKKKQGTDAVRTLSNAEDYLDDEDSD" BASE COUNT 270 a 187 c 236 g 208 t ORIGIN 1 ctcttcccgc agccaagggt gggcgccggt cctaggaggc gacggttgta agccagacaa 61 aaagaactgg ggtgcccgga gtgccaggtg gcgggcaagc ggtgggcttt tcggcggggt 121 ctttaggatt tgcagctcca ggaagcgaga tgtcgaagcc gccacccaaa ccagtcaaac 181 cagggcaagt taaagtcttc agagccctgt atacgtttga acccagaact ccagatgaat 241 tatactttga ggaaggtgat attatctaca ttactgacat gagcgatacc aattggtgga 301 aaggcacctc caaaggcagg actggactaa ttccaagcaa ctatgtggct gagcaggcag 361 aatccattga caatccattg catgaagcag caaaaagagg caacttgagc tggttgagag 421 agtgtttgga caacagagtg ggtgttaatg gcttagacaa agctggaagc actgccttat 481 actgggcttg ccacgggggc cacaaagata tagtggaaat gctatttact caaccaaata 541 ttgaactgaa ccagcagaac aagttgggag atacagcttt cgatgctgct gcctggaagg 601 gttatgcaga tatcgtccag ttgcttctgg caaaaggtgc tagaacagac ttaagaaaca 661 ttgagaagaa gctggccttc gacatggcta ccaatgctgc ctgtgcatct ctcctgaaaa 721 agaaacaggg aacagatgca gttcgaacat taagcaatgc cgaggactat ctcgatgatg 781 aagactcaga ttaattcctt tctggagctt tgagatctaa aacttctgtt gcttttgcca 841 ttccaaaact ttgtctttgc cagaaaagtg ttggtaacta taaagaaaat atatatgaaa 901 a // LOCUS HSU63743 2740 bp mRNA PRI 01-DEC-1996 DEFINITION Human mitotic centromere-associated kinesin mRNA, complete cds. ACCESSION U63743 NID g1695881 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2740) AUTHORS Kim,I.G., Jun,D.Y. and Kim,Y.H. TITLE Expression of Human Centromere-Associated Kinesin Gene in T Lymphocytes JOURNAL Unpublished REFERENCE 2 (bases 1 to 2740) AUTHORS Kim,I.G., Jun,D.Y. and Kim,Y.H. TITLE Direct Submission JOURNAL Submitted (12-JUL-1996) Microbiology, Kyungpook National University, Taegu 702-701, Korea FEATURES Location/Qualifiers source 1..2740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="Jurkat" /clone="E6-1" CDS 55..2232 /note="HsMCAK" /codon_start=1 /product="mitotic centromere-associated kinesin" /db_xref="PID:g1695882" /translation="MAMDSSLQARLFPGLAIKIQRSNGLIHSANVRTVNLEKSCVSVE WAEGGATKGKEIDFDDVAAINPELLQLLPLHPKDNLPLQENVTIQKQKRRSVNSKIPA PKESLRSRSTRMSTVSELRITAQENDMEVELPAAANSRKQFSVPPAPTRPSCPAVAEI PLRMVSEEMEEQVHSIRGSSSANPVNSVRRKSCLVKEVEKMKNKREEKKAQNSEMRMK RAQEYDSSFPNWEFARMIKEFRATLECHPLTMTDPIEEHRICVCVRKRPLNKQELAKK EIDVISIPSKCLLLVHEPKLKVDLTKYLENQAFCFDFAFDETASNEVVYRFTARPLVQ TIFEGGKATCFAYGQTGSGKTHTMGGDLSGKAQNASKGIYAMASRDVFLLKNQPCYRK LGLEVYVTFFEIYNGKLFDLLNKKAKLRVLEDGKQQVQVVGLQEHLVNSADDVIKMLD MGSACRTSGQTFANSNSSRSHACFQIILRAKGRMHGKFSLVDLAGNERGADTSSADRQ TRMEGAEINKSLLALKECIRALGQNKAHTPFRESKLTQVLRDSFIGENSRTCMIATIS PGISSCEYTLNTLRYADRVKELSPHSGPSGEQLIQMETEEMEACSNGALIPGNLSKEE EELSSQMSSFNEAMTQIRELEEKAMEELKEIIQQGPDWLELSEMTEQPDYDLETFVNK AESALAQQAKHFSALRDVIKALRLAMQLEEQASRQISSKKRPQ" BASE COUNT 725 a 665 c 724 g 626 t ORIGIN 1 gcgaaattga ggtttcttgg tattgcgcgt ttctcttcct tgctgactct ccgaatggcc 61 atggactcgt cgcttcaggc ccgcctgttt cccggtctcg ctatcaagat ccaacgcagt 121 aatggtttaa ttcacagtgc caatgtaagg actgtgaact tggagaaatc ctgtgtttca 181 gtggaatggg cagaaggagg tgccacaaag ggcaaagaga ttgattttga tgatgtggct 241 gcaataaacc cagaactctt acagcttctt cccttacatc cgaaggacaa tctgcccttg 301 caggaaaatg taacaatcca gaaacaaaaa cggagatccg tcaactccaa aattcctgct 361 ccaaaagaaa gtcttcgaag ccgctccact cgcatgtcca ctgtctcaga gcttcgcatc 421 acggctcagg agaatgacat ggaggtggag ctgcctgcag ctgcaaactc ccgcaagcag 481 ttttcagttc ctcctgcccc cactaggcct tcctgccctg cagtggctga aataccattg 541 aggatggtca gcgaggagat ggaagagcaa gtccattcca tccgtggcag ctcttctgca 601 aaccctgtga actcagttcg gaggaaatca tgtcttgtga aggaagtgga aaaaatgaag 661 aacaagcgag aagagaagaa ggcccagaac tctgaaatga gaatgaagag agctcaggag 721 tatgacagta gttttccaaa ctgggaattt gcccgaatga ttaaagaatt tcgggctact 781 ttggaatgtc atccacttac tatgactgat cctatcgaag agcacagaat atgtgtctgt 841 gttaggaaac gcccactgaa taagcaagaa ttggccaaga aagaaattga tgtgatttcc 901 attcctagca agtgtctcct cttggtacat gaacccaagt tgaaagtgga cttaacaaag 961 tatctggaga accaagcatt ctgctttgac tttgcatttg atgaaacagc ttcgaatgaa 1021 gttgtctaca ggttcacagc aaggccactg gtacagacaa tctttgaagg tggaaaagca 1081 acttgttttg catatggcca gacaggaagt ggcaagacac atactatggg cggagacctc 1141 tctgggaaag cccagaatgc atccaaaggg atctatgcca tggcctcccg ggacgtcttc 1201 ctcctgaaga atcaaccctg ctaccggaag ttgggcctgg aagtctatgt gacattcttc 1261 gagatctaca atgggaagct gtttgacctg ctcaacaaga aggccaagct gcgcgtgctg 1321 gaggacggca agcaacaggt gcaagtggtg gggctgcagg agcatctggt taactctgct 1381 gatgatgtca tcaagatgct cgacatgggc agcgcctgca gaacctctgg gcagacattt 1441 gccaactcca attcctcccg ctcccacgcg tgcttccaaa ttattcttcg agctaaaggg 1501 agaatgcatg gcaagttctc tttggtagat ctggcaggga atgagcgagg cgcagacact 1561 tccagtgctg accggcagac ccgcatggag ggcgcagaaa tcaacaagag tctcttagcc 1621 ctgaaggagt gcatcagggc cctgggacag aacaaggctc acaccccgtt ccgtgagagc 1681 aagctgacac aggtgctgag ggactccttc attggggaga actctaggac ttgcatgatt 1741 gccacgatct caccaggcat aagctcctgt gaatatactt taaacaccct gagatatgca 1801 gacagggtca aggagctgag cccccacagt gggcccagtg gagagcagtt gattcaaatg 1861 gaaacagaag agatggaagc ctgctctaac ggggcgctga ttccaggcaa tttatccaag 1921 gaagaggagg aactgtcttc ccagatgtcc agctttaacg aagccatgac tcagatcagg 1981 gagctggagg agaaggctat ggaagagctc aaggagatca tacagcaagg accagactgg 2041 cttgagctct ctgagatgac cgagcagcca gactatgacc tggagacctt tgtgaacaaa 2101 gcggaatctg ctctggccca gcaagccaag catttctcag ccctgcgaga tgtcatcaag 2161 gccttacgcc tggccatgca gctggaagag caggctagca gacaaataag cagcaagaaa 2221 cggccccagt gacgactgca aataaaaatc tgtttggttt gacacccagc ctcttccctg 2281 gccctcccca gagaactttg ggtacctggt gggtctaggc agggtctgag ctgggacagg 2341 ttctggtaaa tgccaagtat gggggcatct gggcccaggg cagctgggga gggggtcaga 2401 gtgacatggg acactccttt tctgttcctc agttgtcgcc ctcacgagag gaaggagctc 2461 ttagttaccc ttttgtgttg cccttctttc catcaagggg aatgttctca gcatagagct 2521 ttctccgcag catcctgcct gcgtggactg gctgctaatg gagagctccc tggggttgtc 2581 ctggctctgg ggagagagac ggagccttta gtacagctat ctgctggctc taaaccttct 2641 acgcctttgg gccgagcact gaatgtcttg tactttaaaa aaatgtttct gagacctctt 2701 tctactttac tgtctcccta gagtcctaga ggatccctac // LOCUS HSU63825 879 bp mRNA PRI 13-AUG-1996 DEFINITION Human hepatitis delta antigen interacting protein A (dipA) mRNA, complete cds. ACCESSION U63825 NID g1488313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Brazas,R.M. and Ganem,D. TITLE A cellular homolog of hepatitis delta antigen: implications for viral replication and evolution JOURNAL Science (1996) In press REFERENCE 2 (bases 1 to 879) AUTHORS Brazas,R.M. and Ganem,D. TITLE Direct Submission JOURNAL Submitted (15-JUL-1996) Microbiology, University of California, San Francisco, 513 Parnassus Avenue, San Francisco, CA 94143-0414, USA FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" gene 29..637 /gene="dipA" CDS 29..637 /gene="dipA" /note="isolated in a two hybrid screen to identify cellular proteins that interact with hepatitis delta antigen; similar to hepatitis delta antigen, and has two regions predicted to form coiled-coil protein interaction domains" /codon_start=1 /product="hepatitis delta antigen interacting protein A" /db_xref="PID:g1488314" /translation="MEAEAGGLEELTDEEMAALGKEELVRRLRREEATRLAALVQRGR LMQEVNRQLQGHLGEIRELKQLNRRLQAENRELRDLCCFLDSERQRGRRAARQWQLFG TQASRAVREDLGGCWQKLAELEGRQEELLRENLALKELCLALGEEWGPRGGPSGAGGS GAGPAPELALPPCGPRDLGDGSSSTGSVGSPDQLPLACSPDD" misc_feature 164..289 /gene="dipA" /note="encodes coiled-coil interaction domain" /evidence=not_experimental misc_feature 386..469 /gene="dipA" /note="encodes coiled-coil interaction domain" /evidence=not_experimental polyA_signal 859..864 /evidence=not_experimental BASE COUNT 146 a 276 c 344 g 113 t ORIGIN 1 gggcgatgct ccagaggcct gaccagccat ggaggccgag gcaggcggcc tggaggagct 61 gacggacgag gagatggcgg cgctaggcaa ggaagagcta gtgcggcgcc tgcggcggga 121 ggaggcgacg cgcctggcgg cactggtgca gcgcggccgc ctcatgcagg aggtgaatcg 181 gcagctgcag ggccacctgg gcgagatccg cgagctcaag cagctcaacc ggcgtctgca 241 ggcagagaac cgtgagctgc gcgacctctg ctgcttcctg gactcggagc gccagcgcgg 301 gcggcgcgcc gcacgccagt ggcagctctt cgggacccaa gcatcccggg ccgtgcgcga 361 ggacctgggc ggctgttggc agaagctggc cgagctggag ggccgccagg aggagctgct 421 gcgggagaac ctagcgctta aggagctctg cctggcgctg ggcgaagaat ggggcccccg 481 cggcggcccc agcggcgccg ggggatcagg agccgggcca gcacccgagc ttgccttgcc 541 cccgtgcggg ccccgcgacc taggcgatgg aagctccagc actggcagcg tgggcagtcc 601 ggatcagttg cccctggcct gttcccccga tgattgaagg cactgcttcc tccacgccga 661 cgcccgcccg gattgctccc cgagccccgg gaccgctgtg gacctcggga cctggacgcc 721 gtcctggctg cgcaggaggg gccgctggca tggactaaga aatcctgaca ccaagaaggg 781 cccctcgctc ttgctggcag ggcagcaggg ggactgaagg ctggagcgga gggacttgct 841 gggggttgga ttgggggtaa taaacccgga cggaagcgg // LOCUS HSU63842 1268 bp DNA PRI 04-DEC-1996 DEFINITION Human neurogenic basic-helix-loop-helix protein (neuroD3) gene, complete cds. ACCESSION U63842 NID g1654337 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1268) AUTHORS McCormick,M.B., Tamimi,R.M., Snider,L., Asakura,A., Bergstrom,D. and Tapscott,S.J. TITLE NeuroD2 and neuroD3: distinct expression patterns and transcriptional activation potentials within the neuroD gene family JOURNAL Mol. Cell. Biol. 16 (10), 5792-5800 (1996) MEDLINE 96413331 REFERENCE 2 (bases 1 to 1268) AUTHORS Tapscott,S.J., Tamimi,R., Bergstrom,D. and McCormick,M.B. TITLE Direct Submission JOURNAL Submitted (15-JUL-1996) Clinical Research, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..1268 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" /map="5q22-35" gene 55..768 /gene="neuroD3" CDS 55..768 /gene="neuroD3" /note="bHLH protein related to neuroD; neurogenic basic-helix-loop-helix protein" /codon_start=1 /db_xref="PID:g1654338" /translation="MPARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASASGP PAPARRSAPNISRASEVPGAQDDEQERRRRRGRTRVRSEALLHSLRRSRRVKANDRER NRMHNLNAALDALRSVLPSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGA RERLLPPQCVPCLPGPPSPASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFS FPSLPKDLLHTTPCFIPYH" BASE COUNT 246 a 455 c 343 g 224 t ORIGIN 1 ctgcagcgct ctgagccgct ttctatctgt ccgtcggtcc tgcacagcgc aacgatgcca 61 gcccgccttg agacctgcat ctccgacctc gactgcgcca gcagcagcgg cagtgaccta 121 tccggcttcc tcaccgacga ggaagactgt gccagactcc aacaggcagc ctccgcttcg 181 gggccgcccg cgccggcccg caggagcgcg cccaatatct cccgggcgtc tgaggttcca 241 ggggcacagg acgacgagca ggagaggcgg cggcgccgcg gccggacgcg ggtccgctcc 301 gaggcgctgc tgcactcgct gcgcaggagc cggcgcgtca aggccaacga tcgcgagcgc 361 aaccgcatgc acaacttgaa cgcggccctg gacgcactgc gcagcgtgct gccctcgttc 421 cccgacgaca ccaagctcac caaaatcgag acgctgcgct tcgcctacaa ctacatctgg 481 gctctggccg agacactgcg cctggcggat caagggctgc ccggaggcgg tgcccgggag 541 cgcctcctgc cgccgcagtg cgtcccctgc ctgcccggtc ccccaagccc cgccagcgac 601 gcggagtcct ggggctcagg tgccgccgcc gcctccccgc tctctgaccc cagtagccca 661 gccgcctccg aagacttcac ctaccgcccc ggcgaccctg ttttctcctt cccaagcctg 721 cccaaagact tgctccacac aacgccctgt ttcattcctt accactaggc cctttgtaga 781 cactgttact ttccccctcc cctagtcagc aggcaataga ttgggcccag ctgccgcctc 841 gggacccctc tccaggcgga gggaggaagc gggagcttta aagcagtcgg ggatacctga 901 gccgcttgtt aggtcgccgc accctcgcgg cggatgtctc ttggtctgtt tctccggccc 961 tcagcccagc gcccctcctg cccgccccta gacggccttt ccttttgcac tttctgaact 1021 ccacaaaacc tcctttgtga ctggctcaga actgacccca gccaccactt cagtgtgatt 1081 tagaaaaggg acagatcagc ccctgaagac gaggtgaaaa gtcaatttta caatttgtag 1141 aactctaatg aagaaaaacg agcatgaaaa ttcggtttga gccggctgac aatacaatga 1201 aaaggcttaa aaagcagaga caaggagtgg gcttcatgca ttatggatcc cgacccccac 1261 cactgcag // LOCUS HSU64198 4040 bp mRNA PRI 26-NOV-1996 DEFINITION Human Il-12 receptor beta2 mRNA, complete cds. ACCESSION U64198 NID g1685027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4040) AUTHORS Presky,D.H., Yang,H., Minetti,L.J., Chua,A.O., Nabavi,N., Wou,C.-Y., Gately,M.K. and Gubler,U. TITLE a functional Il-12 receptor complex is composed of two beta type cytokine receptor subunits JOURNAL Unpublished REFERENCE 2 (bases 1 to 4040) AUTHORS Presky,D.H., Yang,H., Minetti,L.J., Chua,A.O., Nabavi,N., Wou,C.-Y., Gately,M.K. and Gubler,U. TITLE Direct Submission JOURNAL Submitted (17-JUL-1996) Inflammation/Autoimmune Diseases, Hoffmann la Roche Inc, 340 Kingsland Street, Nutley, NJ 07110, USA FEATURES Location/Qualifiers source 1..4040 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 641..3229 /codon_start=1 /product="Il-12 receptor beta2" /db_xref="PID:g1685028" /translation="MAHTFRGCSLAFMFIITWLLIKAKIDACKRGDVTVKPSHVILLG STVNITCSLKPRQGCFHYSRRNKLILYKFDRRINFHHGHSLNSQVTGLPLGTTLFVCK LACINSDEIQICGAEIFVGVAPEQPQNLSCIQKGEQGTVACTWERGRDTHLYTEYTLQ LSGPKNLTWQKQCKDIYCDYLDFGINLTPESPESNFTAKVTAVNSLGSSSSLPSTFTF LDIVRPLPPWDIRIKFQKASVSRCTLYWRDEGLVLLNRLRYRPSNSRLWNMVNVTKAK GRHDLLDLKPFTEYEFQISSKLHLYKGSWSDWSESLRAQTPEEEPTGMLDVWYMKRHI DYSRQQISLFWKNLSVSEARGKILHYQVTLQELTGGKAMTQNITGHTSWTTVIPRTGN WAVAVSAANSKGSSLPTRINIMNLCEAGLLAPRQVSANSEGMDNILVTWQPPRKDPSA VQEYVVEWRELHPGGDTQVPLNWLRSRPYNVSALISENIKSYICYEIRVYALSGDQGG CSSILGNSKHKAPLSGPHINAITEEKGSILISWNSIPVQEQMGCLLHYRIYWKERDSN SQPQLCEIPYRVSQNSHPINSLQPRVTYVLWMTALTAAGESSHGNEREFCLQGKANWM AFVAPSICIAIIMVGIFSTHYFQQKVFVLLAALRPQWCSREIPDPANSTCAKKYPIAE EKTQLPLDRLLIDWPTPEDPEPLVISEVLHQVTPVFRHPPCSNWPQREKGIQGHQASE KDMMHSASSPPPPRALQAESRQLVDLYKVLESRGSDPKPENPACPWTVLPAGDLPTHD GYLPSNIDDLPSHEAPLADSLEELEPQHISLSVFPSSSLHPLTFSCGDKLTLDQLKMR CDSLML" BASE COUNT 1073 a 1054 c 976 g 937 t ORIGIN 1 tgcagagaac agagaaagga catctgcgag gaaagttccc tgatggctgt caacaaagtg 61 ccacgtctct atggctgtgt acgctgagca cacgatttta tcgcgcctat catatcttgg 121 tgcataaacg cacctcacct cggtcaaccc ttgctccgtc ttatgagaca ggctttatta 181 tccgcatttt atatgagggg aatctgacgg tggagagaga attatcttgc tcaaggcgac 241 acagcagagc ccacaggtgg cagaatccca cccgagcccg cttcgacccg cggggtggaa 301 accacgggcg cccgcccggc tgcgcttcca gagctgaact gagaagcgag tcctctccgc 361 cctgcggcca ccgcccagcc ccgacccccg ccccggcccg atcctcactc gccgccagct 421 ccccgcgccc accccggagt tggtggcgca gaggcgggag gcggaggcgg gagggcgggc 481 gctggcaccg ggaacgcccg agcgccggca gagagcgcgg agagcgcgac acgtgcggcc 541 cagagcaccg gggccacccg gtccccgcag gcccgggacc gcgcccgctg gcaggcgaca 601 cgtggaagaa tacggagttc tataccagag ttgattgttg atggcacata cttttagagg 661 atgctcattg gcatttatgt ttataatcac gtggctgttg attaaagcaa aaatagatgc 721 gtgcaagaga ggcgatgtga ctgtgaagcc ttcccatgta attttacttg gatccactgt 781 caatattaca tgctctttga agcccagaca aggctgcttt cactattcca gacgtaacaa 841 gttaatcctg tacaagtttg acagaagaat caattttcac catggccact ccctcaattc 901 tcaagtcaca ggtcttcccc ttggtacaac cttgtttgtc tgcaaactgg cctgtatcaa 961 tagtgatgaa attcaaatat gtggagcaga gatcttcgtt ggtgttgctc cagaacagcc 1021 tcaaaattta tcctgcatac agaagggaga acaggggact gtggcctgca cctgggaaag 1081 aggacgagac acccacttat acactgagta tactctacag ctaagtggac caaaaaattt 1141 aacctggcag aagcaatgta aagacattta ttgtgactat ttggactttg gaatcaacct 1201 cacccctgaa tcacctgaat ccaatttcac agccaaggtt actgctgtca atagtcttgg 1261 aagctcctct tcacttccat ccacattcac attcttggac atagtgaggc ctcttcctcc 1321 gtgggacatt agaatcaaat ttcaaaaggc ttccgtgagc agatgtaccc tttattggag 1381 agatgaggga ctggtactgc ttaatcgact cagatatcgg cccagtaaca gcaggctctg 1441 gaatatggtt aatgttacaa aggccaaagg aagacatgat ttgctggatc tgaaaccatt 1501 tacagaatat gaatttcaga tttcctctaa gctacatctt tataagggaa gttggagtga 1561 ttggagtgaa tcattgagag cacaaacacc agaagaagag cctactggga tgttagatgt 1621 ctggtacatg aaacggcaca ttgactacag tagacaacag atttctcttt tctggaagaa 1681 tctgagtgtc tcagaggcaa gaggaaaaat tctccactat caggtgacct tgcaggagct 1741 gacaggaggg aaagccatga cacagaacat cacaggacac acctcctgga ccacagtcat 1801 tcctagaacc ggaaattggg ctgtggctgt gtctgcagca aattcaaaag gcagttctct 1861 gcccactcgt attaacataa tgaacctgtg tgaggcaggg ttgctggctc ctcgccaggt 1921 ctctgcaaac tcagagggca tggacaacat tctggtgact tggcagcctc ccaggaaaga 1981 tccctctgct gttcaggagt acgtggtgga atggagagag ctccatccag ggggtgacac 2041 acaggtccct ctaaactggc tacggagtcg accctacaat gtgtctgctc tgatttcaga 2101 gaacataaaa tcctacatct gttatgaaat ccgtgtgtat gcactctcag gggatcaagg 2161 aggatgcagc tccatcctgg gtaactctaa gcacaaagca ccactgagtg gcccccacat 2221 taatgccatc acagaggaaa aggggagcat tttaatttca tggaacagca ttccagtcca 2281 ggagcaaatg ggctgcctcc tccattatag gatatactgg aaggaacggg actccaactc 2341 ccagcctcag ctctgtgaaa ttccctacag agtctcccaa aattcacatc caataaacag 2401 cctgcagccc cgagtgacat atgtcctgtg gatgacagct ctgacagctg ctggtgaaag 2461 ttcccacgga aatgagaggg aattttgtct gcaaggtaaa gccaattgga tggcgtttgt 2521 ggcaccaagc atttgcattg ctatcatcat ggtgggcatt ttctcaacgc attacttcca 2581 gcaaaaggtg tttgttctcc tagcagccct cagacctcag tggtgtagca gagaaattcc 2641 agatccagca aatagcactt gcgctaagaa atatcccatt gcagaggaga agacacagct 2701 gcccttggac aggctcctga tagactggcc cacgcctgaa gatcctgaac cgctggtcat 2761 cagtgaagtc cttcatcaag tgaccccagt tttcagacat cccccctgct ccaactggcc 2821 acaaagggaa aaaggaatcc aaggtcatca ggcctctgag aaagacatga tgcacagtgc 2881 ctcaagccca ccacctccaa gagctctcca agctgagagc agacaactgg tggatctgta 2941 caaggtgctg gagagcaggg gctccgaccc aaagccagaa aacccagcct gtccctggac 3001 ggtgctccca gcaggtgacc ttcccaccca tgatggctac ttaccctcca acatagatga 3061 cctcccctca catgaggcac ctctcgctga ctctctggaa gaactggagc ctcagcacat 3121 ctccctttct gttttcccct caagttctct tcacccactc accttctcct gtggtgataa 3181 gctgactctg gatcagttaa agatgaggtg tgactccctc atgctctgag tggtgaggct 3241 tcaagcctta aagtcagtgt gccctcaacc agcacagcct gccccaattc ccccagcccc 3301 tgctccagca gctgtcatct ctgggtgcca ccatcggtct ggctgcagct agaggacagg 3361 caagccagct ctgggggagt cttaggaact gggagttggt cttcactcag atgcctcatc 3421 ttgcctttcc cagggcctta aaattacatc cttcactgtg tggacctaga gactccaact 3481 tgaattccta gtaactttct tggtatgctg gccagaaagg gaaatgagga ggagagtaga 3541 aaccacagct cttagtagta atggcataca gtctagagga ccattcatgc aatgactatt 3601 tctaaagcac ctgctacaca gcaggctgta cacagcagat cagtactgtt caacagaact 3661 tcctgagatg atggaaatgt tctacctctg cactcactgt ccagtacatt agacactagg 3721 cacattggct gttaatcact tggaatgtgt ttagcttgac tgaggaatta aattttgatt 3781 gtaaatttaa atcgccacac atggctagtg gctactgtat tggagtgcac agctctagat 3841 ggctcctaga ttattgagag cctccaaaac aaatcaacct agttctatag atgaagacat 3901 aaaagacact ggtaaacacc aatgtaaaag ggcccccaag gtggtcatga ctggtctcat 3961 ttgcagaagt ctaagaatgt acctttttct ggccgggcgt ggtagctcat gcctgtaatc 4021 ccagcacttt gggaggctga // LOCUS HSU64315 2881 bp mRNA PRI 07-SEP-1996 DEFINITION Human DNA repair endonuclease subunit (XPF) mRNA, complete cds. ACCESSION U64315 NID g1524410 KEYWORDS ERCC4; ERCC11; nucleotide excision repair; xeroderma pigmentosum. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2881) AUTHORS Sijbers,A.M., de Laat,W.L., Ariza,R.R., Biggerstaff,M., Wei,Y., Moggs,J.G., Carter,K.C., Shell,B.K., Evans,E., de Jong,M.C., Rademakers,S., de Rooij,J., Jaspers,N.G., Hoeijmakers,J.H. and Wood,R.D. TITLE Xeroderma pigmentosum group F caused by a defect in a structure-specific DNA repair endonuclease JOURNAL Cell (1996) In press REFERENCE 2 (bases 1 to 2881) AUTHORS Wood,R.D. TITLE Direct Submission JOURNAL Submitted (18-JUL-1996) Biochemistry of Inherited Syndromes Lab., Imperial Cancer Research Fund, Clare Hall Laboratories, Blanche Lane, South Mimms, Herts., EN6 3LD, United Kingdom FEATURES Location/Qualifiers source 1..2881 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.1-13.2" gene 16..2733 /gene="XPF" CDS 16..2733 /gene="XPF" /codon_start=1 /product="DNA repair endonuclease subunit" /db_xref="PID:g1524411" /translation="MAPLLEYERQLVLELLDTDGLVVCARGLGADRLLYHFLQLHCHP ACLVLVLNTQPAEEEYFINQLKIEGVEHLPRRVTNEITSNSRYEVYTQGGVIFATSRI LVVDFLTDRIPSDLITGILVYRAHRIIESCQEAFILRLFRQKNKRGFIKAFTDNAVAF DTGFCHVERVMRNLFVRKLYLWPRFHVAVNSFLEQHKPEVVEIHVSMTPTMLAIQTAI LDILNACLKELKCHNPSLEVEDLSLENAIGKPFDKTIRHYLDPLWHQLGAKTKSLVQD LKILRTLLQYLSQYDCVTFLNLLESLRATEKAFGQNSGWLFLDSSTSMFINARARVYH LPDAKMSKKEKISEKMEIKEGEETKKELVLESNPKWEALTEVLKEIEAENKESEALGG PGQVLICASDDRTCSQLRDYITLGAEAFLLRLYRKTFEKDSKAEEVWMKFRKEDSSKR IRKSHKRPKDPQNKERASTKERTLKKKKRKLTLTQMVGKPEELEEEGDVEEGYRREIS SSPESCPEEIKHEEFDVNLSSDAAFGILKEPLTIIHPLLGCSDPYALTRVLHEVEPRY VVLYDAELTFVRQLEIYRASRPGKPLRVYFLIYGGSTEEQRYLTALRKEKEAFEKLIR EKASMVVPEEREGRDETNLDLVRGTASADVSTDTRKAGGQEQNGTQQSIVVDMREFRS ELPSLIHRRDIDIEPVTLEVGDYILTPEMCVERKSISDLIGSLNNGRLYSQCISMSRY YKRPVLLIEFDPSKPFSLTSRGALFQEISSNDISSKLTLLTLHFPRLRILWCPSPHAT AELFEELKQSKPQPDAATALAITADSETLPESEKYNPGPQDFLLKMPGVNAKNCRSLM HHVKNIAELAALSQDELTSILGNAANAKQLYDFIHTSFAEVVSKGKGKK" variation 2090 /gene="XPF" /replace="G" mutation 2281..2290 /gene="XPF" /replace="TTCTCA" mutation 2377 /gene="XPF" /replace="T" variation 2487 /gene="XPF" /replace="T" BASE COUNT 861 a 627 c 667 g 726 t ORIGIN 1 gctcgacgga ttgccatggc gccgctgctg gagtacgagc gacagctggt gctggaactg 61 ctcgacactg acgggctagt agtgtgcgcc cgcgggctcg gcgcggaccg gctcctctac 121 cactttctcc agctgcactg ccacccagcc tgcctggtgc tggtgctcaa cacgcagccg 181 gccgaggagg agtattttat caatcagctg aagatagaag gagttgaaca cctccctcgc 241 cgtgtaacaa atgaaatcac aagcaacagt cgctatgaag tttacacaca aggtggtgtt 301 atatttgcga caagtaggat acttgtggtt gacttcttga ctgatagaat accttcagat 361 ttaattactg gcatcttggt gtatagagcc cacagaataa tcgagtcttg tcaagaagca 421 ttcatcttgc gcctctttcg ccagaaaaac aaacgtggtt ttattaaagc tttcacagac 481 aatgctgttg cctttgatac tggtttttgt catgtggaaa gagtgatgag aaatcttttt 541 gtgaggaaac tgtatctgtg gccaaggttc catgtagcag taaactcatt tttagaacag 601 cacaaacctg aagttgtaga aatccatgtt tctatgacac ctaccatgct tgctatacag 661 actgctatac tggacatttt aaatgcatgt ctaaaggaac taaaatgcca taacccatcg 721 cttgaagtgg aagatttatc tttagaaaat gctattggaa aaccttttga caagacaatc 781 cgccattatc tggatccttt gtggcaccag cttggagcca agactaaatc cttagttcag 841 gatttgaaga tattacgaac tttgctgcag tatctctctc agtatgattg tgtcacattt 901 cttaatcttc tggaatctct gagagcaacg gaaaaagctt ttggtcagaa ttcaggttgg 961 ctgtttcttg actccagcac ctcgatgttt ataaatgctc gagcaagggt ttatcatctt 1021 ccagatgcca aaatgagtaa aaaagaaaaa atatctgaaa aaatggaaat taaagaaggg 1081 gaagaaacaa aaaaggaact ggtcctagaa agcaacccaa agtgggaggc actgactgaa 1141 gtattaaaag aaattgaggc agaaaataag gagagtgaag ctcttggtgg tccaggtcaa 1201 gtactgattt gtgcaagtga tgaccgaaca tgttcccagc tgagagacta tatcactctt 1261 ggagcggagg ccttcttatt gaggctctac aggaaaacct ttgagaagga tagcaaagct 1321 gaagaagtct ggatgaaatt taggaaggaa gacagttcaa agagaattag gaaatctcac 1381 aaaagaccta aagaccccca aaacaaagaa cgggcttcta ccaaagaaag aaccctcaaa 1441 aagaaaaaac ggaagttgac cttaactcaa atggtaggaa aacctgaaga actggaagag 1501 gaaggagatg tcgaggaagg atatcgtcga gaaataagca gtagcccaga aagctgcccg 1561 gaagaaatta agcatgaaga atttgatgta aatttgtcat cggatgctgc tttcggaatc 1621 ctgaaagaac ccctcactat catccatccg cttctgggtt gcagcgaccc ctatgctctg 1681 acaagggtac tacatgaagt ggagccaaga tacgtggttc tttatgacgc agagctaacc 1741 tttgttcggc agcttgaaat ttacagggcg agtaggcctg ggaaacctct gagggtttac 1801 tttcttatat acggaggttc aactgaggaa caacgctatc tcactgcttt gcggaaagaa 1861 aaggaagctt ttgaaaaact cataagggaa aaagcaagca tggttgtccc tgaagaaaga 1921 gaaggcagag atgaaacaaa cttagaccta gtaagaggca cagcatctgc agatgtttcc 1981 actgacactc ggaaagccgg tggccaggaa cagaatggta cacagcaaag catagttgtg 2041 gatatgcgtg aatttcgaag tgagcttcca tctctgatcc atcgtcggga cattgacatt 2101 gaacccgtga ctttagaggt tggagattac atcctcactc cagaaatgtg cgtggagcgc 2161 aagagtatca gtgatttaat cggctcttta aataacggcc gcctctacag ccagtgcatc 2221 tccatgtccc gctactacaa gcgtcccgtg cttctgattg agtttgaccc tagcaagcct 2281 ttctctctca cttcccgagg tgccttgttt caggagatct ccagcaatga cattagttcc 2341 aaactcactc ttcttacact tcacttcccc agactacgga ttctctggtg cccctctcct 2401 catgcaacgg cggagttgtt tgaggagctg aaacaaagca agccacagcc tgatgcggcg 2461 acagcactgg ccattacagc agattccgaa acccttcccg agtcagagaa gtataatcct 2521 ggtccccaag acttcttgtt aaaaatgcca ggggtgaatg ccaaaaactg ccgctccttg 2581 atgcaccacg ttaagaacat cgcagaatta gcagccctgt cacaagacga gctcacgagt 2641 attctgggga atgctgcaaa tgccaaacag ctttatgatt tcattcacac ctcttttgca 2701 gaagtcgtat caaaaggaaa agggaaaaag tgaacagtga tggctgtttt cttatcccat 2761 gcctgtactt ttcagcggct ccttgccaga catcataggt cattattaat tattggtttg 2821 ctatttcatt cttttccaat gctcttaatg attgtacggt ggaccagagt tcagagagcc 2881 c // LOCUS HSU64444 1156 bp mRNA PRI 01-NOV-1996 DEFINITION Human ubiquitin fusion-degradation protein (UFD1L) mRNA, complete cds. ACCESSION U64444 NID g1654345 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1156) AUTHORS Pizzuti,A., Novelli,G., Ratti,A., Amati,F., Mari,A., Calabrese,G., Nicolis,S., Silani,V., Marino,B., Scarlato,G., Ottolenghi,S. and Dallapiccola,B. TITLE UFD1L, a developmentally expressed ubiquitination gene, is deleted in CATCH 22 syndrome JOURNAL Unpublished REFERENCE 2 (bases 1 to 1156) AUTHORS Novelli,G. TITLE Direct Submission JOURNAL Submitted (19-JUL-1996) Novelli G., Universita' Roma Tor Vergata, Sanita' Pubblica e Biologia Cellulare, Via di Tor Vergata 135, Roma, Italy, 00133 FEATURES Location/Qualifiers source 1..1156 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="UFD1-L" /tissue_type="brain" /dev_stage="fetal" /map="22q11" gene 79..1110 /gene="UFD1-L" CDS 79..1110 /gene="UFD1-L" /standard_name="UFD1p" /function="ubiquitin-dependent proteolytic protein" /note="ubiquitin like protein" /codon_start=1 /product="ubiquitin fusion-degradation protein" /db_xref="PID:g1654346" /translation="MFSFNMFDHPIPRVFQNRFSTQYRCFSVSMLAWPNDRSDVEKGG KIIMPPSALDQLSRLNITYPMLFKLTNKNSDRMTHCGVLEFVADEGICYLPHWMMQNL LLEEDGLVQLETVNLQVATYSKSKFCYLPHWMMQNLLLEEGGLVQVESVNLQVATYSK FQPQSADFLDITNPKAVLENALRNFACLTTGDVIAINYNEKIYELRVMETKPDKAVSI HECDMNVDFDAPLGYKEPERQVQHEESTEGEADHSGYAGELGFRAFSGSGNRLDGKKK GVEPSPSPIKPGDIKRGIPNYEFKLGKITFIRNSRPLVKKVEEDEAGGRFVAFSGEGQ SLRKKGRKP" BASE COUNT 318 a 284 c 292 g 262 t ORIGIN 1 ggcacgagga agagcggtcg gcggggtttc ttcgttgcat tgcctgagag gagcggagtc 61 tgccaggtgg tgtccatcat gttctctttc aacatgttcg accaccctat tcccagggtc 121 ttccaaaacc gcttctccac acagtaccgc tgcttctctg tgtccatgct agcatggcct 181 aatgacaggt cagatgtgga gaaaggaggg aagataatta tgccaccctc ggccctggac 241 caactcagcc gacttaacat tacctatccc atgctgttca aactgaccaa taagaattcg 301 gaccgcatga cgcattgtgg cgtgctggaa tttgtggctg atgaaggcat ctgctacctc 361 ccacactgga tgatgcagaa cttactcttg gaagaagacg gcctggtcca gttggagacc 421 gtcaaccttc aagtggccac ctactccaag agtaagttct gctacctccc acactggatg 481 atgcagaact tactcttgga agaaggcggc ctggtccagg tggagagcgt caaccttcaa 541 gtggccacct actccaaatt ccaacctcag agcgctgact tcctggacat caccaacccc 601 aaagccgtat tagaaaacgc acttaggaac tttgcctgtc tgaccaccgg ggatgtgatt 661 gccattaact acaatgaaaa gatctacgaa ctgcgtgtga tggagaccaa acccgacaag 721 gcagtgtcca ttcatgagtg tgacatgaac gtggactttg atgctcccct gggctacaaa 781 gaacccgaaa gacaagtcca gcatgaggag tcgacagaag gtgaagccga ccacagtggc 841 tatgctggag agcttggctt ccgcgctttc tctggatctg gcaatagact ggatggaaag 901 aagaaagggg tagagcccag cccctcccca atcaagcctg gagatattaa aagaggaatt 961 cccaattatg aatttaaact tggtaagata actttcatca gaaattcacg tccccttgtc 1021 aaaaaggttg aagaggatga agctggaggc agattcgtcg ctttctctgg agaaggacag 1081 tcattgcgta aaaagggaag aaagccctaa gtgaggactg ttggctgatt ggaaaataat 1141 aaaagtttca tttgca // LOCUS HSU64520 693 bp mRNA PRI 08-AUG-1996 DEFINITION Human synaptobrevin-3 mRNA, complete cds. ACCESSION U64520 NID g1480967 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 693) AUTHORS Bernstein,A.M. and Whiteheart,S.W. TITLE Synaptobrevin-3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 693) AUTHORS Bernstein,A.M. and Whiteheart,S.W. TITLE Direct Submission JOURNAL Submitted (19-JUL-1996) Biochemistry, University of Kentucky Medical Center, 800 Rose Street, Lexington, KY 40536-0084, USA FEATURES Location/Qualifiers source 1..693 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="megakaryocytic leukemia" /cell_line="CMK" CDS 25..327 /function="membrane fusion" /note="SNARE protein" /codon_start=1 /product="synaptobrevin-3" /db_xref="PID:g1480968" /translation="MSTGPTAATGSNRRLQQTQNQVDEVVDIMRVNVDKVLERDQKLS ELDDRADALQAGASQFETSAAKLKRKYWWKNCKMWAIGITVLVIFIIIIIVWVVSS" BASE COUNT 193 a 150 c 154 g 196 t ORIGIN 1 ctctaaagcg ccgcagctgc caaaatgtct acaggtccaa ctgctgccac tggcagtaat 61 cgaagacttc agcagacaca aaatcaagta gatgaggtgg tggacataat gcgagttaac 121 gtggacaagg ttctggaaag agaccagaag ctctctgagt tagacgaccg tgcagacgca 181 ctgcaggcag gcgcttctca atttgaaacg agcgcagcca agttgaagag gaaatattgg 241 tggaagaatt gcaagatgtg ggcaatcggg attactgttc tggttatctt catcatcatc 301 atcatcgtgt gggttgtctc ttcatgaaga accagcggaa ctcaaaactg ctgttcaaga 361 aacctcttca agacttttga cttagaacct gctatattat caagcttacc tactgttatc 421 tctaaaattt tttttgtgtt aatgtaaagt tgaatttcta ggaaacgtgc ctttgttttt 481 taatatgcac tccaaattag aaggccggcc ccgtccacat tttgcacagt gcctttacag 541 atttacgtat gggctgatga agaggccttc ttaagttcca gagtgctata atctagatgt 601 aatgttgtca ctaattaatt gccattactc ccctttagag cacaatggat ctcgagggat 661 cttccatacc taccagttct gcgcctgcag gtc // LOCUS HSU64820 1899 bp mRNA PRI 18-JUL-1997 DEFINITION Homo sapiens josephin MJD1 mRNA, complete cds. ACCESSION U64820 NID g2262194 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1899) AUTHORS Goto,J., Watanabe,M., Ichikawa,Y., Yee,S.-B., Ihara,N., Endo,K., Igarashi,S., Takiyama,Y., Gaspar,C., Maciel,P., Tsuji,S., Rouleau,G.A. and Kanazawa,I. TITLE Machado-Joseph disease gene products carrying different carboxyl termini JOURNAL Neurosci. Res. (1997) In press REFERENCE 2 (bases 1 to 1899) AUTHORS Goto,J. TITLE Direct Submission JOURNAL Submitted (23-JUL-1996) Neurology, Institute for Brain Research, Faculty of Medicine, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan FEATURES Location/Qualifiers source 1..1899 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q32.1" CDS 58..1143 /note="Machado-Joseph disease gene product" /codon_start=1 /product="josephin MJD1" /db_xref="PID:g2262195" /translation="MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEE RMRMAEGGVTSEDYRTFLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRL RIDPINERSFICNYKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGY SIFVVKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDG SGMLDEDEEDLQRALALSRQEIDMEDEEADLRRTIQLSMQGSSRNISQDMTQTSGTNL TSEELRKRREAYFEKQQQKQQQQQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAM SEEDMLQAAVTMSLETVRNDLKTEGKK" BASE COUNT 600 a 330 c 419 g 550 t ORIGIN 1 ggggcggact ggagggggtg gttcggcgtg ggggccgttg gctccagaca aataaacatg 61 gagtccatct tccacgagaa acaagaaggc tcactttgtg ctcaacattg cctgaataac 121 ttattgcaag gagaatattt tagccctgtg gaattatcct caattgcaca tcagctggat 181 gaggaggaga ggatgagaat ggcagaagga ggagttacta gtgaagatta tcgcacgttt 241 ttacagcagc cttctggaaa tatggatgac agtggttttt tctctattca ggttataagc 301 aatgccttga aagtttgggg tttagaacta atcctgttca acagtccaga gtatcagagg 361 ctcaggatcg atcctataaa tgaaagatca tttatatgca attataagga acactggttt 421 acagttagaa aattaggaaa acagtggttt aacttgaatt ctctcttgac gggtccagaa 481 ttaatatcag atacatatct tgcacttttc ttggctcaat tacaacagga aggttattct 541 atatttgtcg ttaagggtga tctgccagat tgcgaagctg accaactcct gcagatgatt 601 agggtccaac agatgcatcg accaaaactt attggagaag aattagcaca actaaaagag 661 caaagagtcc ataaaacaga cctggaacga gtgttagaag caaatgatgg ctcaggaatg 721 ttagacgaag atgaggagga tttgcagagg gctctggcac taagtcgcca agaaattgac 781 atggaagatg aggaagcaga tctccgcagg actattcagc taagtatgca aggtagttcc 841 agaaacatat ctcaagatat gacacagaca tcaggtacaa atcttacttc agaagagctt 901 cggaagagac gagaagccta ctttgaaaaa cagcagcaaa agcagcaaca gcagcagcag 961 cagcagcagc agggggacct atcaggacag agttcacatc catgtgaaag gccagccacc 1021 agttcaggag cacttgggag tgatctaggt gatgctatga gtgaagaaga catgcttcag 1081 gcagctgtga ccatgtcttt agaaactgtc agaaatgatt tgaaaacaga aggaaaaaaa 1141 taataccttt aaaaaataat ttagatattc atactttcca acattatcct gtgtgattac 1201 agcatagggt ccactttggt aatgtgtcaa agagatgagg aaataagact tttagcggtt 1261 tgcaaacaaa atgatgggaa agtggaacaa tgcgtcggtt gtaggactaa ataatgatct 1321 tccaaatatt agccaaagag gcattcagca attaaagaca tttaaaatag ttttctaaat 1381 gtttcttttt cttttttgag tgtgcaatat gtaacatgtc taaagttagg gcatttttct 1441 tggatctttt tgcagactag ctaattagct ctcgcctcag gctttttcca tatagtttgt 1501 tttctttttc tgtcttgtag gtaagttggc tcacatcatg taatagtggc tttcatttct 1561 tattaaccaa attaaccttt caggaaagta tctctacttt cctgatgttg ataatagtaa 1621 tggttctaga aggatgaaca gttctccctt caactgtata ccgtgtgctc cagtgttttc 1681 ttgtgttgtt ttctctgatc acaacttttc tgctacctgg ttttcattat tttcccacaa 1741 ttcttttgaa agatggtaat cttttctgag gtttagcgtt ttaagcccta cgatgggatc 1801 attatttcat gactggtgcg ttcctaaact ctgaaatcag ccttgcacaa gtacttgaga 1861 ataaatgagc attttttaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSU64998 453 bp DNA PRI 05-NOV-1997 DEFINITION Human ribonuclease k6 precursor gene, complete cds. ACCESSION U64998 NID g2585987 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 453) AUTHORS Rosenberg,H.F. and Dyer,K.D. TITLE Molecular cloning and characterization of a novel human ribonuclease (RNase k6): increasing diversity in the enlarging ribonuclease gene family JOURNAL Nucleic Acids Res. 24 (18), 3507-3513 (1996) MEDLINE 96433147 REFERENCE 2 (bases 1 to 453) AUTHORS Rosenberg,H.F. TITLE Direct Submission JOURNAL Submitted (24-JUL-1996) Laboratory of Host Defenses/NIAID/NIH, 10 Center Dr., MSC 1886, Bethesda, MD 20892-1886, USA FEATURES Location/Qualifiers source 1..453 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" CDS 1..453 /note="RNase k6" /codon_start=1 /product="ribonuclease k6 precursor" /db_xref="PID:g2585988" /translation="MVLCFPLLLLLLVLWGPVCPLHAWPKRLTKAHWFEIQHIQPSPL QCNRAMSGINNYTQHCKHQNTFLHDSFQNVAAVCDLLSIVCKNRRHNCHQSSKPVNMT DCRLTSGKYPQCRYSAAAQYKFFIVACDPPQKSDPPYKLVPVHLDSIL" mat_peptide 70..450 /note="RNase k6" /product="ribonuclease k6" BASE COUNT 110 a 127 c 90 g 126 t ORIGIN 1 atggtgctat gctttcctct tcttttactg ctgctggttc tatggggacc agtgtgtcca 61 cttcatgctt ggcctaagcg tctcaccaag gctcactggt ttgaaattca gcatatacag 121 ccaagtcctc tccaatgcaa cagggcaatg agtggcatca acaattatac ccagcactgt 181 aagcatcaaa atacctttct gcatgactct ttccagaacg tggctgctgt ctgtgatttg 241 ctcagcattg tctgcaaaaa tcgtcggcac aactgccacc agagctcaaa gcctgtcaac 301 atgactgact gcagactcac ttcaggaaag tatccccagt gccgctatag tgctgctgcc 361 cagtacaaat tcttcattgt tgcctgtgac ccccctcaga agagcgatcc cccctacaag 421 ttggttcctg tacacttaga tagtattctc taa // LOCUS HSU65002 7313 bp mRNA PRI 14-FEB-1997 DEFINITION Human zinc finger protein PLAG1 mRNA, complete cds. ACCESSION U65002 NID g1839159 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7313) AUTHORS Kas,K., Voz,M.L., Roijer,E., Astrom,A.K., Meyen,E., Stenman,G. and Van de Ven,W.J. TITLE Promoter swapping between the genes for a novel zinc finger protein and beta-catenin in pleiomorphic adenomas with t(3;8)(p21;q12) translocations JOURNAL Nature Genet. 15 (2), 170-174 (1997) MEDLINE 97172974 REMARK Erratum:[[published erratum appears in Nat Genet 1997 Apr;15(4):411]] REFERENCE 2 (bases 1 to 7313) AUTHORS Kas,K., Voz,M.L., Roijer,E., Meyen,E., Stenman,G. and Van de Ven,W.J.M. TITLE Direct Submission JOURNAL Submitted (24-JUL-1996) Lab for Molecular Oncology, Center of Human Genetics - K.U.Leuven, Herestraat 49, Leuven 3000, Belgium FEATURES Location/Qualifiers source 1..7313 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /map="8q12" exon 1..159 /number=1 exon 160..264 /number=2 exon 265..362 /number=3 exon 363..722 /number=4 CDS 481..1983 /codon_start=1 /product="zinc finger protein PLAG1" /db_xref="PID:g1839160" /translation="MATVIPGDLSEVRDTQKVPSGKRKRGETKPRKNFPCQLCDKAFN SVEKLKVHSYSHTGERPYKCIQQDCTKAFVSKYKLQRHMATHSPEKTHKCNYCEKMFH RKDHLKNHLHTHDPNKETFKCEECGKNYNTKLGFKRHLALHAATSGDLTCKVCLQTFE STGVLLEHLKSHAGKSSGGVKEKKHQCEHCDRRFYTRKDVRRHMVVHTGRKDFLCQYC AQRFGRKDHLTRHMKKSHNQELLKVKTEPVDFLDPFTCNVSVPIKDELLPVMSLPSSE LLSKPFTNTLQLNLYNTPFQSMQSSGSAHQMITTLPLGMTCPIDMDTVHPSHHLSFKY PFSSTSYAISIPEKEQPLKGEIESYLMELQGGVPSSSQDSQASSSSKLGLDPQIGSLD DGAGDLSLSKSSISISDPLNTPALDFSQLFNFIPLNGPPYNPLSVGSLGMSYSQEEAH SSVSQLPTQTQDLQDPANTIGLGSLHSLSAAFTSSLSTSTTLPRFHQAFQ" exon 723..7313 /number=5 BASE COUNT 2287 a 1327 c 1357 g 2342 t ORIGIN 1 ggcagcgcat acactacaat ggctgctgga aagaggcgta aggaaacaat ttccaggccc 61 gccgcgtcca gcccgaaata tgagaaaaaa attattagaa attccgcggg cggtgtagag 121 gcggcggacg ggccggaggg aggatgttaa agccccgcgg ttgcctcttg gtgctgcctt 181 ggccgtattt ggcacccaga atgcttcatt ctgtgacggt ctattaataa ggttgccttg 241 ctagagtttg gagcagggcc tcagattggc caaaatggga aggattggat tccactctct 301 tccacgaaga gtcaatggga ctggctaaga tcaaagtctg aggctttttc catcagtaat 361 cagtcccttt ttgctttctt ttacgaccac atgaaacttg agaagccacc taaagctata 421 tcatttagtg gagttgggca gttcccaagt gtccaacaag aaggcctggt ttaggctgcg 481 atggccactg tcattcctgg tgatttgtca gaagtaagag atacccagaa agtcccttca 541 gggaaacgta agcgtggtga aaccaaacca agaaaaaact ttccttgcca actgtgtgac 601 aaggccttta acagtgttga gaaattaaag gttcactcct actctcacac aggagagagg 661 ccctacaagt gcatacaaca agactgcacc aaggcctttg tttctaagta caaattacaa 721 aggcacatgg ctactcattc tcctgagaaa acccacaagt gtaattattg tgagaaaatg 781 tttcaccgga aagatcatct gaagaatcac ctccatacac acgaccctaa caaagagacg 841 tttaagtgcg aagaatgtgg caagaactac aataccaagc ttggatttaa acgtcacttg 901 gccttgcatg ccgcaacaag tggtgacctc acctgtaagg tatgtttgca aacttttgaa 961 agcacgggag tgcttctgga gcaccttaaa tctcatgcag gcaagtcgtc tggtggggtt 1021 aaagaaaaaa agcaccagtg cgaacattgt gatcgccggt tctacacccg aaaggatgtc 1081 cggagacaca tggtggtgca cactggaaga aaggacttcc tctgtcagta ttgtgcacag 1141 agatttgggc gaaaggatca cctgactcga catatgaaga agagtcacaa tcaagagctt 1201 ctgaaggtca aaacagaacc agtggatttc cttgacccat ttacctgcaa tgtgtctgtg 1261 cctataaaag acgagctcct tccggtgatg tccttacctt ccagtgaact gttatcaaag 1321 ccattcacaa acactttgca gttaaacctc tacaacactc catttcagtc catgcagagc 1381 tcgggatctg cccaccaaat gatcacaact ttacctttgg gaatgacatg cccaatagat 1441 atggacactg ttcatccctc tcaccacctt tctttcaaat atccgttcag ttctacctca 1501 tatgcaattt ctattcctga aaaagaacag ccattaaagg gggaaattga gagttacctg 1561 atggagttac aaggtggcgt gccctcttca tcccaagatt ctcaagcatc gtcatcatct 1621 aagctagggt tggatcctca gattgggtcc ctagatgatg gtgcaggaga cctctcccta 1681 tccaaaagct ctatctccat cagtgacccc ctaaacacac cagcattgga tttttctcag 1741 ttgtttaatt tcataccttt aaatggtcct ccctataatc ctctatcagt ggggagcctt 1801 ggaatgagct attcccagga agaagcacat tcttctgttt cccagctccc cacacaaaca 1861 caggatcttc aggatcctgc aaacactata gggcttgggt ctctgcactc actgtcagca 1921 gctttcacca gcagtttaag cacaagtacc accctcccac gtttccatca agcttttcag 1981 taggattctg ggacatggat tcattacaga aatgtatgtg tagctgtgcc ctagatgacc 2041 atttttattt tagtgcctac tttaaaacag tataaaaatt tctgcttttg tataatacaa 2101 attttcatta agccagtata aaatagaaac tagcttttaa actgagcttt ggaaccattt 2161 gtgttcagtt aagtttacct gggtattttg tcctgattca ctgccaattg tcacatttta 2221 agactttttt tttttccata taggaaagcc attattagta gtaaactttt acaaatccca 2281 ttttcaaatt acttttagat cttaaaattt tcatttttgt ctaataacag tggctctacc 2341 ttttgacatc tggctcatta aaaaatttag caatagaatg taaattgtat aaaaagtttg 2401 tgaataactc aagggtttaa attttcttac tagcttctaa atggattaat aatcaagtgc 2461 ttcaaatgaa ttaagagtcc agtttcggaa gataataaat gtttgttaga tacaccataa 2521 tttcagatca gtatattctg aagactctct gttgtctggc taaaatattt gccatcttta 2581 ttatgagcct ttaaggaaaa caaaccctaa acacaaagca tcagtattta tagcaaaaag 2641 agactctgtt aggtgacatg gcatttcgtg tcacttaata gttggcccta aattagtaca 2701 caggatattt tgtcgtgttt catccttctt aacatgctat cttttcattt aataatagta 2761 atagtgtatg gcattggggt cttcagagtc gatatatagg tagatctctt tagtcttttc 2821 cacctttcac atccaagggg tgggtcaagt gcagccagca atttattttc attgttggcc 2881 cacggttagt ccataatcta gagccattgt ggaactgcag ccatgaggtg tgtttatccc 2941 acagtggatt gactcagcct ctgtgggtga cagacttcta agcaggaaga tagacgtgaa 3001 gcacatggtt acatttggga acttgtgtag ggatcatggc ccctgtagcc agggttaaaa 3061 actggacttt ttagaagtaa agtaaaagca tagcgcttat atcatttctt gctgaatttg 3121 atatgttttt ctttccctta agaatcaaaa gcagaaaaca aaaacaacag tcctactccg 3181 atgttatctt tctgattcaa tgtgaatcca tctttccttg caatattttg gatggagaat 3241 ttgaagttaa atgcattaga aaactacctg atgaactacc acaaagtttt aagtgactag 3301 aaatatatac agtaaaatcc cactttcatg catctctggg aaatgatagg agtattgcaa 3361 ataagttgag tttgtagagg gtaacaaagt aaagtaaaac aaacctatct tggttaacat 3421 gaaaataaca attgagaata tattatattc actgaataat tataggcttt tcctcacatt 3481 agacaaccaa cataatcttc ttaaaggtct aattaatata tttttctaag ggtcagttgg 3541 gacattaacc taagaaacat atctattaag cacttgttaa caccttattt taggaccctt 3601 tccgttgggg atgggggcaa gggtgggagg tttttagaag agtatatatc tctttaaaaa 3661 aaaacagaaa gaaaaatatt tctgagcact cattagccct atatggaaac ttctttcctt 3721 tttgtagggc cagttatcac tgcagattgc aatgtttacc aagaatttct aaaaatgagt 3781 gcagattact gaatataata cattatttaa aatatttggg agtagtataa tttgttgaga 3841 aatgtaaatt gtaataatgt aaatgggggg cttcaatata tatatataat acacacacac 3901 acacacatgc acacataccg cacttcatag aatcaaagtt gctctctgaa ggagctttgg 3961 ctcctgatat tttatcatgc tcctatattt ttttaatcct tggagcagta gtttttatac 4021 ttatgtattt aaattttatt atgaaaaatt acatttatta aaaaagtgtg ttccaaaggc 4081 attaaaatta tatatgttaa taaggaagta catttttaaa tttttcaaac tgctcctagc 4141 ttttgattag gagaatattt tttctgaaag taggcttttc gctctgcttc attactgctt 4201 cctttagttt ctatgaaaca gattgcttac ctaaatcttt agttgaatga ttagtgttca 4261 atattgcttt aatcaccata taaaaggaaa aaaattggtg acagagcaca aatagaaaac 4321 ctatttttaa atagaaatca caaatagcaa gtgtggaagc actactttat tctgtttaaa 4381 atgtacttaa gaagtcatca aattagtgaa ctgagacatt ggccttagta ggctgtattc 4441 actgctaatt taaaaaaggg agtaccagga tttattaagt aaagcatttt ggaaatgggg 4501 aatagcgcca tatatgtatg tatgtgtatg tgtgtgtgtg gtgtgtgtat atatacacac 4561 acacatacat acttaaatct tgccctgcat gaaattcaaa tacatggagg cacatcttca 4621 gggcaccagt gttaaaattt tggagtctta attttcatgt gtacacctct ttgcctgttc 4681 ccacccccag acttgaaata acacttcaga gtaagaggga attcagctaa tttgttttta 4741 aaattgactg tagtggtcac taaacccttt ttgagagaat ttctattaaa gatgaggcag 4801 actcgcttat ttgaattgca caatgttcta acaaggatgt aacacagaat tggctttttt 4861 ttccctagaa aaagattgtt tgtttctatg tcaactagat atgattaaaa ataagtattg 4921 ccaatgctgt tttcattctc tagtggccag aatcattatc cttgaaattt ctggtagtgc 4981 cttagcttgg ttaaaaaaaa aaaaaaaaaa aaaaaaaaag ggattaacat taaataaaag 5041 tagtttagaa tttgggcctc agacaagata ttgaacctca ttcagtttca cttccacatg 5101 tatgtacaag ttaggtcacc aaacacggaa gttgagtgtg gaaggatctt ggcactgtaa 5161 gcaatgctat ccattgatgt atacaagtac ctttatagtt atcgatcact gttaaaactt 5221 tcattttaaa atcctattac caagttcagt tttttaaaac ttcaattgtc ctggctgatt 5281 atgcatcact ctgtgtgcaa cttttttatt tcatttagtg tttctttcaa gctgtgtatt 5341 tttgcctatt tgttgcttgt gctttatttt tcttagtcat ttgtggaata tagtgatata 5401 ttgtgttaat ttggacagta gcggttttta aaaaccatat actgactgaa acatgagcca 5461 gagccgattg ctttattaag ctaataatga atgttaaaga gtacatattt tcaggatcgt 5521 tcatctagtg agcaatacac atattatagg ccaatatttt tttaaaaaat agagcttggt 5581 caacctctat actacacata ttacaagata tagcactttc aaaatgaatc taaaccttta 5641 cagaaacttt cttataggtt atgcctttta ttttaagact tattataatt caagtgccat 5701 tagatgatat atatgtaggc ctttgatata taatgctttg tgtacaaaaa tggtagatgg 5761 tattttaaac aggtacattt ttacagtgtt ttcttatcaa tttgctatat tgcacagaat 5821 cagtgtgtgt cttttcataa ggttttacaa tggtttattt ttttacaagg tttacgtgtc 5881 tcaaagcaca ctgtcttccc agtacgtaag ttaaaaaata ccagttcacc caagttgctt 5941 ctagcctact gagatccatg tgacattgga ggagatcttt taaatgttta gtattcgtca 6001 ttagcaatgg ctggctgtta gttctggtaa atgtgtgcct aagttgaatt tgtcttgttt 6061 ttctcacact gtgtcagcag ccatgtctac aacacagata agtctgttgt gatcacatag 6121 atctacataa gttgtgcagt tttgtgctaa aaacccatag ggagctcctt tgggatcata 6181 gaaaagaaga tcatgcaacc agcattggtg aaggcacact cagattgcac ttagggcctt 6241 tctatgatgt tgtcaaccct ctgaggatgg aaggcagtgt cttttgatgt tatctagcct 6301 agaaatgaca cagaactatt gctaatgtat aaaacacttc attatataag cttcagtggt 6361 acagatgaac cagaatgaat gtttatcttc tcagaaacac tccttcaata ttatattgga 6421 tcatgctgct aatgtaactt gggctacaac tcttcatggt gctacaaact tctctgtctc 6481 attcagtcgt atttttttat ccatagaaaa aggactacat taggtgtaaa agtgtacaat 6541 atatttttat actgtgactt aatttgtcat taacaaactt ttacaccacc acaatgtatt 6601 catgtgcact tgcaaaagga gatctcggac atgcaaatgt taccagaaca aacccagctt 6661 ttgtccacaa ggtgactgta actcagaatg gaaagtgggc tttataatag ggtgtggagt 6721 gaagaacatg ctgtatgtta ctaacagccc tttgaattta acaaaaactg ggaatccatt 6781 aggaaacgga ttgcatcata cctgaacata agctggactg ctgaaattgt atttttagct 6841 aatgaaaaag tgtttggact agtactctaa aaatgttcta atgataaagt tttgagtcaa 6901 aatagaaaag aaaaaaatct gcattccagg ccgaattttg tatattttta ttgcatttaa 6961 aattgctatt ctgtaatatt gggaaatcaa gtggcttatc atgtatatcg tgtacttaaa 7021 atgtattcac aaactactgt tgtatttgta taaaatatag acaaagatca tattttttgt 7081 gtgtgtataa gctctgtaaa atagcaatca cattatgaag ctgcagtgat actacatttt 7141 aaacattcac atccaaagaa gcagactatt tattgtccat ataccagatt taaaatatta 7201 atttgctgct aattaaataa tagtactgca gcttcttgtg gcctacagtg ttatgtttgc 7261 tgtaagaata agatatgtga attccacaaa atatatgaat aaaatctcgt gcc // LOCUS HSU65011 2148 bp mRNA PRI 22-MAR-1997 DEFINITION Human preferentially expressed antigen of melanoma (PRAME) mRNA, complete cds. ACCESSION U65011 NID g1903383 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2148) AUTHORS Ikeda,H., Lethe,B., Lehmann,F., Van Baren,N., Baurain,J.-F., De Smet,C., Chambost,H., Vitale,M., Moretta,A., Boon,T. and Coulie,P.G. TITLE Characterization of an antigen that is recognized on a melanoma showing partial HLA loss by CTL expressing an NK inhibitory receptor JOURNAL Immunity 6 (2), 199-208 (1997) MEDLINE 97199265 REFERENCE 2 (bases 1 to 2148) AUTHORS Ikeda,H., Lethe,B., Baurain,J.-F. and Coulie,P.G. TITLE Direct Submission JOURNAL Submitted (24-JUL-1996) Ludwig Institute for Cancer Research, 74 avenue Hippocrate UCL 7459, Brussels B-1200, Belgium FEATURES Location/Qualifiers source 1..2148 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="melanoma cell" /cell_line="LB33-MEL.A" /haplotype="A24 A28 B13 B44 Cw6 Cw7" gene 236..1765 /gene="PRAME" CDS 236..1765 /gene="PRAME" /note="encodes tumor antigen recognized by cytolytic T lymphocytes" /codon_start=1 /product="preferentially expressed antigen of melanoma" /db_xref="PID:g1903384" /translation="MERRRLWGSIQSRYISMSVWTSPRRLVELAGQSLLKDEALAIAA LELLPRELFPPLFMAAFDGRHSQTLKAMVQAWPFTCLPLGVLMKGQHLHLETFKAVLD GLDVLLAQEVRPRRWKLQVLDLRKNSHQDFWTVWSGNRASLYSFPEPEAAQPMTKKRK VDGLSTEAEQPFIPVEVLVDLFLKEGACDELFSYLIEKVKRKKNVLRLCCKKLKIFAM PMQDIKMILKMVQLDSIEDLEVTCTWKLPTLAKFSPYLGQMINLRRLLLSHIHASSYI SPEKEEQYIAQFTSQFLSLQCLQALYVDSLFFLRGRLDQLLRHVMNPLETLSITNCRL SEGDVMHLSQSPSVSQLSVLSLSGVMLTDVSPEPLQALLERASATLQDLVFDECGITD DQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGLSNLTHVLYPVPLESYED IHGTLHLERLAYLHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPCFMP N" BASE COUNT 534 a 548 c 558 g 508 t ORIGIN 1 gcttcagggt acagctcccc cgcagccaga agccgggcct gcagcccctc agcaccgctc 61 cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 121 actctctgag gaaaaaccat tttgattatt actctcagac gtgcgtggca acaagtgact 181 gagacctaga aatccaagcg ttggaggtcc tgaggccagc ctaagtcgct tcaaaatgga 241 acgaaggcgt ttgtggggtt ccattcagag ccgatacatc agcatgagtg tgtggacaag 301 cccacggaga cttgtggagc tggcagggca gagcctgctg aaggatgagg ccctggccat 361 tgccgccctg gagttgctgc ccagggagct cttcccgcca ctcttcatgg cagcctttga 421 cgggagacac agccagaccc tgaaggcaat ggtgcaggcc tggcccttca cctgcctccc 481 tctgggagtg ctgatgaagg gacaacatct tcacctggag accttcaaag ctgtgcttga 541 tggacttgat gtgctccttg cccaggaggt tcgccccagg aggtggaaac ttcaagtgct 601 ggatttacgg aagaactctc atcaggactt ctggactgta tggtctggaa acagggccag 661 tctgtactca tttccagagc cagaagcagc tcagcccatg acaaagaagc gaaaagtaga 721 tggtttgagc acagaggcag agcagccctt cattccagta gaggtgctcg tagacctgtt 781 cctcaaggaa ggtgcctgtg atgaattgtt ctcctacctc attgagaaag tgaagcgaaa 841 gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatt tttgcaatgc ccatgcagga 901 tatcaagatg atcctgaaaa tggtgcagct ggactctatt gaagatttgg aagtgacttg 961 tacctggaag ctacccacct tggcgaaatt ttctccttac ctgggccaga tgattaatct 1021 gcgtagactc ctcctctccc acatccatgc atcttcctac atttccccgg agaaggaaga 1081 gcagtatatc gcccagttca cctctcagtt cctcagtctg cagtgcctgc aggctctcta 1141 tgtggactct ttatttttcc ttagaggccg cctggatcag ttgctcaggc acgtgatgaa 1201 ccccttggaa accctctcaa taactaactg ccggctttcg gaaggggatg tgatgcatct 1261 gtcccagagt cccagcgtca gtcagctaag tgtcctgagt ctaagtgggg tcatgctgac 1321 cgatgtaagt cccgagcccc tccaagctct gctggagaga gcctctgcca ccctccagga 1381 cctggtcttt gatgagtgtg ggatcacgga tgatcagctc cttgccctcc tgccttccct 1441 gagccactgc tcccagctta caaccttaag cttctacggg aattccatct ccatatctgc 1501 cttgcagagt ctcctgcagc acctcatcgg gctgagcaat ctgacccacg tgctgtatcc 1561 tgtccccctg gagagttatg aggacatcca tggtaccctc cacctggaga ggcttgccta 1621 tctgcatgcc aggctcaggg agttgctgtg tgagttgggg cggcccagca tggtctggct 1681 tagtgccaac ccctgtcctc actgtgggga cagaaccttc tatgacccgg agcccatcct 1741 gtgcccctgt ttcatgccta actagctggg tgcacatatc aaatgcttca ttctgcatac 1801 ttggacacta aagccaggat gtgcatgcat cttgaagcaa caaagcagcc acagtttcag 1861 acaaatgttc agtgtgagtg aggaaaacat gttcagtgag gaaaaaacat tcagacaaat 1921 gttcagtgag gaaaaaaagg ggaagttggg gataggcaga tgttgacttg aggagttaat 1981 gtgatctttg gggagataca tcttatagag ttagaaatag aatctgaatt tctaaaggga 2041 gattctggct tgggaagtac atgtaggagt taatccctgt gtagactgtt gtaaagaaac 2101 tgttgaaaat aaagagaagc aatgtgaagc aaaaaaaaaa aaaaaaaa // LOCUS HSU65090 5801 bp mRNA PRI 03-OCT-1997 DEFINITION Human carboxypeptidase D mRNA, complete cds. ACCESSION U65090 NID g2462776 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5801) AUTHORS Tan,F., Rehli,M., Krause,S.W. and Skidgel,R.A. TITLE Sequence of human carboxypeptidase D reveals it to be a member of the regulatory carboxypeptidase family with three tandem active site domains JOURNAL Biochem. J. 327 (Pt 1), 81-87 (1997) MEDLINE 97454446 REFERENCE 2 (bases 1 to 5801) AUTHORS McGwire,G.B., Tan,F., Michel,B., Rehli,M. and Skidgel,R.A. TITLE Identification of a membrane-bound carboxypeptidase as the mammalian homolog of duck gp180, a hepatitis B virus-binding protein JOURNAL Life Sci. 60 (10), 715-724 (1997) MEDLINE 97205589 REFERENCE 3 (bases 1 to 5801) AUTHORS Tan,F., Rehli,M. and Skidgel,R.A. TITLE Direct Submission JOURNAL Submitted (24-JUL-1996) Pharmacology, Univ. of Illinois College of Medicine at Chicago, 835 S. Wolcott (M/C 868), Chicago, IL 60612, USA FEATURES Location/Qualifiers source 1..5801 /organism="Homo sapiens" /note="6 clones were used to determine the sequence" /db_xref="taxon:9606" source 1..664 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="CPD-5'" /tissue_type="placenta" sig_peptide 36..125 /evidence=not_experimental CDS 36..4169 /function="specifically cleaves C-terminal Arg or Lys residues from peptides at an acidic pH optimum of 6.0 - 6.5" /note="duck gp180 homolog" /codon_start=1 /product="carboxypeptidase D" /db_xref="PID:g2462777" /translation="MASGRDERPHCVGRLLLLMCLLLLGSSARAAHIKKAEATTTTTS AGARGRGQFDRYYHEEELESALREAAAAGLPGLARLFSIGRSVEGRPLWVLRLTAGLG SLIPEGDAGPDAAGPDAAGPLLPGRPQVKLVGNMHGDETVSRQVLIYLARELAALPPG DPRLVRLLNTTDVYLLPSLNPDGFERAREGDCGFGDGGPSGASGRDNSRGRDLNRSFP DQFSTGEPPALDEVPEVRALIEWIRRNKFVLSGNLHGGSVVASYPFDDSPEHKATGIY SKTSDDEVFKYLAKAYASNHPIMKTGEPHCPGDEDETFKDGITNGAHWYDVEGGMQDY NYVWANCFEITLELSCCKYPPASQLRQEWENNRESLITLIEKVHIGVKGFVKDSITGS GLENATISVAGINHNITTGRFGDFYRLLVPGTYNLTVVLTGYMPLTVTNVVVKEGPAT EVDFSLRPTVTSVIPDTTEAVSTASTVAIPNILSGTSSSCQPIQPKDFHHHHFPDMEI FLRRFANEYPNITRLYSLGKSVESRELYVMEISDNPGVHEPGEPEFKYIGNMHGNEVV GRELLLNLIEYLCKNFGTDPEVTDLVHNTRIHLMPSMNPDGYEKSQEGDSISVIGRNN SNNFDLNRNFPDQFVQITDPTQPETIAVMSWMKSYPFVLSANLHGGSLVVNYPFDDDE QGLATYSKSPDDAVFQQIALSYSKENSQMFQGRPCKNMYPNEYFPHGITNGASWYNVP GGMQDWNYLQTNCFEVTIELGCVKYPLEKELPNFWEQNRRSLIQFMKQVHQGVRGFVL DATDGRGILNATISVAEINHPVTTYKTGDYWRLLVPGTYKITASARGYNPVTKNVTVK SEGAIQVNFTLVRSSTDSNNESKKGKGASSSTNDASDPTTKEFETLIKDLSAENGLES LMLRSSSNLALALYRYHSYKDLSEFLRGLVMNYPHITNLTNLGQSTEYRHIWSLEISN KPNVSEPEEPKIRFVAGIHGNAPVGTELLLALAEFLCLNYKKNPAVTQLVDRTRIVIV PSLNPDGRERAQEKDCTSKIGQTNARGKDLDTDFTNNASQPETKAIIENLIQKQNFSL SVALDGGSMLVTYPYDKPVQTVENKETLKHLASLYANNHPSMHMGQPSCPNKSDENIP GGVMRGAEWHSHLGSMKDYSVTYGHCPEITVYTSCCYFPSAARLPSLWADNKRSLLSM LVEVHKGVHGFVKDKTGKPISKAVIVLNEGIKVQTKEGGYFHVLLAPGVHNIIAIADG YQQQHSQVFVHHDAASSVVIVFDTDNRIFGLPRELVVTVSGATMSALILTACIIWCIC SIKSNRHKDGFHRLRQHHDEYEDEIRMMSTGSKKSLLSHEFQDETDTEEETLYSSKH" misc_feature 126..1415 /note="encodes carboxypeptidase domain 1" misc_feature 540..542 /note="encodes potential glycosylation site" source 664..1124 /note="EST deposited under GenBank Accession Number H04765; this clone provided by the IMAGE Consortium, LLNL" /organism="Homo sapiens" /db_xref="taxon:9606" /clone="152294" source 664..1487 /note="EST deposited under GenBank Accession Number R68612; this clone provided by the IMAGE Consortium, LLNL" /organism="Homo sapiens" /db_xref="taxon:9606" /clone="138429" source 1074..3567 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PCR25" /tissue_type="liver" misc_feature 1221..1223 /note="encodes potential glycosylation site" misc_feature 1254..1256 /note="encodes potential glycosylation site" source 1263..2570 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cpdmak-01, mean size of 1.3 kb" /cell_type="macrophage, differentiated in vitro" /clone_lib="MAC library; M. Rehli et al. Biochem. Biophys. Res. Commun. 217, 661-667 (1995)" misc_feature 1311..1313 /note="encodes potential glycosylation site" misc_feature 1416..1517 /note="encodes putative connecting peptide" misc_feature 1518..2636 /note="encodes carboxypeptidase domain 2" misc_feature 1590..1592 /note="encodes potential glycosylation site" misc_feature 1902..1904 /note="encodes potential glycosylation site" misc_feature 2457..2459 /note="encodes potential glycosylation site" misc_feature 2589..2591 /note="encodes potential glycosylation site" misc_feature 2625..2627 /note="encodes potential glycosylation site" misc_feature 2637..2723 /note="encodes putative connecting peptide" misc_feature 2661..2663 /note="encodes potential glycosylation site" misc_feature 2724..3923 /note="encodes carboxypeptidase domain 3" misc_feature 2889..2891 /note="encodes potential glycosylation site" misc_feature 2958..2960 /note="encodes potential glycosylation site" misc_feature 3234..3236 /note="encodes potential glycosylation site" misc_feature 3288..3290 /note="encodes potential glycosylation site" misc_feature 3450..3452 /note="encodes potential glycosylation site" source 3522..5801 /note="EST deposited under GenBank Accession Number R52214; this clone provided by the IMAGE Consortium, LLNL" /organism="Homo sapiens" /db_xref="taxon:9606" /clone="40521" misc_feature 3924..4001 /note="transmembrane-region site" misc_feature 4002..4166 /note="putative cytoplasmic domain" 3'UTR 4170..5801 polyA_signal 5749..5754 polyA_site 5774..5801 BASE COUNT 1679 a 1247 c 1305 g 1570 t ORIGIN 1 gaattcgcgg ccgcgtcgac cggcgctgct ggaagatggc gagcggccgg gacgagcggc 61 cgcattgcgt agggcggctc ctgttgctca tgtgcctgct gctgctgggg agctcggccc 121 gggcggctca catcaagaag gcggaggcga ctaccacaac tacgagcgcg ggcgcgaggg 181 gccgagggca gttcgaccgc tactaccacg aagaggagtt ggagtcggcg ctgagggagg 241 cggcggccgc gggcctcccc ggcctggccc gcctctttag catcggccgc tcggtggaag 301 gccggccgct gtgggtgctt cgcctcaccg ccggcctggg gtcgctaatc cctgagggcg 361 acgcggggcc tgacgctgcc gggcccgacg ctgcggggcc gctgctgccc ggccggcccc 421 aggtgaagct ggtgggcaac atgcatggcg acgagaccgt gtcgcgccag gtgttgatct 481 acttggcccg cgagctggcg gcgctaccgc ccggggaccc gcgcctggtc cgcctgctca 541 acaccaccga cgtgtacctg ctgcccagcc tcaaccccga tggcttcgag cgtgcccgcg 601 agggcgactg tggcttcggc gacggcggcc cgtccggggc cagcggccgc gacaatagtc 661 gcggccgcga cctcaaccga agctttcccg accagtttag caccggcgaa ccccccgccc 721 tggacgaggt gcccgaggtg cgcgccctca tcgagtggat ccgcaggaac aagtttgtgc 781 tttctggaaa tctgcatggt ggctcagtgg tagcaagcta tccttttgat gattctccag 841 aacataaggc cactggaatc tatagcaaaa cctcagatga tgaagtattt aaatacttgg 901 caaaagctta tgcttcaaac caccccataa tgaaaactgg tgagcctcat tgtccaggag 961 atgaagacga gactttcaaa gatggaatca caaacggcgc acattggtat gatgtggaag 1021 gtggtatgca agattacaat tatgtgtggg ccaactgttt tgagatcaca ttagaactgt 1081 cttgttgcaa gtacccacct gcttcacagc ttcgacagga atgggagaac aatcgtgagt 1141 ctttgatcac attgattgaa aaggttcaca ttggagtgaa aggatttgtt aaagattcca 1201 taacaggatc tgggttagag aatgcaacca tctcagtggc tggtattaat cataatatca 1261 caacaggcag atttggtgat ttctaccgat tacttgttcc tggaacttac aaccttacag 1321 tagttttaac tgggtatatg ccattgactg ttactaatgt agtggtgaaa gaaggaccag 1381 ccacagaggt ggatttttct cttaggccaa ctgtaacttc agtaatccct gacacgacag 1441 aggctgtatc aactgctagc acagttgcta tacctaatat tctttctgga acatcatcct 1501 cctgccagcc aattcagcca aaggactttc accaccacca tttccctgat atggaaatct 1561 tcttgagaag gtttgccaat gaatatccta acattacccg gctttattcc ttgggaaaat 1621 cagtagagtc aagagaactt tatgtgatgg agatatctga taatccgggt gtccatgaac 1681 caggtgaacc agaatttaag tacattggaa atatgcatgg aaatgaagtg gttggaagag 1741 aactgctgtt gaacctcata gaataccttt gtaagaactt tggaacagac cctgaagtca 1801 cagatttggt tcataacact agaattcacc ttatgccatc catgaatcct gatgggtatg 1861 aaaagtccca ggaaggagat tcaataagtg taattggcag aaacaacagc aacaactttg 1921 acctgaaccg aaatttccca gaccagtttg ttcagatcac agatcctacg caaccagaaa 1981 ctattgctgt aatgagctgg atgaagtcct atccatttgt actttcagca aacctgcatg 2041 gaggttcttt ggtggttaac tacccttttg atgatgatga acaaggactt gccacatata 2101 gtaaatcacc agatgatgct gtgttccaac aaatagcact ttcttattcc aaggaaaatt 2161 cccagatgtt tcaaggtaga ccttgcaaga atatgtatcc taatgaatat tttcctcatg 2221 gaataacaaa tggagctagt tggtataatg tgccaggagg aatgcaggac tggaactatt 2281 tacaaacaaa ttgctttgaa gtgactattg aactaggttg tgtgaaatat ccacttgaga 2341 aagagctgcc aaacttttgg gaacagaatc gaagatcact aatccagttt atgaaacagg 2401 ttcatcaggg cgtcagagga tttgttctag atgccacaga tggcaggggt atattaaatg 2461 ccaccattag tgttgctgag attaatcacc cagtgactac ttacaaaact ggagattact 2521 ggcgtctctt ggttccagga acttataaaa tcacagcatc tgctcgaggg tataatccag 2581 ttaccaagaa tgtgactgtc aagagtgaag gcgctattca ggtcaacttc acacttgttc 2641 gatcctcaac agattcaaac aatgaatcaa agaaaggaaa aggggctagc agcagcacca 2701 atgatgccag tgatccaact actaaagagt ttgaaacttt aattaaagac ctttcagcgg 2761 agaatggttt ggaaagcctc atgttacgct cctcctcaaa tctggctctg gctctttatc 2821 gataccattc ctacaaagac ttatcagagt ttctgagagg acttgtaatg aactatccac 2881 atattacaaa tcttaccaat ttgggacaga gcactgaata tcgtcacatt tggtcccttg 2941 aaatctccaa taagcccaat gtatctgagc ctgaagaacc aaagattcgt tttgttgctg 3001 gtatccatgg aaatgcgcca gttggaactg aactgctttt ggctctggca gaatttctct 3061 gcctgaacta caaaaagaac ccagctgtta cccaattggt tgacaggact aggattgtga 3121 ttgtcccttc tctaaatcca gatgggcgag agagagctca agagaaagac tgtacttcaa 3181 aaataggaca aacaaatgct cgtggcaaag atttggatac agacttcaca aataatgcct 3241 cccaacctga gaccaaagcc atcattgaaa atttgattca aaaacagaac tttagtcttt 3301 ctgttgcctt agatggtggt tccatgctgg tcacatatcc atatgacaag ccagtacaga 3361 cagtggaaaa taaagagact ctgaagcatt tggcatctct ttatgcaaat aatcatccat 3421 ccatgcacat gggtcagccc agttgcccaa ataaatcaga tgagaatatt ccaggaggag 3481 taatgcgtgg agcagaatgg catagtcacc tgggcagcat gaaggattat agtgtcacct 3541 atggccattg tccggaaatc acagtataca caagctgctg ttactttcct agtgctgcac 3601 gactcccttc cttgtgggca gacaataaga gatctcttct tagtatgtta gtggaggttc 3661 acaagggagt tcatggattt gttaaagata agactggaaa gccaatctct aaagcagtca 3721 ttgtacttaa tgaaggaata aaggtacaaa caaaagaggg aggttatttc catgtactct 3781 tagcgccagg tgtccataac attattgcca tcgctgatgg gtaccagcaa caacattcac 3841 aggtctttgt gcatcatgat gcagctagtt ctgtggtgat agtctttgac acagataacc 3901 ggatatttgg tttgccaagg gagcttgtgg taactgtatc aggtgctact atgtcggcat 3961 tgatcctaac agcttgcatt atttggtgca tctgctcaat caagtctaat agacacaagg 4021 atggctttca tcggctcagg cagcatcatg atgagtatga agatgaaatt cgcatgatgt 4081 ctaccggctc caagaagtcc ctcctaagcc atgagttcca ggatgaaaca gacactgaag 4141 aggaaacatt atattctagc aaacattgaa aaacacattt tgcatatctc ccagcataag 4201 taccaagcaa aattacagtt cctcttggga gaacactgca ttaagaagag agactctctt 4261 gcttcttcaa agagctttgg gaaattaaat tgctaaattt gtattctctg tgaatttcac 4321 tggcagtttt gaacttccct tccttaaagt actctaaacc tttaaaaaaa aatctgattt 4381 atgcagcaga gatgggacag ccactttttc tttttaattt aagatgagct atttggagct 4441 tatgtaataa tggcataaag ccaactagag gatgttgtat tttgcacatc agatgtttac 4501 tagtggcttt agtatttttc tttgttttaa atggccaaaa gaatccagaa acattaaggc 4561 agggacagca gtcagaatcg acataaagct ttaaaaactc aaggtttttt caacctactg 4621 aggagtactt ttctctagtt gttaaatagc tggagttttt cttattcagg tttaatggag 4681 gttgaattga tttttaaaca catataacag taggaaatga ataaatgggc ttctgcattt 4741 ggctttctac ctgttccaag gctagatcgg aactggtaga ctacgctgta agcaggattt 4801 cactacctct cttaaggttt agcaaacttc taaatagccc attttaaggg agaacttact 4861 aactttattg tgaaaggtct aaatgcccac ttgaatgaag ctgagagaga gatctagcaa 4921 aagctaaaac tcatgttgtc tatctttgaa cttggtaaaa acccacaggt gctgctgctt 4981 atatctgtga agcactagct tattctagga atgcctgatt ctttaatatt gcctaaatcg 5041 gaaccttttt ctatgttgca cacatggttt tcagatgacc cagccatcta caagatctga 5101 attctactga aaatatctag aaatgtggaa gagacctact tgcacattct taacctgtat 5161 ttgaacacaa aatatctata cttcatgctc cagcccaagc ctataccctg taatagcata 5221 ctattattga aatcgcttga ccggtcttgt tcacataggc ctctgggagt gatttggttc 5281 tttgccctaa tgtttcattt gacggtctct ttttgatcaa ccaatttttc taaaagttca 5341 gtcgaaagct tttaagtata gcttcctccc ttgaaaaaaa atgtaaacta tgactgctga 5401 gtgataaaac actgtggtgt gaaagtgtca tcttcactgc caatcaggca aagaccggaa 5461 agatttgcat tttattatgt ctgtcttatc atgcaatgga aatgatgctt tttgtaagta 5521 tgcatcttac caatgatgta acggtttaat acctttgaat gttttaataa ccaagttgct 5581 gctgaactta tactaaatca ggggaccaaa aaacttgctc ttatcttctc aaattgtatt 5641 ctatatccat taatgtatca gttatcccaa agccttcagg tggaggggtt taccaccttc 5701 ctaggtcgtt caaccaggtt ttgtgaggaa tgcattcaaa gtggctttat aaaagaagat 5761 tttctttagc aagaaaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU65092 853 bp mRNA PRI 28-FEB-1997 DEFINITION Human melanocyte-specific gene 1 (msg1) mRNA, complete cds. ACCESSION U65092 NID g1853996 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 853) AUTHORS Shioda,T., Fenner,M.H. and Isselbacher,K.J. TITLE msg1, a novel melanocyte-specific gene, encodes a nuclear protein and is associated with pigmentation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (22), 12298-12303 (1996) MEDLINE 97057236 REFERENCE 2 (bases 1 to 853) AUTHORS Shioda,T., Fenner,M.H. and Isselbacher,K.J. TITLE Direct Submission JOURNAL Submitted (25-JUL-1996) MGH Cancer Center, Massachusetts General Hospital, Bldg. 149, 13th St., Charlestown, MA 02129, USA FEATURES Location/Qualifiers source 1..853 /organism="Homo sapiens" /db_xref="taxon:9606" gene 196..777 /gene="msg1" CDS 196..777 /gene="msg1" /codon_start=1 /product="melanocyte-specific gene 1" /db_xref="PID:g1853997" /translation="MPTTSRPALDVKGGTSPAKEDANQEMSSVAYSNLAVKDRKAVAI LHYPGVASNGTKASGAPTSSSGSPIGSPTTTPPTKPPSFNLHPAPHLLASMQLQKLNS QYQGMAAATPGQPGEAGPLQNWDFGAQAGGAESLSPSAGAQSPAIIDSDPVDEEVLMS LVVELGLDRANELPELWLGQNEFDFTADFPSSC" BASE COUNT 198 a 256 c 234 g 165 t ORIGIN 1 agtggcaccg ctgactgccg agaggaagct cgcctctgcc cggctgccct cttgtagtcc 61 gccggcgagg ggcagttctc ggtgaggagg aagagagcag cggacggcac agcacccgcg 121 cgggccctcc cacaacagct ccagctggca gcatcacttc ccgccaattt atccaacttc 181 tgccaaggct ctgaaatgcc aacaacgtcg aggcctgcac ttgatgtcaa gggtggcacc 241 tcacctgcga aggaggatgc caaccaagag atgagctccg tggcctactc caaccttgcg 301 gtgaaagatc gcaaagcagt ggccattctg cactaccctg gggtagcctc aaatggaacc 361 aaggccagtg gggctcccac tagttcctcg ggatctccaa taggctctcc tacaaccacc 421 cctcccacta aacccccatc cttcaacctg caccccgccc ctcacttgct ggctagtatg 481 cagctgcaga aacttaatag ccagtatcag gggatggctg ctgccactcc aggccaaccc 541 ggggaggcag gacccctgca aaactgggac tttggggccc aggcgggagg ggcagaatca 601 ctctctcctt ctgctggtgc ccagagccct gctatcatcg attcggaccc agtggatgag 661 gaagtgctga tgtcgctggt ggtggaactg gggttggacc gagccaatga gcttccggag 721 ctgtggctgg ggcagaatga gtttgacttc actgcggact ttccatctag ctgctaatgc 781 caagtgtccc taaagatgga ggaataaagc caccaattct gttgtaaata aaaataaagt 841 tacttacaaa gag // LOCUS HSU65378 2640 bp mRNA PRI 02-DEC-1997 DEFINITION Human dentin matrix protein 1 mRNA, complete cds. ACCESSION U65378 NID g2654424 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2640) AUTHORS MacDougall,M., Juan,X., Simmons,D. and Feng,J. TITLE Direct Submission JOURNAL Submitted (25-JUL-1996) Pediatric Dentistry, UTHSCSA, 7703 Floyd Curl Dr., San Antonio, TX 78284-7888, USA FEATURES Location/Qualifiers source 1..2640 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q21" CDS 100..1593 /codon_start=1 /product="dentin matrix protein 1" /db_xref="PID:g2654425" /translation="MKISILLMFLWGLSCALPVTRYQNNESEDSEEWKGHLAQAPTPP LANEDPSDCTQSEEGLGSDDHQYIYRLAGGFSRSTGKGGDDKDDDEDDSGDDTFGDDD SGPGPKDRQEGGNSRLGSDEDSDDTIQASEESAPQGQDSAQDTTSESRELDNEDRVDS KPEGGDSTQESESEEHWVGGGSDGESSHGDGSELDDEGMQSDDPESIRSERGNSRMNS AGMKSKESGENSEQANTQDSGGSQLLEHPSRKIFRKSRISEEDDRSELDDNNTMEEVK SDSTENSNSRDTGLSQPRRDSKGDSQEDSKENLSQEESQNVDGPSSESSQEANLSSQE NSSESQEEVVSESRGDNPDPTTSYVEDQEDSDSSEEDSSHTLSHSKSESREEQADSES SESLNFSEESPESPEDENSSSQEGLQSHSSSAESQSEESHSEEDDSDSQDSSRSKEDS NSTESKSSSEEDGQLKNIEIESRKLTVDAYHNKPIGDQDDNDCQDGY" BASE COUNT 893 a 510 c 666 g 571 t ORIGIN 1 tgggcataga tttcctcttt gagaacatca acctgatttt tgagactttt tgaaaaaatt 61 ctttgtgaac tacggagggt agaggtatca cacccaacta tgaagatcag catcctgctc 121 atgttccttt ggggattatc ctgtgctctc ccagtaacca ggtatcaaaa taatgaatct 181 gaggattctg aagaatggaa gggtcatttg gctcaggcac caacaccacc cttggcaaat 241 gaagacccca gtgactgcac tcagtcagag gagggcctgg gctctgatga tcatcaatac 301 atttataggc tagctggtgg cttctccagg agcacaggaa aaggaggaga tgataaagat 361 gacgatgaag atgacagtgg agatgacacc tttggtgacg atgacagtgg cccagggccc 421 aaagacagac aagaaggagg aaactccaga ctgggaagtg atgaggactc tgatgacacc 481 atacaagcca gtgaagagag tgccccacaa gggcaagaca gtgcccaaga taccaccagt 541 gagagcaggg aacttgacaa tgaggaccgg gtggacagca agcctgaggg aggtgactcc 601 actcaagaga gtgagagtga agagcactgg gtgggaggtg gcagtgatgg ggagagcagc 661 catggagacg gctccgagtt ggacgatgag ggaatgcaga gtgatgaccc agagagcatc 721 aggagtgaaa ggggaaactc cagaatgaac agtgcaggca tgaaatcaaa agaatctgga 781 gaaaacagtg agcaagcaaa cactcaagat tcaggtggca gccaattgct ggagcatccc 841 agtaggaaaa tttttaggaa gtctcgcatc tcagaggaag atgacagaag cgagcttgat 901 gacaacaaca caatggaaga agtcaagagt gactctacag aaaacagcaa ctccagagac 961 actggcctca gccaacccag gagagacagc aagggtgact ctcaagaaga cagcaaggag 1021 aatctgtccc aggaagagag ccaaaacgta gatggtccca gcagtgagtc cagccaagag 1081 gccaacctgt catctcaaga gaacagcagt gagtctcagg aagaggtggt gagtgagtcc 1141 aggggagata accccgaccc cacaactagt tatgtagaag accaggaaga cagtgactcc 1201 agcgaggagg acagctcgca cacactctcc cactcaaaaa gtgaatccag agaggagcaa 1261 gcagacagcg aatccagtga gagcctcaac ttctcagagg aaagcccgga gtcccctgag 1321 gatgagaaca gctccagcca ggagggcctc cagtctcaca gcagctcagc agagagtcag 1381 agcgaggaaa gccattctga ggaagacgac agtgactctc aagacagcag cagatccaaa 1441 gaagatagca actccacgga gagcaaatca agcagtgagg aagatggcca gttgaaaaac 1501 attgagatag agagccggaa attaacagtt gatgcctatc acaacaaacc cattggggac 1561 caagatgaca atgactgcca agacggctat tagcatcagc tgtcctaaga agcagttgtc 1621 acataaaaga gtcttaggga cttgaaaatg tatcatgata actataattt attgatgttt 1681 tgatcaaaag aataaccaga tgccatattt ttcctgaaag gaattgctgg acattacact 1741 tgtttttagg gtgtcatcat ttcacagagg tttaaatact gtggagtgac accagaacac 1801 agccaaagag gctagaagca agaaaggatc tgcatgataa ctttgcagct gagatagttc 1861 ctaattcatc aacgtaacaa acaaagctat tgggtgtcca tgatatacca ggcactatgc 1921 taggtgttga gaatgtaaag caggttaaga cttgtttctt gctcctgagg agcgtataga 1981 aggacccaca aagctacaga gttaaagagg gatatgtata agagaaacaa gaattgttac 2041 aacccagtat ggtgagtgcc aagacagaga gctatgaaca cgatatctat ctgggaacac 2101 tgagaagggt gaccaatgcg ttggagaacg agggagggct tcatagcaga agacattgga 2161 gttggaaggt tgcataggtt tcctcatagg agaccgggga attctattct tgggataaat 2221 aatagtaaat tctataggat tcaatactta gtttgcaaaa gcactttgaa attatggcac 2281 agtgagttgt tttggataag aaaatctttt tctcccaatt aatttaccaa ttcatcattt 2341 tttttttttg tagaactccc ataaacatca ccttactaat cactggtgat gataagaagg 2401 attgggtcag gaagagtggg agaaagaaat tcctctttac gtagatactt tttagcttta 2461 ttttttctaa aatcagtttg cgtgcaatgc tagaaaaaaa ctgttctctg agtcctttac 2521 agagcaaaat tctgtatgta agattcaatt gatttttgac aaataccatt tgaaatatta 2581 cctcaacata aaatacttgt tttgtaataa agattataat acccagaaaa aaaaaaaaaa // LOCUS HSU65402 2061 bp DNA PRI 03-JUL-1997 DEFINITION Human seven transmembrane G-coupled receptor (GPR31) gene, complete cds. ACCESSION U65402 NID g2065522 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2061) AUTHORS Zingoni,A., Rocchi,M., Storlazzi,C.T., Bernardini,G., Santoni,A. and Napolitano,M. TITLE Isolation and chromosomal localization of GPR31, a human gene encoding a putative G protein-coupled receptor JOURNAL Genomics 42 (3), 519-523 (1997) MEDLINE 97349123 REFERENCE 2 (bases 1 to 2061) AUTHORS Zingoni,A., Rocchi,M. and Napolitano,M. TITLE Direct Submission JOURNAL Submitted (26-JUL-1996) Laboratory of Physiopatology, Regina Elena Cancer Institute, Via delle Messi d'oro 156, Rome 00158, Italy FEATURES Location/Qualifiers source 1..2061 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27" gene 1..2061 /gene="GPR31" CDS 499..1458 /gene="GPR31" /codon_start=1 /product="seven transmembrane G-coupled receptor" /db_xref="PID:g2065523" /translation="MPFPNCSAPSTVVATAVGVLLGLECGLGLLGNAVALWTFLFRVR VWKPYAVYLLNLALADLLLAACLPFLAAFYLSLQAWHLGRVGCWALRFLLDLSRSVGM AFLAAVALDRYLRVVHPRLKVNLLSPQAALGVSGLVWLLMVALTCPGLLISEAAQNST RCHSFYSRADGSFSIIWQEALSCLQFVLPFGLIVFCNAGIIRALQKRLREPEKQPKLQ RAQALVTLVVVLFALCFLPCFLARVLMHIFQNLGSCRALCAVAHTSDVTGSLTYLHSV VNPVVYCFSSPTFRSSYRRVFHTLRGKGQAAEPPDFNPRDSYS" BASE COUNT 442 a 565 c 501 g 553 t ORIGIN 1 ggatcccagc aagcgtcctt tatgtatgaa aaggaagaag aaaatttccc catgaaacat 61 attcaaagta gagaacaatc tttttgattc cattgttatt ttaattgtat acagacatag 121 gagtctttgc ataattagac ttttccttct ttcaggactg tggatgcaaa gccctggacc 181 cccagacgtt ataggacatt actcctcagc tttgcagccc ggtgatgtga agcgaaacac 241 catttcccct tttttatggc ggaagaaaac agaacacaac tgcaaagggg cttttccctc 301 ccctgctcat cctctttccc caaatgaatt ttggtttgct gtggactcta ttctgctgag 361 gaactgttct tgttgggcaa atgtagatct tgtctactct gtggcaggaa aaggcctttt 421 ctttcatttt gtaagaaaga gcacagagtt cctcctgtac ctgctccagc tgtgcctgca 481 gcccctcacg gccgggtgat gccattccca aactgctcag cccccagcac tgtggtggcc 541 acagctgtgg gtgtcttgct ggggctggag tgtgggctgg gtctgctggg caacgcggtg 601 gcgctgtgga ccttcctgtt ccgggtcagg gtgtggaagc cgtacgctgt ctacctgctc 661 aacctggccc tggctgacct gctgttggct gcgtgcctgc ctttcctggc cgccttctac 721 ctgagcctcc aggcttggca tctgggccgt gtgggctgct gggccctgcg cttcctgctg 781 gacctcagcc gcagcgtggg gatggccttc ctggccgccg tggctttgga ccggtacctc 841 cgtgtggtcc accctcggct taaggtcaac ctgctgtctc ctcaggcggc cctgggggtc 901 tcgggcctcg tctggctcct gatggtcgcc ctcacctgcc cgggcttgct catctctgag 961 gccgcccaga actccaccag gtgccacagt ttctactcca gggcagacgg ctccttcagc 1021 atcatctggc aggaagcact ctcctgcctt cagtttgtcc tcccctttgg cctcatcgtg 1081 ttctgcaatg caggcatcat cagggctctc cagaaaagac tccgggagcc tgagaaacag 1141 cccaagcttc agcgggccca ggcactggtc accttggtgg tggtgctgtt tgctctgtgc 1201 tttctgccct gcttcctggc cagagtcctg atgcacatct tccagaatct ggggagctgc 1261 agggcccttt gtgcagtggc tcatacctcg gatgtcacgg gcagcctcac ctacctgcac 1321 agtgtcgtca accccgtggt atactgcttc tccagcccca ccttcaggag ctcctatcgg 1381 agggtcttcc acaccctccg aggcaaaggg caggcagcag agcccccaga tttcaacccc 1441 agagactcct attcctgaca acagccagcg tcctcaacgc ccgtgtttat ggaactacct 1501 gcgacctaaa taataattac tcctactttg ggattctgga agaagaagaa gtcttaagac 1561 tgcaatacaa ggatcagagc ataaacatgg gcacagttgc tgcaggtgtg gtcttatact 1621 ttgttgacca gggtggtcct ctgtgatttt accttgtaga gtggcaaatc aaaaatgaac 1681 aagctagaac ctcctcctac ccaactatga tgcagattca gttgctgaac tgaaaagtcg 1741 ggcagctact ccatctccac acttgaagaa aatgtaattt gctaaatcag tgaaggaaga 1801 gaagaaagcc gggtgatggc atctttccaa ctcttacttg gtctcagcaa gtcattttca 1861 tttattatgc ttcagtttta aatacaaaaa aaaaactatg ttttcttccc acctgctgtg 1921 cagactgggg atgaccgaca tcagaaagtg ccctggttct aaaaagagac tctgctgtat 1981 ataaggtact gtcgtacatg ctagccttta tttggaacat aacatttttg ttttcataaa 2041 attttgcttc atttttctag a // LOCUS HSU65579 747 bp mRNA PRI 15-APR-1997 DEFINITION Human mitochondrial NADH dehydrogenase-ubiquinone Fe-S protein 8, 23 kDa subunit precursor (NDUFS8) nuclear mRNA encoding mitochondrial protein, complete cds. ACCESSION U65579 NID g1935055 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 747) AUTHORS Procaccio,V., Depetris,D., Soularue,P., Mattei,M.G., Lunardi,J. and Issartel,J.P. TITLE cDNA sequence and chromosomal localization of the NDUFS8 human gene coding for the 23 kDa subunit of the mitochondrial complex I JOURNAL Biochim. Biophys. Acta 1351 (1-2), 37-41 (1997) MEDLINE 97236430 REFERENCE 2 (bases 1 to 747) AUTHORS Procaccio,V., Depetris,D., Soularue,P., Mattei,M.-G., Lunardi,J. and Issartel,J.-P. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) BECP/DBMS, CEA Grenoble, 17 Rue des Martyrs Cedex9, Grenoble 38054, France FEATURES Location/Qualifiers source 1..747 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /clone="H27530 from the Washington University Merck EST project" gene 41..673 /gene="NDUFS8" CDS 41..673 /gene="NDUFS8" /note="mitochondrial NADH-coenzyme Q reductase" /codon_start=1 /product="mitochondrial NADH dehydrogenase-ubiquinone Fe-S protein 8, 23 kDa subunit precursor" /db_xref="PID:g1935056" /translation="MRCLTTPMLLRALAQAARAGPPGGRSLHSSAVAATYKYVNMQDP EMDMKSVTDRAARTLLWTELFRGLGMTLSYLFREPATINYPFEKGPLSPRFRGEHALR RYPSGEERCIACKLCEAICPAQAITIEAEPRADGSRRTTRYDIDMTKCIYCGFCQEAC PVDAIVEGPNFEFSTETHEELLYNKEKLLNNGDKWEAEIAANIQADYLYR" mat_peptide 143..670 /gene="NDUFS8" /evidence=not_experimental /product="mitochondrial NADH dehydrogenase-ubiquinone Fe-S protein 8, 23 kDa subunit" BASE COUNT 165 a 246 c 217 g 119 t ORIGIN 1 ctggcctgct tggcaaggca agtagcggcg gcgcttcaag atgcgctgcc tgaccacgcc 61 tatgctgctg cgggccctgg cccaggctgc acgtgcagga cctcctggtg gccggagcct 121 ccacagcagt gcagtggcag ccacctacaa gtatgtgaac atgcaggatc ccgagatgga 181 catgaagtca gtgactgacc gggcagcccg caccctgctg tggactgagc tcttccgagg 241 cctgggcatg accctgagct acctgttccg ggaaccggcc accatcaact acccgttcga 301 gaagggcccg ctgagccctc gcttccgtgg ggagcatgcg ctgcgccggt acccatccgg 361 ggaggagcgt tgcattgcct gcaagctctg cgaggccatc tgccccgccc aggccatcac 421 catcgaggct gagccaagag ctgatggcag ccgccggacc acccgctatg acatcgacat 481 gaccaagtgc atctactgcg gcttctgcca ggaggcctgt cccgtggatg ccatcgtcga 541 gggccccaac tttgagttct ccacggagac ccatgaggag ctgctgtaca acaaggagaa 601 gttgctcaac aacggggaca agtgggaggc cgagatcgcc gccaacatcc aggctgacta 661 cttgtatcgg tgacgcccca ccggcccgca gcccctgctg cccaataaaa ccactccgac 721 cccacggaaa aaaaaaaaaa aaaaaaa // LOCUS HSU65581 1548 bp mRNA PRI 26-OCT-1996 DEFINITION Human ribosomal protein L3-like mRNA, complete cds. ACCESSION U65581 NID g1638883 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1548) AUTHORS Burn,T.C., Connors,T.D., Van Raay,T.J., Dackowski,W.R., Millholland,J.M., Klinger,K.W. and Landes,G.M. TITLE Generation of a transcriptional map for a 700-kb region surrounding the polycystic kidney disease type 1 (PKD1) and tuberous sclerosis type 2 (TSC2) disease genes on human chromosome 16p3.3 JOURNAL Genome Res. 6 (6), 525-537 (1996) MEDLINE 96425699 REFERENCE 2 (bases 1 to 1548) AUTHORS Van Raay,T.J., Connors,T.D., Klinger,K.W., Landes,G.M. and Burn,T.C. TITLE A novel ribosomal protein L3-like gene (RPL3L) maps to the autosomal dominant polycystic kidney disease gene region JOURNAL Genomics 37 (2), 172-176 (1996) MEDLINE 97079677 REFERENCE 3 (bases 1 to 1548) AUTHORS Van Raay,T.J., Connors,T.D., Klinger,K.W., Landes,G.M. and Burn,T.C. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) Human Genetics, Genzyme Genetics, One Mountain Rd, Framingham, MA 01701, USA FEATURES Location/Qualifiers source 1..1548 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3 between DW1.8 and D16S84" /dev_stage="adult" /tissue_type="heart muscle" /note="expression restricted to muscle" CDS 49..1272 /note="similar to human ribosomal protein L3 GenBank Accession Number X73460" /codon_start=1 /product="ribosomal protein L3-like" /db_xref="PID:g1638884" /translation="MSHRKFSAPRHGHLGFLPHKRSHRHRGKVKTWPRDDPSQPVHLT AFLGYKAGMTHTLREVHRPGLKISKREEVEAVTIVETPPLVVVGVVGYVATPRGLRSF KTIFAEHLSDECRRRFYKDWHKSKKKAFTKACKRWRDTDGKKQLQKDFAAMKKYCKVI RVIVHTQMKLLPFRQKKAHIMEIQLNGGTVAEKVAWAQARLEKQVPVHSVFSQSEVID VIAVTKGRGVKGVTSRWHTKKLPRKTHKGLRKVACIGAWHPARVGCSIARAGQKGYHH RTELNKKIFRIGRGPHMEDGKLVKNNASTSYDVTAKSITPLGGFPHYGEVNNDFVMLK GCIAGTKKRVITLRKSLLVHHSRQAVENIELKFIDTTSKFGHGRFQTAQEKRAFMGPQ KKHLEKETPETSGDL" BASE COUNT 378 a 438 c 470 g 262 t ORIGIN 1 ggcggctagc ggcgaggccc cttcctgtac cttcagggat cggccaccat gtcccaccgg 61 aagttttccg cccctcggca cggacacctg ggcttcctgc cccataagag gagccaccgg 121 caccggggca aggtgaagac gtggccgcgg gatgacccca gccagcccgt gcacctcacg 181 gccttcctgg gctacaaggc gggcatgacc cacaccctgc gggaggtgca ccggccgggg 241 ctcaaaattt ccaaacggga ggaggtggag gcggtgacaa ttgtagaaac gccgccccta 301 gtggtggtgg gcgtggtggg ctacgtggcc acccctcgag gtctccggag cttcaagacc 361 atctttgcag aacacctcag tgatgagtgc cggcgccgat tctacaagga ctggcacaag 421 agcaagaaga aagccttcac caaggcctgc aagaggtggc gggacacaga cgggaaaaag 481 cagctacaga aggacttcgc cgccatgaag aagtactgca aggtcattcg ggtcattgtc 541 cacactcaga tgaaactgct gcccttccgg cagaagaagg cccacatcat ggagatccag 601 ctgaacggtg gcacggtggc cgagaaggtg gcctgggccc aggcccggct ggagaagcag 661 gtgcccgtgc acagcgtgtt cagccagagt gaggtcattg atgtcattgc tgtcaccaag 721 ggtcgaggcg tcaaaggggt cacaagccgc tggcatacca agaagctgcc gcggaagacc 781 cataagggcc tgcgcaaggt ggcctgcatt ggcgcctggc accccgcccg cgtgggctgc 841 tccattgctc gggccgggca gaagggctat caccaccgca cggagctcaa caagaagatc 901 ttccgcatcg gcaggggccc gcacatggag gacgggaagc tggtgaagaa caatgcatcc 961 accagctacg acgtgactgc caagtccatc acaccgctgg gtggcttccc ccactacggg 1021 gaagtgaaca acgacttcgt catgctgaag ggttgtattg ctggtaccaa gaagcgggtc 1081 attacgctga gaaagtccct cctggtgcat cacagtcgcc aagccgtgga gaatattgag 1141 ctcaagttca ttgacaccac ctccaagttc ggccatggcc gcttccagac agcccaagag 1201 aagagggcct tcatgggccc ccaaaagaag catctggaga aggaaacgcc ggagacctcg 1261 ggagacttgt aggctgtgtg gggtggatga accctgaagc gcaccgcact gtctgcccca 1321 atgtctaaca aaggccggag gcgactcttc ctgcgaggtc tcagagcgct gtgtaaccgc 1381 ccaaggggtt caccttgcct gctgcctaga caaagccgat tcattaagac aggggaattg 1441 caatagagaa agagtaattc acacagagct ggctgtgcgg gagaccggag ttttatgttt 1501 tattattact caaatcgatc tctttgagca aaaaaaaaaa aaaaaaaa // LOCUS HSU65637 2190 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens chondroitin-6-sulfotransferase mRNA, complete cds. ACCESSION U65637 NID g2769701 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2190) AUTHORS Williams,K.J. TITLE Atherosclerosis: cell biology and lipoproteins JOURNAL Curr. Opin. Lipidol. 7 (6), U202-U208 (1996) MEDLINE 97189336 REFERENCE 2 (bases 1 to 2190) AUTHORS Peng,T., Tabas,I. and Williams,K.J. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) Medicine, Thomas Jefferson University, 1020 Locust Street, Philadelphia, PA 19107-6799, USA FEATURES Location/Qualifiers source 1..2190 /organism="Homo sapiens" /note="part of the sequence corresponds to an EST, GenBank Accession Number R16177, which is similar to the chicken chondroitin-6-sulfotransferase cDNA sequence, GenBank Accession Number D49915" /db_xref="taxon:9606" CDS 125..1360 /codon_start=1 /product="chondroitin-6-sulfotransferase" /db_xref="PID:g2769702" /translation="MQCSWKAVLLLALASIAIQYTAIRTFTAKSFHTCPGLAEAGLAE RLCEESPTFAYNLSRKTHILILATTRSGSSFVGQLFNQHLDVFYLFEPLYHVQNTLIP RFTQGKSPADRRVMLGASRDLLRSLYDCDLYFLENYIKPPPVNHTTDRIFRRGASRVL CSRPVCDPPGPADLVLEEGDCVRKCGLLNLTVAAEACRERSHVAIKTVRVPEVNDLRA LVEDPRLNLKVIQLVRDPRGILASRSETFRDTYRLWRLWYGTGRKPYNLDVTQLTTVC EDFSNSVSTGLMRPPWLKGKYMLVRYEDLARNPMKKTEEIYGFLGIPLDSHVARWIQN NTRGDPTLGKHKYGTVRNSAATAEKWRFRLSYDIVAFAQNACQQVLAQLGYKIAASEE ELKNPSVSLVEERDFRPFS" BASE COUNT 429 a 717 c 595 g 449 t ORIGIN 1 ctgccgcact ggctgggact gccagctggg cctggagacg ctggtggctg tggactcccc 61 agcttggagc agtccctctt tgacctcacc ccttggagaa gcagccccat gaaggtgccc 121 agccatgcaa tgttcctgga aggccgtcct cctccttgcc ctggcctcca ttgccatcca 181 gtacacggcc atccgcacct tcaccgccaa gtcctttcac acctgccccg ggctggcaga 241 ggccgggctg gccgagcgac tgtgcgagga gagccccacc ttcgcctaca acctctcccg 301 caagacccac atcctcatcc tggccaccac gcgcagcggc tcctccttcg tgggccagct 361 cttcaaccag cacctggacg tcttctacct gtttgagccc ctctaccacg tccagaacac 421 gctcatcccc cgcttcaccc agggcaagag cccggccgac cggcgggtca tgctaggcgc 481 cagccgcgac ctcctgcgga gcctctacga ctgcgacctc tacttcctgg agaactacat 541 caagccgccg ccggtcaacc acaccaccga caggatcttc cgccgcgggg ccagccgggt 601 cctctgctcc cggcctgtgt gcgaccctcc ggggccagcc gacctggtcc tggaggaggg 661 ggactgtgtg cgcaagtgcg ggctactcaa cctgaccgtg gcggccgagg cgtgccgcga 721 gcgcagccac gtggccatca agacggtgcg cgtgcccgag gtgaacgacc tgcgcgccct 781 ggtggaagac ccgcgattaa acctcaaggt catccagctg gtccgagacc cccgcggcat 841 tctggcttcg cgcagcgaga ccttccgcga cacgtaccgg ctctggcggc tctggtacgg 901 caccgggagg aaaccctaca acctggacgt gacgcagctg accacggtgt gcgaggactt 961 ctccaactcc gtgtccaccg gcctcatgcg gcccccgtgg ctcaagggca agtacatgtt 1021 ggtgcgctac gaggacctgg ctcggaaccc tatgaagaag accgaggaga tctacgggtt 1081 cctgggcatc ccgctggaca gccacgtggc ccgctggatc cagaacaaca cgcggggcga 1141 ccccaccctg ggcaagcaca aatacggcac cgtgcgaaac tcggcggcca cggccgagaa 1201 gtggcgcttc cgcctctcct acgacatcgt ggcctttgcc cagaacgcct gccagcaggt 1261 gctggcccag ctgggctaca agatcgccgc ctcggaggag gagctgaaga acccctcggt 1321 cagcctggtg gaggagcggg acttccgccc cttctcgtga cccgggcggt gcgggtgggg 1381 gcgggaggcg caaggtgtcg gttttgataa aatggaccgt ttttaactgt tgccttatta 1441 acccctccct ctcccacctc atcttcgtgt ccttcctgcc cccagctcac cccactccct 1501 tctgcccctt ttttgtctct gaaatttgca ctacgtcttg gacgggaatc actggggcag 1561 agggcgcctg aagtagggtc ccgccccccc caccccattc agacacatgg atgttgggtc 1621 tctgtgcgga cggtgacaat gtttacaagc accacattta cacatccaca cacgcacacg 1681 ggcactcgcg aggcgacttc tcaagctttt gaatgggtga gtggtcgggt atctagtttt 1741 tgcactgtct tactattcaa ggtaagagga tacaaacaag aggaccactt gtctctaatt 1801 tatgaatggt gtccatcctt tccccatccc tgcctcctgc ccctgacgcc catttccccc 1861 cttagagcag cgaaactgcc ccctcctgcc cgcccttgcc tgtcggtgag gcaggttttt 1921 actgtgaggt gaacgtggac ctgtttctgt ttccagtctg tggtgatgct gtctgtctgt 1981 ctgagtctcg tggccgcccc tggaccagtg atgactgatg aatcttatga gcttctgatt 2041 gatctcgggg tccatctgtg atatttcttt gtgccaaaaa gaaaaaaaaa gagtggatca 2101 gtttgctaaa tgaacattga aattgaaatg ctttatctgt gttttctgta aataaaagag 2161 tgcaataaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU65652 1303 bp mRNA PRI 05-FEB-1998 DEFINITION Homo sapiens HREP protein (C17ORF1) mRNA, complete cds. ACCESSION U65652 NID g2673958 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1303) AUTHORS Kennerson,M.L., Gordon,M.J., Blair,I.P. and Nicholson,G.A. TITLE Single test for two hereditary neuropathies, CMT1A and HNPP JOURNAL Clin. Chem. 41 (10), 1534-1535 (1995) MEDLINE 96067068 REFERENCE 2 (bases 1 to 1303) AUTHORS Kennerson,M.L., Nassif,N.T., Dawkins,J.L., DeKroon,R.M., Yang,J.G. and Nicholson,G.A. TITLE The Charcot-Marie-Tooth binary repeat contains a gene transcribed from the opposite strand of a partially duplicated region of the COX10 gene JOURNAL Genomics 46 (1), 61-69 (1997) MEDLINE 98066763 REFERENCE 3 (bases 1 to 1303) AUTHORS Kennerson,M.L., Nassif,N.T., Dawkins,J.L., DeKroon,R.M., Yang,J.G. and Nicholson,G.A. TITLE Direct Submission JOURNAL Submitted (31-JUL-1996) Medicine, University of Sydney, Concord Hospital, Clinical Sciences Bldg., Hospital Rd., Concord, NSW 2139, Australia FEATURES Location/Qualifiers source 1..1303 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17p11.2-p12" /tissue_type="heart and skeletal muscle" gene 1..1303 /gene="C17ORF1" 5'UTR <1..221 /gene="C17ORF1" CDS 222..821 /gene="C17ORF1" /codon_start=1 /product="HREP protein" /db_xref="PID:g2673959" /translation="MDLCKNRLVSGGRDCQVKVWDVDTGKCLKTFRHKDPILATRIND TYIVSSCERGLVKVWHIAMAQLVKTLSGHEGAVKCLFFDQWHLLSGSTDGLVMAWSMV GKYERCLMAFKHPKEVLDVSLLFLRIISACANGKIRIYNFLHGNCMKVIKTNGRGDPV LSFRATEFQSATSAHLLKELTWDGMESNQVLQLREEMPP" 3'UTR 822..1287 /gene="C17ORF1" polyA_signal 1253..1258 /gene="C17ORF1" polyA_signal 1267..1272 /gene="C17ORF1" BASE COUNT 361 a 317 c 341 g 284 t ORIGIN 1 cgcccgggca ggtttcaaat ccaggcccct gactctaagc tgaaccatga gaatctgctg 61 cctcccaact aactaattat ctacatgggt tctacgcagt atgatccttt gggccctcag 121 gaactctcac attcttttgg cctgattcct gcagatactg ggatctgaaa agtggggttt 181 gcacacgaat cttcggtggt caccagggga ctatcacttg catggacttg tgtaagaaca 241 ggctcgtatc tggaggaaga gattgccagg taaaagtatg ggatgtagac acagggaagt 301 gcctgaagac gtttagacac aaagacccca tcttggccac caggatcaat gatacctaca 361 ttgtgagcag ctgtgagcga gggctggtga aagtgtggca cattgccatg gcccagttgg 421 taaagactct cagtggccac gagggagccg tgaaatgcct gttctttgac cagtggcatc 481 tcctctcagg aagcactgat ggcctggtca tggcctggag catggtgggg aagtacgagc 541 gctgcctgat ggccttcaag catcccaaag aggtgctcga cgtgtccctt ctcttcctcc 601 ggatcatcag cgcctgtgca aatggcaaga tccgaattta caatttcctc catgggaact 661 gtatgaaggt gataaaaacc aatggcagag gcgatcctgt gctgtccttc agggcaacag 721 aatttcagtc tgccacatca gcacatttgc taaaagaatt aacgtgggat ggaatggaat 781 cgaaccaagt gctacagctc agggaggaaa tgcctccttg accgagtgtg ctcatgtgag 841 actccacatc gcgggacact taccagcatc cgaggctgcc cgtggccgct gtccagccca 901 tgacaggcgg gatggcccca accacagctc cgacccatgt gttggcaatg ctgatccttt 961 tcagtggtgt gtagcagcag gtatacagga aaatgttgaa gagccccagg gctcctgtga 1021 gtggattcac ccccaaggtc agaatggcaa ctcctggaac agcacaacaa gtggcaaagg 1081 acacagctag caacgggctg gaaaaaggca gagaacgtgg gagtgatcat ctccaactca 1141 actcaatcat actacctaaa atcggcactg acaaagtaca aataatccac ctatccagta 1201 gaggcagcac tgatctggct gaattcctgg atatggggaa tcctcgggat gaaacaaact 1261 tacaataaga aagaatatgt ttgtgggaaa ataaaaaaaa aaa // LOCUS HSU65676 3673 bp mRNA PRI 01-NOV-1996 DEFINITION Human Hermansky-Pudlak syndrome protein (HPS) mRNA, complete cds. ACCESSION U65676 NID g1654350 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3673) AUTHORS Oh,J., Bailin,T., Fukai,K., Feng,G.H., Mao,J., Frenk,E., Almodovar,C., Tamura,N. and Spritz,R.A. TITLE A gene for Hermansky-Pudlak syndrome, a disorder associated with defects of multiple cytoplasmic organelles JOURNAL Nature Genet. (1996) In press REFERENCE 2 (bases 1 to 3673) AUTHORS Spritz,R.A. TITLE Direct Submission JOURNAL Submitted (30-JUL-1996) Medical Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..3673 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q23.1-q23.3; between D10S110/D10S184 and D10S2436" gene 207..2309 /gene="HPS" CDS 207..2309 /gene="HPS" /note="Hermansky-Pudlak syndrome protein" /codon_start=1 /db_xref="PID:g1654351" /translation="MKCVLVATEGAEVLFYWTDQEFEESLRLKFGQSENEEEELPALE DQLSTLLAPVIISSMTMLEKLSDTYTCFSTENGNFLYVLHLFGECLFIAINGDHTESE GDLRRKLYVLKYLFEVHFGLVTVDGHLIRKELRPPDLAQRVQLWEHFQSLLWTYSRLR EQEQCFAVEALERLIHPQLCELCIEALERHVIQAVNTSPERGGEEALHAFLLVHSKLL AFYSSHSASSLRPADLLALILLVQDLYPSESTAEDDIQPSPRRARSSQNIPVQQAWSP HSTGPTGGSSAETETDSFSLPEEYFTPAPSPGDQSSGSTIWLEGGTPPMDALQIAEDT LQTLVPHCPVPSGPRRIFLDANVKESYCPLVPHTMYCLPLWQGINLVLLTRSPSAPLA LVLSQLMDGFSMLEKKLKEGPEPGASLRSQPLVGDLRQRMDKFVKNRGAQEIQSTWLE FKAKAFSKSEPGSSWELLQACGKLKRQLCAIYRLNFLTTAPSRGGPHLPQHLQDQVQR LMREKLTDWKDFLLVKSRRNITMVSYLEDFPGLVHFIYVDRTTGQMVAPSLNCSQKTS SELGKGPLAAFVKTKVWSLIQLARRYLQKGYTTLLFREGDFYCSYFLWFENDMGYKLQ MIEVPVLSDDSVPIGMLGGDYYRKLLRYYSKNRPTEAVRCYELLALHLSVIPTDLLVQ QAGQLARRLWEASRIPLL" BASE COUNT 732 a 1135 c 1037 g 769 t ORIGIN 1 gggcgctgtg cgcgccgcga tccggtacgt gggcctccgg gctgtcccct ctgggggcga 61 tcctccctcc ggagcccccc ttcaaccctc ccggaagtga ggaccaggga tgctgtgctg 121 ctctcccatg agccagtcac cgagtcggtc tgctgcagcc ctttctgaac ctctggccgt 181 ctggatgctc cactgtgctt gccaagatga agtgcgtctt ggtggccact gagggcgcag 241 aggtcctctt ctactggaca gatcaggagt ttgaagagag tctccggctg aagttcgggc 301 agtcagagaa tgaggaagaa gagctccctg ccctggagga ccagctcagc accctcctag 361 ccccggtcat catctcctcc atgacgatgc tggagaagct ctcggacacc tacacctgct 421 tctccacgga aaatggcaac ttcctgtatg tccttcacct gtttggagaa tgcctgttca 481 ttgccatcaa tggtgaccac accgagagcg agggggacct gcggcggaag ctgtatgtgc 541 tcaagtacct gtttgaagtg cactttgggc tggtgactgt ggacggtcat cttatccgaa 601 aggagctgcg gcccccagac ctggcgcagc gtgtccagct gtgggagcac ttccagagcc 661 tgctgtggac ctacagccgc ctgcgggagc aggagcagtg cttcgccgtg gaggccctgg 721 agcgactgat tcacccccag ctctgtgagc tgtgcataga ggcgctggag cggcacgtca 781 tccaggctgt caacaccagc cccgagcggg gaggcgagga ggccctgcat gccttcctgc 841 tcgtgcactc caagctgctg gcattctact ctagccacag tgccagctcc ctgcgcccgg 901 ccgacctgct tgccctcatc ctcctggttc aggacctcta ccccagcgag agcacagcag 961 aggacgacat tcagccttcc ccgcggaggg cccggagcag ccagaacatc cccgtgcagc 1021 aggcctggag ccctcactcc acgggcccaa ctggggggag ctctgcagag acggagacag 1081 acagcttctc cctccctgag gagtacttca caccagctcc ttcccctggc gatcagagct 1141 caggtagcac catctggctg gaggggggca ccccccccat ggatgccctt cagatagcag 1201 aggacaccct ccaaacactg gttccccact gccctgtgcc ttccggcccc agaaggatct 1261 tcctggatgc caacgtgaag gaaagctact gccccctagt gccccacacc atgtactgcc 1321 tgcccctgtg gcagggcatc aacctggtgc tcctgaccag gagccccagc gcgcccctgg 1381 ccctggttct gtcccagctg atggatggct tctccatgct ggagaagaag ctgaaggaag 1441 ggccggagcc cggggcctcc ctgcgctccc agcccctcgt gggagacctg cgccagagga 1501 tggacaagtt tgtcaagaat cgaggggcac aggagattca gagcacctgg ctggagttta 1561 aggccaaggc tttctccaaa agtgagcccg gatcctcctg ggagctgctc caggcatgtg 1621 ggaagctgaa gcggcagctc tgcgccatct accggctgaa ctttctgacc acagccccca 1681 gcaggggagg cccacacctg ccccagcacc tgcaggacca agtgcagagg ctcatgcggg 1741 agaagctgac ggactggaag gacttcttgc tggtgaagag caggaggaac atcaccatgg 1801 tgtcctacct agaagacttc ccaggcttgg tgcacttcat ctatgtggac cgcaccactg 1861 ggcagatggt ggcgccttcc ctcaactgca gtcaaaagac ctcgtcggag ttgggcaagg 1921 ggccgctggc tgcctttgtc aaaactaagg tctggtctct gatccagctg gcgcgcagat 1981 acctgcagaa gggctacacc acgctgctgt tccgggaggg ggatttctac tgctcctact 2041 tcctgtggtt cgagaatgac atggggtaca aactccagat gatcgaggtg cccgtcctct 2101 ccgacgactc agtgcctatc ggcatgctgg gaggagacta ctacaggaag ctcctgcgct 2161 actacagcaa gaaccgccca accgaggctg tcaggtgcta cgagctgctg gccctgcacc 2221 tgtctgtcat ccccactgac ctgctggtgc agcaggccgg ccagctggcc cggcgcctct 2281 gggaggcctc ccgtatcccc ctgctctagg ccaaggtggc cgcagtctgc ctttgcatcc 2341 tgtcctccag ccacccttgc ttgccactgt tccccatgac gagagcctcc tgtctgcagt 2401 ggccatcctg aggatagggc agagtgccca gggtggcccc agggcttcta aaaccccacc 2461 tagaccaccc tccatgtcag gtactgagca aggccccaga tccttctctc tggaggaaga 2521 gggaagccca ggggtcctgt ttgtaaaaca acggtggcaa cagctcctct tccagagctg 2581 cctctgcctt tatcctggga gatggggagg aagccccatc tctgctgttc cctgcgtgga 2641 ggaagcccac ccagcaagct ctctcctacc ccaggtaaaa ggtgctcctt tgcctgggtt 2701 tgaattccag cgctgccact tcctctctgc acctcctggc aagtttcttc tattccccac 2761 gtttaaagcg atggcacctc cgtcccaggg tggtgtgagg attacccagt gtggtaggtg 2821 ctcaataaat gttggtcatt gttatcactg aagcccaaca tgctagtgct tctagaccct 2881 tctgtcagtg ctgataagcc cttgctaagt cccagcccct tcatgcttgg ctggcgtctg 2941 ccctagggct ggggttctca agcccctggc cctggcccag agatttggat tcccttggcg 3001 gccgtggagc ccaggctttg atgtctttca aagcttctgt ggtgcgccct ggattgagaa 3061 ccaccacccg aggggtacag cccctctctt ccaaccgaga agttcctgtc cagaatggac 3121 ccagggacaa gagaccctga gagccctggg actgggagtg tctgctcctc tgagccagga 3181 ggccggtgct gggccagaga ggacggcgtg gcgaaagtca gcgtccactg cagcacagga 3241 tcagatggcc gtgtgctgtg catgcaggag cctcgccttc tgtgtcttta gtcttgagcc 3301 aaaatttgct caaaagactg atctcttcct tgcagggaac agctttgggg ctgggggaac 3361 tagaacccac atgttggtct aaaccctgag aaggtggcag tgaggaagta tcccctcagg 3421 tgactggatc tgtgttcctc cttaacatca tctgatggaa tggcaatgaa aagcgtggat 3481 tgtggaaaat acagaaaaac ataaaggaaa aaactccaat cccctgagcc caccactgtt 3541 caggacccct gcttttgtca cctactattt ccctttagtt tttagcagcg gctggatgtg 3601 atatgtctag tttaaccagt ccccttgatc tttctatata ataaataaca caggagtgaa 3661 catcctgaat cag // LOCUS HSU65785 4503 bp mRNA PRI 25-JAN-1997 DEFINITION Human 150 kDa oxygen-regulated protein ORP150 mRNA, complete cds. ACCESSION U65785 NID g1794218 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4503) AUTHORS Ikeda,J., Kaneda,S., Kuwabara,K., Ogawa,S., Kobayashi,T., Matsumoto,M., Yura,T. and Yanagi,H. TITLE Cloning and expression of cDNA encoding the human 150 kDa oxygen-regulated protein, ORP150 JOURNAL Biochem. Biophys. Res. Commun. 230 (1), 94-99 (1997) MEDLINE 97148579 REFERENCE 2 (bases 1 to 4503) AUTHORS Ikeda,J., Kobayashi,T., Kuwabara,K., Ogawa,S., Yura,T. and Yanagi,H. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) HSP Research Institute, Kyoto Research Park, Shimogyo-ku, Kyoto 600, Japan FEATURES Location/Qualifiers source 1..4503 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="astrocytoma U373" CDS 103..3102 /function="proposed ER chaperone" /codon_start=1 /product="150 kDa oxygen-regulated protein ORP150" /db_xref="PID:g1794219" /translation="MADKVRRQRPRRRVCWALVAVLLADLLALSDTLAVMSVDLGSES MKVAIVKPGVPMEIVLNKESRRKTPVIVTLKENERFFGDSAASMAIKNPKATLRYFQH LLGKQADNPHVALYQARFPEHELTFDPQRQTVHFQISSQLQFSPEEVLGMVLNYSRSL AEDFAEQPIKDAVITVPVFFNQAERRAVLQAARMAGLKVLQLINDNTATALSYGVFRR KDINTTAQNIMFYDMGSGSTVCTIVTYQMVKTKEAGMQPQLQIRGVGFDRTLGGLEME LRLRERLAGLFNEQRKGQRAKDVRENPRAMAKLLREANRLKTVLSANADHMAQIEGLM DDVDFKAKVTRVEFEELCADLFERVPGPVQQALQSAEMSLDEIEQVILVGGATRVPRV QEVLLKAVGKEELGKNINADEAAAMGAVYQAAALSKAFKVKPFVVRDAVVYPILVEFT REVEEEPGIHSLKHNKRVLFSRMGPYPQRKVITFNRYSHDFNFHINYGDLGFLGPEDL RVFGSQNLTTVKLKGVGDSFKKYPDYESKGIKAHFNLDESGVLSLDRVESVFETLVED SAEEESTLTKLGNTISSLFGGGTTPDAKENGTDTVQEEEESPAEGSKDEPGEQVELKE EAEAPVEDGSQPPPPEPKGDATPEGEKATEKENGDKSEAQKPSEKAEAGPEGVAPAPE GEKKQKPARKRRMVEEIGVELVVLDLPDLPEDKLAQSVQKLQDLTLRDLEKQEREKAA NSLEAFIFETQDKLYQPEYQEVSTEEQREEISGKLSAASTWLEDEGVGATTVMLKEKL AELRKLCQGLFFRVEERKKWPERLSALDNLLNHSSMFLKGARLIPEMDQIFTEVEMTT LEKVINETWAWKNATLAEQAKLPATEKPVLLSKDIEAKMMALDREVQYLLNKAKFTKP RPRPKDKNGTRAEPPLNASASDQGEKVIPPAGQTEDAEPISEPEKVETGSEPGDTEPL ELGGPGAEPEQKEQSTGQKRPLKNDEL" BASE COUNT 1045 a 1177 c 1311 g 970 t ORIGIN 1 ttgtgaaggg cgcgggtggg gggcgctgcc ggcctcgtgg gtacgttcgt gccgcgtctg 61 tcccagagct ggggccgcag gagcggaggc aagaggggca ctatggcaga caaagttagg 121 aggcagaggc cgaggaggcg agtctgttgg gccttggtgg ctgtgctctt ggcagacctg 181 ttggcactga gtgatacact ggcagtgatg tctgtggacc tgggcagtga gtccatgaag 241 gtggccattg tcaaacctgg agtgcccatg gaaattgtct tgaataagga atctcggagg 301 aaaacaccgg tgatcgtgac cctgaaagaa aatgaaagat tctttggaga cagtgcagca 361 agcatggcga ttaagaatcc aaaggctacg ctacgttact tccagcacct cctggggaag 421 caggcagata acccccatgt agctctttac caggcccgct tcccggagca cgagctgact 481 ttcgacccac agaggcagac tgtgcacttt cagatcagct cgcagctgca gttctcacct 541 gaggaagtgt tgggcatggt tctcaattat tctcgttctc tagctgaaga ttttgcagag 601 cagcccatca aggatgcagt gatcaccgtg ccagtcttct tcaaccaggc cgagcgccga 661 gctgtgctgc aggctgctcg tatggctggc ctcaaagtgc tgcagctcat caatgacaac 721 accgccactg ccctcagcta tggtgtcttc cgccggaaag atattaacac cactgcccag 781 aatatcatgt tctatgacat gggctcaggc agcaccgtat gcaccattgt gacctaccag 841 atggtgaaga ctaaggaagc tgggatgcag ccacagctgc agatccgggg agtaggattt 901 gaccgtaccc tggggggcct ggagatggag ctccggcttc gagaacgcct ggctgggctt 961 ttcaatgagc agcgcaaggg tcagagagca aaggatgtgc gggagaaccc gcgtgccatg 1021 gccaagctgc tgcgtgaggc taatcggctc aaaaccgtcc tcagtgccaa cgctgaccac 1081 atggcacaga ttgaaggcct gatggatgat gtggacttca aggcaaaagt gactcgtgtg 1141 gaatttgagg agttgtgtgc agacttgttt gagcgggtgc ctgggcctgt acagcaggcc 1201 ctccagagtg ccgaaatgag tctggatgag attgagcagg tgatcctggt gggtggggcc 1261 actcgggtcc ccagagttca ggaggtgctg ctgaaggccg tgggcaagga ggagctgggg 1321 aagaacatca atgcagatga agcagccgcc atgggggcag tgtaccaggc agctgcgctc 1381 agcaaagcct ttaaagtgaa gccatttgtc gtccgagatg cagtggtcta ccccatcctg 1441 gtggagttca cgagggaggt ggaggaggag cctgggattc acagcctgaa gcacaataaa 1501 cgggtactct tctctcggat ggggccctac cctcaacgca aagtcatcac ctttaaccgc 1561 tacagccatg atttcaactt ccacatcaac tacggcgacc tgggcttcct ggggcctgaa 1621 gatcttcggg tatttggctc ccagaatctg accacagtga agctaaaagg ggtgggtgac 1681 agcttcaaga agtatcctga ctacgagtcc aagggcatca aggctcactt caacctggat 1741 gagagtggcg tgctcagtct agacagggtg gagtctgtat ttgagacact ggtagaggac 1801 agcgcagaag aggaatctac tctcaccaaa cttggcaaca ccatttccag cctgtttgga 1861 ggcggtacca caccagatgc caaggagaat ggtactgata ctgtccagga ggaagaggag 1921 agccctgcag aggggagcaa ggacgagcct ggggagcagg tggagctcaa ggaggaagct 1981 gaggccccag tggaggatgg ctctcagccc ccaccccctg aacctaaggg agatgcaacc 2041 cctgagggag aaaaggccac agaaaaagaa aatggggaca agtctgaggc ccagaaacca 2101 agtgagaagg cagaggcagg gcctgagggc gtcgctccag ccccagaggg agagaagaag 2161 cagaagcccg ccaggaagcg gcgaatggta gaggagatcg gggtggagct ggttgttctg 2221 gacctgcctg acttgccaga ggataagctg gctcagtcgg tgcagaaact tcaggacttg 2281 acactccgag acctggagaa gcaggaacgg gaaaaagctg ccaacagctt ggaagcgttc 2341 atatttgaga cccaggacaa gctgtaccag cccgagtacc aggaagtgtc cacagaggag 2401 cagcgtgagg agatctctgg gaagctcagc gccgcatcca cctggctgga ggatgagggt 2461 gttggagcca ccacagtgat gttgaaggag aagctggctg agctgaggaa gctgtgccaa 2521 gggctgtttt ttcgggtaga ggagcgcaag aagtggcccg aacggctgtc tgccctcgat 2581 aatctcctca accattccag catgttcctc aagggggccc ggctcatccc agagatggac 2641 cagatcttca ctgaggtgga gatgacaacg ttagagaaag tcatcaatga gacctgggcc 2701 tggaagaatg caactctggc cgagcaggct aagctgcccg ccacagagaa gcctgtgttg 2761 ctctcaaaag acattgaagc taagatgatg gccctggacc gagaggtgca gtatctgctc 2821 aataaggcca agtttaccaa gccccggccc cggcctaagg acaagaatgg gacccgggca 2881 gagccacccc tcaatgccag tgccagtgac cagggggaga aggtcatccc tccagcaggc 2941 cagactgaag atgcagagcc catttcagaa cctgagaaag tagagactgg atccgagcca 3001 ggagacactg agcctttgga gttaggaggt cctggagcag aacctgaaca gaaagaacaa 3061 tcgacaggac agaagcggcc tttgaagaac gacgaactat aacccccacc tctgttttcc 3121 ccattcatct ccaccccctt cccccaccac ttctatttat ttaacatcga gggttggggg 3181 aggggttggt cctgccctcg gctggagttc ctttctcacc cctgtgattt ggaggtgtgg 3241 agaaggggaa gggagggaca gctcactggt tccttctgca gtacctctgt ggttaaaaat 3301 ggaaactgtt ctcctcccca gccccactcc ctgttcccta cccatatagg ccctaaattt 3361 gggaaaaatc actattaatt tctgaatcct ttgcctgtgg gtaggaagag aatggctgcc 3421 agtggctgat gggtcccggt gatgggaagg gtatcaggtt gctggggagt ttccactctt 3481 ctctggtgat tgttccttcc ctcccttcct ctcccaccat gcgatgagca tcctttcagg 3541 ccagtgtctg cagagcctca gttaccaggt ttggtttctg agtgcctatc tgtgctcttt 3601 cctccctctg cgggcttctc ttgctctgag cctcccttcc ccattcccat gcagctcctt 3661 tccccctggg tttccttggc ttcctgcagc aaattgggca gttctctgcc ccttgcctaa 3721 aagcctgtac ctctggattg gcggaagtaa atctggaagg attctcactc gtatttccca 3781 cccctagtgg ccagaggagg gaggggcaca gtgaagaagg gagcccacca cctctccgaa 3841 gaggaaagcc acgtagagtg gttggcatgg ggtgccagca tcgtgcaagc tctgtcataa 3901 tctgcatctt cccagcagcc tggtacccca ggttcctgta actccctgcc tcctcctctc 3961 ttctgctgtt ctgctcctcc cagacagagc ctttccctca ccccctgacc ccctgggctg 4021 accaaaatgt gctttctact gtgagtccct atcccaagat cctggggaaa ggagagacca 4081 tggtgtgaat gtagagatgc cacctccctc tctctgaggc aggcctgtgg atgaaggagg 4141 agggtcaggg ctggccttcc tctgtgcatc actctgctag gttgggggcc cccgacccac 4201 catacctacg cctagggagc ccgtcctcca gtattccgtc tgtagcagga gctagggctg 4261 ctgcctcagc tccaagacaa gaatgaacct ggctgtgtca gtcattttgt cttttccttt 4321 tttttttttt gccacattgg cagagatggg acctaagggt cccacccctc accccacccc 4381 cacctcttct gtatgtttga attctttcag tagctgttga tgctggttgg acaggtttga 4441 gtcaaattgt actttgctcc attgttaatt gagaaactgt ttcaataaaa tattcttttc 4501 tac // LOCUS HSU65928 1292 bp mRNA PRI 09-OCT-1996 DEFINITION Human Jun activation domain binding protein mRNA, complete cds. ACCESSION U65928 NID g1549382 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1292) AUTHORS Claret,F.-X., Hibi,M., Dhut,S., Toda,T. and Karin,M. TITLE A new group of conserved coactivators that increase the specificity of AP-1 transcription factors JOURNAL Nature 382, 453-457 (1996) REFERENCE 2 (bases 1 to 1292) AUTHORS Claret,F.-X., Hibi,M., Dhut,S., Toda,T. and Karin,M. TITLE Direct Submission JOURNAL Submitted (02-AUG-1996) Division of Molecular Oncology, Biomedical Research Center, Osaka University Medical School, 2-2 Yamada-Oka, Suita, Osaka 565, Japan FEATURES Location/Qualifiers source 1..1292 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /clone_lib="lamda gt11 Jurkat" CDS 128..1132 /codon_start=1 /product="Jun activation domain binding protein" /db_xref="PID:g1549383" /translation="MAASGSGMAQKTWELANNMQEAQSIDEIYKYDKKQQQEILAAKP WTKDHHYFKYCKISALALLKMVMHARSGGNLEVMGLMLGKVDGETMIIMDSFALPVEG TETRVNAQAAAYEYMAAYIENAKQVGHLENAIGWYHSHPGYGCWLSGIDVSTQMLNQQ FQEPFVAVVIDPTRTISAGKVNLGAFRTYPKGYKPPDEGPSEYQTIPLNKIEDFGVHC KQYYALEVSYFKSSLDRKLLELLWNKYWVNTLSSSSLLTNADYTTGQVFDLSEKLEQS EAQLGRGSFMLGLETHDRKSEDKLAKATRDSCKTTIEAIHGLMSQVIKDKLFNQINIS " BASE COUNT 406 a 256 c 306 g 324 t ORIGIN 1 gaattcccaa gagtctaggt aagagtttgt tcccgtggtg cggagggtca aggcccacac 61 ccggaaacct agcgaggtaa agttgcgtct tggttgtaga gacgacaact tctccgcttc 121 ctcggcgatg gcggcgtccg ggagcggtat ggcccagaaa acctgggaac tggccaacaa 181 catgcaggaa gctcagagta tcgatgaaat ctacaaatac gacaagaaac agcagcaaga 241 aatcctggcg gcgaagccct ggactaagga tcaccattac tttaagtact gcaaaatctc 301 agcattggct ctgctgaaga tggtgatgca tgccagatcg ggaggcaact tggaagtgat 361 gggtctgatg ctaggaaagg tggatggtga aaccatgatc attatggaca gttttgcttt 421 gcctgtggag ggcactgaaa cccgagtaaa tgctcaggct gctgcatatg aatacatggc 481 tgcatacata gaaaatgcaa aacaggtggg ccaccttgaa aatgcaatcg ggtggtatca 541 tagccaccct ggctatggct gctggctttc tgggattgat gttagtactc agatgctcaa 601 tcagcagttc caggaaccat ttgtagcagt ggtgattgat ccaacaagaa caatatccgc 661 agggaaagtg aatcttggcg cctttaggac atacccaaag ggctacaaac ctcctgatga 721 aggaccttct gagtaccaga ctattccact taataaaata gaagattttg gtgtacactg 781 caaacaatat tatgccttag aagtctcata tttcaaatcc tctttggatc gcaaattgct 841 tgagctgttg tggaataaat actgggtgaa tacgttgagt tcttctagct tgcttactaa 901 tgcagactat accactggtc aggtctttga tttgtctgaa aagttagagc agtcagaagc 961 ccagctggga cgagggagtt tcatgttggg tttagaaacg catgaccgaa aatcagaaga 1021 caaacttgcc aaagctacaa gagacagctg taaaactacc atagaagcta tccatggatt 1081 gatgtctcag gttattaagg ataaactgtt taatcaaatt aacatctctt aaacagtctc 1141 tgagaagtac tttacctgaa agacagtatg agaaaaatat tcaagtacac tttaaaacca 1201 gttacccaaa atctgattag aagtataagg tgctctgaag tgtcctaaat attaatatcc 1261 tgtaataaag ctctttaaaa tgaaaaaaaa aa // LOCUS HSU65932 1683 bp mRNA PRI 13-AUG-1996 DEFINITION Human extracellular matrix protein 1 (ECM1) mRNA, complete cds. ACCESSION U65932 NID g1488323 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1683) AUTHORS Johnson,M.R., Vos,H.L., Ortiz de Luna,R.I., Dehejia,A.M., Polymeropoulos,M.H., McIntosh,I. and Francomano,C.A. TITLE Characterization of human extracellular matrix protein 1 gene within the pycnodysostosis candidate region on chromosome 1q21 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1683) AUTHORS Johnson,M.R., Vos,H.L., Ortiz de Luna,R.I., Dehejia,A.M., Polymeropoulos,M.H., McIntosh,I. and Francomano,C.A. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) MGB/NCHGR, NIH, 10 Center Dr. MSC 1852, Bethesda, MD 20892-1852, USA FEATURES Location/Qualifiers source 1..1683 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q21" gene 32..1654 /gene="ECM1" CDS 32..1654 /gene="ECM1" /codon_start=1 /product="extracellular matrix protein 1" /db_xref="PID:g1488324" /translation="MGTTARAALVLTYLAVASAASEGGFTATGQRQLRPEHFQEVGYA APPSPPLSRSLPMDHPDSSQHGPPFEGQSQVQPPPSQEATPLQQEKLLPAQLPAEKEV GPPLPQEAVPLQKELPSLQHPNEQKEGTPAPFGDQSHPEPESWNAAQHCQQDRSQGGW GHRLDGFPPGRPSPDNLNQICLPNRQHVVYGPWNLPQSSYSHLTRQGETLNFLEIGYS RCCHCRSHTNRLECAKLVWEEAMSRFCEAEFSVKTRPHWCCTRQGEARFSCFQEEAPQ PHYQLRACPSHQPDISSGLELPFPPGVPTLDNIKNICHLRRFRSVPRNLPATDPLQRE LLALIQLEREFQRCCRQGNNHTCTWKAWEDTLDKYCDREYAVKTHHHLCCRHPPSPTR DECFARRAPYPNYDRDILTIDISRVTPNLMGHLCGNQRVLTKHKHIPGLIHNMTARCC DLPFPEQACCAEEEKLTFINDLCGPRRNIWRDPALCCYLSPGDEQVNCFNINYLRNVA LVSGDTENAKGQGEQGSTGGTNISSTSEPKEE" BASE COUNT 374 a 554 c 428 g 327 t ORIGIN 1 ctctgagtgt ccagtggtca gttgccccag gatggggacc acagccagag cagccttggt 61 cttgacctat ttggctgttg cttctgctgc ctctgaggga ggcttcacgg ctacaggaca 121 gaggcagctg aggccagagc actttcaaga agttggctac gcagctcccc cctccccacc 181 cctatcccga agcctcccca tggatcaccc tgactcctct cagcatggcc ctccctttga 241 gggacagagt caagtgcagc cccctccctc tcaggaggcc acccctctcc aacaggaaaa 301 gctgctacct gcccaactcc ctgctgaaaa ggaagtgggt ccccctctcc ctcaggaagc 361 tgtccccctc caaaaagagc tgccctctct ccagcacccc aatgaacaga aggaaggaac 421 gccagctcca tttggggacc agagccatcc agaacctgag tcctggaatg cagcccagca 481 ctgccaacag gaccggtccc aagggggctg gggccaccgg ctggatggct tcccccctgg 541 gcggccttct ccagacaatc tgaaccaaat ctgccttcct aaccgtcagc atgtggtata 601 tggtccctgg aacctaccac agtccagcta ctcccacctc actcgccagg gtgagaccct 661 caatttcctg gagattggat attcccgctg ctgccactgc cgcagccaca caaaccgcct 721 agagtgtgcc aaacttgtgt gggaggaagc aatgagccga ttctgtgagg ccgagttctc 781 ggtcaagacc cgaccccact ggtgctgcac gcggcagggg gaggctcggt tctcctgctt 841 ccaggaggaa gctccccagc cacactacca gctccgggcc tgccccagcc atcagcctga 901 tatttcctcg ggtcttgagc tgcctttccc tcctggggtg cccacattgg acaatatcaa 961 gaacatctgc cacctgaggc gcttccgctc tgtgccacgc aacctgccag ctactgaccc 1021 cctacaaagg gagctgctgg cactgatcca gctggagagg gagttccagc gctgctgccg 1081 ccaggggaac aatcacacct gtacatggaa ggcctgggag gatacccttg acaaatactg 1141 tgaccgggag tatgctgtga agacccacca ccacttgtgt tgccgccacc ctcccagccc 1201 tactcgggat gagtgctttg cccgtcgggc tccttacccc aactatgacc gggacatctt 1261 gaccattgac atcagtcgag tcacccccaa cctcatgggc cacctctgtg gaaaccaaag 1321 agttctcacc aagcataaac atattcctgg gctgatccac aacatgactg cccgctgctg 1381 tgacctgcca tttccagaac aggcctgctg tgcagaggag gagaaattaa ccttcatcaa 1441 tgatctgtgt ggtccccgac gtaacatctg gcgagaccct gccctctgct gttacctgag 1501 tcctggggat gaacaggtca actgcttcaa catcaattat ctgaggaacg tggctctagt 1561 gtctggagac actgagaacg ccaagggcca gggggagcag ggctcaactg gaggaacaaa 1621 tatcagctcc acctctgagc ccaaggaaga atgagtcacc ccagagccct agagggtcag 1681 atg // LOCUS HSU66033 2558 bp mRNA PRI 05-MAR-1997 DEFINITION Human glypican-5 (GPC5) mRNA, complete cds. ACCESSION U66033 NID g1864084 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2558) AUTHORS Veugelers,M., Vermeesch,J., Reekmans,G., Steinfeld,R., Marynen,P. and David,G. TITLE Characterization of glypican-5 and chromosomal localization of human GPC5, a new member of the glypican gene family JOURNAL Genomics 40 (1), 24-30 (1997) MEDLINE 97224481 REFERENCE 2 (bases 1 to 2558) AUTHORS Veugelers,M. and David,G. TITLE Direct Submission JOURNAL Submitted (03-AUG-1996) Center for Human Genetics, University of Leuven, Herestraat 49, Leuven B3000, Belgium FEATURES Location/Qualifiers source 1..2558 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q32" /dev_stage="fetal" /tissue_type="brain" gene 15..1733 /gene="GPC5" CDS 15..1733 /gene="GPC5" /note="glycosyl phosphatidylinositol-anchored (glypiated) cell surface proteoglycan; putative cell surface receptor for growth and adhesion factors; heparan sulfate proteoglycan" /codon_start=1 /product="glypican-5" /db_xref="PID:g1864085" /translation="MDAQTWPVGFRCLLLLALVGSARSEGVQTCEEVRKLFQWRLLGA VRGLPDSPRAGPDLQVCISKKPTCCTRKMEERYQIAARQDMQQFLQTSSSTLKFLISR NAAAFQETLETLIKQAENYTSILFCSTYRNMALEAAASVQEFFTDVGLYLFGADVNPE EFVNRFFDSLFPLVYNHLINPGVTDSSLEYSECIRMARRDVSPFGNIPQRVMGQMGRS LLPSRTFLQALNLGIEVINTTDYLHFSKECSRALLKMQYCPHCQGLALTKPCMGYCLN VMRGCLAHMAELNPHWHAYIRSLEELSDAMHGTYDIGHVLLNFHLLVNDAVLQAHLNG QKLLEQVNRICGRPVRTPTQSPRCSFDQSKEKHGMKTTTRNSEETLANRRKEFINSLR LYRSFYGGLADQLCANELAAADGLPCWNGEDIVKSYTQRVVGNGIKAQSGNPEVKVKG IDPVINQIIDKLKHVVQLLQGRSPKPDKWELLQLGSGGGMVEQVSGDCDDEDGCGGSG SGEVKRTLKITDWMPDDMNFSDVKQIHQTDTGSTLDTTGAGCAVATESMTFTLISVVM LLPGIW" BASE COUNT 758 a 526 c 572 g 702 t ORIGIN 1 ccaggacggc gaggatggac gcacagacct ggcccgtggg ctttcgctgc ctcctccttc 61 tggccctggt tgggtccgcc cgcagcgagg gcgtgcagac ctgcgaagaa gttcggaaac 121 ttttccagtg gcggctgctg ggagctgtca gggggctgcc ggattcgccg cgggcaggac 181 ctgatcttca ggtttgcata tccaaaaagc ctacatgttg caccaggaag atggaggaga 241 gatatcagat tgcggctcgc caggatatgc agcagtttct tcaaacgtcc agctctacat 301 taaagtttct aatatctcga aatgcggctg cttttcaaga aacccttgaa actctcatca 361 aacaagcaga aaattacacc agtatacttt tttgcagtac ctacaggaac atggccttgg 421 aggctgctgc ttcggttcag gagttcttca ctgatgtggg gctgtattta tttggtgcgg 481 atgttaatcc tgaagaattt gtaaacagat tttttgacag tctttttcct ctggtctaca 541 accacctcat taaccctggt gtgactgaca gttccctgga atactcagaa tgcatccgga 601 tggctcgccg ggatgtgagt ccatttggta atattcccca aagagtaatg ggacagatgg 661 ggaggtccct gctgcccagc cgcacttttc tgcaggcact caatctgggc attgaagtca 721 tcaacaccac agactatctg cacttctcca aagagtgcag cagagccctc ctgaagatgc 781 aatactgccc gcactgccaa ggcctggcgc tcactaagcc ttgtatggga tactgcctca 841 atgtcatgcg aggctgcctg gcgcacatgg cggagcttaa tccacactgg catgcatata 901 tccggtcgtt ggaagaactc tcggatgcaa tgcatggaac atacgacatt ggacacgtgc 961 tgctgaactt tcacttgctt gttaatgatg ctgtgttaca ggctcacctc aatggacaaa 1021 aattattgga acaggtaaat aggatttgtg gccgccctgt aagaacaccc acacaaagcc 1081 cccgttgttc ttttgatcag agcaaagaga agcatggaat gaagaccacc acaaggaaca 1141 gtgaagagac gcttgccaac agaagaaaag aatttatcaa cagccttcga ctgtacaggt 1201 cattctatgg aggtctagct gatcagcttt gtgctaatga attagctgct gcagatggac 1261 ttccctgctg gaatggagaa gatatagtaa aaagttatac tcagcgtgtg gttggaaatg 1321 gaatcaaagc ccagtctgga aatcctgaag tcaaagtcaa aggaattgat cctgtgataa 1381 atcagattat tgataaactg aagcatgttg ttcagttgtt acagggtaga tcacccaaac 1441 ctgacaagtg ggaacttctt cagctgggca gtggtggagg catggttgaa caagtcagtg 1501 gggactgtga tgatgaagat ggttgcgggg gatcaggaag tggagaagtc aagaggacac 1561 tgaagatcac agactggatg ccagatgata tgaacttcag tgatgtaaag caaatccatc 1621 aaacagacac tggcagtact ttagacacaa caggagcagg atgtgcagtg gcgactgaat 1681 ctatgacatt cactctgata agtgtggtga tgttacttcc cgggatttgg taactgaact 1741 cttctgtcct gacatacctt actgaagtct cgatttcttc tctctctgca tatgcctgga 1801 ataagagatc ctttttcaat gtaacaatta tatttatgaa aagatatgtt acactaactt 1861 ctcagaagcc aagctgaaat attcataaag tccctaaaac tcaacgttta aatgacacac 1921 tttaaaaata tgtctttttt caatctaact gaaaaccttc ttaacttcta atatattaaa 1981 tctgaagatg tgaagggcac agaagtgact ttgaataaga agaatttagt gtatctgtaa 2041 ttttattatc aattccaagc cccttccttt ctaaattaaa aatgttttca tttgaaagtg 2101 tatttgccag acaatgaaaa cagtatgcag tatttcttaa agtattgaaa ttagaatatc 2161 atgaaataaa tcaaaacata caatggcaag tagtatgcat gcatattcaa gagactcttc 2221 catttttgca agctgtagaa ggaaatgtct gaatgtctat aagttatggg gtagattctt 2281 gagaagcatt tcatataatt tcactgaaga accttgataa ttttgaccca ctgtaactta 2341 gccactgatg aaccttaaag ctgagtattt tattaacacc tgatttgtat tctattatat 2401 tcaaaatgca tctttggtat tgtgcctctg ctcccatctc tctctttgcc tcatagattt 2461 agctatgttg ggaagcacat gcttgctcta ggaatatctc caataaagct gttaactatt 2521 tggtggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSU66197 732 bp mRNA PRI 15-NOV-1996 DEFINITION Human fibroblast growth factor homologous factor 1 (FHF-1) mRNA, complete cds. ACCESSION U66197 NID g1563884 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 732) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Fibroblast growth factor (FGF) homologous factors: new members of the FGF family implicated in nervous system development JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (18), 9850-9857 (1996) MEDLINE 96382556 REFERENCE 2 (bases 1 to 732) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H.C., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) Molecular Biology and Genetics, HHMI/Johns Hopkins, 725 North Wolfe Street PCTB 805, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..732 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" gene 1..732 /gene="FHF-1" CDS 1..732 /gene="FHF-1" /codon_start=1 /product="fibroblast growth factor homologous factor 1" /db_xref="PID:g1563885" /translation="MAAAIASSLIRQKRQARESNSDRVSASKRRSSPSKDGRSLCERH VLGVFSKVRFCSGRKRPVRRRPEPQLKGIVTRLFSQQGYFLQMHPDGTIDGTKDENSD YTLFNLIPVGLRVVAIQGVKASLYVAMNGEGYLYSSDVFTPECKFKESVFENYYVIYS STLYRQQESGRAWFLGLNKEGQIMKGNRVKKTKPSSHFVPKPIEVCMYREPSLHEIGE KQGRSRKSSGTPTMNGGKVVNQDST" BASE COUNT 210 a 177 c 197 g 148 t ORIGIN 1 atggctgcgg cgatagccag ctccttgatc cggcagaagc ggcaggcgag ggagtccaac 61 agcgaccgag tgtcggcctc caagcgccgc tccagcccca gcaaagacgg gcgctccctg 121 tgcgagaggc acgtcctcgg ggtgttcagc aaagtgcgct tctgcagcgg ccgcaagagg 181 ccggtgaggc ggagaccaga accccagctc aaagggattg tgacaaggtt attcagccag 241 cagggatact tcctgcagat gcacccagat ggtaccattg atgggaccaa ggacgaaaac 301 agcgactaca ctctcttcaa tctaattccc gtgggcctgc gtgtagtggc catccaagga 361 gtgaaggcta gcctctatgt ggccatgaat ggtgaaggct atctctacag ttcagatgtt 421 ttcactccag aatgcaaatt caaggaatct gtgtttgaaa actactatgt gatctattct 481 tccacactgt accgccagca agaatcaggc cgagcttggt ttctgggact caataaagaa 541 ggtcaaatta tgaaggggaa cagagtgaag aaaaccaagc cctcatcaca ttttgtaccg 601 aaacctattg aagtgtgtat gtacagagaa ccatcgctac atgaaattgg agaaaaacaa 661 gggcgttcaa ggaaaagttc tggaacacca accatgaatg gaggcaaagt tgtgaatcaa 721 gattcaacat ag // LOCUS HSU66198 738 bp mRNA PRI 15-NOV-1996 DEFINITION Human fibroblast growth factor homologous factor 2 (FHF-2) mRNA, complete cds. ACCESSION U66198 NID g1563886 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 738) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Fibroblast growth factor (FGF) homologous factors: new members of the FGF family implicated in nervous system development JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (18), 9850-9857 (1996) MEDLINE 96382556 REFERENCE 2 (bases 1 to 738) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H.C., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) Molecular Biology and Genetics, HHMI/Johns Hopkins, 725 North Wolfe Street PCTB 805, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..738 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" gene 1..738 /gene="FHF-2" CDS 1..738 /gene="FHF-2" /codon_start=1 /product="fibroblast growth factor homologous factor 2" /db_xref="PID:g1563887" /translation="MAAAIASSLIRQKRQAREREKSNACKCVSSPSKGKTSCDKNKLN VFSRVKLFGSKKRRRRRPEPQLKGIVTKLYSRQGYHLQLQADGTIDGTKDEDSTYTLF NLIPVGLRVVAIQGVQTKLYLAMNSEGYLYTSELFTPECKFKESVFENYYVTYSSMIY RQQQSGRGWYLGLNKEGEIMKGNHVKKNKPAAHFLPKPLKVAMYKEPSLHDLTEFSRS GSGTPTKSRSVSGVLNGGKSMSHNEST" BASE COUNT 222 a 184 c 188 g 144 t ORIGIN 1 atggcggcgg ctatcgccag ctcgctcatc cgtcagaaga ggcaagcccg cgagcgcgag 61 aaatccaacg cctgcaagtg tgtcagcagc cccagcaaag gcaagaccag ctgcgacaaa 121 aacaagttaa atgtcttttc ccgggtcaaa ctcttcggct ccaagaagag gcgcagaaga 181 agaccagagc ctcagcttaa gggtatagtt accaagctat acagccgaca aggctaccac 241 ttgcagctgc aggcggatgg aaccattgat ggcaccaaag atgaggacag cacttacact 301 ctgtttaacc tcatccctgt gggtctgcga gtggtggcta tccaaggagt tcaaaccaag 361 ctgtacttgg caatgaacag tgagggatac ttgtacacct cggaactttt cacacctgag 421 tgcaaattca aagaatcagt gtttgaaaat tattatgtga catattcatc aatgatatac 481 cgtcagcagc agtcaggccg agggtggtat ctgggtctga acaaagaagg agagatcatg 541 aaaggcaacc atgtgaagaa gaacaagcct gcagctcatt ttctgcctaa accactgaaa 601 gtggccatgt acaaggagcc atcactgcac gatctcacgg agttctcccg atctggaagc 661 gggaccccaa ccaagagcag aagtgtctct ggcgtgctga acggaggcaa atccatgagc 721 cacaatgaat caacgtag // LOCUS HSU66199 678 bp mRNA PRI 15-NOV-1996 DEFINITION Human fibroblast growth factor homologous factor 3 (FHF-3) mRNA, complete cds. ACCESSION U66199 NID g1563888 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 678) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Fibroblast growth factor (FGF) homologous factors: new members of the FGF family implicated in nervous system development JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (18), 9850-9857 (1996) MEDLINE 96382556 REFERENCE 2 (bases 1 to 678) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H.C., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) Molecular Biology and Genetics, HHMI/Johns Hopkins, 725 North Wolfe Street PCTB 805, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..678 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" gene 1..678 /gene="FHF-3" CDS 1..678 /gene="FHF-3" /codon_start=1 /product="fibroblast growth factor homologous factor 3" /db_xref="PID:g1563889" /translation="MAALASSLIRQKREVREPGGSRPVSAQRRVCPRGTKSLCQKQLL ILLSKVRLCGGRPARPDRGPEPQLKGIVTKLFCRQGFYLQANPDGSIQGTPEDTSSFT HFNLIPVGLRVVTIQSAKLGHYMAMNAEGLLYSSPHFTAECRFKECVFENYYVLYASA LYRQRRSGRAWYLGLDKEGQVMKGNRVKKTKAAAHFLPKLLEVAMYQEPSLHSVPEAS PSSPPAP" BASE COUNT 119 a 232 c 201 g 126 t ORIGIN 1 atggcggcgc tggccagtag cctgatccgg cagaagcggg aggtccgcga gcccgggggc 61 agccggccgg tgtcggcgca gcggcgcgtg tgtccccgcg gcaccaagtc cctttgccag 121 aagcagctcc tcatcctgct gtccaaggtg cgactgtgcg gggggcggcc cgcgcggccg 181 gaccgcggcc cggagcctca gctcaaaggc atcgtcacca aactgttctg ccgccagggt 241 ttctacctcc aggcgaatcc cgacggaagc atccagggca ccccagagga taccagctcc 301 ttcacccact tcaacctgat ccctgtgggc ctccgtgtgg tcaccatcca gagcgccaag 361 ctgggtcact acatggccat gaatgctgag ggactgctct acagttcgcc gcatttcaca 421 gctgagtgtc gctttaagga gtgtgtcttt gagaattact acgtcctgta cgcctctgct 481 ctctaccgcc agcgtcgttc tggccgggcc tggtacctcg gcctggacaa ggagggccag 541 gtcatgaagg gaaaccgagt taagaagacc aaggcagctg cccactttct gcccaagctc 601 ctggaggtgg ccatgtacca ggagccttct ctccacagtg tccccgaggc ctccccttcc 661 agtccccctg ccccctga // LOCUS HSU66200 744 bp mRNA PRI 15-NOV-1996 DEFINITION Human fibroblast growth factor homologous factor 4 (FHF-4) mRNA, complete cds. ACCESSION U66200 NID g1563890 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 744) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Fibroblast growth factor (FGF) homologous factors: new members of the FGF family implicated in nervous system development JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (18), 9850-9857 (1996) MEDLINE 96382556 REFERENCE 2 (bases 1 to 744) AUTHORS Smallwood,P.M., Munoz-Sanjuan,I., Tong,P., Macke,J.P., Hendry,S.H.C., Gilbert,D.J., Copeland,N.G., Jenkins,N.A. and Nathans,J. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) Molecular Biology and Genetics, HHMI/Johns Hopkins, 725 North Wolfe Street PCTB 805, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..744 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" gene 1..744 /gene="FHF-4" CDS 1..744 /gene="FHF-4" /codon_start=1 /product="fibroblast growth factor homologous factor 4" /db_xref="PID:g1563891" /translation="MAAAIASGLIRQKRQAREQHWDRPSASRRRSSPSKNRGLCNGNL VDIFSKVRIFGLKKRRLRRQDPQLKGIVTRLYCRQGYYLQMHPDGALDGTKDDSTNST LFNLIPVGLRVVAIQGVKTGLYIAMNGEGYLYPSELFTPECKFKESVFENYYVIYSSM LYRQQESGRAWFLGLNKEGQAMKGNRVKKTKPAAHFLPKPLEVAMYREPSLHDVGETV PKPGVTPSKSTSASAIMNGGKPVNKSKTT" BASE COUNT 217 a 178 c 195 g 154 t ORIGIN 1 atggccgcgg ccatcgctag cggcttgatc cgccagaagc ggcaggcgcg ggagcagcac 61 tgggaccggc cgtctgccag caggaggcgg agcagcccca gcaagaaccg cgggctctgc 121 aacggcaacc tggtggatat cttctccaaa gtgcgcatct tcggcctcaa gaagcgcagg 181 ttgcggcgcc aagatcccca gctcaagggt atagtgacca ggttatattg caggcaaggc 241 tactacttgc aaatgcaccc cgatggagct ctcgatggaa ccaaggatga cagcactaat 301 tctacactct tcaacctcat accagtggga ctacgtgttg ttgccatcca gggagtgaaa 361 acagggttgt atatagccat gaatggagaa ggttacctct acccatcaga actttttacc 421 cctgaatgca agtttaaaga atctgttttt gaaaattatt atgtaatcta ctcatccatg 481 ttgtacagac aacaggaatc tggtagagcc tggtttttgg gattaaataa ggaagggcaa 541 gctatgaaag ggaacagagt aaagaaaacc aaaccagcag ctcattttct acccaagcca 601 ttggaagttg ccatgtaccg agaaccatct ttgcatgatg ttggggaaac ggtcccgaag 661 cctggggtga cgccaagtaa aagcacaagt gcgtctgcaa taatgaatgg aggcaaacca 721 gtcaacaaga gtaagacaac atag // LOCUS HSU66359 1716 bp mRNA PRI 09-NOV-1996 DEFINITION Human T54 protein (T54) mRNA, complete cds. ACCESSION U66359 NID g1663763 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1716) AUTHORS Schindelhauer,D., Hellebrand,H., Grimm,L., Bader,I., Meitinger,T., Wehnert,M., Ross,M. and Meindl,A. TITLE Long range map of a 3.5 Mb region in Xp11.23-22 with a sequence ready map from a 1.1 Mb gene-rich interval JOURNAL Genome Res. (1996) In press REFERENCE 2 (bases 1 to 1716) AUTHORS Grimm,L. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) Abteilung fuer paediatrische Genetik, Kinderpoliklinik, LMU Muenchen, Goethestr. 29, Muenchen 80336, Germany FEATURES Location/Qualifiers source 1..1716 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp11.23; between SYP and TFE-3" gene 61..1197 /gene="T54" CDS 61..1197 /gene="T54" /codon_start=1 /product="T54 protein" /db_xref="PID:g1663764" /translation="MADSKEGVLPLTLLPLPQFHSASLARPHGGAGRLERRRGAISGG KGFLENRGREGAAECEAPGGPQGTRHPFDPELAIAGSHQPGPLQVHRYWGLADGVVSQ AVKELIAESKKSLEEGKNAGVDPTLAIPMIQKGCTPSGEGADSEPRAETVPEEANYEA VPVEAYGWPCCGAWAGNLARHRPHLQSSSEAPCQLTEAKGLGLGCQPDRAQALTPTGP SRMPRPDEEQEKDKEDQPQGLVPGGAVVVLSGPHRGLYGKVEGLDPDNVRAMVRLAVW SRVVTVSEYYCGLSPSRSLTRTPWISGNRTELPHARKTLWNQELYIQQDNSERKRKHL PDRQDGLQPRVRKQPPEVSTGCTGTCVCGLWTTCTKEANITTPR" BASE COUNT 442 a 441 c 522 g 311 t ORIGIN 1 gatctgaacc caaactaaat ttcccagcaa gcagcgcgcc ggcctgggaa aaggagcaag 61 atggctgact ccaaagaggg tgttttgccg ctgacgctgc ttccactgcc ccaatttcat 121 tcggcttcac tcgcacgtcc gcacggaggc gctggccgac tcgagagacg gcgcggggcc 181 atctccggag gaaaaggatt tcttgaaaac cgtggaaggg agggagctgc agagtgtgaa 241 gccccaggag gcccccaagg aactcgtcat ccctttgatc cagaattggc catcgcaggc 301 agccaccagc ccggccccct gcaggtccac agatactggg gccttgcgga tggggtggtg 361 tcccaggctg tgaaggagct cattgcggaa tccaagaagt ctctggaaga aggaaagaat 421 gcgggtgtcg accccacgct cgctatcccc atgatccaga aaggatgcac ccccagcggg 481 gaaggggcag acagcgaacc ccgggcagag acagtgccag aggaggctaa ttatgaggcg 541 gtccccgtgg aggcctatgg gtggccatgc tgcggggcat gggctggaaa cctagcgagg 601 catcggccgc accttcaatc aagtagtgaa gccccgtgtc aactcactga ggccaagggg 661 ttagggctgg ggtgccaacc tgaccgagcc caggccttga cccccactgg cccctcccgc 721 atgccaagac cagatgagga gcaagagaaa gataaggaag atcagcctca agggctggtg 781 cctggaggag ctgtggtggt tctttctggc cctcaccgag gcctctatgg gaaggtggaa 841 ggccttgatc ctgacaatgt tcgggccatg gttcgtctgg ctgtgtggag ccgggtggtg 901 actgttagtg agtactactg cggcctgtct cccagcagga gtttgacaag aacaccttgg 961 atctcaggca acagaacgga actgcctcat gcacggaaga ccctctggaa tcaagaactc 1021 tacatccagc aggacaactc agagaggaag cggaaacacc ttccagaccg acaggatggc 1081 ctgcagccaa gagtgagaaa gcagccccca gaagtcagca ctggttgcac agggacctgc 1141 gtgtgcggtt tgtggacaac atgtacaaag gaggccaata ttacaacacc aagatgataa 1201 ttgaagatgt cctaagccca gatacctgtg tatgtcggac agatgaaggc cgagtcctgg 1261 aaggcctgag ggaagacatg ctggagaccc tggttcccaa ggcagagggt gaccgtgtga 1321 tggtggtgct gggcccacag actggaaggg tgggacattt gctgagccgg gacagagcac 1381 ggtgacccgg gatttggtgc aactgccaag agaaaatcag gtggtggtga cttcactacg 1441 atgccatctg ccagtacatg ggccctagtg acacagatga tgactgaccc atgggactcc 1501 tcccatcccc caggctggta ccagttctgt accatatgag aaagttgcct tcagaaggtg 1561 ggaagatcat tgttccatcc tctacttctg gtgcagtcct gggacaagga caagggaaag 1621 ggatgggtga accagtaggg aagctagaaa caaacccaat atttaccaaa atttaagggt 1681 ataataaaaa ccatttcaag tacttaataa aaaaaa // LOCUS HSU66468 1175 bp mRNA PRI 12-DEC-1996 DEFINITION Human cell growth regulator CGR11 mRNA, complete cds. ACCESSION U66468 NID g1724070 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1175) AUTHORS Madden,S.L., Galella,E.A., Riley,D., Bertelsen,A.H. and Beaudry,G.A. TITLE Induction of cell growth regulatory genes by p53 JOURNAL Cancer Res. 56 (23), 5384-5390 (1996) MEDLINE 97122496 REFERENCE 2 (bases 1 to 1175) AUTHORS Madden,S.L. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) Stephen L. Madden, Molecular & Cellular Biology, PharmaGenics, Inc., 4 Pearl Court, Allendale, NJ 07401, USA FEATURES Location/Qualifiers source 1..1175 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hCGR11" /tissue_type="brain" /dev_stage="fetal" CDS 76..981 /codon_start=1 /product="cell growth regulator CGR11" /db_xref="PID:g1724071" /translation="MLPLTMTVLILLLLPTGQAAPKDGVTRPDSEVQHQLLPNPFQPG QEQLGLLQSYLKGLGRTEVQLEHLSREQVLLYLFALHDYDQSGQLDGLELLSMLTAAL APGAANSPTTNPVILIVDKVLETQDLNGDGLMTPAELINFPGVALRHVEPGEPLAPSP QEPQAVGRQSLLAKSPLRQETQEAPGPREEAKGQVEARRESLDPVQEPGGQAEADGDV PGPRGEAEGQAEAKGDAPGPRGEAEGQAEAKGDAPGPRGEAGGQAEARENGEEAKELP GETLESKNTQNDFEVHIVQVENDEI" polyA_signal 1155..1160 BASE COUNT 300 a 312 c 366 g 197 t ORIGIN 1 gggcggcgca cgagcaggag cgcccacgga gctggacccc cagagccgcg cgcccgcgca 61 gcagttccag gaaggatgtt acctttgacg atgacagtgt taatcctgct gctgctcccc 121 acgggtcagg ctgccccaaa ggatggagtc acaaggccag actctgaagt gcagcatcag 181 ctcctgccca accccttcca gccaggccag gagcagctcg gacttctgca gagctaccta 241 aagggactag gaaggacaga agtgcaactg gagcatctga gccgggagca ggttctcctc 301 tacctctttg ccctccatga ctatgaccag agtggacagc tggatggcct ggagctgctg 361 tccatgttga cagctgctct ggcccctgga gctgccaact ctcctaccac caacccggtg 421 atattgatag tggacaaagt gctcgagacg caggacctga atggggatgg gctcatgacc 481 cctgctgagc tcatcaactt cccgggagta gccctcaggc acgtggagcc cggagagccc 541 cttgctccat ctcctcagga gccacaagct gttggaaggc agtccctatt agctaaaagc 601 ccattaagac aagaaacaca ggaagcccct ggtcccagag aagaagcaaa gggccaggta 661 gaggccagaa gggagtcttt ggatcctgtc caggagcctg ggggccaggc agaggctgat 721 ggagatgttc cagggcccag aggggaagct gagggccagg cagaggctaa aggagatgcc 781 cctgggccca gaggggaagc tgagggccag gcagaggcta aaggagatgc ccctgggccc 841 agaggggaag ctgggggcca ggcagaggcc agggagaatg gagaggaggc caaggaactt 901 ccaggggaaa cactggagtc taagaacacc caaaatgact ttgaggtgca cattgttcaa 961 gtggagaatg atgagatcta gatcttaaga tacaggtacc cacagaagtc tcagtgccag 1021 aacataagcc ctgaagtggg caggggaaat gtacgctggg acaaggacca tctctgtgcc 1081 ccctgtctgg tcccagtagg tatcaggtct ttctatgcag ctcagggaga ccctaagtta 1141 aggggcagat taccaataaa gaactgaatg aattc // LOCUS HSU66469 1258 bp mRNA PRI 12-DEC-1996 DEFINITION Human cell growth regulator CGR19 mRNA, complete cds. ACCESSION U66469 NID g1724072 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1258) AUTHORS Madden,S.L., Galella,E.A., Riley,D., Bertelsen,A.H. and Beaudry,G.A. TITLE Induction of cell growth regulatory genes by p53 JOURNAL Cancer Res. 56 (23), 5384-5390 (1996) MEDLINE 97122496 REFERENCE 2 (bases 1 to 1258) AUTHORS Madden,S.L. TITLE Direct Submission JOURNAL Submitted (08-AUG-1996) Stephen L. Madden, Molecular & Cellular Biology, PharmaGenics, Inc., 4 Pearl Court, Allendale, NJ 07401, USA FEATURES Location/Qualifiers source 1..1258 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hCGR19" /tissue_type="brain" /dev_stage="fetal" CDS 28..1026 /codon_start=1 /product="cell growth regulator CGR19" /db_xref="PID:g1724073" /translation="MAAVFLVTLYEYSPLFYIAVVFTCFIVTTGLVLGWFGWDVPVIL RNSEETQFSTRVFKKQMRQVKNPFGLEITNPSSASITTGITLTTDCLEDSLLTCYWGC SVQKLYEALQKHVYCFRISTPQALEDALYSEYLYQEQYFIKKDSKEEIYCQLPRDTKI EDFGTVPRSRYPLVALLTLADEDDREIYDIISMVSVIHIPDRTYKLSCRILYQYLLLA QGQFHDLKQLFMSANNNFTPSNNSSSEEKNTDRSLLEKVGLSESEVEPSEENSKDCVV CQNGTVNWVLLPCRHTCLCDGCVKYFQQCPMCRQFVQESFALCSQKEQDKDKPKTL" polyA_signal 1231..1236 BASE COUNT 394 a 239 c 253 g 372 t ORIGIN 1 ccgggctcta cccagagcaa gaccctgatg gctgcggtgt ttctggtaac gctttatgaa 61 tactcgccgc ttttctacat cgcggtggtc tttacctgct tcatcgtgac caccggcctg 121 gtattgggat ggtttggttg ggatgttcca gtaattctga gaaattcaga agagacccag 181 ttcagcacaa gagttttcaa aaagcaaatg agacaagtca agaatccttt tggcttagag 241 atcactaatc catcttcagc ttcaattaca actggcataa ccttgacaac agattgcctt 301 gaagatagcc tccttacatg ctactggggg tgcagtgttc aaaaattata tgaagctctg 361 cagaagcatg tttattgctt cagaataagc actccccaag cattagaaga tgctctgtat 421 agtgaatatc tctatcagga acagtatttt attaaaaagg atagcaaaga agaaatatat 481 tgccagttac caagagatac taaaattgaa gactttggta cagtacccag atctcgctat 541 ccattggtag cgctattgac cttagctgat gaggatgacc gggaaattta tgatattatt 601 tccatggtgt cagtgattca tattcctgat aggacttata aactatcctg cagaatattg 661 tatcaatatt tactcttggc tcaaggtcaa tttcatgatc ttaagcaact tttcatgtct 721 gcaaataata atttcactcc ctccaacaat tcctcttcag aagaaaaaaa cacagacaga 781 agtttgttgg aaaaggtggg actctctgaa agtgaagttg agccatcgga agagaacagc 841 aaggactgtg ttgtttgcca gaatgggact gtgaactggg tactcttacc atgcagacac 901 acatgcctgt gtgatggctg tgtgaagtat tttcagcagt gcccaatgtg caggcagttt 961 gttcaggaat cttttgcact ttgcagtcaa aaagagcaag ataaagacaa accgaagact 1021 ctttgaagac atcgtaacac tgaaaagtac actttctact aaagatgcag aaattgatga 1081 tcttggaatt catcataaca tggaatctac agtactgacc atcaatgaaa attatatttt 1141 aacttcatat ttgtatggta cttggatgat aaaaattaat tattcctttc tgcttagtga 1201 atgaatactg gaatccatct gtgttgatac aataaaaatt cattcaactc ttgaaaag // LOCUS HSU66580 1160 bp DNA PRI 15-MAY-1997 DEFINITION Human putative G protein-coupled receptor (GPR21) gene, complete cds. ACCESSION U66580 NID g1753104 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1160) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B.P., Marchese,A., Cheng,R., Heng,H.H., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Cloning and chromosomal mapping of four putative novel human G-protein-coupled receptor genes JOURNAL Gene 187 (1), 75-81 (1997) MEDLINE 97225799 REFERENCE 2 (bases 1 to 1160) AUTHORS O'Dowd,B.F., Nguyen,T., Jung,B., Marchese,A., Cheng,R., Heng,H.H.Q., Kolakowski,L.F. Jr., Lynch,K.R. and George,S.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Department of Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, Ontario M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1160 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q33" gene 41..1090 /gene="GPR21" CDS 41..1090 /gene="GPR21" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1753105" /translation="MNSTLDGNQSSHPFCLLAFGYLETVNFCLLEVLIIVFLTVLIIS GNIIVIFVFHCAPLLNHHTTSYFIQTMAYADLFVGVSCVVPSLSLLHHPLPVEESLTC QIFGFVVSVLKSVSMASLACISIDRYIAITKPLTYNTLVTPWRLRLCIFLIWLYSTLV FLPSFFHWGKPGYHGDVFQWCAESWHTDSYFTLFIVMMLYAPAALIVCFTYFNIFRIC QQHTKDISERQARFSSQSGETGEVQACPDKRYAMVLFRITSVFYILWLPYIIYFLLES STGHSNRFASFLTTWLAISNSFCNCVIYSLSNSVFQRGLKRLSGAMCTSCASQTTAND PYTVRSKGPLNGCHI" BASE COUNT 254 a 283 c 248 g 375 t ORIGIN 1 aagcggcagc atgaagtgac agatcactcc tgagctcaag atgaactcca ccttggatgg 61 taatcagagc agccaccctt tttgcctctt ggcatttggc tatttggaaa ctgtcaattt 121 ttgccttttg gaagtattga ttattgtctt tctaactgta ttgattattt ctggcaacat 181 cattgtgatt tttgtatttc actgtgcacc tttgttgaac catcacacta caagttattt 241 tatccagact atggcatatg ctgacctttt tgttggggtg agctgcgtgg tcccttcttt 301 atcactcctc catcaccccc ttccagtaga ggagtccttg acttgccaga tatttggttt 361 tgtagtatca gttctgaaga gcgtctccat ggcttctctg gcctgtatca gcattgatag 421 atacattgcc attactaaac ctttaaccta taatactctg gttacaccct ggagactacg 481 cctgtgtatt ttcctgattt ggctatactc gaccctggtc ttcctgcctt cctttttcca 541 ctggggcaaa cctggatatc atggagatgt gtttcagtgg tgtgcggagt cctggcacac 601 cgactcctac ttcaccctgt tcatcgtgat gatgttatat gccccagcag cccttattgt 661 ctgcttcacc tatttcaaca tcttccgcat ctgccaacag cacacaaagg atatcagcga 721 aaggcaagcc cgcttcagca gccagagtgg ggagactggg gaagtgcagg cctgtcctga 781 taagcgctat gccatggtcc tgtttcgaat cactagtgta ttttacatcc tctggttgcc 841 atatatcatc tacttcttgt tggaaagctc cactggccac agcaaccgct tcgcatcctt 901 cttgaccacc tggcttgcta ttagtaacag tttctgcaac tgtgtaattt atagtctctc 961 caacagtgta ttccaaagag gactaaagcg cctctcaggg gctatgtgta cttcttgtgc 1021 aagtcagact acagccaacg acccttacac agttagaagc aaaggccctc ttaatggatg 1081 tcatatctga agtggctcag ttacggggtt cccgtgtgtg tgtgtgtgtg tgtgtgtgtg 1141 tgtgtgtatt ttatctctaa // LOCUS HSU66615 5190 bp mRNA PRI 18-SEP-1996 DEFINITION Human SWI/SNF complex 155 KDa subunit (BAF155) mRNA, complete cds. ACCESSION U66615 NID g1549238 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5190) AUTHORS Wang,W., Xue,Y., Zhou,S., Kuo,A., Cairns,B.R. and Crabtree,G.R. TITLE Diversity and specialization of mammalian SWI/SNF complexes JOURNAL Genes Dev. 10 (17), 2117-2130 (1996) MEDLINE 96397413 REFERENCE 2 (bases 1 to 5190) AUTHORS Wang,W., Cote,J., Xue,Y., Zhou,S., Khavari,P.A., Biggar,S.R., Muchardt,C., Kalpana,G.V., Goff,S.P., Yaniv,M., Workman,J.L. and Crabtree,G.R. TITLE Purification and biochemical heterogeneity of the mammalian SWI-SNF complex JOURNAL EMBO J. 15 (1996) In press REFERENCE 3 (bases 1 to 5190) AUTHORS Wang,W., Xue,Y., Zhou,S. and Crabtree,G.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Howard Hughes Medical Institute, Stanford University, Beckman Center B207, Stanford, CA 94305-5428, USA FEATURES Location/Qualifiers source 1..5190 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cells" gene 55..3369 /gene="BAF155" CDS 55..3369 /gene="BAF155" /note="a core subunit presents in all human SWI/SNF complexes purified so far; similar to subunit BAF170 and yeast SWI3 protein; contains a region similar to DNA binding domain of myb and a predicted leucine zipper; the C-terminus of the protein is highly proline-rich and somewhat glutamine-rich" /codon_start=1 /product="SWI/SNF complex 155 KDa subunit" /db_xref="PID:g1549239" /translation="MAAAAGGGGPGTAVGATGFGDSAAAAGLAVYRRKDGGPATKFWE SPETVSQLDSVRVWLGKHYKKYVHADAPTNKTLAGLVVQLLQFQEDAFGKHVTNPAFT KLPAKCFMDFKAGGALCHILGAAYKYKNEQGWRRFDLQNPSRMDRNVEMFMNIEKTLV QNNCLTRPNIYLIPDIDLKLANKLKDIIKRHQGTFTDEKSKASHHIYPYSSSQDDEEW LRPVMRKEKQVLVHWGFYPDSYDTWVHSNDVDAEIEDPPIPEKPWKVHVKWILDTDIF NEWMNEEDYEVDENRKPVSFRQRISTKNEEPVRSPERRDRKASANARKRKHSPSPPPP TPTESRKKSGKKGQASLYGKRRSQKEEDEQEDLTKDMEDPTPVPNIEEVVLPKNVNLK KDSENTPVKGGTVADLDEQDEETVTAGGKEDEDPAKGDQSRSVDLGEDNVTEQTNHII IPSYASWFDYNCIHVIERRALPEFFNGKNKSKTPEIYLAYRNFMIDSYRLNPQEYLTS TACRRNLTGDVCAVMRVHAGGEQWGLVNYQVDPESRPMAMGPPPTPHFNVLADTPLAC ASDLRSPQVPAAQQMLNFPEKNKEKPVDLQNFGLRTDIYSKKTLAKSKGASAGRGWTE QETLLLLEALEMYKDDWNKVSEHVGSRTQDECILHFLRLPIEDPYLENSDASLGPLAY QPVPFSQSGNPVMSTVAFLASVVDPRVASAAAKAALEEFSRVREEVPLELVEAHVKKV QEAARASGKVDPTYGLESSCIAGTGPDEPEKLEGAEEEKMEADPDGQQPEKAENKVEN ETDEGDKAQDGENEKNSEKEQDSEVSEDTKSEEKETEENKELSSTCKERESDTGKKKV EHEISEGNVATAAAAALASAATKAKHLAAVEERKIKSLVALLVETQMKKLEIKLRHFE GLETIMDREKEALEQQRQQLLTERQNFHMEQLKYAELRARQQMEQQQHGQNPQQAHQH SGGPGLAPLGAAGHPGMMPHQQPPPYPLMHHQMPPPHPPQPGQIPGPGSMMPGQHMPG RMIPTVAANIHPSGSGPTPPGMPPMPGNILGPRVPLTAPNGMYPPPPQQQPPPPPPAD GVPPPPAPGPPASAAP" BASE COUNT 1461 a 1168 c 1297 g 1264 t ORIGIN 1 ggaattcccg cgaggccggg gtgggccagg ctgtggggac gacgggctgc gacgatggcc 61 gcagcggcgg gcggcggcgg gccggggaca gcggtaggcg ccacgggctt cggggattcg 121 gcggcagccg caggcctagc tgtttatcga cggaaggatg ggggcccggc caccaagttt 181 tgggagagcc cggagacggt gtcccagctg gattcggtgc gggtctggct gggcaagcac 241 tacaagaagt atgttcatgc ggatgctcct accaataaaa cactggctgg gctggtggtg 301 cagcttcttc agttccagga agatgccttt gggaagcatg tcaccaaccc ggccttcacc 361 aaactccctg caaagtgttt catggatttc aaagctggag gcgccttatg tcacattctt 421 ggggctgctt acaagtataa aaatgaacag ggatggcgga ggtttgacct acagaaccca 481 tctcgaatgg atcgtaatgt ggaaatgttt atgaacattg aaaaaacatt ggtgcagaac 541 aattgtttga ccagacccaa catctacctc attccagaca ttgatctgaa gttggctaac 601 aaattgaaag atatcatcaa acgacatcag ggaacattta cggatgagaa gtcaaaagct 661 tcccaccaca tttacccata ttcttcctca caagacgatg aagaatggtt gagaccggtg 721 atgagaaaag agaagcaagt gttagtgcat tggggctttt acccagacag ctatgatact 781 tgggtccata gtaatgatgt tgatgctgaa attgaagatc caccaattcc agaaaaacca 841 tggaaggttc atgtgaaatg gattttggac actgatattt tcaatgaatg gatgaatgag 901 gaggattatg aggtggatga aaataggaag cctgtgagtt ttcgtcagcg gatttcaacc 961 aagaatgaag agccagtcag aagtccagaa agaagagata gaaaagcatc agctaatgct 1021 cgaaagagga aacattcgcc ttcgcctccc cctccgacac caacagaatc acggaagaag 1081 agtgggaaga aaggccaagc tagcctttat gggaagcgca gaagtcagaa agaggaagat 1141 gagcaagaag atctaaccaa ggatatggaa gacccaacac ctgtacccaa tatagaagaa 1201 gtagtacttc ccaaaaatgt gaacctaaag aaagatagtg aaaatacacc tgttaaagga 1261 ggaactgtag cggatctaga tgagcaggat gaagaaacag tcacagcagg aggaaaggaa 1321 gatgaagatc ctgccaaagg tgatcagagt cgatcagttg accttgggga agataatgtg 1381 acagagcaga ccaatcacat tattattcct agttatgcat catggtttga ttataactgt 1441 attcatgtga ttgaacggcg tgctcttcct gagttcttca atggaaaaaa caaatccaag 1501 actccagaaa tatacttggc atatcgaaat tttatgattg acagctatcg tctaaacccc 1561 caagagtatt taactagcac tgcttgtcgg aggaacttga ctggagatgt gtgtgctgtg 1621 atgagggtcc atgccggggg agagcagtgg ggactcgtta attaccaagt tgacccggaa 1681 agtagaccca tggcaatggg acctcctcct actcctcatt ttaatgtatt agctgatacc 1741 cctctggctt gtgcctctga tcttcgatca cctcaggttc ctgctgctca acagatgcta 1801 aattttcctg agaaaaacaa ggaaaaacca gttgatttgc agaactttgg tctccgtact 1861 gacatttact ccaagaaaac attagcaaag agtaaaggtg ctagtgctgg aagaggatgg 1921 actgaacagg agacccttct actcctggag gccctggaga tgtacaagga tgattggaac 1981 aaagtgtcgg aacatgttgg aagtcgtact caggatgaat gcatcctcca ctttttgaga 2041 cttcccattg aggacccata ccttgagaat tcagatgctt cccttgggcc tttggcctac 2101 cagcctgtcc ccttcagtca gtcaggaaat ccagttatga gtactgttgc ttttttggca 2161 tctgtggtgg accctcgcgt ggcatctgct gcagcaaaag cggctttgga ggagttttct 2221 cgggtccggg aggaggtacc actggaattg gttgaagctc atgtcaagaa agtacaagaa 2281 gcagcacgag cctctgggaa agtggatccc acctacggtc tggagagcag ctgcattgca 2341 ggcacagggc ccgatgagcc agagaagctt gaaggagctg aagaggaaaa aatggaagcc 2401 gaccctgatg gtcagcagcc tgaaaaggca gaaaataaag tggaaaatga aacggatgaa 2461 ggtgataaag cacaagatgg agaaaatgaa aaaaatagtg aaaaggaaca ggatagtgaa 2521 gtgagtgagg ataccaaatc agaagaaaag gagactgaag agaacaaaga actcagtagt 2581 acatgtaaag aaagagaaag tgatactggg aagaagaaag tagaacatga aatttccgaa 2641 ggaaatgttg ccacagccgc agcagctgct cttgcctcag cggctaccaa agccaagcac 2701 ctggctgcag tggaagaaag aaagatcaag tccctggtag ctctcttggt tgagacacaa 2761 atgaagaaac tagagatcaa acttcgacat tttgaagggc tggaaactat catggacaga 2821 gagaaagaag ctctagaaca acagaggcag cagttgctta ctgaacgcca aaacttccac 2881 atggaacagc tgaagtatgc tgaattacga gcacgacagc aaatggaaca gcagcagcat 2941 ggccagaacc ctcaacaggc acaccagcac tcaggaggac ctggcctggc cccacttgga 3001 gcagcagggc accctggcat gatgcctcat caacagcccc ctccctaccc tctgatgcac 3061 caccagatgc caccacctca tccaccccag ccaggtcaga taccaggccc aggttccatg 3121 atgcccgggc agcacatgcc aggccgcatg attcccactg ttgcagccaa catccacccc 3181 tctgggagtg gccctacccc tcctggcatg ccaccaatgc caggaaacat cttaggaccc 3241 cgggtacccc tgacagcacc taacggcatg tatccccctc caccacagca gcagccaccg 3301 ccaccaccac ctgcagatgg ggtccctccg cctcctgctc ctggcccgcc agcctcagct 3361 gctccttagc ctggaagatg cagggaacct ccacgcccac caccatgagc tggagtgggg 3421 atgacaagac ttgtgttcct caactttctt gggtttcttt caggattttt cttctcacag 3481 ctccaagcac gtgtcccgtg cctccccact cctcttacca cccctctctc tgacactttt 3541 tgtgttgggt cctcagccaa cactcaaggg gaaacctgta gtgacagtgt gccctggtca 3601 tccttaaaat aacctgcatc tcccctgtcc tggtgtggga gtaagctgac agtttctctg 3661 caggtcctgt caactttagc atgctatgtc tttaccattt tctctcttcc agttttttgc 3721 tttgtcttat gcttctatgg ataatgctat ataatcatta tctttttatc tttctgttat 3781 tattgtttta aaggagagca tcctaagtta ataggaacca aaaaataatg atgggcagaa 3841 gggggggaat agccacaggg gacaaacctt aaggcattat aagtgacctt atttctgctt 3901 ttctgagcta agaatggtgc tgatggtaaa gtttgagact tttgccacac acaaatttgt 3961 gaaaattaaa cgagatgtgg aaggagaacc tcagtgattt tattccctag tgaggcctct 4021 gagggcctcc acactgcctg gcagaacata ccactgaact agtatgtgct agaggagggc 4081 acaaacatcc gctccttccc taggcctgct ggctctggtt ttctatgcag atgattcatt 4141 ggattggggg tgagtgtttt gtttttctgg gggcagtgtg agctttgagg gttggaatat 4201 tgggaggcat tccttagttt cctcaactag cctggaaagt taggagtcta gggtaattac 4261 cccccaatga gtctagccta ctattcactg ctttgtgtgc atttttttct ccctctttaa 4321 aaaacccttt aaaagaaaaa aaaaagtaga tagtgctaaa tatttagctc atgaaacttg 4381 gttaggatgg ctgggggtac aagtccccaa actacctctt gttacagtag ccagggagtg 4441 gaatttcgtc aaccggtact tttaaggtta ggatgggacg ggaaaagtga agcaggatat 4501 tagctcctta taccttctcc cttccatttc tgagatctca cattccatct atcacagggt 4561 tttcaaagag atgctgaggg taacaaggaa ctcacttggc agtcagagca tcatgctttg 4621 aggtttgggg tgctcaggct gggagggtag aatgccattc cagaggacaa gccacaaaaa 4681 tgccttaatt tgagctcgta tttacccctg ctgataagtg acttgagagt tcccggtttt 4741 ttcctcttgt ccttccctcc cttctgtcct tccatgtgtg gggaaagggt gtttttggta 4801 gagcttggtt tccaaagcgc ctggctttct cacttcacat tctcaagtgg cagtttcatt 4861 atttagaatg caaggtggac atcttttgga tatctttttc tatatatttc taaagcttta 4921 catatgagag ggtataggga ggtgtttata aaacacttga gaactttttt ccttaatatc 4981 agaaagcaaa aaaataaaac cacaattgag atttgccttt caaaccctca ggtttgcctc 5041 taaccaggtg tccctggtca ccatcagagt actggaatac gggaaccgag gaggaccttg 5101 gtccttttgt ttttgttctg gactcttggg agtggaaatg ggatgagttt atccactgga 5161 gcttaagtcc catgcatttg ctccagaaag // LOCUS HSU66616 4022 bp mRNA PRI 18-SEP-1996 DEFINITION Human SWI/SNF complex 170 KDa subunit (BAF170) mRNA, complete cds. ACCESSION U66616 NID g1549240 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4022) AUTHORS Wang,W., Xue,Y., Zhou,S., Kuo,A., Cairns,B.R. and Crabtree,G.R. TITLE Diversity and specialization of mammalian SWI/SNF complexes JOURNAL Genes Dev. 10 (17), 2117-2130 (1996) MEDLINE 96397413 REFERENCE 2 (bases 1 to 4022) AUTHORS Wang,W., Cote,J., Xue,Y., Zhou,S., Khavari,P.A., Biggar,S.R., Muchardt,C., Kalpana,G.V., Goff,S.P., Yaniv,M., Workman,J.L. and Crabtree,G.R. TITLE Purification and biochemical heterogeneity of the mammalian SWI-SNF complex JOURNAL EMBO J. 15 (1996) In press REFERENCE 3 (bases 1 to 4022) AUTHORS Wang,W., Xue,Y., Zhou,S. and Crabtree,G.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Howard Hughes Medical Institute, Stanford University, Beckman Center B207, Stanford, CA 94305-5428, USA FEATURES Location/Qualifiers source 1..4022 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cells" gene 23..3664 /gene="BAF170" CDS 23..3664 /gene="BAF170" /note="similar to human BAF155 and yeast SWI3; contains a region similar to the DNA binding domain of myb and a predicted leucine zipper domain; the C-terminus of the protein is highly proline-rich" /codon_start=1 /product="SWI/SNF complex 170 KDa subunit" /db_xref="PID:g1549241" /translation="MAVRKKDGGPNVKYYEAADTVTQFDNVRLWLGKNYKKYIQAEPP TNKSLSSLVVQLLQFQEEVFGKHVSNAPLTKLPIKCFLDFKAGGSLCHILAAAYKFKS DQGWRRYDFQNPSRMDRNVEMFMTIEKSLVQNNCLSRPNIFLCPEIEPKLLGKLKDII KRHQGTVTEDKNNASHVVYPVPGNLEEEEWVRPVMKRDKQVLLHWGYYPDSYDTWIPA SEIEASVEDAPTPEKPRKVHAKWILDTDTFNEWMNEEDYEVNDDKNPVSRRKKISAKT LTDEVNSPDSDRRDKKGGNYKKRKRSPSPSPTPEVKEEKCKKGPSTPYTKSKRGHREE EQEDLTKDMDEPSPVPNVEEVTLPKTVNTKKDSESAPVKGGTMTDLDEQEDESMETTG KDEDENSTGNKGEQTKNPDLHEDNVTEQTHHIIIPSYAAWFDYNSVHAIERRALPEFF NGKNKSKTPEIYLAYRNFMIDTYRLNPQEYLTSTACRRNLAGDVCAISRVHAFLEQWG LINYQVDAESRPTPMGPPPTSHFHVLADTPSGLVPLQPKTPQQTSASQQMLNFPDKGK EKPTDMQNFGLRTDMYTKKNAPSKSKAAASATREWTEQETLLLLEALEMYKDDWNKVS EHVGSRTQDECILHFLRLPIEDPYLEDSEASLGPLAYQPIPFSQSGNPVMSTVAFLAS VVDPRVASAAAKSALEEFSKMKEEVPTALVEAHVRKVEEAAKVTGKADPAFGLESSGI AGTTSDEPERIEESGNDEARVEGQATDEKKEPKEPREGGGAIEEEAKEKTSEAPKKDE EKGKEGDSEKESEKSDGDPIVDPEKEKEPKEGQEEVLKEVVESEGERKTKVERDIGEG NLSTAAAAALAAAAVKAKHLAAVEERKIKSLVALLVETQMKKLEIKLRHFEELETIMD REREALEYQRQQLLADRQAFHMEQLKYPEMRARQQHFQQMHQQQQQPPPALPPGSQPI PPTGAAGPPAVHGLAVAPASVVPAPAGSGAPPGSLGPSEQIGQAGSTRGPQQQQPAGA PQPGAVPPGVPPPGPHGPSPFPNQQTPPSMMPGAVPGSGHPGVAGNAPLGLPFGMPPP PPPPAPSIIPFGSLADSISINLPAPPNLMGSPPSPVRPGTLPPPNLPVSMANPLHPNL PATTTMPSSLPLGPGLGSAAAQSPAIVAAVQGNLLPSASPLPDPGTPLPPDPTAPSPG TVTPVPPPQ" BASE COUNT 1084 a 1153 c 1051 g 734 t ORIGIN 1 ggaattcccc gagccggaga agatggcggt gcggaagaag gacggcggcc ccaacgtgaa 61 gtactacgag gccgcggaca ccgtgaccca gttcgacaac gtgcggctgt ggctcggcaa 121 gaactacaag aagtatatac aagctgaacc acccaccaac aagtccctgt ctagcctggt 181 tgtacagttg ctacaatttc aggaagaagt ttttggcaaa catgtcagca atgcaccgct 241 cactaaactg ccgatcaaat gtttcctaga tttcaaagcg ggaggctcct tgtgccacat 301 tcttgcagct gcctacaaat tcaagagtga ccagggatgg cggcgttacg atttccagaa 361 tccatcacgc atggaccgca atgtggaaat gtttatgacc attgagaagt ccttggtgca 421 gaataattgc ctgtctcgac ctaacatttt tctgtgccca gaaattgagc ccaaactact 481 agggaaatta aaggacatta tcaagagaca ccagggaaca gtcactgagg ataagaacaa 541 tgcctcccat gttgtgtatc ctgtcccggg gaatctagaa gaagaggaat gggtacgacc 601 agtcatgaag agggataagc aggttcttct gcactggggc tactatcctg acagttacga 661 cacgtggatc ccagcgagtg aaattgaggc atctgtggaa gatgctccaa ctcctgagaa 721 acctaggaag gttcatgcaa agtggatcct ggacaccgac accttcaatg aatggatgaa 781 tgaggaagac tatgaagtaa atgatgacaa aaaccctgtc tcccgccgaa agaagatttc 841 agccaagaca ctgacagatg aggtgaacag cccagattca gatcgacggg acaagaaggg 901 gggaaactat aagaagagga agcgctcccc ctctccttca ccaaccccag aagtcaaaga 961 agaaaaatgc aagaaaggtc cctcaacacc ttacactaag tcaaagcgtg gccacagaga 1021 agaggagcaa gaagacctga ctaaggacat ggacgagccc tcaccagtcc ccaatgtaga 1081 agaggtgaca cttcccaaaa cagtcaacac aaagaaagac tcagagtcgg ccccagtcaa 1141 aggcggcacc atgaccgacc tggatgaaca ggaagatgaa agcatggaga cgacgggcaa 1201 ggatgaggat gagaacagta cggggaacaa gggagagcag accaagaatc cagacctgca 1261 tgaggacaat gtgactgaac agacccacca catcatcatt cccagctacg ctgcctggtt 1321 tgactacaat agtgttcatg ccattgagcg gagggctctc cccgagttct tcaacggcaa 1381 gaacaagtcc aagactccag agatctacct ggcctatcga aactttatga ttgacactta 1441 ccgactgaac ccccaagagt atcttacctc taccgcctgc cgccgaaacc tagcgggtga 1501 tgtctgtgcc atctcgaggg tccatgcctt cctagaacag tggggtctta ttaactacca 1561 ggtggatgct gagagtcgac caaccccaat ggggcctccg cctacctctc acttccatgt 1621 cttggctgac acaccatcag ggctggtgcc tctgcagccc aagacacctc agcagacctc 1681 tgcttcccaa caaatgctca actttcctga caaaggcaaa gagaaaccaa cagacatgca 1741 aaactttggg ctgcgcacag acatgtacac aaaaaagaat gctccctcca agagcaaggc 1801 tgcagccagt gccactcgtg agtggacaga acaggaaacc ctgcttctcc tggaggcact 1861 ggaaatgtac aaagatgact ggaacaaagt gtccgagcat gtgggaagcc gcacacagga 1921 cgagtgcatc ttgcattttc ttcgtcttcc cattgaagac ccatacctgg aggactcaga 1981 ggcctcccta ggccccctgg cctaccaacc catccccttc agtcagtcgg gcaaccctgt 2041 tatgagcact gttgccttcc tggcctctgt cgtcgatccc cgagtcgcct ctgctgctgc 2101 aaagtcagcc ctagaggagt tctccaaaat gaaggaagag gtacccacgg ccttggtgga 2161 ggcccatgtt cgaaaagtgg aagaagcagc caaagtaaca ggcaaggcgg accctgcctt 2221 cggtctggaa agcagtggca ttgctggaac cacctctgat gagcctgagc ggattgagga 2281 gagcgggaat gacgaggctc gggtggaagg ccaggccaca gatgagaaga aggagcccaa 2341 ggaaccccga gaaggagggg gtgctataga ggaggaagca aaagagaaaa ccagcgaggc 2401 tcccaagaag gatgaggaga aagggaaaga aggcgacagt gagaaggagt ccgagaagag 2461 tgatggagac ccaatagtcg atcctgagaa ggagaaggag ccaaaggaag ggcaggagga 2521 agtgctgaag gaagtggtgg agtctgaggg ggaaaggaag acaaaggtgg agcgggacat 2581 tggcgagggc aacctctcca ccgctgctgc cgccgccctg gccgccgccg cagtgaaagc 2641 taagcacttg gctgctgttg aggaaaggaa gatcaaatct ttggtggccc tgctggtgga 2701 gacccagatg aaaaagttgg agatcaaact tcggcacttt gaggagctgg agactatcat 2761 ggaccgggag cgagaagcac tggagtatca gaggcagcag ctcctggccg acagacaagc 2821 cttccacatg gagcagctga agtatccgga gatgagggct cggcagcagc acttccaaca 2881 gatgcaccaa cagcagcagc agccaccacc agccctgccc ccaggctccc agcctatccc 2941 cccaacaggg gctgctgggc cacccgcagt ccatggcttg gctgtggctc cagcctctgt 3001 agtccctgct cctgctggca gtggggcccc tccaggaagt ttgggccctt ctgaacagat 3061 tgggcaggca gggtcaactc gagggccaca gcagcagcaa ccagctggag ccccccagcc 3121 tggggcagtc ccaccagggg ttcccccccc tggaccccat ggcccctcac cgttccccaa 3181 ccaacaaact cctccctcaa tgatgccagg ggcagtgcca ggcagcgggc acccaggcgt 3241 ggcgggtaat gctcctttgg gtttgccttt tggcatgccg cctcctcctc ctcctcctgc 3301 tccatccatc atcccatttg gtagtctagc tgactccatc agtattaacc tccccgctcc 3361 tcctaacctg atgggatcac caccatctcc cgttcgcccc gggactctcc ccccacctaa 3421 cctgcctgtg tccatggcga accctctaca tcctaacctg ccggcgacca ccaccatgcc 3481 atcttccttg cctctcgggc cggggctcgg atccgccgca gcccaaagcc ctgccattgt 3541 ggcagctgtt cagggcaacc tcctgcccag tgccagccca ctgccagacc caggcacccc 3601 cctgcctcca gaccccacag ccccgagccc aggcacggtc acccctgtgc cacctccaca 3661 gtgaggagcc agccagacat ctctccccct caccccctgt ggacatcacg gttccaggaa 3721 cagcccttcc cccaccactg ggaccctccc cagcctggag agttcatcac tacgtaagga 3781 aagctccttc cgcccctcca aagccctcac catgcctaac agaggcatgc attttatatc 3841 agttattcaa ggacttctgt ttaaaagatg tttataatgt ctgggagaga ggataggatg 3901 ggaatgctgc cctaaaggaa gggctggtga aggtgttata caaggttcta ttaaccactt 3961 ctaagggtac acctccctcc aaactactgc attttctatg gattaaaaaa aaaaaggaat 4021 tc // LOCUS HSU66617 2841 bp mRNA PRI 18-SEP-1996 DEFINITION Human SWI/SNF complex 60 KDa subunit (BAF60a) mRNA, alternatively spliced, complete cds. ACCESSION U66617 NID g1549242 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2841) AUTHORS Wang,W., Xue,Y., Zhou,S., Kuo,A., Cairns,B.R. and Crabtree,G.R. TITLE Diversity and specialization of mammalian SWI/SNF complexes JOURNAL Genes Dev. 10 (17), 2117-2130 (1996) MEDLINE 96397413 REFERENCE 2 (bases 1 to 2841) AUTHORS Wang,W., Cote,J., Xue,Y., Zhou,S., Khavari,P.A., Biggar,S.R., Muchardt,C., Kalpana,G.V., Goff,S.P., Yaniv,M., Workman,J.L. and Crabtree,G.R. TITLE Purification and biochemical heterogeneity of the mammalian SWI-SNF complex JOURNAL EMBO J. 15 (1996) In press REFERENCE 3 (bases 1 to 2841) AUTHORS Wang,W., Xue,Y., Zhou,S. and Crabtree,G.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Howard Hughes Medical Institute, Stanford University, Beckman Center B207, Stanford, CA 94305-5428, USA FEATURES Location/Qualifiers source 1..2841 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cells" gene 266..1573 /gene="BAF60a" CDS 266..1573 /gene="BAF60a" /note="alternatively spliced product; present within the 2 MD BRG1 complex; ubiquitously expressed; similar to human BAF60a, BAF60b, BAF60c and yeast SWP73 and SWP73b" /codon_start=1 /product="SWI/SNF complex 60 KDa subunit" /db_xref="PID:g1549243" /translation="MGPAPGQGLYRSPMPGAAYPRPGMLPGSRMTPQGPSMGPPGYGG NPSVRPGLAQSGMDQSRKRPAPQQIQQVQQQAVQNRNHNAKKKKMADKILPQRIRELV PESQAYMDLLAFERKLDQTIMRKRLDIQEALKRPIKQKRKLRIFISNTFNPAKSDAED GEGTVASWELRVEGRLLEDSALSKYDATKQKRKFSSFFKSLVIELDKDLYGPDNHLVE WHRTATTQETDGFQVKRPGDVNVRCTVLLMLDYQPPQFKLDPRLARLLGIHTQTRPVI IQALWQYIKTHKLQDPHEREFVICDKYLQQIFETQRMKFSEIPQRLHALLMPPEPIII NHVISVDPNDQKKTACYDIDVEVDDTLKTQMNSFLLSTASQQEIATLDNKTMTDVVGN PEEERRAEFYFQPWAQEAVCRYFYSKVQQRRQELEQALGIRNT" BASE COUNT 621 a 810 c 746 g 664 t ORIGIN 1 gaattccgcc tatcccatag tctcgctgcc ctgagcctcc cgtgccggcc ggccggccgg 61 gggaacaggc gggcgccggg gggcgctcgg ggggcggggg gagttccggt tccggttctt 121 tgtgcggctg catcggcggc tccgggaaga tggcggcccg ggcgggtttc cagtctgtgg 181 ctccaagcgg cggcgccgga gcctcaggag gggcgggcgc ggctgctgcc ttgggcccgg 241 cggaactccg gggcctcctg tgcgaatggg cccggctccg ggtcaagggc tgtaccgctc 301 cccgatgccc ggagcggcct atccgagacc aggtatgttg ccaggcagcc gaatgacacc 361 tcagggacct tccatgggac cccctggcta tggggggaac ccttcagtcc gacctggcct 421 ggcccagtca gggatggatc agtcccgcaa gagacctgcc cctcagcaga tccagcaggt 481 ccagcagcag gcggtccaaa atcgaaacca caatgcaaag aaaaagaaga tggctgacaa 541 aattctacct caaaggattc gtgaactggt accagaatcc caggcctata tggatctctt 601 ggcttttgaa aggaaactgg accagactat catgaggaaa cggctagata tccaagaggc 661 cttgaaacgt cccattaagc aaaaacggaa gctgcgaatt ttcatttcta acactttcaa 721 tccggctaag tcagatgccg aggatgggga agggacggtg gcttcctggg agcttcgggt 781 agaaggacgg ctcctggagg attcagcctt gtccaaatat gatgccacta aacaaaagag 841 gaagttctct tccttcttta agtccttggt gattgaactg gacaaagacc tgtatgggcc 901 agacaaccat ctggtagaat ggcacaggac cgccactacc caggagaccg atggcttcca 961 ggtgaagcgg ccaggagatg tgaatgtacg gtgtactgtc ctactgatgc tggattacca 1021 gcctccccag tttaaattag acccccgcct agctcgactc ctgggcatcc atacccagac 1081 tcgtccagtg atcatccaag cactgtggca atatattaag acacataagc tccaggaccc 1141 tcacgagcgg gagtttgtca tctgtgacaa gtacctgcag cagatctttg agactcaacg 1201 tatgaagttt tcagagatcc ctcagcggct ccatgccttg cttatgccac cagaacctat 1261 catcattaat catgtcatca gtgttgaccc gaatgatcag aaaaagacag cttgttatga 1321 cattgatgtt gaagtggatg acaccttgaa gacccagatg aattcttttc tgctgtccac 1381 tgccagccaa caggagattg ctactctaga caacaagaca atgactgatg tggtgggtaa 1441 cccagaggag gagcgccgag ctgagttcta cttccagccc tgggctcagg aggctgtgtg 1501 ccgatacttc tactccaagg tgcagcagag acgacaagaa ttagagcaag ccctgggaat 1561 ccggaataca tagggcctct cccacagccc tgattcgact gcaccaattc ttgatttggg 1621 ccctgtgctg cctgcctcat agtatctgcc ttggtcttgc ttggggcgtt ccaggggatg 1681 ctgttggttc aaggacaaga ccagaatgaa gagggtctca caagacacct gttatcctct 1741 tctttcaccc tatctcttcc cacccccagc ttccctttgc cccacaaagt tcccatgtgc 1801 ctgtaccctc ccctggtcta cataggacct ctagatagtg ttagagagag aacatgtagt 1861 ggtaatgagt gcttggaatg gattggcctc aggccaggtg gtcttcaagg ggaccagcta 1921 actgatccta cccttcagag acccaggagt tgggtttcgc tccttctcca agactcaggc 1981 ctgtgggcac tctataagct agttgatctt ggctctcctg ataacagaat ccaatttcct 2041 tccttccctc cacaggtttg gaacaaactc tcccttcact tgttgccctg tagcactaca 2101 gaaaccctgg ttcttggctc cactgagccc caggtcagtc cccagcctct gggttggcct 2161 gctgtcagtg cttctctcac tccttagttg gggtccacat cagtattgga gttttgttct 2221 ttattgctcc ctcccagaca ctccctgtgg ctgccctttg tgattccctc agatctgccc 2281 taatcccggg catttgggtg ggggaatctt gcctttccct ttcagagccc cagggatctc 2341 atctggggaa ctgtcattgc cagcagaggc tgttccttcc tgcagtttgg agatgtgact 2401 cattccattc actcactcca ccctgcctct gcatccctta atggagaaac gggcctaaaa 2461 ccaaacgggt aaaaaagccc tgggccatcc ctgtcttcct gtcccttgtc tgcccagttg 2521 acacctactg gtgacttcta gggcactgag gagtgaaagc gcctagggct ggagaatagc 2581 gctgagttgg gtttgtgact cttccctctc cctgcctcac aggattgtga ctccccagcc 2641 cctgccctca aagcttcaga cccctcaggt agcagcagga ccttgtgatc ttggcccctt 2701 ggatctgaga tggtttttgc atctttccag gagagcctca cattcttctt ccaggttgta 2761 tcacccccga gttagcatat cccaggctcg cagactcaac acagcaaggg tgggagacag 2821 ctgggcacaa agggggattc c // LOCUS HSU66618 2041 bp mRNA PRI 18-SEP-1996 DEFINITION Human SWI/SNF complex 60 KDa subunit (BAF60b) mRNA, complete cds. ACCESSION U66618 NID g1549244 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2041) AUTHORS Wang,W., Xue,Y., Zhou,S., Kuo,A., Cairns,B.R. and Crabtree,G.R. TITLE Diversity and specialization of mammalian SWI/SNF complexes JOURNAL Genes Dev. 10 (17), 2117-2130 (1996) MEDLINE 96397413 REFERENCE 2 (bases 1 to 2041) AUTHORS Wang,W., Cote,J., Xue,Y., Zhou,S., Khavari,P.A., Biggar,S.R., Muchardt,C., Kalpana,G.V., Goff,S.P., Yaniv,M., Workman,J.L. and Crabtree,G.R. TITLE Purification and biochemical heterogeneity of the mammalian SWI-SNF complex JOURNAL EMBO J. 15 (1996) In press REFERENCE 3 (bases 1 to 2041) AUTHORS Wang,W., Xue,Y., Zhou,S. and Crabtree,G.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Howard Hughes Medical Institute, Stanford University, Beckman Center B207, Stanford, CA 94305-5428, USA FEATURES Location/Qualifiers source 1..2041 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cells" gene 429..1856 /gene="BAF60b" CDS 429..1856 /gene="BAF60b" /note="highly expressed in pancreas; present in a distinct complex from BAF60a; similar to human BAF60a, BAF60b, BAF60c and yeast SWP73 and SWP73b" /codon_start=1 /product="SWI/SNF complex 60 KDa subunit" /db_xref="PID:g1549245" /translation="MLPGPALRGPGPAQYQRPGMSPGNRMPMARLAGGTPCWLPIWCS SSASTWHPTHHDGSIPKTPACAPAQPPMPAQRRGLKRRKMADKVLPQRIRELVPESQA YMDLLAFERKLDQTIARKRMEIQEAIKKPLTQKRKLRIYISNTFSPSKAEGDSAGTAG TPGGTPAGDKVASWELRVEGKLLDDPSKQKRKFSSFFKSLVIELDKELYGPDGHLVEW YWMPTTQETDGFQVKRPGDLNVKCTLLLMLDHQPPQYKLDPRLARLLGVHTQTRAAIM QALWLYIKHNQLQDGHEREYINCNRYFRQIFSCGRLRFSEIPMKLAGLLQHPDPIVIN HVISVDPNDQKKTACYDIDVEVDDPLKAQMSNFLASTTNQQEIASLDVKIHETIESIN QLKTQRDFMLSFSTDPQDFIQEWLRSQRRDLKIITDVTGNPEEERRAAFYHQPWAQEA VGRHIFAKVQQRRQELEQVLGIRLT" BASE COUNT 450 a 605 c 610 g 376 t ORIGIN 1 gaattcccgt tgggcggggc agggagttcg tagccgcctc tgggtaactc gactcgggcg 61 gccaaacctc cggagccggg gacggaaggc gggcccgcag cagatcctgg atccggaatc 121 tcccgggcag gagcggaatc tgtcccgaac cgggtctgtg agggactcgc gaacttggat 181 taggaaatcc cggagcccgg atcgacaaat cccggaaccc ggaataagat cgccaagtcc 241 cggatcgcgg agcacagagc acggagtgga ctcgacgcgg agcccggagt ccggatcgcg 301 gcaccgcggg acgggacgga gcgatgtcgg gccgaggcgc gggcgggttc cccgtgcccc 361 cgctaagccc tggcggcggc gccgtggctg cggccctggg agcgccgcct ccccccgcgg 421 gacccggcat gctgcccgga ccggcgctcc ggggaccggg tccggcgcag taccagcgac 481 ctggcatgtc accagggaac cggatgccca tggctcggct tgcaggtggg accccctgct 541 ggctccccat ttggtgcagc agctccgctt cgacctggca tcccacccac catgatggat 601 ccattccgaa aacgcctgct tgtgccccag cgcagcctcc catgcctgcc cagcgccggg 661 ggttaaagag gaggaagatg gcagataagg ttctacctca gcgaatccgg gagcttgttc 721 cagagtctca ggcgtacatg gatctcttgg cttttgagcg gaagctggac cagaccattg 781 ctcgcaagcg gatggagatc caggaggcca tcaaaaagcc tctgacacaa aagcgaaagc 841 ttcggatcta catttccaat acgttcagtc ccagcaaggc ggaaggcgat agtgcaggaa 901 ctgcagggac ccctggggga accccagcag gggacaaggt ggcttcctgg gaactccgag 961 tggaaggaaa actgctggat gatcctagca aacagaagag gaagttttct tcattcttta 1021 agagcctcgt cattgagctg gacaaggagc tgtacgggcc tgacggtcac ctggtggagt 1081 ggtattggat gcccaccacc caggagacag atggcttcca agtaaaacgg cctggagacc 1141 tcaacgtcaa gtgcaccctc ctgctcatgc tggatcatca gcctccccag tacaaattgg 1201 acccccgatt ggcaaggctg ctgggagtgc acacgcagac gagggccgcc atcatgcagg 1261 ccctgtggct ttacatcaag cacaaccagc tgcaggatgg gcacgagcgg gagtacatca 1321 actgcaaccg ttacttccgc cagatcttca gttgtggccg actccgtttc tccgagattc 1381 ccatgaagct ggcagggttg ctgcagcatc cagaccccat tgtcatcaac catgtcatta 1441 gtgtcgaccc taacgaccag aagaagacag cctgttacga catcgatgtg gaggtggacg 1501 acccactgaa ggcccaaatg agcaattttc tggcctctac caccaatcag caggagatcg 1561 cctcccttga tgtcaagatc catgagacca ttgagtccat caaccagctg aagacccaga 1621 gagatttcat gctcagtttt agcaccgacc cccaggactt catccaggaa tggctccgtt 1681 cccagcgccg agaccttaag atcatcactg atgtgactgg aaatcctgag gaggagagac 1741 gagctgcttt ctaccaccag ccctgggccc aggaagcagt aggcaggcac atctttgcca 1801 aggtgcagca gcgaaggcag gaactggaac aggtgctggg aattcgcctg acctaactgc 1861 tcagggatct ttcttcccag ccctggagcc tggagggaga ccaccctctg ggtccttgct 1921 ggggccgcag acacgtaggc tggggtgagg agtgtctgct gtcaccctct actctccagc 1981 tttagtttta taaatgtagt gataggattc cttgttgctt ggtccccaaa gccttatact 2041 t // LOCUS HSU66619 1724 bp mRNA PRI 18-SEP-1996 DEFINITION Human SWI/SNF complex 60 KDa subunit (BAF60c) mRNA, complete cds. ACCESSION U66619 NID g1549246 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1724) AUTHORS Wang,W., Xue,Y., Zhou,S., Kuo,A., Cairns,B.R. and Crabtree,G.R. TITLE Diversity and specialization of mammalian SWI/SNF complexes JOURNAL Genes Dev. 10 (17), 2117-2130 (1996) MEDLINE 96397413 REFERENCE 2 (bases 1 to 1724) AUTHORS Wang,W., Cote,J., Xue,Y., Zhou,S., Khavari,P.A., Biggar,S.R., Muchardt,C., Kalpana,G.V., Goff,S.P., Yaniv,M., Workman,J.L. and Crabtree,G.R. TITLE Purification and biochemical heterogeneity of the mammalian SWI-SNF complex JOURNAL EMBO J. 15 (1996) In press REFERENCE 3 (bases 1 to 1724) AUTHORS Wang,W., Xue,Y., Zhou,S. and Crabtree,G.R. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) Howard Hughes Medical Institute, Stanford University, Beckman Center B207, Stanford, CA 94305-5428, USA FEATURES Location/Qualifiers source 1..1724 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cells" gene 181..1590 /gene="BAF60c" CDS 181..1590 /gene="BAF60c" /note="Unlike BAF60a and BAF60b this gene is highly expressed in muscle cells; similar to human BAF60a, BAF60b, BAF60c and yeast SWP73 and SWP73b" /codon_start=1 /product="SWI/SNF complex 60 KDa subunit" /db_xref="PID:g1549247" /translation="MTPGLQHPPTVVQRPGMPSGARMPHQGRPWAPRAPRTWAAPPCD PAWPPRDGARPQASSAPARQSQAQSQGQPEPTAPARSRSAKRRKMADKILPQRIRELV PESQAYMDLLAFERKLDQTIMRKGVDIQEALKRPMKQKRKLRLYISNTFNPAKSDAED SDGSIASWELRVEGKLLDDPSKQKRKFSSFFKSLVIELDKDLYGPDNHLVEWHRTPTT QETDGFQVKRPGDLSVRCTLLLMLDYQPPQFKLDPRLARLLGLHTQSRSAIVQALWQY VKTNRLQDSHDKEYINGDKYFQQIFDCPRLKFSEIPQRLTALLLPPDPIVINHVISVD PSDQKKTACYDIDVEVEEPLKGQMSSFLLSTANQQEISPLDSKIHETIESINQLKIQR DFMLSFSRDPKGYVQDLLRSQSRDLKVMTDVAGNPEEERRAEFYHQPWSQEAVSRYFY CKIQQRRQELEQSLVVRNT" BASE COUNT 376 a 546 c 510 g 292 t ORIGIN 1 gaattccggc gcaggcgccc gagccgagcg ccgagcaggg agcgggcggc cgcgctccgg 61 gccggggtcc cgggggagca gatcctcaga atggcccttg gtgctgcagg cgcggtgggc 121 tccgggccca ggcaccgagg gggcactgga tgactctcca ggtgcaggac cctgccatct 181 atgactccag gtcttcagca cccacccacc gtggtacagc gccccgggat gccgtctgga 241 gcccggatgc cccaccaggg gcgcccatgg gccccccggg ctccccgtac atgggcagcc 301 ccgccgtgcg acccggcctg gcccccgcgg gatggagccc gcccgcaagc gagcagcgcc 361 cccgcccggc agagccaggc acagagccag ggccagccgg agcccaccgc ccccgcgcgg 421 agccgcagtg ccaagaggag gaagatggct gacaaaatcc tccctcaaag gattcgggag 481 ctggtccccg agtcccaggc ttacatggac ctcttggcat ttgagaggaa actggatcaa 541 accatcatgc ggaagggggt ggacatccag gaggctctga agaggcccat gaagcaaaag 601 cggaagctgc gactctatat ctccaacact tttaaccctg cgaagtctga tgctgaggat 661 tccgacggca gcattgcctc ctgggagcta cgggtggagg ggaagctcct ggatgatccc 721 agcaaacaga agcggaagtt ctcttctttc ttcaagagtt tggtcatcga gctggacaaa 781 gatctttatg gccctgacaa ccacctcgtt gagtggcatc ggacacccac gacccaggag 841 acggacggct tccaggtgaa acggcctggg gacctgagtg tgcgctgcac gctgctcctc 901 atgctggact accagcctcc ccagttcaaa ctggatcccc gcctagcccg gctgctgggg 961 ctgcacacac agagccgctc agccattgtc caggccctgt ggcagtatgt gaagaccaac 1021 aggctgcagg actcccatga caaggaatac atcaatgggg acaagtattt ccagcagatt 1081 tttgattgtc cccggctgaa gttttctgag attccccagc gcctcacagc cctgctattg 1141 ccccctgacc caattgtcat caaccatgtc atcagcgtgg acccttcaga ccagaagaag 1201 acagcgtgct atgacattga cgtggaggtg gaggagccat taaaggggca gatgagcagc 1261 ttcctcctat ccacggccaa ccagcaggag atcagtcctc tggacagtaa gatccatgag 1321 acgattgagt ccataaacca gctcaagatc cagagggact tcatgctaag cttctccaga 1381 gaccccaaag gctatgtcca agacctgctc cgctcccaga gccgggacct caaggtgatg 1441 acagatgtag ccggcaaccc tgaagaggag cgccgggctg agttctacca ccagccctgg 1501 tcccaggagg ccgtcagtcg ctacttctac tgcaagatcc agcagcgcag gcaggagctg 1561 gagcagtcgc tggttgtgcg caacacctag gagcccaaaa acaagcagca cgacggaact 1621 ttcagccgtg tcccgggccc cagcattttg ccccgggctc cagctcactc ctctgccacc 1681 ttggggtgtg gggctggatt aaaagtcatt catctgggaa ttcc // LOCUS HSU66669 1314 bp mRNA PRI 18-OCT-1996 DEFINITION Human 3-hydroxyisobutyryl-coenzyme A hydrolase mRNA, complete cds. ACCESSION U66669 NID g1575572 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1314) AUTHORS Hawes,J.W., Jaskiewicz,J., Shimomura,Y., Huang,B., Bunting,J., Harper,E.T. and Harris,R.A. TITLE Primary structure and tissue-specific expression of human beta-hydroxyisobutyryl-coenzyme A hydrolase JOURNAL J. Biol. Chem. 271 (42), 26430-26434 (1996) MEDLINE 96421653 REFERENCE 2 (bases 1 to 1314) AUTHORS Hawes,J.W. TITLE Direct Submission JOURNAL Submitted (12-AUG-1996) John W. Hawes, Biochemistry and Molecular Biology, Indiana University School of Medicine, 635 Barnhill Dr., Indianapolis, IN 46202-5122, USA FEATURES Location/Qualifiers source 1..1314 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HIBYL-CoA-Hydrolase" /dev_stage="infant" /tissue_type="brain" CDS 64..1209 /codon_start=1 /product="3-hydroxyisobutyryl-coenzyme A hydrolase" /db_xref="PID:g1575573" /translation="MWRLMSRFNAFKRTNTILHHLRMSKHTDAAEEVLLEKKGCAGVI TLNRPKFLNALTLNMIRQIYPQLKKWEQDPETFVIIIKGAGGKAFCAGGDIRVISEAE KGKTEDSSSFLQRRIYLNNAVGSCQKPYVALIHGITMGGGVGLSVHGQFRVATEKCLF AMPETAIGLFPDVGGGYFFATTPRKTWLLPCINGFRLKGRDVYRAGIATHFVDSEKLA MLEEDLLALKSPSKENIASVLENYHTESKIDRDKSFILEEHMDKINSCFSANTVEEII ENLQQDGSSFALEQLKVINKMSPTSLKITLRQLMEGSSKTLQEVLTMEYRLSQACMRG HDFHEGVRAVLIDKDQSPKWKPADLKEVTEEDLNNHFKSLGSSDLKF" BASE COUNT 408 a 226 c 314 g 366 t ORIGIN 1 agtccgggag attctcgctc tgctgcttta gtttcggagt gtttggcgac ggggcagcgc 61 gagatgtgga ggctcatgtc gaggtttaat gcattcaaaa ggactaatac catactgcac 121 catttgagaa tgtccaagca cacagatgca gcagaagagg tgctattgga aaaaaaaggt 181 tgcgcgggag tcataacact aaacagacca aagttcctca atgcactgac tcttaatatg 241 attcggcaga tttatccaca gctaaagaag tgggaacaag atcctgaaac tttcgtgatc 301 attataaagg gagcaggagg aaaggctttc tgtgccgggg gtgatatcag agtgatctcg 361 gaagctgaaa agggcaaaac agaagatagc tccagttttc ttcagagaag aatatatctg 421 aataatgctg ttggttcttg ccagaaacct tatgttgcac ttattcatgg aattacaatg 481 ggtgggggag ttggtctctc agtccatggg caatttcgag tggctacaga aaagtgtctt 541 tttgctatgc cagaaactgc aataggactg ttccctgatg tgggtggagg ttatttcttt 601 gccacgactc caaggaaaac ttggttactt ccttgcatta acggattcag actaaaagga 661 agagatgtgt acagagcagg aattgctaca cactttgtag attctgaaaa gttggccatg 721 ttagaggaag atttgttagc cttgaaatct ccttcaaaag aaaatattgc atctgtctta 781 gaaaattacc atacagagtc taagattgat cgagacaagt cttttatact tgaggaacac 841 atggacaaaa taaacagttg tttttcagcc aatactgtgg aagaaattat tgaaaactta 901 cagcaagatg gttcatcttt tgccctagag caattgaagg taattaataa aatgtctcca 961 acatctctaa agatcacact aaggcaactc atggaggggt cttcaaagac cttgcaagaa 1021 gtactaacta tggagtatcg gctaagtcaa gcttgtatga gaggtcatga ctttcatgaa 1081 ggcgttagag ctgttttaat tgataaagac cagagtccaa aatggaaacc agctgatcta 1141 aaagaagtta ctgaggaaga tttgaataat cactttaagt ctttgggaag cagtgatttg 1201 aaattttgag gtgacatggc ttttaaggta tattttgtag catgggttgg caatctacag 1261 catgtgggcc aaatctcagc cttgctgcct gtttttatat accctgtaag caag // LOCUS HSU66838 1743 bp mRNA PRI 18-MAR-1997 DEFINITION Human cyclin A1 mRNA, complete cds. ACCESSION U66838 NID g1753108 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS Yang,R., Morosetti,R. and Koeffler,H.P. TITLE Characterization of a second human cyclin A that is highly expressed in testis and in several leukemic cell lines JOURNAL Cancer Res. 57 (5), 913-920 (1997) MEDLINE 97193609 REFERENCE 2 (bases 1 to 1743) AUTHORS Yang,R., Morosetti,R. and Koeffler,H.P. TITLE Direct Submission JOURNAL Submitted (13-AUG-1996) Hematology/Oncology, Cedars-Sinai Research Institute UCLA School of Medicine, 8700 Beverly Blvd., Los Angeles, CA 90048, USA FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13; between WI-3374 and D13S219" CDS 130..1527 /codon_start=1 /product="cyclin A1" /db_xref="PID:g1753109" /translation="METGFPAIMYPGSFIGGWGEEYLSWEGPGLPDFVFQQQPVESEA MHCSNPKSGVVLATVARGPDACQILTRAPLGQDPPQRTVLGLLTANGQYRRTCGQGIT RIRCYSGSENAFPPAGKKALPDCGVQEPPKQGFDIYMDELEQGDRDSCSVREGMAFED VYEVDTGTLKSDLHFLLDFNTVSPMLVDSSLLSQSEDISSLGTDVINVTEYAEEIYQY LREAEIRHRPKAHYMKKQPDITEGMRTILVDWLVEVGEEYKLRAETLYLAVNFLDRFL SCMSVLRGKLQLVGTAAMLLASKYEEIYPPEVDEFVYITDDTYTKRQLLKMEHLLLKV LAFDLTVPTTNQFLLQYLRRQGVCVRTENLAKYVAELSLLEADPFLKYLPSLIAAAAF CLANYTVNKHFWPETLAAFTGYSLSEIVPCLSELHKAYLDIPHRPQQAIREKYKASKY LCVSLMEPPAVLLLQ" BASE COUNT 475 a 399 c 437 g 432 t ORIGIN 1 ggtgttgttc cggacacata gaaagataac gacgggaaga gcggggcccc gtttggggtc 61 caggcaggtt ttggggcctc ctgtctggtg ggaggaggcc gcagcgcacg accctgctcg 121 tcacttggga tggagaccgg ctttcccgca atcatgtacc ctggatcttt tattgggggc 181 tggggagaag agtatctcag ctgggaagga ccggggctcc cagatttcgt cttccagcag 241 cagcccgtgg agtctgaagc aatgcactgc agcaacccca agagtggagt tgtgctggct 301 acagtggccc gaggtcccga tgcttgtcag atactcacca gagccccgct gggccaggat 361 cccccgcaga ggacagtgct agggctgcta actgcaaatg ggcagtacag gaggacctgt 421 ggccagggga tcacaagaat caggtgttat tctggatcag aaaatgcctt ccctccagct 481 ggaaagaaag cactccctga ctgtggggtc caagagcccc ccaagcaagg gtttgacatc 541 tacatggatg aactagagca gggggacaga gacagctgct cggtcagaga ggggatggca 601 tttgaggatg tgtatgaagt agacaccggc acactcaagt cagacctgca cttcctgctg 661 gatttcaaca cagtttcccc tatgctggta gattcatctc tcctctccca gtctgaagat 721 atatccagtc ttggcacaga tgtgataaat gtgactgaat atgctgaaga aatttatcag 781 taccttaggg aagctgaaat aaggcacaga cccaaagcac actacatgaa gaagcagcca 841 gacatcacgg aaggcatgcg cacgattctg gtggactggc tggtggaggt tggggaagaa 901 tataaacttc gagcagagac cctgtatctg gctgtcaact tcctggacag gttcctttca 961 tgtatgtctg ttctgagagg gaaactgcag ctcgtaggaa cagcagctat gcttttggct 1021 tcgaaatatg aagagatata tcctcctgaa gtagacgagt ttgtctatat caccgatgat 1081 acatacacaa aacgacaact gttaaaaatg gaacacttgc ttctgaaagt tctagctttt 1141 gatctgacag taccaaccac caaccagttt ctccttcagt acttgaggcg acaaggagtg 1201 tgcgtcagga ctgagaacct ggctaagtac gtagcagagc tgagtctact tgaagcagat 1261 ccattcttga aatatcttcc ttcactgata gctgcagcag ctttttgcct ggcaaactat 1321 actgtgaaca agcacttttg gccagaaacc cttgctgcat ttacagggta ttcattaagt 1381 gaaattgtgc cttgcctgag tgagcttcat aaagcgtacc ttgatatacc ccatcgacct 1441 cagcaagcaa ttagggagaa gtacaaggct tcaaagtacc tgtgtgtgtc cctcatggag 1501 ccacctgcag ttcttcttct acaataagtt tctgaatgga agcacttcca gaacttcacc 1561 tccatatcag aagtgccaat aatcgtcata ggcttctgca cgttggatca actaatgttg 1621 tttacaatat agatgacatt ttaaaaatgt aaatgaattt agtttccctt agactttagt 1681 agtttgtaat atagtccaac attttttaaa caataaactg cttgtcttat gacaaaaaaa 1741 aaa // LOCUS HSU66871 815 bp mRNA PRI 26-MAR-1997 DEFINITION Human enhancer of rudimentary homolog mRNA, complete cds. ACCESSION U66871 NID g1519518 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 815) AUTHORS Gelsthorpe,M., Pulumati,M., McCallum,C., Dang-Vu,K. and Tsubota,S.I. TITLE The putative cell cycle gene, enhancer of rudimentary, encodes a highly conserved protein found in plants and animals JOURNAL Gene 186 (2), 189-195 (1997) MEDLINE 97228417 REFERENCE 2 (bases 1 to 815) AUTHORS Tsubota,S.I. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) Biology, Saint Louis University, 3507 Laclede Avenue, St. Louis, MO 63103, USA FEATURES Location/Qualifiers source 1..815 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 72..386 /note="similar to Drosophila melanogaster ER; HSER" /codon_start=1 /product="enhancer of rudimentary homolog" /db_xref="PID:g1519519" /translation="MSHTILLVQPTKRPEGRTYADYESVNECMEGVCKMYEEHLKRMN PNSPSITYDISQLFDFIDDLADLSCLVYRADTQTYQPYNKDWIKEKIYVLLRRQAQQA GK" BASE COUNT 225 a 154 c 186 g 250 t ORIGIN 1 gcggcgttgt agttaagctc gtgtaacggc ggcggtgtcg gcagctgctg tagcgaagag 61 agtttggcgc gatgtctcac accattttgc tggtacagcc taccaagagg ccagaaggca 121 gaacttatgc tgactacgaa tctgtgaatg aatgcatgga aggtgtttgt aaaatgtatg 181 aagaacatct gaaaagaatg aatcccaaca gtccctctat cacatatgac atcagtcagt 241 tgtttgattt catcgatgat ctggcagacc tcagctgcct ggtttaccga gctgataccc 301 agacatacca gccttataac aaagactgga ttaaagagaa gatctacgtg ctccttcgtc 361 ggcaggccca acaggctggg aaataattgt gttggaagca ctgggggggt tggggtgggc 421 ttggaacaca ggtgtgtaca gcgtgctgta gtggaagttt tgtatcatag taatcctgtt 481 tccactttgt tatactctag ccaagattga ctgtattaga tgaaatgtga ggatcttgtt 541 caatcggaaa cccccgttac ctcctctttt tctttctctt tctttttttt ttttttactt 601 aaacattttt atgatgattt agatggaagt tgttcttcgt cacttaatgt tggttccagt 661 ccttcaactg ttcatatcta ctttataaca ttcacatact aacccttctt caagatgggg 721 tggggggtgg aaatgcagtt tagccatgtc ctcaagataa agtcttggta aaaataaata 781 aatgtccttt agttataaaa aaaaaaaaaa aaaaa // LOCUS HSU66895 606 bp mRNA PRI 29-AUG-1996 DEFINITION Human guanylate kinase (gmk) mRNA, complete cds. ACCESSION U66895 NID g1513314 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 606) AUTHORS Brady,W.A., Kokoris,M.S., Fitzgibbon,M. and Black,M.E. TITLE Cloning, characterization, and modeling of mouse and human guanylate kinases JOURNAL J. Biol. Chem. 271 (28), 16734-16740 (1996) MEDLINE 96279248 REFERENCE 2 (bases 1 to 606) AUTHORS Brady,W.A., Kokoris,M.S. and Black,M.E. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) Darwin Molecular Corp., 1631 220th St. SE, Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..606 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="peripheral blood lymphocytes" gene 7..600 /gene="gmk" CDS 7..600 /gene="gmk" /EC_number="2.7.4.9" /note="ATP:GMP phosphotransferase; This sequence corresponds to that described in Accession Number A11042" /codon_start=1 /product="guanylate kinase" /db_xref="PID:g1513315" /translation="MSGPRPVVLSGPSGAGKSTLLKRLLQEHSGIFGFSVSHTTRNPR PGEENGKDYYFVTREVMQRDIAAGDFIEHAEFSGNLYGTSKVAVQAVQAMNRICVLDV DLQGVRNIKATDLRPIYISVQPPSLHVLEQRLRQRNTETEESLVKRLAAAQADMESSK EPGLFDVVIINDSLDQAYAELKEALSEEIKKAQRTGA" BASE COUNT 132 a 170 c 205 g 99 t ORIGIN 1 ggatccatgt cgggccccag gcctgtggtg ctgagcgggc cttcgggagc tgggaagagc 61 accctgctga agaggctgct ccaggagcac agcggcatct ttggcttcag cgtgtcccat 121 accacgagga acccgaggcc cggcgaggag aacggcaaag attactactt tgtaaccagg 181 gaggtgatgc agcgtgacat agcagccggc gacttcatcg agcatgccga gttctcgggg 241 aacctgtatg gcacgagcaa ggtggcggtg caggccgtgc aggccatgaa ccgcatctgt 301 gtgctggacg tggacctgca gggtgtgcgg aacatcaagg ccaccgatct gcggcccatc 361 tacatctctg tgcagccgcc ttcactgcac gtgctggagc agcggctgcg gcagcgcaac 421 actgaaaccg aggagagcct ggtgaagcgg ctggctgctg cccaggccga catggagagc 481 agcaaggagc ccggcctgtt tgatgtggtc atcattaacg acagcctgga ccaggcctac 541 gcagagctga aggaggcgct ctctgaggaa atcaagaaag ctcaaaggac cggcgcctga 601 ggatcc // LOCUS HSU67191 4009 bp mRNA PRI 14-MAR-1997 DEFINITION Human multiple exostosis-like protein (EXTL) mRNA, complete cds. ACCESSION U67191 NID g1524412 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4009) AUTHORS Wise,C.A., Clines,G.A., Massa,H., Trask,B.J. and Lovett,M. TITLE Identification and localization of the gene for EXTL, a third member of the multiple exostoses gene family JOURNAL Genome Res. 7 (1), 10-16 (1997) MEDLINE 97189339 REFERENCE 2 (bases 1 to 4009) AUTHORS Wise,C.A., Clines,G.A., Massa,H., Trask,B.J. and Lovett,M. TITLE Direct Submission JOURNAL Submitted (16-AUG-1996) Biochemistry, The University of Texas Southwestern Medical Center at Dallas, 6000 Harry Hines Blvd., Dallas, TX 75253-8591, USA FEATURES Location/Qualifiers source 1..4009 /organism="Homo sapiens" /db_xref="taxon:9606" gene 867..2897 /gene="EXTL" CDS 867..2897 /gene="EXTL" /note="multiple exostosis-like gene" /codon_start=1 /product="multiple exostosis-like protein" /db_xref="PID:g1524413" /translation="MQSWRRRKSLWLALSASWLLLVLLGGFSLLRLALPPRPRPGASQ GWPRWLDAELLQSFSQPGELPEDAVSPPQAPHGGSCNWESCFDTSKCRGDGLKVFVYP AVGTISETHRRILASIEGSRFYTFSPAGACLLLLLSLDAQTGECSSMPLQWNRGRNHL VLRLHPAPCPRTFQLGQAMVAEASPTVDSFRPGFDVALPFLPEAHPLRGGAPGQLRQH SPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPGPGQTQRQETLPNATFCLIS GHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLPLQVLAALQE MSPARVLALRQQTQFLWDAYFSSVEKVIHTTLEVIQDRIFGTSANPSLLWNSPPGALL ALSTFSTSPQDFPFYYLQQGSRPEGRFSALIWVGPPGQPPLKLIQAVAGSQHCAQILV LWSNERPLPSRWPETAVPLTVIDGHRKVSDRFYPYSTIRTDAILSLDARSSLSTSEVD FAFLVWQSFPERMVGFLTSSHFWDEAHGGWGYTAERTNEFSMVLTTAAFYHRYYHTLF THSLPKALRTLADEAPTCVDVLMNFIVAAVTKLPPIKVPYGKQRQEAAPLAPGGPGPR PKPPAPAPDCINQIAAAFGHMPLLSSRLRLDPVLFKDPVSVQRKKYRSLEKP" BASE COUNT 723 a 1315 c 1215 g 756 t ORIGIN 1 ccgactggaa agggatggga gtgataccat agggtttggg gggcttggca caacggccct 61 ggcctctgca ctgtgatagc tccaagccaa atggtgcaag gcagctgctg ggctgcgggg 121 aagaggggct gcacttcctg agctggtgca gcggcagagt ggataaggat caggaccccg 181 agccctcctg gccctccaag gcggggggac accggctgca cccaggctgt gacctcagct 241 gacgccgatg atccacatgg gatgcagggt ctaattttag ctccagccac aggggccaca 301 gcccagaagc gtctgccagc gggcacaggg gcagccaggg ctgctcaggc aaggggagga 361 gtgggaagac cagcccagct ccctccagcc tgtccctggc caagccgcct cctgtgggag 421 ctctgactgg tccccatggc ctggcagaca gccctcctct cagttgaggg caaggtcaag 481 gggtgcagcc aggagggcag ggggacacgg ccctgcattc tggacagggg ttgcgtcagc 541 cagagcagtg cccaggggca ggggtccctg ctgggaggga aaaggctggc ttggttgtcc 601 aaaggccgag aaggcagagt cctgagagca ggggggccag gccagcaagc tgggtcccac 661 ctggcctcct cctgcctggc tggtgactca ctatctgacc ttagacaggc ggcctggtct 721 cgatgggcct cagtcttccc atctgtacaa tgacagcaca ggactagcag gtcggtccca 781 gctctggtct cccgccccag gccctgctct tcctgcttgc tggcagaggc ctcccagctt 841 ccctagccct gactgtgggt ggccacatgc agtcgtggag gagaagaaag tccctgtggc 901 tggcactgtc agcctcctgg ctcctgcttg tcctgctggg aggcttctcc cttctccgcc 961 tggcgttgcc tcccagacct cggcccgggg cttcccaagg ctggccccgc tggctggatg 1021 cagagctcct gcagagcttc tcccagcctg gagagctccc agaagatgcc gtttcacctc 1081 ctcaagcccc tcatggtggc agctgcaact gggaatcttg ctttgatacc tcaaagtgca 1141 ggggcgatgg ccttaaggta ttcgtgtacc cagcggttgg aaccatctct gagactcatc 1201 gcaggatcct ggcttccatt gagggctctc gcttctacac attcagccct gctggggcct 1261 gcctcctcct cctcctcagc ctggacgccc agactggaga gtgcagctca atgcctctgc 1321 aatggaacag gggcaggaac catctggtcc tccgtctcca cccggctccc tgccccagga 1381 ccttccagct gggacaggct atggtggctg aggccagccc cacggtggac tccttccggc 1441 ccggctttga tgtggccctc ccttttctcc ctgaagccca cccgttgcga ggtggggctc 1501 ctggccagct gcggcaacac agcccccagc ccggggtagc cctgctagcc ctggaagagg 1561 agaggggtgg gtggcgcaca gcagacactg gctcctctgc ctgcccctgg gatgggcgct 1621 gtgagcaaga ccctggacct gggcagaccc agcgccagga gacgctgccc aatgccacct 1681 tctgcctcat ctctggccac cgtcccgagg ctgcctcgcg cttcctccaa gccctgcagg 1741 ccggctgcat cccagtgctt ctcagccccc gctgggagct gcccttctcc gaggtcatcg 1801 actggaccaa ggcagccatc gtagctgatg agaggctccc acttcaggtc ctggctgccc 1861 tccaggagat gtcccctgca cgggtcctcg ccctgcgtca gcagacccag tttctatggg 1921 atgcctactt ctcctcagtg gagaaggtca tccataccac tctggaggtt attcaggacc 1981 ggatttttgg aacatcagct aacccctcac tgctgtggaa cagcccccca ggggcactcc 2041 tggccctgtc tactttttcc acaagccccc aggacttccc cttctactac ctgcaacagg 2101 gctcccgccc tgagggcaga ttcagcgccc tgatctgggt ggggccccca ggccagcccc 2161 ctctgaagct catccaggcg gtggcaggct cccagcactg tgcccagatc ttggttctct 2221 ggagcaatga gaggccactc ccatccaggt ggccggagac agctgtgccc ttgacagtca 2281 ttgatgggca caggaaggtt agtgatcgct tctacccata tagcaccatc agaacagatg 2341 ccatcctcag cctcgatgcc cgcagcagtc tttccacaag tgaggtggac tttgcctttc 2401 tggtgtggca gagcttccca gagcggatgg tgggcttcct gacgtcgagc catttctggg 2461 acgaggccca tggtggctgg ggctacactg ctgagaggac caacgaattc tccatggttc 2521 tcaccacagc cgccttctac cataggtatt accacactct cttcacccac tccctgccca 2581 aggctctgag gaccctggca gatgaggcac ccacctgtgt ggacgtcctg atgaatttca 2641 tagtagcagc agtcaccaag ctgcccccta tcaaggtgcc ctatggcaag cagcgccagg 2701 aggctgctcc actggcgcct gggggcccgg ggcccaggcc aaagccgcct gccccagccc 2761 ccgactgcat caaccagata gcggcagcgt tcggccacat gcccttgctg tcctctcgtc 2821 tgcgtctgga cccggtgctg tttaaggacc cggtgtccgt gcagcgcaag aagtaccgca 2881 gcctggagaa gccctagggg ggcgacccgc ggagacccca tcaaaggtct cagcccagct 2941 cccagggggc ccggcgcctg ccggcgggct ccgctcttgg gacaccggta aaacctatca 3001 tgtcagccag cgggcccaca cgtcggaccc cggttggcca atcacaacag gggggcgtgg 3061 ccttaccttc tcctgctcgc cctcagccgc ggagcctctg cggaggctga gccccgcgac 3121 cggagcgccg ctctccgctt ctccacccag ctaacttctg ctcgtctttc agagcattgt 3181 ctcctccagg gacttgcctg gccctcctcc ccctccgccc ccaatgggtt cggtctcctt 3241 tgcgctttcg cagcccagtg tttgccgcgg ttcccgcacg gtcaggcccc aggactggct 3301 tgcgagcccc ggggggcggg tcgtgtcgtg tccttttctc tctggctagg caggtgtgcc 3361 gcaaatattt gacgaataga tggatggcag acattaactg cgtgcctggc cctatgaggg 3421 cggcggcggg acagggtgag cggagagaca gaactgtccc cgccccaccc cgccacccat 3481 tcctagggtt cctagtcccg ggaaggcaac tacccaaaga cgcctgggtt gtcaggctca 3541 atccttgagc ctcaaccaag gagcagaggg gtgcgggtaa acgaggagtg gggcgaggaa 3601 agcgggatgg gaacaagtgg cctgtccccg cacggcccgc ggctcagcct cggccgtctg 3661 ccctgcacta ccaaggccga ccactagagg tcagcgtcgc gccgcactct tcacccgcat 3721 gcccagaggg cggccgggat gtggagggtg aacgctgtgg gcagctcggg ggctgaaagc 3781 tgggaagaga aatctgaaga acagccccat ctctttcgcc tgcacacttg gaactcagca 3841 agaagggact gtttctaggt gggctgggcg cgagcaggga ctgtcccgag ctggtatagg 3901 tgagaatgga gttggggggg gacactaggt ctccggggtt cacccacctg gagctggaat 3961 tatcacttcc gaaataaagc gcgtgtcctt gcaaaaaaaa aaaaaaaaa // LOCUS HSU67280 1035 bp mRNA PRI 27-JAN-1998 DEFINITION Homo sapiens calumenin mRNA, complete cds. ACCESSION U67280 NID g2809323 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1035) AUTHORS Liu,X., Rasmussen,H.H., Celis,J.E. and Honore,B. TITLE mRNA encoding human multiple EF-hand protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1035) AUTHORS Liu,X., Rasmussen,H.H., Celis,J.E. and Honore,B. TITLE Direct Submission JOURNAL Submitted (20-AUG-1996) Dept. of Med. Biochem., University of Aarhus, Ole Worms Alle, Bldg. 170, DK-8000 Aarhus C, Denmark FEATURES Location/Qualifiers source 1..1035 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocytes" /clone="9268.8B" CDS 63..1010 /note="multiple EF-hand protein" /codon_start=1 /product="calumenin" /db_xref="PID:g2809324" /translation="MDLRQFLMCLSLCTAFALSKPTEKKDRVHHEPQLSDKVHNDAQS FDYDHDAFLGAEEAKTFDQLTPEESKERLGKIVSKIDGDKDGFVTVDELKDWIKFAQK RWIYEDVERQWKGHDLNEDGLVSWEEYKNATYGYVLDDPDPDDGFNYKQMMVRDERRF KMADKDGDLIATKEEFTAFLHPEEYDYMKDIVVQETMEDIDKNADGLIDLEEYIGDMY SHDGNTDEPEWVKTEREQFVEFRDKNRDGKMDKEETKDWILPSDYDHAEAEARHLVYE SDQNKDGKLTKEEIVDKYDLFVGSQATDFGEALVRHDEF" BASE COUNT 314 a 193 c 294 g 234 t ORIGIN 1 gtgggtgagc ggcggccacg gcatcctgtg ctgtgggggc tacgaggaaa gatctaatta 61 tcatggacct gcgacagttt cttatgtgcc tgtccctgtg cacagccttt gccttgagca 121 aacccacaga aaagaaggac cgtgtacatc atgagcctca gctcagtgac aaggttcaca 181 atgatgctca gagttttgat tatgaccatg atgccttctt gggtgctgaa gaagcaaaga 241 cctttgatca gctgacacca gaagagagca aggaaaggct tggaaagatt gtaagtaaaa 301 tagatggcga caaggacggg tttgtcactg tggatgagct caaagactgg attaaatttg 361 cacaaaagcg ctggatttac gaggatgtag agcgacagtg gaaggggcat gacctcaatg 421 aggacggcct cgtttcctgg gaggagtata aaaatgccac ctacggctac gttttagatg 481 atccagatcc tgatgatgga tttaactata aacagatgat ggttagagat gagcggaggt 541 ttaaaatggc agacaaggat ggagacctca ttgccaccaa ggaggagttc acagctttcc 601 tgcaccctga ggagtatgac tacatgaaag atatagtagt acaggaaaca atggaagata 661 tagataagaa tgctgatggt ctcattgatc tagaagagta tattggtgac atgtacagcc 721 atgatgggaa tactgatgag ccagaatggg taaagacaga gcgagagcag tttgttgagt 781 ttcgggataa gaaccgtgat gggaagatgg acaaggaaga gaccaaagac tggatccttc 841 cctcagacta tgatcatgca gaggcagaag ccaggcacct ggtctatgaa tcagaccaaa 901 acaaggatgg caagcttacc aaggaggaga tcgttgacaa gtatgactta tttgttggca 961 gccaggccac agattttggg gaggccttag tacggcatga tgagttctga gctacggagg 1021 aaccctcatt tcctc // LOCUS HSU67369 2809 bp mRNA PRI 03-DEC-1996 DEFINITION Human growth factor independence-1 (Gfi-1) mRNA, complete cds. ACCESSION U67369 NID g1698691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2809) AUTHORS Roberts,T.P. TITLE Complete sequence of the human GFI-1 gene JOURNAL Oncogene (1997) In press REFERENCE 2 (bases 1 to 2809) AUTHORS Roberts,T.P. TITLE Direct Submission JOURNAL Submitted (20-AUG-1996) Neurosciences NC3-150, Cleveland Clinic Foundation, 9500 Euclid Avenue, Cleveland, OH 44195, USA FEATURES Location/Qualifiers source 1..2809 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1p22" /tissue_type="bone marrow" gene 268..1536 /gene="Gfi-1" CDS 268..1536 /gene="Gfi-1" /note="transcription factor" /codon_start=1 /product="growth factor independence-1" /db_xref="PID:g1698692" /translation="MPRSFLVKSKKAHSYHQPRSPGPDYSLRLENVPAPSRADSTSNA GGAKAEPRDRLSPESQLTEAPDRASASPRQLRSSVCERSSEFEDFWRPPSPSASPASE KSMCPSLDEAQPFPLPFKPYSWSGLAGSDLRHLVQSYRPCGALERGAGLGLFCEPAPE PGHPAALYGPKRAAGGAGAGAPGNCIAGAGATAGPGLGLYGDFGSAAAGLYEKPTAAA GLLYPERGHRLHANKGAGVKVESELLCTRLLLGGGSYKCIKCSKVFSTPHGLEVHVRR SHSGTRPFACEMCGKTFGHAVSLEQHKAVHSQERSFDCKICGKSFKRSSTLSTHLLIH SDTRPYPCQYCGKRFHQKSDMKKHTFIHTGEKLHKCQVCGKAFSQSSNLITHSRKHTG FKPFGCDLCGKGFQRKVDLRRHRETQHGLK" misc_feature 1036..1101 /gene="Gfi-1" /note="encodes zinc finger motif" misc_feature 1122..1185 /gene="Gfi-1" /note="encodes zinc finger motif" misc_feature 1207..1269 /gene="Gfi-1" /note="encodes zinc finger motif" misc_feature 1290..1354 /gene="Gfi-1" /note="enc1288odes zinc finger motif" misc_feature 1374..1438 /gene="Gfi-1" /note="encodes zinc finger motif" misc_feature 1458..1524 /gene="Gfi-1" /note="encodes zinc finger motif" BASE COUNT 652 a 796 c 760 g 601 t ORIGIN 1 agattcgcgg ccgcgtcgac ggtgcgccca ccggtcccgc cgggcgcccg cgggacgcgc 61 cgccagggcc ctctccgccg ggggctcggc gctcgcccac ctcttccaaa tttaaccatt 121 acctaaatcc gaagggaaat gagcaaacct ctcggattgg gtgtcaaggt ctcctccggg 181 ctggggctga gcaagccctc ggagtgaccg tgggtgacag cggctccagg gactcttggg 241 gcgcagtggg gaaagtgccg gaccaccatg ccgcgctcat ttctcgtcaa aagcaagaag 301 gctcacagct accaccagcc gcgctcccca ggaccagact attccctccg tttagagaat 361 gtaccggcgc ctagccgagc agacagcact tcaaatgcag gcggggcgaa agcggagccc 421 cgggaccgtt tgtcccccga atcgcagctg accgaagccc cagacagagc ctccgcatcc 481 cccagacagc tgcgaagcag cgtctgcgaa cggagctcgg agtttgagga cttctggagg 541 cccccgtcac cctccgcgtc tccagcctcg gagaagtcaa tgtgcccatc gctggacgaa 601 gcccagccct tccccctgcc tttcaaaccg tactcatgga gcggcctggc gggttctgac 661 ctgcggcacc tggtgcagag ctaccgaccg tgtggggccc tggagcgtgg cgctggcctg 721 ggcctcttct gtgaacccgc cccggagcct ggccacccgg ccgcgctgta cggcccgaag 781 cgggctgccg gcggcgcggg ggccggggcg ccagggaact gcatcgcagg ggccggtgcc 841 accgctggcc ctggcctagg gctctacggc gacttcgggt ctgcggcagc cgggctgtat 901 gaaaagccca cggcagcggc gggcttgctg taccccgagc gtggccaccg gctgcacgca 961 aacaaaggcg ctggcgtcaa ggtggagtcg gagctgctgt gcacccgcct gctgctgggc 1021 ggcggctcct acaagtgcat caagtgcagc aaggtgtttt ccacgccgca cgggctcgag 1081 gtgcacgtgc gcaggtccca cagcggtacc agaccctttg cctgcgagat gtgcggcaag 1141 accttcgggc acgcggtgag cctggagcag cacaaagccg tgcactcgca ggaacggagc 1201 tttgactgta agatctgtgg gaagagcttc aagaggtcat ccacactgtc cacacacctg 1261 cttatccact cagacactcg gccctacccc tgtcagtact gtggcaagag gttccaccag 1321 aagtcagaca tgaagaaaca cactttcatc cacactggtg agaagcttca caagtgccag 1381 gtgtgcggca aggcattcag ccagagctcc aacctcatca cccacagccg caaacacaca 1441 ggcttcaagc ccttcggctg cgacctctgt gggaagggtt tccagaggaa ggtggacctc 1501 cgaaggcacc gggagacgca gcatgggctc aaatgagcac cctggctggc tgcaagcagc 1561 agctacacaa cactacagag ggcagcctcc ctgcttgcca ccactctgct ccctgcttgc 1621 ctccactccc ttctgacttt ccagacccca ggtccagtct gcagatccta ccaggttgct 1681 cctccttcgc cttacctcct ggagctgcca gaagaaatga ggtacctttt caaagtgcag 1741 ccgagagtga gaaccaagtg actccctagg cttcggacac aaataggctc ctctacacct 1801 gaagacaaag gcaaagtcaa atggggacca gaataaatct tagaccccac agtccttccc 1861 atttccagcc ttaatctaca gacaggaatg cccttcaggt ttcttccttc ccccttcttg 1921 acctacccca gatatttgtg tggaagagga ggaatcacca tttacaaggt ggacaaatgc 1981 taatattttt atctagaaag aagagtgagt gttaactttt atttttttcc ttctgggggg 2041 tctgttgact cctttctttt gggtgctgcc tataaatctt ggaggaatca tttctcctcc 2101 tcaaaaactg attcagaaac tgacttgggg aaggaattta atactttgaa gtcatgagat 2161 gcaccatcga ggctaccccc aagaagaagc agaagagaag ttggtaatga gaggggatta 2221 gaggtcctcc cttcagtagg gctgtgaaaa cctcatcact ggaggtaaaa gcacaagcaa 2281 tgcctgtgga caagatgtca ttcattcact cagcaaatgt tcatggatca ccggctacca 2341 agtaccaggc accatgttag gtattgggga agagagactg aagtcacaac ccctgactgc 2401 tcctcaaaag ctaacggttg cacctccaag tggctgggtc tgttcttact cttggaggga 2461 attctgagaa gacagcacag aattgtaaac cttccctttt gacccttttg gattttatca 2521 ggtgtaaaca aaaagctgaa cagttacttc aaagatatgt gtgtatattc agttttttat 2581 tgttaagctg atattttaaa gatttctgag ctagcaggca tgtgggaagg aaggctctgt 2641 cttcaactct ttgaccctcc atgtgtacca tagagggggg aaaggtggta ttttcacttt 2701 gatgaggttg gtaaatgttt ttagatcttc tggtaagcat tatgtttgtt aatacatatt 2761 tattagagtg atgttttaag ttaataaagt attaagagta aaaaaaaaa // LOCUS HSU67615 13449 bp mRNA PRI 25-JAN-1997 DEFINITION Human beige protein homolog (chs) mRNA, complete cds. ACCESSION U67615 NID g1685033 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13449) AUTHORS Nagle,D.L., Karim,M.A., Woolf,E.A., Holmgren,L., Bork,P., Misumi,D.J., McGrail,S.H., Dussault,B.J. Jr., Perou,C.M., Boissy,R.E., Duyk,G.M., Spritz,R.A. and Moore,K.J. TITLE Identification and mutation analysis of the complete gene for Chediak-Higashi syndrome JOURNAL Nature Genet. 14 (3), 307-311 (1996) MEDLINE 97051925 REFERENCE 2 (bases 1 to 13449) AUTHORS Nagle,D.L., Woolf,E.A., Misumi,D.J., McGrail,S.H., Holmgren,L., Dussault,B.J. Jr. and Moore,K.J. TITLE Direct Submission JOURNAL Submitted (21-AUG-1996) Millennium Pharmaceuticals, 640 Memorial Drive, Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..13449 /organism="Homo sapiens" /db_xref="taxon:9606" gene 190..11595 /gene="chs" CDS 190..11595 /gene="chs" /note="CHS; Chediak-Higashi Syndrome; similar to Mus musculus beige protein encoded by Genbank Accession Number U52461" /codon_start=1 /product="beige protein homolog" /db_xref="PID:g1685034" /translation="MSTDSNSLAREFLTDVNRLCNAVVQRVEAREEEEEETHMATLGQ YLVHGRGFLLLTKLNSIIDQALTCREELLTLLLSLLPLVWKIPVQEEKATDFNLPLSA DIILTKEKNSSSQRSTQEKLHLEGSALSSQVSAKVNVFRKSRRQRKITHRYSVRDARK TQLSTSDSEANSDEKGIAMNKHRRPHLLHHFLTSFPKQDHPKAKLDRLATKEQTPPDA MALENSREIIPRQGSNTDILSEPAALSVISNMNNSPFDLCHVLLSLLEKVCKFDVTLN HNSPLAASVVPTLTEFLAGFGDCCSLSDNLESRVVSAGWTEEPVALIQRMLFRTVLHL LSVDVSTAEMMPENLRKNLTELLRAALKIRICLEKQPDPFAPRQKKTLQEVQEDFVFS KYRHRALLLPELLEGVLQILICCLQSAASNPFYFSQAMDLVQEFIQHHGFNLFETAVL QMEWLVLRDGVPPEASEHLKALINSVMKIMSTVKKVKSEQLHHSMCTRKRHRRCEYSH FMHHHRDLSGLLVSAFKNQVSKNPFEETADGDVYYPERCCCIAVCAHQCLRLLQQASL SSTCVQILSGVHNIGICCCMDPKSVIIPLLHAFKLPALKNFQQHILNILNKLILDQLG GAEISPKIKKAACNICTVDSDQLAQLEETLQGNLCDAELSSSLSSPSYRFQGILPSSG SEDLLWKWDALKAYQNFVFEEDRLHSIQIANHICNLIQKGNIVVQWKLYNYIFNPVLQ RGVELAHHCQHLSVTSAQSHVCSHHNQCLPQDVLQIYVKTLPILLKSRVIRDLFLSCN GVSQIIELNCLNGIRSHSLKAFETLIISLGEQQKDASVPDIDGIDIEQKELSSVHVGT SFHHQQAYSDSPQSLSKFYAGLKEAYPKRRKTVNQDVHINTINLFLCVAFLCVSKEAE SDRESANDSEDTSGYDSTASEPLSHMLPCISLESLVLPSPEHMHQAADIWSMCRWIYM LSSVFQKQFYRLGGFRVCHKLIFMIIQKLFRSHKEEQGKKEGDTSVNENQDLNRISQP KRTMKEDLLSLAIKSDPIPSELGSLKKSADSLGKLELQHISSINVEEVSATEAAPEEA KLFTSQESETSLQSIRLLEALLAICLHGARTSQQKMELELPNQNLSVESILFEMRDHL SQSKVIETQLAKPLFDALLRVALGNYSADFEHNDAMTEKSHQSAEELSSQPGDFSEEA EDSQCCSFKLLVEEEGYEADSESNPEDGETQDDGVDLKSETEGFSASSSPNDLLENLT QGEIIYPEICMLELNLLSASKAKLDVLAHVFESFLKIIRQKEKNVFLLMQQGTVKNLL GGFLSILTQDDSDFQACQRVLVDLLVSLMSSRTCSEELTLLLRIFLEKSPCTKILLLG ILKIIESDTTMSPSQYLTFPLLHAPNLSNGVSSQKYPGILNSKAMGLLRRARVSRSKK EADRESFPHRLLSSWHIAPVHLPLLGQNCWPHLSEGFSVSLWFNVECIHEAESTTEKG KKIKKRNKSLILPDSSFDGTESDRPEGAEYINPGERLIEEGCIHIISLGSKALMIQVW ADPHNATLIFRVCMDSNDDMKAVLLAQVESQENIFLPSKWQHLVLTYLQQPQGKRRIH GKISIWVSGQRKPDVTLDFMLPRKTSLSSDSNKTFCMIGHCLSSQEEFLQLAGKWDLG NLLLFNGAKVGSQEAFYLYACGPNHTSVMPCKYGKPVNDYSKYINKEILRCEQIRELF MTKKDVDIGLLIESLSVVYTTYCPAQYTIYEPVIRLKGQMKTQLSQRPFSSKEVQSIL LEPHHLKNLQPTEYKTIQGILHEIGGTGIFVFLFARVVELSSCEETQALALRVILSLI KYNQQRVHELENCNGLSMIHQVLIKQKCIVGFYILKTLLEGCCGEDIIYMNENGEFKL DVDSNAIIQDVKLLEELLLDWKIWSKAEQGVWETLLAALEVLIRADHHQQMFNIKQLL KAQVVHHFLLTCQVLQEYKEGQLTPMPREVCRSFVKIIAEVLGSPPDLELLTIIFNFL LAVHPPTNTYVCHNPTNFYFSLHIDGKIFQEKVRSIMYLRHSSSGGRSLMSPGFMVIS PSGFTASPYEGENSSNIIPQQMAAHMLRSRSLPAFPTSSLLTQSQKLTGSLGCSIDRL QNIADTYVATQSKKQNSLGSSDTLKKGKEDAFISSCESAKTVCEMEAVLSAQVSVSDV PKGVLGFPVVKADHKQLGAEPRSEDDSPGDESCPRRPDYLKGLASFQRSHSTIASLGL AFPSQNGSAAVGRWPSLVDRNTDDWENFAYSLGYEPNYNRTASAHSVTEDCLVPICCG LYELLSGVLLILPDVLLEDVMDKLIQADTLLVLVNHPSPAIQQGVIKLLDAYFARASK EQKDKFLKNRGFSLLANQLYLHRGTQELLECFIEMFFGRHIGLDEEFDLEDVRNMGLF QKWSVIPILGLIETSLYDNILLHNALLLLLQILNSCSKVADMLLDNGLLYVLCNTVAA LNGLEKNIPMSEYKLLACDIQQLFIAVTIHACSSSGSQYFRVIEDLIVMLGYLQNSKN KRTQNMAVALQLRVLQAAMEFIRTTANHDSENLTDSLQSPSAPHHAVVQKRKSIAGPR KFPLAQTESLLMKMRSVANDELHVMMQRRMSQENPSQATETELAQRLQRLTVLAVNRI IYQEFNSDIIDILRTPENVTQSKTSVFQTEISEENIHHEQSSVFNPFQKEIFTYLVEG FKVSIGSSKASGSKQQWTKILWSCKETFRMQLGRLLVHILSPAHAAQERKQIFEIVHE PNHQEILRDCLSPSLQHGAKLVLYLSELIHNHQGELTEEELGTAELLMNALKLCGHKC IPPSASTKADLIKMIKEEQKKYETEEGVNKAAWQKTVNNNQQSLFQRLDSKSKDISKI AADITQAVSLSQGNERKKVIQHIRGMYKVDLSASRHWQELIQQLTHDRAVWYDPIYYP TSWQLDPTEGPNRERRRLQRCYLTIPNKYLLRDRQKSEDVVKPPLSYLFEDKTHSSFS STVKDKAASESIRVNRRCISVAPSRETAGELLLGKCGMYFVEDNASDTVESSSLQGEL EPASFSWTYEEIKEVHKRWWQLRDNAVEIFLTNGRTLLLAFDNTKVRDDVYHNILTNN LPNLLEYGNITALTNLWYTGQITNFEYLTHLNKHAGRSFNDLMQYPVFPFILADYVSE TLDLNDLLIYRNLSKPIAVQYKEKEDRYVDTYKYLEEEYRKGAREDDPMPPVQPYHYG SHYSNSGTVLHFLVRMPPFTKMFLAYQDQSFDIPDRTFHSTNTTWRLSSFESMTDVKE LIPEFFYLPEFLVNREGFDFGVRQNGERVNHVNLPPWARNDPRLFILIHRQALESDYV SQNICQWIDLVFGYKQKGKASVQAINVFHPATYFGMDVSAVEDPVQRRALETMIKTYG QTPRQLFHMAHVSRPGAKLNIEGELPAAVGLLVQFAFRETREQVKEITYPSPLSWIKG LKWGEYVGSPSAPVPVVCFSQPHGERFGSLQALPTRAICGLSRNFCLVMTYSKEQGVR SMNSTDIQWSAILSWGYADNILRLKSKQSEPPVNFIQSSQQYQVTSCAWVPDSCQLFT GSKCGVITAYTNRFTSSTPSEIEMETQIHLYGHTEEITSLFVCKPYSILISVSRDGTC IIWDLNRLCYVQSLAGHKSPVTAVSASETSGDIATVCDSAGGGSDLRLWTVNGDLVGH VHCREIICSVAFSNQPEGVSINVIAGGLENGIVRLWSTWDLKPVREITFPKSNKPIIS LTFSCDGHHLYTANSDGTVIAWCRKDQQRLKQPMFYSFLSSYAAG" BASE COUNT 4172 a 2621 c 2791 g 3862 t 3 others ORIGIN 1 gcggccgcgt cgacgcggcg gcggcagcgg cgtcggctcg gggttctccg ggagaggggg 61 agtgcgcggc ggccgcagct gccacaaacc aggtgaagct ttgttctaag aatatttgtt 121 tcatctagtt tatgagtcca aatgatatag actgtaaatg tcacagcagt ggtgaaagac 181 tgctcggtca tgagcaccga cagtaactca ctggcacgtg aatttctgac cgatgtcaac 241 cggctttgca atgcagtggt ccagagggtg gaggccaggg aggaagaaga ggaggagacg 301 cacatggcaa cccttggaca gtaccttgtc catggtcgag gatttctatt acttaccaag 361 ctaaattcta taattgatca ggcattgaca tgtagagaag aactcctgac tcttcttctg 421 tctctccttc cactggtatg gaagatacct gtccaagaag aaaaggcaac agattttaac 481 ctaccgctct cagcagatat aatcctgacc aaagaaaaga actcaagttc acaaagatcc 541 actcaggaaa aattacattt agaaggaagt gccctgtcta gtcaggtttc tgcaaaagta 601 aatgtttttc gaaaaagcag acgacagcgt aaaattaccc atcgctattc tgtaagagat 661 gcaagaaaga cacagctctc cacctcagat tcagaagcca attcagatga aaaaggcata 721 gcaatgaata agcatagaag gccccatctg ctgcatcatt ttttaacatc gtttcctaaa 781 caagaccacc ccaaagctaa acttgaccgc ttagcaacca aagaacagac tcctccagat 841 gctatggctt tggaaaattc cagagagatt attccaagac aggggtcaaa cactgacatt 901 ttaagtgagc cagctgcctt gtctgttatc agtaacatga acaattctcc atttgactta 961 tgtcatgttt tgttatcttt attagaaaaa gtttgtaagt ttgacgttac cttgaatcat 1021 aattctcctt tagcagccag tgtagtgccc acactaactg aattcctagc aggctttggg 1081 gactgctgca gtctgagcga caacttggag agtcgagtag tttctgcagg ttggaccgaa 1141 gaaccggtgg ctttgattca aaggatgctc tttcgaacag tgttgcatct tctgtcagta 1201 gatgttagta ctgcagagat gatgccagaa aatcttagga aaaatttaac tgaattgctt 1261 agagcagctt taaaaattag aatatgccta gaaaagcagc ctgacccttt tgcaccaaga 1321 caaaagaaaa cactgcagga ggttcaggaa gattttgtgt tttcaaagta tcgtcataga 1381 gcccttcttt tacctgagct tttggaagga gttcttcaga ttctgatctg ttgtcttcaa 1441 agtgcagctt caaatccctt ctacttcagt caagccatgg atttggttca agaattcatt 1501 cagcatcatg gatttaattt atttgaaaca gcagttcttc aaatggaatg gctggtttta 1561 agagatggag ttcctcccga ggcctcagag catttgaaag ccctaataaa tagtgtgatg 1621 aaaataatga gcactgtcaa aaaagtgaaa tcagagcaac ttcatcattc gatgtgtaca 1681 agaaaaaggc acagacgatg tgaatattct cattttatgc atcatcaccg agatctctca 1741 ggtcttctgg tttcggcttt taaaaaccag gtttccaaaa acccatttga agagactgca 1801 gatggagatg tttattatcc tgagcggtgc tgttgcattg cagtgtgtgc ccatcagtgc 1861 ttgcgcttac tacagcaggc ttccttgagc agcacttgtg tccagatcct atcgggtgtt 1921 cataacattg gaatatgctg ttgtatggat cccaaatctg taatcattcc tttgctccat 1981 gcttttaaat tgccagcact gaaaaatttt cagcagcata tattgaatat ccttaacaaa 2041 cttattttgg atcagttagg aggagcagag atatcaccaa aaattaaaaa agcagcttgt 2101 aatatttgta ctgttgactc tgaccaacta gcccaattag aagagacact gcagggaaac 2161 ttatgtgatg ctgaactctc ctcaagttta tccagtcctt cttacagatt tcaagggatc 2221 ctgcccagca gtggatctga agatttgttg tggaaatggg atgctttaaa ggcttatcag 2281 aactttgttt ttgaagaaga cagattacat agtatacaga ttgcaaatca catttgcaat 2341 ttaatccaga aaggcaatat agttgttcag tggaaattat ataattacat atttaatcct 2401 gtgctccaaa gaggagttga attagcacat cattgtcaac acctaagcgt tacttcagct 2461 caaagtcatg tatgtagcca tcataaccag tgcttgcctc aggacgtgct tcagatttat 2521 gtaaaaactc tgcctatcct gcttaaatcc agggtaataa gagatttgtt tttgagttgt 2581 aatggagtaa gtcaaataat cgaattaaat tgcttaaatg gtattcgaag tcattctcta 2641 aaagcatttg aaactctgat aatcagccta ggggagcaac agaaagatgc ctcagttcca 2701 gatattgatg ggatagacat tgaacagaag gagttgtcct ctgtacatgt gggtacttct 2761 tttcatcatc agcaagctta ttcagattct cctcagagtc tcagcaaatt ttatgctggc 2821 ctcaaagaag cttatccaaa gagacggaag actgttaacc aagatgttca tatcaacaca 2881 ataaacctat tcctctgtgt ggctttttta tgcgtaagta aagaagcaga gtctgacagg 2941 gagtcggcca atgactcaga agatacttct ggctatgaca gcacagccag cgagccttta 3001 agtcatatgc tgccatgtat atctctcgag agccttgtct tgccttctcc tgaacatatg 3061 caccaagcag cagacatttg gtctatgtgt cgttggatct acatgttgag ttcagtgttc 3121 cagaaacagt tttataggct tggtggtttc cgagtatgcc ataagttaat atttatgata 3181 atacagaaac tgttcagaag tcacaaagag gagcaaggaa aaaaggaggg agatacaagt 3241 gtaaatgaaa accaggattt aaacagaatt tctcaaccta agagaactat gaaggaagat 3301 ttattatctt tggctataaa aagtgacccc ataccatcag aactaggtag tctaaaaaag 3361 agtgctgaca gtttaggtaa attagagtta cagcatattt cttccataaa tgtggaagaa 3421 gtttcagcta ctgaagccgc tcccgaggaa gcaaagctat ttacaagtca agaaagtgag 3481 acctcacttc aaagtatacg acttttggaa gcccttctgg ccatttgtct tcatggtgcc 3541 agaactagtc aacagaagat ggaattggag ttacctaatc agaacttgtc tgtggaaagt 3601 atattatttg aaatgaggga ccatctttcc cagtcaaagg tgattgaaac acaactagca 3661 aagccgttat ttgatgccct gcttcgagtt gccctcggga attattcagc agattttgaa 3721 cataatgatg ctatgactga gaagagtcat caatctgcag aagaattgtc atcccagcct 3781 ggtgattttt cagaagaagc tgaggattct cagtgttgta gttttaaact tttagttgaa 3841 gaagaaggtt acgaagcaga tagtgaaagc aatcctgaag atggcgaaac ccaggatgat 3901 ggggtagact taaagtctga aacagaaggt ttcagtgcat caagcagtcc aaatgactta 3961 ctcgaaaacc tcactcaagg ggaaataatt tatcctgaga tttgtatgct ggaattaaat 4021 ttgctttctg ctagtaaagc caaacttgat gtgcttgccc atgtatttga gagttttttg 4081 aaaattatta ggcagaaaga aaagaatgtt tttctgctca tgcaacaggg aactgtgaaa 4141 aatcttttag gagggttctt gagtatttta acacaggatg attctgattt tcaagcatgc 4201 cagagagtat tggtggatct tttggtatct ttgatgagtt caagaacatg ttcagaagag 4261 ctaacccttc ttttgagaat atttctggag aaatctcctt gtacaaaaat tcttcttctg 4321 ggtattctga aaattattga aagtgatact actatgagcc cttcacagta tctaaccttc 4381 cctttactgc acgctccaaa tttaagcaac ggtgtttcat cacaaaagta tcctgggatt 4441 ttaaacagta aggccatggg tttattgaga agagcacgag tttcacggag caagaaagag 4501 gctgatagag agagttttcc ccatcggctg ctttcatctt ggcacatagc cccagtccac 4561 ctgccgttgc tggggcaaaa ctgctggcca cacctatcag aaggtttcag tgtttccctg 4621 tggtttaatg tggagtgtat ccatgaagct gagagtacta cagaaaaagg aaagaagata 4681 aagaaaagaa acaaatcatt aattttacca gatagcagtt ttgatggtac agagagcgac 4741 agaccagaag gtgcagagta cataaatcct ggtgaaagac tcatagaaga aggatgtatt 4801 catataattt cactgggatc caaagcgttg atgatccaag tgtgggctga tccccacaat 4861 gccactctta tctttcgtgt gtgcatggat tcaaatgatg acatgaaagc tgttttacta 4921 gcacaggttg aatcacagga gaatattttc ctcccaagca aatggcaaca tttagtactc 4981 acctacttac agcagcccca agggaaaagg aggattcatg ggaaaatctc catatgggtc 5041 tctggacaga ggaagcctga tgttactttg gattttatgc ttccaagaaa aacaagtttg 5101 tcatctgata gcaataaaac attttgcatg attggccatt gtttatcatc ccaagaagag 5161 tttttgcagt tggctggaaa atgggacctg ggaaatttgc ttctcttcaa cggagctaag 5221 gttggttcac aagaggcctt ttatctgtat gcttgtggac ccaaccatac atctgtaatg 5281 ccatgtaagt atggcaagcc agtcaatgac tactccaaat atattaataa agaaattttg 5341 cgatgtgaac aaatcagaga actttttatg accaagaaag atgtggatat tggtctctta 5401 attgaaagtc tttcagttgt ttatacaact tactgtcctg ctcagtatac catctatgaa 5461 ccagtgatta gacttaaagg tcaaatgaaa acccaactct ctcaaagacc cttcagctca 5521 aaagaagttc agagcatctt attagaacct catcatctaa agaatctcca acctactgaa 5581 tataaaacta ttcaaggcat tctgcacgaa attggtggaa ctggcatatt tgtttttctc 5641 tttgccaggg ttgttgaact cagtagctgt gaagaaactc aagcattagc actgcgagtt 5701 atactctcat taattaaata caaccaacaa agagtacatg aattagaaaa ttgtaatgga 5761 ctttctatga ttcatcaggt gttgatcaaa caaaaatgca ttgttgggtt ttacattttg 5821 aagacccttc ttgaaggatg ctgtggtgaa gatattattt atatgaatga gaatggagag 5881 tttaagttgg atgtagactc taatgctata atccaagatg ttaagctgtt agaggaacta 5941 ttgcttgact ggaagatatg gagtaaagca gagcaaggtg tttgggaaac tttgctagca 6001 gctctagaag tcctcatcag agcagatcac caccagcaga tgtttaatat taagcagtta 6061 ttgaaagctc aagtggttca tcactttcta ctgacttgtc aggttttgca ggaatacaaa 6121 gaggggcaac tcacacccat gccccgagag gtttgtagat catttgtgaa aattatagca 6181 gaagtccttg gatctcctcc agatttggaa ttattgacaa ttatcttcaa tttcctttta 6241 gcagttcacc ctcctactaa tacttacgtt tgtcacaatc ccacgaactt ctacttttct 6301 ttgcacatag atggcaagat ctttcaggag aaagtgcggt caatcatgta cctgaggcat 6361 tccagcagtg gaggaaggtc ccttatgagc cctggattta tggtaataag cccatctggt 6421 tttactgctt caccatatga aggagagaat tcctctaata ttattccaca acagatggcc 6481 gcccatatgc tgcgttctag aagcctacca gcattcccta cttcttcact actaacgcaa 6541 tcacaaaaac tgactggaag tttgggttgt agtatcgaca ggttacaaaa tattgcagat 6601 acttatgttg ccacccaatc aaagaaacaa aattctttgg ggagttccga cacactgaaa 6661 aaaggcaaag aggacgcatt catcagtagc tgtgagtctg caaaaactgt ttgtgaaatg 6721 gaagctgtcc tctcagccca ggtctctgtc agtgatgtcc caaagggagt gctgggattt 6781 ccagtggtca aagcagatca taaacagttg ggagcagaac ccaggtcaga agatgacagt 6841 cctggggatg agtcctgccc acgccgacct gattacctaa agggattggc ctccttccag 6901 cgaagccaca gcactattgc aagccttggg ctagcttttc cttcacagaa cggatctgca 6961 gctgttggcc gttggccaag tcttgttgat agaaacactg atgattggga aaactttgcc 7021 tattctcttg gttatgagcc aaattacaac cgaactgcaa gtgctcacag tgtaactgaa 7081 gactgtttgg tacctatatg ctgtggatta tatgaactcc taagtggggt tcttcttatc 7141 ctgcctgatg ttttgcttga agatgtgatg gacaagctta ttcaagcaga tacacttttg 7201 gtcctcgtta accacccatc accagctata caacaaggtg ttattaaact attagatgca 7261 tattttgcta gagcatctaa ggaacaaaaa gataaatttc tgaagaatcg tggattttcc 7321 ttgctagcca accagttgta tcttcatcga ggaactcaag aattgttaga atgcttcatc 7381 gaaatgttct ttggtcgaca tattggcctt gatgaagaat ttgatctgga agatgtgaga 7441 aacatgggat tgtttcagaa gtggtctgtc attcctattc tgggactaat agagacctct 7501 ctatatgaca acatactctt gcataatgct cttttacttc ttctccaaat tttaaattct 7561 tgttctaagg tagcagatat gttgctggat aatggtctac tctatgtgtt atgtaataca 7621 gtagcagccc tgaatggatt agaaaagaac attcccatga gtgaatataa attgcttgct 7681 tgtgatatac agcaactttt catagcagtt acaattcatg cttgcagttc ctcaggctca 7741 caatatttta gggttattga agaccttatt gtaatgcttg gatatcttca aaatagcaaa 7801 aacaagagga cacaaaatat ggctgttgca ctacagctta gagttctcca ggctgctatg 7861 gaatttataa ggaccaccgc aaatcatgac tctgaaaacc tcacagattc actccagtca 7921 ccttctgctc cccatcatgc agtagttcaa aagcggaaaa gcattgctgg tcctcgaaaa 7981 tttccccttg ctcaaactga atcgcttctg atgaaaatgc gttcagtggc aaatgatgag 8041 cttcatgtga tgatgcaacg gagaatgagc caagagaacc ctagccaagc aactgaaacg 8101 gaacttgcgc agagactaca gaggctcact gttttagcag tcaacaggat tatttatcaa 8161 gaatttaatt cagacattat tgacattttg agaactccag aaaatgtaac tcaaagcaag 8221 acctcagttt tccagaccga aatttctgag gaaaatattc atcatgaaca gtcttctgtt 8281 ttcaatccat ttcagaaaga aatttttaca tatctggtag aaggattcaa agtatctatt 8341 ggttcaagta aagccagtgg ttccaagcag caatggacta aaattctgtg gtcttgtaag 8401 gagaccttcc gaatgcagct tgggagacta ctagtgcata ttttgtcgcc agcccacgct 8461 gcacaagaga gaaagcaaat ttttgaaata gttcatgaac caaatcatca ggaaatacta 8521 cgagactgtc tcagcccatc cctacaacat ggagccaagt tagttttgta tttgtcagag 8581 ttgatacata atcaccaagg tgaattgact gaagaagagc taggcacagc agaactgctt 8641 atgaatgctt tgaagttatg tggtcacaag tgcatccctc ccagtgcatc aacaaaagca 8701 gaccttatta aaatgatcaa agaggaacaa aagaaatatg aaactgaaga aggagtgaat 8761 aaagctgctt ggcagaaaac agttaacaat aatcaacaaa gtctctttca gcgtctggat 8821 tcaaaatcaa aggatatatc taaaatagct gcagatatca cccaggcagt gtctctctcc 8881 caaggaaatg agagaaaaaa ggtgatccag catattagag gaatgtataa agtagatttg 8941 agtgccagca gacattggca ggaacttatt cagcagctga cacatgatag agcagtatgg 9001 tatgacccca tctactatcc aacctcatgg cagttggatc caacagaagg gccaaatcga 9061 gagaggagac gtttacagag atgttattta actattccaa ataagtatct ccttagggat 9121 agacagaaat cagaagatgt tgtcaaacca ccactctctt acctgtttga agacaaaact 9181 cattcttctt tctcttctac tgtcaaagac aaagctgcaa gtgaatctat aagagtgaat 9241 cgaagatgca tcagtgttgc accatctaga gagacagctg gtgaattgtt actaggtaaa 9301 tgtggaatgt attttgtgga agataatgct tctgatacag ttgaaagttc gagccttcag 9361 ggagagttgg aaccagcatc attttcctgg acatatgaag aaattaaaga agttcacaag 9421 cgttggtggc aattgagaga taatgctgta gaaatctttc taacaaatgg cagaacactc 9481 ctgttggcat ttgataacac caaggttcgt gatgatgtat accacaatat actcacaaat 9541 aacctcccta atcttctgga atatggtaac atcaccgctc tgacaaattt atggtatact 9601 gggcaaatta ctaattttga atatttgact cacttaaaca aacatgctgg ccgatccttc 9661 aatgatctca tgcagtatcc tgtgttccca tttatacttg ctgactacgt tagtgagaca 9721 cttgacctca atgatctgtt gatatacaga aatctctcta aacctatagc tgttcagtat 9781 aaagaaaaag aagatcgtta tgtggacaca tacaagtact tggaggaaga gtaccgcaaa 9841 ggagccagag aagatgaccc catgcctccc gtgcagccct atcactatgg ctcccactat 9901 tccaatagcg gcactgtgct tcacttcctg gtcaggatgc ctcctttcac taaaatgttt 9961 ttagcctatc aagatcaaag ttttgacatt ccagacagaa cttttcattc tacaaataca 10021 acttggcgac tctcatcttt tgaatctatg actgatgtga aagaacttat cccagagttt 10081 ttctatcttc cagagttcct agttaaccgt gaaggttttg attttggtgt gcgtcagaat 10141 ggtgaacggg ttaatcacgt caaccttccc ccttgggcgc gtaatgatcc tcgtcttttt 10201 atcctcatcc atcggcaggc tctagagtct gactacgtgt cgcagaacat ctgtcagtgg 10261 attgacttgg tgtttgggta taagcaaaag gggaaggctt ctgttcaagc gatcaatgtt 10321 tttcatcctg ctacatattt tggaatggat gtctctgcag ttgaagatcc agttcagaga 10381 cgagcgctag aaaccatgat aaaaacctac gggcagactc cccgtcagct gttccacatg 10441 gcccatgtga gcagacctgg agccaagctc aatattgaag gagagcttcc agctgctgtg 10501 gggttgctag tgcagtttgc tttcagggag acccgagaac aggtcaaaga aatcacctat 10561 ccgagtcctt tgtcatggat aaaaggcttg aaatgggggg aatacgtggg ttcccccagt 10621 gctccagtac ctgtggtctg cttcagccag ccccacggag aaagatttgg ctctctccag 10681 gctctgccca ccagagcaat ctgtggtttg tcacggaatt tctgtcttgt gatgacatat 10741 agcaaggaac aaggtgtgag aagcatgaac agtacggaca ttcagtggtc agccatcctg 10801 agctggggat atgctgataa tattttaagg ttgaagagta aacaaagtga gcctccagta 10861 aactttattc aaagttcaca acagtaccag gtgactagtt gtgcttgggt gcctgacagt 10921 tgccagctgt ttactggaag caaatgcggt gtcatcacag cctacacaaa cagatttaca 10981 agcagcacgc catcagaaat agaaatggag actcaaatac atctctatgg tcacacagaa 11041 gagataacca gcttatttgt ttgcaaacca tacagtatac tgataagtgt gagcagagac 11101 ggaacctgca tcatatggga tttaaacagg ttatgctatg tacaaagtct ggcgggacac 11161 aaaagccctg tcacagctgt ctctgccagt gaaacctcag gtgatattgc tactgtgtgt 11221 gattcagctg gcggaggcag tgacctcaga ctctggacgg tgaacgggga tctcgttgga 11281 catgtccact gcagggagat catctgttcc gtggctttct ccaaccagcc tgagggagta 11341 tctatcaatg taatcgctgg gggattagaa aatggaattg taaggttatg gagcacatgg 11401 gacttaaagc ctgtgagaga aattacattt cccaaatcaa ataagcccat catcagcctt 11461 acattttctt gtgatggcca ccatttgtac acagcaaaca gtgatgggac cgtgattgcc 11521 tggtgtcgga aggaccagca gcgcttgaaa cagccaatgt tctattcctt ccttagcagc 11581 tatgcagccg ggtgaatgcg aatgaacttc acgttctcca aagcacttta actccaaact 11641 agatttgttg acttcaccag ttttaggagg ttgaacctaa agaaatggat gactggacaa 11701 accatccaaa taatgataaa gtctattcat ctgcacaaaa ttctgaagag tcacatgatc 11761 ctaagaggaa agttctgttc tattttagtg ataatctgga agattgtgtc aatatgcact 11821 agccaacaag ttttaagcct cgcatggtac attaaaatga tattcttaaa attttttccc 11881 accaaggtat tccaaagaaa atattaaggt ctcccctttt tctatgattc caaaaggacc 11941 agtagaattt aaattggttg gttgatngtt tatataaaac acactaaaat tatattttaa 12001 aagtttantg ccntgaaata ctcctcccac cacacacaca tgctccaaaa gaggaaagaa 12061 aaaaagataa tttttaggac ttgataattg ctttctttga gaagcaaatt attcagtagg 12121 tgcctctgta ccaaatattt tatggaatat ctaaatacta aaataaacta tgaatgaatc 12181 tcaaaattag gcagtttttg ccagttgctt tcttagctca aaggagaacc agaatttttt 12241 tgacagccac aaacaagaat acaggtatct tggatttcag acacattctg tttcttcata 12301 aaaattttac ttaaaatctg taacgctaga tattgactat ccttagttga gtcactgagg 12361 tttaaacaca atggtaagtc ttaaagtctg ctatttacag agcattgaat ctgtaccaat 12421 ttgcaataga aagccttcag tatgcaagaa gtttgcatgg gtattaagaa cacagcctaa 12481 ataaggcatt tgatctaatc tgcaggaaga attttcttcc ccaaaacaga attataaaag 12541 cttactttaa acaggaggca gaataattct tttaggaaac catttcattc tgtttctact 12601 aacctatacc atctgagaat tcctaaacat cttggagccg tctgtctctc ccatatgatg 12661 gctgtctgta tatttttact tggggtgctg ctttattggc tttgaaaaca ctgtcagata 12721 agctcagtaa tatgttacca tgggataaaa atatgtatcc ctgcctaaga ataacttgtg 12781 catttgttat ggaaatttaa ttcatatggt gtttacagta ctacttttgt aacttccaga 12841 ctttctaaaa cattctgctt aaaaaccata taaaatataa ttccaaagtc tctgctgtca 12901 agatagattc gagagaaagc acgtggccat gtatgcttta accttaaact gcatacacat 12961 gtagtgatac ctaggctgca tttagatcac cgtgtgctca ggccaggtgt gaatcctgag 13021 gtccatggag gtgcagagat gagattactc ctattcacgt tgaagtgatt tgctttgtta 13081 acaaaaaatt gcagctattg tctagctttc atttttttac tgagaacttt aaattagtcc 13141 cctattagaa tagggttgct actcatcttt ttttaaaaac cgaatttcat catttatcta 13201 aagagaaaat atgcagaata actggtcttg ttaagagtgc aatattatat ttttatgtaa 13261 aaataaaaat taatttgggg ggattattta ttcagcatga aacctaatat gtatatgttt 13321 gaaatacttc ataatgtgca tgttgtagca aacatttctg taaattatca caagctctgt 13381 tacctttata tacgctgcct cttcaatttg gaaataaatt tcataaaaaa aaaaaaaaaa 13441 aaaaaaaaa // LOCUS HSU67733 4240 bp mRNA PRI 21-MAY-1997 DEFINITION Human cGMP-stimulated 3',5'-cyclic nucleotide phosphodiesterase PDE2A3 (PDE2A) mRNA, complete cds. ACCESSION U67733 NID g2108051 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4240) AUTHORS Rosman,G.J., Martins,T.J., Sonnenburg,W.K., Beavo,J.A., Ferguson,K. and Loughney,K. TITLE Isolation and characterization of human cDNAs encoding a cGMP-stimulated 3',5'-cyclic nucleotide phosphodiesterase JOURNAL Gene 191 (1), 89-95 (1997) MEDLINE 97354299 REFERENCE 2 (bases 1 to 4240) AUTHORS Rosman,G.J., Martins,T.J., Sonnenburg,W.K., Beavo,J.A., Ferguson,K. and Loughney,K. TITLE Direct Submission JOURNAL Submitted (21-AUG-1996) Icos Corporation, 22021 20th Ave. S.E., Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..4240 /organism="Homo sapiens" /db_xref="taxon:9606" gene 162..2987 /gene="PDE2A" CDS 162..2987 /gene="PDE2A" /function="cGMP-stimulated 3',5'-cyclic nucleotide phosphodiesterase" /note="PDE2 family; splice variant 3" /codon_start=1 /product="PDE2A3" /db_xref="PID:g2108052" /translation="MGQACGHSILCRSQQYPAARPAEPRGQQVFLKPDEPPPPPQPCA DSLQDALLSLGSVIDISGLQRAVKEALSAVLPRVETVYTYLLDGESQLVCEDPPHELP QEGKVREAIISQKRLGCNGLGFSDLPGKPLARLVAPLAPDTQVLVMPLADKEAGAVAA VILVHCGQLSDNEEWSLQAVEKHTLVALRRVQVLQQRGPREAPRAVQNPPEGTAEDQK GGAAYTDRDRKILQLCGELYDLDASSLQLKVLQYLQQETRASRCCLLLVSEDNLQLSC KVIGDKVLGEEVSFPLTGCLGQVVEDKKSIQLKDLTSEDVQQLQSMLGCELQAMLCVP VISRATDQVVALACAFNKLEGDLFTDEDEHVIQHCFHYTSTVLTSTLAFQKEQKLKCE CQALLQVAKNLFTHLDDVSVLLQEIITEARNLSNAEICSVFLLDQNELVAKVFDGGVV DDESYEIRIPADQGIAGHVATTGQILNIPDAYAHPLFYRGVDDSTGFRTRNILCFPIK NENQEVIGVAELVNKINGPWFSKFDEDLATAFSIYCGISIAHSLLYKKVNEAQYRSHL ANEMMMYHMKVSDDEYTKLLHDGIQPVAAIDSNFASFTYTPRSLPEDDTSMAILSMLQ DMNFINNYKIDCPTLARFCLMVKKGYRDPPYHNWMHAFSVSHFCYLLYKNLELTNYLE DIEIFALFISCMCHDLDHRGTNNSFQVASKSVLAALYSSEGSVMERHHFAQAIAILNT HGCNIFDHFSRKDYQRMLDLMRDIILATDLAHHLRIFKDLQKMAEVGYDRNNKQHHRL LLCLLMTSCDLSDQTKGWKTTRKIAELIYKEFFSQGDLEKAMGNRPMEMMDREKAYIP ELQISFMEHIAMPIYKLLQDLFPKAAELYERVASNREHWTKVSHKFTIRGLPSNNSLD FLDEEYEVPDLDGTRAPINGCCSLDAE" BASE COUNT 902 a 1260 c 1202 g 876 t ORIGIN 1 cagcagagct ggattggggt gttgagtcca ggctgagtag ggggcagccc actgctcttg 61 gtccctgtgc ctgctggggg tgccctgccc tgaactccag gcagcgggga cagggcgagg 121 tgccacctta gtctggctgg ggaggcggac gatgaggagt gatggggcag gcatgcggcc 181 actccatcct ctgcaggagc cagcagtacc cggcagcgcg accggctgag ccgcggggcc 241 agcaggtctt cctcaagccg gacgagccgc cgccgccgcc gcagccatgc gccgacagcc 301 tgcaggacgc cttgctgagt ctgggctctg tcatcgacat ttcaggcctg caacgtgctg 361 tcaaggaggc cctgtcagct gtgctccccc gagtggaaac tgtctacacc tacctactgg 421 atggtgagtc ccagctggtg tgtgaggacc ccccacatga gctgccccag gaggggaaag 481 tccgggaggc tatcatctcc cagaagcggc tgggctgcaa tgggctgggc ttctcagacc 541 tgccagggaa gcccttggcc aggctggtgg ctccactggc tcctgatacc caagtgctgg 601 tcatgccgct agcggacaag gaggctgggg ccgtggcagc tgtcatcttg gtgcactgtg 661 gccagctgag tgataatgag gaatggagcc tgcaggcggt ggagaagcat accctggtcg 721 ccctgcggag ggtgcaggtc ctgcagcagc gcgggcccag ggaggctccc cgagccgtcc 781 agaacccccc ggaggggacg gcggaagacc agaagggcgg ggcggcgtac accgaccgcg 841 accgcaagat cctccaactg tgcggggaac tctacgacct ggatgcctct tccctgcagc 901 tcaaagtgct ccaatacctg cagcaggaga cccgggcatc ccgctgctgc ctcctgctgg 961 tgtcggagga caatctccag ctttcttgca aggtcatcgg agacaaagtg ctcggggaag 1021 aggtcagctt tcccttgaca ggatgcctgg gccaggtggt ggaagacaag aagtccatcc 1081 agctgaagga cctcacctcc gaggatgtac aacagctgca gagcatgttg ggctgtgagc 1141 tgcaggccat gctctgtgtc cctgtcatca gccgggccac tgaccaggtg gtggccttgg 1201 cctgcgcctt caacaagcta gaaggagact tgttcaccga cgaggacgag catgtgatcc 1261 agcactgctt ccactacacc agcaccgtgc tcaccagcac cctggccttc cagaaggaac 1321 agaaactcaa gtgtgagtgc caggctcttc tccaagtggc aaagaacctc ttcacccacc 1381 tggatgacgt ctctgtcctg ctccaggaga tcatcacgga ggccagaaac ctcagcaacg 1441 cagagatctg ctctgtgttc ctgctggatc agaatgagct ggtggccaag gtgttcgacg 1501 ggggcgtggt ggatgatgag agctatgaga tccgcatccc ggccgatcag ggcatcgcgg 1561 gacacgtggc gaccacgggc cagatcctga acatccctga cgcatatgcc catccgcttt 1621 tctaccgcgg cgtggacgac agcaccggct tccgcacgcg caacatcctc tgcttcccca 1681 tcaagaacga gaaccaggag gtcatcggtg tggccgagct ggtgaacaag atcaatgggc 1741 catggttcag caagttcgac gaggacctgg cgacggcctt ctccatctac tgcggcatca 1801 gcatcgccca ttctctccta tacaaaaaag tgaatgaggc tcagtatcgc agccacctgg 1861 ccaatgagat gatgatgtac cacatgaagg tctccgacga tgagtatacc aaacttctcc 1921 atgatgggat ccagcctgtg gctgccattg actccaattt tgcaagtttc acctataccc 1981 ctcgttccct gcccgaggat gacacgtcca tggccatcct gagcatgctg caggacatga 2041 atttcatcaa caactacaaa attgactgcc cgaccctggc ccggttctgt ttgatggtga 2101 agaagggcta ccgggatccc ccctaccaca actggatgca cgccttttct gtctcccact 2161 tctgctacct gctctacaag aacctggagc tcaccaacta cctcgaggac atcgagatct 2221 ttgccttgtt tatttcctgc atgtgtcatg acctggacca cagaggcaca aacaactctt 2281 tccaggtggc ctcgaaatct gtgctggctg cgctctacag ctctgagggc tccgtcatgg 2341 agaggcacca ctttgctcag gccatcgcca tcctcaacac ccacggctgc aacatctttg 2401 atcatttctc ccggaaggac tatcagcgca tgctggatct gatgcgggac atcatcttgg 2461 ccacagacct ggcccaccat ctccgcatct tcaaggacct ccagaagatg gctgaggtgg 2521 gctacgaccg aaacaacaag cagcaccaca gacttctcct ctgcctcctc atgacctcct 2581 gtgacctctc tgaccagacc aagggctgga agactacgag aaagatcgcg gagctgatct 2641 acaaagaatt cttctcccag ggagacctgg agaaggccat gggcaacagg ccgatggaga 2701 tgatggaccg ggagaaggcc tatatccctg agctgcaaat cagcttcatg gagcacattg 2761 caatgcccat ctacaagctg ttgcaggacc tgttccccaa agcggcagag ctgtacgagc 2821 gcgtggcctc caaccgtgag cactggacca aggtgtccca caagttcacc atccgcggcc 2881 tcccaagtaa caactcgctg gacttcctgg atgaggagta cgaggtgcct gatctggatg 2941 gcactagggc ccccatcaat ggctgctgca gccttgatgc tgagtgatcc cctccaggac 3001 acttccctgc ccaggccacc tcccacagcc ctccactggt ctggccagat gcactgggaa 3061 cagagccacg ggtcctgggt cctagaccag gacttcctgt gtgaccctgg acaagtacta 3121 ccttcctggg cctcagcttt ctcgtctgta taatggaagc aagacttcca acctcacgga 3181 gactttgtaa tttgcttctc tgagagcaca ggggtgacca atgagcagtg ggccctactc 3241 tgcacctctg accacacctt ggcaagtctt tcccaagcca ttctttgtct gagcagcttg 3301 atggtttctc cttgccccat ttctgcccca ccagatcttt gctcctttcc ctttgaggac 3361 tcccaccctt tgggtctcca ggatcctcat ggaaggggaa ggtgagacat ctgagtgagc 3421 agagtgtggc atcttggaaa cagtccttag ttctgtggga ggactagaaa cagccgcggc 3481 gaaggccccc tgaggaccac tactatactg atggtgggat tgggacctgg gggatacagg 3541 ggccccagga agaagctggc cagaggggca gctcagtgct ctgcagagag gggccctggg 3601 gagaagcagg atgggattga tgggcaggag ggatccccgc actgggagac aggcccaggt 3661 atgaatgagc cagccatgct tcctcctgcc tgtgtgacgc tgggcgagtc tcttcccctg 3721 tctgggccaa acagggagcg ggtaagacaa tccatgctct aagatccatt ttagatcaat 3781 gtctaaaata gctctatggc tctgcggagt cccagcagag gctatggaat gtttctgcaa 3841 ccctaaggca cagagagcca accctgagtg tctcagaggc cccctgagtg ttccccttgg 3901 cctgagcccc ttacccattc ctgcagccag tgagagacct ggcctcagcc tggcagcgct 3961 ctcttcaagg ccatatccac ctgtgccctg gggcttggga gaccccatag gccgggactc 4021 ttgggtcagc ccgccactgg cttctctctt tttctccgtt tcattctgtg tgcgttgtgg 4081 ggtgggggag ggggtccacc tgccttacct ttctgagttg cctttagaga gatgcgtttt 4141 tctaggactc tgtgcaactg tcgtatatgg tcccgtgggc tgaccgcttt gtacatgaga 4201 ataaatctat ttctttctac caaaaaaaaa aaaaaaaaaa // LOCUS HSU67963 1192 bp mRNA PRI 02-JAN-1997 DEFINITION Human lysophospholipase homolog (HU-K5) mRNA, complete cds. ACCESSION U67963 NID g1763010 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1192) AUTHORS Upton,C. and Buller,R.M.L. TITLE Human homolog of an ectromelia virus protein has similarity to E.coli lysophospholipase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1192) AUTHORS Upton,C. and Buller,R.M.L. TITLE Direct Submission JOURNAL Submitted (23-AUG-1996) Biochemistry and Microbiology, University of Victoria, 150 Petch Building, Victoria, BC V8W 2Y2, Canada FEATURES Location/Qualifiers source 1..1192 /organism="Homo sapiens" /note="EST114238" /db_xref="taxon:9606" /tissue_type="lung" /clone="ATCC #180227" gene 151..1092 /gene="HU-K5" CDS 151..1092 /gene="HU-K5" /codon_start=1 /product="lysophospholipase homolog" /db_xref="PID:g1763011" /translation="METGPEDPSSMPEESSPRRTPQSIPYQDLPHLVNADGQYLFCRY WKPTGTPKALIFVSHGAGEHSGRYEELARMLMGLDLLVFAHDHVGHGQSEGERMVVSD FHVFVRDVLQHVDSMQKDYPGLPVFLLGHSMGGAIAILTAAERPGHFAGMVLISPLVL ANPESATTFKVLAAKVLNLVLPNLSLGPIDSSVLSRNKTEVDIYNSDPLICRAGLKVC FGIQLLNAVSRVERALPKLTVPFLLLQGSADRLCDSKGAYLLMELAKSQDKTLKIYEG AYHVLHKELPEVTNSVFHEINMWVSQRTATAGTASPP" BASE COUNT 262 a 350 c 335 g 245 t ORIGIN 1 ccagcccgaa aggcagggtc tgggtgcggg aagagggctc ggagctgcct tcctgctgcc 61 ttggggccgc ccagatgagg gaacagcccg atttgcctgg ttctgattct ccaggctgtc 121 gtggttgtgg aatgcaaacg ccagcacata atggaaacag gacctgaaga cccttccagc 181 atgccagagg aaagttcccc caggcggacc ccgcagagca ttccctacca ggacctccct 241 cacctggtca atgcagacgg acagtacctc ttctgcaggt actggaaacc cacaggcaca 301 cccaaggccc tcatctttgt gtcccatgga gccggagagc acagtggccg ctatgaagag 361 ctggctcgga tgctgatggg gctggacctg ctggtgttcg cccacgacca tgttggccac 421 ggacagagcg aaggggagag gatggtagtg tctgacttcc acgttttcgt cagggatgtg 481 ttgcagcatg tggattccat gcagaaagac taccctgggc ttcctgtctt ccttctgggc 541 cactccatgg gaggcgccat cgccatcctc acggccgcag agaggccggg ccacttcgcc 601 ggcatggtac tcatttcgcc tctggttctt gccaatcctg aatctgcaac aactttcaag 661 gtccttgctg cgaaagtgct caaccttgtg ctgccaaact tgtccctcgg gcccatcgac 721 tccagcgtgc tctctcggaa taagacagag gtcgacattt ataactcaga ccccctgatc 781 tgccgggcag ggctgaaggt gtgcttcggc atccaactgc tgaatgccgt ctcacgggtg 841 gagcgcgccc tccccaagct gactgtgccc ttcctgctgc tccagggctc tgccgatcgc 901 ctatgtgaca gcaaaggggc ctacctgctc atggagttag ccaagagcca ggacaagact 961 ctcaagattt atgaaggtgc ctaccatgtt ctccacaagg agcttcctga agtcaccaac 1021 tccgtcttcc atgaaataaa catgtgggtc tctcaaagga cagccacggc aggaactgcg 1081 tccccaccct gaatgcattg gccggtgccc ggctcatggt ctgggggatg caggcagggg 1141 aagggcagag atggcttctc agatatggct tgcaaaaaaa aaaaaaaaaa aa // LOCUS HSU68019 2303 bp mRNA PRI 14-OCT-1997 DEFINITION Homo sapiens mad protein homolog (hMAD-3) mRNA, complete cds. ACCESSION U68019 NID g2522266 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2303) AUTHORS Zhang,Y., Feng,X.-H., Wu,R.-Y. and Derynck,R. TITLE Receptor-associated Mad homologues synergize as effectors of the TGF-beta response JOURNAL Nature 383 (6596), 168-172 (1996) MEDLINE 96371046 REFERENCE 2 (bases 1 to 2303) AUTHORS Zhang,Y. TITLE Direct Submission JOURNAL Submitted (26-AUG-1996) Growth & Development, University of California San Francisco, 521 Parnassus Ave., C603, San Francisco, CA 94143-0640, USA REFERENCE 3 (bases 1 to 2303) AUTHORS Zhang,Y. TITLE Direct Submission JOURNAL Submitted (14-OCT-1997) Growth & Development, University of California San Francisco, 521 Parnassus Ave., C603, San Francisco, CA 94143-0640, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..2303 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2303 /gene="hMAD-3" CDS 67..1344 /gene="hMAD-3" /codon_start=1 /product="mad protein homolog" /db_xref="PID:g2522267" /translation="MSSILPFTPPIVKRLLGWKKGEQNGQEEKWCEKAVKSLVKKLKK TGQLDELEKAITTQNVNTKCITIPRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHE LRAMELCEFAFNMKKDEVCVNPYHYQRVETPVLPPVLVPRHTEIPAEFPPLDDYSHSI PENTNFPAGIEPQSNIPETPPPGYLSEDGETSDHQMNHSMDAGSPNLSPNPMSPAHNN LDLQPVTYCEPAFWCSISYYELNQRVGETFHASQPSMTVDGFTDPSNSERFCLGLLSN VNRNAAVELTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKI PPGCNLKIFNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKGWGAEYRRQTVTST PCWIELHLNGPLQWLDKVLTQMGSPSIRCSSVS" BASE COUNT 549 a 669 c 605 g 480 t ORIGIN 1 cccggcgtcc cgtcgagccc agccccgccg ggggcgctcc tcgccgcccg cacgccctcc 61 ccagccatgt cgtccatcct gcctttcact cccccgatcg tgaagcgcct gctgggctgg 121 aagaagggcg agcagaacgg gcaggaggag aaatggtgcg agaaggcggt caagagcctg 181 gtcaagaaac tcaagaagac ggggcagctg gacgagctgg agaaggccat caccacgcag 241 aacgtcaaca ccaagtgcat caccatcccc aggtccctgg atggccggtt gcaggtgtcc 301 catcggaagg ggctccctca tgtcatctac tgccgcctgt ggcgatggcc agacctgcac 361 agccaccacg agctgcgggc catggagctg tgtgagttcg ccttcaatat gaagaaggac 421 gaggtctgcg tgaatcccta ccactaccag agagtagaga caccagttct acctcctgtg 481 ttggtgccac gccacacaga gatcccggcc gagttccccc cactggacga ctacagccat 541 tccatccccg aaaacactaa cttccccgca ggcatcgagc cccagagcaa tattccagag 601 accccacccc ctggctacct gagtgaagat ggagaaacca gtgaccacca gatgaaccac 661 agcatggacg caggttctcc aaacctatcc ccgaatccga tgtccccagc acataataac 721 ttggacctgc agccagttac ctactgcgag ccggccttct ggtgctccat ctcctactac 781 gagctgaacc agcgcgtcgg ggagacattc cacgcctcgc agccatccat gactgtggat 841 ggcttcaccg acccctccaa ttcggagcgc ttctgcctag ggctgctctc caatgtcaac 901 aggaatgcag cagtggagct gacacggaga cacatcggaa gaggcgtgcg gctctactac 961 atcggagggg aggtcttcgc agagtgcctc agtgacagcg ctatttttgt ccagtctccc 1021 aactgtaacc agcgctatgg ctggcacccg gccaccgtct gcaagatccc accaggatgc 1081 aacctgaaga tcttcaacaa ccaggagttc gctgccctcc tggcccagtc ggtcaaccag 1141 ggctttgagg ctgtctacca gttgacccga atgtgcacca tccgcatgag cttcgtcaaa 1201 ggctggggag cggagtacag gagacagact gtgaccagta ccccctgctg gattgagctg 1261 cacctgaatg ggcctttgca gtggcttgac aaggtcctca cccagatggg ctccccaagc 1321 atccgctgtt ccagtgtgtc ttagagacat caagtatggt aggggagggc aggcttgggg 1381 aaaatggcca tacaggaggt ggagaaaatt ggaactctac tcaacccatt gttgtcaagg 1441 aagaagaaat ctttctccct caactgaagg ggtgcaccca cctgttttct gaaacacacg 1501 agcaaaccca gaggtggatg ttatgaacag ctgtgtctgc caaacacatt taccctttgg 1561 ccccactttg aagggcaaga aatggcgtct gctctggtgg cttaagtgag cagaacaggt 1621 agtattacac caccggcacc ctccccccag actctttttt tgagtgacag ctttctggga 1681 tgtcacagtc caaccagaaa cgcccctctg tctaggactg cagtgtggag ttcaccttgg 1741 aagggcgttc taggtaggaa gagcccgcac gatgcagacc tcatgcccag ctctctgacg 1801 cttgtgacag tgcctcttcc agtgaacatt cccagcccag ccccgccccg ttgtgagctg 1861 gatagacttg ggatggggag ggagggagtt ttgtctgtct ccctcccctc tcagaacata 1921 ctgattggga ggtgcgtgtt cagcagaacc tgcacacagg acagcgggaa aaatcgatga 1981 gcgccacctc tttaaaaact cacttacgtt gtcctttttc actttgaaaa gttggaagga 2041 ctgctgaggc ccagtgcata tgcaatgtat agtgtctatt atcacattaa tctcaaagag 2101 attcgaatga cggtaagtgt tctcatgaag caggaggccc ttgtcgtggg atggcatttg 2161 gtctcaggca gcaccacact gggtgcgtct ccagtcatct gtaagagctt gctccagatt 2221 ctgatgcata cggctatatt ggtttatgta gtcagttgca ttcattaaat caactttatc 2281 atatgctcaa aaaaaaaaaa aag // LOCUS HSU68140 2907 bp mRNA PRI 18-SEP-1997 DEFINITION Homo sapiens nuclear VCP-like protein NVLp.2 (NVL.2) mRNA, complete cds. ACCESSION U68140 NID g2406564 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2907) AUTHORS Germain-Lee,E.L., Obie,C. and Valle,D. TITLE NVL: a new member of the AAA family of ATPases localized to the nucleus JOURNAL Genomics 44 (1), 22-34 (1997) MEDLINE 97432817 REFERENCE 2 (bases 1 to 2907) AUTHORS Germain-Lee,E.L., Obie,C. and Valle,D. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) Pediatrics, Johns Hopkins Med, 600 N. Wolfe St, Baltimore, MD 21287, USA FEATURES Location/Qualifiers source 1..2907 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q41-1q42.2" /tissue_type="kidney" gene 1..2907 /note="alternatively spliced" /gene="NVL.2" CDS 57..2627 /gene="NVL.2" /note="NVLP; AAA protein; VCP=valosin-containing protein" /codon_start=1 /product="nuclear VCP-like protein NVLp.2" /db_xref="PID:g2406565" /translation="MKPRPAGFVDNKLKQRVIQYLTSNKCGKYVDIGVLASDLQRVYS IDYGRRKRNAFRIQVEKVFSIISSEKELKNLTELEDEHLAKRARQGEEDNEYTESYSD DDSSMEDYPDPQSANHMNSSLLSLYRKGNPDSVSNTPEMEQRETTSSTPRISSKTGSI PLKTPAKDSEGGWFIDKTPSVKKDSFFLDLSCEKSNPKKPITEIQDSKDSSLLESDMK RKGKLKNKGSKRKKEDLQEVDGEIEAVLQKKAKARGLEFQISNVKFEDVGGNDMTLKE VCKMLIHMRHPEVYHHLGVVPPRGVLLHGPPGCGKTLLAHAIAGELDLPILKVAAPEI VSGVSGESEQKLRELFEQAVSNAPCIIFIDEIDAITPKREVASKDMERRIVAQLLTCM DDLNNVAATARVLVIGATNRPDSLDPALRRAGRFDREICLGIPDEASRERILQTLCRK LRLPQAFDFCHLAHLTPGFVGADLMALCREAAMCAVNRVLMKLQEQQKKNPEMEDLPS KGVQEERLGTEPTSETQDELQRLLGLLRDQDPLSEEQMQGLCIELNDFIVALSSVQPS AKREGFVTVPNVTWADIGALEDIREELTMAILAPVRNPDQFKALGLVTPAGVLLAGPP GCGKTLLAKAVANESGLNFISVKGPELLNMYVGESERAVRQVFQRAKNSAPCVIFFDE VDALCPRRSDRETGASVRVVNQLLTEMDGLEARQQVFIMAATNRPDIIDPAILRPGRL DKTLFVGLPPPADRLAILKTITKNGTKPPLDADVNLEAIAGDLRCDCYTGADLSALVR EASICALRQEMARQKSGNEKGELKVSHKHFEEAFKKVRSSISKKDQIMYERLQESLSR " BASE COUNT 891 a 607 c 709 g 700 t ORIGIN 1 cgcccccggt tgactaggag ctggcggtcc gagctgtggc ttgaaagacc gacgcgatga 61 agcccagacc tgcagggttc gtggataata aactcaagca gcgagtcatc cagtacctta 121 ccagtaacaa atgtggcaaa tatgtggaca ttggagtctt agcgtctgat ttacaaagag 181 tgtacagtat agactatggt cgaagaaaaa gaaatgcttt taggattcag gtagaaaaag 241 tatttagcat aattagtagt gagaaggaac ttaagaattt aacagaatta gaagatgaac 301 atttggcaaa aagggcaaga caaggtgaag aggataatga gtatactgaa agctattctg 361 atgatgattc aagtatggaa gactacccag atccacagtc agcaaatcac atgaacagtt 421 ccctgctgtc tttatatcgg aaaggaaatc ctgattctgt ttcaaatact cctgagatgg 481 agcaaagaga aaccacctct tcaacaccac gaataagttc caaaacaggc tccattccct 541 tgaagacccc tgccaaagat tctgaaggag gatggtttat tgacaaaacc ccaagtgtaa 601 agaaagacag ttttttcttg gacctgtcat gtgagaaaag taatcctaag aagccaataa 661 ctgagataca ggattcaaaa gattcttctc ttttggagag tgatatgaaa cggaaaggca 721 agctaaagaa taaaggaagc aaaaggaaga aagaagatct tcaggaagta gatggagaaa 781 ttgaagctgt cctacaaaag aaagctaaag ccagggggtt agaattccag atctccaacg 841 tgaagtttga agatgtggga ggcaatgata tgacattaaa agaggtctgc aagatgctca 901 tacacatgcg tcacccggag gtgtaccacc acctgggcgt cgtgccccct cgtggagttc 961 tccttcatgg accaccaggc tgtgggaaga cattacttgc acatgcaatt gctggggaac 1021 ttgacctgcc aattttgaaa gtggctgctc cagagattgt gtctggagta tccggagagt 1081 ctgagcagaa gctgagagaa ctatttgagc aagctgtgtc aaatgcacca tgtatcattt 1141 tcattgatga aattgatgct attaccccca aaagagaagt ggcttcaaaa gatatggaac 1201 gaagaattgt agcccaactc ctaacctgca tggatgatct gaataatgtg gctgctacag 1261 cccgggtcct agttattgga gctactaatc gaccagactc gttagaccct gctttgagac 1321 gtgcgggaag gttcgaccga gaaatatgcc taggtatccc agatgaagca tccagggaaa 1381 gaatacttca aacattgtgc agaaaactga ggcttcctca agcttttgat ttctgtcact 1441 tagcacacct aactccaggc tttgttggtg ctgatctcat ggcactgtgc cgagaggcag 1501 caatgtgtgc agtcaataga gtcttaatga agctacagga acagcagaag aaaaatcctg 1561 aaatggaaga tttgccatct aaaggagtcc aggaggaaag gctgggaact gagcccactt 1621 ctgaaacaca ggatgaatta caaaggctgc tggggttgct aagagaccaa gatcccctct 1681 cagaggagca gatgcaagga ctgtgcattg aactgaatga tttcattgtt gctctatcct 1741 cagtccaacc ctctgccaaa agggaaggct ttgtcactgt ccctaatgtg acatgggcag 1801 atattggtgc cctggaagac attagagagg agctcaccat ggcaatattg gcaccagtac 1861 gcaacccaga ccagttcaaa gctcttggat tggtgactcc agctggggtc ctccttgctg 1921 gtcctcctgg ctgtgggaag actctgctgg cgaaggctgt tgcaaatgag tccggactaa 1981 attttatatc tgtcaagggc cccgaattac taaacatgta tgttggtgag agtgaacgtg 2041 ctgtgcgaca agtttttcaa cgagccaaga actcagcacc ctgtgtgata ttctttgatg 2101 aagtggatgc tttatgtcct cgaagatcag accgagagac aggggcaagt gtccgagtgg 2161 tgaatcagct acttacagag atggatggtc tggaagcacg ccagcaggtt tttattatgg 2221 cagccactaa caggccagat ataattgacc ctgcaatcct gcgcccgggc cgcctggaca 2281 aaacactgtt tgtgggttta ccgccccctg cagatcgcct tgccatctta aaaactatca 2341 caaaaaatgg taccaaacca ccactggatg cagatgtaaa tttggaagca attgctggtg 2401 accttcgctg tgattgctat acgggcgcag atctctctgc tttggtacga gaagcttcta 2461 tctgtgccct gagacaggaa atggcaagac agaagagtgg aaatgaaaaa ggtgaactca 2521 aggttagtca taagcatttt gaagaagctt tcaagaaagt aagatcatct atatcaaaaa 2581 aggatcaaat catgtatgaa cgtttgcagg agtccctcag ccggtgatgt ctccagcagc 2641 cggcttagag gagctagccc atcaagccgg cagagaatcc cccacacgct ctgaaggacc 2701 cactttcagc tggacacagg cgcggcctca tgtaaacatt ttattttcaa atgaatgagg 2761 ccaagctgaa gctgaagctg aagattcttc ccatctggcc agccttgtgt gaaaatgcct 2821 tcttcctctt taagaaaaag gacaattttt aaactgctga aataaaatga ctgttacatt 2881 tttcaaataa aacttttact ttgaact // LOCUS HSU68233 2218 bp mRNA PRI 16-SEP-1996 DEFINITION Human farnesol receptor HRR-1 (HRR-1) mRNA, complete cds. ACCESSION U68233 NID g1546083 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2218) AUTHORS Papetti,M., Wood,N, Lohmar,P.D. and Bowman,M.R. TITLE The Identification of the cDNA Coding for HRR-1, a Novel Human Farnesol Receptor JOURNAL Unpublished REFERENCE 2 (bases 1 to 2218) AUTHORS Papetti,M., Wood,N, Lohmar,P.D. and Bowman,M.R. TITLE Direct Submission JOURNAL Submitted (28-AUG-1996) Immunology and Hematopoiesis, Genetics Institute, 87 CambridgePark Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..2218 /organism="Homo sapiens" /db_xref="taxon:9606" gene 354..1772 /gene="HRR-1" CDS 354..1772 /gene="HRR-1" /note="FXR; retinoid receptor" /codon_start=1 /product="farnesol receptor HRR-1" /db_xref="PID:g1546084" /translation="MGSKMNLIEHSHLPTTDEFSFSENLFGVLTEQVAGPLGQNLEVE PYSQYSNVQFPQVQPQISSSSYYSNLGFYPQQPEEWYSPGIYELRRMPAETLYQGETE VAEMPVTKKPRMGASAGRIKGDELCVVCGDRASGYHYNALTCEGCKGFFRRSITKNAV YKCKNGGNCVMDMYMRRKCQECRLRKCKEMGMLAECLLTEIQCKSKRLRKNVKQHADQ TVNEDSEGRDLRQVTSTTKSCREKTELTPDQQTLLHFIMDSYNKQRMPQEITNKILKE EFSAEENFLILTEMATNHVQVLVEFTKKLPGFQTLDHEDQIALLKGSAVEAMFLRSAE IFNKKLPSGHSDLLEERIRNSGISDEYITPMFSFYKSIGELKMTQEEYALLTAIVILS PDRQYIKDREAVEKLQEPLLDVLQKLCKIHQPENPQHFACLLGRLTELRTFNHHHAEM LMSWRVNDHKFTPLLCEIWDVQ" polyA_signal 2124..2129 BASE COUNT 741 a 423 c 458 g 596 t ORIGIN 1 acgagactct ctcctcctcc tcacctcatt gtctccccga cttatcctaa tgcgaaattg 61 gattctgagc atttgtagca aaatcgctgg gatctggaga ggaagactca gtccagaatc 121 ctcccagggc cttgaaagtc catctctgac ccaaaacaat ccaaggaggt agaagacatc 181 gtagaaggag tgaaagaaga aaagaagact tagaaacata gctcaaagtg aacactgctt 241 ctcttagttt cctggatttc ttctggacat ttcctcaaga tgaaacttca gacactttgg 301 agtttttttt gaagaccacc ataaagaaag tgcatttcaa ttgaaaaatt tggatgggat 361 caaaaatgaa tctcattgaa cattcccatt tacctaccac agatgaattt tctttttctg 421 aaaatttatt tggtgtttta acagaacaag tggcaggtcc tctgggacag aacctggaag 481 tggaaccata ctcgcaatac agcaatgttc agtttcccca agttcaacca cagatttcct 541 cgtcatccta ttattccaac ctgggtttct acccccagca gcctgaagag tggtactctc 601 ctggaatata tgaactcagg cgtatgccag ctgagactct ctaccaggga gaaactgagg 661 tagcagagat gcctgtaaca aagaagcccc gcatgggcgc gtcagcaggg aggatcaaag 721 gggatgagct gtgtgttgtt tgtggagaca gagcctctgg ataccactat aatgcactga 781 cctgtgaggg gtgtaaaggt ttcttcagga gaagcattac caaaaacgct gtgtacaagt 841 gtaaaaacgg gggcaactgt gtgatggata tgtacatgcg aagaaagtgt caagagtgtc 901 gactaaggaa atgcaaagag atgggaatgt tggctgaatg cttgttaact gaaattcagt 961 gtaaatctaa gcgactgaga aaaaatgtga agcagcatgc agatcagacc gtgaatgaag 1021 acagtgaagg tcgtgacttg cgacaagtga cctcgacaac aaagtcatgc agggagaaaa 1081 ctgaactcac cccagatcaa cagactcttc tacattttat tatggattca tataacaaac 1141 agaggatgcc tcaggaaata acaaataaaa ttttaaaaga agaattcagt gcagaagaaa 1201 attttctcat tttgacggaa atggcaacca atcatgtaca ggttcttgta gaattcacaa 1261 aaaagctacc aggatttcag actttggacc atgaagacca gattgctttg ctgaaagggt 1321 ctgcggttga agctatgttc cttcgttcag ctgagatttt caataagaaa cttccgtctg 1381 ggcattctga cctattggaa gaaagaattc gaaatagtgg tatctctgat gaatatataa 1441 cacctatgtt tagtttttat aaaagtattg gggaactgaa aatgactcaa gaggagtatg 1501 ctctgcttac agcaattgtt atcctgtctc cagatagaca atacataaag gatagagagg 1561 cagtagagaa gcttcaggag ccacttcttg atgtgctaca aaagttgtgt aagattcacc 1621 agcctgaaaa tcctcaacac tttgcctgtc tcctgggtcg cctgactgaa ttacggacat 1681 tcaatcatca ccacgctgag atgctgatgt catggagagt aaacgaccac aagtttaccc 1741 cacttctctg tgaaatctgg gacgtgcagt gatggggatt acaggggagg ggtctagctc 1801 ctttttctct ctcatattaa tctgatgtat aactttcctt tatttcactt gtacccagtt 1861 tcactcaaga aatcttgatg aatatttatg ttgtaattac atgtgtaact tccacaactg 1921 taaatattgg gctagataga acaactttct ctacattgtg ttttaaaagg ctccagggaa 1981 tcctgcattc taattggcaa gccctgtttg cctaattaaa ttgattgtta cttcaattct 2041 atctgttgaa ctagggaaaa tctcattttg ctcatcttac catattgcat atattttatt 2101 aaagagttgt attcaatctt ggcaataaag caaacataat ggcaacagaa aaaaaaaaaa 2161 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSU68566 1196 bp mRNA PRI 02-APR-1997 DEFINITION Human HS1 binding protein HAX-1 mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U68566 NID g1916621 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1196) AUTHORS Suzuki,Y., Demoliere,C., Kitamura,D., Takeshita,H., Deuschle,U. and Watanabe,T. TITLE HAX-1, a novel intracellular protein, localized on mitochondria, directly associates with HS1, a substrate of Src family tyrosine kinases JOURNAL J. Immunol. 158 (6), 2736-2744 (1997) MEDLINE 97211841 REFERENCE 2 (bases 1 to 1196) AUTHORS Suzuki,Y., Demoliere,C., Kitamura,D., Takeshita,H., Deuschle,U. and Watanabe,T. TITLE Direct Submission JOURNAL Submitted (30-AUG-1996) Molecular Immunology, Medical Institute of Bioregulation, Kyushu University, 3-1-1 Maidashi, Higashi-Ku, Fukuoka 812-82, Japan FEATURES Location/Qualifiers source 1..1196 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="uterine cervical adenocarcinoma" CDS 162..1001 /note="localized to the mitochondrial membrane; HS1 binding protein" /codon_start=1 /product="HAX-1" /db_xref="PID:g1916622" /translation="MSLFDLFRGFFGFPGPRSHRDPFFGGMTRDEDDDEEEEEEGGSW RRGNPRFHSPQHPPEEFGFGFSFSPGGGIRFHDNFGFDDLVRDFNSIFSDMGAWTLPS HPPELPGPESETPGERLREGQTLRDSMLKYPDSHQPRIFGGVLESDARSESPQPAPDW GSQRPFHRFDDVWPMDPHPRTREDNDLDSQVSQEGLGPVLTPQPKSYFKSISVTKITK PDGIVEERRTVVDSEGRTETTVTRHEADSSPRGDPESPRPPALDDAFSILDLFLGRWF RSR" BASE COUNT 289 a 315 c 315 g 277 t ORIGIN 1 aggtccggct taccgtcgtt tacgacagtg tcaggatcgc gggcttgctt tccggtagcg 61 tgggctgacg cctcgctcaa tttctcacag ggctgcgcag gtttcccccg tctgcgaatg 121 gaccactgga ggggttcaaa ggttcgcgtc ccagtacggg aatgagcctc tttgatctct 181 tccggggctt tttcggcttt cctggacctc ggagccacag agatcccttt tttggaggga 241 tgactcgaga tgaagatgat gatgaggaag aagaagaaga agggggctca tggcgccgtg 301 ggaacccaag gttccatagt cctcagcacc cccctgagga atttggcttc ggcttcagct 361 tcagcccagg aggagggata cgtttccacg ataacttcgg ctttgatgac ctagtacgag 421 atttcaatag catcttcagc gatatggggg cctggacctt gccttcccat cctcctgaac 481 ttccaggtcc tgagtcagag acacctggtg agagactacg ggagggacag acacttcggg 541 actcaatgct taagtatcca gatagtcacc agcccaggat ctttgggggg gtcttggaga 601 gtgatgcaag aagtgaatcc ccccaaccag caccagactg gggctcccag aggccatttc 661 ataggtttga tgatgtatgg cctatggacc cccatcctag aaccagagag gacaatgatc 721 ttgattccca ggtttcccag gagggtcttg gcccggttct aacgccccag cccaaatcct 781 atttcaagag catctctgtg accaagatca ctaaaccaga tgggatagtg gaggagcgcc 841 ggactgtggt ggacagtgag ggccggacag agactacagt aacccgacac gaagcagata 901 gcagtcctag gggtgatcca gaatcaccaa gacctccagc cctggatgat gccttttcca 961 tcctggactt attcctggga cgttggttcc ggtcccggta gccttgttaa ccctcagagg 1021 ccttcaagtc ctttccacct ctcacccatt gcccaccatt aataagctta gcttctcttg 1081 ccacctcagg ggcttggata tgtggaatag tgaactgggg ccatgtcagt ttgtcactca 1141 cccaaactga ccaataaaac ctttatttat gctaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU68723 2667 bp mRNA PRI 25-MAY-1997 DEFINITION Human checkpoint suppressor 1 mRNA, complete cds. ACCESSION U68723 NID g2114391 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2667) AUTHORS Field,L.L., Tobias,R., Thomson,G. and Plon,S. TITLE Susceptibility to insulin-dependent diabetes mellitus maps to a locus (IDDM11) on human chromosome 14q24.3-q31 JOURNAL Genomics 33 (1), 1-8 (1996) MEDLINE 96207294 REFERENCE 2 (bases 1 to 2667) AUTHORS Pati,D., Keller,C., Groudine,M. and Plon,S.E. TITLE Reconstitution of a MEC1-independent checkpoint in yeast by expression of a novel human fork head cDNA JOURNAL Mol. Cell. Biol. 17 (6), 3037-3046 (1997) MEDLINE 97299653 REFERENCE 3 (bases 1 to 2667) AUTHORS Pati,D., Keller,C., Groudine,M. and Plon,S.E. TITLE Direct Submission JOURNAL Submitted (30-AUG-1996) Pediatrics, Baylor College of Medicine, 6621 Fannin St. MC3-3320, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2667 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="between 14q23-3 and 14q31" /cell_line="U118 glioblastoma" /tissue_lib="AA2M, ADANS vector library of J. Collicelli" CDS 133..1605 /note="similar to fork head; CHES1; suppresses multiple yeast checkpoint mutations including mec1, rad9, rad53 and dun1; reconstitutes the DNA damage G2 checkpoint in deficient yeast strains" /codon_start=1 /product="checkpoint suppressor 1" /db_xref="PID:g2114392" /translation="MGPVMPPSKKPESSGISVSSGLSQCYGGSGFSKALQEDDDLDFS LPDIRLEEGAMEDEELTNLNWLHESKNLLKSFGESVLRSVSPVQDLDDDTPPSPAHSD MPYDARQNPNCKPPYSFSCLIFMAIEDSPTKRLPVKDIYNWILEHFPYFANAPTGWKN SVRHNLSLNKCFKKVDKERSQSIGKGSLWCIDPEYRQNLIQALKKTPYHPHPHVFNTP PTCPQAYQSTSGPPIWPGSTFFKRNGALLQDPDIDAASAMMLLNTPPEIQAGFPPGVI QNGARVLSRGLFPGVRPLPITPIGVTAAMRNGITSCRMRTESEPSCGSPVVSGDPKED HNYSSAKSSNARSTSPTSDSISSSSSSADDHYEFATKGSQEGSEGSEGSFRSHESPSD TEEDDRKHSQKEPKDSLGDSGYASQHKKRQHFAKARKVPSDTLPLKKRRTEKPPESDD EEMKEAAGSLLHLAGIRSCLNNITNRTAKGQKEQKETTKN" BASE COUNT 743 a 699 c 667 g 558 t ORIGIN 1 gagggtgggt cgaccccggg aattcggcac gagcggcggc ggggccaccc gcgagtccag 61 cgtcgccgca gccccccaat gcggccgcga gaagcagcgg gggggcaggc gatcgaagga 121 gccttcacgt aaatgggtcc agtcatgcct cccagtaaga agccagaaag ctcaggaatt 181 agtgtctcca gtggactgag tcagtgttac gggggcagcg gtttctccaa ggcccttcag 241 gaagacgatg acctcgactt ttctctgcct gacatccgat tagaagaggg ggccatggaa 301 gatgaagagc tgaccaacct gaactggctg cacgagagca agaacttgct gaagagcttt 361 ggggagtcgg tcctcaggag tgtcagcccc gtccaggacc tggacgatga caccccccca 421 tcccctgccc actctgacat gccctacgat gccaggcaga accccaactg caaacccccc 481 tactccttca gctgcctcat atttatggcc atcgaggact ctccaaccaa gcgcctgcca 541 gtgaaggata tctacaactg gatcttggaa cattttccgt attttgcaaa tgcacctact 601 gggtggaaaa actcagtgag acacaattta tcattgaata agtgttttaa gaaagtggac 661 aaagagagga gtcagagtat tgggaaaggg tcgttgtggt gcatagaccc agagtataga 721 caaaatctaa ttcaggcttt gaaaaagaca ccttatcacc cacacccaca cgtgttcaat 781 acacctccca cctgtcctca ggcatatcaa agcacatcag gtccacccat ctggccgggc 841 agtaccttct tcaagagaaa tggagccctt ctccaagatc ctgacattga tgctgccagt 901 gccatgatgc ttttgaatac tccccctgag atacaagcag gttttcctcc aggagtgatc 961 caaaatggag cgcgggtcct gagccgaggg ctgtttcctg gcgtgcggcc gctgccaatc 1021 actcccattg gggtgacagc ggccatgagg aatggcatca ccagctgccg gatgcggact 1081 gagagtgagc catcttgtgg ctccccagtg gtcagcggag accccaagga ggatcacaac 1141 tacagcagtg ccaagtcctc caacgcccgg agcacctcgc ccaccagcga ctccatctcc 1201 tcctcctcct cctcagccga cgaccactat gagtttgcca ccaaggggag ccaggagggc 1261 agcgagggca gcgaggggag cttccggagc cacgagagcc ccagcgacac ggaagaggac 1321 gacaggaagc acagccagaa ggagcccaag gattctctgg gggacagcgg gtacgcatcc 1381 cagcacaaga agcgccagca cttcgccaag gccaggaagg tccccagcga cacactgccc 1441 ctcaaaaaga gacgcaccga aaagcccccc gagagcgatg atgaggagat gaaagaagcg 1501 gcagggtccc tcctgcactt agcagggatc cggtcctgtt tgaataacat caccaatcgg 1561 acggcaaagg ggcagaaaga gcaaaaggaa accacaaaaa attaaaaaca agtcactgat 1621 ttgttttgaa cttacgacca tttggtttca gcatgtcagg agatttctaa tgatttgtgg 1681 caatatcagc aatttttttt cttttttctt gtttggggtt tggttttctt tcttttcttt 1741 tccttttatt gggttttaat ttgccccctc ttctttgttt tggaccctta agaattttat 1801 ttttaaagga gattgaagcc atagaactca tattgacact cagctgtttt acaaaagctt 1861 ttcattatct gaagacaaaa ccgaaaaagc caaaattacc attgcttcct ccagcttgtc 1921 agaaacctgt ggctgaatcc gcagggatgt caacgtcaat atcacaggaa cacacattcg 1981 gcacctagaa ggcacgtggg caaagtaatc atcgttcagg cccaaccctt aggtttaaaa 2041 agtcaggttg tccatcccat tggggttcac tgagtgaagg cacataaagc aattgaggag 2101 gaggaggaac ccctcgtccc cctaggagca gacccaagct tgtggcacca ggcatctgat 2161 ggtgccagga aagccactgg aattgtcaca cggcgagcac agagggccgg ccaccagtcc 2221 tcgatgcttc tgaaccctga accccgatga catcttacga ggtggacgtt ggactgttca 2281 tgcgcatcgg gtgtcagtga ctcatggaga agaaatgggg taaattttta gtgatgttgc 2341 taatcattga attctgttct ctattaaatt aagaaaatgt tccaaaagcc ataagcctga 2401 agattggccc tgtgcacgca cgcacacaca cacacacaca cacacacaca cacacacaca 2461 cacgaaggag agagagagaa aactgatggg gaaaacaagc tgtgtcttct taactgccca 2521 agtgaaaagc aaccaagtcc aggaaattac aatagctgtt aaggaaagga aataatggta 2581 cagatctttt tctgtctatc aaaactattt gatccaagtg aaaaaaaaaa aaaaactaga 2641 aagctacgga acctgcaatg cggccgc // LOCUS HSU68727 3439 bp mRNA PRI 29-APR-1997 DEFINITION Human homeobox-containing protein mRNA, complete cds. ACCESSION U68727 NID g2052384 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3439) AUTHORS Chen,H., Rossier,C., Nakamura,Y., Lynn,A., Chakravarti,A. and Antonarakis,S.E. TITLE Cloning of a novel homeobox-containing gene, PKNOX1, and mapping to human chromosome 21q22.3 JOURNAL Genomics 41 (2), 193-200 (1997) MEDLINE 97288516 REFERENCE 2 (bases 1 to 3439) AUTHORS Chen,H., Rossier,C. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (30-AUG-1996) Medical Genetics, Geneva University Medical School, 1 rue Michel-Servet, Geneva 1211, Switzerland FEATURES Location/Qualifiers source 1..3439 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.3" CDS 86..1393 /codon_start=1 /product="homeobox-containing protein" /db_xref="PID:g2052385" /translation="MATQTLSIDSYQDGQQMQVVTELKTEQDPNCSEPDAEGVSPPPV ESQTPMDVDKQAIYRHPLFPLLALLFEKCEQSTQGSEGTTSASFDVDIENFVRKQEKE GKPFFCEDPETDNLMVKAIQVLRIHLLELEKVNELCKDFCSRYIACLKTKMNSETLLS GEPGSPYSPVQSQQIQSAITGTISPQGIVVPASALQQGNVAMATVAGGTVYQPVTVVT PQGQVVTQTLSPGTIRIQNSQLQLQLNQDLSILHQDDGSSKNKRGVLPKHATNVMRSW LFQHIGHPYPTEDEKKQIAAQTNLTLLQVNNWFINARRRILQPMLDSSCSETPKTKKK TAQNRPVQRFWADSIASGVAQPPPSELTMSEGAVVTITTPVNMNVDSLQSLSSDGATL AVQQVMMAGQSEDESVDSTEEDAGALAPAHISGLVLENSDSLQ" BASE COUNT 940 a 725 c 792 g 982 t ORIGIN 1 ggatccgcgt gaagattggc acccagacac cattcgcttt tcacccaaga tgatttgatg 61 tcttataaaa ctctgatgaa ccatgatggc tacacagaca ttaagtatag acagctatca 121 agatgggcaa cagatgcaag tagtaacaga gttaaagaca gaacaagatc caaactgctc 181 tgaacccgat gcagaaggag tgagccctcc ccctgtggag tctcagaccc cgatggatgt 241 ggacaagcag gccatttata ggcatccact atttccatta ttagctttgt tgtttgaaaa 301 atgtgaacaa tctacacagg gctctgaagg cacaacttct gccagttttg atgtagacat 361 cgaaaatttt gtaagaaagc aagagaagga agggaaacct ttcttttgtg aagatccaga 421 aaccgataat ttaatggtaa aagcaatcca ggttttgcgc attcatcttc ttgagctgga 481 aaaggttaac gaactctgca aagatttctg cagtcgatac attgcttgtc tgaaaacaaa 541 aatgaacagt gaaactctgt tgagtggaga gcctggaagc ccgtactcac cagtgcagtc 601 ccagcagatt caaagtgcca tcacaggcac catcagccct cagggaattg tggtgccggc 661 gtccgcgctg cagcagggaa acgtagccat ggcgacggtg gcaggtggca cagtgtatca 721 gcctgtcacg gtcgtcactc cccaaggcca agtggtcaca cagacattgt cgcctgggac 781 aattaggatc cagaactccc agcttcagtt acagttaaac caagatctca gcatcttgca 841 tcaagatgat ggttcatcta agaacaagag gggcgtcctg ccaaagcatg ccacgaacgt 901 gatgcggtcc tggctcttcc agcacatcgg gcatccctac ccaacagagg atgagaaaaa 961 acagattgct gctcagacaa atttgacact actccaagtc aacaactggt tcatcaatgc 1021 cagaagacga attcttcagc caatgttgga ttcaagttgt tcagagaccc ccaaaacaaa 1081 gaaaaaaact gctcagaacc ggccagttca gaggttttgg gctgattcta ttgcatcagg 1141 agtcgcacag ccaccgccga gcgagctcac catgtcggaa ggagctgttg tcaccatcac 1201 cacgcccgtg aacatgaacg tggacagcct tcagtctctg tcctcggacg gggccaccct 1261 ggcggtgcag caggtcatga tggcagggca gagcgaggac gagtctgtgg acagcacaga 1321 ggaggatgcg ggtgccctgg cccctgccca catcagcggg ctggtcttgg agaacagtga 1381 ctccctgcag taggggcagg agcagacgca cctgactttt tggagtttgc acagcaaaca 1441 ttttacacag ttttatttct aatatgtttt atatgtagat atagaagagt gcacttttgt 1501 atttcatagt aagcttaaag cgcgtctttg ccggtgcagc gacttctttc aagtgtgtgt 1561 gtgtgtgtgt gtgcgcgtgt gtgcgtgtgt gtggattttt aaagaaattc tttaaaggtt 1621 taacgctaga ttgtgaggaa tgacacacca ctccctcccc accttgaatc cctaattaga 1681 ttaaggaata gcgctgccat tttctaaacc gtgatgcggt tgtcacttag ttctgtggtt 1741 ccagcagatc tcagtgggct ggttgatctt gtgtggccca tggatttgaa agaagctgct 1801 gcacccgaaa ctgccagtgt gcggtgacaa cggcacacgc ctagactgag tgtggtttcg 1861 tcgtgagtgg atggacggca agcttagcaa gcctaagtcc cctcatgttc agtgagcctg 1921 tttcatttgc tatatagaaa aagaaactcc tatttttacc ttgctggaat tattggataa 1981 aaaagctatt tttataaatt cgttatgaat tggatgatga ctatattgag gataaaattt 2041 ctagagaaga aacaatacat gcttgctatt aatatttcaa tttggaatgt tctgaattga 2101 ccaaatttaa cgaacctgcc caaagttagc taccgttcca tggttctttg ctctccccgg 2161 gtagtgatga acatttacta ctataaaaga aacagctatt taatgaaatt ttgatatctg 2221 caaatttttg ttgatatgta atgctcagat tgcattttac acttgatcta aacatatatc 2281 gaaagatatc tgctaaacag gacttcaggt aagtgaggtg aaatggtagc cagtgacccg 2341 ttaggagctc tcaccgtaca tactccagtc taatttaaat ctgaccacag ttgcatggtc 2401 gctttaccat gttagctgtg tattgtttta aaagttttaa cttcaaaata tgttatgcac 2461 agaatgttta ttataaacta atataaaatg tgctctaccc cattggctca gagctagggc 2521 aaacagcaga tattcagact ttattactta actagcggac atccctggag tcccagcagc 2581 gagttggtct ggcgagggca cctcggcagc ccccacgggt tggctcctac gtttgcgttt 2641 gtggctggtc tcctggtgtc agtgttctct tgtacgttgt tgctttcgac ttttcagagc 2701 cctcctgctc acttgaccac gtgagatttg gaataactgt aggacttctg tttcctggta 2761 acaagatgaa ccgagagagt gggctgggtt ctgttttctt tggtttcgat ttgttttcat 2821 tgtttactta ggagtggtgc tttttctcag aaaacaggcc acggtgtttc atacagaatg 2881 tcttcatatc atctgaaatg gtatggctga agttcatttg tttacagggt cgggaatgtc 2941 ttcagtttct tgagagtcaa cagtaatgat tggttgtaag ccaagggaca ttttaagcta 3001 gtgaagagtt ttttctggaa ttgatttttc ccaaaagaat atattaattg aggttaagaa 3061 gtcagtggga aacacacaga aatttatttt aaaatctttc aggagcttta actgaaagac 3121 ttggttatca agtcttttgg ggagagaatg acattttttt tttgagacag agttttgctc 3181 ttgttgccca ggctggaggg caatggtgcc cccgctcccc acccttgaat tatgcacatc 3241 tcatggtttt ttttgctgct ttttttttca aaagcagctt tactgagata taatttatat 3301 tctataaaat tcacctgttt caagtataca attcaatgac tttgagtaaa tgtacagcgt 3361 tgtgtgacca tcgccacaat ctaattttag aacattttca tcacagcaaa aatgatccct 3421 tgtacccatt ccggaattc // LOCUS HSU69161 1050 bp mRNA PRI 15-NOV-1997 DEFINITION Homo sapiens CC3 (CC3) mRNA, complete cds. ACCESSION U69161 NID g2618732 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1050) AUTHORS Shtivelman,E. TITLE A link between metastasis and resistance to apoptosis of variant small cell lung carcinoma JOURNAL Oncogene 14 (18), 2167-2173 (1997) MEDLINE 97316778 REFERENCE 2 (bases 1 to 1050) AUTHORS Shtivelman,E. TITLE Direct Submission JOURNAL Submitted (03-SEP-1996) G. Brush Cancer Research Inst., California Pacific Medical Center, 2330 Clay Street, San Francisco, CA 94115, USA FEATURES Location/Qualifiers source 1..1050 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /cell_line="H146" gene 1..1050 /gene="CC3" CDS 99..827 /gene="CC3" /note="no assigned function; has properties of metastasis-suppressor for variant small cell lung carcinoma (SCLC)" /codon_start=1 /product="CC3" /db_xref="PID:g2618733" /translation="MAETEALSKLREDFRMQNKSVFILGASGETGRVLLKEILEQGLF SKVTLIGRRKLTFDEEAYKNVNQEVVDFEKLDDYASAFQGHDVGFCCLGTTRGKAGAE GFVRVDRDYVLKSAELAKAGGCKHFNLLSSKGADKSSNFLYLQVKGEVEAKVEELKFD RYSVFRPGVLLCDRQESRPGEWLVRKLFGSLPDSWARGHSVPVVTVVRAMLNNVVRPR DKQMELLENKAIHDLGKAHGSLKP" BASE COUNT 289 a 219 c 272 g 270 t ORIGIN 1 gtcctcccta acagataaac agcccttgtt cctcgggata aggactggca gtcccctgac 61 accctaagac cggcatctgt cgatgttatt tccccagcat ggccgaaaca gaagccctgt 121 cgaagcttcg ggaagacttc aggatgcaga ataaatccgt ctttattttg ggcgccagcg 181 gagaaaccgg cagagtgctc ttaaaggaaa tcctggagca gggcctgttt tccaaagtca 241 cgctcattgg gcggaggaag ctcaccttcg acgaggaagc ttataaaaat gtgaatcaag 301 aagtggtgga ctttgaaaag ttggatgact acgcctctgc ctttcaaggt catgatgttg 361 gattctgttg cctgggtacc accagaggga aagctggggc ggagggattt gttcgtgttg 421 accgagatta tgtgctgaag tctgcagagc tggcaaaagc tggagggtgc aaacatttca 481 acttgctatc ctctaaagga gctgataaat caagcaattt tttatatcta caagttaagg 541 gagaagtaga agccaaggtt gaagaattaa aatttgatcg ttactctgta tttaggcctg 601 gagttctgtt atgtgatagg caagaatctc gcccaggtga atggctggtt agaaagctct 661 ttggctcctt accagactct tgggccaggg ggcattctgt gcctgtggtg accgtggtta 721 gagcaatgct gaacaatgtg gtgagaccaa gagacaagca gatggaactg ctggagaaca 781 aggccatcca tgacctgggg aaagcgcatg gctctctcaa gccatgacca cattggagaa 841 atggttttta ttgtcaacct taacacccat caccaaatcg gtaatttcag ggtctaaaaa 901 aagtcagcat gttttaactt tgttgtttta ctatcctcag gcatccattc caatcaagaa 961 atgatggtgc tctgcatcag tggttcagag cctggttata catatagatc actcagggag 1021 ctttggagaa ataaagattt gtcagcccaa // LOCUS HSU69611 3014 bp mRNA PRI 09-APR-1997 DEFINITION Human TNF-alpha converting enzyme mRNA, complete cds. ACCESSION U69611 NID g1858021 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3014) AUTHORS Black,R.A., Rauch,C.T., Kozlosky,C.J., Peschon,J.J., Slack,J.L., Wolfson,M.F., Castner,B.J., Stocking,K.L., Reddy,P., Srinivasan,S., Nelson,N., Bioani,N., Schooley,K.A., Gerhart,M., Davis,R., Fitzner,J.N., Johnson,R.S., Paxton,R.J., March,C.J. and Cerretti,D.P. TITLE A metalloproteinase disintegrin that releases tumour-necrosis factor-alpha from cells JOURNAL Nature 385 (6618), 729-733 (1997) MEDLINE 97186574 REFERENCE 2 (bases 1 to 3014) AUTHORS Cerretti,D.P. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) Molecular Biology, Immunex Corp., 51 University St., Seattle, WA 98101, USA FEATURES Location/Qualifiers source 1..3014 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 115..2589 /note="transmembrane metalloproteinase/disintegrin; adamalysin; TACEA" /codon_start=1 /product="TNF-alpha converting enzyme" /db_xref="PID:g1858022" /translation="MRQSLLFLTSVVPFVLAPRPPDDPGFGPHQRLEKLDSLLSDYDI LSLSNIQQHSVRKRDLQTSTHVETLLTFSALKRHFKLYLTSSTERFSQNFKVVVVDGK NESEYTVKWQDFFTGHVVGEPDSRVLAHIRDDDVIIRINTDGAEYNIEPLWRFVNDTK DKRMLVYKSEDIKNVSRLQSPKVCGYLKVDNEELLPKGLVDREPPEELVHRVKRRADP DPMKNTCKLLVVADHRFYRYMGRGEESTTTNYLIELIDRVDDIYRNTSWDNAGFKGYG IQIEQIRILKSPQEVKPGEKHYNMAKSYPNEEKDAWDVKMLLEQFSFDIAEEASKVCL AHLFTYQDFDMGTLGLAYVGSPRANSHGGVCPKAYYSPVGKKNIYLNSGLTSTKNYGK TILTKEADLVTTHELGHNFGAEHDPDGLAECAPNEDQGGKYVMYPIAVSGDHENNKMF SNCSKQSIYKTIESKAQECFQERSNKVCGNSRVDEGEECDPGIMYLNNDTCCNSDCTL KEGVQCSDRNSPCCKNCQFETAQKKCQEAINATCKGVSYCTGNSSECPPPGNAEDDTV CLDLGKCKDGKCIPFCEREQQLESCACNETDNSCKVCCRDLSGRCVPYVDAEQKNLFL RKGKPCTVGFCDMNGKCEKRVQDVIERFWDFIDQLSINTFGKFLADNIVGSVLVFSLI FWIPFSILVHCVDKKLDKQYESLSLFHPSNVEMLSSMDSASVRIIKPFPAPQTPGRLQ PAPVIPSAPAAPKLDHQRMDTIQEDPSTDSHMDEDGFEKDPFPNSSTAAKSFEDLTDH PVTRSEKAASFKLQRQNRVDSKETEC" misc_feature 2128..2296 /note="encodes transmembrane domain" BASE COUNT 886 a 595 c 735 g 798 t ORIGIN 1 ggattgaggg gctaggccgg gcggatcccg tcctcccccg atgtgagcag ttttccgaaa 61 ccccgtcagg cgaaggctgc ccagagaggt ggagtcggta gcggggccgg gaacatgagg 121 cagtctctcc tattcctgac cagcgtggtt cctttcgtgc tggcgccgcg acctccggat 181 gacccgggct tcggccccca ccagagactc gagaagcttg attctttgct ctcagactac 241 gatattctct ctttatctaa tatccagcag cattcggtaa gaaaaagaga tctacagact 301 tcaacacatg tagaaacact actaactttt tcagctttga aaaggcattt taaattatac 361 ctgacatcaa gtactgaacg tttttcacaa aatttcaagg tcgtggtggt ggatggtaaa 421 aacgaaagcg agtacactgt aaaatggcag gacttcttca ctggacacgt ggttggtgag 481 cctgactcta gggttctagc ccacataaga gatgatgatg ttataatcag aatcaacaca 541 gatggggccg aatataacat agagccactt tggagatttg ttaatgatac caaagacaaa 601 agaatgttag tttataaatc tgaagatatc aagaatgttt cacgtttgca gtctccaaaa 661 gtgtgtggtt atttaaaagt ggataatgaa gagttgctcc caaaagggtt agtagacaga 721 gaaccacctg aagagcttgt tcatcgagtg aaaagaagag ctgacccaga tcccatgaag 781 aacacgtgta aattattggt ggtagcagat catcgcttct acagatacat gggcagaggg 841 gaagagagta caactacaaa ttacttaata gagctaattg acagagttga tgacatctat 901 cggaacactt catgggataa tgcaggtttt aaaggctatg gaatacagat agagcagatt 961 cgcattctca agtctccaca agaggtaaaa cctggtgaaa agcactacaa catggcaaaa 1021 agttacccaa atgaagaaaa ggatgcttgg gatgtgaaga tgttgctaga gcaatttagc 1081 tttgatatag ctgaggaagc atctaaagtt tgcttggcac accttttcac ataccaagat 1141 tttgatatgg gaactcttgg attagcttat gttggctctc ccagagcaaa cagccatgga 1201 ggtgtttgtc caaaggctta ttatagccca gttgggaaga aaaatatcta tttgaatagt 1261 ggtttgacga gcacaaagaa ttatggtaaa accatcctta caaaggaagc tgacctggtt 1321 acaactcatg aattgggaca taattttgga gcagaacatg atccggatgg tctagcagaa 1381 tgtgccccga atgaggacca gggagggaaa tatgtcatgt atcccatagc tgtgagtggc 1441 gatcacgaga acaataagat gttttcaaac tgcagtaaac aatcaatcta taagaccatt 1501 gaaagtaagg cccaggagtg ttttcaagaa cgcagcaata aagtttgtgg gaactcgagg 1561 gtggatgaag gagaagagtg tgatcctggc atcatgtatc tgaacaacga cacctgctgc 1621 aacagcgact gcacgttgaa ggaaggtgtc cagtgcagtg acaggaacag tccttgctgt 1681 aaaaactgtc agtttgagac tgcccagaag aagtgccagg aggcgattaa tgctacttgc 1741 aaaggcgtgt cctactgcac aggtaatagc agtgagtgcc cgcctccagg aaatgctgaa 1801 gatgacactg tttgcttgga tcttggcaag tgtaaggatg ggaaatgcat ccctttctgc 1861 gagagggaac agcagctgga gtcctgtgca tgtaatgaaa ctgacaactc ctgcaaggtg 1921 tgctgcaggg acctttccgg ccgctgtgtg ccctatgtcg atgctgaaca aaagaactta 1981 tttttgagga aaggaaagcc ctgtacagta ggattttgtg acatgaatgg caaatgtgag 2041 aaacgagtac aggatgtaat tgaacgattt tgggatttca ttgaccagct gagcatcaat 2101 acttttggaa agtttttagc agacaacatc gttgggtctg tcctggtttt ctccttgata 2161 ttttggattc ctttcagcat tcttgtccat tgtgtggata agaaattgga taaacagtat 2221 gaatctctgt ctctgtttca ccccagtaac gtcgaaatgc tgagcagcat ggattctgca 2281 tcggttcgca ttatcaaacc ctttcctgcg ccccagactc caggccgcct gcagcctgcc 2341 cctgtgatcc cttcggcgcc agcagctcca aaactggacc accagagaat ggacaccatc 2401 caggaagacc ccagcacaga ctcacatatg gacgaggatg ggtttgagaa ggaccccttc 2461 ccaaatagca gcacagctgc caagtcattt gaggatctca cggaccatcc ggtcaccaga 2521 agtgaaaagg ctgcctcctt taaactgcag cgtcagaatc gtgttgacag caaagaaaca 2581 gagtgctaat ttagttctca gctcttctga cttaagtgtg caaaatattt ttatagattt 2641 gacctacaat caatcacagc ttatattttg tgaagactgg gaagtgactt agcagatgct 2701 ggtcatgtgt ttgaacttcc tgcaggtaaa cagttcttgt gtggtttggc ccttctcctt 2761 ttgaaaaggt aaggtgaagg tgaatctagc ttattttgag gctttcaggt tttagttttt 2821 aaaatatctt ttgacctgtg gtgcaaaagc agaaaataca gctggattgg gttatgagta 2881 tttacgtttt tgtaaattaa tcttttatat tgataacagc actgactagg gaaatgatca 2941 gttttttttt tatacactgt aatgaaccgc tgaatatgag gcatttggca tttatttgtg 3001 atgacaactg gaat // LOCUS HSU69645 1140 bp mRNA PRI 02-OCT-1996 DEFINITION Human zinc finger protein mRNA, complete cds. ACCESSION U69645 NID g1575614 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1140) AUTHORS Drew,P.D., Gado,A.M., Nagle,J.W., Dehejia,A.M., Polymeropoulos,M.H., Biddison,W.E., Jacobson,S. and Becker,K.G. TITLE C2H2-546: A zinc finger protein differentially expressed in HTLV-1 infected T cells JOURNAL Unpublished REFERENCE 2 (bases 1 to 1140) AUTHORS Drew,P.D., Gado,A.M., Nagle,J.W., Dehejia,A.M., Polymeropoulos,M.H., Biddison,W.E., Jacobson,S. and Becker,K.G. TITLE Direct Submission JOURNAL Submitted (05-SEP-1996) Neuroimmunology Branch, NIH, Bldg 10, Rm 5B16, Bethesda, MD 20892 FEATURES Location/Qualifiers source 1..1140 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q11.2" CDS 123..944 /note="C2H2 type zinc finger" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g1575615" /translation="MFGFPTATLLDCHGRYAQNVAFFNVMTEAHHKYDHSEATGSSSW DIQNSFRREKLEQKSPDSKTLQEDSPGVRQRVYECQECGKSFRQKGSLTLHERIHTGQ KPFECTHCGKSFRAKGNLVTHQRIHTGEKPYQCKECGKSFSQRGSLAVHERLHTGQKP YECAICQRSFRNQSNLAVHRRVHSGEKPYRCDQCGKAFSQKGSLIVHIRVHTGLKPYA CTQCRKSFHTRGNCILHGKIHTGETPYLCGQCGKSFTQRGSLAVHQRSCSQRLTL" BASE COUNT 343 a 252 c 290 g 254 t 1 others ORIGIN 1 ctggcgctgc tggggctcgg cgncggcctt tgtctgcggg cacggccgct gcggtgctca 61 ggaacagccc atggaagaat catatgaaga ggtggtgact gaggtcgtaa gcaggagtgg 121 acatgtttgg atttccaaca gctaccctgc tggactgtca tggaagatat gcccagaatg 181 tagcgttctt caatgtgatg actgaagccc accacaaata tgaccactct gaggctacag 241 gatcctcaag ctgggatatc caaaattctt tcagaagaga gaagctggaa caaaaatccc 301 cagattcgaa gacactacag gaagattcac ctggagtgag acaaagggtc tatgagtgcc 361 aggagtgtgg aaaatccttc cggcaaaaag gtagtctaac gttacatgag agaatccaca 421 ctggtcaaaa gccttttgag tgcacccact gtggaaaaag cttcagggcc aaaggcaatc 481 ttgttacaca tcaacggata cacacgggag agaagcctta tcagtgcaag gagtgtggga 541 aaagcttcag tcaacgaggt agtctcgctg tccacgagag actccacact ggacagaaac 601 cctacgagtg tgctatttgt cagagaagct tcaggaatca gagtaacctt gctgttcaca 661 ggagagttca cagtggtgag aagccctata gatgtgatca gtgtggaaaa gccttcagtc 721 agaaaggaag cttaattgtt cacatcagag tccacacagg cctgaagccc tatgcctgta 781 cccagtgcag gaagagtttc cacaccaggg ggaattgtat tctgcatggc aaaatccaca 841 caggagagac accctatctg tgcggccagt gtggaaaaag cttcacccag agagggagtc 901 tggctgtgca ccagcgaagc tgctcacaga ggctcaccct ttgaccactt tcctgaagag 961 aagttctctt tatgaattaa gagtacaaaa tcctctgaga tgaagcaacc tatccagttc 1021 tatggaatga atggagaatc tttcagaaag accatcattg ggtagggcaa actgattttt 1081 ttcctttccc ccaaaagagt atgaaaaata aaggtcttgt ttattatcat taaaaaaaaa // LOCUS HSU69883 1686 bp mRNA PRI 02-OCT-1996 DEFINITION Human calcium-activated potassium channel hSK1 (SK) mRNA, complete cds. ACCESSION U69883 NID g1575660 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1686) AUTHORS Kohler,M., Hirschberg,B., Bond,C.T., Kinzie,J.M., Marrion,N.V., Maylie,J. and Adelman,J.P. TITLE Small-conductance, calcium-activated potassium channels from mammalian brain JOURNAL Science 273 (5282), 1709-1714 (1996) MEDLINE 96376602 REFERENCE 2 (bases 1 to 1686) AUTHORS Bond,C.T., Maylie,J. and Adelman,J.P. TITLE Direct Submission JOURNAL Submitted (09-SEP-1996) Vollum Institute, Oregon Health Sciences University, 3181 SW Sam Jackson Park Road, Portland, OR 97201-3098, USA FEATURES Location/Qualifiers source 1..1686 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" gene 1..1686 /gene="SK" CDS 1..1686 /gene="SK" /codon_start=1 /product="calcium-activated potassium channel hSK1" /db_xref="PID:g1575661" /translation="MPGPRAACSEPNPCTQVVMNSHSYNGSVGRPLGSGPGALGRDPP DPEAGHPPQPPHSPGLQVVVAKSEPARPSPGSPRGQPQDQDDDEDDEEDEAGRQRASG KPSNVGHRLGHRRALFEKRKRLSDYALIFGMFGIVVMVTETELSWGVYTKESLYSFAL KCLISLSTAILLGLVVLYHAREIQLFMVDNGADDWRIAMTCERVFLISLELAVCAIHP VPGHYRFTWTARLAFTYAPSVAEADVDVLLSIPMFLRLYLLGRVMLLHSKIFTDASSR SIGALNKITFNTRFVMKTLMTICPGTVLLVFSISSWIIAAWTVRVCERYHDKQEVTSN FLGAMWLISITFLSIGYGDMVPHTYCGKGVCLLTGIMGAGCTALVVAVVARKLELTKA EKHVHNFMMDTQLTKRVKNAAANVLRETWLIYKHTRLVKKPDQARVRKHQRKFLQAIH QAQKLRSVKIEQGKLNDQANTLTDLAKTQTVMYDLVSELHAQHEELEARLATLESRLD ALGASLQALPGLIAQAIRPPPPPLPPRPGPGPQDQAARSSPCRWTPVAPSDCG" BASE COUNT 320 a 583 c 502 g 281 t ORIGIN 1 atgccgggtc cccgggcggc ctgcagcgag cccaacccct gcacccaggt agtcatgaac 61 agccacagct acaatggcag cgtggggcgg ccgctgggca gcgggccggg cgccctggga 121 cgagaccctc cggaccctga ggccggccac cccccacaac ccccgcacag cccgggcctc 181 caggtggtag tggccaagag tgagccagcc cggccctcac ccggcagccc ccgggggcag 241 ccccaggacc aggacgatga cgaggatgat gaggaagatg aggccggcag gcagagagcc 301 tcggggaaac cctcaaatgt gggccaccgc ctgggccacc ggcgggcgct cttcgagaag 361 cggaagcgcc tcagcgacta tgccctcatt ttcggcatgt ttggcatcgt cgtcatggtg 421 acggagaccg agctgtcctg gggggtgtac accaaggagt ctctgtactc attcgcactc 481 aaatgcctca tcagcctctc cacggccatc ctgctgggtc tcgttgtcct ctaccatgcc 541 cgggagatcc agctgttcat ggtggacaac ggggctgatg actggcgcat cgccatgacc 601 tgcgagcgcg tgttcctcat ctcgctagag ctggcagtgt gcgccattca cccggtgccc 661 ggccactacc gcttcacgtg gacggcgcgg ctggccttca cgtacgcgcc ctcggtggcc 721 gaggccgacg tggacgtgct gctgtccatc cccatgttcc tgcgcctcta cctgctgggc 781 cgggtgatgc tactgcacag caaaatcttc acggacgcct cgagccgcag catcggggcc 841 ctcaacaaga tcaccttcaa cacgcgcttc gtcatgaaga cactcatgac catctgcccc 901 ggcaccgtgc tgctggtctt cagcatctcc tcctggatca tcgcagcctg gaccgtgcgc 961 gtctgcgaga ggtaccacga caagcaggaa gtgaccagca acttcctggg ggccatgtgg 1021 ctgatttcca tcaccttcct ctccattggc tacggcgaca tggtgcccca cacctactgc 1081 gggaagggtg tgtgcctgct cactggcatc atgggagctg gctgtaccgc gctcgtggtg 1141 gctgtggtgg ctcggaagct ggagctcacc aaggctgaga agcacgtgca caacttcatg 1201 atggacactc agctcaccaa gcgggtaaaa aacgccgctg ctaacgttct cagggagacg 1261 tggctcatct acaaacatac caggctggtg aagaagccag accaagcccg ggttcggaaa 1321 caccagcgta agttcctcca agccatccat caggctcaga agctccggag tgtgaagatc 1381 gagcaaggga agctgaacga ccaggctaac acgcttaccg acctagccaa gacccagacc 1441 gtcatgtacg accttgtatc ggagctgcac gctcagcacg aggagctgga ggcccgcctg 1501 gccaccctgg aaagccgctt ggatgcgctg ggtgcctctc tacaggccct gcctggcctc 1561 atcgcccaag ccatacgccc acccccgcct cccctgcctc ccaggcccgg ccccggcccc 1621 caagaccagg cagcccggag ctccccctgc cggtggacgc ccgtggcccc ctcggactgc 1681 gggtga // LOCUS HSU69961 2125 bp mRNA PRI 19-DEC-1996 DEFINITION Human solurshin (RGS) mRNA, complete cds. ACCESSION U69961 NID g1737166 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2125) AUTHORS Semina,E.V., Reiter,R., Leysens,N.J., Alward,W.L.M., Small,K.W., Datson,N.A., Siegle-Bartelt,J., Bierke-Nelson,D., Bitoun,P., Zabel,B.U., Carey,J.C. and Murray,J.C. TITLE Cloning and characterization of a novel bicoid-related homeobox transcription factor gene, RIEG, involved in Rieger syndrome JOURNAL Nature Genet. 14 (4), 392-399 (1996) MEDLINE 97099449 REFERENCE 2 (bases 1 to 2125) AUTHORS Semina,E.V. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) Pediatrics, University of Iowa, 140 Eckstein Medical Research Building, Iowa City, IA 52242, USA FEATURES Location/Qualifiers source 1..2125 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q25" gene 584..1399 /gene="RGS" CDS 584..1399 /gene="RGS" /note="Rieger syndrome; bicoid-related homeodomain protein; transcription factor" /codon_start=1 /product="solurshin" /db_xref="PID:g1737167" /translation="METNCRKLVSACLQLEKDKSQQGKNEDVGAEDPSKKKRQRRQRT HFTSQQLQQLEATFQRNRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRERNQQ AELCKNGFGPQFNGLMQPYDDMYPGYSYNNWAAKGLTSASLSTKSFPFFNSMNVNPLS SQSMFSPPNSISSMSMSSSMVPSAVTGVPGSSLNSLNNLNNLSSPSLNSAVPTPACPY APPTPPYVYRDTCNSSLASLRLKAKQHSSFGYASVQKPASNLSACQYAVDRPV" BASE COUNT 548 a 560 c 548 g 469 t ORIGIN 1 tgggagtccg tgctcctgct cctcggttgg ctcctaagtg ccccgccagg tcccctctcc 61 tttcgctctc ccggctccgg ctcccgactc ttcggcccgc tggcatctgc ttccctcccc 121 tgcctcgttt ctcgtcgccc ctgctcgctc cccccggcgc tcgcccgggc gctgtgctcg 181 ctcctggatc gccagccgcg cagccgggct cggccggccg cccgcgcgcc actgtgcagt 241 ggagtttggt ggaatctctg ctgacgtcac gtcactcccc acacggagta ggagcagagg 301 gaagagagag ggatgagagg gagggagagg agagagagtg cgagaccgag cgagaaagct 361 ggagaggagc agaaagaaac tgccagtggc ggctagattt cggaggcccc agtgcacccg 421 tggactcctt cggaacttgg caccctcagg agccctgcag tcctctcagg cccggctttc 481 gggcgcttgc cgtgcagccg gaggctcggc tcgctggaaa tcgccccggg aagcagtggg 541 acgcggagac agcagctctc tcccggtagc cgataacggg gaaatggaga ccaactgccg 601 caaactggtg tcggcgtgtc tgcaattaga gaaagataaa agccagcagg ggaagaatga 661 ggacgtgggc gccgaggacc cgtctaagaa gaagcggcaa aggcggcagc ggactcactt 721 taccagccag cagctccagc agctggaggc cactttccag aggaaccgct acccggacat 781 gtccacacgc gaagaaatcg ctgtgtggac caaccttacg gaagcccgag tccgggtttg 841 gttcaagaat cgtcgggcca aatggagaaa gagggagcgc aaccagcagg ccgagctatg 901 caagaatggc ttcgggccgc agttcaatgg gctcatgcag ccctacgacg acatgtaccc 961 aggctattcc tacaacaact gggccgccaa gggccttaca tccgcctccc tatccaccaa 1021 gagcttcccc ttcttcaact ctatgaacgt caaccccctg tcatcacaga gcatgttttc 1081 cccacccaac tctatctcgt ccatgagcat gtcgtccagc atggtgccct cagcagtgac 1141 aggcgtcccg ggctccagtc tcaacagcct gaataacttg aacaacctga gtagcccgtc 1201 gctgaattcc gcggtgccga cgcctgcctg tccttacgcg ccgccgactc ctccgtatgt 1261 ttatagggac acgtgtaact cgagcctggc cagcctgaga ctgaaagcaa agcagcactc 1321 cagcttcggc tacgccagcg tgcagaaacc ggcctccaac ctgagtgctt gccagtatgc 1381 agtggaccgg cccgtgtgag ccgcacccac agcgccggga tcctaggacc ttgccggatg 1441 gggcaactcc gcccttgaaa gactgggaat tatgctagaa ggtcgtgggc actaaagaaa 1501 gggagagaaa gagaagctat atagagaaaa ggaaaccact gaatcaaaga gagagctcct 1561 ttgatttcaa agggatgtcc tcagtgtctg acatctttca ctacaagtat ttctaacagt 1621 tgcaaggaca catacacaaa caaatgtttt gactggatat gacattttaa cattactata 1681 agcttgttat tttttaagtt tagcattgtt aacatttaaa tgactgaaag gatgtatata 1741 tatcgaaatg tcaaattaat tttataaaag cagttgttag taatatcaca acagtgtttt 1801 taaaggttag gctttaaaat aaagcatgtt atacagaagc gattaggatt tttcgcttgc 1861 gagcaaggga gtgtatatac taaatgccac actgtatgtt tctaacatat tattattatt 1921 ataaaaaatg tgtgaatatc agttttagaa tagtttgtgt ggtggatgca atgatgtttc 1981 tgaaactgct atgtacaacc taccctgtgt ataacatttc gtacaatatt attgttttac 2041 ttttcagcaa atatgaaaca aatgtgtttt attttcatgg gagtaaaata tactgcatac 2101 aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSU69962 2435 bp mRNA PRI 17-SEP-1996 DEFINITION Human delayed rectifier potassium channel protein mRNA, complete cds. ACCESSION U69962 NID g1546838 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2435) AUTHORS Schmalz,F.M. Kinsella,J. Vogalis,F., Schneider,A., Flynn,E., Kenyon,J.L. and Horowitz,B. TITLE Direct Submission JOURNAL Submitted (06-SEP-1996) Physiology, UNR-School of Medicine, 107 Anderson Building, Reno, NV 89557, USA FEATURES Location/Qualifiers source 1..2435 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 3..2423 /codon_start=1 /product="delayed rectifier potassium channel protein" /db_xref="PID:g1546839" /translation="MAEKAPPGLNRKTSRSTLSLPPEPVDIIRSKTCSRRVKINVGGL NHEVLWRTLDRLPRTRLGKLRDCNTHESLLEVCDDYNLNENEYFFDRHPGAFTSILNF YRTGKLHMMEEMCALSFGQELDYWGIDEIYLESCCQARYHQKKEQMNEELRREAETMR DGEGEEFDNTCCPDKRKKLWDLLEKPNSSVAAKILAIVSILFIVLSTIALSLNTLPEL QETDEFGQLNDNRQLAHVEAVCIAWFTMEYLLRFLSSPNKWKFFKGPLNVIDLLAILP YYVTIFLTESNKSVLQFQNVRRVVQIFRIMRILRILKLARHSTGLQSLGFTLRRSYNE LGLLILFLAMGIMIFSSLVFFAEKDEDATKFTSIPASFWWATITMTTVGYGDIYPKTL LGKIVGGLCCIAGVLVIALPIPIIVNNFSEFYKEQKRQEKAIKRREALERAKRNGSIV SMNLKDAFARSMELIDVAVEKAGESANTKDSADDNHLSPSRWKWARKALSETSSNKSF ENKYQEVSQKDSHEQLNNTFSSSPQHLSAQKLEMLYNEITKTQPHSHPNPDCQEKPER PSAYEEEIEMEEVVCPQEQLAVAQTEVIVDMKSTSSIDSFTSCATDFTETERSPLPPP SASHLQMKFPTDLPGTEEHQRARGPPFLTLSREKGPAARDGTLEYAPVDITVNLDASG SQCGLHSPLQSDNATDSPKSSLKGSNPLKSRSLKVNFKENRGSAPQTPPSTARPLPVT TADFSLTTPQHISTILLEETPSQGDRPCWALRFQRLVRDLPKGCPPGFPSRNCSLSLQ ERGGASLK" BASE COUNT 670 a 625 c 604 g 536 t ORIGIN 1 ccatggcaga aaaggcacct cctggcttaa acaggaagac ttcaaggtcg acactttccc 61 ttcctccaga gcctgtggac attatccgga gcaaaacatg ctccaggaga gttaagatca 121 atgtgggggg cctcaaccac gaagtcctgt ggagaacgct ggacaggctg cccaggacgc 181 gcctggggaa gcttcgagac tgcaacacac acgagagcct cctggaagtg tgcgacgact 241 ataatctgaa cgagaacgag tatttctttg atcggcatcc aggagccttc acttccattt 301 taaatttcta ccggaccggg aaactccata tgatggaaga aatgtgtgca ctttcgtttg 361 gccaagaact tgattactgg gggattgatg agatctactt ggagtcctgc tgccaggcca 421 gatatcatca aaaaaaagaa caaatgaacg aagaactgag gcgagaggca gagactatgc 481 gagacggaga aggagaagag tttgataata cctgctgccc tgataaaagg aagaaactgt 541 gggacttgct ggagaaacct aactcatcag tggctgcaaa gatcctggcc atcgtgtcta 601 tcctgttcat tgtgctttcc accattgctt tgtctctcaa tacgctgccg gagctgcagg 661 aaacggacga atttggacaa ctcaatgaca accgccaatt agcacacgtg gaggctgtgt 721 gtattgcatg gtttaccatg gagtaccttt tgcgattctt atcctcacca aataaatgga 781 agttcttcaa aggcccactg aatgtcattg atttgctggc catcttgccg tactatgtca 841 ccatttttct gacggagtcc aacaagagcg tgctgcagtt ccaaaacgtg aggcgcgtgg 901 tccagatctt ccgaatcatg cgcatcctca ggatcctgaa actcgccagg cattcgacag 961 gcctgcagtc tctgggtttc acccttaggc ggagttacaa tgaattgggc ttgttgatat 1021 tgtttctggc catggggata atgatatttt ccagcctggt attttttgct gagaaggatg 1081 aagatgctac caagttcacc agtatccctg catcattttg gtgggccacc atcaccatga 1141 ccactgttgg ctatggtgac atttacccta aaacattact agggaaaatt gtgggaggtc 1201 tgtgctgtat tgctggggtt ctggttattg cccttcctat cccaattatt gtgaacaatt 1261 tttctgagtt ttacaaggag cagaaacgcc aagagaaagc aattaaaagg agggaggctc 1321 ttgagcgggc caaaaggaac ggaagcatcg tttctatgaa cttaaaagat gccttcgctc 1381 gaagtatgga actgatagat gtggctgttg agaaggccgg agagtccgcc aacacaaagg 1441 actccgccga cgataatcac ctgtcgccaa gccggtggaa gtgggccagg aaggctctgt 1501 cggaaacaag ctccaacaag tctttcgaga ataagtacca ggaggttagc caaaaagact 1561 cccacgagca gctgaacaac acgttttcct ccagcccaca gcatctgagt gcccagaaac 1621 tggagatgct atacaatgaa attaccaaga cacagcctca ttctcaccca aacccagact 1681 gccaagaaaa gcctgagagg ccatctgcat atgaagaaga gattgaaatg gaagaagtgg 1741 tgtgtccaca ggagcagctg gccgtggcac agaccgaggt cattgtggac atgaagagca 1801 cctccagcat cgacagcttc accagctgtg ccaccgactt cacagagaca gagagatcgc 1861 cgctgccgcc gccctccgcc tctcacttgc agatgaagtt cccaaccgac ctcccaggga 1921 cagaagagca ccaaagagct aggggccccc cgtttctaac tctatccaga gagaaaggac 1981 ctgctgccag ggatggcacg ctggagtatg ccccagttga cataactgtg aacctcgatg 2041 ccagtggctc ccagtgtggg ctacatagtc ctttgcagtc tgacaatgcc accgacagtc 2101 ctaagagctc tctaaaaggc agcaacccac taaagtccag atccctcaaa gtgaacttta 2161 aggaaaatag aggcagtgca ccacagaccc cgcccagcac agccaggcca ctgccagtca 2221 ccacagctga cttttcgctc actaccccgc agcacatcag taccatcctc ttagaagaaa 2281 ccccctccca gggagacaga ccttgctggg cactgaggtt tcagcgcctt gtcagggacc 2341 ttccaaaggg ctgtccccca ggtttcccaa gcagaaactg ttccctttct cttcaagaga 2401 gaggaggagc ttcactgaaa tagacactgg gaatt // LOCUS HSU70136 5041 bp mRNA PRI 30-SEP-1996 DEFINITION Human megakaryocyte stimulating factor mRNA, complete cds. ACCESSION U70136 NID g1572720 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5041) AUTHORS Turner,K.J., Fitz,L.J., Temple,P., Jacobs,K., Larson,D., Leary,A.C., Kelleher,K., Giannotti,J., Calvetti,J., Fitzgerald,M., Kriz,M.-J., Ferenz,C., Grobholz,J., Fraser,H., Bean,K., Norton,C.R., Gesner,T., Bhatia,S., Kriz,R., Hewick,R. and Clark,S.C. TITLE Purification, Biochemical Characterization, and Cloning of a Novel Megakaryocyte Stimulating Factor that has Megakaryocyte Colony Stimulating Activity JOURNAL Blood 78 (Suppl. 1), 279 (1991) REFERENCE 2 (bases 1 to 5041) AUTHORS Merberg,D.M., Fitz,L.J., Temple,P., Giannotti,J., Murtha,P., Fitzgerald,M., Scaltreto,J., Kelleher,K., Preissner,K., Kriz,R., Jacobs,K. and Turner,K. TITLE A Comparison of Vitronectin and Megakaryocyte Stimulating Factor JOURNAL (in) Preissner,K.T., Rosenblatt,S., Kost,C., Wegerhoff,J. and Mosher,D.F. (Eds.); BIOLOGY OF VITRONECTINS AND THEIR RECEPTORS.: 45-52; Elsevier Science Publishers B.V. (1993) REFERENCE 3 (bases 1 to 5041) AUTHORS Turner,K.J., Fitz,L.J., Temple,P., Jacobs,K., Larson,D., Leary,A.C., Kelleher,K., Giannotti,J., Calvetti,J., Fitzgerald,M., Kriz,M.-J., Ferenz,C., Grobholz,J., Fraser,H., Bean,K., Norton,C.R., Gesner,T., Bhatia,S., Kriz,R., Hewick,R. and Clark,S.C. TITLE Direct Submission JOURNAL Submitted (09-SEP-1996) Genetics Institute, 87 CambridgePark Drive, Cambridge, MA 02140, USA FEATURES Location/Qualifiers source 1..5041 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA and PMA activated human peripheral blood mononuclear cells" CDS 34..4248 /note="MSF" /codon_start=1 /product="megakaryocyte stimulating factor" /db_xref="PID:g1572721" /translation="MAWKTLPIYLLLLLSVFVIQQVSSQDLSSCAGRCGEGYSRDATC NCDYNCQHYMECCPDFKRVCTAELSCKGRCFESFERGRECDCDAQCKKYDKCCPDYES FCAEVHNPTSPPSSKKAPPPSGASQTIKSTTKRSPKPPNKKKTKKVIESEEITEEHSV SENQESSSSSSSSSSSSTIWKIKSSKNSAANRELQKKLKVKDNKKNRTKKKPTPKPPV VDEAGSGLDNGDFKVTTPDTSTTQHNKVSTSPKITTAKPINPRPSLPPNSDTSKETSL TVNKETTVETKETTTTNKQTSTDGKEKTTSAKETQSIEKTSAKDLAPTSKVLAKPTPK AETTTKGPALTTPKEPTPTTPKEPASTTPKEPTPTTIKSAPTTPKEPAPTTTKSAPTT PKEPAPTTTKEPAPTTPKEPAPTTTKEPAPTTTKSAPTTPKEPAPTTPKKPAPTTPKE PAPTTPKEPTPTTPKEPAPTTKEPAPTTPKEPAPTAPKKPAPTTPKEPAPTTPKEPAP TTTKEPSPTTPKEPAPTTTKSAPTTTKEPAPTTTKSAPTTPKEPSPTTTKEPAPTTPK EPAPTTPKKPAPTTPKEPAPTTPKEPAPTTTKKPAPTAPKEPAPTTPKETAPTTPKKL TPTTPEKLAPTTPEKPAPTTPEELAPTTPEEPTPTTPEEPAPTTPKAAAPNTPKEPAP TTPKEPAPTTPKEPAPTTPKETAPTTPKGTAPTTLKEPAPTTPKKPAPKELAPTTTKE PTSTTSDKPAPTTPKGTAPTTPKEPAPTTPKEPAPTTPKGTAPTTLKEPAPTTPKKPA PKELAPTTTKGPTSTTSDKPAPTTPKETAPTTPKEPAPTTPKKPAPTTPETPPPTTSE VSTPTTTKEPTTIHKSPDESTPELSAEPTPKALENSPKEPGVPTTKTPAATKPEMTTT AKDKTTERDLRTTPETTTAAPKMTKETATTTEKTTESKITATTTQVTSTTTQDTTPFK ITTLKTTTLAPKVTTTKKTITTTEIMNKPEETAKPKDRATNSKATTPKPQKPTKAPKK PTSTKKPKTMPRVRKPKTTPTPRKMTSTMPELNPTSRIAEAMLQTTTRPNQTPNSKLV EVNPKSEDAGGAEGETPHMLLRPHVFMPEVTPDMDYLPRVPNQGIIINPMLSDETNIC NGKPVDGLTTLRNGTLVAFRGHYFWMLSPFSPPSPARRITEVWGIPSPIDTVFTRCNC EGKTFFFKDSQYWRFTNDIKDAGYPKPIFKGFGGLTGQIVAALSTAKYKNWPESVYFF KRGGSIQQYIYKQEPVQKCPGRRPALNYPVYGEMTQVRRRRFERAIGPSQTHTIRIQY SPARLAYQDKGVLHNEVKVSILWRGLPNVVTSAISLPNIRKPDGYDYYAFSKDQYYNI DVPSRTARAITTRSGQTLSKVWYNCP" BASE COUNT 1710 a 1516 c 806 g 1009 t ORIGIN 1 gcggccgcga ctattcggta cctgaaaaca acgatggcat ggaaaacact tcccatttac 61 ctgttgttgc tgctgtctgt tttcgtgatt cagcaagttt catctcaaga tttatcaagc 121 tgtgcaggga gatgtgggga agggtattct agagatgcca cctgcaactg tgattataac 181 tgtcaacact acatggagtg ctgccctgat ttcaagagag tctgcactgc ggagctttcc 241 tgtaaaggcc gctgctttga gtccttcgag agagggaggg agtgtgactg cgacgcccaa 301 tgtaagaagt atgacaagtg ctgtcccgat tatgagagtt tctgtgcaga agtgcataat 361 cccacatcac caccatcttc aaagaaagca cctccacctt caggagcatc tcaaaccatc 421 aaatcaacaa ccaaacgttc acccaaacca ccaaacaaga agaagactaa gaaagttata 481 gaatcagagg aaataacaga agaacattct gtttctgaaa atcaagagtc ctcctcctcc 541 tcctcctctt cctcttcttc ttcaacaatt tggaaaatca agtcttccaa aaattcagct 601 gctaatagag aattacagaa gaaactcaaa gtaaaagata acaagaagaa cagaactaaa 661 aagaaaccta cccccaaacc accagttgta gatgaagctg gaagtggatt ggacaatggt 721 gacttcaagg tcacaactcc tgacacgtct accacccaac acaataaagt cagcacatct 781 cccaagatca caacagcaaa accaataaat cccagaccca gtcttccacc taattctgat 841 acatctaaag agacgtcttt gacagtgaat aaagagacaa cagttgaaac taaagaaact 901 actacaacaa ataaacagac ttcaactgat ggaaaagaga agactacttc cgctaaagag 961 acacaaagta tagagaaaac atctgctaaa gatttagcac ccacatctaa agtgctggct 1021 aaacctacac ccaaagctga aactacaacc aaaggccctg ctctcaccac tcccaaggag 1081 cccacgccca ccactcccaa ggagcctgca tctaccacac ccaaagagcc cacacctacc 1141 accatcaagt ctgcacccac cacccccaag gagcctgcac ccaccaccac caagtctgca 1201 cccaccactc ccaaggagcc tgcacccacc accaccaagg agcctgcacc caccactccc 1261 aaggagcctg cacccaccac caccaaggag cctgcaccca ccaccaccaa gtctgcaccc 1321 accactccca aggagcctgc acccaccacc cccaagaagc ctgccccaac tacccccaag 1381 gagcctgcac ccaccactcc caaggagcct acacccacca ctcccaagga gcctgcaccc 1441 accaccaagg agcctgcacc caccactccc aaagagcctg cacccactgc ccccaagaag 1501 cctgccccaa ctacccccaa ggagcctgca cccaccactc ccaaggagcc tgcacccacc 1561 accaccaagg agccttcacc caccactccc aaggagcctg cacccaccac caccaagtct 1621 gcacccacca ctaccaagga gcctgcaccc accactacca agtctgcacc caccactccc 1681 aaggagcctt cacccaccac caccaaggag cctgcaccca ccactcccaa ggagcctgca 1741 cccaccaccc ccaagaagcc tgccccaact acccccaagg agcctgcacc caccactccc 1801 aaggaacctg cacccaccac caccaagaag cctgcaccca ccgctcccaa agagcctgcc 1861 ccaactaccc ccaaggagac tgcacccacc acccccaaga agctcacgcc caccaccccc 1921 gagaagctcg cacccaccac ccctgagaag cccgcaccca ccacccctga ggagctcgca 1981 cccaccaccc ctgaggagcc cacacccacc acccctgagg agcctgctcc caccactccc 2041 aaggcagcgg ctcccaacac ccctaaggag cctgctccaa ctacccctaa ggagcctgct 2101 ccaactaccc ctaaggagcc tgctccaact acccctaagg agactgctcc aactacccct 2161 aaagggactg ctccaactac cctcaaggaa cctgcaccca ctactcccaa gaagcctgcc 2221 cccaaggagc ttgcacccac caccaccaag gagcccacat ccaccacctc tgacaagccc 2281 gctccaacta cccctaaggg gactgctcca actaccccta aggagcctgc tccaactacc 2341 cctaaggagc ctgctccaac tacccctaag gggactgctc caactaccct caaggaacct 2401 gcacccacta ctcccaagaa gcctgccccc aaggagcttg cacccaccac caccaagggg 2461 cccacatcca ccacctctga caagcctgct ccaactacac ctaaggagac tgctccaact 2521 acccccaagg agcctgcacc cactaccccc aagaagcctg ctccaactac tcctgagaca 2581 cctcctccaa ccacttcaga ggtctctact ccaactacca ccaaggagcc taccactatc 2641 cacaaaagcc ctgatgaatc aactcctgag ctttctgcag aacccacacc aaaagctctt 2701 gaaaacagtc ccaaggaacc tggtgtacct acaactaaga ctcctgcagc gactaaacct 2761 gaaatgacta caacagctaa agacaagaca acagaaagag acttacgtac tacacctgaa 2821 actacaactg ctgcacctaa gatgacaaaa gagacagcaa ctacaacaga aaaaactacc 2881 gaatccaaaa taacagctac aaccacacaa gtaacatcta ccacaactca agataccaca 2941 ccattcaaaa ttactactct taaaacaact actcttgcac ccaaagtaac tacaacaaaa 3001 aagacaatta ctaccactga gattatgaac aaacctgaag aaacagctaa accaaaagac 3061 agagctacta attctaaagc gacaactcct aaacctcaaa agccaaccaa agcacccaaa 3121 aaacccactt ctaccaaaaa gccaaaaaca atgcctagag tgagaaaacc aaagacgaca 3181 ccaactcccc gcaagatgac atcaacaatg ccagaattga accctacctc aagaatagca 3241 gaagccatgc tccaaaccac caccagacct aaccaaactc caaactccaa actagttgaa 3301 gtaaatccaa agagtgaaga tgcaggtggt gctgaaggag aaacacctca tatgcttctc 3361 aggccccatg tgttcatgcc tgaagttact cccgacatgg attacttacc gagagtaccc 3421 aatcaaggca ttatcatcaa tcccatgctt tccgatgaga ccaatatatg caatggtaag 3481 ccagtagatg gactgactac tttgcgcaat gggacattag ttgcattccg aggtcattat 3541 ttctggatgc taagtccatt cagtccacca tctccagctc gcagaattac tgaagtttgg 3601 ggtattcctt cccccattga tactgttttt actaggtgca actgtgaagg aaaaactttc 3661 ttctttaagg attctcagta ctggcgtttt accaatgata taaaagatgc agggtacccc 3721 aaaccaattt tcaaaggatt tggaggacta actggacaaa tagtggcagc gctttcaaca 3781 gctaaatata agaactggcc tgaatctgtg tattttttca agagaggtgg cagcattcag 3841 cagtatattt ataaacagga acctgtacag aagtgccctg gaagaaggcc tgctctaaat 3901 tatccagtgt atggagaaat gacacaggtt aggagacgtc gctttgaacg tgctatagga 3961 ccttctcaaa cacacaccat cagaattcaa tattcacctg ccagactggc ttatcaagac 4021 aaaggtgtcc ttcataatga agttaaagtg agtatactgt ggagaggact tccaaatgtg 4081 gttacctcag ctatatcact gcccaacatc agaaaacctg acggctatga ttactatgcc 4141 ttttctaaag atcaatacta taacattgat gtgcctagta gaacagcaag agcaattact 4201 actcgttctg ggcagacctt atccaaagtc tggtacaact gtccttagac tgatgagcaa 4261 aggaggagtc aactaatgaa gaaatgaata ataaattttg acactgaaaa acattttatt 4321 aataaagaat attgacatga gtataccagt ttatatataa aaatgttttt aaacttgaca 4381 atcattacac taaaacagat ttgataatct tattcacagt tgttattgtt tacagaccat 4441 ttaattaata tttcctctgt ttattcctcc tctccctccc attgcatggc tcacacctgt 4501 aaaagaaaaa agaatcaaat tgaatatatc ttttaagaat tcaaaactag tgtattcact 4561 taccctagtt cattataaaa aatatctagg cattgtggat ataaaactgt tgggtattct 4621 acaacttcaa tggaaattat tacaagcaga ttaatccctc tttttgtgac acaagtacaa 4681 tctaaaagtt atattggaaa acatggaaat attaaaattt tacactttta ctagctaaaa 4741 cataatcaca aagctttatc gtgttgtata aaaaaattaa caatataatg gcaataggta 4801 gagatacaac aaatgaatat aacactataa cacttcatat tttccaaatc ttaatttgga 4861 tttaaggaag aaatcaataa atataaaata taagcacata tttattatat atctaaggta 4921 tacaaatctg tctacatgaa gtttacagat tggtaaatat cacctgctca acatgtaatt 4981 atttaataaa actttggaac attaaaaaaa taaattggag gcttaaaaaa aaaaaaaaaa 5041 a // LOCUS HSU70212 4007 bp mRNA PRI 08-JUL-1997 DEFINITION Human SIM1 mRNA, complete cds. ACCESSION U70212 NID g2245351 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4007) AUTHORS Chrast,R., Scott,H.S., Chen,H., Kudoh,J., Rossier,C., Minoshima,S., Wang,Y., Shimizu,N. and Antonarakis,S.E. TITLE Cloning of two human homologs of the Drosophila single-minded gene SIM1 on chromosome 6q and SIM2 on 21q within the Down syndrome chromosomal region JOURNAL Genome Res. 7 (6), 615-624 (1997) MEDLINE 97343329 REFERENCE 2 (bases 1 to 4007) AUTHORS Chrast,R., Rossier,C. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (10-SEP-1996) Medical Genetics, Geneva University Medical School, 1 rue Michel Servet, Geneva 1211, Switzerland FEATURES Location/Qualifiers source 1..4007 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q" gene 217..2517 /gene="SIM1" CDS 217..2517 /gene="SIM1" /note="similar to the single-minded protein in Drosophila melanogaster, SwissProt Accession Number P05709" /codon_start=1 /product="hSIM1" /db_xref="PID:g2245352" /translation="MKEKSKNAARTRREKENSEFYELAKLLPLASAITSQVDKASIIR LTTSYLKMRVVFPEGLGEAWGHSSRTSPLDNVGRELGSHLLQTLDGFIFVVAPDGKIM YISETASVHLGLSQVELTGNSIYEYIHPADHDEMTAVLTAHQPYHSHFVQEYEIERSF FLRMKCVLAKRNAGLTCGGYKVIHCSGYLKIRQYSLDMSPFDGCYQNVGLVAVGHSLP PSAVTEIKLHSNMFMFRASLDMKLIFLDSRVAELTGYEPQDLIEKTLYHHVHGCDTFH LRCAHHLLLVKGQVTTKYYRFLAKHGGWVWVQSYATIVHNSRSSRPHCIVSVNYVLTD TEYKGLQLSLDQISASKPAFSYTSSSTPTMTDNRKGAKSRLSSSKSKSRTSPYPQYSG FHTERSESDHDSQWGGSPLTDTASPQLLDPADRPGSQHDASCAYRQFSDRSSLCYGFA LDHSRLVEERHFHTQACEGGRCEAGRYFLGTPQAGREPWWGSRAALPLTKASPESREA YENSMPHIASVHRIHGRGHWDEDSVVSSPDPGSASESGDRYRTEQYQSSPHEPSKIET LIRATQQMIKEEENRLQLRKAPSDQLASINGAGKKHSLCFANYQQPPPTGEVCHGSAL ANTSPCDHIQQREGKMLSPHENDYDNSPTALSRISSPNSDRISKSSLILAKDYLHSDI SPHQTAGDHPTVSPNCFGSHRQYFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSHFD VTSHLRMQPDPAQGHKGTSVIITNGS" BASE COUNT 1159 a 967 c 899 g 981 t 1 others ORIGIN 1 ggatccgcgc gaattttcaa agaacatatt ttccgttcac ccccgctggt cttttactgc 61 catcaataca ctgttcttgg tgcaaatacc tcagcctctt tattcaaagt atgttttatg 121 ttttngccaa atatgatctc taattgaaag tttatttttg gttttggatg aatctgcgga 181 gcttaagttg tgagaagaaa gggggaacaa gacacaatga aagaaaagtc caaaaatgct 241 gcgcggacta ggagggagaa ggaaaacagc gaattttatg aactggctaa attactgcct 301 ttggcctcgg ctatcacctc gcaggtggac aaagcatcca taatcagact cacgaccagc 361 tatctcaaaa tgagagtggt gttcccagaa gggctcggcg aggcgtgggg ccactcaagt 421 cggaccagcc ccctggacaa cgttggccga gaactgggct cccatctgct ccagaccctg 481 gatggcttca tcttcgtggt agccccagat gggaagatca tgtacatctc agagacagcc 541 tcagtccact tgggtctttc tcaggtagag ctgaccggaa acagcattta tgaatacatt 601 cacccggcag accacgacga gatgacggcg gtgctcaccg cccatcaacc ctaccactct 661 cacttcgtgc aggagtatga gatcgagcgc tccttcttcc tgaggatgaa gtgcgtcttg 721 gccaagcgta acgccggcct cacctgtggc ggctacaagg tcatccactg cagcggctac 781 ttgaagatcc gccagtacag cctggacatg tcccccttcg acggctgcta ccaaaacgtg 841 ggcctggtgg ccgtgggcca ctcgctgcct cccagcgccg tcacggagat caagctacac 901 agcaatatgt ttatgttccg cgccagcctg gacatgaagc tcatctttct ggactccagg 961 gtggcggagc tgacggggta cgaacctcag gacctgattg agaagactct gtaccaccat 1021 gtgcacggct gcgacacctt ccacctgcgc tgcgcgcacc atttgctgct ggtgaaggga 1081 caggtgacca ccaagtacta caggttcctg gcgaaacacg gcggctgggt atgggtgcaa 1141 agctacgcga ccatcgtgca caacagtcgc tcctccaggc cacactgtat cgtcagcgtc 1201 aactatgtcc tcacagacac agaatacaaa gggctgcagc tctccctgga tcagatctca 1261 gcctccaaac cagccttctc ctataccagc agctccaccc ccaccatgac tgacaacaga 1321 aagggggcca aatcccggct ctccagctca aagtcaaaat ccaggacttc cccataccct 1381 cagtattcgg gatttcacac agaaagatcg gaatctgatc atgacagcca gtggggcgga 1441 agtcccttga ccgacacggc ctctccgcag cttctggacc ccgccgatag gcctggctcc 1501 cagcacgacg catcgtgcgc ctacagacag ttttcggacc gcagctctct ctgctatggc 1561 tttgcgcttg accactcgag gctggtggaa gagaggcatt tccataccca ggcctgtgaa 1621 ggaggccgat gtgaggcagg caggtacttc ctgggaacgc cgcaggccgg gagggagccc 1681 tggtggggct ctcgcgcagc cttgcccctg acaaaggcct ccccagaaag cagagaagcc 1741 tatgaaaaca gcatgcctca catcgcttca gtccacagga tccatgggcg aggtcattgg 1801 gatgaagata gtgtggtcag ttctccagac cctgggtcgg ccagtgaatc aggtgaccga 1861 tatcgtactg agcagtatca aagtagccca catgaaccca gcaaaattga aactcttata 1921 agagccactc agcaaatgat taaagaagaa gagaacagat tacagctaag gaaagccccc 1981 tcagaccaac tggcttccat taatggggct gggaaaaaac actccctgtg ttttgcaaac 2041 taccaacagc ccccaccaac aggtgaagtc tgccatggct ctgctcttgc caacacttca 2101 ccatgtgacc atatccagca gagagaggga aaaatgttga gcccccatga aaatgactat 2161 gacaacagtc ccaccgcact atctcggata agtagtccca attcggatcg catttcaaaa 2221 tccagtttga tcctagctaa agactatctg cattcggata tatctcctca tcagacagca 2281 ggagaccacc ctactgtctc tccaaactgc tttggctctc accggcagta ttttgacaag 2341 catgcttaca cattaactgg atatgccctg gagcacttat atgacagcga aaccattaga 2401 aactattcct tgggctgtaa tggctcacac tttgatgtaa cttcccatct gaggatgcaa 2461 ccagacccag cacaaggaca caagggaaca tctgttataa taaccaacgg aagctgatgt 2521 tttgctgaaa tattttgttc tttaaggatc tctgaaacat atttatagtt taatacccca 2581 ttaccagcat ttactatgcc acagattgtt agagagtata acttaagtta ctgggtattt 2641 gatacgtgtt cctataaaat caaagaaaac atagcactag cattcagggt tatacacaga 2701 aaagggagct aaattgaata cacaaatttc ccctctaatt atatgggaac cagaatagat 2761 aaattttgac ttgaaaaata ttcatgtaga tcaagtgtgc atatatacta catgagagga 2821 ctgatgaatg acaacattgc attgtgacta tccagtgatc ctcaaacaca caaactatta 2881 cttacaaact gcggtataca ttttacatat ggaaatatag gctatgtaat gtaaatacat 2941 caaaaatggg taattttctt tgactctgtc acactaaact tcttaacgaa atttccattc 3001 ccaaaataac tgagaaagag agagatacat cttataaact gacttctttg tggtttcaaa 3061 tcagccagct catttggttc aggcataaat tagagaaatg gttctggata tggtgcaaaa 3121 atgagttttc acctggtatc cattataaac aatcaggaag aggtaatttt tcaccttgct 3181 tttcagttag acaaggacca ggattgcact gacatggcgc tgagggtttt tctaagtaag 3241 aacactgaga tattgggaca cacatcaaaa acctggagtg ctcaattgga agtagttcta 3301 tgaatatgga aaggccagag gcagagtgaa ataaaatgct atctcaaagt ttaacacaat 3361 ttaagggctc agcataagta aacaacatat ttggggtttg cttgtaaaac caactaaata 3421 aaaaattcaa accaattcac ccagaaaaaa gaccaatagg tgcaaaaata aaaggaaaac 3481 cagtgaagtg ccacatgaca gcagtgttaa gtgtttgaaa acgtttcaaa gcacatatgt 3541 gccaatgtga caacatgtgg aaagcctcag gagagagtct aagataaaag cttaggctga 3601 tagacaagta gttaagagct aagagcagta ctctgaagga ataggcaaaa tgtttatttt 3661 ccttattgtt tgtaaacaac aaacttggtc ttacatctgt gtggtatagt agaaaggcca 3721 gctgactaga tctctggatt ctaattttgg ccctacctgt aacttaattt tgtgaccaca 3781 gttgtaccat tcaccgtgcc tgggctctag tttcctggtt tgtaaggcag ccccagcgtt 3841 catgttctgt gatagagcag aactgaactt attacctaat taactctctg ctatgagttg 3901 tcaagactga tcattctgtt ttttctgtac acagaagttt agatgctttg tgacttaagc 3961 aggtgtgtgg gctcctttag gcaggttaca gttaatttct agagtcg // LOCUS HSU70310 2290 bp mRNA PRI 10-OCT-1997 DEFINITION Human DNA repair protein XRCC9 (XRCC9) mRNA, complete cds. ACCESSION U70310 NID g2465497 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2290) AUTHORS Liu,N., Lamerdin,J.E., Tucker,J.D., Zhou,Z.Q., Walter,C.A., Albala,J.S., Busch,D.B. and Thompson,L.H. TITLE The human XRCC9 gene corrects chromosomal instability and mutagen sensitivities in CHO UV40 cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (17), 9232-9237 (1997) MEDLINE 97404378 REFERENCE 2 (bases 1 to 2290) AUTHORS Liu,N., Lamerdin,J.E., Ramsey,M., Tucker,J.D., Zhou,Z.-Q., Walter,C.A. and Thompson,L.H. TITLE Direct Submission JOURNAL Submitted (10-SEP-1996) BBRP, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..2290 /organism="Homo sapiens" /note="cDNA library from R. Legerski" /db_xref="taxon:9606" /cell_type="HeLa" /chromosome="9" /map="9p" 5'UTR <1..134 /gene="XRCC9" gene <1..2290 /gene="XRCC9" CDS 135..2003 /gene="XRCC9" /note="X-ray cross-complementing rodent repair group 9" /codon_start=1 /product="DNA repair protein XRCC9" /db_xref="PID:g1857159" /translation="MSRQTTSVGSSCLDLWREKNDRLVRQAKVAQNSGLTLRRQQLAQ DALEGLRGLLHSLQGLPAAVPVLPLELTVTCNFIILRASLAQGFTEDQAQDIQRSLER VLETQEQQGPRLEQGLRELWDSVLRASCLLPELLSALHRLVGLQAALWLSADRLGDLA LLLETLNGSQSGASKDLLLLLKTWSPPAEELDAPLTLQDAQGLKDVLLTAFAYRQGLQ ELITGNPDKALSSLHEAASGLCPRPVLVQVYTALGSCHRKMGNPQRALLYLVAALKEG SAWGPPLLEASRLYQQLGDTTAELESLELLVEALNVPCSSKAPQFLIEVELLLPPPDL ASPLHCGTQSQTKHILASRCLQTGRAGDAAEHYLDLLALLLDSSEPRFSPPPSPPGPC MPEVFLEAAVALIQAGRAQDALTLCEELLSRTSSLLPKMSRLWEDARKGTKELPYCPL WVSATHLLQGQAWVQLGAQKVAISEFSRCLELLFRATPEEKEQGAAFNCEQGCKSDAA LQQLRAAALISRGLEWVASGQDTKALQDFLLSVQMCPGNRDTYFHLLQTLKRLDRRDE ATALWWRLEAQTKGSHEDALWSLPLYLESYLSWIRPSDRDAFLEEFRTSLPKSCDL" 3'UTR 2004..2290 /gene="XRCC9" polyA_signal 2255..2260 /gene="XRCC9" BASE COUNT 490 a 637 c 648 g 515 t ORIGIN 1 ccccgagaga agcaggggag ctcggcgggg tgcagaagtg cccaggcccc tccccgctgg 61 ggttgggagc ttgggcaggc cagcttcacc cttcctaagt ccgcttctgg tctccgggcc 121 cagcctcggc caccatgtcc cgccagacca cctctgtggg ctccagctgc ctggacctgt 181 ggagggaaaa gaatgaccgg ctcgttcgac aggccaaggt ggctcagaac tccggtctga 241 ctctgaggcg acagcagttg gctcaggatg cactggaagg gctcagaggg ctcctccata 301 gtctgcaagg gctccctgca gctgttcctg ttcttccctt ggagctgact gtcacctgca 361 acttcattat cctgagggca agcttggccc agggtttcac agaggatcag gcccaggata 421 tccagcggag cctagagaga gtgctggaga cacaggagca gcaggggccc aggttggaac 481 aggggctcag ggagctgtgg gactctgtcc ttcgtgcttc ctgccttctg ccggagctgc 541 tgtctgccct gcaccgcctg gttggcctgc aggctgccct ctggttgagt gctgaccgtc 601 ttggggacct ggccttgtta ctagagaccc tgaatggcag ccagagtgga gcctctaagg 661 atctgctgtt acttctgaaa acttggagtc ccccagctga ggaattagat gctccattga 721 ccctgcagga tgcccaggga ttgaaggatg tcctcctgac agcatttgcc taccgccaag 781 gtctccagga gctgatcaca gggaacccag acaaggcact aagcagcctt catgaagcgg 841 cctcaggcct gtgtccacgg cctgtgttgg tccaggtgta cacagcactg gggtcctgtc 901 accgtaagat gggaaatcca cagagagcac tgttgtactt ggttgcagcc ctgaaagagg 961 gatcagcctg gggtcctcca cttctggagg cctctaggct ctatcagcaa ctgggggaca 1021 caacagcaga gctggagagt ctggagctgc tagttgaggc cttgaatgtc ccatgcagtt 1081 ccaaagcccc gcagtttctc attgaggtag aattactact gccaccacct gacctagcct 1141 caccccttca ttgtggcact cagagccaga ccaagcacat actagcaagc aggtgcctac 1201 agacggggag ggcaggagac gctgcagagc attacttgga cctgctggcc ctgttgctgg 1261 atagctcgga gccaaggttc tccccacccc cctcccctcc agggccctgt atgcctgagg 1321 tgtttttgga ggcagcggta gcactgatcc aggcaggcag agcccaagat gccttgactc 1381 tatgtgagga gttgctcagc cgcacatcat ctctgctacc caagatgtcc cggctgtggg 1441 aagatgccag aaaaggaacc aaggaactgc catactgccc actctgggtc tctgccaccc 1501 acctgcttca gggccaggcc tgggttcaac tgggtgccca aaaagtggca attagtgaat 1561 ttagcaggtg cctcgagctg ctcttccggg ccacacctga ggaaaaagaa caaggggcag 1621 ctttcaactg tgagcaggga tgtaagtcag atgcggcact gcagcagctt cgggcagccg 1681 ccctaattag tcgtggactg gaatgggtag ccagcggcca ggataccaaa gccttacagg 1741 acttcctcct cagtgtgcag atgtgcccag gtaatcgaga cacttacttt cacctgcttc 1801 agactctgaa gaggctagat cggagggatg aggccactgc actctggtgg aggctggagg 1861 cccaaactaa ggggtcacat gaagatgctc tgtggtctct ccccctgtac ctagaaagct 1921 atttgagctg gatccgtccc tctgatcgtg acgccttcct tgaagaattt cggacatctc 1981 tgccaaagtc ttgtgacctg tagctgccac gttttgaaga gcttgagctg ggtccccagt 2041 gggctgtctc tctgtgggga gggctttctg cttcaccatc attaggaatg tgaccattcc 2101 tatataattc ctggactggt gagattggtg gtaggcctgt gaaatttgcc ctagttacta 2161 ccattctcgt tttggaggaa acaatctctg ccaccaccaa gtcattgact ttgctcgagg 2221 cacctttttt cctgtttctc cttttctgtt gtcgagtaaa atttcatatt taaaaaaaaa 2281 aaaaaaaaaa // LOCUS HSU70321 1724 bp mRNA PRI 31-MAY-1997 DEFINITION Human herpesvirus entry mediator mRNA, complete cds. ACCESSION U70321 NID g2138189 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1724) AUTHORS Montgomery,R.I., Warner,M.S., Lum,B.J. and Spear,P.G. TITLE Herpes simplex virus-1 entry into cells mediated by a novel member of the TNF/NGF receptor family JOURNAL Cell 87 (3), 427-436 (1996) MEDLINE 97053782 REFERENCE 2 (bases 1 to 1724) AUTHORS Montgomery,R.I., Warner,M.S., Lum,B. and Spear,P.G. TITLE Direct Submission JOURNAL Submitted (10-SEP-1996) Micro-Immuno, W213, Northwestern Univ, 303 E. Chicago Ave, Chicago, IL 60611, USA FEATURES Location/Qualifiers source 1..1724 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 294..1145 /note="HVEM; member of TNF/NGF receptor family" /codon_start=1 /product="herpesvirus entry mediator" /db_xref="PID:g2138190" /translation="MEPPGDWGPPPWRSTPRTDVLRLVLYLTFLGAPCYAPALPSCKE DEYPVGSECCPKCSPGYRVKEACGELTGTVCEPCPPGTYIAHLNGLSKCLQCQMCDPA MGLRASRNCSRTENAVCGCSPGHFCIVQDGDHCAACRAYATSSPGQRVQKGGTESQDT LCQNCPPGTFSPNGTLEECQHQTKCSWLVTKAGAGTSSSHWVWWFLSGSLVIVIVCST VGLIICVKRRKPRGDVVKVIVSVQRKRQEAEGEATVIEALQAPPDVTTVAVEETIPSF TGRSPNH" BASE COUNT 331 a 548 c 512 g 333 t ORIGIN 1 ccttcatacc ggcccttccc ctcggctttg cctggacagc tcctgcctcc cgcagggccc 61 acctgtgtcc cccagcgccg ctccacccag caggcctgag cccctctctg ctgccagaca 121 ccccctgctg cccactctcc tgctgctcgg gttctgaggc acagcttgtc acaccgaggc 181 ggattctctt tctctttctc ttctggccca cagccgcagc aatggcgctg agttcctctg 241 ctggagttca tcctgctagc tgggttcccg agctgccggt ctgagcctga ggcatggagc 301 ctcctggaga ctgggggcct cctccctgga gatccacccc cagaaccgac gtcttgaggc 361 tggtgctgta tctcaccttc ctgggagccc cctgctacgc cccagctctg ccgtcctgca 421 aggaggacga gtacccagtg ggctccgagt gctgccccaa gtgcagtcca ggttatcgtg 481 tgaaggaggc ctgcggggag ctgacgggca cagtgtgtga accctgccct ccaggcacct 541 acattgccca cctcaatggc ctaagcaagt gtctgcagtg ccaaatgtgt gacccagcca 601 tgggcctgcg cgcgagccgg aactgctcca ggacagagaa cgccgtgtgt ggctgcagcc 661 caggccactt ctgcatcgtc caggacgggg accactgcgc cgcgtgccgc gcttacgcca 721 cctccagccc gggccagagg gtgcagaagg gaggcaccga gagtcaggac accctgtgtc 781 agaactgccc cccggggacc ttctctccca atgggaccct ggaggaatgt cagcaccaga 841 ccaagtgcag ctggctggtg acgaaggccg gagctgggac cagcagctcc cactgggtat 901 ggtggtttct ctcagggagc ctcgtcatcg tcattgtttg ctccacagtt ggcctaatca 961 tatgtgtgaa aagaagaaag ccaaggggtg atgtagtcaa ggtgatcgtc tccgtccagc 1021 ggaaaagaca ggaggcagaa ggtgaggcca cagtcattga ggccctgcag gcccctccgg 1081 acgtcaccac ggtggccgtg gaggagacaa taccctcatt cacggggagg agcccaaacc 1141 actgacccac agactctgca ccccgacgcc agagatacct ggagcgacgg ctgctgaaag 1201 aggctgtcca cctggcgaaa ccaccggagc ccggaggctt gggggctccg ccctgggctg 1261 gcttccgtct cctccagtgg agggagaggt ggggcccctg ctggggtaga gctggggacg 1321 ccacgtgcca ttcccatggg ccagtgaggg cctggggcct ctgttctgct gtggcctgag 1381 ctccccagag tcctgaggag gagcgccagt tgcccctcgc tcacagacca cacacccagc 1441 cctcctgggc cagcccagag ggcccttcag accccagctg tctgcgcgtc tgactcttgt 1501 ggcctcagca ggacaggccc cgggcactgc ctcacagcca aggctggact gggttggctg 1561 cagtgtggtg tttagtggat accacatcgg aagtgatttt ctaaattgga tttgaattcc 1621 ggtcctgtct tctatttgtc atgaaacagt gtatttgggg agatgctgtg ggaggatgta 1681 aatatcttgt ttctcctcaa aaaaaaaaaa aaaaaaaaaa aaaa // LOCUS HSU70322 3054 bp mRNA PRI 08-OCT-1996 DEFINITION Human transportin (TRN) mRNA, complete cds. ACCESSION U70322 NID g1613833 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3054) AUTHORS Pollard,V.W., Michael,W.M., Nakielny,S., Siomi,M.C., Wang,F. and Dreyfuss,G. TITLE A novel receptor-mediated nuclear protein import pathway JOURNAL Cell 86 (6), 985-994 (1996) MEDLINE 96404451 REFERENCE 2 (bases 1 to 3054) AUTHORS Pollard,V.W., Michael,W.M., Nakielny,S., Siomi,M.C., Wang,F. and Dreyfuss,G. TITLE Direct Submission JOURNAL Submitted (10-SEP-1996) Biochemistry and Biophysics, University of Pennsylvania, 415 Curie Blvd., Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..3054 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 94..2766 /gene="TRN" CDS 94..2766 /gene="TRN" /codon_start=1 /product="transportin" /db_xref="PID:g1613834" /translation="MEYEWKPDEQGLQQILQLLKESQSPDTTIQRTVQQKLEQLNQYP DFNNYLIFVLTKLKSEDEPTRSLSGLILKNNVKAHFQNFPNGVTDFIKSECLNNIGDS SPLIRATVGILITTIASKGELQNWPDLLPKLCSLLDSEDYNTCEGAFGALQKICEDSA EILDSDVLDRPLNIMIPKFLQFFKHSSPKIRSHAVACVNQFIISRTQALMLHIDSFTE NLFALAGDEEPEVRKNVCRALVMLLEVRMDRLLPHMHNIVEYMLQRTQDQDENVALEA CEFWLTLAEQPICKDVLVRHLPKLIPVLVNGMKYSDIDIILLKGDVEEDETIPDSEQD IRPRFHRSRTVAQQHDEDGIEEEDDDDDEIDDDDTISDWNLRKCSAAALDVLANVYRD ELLPHILPLLKELLFHHEWVVKESGILVLGAIAEGCMQGMIPYLPELIPHLIQCLSDK KALVRSITCWTLSRYAHWVVSQPPDTYLKPLMTELLKRILDSNKRVQEAACSAFATLE EEACTELVPYLAYILDTLVFAFSKYQHKNLLILYDAIGTLADSVGHHLNKPEYIQMLM PPLIQKWNMLKDEDKDLFPLLECLSSVATALQSGFLPYCEPVYQRCVNLVQKTLAQAM LNNAQPDQYEAPDKDFMIVALDLLSGLAEGLGGNIEQLVARSNILTLMYQCMQDKMPE VRQSSFALLGDLTKACFQHVKPCIADFMPILGTNLNPEFISVCNNATWAIGEISIQMG IEMQPYIPMVLHQLVEIINRPNTPKTLLENTAITIGRLGYVCPQEVAPMLQQFIRPWC TSLRNIRDNEEKDSAFRGICTMISVNPSGVIQDFIFFCDAVASWINPKDDLRDMFCKI LHGFKNQVGDENWRRFSDQFPLPLKERLAAFYGV" BASE COUNT 899 a 625 c 685 g 845 t ORIGIN 1 agcatttcag gccccggaca ggaggcagtg ccgcttcggc cgaaggccga gccgcccgag 61 ggctctggga tggtgtggga ccggcaaacc aagatggagt atgagtggaa acctgacgag 121 caagggcttc agcaaatcct gcagctgttg aaggagtccc agtccccaga caccaccatc 181 cagagaaccg tgcaacaaaa actggaacaa cttaatcagt atccagactt taacaactac 241 ttgatttttg ttcttacaaa attaaaatct gaagatgaac ccacaagatc attgagtggt 301 cttatcttga agaataatgt gaaagcacac tttcagaact tcccaaatgg tgtaacagac 361 tttattaaaa gtgaatgttt aaataatatt ggtgactcct ctcctctgat tagagccact 421 gttggtattt tgatcacaac tatagcctcc aagggagaat tgcagaattg gcctgacctc 481 ttaccaaaac tctgtagcct gttggattct gaagattata atacctgtga gggagcattt 541 ggtgcccttc agaagatttg tgaagattct gctgagattt tagacagtga tgttttagat 601 cgtcctctca acatcatgat tcccaaattt ttacagttct tcaagcatag tagtccaaaa 661 ataaggtctc acgctgttgc atgtgtcaat cagtttatca tcagtaggac tcaagctcta 721 atgttgcaca ttgattcttt tactgagaat ctctttgcat tagctggtga tgaagaacca 781 gaggtacgga aaaatgtgtg ccgagcactt gtgatgttgc tcgaagttcg aatggatcgc 841 ctgcttcctc acatgcataa tatagttgag tacatgctac agaggactca agatcaagat 901 gaaaatgtgg ctttagaagc ctgtgaattt tggctaactt tagctgaaca gccaatatgc 961 aaagatgtac tcgtaaggca tcttcctaag ttgattcctg tgttagtgaa tggcatgaag 1021 tactcagaca tagatattat cctacttaag ggtgatgttg aagaagacga aacgattcct 1081 gatagtgaac aggatatacg gccacgtttt caccgatcga ggacggtggc tcagcagcat 1141 gatgaagatg gaattgaaga ggaagacgat gatgatgatg aaattgatga tgatgataca 1201 atttcagact ggaatctaag aaaatgttct gctgctgccc tggatgttct tgcaaatgtg 1261 tatcgtgatg aactgctgcc acatattttg ccccttttga aagaattact ttttcatcat 1321 gaatgggttg ttaaagaatc aggcattttg gttttaggag caattgctga aggttgcatg 1381 cagggcatga ttccatactt gcctgagctt attcctcacc ttattcagtg cctctctgat 1441 aaaaaggctc ttgtgcgttc cataacatgc tggactctta gccgctatgc acactgggtg 1501 gtcagccagc cgccagacac gtacctgaag ccattaatga cagaattgct aaagcgcatc 1561 ctggacagca acaagagagt acaagaagct gcctgcagtg cctttgctac cctagaagag 1621 gaggcttgta cagaacttgt tccttacctt gcttatatac ttgataccct ggtctttgca 1681 tttagtaaat accagcataa gaacctgctc attctttacg atgccatagg aacattagca 1741 gattcagtag gacatcattt aaacaaacca gaatatattc agatgctaat gcctccactg 1801 atccagaaat ggaacatgtt aaaggatgaa gataaagatc tcttcccttt acttgagtgc 1861 ctatcttcag ttgccacagc actgcagtct ggattccttc cgtactgtga acctgtgtat 1921 cagcgttgtg taaacctagt acagaagact cttgcacaag ccatgctaaa caatgctcaa 1981 ccagatcaat atgaagctcc agataaagat tttatgatag tggctcttga tttactgagt 2041 ggcctggctg aaggacttgg aggcaacatt gaacagctgg tagcccgaag taacatcctg 2101 acactaatgt atcagtgcat gcaggataaa atgccagaag ttcgacagag ttcttttgcc 2161 ctgttaggtg acctcacaaa agcttgcttt cagcatgtta agccttgtat agctgatttc 2221 atgccaatat tgggaaccaa cctaaatcca gaattcattt cagtctgcaa caatgccaca 2281 tgggcaattg gagaaatctc cattcaaatg ggtatagaga tgcagcctta tattcctatg 2341 gtgttgcacc agcttgtaga aatcattaac agacccaaca caccaaagac gttgttagag 2401 aatacagcaa taacaattgg tcgtcttggt tacgtttgtc ctcaagaggt ggcccccatg 2461 ctacagcagt ttataagacc ctggtgcacc tctctgagaa acataagaga caatgaggaa 2521 aaggattcag cattccgtgg aatttgtacc atgatcagtg tgaatcccag tggcgtaatc 2581 caagatttta tatttttttg tgatgccgtt gcatcatgga ttaacccaaa agatgatctc 2641 agagacatgt tctgtaagat ccttcatgga tttaaaaatc aagttggcga tgaaaattgg 2701 aggcgtttct ctgaccagtt tcctcttccc ttaaaagagc gtcttgcagc tttttatggt 2761 gtttaatcta atacacttaa gctgcagtcc caaaattagg ggtccttcag tcttggagac 2821 tataagggag cctctgcacc cagggaaaat gttacccttt acagggggga agggtaaacc 2881 agtagggaat acagtacaat cccaacccta ctgggagggg cgggagggag gtgttgccgt 2941 cactgtatta agtcgatgtt gggaaacgtt ttaacatctg gagcctttgt gggtggaaat 3001 atgtctccag ttacaactcc gcagtggatg tgaagaagca aaaaaaaaaa aaaa // LOCUS HSU70451 2682 bp mRNA PRI 21-MAR-1997 DEFINITION Human myleoid differentiation primary response protein MyD88 mRNA, complete cds. ACCESSION U70451 NID g1763090 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2682) AUTHORS Hardiman,G., Rock,F.L., Balasubramanian,S., Kastelein,R.A. and Bazan,J.F. TITLE Molecular characterization and modular analysis of human MyD88 JOURNAL Oncogene 13 (11), 2467-2475 (1996) MEDLINE 97115998 REFERENCE 2 (bases 1 to 2682) AUTHORS Hardiman,G. and Bazan,J.F. TITLE Direct Submission JOURNAL Submitted (11-SEP-1996) Molecular Biology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304-1104, USA FEATURES Location/Qualifiers source 1..2682 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="dendrite" CDS 33..923 /note="myleoid differentiation primary response protein" /codon_start=1 /product="MyD88" /db_xref="PID:g1763091" /translation="MAAGGPGAGSAAPVSSTSSLPLAALNMRVRRRLSLFLNVRTQVA ADWTALAEEMDFEYLEIRQLETQADPTGRLLDAWQGRPGASVGRLLELLTKLGCDDVL LELGPSIEEDCQKYILKQQQEEAEKPLQVAAVDSSVPRTAELAGITTLDDPLGHMPER FDAFICYCPSDIQFVQEMIRQLEQTNYRLKLCVSDRDVLPGTCVWSIASELIEKRCRR MVVVVSDDYLQSKECDFQTKFALSLSPGAHQKRLIPIKYKAMKKEFPSILRFITVCDY TNPCTKSWFWTRLAKALSLP" BASE COUNT 632 a 748 c 657 g 645 t ORIGIN 1 gggtagaccc acgagtccgc ccacgggtct gcatggctgc aggaggtccc ggcgcggggt 61 ctgcggcccc ggtctcctcc acatcctccc ttcccctggc tgctctcaac atgcgagtgc 121 ggcgccgcct gtctctgttc ttgaacgtgc ggacacaggt ggcggccgac tggaccgcgc 181 tggcggagga gatggacttt gagtacttgg agatccggca actggagaca caagcggacc 241 ccactggcag gctgctggac gcctggcagg gacgccctgg cgcctctgta ggccgactgc 301 tcgagctgct taccaagctg ggctgcgacg acgtgctgct ggagctggga cccagcattg 361 aggaggattg ccaaaagtat atcttgaagc agcagcagga ggaggctgag aagcctttac 421 aggtggccgc tgtagacagc agtgtcccac ggacagcaga gctggcgggc atcaccacac 481 ttgatgaccc cctggggcat atgcctgagc gtttcgatgc cttcatctgc tattgcccca 541 gcgacatcca gtttgtgcag gagatgatcc ggcaactgga acagacaaac tatcgactga 601 agttgtgtgt gtctgaccgc gatgtcctgc ctggcacctg tgtctggtct attgctagtg 661 agctcatcga aaagaggtgc cgccggatgg tggtggttgt ctctgatgat tacctgcaga 721 gcaaggaatg tgacttccag accaaatttg cactcagcct ctctccaggt gcccatcaga 781 agcgactgat ccccatcaag tacaaggcaa tgaagaaaga gttccccagc atcctgaggt 841 tcatcactgt ctgcgactac accaacccct gcaccaaatc ttggttctgg actcgccttg 901 ccaaggcctt gtccctgccc tgaagactgt tctgaggccc tgggtgtgtg tgtatctgtc 961 tgcctgtcca tgtacttctg ccctgcctcc tcctttcgtt gtaggaggaa tctgtgctct 1021 acttacctct caattcctgg agatgccaac ttcacagaca cgtctgcagc agctggacat 1081 cacatttcat gtcctgcatg gaaccagtgg ctgtgagtgg catgtccact tgctggatta 1141 tcagccagga cactatagaa caggaccagc tgagactaag aaggaccagc agagccagct 1201 cagctctgag ccattcacac atcttcaccc tcagtttcct cacttgagga gtgggatggg 1261 gagaacagag agtagctgtg tttgaatccc tgtaggaaat ggtgaagcat agctctgggt 1321 ctcctggggg agaccaggct tggctgcggg agagctggct gttgctggac tacatgctgg 1381 ccactgctgt gaccacgaca ctgctggggc agcttcttcc acagtgatgc ctactgatgc 1441 ttcagtgcct ctgcacaccg cccattccac ttcctccttc cccacagggc aggtggggaa 1501 gcagtttggc ccagcccaag gagaccccac cttgagcctt atttcctaat gggtccacct 1561 ctcatctgca tctttcacac ctcccagctt ctgcccaacc ttcagcagtg acaagtcccc 1621 aagagactcg cctgagcagc ttgggctgct tttcatttcc acctgtcagg atgcctgtgg 1681 tcatgctctc agctccacct ggcatgagaa gggatcctgg cctctggcat attcatcaag 1741 tatgagttct ggggatgagt cactgtaatg atgtgagcag ggagccttcc tccctgggcc 1801 acctgcagag agctttccca ccaactttgt accttgattg ccttacaaag ttatttgttt 1861 acaaacagcg accatataaa agcctcctgc cccaaagctt gtgggcacat gggcacatac 1921 agactcacat acagacacac acatatatgt acagacatgt actctcacac acacaggcac 1981 cagcatacac acgtttttct aggtacagct cccaggaaca gctaggtggg aaagtcccat 2041 cactgaggga gcctaaccat gtccctgaac aaaaattggg cactcatcta ttccttttct 2101 cttgtgtccc tactcattga aaccaaactc tggaaaggac ccaatgtacc agtatttata 2161 cctctaatga agcacagaga gaggaagaga gctgcttaaa ctcacacaac aatgaactgc 2221 agacacagct gttctctccc tctctccttc ccagagcaat ttatacttta ccctcaggct 2281 gtcctctggg gagaaggtgc catggtctta ggtgtctgtg ccccaggaca gaccctagga 2341 ccctaaatcc aatagaaaat gcatatcttt gctccacttt cagccaggct ggagcaaggt 2401 accttttctt aggatcttgg gagggaatgg atgcccctct ctgcatgatc ttgttgaggc 2461 atttagctgc catgcacctg tcccccttta atactgggca ttttaaagcc atctcaagag 2521 gcatcttcta catgttttgt acgcattaaa ataatttcaa agatatctga gaaaagccga 2581 tatttgccat tcttcctata tcctggaata tatcttgcat cctgagttta taataataaa 2641 taatattcta ccttggaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU70660 502 bp mRNA PRI 20-APR-1997 DEFINITION Human copper transport protein HAH1 (HAH1) mRNA, complete cds. ACCESSION U70660 NID g1945364 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 502) AUTHORS Klomp,L.W., Lin,S.J., Yuan,D.S., Klausner,R.D., Culotta,V.C. and Gitlin,J.D. TITLE Identification and functional expression of HAH1, a novel human gene involved in copper homeostasis JOURNAL J. Biol. Chem. 272 (14), 9221-9226 (1997) MEDLINE 97238857 REFERENCE 2 (bases 1 to 502) AUTHORS Klomp,L.W.J. and Gitlin,J.D. TITLE Direct Submission JOURNAL Submitted (12-SEP-1996) Pediatrics, Washington University, One Children's Place, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..502 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="5" /map="5q32-33" 5'UTR 1..113 gene 114..320 /gene="HAH1" CDS 114..320 /gene="HAH1" /note="human ATX1 homolog" /codon_start=1 /product="copper transport protein HAH1" /db_xref="PID:g1945365" /translation="MPKHEFSVDMTCGGCAEAVSRVLNKLGGVKYDIDLPNKKVCIES EHSMDTLLATLKKTGKTVSYLGLE" 3'UTR 321..502 polyA_signal 479..484 BASE COUNT 102 a 149 c 142 g 109 t ORIGIN 1 ctgtcgccct gcacggtgac ccgcgtgtgc gaggccttca tggccaggat ccgggtggag 61 aggcgctgct gacaccgccg ccacaccgcc gccacaccgc cgctgcctca gtcatgccga 121 agcacgagtt ctctgtggac atgacctgtg gaggctgtgc tgaagctgtc tctcgggtcc 181 tcaataagct tggaggagtt aagtatgaca ttgacctgcc caacaagaag gtctgcattg 241 aatctgagca cagcatggac actctgcttg caaccctgaa gaaaacagga aagactgttt 301 cctaccttgg ccttgagtag caggggcctg gtccccacag cccacaggat ggaccaaagg 361 gggcaggatg ctgatcctcc cgctggcttc cagacagacc tgggacttgg cagtcatgcc 421 gggtgatcgt gttcctgcgg agaccctcag ttgtcctatt ccttcctagc ttccctgcaa 481 taaatcaagc tgcttttgtt gg // LOCUS HSU70663 1953 bp mRNA PRI 04-MAR-1997 DEFINITION Human zinc finger transcription factor hEZF (EZF) mRNA, complete cds. ACCESSION U70663 NID g1857160 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1953) AUTHORS Garrett-Sinha,L.A. and de Crombrugghe,B. TITLE Identification and Characterization of the Human Homolog of EZF JOURNAL Unpublished REFERENCE 2 (bases 1 to 1953) AUTHORS Garrett-Sinha,L.A. and de Crombrugghe,B. TITLE Direct Submission JOURNAL Submitted (12-SEP-1996) Molecular Genetics, M.D. Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1953 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q31" /tissue_type="placenta" gene 504..1916 /gene="EZF" CDS 504..1916 /gene="EZF" /note="homolog of the mouse epithelial zinc finger; zinc finger transcription factor" /codon_start=1 /product="hEZF" /db_xref="PID:g1857161" /translation="MAVSDALLPSFSTFASGPAGREKTLRQAGAPNNRWREELSHMKR LPPVLPGRPYDLAAATVATDLESGGAGAACGGSNLAPLPRRETEEFNDLLDLDFILSN SLTHPPESVAATVSSSASASSSSSPSSSGPASAPSTCSFTYPIRAGNDPGVAPGGTGG GLLYGRESAPPPTAPFNLADINDVSPSGGFVAELLRPELDPVYIPPQQPQPPGGGLMG KFVLKASLSAPGSEYGSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPKIKQEAVSSC THLGAGPPLSNGHRPAAHNFPLGRQLPSRSTPTLGFEEVLSSRECHPALPLPPGFHPH PGPNYPSFLPDQMQPQVPPLHYQELMPPGSCMPEEPKPKRGRRSWPRKRTATHTCDYA GCGKTYTKSSHLKAHLRTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQCQ KCDRAFSRSDHLALHMKRHF" BASE COUNT 368 a 687 c 557 g 341 t ORIGIN 1 cgctcccacc cgcccgtggc ccgcgcccat ggccgcgcgc gctccacaca actcaccgga 61 gtccgcgccc tgcgccgccg accagttcgc agctccgcgc cacggcagcc agtctcacct 121 ggcggcaccg cccgcccacc gccccggcca cagcccctgc gcccacggca gcaatcgagg 181 cgaccgcgac agtggtgggg gacgctgctg agtggaagag agcgcagccc ggccaccgga 241 cctacttact cgccttgctg attgtctatt tttgcgttta caacttttct aagaactttt 301 gtatacaaag gaacttttta aaaaagacgc ttccaagtta tatttaatcc aaagaagaag 361 gatctcggcc aatttggggt tttgggtttt ggcttcgttt tttctcttcg ttgactttgg 421 ggttcaggtg ccccagctgc ttcgggctgc cgaggacctt ctgggccccc acattaatga 481 ggcagccacc tggcgagtct gacatggctg tcagcgacgc gctgctccca tctttctcca 541 cgttcgcgtc tggcccggcg ggaagggaga agacactgcg tcaagcaggt gccccgaata 601 accgctggcg ggaggagctc tcccacatga agcgacttcc cccagtgctt cccggccgcc 661 cctatgacct ggcggcggcg accgtggcca cagacctgga gagcggcgga gccggtgcgg 721 cttgcggcgg tagcaacctg gcgcccctac ctcggagaga gaccgaggag ttcaacgatc 781 tcctggacct ggactttatt ctctccaatt cgctgaccca tcctccggag tcagtggccg 841 ccaccgtgtc ctcgtcagcg tcagcctcct cttcgtcgtc gccgtcgagc agcggccctg 901 ccagcgcgcc ctccacctgc agcttcacct atccgatccg ggccgggaac gacccgggcg 961 tggcgccggg cggcacgggc ggaggcctcc tctatggcag ggagtccgct ccccctccga 1021 cggctccctt caacctggcg gacatcaacg acgtgagccc ctcgggcggc ttcgtggccg 1081 agctcctgcg gccagaattg gacccggtgt acattccgcc gcagcagccg cagccgccag 1141 gtggcgggct gatgggcaag ttcgtgctga aggcgtcgct gagcgcccct ggcagcgagt 1201 acggcagccc gtcggtcatc agcgtcagca aaggcagccc tgacggcagc cacccggtgg 1261 tggtggcgcc ctacaacggc gggccgccgc gcacgtgccc caagatcaag caggaggcgg 1321 tctcttcgtg cacccacttg ggcgctggac cccctctcag caatggccac cggccggctg 1381 cacacaactt ccccctgggg cggcagctcc ccagcaggag taccccgacc ctgggttttg 1441 aggaagtgct gagcagcagg gaatgtcacc ctgccctgcc gcttcctccc ggcttccatc 1501 cccacccggg gcccaattac ccatccttcc tgcccgatca gatgcagccg caagtcccgc 1561 cgctccatta ccaagagctc atgccacccg gttcctgcat gccagaggag cccaagccaa 1621 agaggggaag acgatcgtgg ccccggaaaa ggaccgccac ccacacttgt gattacgcgg 1681 gctgcggcaa aacctacaca aagagttccc atctcaaggc acacctgcga acccacacag 1741 gtgagaaacc ttaccactgt gactgggacg gctgtggatg gaaattcgcc cgctcagatg 1801 aactgaccag gcactaccgt aaacacacgg ggcaccgccc gttccagtgc caaaaatgcg 1861 accgagcatt ttccaggtcg gaccacctcg ccttacacat gaagaggcat ttttaaatcc 1921 cagacagtgg atatgaccca cactgccaga aga // LOCUS HSU70735 1060 bp mRNA PRI 08-SEP-1997 DEFINITION Human 34 kDa mov34 isologue mRNA, complete cds. ACCESSION U70735 NID g2360944 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1060) AUTHORS Asano,K., Vornlocher,H.-P., Richter-Cook,N.J., Merrick,W.C., Hinnebusch,A.G. and Hershey,J.W.B. TITLE Structure of cDNAs encoding human eIF3 subunits: Possible roles in RNA binding and macromolecular assembly JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 1060) AUTHORS Asano,K., Hershey,J.W.B. and Hinnebusch,A.G. TITLE Direct Submission JOURNAL Submitted (12-SEP-1996) Laboratory of Eukaryotic Gene Regulation, National Institute of Child Health and Human Development, Bldg. 6B, Rm. 3B315, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1060 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 44..937 /note="Mov34 homolog" /codon_start=1 /product="34 kDa Mov34 isologue" /db_xref="PID:g2360945" /translation="MACGVTGSVSVALHPLVILNISDHWIRMRSQEGRPVQVIGALIG KQEGRNIEVMNSFELLSHTVEEKIIIDKEYYYTKEEQFKQVFKELEFLGWYTTGGPPD PSDIHVHKQVCEIIESPLFLKLNPMTKHTDLPVSVFESVIDIINGEATMLFAELTYTL ATEEAERIGVDHVARMTATGSGENSTVAEHLIAQHSAIKMLHSRVKLILEYVKASEAG EVPFNHEILREAYALCHCLPVLSTDKFKTDFYDQCNDVGLMAYLGTITKTCNTMNQFV NKFNVLYDRQGIGRRMRGLFF" BASE COUNT 269 a 270 c 294 g 227 t ORIGIN 1 caagtttggc accggggtgg atgcagcagt agtccccagc gtgatggcct gcggagtgac 61 tgggagtgtt tccgtcgctc tccatcccct tgtcattctc aacatctcag accactggat 121 ccgcatgcgc tcccaggagg ggcggcctgt gcaggtgatt ggggctctga ttggcaagca 181 ggagggccga aatatcgagg tgatgaactc ctttgagctg ctgtcccaca ccgtggaaga 241 gaagattatc attgacaagg aatattatta caccaaggag gagcagttta aacaggtgtt 301 caaggagctg gagtttctgg gttggtatac cacagggggg ccacctgacc cctcggacat 361 ccacgtccat aagcaggtgt gtgagatcat cgagagcccc ctctttctga agttgaaccc 421 tatgaccaag cacacagatc ttcctgtcag cgtttttgag tctgtcattg atataatcaa 481 tggagaggcc acaatgctgt ttgctgagct gacctacact ctggccacag aggaagcgga 541 acgcattggt gtagaccacg tagcccgaat gacagcaaca ggcagtggag agaactccac 601 tgtggctgaa cacctgatag cacagcacag cgccatcaag atgctgcaca gccgcgtcaa 661 gctcatcttg gagtacgtca aggcctctga agcgggagag gtccccttta atcatgagat 721 cctgcgggag gcctatgctc tgtgtcactg tctcccggtg ctcagcacag acaagttcaa 781 gacagatttt tatgatcaat gcaacgacgt ggggctcatg gcctacctcg gcaccatcac 841 caaaacgtgc aacaccatga accagtttgt gaacaagttc aatgtcctct acgaccgaca 901 aggcatcggc aggagaatgc gcgggctctt tttctgatga gggtacttga agggctgatg 961 gacaggggtc aggcaactat ccaaagggga gggcactaca cttccttgag agaaaccgct 1021 gtcattaata aaaggggagc agcccctgag caccgaaaaa // LOCUS HSU70862 1299 bp mRNA PRI 02-APR-1997 DEFINITION Human nuclear factor I B3 mRNA, complete cds. ACCESSION U70862 NID g1916623 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1299) AUTHORS Liu,Y.-C., Bernard,H.-U. and Apt,D. TITLE NFI-B3, a novel transcriptional repressor of the NFI family, is generated by alternative RNA processing JOURNAL Unpublished REFERENCE 2 (bases 1 to 1299) AUTHORS Liu,Y.-C., Bernard,H.-U. and Apt,D. TITLE Direct Submission JOURNAL Submitted (13-SEP-1996) Institute of Molecular and Cell Biology, National University of Singapore, 10 Kent Ridge Crescent, Singapore 119260, Republic of Singapore FEATURES Location/Qualifiers source 1..1299 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9p24.1" /tissue_type="foreskin" /cell_type="fibroblast" /clone_lib="Clontech HL 1052a" CDS 458..1024 /note="NFI-B3" /codon_start=1 /product="nuclear factor I B3" /db_xref="PID:g1916624" /translation="MMYSPICLTQDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKK HEKRMSKDEERAVKDELLSEKPEIKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHP CCVLSNPDQKGKIRRIDCLRQADKVWRLDLVMVILFKGIPLESTDGERLMKSPHCTNP ALCVQPHHITVSVKELDLFLAYYVQEAR" BASE COUNT 393 a 320 c 288 g 298 t ORIGIN 1 tcccgccgcc cccctccccg ccttttttga aaaaaagcat tttaccacca accaccaccc 61 caatccaacc cacaccgaac cttcgcgcac cccctacacc caacaacaac aacaacaact 121 gcaaaataga aaacaaatcc ccaaacccag gcgaaaagca gccaacaccg gcggcggcgg 181 cggcctcggc aagcacggcc agcgcgctcg gactgcaaga gggttaaaag tgtagattgg 241 atttcacccc tggaactcta gcacgccgag tgaacttgaa tctttggcta tttaaggagg 301 attgggtttg ttgtgaagtt gcggtgatcc agcgcagagc cccgtcctga ttgatcgcat 361 cgcggggctc agatgactgt aaaatgaata gatgaaattc ttgcttctcg aagattttct 421 tgggcatctc ccggaaagtg ccttttaagg cgaagtcatg atgtattctc ccatctgtct 481 cactcaggat gaatttcacc cattcatcga ggcacttctt ccacatgtcc gtgcaattgc 541 ctatacttgg ttcaacctgc aggctcgaaa acgcaagtac tttaaaaagc atgagaagcg 601 aatgtcaaag gatgaagaaa gagcagtcaa agatgagctt ctcagtgaaa agcccgaaat 661 caaacagaag tgggcatcca ggctccttgc caaactgcgc aaagatattc gccaggagta 721 tcgagaggac tttgtgctca ccgtgactgg caagaagcac ccgtgctgtg tcttatccaa 781 tcccgaccag aagggtaaga ttaggagaat cgactgcctg cgacaggcag acaaagtctg 841 gcgtctggat ctagtcatgg tgatcctgtt caaaggcatc cccttggaaa gtaccgatgg 901 agagcggctc atgaaatccc cacattgcac aaacccagca ctttgtgtcc agccacatca 961 tatcacagta tcagttaagg agcttgattt gtttttggca tactacgtgc aggaggcaag 1021 gtaagaattt cttttcttca aataaatgca tcaagatata ggtgtccttg cattaccaac 1081 aatccatagt aaaacaatca ttttacaaag ccgcatacca aaagtagact aaagtttgag 1141 ctgtcacaga tggtaggtca tttccaaact tggcctggtg ggaggaaaaa tggcctgcag 1201 atagccaggt gacagactgt tacctgaggt ttgttatata aatgtgggac ttaataaact 1261 gctatgagct ttattattca aaaaaaaaaa aaaaaaaaa // LOCUS HSU70867 4048 bp mRNA PRI 11-OCT-1996 DEFINITION Human prostaglandin transporter hPGT mRNA, complete cds. ACCESSION U70867 NID g1617589 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4048) AUTHORS Lu,R., Kanai,N., Bao,Y. and Schuster,V.L. TITLE Cloning, in vitro expression, and tissue distribution of a human prostaglandin transporter cDNA(hPGT) JOURNAL J. Clin. Invest. 98 (5), 1142-1149 (1996) MEDLINE 96379664 REFERENCE 2 (bases 1 to 4048) AUTHORS Lu,R. and Schuster,V.L. TITLE Direct Submission JOURNAL Submitted (13-SEP-1996) Medicine, Albert Einstein College of Medicine, Yeshiva Univ., 1300 Morris Park Ave., Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..4048 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 92..2023 /note="membrane transporter" /codon_start=1 /product="prostaglandin transporter hPGT" /db_xref="PID:g1617590" /translation="MGLLPKLGVSQGSDTSTSRAGRCARSVFGNIKVFVLCQGLLQLC QLLYSAYFKSSLTTIEKRFGLSSSSSGLISSLNEISNAILIIFVSYFGSRVHRPRLIG IGGLFLAAGAFILTLPHFLSEPYQYTLASTGNNSRLQAELCQKHWQDLPPSKCHSTTQ NPQKETSSMWGLMVVAQLLAGIGTVPIQPFGISYVDDFSEPSNSPLYISILFAISVFG PAFGYLLGSIMLQIFVDYGRVNTAAVNLVPGDPRWIGAWWLGLLISSALLVLTSFPFF FFPRAMPIGAKRAPATADEARKLEEAKSRGSLVDFIKRFPCIFLRLLMNSLFVLVVLA QCTFSSVIAGLSTFLNKFLEKQYGTSAAYANFLIGAVNLPAAALGMLFGGILMKRFVF SLQTIPRIATTIITISMILCVPLFFMGCSTPTVAEVYPPSTSSSIHPQSPACRRDCSC PDSIFHPVCGDNGIEYLSPCHAGCSNINMSSATSKQLIYLNCSCVTGGSASAKTGSCP VPCAHFLLPAIFLISFVSLIACISHNPLYMMVLRVVNQEEKSFAIGVQFLLMRLLAWL PSPALYGLTIDHSCIRWNSLCLGRRGACAYYDNDALRDRYLGLQMGYKALGMLLLCFI SWRVKKNKEYNVQKAAGLI" BASE COUNT 876 a 1223 c 980 g 969 t ORIGIN 1 aattccgggt cgcctctcac ccgccccggc cgctccagcc cgaggcgccc cgaccccgcg 61 ccactccgcg cccggccagc cgcccgcagc catggggctc ctgcccaagc tcggcgtgtc 121 ccagggcagc gacacctcta ctagccgagc cggccgctgt gcccgctcgg tcttcggcaa 181 cattaaggtg tttgtgctct gccaaggcct cctgcagctc tgccaactcc tgtacagcgc 241 ctacttcaag agcagcctca ccaccattga gaagcgcttt gggctctcca gttcttcatc 301 gggtctcatt tccagcttga atgagatcag caatgccatc ctcatcatct ttgtcagcta 361 ctttggcagc cgggtgcacc gtccacgtct gattggcatc ggaggtctct tcctggctgc 421 aggtgccttc atcctcaccc tcccacactt cctctccgag ccctaccagt acaccttggc 481 cagcactggg aacaacagcc gcttgcaggc cgagctctgc cagaagcatt ggcaggacct 541 gcctcccagt aagtgccaca gcaccaccca gaacccccag aaggagacca gcagcatgtg 601 gggcctgatg gtggttgccc agctgctggc tggcatcggg acagtgccta ttcagccatt 661 tgggatctcc tatgtggatg acttctcaga gcccagcaac tcgcccctgt acatctccat 721 cttatttgcc atctctgtat ttggaccggc tttcgggtac ctgctgggct ctatcatgct 781 gcagatcttt gtggactatg gcagggtcaa cacagctgca gttaacttgg tcccgggtga 841 cccccgatgg attggagcct ggtggctagg cctgctcatt tcttcagctt tattggttct 901 cacctctttc cccttttttt tcttccctcg agcaatgccc ataggagcaa agagggctcc 961 tgccacagca gatgaagcaa ggaagttgga ggaggccaag tcaagaggct ccctggtgga 1021 tttcattaaa cggtttccat gcatctttct gaggctcctg atgaactcac tcttcgtcct 1081 ggtggtcctg gcccagtgca ccttctcctc cgtcattgct ggcctctcca ccttcctcaa 1141 caagttcctg gagaagcagt atggcacctc agcagcctat gccaacttcc tcattggtgc 1201 tgtgaacctc cctgctgcag ccttggggat gctgtttgga ggaatcctca tgaagcgctt 1261 tgttttctct ctacaaacca ttccccgcat agctaccacc atcatcacca tctccatgat 1321 cctttgtgtt cctttgttct tcatgggatg ctccacccca actgtggccg aagtctaccc 1381 ccctagcaca tcaagttcta tacatccgca gtctcctgcc tgccgcaggg actgctcgtg 1441 cccagattct atcttccacc cggtctgtgg agacaatgga atcgagtacc tctccccttg 1501 ccatgccggc tgcagcaaca tcaacatgag ctctgcaacc tccaagcaac tgatctattt 1561 gaactgcagc tgtgtgaccg ggggatccgc ttcagcaaag acaggatcgt gccctgtccc 1621 ctgtgcccac ttcctgctcc cggccatctt cctcatctcc ttcgtgtccc tgatagcctg 1681 catctcccac aaccccctct acatgatggt tctgcgtgtg gtgaaccagg aggaaaagtc 1741 atttgccatc ggggtgcagt tcttgttgat gcgcttgctg gcctggctgc catctccagc 1801 cctctatggc ctcaccattg accactcctg catccggtgg aactcgctgt gcttggggag 1861 gcgaggggcc tgcgcctact atgacaacga tgctctccga gacaggtacc tgggcctgca 1921 gatgggctac aaggcgctgg gcatgctgct gctttgcttc atcagctgga gggtgaagaa 1981 gaacaaggag tacaacgtgc agaaggcggc aggcctcatc tgaccccacc ctgggccact 2041 gcctgctcca gagagtggac cttgactctt ccacacctgc ctatactcac taatgttaac 2101 acgtcatttc ctttttgtat ttttaaacaa gaaagaaaac cccagtcctc atttgccttc 2161 cctacctctt cctcccagag tcctccccac agttcctaag ggccactgtg tacccgggct 2221 gtgtgggcca gaactggggg gctgagtctt ccctggcccc ttggaagagg cccccagatg 2281 cccaggctca cttcagtgtt gagtcctcca ttgaggatgc ccactgaggc agccaggccc 2341 ctcaccagcc ctggggggaa tcctaaacag agagagaaaa agggtatctg cccttcttgc 2401 caggcagctc cactctcccg ctgactgccc acaccctgca gagtggcagg ggtgaaagga 2461 agaaggaagt ggctgagtta ttaatagcca gagccactgg gagactgggg agactggctg 2521 taaccccctt cacacctggg tttggcatca gcacagacta cgggaggggc tggctccctc 2581 cccctcagac cctcacttcc tgtacctaga ggccattctg gatgctgcca tgttgggaag 2641 tacagtctct gcccattacc tgcatgcagg caccagagca gggactgaga aaccccaagg 2701 atgggtcatc taagtgctgt ccatatgaac cctggacttt ctgtccttag atcctcacat 2761 gttatccctg tctttctggg gtacgtttca aactgaggaa gctacaacac agtgaagacc 2821 caaggaaggc ctatgaaatg gtcctgatgc ccaacctccc accccttcaa tgtggggacg 2881 agaccccctc atctcagagt aatgggaaga acctcccaca tctccctggc agcagatgag 2941 gtggcttcac atgcacttcc ctgtctggac ttcagcccgt attccgagga gtagagaggc 3001 agaagagatg tcagcaaagc aagtgatgaa gcagagtgga tgtccactgt caccaagctg 3061 gatggcaagc tgcggcccac aaaacagcca gtcaggttgg ctttcctggt ttcagacatg 3121 ctcataccat tcccattttc tcagcctctt ctctgcctcc agagaggtgg atgcctgggt 3181 tgagagacac agctgctacg tgatagatgt tgagagacag aagccaacga aggaggtcat 3241 tcatcaacaa atatatttat tggagaccga ctttgtgcaa agcaatgcta atcagggttc 3301 tccatggagc ttccctcagc tcttacctca cctccctcca tttacattag ggccttctcc 3361 cagggtgtgc tcggtgggca gtgtgggact gggggtgtgg gagttggtga gagcaggagg 3421 agaggtgggg acagcaagaa gccacagatt ggcatgaagg atcctgacct gactatccat 3481 gccatccatg gcccccagac tgactctgca cctggccctt tgccagacag ctctgtctcc 3541 ccatgtcctc tggaacagct gggcatgggt catggccatt catgaccctt aagtgccacc 3601 cttcttggaa gaccccctcc agaagcatac tggaagccac ctctggaaaa gcctcatatg 3661 gtgatatgcc aaaatattta tgtcaatgtc caaacaaagt ccaatgccat gagactgaag 3721 tctttgtgga aaccactgtt acagacaagc ttatttccaa agccacctca tttccaaaca 3781 tctcactcag gaagggaggc tcaatgtaac ctcaggggcc agttttagca tttgaaatgg 3841 ttctgcttgg aaaatgatgc cctgcaacta accctggtct ttcccatggc aatttaacca 3901 catttggaag gcactgcctt cagctgagtt tatgaacaat gaatgccaac cttcaggttc 3961 tagaagattg gttgcactcc caaaccttta ttctattata ttactattaa aatattctaa 4021 ttttgctatt gaggtaaaaa aaaaaaaa // LOCUS HSU70987 1873 bp mRNA PRI 06-MAR-1997 DEFINITION Human GAP binding protein p62dok (DOK) mRNA, complete cds. ACCESSION U70987 NID g1794260 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1873) AUTHORS Carpino,N., Wisniewski,D., Strife,A., Marshak,D., Kobayashi,R., Stillman,B. and Clarkson,B. TITLE p62(dok): a constitutively tyrosine-phosphorylated, GAP-associated protein in chronic myelogenous leukemia progenitor cells JOURNAL Cell 88 (2), 197-204 (1997) MEDLINE 97160840 REFERENCE 2 (bases 1 to 1873) AUTHORS Carpino,N., Wisniewski,D., Strife,A., Marshak,D., Kobayashi,R., Stillman,B. and Clarkson,B. TITLE Direct Submission JOURNAL Submitted (16-SEP-1996) Cold Spring Harbor Laboratory, P.O. Box 100, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..1873 /organism="Homo sapiens" /db_xref="taxon:9606" gene 23..1468 /gene="DOK" CDS 23..1468 /gene="DOK" /codon_start=1 /product="GAP binding protein p62dok" /db_xref="PID:g1794261" /translation="MDGAVMEGPLFLQSQRFGTKRWRKTWAVLYPASPHGVARLEFFD HKGSSSGGGRGSSRRLDCKVIRLAECVSVAPVTVETPPEPGATAFRLDTAQRSHLLAA DAPSSAAWVQTLCRNAFPKGSWTLAPTDNPPKLSALEMLENSLYSPTWEGSQFWVTVQ RTEAAERCGLHGSYVLRVEAERLTLLTVGAQSQILEPLLSWPYTLLRRYGRDKVMFSF EAGRRCPSGPGTFTFQTAQGNDIFQAVETAIHRQKAQGKAGQGHDVLRADSHEGEVAE GKLPSPPGPQELLDSPPALYAEPLDSLRIAPCPSQDSLYSDPLDSTSAQAGEGVQRKK PLYWDLYEHAQQQLLKAKLTDPKEDPIYDEPEGLAPVPPQGLYDLPREPKDAWWCQAR VKEEGYELPYNPATDDYAVPPPRSTKPLLAPKPQGPAFPEPGTATGSGIKSHNSALYS QVQKSGASGSWDCGLSRVGTDKTGVKSEGST" BASE COUNT 390 a 543 c 598 g 342 t ORIGIN 1 gcggaaggaa ccgccggggg ccatggacgg agcagtgatg gaagggccgc tttttttgca 61 gagtcagcgc tttgggacca agaggtggag gaagacctgg gccgtgctct acccggccag 121 tccccacggc gtagcgcggc tcgagttctt tgaccataag gggtcgagct ctgggggtgg 181 ccgagggagc tcgcgccgcc tggactgcaa agtgatccgt ctggctgagt gtgtgagtgt 241 ggcccccgtc accgtggaga ccccccctga gcccggcgcc actgccttcc gcctggacac 301 tgctcagcgc tcgcacctgc tggcggccga cgcgccgtcc agtgcagcct gggtgcagac 361 gctgtgccga aacgcctttc cgaaaggcag ctggactctg gcgcctaccg ataacccacc 421 taagctttct gccctggaga tgctggagaa ctccttgtac agccctacct gggaaggatc 481 ccaattctgg gtaacggtgc agaggactga ggccgccgag cgctgtggcc tgcatggctc 541 ctacgtgctg agggtggagg ctgaaaggct gactctcctg accgtggggg cccagagtca 601 gatactggag ccactcctgt cctggcccta cactctgttg cgtcgctatg gccgggacaa 661 ggtcatgttc tctttcgagg ccggccgccg ctgcccctca ggccctggaa ccttcacctt 721 ccagacggca cagggaaatg acatcttcca ggcagttgag actgccatcc accggcagaa 781 ggcccaggga aaggccggac aggggcacga tgttctcaga gctgactccc atgaagggga 841 ggtggcagag gggaagttgc cttccccacc tggcccccaa gagctcctcg acagtccccc 901 agccctgtat gctgagccct tagactccct gcgcattgct ccatgccctt cccaggactc 961 cctatactca gaccccttgg acagcacgtc tgctcaggca ggagagggag tacaacggaa 1021 gaaacctctc tattgggact tgtatgagca tgcgcagcag cagttgctga aggccaagct 1081 gacagacccc aaagaggatc ccatctatga tgaacctgag ggcctggccc cagtccctcc 1141 ccagggcctt tatgatctgc ctcgggagcc caaggatgca tggtggtgcc aagctcgggt 1201 gaaggaggag ggctatgagc tcccctacaa ccctgccact gatgactacg ctgtgccacc 1261 ccctcggagc acaaagcccc tccttgctcc caagccccag ggcccagcct tccctgaacc 1321 tggtactgca actggcagtg gcatcaaaag ccacaactca gccctgtaca gccaggtcca 1381 gaagagcggg gcctcaggga gctgggactg tgggctctct agagtaggga ctgacaagac 1441 tggggtcaag tcagagggct ctacctgaga aggacggcaa ggctgaggtg gctaaggggg 1501 accatgggga ggtggcacta gggatcaaag aagatggtta gaaccagcag aagccagagg 1561 gtgggagggg ccatgctgtg tgagaccagg ggaccagagg gatgggagag tcaagggaag 1621 gacaatccca ggaagtccta agaagtgggg cagatggcag ggctgaggat gggctctgca 1681 tcccccaaag ccatcccttc cctacttccc caaatgaagg gacggctgtg ggaccaggtc 1741 tgtggaaagt ggtgcatggt cagaatgggt gcagtttgag gggcctgtgt ggaggcctca 1801 gggagatgtt ggactgtgcc tggatcctta ctcctgcatt gttctttgcc agagacctat 1861 ttaaaaattt taa // LOCUS HSU71092 1877 bp DNA PRI 19-DEC-1996 DEFINITION Human somatostatin receptor-like protein (SLC1) gene, complete cds. ACCESSION U71092 NID g1737178 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1877) AUTHORS Kolakowski,L.F. Jr., Jung,B.P., Nguyen,T., Johnson,M.P., Lynch,K.R., Cheng,R., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE Characterization of a human gene related to genes encoding somatostatin receptors JOURNAL FEBS Lett. 398 (2-3), 253-258 (1996) MEDLINE 97131607 REFERENCE 2 (bases 1 to 1877) AUTHORS Kolakowski,L.F.J.r., Jung,B.P., Nguyen,T., Johnson,M.P., Lynch,K.R., Cheng,R., Heng,H.H.Q., George,S.R. and O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (17-SEP-1996) Department of Pharmacology, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, TX 78284-7764, USA FEATURES Location/Qualifiers source 1..1877 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q13.3" repeat_region 237..260 /rpt_family="dinucleotide repeat" /rpt_type=tandem /rpt_unit=CA gene 382..1590 /gene="SLC1" CDS 382..1590 /gene="SLC1" /note="G-protein-coupled receptor" /codon_start=1 /product="somatostatin receptor-like protein" /db_xref="PID:g1737179" /translation="MLCPSKTDGSGHSGRIHQETHGEGKRDKISNSEGRENGGRGFQM NGGSLEAEHASRMSVLRAKPMSNSQRLLLLSPGSPPRTGSISYINIIMPSVFGTICLL GIIGNSTVIFAVVKKSKLHWCNNVPDIFIINLSVVDLLFLLGMPFMIHQLMGNGVWHF GETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSVATLVICLLWA LSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFALPFVVITAA YVRILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRP TLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSVKPAAQGQLRAVSNAQTA DEERTESKGT" BASE COUNT 435 a 563 c 482 g 397 t ORIGIN 1 tccaacagac agtttctgtc tctgcttcac tcaagaagcc caggctcaga agataccaat 61 caaggaaatc cccgctagga agcctggggt agggagagct gctggcttga ccagggcaca 121 gccggcaaaa gcctctacaa gacagtcacc cacagatatg cccaagaatc agtacacagt 181 ttccaaccag agatctccaa aatgaaacac tcagggctac acataggaaa agcacgcaca 241 cacacacaca cacacacaca gacacttact tttgtgtcct tctggctatg ctgacgagtt 301 ttcctggtga agcccggggc tcacagagta atctctgcag acaactgtgg ttcttgcctc 361 tggtgcctgc aggaggcagg catgttgtgt ccttccaaga cagatggctc agggcactct 421 ggtaggattc accaggaaac tcatggagaa gggaaaaggg acaagattag caacagtgaa 481 gggagggaga atggtgggag aggattccag atgaacggtg ggtcgctgga ggctgagcat 541 gccagcagga tgtcagttct cagagcaaag cccatgtcaa acagccaacg cttgctcctt 601 ctgtccccag gatcacctcc tcgcacgggg agcatctcct acatcaacat catcatgcct 661 tcggtgttcg gcaccatctg cctcctgggc atcatcggga actccacggt catcttcgcg 721 gtcgtgaaga agtccaagct gcactggtgc aacaacgtcc ccgacatctt catcatcaac 781 ctctcggtag tagatctcct ctttctcctg ggcatgccct tcatgatcca ccagctcatg 841 ggcaatgggg tgtggcactt tggggagacc atgtgcaccc tcatcacggc catggatgcc 901 aatagtcagt tcaccagcac ctacatcctg accgccatgg ccattgaccg ctacctggcc 961 actgtccacc ccatctcttc cacgaagttc cggaagccct ctgtggccac cctggtgatc 1021 tgcctcctgt gggccctctc cttcatcagc atcacccctg tgtggctgta tgccagactc 1081 atccccttcc caggaggtgc agtgggctgc ggcatacgcc tgcccaaccc agacactgac 1141 ctctactggt tcaccctgta ccagtttttc ctggcctttg ccctgccttt tgtggtcatc 1201 acagccgcat acgtgaggat cctgcagcgc atgacgtcct cagtggcccc cgcctcccag 1261 cgcagcatcc ggctgcggac aaagagggtg acccgcacag ccatcgccat ctgtctggtc 1321 ttctttgtgt gctgggcacc ctactatgtg ctacagctga cccagttgtc catcagccgc 1381 ccgaccctca cctttgtcta cttatacaat gcggccatca gcttgggcta tgccaacagc 1441 tgcctcaacc cctttgtgta catcgtgctc tgtgagacgt tccgcaaacg cttggtcctg 1501 tcggtgaagc ctgcagccca ggggcagctt cgcgctgtca gcaacgctca gacggctgac 1561 gaggagagga cagaaagcaa aggcacctga tacttcccct gccaccctgc acacctccaa 1621 gtcagggcac cacaacacgc caccgggaga gatgctgaga aaaacccaag accgctcggg 1681 aaatgcagga aggccgggtt gtgaggggtt gttgcaatga aataaataca ttccatgggc 1741 tcacacgttg ctggggaggc ctggagtcag gtttggggtt ttcagatatc agaaatccct 1801 tgggggagca ggatgagacc tttggataga acagaagctg agcaagagaa catgttggtt 1861 tggataaccg gttgcac // LOCUS HSU71127 912 bp mRNA PRI 02-OCT-1996 DEFINITION Human low-Mr GTP-binding protein Rab32 (RAB32) mRNA, complete cds. ACCESSION U71127 NID g1575791 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 912) AUTHORS Burke,S. and Seabra,M.C. TITLE Cloning of novel Rab proteins with the yeast two-hybrid system JOURNAL Unpublished REFERENCE 2 (bases 1 to 912) AUTHORS Burke,S. and Seabra,M.C. TITLE Direct Submission JOURNAL Submitted (18-SEP-1996) Molecular Genetics, University Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..912 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 14..691 /gene="RAB32" CDS 14..691 /gene="RAB32" /codon_start=1 /product="low Mr GTP-binding protein Rab32" /db_xref="PID:g1575792" /translation="MAGGGAGDPGLGAAAAPAPETREHLFKVLVIGELGVGKTSIIKR YVHQLFSQHYRATIGVDFALKVLNWDSRTLVRLQLWDIAGQERFGNMTRVYYKEAVGA FVVFDISRSSTFEAVLKWKSDLDSKVHLPNGSPIPAVLLANKCDQNKDSSQSPSQVDQ FCKEHGFAGWFETSAKDNINIEEAARFLVEKILVNHQSFPNEENDVDKIKLDQETLRA ENKSQCC" BASE COUNT 230 a 226 c 247 g 209 t ORIGIN 1 ggcacgagcg ctcatggcgg gcggaggagc cggggacccc ggcctggggg cggccgccgc 61 cccagcgccc gagacccgcg agcacctctt caaggtgctg gtgatcggcg agcttggcgt 121 gggcaagacc agcatcatca agcgctacgt ccaccagctc ttctcccagc actaccgggc 181 caccatcggg gtggacttcg ccctcaaggt cctcaactgg gacagcagga ctctggtgcg 241 cctgcagctg tgggacatcg cggggcagga gcgatttggc aacatgaccc gagtatacta 301 caaggaagct gttggtgctt ttgtagtctt tgatatatca agaagttcca catttgaggc 361 agtcttaaaa tggaaaagtg atctggatag taaagttcat cttccaaatg gcagccctat 421 ccctgctgtc ctcttggcta acaaatgtga ccagaacaag gacagtagcc agagtccttc 481 ccaggtggac caattctgca aagaacatgg ctttgccgga tggtttgaaa cctctgcaaa 541 ggataacata aacatagagg aagctgcccg gttcctagtg gagaagattc ttgtaaacca 601 ccaaagcttt cctaatgaag aaaacgatgt ggacaaaatt aagctagatc aagagacctt 661 gagagcagag aacaaatccc agtgttgctg atatatggct tctgcttctc ttgtgtgtgc 721 ctcagctctg aagaagttcc tgagaatggg ttacagatgt catgttagct gggagtcttc 781 ccacatgtgg cacttcaaaa ggcagcacca ctgggcgcct gcacttattt gaaaatggaa 841 ctttgggaga agtatccctg ctagtggctc tgtaacttaa cagatgacaa ttaggctttt 901 gtcattgttg cc // LOCUS HSU71285 3917 bp mRNA PRI 03-APR-1997 DEFINITION Human 5-methyltetrahydrofolate-homocysteine methyltransferase mRNA, complete cds. ACCESSION U71285 NID g1923220 KEYWORDS vitamin B12; cobalamin binding site; methionine synthase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3917) AUTHORS Leclerc,D., Campeau,E., Goyette,P., Adjalla,C.E., Christensen,B., Ross,M., Eydoux,P., Rosenblatt,D.S., Rozen,R. and Gravel,R.A. TITLE Human methionine synthase: cDNA cloning and identification of mutations in patients of the cblG complementation group of folate/cobalamin disorders JOURNAL Hum. Mol. Genet. 5 (12), 1867-1874 (1996) MEDLINE 97123490 REFERENCE 2 (bases 1 to 3917) AUTHORS Leclerc,D. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Human Genetics, McGill University, Montreal Children's Hospital - Research Institute, Place Toulon, Room 222, 4060 Ste-Catherine West, Montreal H3Z 2Z3, Canada FEATURES Location/Qualifiers source 1..3917 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q43" CDS 64..3861 /note="methionine synthase; cobalamin binding protein; cytosolic protein" /codon_start=1 /product="5-methyltetrahydrofolate-homocysteine methyltransferase" /db_xref="PID:g1923221" /translation="MSPALQDLSQPEGLKKTLRDEINAILQKRIMVLDGGMGTMIQRE KLNEEHFRGQEFKDHARPLKGNNDILSITQPDVIYQIHKEYLLAGADIIETNTFSSTS IAQADYGLEHLAYRMNMCSAGVARKAAEEVTLQTGIKRFVAGALGPTNKTLSVSPSVE RPDYRNITFDELVEAYQEQAKGLLDGGVDILLIETIFDTANAKAALFALQNLFEEKYA PRPIFISGTIVDKSGRTLSGQTGEGFVISVSHGEPLCIGLNCALGAAEMRPFIEIIGK CTTAYVLCYPNAGLPNTFGDYDETPSMMAKHLKDFAMDGLVNIVGGCCGSTPDHIREI AEAVKNCKPRVPPATAFEGHMLLSGLEPFRIGPYTNFVNIGERCNVAGSRKFAKLIMA GNYEEALCVAKVQVEMGAQVLDVNMDDGMLDGPSAMTRFCNLIASEPDIAKVPLCIDS SNFAVIEAGLKCCQGKCIVNSISLKEGEDDFLEKARKIKKYGAAMVVMAFDEEGQATE TDTKIRVCTRAYHLLVKKLGFNPNDIIFDPNILTIGTGMEEHNLYAINFIHATKVIKE TLPGARISGGLSNLSFSFRGMEAIREAMHGVFLYHAIKSGMDMGIVNAGNLPVYDDIH KELLQLCEDLIWNKDPEATEKLLRYAQTQGTGGKKVIQTDEWRNGPVEERLEYALVKG IEKHIIEDTEEARLNQKKYPRPLNIIEGPLMNGMKIVGDLFGAGKMFLPQVIKSARVM KKAVGHLIPFMEKEREETRVLNGTVEEEDPYQGTIVLATVKGDVHDIGKNIVGVVLGC NNFRVIDLGVMTPCDKILKAALDHKADIIGLSGLITPSLDEMIFVAKEMERLAIRIPL LIGGATTSKTHTAVKIAPRYSAPVIHVLDASKSVVVCSQLLDENLKDEYFEEIMEEYE DIRQDHYESLKERRYLPLSQARKSGFQMDWLSEPHPVKPTFIGTQVFEDYDLQKLVDY IDWKPFFDVWQLRGKYPNRGFPKIFNDKTVGGEARKVYDDAHNMLNTLISQKKLRARG VVGFWPAQSIQDDIHLYAEAAVPQAAEPIATFYGLRQQAEKDSASTEPYYCLSDFIAP LHSGIRDYLGLFAVACFGVEELSKAYEDDGDDYSSIMVKALGDRLAEAFAEELHERVR RELWAYCGSEQLDVADLRRLRYKGIRPAPGYPSQPDHTEKLTMWRLADIEQSTGIRLT ESLAMAPASAVSGLYFSNLKSKYFAVGKISKDQVEDYALRKNISVAEVEKWLGPILGY DTD" misc_feature 777..876 /note="encodes B12 binding region" mutation 2703..2705 /note="deletion of isoleucine (I881) in cblG patient" /replace="" variation 2819 /note="D919G polymorphism site; creates conversion of aspartic acid to glycine" /replace="g" mutation 2821 /note="mutation H920D in cblG patient; creates conversion of histidine to aspartic acid" /replace="g" BASE COUNT 1107 a 840 c 1006 g 964 t ORIGIN 1 cgtcacctgt ggagagcacg tcttctctgc cgcgccctct gcgcaaggag gagactcgac 61 aacatgtcac ccgcgctcca agacctgtcg caacccgaag gtctgaagaa aaccctgcgg 121 gatgagatca atgccattct gcagaagagg attatggtgc tggatggagg gatggggacc 181 atgatccagc gggagaagct aaacgaagaa cacttccgag gtcaggaatt taaagatcat 241 gccaggccgc tgaaaggcaa caatgacatt ttaagtataa ctcagcctga tgtcatttac 301 caaatccata aggaatactt gctggctggg gcagatatca ttgaaacaaa tacttttagc 361 agcactagta ttgcccaagc tgactatggc cttgaacact tggcctaccg gatgaacatg 421 tgctctgcag gagtggccag aaaagctgcc gaggaggtaa ctctccagac aggaattaag 481 aggtttgtgg caggggctct gggtccgact aataagacac tctctgtgtc cccatctgtg 541 gaaaggccgg attataggaa catcacattt gatgagcttg ttgaagcata ccaagagcag 601 gccaaaggac ttctggatgg cggggttgat atcttactca ttgaaactat ttttgatact 661 gccaatgcca aggcagcctt gtttgcactc caaaatcttt ttgaggagaa atatgctccc 721 cggcctatct ttatttcagg gacgatcgtt gataaaagtg ggcggactct ttccggacag 781 acaggagagg gatttgtcat cagcgtgtct catggagaac cactctgcat tggattaaat 841 tgtgctttgg gtgcagctga gatgagacct tttattgaaa taattggaaa atgtacaaca 901 gcctatgtcc tctgttatcc caatgcaggt cttcccaaca cctttggtga ctatgatgaa 961 acgccttcta tgatggccaa gcacctaaag gattttgcta tggatggctt ggtcaatata 1021 gttggaggat gctgtgggtc aacaccagat catatcaggg aaattgctga agctgtgaaa 1081 aattgtaagc ctagagttcc acctgccact gcttttgaag gacatatgtt actgtctggt 1141 ctagagccct tcaggattgg accgtacacc aactttgtta acattggaga gcgctgtaat 1201 gttgcaggat caaggaagtt tgctaaactc atcatggcag gaaactatga agaagccttg 1261 tgtgttgcca aagtgcaggt ggaaatggga gcccaggtgt tggatgtcaa catggatgat 1321 ggcatgctag atggtccaag tgcaatgacc agattttgca acttaattgc ttccgagcca 1381 gacatcgcaa aggtaccttt gtgcatcgac tcctccaatt ttgctgtgat tgaagctggg 1441 ttaaagtgct gccaagggaa gtgcattgtc aatagcatta gtctgaagga aggagaggac 1501 gacttcttgg agaaggccag gaagattaaa aagtatggag ctgctatggt ggtcatggct 1561 tttgatgaag aaggacaggc aacagaaaca gacacaaaaa tcagagtgtg cacccgggcc 1621 taccatctgc ttgtgaaaaa actgggcttt aatccaaatg acattatttt tgaccctaat 1681 atcctaacca ttgggactgg aatggaggaa cacaacttgt atgccattaa ttttatccat 1741 gcaacaaaag tcattaaaga aacattacct ggagccagaa taagtggagg tctttccaac 1801 ttgtccttct ccttccgagg aatggaagcc attcgagaag caatgcatgg ggttttcctt 1861 taccatgcaa tcaagtctgg catggacatg gggatagtga atgctggaaa cctccctgtg 1921 tatgatgata tccataagga acttctgcag ctctgtgaag atctcatctg gaataaagac 1981 cctgaggcca ctgagaagct cttacgttat gcccagactc aaggcacagg agggaagaaa 2041 gtcattcaga ctgatgagtg gagaaatggc cctgtcgaag aacgccttga gtatgccctt 2101 gtgaagggca ttgaaaaaca tattattgag gatactgagg aagccaggtt aaaccaaaaa 2161 aaatatcccc gacctctcaa tataattgaa ggacccctga tgaatggaat gaaaattgtt 2221 ggtgatcttt ttggagctgg aaaaatgttt ctacctcagg ttataaagtc agcccgggtt 2281 atgaagaagg ctgttggcca ccttatccct ttcatggaaa aagaaagaga agaaaccaga 2341 gtgcttaacg gcacagtaga agaagaggac ccttaccagg gcaccatcgt gctggccact 2401 gttaaaggcg acgtgcacga cataggcaag aacatagttg gagtagtcct tggctgcaat 2461 aatttccgag ttattgattt aggagtcatg actccatgtg ataagatact gaaagctgct 2521 cttgaccaca aagcagatat aattggcctg tcaggactca tcactccttc cctggatgaa 2581 atgatttttg ttgccaagga aatggagaga ttagctataa ggattccatt gttgattgga 2641 ggagcaacca cttcaaaaac ccacacagca gttaaaatag ctccgagata cagtgcacct 2701 gtaatccatg tcctggacgc gtccaagagt gtggtggtgt gttcccagct gttagatgaa 2761 aatctaaagg atgaatactt tgaggaaatc atggaagaat atgaagatat tagacaggac 2821 cattatgagt ctctcaagga gaggagatac ttacccttaa gtcaagccag aaaaagtggt 2881 ttccaaatgg attggctgtc tgaacctcac ccagtgaagc ccacgtttat tgggacccag 2941 gtctttgaag actatgacct gcagaagctg gtggactaca ttgactggaa gcctttcttt 3001 gatgtctggc agctccgggg caagtacccg aatcgaggct tccccaagat atttaacgac 3061 aaaacagtag gtggagaggc caggaaggtc tacgatgatg cccacaatat gctgaacaca 3121 ctgattagtc aaaagaaact ccgggcccgg ggtgtggttg ggttctggcc agcacagagt 3181 atccaagacg acattcacct gtacgcagag gctgctgtgc cccaggctgc agagcccata 3241 gccactttct atgggttaag gcaacaggct gagaaggact ctgccagcac ggagccatac 3301 tactgcctct cagacttcat cgctcccttg cattctggca tccgtgacta cctgggcctg 3361 tttgccgttg cctgctttgg ggtagaagag ctgagcaagg cctatgagga tgatggtgac 3421 gactacagca gcatcatggt caaggcgctg ggggaccggc tggcagaggc ctttgcagaa 3481 gagctccatg aaagagttcg ccgagaactg tgggcctact gtggcagtga gcagctggac 3541 gtcgcagacc tgcgaaggtt gcggtacaag ggcatccgcc cggctcctgg ctaccccagc 3601 cagcccgacc acaccgagaa gctcaccatg tggagactcg cagacatcga gcagtctaca 3661 ggcattaggt taacagaatc attagcaatg gcacctgctt cagcagtctc aggcctctac 3721 ttctccaatt tgaagtccaa atattttgct gtggggaaga tttccaagga tcaggttgag 3781 gattatgcat tgaggaagaa catatctgtg gctgaggttg agaaatggct tggacccatt 3841 ttgggatatg atacagacta actttttttt tttttgcctt ttttattctt gatgatcctc 3901 aaggaaatac aacctag // LOCUS HSU71300 1848 bp mRNA PRI 25-JAN-1997 DEFINITION Human snRNA activating protein complex 50kD subunit (SNAP50) mRNA, complete cds. ACCESSION U71300 NID g1619945 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1848) AUTHORS Henry,R.W., Ma,B., Sadowski,C.L., Kobayashi,R. and Hernandez,N. TITLE Cloning and characterization of SNAP50, a subunit of the snRNA-activating protein complex SNAPc JOURNAL EMBO J. 15 (24), 7129-7136 (1996) MEDLINE 97157503 REFERENCE 2 (bases 1 to 1848) AUTHORS Henry,R.W., Ma,B., Sadowski,C.L., Kobayashi,R. and Hernandez,N. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA FEATURES Location/Qualifiers source 1..1848 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal cell teratocarcinoma" gene 15..1250 /gene="SNAP50" CDS 15..1250 /gene="SNAP50" /function="transcription factor required for transcription of small nuclear RNAs" /codon_start=1 /product="snRNA activating protein complex 50kD subunit" /db_xref="PID:g1619946" /translation="MAEGSRGGPTCSGVGGRQDPVSGSGGCNFPEYELPELNTRAFHV GAFGELWRGRLRGAGDLSLREPPASALPGSQAADSDREDAAVARDLDCSLEAAAELRA VCGLDKLKCLEDGEDPEVIPENTDLVTLGVRKRFLEHREETITIDRACRQETFVYEME SHAIGKKPENSADMIEEGELILSVNILYPVIFHKHKEHKPYQTMLVLGSQKLTQLRDS IRCVSDLQIGGEFSNTPDQAPEHISKDLYKSAFFYFEGTFYNDKRYPECRDLSRTIIE WSESHDRGYGKFQTARMEDFTFNDLCIKLGFPYLYCHQGDCEHVIVITDIRLVHHDDC LDRTLYPLLIKKHWLWTRKCFVCKMYTARWVTNNDSFAPEDPCFFCDVCFRMLHYDSE GNKLGEFLAYPYVDPGTFN" BASE COUNT 530 a 372 c 431 g 513 t 2 others ORIGIN 1 gaattccggc gaacatggct gaaggaagcc gaggtggccc tacgtgtagc ggggtgggtg 61 gcaggcagga cccagtctcc ggcagtggcg gctgcaactt tccagagtat gagcttcccg 121 agctaaatac gcgcgctttc catgtgggcg cctttgggga gctgtggcgg ggccgtctgc 181 gcggggccgg ggacttgtcg ctgagggagc cgccggcatc cgctctgcct gggagccagg 241 cagctgactc cgaccgggag gatgccgcgg tggccaggga tctggactgc agcctggagg 301 cggcggctga gctgagggcg gtgtgcggcc ttgataaact gaaatgcctt gaggacggtg 361 aggatccaga agtcattccg gagaatactg acctggtgac tttgggggtt agaaaaaggt 421 tcttggaaca tcgggaagaa accattacaa tagatcgagc ctgcagacaa gaaacattcg 481 tttatgagat ggagtcacat gccataggaa aaaagcctga aaattcagca gacatgattg 541 aagaagggga gcttatccta tctgtgaata tcttgtaccc tgttatattt cataagcaca 601 aagaacacaa accataccaa acaatgctgg tgttgggcag tcaaaaactc acacaactga 661 gggattcaat tcgatgtgtc agtgacctcc agattggtgg tgaattcagc aacactcctg 721 accaagcccc tgagcacatc agcaaagacc tatacaaatc agccttcttt tattttgaag 781 gaacatttta taatgataaa agatacccag aatgcagaga tttgagcaga actatcattg 841 agtggtcaga gtcccatgat agaggctatg gaaagtttca gactgctaga atggaagatt 901 tcaccttcaa tgacttgtgt attaaactgg gttttcctta cttatactgt catcagggag 961 actgtgaaca tgtcattgtc attactgaca taaggcttgt gcatcatgat gactgcttgg 1021 ataggacatt gtatcccctc cttatcaaga agcattggct atggaccaga aaatgttttg 1081 tttgtaaaat gtatacagcc agatgggtga cgaacaatga cagttttgca ccagaggacc 1141 catgcttctt ttgtgatgtt tgcttccgaa tgctgcacta tgattcagaa ggcaacaaac 1201 tgggggaatt ccttgcttat ccttatgttg atcctggaac ctttaattaa gaatagctac 1261 actcacaaaa ataccccctc atgaaataac tgttctcttg gatggttacc ttatttctaa 1321 gaaacgccac tgaggaacag gatccacttt gaacagtccg ctaaagctat caaaaaaaag 1381 tccaaatgac agattttctt ataatgatag tatttaaatg tttataacat agtttaattt 1441 tatatttatt ccaagatagt atttaattta gcgtttttac ccattttgag ttgagttgta 1501 gtactttata tattctgact ttaaatcctt tgtcagacac acatattctt tctcccaatc 1561 catgccttcc ctattcattc tctgtccaga gttttttgct aaagatagaa ttattaatga 1621 tacatcaagt agtggaagtg ttttgaaaat tctttgaaga atgtgagagc tacaccttct 1681 accatgaggc ttccaaggta aagatccttc aaataactgt aatggaaact agggagaatg 1741 aattatttac aaataagaca naattaatag ctttgggatt atttattttt tgagacaaaa 1801 tctcgctctt gttgcccagg cctggaatac aanggcgcaa tctcagct // LOCUS HSU71321 1694 bp mRNA PRI 03-APR-1997 DEFINITION Human FK506-binding protein FKBP51 mRNA, complete cds. ACCESSION U71321 NID g1916640 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1694) AUTHORS Baughman,G., Wiederrecht,G.J., Chang,F., Martin,M.M. and Bourgeois,S. TITLE Tissue distribution and abundance of human FKBP51, and FK506-binding protein that can mediate calcineurin inhibition JOURNAL Biochem. Biophys. Res. Commun. 232 (2), 437-443 (1997) MEDLINE 97242207 REFERENCE 2 (bases 1 to 1694) AUTHORS Baughman,G., Wiederrecht,G.J., Chang,F., Martin,M.M. and Bourgeois,S. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Regulatory Biology, The Salk Institute for Biological Studies, P.O. Box 8500, San Diego, CA 92186-5800, USA FEATURES Location/Qualifiers source 1..1694 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 154..1527 /function="FK506-binding protein" /codon_start=1 /product="FKBP51" /db_xref="PID:g1916641" /translation="MTTDEGAKNNEESPTATVAEQGEDITSKKDRGVLKIVKRVGNGE ETPMIGDKVYVHYKGKLSNGKKFDSSHDRNEPFVFSLGKGQVIKAWDIGVATMKKGEI CHLLCKPEYAYGSAGSLPKIPSNATLFFEIELLDFKGEDLFEDGGIIRRTKRKGEGYS NPNEGATVEIHLEGRCGGRMFDCRDVAFTVGEGEDHDIPIGIDKALEKMQREEQCILY LGPRYGFGEAGKPKFGIEPNAELIYEVTLKSFEKAKESWEMDTKEKLEQAAIVKEKGT VYFKGGKYMQAVIQYGKIVSWLEMEYGLSEKESKASESFLLAAFLNLAMCYLKLREYT KAVECCDKALGLDSANEKGLYRRGEAQLLMNEFESAKGDFEKVLEVNPQNKAARLQIS MCQKKAKEHNERDRRIYANMFKKFAEQDAKEEANKAMGKKTSEGVTNEKGTDSQAMEE EKPEGHV" BASE COUNT 530 a 333 c 484 g 347 t ORIGIN 1 gggccggctc gcgggcgctg ccagtctcgg gcggcggtgt ccggcgcgcg ggcggcctgc 61 tgggcgggct gaagggttag cggagcacgg gcaaggcgga gagtgacgga gtcggcgagc 121 ccccgcggcg acaggttctc tacttaaaag acaatgacta ctgatgaagg tgccaagaac 181 aatgaagaaa gccccacagc cactgttgct gagcagggag aggatattac ctccaaaaaa 241 gacaggggag tattaaagat tgtcaaaaga gtggggaatg gtgaggaaac gccgatgatt 301 ggagacaaag tttatgtcca ttacaaagga aaattgtcaa atggaaagaa gtttgattcc 361 agtcatgata gaaatgaacc atttgtcttt agtcttggca aaggccaagt catcaaggca 421 tgggacattg gggtggctac catgaagaaa ggagagatat gccatttact gtgcaaacca 481 gaatatgcat atggctcggc tggcagtctc cctaaaattc cctcgaatgc aactctcttt 541 tttgagattg agctccttga tttcaaagga gaggatttat ttgaagatgg aggcattatc 601 cggagaacca aacggaaagg agagggatat tcaaatccaa acgaaggagc aacagtagaa 661 atccacctgg aaggccgctg tggtggaagg atgtttgact gcagagatgt ggcattcact 721 gtgggcgaag gagaagacca cgacattcca attggaattg acaaagctct ggagaaaatg 781 cagcgggaag aacaatgtat tttatatctt ggaccaagat atggttttgg agaggcaggg 841 aagcctaaat ttggcattga acctaatgct gagcttatat atgaagttac acttaagagc 901 ttcgaaaagg ccaaagaatc ctgggagatg gataccaaag aaaaattgga gcaggctgcc 961 attgtcaaag agaagggaac cgtatacttc aagggaggca aatacatgca ggcggtgatt 1021 cagtatggga agatagtgtc ctggttagag atggaatatg gtttatcaga aaaggaatcg 1081 aaagcttctg aatcatttct ccttgctgcc tttctgaacc tggccatgtg ctacctgaag 1141 cttagagaat acaccaaagc tgttgaatgc tgtgacaagg cccttggact ggacagtgcc 1201 aatgagaaag gcttgtatag gaggggtgaa gcccagctgc tcatgaacga gtttgagtca 1261 gccaagggtg actttgagaa agtgctggaa gtaaaccccc agaataaggc tgcaagactg 1321 cagatctcca tgtgccagaa aaaggccaag gagcacaacg agcgggaccg caggatatac 1381 gccaacatgt tcaagaagtt tgcagagcag gatgccaagg aagaggccaa taaagcaatg 1441 ggcaagaaga cttcagaagg ggtcactaat gaaaaaggaa cagacagtca agcaatggaa 1501 gaagagaaac ctgagggcca cgtatgacgc cacgccaagg agggaagagt cccagtgaac 1561 tcggcccctc ctcaatgggc tttcccccaa ctcaggacag aacagtgttt aatgtaaagt 1621 ttgttatagt ctatgtgatt ctggaagcaa atggcaaaac cagtagcttc ccaaaaacag 1681 cccccctgct gctg // LOCUS HSU71374 1095 bp mRNA PRI 01-JAN-1997 DEFINITION Human HsPex13p mRNA, complete cds. ACCESSION U71374 NID g1613843 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1095) AUTHORS Gould,S.J., Kalish,J.E., Morrell,J.C., Bjorkman,J., Urquhart,A.J. and Crane,D.I. TITLE Pex13p is an SH3 protein of the peroxisome membrane and a docking factor for the predominantly cytoplasmic PTs1 receptor JOURNAL J. Cell Biol. 135 (1), 85-95 (1996) MEDLINE 97011155 REFERENCE 2 (bases 1 to 1095) AUTHORS Gould,S.J. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Biological Chemistry, The Johns Hopkins University School of Medicine, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1095 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1095 /function="docking factor for the PTS1 receptor" /note="this gene may be involved in peroxisome biogenesis disorder" /codon_start=1 /product="HsPex13p" /db_xref="PID:g1613844" /translation="MTRPGQPALTRVPPPILPRPSQQTGSSSVNTFRPAYSSFSSGYG AYGNSFYGGYSPYSYGYNGLGYNRLRVDDLPPSRFVQQAEESSRGAFQSIESIVHAFA SVSMMMDATFSAVYNSFRAVLDVANHFSRLKIHFTKVFSAFALVRTIRYLYRRLQRML GLRRGSENEDLWAESEGTVACLGAEDRAATSAKSWPIFLFFAVILGGPYLIWKLLSTH SDEVTDSINWASGEDDHVVARAEYDFAAVSEEEISFRAGDMLNLALKEQQPKVRGWLL ASLDGQTTGLIPANYVKILGKRKGRKTVESSKVSKQQQSFTNPTLTKGATVADSLDEQ EAAFESVFVETNKVPVAPDSIGKDGEKQDL" BASE COUNT 310 a 220 c 256 g 309 t ORIGIN 1 atgacaagac ctggacaacc agcacttacc agagtgcccc cacctattct tccaaggcca 61 tcacagcaga caggaagtag cagtgtgaac acttttagac ctgcttacag ttcattttct 121 tctggatatg gtgcctatgg aaattcattt tatggaggct atagtcctta tagttatgga 181 tataatgggc tgggctacaa ccgcctccgt gtagatgatc ttccacccag tagatttgtt 241 cagcaagctg aagaaagcag caggggtgca tttcagtcca ttgaaagtat tgtgcatgca 301 tttgcctctg tcagtatgat gatggatgct accttttcag ctgtctataa cagtttcagg 361 gctgtattgg atgtagcaaa tcacttttcc cgattgaaaa tacactttac aaaagtgttt 421 tcagcttttg cattggttag gactatacgg tatctttaca gacggctaca gcggatgtta 481 ggtttaagaa gaggctctga gaatgaagac ctctgggcag agagtgaagg aactgtggca 541 tgccttggtg ctgaggaccg agcagctacc tcagcaaaat cttggccaat attcttgttc 601 tttgctgtta tccttggtgg tccttacctc atttggaaac tattgtctac tcacagtgat 661 gaagtaacag acagcatcaa ctgggcaagt ggtgaggatg accatgtagt tgccagagca 721 gaatatgatt ttgctgccgt atctgaagaa gaaatttctt tccgggctgg tgatatgctg 781 aacttagctc tcaaagaaca acaacccaaa gtgcgtggtt ggcttctggc tagccttgat 841 ggccaaacaa caggacttat acctgcgaat tatgtcaaaa ttcttggcaa aagaaaaggt 901 aggaaaacgg tggaatcaag taaagtttcc aagcagcaac aatcttttac caacccaaca 961 ctaactaaag gagccacggt tgctgattct ttggatgaac aggaagctgc ctttgaatct 1021 gtttttgttg aaactaataa ggttccagtt gcacctgatt ccattgggaa agatggagaa 1081 aagcaagatc tttga // LOCUS HSU71383 2130 bp mRNA PRI 20-SEP-1997 DEFINITION Human OB binding protein-2 (OB-BP2) mRNA, complete cds. ACCESSION U71383 NID g2411474 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2130) AUTHORS Patel,N., Balasubramanian,S., Altmann,S.W., Gish,K.C., Timans,J.C., Peterson,D., Bell,M.P., Bazan,J.F. and Kastelein,R.A. TITLE Two Novel Sialoadhesin Family Members Bind Leptin JOURNAL Unpublished REFERENCE 2 (bases 1 to 2130) AUTHORS Patel,N. TITLE Direct Submission JOURNAL Submitted (19-SEP-1996) Molecular Biology, DNAX, 901 California Ave, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..2130 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="TF-1" gene 143..1798 /gene="OB-BP2" CDS 143..1798 /gene="OB-BP2" /codon_start=1 /product="OB binding protein-2" /db_xref="PID:g2411475" /translation="MLPLLLLPLLWGGSLQEKPVYELQVQKSVTVQEGLCVLVPCSFS YPWRSWYSSPPLYVYWFRDGEIPYYAEVVATNNPDRRVKPETQGRFRLLGDVQKKNCS LSIGDARMEDTGSYFFRVERGRDVKYSYQQNKLNLEVTALIEKPDIHFLEPLESGRPT RLSCSLPGSCEAGPPLTFSWTGNALSPLDPETTRSSELTLTPRPEDHGTNLTCQMKRQ GAQVTTERTVQLNVSYAPQTITIFRNGIALEILQNTSYLPVLEGQALRLLCDAPSNPP AHLSWFQGSPALNATPISNTGILELRRVRSAEEGGFTCRAQHPLGFLQIFLNLSVYSL PQLLGPSCSWEAEGLHCRCSFRARPAPSLCWRLEEKPLEGNSSQGSFKVNSSSAGPWA NSSLILHGGLSSDLKVSCKAWNIYGSQSGSVLLLQGRSNLGTGVVPAALGGAGVMALL CICLCLIFFLIVKARRKQAAGRPEKMDDEDPIMGTITSGSRKKPWPDSPGDQASPPGD APPLEEQKELHYASLSFSEMKSREPKDQEAPSTTEYSEIKTSK" BASE COUNT 485 a 641 c 576 g 428 t ORIGIN 1 cattgtgttg ggcacagctc tcactcaccc tccggcttcc tgtcggggct ttctcagccc 61 caccccacgt ttggacattt ggagcatttc cttccctgac agccggacct gggactgggc 121 tggggccctg gcggatggag acatgctgcc cctgctgctg ctgcccctgc tgtggggggg 181 gtccctgcag gagaagccag tgtacgagct gcaagtgcag aagtcggtga cggtgcagga 241 gggcctgtgc gtccttgtgc cctgctcctt ctcttacccc tggagatcct ggtattcctc 301 tcccccactc tacgtctact ggttccggga cggggagatc ccatactacg ctgaggttgt 361 ggccacaaac aacccagaca gaagagtgaa gccagagacc cagggccgat tccgcctcct 421 tggggatgtc cagaagaaga actgctccct gagcatcgga gatgccagaa tggaggacac 481 gggaagctat ttcttccgcg tggagagagg aagggatgta aaatatagct accaacagaa 541 taagctgaac ttggaggtga cagccctgat agagaaaccc gacatccact ttctggagcc 601 tctggagtcc ggccgcccca caaggctgag ctgcagcctt ccaggatcct gtgaagcggg 661 accacctctc acattctcct ggacggggaa tgccctcagc cccctggacc ccgagaccac 721 ccgctcctcg gagctcaccc tcacccccag gcccgaggac catggcacca acctcacctg 781 tcagatgaaa cgccaaggag ctcaggtgac cacggagaga actgtccagc tcaatgtctc 841 ctatgctcca cagaccatca ccatcttcag gaacggcata gccctagaga tcctgcaaaa 901 cacctcatac cttccggtcc tggagggcca ggctctgcgg ctgctctgtg atgctcccag 961 caacccccct gcacacctga gctggttcca gggctcccct gccctgaacg ccacccccat 1021 ctccaatacc gggatcttgg agcttcgtcg agtaaggtct gcagaagaag gaggcttcac 1081 ctgccgcgct cagcacccgc tgggcttcct gcaaattttt ctgaatctct cagtttactc 1141 cctcccacag ttgctgggcc cctcctgctc ctgggaggct gagggtctgc actgcagatg 1201 ctcctttcga gcccggccgg ccccctccct gtgctggcgg cttgaggaga agccgctgga 1261 ggggaacagc agccagggct cattcaaggt caactccagc tcagctgggc cctgggccaa 1321 cagctccctg atcctccacg gggggctcag ctccgacctc aaagtcagct gcaaggcctg 1381 gaacatctat gggtcccaga gcggctctgt cctgctgctg caagggagat cgaacctcgg 1441 gacaggagtg gttcctgcag cccttggtgg tgctggtgtc atggccctgc tctgtatctg 1501 tctgtgcctc atcttctttt taatagtgaa agcccgcagg aagcaagcag ctgggagacc 1561 agagaaaatg gatgatgaag accccattat gggtaccatc acctcgggtt ccaggaagaa 1621 gccctggcca gacagccccg gagatcaagc atctcctcct ggggatgccc ctcccttgga 1681 agaacaaaag gagctccatt atgcctccct tagtttttct gagatgaagt cgagggagcc 1741 taaggaccag gaggccccaa gcaccacgga gtactcggag atcaagacaa gcaagtgagg 1801 atttgcccag agttcagtcc tggctggagg agccacagcc tgtctggggg aaaggacaag 1861 tcagggacca cttgctgaag cacgaagagc ccttgtggca atgttaacat taactgatgt 1921 ttaagtgctc caagcagagc agaaagaaaa cagatgatgg aattagagag gtgggctcaa 1981 atctaggccc tggcactgtc atcaagcaat tcactgcatc cctctgtgcc tcagtttccc 2041 attctgtaaa tcagagatca tgcatgctac ctcaaaggtt gttgtgaaca ttaaagaaat 2101 caacacatgg aaatcaaaaa aaaaaaaaaa // LOCUS HSU72066 3247 bp mRNA PRI 13-DEC-1996 DEFINITION Human CtBP interacting protein CtIP (CtIP) mRNA, complete cds. ACCESSION U72066 NID g1730320 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3247) AUTHORS Schaeper,U., Boyd,J.M., Lim,L. and Chinnadurai,G. TITLE Molecular cloning and characterization of the CtBP interacting protein, CtIP JOURNAL Unpublished REFERENCE 2 (bases 1 to 3247) AUTHORS Schaeper,U. and Chinnadurai,G. TITLE Direct Submission JOURNAL Submitted (20-SEP-1996) Institute for Molecular Virology, St. Louis University Health Sciences Center, 3681 Park Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..3247 /organism="Homo sapiens" /note="expressed in multiple tissues and cell lines" /db_xref="taxon:9606" /cell_line="HeLa" gene 300..2993 /gene="CtIP" CDS 300..2993 /gene="CtIP" /codon_start=1 /product="CtBP interacting protein CtIP" /db_xref="PID:g1730321" /translation="MNILGSSCGSPNSADTSSDFKDLWTKLKECHDREVQGLQVKVTK LKQERILDAQRLEEFFTKNQQLREQQKVLHETIKVLEDRLRAGLCDRCAVTEEHMRKK QQEFENIRQQNLKLITELMNERNTLQEENKKLSEQLQQKIENDQQHQAAELECEEDVI PDSPITAFSFSGVNRLRRKENPHVRYIEQTHTKLEHSVCANEMRKVSKSSTHPQHNPN ENEILVADTYDQSQSPMAKAHGTSSYTPDKSSFNLATVVAETLGLGVQEESETQGPMS PLGDELYHCLEGNHKKQPFEESTRNTEDSLRFSDSTSKTPPQEELPTRVSSPVFGATS SIKSGLDLNTSLSPSLLQPGKKKHLKTLPFSNTCISRLEKTRSKSEDSALFTHHSLGS EVNKIIIQSSNKQILINKNISESLGEQNRTEYGKDSNTDKHLEPLKSLGGRTSKRKKT EEESEHEVSCPQASFDKENAFPFPMDNQFSMNGDCVMDKPLDLSDRFSAIQRQEKSQG SETSKNKFRQVTLYEALKTIPKGFSSSRKASDGNCTLPKDSPGEPCSQECIILQPLNK CSPDNKPSLQIKEENAVFKIPLRPRESLETENVLDDIKSAGSHEPIKIQTRSDHGGCE LASVLQLNPCRTGKIKSLQNNQDVSFENIQWSIDPGADLSQYKMDVTVIDTKDGSQSK LGGETVDMDCTLVSETVLLKMKKQEQKGEKSSNEERKMNDSLEDMFDRTTHEEYESCL ADSFSQAADEEEELSTATKKLHTHGDKQDKVKQKAFVEPYFKGDERETSLQNFPHIEV VRKKEERRKLLGHTCKECEIYYADMPAEEREKKLASCSRHRFRYIPPNTPENFWEVGF PSTQTCMERGYIKEDLDPCPRPKRRQPYNAIFSPKGKEQKT" misc_feature 1767..1781 /gene="CtIP" /note="encodes PLDLS which is required for interaction with the cellular protein CtBP; this motif is also found in the CtBP-binding region of Adenovirus 2/5 E1A." BASE COUNT 1132 a 618 c 677 g 820 t ORIGIN 1 cgggtccggc cgctccgagc ccggccgcag cccccggctt aaagcgcggg ctgtccggag 61 ggtcggcttt cccaccgagg atttggcact ctggtgaggg ttttgggcga aagagaaaag 121 cgagcagccg tccttcacag cctcagaaag tgctcgcttc ccttcggggc tttcgcgaat 181 cccgaggcaa tctcggaggc ggtatttgac ctgtccaaag acgacttgat acctctataa 241 tgtaacagaa aaggtcagaa aatattaagc aagtagaagt gtggagcata ttaagcaaga 301 tgaacatctt gggaagcagc tgtggaagcc ctaactctgc agatacatct agtgacttta 361 aggacctttg gacaaaacta aaagaatgtc atgatagaga agtacaaggt ttacaagtaa 421 aagtaaccaa gctaaaacag gaacgaatct tagatgcaca aagactagaa gaattcttca 481 ccaaaaatca acagctgagg gaacagcaga aagtccttca tgaaaccatt aaagttttag 541 aagatcggtt aagagcaggc ttatgtgatc gctgtgcagt aactgaagaa catatgcgga 601 aaaaacagca agagtttgaa aatatccggc agcagaatct taaacttatt acagaactta 661 tgaatgaaag gaatactcta caggaagaaa ataaaaagct ttctgaacaa ctccagcaga 721 aaattgagaa tgatcaacag catcaagcag ctgagcttga atgtgaggaa gacgttattc 781 cagattcacc gataacagcc ttctcatttt ctggcgttaa ccggctacga agaaaggaga 841 acccccatgt ccgatacata gaacaaacac atactaaatt ggagcactct gtgtgtgcaa 901 atgaaatgag aaaagtttcc aagtcttcaa ctcatccaca acataatcct aatgaaaatg 961 aaattctagt agctgacact tatgaccaaa gtcaatctcc aatggccaaa gcacatggaa 1021 caagcagcta tacccctgat aagtcatctt ttaatttagc tacagttgtt gctgaaacac 1081 ttggacttgg tgttcaagaa gaatctgaaa ctcaaggtcc catgagcccc cttggtgatg 1141 agctctacca ctgtctggaa ggaaatcaca agaaacagcc ttttgaggaa tctacaagaa 1201 atactgaaga tagtttaaga ttttcagatt ctacttcaaa gactcctcct caagaagaat 1261 tacctactcg agtgtcatct cctgtatttg gagctacctc tagtatcaaa agtggtttag 1321 atttgaatac aagtttgtcc ccttctcttt tacagcctgg gaaaaaaaaa catctgaaaa 1381 cactcccttt tagcaacact tgtatatcta gattagaaaa aactagatca aaatctgaag 1441 atagtgccct tttcacacat cacagtcttg ggtctgaagt gaacaagatc attatccagt 1501 catctaataa acagatactt ataaataaaa atataagtga atccctaggt gaacagaata 1561 ggactgagta cggtaaagat tctaacactg ataaacattt ggagcccctg aaatcattgg 1621 gaggccgaac atccaaaagg aagaaaactg aggaagaaag tgaacatgaa gtaagctgcc 1681 cccaagcttc ttttgataaa gaaaatgctt tcccttttcc aatggataat cagttttcca 1741 tgaatggaga ctgtgtgatg gataaacctc tggatctgtc tgatcgattt tcagctattc 1801 agcgtcaaga gaaaagccaa ggaagtgaga cttctaaaaa caaatttagg caagtgactc 1861 tttatgaggc tttgaagacc attccaaagg gcttttcctc aagccgtaag gcctcagatg 1921 gcaactgcac gttgcccaaa gattccccag gggagccctg ttcacaggaa tgcatcatcc 1981 ttcagccctt gaataaatgc tctccagaca ataaaccatc attacaaata aaagaagaaa 2041 atgctgtctt taaaattcct ctacgtccac gtgaaagttt ggagactgag aatgttttag 2101 atgacataaa gagtgctggt tctcatgagc caataaaaat acaaaccagg tcagaccatg 2161 gaggatgtga acttgcatca gttcttcagt taaatccatg tagaactggt aaaataaagt 2221 ctctacaaaa caaccaagat gtatcctttg aaaatatcca gtggagtata gatccgggag 2281 cagacctttc tcagtataaa atggatgtta ctgtaataga tacaaaggat ggcagtcagt 2341 caaaattagg aggagagaca gtggacatgg actgtacatt ggttagtgaa accgttctct 2401 taaaaatgaa gaagcaagag cagaagggag aaaaaagttc aaatgaagaa agaaaaatga 2461 atgatagctt ggaagatatg tttgatcgga caacacatga agagtatgaa tcctgtttgg 2521 cagacagttt ctcccaagca gcagatgaag aggaggaatt gtctactgcc acaaagaaac 2581 tacacactca tggtgataaa caagacaaag tcaagcagaa agcgtttgtg gagccgtatt 2641 ttaaaggtga tgaaagagag actagcttgc aaaattttcc tcatattgag gtggttcgga 2701 aaaaagagga gagaagaaaa ctgcttgggc acacgtgtaa ggaatgtgaa atttattatg 2761 cagatatgcc agcagaagaa agagaaaaga aattggcttc ctgctcaaga caccgattcc 2821 gctacattcc acccaacaca ccagagaatt tttgggaagt tggttttcct tccactcaga 2881 cttgtatgga aagaggttat attaaggaag atcttgatcc ttgtcctcgt ccaaaaagac 2941 gtcagcctta caacgcaata ttttctccaa aaggcaagga gcagaagaca tagacgttga 3001 aacagaaaca gaaggatgaa ggacagtttt ttccttctta gttatttata gttaaagttg 3061 gtactaaaca ttgatttttt tgatcttctg taaatggatt tataaatcag ttttctattg 3121 aaaatgtttg tgatattttg cttttgcacc tttaaaacaa taaggcgctt tcattttgca 3181 ctctaactta agagttttta ctttatgtag tgatacctaa tacaattttg aaaatacaaa 3241 aaaaaaa // LOCUS HSU72206 3633 bp mRNA PRI 21-OCT-1996 DEFINITION Human guanine nucleotide regulatory factor (LFP40) mRNA, complete cds. ACCESSION U72206 NID g1621452 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3633) AUTHORS Ren,Y. and Busch,H. TITLE Guanine nucleotide regulatory factor JOURNAL Unpublished REFERENCE 2 (bases 1 to 3633) AUTHORS Ren,Y. and Busch,H. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) Pharmacology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..3633 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa cell" gene 103..2787 /gene="LFP40" CDS 103..2787 /gene="LFP40" /note="similar to lfc oncogene product" /codon_start=1 /product="guanine nucleotide regulatory factor" /db_xref="PID:g1621453" /translation="MSRIESLTRARIDRSRELASKTREKEKMKEAKDARYTNGHLFTT ISVSGMTMCYACTKSITSKEALLCPTCNVTIPTAVKNTLRQLYQVQAEEPESAPVKNN TALQSVFVRSKTTIRERPSSAIYPSDRFRPSFVGSRRGVSSLFLAKSVSTTNIAGHFN DESPLGMRRIFSQSTDSLNMRNRTLSVESLIDEAEVIYSELMSDFEMDEKDFAADSWS LAVDSSFLQQHKKEVMKQQDVMYELIQTELHHVRTLKIMTRLFRTGMLEELHLEPGVV QGLFPLRGRASVTSIHASSASYQNADARPCALGSTGNFVIPRLGDLLISQFSGGSAEQ MCKTYSEFCSRHSKALKFYKELYARDKRFQQFIRKVTRPAVLKRHGVQECILLVTQRI TKYPLLISRILQHSHGIEEERQDLTTALGLVKELLSNVDEGIYQLEKGARLQEIYNRM DPRAQTPVPGKGPFGREELLRRKLIHDGCLLWKTATGRFKDVLVLLMTDVLVFLQEKD QKYIFPTLDKPSVVSLQNLIVRDIANQEKGMFLISAAPPEMYEVHTASRDDRSTWIRV IQQSVRTCPSREDFPLIETEDEAYLRRIKMELQQKDRALWSCCERSRLFAKMTHFQAE EAGGSGLALPTMPRGLFRSESLESPRGERLLQDAIREVEGLKDLLVGPGVELLLTPRE PALPLEPDSGGNTSPGVTANGEARTFNGSIELCRADSDSSQRDRNGNQLRSPQEEALQ RLVNLYGLLHGLQAAVAQQDTLMEARFPEGPERREKLCRANSRDGEAGRAGAAPVAPE KQATELALLQRQHALLQEELRRCRRLGEERATEAGSLEARLRESEQARALLEREAEEA RRQLAALGQTEQLPAEAPWARRPVDPRRRSSPQAMPCT" BASE COUNT 817 a 1062 c 1054 g 700 t ORIGIN 1 ccgagaccaa cgcgtgcggg ccgaacccct ccccccgcct tcccccaaca atacaggacg 61 ccggggtccg cgccgcgtcc tccctggtcc ccccgtccga ttatgtctcg gatcgaatcc 121 ctcacgcggg cgcggatcga ccggagcaga gagctggcga gcaagacccg ggaaaaggag 181 aagatgaagg aagccaagga tgcccgctat accaatgggc acctcttcac caccatttca 241 gtttcaggca tgaccatgtg ctatgcctgt accaagagca tcacttccaa ggaagctctc 301 ctctgcccaa cctgcaatgt gactatccca accgctgtaa agaacaccct tcgccaactg 361 taccaagttc aagcagaaga accggaaagc gcccctgtga agaacaacac cgccttgcag 421 tcagtttttg ttcgaagtaa gacaaccatc cgggagcggc ccagctcggc catctaccct 481 tccgacagat tccggccgtc cttcgtgggc tcccgccgtg gcgtttcttc tttgttttta 541 gccaagagtg tttctaccac caacattgct ggacatttca atgatgagtc tcccttgggg 601 atgcgccgga tcttctcaca gtccacagac tccctcaaca tgcggaaccg aaccctatcc 661 gtggaatccc tcattgacga agcagaggta atctacagtg agctgatgag tgactttgag 721 atggatgaga aggactttgc agctgactct tggagtcttg ctgtggacag cagcttcctg 781 cagcagcata aaaaggaggt gatgaagcag caagatgtca tgtatgagtt aatccagaca 841 gagctgcacc atgtgaggac actgaagatc atgacccgcc tcttccgcac ggggatgctg 901 gaagagctac acttggagcc aggagtggtc cagggcctgt ttcccctgcg tggacgagct 961 tcagtgacat ccatacacgc ttcctcagcc agttaccaga acgccgacgc caggccctgt 1021 gcccttggca gcaccgggaa ttttgtcatc cctcgcttgg gtgatctgct catcagccag 1081 ttctcaggtg gtagtgcgga gcagatgtgt aagacctact cggagttctg cagccgccac 1141 agcaaggcct taaagtttta taaggagctg tacgcccgag acaaacgctt ccagcaattc 1201 atccggaaag tgacccgccc cgccgtgctc aagcggcacg gggtacagga gtgcatcctg 1261 ctggtgactc agcgcatcac caagtacccg ttactcatca gccgcatcct gcagcattcc 1321 cacgggatcg aggaggagcg ccaggacctg accacagcac tggggctagt gaaggagctg 1381 ctgtccaatg tggacgaggg tatttatcag ctggagaaag gggcccgtct gcaggagatc 1441 tacaaccgca tggaccctcg ggcccaaacc ccagtgcctg gcaagggccc ctttggccga 1501 gaggaacttc tgaggcgcaa actcatccac gatggctgcc tgctctggaa gacagcgacg 1561 gggcgcttca aagatgtgct agtgctgctg atgacagatg tactggtgtt tctccaggaa 1621 aaggaccaga agtacatctt tcctaccctg gacaagcctt cagtggtatc gctgcagaat 1681 ctaatcgtac gagacattgc caaccaggag aaagggatgt ttctgatcag cgcagcccca 1741 cctgagatgt acgaggtgca cacagcatcc cgggatgacc ggagcacctg gatccgggtc 1801 attcagcaga gcgtgcgcac atgcccatcc agggaggact tccccctgat tgagacagag 1861 gatgaggctt acctgcggcg aattaagatg gagttgcagc agaaggaccg ggcactgtgg 1921 agctgctgcg agagaagtcg gctgttcgct aagatgaccc atttccaggc cgaagaggct 1981 ggtggcagtg ggctggccct gcccaccatg cccaggggcc ttttccgctc tgagtccctt 2041 gagtcccctc gtggcgagcg gctgctgcag gatgccatcc gtgaggtgga gggtctgaaa 2101 gacctgctgg tggggccagg agtggaactg ctcttgacac cccgagagcc agccctgccc 2161 ttggaaccag acagcggtgg taacacgagt cctggggtca ctgccaatgg tgaggccaga 2221 accttcaatg gctccattga actctgcaga gctgactcag actctagcca gagggatcga 2281 aatggaaatc agctgagatc accgcaagag gaggcgttac agcgattggt caatctctat 2341 ggacttctac atggcctaca ggcagctgtg gcccagcagg acactctgat ggaagcccgg 2401 ttccctgagg gccctgagcg gcgggagaag ctgtgccgag ccaactctcg ggatggggag 2461 gctggcaggg ctggggctgc ccctgtggcc cctgaaaagc aggccacgga actggcatta 2521 ctgcagcggc aacatgcgct gctgcaggag gagctacggc gctgccggcg gctaggtgaa 2581 gaacgggcaa ccgaagctgg cagcctggag gcccggctcc gggagagtga gcaggcccgg 2641 gcactgctgg agcgtgaggc cgaagaggct cgaaggcagc tggccgccct gggccagacc 2701 gagcaactcc cagctgaggc cccctgggcc cgcagacctg tggatcctcg gcggcgcagc 2761 tccccgcagg cgatgccctg tacttgagtt tcaacccccc acagcccagc cgaggcactg 2821 accgcctgga tctacctgtc actactcgct ctgtccatcg aaactttgag gaccgagaga 2881 ggcaggaact ggggagcccc gaagagcggc tgcaagacag cagtgaccct gacactggca 2941 gcgaggagga aggtagcagc cgtctgtctc cgccccacag tccacgagac tttaccagaa 3001 tgcaggacat cccggaggag acggagagcc gcgacgggga gggctgtagc tcgagagcta 3061 agggggcccc tcccccctgc cccgtgcccc actgaagaac attactgagg gggctaacct 3121 tggggactcc aatttgccaa tgatgaggga acatttgaaa gaactgcaaa ttgtccttgc 3181 cagctcttgg gatccttgga tacctggggc catttaagaa gctaggggaa ttaggccaca 3241 acaccccctg ggacatccga aagctacacc acagatgccg tggttcatgc cttcttcccg 3301 caactttagg aaaatttatt tatttattgt ttattagtta tggggggaga ggggagattt 3361 aaaggaccag ggacatggga accaagccat agggatcaga gggcctgtcc ttgaacacta 3421 ctggggtata ttcaggctca tccacgcagc tgctgggttc ttgcctaacg gccctcccct 3481 gcaacatccg tcttggagga gaggctgcag ccacagcacc ctactgccct ttaaataaag 3541 gagggctgtg ggcagggcca tgtccctttc tcctctcccc tcttcctctt actgctgttc 3601 tccctttctc cgtccttcat ggaagccctg gga // LOCUS HSU72209 1033 bp mRNA PRI 04-MAR-1997 DEFINITION Human YY1-associated factor 2 (YAF2) mRNA, complete cds. ACCESSION U72209 NID g1778303 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1033) AUTHORS Kalenik,J.L., Chen,D., Bradley,M.E., Chen,S.J. and Lee,T.C. TITLE Yeast two-hybrid cloning of a novel zinc finger protein that interacts with the multifunctional transcription factor YY1 JOURNAL Nucleic Acids Res. 25 (4), 843-849 (1997) MEDLINE 97169297 REFERENCE 2 (bases 1 to 1033) AUTHORS Lee,T.C. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) Biochemistry, SUNY at Buffalo, 3435 Main Street, Buffalo, NY 14214, USA FEATURES Location/Qualifiers source 1..1033 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" gene 254..796 /gene="YAF2" CDS 254..796 /gene="YAF2" /note="zinc finger protein" /codon_start=1 /product="YY1-associated factor 2" /db_xref="PID:g1778304" /translation="MGDKKSPTRPKRHAKPSSDEGYWDCSVCTFRNSAEAFKCMMCDV RKGTSTRKPRPVSQLVAQQVTQQFVPPTQSKKEKKDKVEKEKSEKETTSKKNSHKKTR PRLKNVDRSSAQHLEVTVGDLTVIITDFKEKTKSPPASSAASADQHSQSGSSSDNTER GMSRSSSPRGEASSLNGESH" BASE COUNT 309 a 217 c 240 g 267 t ORIGIN 1 ccactaaagt gcaagaatta cattgcactg tttctccact ttttattttc tcttaggctt 61 ttgtttctat ttcaaacata ctttcttggt tttctaatgg agtatatagt ttagtcattt 121 cacagactct ggcctcctct cctgaaatcc ttttggatgg ggaaagggaa ggtggggagg 181 gtccgacagt ggcggtagag aggagactcc ggctggcgac cggggactgg tggagtgggg 241 tgatagccaa gccatgggag acaagaagag ccccaccagg ccgaagcggc acgcgaagcc 301 ttcctcggat gagggttact gggactgtag cgtctgcacc ttccggaaca gcgccgaggc 361 cttcaagtgc atgatgtgcg atgtgcggaa gggcacctcc acccggaaac ctcgacctgt 421 ctcccagttg gttgcacagc aggttactca gcagtttgtg cctcctacac agtcaaagaa 481 agagaaaaaa gataaagtag aaaaagaaaa aagtgaaaag gaaacaacta gcaaaaagaa 541 tagccataag aaaaccaggc caagattgaa aaatgtggat cggagtagtg ctcagcattt 601 ggaagttact gttggagatc tgacagtcat tattacagac tttaaggaga aaacaaagtc 661 accgcctgca tctagtgctg cctctgcaga tcaacacagt caaagcggct ctagctctga 721 taacacagag agaggaatgt ccaggtcatc ttcacccaga ggagaagcct catcattgaa 781 tggagaatct cattaaagtt tattttctcc aatttcttag tcacttctgt cctaccatgc 841 aaatacacag attatgccaa gaggtaccac attttcatga cagatacatt catgcacaat 901 ccataatttg agttttacat aaaatagaaa tttgttagaa tttgttagat tttattgcaa 961 tgatgcctac caaacatttc cagacttaac attttggtct ctgcagttaa gtgccatgaa 1021 aatgtggttg aat // LOCUS HSU72245 279 bp mRNA PRI 13-MAY-1997 DEFINITION Human phospholemman chloride channel mRNA, complete cds. ACCESSION U72245 NID g1916009 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 279) AUTHORS Chen,L.S., Lo,C.F., Numann,R. and Cuddy,M. TITLE Characterization of the human and rat phospholemman (PLM) cDNAs and localization of the human PLM gene to chromosome 19q13.1 JOURNAL Genomics 41 (3), 435-443 (1997) MEDLINE 97312702 REFERENCE 2 (bases 1 to 279) AUTHORS Chen,L.K. TITLE Direct Submission JOURNAL Submitted (23-SEP-1996) Cardiovascular and Metabolic Disorders, Wyeth-Ayerst Research, Princeton, NJ, 08543-8000 FEATURES Location/Qualifiers source 1..279 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hPLM" /map="19q13.1" /tissue_type="heart" /dev_stage="adult" CDS 1..279 /codon_start=1 /product="phospholemman chloride channel" /db_xref="PID:g1916010" /translation="MAPLHHILVFCVGLLTMAKAESPKEHDPFTYDYQSLQIGGLVIA GILFILGILIVLSRRCRCKFNQQQRTGEPDEEEGTFRSSIRRLSTRRR" BASE COUNT 61 a 89 c 74 g 55 t ORIGIN 1 atggcacctc tccaccacat cttggttttc tgtgtgggtc tcctcaccat ggccaaggca 61 gaaagtccaa aggaacacga cccgttcact tacgactacc agtccctgca gatcggaggc 121 ctcgtcatcg ccgggatcct cttcatcctg ggcatcctca tcgtgctgag cagaagatgc 181 cggtgcaagt tcaaccagca gcagaggact ggggaacccg atgaagagga gggaactttc 241 cgcagctcca tccgccgtct gtccacccgc aggcggtag // LOCUS HSU72515 1842 bp mRNA PRI 24-JUL-1997 DEFINITION Human C3f mRNA, complete cds. ACCESSION U72515 NID g1673519 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1842) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination JOURNAL Genome Res. 7 (3), 268-280 (1997) MEDLINE 97228904 REFERENCE 2 (bases 1 to 1842) AUTHORS Ansari-Lari,M.A., Shen,Y., Muzny,D.M., Lee,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (24-SEP-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1842 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p13" misc_feature 1..1842 /gene="C3f" gene 1..1842 /gene="C3f" CDS 118..1263 /gene="C3f" /note="similar to ESTs with GenBank Accession Numbers H45806 and H45837; similar to S. cerevisiae ORF YOR175c encoded by GenBank Accession Number Z75083; see corresponding genomic sequence in GenBank Accession Number U72506" /codon_start=1 /product="C3f" /db_xref="PID:g1673520" /translation="MGRTITAVLTTFCFQMAYLLAGYYYTATGNYDIKWTMPHCVLTL KLIGLAVDYFDGGKDQNSLSSEQQKYAIRGVPSLLEVAGFSYFYGAFLVGPQFSMNHY MKLVQGELIDIPGKIPNSIIPALKRLSLGLFYLVGYTLLSPHITEDYLLTEDYDNHPF WFRCMYMLIWGKFVLYKYVTCWLVTEGVCILTGLGFNGFEEKGKAKWDACANMKVWLF ETNPRFTGTIASFNINTNAWVARYIFKRLKFLGNKELSQGLSLLFLALWHGLHSGYLV CFQMEFLIVIVERQAARLIQESPTLSKLAAITVLQPFYYLVQQTIHWLFMGYSMTAFC LFTWDKWLKVYKSIYFLGHIFFLSLLFILPYIHKAMVPRKEKLKKME" BASE COUNT 413 a 525 c 413 g 491 t ORIGIN 1 tacctcatcc acctcttcca tacctttaca ggcctctcaa ttgcttattt taactttgga 61 aaccagctct accactccct gctgtgtatt gtgcttcagt tcctcatcct tcgactaatg 121 ggccgcacca tcactgccgt cctcactacc ttttgcttcc agatggccta ccttctggct 181 ggatactatt acactgccac cggcaactac gatatcaagt ggacaatgcc acattgtgtt 241 ctgactttga agctgattgg tttggctgtt gactactttg acggagggaa agatcagaat 301 tccttgtcct ctgagcaaca gaaatatgcc atacgtggtg ttccttccct gctggaagtt 361 gctggtttct cctacttcta tggggccttc ttggtagggc cccagttctc aatgaatcac 421 tacatgaagc tggtgcaggg agagctgatt gacataccag gaaagatacc aaacagcatc 481 attcctgctc tcaagcgcct gagtctgggc cttttctacc tagtgggcta cacactgctc 541 agcccccaca tcacagaaga ctatctcctc actgaagact atgacaacca ccccttctgg 601 ttccgctgca tgtacatgct gatctggggc aagtttgtgc tgtacaaata tgtcacctgt 661 tggctggtca cagaaggagt atgcattttg acgggcctgg gcttcaatgg ctttgaagaa 721 aagggcaagg caaagtggga tgcctgtgcc aacatgaagg tgtggctctt tgaaacaaac 781 ccccgcttca ctggcaccat tgcctcattc aacatcaaca ccaacgcctg ggtggcccgc 841 tacatcttca aacgactcaa gttccttgga aataaagaac tctctcaggg tctctcgttg 901 ctattcctgg ccctctggca cggcctgcac tcaggatacc tggtctgctt ccagatggaa 961 ttcctcattg ttattgtgga aagacaggct gccaggctca ttcaagagag ccccaccctg 1021 agcaagctgg ccgccattac tgtcctccag cccttctact atttggtgca acagaccatc 1081 cactggctct tcatgggtta ctccatgact gccttctgcc tcttcacgtg ggacaaatgg 1141 cttaaggtgt ataaatccat ctatttcctt ggccacatct tcttcctgag cctactattc 1201 atattgcctt atattcacaa agcaatggtg ccaaggaaag agaagttaaa gaagatggaa 1261 taatccattt ccctggtggc ctgtgcggga ctggtgcaga aactactcgt ctcccttttc 1321 acagcactcc tttgccccag agcagagaat ggaaaagcca gggaggtgga agatcgatgc 1381 ttccagctgt gcctctgctg ccagccaagt cttcatttgg ggccaaaggg gaaacttttt 1441 tttggagaag gcgtcttgct ttgtcaccca cgctggaatg cagtggcggg atctcagctc 1501 accgcaacct ccacctcctg ggttcaagtg attttcctgc ctcagcctcc caagtagctg 1561 ggaatacagg cacgccacca tgcccagcta atttttgtat tttcagtaga aacgggattt 1621 caccacgttg gccaggctgg tctcgaactc ctgaccgcaa gtgatccacc cgcctccgcc 1681 tcccaaagtg ctgggattac aggcgtgagc caccgtgccc ggcccaaagg ggaaactctt 1741 gtgggaggag cagaggggct cacatctccc ctctgattcc cccatgcaca ttgccttatc 1801 tctccccatc tagccaggaa tctattgtgt ttttcttctg cc // LOCUS HSU72621 3249 bp mRNA PRI 20-AUG-1997 DEFINITION Human LOT1 mRNA, complete cds. ACCESSION U72621 NID g1658381 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3249) AUTHORS Abdollahi,A., Roberts,D., Godwin,A.K., Schultz,D.C., Sonoda,G., Testa,J.R. and Hamilton,T.C. TITLE Identification of a zinc-finger gene at 6q25: a chromosomal region implicated in development of many solid tumors JOURNAL Oncogene 14 (16), 1973-1979 (1997) MEDLINE 97294608 REFERENCE 2 (bases 1 to 3249) AUTHORS Abdollahi,A., Godwin,A.K., Miller,P.D., Getts,L.A., Schultz,D.C., Taguchi,T., Testa,J.R. and Hamilton,T.C. TITLE Identification of a gene containing zinc-finger motifs based on lost expression in malignantly transformed rat ovarian surface epithelial cells JOURNAL Cancer Res. 57 (10), 2029-2034 (1997) MEDLINE 97301600 REFERENCE 3 (bases 1 to 3249) AUTHORS Abdollahi,A., Godwin,A.K., Miller,P.D., Getts,L.A., Schultz,D.C., Taguchi,T., Testa,J.R. and Hamilton,T.C. TITLE Direct Submission JOURNAL Submitted (20-SEP-1996) Medical Oncology, Fox Chase Cancer Research Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..3249 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q25" gene 1..3249 /gene="LOT1" CDS 658..2049 /gene="LOT1" /standard_name="lost on transformation" /codon_start=1 /db_xref="PID:g1658382" /translation="MATFPCQLCGKTFLTLEKFTIHNYSHSRERPYKCVQPDCGKAFV SRYKLMRHMATHSPQKSHQCAHCEKTFNRKDHLKNHFQTHDPNKMAFGCEECGKKYNT MLGYKRHLALHAASSGDLTCGVCALELGSTEVLLDHLKAHAEEKPPSGTKEKKHQCDH CERCFYTRKDVRRHLVVHTGCKDFLCQFCAQRFGRKDHLTRHTKKTHSQELMKESLQT GDLLSTFHTISPSFQLKAAALPPFPLGASAQNGLASSLPAEVHSLTLSPPEQAAQPMQ PLPESLASLHPSVSPGSPPPPLPNHKYNTTSTSYSPLASLPLKADTKGFCNISLFEDL PLQEPQSPQKLNPGFDLAKGNAGKVNLPKELPADAVNLTIPASLDLSPLLGFWQLPPP ATQNTFGNSTLALGPGESLPHRLSCLGQQQQEPPLAMGTVSLGQLPLPPIPHVFSAGT GSAILPHFHHAFR" BASE COUNT 902 a 793 c 701 g 853 t ORIGIN 1 ggggattgtg gaagggtagg ggaaggagat ttctctctcc ccaggccact agataaaagg 61 ccatgtttgt ccacattggg aggctgatgg gaagaaagga tgccagacca gttggcctgt 121 ccctggggct tgtctcagga cctccaggaa gtgtttctga actgcaggct ctcgtgtgtg 181 tctcccattt gtcaaaactt tgacctgatc ttttcagaat ccaccttgtg aggccccagc 241 cttggaagcc actgcccatt gccagaacac ggggagtaga gcagtgagca ctgagtgggc 301 ctgaggctgc ctttcttccc tggcacacgt cctgaggagg ggaggttgtg gcggcaccag 361 gcagaagcta tgccactgct gcgctgggtc tccctcccca gaagccctca ttcttgattt 421 gctcaagctg ttcctgctta tcaacaaaca ccttaattta tcaaacacag caaagctagt 481 gacagctgag aggtccatgt ctgggtagaa ccaggcccac gatgctgcct ctcccgtgtt 541 ctggagttca gctgcaggga ttctgctgat gtgcccagca ccatcgttct gtttgtgctt 601 aaatggcaca gcatttggtc agcacatctg aaaaggaagg tgtgagaagc aaagcccatg 661 gccacgttcc cctgccagtt atgtggcaag acgttcctca ccctggagaa gttcacgatt 721 cacaattatt cccactccag ggagcggccg tacaagtgtg tgcagcctga ctgtggcaaa 781 gcctttgttt ccagatataa attgatgagg catatggcta cccattctcc ccagaaatct 841 caccagtgtg ctcactgtga gaagacgttc aaccggaaag accacctgaa aaaccacttc 901 cagacccacg accccaacaa aatggccttt gggtgtgagg agtgtgggaa gaagtacaac 961 accatgctgg gctataagag gcacctggcc ctccatgcgg ccagcagtgg ggacctcacc 1021 tgtggggtct gtgccctgga gctagggagc accgaggtgc tactggacca cctcaaagcc 1081 catgcggaag agaagccccc tagcggaacc aaggaaaaga agcaccagtg cgaccactgt 1141 gaaagatgct tctacacccg gaaggatgtg cgacgccacc tggtggtcca cacaggatgc 1201 aaggacttcc tgtgccagtt ctgtgcccag agatttgggc gcaaggatca cctcacccgg 1261 cataccaaga agacccactc acaggagctg atgaaagaga gcttgcagac cggagacctt 1321 ctgagcacct tccacaccat ctcgccttca ttccaactga aggctgctgc cttgcctcct 1381 ttccctttag gagcttctgc ccagaacggg cttgcaagta gcttgccagc tgaggtccat 1441 agcctcaccc tcagtccccc agaacaagcc gcccagccta tgcagccgct gccagagtcc 1501 ctggcctccc tccacccctc ggtatcccct ggctctcctc cgccacccct tcccaatcac 1561 aagtacaaca ccacttctac ctcatactcc ccacttgcaa gcctgcccct caaagcagat 1621 actaaaggtt tttgcaatat cagtttgttt gaggacttgc ctctgcaaga gcctcagtca 1681 cctcaaaagc tcaacccagg ttttgatctg gctaagggaa atgctggtaa agtaaacctg 1741 cccaaggagc tgcctgcaga tgctgtgaac ctaacaatac ctgcctctct ggacctgtcc 1801 cccctgttgg gcttctggca gctgccccct cctgctaccc aaaatacctt tgggaatagc 1861 actcttgccc tggggcctgg ggaatctttg ccccacaggt taagctgtct ggggcagcag 1921 cagcaagaac ccccacttgc catgggcact gtgagcctgg gccagctccc cctgcccccc 1981 atccctcatg tgttctcagc tggcactggc tctgccatcc tgcctcattt ccatcatgca 2041 ttcagataat tgatttttaa agggtatttt tcgtattctg gaagatgttt taagaagcat 2101 tttaaatgtc agttacaata tgagaaagat ttggaaaacg agactgggac tatggcttat 2161 tcagtgatga ctggcttgag atgataagag aattctcgaa ctgcatgtat tgtgccaatc 2221 tgtcctgagt gttcatgctt tgtaccaaat ttaatgaacg cgtgttctgt aatcaaactg 2281 caaatattgt cataaccaac atccaaaatg acggctgcta tatataagtg tttgtcatat 2341 ggaatttaat cgtaagccat gatcataatg ttaactaaat aactttatgt ggcactgcct 2401 agtaagggaa ctatggaaag gtttggattt ctccaaatct gggagaattt tcaaaataag 2461 aaaataacct ttatatgata tactatgact aggctgtgta tttcttttca gggatttttc 2521 taccttcagg gttggatgta gtttagttac tattaccata gccaacctgt agttttacat 2581 atacattttc ttgtggagca atagagttct ccattttaca gaagcatttt aaatgtagtt 2641 tgaatatttt ccacaagatg ctgcaatgtg agttatcact tcatttatct taaagaaaga 2701 ctaaactggt tgtcagttac atctgacaga aaaaaaaaaa aaaatcactg tgtaaccagg 2761 gttaagtggt taaaataatc cagggcgtca gtcaaaggca ttttgctgac tttaatattg 2821 attatatttt taacagggaa tttaaggaaa atattaccgg ggaattaaaa aatatatata 2881 tattaaaaca agaattttcc tttgcccctg tccagcctaa acctacctac ctcaaggctg 2941 cctaagttcc taagtattgt ttgtaatcac ccaataaata agtgcatttg taattcatca 3001 gtcattatta gcttttatta aaagaagatt acgttttaca atgtaactat aatctcttga 3061 atttggtatc ttattaatga gttttaaaga tgtaaaacct aacctttttt aaagctccat 3121 tgtcttatgt ttttagaggc ttttccgtaa acatatatct tacatataat aaacttttca 3181 aatcttgcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3241 aaaaaaaaa // LOCUS HSU72661 1221 bp mRNA PRI 26-OCT-1996 DEFINITION Human ninjurin1 mRNA, complete cds. ACCESSION U72661 NID g1644367 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1221) AUTHORS Araki,T. and Milbrandt,J. TITLE Ninjurin, a novel adhesion molecule, is induced by nerve injury and promotes axonal growth JOURNAL Neuron 17 (2), 353-361 (1996) MEDLINE 96374367 REFERENCE 2 (bases 1 to 1221) AUTHORS Araki,T. and Milbrandt,J. TITLE Direct Submission JOURNAL Submitted (26-SEP-1996) Pathology and Medicine, Washington University School of Medicine, 660 S. Euclid Ave. Box 8118, St. Louis, MO 63132, USA FEATURES Location/Qualifiers source 1..1221 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 21..479 /note="plasma membrane protein; gene up-regulated after nerve injury in neurons and Schwann cells" /codon_start=1 /product="ninjurin1" /db_xref="PID:g1644368" /translation="MDSGTEEYELNGGLPPGTPGSPDASPARWGWRHGPINVNHYASK KSAAESMLDIALLMANASQLKAVVEQGPSFAFYVPLVVLISISLVLQIGVGVLLIFLV KYDLNNPDKHAKLDFLNNLATGLVFIIVVVNIFITAFGVQKPLMDMAPQQ" BASE COUNT 241 a 425 c 339 g 215 t 1 others ORIGIN 1 gcggcctggg cggccgcacc atggactcgg gaaccgagga gtacgagctc aacggcggcc 61 tgcctccggg cacacccggc tccccggacg cctcgccggc ccgctggggc tggaggcacg 121 ggcccatcaa cgtgaaccat tacgccagca agaagagcgc agccgagagc atgctggaca 181 tcgcgctgct gatggccaac gcgtcccagc tgaaggccgt cgtggaacag ggccccagct 241 tcgccttcta tgtgcccctg gtggtcctca tctccatctc ccttgtgctg cagatcggcg 301 tgggggtgct gctcatcttc cttgtcaagt acgaccttaa caacccggac aagcacgcca 361 agctggactt cctcaacaac ctggccacgg gcctggtgtt catcatcgtg gtagtcaaca 421 tcttcatcac ggccttcggg gtccagaagc ccttgatgga catggcaccc cagcagtagg 481 acacccagga ccctggatgc tgcctgccct gcaactcagc tgcccgaccc caggagtcgc 541 catacctgtg aggtgtccac ctccctgcac atggcactac ccagactgcc agagcccagg 601 ctggcctcat ctgcaccatg tccccggacc agcccttgct ctgactgcgg ccaagcacca 661 cgcaggaggc cactcttgtc tctcascagc tgttcccagg aggcagctcc ctcctggcac 721 atgggggctg gcacaatagc ccagagggtc agaactggac agctgcagag acctgtgccc 781 agagaagggt ctcgacccac tcaaggacac acagcaggtc cgtggatggg ctggatgagt 841 gaccagggcc agcctctgtc tcaggacatt ccagaaggac aaggagatgt ctctccctct 901 cccaaagcac cagcgtccct gcctcccgtg ggccctgtcc gggttgcccc tggtgacccc 961 agcctctgtc cacttcctaa cccagggacc ctgcacagcc agaactgcct ttggccctac 1021 ggatggccac tggctctggt ctaaagtgcc tgggcttggt ggccatcaag agggagccag 1081 tcaggcctgt gagggccgta gaccttgtat ataccctgca ccagcagtga ccgggcagag 1141 cccaaccccc tccacggggg tcccagcacc cacttttcta atcatgaatg aacaataaag 1201 cccacgctct ttgtcaggca a // LOCUS HSU72671 3000 bp mRNA PRI 05-FEB-1997 DEFINITION Human telencephalin precursor mRNA, complete cds. ACCESSION U72671 NID g1815648 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3000) AUTHORS Mizuno,T., Yoshihara,Y., Inazawa,J., Kagamiyama,H. and Mori,K. TITLE cDNA cloning and chromosomal localization of the human telencephalin and its distinctive interaction with lymphocyte function-associated antigen-1 JOURNAL J. Biol. Chem. 272 (2), 1156-1163 (1997) MEDLINE 97150880 REFERENCE 2 (bases 1 to 3000) AUTHORS Yoshihara,Y. TITLE Direct Submission JOURNAL Submitted (27-SEP-1996) Neuroscience, Osaka Bioscience Institute, 6-2-4 Furuedai, Suita, Osaka, 565, Japan FEATURES Location/Qualifiers source 1..3000 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="#12" /chromosome="19" /map="19p13.2" /tissue_type="brain" /dev_stage="adult" 5'UTR 1..65 sig_peptide 66..146 CDS 66..2840 /note="neural cell adhesion protein" /codon_start=1 /product="telencephalin precursor" /db_xref="PID:g1815649" /translation="MPGPSPGLRRALLGLWAALGLGLFGLSAVSQEPFWADLQPRVAF VERGGSLWLNCSTNCPRPERGGLETSLRRNGTQRGLRWLARQLVDIREPETQPVCFFR CARRTLQARGLIRTFQRPDRVELMPLPPWQPVGENFTLSCRVPGAGPRASLTLTLLRG AQELIRRSFAGEPPRARGAVLTATVLARREDHGANFSCRAELDLRPHGLGLFENSSAP RELRTFSLSPDAPRLAAPRLLEVGSERPVSCTLDGLFPASEARVYLALGDQNLSPDVT LEGDAFVATATATASAEQEGARQLICNVTLGGENRETRENVTIYSFPAPLLTLSEPSV SEGQMVTVTCAAGTQALVTLEGVPAAVPGQPAQLQLNATENDDRRSFFCDATLDVDGE TLIKNRSAELRVLYAPRLDDSDCPRSWTWPEGPEQTLRCEARGNPEPSVHCARSDGGA VLALGLLGPVTRALSGTYRCKAANDQGEAVKDVTLTVEYAPALDSVGCPERITWLEGT EASLSCVAHGVPPPDVICVRSGELGAVIEGLLRVAREHAGTYRCEATNPRGSAAKNVA VTVEYGPRFEEPSCPSNWTWVEGSGRLFSCEVDGKPQPSVKCVGSGGATEGVLLPLAP PDPSPRAPRIPRVLAPGIYVCNATNRHGSVAKTVVVSAESPPEMDESTCPSHQTWLEG AEASALACAARGRPSPGVRCSREGIPWPEQQRVSREDAGTYHCVATNAHGTDSRTVTV GVEYRPVVAELAASPPGGVRPGGNFTLTCRAEAWPPAQISWRAPPGALNIGLSSNNST LSVAGAMGSHGGEYECARTNAHGRHARRITVRVAGPWLWVAVGGAAGGAALLAAGAGL AFYVQSTACKKGEYNVQEAESSGEAVCLNGAGGGAGGAAGAEGGPEAAGGAAESPAEG EVFAIQLTSA" mat_peptide 147..2837 /product="telencephalin" 3'UTR 2841..3000 BASE COUNT 491 a 996 c 1024 g 489 t ORIGIN 1 ccgtcctcta gcccagctcc tcggctcgcg ctctcctcgc ctcctgtgct ttccccgccg 61 cggcgatgcc agggccttcg ccagggctgc gccgggcgct actcggcctc tgggctgctc 121 tgggcctggg gctcttcggc ctctcagcgg tctcgcagga gcccttctgg gcggacctgc 181 agcctcgcgt ggcgttcgtg gagcgcgggg gctcgctgtg gctgaattgc agcaccaact 241 gccctcggcc ggagcgcggt ggcctggaga cctcgctgcg ccgaaacggg acccagaggg 301 gtttgcgttg gttggcgcgg cagctggtgg acattcgcga gccggagact cagcccgtct 361 gcttcttccg ctgcgcgcgg cgcacactac aggcgcgtgg gctcattcgc actttccagc 421 gaccagatcg cgtagagctg atgccgctgc ctccctggca gccggtgggc gagaacttca 481 ccctgagctg tagggtcccc ggcgccgggc cccgtgcgag cctcacgctg accctgctgc 541 ggggcgccca ggagctgatc cgccgcagct tcgccggtga accaccccga gcgcggggcg 601 cggtgctcac agccacggta ctggctcgga gggaggacca tggagccaat ttctcgtgtc 661 gcgccgagct ggacctgcgg ccgcacggac tgggactgtt tgaaaacagc tcggccccca 721 gagagctccg aaccttctcc ctgtctccgg atgccccgcg cctcgctgct ccccggctct 781 tggaagttgg ctcggaaagg cccgtgagct gcactctgga cggactgttt ccagcctcag 841 aggccagggt ctacctcgca ctgggggacc agaatctgag tcctgatgtc accctcgaag 901 gggacgcatt cgtggccact gccacagcca cagctagcgc agagcaggag ggtgccaggc 961 agctgatctg caacgtcacc ctggggggcg aaaaccggga gacccgggag aacgtgacca 1021 tctacagctt cccggcacca ctcctgaccc tgagcgaacc cagcgtctcc gaggggcaga 1081 tggtgacagt aacctgcgca gctgggaccc aagctctggt cacactggag ggagttccag 1141 ccgcggtccc ggggcagccc gcccagcttc agctaaatgc caccgagaac gacgacagac 1201 gcagcttctt ctgcgacgcc accctcgatg tggacgggga gaccctgatc aagaacagga 1261 gcgcagagct tcgtgtccta tacgctcccc ggctagacga ttcggactgc cccaggagtt 1321 ggacgtggcc cgagggccca gagcagacgc tgcgctgcga ggcccgcggg aacccagaac 1381 cctcagtgca ctgtgcgcgc tccgacggcg gggccgtgct ggctctgggc ctgctgggtc 1441 cagtcactcg ggcgctctca ggcacttacc gctgcaaggc ggccaatgat caaggcgagg 1501 cggtcaagga cgtaacgcta acggtggagt acgcaccagc gctggacagc gtgggctgcc 1561 cagaacgcat tacttggctg gagggaacag aagcctcgct gagctgtgtg gcgcacgggg 1621 taccgccgcc tgatgtgatc tgcgtgcgct ctggagaact cggggccgtc atcgaggggc 1681 tgttgcgtgt ggcccgggag catgcgggca cttaccgctg cgaagccacc aaccctcggg 1741 gctctgcggc caaaaatgtg gccgtcacgg tggaatatgg ccccaggttt gaggagccga 1801 gctgccccag caattggaca tgggtggaag gatctgggcg cctgttttcc tgtgaggtcg 1861 atgggaagcc acagccaagc gtgaagtgcg tgggctccgg gggcgccact gagggggtgc 1921 tgctgccgct ggcaccccca gaccctagtc ccagagctcc cagaatccct agagtcctgg 1981 cacccggtat ctacgtctgc aacgccacca accgccacgg ctccgtggcc aaaacagtcg 2041 tcgtgagcgc ggagtcgcca ccggagatgg atgaatctac ctgcccaagt caccagacgt 2101 ggctggaagg ggctgaggct tccgcgctgg cctgcgccgc ccggggtcgc ccttccccag 2161 gagtgcgctg ctctcgggaa ggcatcccat ggcctgagca gcagcgcgtg tcccgagagg 2221 acgcgggcac ttaccactgt gtggccacca atgcgcatgg cacggactcc cggaccgtca 2281 ctgtgggcgt ggaataccgg ccagtggtgg ccgaacttgc tgcctcgccc cctggaggcg 2341 tgcgcccagg aggaaacttc acgttgacct gccgcgcgga ggcctggcct ccagcccaga 2401 tcagctggcg cgcgcccccg ggggccctca acatcggcct gtcgagcaac aacagcacac 2461 tgagcgtggc aggcgccatg ggaagccacg gcggcgagta cgagtgcgca cgcaccaacg 2521 cgcacgggcg ccacgcgcgg cgcatcacgg tgcgcgtggc cggtccgtgg ctatgggtcg 2581 ccgtgggcgg cgcggcgggg ggcgcggcgc tgctggccgc gggggccggc ctggccttct 2641 acgtgcagtc caccgcctgc aagaagggcg agtacaacgt gcaggaggcc gagagctcag 2701 gcgaggccgt gtgtctcaac ggagcgggcg gcggcgctgg cggggcggca ggcgcggagg 2761 gcggacccga ggcggcgggg ggcgcggccg agtcgccggc ggagggcgag gtcttcgcca 2821 tacagctgac atcggcgtga gccgctcccc tctccccgcg ggccggggga cgccccccag 2881 actcacacgg gggcttattt attgctttat ttatttactt attcatttat ttatgtattc 2941 aactccaagg gcgtcacccc cattttctac ccatcccctc aataaagttt ttataaagga // LOCUS HSU73328 1403 bp mRNA PRI 17-JAN-1997 DEFINITION Human DLX7 (Dlx7) mRNA, complete cds. ACCESSION U73328 NID g1657866 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1403) AUTHORS Nakamura,S., Stock,D.W., Wydner,K.L., Bollekens,J.A., Takeshita,K., Nagai,B.M., Chiba,S., Kitamura,T., Freeland,T.M., Zhao,Z., Minowada,J., Lawrence,J.B., Weiss,K.M. and Ruddle,F.H. TITLE Genomic analysis of a new mammalian distal-less gene: Dlx7 JOURNAL Genomics 38 (3), 314-324 (1996) MEDLINE 97131510 REFERENCE 2 (bases 1 to 1403) AUTHORS Nakamura,S., Stock,D.W., Wydner,K.L., Bollekens,J.A., Takeshita,K., Nagai,B.M., Chiba,S., Kitamura,T., Freeland,T.M., Zhao,Z., Minowada,J., Lawrence,J.B., Weiss,K.M. and Ruddle,F.H. TITLE Direct Submission JOURNAL Submitted (03-OCT-1996) Anthropology, Pennsylvania State University, 409 Carpenter Bldg., University Park, PA 16802, USA FEATURES Location/Qualifiers source 1..1403 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q21.3-17q22" /clone_lib="TF-1 leukemia cell" gene 247..750 /gene="Dlx7" CDS 247..750 /gene="Dlx7" /codon_start=1 /product="DLX7" /db_xref="PID:g1657867" /translation="MKLSVLPHRSLLAPNTVLCCPPDSEKPRLSPEPSERRPQAAAKK LRKPRTIYSSLQLQHLNQRFQHTQYLALPERAQLAAQLGLTQTQVKIWFQNKRSKYKK LLKQNSGGQEGDFPGRTFSVSPCSPPLPYLWDLPKAGTLPTSGYGNSFGPWYQHHSSD VLSPQMM" BASE COUNT 317 a 438 c 355 g 293 t ORIGIN 1 ctaaagccgc aggcgccgcg gtaccctggc tgtggccctc ggcgctttct tcctagggtc 61 acaggaccca tacgagtggg agctccctgg gagcagaact gcgtcttgta tcacctggcg 121 cggtgaacgt gggggttgaa acgctccacg cggaaggtag agggcagggg ccaagggggc 181 gatcctggtg gctgcgcttt ttgctatttg ctgccgacgg catgcagacg agatgcaaat 241 aagcttatga aactgtccgt cctaccccat cgctccctcc tcgcccccaa caccgtgttg 301 tgctgcccac cagactcgga gaagccgcgg ctgtctccgg aaccctccga gcggcgccct 361 caggccgccg ccaaaaagct ccgcaagccg aggaccatct actccagcct gcagctgcag 421 cacctaaacc agcgtttcca gcacacgcag tacctggcgc tgcccgagag ggcccagctg 481 gcagcgcagc tcggcctcac ccagacccag gtaaagatct ggtttcagaa caaacgctcc 541 aagtataaga agctcctgaa gcagaattct gggggccagg aaggggactt ccctgggagg 601 accttctctg tgtctccctg ctccccaccc ctcccctacc tctgggatct acccaaggca 661 gggaccctgc ccaccagtgg ctatggcaac agctttggac cctggtatca gcatcactcc 721 tcagatgtcc tgtcgcctca gatgatgtga atctggggaa gggcgggtca ggcccacagc 781 cttcctgcaa agcccaggac ccaggcagtc cacctgcacc ccttctgggc tgggaggaaa 841 ccagctccag atgggttttc tctggaggac aagcagttag aggagaaaag gggaatggag 901 cagagcctgt acccctaacc ctagcagcta aatcaaggac ctcagcctta tataatcatt 961 gtccccacca ctaccatgga ctggacacct tcactccagc tggacaaaga ctctggagag 1021 agagccattg gctggagttg agactgtccc cagaaccctt ggtcttgcca ctcccccact 1081 ccttcttccc tctctccctt tctcctctcc ctgctttctt gaaaaggact gaatcgccac 1141 tacagcctgg gtgcaaaatc agcaagaaac attgagtatt tttttttctt tgtatgcctt 1201 tggccttgca caacccattt gtgagcaaaa gcagaagtgg accaccatca gctcccaccc 1261 acccagcgat ttttccttgg aggtcagccc gttaccccca taactgattt acctacttac 1321 catactggga ggtagaagag atgcagagaa atgtggaatt tgtggaccta tgggtaattt 1381 atgctttcct cctaaaaaaa aaa // LOCUS HSU73379 783 bp mRNA PRI 02-MAY-1997 DEFINITION Human cyclin-selective ubiquitin carrier protein mRNA, complete cds. ACCESSION U73379 NID g2062372 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 783) AUTHORS Townsley,F.M. and Ruderman,J.V. TITLE UbcH10 JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 783) AUTHORS Townsley,F.M. and Ruderman,J.V. TITLE Direct Submission JOURNAL Submitted (03-OCT-1996) Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..783 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 41..580 /note="UbcH10" /codon_start=1 /product="cyclin-selective ubiquitin carrier protein" /db_xref="PID:g2062373" /translation="MASQNRDPAATSVAAARKGAEPSGGAARGPVGKRLQQELMTLMM SGDKGISAFPESDNLFKWVGTIHGAAGTVYEDLRYKLSLEFPSGYPYNAPTVKFLTPC YHPNVDTQGNICLDILKEKWSALYDVRTILLSIQSLLGEPNIDSPLNTHAAELWKNPT AFKKYLQETYSKQVTSQEP" BASE COUNT 192 a 212 c 180 g 199 t ORIGIN 1 ggcacgagcg agttcctgtc tctctgccaa cgccgcccgg atggcttccc aaaaccgcga 61 cccagccgcc actagcgtcg ccgccgcccg taaaggagct gagccgagcg ggggcgccgc 121 ccggggtccg gtgggcaaaa ggctacagca ggagctgatg accctcatga tgtctggcga 181 taaagggatt tctgccttcc ctgaatcaga caaccttttc aaatgggtag ggaccatcca 241 tggagcagct ggaacagtat atgaagacct gaggtataag ctctcgctag agttccccag 301 tggctaccct tacaatgcgc ccacagtgaa gttcctcacg ccctgctatc accccaacgt 361 ggacacccag ggtaacatat gcctggacat cctgaaggaa aagtggtctg ccctgtatga 421 tgtcaggacc attctgctct ccatccagag ccttctagga gaacccaaca ttgatagtcc 481 cttgaacaca catgctgccg agctctggaa aaaccccaca gcttttaaga agtacctgca 541 agaaacctac tcaaagcagg tcaccagcca ggagccctga cccaggctgc ccagcctgtc 601 cttgtgtcgt ctttttaatt tttccttaga tggtctgtcc tttttgtgat ttctgtatag 661 gactctttat cttgagctgt ggtatttttg ttttgttttt gtcttttaaa ttaagcctcg 721 gttgagccct tgtatattaa ataaatgcat ttttgtcctt ttttaaaaaa aaaaaaaaaa 781 aaa // LOCUS HSU73514 947 bp mRNA PRI 04-SEP-1997 DEFINITION Human short-chain alcohol dehydrogenase (XH98G2) mRNA, complete cds. ACCESSION U73514 NID g1778354 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 947) AUTHORS Zhuchenko,O.P., Wehnert,M., Bailey,J., Sun,Z.S. and Lee,C.C. TITLE Isolation and genomic organization of a new member of the short-chain alcohol dehydrogenase family mapped to the human X chromosome JOURNAL Unpublished REFERENCE 2 (bases 1 to 947) AUTHORS Zhuchenko,O.P., Wehnert,M., Bailey,J., Sun,Z.S. and Lee,C.C. TITLE Direct Submission JOURNAL Submitted (05-OCT-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..947 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xp11.21" gene 1..947 /gene="XH98G2" CDS 16..801 /gene="XH98G2" /note="X-linked alcohol dehydrogenase" /codon_start=1 /product="short-chain alcohol dehydrogenase" /db_xref="PID:g1778355" /translation="MAAACRSVKGLVAVITGGASGLGLATAERLVGQGASAVLLDLPN SGGEAQAKKLGNNCVFAPADVTSEKDVQTALALAKGKFGRVDVAVNCAGIAVASKTYN LKKGQTHTLEDFQRVLDVNLMGTFNVIRLVAGEMGQNEPDQGGQRGVIINTASVAAFE GQVGQAAYSASKGGIVGMTLPIARDLAPIGIRVMTIAPGLFGTPLLTSLPEKVCNFLA SQVPFPSRLGDPAEYAHLVQAIIENPFLNGEVIRLDGAIRMQP" BASE COUNT 204 a 259 c 285 g 199 t ORIGIN 1 gtggccggcg acaagatggc agcagcgtgt cggagcgtga agggcctggt ggcggtaata 61 accggaggag cctcgggcct gggcctggcc acggcggagc gacttgtggg gcagggagcc 121 tctgctgtgc ttctggacct gcccaactcg ggtggggagg cccaagccaa gaagttagga 181 aacaactgcg ttttcgcccc agccgacgtg acctctgaga aggatgtgca aacagctctg 241 gctctagcaa aaggaaagtt tggccgtgtg gatgtagctg tcaactgtgc aggcatcgcg 301 gtggctagca agacgtacaa cttaaagaag ggccagaccc ataccttgga agacttccag 361 cgagttcttg atgtgaatct catgggcacc ttcaatgtga tccgcctggt ggctggtgag 421 atgggccaga atgaaccaga ccagggaggc caacgtgggg tcatcatcaa cactgccagt 481 gtggctgcct tcgagggtca ggttggacaa gctgcatact ctgcttccaa ggggggaata 541 gtgggcatga cactgcccat tgctcgggat ctggctccca taggtatccg ggtgatgacc 601 attgccccag gtctgtttgg caccccactg ctgaccagcc tcccagagaa agtgtgcaac 661 ttcttggcca gccaagtgcc cttccctagc cgactgggtg accctgctga gtatgctcac 721 ctcgtacagg ccatcatcga gaacccattc ctcaatggag aggtcatccg gctggatggg 781 gccattcgta tgcagccttg aagggagaag gcagagaaaa cacacgctcc tctgcccttc 841 ctttccctgg ggtactactc tccagcttgg gaggaagccc agtagccatt ttgtaactgc 901 ctaccagtcg ccctctgtgc ctaataaagt ctctttttct cacagag // LOCUS HSU73524 2437 bp mRNA PRI 29-OCT-1996 DEFINITION Human putative ATP/GTP-binding protein (HEAB) mRNA, complete cds. ACCESSION U73524 NID g1644401 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2437) AUTHORS Tanabe,S., Bohlander,S.K., Vignon,C.V., Espinosa,R. 3rd, Zhao,N., Strissel,P.L., Zeleznik-Le,N.J. and Rowley,J.D. TITLE AF10 is split by MLL and HEAB, a human homolog to a putative Caenorhabditis elegans ATP/GTP-binding protein in an invins(10;11)(p12;q23q12) JOURNAL Blood 88 (9), 3535-3545 (1996) MEDLINE 97051786 REFERENCE 2 (bases 1 to 2437) AUTHORS Tanabe,S. and Rowley,J.D. TITLE Direct Submission JOURNAL Submitted (06-OCT-1996) Department of Medicine, Section of Hematology/Oncology, The University of Chicago, 5841 South Maryland Avenue, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..2437 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q12" gene 724..2001 /gene="HEAB" CDS 724..2001 /gene="HEAB" /note="similar to C. elegans ATP/GTP-binding protein F59A2.4 encoded by GenBank Accession Number Z34801" /codon_start=1 /product="putative ATP/GTP-binding protein" /db_xref="PID:g1644402" /translation="MGEEANDDKKPTTKFELERETELRFEVEASQSVQLELLTGMAEI FGTELTRNKKFTFDAGAKVAVFTWHGCSVQLSGRTEVAYVSKDTPMLLYLNTHTALEQ MRRQAEKEEERGPRVMVVGPTDVGKSTVCRLLLNYAVRLGRRPTYVELDVGQGSVSIP GTMGALYIERPADVEEGFSIQAPLVYHFGSTTPGTNIKLYNKITSRLADVFNQRCEVN RRASVSGCVINTCGWVKGSGYQALVHAASAFEVDVVVVLDQERLYNELKRDLPHFVRT VLLPKSGGVVERSKDFRRECRDERIREYFYGFRGCFYPHAFNVKFSDVKIYKVGAPTI PDSCLPLGMSQEDNQLKLVPVTPGRDMVHHLLSVSTAEGTEENLSETSVAGFIVVTSV DLEHQVFTVLSPAPRPLPKNFLLIMDIRFMDLK" BASE COUNT 577 a 610 c 626 g 624 t ORIGIN 1 atgactgact tgtagctgga agaaatcatc ggatttttat tcttttatta aagaaaaaaa 61 atttgaaatg ccttccatgt gccaagcact gtgtcaggtg ggagatgaca gcttggtgaa 121 acctctgtca ggctgtcttc ctccgctttc tctatccctg ggtttccccc tgcctaaaaa 181 ggattttgtg cttcgtggct tgtccaggca agcaggccgt cgcgggacct agaccgagac 241 agtgagtctc tctttctccc gggcctccct tctgtttcct gggctgcagg ggagcaggaa 301 atctggggcg agattcccgc cgcggacgcg cactgccgaa gcctggtccc tcgacctgtc 361 cctgcccagc gcgggggcgc aaccgccacg cctcctcacc cctccctccg gctgcacgaa 421 taatgacaac agccgcccct cccacctttg gcgtcacgtt caaaacaatc ctttgactac 481 aactcccaga aggccgagcg gcttagcgag tgcacccgct ctcggctgct ccggcaaact 541 acacatccca aagggcagcg ccgaccgcgt gtcctttcac agcaaagtgc ggaactgcgt 601 ttgtttccgg cgtgggtccg ggcaagaacc gcttgtagtt tggtttaaat tctgcacggg 661 aggaccttct gagtttacct gttgggctcc tggctgcgca ggcacagcag ctacacagaa 721 gagatgggag aagaggctaa tgatgacaag aagccaacca ctaaatttga actagagcga 781 gaaacagaac ttcgctttga ggtggaggca tctcagtcag ttcagttgga gttgttgact 841 ggcatggcag agatctttgg cacagagctg acccgaaaca agaaattcac ctttgatgct 901 ggtgccaagg tggctgtttt cacttggcat ggctgttctg tgcaactgag cggccgcact 961 gaggtggctt atgtctccaa ggacactcct atgttgcttt acctcaacac tcacacagcc 1021 ttggaacaga tgcggaggca agcggaaaag gaagaagagc gaggtccccg agtgatggta 1081 gtgggcccca ctgatgtggg caagtctaca gtgtgtcgcc ttctgctcaa ctacgcagtg 1141 cgtttgggcc gccgtcccac ttatgtggag ctggatgtgg gccagggttc tgtgtccatc 1201 cctggtacca tgggggccct ctacatcgag cggcctgcag atgtcgaaga gggtttctct 1261 atccaggccc ctctggtgta tcattttggt tccaccactc ctggcactaa catcaagctt 1321 tataataaga ttacatctcg tttagcagat gtgttcaacc aaaggtgtga ggtgaaccga 1381 agggcatctg tgagtggctg tgtcattaac acctgtggct gggtcaaggg ctctggttac 1441 caggctctgg tgcatgcagc ctcagctttt gaggtggatg tcgttgttgt tctggatcaa 1501 gaacgactgt acaatgaact gaaacgggac ctcccccact ttgtacgcac tgtgctgctc 1561 cctaaatctg ggggtgtggt ggagcgctcc aaggacttcc ggcgggaatg tagggatgag 1621 cgtatccgtg agtattttta tggattccga ggctgtttct atccccatgc cttcaatgtc 1681 aaattttcag atgtgaaaat ctacaaagtt ggggcaccca ccatcccaga ctcctgttta 1741 cctttgggca tgtctcaaga ggataatcag ctcaagctag tacctgtcac tcctgggcga 1801 gatatggtgc accacctact gagtgttagc actgccgagg gtacagagga gaacctgtcc 1861 gagacaagtg tagctggctt cattgtggtg accagtgtgg acctggagca tcaggtgttt 1921 actgttctgt ctccagcccc tcgcccactg cctaagaact tccttctcat catggatatc 1981 cggttcatgg atctgaagta gagatcagca ggaagccttg ctgcctggga catagagatc 2041 atctggccac ccctagaggc agatgggctg agataaaaga ctgttggggc cacctgacca 2101 gtaaactgtg gactagtaga aagttcatat tctacctcta aaaacaggta gtggtaacct 2161 gactcttcta atcttgaacc aaaaggaaaa ccatgagact gtaattggtt tcttagacca 2221 cctaagatgc cactttgaat tctctaagac cctggagaat tgcatttctt tcactgtgct 2281 actatgtggt ttttaaaaaa tcaatgcttt atattccata tgtggttctt acccatttat 2341 catggatgaa agtgtgaatt agagggactc cttccaataa agttcaaact gaaaaaaaat 2401 cattttaata aatatttttg ccatatcata aaaaaaa // LOCUS HSU73704 1837 bp mRNA PRI 20-DEC-1996 DEFINITION Homo sapiens 48 kDa FKBP-associated protein FAP48 mRNA, complete cds. ACCESSION U73704 NID g1658004 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1837) AUTHORS Chambraud,B., Radanyi,C., Camonis,J.H., Shazand,K., Rajkowski,K. and Baulieu,E.E. TITLE FAP48, a new protein that forms specific complexes with both immunophilins FKBP59 and FKBP12. Prevention by the immunosuppressant drugs FK506 and rapamycin JOURNAL J. Biol. Chem. 271 (51), 32923-32929 (1996) MEDLINE 97115832 REFERENCE 2 (bases 1 to 1837) AUTHORS Chambraud,B., Radanyi,C., Camonis,J.H., Shazand,K., Rajkowski,K. and Baulieu,E.E. TITLE Direct Submission JOURNAL Submitted (08-OCT-1996) Unite 33, INSERM, 80-rue du General Leclerc, Kremlin-Bicetre 94276, France FEATURES Location/Qualifiers source 1..1837 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" CDS 94..1347 /note="48 kDa FKBP-associated protein" /codon_start=1 /product="FAP48" /db_xref="PID:g1658005" /translation="MAVEELQSIIKRCQILEEQDFKEEDFGLFQLAGQRCIEEGHTDQ LLEIIQNEKNKVIIKNMGWNLVGPVVRCLLCKDKEDSKRKVYFLIFDLLVKLCNPKEL LLGLLELIEEPSGKQISQSILLLLQPLQTVIQKLHNKAYSIGLALSTLWNQLSLLPVP YSKEQIQMDDYGLCQCCKALIEFTKPFVEEVIDNKENSLENEKLKDELLKFCFKSLKC PLLTAQFFEQSEEGGNDPFRYFASEIIGFLSAIGHPFPKMIFNHGRKKRTWNYLEFEE EENKQLADSMASLAYLVFVQGIHIDQLPMVLSPLYLLQFNMGHIEVFLQRTEESVISK GLELLENSLLRIEDNSLLYQYLEIKSFLTVPQGLVKVMTLCPIETLRKKSLAMLQLYI NKLDSQGKYTLFREHVTTNGLQDHS" BASE COUNT 631 a 292 c 377 g 537 t ORIGIN 1 agaagagcgg gctaagacgc cggaggaggt ggcggcggct gggagaggcg agggttctgg 61 ccgattttag catcgaaact aggagaaata agaatggctg tagaggaact tcagtctata 121 ataaagagat gtcaaatcct agaagagcaa gactttaaag aagaggattt tggcctattt 181 cagttagctg ggcaaagatg catagaagaa gggcacacag accagctatt agaaattatt 241 caaaatgaaa agaataaggt catcatcaag aatatgggct ggaatctcgt tggtcctgtt 301 gttcgatgcc ttttgtgtaa agataaagag gatagtaaaa gaaaagttta ttttttgatc 361 tttgatttat tggtaaagtt atgcaatcca aaggaattat tgttgggttt gcttgaactg 421 attgaagagc cctctggaaa acagatatcc caaagtattc ttcttttgct tcagccatta 481 caaacagtga ttcagaaact tcataacaag gcatattcaa ttggattagc attgtctacc 541 ctttggaatc agctatctct tcttcctgtt ccatactcaa aagaacaaat acaaatggat 601 gactatggcc tttgtcagtg ttgcaaggcc ttaatagagt tcactaagcc ttttgtggaa 661 gaagtcattg ataacaaaga aaactcactg gaaaatgaaa agttaaagga tgaattactg 721 aaattttgtt tcaaaagctt gaaatgccct ttgctgacag cacaattctt tgaacagtct 781 gaagaaggtg gaaatgatcc tttcaggtat tttgcatcag aaataatagg ttttttatca 841 gcaattggac accctttccc caaaatgatt tttaatcatg gaaggaaaaa gagaacttgg 901 aattaccttg aatttgaaga agaagaaaat aaacagttag cagactcaat ggcttctctg 961 gcatatctag tatttgtaca gggcatccat attgatcagc ttccaatggt cttaagccca 1021 ttgtaccttt tgcagtttaa tatggggcac attgaagtct ttttgcaaag aacagaagag 1081 tctgttatct ccaaaggatt ggagctgctg gagaatagtt tattgagaat agaagacaat 1141 agtctacttt accagtactt agaaatcaag agttttctta ctgtacctca gggcttagtg 1201 aaagtaatga cactttgccc cattgagaca ctgaggaaaa agagtttagc tatgcttcag 1261 ctgtatatta acaagttgga ttcacaaggc aaatatacat tatttagaga acacgtaaca 1321 acaaatggtt tacaggacca cagttgattt cccttcttga tttggtactt tttctcccag 1381 agggtgcaga aacagattta ctgcaaaact cagataggat tatggcttca ttaaatttat 1441 tgaggtattt ggttatcaaa gataatgaaa atgacaatca aactggatta tggacagaac 1501 ttggaaatat tgagaataat ttcttaaagc cacttcatat aggacttaat atgtcaaaag 1561 cacattatga aggcagaaat taaaaatagc caagaggccc agaaatctaa agatctttgt 1621 tctataactg taagtggaga agagatccct aatatgcctc ctgaaatgca gcttaaggtc 1681 ctgcattcag ctcttttcac atttgatttg attgaaagtg ttctagctcg agtggaagaa 1741 ctcattgaaa taaaaacaaa gtctacctct gaagaaaata ttgggataaa gtgaaagttc 1801 catttcctaa ataaaaacta ataaaatata gtacctc // LOCUS HSU73960 1077 bp mRNA PRI 02-JAN-1997 DEFINITION Human ADP-ribosylation factor-like protein 4 mRNA, complete cds. ACCESSION U73960 NID g1763290 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1077) AUTHORS Kim,J., Lee,Y., Lee,I., Kang,B., Han,Y., Kang,H. and Choe,I. TITLE Isolation and characterization of human cDNA homologous to rat ARL4 JOURNAL Thesis (1996) Immune Regulation Res. Unit, Korean Research Institute of Bioscience and Biotechnology REFERENCE 2 (bases 1 to 1077) AUTHORS Choe,I. TITLE Direct Submission JOURNAL Submitted (11-OCT-1996) Korean Research Institute of Bioscience and Biotechnology, Immune Regulation Res. Unit, Yoosung, Taijon, Korea, 305-600 FEATURES Location/Qualifiers source 1..1077 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="S01H01" /tissue_type="liver" /dev_stage="fetus" CDS 154..756 /codon_start=1 /product="ADP-ribosylation factor-like protein 4" /db_xref="PID:g1763291" /translation="MGNGLSDQTSILSNLPSFQSFHIVILGLDCAGKTTVLYRLQFNE FVNTVPTKGFNTEKIKVTLGNSKTVTFHFWDVGGQEKLRPLWKSYTRCTDGIVFVVDS VDVERMEEAKTELHKITRISENQGVPVLIVANKQDLRNSLSLSEIEKLLAMGELSSST PWHLQPTCAIIGDGLKEGLEKLHDMIIKRRKMLRQQKKKR" polyA_signal 1057..1062 polyA_site 1077 BASE COUNT 333 a 202 c 237 g 305 t ORIGIN 1 cttatccctg cgtagaaacg cctgccaatg ctttctcatt tggacccaga ctccagatcg 61 ggagcagtct tatagctgga tcagctacca agagaagttg taaaccaaga agagaaaagc 121 atttcaattt gggacattta tttgcacctg gaaatgggga atgggctgtc agaccagact 181 tctatcctgt ccaacctgcc ttcatttcag tctttccaca ttgttattct gggtttggac 241 tgtgctggaa agacaacagt cttatacagg ctgcagttca atgaatttgt aaataccgta 301 cctaccaaag gatttaacac tgagaaaatt aaggtaacct tgggaaattc taaaacagtc 361 acttttcact tctgggatgt aggtggtcag gagaaattaa ggccactgtg gaagtcatat 421 accagatgca cagatggcat tgtatttgtt gtggactctg ttgatgtcga aaggatggaa 481 gaagccaaaa ctgaacttca caaaataact aggatatcag aaaatcaggg agtccctgta 541 cttatagttg ctaacaaaca agatttgagg aactcattgt cactttcaga aattgagaaa 601 ttgttagcaa tgggtgaact gagctcatca actccttggc atttgcagcc tacctgtgca 661 atcataggag atggcctaaa ggaaggactt gagaaactac atgatatgat cattaaaaga 721 agaaaaatgt tgcggcaaca gaaaaagaaa agatgaatat caatacctat tatatctgtg 781 tggagtaggt tttctctggt ctgattttga caaatagaag agtgtctaca ccgtcctttg 841 cctgtctgcc ctcctggatg ctattaaagc tttgttttgt tgaacaatca gatgcccaac 901 tctgttgcct tgtggaagat gagtaaatgc agtgcttctt aaagtggtct cttctcccta 961 ccccacaaat cttttggtac taccatttgg ggaagccaag caaggatagt aaattgacca 1021 gaacacagtt gtgggaattt ggtctgaagt tagtgaaata aaactttaaa gagtgtc // LOCUS HSU74324 2392 bp mRNA PRI 05-NOV-1996 DEFINITION Human guanine nucleotide exchange factor mss4 mRNA, complete cds. ACCESSION U74324 NID g1658190 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2392) AUTHORS Mueller-Pillasch,F., Zimmerhackl,F., Lacher,U. and Gress,T.M. TITLE Cloning and chromosomal mapping of novel transcripts of human mss4 in pancreatic cancer JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 2392) AUTHORS Mueller-Pillasch,F., Zimmerhackl,F., Lacher,U. and Gress,T.M. TITLE Direct Submission JOURNAL Submitted (15-OCT-1996) Internal Medicine I, Medizinische Klinik, Robert-Koch-Str.8, Ulm, Baden Wuerttemberg 89081, Germany FEATURES Location/Qualifiers source 1..2392 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q32-41" /cell_line="pancreatic cancer cell line Patu8988t" CDS 4..375 /note="similar to mss4 from human brain deposited in GenBank Accession Number S78873; overexpressed in pancreatic cancer" /codon_start=1 /product="guanine nucleotide exchange factor mss4" /db_xref="PID:g1658191" /translation="MEPAEQPSELVSAEGRNRKAVLCQRCGSRVLQPGTALFSRRQLF LPSMRKKPALSDGSNPDGDLLQEHWLVEDMFIFENVGFTKDVGNIKFLVCADCEIGPI GWHCLDDKNSFYVALERVSHE" BASE COUNT 643 a 564 c 488 g 697 t ORIGIN 1 gcgatggaac cagcggagca gccgagcgag ttagtgtcag ccgagggccg aaaccggaag 61 gcggtgctgt gccagcgttg cggctcccgg gtgctgcagc cagggaccgc tctcttctct 121 cgccgacagc ttttccttcc ctccatgaga aagaagccag ctctgtctga cggcagcaat 181 cctgacggcg atctcctcca ggaacactgg ctggttgagg acatgttcat ttttgagaat 241 gtgggcttca ccaaggacgt gggcaacatc aagtttctgg tctgcgcaga ctgtgaaatt 301 ggaccaattg gctggcattg cctagatgac aagaacagtt tctatgtggc cttggaacga 361 gtttcccatg agtaactgag gggaggggta ctcagctcca tctccaaaga taaacctact 421 ccccacaaga actggccttt aatgtggtat aactgttccg ctgccttctt gtctgtgcca 481 atataaatac tgagtaccag catgtccatt tgaacatgca gagggttaat cctgcttcct 541 aaagcctcaa gtacatgcct cctgcttagt tcactttgta tcacatttcc taagctccct 601 tttcccccag ttttgggaca ctgtgcttac ctccaaaaat ctcatctctt ccctggcatt 661 ctccctaggc tctgttttgc ccaggggctc ccgctttttc ttgctctaga agagcagtat 721 tcaacctttt agctatgatg acacataaca aaagatgctt atgtactaat agttgaaatc 781 tgcctttttc tcattcaaga aggcatacaa atatctgaga gtgactttgt tgtatggcta 841 cccttgtgat ctacagtaat ttattctttc taaaagtaaa gcattttcaa aactcagtat 901 ttaaaccact aaccagaaac attactttgg atgcatctct aataccatgt ttgagcacct 961 ctgctctagg ttgagaatga cattttattg tgaagatggg tgtggtccct cttcccttga 1021 aatcttgtag tttcttttta tattagctcc tcactgctac agcctagaag gtgagaagca 1081 gattttaacc ccatctggca gccattcaag gaaaacccag ccctggctgt ttactcaggc 1141 tctcttagaa tgagagtgga gggtgtagga tatgagggtt aggcctttgc cacattatac 1201 aaaagtttat aatttgccac atctggacaa gtaactttct tcttttgttc ataggcaaga 1261 cttctttaat ggatagtatc atttactgaa cctactgggc atggtcctca tggagttttg 1321 gttcaactgg aatctctgtg ccaacccagg atacaaactt ccatccagat gcatggatac 1381 aaatttccat agctccgggg ctgctctaac agtttgatat cagcagacgt gtttaagcct 1441 tcaacttgat ttccaattat tccatcattt ctattctgaa atgactcact gtcgtagaag 1501 gaagtttatt atataaccaa ggtttcaggt atcctcctgt caccacctct actattcagg 1561 aggcctaaaa ttgaaataag tactaaacag aaaccttacc atctgaagtc ccttccattg 1621 ttttctagtt gcttcctcat tcctttaccc ccaaactttc taagggctcc ccgactgcag 1681 tgcaaggtgg ccttgggcac agttttgcaa ccaacctact cccctggaaa gatggcttcc 1741 ttctccaatt agcctgaatg agctttagta agcaattgag agaactgtgt ttttccgaca 1801 aacggtcaca tgtccatcgt tatggcatgt gcagaaaaac agagttatcc acacaagtca 1861 ggagcaaaac ccaagtagat gcctctaggg gcacatggcc cctcacatca caagccagaa 1921 cctaagctaa gcatttttta aattgagttt gagactagcc tgggcaacat agtgagaccc 1981 catctctatt taaaaaatga acaaaattac ctgggcatag tggtgtatac ctgtagtccc 2041 tgctacttgg gaggctcagg tgggaggatc acttgagccc caaagatggt ggctgcagta 2101 agccaagatc acaccactgg cactccagcc tgggcaacag agtgagaccc tgtctcaaac 2161 aaacaacaac agtaacaaca aaaaaatata tagcgacttg aataggaaac catagtattt 2221 cattgtttta atttgcattt attttatttc tagtgaaact tttatatacg tatcggtcat 2281 ttctatttcc ttttttgaga actgccaaat gttgccctag tagccaaaaa tatcaggttg 2341 ctattctgcc ttttcagttt gatctgaaag aaataaaggc atttaagcaa tg // LOCUS HSU74612 3492 bp mRNA PRI 06-MAR-1997 DEFINITION Human hepatocyte nuclear factor-3/fork head homolog 11A (HFH-11A) mRNA complete cds. ACCESSION U74612 NID g1842252 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3492) AUTHORS Ye,H., Kelly,T.F., Samadani,U., Lim,L., Rubio,S., Overdier,D.G., Roebuck,K.A. and Costa,R.H. TITLE Hepatocyte nuclear factor 3/fork head homolog 11 is expressed in proliferating epithelial and mesenchymal cells of embryonic and adult tissues JOURNAL Mol. Cell. Biol. 17 (3), 1626-1641 (1997) MEDLINE 97184488 REFERENCE 2 (bases 1 to 3492) AUTHORS Ye,H., Kelly,T.F., Samadani,U., Lim,L., Rubio,R., Overdier,D.G., Roebuck,K.A. and Costa,R.H. TITLE Direct Submission JOURNAL Submitted (15-OCT-1996) Biochemistry, University of Illinois at Chicago, 1819 West Polk Street, Chicago, IL 60612-7334, USA FEATURES Location/Qualifiers source 1..3492 /organism="Homo sapiens" /note="Embryonic expression pattern: liver, lung, intestine, kindey, urinary tract; Adult expression pattern: intestine (crypts), colon, testis (spermatocytes and spermatids), and thymus" /db_xref="taxon:9606" /tissue_type="colon carcinoma" /cell_line="HT-29 and Caco-2" /note="Induced during liver regeneration - present in HepG2, HeLa, A549 and H441 cell lines" gene 115..2520 /gene="HFH-11A" CDS 115..2520 /gene="HFH-11A" /note="winged helix transcription factor" /codon_start=1 /product="hepatocyte nuclear factor-3/fork head homolog 11A" /db_xref="PID:g1842253" /translation="MKTSPRRPLILKRRRLPLPVQNAPSETSEEEPKRSPAQQESNQA EASKEVAESNSCKFPAGIKIINHPTMPNTQVVAIPNNANIHSIITALTAKGKESGSSG PNKFILISCGGAPTQPPGLRPQTQTSYDAKRTEVTLETLGPKPAARDVNLPRPPGALC EQKRETCADGEAAGCTINNSLSNIQWLRKMSSDGLGSRSIKQEMEEKENCHLEQRQVK VEEPSRPSASWQNSVSERPPYSYMAMIQFAINSTERKRMTLKDIYTWIEDHFPYFKHI AKPGWKNSIRHNLSLHDMFVRETSANGKVSFWTIHPSANRYLTLDQVFKPLDPGSPQL PEHLESQQKRPNPELRRNMTIKTELPLGARRKMKPLLPRVSSYLVPIQFPVNQSLVLQ PSVKVPLPLAASLMSSELARHSKRVRIAPKVFGEQVVFGYMSKFFSGDLRDFGTPITS LFNFIFLCLSVLLAEEGIAPLSSAGPGKEEKLLFGEGFSPLLPVQTIKEEEIQPGEEM PHLARPIKVESPPLEEWPSPAPSFKEESSHSWEDSSQSPTPRPKKSYSGLRSPTRCVS EMLVIQHRERRERSRSRRKQHLLPPCVDEPELLFSEGPSTSRWAAELPFPADSSDPAS QLSYSQEVGGPFKTPIKETLPISSTPSKSVLPRTPESWRLTPPAKVGGLDFSPVQTSQ GASDPLPDPLGLMDLSTTPLQSAPPLESPQRLLSSEPLDLISVPFGNSSPSDIDVPKP GSPEPQVSGLAANRSLTEGLVLDTMNDSLSKILLDISFPGLDEDPLGPDNINWSQFIP ELQ" misc_feature 820..1155 /gene="HFH-11A" /note="encodes a winged helix DNA binding domain" exon 1090..1134 /gene="HFH-11A" /note="exon A1" /number=1 exon 1384..1497 /gene="HFH-11A" /note="exon A2" /number=2 polyA_site 3475 BASE COUNT 856 a 1010 c 867 g 759 t ORIGIN 1 ggttggagga gcccggagcc cgccttcgga gctacggcct aacggcggcg gcgactgcag 61 tctggagggt ccacacttgt gattctcaat ggagagtgaa aacgcagatt cataatgaaa 121 actagccccc gtcggccact gattctcaaa agacggaggc tgccccttcc tgttcaaaat 181 gccccaagtg aaacatcaga ggaggaacct aagagatccc ctgcccaaca ggagtctaat 241 caagcagagg cctccaagga agtggcagag tccaactctt gcaagtttcc agctgggatc 301 aagattatta accaccccac catgcccaac acgcaagtag tggccatccc caacaatgct 361 aatattcaca gcatcatcac agcactgact gccaagggaa aagagagtgg cagtagtggg 421 cccaacaaat tcatcctcat cagctgtggg ggagccccaa ctcagcctcc aggactccgg 481 cctcaaaccc aaaccagcta tgatgccaaa aggacagaag tgaccctgga gaccttggga 541 ccaaaacctg cagctaggga tgtgaatctt cctagaccac ctggagccct ttgcgagcag 601 aaacgggaga cctgtgcaga tggtgaggca gcaggctgca ctatcaacaa tagcctatcc 661 aacatccagt ggcttcgaaa gatgagttct gatggactgg gctcccgcag catcaagcaa 721 gagatggagg aaaaggagaa ttgtcacctg gagcagcgac aggttaaggt tgaggagcct 781 tcgagaccat cagcgtcctg gcagaactct gtgtctgagc ggccacccta ctcttacatg 841 gccatgatac aattcgccat caacagcact gagaggaagc gcatgacttt gaaagacatc 901 tatacgtgga ttgaggacca ctttccctac tttaagcaca ttgccaagcc aggctggaag 961 aactccatcc gccacaacct ttccctgcac gacatgtttg tccgggagac gtctgccaat 1021 ggcaaggtct ccttctggac cattcacccc agtgccaacc gctacttgac attggaccag 1081 gtgtttaagc cactggaccc agggtctcca caattgcccg agcacttgga atcacagcag 1141 aaacgaccga atccagagct ccgccggaac atgaccatca aaaccgaact ccccctgggc 1201 gcacggcgga agatgaagcc actgctacca cgggtcagct catacctggt acctatccag 1261 ttcccggtga accagtcact ggtgttgcag ccctcggtga aggtgccatt gcccctggcg 1321 gcttccctca tgagctcaga gcttgcccgc catagcaagc gagtccgcat tgcccccaag 1381 gtttttgggg aacaggtggt gtttggttac atgagtaagt tctttagtgg cgatctgcga 1441 gattttggta cacccatcac cagcttgttt aattttatct ttctttgttt atcagtgctg 1501 ctagctgagg aggggatagc tcctctttct tctgcaggac cagggaaaga ggagaaactc 1561 ctgtttggag aagggttttc tcctttgctt ccagttcaga ctatcaagga ggaagaaatc 1621 cagcctgggg aggaaatgcc acacttagcg agacccatca aagtggagag ccctcccttg 1681 gaagagtggc cctccccggc cccatctttc aaagaggaat catctcactc ctgggaggat 1741 tcgtcccaat ctcccacccc aagacccaag aagtcctaca gtgggcttag gtccccaacc 1801 cggtgtgtct cggaaatgct tgtgattcaa cacagggaga ggagggagag gagccggtct 1861 cggaggaaac agcatctact gcctccctgt gtggatgagc cggagctgct cttctcagag 1921 gggcccagta cttcccgctg ggccgcagag ctcccgttcc cagcagactc ctctgaccct 1981 gcctcccagc tcagctactc ccaggaagtg ggaggacctt ttaagacacc cattaaggaa 2041 acgctgccca tctcctccac cccgagcaaa tctgtcctcc ccagaacccc tgaatcctgg 2101 aggctcacgc ccccagccaa agtaggggga ctggatttca gcccagtaca aacctcccag 2161 ggtgcctctg accccttgcc tgaccccctg gggctgatgg atctcagcac cactcccttg 2221 caaagtgctc ccccccttga atcaccgcaa aggctcctca gttcagaacc cttagacctc 2281 atctccgtcc cctttggcaa ctcttctccc tcagatatag acgtccccaa gccaggctcc 2341 ccggagccac aggtttctgg ccttgcagcc aatcgttctc tgacagaagg cctggtcctg 2401 gacacaatga atgacagcct cagcaagatc ctgctggaca tcagctttcc tggcctggac 2461 gaggacccac tgggccctga caacatcaac tggtcccagt ttattcctga gctacagtag 2521 agccctgccc ttgcccctgt gctcaagctg tccaccatcc cgggcactcc aaggctcagt 2581 gcaccccaag cctctgagtg aggacagcag gcagggactg ttctgctcct catagctccc 2641 tgctgcctga ttatgcaaaa gtagcagtca caccctagcc actgctggga ccttgtgttc 2701 cccaagagta tctgattcct ctgctgtccc tgccaggagc tgaagggtgg gaacaacaaa 2761 ggcaatggtg aaaagagatt aggaaccccc cagcctgttt ccattctctg cccagcagtc 2821 tcttaccttc cctgatcttt gcagggtggt ccgtgtaaat agtataaatt ctccaaatta 2881 tcctctaatt ataaatgtaa gcttatttcc ttagatcatt atccagagac tgccagaagg 2941 tgggtaggat gacctggggt ttcaattgac ttctgttcct tgcttttagt tttgatagaa 3001 gggaagacct gcagtgcacg gtttcttcca ggctgaggta cctggatctt gggttcttca 3061 ctgcagggac ccagacaagt ggatctgctt gccagagtcc tttttgcccc tccctgccac 3121 ctccccgtgt ttccaagtca gctttcctgc aagaagaaat cctggttaaa aaagtctttt 3181 gtattgggtc aggagttgaa tttggggtgg gaggatggat gcaactgaag cagagtgtgg 3241 gtgcccagat gtgcgctatt agatgtttct ctgataatgt ccccaatcat accagggaga 3301 ctggcattga cgagaactca ggtggaggct tgagaaggcc gaaagggccc ctgacctgcc 3361 tggcttcctt agcttgcccc tcagctttgc aaagagccac cctaggcccc agctgaccgc 3421 atgggtgtga gccagcttga gaacactaac tactcaataa aagcgaaggt ggacaaaaaa 3481 aaaaaaaaaa aa // LOCUS HSU74667 2215 bp mRNA PRI 05-NOV-1996 DEFINITION Human tat interactive protein (TIP60) mRNA, complete cds. ACCESSION U74667 NID g1657981 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 271 to 2215) AUTHORS Kamine,J., Elangovan,B., Subramanian,T., Coleman,D. and Chinnadurai,G. TITLE Identification of a cellular protein that specifically interacts with the essential cysteine region of the HIV-1 Tat transactivator JOURNAL Virology 216 (2), 357-366 (1996) MEDLINE 96182937 REFERENCE 2 (bases 1 to 2215) AUTHORS Elangovan,B., Boyd,J.M., Subramanian,T. and Chinnadurai,G. TITLE Full cDNA sequence for the Tat interacting protein Tip60 JOURNAL Unpublished REFERENCE 3 (bases 1 to 2215) AUTHORS Elangovan,B., Boyd,J.M., Subramanian,T. and Chinnadurai,G. TITLE Direct Submission JOURNAL Submitted (07-OCT-1996) Institute for Molecular Virology, St. Louis University Health Sciences Center, 3681 Park Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2215 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" misc_feature 1..355 /note="similar to EST with GenBank Accession Number T09055" gene 242..2005 /gene="TIP60" CDS 242..1783 /gene="TIP60" /note="interacts with HIV1 Tat; similar to acetyltransferase; similar to yeast SAS2, SAS3 and human MOZ, encoded by GenBank Accession Numbers U14548, Z23261 and U47742, respectively; similar to sequence with GenBank Accession Number U40989" /codon_start=1 /product="tat interactive protein" /db_xref="PID:g1657982" /translation="MAEVGEIIEGCRLPVLRRNQDNEDEWPLAEILSVKDISGRKLFY VHYIDFNKRLDEWVTHERLDLKKIQFPKKEAKTPTKNGLPGSRPGSPEREVPASAQAS GKTLPIPVQITLRFNLPKEREAIPGGEPDQPLSSSSCLQPNHRSTKRKVEVVSPATPV PSETAPASVFPQNGAARRAVAAQPGRKRKSNCLGTDEDSQDSSDGIPSAPRMTGSLVS DRSHDDIVTRMKNIECIELGRHRLKPWYFSPYPQELTTLPVLYLCEFCLKYGRSLKCL QRHLTKCDLRHPPGNEIYRKGTISFFEIDGRKNKSYSQNLCLLAKCFLDHKTLYYDTD PFLFYVMTEYDCKGFHIVGYFSKEKESTEDYNVACILTLPPYQRRGYRKLLIEFSYEL SKVEGKTGTPEKPLSDLGLLSYRSYWSQTILEILMGLKSESGERPQITINEISEITSI KKEDVISTLQYLNLINYYKGQYILTLSEDIVDGHERAMLKRLLRIDSKCLHFTPKDWS KRGKW" BASE COUNT 529 a 644 c 624 g 418 t ORIGIN 1 actcagtaga ccgccactgg ctgtgcacgt tatggggttt ccacctaggg ctcggcctga 61 ggcttgtaac actccgtttt cccccgagtc acaggggcag tcttgcccct cgcagctggg 121 tcgcggtgtc tctcaaaggt ccccctctac aggggcttcg tgaggcccgg gcccacaggg 181 cgctcggtcc cggaagtgac gtctcccaga ggggccggaa gtggcagtgg agggagggaa 241 gatggcggag gtgggggaga taatcgaggg ctgccgccta cccgtgctgc ggcggaacca 301 ggacaacgaa gatgagtggc ccctggccga gatcctgagc gtgaaggaca tcagtggccg 361 gaagcttttc tacgtccatt acattgactt caacaaacgt ctggatgaat gggtgacgca 421 tgagcggctg gacctaaaga agatccagtt ccccaagaaa gaggccaaga cccccactaa 481 gaacggactt cctgggtccc gtcctggctc tccagagaga gaggtgccgg cctcggcgca 541 ggccagcggg aagaccttgc caatcccggt ccagatcaca ctccgcttca acctgcccaa 601 ggagcgggag gccattcccg gtggcgagcc tgaccagccg ctctcctcca gctcctgcct 661 gcagcccaac caccgctcaa cgaaacggaa ggtggaggtg gtttcaccag caactccagt 721 gcccagcgag acagccccgg cctcggtttt tccccagaat ggagccgccc gtagggcagt 781 ggcagcccag ccaggacgga agcgaaaatc gaattgtttg ggcactgatg aggactccca 841 ggacagctct gatggaatac cgtcagcacc acgcatgact ggcagcctgg tgtctgatcg 901 aagccacgac gacatcgtca cccggatgaa gaacattgag tgcattgagc tgggccggca 961 ccgcctcaag ccgtggtact tctccccgta cccacaggaa ctcaccacat tgcctgtcct 1021 ctacctgtgc gagttctgcc tcaagtacgg ccgtagtctc aagtgtcttc agcgtcattt 1081 gaccaagtgt gacctacgac atcctccagg caatgagatt taccgcaagg gcaccatctc 1141 cttctttgag attgatggac gtaagaacaa gagttattcc cagaacctgt gtcttttggc 1201 caagtgtttc cttgaccata agacactgta ctatgacaca gaccctttcc tcttctacgt 1261 catgacagag tatgactgta agggcttcca catcgtgggc tacttctcca aggagaaaga 1321 atcaacggaa gactacaatg tggcctgcat cctaaccctg cctccctacc agcgccgggg 1381 ctaccggaag ctgctgatcg agttcagcta tgaactctcc aaagtggaag ggaaaacagg 1441 gacccctgag aagcccctct cagaccttgg cctcctatcc tatcgaagct actggtccca 1501 gaccatcctg gagatcctga tggggctgaa gtcggagagc ggggagaggc cacagatcac 1561 catcaatgag attagtgaaa tcaccagcat caagaaggag gatgtcatct ccactctgca 1621 gtacctcaat ctcatcaact actacaaggg ccagtacatc ctcacactgt cagaggacat 1681 cgtggatggc catgagcggg ccatgctcaa gcggctcctg cggatcgact ccaagtgtct 1741 gcacttcact cccaaggact ggagcaagag ggggaagtgg tgaccagaca ctgcccactg 1801 cagtgccaag acggcagcag gactggggct gatagcccac cccgccccca ctgcagctcc 1861 cacaaagcac tctaagggag atggggctga ggacagctca aaaaggagag gacaggcctg 1921 cagggcccac ttgcccagca ccaaggcgag ctccgggctc agaccaactc caaggtcagc 1981 tggccacagc ccaggcctcc tctgaagcag ggaccagagg gagccaggca gctgtgtaca 2041 gtgagaaggg atccggatgg gggagctctg tacagagggc tggtgattgt aaaaatttct 2101 tttgtaaagt agaagttggg ggtggggtgg gtgctggctg caaaaatttc tggcttctct 2161 tacccctatt gcccccggca ataaattgtt tctatatgcc aaaaaaaaaa aaaaa // LOCUS HSU75283 1641 bp mRNA PRI 26-MAR-1997 DEFINITION Human sigma receptor mRNA, complete cds. ACCESSION U75283 NID g1906590 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1641) AUTHORS Kekuda,R., Prasad,P.D., Fei,Y.J., Leibach,F.H. and Ganapathy,V. TITLE Cloning and functional expression of the human type 1 sigma receptor (hSigmaR1) JOURNAL Biochem. Biophys. Res. Commun. 229 (2), 553-558 (1996) MEDLINE 97127440 REFERENCE 2 (bases 1 to 1641) AUTHORS Kekuda,R., Prasad,P.D., Fei,Y.J., Leibach,F.H. and Ganapathy,V. TITLE Direct Submission JOURNAL Submitted (16-OCT-1996) Biochemistry & Molecular Biology, Medical College of Georgia, 1120 15th Street, Augusta, GA 30912-2100, USA FEATURES Location/Qualifiers source 1..1641 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="choriocarcinoma" /cell_line="JAR" CDS 48..719 /codon_start=1 /product="sigma receptor" /db_xref="PID:g1783387" /translation="MQWAVGRRWAWAALLLAVAAVLTQVVWLWLGTQSFVFQREEIAQ LARQYAGLDHELAFSRLIVELRRLHPGHVLPDEELQWVFVNAGGWMGAMCLLHASLSE YVLLFGTALGSRGHSGRYWAEISDTIISGTFHQWREGTTKSEVFYPGETVVHGPGEAT AVEWGPNTWMVEYGRGVIPSTLAFALADTVFSTQDFLTLFYTLRSYARGLRLELTTYL FGQDP" BASE COUNT 344 a 502 c 448 g 347 t ORIGIN 1 ggccccggct ccctcctgag ctgcgccgtg ccaggccgcc cgccgggatg cagtgggccg 61 tgggccggcg gtgggcgtgg gccgcgctgc tcctggctgt cgcagcggtg ctgacccagg 121 tcgtctggct ctggctgggt acgcagagct tcgtcttcca gcgcgaagag atagcgcagt 181 tggcgcggca gtacgctggg ctggaccacg agctggcctt ctctcgtctg atcgtggagc 241 tgcggcggct gcacccaggc cacgtgctgc ccgacgagga gctgcagtgg gtgttcgtga 301 atgcgggtgg ctggatgggc gccatgtgcc ttctgcacgc ctcgctgtcc gagtatgtgc 361 tgctcttcgg caccgccttg ggctcccgcg gccactcggg gcgctactgg gctgagatct 421 cggataccat catctctggc accttccacc agtggagaga gggcaccacc aaaagtgagg 481 tcttctaccc aggggagacg gtagtacacg ggcctggtga ggcaacagct gtggagtggg 541 ggccaaacac atggatggtg gagtacggcc ggggcgtcat cccatccacc ctggccttcg 601 cgctggccga cactgtcttc agcacccagg acttcctcac cctcttctat actcttcgct 661 cctatgctcg gggcctccgg cttgagctca ccacctacct ctttggccag gacccttgac 721 cagccaggcc tgaaggaaga cctgcggatg gacaggagcg ggcaggcccg cacatatcca 781 cttgctggag cccatgttta cagacaggga catacaccat gcagatcctg agttcctgct 841 gtatgagcag ggatatccat gcttatgtat ccaaacacag agacccatgg gaacaaatga 901 gacacatata gatactgaga cctgtgtgta cagtaggacc atgcactcac acccatctgg 961 agagggagcc cccggtatac caagggagcc agttgtgttc agacacacac atcacagctt 1021 gactcactaa ctgaggcctt tccatagctc cacagcttcc cacctcctcc ccaccaaacc 1081 ggggttctag agttaaggat gggggagggt attatactgc ctcagtctga ctcctcaacc 1141 cagcagcaat ttgaggggat gagggggaag aggagctgcc ttttggaggc ccccttcacc 1201 tgcagctatg atgcccttcc ccttctcccc tgtcctcacc atatgcctta tccccattct 1261 actcccctgc tatgcaagtg cccctgtggc ttgtccccaa ccccctcagc aacaaagctc 1321 agctggggaa cgagagtaat ttgaagaatg cttgaagtca gcgtcttcca ttccagaaag 1381 acccccattc ttcctttggg ggtatgatgt ggaagctggt ttcagcccag gacccaccac 1441 tgaggagagg atctagacag gtgggcctaa ttccaagggg cccttcctgg cctggagaag 1501 gccttttaca cacacacaac acatacacac acacacacac acacatatca cagttttcac 1561 acagcccctg ctgcattctc tgtccatctg tctgtttcta ttaataaaga tttgttgatc 1621 tgttaaaaaa aaaaaaaaaa a // LOCUS HSU75329 2479 bp mRNA PRI 10-OCT-1997 DEFINITION Human serine protease mRNA, complete cds. ACCESSION U75329 NID g2507612 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2479) AUTHORS Paoloni-Giacobino,A., Chen,H., Peitsch,M.C., Rossier,C. and Antonarakis,S.E. TITLE Cloning of the TMPRSS2 gene, which encodes a novel serine protease with transmembrane, LDLRA, and SRCR domains and maps to 21q22.3 JOURNAL Genomics 44 (3), 309-320 (1997) MEDLINE 97468144 REFERENCE 2 (bases 1 to 2479) AUTHORS Paoloni-Giacobino,A., Chen,H. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) Medical Genetics, University of Geneva Medical School, 1 Michel-Servet Street, Geneva 1211, Switzerland FEATURES Location/Qualifiers source 1..2479 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.3" CDS 57..1535 /codon_start=1 /product="serine protease" /db_xref="PID:g2507613" /translation="MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPTVYEVHPAQY YPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAA GLLWKFMGSKCSNSGIECDSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQM YSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGN VDIYKKLYHSDACSSKAVVSLRCLACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQ NVHVCGGSIITPEWIVTAAHCVEKPLNNPWHWTAFAGILRQSFMFYGAGYQVQKVISH PNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGK TSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSNN NIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMKANG" BASE COUNT 578 a 650 c 677 g 574 t ORIGIN 1 gtcatattga acattccaga tacctatcat tactcgatgc tgttgataac agcaagatgg 61 ctttgaactc agggtcacca ccagctattg gaccttacta tgaaaaccat ggataccaac 121 cggaaaaccc ctatcccgca cagcccactg tggtccccac tgtctacgag gtgcatccgg 181 ctcagtacta cccgtccccc gtgccccagt acgccccgag ggtcctgacg caggcttcca 241 accccgtcgt ctgcacgcag cccaaatccc catccgggac agtgtgcacc tcaaagacta 301 agaaagcact gtgcatcacc ttgaccctgg ggaccttcct cgtgggagct gcgctggccg 361 ctggcctact ctggaagttc atgggcagca agtgctccaa ctctgggata gagtgcgact 421 cctcaggtac ctgcatcaac ccctctaact ggtgtgatgg cgtgtcacac tgccccggcg 481 gggaggacga gaatcggtgt gttcgcctct acggaccaaa cttcatcctt cagatgtact 541 catctcagag gaagtcctgg caccctgtgt gccaagacga ctggaacgag aactacgggc 601 gggcggcctg cagggacatg ggctataaga ataattttta ctctagccaa ggaatagtgg 661 atgacagcgg atccaccagc tttatgaaac tgaacacaag tgccggcaat gtcgatatct 721 ataaaaaact gtaccacagt gatgcctgtt cttcaaaagc agtggtttct ttacgctgtt 781 tagcctgcgg ggtcaacttg aactcaagcc gccagagcag gatcgtgggc ggtgagagcg 841 cgctcccggg ggcctggccc tggcaggtca gcctgcacgt ccagaacgtc cacgtgtgcg 901 gaggctccat catcaccccc gagtggatcg tgacagccgc ccactgcgtg gaaaaacctc 961 ttaacaatcc atggcattgg acggcatttg cggggatttt gagacaatct ttcatgttct 1021 atggagccgg ataccaagta caaaaagtga tttctcatcc aaattatgac tccaagacca 1081 agaacaatga cattgcgctg atgaagctgc agaagcctct gactttcaac gacctagtga 1141 aaccagtgtg tctgcccaac ccaggcatga tgctgcagcc agaacagctc tgctggattt 1201 ccgggtgggg ggccaccgag gagaaaggga agacctcaga agtgctgaac gctgccaagg 1261 tgcttctcat tgagacacag agatgcaaca gcagatatgt ctatgacaac ctgatcacac 1321 cagccatgat ctgtgccggc ttcctgcagg ggaacgtcga ttcttgccag ggtgacagtg 1381 gagggcctct ggtcacttcg aacaacaata tctggtggct gataggggat acaagctggg 1441 gttctggctg tgccaaagct tacagaccag gagtgtacgg gaatgtgatg gtattcacgg 1501 actggattta tcgacaaatg aaggcaaacg gctaatccac atggtcttcg tccttgacgt 1561 cgttttacaa gaaaacaatg gggctggttt tgcttccccg tgcatgattt actcttagag 1621 atgattcaga ggtcacttca tttttattaa acagtgaact tgtctggctt tggcactctc 1681 tgccatactg tgcaggctgc agtggctccc ctgcccagcc tgctctccct aaccccttgt 1741 ccgcaagggg tgatggccgg ctggttgtgg gcactggcgg tcaattgtgg aaggaagagg 1801 gttggaggct gcccccattg agatcttcct gctgagtcct ttccaggggc caattttgga 1861 tgagcatgga gctgtcactt ctcagctgct ggatgacttg agatgaaaaa ggagagacat 1921 ggaaagggag acagccaggt ggcacctgca gcggctgccc tctggggcca cttggtagtg 1981 tccccagcct acttcacaag gggattttgc tgatgggttc ttagagcctt agcagccctg 2041 gatggtggcc agaaataaag ggaccagccc ttcatgggtg gtgacgtggt agtcacttgt 2101 aaggggaaca gaaacatttt tgttcttatg gggtgagaat atagacagtg cccttggtgc 2161 gagggaagca attgaaaagg aacttgccct gagcactcct ggtgcaggtc tccacctgca 2221 cattgggtgg ggctcctggg agggagactc agccttcctc ctcatcctcc ctgaccctgc 2281 tcctagcacc ctggagagtg aatgcccctt ggtccctggc agggcgccaa gtttggcacc 2341 atgtcggcct cttcaggcct gatagtcatt ggaaattgag gtccatgggg gaaatcaagg 2401 atgctcagtt taaggtacac tgtttccatg ttatgtttct acacattgat ggtggtgacc 2461 ctgagttcaa agccatctt // LOCUS HSU75330 4723 bp mRNA PRI 10-OCT-1997 DEFINITION Human neural cell adhesion protein (NCAM21) mRNA, complete cds. ACCESSION U75330 NID g2507614 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4723) AUTHORS Paoloni-Giacobino,A., Chen,H. and Antonarakis,S.E. TITLE Cloning of a novel human neural cell adhesion molecule gene (NCAM2) that maps to chromosome region 21q21 and is potentially involved in Down syndrome JOURNAL Genomics 43 (1), 43-51 (1997) MEDLINE 97369930 REFERENCE 2 (bases 1 to 4723) AUTHORS Paoloni-Giacobino,A., Chen,H. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) Medical Genetics, University of Geneva Medical School, 1 Michel-Servet Street, Geneva 1211, Switzerland FEATURES Location/Qualifiers source 1..4723 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q21" gene 1..4723 /gene="NCAM21" CDS 70..2583 /gene="NCAM21" /codon_start=1 /product="neural cell adhesion protein" /db_xref="PID:g2507615" /translation="MSLLLSFYLLGLLVSSGQALLQVTISLSKVELSVGESKFFTCTA IGEPRSIDWYNPQGEKIISTQRVVVQKGGVRSRLTIYNANIEDAGIYRCQATDAKGQT QEATVVLEIYQKLTFREVVSPQEFKQGEDAEVVCRVSSSPAPAVSWLYHNEEVTTISD NRLAMLANNNLQILNINKSDEGIYRCEGRVEARGEIDFRDIIVIVNVPPAISMPQKSF NATAERGEEMTFSCRASGSPEPAISWFRNGKLIEENEKYILKGSNTELTVRNIINSDG GPYVCRATNKAGEDEKQAFLQVFVQPHIIQLKNETTYENGQVTLVCDAEGEPIPEITW KRAVDGFTFTEGDKSPDGRIEVKGQHGSSSLHIKDVKLSGSGRYDCEAASRIGGHQKS MYLDIEYAPKFISNQTIYYSWEGNPINISCDVKSNPPASIHWRRDKLVLPAKNTTNLK TYSTGRKMILEIAPTSDNDFGRYNCTATNHIGTRFQEYILALADVPSSPYGVKIIELS QTTAKVSFNKPDSHGGVPIHHYQVDVKEVASEIWKIVRSHGVQTMVVLNNLEPNTTYE IRVAAVNGKGQGDYSKIEIFQTLPVREPSPPSIHGQPSSGKSFKLSITKQDDGGAPIL EYIVKYRSKDKEDQWLEKKVQGNKDHIILEHLQWTMGMKFRLPAANRLGYSEPTVYEF SMPPKPNIIKDTLFNGLGLGAVIGLGVAALLLILVVTDVSCFFIRQCGLLMCITRRMC GKKSGSSGKSKELEEGKAAYLKDGSKEPIVEMRTEDERVTNHEDGSPVNEPNETTPLT EPEKLPLKEEDGKEALNPETIEIKVSNDIIQSKEDDSKA" BASE COUNT 1583 a 826 c 971 g 1343 t ORIGIN 1 gaattccggt ctctctccag ggctggactt aataactttg aaactgtcca ccggtgtcac 61 gtcctgaaca tgagcctcct cctctccttc tacctgctgg ggttgcttgt cagtagcggg 121 caagctcttc ttcaagtgac aatttcactt agcaaagtag agcttagtgt tggagaatct 181 aaattcttca catgtacagc gattggtgaa cctagaagta ttgattggta taatcctcaa 241 ggagagaaga taatttcaac acagagggta gtagtgcaaa agggaggtgt taggtcacgg 301 ttaaccatct acaatgcaaa tatagaagat gcagggatat atcgttgtca agcaacagat 361 gccaaaggac aaacacaaga agctacagta gttttggaaa tttaccaaaa actcactttc 421 agagaagtgg tatctccaca agaattcaaa caaggagaag atgcagaagt ggtttgccga 481 gttagcagtt cacctgcacc tgctgtcagc tggttgtatc ataatgagga agtcaccact 541 atttccgaca atcggctcgc tatgttagca aacaataacc tgcagattct caacatcaat 601 aaaagtgatg aaggtatata cagatgtgaa ggaagagtgg aggccagggg agaaattgac 661 ttccgtgata tcattgttat tgttaatgtg ccgccagcaa tctctatgcc tcagaaatct 721 tttaatgcca cagcagagag aggagaagaa atgacatttt cctgcagggc ctcaggctct 781 ccagaacccg ccatctcctg gttcaggaat ggcaagctca ttgaagaaaa tgagaagtac 841 atattgaaag ggagcaatac agaactcact gtcaggaaca taatcaatag tgatggtggt 901 ccttatgtct gcagggccac aaataaggca ggagaagatg aaaagcaagc tttcctccaa 961 gtctttgtac agcctcacat aatacagctt aaaaatgaaa ctacatatga gaatggtcaa 1021 gtcacactcg tatgtgatgc ggaaggggag cctattccag aaatcacttg gaaaagagct 1081 gtggatggct tcacgttcac tgaaggcgat aagagcccgg acggccgtat cgaagtcaaa 1141 gggcagcatg gaagctcatc actgcatatt aaagatgtga agttgtcagg ctcagggaga 1201 tatgactgtg aagctgcaag cagaattgga gggcatcaaa agagcatgta ccttgatatt 1261 gaatatgccc ccaagtttat atcaaaccaa acaatttatt actcttggga aggaaatcct 1321 atcaatataa gttgtgatgt gaaatcgaat ccaccagcat caattcactg gagaagagat 1381 aaattagtct tacctgctaa aaacacgacc aatttaaaga cttatagtac aggaagaaag 1441 atgatattag agattgcacc tacatctgac aatgactttg gacgctataa ttgcacagcc 1501 actaatcata taggaacaag atttcaagaa tatattcttg ctttggctga cgtgccatcc 1561 agtccctatg gagtgaagat catagagctg tcgcagacca cggccaaggt ttccttcaac 1621 aaaccggact cccatggagg tgtacctatt catcactatc aggtggatgt caaagaagta 1681 gcgtcagaaa tctggaaaat tgtacgctcc catggagttc aaacaatggt tgttttgaac 1741 aacctggaac caaatacaac ttatgaaatt agggttgcag ctgtaaatgg aaagggacaa 1801 ggagactaca gtaaaataga aatcttccaa acattaccag ttcgtgaacc aagtcctcca 1861 tccatacatg gacagccaag cagtggaaag agctttaaac tcagcatcac caaacaggac 1921 gatggagggg cccctatttt ggaatacatt gtgaaatata gaagtaaaga taaggaagac 1981 caatggctag agaaaaaagt gcaaggaaat aaagaccaca tcattttgga gcatctccag 2041 tggaccatgg ggatgaagtt cagattacca gctgccaata gattgggata ttctgaaccg 2101 acagtttatg aattcagcat gccaccaaag cccaacatta ttaaagacac gctgtttaat 2161 ggtcttgggc ttggagcagt aattggcctg ggagttgctg cactgctgct aattcttgtg 2221 gtaacagacg tcagctgctt ctttattcgg caatgtgggt tgctgatgtg catcactagg 2281 agaatgtgtg gaaagaaaag tggctccagt ggcaaaagta aagaactcga agaaggaaaa 2341 gctgcatacc tgaaagatgg atcaaaagaa ccaatagtgg agatgagaac agaggatgaa 2401 agagttacta atcacgaaga tgggagccca gtaaatgagc caaatgaaac cacaccactg 2461 acagaacctg aaaaattgcc tttaaaggaa gaagatggga aagaagctct aaatccagaa 2521 actatagaaa ttaaagtttc taacgacatc attcaatcaa aagaagacga cagcaaagca 2581 taacaacaat attacagggg cttgaacaac actacgaaga gtatttggat tgcgtgaccc 2641 tatgaccaaa actattccat tgaccttaat ttcttgggaa acttctagct tggaatagct 2701 tgtacacata tacatatgat caaatactcc tggcccatgg atccattccc ttttgttatt 2761 gttgttgttg ttgctgttgt tgttaatttt gttaagaatt tcaatatcaa gactgactgg 2821 caccaacact ttggtattca atttgattct atgactgaag tactggaatt tattatgtgg 2881 ctaaagtgct ctatttatta agaactatat ttaataccac caacaaatat aggggttaag 2941 gaaaaaaaac gttgagctac atgtgtaaga aggccctgca tgtgtatgag tcctattctg 3001 ggcaaataga ttcttaaagt ggctttcaac ttcaagatga aggagcttaa taatggttac 3061 tcattttatc aggggaattt cagggaacgt aggcgtcaaa gagccagtta tctttagcag 3121 atattaaaaa ttgaaaactt tggagaactc atttcaagtt atgattcagt gcattttcaa 3181 cattgatttt tgatagactg aagtgccaga tcaaaattgt tacccatttg aaagaatatt 3241 agttgtatat aaaattagat tagaaagact ttcctaaatc tctatctctt tatatatgtc 3301 ctattcattc acaatggatt atacaaaaaa aagtgtattg caagtgaaat aatattgatt 3361 tctgccctca gcttcaaata aagtaaattg aaatgggaac aatatcaata tggtgtcttg 3421 atatatttat aaatatgtga ttatcattta tttttaaaat aatttatcaa aaaacaagtc 3481 tttagtgttc aaatacttca aatcatatcc tcagatatat ttttagccca tggttttata 3541 taatctttaa gaactaattt taccactgtt ataggttcac cattaaatat aattggctaa 3601 taaaaatttt aaggttgact aaattaagaa gaaattattt aacattttaa tgtgccataa 3661 aagagtaaat gataaataat taaatgccac tatgtgttct attccggatg ttctagctag 3721 aagtcatttt aagattttga taaacaactt tggttgaaga aattccttaa gtattcaaca 3781 caaactttct aatatctttt gttagggtta taccagaata aaatgcttct ttacttccaa 3841 gctatgcaag ctcccagagg taatagagtg acacatgatt taacttatat gtaaggttta 3901 aaaaagtatt tatcattata aacatacata ccatttggga gcaggtttat taaccttgag 3961 agccaaaggt ttccttaggc cctgtaacat tcagaacctt tggtgtttca ggtggtatta 4021 tagctcaaat agtgacagga cagggaatgc gttccaaagg aatattggag caattttaac 4081 attgcagaaa cctgctctgg gtgtgtctct ctgtagagat aacctgatga ttattaaatg 4141 taaaattaag gcaactcatg aatattttta tttacaaagt gcttgaaact cagccaagga 4201 gagaaaacta agtactttta tataattcat cacttttctg gctacagcag gacagaatat 4261 gaccaccttc gtttgaaggc accaaatcgt cgcagtgtct ttgccataag ttgcagggtt 4321 aaatgcggaa atctctcctt gcgttcctgt ctggcgtatt ctgaagaaaa gaacagaatt 4381 cttgtgccta cctaagaatt tgagtagtgt ctaaacaaac aaagcagtta ggtcatttta 4441 actgacttga ttatccaact ggtctttgac agatttgact gtccatattt agtttatgtt 4501 tgtctgatca tccagtttgc tttatttggc tgtgtttaat ttggtggttg gtttggtttg 4561 gtaccagtgt actaaaacta gtcaaaatac ttgaattagt ttgtttgtgc aaagtgtaca 4621 accttagtaa agtgtccatg aagcaatagc catgaatgct aattatttct aaatagggcc 4681 acatggtttt aaactaatga tggtgaaaga aatacgatga ctg // LOCUS HSU75362 2738 bp mRNA PRI 06-NOV-1996 DEFINITION Human isopeptidase T-3 (ISOT-3) mRNA, complete cds. ACCESSION U75362 NID g1658462 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2738) AUTHORS Ansari-Lari,M.A., Timms,K., Morris,W., Brown,S.N. and Gibbs,R.A. TITLE The genomic organization of isopeptidase T-3 gene (ISOT-3), a new member of ubiquitin specific protease family (UBP) JOURNAL Unpublished REFERENCE 2 (bases 1 to 2738) AUTHORS Ansari-Lari,M.A., Timms,K., Morris,W., Brown,S.N. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2738 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" gene 95..2686 /gene="ISOT-3" CDS 95..2686 /gene="ISOT-3" /note="deubiquitinating enzyme; ubiquitin carboxy terminal hydrolase" /codon_start=1 /product="isopeptidase T-3" /db_xref="PID:g1658463" /translation="MQRRGALFGMPGGSGGRKMAAGDIGELLVPHMPTIRVPRSGDRV YKNECAFSYDSPNSEGGLYVCMNTFLAFGREHVERHFRKTGQSVYMHLKRHAREKVRG ASGGALPKRRNSKIFLDLDTDDDLNSDDYEYEDEAKLVIFPDHYEIALPNIEELPALV TIACDAVLSSKSPYRKQDPDTWENELPVSKYANNLTQLDNGVRIPPSGWKCARCDLRE NLWLNLTDGSVLCGKWFFDSSGGNGHALEHYRDMGYPLAVKLGTITPDGADVYSFQEE EPVLDPHLAKHLAHFGIDMLHMHGTENGLQDNDIKLRVSEWEVIQESGTKLKPMYGPG YTGLKNLGNSCYLSSVMQAIFSIPEFQRAYVGNLPRIFDYSPLDPTQDFNTQMTKLGH GLLSGQYSKPPVKSELIEQVMKEEHKPQQNGISPRMFKAFVSKSHPEFSSNRQQDAQE FFLHLVNLVERNRIGSENPSDVFRFLVEERIQCCQTRKVRYTERVDYLMQLPVAMEAA TNKDELIAYELTRREAEANRRPLPELVRAKIPFSACLQAFSEPENVDDFWSSALQAKS AGVKTSRFASFPEYLVVQIKKFTFGLDWVPKKFDVSIDMPDLLDINHLRARGLQPGEE ELPDISPPIVIPDDSKDRLMNQLIDPSDIDESSVMQLAEMGFPLEACRKAVYFTGNMG AEVAFNWIIVHMEEPDFAEPLTMPGYGGAASAGASVFGASGLDNQPPEEIVAIITSMG FQRNQAIQALRATNNNLERALDWIFSHPEFEEDSDFVIEMENNANANIISEAKPEGPR VKDGSGTYELFAFISHMGTSTMSGHYICHIKKEGRWVIYNDHKVCASERPPKDLGYMY FYRRIPS" polyA_site 2738 BASE COUNT 757 a 647 c 702 g 632 t ORIGIN 1 cgcggaattc cgcggaattc cgcgccgccg ccgccggcag accccgcgct ccggctccgg 61 ctcggctcgc tcggctccgg tgcgcgccga ggccatgcag cgccggggcg ccctgttcgg 121 catgccgggc ggcagcggag gcaggaagat ggctgcagga gacatcggcg agctgctagt 181 gccccacatg cccacgatcc gcgtgcccag gtccggcgac agggtctaca agaacgagtg 241 cgccttctcc tacgactctc ccaattctga aggtggactc tatgtatgca tgaatacatt 301 tttggccttt ggaagggaac atgttgaaag acattttcga aaaactggac agagtgtata 361 catgcacctg aaaagacatg cgcgagagaa ggtaagaggg gcgtctggtg gagcgttacc 421 aaaaaggagg aattccaaga tttttttaga tctagatact gatgacgatt taaatagcga 481 cgattatgaa tatgaagatg aagccaaact tgttatattc ccagatcact atgaaatagc 541 actaccaaat attgaggagt taccagccct ggtaacaatt gcttgtgatg cagttctcag 601 ctcaaaatct ccatacagaa agcaggaccc agacacgtgg gaaaatgaat tgccagtatc 661 taaatatgcc aacaacctca cccagctgga caatggagtc aggattcctc caagtggttg 721 gaagtgtgcc agatgcgacc tgcgagaaaa cctctggttg aatctgactg acggctctgt 781 cctgtgtgga aagtggttct ttgacagctc tgggggcaac gggcatgcgc tggagcatta 841 cagagacatg ggctacccac tagccgtgaa actgggaacc atcactcctg acggggcaga 901 tgtttattct tttcaagaag aagaacctgt tttggatcct catttggcca agcacttagc 961 gcattttgga attgatatgc ttcatatgca tgggacagag aatgggctcc aggacaatga 1021 catcaagctg agggtcagtg agtgggaagt gatccaggag tcgggcacga aactgaagcc 1081 aatgtatggt cctggctaca cgggtctgaa gaacctgggc aacagctgct atctcagctc 1141 tgtcatgcag gccatcttca gcatcccaga attccagaga gcgtatgtag gaaaccttcc 1201 cagaatattt gactactcgc ctttagatcc aacacaagat ttcaacacac agatgactaa 1261 gttaggacat ggccttctct caggccagta ttcaaagcct ccggtgaaat ctgaactcat 1321 tgaacaggtg atgaaggagg agcacaagcc acagcagaac gggatctctc cgcgcatgtt 1381 taaggccttt gtaagcaaga gccacccgga attctcctct aacaggcagc aagatgccca 1441 ggaattcttc ttgcacctgg tgaatctagt agagaggaac cgcatcggct cagaaaaccc 1501 aagcgatgtt tttcgttttt tggtggaaga acgcattcag tgctgtcaga cccggaaagt 1561 ccgctacacg gagagggtgg attacctgat gcagttacct gtggccatgg aggcggcaac 1621 caacaaggat gaactgatcg cttatgaact aacgagaagg gaagcagaag caaacagaag 1681 accccttcct gagttggtac gtgccaagat accatttagt gcctgccttc aggccttctc 1741 tgaaccagaa aatgttgatg atttctggag cagtgcccta caagcaaagt ctgcgggtgt 1801 gaaaacatct cgctttgctt cattccctga atacttggta gtgcagataa agaagttcac 1861 ttttggtctt gactgggttc ccaaaaaatt tgatgtttct attgatatgc cagacctact 1921 tgatatcaac catctccgag ccagggggtt acagccagga gaggaagaac ttccagacat 1981 cagccccccc atagtcattc ctgatgactc aaaagatcgc ctgatgaacc aattgataga 2041 cccatcagac atcgatgagt catcagtgat gcagctggcc gagatgggtt tcccgctgga 2101 agcatgtcgc aaggctgtgt acttcactgg aaatatgggc gccgaggtgg ccttcaactg 2161 gatcattgtt cacatggaag agccagattt tgctgagccg ctgaccatgc ctggttatgg 2221 aggggcagct tctgctggag cctctgtttt tggtgcttct ggactggata accaacctcc 2281 agaggaaatc gtagctatca tcacctccat gggatttcag cgaaatcagg ctattcaggc 2341 actacgagca acgaataata acctggaaag agcactggat tggatcttta gccaccctga 2401 gtttgaagaa gacagtgatt ttgtgattga gatggagaat aatgccaatg caaacattat 2461 ttctgaggcc aagcccgaag gacctagagt caaggatgga tctggaacat atgagctatt 2521 tgcattcatc agtcacatgg gaacatccac aatgagtggt cattacattt gccatatcaa 2581 aaaggaagga agatgggtga tttacaatga ccacaaagtt tgtgcctcag aaaggccccc 2641 taaagacctg ggctacatgt acttttaccg caggatacca agctaaacct caaatataaa 2701 aattggcgaa aagaagccat acgccttttt aatttgcc // LOCUS HSU75370 3832 bp mRNA PRI 25-MAY-1997 DEFINITION Human mitochondrial RNA polymerase mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U75370 NID g2114395 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3832) AUTHORS Tiranti,V., Savoia,A., Forti,F., D'Apolito,M.F., Centra,M., Rocchi,M. and Zeviani,M. TITLE Identification of the gene encoding the human mitochondrial RNA polymerase (h-mtRPOL) by cyberscreening of the Expressed Sequence Tags database JOURNAL Hum. Mol. Genet. 6 (4), 615-625 (1997) MEDLINE 97252399 REFERENCE 2 (bases 1 to 3832) AUTHORS Tiranti,V., D'Apolito,M.F., Forti,F., Rocchi,M., Savoia,A. and Zeviani,M. TITLE Direct Submission JOURNAL Submitted (18-OCT-1996) Molecular Medicine, Children's Hospital Bambino Gesu'-IRCCS, Piazza S. Onofrio 4, Rome, RM 00165, Italy FEATURES Location/Qualifiers source 1..3832 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="19" /map="19p13.3" CDS 33..3725 /codon_start=1 /product="mitochondrial RNA polymerase" /db_xref="PID:g2114396" /translation="MSALCWGRGAAGLKRALRPCGRPGLPGKEGTAGGVCGPRRSSSA SPQEQDQDRRKDWGHVELLEVLQARVRQLQAESVSEVVVNRVDVARLPECGSGDGSLQ PPRKVQMGAKDATPVPCGRWAKILEKDKRTQQMRMQRLKAKLQMPFQSGEFKALTRRL QVEPRLLSKQMAGCLEDCTRQAPESPWEEQLARLLQEAPGKLSLDVEQAPSGQHSQAQ LSGQQQRLLAFFKCCLLTDQLPLAHHLLVVHHGQRQKRKLLTLDMYNAVMLGWARQGA FKELVYVLFMVKDAGLTPDLLSYAAALQCMGRQDQDAGTIERCLEQMSQEGLKLQALF TAVLLSEEDRATVLKAVHKVKPTFSLPPQLPPPVNTSKLLRDVYAKDGRVSYPKLHLP LKTLQCFFEKQLHMELASRVCVVSVEKPTLPSKEVKHARKTLKTLRDQWEKALCRALR ETKNRLEREVYEGRFSLYPFLCLLDEREVVRMLLQVLQALPAQGESFTTLARELSART FSRHVVQRQRVSGQVQALQNHYRKYLCLLASDAEVPEPCLPRQYWEELGAPEALREQP WPLPVQMELGKLLAEMLVQATQMPCSLDKPHRSSRLVPVLYHVYSFRNVQQIGILKPH PAYVQLLEKAAEPTLTFEAVDVPMLCPPLPWTSPHSGAFLLSPTKLMRTVEGATQHQE LLETCPPTALHGALDALTQLGNCAWRVNGRVLDLVLQLFQAKGCPQLGVPAPPSEAPQ PPEAHLPHSAAPARKAELRRELAHCQKVAREMHSLRAEALYRLSLAQHLRDRVFWLPH NMDFRGRTYPCPPHFNHLGSDVARALLEFAQGRPLGPHGLDWLKIHLVNLTGLKKREP LRKRLAFAEEVMDDILDSADQPLTGRKWWMGAEEPWQTLACCMEVANAVRASDPAAYV SHLPVHQDGSCNGLQHYAALGRDSVGAASVNLEPSDVPQDVYSGVAAQVEVFRRQDAQ RGMRVAQVLESFITRKVVKQTVMTVVYGVTRYGGRLQIEKRLRELSDFPQEFVWEASH YLVRQVFKSLQEMFSGTRAIQHWLTESARLISHMGSVVEWVTPLGVPVIQPYRLDSKV KQIGGGIQSITYTHNGDISRKPNTRKQKNGFPPNFIHSLDSSHMMLTALHCYRKGLTF VSVHDCYWTHAADVSVMNQVCREQFVRLHSEPILQDLSRFLVKRFCSEPQKILEASQL KETLQAVPKPGAFDLEQVKRSTYFFS" BASE COUNT 725 a 1235 c 1259 g 613 t ORIGIN 1 cggcggcggc ggcgcctgga gcggcgtgcg taatgtcggc actttgctgg ggccgcggag 61 cggcggggct caaacgagcc ctacggcctt gcggccgccc gggactcccc ggcaaagaag 121 ggaccgccgg tggcgtctgc ggccccagga ggagctcgtc cgccagcccc caggagcaag 181 accaagaccg caggaaggac tggggccacg tggagctgct ggaggtgctc caggcgcggg 241 tgcggcagct gcaggctgag agcgtgtcgg aggtggtggt gaacagggtg gatgtggcgc 301 ggctcccaga atgtggcagt ggagatggta gcctccagcc acccaggaag gtccagatgg 361 gggccaagga tgccaccccg gtgccctgtg gccgctgggc aaagatactg gagaaggata 421 agcggaccca gcagatgcgt atgcagcggt tgaaggcgaa gctgcagatg ccattccaga 481 gcggggagtt caaggcgctg accaggcgcc tgcaggtgga gccccggctc ctgagcaagc 541 agatggccgg gtgcctggag gactgcacgc gccaggcccc cgagagcccc tgggaggagc 601 agctggcccg gctgctgcag gaggcccctg ggaagctgag cctcgatgtg gagcaggccc 661 cgtcggggca gcactcgcag gcccagctct caggtcagca gcagaggctc ctggccttct 721 tcaagtgctg cctgctcact gaccagctgc ccctcgccca ccacctgctg gtcgtccacc 781 acggccagcg gcagaagcgg aagctgctca cgctggacat gtacaacgcc gtgatgcttg 841 gctgggcgcg gcagggtgct ttcaaggagc tggtatatgt gttattcatg gtgaaggatg 901 ccggcttgac cccggacctg ctgtcctatg cggctgccct ccagtgcatg gggaggcagg 961 accaggacgc cgggaccatc gaaaggtgtc tggaacagat gagccaggag gggctgaagc 1021 tgcaggcact cttcaccgcc gttctgctgt ctgaggagga tcgggccact gttctgaagg 1081 ccgtgcacaa ggtgaagccc accttcagcc tcccgccgca gctgccgccc ccggtcaaca 1141 cctccaagct gctcagggac gtgtatgcca aggatgggcg tgtgtcctac ccgaagctgc 1201 acctgccctt gaagaccctg cagtgcttct ttgagaagca gctccacatg gagctggcca 1261 gcagggtgtg cgtggtgtcc gtggagaagc ccacgttgcc aagcaaggag gtcaagcacg 1321 cgcggaagac cctgaagacc ctgcgggacc aatgggagaa agcactgtgc cgggcgctgc 1381 gggagaccaa gaaccgccta gagcgcgagg tgtacgaggg ccggttctca ctttacccct 1441 tcctgtgcct gctggacgag cgcgaggtgg tgcggatgct cctgcaggtc ctgcaggcgc 1501 tgcccgccca aggtgagtcc ttcaccaccc tggcccggga gctgagtgcg cgcactttca 1561 gccggcacgt ggtgcagagg cagcgggtca gtggccaggt gcaggcgctg cagaaccact 1621 acaggaagta cctctgcttg ctggcctccg acgccgaggt gcccgagccc tgcctgccgc 1681 ggcagtactg ggaggagctg ggggcgcccg aggccctgcg ggagcagccc tggcccctgc 1741 cagtgcagat ggagctgggc aagctgctgg cggagatgct ggtgcaggct acgcagatgc 1801 catgcagcct ggacaagccg catcgttcct ctcggcttgt ccccgtgctc taccacgtgt 1861 attccttccg caacgtccag caaatcggca tcctgaagcc gcacccggcc tacgtgcagc 1921 tgctggagaa ggccgcggaa cccacgctga ccttcgaggc ggtggatgta cccatgcttt 1981 gccccccgct gccctggaca tcgccgcact ctggtgcttt cctgctcagc cccaccaagc 2041 tgatgcgcac ggtggaaggc gccacgcaac accaggagct gctggaaacc tgcccaccca 2101 ccgcgctgca tggcgcactg gacgccctca cccaactggg caactgcgcc tggcgcgtca 2161 acgggcgcgt gctggacctg gtgctgcagc tcttccaggc caagggctgc ccccagctag 2221 gcgtgccggc cccgccctcc gaggcgcccc agccgcccga ggcccacctg ccgcacagcg 2281 ccgcgcccgc ccgcaaggcc gagctgcgcc gtgagctggc gcactgccag aaggtggccc 2341 gggagatgca cagcctgcgg gcggaggcgc tgtaccgcct ctcgctggcg cagcacctgc 2401 gggaccgcgt cttctggctg ccgcacaaca tggacttccg cggccgcacc tacccctgcc 2461 cgccgcactt caaccacctg ggcagcgacg tggcgcgggc cctgctggag ttcgcccagg 2521 gccgcccgct cggcccgcac ggcctggatt ggctcaagat ccacctggtc aatctcacgg 2581 ggttgaagaa gcgggagccg ctgcggaagc gcctggcctt tgcggaggag gtgatggatg 2641 acatcctgga ctccgcggac caacccttga cgggccgaaa gtggtggatg ggcgcggagg 2701 aaccctggca gacgctggcc tgctgtatgg aggtggcgaa cgctgtgcgc gcctccgacc 2761 ctgccgccta tgtctcccac ctccccgtcc atcaggacgg ctcttgcaac ggcctgcagc 2821 attatgctgc tctgggccgc gacagcgtgg gcgccgcctc cgtcaacctg gagccctcgg 2881 atgtgccgca ggacgtgtac agcggcgtgg ccgcgcaggt ggaggtgttc cgtaggcagg 2941 acgcccagcg gggcatgcgg gtggcacagg tgctggaaag tttcatcacc cgcaaggtgg 3001 tgaagcagac ggtgatgacg gtggtgtacg gggtcacgcg ctatggcggg cgcctgcaga 3061 ttgagaagcg cctccgggag ctgagcgact ttccccagga gttcgtgtgg gaggcctctc 3121 actatctcgt acgccaggtc ttcaagagtc tacaggagat gttctcgggg acccgggcca 3181 tccagcactg gctgaccgag agtgcccgcc tcatctccca catgggctct gtggtggagt 3241 gggtcacacc cctgggcgtc cccgtcatcc agccgtatcg cctggactcc aaggtcaagc 3301 aaataggagg tggaattcag agcatcacct acacccacaa cggagacatc agccgaaagc 3361 ccaacacacg taagcagaag aacggcttcc cgcccaactt catccactcg ctggactcct 3421 cccacatgat gctcaccgcc ctgcactgct acaggaaggg cctgaccttc gtctctgtgc 3481 acgactgtta ctggactcac gcagctgatg tctccgtcat gaaccaggtg tgccgggagc 3541 agtttgtccg cttgcacagc gagcccatcc tgcaggacct gtccagattc ctggtcaagc 3601 ggttctgctc tgagccccag aagatcttgg aggccagcca gctgaaggag acactgcagg 3661 cggtgcccaa gccaggggcc ttcgacctgg agcaggtgaa gcgttccacc tacttcttca 3721 gctgacaccc cgtgagcctt gtcagtgtgt aaataaagct cttttgccac cccccaaaaa 3781 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU76010 2000 bp mRNA PRI 02-JAN-1997 DEFINITION Human putative zinc transporter ZnT-3 (ZnT-3) mRNA, complete cds. ACCESSION U76010 NID g1763375 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2000) AUTHORS Palmiter,R.D., Cole,T.B., Quaife,C.J. and Findley,S.D. TITLE ZnT-3, a putative transporter of zinc into synaptic vesicles JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1996) In press REFERENCE 2 (bases 1 to 2000) AUTHORS Palmiter,R.D., Cole,T.B., Quaife,C.J. and Findley,S.D. TITLE Direct Submission JOURNAL Submitted (23-OCT-1996) Howard Hughes Medical Institute, University of Washington, Box 357370, Seattle, WA 98195, USA FEATURES Location/Qualifiers source 1..2000 /organism="Homo sapiens" /db_xref="taxon:9606" gene 84..1475 /gene="ZnT-3" CDS 84..1250 /gene="ZnT-3" /function="zinc transporter" /codon_start=1 /product="ZnT-3" /db_xref="PID:g1763376" /translation="MEPSPAAGGLETTRLVSPRDRGGAGGSLRLKSLFTEPSEPLPEE SKPVEMPFHHCHRDPLPPPGLTPERLHARRQLYAACAVCFVFMAGEVVGGYLAHSLAI MTDAAHLLADVGSMMGSLFSLWLSTRPATRTMTFGWHRSETLGALASVVSLWMVTGIL LYLAFVRLLHSDYHIEGGAMLLTASIAVCANLLMAFVLHQAGPPHSHGSRGAEYAPLE EGPEQPLPLGNTSVRAAFVHVLGDLLQSFGVLAASILIYFKPQYKAADPISTFLFSIC ALGSTAPTLRDVLRILMEGTPRNVGFEPVRDTLLSVPGVRATHELHLWALTLTYHVAS AHLAIDSTADPEAVLAEASSRLYSRFGFSSCTLQVEQYQPEMAQCLRCQEPPQA" BASE COUNT 301 a 659 c 590 g 450 t ORIGIN 1 cggcgggctg ctcggacttg gcgcggggcc ggcccggcct ctctcttcct cggtggggcc 61 tagacggtcg gggcaccggg aacatggagc cctctccagc cgctgggggc ttggagacca 121 ctcgcctggt gagcccccgg gaccgcggtg gcgccggagg cagcctgcgt ttgaagagtc 181 tcttcacaga gccctcagag cccctccctg aggagtccaa acctgtggag atgcccttcc 241 accactgcca cagggacccc cttccgccgc cgggccttac ccctgagagg ctgcatgcac 301 ggaggcagct atatgctgcc tgtgccgttt gctttgtctt catggctggg gaggtggtcg 361 gcgggtatct ggcacacagc ctggccatca tgaccgatgc agcccacttg ctggcggatg 421 tgggcagcat gatgggcagc ctcttctccc tctggctctc cacccgtcca gccacccgca 481 ccatgacctt tggctggcac cgttcagaga ctctgggggc tttggcctct gtggtctccc 541 tctggatggt cactggcatc ctcctgtacc tggccttcgt ccgcctgctg cacagcgact 601 accacatcga ggggggtgcc atgctgctga ccgccagcat cgcagtctgt gccaacctgt 661 taatggcctt tgtgctgcac caggctgggc ccccccacag ccacgggtct aggggagcag 721 agtatgcacc gctggaggag gggcctgaac agcccctgcc cctggggaac accagcgtcc 781 gggcggcatt tgtgcacgtg ctgggggacc tcctgcagag ctttggggta ctggctgcct 841 ccatcctcat ctacttcaag cctcaataca aggcagccga ccccatcagc accttcctct 901 tctccatctg tgcccttgga tccaccgctc ccaccctccg agacgttctt cgaatcctca 961 tggaaggtac cccccgcaat gtggggttcg aacctgtgcg ggatacgctg ttgtcggtgc 1021 caggagtccg ggcaacccat gagctgcacc tgtgggccct tacgctcact taccatgttg 1081 cctctgcaca cctggccatc gactccaccg ctgaccctga agccgtcctg gctgaagcct 1141 catcccggct ctactcccgg tttggattct ccagctgcac cctgcaggtc gagcagtatc 1201 agccggagat ggcccagtgc ctgcgctgcc aggaaccccc ccaagcctga gccatggccc 1261 tgccctcacc ccactgccag gccgaggctc agccccagac tctcagcatc tgctgccctg 1321 atcacagaga cgggaccgag ccaggtcata ccccttccct ctctcccctc cctaccacct 1381 gccagtttcc ccagcctcag ccccagcccc agccccagtg ggcaagacca aagtgtggcg 1441 gggagtgggg tgggagtcag gggaatagat gtgactagtt caggggcggg gactcccagg 1501 cctcagtgtg gcagggtgtg ttgaaggcct gtggtgccat ctccccatgg ttcatgtgga 1561 gccacgaaca tcctttccct gcagtccatt tgtctgtgtg gcaggctggc tggctggggg 1621 catctgcctg tctatgtgct gttggtgtgc ctatgcctgg gggaggtcag taggggcccc 1681 ctccccacat ggccctcgct ctgtctatgc aggggcccca aagcccgcac tttgtccgtg 1741 tgtcttagcc ctgtggtttt gtctgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 1801 gttcttggtg ctgtggcctg tgtgtctctg tgcctatgtg gctgtgctat ggtttctatg 1861 agtctgctcc atccatgtgt ctgtttgggg gtctatctct ccatccctct gttggtgctg 1921 tgcccttggc tatccctgaa agagggagga ctccgctgca gctccaccaa taaagttgtg 1981 tctcactgca aaaaaaaaaa // LOCUS HSU76362 1996 bp mRNA PRI 10-MAY-1997 DEFINITION Human retinal glutamate transporter EAAT5 mRNA, complete cds. ACCESSION U76362 NID g2076761 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1996) AUTHORS Arriza,J.L., Eliasof,S., Kavanaugh,M.P. and Amara,S.G. TITLE Excitatory amino acid transporter 5, a retinal glutamate transporter coupled to a chloride conductance JOURNAL Proc. Natl. Acad. Sci USA 94, 4155-4160 (1997) REFERENCE 2 (bases 1 to 1996) AUTHORS Arriza,J.L., Eliasof,S., Kavanaugh,M.P. and Amara,S.G. TITLE Direct Submission JOURNAL Submitted (25-OCT-1996) Vollum Institute, Oregon Health Sciences University, 3181 S.W. Sam Jackson Park Road, Portland, OR 97201, USA FEATURES Location/Qualifiers source 1..1996 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retina" CDS 189..1871 /note="excitatory amino acid transporter 5; tSXV motif carboxyl terminus may interact with PDZ domains" /codon_start=1 /product="retinal glutamate transporter EAAT5" /db_xref="PID:g2076762" /translation="MVPHTILARGRDVCRRNGLLILSVLSVIVGCLLGFFLRTRRLSP QEISYFQFPGELLMRMLKMMILPLVFSSLMSGLASLDAKTSSRLGVLTVAYYLWTTFM AVIVGIFMVSIIHPGSAAQKETTEQSGKPIMSSADALLDLIRNMFPANLVEATFKQYR TKTTPVVKSPKVAPEEAPPRRILIYGVQEENGSHVQNFALDLTPPPEVVYKSEPGTSD GMNVLGIVFFSATMGIMLGRMGDSGGPLVSFCQCLNESVMKIVAVAVWYFPFGIVFLI AGKILEMDDPRAVGKKLGFYSVTVVCGLVLHGLFILPLLYFFITKKNPIVFIRGILQA LLIALATSSSSATLPITFKCLLENNHIDRRIARFVLPVGATINMDGTALYEAVAAIFI AQVNNYELDFGQIITISITGTAASIGAAGIPQAGLVTMVIVLTSVGLPTDDITLIIGV DWALDRFRTMINVLGDALAAGIMAHICRKDFARDTGTEKLLPCETKPVSLQEIVAAQQ NGCVKSVAEASELTLGPTCPHHVPVQVERDEELPAASLNHCTIQISELETNV" BASE COUNT 370 a 652 c 575 g 399 t ORIGIN 1 gaattccgcg tgtggccgcc ttagagggaa gccacacggg catggccgtg gggctggcga 61 ctggtgttta gcaactccga ccacctgcct gctgaggggc tagagccctc agcccagacc 121 ctgtgccccc ggccgggctc tcatgcgtgg aatggtgctg tgccccttgc cagcaggcca 181 ggctcaccat ggtgccgcat accatcttgg cacgggggag ggacgtgtgc aggcggaatg 241 gactcctcat cctgtctgtg ctgtctgtca tcgtgggctg cctcctcggc ttcttcttga 301 ggacccggcg cctctcacca caggaaatta gttacttcca gttccccgga gagctcctga 361 tgaggatgct gaagatgatg atcctgccac tggtgttctc cagcttgatg tccggacttg 421 cctccctgga tgccaagacc tctagccgcc tgggcgtcct caccgtggcg tactacctgt 481 ggaccacctt catggctgtc atcgtgggca tcttcatggt ctccatcatc cacccaggca 541 gcgcggccca gaaggagacc acggagcaga gtgggaagcc catcatgagc tcagccgatg 601 ccctgttgga cctcatccgg aacatgttcc cagccaacct agtagaagcc acattcaaac 661 agtaccgcac caagaccacc ccagttgtca agtcccccaa ggtggcacca gaggaggccc 721 ctcctcggcg gatcctcatc tacggggtcc aggaggagaa tggctcccat gtgcagaact 781 tcgccctgga cctgaccccg ccgcccgagg tcgtttacaa gtcagagccg ggcaccagcg 841 atggcatgaa tgtgctgggc atcgtcttct tctctgccac catgggcatc atgctgggcc 901 gcatgggtga cagcgggggc cccctggtca gcttctgcca gtgcctcaat gagtcggtca 961 tgaagatcgt ggcggtggct gtgtggtatt tccccttcgg cattgtgttc ctcattgcgg 1021 gtaagatcct ggagatggac gaccccaggg ccgtcggcaa gaagctgggc ttctactcag 1081 tcaccgtggt gtgcgggctg gtgctccacg ggctctttat cctgcccctg ctctacttct 1141 tcatcaccaa gaagaatccc atcgtcttca tccgcggcat cctgcaggct ctgctcatcg 1201 cgctggccac ctcctccagc tcagccacac tgcccatcac cttcaagtgc ctgctggaga 1261 acaaccacat cgaccggcgc atcgctcgct tcgtgctgcc cgtgggtgcc accatcaaca 1321 tggacggcac tgcgctctac gaggctgtgg ccgccatctt catcgcccag gtcaacaact 1381 acgagctgga ctttggccag atcatcacca tcagtatcac aggcactgca gccagcattg 1441 gggcagctgg catcccccag gccggcctcg tcaccatggt catcgtgctc acctccgtgg 1501 gactgcccac cgatgacatc accctcatca ttggcgttga ctgggctctg gaccgtttcc 1561 gcaccatgat taacgtgctg ggtgatgcgc tggcagcggg gatcatggcc catatatgtc 1621 ggaaggattt tgcccgggac acaggcaccg agaaactgct gccctgcgag accaagccag 1681 tgagcctcca ggagatcgtg gcagcccagc agaatggctg tgtgaagagt gtagccgagg 1741 cctccgagct caccctgggc cccacctgcc cccaccacgt ccccgttcaa gtggagcggg 1801 atgaggagct gcccgctgcg agtctgaacc actgcaccat ccagatcagc gagctggaga 1861 ccaatgtctg agcctgcgga gctgcagggg caggcgaggc ctccaggggc agggtcctga 1921 ggcaggaact cgactctcca accctcctga gcagccggta gggggcagga tcacacattc 1981 ttctcaccct tgagag // LOCUS HSU76368 2185 bp mRNA PRI 12-JUL-1997 DEFINITION Human cationic amino acid transporter-2A (ATRC2) mRNA, complete cds. ACCESSION U76368 NID g2252785 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2185) AUTHORS Closs,E.I., Graef,P., Habermeier,A., Cunningham,J.M. and Foerstermann,U. TITLE Human cationic amino acid transporters hCAT-1, hCAT-2A and hCAT-2B: three related carriers with distinct transport properties JOURNAL J. Biochem. 36, 6462-6468 (1997) REFERENCE 2 (bases 1 to 2185) AUTHORS Closs,E.I. and Cunningham,J.M. TITLE Direct Submission JOURNAL Submitted (25-OCT-1996) Pharmacology, Johannes Gutenberg University, Obere Zahlbacher Strasse 67, Mainz 55101, Germany FEATURES Location/Qualifiers source 1..2185 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 195..2168 /gene="ATRC2" CDS 195..2168 /gene="ATRC2" /note="low affinity carrier for cationic amino acids; cationic amino acid transporter-2A" /codon_start=1 /product="hCAT-2A" /db_xref="PID:g2252786" /translation="MIPCRAALTFARCLIRRKIVTLDSLEDTKLCRCLSTMDLIALGV GSTLGAGVYVLAGEVAKADSGPSIVVSFLIAALASVMAGLCYAEFGARVPKTGSAYLY TYVTVGELWAFITGWNLILSYVIGTSSVARAWSGTFDELLSKQIGQFLRTYFRMNYTG LAEYPDFFAVCLILLLAGLLSFGVKESAWVNKVFTAVNILVLLFVMVAGFVKGNVANW KISEEFLKNISASAREPPSENGTSIYGAGGFMPYGFTGTLAGAATCFYAFVGFDCIAT TGEEVRNPQKAIPIGIVTSLLVCFMAYFGVSAALTLMMPYYLLDEKSPLPVAFEYVGW GPAKYVVAAGSLCALSTSLLGSMFPLPRILFAMARDGLLFRFLARVSKRQSPVAATLT AGVISALMAFLFDLKALVDMMSIGTLMAYSLVAACVLILRYQPGLSYDQPKCSPEKDG LGSSPRVTSKSESQVTMLQRQGFSMRTLFCPSLLPTQQSASLVSFLVGFLAFLVLGLS VLTTYGVHAITRLEAWSLALLTLFLVLFVAIVLTIWRQPQNQQKVAFMVPFLPFLPAF SILVNIYLMVQLSADTWVRFSIWMAIGFLIYFSYGIRHSLEGHLRDENNEEDAYPDNV HAAAEEKSAIQANDHHPRNLSSPFIFHEKTSEF" BASE COUNT 491 a 524 c 529 g 640 t 1 others ORIGIN 1 gaattccggc tctcaaattt tctatagaat caagatagaa cctttagatg tctcaccacg 61 aaactagcaa ctggaatgaa gatagaaaca agtggttata actcagacaa actaatttgt 121 cgagggttta ttggaacacc tgccccaccg gtttgcgaca naagtttctc ctgtcgcctt 181 cgtcagacgt cagaatgatt ccttgcagag ccgcgctgac ctttgcccga tgtctgatcc 241 ggagaaaaat cgtgaccctg gacagtctag aagacaccaa attatgccgc tgcttatcca 301 ccatggacct cattgccctg ggcgttggaa gcacccttgg ggccggggtt tatgtcctcg 361 ctggggaggt ggccaaggca gactcgggcc ccagcatcgt ggtgtccttc ctcattgctg 421 ccctggcttc agtgatggct ggcctctgct atgccgaatt tggggcccgt gttcccaaga 481 cggggtctgc atatttgtac acctacgtga ctgtcggaga gctgtgggcc ttcatcactg 541 gctggaatct cattttatcg tatgtgatag gtacatcaag tgttgcaaga gcctggagtg 601 gcacctttga tgaacttctt agcaaacaga ttggtcagtt tttgaggaca tacttcagaa 661 tgaattacac tggtcttgca gaatatcccg atttttttgc tgtgtgcctt atattacttc 721 tagcaggtct tttgtctttt ggagtaaaag agtctgcttg ggtgaataaa gtcttcacag 781 ctgttaatat tctcgtcctt ctgtttgtga tggttgctgg gtttgtgaaa ggaaatgtgg 841 caaactggaa gattagtgaa gagtttctca aaaatatatc agcaagtgcc agagagccac 901 cttctgaaaa cggaacaagt atctatgggg ctggtggctt tatgccttat ggctttacgg 961 gaacgttggc tggtgctgca acttgctttt atgcctttgt gggatttgac tgcattgcaa 1021 caactggtga agaagttcgg aatccccaga aagctattcc cattggaatt gtgacgtctt 1081 tgcttgtttg ctttatggcc tattttgggg tctctgcagc tttaacactt atgatgccgt 1141 actacctcct cgatgaaaaa agcccccttc ctgtagcgtt tgaatatgtg ggatggggtc 1201 ctgccaaata tgtcgtcgca gctggttctc tctgcgcctt gtcaacaagt cttctgggct 1261 ctatgtttcc tttaccccga attctgtttg ccatggcccg ggatggctta ctgtttagat 1321 ttcttgccag agtgagtaag aggcagtcac cagttgctgc cacgttgact gcaggggtca 1381 tttctgcttt gatggccttt ctgtttgacc tgaaggcgct tgtggacatg atgtccattg 1441 gcacactcat ggcctactct ctggtggcag cctgtgttct catcctcagg taccagcctg 1501 gcttatctta cgaccagccc aaatgttctc ctgagaaaga tggtctggga tcgtctccca 1561 gggtaacctc gaagagtgag tcccaggtca ccatgctgca gagacagggc ttcagcatgc 1621 ggaccctctt ctgcccctcc cttctgccaa cacagcagtc agcttctctc gtgagctttc 1681 tggtaggatt cctagctttc ctcgtgttgg gcctgagtgt cttgaccact tacggagttc 1741 atgccatcac caggctggag gcctggagcc tcgctctcct cacgctgttt cttgttctct 1801 tcgttgccat cgttctcacc atctggaggc agccccagaa tcagcaaaaa gtagccttca 1861 tggttccatt cttaccattt ttgccagcgt tcagcatctt ggtgaacatt tacttgatgg 1921 tccagttaag tgcagacact tgggtcagat tcagcatttg gatggcaatt ggcttcctga 1981 tttacttttc ttatggcatt agacacagcc tggagggtca tctgagagat gaaaacaatg 2041 aagaagatgc ttatccagac aacgttcatg cagcagcaga agaaaaatct gccattcaag 2101 caaatgacca tcacccaaga aatctcagtt cacctttcat attccatgaa aagacaagtg 2161 aattctaaca cttgcaggag cagat // LOCUS HSU76376 716 bp mRNA PRI 03-APR-1997 DEFINITION Human activator of apoptosis Hrk (hrk) mRNA, complete cds. ACCESSION U76376 NID g1923234 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 716) AUTHORS Inohara,N., Ding,L., Chen,S. and Nunez,G. TITLE harakiri, a novel regulator of cell death, encodes a protein that activates apoptosis and interacts selectively with survival-promoting proteins Bcl-2 and Bcl-X(L) JOURNAL EMBO J. 16 (7), 1686-1694 (1997) MEDLINE 97277020 REFERENCE 2 (bases 1 to 716) AUTHORS Inohara,N. TITLE Direct Submission JOURNAL Submitted (26-OCT-1996) Department of Pathology and Comprehensive Cancer Center, The University of Michigan Medical School, 1150 W.Medical Center Dr., Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..716 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="9 week embryo" gene 121..396 /gene="hrk" CDS 121..396 /gene="hrk" /note="harakiri; an activator of apoptosis, interacting with Bcl-2 and Bcl-xL but not Bax, Bcl-xS and Bak; a novel BH3 protein of Bcl-2 family" /codon_start=1 /product="activator of apoptosis Hrk" /db_xref="PID:g1923235" /translation="MCPCPLHRGRGPPAVCACSAGRLGLRSSAAQLTAARLKALGDEL HQRTMWRRRARSRRAPAPGALPTYWPWLCAAAQVAALAAWLLGRRNL" misc_signal 651..655 /note="AU-rich mRNA stablization signal" BASE COUNT 151 a 208 c 263 g 94 t ORIGIN 1 gaaacttggt gtccagggga ggcccccggc ggctggagcg cggcggcagc gggcgcagag 61 gccggaggga gaggaggcga ggggcggccc gagcgcgggg cgggagcgag gccagcggtc 121 atgtgcccgt gccccctgca ccgcggccgc ggccccccgg ccgtgtgcgc ctgcagcgcg 181 ggtcgcctgg ggctgcgctc gtccgccgcg cagctcaccg ccgcccggct caaggcgcta 241 ggcgacgagc tgcaccagcg caccatgtgg cggcgccgcg cgcggagccg gagggcgccg 301 gcgcccggcg cgctccccac ctactggcct tggctgtgcg cggccgcgca ggtggcggcg 361 ctggcggcct ggctgctcgg caggcggaac ttgtaggaac gcggggcttc ttggtggggc 421 cggagccgag acccagccgg agcgagcaac aggttggtga aaaccctgtg tccttggaga 481 aagctggttc ccgttttcca gagggggagc ccagagcttg aaaggccgcg gttggcactt 541 cgagaaggaa gtggagagta aagacagcgc ctggagcgat cgtagaaaca cagaatggga 601 ctggggaagc cctttggaaa tccagctgca gaaacagaca ccccaatgct atttacatac 661 agctctatat atataaaaaa agaaaatatg aatattaaaa aaaaaaaaaa aaaaaa // LOCUS HSU76388 1895 bp mRNA PRI 29-APR-1997 DEFINITION Human steroidogenic factor 1 mRNA, complete cds. ACCESSION U76388 NID g2052387 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1895) AUTHORS de Santa Barbara,P., Desclozeaux,M., Boizet,B., Bonneaud,N., Laudet,V., Poulat,F. and Berta,P. TITLE Cloning and Characterization of the Human Steroidogenic Factor 1 (SF-1) cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1895) AUTHORS de Santa Barbara,P. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) Centre de Recherche de Biochimie, Macromoleculaire, Campus CNRS, 1919 route de Mende, Montpellier, 34033, France FEATURES Location/Qualifiers source 1..1895 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" CDS 16..1401 /note="SF-1" /codon_start=1 /product="steroidogenic factor 1" /db_xref="PID:g2052388" /translation="MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNN KHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRD RALKQQKKAQIRANGFKLETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGLAAGPPAGP LGDFGAPALPMAVPGAHGPLAGYLYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPFS GGPNVPELILQLLQLEPDEDQVRARILGCLQEPTKSRPDQPAAFGLLCRMADQTFISI VDWARRCMVFKELEVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSILLVTGQEVELT TVATQAGSLLHSLVLRAQELVLQLLALQLDRQEFVCLKFIILFSLDLKFLNNHILVKD AQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRNN LLIEMLQAKQT" BASE COUNT 340 a 609 c 624 g 322 t ORIGIN 1 gcggacgccg cgggcatgga ctattcgtac gacgaggacc tggacgagct gtgccccgtg 61 tgcggggaca aggtgtccgg ctaccactac ggactgctca cgtgtgagag ctgcaagggc 121 ttcttcaagc gcacggtgca gaacaacaag cactacacgt gcaccgagag ccagagctgc 181 aagatcgaca agacgcagcg caagcgctgt cccttctgcc gcttccagaa atgcctgacg 241 gtggggatgc gcctggaagc cgtgcgcgct gaccgtatga ggggtggccg gaacaagttt 301 gggccgatgt acaagcggga ccgggccctg aaacagcaga agaaggcaca gattcgggcc 361 aatggcttca agctggagac agggcccccg atgggggtgc ccccgccgcc ccctcccgca 421 ccggactacg tgctgcctcc cagcctgcat gggcctgagc ccaagggcct ggccgccggt 481 ccacctgctg ggccactggg cgactttggg gccccagcac tgcccatggc cgtgcccggt 541 gcccacgggc cactggctgg ctacctctac cctgcctttc ctggccgtgc catcaagtct 601 gagtacccgg agccttatgc cagcccccca cagcctgggc tgccgtacgg ctacccagag 661 cccttctctg gaggccccaa cgtgcctgag ctcatcctgc agctgctgca gctggagccg 721 gatgaggacc aggtgcgggc ccgcatcttg ggctgcctgc aggagcccac caaaagccgc 781 cccgaccagc cggcggcctt cggcctcctg tgcagaatgg ccgaccagac cttcatctcc 841 atcgtggact gggcacgcag gtgcatggtc ttcaaggagc tggaggtggc cgaccagatg 901 acgctgctgc agaactgctg gagcgagctg ctggtgttcg accacatcta ccgccaggtc 961 cagcacggca aggagggcag catcctgctg gtcaccgggc aggaggtgga gctgaccaca 1021 gtggccaccc aggcgggctc gctgctgcac agcctggtgt tgcgggcgca ggagctggtg 1081 ctgcagctgc ttgcgctgca gctggaccgg caggagtttg tctgcctcaa gttcatcatc 1141 ctcttcagcc tggatttgaa gttcctgaat aaccacatcc tggtgaaaga cgctcaggag 1201 aaggccaacg ccgccctgct tgactacacc ctgtgccact acccgcactg cggggacaaa 1261 ttccagcagc tgctgctgtg cctggtggag gtgcgggccc tgagcatgca ggccaaggag 1321 tacctgtacc acaagcacct gggcaacgag atgccccgca acaacctgct catcgaaatg 1381 ctgcaagcca agcagacttg agcctgggcc gggggcgggg ccgggactgg gggcgggact 1441 gggggcgggg cctgggcggg gccgcagcca caccgctggc tccgcatggt tcattttctg 1501 atgcccaccg aggagcccca gccccgtccc agaggccgct gcccctgagt tctgacactg 1561 tgtgtttggg aaggtgggtg aggctgggca gggcctggcg gaggtggagt ggccactggc 1621 acttgcctgc tgcttggagt gccccaagga ggtggctgtt aaccacccgc cccgccccct 1681 ccctgctccc agctctctct cctggagtct gaagcctgca ggtccgggga ggaggttcgg 1741 gattccctgg tgggcctcga cgtcccttgg atcagaggtc atcccttcct cctctcctgg 1801 aaacagacag ggagaagttg agcaggtatc aactagggga ggagagaggg tctccagtgt 1861 tccccccata gagaccagga gggagagcct ctgtt // LOCUS HSU76456 1189 bp mRNA PRI 14-JAN-1997 DEFINITION Human tissue inhibitor of metalloproteinase 4 mRNA, complete cds. ACCESSION U76456 NID g1773292 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1189) AUTHORS Greene,J., Wang,M., Raymond,L.A., Liu,Y.E., Rosen,C. and Shi,Y.E. TITLE Molecular cloning and characterization of human tissue inhibitor of metalloproteinase 4 JOURNAL J. Biol. Chem. (1996) In press REFERENCE 2 (bases 1 to 1189) AUTHORS Greene,J., Wang,M., Raymond,L.A., Liu,Y.E., Rosen,C. and Shi,Y.E. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) Pediatrics, Long Island Jewish Medical Center, 270-05 76th Ave, New Hyde Park, NY 11042, USA FEATURES Location/Qualifiers source 1..1189 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" CDS 60..734 /codon_start=1 /product="tissue inhibitor of metalloproteinase 4" /db_xref="PID:g1773293" /translation="MPGSPRPAPSWVLLLRLLALLRPPGLGEACSCAPAHPQQHICHS ALVIRAKISSEKVVPASADPADTEKMLRYEIKQIKMFKGFEKVKDVQYIYTPFDSSLC GVKLEANSQKQYLLTGQVLSDGKVFIHLCNYIEPWEDLSLVQRESLNHHYHLNCGCQI TTCYTVPCTISAPNECLWTDWLLERKLYGYQAQHYVCMKHVDGTCSWYRGHLPLRKEF VDIVQP" BASE COUNT 281 a 323 c 300 g 285 t ORIGIN 1 cctgctgggg ccgtccagtc ccccagacct cacaggctca gtcgcggatc tgcagtgtca 61 tgcctgggag ccctcggccc gcgccaagct gggtgctgtt gctgcggctg ctggcgttgc 121 tgcggccccc ggggctgggt gaggcatgca gctgcgcccc ggcgcaccct cagcagcaca 181 tctgccactc ggcacttgtg attcgggcca aaatctccag tgagaaggta gttccggcca 241 gtgcagaccc tgctgacact gaaaaaatgc tccggtatga aatcaaacag ataaagatgt 301 tcaaagggtt tgagaaagtc aaggatgttc agtatatcta tacgcctttt gactcttccc 361 tctgtggtgt gaaactagaa gccaacagcc agaagcagta tctcttgact ggtcaggtcc 421 tcagtgatgg aaaagtcttc atccatctgt gcaactacat cgagccctgg gaggacctgt 481 ccttggtgca gagggaaagt ctgaatcatc actaccatct gaactgtggc tgccaaatca 541 ccacctgcta cacagtaccc tgtaccatct cggcccctaa cgagtgcctc tggacagact 601 ggctgttgga acgaaagctc tatggttacc aggctcagca ttatgtctgt atgaagcatg 661 ttgacggcac ctgcagctgg taccggggcc acctgcctct caggaaggag tttgttgaca 721 tcgttcagcc ctagtaggga ccagtgacca tcacatccct tcaagagtcc tgaagatcaa 781 gccagttctc cttccctgca gagctttggc cattaccacc tgacctcttg ctgccagcta 841 ataagaagtg ccaagtggac agtctggcca ctgtcaaggc agggaagggg ccatgacttt 901 tctgccctgc cctcagcctg ttgccctgcc tcccaaaccc cattagtcta gccttgtagc 961 tgttactgca agtgtttctt ctggcttagt ctgttttcta aagccaggac tattcccttt 1021 cctccccagg aatatgtgtt ttcctttgtc ttaatcgatc tggtagggga gaaatggcga 1081 atgtcataca catgagatgg tatatccttg cgatgtacag aatcagaagg tggtttgaca 1141 gcatcataaa caggctgact ggcaggaatg aaaaaaaaaa aaaaaaaaa // LOCUS HSU76560 1434 bp mRNA PRI 27-MAR-1997 DEFINITION Human peroxisome targeting signal 2 receptor (Pex7) mRNA, complete cds. ACCESSION U76560 NID g1907314 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1434) AUTHORS Braverman,N., Steel,G., Obie,C. and Valle,D.L. TITLE Human pex7 encodes the peroxisomal PTS2 receptor and is responsible for rhizomelic chondrodysplasia punctata JOURNAL Nature Genet. (1997) In press REFERENCE 2 (bases 1 to 1434) AUTHORS Braverman,N., Steel,G., Morrel,J., Gould,S., Valle,D.L., Moser,H. and Moser,A. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) Molecular Biology/Genetics, The Johns Hopkins University School of Medicine, PCTB 803, 725 N. Wolfe St., Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1434 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6q22-q24" /chromosome="6" gene 49..1020 /gene="Pex7" CDS 49..1020 /gene="Pex7" /note="peroxisome assembly gene that is responsible for RCDP; PTS2 receptor" /codon_start=1 /product="peroxisome targeting signal 2 receptor" /db_xref="PID:g1907315" /translation="MSAVCGGAARMLRTPGRHGYAAEFSPYLPGRLACATAQHYGIAG CGTLLILDPDEAGLRLFRSFDWNDGLFDVTWSENNEHVLITCSGDGSLQLWDTAKAAG PLQVYKEHAQEVYSVDWSQTRGEQLVVSGSWDQTVKLWDPTVGKSLCTFRGHESIIYS TIWSPHIPGCFASASGDQTLRIWDVKAAGVRIVIPAHQAEILSCDWCKYNENLLVTGA VDCSLRGWDLRNVRQPVFELLGHTYAIRRVKFSPFHASVLASCSYDFTVRFWNFSKPD SLLETVEHHTEFTCGLDFSLQSPTQVADCSWDETIKIYDPACLTIPA" BASE COUNT 378 a 280 c 359 g 417 t ORIGIN 1 gaacggcttc cgcggccggg gcagcgaggg ccgggggcgg cgggcgggat gagtgcggtg 61 tgcggtggag cggcgcggat gctgcggacg ccgggacgcc acggctacgc cgccgagttc 121 tccccgtacc tgccgggccg cctggcctgc gccaccgcgc agcactacgg catcgcgggc 181 tgtggaaccc tactaatatt ggatccagat gaagctgggc taaggctttt tagaagcttt 241 gactggaatg atggtttgtt tgatgtgact tggagtgaga acaacgaaca tgtcctcatc 301 acctgtagtg gcgatggctc gctgcagctc tgggacactg ccaaagctgc agggccactg 361 caagtctata aagaacacgc tcaggaggtg tatagtgttg attggagcca aaccagaggt 421 gaacagcttg tggtgtctgg ctcatgggat caaactgtca aattgtggga tccaactgtt 481 ggaaagtctc tgtgcacctt tagaggccat gaaagtatta tttatagcac aatctggtct 541 ccccacatcc ctggttgttt tgcttcagcc tcaggtgatc agactctgag aatatgggat 601 gtgaaggcag caggagtaag aatcgtgatt cctgcacatc aggcagaaat cttgagttgt 661 gactggtgta aatacaatga gaatttgctg gtgaccgggg cggttgactg tagtttgaga 721 ggctgggact taaggaatgt acgacaacca gtgtttgaac ttcttggtca tacctatgct 781 attaggaggg tgaaattttc accatttcat gcttctgtgc tggcctcttg ctcgtatgat 841 tttactgtaa gattctggaa cttttcaaag cctgactctc ttcttgaaac agtggagcat 901 catacagagt ttacttgtgg tttagacttc agtcttcaga gccccactca ggtggctgac 961 tgttcttggg atgaaacaat aaagatctat gaccctgctt gtcttactat tcctgcttga 1021 gatacactac tttggtcaga aacagaggat gttggctgaa gaactgccta acagcaaata 1081 aattaactat ggaaaacata gacattatgc ttttatatgc tattcagatt tcaaatcttt 1141 ccaatttacc ctggaatcag ttttgaggga gctgataaag actttagctg actcgttaag 1201 cctgatacat aagccatatt taaaattcta agaaataatt aatgttatga tatatcttgt 1261 agtatctatt aaaatgtctc tgggtcataa aatggattaa aatatgggag atcagtaggt 1321 tatacttata tagatagtga tatatttcat ttttaatttg tcatttttga tgtaaaatat 1381 aatcacttct gtgataaata aactatctat tgatcattta tcattttaaa aaaa // LOCUS HSU76638 2530 bp mRNA PRI 10-DEC-1996 DEFINITION Human BRCA1-associated RING domain protein (BARD1) mRNA, complete cds. ACCESSION U76638 NID g1710174 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2530) AUTHORS Wu,L.C., Wang,Z.W., Tsan,J.T., Spillman,M.A., Phung,A., Xu,X.L., Yang,M.-C.W., Hwang,L.-Y., Bowcock,A.M. and Baer,R. TITLE Identification of a RING protein that can interact in vivo with the BRCA1 gene product JOURNAL Nature Genet. 13 (8), 430-440 (1996) REFERENCE 2 (bases 1 to 2530) AUTHORS Wu,L.C., Wang,Z.W., Tsan,J.T., Spillman,M.A., Phung,A., Xu,X.L., Yang,M.-C.W., Hwang,L.-Y., Bowcock,A.M. and Baer,R. TITLE Direct Submission JOURNAL Submitted (29-OCT-1996) Microbiology, U.T. Southwestern Medical Center, 6000 Harry Hines Boulevard, Dallas, TX 75235-9140, USA FEATURES Location/Qualifiers source 1..2530 /organism="Homo sapiens" /note="A composite sequence derived from two overlapping cDNA clones (B202 and B230) of BARD1 mRNA" /db_xref="taxon:9606" /chromosome="2" /map="2q" gene 74..2407 /gene="BARD1" CDS 74..2407 /gene="BARD1" /codon_start=1 /product="BRCA1-associated RING domain protein" /db_xref="PID:g1710175" /translation="MPDNRQPRNRQPRIRSGNEPRSAPAMEPDGRGAWAHSRAALDRL EKLLRCSRCTNILREPVCLGGCEHIFCSNCVSDCIGTGCPVCYTPAWIQDLKINRQLD SMIQLCSKLRNLLHDNELSDLKEDKPRKSLFNDAGNKKNSIKMWFSPRSKKVRYVVSK ASVQTQPAIKKDASAQQDSYEFVSPSPPADVSERAKKASARSGKKQKKKTLAEINQKW NLEAEKEDGEFDSKEESKQKLVSFCSQPSVISSPQINGEIDLLASGSLTESECFGSLT EVSLPLAEQIESPDTKSRNEVVTPEKVCKNYLTSKKSLPLENNGKRGHHNRLSSPISK RCRTSILSTSGDFVKQTVPSENIPLPECSSPPSCKRKVGGTSGRKNSNMSDEFISLSP GTPPSTLSSSSYRQVMSSPSAMKLLPNMAVKRNHRGETLLHIASIKGDIPSVEYLLQN GSDPNVKDHAGWTPLHEACNHGHLKVVELLLQHKALVNTTGYQNDSPLHDAAKNGHVD IVKLLLSYGASRNAVNIFGLRPVDYTDDESMKSLLLLPEKNESSSASHCSVMNTGQRR DGPLVLIGSGLSSEQQKMLSELAVILKAKKYTEFDSTVTHVVVPGDAVQSTLKCMLGI LNGCWILKFEWVKACLRRKVCEQEEKYEIPEGPRRSRLNREQLLPKLFDGCYFYLWGT FKHHPKDNLIKLVTAGGGQILSRKPKPDSDVTQTINTVAYHARPDSDQRFCTQYIIYE DLCNYHPERVRQGKVWKAPSSWFIDCVMSFELLPLDS" BASE COUNT 762 a 522 c 587 g 659 t ORIGIN 1 cagcttccct gtggtttccc gaggcttcct tgcttcccgc tctgcgagga gcctttcatc 61 cgaaggcggg acgatgccgg ataatcggca gccgaggaac cggcagccga ggatccgctc 121 cgggaacgag cctcgttccg cgcccgccat ggaaccggat ggtcgcggtg cctgggccca 181 cagtcgcgcc gcgctcgacc gcctggagaa gctgctgcgc tgctcgcgtt gtactaacat 241 tctgagagag cctgtgtgtt taggaggatg tgagcacatc ttctgtagta attgtgtaag 301 tgactgcatt ggaactggat gtccagtgtg ttacaccccg gcctggatac aagacttgaa 361 gataaataga caactggaca gcatgattca actttgtagt aagcttcgaa atttgctaca 421 tgacaatgag ctgtcagatt tgaaagaaga taaacctagg aaaagtttgt ttaatgatgc 481 aggaaacaag aagaattcaa ttaaaatgtg gtttagccct cgaagtaaga aagtcagata 541 tgttgtgagt aaagcttcag tgcaaaccca gcctgcaata aaaaaagatg caagtgctca 601 gcaagactca tatgaatttg tttccccaag tcctcctgca gatgtttctg agagggctaa 661 aaaggcttct gcaagatctg gaaaaaagca aaaaaagaaa actttagctg aaatcaacca 721 aaaatggaat ttagaggcag aaaaagaaga tggtgaattt gactccaaag aggaatctaa 781 gcaaaagctg gtatccttct gtagccaacc atctgttatc tccagtcctc agataaatgg 841 tgaaatagac ttactagcaa gtggctcctt gacagaatct gaatgttttg gaagtttaac 901 tgaagtctct ttaccattgg ctgagcaaat agagtctcca gacactaaga gcaggaatga 961 agtagtgact cctgagaagg tctgcaaaaa ttatcttaca tctaagaaat ctttgccatt 1021 agaaaataat ggaaaacgtg gccatcacaa tagactttcc agtcccattt ctaagagatg 1081 tagaaccagc attctgagca ccagtggaga ttttgttaag caaaccgtgc cctcagaaaa 1141 tataccattg cctgaatgtt cttcaccacc ttcatgcaaa cgtaaagttg gtggtacatc 1201 agggaggaaa aacagtaaca tgtccgatga attcattagt ctttcaccag gtacaccacc 1261 ttctacatta agtagttcaa gttacaggca agtgatgtct agtccctcag caatgaagct 1321 gttgcccaat atggctgtga aaagaaatca tagaggagag actttgctcc atattgcttc 1381 tattaagggc gacatacctt ctgttgaata ccttttacaa aatggaagtg atccaaatgt 1441 taaagaccat gctggatgga caccattgca tgaagcttgc aatcatgggc acctgaaggt 1501 agtggaatta ttgctccagc ataaggcatt ggtgaacacc accgggtatc aaaatgactc 1561 accacttcac gatgcagcca agaatgggca cgtggatata gtcaagctgt tactttccta 1621 tggagcctcc agaaatgctg ttaatatatt tggtctgcgg cctgtcgatt atacagatga 1681 tgaaagtatg aaatcgctat tgctgctacc agagaagaat gaatcatcct cagctagcca 1741 ctgctcagta atgaacactg ggcagcgtag ggatggacct cttgtactta taggcagtgg 1801 gctgtcttca gaacaacaga aaatgctcag tgagcttgca gtaattctta aggctaaaaa 1861 atatactgag tttgacagta cagtaactca tgttgttgtt cctggtgatg cagttcaaag 1921 taccttgaag tgtatgcttg ggattctcaa tggatgctgg attctaaaat ttgaatgggt 1981 aaaagcatgt ctacgaagaa aagtatgtga acaggaagaa aagtatgaaa ttcctgaagg 2041 tccacgcaga agcaggctca acagagaaca gctgttgcca aagctgtttg atggatgcta 2101 cttctatttg tggggaacct tcaaacacca tccaaaggac aaccttatta agctcgtcac 2161 tgcaggtggg ggccagatcc tcagtagaaa gcccaagcca gacagtgacg tgactcagac 2221 catcaataca gtcgcatacc atgcgagacc cgattctgat cagcgcttct gcacacagta 2281 tatcatctat gaagatttgt gtaattatca cccagagagg gttcggcagg gcaaagtctg 2341 gaaggctcct tcgagctggt ttatagactg tgtgatgtcc tttgagttgc ttcctcttga 2401 cagctgaata ttataccaga tgaacatttc aaattgaatt tgcacggttt gtgagagccc 2461 agtcattgta ctgtttttaa tgttcacatt tttacaaata ggtagagtca ttcatatttg 2521 tctttgaatc // LOCUS HSU76992 2672 bp mRNA PRI 14-NOV-1996 DEFINITION Human Tat-SF1 mRNA, complete cds. ACCESSION U76992 NID g1667610 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2672) AUTHORS Zhou,Q. and Sharp,P.A. TITLE Tat-SF1: cofactor for stimulation of transcriptional elongation by HIV-1 Tat JOURNAL Science 274 (5287), 605-610 (1996) MEDLINE 97002454 REFERENCE 2 (bases 1 to 2672) AUTHORS Zhou,Q. and Sharp,P.A. TITLE Direct Submission JOURNAL Submitted (31-OCT-1996) Center for Cancer Research, Massachusetts Institute of Technology, 40 Ames St., Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..2672 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" gene 58..2322 /gene="Tat-SF1" CDS 58..2322 /gene="Tat-SF1" /function="cofactor required for Tat activation of HIV-1 transcription" /note="similar to EWS and FUS/TLS77" /codon_start=1 /product="Tat-SF1" /db_xref="PID:g1667611" /translation="MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQ PTDTPYEWDLDKKAWFPKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQ EKAPEPTDARKKGEKRKAESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIM RDPQTEEFKVKLYKDNQGNLKGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKF QLKGEYDASKKKKKCKDYKKKLSMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMD FEDDPLVLNEIREDLRVECSKFGQIRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDG RWFGGRQITAQAWDGTTDYQVEETSREREERLRGWEAFLNAPEANRGLSVQILSLLRK AGPSRARHFSEHPSTSKMNAQETATGMAFEEPIDEKKFEKTEDGGEFEEGASENNAKE SSPEKEAEEGCPEKESEEGCPKRGFEGSCSQKESEEGNPVRGSEEDSPKKESKKKTLK NDCEENGLAKESEDDLNKESEEEVGPTKESEEDDSEKESDEDCSEKQSEDGSEREFEE NGLEKDLDEEGSEKELHENVLDKELEENDSENSEFEDDGSEKVLDEEGSEREFDEDSD EKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAADKKAEEGDADEKLFEESDDKEDED ADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDDSDERGTLGGFGSVEEGPLSTG SSFILSSDDDDDDI" BASE COUNT 898 a 420 c 711 g 643 t ORIGIN 1 agcgtcattt cggcctctta gttcttctga accctgctcc tgagctaggt aggaaacatg 61 agcggcacca acttggatgg gaacgatgag tttgatgagc agttgcgaat gcaagaattg 121 tacggagacg gcaaggatgg tgacacccag accgatgccg gcggagaacc cgattctctc 181 gggcagcagc cgacggacac tccctacgag tgggacctgg acaaaaaggc ttggttcccc 241 aagattactg aagatttcat tgctacatat caggccaatt atggcttctc taacgatggc 301 gcatctagtt ctaccgcaaa tgttgaagat gtccatgcta ggactgcaga ggaacctcca 361 caagaaaaag ccccggaacc cactgatgcc agaaagaagg gagaaaaaag aaaggctgag 421 tcaggatggt ttcatgttga agaagacaga aatacaaatg tatacgtgtc tggtttgcct 481 ccagatatta cagtggatga atttatacaa cttatgtcca agtttggcat tattatgaga 541 gatcctcaga cagaagaatt taaggtcaaa ctttacaaag ataatcaagg aaatcttaaa 601 ggagacggtc tttgctgtta tttgaaaaga gaatctgtgg aacttgcatt aaaacttttg 661 gatgaagatg aaattagagg ctacaaatta catgttgagg tggcaaagtt tcaactgaag 721 ggagaatatg atgcctcaaa gaagaagaag aagtgcaaag actataagaa gaagctgtct 781 atgcaacaaa agcagttgga ttggagacct gagaggcgag ccggaccatc ccggatgcgc 841 catgagcgag ttgtcatcat caagaatatg tttcatccta tggattttga ggatgatccg 901 ttggtgctga atgagatcag agaagacctt cgagtagagt gttcgaagtt tggacaaatt 961 aggaaactcc ttctctttga taggcaccca gatggtgtgg cctctgtgtc ctttcgggat 1021 ccagaggaag ctgattattg tattcagact ctcgatggaa gatggtttgg tggccgtcaa 1081 atcactgccc aggcatggga tgggactaca gattatcagg tggaggaaac ctcaagagaa 1141 agggaggaaa ggctgagagg atgggaggct ttcctcaatg ctcctgaggc caacagaggc 1201 cttagcgttc agattctgtc tctgcttcga aaggcagggc cttctagagc aaggcatttt 1261 tcagagcacc ccagcacatc taaaatgaat gctcaagaaa ctgcaactgg aatggcattt 1321 gaagaaccta tagatgagaa gaagtttgaa aagacagaag atgggggaga atttgaagaa 1381 ggtgcttctg aaaacaatgc taaggaaagt agccccgaaa aagaggctga agaaggctgc 1441 cctgaaaaag aatctgaaga gggctgcccc aaaagagggt ttgaaggcag ctgctcccaa 1501 aaagagtctg aagaaggcaa tcccgtaaga ggatctgaag aggatagtcc taaaaaagag 1561 tctaaaaaga agacactcaa aaatgattgt gaagagaatg gccttgcaaa ggaatctgaa 1621 gatgacctca acaaggagtc tgaagaggag gttggcccca caaaagagtc cgaagaagat 1681 gactcagaga aagagtctga tgaagactgc tctgaaaaac agtctgaaga tggctccgaa 1741 agagaatttg aagaaaatgg tctcgagaaa gatttggacg aggaaggttc tgaaaaggag 1801 cttcatgaaa atgttcttga caaagagtta gaagaaaatg actctgaaaa ctccgaattt 1861 gaagatgacg gctctgaaaa agtgttagat gaggaaggct ctgagagaga gtttgacgaa 1921 gattcagatg aaaaggaaga agaggaggat acatatgaaa aagtatttga tgatgagtct 1981 gatgagaaag aggatgaaga atatgcagat gaaaaggggc ttgaagctgc tgataaaaag 2041 gcggaagaag gtgatgcaga tgaaaagctg tttgaagagt cagatgacaa ggaagatgaa 2101 gatgcagatg gaaaggaagt tgaagatgct gacgaaaagt tgttcgaaga tgatgattcc 2161 aatgagaagt tgtttgatga ggaggaagat tccagtgaga agttgtttga cgattctgat 2221 gagaggggga ctttgggtgg ttttgggagt gttgaagaag ggcccctatc cactggcagc 2281 agctttattc tcagtagcga tgatgatgac gatgatattt aatcccttaa acttgctttt 2341 tagggagagt cctccatcta catttgcctg tgcttcaggg taattactag tagtgttaca 2401 tgaacatgtg catagtggta ggatgccatc agattaaagc attgaagtgt ttcattgtta 2461 cctgtaccta atggttttaa atatatgtta attgattgtt tagttaaaat gtcatagtta 2521 caatgcaagt aaactggata cttgttcttt tgtcagattt gttaaatgca tgcagaataa 2581 tatttttaag agtattgatt gaagtttgtg atattcatca ataaaaatga gttgataata 2641 tgcagaaact gaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU77088 1960 bp mRNA PRI 25-MAR-1997 DEFINITION Human thymidine kinase 2 (TK2) mRNA, complete cds. ACCESSION U77088 NID g1905968 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1960) AUTHORS Johansson,M. and Karlsson,A. TITLE Cloning of the cDNA and chromosome localization of the gene for human thymidine kinase 2 JOURNAL J. Biol. Chem. 272 (13), 8454-8458 (1997) MEDLINE 97236800 REFERENCE 2 (bases 1 to 1960) AUTHORS Johansson,M. and Karlsson,A. TITLE Direct Submission JOURNAL Submitted (02-NOV-1996) Medical Biochemistry and Biophysics, Karolinska Institute, Doktorsringen 2A, Stockholm 171 77, Sweden FEATURES Location/Qualifiers source 1..1960 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" gene 9..713 /gene="TK2" CDS 9..713 /gene="TK2" /function="deoxyribonucleoside kinase" /codon_start=1 /product="thymidine kinase 2" /db_xref="PID:g1905969" /translation="MGAFCQRPSSDKEQEKEKKSVICVEGNIAGGKTTCLEFFSNATD VEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLME RSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQ RLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFE QNRDRILTPENRKHCP" BASE COUNT 471 a 465 c 525 g 499 t ORIGIN 1 gccaagttat gggtgcgttc tgccagcgtc ctagcagtga taaagaacag gaaaaagaga 61 aaaaatcagt gatctgtgtc gagggcaata ttgcaggtgg gaagacgaca tgcctggaat 121 tcttctccaa cgcgacagac gtcgaggtgt taacggagcc tgtgtccaag tggagaaatg 181 tccgtggcca caatcctctg ggcctgatgt accacgatgc ctctcgctgg ggtcttacgc 241 tacagactta tgtgcagctc accatgctgg acaggcatac tcgtcctcag gtgtcatctg 301 tacggttgat ggagaggtcg attcacagcg caagatacat ttttgtagaa aacctgtata 361 gaagtgggaa gatgccagaa gtggactatg tagttctgtc ggaatggttt gactggatct 421 tgaggaacat ggacgtgtct gttgatttga tagtttacct tcggaccaat cctgagactt 481 gttaccagag gttaaagaag agatgcaggg aagaggagaa ggtcattccg ctggaatacc 541 tggaagcaat tcaccatctc catgaggagt ggctcatcaa aggcagcctt ttccccatgg 601 cagcccctgt tctggtgatt gaggctgacc accacatgga gaggatgtta gaactctttg 661 aacaaaatcg ggatcgaata ttaactccag agaatcggaa gcattgccca taggaggcaa 721 aaggtctatg gctcatgtct gaaaaatgcc tgctgctgcc aagttagcta ttgggagcaa 781 tctggaaaaa cttgctccca ggagggcttt gtgtctggcc agcttgattt tcctaatggt 841 ctcatctcct ttgctagtgt ctttgtcatg cgtctctggc cctcgtgggt aaatgacaaa 901 cgggaccaat gggtttgcca agccctttgc tgttcgcagc cctcacattc ccccggtgcc 961 tctcccatgg ctttgtgctg ctgagtcgct ctcatgaagc ccttagggga gagcacctgt 1021 tgtgtgcctg acaccacgct ggagctgtgt accaatcgtc tcagccttca ttaggaggcc 1081 gaggtaggag tcttatatcc caggtgagga atttgaagct cagaaaggtt gaggggctcc 1141 ccagaggtca cacagcctgt gtgcagtgga gctggcacca ttcagacttt cagccgactc 1201 agcaactttc ccttgccctg ggctgcctcc tcctgagagc tgttccccac cgccctgcct 1261 cttccggttg gaggctctca tgtctctttg gggagagctg gcagtgtgcg gagctgataa 1321 cattttccca atattgagca gttcccaagg acagtcagca tttctagact tccacaaaat 1381 tatgctgcat ttggctggag cccggtgttc agtggtttcc ctgcccgagg tcgctgcagc 1441 cccatctacc acatcttcat gtggacattg agattcacat gctggctcct gaagggtgct 1501 cagtctcctt ggtgattaag gtcctgcttg aactgctgcc aactccatgt cagggaagtc 1561 gcttttggtg cctggctggt ttgcccagag ccaagctggg gcaaggggca gccagccctg 1621 gcttccaagg ctcccgtact gtctgtgtcc ttgtataagg agctttgctc ttggaattac 1681 tgaaagtctg tggccctaag agagagacac aagtggcctt aagtcttttt gaagtgttat 1741 ttcatccagg gaaatgcctc gagccataga gcctgaaatc atctttgttg gctcagaaaa 1801 taccttagct tcactcagct ggactgcatt gaaggcgagg ctgccccttg gatcaagcag 1861 aaaacaagag aaagaaagaa cgttcccttt ggggatagtc tggaaagttg ggatttgcaa 1921 ataaaggctc tggaagcatt aaaaaaaaaa aaaaaaaaaa // LOCUS HSU77129 3000 bp mRNA PRI 04-MAR-1997 DEFINITION Human SPS1/STE20 homolog KHS1 mRNA, complete cds. ACCESSION U77129 NID g1857330 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3000) AUTHORS Tung,R.M. and Blenis,J. TITLE A Novel Human SPS1/STE20 Homologue, KHS, activates Jun N-terminal Kinase JOURNAL Oncogene 14 (1997) In press REFERENCE 2 (bases 1 to 3000) AUTHORS Tung,R.M. and Blenis,J. TITLE Direct Submission JOURNAL Submitted (02-NOV-1996) Cell Biology, Harvard Medical School, 240 Longwood Ave, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..3000 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 321..2861 /function="kinase" /note="SPS1/STE20 homolog" /codon_start=1 /product="KHS1" /db_xref="PID:g1857331" /translation="MEAPLRPAADILRRNPQQDYELVQRVGSGTYGDVYKARNVHTGE LAAVKIIKLEPGDDFSLIQQEIFMVKECKHCNIVAYFGSYLSREKLWICMEYCGGGSL QDIYHVTGPLSELQIAYVCRETLQGLAYLHTKGKMHRDIKGANILLTDHGDVKLADFG VAAKITATIAKRKSFIGTPYWMAPEVAAVEKNGGYNQLCDIWAVGITAIELGELQPPM FDLHPMRALFLMSKSNFQPPKLKDKTKWSSTFHNFVKIALTKNPKKRPTAERLLTHTF VAQPGLSRALAVELLDKVNNPDNHAHYTEADDDDFEPHAIIRHTIRSTNRNARAERTA SEINFDKLQFEPPLRKETEARDEMGLSSDPNFMLQWNPFVDGANTGKSTSKRAIPPPL PPKPRISSYPEDNFPDEEKASTIKHCPDSESRAPQILRRQSSPSCGPVAETSSIGNGD GISKLMSENTEGSAQAPQLPRKNDKRDFPKPAINGLPPTPKVLMGACFSKVFDGCPLK INCATSWIHPDTKDQYIIFGTEDGIYTLNLNELHEATMEQLFPRKCTWLYVINNTLMS LSEGKTFQLYSHNLIALFEHAKKPGLAAHIQTHRFPDRILPRKFALTTKIPDTKGCHK CCIVRNPYTGHKYLCGALQSGIVLLQWYEPMQKFMLIKHFDFPLPSPLNVFEMLVIPE QEYPMVCVAISKGTESNQVVQFETINLNSASSWFTEIGAGSQQLDSIHVTQLERDTVL VCLDKFVKIVNLQGKLKSSKKLASELSFDFRIESVVCLQDSVLAFWKHGMQGKSFKSD EVTQEISDETRVFRLLGSDRVVVLESRPTENPTAHSNLYILAGHENSY" BASE COUNT 936 a 632 c 688 g 744 t ORIGIN 1 ggcgccgacc catgctggct gggaacgtgt ctcccggtga cgcagccccg ggtggggaac 61 gtggtgcggc ggaagaggcg gtggtgactg tacgcgcctc cgccgccccc gagaggacgc 121 gccgtgcagc ggctgagtgg cggcggcggc gacggcaaac ccggagctgc cggccggcgc 181 gcgggaggag gacgcgggtg cggtctagga aacggagctg cgggcggagg ctccatgttg 241 ggaagcggcg ccgttcgtgc ttgttagcgg gaatccggga gccgcggggt gagctggcgg 301 gggccgggcc ctaagtgaag atggaggccc cgctgcggcc tgccgcggac atcctgaggc 361 ggaacccgca gcaggactac gaactcgtcc agagggtcgg cagcggcacc tacggggacg 421 tctataaggc cagaaatgta cacacaggag agctggctgc agtaaaaatc attaaattgg 481 agcctggaga tgatttttct ttgattcaac aagaaatatt tatggttaaa gaatgtaaac 541 attgtaacat cgttgcctac tttgggagtt atcttagtcg ggaaaaacta tggatttgta 601 tggaatactg tggtggcgga tcacttcaag atatttacca tgttactgga ccattatcag 661 aattgcaaat agcctatgta tgcagagaaa ccttacaggg tcttgcctat ttgcatacta 721 aaggcaaaat gcatagagat atcaaaggtg ctaatatttt attgacagac catggcgatg 781 taaaattagc tgactttggt gtggctgcaa aaataacagc taccattgca aaacgaaaat 841 ctttcattgg caccccttac tggatggccc cagaagttgc agcagtagag aagaatggtg 901 gctacaacca actctgtgat atctgggcag taggaataac agcaattgaa cttggagaac 961 ttcagccacc tatgtttgat ctccacccaa tgagggctct cttcttaatg tcaaaaagta 1021 attttcagcc tccaaaacta aaggacaaaa caaaatggtc atcaacattc cataattttg 1081 tcaaaatagc actaaccaaa aacccaaaaa aaagaccaac tgctgaaaga cttctgactc 1141 acacttttgt tgcacagcca ggtctctcta gagccctagc agttgaactg ttagacaaag 1201 tgaacaatcc agataaccac gcacattaca ctgaagcaga tgacgatgac tttgagcccc 1261 atgcaatcat tcgtcatacc attagatcta caaacaggaa tgccagagct gaacggacag 1321 cttcagaaat aaattttgac aaattacaat ttgaacctcc tctgagaaaa gaaacagaag 1381 cacgagatga aatgggattg tcatcagacc caaatttcat gttacagtgg aatccttttg 1441 ttgatggtgc aaatactggc aaatcaacct caaaacgtgc aataccacct cccctacctc 1501 ctaagccaag gataagcagt taccctgaag acaactttcc ggatgaagaa aaagcatcaa 1561 ccataaaaca ttgtcctgat tcagaaagca gagctcccca aattctcaga agacagagta 1621 gcccaagttg tgggcctgtg gcagagactt cttctattgg aaatggtgat ggtatttcaa 1681 aactgatgag tgaaaataca gaaggatcag cacaagcacc acagttacca cgaaaaaacg 1741 acaaacgaga cttccctaaa ccagccatca atggccttcc acccacccca aaagttctga 1801 tgggagcatg cttttcaaaa gtttttgatg gctgtccttt gaaaattaat tgtgcaacat 1861 cctggataca tcctgataca aaagatcagt acattatttt tggaactgaa gatggtattt 1921 acacactgaa tctcaatgag ctacatgagg caacgatgga acagttattt ccacggaagt 1981 gtacttggct gtatgttatc aataatactt taatgtcatt atcagaagga aaaacctttc 2041 agctctactc tcacaatctt atagctttgt ttgaacatgc caaaaaacca ggattagctg 2101 cccatattca aactcacagg tttccagacc gaatactacc aagaaaattc gctttaacaa 2161 caaagattcc tgatacaaaa ggctgccaca aatgttgcat agtcagaaac ccttacacgg 2221 gacataaata cctctgtgga gctttacagt ctggaattgt tttacttcag tggtatgagc 2281 caatgcagaa attcatgttg ataaagcact ttgattttcc tttgccaagt cctttgaatg 2341 tttttgaaat gctggtgata cctgaacagg aataccctat ggtctgtgta gctattagca 2401 aaggcactga atcgaatcag gtagttcagt ttgagacaat caatttgaac tctgcatctt 2461 catggtttac agaaattggt gcaggcagcc agcagttaga ttccattcat gtaacacagt 2521 tggagagaga taccgtttta gtgtgtttag acaaatttgt gaaaattgta aatctacaag 2581 gaaaattaaa atcaagtaag aaactggcct ctgagttaag ttttgatttt cgcattgaat 2641 ctgtagtatg ccttcaagac agtgtgttgg ctttctggaa acatgggatg cagggtaaaa 2701 gcttcaagtc agatgaggtt acccaggaga tttcagatga aacaagagtt ttccgcttat 2761 taggatcaga cagggttgtc gttttggaaa gtaggccaac agaaaatcct actgcacaca 2821 gcaatctcta catcttggct ggacatgaaa atagttacta agcaacagaa actgatctca 2881 aatgacagga aaatgaatat actccattga aagggaaaat aaggaaattc aatacaaact 2941 gcactatgat ttgctttaac tattatgggt tatattgcaa atgatctgta ctttagggta // LOCUS HSU77413 3084 bp mRNA PRI 21-JUL-1997 DEFINITION Human O-linked GlcNAc transferase mRNA, complete cds. ACCESSION U77413 NID g2266993 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3084) AUTHORS Lubas,W.A., Frank,D.W., Krause,M. and Hanover,J.A. TITLE O-Linked GlcNAc transferase is a conserved nucleocytoplasmic protein containing tetratricopeptide repeats JOURNAL J. Biol. Chem. 272 (14), 9316-9324 (1997) MEDLINE 97238870 REFERENCE 2 (bases 1 to 3084) AUTHORS Lubas,W.A., Frank,D., Krause,M. and Hanover,J.A. TITLE Direct Submission JOURNAL Submitted (05-NOV-1996) Laboratory of Cell Biochemistry and Biology, NIDDK, NIH, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3084 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 266..3028 /function="adds O-linked GlcNAc to transcription factors and nuclear pore proteins" /note="contains TPR motifs" /codon_start=1 /product="O-linked GlcNAc transferase" /db_xref="PID:g2266994" /translation="MLQGHFWLVREGIMISPSSPPPPNLFFFPLQIFPFPFTSFPSHL LSLTPPKACYLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAY INLGNVLKEARIFDRAVAAYLRALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI ELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRLCPTHADSLNNLANIKREQGNIE EAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFADAYSNMG NTLKEMQDVQGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIASYRTALKLKP DFPDAYCNLAHCLQIVCDWTDYDERMKKLVSIVADQLEKNRLPSVHPHHSMLYPLSHG FRKAIAERHGNLCLDKINVLHKPPYEHPKDLKLSDGRLRVGYVSSDFGNHPTSHLMQS IPGMHNPDKFEVFCYALSPDDGTNFRVKVMAEANHFIDLSQIPCNGKAADRIHQDGIH ILVNMNGYTKGARNELFALRPAPIQAMWLGYPGTSGALFMDYIITDQETSPAEVAEQY SEKLAYMPHTFFIGDHANMFPHLKKKAVIDFKSNGHIYDNRIVLNGIDLKAFLDSLPD VKIVKMKCPDGGDNADSSNTALNMPVIPMNTIAEAVIEMINRGQIQITINGFSISNGL ATTQINNKAATGEEVPRTIIVTTRSQYGLPEDAIVYCNFNQLYKIDPSTLQMWANILK RVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRIIFSPVAPKEEHVRRGQLADVCLDT PLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQLTCLGCLELIAKNRQEYEDIAVK LGTDLEYLKKVRGKVWKQRISSPLFNTKQYTMELERLYLQMWEHYAAGNKPDHMIKPV EVTESA" BASE COUNT 859 a 690 c 699 g 836 t ORIGIN 1 tccggaaaca gtgggggtag gaaaactcgg cctcaagttg cgccctctag gtagcacttg 61 aaaacatgac aagggcccgt agttgtttgg ataagagaac tccagcatag agccttatag 121 caactgactt cccagttaag tcccagtgta agggttggtc tttggttggc agaactgaac 181 atggtggttt gcacttgggt tctggtggcg caggcgcagg agcagccagc tgtggcagcg 241 cattagtttt ggcgcaagcg agcctatgct gcagggtcac ttttggctgg tcagagaagg 301 aataatgata tcaccttctt ccccccctcc ccccaatctt ttttttttcc ctttacaaat 361 tttccccttt ccctttacct cctttccctc ccatcttctt tcattaaccc ctcctaaggc 421 atgttatttg aaagcaattg agacgcaacc gaactttgca gtagcttgga gtaatcttgg 481 ctgtgttttc aatgcacaag gggaaatttg gcttgcaatt catcactttg aaaaggctgt 541 cacccttgac ccaaactttc tggatgctta tatcaattta ggaaatgtct tgaaagaggc 601 acgcattttt gacagagctg tggcagctta tcttcgtgcc ctaagtttga gtccaaatca 661 cgcagtggtg cacggcaacc tggcttgtgt atactatgag caaggcctga tagatctggc 721 aatagacacc tacaggcggg ctatcgaact acaaccacat ttccctgatg cttactgcaa 781 cctagccaat gctctcaaag agaagggcag tgttgctgaa gcagaagatt gttataatac 841 agctctccgt ctgtgtccca cccatgcaga ctctctgaat aacctagcca atatcaaacg 901 agaacaggga aacattgaag aggcagttcg cttgtatcgt aaagcattag aagtcttccc 961 agagtttgct gctgcccatt caaatttagc aagtgtactg cagcagcagg gaaaactgca 1021 ggaagctctg atgcattata aggaggctat tcgaatcagt cctacctttg ctgatgccta 1081 ctctaatatg ggaaacactc taaaggagat gcaggatgtt cagggagcct tgcagtgtta 1141 tacgcgtgcc atccaaatta atcctgcatt tgcagatgca catagcaatc tggcttccat 1201 tcataaggat tcagggaata ttccagaagc catagcttct taccgcacgg ctctgaaact 1261 taagcctgat tttcctgatg cttattgtaa cttggctcat tgcctgcaga ttgtctgtga 1321 ttggacagac tatgatgagc gaatgaagaa gttggtcagt attgtggctg accagttaga 1381 gaagaatagg ttgccttctg tgcatcctca tcatagtatg ctatatcctc tttctcatgg 1441 cttcaggaag gctattgctg agaggcacgg caacctgtgc ttagataaga ttaatgttct 1501 tcataaacca ccatatgaac atccaaaaga cttgaagctc agtgatggtc ggctgcgtgt 1561 aggatatgtg agttccgact ttgggaatca tcctacttct caccttatgc agtctattcc 1621 aggcatgcac aatcctgata aatttgaggt gttctgttat gccctgagcc cagacgatgg 1681 cacaaacttc cgagtgaagg tgatggcaga agccaatcat ttcattgatc tttctcagat 1741 tccatgcaat ggaaaagcag ctgatcgcat ccatcaggat ggaattcata tccttgtaaa 1801 tatgaatggc tatactaagg gcgctcgaaa tgagcttttt gctctcaggc cagctcctat 1861 tcaggcaatg tggctgggat accctgggac gagtggtgcg cttttcatgg attatattat 1921 cactgatcag gaaacttcgc cagctgaagt tgctgagcag tattccgaga aattggctta 1981 tatgccccac acttttttta ttggtgatca tgctaatatg ttccctcacc tgaagaaaaa 2041 agcagtcatc gattttaagt ccaatgggca catttatgac aatcggatag ttctgaatgg 2101 catcgacctc aaagcatttc ttgatagtct accagatgtg aaaattgtca agatgaagtg 2161 tcctgatgga ggagacaatg cagatagcag taacacagct cttaatatgc ctgttattcc 2221 tatgaatact attgcagaag cagttattga aatgattaac cgaggacaga ttcaaataac 2281 aattaatgga ttcagtatta gcaatggact ggcaactact cagatcaaca ataaggctgc 2341 aactggagag gaggttcccc gtaccattat tgtaaccacc cgttctcagt acgggttacc 2401 agaagatgcc atcgtatact gtaactttaa tcagttgtat aaaattgacc cttctacttt 2461 gcagatgtgg gcaaacattc tgaagcgtgt tcccaatagt gtactctggc tgttgcgttt 2521 tccagcagta ggagaaccta atattcaaca gtatgcacaa aacatgggcc tgccccagaa 2581 ccgtatcatt ttttcacctg ttgctcctaa agaggaacac gtcaggagag gccagctggc 2641 tgatgtctgc ttggacactc cactctgtaa tgggcacacc acagggatgg atgtcctctg 2701 ggcagggacc cccatggtga ctatgccagg agagactctt gcttctcgag ttgcagcatc 2761 ccagctcact tgcttaggtt gtcttgagct tattgctaaa aacagacaag aatatgaaga 2821 catagctgtg aagctgggaa ctgatctaga atacctgaag aaagttcgtg gcaaagtctg 2881 gaagcaaaga atatctagcc ctctgttcaa caccaaacaa tacacaatgg aactagagcg 2941 gctctatcta cagatgtggg agcattatgc agctggcaac aaacctgacc acatgattaa 3001 gcctgttgaa gtcactgagt cagcataaat aaagactgca caggagaatt acccctaaaa 3061 aaaaaaaaaa aaaagggcgg ccgc // LOCUS HSU77456 2534 bp mRNA PRI 21-NOV-1996 DEFINITION Human nucleosome assembly protein 2 mRNA, complete cds. ACCESSION U77456 NID g1679778 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2534) AUTHORS Hu,R.J., Lee,M.P., Johnson,L.A. and Feinberg,A.P. TITLE A novel human homologue of yeast nucleosome assembly protein, 65 kb centromeric to the p57KIP2 gene, is biallelically expressed in fetal and adult tissues JOURNAL Hum. Mol. Genet. 5 (11), 1743-1748 (1996) MEDLINE 97081759 REFERENCE 2 (bases 1 to 2534) AUTHORS Hu,R.-J., Lee,M., Johnson,L. and Feinberg,A. TITLE Direct Submission JOURNAL Submitted (05-NOV-1996) Department of Medicine, 1064 Ross, Johns Hopkins University School of Medicine, 720 Rutland Ave, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..2534 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" CDS 150..1277 /note="hNAP2" /codon_start=1 /product="nucleosome assembly protein 2" /db_xref="PID:g1679779" /translation="MADHSFSDGVPSDSVEAAKNASNTEKLTDQVMQNPRVLAALQER LDNVPHTPSSYIETLPKAVKRRINALKQLQVRCAHIEAKFYEEVHDLERKYAALYQPL FDKRREFITGDVEPTDAESEWHSENEEEEKLAGDMKSKVVVTEKAAATAEEPDPKGIP EFWFTIFRNVDMLSELVQEYDEPILKHLQDIKVKFSDPGQPMSFVLEFHFEPNDYFTN SVLTKTYKMKSEPDKADPFSFEGPEIVDCDGCTIDWKKGKNVTVKTIKKKQKHKGRGT VRTITKQVPNESFFNFFNPLKASGDGESLDEDSEFTLASDFEIGHFFRERIVPRAVLY FTGEAIEDDDNFEEGEEGEEEELEGDEEGEDEDDAEINPKV" BASE COUNT 660 a 578 c 684 g 611 t 1 others ORIGIN 1 ggcacgagct tgctcgcgga gagaggagct aggagcctcg gccaatggga gccggcgttg 61 ttggaggcca cggcggcgca gccccaaagc gagcgaagct agggtcgccg ccactgccgc 121 aggaggcgtg aggggataaa aacattcaga tggcagatca cagtttttca gatggggttc 181 cttcagattc cgtggaagct gctaaaaatg caagtaacac agaaaagctc acagatcagg 241 tgatgcagaa tcctcgagtt ctggcagctt tacaggagcg acttgacaat gtccctcaca 301 ccccttccag ctacatcgaa actttaccta aagcagtaaa aagaagaatt aatgcattga 361 aacaacttca ggtgagatgt gctcacatag aagccaagtt ctatgaagag gtacatgact 421 tggaaagaaa gtatgcagcg ctataccagc ctctctttga caagagaaga gaatttatca 481 ccggcgatgt tgaaccaaca gatgcggaat cggaatggca cagtgaaaat gaagaggaag 541 agaaattggc tggagacatg aaaagtaaag tagtcgtcac agaaaaagca gcggcaacgg 601 ctgaagagcc agatcccaaa ggaattccag agttctggtt taccatcttc agaaatgtgg 661 acatgctgag tgaattagtc caggaatatg atgaaccaat cttgaaacac ctgcaggata 721 ttaaagtgaa attttctgac cctggacagc ctatgtcttt tgtgttagag ttccactttg 781 aacccaacga ctactttacc aactcagtcc tgacaaaaac ctacaagatg aaatcagaac 841 cagataaggc tgatcccttt tcctttgaag gtcctgagat tgtggactgt gacgggtgta 901 ctattgactg gaagaaagga aagaatgtta ctgtcaaaac catcaagaaa aagcagaagc 961 ataagggtcg aggcactgtt agaacaatta cgaaacaagt acccaatgag tcctttttca 1021 acttcttcaa tccattgaaa gcatccgggg atggagaatc actggatgaa gattctgaat 1081 tcacattagc ctctgatttt gaaattggac actttttccg tgagcggata gtcccgcggg 1141 ctgtgctgta cttcactggg gaggccatag aagatgatga caattttgaa gaaggtgaag 1201 aaggagaaga ggaggaatta gaaggtgacg aggagggaga agacgaggat gatgcggaaa 1261 ttaaccccaa ggtgtaattt ttgtctgtta atcattcata cgtttctaga aggaacccag 1321 ccagccggcg gaatgcaagc agcagtagga agcggaggcg ggtgcctggc agaccggctg 1381 tcgggactcc aggcctgtgg gcggggcctc ggtccttgcc gcagcacaat cccgtggaca 1441 gagcttactc catctaactc gttttcaagt gcatgatttt cactttcact tttccttttt 1501 ccttattatt ttgattaact tgtacagtgg caactgaaat gcatttcaga aataggaggt 1561 ttcgtccagc accctctgca gccttggtgc ctgtagctct ggacttccct gggcctttcc 1621 ctgtgggagg gccctgtaga cacatcaggg tggggtgggg gtcacttggc aaaaagggcc 1681 gaggtctggt gatgtggttc ccaggatctt ggaacctctc ccacccctcc tgcagttgga 1741 ctgaattcta ccctttcatc cgaagaaacc cacttgctgt ttccagccgc tgaatctgct 1801 gagtgtgcag cctgcatcac ctgctgtatg ccgatcatct cagaaagggc tgtgtagagt 1861 agggccctgt tctccttagg atgttgcttc ttgatttttt ttttttttta ggggtgggtc 1921 agggttgtga cacaccagcc caggtgagag ctgctgcggg tcacctcata tttatttatc 1981 ccttcttgcc tgtgaggact gcggcttttc gctgtggctc gtccttaacg tttctgaacc 2041 accttggtgc cctgagcagg aagatgtgcc acttcctagc aggcgcaagg cctgtgcgga 2101 agaaacgccg ctccctgcca ccagggctga agatgcgagc ccgtcctcat gacgcaggcg 2161 ccaccctgct gccggagccg ggcttcggca tcttctccac tgagggactg ggctgggaag 2221 tcctgcgttt cagtggagcg tatgagcgtc aagtcctgct ttctcagtag ccccattgcg 2281 gggccccacc attcatcctg tctgaaggtc ctgggtttgg tgtgaccgct tggcggctgg 2341 tgggtggggt tttcaagtgg gtgacggcgc tctccggcaa ccggggatgg ccgtgtccgc 2401 actgaccagg cctgtggaga gtgctcggcc taaccttaga acacaattgt aactgaaaac 2461 agtgttttca atttgtacag aatagttaga atattctaat aaagtggtga aacattgaaa 2521 aaaaaanaaa aaaa // LOCUS HSU77494 3344 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens RANBP8 mRNA, complete cds. ACCESSION U77494 NID g2337917 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3344) AUTHORS Gorlich,D., Dabrowski,M., Bischoff,F.R., Kutay,U., Bork,P., Hartmann,E., Prehn,S. and Izaurralde,E. TITLE A novel class of RanGTP binding proteins JOURNAL J. Cell Biol. 138 (1), 65-80 (1997) MEDLINE 97362061 REFERENCE 2 (bases 1 to 3344) AUTHORS Prehn,S., Gorlich,D. and Hartmann,E. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Zellbiologie, Max-Delbruck-Centrum fur Molekulare Medizin, Robert-Rossle-Str. 10, Berlin 13125, Germany FEATURES Location/Qualifiers source 1..3344 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 171..3284 /codon_start=1 /product="RANBP8" /db_xref="PID:g2337918" /translation="MDLNRFIQALKGTIDPKLRIAAENELNQSYKIINFAPSLLRIIV SDHVEFPVRQAAAIYLKNMVTQYWPDREPPPGEAIFPFNIHENDRQRIRDNIVEGIIR SPDLVRVQLTMCLRAIIKHDFPGHWPGVVDKIDYYLQSQSSASWLGSLLCLYQLVKTY EYKKAEEREPLIIAMQIFLPRIQQQIVQLLPDSSYYSVLLQKQILKIFYALVQYALPL QLVNNQTMTTWMEIFRTIIDRTVPPETLHIDEDDRPELVWWKCKKWALHIVARLFERY GSAGNVTKEYFEFSEFFLKTYAVGIQQVLLKILDQYRQKEYVAPRVLQQAFNYLNQGV VHSITWKQMKPHIQNISEDVIFSVMCYKDEDEELWQEDPYEYIRMKFDIFEDYASPTT AAQTLLYTAAKKRKEVLPKMMAFCYQILTDPNFDPRKKDGALHVIGSLAEILLKKSLF KDQMELFLQNHVFPLLLSNLGYLRARSCWVLHAFSSLKFHNELNLRNAVELAKKSLIE DKEMPVKVEAALALQSLISNQIQAKEYMKPHVRPIMQELLHIVRETENDDVTNVIQKM ICEYSQEVASIAVDMTQHLAEIFGKVLQSDEYEEVEDKTVMAMGILHTIDTILTVVED HKEITQQLENICLRIIDLVLQKHVIEFYEEILSLAYSLTCHSISPQMWQLLGILYEVF QQDCFEYFTDMMPLLHNYVTIDTDTLLSNAKHLEILFTMCRKVLCGDAGEDAECHAAK LLEVIILQCKGRGIDQCIPLFVQLVLERLTRGVKTSELRTMCLQVAIAALYYNPDLLL HTLERIQLPHNPGPITVQFINQWMNDTDCFLGHHDRKMCIIGLSILLELQNRPPAVDA VVGQIVPSILFLFLGLKQVCATRQLVNREDRSKAEKADMEENEEISSDEEETNVTAQA MQSNNGRGEDEEEEDDDWDEEVLEETALEGFSTPLDLDNSVDEYQFFTQALITVQSRD AAWYQLLMAPLSEDQRTALQEVYTLAEHRRTVAEAKKKIEQQGGFTFENKGVLSAFNF GTVPSNN" BASE COUNT 1024 a 648 c 787 g 885 t ORIGIN 1 ggcacgaggc agagtggcgc agcaagtggc cgcaggtggc gacggtggcg gggggtgggg 61 tgtgaggtaa tccaggggtc gcggaagagg aggctgagag ggtcaaaaga aaactaaagc 121 tgcagtccgg cctactgttc cgggggccgc gggagccccc acccggggag atggacctca 181 accggttcat ccaggcgctg aagggcacca tcgacccgaa gttgcggatt gcagccgaga 241 acgagctcaa ccagtcctac aagattatca attttgcccc cagtttactt cggattatag 301 tctctgacca tgtggaattc ccagtacgac aggcagctgc catttacctg aagaacatgg 361 tgacacaata ctggccagat cgagaacctc caccaggaga agcaatattt ccattcaaca 421 ttcacgaaaa cgatcgacag cgaatacgtg ataacattgt ggaaggaata attcggtctc 481 cagatttagt gagagtccaa ttaacaatgt gtctccgtgc catcataaaa catgattttc 541 ctggtcactg gccaggagtg gtcgacaaga tagactatta cttgcaatca cagagcagtg 601 caagctggct tggcagttta ttatgcctgt atcaactggt gaagacatat gaatataaga 661 aagcagaaga gagagaacct cttataatag caatgcagat attcctgcct cgtattcagc 721 aacaaattgt tcagctcctt cctgattcct cctattattc tgtattactg cagaaacaaa 781 ttctgaaaat cttttatgca cttgttcagt atgcattgcc tcttcagcta gtgaataacc 841 aaaccatgac aacatggatg gagatcttcc gaactattat cgacaggacc gttcctcctg 901 agactctgca cattgatgag gatgatagac cagaactggt atggtggaag tgtaagaagt 961 gggcactgca tattgtagct cggctctttg aacgatatgg aagcgcagga aatgtcacaa 1021 aagaatactt tgaattttct gaattctttt tgaaaaccta tgcagtgggc attcagcagg 1081 tgctactaaa aattttagat caatatagac agaaagaata tgtagctccc cgtgttctcc 1141 agcaagcatt caactatctc aaccaagggg tggttcattc tataacctgg aagcagatga 1201 agccacacat acagaatatc tctgaagatg tgattttttc tgtgatgtgt tataaagatg 1261 aggatgaaga gctgtggcaa gaagatccat atgagtatat aaggatgaaa tttgatattt 1321 ttgaagatta tgcttctccc accacagcag cccagactct cttatatact gctgcaaaga 1381 aaagaaaaga ggtgttgcca aaaatgatgg cattctgtta tcaaatcctg acagacccga 1441 actttgaccc taggaagaaa gatggagccc tgcatgtgat tggttcccta gctgagattt 1501 tactgaagaa gagtttattc aaggaccaaa tggagctgtt tctacaaaat catgtatttc 1561 cattattatt gtctaacctg ggatatcttc gagctagatc ttgctgggta cttcatgcat 1621 ttagttcttt gaagttccat aatgagctca atctaagaaa tgccgttgaa ttagcgaaga 1681 agagcctgat tgaagataaa gagatgcctg tcaaagttga agctgccctt gctcttcagt 1741 ctttaatttc taaccagata caagctaagg aatatatgaa gccacatgtg aggcctatta 1801 tgcaggaact gttgcacatt gttagagaga cagaaaatga tgatgttact aatgtcatcc 1861 agaagatgat atgtgaatac agtcaagagg tagcctcaat tgctgttgat atgacccaac 1921 acttggctga gatatttggc aaagttcttc aaagtgatga atatgaagaa gttgaagaca 1981 aaacagtaat ggctatggga attttacata ccattgatac tatcttaaca gttgtagaag 2041 atcataaaga gattacccag cagttagaga atatctgtct acggatcatt gatcttgttc 2101 tgcagaaaca tgtaattgaa ttctatgaag aaattctttc cctggcatac agtttaacct 2161 gccacagtat ttcccctcaa atgtggcagc ttctaggtat actatatgaa gtgtttcagc 2221 aggattgctt tgaatacttt acagacatga tgcctctcct gcataattat gtgacaatag 2281 atacagatac cttactatca aatgcaaaac atttagaaat tctttttaca atgtgtagga 2341 aggtactatg tggagatgca ggagaagatg cagagtgtca tgcagctaaa cttctggaag 2401 tcatcattct tcagtgcaaa ggaaggggaa ttgatcagtg cattccactc ttcgttcaac 2461 ttgttttgga gagattaact cgaggggtca aaactagtga gcttcgtact atgtgtcttc 2521 aggttgcaat tgctgccttg tactacaacc ctgatttgct gctacatact ttagaacgaa 2581 ttcagttgcc tcacaaccct ggacctatca ctgtacagtt tataaatcaa tggatgaatg 2641 atacagattg ttttctgggg catcatgacc ggaagatgtg tataatagga ctgagtatcc 2701 ttttggaatt gcaaaatcga cctcctgcag tagatgctgt ggtgggacag attgttccct 2761 caattctttt ccttttcctt ggcctaaagc aggtctgtgc tactagacaa ctggtaaacc 2821 gggaagatcg ttcaaaagca gagaaagctg atatggaaga aaatgaggag atttcaagtg 2881 atgaagagga gacaaatgta actgctcaag caatgcagtc aaataatgga agaggtgaag 2941 atgaggagga ggaagatgat gactgggatg aagaagtatt ggaagaaacc gcgcttgagg 3001 ggttcagtac tccacttgac cttgacaata gtgtggatga atatcagttt tttacacaag 3061 ctctgataac tgtgcagagt cgagatgcag cctggtacca gctgctgatg gcaccactca 3121 gcgaggatca gaggacagca ctgcaggagg tgtacacact ggcagagcac cgacggacgg 3181 tggcagaggc aaagaagaag attgaacaac agggaggctt cacctttgaa aacaaaggag 3241 tcctctccgc atttaatttt gggactgtgc ccagcaacaa ctgaaggaaa gaacatcagc 3301 tgaccaaatg tcatcgctgc attttatttc acaagaggag tgtg // LOCUS HSU77594 770 bp mRNA PRI 25-FEB-1997 DEFINITION Human tazarotene-induced gene 2 (TIG2) mRNA, complete cds. ACCESSION U77594 NID g1848263 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 770) AUTHORS Nagpat,S., Patel,S., Jacobs,H., DiSepio,D., Ghosn,C., Malhotra,M., Teng,M., Duvic,M. and Chandraratna,R.A.S. TITLE Tazarotene-induced gene 2 (TIG2), a novel retinoid responsive gene in skin JOURNAL Unpublished REFERENCE 2 (bases 1 to 770) AUTHORS Nagpal,S., Patel,S. and Chandraratna,R.A.S. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Biochemistry, Allergan Inc., 2525 Dupont Dr., Irvine, CA 92713, USA FEATURES Location/Qualifiers source 1..770 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retinoid (tazarotene) treated and untreated skin graft cultures of foreskin keratinocytes and dermal fibroblasts" gene 97..588 /gene="TIG2" CDS 97..588 /gene="TIG2" /codon_start=1 /product="tazarotene-induced gene 2" /db_xref="PID:g1848264" /translation="MRRLLIPLALWLGAVGVGVAELTEAQRRGLQVALEEFHKHPPVQ WAFQETSVESAVDTPFPAGIFVRLEFKLQQTSCRKRDWKKPECKVRPNGRKRKCLACI KLGSEDKVLGRLVHCPIETQVLREAEEHQETQCLRVQRAGEDPHSFYFPGQFAFSKAL PRS" BASE COUNT 207 a 216 c 238 g 109 t ORIGIN 1 accggtccgg aattcccggg tcgacccacg cgtccggcgg gacggtcagg ggagacctcc 61 aggcgcaggg aaggacggcc agggtgacac ggaagcatgc gacggctgct gatccctctg 121 gccctgtggc tgggtgcggt gggcgtgggc gtcgccgagc tcacggaagc ccagcgccgg 181 ggcctgcagg tggccctgga ggaatttcac aagcacccgc ccgtgcagtg ggccttccag 241 gagaccagtg tggagagcgc cgtggacacg cccttcccag ctggaatatt tgtgaggctg 301 gaatttaagc tgcagcagac aagctgccgg aagagggact ggaagaaacc cgagtgcaaa 361 gtcaggccca atgggaggaa acggaaatgc ctggcctgca tcaaactggg ctctgaggac 421 aaagttctgg gccggttggt ccactgcccc atagagaccc aagttctgcg ggaggctgag 481 gagcaccagg agacccagtg cctcagggtg cagcgggctg gtgaggaccc ccacagcttc 541 tacttccctg gacagttcgc cttctccaag gccctgcccc gcagctaagc cagcactgag 601 ctgcgtggtg cctccaggac cgctgcgggt ggtaaccagt ggaagacccc agcccccagg 661 gagaggaacc cgttctatcc ccagccatga taataaagct gctctcccaa aaaaaaaaaa 721 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU77604 589 bp mRNA PRI 29-SEP-1997 DEFINITION Homo sapiens microsomal glutathione S-transferase 2 (MGST2) mRNA, complete cds. ACCESSION U77604 NID g1747520 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 589) AUTHORS Jakobsson,P.J., Mancini,J.A. and Ford-Hutchinson,A.W. TITLE Identification and characterization of a novel human microsomal glutathione S-transferase with leukotriene C4 synthase activity and significant sequence identity to 5-lipoxygenase-activating protein and leukotriene C4 synthase JOURNAL J. Biol. Chem. 271 (36), 22203-22210 (1996) MEDLINE 96355624 REFERENCE 2 (bases 1 to 589) AUTHORS Jakobsson,P.-J., Mancini,J.A. and Ford-Hutchinson,A.W. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Biochemistry and Molecular Biology, Merck Frosst, PO Box 1005, Pointe-Claire/Dorval, PQ H9R 4P8, Canada FEATURES Location/Qualifiers source 1..589 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q28-31" gene 1..589 /gene="MGST2" CDS 12..455 /gene="MGST2" /codon_start=1 /product="microsomal glutathione S-transferase 2" /db_xref="PID:g1747521" /translation="MAGNSILLAAVSILSACQQSYFALQVGKARLKYKVTPPAVTGSP EFERVFRAQQNCVEFYPIFIITLWMAGWYFNQVFATCLGLVYIYGRHLYFWGYSEAAK KRITGFRLSLGILALLTLLGALGIANSFLDEYLDLNIAKKLRRQF" BASE COUNT 163 a 115 c 129 g 182 t ORIGIN 1 ggcacgagaa gatggccggg aactcgatcc tgctggctgc tgtctctatt ctctcggcct 61 gtcagcaaag ttattttgct ttgcaagttg gaaaggcaag attaaaatac aaagttacgc 121 ccccagcagt cactgggtca ccagagtttg agagagtatt tcgggcacaa caaaactgtg 181 tggagtttta tcctatattc ataattacat tgtggatggc tgggtggtat ttcaaccaag 241 tttttgctac ttgtctgggt ctggtgtaca tatatggccg tcacctatac ttctggggat 301 attcagaagc tgctaaaaaa cggatcaccg gtttccgact gagtctgggg attttggcct 361 tgttgaccct cctaggtgcc ctgggaattg caaacagctt tctggatgaa tatctggacc 421 tcaatattgc caagaaactg aggcggcaat tctaactttt tctcttccct ttaatgcttg 481 cagaagctgt tcccaccatg aaggtaatat ggtatcattt gttaaataaa aataaagtct 541 ttattctgtt tttcttgaaa aaaaaaaaaa aaaaaagatc tttaattaa // LOCUS HSU77629 1140 bp DNA PRI 26-NOV-1997 DEFINITION Homo sapiens Achaete-Scute homologue 2 (ASCL2) gene, complete cds. ACCESSION U77629 NID g2642464 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1140) AUTHORS Alders,M., Hodges,M., Hadjantonakis,A.-K., Postmus,J., van Wijk,I., Bliek,J., de Meulemeester,M., Westerveld,A., Guillemot,F., Oudejans,C., Little,P. and Mannens,M. TITLE The human Achaete-Scute homologue 2 (ASCL2,HASH2) maps to chromosome 11p15.5, close to IGF2 and is expressed in extravillus trophoblasts JOURNAL Hum. Mol. Genet. 6 (6), 859-867 (1997) MEDLINE 97318794 REFERENCE 2 (bases 1 to 1140) AUTHORS Alders,M. TITLE Direct Submission JOURNAL Submitted (06-NOV-1996) Human Genetics, University of Amsterdam, Meibergdreef 15, Amsterdam 1105AZ, The Netherlands FEATURES Location/Qualifiers source 1..1140 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15.5" mRNA 209..1140 /gene="ASCL2" gene 209..1140 /note="HASH2" /gene="ASCL2" CDS 545..1126 /gene="ASCL2" /codon_start=1 /product="Achaete-Scute homologue 2" /db_xref="PID:g2642465" /translation="MDGGTLPRSAPPAPPVPVGCAARRRPASPELLRCSRRRRPATAE TGGGAAAVARRNERERNRVKLVNLGFQALRQHVPHGGASKKLSKVETLRSAVEYIRAL QRLLAEHDAVRNALAGGLRPQAVRPSAPRGPPGTTPVAASPSRASSSPGRGGSSEPGS PRSAYSSDDSGCEGALSPAERELLDFSSWLGGY" BASE COUNT 161 a 394 c 431 g 154 t ORIGIN 1 gtaccttgct ttgggggcgc actaagtacc tgccgggagc agggggcgca ccgggaactc 61 gcagatttcg ccagttgggc gcactgggga tctgtggact gcgtccgggg gatgggctag 121 ggggacatgc gcacgctttg ggccttacag aatgtgatcg cgcgaggggg agggcgaagc 181 gtggcgggag ggcgaggcga aggaaggagg gcgtgagaaa ggcgacggcg gcggcgcgga 241 ggagggttat ctatacattt aaaaaccagc cgcctgcgcc gcgcctgcgg agacctggga 301 gagtccggcc gcacgcgcgg gacacgagcg tcccacgctc cctggcgcgt acggcctgcc 361 accactaggc ctcctatccc cgggctccag acgacctagg acgcgtgccc tggggagttg 421 cctggcggcg ccgtgccaga agcccccttg gggcgccaca gttttccccg tcgcctccgg 481 ttcctctgcc tgcaccttcc tgcggcgcgc cgggacctgg agcgggcggg tggatgcagg 541 cgcgatggac ggcggcacac tgcccaggtc cgcgccccct gcgccccccg tccctgtcgg 601 ctgcgctgcc cggcggagac ccgcgtcccc ggaactgttg cgctgcagcc ggcggcggcg 661 accggccacc gcagagaccg gaggcggcgc agcggccgta gcgcggcgca atgagcgcga 721 gcgcaaccgc gtgaagctgg tgaacttggg cttccaggcg ctgcggcagc acgtgccgca 781 cggcggcgcc agcaagaagc tgagcaaggt ggagacgctg cgctcagccg tggagtacat 841 ccgcgcgctg cagcgcctgc tggccgagca cgacgccgtg cgcaacgcgc tggcgggagg 901 gctgaggccg caggccgtgc ggccgtctgc gccccgcggg ccgccaggga ccaccccggt 961 cgccgcctcg ccctcccgcg cttcttcgtc cccgggccgc gggggcagct cggagcccgg 1021 ctccccgcgt tccgcctact cgtcggacga cagcggctgc gaaggcgcgc tgagtcctgc 1081 ggagcgcgag ctactcgact tctccagctg gttagggggc tactgagcgc cctcgaccta // LOCUS HSU77643 2000 bp mRNA PRI 02-MAY-1997 DEFINITION Human K12 protein precursor mRNA, complete cds. ACCESSION U77643 NID g2062390 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2000) AUTHORS Slentz-Kesler,K.A. and Kaufman,R.E. TITLE Molecular cloning of K12, a novel human gene on chromosome 17q25 expressed in breast cancer cells and peripheral blood leukocytes JOURNAL Unpublished REFERENCE 2 (bases 1 to 2000) AUTHORS Slentz-Kesler,K.A. and Kaufman,R.E. TITLE Direct Submission JOURNAL Submitted (07-NOV-1996) Biochemistry, Duke University, Box 3250 DUMC, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..2000 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q25.2-25.3" /cell_type="K562 erythroleukemic" sig_peptide 119..202 CDS 119..865 /note="type 1a transmembrane protein" /codon_start=1 /product="K12 protein precursor" /db_xref="PID:g2062391" /translation="MQTCPLAFPGHVSQALGTLLFLAASLSAQNEGWDSPICTEGVVS VSWGENTVMSCNISNAFSHVNIKLRAHGQESAIFNEVAPGYFSRDGWQLQVQGGVAQL VIKGARDSHAGLYMWHLVGHQRNNRQVTLEVSGAEPQSAPDTGFWPVPAVVTAVFILL VALVMFAWYRCRCSQQRREKKFFLLEPQMKVAALRAGAQQGLSRASAELWTPDSEPTP RPLALVFKPSPLGALELLSPQPLFPYAADP" mat_peptide 203..862 /product="K12 protein" misc_feature 551..623 /note="encodes transmembrane domain" misc_feature 752..862 /note="encodes proline-rich region" BASE COUNT 367 a 659 c 602 g 372 t ORIGIN 1 attttcctgg ggctccgggg cgcggagaag ctgcatccca gaggagcgcg tccaggagcg 61 gacccgggag tgtttcaaga gccagtgaca aggaccaggg gcccaagtcc caccagccat 121 gcagacctgc cccctggcat tccctggcca cgtttcccag gcccttggga ccctcctgtt 181 tttggctgcc tccttgagtg ctcagaatga aggctgggac agccccatct gcacagaggg 241 ggtagtctct gtgtcttggg gcgagaacac cgtcatgtcc tgcaacatct ccaacgcctt 301 ctcccatgtc aacatcaagc tgcgtgccca cgggcaggag agcgccatct tcaatgaggt 361 ggctccaggc tacttctccc gggacggctg gcagctccag gttcagggag gcgtggcaca 421 gctggtgatc aaaggcgccc gggactccca tgctgggctg tacatgtggc acctcgtggg 481 acaccagaga aataacagac aagtcacgct ggaggtttca ggtgcagaac cccagtccgc 541 ccctgacact gggttctggc ctgtgccagc ggtggtcact gctgtcttca tcctcttggt 601 cgctctggtc atgttcgcct ggtacaggtg ccgctgttcc cagcaacgcc gggagaagaa 661 gttcttcctc ctagaacccc agatgaaggt cgcagccctc agagcgggag cccagcaggg 721 cctgagcaga gcctccgctg aactgtggac cccagactcc gagcccaccc caaggccgct 781 ggcactggtg ttcaaaccct caccacttgg agccctggag ctgctgtccc cccaaccctt 841 gtttccatat gccgcagacc catagccgcc tgcaaggcag agaggacaca ggagagccag 901 ccctgagtgc cgaccttggg tggcggggcc tgggtctctc gtcccacccg gagggcacag 961 acaccggctt gcttggcagg ctgggcctct gtgtcaccca ctcctgggtg cgtgcagacc 1021 cttcccctcc accccccagg tcttccaagc tctgcttcct cagtttccaa aatggaacca 1081 cctcacctcc gcagcacccg acttaccagg acgcatgccc ctccctctgc cctcatcaaa 1141 cccacagacc cggactccct ttctgccacc ccaggctggt ccggccccag gtgtggggtc 1201 cgctctctcc actcccaggg ctccgcgccc aagtgagggg gcccctgccg gagcctcaga 1261 cacactggag ttcagggctg ggggggcctt ggcacatacc tgtcccttgg ctatgagcag 1321 gctttggggg cccttccgcg gcagccccgg gggccgaggt agggtctggg ggcttagagg 1381 ctgggatggc tcctggcccc accgccaggg ggcaagcgca ggccgggctg ggaggcggcg 1441 gcggcggctc gggctggggg gtcaggtgga cgctgcctcc ggggctggtc gcgcatccct 1501 cagtccctcg gccacccggg ggtcgctccc tcgtgcccac cgcacctgcc gagcctcttt 1561 ggacccagat ctgttcatgc ttttgtcttc gtcactgcgg cggggccctt tgatgtcttc 1621 atctgtatgg ggtggaaaaa tcaccgggaa tcccccttca gttctttgaa aaagttccat 1681 gactcgaata tctgaaatga agaaaacaaa ccgactcaca aacctccaag tagctccaaa 1741 tgcaattttt aaaatggaaa acaaaaatct gaaagaaacg tctttagtgg ctttaagccc 1801 caaaacgtcc ctaaggcgtc ctcgagatga agacgggggg gagcccccag ccaggtggag 1861 accccgcagg acgcggcggc gcccggtgac cgaggcctcg cacagccggc cgccctgagg 1921 gtcgggccgg agccagggtc caagaggggc gcgtttgtgt ctcgggttaa aataaggttc 1981 cgtccgcgtg ctgggtcaga // LOCUS HSU77664 991 bp mRNA PRI 14-MAR-1997 DEFINITION Human RNaseP protein p38 (RPP38) mRNA, complete cds. ACCESSION U77664 NID g1885378 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 991) AUTHORS Eder,P.S., Kekuda,R., Stolc,V. and Altman,S. TITLE Characterization of two scleroderma autoimmune antigens that copurify with human ribonuclease P JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (4), 1101-1106 (1997) MEDLINE 97188428 REFERENCE 2 (bases 1 to 991) AUTHORS Eder,P.S. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) Biology, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA FEATURES Location/Qualifiers source 1..991 /organism="Homo sapiens" /db_xref="taxon:9606" gene 37..888 /gene="RPP38" CDS 37..888 /gene="RPP38" /codon_start=1 /product="RNaseP protein P38" /db_xref="PID:g1885379" /translation="MAAAPQAPGRGSLRKTRPLVVKTSLNNPYIIRWSALESEDMHFI LQTLEDRLKAIGLQKIEDKKKKNKTPFLKKESREKCSIAVDISDNLKEKKTDAKQQVS GWTPAHVRKQLVIGVNEVTRALERRELLLVLVCKSVKPAMITSHLIQLSLSRSVPACQ VPRLSERIAPVIGLKCVLALAFKKNTTDFVDEVRAIIPRVPSLSVPWLQDRIEDSGEN LETEPLESQDRELLDTSFEDLSKPKRKLADGRQASVTLQPLKIKKLIPNPNKIRKPPK SKKATPK" BASE COUNT 332 a 211 c 225 g 223 t ORIGIN 1 aacggattcg ccatcgtgag ttccaggatt ttcaaaatgg ctgcagctcc tcaagcaccg 61 gggcggggat ctctccgtaa gacgagacct ctggttgtga agacgtcgtt gaacaaccca 121 tacatcatcc gctggagcgc tctggagagc gaggatatgc acttcatcct acagacgctt 181 gaggacaggc ttaaagctat tggacttcag aagattgaag ataagaagaa aaagaacaaa 241 acaccttttc tgaaaaaaga aagcagagag aaatgcagca ttgctgttga tattagtgat 301 aatctgaagg agaagaaaac agatgctaag cagcaagtgt cagggtggac gcctgcacac 361 gtcaggaagc agcttgtcat tggcgttaac gaagttacca gagccctgga aaggagggaa 421 ctgctgttag ttctggtgtg taaatcagtc aagcctgcca tgatcacctc acacttgatt 481 cagttaagcc taagcagaag tgtccctgcc tgtcaggtcc cccggctcag tgagagaatc 541 gcccccgtca ttggcttaaa atgtgttcta gccttggcgt tcaaaaagaa caccactgac 601 tttgtggacg aagtaagagc catcatcccc agagtcccca gtttaagtgt accatggctt 661 caagacagaa ttgaagattc tggggaaaat ttagagactg aacctctgga aagccaagac 721 agagagcttt tggacacttc atttgaagat ctgtcaaaac ctaagagaaa gcttgctgac 781 ggtcggcagg cttctgtaac attacaaccc cttaaaataa agaaactgat tccaaaccct 841 aataagataa ggaaaccacc caaaagtaaa aaagctactc caaagtaatc ttgcataaac 901 ttgtcatgtc atacagtttg tgaaaggaca ccttgtaaag aagccttgaa actaataaaa 961 tgagttatac ttacataaaa aaaaaaaaaa a // LOCUS HSU77665 942 bp mRNA PRI 14-MAR-1997 DEFINITION Human RNaseP protein p30 (RPP30) mRNA, complete cds. ACCESSION U77665 NID g1885380 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 942) AUTHORS Eder,P.S., Kekuda,R., Stolc,V. and Altman,S. TITLE Characterization of two scleroderma autoimmune antigens that copurify with human ribonuclease P JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (4), 1101-1106 (1997) MEDLINE 97188428 REFERENCE 2 (bases 1 to 942) AUTHORS Eder,P.S. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) Biology, Yale University, 266 Whitney Avenue, New Haven, CT 06511, USA FEATURES Location/Qualifiers source 1..942 /organism="Homo sapiens" /db_xref="taxon:9606" gene 28..834 /gene="RPP30" CDS 28..834 /gene="RPP30" /codon_start=1 /product="RNaseP protein P30" /db_xref="PID:g1885381" /translation="MAVFADLDLRAGSDLKALRGLVETAAHLGYSVVAINHIVDFKEK KQEIEKPVAVSELFTTLPIVQGKSRPIKILTRLTIIVSDPSHCNVLRATSSRARLYDV VAVFPKTEKLFHIACTHLDVDLVCITVTEKLPFYFKRPPINVAIDRGLAFELVYSPAI KDSTMRRYTISSALNLMQICKGKNVIISSAAERPLEIRGPYDVANLGLLFGLSESDAK AAVSTNCRAALLHGETRKTAFGIISTVKKPRPSEGDEDCLPASKKAKCEG" BASE COUNT 292 a 197 c 200 g 253 t ORIGIN 1 gaattcggca cgaggtggga cttcagcatg gcggtgtttg cagatttgga cctgcgagcg 61 ggttctgacc tgaaggctct gcgcggactt gtggagacag ccgctcacct tggctattca 121 gttgttgcta tcaatcatat cgttgacttt aaggaaaaga aacaggaaat tgaaaaacca 181 gtagctgttt ctgaactctt cacaactttg ccaattgtac agggaaaatc aagaccaatt 241 aaaattttaa ctagattaac aattattgtc tcggatccat ctcactgcaa tgttttgaga 301 gcaacttctt caagggcccg gctctatgat gttgttgcag tttttccaaa gacagaaaag 361 ctttttcata ttgcttgcac acatttagat gtggatttag tctgcataac tgtaacagag 421 aaactaccat tttacttcaa aagacctcct attaatgtgg cgattgaccg aggcctggct 481 tttgaacttg tctatagccc tgctatcaaa gactccacaa tgagaaggta tacaatttcc 541 agtgccctca atttgatgca aatctgcaaa ggaaagaatg taattatatc tagtgctgca 601 gaaaggcctt tagaaataag agggccatat gacgtggcaa atctaggctt gctgtttggg 661 ctctctgaaa gtgacgccaa ggctgcggtg tccaccaact gccgagcagc gcttctccat 721 ggagaaacta gaaaaactgc ttttggaatt atctctacag tgaagaaacc tcggccatca 781 gaaggagatg aagattgtct tccagcttcc aagaaagcca agtgtgaggg ctgaaaagaa 841 tgccccagtc tctgtcagca ctcccttctt cccttttata gttcatcagc cacaacaaaa 901 ataaaacctt tgtgtgaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HSU77718 2617 bp mRNA PRI 01-MAR-1997 DEFINITION Human desmosome associated protein pinin mRNA, complete cds. ACCESSION U77718 NID g1684846 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2617) AUTHORS Ouyang,P. and Sugrue,S.P. TITLE Characterization of pinin, a novel protein associated with the desmosome-intermediate filament complex JOURNAL J. Cell Biol. 135 (4), 1027-1042 (1996) MEDLINE 97081102 REFERENCE 2 (bases 1 to 2617) AUTHORS Sugrue,S.P. and Ouyang,P. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) Anatomy and Cell Biology, University of Florida, 1600 SW Archer Road, Gainesville, FL 32610, USA FEATURES Location/Qualifiers source 1..2617 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 31..2262 /note="desmosome associated protein; phosphoprotein" /codon_start=1 /product="pinin" /db_xref="PID:g1684847" /translation="MAVAVRTLQEQLEKAKESLKNVDENIRKLTGRDPNDVRPIQARL LALSGPGGGRGRGSLLLRRGFSDSGGPPAKQRDLEGAVSRLGGERRTRRESRQESDPE DDDVKKPALQSSVVATSKERTRRDLIQDQNMDEKGKQRNRRIFGLLMGTLQKFKQEST VATERQNRRQEIEQKLEVQAEEERKQVENERRELFEERRAKQTELRLLEQKVELAQLQ EEWNEHNAKIIKYIRTKTKPHLFYIPGRMCPATQKLIEESQRKMNALFDGRRIEFAEQ INKMEARPRRQSMKEKEHQVVRNEEHKAEQEEGKVAQREEELVETGNQHNDVEIEEAG EEEEKEIGIVHSDAEKEQEEEEQKQEMEVKMEEETEVRESEKQQDSQPEEVMDVLEMV ENVKHVIADQEVMETNRVESVEPSENEASKELEPEMEFEIEPDKECKSLSPGKENVSA LDMEKESDEKEEKESEPQPEPVAQPQAQSQPQLQLQSQSEPQPQLQPEPAQPQLQSQP QLQLQSQCHAVLQSHPPSQPEDLSLAVLQPTPQVTQEHGHFLPERKDFPVESVKLTEV PVDPVLTVHPESESETNTRSRSRGRTRNRTTKSRSRSSSSSSSSSSSTSSSSGSSSSS GSSSSRSSSSSSSSTSGSSSRDSSSSTSSSSESRSRSRGRGHNRDRKHRRSVDRKRRD TSGLERSHKSSKGGSSRDTKGSKDKNSRSDRKRSISESSRSGKRSSRSERDRKSDRKD KRR" BASE COUNT 911 a 471 c 686 g 549 t ORIGIN 1 aagcagtctc aagcctgccg cagggagaag atggcggtcg ccgtgagaac tttgcaggaa 61 cagctggaaa aggccaaaga gagtcttaag aacgtggatg agaacattcg caagctcacc 121 gggcgggacc cgaatgatgt gaggcccatc caagccagat tgctggccct ttctggtcct 181 ggtggaggta gaggacgtgg tagtttattg ctgaggcgtg gattctcaga tagtggagga 241 cccccagcca aacagagaga ccttgaaggg gcagtcagta ggctgggcgg ggagcgtcgg 301 accagaagag aatcacgcca ggaaagcgac ccggaggatg atgatgttaa aaagccagca 361 ttgcagtctt cagttgtagc tacctccaaa gagcgcacac gtagagacct tatccaggat 421 caaaatatgg atgaaaaggg aaagcaaagg aaccgacgaa tatttggctt attgatgggc 481 actcttcaga aatttaaaca agaatccact gttgctactg aaaggcaaaa caggcgccag 541 gaaattgaac aaaaacttga agtgcaggcg gaagaagaaa gaaagcaggt tgaaaatgaa 601 aggagagaac tgtttgaaga gaggcgtgct aaacagacag aactgcggct tttagaacag 661 aaggttgagc ttgcgcagct gcaagaagaa tggaatgaac ataatgccaa aataattaaa 721 tatataagaa ctaagacaaa gccccatttg ttttatattc ccggaagaat gtgtccagct 781 acccaaaaac taatagaaga gtcacagaga aaaatgaacg ctttatttga tggtagacgc 841 atcgaatttg cagaacaaat aaataaaatg gaggctaggc ctagaagaca atcaatgaag 901 gaaaaagagc atcaggtggt gcgtaatgaa gaacacaagg cggaacaaga agagggtaag 961 gtggctcagc gagaggaaga gttggtggag acaggtaacc agcacaatga tgttgaaata 1021 gaggaagcag gagaggaaga ggaaaaggaa atagggattg ttcatagtga tgcagagaaa 1081 gagcaggagg aggaggaaca aaaacaggaa atggaggtta agatggagga ggaaactgag 1141 gtaagggaaa gtgagaagca gcaggatagt cagcctgaag aagttatgga tgtgctagag 1201 atggttgaga atgtcaaaca tgtaattgct gaccaggagg taatggaaac taatcgagtt 1261 gaaagtgtag aaccttcaga aaatgaagct agcaaagaat tggaaccaga aatggaattt 1321 gaaattgagc cagataaaga atgtaaatcc ctttctcctg ggaaagagaa tgtcagtgct 1381 ttagacatgg aaaaggagtc tgacgaaaaa gaagaaaaag aatctgagcc ccaacctgag 1441 cctgtggctc aacctcaggc tcagtctcag ccccagctcc agcttcaatc ccagtccgag 1501 ccacagcctc agctacaacc tgagcctgct caacctcagc ttcagtctca gccccagctt 1561 cagcttcaat cccagtgcca tgcagtactc cagtcccatc ctccctctca acctgaggat 1621 ttgtcattag ctgttttaca gccaacaccc caagttactc aggagcatgg gcattttcta 1681 cctgagagga aggattttcc tgtagagtct gtaaaactga ctgaggtacc agtagaccca 1741 gtcttgacag tacatccaga gagcgagagc gaaaccaata ctaggagcag gagtagaggt 1801 cgaactagaa atagaaccac caagagtaga agtcgaagca gtagcagtag cagttctagt 1861 agcagttcaa ccagtagcag cagtggaagt agttccagca gtggaagtag tagcagtcgc 1921 agtagttcca gtagcagctc cagtacaagt ggcagcagca gcagagatag cagcagcagc 1981 actagtagta gtagtgagag tagaagtcgg agtaggggcc ggggacataa tagagataga 2041 aagcacagaa ggagcgtgga tcggaagaga agggatactt caggactaga aagaagtcac 2101 aaatcttcaa aaggtggtag tagtagagat acaaaaggat caaaggataa gaattcccgg 2161 tccgacagaa agaggtctat atcagagagt agtcgatcag gcaaaagatc ttcaagaagt 2221 gaaagagacc gaaaatcaga caggaaagac aaaaggcgtt aatggaagaa gccaggcttt 2281 cttagccatt ctttgcagca gaagatttct tgatgaaaaa ggattacctt tccttgtaaa 2341 gaggatgctg ccttaagaat tgcatgttgt aaaaaatctt tttggaagat acagactgtt 2401 tgtttaccag acattcttgt actttttgca taattttgta agagttattt atcaaaatta 2461 tgtgaggttc caaaatatgt aaaaatgata ataataaaaa aagattaaca tcccttgtca 2521 tcttttttaa atatcctata ctcttcagta agaatctgta tattttaata ggcaaatctt 2581 taagtctgtt cccttcaatt ctgtatcata cattgct // LOCUS HSU77735 2088 bp mRNA PRI 24-DEC-1996 DEFINITION Human pim-2 protooncogene homolog pim-2h mRNA, complete cds. ACCESSION U77735 NID g1750275 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2088) AUTHORS Baytel,D. and Don,J. TITLE Human homolog of the pim-2 protooncogene JOURNAL Unpublished REFERENCE 2 (bases 1 to 2088) AUTHORS Baytel,D. and Don,J. TITLE Direct Submission JOURNAL Submitted (07-NOV-1996) Life Sciences, Bar-Ilan University, Ramat Gan 52900, Israel FEATURES Location/Qualifiers source 1..2088 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="72m" /tissue_type="testis" misc_feature 100..102 /note="similarity at the amino acid level suggests that, as in the murine pim-2, translation might initiate from a ctg codon, which is within a Kozak sequence as an alternative initiation codon" CDS 186..1190 /note="similar to murine pim-2 product encoded by GenBank Accession Number L41495; serine/threonine protein kinase" /codon_start=1 /product="pim-2 protooncogene homolog pim-2h" /db_xref="PID:g1750276" /translation="MLTKPLQGPPAPPGTPTPPPGGKDREAFEAEYRLGPLLGKGGFG TVFAGHRLTDRLQVAIKVIPRNRVLGWSPLSDSVTCPLEVALLWKVGAGGGHPGVIRL LDWFETQEGFMLVLERPLPAQDLFDYITEKGPLGEGPSRCFFGQVVAAIQHCHSRGVV HRDIKDENILIDLRRGCAKLIDFGSGALLHDEPYTDFDGTRVYSPPEWISRHQYHALP ATVWSLGILLYDMVCGDIPFERDQEILEAELHFPAHVSPDCCALIRRCLAPKPSSRPS LEEILLDPWMQTPAEDVTPQPLQRRPCPFGLVLATLSLAWPGLAPNGQKSHPMAMSQG " BASE COUNT 436 a 594 c 528 g 530 t ORIGIN 1 gaattcggca cgagcgcgcg gcgaatctca acgctgcgcc gtctgcgggc gcttccgggc 61 caccagtttc tctgctttcc accctggcgc cccccagccc tggctcccca gctgcgctgc 121 cccgggcgtc cacgccctgc gggcttagcg ggttcagtgg gctcaatctg cgcagcgcca 181 cctccatgtt gaccaagcct ctacaggggc ctcccgcgcc ccccgggacc cccacgccgc 241 cgccaggagg caaggatcgg gaagcgttcg aggccgagta tcgactcggc cccctcctgg 301 gtaagggggg ctttggcacc gtcttcgcag gacaccgcct cacagatcga ctccaggtgg 361 ccatcaaagt gattccccgg aatcgtgtgc tgggctggtc ccccttgtca gactcagtca 421 catgcccact cgaagtcgca ctgctatgga aagtgggtgc aggtggtggg caccctggcg 481 tgatccgcct gcttgactgg tttgagacac aggaaggctt catgctggtc ctcgagcggc 541 ctttgcccgc ccaggatctc tttgactata tcacagagaa gggcccactg ggtgaaggcc 601 caagccgctg cttctttggc caagtagtgg cagccatcca gcactgccat tcccgtggag 661 ttgtccatcg tgacatcaag gatgagaaca tcctgataga cctacgccgt ggctgtgcca 721 aactcattga ttttggttct ggtgccctgc ttcatgatga accctacact gactttgatg 781 ggacaagggt gtacagcccc ccagagtgga tctctcgaca ccagtaccat gcactcccgg 841 ccactgtctg gtcactgggc atcctcctct atgacatggt gtgtggggac attccctttg 901 agagggacca ggagattctg gaagctgagc tccacttccc agcccatgtc tccccagact 961 gctgtgccct aatccgccgg tgcctggccc ccaaaccttc ttcccgaccc tcactggaag 1021 agatcctgct ggacccctgg atgcaaacac cagccgagga tgttacccct caacccctcc 1081 aaaggaggcc ctgccccttt ggcctggtcc ttgctaccct aagcctggcc tggcctggcc 1141 tggcccccaa tggtcagaag agccatccca tggccatgtc acagggatag atggacattt 1201 gttgacttgg ttttacaggt cattaccagt cattaaagtc cagtattact aaggtaaggg 1261 attgaggatc aggggttaga agacataaac caagtttgcc cagttccctt cccaatccta 1321 caaaggagcc ttcctcccag aacctgtggt ccctgatttt ggagggggaa cttcttgctt 1381 ctcattttgc taaggaagtt tattttggtg aagttgttcc cattttgagc cccgggactc 1441 ttattttgat gatgtgtcac cccacattgg cacctcctac taccaccaca caaacttagt 1501 tcatatgctt ttacttgggc aagggtgctt tccttccaat accccagtag cttttatttt 1561 agtaaaggga ccctttcccc tagcctaggg tcccatattg ggtcaagctg cttacctgcc 1621 tcagcccagg attttttatt ttgggggagg taatgccctg ttgttacccc aaggcttctt 1681 tttttttttt tttttttttg ggtgagggga ccctactttg ttatcccaag tgctcttatt 1741 ctggtgagaa gaaccttaat tccataattt gggaaggaat ggaagatgga caccaccgga 1801 caccaccaga caataggatg ggatggatgg ttttttgggg gatgggctag gggaaataag 1861 gcttgctgtt tgttttcctg gggcgctccc tccaattttg cagatttttg caacctcctc 1921 ctgagccggg attgtccaat tactaaaatg taaataatca cgtattgtgg ggaggggagt 1981 tccaagtgtg ccctcctttt ttttcctgcc tggattattt aaaaagccat gtgtggaaac 2041 ccactattta ataaaagtaa tagaatcaga aaaaaaaaaa aaaaaaaa // LOCUS HSU77845 2007 bp mRNA PRI 25-APR-1997 DEFINITION Human hTRIP (hTRIP) mRNA, complete cds. ACCESSION U77845 NID g2039303 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2007) AUTHORS Lee,S.Y., Lee,S.Y. and Choi,Y. TITLE TRAF-interacting protein (TRIP): a novel component of the tumor necrosis factor receptor (TNFR)- and CD30-TRAF signaling complexes that inhibits TRAF2-mediated NF-kappaB activation JOURNAL J. Exp. Med. 185 (7), 1275-1285 (1997) MEDLINE 97258620 REFERENCE 2 (bases 1 to 2007) AUTHORS Lee,S.Y., Lee,S.Y. and Choi,Y. TITLE Direct Submission JOURNAL Submitted (10-NOV-1996) Immunology, The Rockefeller University, 1230 York Avenue Box 295, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..2007 /organism="Homo sapiens" /db_xref="taxon:9606" gene 103..1512 /gene="hTRIP" CDS 103..1512 /gene="hTRIP" /codon_start=1 /product="hTRIP" /db_xref="PID:g2039304" /translation="MPIRALCTICSDFFDHSRDVAAIHCGHTFHLQCLIQSFETAPSR TCPQCRIQVGKRTIINKLFFDLAQEEENVLDREFLKNELDNVRAQLSQKDKEKRDSQV IIDTLRDTLEERNATVVSLQQALGKAEMLCSTLKKQMKYLEQQQDETKQAQEEAGRLR SKMKTMEQIELLLQSQLPEVEEMIRDMGVGQSAVEQLAVYCVSLKKEYENLKEARKAS GEVADKLRKDLFSSRSKLQTVYSELDQAKLELKSAQKDLQSADKEIMSLKKKLTMLQE TLNLPPVASETVDRLVLESPAPVEVNLKLRRPSFRDDIDLNATFDVDTPPARPSSSQH GYYEKLCLEKSHSPIQDVPKKICKGPRKESQLSLGGQSCAGEPDEELVGAFPIFVRNA ILGQKQPKRPRSESSCSKDVVRTGFDGLGGRTKFIQPTDTVMIRPLPVKPKTKVKQRV RVKTVPSLFQAKLDTFLWS" BASE COUNT 517 a 518 c 558 g 414 t ORIGIN 1 gtgcggtgga gcgaaatttg aagcaagcgg aggcggggcg ctctacgaag ccggacctgt 61 agcagtttct ttggctgcct gggccccttg agtccagcca tcatgcctat ccgtgctctg 121 tgcactatct gctccgactt cttcgatcac tcccgcgacg tggccgccat ccactgcggc 181 cacaccttcc acttgcagtg cctaattcag tcctttgaga cagcaccaag tcggacctgc 241 ccacagtgcc gaatccaggt tggcaaaaga accattatca ataagctctt ctttgatctt 301 gcccaggagg aggagaatgt cttggatcga gaattcttaa agaatgaact ggacaatgtc 361 agagcccagc tttcccagaa agacaaggag aaacgagaca gccaggtcat catcgacact 421 ctgcgggata cgctggaaga acgcaatgct actgtggtat ctctgcagca ggccttgggc 481 aaggccgaga tgctgtgctc cacactgaaa aagcagatga agtacttaga gcagcagcag 541 gatgagacca aacaagcaca agaggaggcg ggccggctca ggagcaagat gaagaccatg 601 gagcagattg agcttctact ccagagccag ctccctgagg tggaggagat gatccgagac 661 atgggtgtgg gacagtcagc ggtggaacag ctggctgtgt actgtgtgtc tctcaagaaa 721 gagtacgaga atctaaaaga ggcacggaag gcctcagggg aggtggctga caagctgagg 781 aaggatttgt tttcctccag aagcaagttg cagacagtct actctgaatt ggatcaggcc 841 aagttagaac tgaagtcagc ccagaaggac ttacagagtg ctgacaagga aatcatgagc 901 ctgaaaaaga agctaacgat gctgcaggaa accttgaacc tgccaccagt ggccagtgag 961 actgtcgacc gcctggtttt agagagccca gcccctgtgg aggtgaatct gaagctccgc 1021 cggccatcct tccgtgatga tattgatctc aatgctacct ttgatgtgga tactccccca 1081 gcccggccct ccagctccca gcatggttac tacgaaaaac tttgcctaga gaagtcacac 1141 tccccaattc aggatgtccc caagaagata tgcaaaggcc ccaggaagga gtcccagctc 1201 tcactgggtg gccagagctg tgcaggagag ccagatgagg aactggttgg tgccttccct 1261 atttttgtcc ggaatgccat cctaggccag aaacagccca aaaggcccag gtcagagtcc 1321 tcttgcagca aagatgtggt aaggacaggc ttcgatgggc tcggtggccg gacaaaattc 1381 atccagccta ctgacacagt catgatccgc ccattgcctg ttaagcccaa gaccaaggtt 1441 aagcagaggg tgagggtgaa gaccgtgcct tctctcttcc aggccaagct ggacaccttc 1501 ctgtggtcgt gagaacagtg agtctgacca atggccagac acatgcctgc aacttgtagg 1561 tcaaggactg tccaggcagg gtttgtggac agagccctac tttcgggacc agcctgaggt 1621 gtaagggcag acaaacaggt gagggtgagt gtgacaccca gagactgctc ttcctgccct 1681 caccctgccc cactcctacg actgggagct gacatgacca gcccactgat cctgtcagca 1741 ggtcctgctc tgttgccagg ctcttgttta tagccatgat cagatgtggt cagactcttt 1801 ctgggcctgg agaccacggt cacttgttga ctgtctctgt ggaccagagt gcttgaggca 1861 tctcaggcag cctcagccca agcttctacc tgcctttgac ttgcttctag catagcctgg 1921 gccaagcagg gtggggaatg gaggatagac atgggatgta tggagaggat ggaagatttt 1981 cccgaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU77942 1614 bp mRNA PRI 11-NOV-1997 DEFINITION Human syntaxin 7 mRNA, complete cds. ACCESSION U77942 NID g2337919 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1614) AUTHORS Wang,H., Frelin,L. and Pevsner,J. TITLE Human syntaxin 7: a Pep12p/Vps6p homologue implicated in vesicle trafficking to lysosomes JOURNAL Gene 199 (1-2), 39-48 (1997) MEDLINE 98019069 REFERENCE 2 (bases 1 to 1614) AUTHORS Wang,H., Frelin,L. and Pevsner,J. TITLE Direct Submission JOURNAL Submitted (08-NOV-1996) Department of Neuroscience, Kennedy Krieger Institute, 707 N. Broadway, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1614 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 80..865 /note="putative prelysosomal or intracellular syntaxin; similar to Saccharomyces cerevisiae Pep12p Swiss-Prot Accession Number P32854 and to Arabidopsis thaliana syntaxin, encoded by Genbank Accession Number L41651; syntaxin homolog" /codon_start=1 /product="syntaxin 7" /db_xref="PID:g2337920" /translation="MSYTPGVGGDPTQLAQRISSNIQKITQCSVEIQRTLNQLGTPQD SPELRQQLQQKQQYTNQLAKETDKYIKEFGSLPTTPSEQRQRKIQKDRLVAEFTTSLT NFQKVQRQAAEREKEFVARVRASSRVSGSFPEDSSKERNLVSWESQTQPQVQVQDEEI TEDDLRLIHERESSIRQLEADIMDINEIFKDLGMMIHEQGDVIDSIEANVENAEVHVQ QANQQLSRAADYQRKSRKTLCIIILILVIGVAIISLIIWGLNH" BASE COUNT 495 a 288 c 344 g 487 t ORIGIN 1 gagggagccg tggaggtcca ggtgactgct tagaaaactg cacagcatct gatgaaatta 61 gcgaataaga acatcaacca tgtcttacac tccaggagtt ggtggtgacc ccacccagtt 121 ggcccagagg atctcttcta acatccagaa gatcacacag tgttctgtgg aaatacaaag 181 aactctgaat caacttggaa cacctcaaga ttcacctgaa ttgaggcaac agttgcaaca 241 gaagcagcag tatactaacc agcttgccaa agaaacagat aagtacatta aagagtttgg 301 atctctgccc accaccccca gtgaacagcg tcaaaggaaa atacagaagg atcgcttagt 361 ggcagagttc acaacatcac tgacaaactt ccagaaggtc cagaggcagg ctgctgagcg 421 agagaaagag tttgttgctc gagtaagagc cagttccaga gtgtctggca gttttcctga 481 ggacagctca aaagaaagga atcttgtatc ctgggaaagc caaactcaac ctcaagtgca 541 ggtgcaggat gaagaaatta cagaggatga cctccgtctt attcatgaga gagaatcttc 601 tatcaggcaa cttgaagctg atattatgga tattaatgaa atatttaaag atttgggaat 661 gatgattcat gaacaaggag atgtaataga tagcatagaa gccaatgtgg aaaatgcaga 721 ggtgcacgtt cagcaagcaa atcagcagct gtcaagggca gcagattatc agcgcaaatc 781 cagaaaaacc ctgtgcatca tcattcttat ccttgtcatt ggagttgcga ttatcagtct 841 catcatatgg ggattgaacc actgaagtta taaaggagca cactgtcgca ctacattgtc 901 taaattatgt aggaagattc ctgtaatcat gtttttttaa ttattatttt aaagctattg 961 tataaaggat ggttcccata ctttgttatt tttattgggg gggttgggcg ggttcctttg 1021 gattaaatct gatattttct aatactgaaa gattttctaa atgtcactgc tgacataact 1081 cccttggtct tcaatttaat agttgttaag ttttgggcca cattgcatat gcctttcatt 1141 tataatttat ttaccctgct tgacttagtt tggggaattc ggaaatttaa ggtgtgtgta 1201 ttctgttggg atctccctgc cacgtgaaca caccaagatg tgtgttactt caagttaaaa 1261 ctccccaaaa tttaattttt gatttgcttc caccagggga aaatattctc caataatgta 1321 aaataattaa ggtccaatac atgggttgta tttttctggt tcacaacagc acaaagtgtc 1381 tttcattttt ttgttggatt tcctttaaga tcttttttac cctgaagtcg gtgaacactt 1441 ttctagttaa tttgatactc tttctgtgta tataataagc ttttgctgta gattgcctag 1501 taaaattact aaggataggt tgtttttaca tatggtctat ttaagtctga tgtttacggg 1561 ggagagtgta gttactaaaa atgtttaaca taatttggaa gaagagtatg aaca // LOCUS HSU77968 1904 bp mRNA PRI 13-FEB-1997 DEFINITION Human neuronal PAS1 (NPAS1) mRNA, complete cds. ACCESSION U77968 NID g1840055 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1904) AUTHORS Zhou,Y.D., Barnard,M., Tian,H., Li,X., Ring,H.Z., Francke,U., Shelton,J., Richardson,J., Russell,D.W. and McKnight,S.L. TITLE Molecular characterization of two mammalian bHLH-PAS domain proteins selectively expressed in the central nervous system JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (2), 713-718 (1997) MEDLINE 97165088 REFERENCE 2 (bases 1 to 1904) AUTHORS Zhou,Y.-D., Barnard,M., Tian,H., Li,X., Ring,H.Z., Francke,U., Shelton,J., Richardson,J., Russell,D.W. and McKnight,S.L. TITLE Direct Submission JOURNAL Submitted (11-NOV-1996) Biochemistry, UT Southwestern, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..1904 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetus" /clone_lib="Stratagene, Catalog No. 936206" gene 1..1904 /gene="NPAS1" CDS 25..1797 /gene="NPAS1" /note="member of the bHLH-PAS family; one of two transcripts, see GenBank Accession Number U77970" /codon_start=1 /product="neuronal PAS1" /db_xref="PID:g1840056" /translation="MAAPYPGSGGGSEVKCVGGRGASVPWDFLPGLMVKAPSGPCLQA QRKEKSRNAARSRRGKENLEFFELAKLLPLPGAISIQLDKASIVRLSVTYLRLRRFAA LGAPPWGLRAAGPPAGLAPGRRGPAALVSEVFEQHLGGHILQSLDGFVFALNQEGKFL YISETVSIYLGLSQVEMTGSSVFDYIHPGDHSEVLEQLGLRTTTPGPPTPSSVSSSSS SSSSLADTPEIEASLTKVPPSSLVQERSFFVRMKSTLTKRGLHVKASGYKVIHVTGRL RAHALGLVALGHTLPPAPLAELPLHGHMIVFRLSLGLTILACESRVSDHMDLGPSELV GRSCYQFVHGQDATRIRQSHVDLLDKGQVMTGYYRWLQRAGGFVWLQSVATVAGSGKS PGEHHVLWVSHVLSQAEGGQTPLDAFQLPASVACEEASSPGPEPTEPEPPTEGKQAVP AENEAPQTQGKRIKVEPGPRETKGSEDSGDEDPSSHPATPRPEFTSVIRAGVLKQDPV RPWGLAPPGDPPPTLLHAGFLPPVVRGLCTPGTIRYGPAELGLVYPHLQRLGPGPALP EAFYPPLGLPYPGPAGTRLPRKGD" BASE COUNT 304 a 685 c 614 g 301 t ORIGIN 1 cccgcctgag cgagcccccc ggagatggcg gccccctatc ccggcagtgg cggcggaagc 61 gaggtcaaat gcgtgggagg ccgcggcgcc agcgtcccct gggactttct acccgggctg 121 atggtcaagg cgccgtccgg accgtgcctg caggcgcagc gcaaggagaa gtcccggaac 181 gcggcgcgct cgcggcgcgg gaaggagaac ctggagttct tcgagctggc caagcttctc 241 ccgctgcccg gcgccatctc catccagctg gacaaggctt ccatcgtgcg cctcagcgtc 301 acctacctcc gcctgcgccg gttcgccgcg ctgggggcgc cgccctgggg gctgagagcc 361 gcggggccgc cagctggcct cgccccaggc cgccgcggcc ccgcagcgct ggtctccgaa 421 gtcttcgagc agcacctggg aggtcacatc ttgcagtccc tggatggctt tgtgttcgcc 481 ttgaaccagg aaggaaaatt cctctacatc tcagagacag tctccatcta tctgggtctc 541 tcacaggtgg agatgacggg cagcagcgtc ttcgactaca ttcaccctgg ggaccactca 601 gaggtgctgg agcaactggg gctgcggacg acgacgcccg gccccccaac cccgtcctcc 661 gtctcctctt cctcctcctc ttcctcttcg cttgcagata cccccgagat cgaggccagc 721 ctcaccaagg tgcccccctc ctccctggtc caggagcgct ccttctttgt ccgcatgaaa 781 tccacgctca ccaagagggg gctgcacgtc aaggcctcag ggtacaaggt catccacgtg 841 actgggcgcc ttcgggccca cgccctgggc cttgtggccc tcgggcacac gttgcccccg 901 gcccccctgg ctgagctgcc actccatgga cacatgatcg tcttccgtct cagcctgggt 961 ctcaccatcc ttgcttgtga gagcagagtc agcgaccaca tggacctggg gccctcagag 1021 ctggtgggcc gcagctgcta ccagtttgtc cacggacaag acgccacgag gatccgccag 1081 agccacgtgg acttgctgga caagggtcag gtgatgactg gttactaccg ttggctgcag 1141 cgtgccgggg gcttcgtgtg gctgcagtct gtggccacag tggctgggag cgggaagagc 1201 cccggggagc accatgtgct ttgggtcagc cacgtgctca gccaagccga gggtggccaa 1261 actcctttgg atgccttcca gcttccagcc agcgtggcct gtgaggaggc atccagcccg 1321 gggccagagc ccacagagcc ggagcctccg acggaaggga agcaggctgt cccagcggag 1381 aacgaggccc cccagaccca gggcaaacgc atcaaagtgg agcccggccc gagggaaacc 1441 aaaggttccg aggacagtgg cgacgaggat ccctccagcc acccggccac accgaggccc 1501 gagttcacct ctgtcatccg ggcaggggtc ctgaagcagg atccggtgcg gccatggggc 1561 ctggcgcctc ccggggaccc cccgcccacc ctcctgcacg cgggcttcct gccgccggtg 1621 gtgcggggcc tgtgcacacc cggcaccatc cgctacggcc ccgcggagct gggcctggtg 1681 tacccgcacc tgcagaggct gggtccgggc cccgcgctcc cggaggcctt ttacccgccc 1741 ctgggcctgc cctacccggg gcccgcgggc accaggctgc cgcggaaggg ggactgagga 1801 ctggcagagc tgccggcgcc ggaccctgcg acaaccgggg tcccccagga cagtaggccc 1861 ggctctgccc gtagccctga gaattaaacg ccggctctcc ctgc // LOCUS HSU77970 2880 bp mRNA PRI 13-FEB-1997 DEFINITION Human neuronal PAS2 (NPAS2) mRNA, complete cds. ACCESSION U77970 NID g1840059 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2880) AUTHORS Zhou,Y.D., Barnard,M., Tian,H., Li,X., Ring,H.Z., Francke,U., Shelton,J., Richardson,J., Russell,D.W. and McKnight,S.L. TITLE Molecular characterization of two mammalian bHLH-PAS domain proteins selectively expressed in the central nervous system JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (2), 713-718 (1997) MEDLINE 97165088 REFERENCE 2 (bases 1 to 2880) AUTHORS Zhou,Y.-D., Barnard,M., Tian,H., Li,X., Ring,H.Z., Francke,U., Shelton,J., Richardson,J., Russell,D.W. and McKnight,S.L. TITLE Direct Submission JOURNAL Submitted (11-NOV-1996) Biochemistry, UT Southwestern, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..2880 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 287..2761 /gene="NPAS2" CDS 287..2761 /gene="NPAS2" /note="a member of the bHLH-PAS family; longest predicted reading frame; two of two transcripts, see GenBank Accession Number U77968" /codon_start=1 /product="neuronal PAS2" /db_xref="PID:g1840060" /translation="MDEDEKDRAKRASRNKSEKKRRDQFNVLIKELSSMLPGNTRKMD KTTVLEKVIGFLQKHNEVSAQTEICDIQQDWKPSFLSNEEFTQLMLEALDGFIIAVTT DGSIIYVSDSITPLLGHLPSDVMDQNLLNFLPEQEHSEVYKILSSHMLVTDSPSPEYL KSDSDLEFYCHLLRGSLNPKEFPTYEYIKFVGNFRSYNNVPSPSCNGFDNTLSRPCRV PLGKEVCFIATVRLATPQFLKEMCIVDEPLEEFTSRHSLEWKFLFLDHRAPPIIGYLP FEVLGTSGYDYYHIDDLELLARCHQHLMQFGTGKSCCYRFLTKGQQWIWLQTHYYITY HQWNSKPEFIVCTHSVVSYADVRVERRQELALEDPPSEALHSSALKDKGSSLEPRQHF NALDVGASGLNTSHSPSASSRSSHKSSHTAMSEPTSTPTKLMAEASTPALPRSATLPQ ELPVPGLSQAATMPAPLPSPLSCDLTQQLLPQTVLQSTPAPMAQFSAQFSMFQTIKDQ LEQRTRILQANIRWQQEELHKIQEQLCLVQDSNVQMFLQQPAVSLSFSSTQRPEAQQQ LQQRSAAVTQPQLGAGPQLPGQISSAQVTSQHLLRESSVISTQGPKPMRSSQLMQSSG RSGSSLVSPFSSATAALPPSLNLTTPASTSQDASQCQPSPDFSHDRQLRLLLSQPIQP MMPGSCDARQPSEVSRTGRQVKYAQSQTVFQNPDAHPANSSSAPMPVLLMGQAVLHPS FPASQPSPLQPAQARQQPPQHYLQVQAPTSLHSEQQDSLLLSTYSQQPGTLGYPQPPP AQPQPLRPPRRVSSLSESSGLQQPPR" BASE COUNT 686 a 937 c 723 g 534 t ORIGIN 1 gtttgccgcg cgagcagccg gcctctcgca ggagccgagg gacccgcgcg gctgcggccc 61 aggagcggcg gccgcggagc ccggagaccc gcagccgcgg cggcggcggc ggcggcggca 121 gcagctagag cagcgcctcc cgccgccgcc cgggaggagc tcgccgcgcc cgctcgccgc 181 ctcgtctccc agcggcggcg ggaggcgcgt ctccccggcc cagtccgcgc ccggccccgc 241 gggaccgctc cggcccgctc cgaggaaaaa ctgcatagaa aatctaatgg atgaagatga 301 gaaagacaga gccaagagag cttctcgaaa caagtctgag aagaagcgtc gggaccagtt 361 caatgttctc atcaaagagc tcagttccat gctccctggc aacacgcgga aaatggacaa 421 aaccaccgtg ttggaaaagg tcatcggatt tttgcagaaa cacaatgaag tctcagcgca 481 aacggaaatc tgtgacattc agcaagactg gaagccttca ttcctcagta atgaagaatt 541 cacccagctg atgttggagg cattagatgg cttcattatc gcagtgacaa cagacggcag 601 catcatctat gtctctgaca gtatcacgcc tctccttggg catttaccgt cggatgtcat 661 ggatcagaat ttgttaaatt tcctcccaga acaagaacat tcagaagttt ataaaatcct 721 ttcttcccat atgcttgtga cggattcccc ctccccagaa tacttaaaat ctgacagcga 781 tttagagttt tattgccatc ttctcagagg cagcttgaac ccaaaggaat ttccaactta 841 tgaatacata aaatttgtag gaaattttcg ctcttacaac aatgtgccta gcccctcctg 901 taatggtttt gacaacaccc tttcaagacc ttgccgggta ccactaggaa aggaggtttg 961 cttcattgcc accgttcgtc tggcaacacc acaattctta aaggaaatgt gcatagttga 1021 cgaaccttta gaggaattca cttcaaggca tagcttggaa tggaaatttt tatttctgga 1081 tcacagagca cctccaatca taggatacct gccttttgaa gtgctgggaa cctcaggcta 1141 tgactactac cacattgatg acctggagct cctggccagg tgtcaccagc acctgatgca 1201 gtttggcaca gggaagtcgt gttgctaccg gtttctgacc aaaggtcagc agtggatctg 1261 gctgcagact cactactaca tcacctacca tcagtggaac tccaagcccg agttcatcgt 1321 gtgcacacac tcggtggtca gttacgcaga tgtccgggtg gaaaggaggc aggagctggc 1381 tctggaagac ccgccatccg aggccctcca ctcctcagca ctaaaggaca agggctcaag 1441 cctggaacct cggcagcact ttaacgcact cgacgtgggt gcctcgggcc ttaataccag 1501 tcattcgcca tcggcgtcct caagaagttc ccacaaatcc tcgcacacag ccatgtcaga 1561 acccacctcc actcccacca agctgatggc agaggccagc accccggctt tgccaagatc 1621 agccaccctg ccccaagagt tacctgtccc cgggctcagc caggcagcca ccatgccggc 1681 ccctctgcct tccccattgt cctgcgacct cacacagcag ctcctgcctc agaccgttct 1741 gcagagcacg cccgctccca tggcacagtt ttcggcacag ttcagcatgt tccagaccat 1801 caaagaccag ctagagcagc ggacgcggat cctgcaggcc aatatccggt ggcaacagga 1861 agagctccac aagatccagg agcagctctg cctggtccag gactccaacg tccagatgtt 1921 cctgcagcag ccagctgtat ccctgagctt cagcagcacc cagcgacctg aggctcagca 1981 gcagctacag caaaggtcag ctgcagtgac tcagccccag ctcggggcgg gcccccaact 2041 tccagggcag atctcctctg cccaggtcac aagccagcac ctgctcagag aatcaagtgt 2101 gatatcaacc caaggtccaa agccaatgag aagctcacag ctaatgcaga gcagcggccg 2161 ctctggaagc agcctagtgt ccccgttcag cagcgccaca gctgcgctcc cgccaagtct 2221 gaatctgacc acacctgctt ccacctccca ggatgccagc cagtgccagc ccagcccaga 2281 cttcagccat gatcggcagc tcaggctgtt gctgagccag cccatccagc ccatgatgcc 2341 cgggtcctgt gacgcaaggc agccctcgga agtcagcagg acgggacggc aagtcaagta 2401 cgcccagagc cagaccgtgt ttcaaaatcc agacgcacac cccgccaaca gcagcagcgc 2461 cccgatgccc gtcctgctga tggggcaggc ggtgctccac cccagcttcc ctgcctccca 2521 accatcgccc ctgcagcctg cacaggcccg gcagcagcca ccgcagcact acctgcaggt 2581 acaggcacca acctctttgc acagtgagca gcaggactcg ctacttctct ccacctactc 2641 acaacagcca gggaccctgg gctaccccca accaccccca gcacagcccc agcccctacg 2701 tcctccccga agggtcagca gtctgtctga gtcgtcaggc ctccagcagc cgccccgata 2761 atgccccggc actgaagtcg ggacacaatc agctttaacc aatggatgag gggggtggcc 2821 acaggagatg gggagaggag tctgaactaa acccctggct tttgtgcaca ctgcatacgt // LOCUS HSU78082 885 bp mRNA PRI 15-NOV-1997 DEFINITION Human RNA polymerase transcriptional regulation mediator (h-MED6) mRNA, complete cds. ACCESSION U78082 NID g2618737 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 885) AUTHORS Kim,Y.-J. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) Basic Research Center, Samsung Biomedical Research Institute, 50 Ilwon-dong Kangnam-ku, Seoul 135-230, Republic of Korea FEATURES Location/Qualifiers source 1..885 /organism="Homo sapiens" /db_xref="taxon:9606" gene 14..754 /gene="h-MEd6" CDS 51..524 /gene="h-MEd6" /note="h-Med6p" /codon_start=1 /product="RNA polymerase transcriptional regulation mediator" /db_xref="PID:g2618738" /translation="MAAVDIRDNLLGISWVDSSWIPILNSGSVLDYFSERSNPFYDRT CNNEVVKMQRLTLEHLNQMVGIEYILLHAQEPILFIIRKQQRQSPAQVIPLADYYIIA GVIYQAPDLGSVINSRVLTAVHGIQSAFDEAMSYCRYHPSKGYWWHFKDHEEQGK" BASE COUNT 237 a 173 c 210 g 265 t ORIGIN 1 ctcgaggcca agaattcggc acgagggaac ctgtaaacgc tctcggaatt atggcggcgg 61 tggatatccg agacaatctg ctgggaattt cttgggttga cagctcttgg atccctattt 121 tgaacagtgg tagtgtcctg gattactttt cagaaagaag taatcctttt tatgacagaa 181 catgtaataa tgaagtggtc aaaatgcaga ggctaacatt agaacacttg aatcagatgg 241 ttggaatcga gtacatcctt ttgcatgctc aagagcccat tcttttcatc attcggaagc 301 aacagcggca gtcccctgcc caagttatcc cactagctga ttactatatc attgctggag 361 tgatctatca ggcaccagac ttgggatcag ttataaactc tagagtgctt actgcagtgc 421 atggtattca gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag 481 ggtattggtg gcacttcaaa gatcatgaag agcaaggtaa gtagaacatc cataccctcc 541 taaaacactt tttgatcctc tgagaatgaa gctgttttct ttaggaaaat ggctgttgat 601 cttttctaag tgtgtttcac tttttcatgg gatgatggct ttgttgcagc tgagattcat 661 gtaactagag tggtaataat agtttcacat aggaacagat gcaagttcac tctgttagtt 721 aactggtagt ctttgttaag gtgattcaag gttttaaaat atttggggcc aggtgtggtg 781 gctcactcct gtaatcccgg cactttggaa tgccaaggca ggtggatcac ctgagcctag 841 gagttcaaga tcagcctggg caacatagtg aaacctggtc tctgc // LOCUS HSU78107 1195 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens gamma SNAP mRNA, complete cds. ACCESSION U78107 NID g1685287 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1195) AUTHORS Lemons,P.P., Chen,D., Bernstein,A.M., Bennett,M.K. and Whiteheart,S.W. TITLE Regulated secretion in platelets: identification of elements of the platelet exocytosis machinery JOURNAL Blood 90 (4), 1490-1500 (1997) MEDLINE 97413351 REFERENCE 2 (bases 1 to 1195) AUTHORS Chen,D. and Whiteheart,S.W. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) Biochemistry, University of Kentucky, 800 Rose Street, Lexington, KY 40536, USA FEATURES Location/Qualifiers source 1..1195 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="melanocyte and platelet" /clone="IMAGE Consortium clone 269139" CDS 74..1012 /note="soluble NSF attachment protein" /codon_start=1 /product="gamma SNAP" /db_xref="PID:g1685288" /translation="MAAQKINEGLEHLAKAEKYLKTGFLKWKPDYDSAASEYGKAAVA FKNAKQFEQAKDACLREAVAHENNRALFHAAKAYEQAGMMLKEMQKLPEAVQLIEKAS MMYLENGTPDTAAMALERAGKLIENVDPEKAVQLYQQTANVFENEERLRQAVELLGKA SRLLVRGRRFDEAALSIQKEKNIYKEIENYPTCYKKTIAQVLVHLHRNDYVAAERCVR ESYSIPGFNGSEDCAALEQLLEGYDQQDQDQVSDVCNSPLFKYMDNDYAKLGLSLVVP GGGIKKKSPATPQAKPDGVTATAADEEEDEYSGGLC" BASE COUNT 402 a 227 c 293 g 273 t ORIGIN 1 cacgaggcgg attcttggcg ccggagaaga ggcagggtca ccctctctcc acgtcagaga 61 cctgactgtg gagatggcgg ctcagaagat aaacgagggg ctggaacacc tcgccaaagc 121 agagaaatac ctgaaaactg gttttttaaa atggaagcca gattatgaca gtgccgcttc 181 tgaatatgga aaagcagctg ttgcttttaa aaatgccaaa cagtttgagc aagcaaaaga 241 tgcctgcctg agggaagctg ttgcccatga aaataatagg gctctttttc atgctgccaa 301 agcttatgag caagctggaa tgatgttgaa ggagatgcag aaactaccag aggccgttca 361 gctaattgag aaggccagca tgatgtatct agaaaacggc accccagaca cagcagccat 421 ggctttggag cgagctggaa agcttataga aaatgttgat ccagagaagg ctgtacagtt 481 atatcaacag acagctaatg tgtttgaaaa tgaagaacgc ttacgacagg cagttgaatt 541 actaggaaaa gcctccagac tactagtacg aggacgtagg tttgatgagg cggcactctc 601 tattcagaaa gaaaaaaata tttataagga aattgagaat tatccaactt gttataagaa 661 aacaattgct caagtcttag ttcatctaca cagaaatgac tatgtagctg cagaaagatg 721 tgtccgggag agctatagca tccctgggtt caatggcagt gaagactgtg ctgccctgga 781 acagcttctt gaaggttatg accagcaaga ccaagatcag gtgtcagatg tctgcaactc 841 accgcttttc aagtacatgg acaatgatta tgctaagctg ggcctgagtt tggtggttcc 901 aggaggggga atcaagaaga aatcacctgc aacaccacag gccaagcctg atggtgtcac 961 tgccacggct gctgatgaag aggaagatga atactcagga ggactatgct agtattttgc 1021 ttgctgaaaa gaaaagggaa acaaaggtaa aatcctgaca tgccatttca aggacttggg 1081 aatagattag ggatatccgt acttcattac agtcatgatt ttggatccta ataaagacta 1141 gtttttagtt accatcttcc caaatcaaaa aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HSU78110 594 bp mRNA PRI 14-DEC-1996 DEFINITION Human prepro-neurturin mRNA, complete cds. ACCESSION U78110 NID g1731676 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 594) AUTHORS Kotzbauer,P.T., Lampe,P.A., Heuckeroth,R.O., Golden,J.P., Creedon,D.J., Johnson,E.M. Jr. and Milbrandt,J. TITLE Neurturin, a relative of glial-cell-line-derived neurotrophic factor JOURNAL Nature 384 (6608), 467-470 (1996) MEDLINE 97100947 REFERENCE 2 (bases 1 to 594) AUTHORS Kotzbauer,P.T., Lampe,P.A., Johnson Jr.,E.M. and Milbrandt,J. TITLE Direct Submission JOURNAL Submitted (13-NOV-1996) Pathology, Washington University School of Medicine, 660 S. Euclid Ave., Box 8118, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..594 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 1..57 CDS 1..594 /note="neurotrophic factor" /codon_start=1 /product="prepro-neurturin" /db_xref="PID:g1731677" /translation="MQRWKAAALASVLCSSVLSIWMCREGLLLSHRLGPALVPLHRLP RTLDARIARLAQYRALLQGAPDAMELRELTPWAGRPPGPRRRAGPRRRRARARLGARP CGLRELEVRVSELGLGYASDETVLFRYCAGACEAAARVYDLGLRRLRQRRRLRRERVR AQPCCRPTAYEDEVSFLDAHSRYHTVHELSARECACV" misc_feature 58..594 /product="pro-neurturin" mat_peptide 286..591 /note="supports the survival of sympathetic neurons in culture; mature peptide sequence is predicted from the N-terminal amino acid sequence of purified chinese hamster neurturin" /product="neurturin" BASE COUNT 61 a 214 c 231 g 88 t ORIGIN 1 atgcagcgct ggaaggcggc ggccttggcc tcagtgctct gcagctccgt gctgtccatc 61 tggatgtgtc gagagggcct gcttctcagc caccgcctcg gacctgcgct ggtccccctg 121 caccgcctgc ctcgaaccct ggacgcccgg attgcccgcc tggcccagta ccgtgcactc 181 ctgcaggggg ccccggatgc gatggagctg cgcgagctga cgccctgggc tgggcggccc 241 ccaggtccgc gccgtcgggc ggggccccgg cggcggcgcg cgcgtgcgcg gttgggggcg 301 cggccttgcg ggctgcgcga gctggaggtg cgcgtgagcg agctgggcct gggctacgcg 361 tccgacgaga cggtgctgtt ccgctactgc gcaggcgcct gcgaggctgc cgcgcgcgtc 421 tacgacctcg ggctgcgacg actgcgccag cggcggcgcc tgcggcggga gcgggtgcgc 481 gcgcagccct gctgccgccc gacggcctac gaggacgagg tgtccttcct ggacgcgcac 541 agccgctacc acacggtgca cgagctgtcg gcgcgcgagt gcgcctgcgt gtga // LOCUS HSU78180 3923 bp mRNA PRI 08-MAR-1997 DEFINITION Human sodium channel 2 (hBNaC2) mRNA, alternatively spliced, complete cds. ACCESSION U78180 NID g1871167 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3923) AUTHORS Garcia-Anoveros,J., Derfler,B., Neville-Golden,J., Hyman,B.T. and Corey,D.P. TITLE BNaC1 and BNaC2 constitute a new family of human neuronal sodium channels related to degenerins and epithelial sodium channels JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (4), 1459-1464 (1997) MEDLINE 97188490 REFERENCE 2 (bases 1 to 3923) AUTHORS Garcia-Anoveros,J., Neville-Golden,J. and Corey,D.P. TITLE Direct Submission JOURNAL Submitted (11-NOV-1996) Neurobiology, Massachusetts General Hospital and Harvard Medical School, 50 Blossom St., Boston, MA 02114, USA FEATURES Location/Qualifiers source 1..3923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q12" /tissue_type="brain" gene 230..1954 /gene="hBNaC2" CDS 230..1954 /gene="hBNaC2" /note="alternatively spliced form of sodium channel 2 mRNA, GenBank Accession Number U78181; sodium channel homolog; member of the DEG/ENaC superfamily" /codon_start=1 /product="sodium channel 2" /db_xref="PID:g1871168" /translation="MELKAEEEEVGGVQPVSIQAFASSSTLHGLAHIFSYERLSLKRA LWALCFLGSLAVLLCVCTERVQYYFHYHHVTKLDEVAASQLTFPAVTLCNLNEFRFSQ VSKNDLYHAGELLALLNNRYEIPDTQMADEKQLEILQDKANFRSFKPKPFNMREFYDR AGHDIRDMLLSCHFRGEVCSAEDFKVVFTRYGKCYTFNSGRDGRPRLKTMKDGTGNGL EIMLDIQQDEYLPVWGETDETSFEAGIKVQIHSQDEPPFIDQLGFGVAPGFQTFVACQ EQRLIYLPPPWGTCKAVTMDSDLDFFDSYSITACRIDCETRYLVENCNCRMVHMPGDA PYCTPEQYKECADPALDFLVEKDQEYCVCEMPCNLTRYGKELSMVKIPSKASAKYLAK KFNKSEQYIGENILVLDIFFEVLNYETIEQKKAYEIAGLLGELLMTPVPFSCHGHGVA PYHPKAGCSLLSHEGPPPQRPFPKPCCLGDIGGQMGLFIGASILTVLELFDYAYEVIK HKLCRRGKCQKEAKRSSADKGVALSLDDVKRHNPCESLRGHPAGMTYAANILPHHPAR GTFEDFTC" exon 1527..1669 /gene="hBNaC2" /note="alternatively spliced exon" BASE COUNT 839 a 1203 c 1049 g 827 t 5 others ORIGIN 1 gagccagcga gccagcgcgc gcgggcgggc ggacagatcg gagccgagcg gggccgggcg 61 gggcgctccc tgcagggctc tgcgcggcgt gccgcggcgg ccgcgggctc cggccccggg 121 ccatgagccc ctccgcgact cggcgctgag cccgccaccg gtccagcgcc ccaggacccg 181 ccgccggctg ccggcttgcc gaagccccct caggatcccc tcaacaagga tggaactgaa 241 ggccgaggag gaggaggtgg gtggcgtcca gccggtgagc atccaggcct tcgccagcag 301 ctccacactg cacggcctgg cccacatctt ctcctacgag cggctgtctc tgaagcgggc 361 actgtgggcc ctgtgcttcc tgggctcgct ggctgtgctg ctgtgtgtgt gcacggagcg 421 tgtgcagtac tacttccact accaccatgt caccaagctc gacgaggtgg ctgcctctca 481 gcttaccttc cctgctgtca cgctgtgcaa cctcaacgag ttccgcttta gccaagtctc 541 caagaatgac ctgtatcatg ctggggagct gctggccctg ctcaacaaca ggtatgagat 601 accagacaca cagatggcag atgaaaagca gctggagata ctgcaggaca aagccaactt 661 ccgcagcttc aaacccaaac ccttcaacat gcgtgagttc tacgaccgag ctgggcacga 721 cattcgagac atgctgctct cctgccactt ccggggggag gtctgcagcg ctgaagactt 781 caaggtggtc ttcacacgct atggaaagtg ctacacgttc aactcgggcc gagatgggcg 841 gccgcggctg aagaccatga aggatgggac gggcaatggg ctggaaatca tgctggacat 901 ccagcaggac gagtacctgc ctgtgtgggg ggagactgac gagacgtcct tcgaagcagg 961 catcaaagtg cagatccata gtcaggatga acctcctttc atcgaccagc tgggctttgg 1021 cgtggcccca ggcttccaga cctttgtggc ctgccaggag cagcggctca tctacctgcc 1081 cccaccctgg ggcacctgca aagctgttac catggactcg gatttggatt tcttcgactc 1141 ctacagcatc actgcctgcc gcatcgactg tgagacgcgc tacctggtgg agaactgcaa 1201 ctgccgcatg gtgcacatgc caggggatgc cccatactgt actccagagc agtacaagga 1261 gtgtgcagat cctgctctgg acttcctggt ggagaaggac caggagtact gcgtgtgtga 1321 aatgccttgc aacctgaccc gctatggcaa agagctgtcc atggtcaaga tccccagcaa 1381 agcctcagcc aagtacctgg ccaagaagtt caacaaatct gagcaataca taggggagaa 1441 catcctggtg ctggacattt tctttgaagt cctcaactat gagaccattg aacagaagaa 1501 ggcctatgag attgcagggc tcctgggtga gctgctgatg acacctgtcc ccttctcatg 1561 ccatgggcat ggcgtggctc cctatcatcc aaaagcaggg tgctcacttc tgtcccatga 1621 gggtcctcca ccccagaggc ccttccccaa accctgttgt cttggtgaca tcgggggcca 1681 gatggggctg ttcatcgggg ccagcatcct cacggtgctg gagctctttg actacgccta 1741 cgaggtcatt aagcacaagc tgtgccgacg aggaaaatgc cagaaggagg ccaaaaggag 1801 cagtgcggac aagggcgtgg ccctcagcct ggacgacgtc aaaagacaca acccgtgcga 1861 gagccttcgg ggccaccctg ccgggatgac atacgctgcc aacatcctac ctcaccatcc 1921 ggcccgaggc acgttcgagg actttacctg ctgagccccg caggccgctg aaccaaaggc 1981 ctagatgggg aggactagga gagcgrgggg gcccccagct gcctcctcac atctgccctg 2041 ggractcccc acactccggg gcagatcttt cctcttgtct gtggtaagga aggagtcttg 2101 accatagagt cctctctctg cctctatccc attcytttta catttaacaa aactaatcta 2161 aaaaagaact aaaaagggag aacggggcaa gggacctcag gctgcccctc tctcctccat 2221 gctgcctccc ctagctccca gcctgaattc tgtctatcta gctgtctgcc atctgagtgt 2281 ccatctacat tctgctgcca ccagtcacca aaggcccttc ccagtgaggg gtggaaggga 2341 tctctggggt ctggaatttg gccccaaacc agagaatgta ccttaagggg gagggctagt 2401 gtgggggagg gaggcttccc cagccttaag agaccctctc agcccagtga ctgtccccaa 2461 acccaagtct cctggcagga actaaaacct cagccccact ctctcacacc atgtggaatc 2521 tcgtgggggt cggggatccc cttaagaagt ggtaatgggg acaagatgcg gccctggtgc 2581 tgtaggctac atcctgatac ctataagttc acccccaccc cacagctgct ggagagaaat 2641 cccaagaggc agcccttcct caccatccca ttaaagacck ggctggttag cgtccagctc 2701 agggagaagg gcgctagtgc ctaacctcac tggtccctct cccggaggcc cttgtagagg 2761 gccacgtcca taaattttct tatggaactc tcccacatcc tcttccccaa cttcatttgc 2821 ttctctcaac aacctcatct gcattttcta tttctatatg atacagactc tatattgcta 2881 tatctctgta tatactttcc cagccctgtc tgtctccacc ccatcccctc ttgtctctga 2941 gaaccattct cccaccccaa gttccacctt ctatgtttct actccctccc tggtctctga 3001 atgccttygc ctgtataaag agttggactc tctcccctgg tgtctgtact gtgtacacac 3061 atccctctga gaagcacaag gagacgacac gcgcattgta acctttgcac tgtctcagtg 3121 gcgacaaagg aagctgtgaa tcacaagctc tgcctctttc tggcctcacc ctctccccca 3181 acccgggcac cctcggccct ccctgcagcc ttaacattct cttcccctgc tcctcctatc 3241 ccattgccct ctgcccagct gacagtggca tccccaggga aggggttgct gtagagatag 3301 cccccaccca ggggatggag gtctaccctg gacactaagc caagtgtgtc agagacagaa 3361 gggagctggg gattggcgac tcctgaagtt ggggcagtgg gatgctgaca ggcagaagct 3421 gaggtcctca gtcagtggcc tttcctcctt ctgggtgccc agcccccttt cctcacctga 3481 tacccaagcc caccactttt attttctggt gaggtgggtt tgggaggaaa gagaggccta 3541 gaggaggagt tgaaagctct gctgttgtct caccctatct taatgagaga caagtgaggt 3601 ggagggcctg ccccccctcc ctccaccaga cactccttcc aggcctgagc cccaacccct 3661 cttcaggcct tccttcccta gctgtgtctt ggtcttcaat cccagaacag gacctgtgag 3721 cagctgcatt ggcctggagc tggagagtaa ggctgtagga tctttggaat ctcttggttc 3781 ctaagagttt cctcagagat catacctccc cagagggaag caggaatgag gccaaaaagt 3841 gtgcattgga taggggaaca gcaggcaggg ctctgggtga cgcatgcctc tggtctaata 3901 aactgggttt caaccaaaaa aaa // LOCUS HSU78305 2973 bp mRNA PRI 26-JUN-1997 DEFINITION Homo sapiens protein phosphatase Wip1 mRNA, complete cds. ACCESSION U78305 NID g2218062 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2973) AUTHORS Fiscella,M., Zhang,H., Fan,S., Sakaguchi,K., Shen,S., Mercer,W.E., Vande Woude,G.F., O'Connor,P.M. and Appella,E. TITLE Wip1, a novel human protein phosphatase that is induced in response to ionizing radiation in a p53-dependent manner JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (12), 6048-6053 (1997) MEDLINE 97322321 REFERENCE 2 (bases 1 to 2973) AUTHORS Fiscella,M., Zhang,H., Fan,S., Sakaguchi,K., Van de Woude,G.F., O'Connor,P.M. and Appella,E. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) LCB, NCI/NIH, 9000 Rockville Pike, Bldg 37, Rm 1B03, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2973 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 204..2021 /function="protein phosphatase" /codon_start=1 /product="Wip1" /db_xref="PID:g2218063" /translation="MAGLYSLGVSVFSDQGGRKYMEDVTQIVVEPEPTAEEKPSPRRS LSQPLPPRPSPAALPGGEVSGKGPAVAAREARDPLPDAGASPAPSRCCRRRSSVAFFA VCDGHGGREAAQFAREHLWGFIKKQKGFTSSEPAKVCAAIRKGFLACHLAMWKKLAEW PKTMTGLPSTSGTTASVVIIRGMKMYVAHVGDSGVVLGIQDDPKDDFVRAVEVTQDHK PELPKERERIEGLGGSVMNKSGVNRVVWKRPRLTHNGPVRRSTVIDQIPFLAVARALG DLWSYDFFSGEFVVSPEPDTSVHTLDPQKHKYIILGSDGLWNMIPPQDAISMCQDQEE KKYLMGEHGQSCAKMLVNRALGRWRQRMLRADNTSAIVICISPEVDNQGNFTNEDELY LNLTDSPSYNSQETCVMTPSPCSTPPVKSLEEDPWPRVNSKDHIPALVRSNAFSENFL EVSAEIARENVQGVVIPSKDPEPLEENCAKALTLRIHDSLNNSLPIGLVPTNSTNTVM DQKNLKMSTPGQMKAQEIERTPPTNFKRTLEESNSGPLMKKHRRNGLSRSSGAQPASL PTTSQRKNSVKLTMRRRLRGQKKIGNPLLHQHRKTVCVC" BASE COUNT 817 a 667 c 696 g 793 t ORIGIN 1 ctggctctgc tcgctccggc gctccggccc agctctcgcg gacaagtcca gacatcgcgc 61 gccccccctt ctccgggtcc gccccctccc ccttctcggc gtcgtcgaag ataaacaata 121 gttggccggc gagcgcctag tgtgtctccc gccgccggat tcggcgggct gcgtgggacc 181 ggcgggatcc cggccagccg gccatggcgg ggctgtactc gctgggagtg agcgtcttct 241 ccgaccaggg cgggaggaag tacatggagg acgttactca aatcgttgtg gagcccgaac 301 cgacggctga agaaaagccc tcgccgcggc ggtcgctgtc tcagccgttg cctccgcggc 361 cgtcgccggc cgcccttccc ggcggcgaag tctcggggaa aggcccagcg gtggcagccc 421 gagaggctcg cgaccctctc ccggacgccg gggcctcgcc ggcacctagc cgctgctgcc 481 gccgccgttc ctccgtggcc tttttcgccg tgtgcgacgg gcacggcggg cgggaggcgg 541 cacagtttgc ccgggagcac ttgtggggtt tcatcaagaa gcagaagggt ttcacctcgt 601 ccgagccggc taaggtttgc gctgccatcc gcaaaggctt tctcgcttgt caccttgcca 661 tgtggaagaa actggcggaa tggccaaaga ctatgacggg tcttcctagc acatcaggga 721 caactgccag tgtggtcatc attcggggca tgaagatgta tgtagctcac gtaggtgact 781 caggggtggt tcttggaatt caggatgacc cgaaggatga ctttgtcaga gctgtggagg 841 tgacacagga ccataagcca gaacttccca aggaaagaga acgaatcgaa ggacttggtg 901 ggagtgtaat gaacaagtct ggggtgaatc gtgtagtttg gaaacgacct cgactcactc 961 acaatggacc tgttagaagg agcacagtta ttgaccagat tccttttctg gcagtagcaa 1021 gagcacttgg tgatttgtgg agctatgatt tcttcagtgg tgaatttgtg gtgtcacctg 1081 aaccagacac aagtgtccac actcttgacc ctcagaagca caagtatatt atattgggga 1141 gtgatggact ttggaatatg attccaccac aagatgccat ctcaatgtgc caggaccaag 1201 aggagaaaaa atacctgatg ggtgagcatg gacaatcttg tgccaaaatg cttgtgaatc 1261 gagcattggg ccgctggagg cagcgtatgc tccgagcaga taacactagt gccatagtaa 1321 tctgcatctc tccagaagtg gacaatcagg gaaactttac caatgaagat gagttatacc 1381 tgaacctgac tgacagccct tcctataata gtcaagaaac ctgtgtgatg actccttccc 1441 catgttctac accaccagtc aagtcactgg aggaggatcc atggccaagg gtgaattcta 1501 aggaccatat acctgccctg gttcgtagca atgccttctc agagaatttt ttagaggttt 1561 cagctgagat agctcgagag aatgtccaag gtgtagtcat accctcaaaa gatccagaac 1621 cacttgaaga aaattgcgct aaagccctga ctttaaggat acatgattct ttgaataata 1681 gccttccaat tggccttgtg cctactaatt caacaaacac tgtcatggac caaaaaaatt 1741 tgaagatgtc aactcctggc caaatgaaag cccaagaaat tgaaagaacc cctccaacaa 1801 actttaaaag gacattagaa gagtccaatt ctggccccct gatgaagaag catagacgaa 1861 atggcttaag tcgaagtagt ggtgctcagc ctgcaagtct ccccacaacc tcacagcgaa 1921 agaactctgt taaactcacc atgcgacgca gacttagggg ccagaagaaa attggaaatc 1981 ctttacttca tcaacacagg aaaactgttt gtgtttgctg aaatgcatct gggaaatgag 2041 gtttttccaa acttaggata taagagggct ttttaaattt ggtgccgatg ttgaactttt 2101 tttaagggga gaaaattaaa agaaatatac agtttgactt tttggaattc agcagtttta 2161 tcctggcctt gtacttgctt gtattgtaaa tgtggatttt gtagatgtta gggtataagt 2221 tgctgtaaaa tttgtgtaaa tttgtatcca cacaaattca gtctctgaat acacagtatt 2281 cagagtctct gatacacagt aattgtgaca atagggctaa atgtttaaag aaatcaaaag 2341 aatctattag attttagaaa aacatttaaa ctttttaaaa tacttattaa aaaatttgta 2401 taagccactt gtcttgaaaa ctgtgcaact ttttaaagta aattattaag cagactggaa 2461 aagtgatgta ttttcatagt gacctgtgtt tcacttaatg tttcttagag ccaagtgtct 2521 tttaaacatt attttttatt tctgatttca taattcagaa ctaaattttt catagaagtg 2581 ttgagccatg ctacagttag tcttgtccca attaaaatac tatgcagtat ctcttacatc 2641 agtagcattt ttctaaaacc ttagtcatca gatatgctta ctaaatcttc agcatagaag 2701 gaagtgtgtt tgcctaaaac aatctaaaac aattcccttc tttttcatcc cagaccaatg 2761 gcattattag gtcttaaagt agttactccc ttctcgtgtt tgcttaaaat atgtgaagtt 2821 ttccttgcta tttcaataac agatggtgct gctaattccc aacatttctt aaattatttt 2881 atatcataca gttttcattg attatatggg tatatattca tctaataaat cagtgaactg 2941 ttcctcatgt tgctgaaaaa aaaaaaaaaa aaa // LOCUS HSU78310 2257 bp mRNA PRI 13-JUN-1997 DEFINITION Homo sapiens pescadillo mRNA, complete cds. ACCESSION U78310 NID g2194202 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2257) AUTHORS Allende,M.L., Amsterdam,A., Becker,T., Kawakami,K., Gaiano,N. and Hopkins,N. TITLE Insertional mutagenesis in zebrafish identifies two novel genes, pescadillo and dead eye, essential for embryonic development JOURNAL Genes Dev. 10 (24), 3141-3155 (1996) MEDLINE 97138157 REFERENCE 2 (bases 1 to 2257) AUTHORS Kawakami,K., Budarf,M.L., Ciccarelli,L., Emanuel,B.S. and Hopkins,N. TITLE Assignment of the human homolog of the zebrafish essential gene, pescadillo, to chromosome 22q12.1 JOURNAL Unpublished REFERENCE 3 (bases 1 to 2257) AUTHORS Kawakami,K. and Grosshans,D. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) Center for Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA 02139, USA FEATURES Location/Qualifiers source 1..2257 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="26578" /chromosome="22" /map="22q12.1" gene 59..1825 /gene="pescadillo" CDS 59..1825 /gene="pescadillo" /codon_start=1 /product="pescadillo" /db_xref="PID:g2194203" /translation="MGGLEKKKYERGSATNYITRNKARKKLQLSLADFRRLCILKGIY PHEPKHKKKVNKGSTAARTFYLIKDIRFLLHEPIVNKFREYKVFVRKLRKAYGKSEWN TVERLKDNKPNYKLDHIIKERYPTFIDALRDLDDALSMCFLFSTFPRTGKCHVQTIQL CRRLTVEFMHYIIAARALRKVFLSIKGIYYQAEVLGQPIVWITPYAFSHDHPTDVDYR VMATFTEFYTTLLGFVNFRLYQLLNLHYPPKLEGQAQAEAKAGEGTYALDSESCMEKL AALSASLARVVVPATEEEAEVDEFPTDGEMSAQEEDRRKELEAQEKHKKLFEGLKFFL NREVPREALAFIIRSFGGEVSWDKSLCIGATYDVTDSRITHQIVDRPGQQTSVIGRCY VQPQWVFDSVNARLLLPVAEYFSGVQLPPHLSPFVTEKEGDYVPPEKLKLLALQRGED PGNLNESEEEEEEDDNNEGDGDEEGENEEEEEDAEAGSEKEEEARLAALEEQRMEGKK PRVMAGTLKLEDKQRLAQEEESEAKRLAIMMMKKREKYLYQKIMFGKRRKIREANKLA EKRKAHDEAVRSEKKAKKARPE" BASE COUNT 547 a 610 c 682 g 418 t ORIGIN 1 cacgaggaag tggagctgcc tgtacgcgcg gccctagtcg gctcctcaac gtggagcgat 61 gggaggcctt gagaagaaga agtatgaacg aggctcggcc accaactaca tcacccggaa 121 caaagcccgg aagaagctcc agctgagctt ggctgacttt aggcggctgt gcattctgaa 181 gggcatttat ccccatgaac ccaaacacaa gaagaaggtt aacaagggtt ctacagcagc 241 ccgaacgttt taccttatca aagacatcag gtttctcctc cacgaaccca ttgtcaacaa 301 gttccgtgaa tacaaggtgt tcgtccggaa gctccggaag gcttatggga agagcgagtg 361 gaacactgta gagcgtttaa aggacaataa gcccaactac aaactcgacc acatcatcaa 421 ggaacggtat cccacgttca tcgatgccct gcgggacctg gacgatgccc tctccatgtg 481 cttcctgttt tccaccttcc cgcggactgg caagtgccac gtgcagacca ttcagctgtg 541 ccgccggctc actgtggagt tcatgcacta cattatcgct gcccgtgccc tgcgcaaggt 601 cttcctgtcc atcaaaggca tttactacca ggccgaggta ctggggcagc ccatcgtgtg 661 gatcactccc tatgccttct cccatgacca cccgacagac gtggactaca gggtcatggc 721 caccttcacc gagttctaca ccacgctgct gggctttgtc aacttccgcc tttaccagtt 781 gctcaacctc cactatcccc cgaagctcga gggtcaggcc caagcagagg caaaggccgg 841 tgagggcacc tacgcgttgg actccgagag ttgtatggag aaactggcag ccctcagtgc 901 cagcctggcc cgcgtggtgg tgcctgccac agaggaggag gccgaggtgg atgagtttcc 961 caccgatggg gagatgtcag cgcaggagga agaccgcagg aaggagctgg aggcgcagga 1021 gaagcacaag aagctttttg agggcctgaa gttcttcctg aaccgagagg tgccccgtga 1081 ggccctggcc ttcatcatca ggagttttgg tggggaagtg tcctgggaca aatctttgtg 1141 cattggggcc acctatgacg tcacagactc ccgcatcacc catcagattg tcgaccggcc 1201 tgggcagcag acctcagtca ttggcaggtg ctacgtgcag ccccagtggg tgtttgactc 1261 agtgaacgcc aggctccttc tccccgtggc agagtacttc tctggggtgc agctgccccc 1321 acacctttca ccctttgtga ccgagaagga aggagattac gttccacctg agaagctgaa 1381 gctgctggct ctgcagcggg gagaggaccc aggaaacctg aatgagtcag aagaggagga 1441 ggaagaggac gacaacaacg aaggtgatgg tgatgaagag ggagaaaatg aggaggagga 1501 ggaagatgca gaggctggtt cagaaaagga ggaagaggcc cggctggcag ccctggaaga 1561 gcagaggatg gaggggaaga agcccagggt gatggcaggc accttgaagc tggaggataa 1621 gcagcggctg gcccaggagg aggagagtga ggccaagcgc ctggccatta tgatgatgaa 1681 gaagcgggag aagtacctgt accagaagat catgtttggc aagaggcgaa aaatccgaga 1741 ggccaacaag ctggcggaga agcggaaagc ccacgatgag gcggtgaggt ctgagaagaa 1801 ggccaagaag gcaaggccgg agtgagtgcc tgcggcccct cacagggctg aggccagccc 1861 ctagcagctg gatgtggcag aggcaggcca gaggacctaa gtgtgatgga ccagagtcac 1921 ttctcctcct cctttctcca gccagccctg acccctcatg ctctctggct gggccagtgg 1981 gcagccctcg cttcccttgg atggagctgc cctgctggtg cctggtcaga gaagaggcct 2041 ctgtgcccag cctgattctc tgctcccagg agccagtgac atgaggtgca gaggcccacc 2101 cagcccccta cctactgccc ccattcatcc tggctttcca cagccccctc ccacacagtt 2161 ggacccgtga ttctcagggt gctgtgatgg ggtgagggta gggggagcat ttgttattaa 2221 atgactggac ttttgaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU78313 1558 bp mRNA PRI 02-JAN-1997 DEFINITION Human myogenic repressor I-mf (MDFI) mRNA, complete cds. ACCESSION U78313 NID g1763614 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1558) AUTHORS Chen,C.M., Kraut,N., Groudine,M. and Weintraub,H. TITLE I-mf, a novel myogenic repressor, interacts with members of the MyoD family JOURNAL Cell 86 (5), 731-741 (1996) MEDLINE 96390847 REFERENCE 2 (bases 1 to 1558) AUTHORS Kraut,N. TITLE The gene encoding the myogenic repressor I-mf (Mdfi) maps to human chromosome 6p21 and mouse chromosome 17 JOURNAL Unpublished REFERENCE 3 (bases 1 to 1558) AUTHORS Kraut,N. TITLE Direct Submission JOURNAL Submitted (14-NOV-1996) Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, A3-025, Seattle, WA 98104-2092, USA FEATURES Location/Qualifiers source 1..1558 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /chromosome="6" /map="6p21" gene 150..890 /gene="MDFI" CDS 150..890 /gene="MDFI" /function="inhibitor of MyoD family" /codon_start=1 /product="myogenic repressor I-mf" /db_xref="PID:g1763615" /translation="MYQVSGQRPSGCDAPYGAPSAAPGPAQTLSLLPGLEVVTGSTHP AEAAPEEGSLEEAATPMPQGNGPGIPQGLDSTDLDVPTEAVTCQPQGNPLGCTPLLPN DSGHPSELGGTRRAGNGALGGPKAHRKLQTHPSLASQGSKKSKSSSKSTTSQIPLQAQ EDCCVHCILSCLFCEFLTLCNIVLDCATCGSCSSEDSCLCCCCCGSGECADCDLPCDL DCGILDACCESADCLEICMECCGLCFSS" BASE COUNT 288 a 505 c 503 g 261 t 1 others ORIGIN 1 cagcgagtga gaggggaagg ggcgccaggc gagcacccgg gagccagcgg gacctgggca 61 ggggcgcccg gagcaggcgc gcatggcggg ccccgcgcgg ggatccggct ggaagagagc 121 gtagcacggc tcgcacgagt ccggggccga tgtaccaggt gagcggccag cgcccctctg 181 gctgcgacgc gccctatgga gcccccagcg cagccccggg cccagcccag accctatccc 241 tccttcctgg gctggaggta gtaacaggat ccactcaccc tgcggaggca gcaccagagg 301 agggctccct ggaggaggcg gcaaccccca tgccccaagg caatggccct ggcatccccc 361 agggcctgga cagcactgac ctcgacgtcc ccacagaagc tgttacatgc cagcctcagg 421 ggaacccctt gggctgcacc ccacttctgc cgaatgactc tggccacccc tcagagctgg 481 gcggcaccag acgggcgggg aatggtgccc tgggtggccc caaggcccac cggaagttgc 541 agacacaccc atctctcgcc agccagggca gcaagaagag taagagcagc agcaaatcca 601 ccacctccca gatccccctc caggcacagg aagactgctg tgtccactgc atcctgtcct 661 gcctgttctg cgagttcctg acgctgtgca acatcgtcct ggactgcgcc acctgtggct 721 cctgcagctc ggaggactcg tgcctctgct gctgctgctg tggctctggc gagtgtgccg 781 actgcgacct gccctgcgac ctggactgcg gcatcctgga tgcctgctgc gagtccgcgg 841 actgcctgga gatctgcatg gagtgctgtg ggctctgctt ctcctcctga gcctctgtcg 901 ggggctaagc cagcctggcg cccctgcaga ttccancagg gtccctctga gtggggccag 961 gcccaggact gtcacacaag gcttgagaag ccccctctcc ctggtcctct cctacccacc 1021 catgtcctct cagaacccca gccttgaaaa tagtgggggg cactcagagg ggccacctcc 1081 tcagccgtgg gtggtgggcc catggcagag aagcctgaac tctttactgg gttaccaggt 1141 tcatacattg ctgaggacct gacaggacaa cctaggggca gggctggggt gggggccgca 1201 gagggcagcc agggctgggg aacactgtga aagttacttg gggagggtgg gccggtgggg 1261 ccgtagctct ctacctctcc ctgctcctgg tgcctgcctc tctcctccac cccaggctta 1321 gaggacagaa aaatgtgaag agacgcccca cccaccctca gccagccctc tccagtctcc 1381 tttcctaggc ttttttgggg gcctaaccca cgcagtcacc ccagagggca gggctaggcg 1441 agagcctggg gtggggcggg agggggaaca gtatggaaaa gactggaagg ggaaaggaag 1501 ggaagggagg gaggtctgtt ctatctgttg ctgtaaataa agatatttgt ccatctct // LOCUS HSU78556 3327 bp mRNA PRI 28-NOV-1996 DEFINITION Human cisplatin resistance associated alpha protein (hCRA alpha) mRNA, complete cds. ACCESSION U78556 NID g1688306 KEYWORDS Cisplatin; hCRA. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3327) AUTHORS Tanimura,H., Ledakis,P. and Fojo,T. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) Medicine Branch, National Cancer Institute, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3327 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="cisplatin resistant cell A2780 E(80) derived from A2780 (human ovarian carcinoma cell line)" gene 247..993 /gene="hCRA alpha" CDS 247..993 /gene="hCRA alpha" /codon_start=1 /product="cisplatin resistance associated alpha protein" /db_xref="PID:g1688307" /translation="MGSVQENRMPEPRSRQPSSCLASRCLPGEQILAWAPGVRKGLEP ELSGTLICTNFRVTFQPCGWQWNQDTPLNSEYDFALVNIGRLEAVSGLSRVQLLRPGS LHKFIPEEILIHGRDFRLLRVGFEAGGLEPQAFQVTMAIVQARAQSNQAQQYSGITLS KAGQGSGSRKPPIPLMETAEDWETERKKQAARGWRVSTVNERFDVATSLPRYFWVPNR ILDSEVRRAFGHFHQGRGPVSVMVRVMAVD" BASE COUNT 751 a 838 c 866 g 872 t ORIGIN 1 cacacctttc caaggacccc caaactctgc tccgtgcacg tcaaatgctc ctttcccttg 61 tgtccaaccc cctacccctc tccctaacac ccctcttctc aacaagactc agcctctccc 121 cgaggtgggt gagcatcctt gaggtttccc acccttaact gctgtgtccc cggatggagc 181 cagagaaatg tggtgggggg gccggggcag agtttcaaca ttgcccccca gaaggaggag 241 ccagagatgg ggtctgtcca ggaaaacagg atgccggagc ccaggagtcg tcagcctagc 301 agttgcctgg cctccagatg cctcccaggg gagcagatcc tagcatgggc cccaggggtg 361 aggaagggcc tggaaccaga attgtctgga accctgatct gtaccaactt tagggtcacc 421 ttccagccct gtggatggca gtggaatcag gacactccct tgaacagtga atacgatttt 481 gccctggtca acattggacg attagaggct gtgagcggct tgtcccgagt ccagctcctc 541 cgtccagggt ccctgcataa atttatccct gaggagattc tgattcatgg ccgagacttc 601 cggctgctca gagttggttt tgaggctgga ggcctagagc ctcaggcttt tcaggtgacc 661 atggccattg tccaagccag agctcagagc aatcaagccc aacagtattc ggggataacc 721 ctgagcaagg ctggccaggg ttctggctcc agaaaaccac caattcctct catggagaca 781 gcggaagact gggagactga gcggaagaag caggcagcca gaggctggag ggtcagcacg 841 gtcaacgaga ggttcgacgt agccaccagc ctcccccgtt acttctgggt ccctaaccga 901 attctggaca gtgaggtcag gagagcattt ggccactttc atcagggccg tggaccggtc 961 agtgtgatgg ttagggtaat ggctgtggat tagagggtca tgtgggccag ggacatcgtg 1021 gagggaggaa cctctgtgag gtcagtgtgg gggcaagggt agcgtggagc taggcatttc 1081 tcccacaatg accctcttct gccccatgtg aagcgcttgt cctggcatca ccctgggggc 1141 agtgatcttc tccgctgtgg aggcttctat acagccagtg accctaacaa ggaggatatc 1201 agagcagtgg agttgatgca ccaggctggg cattcagatg ttgtcctggt agacactatg 1261 gatgagctgc ccagccttgc agatgtccaa cttgcccacc tgaggctgag ggccctctgc 1321 ctgcctgatt catctgtagc tgaggataaa tgctttcagc cctggaagga acacgatggc 1381 tggactatgt cagggcttgt cttcgaaagg ccagtgacat ttcagtatta gtgacatcca 1441 gggttcgttc tgtaatactt caaggctccg gtgtttctcc tcttccttga ttgtgtctgg 1501 cagctcctcc agcagtttcc agctgatttt gaattctctg agtttttcct tcttgctctt 1561 catgacagtg tcagggttcc tgacaccctt accttcctga gaaatacccc ctgggagcgc 1621 ggaaagcaga gcggacaggt cagtgacttc tatttttgac tcgtgttttt ttttccattg 1681 agatgtactc tctgaagttt ggtcttgatt tgttttatga gaagtgaggt ctgtgagtgg 1741 ggagggggag atttattctc attttcagga cgagactttt gccctacatc tttcctagaa 1801 taagaggtga gaatctcatg atttgtctct agatgtggga ggattgtgtg taaccatcct 1861 ttttcttgct tcctctgtcc agttaaactc ctatacacaa gtctacaccc caggatactc 1921 cagcctccag ctgggaactc ttttaacctg cagctgtctg tctgggactg ggatttacgt 1981 tatagcaatg cacagatact acaattccag aatcctggct atgacccaga acactgtcca 2041 gattcctggc tccctagacc acagccaagc ttcatggttc ctggaccccc cagttttgtg 2101 tggctcttct ctagaggagc attgaccccc ctgaatcagc tctgtccttg gcgggacagt 2161 ccttccctgc tggcagtctc ttctcgttgg ctccctcgac ctgctatctc ctctgaaagc 2221 tggctgacca ggaatggggt ctcccctcac attggggagc ttgcccttta cctccagggc 2281 tgctgctgcc tgggtatctg ggaccccaga tcaggctctg gagacgctgc tacctgaggg 2341 gaaggcctga ggtccaggta agaagggaaa atagactggg agtgggacaa gggacttgac 2401 tctgctgaac cagatgaaca ggagctggaa aggcaaggag ctgaagcctc tgggagtctg 2461 ggaagtgaag ttctactcct cttggcatca aacaaggttt gggagtgtag gaggtgcggg 2521 aaagtgcttg tggcttagat taagtggaat ttagggcata gctgaaaggg gaaacagaat 2581 taaagacacc agaagtagca gagaagcagg gggccagagc tacaacagta ttcttctctg 2641 ttcctctttg cctcctcccc agatgggcct ctcatctccc acaatctctg gcctccagga 2701 tgagctatcc catcttcagg agttattacg gaaaggacac caagaatatc tcctgaggat 2761 cactccaaga aaagagatcc acataccatt ctcaatccca ctgaaattgc tggcattctc 2821 aaaggcaggg cagaggggga tctggggtag agggagggtt ctgtctaatc tttttttttt 2881 cttttgtatc tgcacttgca gcctcagctt tcatacttca gcccttaagt tcactaagaa 2941 ggtctgagtt tctgctgcag atagtggtgt taactgctcc aactcttgtc ttgcttagtt 3001 tctacaaata tttttgcttc ttgtcatttg aaggattaag aaacaaaaac aatccagaaa 3061 ttgatcggtt tttttaggcc aatcccatcc cttctggata accagatgtt aaatcatgag 3121 atcagagatg ctgttcatca gtcccaacaa gatggcctag aaatcgcatt ctcacctcgc 3181 cttgctgctg ctttaattcc aagttctatt tcttccctta tagttttcta tgggaatgag 3241 gcggatacag gaaacaccct atctcctctg tatttttgta gtggaatttc tatttaaggg 3301 gctcattaaa gcatagtatt tatacac // LOCUS HSU78575 3713 bp mRNA PRI 20-DEC-1996 DEFINITION Human 68 kDa type I phosphatidylinositol-4-phosphate 5-kinase alpha mRNA, clone PIP5KIa1, complete cds. ACCESSION U78575 NID g1743870 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3713) AUTHORS Loijens,J.C. and Anderson,R.A. TITLE Type I phosphatidylinositol-4-phosphate 5-kinases are distinct members of this novel lipid kinase family JOURNAL J. Biol. Chem. 271 (51), 32937-32943 (1996) MEDLINE 97115834 REFERENCE 2 (bases 1 to 3713) AUTHORS Loijens,J.C. and Anderson,R.A. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) Pharmacology, University of Wisconsin - Madison, 1300 University Ave., Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..3713 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="PIP5KIa1" /clone_lib="human fetal brain 5'-stretch cDNA library (Clontech # HL1149x)" CDS 401..2050 /EC_number="2.7.1.68" /note="isoforms, possibly alternatively spliced, encoded by GenBank Accession Numbers U78576 and U78577" /codon_start=1 /product="68 kDa type I phosphatidylinositol-4-phosphate 5-kinase alpha" /db_xref="PID:g1743871" /translation="MASASSGPSSSVGFSSFDPAVPSCTLSSASGIKRPMASEVPYAS GMPIKKIGHRSVDSSGETTYKKTTSSALKGAIQLGITHTVGSLSTKPERDVLMQDFYV VESIFFPSEGSNLTPAHHYNDFRFKTYAPVAFRYFRELFGIRPDDYLYSLCSEPLIEL CSSGASGSLFYVSSDDEFIIKTVQHKEAEFLQKLLPGYYMNLNQNPRTLLPKFYGLYC VQAGGKNIRIVVMNNLLPRSVKMHIKYDLKGSTYKRRASQKEREKPLPTFKDLDFLQD IPDGLFLDADMYNALCKTLQRDCLVLQSFKIMDYSLLMSIHNIDHAQREPLSSETQYS VDTRRPAPQKALYSTAMESIQGEARRGGTMETDDHMGGIPARNSKGERLLLYIGIIDI LQSYRFVKKLEHSWKALVHDGDTVSVHRPGFYAERFQRFMCNTVFKKIPLKPSPSKKF RSGSSFSRRAGSSGNSCITYQPSVSGEHKAQVTTKAEVEPGVHLGRPDVLPQTPPLEE ISEGSPIPDPSFSPLVGETLQMLTTSTTLEKLEVAESEFTH" polyA_signal 3691..3696 polyA_site 3713 BASE COUNT 919 a 865 c 882 g 1047 t ORIGIN 1 attaacaggc cgtggttagg aaggacggag aaggggcgtt cgctcctttg ggacttttca 61 tgcctcgttt ttttttcaga tgtggcttgg tctgggcgca aggtcccagc agccagctta 121 agcttactct tctgtgaaag gggaaagtat cccctgtgga aagcggttaa acttgtggag 181 ggggtgcggg acgtgagttc ttccccatgc caggcgaatg gtgtggcctt gagctggtcc 241 aggagccggc tcgacgtgtc tgagggaggc ccggaggggg cggggaggtg gcccacagaa 301 cgcgggttct gtaaagagac gttgggaaga ttcgattccg agaagaggaa gaaccggatt 361 gaaagagagc caggccgctg agggggaggg ggctgctaag atggcgtcgg cctcctccgg 421 gccgtcgtct tcggtcggtt tttcatcctt tgatcccgcg gtcccttcct gtaccttgtc 481 ctcagcatct ggaatcaaga gacccatggc atctgaggtg ccttatgcct ctggcatgcc 541 catcaagaaa ataggccata gaagtgttga ttcctcagga gagacaacat ataaaaagac 601 aacctcatca gccttgaaag gtgccatcca gttaggcatt acccacactg tggggagcct 661 gagtaccaaa ccagagcgtg atgtcctcat gcaagatttc tacgtggttg agagtatctt 721 ctttcccagt gaagggagca acctgacccc tgctcatcac tacaatgact ttcgtttcaa 781 gacctatgca cctgttgcct tccgctactt ccgggagcta tttggtatcc ggcccgatga 841 ttacttgtat tccctctgca gtgagccgct gattgaactc tgtagctctg gagctagtgg 901 ttccctattc tatgtgtcca gcgacgatga gttcattatt aagacagtcc aacataaaga 961 ggcggaattt ctgcagaagc tgcttccagg atactacatg aacctcaacc agaaccctcg 1021 gactttgctg cctaaattct atggactgta ctgtgtgcag gcaggtggca agaacattcg 1081 gattgtggtg atgaacaatc ttttaccaag atcggtaaaa atgcatatca aatatgacct 1141 caaaggctca acctacaaac ggcgggcttc ccagaaagag cgagagaagc ctcttcccac 1201 atttaaagac ctagacttct tacaagacat ccctgatggt ctttttttgg atgctgacat 1261 gtacaacgct ctctgtaaga ccctgcagcg tgactgtttg gtgctgcaga gcttcaagat 1321 aatggattac agcctcttga tgtcaatcca taatatagat catgcacaac gagagccctt 1381 aagcagtgaa acacagtact cagttgatac tcgaagaccg gccccccaaa aggctctgta 1441 ttccacagcc atggaatcca tccagggaga ggctcgacgg ggtggtacca tggagactga 1501 tgaccatatg ggtggcatcc ctgcccggaa tagtaaaggg gaaaggcttc tgctttatat 1561 tggcatcatt gacattctac agtcttacag gtttgttaag aagttggagc actcttggaa 1621 agccctggta catgacggag acactgtctc agtgcatcgc ccaggcttct acgctgaacg 1681 gttccagcgc ttcatgtgca acacagtatt taagaagatt cccttgaagc cttctccttc 1741 caaaaagttt cggtctggct catctttctc tcggcgagca ggctccagtg gcaactcctg 1801 cattacttac cagccatcgg tctctgggga acacaaggca caagtgacaa caaaggcaga 1861 agtggagcca ggcgttcacc ttggtcgtcc tgatgtttta cctcagactc cacctttgga 1921 ggaaatcagt gagggctcgc ctattcctga ccccagtttc tcacctctag ttggagagac 1981 tttgcaaatg ctaactacaa gtacaacctt ggaaaagctt gaagttgcag agtcagagtt 2041 cacccattaa gcgcaaagcc tcagaagacc tggaacaaga ttctgccatc tctgtgatcc 2101 caagatgtca gcccttgccc cagcaatgct gaattttctt ctacttggtc atcaaaaaag 2161 gagtgtaata gaagtgaggg gagctgctcc tccatcttct tcctgaagaa gaaccttctc 2221 tccttcctct tcctcatgaa tgggccttag tgcctcagag agttgaggac cgcagcatcc 2281 cctccactcc agagttgggt ggtacggatt ttcaactggc caaccctttg cctccactat 2341 tgaatttttt tcagaccccc attcttcatg ctggaaatgg gattgctgga cttggcagct 2401 ttctttcccc tcgtctttga ctaggaaccg gactcttaat ttcctcagga cagactagct 2461 ggcacattat ccctacctta gttctttctc tctgactcct ggaagaatac tcctgtaatc 2521 tctgtaaagg tttttggggg ataagggtgt ttaaccacct cccagctttc ttcttctttt 2581 ttttttctga aaaaaggaaa aagcacacag cacacaattt caagccattt tcagatcaga 2641 actccagaag tgttgacaag atgcctattc gtagagttcc ctcagaagag ccatggtgtt 2701 tatgaagaga agagtagtga ttgctctgcc agaagcagct cctctttaaa ctcctcctct 2761 cttgatgaat ttcttaaggc tgaaggaatg aagagagtgg gacatggggt aatctttatc 2821 ccttttgtta aaacaggagg cagccatggg ctgggagatc atagcccttc ctaggcagaa 2881 tcctgttcac tgccaggcta tagtaattat tactattttg caatttgaaa tatattctgg 2941 ttgtttttct aaatgtgaag acttaccaaa tgaattttag atcattctcc agaggagatt 3001 ttttttgctc ttctcatctt ttccaacagt gttctcctgt ttgtggagct aaggtaaaga 3061 ggggacactt ctgtctgttt aacagacagt ccatatctgt gaggccagca aatattttct 3121 taaactcatg gggagacagc agattcttgc cttggtgagg tcattgctgt gccatatgtc 3181 ctacccccct gtcttcatgc agggaagttg gaaatggggg ctacatatgc cctctcctcc 3241 ccgtctacaa gagttgtggt tttccatctg atccttccac tcttgtcagg ggaagaaggg 3301 ggcctggtat ctcaggcaga ttgttgaatt cctgttctat cccttctcta tcccaccctg 3361 ccttgataat atgttagccc ataccccaaa taactgtcta tattagacac ccccagccag 3421 tttctggctg cctgtctttg ctgccatgtt ttttacaaga aggaaagaat tcttgctatt 3481 tttttttcat aatttactat ttatgatgta tttaagtgtt ttattaagga cagagttctg 3541 ttaggggtgg gagggaatat ttgagggagg gctgggtctt agggaaagga atggggaagc 3601 aacattttta ttaagtgtta ctatttgcct ctactttgta ttgttcagaa atggcaaata 3661 caatataaaa gtgatatatg gttttaatgt aataaacttt aatgagttat tta // LOCUS HSU78678 756 bp mRNA PRI 31-JAN-1997 DEFINITION Human thioredoxin mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U78678 NID g1809134 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 756) AUTHORS Miranda-Vizuete,A., Gustafsson,J.-A. and Spyrou,G. TITLE Human mitochondrial thioredoxin JOURNAL Unpublished REFERENCE 2 (bases 1 to 756) AUTHORS Miranda-Vizuete,A., Gustafsson,J.-A. and Spyrou,G. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) Dept. of Biosciences at NOVUM, Karolinska Institutet, Halsovagen 7, Huddinge S-14157, Sweden FEATURES Location/Qualifiers source 1..756 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 1..501 /codon_start=1 /product="thioredoxin" /db_xref="PID:g1809135" /translation="MAQRLLLRRFLASVISRKPSQGQWPPLTSKALQTPQCSPGGLTV TPNPARTIYTTRISLTTFNIQDGPDFQDRVVNSETPVVVDFHAQWCGPCKILGPRLEK MVAKQHGKVVMAKVDIDDHTDLAIEYEVSAVPTVLAMKNGDVVDKFVGIKDEDQLEAF LKKLIG" BASE COUNT 153 a 222 c 218 g 163 t ORIGIN 1 atggctcagc gacttcttct gaggaggttc ctggcctctg tcatctccag gaagccctct 61 cagggtcagt ggccacccct cacttccaaa gccctgcaga ccccacaatg cagtcctggt 121 ggcctgactg taacacccaa cccagcccgg acaatataca ccacgaggat ctccttgaca 181 acctttaata tccaggatgg acctgacttt caagaccgag tggtcaacag tgagacacca 241 gtggttgtgg atttccacgc acagtggtgt ggaccctgca agatcctggg gccgaggtta 301 gagaagatgg tggccaagca gcacgggaag gtggtgatgg ccaaggtgga tattgatgac 361 cacacagacc tcgccattga gtatgaggtg tcagcggtgc ccactgtgct ggccatgaag 421 aatggggacg tggtggacaa gtttgtgggc atcaaggatg aggatcagtt ggaggccttc 481 ctgaagaagc tgattggctg acaagcaggg atgagtcctg gttcccttgc ccgcgtggga 541 ccccaataga actcagccct tccatgccag cccttcctgc tgcctccctc ctgtctggct 601 cctggggccc atgcttagag cccaggctcc agccctgagt gcttccgagc tggcggactg 661 cccaggggcc atcagaggat ggtggtgctg ctgctgatcc ggggaccgct gtcttccctc 721 ccatacgcct ttcatccctc cttctagggc ctatgg // LOCUS HSU78773 2770 bp mRNA PRI 03-DEC-1996 DEFINITION Human nuclear corepressor KAP-1 (KAP-1) mRNA, complete cds. ACCESSION U78773 NID g1699026 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2770) AUTHORS Friedman,J.R., Fredericks,W.J., Jensen,D.E., Speicher,D.W., Huang,X.P., Neilson,E.G. and Rauscher,F.J. 3rd. TITLE KAP-1, a novel corepressor for the highly conserved KRAB repression domain JOURNAL Genes Dev. 10 (16), 2067-2078 (1996) MEDLINE 96365472 REFERENCE 2 (bases 1 to 2770) AUTHORS Friedman,J.R., Fredericks,W.J., Jensen,D.E., Speicher,D.W., Huang,X.-P., Neilson,E.G. and Rauscher,F.J. 3rd. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) Molecular Genetics, The Wistar Institute, 3601 Spruce Street, Philadelphia, PA 19104, USA FEATURES Location/Qualifiers source 1..2770 /organism="Homo sapiens" /db_xref="taxon:9606" gene 101..2608 /gene="KAP-1" CDS 101..2608 /gene="KAP-1" /note="KRAB-associated protein" /codon_start=1 /product="nuclear corepressor KAP-1" /db_xref="PID:g1699027" /translation="MAASAAAASAAAASAASGSPGPGEGSAGGEKRSTAPSAAASASA SAAASSPAGGGAEALELLEHCGVCRERLRPEREPRLLPCLHSACSACLGPAAPAAANS SGDGGAAGDGTVVDCPVCKQQCFSKDIVENYFMRDSGSKAATDAQDANQCCTSCEDNA PGTSYCVECSEPLCETCVEAHQRVKYTKDHTVRSTGPAKSRDGERTVYCNVHKHEPLV LFCESCDTLTCRDCQLNAHKDHQYQFLEDAVRNQRKLLASLVKRLGDKHATLQKSTKE VRSSIRQVSDVQKRVQVDVKMAILQIMKELNKRGRVLVNDAQKVTEGQQERLERQHWT MTKIQKHQEHILRFASWALESDNNTALLLSKKLIYFQLHRALKMIVDPVEPHGEMKFQ WDLNAWTKSAEAFGKIVAERPGTNSTGPAPMAPPRAPGPLSKQGSGSSQPMEVQEGYG FGSGDDPYSSAEPHVSGVKRSRSGEGEVSGLMRKVPRVSLERLDLDLTADSQPPVFKV FPGSTTEDYNLIVIERGAAAAATGQPGTAPAGTPGAPPLAGMAIVKEEETEAAIGAPP TATEGPETKPVLMALAEGPGAEGPRLASPSGSTSSGLEVVAPEGTSAPGGGPGTLDDS ATICRVCQKPGDLVMCNQCEFCFHLDCHLPALQDVPGEEWSCSLCHVLPDLKEEDGSL SLDGADSTGVVAKLSPANQRKCERVLLALFCHEPCRPLHQLATDSTFSLDQPGGTLDL TLIRARLQEKLSPPYSSPQEFAQDVGRMFKQFNKLTEDKADVQSIIGLQRFFETRMNE AFGDTKFSAVLVEPPPMSLPGAGLSSQELSGGPGDGP" BASE COUNT 517 a 872 c 865 g 516 t ORIGIN 1 ggcggcggcg gcggcagcgg cccagcagtt ggcggcgagg gtctggcctc gcggcgggcc 61 cgcgccctcc tccccccctg gggcccccgg cggcgtgtga atggcggcct ccgcggcggc 121 agcctcggca gcagcggcct cggccgcctc tggcagcccg ggcccgggcg agggctccgc 181 tggcggcgaa aagcgctcca ccgccccttc ggccgcagcc tcggcctctg cctcagccgc 241 ggcgtcgtcg cccgcggggg gcggcgccga ggcgctggag ctgctggagc actgcggcgt 301 gtgcagagag cgcctgcgac ccgagaggga gccccgcctg ctgccctgtt tgcactcggc 361 ctgtagtgcc tgcttagggc ccgcggcccc cgccgccgcc aacagctcgg gggacggcgg 421 ggcggcgggc gacggcaccg tggtggactg tcccgtgtgc aagcaacagt gcttctccaa 481 agacatcgtg gagaattatt tcatgcgtga tagtggcagc aaggctgcca ccgacgccca 541 ggatgcgaac cagtgctgca ctagctgtga ggataatgcc ccaggcacca gctactgtgt 601 ggagtgctcg gagcctctgt gtgagacctg tgtagaggcg caccagcggg tgaagtacac 661 caaggaccat actgtgcgct ctactgggcc agccaagtct cgggatggtg aacgtactgt 721 ctattgcaac gtacacaagc atgaacccct tgtgctgttt tgtgagagct gtgatactct 781 cacctgccga gactgccagc tcaatgccca caaggaccac cagtaccagt tcttagagga 841 tgcagtgagg aaccagcgca agctcctggc ctcactggtg aagcgccttg gggacaaaca 901 tgcaacattg cagaagagca ccaaggaggt tcgcagctca atccgccagg tgtctgacgt 961 acagaagcgt gtgcaagtgg atgtcaagat ggccatcctg cagatcatga aggagctgaa 1021 taagcggggc cgtgtgctgg tcaatgatgc ccagaaggtg actgaggggc agcaggagcg 1081 cctggagcgg cagcactgga ccatgaccaa gatccagaag caccaggagc acattctgcg 1141 ctttgcctct tgggctctgg agagtgacaa caacacagcc cttttgcttt ctaagaagtt 1201 gatctacttc cagctgcacc gggccctcaa gatgattgtg gatcccgtgg agccacatgg 1261 cgagatgaag tttcagtggg acctcaatgc ctggaccaag agtgccgagg cctttggcaa 1321 gattgtggca gagcgtcctg gcactaactc aacaggccct gcacccatgg cccctccaag 1381 agccccaggg cccctgagca agcagggctc tggcagcagc cagcccatgg aggtgcagga 1441 aggctatggc tttgggtcag gagatgatcc ctactcaagt gcagagcccc atgtgtcagg 1501 tgtgaaacgg tcccgctcag gtgagggcga ggtgagcggc cttatgcgca aggtgccacg 1561 agtgagcctt gaacgcctgg acctggacct aacagctgac agccagccac ccgtcttcaa 1621 ggtcttccca ggcagtacca ctgaggacta caaccttatt gttattgaac gtggcgctgc 1681 cgctgcagct accggccagc cagggactgc gcctgcagga acccctggtg ccccacccct 1741 ggctggcatg gccattgtca aggaggagga gacggaggct gccattggag cccctcctac 1801 tgccactgag ggccctgaga ccaaacctgt gcttatggct cttgcggagg gtcctggtgc 1861 tgagggtccc cgcctggcct cacctagtgg cagcaccagc tcagggctgg aggtggtggc 1921 tcctgagggt acctcagccc caggtggtgg cccgggaacc ctggatgaca gtgccaccat 1981 ttgccgtgtc tgccagaagc caggcgatct ggttatgtgc aaccagtgtg agttttgttt 2041 ccacctggac tgtcacctgc cggccctgca ggatgtacca ggggaggagt ggagctgctc 2101 actctgccat gtgctccctg acctgaagga ggaggatggc agcctcagcc tggatggtgc 2161 agacagcact ggcgtggtgg ccaagctctc accagccaac cagcggaaat gtgagcgtgt 2221 actgctggcc ctattctgtc acgaaccctg ccgccccctg catcagctgg ctaccgactc 2281 caccttctcc ctggaccagc ccggtggcac cctggatctg accctgatcc gtgcccgcct 2341 ccaggagaag ttgtcacctc cctacagctc cccacaggag tttgcccagg atgtgggccg 2401 catgttcaag caattcaaca agttaactga ggacaaggca gacgtgcagt ccatcatcgg 2461 cctgcagcgc ttcttcgaga cgcgcatgaa cgaggccttc ggtgacacca agttctctgc 2521 tgtgctggtg gagcccccgc cgatgagcct gcctggtgct ggcctgagtt cccaggagct 2581 gtctggtggc cctggtgatg gcccctgagg ctggagcccc catggccagc ccagcctggc 2641 tctgttctct gtcctgtcac cccatcccca ctcccctggt ggcctgactc ccactccctg 2701 gtggccccat cccccagttc ctcacgatat ggtttttact tctgtggatt taataaaaac 2761 ttcaccaggt // LOCUS HSU78798 2264 bp mRNA PRI 12-DEC-1996 DEFINITION Human TNF receptor associated factor 6 (TRAF6) mRNA, complete cds. ACCESSION U78798 L81153 NID g1732425 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2264) AUTHORS Cao,Z., Xiong,J., Takeuchi,M., Kurama,T. and Goeddel,D.V. TITLE TRAF6 is a signal transducer for interleukin-1 JOURNAL Nature 383 (6599), 443-446 (1996) MEDLINE 96434892 REFERENCE 2 (bases 1 to 2264) AUTHORS Cao,Z., Xiong,J., Takeuchi,M., Kurama,T. and Goeddel,D.V. TITLE Direct Submission JOURNAL Submitted (28-OCT-1996) Zhaodan Cao, Tularik, Inc., 2 Corporate Drive, South San Francisco, CA 94080, USA FEATURES Location/Qualifiers source 1..2264 /organism="Homo sapiens" /db_xref="taxon:9606" gene 222..1790 /gene="TRAF6" CDS 222..1790 /gene="TRAF6" /note="TNF receptor associated factor 6; transcription factor NF kappa B; signal transduction; cytokine." /codon_start=1 /product="putative interleukin 1 signal transducer" /db_xref="PID:g1732426" /translation="MSLLNCENSCGSSQSESDCCVAMASSCSAVTKDDSVGGTASTGN LSSSFMEEIQGYDVEFDPPLESKYECPICLMALREAVQTPCGHRFCKACIIKSIRDAG HKCPVDNEILLENQLFPDNFAKREILSLMVKCPNEGCLHKMELRHLEDHQAHCEFALM DCPQCQRPFQKFHINIHILKDCPRRQVSCDNCAASMAFEDKEIHDQNCPLANVICEYC NTILIREQMPNHYDLDCPTAPIPCTFSTFGCHEKMQRNHLARHLQENTQSHMRMLAQA VHSLSVIPDSGYISEVRNFQETIHQLEGRLVRQDHQIRELTAKMETQSMYVSELKRTI RTLEDKVAEIEAQQCNGIYIWKIGNFGMHLKCQEEEKPVVIHSPGFYTGKPGYKLCMR LHLQLPTAQRCANYISLFVHTMQGEYDSHLPWPFQGTIRLTILDQSEAPVRQNHEEIM DAKPELLAFQRPTIPRNPKGFGYVTFMHLEALRQRTFIKDDTLLVRCEVSTRFDMGSL RREGFQPRSTDAGV" BASE COUNT 637 a 518 c 520 g 589 t ORIGIN 1 ccgcagctgg ggcttggcct gcgggcggcc agcgaaggtg gcgaaggctc ccactggatc 61 cagagtttgc cgtccaagca gcctcgtctc ggcgcgcagt gtctgtgtcc gtcctctacc 121 agcgccttgg ctgagcggag tcgtgcggtt ggtgggggag ccctgccctc ctggttcggc 181 ctccccgcgc actagaacga gcaagtgata atcaagttac tatgagtctg ctaaactgtg 241 aaaacagctg tggatccagc cagtctgaaa gtgactgctg tgtggccatg gccagctcct 301 gtagcgctgt aacaaaagat gatagtgtgg gtggaactgc cagcacgggg aacctctcca 361 gctcatttat ggaggagatc cagggatatg atgtagagtt tgacccaccc ctggaaagca 421 agtatgaatg ccccatctgc ttgatggcat tacgagaagc agtgcaaacg ccatgcggcc 481 ataggttctg caaagcctgc atcataaaat caataaggga tgcaggtcac aaatgtccag 541 ttgacaatga aatactgctg gaaaatcaac tatttccaga caattttgca aaacgtgaga 601 ttctttctct gatggtgaaa tgtccaaatg aaggttgttt gcacaagatg gaactgagac 661 atcttgagga tcatcaagca cattgtgagt ttgctcttat ggattgtccc caatgccagc 721 gtcccttcca aaaattccat attaatattc acattctgaa ggattgtcca aggagacagg 781 tttcttgtga caactgtgct gcatcaatgg catttgaaga taaagagatc catgaccaga 841 actgtccttt ggcaaatgtc atctgtgaat actgcaatac tatactcatc agagaacaga 901 tgcctaatca ttatgatcta gactgcccta cagccccaat tccatgcaca ttcagtactt 961 ttggttgcca tgaaaagatg cagaggaatc acttggcacg ccacctacaa gagaacaccc 1021 agtcacacat gagaatgttg gcccaggctg ttcatagttt gagcgttata cccgactctg 1081 ggtatatctc agaggtccgg aatttccagg aaactattca ccagttagag ggtcgccttg 1141 taagacaaga ccatcaaatc cgggagctga ctgctaaaat ggaaactcag agtatgtatg 1201 taagtgagct caaacgaacc attcgaaccc ttgaggacaa agttgctgaa atcgaagcac 1261 agcagtgcaa tggaatttat atttggaaga ttggcaactt tggaatgcat ttgaaatgtc 1321 aagaagagga gaaacctgtt gtgattcata gccctggatt ctacactggc aaacccgggt 1381 acaaactgtg catgcgcttg caccttcagt taccgactgc tcagcgctgt gcaaactata 1441 tatccctttt tgtccacaca atgcaaggag aatatgacag ccacctccct tggcccttcc 1501 agggtacaat acgccttaca attcttgatc agtctgaagc acctgtaagg caaaaccacg 1561 aagagataat ggatgccaaa ccagagctgc ttgctttcca gcgacccaca atcccacgga 1621 acccaaaagg ttttggctat gtaactttta tgcatctgga agccctaaga caaagaactt 1681 tcattaagga tgacacatta ttagtgcgct gtgaggtctc cacccgcttt gacatgggta 1741 gccttcggag ggagggtttt cagccacgaa gtactgatgc aggggtatag cttgccctca 1801 cttgctcaaa aacaactacc tggagaaaac agtgcctttc cttgccctgt tctcaataac 1861 atgcaaacaa acaagccacg ggaaatatgt aatatctact agtgagtgtt gttagagagg 1921 tcacttacta tttcttcctg ttacaaatga tctgaggcag ttttttcctg ggaatccaca 1981 cgttccatgc tttttcagaa atgttaggcc tgaagtgcct gtggcatgtt gcagcagcta 2041 ttttgccagt tagtatacct ctttgttgta ctttcttggg cttttgctct ggtgtatttt 2101 attgtcagaa agtccagact caagagtact aaacttttaa taataatgga ttttccttaa 2161 aacttcagtc tttttgtagt attatatgta atatattaaa agtgaaaatc actaccgcct 2221 tgaaaaaaaa aaaaaaaaaa ctcgaggggg gcccgtaccc aatg // LOCUS HSU78876 2348 bp mRNA PRI 02-FEB-1997 DEFINITION Human MEK kinase 3 mRNA, complete cds. ACCESSION U78876 NID g1813645 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2348) AUTHORS Ellinger-Ziegelbauer,H.C., Brown,K., Kelly,K. and Siebenlist,U. TITLE Direct activation of the SAPK and ERK pathways by an inducible MEKK3 derivative JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2348) AUTHORS Ellinger-Ziegelbauer,H.C., Brown,K., Kelly,K. and Siebenlist,U. TITLE Direct Submission JOURNAL Submitted (20-NOV-1996) Laboratory of Immunoregulation, NIAID, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892-1876, USA FEATURES Location/Qualifiers source 1..2348 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 84..1964 /function="SAPK/ERK pathway regulator" /note="hMEKK3" /codon_start=1 /product="MEK kinase 3" /db_xref="PID:g1813646" /translation="MDEQEALNSIMNDLVALQMNRRHRMPGYETMKNKDTGHSNRQSD VRIKFEHNGERRIIAFSRPVKYEDVEHKVTTVFGQPLDLHYMNNELSILLKNQDDLDK AIDILDRSSSMKSLRILLLSQDRNHNSSSPHSEVSRQVRIKASQSAGDINTIYQPPEP RSRHLSVSSQNPGRSSPPPGYVPERQQHIARQGSYTSINSEGEFIPETSEQCMLDPLS SAENSLSGSCQSLDRSADSPSFRKSRMSRAQSFPDNRQEYSDRETQLYDKGVKGGTYP RRYHVSVHHKDYSDGRRTFPRIRRHQGNLFTLVPSSRSLSTNGENMGLAVQYLDPRGR LRSADSENALSVQERNVPTKSPSAPINWRRGKLLGQGAFGRVYLCYDVDTGRELASKQ VQFDPDSPETSKEVSALECEIQLLKNLQHERIVQYYGCLRDRAEKTLTIFMEYMPGGS VKDQLKAYGALTESVTRKYTRQILEGMSYLHSNMIVHRDIKGANILRDSAGNVKLGDF GASKRLQTICMSGTGMRSVTGTPYWMSPEVISGEGYGRKADVWSLGCTVVEMLTEKPP WAEYEAMAAIFKIATQPTNPQLPSHISEHGRDFLRRIFVEARQRPSAEELLTHHFAQL MY" BASE COUNT 557 a 670 c 682 g 439 t ORIGIN 1 ccgccgcccg ggcccccggc atgcagcccc ggctgcggag gtgacactca cggaccttag 61 ccaccgccgc cgccatcgcc accatggacg aacaggaggc attgaactca atcatgaacg 121 atctggtggc cctccagatg aaccgacgtc accggatgcc tggatatgag accatgaaga 181 acaaagacac aggtcactca aataggcaga gtgacgtcag aatcaagttc gagcacaacg 241 gggagaggcg aattatagcg ttcagccggc ctgtgaaata tgaagatgtg gagcacaagg 301 tgacaacagt atttggacaa cctcttgatc tacattacat gaacaatgag ctctccatcc 361 tgctgaaaaa ccaagatgat cttgataaag caattgacat tttagataga agctcaagca 421 tgaaaagcct taggatattg ctgttgtccc aggacagaaa ccataacagt tcctctcccc 481 actctgaggt gtccagacag gtgcggatca aggcttccca gtccgcaggg gatataaata 541 ctatctacca gccccccgag cccagaagca ggcacctctc tgtcagctcc cagaaccctg 601 gccgaagctc acctccccct ggctatgttc ctgagcggca gcagcacatt gcccggcagg 661 ggtcctacac cagcatcaac agtgaggggg agttcatccc agagaccagc gagcagtgca 721 tgctggatcc cctgagcagt gcagaaaatt ccttgtctgg aagctgccaa tccttggaca 781 ggtcagcaga cagcccatcc ttccggaaat cacgaatgtc ccgtgcccag agcttccctg 841 acaacagaca ggaatactca gatcgggaaa ctcagcttta tgacaaaggg gtcaaaggtg 901 gaacctaccc ccggcgctac cacgtgtctg tgcaccacaa ggactacagt gatggcagaa 961 gaacatttcc ccgaatacgg cgtcatcaag gcaacttgtt caccctggtg ccctccagcc 1021 gctccctgag cacaaatggc gagaacatgg gtctggctgt gcaatacctg gacccccgtg 1081 ggcgcctgcg gagtgcggac agcgagaatg ccctctctgt gcaggagagg aatgtgccaa 1141 ccaagtctcc cagtgccccc atcaactggc gccggggaaa gctcctgggc cagggtgcct 1201 tcggcagggt ctatttgtgc tatgacgtgg acacgggacg tgaacttgct tccaagcagg 1261 tccaatttga tccagacagt cctgagacaa gcaaggaggt gagtgctctg gagtgcgaga 1321 tccagttgct aaagaacttg cagcatgagc gcatcgtgca gtactatggc tgtctgcggg 1381 accgcgctga gaagaccctg accatcttca tggagtacat gccagggggc tcggtgaaag 1441 accagttgaa ggcttacggt gctctgacag agagcgtgac ccgaaagtac acgcggcaga 1501 tcctggaggg catgtcctac ctgcacagca acatgattgt tcaccgggac attaagggag 1561 ccaacatcct ccgagactct gctgggaatg taaagctggg ggactttggg gccagcaaac 1621 gcctgcagac gatctgtatg tcggggacgg gcatgcgctc cgtcactggc acaccctact 1681 ggatgagccc tgaggtgatc agcggcgagg gctatggaag gaaagcagac gtgtggagcc 1741 tgggctgcac tgtggtggag atgctgacag agaaaccacc gtgggcagag tatgaagcta 1801 tggccgccat cttcaagatt gccacccagc ccaccaatcc tcagctgccc tcccacatct 1861 ctgaacatgg ccgggacttc ctgaggcgca tttttgtgga ggctcgccag agaccttcag 1921 ctgaggagct gctcacacac cactttgcac agctcatgta ctgagctctc acggccacac 1981 agctgccggt cgccctttgc tgcatggcag ggggctgctg ctgggctcag tgaagttgct 2041 gcttctccca ggcaaggctg tggaccatgg agtggcagcc cagccagcgt cggtctgtgc 2101 cccttccgcc actggggctc agagccgggg tggggtggct gcagcctcag gactgggagc 2161 ccccagcctg tcagatccag gagctccagt gtcctgagct cagcgtggag gggtaggggc 2221 tgggaacagt gtgcaaggca gccgtgggcc ccaccctcgg ggatgtgtcc tgacactgca 2281 attggcaccg aagcccagag ggtctggggg cacaagactg acgccagggt atgaagagtg 2341 ttattttc // LOCUS HSU79115 949 bp mRNA PRI 07-FEB-1997 DEFINITION Human death adaptor molecule RAIDD (RAIDD) mRNA, complete cds. ACCESSION U79115 NID g1785556 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 949) AUTHORS Lennon,G., Auffray,C., Polymeropoulos,M. and Soares,M.B. TITLE The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression JOURNAL Genomics 33 (1), 151-152 (1996) MEDLINE 96224170 REFERENCE 2 (bases 1 to 949) AUTHORS Duan,H. and Dixit,V.M. TITLE RAIDD is a new 'death' adaptor molecule JOURNAL Nature 385 (6611), 86-89 (1997) MEDLINE 97138227 REFERENCE 3 (bases 1 to 949) AUTHORS Duan,H. and Dixit,V.M. TITLE Direct Submission JOURNAL Submitted (20-NOV-1996) Pathology, University of Michigan, 1150 W. Medical Center Drive, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..949 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="I.M.A.G.E. Consortium CloneID 109053" gene 101..700 /gene="RAIDD" CDS 101..700 /gene="RAIDD" /note="RIP associated protein with a death domain; homologous to ICH-1/CED-3" /codon_start=1 /product="death adaptor molecule RAIDD" /db_xref="PID:g1785557" /translation="MEARDKQVLRSLRLELGAEVLVEGLVLQYLYQEGILTENHIQEI NAQTTGLRKTMLLLDILPSRGPKAFDTFLDSLQEFPWVREKLKKAREEAMTDLPAGDR LTGIPSHILNSSPSDRQINQLAQRLGPEWEPMVLSLGLSQTDIYRCKANHPHNVQSQV VEAFIRWRQRFGKQATFQSLHNGLRAVEVDPSLLLHMLE" BASE COUNT 217 a 241 c 260 g 231 t ORIGIN 1 ttgtgctcta aagtgcttat ggggcaggtt ccctaacagt caggattccg gttgcagttt 61 ttctcccccg ccccaaagat acgtggttgc agacggagaa atggaggcca gagacaaaca 121 agtactccgc tcacttcgcc tggagctggg tgcagaggta ttggtggagg gactggttct 181 tcagtacctc taccaggaag gaatcttgac ggaaaaccat attcaagaaa tcaatgctca 241 aaccacaggc ctccggaaaa caatgctcct gctggatatc ctaccttcca ggggccctaa 301 agcatttgat acattcctag attccctaca ggagtttccc tgggtcaggg agaagctgaa 361 gaaggcaagg gaagaggcca tgaccgacct gcctgcaggt gacagattga ctgggatccc 421 ctcgcacatc ctcaacagct ccccatcaga ccggcagatt aaccagctgg cccagaggct 481 gggccctgag tgggagccca tggtgctgtc tctgggactg tcccagacgg atatctaccg 541 ctgtaaggcc aaccaccccc acaacgtgca gtcgcaggtg gtggaggcct tcatccgttg 601 gcggcagcgc ttcgggaagc aggccacctt ccagagcctg cacaacgggc tgcgggctgt 661 ggaggtggac ccctcgctgc tcctgcacat gttggagtga tggtgcctcc agcaaccgct 721 ggggagtgtg tccctgagtc atgtgggctt gaatcctgac tttcactcag agcaggtggt 781 tttttgtgta ggtttgtttt ttatttttga tgatcttcag atggaaggag aaaacagggt 841 ttccactaga cattacttga aaggccagat tactcagcag atctcccatg ttggctcaac 901 aattctttgt ttttaattgc ttgaagattg cattgttgta attgttcag // LOCUS HSU79252 1600 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23679 mRNA, complete cds. ACCESSION U79252 NID g1710201 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1600) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1600) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1600) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1600 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23679" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone_lib="Soares library 1NIB from IMAGE consortium" CDS 974..1450 /codon_start=1 /product="unknown" /db_xref="PID:g1710202" /translation="MEGPRRGPEVGGFCKYRLLRVSRALCHDTSLGLTWLRTCSVRGF VRTLPFCLKLKAKENDRRLRTELTLAPGWEAAALLDATYCKWPEYQRGGFHGQMHSRC LPLHLDHLVVFKFLVPEAKSTTCLLVTCLPAVVVDVLAGRFGISHQSFCTVLVSSI" BASE COUNT 448 a 326 c 343 g 483 t ORIGIN 1 ggccaaaagt tgacttattt tgaatggact acattatagc ttaactagat tgtacgagtg 61 cttatcaact aatatgttaa aatatggtaa ttcctttttt ttttttttga gatggagtcc 121 agcctgggtg acagagtgag accctatttc aaaaaaataa aaattggaag aagagcttaa 181 aaaagataag attttaaaga gtcccaagtt atttaagttg agtgtaattg tcatttaagg 241 aaggcaaatg agtttatcat ccttcttaaa gagcatctct tttaactgtt ggacaaaacc 301 ataactttgt cattttacaa ggaagaacct cttaagaagt cctcagaacc agaagcaatg 361 tgaactctca gcgctggtcc tggtgggttt gctgaccatg actgggcaag ccgttctttt 421 tgctgccatc ttcctcatca taaagtgtgg aacataggca attgctttga gattcttgga 481 tagaagagga caacattctg cacctgcccc cttttttaaa tctttgggga aagatgagta 541 actttcccca ctactctgcc ttcctgttca gtaactctta cttttgcctg aagtaacagc 601 atcttctact tctccatcta gagatttttg tgtgtgtgcc atcaaggtta gcaaacttta 661 tacgtagcct aacacttaaa aaatgcactc attatcttaa acctaataaa ttccagagtt 721 tattttggtt ctcctctgtt gcccttccta aaaaatgagc tgaagatgac agtatttttc 781 tttacatgct tggttatgac ttttaaagtt ttatttaaat aaatgttgaa gctcaagttt 841 aaagaagcgt tgcagaggcc cacggtctcc tgggtcccgg ccacctgtcc atattccaca 901 tttgctgact gtgctccctg cactccactc aagttgagag ttcaaatagt cttgaagggg 961 aatcagcttc aggatggaag gacccaggag aggccccgag gtgggagggt tctgtaaata 1021 cagactactg cgagtgtcca gagctctctg ccatgatact tccttgggac tgacttggct 1081 gagaacgtgt tctgtcagag gatttgttag aactctgccc ttttgtctga aactcaaggc 1141 caaggagaat gataggagac ttaggacaga gctgaccctt gcaccaggct gggaggctgc 1201 agccctttta gatgccactt actgtaagtg gccagaatac cagagaggtg ggttccatgg 1261 tcaaatgcac agtaggtgtt tacctttaca tttggatcac cttgtagtct ttaaattctt 1321 ggtccctgag gccaagtcca caacttgcct tctagtcact tgcctgcccg cagtggtggt 1381 ggatgtgtta gctggtagat ttggaatcag tcaccagtct ttctgtactg tcttggttag 1441 ctctatataa gtaggggcag cttagccctg aggcccagag acctgctgtc ctttttctcc 1501 ttgagggagg aaataaaact gcggaataca atgtccttcc atagcatggg aagaagaaaa 1561 taaacatctc ctttccaaca aaaaaaaaaa aaaaaaaaaa // LOCUS HSU79253 1203 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23893 mRNA, complete cds. ACCESSION U79253 NID g1710204 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1203) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1203) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1203) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1203 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone_lib="Soares library 1NIB from IMAGE consortium" /clone="23893" CDS 644..1054 /codon_start=1 /product="unknown" /db_xref="PID:g1710205" /translation="MCCAVSEQRLTCADQMMLFGKISQQLCGVKKLPWSCDSRYFWGW LNAVFNKVDYDRIRDVGPDRAASEWLLRCGAMVRYHGQERWQKDYNHLPTGPLDKYKI QAIDATDSCIMSIGFDHMGNYPIVLLIENADDLQ" BASE COUNT 252 a 325 c 320 g 306 t ORIGIN 1 gacgctgtga gtagagaagc taggccccga gccgggcggg actagggtgg tggttgtgtt 61 ctgccctcgc ccgtcgccaa cgctgggctg actgaaaacc tggccctttg ggtcgcgcgt 121 cttcgacttt caccgccatg cggacatggg gccgtgggag aaactctggg cgggattacg 181 gtgaaaattc ttgtttgggc gggccctgcg gcttctttct tccggctcgc cctcggcccc 241 gccctcgccc cgcccccgcc cccgctccag ctccagccgg gtcctcgctc ggccccgcac 301 gcggcccgcg gtagacggag ggcctgggca cccaccgtct ccatcccggc cccagcccag 361 tcgagcgtcc actagaaggg cgctccctgt ccggccctaa tcctcgctcc tcacggaggc 421 tctttgtcac agcgaagact gacagcccgc agtcttctgg acttctttta tggggctcct 481 ggcggtgcta ttcattcagt cattgattcg tcttgtaaat atggagcgcc tgctctgtac 541 tcgacccggc actgggtacc ggaagaacca gacagctcgg tttttgccac catttataag 601 tctgtgtcct tttcttgagt actgaaccga gatacagttt taaatgtgct gtgcggtctc 661 tgagcagcga ctcacctgtg cagatcaaat gatgctgttt ggaaaaattt cccagcagtt 721 gtgtggcgta aagaaactcc catggtcatg tgactccaga tacttctggg gctggttgaa 781 tgcagtgttt aataaggtgg attatgatcg catcagggat gttggccctg acagggcggc 841 atccgagtgg ttgctgcgct gtggggccat ggtgcgctac catggccagg agaggtggca 901 gaaggactac aaccaccttc caacaggccc tctggacaaa tacaagattc aggcgatcga 961 cgccaccgac tcttgtatca tgagcattgg atttgatcac atgggtaact accctatcgt 1021 tttgctaata gaaaatgcag atgatttgca gtgaccattt tgttgcagtt gactcacgat 1081 tatagtcata ggtatgtcct ttttgcccat ttcattatag tcaagtttgt ttcttcctgt 1141 gtttcatatt tattttcaaa ttaaattgag gttacagaca aaaaaaaaaa aaaaaaaaaa 1201 aaa // LOCUS HSU79259 1706 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23945 mRNA, complete cds. ACCESSION U79259 NID g1710213 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1706) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1706) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1706) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1706 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23945" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone_lib="Soares library 1NIB from IMAGE consortium" CDS 637..1404 /codon_start=1 /product="unknown" /db_xref="PID:g1710214" /translation="MNPSTPSDGTFGQGFHCDSPSLGAPELDGKHFPPLAHPPTVFDA GLQKAYSPTCSPTLGFKEELRPPPTKLAACEPLKHGLQGASLGHAAAAQAHLSCRDLP LGHPHYDSPSCKGTAYWYPPGSAARSPPYEGKVGTGLLADFLGRTEAACLSAPHLASP PATPKADKEPLEMARPPGPPRGPAAAAAGYGCPLLSDLTLSPVPRDSLLPLQDTAYRY PGFMPQAHPGLGGGPKSGFLGPMAEPHPEDTFTVTSL" BASE COUNT 358 a 610 c 456 g 282 t ORIGIN 1 cagcgagccc aacgtgatcc tggacatctc caactacaca ccgcagaagg tgaagcagca 61 gacggctgtg tcggagacct tctctgagtc atcctccgac agcacccagt tcaatcagcc 121 ggttggtggc ggggggtttc ggcgtgccaa cagcgaggcc tcaagtagtg agggccagtc 181 gagcctgtcc agcctggaga aactgatgat ggactggaac gaggcatcat ctgcccccgg 241 ctacaactgg aaccagagtg tcctctttca gagtagctcc aagccgggcc gtggacggcg 301 gaagaaggtg gacctgttcg aggcctcaca tctgggcttc ccgacatccg cctctgccgc 361 tgcctcaggc tacccatcca aacggagcac tgggccccgg cagccgcgag gtggacgggg 421 cggtggggcc tgctcagcca agaaggagcg gggtggcgca gcggccaaag ccaagttcat 481 ccccaagcca cagccagtca acccactgtt ccaggacagt cctgacctcg gcctggacta 541 ctatagcggg gacagcagca tgtcaccact gccctcacag tcgaggcctt cggcgtggga 601 gagcgagacc cctgtgactt cataggaccc tactccatga acccgtccac gccttccgat 661 ggcacctttg gccaaggctt ccactgcgac tcgcccagcc tgggtgctcc cgagcttgat 721 ggcaagcatt tcccaccgct ggcccaccca cccacggtgt ttgacgccgg cctgcagaag 781 gcatactcgc ccacctgctc gcctacactg ggcttcaagg aagagctgcg gccaccgccc 841 acaaagctgg ctgcctgcga gcccctcaag catggactcc agggggccag cctgggccac 901 gcagctgcag cccaggccca cctgagctgc cgggacctgc cgctgggcca tccccactac 961 gattccccca gctgcaaggg cacagcctat tggtaccctc caggctcagc tgcccgcagc 1021 ccgccctatg aaggcaaggt gggtacaggg ctgctggctg acttcctggg caggacggag 1081 gccgcgtgcc tcagtgcccc tcacctggct agcccaccag ccacgcccaa ggccgacaag 1141 gagccactgg aaatggcccg gccccctggc ccaccccgtg gccctgctgc agccgctgct 1201 ggctatggct gcccactcct tagtgacttg accctgtccc ccgtgccgag ggactcgctg 1261 ctgcccctgc aggacaccgc ctacaggtac ccaggcttta tgccccaggc gcatcctggc 1321 ctgggtgggg gccccaagag cggcttcctg gggcccatgg cggaacctca ccccgaggac 1381 acattcaccg tcacatccct gtagtgccaa ctgaagtgcc gactggaccg cgaggttttg 1441 ttcctggctt tcagaaaacc aacgccaaga tccctcccag cgtccacatc gtcctctggc 1501 aggagctcct gcccctctgc ctcccaccct gccccctaca ccccctgcag acccatctcc 1561 ctccaccccc tcccacccat ctcctccacg cagaagccga aggtgagccc tttctgcaca 1621 aaaccagcaa ttgtaaatac tttttaaaaa tgtacaaaac ttaaaaacaa aacacagttt 1681 tagaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HSU79266 1579 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23627 mRNA, complete cds. ACCESSION U79266 NID g1710225 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1579) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1579) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1579) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA COMMENT similar to human DNA with an HBV insertion site, GenBank Accession Number M15772. FEATURES Location/Qualifiers source 1..1579 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23627" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone_lib="Soares library 1NIB from IMAGE consortium" CDS 185..1042 /codon_start=1 /product="unknown" /db_xref="PID:g1710226" /translation="MAGRRAQPGSAPPRPAAPHPRPASRAFPQHCRPRDAERPPSPRS PLMPGCELPVGTCPDMCPAAERAQREREHRLHRLEVVPGCRQDPPRADPQRAVKEYSR PAAGKPRPPPSQLRPPSVLLATVRYLAGEVAESADIARAEVASFVADRLRAVRLDLAL QGAGDAEAAVVLEAALATLLAVVARLGPDAARGPADPVLLQAQVQEGFGSLRRCYARG AGPHPRQPAFQGLFLLYNLGESGSWRLGRAWGQEPTMTVEARWKPCMRFYSCLLPCAP ARPSARPWR" BASE COUNT 252 a 528 c 560 g 239 t ORIGIN 1 aaaacactaa ggggagcgcg cgaagctgaa cttggcgctc gatgggggcc gttagccgcc 61 ctagagcgcg cggagccgca gaggcgtagc tggactacaa cgcagtgcat ctcgggaggc 121 caactcgact ggactgggtg agaggacaga ggtggctcga tgggcggccc gaaggccggg 181 gatcatggcg ggaaggcggg cccagccagg ttcagccccg ccccgacccg ccgctcccca 241 cccccggccg gcctcgcgtg ccttcccgca gcactgccgt ccccgggatg ctgagcgccc 301 accgtctccc cgcagccccc tcatgcccgg ctgcgagctg cccgtgggca cctgcccgga 361 catgtgcccg gccgccgagc gcgcccagcg cgaaagggag caccgcctgc accgcttgga 421 ggtggtgccg ggttgccgcc aggacccgcc ccgcgcggat ccgcagcgcg cggtgaagga 481 gtacagccga cccgccgccg gcaagccccg gcccccgccc agccagttgc gtccgccctc 541 cgtgctgctg gccaccgtgc gctacctggc cggtgaggtg gcggagagcg ccgacatcgc 601 ccgcgccgag gtggccagct tcgtggcaga ccgcttgcga gctgtgcgcc tggacctggc 661 gctgcaggga gcgggcgacg ccgaggcagc ggtggtgctg gaggcggcgc tggccacgct 721 gctggccgta gtggcgcggc tcgggcccga cgcggcgcgg ggacccgcgg acccggtgct 781 gctgcaggcc caggtgcagg agggcttcgg ctcgctgcgg cgctgctacg cgcggggcgc 841 cgggccgcac ccccgccaac ccgccttcca gggcctcttt ctgctctata acctgggtga 901 gtcgggatcc tggcggctgg gcagagcgtg gggacaggag cccaccatga cagtggaggc 961 tcggtggaag ccctgcatga ggttctacag ctgcctgctg ccctgcgcgc ctgcccgccc 1021 ctccgcaagg ccttggcggt agatgctgcc ttccgagagg gcaatgctgc ccgcctgttc 1081 cgtctgctcc agaccctgcc ctacctgcca agttgcgctg tgcagtgcca tgtgggccat 1141 gcccgccggg aagccctggc ccgcttcgct cgtgccttta gcacccccaa gggccagacc 1201 ttgcctctgg gcttcatggt caacctcttg gccctggatg gactcaggga agcacgggac 1261 ctgtgccagg cccacgggct gcccttggac ggagaggaga gagttgtgtt cctgaggggt 1321 cgctacgtgg aggaagggct accgcctgcc agtacgtgca aggtgttagt ggagagcaaa 1381 cttcgaggac gtaccctgga ggaggtggtc atggcagagg aggaagatga gggcacggac 1441 agacctgggt ccccagcctg aggagggagc gtgagcctcc cagagcccca ggactgggcc 1501 agagcactta ggtttctttt tccatggttt ccaggtaata aaaggaactt gttttgttgg 1561 taaaaaaaaa aaaaaaaaa // LOCUS HSU79274 1506 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23733 mRNA, complete cds. ACCESSION U79274 NID g1710240 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1506) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1506) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1506) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1506 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23733" /sex="female" /tissue_type="brain" /dev_stage="infant" /clone_lib="Soares library 1NIB from IMAGE consortium" CDS 417..1238 /codon_start=1 /product="unknown" /db_xref="PID:g1710241" /translation="MLGQLLPHTARGLGAAEMPGQGPGSDWTERSSSAEPPAVAGTEG GGGGSAGYSCYQNSKGSDRIKDGYKVNSHIAKLQELWKTPQNQTIHLSKSMMEASFFK HPDLTTGQKRYLCSIAKIYNANYLKMLMKRQYMHVLQHSSQKPGVLTHHRSRLSSRYS QKQHYPCTTWRHQLEREDSGSSDIAAASAPEMLIQHSLWWPVRNKEGIKTGYASKTRC KSLKIFRRPRKLFMQTVSSDDSESHMSEEKKEEDLLNNFMQSMSIEEQGEHLMLT" BASE COUNT 429 a 351 c 366 g 360 t ORIGIN 1 aagcttggat acctgagtaa taacggacaa cagataaagc tccgcatcct tgcgccacgt 61 gcttcggtcc gtggtttgcg tgcagacgtt tgacctgtat ggtgaccctc gcgatttgca 121 aatgtgctga ggatctggaa tactgaagtg gaaggcacct cttgttttgg ggagcatgta 181 tatttccctc ctgtgacgca ctgcttccac agagaggtgc aaacgtgcaa acgctcagcg 241 accgcagcgc tcctgcccct cccccaccgt aactccgggg tcgcggatct gcccgccccg 301 ctctcccgaa gctgttcggg cagtgtccga acggcttcgg aggggcgaga agccagcatc 361 cgagccgcct ctccggaata ccagcagcct gacgcacgcg tgctgtcggg ggagggatgc 421 tgggacagct gctcccgcac acggctcgcg gtctcggcgc cgcggagatg cccggccagg 481 gtccggggtc cgactggacg gagcgtagct cttctgcaga gccgcccgct gtggccggga 541 ccgagggtgg cggcggcgga tcagctggat actcttgtta ccagaattcc aaaggttctg 601 atagaatcaa agatggatac aaagtgaact cacacatagc taagctgcaa gagttatgga 661 aaactcccca aaatcaaaca atccacctct ctaaatcaat gatggaggcg tcctttttca 721 agcatccaga cctcaccaca ggccagaagc gttacctgtg cagcattgct aaaatctata 781 atgcaaacta tctgaagatg ttaatgaaga ggcagtacat gcacgtactt cagcacagct 841 cacaaaagcc aggtgtcctc actcatcaca gaagccgcct tagctcccgt tactcacaga 901 aacagcatta cccttgcact acatggcgac atcaactgga gagagaggac tcggggtctt 961 ctgatatcgc agctgcatct gcacctgaaa tgctcataca gcattccctt tggtggccag 1021 tgagaaacaa agaagggata aaaactggat atgcatctaa aacaagatgt aagtcactga 1081 agatttttag aagaccaagg aaactgttca tgcaaacagt ttcttcagat gattctgaat 1141 cacacatgag tgaagaaaaa aaggaagaag atttactaaa taattttatg caatcaatgt 1201 caattgaaga acagggagaa catctgatgt taacttgaca gtcttgtctc gtgtattgaa 1261 ttcgtgccaa aggtgagggt aaggggttgt gagttgtgtc ctgtatgttt aggatggtat 1321 tgttatttat taaatcatta agtaattttg gtttgttcag aaacttaaaa caatgtaatt 1381 ggtctgatgt agttccatgt accaatgata gttatgtaag aaaatttaca tgtaacatat 1441 acttgtactt ctagctagat acaattaaaa cttttcttgc attcaaaaaa aaaaaaaaaa 1501 aaaaaa // LOCUS HSU79303 1567 bp mRNA PRI 25-MAR-1997 DEFINITION Human clone 23882 mRNA, complete cds. ACCESSION U79303 NID g1710289 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1567) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1567) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large Scale Concatenation cDNA Sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1567) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (22-NOV-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1567 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="23882" /sex="female" /dev_stage="infant" /tissue_type="brain" /clone_lib="Soares library 1NIB from IMAGE consortium" CDS 289..750 /codon_start=1 /product="unknown" /db_xref="PID:g1710290" /translation="MNDRSSRRRTMKDDETFEISIPFDEAPHLDPQIFYSLSPSRRNF EEPPEAASSALALMNSVKTQLHMALERNSWLQKRIEDLEEERDFLRCQLDKFISSARM EAEDHCRMKPGPRRMEGDSRGGAGGEASDPESAASSLSGASEEGSASERRR" BASE COUNT 294 a 545 c 487 g 241 t ORIGIN 1 aacggcgggc ctcacctgga cccggggact cttcaacgag gggcgctagc ctggcccgac 61 tgggcgagtc cccgcgtccc tgcccgttcc gtcgctcagt tccacgacac gctcccctgc 121 cgccccgcct gttcgcgggt gggtgggcgc ttccctcggc tccccgtgac actttgcaga 181 cgctccccgg cgccgggcat gggcgccgcc gccgtcggtc cccgagccgg attccgcgag 241 cggtgcccct gaggccctcg gctgctgggg tccgcaggaa gccgcgccat gaatgaccgg 301 agcagtcgga ggcggacaat gaaggacgat gagaccttcg agatctccat tcccttcgat 361 gaggcacccc acctagaccc acagatcttt tacagtctga gcccctctcg gagaaacttc 421 gaggagcctc cggaggctgc gtcctccgcc ctggctctga tgaacagcgt caagacccag 481 ctgcacatgg ctctggagag gaactcctgg ctgcagaagc gcatcgagga cctggaggaa 541 gagagggact tcctgcggtg ccagctggac aaattcatct cttctgctcg gatggaggca 601 gaggaccact gccggatgaa gcctgggccc aggcggatgg agggggacag ccgtggtggg 661 gctgggggcg aggcctcgga ccctgagtca gcagcctcct ccctcagcgg agcgtccgaa 721 gaaggcagtg ccagtgagag gaggcggtag aagcagaagg gaggtgctag tcggaggcgc 781 tttgggaagc ccaaggcccg ggagaggcag cgagtgaagg acgccgacgg ggtcctctgc 841 cggtacaaga agatcctggg caccttccag aagctcaaga gcatgtcgcg ggccttcgag 901 caccaccgcg tggacaggaa caccgtggcg ctgaccacgc ccatcgccga gctgctcatt 961 gtggcccccg agaagctggc cgaggtgggc gagttcgacc cctccaagga gcgcctgctc 1021 gagtactccc gccgctgctt tctggccctg gacgacgaga cgctcaagaa ggtgcaggcg 1081 ctcaagaaga gcaagctgct gctgcccatc acctaccgct tcaagcggtg atcgcaccac 1141 gcctccgcgc ctccacccgg gccttcctcc cccgtggacc ccggtggatg acctgcccct 1201 ctccccgccg cgcccctgcc cctcctcctc gctccctggg ttgggggctc ccttagccgg 1261 gcccccaagc gcgacggccc cggaccggcc gcggcccctt cccgaacgcc ggcaccccct 1321 tccgcttggg ctgcccagcc ctgtcctcgc cgggcccctt cctcctggaa aaccaggcag 1381 gcgggtgccc ccccctcgag tgggggactg tacagacccc gtctccgccc tggccccgcg 1441 gaggagctgc ccacctgatt cccggacaga cctccccaac tccgcgtgag acagagaatt 1501 attcagataa tttaaattaa aaaacgacgt gaaaatttgg aataaaaaaa aaaaaaaaaa 1561 aaaaaaa // LOCUS HSU79526 2415 bp mRNA PRI 07-MAY-1997 DEFINITION Human orphan G-protein coupled receptor Dez isoform a mRNA, complete cds. ACCESSION U79526 NID g1732342 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2415) AUTHORS Methner,A., Hermey,G., Schinke,B. and Hermans-Borgmeyer,I. TITLE A novel G protein-coupled receptor with homology to neuropeptide and chemoattractant receptors expressed during bone development JOURNAL Biochem. Biophys. Res. Commun. 233 (2), 336-342 (1997) MEDLINE 97289630 REFERENCE 2 (bases 1 to 2415) AUTHORS Methner,A., Hermey,G., Schinke,B. and Hermans,I. TITLE Direct Submission JOURNAL Submitted (25-NOV-1996) ZMNH, UKE Hamburg, Martinistr. 52, Hamburg D-20246, Germany FEATURES Location/Qualifiers source 1..2415 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NT2" CDS 355..1476 /note="alternatively spliced" /codon_start=1 /product="orphan G-protein coupled receptor Dez isoform a" /db_xref="PID:g1732343" /translation="MRMEDEDYNTSISYGDEYPDYLDSIVVLEDLSPLEARVTRIFLV VVYSIVCFLGILGNGLVIIIATFKMKKTVNMVWFLNLAVADFLFNVFLPIHITYAAMD YHWVFGTAMCKISNFLLIHNMFTSVFLLTIISSDRCISVLLPVWSQNHRSVRLAYMAC MVIWVLAFFLSSPSLVFRDTANLHGKISCFNNFSLSTPGSSSWPTHSQMDPVGYSRHM VVTVTRFLCGFLVPVLIITACYLTIVCKLHRNRLAKTKKPFKIIVTIIITFFLCWCPY HTLNLLELHHTAMPGSVFSLGLPLATALAIANSCMNPILYVFMGQDFKKFKVALFSRL VNALSEDTGHSSYPSHRSFTKMSSMNERTSMNERETGML" BASE COUNT 590 a 641 c 606 g 577 t 1 others ORIGIN 1 gaattcggca cgagccccgg cggccagcag ggagctcagg acagagcagg ctccctggga 61 agcctccggg tgataggggt gttccagctg cggcgctctg ggggttcaga gggggatctt 121 gaatgaacaa atgaatgaac tgctttctgg gcaaacagcc acagccagag gagcctgtga 181 ttggcagaaa gaagccaggg tgtgcaagtc tccccaacag cctcgagtgg cctgcagtca 241 cagggaaccc tcaggaagac cttccgggca gagaccagag ggtgtttcta gctgtgtaca 301 gggactgatt ggctgaggac tcacattgga gagctgcaga caacataacg gtgaatgaga 361 atggaggatg aagattacaa cacttccatc agttacggtg atgaataccc tgattattta 421 gactccattg tggttttgga ggacttatcc cccttggaag ccagggtgac caggatcttc 481 ctggtggtgg tctacagcat cgtctgcttc ctcgggattc tgggcaatgg tctggtgatc 541 atcattgcca ccttcaagat gaagaagaca gtgaacatgg tctggttcct caacctggca 601 gtggcagatt tcctgttcaa cgtcttcctc ccaatccata tcacctatgc cgccatggac 661 taccactggg ttttcgggac agccatgtgc aagatcagca acttccttct catccacaac 721 atgttcacca gcgtcttcct gctgaccatc atcagctctg accgctgcat ctctgtgctc 781 ctccctgtct ggtcccagaa ccaccgcagc gttcgcctgg cttacatggc ctgcatggtc 841 atctgggtcc tggctttctt cttgagttcc ccatctctcg tcttccggga cacagccaac 901 ctgcatggga aaatatcctg cttcaacaac ttcagcctgt ccacacctgg gtcttcctcg 961 tggcccactc actcccaaat ggaccctgtg gggtatagcc ggcacatggt ggtgactgtc 1021 acccgcttcc tctgtggctt cctggtccca gtcctcatca tcacagcttg ctacctcacc 1081 atcgtctgca aactgcaccg caaccgcctg gccaagacca agaagccctt caagattatt 1141 gtgaccatca tcattacctt cttcctctgc tggtgcccct accacacact caacctccta 1201 gagctccacc acactgccat gcctggctct gtcttcagcc tgggtttgcc cctggccact 1261 gcccttgcca ttgccaacag ctgcatgaac cccattctgt atgttttcat gggtcaggac 1321 ttcaagaagt tcaaggtggc cctcttctct cgcctggtca atgctctaag tgaagataca 1381 ggccactctt cctaccccag ccatagaagc tttaccaaga tgtcatcaat gaatgagagg 1441 acttctatga atgagaggga gaccggcatg ctttgatcct cactgtggaa cccctcaatg 1501 gactctctca acccagggac acccaaggat atgtcttctg aagatcaagg caagaacctc 1561 tttagcatcc accaattttc actgcatttt gcatgggatg aacagtgttt tatgctggga 1621 atctagggcc tggaacccct ttcttctagt ggaccttggg aggccagcct tgactgactc 1681 aaagcaaaaa aggaagaatt ctcaaaagca ttgccatgaa ctgggattgg catagggcgg 1741 tgggattaag ctgcctattg tgtgtgcccc agaaatgaca ctctccaagg tccattcctg 1801 gtgtgagcag tgagggggtc agagcaaacc cagtgtgatg cagatcacac ttggcccttg 1861 tatatatatt atcagtagct ggccagaact caagtcactg ctctgtgtta aagtatgtgg 1921 aaatgcatag gtgatctggg agaggaaggt gacatatcag cccatgaact ccataggcca 1981 ttaatttggc tttcggatca gctaggaaga cagaaaactg gtcctgagag gtcctgtggg 2041 ctctttacaa gggcaagaca agtgagggat gcaacgagga cttgagattt ggcaaagaaa 2101 tgagaaagga gaaaagaacc tttaaaggat gcaggcatgg agcagctcac cctanaacat 2161 cgcaggctga tggtgcttta ggctggagct atttggggcg tcagggtggt gccagcccct 2221 gatctcacct tgcccctctt acctgggggt ggggttgtgg caggcatctt caccagcagc 2281 cctcacccca tcacagggat ttttttctgc ctttcctaat ccactcagct ctggctggaa 2341 gtgcttaaaa taaaacactg gtggtgggga attgctggga gcaaaaaaaa aaaaaaaaaa 2401 aaaaaaaaaa aaaaa // LOCUS HSU79716 11580 bp mRNA PRI 25-FEB-1997 DEFINITION Human reelin (RELN) mRNA, complete cds. ACCESSION U79716 NID g1743884 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 11580) AUTHORS DeSilva,U., D'Arcangelo,G., Braden,V.V., Chen,J., Miao,G.G., Curran,T. and Green,E.D. TITLE The human reelin gene: isolation, sequencing, and mapping on chromosome 7 JOURNAL Genome Res. 7 (2), 157-164 (1997) MEDLINE 97202106 REFERENCE 2 (bases 1 to 11580) AUTHORS DeSilva,U., D'Arcangelo,G., Braden,V.V., Chen,J., Miao,G.G., Curran,T. and Green,E.D. TITLE Direct Submission JOURNAL Submitted (26-NOV-1996) National Center for Human Genome Research, National Institutes of Health, 49 Convent Drive, MSC4431, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..11580 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" gene 176..10558 /gene="RELN" CDS 176..10558 /gene="RELN" /codon_start=1 /product="reelin" /db_xref="PID:g1743885" /translation="MERSGWARQTFLLALLLGATLRARAAAGYYPRFSPFFFLCTHHG ELEGDGEQGEVLISLHIAGNPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSVQASQ SIGGSSAFGFGIMSDHQFGNQFMCSVVASHVSHLPTTNLSFIWIAPPAGTGCVNFMAT ATHRGQVIFKDALAQQLCEQGAPTDVTVHPHLAEIHSDSIILRDDFDSYHQLQLNPNI WVECNNCETGEQCGAIMHGNAVTFCEPYGPRELITTGLNTTTASVLQFSIGSGSCRFS YSDPSIIVLYAKNNSADWIQLEKIRAPSNVSTIIHILYLPEDAKGENVQFQWKQENLR VGEVYEACWALDNILIINSAHRQVVLEDSLDPVDTGNWLFFPGATVKHSCQSDGNSIY FHGNEGSEFNFATTRDVDLSTEDIQEQWSEEFESQPTGWDVLGAVIGTECGTIESGLS MVFLKDGERKLCTPSMDTTGYGNLRFYFVMGGICDPGNSHENDIILYAKIEGRKEHIT LDTLSYSSYKVPSLVSVVINPELQTPATKFCLRQKNHQGHNRNVWAVDFFHVLPVLPS TMSHMIQFSINLGCGTHQPGNSVSLEFSTNHGRSWSLLHTECLPEICAGPHLPHSTVY SSENYSGWNRITIPLPNAALTRNTRIRWRQTGPILGNMWAIDNVYIGPSCLKFCSGRG QCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSSRLSSYHNFYSIRGAEVSFGCGV LASGKALVFNKEGRRQLITSFLDSSQSRFLQFTLRLGSKSVLSTCRAPDQPGEGVLLH YSYDNGITWKLLEHYSYLSYHEPRIISVELPGDAKQFGIQFRWWQPYHSSQREDVWAI DEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWTLCFTGDSKLASSMRY VETQSMQIGASYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGLTWHLVQEECLPSMP SCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQSYYTAQDEWALDSIYIG QQCPNMCSGHGSCDHGICRCDQGYQGTECHPEAALPSTIMSDFENQNGWESDWQEVIG GEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGGESASCNKP DSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPCTRFRWWQPVF SGEDYDQWAVDDIIILSEKQKQIIPVINPTLPQNFYEKPAFDYPMNQMSVWLMLANEG MVKNETFCAATPSAMIFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFSSTAPVL LQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTMYHTGDFEEWTRITIVIPRS LASSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYTAA QGTCVSNVPNHNEMFDRFEGKLSPLWYKITGAQVGTGCGTLNDGKSLYFNGPGKREAR TVPLDTRNIRLVQFYIQIGSKTSGITCIKPRTRNEGLIVQYSNDNGILWHLLRELDFM SFLEPQIISIDLPQDAKTPATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKF DGSIDLQANWYRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEM SMGCSKPFSNSHSVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTESSIYTSERFQNWK RITVYLPLSTISPRTRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGRCV CDRGFGGPYCVPVVPLPSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLIFK GEGLRMLISRDLDCTNTMYVQFSLRFIAKSTPERSHSILLQFSISGGITWHLMDEFYF PQTTNILFINVPLPYTAQTNATRFRLWQPYNNGKKEEIWIVDDFIIDGNNVNNPVMLL DTFDFGPREDNWFFYPGGNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVN ENTIIQFEINVGCSTDSSSADPVRLEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHH PSSTYYAGTMQGWRREVVHFGKLHLCGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQC EEMCNGQGSCINGTKCICDPGYSGPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKP SRKCGILSSGNNLFFNEDGLRMLMTRDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLL QYSLNGGLSWSLLQEFLFSNSSNVGRYIALEIPLKARSGSTRLRWWQPSENGHFYSPW VIDQILIGGNISGNTVLEDDFTTLDSRKWLLHPGGTKMPVCGSTGDALVFIEKASTRY VVSTDVAVNEDSFLQIDFAASCSVTDSCYAIELEYSVDLGLSWHPLVRDCLPTNVECS RYHLQRILVSDTFNKWTRITLPLPPYTRSQATRFRWHQPAPFDKQQTWAIDNVYIGDG CIDMCSGHGRCIQGNCVCDEQWGGLYCDDPETSLPTQLKDNFNRAPSSQNWLTVNGGK LSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFIQFYFMYGCLITPNNRNQGVLL EYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRWWQPRHDGLDQNDWA IDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRIAFDMFMEDKTSVNEHWL FHDDCTVERFCDSPDGVMLCGSHDGREVYAVTHDLTPTEGWIMQFKISVGCKVSEKIA QNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGSVSQPSVFFPTKGWKRITYPLPESLV GNPVRFRFYQKYSDMQWAIDNFYLGPGCLDNCRGHGDCLREQCICDPGYSGPNCYLTH TLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVRQAVTQDLDL RGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMDYQKYISVR HDYILLPEDALTNTTRLRWWQPFVISNGIVVSGVERAQWALDNILIGGAEINPSQLVD TFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGY MMQFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHE ATIYNSVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKL CSGHGYCTTGAICICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIG SGCGQLAPYAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSMSQTDSCNSDLS GPHAVDKAVLLQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQP RHNGTGHDQWALDHVEVVLVSTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP" BASE COUNT 3014 a 2696 c 2753 g 3116 t 1 others ORIGIN 1 cacgcgtggg ctcggcgggg gcccgctccc aggcccgctc ccgagcccgt tccgctcccg 61 tccgccttct tctcgccttc tctccgcgtg gctcctccgt cccggcgtct ccaaaactga 121 atgagcgagc ggcgcgtagg gcgscggcgg cggcggcggc ggcggcggcg gcggcatgga 181 gcgcagtggc tgggcccggc agactttcct cctagcgctg ttgctggggg cgacgctgag 241 ggcgcgcgcg gcggctggct attacccccg cttttcgccc ttctttttcc tgtgcaccca 301 ccacggggag ctggaagggg atggggagca gggcgaggtg ctcatttccc tgcatattgc 361 gggcaacccc acctactacg ttccgggaca agaataccat gtgacaattt caacaagcac 421 cttttttgac ggcttgctgg tgacaggact atacacatct acaagtgttc aggcatcaca 481 gagcattgga ggttccagtg ctttcggatt tgggatcatg tctgaccacc agtttggtaa 541 ccagtttatg tgcagtgtgg tagcctctca cgtgagtcac ctgcccacaa ccaacctcag 601 tttcatctgg attgctccac ctgcgggcac aggctgtgtg aatttcatgg ctacagcaac 661 acaccggggc caggttattt tcaaagatgc tttagcccag cagttgtgtg aacaaggagc 721 tccaacagat gtcactgtgc acccacatct agctgaaata catagtgaca gcattatcct 781 gagagatgac tttgactcct accaccaact gcaattaaat ccaaatatat gggttgaatg 841 taacaactgt gagactggag aacagtgtgg cgcgattatg catggcaatg ccgtcacctt 901 ctgtgaacca tatggcccac gagaactgat taccacaggc cttaatacaa caacagcttc 961 tgtcctccaa ttttccattg ggtcaggttc atgtcgcttt agttattcag accccagcat 1021 catcgtgtta tatgccaaga ataactctgc ggactggatt cagctagaga aaattagagc 1081 cccttccaat gtcagcacaa tcatccatat cctctacctt cctgaggacg ccaaagggga 1141 gaatgtccaa tttcagtgga agcaggaaaa tcttcgtgta ggtgaagtgt atgaagcctg 1201 ctgggcctta gataacatct tgatcatcaa ttcagctcac agacaagtcg ttttagaaga 1261 tagtctcgac ccagtggaca caggcaactg gcttttcttc ccaggagcta cagttaagca 1321 tagctgtcag tcagatggga actccattta tttccatgga aatgaaggca gcgagttcaa 1381 ttttgccacc accagggatg tagatctttc cacagaagat attcaagagc aatggtcaga 1441 agaatttgag agccagccta caggatggga tgtcttggga gctgtcattg gtacagaatg 1501 tggaacgata gaatcaggct tatcaatggt cttcctcaaa gatggagaga ggaaattatg 1561 cactccatcc atggacacta ccggttatgg gaacctgagg ttttactttg tgatgggagg 1621 aatttgtgac cctggaaatt ctcatgaaaa tgacataatc ctgtatgcaa aaattgaagg 1681 aagaaaagag catataacac tggataccct ttcctattcc tcatataagg ttccgtcttt 1741 ggtttctgtg gtcatcaatc ctgaacttca gactcctgct accaaatttt gtctcaggca 1801 aaagaaccat caaggacata ataggaatgt ctgggctgta gactttttcc atgtcttgcc 1861 tgttctccct tctacaatgt ctcacatgat acagttttcc atcaatctgg gatgtggaac 1921 gcatcagcct ggtaacagtg tcagcttgga attttctacc aaccatgggc gctcctggtc 1981 cctccttcac actgaatgct tacctgagat ctgtgctgga ccccacctcc cccacagcac 2041 tgtctactcc tctgaaaact acagtgggtg gaaccgaata acaattcccc ttcctaacgc 2101 agcactaacc cggaacacca ggattcgctg gagacaaaca ggaccaatcc ttggaaacat 2161 gtgggcaatt gataatgttt atattggccc gtcatgtctc aaattctgtt ctggcagagg 2221 acagtgcact agacatggtt gcaagtgtga ccctggattt tctggcccag cttgtgagat 2281 ggcatcccag acattcccaa tgtttatttc tgaaagcttt ggcagttcca ggctctcctc 2341 ttaccataac ttttactcta tccgtggtgc tgaagtcagc tttggttgtg gtgtcttggc 2401 cagtggtaag gccctggttt tcaacaaaga agggcggcgt cagctaatta catctttcct 2461 tgacagctca caatccaggt ttctccagtt cacactgaga ctggggagca aatctgttct 2521 gagcacgtgc agagcccctg atcagcctgg tgaaggagtt ttgctgcatt attcttatga 2581 taatgggata acttggaaac tcctggagca ttattcatat ctcagctatc atgagcccag 2641 aataatctcc gtagaactac caggtgatgc aaagcagttt ggaattcagt tcagatggtg 2701 gcaaccgtat cattcttccc agagagaaga tgtatgggct attgatgaga ttatcatgac 2761 atctgtgctt ttcaacagca ttagtcttga ctttaccaat cttgtggagg tcactcagtc 2821 tctgggattc taccttggaa atgttcagcc atactgtggc cacgactgga ccctttgttt 2881 tacaggagat tctaaacttg cctcaagtat gcgctatgtg gaaacacaat caatgcagat 2941 aggagcatcc tatatgattc agttcagttt ggtgatggga tgtggccaga aatacacccc 3001 acacatggac aaccaggtga agctggagta ctcaaccaac cacggcctta cctggcacct 3061 cgtccaagaa gaatgccttc caagtatgcc aagttgtcag gaatttacat cagcaagtat 3121 ttaccatgcc agtgagttta cacagtggag gagagtcata gtgcttcttc cccagaaaac 3181 ttggtccagt gctacccgtt tccgctggag ccagagctat tacacagctc aagacgagtg 3241 ggctttggac agcatttaca ttgggcagca gtgccccaac atgtgcagtg ggcatggctc 3301 atgcgatcat ggcatatgca ggtgtgacca ggggtaccaa ggcactgaat gccacccaga 3361 agctgccctt ccgtccacaa ttatgtcaga ttttgagaac cagaatggct gggagtctga 3421 ctggcaagaa gttattgggg gagaaattgt aaaaccagaa caagggtgtg gtgtcatctc 3481 ttctggatca tctctgtact tcagcaaggc tgggaaaaga cagctggtga gttgggacct 3541 ggatacttct tgggtggact ttgtccagtt ctacatccag ataggcggag agagtgcttc 3601 atgcaacaag cctgacagca gagaggaggg cgtcctcctt cagtacagca acaatggggg 3661 catccagtgg cacctgctag cagagatgta cttttcagac ttcagcaaac ccagatttgt 3721 ctatctggag cttccagctg ctgccaagac cccttgcacc aggttccgct ggtggcagcc 3781 cgtgttctca ggggaggact atgaccagtg ggcagtcgat gacatcatca ttctgtccga 3841 gaagcagaag cagatcatcc cagttatcaa tccaacttta cctcagaact tttatgagaa 3901 gccagctttt gattacccta tgaatcagat gagtgtgtgg ttgatgttgg ctaatgaagg 3961 aatggttaaa aatgaaacct tctgtgctgc cacaccatca gcaatgatat ttggaaaatc 4021 agatggagat cgatttgcag taactcgaga tttgaccctg aaacctggat atgtgctaca 4081 gttcaagcta aacataggtt gtgccaatca attcagcagt actgctccag ttcttcttca 4141 gtactctcat gatgctggta tgtcctggtt tctggtgaaa gaaggctgtt acccggcttc 4201 tgcaggcaaa ggatgcgaag gaaactccag agaactaagt gagcccacca tgtatcacac 4261 aggggacttt gaagaatgga caagaatcac cattgttatt ccaaggtctc ttgcatccag 4321 caagaccaga ttccgatgga tccaggagag cagctcacag aaaaacgtgc ctccatttgg 4381 tttagatgga gtgtacatat ccgagccttg tcccagttac tgcagtggcc atggggactg 4441 catttcagga gtgtgtttct gtgacctggg atatactgct gcacaaggaa cctgtgtgtc 4501 aaatgtcccc aatcacaatg agatgttcga taggtttgag gggaagctca gccctctgtg 4561 gtacaagata acaggtgccc aggttggaac tggctgtgga acacttaacg atggcaaatc 4621 tctctacttc aatggccctg ggaaaaggga agcccggacg gtccctctgg acaccaggaa 4681 tatcagactt gttcaatttt atatacaaat tggaagcaaa acttcaggca ttacctgcat 4741 caaaccaaga actagaaatg aagggcttat tgttcagtat tcaaatgaca atgggatact 4801 ctggcatttg cttcgagagt tggacttcat gtccttcctg gaaccacaga tcatttccat 4861 tgacctgcca caggacgcga agacacctgc aacggcattt cgatggtggc aaccgcaaca 4921 tgggaagcat tcagcccagt gggctttgga tgatgttctt ataggaatga atgacagctc 4981 tcaaactgga tttcaagaca aatttgatgg ctctatagat ttgcaagcca actggtatcg 5041 aatccaagga ggtcaagttg atattgactg tctctctatg gatactgctc tgatattcac 5101 tgaaaacata ggaaaacctc gttatgctga gacctgggat tttcatgtgt cagcatctac 5161 ctttttgcag tttgaaatga gcatgggctg tagcaagccc ttcagcaact cccacagtgt 5221 acagctccag tattctctga acaatggcaa ggactggcat cttgtcaccg aagagtgtgt 5281 tcctccaacc attggctgtc tgcattacac ggaaagttca atttacacct cggaaagatt 5341 ccagaattgg aagcggatca ctgtctacct tccactctcc accatttctc ccaggacccg 5401 gttcagatgg attcaggcca actacactgt gggggctgat tcctgggcga ttgataatgt 5461 tgtactggcc tcagggtgcc cttggatgtg ctcaggacga gggatttgtg atgctggacg 5521 ctgtgtgtgt gaccggggct ttggtggacc ctattgtgtt cctgttgttc ctctgccctc 5581 gattcttaaa gacgatttca atgggaattt acatcctgac ctttggcctg aagtgtatgg 5641 tgcagagagg gggaatctga atggtgaaac catcaaatct ggaacatctc taatttttaa 5701 aggggaagga ctaaggatgc ttatttcaag agatctagat tgtacaaata caatgtatgt 5761 ccagttttca cttagattta tagcaaaaag taccccagag agatctcact ctattctgtt 5821 acaattctcc atcagtggag gaatcacttg gcacctgatg gatgaatttt actttcctca 5881 aacaacgaat atacttttca tcaatgttcc cttgccatac actgcccaaa ccaatgctac 5941 aagattcaga ctctggcaac cttataataa cggtaagaaa gaagaaatct ggattgttga 6001 tgacttcatt atcgatggaa ataatgtaaa caaccctgtg atgctcttgg atacatttga 6061 ttttgggccc agagaagaca attggttttt ctatcctggt ggtaacatcg gtctttattg 6121 tccatattct tcaaaggggg cacctgaaga agattcagct atggtgtttg tttcaaatga 6181 agttggtgag cattccatta ccacccgtga cctaaatgtg aatgagaaca ccatcataca 6241 atttgagatc aacgttggct gttcgactga tagctcatcc gcggatccag tgagactgga 6301 attttcaagg gacttcgggg cgacctggca ccttctgctg cccctctgct accacagcag 6361 cagccacgtc agctctttat gctccaccga gcaccacccc agcagcacct actacgcagg 6421 aaccatgcag ggctggagga gggaggtcgt gcactttggg aagctgcacc tttgtggatc 6481 tgtccgtttc agatggtacc agggatttta ccctgccggc tctcagccag tgacatgggc 6541 cattgataat gtctacatcg gtccccagtg tgaggagatg tgtaatggac aggggagctg 6601 tatcaatgga accaaatgta tatgtgaccc tggctactca ggtccaacct gtaaaataag 6661 caccaaaaat cctgattttc tcaaagatga tttcgaaggt cagctagaat ctgatagatt 6721 cttattaatg agtggtggga aaccatctcg aaagtgtgga atcctttcta gtggaaacaa 6781 cctctttttc aatgaagatg gcttgcgcat gttgatgaca cgagacctgg atttatcaca 6841 tgctagattt gtgcagttct tcatgagact gggatgtggt aaaggcgttc ctgaccccag 6901 gagtcaaccc gtgctcctac agtattctct caacggtggc ctctcgtgga gtcttcttca 6961 ggagttcctt ttcagcaatt ccagcaatgt gggcaggtac attgccctgg agataccctt 7021 gaaagcccgt tctggttcta ctcgccttcg ctggtggcaa ccgtctgaga atgggcactt 7081 ctacagcccc tgggttatcg atcagattct tattggagga aatatttctg gtaatacggt 7141 cttggaagat gatttcacaa cccttgatag taggaaatgg ctgcttcacc caggaggcac 7201 caagatgccc gtgtgtggct ctactggtga tgccctggtc ttcattgaaa aggccagcac 7261 ccgttacgtg gtcagcacag acgttgccgt gaatgaggat tccttcctac agatagactt 7321 cgctgcctcc tgctcagtca cagactcttg ttatgcgatt gaattggaat actcagtaga 7381 tcttggattg tcatggcacc cattggtaag ggactgtctg cctaccaatg tggaatgcag 7441 tcgctatcat ctgcaacgga tcctggtgtc agacactttc aacaagtgga ctagaatcac 7501 tctgcctctc cctccttata ccaggtccca agccactcgt ttccgttggc atcaaccagc 7561 tccttttgac aagcagcaga catgggcaat agataatgtc tatatcgggg atggctgcat 7621 agacatgtgc agtggccatg ggagatgcat ccagggaaac tgcgtctgtg atgaacagtg 7681 gggtggcctg tactgtgatg accccgagac ctctcttcca acccaactca aagacaactt 7741 caatcgagct ccatccagtc agaactggct gactgtgaac ggagggaaat tgagtacagt 7801 gtgtggagcc gtggcgtcgg gaatggctct ccatttcagt gggggttgta gtcgattatt 7861 agtcactgtg gatctaaacc tcactaatgc tgagttcatc caattttact tcatgtatgg 7921 gtgcctgatt acaccaaaca accgtaacca aggtgttctc ttggaatatt ctgtcaatgg 7981 aggcattacc tggaacctgc tcatggagat tttctatgac cagtacagta agcccggatt 8041 tgtgaatatc cttctccctc ctgatgctaa agagattgcc actcgcttcc gctggtggca 8101 gccaagacat gacggcctgg atcagaacga ctgggccatt gacaatgtcc tcatctcagg 8161 ctctgctgac caaaggaccg ttatgctgga caccttcagc agcgccccag taccccagca 8221 cgagcgctcc cctgcagatg ccggccctgt cgggaggatc gcctttgaca tgtttatgga 8281 agacaaaact tcagtgaatg agcactggct attccatgat gattgtacag tagaaagatt 8341 ctgtgactcc cctgatggtg tgatgctctg tggcagtcat gatggacggg aggtgtatgc 8401 agtgacccat gacctgactc ccactgaagg ctggattatg caattcaaga tctcagttgg 8461 atgtaaggtg tctgaaaaaa ttgcccagaa tcaaattcat gtgcagtatt ctactgactt 8521 cggtgtgagt tggaattatc tggtccctca gtgcttgcct gctgacccaa aatgctctgg 8581 aagtgtttct cagccatctg tattctttcc aactaaaggg tggaaaagga tcacctaccc 8641 acttcctgaa agcttagtgg gaaatccggt aaggtttagg ttctatcaga agtactcaga 8701 catgcagtgg gcaatcgata atttctacct gggccctgga tgcttggaca actgcagggg 8761 ccatggagat tgcttaaggg aacagtgcat ctgtgatccg ggatactcag ggccaaactg 8821 ctacttgacc cacactctga agactttcct gaaggaacgc tttgacagtg aagaaatcaa 8881 acctgactta tggatgtcct tagaaggtgg aagtacttgc actgagtgtg gaattcttgc 8941 cgaggacact gcactctatt ttgggggatc cactgtgaga caagcggtta cacaagattt 9001 ggatcttcga ggtgcaaagt tcctgcaata ctgggggcgc atcggtagtg agaacaacat 9061 gacctcttgc catcgtccca tctgccggaa ggaaggcgtg ctgttggact actctaccga 9121 tggaggaatt acctggactt tgctccatga gatggattac cagaaataca tttctgttag 9181 acacgactac atacttcttc ctgaagatgc cctcaccaac acaactcgac ttcgctggtg 9241 gcagcctttt gtgatcagca atggaattgt ggtctctggg gtggagcgtg ctcagtgggc 9301 actggacaac attttgattg gtggagcaga aatcaatccc agccaattgg tggacacttt 9361 tgatgatgaa ggcacttccc atgaagaaaa ctggagtttt taccctaatg ctgtaaggac 9421 agcaggattt tgtggcaatc catcctttca cctctattgg ccaaataaaa agaaggacaa 9481 gactcacaat gctctctcct cccgagaact cattatacag ccaggataca tgatgcagtt 9541 taaaattgtg gtgggttgtg aagccacttc ttgtggtgac cttcattccg taatgctgga 9601 atacactaag gatgcaagat cggattcctg gcagctcgta cagacccagt gccttccttc 9661 ctcttctaac agcattggct gctccccttt ccagttccat gaagccacca tctacaactc 9721 tgtcaacagc tcaagctgga aaagaatcac catccagctg cctgaccatg tctcctctag 9781 tgcaacacag ttccgctgga tccagaaggg agaagaaact gagaagcaaa gctgggcaat 9841 tgaccacgtg tacattggag aggcttgccc caagctctgc agcgggcacg gatactgcac 9901 gaccggtgcc atctgcatct gcgacgagag cttccaaggt gatgactgct ctgttttcag 9961 tcacgacctt cccagttata ttaaagataa ttttgagtcc gcaagagtca ccgaggcaaa 10021 ctgggagacc attcaaggtg gagtcatagg aagtggctgt gggcagctgg ccccctacgc 10081 ccatggagac tcactgtact ttaatggctg tcagatcagg caagcagcta ccaagcctct 10141 ggatctcact cgagcaagca aaatcatgtt tgttttgcaa attgggagca tgtcgcagac 10201 ggacagctgc aacagtgacc tgagtggccc ccacgctgtg gacaaggcgg tgctgctgca 10261 atacagcgtc aacaacggga tcacctggca tgtcatcgcc cagcaccagc caaaggactt 10321 cacacaagct cagagagtgt cttacaatgt ccccctggag gcacggatga aaggagtctt 10381 actgcgctgg tggcaaccac gccacaatgg aacaggtcat gatcaatggg ctttggacca 10441 tgtggaggtc gtcctagtaa gcactcgcaa acaaaattac atgatgaatt tttcacgaca 10501 acatgggctc agacatttct acaacagaag acgaaggtca cttaggcgat acccatgaag 10561 aatcaaaaag tttatttttt ttcttccaac atgtgatgtg ttgctctcca ttcttttaaa 10621 tctcgcacta catctgatat caggaaatat ctgtgaagga cttggtgatt acctgaaagc 10681 ccttctcaag accgagtgta caccactttc ccacactgtg aactaatgac aagtgactta 10741 tttgctcata agtaaatgtc ttcatgttga tgtgtccgtg aaagttgtga tctgttgtaa 10801 tatcagttac agtggcagta ttgacaataa gaaacagttt aacagaaaaa tgaaatttaa 10861 gcacaaaaaa tttaagagat tttatgttta aaatggcatt tagcacagta tttaacattc 10921 ttggtcacaa agctatttaa gtggactgta tttcagctat gtctcatgtt ttatatgatt 10981 aaattatcat tgtttgtcct ttatgtattc tcttctacaa tacaacacat tgaaactgta 11041 tttacttgtt atgttgtaat attttgctgc tgaatttggg gctacttata ttctgcagaa 11101 aattaattga aatacctatt caagaagata gttgtaaaga tattgtatct cctttaatat 11161 actccttaaa aatgtatgtt ggtttagcgt tgttttgtgg ataagaaaaa tgcttgaccc 11221 tgaaatattt tctactttaa attgtggatg aagaccctat ctcccacaaa taagttccca 11281 tttccttgtc taaagatctt tttttaagtg ttctgtggct gatttactaa cagtaactgc 11341 cattttttgt ctgtgataac agagtgattt gtaaaacagt ggttgttttt tcattgtgtt 11401 ttcttcgtgg attgtttttt ctgcgggtca tattcatacc ttctgatgaa gttgtacaac 11461 accagcaaca ttataatggc cctgtagctc tgaatgctat ttgtgtaact gaaaggttgc 11521 actctagggt gaaccaagct ataaaagccc atgcttaaat aaaaattatg tccaaaagcc // LOCUS HSU79725 2793 bp mRNA PRI 04-FEB-1997 DEFINITION Human A33 antigen precursor mRNA, complete cds. ACCESSION U79725 NID g1814276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2793) AUTHORS Heath,J.K., White,S.J., Johnstone,C.N., Catimel,B., Simpson,R.J., Moritz,R.L., Tu,G.-F., Ji,H., Whitehead,R.H., Groenen,L.C., Scott,A.M., Ritter,G., Cohen,L., Welt,S., Old,L.J., Nice,E.C. and Burgess,A.W. TITLE The human A33 antigen is a transmembrane glycoprotein and a novel member of the immunoglobulin superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (2), 469-474 (1997) MEDLINE 97165045 REFERENCE 2 (bases 1 to 2793) AUTHORS Heath,J.K. and White,S.J. TITLE Direct Submission JOURNAL Submitted (26-NOV-1996) Melbourne Branch, Ludwig Institute for Cancer Research, Post Office Royal Melbourne Hospital, Parkville, Victoria 3050, Australia FEATURES Location/Qualifiers source 1..2793 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon carcinoma" /cell_line="LIM1215" 5'UTR 1..344 sig_peptide 345..407 CDS 345..1304 /note="intestine-specific antigen; novel member of the immunoglobulin superfamily; transmembrane protein; contains an extracellular domain, a transmembrane domain, and an intracellular domain" /codon_start=1 /product="A33 antigen precursor" /db_xref="PID:g1814277" /translation="MVGKMWPVLWTLCAVRVTVDAISVETPQDVLRASQGKSVTLPCT YHTSTSSREGLIQWDKLLLTHTERVVIWPFSNKNYIHGELYKNRVSISNNAEQSDASI TIDQLTMADNGTYECSVSLMSDLEGNTKSRVRLLVLVPPSKPECGIEGETIIGNNIQL TCQSKEGSPTPQYSWKRYNILNQEQPLAQPASGQPVSLKNISTDTSGYYICTSSNEEG TQFCNITVAVRSPSMNVALYVGIAVGVVAALIIIGIIIYCCCCRGKDDNTEDKEDARP NREAYEEPPEQLRELSREREEEDDYRQEEQRSTGRESPDHLDQ" mat_peptide 408..1301 /product="A33 antigen" 3'UTR 1305..2793 polyA_signal 2777..2782 BASE COUNT 620 a 781 c 730 g 662 t ORIGIN 1 ctaccccttt gtgagcagtc taggactttg tacacctgtt aagtagggag aaggcagggg 61 aggtggctgg tttaagggga acttgaggga agtagggaag actcctcttg ggacctttgg 121 agtaggtgac acatgagccc agccccagct cacctgccaa tccagctgag gagctcacct 181 gccaatccag ctgaggctgg gcagaggtgg gtgagaagag ggaaaattgc agggacctcc 241 agttgggcca ggccagaagc tgctgtagct ttaaccagac agctcagacc tgtctggagg 301 ctgccagtga caggttaggt ttagggcaga gaagaagcaa gaccatggtg gggaagatgt 361 ggcctgtgtt gtggacactc tgtgcagtca gggtgaccgt cgatgccatc tctgtggaaa 421 ctccgcagga cgttcttcgg gcttcgcagg gaaagagtgt caccctgccc tgcacctacc 481 acacttccac ctccagtcga gagggactta ttcaatggga taagctcctc ctcactcata 541 cggaaagggt ggtcatctgg ccgttttcaa acaaaaacta catccatggt gagctttata 601 agaatcgcgt cagcatatcc aacaatgctg agcagtccga tgcctccatc accattgatc 661 agctgaccat ggctgacaac ggcacctacg agtgttctgt ctcgctgatg tcagacctgg 721 agggcaacac caagtcacgt gtccgcctgt tggtcctcgt gccaccctcc aaaccagaat 781 gcggcatcga gggagagacc ataattggga acaacatcca gctgacctgc caatcaaagg 841 agggctcacc aacccctcag tacagctgga agaggtacaa catcctgaat caggagcagc 901 ccctggccca gccagcctca ggtcagcctg tctccctgaa gaatatctcc acagacacat 961 cgggttacta catctgtacc tccagcaatg aggaggggac gcagttctgc aacatcacgg 1021 tggccgtcag atctccctcc atgaacgtgg ccctgtatgt gggcatcgcg gtgggcgtgg 1081 ttgcagccct cattatcatt ggcatcatca tctactgctg ctgctgccga gggaaggacg 1141 acaacactga agacaaggag gatgcaaggc cgaaccggga agcctatgag gagccaccag 1201 agcagctaag agaactttcc agagagaggg aggaggagga tgactacagg caagaagagc 1261 agaggagcac tgggcgtgaa tccccggacc acctcgacca gtgacaggcc agcagcagag 1321 ggcggcggag gaagggttag gggttcattc tcccgcttcc tggcctccct tctcctttct 1381 aagccctgtt ctcctgtccc tccatcccag acattgatgg ggacatttct tccccagtgt 1441 cagctgtggg gaacatggct ggcctggtaa gggggtccct gtgctgatcc tgctgacctc 1501 actgtcctgt gaagtaaccc ctcctggctg tgacacctgg tgcgggcctg gccctcactc 1561 aagaccaggc tgcagcctcc acttccctcg tagttggcag gagctcctgg aagcacagcg 1621 ctgagcatgg ggcgctccca ctcagaactc tccagggagg cgatgccagc cttggggggt 1681 gggggctgtc ctgctcacct gtgtgcccag cacctggagg ggcaccaggt ggagggtttg 1741 cactccacac atctttcttg aatgaatgaa agaataagtg agtatgcttg ggccctgcat 1801 tggcctggcc tccagctccc actccctttc caacctcact tcccgtagct gccagtatgt 1861 tccaaaccct cctgggaagg ccacctccca ctcctgctgc acaggccctg gggagctttt 1921 gcccacacac tttccatctc tgcctgtcaa tatcgtacct gtccctccag gcccatctca 1981 aatcacaagg atttctctaa ccctatccta attgtccaca tacgtggaaa caatcctgtt 2041 actctgtccc acgtccaatc atgggccaca aggcacagtc ttctgagcga gtgctctcac 2101 tgtattagag cgccagctcc ttggggcagg gcctgggcct catggctttt gctttccctg 2161 aagccctagt agctggcgcc catcctagtg ggcacttaag cttaattggg gaaactgctt 2221 tgattggttg tgccttccct tctctggtct ccttgagatg atcgtagaca cagggatgat 2281 tcccacccaa acccacgtat tcattcagtg agttaaacac gaattgattt aaagtgaaca 2341 cacacaaggg agcttgcttg cagatggtct gagttcttgt gtcctggtaa ttcctctcca 2401 ggccagaata attggcatgt ctcctcaacc cacatggggt tcctggttgt tcctgcatcc 2461 cgatacctca gccctggccc tgcccagccc atttgggctc tggttttctg gtggggctgt 2521 cctgctgccc tcccacagcc tccttctgtt tgtcgagcat ttcttctact cttgagagct 2581 caggcagcgt tagggctgct taggtctcat ggaccagtgg ctggtctcac ccaactgcag 2641 tttactattg ctatcttttc tggatgatca gaaaaataat tccataaatc tattgtctac 2701 ttgcgatttt ttaaaaaatg tatattttta tatatattgt taaatccttt gcttcattcc 2761 aaatgctttc agtaataata aaattgtggg tgg // LOCUS HSU79734 4714 bp mRNA PRI 07-MAY-1997 DEFINITION Human huntingtin interacting protein (HIP1) mRNA, complete cds. ACCESSION U79734 NID g2072422 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4714) AUTHORS Kalchman,M.A., Koide,H.B., McCutcheon,K., Graham,R.K., Nichol,K., Nishiyama,K., Kazemi-Esfarjani,P., Lynn,F.C., Wellington,C., Metzler,M., Goldberg,Y.P., Kanazawa,I., Geitz,R.D. and Hayden,M.R. TITLE HIP1, a human homologue of S. cerevisiae Sla2p, interacts with membrane-associated huntingtin in the brain JOURNAL Nature Genet. 16 (1), 44-53 (1997) MEDLINE 97285121 REFERENCE 2 (bases 1 to 4714) AUTHORS Kalchman,M.A., Nichol,K., Graham,R.K., Geitz,R.D. and Hayden,M.R. TITLE Direct Submission JOURNAL Submitted (24-NOV-1996) Medical Genetics, University of British Columbia, #416-2125 East Mall, NCE Building, Vancouver, BC V6T 1Z4, Canada FEATURES Location/Qualifiers source 1..4714 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q11.2" gene 245..2989 /gene="HIP1" CDS 245..2989 /gene="HIP1" /note="putative orf; similar to SLA2 Saccharomyces cerevisiae, encoded by Genbank Accession Number Z22811, and ZK370.3 protein in Caenorhabditis elegans, encoded by Genbank Accession Number M98552" /codon_start=1 /product="huntingtin interacting protein 1" /db_xref="PID:g2072423" /translation="MSRMWGHLSEGYGQLCSIYLKLLRTKMEYHTKNPRFPGNLQMSD RQLDEAGESDVNNFFQLTVEMFDYLECELNLFQTVFNSLDMSRSVSVTAAGQCRLAPL IQVILDCSHLYDYTVKLLFKLHSCLPADTLQGHRDRFMEQFTKLKDLFYRSSNLQYFK RLIQIPQLPENPPNFLRASALSEHISPVVVIPAEASSPDSEPVLEKDDLMDMDASQQN LFDNKFDDIFGSSFSSDPFNFNSQNGVNKDEKDHLIERLYREISGLKAQLENMKTESQ RVVLQLKGHVSELEADLAEQQHLRQQAADDCEFLRAELDELRRQREDTEKAQRSLSEI ERKAQANEQRYSKLKEKYSELVQNHADLLRKNAEVTKQVSMARQAQVDLEREKKELED SLERISDQGQRKTQEQLEVLESLKQELATSQRELQVLQGSLETSAQSEANWAAEFAEL EKERDSLVSGAAHREEELSALRKELQDTQLKLASTEESMCQLAKDQRKMLLVGSRKAA EQVIQDALNQLEEPPLISCAGSADHLLSTVTSISSCIEQLEKSWSQYLACPEDISGLL HSITLLAHLTSDAIAHGATTCLRAPPEPADSLTEACKQYGRETLAYLASLEEEGSLEN ADSTAMRNCLSKIKAIGEELLPRGLDIKQEELGDLVDKEMAATSAAIETATARIEEML SKSRAGDTGVKLEVNERILGCCTSLMQAIQVLIVASKDLQREIVESGRGTASPKEFYA KNSRWTEGLISASKAVGWGATVMVDAADLVVQGRGKFEELMVCSHEIAASTAQLVAAS KVKADKDSPNLAQLQQASRGVNQATAGVVASTISGKSQIEETDNMDFSSMTLTQIKRQ EMDSQVRVLELENELQKERQKLGELRKKHYELAGVAEGWEEGTEASPPTLQEVVTEKE " BASE COUNT 1230 a 1258 c 1250 g 976 t ORIGIN 1 cagcatcaat aaggccatta atacgcagga agtggctgta aaggaaaaac acgccagaac 61 gtgcatactg ggcacccacc atgagaaagg ggcacagacc ttctggtctg ttgtcaaccg 121 cctgcctctg tctagcaacg cagtgctctg ctggaagttc tgccatgtgt tccacaaact 181 cctccgagat ggacacccga acgtcctgaa ggactctctg agatacagaa atgaattgag 241 tgacatgagc aggatgtggg gccacctgag cgaggggtat ggccagctgt gcagcatcta 301 cctgaaactg ctaagaacca agatggagta ccacaccaaa aatcccaggt tcccaggcaa 361 cctgcagatg agtgaccgcc agctggacga ggctggagaa agtgacgtga acaacttttt 421 ccagttaaca gtggagatgt ttgactacct ggagtgtgaa ctcaacctct tccaaacagt 481 attcaactcc ctggacatgt cccgctctgt gtccgtgacg gcagcagggc agtgccgcct 541 cgccccgctg atccaggtca tcttggactg cagccacctt tatgactaca ctgtcaagct 601 tctcttcaaa ctccactcct gcctcccagc tgacaccctg caaggccacc gggaccgctt 661 catggagcag tttacaaagt tgaaagatct gttctaccgc tccagcaacc tgcagtactt 721 caagcggctc attcagatcc cccagctgcc tgagaaccca cccaacttcc tgcgagcctc 781 agccctgtca gaacatatca gccctgtggt ggtgatccct gcagaggcct catcccccga 841 cagcgagcca gtcctagaga aggatgacct catggacatg gatgcctctc agcagaattt 901 atttgacaac aagtttgatg acatctttgg cagttcattc agcagtgatc ccttcaattt 961 caacagtcaa aatggtgtga acaaggatga gaaggaccac ttaattgagc gactatacag 1021 agagatcagt ggattgaagg cacagctaga aaacatgaag actgagagcc agcgggttgt 1081 gctgcagctg aagggccacg tcagcgagct ggaagcagat ctggccgagc agcagcacct 1141 gcggcagcag gcggccgacg actgtgaatt cctgcgggca gaactggacg agctcaggag 1201 gcagcgggag gacaccgaga aggctcagcg gagcctgtct gagatagaaa ggaaagctca 1261 agccaatgaa cagcgatata gcaagctaaa ggagaagtac agcgagctgg ttcagaacca 1321 cgctgacctg ctgcggaaga atgcagaggt gaccaaacag gtgtccatgg ccagacaagc 1381 ccaggtagat ttggaacgag agaaaaaaga gctggaggat tcgttggagc gcatcagtga 1441 ccagggccag cggaagactc aagaacagct ggaagttcta gagagcttga agcaggaact 1501 tgccacaagc caacgggagc ttcaggttct gcaaggcagc ctggaaactt ctgcccagtc 1561 agaagcaaac tgggcagccg agttcgccga gctagagaag gagcgggaca gcctggtgag 1621 tggcgcagct catagggagg aggaattatc tgctcttcgg aaagaactgc aggacactca 1681 gctcaaactg gccagcacag aggaatctat gtgccagctt gccaaagacc aacgaaaaat 1741 gcttctggtg gggtccagga aggctgcgga gcaggtgata caagacgccc tgaaccagct 1801 tgaagaacct cctctcatca gctgcgctgg gtctgcagat cacctcctct ccacggtcac 1861 atccatttcc agctgcatcg agcaactgga gaaaagctgg agccagtatc tggcctgccc 1921 agaagacatc agtggacttc tccattccat aaccctgctg gcccacttga ccagcgacgc 1981 cattgctcat ggtgccacca cctgcctcag agccccacct gagcctgccg actcactgac 2041 cgaggcctgt aagcagtatg gcagggaaac cctcgcctac ctggcctccc tggaggaaga 2101 gggaagcctt gagaatgccg acagcacagc catgaggaac tgcctgagca agatcaaggc 2161 catcggcgag gagctcctgc ccaggggact ggacatcaag caggaggagc tgggggacct 2221 ggtggacaag gagatggcgg ccacttcagc tgctattgaa actgccacgg ccagaataga 2281 ggagatgctc agcaaatccc gagcaggaga cacaggagtc aaattggagg tgaatgaaag 2341 gatccttggt tgctgtacca gcctcatgca agctattcag gtgctcatcg tggcctctaa 2401 ggacctccag agagagattg tggagagcgg caggggtaca gcatccccta aagagtttta 2461 tgccaagaac tctcgatgga cagaaggact tatctcagcc tccaaggctg tgggctgggg 2521 agccactgtc atggtggatg cagctgatct ggtggtacaa ggcagaggga aatttgagga 2581 gctaatggtg tgttctcatg aaattgctgc tagcacagcc cagcttgtgg ctgcatccaa 2641 ggtgaaagct gataaggaca gccccaacct agcccagctg cagcaggcct ctcggggagt 2701 gaaccaggcc actgccggcg ttgtggcctc aaccatttcc ggcaaatcac agatcgaaga 2761 gacagacaac atggacttct caagcatgac gctgacacag atcaaacgcc aagagatgga 2821 ttctcaggtt agggtgctag agctagaaaa tgaattgcag aaggagcgtc aaaaactggg 2881 agagcttcgg aaaaagcact acgagcttgc tggtgttgct gagggctggg aagaaggaac 2941 agaggcatct ccacctacac tgcaagaagt ggtaaccgaa aaagaataga gccaaaccaa 3001 caccccatat gtcagtgtaa atccttgtta cctatctcgt gtgtgttatt tccccagcca 3061 caggccaaat ccttggagtc ccaggggcag ccacaccact gccattaccc agtgccgagg 3121 acatgcatga cacttcccaa agactccctc catagcgaca ccctttctgt ttggacccat 3181 ggtcatctct gttcttttcc cgcctcccta gttagcatcc aggctggcca gtgctgccca 3241 tgagcaagcc taggtacgaa gaggggtggt ggggggcagg gccactcaac agagaggacc 3301 aacatccagt cctgctgact atttgacccc cacaacaatg ggtatcctta atagaggagc 3361 tgcttgttgt ttgttgacag cttggaaagg gaagatctta tgccttttct tttctgtttt 3421 cttctcagtc ttttcagttt catcatttgc acaaacttgt gagcatcaga gggctgatgg 3481 attccaaacc aggacactac cctgagatct gcacagtcag aaggacggca ggagtgtcct 3541 ggctgtgaat gccaaagcca ttctccccct ctttgggcag tgccatggat ttccactgct 3601 tcttatggtg gttggttggg ttttttggtt ttgttttttt tttttaagtt tcactcacat 3661 agccaactct cccaaagggc acacccctgg ggctgagtct ccagggcccc ccaactgtgg 3721 tagctccagc gatggtgctg cccaggcctc tcggtgctcc atctccgcct ccacactgac 3781 caagtgctgg cccacccagt ccatgctcca gggtcaggcg gagctgctga gtgacagctt 3841 tcctcaaaaa gcagaaggag agtgagtgcc tttccctcct aaagctgaat cccggcggaa 3901 agcctctgtc cgcctttaca agggagaaga caacagaaag agggacaaga gggttcacac 3961 agcccagttc ccgtgacgag gctcaaaaac ttgatcacat gcttgaatgg agctggtgag 4021 atcaacaaca ctacttccct gccggaatga actgtccgtg aatggtctct gtcaagcggg 4081 ccgtctccct tggcccagag acggagtgtg ggagtgattc ccaactcctt tctgcagacg 4141 tctgccttgg catcctcttg aataggaaga tcgttccact ttctacgcaa ttgacaaacc 4201 cggaagatca gatgcaattg ctcccatcag ggaagaaccc tatacttggt ttgctaccct 4261 tagtatttat tactaacctc ccttaagcag caacagccta caaagagatg cttggagcaa 4321 tcagaacttc aggtgtgact ctagcaaagc tcatctttct gcccggctac atcagccttc 4381 aagaatcaga agaaagccaa ggtgctggac tgttactgac ttggatccca aagcaaggag 4441 atcatttgga gctcttgggt cagagaaaat gagaaaggac agagccagcg gctccaactc 4501 ctttcagcca catgccccag gctctcgctg ccctgtggac aggatgagga cagagggcac 4561 atgaacagct tgccagggat gggcagccca acagcacttt tcctcttcta gatggacccc 4621 agcatttaag tgaccttctg atcttgggaa aacagcgtct tccttcttta tctatagcaa 4681 ctcattggtg gtagccatca agcacttcgg aatt // LOCUS HSU79745 2211 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens monocarboxylate transporter homologue MCT6 mRNA, complete cds. ACCESSION U79745 NID g2463631 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2211) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Cloning and sequencing of four new mammalian monocarboxylate transporter (MCT) homologues confirms the existence of a transporter family with an ancient past JOURNAL Biochem. J. 329 (2), 321-328 (1998) REFERENCE 2 (bases 1 to 2211) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Direct Submission JOURNAL Submitted (26-NOV-1996) Cellular Biochemistry, Hannah Research Institute, Mauchline Road, Ayr KA6 5HL, UK FEATURES Location/Qualifiers source 1..2211 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="circulating blood" /sex="male" /dev_stage="24-34 years" /note="cDNA library prepared from 4 individuals" CDS 166..1737 /codon_start=1 /product="monocarboxylate transporter homologue MCT6" /db_xref="PID:g2463632" /translation="MTQNKLKLCSKANVYTEVPDGGWGWAVAVSFFFVEVFTYGIIKT FGVFFNDLMDSFNESNSRISWIISICVFVLTFSAPLATVLSNRFGHRLVVMLGGLLVS TGMVAASFSQEVSHMYVAIGIISGLGYCFSFLPTVTILSQYFGKRRSIVTAVASTGEC FAVFAFAPAIMALKERIGWRYSLLFVGLLQLNIVIFGALLRPIIIRGPASPKIVIQEN RKEAQYMLENEKTRTSIDSIDSGVELTTSPKNVPTHTNLELEPKADMQQVLVKTSPRP SEKKAPLLDFSILKEKSFICYALFGLFATLGFFAPSLYIIPLGISLGIDQDRAAFLLS TMAIAEVFGRIGAGFVLNREPIRKIYIELICVILLTVSLFAFTFATEFWGLMSCSIFF GFMVGTIGGLTFHCLLKMMSWALQKMSSAAGVYIFIQSIAGLAGPPLAGLLVDQSKIY SRAFYSCAAGMALAAVCLALVRPCKMGLCQRHHSGETKVVSHRGKTLQDIPEDFLEMD LAKNEHRVHVQMEPV" BASE COUNT 564 a 506 c 497 g 644 t ORIGIN 1 ttgggggttt attctcttcc cttctaactt gacagggtct tgctctgtca ttcaggcaag 61 agtgcagtag tgtgatcact tcttactgcc gcctcaagct tccagcctca actcaagcaa 121 tcctcccacc tcagccaccc aagtggctgg gactacagat taagaatgac ccaaaataaa 181 ttaaagcttt gttccaaagc caatgtgtat actgaagtgc ctgatggagg atggggctgg 241 gcggtagctg tttcattttt cttcgttgaa gtcttcacct acggcatcat caagacattt 301 ggtgtcttct ttaatgactt aatggacagt tttaatgaat ccaatagcag gatctcatgg 361 ataatctcaa tctgtgtgtt tgtcttaaca ttttcagctc ccctcgccac agtcctgagc 421 aatcgtttcg gacaccgtct ggtagtgatg ttgggggggc tacttgtcag caccgggatg 481 gtggccgcct ccttctcaca agaggtttct catatgtacg tcgccatcgg catcatctct 541 ggtctgggat actgctttag ttttctccca actgtaacca tcctatcaca atattttggc 601 aaaagacgtt ccatagtcac tgcagttgct tccacaggag aatgtttcgc tgtgtttgct 661 ttcgcaccag caatcatggc tctgaaggag cgcattggct ggagatacag cctcctcttc 721 gtgggcctac tacagttaaa cattgtcatc ttcggagcac tgctcagacc catcattatc 781 agaggaccag cgtcaccgaa aatagtcatc caggaaaatc ggaaagaagc gcagtatatg 841 cttgaaaatg agaaaacacg aacctcaata gactccattg actcaggagt agaactaact 901 acctcaccta aaaatgtgcc tactcacact aacctggaac tggagccgaa ggccgacatg 961 cagcaggtcc tggtgaagac cagccccagg ccaagcgaaa agaaagcccc gctattagac 1021 ttctccattt tgaaagagaa aagttttatt tgttatgcat tatttggtct ctttgcaaca 1081 ctgggattct ttgcaccttc cttgtacatc attcctctgg gcattagtct gggcattgac 1141 caggaccgcg ctgctttttt attatctacg atggccattg cagaagtttt cggaaggatc 1201 ggagctggtt ttgtcctcaa cagggagccc attcgtaaga tttacattga gctcatctgc 1261 gtcatcttat tgactgtgtc tctgtttgcc tttacttttg ctactgaatt ctggggtcta 1321 atgtcatgca gcatattttt tgggtttatg gttggaacaa taggaggact cacattccac 1381 tgcttgctga agatgatgtc gtgggcattg cagaagatgt cttctgcagc tggggtctac 1441 atcttcattc agagcatagc aggactggct ggaccgcccc ttgcaggttt gttggtggac 1501 caaagtaaga tctacagcag ggccttctac tcctgcgcag ctggcatggc cctggctgct 1561 gtgtgcctcg ccctggtgag accgtgtaag atgggactgt gccagcgtca tcactcaggt 1621 gaaacaaagg tagtgagcca tcgtgggaag actttacagg acatacctga agactttctg 1681 gaaatggatc ttgcaaaaaa tgagcacaga gttcacgtgc aaatggagcc ggtatgacac 1741 actttcttac aacaacagcc actgtgttgg ctggagaggg atggggtggg cccaacgggg 1801 acacaaggag gcagaggagc taacccctct actccacttt caaaactaca ttttaaaggg 1861 aatgtgtatg tgaagagcac taccaacatc gcttttgttt tgttttgttt tgttttaagc 1921 tttttttttt tgcttgtttt taaagccaaa acaaaaaaca accaagcact cttccatata 1981 taaatctggc tgtattcagt agcaatacaa gagatatgta gaaagactct ttggttcaca 2041 ttccgatatt aaaatagtga catgaactgg caaagtggtt ttaaaagctt tcacgtggga 2101 taaatgattt tctttttttc ttttctttct tcctatggtc ttgtctgaat aaactactct 2161 cctgaataaa acaacatcca acccaggtca ttgaaatgaa attggccagt c // LOCUS HSU79751 2730 bp mRNA PRI 16-JUL-1997 DEFINITION Human basic-leucine zipper nuclear factor (JEM-1) mRNA, complete cds. ACCESSION U79751 NID g2257753 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2730) AUTHORS Duprez,E., Tong,J.-H., Berger,R., Chen,Z. and Lanotte,M. TITLE Direct Submission JOURNAL Submitted (26-NOV-1996) INSERM U-301, Hopital Saint-Louis, 1 rue Claude Vellefaux, Paris 75010, France FEATURES Location/Qualifiers source 1..2730 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q24" /cell_line="t(15;17) acute promyelocytic leukemia NB4" protein_bind 242..245 /bound_moiety="SP1" gene 404..1606 /gene="JEM1" CDS 404..1606 /gene="JEM1" /note="Jem-1" /codon_start=1 /product="basic-leucine zipper nuclear factor" /db_xref="PID:g2257754" /translation="MTTKNLETKVTVTSSPIRGAGDGMETEEPPKSVEVTSGVQSRKH HSLQSPWKKAVPSESPGVLQLGKMLTEKAMEVKAVRILVPKAAITHDIPNKNTKVKSL GHHKGEFLGQSEGVIEPNKELSEVKNVLEKLKNSERRLLQDKEGLSNQLRVQTEVNRE LKKLLVASVGDDLQYHFERLAREKNQLILENEALGRNTAQLSEQLERMSIQCDVWRSK FLASRVMADELTNSRAALQRQNRDAHGAIQDLLSEREQFRQEMIATQKLLEELLVSLQ WGREQTYSPSVQPHSTAELALTNHKLAKAVNSHLLGNVGINNQKKIPSTVEFCSTPAE KMAETVLRILDPVTCKESSPDNPFFESSPTTLLATKKNIGRFHPYTRYENITFNCCNH CRGELIAL" misc_feature 794..880 /gene="JEM1" /note="encodes leucine basic domain" misc_feature 920..922 /gene="JEM1" /note="encodes leucine zipper domain" misc_feature 941..943 /gene="JEM1" /note="encodes leucine zipper domain" misc_feature 962..964 /gene="JEM1" /note="encodes leucine zipper domain" misc_feature 983..985 /gene="JEM1" /note="encodes leucine zipper domain" misc_feature 1004..1006 /gene="JEM1" /note="encodes leucine zipper domain" misc_feature 1451..1513 /gene="JEM1" /note="encodes PEST motif" BASE COUNT 927 a 483 c 543 g 777 t ORIGIN 1 aagtttaagt gaatatgcgg cttgggctcc aaaagttgct gtacggtatt ttattttaaa 61 gtagaaatct gtccgctttt cacttacgga ggccttacag tgtgtgttct gtgttacttg 121 agtttatacg cactgcagaa tttgtttatc ctgctctttt ggaaacgtat tccggaagcg 181 aaaccctgag taatcggaag tggttaggag tgagagagct gctggatatg cggagggact 241 gggcgggtcg gcttccgaat ggaagaggtc tgtgagaagt taacctggtg ataccgatcc 301 gaagagccta tcaagtgaag ccccctgaaa tacggagaat aagaatctta gaggttgttc 361 agcagaagtc ttggagtgca ttttcagtgg ttaaggtgaa aaaatgacta ctaaaaattt 421 agaaaccaaa gtcaccgtta cttcatcccc aatccgagga gcaggagatg gaatggaaac 481 tgaggaacca cctaaatctg ttgaagttac ctccggagtc caatctagaa agcatcatag 541 tcttcagagt ccatggaaga aagcagttcc atcagagagc ccaggagttc ttcagctagg 601 gaaaatgctc actgaaaaag caatggaagt taaagctgta agaatattag ttcccaaagc 661 tgctataact catgatatcc ccaacaaaaa tacaaaggtt aagtctctgg gacatcataa 721 aggagaattc cttggtcagt cagagggagt tatagaacct aataaggaac tctcagaggt 781 aaagaatgta ttggaaaagc tcaagaattc tgaaagaagg ttactacagg acaaagaagg 841 tctttcaaac cagctccgtg tacagacaga ggtaaatcgt gagttaaaaa agttactggt 901 ggcttctgtt ggggatgatc ttcagtatca ctttgaacgt ctagcccgtg agaaaaatca 961 gcttatttta gaaaatgaag ccctaggtcg aaacacagct cagctttctg aacagttaga 1021 acgtatgtca atacagtgtg atgtatggcg aagtaaattc cttgcaagca gggtaatggc 1081 agatgagtta accaactcaa gagcagcttt acagcgtcaa aaccgtgatg cacacggggc 1141 tatacaagat ctcctaagtg aacgggaaca gtttcgtcaa gaaatgatag ctacccagaa 1201 attattggag gagctcttag tttccttgca atggggaaga gagcaaactt actcccctag 1261 tgtacaaccc cacagcacag cagagctagc attaacaaat cacaagttgg caaaagcagt 1321 aaattctcat cttctgggaa atgttggcat taacaatcaa aaaaagattc catcaacagt 1381 tgaattctgc agcaccccag ctgagaaaat ggctgaaacg gttctaagaa ttttagatcc 1441 agttacctgc aaagagagtt cacctgataa tccatttttt gagtcttcac caaccacctt 1501 acttgctaca aagaaaaata ttggacgatt tcatccctat actagatatg aaaatataac 1561 tttcaattgc tgcaatcact gccggggaga actgattgcc ctttaacagt caatatgttg 1621 gaggcatgct aaggtacttc cttattaccc aagagtcatt attatttggg agctggggtt 1681 cttacaatgc tagaaataat atcacttcct atttacataa tgtatacacc caaagatatt 1741 ttatgtacta gactccagat taccctttct taataaatat ctcagggtaa ggaaagaaag 1801 aaactgtata gatatattta aaatagagaa tactttccaa gcaatacatg atacttttcc 1861 taaaagactc taaaagaaaa agattctgta actctctttt agcaccaaat tattgtttat 1921 cttgctggat attttatatg aacagtgtta atttagatgc actaaagcaa aggtaggcaa 1981 actacaacca tgagtcaaac atggccacac ccattcattt gctattgtct aagcctggtt 2041 ttggccacta caactgcaga gttgaataga tgcagcagat cctttacaga aaagttttct 2101 gacctcaatt ctaaagtaat tgtagtaggg aggctggagg actttctttc cctttatggt 2161 aattttttga gctacaaaag aggccttgca gaaatggggt gaagggatta atcttttaaa 2221 aataaatgct atatattagg aaaataaaaa atattttaga gccaagttaa caagtacttc 2281 aggaaaacat gctagtttta tgcagggcat tctgtattcc aaatggatac aatccgacat 2341 atataaaaga aacagattct taactattga ctcttattta gcaaatgcaa cagacaagaa 2401 tatccaactt gatatttata aaaggtagac tttttccaaa agtgtataag ctcaaagaaa 2461 aaatgcaacc tgtcaattaa tatatactat gtaatatata ttattgtgta tttatgatta 2521 gccatcataa atgcccattg cttggccttt aagaataatc acaaaatatt tatattaaat 2581 tatacaaatt tgttgcagaa gtgcctgtga gagaaatctt caaaagacaa acctggtcaa 2641 ataataataa ttttaatgtc aatgattttt tttgtctgac tcatctgagt tatatttagt 2701 tttcaagtgg caataaattt atctaccttc // LOCUS HSU79775 1771 bp mRNA PRI 17-JUL-1997 DEFINITION Homo sapiens NNP-1 (NNP-1) mRNA, complete cds. ACCESSION U79775 NID g2258273 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1771) AUTHORS Jansen,E., Meulemans,S.M., Orlans,I.C. and Van de Ven,W.J. TITLE The NNP-1 gene (D21S2056E), which encodes a novel nuclear protein, maps in close proximity to the cystatin B gene within the EPM1 and APECED critical region on 21q22.3 JOURNAL Genomics 42 (2), 336-341 (1997) MEDLINE 97336061 REFERENCE 2 (bases 1 to 1771) AUTHORS Jansen,E. TITLE Direct Submission JOURNAL Submitted (27-NOV-1996) Center for Human Genetics, University of Leuven, Herestraat 49, Leuven B-3000, Belgium FEATURES Location/Qualifiers source 1..1771 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /dev_stage="fetus" /clone_lib="Clontech HL5004b" /chromosome="21" /map="21q22.3; NNP-1 is telomeric to the cystatin B gene within the EPM1 critical region" gene 39..1424 /gene="NNP-1" CDS 39..1424 /gene="NNP-1" /note="novel nuclear protein" /codon_start=1 /product="NNP-1" /db_xref="PID:g2258274" /translation="MVSRVQLPPEIQLAQRLAGNEQVTRDRAVRKLRKYIVARTQRAA GGFTHDELLKVWKGLFYCMWMQDKPLLQEELGRTISQLVHAFQTTEAQHLFLQAFWQT MNREWTGIDRLRLDKFYMLMRMVLNESLKVLKMQGWEERQIEELLELLMTEILHPSSQ APNGVKSHFIEIFLEELTKVGAEELTADQNLKFIDPFCRIAARTKDSLVLNNITRGIF ETIVEQAPLAIEDLLNELDTQDEEVASDSDESSEGGERGDALSQKRSEKPPAGSICRA EPEAGEEQAGDDRDSGGPVLQFDYEAVANRLFEMASRQSTPSQNRKRLYKVIRKLQDL AGGIFPEDEIPEKACRRLLEGRRQKKTKKQKRLLRLQQERGKGEKEPPSPGMERKRSR RRGVGADPEARAEAGEQPGTAERALLRDQPRGRGQRGARQRRRTPRPLTSARAKAANV QEPEKKKKRRE" BASE COUNT 425 a 458 c 605 g 283 t ORIGIN 1 ggcgactccg ggaacagggg gtctcggccg tcggcgtcat ggtttcgcgc gtgcagctcc 61 cgcctgagat ccagctggct cagcgcctgg cggggaatga gcaggtgacc cgggaccggg 121 cggtgaggaa gctccggaaa tacatcgtcg ccaggactca gcgggccgca ggtggtttta 181 cgcacgacga gctgctgaag gtgtggaaag gactgtttta ttgcatgtgg atgcaggaca 241 agccactcct ccaggaagaa ttaggaagga ctatttccca gctcgttcat gcttttcaga 301 ccacggaggc gcagcacctg ttccttcagg ccttctggca gaccatgaat cgcgagtgga 361 cgggcattga caggctgcgc ctggataaat tctacatgct catgcggatg gtcctgaacg 421 agtccttgaa ggttctgaag atgcaaggct gggaagaaag acagatcgag gagctgctag 481 agctgctgat gactgagatc ctgcacccca gcagccaggc ccccaacggt gtgaagagcc 541 acttcatcga gatcttcctg gaggagctga ccaaagtggg cgccgaggag cttacggcag 601 accagaacct gaagttcatc gaccccttct gcagaattgc tgcccggacc aaggattcct 661 tggttttgaa caacatcact cgaggcatct ttgagacgat tgtggagcag gccccgcttg 721 ccattgaaga cctcctgaat gaactggaca cacaggatga ggaggtggcg tcggacagtg 781 atgagtcctc tgagggtggt gagcgtggag acgcgctgtc ccagaagagg tctgagaagc 841 cgcccgcagg ctccatctgc agggctgaac ctgaggctgg tgaggagcag gcaggtgacg 901 acagggacag tggcggcccc gttctccagt ttgactacga ggcagttgct aacagactgt 961 ttgaaatggc cagccgccag agcacccctt ctcagaacag aaagcgtctc tacaaagtga 1021 tccggaagct gcaggacctg gcaggaggca ttttccctga agatgagatc ccagagaagg 1081 cctgcaggcg cctgcttgaa gggaggcggc agaagaagac gaagaagcag aagcgtctgc 1141 tcaggttgca gcaggagaga gggaaaggtg agaaggagcc cccgagcccg ggcatggaga 1201 ggaagaggag caggaggagg ggtgtagggg ccgaccccga ggcgcgggca gaggctggtg 1261 agcagccagg cacagctgag cgggccctgc tccgagatca gcccaggggc cgtggccaga 1321 gaggggctcg ccagagaagg aggacacctc ggcccctgac cagtgcccga gcaaaggcgg 1381 ccaatgtcca ggagccggag aagaagaaga aacgcaggga gtgatgtggc cgggccaagg 1441 acaggcaggg agggaggcca ggacctcgct tgcaccgcgg gacgaggctg accgggctgt 1501 tctgtagact caggaccgtg gctccagaac tctgtgccag gcgggaggga agggcggcac 1561 tggagagatg ggcccatcat taggggccag catcccagga actggacctt tccccagagc 1621 ctccgcctgt ggctgtgatg accttgggcc agaaggtcaa actccgaaga ctgaaactct 1681 gcctgcagca ggactggccg cccctgctgt ggggggttca gaaaataaaa tgccgcgcag 1741 gccttgcaag ggaaaaaaaa aaaaaaaaaa a // LOCUS HSU80034 2392 bp mRNA PRI 22-APR-1997 DEFINITION Human mitochondrial intermediate peptidase precursor (MIPEP) mRNA, mitochondrial gene encoding mitochondrial protein, complete cds. ACCESSION U80034 NID g1763641 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2392) AUTHORS Chew,A., Buck,E.A., Peretz,S., Sirugo,G., Rinaldo,P. and Isaya,G. TITLE Cloning, expression, and chromosomal assignment of the human mitochondrial intermediate peptidase gene (MIPEP) JOURNAL Genomics 40 (3), 493-496 (1997) MEDLINE 97230465 REFERENCE 2 (bases 1 to 2392) AUTHORS Chew,A., Buck,E.A., Peretz,S., Rinaldo,P. and Isaya,G. TITLE Direct Submission JOURNAL Submitted (27-NOV-1996) Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..2392 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="13q12" /tissue_type="liver" CDS 75..2216 /gene="MIPEP" /EC_number="3.4.24.59" /function="maturation of octapeptide-containing precursors" /note="HMIP; mitochondrial protein precursor homolog; similar to rat MIP, yeast MIP1, and Schizophyllum commune SMIP genes, GenBank Accession Numbers M96633, U10243 and L43072 respectively" /codon_start=1 /product="mitochondrial intermediate peptidase precursor" /db_xref="PID:g1763642" /translation="MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVG AAFNVKPQGSRLDLFGERARLFGVPELSAPEGFHIAQEKALRKTELLVDRACSTPPGP QTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMVEKLNTNVDLYQ SLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKQKRKRAVDLNVKILDLSST FLMGTNFPNKIEKHLLPEHIRRNFTSAGDHIIIDGLHAESPDDLVREAAYKIFLYPNA GQLKCLEELLSSRDLLAKLVGYSTFSHRALQGTIAKNPETVMQFLEKLSDKLSERTLK DFEMIRGMKMKLNAQNSEVMPWDPPYYSGVIRAERYNIEPSLYCPFFSLGACMEGLNI LLNRLLGISLYAEQPAKGEVWSEDVRKLAVVHESEGLLGYIYCDFFQRADKPHQDCHF TIRGGRLKEDGDYQLPLVVLMLNLPRSSRSSPTLLTPGMMENLFHEMGHAMHSMLGRT RYQHVTGTRCPTDFAEVPSILMEYFANDYRVVNQFARHYQTGQPLPKNMVSRLCESKK VCAAADMQLQVFYATLDQIYHGKHPLRNSTTDILKETQEKFYGLPYVPNTAWQLRFSH LVGYGARYYSYLMSRAVASMVWKECFLQDPFNRAAGERYRREMLAHGGGREPMLMVEG MLQKCPSVDDFVSALVSDLDLDFETFLMDSE" sig_peptide 75..179 /gene="MIPEP" gene 75..2216 /gene="MIPEP" mat_peptide 180..2213 /gene="MIPEP" /product="mitochondrial intermediate peptidase" BASE COUNT 646 a 518 c 586 g 642 t ORIGIN 1 gcggagcgcg cgctcccagc gaaagcagca gggcagggat ctgcgttgga ggaagggact 61 gctctggtgc tagaatgctg tgcgtcggaa ggctgggcgg cttgggagcc agagcagcag 121 ctctgccgcc ccgccgggcg ggccggggaa gcctcgaagc cgggatccgg gcccgaaggg 181 tcagcaccag ctggtctccc gtgggcgccg ccttcaatgt caagccccag ggcagccgct 241 tggacctgtt cggcgagcgg gcgcgtcttt ttggagttcc tgagctgagt gccccagaag 301 gatttcatat tgcacaagaa aaagccttga gaaagacaga attgcttgtg gaccgtgcat 361 gttccacccc acctgggccc cagaccgtgc tgatcttcga tgagctctcg gattccttat 421 gcagagtggc cgacttggct gattttgtga aaatcgctca ccctgagcca gcattcagag 481 aagctgcgga agaagcttgt agaagtattg gcaccatggt agagaagttg aacacaaatg 541 tggatttata tcaaagtttg caaaaattac tagctgataa aaaacttgtg gattcccttg 601 atccagaaac aaggcgagtg gctgaactgt ttatgtttga ttttgaaatt agtggaatcc 661 atctagacaa acaaaagcgt aaaagagcag tggacctcaa tgttaaaatc ttggatttga 721 gtagtacatt tcttatggga accaattttc ccaacaagat tgagaagcat ctcttaccag 781 aacacattcg tcgtaacttt acatctgctg gggatcatat cataattgat ggtctccacg 841 cagaatcacc agatgacttg gtgcgagaag ctgcttataa aatttttctt tatcccaatg 901 ctggtcaatt gaaatgttta gaagaattgc tcagcagcag agatcttctg gcaaagttgg 961 tggggtattc cacgttttct cacagggctc tccaaggaac gatagctaaa aatccagaga 1021 ctgtcatgca gttccttgaa aaactatctg acaaactttc tgaaagaact ctgaaagatt 1081 ttgagatgat acgagggatg aaaatgaaac tgaatgctca aaattccgaa gtaatgccct 1141 gggacccccc ttactacagt ggtgtgattc gtgcagaaag gtataatatt gagcccagcc 1201 tatattgccc gtttttctct cttggagcat gcatggaagg cctgaatatt ttgcttaaca 1261 gactgttggg gatttcatta tatgcagagc agcctgcaaa aggagaggtg tggagcgaag 1321 atgtccgaaa actggctgtt gttcatgaat ctgaaggatt gttggggtac atttactgtg 1381 atttttttca gcgagcagac aaaccacatc aggattgcca tttcactatc cgtggaggca 1441 gactaaagga agatggagac tatcaactcc cacttgtagt tcttatgctg aatcttcccc 1501 gttcctcaag gagttctcca actttgctaa ctcctggcat gatggaaaat cttttccatg 1561 aaatgggaca tgccatgcat tcaatgctag gacgtactcg ttaccaacac gtcactggga 1621 ccaggtgccc tactgatttt gctgaggttc cttctattct gatggagtac tttgcaaatg 1681 attatcgagt agttaaccaa tttgccagac attatcagac tggacagcca ctgccaaaaa 1741 atatggtgtc tcgtctttgt gaatctaaaa aggtttgtgc tgcagctgat atgcaacttc 1801 aggtctttta tgccactctg gatcaaatct accatgggaa gcatcccctg aggaattcaa 1861 ccacagacat tctcaaggaa acacaagaga aattctatgg cctaccatat gttccaaata 1921 ctgcctggca gctgcgattc agccacctcg tggggtatgg tgctagatat tactcttacc 1981 tcatgtccag agcggtcgcc tccatggttt ggaaggagtg ttttctacag gatcctttca 2041 acagggctgc cggggagcgc tatcgcaggg agatgctggc ccacggtgga ggcagggagc 2101 ccatgctcat ggttgaaggt atgcttcaga agtgtccttc tgttgatgac ttcgtaagtg 2161 ccctcgtttc cgacttggat ctggacttcg aaactttcct catggattct gaataaaaga 2221 aacactctac acctctaatc aaggtcatgt agtaatgact ttgttataaa tgctacagct 2281 gtgagagctt gtttctgatt gtttcattgt tcgcttctgt aattctgaaa aactttaaac 2341 tggtagaact tggaataaat aatttgtttt aattaaaaaa aaaaaaaaaa aa // LOCUS HSU80040 2738 bp mRNA PRI 11-DEC-1996 DEFINITION Human nuclear aconitase mRNA, encoding mitochondrial protein, complete cds. ACCESSION U80040 NID g1718501 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2738) AUTHORS Juang,H.H. and Chiou,B. TITLE Cloning and structural characterization of human mitochondrial aconitase JOURNAL Unpublished REFERENCE 2 (bases 1 to 2738) AUTHORS Juang,H.H. and Chiou,B. TITLE Direct Submission JOURNAL Submitted (27-NOV-1996) Anatomy, Chang Gung College of Medicine and Technology, 259 Web-Hya 1st Road, Kwei-Shan, Tao-Yuan 333, Taiwan, ROC REFERENCE 3 (bases 1 to 2738) AUTHORS Juang,H.H. TITLE Direct Submission JOURNAL Submitted (09-DEC-1996) Anatomy, Chang Gung College of Medicine and Technology, 259 Web-Hya 1st Road, Kwei-Shan, Tao-Yuan 333, Taiwan, ROC REMARK Protein and nucleotide sequence update by submitter FEATURES Location/Qualifiers source 1..2738 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" /sex="female" CDS 21..2363 /codon_start=1 /product="mitochondrial aconitase" /db_xref="PID:g1718502" /translation="MAPYSLLVTRLQKALGVRQYHVASVLCQRAKVAMTHFEPNEYIH YDLLEKNINIVRKRLNRPLTLSEKIVYGHLDDPASQEIERGKSYLRLRPDRVAMQDAT AQMAMLQFISSGLSKVAVPSTIHCDHLIEAQVGDEKDLRRAKDINQEVYNFLATAGDK YGVGFWSPGSGIIHQIILENYAYPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMA GIPWELKCPKVIGVKLTGSLSGWTSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSMS CTGMATICNMGAEIGATTSVFPYNHRMKKYLSKTGREDIANLADEFKDHLVPDPGCHY DQLIEINLSELKPHINGPFTPDLAHPVAEVGKVAEKEGWPLDIRVGLIGSCTNSSYED MGRSAAVAKQALAHGLKCKSQFTITPGSEQIRATIERDGYAQILRDLGGIVLANACGP CIGQWDRKDIKKGEKNTIVTSYNRNFTGRNDANPETHAFVTSPEIVTALAIAGTLKFN PETDYLTGKDGKKFRLEAPDADELPKGEFDPGQDTYQHPPKDSSRQHVDVSPTSQRLQ LLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAAGPWLKFRGHLDNISNNLLIGAINI ENGKANSVRNAVTQEFGPVPDTARYYKKHGIRWVVIGDENYGEGSSREHAALEPRHLG GRAIITKSFARIHETNLKKQGLLPLTFADPADYNKIHPVDKLTIQGLKDFTPGKPLKC IIKHPNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ" BASE COUNT 673 a 770 c 747 g 548 t ORIGIN 1 tatctttgtc agtgcacaaa atggcgccct acagcctact ggtgactcgg ctgcagaaag 61 ctctgggtgt gcggcagtac catgtggcct cagtcctgtg ccaacgggcc aaggtggcga 121 tgacgcattt tgagcccaac gagtacatcc attatgacct gctagagaag aacattaaca 181 ttgttcgcaa acgactgaac cggccgctga cactctcgga gaagattgtg tatggacacc 241 tggatgaccc cgccagccag gaaattgagc gaggcaagtc gtacctgcgg ctgcggcccg 301 accgtgtggc catgcaggat gcgacggccc agatggccat gctccagttc atcagcagcg 361 ggctgtccaa ggtggctgtg ccatccacca tccactgtga ccatctgatt gaagcccagg 421 ttggggacga gaaagacctg cgccgggcca aggacatcaa ccaggaagtt tataatttcc 481 tggcaactgc aggtgacaag tatggcgtgg gcttctggag ccctggatct ggaatcattc 541 accagattat tctcgaaaac tatgcgtacc ctggagttct tctgattgga actgactccc 601 acacccccaa tggtggtggc ctaggaggca tctgcattgg agtagggggt gcagatgctg 661 tggatgtcat ggctgggatc ccctgggagc tgaagtgccc caaggtgatt ggcgtgaagc 721 tgacgggctc tctctccggt tggacctcac ccaaagatgt gatcctgaag gtggcaggca 781 tcctcacggt gaaaggtggc acaggtgcaa tcgtggaata ccacgggcct ggtgtagact 841 ccatgtcctg cactggcatg gcgacaatct gcaacatggg tgcagaaatt ggggccacca 901 cttccgtgtt cccttacaac cacaggatga agaagtacct gagcaagacc ggccgggaag 961 acattgccaa tctagctgat gaattcaagg atcacttggt gcctgaccct ggctgccatt 1021 atgaccaact aattgaaatt aacctcagtg agctgaagcc acacatcaat gggcccttca 1081 cccctgacct ggctcaccct gtggcagaag tgggcaaggt ggcagagaag gaaggatggc 1141 ctctggacat ccgagtgggt ctaattggta gctgcaccaa ttcaagctat gaagatatgg 1201 ggcgctcagc agctgtggcc aagcaggcac tggcccatgg actcaagtgc aagtcccagt 1261 tcaccatcac tccaggttcc gagcagatcc gcgccaccat tgagcgggac ggctatgcac 1321 agatcctgag ggatctgggt ggcattgtcc tggccaatgc ctgcgggccc tgcattggcc 1381 agtgggacag aaaggacatc aagaaggggg agaagaacac aatcgtcacc tcctacaaca 1441 ggaacttcac gggccgcaac gacgcaaacc ccgagaccca tgcctttgtc acgtccccag 1501 agattgtcac agccctggcc attgcaggaa ccctcaagtt caacccagag accgactacc 1561 tgacaggcaa ggatggcaag aagttcaggc tggaagctcc ggatgcagat gagcttccca 1621 aaggggagtt tgacccaggg caggacacct accagcaccc cccaaaggac agcagcaggc 1681 agcatgtgga cgtgagcccc accagccagc gcctgcagct cctggagcct tttgacaagt 1741 gggatggcaa ggacctggag gacctgcaga tcctcatcaa ggtcaaaggg aagtgtacca 1801 ctgaccacat ctcagctgct ggcccctggc tcaagttccg tgggcacttg gataacatct 1861 ccaacaacct gctcattggt gccatcaaca ttgaaaacgg caaggccaac tccgtgcgca 1921 atgccgtcac tcaggagttt ggccccgtcc ctgacactgc ccgctactac aagaaacatg 1981 gcatcaggtg ggtggtgatc ggagacgaga actacggcga gggctcgagc cgggagcatg 2041 cagctctgga gcctcgccac cttgggggcc gggccatcat caccaagagc tttgccagga 2101 tccacgagac caacctgaag aaacagggcc tgctgcctct gaccttcgct gacccggctg 2161 actacaacaa gattcaccct gtggacaagc tgaccattca gggcctgaag gacttcaccc 2221 ctggcaagcc cctgaagtgc atcatcaagc accccaacgg gacccaggag accatcctcc 2281 tgaaccacac cttcaacgag acgcagattg agtggttccg cgctggcagt gccctcaaca 2341 gaatgaagga actgcaacag tgagggcagt gcctccccgc ccgccgctgg cgtcaagttc 2401 agctccacgt gtgccatcag tggatccgat ccgtccagcc atggcttcct attccaagat 2461 ggtgtgacca gacatgcttc ctgctccccg tagccctcgg agtgactgtg gttgtggtgg 2521 gggggttctt aaaataactt tttagccccc gtcttcctat tttgagtttg gttcagatct 2581 taagcagctc catgcaactg tatttatttt tgatgacaag actcccatct aaagtttttc 2641 tcctgcctga tcatttcatt ggtggctgaa ggattctaga gaaccttttg ttcttgcaag 2701 gaaaacaaga atccaaaacc aaaaaaaaaa aaaaaaaa // LOCUS HSU80073 1680 bp mRNA PRI 15-OCT-1997 DEFINITION Human tip associating protein (TAP) mRNA, complete cds. ACCESSION U80073 NID g1724119 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1680) AUTHORS Yoon,D.W., Lee,H., Seol,W., DeMaria,M., Rosenzweig,M. and Jung,J.U. TITLE Tap: a novel cellular protein that interacts with tip of herpesvirus saimiri and induces lymphocyte aggregation JOURNAL Immunity 6 (5), 571-582 (1997) MEDLINE 97318898 REFERENCE 2 (bases 1 to 1680) AUTHORS Yoon,D.-W., Lee,H. and Jung,J.U. TITLE Direct Submission JOURNAL Submitted (27-NOV-1996) Microbiology, NERPRC, Harvard Medical School, 1 Pine Hill Dr., Southborough, MA 01772, USA FEATURES Location/Qualifiers source 1..1680 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="white blood cell" gene 1..1680 /gene="TAP" CDS 1..1680 /gene="TAP" /codon_start=1 /product="tip associating protein" /db_xref="PID:g1724120" /translation="MSDAQDGPRVRYNPYTTRPNRRGDTWHDRDRIHVTVRRDRAPPE RGGAGTSQDGTSKNCFKITIPYGRKYDKAWLLSMIQSKCSVPFTPIEFHYENTRAQFF VEDASTASALKAVNYKILDRENRRISIIINSSAPPHTILNELKPEQVEQLKLIMSKRY DGSQQALDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRIIEENIPELLSLNLSNNRLY RLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEELWLDGNSLCDTFRDQST YISAIRERFPKLLRLDGHELPPPIAFDVEAPTTLPPCKGSYFGTENLKSLVLHFLQQY YAIYDSGDRQGLLDAYHDGACCSLSIPFIPQNPARSSLAEYFKDSRNVKKLKDPTLRF RLLKHTRLNVVAFLNELPKTQHDVNSFVVDISAQTSTLLCFSVNGVFKEVDGKSRDSL RAFTRTFIAVPASNSGLCIVNDELFVRNASSEEIQRAFAMPAPTPSSSPVPTLSPEQQ EMLQAFSTQSGMNLEWSQKCLQDNNWDYTRSAQAFTHLKAKGEIPEVAFMK" BASE COUNT 444 a 453 c 400 g 383 t ORIGIN 1 atgagtgatg cccaggatgg tccccgagta cgatacaacc cctataccac ccgacctaac 61 cgtcggggtg atacttggca tgatcgagat cgcattcatg ttactgtgcg gagagacaga 121 gctcctccag agagaggagg ggctggcacc agccaggatg ggacctcaaa gaactgcttc 181 aagattacaa ttccttatgg cagaaagtat gacaaggcat ggctcctgag catgattcag 241 agcaagtgca gtgtgccctt cacccctatt gagtttcact atgagaatac acgggcccag 301 ttcttcgttg aagacgccag tactgcctct gcattgaagg ctgtcaacta taagattttg 361 gatcgggaga accgaaggat atctatcatc atcaactctt ctgctccacc ccacactata 421 ctgaatgaac tgaagccaga acaagtagaa cagctaaagc tgatcatgag caaacgatac 481 gatggctccc aacaagccct tgacctcaaa ggcctccgtt cagacccaga tttggtggcc 541 cagaacattg acgttgtcct gaatcgcaga agctgtatgg cagctaccct gaggatcatt 601 gaagagaaca tccctgagct attgtccttg aacttgagca acaacaggct gtacaggctg 661 gatgacatgt ctagcattgt tcagaaggca cccaacctga agatcctaaa cctttctgga 721 aatgaattga agtctgagcg ggaattggac aagataaagg ggctgaagct agaagagctc 781 tggctcgatg gaaactccct gtgtgacacc ttccgagacc agtccaccta catcagcgcc 841 attcgcgaac gatttcccaa gttactacgc ctggatggcc atgagctacc cccaccaatt 901 gcctttgatg ttgaagcccc cacgacgtta ccgccctgca agggaagcta ttttggaaca 961 gaaaacttga agagtctggt cttgcacttc ctgcaacagt actatgcaat ttacgactct 1021 ggagaccgac aagggctcct ggatgcctac catgatgggg cctgctgttc cctgagcatt 1081 cctttcattc ctcagaaccc tgcccgaagc agcttagccg agtatttcaa ggatagcaga 1141 aatgtgaaga agcttaaaga ccctaccttg cggttccggc tgctgaagca cacgcgtctc 1201 aacgttgttg ccttcctcaa tgagttgccc aaaacccagc acgacgtcaa ttccttcgtg 1261 gtagacataa gcgcccagac aagcacattg ctgtgttttt ctgtcaatgg agtcttcaag 1321 gaagtggacg gaaagtcccg ggattctttg cgagccttca cccggacatt cattgctgtt 1381 cctgctagca attcagggct atgtattgta aatgatgagc tatttgtgcg gaatgccagt 1441 tctgaagaga tccaaagagc cttcgctatg cctgcaccca cgccttcctc cagcccggtg 1501 cccaccctct ctccagagca gcaggaaatg ttgcaagcat tctctaccca gtctggcatg 1561 aacctcgagt ggtcccagaa gtgccttcag gacaacaact gggactacac cagatctgcc 1621 caggccttca ctcatctcaa ggccaagggc gagatcccag aagtggcatt catgaagtga // LOCUS HSU80456 3921 bp mRNA PRI 08-JUL-1997 DEFINITION Human transcription factor SIM2 long form mRNA, complete cds. ACCESSION U80456 NID g2062416 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3921) AUTHORS Chrast,R., Scott,H.S., Chen,H., Kudoh,J., Rossier,C., Minoshima,S., Wang,Y., Shimizu,N. and Antonarakis,S.E. TITLE Cloning of two human homologs of the Drosophila single-minded gene SIM1 on chromosome 6q and SIM2 on 21q within the Down syndrome chromosomal region JOURNAL Genome Res. 7 (6), 615-624 (1997) MEDLINE 97343329 REFERENCE 2 (bases 1 to 3921) AUTHORS Chrast,R., Kudoh,J., Rossier,C., Chen,H., Minoshima,S., Shimizu,N. and Antonarakis,S.E. TITLE Direct Submission JOURNAL Submitted (29-NOV-1996) Medical Genetics, University of Geneva Medical School, 1, Rue Michel-Servet, Geneva 1211, Switzerland FEATURES Location/Qualifiers source 1..3921 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22.2" CDS 93..2096 /codon_start=1 /product="transcription factor SIM2 long form" /db_xref="PID:g2062417" /translation="MKEKSKNAAKTRREKENGEFYELAKLLPLPSAITSQLDKASIIR LTTSYLKMRAVFPEGLGDAWGQPSRAGPLDGVAKELGSHLLQTLDGFVFVVASDGKIM YISETASVHLGLSQVELTGNSIYEYIHPSDHDEMTAVLTAHQPLHHHLLQEYEIERSF FLRMKCVLAKRNAGLTCSGYKVIHCSGYLKIRQYMLDMSLYDSCYQIVGLVAVGQSLP PSAITEIKLYSNMFMFRASLDLKLIFLDSRVTEVTGYEPQDLIEKTLYHHVHGCDVFH LRYAHHLLLVKGQVTTKYYRLLSKRGGWVWVQSYATVVHNSRSSRPHCIVSVNYVLTE IEYKELQLSLEQVSTAKSQDSWRTALSTSQETRKLVKPKNTKMKTKLRTNPYPPQQYS SFQMDKLECGQLGNWRASPPASAAAPPELQPHSESSDLLYTPSYSLPFSYHYGHFPLD SHVFSSKKPMLPAKFGQPQGSPCEVARFFLSTLPASGECQWHYANPLVPSSSSPAKNP PEPPANTARHSLVPSYEAPAAAVRRFGEDTAPPSFPSCGHYREEPALGPAKAARQAAR DGARLALARAAPECCAPPTPEAPGAPAQLPFVLLNYHRVLARRGPLGGAAPAASGLAC APGGPEAATGALRLRHPSPAATSPPGAPLPHYLGASVIITNGR" BASE COUNT 922 a 1137 c 998 g 864 t ORIGIN 1 actcactata gggctcgagc ggccgcccgg gcaggtgggg ctccgcgggc ctggagcacg 61 gccgggtcta atatgcccgg agccgaggcg cgatgaagga gaagtccaag aatgcggcca 121 agaccaggag ggagaaggaa aatggcgagt tttacgagct tgccaagctg ctcccgctgc 181 cgtcggccat cacttcgcag ctggacaaag cgtccatcat ccgcctcacc acgagctacc 241 tgaagatgcg cgccgtcttc cccgaaggtt taggagacgc gtggggacag ccgagccgcg 301 ccgggcccct ggacggcgtc gccaaggagc tgggatcgca cttgctgcag actttggatg 361 gatttgtttt tgtggtagca tctgatggca aaatcatgta tatatccgag accgcttctg 421 tccatttagg cttatcccag gtggagctca cgggcaacag tatttatgaa tacatccatc 481 cttctgacca cgatgagatg accgctgtcc tcacggccca ccagccgctg caccaccacc 541 tgctccaaga gtatgagata gagaggtcgt tctttcttcg aatgaaatgt gtcttggcga 601 aaaggaacgc gggcctgacc tgcagcggat acaaggtcat ccactgcagt ggctacttga 661 agatcaggca gtatatgctg gacatgtccc tgtacgactc ctgctaccag attgtggggc 721 tggtggccgt gggccagtcg ctgccaccca gtgccatcac cgagatcaag ctgtacagta 781 acatgttcat gttcagggcc agccttgacc tgaagctgat attcctggat tccagggtga 841 ccgaggtgac gggttacgag ccgcaggacc tgatcgagaa gaccctatac catcacgtgc 901 acggctgcga cgtgttccac ctccgctacg cacaccacct cctgttggtg aagggccagg 961 tcaccaccaa gtactaccgg ctgctgtcca agcggggcgg ctgggtgtgg gtgcagagct 1021 acgccaccgt ggtgcacaac agccgctcgt cccggcccca ctgcatcgtg agtgtcaatt 1081 atgtactcac ggagattgaa tacaaggaac ttcagctgtc cctggagcag gtgtccactg 1141 ccaagtccca ggactcctgg aggaccgcct tgtctacctc acaagaaact aggaaattag 1201 tgaaacccaa aaataccaag atgaagacaa agctgagaac aaacccttac cccccacagc 1261 aatacagctc gttccaaatg gacaaactgg aatgcggcca gctcggaaac tggagagcca 1321 gtccccctgc aagcgctgct gctcctccag aactgcagcc ccactcagaa agcagtgacc 1381 ttctgtacac gccatcctac agcctgccct tctcctacca ttacggacac ttccctctgg 1441 actctcacgt cttcagcagc aaaaagccaa tgttgccggc caagttcggg cagccccaag 1501 gatccccttg tgaggtggca cgctttttcc tgagcacact gccagccagc ggtgaatgcc 1561 agtggcatta tgccaacccc ctagtgccta gcagctcgtc tccagctaaa aatcctccag 1621 agccaccggc gaacactgct aggcacagcc tggtgccaag ctacgaagcg cccgccgccg 1681 ccgtgcgcag gttcggcgag gacaccgcgc ccccgagctt cccgagctgc ggccactacc 1741 gcgaggagcc cgcgctgggc ccggccaaag ccgcccgcca ggccgcccgg gacggggcgc 1801 ggctggcgct ggcccgcgcg gcacccgagt gctgcgcgcc cccgaccccc gaggccccgg 1861 gcgcgccggc gcagctgccc ttcgtgctgc tcaactacca ccgcgtgctg gcccggcgcg 1921 gaccgctggg gggcgccgca cccgccgcct ccggcctggc ctgcgctccc ggcggccccg 1981 aggcggcgac cggcgcgctg cggctccggc acccgagccc cgccgccacc tccccgcccg 2041 gcgcgcccct gccgcactac ctgggcgcct cggtcatcat caccaacggg aggtgacccg 2101 ctggccgccc gcgccaggag cctggacccg gcctcccggg gctgcggcgc caccgagccc 2161 ggcaaatgcg cacgacctac attaatttat gcagagacag ctgtttgaat tggaccccgc 2221 cgccgacttg cggatttcca ccgcggaggc cccgcgcgcc ggtgccgagg gccgaggagc 2281 gcccgggtcc gggcaggtga ccgcccgcct ctgtcctgcg agggccggtg cgacccagtt 2341 gctgggggct tggtttcctc accttgaaat cgggcttcac gcgtcttgcc ttgtccccaa 2401 cgttccacaa cagtcccgct gggggattga agcggtttca ctccgcaaat atcctccact 2461 ttcaggaggg aaaacccacc ctaccacagt ccgctcttcc aagtggacgg cagacctggg 2521 aggggacgcc tgtgtcacga gcccttttag atgcttaggt gaaggcagaa gtgatgattg 2581 taagtcccat gaatacacaa ctccactgtc tttaaaagtc attcaagagt ctcattattt 2641 ttgtttttat ttaacccttt cttcaataca aaaagccaac aaaccaagac taagggggtg 2701 accatgcaat tccattttgt gtctgtgaac ataggtgtgc ttcccaaata cattaacaag 2761 ctcttacttc cccctaaccc ctatgaactc ttgataacac caagagtagc accttcagaa 2821 tatattgaat aggcattaaa tgcaaaaata tatatgtagc cagacagttt atgagaatga 2881 ccctgtcaag cttcattatt acgtggcaaa atccctctgg cccacacaga tctgtaattc 2941 actaggctcg tgtttgctac aaatagtgct aataaagtta aattgcacgt gcaatacgga 3001 acactgtcaa tggactgcac cttgtgaagg aaaaacatgc ttaagggggt gtaatgaaaa 3061 tgatgtagac attttaagca ttttctacac agcgagaaaa cttcgtaaga acatgttacg 3121 tgtgcaacag gtaaacagaa atcctttcat aaagcaccag cagtgtttaa aaaatgagct 3181 tccattaatt tttacttttt atgggttttg cttaaagatc tcaacatgga aaaatcctgt 3241 catggctctg aactgcacaa tgcattgaac cgccgtcctt caattttctt cacactatca 3301 acactgcagc attttgctgc tttatcaaaa tggtttattt taggaaactt tttccacctt 3361 tctgaatgga aagaggtttt cacaaatgtt ttaaactcat cgttctaaaa tcaagtgcac 3421 ctacaccaac tgctctcaaa atgtgaactg actttttttt tttttttttt gccaaccctg 3481 tgtcacttag tgaggacctg acacaatccc tacagggtgt ctgtcagtgg gcctcatggt 3541 aagagtcaca atttgcaaat ttaggaccgt gggtcatgca gcgaaggggc tggatggtag 3601 gaagggatgt gcccgcctct ccacgcactc agctatacct cattcacagc tccttgtgag 3661 tgtgtgcaca ggaaataagc cgagggtatt atttttttat gttcatgagt cttgtaatta 3721 aaccgtgatt cttgaaaggt gtaggtttga ttactaggag ataccaccga catttttcaa 3781 taaagtactg caaaatgctt ttgtgtctac cttgttatta acttttgggg ctgtatttag 3841 taaaaataaa tcaaggctat cggagcagtt caataacaaa ggttactgtt gagaaaaaag 3901 accctatcat agatttacaa g // LOCUS HSU80669 1428 bp mRNA PRI 16-DEC-1996 DEFINITION Human androgen regulated homeobox protein (NKX3.1) mRNA, complete cds. ACCESSION U80669 NID g1732377 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1428) AUTHORS Prescott,J.L., Blok,L. and Tindall,D.J. TITLE Isolation and Androgen Regulation of a Novel Human Homeobox Gene, NKX 3.1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1428) AUTHORS Prescott,J.L., Blok,L. and Tindall,D.J. TITLE Direct Submission JOURNAL Submitted (30-NOV-1996) Urology Research, Mayo Clinic/Foundation, 200 First St. SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1428 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="prostate" gene 30..734 /gene="NKX3.1" CDS 30..734 /gene="NKX3.1" /codon_start=1 /product="androgen regulated homeobox protein" /db_xref="PID:g1732378" /translation="MLRVPEPRPGEAKAEGAAPPTPSKPLTSFLIQDILRDGAQRQGG RTSSQRQRDPEPEPEPEPEGGRSRAGAQNDQLSTGPRAAPDEAETLAETEPERHLGSY LLDSENTSGALPRLPQTPKQPQKRSRAAFSHTQVIELERKFSHQKYLSAPERAHLAKN LKLTETQVKIWFQNRRYKTKRKQLSSELGDLEKHSFLPALKEEAFSRASLVSVYNSYP YYPYLHCVGSWSPAFW" misc_feature 399..578 /gene="NKX3.1" /note="encodes an androgen regulated homeobox" BASE COUNT 366 a 386 c 394 g 282 t ORIGIN 1 tgcattcagg ccaaggcggg gccgccggga tgctcagggt tccggagccg cggcccgggg 61 aggcgaaagc ggagggggcc gcgccgccga ccccgtccaa gccgctcacg tccttcctca 121 tccaggacat cctgcgggac ggcgcgcagc ggcaaggcgg ccgcacgagc agccagagac 181 agcgcgaccc ggagccggag ccagagccag agccagaggg aggacgcagc cgcgccgggg 241 cgcagaacga ccagctgagc accgggcccc gcgccgcgcc ggatgaggcc gagacgctgg 301 cagagaccga gccagaaagg cacttggggt cttatctgtt ggactctgaa aacacttcag 361 gcgcccttcc aaggcttccc caaaccccta agcagccgca gaagcgctcc cgagctgcct 421 tctcccacac tcaggtgatc gagttggaga ggaagttcag ccatcagaag tacctgtcgg 481 cccctgaacg ggcccacctg gccaagaacc tcaagctcac ggagacccaa gtgaagatat 541 ggttccagaa cagacgctat aagactaagc gaaagcagct ctcctcggag ctgggagact 601 tggagaagca ctcctttttg ccggccctga aagaggaggc cttctcccgg gcctccctgg 661 tctccgtgta taacagctat ccttactacc catacctgca ctgcgtgggc agctggagcc 721 cagctttttg gtaatgccag ctcaggtgac aaccattatg atcaaaaact gccttcccca 781 gggtgtctca tatgaaaagc acaaggggcc aaggtcaggg agcaagaggt gtgcacacca 841 aagctattgg agatttgcgt ggaaatctca gattcttcac tggtgagaca atgaaacaac 901 agagacagtg aaagttttaa tacctaagtc attcccccag tgcatactgt agcgtcaagt 961 ttttgcttct ggctacctgt ttgaagggga gagagggaaa atcaagtggt attttccagc 1021 actttgtatg attttggatg agctgtacac ccaaggattc tgttctgcaa ctccatcctc 1081 ctgtgtcact gaatatcaac tctgaaagag caaacctaac aggagaaagg acaaccagga 1141 tgaggatgtc accaactgaa ttaaacttaa gtccagaagc ctcctgttgg ccttggaata 1201 tggccaaggc tctctctgtc cctgtaaaag agaggggcaa atatctccaa gagaacgccc 1261 tcatgctcag cacatatttg catggaaggg ggagatgggt gggaggagat gaaaatatca 1321 gcttttctta ttccttttta ttccttttaa aatggtatgc caacttaagt atttacaggg 1381 tggcccaaat agaacaagat gcactcgctg tgattttaag acaagctg // LOCUS HSU80744 962 bp mRNA PRI 18-DEC-1997 DEFINITION Homo sapiens CTG4a mRNA, complete cds. ACCESSION U80744 NID g2565062 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 962) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE cDNAs with long CAG trinucleotide repeats from human brain JOURNAL Hum. Genet. 100 (1), 114-122 (1997) MEDLINE 97369492 REFERENCE 2 (bases 1 to 962) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE Direct Submission JOURNAL Submitted (02-DEC-1996) Psychiatry, Johns Hopkins Univ. Sch. of Med., 600 N. Wolfe Street, Meyer 2-181, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..962 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="cerebral cortex" gene 1..962 /gene="CTG4a" CDS 388..819 /gene="CTG4a" /note="polyleucine rich" /codon_start=1 /product="CTG4a" /db_xref="PID:g2565063" /translation="MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVR LPSKCEVCKYVAVELKSAFEETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTET ICKRLLIIACTRRGPAAIDLPRACQRPLRHYTTWYTKGSRW" repeat_region 439..462 /rpt_type=tandem /rpt_unit=CTG BASE COUNT 200 a 252 c 315 g 194 t 1 others ORIGIN 1 gaattccgcc cagacgcagg cttcttctcg ggtcttggtc ctgcatcctc tctctcccag 61 agcctccgtt agggggtggg aaaggacttt gccataggtc gctgaggcca ccatctgctc 121 tcttactggc caagggcgta aaaagatagt cttcccatta gctagagagc aaaccccaga 181 aagcctattg gttgcgccgt ccgcgggcct tggtccgctt tgaaggcggg ctgcggctgc 241 gagaggaggg cgggcgggag gctagctgtt gtcgtggttg ctcggaggca cgtgtgcagt 301 cccggaagcg gcgaggggaa actgctccgc gcgcgcccgc gggaggagga accgcccggt 361 cctttagggt ccgggcccgg ccgggccatg gattcaatgc ctgagcccgc gtcccgctgt 421 cttctgcttc ttcccttgct gctgctgctg ctgctgctgc tgccggcccc ggagctgggc 481 ccgagccagg ccggagctga ggagaacgac tgggttcgcc tgcccagcaa atgcgaagtg 541 tgtaaatatg ttgctgtgga gctgaagtca gcctttgagg aaaccggcaa gaccaaggag 601 gtgattggca cgggctatgg catcctggac cagaaggcct ctggagtcaa atacaccaag 661 tcggacttgc ggttaatcga agtcactgag accatttgca agaggctcct gattatagcc 721 tgcacaagga gaggaccggc agcaatcgat ttgccaaggg catgtcagag acctttgaga 781 cattacacaa cctggtacac aaaggggtca aggtggtgat ggacatcccc tatgagctgt 841 ggaacgagac ttctgcagag gtggctgacc tcaagaanca gtgtgatgtg ctggtggaag 901 agtttgagga ggtgatcgag gactggtaca ggaaccacca ggaggaagac ctgactgaat 961 tc // LOCUS HSU80746 1719 bp mRNA PRI 18-DEC-1997 DEFINITION Homo sapiens CAGH4 mRNA, partial cds. ACCESSION U80746 NID g2565066 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE cDNAs with long CAG trinucleotide repeats from human brain JOURNAL Hum. Genet. 100 (1), 114-122 (1997) MEDLINE 97369492 REFERENCE 2 (bases 1 to 1719) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE Direct Submission JOURNAL Submitted (02-DEC-1996) Psychiatry, Johns Hopkins Univ. Sch. of Med., 600 N. Wolfe Street, Meyer 2-181, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 1..1077 /note="similar to human RNA-binding protein CUG-BP/hNab50 (U63289); contains polyglutamine" /codon_start=1 /product="CAGH4" /db_xref="PID:g2565067" /translation="MRTSGRCLSPSGPSTSALCSGGQMAPAKGCAFVKFQTHAEAQAA INTLHSSRTLPGASSSLVVKFADTEKERGLRRMQQVATQLGMFSPIALQFGAYSAYTQ ALMQQQAALVAAHSAYLSPMATMAAVQMQHMAAINANGLIATPITPSSGTSTPPAIAA TPVSAIPAALGVNGYSPVPTQPTGQPAPDALYPNGVHPYPAQSPAAPVDPLQQAYAGM QHYTAAYPAAYSLVAPAFPQPPALVAQQPPPPPQQQQQQQQQQQQQQQREGPDGCNIF IYHLPQEFTDSEILQMFVPFGHVISAKVFVDRATNQSKCFGFVSFDNPASAQAAIQAM NGFQIGMKRLKVQLKRPKDANRPY" repeat_region 757..780 /rpt_type=tandem /rpt_unit=CAG BASE COUNT 324 a 642 c 446 g 307 t ORIGIN 1 atgaggacgt ccggaagatg tttgagccct tcgggaccat cgacgagtgc actgtgctcc 61 gggggccaga tggcaccagc aaagggctgc gccttcgtga agttccagac ccacgctgag 121 gcccaggcgg ccatcaacac ccttcacagc agccggaccc tgccaggtgc ctcgtccagc 181 ctggtggtga agtttgctga cactgagaag gagcgaggtc tccgccgcat gcagcaggtg 241 gccacccagt tgggcatgtt cagccccatc gccctccagt ttggagccta cagcgcctac 301 acccaggccc tgatgcagca gcaggcggcc ctggtagcgg ctcacagtgc ctacctcagc 361 cccatggcca ccatggctgc cgtgcagatg cagcacatgg ctgccatcaa tgccaatggc 421 ctcatcgcca cccccatcac cccatcctca ggaaccagca cccctcctgc catcgctgcc 481 acgcctgtct ctgccattcc ggctgccctg ggcgtcaacg gctacagccc ggtgcccacc 541 cagcccactg ggcagcctgc ccctgatgct ctgtatccca acggggttca cccctaccca 601 gcccagagcc ccgcggcccc cgtggacccc ctgcagcagg cctacgcggg gatgcagcac 661 tacacagcag cctacccagc agcctacagc ctggttgcac ctgcgttccc gcagcctcca 721 gccctggtcg cccagcagcc cccaccacca cctcaacagc agcagcagca gcagcagcag 781 caacagcagc agcagcaaag agaaggccct gatggctgca acatcttcat ctaccacctg 841 ccccaggagt tcactgactc agagatcctc cagatgtttg tcccctttgg ccacgtcatc 901 tcagccaaag tctttgttga ccgagccacc aatcaaagca aatgttttgg ctttgtgagt 961 ttcgacaatc cggccagtgc ccaggctgcc atccaggcca tgaatggctt ccagatcggc 1021 atgaagcgcc tcaaagtcca gctaaagcgg cctaaggatg ccaaccggcc ctactgaggg 1081 cccccaggtc tggagatccc agaggaaggg gcgcctcaca ccctcttccc acgactggcc 1141 ccggccctct ccgcacacct gccctgggcc ttgactgggt tctggggcaa acgctgcttc 1201 gtggcccccg ggggcacaag acaccggccc ctcccacccc cctgcctctc tgaagggcca 1261 tggctatgct tccctggctc caagggccca tttcctccta gatgcccttt tggcctttgt 1321 gagggagcga ggaacaggct cgaaggctcc ggggtatctg ccttctgctg ggctcctgtg 1381 acaggccttc tgtgcccagc gtttgtactt gcctccccca acagtgggcc tgttctaccc 1441 gtgcaggccc caggagagcc gcaggggcct gccacacact cccagctcac cctcacccca 1501 gcctcttccc cacattaggg gtttcttgga agctggctct cactcccctc caccctcagc 1561 tagaggtagg atatccctga ttcctgggct cccagcccta aaactcactg cctcccccaa 1621 gggccccctc taaggagggt gtgggggagc cctgagggct gcctctctgc cagtcagcca 1681 cagagaccct cctcctttca cgaggaaaag acggaattc // LOCUS HSU80760 1655 bp mRNA PRI 18-DEC-1997 DEFINITION Homo sapiens CAGH1 alternate open reading frame mRNA, complete cds. ACCESSION U80760 NID g2565088 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1655) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE cDNAs with long CAG trinucleotide repeats from human brain JOURNAL Hum. Genet. 100 (1), 114-122 (1997) MEDLINE 97369492 REFERENCE 2 (bases 1 to 1655) AUTHORS Margolis,R.L., Abraham,M.R., Gatchell,S.B., Li,S.H., Kidwai,A.S., Breschel,T.S., Stine,O.C., Callahan,C., McInnis,M.G. and Ross,C.A. TITLE Direct Submission JOURNAL Submitted (02-DEC-1996) Psychiatry, Johns Hopkins Univ. Sch. of Med., 600 N. Wolfe Street, Meyer 2-181, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..1655 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" gene 1..1655 /gene="CAGH1" CDS 915..1331 /gene="CAGH1" /note="contains leucine repeat" /codon_start=1 /product="CAGH1 alternate open reading frame" /db_xref="PID:g2565089" /translation="MFLLKKSSEKGWYPVKESRDRGPRRWSQGLSRKKRTFPTKSWRK KTPGLLLPLPSGWQHGSLRAGQVLHLDGAGGDSEADVLMMLRRLIRGQVKGTLGWRVG IAVPTTPTLGGCPRRLEVWWWLLLLLLLLLLLLLLL" repeat_region 1290..1328 /rpt_type=tandem /rpt_unit=CTG BASE COUNT 403 a 385 c 466 g 401 t ORIGIN 1 gaattccatc cctcccccct cccccacccc aaggacaaaa agaaaaatgc tgtggctaga 61 ggacaggggt ggagagaggg aaaaaaagga caaaaataaa caaaacatca aatatgaaaa 121 ttctcagaga taatgcattt ataaacacag aaatggttac aacaaagatg gccgtgatga 181 gtgggtataa tatatttata tatatatatt tatatataaa tccgtgtccg gcatctgact 241 gtggcaccta gggagctaag tccagtcctt gagtttacct tgaactctcc cttctccgca 301 acacccctgt tttggagttt gcacagatta cacaaagcct cccacagctc cttgggggtg 361 ggttggggag acttgagagt ataggtcttt gtaggcagag aaggagagag gcttcaagga 421 aatccgtaaa accataacac acacttctaa gccacctgtg accaacttgg gaatttctgg 481 ccccttgggg accacatctc agcccttgcc cccttcaaat aaaaggaggt ctagccccta 541 ccccaaatct ccttctacca gcagtcaata ggaagcaaag tgagacgatg taggggaaga 601 aatggctctc agggactgag gcatttgaga aacctctgtt cttttgcagg caagaataga 661 acaagaggct ggttgcattt ggggctccct tttctctgtt atctgggagg gccagcctct 721 agtcttacat cagcccaaac tttgaggata aggagggtaa ggatagggta agtggccaca 781 ctggacaagg ttctatgagg tcatagcaaa tccttccctt gagcaccgac ccccagtttt 841 agaagctttg cttgggaggg gaggctgctg gatgacacca tcagctcagt attcctttgc 901 aatcaggagg gctgatgttc cttttgaaga agagttcaga aaaaggatgg tatcctgtga 961 aggaaagccg tgacagaggc ccaaggagat ggagccaagg cctgtcaagg aagaaaagga 1021 cttttcccac caagagttgg agaaagaaga caccaggact acttcttcct cttcccagtg 1081 ggtggcagca cggatctcta agagctggcc aggtgctcca cctggatggt gctggtggtg 1141 acagtgaggc agatgtcctt atgatgctcc gccgtcttat acggggtcag gtcaaaggaa 1201 cactggggtg gagggttggg attgctgtcc ccaccacccc caccctgggg ggctgcccca 1261 ggagactgga agtgtggtgg tggctgttgc tgctgctgct gctgctgctg ctgctgctgc 1321 tgctgctgtg atgcctggga ggcctgggcc tgggcctggg cttgagcctg agcctgagcc 1381 tgggcttgag cttgagcctg ggcctgggcc actgctgccg ctgctgctgc tgcctgcacc 1441 tgttgctgaa gatcaggcgg gttgtgtttg cgcatatgtt tcataaggta tgtttctgat 1501 gtgtatgccc gactgcagat agtgcaggtg tacaccttgg catgcttcac tgtgtgcgta 1561 gacaggtgca cctctagtga ggctgcatcc gtgtacgccc gatgacagtt gtggcacttg 1621 aagggtttat ctttgttgtg ttgccgtcgg aattc // LOCUS HSU81006 2391 bp mRNA PRI 19-DEC-1996 DEFINITION Human p76 mRNA, complete cds. ACCESSION U81006 NID g1737489 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2391) AUTHORS Schimmoeller,F., Diaz,E. and Pfeffer,S.R. TITLE p76 is a human member of an emerging family of multispanning membrane proteins JOURNAL Unpublished REFERENCE 2 (bases 1 to 2391) AUTHORS Schimmoeller,F., Diaz,E. and Pfeffer,S.R. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) Biochemistry, Stanford University School of Medicine, CA, Stanford 94305, USA FEATURES Location/Qualifiers source 1..2391 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 134..2125 /note="predicted molecular weight is 76 kD; contains nine potential membrane spanning domains; similar to yeast p24a precursor protein encoded by GenBank Accession Number X67316" /codon_start=1 /product="p76" /db_xref="PID:g1737490" /translation="MSARLPVLSPPRWPRLLLLSLLLLGAVPGPRRSGAFYLPGLAPV NFCDEEKKSDECKAEIELFVNRLDSVESVLPYEYTAFDFCQASEGKRPSENLGQVLFG ERIEPSPYKFTFNKKETCKLVCTKTYHTEKAEDKQKLEFLKKSMLLNYQHHWIVDNMP VTWCYDVEDGQRFCNPGFPIGCYITDKGHAKDACVISSDFHERDTFYIFNHVDIKIYY HVVETGSMGARLVAAKLEPKSFKHTHIDKPDCSGPPMDISNKASGEIKIAYTYSVSFE EDDKIRWASRWDYILESMPHTHIQWFSIMNSLVIVLFLSGMVAMIMLRTLHKDIARYN QMDSTEDAQEEFGWKLVHGDIFRPPRKGMLLSVFLGSGTQILIMTFVTLFFACLGFLS PANRGALMTCAVVLWVLLGTPAGYVAARFYKSFGGEKWKTNVLLTSFLCPGIVFADFF IMNLILWGEGSSAAIPFGTLVAILALWFCISVPLTFIGAYFGFKKNAIEHPVRTNQIP RQIPEQSFYTKPLPGIIMGGILPFGCIFIQLFFILNSIWSHQMYYMFGFLFLVFIILV ITCSEATILLCYFHLCAEDYHWQWRSFLTSGFTAVYFLIYAVHYFFSKLQITGTASTI LYFGYTMIMVLIFFLFTGTIGFFACFWFVTKIYSVVKVD" BASE COUNT 621 a 501 c 512 g 757 t ORIGIN 1 cgcaaccgga actagccttc tgggggccgg cttggtttat ctctggcggc cttgtagtcg 61 tctccgagac tccccacccc tccttccctc ttgaccccct aggtttgatt gccctttccc 121 cgaaacaact atcatgagcg cgaggctgcc ggtgttgtct ccacctcggt ggccgcggct 181 gttgctgctg tcgctgctcc tgctgggggc ggttcctggc ccgcgccgga gcggcgcttt 241 ctacctgccc ggcctggcgc ccgtcaactt ctgcgacgaa gaaaaaaaga gcgacgagtg 301 caaggccgaa atagaactat ttgtgaacag acttgattca gtggaatcag ttcttcctta 361 tgaatacaca gcgtttgatt tttgccaagc atcagaagga aagcgcccat ctgaaaatct 421 tggtcaggta ctattcgggg aaagaattga accttcacca tataagttta cgtttaataa 481 gaaggagacc tgtaagcttg tttgtacaaa aacataccat acagagaaag ctgaagacaa 541 acaaaagtta gaattcttga aaaaaagcat gttattgaat tatcaacatc actggattgt 601 ggataatatg cctgtaacgt ggtgttacga tgttgaagat ggtcagaggt tctgtaatcc 661 tggatttcct attggctgtt acattacaga taaaggccat gcaaaagatg cctgtgttat 721 tagttcagat ttccatgaaa gagatacatt ttacatcttc aaccatgttg acatcaaaat 781 atactatcat gttgttgaaa ctgggtccat gggagcaaga ttagtggctg ctaaacttga 841 accgaaaagc ttcaaacata cccatataga taaaccagac tgctcagggc cccccatgga 901 cataagtaac aaggcttctg gggagataaa aattgcctat acttactctg ttagcttcga 961 ggaagatgat aagatcagat gggcgtctag atgggactat attctggagt ctatgcctca 1021 tacccacatt cagtggttta gcattatgaa ttccctggtc attgttctct tcttatctgg 1081 aatggtagct atgattatgt tacggacact gcacaaagat attgctagat ataatcagat 1141 ggactctacg gaagatgccc aggaagaatt tggctggaaa cttgttcatg gtgatatatt 1201 ccgtcctcca agaaaaggga tgctgctatc agtctttcta ggatccggga cacagatttt 1261 aattatgacc tttgtgactc tatttttcgc ttgcctggga tttttgtcac ctgccaaccg 1321 aggagcgctg atgacgtgtg ctgtggtcct gtgggtgctg ctgggcaccc ctgcaggcta 1381 tgttgctgcc agattctata agtcctttgg aggtgagaag tggaaaacaa atgttttatt 1441 aacatcattt ctttgtcctg ggattgtatt tgctgacttc tttataatga atctgatcct 1501 ctggggagaa ggatcttcag cagctattcc ttttgggaca ctggttgcca tattggccct 1561 ttggttctgc atatctgtgc ctctgacgtt tattggtgca tactttggtt ttaagaagaa 1621 tgccattgaa cacccagttc gaaccaatca gattccacgt cagattcctg aacagtcgtt 1681 ctacacgaag cccttgcctg gtattatcat gggagggatt ttgccctttg gctgcatctt 1741 tatacaactt ttcttcattc tgaatagtat ttggtcacac cagatgtatt acatgtttgg 1801 cttcctattt ctggtgttta tcattttggt tattacctgt tctgaagcaa ctatacttct 1861 ttgctatttc cacctatgtg cagaggatta tcattggcaa tggcgttcat tccttacgag 1921 tggctttact gcagtttatt tcttaatcta tgcagtacac tacttctttt caaaactgca 1981 gatcacggga acagcaagca caattctgta ctttggttat accatgataa tggttttgat 2041 cttctttctt tttacaggaa caattggctt ctttgcatgc ttttggtttg ttaccaaaat 2101 atacagtgtg gtgaaggttg actgaagaag tccagtgtgt ccagttaaaa cagaaataaa 2161 ttaaactctt catcaacaaa gacctgtttt tgtgactgcc ttgagtttta tcagaattat 2221 tggcctagta atccttcaga aacaccgtaa ttctaaataa acctcttccc atacaccttt 2281 cccccataag atctgtcttc aacactataa agcatttgta ttgtgatttg attaagtata 2341 tatttggttg ttctcaatga agagcaaatt taaatattat gtgcatttga a // LOCUS HSU81375 2162 bp mRNA PRI 21-FEB-1997 DEFINITION Human placental equilibrative nucleoside transporter 1 (hENT1) mRNA, complete cds. ACCESSION U81375 NID g1845344 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2162) AUTHORS Griffiths,M., Beaumont,N., Yao,S.Y.M., Sundaram,M., Boumah,C.E., Davies,A., Kwong,F.Y.P., Coe,I., Cass,C.E., Young,J.D. and Baldwin,S.A. TITLE Cloning of a human nucleoside transporter implicated in the cellular uptake of adenosine and chemotherapeutic drugs JOURNAL Nat. Med. 3 (1), 89-93 (1997) MEDLINE 97140266 REFERENCE 2 (bases 1 to 2162) AUTHORS Griffiths,M., Beaumont,N., Yao,S.Y.M., Sundaram,M., Boumah,C.E., Davies,A., Kwong,F.Y.P., Coe,I., Cass,C.E., Young,J.D. and Baldwin,S.A. TITLE Direct Submission JOURNAL Submitted (09-DEC-1996) Department of Biochemistry and Molecular Biology, University of Leeds, W. Yorkshire, Leeds LS2 9JT, UK FEATURES Location/Qualifiers source 1..2162 /organism="Homo sapiens" /note="expressed in adult erythrocytes, placenta, heart, brain, mammary gland, and in fetal liver and/or spleen" /db_xref="taxon:9606" /tissue_type="placenta" gene 179..1549 /gene="hENT1" CDS 179..1549 /gene="hENT1" /note="broad substrate specificity for purines and pyrimidines; inhibited by nitrobenzylthioinosine (NBMPR), dipyridamole and dilazep; equilibrative sensitive type transporter; integral plasma membrane protein; similar to M. musculus nucleolar protein HNP36, to human nucleolar protein HNP36, to S. cerevisiae FUN26, to C. elegans ZK809.4, to C. elegans K09A9.3, and to C. elegans F16H11.3 encoded, respectively, by GenBank Accession Numbers X86682, X86681, L05146, Z68303, Z79601, and U55376" /codon_start=1 /product="equilibrative nucleoside transporter 1" /db_xref="PID:g1845345" /translation="MTTSHQPQDRYKAVWLIFFMLGLGTLLPWNFFMTATQYFTNRLD MSQNVSLVTAELSKDAQASAAPAAPLPERNSLSAIFNNVMTLCAMLPLLLFTYLNSFL HQRIPQSVRILGSLVAILLVFLITAILVKVQLDALPFFVITMIKIVLINSFGAILQGS LFGLAGLLPASYTAPIMSGQGLAGFFASVAMICAIASGSELSESAFGYFITACAVIIL TIICYLGLPRLEFYRYYQQLKLEGPGEQETKLDLISKGEEPRAGKEESGVSVSNSQPT NESHSIKAILKNISVLAFSVCFIFTITIGMFPAVTVEVKSSIAGSSTWERYFIPVSCF LTFNIFDWLGRSLTAVFMWPGKDSRWLPSLVLARLVFVPLLLLCNIKPRRYLTVVFEH DAWFIFFMAAFAFSNGYLASLCMCFGPKKVKPAEAETAGAIMAFFLCLGLALGAVFSF LFRAIV" BASE COUNT 389 a 638 c 560 g 575 t ORIGIN 1 gggctgcgct gtccagctgt ggctatggcc ccagccccga gatgaggagg gagagaacta 61 ggggcccgca ggcctgggaa tttccgtccc ccaccaagtc cggatgctca ctccaaagtc 121 tcagcaggcc cctgagggag ggagctgtca gccagggaaa accgagaaca ccatcaccat 181 gacaaccagt caccagcctc aggacagata caaagctgtc tggcttatct tcttcatgct 241 gggtctggga acgctgctcc cgtggaattt tttcatgacg gccactcagt atttcacaaa 301 ccgcctggac atgtcccaga atgtgtcctt ggtcactgct gaactgagca aggacgccca 361 ggcgtcagcc gcccctgcag cacccttgcc tgagcggaac tctctcagtg ccatcttcaa 421 caatgtcatg accctatgtg ccatgctgcc cctgctgtta ttcacctacc tcaactcctt 481 cctgcatcag aggatccccc agtccgtacg gatcctgggc agcctggtgg ccatcctgct 541 ggtgtttctg atcactgcca tcctggtgaa ggtgcagctg gatgctctgc ccttctttgt 601 catcaccatg atcaagatcg tgctcattaa ttcatttggt gccatcctgc agggcagcct 661 gtttggtctg gctggccttc tgcctgccag ctacacggcc cccatcatga gtggccaggg 721 cctagcaggc ttctttgcct ccgtggccat gatctgcgct attgccagtg gctcggaact 781 atcagaaagt gccttcggct actttatcac agcctgtgct gttatcattt tgaccatcat 841 ctgttacctg ggcctgcccc gcctggaatt ctaccgctac taccagcagc tcaagcttga 901 aggacccggg gagcaggaga ccaagttgga cctcattagc aaaggagagg agccaagagc 961 aggcaaagag gaatctggag tttcagtctc caactctcag cccaccaatg aaagccactc 1021 tatcaaagcc atcctgaaaa atatctcagt cctggctttc tctgtctgct tcatcttcac 1081 tatcaccatt gggatgtttc cagccgtgac tgttgaggtc aagtccagca tcgcaggcag 1141 cagcacctgg gaacgttact tcattcctgt gtcctgtttc ttgactttca atatctttga 1201 ctggttgggc cggagcctca cagctgtatt catgtggcct gggaaggaca gccgctggct 1261 gccaagcctg gtgctggccc ggctggtgtt tgtgccactg ctgctgctgt gcaacattaa 1321 gccccgccgc tacctgactg tggtcttcga gcacgatgcc tggttcatct tcttcatggc 1381 tgcctttgcc ttctccaacg gctacctcgc cagcctctgc atgtgcttcg ggcccaagaa 1441 agtgaagcca gctgaggcag agaccgcagg agccatcatg gccttcttcc tgtgtctggg 1501 tctggcactg ggggctgttt tctccttcct gttccgggca attgtgtgac aaaggatgga 1561 cagaaggact gcctgcctcc ctccctgtct gcctcctgcc ccttccttct gccaggggtg 1621 atcctgagtg gtctggcggt tttttcttct aactgacttc tgctttccac ggcgtgtgct 1681 gggcccggat ctccaggccc tggggaggga gcctctggac ggacagtggg gacattgtgg 1741 gtttggggct cagagtcgag ggacggggtg tagcctcggc atttgcttga gtttctccac 1801 tcttggctct gactgatccc tgcttgtgca ggccagtgga ggctcttggg cttggagaac 1861 acgtgtgtct ctgtgtatgt gtctgtgtgt ctgcgtccgt gtctgtcaga ctgtctgcct 1921 gtcctggggt ggctaggagc tgggtctgac cgttgtatgg tttgacctga tatactccat 1981 tctcccctgc gcctcctcct ctgtgttttt tccatgtccc cctcccaact ccccatgccc 2041 agtttttacc catcatgcac cctgtacagt tgccacgtta ctgccttttt taaaaatata 2101 tttgacagaa accaggtgcc ttcagaggct ctctgattta aataaacctt tcttgttttt 2161 tt // LOCUS HSU81504 3950 bp mRNA PRI 26-JUN-1997 DEFINITION Homo sapiens beta-3A-adaptin subunit of the AP-3 complex mRNA, complete cds. ACCESSION U81504 NID g2199511 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3950) AUTHORS Dell'Angelica,E.C., Ooi,C.E. and Bonifacino,J.S. TITLE beta3A-adaptin, a subunit of the adaptor-like complex AP-3 JOURNAL J. Biol. Chem. 272 (24), 15078-15084 (1997) MEDLINE 97326075 REFERENCE 2 (bases 1 to 3950) AUTHORS Dell'Angelica,E.C. and Bonifacino,J.S. TITLE Direct Submission JOURNAL Submitted (09-DEC-1996) CBMB-NICHD, National Institutes of Health, Bldg. 18T, Rm. 101, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3950 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" CDS 93..3377 /note="similar to cerebellar degeneration antigen beta-NAP and to the beta-1-adaptin and beta-2-adaptin subunits of clathrin-associated complexes" /codon_start=1 /product="beta-3A-adaptin subunit of the AP-3 complex" /db_xref="PID:g2199512" /translation="MSSNSFPYNEQSGGGEATELGQEATSTISPSGAFGLFSSDLKKN EDLKQMLESNKDSAKLDAMKRIVGMIAKGKNASELFPAVVKNVASKNIEIKKLVYVYL VRYAEEQQDLALLSISTFQRALKDPNQLIRASALRVLSSIRVPIIVPIMMLAIKEASA DLSPYVRKNAAHAIQKLYSLDPEQKEMLIEVIEKLLKDKSTLVAGSVVMAFEEVCPDR IDLIHKNYRKLCNLLVDVEEWGQVVIIHMLTRYARTQFVSPWKEGDELEDNGKNFYES DDDQKEKTDKKKKPYTMDPDHRLLIRNTKPLLQSRNAAVVMAVAQLYWHISPKSEAGI ISKSLVRLLRSNREVQYIVLQNIATMSIQRKGMFEPYLKSFYVRSTDPTMIKTLKLEI LTNLANEANISTLLREFQTYVKSQDKQFAAATIQTIGRCATNILEVTDTCLNGLVCLL SNRDEIVVAESVVVIKKLLQMQPAQHGEIIKHMAKLLDSITVPVARASILWLIGENCE RVPKIAPDVLRKMAKSFTSEDDLVKLQILNLGAKLYLTNSKQTKLLTQYILNLGKYDQ NYDIRDRTRFIRQLIVPNEKSGALSKYAKKIFLAQKPAPLLESPFKDRDHFQLGTLSH TLNIKATGYLELSNWPEVAPDPSVRNVEVIELAKEWTPAGKAKQENSAKKFYSESEEE EDSSDSSSDSESESGSESGEQGESGEEGDSNEDSSEDSSSEQDSESGRESGLENKRTA KRNSKAKGKSDSEDGEKENEKSKTSDSSNDESSSIEDSSSDSESESEPESESESRRVT KEKEKKTKQDRTPLTKDVSLLDLDDFNPVSTPVALPTPALSPSLMADLEGLHLSTSSS VISVSTPAFVPTKTHVLLHRMSGKGLAAHYFFPRQPCIFGDKMVSIQITLNNTTDRKI ENIHIGEKKLPIGMKMHVFNPIDSLEPEGSITVSMGIDFCDSTQTASFQLCTKDDCFN VNIQPPVGELLLPVAMSEKDFKKEQGVLTGMNETSAVIIAAPQNFTPSVIFQKVVNVA NVGAVPSGQDNIHRFAAKTVHSGSLMLVTVELKEGSTAQLIINTEKTVIGSVLLRELK PVLSQG" polyA_signal 3931..3936 BASE COUNT 1249 a 772 c 857 g 1072 t ORIGIN 1 cgagaactag ttttgttccg tgccctctgg actggaacct tttggagaga acccccggca 61 ggaccaaccc cgcacccgcc agcaccgcgg caatgtccag caatagtttt ccttacaatg 121 agcagtccgg aggaggggag gcgacggagc tgggtcagga ggcgacctca accatttccc 181 cctcgggggc cttcggcctc tttagcagcg atttgaagaa gaatgaagat ctaaagcaaa 241 tgttagagag caacaaagat tctgctaaac tggatgctat gaagcggatt gttgggatga 301 ttgcaaaagg gaaaaatgca tctgaactgt ttcctgctgt tgtgaagaat gtggccagta 361 aaaatattga gatcaagaag ttggtatatg tttacctggt tcgatatgct gaagaacagc 421 aggatcttgc actcctgtcc ataagcactt ttcagcgagc tctgaaggac ccaaaccaac 481 taattcgtgc aagcgctttg agagttctgt caagtattag agtgccaatt attgtaccta 541 tcatgatgct tgctattaag gaagcttctg ctgacttatc accatatgtt aggaagaatg 601 cagcccatgc aatacaaaaa ttatacagcc ttgatccaga gcagaaggaa atgttaattg 661 aagtaattga aaaacttctg aaagataaaa gcacattggt agctggcagt gttgtgatgg 721 cttttgaaga agtatgcccg gacagaatag atctgattca taaaaattac cgcaagctat 781 gtaacttact agtggatgtt gaagagtggg ggcaggttgt cataatccac atgctaactc 841 gatatgctcg gacacagttt gtcagccctt ggaaagaggg tgatgaatta gaagacaatg 901 gaaagaattt ctacgaatct gatgatgatc agaaggaaaa gactgacaaa aagaagaagc 961 cgtatactat ggatccagat catagactct taattagaaa tacaaagcct ttgcttcaga 1021 gcaggaatgc tgcggtggtt atggcagttg ctcagctgta ttggcacata tcaccaaaat 1081 ctgaagctgg cataatttct aaatcactag tgcgtttact tcgtagcaat agggaggtgc 1141 agtatattgt cctacaaaat atagcaacta tgtcaattca aagaaagggg atgtttgaac 1201 cttatctgaa gagtttctat gttaggtcaa ctgatccaac tatgatcaag acactgaagc 1261 ttgaaatttt gacaaacttg gcaaatgaag ccaacatatc aactcttctt cgagaatttc 1321 agacctatgt gaaaagccag gataaacaat ttgcagcagc cactattcag actataggca 1381 gatgtgcaac caacatcttg gaagtcactg acacgtgcct caatggcttg gtctgtctgc 1441 tgtccaacag ggatgaaata gttgttgctg aaagtgtggt tgttataaag aaattactgc 1501 aaatgcaacc tgcacaacat ggtgaaatta ttaaacatat ggccaaactc ctggacagta 1561 tcactgttcc tgttgctaga gcaagtattc tttggctaat tggagaaaac tgtgaacgag 1621 ttcctaaaat tgcccctgat gttttgagga agatggctaa aagcttcact agtgaagatg 1681 atctggtaaa actgcagata ttaaatctgg gagcaaaatt gtatttaacc aactccaaac 1741 agacaaaatt gcttacccag tacatattaa atctcggcaa gtatgatcaa aactacgaca 1801 tcagagaccg tacaagattt attaggcagc ttattgttcc gaatgaaaag agtggagctt 1861 taagtaaata tgccaaaaaa atattcctag cacaaaagcc tgcaccactg cttgagtctc 1921 cttttaaaga tagagatcat ttccagcttg gcaccttatc tcatactctc aacattaaag 1981 ctactgggta cctggaatta tctaattggc cagaggtggc gcccgaccca tcagttcgaa 2041 atgtagaagt aatagagttg gcaaaagaat ggaccccagc aggaaaagca aagcaagaga 2101 attctgctaa gaagttttat tctgaatctg aggaagagga ggactcttct gatagtagca 2161 gtgacagtga gagtgaatct ggaagtgaaa gtggagaaca aggcgaaagt ggggaggaag 2221 gagacagcaa tgaggacagc agtgaggact cctccagtga gcaggacagt gagagtggac 2281 gggagtcagg cctagaaaac aaaagaacag ccaagaggaa ctcaaaagcc aaaggaaaaa 2341 gtgattctga agatggggag aaggaaaatg aaaaatctaa aacttcagat tcttcaaatg 2401 acgaatctag ttcaatagaa gacagttctt ccgattctga atcagagtca gaacctgaaa 2461 gtgaatctga atccagaaga gtcactaagg agaaagaaaa gaaaacaaag caagatagaa 2521 ctcctcttac caaagatgtt tcacttctag atctggatga ttttaaccca gtatccactc 2581 cagttgcact tcccacacca gctctttctc caagtttgat ggctgatctt gaaggtttac 2641 acttgtcaac ttcctcttca gtcatcagtg tcagtactcc tgcatttgta ccaacgaaaa 2701 ctcacgtgct gcttcatcga atgagtggaa aaggactagc tgcccattat ttctttccaa 2761 gacagccttg catttttggt gataagatgg tctctataca aataacactg aataacacta 2821 ctgatcgaaa gatagaaaat atccacatag gggaaaaaaa acttcctata ggcatgaaaa 2881 tgcatgtttt taatccaata gactctcttg agcctgaggg atccattaca gtttcaatgg 2941 gtattgactt ttgtgattct actcagactg ccagtttcca gttgtgtacc aaggatgatt 3001 gcttcaatgt taatattcag ccacctgttg gagaactgct tttacctgtg gccatgtcag 3061 agaaagattt taagaaagag caaggagtgc taacaggaat gaatgaaact tctgctgtaa 3121 tcattgctgc accacagaat ttcactccct ctgtgatctt tcagaaggtt gtaaatgtag 3181 ccaatgtagg tgcagtccct tctggccagg ataatataca caggtttgca gctaaaactg 3241 tgcacagtgg gtcattgatg ctagtcacag tggaactgaa ggaaggctct acagcccagc 3301 ttatcataaa cactgagaaa actgtgattg gctctgttct gctgcgggaa ctgaagcctg 3361 tcctgtctca ggggtaacct gcttacatct ggactttaga atctggcaca caacaaaagt 3421 gcctggcatc cactactgct gcctttcatt tataataata gcccttccat ctggcagtgg 3481 gggtagaata cactcttgac attcttgtct cctgctttag aatgctagtg tgtatctatc 3541 atgtatgcaa tactttcccc ctttttgctt tgctaaccga agagcatata ttttactgtc 3601 agttgtctca actcttgaat ccatgtggcg ttttctctgt cctgctgctt cttttggcct 3661 cctcgttttc cttctctttt tcgacaatgg tagacatgaa tgagatattt aaagttcatt 3721 ggaaatcttc ttccctacag cagtaagcaa aaattagcaa agagatagtc taaatggcct 3781 ctcagcttgg tatgtgaaaa tgagatcaca tactttttaa atccaaatac aaaagcatag 3841 tctctgcaag attttgttct ttgaatttct tgatattgta attgattatt gataactgtc 3901 atcatgaaat tatctctcaa taataagata aataaactag catatgaatc // LOCUS HSU81523 1961 bp mRNA PRI 01-MAY-1997 DEFINITION Human endometrial bleeding associated factor mRNA, complete cds. ACCESSION U81523 NID g2058537 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1961) AUTHORS Tabibzadeh,S. and Kothapalli,R. TITLE Direct Submission JOURNAL Submitted (09-DEC-1996) Pathology, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, FL 33612, USA FEATURES Location/Qualifiers source 1..1961 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q42.1" /tissue_type="placenta" CDS 34..1146 /codon_start=1 /product="endometrial bleeding associated factor" /db_xref="PID:g2058538" /translation="MWPLWLCWALWVLPLAGPGAALTEEQLLASLLRQLQLSEVPVLD RADMEKLVIPAHVRAQYVVLLRRDGDRSRGKRFSQSFREVAGRFLASEASTHLLVFGM EQRLPPNSELVQAVLRLFQEPVPQGALHRHGRLSPAAPKARVTVEWLVRDDGSNRTSL IDSRLVSVHESGWKAFDVTEAVNFWQQLSRPPEPLLVQVSVQREHLGPLASGAHKLVR FASQGAPAGLGEPQLELHTLDLRDYGAQGDCDPEAPMTEGTRCCRQEMYIDLQGMKWA KNWVLEPPGFLAYECVGTCQQPPEALAFNWPFLGPRQCIASETASLPMIVSIKEGGRT RPQVVSLPNMRVQKCSCASDGALVPRRLQHRPWCIH" BASE COUNT 399 a 572 c 580 g 410 t ORIGIN 1 ccactctgcc tcctgctccc ccagggcagc accatgtggc ccctgtggct ctgctgggca 61 ctctgggtgc tgcccctggc tggccccggg gcggccctga ccgaggagca gctcctggcg 121 agcctgctgc ggcagctgca gctcagcgag gtgcccgtac tggacagggc cgacatggag 181 aagctggtca tccccgccca cgtgagggcc cagtatgtag tcctgctgcg gcgcgacggg 241 gaccgctccc gcggaaagag gttcagccag agcttccgag aggtggccgg caggttcctg 301 gcgtcggagg ccagcacaca cctgctggtg ttcggcatgg agcagcggct gccgcccaac 361 agcgagctgg tgcaggccgt gctgcggctc ttccaggagc cggttcccca aggcgcgctg 421 cacaggcacg ggcggctgtc cccggcagcg cccaaggccc gggtgaccgt cgagtggctg 481 gtccgcgacg acggctccaa ccgcacctcc ctcatcgact ccaggctggt gtccgtccac 541 gagagcggct ggaaggcctt cgacgtgacc gaggccgtga acttctggca gcagctgagc 601 cggcccccgg agccgctgct cgtacaggtg tcggtgcaga gggagcatct gggcccgctg 661 gcgtccggcg cccacaagct ggtccgcttt gcctcgcagg gggcgccagc cgggcttggg 721 gagccccagc tggagctgca caccctggac ctcagggact atggagctca gggcgactgt 781 gaccctgaag caccaatgac cgagggcacc cgctgctgcc gccaggagat gtacattgac 841 ctgcagggga tgaagtgggc caagaactgg gtgctggagc ccccgggctt cctggcttac 901 gagtgtgtgg gcacctgcca gcagcccccg gaagccctgg ccttcaattg gccatttctg 961 gggccgcgac agtgtatcgc ctcggagact gcctcgctgc ccatgatcgt cagcatcaag 1021 gagggaggca ggaccaggcc ccaggtggtc agcctgccca acatgagggt gcagaagtgc 1081 agctgtgcct cggatggggc gctcgtgcca aggaggctcc agcataggcc ctggtgtatc 1141 cattgagcct ctaactgaac gtgtgcataa gaggtggtct taatgtaggg cgttaacttt 1201 atacttagca agttactcca tcccaattta gtgctcctgt gtgacctcgc cctgtgtcct 1261 tccattcctg tctttcccgt ccatcaccca tcctaagcac ttacgtgagt aaataatgca 1321 gctcagatgc tgagctctag taggaaatgc tggcatgctg attacaagat acagctgagc 1381 aatgcacaca ttttcagctg ggagtttctg ttctctggca aattcttcac tgagtctgga 1441 acaataatac cctatgatta gaactgggga aacagaactg aattgctgtg ttatatgagg 1501 aattaaaacc ttcaaatctc tatttccccc aaatactgac ccattctgga cttttgtaaa 1561 catacctagg cccctgttcc cctgagaggg tgctaagagg aaggatgaag ggcttcaggc 1621 tgggggcagt ggacagggaa ttgggatacc tggattctgg ttctgacagg gccacaagct 1681 aggatctcta acaaacgcag aaggctttgg ctcgtcattt cctcttaaaa aaggaggagc 1741 tgggcttcag ctctaagaac ttcattgccc tggggatcag acagccccta cctacccctg 1801 cccactcctc tggagactga gccttgcccg tgcatattta ggtcatttcc cacactgtct 1861 tagagaactt gtcaccagaa accacatgta tttgcatgtt ttttgttaat ttagctaaag 1921 caattgaatg tagatactca gaagaaataa aaaatgatgt t // LOCUS HSU81556 2004 bp mRNA PRI 25-DEC-1996 DEFINITION Human hypothetical protein A4 mRNA, complete cds. ACCESSION U81556 NID g1750283 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2004) AUTHORS Beutler,E., Gelbart,T., West,C., Kuhl,W. and Lee,P. TITLE A strategy for cloning the hereditary hemochromatosis gene JOURNAL Blood Cells Mol. Dis. 21 (3), 207-216 (1995) MEDLINE 96230927 REFERENCE 2 (bases 1 to 2004) AUTHORS Lee,P.L., Gelbart,T., West,C., Adams,M., Blackstone,R. and Meyer,A. TITLE Identification of 15 genes mapping to chromosome 6p21.3 spanning the microsatellite markers D6S306 and D6S1260. Characterization of 3 genes encoding zinc finger proteins JOURNAL Unpublished REFERENCE 3 (bases 1 to 2004) AUTHORS Lee,P., Kuhl,W., Gelbart,T., West,C. and Beutler,E. TITLE Direct Submission JOURNAL Submitted (15-NOV-1996) Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Rd., La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2004 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" CDS 596..820 /codon_start=1 /product="hypothetical protein A4" /db_xref="PID:g1750284" /translation="MAWWECPCCICMKPLLFRPHSHQNPACPPATPPQQCHSHLPQRS YSAKPSQGLFLWTRASLVIICCRVSSFTVC" BASE COUNT 444 a 523 c 477 g 558 t 2 others ORIGIN 1 ctatgatggc cagccaccgc tgagagcacg aagctgctgc tggctggcat ttttctctag 61 cgttgtggtg ccacctnccc ttatnacctt gggacaagaa gggaaggtgg ccattgtctt 121 tctctttgga atcataaagt ggaacagagt ccccagaact catgtggcca tttccgccag 181 catcactccc cggtgcctat ggggtcccgg tgtacctaaa gggagaagga ccccatgtgc 241 tagccagaaa tatactgtct cttgaaggaa agcaggagct cagactctta gagccaggtg 301 tggtttcgga cccaaggcct gacctaggct gctatcctaa tattgcagga ggggcctctc 361 ttccaagccc caccctaagg gttagccctt ggccaaatct tttgccgtct aggcccagcc 421 aggcttttct gactaaataa gcaataagag gctctaagct gactgagttg caaggaccct 481 ttccgccctc ccttggatct ccatgttttt ccagatggcg gaagagcatg tgccaccccc 541 tttcctaaca gacttgtcca agtgcttggc gtgggaccca tgaccaaagc ccaggatggc 601 ttggtgggag tgtccctgct gcatctgcat gaagcccctg ctttttaggc ctcactccca 661 tcagaaccct gcctgcccac ctgcaactcc cccccaacaa tgccattccc acttgcccca 721 gagaagctac tcggccaaac ctagccaggg tctgttcttg tggaccagag ccagcctagt 781 cattatttgc tgtcgggttt ccagtttcac cgtgtgttag ggtgagggat gattgtaaaa 841 tttgctcctc aaaggaatca ggccagactc aattttggag ggcaagacag ggaggaggcc 901 gcttcatccc agactctctt ctagggcttc ccaccatcag cccctcccac ttgagactgg 961 tctttgggag gcaataggcc accatgcctg gtcagcacca attcaagcca tgccaggaat 1021 ctgcctacct gccaggttca gttcttttaa ggtgcctctt caggggacac agtgtgtctc 1081 tctgattggg cttctaaatc aaaagcctga tgttcgtgtc cctctcatag ggggagcttt 1141 ggacacagga ccagtttgga aaagggtcag gtaagggttt ccactctgca cattgtagag 1201 ggaacactct gtaggcccat gggtccctta ctagagaggt tgagtgaatt tgccttcagt 1261 taacatggga ccttctgttt agcttcctct tgcttcccaa agattttaag cattttgtaa 1321 atgtataaac tcacctctgg taacagtggc ccagacgctg ctttgtgcta aaagcatggg 1381 aaatgtaaag gcagtctttc tctgggaaat ggatgctatt ctattctgct gcccctacct 1441 gttcctgagg cctcatttag aaagaaaatc ccctcagaag gctgtctggc acccagtgtc 1501 ctagccaggc caagtatatg agaaaggtaa gtccattttc cccttcaggt cctcagtgga 1561 ttacttaacc actgctgtcc ctcggtccct ttttcctaaa cgggtttagt tctgtctttt 1621 ttctcctttt ttctaaatgc tggtaaatat ttacattcag ccagggaaga ggaggccaga 1681 ggtcgggcca gctgccccat tcttttaacg ttgtagggcc tgcccatgga gcggaccctc 1741 ctctttgggc ctcgtgagct tttttgctta tcatgttcca tttcgtgccg ctttccccct 1801 tcaagatgcc atttggaggg taggggatct gcttcccact gtgactgggc tatgggattc 1861 tgactacctt gcttacagat tcatggtttg ataaatttgt tgtattccaa aacttgaaat 1921 gcaggacgcc attaagtgtc tgtttatatt tttggaatat ttgtattact tacaattaat 1981 taataaaagt gggtttaaaa aacc // LOCUS HSU81787 2305 bp mRNA PRI 11-APR-1997 DEFINITION Human Wnt10B mRNA, complete cds. ACCESSION U81787 NID g1932788 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2305) AUTHORS Hardiman,G., Kastelein,R.A. and Bazan,J.F. TITLE Isolation, characterization and chromosomal localization of human Wnt10B JOURNAL Cytogenet. Cell Genet. (1997) In press REFERENCE 2 (bases 1 to 2305) AUTHORS Hardiman,G. and Bazan,F. TITLE Direct Submission JOURNAL Submitted (10-DEC-1996) Molecular Biology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..2305 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12q13.1" /cell_line="HELA" gene 364..1533 /gene="Wnt10B" CDS 364..1533 /gene="Wnt10B" /codon_start=1 /product="Wnt10B" /db_xref="PID:g1932789" /translation="MLEEPRPRPPPSGLAGLLFLALCSRALSNEILGLKLPGEPPLTA NTVCLTLSGLSKRQLDLCLRNPDVTASALQGLHIAVHECQHQLRDQRWNCSALEGGGR LPHHSAILKRGFRESAFSFSMLAAGVMHAVATACSLGKLVSCGCGWKGSGEQDRLRAK LLQLQALSRGKSFPHSLPSPGPGSSPSPGPQDTWEWGGCNHDMDFGEKFSRDFLDSRE APRDIQARMRIHNNRVGRQVVTENLKRKCKCHGTSGSCQFKTCWRAAPEFRAVGAALR ERLGRAIFIDTHNRNSGAFQPRLRPRRLSGELVYFEKSPDFCERDPTMGSPGTRGRAC NKTSRLLDGCGSLCCGRGHNVLRQTRVERCHCRFHWCCYVLCDECKVTEWVNVCK" BASE COUNT 451 a 676 c 685 g 493 t ORIGIN 1 gcggccgcgt cgacggaggg gctgcagctc cgtcagcccg gcagagccac cctgagctcg 61 gtgagagcaa agccagagcc cccagtcctt tgctcgccgg cttgctatct ctctcgatca 121 ctccctccct tcctccctcc cttcctcccg gcggccgcgg cggcgctggg gaagcggtga 181 agaggagtgg cccggccctg gaagaatgcg gctctgacaa ggggacagaa cccagcgcag 241 tctccccacg gtttaagcag cactagtgaa gcccaggcaa cccaaccgtg cctgtctcgg 301 accccgcacc caaaccactg gaggtcctga tcgatctgcc caccggagcc tccgggcttc 361 gacatgctgg aggagccccg gccgcggcct ccgccctcgg gcctcgcggg tctcctgttc 421 ctggcgttgt gcagtcgggc tctaagcaat gagattctgg gcctgaagtt gcctggcgag 481 ccgccgctga cggccaacac cgtgtgcttg acgctgtccg gcctgagcaa gcggcagcta 541 gacctgtgcc tgcgcaaccc cgacgtgacg gcgtccgcgc ttcagggtct gcacatcgcg 601 gtccacgagt gtcagcacca gctgcgcgac cagcgctgga actgctccgc gcttgagggc 661 ggcggccgcc tgccgcacca cagcgccatc ctcaagcgcg gtttccgaga aagtgctttt 721 tccttctcca tgctggctgc tggggtcatg cacgcagtag ccacggcctg cagcctgggc 781 aagctggtga gctgtggctg tggctggaag ggcagtggtg agcaggatcg gctgagggcc 841 aaactgctgc agctgcaggc actgtcccga ggcaagagtt tcccccactc tctgcccagc 901 cctggccctg gctcaagccc cagccctggc ccccaggaca catgggaatg gggtggctgt 961 aaccatgaca tggactttgg agagaagttc tctcgggatt tcttggattc cagggaagct 1021 ccccgggaca tccaggcacg aatgcgaatc cacaacaaca gggtggggcg ccaggtggta 1081 actgaaaacc tgaagcggaa atgcaagtgt catggcacat caggcagctg ccagttcaag 1141 acatgctgga gggcggcccc agagttccgg gcagtggggg cggcgttgag ggagcggctg 1201 ggccgggcca tcttcattga tacccacaac cgcaattctg gagccttcca gccccgtctg 1261 cgtccccgtc gcctctcagg agagctggtc tactttgaga agtctcctga cttctgtgag 1321 cgagacccca ctatgggctc cccagggaca aggggccggg cctgcaacaa gaccagccgc 1381 ctgttggatg gctgtggcag cctgtgctgt ggccgtgggc acaacgtgct ccggcagaca 1441 cgagttgagc gctgccattg ccgcttccac tggtgctgct atgtgctgtg tgatgagtgc 1501 aaggttacag agtgggtgaa tgtgtgtaag tgagggtcag ccttaccttg gggctgggga 1561 agaggactgt gtgagagggg cgccttttca gccctttgct ctgatttcct tccaaggtca 1621 ctcttggtcc ctggaagctt aaagtatcta cctggaaaca gctttagggg tggtgggggt 1681 caggtggact ctgggatgtg tagccttctc cccaacaatt ggagggtctt gaggggaagc 1741 tgccacccct cttctgctcc ttagacacct gaatggacta agatgaaatg cactgtattg 1801 ctcctcccac ttctcaactc cagagcccct ttaaccctga ttcatactcc ttttggctgg 1861 ggagtcccta tagtttcacc actcctctcc cttgagggat aaccccaggc actgtttgga 1921 gccataagat ctgtatctag aaagagatca cccactccta tgtactatcc ccaaactcct 1981 ttactgcagc ctgggctccc tcttgtggga taatgggaga cagtggtaga gaggtttttc 2041 ttgggaaaga gacagagtgc tgaggggcac tctcccctga atcctcagag agttgtctgt 2101 ccaggccctt agggaagttg tctccttcca ttcagatgtt aatggggacc ctccaaagga 2161 aggggttttc ccatgactct tggagcctct ttttccttct tcagcaggaa gggtgggaag 2221 ggataattta tcatactgag acttgttctt ggttcctgtt tgaaactaaa ataaattaag 2281 ttactggaaa aaaaaaaaaa aaaaa // LOCUS HSU81800 1982 bp mRNA PRI 03-FEB-1998 DEFINITION Homo sapiens monocarboxylate transporter (MCT3) mRNA, complete cds. ACCESSION U81800 NID g2463633 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1982) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Cloning and sequencing of four new mammalian monocarboxylate transporter (MCT) homologues confirms the existence of a transporter family with an ancient past JOURNAL Biochem. J. 329 (2), 321-328 (1998) REFERENCE 2 (bases 1 to 1982) AUTHORS Price,N.T., Jackson,V.N. and Halestrap,A.P. TITLE Direct Submission JOURNAL Submitted (11-DEC-1996) Cellular Biochemistry, Hannah Research Institute, Mauchlin Road, Ayr KA6 5HL, UK FEATURES Location/Qualifiers source 1..1982 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="circulating blood" /sex="males" /dev_stage="24-34 years old" gene 1..1982 /gene="MCT3" CDS 63..1460 /gene="MCT3" /note="similar to Gallus gallus retinal epithelial membrane protein, encoded by GenBank Accession Number U15685" /codon_start=1 /product="monocarboxylate transporter" /db_xref="PID:g2463634" /translation="MGGAVVDEGPTGVKAPDGGWGWAVLFGCFVITGFSYAFPKAVSV FFKELIQEFGIGYSDTAWISSILLAMLYGTGPLCSVCVNRFGCRPVMLVGGLFASLGM VAASFCRSIIQVYLTTGVITGLGLALNFQPSLIMLNRYFSKRRPMANGLAAAGSPVFL CALSPLGQLLQDRYGWRGGFLILGGLLLNCCVCAALMRPLVVTAQPGSGPPRPSRRLL DLSVFRDRGFVLYAVAASVMVLGLFVPPVFVVSYAKDLGVPDTKAAFLLTILGFIDIF ARPAAGFVAGLGKVRPYSVYLFSFSMFFNGLADLAGSTAGDYGGLVVFCIFFGISYGM VGALQFEVLMAIVGTHKFSSAIGLVLLMEAVAVLVGPPSGGKLLDATHVYMYVFILAG AEVLTSSLILLLGNFFCIRKKPKEPQPEVAAAEEEKLHKPPADSGVDLREVEHFLKAE PEKNGEVVHTPETSV" BASE COUNT 295 a 613 c 666 g 408 t ORIGIN 1 ggcgagaggc gggctgaggc ggcccagcgg cggcaggtga ggcggaacca accctcctgg 61 ccatgggagg ggccgtggtg gacgagggcc ccacaggcgt caaggcccct gacggcggct 121 ggggctgggc cgtgctcttc ggctgtttcg tcatcactgg cttctcctac gccttcccca 181 aggccgtcag tgtcttcttc aaggagctca tacaggagtt tgggatcggc tacagcgaca 241 cagcctggat ctcctccatc ctgctggcca tgctctacgg gacaggtccg ctctgcagtg 301 tgtgcgtgaa ccgctttggc tgccggcccg tcatgcttgt ggggggtctc tttgcgtcgc 361 tgggcatggt ggctgcgtcc ttttgccgga gcatcatcca ggtctacctc accactgggg 421 tcatcacggg gttgggtttg gcactcaact tccagccctc gctcatcatg ctgaaccgct 481 acttcagcaa gcggcgcccc atggccaacg ggctggcggc agcaggtagc cctgtcttcc 541 tgtgtgccct gagcccgctg gggcagctgc tgcaggaccg ctacggctgg cggggcggct 601 tcctcatcct gggcggcctg ctgctcaact gctgcgtgtg tgccgcactc atgaggcccc 661 tggtggtcac ggcccagccg ggctcggggc cgccgcgacc ctcccggcgc ctgctagacc 721 tgagcgtctt ccgggaccgc ggctttgtgc tttacgccgt ggccgcctcg gtcatggtgc 781 tggggctctt cgtcccgccc gtgttcgtgg tgagctacgc caaggacctg ggcgtgcccg 841 acaccaaggc cgccttcctg ctcaccatcc tgggcttcat tgacatcttc gcgcggccgg 901 ccgcgggctt cgtggcgggg cttgggaagg tgcggcccta ctccgtctac ctcttcagct 961 tctccatgtt cttcaacggc ctcgcggacc tggcgggctc tacggcgggc gactacggcg 1021 gcctcgtggt cttctgcatc ttctttggca tctcctacgg catggtgggg gccctgcagt 1081 tcgaggtgct catggccatc gtgggcaccc acaagttctc cagtgccatt ggcctggtgc 1141 tgctgatgga ggcggtggcc gtgctcgtcg ggcccccttc gggaggcaaa ctcctggatg 1201 cgacccacgt ctacatgtac gtgttcatcc tggcgggggc cgaggtgctc acctcctccc 1261 tgattttgct gctgggcaac ttcttctgca ttaggaagaa gcccaaagag ccacagcctg 1321 aggtggcggc cgcggaggag gagaagctcc acaagcctcc tgcagactcg ggggtggact 1381 tgcgggaggt ggagcatttc ctgaaggctg agcctgagaa aaacggggag gtggttcaca 1441 ccccggaaac aagtgtctga gtggctgggc ggggccggca ggcacaggga ggaggtacag 1501 aagccggcaa cgcttgctat ttattttaca aactggactg gctcaggcag ggccacggct 1561 gggctccagc tgccggccca gcggatcgtc gcccgatcag tgttttgagg gggaaggtgg 1621 cggggtggga accgtgtcat tccagagtgg atctgcggtg aagccaagcc gcaaggttac 1681 aaggcatcct caccaggggc cccgcctgct gctcccaggt ggcctgcggc cactgctatg 1741 ctcaaggacc tggaaaccca tgcttcgaga caacgtgact ttaatgggag ggtgggtggg 1801 ccgcagacag gctggcaggg caggtgctgc gtggggccct ctccagcccg tcctaccctg 1861 ggctcacatg gggcctgtgc ccacccctct tgagtgtctt ggggacagct ctttccaccc 1921 ctggaagatg gaaataaacc tgcgtgtggg tggagtgttc tcgtgccgaa ttcaaaaagc 1981 tt // LOCUS HSU82130 1494 bp mRNA PRI 14-JAN-1997 DEFINITION Human tumor susceptiblity protein (TSG101) mRNA, complete cds. ACCESSION U82130 NID g1772663 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1494) AUTHORS Li,L., Li,X., Francke,U. and Cohen,S.N. TITLE The TSG101 tumor susceptibility gene is located in chromosome 11 band p15 and is mutated in human breast cancer JOURNAL Cell 88 (1), 143-154 (1997) MEDLINE 97148696 REFERENCE 2 (bases 1 to 1494) AUTHORS Li,L. and Cohen,S.N. TITLE Direct Submission JOURNAL Submitted (12-DEC-1996) Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..1494 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11p15" gene 121..1263 /gene="TSG101" CDS 121..1263 /gene="TSG101" /codon_start=1 /product="tumor susceptibility protein" /db_xref="PID:g1772664" /translation="MVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMN LTGTIPVPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLP YLHEWKHPQSDLLGLIQVMIVVFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGG ISPYPSGYPPNPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRA SLISAVSDKLRWRMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDK NIELLKKKDEELSSALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFY LGEALRRGVIDLDVFLKHVRLLSRKQFQLRALMQKARKTAGLSDLY" BASE COUNT 424 a 338 c 329 g 403 t ORIGIN 1 gaagggtgtg cgattgtgtg ggacggtctg gggcagccca gcagcggctg accctctgcc 61 tgcggggaag ggagtcgcca ggcggccgtc atggcggtgt cggagagcca gctcaagaaa 121 atggtgtcca agtacaaata cagagaccta actgtacgtg aaactgtcaa tgttattact 181 ctatacaaag atctcaaacc tgttttggat tcatatgttt ttaacgatgg cagttccagg 241 gaactaatga acctcactgg aacaatccct gtgccttata gaggtaatac atacaatatt 301 ccaatatgcc tatggctact ggacacatac ccatataatc cccctatctg ttttgttaag 361 cctactagtt caatgactat taaaacagga aagcatgttg atgcaaatgg gaagatatat 421 cttccttatc tacatgaatg gaaacaccca cagtcagact tgttggggct tattcaggtc 481 atgattgtgg tatttggaga tgaacctcca gtcttctctc gtcctatttc ggcatcctat 541 ccgccatacc aggcaacggg gccaccaaat acttcctaca tgccaggcat gccaggtgga 601 atctctccat acccatccgg ataccctccc aatcccagtg gttacccagg ctgtccttac 661 ccacctggtg gtccatatcc tgccacaaca agttctcagt acccttctca gcctcctgtg 721 accactgttg gtcccagtag ggatggcaca atcagcgagg acaccatccg agcctctctc 781 atctctgcgg tcagtgacaa actgagatgg cggatgaagg aggaaatgga tcgtgcccag 841 gcagagctca atgccttgaa acgaacagaa gaagacctga aaaagggtca ccagaaactg 901 gaagagatgg ttacccgttt agatcaagaa gtagccgagg ttgataaaaa catagaactt 961 ttgaaaaaga aggatgaaga actcagttct gctctggaaa aaatggaaaa tcagtctgaa 1021 aacaatgata tcgatgaagt tatcattccc acagctccct tatacaaaca gatcctgaat 1081 ctgtatgcag aagaaaacgc tattgaagac actatctttt acttgggaga agccttgaga 1141 aggggcgtga tagacctgga tgtcttcctg aagcatgtac gtcttctgtc ccgtaaacag 1201 ttccagctga gggcactaat gcaaaaagca agaaagactg ccggtctcag tgacctctac 1261 tgacttctct gataccagct ggaggttgag ctcttcttaa agtattcttc tcttcctttt 1321 atcagtaggt gcccagaata agttattgca gtttatcatt caagtgtaaa atattttgaa 1381 tcaataatat attttctgtt ttcttttggt aaagactggc ttttattaat gcactttcta 1441 tcctctgtaa actttttgtg ctgaatgttg ggactgctaa ataaaatttg tttt // LOCUS HSU82169 2184 bp mRNA PRI 26-MAR-1997 DEFINITION Human frizzled homolog (FZD3) mRNA, complete cds. ACCESSION U82169 NID g1906597 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2184) AUTHORS Wang,Y.K., Samos,C.H., Peoples,R., Perez-Jurado,L.A., Nusse,R. and Francke,U. TITLE A novel human homologue of the Drosophila frizzled wnt receptor gene binds wingless protein and is in the Williams syndrome deletion at 7q11.23 JOURNAL Hum. Mol. Genet. 6 (3), 465-472 (1997) MEDLINE 97227293 REFERENCE 2 (bases 1 to 2184) AUTHORS Wang,Y.-K., Peoples,R., Perez-Jurado,L.A. and Francke,U. TITLE Direct Submission JOURNAL Submitted (12-DEC-1996) Howard Hughes Medical Institute, Stanford Medical Center, Beckman Center B201, Stanford, CA 94305, USA FEATURES Location/Qualifiers source 1..2184 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q11.23" gene 25..1801 /gene="FZD3" CDS 26..1801 /gene="FZD3" /note="similar to Drosophila frizzled; seven transmembrane receptor" /codon_start=1 /product="frizzled homolog" /db_xref="PID:g1906598" /translation="MAVAPLRGALLLWQLLAAGGAALEIGRFDPERGRGAAPCQAVEI PMCRGIGYNLTRMPNLLGHTSQGEAAAELAEFAPLVQYGCHSHLRFFLCSLYAPMCTD QVSTPIPACRPMCEQARLRCAPIMEQFNFGWPDSLDCARLPTRNDPHALCMEAPENAT AGPAEPHKGLGMLPVAPRPARPPGDLGPGAGGSGTCENPEKFQYVEKSRSCAPRCGPG VEVFWSRRDKDFALVWMAVWSALCFFSTAFTVLTFLLEPHRFQYPERPIIFLSMCYNV YSLAFLIRAVAGAQSVACDQEAGALYVIQEGLENTGCTLVFLLLYYFGMASSLWWVVL TLTWFLAAGKKWGHEAIEAHGSYFHMAAWGLPALKTIVILTLRKVAGDELTGLCYVAS TDAAALTGFVLVPLSGYLVLGSSFLLTGFVALFHIRKIMKTGGTNTEKLEKLMVKIGV FSILYTVPATCVIVCYVYERLNMDFWRLRATEQPCAAAAGPGGRRDCSLPGGSVPTVA VFMLKIFMSLVVGITSGVWVWSSKTFQTWQSLCYRKIAAGRARAKACRAPGSYGRGTH CHYKAPTVVLHMTKTDPSLENPTHL" BASE COUNT 329 a 737 c 729 g 389 t ORIGIN 1 ccgccttcgg cccgggcctc ccgggatggc cgtggcgcct ctgcgggggg cgctgctgct 61 gtggcagctg ctggcggcgg gcggcgcggc actggagatc ggccgcttcg acccggagcg 121 cgggcgcggg gctgcgccgt gccaggcggt ggagatcccc atgtgccgcg gcatcggcta 181 caacctgacc cgcatgccca acctgctggg ccacacgtcg cagggcgagg cggctgccga 241 gctagcggag ttcgcgccgc tggtgcagta cggctgccac agccacctgc gcttcttcct 301 gtgctcgctc tacgcgccca tgtgcaccga ccaggtctcg acgcccattc ccgcctgccg 361 gcccatgtgc gagcaggcgc gcctgcgctg cgcgcccatc atggagcagt tcaacttcgg 421 ctggccggac tcgctcgact gcgcccggct gcccacgcgc aacgacccgc acgcgctgtg 481 catggaggcg cccgagaacg ccacggccgg ccccgcggag ccccacaagg gcctgggcat 541 gctgcccgtg gcgccgcggc ccgcgcgccc tcccggagac ctgggcccgg gcgcgggcgg 601 cagtggcacc tgcgagaacc ccgagaagtt ccagtacgtg gagaagagcc gctcgtgcgc 661 accgcgctgc gggcccggcg tcgaggtgtt ctggtcccgg cgcgacaagg acttcgcgct 721 ggtctggatg gccgtgtggt cggcgctgtg cttcttctcc accgccttca ctgtgctcac 781 cttcttgctg gagccccacc gcttccagta ccccgagcgc cccatcatct tcctctccat 841 gtgctacaac gtctactcgc tggccttcct gatccgtgcg gtggccggag cgcagagcgt 901 ggcctgtgac caggaggcgg gcgcgctcta cgtgatccag gagggcctgg agaacacggg 961 ctgcacgctg gtcttcctac tgctctacta cttcggcatg gccagctcgc tctggtgggt 1021 ggtcctgacg ctcacctggt tcctggctgc cgggaagaaa tggggccacg aggccatcga 1081 ggcccacggc agctatttcc acatggctgc ctggggcctg cccgcgctca agaccatcgt 1141 catcctgacc ctgcgcaagg tggcgggtga tgagctgact gggctttgct acgtggccag 1201 cacggatgca gcagcgctca cgggcttcgt gctggtgccc ctctctggct acctggtgct 1261 gggcagtagt ttcctcctga ccggcttcgt ggccctcttc cacatccgca agatcatgaa 1321 gacgggcggc accaacacag agaagctgga gaagctcatg gtcaagatcg gggtcttctc 1381 catcctctac acggtgcccg ccacctgcgt catcgtttgc tatgtctacg aacgcctcaa 1441 catggacttc tggcgccttc gggccacaga gcagccatgc gcagcggccg cggggcccgg 1501 aggccggagg gactgctcgc tgccaggggg ctcggtgccc accgtggcgg tcttcatgct 1561 caaaattttc atgtcactgg tggtggggat caccagcggc gtctgggtgt ggagctccaa 1621 gactttccag acctggcaga gcctgtgcta ccgcaagata gcagctggcc gggcccgggc 1681 caaggcctgc cgcgcccccg ggagctacgg acgtggcacg cactgccact ataaggctcc 1741 caccgtggtc ttgcacatga ctaagacgga cccctctttg gagaacccca cacacctcta 1801 gccacacagg cctggcgcgg ggtggctgct gccccctcct tgccctccac gccctgcccc 1861 ctgcatcccc tagagacagc tgactagcag ctgcccagct gtcaaggtca ggcaagtgag 1921 caccggggac tgaggatcag ggcgggaccc cgtgaggctc attaggggag atgggggtct 1981 cccctaatgc gggggctgga ccaggctgag tccccacagg gtcctagtgg aggatgtgga 2041 ggggcggggc agaggggtcc agccggagtt tatttaatga tgtaatttat tgttgcgttc 2101 ctctggaagc tgtgactgga ataaaccccc gcgtggcact gctgatcctc tctggctggg 2161 aagggggaag gtaggaggtg aggc // LOCUS HSU82226 855 bp mRNA PRI 25-FEB-1997 DEFINITION Human calcium and integrin binding protein CIB mRNA, complete cds. ACCESSION U82226 NID g1848270 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 855) AUTHORS Naik,U.P., Patel,P.M. and Parise,L.V. TITLE Identification of a novel calcium-binding protein that interacts with the integrin alphaIIb cytoplasmic domain JOURNAL J. Biol. Chem. 272 (8), 4651-4654 (1997) MEDLINE 97184102 REFERENCE 2 (bases 1 to 855) AUTHORS Naik,U.P. and Parise,L.V. TITLE Direct Submission JOURNAL Submitted (13-DEC-1996) Pharmacology, The University of North Carolina at Chapel Hill, 1106 FLOB, CB# 7365, Chapel Hill, NC 27599, USA FEATURES Location/Qualifiers source 1..855 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" CDS 47..622 /note="similar to Rattus norvegicus calcineurin B-like protein encoded by the sequence presented in GenBank Accession Number D10393 and to Rattus norvegicus phosphoprotein phosphatase beta chain: PIR Accession Number PS0261" /codon_start=1 /product="calcium and integrin binding protein CIB" /db_xref="PID:g1848271" /translation="MGGSGSRLSKELLAEYQDLTFLTKQEILLAHRRFCELLPQEQRS VESSLRAQVPFEQILSLPELKANPFKERICRVFSTSPAKDSLSFEDFLDLLSVFSDTA TPDIKSHYAFRIFDFDDDGTLNREDLSRLVNCLTGEGEDTRLSASEMKQLIDNILEES DIDRDGTINLSEFQHVISRSPDFASSFKIVL" misc_feature 392..427 /note="encodes an EF-hand motif, putative calcium binding domain" misc_feature 527..562 /note="encodes an EF-hand motif; putative calcium binding domain" BASE COUNT 168 a 249 c 246 g 192 t ORIGIN 1 tctgcgtctc gaggcgagtt ggcggagctg tgcgcgcggc ggggcgatgg ggggctcggg 61 cagtcgcctg tccaaggagc tgctggccga gtaccaggac ttgacgttcc tgacgaagca 121 ggagatcctc ctagcccaca ggcggttttg tgagctgctt ccccaggagc agcggagcgt 181 ggagtcgtca cttcgggcac aagtgccctt cgagcagatt ctcagccttc cagagctcaa 241 ggccaacccc ttcaaggagc gaatctgcag ggtcttctcc acatccccag ccaaagacag 301 ccttagcttt gaggacttcc tggatctcct cagtgtgttc agtgacacag ccacgccaga 361 catcaagtcc cattatgcct tccgcatctt tgactttgat gatgacggaa ccttgaacag 421 agaagacctg agccggctgg tgaactgcct cacgggagag ggcgaggaca cacggcttag 481 tgcgtctgag atgaagcagc tcatcgacaa catcctggag gagtctgaca ttgacaggga 541 tggaaccatc aacctctctg agttccagca cgtcatctcc cgttctccag actttgccag 601 ctcctttaag attgtcctgt gacagcagcc ccagcgtgtg tcctggcacc ctgtccaaga 661 acctttctac tgctgagctg tggccaaggt caagcctgtg ttgccagtgc gggccaagct 721 ggcccagcct ggagctggcg ctgtgcagcc tcaccccggg caggggcggc cctcgttgtc 781 agggcctctc ctcactgctg ttgtcattgc tccgtttgtg tttgtactaa tcagtaataa 841 aggtttagaa gtttg // LOCUS HSU82381 2389 bp mRNA PRI 13-DEC-1997 DEFINITION Human proline dehydrogenase/proline oxidase (PRODH) mRNA, complete cds. ACCESSION U82381 NID g2677801 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2389) AUTHORS Campbell,H.D., Webb,G.C. and Young,I.G. TITLE A human homologue of the Drosophila melanogaster sluggish-A (proline oxidase) gene maps to 22q11.2, and is a candidate gene for type-I hyperprolinaemia JOURNAL Hum. Genet. 101 (1), 69-74 (1997) MEDLINE 98046348 REFERENCE 2 (bases 1 to 2389) AUTHORS Campbell,H.D., Webb,G.C. and Young,I.G. TITLE Direct Submission JOURNAL Submitted (16-DEC-1996) MES and Centre for Molecular Structure and Function, RSBS, Australian National University, PO Box 475, Canberra, ACT 2601, Australia FEATURES Location/Qualifiers source 1..2389 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /map="22q11.2" gene 1..2389 /gene="PRODH" CDS 447..1997 /gene="PRODH" /codon_start=1 /product="proline dehydrogenase/proline oxidase" /db_xref="PID:g2677802" /translation="MLEFVMREWKKSRKLLGQRLFNKLMKMTFYGHFVAGEDQESIQP LLRHYRAFGVSAILDYGVEEDLSPEEAEHKEMESCSSAAERDGSGTNKRDKQYQAHRA FGDRRNGVISARTYFYANEAKCDSHMETFLRCIEASGRVSDDGFIAIKLTALGRPQFL LQFSEVLAKWRCFFHQMAVEQGQAGLAAMDTKLEVAVLQESVAKLGIASRAEIEDWFT AETLGVSGTMDLLDWSSLIDSRTKLSKHLVVPNAQTGQLEPLLSRFTEEEELQMTRML QRMDVLAKKATEMGVRLMVDAEQTYFQPAISRLTLEMQRKFNVEKPLIFNTYQCYLKD AYDNVTLDVELARREGWCFGAKLVRGAYLAQERARAAEIGYEDPINPTYEATNAMYHR CLDYVLEELKHNAKAKVMVASHNEDTVRFALRRMEELGLHPADHQVYFGQLLGMCDQI SFPLGQAGYPVYKYVPYGPVMEVLPYLSRRALENSSLMKGTHRERQLLWLELLRRLRT GNLFHRPA" polyA_signal 2371..2376 /gene="PRODH" /evidence=experimental polyA_site 2389 /gene="PRODH" /evidence=experimental BASE COUNT 541 a 648 c 749 g 451 t ORIGIN 1 taatgagagg gaaaacaagt atgaagctgt gtggctgaaa ccgtcttggc aagttaggga 61 aagaaaacgg aagtcactgg ggctgatcac agtgctaagc atgagagcac tgcaagatga 121 ggtcacggag gtgggcaggg accggcttgt gccaggcctt gctggcaggg tgaagagttt 181 gccttttctc tgcgtacaat ggaaaggaga agaggtttta agcaagagaa tggcttggtc 241 atgtgtatgt ctttgagaca ccctggctag tctatgtatg atgcaaaagg tgggtggggc 301 agggtgacaa gaaaatactg ttccggagct tcctgtggct gtgcctataa gaggtggtgg 361 tggtggtgtg gaaggaggtg tggcagtgaa taaacagaga tgtagaaaca gcgtgtacat 421 atattttaag gaacactgag gacgtgatgc tggaatttgt gatgagagag tggaaaaaat 481 ccaggaaact tctaggacag aggctattca acaagctcat gaagatgacc ttctatgggc 541 attttgtagc cggggaggac caggagtcca tccagcccct gcttcggcac tacagggcct 601 tcggtgtcag cgccatcctg gactatggag tggaggagga cctgagcccc gaggaggcag 661 agcacaagga gatggagtcc tgctcctcgg ctgcggagag ggatggcagt ggcacgaata 721 agcgggacaa gcaataccag gcccaccggg ctttcgggga ccgcaggaat ggtgtcatca 781 gtgcccgcac ctacttctac gccaatgagg ccaagtgcga cagccacatg gagacattct 841 tgcgctgcat cgaagcctca ggtagagtca gcgatgacgg cttcatagcc attaagctca 901 cagcactggg gagaccccag tttctgctgc agttctcaga ggtgctggcc aagtggaggt 961 gcttctttca ccaaatggct gtggagcaag ggcaggcggg cctggctgcc atggacacca 1021 agctggaggt ggcggtgctg caggaaagtg tcgcaaagtt gggcatcgca tccagggctg 1081 agattgagga ctggttcacg gcagagaccc tgggagtgtc tggcaccatg gacctgctgg 1141 actggagcag cctcatcgac agcaggacca agctgtccaa gcacctggta gtccccaacg 1201 cacagacagg acagctggag cccctgctgt cccggttcac tgaggaggag gagctacaga 1261 tgaccaggat gctacagcgg atggatgtcc tggccaagaa agccacagag atgggcgtgc 1321 ggctgatggt ggatgccgag cagacctact tccagccggc catcagccgc ctgacgctgg 1381 agatgcagcg gaagttcaat gtggagaagc cgctcatctt caacacatac cagtgctacc 1441 tcaaggatgc ctatgacaat gtgaccctgg acgtggagct ggctcgccgt gagggctggt 1501 gttttggggc caagctggtg cggggcgcat acctggccca ggagcgagcc cgtgcggcag 1561 agatcggcta tgaggacccc atcaacccca cgtacgaggc caccaacgcc atgtaccaca 1621 ggtgcctgga ctacgtgttg gaggagctga agcacaacgc caaggccaag gtgatggtgg 1681 cctcccacaa tgaggacaca gtgcgcttcg cactgcgcag gatggaggag ctgggcctgc 1741 atcctgctga ccaccaggtg tactttggac agctgctagg catgtgtgac cagatcagct 1801 tcccgctggg ccaggccggc taccccgtgt acaagtacgt gccctatggc cccgtgatgg 1861 aggtgctgcc ctacttgtcc cgccgtgccc tggagaacag cagcctcatg aagggcaccc 1921 atcgggagcg gcagttgctg tggctggagc tcttgaggcg gctccgaact ggcaacctct 1981 tccatcgccc tgcctagcac ccgccagcac accctcagcc tccagcaccc cccgcccccg 2041 cccaggccat caccacagct gcagccaacc ccatcctcac acagattcac cttttttcac 2101 cccacacttg cagagctgct ggaggtgagg tcaggtgcct cccagccctg cccagagtat 2161 gggcactcag gtgtgggccg aacctgatac ctgcctggga cagccactgg aaacttttgg 2221 gaactctcct cgaatgtgtg gcccaaggcc cccacctctg tgacccccat gtccttggac 2281 ctagaggatt gtccaccttc tgccaaggcc agcccacaca gcccgagccc cttggggagc 2341 agtggccggg ctggggaggc ctgcctggtc aataaaccac tgttcctgc // LOCUS HSU82469 1733 bp mRNA PRI 06-MAY-1997 DEFINITION Human tubby related protein 2 (TULP2) mRNA, complete cds. ACCESSION U82469 NID g2072163 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1733) AUTHORS North,M.A., Naggert,J.K., Yan,Y., Noben-Trauth,K. and Nishina,P.M. TITLE Molecular characterization of TUB, TULP1, and TULP2, members of the novel tubby gene family and their possible relation to ocular diseases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (7), 3128-3133 (1997) MEDLINE 97250501 REFERENCE 2 (bases 1 to 1733) AUTHORS North,M.A., Naggert,J.K., Yan,Y., Noben-Trauth,K. and Nishina,P.M. TITLE Direct Submission JOURNAL Submitted (17-DEC-1996) The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA FEATURES Location/Qualifiers source 1..1733 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.1" /tissue_type="brain" gene 94..1656 /gene="TULP2" CDS 94..1656 /gene="TULP2" /note="related to mouse tub" /codon_start=1 /product="tubby related protein 2 TULP2" /db_xref="PID:g2072164" /translation="MSQDNDTLMRDILGHELAAMRLQKLEQQRRLFEKKQRQKRQELL MVQANPDASPWLWRSCLREERLLGDRGLGNPFLRKKVSEAHLPSGIHSALGTVSCGGD GRGERGLPTPRTEAVFRNLGLQSPFLSWLPDNSDAELEEVSVENGSVSPPPFKQSPRI RRKGWQAHQRPGTRAEGESDSQDMGDAHKSPNMGPNPGMDGDCVYENLAFQKEEDLEK KREASESTGTNSSAAHNEELSKALKGEGGTDSDHMRHEASLAIRSPCPGLEEDMEAYV LRPALPGTMMQCYLTRDKHGVDKGLFPLYYLYLETSDSLQRFLLAGRKRRRSKTSNYL ISLDPTLLSRDGDNFVGKVRSNVFSTKFTIFDNGVNPDREHLTRNTARIRQELGAVCY EPNVLGYLGPRKMTVILPGTNSQNQRINVQPLNEQESLLSRYQRGDKQGLLLLHNKTP SWDKENGVYTLNFHGRVTRASVKNFQIVDPKHQEHLVLQFGRVGPDTFTMDFCFPFSP LQAFSICLSSFN" 3'UTR 1657..1733 BASE COUNT 449 a 480 c 455 g 349 t ORIGIN 1 ggaatcctcc ctccctctga gccgtctttc ttctcctccc tatttcgcag atatcccgag 61 attaggtccc cagcttccaa agagaggatc agaatgtctc aggataatga cacattgatg 121 agagacatcc tggggcatga gctcgctgct atgaggctgc agaagctgga acagcagcgg 181 cggctgtttg aaaagaagca gcgacagaag cgccaggagc tcctcatggt tcaggccaat 241 cctgacgctt ccccgtggct ttggcgctct tgtctgcggg aggagcgcct tttaggtgac 301 agaggccttg ggaacccttt cctccggaag aaagtgtcag aggcacatct gccctctggc 361 atccacagtg ccctgggcac cgtgagctgt ggtggagacg gcaggggcga gcgcggcctc 421 ccgacaccgc ggacagaagc agtgttcagg aatctcggtc tccagtcccc tttcttatcc 481 tggctcccag acaattccga tgcagaattg gaggaagtct ccgtggagaa tggttccgtc 541 tctcccccac cttttaaaca gtctccgaga atccgacgca agggttggca agcccaccaa 601 cgacctggga cccgtgcaga gggtgagagt gactcccagg atatgggaga tgcacacaag 661 tcacccaata tgggaccaaa ccctggaatg gatggtgact gtgtatatga aaacttggcc 721 ttccaaaagg aagaagactt ggaaaagaag agagaggcct ctgagtctac agggacgaac 781 tcctcagcag cacacaacga agagttgtcc aaggccctga aaggcgaggg tggcacggac 841 agcgaccata tgaggcacga agcctccttg gcaatccgct ccccctgccc tgggctggag 901 gaggacatgg aagcctacgt gctgcggcca gcgctcccgg gcaccatgat gcagtgctac 961 ctcacccgtg acaagcacgg cgtggacaag ggcttgttcc ccctctacta cctctacctg 1021 gagacctctg acagcctgca gcgcttcctc ctggctgggc gaaagagaag aaggagcaaa 1081 acttctaatt acctcatctc cctggatcct acactcctat ctcgggacgg ggacaatttc 1141 gtgggcaaag tcagatccaa tgtcttcagc accaagttca ccatctttga caatggggtg 1201 aatcctgacc gggagcattt aaccaggaat actgcccgga tcagacagga gctgggggct 1261 gtgtgttatg agcccaacgt cttaggatac ctggggcctc ggaaaatgac tgtgattctc 1321 ccaggaacca acagccagaa ccagcgaatc aatgtccagc cactaaatga acaggagtcg 1381 ctactgagtc gttaccaacg tggggacaaa caagggttgc ttttgttgca caacaaaacc 1441 ccgtcgtggg acaaggagaa cggtgtctac acgctcaatt tccatggtcg agtcactcgg 1501 gcttcggtga agaacttcca aatcgtggat cccaaacacc aagaacatct ggtgctccag 1561 ttcggccgag tgggcccaga cacattcacc atggacttct gctttccatt tagcccgctc 1621 caggccttca gcatctgctt gtccagtttc aattagaagc tggctgttga ataactcaat 1681 aaaataccat acccttgcca gcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSU82532 802 bp mRNA PRI 15-MAY-1997 DEFINITION Human GDI-dissociation inhibitor RhoGDIgammma mRNA, complete cds. ACCESSION U82532 NID g1772912 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 802) AUTHORS Adra,C.N., Manor,D., Ko,J.L., Zhu,S., Horiuchi,T., Van Aelst,L., Cerione,R.A. and Lim,B. TITLE RhoGDIgamma: a GDP-dissociation inhibitor for Rho proteins with preferential expression in brain and pancreas JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (9), 4279-4284 (1997) MEDLINE 97272214 REFERENCE 2 (bases 1 to 802) AUTHORS Adra,C.N., Manor,D., Ko,J.L., Zhu,S., Horiuchi,T., Aelst,L.V., Cerione,R.A. and Lim,B. TITLE Direct Submission JOURNAL Submitted (17-DEC-1996) Medicine, Harvard Medical School, Harvard Institutes of Medicine, Beth Israel Deaconess Medical Center, Hematology/Oncology Division, 330 Brookline Ave., Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..802 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.3" /tissue_type="brain" CDS 39..716 /function="GDI-dissociation inhibitor" /codon_start=1 /product="RhoGDIgammma" /db_xref="PID:g1772913" /translation="MLGLDACELGAQLLELLRLALCARVLLADKEGGPPAVDEVLDEA VPEYRAPGRKSLLEIRQLDPDDRSLAKYKRVLLGPLPPAVDPSLPNVQVTRLTLLSEQ APGPVVMDLTGNLAVLKDQVFVLKEGVDYRVKISFKVHREIVSGLKCLHHTYRRGLRV DKTVYMVGSYGPSAQEYEFVTPVEEAPRGALVRGPYLVVSLFTDDDRTHHLSWEWGLC ICQDWKD" BASE COUNT 133 a 247 c 285 g 137 t ORIGIN 1 cggggcgggc ggcggctcct cggcggccgc gcgccgccat gctgggcctg gacgcgtgcg 61 agctgggggc gcagctgctg gagctgctcc ggctggcgct gtgcgcccga gtcctcctgg 121 ctgacaagga gggtgggccg ccggcagtgg acgaggtgtt ggatgaggct gtgcccgagt 181 accgggcgcc ggggaggaag agcctcttgg agatccggca gctggacccg gacgacagga 241 gcctggccaa gtacaagcgg gtgctgctgg ggcccctgcc accggccgtg gacccaagcc 301 tgcccaatgt gcaggtgacc aggctgacac tcctgtcgga acaggctccg gggcccgtcg 361 tcatggatct cacagggaac ctggctgttc tgaaggacca ggtgtttgtc ctgaaggaag 421 gtgttgatta cagagtgaag atctccttca aggtccacag ggagattgtc agcggcctca 481 agtgtctgca ccacacctac cgccggggcc tgcgcgtgga caagaccgtc tacatggtgg 541 gcagctatgg cccgagcgcc caggagtatg agtttgtgac tccggtggag gaagcgccga 601 ggggtgcgct ggtgcggggc ccctatctgg tggtgtccct cttcaccgac gatgacagga 661 cgcaccacct gtcctgggag tggggtctct gcatctgcca ggactggaag gactgaaccc 721 ccagtccgtg tctcccctga cctccctcag ttgttgcaca gggaccccca agcatcccca 781 gcaccccccg tgagtgacca ga // LOCUS HSU82535 2063 bp mRNA PRI 03-JUN-1997 DEFINITION Human fatty acid amide hydrolase mRNA, complete cds. ACCESSION U82535 NID g2149155 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2063) AUTHORS Giang,D.K. and Cravatt,B.F. TITLE Molecular characterization of human and mouse fatty acid amide hydrolases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (6), 2238-2242 (1997) MEDLINE 97225936 REFERENCE 2 (bases 1 to 2063) AUTHORS Cravatt,B.F. and Giang,D.K. TITLE Direct Submission JOURNAL Submitted (17-DEC-1996) The Skaggs Institute of Chemical Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..2063 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 36..1775 /codon_start=1 /product="fatty acid amide hydrolase" /db_xref="PID:g2149156" /translation="MVQYELWAALPGASGVALACCFVAAAVALRWSGRRTARGAVVRA RQKQRAGLENMDRAAQRFRLQNPDLDSEALLALPLPQLVQKLHSRELAPEAVLFTYVG KAWEVNKGTNCVTSYLADCETQLSQAPRQGLLYGVPVSLKECFTYKGQDSTLGLSLNE GVPAECDSVVVHVLKLQGAVPFVHTNVPQSMFSYDCSNPLFGQTVNPWKSSKSPGGSS GGEGALIGSGGSPLGLGTDIGGSIRFPSSFCGICGLKPTGNRLSKSGLKGCVYGQEAV RLSVGPMARDVESLALCLRALLCEDMFRLDPTVPPLPFREEVYTSSQPLRVGYYETDN YTMPSPAMRRAVLETKQSLEAAGHTLVPFLPSNIPHALETLSTGGLFSDGGHTFLQNF KGDFVDPCLGDLVSILKLPQWLKGLLAFLVKPLLPRLSAFLSNMKSRSAGKLWELQHE IEVYRKTVIAQWRALDLDVVLTPMLAPALDLNAPGRATGAVSYTMLYNCLDFPAGVVP VTTVTAEDEAQMEHYRGYFGDIWDKMLQKGMKKSVGLPVAVQCVALPWQEELCLRFMR EVERLMTPEKQSS" BASE COUNT 399 a 629 c 641 g 394 t ORIGIN 1 tgccgggcgg taggcagcag caggctgaag ggatcatggt gcagtacgag ctgtgggccg 61 cgctgcctgg cgcctccggg gtcgccctgg cctgctgctt cgtggcggcg gccgtggccc 121 tgcgctggtc cgggcgccgg acggcgcggg gcgcggtggt ccgggcgcga cagaagcagc 181 gagcgggcct ggagaacatg gacagggcgg cgcagcgctt ccggctccag aacccagacc 241 tggactcaga ggcgctgcta gccctgcccc tgcctcagct ggtgcagaag ttacacagta 301 gagagctggc ccctgaggcc gtgctcttca cctatgtggg aaaggcctgg gaagtgaaca 361 aagggaccaa ctgtgtgacc tcctatctgg ctgactgtga gactcagctg tctcaggccc 421 caaggcaggg cctgctctat ggcgtccctg tgagcctcaa ggagtgcttc acctacaagg 481 gccaggactc cacgctgggc ttgagcctga atgaaggggt gccggcggag tgcgacagcg 541 tagtggtgca tgtgctgaag ctgcagggtg ccgtgccctt cgtgcacacc aatgttccac 601 agtccatgtt cagctatgac tgcagtaacc ccctctttgg ccagaccgtg aacccatgga 661 agtcctccaa aagcccaggg ggctcctcag ggggtgaagg ggccctcatc gggtctggag 721 gctcccccct gggcttaggc actgatatcg gaggcagcat ccgcttcccc tcctccttct 781 gcggcatctg cggcctcaag cccacaggga accgcctcag caagagtggc ctgaagggct 841 gtgtctatgg acaggaggca gtgcgtctct ccgtgggccc catggcccgg gacgtggaga 901 gcctggcact gtgcctgcga gccctgctgt gcgaggacat gttccgcttg gaccccactg 961 tgcctccctt gcccttcaga gaagaggtct acaccagctc tcagcccctg cgtgtggggt 1021 actatgagac tgacaactat accatgccct ccccggccat gaggcgggcc gtgctggaga 1081 ccaaacagag ccttgaggct gcggggcaca cgctggttcc cttcttgcca agcaacatac 1141 cccatgctct ggagaccctg tcaacaggtg ggctcttcag tgatggtggc cacaccttcc 1201 tacagaactt caaaggtgat ttcgtggacc cctgcctggg ggacctggtc tcaattctga 1261 agcttcccca atggcttaaa ggactgctgg ccttcctggt gaagcctctg ctgccaaggc 1321 tgtcagcttt cctcagcaac atgaagtctc gttcggctgg aaaactctgg gaactgcagc 1381 acgagatcga ggtgtaccgc aaaaccgtga ttgcccagtg gagggcgctg gacctggatg 1441 tggtgctgac ccccatgctg gcccctgctc tggacttgaa tgccccaggc agggccacag 1501 gggccgtcag ctacactatg ctgtacaact gcctggactt ccctgcaggg gtggtgcctg 1561 tcaccacggt gactgctgag gacgaggccc agatggaaca ttacaggggc tactttgggg 1621 atatctggga caagatgctg cagaagggca tgaagaagag tgtggggctg ccggtggccg 1681 tgcagtgtgt ggctctgccc tggcaagaag agttgtgtct gcggttcatg cgggaggtgg 1741 agcgactgat gacccctgaa aagcagtcat cctgatggct ctggctccag aggacctgag 1801 actcacactc tctgcagccc agcctagtca gggcacagct gccctgctgc cacagcaagg 1861 aaatgtcctg catggggcag aggcttccgt gtcctctccc ccaaccccct gcaagaagcg 1921 ccgactccct gagtctggac ctccatccct gctctggtcc cctctcttcg tcctgatccc 1981 tccaccccca tgtggcagcc catgggtatg acataggcca aggcccaact aacagtcaag 2041 aaacaaaaaa aaaaaaaaaa aaa // LOCUS HSU82613 757 bp mRNA PRI 13-JAN-1997 DEFINITION Human DNA-binding protein ABP/ZF mRNA, complete cds. ACCESSION U82613 NID g1773066 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 757) AUTHORS Tomilin,N. and Boyko,V. TITLE Human cDNA encoding a DNA-binding protein JOURNAL Mol. Gen. Mikrobiol. Virusol. (1997) In press REFERENCE 2 (bases 1 to 757) AUTHORS Tomilin,N. and Boyko,V. TITLE Direct Submission JOURNAL Submitted (16-DEC-1996) Chromosome Stability, Institute of Cytology, Tikchoretskii Av.4, St. Petersburg 194064, Russia FEATURES Location/Qualifiers source 1..757 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocytes" CDS 365..685 /note="zinc-finger protein p11" /codon_start=1 /product="DNA-binding protein ABP/ZF" /db_xref="PID:g1773067" /translation="MYVENHCSRPALLLQLWGRGSPAQARGWQGVRNSPVACSSPFRQ EHCMSEHFKNRPACLGARSPPQGHKWGESPSQGTQAGAGKCRACGKRVSEGDRNGSGG GKWG" BASE COUNT 197 a 201 c 219 g 140 t ORIGIN 1 gggccacacc cggggctctg aggatttgga caaagactca gtggaaaaac tagagctggg 61 ctgtcccttc agcccccacc tgtcccttcc tatgccctca gtgtctcgaa gtacctcccg 121 cagcagtgcc aattgggaaa ggcttcggca agggaccctg aggagagacc tgcgtgggat 181 aatcaacagg ggtctggagg acggggagag ctgggaatat cagatctgac tgcgtgttct 241 cacttcgctt cctggaactt gctctcattt tcctgggtgc atcaaacaaa acaaaaacca 301 aacacccaga ggtctcatct cccaggcccc aggggagaaa gaggagtagc atgaacgcca 361 aggaatgtac gttgagaatc actgctccag gcctgcatta ctccttcagc tctggggcag 421 aggaagccca gcccaagcac ggggctggca gggcgtgagg aactctcctg tggcctgctc 481 atcacccttc cgacaggagc actgcatgtc agagcacttt aaaaacaggc cagcctgctt 541 gggcgctcgg tctccacccc agggtcataa gtggggagag agcccttccc agggcaccca 601 ggcaggtgca gggaagtgca gagcttgtgg aaagcgtgtg agtgagggag acaggaacgg 661 ctctgggggt gggaagtggg gctaggtctt gccaactcca tcttcaataa agtcgttttc 721 ggatccctaa aaaaaaaaaa aaaaaaaaaa aaaaccc // LOCUS HSU82761 2258 bp mRNA PRI 08-FEB-1998 DEFINITION Homo sapiens S-adenosyl homocysteine hydrolase homolog (XPVkona) mRNA, complete cds. ACCESSION U82761 NID g2852124 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2258) AUTHORS Volpe,J.P.G., McDowell,M., Jostes,R.F., Afzal,V., Sadinski,W., Trask,B.J., Legerski,R. and Cleaver,J.E. TITLE Complementation of chromosomal instability in the xeroderma pigmentosum variant by a gene on human chromosome 1 with homology to S-adenosyl homocysteine hydrolase JOURNAL Unpublished REFERENCE 2 (bases 1 to 2258) AUTHORS Volpe,J.P.G., McDowell,M. and Cleaver,J.E. TITLE Direct Submission JOURNAL Submitted (19-DEC-1996) Dermatology, UCSF, 3rd and Parnassus, Box 0750, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..2258 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /tissue_type="skin" /cell_type="fibroblasts" gene 1..2258 /gene="XPVkona" CDS 48..1550 /gene="XPVkona" /note="xeroderma pigmentosum variant" /codon_start=1 /product="S-adenosyl homocysteine hydrolase homolog" /db_xref="PID:g2852125" /translation="MATVTKAPKKQIQFADDMQEFTKFPTKTGRRSLSRSISQSSTDS YSSAASYTDSSDDEVSPREKQQTNSKGSSNFCVKNIKQAEFGRREIEIAEQDMSALIS LRKRAQGEKPLAGAKIVGCTHITAQTAVLIETLCALGAQCRWSACNIYSTQNEVAAAL AEAGVAVFAWKGESEDDFWWCIDRCVNMDGWQANMILDDGGDLTHWVYKKYPNVFKKI RGIVEESVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDGLKRTTD VMFGGKQVVVCGYGEVGKGCCAALKALGAIVYITEIDPICALQACMDGFRVVKLNEVI RQVDVVITCTGNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVTSLRTPELTWERVRSQ VDHVIWPDGKRVVLLAEGRLLNLSCSTVPTFVLSITATTQALALIELYNAPEGRYKQD VYLLPKKMDEYVASLHLPSFDAHLTELTDDQAKYLGLNKNGPFKPNYYRY" BASE COUNT 603 a 513 c 542 g 600 t ORIGIN 1 agctgaagca ggccaaggag atcgaggacg ccgagaagta ctccttcatg gccaccgtca 61 ccaaggcgcc caagaagcaa atccagtttg ctgatgacat gcaggagttc accaaattcc 121 ccaccaaaac tggccgaaga tctttgtctc gctcgatctc acagtcctcc actgacagct 181 acagttcagc tgcatcctac acagatagct ctgatgatga ggtttctccc cgagagaagc 241 agcaaaccaa ctccaagggc agcagcaatt tctgtgtgaa gaacatcaag caggcagaat 301 ttggacgccg ggagattgag attgcagagc aagacatgtc tgctctgatt tcactcagga 361 aacgtgctca gggggagaag cccttggctg gtgctaaaat agtgggctgt acacacatca 421 cagcccagac agcggtgttg attgagacac tctgtgccct gggggctcag tgccgctggt 481 ctgcttgtaa catctactca actcagaatg aagtagctgc agcactggct gaggctggag 541 ttgcagtgtt cgcttggaag ggcgagtcag aagatgactt ctggtggtgt attgaccgct 601 gtgtgaacat ggatgggtgg caggccaaca tgatcctgga tgatggggga gacttaaccc 661 actgggttta taagaagtat ccaaacgtgt ttaagaagat ccgaggcatt gtggaagaga 721 gcgtgactgg tgttcacagg ctgtatcagc tctccaaagc tgggaagctc tgtgttccgg 781 ccatgaacgt caatgattct gttaccaaac agaagtttga taacttgtac tgctgccgag 841 aatccatttt ggatggcctg aagaggacca cagatgtgat gtttggtggg aaacaagtgg 901 tggtgtgtgg ctatggtgag gtaggcaagg gctgctgtgc tgctctcaaa gctcttggag 961 caattgtcta cattaccgaa atcgacccca tctgtgctct gcaggcctgc atggatgggt 1021 tcagggtggt aaagctaaat gaagtcatcc ggcaagtcga tgtcgtaata acttgcacag 1081 gaaataagaa tgtagtgaca cgggagcact tggatcgcat gaaaaacagt tgtatcgtat 1141 gcaatatggg ccactccaac acagaaatcg atgtgaccag cctccgcact ccggagctga 1201 cgtgggagcg agtacgttct caggtggacc atgtcatctg gccagatggc aaacgagttg 1261 tcctcctggc agagggtcgt ctactcaatt tgagctgctc cacagttccc acctttgttc 1321 tgtccatcac agccacaaca caggctttgg cactgataga actctataat gcacccgagg 1381 ggcgatacaa gcaggatgtg tacttgcttc ctaagaaaat ggatgaatac gttgccagct 1441 tgcatctgcc atcatttgat gcccacctta cagagctgac agatgaccaa gcaaaatatc 1501 tgggactcaa caaaaatggg ccattcaaac ctaattatta cagatactaa tggaccatac 1561 taccaaggac cagtccacct gaaccacaca ctctaaagaa atatttttta agataacttt 1621 tattttcttc ttactccttt cctcttgatt tttttcctat aatttcattc ttgttttttc 1681 atctcattat ccaagttctg cagaccacac aggaacttgc ttcatggctc tttagatgaa 1741 atagaagttc agggttcctc actctagtca ctaaagaagg attttactct cccagcccag 1801 aaaggtgatt ctttctttac catttctggg gactttagtc ttaattaggt accttattaa 1861 caggaaatgc taaggtacct tctctgtgga acaatctgca atgtctaaat cgccttaaaa 1921 gagcccattt cttagctgct gaaatcagtg ctctttcact tcttcagaga agcagggatg 1981 gtacctaccc ggcaggtagg ttagatgtgg gtggtgcatg ttaatttccc ttagaagttc 2041 caagccctgt ttcctgcgta aaggtggtat gtccagttca gagatgtgat aatgagcatg 2101 gcttgttaag atcaggaggc ccacttggat ttatagtata gcccttcctc gactcccacc 2161 agacttgctc atttttcgag tttttaacta gactacactg tattgagttt aattttgtcc 2221 tctaggattt atttctgttg tccaaaaaaa aaaaaaaa // LOCUS HSU82811 747 bp mRNA PRI 07-DEC-1997 DEFINITION Human homeodomain-containing protein (HANF) mRNA, complete cds. ACCESSION U82811 NID g2662410 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 747) AUTHORS Kazanskaya,O.V., Severtzova,E.A., Barth,K.A., Ermakova,G.V., Lukyanov,S.A., Benyumov,A.O., Pannese,M., Boncinelli,E., Wilson,S.W. and Zaraisky,A.G. TITLE Anf: a novel class of vertebrate homeobox genes expressed at the anterior end of the main embryonic axis JOURNAL Gene 200 (1-2), 25-34 (1997) MEDLINE 98038973 REFERENCE 2 (bases 1 to 747) AUTHORS Zaraisky,A.G., Kazanskaya,O.V. and Ermakova,G.V. TITLE Direct Submission JOURNAL Submitted (20-DEC-1996) Group of Molecular Basis of Development, Shemyakin and Ovchinnicov Institute of Bioorganic Chemistry, Miklukho-Maklaya 16/10, Moscow 117871, Russia FEATURES Location/Qualifiers source 1..747 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="undifferentiated teratocarcinoma" gene 1..747 /gene="HANF" CDS 39..596 /gene="HANF" /note="homeodomain-containing protein; DNA-binding protein" /codon_start=1 /product="HANF" /db_xref="PID:g2662411" /translation="MSPSLQEGAQLGENKPSTCSFSIERILGLDQKKDCVPLMKPHRP WADTCSSSGKDGNLCLHVPNPPSGISFPSVVDHPMPEERASKYENYFSASERLSLKRE LSWYRGRRPRTAFTQNQIEVLENVFRVNCYPGIDIREDLAQKLNLEEDRIQIWFQNRR AKLKRSHRESQFLMAKKNFNTNLLE" BASE COUNT 246 a 157 c 169 g 175 t ORIGIN 1 ggaggccaga gctgttgctc tgtgcagacc acgagaggat gtctcccagc cttcaggaag 61 gcgctcagct cggggaaaac aaaccctcaa cttgctcctt ttcaattgag agaatcttag 121 gactggacca gaagaaagac tgtgttccat taatgaaacc ccacaggccc tgggcagaca 181 cctgcagctc atcagggaaa gatggtaact tatgtctaca tgtcccaaat cctcccagtg 241 ggatttcatt ccctagcgtg gtggatcacc caatgccaga agaaagagct tcgaaatatg 301 aaaattactt ttcagcctca gaaagactgt ctttgaaaag agagttgagt tggtatagag 361 gccgaagacc aagaactgct tttactcaaa accagattga agtgttagaa aatgtcttta 421 gagtaaactg ctatcctggt atcgatatta gagaagactt agctcaaaaa ttgaatctag 481 aggaagacag aatccagatt tggtttcaaa atcggcgtgc aaaactgaaa aggtcccata 541 gagaatcaca gtttctaatg gcgaaaaaaa atttcaacac aaatctgctg gaatagatag 601 accacattgg atgatttaac attacgaatg gtgtttaact taatatctcc agcacttacc 661 agggtactag acacagaata ggtaatgaaa atgcgaagag gctgggcaca gtggctcaca 721 cctaaatccc agcactttgg gaggacc // LOCUS HSU82812 2181 bp mRNA PRI 20-DEC-1997 DEFINITION Human scavenger receptor cysteine rich Sp alpha mRNA, complete cds. ACCESSION U82812 NID g2702313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2181) AUTHORS Gebe,J.A., Kiener,P.A. , Ring,H.Z., Li,X., Francke,U. and Aruffo,A. TITLE Molecular Cloning, Mapping to Human Chromosome 1q21-q23 and Cell Binding Characteristics of Sp alpha: A New Member of the SRCR Family of Proteins JOURNAL J. Biol. Chem. (1997) In press REFERENCE 2 (bases 1 to 2181) AUTHORS Gebe,J.A. and Aruffo,A. TITLE Direct Submission JOURNAL Submitted (19-DEC-1996) Inflammation, Bristol-Myers Squibb, 3005 First Ave., Seattle, WA 98121, USA FEATURES Location/Qualifiers source 1..2181 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /chromosome="1" /map="1q21-q23" CDS 61..1104 /note="secreted protein; contains three scavenger receptor cysteine rich (SRCR) domains; group B member; similar to CD6, CD5, M130, and WC1" /codon_start=1 /product="Sp alpha" /db_xref="PID:g2702314" /translation="MALLFSLILAICTRPGFLASPSGVRLVGGLHRCEGRVEVEQKGQ WGTVCDDGWDIKDVAVLCRELGCGAASGTPSGILYEPPAEKEQKVLIQSVSCTGTEDT LAQCEQEEVYDCSHDEDAGASCENPESSFSPVPEGVRLADGPGHCKGRVEVKHQNQWY TVCQTGWSLRAAKVVCRQLGCGRAVLTQKRCNKHAYGRKPIWLSQMSCSGREATLQDC PSGPWGKNTCNHDEDTWVECEDPFDLRLVGGDNLCSGRLEVLHKGVWGSVCDDNWGEK EDQVVCKQLGCGKSLSPSFRDRKCYGPGVGRIWLDNVRCSGEEQSLEQCQHRFWGFHD CTHQEDVAVICSG" BASE COUNT 560 a 501 c 562 g 558 t ORIGIN 1 ctgcttgggg acctccttct agccttaaat ttcagctcat caccttcacc tgccttggtc 61 atggctctgc tattctcctt gatccttgcc atttgcacca gacctggatt cctagcgtct 121 ccatctggag tgcggctggt ggggggcctc caccgctgtg aagggcgggt ggaggtggaa 181 cagaaaggcc agtggggcac cgtgtgtgat gacggctggg acattaagga cgtggctgtg 241 ttgtgccggg agctgggctg tggagctgcc agcggaaccc ctagtggtat tttgtatgag 301 ccaccagcag aaaaagagca aaaggtcctc atccaatcag tcagttgcac aggaacagaa 361 gatacattgg ctcagtgtga gcaagaagaa gtttatgatt gttcacatga tgaagatgct 421 ggggcatcgt gtgagaaccc agagagctct ttctccccag tcccagaggg tgtcaggctg 481 gctgacggcc ctgggcattg caagggacgc gtggaagtga agcaccagaa ccagtggtat 541 accgtgtgcc agacaggctg gagcctccgg gccgcaaagg tggtgtgccg gcagctggga 601 tgtgggaggg ctgtactgac tcaaaaacgc tgcaacaagc atgcctatgg ccgaaaaccc 661 atctggctga gccagatgtc atgctcagga cgagaagcaa cccttcagga ttgcccttct 721 gggccttggg ggaagaacac ctgcaaccat gatgaagaca cgtgggtcga atgtgaagat 781 ccctttgact tgagactagt aggaggagac aacctctgct ctgggcgact ggaggtgctg 841 cacaagggcg tatggggctc tgtctgtgat gacaactggg gagaaaagga ggaccaggtg 901 gtatgcaagc aactgggctg tgggaagtcc ctctctccct ccttcagaga ccggaaatgc 961 tatggccctg gggttggccg catctggctg gataatgttc gttgctcagg ggaggagcag 1021 tccctggagc agtgccagca cagattttgg gggtttcacg actgcaccca ccaggaagat 1081 gtggctgtca tctgctcagg atagtatcct ggtgttgctt gacctggccc ccctggcccc 1141 gcctgccctc tgcttgttct cctgagccct gattatcctc atactcattc tggggctcag 1201 gcttgagcca ctactccctc atcccctcag gagtctgaac actgggctta tgccttactc 1261 tcagggacaa gcagccccct ttgctgcctg tagatgtgag ctgttgagtt ccctcttgct 1321 ggggaagatg agcttccatg tatcctgtgc tcaaccctga ccctttgaca ctggttctgg 1381 cctttcctgc cttttctcaa gctgcctgga atcctcaaac ctgtcacttt ggtcagatgt 1441 gcagaccatt actaaggtct atgtctgcaa acattactaa tctaggtcct attactaatc 1501 tatgtctgca aacattaaag gaatgaaaca atgaaaggaa catttgaaag aaaatgtggg 1561 tagacaattt cttgcaactt gggggaaagt ttagaattct tttgattgga ctactttttt 1621 ttttttcctc aagcttcagg tgaccacaat agcaacacct ccctattctg ttatttctta 1681 gtgtaggtag acaattcttt caggagcaga gcagcgtcct ataatcctag accttttcat 1741 gacgtgtaaa aaatgatgtt tcatcctctg attgccccaa taaaaatctt tgttgtccat 1801 ccctatacaa cctgccaaca tggttgacat ttaatgagag gaatgtcaaa aatacatttt 1861 actttattca aagaaaaata tattggttac tgggaaaagg tcaagaaaga ggcagaaaga 1921 gatcagggag ggctaaagtt gtgtcttatg ccaagcggaa gtggaaaata tcacttttca 1981 ctttatcaac tgagactttg gggcctgtaa gcttgaggca agacagaaat aagagaatca 2041 agacttgatt gtaaaaattg acaactttag attctgaggc taggctgagt acttattata 2101 cggctacatt tacacattta cacttatcta ataaatcaga tttcacagtc tcaaaaaaaa 2161 aaaaagaaaa aaaaaaaaaa a // LOCUS HSU82938 1034 bp mRNA PRI 02-JUL-1997 DEFINITION Human CD27BP (Siva) mRNA, complete cds. ACCESSION U82938 NID g2228596 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1034) AUTHORS Prasad,K.V., Ao,Z., Yoon,Y., Wu,M.X., Rizk,M., Jacquot,S. and Schlossman,S.F. TITLE CD27, a member of the tumor necrosis factor receptor family, induces apoptosis and binds to Siva, a proapoptotic protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (12), 6346-6351 (1997) MEDLINE 97322375 REFERENCE 2 (bases 1 to 1034) AUTHORS Prasad,K.V.S., Ao,Z., Yoon,Y. and Schlossman,S.F. TITLE Direct Submission JOURNAL Submitted (20-DEC-1996) Tumor Immunology, Dana Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..1034 /organism="Homo sapiens" /note="cDNA sequence was obtained from a yeast two hybrid system" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_type="thymus" gene 253..822 /gene="Siva" CDS 253..822 /gene="Siva" /function="binds to CD27" /function="apoptosis" /codon_start=1 /product="CD27BP" /db_xref="PID:g2228597" /translation="MRRPGSCVAPGPAAMPKRSCPFADVAPLQLKVRVSQRELSRGVC AERYSQEVFEKTKRLLFLGAQAYLDHVWDEGCAVVHLPESPKPGPTGAPRAARGQMLI GPDGRLIRSLGQASEADPSGVASIACSSCVRAVDGKAVCGQCERALCGQCVRTCWGCG SVACTLCGLVDCSDMYEKVLCTSCAMFET" BASE COUNT 208 a 295 c 332 g 199 t ORIGIN 1 tgctcacact gtatcccagc actttgggag gccgaggcag gcagattgcc tgaggtcagg 61 agttcaagac cagcctggcc aacatggcaa aaccctgtct ccactaaaaa tacaaaaatt 121 agccaagcgt ggtggcatgt gcctgtaatc ccagctactc aggaggctga ggcatgagaa 181 tctcttgaac cccagaggtg taggttgcag tgagcagaga ttgtgccact gcactccagc 241 ctgggcgaca gcatgaggcg gccggggagc tgcgtagctc ccggccccgc ggccatgccc 301 aagcggagct gccccttcgc ggacgtggcc ccgctacagc tcaaggtccg cgtgagccag 361 agggagttga gccgcggcgt gtgcgccgag cgctactcgc aggaggtctt cgagaagacc 421 aagcgactcc tgttcctcgg ggcccaggcc tacctggacc acgtgtggga tgaaggctgt 481 gccgtcgttc acctgccaga gtccccaaag cctggcccta caggggcccc gagggctgca 541 cgtgggcaga tgctgattgg accagacggc cgcctgatca ggagccttgg gcaggcctcc 601 gaagctgacc catctggggt agcgtccatt gcctgttcct catgcgtgcg agccgtggat 661 gggaaggcgg tctgcggtca gtgtgagcga gccctgtgcg ggcagtgtgt gcgcacctgc 721 tggggctgcg gctccgtggc ctgtaccctg tgtggcctcg tggactgcag tgacatgtac 781 gagaaagtgc tgtgcaccag ctgtgccatg ttcgagacct gaggctggct caagccggct 841 gccttcaccg ggagccacgc cgtgcatggc agccttccct ggacgagcgc tcggtgttca 901 cactgaactg tggggtcgac gggaggggtg ccttttacat gttctatttt gtatcctaat 961 gacagaatga ataaacctct ttatatttgc aaaaaaaaaa aaaaaaaact cgaggggggg 1021 cccggtaccc aatg // LOCUS HSU82939 3549 bp mRNA PRI 03-DEC-1997 DEFINITION Human NK-tumor recognition molecule-related protein mRNA, complete cds. ACCESSION U82939 NID g2656122 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3549) AUTHORS Zhou,R. and Ao,S.-z. TITLE Identification of a novel human NK-tumor recognition molecule-related protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 3549) AUTHORS Zhou,R. and Ao,S.-z. TITLE Direct Submission JOURNAL Submitted (20-DEC-1996) State Key Laboratory of Molecular Biology, Shanghai Institute of Biochemistry, Academia Sinica, P.O. Box 52, Shanghai 200031, People's Republic of China FEATURES Location/Qualifiers source 1..3549 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 541..2988 /codon_start=1 /product="NK-tumor recognition molecule-related protein" /db_xref="PID:g2656123" /translation="MRQIAVRRPTTADERSLRKIQEQDIINFRRTLYRAGARVRNIED GGRYRDISAEFFRRNPACLHRLVPWLKRELTVLFGAHGSLVNIVQHIIMSNVTRYDLE SQAFVSDLRPFLLNRTEHFIHEFISFARSPFNMAAFDQHANYDCPAPSYEEGSHSDSS VITISPDEAETQELDINVATVSQAPWDDETPGPSYSSSEQVHVTMSSLLNTSDSSDEE LVTGGATSQIQGVQTNDDLNNDSDDSSDNCVIVGFVKPLAERTPELVELSSDSEDLGS YEKMETVKTQEQEQSYSSGDSDVSRCSSPHSVLGKDEQINKGHCDSSTRIKSKKEEKR STSLSSPRNLNSSVRGDRVYSPYNHRHRKRGRSRSSDSRSQSRSGHDQKNHRKHHGKK RMKSKRSRSRESSRPRGRRDKKRSRTRDSSWSRRSQTLSLSSESTSRSRSRSSDHGKR RSRSRNRDRYYLRNNYGSRYKWEYTYYSRNKDRDGYESSYRRRTLSRAHYSRQSSSPE FRVQSFSERTNARKKNNHSERKYYYYERHRSRSLSSNRSRTASTGTDRVRNEKPGGKR KYKTRHLEGTNEVAQPSREFASKAKDSHYQKSSSKLDGNYKNESDTFSDSRSSDRETK HKRRKRKTRSLSVEIVYEGKATDTTKHHKKKKKKHKKKHKKHHGDNASRSPVVITIDS DSDKDSEVKEDTECDNSGPQDPLQNEFLAPSLEPFETKDVVTIEAEFGVLDKECDIAT LSNNLNNANKTVDNIPPLAASVEQTLDVREESTFVSDLENQPSNIVSLQTEPSRQLPS PRTSLMSVCLGRDCDMS" BASE COUNT 1193 a 640 c 742 g 973 t 1 others ORIGIN 1 gagctcgcgg cgagcgcccc agccaggcct gcgccggcat cctccgagat aatggcatca 61 gctgctaagg aatttaaaat ggacaacttt tcacctaaag ctggcactag caaattgcaa 121 cagacagtac cagctgatgc atctcctgat tctaagtgtc ctatatgctt ggatagattt 181 gataatgtgt cttacttaga tcgctgctta cataagttct gaaaacgctg tgtacaggag 241 tggtcaaaaa acaaagctga atgcccacta tgtaaacagc cctttgattc tattttccat 301 tctgtgaggg cagaagatga cttcaaggag tatgtcctaa ggccttcgta taatggttct 361 tttgtcaccc ctgatcgacg atttcgctac cgtacaactc tgacaaggga acgaaatgct 421 tctgtgtatt cacctagtgg tcctgtgaac agaagaacaa caactccacc ggatagtgga 481 gtactgtttg aagggttagg catttcaaca agacctagag atgttgaaat tcctcagttt 541 atgagacaga ttgcagtaag gaggccaact acggcagatg aaagatcttt gcggaaaatt 601 caagaacaag atattattaa ttttagacga actctttatc gtgctggtgc tcgagttaga 661 aatattgaag atggtggccg ctacagggat atttcagctg aatttttccg tagaaatcca 721 gcttgccttc acagattagt cccctggtta aaacgtgaac ttacagttct ttttggagct 781 catggatctt tagtgaatat tgtccagcat attatcatga gtaatgttac tcgctatgac 841 ttggagagtc aggcatttgt gtctgattta agaccatttt tacttaatcg aactgagcat 901 tttatacatg aatttatcag ttttgcccga tctcctttta acatggcagc ctttgaccag 961 catgccaatt atgattgccc tgctccttca tacgaagaag gcagccattc tgattcttca 1021 gtcataacaa tatctccaga tgaggctgag acccaagagc tggatattaa tgtagccact 1081 gttagtcagg caccatggga tgatgaaact ccaggaccat cttactcaag ctcagagcag 1141 gtacacgtta ctatgtcttc tcttttaaat acttctgaca gttcagatga agaacttgtc 1201 acaggaggag ccacgtctca gatacaagga gtacaaacca atgacgacct aaataatgac 1261 agtgatgatt cttcagataa ttgtgtcatt gttgggtttg ttaaaccact agctgagagg 1321 accccagaac ttgttgaact gtcctctgat tctgaggact taggttctta tgagaaaatg 1381 gagacagtga agacacaaga acaggagcaa tcttacagtt ctggtgatag cgatgttagt 1441 agatgctcat ctccacactc tgtccttgga aaggatgaac aaataaataa aggtcattgt 1501 gattctagta caagaatcaa atcaaagaag gaagagaaac gatctacatc attgtcatct 1561 cccagaaacc tgaactcatc tgtaagagga gacagagtat attctccata taaccataga 1621 cacagaaaga ggggaagatc aagaagttca gattcacgtt ctcagagtag aagtgggcat 1681 gatcagaaga atcatagaaa gcatcatggg aagaaaagaa tgaaaagtaa acgatccaga 1741 agcagggaaa gtagcagacc tagagggaga agagacaaaa agagatcaag aactagagat 1801 agcagttggt ccagaagaag ccaaactctg tctctaagta gtgaaagcac aagcagatca 1861 aggtctcgta gcagtgatca tggtaaaaga agatcacgga gcagaaatag agatcgttat 1921 tatttaagaa ataattatgg aagcagatac aaatgggagt atacttatta cagtagaaac 1981 aaggacaggg atgggtacga atcatcttac aggaggagga ctctgtccag agctcattat 2041 tctagacagt cttcaagtcc agaatttaga gttcagtcct tttctgaaag aacaaatgct 2101 aggaaaaaaa ataatcacag tgagaggaag tattactact atgaaaggca cagatcaagg 2161 agcctgtcta gtaacagatc aaggactgca tctaccggga ctgaccgggt gagaaatgaa 2221 aagcctggag ggaaacgaaa atacaaaaca cggcatttgg agggtactaa cgaagtggct 2281 cagccatctc gtgaatttgc ttctaaagca aaggacagtc attaccaaaa atcttcatca 2341 aaattggatg gaaactacaa aaatgagagt gatacctttt cagacagccg atcatcagac 2401 agagagacaa aacacaaaag gagaaaaagg aagacccgga gcctaagtgt agagatagtt 2461 tatgaaggaa aagctactga tacaactaaa caccataaaa agaaaaagaa gaaacataag 2521 aagaagcata agaaacacca tggagataat gcttcacgtt ccccagttgt aattaccatt 2581 gacagtgaca gtgataagga ttctgaagta aaggaggata cagaatgtga caatagtggt 2641 cctcaagacc ctctacaaaa tgagtttttg gctccttcct tggaaccatt tgaaactaaa 2701 gatgtagtta caatagaagc tgaatttggt gtgctggaca aggaatgtga tattgccaca 2761 cttagtaaca acttgaataa tgccaacaaa actgtagata atattccacc tctggcagct 2821 tcagttgaac aaactctcga tgtaagagaa gagagcacct ttgtttctga tttggagaac 2881 cagcccagta acattgtgtc tcttcaaact gagccatcaa ggcaattgcc atcgccacgg 2941 acatcattaa tgtcagtatg tcttggtaga gactgtgata tgtcttaaaa ctgccaaagc 3001 atttcattga gaattatgat gttataaaaa ggaaaaagga agaatgtcgt ctactgcagt 3061 ctatttaaag atgacatttg gtgaaaactc tcttcctcct tacaatattt tnaatgattt 3121 tttttttggt gttaatttgt aaaaatcatt atttgttcaa aatgtatgtc ccaccctcaa 3181 agatatgcac ttttaagtga agaaaatgat actgctagca gcttatttaa aacttggggt 3241 cctttttaaa taagaaaaat tatataaatt ttagaagtta tttcataaag ccatacggta 3301 ttgacatatt tttaaggtag tcaatgagta tttttgaatt tttttttttt gagagttatt 3361 ctggaaatgt gttataagct aggagaatcc ctttggacag tctttatttt tcttcttaaa 3421 aatttatatg attcaaaacc atttcttcag gttaaattga ggcattttaa tctgcacagt 3481 ttatcttctg ccaaaataaa aatttactat ttcctttata tacttgttta tctggactgc 3541 cagtaaaac // LOCUS HSU82972 2076 bp mRNA PRI 25-MAY-1997 DEFINITION Human interleukin-16 mRNA, complete cds. ACCESSION U82972 NID g2114409 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2076) AUTHORS Baier,M., Bannert,N., Werner,A., Lang,K. and Kurth,R. TITLE Molecular cloning, sequence, expression, and processing of the interleukin 16 precursor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (10), 5273-5277 (1997) MEDLINE 97289756 REFERENCE 2 (bases 1 to 2076) AUTHORS Baier,M., Bannert,N. and Kurth,R. TITLE Direct Submission JOURNAL Submitted (22-DEC-1996) AIDS Research, Paul-Ehrlich-Institute, Paul-Ehrlich-Strasse 51-59, Langen 63225, Germany FEATURES Location/Qualifiers source 1..2076 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="15" 5'UTR 1..180 CDS 181..2076 /codon_start=1 /product="interleukin-16" /db_xref="PID:g2114410" /translation="MDYSFDTTAEDPWVRISDCIKNLFSPIMSENHGHMPLQPNASLN EEEGTQGHPDGTPPKLDTANGTPKVYKSADSSTVKKGPPVAPKPAWFRQSLKGLRNRA SDPRGLPDPALSTQPAPASREHLGSHIRASSSSSSIRQRISSFETFGSSQLPDKGAQR LSLQPSSGEAAKPLGKHEEGRFSGLLGRGAAPTLVPQQPEQVLSSGSPAASEARDPGV SESPPPRRQPNQKTLPPGPDPLLRLLSTQAEESQGPVLKMPSQRARSFPLTRSQSCET KLLDEKTSKLYSISSQVSSAVMKSLLCLPSSISCAQTPCIPKEGASPTSSSNEDSAAN GSAETSALDTGFSLNLSELREYTEGLTEAKEDDDGDHSSLQSGQSVISLLSSEELKKL IEEVKVLDEATLKQLDGIHVTILHKEEGAGLGFSLAGGADLENKVITVHRVFPNGLAS QEGTIQKGNEVLSINGKSLKGTTHHDALAILRQAREPRQAVIVTRKLTPEAMPDLNSS TDSAASASAASDVSVESTAEATVCTVTLEKMSAGLGFSLEGGKGSLHGDKPLTINRIF KGAASEQSETVQPGDEILQLGGTAMQGLTRFEAWNIIKALPDGPVTIVIRRKSLQSKE TTAAGDS" BASE COUNT 534 a 619 c 531 g 392 t ORIGIN 1 ctgctgctac cacaggaaga cacagcaggg agaagcccta gtgcctctgc cggctgccca 61 ggacctggta tcggcccaca gaccaagtcc tccacagagg gcgagccagg gtggagaaga 121 gccagcccag tgacccaaac atccccgata aaacacccac tgcttaagag gcaggctcgg 181 atggactata gctttgatac cacagccgaa gacccttggg ttaggatttc tgactgcatc 241 aaaaacttat ttagccccat catgagtgag aaccatggcc acatgcctct acagcccaat 301 gccagcctga atgaagaaga agggacacag ggccacccag atgggacccc accaaagctg 361 gacaccgcca atggcactcc caaagtttac aagtcagcag acagcagcac tgtgaagaaa 421 ggtcctcctg tggctcccaa gccagcctgg tttcgccaaa gcttgaaagg tttgaggaat 481 cgtgcttcag acccaagagg gctccctgat cctgccttgt ccacccagcc agcacctgct 541 tccagggagc acctaggatc acacatccgg gcctcctcct cctcctcctc catcaggcag 601 agaatcagct cctttgaaac ctttggctcc tctcaactgc ctgacaaagg agcccagaga 661 ctgagcctcc agccctcctc cggggaggca gcaaaacctc ttgggaagca tgaggaagga 721 cggttttctg gactcttggg gcgaggggct gcacccactc ttgtgcccca gcagcctgag 781 caagtactgt cctcggggtc ccctgcagcc tccgaggcca gagacccagg cgtgtctgag 841 tcccctcccc caaggcggca gcccaatcag aaaactctcc cccctggccc ggacccgctc 901 ctaaggctgc tgtcaacaca ggctgaggaa tctcaaggcc cagtgctcaa gatgcctagc 961 cagcgagcac ggagcttccc cctgaccagg tcccagtcct gtgagacgaa gctacttgac 1021 gaaaagacca gcaaactcta ttctatcagc agccaagtgt catcggctgt catgaaatcc 1081 ttgctgtgcc ttccatcttc tatctcctgt gcccagactc cctgcatccc caaggaaggg 1141 gcatctccaa catcatcatc caacgaagac tcagctgcaa atggttctgc tgaaacatct 1201 gccttggaca cagggttctc gctcaacctt tcagagctga gagaatatac agagggtctc 1261 acggaagcca aggaagacga tgatggggac cacagttccc ttcagtctgg tcagtccgtt 1321 atctccctgc tgagctcaga agaattaaaa aaactcatcg aggaggtgaa ggttctggat 1381 gaagcaacat taaagcaatt agacggcatc catgtcacca tcttacacaa ggaggaaggt 1441 gctggtcttg ggttcagctt ggcaggagga gcagatctag aaaacaaggt gattacggtt 1501 cacagagtgt ttccaaatgg gctggcctcc caggaaggga ctattcagaa gggcaatgag 1561 gttctttcca tcaacggcaa gtctctcaag gggaccacgc accatgatgc cttggcaatc 1621 ctccgccaag ctcgagagcc caggcaagct gtgattgtca caaggaagct gactccagag 1681 gccatgcctg acctcaactc ctccactgac tctgcagcct cagcctctgc agccagtgat 1741 gtttctgtag aatctacagc agaggccaca gtctgcacgg tgacactgga gaagatgtcg 1801 gcagggctgg gcttcagcct ggaaggaggg aagggctccc tacacggaga caagcctctc 1861 accattaaca ggattttcaa aggagcagcc tcagaacaaa gtgagacagt ccagcctgga 1921 gatgaaatct tgcagctggg tggcactgcc atgcagggcc tcacacggtt tgaagcctgg 1981 aacatcatca aggcactgcc tgatggacct gtcacgattg tcatcaggag aaaaagcctc 2041 cagtccaagg aaaccacagc tgctggagac tcctag // LOCUS HSU82988 1040 bp mRNA PRI 15-NOV-1997 DEFINITION Human leukocyte antigen CD84 mRNA, complete cds. ACCESSION U82988 NID g2618739 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1040) AUTHORS de la Fuente,M.A., Pizcueta,P., Nadal,M., Bosch,J. and Engel,P. TITLE CD84 leukocyte antigen is a new member of the Ig superfamily JOURNAL Blood 90 (6), 2398-2405 (1997) MEDLINE 97454416 REFERENCE 2 (bases 1 to 1040) AUTHORS de la Fuente,M.A., Pizcueta,P. and Engel,P. TITLE Direct Submission JOURNAL Submitted (21-DEC-1996) Hepatology, Fundacio Clinic, Villarroel 170, Barcelona 08036, Spain FEATURES Location/Qualifiers source 1..1040 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji; B cell line" /chromosome="1" /map="1q24" CDS 42..1028 /note="member of immunoglobulin superfamily" /codon_start=1 /product="leukocyte antigen CD84" /db_xref="PID:g2618740" /translation="MAQHHLWILLLCLQTWPEAAGKDSEIFTVNGILGESVTFPVNIQ EPRQVKIIAWTSKTSVAYVTPGDSETAPVVTVTHRNYYERIHALGPNYNLVISDLRME DAGDYKADINTQADPYTTTKRYNLQIYRRLGKPKITQSLMASVNSTCNVTLTCSVEKE EKNVTYNWSPLGEEGNVLQIFQTPEDQELTYTCTAQNPVSNNSDSISARQLCADIAMG FRTHHTGLLSVLAMFFLLVLILSSVFLFRLFKRRQDAASKKTIYTYIMASRNTQPAES RIYDEILQSKVLPSKEEPVNTVYSEVQFADKMGKASTQDSKPPGTSSYEIVI" BASE COUNT 308 a 256 c 231 g 245 t ORIGIN 1 aattccggtg cttttccaca gaaggttaga ccctgaaaga gatggctcag caccacctat 61 ggatcttgct cctttgcctg caaacctggc cggaagcagc tggaaaagac tcagaaatct 121 tcacagtgaa tgggattctg ggagagtcag tcactttccc tgtaaatatc caagaaccac 181 ggcaagttaa aatcattgct tggacttcta aaacatctgt tgcttatgta acaccaggag 241 actcagaaac agcacccgta gttactgtga cccacagaaa ttattatgaa cggatacatg 301 ccttaggtcc gaactacaat ctggtcatta gcgatctgag gatggaagac gcaggagact 361 acaaagcaga cataaataca caggctgatc cctacaccac caccaagcgc tacaacctgc 421 aaatctatcg tcggcttggg aaaccaaaaa ttacacagag tttaatggca tctgtgaaca 481 gcacctgtaa tgtcacactg acatgctctg tagagaaaga agaaaagaat gtgacataca 541 attggagtcc cctgggagaa gagggtaatg tccttcaaat cttccagact cctgaggacc 601 aagagctgac ttacacgtgt acagcccaga accctgtcag caacaattct gactccatct 661 ctgcccggca gctctgtgca gacatcgcaa tgggcttccg tactcaccac accgggttgc 721 tgagcgtgct ggctatgttc tttctgcttg ttctcattct gtcttcagtg tttttgttcc 781 gtttgttcaa gagaagacaa gatgctgcct caaagaaaac catatacaca tatatcatgg 841 cttcaaggaa cacccagcca gcagagtcca gaatctatga tgaaatcctg cagtccaagg 901 tgcttccctc caaggaagag ccagtgaaca cagtttattc cgaagtgcag tttgctgata 961 agatggggaa agccagcaca caggacagta aacctcctgg gacttcaagc tatgaaattg 1021 tgatctaggc tgctgggctg // LOCUS HSU83171 2923 bp mRNA PRI 31-MAY-1997 DEFINITION Human macrophage-derived chemokine precursor (MDC) mRNA, complete cds. ACCESSION U83171 NID g1931580 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2923) AUTHORS Godiska,R., Chantry,D., Raport,C.J., Sozzani,S., Allavena,P., Leviten,D., Mantovani,A. and Gray,P.W. TITLE Human macrophage-derived chemokine (MDC), a novel chemoattractant for monocytes, monocyte-derived dendritic cells, and natural killer cells JOURNAL J. Exp. Med. 185 (9), 1595-1604 (1997) MEDLINE 97296313 REFERENCE 2 (bases 1 to 2923) AUTHORS Godiska,R. and Gray,P.W. TITLE Direct Submission JOURNAL Submitted (23-DEC-1996) ICOS Corporation, 22021 20th Avenue SE, Bothell, WA 98021, USA FEATURES Location/Qualifiers source 1..2923 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" gene 20..301 /gene="MDC" sig_peptide 20..91 /gene="MDC" CDS 20..301 /gene="MDC" /function="chemotactic for dendritic cells and natural killer cells" /codon_start=1 /product="macrophage-derived chemokine precursor" /db_xref="PID:g1931581" /translation="MARLQTALLVVLVLLAVALQATEAGPYGANMEDSVCCRDYVRYR LPLRVVKHFYWTSDSCPRPGVVLLTFRDKEICADPRVPWVKMILNKLSQ" mat_peptide 92..298 /gene="MDC" /product="macrophage-derived chemokine" repeat_region complement(1194..1805) /rpt_family="ALU" repeat_region complement(2335..2443) /rpt_family="ALU" BASE COUNT 605 a 861 c 669 g 788 t ORIGIN 1 gagacataca ggacagagca tggctcgcct acagactgca ctcctggttg tcctcgtcct 61 ccttgctgtg gcgcttcaag caactgaggc aggcccctac ggcgccaaca tggaagacag 121 cgtctgctgc cgtgattacg tccgttaccg tctgcccctg cgcgtggtga aacacttcta 181 ctggacctca gactcctgcc cgaggcctgg cgtggtgttg ctaaccttca gggataagga 241 gatctgtgcc gatcccagag tgccctgggt gaagatgatt ctcaataagc tgagccaatg 301 aagagcctac tctgatgacc gtggccttgg ctcctccagg aaggctcagg agccctacct 361 ccctgccatt atagctgctc cccgccagaa gcctgtgcca actctctgca ttccctgatc 421 tccatccctg tggctgtcac ccttggtcac ctccgtgctg tcactgccat ctcccccctg 481 acccctctaa cccatcctct gcctccctcc ctgcagtcag agggtcctgt tcccatcagc 541 gattcccctg cttaaaccct tccatgactc cccactgccc taagctgagg tcagtctccc 601 aagcctggca tgtggccctc tggatctggg ttccatctct gtctccagcc tgcccacttc 661 ccttcatgaa tgttgggttc tagctccctg ttctccaaac ccatactaca catcccactt 721 ctgggtcttt gcctgggatg ttgctgacac tcagaaagtc ccaccacctg cacatgtgta 781 gccccaccag ccctccaagg cattgctcgc ccaagcagct ggtaattcca tttcatgtat 841 tagatgtccc ctggccctct gtcccctctt aataacccta gtcacagtct ccgcagattc 901 ttgggatttg ggggttttct cccccacctc tccactagtt ggaccaaggt ttctagctaa 961 gttactctag tctccaagcc tctagcatag agcactgcag acaggccctg gctcagaatc 1021 agagcccaga aagtggctgc agacaaaatc aataaaacta atgtccctcc cctctccctg 1081 ccaaaaggca gttacatatc aatacagaga ctcaaggtca ctagaaatgg gccagctggg 1141 tcaatgtgaa gccccaaatt tgcccagatt cacctttctt cccccactcc cttttttttt 1201 tttttttttt tgagatggag tttcgctctt gtcacccacg ctggagtgca atggtgtggt 1261 cttggcttat tgaagcctct gcctcctggg ttcaagtgat tctcttgcct cagcctcctg 1321 agtagctggg attacaggtt cctgctacca cgcccagcta atttttgtat ttttagtaga 1381 gacgaggctt caccatgttg gccaggctgg tctcgaactc ctgtcctcag gtaatccgcc 1441 cacctcagcc tcccaaagtg ctgggattac aggcgtgagc cacagtgcct ggcctcttcc 1501 ctctccccac tgcccccccc aacttttttt ttttttttat ggcagggtct cactctgtcg 1561 cccaggctgg agtgcagtgg cgtgatctcg gctcactaca acctcgacct cctgggttca 1621 agtgattctc ccaccccagc ctcccaagta gctgggatta caggtgtgtg ccactacggc 1681 tggctaattt ttgtattttt agtagagaca ggtttcacca tattggccag gctggtcttg 1741 aactcctgac ctcaagtgat ccaccttcct tgtgctccca aagtgctgag attacaggcg 1801 tgagctatca cacccagcct cccccttttt ttcctaatag gagactcctg tacctttctt 1861 cgttttacct atgtgtcgtg tctgcttaca tttccttctc ccctcaggct ttttttgggt 1921 ggtcctccaa cctccaatac ccaggcctgg cctcttcaga gtacccccca ttccactttc 1981 cctgcctcct tccttaaata gctgacaatc aaattcatgc tatggtgtga aagactacct 2041 ttgacttggt attataagct ggagttatat atgtatttga aaacagagta aatacttaag 2101 aggccaaata gatgaatgga agaattttag gaactgtgag agggggacaa ggtgaagctt 2161 tcctggccct gggaggaagc tggctgtggt agcgtagcgc tctctctctc tgtctgtggc 2221 aggagccaaa gagtagggtg taattgagtg aaggaatcct gggtagagac cattctcagg 2281 tggttgggcc aggctaaaga ctgggagttg ggtctatcta tgcctttctg gctgattttt 2341 gtagagacgg ggttttgcca tgttacccag gctggtctca aactcctggg ctcaagcgat 2401 cctcctggct cagcctccca aagtgctggg attacaggcg tgaatcactg cgcctggctt 2461 cctcttcctc ttgagaaata ttcttttcat acagcaagta tgggacagca gtgtcccagg 2521 taaaggacat aaatgttaca agtgtctggt cctttctgag ggaggctggt gccgctctgc 2581 agggtatttg aacctgtgga attggaggag gccatttcac tccctgaacc cagcctgaca 2641 aatcacagtg agaatgttca ccttataggc ttgctgtggg gctcaggttg aaagtgtggg 2701 gagtgacact gcctaggcat ccagctcagt gtcatccagg gcctgtgtcc ctcccgaacc 2761 cagggtcaac ctgcctgcca caggcactag aaggacgaat ctgcctactg cccatgaacg 2821 gggccctcaa gcgtcctggg atctccttct ccctcctgtc ctgtccttgc ccctcaggac 2881 tgctggaaaa taaatccttt aaaatagtaa aaaaaaaaaa aaa // LOCUS HSU83246 1985 bp mRNA PRI 23-JAN-1997 DEFINITION Human copine I mRNA, complete cds. ACCESSION U83246 NID g1791256 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1985) AUTHORS Tomsig,J.L. and Creutz,C.E. TITLE Structure of the human copine I messenger RNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1985) AUTHORS Tomsig,J.L. and Creutz,C.E. TITLE Direct Submission JOURNAL Submitted (26-DEC-1996) Pharmacology, University of Virginia, 1300 Jefferson Park Ave., Charlottesville, VA 22908, USA FEATURES Location/Qualifiers source 1..1985 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 157..1770 /function="membrane-binding protein" /note="contains two C2-domains; calcium-dependent" /codon_start=1 /product="copine I" /db_xref="PID:g1791257" /translation="MAHCVTLVQLSISCDHLIDKDIGSKSDPLCVLLQDVGGGSWAEL GRTERVRNCSSPEFSKTLQLEYRFETVQKLRFGIYDIDNKTPELRDDDFLGGAECSLG QIVSSQVLTLPLMLKPGKPAGRGTITVSAQELKDNRVVTMEVEARNLDKKDFLGKSDP FLEFFRQGDGKWHLVYRSEVIKNNLNPTWKRFSVPVQHFCGGNPSTPIQVQCSDYDSD GSHDLIGTFHTSLAQLQAVPAEFECIHPEKQQKKKSYKNSGTIRVKICRVETEYSFLD YVMGGCQINFTVGVDFTGSNGDPSSPDSLHYLSPTGVNEYLMALWSVGSVVQDYDSDK LFPAFGFGAQVPPDWQVSHEFALNFNPSNPYCAGIQGIVDAYRQALPQVRLYGPTNFA PIINHVARFAAQAAHQGTASQYFMLLLLTDGAVTDVEATREAVVRASNLPMSVIIVGV GGADFEAMEQLDADGGPLHTRSGQAAARDIVQFVPYRRFQNAPREALAQTVLAEVPTQ LVSYFRAQGWAPLKPLPPSAKDPAQAPQA" BASE COUNT 465 a 538 c 511 g 470 t 1 others ORIGIN 1 accaggcaaa tattccattc agcattacaa agatggatgt tcttcagttc ctagaaggaa 61 tcccagtgga tgaaaatgct gtacatgttc ttgttgataa caatgggcaa ggtctaggac 121 aggcattggt tcagtttaaa aatgaagatg atgcacatgg cccactgcgt gaccttggtt 181 cagctgtcca tttcctgtga ccatctcatt gacaaggaca tcggctccaa gtctgaccca 241 ctctgcgtcc ttttacagga tgtgggaggg ggcagctggg ctgagcttgg ccggactgaa 301 cgggtgcgga actgctcaag ccctgagttc tccaagactc tacagcttga gtaccgcttt 361 gagacagtcc agaagctacg ctttggaatc tatgacatag acaacaagac gccagagctg 421 agggatgatg acttcctagg gggtgctgag tgttccctag gacagattgt gtccagccag 481 gtactgactc tccccttgat gctgaagcct ggaaaacctg ctgggcgggg gaccatcacg 541 gtctcagctc aggaattaaa ggacaatcgt gtagtaacca tggaggtaga ggccagaaac 601 ctagataaga aggacttcct gggaaaatca gatccatttc tggagttctt ccgccagggt 661 gatgggaaat ggcacctggt gtacagatct gaggtcatca agaacaacct gaaccctaca 721 tggaagcgtt tctcagtccc cgttcagcat ttctgtggtg ggaaccccag cacacccatc 781 caggtgcaat gctccgatta tgacagtgac gggtcacatg atctcatcgg taccttccac 841 accagcttgg cccagctgca ggcagtcccg gctgagtttg aatgcatcca ccctgagaag 901 cagcagaaaa agaaaagcta caagaactct ggaactatcc gtgtcaagat ttgtcgggta 961 gaaacagagt actcctttct ggactatgtg atgggaggct gtcagatcaa cttcactgtg 1021 ggcgtggact tcactggctc caatggagac ccctcctcac ctgactccct acactacctg 1081 agtccaacag gggtcaatga gtacctgatg gcactgtgga gtgtgggcag cgtggttcag 1141 gactatgact cagacaagct gttccctgca tttggatttg gggcccaggt tccccctgac 1201 tggcaggtct cgcatgaatt tgccttgaat ttcaacccca gtaaccccta ctgtgcaggc 1261 atccagggca ttgtggatgc ctaccgccaa gccctgcccc aagttcgcct ctatggccct 1321 accaactttg cacccatcat caaccatgtg gccaggtttg cagcccaggc tgcacatcag 1381 gggactgcct cgcaatactt catgctgttg ctgctgactg atggtgctgt gacggatgtg 1441 gaagccacac gtgaggctgt ggtgcgtgcc tcgaacctgc ccatgtcagt gatcattgtg 1501 ggtgtgggtg gtgctgactt tgaggccatg gagcagctgg acgctgatgg tggacccctg 1561 catacacgtt ctgggcaggc tgctgcccgc gacattgtgc agtttgtacc ctaccgccgg 1621 ttccagaatg cccctcggga ggcattggca cagaccgtgc tcgcagaagt gcccacacaa 1681 ctggtctcat acttcagggc ccagggttgg gccccgctca agccacttcc accctcagcc 1741 aaggatcctg cacaggcccc ccaggcctag gttcccttgg aggctgtggc aagtcctcaa 1801 tcctgtgtcc cagaggtccc tntgggccac aacccaaccc ttctcactct cctcagtgct 1861 agcactttgt attttttgat acttttatac ttgtttctgc ttttgctgct cttgatccca 1921 cctttgctcc tgacaaccct cattcaataa agaccagtga agaccaaaaa aaaaaaaaaa 1981 aaaaa // LOCUS HSU83410 2832 bp mRNA PRI 03-APR-1997 DEFINITION Human CUL-2 (cul-2) mRNA, complete cds. ACCESSION U83410 NID g1923242 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2832) AUTHORS Pause,A., Lee,S., Worrell,R.A., Chen,D.Y., Burgess,W.H., Linehan,W.M. and Klausner,R.D. TITLE The von Hippel-Lindau tumor-suppressor gene product forms a stable complex with human CUL-2, a member of the Cdc53 family of proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (6), 2156-2161 (1997) MEDLINE 97225922 REFERENCE 2 (bases 1 to 2832) AUTHORS Pause,A. TITLE Direct Submission JOURNAL Submitted (30-DEC-1996) CBMB, NICHD, NIH, 18 Library Drive, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2832 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" gene 147..2384 /gene="cul-2" CDS 147..2384 /gene="cul-2" /note="Cullin family member; Cdc53 homolog; VHL tumor suppressor protein binding protein" /codon_start=1 /product="CUL-2" /db_xref="PID:g1923243" /translation="MSLKPRVVDFDETWNKLLTTIKAVVMLEYVERATWNDRFSDIYA LCVAYPEPLGERLYTETKIFLENHVRHLHKRVLESEEQVLVMYHRYWEEYSKGADYMD CLYRYLSTQFIKKNKLTEADLQYGYGGVDMNEPLMEIGELALDMWRKLMVEPLQAILI RMLLREIKNDRGGEDPNQKVIHGVINSFVHVEQYKKKFPLKFYQEIFESPFLTETGEY YKQEASNLLQESNCSQYMEKVLGRLKDEEIRCRKYLHPSSYTKVIHECQQRMVADHLQ FLHAECHNIIRQEKKNDMANMYVLLRAVSTGLPHMIQELQNHIHDEGLRATSNLTQEN MPTLFVESVLEVHGKFVQLINTVLNGDQHFMSALDKALTSVVNYREPKSVCKAPELLA KYCDNLLKKSAKGMTENEVEDRLTSFITVFKYIDDKDVFQKFYARMLAKRLIHGLSMS MDSEEAMINKLKQACGYEFTSKLHRMYTDMSVSADLNNKFNNFIKNQDTVIDLGISFQ IYVLQAGAWPLTQAPSSTFAIPQELEKSVQMFELFYSQHFSGRKLTWLHYLCTGEVKM NYLGKPYVAMVTTYQMAVLLAFNNSETVSYKELQDSTQMNEKELTKTIKSLLDVKMIN HDSEKEDIDAESSFSLNMNFSSKRTKFKITTSMQKDTPQEMEQTRSAVDEDRKMYLQA AIVRIMKARKVLRHNALIQEVISQSRARFNPSISMIKKCIEVLIDKQYIERSQASADE YSYVA" BASE COUNT 935 a 534 c 576 g 787 t ORIGIN 1 gcgagctgac agccgccgcc gccgccgcct ccgcccacct tcctcgccgg ggcttcgtct 61 ttcactcctt cgggctgcct ccccctcccc ttgtcccctg ccccttgccc tgcttctgca 121 gaagatttca acactacact tgcacaatgt ctttgaaacc aagagtagta gattttgatg 181 aaacatggaa caaacttttg acgacaataa aagccgtggt catgttggaa tacgtcgaaa 241 gagcaacatg gaatgaccgt ttctcagata tctatgcttt atgtgtggcc tatcctgaac 301 cccttggaga aagactttat acagaaacta agattttttt ggaaaatcat gttcggcatt 361 tgcataagag agttttggag tcagaagaac aagtacttgt tatgtatcat aggtactggg 421 aagaatacag caagggtgca gactatatgg actgcttata taggtatctc agcacccagt 481 ttattaaaaa gaataaatta acagaagcgg accttcagta tggctatggt ggtgtagata 541 tgaatgaacc acttatggaa ataggagagc tagcattgga tatgtggagg aaattgatgg 601 ttgaaccact tcaggccatc cttatccgaa tgctgctccg agaaatcaaa aatgatcgtg 661 gtggagaaga cccaaaccag aaagtaatcc atggggttat taactccttt gttcatgttg 721 aacagtataa gaaaaaattc cccttaaagt tttatcagga aatttttgag tctccctttc 781 tgactgaaac aggagagtat tacaaacaag aagcttcaaa tttattacaa gaatcaaact 841 gctcacagta tatggaaaag gttttaggta gattaaaaga tgaagaaatt cgatgtcgaa 901 aatacctaca tccaagttca tatactaagg tgattcatga atgtcaacaa cgaatggtag 961 cagaccactt acagttttta catgcagaat gtcataatat aattcgacaa gagaaaaaaa 1021 atgacatggc aaatatgtac gtcttactcc gtgctgtgtc cactggttta cctcatatga 1081 ttcaggagct gcaaaaccac atccatgatg agggccttcg agcaaccagc aaccttactc 1141 aggaaaacat gccaacacta tttgtggagt cagttttgga agtgcatggt aaatttgttc 1201 agcttatcaa cactgttttg aatggtgatc agcattttat gagtgcgttg gataaggccc 1261 ttacgtcagt tgtaaattac agagaaccta agtctgtttg caaagcacct gaactgcttg 1321 ctaagtactg tgacaactta ctgaagaagt cagcgaaagg gatgacagag aatgaagtgg 1381 aagacaggct tacgagcttc atcacagtgt tcaaatacat tgatgacaag gacgtctttc 1441 aaaagttcta cgcaagaatg ctggcaaaac gtttaattca tgggttatcc atgtctatgg 1501 actctgaaga agccatgatc aacaaattaa agcaagcctg tggttatgag tttaccagca 1561 agctacatcg gatgtataca gatatgagtg tcagcgctga tctcaacaat aagttcaaca 1621 attttatcaa aaaccaagac acagtaatag atttgggaat tagttttcaa atatatgttc 1681 tacaggctgg tgcgtggcct cttactcagg ctccttcatc tacgtttgca attccccagg 1741 aattagaaaa aagtgtacag atgtttgaat tattttatag ccaacatttc agtggaagga 1801 aacttacatg gttacattat ctgtgtacag gtgaagttaa aatgaactat ttgggcaaac 1861 catatgtagc catggttaca acataccaaa tggcagttct tcttgccttt aacaacagtg 1921 aaactgtcag ttataaagag cttcaggaca gcactcagat gaatgaaaag gaactgacaa 1981 aaacaatcaa atcattactt gatgtgaaaa tgattaacca tgattcagaa aaggaagata 2041 ttgatgcaga atcttcgttt tcattaaata tgaactttag cagtaaaaga acaaaattta 2101 aaattactac atcaatgcag aaagacacac cacaagaaat ggagcagact agaagtgcag 2161 ttgatgagga ccggaaaatg tatctccaag ctgctatagt tcgtatcatg aaagcacgaa 2221 aagtgcttcg gcacaatgcc cttattcaag aggtgattag ccagtcaaga gctaggttta 2281 atcccagtat cagcatgatt aagaagtgta ttgaagttct gatagacaaa caatacatag 2341 aacgcagcca ggcgtcggca gatgaataca gctacgtcgc gtgatgtcgc tctcctccag 2401 cgtggtgtga gaagatcatt gccatcacca tttggtgtgt tcctgtggga aaaagcagga 2461 ctgtgcctcc ataatttggt catttggcag cccctgtttt ctgctgttta caacatcacc 2521 agtgccacgt catgagcgtc aaagaaaatg cctagagata tttcaagctc atgacattat 2581 gacatttctt aaaactttat taaaagaatg agtgaagtat tgctgaaaag tggaaaatcg 2641 gttgggtacc atgctttttc tccccttcac gtttgcagtt gatgtgtctt tttttttttt 2701 tttaatgtat cttaaaggac ataaaattta aaaacttaaa tattgtaata tgacagataa 2761 cctaataatt gtatctacat taaaatgaca aacatgatac tgctgcttgt caaataaaaa 2821 aaaaaaaaaa aa // LOCUS HSU83411 2100 bp mRNA PRI 06-JUN-1997 DEFINITION Homo sapiens carboxypeptidase Z precursor, mRNA, complete cds. ACCESSION U83411 NID g2160713 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2100) AUTHORS Song,L. and Fricker,L.D. TITLE Cloning and expression of human carboxypeptidase Z, a novel metallocarboxypeptidase JOURNAL J. Biol. Chem. 272 (16), 10543-10550 (1997) MEDLINE 97256770 REFERENCE 2 (bases 1 to 2100) AUTHORS Song,L. and Fricker,L.D. TITLE Direct Submission JOURNAL Submitted (30-DEC-1996) Mol. Pharmacol., Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..2100 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 40..93 CDS 40..1965 /note="metallocarboxypeptidase with N-terminal frizzled domain" /codon_start=1 /product="carboxypeptidase Z precursor" /db_xref="PID:g2160714" /translation="MPPPPLLLLLTVLVVAAARPGCEFERNPAATCVDLQLRTCSDAA YNHTTFPNLLQHRSWEVVEASSEYILLSVLHQLLEGQCNPDLRLLGCAVLAPRCEGGW VRRPCRHICEGLREVCQPAFDAIDMAWPYFLDCHRYFTREDEGCYDPLEKLRGGLEAD EALPSGLPPTFIRFSHHSYAQMVRVLRRTASRCAHVARTYSIGRSFDGRELLVIEFSS RPGQHELMEPEVKLIGNIHGNEVAGREMLIYLAQYLCSEYLLGNPRIQRLLNTTRIHL LPSINPDGYEVAAAEGAGYNGWTSGRQNAQNLDLNRNFPDLTSEYYRLAETRGARSDH IPIPQHYWWGKVAPETKAIMKWMQTIPFVLSASLHGGDLVVSYPFDFSKHPQEEKMFS PTPDEKMFKLLSRAYADVHPMMMDRSENRCGGNFLKRGSIINGADWYSFTGGMSDFNY LHTNCFEITVELGCVKFPPEEALYTLWQHNKESLLNFVETVHRGIKGVVTDKFGKPVK NARISVKGIRHDITTAPDGDYWRLLPPGIHIVIAQAPGYAKVIKKVIIPARMKRAGRV DFILQPLGMGPKNFIHGLRRTGPHDPLGGASSLGEATEPDPLRARRQPSADGSKPWWW SYFTSLSTHRPRWLLKY" mat_peptide 94..1962 /product="carboxypeptidase Z" misc_feature 130..492 /note="encodes frizzled domain" misc_feature 541..1710 /note="encodes metallocarboxypeptidase domain" BASE COUNT 410 a 699 c 619 g 372 t ORIGIN 1 acatcactgc gctggccgtc caaggtccgc cgccccacca tgccgccccc gccgctgctg 61 ctgctcctta cagtcctggt cgtcgccgct gcccggccgg ggtgcgagtt tgagcggaac 121 cccgccgcca cctgcgtgga cctgcagctc aggacctgca gcgatgccgc ctacaaccac 181 accaccttcc ccaacctgct tcagcaccgg tcgtgggagg tggtggaggc cagctccgag 241 tacatcctgc tgagcgttct acaccagctc ctggaaggcc agtgcaaccc ggacctgcgg 301 ctgctgggct gtgctgtgct ggccccccgg tgtgagggcg gctgggtgcg cagaccctgc 361 cggcacatct gcgagggcct gcgggaggtc tgccagcccg ccttcgacgc cattgacatg 421 gcctggccct acttccttga ctgccaccgc tacttcacga gagaggacga gggctgctat 481 gacccgctgg agaagcttcg gggaggcctg gaggctgacg aggcactgcc ctcagggctg 541 ccgcccacct tcatccgctt cagccaccac tcctacgccc agatggtgcg tgtgctgagg 601 cggacggcct cccgctgtgc ccacgtggcc aggacctaca gcatcgggcg cagcttcgac 661 ggcagggagc tgctggtcat cgagttctcc agccgccccg gccagcacga gctgatggag 721 cccgaggtga agctcatcgg caacattcat ggcaacgagg tggcgggccg ggagatgctc 781 atctacctag cccagtacct gtgctctgag tacctgcttg gtaacccccg catccagcgc 841 ctgctcaaca ccacccgcat ccacctgctg ccctccatta accctgacgg ctatgaggtg 901 gcagctgccg agggtgccgg ctacaacggg tggacgagcg ggaggcagaa cgcgcagaac 961 ctggatctga accgaaattt cccggacctg acgtccgagt actaccggct ggcggagacc 1021 cgcggcgcac gcagcgacca catccccatc ccccagcact actggtgggg taaggtggcc 1081 ccggagacaa aggcaatcat gaagtggatg cagaccatac cctttgtgct ctcagccagc 1141 cttcatgggg gcgacctggt ggtgtcctac cccttcgact tctccaagca cccccaggag 1201 gagaagatgt tttctcccac gcccgacgag aagatgttca agctgctgtc cagagcctac 1261 gctgacgtcc accccatgat gatggacagg tcggagaata ggtgtggagg caatttcctg 1321 aagaggggga gcatcatcaa cggggcggac tggtacagct tcacgggagg catgtccgat 1381 ttcaactacc tgcacaccaa ctgctttgag atcacggtag agctgggctg tgtgaagttc 1441 ccccccgagg aggccctgta cacactctgg cagcacaaca aggagtcact cctgaatttc 1501 gtggagacgg tgcaccgggg catcaaaggt gtggtgacag ataaattcgg caagccagtc 1561 aaaaacgccc ggatctcagt caaaggcatt cgccacgaca tcaccacagc cccagatggt 1621 gactactgga gactgctgcc cccaggtatc cacattgtca ttgcccaagc ccctggctac 1681 gccaaagtca tcaagaaagt catcatcccc gcccggatga agagggctgg ccgtgtggac 1741 ttcattctgc aacctctggg gatgggaccc aagaacttta ttcatgggct gcggaggact 1801 gggccccacg acccgctggg aggtgccagc tctttggggg aggccacgga gcccgacccg 1861 ctccgggcgc gcaggcagcc ctcggccgac gggagtaagc cctggtggtg gtcctacttc 1921 acatcgctga gcacccacag gccacgctgg ctgctcaagt actagccccg gccccagcac 1981 ccgccaggat gtggagaccg aggcccatct ccgcatcccg ggctcctggc tcttgatttt 2041 gtctgccaca gacatcccac aaagccgctg ccattttatt aaagtgtttt gatccacaaa // LOCUS HSU83460 1804 bp mRNA PRI 09-AUG-1997 DEFINITION Human high-affinity copper uptake protein (hCTR1) mRNA, complete cds. ACCESSION U83460 NID g2315986 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1804) AUTHORS Zhou,B. and Gitschier,J. TITLE hCTR1: A human gene for copper uptake identified by complementation in yeast JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (14), 7481-7486 (1997) MEDLINE 97352824 REFERENCE 2 (bases 1 to 1804) AUTHORS Zhou,B. and Gitschier,J.M. TITLE Direct Submission JOURNAL Submitted (31-DEC-1996) Howard Hughes Medical Institute, 513 Parnassus, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1804 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q31-q32" gene 1..1804 /gene="hCTR1" CDS 153..725 /gene="hCTR1" /note="contains 3 putative transmembrane domains; similar to copper transporters found in Arabidopsis thaliana, COPT1, and in Saccharomyces cerevisae, CTR1" /codon_start=1 /product="high-affinity copper uptake protein" /db_xref="PID:g2315987" /translation="MDHSHHMGMSYMDSNSTMQPSHHHPTTSASHSHGGGDSSMMMMP MTFYFGFKNVELLFSGLVINTAGEMAGAFVAVFLLAMFYEGLKIARESLLRKSQVSIR YNSMPVPGPNGTILMETHKTVGQQMLSFPHLLQTVLHIIQVVISYFLMLIFMTYNGYL CIAVAAGAGTGYFLFSWKKAVVVDITEHCH" BASE COUNT 500 a 411 c 423 g 470 t ORIGIN 1 gcggtggtgg acacgtcgag ccgggtagaa gtggaggggc cgttcgaaga gtcgtgaggg 61 ggtgacgggt taagattcgg agagagaggt gctagtggct ggacttgacc tggaaagaat 121 cttctgctga ctctcaactt ttcctggaaa aaatggatca ttcccaccat atggggatga 181 gctatatgga ctccaacagt accatgcaac cttctcacca tcacccaacc acttcagcct 241 cacactccca tggtggagga gacagcagca tgatgatgat gcctatgacc ttctactttg 301 gctttaagaa tgtggaacta ctgttttccg gtttggtgat caatacagct ggagaaatgg 361 ctggagcttt tgtggcagtg tttttactag caatgttcta tgaaggactc aagatagccc 421 gagagagcct gctgcgtaag tcacaagtca gcattcgcta caattccatg cctgtcccag 481 gaccaaatgg aaccatcctt atggagacac acaaaactgt tgggcaacag atgctgagct 541 ttcctcacct cctgcaaaca gtgctgcaca tcatccaggt ggtcataagc tacttcctca 601 tgctcatctt catgacctac aacgggtacc tctgcattgc agtagcagca ggggccggta 661 caggatactt cctcttcagc tggaagaagg cagtggtagt ggatatcaca gagcattgcc 721 attgacatca aactctatgg cgtggcctta tcgattgcag tgggaagttg ttgaagactt 781 gaagacgtga ttcctgctcc aatcatccct tcttgctcct ctttgtgcac gtacacacac 841 acacacacac acacacacac acacacaccc ctgctcaaca gaggtttagt ttacagtctc 901 tgaactaaag tagtaacctc ccaaattgtt ttttctaata agctgagatt cccatttctc 961 ttaaggagaa gccacccatg agatgtcttt tccttctcca tcatcttaga gccaagttat 1021 atgttcttgt ctaatccatg tagctttttg ttcaatgact tgatcatctg cttccttttt 1081 gaatttttaa cagatagtaa gtaaatttgg tggttttttc ccctgggtca gtgatggaaa 1141 ggggttaact tcagccagga ttgatggcag ctgagggaaa ttcttgccca actaaaccca 1201 gaactcaaac ttaacattag aaaataaggt ccagggccgg acacagtggc ccatgcctgt 1261 aatcccagca ctttgggggg ccaaggcagg ctggatcacc tgaggacagg agttcgagac 1321 cagtctggcc aacatgggga aaccccgtct ctactaaaaa tacaaaaatt agccgggcat 1381 ggtggtgggc gcctgtaatc ccagctactc agaaggctga ggcaggagaa tcacttgaac 1441 ctaggaggcg gaggttgcag tgagccaaga tggcgccatt gcactccagc ctgggtgaca 1501 agagtgaaac tccatctcaa aaaaaaagaa aagaaggtcc agcttttgga ttcaatgagt 1561 gggaaataca ttgtgccttt ctctagatgt gatacgttat accaaaatct ttgtagtgtg 1621 cagagcggtg gtttgagact aaatacaggc ttagaacttg cagagtgtgt attcttggat 1681 ggctgatgca tcgacttgca ttcccactta acactttgat tagccatgaa cttgccaatc 1741 aaaaaatgac aatcaatttg agaaaataga aatagatatt tttaaataaa accattcaca 1801 gttt // LOCUS HSU83461 1698 bp mRNA PRI 09-AUG-1997 DEFINITION Human putative copper uptake protein (hCTR2) mRNA, complete cds. ACCESSION U83461 NID g2315988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1698) AUTHORS Zhou,B. and Gitschier,J. TITLE hCTR1: A human gene for copper uptake identified by complementation in yeast JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (14), 7481-7486 (1997) MEDLINE 97352824 REFERENCE 2 (bases 1 to 1698) AUTHORS Zhou,B. and Gitschier,J.M. TITLE Direct Submission JOURNAL Submitted (31-DEC-1996) Howard Hughes Medical Institute, 513 Parnassus, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1698 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q31-q32" gene 1..1698 /gene="hCTR2" CDS 64..495 /gene="hCTR2" /note="similar to the hCTR1 product encoded by GenBank Accession Number U83460 and to yeast CTR2" /codon_start=1 /product="putative copper uptake protein" /db_xref="PID:g2315989" /translation="MAMHFIFSDTAVLLFDFWSVHSPAGMALSVLVLLLLAVLYEGIK VGKAKLLNQVLVNLPTSISQQTIAETDGDSAGSDSFPVGRTHHRWYLCHFGQSLIHVI QVVIGYFIMLAVMSYNTWIFLGVVLGSAVGYYLAYPLLSTA" BASE COUNT 422 a 416 c 386 g 474 t ORIGIN 1 tcggcacagg agcgaggaga cccgagagca gacgcgccct ggcgcccgcc ctgcgcagtc 61 accatggcga tgcatttcat cttctcagat acagcggtgc ttctgtttga tttctggagt 121 gtccacagtc ctgctggcat ggccctttcg gtgttggtgc tcctgcttct ggctgtactg 181 tatgaaggca tcaaggttgg caaagccaag ctgctcaacc aggtactggt gaacctgcca 241 acctccatca gccagcagac catcgcagag acagacgggg actctgcagg ctcagattca 301 ttccctgttg gcagaaccca ccacaggtgg tacttgtgtc actttggcca gtctctaatc 361 catgtcatcc aggtggtcat cggctacttc atcatgctgg ccgtaatgtc ctacaacacc 421 tggattttcc ttggtgtggt cttgggctct gctgtgggct actacctagc ttacccactt 481 ctcagcacag cttagatggt gaggaacgtg caggcactga ggctggaggg acatggagcc 541 ccctcttcca gacactatac ttccaactgc cctttcttct gatggctatt cctccacctt 601 attcccagcc cctggaaact ttgagctgaa gccagcactt gctccctgga gttcggaagc 661 cattgcagca accttccttc tcagccagcc tacgtagggc ccaggcatgg tcttgtgtct 721 taagacagct gctgtgacca aagggagaat ggagataaca ggggtggcag ggttactgag 781 cccatgacaa tgcttctctg tgactcaaac caggaatttc caaagatttc aagccaggga 841 gaagggttct tggtgatgca gggcatggaa cctggacacc ctcagctctc ctgctttgtg 901 ccttatctac aggagcatcg cccattggac ttcctgacct cttctgtctt tgagggacag 961 agaccaagct agatcctttt tctcaccttt ctgcctttgg aacacatgaa gatcatctcg 1021 tctatggatc atgttgacaa actaagtttt ttttattttt cccattgaac tcctagttgg 1081 caattttgca cattcataca aaaaaatttt taatgaaatg atttcattga ttcatgatgg 1141 atggcagaaa ctgctgagac ctatttccct ttcttgggga gagaataagt gacagctgat 1201 taaaggcaga gacacaggac tgctttcagg ctcctggttt attctctgat tgactgagct 1261 ccttccacca gaaggcactg cctgcaggaa gaagatgatc tgatggccgt gggtgtctgg 1321 gaagctcttc gtggcctcaa tgccctcctt tatcctcatc tttcttctat gcagaacaaa 1381 aagctgcatc taataatgtt caatacttaa tattctctat ttattactta ctgcttactc 1441 gtaatgatct agtggggaaa catgattcat tcacttaaaa tactgattaa gccatgggca 1501 ggtactgact gaagatgcaa tccaaccaaa gccattacat tttttgagtt agatgggact 1561 ctctggatag ttgaacctct tcactttata aaaaaggaaa gagagaaaat cactgctgta 1621 tactaaatac ctcacagatt agatgaaaag atggttgtaa gctttgggaa ttaaaaacaa 1681 atacatttta gtaaatat // LOCUS HSU83508 2149 bp mRNA PRI 27-MAR-1997 DEFINITION Human angiopoietin-1 mRNA, complete cds. ACCESSION U83508 NID g1907326 KEYWORDS angiogenesis; TEK; VEGF. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2149) AUTHORS Davis,S., Aldrich,T.H., Jones,P.F., Acheson,A., Compton,D.L., Jain,V., Ryan,T.E., Bruno,J., Radziejewski,C., Maisonpierre,P.C. and Yancopoulos,G.D. TITLE Isolation of angiopoietin-1, a ligand for the TIE2 receptor, by secretion-trap expression cloning JOURNAL Cell 87 (7), 1161-1169 (1996) MEDLINE 97134663 REFERENCE 2 (bases 1 to 2149) AUTHORS Davis,S., Aldrich,T.H., Jones,P.F., Acheson,A., Compton,D.L., Jain,V., Ryan,T.E., Bruno,J., Radziejewski,C., Maisonpierre,P.C. and Yancopoulos,G.D. TITLE Direct Submission JOURNAL Submitted (31-DEC-1996) Discovery, Regeneron Pharmaceuticals, 777 Old Saw Mill River Road, Tarrytown, NY 10591, USA FEATURES Location/Qualifiers source 1..2149 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 310..1806 /function="ligand for the TIE2 receptor" /codon_start=1 /product="angiopoietin-1" /db_xref="PID:g1907327" /translation="MTVFLSFAFLAAILTHIGCSNQRRSPENSGRRYNRIQHGQCAYT FILPEHDGNCRESTTDQYNTNALQRDAPHVEPDFSSQKLQHLEHVMENYTQWLQKLEN YIVENMKSEMAQIQQNAVQNHTATMLEIGTSLLSQTAEQTRKLTDVETQVLNQTSRLE IQLLENSLSTYKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEEKEN LQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTVHNLVNLCTKEGVLLKGGK REEEKPFRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGS LDFQRGWKEYKMGFGNPSGEYWLGNEFIFAITSQRQYMLRIELMDWEGNRAYSQYDRF HIGNEKQNYRLYLKGHTGTAGKQSSLILHGADFSTKDADNDNCMCKCALMLTGGWWFD ACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPLDF" BASE COUNT 733 a 411 c 487 g 518 t ORIGIN 1 cagctgactc aggcaggctc catgctgaac ggtcacacag agaggaaaca ataaatctca 61 gctactatgc aataaatatc tcaagtttta acgaagaaaa acatcattgc agtgaaataa 121 aaaattttaa aattttagaa caaagctaac aaatggctag ttttctatga ttcttcttca 181 aacgctttct ttgaggggga aagagtcaaa caaacaagca gttttacctg aaataaagaa 241 ctagttttag aggtcagaag aaaggagcaa gttttgcgag aggcacggaa ggagtgtgct 301 ggcagtacaa tgacagtttt cctttccttt gctttcctcg ctgccattct gactcacata 361 gggtgcagca atcagcgccg aagtccagaa aacagtggga gaagatataa ccggattcaa 421 catgggcaat gtgcctacac tttcattctt ccagaacacg atggcaactg tcgtgagagt 481 acgacagacc agtacaacac aaacgctctg cagagagatg ctccacacgt ggaaccggat 541 ttctcttccc agaaacttca acatctggaa catgtgatgg aaaattatac tcagtggctg 601 caaaaacttg agaattacat tgtggaaaac atgaagtcgg agatggccca gatacagcag 661 aatgcagttc agaaccacac ggctaccatg ctggagatag gaaccagcct cctctctcag 721 actgcagagc agaccagaaa gctgacagat gttgagaccc aggtactaaa tcaaacttct 781 cgacttgaga tacagctgct ggagaattca ttatccacct acaagctaga gaagcaactt 841 cttcaacaga caaatgaaat cttgaagatc catgaaaaaa acagtttatt agaacataaa 901 atcttagaaa tggaaggaaa acacaaggaa gagttggaca ccttaaagga agagaaagag 961 aaccttcaag gcttggttac tcgtcaaaca tatataatcc aggagctgga aaagcaatta 1021 aacagagcta ccaccaacaa cagtgtcctt cagaagcagc aactggagct gatggacaca 1081 gtccacaacc ttgtcaatct ttgcactaaa gaaggtgttt tactaaaggg aggaaaaaga 1141 gaggaagaga aaccatttag agactgtgca gatgtatatc aagctggttt taataaaagt 1201 ggaatctaca ctatttatat taataatatg ccagaaccca aaaaggtgtt ttgcaatatg 1261 gatgtcaatg ggggaggttg gactgtaata caacatcgtg aagatggaag tctagatttc 1321 caaagaggct ggaaggaata taaaatgggt tttggaaatc cctccggtga atattggctg 1381 gggaatgagt ttatttttgc cattaccagt cagaggcagt acatgctaag aattgagtta 1441 atggactggg aagggaaccg agcctattca cagtatgaca gattccacat aggaaatgaa 1501 aagcaaaact ataggttgta tttaaaaggt cacactggga cagcaggaaa acagagcagc 1561 ctgatcttac acggtgctga tttcagcact aaagatgctg ataatgacaa ctgtatgtgc 1621 aaatgtgccc tcatgttaac aggaggatgg tggtttgatg cttgtggccc ctccaatcta 1681 aatggaatgt tctatactgc gggacaaaac catggaaaac tgaatgggat aaagtggcac 1741 tacttcaaag ggcccagtta ctccttacgt tccacaacta tgatgattcg acctttagat 1801 ttttgaaagc gcaatgtcag aagcgattat gaaagcaaca aagaaatccg gagaagctgc 1861 caggtgagaa actgtttgaa aacttcagaa gcaaacaata ttgtctccct tccagcaata 1921 agtggtagtt atgtgaagtc accaaggttc ttgaccgtga atctggagcc gtttgagttc 1981 acaagagtct ctacttgggg tgacagtgct cacgtggctc gactatagaa aactccactg 2041 actgtcgggc tttaaaaagg gaagaaactg ctgagcttgc tgtgcttcaa actactactg 2101 gaccttattt tggaactatg gtagccagat gataaatatg gttaatttc // LOCUS HSU83908 1740 bp DNA PRI 07-FEB-1997 DEFINITION Human nuclear antigen H731 mRNA, complete cds. ACCESSION U83908 NID g1825561 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1740) AUTHORS Matsuhashi,S., Yoshinaga,H., Yatsuki,H., Tsugita,A. and Hori,K. TITLE Isolation of a novel gene from a human cell line with Pr-28 MAb which recognizes a nuclear antigen involved in the cell cycle JOURNAL Res. Commun. Biochem. Cell Mol. Biol. 1, 109-120 (1997) REFERENCE 2 (bases 1 to 1740) AUTHORS Yoshinaga,H., Matsuhashi,S., Kondo,T. and Hori,K. TITLE Expression of the human H731 gene product in Escherichia coli inhibits DNA synthesis JOURNAL Unpublished REFERENCE 3 (bases 1 to 1740) AUTHORS Matsuhashi,S. TITLE Direct Submission JOURNAL Submitted (06-JAN-1997) Biochemistry, Saga Medical School, Nabeshima, Saga 849, Japan FEATURES Location/Qualifiers source 1..1740 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="glioma cell line" CDS 235..1611 /function="has a role in the cell cycle" /codon_start=1 /product="nuclear antigen H731" /db_xref="PID:g1825562" /translation="MTKYPDNLSDSLFSGDEENAGTEEVKNEINGNWISASSINEARI NAKAKRRLRKNSSRDSGRGDSVSESGSDALRSGLTVPTSPKGRLLDRRSRSGKGRGLP KKGGAGGKGVWGTPGQVYDVEEVDVKDPNYDDDQENCVYETVVLPLDERAFEKTLTPI IQEYFEHGDTNEVAEMLRDLNLGEMKSGVPVLAVSLALEGKASHREMTTKLLSDLCGT VMSTTDVEKSFDKLLKDLPELALDTPRAPQLVGQFIARAVGDGILCNTYIDSYKGTVD CVQARAALDKATVLLSMSKGGKRKDSVWGSGGGQQSVNHLVKEIDMLLKEYLLSGDIS EAEHCLKELEVPHFHHELVYEAIIMVLESTGESTFKMILDLLKSLWKSSTITVDQMKR GYERIYNEIPDINLDVPHSYSVLERFVEECFQAGIISKQLRDLCPSRGRKRFVSEGDG GRLKPESY" BASE COUNT 535 a 263 c 452 g 490 t ORIGIN 1 ggggtcgggg ccggctgacc aggaacctgg gcgagcagcg gcgggggccc gagggattct 61 gaaggaagat ttccattagg taatttgttt aatcagtgca agcgaaatta agggaaaatg 121 gatgtagaaa atgagcagat actgaatgta aaccctgcag ggtattttcc ctaattctcc 181 atggtgcttc aatagcatgt tattatcata aaaatgaaca gttttgtgga atagatgacc 241 aaatatcctg ataacttaag tgactctctc ttttccggtg atgaagaaaa tgctgggact 301 gaggaagtaa agaatgaaat aaatggaaat tggatttcag catcctccat taacgaagct 361 agaattaatg ccaaggcaaa aaggcgacta aggaaaaact catcccggga ctctggcaga 421 ggcgattcgg tcagcgagag tgggagtgac gcccttagaa gtggattaac tgtgccaacc 481 agtccaaagg gaaggttgct ggataggcga tccagatctg ggaaaggaag gggactacca 541 aagaaaggtg gtgcaggagg caaaggtgtc tggggtacac ctggacaggt gtatgatgtg 601 gaggaggtgg atgtgaaaga tcctaactat gatgatgacc aggagaactg tgtttatgaa 661 actgtagttt tgcctttgga tgaaagggca tttgagaaga ctttaacacc aatcatacag 721 gaatattttg agcatggaga tactaatgaa gttgcggaaa tgttaagaga tttaaatctt 781 ggtgaaatga aaagtggagt accagtgttg gcagtatcct tagcattgga ggggaaggct 841 agtcatagag agatgactac taagcttctt tctgaccttt gtgggacagt aatgagcaca 901 actgatgtgg aaaaatcatt tgataaattg ttgaaagatc tacctgaatt agcactggat 961 actcctagag caccacagtt ggtgggccag tttattgcta gagctgttgg agatggaatt 1021 ttatgtaata cctatattga tagttacaaa ggaactgtag attgtgtgca ggctagagct 1081 gctctggata aggctaccgt gcttctgagt atgtctaaag gtggaaagcg taaagatagt 1141 gtgtggggct ctggaggtgg gcagcaatct gtcaatcacc ttgttaaaga gattgatatg 1201 ctgctgaaag aatatttact ctctggagac atatctgaag ctgaacattg ccttaaggaa 1261 ctggaagtac ctcattttca ccatgagctt gtatatgaag ctattataat ggttttagag 1321 tcaactggag aaagtacatt taagatgatt ttggatttat taaagtccct ttggaagtct 1381 tctaccatta ctgtagacca aatgaaaaga ggttatgaga gaatttacaa tgaaattccg 1441 gacattaatc tggatgtccc acattcatac tctgtgctgg agcggtttgt agaagaatgt 1501 tttcaggctg gaataatttc caaacaactc agagatcttt gtccttcaag gggcagaaag 1561 cgttttgtaa gcgaaggaga tggaggtcgt cttaaaccag agagctactg aatataagaa 1621 ctcttgcagt cttagatgtt ataaaaatat atatctgaat tgtaagagtt gttagcacaa 1681 gttttttttt tttttttttt aagcacttgt tttgggtaca aggcatttct gacattttat // LOCUS HSU84007 7367 bp mRNA PRI 04-MAR-1997 DEFINITION Human glycogen debranching enzyme isoform 1 (AGL) mRNA, alternatively spliced isoform, complete cds. ACCESSION U84007 NID g1857619 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7367) AUTHORS Bao,Y., Dawsom,T.L. and Chen,Y.-T. TITLE Human glycogen debranching enzyme gene (AGL): complete structural organization and characterization of the 5' flanking region JOURNAL Genomics 38 (1997) In press REFERENCE 2 (bases 1 to 7367) AUTHORS Bao,Y., Dawsom,T.L. and Chen,Y.-T. TITLE Direct Submission JOURNAL Submitted (07-JAN-1997) Pediatrics, Duke University Medical Center, Box 3528, Durham, NC 27710, USA FEATURES Location/Qualifiers source 1..7367 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p21" /chromosome="1" gene 401..4999 /gene="AGL" CDS 401..4999 /gene="AGL" /EC_number="2.4.1.25" /note="alternatively spliced isoform" /codon_start=1 /product="glycogen debranching enzyme isoform 1" /db_xref="PID:g1857620" /translation="MGHSKQIRILLLNEMEKLEKTLFRLEQGYELQFRLGPTLQGKAV TVYTNYPFPGETFNREKFRSLDWENPTEREDDSDKYCKLNLQQSGSFQYYFLQGNEKS GGGYIVVDPILRVGADNHVLPLDCVTLQTFLAKCLGPFDEWESRLRVAKESGYNMIHF TPLQTLGLSRSCYSLANQLELNPDFSRPNRKYTWNDVGQLVEKLKKEWNVICITDVVY NHTAANSKWIQEHPECAYNLVNSPHLKPAWVLDRALWRFSCDVAEGKYKEKGIPALIE NDHHMNSIRKIIWEDIFPKLKLWEFFQVDVNKAVEQFRRLLTQENRRVTKSDPNQHLT IIQDPEYRRFGCTVDMNIALTTFIPHDKGPAAIEECCNWFHKRMEELNSEKHRLINYH QEQAVNCLLGNVFYERLAGHGPKLGPVTRKHPLVTRYFTFPFEEIDFSMEESMIHLPN KACFLMAHNGWVMGDDPLRNFAEPGSEVYLRRELICWGDSVKLRYGNKPEDCPYLWAH MKKYTEITATYFQGVRLDNCHSTPLHVAEYMLDAARNLQPNLYVVAELFTGSEDLDNV FVTRLGISSLIREAMSAYNSHEEGRLVYRYGGEPVGSFVQPCLRPLMPAIAHALFMDI THDNECPIVHRSAYDALPSTTIVSMACCASGSTRGYDELVPHQISVVSEERFYTKWNP EALPSNTGEVNFQSGIIAARCAISKLHQELGAKGFIQVYVDQVDEDIVAVTRHSPSIH QSVVAVSRTAFRNPKTSFYSKEVPQMCIPGKIEEVVLEARTIERNTKPYRKDENSING TPDITVEIREHIQLNESKIVKQAGVATKGPNEYIQEIEFENLSPGSVIIFRVSLDPHA QVAVGILRNHLTQFSPHFKSGSLAVDNADPILKIPFASLASRLTLAELNQILYRCESE EKEDGGGCYDIPNWSALKYAGLQGLMSVLAEIRPKNDLGHPFCNNLRSGDWMIDYVSN RLISRSGTIAEVGKWLQAMFFYLKQIPRYLIPCYFDAILIGAYTTLLDTAWKQMSSFV QNGSTFVKHLSLGSVQLCGVGKFPSLPILSPALMDVPYRLNEITKEKEQCCVSLAAGL PHFSSGIFRCWGRDTFIALRGILLITGRYVEARNIILAFAGTLRHGLIPNLLGEGIYA RYNCRDAVWWWLQCIQDYCKMVPNGLDILKCPVSRMYPTDDSAPLPAGTLDQPLFEVI QEAMQKHMQGIQFRERNAGPQIDRNMKDEGFNITAGVDEETGFVYGGNRFNCGTWMDK MGESDRARNRGIPATPRDGSAVEIVGLSKSAVRWLLELSKKNIFPYHEVTVKRHGKAI KVSYDEWNRKIQDNFEKLFHVSEDPSDLNEKHPNLVHKRGIYKDSYGASSPWCDYQLR PNFTIAMVVAPELFTTEKAGKALEIAEKKLLGPLGMKTLDPDDMVYCGIYDNALDNDN YNLAKGFNYHQGPEWLWPIGYFLRAKLYFSRLMGPETTAKTIVLVKNVLSRHYVHLER SPWKGLPELTNENAQYCPFSCETQAWSIATILETLYDL" BASE COUNT 2317 a 1305 c 1483 g 2262 t ORIGIN 1 cccggaagtg ggccagaggt acggtccgct cccacctggg gcgagtgcgc gcacggccag 61 gttgggtacc gggtgcgccc aggaacccgc gcgaggcgaa gtcgctgaga ctctgcctgc 121 ttctcaccca gctgcctcgg cgctgccccg gtcgctcgcc gcccctccct ttgcccttca 181 cggcgcccgg ccctccttgg gctgcggctt ctgtgcgagg ctgggcagcc agcccttccc 241 cttctgtttc tccccgtccc ctccccccga ccgtagcacc agagtcgcgg gtcctgcagt 301 gccccagaag ccgcacgtat aactccctcg gcgggtaact cattcgactg tggagttctt 361 ttaattctta tgaaagattt caaatcctct ggaagccaaa atgggacaca gtaaacagat 421 tcgaatttta cttctgaacg aaatggagaa actggaaaag accctcttca gacttgaaca 481 agggtatgag ctacagttcc gattaggccc aactttacag ggaaaagcag ttaccgtgta 541 tacaaattac ccatttcctg gagaaacatt taatagagaa aaattccgtt ctctggattg 601 ggaaaatcca acagaaagag aagatgattc tgataaatac tgtaaactta atctgcaaca 661 atctggttca tttcagtatt atttccttca aggaaatgag aaaagtggtg gaggttacat 721 agttgtggac cccattttac gtgttggtgc tgataatcat gtgctaccct tggactgtgt 781 tactcttcag acatttttag ctaagtgttt gggacctttt gatgaatggg aaagcagact 841 tagggttgca aaagaatcag gctacaacat gattcatttt accccattgc agactcttgg 901 actatctagg tcatgctact cccttgccaa tcagttagaa ttaaatcctg acttttcaag 961 acctaataga aagtatacct ggaatgatgt tggacagcta gtggaaaaat taaaaaagga 1021 atggaatgtt atttgtatta ctgatgttgt ctacaatcat actgctgcta atagtaaatg 1081 gatccaggaa catccagaat gtgcctataa tcttgtgaat tctccacact taaaacctgc 1141 ctgggtctta gacagagcac tttggcgttt ctcctgtgat gttgcagaag ggaaatacaa 1201 agaaaaggga atacctgctt tgattgaaaa tgatcaccat atgaattcca tccgaaaaat 1261 aatttgggag gatatttttc caaagcttaa actctgggaa tttttccaag tagatgtcaa 1321 caaagcggtt gagcaattta gaagacttct tacacaagaa aataggcgag taaccaagtc 1381 tgatccaaac caacacctta cgattattca agatcctgaa tacagacggt ttggctgtac 1441 tgtagatatg aacattgcac taacgacttt cataccacat gacaaggggc cagcagcaat 1501 tgaagaatgc tgtaattggt ttcataaaag aatggaggaa ttaaattcag agaagcatcg 1561 actcattaac tatcatcagg aacaggcagt taattgcctt ttgggaaatg tgttttatga 1621 acgactggct ggccatggtc caaaactagg acctgtcact agaaagcatc ctttagttac 1681 caggtatttt actttcccat ttgaagagat agacttctcc atggaagaat ctatgattca 1741 tctgccaaat aaagcttgtt ttctgatggc acacaatgga tgggtaatgg gagatgatcc 1801 tcttcgaaac tttgctgaac cgggttcaga agtttaccta aggagagaac ttatttgctg 1861 gggagacagt gttaaattac gctatgggaa taaaccagag gactgtcctt atctctgggc 1921 acacatgaaa aaatacactg aaataactgc aacttatttc cagggagtac gtcttgataa 1981 ctgccactca acacctcttc acgtagctga gtacatgttg gatgctgcta ggaatttgca 2041 acccaattta tatgtagtag ctgaactgtt cacaggaagt gaagatctgg acaatgtctt 2101 tgttactaga ctgggcatta gttccttaat aagagaggca atgagtgcat ataatagtca 2161 tgaagagggc agattagttt accgatatgg aggagaacct gttggatcct ttgttcagcc 2221 ctgtttgagg cctttaatgc cagctattgc acatgccctg tttatggata ttacgcatga 2281 taatgagtgt cctattgtgc atagatcagc gtatgatgct cttccaagta ctacaattgt 2341 ttctatggca tgttgtgcta gtggaagtac aagaggctat gatgaattag tgcctcatca 2401 gatttcagtg gtttctgaag aacggtttta cactaagtgg aatcctgaag cattgccttc 2461 aaacacaggt gaagttaatt tccaaagcgg cattattgca gccaggtgtg ctatcagtaa 2521 acttcatcag gagcttggag ccaagggttt tattcaggtg tatgtggatc aagttgatga 2581 agacatagtg gcagtaacaa gacactcacc tagcatccat cagtctgttg tggctgtatc 2641 tagaactgct ttcaggaatc ccaagacttc attttacagc aaggaagtgc ctcaaatgtg 2701 catccctggc aaaattgaag aagtagttct tgaagctaga actattgaga gaaacacgaa 2761 accttatagg aaggatgaga attcaatcaa tggaacacca gatatcacag tagaaattag 2821 agaacatatt cagcttaatg aaagtaaaat tgttaaacaa gctggagttg ccacaaaagg 2881 gcccaatgaa tatattcaag aaatagaatt tgaaaacttg tctccaggaa gtgttattat 2941 attcagagtt agtcttgatc cacatgcaca agtcgctgtt ggaattcttc gaaatcatct 3001 gacacaattc agtcctcact ttaaatctgg cagcctagct gttgacaatg cagatcctat 3061 attaaaaatt ccttttgctt ctcttgcctc cagattaact ttggctgagc taaatcagat 3121 cctttaccga tgtgaatcag aagaaaagga agatggtgga gggtgctatg acataccaaa 3181 ctggtcagcc cttaaatatg caggtcttca aggtttaatg tctgtattgg cagaaataag 3241 accaaagaat gacttggggc atcctttttg taataatttg agatctggag attggatgat 3301 tgactatgtc agtaaccggc ttatttcacg atcaggaact attgctgaag ttggtaaatg 3361 gttgcaggct atgttcttct acctgaagca gatcccacgt taccttatcc catgttactt 3421 tgatgctata ttaattggtg catataccac tcttctggat acagcatgga agcagatgtc 3481 aagctttgtt cagaatggtt caacctttgt gaaacacctt tcattgggtt cagttcaact 3541 gtgtggagta ggaaaattcc cttccctgcc aattctttca cctgccctaa tggatgtacc 3601 ttataggtta aatgagatca caaaagaaaa ggagcaatgt tgtgtttctc tagctgcagg 3661 cttacctcat ttttcttctg gtattttccg ctgctgggga agggatactt ttattgcact 3721 tagaggtata ctgctgatta ctggacgcta tgtagaagcc aggaatatta ttttagcatt 3781 tgcgggtacc ctgaggcatg gtctcattcc taatctactg ggtgaaggga tttatgccag 3841 atacaattgt cgggatgctg tgtggtggtg gctgcagtgt atccaggatt actgtaaaat 3901 ggttccaaat ggtctagaca ttctcaagtg cccagtttcc agaatgtatc ctacagatga 3961 ttctgctcct ttgcctgctg gcacactgga tcagccattg tttgaagtca tacaggaagc 4021 aatgcaaaaa cacatgcagg gcatacagtt ccgagaaagg aatgctggtc cccagataga 4081 tcgaaacatg aaggacgaag gttttaatat aactgcagga gttgatgaag aaacaggatt 4141 tgtttatgga ggaaatcgtt tcaattgtgg cacatggatg gataaaatgg gagaaagtga 4201 cagagctaga aacagaggaa tcccagccac accaagagat gggtctgctg tggaaattgt 4261 gggcctgagt aaatctgctg ttcgctggtt gctggaatta tccaaaaaaa atattttccc 4321 ttatcatgaa gtcacagtaa aaagacatgg aaaggctata aaggtctcat atgatgagtg 4381 gaacagaaaa atacaagaca actttgaaaa gctatttcat gtttccgaag acccttcaga 4441 tttaaatgaa aagcatccaa atctggttca caaacgtggc atatacaaag atagttatgg 4501 agcttcaagt ccttggtgtg actatcagct caggcctaat tttaccatag caatggttgt 4561 ggcccctgag ctctttacta cagaaaaagc agggaaagct ttggagattg cagaaaaaaa 4621 attgcttggt ccccttggca tgaaaacttt agatccagat gatatggttt actgtggaat 4681 ttatgacaat gcattagaca atgacaacta caatcttgct aaaggtttca attatcacca 4741 aggacctgag tggctgtggc ctattgggta ttttcttcgt gcaaaattat atttttccag 4801 attgatgggc ccggagacta ctgcaaagac tatagttttg gttaaaaatg ttctttcccg 4861 acattatgtt catcttgaga gatccccttg gaaaggactt ccagaactga ccaatgagaa 4921 tgcccagtac tgtcctttca gctgtgaaac acaagcctgg tcaattgcta ctattcttga 4981 gacactttat gatttatagt ttattacaga tattaagtat gcaattactg tattatagga 5041 tgcaaggtca tcatatgtaa atgctatatg cacaggctca agttgtttta aaaatctcat 5101 ttattataat attgatgctc aattaggtaa gattgtaaaa gcattgattt tttttaatgt 5161 acagaggtag atttcaattt gaatcagaaa gaaatatcat taccaatgaa atgtgtttga 5221 gttcagtaag aattattcaa atgcctagaa atccatagtt tggaaaataa aaatcatgtc 5281 atcttctatt tgtacagaaa tgaaaataaa atatgaaaat aatgaaagaa atgaaaagat 5341 agcttttaat tgtggtatat ataatcttca gtaacaatac atactgaata cgctgtggtt 5401 cattaatatt aacaccacgt actatagtat tcttagaata cagtgctcac tgcatttaat 5461 aaatatttaa taaatgatga atgatagaag tttccatcta caatatatgt tcctaaatgg 5521 agcacagatg ttcaaactat gctttcattt tttcactgat atattaattt ttgtgtaatg 5581 aatgccaaca gtatatttta tatgatttac ttatgtgagg aaacatgcaa agcattagga 5641 aatttatttc ctaaaaacag ttttgtaaaa ttagtattga gttctattga gtattataag 5701 atagcttaca ttttcaaaat ggaaattgtc ggtcatattt ctagaacttt aaagaaaaaa 5761 gaatgttata ttagttttct aaaactcaac tatctttagt catgttcaaa aatctattgc 5821 tagatcatag tagatactgg ttttctatta actcaaaacc tacattgaca agtttaacat 5881 tgagaagaat cttaacaaaa atatggatat gaattcagta gatatcttaa attcaataaa 5941 atcactggaa gtttttcatg ataacttatt ttaagatgcc ttaaaaatct taaagtcaca 6001 aaaggaaaaa ggtttttaac atttacatga gttaacattt tttcatagaa cttatttcct 6061 agatagaatt ttttactgtt ttttactgtt ttcttaagaa aacagttaaa tcattatgca 6121 ttcagttgga agaaagtagt ggcaagaatt ctttcattgc tatataatat tcagtggctc 6181 atttatacct aataaaataa tggtatttta aaataatgct actttcaaag tagcattttt 6241 ttagttagtt tacaggttac atacccaaaa ccttaactat gactaagaaa ttaaagaaga 6301 aaaccagcaa actaaaactt ctgggcagca aaaatatata aatgcttcag atgtcaaata 6361 cccatgcttg aaagctcgtg taatttactt taagattatc tgcctgctct tcttcaaagc 6421 tgaccttgct ttagaaatag ttttaactag cttagttttc tggtttccaa aactaaaata 6481 gattaaatcc tacaaattta aggacagttg tgacagtaat ctgaccacta tctataaata 6541 cattggacat tggtttccaa atctcccttt ctcttcagtt ccttccttgt tcaatatata 6601 cccttctcta aactgtgcgg gtaaaaggaa tgactgtcct tgagagaacc attagtttat 6661 caaaggttta tgtagttttg ttgctgtacc ctaactttga tattcaggga ggtaggaaag 6721 gtaacagaaa accagcatat ttaatcaaag caagaagtaa tcgctgacag ttaaatgtga 6781 ccaaaaaaat taaaagttca caattttttt aatgtagcca tttggggtta tctctagtaa 6841 ggcagatacc cacgttggta aatttttagg atattgtgtt gcactagaaa actaagtggt 6901 tcatatttct aatgaggaag attaatgaaa gaacattgtt atattctgcg tggtatattt 6961 taaagtttaa gaaggcatgt taaacattat ttcctctatg gtagttaaaa tacagaatta 7021 gatttttaac aggtgtcatt tgactaaacg tttcggtaga atgcttcata cttgagtgat 7081 gctggataag gtattgtatt tcaacaatgg actatgcctt ggtttttcac taatcaaaat 7141 caaaattact ctttaacatg ataaatgaat ttaccagttt agtatgctgt ggtattttaa 7201 taagttttca aagataattg ggaaaacatg agactggtca tattgatgaa tattgtaaca 7261 tgtgaattgt gatccatttc tgatatgtct tgaactactg tgtctagtag gcaaatgtca 7321 ttgttacctc tgtgtgttaa gaaaataaaa atattttcta aaggtcg // LOCUS HSU84138 1764 bp mRNA PRI 22-JAN-1998 DEFINITION Homo sapiens DNA repair protein hh5Rad51 mRNA, complete cds. ACCESSION U84138 NID g2801404 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1764) AUTHORS Albala,J.S., Thelen,M.P., Prange,C., Fan,W., Christensen,M., Thompson,L.H. and Lennon,G.G. TITLE Identification of a novel human RAD51 homolog, RAD51B JOURNAL Genomics 46 (3), 476-479 (1997) MEDLINE 98110585 REFERENCE 2 (bases 1 to 1764) AUTHORS Albala,J.S., Prange,C.K., Fan,W., Christensen,M., Thelen,M. and Lennon,G.G. TITLE Direct Submission JOURNAL Submitted (07-JAN-1997) Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..1764 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14" /map="14q23-14q24.2" 5'UTR 1..54 CDS 55..1107 /function="DNA repair protein" /codon_start=1 /product="hh5Rad51" /db_xref="PID:g2801405" /translation="MGSKKLKRVGLSQELCDRLSRHQILTCQDFLCLSPLELMKVTGL SYRGVHELLCMVSRACAPKMQTAYGIKAQRSADFSPAFLSTTLSALDEALHGGVACGS LTEITGPPGCGKTQFCIMMSILATLPTNMGGLEGAVVYIDTESAFSAERLVEIAESRF PRYFNTEEKLLLTSSKVHLYRELTCDEVLQRIESLEEEIISKGIKLVILDSVASVVRK EFDAQLQGNLKERNKFLAREASSLKYLAEEFSIPVILTNQITTHLSGALASQADLVSP ADDLSLSEGTSGSSCVIAALGNTWSHSVNTRLILQYLDSERRQILIAKSPLAPFTSFV YTIKEEGLVLQAYGNS" 3'UTR 1108..1764 polyA_signal 1727..1732 BASE COUNT 513 a 361 c 387 g 503 t ORIGIN 1 gggaaactgt gtaaagggtg gggaaacttg aaagttggat gctgcagacc cggcatgggt 61 agcaagaaac taaaacgagt gggtttatca caagagctgt gtgaccgtct gagtagacat 121 cagatcctta cctgtcagga ctttttatgt ctttccccac tggagcttat gaaggtgact 181 ggtctgagtt atcgaggtgt ccatgaactt ctatgtatgg tcagcagggc ctgtgcccca 241 aagatgcaaa cggcttatgg gataaaagca caaaggtctg ctgatttctc accagcattc 301 ttatctacta ccctttctgc tttggacgaa gccctgcatg gtggtgtggc ttgtggatcc 361 ctcacagaga ttacaggtcc accaggttgt ggaaaaactc agttttgtat aatgatgagc 421 attttggcta cattacccac caacatggga ggattagaag gagctgtggt gtacattgac 481 acagagtctg catttagtgc tgaaagactg gttgaaatag cagaatcccg ttttcccaga 541 tattttaaca ctgaagaaaa gttacttttg acaagtagta aagttcatct ttatcgggaa 601 ctcacctgtg atgaagttct acaaaggatt gaatctttgg aagaagaaat tatctcaaaa 661 ggaattaaac ttgtgattct tgactctgtt gcttctgtgg tcagaaagga gtttgatgca 721 caacttcaag gcaatctcaa agaaagaaac aagttcttgg caagagaggc atcctccttg 781 aagtatttgg ctgaggagtt ttcaatccca gttatcttga cgaatcagat tacaacccat 841 ctgagtggag ccctggcttc tcaggcagac ctggtgtctc cagctgatga tttgtccctg 901 tctgaaggca cttctggatc cagctgtgtg atagccgcac taggaaatac ctggagtcac 961 agtgtgaata cccggctgat cctccagtac cttgattcag agagaagaca gattcttatt 1021 gccaagtccc ctctggctcc cttcacctca tttgtctaca ccatcaagga ggaaggcctg 1081 gttcttcaag cctatggaaa ttcctagaga cagataaatg tgcaaacctg ttcatcttgc 1141 caagaaaaat ccgctttttt gccacagaaa caaaatattg ggaaagagtc ttgtggtgaa 1201 acacccatcg ttctttgcta aaacatttgg ttgctactgt gtagactcag cttaagtcat 1261 ggaattctag aggatgtatc tcacaagtag gatcaagaac aagcccaaca gtaatctgca 1321 tcataagctg atttgatacc atggcactga caatgggcac tgatttgata ccatggcact 1381 gacatgggca cacagggaac aggaaatggg aatgagagca agggttgggt tgtgttcgtg 1441 gaacacatag gttttttttt tttaactttc tctttctaaa atatttcatt ttgatggagg 1501 tgaaatttat ataagatgaa attaaccatt ttaaagtaaa caattccgtg gcaactagat 1561 atcatgatgt gcaaccagca tctctgtcta gttccaaata ttttcatcac cccaaaagca 1621 agacccataa ccattatgca agtgttccta tttccccctc ctcccagctc ctggaaaccc 1681 accaatctac tttgttgcta tggctttacc tattctggat atttcatata aatggaatca 1741 tatagtgtca taaaaaaaaa aaaa // LOCUS HSU84401 2364 bp mRNA PRI 02-FEB-1997 DEFINITION Human smoothened mRNA, complete cds. ACCESSION U84401 NID g1813875 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2364) AUTHORS Stone,D., Hynes,M. , Armanini,M., Swanson,T., Gu,Q., Johnson,R., Scott,M., Pennica,D., Goddard,A., Phillips,H., Noll,M., Hooper,J., de Sauvage,F.J. and Rosenthal,A. TITLE The tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog JOURNAL Nature 384 (6605), 129-134 (1996) MEDLINE 97064168 REFERENCE 2 (bases 1 to 2364) AUTHORS Stone,D., Hynes,M. , Armanini,M., Swanson,T., Gu,Q., Johnson,R., Scott,M., Pennica,D., Goddard,A., Phillips,H., Noll,M., Hooper,J., de Sauvage,F.J. and Rosenthal,A. TITLE Direct Submission JOURNAL Submitted (08-JAN-1997) Molecular Oncology, Genentech Inc., 460 Pt San Bruno Blvd., South San Francisco, CA 94404, USA FEATURES Location/Qualifiers source 1..2364 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..2364 /function="candidate receptor for Sonic hedgehog" /codon_start=1 /product="smoothened" /db_xref="PID:g1813876" /translation="MAAARPARGPELPLLGLLLLLLLGDPGRGAASSGNATGPGPRSA GGSARRSAAVTGPPPPLSHCGRAAPCEPLRYNVCLGSVLPYGATSTLLAGDSDSQEEA HGKLVLWSGLRNAPRCWAVIQPLLCAVYMPKCENDRVELPSRTLCQATRGPCAIVERE RGWPDFLRCTPDRFPEGCTNEVQNIKFNSSGQCEVPLVRTDNPKSWYEDVEGCGIQCQ NPLFTEAEHQDMHSYIAAFGAVTGLCTLFTLATFVADWRNSNRYPAVILFYVNACFFV GSIGWLAQFMDGARREIVCRADGTMRLGEPTSNETLSCVIIFVIVYYALMAGVVWFVV LTYAWHTSFKALGTTYQPLSGKTSYFHLLTWSLPFVLTVAILAVAQVDGDSVSGICFV GYKNYRYRAGFVLAPIGLVLIVGGYFLIRGVMTLFSIKSNHPGLLSEKAASKINETML RLGIFGFLAFGFVLITFSCHFYDFFNQAEWERSFRDYVLCQANVTIGLPTKQPIPDCE IKNRPSLLVEKINLFAMFGTGIAMSTWVWTKATLLIWRRTWCRLTGQSDDEPKRIKKS KMIAKAFSKRHELLQNPGQELSFSMHTVSHDGPVAGLAFDLNEPSADVSSAWAQHVTK MVARRGAILPQDISVTPVATPVPPEEQANLWLVEAEISPELQKRLGRKKKRRKRKKEV CPLAPPPELHPPAPAPSTIPRLPQLPRQKCLVAAGAWGAGDSCRQGAWTLVSNPFCPE PSPPQDPFLPSAPAPVAWAHGRRQGLGPIHSRTNLMDTELMDADSDF" BASE COUNT 412 a 764 c 721 g 467 t ORIGIN 1 atggccgctg cccgcccagc gcgggggccg gagctcccgc tcctggggct gctgctgctg 61 ctgctgctgg gggacccggg ccggggggcg gcctcgagcg ggaacgcgac cgggcctggg 121 cctcggagcg cgggcgggag cgcgaggagg agcgcggcgg tgactggccc tccgccgccg 181 ctgagccact gcggccgggc tgccccctgc gagccgctgc gctacaacgt gtgcctgggc 241 tcggtgctgc cctacggggc cacctccaca ctgctggccg gagactcgga ctcccaggag 301 gaagcgcacg gcaagctcgt gctctggtcg ggcctccgga atgccccccg ctgctgggca 361 gtgatccagc ccctgctgtg tgccgtatac atgcccaagt gtgagaatga ccgggtggag 421 ctgcccagcc gtaccctctg ccaggccacc cgaggcccct gtgccatcgt ggagagggag 481 cggggctggc ctgacttcct gcgctgcact cctgaccgct tccctgaagg ctgcacgaat 541 gaggtgcaga acatcaagtt caacagttca ggccagtgcg aagtgccctt ggttcggaca 601 gacaacccca agagctggta cgaggacgtg gagggctgcg gcatccagtg ccagaacccg 661 ctcttcacag aggctgagca ccaggacatg cacagctaca tcgcggcctt cggggccgtc 721 acgggcctct gcacgctctt caccctggcc acattcgtgg ctgactggcg gaactcgaat 781 cgctaccctg ctgttattct cttctacgtc aatgcgtgct tctttgtggg cagcattggc 841 tggctggccc agttcatgga tggtgcccgc cgagagatcg tctgccgtgc agatggcacc 901 atgaggcttg gggagcccac ctccaatgag actctgtcct gcgtcatcat ctttgtcatc 961 gtgtactacg ccctgatggc tggtgtggtt tggtttgtgg tcctcaccta tgcctggcac 1021 acttccttca aagccctggg caccacctac cagcctctct cgggcaagac ctcctacttc 1081 cacctgctca cctggtcact cccctttgtc ctcactgtgg caatccttgc tgtggcgcag 1141 gtggatgggg actctgtgag tggcatttgt tttgtgggct acaagaacta ccgataccgt 1201 gcgggcttcg tgctggcccc aatcggcctg gtgctcatcg tgggaggcta cttcctcatc 1261 cgaggagtca tgactctgtt ctccatcaag agcaaccacc ccgggctgct gagtgagaag 1321 gctgccagca agatcaacga gaccatgctg cgcctgggca tttttggctt cctggccttt 1381 ggctttgtgc tcattacctt cagctgccac ttctacgact tcttcaacca ggctgagtgg 1441 gagcgcagct tccgggacta tgtgctatgt caggccaatg tgaccatcgg gctgcccacc 1501 aagcagccca tccctgactg tgagatcaag aatcgcccga gccttctggt ggagaagatc 1561 aacctgtttg ccatgtttgg aactggcatc gccatgagca cctgggtctg gaccaaggcc 1621 acgctgctca tctggaggcg tacctggtgc aggttgactg ggcagagtga cgatgagcca 1681 aagcggatca agaagagcaa gatgattgcc aaggccttct ctaagcggca cgagctcctg 1741 cagaacccag gccaggagct gtccttcagc atgcacactg tgtcccacga cgggcccgtg 1801 gcgggcttgg cctttgacct caatgagccc tcagctgatg tctcctctgc ctgggcccag 1861 catgtcacca agatggtggc tcggagagga gccatactgc cccaggatat ttctgtcacc 1921 cctgtggcaa ctccagtgcc cccagaggaa caagccaacc tgtggctggt tgaggcagag 1981 atctccccag agctgcagaa gcgcctgggc cggaagaaga agaggaggaa gaggaagaag 2041 gaggtgtgcc cgctggcgcc gccccctgag cttcaccccc ctgcccctgc ccccagtacc 2101 attcctcgac tgcctcagct gccccggcag aaatgcctgg tggctgcagg tgcctgggga 2161 gctggggact cttgccgaca gggagcgtgg accctggtct ccaacccatt ctgcccagag 2221 cccagtcccc ctcaggatcc atttctgccc agtgcaccgg cccccgtggc atgggctcat 2281 ggccgccgac agggcctggg gcctattcac tcccgcacca acctgatgga cacagaactc 2341 atggatgcag actcggactt ctga // LOCUS HSU84487 3310 bp mRNA PRI 15-MAR-1997 DEFINITION Human CX3C chemokine precursor, mRNA, alternatively spliced, complete cds. ACCESSION U84487 NID g1888522 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3310) AUTHORS Bazan,J.F., Bacon,K.B., Hardiman,G., Wang,W., Soo,K., Rossi,D., Greaves,D.R., Zlotnik,A. and Schall,T.J. TITLE A new class of membrane-bound chemokine with a CX3C motif JOURNAL Nature 385 (6617), 640-644 (1997) MEDLINE 97177111 REFERENCE 2 (bases 1 to 3310) AUTHORS Bazan,J.F., Bacon,K.B., Hardiman,G., Wang,W., Rossi,D., Greaves,D.R., Zlotnik,A. and Schall,T.J. TITLE Direct Submission JOURNAL Submitted (07-JAN-1997) Molecular Biology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304-1104, USA FEATURES Location/Qualifiers source 1..3310 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 80..1273 /note="membrane-tethered chemokine module" /codon_start=1 /product="CX3C chemokine precursor" /db_xref="PID:g1888523" /translation="MAPISLSWLLRLATFCHLTVLLAGQHHGVTKCNITCSKMTSKIP VALLIHYQQNQASCGKRAIILETRQHRLFCADPKEQWVKDAMQHLDRQAAALTRNGGT FEKQIGEVKPRTTPAAGGMDESVVLEPEATGESSSLEPTPSSQEAQRALGTSPELPTG VTGSSGTRLPPTPKAQDGGPVGTELFRVPPVSTAATWQSSAPHQPGPSLWAEAKTSEA PSTQDPSTQASTASSPAPEENAPSEGQRVWGQGQSPRPENSLEREEMGPVPAHTDAFQ DWGPGSMAHVSVVPVSSEGTPSREPVASGSWTPKAEEPIHATMDPQRLGVLITPVPDA QAATRRQAVGLLAFLGLLFCLGVAMFTYQSLQGCPRKMAGEMAEGLRYIPRSCGSNSY VLVPV" sig_peptide 80..151 mat_peptide 152..1270 /product="CX3C chemokine" misc_feature 152..379 /note="encodes chemokine module" misc_feature 380..1102 /note="encodes glycosylation stalk" misc_feature 1103..1159 /note="encodes transmembrane helix" misc_feature 1160..1270 /note="encodes intracellular domain" 3'UTR 1274..3310 /note="alternatively spliced; short transcript deposited as GenBank Accession Number U91835" BASE COUNT 659 a 1051 c 916 g 682 t 2 others ORIGIN 1 ggcacgaggg cactgagctc tgccgcctgg ctctagccgc ctgcctggcc cccgccggga 61 ctcttgccca ccctcagcca tggctccgat atctctgtcg tggctgctcc gcttggccac 121 cttctgccat ctgactgtcc tgctggctgg acagcaccac ggtgtgacga aatgcaacat 181 cacgtgcagc aagatgacat caaagatacc tgtagctttg ctcatccact atcaacagaa 241 ccaggcatca tgcggcaaac gcgcaatcat cttggagacg agacagcaca ggctgttctg 301 tgccgacccg aaggagcaat gggtcaagga cgcgatgcag catctggacc gccaggctgc 361 tgccctaact cgaaatggcg gcaccttcga gaagcagatc ggcgaggtga agcccaggac 421 cacccctgcc gccgggggaa tggacgagtc tgtggtcctg gagcccgaag ccacaggcga 481 aagcagtagc ctggagccga ctccttcttc ccaggaagca cagagggccc tggggacctc 541 cccagagctg ccgacgggcg tgactggttc ctcagggacc aggctccccc cgacgccaaa 601 ggctcaggat ggagggcctg tgggcacgga gcttttccga gtgcctcccg tctccactgc 661 cgccacgtgg cagagttctg ctccccacca acctgggccc agcctctggg ctgaggcaaa 721 gacctctgag gccccgtcca cccaggaccc ctccacccag gcctccactg cgtcctcccc 781 agccccagag gagaatgctc cgtctgaagg ccagcgtgtg tggggtcagg gacagagccc 841 caggccagag aactctctgg agcgggagga gatgggtccc gtgccagcgc acacggatgc 901 cttccaggac tgggggcctg gcagcatggc ccacgtctct gtggtccctg tctcctcaga 961 agggaccccc agcagggagc cagtggcttc aggcagctgg acccctaagg ctgaggaacc 1021 catccatgcc accatggacc cccagaggct gggcgtcctt atcactcctg tccctgacgc 1081 ccaggctgcc acccggaggc aggcggtggg gctgctggcc ttccttggcc tcctcttctg 1141 cctgggggtg gccatgttca cctaccagag cctccagggc tgccctcgaa agatggcagg 1201 agagatggcg gagggccttc gctacatccc ccggagctgt ggtagtaatt catatgtcct 1261 ggtgcccgtg tgaactcctc tggcctgtgt ctagttgttt gattcagaca gctgcctggg 1321 atccctcatc ctcataccca cccccaccca agggcctggc ctgagctggg atgattggag 1381 gggggaggtg ggatcctcca ggtgcacaag ctccaagctc ccaggcattc cccaggaggc 1441 cagccttgac cattctccac cttccaggga cagagggggt ggcctcccaa ctcaccccag 1501 ccccaaaact ctcctctgct gctggctggt tagaggttcc ctttgacgcc atcccagccc 1561 caatgaacaa ttatttatta aatgcccagc cccttctgac ccatgctgcc ctgtgagtac 1621 tacagtcctc ccatctcaca catgagcatc aggccaggcc ctctgcccac tccctgcaac 1681 ctgattgtgt ctcttggtcc tgctgcagtt gccagtcacc ccggccacct gcggtgctat 1741 ctcccccagc cccatcctct gtacagagcc cacgccccca ctggtgacat gtcttttctt 1801 gcatgaggct agtgtggtgt ttcctgggca ctgcttccag tgaggctctg cccttggtta 1861 ggsattgtgg gaaggggaga taagggtatc tggtgacttt cctctttggt ctacactgtg 1921 ctgagtctga aggctgggtt ctgatcctag ttccaccatc aagccaccaa catactccca 1981 tctgtgaaag gaaagaggga ggtaaggaat acctgtcccc ctgacaacac tcattgacct 2041 gaggcccttc tctccagccc ctggatgcag cctcacagtc cttaccagca gagcacctta 2101 gacagtccct gccaatggac taacttgtct ttggaccctg aggcccagag ggcctgcarg 2161 ggagtgagtt gatagcacag accctgccct gtgggccccc aaatggaaat gggcagagca 2221 gagaccatcc ctgaaggccc cgcccaggct tagtcactga gacagcccgg gctctgcttc 2281 ccatcacccg ctaagaggga gggagggctc cagacacatg tccaagaagc ccaggaaagg 2341 ctccaggagc agccacattc ctgatgcttc ttcagagact cctgcaggca gccaggccac 2401 aagacccttg tggtcccacc ccacacacgc cagattcttt cctgaggctg ggctcccttc 2461 ccacctctct cactccttga aaacactgtt ctctgccctc caagaccttc tccttcacct 2521 ttgtccccac cgcagacagg accaggggat ttccatgatg ttttccatga gtcccctgtt 2581 tgtttctgaa agggacgcta cccgggaagg gggctgggac atgggaaagg ggaagttgta 2641 ggcataaagt caggggttcc cttttttggc tgctgaaggc tcgagcatgc ctggatgggg 2701 ctgcaccggc tggcctggcc cctcagggtc cctggtggca gctcacctct cccttggatt 2761 gtccccgacc cttgccgtct acctgagggg cctcttatgg gctgggttct acccaggtgc 2821 taggaacact ccttcacaga tgggtgcttg gaggaaggaa acccagctct ggtccataga 2881 gagcaaaacg ctgtgctgcc ctgcccaccc tggcctctgc actcccctgc tgggtgtggc 2941 gcagcatatt caggaagctc agggccctgg ctcaggtggg gtcactctgg cagctcagag 3001 agggtgggag tgggtccaat gcactttgtt ctggctcttc caggctggga gagcctttca 3061 ggggtgggac accctgtgat ggggccctgc ctcctttgtg aggaagccgc tggggccagt 3121 tggtccccct tccatggact ttgttagttt ctccaagcag gacatggaca aggatgatct 3181 aggaagactt tggaaagagt aggaagactt tggaaagact tttccaaccc tcatcaccaa 3241 cgtctgtgcc attttgtatt ttactaataa aatttaaaag tcttgtgaaa aaaaaaaaaa 3301 aaaaaaaaaa // LOCUS HSU84573 3503 bp mRNA PRI 31-MAY-1997 DEFINITION Homo sapiens lysyl hydroxylase isoform 2 (PLOD2) mRNA, complete cds. ACCESSION U84573 NID g2138313 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3503) AUTHORS Valtavaara,M., Papponen,H., Pirttila,A.-M., Hiltunen,K., Helander,H. and Myllyla,R. TITLE Cloning and characterization of a novel human lysyl hydroxylase isoform highly expressed in pancreas and muscle JOURNAL J. Biol. Chem. 272 (11), 6831-6834 (1997) MEDLINE 97207229 REFERENCE 2 (bases 1 to 3503) AUTHORS Valtavaara,M., Papponen,H., Pirttila,A.-M. and Myllyla,R. TITLE Direct Submission JOURNAL Submitted (10-JAN-1997) Biochemistry, University of Oulu, Linnanmaa, Oulu, FIN 90570, Finland FEATURES Location/Qualifiers source 1..3503 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2214 /gene="PLOD2" CDS 1..2214 /gene="PLOD2" /codon_start=1 /product="lysyl hydroxylase isoform 2" /db_xref="PID:g2138314" /translation="MGGCTVKPQLLLLALVLHPWNPCLGADSEKPSSIPTDKLLVITV ATKESDGFHRFMQSAKYFNYTVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYAD QDDLVVMFTECFDVIFAGGPEEVLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIG KRYLNSGGFIGYAPYVNRIVQQWNLQDNDDDQLFYTKVYIDPLKREAINITLDHKCKI FQTLNGAVDEVVLKFENGKARAKNTFYETLPVAINGNGPTKILLNYFGNYVPNSWTQD NGCTLCEFDTVDLSAVDVHPNVSIGVFIEQPTPFLPRFLDILLTLDYPKEALKLFIHN KEVYHEKDIKVFFDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVD ADVVLTNPRTLKILIEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQG NRVGVWNVPYMANVYLIKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMGVFMYISN RHEFGRLLSTANYNTSHYNNDLWQIFENPVDWKEKYINRDYSKIFTENIVEQPCPDVF WFPIFSEKACDELVEEMEHYGKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLD FIREFIAPVTLKVFAGYYTKGFALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVG EDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP" BASE COUNT 1167 a 555 c 680 g 1101 t ORIGIN 1 atggggggat gcacggtgaa gcctcagctg ctgctcctgg cgctcgtcct ccacccctgg 61 aatccctgtc tgggtgcgga ctcggagaag ccctcgagca tccccacaga taaattatta 121 gtcataactg tagcaacaaa agaaagtgat ggattccatc gatttatgca gtcagccaaa 181 tatttcaatt atactgtgaa ggtccttggt caaggagaag aatggagagg tggtgatgga 241 attaatagta ttggaggggg ccagaaagtg agattaatga aagaagtcat ggaacactat 301 gctgatcaag atgatctggt tgtcatgttt actgaatgct ttgatgtcat atttgctggt 361 ggtccagaag aagttctaaa aaaattccaa aaggcaaacc acaaagtggt ctttgcagca 421 gatggaattt tgtggccaga taaaagacta gcagacaagt atcctgttgt gcacattggg 481 aaacgctatc tgaattcagg aggatttatt ggctatgctc catatgtcaa ccgtatagtt 541 caacaatgga atctccagga taatgatgat gatcagctct tttacactaa agtttacatt 601 gatccactga aaagggaagc tattaacatc acattggatc acaaatgcaa aattttccag 661 accttaaatg gagctgtaga tgaagttgtt ttaaaatttg aaaatggcaa agccagagct 721 aagaatacat tttatgaaac attaccagtg gcaattaatg gaaatggacc caccaagatt 781 ctcctgaatt attttggaaa ctatgtaccc aattcatgga cacaggataa tggctgcact 841 ctttgtgaat tcgatacagt cgacttgtct gcagtagatg tccatccaaa cgtatcaata 901 ggtgttttta ttgagcaacc aacccctttt ctacctcggt ttctggacat attgttgaca 961 ctggattacc caaaagaagc acttaaactt tttattcata acaaagaagt ttatcatgaa 1021 aaggacatca aggtattttt tgataaagct aagcatgaaa tcaaaactat aaaaatagta 1081 ggaccagaag aaaatctaag tcaagcggaa gccagaaaca tgggaatgga cttttgccgt 1141 caggatgaaa agtgtgatta ttactttagt gtggatgcag atgttgtttt gacaaatcca 1201 aggactttaa aaattttgat tgaacaaaac agaaagatca ttgctcctct tgtaactcgt 1261 catggaaagc tgtggtccaa tttctgggga gcattgagtc ctgatggata ctatgcacga 1321 tctgaagatt atgtggatat tgttcaaggg aatagagtag gagtatggaa tgtcccatat 1381 atggctaatg tgtacttaat taaaggaaag acactccgat cagagatgaa tgaaaggaac 1441 tattttgttc gtgataaact ggatcctgat atggctcttt gccgaaatgc tagagaaatg 1501 ggtgtattta tgtacatttc taatagacat gaatttggaa ggctattatc cactgctaat 1561 tacaatactt cccattataa caatgacctc tggcagattt ttgaaaatcc tgtggactgg 1621 aaggaaaagt atataaaccg tgattattca aagattttca ctgaaaatat agttgaacag 1681 ccctgtccag atgtcttttg gttccccata ttttctgaaa aagcctgtga tgaattggta 1741 gaagaaatgg aacattacgg caaatggtct gggggaaaac atcatgatag ccgtatatct 1801 ggtggttatg aaaatgtccc aactgatgat atccacatga agcaagttga tctggagaat 1861 gtatggcttg attttatccg ggagttcatt gcaccagtta cactgaaggt ctttgcaggc 1921 tattatacga agggatttgc actactgaat tttgtagtaa aatactcccc tgaacgacag 1981 cgttctcttc gtcctcatca tgatgcttct acatttacca taaacattgc acttaataac 2041 gtgggagaag actttcaggg aggtggttgc aaatttctaa ggtacaattg ctctattgag 2101 tcaccacgaa aaggctggag cttcatgcat cctgggagac tcacacattt gcatgaagga 2161 cttcctgtta aaaatggaac aagatacatt gcagtgtcat ttatagatcc ctaagttatt 2221 tacttttcat tgaattgaaa tttattttgg gtgaatgact ggcatgaaca cgtctttgaa 2281 gttgtggctg agaagatgag aggaatattt aaataacatc aacagaacaa cttcactttg 2341 ggccaaacat ttgaaaaact ttttataaaa aattgtttga tatttcttaa tgtctgctct 2401 gagccttaaa acacagattg aagaagaaaa gaaagaaaaa acttaaatat ttatttctat 2461 gctttgttgc ctctgagaat aatgacaatt tatgaatttg tgtttcaaat tgataaaata 2521 tttaggtaca aataacaaga ctaataatat tttcttattt aaaaaaagca tgggaagatt 2581 tttatttatc aaaatataga ggaaatgtag acaaaatgga tataaatgaa aattaccatg 2641 ttgtaaaacc ttgaaaatca gattctaact gattgtatgc aactaagtat ttctgaacac 2701 ctatgcaggt cttatttaca gtgttactaa gggaacacac aaagaattac acaacgtttt 2761 cctcaagaaa atggtacaaa acacaaccga ggagcgtata cagttgaaaa catttttgtt 2821 ttgattggaa ggcagattat tttatattag tattaaaaat caaaccctat gtttctttca 2881 gatgaatctt ccaaagtgga ttatattaag caggtattag atttagaaaa cctttccatt 2941 tcttaaagta ttatcaagtg tcaagatcag caagtgtcct taagtcaaat aggttttttt 3001 ttgttggtgg ttgtgcttgc tttccttttt tagaaagttc tagaaaatag gaaaacgaaa 3061 aatttcattg agatgagtag tgcatttaat tattttttaa aaaacttttt aagtacttga 3121 attttatatc aggaaaacaa agttgttgag ccttgcttct tccgttttgc cctttgtctc 3181 gctccttatt cttttttggg gggagggtta tttgcttttt tatcttcctg gcataatttc 3241 cattttattc ttctgagtgt ctatgttaac ttccctctat cccgcttata aaaaaattct 3301 ccaacaaaaa tacttgttga cttgatgttt tatcacttct ctaagtaagg ttgaaatatc 3361 cttattgtag ctactgtttt taatgtaaag gttaaacttg aaaagaaatt cttaatcacg 3421 gtgccaaaat tcattttcta acaccatgtg ttagaaaatt ataaaaaata aaataatttt 3481 aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSU84720 1669 bp mRNA PRI 22-MAR-1997 DEFINITION Human mRNA export protein Rae1 (RAE1) mRNA, complete cds. ACCESSION U84720 NID g1903455 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1669) AUTHORS Bharathi,A., Ghosh,A., Whalen,W.A., Yoon,J.H., Pu,R., Dasso,M. and Dhar,R. TITLE The human RAE1 gene is a functional homolog of Schizosaccharomyces pombe rae1 gene involved in nuclear export of poly(A)+ RNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 1669) AUTHORS Bharathi,A., Ghosh,A., Whalen,W.A., Yoon,J.H., Pu,R., Dasso,M. and Dhar,R. TITLE Direct Submission JOURNAL Submitted (10-JAN-1997) Laboratory of Molecular Virology, National Cancer Institute, Bldg. 41, Rm. B512, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1669 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="breast carcinoma line MDAMB435" gene 188..1294 /gene="RAE1" CDS 188..1294 /gene="RAE1" /function="nuclear export of poly(A)+ RNA" /note="mRNA export protein; homolog of Schizosaccharomyces pombe Rae1p" /codon_start=1 /product="Rae1" /db_xref="PID:g1903456" /translation="MSLFGTTSGFGTSGTSMFGSATTDNHNPMKDIEVTSSPDDSIGC LSFSPPTLPGNFLIAGSWANDVRCWEVQDSGQTIPKAQQMHTGPVLDVCWSDDGSKVF TASCDKTAKMWDLSSNQAIQIAQHDAPVKTIHWIKAPNYSCVMTGSWDKTLKFWDTRS SNPMMVLQLPERCYCADVIYPMAVVATAERGLIVYQLENQPSEFRRIESPLKHQHRCV AIFKDKQNKPTGFALGSIEGRVAIHYINPPNPAKDNFTFKCHRSNGTNTSAPQDIYAV NGIAFHPVHGTLATVGSDGRFSFWDKDARTKLKTSEQLDQPISACCFNHNGNIFAYAS SYDWSKGHEFYNPQKKNYIFLRNAAEELKPRNKK" BASE COUNT 436 a 398 c 414 g 421 t ORIGIN 1 ggcacgagcg gcacgagcgg cggtagtcag ggcagtttct acgcaggctt aaggaggctt 61 cgggctcctg ggatttctgt ccgcgctcct ggcccacgtc cttcgcgcca gagcaggttc 121 gcaaactcct cagacccttc tgctcccggc cgccgctttc cgccggggcg agacccccag 181 gttcaaaatg agcctgtttg gaacaacctc aggttttgga accagtggga ccagcatgtt 241 tggcagtgca actacagaca atcacaatcc catgaaggat attgaagtaa catcatctcc 301 tgatgatagc attggttgtc tgtcttttag cccaccaacc ttgccgggga actttcttat 361 tgcaggatca tgggctaatg atgttcgctg ctgggaagtt caagacagtg gacagaccat 421 tccaaaagcc cagcagatgc acactgggcc tgtgcttgat gtctgctgga gtgacgatgg 481 gagcaaagtg tttacggcat cgtgtgataa aactgccaaa atgtgggacc tcagcagtaa 541 ccaagcgata cagatcgcac agcatgatgc tcctgttaaa accatccatt ggatcaaagc 601 tccaaactac agctgtgtga tgactgggag ctgggataag actttaaagt tttgggatac 661 tcgatcgtca aatcctatga tggttttgca actccctgaa aggtgttact gtgctgacgt 721 gatatacccc atggctgtgg tggcaactgc agagaggggc ctgattgtct atcagctaga 781 gaatcaacct tctgaattca ggaggataga atctccactg aaacatcagc atcggtgtgt 841 ggctattttt aaagacaaac agaacaagcc gactggtttt gccctgggaa gtatcgaggg 901 gagagttgct attcactata tcaacccccc gaaccccgcc aaagataact tcacctttaa 961 atgtcatcga tctaatggaa ccaacacttc agctcctcag gacatttatg cggtaaatgg 1021 aatcgcgttc catcctgttc atggcaccct tgcaactgtg ggatctgatg gtagattcag 1081 cttctgggac aaagatgcca gaacaaaact aaaaacttcg gaacagttag atcagcccat 1141 ctcagcttgc tgtttcaatc acaatggaaa catatttgca tacgcttcca gctacgactg 1201 gtcaaaggga catgaatttt ataatcccca gaaaaaaaat tacattttcc tgcgtaatgc 1261 ggccgaagag ctaaagccca ggaataagaa gtagtggctg gagactctgg ctcagccaga 1321 gttgtttctc tccactctgc ctcatctctg tacgaatttg ggtcccagcc ttgttgggtt 1381 gtcagccatg gacatggatt tcaacccctg gagaaaacga tgtcattgtt cagcagctga 1441 gagccccagg cgtccgcggc gacttgccgt ctctccattc cactgcctgt tgcagagttt 1501 ttctgtaact aagggggttg aggttattgt agacgttaga ttgcgggcac cgccagggat 1561 tttgcagcgc ttcagtgtac gtgttagaga atattggaaa agcgtctgtg agccccgtgc 1621 tgtattttgt aataaagtct tttgcagatt gaataaaaaa aaaaaaaaa // LOCUS HSU84971 1144 bp mRNA PRI 11-JUN-1997 DEFINITION Homo sapiens fetal unknown mRNA, complete cds. ACCESSION U84971 NID g2183022 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1144) AUTHORS Dickson,M.C., Heather,L.J., Lyle,L.,., Clark,L.N.C., van Deutekom,J.C.T., Wright,T.J., Flint,J., Frants,R.R. and Hewitt,J.E. TITLE Sequence of a 40kb region flanking the FSHD-associated repeat sequence on distal chromosome 4q35 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1144) AUTHORS Lyle,R. and Hewitt,J.E. TITLE Direct Submission JOURNAL Submitted (12-JAN-1997) School of Biological Sciences, The University of Manchester, 3.239 Stopford Building, Oxford Rd., Manchester M13 9PT, UK FEATURES Location/Qualifiers source 1..1144 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="6 weeks" /tissue_type="whole fetus" /clone="EP1" /chromosome="5" CDS 312..842 /codon_start=1 /product="unknown" /db_xref="PID:g2183023" /translation="MASEAPSPPRSPPPPTSPEPELAQLRRKVEKLERELRSCKRQVR EIEKLLHHTERLYQNAESNNQELRTQVEELSKILQRGRNEDNKKSDVEVQTENHAPWS ISDYFYQTYYNDVSLPNKVTELSDQQDQAIETSILNSKDHLQVENDAYPGTDRTENVK YRQVDHFASNSQVIKC" BASE COUNT 316 a 282 c 270 g 276 t ORIGIN 1 ttttccgact gcttatccga cgctcctccc tctgtctctg tagctggaga aggtagtttc 61 caggaaagtt ttccggtttg caggccgcgc acatcgggca ggggccatcc tcggtcccct 121 tgctcgttgc tcgcagcccc gttcggctac aagtgagttt cagggcgtca tggccagggg 181 ccaccgcggc cagccgggtg tgaggctgcc tttcgctgcc cgcgcgctcc agtggtctct 241 gggtccgccg gcgtccgttt cggcctgaac gcagcccctc cgcggcgacg agcagtctcg 301 cgccggagct catggcctcg gaggcgccgt ccccgccgcg gtcgccgccg ccgcccacct 361 cccccgagcc tgagctggcc cagctaaggc ggaaggtgga gaagttggaa cgtgaactgc 421 ggagctgcaa gcggcaggtg cgggagatcg agaagctgct gcatcacaca gaacggctgt 481 accagaacgc agaaagcaac aaccaggagc tccgcacgca ggtggaagaa ctcagtaaaa 541 tactccaacg tgggagaaat gaagataata aaaagtctga tgtagaagta caaacagaga 601 accatgctcc ttggtcaatc tcagattatt tttatcagac gtactacaat gacgttagtc 661 ttccaaataa agtgactgaa ctgtcagatc aacaagatca agctatcgaa acttctattt 721 tgaattctaa agaccattta caagtagaaa atgatgctta ccctggtacc gatagaacag 781 aaaatgttaa atatagacaa gtggaccatt ttgcctcaaa ttcacaggta ataaaatgct 841 aaacatgaaa ctgttgatgc ccaagaacct gtccttcttt gttgttatta tgtggaaaga 901 tagacttctt tggtccgtca cagcatatat aagggttata aacattctta tgtgtaatta 961 tgcaaagaat gttgtagttt ttccaaaacc agaattagaa cttcttcata tatgtagtgg 1021 ttttcctact acattgtgtt ccaaaatttt gtgatctaca taaactaaaa caaaatttgt 1081 ccagatattt tgactgaaca aaagaactgt ggacaacaat accacattta cacctaaaaa 1141 aaaa // LOCUS HSU85245 3743 bp mRNA PRI 05-MAR-1997 DEFINITION Human phosphatidylinositol-4-phosphate 5-kinase type II beta mRNA, complete cds. ACCESSION U85245 NID g1857636 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3743) AUTHORS Castellino,A.M., Parker,G.J., Boronenkov,I.V., Anderson,R.A. and Chao,M.V. TITLE A Novel Interaction between the Juxtamembrane Region of the p55 Tumor Necrosis Factor Receptor and Phosphatidylinositol-4-phosphate 5-Kinase JOURNAL J. Biol. Chem. 272 (1997) In press REFERENCE 2 (bases 1 to 3743) AUTHORS Castellino,A.M., Parker,G.J., Boronenkov,I.V., Anderson,R.A. and Chao,M.V. TITLE Direct Submission JOURNAL Submitted (13-JAN-1997) Pharmacology, University of Wisconsin-Madison, 1300 University Avenue, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..3743 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /clone_lib="Clontech #HL1149x" CDS 481..1731 /EC_number="2.7.1.68" /function="lipid kinase; generates phosphatidylinositol 4,5-bisphosphate from phosphatidylinositol 4-phosphate" /note="PIP5KIIB" /codon_start=1 /product="phosphatidylinositol-4-phosphate 5-kinase type II beta" /db_xref="PID:g1857637" /translation="MSSNCTSTTAVAVAPLSASKTKTKKKHFVCQKVKLFRASEPILS VLMWGVNHTINELSNVPVPVMLMPDDFKAYSKIKVDNHLFNKENLPSRFKFKEYCPMV FRNLRERFGIDDQDYQNSVTRSAPINSDSQGRCGTRFLTTYDRRFVIKTVSSEDVAEM HNILKKYHQFIVECHGNTLLPQFLGMYRLTVDGVETYMVVTRNVFSHRLTVHRKYDLK GSTVAREASDKEKAKDLPTFKDNDFLNEGQKLHVGEESKKNFLEKLKRDVEFLAQLKI MDYSLLVGIHDVDRAEQEEMEVEERAEDEECENDGVGGNLLCSYGTPPDSPGNLLSFP RFFGPGEFDPSVDVYAMKSHESSPKKEVYFMAIIDILTPYDTKKKAAHAAKTVKHGAG AEISTVNPEQYSKRFNEFMSNILT" BASE COUNT 839 a 964 c 1095 g 845 t ORIGIN 1 ttgcgggaaa gagccaaacc ctggcgttgg ggggcccggg cggggagccc ctcccgcggt 61 ccacagcgac gcctgcccag ccctcctccc cttccggctc cggcacgggg ccccgaggcg 121 ttcggaggcc aggcgggttt ctgtcaggcc cggggaggag gggcgggcgg ggcggccgct 181 gcctccccgg gacgggccgt accacgcgga cggggaggac ggggccaggg gactgcaggg 241 cggctgcacc gcccgggggc ggggtgcgga cgggccggcg ggctccccgg ggcggggcgg 301 gagggcgggg cgtggggcgg acggaaccac cggggcgggg tgggaggtaa cgggacgggc 361 gcgaccatgg cgcggtgagg gagcgggggt ggggatcggt ccgggggagg cctgaggccg 421 ctggcttgtg cgctgtctcc gccgcccccc tctttcgccg ccgccgccgc cgccccgggc 481 atgtcgtcca actgcaccag caccacggcg gtggcggtgg cgccgctcag cgccagcaag 541 accaagacca agaagaagca tttcgtgtgc cagaaagtga agctattccg ggccagcgag 601 ccgatcctca gcgtcctgat gtggggggtg aaccacacga tcaatgagct gagcaatgtt 661 cctgttcctg tcatgctaat gccagatgac ttcaaagcct acagcaagat caaggtggac 721 aatcatctct tcaataagga gaacctgccc agccgcttta agtttaagga gtattgcccc 781 atggtgttcc gaaaccttcg ggagaggttt ggaattgatg atcaggatta ccagaattca 841 gtgacgcgca gcgcccccat caacagtgac agccagggtc ggtgtggcac gcgtttcctc 901 accacctacg accggcgctt tgtcatcaag actgtgtcca gcgaggacgt ggcggagatg 961 cacaacatct taaagaaata ccaccagttt atagtggagt gtcatggcaa cacgcttttg 1021 ccacagttcc tgggcatgta ccgcctgacc gtggatggtg tggaaaccta catggtggtt 1081 accaggaacg tgttcagcca tcggctcact gtgcatcgca agtatgacct caagggttct 1141 acggttgcca gagaagcgag cgacaaggag aaggccaagg acttgccaac attcaaagac 1201 aatgacttcc tcaatgaagg gcagaagctg catgtgggag aggagagtaa aaagaacttc 1261 ctggagaaac tgaagcggga cgttgagttc ttggcacagc tgaagatcat ggactacagc 1321 ctgctggtgg gcatccacga cgtggaccgg gcagagcagg aggagatgga ggtggaggag 1381 cgggcagagg acgaggagtg tgagaatgat ggggtgggtg gcaacctact ctgctcctat 1441 ggcacacctc cggacagccc tggcaacctc ctcagctttc ctcggttctt tggtcctggg 1501 gaattcgacc cctctgttga cgtctatgcc atgaaaagcc atgaaagttc ccccaagaag 1561 gaggtgtatt tcatggccat cattgatatc ctcacgccat acgatacaaa gaagaaagct 1621 gcacatgctg ccaaaacggt gaaacacggg gcaggggccg agatctcgac tgtgaaccct 1681 gagcagtact ccaaacgctt caacgagttt atgtccaaca tcctgacgta gttctcttct 1741 accttcagcc gagaccgaga gactggatat ggggtcgggg atcgggactt agggagaagg 1801 gtgtatttgg gctagatggg agggtgggag cgagatcggg tttgggaggg ctttagcaat 1861 gagacttgca gcctgtgaca ccgaaagaga ctttagctga agaggagggg gatgtgctgt 1921 gtgtgcacca gctcacagga tgtaacccca ccttctgctt acccttgatt ttttctcccc 1981 atttgacacc caggttaaaa aggggttccc tttttggtac cttgtaacct tttaagatac 2041 cttggggcta gagatgactt cgtgggttta tttgggtttt gtttctgaaa tttcattgct 2101 ccaggtttgc tatttataat catatttcat cagcctaccc accctcccca tctttgctgc 2161 tctcagttcc cttcaattaa agagataccc agtagaccca gcacaagggt ccttccagaa 2221 ccaagtgcta tggatgccag attggagagg tcagacacct cgccctgctg catttgctct 2281 tgtctggatt aactttgtaa tttatggagt attgtgcaca acttcctcca cctttccctt 2341 ggattcaagt gaaaactgtt gcattattcc tccatcctgt ctggaataca ccaggtcaac 2401 accagagatc tcagatcaga atcagagatc tcagagggga ataagttcat cctcatggga 2461 tggtgagggg caggaaagcg gctgggctct tggacaccct ggttctcaga gaaccctgtg 2521 atgatcaccc aagccccagg ctgtcttagc ccctggagtt cagaagtcct ctctgtaaat 2581 cctgcctccc actaggtcaa gaggaactag agtacctttg gatttatcag gaccctcatg 2641 tttaaatggt tatttccctt tgggaaaact tcagaaactg atgtatcaaa tgaggccctg 2701 tgccctcgat ctatttcctt cttccttctg acctcctccc aggcactctt acttctagcc 2761 gaactcttag ctctgggcag atctccaagc gcctggagtg ctttttagca gagacacctc 2821 gttaagctcc gggatgacct tgtaggagat ctgtctcccc tgtgcctgga gagttacagc 2881 cagaaaggtg cccccatctt agagtgtggt gtccaaacgt gaggtggctt cctagttaca 2941 tgaggatgtg atccaggaaa tccagtttgg aggcttgatg tgggttttga cctggcctca 3001 ccttggggct gtttttcctt gttgccccgc tctagacttt tagcagatct gcacccacag 3061 gcttttttgg aaggagtggc ttcctcgagg tgttccacct gcttcggagc ctgccaccca 3121 ggccctcaga actgaccaca ggctgctctg gccaggagag aaacagctct gttgttctgc 3181 attgggggag gtacattcct gcatcttctc accccctcaa ccaggaactg gggatttggg 3241 atgagatatg gtcagacttg tagataaccc caaagatgtg aagatcgctt gtgaaaccat 3301 tttgaatgaa tagattggtt tcctgtggct ccctccaaac ctggccaagc ccagcttccg 3361 aagcaggaac cagcactgtc tctgtgcctg actcacagca tataggtcag gaaagaatgg 3421 agacggcatt cttggacttc actggggctg ctggattgga tgggaaacct tctggaagag 3481 gcagatgggg gtcaaaccac tgccttgccc caggaagggg ccataggtag gtctgaacaa 3541 ctgccggaag accactacat gacttaggga acttgaaacc aactggctca tggagaaaac 3601 aaatttgact tgggaaaggg attatgtagg aataatgttt ggacttgatt tccccacgtc 3661 ataatgaaga atggaagttt ggatctgctc ctcgtcaggc gcagcatctc tgaagcttgg 3721 aaagctgtct tccagggttg taa // LOCUS HSU85625 1186 bp mRNA PRI 21-JUN-1997 DEFINITION Homo sapiens ribonuclease 6 precursor, mRNA, complete cds. ACCESSION U85625 NID g2209028 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1186) AUTHORS Trubia,M., Sessa,L. and Taramelli,R. TITLE Mammalian Rh/T2/S-glycoprotein ribonuclease family genes: cloning of a human member located in a region of chromosome 6 (6q27) frequently deleted in human malignancies JOURNAL Genomics 42 (2), 342-344 (1997) MEDLINE 97336062 REFERENCE 2 (bases 1 to 1186) AUTHORS Trubia,M. and Taramelli,R. TITLE Direct Submission JOURNAL Submitted (16-JAN-1997) Genetica e Biologia dei Microrganismi, University of Milan, Via Celoria 26, Milan 20133, Italy FEATURES Location/Qualifiers source 1..1186 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6q27; between D6S193 and D6S297" sig_peptide 358..417 CDS 358..927 /codon_start=1 /product="ribonuclease 6 precursor" /db_xref="PID:g2209029" /translation="MRPAALRGRLLGCLCLALLCLGGADKRLRDNHEWKKLIMVQHWP ETVCEKIQNDCRDPPDYWTIHGLWPDKSEGCNRSWPFNLEEIKDLLPEMRAYWPDVIH SFPNRSRFWKHEWEKHGTCAAQVDALNSQKKYFGRSLELYRELDLNSVLLKLGIKPSI NYYQVADFKDALARVYGVIPKSSASTKPG" mat_peptide 418..924 /product="ribonuclease 6" BASE COUNT 275 a 314 c 374 g 223 t ORIGIN 1 cggcgactga ccgtggtcgt gggcggacgg cggctgcagc gcggaggagc tggggtcgct 61 gtgggtcgcg acggagcccg ggacgtgcgc gcttggtgca cgatcctgaa ggggagctcc 121 gaggggcccg ggtctccagg gctgctgcgg ccattcccgg agcccggcgc ggggcccgca 181 gatactggtt taggccgtcc cagggctccg ggcgcacccg tggccgctgc tgcagcgagg 241 gagcgcggcg cgggggggct cggagacagt gttttctccc ggaagtcttc ctcgggcagc 301 aggtgggaag tgggagcgga gcggcagctg gcagcgttct ctccgcaggt cggcaccatg 361 cgccctgcag ccctgcgcgg gcgcctgctg ggctgcctct gcctggcgtt gctttgcctg 421 ggcggtgcgg acaagcgcct gcgtgacaac catgagtgga aaaaactaat tatggttcag 481 cactggcctg agacagtatg cgagaaaatt caaaacgact gtagagaccc tccggattac 541 tggacaatac atggactatg gcccgataaa agtgaaggat gtaatagatc gtggcccttc 601 aatttagaag agattaagga tcttttgcca gaaatgaggg catactggcc tgacgtaatt 661 cactcgtttc ccaatcgcag ccgcttctgg aagcatgagt gggaaaagca tgggacctgc 721 gccgcccagg tggatgcgct caactcccag aagaagtact ttggcagaag cctggaactc 781 tacagggagc tggacctcaa cagtgtgctt ctaaaattgg ggataaaacc atccatcaat 841 tactaccaag ttgcagattt taaagatgcc cttgccagag tatatggagt gatacccaaa 901 tccagtgcct ccaccaagcc aggatgagaa gtacagacaa ttggtcagat agaactgtgc 961 ctcactaagc aagaccagca gctgcaaaac tgcaccgagc cgggggagca gccgtccccc 1021 aagcaggaag tctggctggc aaatggggcc gccgagagcc ggggtctgag agtctgtgaa 1081 gatggcccag tcttctatcc cccacctaaa aagaccaagc attgatgccc aagttttgga 1141 aatattctgt tttaaaaagc aagagaaatt cacaaactgc agctcg // LOCUS HSU85658 2804 bp mRNA PRI 15-MAY-1997 DEFINITION Human transcription factor ERF-1 mRNA, complete cds. ACCESSION U85658 NID g2058552 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2804) AUTHORS McPherson,L.A., Baichwal,V.R. and Weigel,R.J. TITLE Identification of ERF-1 as a member of the AP2 transcription factor family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (9), 4342-4347 (1997) MEDLINE 97272225 REFERENCE 2 (bases 1 to 2804) AUTHORS McPherson,L.A., Baichwal,V.R. and Weigel,R.J. TITLE Direct Submission JOURNAL Submitted (16-JAN-1997) Surgery, Stanford University, 1201 Welch Road, Stanford, CA 94305-5486, USA FEATURES Location/Qualifiers source 1..2804 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MCF7 human breast carcinoma cell line" /chromosome="20" /map="20q13.2" CDS 167..1519 /function="transcription factor" /note="AP-2 gamma homolog" /codon_start=1 /product="ERF-1" /db_xref="PID:g2058553" /translation="MLWKITDNVKYEEDCEDRHDGSSNGNPRVPHLSSAGQHLYSPAP PLSHTGVAEYQPPPYFPPPYQQLAYSQSADPYSHLGEAYAAAINPLHQPAPTGSQQQA WPGRQSQEGAGLPSHHGRPAGLLPHLSGLEAGAVSARRDAYRRSDLLLPHAHALDAAG LAENLGLHDMPHQMDEVQNVDDQHLLLHDQTVIRKGPISMTKNPLNLPCQKELVGAVM NPTEVFCSVPGRLSLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGR SLREKLDKIGLNLPAGRRKAAHVTLLTSLVEGEAVHLARDFAYVCEAEFPSKPVAEYL TRPHLGGRNEMAARKNMLLAAQQLCKEFTELLSQDRTPHGTSRLAPVLETNIQNCLSH FSLITHGFGSQAICAAVSALQNYIKEALIVIDKSYMNPGDQSPADSNKTLEKMEKHRK " BASE COUNT 701 a 702 c 692 g 709 t ORIGIN 1 tcgcagagcc gccgatgcgt gtccagtgac ccggacagca aggcccgcgc gcggcggggg 61 cggcggcaga cgcctggtca ccgtgacccc gattttggat ttaccgcttg ggggctgggg 121 ggatcctgga tttaactggc gactgttttg ggggacgccg gacgccatgt tgtggaaaat 181 aaccgataat gtcaagtacg aagaggactg cgaggatcgc cacgacggga gcagcaatgg 241 gaatccgcgg gtcccccacc tctcctccgc cgggcagcac ctctacagcc ccgcgccacc 301 cctctcccac actggagtcg ccgaatatca gccgccaccc tactttcccc ctccctacca 361 gcagctggcc tactcccagt cggccgaccc ctactcgcat ctgggggaag cgtacgccgc 421 cgccatcaac cccctgcacc agccggcgcc cacaggcagc cagcagcagg cctggcccgg 481 ccgccagagc caggagggag cggggctgcc ctcgcaccac gggcgcccgg ccggcctact 541 gccccacctc tccgggctgg aggcgggcgc ggtgagcgcc cgcagggatg cctaccgccg 601 ctccgacctg ctgctgcccc acgcacacgc cctggatgcc gcgggcctgg ccgagaacct 661 ggggctccac gacatgcctc accagatgga cgaggtgcag aatgtcgacg accagcacct 721 gttgctgcac gatcagacag tcattcgcaa aggtcccatt tccatgacca agaaccctct 781 gaacctcccc tgtcagaagg agctggtggg ggccgtaatg aaccccactg aggtcttctg 841 ctcagtccct ggaagattgt cgctcctcag ctctacgtct aaatacaaag tgacagtggc 901 tgaagtacag aggcgactgt ccccacctga atgcttaaat gcctcgttac tgggaggtgt 961 tctcagaaga gccaaatcga aaaatggagg ccggtccttg cgggagaagt tggacaagat 1021 cgggttgaat cttccggccg ggaggcggaa agccgctcat gtgactctcc tgacatcctt 1081 agtagaaggt gaagctgttc atttggctag ggactttgcc tatgtctgtg aagccgaatt 1141 tcctagtaaa ccagtggcag aatatttaac cagacctcat cttggaggac gaaatgagat 1201 ggcagctagg aagaacatgc tattggcggc ccagcaactg tgtaaagaat tcacagaact 1261 tctcagccaa gaccggacac cccatgggac cagcaggctc gccccagtct tggagacgaa 1321 catacagaac tgcttgtctc atttcagcct gattacccac gggtttggca gccaggccat 1381 ctgtgccgcg gtgtctgccc tgcagaacta catcaaagaa gccctgattg tcatagacaa 1441 atcctacatg aaccctggag accagagtcc agctgattct aacaaaaccc tggagaaaat 1501 ggagaaacac aggaaataaa attggaacga agaaaggtta ggagagtagg gaaggaacag 1561 gactgcaaaa atccttctcc accgcacaga ctgggaaccc ctcctggcct gggggaagag 1621 tttgttacct accttactat ttaaagagcc ttcactggtt ctgcatcacc cgcccctgga 1681 cttcttagtt gtttctctag cgctgagcta tctcctaact ttggacctat tatcagaagg 1741 tgacaagtac tggctcttta ttcattaagc tttttttttt tgaaccccat tctttccttc 1801 tctgaaagtg gtgctataag ttttagaatc ttttaaatac attccctggg ccaacagacc 1861 cacacactta gccattgaaa tgtcaaattg atgtgcccta gatcaacaga tcaacaatac 1921 cttttttttc agtgttaagg taatggttgg tttttgtgtc cgctaaatat ttaccttgaa 1981 aaaaagaaaa gtgtgtatct agcttcttca gagatcaagt cctctggtag gaggcaaagg 2041 ttctatctgc ttagcaacta gttaataagt ggtatctgac acactctaaa ccccgtgttc 2101 aaacgggggc cttctggttt taggaaactt gtagaaacga agcctgctga ttgatttttt 2161 tctccttttt tttttttttt ttttttaact ttgaaagtta actcttcaaa tgggagactc 2221 tttgaaatga catgttccct taaggtactg aagctttatt tgcatattta tttcagatgt 2281 ttcgagtaaa cttgaaaagg gtaggcacga agcaatttgt tgctgcttgt cacccccaag 2341 tccccgtgga ggttctgtat tttaagaaac agtgcgttga gtgtacagat tttatttatg 2401 cgtaatttaa tggggtctgt aaatactggt gcacttctta cgactttttt gagacatggg 2461 atccaatttt aatattaact tttaatggtg atggggtaat ctataacaca tcataaggtt 2521 ttattcatat atatacaggg tattaagaat taagaggatg ctgggctctg ttcttggctt 2581 ggaagattct atttaattga aactctctgt tcagaaagca ataactttgt ctcgttcctg 2641 ttgggctgaa ccctaaggtg agtgtgcagt acagtgtgtg tgggtgaaat ggagatttgg 2701 aattgaactc tctgcctgta aatgttcccc aaataattgt tgtgtgtatg atacgtgtat 2761 aataaaagta ttcttgttag aattgaaaaa aaaaaaaaaa aaaa // LOCUS HSU85707 2511 bp mRNA PRI 24-JUL-1997 DEFINITION Human leukemogenic homolog protein (MEIS1) mRNA, complete cds. ACCESSION U85707 NID g2058550 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2511) AUTHORS Smith,J.E. Jr., Bollekens,J.A., Inghirami,G. and Takeshita,K. TITLE Cloning and mapping of the MEIS1 gene, the human homolog of a murine leukemogenic gene JOURNAL Genomics 43 (1), 99-103 (1997) MEDLINE 97369938 REFERENCE 2 (bases 1 to 2511) AUTHORS Smith,J.E., Bollekens,J.A. and Takeshita,K. TITLE Direct Submission JOURNAL Submitted (15-JAN-1997) Medicine-Hematology, New York University Medical Center, 550 First Avenue, New York, NY 10016, USA FEATURES Location/Qualifiers source 1..2511 /organism="Homo sapiens" /note="IMAGE Consortium clones 223490 and 223513" /db_xref="taxon:9606" /chromosome="2" /map="2p13-14" gene 1..2511 /gene="MEIS1" CDS 66..1238 /gene="MEIS1" /note="TALE homeobox protein" /codon_start=1 /product="leukemogenic homolog protein" /db_xref="PID:g2058551" /translation="MAQRYDDLPHYGGMDGVGIPSTMYGDPHAARSMQPVHHLNHGPP LHSHQYPHTAHTNAMAPSMGSSVNDALKRDKDAIYGHPLFPLLALIFEKCELATCTPR EPGVAGGDVCSSESFNEDIAVFAKQIRAEKPLFSSNPELDNLMIQAIQVLRFHLLELE KVHELCDNFCHRYISCLKGKMPIDLVIDDREGGSKSDSEDITRSANLTDQPSWNRDHD DTASTRSGGTPGPSSGGHTSHSGDNSSEQGDGLDNSVASPSTGDDDDPDKDKKRHKKR GIFPKVATNIMRAWLFQHLTHPYPSEEQKKQLAQDTGLTILQVNNWFINARRRIVQPM IDQSNRAVSQGTPYNPDGQPMGGFVMDGQQHMGIRAPGPMSGMGMNMGMEGQWHYM" misc_feature 879..1067 /gene="MEIS1" /note="encodes TALE homeobox" BASE COUNT 732 a 580 c 564 g 635 t ORIGIN 1 cttttcacac tggccttaaa gaggatatat tagaagttga agtaggaagg gagccagaga 61 ggccgatggc gcaaaggtac gacgatctac cccattacgg gggcatggat ggagtaggca 121 tcccctccac gatgtatggg gacccgcatg cagccaggtc catgcagccg gtccaccacc 181 tgaaccacgg gcctcctctg cactcgcatc agtacccgca cacagctcat accaacgcca 241 tggcccccag catgggctcc tctgtcaatg acgctttaaa gagagataaa gatgccattt 301 atggacaccc cctcttccct ctcttagcac tgatttttga gaaatgtgaa ttagctactt 361 gtaccccccg cgagccgggg gtggcgggcg gggacgtctg ctcgtcagag tcattcaatg 421 aagatatagc cgtgttcgcc aaacagattc gcgcagaaaa acctctattt tcttctaatc 481 cagaactgga taacttgatg attcaagcca tacaagtatt aaggtttcat ctattggaat 541 tagagaaggt acacgaatta tgtgacaatt tctgccaccg gtatattagc tgtttgaaag 601 ggaaaatgcc tatcgatttg gtgatagacg atagagaagg aggatcaaaa tcagacagtg 661 aagatataac aagatcagca aatctaactg accagccctc ttggaacaga gatcatgatg 721 acacggcatc tactcgttca ggaggaaccc caggcccttc cagcggtggc cacacgtcac 781 acagtgggga caacagcagt gagcaaggtg atggcttgga caacagtgta gcttccccca 841 gcacaggtga cgatgatgac cctgataagg acaaaaagcg tcacaaaaag cgtggcatct 901 ttcccaaagt agccacaaat atcatgaggg cgtggctgtt ccagcatcta acacaccctt 961 acccttctga agaacagaaa aagcagttgg cacaagacac gggactcacc atccttcaag 1021 tgaacaattg gtttattaat gcccggagaa gaatagtgca gcccatgata gaccagtcca 1081 accgagcagt aagtcaagga acaccttata atcctgatgg acagcccatg ggaggtttcg 1141 taatggacgg tcagcaacat atgggaatta gagcaccagg acctatgagt ggaatgggca 1201 tgaatatggg catggagggg cagtggcact acatgtaacc ttcatctagt taaccaatcg 1261 caaagcaagg gggaaggctg caaagtatgc caggggagta tgtagcccgg ggtggtccaa 1321 tgggtgtgag tatgggacag ccaagttata cccaacccca gatgcccccc catcctgctc 1381 agctgcgtca tgggcccccc atgcatacgt acattcctgg acaccctcac cacccaacag 1441 tgatgatgca tggaggaccg ccccaccctg gaatgccaat gtcagcatca agccccacag 1501 ttcttaatac aggagaccca acaatgagtg gacaagtcat ggacattcat gctcagtagc 1561 ttaagggaat atgcattgtc tgcaatggtg actgatttca aatcatgttt tttctgcaat 1621 gactgtggag ttccattctt ggcatctact ctggaccaag gagcatccct aattcttcat 1681 agggaccttt aaaaagcagg aaataccaac tgaagtcaat ttgggggaca tgctaaataa 1741 ctatataaga cattaagaga acaaagagtg aaatattgta aatgctatta tactgttatc 1801 catattacgt tgtttcttat agatttttta aaaaaaatgt gaaatttttc cacactatgt 1861 gtgttgtttc catagctctt cacttcctcc agaagcctcc ttacattaaa aagccttaca 1921 gttatcctgc aagggacagg aaggtctgat ttgcaggatt tttagagcat taaaataact 1981 atcaggcaga agaatctttc ttctcgccta ggatttcagc catgcgcgcg ctctctctct 2041 ttctctctct tttcctctct ctccctcttt ctagcctggg gcttgaattt gcatgtctaa 2101 ttcatttact caccatattt gaattggcct gaacagatgt aaatcgggaa ggatgggaaa 2161 aactgcagtc atcaacaatg attaatcagc tgttgcaggc agtgtcttaa ggagactggt 2221 aggaggaggc atggaaacca aaaggccgtg tgtttagaag cctaattgtc acatcaagca 2281 tcattgtccc catgcaacaa ccaccacctt atacatcact tcctgtttta agcagctcta 2341 aaacatagac tgaagattta tttttaatat gttgacttta tttctgagca aagcatcggt 2401 catgtgtgta ttttttcata gtcccacctt ggagcattta tgtagacatt gtaaataaat 2461 tttgtgcaaa aaggactgga aaaatgaact gtattattgc aatttttttt t // LOCUS HSU85768 360 bp mRNA PRI 01-APR-1997 DEFINITION Human myeloid progenitor inhibitory factor-1 MPIF-2 mRNA, complete cds. ACCESSION U85768 NID g1916251 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 360) AUTHORS Patel,V.P., Kreider,B.L., Li,Y., Li,H., Leung,K., Salcedo,T., Nardelli,B., Pippalla,V., Gentz,S., Thotakura,R., Parmelee,D., Gentz,R. and Garotta,G. TITLE Molecular and functional characterization of two novel human C-C chemokines as inhibitors of two distinct classes of myeloid progenitors JOURNAL J. Exp. Med. (1997) In press REFERENCE 2 (bases 1 to 360) AUTHORS Li,H. and Patel,V.P. TITLE Direct Submission JOURNAL Submitted (17-JAN-1997) Cell Biology, Human Genome Sciences, 9410 Keywest Ave., Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..360 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..360 /note="myeloid progenitor inhibitory factor-2" /codon_start=1 /product="MPIF-2" /db_xref="PID:g1916252" /translation="MAGLMTIVTSLLFLGVCAHHIIPTGSVVIPSPCCMFFVSKRIPE NRVVSYQLSSRSTCLKGGVIFTTKKGQQFCGDPKQEWVQRYMKNLDAKQKKASPRARA VAVKGPVQRYPGNQTTC" BASE COUNT 85 a 106 c 96 g 73 t ORIGIN 1 atggcaggcc tgatgaccat agtaaccagc cttctgttcc ttggtgtctg tgcccaccac 61 atcatcccta cgggctctgt ggtcataccc tctccctgct gcatgttctt tgtttccaag 121 agaattcctg agaaccgagt ggtcagctac cagctgtcca gcaggagcac atgcctcaag 181 ggaggagtga tcttcaccac caagaagggc cagcagttct gtggcgaccc caagcaggag 241 tgggtccaga ggtacatgaa gaacctggac gccaagcaga agaaggcttc ccctagggcc 301 agggcagtgg ctgtcaaggg ccctgtccag agatatcctg gcaaccaaac cacctgctaa // LOCUS HSU85773 2302 bp mRNA PRI 26-JUN-1997 DEFINITION Human phosphomannomutase (PMM2) mRNA, complete cds. ACCESSION U85773 NID g2218086 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2302) AUTHORS Matthijs,G., Schollen,E., Pardon,E., Veiga-Da-Cunha,M., Jaeken,J., Cassiman,J.J. and Van Schaftingen,E. TITLE Mutations in PMMM2, a phosphomannomutase gene on chromosome 16p13, in carbohydrate-deficient glycoprotein type I syndrome JOURNAL Nature Genet. 16 (1), 88-92 (1997) MEDLINE 97285128 REMARK Erratum:[[published erratum appears in Nat Genet 1997 Jul;16(3):316]] REFERENCE 2 (bases 1 to 2302) AUTHORS Matthijs,G., Schollen,E. and Van Schaftingen,E. TITLE Direct Submission JOURNAL Submitted (17-JAN-1997) Center for Human Genetics, University of Leuven, Gasthuisberg ON6, Leuven B-3000, Belgium FEATURES Location/Qualifiers source 1..2302 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13; near D16S406" gene 49..789 /gene="PMM2" CDS 49..789 /gene="PMM2" /note="phosphomannomutase activity is deficient in patients with the carbohydrate-deficient glycoprotein syndrome type I (CDG1)" /codon_start=1 /product="phopshomannomutase" /db_xref="PID:g2218087" /translation="MAAPGPALCLFDVDGTLTAPRQKITKEMDDFLQKLRQKIKIGVV GGSDFEKVQEQLGNDVVEKYDYVFPENGLVAYKDGKLLCRQNIQSHLGEALIQDLINY CLSYIAKIKLPKKRGTFIEFRNGMLNVSPIGRSCSQEERIEFYELDKKENIRQKFVAD LRKEFAGKGLTFSIGGQISFDVFPDGWDKRYCLRHVENDGYKTIYFFGDKTMPGGNDH EIFTDPRTMGYSVTAPEDTRRICELLFS" BASE COUNT 583 a 596 c 611 g 512 t ORIGIN 1 gttcctcgtg ccaacgtgtc ttgtaaggtg cggctagaaa ctggggacat ggcagcgcct 61 ggcccagcgc tctgcctctt cgacgtggat gggaccctca ccgccccgcg gcagaaaatt 121 accaaagaaa tggatgactt cctacaaaaa ttgaggcaga agatcaaaat cggagtggta 181 ggcggatcgg actttgagaa agtgcaggag caactgggaa atgatgtggt tgaaaaatac 241 gattatgtgt ttccagaaaa tggcttggta gcatacaaag atgggaaact cttgtgtaga 301 cagaatattc aaagtcatct gggtgaggcc ctaatccaag atttaatcaa ctactgtctg 361 agctacattg cgaaaattaa actcccgaag aagaggggta ctttcattga attccgaaat 421 gggatgttaa acgtgtcccc tattggaaga agctgcagcc aagaagaacg cattgagttc 481 tacgaactcg ataaaaaaga aaatataaga caaaagtttg tagcagatct acggaaagag 541 tttgctggaa aaggcctcac gttttccata ggaggccaga tcagctttga tgtctttcct 601 gatggatggg acaagagata ctgtctgcga catgtggaaa atgacggtta taagaccatt 661 tatttctttg gagacaaaac tatgccaggt ggcaatgacc atgagatctt cacagacccc 721 agaaccatgg gctactccgt gacagcgcct gaggacacgc gcaggatctg tgaactgctg 781 ttctcctaac gtgggagcgg gaggggcggg gtcccggctg acaagcagca tagggcattc 841 ggtggccaga gccgagggtc ctcccacacg tgctcaccca cccgcagcct aggcaggctc 901 tgcatgctat gccaggcatg tgccgtctgg acttccacct ccagtgccag aaacttccag 961 aaagaaggag aaactcttgt caagaatggc ccagaggaat gcctcgcaca aaaggtcttc 1021 cccacccacc cccagccccc tagtctaata cccaccctga tacgtgcaat catgtagttt 1081 tggcggaaat ttccccatca ttctaggatg atacagaaag aaaactgtgc ctggaccctc 1141 cctcttggtg ggtctgtgga aacataagcg gtttttttaa tgggcccctg catcaatacc 1201 aaacatgggg gtttggtaat gagaaaccag gacaggccat ctgcagtgac ccagcccagg 1261 acgaagttta caaacacctc ctggaacgaa gctcccgcct gcatgtcacc ttgatggggg 1321 ctgtgagtgg ggcagtgtga tacccagtga ctagacgcac tctgcgtttt cccgtgtttg 1381 gggctgaggc ctgctggaca gatggctggc caagtgggag cagaccctag ggagtttgca 1441 cctcggctgg gccggattcg gaccggctct gtgttcacta cactcagaat agcctgctgc 1501 ttctctgtct ccgagaccgg agtacttggg aacaacagct gggctggaga gttggtgctg 1561 gcaaaacagt ccttcccctg gggccggttc ttacccaggt ccagagaaac caacgcggga 1621 tgtcagactt caccaaaagg actttctggt tgcccctggc tggcttcctg gaggcgttcg 1681 cctctagttt ctcagggatg gagcgagagc ccagccagag aacagtaaga ggagctgctc 1741 tcctatctgc actcacccag gccttcaccc agactttacc gcggaggcgg ctgagtgcag 1801 ctacagctag gtccgcgtcc ctcactcttt tcatcttctg cacgttcttc gtgaaactgg 1861 aaggatcccg ggtctcagct agaacacggt ggaagagaac tttcctagga aacggttcat 1921 gtgtcacttt tcaggatgtg gaaacactga gccatacacc ctccattgct tggtgctggg 1981 gttgtgtggc ctccactggg cacttgccga cctgagtctg gggccagggg agcccaggct 2041 gccctgcact cctgcctccc agcccacagc caggtgcttt catcacagct aaacctggtt 2101 ccctccaaac ctcccagcca ctcgggcttg taactgtctg agccccggat ccggtggggt 2161 gaaagcagcc agctcatccc agtgactcac aggacacagc catccagcgg catctttcct 2221 tgtcgaatga tactgtaatg accttccaaa gtgaagagta gcacattaaa gtgattttat 2281 tgtttaaaaa aaaaaaaaaa aa // LOCUS HSU85946 2343 bp mRNA PRI 02-MAY-1997 DEFINITION Human brain secretory protein hSec10p (HSEC10) mRNA, complete cds. ACCESSION U85946 NID g2062604 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2343) AUTHORS Guo,W., Novick,P. and De Camilli,P. TITLE Direct Submission JOURNAL Submitted (19-JAN-1997) Cell Biology, Yale School of Medicine, 333 Cedar Street, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..2343 /organism="Homo sapiens" /db_xref="taxon:9606" gene 196..2322 /gene="HSEC10" CDS 196..2322 /gene="HSEC10" /note="similar to S. cerevisiae Sec10p" /codon_start=1 /product="brain secretory protein hSec10p" /db_xref="PID:g2062605" /translation="MATTAELFEEPFVADEYIERLVWRTPGGGSRGGPEAFDPKRLLE EFVNHIQELQIMDERIQRKVEKLEQQCQKEAKEFAKKVQELQKSNQVAFQHFQELDEH ISYVATKVCHLGDQLEGVNTPRQRAVEAQKLMKYFNEFLDGELKSDVFTNSEKIKEAA DIIQKLHLIAQELPFDRFSEVKSKIASKYHDLECQLIQEFTSAQRRGEISRMREVAAV LLHFKGYSHCVDVYIKQCQEGAYLRNDIFEDAGILCQRVNKQVGDIFSNPETVLAKLI QNVFEIKLQSFVKEQLEECRKSDAEQYLKNLYDLYTRTTNLSSKLMEFNLGTDKQTFL SKLIKSIFISYLENYIEVETGYLKSRSAMILQRYYDSKNHQKRSIGTGGIQDLKERIR QRTNLPLGPSIDTHGETFLSQEVVVNLLQETKQAFERCHRLSDPSDLPRNAFRIFTIL VEFLCIEHIDYALETGLAGIPSSDSRNANLYFLDVVQQANTIFHLFDKQFNDHLMPLI SSSPKLSECLQKKKEIIEQMEMKLDTGIDRTLNCMIGQMKHILAAEQKKTDFKPEDEN NVLIQYTNACVKVCAYVRKQVEKIKNSMDGKNVDTVLMELGVRFHRLIYEHLQQYSYS CMGGMLAICDVAEYRKCAKDFKIPMVLHLFDTLHALCNLLVVAPDNLKQVCSGEQLAN LDKNILHSFVQLRADYRSARLARHFS" BASE COUNT 739 a 426 c 530 g 648 t ORIGIN 1 attccggagc gtttgcggct tcgcttcatg gccgctctcc cgcccctcct gggatctgtg 61 gggagctggg gagcccgcag cggcccggag ccggagctgg cgagccgagc ggagacctgt 121 gcgccgcgcc tctgaggcgc agcatgtgaa gcggagacgg catccagtgg ggggcgagcc 181 tctcagccgg ccgggatggc taccacggcc gagctcttcg aggagccttt tgtggcagat 241 gaatatattg aacgtcttgt atggagaacc ccaggaggag gctctagagg tggacctgaa 301 gcttttgatc ctaaaagatt attagaagaa tttgtaaatc atattcagga actccagata 361 atggatgaaa ggattcagag gaaagtagag aaactagagc aacaatgtca gaaagaagcc 421 aaggaatttg ccaagaaggt acaagagctg cagaaaagca atcaggttgc cttccaacat 481 ttccaagaac tagatgagca cattagctat gtagcaacta aagtctgtca ccttggagac 541 cagttagagg gggtaaacac acccagacaa cgggcagtgg aggctcagaa attgatgaaa 601 tactttaatg agtttctaga tggagaattg aaatctgatg tttttacaaa ttctgaaaag 661 ataaaggaag cagcagacat cattcagaag ttgcacctaa ttgcccaaga gttacctttt 721 gatagatttt cagaagttaa atccaaaatt gcaagtaaat accatgattt agaatgccag 781 ctgattcagg agtttaccag tgctcaaaga agaggtgaaa tctccagaat gagagaagta 841 gcagcagttt tacttcattt taagggttat tcccattgtg ttgatgttta tataaagcag 901 tgccaggagg gtgcttattt gagaaatgat atatttgaag acgctggaat actctgtcaa 961 agagtgaaca aacaagttgg agatatcttc agtaatccag aaacagtcct ggctaaactt 1021 attcaaaatg tatttgaaat caaactacag agttttgtga aagagcagtt agaagaatgt 1081 aggaagtccg atgcagagca atatctcaaa aatctctatg atctgtatac aagaaccacc 1141 aatctttcca gcaagctgat ggagtttaat ttaggtactg ataaacagac tttcttgtct 1201 aagcttatca aatccatttt catttcctat ttggagaact atattgaggt ggagactgga 1261 tatttgaaaa gcagaagtgc tatgatccta cagcgctatt atgattcgaa aaaccatcaa 1321 aagagatcca ttggcacagg aggtattcaa gatttgaagg aaagaattag acagcgtacc 1381 aacttaccac ttgggccaag tatcgatact catggggaga cttttctatc ccaagaagtg 1441 gtggttaatc ttttacaaga aaccaaacaa gcctttgaaa gatgtcatag gctctctgat 1501 ccttctgact taccaaggaa tgccttcaga atttttacca ttcttgtgga atttttatgt 1561 attgagcata ttgattatgc tttggaaaca ggacttgctg gaattccctc ttcagattct 1621 aggaatgcaa atctttattt tttggacgtt gtgcaacagg ccaatactat ttttcatctt 1681 tttgacaaac agtttaatga tcaccttatg ccactaataa gctcttctcc taagttatct 1741 gaatgccttc agaagaaaaa agaaataatt gaacaaatgg agatgaaatt ggatactggc 1801 attgatagga cattaaattg tatgattgga cagatgaagc atattttggc tgcagaacag 1861 aagaaaacag attttaagcc agaagatgaa aacaatgttt tgattcaata tactaatgcc 1921 tgtgtaaaag tctgtgctta cgtaagaaaa caagtggaga agattaaaaa ttccatggat 1981 gggaagaatg tggatacagt tttgatggaa cttggagtac gttttcatcg acttatctat 2041 gagcatcttc aacaatattc ctacagttgt atgggtggca tgttggccat ttgtgatgta 2101 gccgaatata ggaagtgtgc caaagacttc aagattccaa tggtattaca tctttttgat 2161 actctgcatg ctctttgcaa tcttctggta gttgccccag ataatttaaa gcaagtctgc 2221 tcaggagaac aacttgctaa tctggacaag aatatacttc actccttcgt acaacttcgt 2281 gctgattata gatctgcccg ccttgctcga cacttcagct gagattgaat ttacaaagga 2341 att // LOCUS HSU86136 8665 bp mRNA PRI 25-FEB-1997 DEFINITION Human telomerase-associated protein TP-1 mRNA, complete cds. ACCESSION U86136 NID g1848276 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8665) AUTHORS Harrington,L., McPhail,T., Mar,V., Zhou,W., Oulton,R., Bass,M.B., Arruda,I. and Robinson,M.O. TITLE A mammalian telomerase-associated protein JOURNAL Science 275 (5302), 973-977 (1997) MEDLINE 97172559 REFERENCE 2 (bases 1 to 8665) AUTHORS Robinson,M.O. and Harrington,L. TITLE Direct Submission JOURNAL Submitted (22-JAN-1997) Molecular Genetics, Amgen, Inc., 1840 Dehavilland Drive, Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..8665 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 80..7963 /codon_start=1 /product="telomerase-associated protein TP-1" /db_xref="PID:g1848277" /translation="MEKLHGHVSAHPDILSLENRCLAMLPDLQPLEKLHQHVSTHSDI LSLKNQCLATLPDLKTMEKPHGYVSAHPDILSLENQCLATLSDLKTMEKPHGHVSAHP DILSLENRCLATLPSLKSTVSASPLFQSLQISHMTQADLYRVNNSNCLLSEPPSWRAQ HFSKGLDLSTCPIALKSISATETAQEATLGRWFDSEEKKGAETQMPSYSLSLGEEEEV EDLAVKLTSGDSESHPEPTDHVLQEKKMALLSLLCSTLVSEVNMNNTSDPTLAAIFEI CRELALLEPEFILKASLYARQQLNVRNVANNILAIAAFLPACRPHLRRYFCAIVQLPS DWIQVAELYQSLAEGDKNKLVPLPACLRTAMTDKFAQFDEYQLAKYNPRKHRAKRHPR RPPRSPGMEPPFSHRCFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKLVQRLHI HKPAQHVQALLGYRYPSNLQLFSRSRLPGPWDSSRAGKRMKLSRPETWERELSLRGNK ASVWEELIENGKLPFMAMLRNLCNLLRVGISSRHHELILQRLQHGKSVIHSRQFPFRF LNAHDAIDALEAQLRNQALPFPSNITLMRRILTRNEKNRPRRRFLCHLSRQQLRMAMR IPVLYEQLKREKLRVHKARQWKYDGEMLNRYRQALETAVNLSVKHSLPLLPGRTVLVY LTDANADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGI LKTAIKLQAQVQEFDENDGWSLNTFGKYLLSLAGQRVPVDRVILLGQSMDDGMINVAK QLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAEHGASHLLEHV GQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLAPVSQQGWRSIRLFISSTFRDMHGER DLLLRSVLPALQARAAPHRISLHGIDLRWGVTEEETRRNRQLEVCLGEVENAQLFVGI LGSRYGYIPPSYNLPDHPHFHWAQQYPSGRSVTEMEVMQFLNRNQRLQPSAQALIYFR DSSFLSSVPDAWKSDFVSESEEAACRISELKSYLSRQKGITCRRYPCEWGGVAAGRPY VGGLEEFGQLVLQDVWNMIQKLYLQPGALLEQPVSIPDDDLVQATFQQLQKPPSPARP RLLQDTVQQLMLPHGRLSLVTGQSGQGKTAFLASLVSALQAPDGAKVAPLVFFHFSGA RPDQGLALTLLRRLCTYLRGQLKEPGALPSTYRSLVWELQQRLLPKSAESLHPGQTQV LIIDGADRLVDQNGQLISDWIPKKLPRCVHLVLSVSSDAGLGETLEQSQGAHVLALGP LEASARARLVREELALYGKRLEESPFNNQMRLLLVKRESGRPLYLRLVTDHLRLFTLY EQVSERLRTLPATVPLLLQHILSTLEKEHGPDVLPQALTALEVTRSGLTVDQLHGVLS VWRTLPKGTKSWEEAVAAGNSGDPYPMGPFACLVQSLRSLLGEGPLERPGARLCLPDG PLRTAAKRCYGKRPGLEDTAHILIAAQLWKTCDADASGTFRSCPPEALGDLPYHLLQS GNRGLLSKFLTNLHVVAAHLELGLVSRLLEAHALYASSVPKEEQKLPEADVAVFRTFL RQQASILSQYPRLLPQQAANQPLDSPLCHQASLLSRRWHLQHTLRWLNKPRTMKNQQS SSLSLAVSSSPTAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLF LSDDTLFLTAFDGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLK LWDTVRGQLAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPG ASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLL TAGEDGKVQVWSGSLGRPRGHLGSLSLSPALSVALSPDGDRVAVGYRADGIRIYKISS GSQGAQGQALDVAVSALAWLSPKVLVSGAEDGSLQGWALKECSLQSLWLLSRFQKPVL GLATSQELLASASEDFTVQLWPRQLLTRPHKAEDFPCGTELRGHEGPVSCCSFSTDGG SLATGGRDRSLLCWDVRTPKTPVLIHSFPACHRDWVTGCAWTKDNLLISCSSDGSVGL WDPESGQRLGQFLGHQSAVSAVAAVEEHVVSVSRDGTLKVWDHQGVELTSIPAHSGPI SHCAAAMEPRAAGQPGSELLVVTVGLDGATRLWHPLLVCQTHTLLGHSGPVRAAAVSE TSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAPDGSMAVSGNQAGELIL WQEAKAVATAQAPGHIGALIWSSAHTFFVLSADEKISEWQVKLRKGSAPGNLSLHLNR ILQEDLGVLTSLDWAPDGHFLILAKADLKLLCMKPGDAPSEIWSSYTENPMILSTHKE YGIFVLQPKDPGVLSFLRQKESGEFEERLNFDINLENPSRTLISITQAKPESESSFLC ASSDGILWNLAKCSPEGEWTTGNMWQKKANTPETQTPGTDPSTCRESDASMDSDASMD SEPTPHLKTRQRRKIHSGSVTALHVLPELLVTASKDRDVKLWERPSMQLLGLFRCEGS VSCLEPWLGANSTLQLAVGDVQGNVYFLNWE" repeat_region 8492..8665 /rpt_family="ALU" BASE COUNT 1948 a 2360 c 2410 g 1946 t 1 others ORIGIN 1 ggtaccggtc cggaattccc gggtcgaccc acgcgtccgg aatcggacgc cccaggcata 61 tacaagctga gtttcagcca tggaaaaact ccatgggcat gtgtctgccc atccagacat 121 cctctccttg gagaaccggt gcctggctat gctccctgac ttacagccct tggagaaact 181 acatcagcat gtatctaccc actcagatat cctctccttg aagaaccagt gcctagccac 241 gcttcctgac ctgaagacca tggaaaaacc acatggatat gtgtctgccc acccagacat 301 cctctccttg gagaaccagt gcctggccac actttctgac ctgaagacca tggagaaacc 361 acatggacat gtttctgccc acccagacat cctctccttg gagaaccggt gcctggccac 421 cctccctagt ctaaagagca ctgtgtctgc cagccccttg ttccagagtc tacagatatc 481 tcacatgacg caagctgatt tgtaccgtgt gaacaacagc aattgcctgc tctctgagcc 541 tccaagttgg agggctcagc atttctctaa gggactagac ctttcaacct gccctatagc 601 cctgaaatcc atctctgcca cagagacagc tcaggaagca actttgggtc gttggtttga 661 ttcagaagag aagaaagggg cagagaccca aatgccttct tatagtctga gcttgggaga 721 ggaggaggag gtggaggatc tggccgtgaa gctcacctct ggagactctg aatctcatcc 781 agagcctact gaccatgtcc ttcaggaaaa gaagatggct ctactgagct tgctgtgctc 841 tactctggtc tcagaagtaa acatgaacaa tacatctgac cccaccctgg ctgccatttt 901 tgaaatctgt cgtgaacttg ccctcctgga gcctgagttt atcctcaagg catctttgta 961 tgccaggcag cagctgaacg tccggaatgt ggccaataac atcttggcca ttgctgcttt 1021 cttgccggcg tgtcgccccc acctgcgacg atatttctgt gccattgtcc agctgccttc 1081 tgactggatc caggtggctg agctttacca gagcctggct gagggagata agaataagct 1141 ggtgcccctg cccgcctgtc tccgtactgc catgacggac aaatttgccc agtttgacga 1201 gtaccagctg gctaagtaca accctcggaa gcaccgggcc aagagacacc cccgccggcc 1261 accccgctct ccagggatgg agcctccatt ttctcacaga tgttttccaa ggtacatagg 1321 gtttctcaga gaagagcaga gaaagtttga gaaggccggt gatacagtgt cagagaaaaa 1381 gaatcctcca aggttcaccc tgaagaagct ggttcagcga ctgcacatcc acaagcctgc 1441 ccagcacgtt caagccctgc tgggttacag atacccctcc aacctacagc tcttttctcg 1501 aagtcgcctt cctgggcctt gggattctag cagagctggg aagaggatga agctgtctag 1561 gccagagacc tgggagcggg agctgagcct acgggggaac aaagcgtcgg tctgggagga 1621 actcattgaa aatgggaagc ttcccttcat ggccatgctt cggaacctgt gcaacctgct 1681 gcgggttgga atcagttccc gccaccatga gctcattctc cagagactcc agcatgggaa 1741 gtcggtgatc cacagtcggc agtttccatt cagatttctt aacgcccatg atgccattga 1801 tgccctcgag gctcaactca gaaatcaagc attgcccttt ccttcgaata taacactgat 1861 gaggcggata ctaactagaa atgaaaagaa ccgtcccagg cggaggtttc tttgccacct 1921 aagccgtcag cagcttcgta tggcaatgag gatacctgtg ttgtatgagc agctcaagag 1981 ggagaagctg agagtacaca aggccagaca gtggaaatat gatggtgaga tgctgaacag 2041 gtaccgacag gccctagaga cagctgtgaa cctctctgtg aagcacagcc tgcccctgct 2101 gccaggccgc actgtcttgg tctatctgac agatgctaat gcagacaggc tctgtccaaa 2161 gagcaaccca caagggcccc cgctgaacta tgcactgctg ttgattggga tgatgatcac 2221 gagggcggag caggtggacg tcgtgctgtg tggaggtgac actctgaaga ctgcagtgct 2281 taaggcagaa gaaggcatcc tgaagactgc catcaagctc caggctcaag tccaggagtt 2341 tgatgaaaat gatggatggt ccctgaatac ttttgggaaa tacctgctgt ctctggctgg 2401 ccaaagggtt cctgtggaca gggtcatcct ccttggccaa agcatggatg atggaatgat 2461 aaatgtggcc aaacagcttt actggcagcg tgtgaattcc aagtgcctct ttgttggtat 2521 cctcctaaga agggtacaat acctgtcaac agatttgaat cccaatgatg tgacactctc 2581 aggctgtact gatgcgatac tgaagttcat tgcagagcat ggggcctccc atcttctgga 2641 acatgtgggc caaatggaca aaatattcaa gattccacca cccccaggaa agacaggggt 2701 ccagtctctc cggccactgg aagaggacac tccaagcccc ttggctcctg tttcccagca 2761 aggatggcgc agcatccggc ttttcatttc atccactttc cgagacatgc acggggagcg 2821 ggacctgctg ctgaggtctg tgctgccagc actgcaggcc cgagcggccc ctcaccgtat 2881 cagccttcac ggaatcgacc tccgctgggg cgtcactgag gaggagaccc gtaggaacag 2941 acaactggaa gtgtgccttg gggaggtgga gaacgcacag ctgtttgtgg ggattctggg 3001 ctcccgttat ggatacattc cccccagcta caaccttcct gaccatccac acttccactg 3061 ggcccagcag tacccttcag ggcgctctgt gacagagatg gaggtgatgc agttcctgaa 3121 ccggaaccaa cgtctgcagc cctctgccca agctctcatc tacttccggg attccagctt 3181 cctcagctct gtgccagatg cctggaaatc tgactttgtt tctgagtctg aagaggccgc 3241 atgtcggatc tcagaactga agagctacct aagcagacag aaagggataa cctgccgcag 3301 atacccctgt gagtgggggg gtgtggcagc tggccggccc tatgttggcg ggctggagga 3361 gtttgggcag ttggttctgc aggatgtatg gaatatgatc cagaagctct acctgcagcc 3421 tggggccctg ctggagcagc cagtgtccat cccagacgat gacttggtcc aggccacctt 3481 ccagcagctg cagaagccac cgagtcctgc ccggccacgc cttcttcagg acacagtgca 3541 acagctgatg ctgccccacg gaaggctgag cctggtgacg gggcagtcag gacagggcaa 3601 gacagccttc ctggcatctc ttgtgtcagc cctgcaggct cctgatgggg ccaaggtggc 3661 accattagtc ttcttccact tttctggggc tcgtcctgac cagggtcttg ccctcactct 3721 gctcagacgc ctctgtacct atctgcgtgg ccaactaaaa gagccaggtg ccctccccag 3781 cacctaccga agcctggtgt gggagctgca gcagaggctg ctgcccaagt ctgctgagtc 3841 cctgcatcct ggccagaccc aggtcctgat catcgatggg gctgataggt tagtggacca 3901 gaatgggcag ctgatttcag actggatccc aaagaagctt ccccggtgtg tacacctggt 3961 gctgagtgtg tctagtgatg caggcctagg ggagaccctt gagcagagcc agggtgccca 4021 cgtgctggcc ttggggcctc tggaggcctc tgctcgggcc cggctggtga gagaggagct 4081 ggccctgtac gggaagcggc tggaggagtc accatttaac aaccagatgc gactgctgct 4141 ggtgaagcgg gaatcaggcc ggccgctcta cctgcgcttg gtcaccgatc acctgaggct 4201 cttcacgctg tatgagcagg tgtctgagag actccggacc ctgcctgcca ctgtccccct 4261 gctgctgcag cacatcctga gcacactgga gaaggagcac gggcctgatg tccttcccca 4321 ggccttgact gccctagaag tcacacggag tggtttgact gtggaccagc tgcacggagt 4381 gctgagtgtg tggcggacac taccgaaggg gactaagagc tgggaagaag cagtggctgc 4441 tggtaacagt ggagacccct accccatggg cccgtttgcc tgcctcgtcc agagtctgcg 4501 cagtttgcta ggggagggcc ctctggagcg ccctggtgcc cggctgtgcc tccctgatgg 4561 gcccctgaga acagcagcta aacgttgcta tgggaagagg ccagggctag aggacacggc 4621 acacatcctc attgcagctc agctctggaa gacatgtgac gctgatgcct caggcacctt 4681 ccgaagttgc cctcctgagg ctctgggaga cctgccttac cacctgctcc agagcgggaa 4741 ccgtggactt ctttcgaagt tccttaccaa cctccatgtg gtggctgcac acttggaatt 4801 gggtctggtc tctcggctct tggaggccca tgccctctat gcttcttcag tccccaaaga 4861 ggaacaaaag ctccccgagg ctgacgttgc agtgtttcgc accttcctga ggcagcaggc 4921 ttcaatcctc agccagtacc cccggctcct gccccagcag gcagccaacc agcccctgga 4981 ctcacctctt tgccaccaag cctcgctgct ctcccggaga tggcacctcc aacacacact 5041 acgatggctt aataaacccc ggaccatgaa aaatcagcaa agctccagcc tgtctctggc 5101 agtttcctca tcccctactg ctgtggcctt ctccaccaat gggcaaagag cagctgtggg 5161 cactgccaat gggacagttt acctgttgga cctgagaact tggcaggagg agaagtctgt 5221 ggtgagtggc tgtgatggaa tctctgcttg tttgttcctc tccgatgata cactctttct 5281 tactgccttc gacgggctcc tggagctctg ggacctgcag catggttgtc gggtgctgca 5341 gactaaggct caccagtacc aaatcactgg ctgctgcctg agcccagact gccggctgct 5401 agccaccgtg tgcttgggag gatgcctaaa gctgtgggac acagtccgtg ggcagctggc 5461 cttccagcac acctacccca agtccctgaa ctgtgttgcc ttccacccag aggggcaggt 5521 aatagccaca ggcagctggg ctggcagcat cagcttcttc caggtggatg ggctcaaagt 5581 caccaaggac ctgggggcac ccggagcctc tatccgtacc ttggccttca atgtgcctgg 5641 gggggttgtg gctgtgggcc ggctggacag tatggtggag ctgtgggcct ggcgagaagg 5701 ggcacggctg gctgccttcc ctgcccacca tggctttgtt gctgctgcgc ttttcctgca 5761 tgcgggttgc cagttactga cggctggaga ggatggcaag gttcaggtgt ggtcagggtc 5821 tctgggtcgg ccccgtgggc acctgggttc cctttctctc tctcctgccc tctctgtggc 5881 actcagccca gatggtgatc gggtggctgt tggatatcga gcggatggca ttaggatcta 5941 caaaatctct tcaggttccc agggggctca gggtcaggca ctggatgtgg cagtgtccgc 6001 cctggcctgg ctaagcccca aggtattggt gagtggtgca gaagatgggt ccttgcaggg 6061 ctgggcactc aaggaatgct cccttcagtc cctctggctc ctgtccagat tccagaagcc 6121 tgtgctagga ctggccactt cccaggagct cttggcttct gcctcagagg atttcacagt 6181 gcagctgtgg ccaaggcagc tgctgacgcg gccacacaag gcagaagact ttccctgtgg 6241 cactgagctg cggggacatg agggccctgt gagctgctgt agtttcagca ctgatggagg 6301 cagcctggcc accgggggcc gggatcggag tctcctctgc tgggacgtga ggacacccaa 6361 aacccctgtt ttgatccact ccttccctgc ctgtcaccgt gactgggtca ctggctgtgc 6421 ctggaccaaa gataacctac tgatatcctg ctccagtgat ggctctgtgg ggctctggga 6481 cccagagtca ggacagcggc ttggtcagtt cctgggtcat cagagtgctg tgagcgctgt 6541 ggcagctgtg gaggagcacg tggtgtctgt gagccgggat gggaccttga aagtgtggga 6601 ccatcaaggc gtggagctga ccagcatccc tgctcactca ggacccatta gccactgtgc 6661 agctgccatg gagccccgtg cagctggaca gcctgggtca gagcttctgg tggtaaccgt 6721 cgggctagat ggggccacac ggttatggca tccactcttg gtgtgccaaa cccacaccct 6781 cctgggacac agcggcccag tccgtgctgc tgctgtttca gaaacctcag gcctcatgct 6841 gaccgcctct gaggatggtt ctgtacggct ctggcaggtt cctaaggaag cagatgacac 6901 atgtatacca aggagttctg cagccgtcac tgctgtggct tgggcaccag atggttccat 6961 ggcagtatct ggaaatcaag ctggggaact aatcttgtgg caggaagcta aggctgtggc 7021 cacagcacag gctccaggcc acattggtgc tctgatctgg tcctcggcac acaccttttt 7081 tgtcctcagt gctgatgaga aaatcagcga gtggcaagtg aaactgcgga agggttcggc 7141 acccggaaat ttgagtcttc acctgaaccg aattctacag gaggacttag gggtgctgac 7201 aagtctggat tgggctcctg atggtcactt tctcatcttg gccaaagcag atttgaagtt 7261 actttgcatg aagccagggg atgctccatc tgaaatctgg agcagctata cagaaaatcc 7321 tatgatattg tccacccaca aggagtatgg catatttgtc ctgcagccca aggatcctgg 7381 agttctttct ttcttgaggc aaaaggaatc aggagagttt gaagagaggc tgaactttga 7441 tataaactta gagaatccta gtaggaccct aatatcgata actcaagcca aacctgaatc 7501 tgagtcctca tttttgtgtg ccagctctga tgggatccta tggaacctgg ccaaatgcag 7561 cccagaagga gaatggacca caggtaacat gtggcagaaa aaagcaaaca ctccagaaac 7621 ccaaactcca gggacagacc catctacctg cagggaatct gatgccagca tggatagtga 7681 tgccagcatg gatagtgagc caacaccaca tctaaagaca cggcagcgta gaaagattca 7741 ctcgggctct gtcacagccc tccatgtgct acctgagttg ctggtgacag cttcgaagga 7801 cagagatgtt aagctatggg agagacccag tatgcagctg ctgggcctgt tccgatgcga 7861 agggtcagtg agctgcctgg aaccttggct gggcgctaac tccaccctgc agcttgccgt 7921 gggagacgtg cagggcaatg tgtactttct gaattgggaa tgaagatgtg ccactcggga 7981 ataatgatac cccttgtgct agagatgcaa agcctgaaga cactggtagc ttttaataat 8041 tataaaatta ataatttctt gataattata aaaatgaagt gtcaaaaaat ctcaagtgta 8101 ggcctgcctg tgttctcatg tggatttaga acaggaggat attctatgtg tatgtatatg 8161 tacattctaa tgtgtgtctc ttcttattca acattaatcc ttactagaac cacaagaaag 8221 tgaatgaaat ctttagtagg tactcttttg aaactaggtt ytagaattct tgcatcactc 8281 gcgggcccta ggaccctagg atgccattct tgccaggagg aggaatgaga gtgatgttgg 8341 ccaacattca atttgaacag agcatggaag acctttcagt tcatcgggaa agaatgaggg 8401 agggagaata agtcagtcat gcatcagggc atttagaaag agctatgttt ctgtcacaga 8461 gacagccctt ttctcagaac tacccagagg aggccgggca tggtggctca cgcttgtaat 8521 cccagcactt tgggaggccg aggtgggcag atcacgaggt caggagatca agaccatcct 8581 ggctaacata gtgaaaccct gtctctacta aaaaatacaa aaagttaacc aggcatgtag 8641 cggccgctct agaggatcca agctt // LOCUS HSU86358 879 bp mRNA PRI 11-SEP-1997 DEFINITION Human chemokine (TECK) mRNA, complete cds. ACCESSION U86358 NID g2388626 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Vicari,A.P., Figueroa,D.J., Hedrick,J.A., Foster,J.S., Singh,K.P., Menon,S., Copeland,N.G., Gilbert,D.J., Jenkins,N.A., Bacon,K.B. and Ziotnik,A. TITLE TECK: a novel cc chemokine specifically expressed by thymic dendritic cells and potentially involved in T cell development JOURNAL Immunology 7, 291-301 (1997) REFERENCE 2 (bases 1 to 879) AUTHORS Vicari,A.P. and Zlotnik,A. TITLE Direct Submission JOURNAL Submitted (21-JAN-1997) Immunology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /tissue_type="thymus" gene 1..879 /gene="TECK" CDS 1..453 /gene="TECK" /codon_start=1 /product="chemokine" /db_xref="PID:g2388627" /translation="MNLWLLACLVAGFLGAWAPAVHTQGVFEDCCLAYHYPIGWAVLR RAWTYRIQEVSGSCNLPAAIFYLPKRHRKVCGNPKSREVQRAMKLLDARNKVFAKLHH NMQTFQAGPHAVKKLSSGNSKLSSSKFSNPISSSKRNVSLLISANSGL" BASE COUNT 191 a 264 c 218 g 206 t ORIGIN 1 atgaacctgt ggctcctggc ctgcctggtg gccggcttcc tgggagcctg ggcccccgct 61 gtccacaccc aaggtgtctt tgaggactgc tgcctggcct accactaccc cattgggtgg 121 gctgtgctcc ggcgcgcctg gacttaccgg atccaggagg tgagcgggag ctgcaatctg 181 cctgctgcga tattctacct ccccaagaga cacaggaagg tgtgtgggaa ccccaaaagc 241 agggaggtgc agagagccat gaagctcctg gatgctcgaa ataaggtttt tgcaaagctc 301 caccacaaca tgcagacctt ccaagcaggc cctcatgctg taaagaagtt gagttctgga 361 aactccaagt tatcatcatc caagtttagc aatcccatca gcagcagcaa gaggaatgtc 421 tccctcctga tatcagctaa ttcaggactg tgagccggct catttctggg ctccatcggc 481 acaggagggg ccggatcttt ctccgataaa accgtcgccc tacagaccca gctgtcccca 541 cgcctctgtc ttttgggtca agtcttaatc cctgcacctg agttggtcct ccctctgcac 601 ccccaccacc tcctgcccgt ctggcaactg gaaagaagga gttggcctga ttttaacctt 661 ttgccgctcc ggggaacagc acaatcctgg gcagccagtg gctcttgtag agaaaactta 721 ggatacctct ctcactttct gtttcttgcc gtccaccccg ggccatgcca gtgtgtcctc 781 tgggtcccct ccaaaaatct ggtcattcaa ggatcccctc ccaaggctat gcttttctat 841 aacttttaaa taaaccttgg ggggtgaatg gaataaaaa // LOCUS HSU86602 1325 bp mRNA PRI 11-FEB-1997 DEFINITION Human nucleolar protein p40 mRNA, complete cds. ACCESSION U86602 NID g1835785 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1325) AUTHORS Henning,D., Busch,R.K., Perlaky,L., Zhu,L., Valdez,B.C. and Busch,H. TITLE Cloning and partial characterization of nucleolar protein p40 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1325) AUTHORS Henning,D., Busch,R.K., Perlaky,L., Zhu,L., Valdez,B.C. and Busch,H. TITLE Direct Submission JOURNAL Submitted (22-JAN-1997) Pharmacology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..1325 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 143..1063 /note="cell proliferation-associated protein" /codon_start=1 /product="nucleolar protein p40" /db_xref="PID:g1835786" /translation="MDTPPLSDSESESDESLVTDRELQDAFSRGLLKPGLNVVLEGPK KAVNDVNGLKQCLAEFKRDLEWVERLDVTLGPVPEIGGSEAPAPQNKDQKAVDPEDDF QREMSFYRQAQAAVLAVLPRLHQLKVPTKRPTDYFAEMAKSDLQVQKIRQKLQTKQAA MERSEKAKQLRALRKYGKKVQTEVLQKRQQEKAHMMNAIKKYQKGFSDKLDFLEGDQK PLAQRKKAGAKGQQMRKGPSAKRRYKNQKFGFGGKKKGSKWNTRESYDDVSSFRAKTA HGRGLKRPGKKGSNKRPGKRTREKMKNRTH" BASE COUNT 403 a 279 c 366 g 277 t ORIGIN 1 gcgattcggt ggcacgtgga gccacggcgt gggagtaggg ggctgaaggc aggcagcagc 61 ggccagggcc gccctctgct agccgcttgg gtctcgggat accccgtttc ttcctgtagg 121 tgtgggacgt gcgtgcggcg agatggacac tcccccgctc tcggattcgg agtcggaatc 181 cgatgaatcc cttgtcacag acagagagtt gcaggatgcg ttttcccgag ggcttctgaa 241 gccaggcctc aatgtcgtgc tagaggggcc gaagaaggcc gtgaacgacg tgaatggcct 301 gaagcaatgt ttggcagaat tcaagcggga tctggaatgg gttgaaaggc tcgatgtgac 361 actgggtccg gtaccggaga tcggtggatc tgaggcgcca gcacctcaga acaaggacca 421 gaaagctgtt gatccagaag acgacttcca gcgagagatg agtttctatc gccaagccca 481 ggccgcagtg cttgcagtct taccccgcct ccatcagctc aaagtcccta cgaagcgacc 541 cactgattat tttgcggaaa tggccaaatc tgatctgcag gtgcagaaga ttcgacagaa 601 gctgcagact aaacaggctg ccatggagag gtctgaaaaa gctaagcaac tgcgagcact 661 taggaaatac gggaagaagg tgcaaacgga ggttcttcag aagaggcagc aggagaaagc 721 ccatatgatg aatgctatta agaaatatca gaaaggcttc tctgataaac tggatttcct 781 tgagggagat cagaaacctc tggcacagcg caagaaggca ggagccaaag gccagcagat 841 gaggaagggg cccagtgcta aacgacggta taaaaaccag aagtttggtt ttggtggaaa 901 gaagaaaggc tcaaagtgga acactcggga gagctatgat gatgtatcta gcttccgggc 961 caagacagct catggcagag gcctcaagag gcctggcaag aaagggtcaa ataagagacc 1021 tggaaaacga acaagagaga agatgaagaa cagaacacac taaatagcat ctttgaatac 1081 aaagaaccaa gaaaaaggaa tgaagactcg caatttcacg acacactttg atcccttctg 1141 ttggtgtcat gttgtaaaca tttctttcaa taaactaaag aaaaattatt aaaggaacac 1201 atacctttgg ttaaatagtc tagactaaaa gattgagaag ttactttcca ttgctatcta 1261 ttgataattt agacattgag ttcaaattgc cttcatttta tgataaataa tgatttaact 1321 gaaaa // LOCUS HSU86751 3286 bp mRNA PRI 03-SEP-1997 DEFINITION Human nucleolar fibrillar center protein (ASE-1) mRNA, complete cds. ACCESSION U86751 NID g2351682 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3286) AUTHORS Whitehead,C.M., Winkfein,R.J., Fritzler,M.J. and Rattner,J.B. TITLE ASE-1, A Novel Protein of the Nucleolar Fibrillar Centre JOURNAL Unpublished REFERENCE 2 (bases 1 to 3286) AUTHORS Whitehead,C.M., Winkfein,R.J., Fritzler,M.J. and Rattner,J.B. TITLE Direct Submission JOURNAL Submitted (24-JAN-1997) Medical Biochemistry, University of Calgary, 3330 Hospital Dr. NW, Calgary, AB T2N 4N1, Canada FEATURES Location/Qualifiers source 1..3286 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19q13.3" gene 489..2021 /gene="ASE-1" CDS 489..2021 /gene="ASE-1" /codon_start=1 /product="nucleolar fibrillar center protein" /db_xref="PID:g2351683" /translation="MEEPQAGDAARFSCPPNFTAKPPASESPRFSLEALTGPDTELWL IQAPADFAPECFNGRHVPLSGSQIVKGKLAGKRHRYRVLSSCPQAGEATLLAPSTEAG GGLTCASAPQGTLRILEGPQQSLSGSPLQPIPASPPPQIPPGLRPRFCAFGGNPPVTG PRSALAPNLLTSGKKKKEMQVTEAPVTQEAVNGHGALEVDMALGSPEMDVRKKKKKKN QQLKEPEAAGPVGTEPTVETLEPLGVLFPSTTKKRKKPKGKETFEPEDKTVKQEQINT EPLEDTVLSPTKKRKRQKGTEGMEPEEGVTVESQPQVKVEPLEEAIPLPPTKKRKKEK GQMAMMEPGTEAMEPVEPEMKPLESPGGTMAPQQPEGAKPQAQAALAAPKKKTKKEKQ QDATVEPETEVVGPELPDDLEPQAAPTSTKKKKKKKERGHTVTEPIQPLEPELPGEGQ PEARATPGSTKKRKKQSQESRMPETVPQEEMPGPPLNSESGEEAPTGRDKKRKQQQQQ PV" polyA_signal 3235..3240 BASE COUNT 799 a 895 c 896 g 696 t ORIGIN 1 aagttctgaa cttgtgaggc atctgggcct ccccagaaga catttaacac agaaagcaca 61 gccctactaa ctagtattct tacctgtctc ttcaagaatt tcagaccaat cgaccgtcct 121 gtctctttaa ggcttaggaa gagcagtgtg gctgcccctt taaggaggcg ttgcaacaaa 181 ccatattgga cagacgatgg gggcgaccca tcgggacccg acgggcctct gactccagca 241 atacagcgaa tcagcggctt tcgggaatac atttttcgga aaaagacttc ttcctcggtt 301 ttctgctctg cacacgttga aattttcccc agtttttcct gcagatcggg agtcgagcaa 361 tgcctacccc cgcgctcccg caccagttgg gcgctcccgg atgatgccct acccctttgg 421 atccacgtgg tctgcaacct ggtgcgagca gcccgggcta cagggttgcc tgaggtgtgg 481 gtcccaggat ggaggagccc caggccggcg atgctgctcg gttctcttgt ccccccaact 541 ttaccgcgaa gcccccagcc tcagagtccc ctcgtttctc cttggaggcg ctgacgggtc 601 cagatacgga gctgtggctt attcaggccc ctgcagactt tgccccagaa tgcttcaatg 661 ggcggcatgt gcctctctct ggctcccaga tcgtcaaggg caaattggca ggcaagcggc 721 accgctatcg agtcctcagc agctgtcccc aagctggaga agcgaccctg ctggccccct 781 caacggaggc aggaggtgga ctcacctgtg cctcagcccc ccagggcacc ctaaggatcc 841 ttgagggtcc ccagcaatcc ctgtcaggga gccctctgca gcccatccca gcaagtcccc 901 caccacagat ccctcctggc ctgaggcctc ggttctgtgc ctttgggggc aacccaccag 961 tcacagggcc taggtcagcc ttggccccca acctgctcac ctcagggaag aagaaaaagg 1021 agatgcaggt gacagaggcc ccagtcactc aggaggcagt gaatgggcac ggggccctgg 1081 aggtggacat ggctttgggg tcgccagaaa tggatgtgcg gaagaagaag aagaaaaaaa 1141 atcagcagct gaaagaacca gaggcagcag ggcctgtggg gacagagccc acagtggaga 1201 cactggagcc tctgggagtg ctgttcccgt ccaccaccaa gaagaggaag aagcccaaag 1261 ggaaagaaac cttcgagcca gaagacaaga cagtgaagca ggaacagatt aacactgagc 1321 ctctagaaga cacagtcctg tccccgacca aaaagagaaa gaggcaaaag gggacggaag 1381 ggatggagcc agaggagggg gtgacagttg agtctcagcc acaggtgaag gtggagccac 1441 tggaggaagc catccctctg ccccctacga agaagaggaa aaaagaaaag ggacagatgg 1501 caatgatgga gccagggacg gaggcgatgg agccagtgga gccggagatg aagcctctgg 1561 agtccccagg ggggaccatg gcgcctcaac agccagaagg agcgaagcct caggcccagg 1621 cagctctggc agctcccaaa aagaagacga agaaagaaaa acagcaagat gccacagtgg 1681 agccagagac agaggtggtg gggcctgagc tgccggatga ccttgagcct caggcagctc 1741 ccacatccac caagaagaag aagaagaaga aagagagagg tcacacagtg actgagccaa 1801 ttcagccact agagcctgaa ctgccagggg agggacagcc tgaagccagg gcaactccgg 1861 gatccaccaa gaagaggaag aagcagagtc aggaaagccg gatgccagag acagtgcccc 1921 aagaggagat gccagggccg ccactgaatt cagagtctgg ggaggaggct cccacaggcc 1981 gggacaagaa gcggaagcag cagcagcagc agcctgtgta gtctgccccc gggaaactga 2041 ggaactaaag aaagctgaag gtgcccacct gggccaccag aaggtgacac ccccagaatc 2101 cctccccaga gactgcacca gcgcagccag caggagcctg gcctgggagg acgatttatt 2161 attacactgg gggtttcctt ggcagctggg gtcatcaggg tactttcaag aagggctcgt 2221 gcaggacatc aaacagcctc cgggcctgga tgggagggag aaaaaaatga ggaaccagtc 2281 attaaaggag ctgtttcctg ggtaaatcta gagtggggtt ttggttcttt attttcccct 2341 ataccctcaa gcatttatcc attgagttac aaacaatcca gttacaatct ttttaagtta 2401 ttattattat tattattttt tttttttttg agatggagtc tcgctctgtc gcccaggttg 2461 gagtgcagtg gcgcaatctc ggctcactgc aagctccgcc tcccgggttc acgccattct 2521 cctgcctcag cctcctgagt agctgggact acaggcccct gcccagctaa ttttttgtat 2581 ttttttttag tagagatggg gtttcaccac gttagccagg atggtctcga tctcctgacc 2641 tcctgatgcg cctgcctcag cctcccagtg ctgggattat aggtgtgagc cactgcgcct 2701 ggctaagtta ttattatttt tttgagacag tctcctggtg tcacccaggc tggagtgcag 2761 tggtgtgatc ttggctcact gcaacctccg cctcctgggt tccaacgatt ctcctgcctc 2821 agcctcccga gtagctgggc ctaaaggtgc ccaccactat acccggctaa tttttgtatt 2881 tttagtagag acaggggttt caccatattg gccaggctgg tctcgaactc ctgacctcgt 2941 gatccacctg ccttgacctc ccaaagtgct aggataacag gtgtgagcca ccgcaccctg 3001 ccaagttatt ttaaaatgta ccattattat tgactatagt cacctggttg tgttatcaaa 3061 tagtatgtct tattcattct ttctttgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtggta 3121 cccattaacc ttccccatct ccctgccagc ccctaactac cctccccagc ctccaggaac 3181 tatccatcca ctcttatctc catgagttca attgttttga tttttagata cacaaataaa 3241 taagaacatg caatgtttgt ctttctgtgc ctggcttatt tcactt // LOCUS HSU86753 2409 bp mRNA PRI 14-JUN-1997 DEFINITION Human Cdc5-related protein (PCDC5RP) mRNA, complete cds. ACCESSION U86753 NID g1854034 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2409) AUTHORS Bernstein,H.S. and Coughlin,S.R. TITLE Pombe Cdc5-related protein. A putative human transcription factor implicated in mitogen-activated signaling JOURNAL J. Biol. Chem. 272 (9), 5833-5837 (1997) MEDLINE 97190317 REFERENCE 2 (bases 1 to 2409) AUTHORS Bernstein,H.S. TITLE Direct Submission JOURNAL Submitted (24-JAN-1997) Pediatric Cardiology, UCSF, 513 Parnassus Ave., Box 00632, San Francisco, CA 94143-0632, USA FEATURES Location/Qualifiers source 1..2409 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..2409 /gene="PCDC5RP" CDS 1..2409 /gene="PCDC5RP" /note="similar to S. pombe Cdc5" /codon_start=1 /product="pombe Cdc5-related protein" /db_xref="PID:g1854035" /translation="MPRIMIKGGVWRNTEDEILKAAVMKYGKNQWSRIASLLHRKSAK QCKARWYEWLDPSIKKTEWSREEEEKLLHLAKLMPTQWRTIAPIIGRTAAQCLEHYEF LLDKAAQRDNEEETTDDPRKLKPGEIDPNPETKPARPDPIDMDEDELEMLSEARARLA NTQGKKAKRKAREKQLEEARRLAALQKRRELRAAGIEIQKKRKRKRGVDYNAEIPFEK KPALGFYDTSEENYQALDADFRKLRQQDLDGELRSEKEGRDRKKDKQHLKRKKESDLP SAILQTSGVSEFTKKRSKLVLPAPQISDAELQEVVKVGQASEIARQTAEESGITNSAS STLLSEYNVTNNSVALRTPRTPASQDRILQEAQNLMALTNVDTPLKGGLNTPLHESDF SGVTPQRQVVQTPNTVLSTPFRTPSNGAEGLTPRSGTTPKPVINSTPGRTPLRDKLNI NPEDGMADYSDPSYVKQMERESREHLRLGLLGLPAPKNDFEIVLPENAEKELEEREID DTYIEDAADVDARKQAIRDAERVKEMKRMHKAVQKDLPRPSEVNETILRPLNVEPPLT DLQKSEELIKKEMITMLHYDLLHHPYEPSGNKKGKTVGFGTNNSEHITYLEHNPYEKF SKEELKKAQDVLVQEMEVVKQGMSHGELSSEAYNQVWEECYSQVLYLPGQSRYTRANL ASKKDRIESLEKRLEINRGHMTTEAKRAAKMEKKMKILLGGYQSRAMGLMKQLNDLWD QIEQAHLELRTFEELKKHEDSAIPRRLECLKEDVQRQQEREKELQHRYADLLLEKETL KSKF" BASE COUNT 855 a 472 c 561 g 521 t ORIGIN 1 atgcctcgaa ttatgatcaa ggggggcgta tggaggaata ccgaggatga aattctgaaa 61 gcagcggtaa tgaaatatgg gaaaaatcag tggtctagga ttgcctcatt gctgcataga 121 aaatcagcaa agcagtgcaa agccagatgg tatgaatggc tggatccaag cattaagaag 181 acagaatggt ccagagaaga agaggaaaaa ctcttgcact tggccaagtt gatgccaact 241 cagtggagga ccattgctcc aatcattgga agaacagcgg cccagtgctt agaacactat 301 gaatttcttc tggataaagc tgcccaaaga gacaatgaag aggaaacaac agatgatcca 361 cgaaaactta aacctggaga aatagatcca aatccagaaa caaaaccagc gcggcctgat 421 ccaattgata tggatgagga tgaacttgag atgctttctg aagccagagc ccgcttggct 481 aatactcagg gaaagaaggc caagaggaaa gcaagagaga aacaattgga agaagcaaga 541 cgtcttgctg ccctccaaaa aagaagagaa cttcgagcag ctggcataga aattcagaag 601 aaaagaaaaa ggaagagagg agttgattat aatgccgaaa tcccatttga aaaaaagcct 661 gcccttggtt tttatgatac ttctgaggaa aactaccaag ctcttgacgc agatttcagg 721 aaattaagac aacaggatct tgatggggag ctaagatctg aaaaagaagg aagagataga 781 aaaaaagaca aacagcattt gaaaaggaaa aaagaatctg atttaccatc agctattctt 841 caaactagtg gtgtttctga atttactaaa aagagaagca aactagtact tcctgcccct 901 cagatttcag atgcagaact ccaggaagtt gtaaaagtag gccaagcgag tgaaattgca 961 cgtcaaactg ccgaggaatc tggcataaca aattctgctt ccagtacact tttgtctgag 1021 tacaatgtca ccaacaacag cgttgctctt agaacaccac gaacaccagc ttcccaggac 1081 agaattctgc aggaagccca gaacctcatg gccctcacca atgtggacac cccattgaaa 1141 ggtggactta ataccccatt gcatgagagt gacttctcag gtgtaactcc acagcgacaa 1201 gttgtacaga ctccaaacac agttctctct actccattca ggactccttc taatggagct 1261 gaagggctga ctccccggag tggaacaact cccaaaccag ttattaactc tactccgggt 1321 agaactcctc ttcgagacaa gttaaacatt aatcccgagg atggaatggc agactatagt 1381 gatccctctt acgtgaagca gatggaaaga gaatcccgag aacatctccg tttagggttg 1441 ttgggccttc ctgcccctaa gaatgatttt gaaattgttc taccagaaaa tgccgagaag 1501 gagctggaag aacgtgaaat agatgatact tacattgaag atgctgctga tgtggatgct 1561 cgaaagcagg ccatacgaga tgcagagcgt gtaaaggaaa tgaaacgaat gcataaagct 1621 gtccagaaag atctgccaag accatcagaa gtaaatgaaa ctattctaag acccttaaat 1681 gtagaaccgc ctttaacaga tttacagaaa agtgaagaac taatcaaaaa agaaatgatc 1741 acaatgcttc attatgacct tctacatcac ccttatgaac catctggaaa taaaaaaggc 1801 aaaactgtag ggtttggtac caataattca gagcacatta cctatctgga acataatcct 1861 tatgaaaagt tctccaaaga agagctgaaa aaggcccagg atgttttggt gcaggagatg 1921 gaagtggtta aacaaggaat gagccatgga gagctctcaa gtgaagctta taaccaggtg 1981 tgggaagaat gctacagtca agttttatat cttcctgggc agagccgcta cacacgggcc 2041 aatctggcta gtaaaaagga cagaattgaa tcacttgaaa agaggctcga gataaacagg 2101 ggtcacatga cgacagaagc caagagggct gcaaagatgg aaaagaagat gaaaattttg 2161 cttgggggtt accagtctcg tgctatgggg ctcatgaaac agttgaatga cttatgggac 2221 caaattgaac aggctcactt ggagttacgc acttttgaag aactcaagaa acatgaagat 2281 tctgctattc cccggaggct agagtgtcta aaagaagacg ttcagcgaca acaagaaaga 2341 gaaaaggaac ttcaacatag atatgctgat ttgctgctgg agaaagagac tttaaagtca 2401 aaattctga // LOCUS HSU86782 1132 bp mRNA PRI 21-NOV-1997 DEFINITION Human 26S proteasome-associated pad1 homolog (POH1) mRNA, complete cds. ACCESSION U86782 NID g2073565 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1132) AUTHORS Spataro,V., Toda,T., Craig,R., Seeger,M., Dubiel,W., Harris,A.L. and Norbury,C. TITLE Resistance to diverse drugs and ultraviolet light conferred by overexpression of a novel human 26 S proteasome subunit JOURNAL J. Biol. Chem. 272 (48), 30470-30475 (1997) MEDLINE 98043754 REFERENCE 2 (bases 1 to 1132) AUTHORS Norbury,C. TITLE Direct Submission JOURNAL Submitted (23-JAN-1997) ICRF Molecular Oncology Laboratory, Institute of Molecular Medicine, John Radcliffe Hospital, Headley Way, Oxford OX3 9DS, UK FEATURES Location/Qualifiers source 1..1132 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="lung" gene 1..1132 /gene="POH1" CDS 200..1132 /gene="POH1" /function="induces multi-drug resistance on overexpression" /note="human homolog of fission yeast pad1" /codon_start=1 /product="26S proteasome-associated pad1 homolog" /db_xref="PID:g1923256" /translation="MDRLLRLGGGMPGLGQGPPTDAPAVDTAEQVYISSLALLKMLKH GRAGVPMEVMGLMLGEFVDDYTVRVIDVFAMPQSGTGVSVEAVDPVFQAKMLDMLKQT GRPEMVVGWYHSHPGFGCWLSGVDINTQQSFEALSERAVAVVVDPIQSVKGKVVIDAF RLINANMMVLGHEPRQTTSNLGHLNKPSIQALIHGLNRHYYSITINYRKNELEQKMLL NLHKKSWMEGLTLQDYSEHCKHNESVVKEMLELAKNYNKAVEEEDKMTPEQLAIKNVG KQDPKRHLEEHVDVLMTSNIVQCLAAMLDTVVFK" BASE COUNT 352 a 197 c 284 g 299 t ORIGIN 1 gcgtcaccac agaggcaaga caagggtcca tatcgcggca tccggctccc gcccgtcttc 61 aggagagaaa gaaaaaataa aatatacttg gggaagttgt acctgccaga attagtaaga 121 gctttcttta agaagacatt tgtcaaactc aacaaattga aggttaacac cttaagagtt 181 gtagttactg accagaaata tggacagact tcttagactt ggaggaggta tgcctggact 241 gggccagggg ccacctacag atgctcctgc agtggacaca gcagaacaag tctatatctc 301 ttccctggca ctgttaaaaa tgttaaaaca tggccgtgct ggagttccaa tggaagttat 361 gggtttgatg cttggagaat ttgttgatga ttataccgtc agagtgattg atgtgtttgc 421 tatgccacag tcaggaacag gtgtcagtgt ggaggcagtt gatccagtgt tccaagctaa 481 aatgttggat atgttgaagc agacaggaag gccggagatg gttgttggtt ggtatcacag 541 tcaccctggc tttggttgtt ggctttctgg tgtggatatc aacactcagc agagctttga 601 agccttgtcg gagagagctg tggcagtggt tgtggatccc attcagagtg taaaaggaaa 661 ggttgttatt gatgccttca gattgatcaa tgctaatatg atggtcttag gacatgaacc 721 aagacaaaca acttcgaatc tgggtcactt aaacaagcca tctatccagg cattaattca 781 tggactaaac agacattatt actccattac tattaactat cggaaaaatg aactggaaca 841 gaagatgttg ctaaatttgc ataagaagag ttggatggaa ggtttgacac ttcaggacta 901 cagtgaacat tgtaaacaca atgaatcagt ggtaaaagag atgttggaat tagccaagaa 961 ttacaataag gctgtagaag aagaagataa gatgacacct gaacagctgg caataaagaa 1021 tgttggcaag caggacccca aacgtcattt ggaggaacat gtggatgtac ttatgacctc 1081 aaatattgtc cagtgtttag cagctatgtt ggatactgtc gtatttaaat aa // LOCUS HSU87223 5293 bp mRNA PRI 04-MAR-1997 DEFINITION Human contactin associated protein (Caspr) mRNA, complete cds. ACCESSION U87223 NID g1857707 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5293) AUTHORS Peles,E., Nativ,M., Lustig,M., Grumet,M., Schilling,J., Martinez,R., Plowman,G.D. and Schlessinger,J. TITLE Identification of a Novel Contactin Associated Transmembrane Receptor with Multiple Domains Implicated in Protein-Protein Interactions JOURNAL EMBO J. (1997) In press REFERENCE 2 (bases 1 to 5293) AUTHORS Peles,E. TITLE Direct Submission JOURNAL Submitted (24-JAN-1997) Research, SUGEN Inc., 515 Galveston Drive, Redwood City, CA 94063, USA FEATURES Location/Qualifiers source 1..5293 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q21" /cell_line="IMR-32" /tissue_type="neuroblastoma" gene 218..4372 /gene="Caspr" CDS 218..4372 /gene="Caspr" /note="neurexin-like protein" /codon_start=1 /product="contactin associated protein" /db_xref="PID:g1857708" /translation="MMHLRLFCILLAAVSGAEGWGYYGCDEELVGPLYARSLGASSYY SLLTAPRFARLHGISGWSPRIGDPNPWLQIDLMKKHRIRAVATQGSFNSWDWVTRYML LYGDRVDSWTPFYQRGHNSTFFGNVNESAVVRHDLHFHFTARYIRIVPLAWNPRGKIG LRLGLYGCPYKADILYFDGDDAISYRFPRGVSRSLWDVFAFSFKTEEKDGLLLHAEGA QGDYVTLELEGAHLLLHMSLGSSPIQPRPGHTTVSAGGVLNDQHWHYVRVDRFGRDVN FTLDGYVQRFILNGDFERLNLDTEMFIGGLVGAARKNLAYRHNFRGCIENVIFNRVNI ADLAVRRHSRITFEGKVAFRCLDPVPHPINFGGPHNFVQVPGFPRRGRLAVSFRFRTW DLTGLLLFSRLGDGLGHVELTLSEGQVNVSIAQSGRKKLQFAAGYRLNDGFWHEVNFV AQENHAVISIDDVEGAEVRVSYPLLIRTGTSYFFGGCPKPASRWDCHSNQTAFHGCME LLKVDGQLVNLTLVEGRRLGFYAEVLFDTCGITDRCSPNMCEHDGRCYQSWDDFICYC ELTGYKGETCHTPLYKESCEAYRLSGKTSGNFTIDPDGSGPLKPFVVYCDIRENRAWT VVRHDRLWTTRVTGSSMERPFLGAIQYWNASWEEVSALANASQHCEQWIEFSCYNSRL LNTAGGYPYSFWIGRNEEQHFYWGGSQPGIQRCACGLDRSCVDPALYCNCDADQPQWR TDKGLLTFVDHLPVTQVVIGDTNRSTSEAQFFLRPLRCYGDRNSWNTISFHTGAALRF PPIRANHSLDVSFYFRTSAPSGVFLENMGGPYCQWRRPYVRVELNTSRDVVFAFDVGN GDENLTVHSDDFEFNDDEWHLVRAEINVKQARLRVDHRPWVLRPMPLQTYIWMEYDQP LYVGSAELKRRPFVGCLRAMRLNGVTLNLEGRANASEGTSPNCTGHCAHPRLPCFHGG RCVERYSYYTCDCDLTAFDGPYCNHDIGGFFEPGTWMRYNLQSALRSAAREFSHMLSR PVPGYEPGYIPGYDTPGYVPGYHGPGYRLPDYPRPGRPVPGYRGPVYNVTGEEVSFSF STSSAPAVLLYVSSFVRDYMAVLIKDDGTLQLRYQLGTSPYVYQLTTRPVTDGQPHSI NITRVYRNLFIQVDYFPLTEQKFSLLVDSQLDSPKALYLGRVMETGVIDPEIQRYNTP GFSGCLSGVRFNNVAPLKTHFRTPRPMTAELAEALRVQGELSESNCGAMPRLVSEVPP ELDPWYLPPDFPYYHDEGWVAILLGFLVAFLLLGLVGMLVLFYLQNHRYKGSYHTNEP KAAHEYHPGSKPPLPTSGPAQVPTPTAAPNQAPASAPAPAPTPAPAPGPRDQNLPQIL EESRSE" BASE COUNT 1097 a 1576 c 1448 g 1172 t ORIGIN 1 caagagcgga ggaccaggaa ccagagagag agagagagaa aagagagagg agagacagag 61 cgcttggggg cgaaaggaga gagggaggga agggtgggta aggaggagag agcggtctgc 121 tgcaaacccc aggaggagag cttggagccc aagccagaac tcgagcccta gccggagccg 181 ttcacaggga ggcggctgcc gggaccgtca gccctgcatg atgcatctcc ggctcttctg 241 catcctgctc gccgcggtct caggagccga gggctggggc tactacggct gcgacgagga 301 gctggtgggt cccctgtatg cacgctccct gggcgcctcc tcctactaca gtctccttac 361 tgcgccgcga ttcgccaggc tgcacggcat aagcgggtgg tcaccacgga ttggggatcc 421 gaatccctgg ctccagatag acttaatgaa gaagcaccgg atccgggccg tggccacaca 481 gggctccttt aattcttggg actgggtcac acgttacatg ctactctacg gcgaccgagt 541 ggacagctgg acaccgttct accagcgagg gcacaactcg accttctttg gtaacgtgaa 601 cgagtcggcg gtggtgcgcc atgacctgca cttccacttc actgcgcgct acatccgcat 661 cgtgcccctg gcctggaacc cacgcggcaa gatcggcctg aggctcggcc tctatggctg 721 cccatacaag gccgacatac tctatttcga cggcgacgat gccatctcct accgcttccc 781 gcgaggggtc agccgaagcc tgtgggacgt gttcgccttc agcttcaaga ccgaggagaa 841 ggacggtctt ctgctgcacg ccgagggcgc ccagggcgac tacgtgacgc tcgagctgga 901 gggggcacac ctgctgctgc acatgagcct gggcagcagc cctatccagc caagaccagg 961 tcacaccacc gtgagcgcag gcggagtcct caatgaccag cactggcact atgtgcgggt 1021 ggaccgattt ggccgcgatg taaatttcac cctggacggc tatgtgcagc gctttattct 1081 caatggagac ttcgagaggc tgaacctgga cactgagatg ttcatcggag gtctggtggg 1141 cgccgcgcgg aagaacctgg cctatcggca taacttccgc ggctgcatag aaaacgtaat 1201 cttcaaccgc gtcaacatcg cagacctggc cgtgcggcgc cattcccgga tcaccttcga 1261 gggtaaggtg gcttttcgtt gcctggaccc ggtaccgcac cctatcaact tcggaggccc 1321 tcacaacttc gttcaagtgc ccggtttccc acgccgtggc cgcctggcag tctcatttcg 1381 cttccgcacc tgggacctca ccgggcttct ccttttctcc cgtctggggg acgggctggg 1441 ccacgtggag ctgacgctca gcgaagggca ggtcaacgtg tccatcgcgc agagcggccg 1501 aaagaagctt cagttcgctg ctgggtaccg actgaatgac ggcttttggc acgaggtgaa 1561 ttttgtggca caggaaaacc atgcagttat cagcattgat gatgtggaag gggcagaggt 1621 cagggtctca tacccgttgc tgatccggac agggacctca tatttctttg ggggttgtcc 1681 caagccagcc agtcgatggg actgccactc caaccagacg gcattccatg gctgcatgga 1741 gctgctcaag gtggatggtc aactggtcaa cctgactctg gtggagggcc ggcggcttgg 1801 attctatgct gaggtcctct ttgatacatg tggcatcact gataggtgca gccctaacat 1861 gtgtgagcat gatggacgct gctaccagtc ttgggatgac ttcatttgct actgcgaact 1921 gacgggctac aagggagaga cctgccacac acctttgtat aaggaatcct gtgaggctta 1981 tcggctcagt gggaaaactt ctggaaactt caccattgat cctgatggca gtggccccct 2041 gaagccattt gtagtgtact gtgatatccg agagaaccga gcgtggacag ttgtgcggca 2101 tgacaggctg tggacaactc gagtgacagg ttccagcatg gagcggccat tcctgggggc 2161 tatccagtac tggaatgcat cctgggagga agtcagtgcc cttgccaatg cttcccagca 2221 ttgtgaacag tggatcgagt tctcctgcta caattcccgg ctgctcaaca ctgcaggagg 2281 ctacccctac agcttttgga ttggccgaaa tgaggagcag cacttctact ggggaggctc 2341 ccagcctggg atccagcgct gtgcctgtgg tctggaccgg agctgtgtgg accctgcctt 2401 gtactgcaac tgtgacgctg accagcccca gtggagaact gacaagggac tgctgacctt 2461 tgtggaccat ctgcctgtca ctcaggtagt gataggggat acgaaccgct ccacttctga 2521 ggcccagttc ttcctgaggc ctctgcgctg ctatggcgat cgaaattcct ggaacaccat 2581 ttccttccac accggggctg cactacgctt ccccccaatc cgtgccaacc acagcctgga 2641 tgtctccttc tacttcagga cctctgctcc ctcgggggtc ttcctagaga atatgggggg 2701 cccttactgc cagtggcgcc gaccttatgt gcgggtggaa ctcaacacat cccgggatgt 2761 ggtcttcgcc tttgatgtgg ggaatgggga tgagaacctc acagtacact cagacgactt 2821 tgagttcaat gatgacgagt ggcacctggt ccgggctgaa atcaacgtga agcaggcccg 2881 gctccgagtg gatcaccggc cctgggttct gcggcctatg ccactgcaga cctacatctg 2941 gatggagtat gaccagcccc tctatgtggg atctgcagag cttaagagac gcccctttgt 3001 gggttgcttg agggccatgc gtctgaacgg agtgactctg aacctggagg gccgtgccaa 3061 tgcctctgag ggtacctcac ccaactgcac aggccactgt gcccaccctc ggctcccctg 3121 tttccatgga ggccgctgcg tggagcgcta tagctactac acgtgtgact gtgacctcac 3181 ggcttttgat gggccatact gcaaccacga tattggtggt ttctttgagc cgggcacctg 3241 gatgcgctat aacctacagt cagcgctgcg ctctgcagcc agggagttct cccacatgct 3301 gagccggcca gtgccaggct atgagcctgg ctacatcccg ggctatgata ctccgggcta 3361 tgtgcctggc taccatggcc ccgggtaccg cctgcccgac tacccccggc ctggtcggcc 3421 tgtgcccggt taccgtgggc ctgtctacaa cgttacggga gaggaggtct ccttcagctt 3481 cagcaccagc tccgcccctg ctgtcctgct ctacgtcagt tcctttgttc gtgactacat 3541 ggctgtgctc atcaaggatg atgggaccct tcagctgcga tatcagctgg gcaccagtcc 3601 ctacgtgtac cagctaacca ctcgaccagt gaccgatggc cagccccata gcatcaatat 3661 cacccgtgtt taccggaacc tcttcatcca ggtggactac ttcccactga cagagcagaa 3721 gttctcgctg ttggtggaca gccagttgga ctcacccaag gccttgtatt tagggcgtgt 3781 gatggagaca ggagtcattg acccggagat ccagcgctac aacaccccag gtttctcagg 3841 ctgcctgtct ggtgttcgat tcaacaacgt ggctcccctc aagacccact tccgaacccc 3901 tcgacccatg actgctgagc tagctgaggc ccttcgagtt cagggagaac tgtccgaatc 3961 taattgcgga gctatgccac gtcttgtttc agaggtgcca cctgagcttg atccctggta 4021 tctgccccca gacttcccct actaccatga tgaaggatgg gttgccatac ttttaggctt 4081 tttggtggcc tttctgctgc tggggctggt gggaatgttg gtgctcttct atctgcaaaa 4141 tcatcgctat aagggctcct accataccaa tgagcccaag gctgcccacg agtaccatcc 4201 tggcagcaaa cctcccctac ccacttcagg ccctgcccag gtccccaccc ctacagcagc 4261 tcccaaccaa gctccagcct cagccccagc cccagcccca actccagccc cagcccctgg 4321 cccccgggat cagaacctac cccagatcct ggaggagtcc aggtctgaat gagtcagaag 4381 ggcttctggg accaattcca gctcctgaca ttcccccagt cctgcctctc ccccatccta 4441 tcagggacat ttggctcctc ttagctggct ctgctcatcc agaggatatt cccccatccc 4501 ccccccatca agtttggtgg gcagagctac agatgggacc caagggagtg gccgagcctc 4561 actgcctaaa ccaatgccct tctcatccct gtttccccag gctcctggct gtttatctgc 4621 cccaaaggag aagcctcatg gggttgacat aggtcctttc tgccatctct gttccagctg 4681 ctgtcaggga ttaacaacag agtgtagggg agattaactg cctcccttcc aatagacact 4741 atcagcaggg acagatgtgt gggagtgcag ggctgcagag ggtatggggg gaggaggctg 4801 ctaaacccta tcccccagcc tcccccctgc cctgaagatc ttccatttgc ttccactcag 4861 ctggaggctc aagagggctt gatggctgtc ccctgccccc ctccttttgt tttgtacaca 4921 gagaccaaga ggcctcagtt tagcacctta gtacctccgc tgcttcactt gctttagcca 4981 aagccataaa aaacctgcaa cgtagagaaa ataatgcaga taccctgact agccagccct 5041 ctactcctcc aaccttttcc aagatatgca atggcctttg tgcctgccca aaggcttcgc 5101 ccctccagtg catgaggaac cctctttcct ccgctcagag atgctgcttc atttacccag 5161 gaggtcatat tctttatata tattttttgt tgcaaagtgt ctctctagag aaactctata 5221 tattattcga atttttaaat tatttgttta tatataaaag aaaagctcaa ttggcaaaaa 5281 aaaaaaaaaa aaa // LOCUS HSU87269 2561 bp mRNA PRI 26-MAR-1997 DEFINITION Human p120E4F transcription factor mRNA, complete cds. ACCESSION U87269 NID g1906601 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2561) AUTHORS Fernandes,E.R. and Rooney,R.J. TITLE The adenovirus E1A-regulated transcription factor E4F is generated from the human homolog of nuclear factor phiAP3 JOURNAL Mol. Cell. Biol. 17 (4), 1890-1903 (1997) MEDLINE 97219979 REFERENCE 2 (bases 1 to 2561) AUTHORS Rooney,R.J. and Fernandes,E.R. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) Biochemistry, St. Jude Children's Research Hospital, 332 N. Lauderdale, Memphis, TN 38105, USA FEATURES Location/Qualifiers source 1..2561 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 25..2376 /note="E4F-1; transcription factor p50E4F is an amino-terminal fragment derived by cleavage of p120E4F at an undetermined cleavage site" /codon_start=1 /product="p120E4F transcription factor" /db_xref="PID:g1906602" /translation="MEGEMAVRVTAAHTAEARPKPGGKRARVQLRRWRRPWPSGFLGL PAPFSEEDEDDVHRCGRCQAEFTALEDFVQHKIQKACPAAPPEALPATPATTALLGQE VVPAAPGPEEPITVAHIVVEAASLAADISHASDLVGGGHIKEVIVAAEAELGDGEMAE APGSPHQQGLGLAGEGEQAQVKLLVNKDGRYVCALCHKTFKTGSILKAHMVTHSSRKD HECKLCGASFRTKGSLIRHHRRHTDERPYKCSKCGKSFRESGALTRHLKSLTPCTEKI RFSVSKDVVVSKEDARAGSGAGAAGLGTATSSVTGEPIETSPVIHLVTDAKGTVIHEV HVQMQELSLGMKALAPEPPVSQELPCSRKGSRENLLHQAMQNSGIVLERAAGEEGALE PAPAAGSSPQPLAVAAPQLPVLEVQPLETQVASEASAVPRTHPCPQCSETFPTAATLE AHKRGHTGPRPFACAQCGKAFPKAYLLKTDQEVHVRERRFRCGDCGKLYKTIAHVRGH RRVHSDERPYPCPKCGKRYKTKNAQQVHFRTHLEEKPHVCQFCSRGFREKGSLVRHVR HHTGEKPFKCYKCGRGFAEHGTLNRHLRTKGGCLLEVEELLVSEDSPAAATTVLTEDP HTVLVEFSSVVADTQEYIIEATADDAETSEATEIIEGTQTEVDSHIMKVVQQIVHPRS AGHQIIVQNVTMDEETALGPRGAAADTITIATPESLTEQVAMTLASAISEGTVLAARA GTSGTEQATVTMVSSEDIEILEHAGELVIASPEGQLEVQTVIV" BASE COUNT 532 a 801 c 848 g 380 t ORIGIN 1 gccatcttct gcggccgttg cgacatggag ggcgagatgg cagtgcgggt gacggccgct 61 catacggcag aagccaggcc gaagccgggc gggaagcggg cgagggtgca gttgcggcgg 121 tggcggcggc cttggcccag cggcttcctc ggcctcccgg cgcccttcag cgaggaagat 181 gaggacgatg tgcacagatg cggccgctgc caggcagagt tcaccgcctt ggaggatttt 241 gttcagcaca agattcagaa ggcctgccca gcggcccctc cggaggccct gcctgccacc 301 cctgccacca cagcgttgct gggccaggag gtggtgccgg cagcaccagg cccagaggag 361 cccatcactg tggcccacat cgtggtggag gcggcctctc tggcagcaga catcagccac 421 gcatctgacc ttgttggtgg tgggcacatc aaagaggtca tcgtggctgc tgaggcggag 481 ctgggagacg gtgagatggc cgaggccccg ggcagccccc accagcaggg gctggggctc 541 gcaggggagg gtgagcaggc ccaggtgaag ctactggtga acaaggatgg ccgctatgtg 601 tgtgcgctgt gccacaagac cttcaagacg ggcagcatcc tcaaggccca catggtcact 661 cacagcagcc gcaaggacca cgagtgcaag ctctgtgggg cctccttccg caccaagggc 721 tcactcatcc ggcaccaccg gcggcacacg gatgagcgcc cctacaagtg ctccaagtgt 781 ggaaagagct tccgggagtc gggtgcactg acccggcacc tcaagtctct caccccctgc 841 acagagaaaa tccgcttcag tgtgagcaag gacgtggttg tcagcaaaga ggacgcacgt 901 gcaggttctg gagctggagc tgccggcttg gggacagcca catcatcggt gacaggcgag 961 cctatagaga cttcacccgt gattcacctg gtgacagatg ccaagggcac cgtcatccac 1021 gaagtccacg tccagatgca ggagctgtcc ctgggcatga aagccctggc cccagagccc 1081 cccgtctccc aggagctccc ctgctccagg aagggcagcc gtgagaacct gctgcaccag 1141 gccatgcaga actccggcat cgtccttgag cgcgctgctg gggaggaggg tgccctggag 1201 ccagctcctg ctgccgggtc cagtccccag cccctggcag tggcagcccc gcagctgccg 1261 gtactggaag tgcagccgct ggagacacag gtggccagcg aggcctcagc ggtgcccagg 1321 acccacccat gtcctcagtg cagtgagacc ttcccgacag cagccaccct ggaggcccac 1381 aagaggggcc acaccgggcc gaggccgttc gcctgcgcgc agtgtggcaa ggccttcccc 1441 aaggcctacc tgctcaagac agaccaggag gtgcacgtgc gtgagcgccg tttccgctgt 1501 ggcgactgcg ggaagctcta caagaccatt gcccatgtgc gtggccaccg gcgcgtccac 1561 tcagacgagc ggccctaccc ttgtcccaag tgtggcaagc gctacaagac taagaacgca 1621 cagcaggtgc acttcaggac acacctggag gagaagccgc acgtgtgcca gttctgcagc 1681 cgtggcttcc gagagaaggg ctcactggtg cggcacgtgc gacaccacac aggcgagaag 1741 ccgttcaagt gctacaagtg cggccgtggc ttcgccgagc acggcacgct gaaccggcac 1801 ctgcgcacca aagggggctg cctgctggag gtggaggagt tgctggtgtc tgaggacagc 1861 cccgcggcag ccaccaccgt cctcacggaa gacccgcaca cagtgttggt ggagttctcg 1921 tccgtggtag ctgacaccca ggagtatatc atcgaggcca ctgcggacga tgcggagacc 1981 agtgaggcca cggagatcat cgagggcacc cagacagagg tggacagcca catcatgaag 2041 gtggtgcagc agatcgtgca cccacgtagc gccggccacc agatcatcgt gcagaacgtc 2101 accatggacg aggagacggc gctgggcccc agaggggctg ccgccgacac catcaccatc 2161 gccacccccg agagcctgac agagcaggtg gccatgacgc tggcctcggc catcagcgag 2221 ggcactgtgc ttgccgcccg ggcagggaca agtggcactg aacaggccac tgtgaccatg 2281 gtgtcatcag aggacatcga gatcctggag catgcaggcg agctggtcat cgcctcgccg 2341 gagggccagc tggaggtgca gacggtcatc gtctagcatg aggtctgcgg ggtcctggcc 2401 gggcagggac agggcagagg actctgaagc gccccaccca tgcctgcctg gcctggtaga 2461 gaagatggca caggatggag gcgccccaag acggacagtg tacataagag tttcttgttg 2521 ctttacaata aaacatgaga acctgccaaa aaaaaaaaaa a // LOCUS HSU87309 4919 bp mRNA PRI 18-FEB-1997 DEFINITION Human hVps41p (HVPS41) mRNA, complete cds. ACCESSION U87309 NID g1842092 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4919) AUTHORS Radisky,D., Snyder,W., Emr,S. and Kaplan,J. TITLE Identification of VPS41, a gene required for vacuolar traffic and the assembly of the yeast high affinity transport system JOURNAL Unpublished REFERENCE 2 (bases 1 to 4919) AUTHORS Radisky,D., Snyder,W., Emr,S. and Kaplan,J. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) Pathology, University of Utah, 50 North Medical Drive, Salt Lake City, UT 84132, USA FEATURES Location/Qualifiers source 1..4919 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene heart cDNA library, Catalog #936208" /tissue_type="heart" gene 30..2594 /gene="HVPS41" CDS 30..2594 /gene="HVPS41" /function="vacuolar traffic" /note="similar to yeast Vps41p" /codon_start=1 /product="hVps41p" /db_xref="PID:g1842093" /translation="MAEAVEQETGSLEESTDESEEEESEEEPKLKYERLSNGVTEILQ KDAASCMTVHDKFLALGTHYGKVYLLDVQGNITQKFDVSPVKINQISLDESGEHMGVC SEDGKVQVFGLYSGEEFHETFDCPIKIIAVHPHFVRSSCKQFVTGGKKLLLFERSWMN RWKSAVLHEGEGNIRSVKWRGHLIAWANNMGVKIFDIISKQRITNVPRDDISLRPDMY PCSLCWKDNVTLIIGWGTSVKVCSVKERHASEMRDLPSRYVEIVSQFETEFYISGLAP LCDQLVVLSYVKEISEKTEREYCARPRLDIIQPLSETCEEISSDALTVRGFQENECRD YHLEYSEGESLFYIVSPRDVVVAKERDQDDHIDWLLEKKKYEEALMAAEISQKNIKRH KILDIGLAYINHLVERGDYDIAARKCQKILGKNAALWEYEVYKFKEIGQLKAISPYLP RGDPVLKPLIYEMILHEFLESDYEGFATLIREWPGDLYNNSVIVQAVRDHLKKDSQNK TLLKTLAELYTYDKNYGNALEIYLTLRHKDVFQLIHKHNLFSSIKDKIVLLMDFDSEK AVDMLLDNEDKISIKKVVEELEDRPELQHVYLHKLFKRDHHKGQRYHEKQISLYAEYD RPNLLPFLRDSTHCPLEKALEICQQRNFVEETVYLLSRMGNSRSALKMIMEELHDVDK AIEFAKEQDDGELWEDLILYSIDKPPFITGLLNNIGTHVDPILLIHRIKEGMEIPNLR DSLVKILQDYNLQILLREGCKKILVADSLSLLKKMHRTQMKGVLVDEENICESCLSPI LPSDAAKPFSVVVFHCRHMFHKECLPMPSMNSAAQFCNICSAKNRGPGSAILEMKK" unsure 3302..4919 /note="single-stranded sequence only" repeat_region 3952..4232 /rpt_family="Alu" /rpt_type=dispersed BASE COUNT 1485 a 928 c 1091 g 1415 t ORIGIN 1 ttgctgtcag gtgactctcc cgtggcgcca tggcggaagc agtggagcag gaaactgggt 61 cccttgaaga atctacagat gagtctgagg aagaagagag cgaagaggaa cccaagctga 121 agtatgaaag gctttccaat ggggtaactg aaatacttca gaaggatgca gctagctgca 181 tgacagtcca tgacaagttt ttggcattgg gcacacatta tggcaaggtt tatttacttg 241 atgtccaggg gaacatcact cagaagtttg atgtaagtcc tgtgaagata aatcagatta 301 gcttggatga aagtggagag cacatgggtg tgtgttcaga ggatggcaag gtgcaggtat 361 ttggactgta ttctggagaa gaatttcacg agacttttga ctgtcccatt aaaattattg 421 ctgtgcaccc acatttcgtg agatccagtt gcaagcagtt tgtgaccgga gggaagaagc 481 tgctactgtt tgaacggtct tggatgaaca gatggaagtc tgctgttctg catgaagggg 541 aagggaacat aaggagtgtg aagtggagag gccatctgat tgcttgggcc aataatatgg 601 gtgtgaagat ttttgacatc atctcaaagc aaagaatcac caatgtgccc cgggatgata 661 taagtcttcg cccagacatg tatccctgca gcctctgctg gaaggacaat gtgacactga 721 ttattggctg ggggacttct gtcaaggtgt gctcagtgaa ggaacggcat gccagtgaaa 781 tgagggattt gccaagtcga tatgttgaaa tagtgtctca gtttgaaact gaattctaca 841 tcagtggact tgcacctctc tgtgatcagc ttgttgtact ttcgtatgta aaggagattt 901 cagaaaaaac ggaaagagaa tactgtgcca ggcctagact ggacatcatc cagccacttt 961 ctgagacttg tgaagagatc tcttctgatg ctttgacagt cagaggcttt caggagaatg 1021 aatgtagaga ttatcattta gaatactctg aaggggaatc acttttttac atcgtgagtc 1081 cgagagatgt tgtagtggcc aaggaacgag accaagatga tcacattgac tggctccttg 1141 aaaagaagaa atatgaagaa gcattgatgg cagctgaaat tagccaaaaa aatattaaaa 1201 gacataagat tctggatatt ggcttggcat atataaatca cctggtggag agaggagact 1261 atgacatagc agcacgcaaa tgccagaaaa ttcttgggaa aaatgcagca ctctgggaat 1321 atgaagttta taaatttaaa gaaattggac agcttaaggc tattagtcct tatttgccaa 1381 gaggtgatcc agttctgaaa ccactcatct atgaaatgat cttacatgaa tttttggaga 1441 gtgattatga gggttttgcc acattgatcc gagaatggcc tggagatctg tataataatt 1501 cagtcatagt tcaagcagtt cgggatcatt tgaagaaaga tagtcagaac aagactttac 1561 ttaaaaccct ggcagaattg tacacctatg acaagaacta tggcaatgct ctggaaatat 1621 acttaacatt aagacataaa gacgtttttc agttgatcca caagcataat cttttcagtt 1681 ctatcaagga taaaattgtt ttattaatgg attttgattc agagaaagct gttgacatgc 1741 ttttggacaa tgaagataaa atttcaatta aaaaggtagt ggaagaattg gaagacagac 1801 cagagctaca gcatgtgtat ttgcataagc ttttcaagag agaccaccat aaggggcagc 1861 gttaccatga aaaacagatc agtctttatg ctgaatatga tcgaccaaac ttacttccct 1921 ttctccgaga cagtacccat tgcccacttg aaaaggctct tgagatctgt caacagagaa 1981 actttgtaga agagacagtt tatcttctga gccgaatggg taatagccga agtgccctga 2041 agatgattat ggaggaatta catgatgttg ataaagcaat cgaatttgcc aaggagcaag 2101 atgatggaga gctgtgggaa gatttgattt tatattccat tgacaaacca ccatttatta 2161 ctggcttgtt aaacaacatt ggcacacatg ttgacccaat tctactgatt caccgtatta 2221 aggaaggaat ggagatcccc aatttgagag attccttggt taaaattctg caagactaca 2281 atttgcaaat tctgcttcgt gaaggctgca agaagattct cgtagctgac tctttgtcct 2341 tactgaagaa aatgcaccga actcaaatga aaggtgttct tgttgatgag gagaacatct 2401 gtgagtcgtg cctttcccct attcttccat cagatgcagc taagcccttc agcgtggtgg 2461 tcttccattg ccggcacatg ttccacaagg agtgcctgcc catgcccagc atgaactctg 2521 ctgcacagtt ctgcaacatc tgcagtgcta agaaccgtgg accaggaagt gcaattttgg 2581 agatgaaaaa atagctcatt tctccttgtc agtctccttg tcaccactct ttttgagact 2641 gtttttgcaa caacaaaagc atttgttgac actcgtgctg ttaagagatt tgtttatgtt 2701 tatattatac tcaaaaacaa tttcttcatc tattcctgta ctaatggttt ctctttgcag 2761 ttcacagaga atttggggct ctcttcatgc cttgaaattt tggggtccat agtgaatatt 2821 ttgttattta tttgtttggc tcattcttta tatagtaatg gaaacataag tctaggagtt 2881 agaaatgaat tttttagacc ttagtaaaac catttaacca taaaatggac aactgagaat 2941 tctcccagct gcctgaaagc gtcgccaact gtggttatcc tgcaagctgc tacctgcaac 3001 ttggacgttg tttccacgtg ctctgctggc tacgattctt gcattctggg tttggctttt 3061 ttctgtgtca tcaactatgg ttatcctcta aataggcatt taatgaaaca ttgtacaaat 3121 tgtcactcat ttgatgacac ctgggaataa cattagcagg ctgatgtcct gcaccattat 3181 gtttactaat cacatgttct gtgtgctgtg acgactgtca aagagtatct ggccatggcg 3241 gacactcagc atttgttgat tgaataaatg ttagctcttc tcattgtgaa ggactcactt 3301 ttactgggat aaacaaatgc agttaagaat tctggcaccc ttgtaaggaa gaaaagagag 3361 ttcaacacct tcgagtctga gcgcttgtgg ctagagtttg ccaggaggga ggaaaccagt 3421 gaccctgaaa actgagggtg cctcaggagc agtgggacca cctgatgctg aaggacggac 3481 taatgatgtt tcctcttgcc ttctctggtg cctccattgc cctccatgga acagagcata 3541 tcatagaagg agaaaaattc caacttgtaa ttgtgtctta cagttactgg cttcatcttc 3601 cttgggatat atggtcatcc tctaatgagt gtaaaagtgc gcaaaacaca tccttattgt 3661 tcctgatctc ttagtcccat aaatgggaac aaatacagct ttctgcttct ttctttttgg 3721 ggaaaggaca gggtgctagt gagtactgac agcatgccag ctaccaaagt cacccagcca 3781 ttcccatgag cagcagttca tttaattgtc acagcgtcgc caggaagaag atctgataaa 3841 cctaggttta cagataaaga aagcaaaatg tagagatgtt gttgaggtca cagaggtgac 3901 tgcctaactt cagagcaggg cttctgatcc ctttaagaaa ttacagggcc agccgggcat 3961 ggtggctcac gcccgtaatc ccagggcttt gggaagcctt ggcgggtgga tcacctgaga 4021 tcgcacgttc gagaccggcc tgaccaacat ggagaaaccc catctctact aaaaacacaa 4081 attagccagg catggtggta catgcctgta atcccggcta ctcaggaggc tgaggcagga 4141 gaatcacttg accccaggag acatatgttg tggtgagctg aggtctcgcc attgcactcc 4201 agcctgggca acaagagcaa aactccgtct caaaaaagaa aagaaaagaa aagaaatcat 4261 agggccaagt tcaaaggaaa tgcacagaac atatcttcac attagagtta agaattctct 4321 agcaaacaac agattttttt gttgttgtta gtcacaaata cttagaactg gaaggcgctt 4381 tgttattatt gaatgtaccc ctcagccttc tcagcatttc cttatcccaa gactagtgtg 4441 ctttctgcta cactgctagt tttcagtttt gttcttaccc aattgttttt tcttttcaac 4501 attaccaatt tacagattca gtttattaca tttacattaa tcctcactta tgatttgagc 4561 aagctcattt ccagaaaagt ttactttaag atcatcaata ggatttgcta atttcagtga 4621 agtcattttg cttcaggggt aaattatcct agttaccaag tcctatttgg acataaagaa 4681 aatcctactt atagaaaagg agaaaataat taaacagtct tcatttttaa gtaactgatt 4741 taaaaggaaa ataataaaat atgttcgttt atcatttcag aaattgctgt aacacactgg 4801 aaaattcctg aacaatatag attttatcgt taataaaaaa cactagcttt cgttccttag 4861 aatgtctttt cttttgaata aacagtattg ggtgatttaa aaaaaaaaaa aaaaaaaaa // LOCUS HSU87459 752 bp mRNA PRI 16-MAR-1997 DEFINITION Human autoimmunogenic cancer/testis antigen NY-ESO-1 mRNA, complete cds. ACCESSION U87459 NID g1890098 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 752) AUTHORS Chen,Y.-T., Scanlan,M.J., Sahin,U., Tuereci,O., Gure,A., Tsang,S., Williamson,B., Stockert,E., Pfreundschuh,M. and Old,L.J. TITLE A testicular antigen aberrantly expressed in human cancers detected by autologous antibody screening JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (1997) In press REFERENCE 2 (bases 1 to 752) AUTHORS Chen,Y.-T. TITLE Direct Submission JOURNAL Submitted (28-JAN-1997) Ludwig Institute for Cancer Research, New York Branch, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..752 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="esophageal squamous cell carcinoma expression cDNA library" CDS 54..596 /codon_start=1 /product="autoimmunogenic cancer/testis antigen NY-ESO-1" /db_xref="PID:g1890099" /translation="MQAEGRGTGGSTGDADGPGGPGIPDGPGGNAGGPGEAGATGGRG PRGAGAARASGPGGGAPRGPHGGAASGLNGCCRCGARGPESRLLEFYLAMPFATPMEA ELARRSLAQDAPPLPVPGVLLKEFTVSGNILTIRLTAADHRQLQLSISSCLQQLSLLM WITQCFLPVFLAQPPSGQRR" BASE COUNT 126 a 230 c 256 g 140 t ORIGIN 1 atcctcgtgg gccctgacct tctctctgag agccgggcag aggctccgga gccatgcagg 61 ccgaaggccg gggcacaggg ggttcgacgg gcgatgctga tggcccagga ggccctggca 121 ttcctgatgg cccagggggc aatgctggcg gcccaggaga ggcgggtgcc acgggcggca 181 gaggtccccg gggcgcaggg gcagcaaggg cctcggggcc gggaggaggc gccccgcggg 241 gtccgcatgg cggcgcggct tcagggctga atggatgctg cagatgcggg gccagggggc 301 cggagagccg cctgcttgag ttctacctcg ccatgccttt cgcgacaccc atggaagcag 361 agctggcccg caggagcctg gcccaggatg ccccaccgct tcccgtgcca ggggtgcttc 421 tgaaggagtt cactgtgtcc ggcaacatac tgactatccg actgactgct gcagaccacc 481 gccaactgca gctctccatc agctcctgtc tccagcagct ttccctgttg atgtggatca 541 cgcagtgctt tctgcccgtg tttttggctc agcctccctc agggcagagg cgctaagccc 601 agcctggcgc cccttcctag gtcatgcctc ctcccctagg gaatggtccc agcacgagtg 661 gccagttcat tgtgggggcc tgattgtttg tcgctggagg aggacggctt acatgtttgt 721 ttctgtagaa aataaaactg agctacgaaa aa // LOCUS HSU87836 320 bp mRNA PRI 10-SEP-1997 DEFINITION Homo sapiens htra2-beta-2 mRNA, complete cds. ACCESSION U87836 NID g2367403 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 320) AUTHORS Beil,B., Screaton,G. and Stamm,S. TITLE Molecular cloning of htra2-beta-1 and htra2-beta-2, two human homologs of tra-2 generated by alternative splicing JOURNAL DNA Cell Biol. 16 (6), 679-690 (1997) MEDLINE 97355681 REFERENCE 2 (bases 1 to 320) AUTHORS Beil,B., Screaton,G. and Stamm,S. TITLE Direct Submission JOURNAL Submitted (30-JAN-1997) Max-Planck-Institute for Psychiatry, Am Klopferspitz 18 a, Planegg 82152, Germany FEATURES Location/Qualifiers source 1..320 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 159..275 /note="alternative splice variant of human splicing factor htra2-beta-1; truncated protein that lacks an SR domain" /codon_start=1 /product="htra2-beta-2" /db_xref="PID:g2367404" /translation="MSDSGEQNYGERVNVEEGKCGSRHLTSFINEYLKLRNK" BASE COUNT 104 a 59 c 104 g 53 t ORIGIN 1 ggcacgagcc cgtgcggagg cggtgcggag catttcggct ctgagcggct gggcgaccgg 61 cgcgtcgtgc ggggctgcgg cggagcctcc ttaaggaagg tgcaagaggt tggcagcttc 121 gattgaagca catcgaccgg cgacagcagc caggagtcat gagcgacagc ggcgagcaga 181 actacggcga gcgggttaat gttgaagaag gaaaatgcgg aagtcgtcat ttgacaagtt 241 ttataaatga gtatttgaag ctcaggaata agtgaagctg aaatttgaaa aaaaaaaaaa 301 aaaaaaaaaa aaaaaaaaaa // LOCUS HSU87964 2122 bp mRNA PRI 02-APR-1997 DEFINITION Human putative G-protein (GP-1) mRNA, complete cds. ACCESSION U87964 NID g1916924 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2122) AUTHORS Senju,S. and Nishimura,Y. TITLE Identification of human and mouse GP-1, a putative member of a novel G-protein family JOURNAL Biochem. Biophys. Res. Commun. 231 (2), 360-364 (1997) MEDLINE 97223458 REFERENCE 2 (bases 1 to 2122) AUTHORS Senju,S. TITLE Direct Submission JOURNAL Submitted (31-JAN-1997) Division of Immunogenetics, Department of Neuroscience and Immunology, Kumamoto University Graduate School of Medical Sciences, 2-2-1 Honjo, Kumamoto 860, Japan FEATURES Location/Qualifiers source 1..2122 /organism="Homo sapiens" /db_xref="taxon:9606" gene 154..1908 /gene="GP-1" CDS 154..1908 /gene="GP-1" /codon_start=1 /product="putative G-protein" /db_xref="PID:g1916925" /translation="MDEGCGETIYVIGQGSDGTEYGLSEADMEASYATVKSMAEQIEA DVILLRERQEAGGRVRDYLVRKRVGDNDFLEVRVAVVGNVDAGKSTLLGVLTHGELDN GRGFARQKLFRHKHEIESGRTSSVGNDILGFDSEGNVVNKPDSHGGSLEWTKICEKST KVITFIDLAGHEKYLKTTVFGMTGHLPDFCMLMVGSNAGIVGMTKEHLGLALALNVPV FVVVTKIDMCPANILQETLKLLQRLLKSPGCRKIPVLVQSKDDVIVTASNFSSERMCP IFQISNVTGENLDLLKMFLNLLSPRTSYREEEPAEFQIDDTYSVPGVGTVVSGTTLRG LIKLNDTLLLGPDPLGNFLSIAVKSIHRKRMPVKEVRGGQTASFALKKIKRSSIRKGM VMVSPRLNPQASWEFEAEILVLHHPTTISPRYQAMVHCGSIRQTATILSMDKDCLRTG DKATVHFRFIKTPEYLHIDQRLVFREGRTKAVGTITKLLQTTNNSPMNSKPQQIKMQS TKKGPLTKRDEGGPSGGPAVGAPPPGDEASSVGAGQPAASSNLQPQPKPSSGGRRRGG QRHKVKSQGACVTPASGC" BASE COUNT 495 a 624 c 612 g 391 t ORIGIN 1 gccgcccgac tccacggcgg ctttgactcg gactgcagcg aggacggcga ggcgctcaac 61 ggcgagccag agctggacct caccagcaag ctggttctag tgagccctac atcagagcag 121 tatgacagcc tacttcggca gatgtgggag aggatggacg agggatgcgg agagaccata 181 tatgtcattg ggcagggatc agatgggact gagtatgggc tgagtgaagc tgacatggag 241 gcctcctacg ccacagtgaa gagcatggcg gaacagatag aggccgatgt catccttctg 301 cgggaacggc aagaagctgg gggccgcgtg cgtgattacc tggtccggaa acgagtagga 361 gacaatgact tcctggaggt cagggtagca gtggtgggca acgtggatgc tggcaaaagc 421 acgcttctgg gggtcctgac acatggggag ctggacaatg gccgaggctt tgcccgccag 481 aaactcttcc gccacaaaca tgaaattgaa tctggtcgca ccagcagtgt gggcaacgac 541 attctgggct ttgacagtga aggcaatgta gtgaacaagc ctgacagcca cggcggcagc 601 ctggagtgga ccaagatctg tgagaagtcc acgaaagtca ttaccttcat cgacttggct 661 ggtcatgaga agtacctgaa aaccactgtc ttcggcatga caggccatct gcctgacttc 721 tgcatgctca tggtgggcag caatgctggc atcgtgggga tgaccaaaga acacctgggc 781 ttggcactgg cactcaatgt acctgtcttt gtggtagtca ccaagattga catgtgtcct 841 gccaacatcc tgcaagaaac cctgaagctg ttacagcgcc tgctgaagtc accaggctgc 901 cggaagatcc ccgtgctggt gcagagcaaa gatgatgtga ttgtcacagc ctccaacttc 961 agctctgaaa ggatgtgccc gatattccag atctccaacg ttacaggcga gaacctagat 1021 ctgctgaaga tgttcctcaa cctcctctcc ccccgcacca gctacaggga ggaggagcct 1081 gctgagtttc agattgatga cacctactcc gtcccgggtg tggggacagt ggtttcgggg 1141 acaacactga gaggcctgat caagctgaat gacacgctgc tgctgggccc agaccccttg 1201 ggtaacttcc tgtccattgc tgtcaaatcc atccatcgca agcgcatgcc tgtcaaggag 1261 gtgcggggtg gccagacagc atcctttgcg ctgaagaaga tcaagcgctc gtccatccgg 1321 aagggcatgg tgatggtttc cccacgtttg aatccccaag cctcctggga gtttgaggcc 1381 gagattctcg tcctccacca ccccaccaca attagcccgc gctaccaggc catggtgcac 1441 tgtgggagca tcaggcagac agccaccatt ctgagcatgg acaaggactg tctgcgcact 1501 ggggacaagg ccactgtaca cttccgcttc atcaagaccc ctgagtacct gcacatagac 1561 cagcggctgg tgttccggga aggccgcacc aaggctgtcg gcaccatcac caagctcctc 1621 cagaccacca acaactcccc aatgaactcc aagccgcagc agattaaaat gcagtcgacg 1681 aaaaagggcc ccctgacgaa acgagacgag gggggcccgt ctggtgggcc agcagtagga 1741 gcacccccac ctggagatga agcctcctct gtaggggcag ggcaaccagc tgcgtccagc 1801 aatctccagc ctcagcctaa gcccagcagt ggaggccggc gacgaggggg ccagcgccac 1861 aaggtgaagt cccagggggc ctgtgtgact cctgccagcg gctgctgaac cttcccctgg 1921 cccaccctca ccacccaagg ggtcatcatc tctggccacc actccaccag atgggcagag 1981 cagctatgac cgccacccag ccctcccgct caggccacag ccggagcctc cgcattgccc 2041 ccacccccat tttccagggg ggttgtaatt tataagctga cgaaggtagc cagacttccg 2101 gagactgacc atctctcact gt // LOCUS HSU87967 1704 bp mRNA PRI 18-FEB-1997 DEFINITION Human ATP diphosphohydrolase mRNA, complete cds. ACCESSION U87967 NID g1842119 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1704) AUTHORS Kaczmarek,E., Koziak,K., Sevigny,J., Siegel,J.B., Anrather,J., Beaudoin,A.R., Bach,F.H. and Robson,S.C. TITLE Identification and characterization of CD39/vascular ATP diphosphohydrolase JOURNAL J. Biol. Chem. 271 (51), 33116-33122 (1996) MEDLINE 97115858 REFERENCE 2 (bases 1 to 1704) AUTHORS Robson,S.C., Kaczmarek,E., Siegel,J.B., Candinas,D., Koziak,K., Millan,M., Hancock,W.W. and Bach,F.H. TITLE Loss of ATP diphosphohydrolase activity with endothelial cell activation JOURNAL J. Exp. Med. 185 (1), 153-163 (1997) MEDLINE 97149443 REFERENCE 3 (bases 1 to 1704) AUTHORS Kaczmarek,E., Koziak,K., Sevigny,J., Siegel,J.B., Anrather,J., Beaudoin,A.R., Bach,F.H. and Robson,S.C. TITLE Direct Submission JOURNAL Submitted (30-JAN-1997) Medicine, Harvard University, 99 Brookline Avenue, BIDMC, RN,, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..1704 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial cells" /chromosome="10" CDS 31..1563 /note="CD39" /codon_start=1 /product="ATP diphosphohydrolase" /db_xref="PID:g1842120" /translation="MEDTKESNVKTFCSKNILAILGFSSIIAVIALLAVGLTQNKALP ENVKYGIVLDAGSSHTSLYIYKWPAEKENDTGVVHQVEECRVKGPGISKFVQKVNEIG IYLTDCMERAREVIPRSQHQETPVYLGATAGMRLLRMESEELADRVLDVVERSLSNYP FDFQGARIITGQEEGAYGWITINYLLGKFSQKTRWFSIVPYETNNQETFGALDLGGAS TQVTFVPQNQTIESPDNALQFRLYGKDYNVYTHSFLCYGKDQALWQKLAKDIQVASNE ILRDPCFHPGYKKVVNVSDLYKTPCTKRFEMTLPFQQFEIQGIGNYQQCHQSILELFN TSYCPYSQCAFNGIFLPPLQGDFGAFSAFYFVMKFLNLTSEKVSQEKVTEMMKKFCAQ PWEEIKTSYAGVKEKYLSEYCFSGTYILSLLLQGYHFTADSWEHIHFIGKIQGSDAGW TLGYMLNLTNMIPAEQPLSTPLSHSTYVFLMVLFSLVLFTVAIIGLLIFHKPSYFWKD MV" BASE COUNT 468 a 392 c 403 g 441 t ORIGIN 1 gaaagaggag gaaaacaaaa gctgctactt atggaagata caaaggagtc taacgtgaag 61 acattttgct ccaagaatat cctagccatc cttggcttct cctctatcat agctgtgata 121 gctttgcttg ctgtggggtt gacccagaac aaagcattgc cagaaaacgt taagtatggg 181 attgtgctgg atgcgggttc ttctcacaca agtttataca tctataagtg gccagcagaa 241 aaggagaatg acacaggcgt ggtgcatcaa gtagaagaat gcagggttaa aggtcctgga 301 atctcaaaat ttgttcagaa agtaaatgaa ataggcattt acctgactga ttgcatggaa 361 agagctaggg aagtgattcc aaggtcccag caccaagaga cacccgttta cctgggagcc 421 acggcaggca tgcggttgct caggatggaa agtgaagagt tggcagacag ggttctggat 481 gtggtggaga ggagcctcag caactacccc tttgacttcc agggtgccag gatcattact 541 ggccaagagg aaggtgccta tggctggatt actatcaact atctgctggg caaattcagt 601 cagaaaacaa ggtggttcag catagtccca tatgaaacca ataatcagga aacctttgga 661 gctttggacc ttgggggagc ctctacacaa gtcacttttg taccccaaaa ccagactatc 721 gagtccccag ataatgctct gcaatttcgc ctctatggca aggactacaa tgtctacaca 781 catagcttct tgtgctatgg gaaggatcag gcactctggc agaaactggc caaggacatt 841 caggttgcaa gtaatgaaat tctcagggac ccatgctttc atcctggata taagaaggta 901 gtgaacgtaa gtgaccttta caagaccccc tgcaccaaga gatttgagat gactcttcca 961 ttccagcagt ttgaaatcca gggtattgga aactatcaac aatgccatca aagcatcctg 1021 gagctcttca acaccagtta ctgcccttac tcccagtgtg ccttcaatgg gattttcttg 1081 ccaccactcc agggggattt tggggcattt tcagcttttt actttgtgat gaagttttta 1141 aacttgacat cagagaaagt ctctcaggaa aaggtgactg agatgatgaa aaagttctgt 1201 gctcagcctt gggaggagat aaaaacatct tacgctggag taaaggagaa gtacctgagt 1261 gaatactgct tttctggtac ctacattctc tccctccttc tgcaaggcta tcatttcaca 1321 gctgattcct gggagcacat ccatttcatt ggcaagatcc agggcagcga cgccggctgg 1381 actttgggct acatgctgaa cctgaccaac atgatcccag ctgagcaacc attgtccaca 1441 cctctctccc actccaccta tgtcttcctc atggttctat tctccctggt ccttttcaca 1501 gtggccatca taggcttgct tatctttcac aagccttcat atttctggaa agatatggta 1561 tagcaaaagc agctgaaata tgctggctgg agtgaggaaa aaatcgtcca gggagcattt 1621 tcctccatcg cagtgttcaa ggccatcctt ccctgtctgc cagggccagt cttgacgagt 1681 gtgaagcttc cttggctttt actg // LOCUS HSU88047 2725 bp mRNA PRI 17-OCT-1997 DEFINITION Homo sapiens DNA binding protein homolog (DRIL1) mRNA, complete cds. ACCESSION U88047 NID g2529687 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2725) AUTHORS Kortschak,R.D., Saint,R.B. and Jenne,D.E. TITLE Direct Submission JOURNAL Submitted (02-FEB-1997) Genetics, University of Adelaide, North Terrace, Adelaide, SA 5005, Australia REFERENCE 2 (bases 1 to 2725) AUTHORS Kortschak,R.D., Saint,R.B. and Jenne,D.E. TITLE Direct Submission JOURNAL Submitted (16-OCT-1997) Genetics, University of Adelaide, North Terrace, Adelaide, SA 5005, Australia REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..2725 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19p13" gene 1..2725 /note="dead ringer-like" /gene="DRIL1" CDS 201..1982 /gene="DRIL1" /codon_start=1 /product="DNA binding protein homolog" /db_xref="PID:g2529688" /translation="MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPD EDREPESARMQRAQMAALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSP GRGREGPGEEHFEDMASDEDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEEDEEGLG PPGPASLGTTALFPRKAQPPQAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPP DHGDWTYEEQFKQLYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLFMLY VLVTEKGGLVEVINKKLWREITKGLNLPTSITSAAFTLRTQYMKYLYPYECEKRGLSN PNELQAAIDSNRREGRRQSFGGSLFAYSPGGAHGMLSSPKLPVSSLGLAASTNGSSIT PAPKIKKEEDSAIPITVPGRLPVSLAGHPVVAAQAAAVQAAAAQAAVAAQAAALEQLR EKLESAEPPEKKMALVADEQQRLMQRALQQNFLAMAAQLPMSIRINSQASESRQDSAV NLTGTNGSNSISMSVEINGIMYTGVLFAQPPAPTPTSAPNKGGGGGGGSSSNAGGRGG NTGTSGGQAGPAGLSTPSTSTSNNSLP" BASE COUNT 608 a 830 c 828 g 459 t ORIGIN 1 gcatgcccca tcagccttca gcttgagccc ggcggccccc gcccccgccc cctgccaccc 61 tgcactgccc cggctccccc gcggccccca cgctgcagtg cggccgggcc ccctccccgc 121 aggggccgcc cccgccgccc acccctagcg cccgtggtgg tggtggtggt ggtggtggtg 181 gtggcccggg ccgcagggcc atgaaactac aggccgtgat ggagacgctg ttgcagcggc 241 agcagcgggc gcgccaggag ctggaggccc ggcagcagct gccccccgat ccccctgctg 301 caccccccgg ccgggcccgg gctgcccccg acgaggacag agagcccgag agtgcccgga 361 tgcagcgggc tcagatggcc gcactggcag ccatgcgggc tgcagctgcg ggcctgggac 421 acccagccag ccccggcggc tctgaggatg ggcccccagg ctcggaggag gaggacgcgg 481 cccgggaggg gacaccgggc tcacccgggc gaggcagaga agggccagga gaggagcact 541 ttgaggacat ggcctccgac gaggacatga agcccaaatg ggaggaggag gagatggagg 601 aagacctcgg ggaggatgag gaggaggagg aggaggatta cgaggatgag gaggaggagg 661 aggacgagga ggggctgggc cccccaggcc ctgccagctt gggcaccacg gcactgttcc 721 cccgaaaggc ccagccaccc caggccttcc gcggcgatgg cgttcccagg gtgctggggg 781 gccaggagcg gccggggcct ggccctgccc accccggagg ggccgcccac gtagccccgc 841 agctgcagcc gcctgaccac ggcgactgga cttacgagga gcagtttaag cagctctacg 901 aactcgacgg ggaccccaag aggaaggaat tcctggatga cttgttcagc ttcatgcaga 961 agcgagggac acctgtgaac cgcatcccca tcatggccaa acaggtcctt gacctgttca 1021 tgctgtacgt gctggtgacg gagaagggcg gcctcgtgga ggtcatcaac aagaagctgt 1081 ggcgtgagat caccaagggc ctcaacctgc ccacgtccat caccagtgca gccttcaccc 1141 tgcggaccca atacatgaag tacctgtacc cctacgagtg tgagaagcgg ggcctcagta 1201 accccaatga gctccaggca gccatagaca gcaaccgacg ggagggccgg cgccagagct 1261 ttggtggctc cctctttgcc tactcgccag gcggggcaca cggcatgctc tcctcaccca 1321 agctacccgt gtcctccctg ggcctggccg caagcaccaa tggcagctcc atcacccccg 1381 cccctaagat caagaaagag gaggactcag ccatccccat cacagtccct ggccgcctgc 1441 ctgtgtccct ggcgggccac cctgtggtgg cagcccaggc agcagctgtg caagcagcag 1501 ccgcccaagc agctgtggcc gcacaggcag ctgccctgga acagctgcgg gagaagctgg 1561 agtctgcaga gcctccggag aagaagatgg ccctggtggc cgatgagcag caacggctga 1621 tgcaacgtgc actccagcag aacttcctgg ccatggcggc ccagctgccc atgagcattc 1681 ggatcaacag ccaagcctcc gaaagccgcc aggactctgc tgtgaacctg acgggcacca 1741 acggcagcaa cagcatcagc atgtcggtgg agatcaacgg catcatgtac acaggagttc 1801 tgtttgctca gccgccggcc cccacgccaa cctctgctcc caacaaagga ggcggcggcg 1861 gcggcggcag cagcagcaac gcaggcggcc ggggaggaaa caccggaacc agcggcggcc 1921 aggctgggcc agcggggctg tccacaccct ccacatctac ctcaaataac tcgttgcctt 1981 aaccgcatca ctccccaccc gccacccacc ctggagcccg ccggcctggg cagggggtcc 2041 aggtgggcca cacaggggcc aggatggcgg aagatacggg tggggaggga agatatccag 2101 aaaggagcca cagctgacgc caaaaagaaa agaaaaaaga tatatatata tatatatata 2161 tatatacgta tatatataaa gaagaattta ataaaacagg ggaaaaccaa ggaacacttg 2221 aatttctcag gttttggaca ttcagagaga tgaattgtga gaacagcaaa gaaatccatc 2281 agaaaaacag aaagaggcag acgtttccca gggcgttcag gcagccctga tggaccgaag 2341 gctctggtgt ctggtttggc cccacagcag tgtgggccga tcctgtttac ctcatacatc 2401 cctgcactgt gtgttttcat ttttgtctgc tttagttctc ttttattttc tattcaccac 2461 acactcacca ctcccagctt ctcgtgtcca gtgaaacccc tgaaccaaga tcactgaatt 2521 tttgtttttt tcttgttgct ttgggaaatt tttttttctc tgtagggttt ttaagaggtt 2581 tcgggggttt tgttgtgtaa atattctatt ttattcttgg ggggatcaaa ccttaggaaa 2641 aggatatcta tatatctata tagctatata tttgtgttcc ttcagggaaa ctggtcttga 2701 aaaagcaaga aaaaaaagca aaaaa // LOCUS HSU88063 783 bp mRNA PRI 16-APR-1997 DEFINITION Human Agouti related protein (Art) mRNA, complete cds. ACCESSION U88063 NID g1938362 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 783) AUTHORS Shutter,J.R., Graham,M., Kinsey,A.C., Scully,S., Luthy,R. and Stark,K.L. TITLE Hypothalamic expression of ART, a novel gene related to agouti, is up-regulated in obese and diabetic mutant mice JOURNAL Genes Dev. 11 (5), 593-602 (1997) MEDLINE 97230362 REFERENCE 2 (bases 1 to 783) AUTHORS Stark,K.L. TITLE Direct Submission JOURNAL Submitted (31-JAN-1997) Molecular Genetics, Amgen Inc., 1840 DeHavilland, Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..783 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16q21" gene 301..699 /gene="Art" CDS 301..699 /gene="Art" /codon_start=1 /product="Agouti related protein" /db_xref="PID:g1938363" /translation="MLTAAVLSCALLLALPATRGAQMGLAPMEGIRRPDQALLPELPG LGLRAPLKKTTAEQAEEDLLQEAQALAEVLDLQDREPRSSRRCVRLHESCLGQQVPCC DPCATCYCRFFNAFCYCRKLGTAMNPCSRT" BASE COUNT 175 a 235 c 232 g 141 t ORIGIN 1 agctcctagg tccctgtcct gtggaaattt gtggaccctg ggcaccctct cttgctccca 61 aattttaatc ggctcctgga aacctcaccc caaattggag ataggcactc ctcttgtaga 121 acaaaaggct caggttcagg gagtgagggc ctgaactgtg cccccaccct ccaggaaggg 181 tccttcacgg cctggctgca gggatcagtc acgtgtggcc cttcattagg ccctgccata 241 taagccaagg gcacggggtg gccgggaact ctctaggcaa gaatcccgga ggcagaggcc 301 atgctgaccg cagcggtgct gagctgtgcc ctgctgctgg cactgcctgc cacgcgagga 361 gcccagatgg gcttggcccc catggagggc atcagaaggc ctgaccaggc cctgctccca 421 gagctcccag gcctgggcct gcgggcccca ctgaagaaga caactgcaga acaggcagaa 481 gaggatctgt tgcaggaggc tcaggccttg gcagaggtac tagacctgca ggaccgcgag 541 ccccgctcct cacgtcgctg cgtaaggctg catgagtcct gcctgggaca gcaggtgcct 601 tgctgtgacc catgtgccac gtgctactgc cgcttcttca atgccttctg ctactgccgc 661 aagctgggta ctgccatgaa tccctgcagc cgcacctagc tggccaacgt cagggtcggg 721 gctagggtag gggcaaggaa actcgaataa aggatgggac caacaaaaaa aaaaaaaaaa 781 aaa // LOCUS HSU88540 2366 bp mRNA PRI 02-OCT-1997 DEFINITION Human Toll-like receptor 1 (TLR1) mRNA, complete cds. ACCESSION U88540 NID g2459617 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2366) AUTHORS Rock,F.L., Hardiman,G., Timans,J., Kastelein,R.A. and Bazan,J.F. TITLE A novel family of human receptors structurally related to the Drosophila morphogen Toll JOURNAL Unpublished REFERENCE 2 (bases 1 to 2366) AUTHORS Rock,F.L., Hardiman,G., Timans,J., Kastelein,R.A. and Bazan,J.F. TITLE Direct Submission JOURNAL Submitted (04-FEB-1997) Molecular Biology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..2366 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4p14" /cell_type="erythroleukemic TF-1 cells" gene 1..2366 /gene="TLR1" CDS 1..2361 /gene="TLR1" /function="signaling receptor" /note="type 1 transmembrane receptor with an extracellular domain largely composed of leucine-rich repeats and an intracellular segment with similarity to the signaling domains of interleukin-1-type receptors, the intracellular molecule Myd88 and several plant disease resistance gene products" /codon_start=1 /product="Toll-like receptor 1" /db_xref="PID:g2459618" /translation="MTSIFHFAIIFMLILQIRIQLSEESEFLVDRSKNGLIHVPKDLS QKTTILNISQNYISELWTSDILSLSKLRILIISHNRIQYLDISVFKFNQELEYLDLSH NKLVKISCHPTVNLKHLDLSFNAFDALPICKEFGNMSQLKFLGLSTTHLEKSSVLPIA HLNISKVLLVLGETYGEKEDPEGLQDFNTESLHIVFPTNKEFHFILDVSVKTVANLEL SNIKCVLEDNKCSYFLSILAKLQTNPKLSSLTLNNIETTWNSFIRILQLVWHTTVWYF SISNVKLQGQLDFRDFDYSGTSLKALSIHQVVSDVFGFPQSYIYEIFSNMNIKNFTVS GTRMVHMLCPSKISPFLHLDFSNNLLTDTVFENCGHLTELETLILQMNQLKELSKIAE MTTQMKSLQQLDISQNSVSYDEKKGDCSWTKSLLSLNMSSNILTDTIFRCLPPRIKVL DLHSNKIKSIPKQVVKLEALQELNVAFNSLTDLPGCGSFSSLSVLIIDHNSVSHPSAD FFQSCQKMRSIKAGDNPFQCTCELGEFVKNIDQVSSEVLEGWPDSYKCDYPESYRGTL LKDFHMSELSCNITLLIVTIVATMLVLAVTVTSLCIYLDLPWYLRMVCQWTQTRRRAR NIPLEELQRNLQFHAFISYSGHDSFWVKNELLPNLEKEGMQICLHERNFVPGKSIVEN IITCIEKSYKSIFVLSPNFVQSEWCHYELYFAHHNLFHEGSNSLILILLEPIPQYSIP SSYHKLKSLMARRTYLEWPKEKSKRGLFWANLRAAINIKLTEQAKK" BASE COUNT 722 a 487 c 446 g 711 t ORIGIN 1 atgactagca tcttccattt tgccattatc ttcatgttaa tacttcagat cagaatacaa 61 ttatctgaag aaagtgaatt tttagttgat aggtcaaaaa acggtctcat ccacgttcct 121 aaagacctat cccagaaaac aacaatctta aatatatcgc aaaattatat atctgagctt 181 tggacttctg acatcttatc actgtcaaaa ctgaggattt tgataatttc tcataataga 241 atccagtatc ttgatatcag tgttttcaaa ttcaaccagg aattggaata cttggatttg 301 tcccacaaca agttggtgaa gatttcttgc caccctactg tgaacctcaa gcacttggac 361 ctgtcattta atgcatttga tgccctgcct atatgcaaag agtttggcaa tatgtctcaa 421 ctaaaatttc tggggttgag caccacacac ttagaaaaat ctagtgtgct gccaattgct 481 catttgaata tcagcaaggt cttgctggtc ttaggagaga cttatgggga aaaagaagac 541 cctgagggcc ttcaagactt taacactgag agtctgcaca ttgtgttccc cacaaacaaa 601 gaattccatt ttattttgga tgtgtcagtc aagactgtag caaatctgga actatctaat 661 atcaaatgtg tgctagaaga taacaaatgt tcttacttcc taagtattct ggcgaaactt 721 caaacaaatc caaagttatc aagtcttacc ttaaacaaca ttgaaacaac ttggaattct 781 ttcattagga tcctccagct ggtttggcat acaactgtat ggtatttctc aatttcaaac 841 gtgaagctac agggtcagct ggacttcaga gattttgatt attctggcac ttccttgaag 901 gccttgtcta tacaccaagt tgtcagcgat gtgttcggtt ttccgcaaag ttatatctat 961 gaaatctttt cgaatatgaa catcaaaaat ttcacagtgt ctggtacacg catggtccac 1021 atgctttgcc catccaaaat tagcccgttc ctgcatttgg atttttccaa taatctctta 1081 acagacacgg tttttgaaaa ttgtgggcac cttactgagt tggagacact tattttacaa 1141 atgaatcaat taaaagaact ttcaaaaata gctgaaatga ctacacagat gaagtctctg 1201 caacaattgg atattagcca gaattctgta agctatgatg aaaagaaagg agactgttct 1261 tggactaaaa gtttattaag tttaaatatg tcttcaaata tacttactga cactattttc 1321 agatgtttac ctcccaggat caaggtactt gatcttcaca gcaataaaat aaagagcatt 1381 cctaaacaag tcgtaaaact ggaagctttg caagaactca atgttgcttt caattcttta 1441 actgaccttc ctggatgtgg cagctttagc agcctttctg tattgatcat tgatcacaat 1501 tcagtttccc acccatcagc tgatttcttc cagagctgcc agaagatgag gtcaataaaa 1561 gcaggggaca atccattcca atgtacctgt gagctaggag aatttgtcaa aaatatagac 1621 caagtatcaa gtgaagtgtt agagggctgg cctgattctt ataagtgtga ctacccggaa 1681 agttatagag gaaccctact aaaggacttt cacatgtctg aattatcctg caacataact 1741 ctgctgatcg tcaccatcgt tgccaccatg ctggtgttgg ctgtgactgt gacctccctc 1801 tgcatctact tggatctgcc ctggtatctc aggatggtgt gccagtggac ccagacccgg 1861 cgcagggcca ggaacatacc cttagaagaa ctccaaagaa atctccagtt tcatgcattt 1921 atttcatata gtgggcacga ttctttctgg gtgaagaatg aattattgcc aaacctagag 1981 aaagaaggta tgcagatttg ccttcatgag agaaactttg ttcctggcaa gagcattgtg 2041 gaaaatatca tcacctgcat tgagaagagt tacaagtcca tctttgtttt gtctcccaac 2101 tttgtccaga gtgaatggtg ccattatgaa ctctactttg cccatcacaa tctctttcat 2161 gaaggatcta atagcttaat cctgatcttg ctggaaccca ttccgcagta ctccattcct 2221 agcagttatc acaagctcaa aagtctcatg gccaggagga cttatttgga atggcccaag 2281 gaaaagagca aacgtggcct tttttgggct aacttaaggg cagccattaa tattaagctg 2341 acagagcaag caaagaaata gtctag // LOCUS HSU88573 1368 bp mRNA PRI 05-AUG-1997 DEFINITION Human NBR2 mRNA, complete cds. ACCESSION U88573 NID g2304976 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1368) AUTHORS Xu,C.F., Brown,M.A., Nicolai,H., Chambers,J.A., Griffiths,B.L. and Solomon,E. TITLE Isolation and characterisation of the NBR2 gene which lies head to head with the human BRCA1 gene JOURNAL Hum. Mol. Genet. 6 (7), 1057-1062 (1997) MEDLINE 97358579 REFERENCE 2 (bases 1 to 1368) AUTHORS Xu,C.-F., Brown,M.A., Nicolai,H., Chambers,J.A., Griffiths,B.L. and Solomon,E. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) Division of Medical and Molecular Genetics, UMDS, 8th Floor, Guy's Tower, Guy's Hospital, London SE1 9RT, UK FEATURES Location/Qualifiers source 1..1368 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /map="17q21" /tissue_type="breast and placenta" gene 1..1368 /gene="NBR2" exon 1..161 /gene="NBR2" /number=1 exon 162..224 /gene="NBR2" /number=2 exon 225..490 /gene="NBR2" /number=3 CDS 308..646 /gene="NBR2" /function="unknown" /codon_start=1 /db_xref="PID:g2304977" /translation="MWKGGRSHPFLPCSSRRAGSGGQLDSILPHQSPAWGPWGCKDLS SGVPSFLTSSILWKSAVFAEDNGLKIHLCSYKRDDLVLFYDCTSFVLTFGPSPWFLTQ GFLNPLEFSA" exon 491..958 /gene="NBR2" /number=4 exon 959..1368 /gene="NBR2" /number=5 BASE COUNT 357 a 349 c 352 g 310 t ORIGIN 1 ataaagtgcc tgccctctag cctctactct tccagttgcg gcttattgca tcacagtaat 61 tgctgtacga aggtcagaat cgctacctat tgtccaaagc agtcgtaaga agaggtccca 121 atcccccact ctttccgccc taatggaggt ctccagtttc ggtaaaagtt tcatttgatc 181 tgaatagtat taaaataaaa tacctggatg aggaagatga agaggtgctg gaagcccagg 241 aagcacacat caaggctccc ttgccagcag ggtgctgcca ataaaaggta gtcacgtgga 301 atttggaatg tggaaaggag gtagaagtca tcctttcctc ccctgtagca gcaggcgtgc 361 aggctctggt ggtcagctgg actccatact cccccaccag tcaccagcct ggggaccgtg 421 gggctgcaag gacctcagca gcggtgtccc aagtttcctg acttcttcca tcctctggaa 481 atcagctgtg tttgctgagg ataatggcct caagatccat ctgtgttcct acaaaagaga 541 tgatcttgtt cttttttatg attgcacatc ttttgttcta acgtttggtc cttcaccttg 601 gttcctgaca caaggattcc taaatccctt ggaattttct gcatgatagg agcatccttt 661 gttctcatga ggtgactctt ggtgggctcc ttatttgggg actggtcacc aaaaatacct 721 aactatggtt ggaagcttag tgctttcagc cccattcccc atcctctggg atggggagca 781 gagctggagc tcgatcatgc ctgcgtgaca aagcctccag aaaaatcctt gaaagacagg 841 acatggagag ctgctgggtt ggcgaacaca tccatgtgcc gggaggatgg tgcaccccaa 901 ctccacaagg acccttccag acctcaccct gtgtatctct tcatctggct gttcatttag 961 cagctccgag ggcaggcatg gtggctcatg cctgtagtcc cagcactttg ggaggccgag 1021 gcaggtggtt catctaaggt ctggagttcg agaccagcct ggccaacata gcgagagcag 1081 ctccggtggc gggaggagtg gcagcggcca ggcagcccag cttcgcgaag gctgtaggca 1141 caccgcggcc agcaggcacc tggcacccac cttccctgct gccaggatgc ccaagaaaaa 1201 ggtcagctcc accgaagggg ctgccatgga agagcccaag aggagatcag cgcaattgtc 1261 agctaaacct cctgcaaaag tggaagcgaa gccgaaaaag gcagcagcga aggataaatt 1321 ttcagacaca aaagtgcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSU88629 1923 bp DNA PRI 22-APR-1997 DEFINITION Human RNA polymerase II elongation factor ELL2, complete cds. ACCESSION U88629 NID g1946346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1923) AUTHORS Shilatifard,A., Duan,D.R., Haque,D., Florence,C., Schubach,W.H., Conaway,J.W. and Conaway,R.C. TITLE ELL2, a new member of an ELL family of RNA polymerase II elongation factors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (8), 3639-3643 (1997) MEDLINE 97268622 REFERENCE 2 (bases 1 to 1923) AUTHORS Shilatifard,A., Duan,D.R., Haque,D., Florence,C., Schubach,W.H., Conaway,J.W. and Conaway,R.C. TITLE Direct Submission JOURNAL Submitted (05-FEB-1997) Molecular and Cell Biology, Oklahoma Medical Research Foundation, 825 NE 13th Street, Oklahoma City, OK 73104, USA FEATURES Location/Qualifiers source 1..1923 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1923 /codon_start=1 /product="RNA polymerase II elongation factor ELL2" /db_xref="PID:g1946347" /translation="MAAGGTGGLREEQRYGLSCGRLGQDNITVLHVKLTETAIRALET YQSHKNLIPFRPSIQFQGLHGLVKIPKNDPLNEVHNFNFYLSNVGKDNPQGSFDCIQQ TFSSSGASQLNCLGFIQDKITVCATNDSYQMTRERMTQAEEESRNRSTKVIKPGGPYV GKRVQIRKAPQAVSDTVPERKRSTPMNPANTIRKTHSSSTISQRPYRDRVIHLLALKA YKKPELLARLQKDGVNQKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQRDWPGYS EIDRRSLESVLSRKLNPSQNATGTSRSESPVCSSRDAVSSPQKRLLDSEFIDPLMNKK ARISHLTNRVPPTLNGHLNPTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPISHPPQIV NSNSNSPSTPEGRGTQDLPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLKCPKP MEENHSMSHKKSKKKSKKHKEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNSSG GVKEDCTASMEPSAIELPDYLIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETVA RRFIKLDAQRKRLSPGSKEYQNVHEEVLQEYQKIKQSSPNYHEEKYRCEYLHNKLAHI KRLIGEFDQQQAESWS" BASE COUNT 630 a 462 c 410 g 421 t ORIGIN 1 atggcggcgg gggggacagg gggcctgcgg gaggagcagc gctatgggct gtcgtgcgga 61 cggctggggc aggacaacat caccgtactg catgtgaagc tcaccgagac ggcgatccgg 121 gcgctcgaga cttaccagag ccacaagaat ttaattcctt ttcgaccttc aatccagttc 181 caaggactcc acgggcttgt caaaattccc aaaaatgatc ccctcaatga agttcataac 241 tttaactttt atttgtcaaa tgtgggcaaa gacaaccctc agggcagctt tgactgcatc 301 cagcaaacat tctccagctc tggagcctcc cagctcaatt gcctgggatt tatacaagat 361 aaaattacag tgtgtgcaac aaacgactcg tatcagatga cacgagaaag aatgacccag 421 gcagaggagg aatcccgcaa ccgaagcaca aaagttatca aacccggtgg accatatgta 481 gggaaaagag tgcaaattcg gaaagcacct caagctgttt cagatacagt tcctgagagg 541 aaaaggtcaa cccccatgaa ccctgcaaat acaattcgaa agacacatag cagcagcacc 601 atctctcaga ggccatacag ggacagggtg attcacttac tggccctgaa ggcctacaag 661 aaaccggagc tacttgctag actccagaaa gatggtgtca atcaaaaaga caagaactcc 721 ctgggagcaa ttctgcaaca ggtagccaat ctgaattcta aggacctctc atatacctta 781 aaggattatg tttttaaaga gcttcaaaga gactggcctg gatacagtga aatagacaga 841 cggtcattgg agtcagtgct ctctagaaaa ctaaatccgt ctcagaatgc tacaggcacc 901 agccgttcag aatctcctgt atgttctagt agagatgctg tatcttctcc tcagaaacgg 961 cttttggatt cagagtttat tgatccttta atgaataaaa aagcccgaat atctcacctg 1021 acgaacagag taccaccaac actaaatggt catttgaatc ccaccagtga aaaatcggct 1081 gcaggcctcc cactgccccc tgcggctgct gccatcccca cccctccacc gctgccttca 1141 acctatctgc ccatctcaca tcctcctcag attgtaaatt ctaactccaa ctcccctagc 1201 actccagaag gccgggggac tcaagaccta cctgttgaca gttttagtca aaacgatagt 1261 atctatgagg accagcaaga caaatatacc tctaggactt ctctggaaac cttaccccct 1321 ggttccgttc tactaaagtg tccaaagcct atggaagaaa accattcaat gtctcacaaa 1381 aagtccaaaa agaagtctaa aaaacataag gaaaaggacc aaataaaaaa gcacgacatt 1441 gagactattg aggaaaagga ggaagatctt aagagagaag aggaaattgc caagctaaat 1501 aactccagtc caaattccag tggaggagtt aaagaggatt gcactgcctc catggaacct 1561 tcagcaattg aactcccaga ttatttgata aaatatatcg ctatcgtctc ctatgagcaa 1621 cgccagaatt ataaggatga cttcaatgca gagtatgatg agtacagagc tttgcatgcc 1681 aggatggaga ctgtagctag aagatttatc aaactagatg cacaaagaaa gcgcctttct 1741 ccaggctcaa aagagtatca gaatgttcat gaagaagtct tacaagaata tcagaagata 1801 aagcagtcta gtcccaatta ccatgaagaa aaatacagat gtgaatatct tcataacaag 1861 ctggctcaca tcaaaaggct aataggtgaa tttgaccaac agcaagcaga gtcatggtcc 1921 tag // LOCUS HSU88666 3745 bp mRNA PRI 14-MAR-1997 DEFINITION Human serine kinase SRPK2 mRNA, complete cds. ACCESSION U88666 NID g1857943 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3745) AUTHORS Wang,H.Y., Wen,L. and Fu,X.D. TITLE Direct Submission JOURNAL Submitted (06-FEB-1997) Cell. Mol. Medicine, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0651, USA FEATURES Location/Qualifiers source 1..3745 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 188..2248 /note="similar to human serine kinase SRPK1, encoded by GenBank Accession Number U09564; specific for serine/arginine-rich splicing factors" /codon_start=1 /product="serine kinase SRPK2" /db_xref="PID:g1857944" /translation="MSVNSEKSSSSERPEPQQKAPLVPPPPPPPPPPPPPLPDPTPPE PEEEILGSDDEEQEDPADYCKGGYHPVKIGDLFNGRYHVIRKLGWGHFSTVWLCWDMQ GKRFVAMKVVKSAQHYTETALDEIKLLKCVRESDPSDPNKDMVVQLIDDFKISGMNGI HVCMVFEVLGHHLLKWIIKSNYQGLPVRCVKSIIRQVLQGLDYLHSKCKIIHTDIKPE NILMCVDDAYVRRMAAEPEWQKAGAPPPSGSAVSTAPQQKPIGKISKNKKKKLKKKQK RQAELLEKRLQEIEELEREAERKIIEENITSAAPSNDQDGEYCPEVKLKTTGLEEAAE AETAKDNGEAEDQEEKEDAEKENIEKDEDDVDQELANIDPTWIESPKTNGHIENGPFS LEQQLDDEDDDEEDCPNPEEYNLDEPNAESDYTYSSSYEQFNGELPNGRHKIPESQFP EFSTSLFSGSLEPVACGSVLSEGSPLTEQEESSPSHDRSRTVSASSTGDLPKAKTRAA DLLVNPLDPRNRDKIRVKIADLGNACWVHKHFTEDIQTRQYRSIEVLIGAGYSTPADI WSTACMAFELATGDYLFEPHSGEDYSRDEDHIAHIIELLGSIPRHFALSGKYSREFFN RRGELRHITKLKPWSLFDVLVEKYGWPHEDAAQFTDFLIPMLEMVPEKRASAGECRHP WLNS" BASE COUNT 1163 a 719 c 829 g 1034 t ORIGIN 1 gaattcggca cgaggccatt gaatcccagt cctaacagaa gtactgcgaa tcttgtggcc 61 tcattctgaa caaaagggat tagagaagaa aaatctcttg atataaggct tgaaagcaag 121 ggcaggcaat cttggttgtg aatattttct gatttttcca gaaatcaagc agaagattga 181 gctgctgatg tcagttaact ctgagaagtc gtcctcttca gaaaggccgg agcctcaaca 241 gaaagctcct ttagttcctc ctcctccacc gccaccacca ccaccaccgc cacctttgcc 301 agaccccaca cccccggagc cagaggagga gatcctggga tcagatgatg aggagcaaga 361 ggaccctgcg gactactgca aaggtggata tcatccagtg aaaattggag acctcttcaa 421 tggccggtat catgttatta gaaagcttgg atgggggcac ttctctactg tctggctgtg 481 ctgggatatg caggggaaaa gatttgttgc aatgaaagtt gtaaaaagtg cccagcatta 541 tacggagaca gccttggatg aaataaaatt gctcaaatgt gttcgagaaa gtgatcccag 601 tgacccaaac aaagacatgg tggtccagct cattgacgac ttcaagattt caggcatgaa 661 tgggatacat gtctgcatgg tcttcgaagt acttggccac catctcctca agtggatcat 721 caaatccaac tatcaaggcc tcccagtacg ttgtgtgaag agtatcattc gacaggtcct 781 tcaagggtta gattacttac acagtaagtg caagatcatt catactgaca taaagccgga 841 aaatatcttg atgtgtgtgg atgatgcata tgtgagaaga atggcagctg agcctgagtg 901 gcagaaagca ggtgctcctc ctccttcagg gtctgcagtg agtacggctc cacagcagaa 961 acctatagga aaaatatcta aaaacaaaaa gaaaaaactg aaaaagaaac agaagaggca 1021 ggctgagtta ttggagaagc gcctgcagga gatagaagaa ttggagcgag aagctgaaag 1081 gaaaataata gaagaaaaca tcacctcagc tgcaccttcc aatgaccagg atggcgaata 1141 ctgcccagag gtgaaactaa aaacaacagg attagaggag gcggctgagg cagagactgc 1201 aaaggacaat ggtgaagctg aggaccagga agagaaagaa gatgctgaga aagaaaacat 1261 tgaaaaagat gaagatgatg tagatcagga acttgcgaac atagacccta cgtggataga 1321 atcacctaaa accaatggcc atattgagaa tggcccattc tcactggagc agcaactgga 1381 cgatgaagat gatgatgaag aagactgccc aaatcctgag gaatataatc ttgatgagcc 1441 aaatgcagaa agtgattaca catatagcag ctcctatgaa caattcaatg gtgaattgcc 1501 aaatggacga cataaaattc ccgagtcaca gttcccagag ttttccacct cgttgttctc 1561 tggatcctta gaacctgtgg cctgcggctc tgtgctttct gagggatcac cacttactga 1621 gcaagaggag agcagtccat cccatgacag aagcagaacg gtttcagcct ccagtactgg 1681 ggatttgcca aaagcaaaaa cccgggcagc tgacttgttg gtgaatcccc tggatccgcg 1741 gaatcgagat aaaattagag taaaaattgc tgacctggga aatgcttgtt gggtgcataa 1801 acacttcacg gaagacatcc agacgcgtca gtaccgctcc atagaggttt taataggagc 1861 ggggtacagc acccctgcgg acatctggag cacggcgtgt atggcatttg agctggcaac 1921 gggagattat ttgtttgaac cacattctgg ggaagactat tccagagacg aagaccacat 1981 agcccacatc atagagctgc taggcagtat tccaaggcac tttgctctat ctggaaaata 2041 ttctcgggaa ttcttcaatc gcagaggaga actgcgacac atcaccaagc tgaagccctg 2101 gagcctcttt gatgtacttg tggaaaagta tggctggccc catgaagatg ctgcacagtt 2161 tacagatttc ctgatcccga tgttagaaat ggttccagaa aaacgagcct cagctggcga 2221 atgtcggcat ccttggttga attcttagca aattctacca atattgcatt ctgagctagc 2281 aaatgttccc agtacattgg acctaaacgg tgactctcat tctttaacag gattacaagt 2341 gagctggctt catcctcaga cctttatttt gctttgaggt actgttgttt gacattttgc 2401 tttttgtgca ctgtgatcct ggggaagggt agtcttttgt cttcagctaa gtagtttact 2461 gaccattttc ttctggaaac aataacatgt ctctaagcat tgtttcttgt gttgtgtgac 2521 attcaaatgt catttttttg aatgaaaaat actttcccct ttgtgttttg gcaggttttg 2581 taactattta tgaagaaata ttttagctga gtactatata atttacaatc ttaagaaatt 2641 atcaagttgg aaccaagaaa tagcaaggaa atgtacaatt ttatcttctg gcaaagggac 2701 atcattcctg tattatagtg tatgtaaatg caccctgtaa atgttacttt ccattaaata 2761 tgggaggggg actcaaattt cagaaaagct accaagtctt gagtgctttg tagcctatgt 2821 tgcatgtagc ggactttaac tgctccaagg agttgtgcaa acttttcatt ccataacagt 2881 cttttcacat tggattttaa acaaagtggc tctgggttat aagatgtcat tctctatatg 2941 gcactttaaa ggaagaaaag atatgtttct cattctaaaa tatgcattat aatttagcag 3001 tcccatttgt gattttgcat atttttaaaa gtacttttaa agaagagcaa tttcccttta 3061 aaaatgtgat ggctcagtac catgtcatgt tgcctcctct gggcgctgta agttaagctc 3121 tacatagatt aaattggaga aacgtgttaa ttgtgtggaa tgaaaaaata catatatttt 3181 tggaaaagca tgatcatgct tgtctagaac acaaggtatg gtatatacaa tttgcagtgc 3241 agtgggcaga atacttctca cagctcaaag ataacagtga tcacattcat tccataggta 3301 gctttacgtg tggctacaac aaattttact agctttttca ttgtctttcc atgaaacgaa 3361 gttgagaaaa tgattttccc tttgcaggtt gcacacagtt ttgtttatgc atttccttaa 3421 aattaattgt agactccagg atacaaacca tagtaggcaa tacaatttag aatgtaatat 3481 atagaggtat attagcctct ttagaagtca gtggattgaa tgtcttttta ttttaaattt 3541 tacattcatt aaggtgcctc gtttttgact ttgtccatta acatttatcc atatgccttt 3601 gcaataacta gattgtgaaa agctaacaag tgttgtaaca ataatccatt gtttgaggtg 3661 cttgcagttg tcttaaaaat taaagtgttt tggttttttt ttttccagaa aaaaaaaaaa 3721 aaaaaaaaaa aaaaaaaatt cctgc // LOCUS HSU88878 2600 bp mRNA PRI 02-OCT-1997 DEFINITION Human Toll-like receptor 2 (TLR2) mRNA, complete cds. ACCESSION U88878 NID g2459623 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2600) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE A novel family of human receptors structurally related to the Drosophila morphogen Toll JOURNAL Unpublished REFERENCE 2 (bases 1 to 2600) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE Direct Submission JOURNAL Submitted (07-FEB-1997) Molecular Biology, DNAX Research Institute, 901 California Ave, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..2600 /organism="Homo sapiens" /note="cloned using I.M.A.G.E. EST clones #80633 and #117262 as probes" /db_xref="taxon:9606" /chromosome="4" /map="4q32" gene 130..2484 /gene="TLR2" CDS 130..2484 /gene="TLR2" /function="signaling receptor" /note="type 1 transmembrane receptor; conatains an extracellular domain largely composed of leucine-rich repeats and an intracellular segment similar to the signaling domains of interleukin-1-type receptors, the intracellular molecule Myd88, and several plant disease resistance gene products" /codon_start=1 /product="Toll-like receptor 2" /db_xref="PID:g2459624" /translation="MPHTLWMVWVLGVIISLSKEESSNQASLSCDRNGICKGSSGSLN SIPSGLTEAVKSLDLSNNRITYISNSDLQRCVNLQALVLTSNGINTIEEDSFSSLGSL EHLDLSYNYLSNLSSSWFKPLSSLTFLNLLGNPYKTLGETSLFSHLTKLQILRVGNMD TFTKIQRKDFAGLTFLEELEIDASDLQSYEPKSLKSIQNVSHLILHMKQHILLLEIFV DVTSSVECLELRDTDLDTFHFSELSTGETNSLIKKFTFRNVKITDESLFQVMKLLNQI SGLLELEFDDCTLNGVGNFRASDNDRVIDPGKVETLTIRRLHIPRFYLFYDLSTLYSL TERVKRITVENSKVFLVPCLLSQHLKSLEYLDLSENLMVEEYLKNSACEDAWPSLQTL ILRQNHLASLEKTGETLLTLKNLTNIDISKNSFHSMPETCQWPEKMKYLNLSSTRIHS VTGCIPKTLEILDVSNNNLNLFSLNLPQLKELYISRNKLMTLPDASLLPMLLVLKISR NAITTFSKEQLDSFHTLKTLEAGGNNFICSCEFLSFTQEQQALAKVLIDWPANYLCDS PSHVRGQQVQDVRLSVSECHRTALVSGMCCALFLLILLTGVLCHRFHGLWYMKMMWAW LQAKRKPRKAPSRNICYDAFVSYSERDAYWVENLMVQELENFNPPFKLCLHKRDFIPG KWIIDNIIDSIEKSHKTVFVLSENFVKSEWCKYELDFSHFRLFEENNDAAILILLEPI EKKAIPQRFCKLRKIMNTKTYLEWPMDEAQREGFWVNLRAAIKS" BASE COUNT 741 a 539 c 546 g 774 t ORIGIN 1 ggatccaaag gagacctata gtgactccca ggagctctta gtgaccaagt gaaggtacct 61 gtggggctca ttgtgcccat tgctctttca ctgctttcaa ctggtagttg tgggttgaag 121 cactggacaa tgccacatac tttgtggatg gtgtgggtct tgggggtcat catcagcctc 181 tccaaggaag aatcctccaa tcaggcttct ctgtcttgtg accgcaatgg tatctgcaag 241 ggcagctcag gatctttaaa ctccattccc tcagggctca cagaagctgt aaaaagcctt 301 gacctgtcca acaacaggat cacctacatt agcaacagtg acctacagag gtgtgtgaac 361 ctccaggctc tggtgctgac atccaatgga attaacacaa tagaggaaga ttctttttct 421 tccctgggca gtcttgaaca tttagactta tcctataatt acttatctaa tttatcgtct 481 tcctggttca agcccctttc ttctttaaca ttcttaaact tactgggaaa tccttacaaa 541 accctagggg aaacatctct tttttctcat ctcacaaaat tgcaaatcct gagagtggga 601 aatatggaca ccttcactaa gattcaaaga aaagattttg ctggacttac cttccttgag 661 gaacttgaga ttgatgcttc agatctacag agctatgagc caaaaagttt gaagtcaatt 721 cagaacgtaa gtcatctgat ccttcatatg aagcagcata ttttactgct ggagattttt 781 gtagatgtta caagttccgt ggaatgtttg gaactgcgag atactgattt ggacactttc 841 catttttcag aactatccac tggtgaaaca aattcattga ttaaaaagtt tacatttaga 901 aatgtgaaaa tcaccgatga aagtttgttt caggttatga aacttttgaa tcagatttct 961 ggattgttag aattagagtt tgatgactgt acccttaatg gagttggtaa ttttagagca 1021 tctgataatg acagagttat agatccaggt aaagtggaaa cgttaacaat ccggaggctg 1081 catattccaa ggttttactt attttatgat ctgagcactt tatattcact tacagaaaga 1141 gttaaaagaa tcacagtaga aaacagtaaa gtttttctgg ttccttgttt actttcacaa 1201 catttaaaat cattagaata cttggatctc agtgaaaatt tgatggttga agaatacttg 1261 aaaaattcag cctgtgagga tgcctggccc tctctacaaa ctttaatttt aaggcaaaat 1321 catttggcat cattggaaaa aaccggagag actttgctca ctctgaaaaa cttgactaac 1381 attgatatca gtaagaatag ttttcattct atgcctgaaa cttgtcagtg gccagaaaag 1441 atgaaatatt tgaacttatc cagcacacga atacacagtg taacaggctg cattcccaag 1501 acactggaaa ttttagatgt tagcaacaac aatctcaatt tattttcttt gaatttgccg 1561 caactcaaag aactttatat ttccagaaat aagttgatga ctctaccaga tgcctccctc 1621 ttacccatgt tactagtatt gaaaatcagt aggaatgcaa taactacgtt ttctaaggag 1681 caacttgact catttcacac actgaagact ttggaagctg gtggcaataa cttcatttgc 1741 tcctgtgaat tcctctcctt cactcaggag cagcaagcac tggccaaagt cttgattgat 1801 tggccagcaa attacctgtg tgactctcca tcccatgtgc gtggccagca ggttcaggat 1861 gtccgcctct cggtgtcgga atgtcacagg acagcactgg tgtctggcat gtgctgtgct 1921 ctgttcctgc tgatcctgct cacgggggtc ctgtgccacc gtttccatgg cctgtggtat 1981 atgaaaatga tgtgggcctg gctccaggcc aaaaggaagc ccaggaaagc tcccagcagg 2041 aacatctgct atgatgcatt tgtttcttac agtgagcggg atgcctactg ggtggagaac 2101 cttatggtcc aggagctgga gaacttcaat ccccccttca agttgtgtct tcataagcgg 2161 gacttcattc ctggcaagtg gatcattgac aatatcattg actccattga aaagagccac 2221 aaaactgtct ttgtgctttc tgaaaacttt gtgaagagtg agtggtgcaa gtatgaactg 2281 gacttctccc atttccgtct ttttgaagag aacaatgatg ctgccattct cattcttctg 2341 gagcccattg agaaaaaagc cattccccag cgcttctgca agctgcggaa gataatgaac 2401 accaagacct acctggagtg gcccatggac gaggctcagc gggaaggatt ttgggtaaat 2461 ctgagagctg cgataaagtc ctaggttccc atatttaaga ccagtctttg tctagttggg 2521 atctttatgt cactagttat agttaagttc attcagacat aattatataa aaactacgtg 2581 gatgtaccgt catttgagga // LOCUS HSU88879 3029 bp mRNA PRI 02-OCT-1997 DEFINITION Human Toll-like receptor 3 (TLR3) mRNA, complete cds. ACCESSION U88879 NID g2459625 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3029) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE A novel family of human receptors structurally related to the Drosophila morphogen Toll JOURNAL Unpublished REFERENCE 2 (bases 1 to 3029) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE Direct Submission JOURNAL Submitted (07-FEB-1997) Molecular Biology, DNAX Research Institute, 901 California Ave, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..3029 /organism="Homo sapiens" /note="cloned using I.M.A.G.E. EST clone #144675 as a probe" /db_xref="taxon:9606" /chromosome="4" /map="4q35" gene 74..2788 /gene="TLR3" CDS 74..2788 /gene="TLR3" /function="signaling receptor" /note="type 1 transmembrane receptor; contains an extracellular domain largely composed of leucine-rich repeats and an intracellular segment similar to the signaling domains of interleukin-1-type receptors, the intracellular molecule Myd88, and several plant disease resistance gene products" /codon_start=1 /product="Toll-like receptor 3" /db_xref="PID:g2459626" /translation="MRQTLPCIYFWGGLLPFGMLCASSTTKCTVSHEVADCSHLKLTQ VPDDLPTNITVLNLTHNQLRRLPAANFTRYSQLTSLDVGFNTISKLEPELCQKLPMLK VLNLQHNELSQLSDKTFAFCTNLTELHLMSNSIQKIKNNPFVKQKNLITLDLSHNGLS STKLGTQVQLENLQELLLSNNKIQALKSEELDIFANSSLKKLELSSNQIKEFSPGCFH AIGRLFGLFLNNVQLGPSLTEKLCLELANTSIRNLSLSNSQLSTTSNTTFLGLKWTNL TMLDLSYNNLNVVGNDSFAWLPQLEYFFLEYNNIQHLFSHSLHGLFNVRYLNLKRSFT KQSISLASLPKIDDFSFQWLKCLEHLNMEDNDIPGIKSNMFTGLINLKYLSLSNSFTS LRTLTNETFVSLAHSPLHILNLTKNKISKIESDAFSWLGHLEVLDLGLNEIGQELTGQ EWRGLENIFEIYLSYNKYLQLTRNSFALVPSLQRLMLRRVALKNVDSSPSPFQPLRNL TILDLSNNNIANINDDMLEGLEKLEILDLQHNNLARLWKHANPGGPIYFLKGLSHLHI LNLESNGFDEIPVEVFKDLFELKIIDLGLNNLNTLPASVFNNQVSLKSLNLQKNLITS VEKKVFGPAFRNLTELDMRFNPFDCTCESIAWFVNWINETHTNIPELSSHYLCNTPPH YHGFPVRLFDTSSCKDSAPFELFFMINTSILLIFIFIVLLIHFEGWRISFYWNVSVHR VLGFKEIDRQTEQFEYAAYIIHAYKDKDWVWEHFSSMEKEDQSLKFCLEERDFEAGVF ELEAIVNSIKRSRKIIFVITHHLLKDPLCKRFKVHHAVQQAIEQNLDSIILVFLEEIP DYKLNHALCLRRGMFKSHCILNWPVQKERIGAFRHKLQVALGSKNSVH" BASE COUNT 953 a 638 c 534 g 904 t ORIGIN 1 gcggccgcgt cgacgaaatg tctggatttg gactaaagaa aaaaggaaag gctagcagtc 61 atccaacaga atcatgagac agactttgcc ttgtatctac ttttgggggg gccttttgcc 121 ctttgggatg ctgtgtgcat cctccaccac caagtgcact gttagccatg aagttgctga 181 ctgcagccac ctgaagttga ctcaggtacc cgatgatcta cccacaaaca taacagtgtt 241 gaaccttacc cataatcaac tcagaagatt accagccgcc aacttcacaa ggtatagcca 301 gctaactagc ttggatgtag gatttaacac catctcaaaa ctggagccag aattgtgcca 361 gaaacttccc atgttaaaag ttttgaacct ccagcacaat gagctatctc aactttctga 421 taaaaccttt gccttctgca cgaatttgac tgaactccat ctcatgtcca actcaatcca 481 gaaaattaaa aataatccct ttgtcaagca gaagaattta atcacattag atctgtctca 541 taatggcttg tcatctacaa aattaggaac tcaggttcag ctggaaaatc tccaagagct 601 tctattatca aacaataaaa ttcaagcgct aaaaagtgaa gaactggata tctttgccaa 661 ttcatcttta aaaaaattag agttgtcatc gaatcaaatt aaagagtttt ctccagggtg 721 ttttcacgca attggaagat tatttggcct ctttctgaac aatgtccagc tgggtcccag 781 ccttacagag aagctatgtt tggaattagc aaacacaagc attcggaatc tgtctctgag 841 taacagccag ctgtccacca ccagcaatac aactttcttg ggactaaagt ggacaaatct 901 cactatgctc gatctttcct acaacaactt aaatgtggtt ggtaacgatt cctttgcttg 961 gcttccacaa ctagaatatt tcttcctaga gtataataat atacagcatt tgttttctca 1021 ctctttgcac gggcttttca atgtgaggta cctgaatttg aaacggtctt ttactaaaca 1081 aagtatttcc cttgcctcac tccccaagat tgatgatttt tcttttcagt ggctaaaatg 1141 tttggagcac cttaacatgg aagataatga tattccaggc ataaaaagca atatgttcac 1201 aggattgata aacctgaaat acttaagtct atccaactcc tttacaagtt tgcgaacttt 1261 gacaaatgaa acatttgtat cacttgctca ttctccctta cacatactca acctaaccaa 1321 gaataaaatc tcaaaaatag agagtgatgc tttctcttgg ttgggccacc tagaagtact 1381 tgacctgggc cttaatgaaa ttgggcaaga actcacaggc caggaatgga gaggtctaga 1441 aaatattttc gaaatctatc tttcctacaa caagtacctg cagctgacta ggaactcctt 1501 tgccttggtc ccaagccttc aacgactgat gctccgaagg gtggccctta aaaatgtgga 1561 tagctctcct tcaccattcc agcctcttcg taacttgacc attctggatc taagcaacaa 1621 caacatagcc aacataaatg atgacatgtt ggagggtctt gagaaactag aaattctcga 1681 tttgcagcat aacaacttag cacggctctg gaaacacgca aaccctggtg gtcccattta 1741 tttcctaaag ggtctgtctc acctccacat ccttaacttg gagtccaacg gctttgacga 1801 gatcccagtt gaggtcttca aggatttatt tgaactaaag atcatcgatt taggattgaa 1861 taatttaaac acacttccag catctgtctt taataatcag gtgtctctaa agtcattgaa 1921 ccttcagaag aatctcataa catccgttga gaagaaggtt ttcgggccag ctttcaggaa 1981 cctgactgag ttagatatgc gctttaatcc ctttgattgc acgtgtgaaa gtattgcctg 2041 gtttgttaat tggattaacg agacccatac caacatccct gagctgtcaa gccactacct 2101 ttgcaacact ccacctcact atcatgggtt cccagtgaga ctttttgata catcatcttg 2161 caaagacagt gccccctttg aactcttttt catgatcaat accagtatcc tgttgatttt 2221 tatctttatt gtacttctca tccactttga gggctggagg atatcttttt attggaatgt 2281 ttcagtacat cgagttcttg gtttcaaaga aatagacaga cagacagaac agtttgaata 2341 tgcagcatat ataattcatg cctataaaga taaggattgg gtctgggaac atttctcttc 2401 aatggaaaag gaagaccaat ctctcaaatt ttgtctggaa gaaagggact ttgaggcggg 2461 tgtttttgaa ctagaagcaa ttgttaacag catcaaaaga agcagaaaaa ttatttttgt 2521 tataacacac catctattaa aagacccatt atgcaaaaga ttcaaggtac atcatgcagt 2581 tcaacaagct attgaacaaa atctggattc cattatattg gttttccttg aggagattcc 2641 agattataaa ctgaaccatg cactctgttt gcgaagagga atgtttaaat ctcactgcat 2701 cttgaactgg ccagttcaga aagaacggat aggtgccttt cgtcataaat tgcaagtagc 2761 acttggatcc aaaaactctg tacattaaat ttatttaaat attcaattag caaaggagaa 2821 actttctcaa tttaaaaagt tctatggcaa atttaagttt tccataaagg tgttataatt 2881 tgtttattca tatttgtaaa tgattatatt ctatcacaat tacatctctt ctaggaaaat 2941 gtgtctcctt atttcaggcc tatttttgac aattgactta attttaccca aaataaaaca 3001 tataagcacg caaaaaaaaa aaaaaaaaa // LOCUS HSU88880 3811 bp mRNA PRI 02-OCT-1997 DEFINITION Human Toll-like receptor 4 (TLR4) mRNA, complete cds. ACCESSION U88880 NID g2459627 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3811) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE A novel family of human receptors structurally related to the Drosophila morphogen Toll JOURNAL Unpublished REFERENCE 2 (bases 1 to 3811) AUTHORS Rock,F.L., Hardiman,G., Timans,J.C., Kastelein,R.A. and Bazan,J.F. TITLE Direct Submission JOURNAL Submitted (07-FEB-1997) Molecular Biology, DNAX Research Institute, 901 California Ave, Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..3811 /organism="Homo sapiens" /note="cloned using I.M.A.G.E. EST clone #202057 as a probe" /db_xref="taxon:9606" /chromosome="9" /map="9q32-33" gene 285..2684 /gene="TLR4" CDS 285..2684 /gene="TLR4" /function="signaling receptor" /note="type 1 transmembrane receptor; contains an extracellular domain largely composed of leucine-rich repeats and an intracellular segment similar to the signaling domains of interleukin-1-type receptors, the intracellular molecule Myd88 and several plant disease resistance gene products" /codon_start=1 /product="Toll-like receptor 4" /db_xref="PID:g2459628" /translation="MELNFYKIPDNLPFSTKNLDLSFNPLRHLGSYSFFSFPELQVLD LSRCEIQTIEDGAYQSLSHLSTLILTGNPIQSLALGAFSGLSSLQKLVAVETNLASLE NFPIGHLKTLKELNVAHNLIQSFKLPEYFSNLTNLEHLDLSSNKIQSIYCTDLRVLHQ MPLLNLSLDLSLNPMNFIQPGAFKEIRLHKLTLRNNFDSLNVMKTCIQGLAGLEVHRL VLGEFRNEGNLEKFDKSALEGLCNLTIEEFRLAYLDYYLDDIIDLFNCLTNVSSFSLV SVTIERVKDFSYNFGWQHLELVNCKFGQFPTLKLKSLKRLTFTSNKGGNAFSEVDLPS LEFLDLSRNGLSFKGCCSQSDFGTTSLKYLDLSFNGVITMSSNFLGLEQLEHLDFQHS NLKQMSEFSVFLSLRNLIYLDISHTHTRVAFNGIFNGLSSLEVLKMAGNSFQENFLPD IFTELRNLTFLDLSQCQLEQLSPTAFNSLSSLQVLNMSHNNFFSLDTFPYKCLNSLQV LDYSLNHIMTSKKQELQHFPSSLAFLNLTQNDFACTCEHQSFLQWIKDQRQLLVEVER MECATPSDKQGMPVLSLNITCQMNKTIIGVSVLSVLVVSVVAVLVYKFYFHLMLLAGC IKYGRGENIYDAFVIYSSQDEDWVRNELVKNLEEGVPPFQLCLHYRDFIPGVAIAANI IHEGFHKSRKVIVVVSQHFIQSRWCIFEYEIAQTWQFLSSRAGIIFIVLQKVEKTLLR QQVELYRLLSRNTYLEWEDSVLGRHIFWRRLRKALLDGKSWNPEGTVGTGCNWQEATS I" BASE COUNT 1070 a 820 c 784 g 1137 t ORIGIN 1 acagggccac tgctgctcac agaagcagtg aggatgatgc caggatgatg tctgcctcgc 61 gcctggctgg gactctgatc ccagccatgg ccttcctctc ctgcgtgaga ccagaaagct 121 gggagccctg cgtggagact tggccctaaa ccacacagaa gagctggcat gaaacccaga 181 gctttcagac tccggagcct cagcccttca ccccgattcc attgcttctt gctaaatgct 241 gccgttttat cacggaggtg gttcctaata ttacttatca atgcatggag ctgaatttct 301 acaaaatccc cgacaacctc cccttctcaa ccaagaacct ggacctgagc tttaatcccc 361 tgaggcattt aggcagctat agcttcttca gtttcccaga actgcaggtg ctggatttat 421 ccaggtgtga aatccagaca attgaagatg gggcatatca gagcctaagc cacctctcta 481 ccttaatatt gacaggaaac cccatccaga gtttagccct gggagccttt tctggactat 541 caagtttaca gaagctggtg gctgtggaga caaatctagc atctctagag aacttcccca 601 ttggacatct caaaactttg aaagaactta atgtggctca caatcttatc caatctttca 661 aattacctga gtatttttct aatctgacca atctagagca cttggacctt tccagcaaca 721 agattcaaag tatttattgc acagacttgc gggttctaca tcaaatgccc ctactcaatc 781 tctctttaga cctgtccctg aaccctatga actttatcca accaggtgca tttaaagaaa 841 ttaggcttca taagctgact ttaagaaata attttgatag tttaaatgta atgaaaactt 901 gtattcaagg tctggctggt ttagaagtcc atcgtttggt tctgggagaa tttagaaatg 961 aaggaaactt ggaaaagttt gacaaatctg ctctagaggg cctgtgcaat ttgaccattg 1021 aagaattccg attagcatac ttagactact acctcgatga tattattgac ttatttaatt 1081 gtttgacaaa tgtttcttca ttttccctgg tgagtgtgac tattgaaagg gtaaaagact 1141 tttcttataa tttcggatgg caacatttag aattagttaa ctgtaaattt ggacagtttc 1201 ccacattgaa actcaaatct ctcaaaaggc ttactttcac ttccaacaaa ggtgggaatg 1261 ctttttcaga agttgatcta ccaagccttg agtttctaga tctcagtaga aatggcttga 1321 gtttcaaagg ttgctgttct caaagtgatt ttgggacaac cagcctaaag tatttagatc 1381 tgagcttcaa tggtgttatt accatgagtt caaacttctt gggcttagaa caactagaac 1441 atctggattt ccagcattcc aatttgaaac aaatgagtga gttttcagta ttcctatcac 1501 tcagaaacct catttacctt gacatttctc atactcacac cagagttgct ttcaatggca 1561 tcttcaatgg cttgtccagt ctcgaagtct tgaaaatggc tggcaattct ttccaggaaa 1621 acttccttcc agatatcttc acagagctga gaaacttgac cttcctggac ctctctcagt 1681 gtcaactgga gcagttgtct ccaacagcat ttaactcact ctccagtctt caggtactaa 1741 atatgagcca caacaacttc ttttcattgg atacgtttcc ttataagtgt ctgaactccc 1801 tccaggttct tgattacagt ctcaatcaca taatgacttc caaaaaacag gaactacagc 1861 attttccaag tagtctagct ttcttaaatc ttactcagaa tgactttgct tgtacttgtg 1921 aacaccagag tttcctgcaa tggatcaagg accagaggca gctcttggtg gaagttgaac 1981 gaatggaatg tgcaacacct tcagataagc agggcatgcc tgtgctgagt ttgaatatca 2041 cctgtcagat gaataagacc atcattggtg tgtcggtcct cagtgtgctt gtagtatctg 2101 ttgtagcagt tctggtctat aagttctatt ttcacctgat gcttcttgct ggctgcataa 2161 agtatggtag aggtgaaaac atctatgatg cctttgttat ctactcaagc caggatgagg 2221 actgggtaag gaatgagcta gtaaagaatt tagaagaagg ggtgcctcca tttcagctct 2281 gccttcacta cagagacttt attcccggtg tggccattgc tgccaacatc atccatgaag 2341 gtttccataa aagccgaaag gtgattgttg tggtgtccca gcacttcatc cagagccgct 2401 ggtgtatctt tgaatatgag attgctcaga cctggcagtt tctgagcagt cgtgctggta 2461 tcatcttcat tgtcctgcag aaggtggaga agaccctgct caggcagcag gtggagctgt 2521 accgccttct cagcaggaac acttacctgg agtgggagga cagtgtcctg gggcggcaca 2581 tcttctggag acgactcaga aaagccctgc tggatggtaa atcatggaat ccagaaggaa 2641 cagtgggtac aggatgcaat tggcaggaag caacatctat ctgaagagga aaaataaaaa 2701 cctcctgagg catttcttgc ccagctgggt ccaacacttg ttcagttaat aagtattaaa 2761 tgctgccaca tgtcaggcct tatgctaagg gtgagtaatt ccatggtgca ctagatatgc 2821 agggctgcta atctcaagga gcttccagtg cagagggaat aaatgctaga ctaaaataca 2881 gagtcttcca ggtgggcatt tcaaccaact cagtcaagga acccatgaca aagaaagtca 2941 tttcaactct tacctcatca agttgaataa agacagagaa aacagaaaga gacattgttc 3001 ttttcctgag tcttttgaat ggaaattgta ttatgttata gccatcataa aaccattttg 3061 gtagttttga ctgaactggg tgttcacttt ttcctttttg attgaataca atttaaattc 3121 tacttgatga ctgcagtcgt caaggggctc ctgatgcaag atgccccttc cattttaagt 3181 ctgtctcctt acagaggtta aagtctaatg gctaattcct aaggaaacct gattaacaca 3241 tgctcacaac catcctggtc attctcgaac atgttctatt ttttaactaa tcacccctga 3301 tatattttta tttttatata tccagttttc atttttttac gtcttgccta taagctaata 3361 tcataaataa ggttgtttaa gacgtgcttc aaatatccat attaaccact atttttcaag 3421 gaagtatgga aaagtacact ctgtcacttt gtcactcgat gtcattccaa agttattgcc 3481 tactaagtaa tgactgtcat gaaagcagca ttgaaataat ttgtttaaag ggggcactct 3541 tttaaacggg aagaaaattt ccgcttcctg gtcttatcat ggacaatttg ggctataggc 3601 atgaaggaag tgggattacc tcaggaagtc accttttctt gattccagaa acatatgggc 3661 tgataaaccc ggggtgacct catgaaatga gttgcagcag atgtttattt ttttcagaac 3721 aagtgatgtt tgatggacct atgaatctat ttagggagac acagatggct gggatccctc 3781 ccctgtaccc ttctcactga caggagaact a // LOCUS HSU88895 887 bp mRNA PRI 24-OCT-1997 DEFINITION Human endogenous retrovirus H D1 leader region/integrase-derived ORF1, ORF2, and putative envelope protein mRNA, complete cds. ACCESSION U88895 NID g2104909 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 887) AUTHORS Lindeskog,M. and Blomberg,J. TITLE Spliced human endogenous retroviral HERV-H env transcripts in T-cell leukaemia cell lines and normal leukocytes: alternative splicing pattern of HERV-H transcripts JOURNAL J. Gen. Virol. 78 (Pt 10), 2575-2585 (1997) MEDLINE 98007634 REFERENCE 2 (bases 1 to 887) AUTHORS Lindeskog,M. and Blomberg,J. TITLE Direct Submission JOURNAL Submitted (07-FEB-1997) Department of Medical Microbiology, Section of Virology, University of Lund, Solvegatan 23, Lund S-22362, Sweden FEATURES Location/Qualifiers source 1..887 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="leukemia H9" /note="obtained from a PCR-amplified reverse transcribed total RNA using a 5' primer from the primer binding site (PBS) and a 3'primer from the HERV-H envelope gene" misc_feature 1..126 /note="HERV-H D1 leader region" CDS 69..401 /note="ORF derived from D1 leader region and integrase coding region" /codon_start=1 /db_xref="PID:g2104910" /translation="MTSGPQTDQPKKHLTNFKSDPQPYEDNLAGRSVLVKNLTPQTLQ PRWTGPYLVIYSTLTAVRLQDPPHWVHRSRIKLCPSDSQPNPSSSSWKSQVLSPTSLK HTRISEEQ" misc_feature 127..887 /note="HERV-H integrase/envelope region" CDS 581..865 /note="ORF2" /codon_start=1 /db_xref="PID:g2104911" /translation="MPYSCLHRRFTLFLQAITADISWCYPQTATLNSLLEWIDDLCWQ GTLQYFHPDEVLFFTFILTLILIPILMSPSTSPQLSPPHYQPYPFSPSCF" CDS 696..>887 /codon_start=1 /product="putative envelope protein" /db_xref="PID:g2104912" /translation="MIFAGKAPSNTSTLMKFYSLLLYSLLFSFPFLCHPLPLPSYLHH TINLTHSLLAASNPSLANNC" BASE COUNT 212 a 303 c 121 g 251 t ORIGIN 1 cgggggacct cccttgggag atcaatcccc tgtcctcctg ctctttgctc catgagaaag 61 atccacctat gacctcaggt cctcagaccg accagcccaa gaaacatctc accaatttca 121 aatccgatcc ccagccatat gaagacaacc tagctggacg atcagttctt gttaagaatc 181 tgacccctca aactctacaa cctcgatgga ccggacccta cttagtcatc tatagtaccc 241 tgactgccgt ccgcctgcag gatcctcccc actgggttca ccgttccaga ataaagctgt 301 gtccatcgga cagccagcct aatccctcct cttcctcctg gaagtcgcaa gtactctccc 361 caacttccct taaacacact cgtatttctg aagaacagta ataaccctta tgagcctaat 421 acatcccttc attctattag atctgttcat ccttacccta ctttttgcaa cagggcttta 481 cgaagtcacc ccaccactta ggccgagccc caaaaaacta gtcatcccta ctatcttctg 541 tccggtcata ctcctattct ccattctcaa ctacttataa atgccctact cttgtttaca 601 ccgccggttt acactgtttc tccaagccat cacagctgat atctcttggt gctatcccca 661 aactgccact cttaactccc tcttagagtg gatagatgat ctttgctggc aaggcaccct 721 ccaatacttc caccctgatg aagttctatt ctttactttt atactcactc ttattctcat 781 tcccattctt atgtcaccct ctacctctcc ccagctatct ccaccacact atcaacctta 841 cccattctct cctagctgct tctaatccct ccttagcgaa caactgc // LOCUS HSU89278 2555 bp mRNA PRI 25-MAR-1997 DEFINITION Human polyhomeotic 2 homolog (HPH2) mRNA, complete cds. ACCESSION U89278 NID g1877500 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2555) AUTHORS Gunster,M.J., Satijn,D.P., Hamer,K.M., den Blaauwen,J.L., de Bruijn,D., Alkema,M.J., van Lohuizen,M., van Driel,R. and Otte,A.P. TITLE Identification and characterization of interactions between the vertebrate polycomb-group protein BMI1 and human homologs of polyhomeotic JOURNAL Mol. Cell. Biol. 17 (4), 2326-2335 (1997) MEDLINE 97220024 REFERENCE 2 (bases 1 to 2555) AUTHORS Gunster,M.J., Satijn,D.P.E., Hamer,C.M., Den Blaauwen,J.L., De Bruijn,D., Alkema,M.J., Van Driel,R. and Otte,A.P. TITLE Direct Submission JOURNAL Submitted (11-FEB-1997) E.C. Slater Institute, Plantage Muidergracht 12, Amsterdam 1018 TV, The Netherlands FEATURES Location/Qualifiers source 1..2555 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2555 /gene="HPH2" CDS 9..1310 /gene="HPH2" /function="interacts with the vertebrate polycomb-group protein BMI1" /codon_start=1 /product="polyhomeotic 2 homolog" /db_xref="PID:g1877501" /translation="MCLRGGCSPRAPAAAPQPRPPPALPPRPRAPVPASRPGRPLLTP ARPCGRMRRGSPGPRLGGSRGERRRPAGRDPARVGPGQGLRRPARPGPAAWTETGQGI VHALTDLSIPGMTSGNGNSASSIAGTAPQNGENKPPQAIVKPQILTHVIEGFVIQEGA DVSRWDARLLVGNLKKKYAQGFLPEKLPQQDHTTTTDSEMEEPYLQESKEEGAPLKLK CELCGRVDFAYKFKRSKRFCSMACAKRYNVGCTKRVGLFHSDRSKLQKAGAATHNRRR PAKPVCHHLPRIPRSSQQALCPFRLLLLCVTHSQEDSSRCSDNSSYEEPLSPISASSS TSAGDKASGTWSSPTCICGTWWAWDTTSCQVSHQVNVEDVYEFIRSLPGCQEIAEEFR AQEIDGQALLLLKEDHLMSVMNIKLGPALKIYARISMLKDS" BASE COUNT 538 a 796 c 724 g 497 t ORIGIN 1 ggcgccgcat gtgtctccgc ggcggctgca gccctcgagc gcccgccgcc gcgccccaac 61 cccggccgcc gcccgccctc ccgccccggc ctcgcgcccc cgtcccggcc tcgcgccccg 121 gccgcccttt gttgacgccg gccaggccgt gcggtcggat gcgccgcggc agccccgggc 181 cccggctcgg aggctcccgg ggcgagagga ggcggcccgc cggccgggac cccgcgcgag 241 tcggccccgg ccaggggctg cgtaggcccg cccggccagg cccagccgcc tggacagaga 301 cagggcaggg cattgttcat gcactgaccg acctcagcat ccccggcatg acctcaggga 361 acggaaactc tgcctccagc atcgccggca ctgcccccca gaatggtgag aataaaccac 421 cacaggccat tgtgaaaccc caaatcctga cgcatgttat cgaagggttt gtgatccagg 481 agggggcgga cgtttcccgg tgggacgctc gtctgctggt ggggaatctc aagaagaagt 541 atgcacaggg gttcctgcct gagaaacttc cacagcagga tcacaccacc accactgact 601 cggagatgga ggagccctat ctgcaagaat ccaaagagga gggtgctccc ctcaaactca 661 agtgtgagct ctgtggccgg gtggactttg cctataagtt caagcgttcc aagcgcttct 721 gttccatggc ttgtgcaaag aggtacaacg tgggatgcac caaacgggtg ggacttttcc 781 actcagaccg gagcaagctg cagaaggcag gagctgcgac ccacaaccgc cgtcggccag 841 caaagccagt ctgccaccac ttaccaagga taccaagaag cagccaacag gcactgtgcc 901 cctttcggtt actgctgctt tgcgtaacac acagccagga agactccagc cgttgctcag 961 ataactcaag ctatgaggaa cccttgtcac ccatctcagc cagctcatct acttccgccg 1021 gcgacaaggc cagcgggacc tggagctccc cgacatgcat atgcgggacc tggtgggcat 1081 gggacaccac ttcctgccaa gtgagccacc aagtgaatgt agaagacgtc tacgaattca 1141 tccgctctct gccaggctgc caggagatag cagaggaatt ccgtgcccag gaaatcgacg 1201 ggcaagccct gctgctgctc aaggaggacc acctgatgag cgttatgaac atcaagctgg 1261 ggcccgccct gaagatctac gcccgcatca gcatgctcaa ggactcctag ggctggtggc 1321 accaggattc tggcccaggg cgcctcctcc cgactgagca gagccagaca gacattcctg 1381 aggggcccag aaatggcggc gttggagggc aggggctctc cctaggggca tagctggtga 1441 ggaggtctgg gcacctcctc catggctctc aggggccttt catttctgtg ggaggggcag 1501 agaggtaggt ggcacagaag atggggcttt atgcttgtaa atattgatag cactggcttc 1561 ctccaaagtc ccaatactct agccccgctc tcttcccctc tttctgtccc ccattttcca 1621 gggggtatat ggtcagggct ccccaacctg agttggttac ttcaagggca gccagcaggc 1681 ctggatggag gcctagaaag cccttgcctt ccttcctccc acttctttct ccaggcctgg 1741 ttaactcttc cgttgtcagc ttctccccct tcagcctgtt tctgcagcag ccagggttct 1801 cccccctaca ccctctgcag gtggagagag agaagctggg cccagccgcg gtgcctgctg 1861 gccaagacgc cttaacgctg tgtgtatgac tgtgtgactg tgtgggagcc tggactgaca 1921 gataggccaa gggctactct ctggcatctc caggtgtttt gtagcaaaca gccacttagt 1981 gctttgtcct ggactccact cagcctcagg atggggaata gccaagaatg gcagcctcag 2041 cgcagaggca aggtcagaaa gagacggcgc ttcagagttt cctttccaga cacccctccc 2101 cgcactgtga agttcccctg accgccctcc tggttcacaa agagcattaa gaaagctgcg 2161 gtggtctgag caacatagcc cagacgtgga gcctcctggc ctgcctgccc gcccaccctg 2221 ggagtccagt ggtgaggctc agagaacttc taaggggaaa gaacagctgg agtttctgtt 2281 gatgtgaaga aggcagctct tggcctccca ctcccacact tctttgccta taaatcttcc 2341 tagcagcaat ttgagctacc tgaggaggag gcagggcaga agggcaaggg cctgcctctg 2401 acctgccgtg tcctttgcag gaaggaggta ggcacctttc tgagcttatt ctattcccca 2461 cccacacccc caggcagggt tggaaatgaa ggactttttt aacctttgtt ttgtttttta 2521 aaaataaatc tgtaaaatct gaaaaaaaaa aaaaa // LOCUS HSU89344 7452 bp mRNA PRI 31-MAY-1997 DEFINITION Human acetyl-CoA carboxylase (ACC2) mRNA, complete cds. ACCESSION U89344 NID g2138329 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7452) AUTHORS Abu-Elheiga,L., Almarza-Ortega,D.B., Baldini,A. and Wakil,S.J. TITLE Human acetyl-CoA carboxylase 2. Molecular cloning, characterization, chromosomal mapping, and evidence for two isoforms JOURNAL J. Biol. Chem. 272 (16), 10669-10677 (1997) MEDLINE 97256787 REFERENCE 2 (bases 1 to 7452) AUTHORS Abu-Elheiga,L. TITLE Direct Submission JOURNAL Submitted (11-FEB-1997) Biochemistry, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..7452 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="liver" /dev_stage="adult" /chromosome="12" /map="12q23" /clone_lib="Clontech catalog number 7300-1" gene 1..7452 /gene="ACC2" CDS 1..7452 /gene="ACC2" /EC_number="6.4.1.2" /function="synthesis of malonyl-CoA" /note="acc280" /codon_start=1 /product="acetyl-CoA carboxylase" /db_xref="PID:g2138330" /translation="MVLLLCLSCLIFSCLTFSWLKIWEKMTDSKPITKSKSEANLIPS QEPFPASDNSGETPQRNGEGHTLHKDTQPGRAQPPTKAQRSGRRRNSLPPSRQKPPRN PLSSSDAAPSPELQANGTGTQGLEATDTNGLSSSARPQGSKLVPSKEDKKQANIKRQL MTNFILGSFDDYSSDEDSVAGSSRESTRKGSRASLGALSLEAYLTTGEAETRVPTMRP SMSGLHLVKRGREHKKLDLHRDFTVASPAEFVTRFGGDRVIEKVLIANNGIAAVKCMR SIRRWAYEMFRNERAIRFVRMVTPEDLKANAEYIKMADHYGPAPGGPNNNNYANVELI VDIAKRIPLQAVWAGWGHALENPKLPELLCKNGVAFLGPPRLRPMVGLGDKIASTVVA QTLQVPTLPRSGSALTVEWTEDDLQQGKRISVPEDVYDKGCVKDVDEGLEAAERIGFP LMIKASEGGGGKGIRETESAEDFPILFRQVQSEIPGSPIFLMKLAQHARHLEVQILAD QYGNAVSLFGRDCSIQRRHQKIVEEAPATIAPLAIFEFMEQCAIRLAKTVGYVSAGTV EYLYSQDGSFHFLELNPRLQVEHPCTEMIADVNLPAAQLQIAMGAPLHRLKDIRLLYG ESPWGDSPISFENSAHLPCPRGHVIATRITSENPDEGFKPSSGTVQELNFRSSKNVWG YFTVAATGGLHEFAISQFGHCFSWGENRKEAISNMVVALKELSLRGDFRTTVEYLINL LETESFQNNYIDTGWLDYLIAEKVQKKPNIMLGVVCGALERGDAMFRTCMTDFLHSLE RGQVLPADSLLNLVDVELIYEGVKYILKVTRQSLTMFVLIMNGCHIEIDAHRLNDGGL LLSYNGNSYTTYMKEEVDSYRTIGNKTCVFEKENDPTVLRSPSAGKLTQITVEDGGHV EAGRRYAEMEVMKMIMTLNVQERGRVKYIKRPGAVLEAGCVVARLELDDPSKVHPAEP FTGELPAQQNTADLGKKLHRVFHSVLGSLTNVMSGFCLPEPFFSIKLKEWVQKLMMTL RHPSLLLDVQEIMTSRAGRIPPPVEKSVRKVMAQYASNITSVLCQFPSQQIATILDCH AATLQRKADREVFFINTQSMVQLVQRYRSGIRGHMKTVVIDLLRRYLRVETIFGKARD ADANSSGMVGGVRSLSFTSVWVVLSPPAHYDKCVINLREQFKPDMSQVLDCIFSHAQV TKKNQLVIMLIDELCGPDPSLSDELISILNELTQLSKSEHCKVALRARQILIASPSYE LRHNQVESIFLSAIDMYGHQFCPENLQKLILSETTIFDVLNTFFYHANKVVCMASLEV YVGGAYIAYVLNSLQHRQLPDGTCVVEFQFMLPSSHPNRMTVPISITNPDLLRHTTEL FMDSGFSPLCQRMGAMVAFRRFEDFTRNFDEVISCFANVPKDPPLFSEARTSLYSEDD CKSLREEPIHILNVSIQCADHLEDEALVPILRTFVQSKKNILVDYGLRRIPFLIAQEK EFPKFFTFRARDEFAEDRIYRHLEPALAFQLELNRMRNFDLTAVPCANHKMHLYLGAA KVEGRYEVTDHRFFIRAIIRHSDLITKEASFEYLQNEGERLLLEAMDELEVAFNNTNV RTDCNHIFLNFVPTVIMDPNKIEESVRYMVMRYGSRLWKLRVLQAEVKINIRQTTTGS AVPIRLFITNESGYYLDISLYKEVTDSRSGNIMFHSFGNKQGPQHGMLINTPYVTKDL LQAKRFQAQTLGTTYIYDFPEMFRQALFKLWGSPDKYPKDILTYTELVLDSQGQLVEM NRLPGGNEVGMVAFKMRFKTQEYPEGRDVIVIGNDITFRIGSFGPGEDLLYLRASEMA RAEAIPKIYVAANSGARIGMAEEIKHMFHVAWVDPEDPHKGFKYLYLTPQDYTRISSL NSVHCKHIEEGGESRYMITDIIGKDDGLGVENLRGSGMIAGESSLAYEEIVTISLVTC RAIGIGAYLVRLGQRVIQVENSHIILTGASALNKVLGREVYTSNNQLGGVQIMHYNGV SHITVPDDFEGVYTILEWLSYMPKDNHSPVPIITPTDPIDREIEFLPSRAPYDPRWML AGRPHPTLKGTWQSGFFDHGSFKEIMAPWAQTVVTGRARLGGIPVGVIAVETRTVEVA VPADPANLDSEAKIIQQAGQVWFPDSAYKTAQAIKDFNREKLPLMIFANWRGFSGGMK DMYDQVLKFGAYIVDGLRQYKQPILIYIRPMRELRGGSWVVIDATINPLCIEMYADKE SRGGVLEPEGTVEIKFRKEDLIKSMRRIDPAYKKLMEQLGEPDLSDKDRKDLEGRLKA REDLLLPIYHQVAVQFADFHDTPGRMLEKGVISDILEWKTARTFLYWRLRRLLLEDQV KQEILQASGELSHVHIQSMLRRWFVETEGAVKAYLWDNNQVVVQWLEQHWQAGDGPRS TIRENITYLKHDSVLKTIRGLVEENPEVAVDCVIYLSQHISPAERAQVVHLLSTMDSP AST" BASE COUNT 1787 a 2076 c 2066 g 1523 t ORIGIN 1 atggtcttgc ttctttgtct atcttgtctg attttctcct gtctgacctt ttcctggtta 61 aaaatctggg agaaaatgac ggactccaag ccgatcacca agagtaaatc agaagcaaac 121 ctcatcccga gccaggagcc ctttccagcc tctgataact caggggagac accgcagaga 181 aatggggagg gccacactct gcacaaagac acccagccag gccgagccca gcctcccaca 241 aaggcccaaa gatccggtcg gcggagaaac tccctaccac cctcccgcca gaagccccca 301 agaaaccccc tttcttccag tgacgcagca ccctccccag agcttcaagc caacgggact 361 gggacacaag gtctggaggc cacagatacc aatggcctgt cctcctcagc caggccccag 421 ggcagcaagc tggtcccctc caaagaagac aagaagcagg caaacatcaa gaggcagctg 481 atgaccaact tcatcctggg ctcttttgat gactactcct ccgacgagga ctctgttgct 541 ggctcatctc gtgagtctac ccggaagggc agccgggcca gcttgggggc cctgtccctg 601 gaggcttatc tgaccacagg tgaagctgag acccgcgtcc ccactatgag gccgagcatg 661 tcgggactcc acctggtgaa gaggggacgg gaacacaaga agctggacct gcacagagac 721 tttaccgtgg cttctcccgc tgagtttgtc acacgctttg ggggggatcg ggtcatcgag 781 aaggtgctta ttgccaacaa cgggattgcc gctgtgaagt gcatgcgctc catccgcagg 841 tgggcctatg agatgttccg caacgagcgg gccatccggt ttgttcgcat ggtgaccccc 901 gaggacctta aggccaacgc agagtacatc aagatggcgg atcattacgg gcccgcccca 961 ggagggccca ataacaacaa ctatgccaac gtggagctga ttgtggacat tgccaagaga 1021 atcccgttgc aggcggtgtg ggctggctgg ggccatgctt tagaaaaccc taaacttccg 1081 gagctgctgt gcaagaatgg agttgctttc ttaggccctc ccaggttgag gccaatggtg 1141 ggtctaggag ataagatcgc ctccaccgtt gtcgcccaga cgctacaggt cccaaccctg 1201 cccaggagtg gaagcgccct gacagtggag tggacagaag atgatctgca gcagggaaaa 1261 agaatcagtg tcccagaaga tgtttatgac aagggttgcg tgaaagacgt agatgagggc 1321 ttggaggcag cagaaagaat tggttttcca ttgatgatca aagcttctga aggtggcgga 1381 gggaagggaa tccgggaaac tgagagtgcg gaggacttcc cgatcctttt cagacaagta 1441 cagagtgaga tcccaggctc gcccatcttt ctcatgaagc tggcccagca cgcccgtcac 1501 ctggaagttc agatcctcgc tgaccagtat gggaatgctg tgtctctgtt tggtcgcgac 1561 tgctccatcc agcggcggca tcagaagatc gttgaggaag caccggccac catcgcgccg 1621 ctggccatat tcgagttcat ggagcagtgt gccattcgcc tggccaagac cgtgggctat 1681 gtgagtgcag ggacagtgga atacctctat agtcaggatg gtagcttcca cttcttggag 1741 ctgaatcctc gcttgcaggt ggaacatccc tgcacagaaa tgattgctga cgttaatctg 1801 ccggccgccc agctacagat cgccatgggt gccccactgc accggctgaa agatatccgg 1861 cttctgtatg gagagtcacc ctggggagac tccccaattt cttttgaaaa ctcagctcat 1921 ctcccctgcc cccgaggcca cgtcattgcc accagaatca ccagcgaaaa cccagacgag 1981 ggttttaagc cgagctccgg gactgtccag gaactgaatt tccggagcag caagaacgtc 2041 tggggttact tcacggtggc cgctactgga ggcctgcacg agtttgcgat ttcccagttt 2101 gggcactgct tctcctgggg agagaaccgg aaagaggcca tttcgaacat ggtggtggct 2161 ttgaaggaac tgtccctccg aggcgacttt aggactaccg tggaatacct cattaacctc 2221 ctggagaccg agagcttcca gaacaactac atcgacaccg ggtggttgga ctacctcatt 2281 gctgagaaag tgcaaaagaa accgaatatc atgcttgggg tggtatgcgg ggcccttgaa 2341 cgtggagatg cgatgttcag aacgtgcatg acagatttct tacactccct ggaaaggggc 2401 caggtcctcc cagcggattc actactgaac ctcgtagatg tggaattaat ttacgagggt 2461 gtaaagtaca ttctaaaggt gacccggcag tctctgacca tgttcgttct catcatgaat 2521 ggctgccaca tcgagattga tgcccaccgg ctgaatgatg gggggctcct gctctcctac 2581 aatgggaaca gctacaccac ctacatgaag gaagaggttg acagttaccg taccatcggc 2641 aataagacgt gtgtttttga gaaggagaac gatcctacag tcctgagatc cccctcggct 2701 gggaagctga cacagatcac agtggaggat gggggccacg ttgaggctgg gagacgctac 2761 gctgagatgg aggtgatgaa gatgatcatg accctgaacg ttcaggaaag aggccgggtg 2821 aagtacatca agcgtccagg tgcggtgctg gaagcaggct gcgtggtggc caggctggag 2881 ctcgatgacc cttctaaagt ccacccggct gaaccgttca caggagaact ccctgcccag 2941 cagaacactg ccgacctcgg aaagaaactg cacagggtct tccacagcgt cctgggaagc 3001 ctcaccaacg tcatgagtgg cttttgtctg ccagagccgt tttttagcat aaagctgaag 3061 gagtgggtgc agaagctcat gatgaccctc cggcacccgt cactgctgct ggacgtgcag 3121 gagatcatga ccagtcgtgc aggccgcatc cccccccctg ttgagaagtc tgtccgcaag 3181 gtgatggccc agtatgccag caacatcacc tcggtgctgt gccagttccc cagccagcag 3241 atagccacca tcctggactg ccatgcagcc accctgcagc ggaaggctga tcgagaggtc 3301 ttcttcatca acacccagag catggtgcag ttggtccaga ggtaccgaag tggaatccgc 3361 ggtcatatga aaacagtggt gatcgatctc ttgagaagat acttgcgtgt tgagaccatt 3421 ttcggcaagg caagagatgc tgatgccaac tccagtggga tggtgggggg cgtgaggagc 3481 ctgagcttta cctctgtgtg ggtggttttg tctcccccag cccactacga caagtgtgtg 3541 ataaacctca gggaacagtt caagccagac atgtcccagg tgctggactg catcttctcc 3601 cacgcacagg tgaccaagaa gaaccagctg gtgatcatgt tgatcgatga gctgtgtggc 3661 ccagaccctt ccctgtcgga cgagctgatc tccatcctca acgagctcac tcagctgagc 3721 aaaagcgagc actgcaaagt ggccctcaga gcccggcaga tcctgatcgc ctccccctcc 3781 tacgagctgc ggcataacca ggtggagtcc attttcctgt ctgccattga catgtacggc 3841 caccagttct gccccgagaa cctccagaaa ttaatacttt cggaaacaac catcttcgac 3901 gtcctgaata ctttcttcta tcacgcaaac aaagtcgtgt gcatggcgtc cttggaggtt 3961 tacgtggggg gggcttacat cgcctatgtg ttaaacagcc tgcagcaccg gcagctcccg 4021 gacggcacct gcgtggtaga attccagttc atgctgccgt cctcccaccc aaaccggatg 4081 accgtgccca tcagcatcac caaccctgac ctgctgaggc acacgacaga gctcttcatg 4141 gacagcggct tctccccact gtgccagcgc atgggagcca tggtagcctt caggagattc 4201 gaggacttca ccagaaattt tgatgaagtc atctcttgct tcgccaacgt gccgaaagac 4261 ccccccctct tcagcgaggc ccgcacctcc ctatactccg aggatgactg caagagcctc 4321 agagaagagc ccatccacat tctgaatgtg tccatccagt gtgcggacca cctggaggat 4381 gaggcactgg tgccgatttt acgtacattc gtacagtcca agaaaaatat ccttgtggat 4441 tatggactcc gacgaatccc attcttgatt gcccaagaga aagaatttcc caagtttttc 4501 acattcagag caagagatga gtttgcagaa gatcgcattt accgtcactt ggaacctgcc 4561 ctggctttcc agctggaact caaccggatg cgtaacttcg atctgaccgc cgtgccctgt 4621 gccaaccaca agatgcacct ttacctgggt gctgccaagg tggaaggaag gtatgaagtg 4681 acggaccata ggttcttcat ccgtgccatc atcaggcact ctgacctgat cacaaaggaa 4741 gcctccttcg aatacctgca gaacgagggt gagcggctgc tcctggaggc catggacgag 4801 ctggaggtgg cgttcaataa caccaacgtg cgcaccgact gcaaccacat cttcctcaac 4861 ttcgtgccca ctgtcatcat ggaccccaac aagatcgagg agtccgtgcg ctacatggtt 4921 atgcgctacg gcagccggct gtggaaactc cgtgtgctac aggctgaggt caagatcaac 4981 atccgccaga ccaccaccgg cagtgccgtt cccatccgcc tgttcatcac caatgagtcg 5041 ggctactacc tggacatcag cctctacaaa gaagtgactg actccagatc tggaaatatc 5101 atgtttcact ccttcggcaa caagcaaggg ccccagcacg ggatgctgat caatactccc 5161 tacgtcacca aggatctgct ccaggccaag cgattccagg cccagaccct gggaaccacc 5221 tacatctatg acttcccgga aatgttcagg caggctctct ttaaactgtg gggctcccca 5281 gacaagtatc ccaaagacat cctgacatac actgaattag tgttggactc tcagggccag 5341 ctggtggaga tgaaccgact tcctggtgga aatgaggtgg gcatggtggc cttcaaaatg 5401 aggtttaaga cccaggagta cccggaagga cgggatgtga tcgtcatcgg caatgacatc 5461 acctttcgca ttggatcctt tggccctgga gaggaccttc tgtacctgcg ggcatccgag 5521 atggcccggg cagaggcgat tcccaaaatt tacgtggcag ccaacagtgg cgcccgtatt 5581 ggcatggcag aggagatcaa acacatgttc cacgtggctt gggtggaccc agaagacccc 5641 cacaaaggat ttaaatacct gtacctgact ccccaagact acaccagaat cagctccctg 5701 aactccgtcc actgtaaaca catcgaggaa ggaggagagt ccagatacat gatcacggat 5761 atcatcggga aggatgatgg cttgggcgtg gagaatctga ggggctcagg catgattgct 5821 ggggagtcct ctctggctta cgaagagatc gtcaccatta gcttggtgac ctgccgagcc 5881 attgggattg gggcctactt ggtgaggctg ggccagcgag tgatccaggt ggagaattcc 5941 cacatcatcc tcacaggagc aagtgctctc aacaaggtcc tgggaagaga ggtctacaca 6001 tccaacaacc agctgggtgg cgttcagatc atgcattaca atggtgtctc ccacatcacc 6061 gtgccagatg actttgaggg ggtttatacc atcctggagt ggctgtccta tatgccaaag 6121 gataatcaca gccctgtccc tatcatcaca cccactgacc ccattgacag agaaattgaa 6181 ttcctcccat ccagagctcc ctacgacccc cggtggatgc ttgcaggaag gcctcaccca 6241 actctgaagg gaacgtggca gagcggattc tttgaccacg gcagtttcaa ggaaatcatg 6301 gcaccctggg cgcagaccgt ggtgacagga cgagcaaggc ttggggggat tcccgtggga 6361 gtgattgctg tggagacacg gactgtggag gtggcagtcc ctgcagaccc tgccaacctg 6421 gattctgagg ccaagataat tcagcaggca ggacaggtgt ggttcccaga ctcagcctac 6481 aaaaccgccc aggccatcaa ggacttcaac cgggagaagt tgcccctgat gatctttgcc 6541 aactggaggg ggttctccgg tggcatgaaa gacatgtatg accaggtgct gaagtttgga 6601 gcctacatcg tggacggcct tagacaatac aaacagccca tcctgatcta tatccgccct 6661 atgcgggagc tccggggagg ctcctgggtg gtcatagatg ccaccatcaa cccgctgtgc 6721 atagaaatgt atgcagacaa agagagcagg ggtggtgttc tggaaccaga ggggacagtg 6781 gagattaagt tccgaaagga agatctgata aagtccatga gaaggatcga tccagcttac 6841 aagaagctca tggaacagct aggggaacct gatctctccg acaaggaccg aaaggacctg 6901 gagggccggc taaaggctcg cgaggacctg ctgctcccca tctaccacca ggtggcggtg 6961 cagttcgccg acttccatga cacacccggc cggatgctgg agaagggcgt catatctgac 7021 atcctggagt ggaagaccgc acgcaccttc ctgtattggc gtctgcgccg cctcctcctg 7081 gaggaccagg tcaagcagga gatcctgcag gccagcgggg agctgagtca cgtgcatatc 7141 cagtccatgc tgcgtcgctg gttcgtggag acggaggggg ctgtcaaggc ctacttgtgg 7201 gacaacaacc aggtggttgt gcagtggctg gaacagcact ggcaggcagg ggatggcccg 7261 cgctccacca tccgtgagaa catcacgtac ctgaagcacg actctgtcct caagaccatc 7321 cgaggcctgg ttgaagaaaa ccccgaggtg gccgtggact gtgtgatata cctgagccag 7381 cacatcagcc cagctgagcg ggcgcaggtc gttcacctgc tgtctaccat ggacagcccg 7441 gcctccacct ga // LOCUS HSU89505 1620 bp mRNA PRI 13-MAY-1997 DEFINITION Human Hlark mRNA, complete cds. ACCESSION U89505 NID g2078528 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1620) AUTHORS Jackson,F.R., Banfi,S., Guffanti,A. and Rossi,E. TITLE A novel zinc finger-containing RNA-binding protein conserved from fruitflies to humans JOURNAL Genomics 41 (3), 444-452 (1997) MEDLINE 97312703 REFERENCE 2 (bases 1 to 1620) AUTHORS Jackson,F.R. TITLE Direct Submission JOURNAL Submitted (12-FEB-1997) Neuroscience, Tufts University School of Medicine, 136 Harrison Avenue, Boston, MA 02111, USA FEATURES Location/Qualifiers source 1..1620 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" CDS 56..1156 /note="similar to RRM-type RNA-binding protein; similar to Drosophila melanogaster RNA-binding protein lark, encoded by GenBank Accession Number U59476" /codon_start=1 /product="Hlark" /db_xref="PID:g2078529" /translation="MVKLFIGNLPREATEQEIRSLFEQYGKVLECDIIKNYGFVHIED KTAAEDAIRNLHHYKLHGVNINVEASKNKSKTSTKLHVGNISPTCTNKELRAKFEEYG PVIECDIVKDYAFVHMERAEDAVEAIRGLDNTEFQGKRMHVQLSTSRLRTAPGMGDQS GCYRCGKEGHWSKECPIDRSGRVADLTEQYNEQYGAVRTPYTMSYGDSLYYNNAYGAL DAYYKRCRAARSYEAVQLQLPPCIITQSRPCPSCHKSRIQPWPVTSPPPLSIPTIDTC CRPQELLPQLLLQQQPLLLLLQLPLHITGGIGAPCVALQPQSPLLERATVTGMRVSCP KLQQPRGILCTTWPGMSGSSMPIGRGTQPFKA" BASE COUNT 367 a 431 c 437 g 385 t ORIGIN 1 gaggcaagaa ttcggcacga gggccctgct ggtttctgtg cgggctcttg tcaggatggt 61 gaagctgttc atcggaaacc tgccccggga ggctacagag caggagattc gctcactctt 121 cgagcagtat gggaaggtgc tggaatgtga catcattaag aattacggct ttgtgcacat 181 agaagacaag acggcagctg aggatgccat acgcaacctg caccattaca agcttcatgg 241 ggtgaacatc aacgtggaag ccagcaagaa taagagcaaa acctcaacaa agttgcatgt 301 gggcaacatc agtcccacct gcaccaataa ggagcttcga gccaagtttg aggagtatgg 361 tccggtcatc gaatgtgaca tcgtgaaaga ttatgccttc gtacacatgg agcgggcaga 421 ggatgcagtg gaggccatca ggggccttga taacacagag tttcaaggca aacgaatgca 481 cgtgcagttg tccaccagcc ggcttaggac tgcgcccggg atgggagacc agagcggctg 541 ctatcggtgc gggaaagagg ggcactggtc caaagagtgt ccgatagatc gttcaggccg 601 cgtggcagac ttgaccgagc aatataatga gcaatacgga gcagtgcgta cgccttacac 661 catgagctat ggggattcat tgtattacaa caacgcgtac ggagcgctcg atgcctacta 721 caagcgctgc cgtgctgccc ggtcctatga ggcagtgcag ctgcagctgc ctccgtgtat 781 aattacgcag agcagaccct gtcccagctg ccacaagtcc agaatacagc catggccagt 841 cacctcacct ccacctctct cgatccctac gatagacacc tgttgccgac ctcaggagct 901 gctgccacag ctgctgctgc agcagcagcc gctgctgctg ttactgcagc ttccacttca 961 tattacgggc gggatcggag ccccctgcgt cgcgctacag ccccagtccc cactgttgga 1021 gagggctacg gttacgggca tgagagtgag ttgtcccaag cttcagcagc cgcgcggaat 1081 tctctgtacg acatggcccg gtatgagcgg gagcagtatg ccgatcgggc gcggtactca 1141 gccttttaaa gcttgaggtg ggatgtgtgt gggctgaaat tccgagctgc ggttgtgcat 1201 gagaatacac ccttcgtggt accccatctc cgggacgttc tcggctctgt gcgttcagtc 1261 cctcaggaac cgtggacctt aatttacctt gctaagttca gaccttctct tcctttcctt 1321 tcctttcctc tcctgcccat tttcctgttc ttctgtcctt caatacttct gtagcttccc 1381 attcatgttc tcttctccca gcaggcctca ttgtgtgcag aaactgtggt gggggctgtg 1441 ctgtctcctc cctgcctcct gcctcctgcg gctgttggat ttgggaatga ccttggtgag 1501 agtctcactg ctccagggtc tctttttggt ccaaaggcta gacctataga gttggatcac 1561 tttttttctt tccggtgaaa taaatggttt ttcaacttaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU89606 960 bp mRNA PRI 22-APR-1997 DEFINITION Human pyridoxal kinase mRNA, complete cds. ACCESSION U89606 NID g1946348 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 960) AUTHORS Hanna,M.C., Turner,A.J. and Kirkness,E.F. TITLE Human pyridoxal kinase. cDNA cloning, expression, and modulation by ligands of the benzodiazepine receptor JOURNAL J. Biol. Chem. 272 (16), 10756-10760 (1997) MEDLINE 97256798 REFERENCE 2 (bases 1 to 960) AUTHORS Hanna,M.C., Turner,A.J. and Kirkness,E.F. TITLE Direct Submission JOURNAL Submitted (13-FEB-1997) Department of Molecular and Cellular Biology, The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..960 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="21" /map="21q22" CDS 7..945 /EC_number="2.7.1.35" /function="required for synthesis of pyridoxal-5-phosphate from vitamin B6" /codon_start=1 /product="pyridoxal kinase" /db_xref="PID:g1946349" /translation="MEEECRVLSIQSHVIRGYVGNRAATFPLQVLGFEIDAVNSVQFS NHTGYAHWKGQVLNSDELQELYEGLRLNNMNKYDYVLTGYTRDKSFLAMVVDIVQELK QQNPRLVYVCDPVLGDKWDGEGSMYVPEDLLPVYKEKVVPLADIITPNQFEAELLSGR KIHSQEEALRVMDMLHSMGPDTVVITSSDLPSPQGSNYLIVLGSQRRRNPAGSVVMER IRMDIRKVDAVFVGTGDLFAAMLLAWTHKHPNNLKVACEKTVSTLHHVLQRTIQCAKA QAGEGVRPSPMQLELRMVQSKRDIEDPEIVVQATVL" BASE COUNT 213 a 262 c 305 g 180 t ORIGIN 1 cccggcatgg aggaggagtg ccgggtgctc tccatacaga gccacgtcat ccgcggctac 61 gtgggcaacc gggcggccac gttcccgctg caggttttgg gatttgagat tgacgcggtg 121 aactctgtcc agttttcaaa ccacacaggc tatgcccact ggaagggcca agtgctgaat 181 tcagatgagc tccaggagtt gtacgaaggc ctgaggctga acaacatgaa taaatatgac 241 tacgtgctca caggttatac gagggacaag tcgttcctgg ccatggtggt ggacattgtg 301 caggagctga agcagcagaa ccccaggctg gtgtacgtgt gtgatccagt cttgggtgac 361 aagtgggacg gcgaaggctc gatgtacgtc ccggaggacc tccttcccgt ctacaaagaa 421 aaagtggtgc cgcttgcaga cattatcacg cccaaccagt ttgaggccga gttactgagt 481 ggccggaaga tccacagcca ggaggaagcc ttgcgggtga tggacatgct gcactctatg 541 ggccccgaca ccgtggtcat caccagctcc gacctgccct ccccgcaggg cagcaactac 601 ctgattgtgc tggggagtca gaggaggagg aatcccgctg gctccgtggt gatggaacgc 661 atccggatgg acattcgcaa agtggacgcc gtctttgtgg gcactgggga cctgtttgct 721 gccatgctcc tggcgtggac acacaagcac cccaataacc tcaaggtggc ctgtgagaag 781 accgtgtcta ccttgcacca cgttctgcag aggaccatcc agtgtgcaaa agcccaggcc 841 ggggaaggag tgaggcccag ccccatgcag ctggagctgc ggatggtgca gagcaaaagg 901 gacatcgagg acccagagat cgtcgtccag gccacggtgc tgtgagggcc ccgccgcttg // LOCUS HSU89896 1767 bp mRNA PRI 10-DEC-1997 DEFINITION Homo sapiens casein kinase I gamma 2 mRNA, complete cds. ACCESSION U89896 NID g1890117 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1767) AUTHORS Kitabayashi,A.N., Kusuda,J., Hirai,M. and Hashimoto,K. TITLE Cloning and chromosomal mappping of human casein kinase I gamma 2 (CSNK1G2) JOURNAL Genomics 46 (1), 133-137 (1997) REFERENCE 2 (bases 1 to 1767) AUTHORS Kitabayashi,N.A., Kusuda,J. and Hashimoto,K. TITLE Direct Submission JOURNAL Submitted (18-FEB-1997) Division of Genetic Resources, NIH, 1-23-1, Toyama, Sinjyuku-ku, Tokyo 162, Japan FEATURES Location/Qualifiers source 1..1767 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" CDS 240..1487 /codon_start=1 /product="casein kinase I gamma 2" /db_xref="PID:g1890118" /translation="MDFDKKGGKGETEEGRRMSKAGGGRSSHGIRSSGTSSGVLMVGP NFRVGKKIGCGNFGELRLGKNLYTNEYVAIKLEPIKSRAPQLHLEYRFYKQLSATEGV PQVYYFGPCGKYNAMVLELLGPSLEDLFDLCDRTFTLKTVLMIAIQLITRMEYVHTKS LIYRDVKPENFLVGRPGTKRQHAIHIIDFGLAKEYIDPETKKHIPYREHKSLTGTARY MSINTHLGKEQSRRDDLEALGHMFMYFLRGSLPWQGLKADTLKERYQKIGDTKRATPI EVLCENFPEEMATYLRYVRRLDFFEKPDYDYLRKLFTDLFDRSGFVFDYEYDWAGKPL PTPIGTVHTDLPSQPQLRDKTQPHSKNQALNSTNGELNADDPTAGHSNAPITAPAEVE VADETKCCCFFKRRKRKSLQRHK" BASE COUNT 405 a 569 c 531 g 262 t ORIGIN 1 cggcacgagc agcagaatgt ctcctgcccc cgagagcgac cccgaggcca ctgagaagag 61 cagcgcggcc tggccggccc gaacgcctgc gtctcagtag ctgggagcca cgggcccacg 121 cccgcccacc ggccgcagtg atgttctagc cacagaggag ccaagacctc aggtttccag 181 agacttggga tttgcacggc agcagagtca ccgtggagag gccagggtat cacaaactta 241 tggattttga caagaaagga gggaaagggg agacggagga gggccggaga atgtccaagg 301 ccggcggggg ccggagcagc cacggcatcc ggagctcggg gaccagctcg ggggtcctga 361 tggtgggccc caacttccgc gtcggcaaga agatcggctg cggcaacttc ggggagctcc 421 gcctaggaaa gaatctctat acaaatgaat acgtggctat caaattggag ccgatcaagt 481 cccgggcccc gcagctgcac ctggagtacc ggttctacaa gcagctcagc gccacagagg 541 gcgtccctca ggtctactac ttcggtccgt gcgggaagta caacgccatg gtgctggagc 601 tgctggggcc cagcctggag gacctgttcg acctgtgcga ccggaccttc acgctcaaga 661 cggtgctgat gatcgccatc cagctgatca cgcgcatgga gtatgtgcac accaagagcc 721 taatctaccg ggacgtgaag cccgagaact tcctggtggg ccgcccgggg accaagcggc 781 agcatgccat ccacatcatc gacttcgggc tggccaagga gtacatcgac cccgagacca 841 agaagcacat cccgtaccgc gagcacaaga gcctgacggg cacggcgcgc tacatgagca 901 tcaacacgca cctgggcaag gagcagagcc gccgcgacga cctggaggcg ctgggccaca 961 tgttcatgta cttcctgcgc ggcagcctcc cctggcaggg gctcaaggcc gacacgctca 1021 aggagcggta ccagaagatc ggggacacca aacgcgccac gcccatcgag gtgctctgcg 1081 agaacttccc agaggagatg gccacgtacc tgcgctatgt gcggcgcctg gacttcttcg 1141 agaagcccga ctatgactac ctgcggaagc tcttcaccga cctcttcgac cgcagtggct 1201 tcgtgttcga ctatgagtac gactgggccg ggaagcccct gccgaccccc atcggcaccg 1261 tccacaccga cctgccctcc cagcctcagc tccgggacaa aacccagccg cacagcaaaa 1321 accaggcgtt gaactccacc aacggggagc tgaatgcgga cgaccccacg gccggccact 1381 ccaacgcccc gatcacagcg cctgcagagg tggaggtggc cgatgaaacc aaatgctgct 1441 gtttcttcaa gaggagaaag agaaaatcgc tgcagcgaca caagtgaccc tgggcgcgtg 1501 cagccccctg aatcttctcc gtgcagcccc ttggggcgcg accttgtgcg aggccctcgg 1561 ggcccaccca cagcggccca gggccagacc ctggctggaa gccagaacgc agactgcagg 1621 ggccgcgcct ggctcaggcg gccccacccc cgggacgtgg ggtcacttcc ttcatgtaag 1681 actttggccg aaatttctac acctgtgtct agtcctcccc tccaagagca ttaactattt 1741 aaaacaagga aaaaaaaaaa aaaaaaa // LOCUS HSU90028 3257 bp mRNA PRI 06-JAN-1998 DEFINITION Homo sapiens bicaudal-D (BICD) mRNA, complete cds. ACCESSION U90028 NID g2745975 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3257) AUTHORS Baens,M., Aerssens,J., van Zand,K., Van den Berghe,H. and Marynen,P. TITLE Isolation and regional assignment of human chromosome 12p cDNAs JOURNAL Genomics 29 (1), 44-52 (1995) MEDLINE 96079090 REFERENCE 2 (bases 1 to 3257) AUTHORS Baens,M. and Marynen,P. TITLE A human homologue (BICD1) of the drosophila bicaudal-D gene JOURNAL Genomics 45 (3), 601-606 (1997) MEDLINE 98035884 REFERENCE 3 (bases 1 to 3257) AUTHORS Baens,M. and Marynen,P. TITLE Direct Submission JOURNAL Submitted (19-FEB-1997) Center for Human Genetics, K.U.Leuven, Herestraat 49, Leuven B3000, Belgium FEATURES Location/Qualifiers source 1..3257 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /map="12p11" gene 1..3257 /gene="BICD" CDS 82..3009 /gene="BICD" /codon_start=1 /product="bicaudal-D" /db_xref="PID:g2745976" /translation="MAAEEVLQTVDHYKTEIERLTKELTETTHEKIQAAEYGLVVLEE KLTLKQQYDELEAEYDSLKQELEQLKEAFGQSFSIHRKVAEDGETREETLLQESASKE AYYLGKILEMQNELKQSRAVVTNVQAENERLTAVVQDLKENNEMVELQRIRMKDEIRE YKFREARLLQDYTELEEENITLQKLVSTLKQNQVEYEGLKHEIKRFEEETVLLNSQLE DAIRLKEIAEHQLEEALETLKNEREQKNNLRKELSQYISLNDNHISISVDGLKFAEDG SEPNNDDKMNGHIHGPLVKLNGDYRTPTLRKGESLNPVSDLFSELNISEIQKLKQQLM QVEREKAILLANLQESQTQLEHTKGALTEQHERVHRLTEHVNAMRGLQSSKELKAELD GEKGRDSGEEAHDYEVDINGLEILECKYRVAVTEVIDLKAEIKALKEKYNKSVENYTD EKAKYESKIQMYDEQVTSLEKTTKESGEKMAHMEKELQKMTSIANENHSTLNTAQDEL VTFSEELAQLYHHVCLCNNETPNRVMLDYYRQSRVTRSGSLKGPDDPRGLLSPRLARR GVSSPVETRTSSEPVAKESTEPSKEPSPTKTPTISPVITAPPSSPVLDTSDIRKEPMN IYNLNAIIRDQIKHLQKAVDRSLQLSRQRAAARELAPMIDKDKEALMEEILKLKSLLS TKREQIATLRAVLKANKQTAEVALANLKNKYENEKAMVTETMTKLRNELKALKEDAAT FSSLRTMFATRCDEYVTQLDEMQRQLAAAEDEKKTLNTLLRMAIQQKLALTQRLEDLE FDHEQSRRSKGKLGKSKIGSPKVSGEASVTVPTIDTYLLHSQGPQTPNIRVSSGTQRK RQFSPSLCDQSRPRTSGASYLQNLLRVPPDPTSTESFLLKGPPSMSEFIQGHRLSKEK RLTVAPPDCQQPAASVPPQCSQLAGRQDCPTVSPDTALPEEQPHSSSQCAPLHCLSKP PHP" BASE COUNT 993 a 785 c 837 g 642 t ORIGIN 1 atttccttct ccctttcccc gccagcttcg catccatctc ccccaccccg taaccccctc 61 ctgcctccat ccaccggggc tatggccgca gaagaggtat tgcagacggt ggaccattat 121 aagactgaga tagagaggct aaccaaggag ctcacggaga ccacccacga gaagatccag 181 gctgccgagt acgggctggt ggtgctggag gagaagctga ccctcaaaca gcagtatgat 241 gaactggagg ctgagtacga cagcctcaaa caggagctgg agcagctcaa agaggcattt 301 gggcagtcct tctccatcca ccggaaggtt gctgaagatg gagagactcg ggaggaaacg 361 cttctgcagg agtcagcatc gaaggaggct tactatctgg ggaagatctt ggagatgcag 421 aacgagctga aacagagccg ggctgtggtc actaatgtac aggcagaaaa cgagaggctc 481 accgcagtcg tgcaggatct gaaggagaac aatgagatgg tggagctaca gagaatacgg 541 atgaaggatg aaatccgaga atataagttc cgggaggcac ggctccttca ggactatact 601 gaattggaag aagaaaatat cacattgcag aaactagtgt ccacgttgaa gcagaaccag 661 gttgaatacg aaggcttaaa gcatgagatt aagcgatttg aggaggagac ggtactgctg 721 aacagccagc tggaagatgc catccgattg aaagagattg ctgagcacca actggaagaa 781 gccctcgaga ctttaaaaaa tgaaagagag caaaagaaca acctgcggaa ggagctctcc 841 cagtatatca gcctcaatga taaccatatc agcatctcag tagatggact caaatttgcc 901 gaggatggga gtgaaccaaa caatgatgac aaaatgaacg gtcatatcca tgggcctctt 961 gtgaaactga atggagacta tcggactccc accttaagga aaggagagtc tctgaaccct 1021 gtctctgact tattcagtga gctgaacatt tcagaaatac agaagttgaa gcagcagctt 1081 atgcaggtag agcgggaaaa ggccattctt ttggccaacc tacaggagtc acagacacag 1141 ctggaacaca ccaagggggc actgacggag cagcatgagc gggtgcaccg gctcacagag 1201 cacgtcaatg ccatgagggg cctgcaaagc agcaaggagc tcaaggctga gctggacggg 1261 gagaagggcc gggactcagg ggaggaggcc catgactatg aggtggacat caatggttta 1321 gagatccttg aatgcaaata cagggtggca gtaactgagg tgattgatct gaaagctgaa 1381 attaaggcct taaaggagaa atataataaa tctgtagaaa actacactga tgagaaggcc 1441 aagtatgaga gtaaaatcca gatgtatgat gagcaggtga caagccttga gaagaccacc 1501 aaggagagtg gtgagaagat ggcccacatg gagaaggagt tgcaaaagat gaccagcata 1561 gccaacgaaa atcacagtac ccttaatacg gcccaggatg agttagtgac attcagtgag 1621 gagttagctc agctttacca ccatgtgtgt ctatgtaata atgaaactcc caacagggtc 1681 atgctggatt actataggca gagcagagtc acccgcagtg gcagcctgaa agggcccgat 1741 gatcccagag gacttttgtc cccacgatta gccaggcggg gtgtgtcatc cccggtagaa 1801 acaaggacct catctgaacc agttgcaaaa gaaagcacag agcccagcaa agaaccaagt 1861 ccaactaaga cccccacaat ctctcctgtt attactgccc caccgtcatc tccagtattg 1921 gatacaagtg acatccgcaa agagccaatg aatatctaca accttaatgc cataatccgg 1981 gaccaaatca agcatctgca gaaagctgtg gaccggtcct tgcaactgtc tcgtcaaaga 2041 gcagcagctc gggagctagc ccccatgatt gataaagaca aggaagcctt aatggaagag 2101 atcctcaagc taaagtccct gctgagcacc aaacgggagc agatcgccac attgagggcg 2161 gtgttgaaag ccaacaagca gacagctgag gtggcgctag ctaatctcaa gaacaaatat 2221 gaaaatgaaa aagcaatggt gactgaaacc atgacgaagc ttagaaatga actgaaggct 2281 ttgaaagaag atgctgcaac cttctcatcc ctgagaacaa tgtttgcaac aagatgtgat 2341 gaatatgtca cccagttgga tgagatgcag agacagttag cagctgcaga ggatgagaag 2401 aagactctga acactttgtt acgaatggct atccagcaaa aactcgccct gacccagagg 2461 ctggaggact tagagtttga ccatgagcag tcccgacgca gcaaaggcaa acttggaaag 2521 agcaagatcg gcagccctaa agtaagtggg gaggcatcag tcaccgtgcc caccatagac 2581 acttacctcc tgcatagtca gggcccacag acacccaaca ttcgggtcag cagtggcact 2641 cagaggaaaa gacaattttc accttccctt tgtgatcaga gccgtcccag gacttcaggg 2701 gcttcctacc tacagaattt attaagagtt ccccctgatc ccacctccac agaatcattt 2761 cttctgaagg gccccccttc catgagtgaa ttcatccaag ggcaccggct cagcaaggaa 2821 aaaaggttaa ccgtggctcc accagattgt cagcagcctg ctgcctccgt accgccacag 2881 tgctcacaac tagccgggag gcaagactgc ccaactgtca gtcctgacac agctctccct 2941 gaggagcagc cacattccag ctcccagtgc gcccctctcc actgtctctc caagcctcct 3001 cacccctagt cttcatctcc tgtggacgaa catctggggt ggaagttttg tagccacaca 3061 caggatactg cccaagatcc agcgggtgtt ttcttctcgg ttgttagatg tacaattgga 3121 ttaatgtcca tcgttttgga agacgagaaa gttgagaaga acacgaagca cagaccctga 3181 tgtgataaaa cattttgtgg tttctctgag tcacagataa acttctgcca tcaaatggct 3241 acagttcatt taaattt // LOCUS HSU90142 2720 bp mRNA PRI 20-MAY-1997 DEFINITION Human unknown protein (BT2.1) mRNA, complete cds. ACCESSION U90142 NID g1899191 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2720) AUTHORS Tazi Ahnini,R., Offer,C., Bouchouata,C., Henry,J. and Pontarotti,P. TITLE Direct Submission JOURNAL Submitted (20-FEB-1997) Genetics, CNRS, Grande Bretagne CHU Purpan, Toulouse 31300, France FEATURES Location/Qualifiers source 1..2720 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p22-p23" gene 230..1819 /gene="BT2.1" CDS 230..1819 /gene="BT2.1" /note="similar to butyrophilin protein" /codon_start=1 /product="unknown protein" /db_xref="PID:g1899192" /translation="MESAAALHFSRPASLLLLLLSLCALVSAHFIVVGPTDPILATVG ENTTLRCHLSPEKNAEDMEVRWFRSQFSPAVFVYKGGRERTEEQMEEYRGRTTFVSKD ISRGSVALVIHNITAQGNGTYRCYFQEGRSYDEAILHLVVAERLGSKPLISMRGHEDG GIRLECISRGWYPKPLTVWRDPYGGVAPALKEVSMPDADGLFMVTTAVIIRDKSVRNM SCSINNTLLGQKKESVIFIPESFMPSVSPLAVCIYWINKLQKEKKILSGEKEFERETR EIALKELEKERVQKEEELQVKEKLQEELRWRRTFLHAVDVVLDPDTAHPDLFLSEDRR SVRRCPFRHLGESVPDNPERFDSQPCVLGRESFASGKHYWEVEVENVIEWTVGVCRDS VERKGEVLLIPQNGFWTLEMHKGQYRAVSSPDRILPLKESLCRVGVFLDYEAGDVSFY NMRDRSHIYTCPRSAFSGPDTSQSGDPPEPIESIPWSHSHVDKPWSFQQPPHNTHLPA ASFTPTTDLSPSFLLLTRLCF" BASE COUNT 732 a 626 c 725 g 632 t 5 others ORIGIN 1 gccctcgagg ccaagaattc ggcacgaggc tccaaacatg gcgacctagg agaaagggaa 61 gaacaatttt ttctcctctt ttgggaaggt ttgcgtctag tagtgcctgt gcccctgggc 121 agattggaga gaagagggac gactggagaa tcgtcgagaa ccagcggaga aaagaaaaag 181 caacgtttaa ttctagaagg cctcctgtcc ctgcctgctc tgggtgctca tggaatcagc 241 tgctgccctg cacttctccc ggccagcctc cctcctcctc ctcctcctca gcctgtgtgc 301 actggtctca gcccacttta tagtcgtggg gcccactgat cccatcttgg ccacggttgg 361 agaaaacact acgttacgct gccatctgtc acccgagaaa aatgctgagg acatggaggt 421 gcggtggttc cggtctcagt tctcccccgc agtgtttgtg tataaaggtg gcagagagag 481 aacagaggag cagatggagg agtaccgagg aagaaccacc tttgtgagca aagacatcag 541 caggggcagc gtggccctgg tcatacacaa catcacagcc cagggaaacg gcacctaccg 601 ctgttacttc caagaaggca ggtcctacga tgaggccatc ctgcacctcg tagtggcaga 661 gagactaggc tctaagcccc tcatttcaat gaggggccat gaagacgggg gcatccggct 721 ggagtgcata tctagagggt ggtacccaaa gcccctcaca gtgtggaggg acccctacgg 781 tggggttgcg cctgccctga aagaggtctc catgcctgat gcagacggcc tcttcatggt 841 caccacggct gtgatcatca gagacaagtc tgtgaggaac atgtcctgct ctatcaacaa 901 caccctgctc ggccagaaga aagaaagtgt catttttatt ccagaatcct ttatgcccag 961 tgtgtctccc ctggccgtat gcatctattg gatcaacaaa ctccaaaagg aaaaaaagat 1021 tctgtcaggg gaaaaggagt ttgaacggga aacaagagaa attgctctaa aggaactgga 1081 gaaagaacgt gtgcaaaaag aggaagaact tcaagtaaaa gagaaacttc aagaagaatt 1141 gcgatggaga agaacattct tacatgctgt tgatgtggtc ctggatccag acaccgctca 1201 tcccgatctc ttcctgtcag aggaccggag aagtgtgaga aggtgcccct tcaggcacct 1261 aggggagagc gtgcctgaca acccagagag attcgacagt cagccttgtg tcctaggccg 1321 ggagagcttc gcttcaggga aacattactg ggaggtggag gtggaaaacg tgattgagtg 1381 gactgtgggg gtctgtagag acagtgttga gaggaaaggg gaggtcctgc tgattcctca 1441 gaatggcttc tggaccttgg agatgcataa agggcaatac cgggccgtgt cctcccctga 1501 taggattctc cctttgaagg agtccctttg ccgggtgggc gtcttcctgg actatgaagc 1561 tggagatgtc tccttctaca acatgaggga cagatcgcac atctacacat gtccccgttc 1621 agccttttcc gggcctgaca cttcacagag tggggaccca ccagagccta tagaatcaat 1681 tccttggtct cacagccatg tagacaagcc ctggtcattt cagcagccac cgcacaacac 1741 ccatcttcca gctgcctctt tcacacccac tacagacctc agccccagtt ttctcctcct 1801 cactaggctg tgtttttagt agttcctttg cttgtaacta tgggatggga tccaggcata 1861 gggaactagt tgttacacag ctcccagcca agaagaaagt gtgagaagtt gatgggcagc 1921 aaacctgctg tttaacatca gggtgaccac attaagccca gtattccagt tggcaccaga 1981 agatatggac ttggaatgag gcctacaggg ttcaccagga tgtaagagga gagaggaatc 2041 cacaggacca ccagagagga gagggaacca gatatgcaga tcagagatag aggaagtgga 2101 accagagagc tgggagggac caaggttgta agggtggcta agtcccacca taacagctaa 2161 ggggacctgg gagatgatgg ctcatttcca cccagcccca ggatttccag agcgcacatc 2221 cacaggcctg gacctgggat gaagatgaat gaagaacatg gatgcacgtg gatgtagttt 2281 ggctcaggtg tccctgcagt tggcaaggag tcagtactca gtccctgagt gtggctgaaa 2341 tttgagttct gggcggngcc cagggngtaa tggaccnaga tntacctcag tattcaagtt 2401 cagtggggac accagtggct tcaaactttc ctggtttcat gatatcttga gacnccttac 2461 aaatgatgga ggattccaaa gagtttttgt ttatttgggt taatatttgt tggtatttat 2521 ggcatttgag attgaaacta agaaatgttt taatttatta cctttacaac atttatttac 2581 attacataca tacatttaca acatttatta atttatatta aaatagcatg aataagccaa 2641 ttataggtta atataagtag aatgtttgtg aaaaataagt atggtatcca aagcaaaata 2701 aattttattg tgaagtgtga // LOCUS HSU90268 2004 bp mRNA PRI 03-JUN-1997 DEFINITION Human Krit1 mRNA, complete cds. ACCESSION U90268 NID g2149601 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2004) AUTHORS Serebriiskii,I.G., Estojak,J., Sonoda,G., Testa,J.R. and Golemis,E.A. TITLE Association of Krev-1/rap1a with Krit1, a novel ankyrin-repeat containing protein mapping to 7q22 JOURNAL Unpublished REFERENCE 2 (bases 1 to 2004) AUTHORS Serebriiskii,I.G., Estojak,J. and Golemis,E.A. TITLE Direct Submission JOURNAL Submitted (20-FEB-1997) Division of Basic Sciences, Fox Chase Cancer Center, 7701 Burholme Ave., Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..2004 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q21-q22" gene 26..1615 /gene="Krit1" CDS 26..1615 /gene="Krit1" /note="ankyrin-repeat containing protein" /codon_start=1 /product="Krit1" /db_xref="PID:g2149602" /translation="MGYSALEIKSKMLALEKADTCIYNPLGGSDLQYTNRVDKVVINP YFGLGAPDYSKIQIPKQEKWQRSMSSVTEDKERQWVDDFPLHRSACEGDSELLSRLLS ERFSVNQLDSDHWAPIHYACWYGKVEATRILLEKGKCNPNLLNGQLSSPLHFAAGGGH AEIVQILLNHPETDRHITDQQGRSPLNICEENKQNNWEEAAKLLKEAINKPYEKVRIY RMDGSYRSVELKHGNNTTVQQIMEGMRLSQETQQYFTIWICSENLSLQLKPYHKPLQH VRDWPEILAELTNLDPQRETPQLFLRRDVRLPLEVEKQIEDPLAILILFDEARYNLLK GFYTAPDAKLITLASLLLQIVYGNYESKKHKQGFLNEENLKSIVPVTKLKSKAPHWTN RILHEYKNLSTSEGVSKEMHHLQRMFLQNCWEIPTYGAAFFTGQIFTKASPSNHKVIP VYVGVNIKGLHLLNMETKALLISLKYGCFMWQLGDTDTCFQIHSMENKMSFIVHTKQA GLVVKLLMKLNGQLMATERNS" repeat_region 1702..1997 /rpt_family="Alu" BASE COUNT 687 a 382 c 435 g 500 t ORIGIN 1 gtcagagaca gaaaactcac tacatatggg ctatagtgca ctagaaataa agagtaaaat 61 gttagcccta gagaaagcag atacctgtat ttacaaccct ttgggtggat cagatcttca 121 gtatacaaat cgggtagata aagtggtaat aaatccatac tttggtctag gagctccaga 181 ctactcaaaa atccaaatac ctaaacagga aaaatggcag agaagcatga gcagtgtcac 241 agaagacaag gaacgacagt gggtagatga ttttcctctc caccgaagcg cctgtgaagg 301 agattcagaa ttactaagcc gtcttctcag tgaaagattt tcagtcaacc agttagatag 361 tgaccactgg gcacccattc attatgcatg ctggtatgga aaagttgagg ccactcgcat 421 attgttagag aaaggaaagt gcaatccaaa ccttttaaat ggacaactta gttctcctct 481 tcattttgct gctggaggag gacatgctga aatagtacag attctcctaa accacccaga 541 aacggataga catataacag accaacaagg aagatctcca ttaaatattt gtgaagaaaa 601 caaacaaaac aactgggaag aagctgcaaa attgttgaag gaagcaatta acaaaccata 661 tgaaaaagtt cgaatataca gaatggatgg gtcatatcgt tctgttgaat tgaagcatgg 721 aaataatacc acagtgcagc agataatgga aggaatgcgt ctctctcaag aaactcagca 781 atatttcact atatggattt gttcagaaaa cctcagcctt caactcaaac catatcataa 841 acccttgcaa catgttcgtg actggccaga aatacttgct gaattgacta atctggatcc 901 tcaaagggaa acacctcagc tttttctaag aagagatgtg agacttccct tggaagttga 961 aaaacagatt gaagacccac tagctattct tattctcttt gatgaagcca gatataattt 1021 attgaagggc ttttatacag ctcctgatgc taagctgata acattggcaa gtctgctttt 1081 gcaaatagtc tatggaaatt atgagagtaa aaaacacaag caaggtttcc taaatgaaga 1141 aaatctaaaa tccatcgtac ctgttaccaa actgaaaagt aaggcacctc actggacaaa 1201 tcgcatactt catgaataca agaatctcag tacaagtgaa ggtgtcagta aagaaatgca 1261 tcaccttcag cgcatgttct tacagaattg ctgggaaatt cctacttatg gagcagcatt 1321 tttcacagga cagatattta caaaggcaag ccccagcaat cataaagtca tccctgtgta 1381 tgtaggagtg aatataaaag gacttcatct cctcaacatg gaaactaagg ctttactcat 1441 cagtcttaag tatggttgtt ttatgtggca attgggagat actgatactt gttttcagat 1501 ccatagcatg gaaaataaaa tgagctttat agtacataca aaacaggctg gtctcgtggt 1561 aaaactgtta atgaagctaa atggacagtt aatggccact gaaagaaatt catgaaagag 1621 aagtaactgt tactcaagcc accacatttt ggtgatgcag agtttccttt ccgcgaaaga 1681 tttcttaaaa tattactttt gggccctagc atggcgggtc aaccctgtaa ttccagcact 1741 ttgggagggt gggggcaggg cgggatcaac tgaagtcaga gttcaagaca gcctgggcaa 1801 catggtgaaa cctgtctcta caaaaataca aaaattaggt gggtgtggtg gggggcgcct 1861 attcatccta gctactaggg gagggcaagg tgggggagat cgcttaaccc caggaggtgg 1921 gggttgttgt gagccaagat tgcaccacgg cacgctagcc tgggtgacac aggaagactc 1981 catctcaaaa aaaaaaaaaa aaaa // LOCUS HSU90304 1825 bp mRNA PRI 21-MAR-1997 DEFINITION Human iroquois-class homeodomain protein IRX-2a mRNA, complete cds. ACCESSION U90304 NID g1899219 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1825) AUTHORS Lewis,M.T., Strickland,P.A., Ross,S., Snyder,C.J. and Daniel,C.W. TITLE IRX: A new family of human homeobox genes from the breast JOURNAL Unpublished REFERENCE 2 (bases 1 to 1825) AUTHORS Lewis,M.T., Strickland,P.A., Ross,S., Snyder,C.J. and Daniel,C.W. TITLE Direct Submission JOURNAL Submitted (21-FEB-1997) Biology, University of California, Santa Cruz, CA 95064, USA FEATURES Location/Qualifiers source 1..1825 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 376..1629 /codon_start=1 /product="iroquois-class homeodomain protein IRX-2a" /db_xref="PID:g1899220" /translation="MAVETTVHTHLSASPPQGSPYDHTPGMAGSLGYHPYAAPLGSYP YGDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANAR RRLKKENKMTWTPRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAA SGCERLQGPPTPAGKETEGSLSDSDFKEPPSEGRLDALQGPPRTGGPSPAGPAAARLA EDPAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIAT LSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSR PLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPK MLRSQSQLDLCKDSPYELKKGMSDI" BASE COUNT 376 a 616 c 553 g 280 t ORIGIN 1 aagagccctg acttcccttg ttttcccccc ttgcgcccaa cgtgcgtccg ctcccccgcc 61 gagcgcggag tcgcctcagt tgcccaggcc tctatctgca tggagggccg ggccgccgtg 121 accagatctg cgcacggggt acggacgtgc ccgggcagat gggggcctac ggggtgacac 181 cgaggccggg acagcttcag gggccccaga aggacctgac ccagaaattg aggtccccgc 241 tgccttctga ggagggggag gagttgctcc taggtctgaa ccccgccagc cttgccccgt 301 aggaagctgg agtgcgggcc tcgtccaccc acagaccccg gggagcgcag ggaaaagggt 361 gcttcggtcg ttccgatggc agtggagacc acggtccaca ctcacctctc tgcgtctcca 421 ccgcagggct ctccctacga ccacacaccc ggcatggcgg gctccttggg gtaccatcct 481 tacgcggcgc ccctgggatc gtacccttac ggggacccag cgtaccggaa gaacgccaca 541 agggacgcca cggctaccct caaggcctgg ctcaacgagc accgcaagaa cccctacccc 601 accaagggcg agaagatcat gctggccatc atcaccaaga tgaccctcac ccaggtgtcc 661 acctggttcg ccaacgcgcg ccggcgcctc aagaaagaga ataaaatgac gtggacgccg 721 cggaaccgca gcgaggacga ggaagaggag gagaacattg acctggagaa gaacgacgag 781 gacgagcccc agaagcccga ggacaagggc gaccccgagg gccccgaagc aggaggagct 841 gagcagaagg cggcttcggg ctgcgaacgg cttcagggac cacccacccc tgcaggcaag 901 gagacggagg gcagcctcag cgactcggat tttaaggagc cgccctcgga gggccgcctc 961 gacgcgctgc agggcccccc ccgcaccggc gggccctccc cggctgggcc agcggcggcg 1021 cggctggcgg aggacccggc ccctcactac cccgccggag cgccggcgcc cggcccgcat 1081 ccagccgcgg gcgaggtgcc tccgggtccc ggcgggccct cggttatcca ttcgccgcct 1141 ccgccgccgc ctcctgcggt gctcgccaag cccaaactgt ggtctttggc agagatcgcc 1201 acattgtcgg acaaggtcaa ggacgggggc ggcgggaacg agggctctcc atgcccaccg 1261 tgtcccgggc ccatagccgg gcaagcccta ggaggcagcc gggcgtcgcc ggccccggcg 1321 ccgtcacgct cgccctcggc gcagtgtcct tttccaggcg ggacggtgct gtcccggcct 1381 ctctactaca ccgcgccctt ctatcccggc tacacgaact atggctcctt cggacacctt 1441 catggccacc cggggcccgg gccaggcccc acaaccggtc cggggtctca tttcaatgga 1501 ttaaaccaga ccgtgttgaa ccgagcggac gctttggcta aagacccgaa aatgttgcgg 1561 agccagtctc agctagacct gtgcaaagac tctccctatg aattgaagaa aggtatgtcc 1621 gacatttaac gcgggctgcg tcggtcccgg acttttctaa tttattaaaa acatggcctt 1681 ggcagttatt tttccatcac cgagagagag agacagagag agaaaataaa ctacccctcc 1741 tattcagaag tttatagttt atggagatgg atgacataaa aatgtaaaca tctccacaca 1801 cacaaaaaaa tgttttaacc aaccg // LOCUS HSU90313 793 bp mRNA PRI 15-SEP-1997 DEFINITION Human glutathione-S-transferase homolog mRNA, complete cds. ACCESSION U90313 NID g2393721 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 793) AUTHORS Kodym,R. and Story,M.D. TITLE Cloning of the human homolog to a mouse protein, differentially expressed in lymphoma cells with different susceptibility to radiation induced apoptosis JOURNAL Unpublished REFERENCE 2 (bases 1 to 793) AUTHORS Kodym,R. and Story,M.D. TITLE Direct Submission JOURNAL Submitted (20-FEB-1997) Experimental Radiation Oncology, The University of Texas, MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..793 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 10..735 /note="similar to murine glutathione-S-transferase gene, GenBank Accession Number U80819" /codon_start=1 /product="glutathione-S-transferase homolog" /db_xref="PID:g2393722" /translation="MSGESARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKA KGIRHEVININLKNKPEWFFKKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKL LPDDPYEKACQKMILELFSKVPSLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNK KTTFFGGNSISMIDYLIWPWFERLEAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTS EKDWQGFLELYLQNSPEACDYGL" polyA_signal 753..758 BASE COUNT 226 a 171 c 211 g 185 t ORIGIN 1 tgcgccacga tgtccgggga gtcagccagg agcttgggga agggaagcgc gcccccgggg 61 ccggtcccgg agggctcgat ccgcatctac agcatgaggt tctgcccgtt tgctgagagg 121 acgcgtctag tcctgaaggc caagggaatc aggcatgaag tcatcaatat caacctgaaa 181 aataagcctg agtggttctt taagaaaaat ccctttggtc tggtgccagt tctggaaaac 241 agtcagggtc agctgatcta cgagtctgcc atcacctgtg agtacctgga tgaagcatac 301 ccagggaaga agctgttgcc ggatgacccc tatgagaaag cttgccagaa gatgatctta 361 gagttgtttt ctaaggtgcc atccttggta ggaagcttta ttagaagcca aaataaagaa 421 gactatgctg gcctaaaaga agaatttcgt aaagaattta ccaagctaga ggaggttctg 481 actaataaga agacgacctt ctttggtggc aattctatct ctatgattga ttacctcatc 541 tggccctggt ttgaacggct ggaagcaatg aagttaaatg agtgtgtaga ccacactcca 601 aaactgaaac tgtggatggc agccatgaag gaagatccca cagtctcagc cctgcttact 661 agtgagaaag actggcaagg tttcctagag ctctacttac agaacagccc tgaggcctgt 721 gactatgggc tctgaagggg gcaggagtca gcaataaagc tatgtctgat attttccttc 781 agtaaaaaaa aaa // LOCUS HSU90426 1512 bp mRNA PRI 25-MAR-1997 DEFINITION Human nuclear RNA helicase, complete cds. ACCESSION U90426 NID g1905997 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1512) AUTHORS Jelinek,W.R. TITLE Direct Submission JOURNAL Submitted (21-FEB-1997) Biochemistry, NYU Medical Center, 550 First Avenue, New York City, NY 10016, USA FEATURES Location/Qualifiers source 1..1512 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 96..1379 /note="DEAD-box family member; contains DECD-box; similar to rat liver nuclear protein p47 (PIR Accession Number A42881) and D. melanogaster DEAD-box RNA helicase WM6 (PIR Accession Number S51601)" /codon_start=1 /product="nuclear RNA helicase" /db_xref="PID:g1905998" /translation="MAEQDVENDLLDYDEEEEPQAPQESTPAPPKKDIKGSYVSIHSS GFRDFLLKPELLRAIVDCGLEHPSEVQHECIPQAILGMDVLCQAKSGMGKTAVFVLAT LQQIEPVNGQVTVLVMCHTRELAFQISKEYERFSKYMPSVKVSVFFGGLSIKKDEEVM KKNCPHVVVGTPGRILALVRNRSFSLKNVKHFVLDECDKMLEQLDMRRDVQEIFRLTP HEKQCMMFSATLSKDIRPVCRKFMQDPMEVFVDDETKLTLHGLQQYYVKLKDSEKNRK LFDLLDVLEFNQVIIFVKSVQRCMALAQLLVEQNFPAIAIHRGMAQEERLSRYQQFKD FQRRILVATNLFGRGMDIERVNIVFNYDMPEDSDTYLHRVARGGRFGTKGLAITFVSD ENDAKILNHVQDRCEVNVAELPEEIDISTYIEQSR" BASE COUNT 357 a 415 c 443 g 297 t ORIGIN 1 cggaagcgca gcaactcgtg tctgagcgcc cggcggaaaa ccgaagttgg aagtgtctct 61 tagcagcgcg cggagaagaa cggggagcca gcatcatggc agaacaggat gtggaaaacg 121 atcttttgga ttacgatgag gaggaagagc cccaggctcc tcaagagagc acaccagctc 181 cccctaagaa agacatcaag ggatcctacg tttccatcca cagctctggc ttccgggact 241 ttctgctgaa gccggagctc ctgcgggcca tcgtggactg tggcttggag catccttctg 301 aggtccagca tgagtgcatt ccccaggcca tcctgggcat ggacgtcctg tgccaggcca 361 agtccgggat gggcaagaca gcggtcttcg tgctggccac cctacagcag attgagcctg 421 tcaacggaca ggtgacggtc ctggtcatgt gccacacgag ggagctggcc ttccagatca 481 gcaaggaata tgagcgcttt tccaagtaca tgcccagcgt caaggtgtct gtgttcttcg 541 gtggtctctc catcaagaag gatgaagaag tgatgaagaa gaactgtccc catgtcgtgg 601 tggggacccc gggccgcatc ctggcgctcg tgcggaatag gagcttcagc ctaaagaatg 661 tgaagcactt tgtgctggac gagtgtgaca agatgctgga gcagctggac atgcggcggg 721 atgtgcagga gatcttccgc ctgacaccac acgagaagca gtgcatgatg ttcagcgcca 781 ccctgagcaa ggacatccgc cctgtgtgca ggaagttcat gcaggatccc atggaggtgt 841 ttgtggacga cgagaccaag ctcacgctgc acgggctgca gcagtactac gtcaaactca 901 aagacagtga gaagaaccgc aagctctttg atctcttgga tgtgctggag tttaaccagg 961 tgataatctt cgtcaagtca gtgcagcgct gcatggccct ggcccagctc ctcgtggagc 1021 agaacttccc ggccatcgcc atccaccggg gcatggccca ggaggagcgc ctgtcacgct 1081 atcagcagtt caaggatttc cagcggcgga tcctggtggc caccaatctg tttggccggg 1141 ggatggacat cgagcgagtc aacatcgtct ttaactacga catgcctgag gactcggaca 1201 cctacctgca ccgggtggcc cgggggggtc gctttggcac caaaggccta gccatcactt 1261 ttgtgtctga cgagaatgat gccaaaatcc tcaatcacgt ccaggaccgg tgtgaagtta 1321 atgtggcaga acttccagag gaaatcgaca tctccacata catcgagcag agccggtaac 1381 caccacgtgc cagagccgcc cacccggagc cgcccgcatg cagcttcacc tcccctttcc 1441 aggcgccact gttgagaagc tagagattgt atgagaataa agtgttatta tgaaatgaag 1501 aagcctcacc ca // LOCUS HSU90441 2194 bp mRNA PRI 27-SEP-1997 DEFINITION Human prolyl 4-hydroxylase alpha (II) subunit mRNA, complete cds. ACCESSION U90441 NID g2439984 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2194) AUTHORS Annunen,P., Helaakoski,T., Myllyharju,J., Veijola,J., Pihlajaniemi,T. and Kivirikko,K.I. TITLE Cloning of the human prolyl 4-hydroxylase alpha subunit isoform alpha(II) and characterization of the type II enzyme tetramer. The alpha(I) and alpha(II) subunits do not form a mixed alpha(I)alpha(II)beta2 tetramer JOURNAL J. Biol. Chem. 272 (28), 17342-17348 (1997) MEDLINE 97362215 REFERENCE 2 (bases 1 to 2194) AUTHORS Annunen,P.P., Helaakoski,T., Myllyharju,J., Veijola,J., Pihlajaniemi,T. and Kivirikko,K.I. TITLE Direct Submission JOURNAL Submitted (24-FEB-1997) Department of Medical Biochemistry, University of Oulu, Kajaanintie 52 A, Oulu, FIN-90220, Finland FEATURES Location/Qualifiers source 1..2194 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 188..1795 /codon_start=1 /product="prolyl 4-hydroxylase alpha (II) subunit" /db_xref="PID:g2439985" /translation="MKLWVSALLMAWFGVLSCVQAEFFTSIGHMTDLIYAEKELVQSL KEYILVEEAKLSKIKSWANKMEALTSKSAADAEGYLAHPVNAYKLVKRLNTDWPALED LVLQDSAAGFIANLSVQRQFFPTDEDEIGAAKALMRLQDTYRLDPGTISRGELPGTKY QAMLSVDDCFGMGRSAYNEGDYYHTVLWMEQVLKQLDAGEEATTTKSQVLDYLSYAVF QLGDLHRALELTRRLLSLDPSHERAGGNLRYFEQLLEEEREKTLTNQTEAELATPEGI YERPVDYLPERDVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWD SPHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDD PVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNR VATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLV GCKWVSNKWFHERGQEFLRPCGSTEVD" BASE COUNT 534 a 530 c 610 g 520 t ORIGIN 1 ggggaaggaa cactgtaggg gatagctgtc cacggacgct gtctacaaga ccctggagtg 61 agataacgtg cctggtactg tgccctgcat gtgtaagatg cccagttgac cttcgcagca 121 ggagcctgga tcaggcactt cctgcctcag gtattgctgg acagcccaga cacttccctc 181 tgtgaccatg aaactctggg tgtctgcatt gctgatggcc tggtttggtg tcctgagctg 241 tgtgcaggcc gaattcttca cctctattgg gcacatgact gacctgattt atgcagagaa 301 agagctggtg cagtctctga aagagtacat ccttgtggag gaagccaagc tttccaagat 361 taagagctgg gccaacaaaa tggaagcctt gactagcaag tcagctgctg atgctgaggg 421 ctacctggct caccctgtga atgcctacaa actggtgaag cggctaaaca cagactggcc 481 tgcgctggag gaccttgtcc tgcaggactc agctgcaggt tttatcgcca acctctctgt 541 gcagcggcag ttcttcccca ctgatgagga cgagatagga gctgccaaag ccctgatgag 601 acttcaggac acatacaggc tggacccagg cacaatttcc agaggggaac ttccaggaac 661 caagtaccag gcaatgctga gtgtggatga ctgctttggg atgggccgct cggcctacaa 721 tgaaggggac tattatcata cggtgttgtg gatggagcag gtgctaaagc agcttgatgc 781 cggggaggag gccaccacaa ccaagtcaca ggtgctggac tacctcagct atgctgtctt 841 ccagttgggt gatctgcacc gtgccctgga gctcacccgc cgcctgctct cccttgaccc 901 aagccacgaa cgagctggag ggaatctgcg gtactttgag cagttattgg aggaagagag 961 agaaaaaacg ttaacaaatc agacagaagc tgagctagca accccagaag gcatctatga 1021 gaggcctgtg gactacctgc ctgagaggga tgtttacgag agcctctgtc gtggggaggg 1081 tgtcaaactg acaccccgta gacagaagag gcttttctgt aggtaccacc atggcaacag 1141 ggccccacag ctgctcattg cccccttcaa agaggaggac gagtgggaca gcccgcacat 1201 cgtcaggtac tacgatgtca tgtctgatga ggaaatcgag aggatcaagg agatcgcaaa 1261 acctaaactt gcacgagcca ccgttcgtga tcccaagaca ggagtcctca ctgtcgccag 1321 ctaccgggtt tccaaaagct cctggctaga ggaagatgat gaccctgttg tggcccgagt 1381 aaatcgtcgg atgcagcata tcacagggtt aacagtaaag actgcagaat tgttacaggt 1441 tgcaaattat ggagtgggag gacagtatga accgcacttc gacttctcta ggaatgatga 1501 gcgagatact ttcaagcatt tagggacggg gaatcgtgtg gctactttct taaactacat 1561 gagtgatgta gaagctggtg gtgccaccgt cttccctgat ctgggggctg caatttggcc 1621 taagaagggt acagctgtgt tctggtacaa cctcttgcgg agcggggaag gtgactaccg 1681 aacaagacat gctgcctgcc ctgtgcttgt gggctgcaag tgggtctcca ataagtggtt 1741 ccatgaacga ggacaggagt tcttgagacc ttgtggatca acagaagttg actgacatcc 1801 ttttctgtcc ttccccttcc tggtccttca gcccatgtca acgtgacaga cacctttgta 1861 tgttccttgt atgttcctat caggctgatt tttggagaaa tgaatgtttg tctggagcag 1921 agggagacca tactagggcg actcctgtgt gactgaagtc ccagcccttc cattcagcct 1981 gtgccatccc tggccccaag gctaggatca aagtggctgc agcagagtta gctgtctagc 2041 gcctagcaag gtgcctttgt acctcaggtg ttttaggtgt gagatgtttc agtgaaccaa 2101 agttctgata ccttgtttac atgtttgttt ttatggcatt tctatctatt gtggctttac 2161 caaaaaataa aatgtcccta ccagaagcct taaa // LOCUS HSU90544 2281 bp mRNA PRI 02-MAY-1997 DEFINITION Human sodium phosphate transporter (NPT3) mRNA, complete cds. ACCESSION U90544 NID g2062689 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2281) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE A 1.1 megabase transcript map of the human hereditary hemochromatosis locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 2281) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Sequencing, Mercator Genetics, 4040 Campbell Avenue, Menlo Park, CA 94025, USA FEATURES Location/Qualifiers source 1..2281 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" gene 419..1729 /gene="NPT3" CDS 419..1729 /gene="NPT3" /codon_start=1 /product="sodium phosphate transporter" /db_xref="PID:g2062690" /translation="MDGKPATRKGPDFCSLRYGLALIMHFSNFTMITQRVSLSIAIIA MVNTTQQQGLSNASTEGPVADAFNNSSISIKEFDTKASVYQWSPETQGIIFSSINYGI ILTLIPSGYLAGIFGAKKMLGAGLLISSLLTLFTPLAADFGVILVIMVRTVQGMAQGM AWTGQFTIWAKWAPPLERSKLTTIAGSGSAFGSFIILCVGGLISQALSWPFIFYIFGS TGCVCCLLWFTVIYDDPMHHPCISVREKEHILSSLAQQPSSPGRAVPIKAMVTCLPLW AIFLGFFSHFWLCTIILTYLPTYISTLLHVNIRDSGVLSSLPFIAAASCTILGGQLAD FLLSRNLLRLITVRKLFSSLDMQVSSWESQGDLGSSQESSLPLPLDSSSVRILSLVGG MSFSCLLQSTCLAWSFTSRLDKQNFKTGPKRGPLPASEDIKLQT" BASE COUNT 624 a 506 c 490 g 661 t ORIGIN 1 ggacagaaaa ctccctcctt ttccaagtta gccttatagt ctagggctta aaatactggt 61 ttaatggtga aggtaagtgc ttttcttctt tttgggtaga aggattatta ctaacttacc 121 aaaggtccat taaggggagg gaacagtttt aggagaagtc agagaaaaga cattaacagc 181 aacataagga tctccatctg gtaatattgc ctaattccaa aatgaagaga ctctctgaaa 241 aagataactg attcaatgaa gaccctaggg caaggcttga gaagccactg gtaccaatgg 301 acactgtgga caatggtcat ttctccaagg acgctataaa agactgtcgt agtaaaagag 361 attcagggca cagggaaact ccaccacaaa gcgtggtacc atttcccaca gaagctaaat 421 ggacgggaag cctgccacca ggaaaggtcc agatttctgt tcattacgct atgggctggc 481 tcttatcatg cacttctcaa acttcaccat gataacgcag cgtgtgagtc tgagcattgc 541 gatcatcgcc atggtgaaca ccactcagca gcaaggtcta tctaatgcct ccactgaggg 601 gcctgttgca gatgccttca ataactccag catatccatc aaggaatttg atacaaaggc 661 ctctgtgtat caatggagcc cagaaactca gggtatcatc tttagctcca tcaactatgg 721 gataatactg actctgatcc caagtggata tttagcaggg atatttggag caaaaaaaat 781 gcttggtgct ggtttgctga tctcttccct tctcaccctc tttacaccac tggctgctga 841 cttcggagtg attttggtca tcatggttcg gacagtccag ggcatggccc agggaatggc 901 atggacaggt cagtttacta tttgggcaaa gtgggctcct ccacttgaac gaagcaagct 961 caccaccatt gcaggatcag ggtcagcatt tggatccttc atcatcctct gtgtgggggg 1021 actaatctca caggccttga gctggccttt tatcttctac atctttggta gcactggctg 1081 tgtctgctgt ctcctatggt tcacagtgat ttatgatgac cccatgcatc acccgtgcat 1141 aagtgttagg gaaaaggagc acatcctgtc ctcactggct caacagccca gttctcctgg 1201 acgagctgtc cccataaagg cgatggtcac atgcctacca ctttgggcca ttttcctggg 1261 ttttttcagc catttctggt tatgcaccat catcctaaca tacctaccaa cgtatatcag 1321 tactctgctc catgttaaca tcagagatag tggagttctg tcctccctgc cttttattgc 1381 tgctgcaagc tgtacaattt taggaggtca gctggcagat ttccttttgt ccaggaatct 1441 tctcagattg atcactgtgc gaaagctctt ttcatctctt gatatgcaag tttcctcatg 1501 ggaatctcaa ggggatttgg gctcatcgca ggaatcatct cttccactgc cactggattc 1561 ctcatcagtc aggattttga gtctggttgg aggaatgtct ttttcctgtc tgctgcagtc 1621 aacatgtttg gcctggtctt ttacctcacg tttggacaag cagaacttca agactgggcc 1681 aaagagagga cccttacccg cctctgagga cataaagtta caaacttaaa tgtggtactg 1741 agcatgaact ttttaaacat tttttacttc tctccatatt cctgaccata gactcagcag 1801 ttcttaactc tggctgtgtg ttagtcttcc ctggggagcc tttataagac actgatactt 1861 gggacccact ccagagattc tgaatgaatt ggtctggggt ggaacccaga tactactaat 1921 ttttagatac tccttagagg tttctagcat gcgcccgggg ttgacaacag ctggacaaac 1981 ttgaaaagtc aattcatgtg gcctttgaat tttcctcatt ggaaagtact aaataaataa 2041 aaattcatgt gaaaatgatc actgataaat atcttcatgg tggggcaggt tattggatgc 2101 agagaagatc tgctcggaat tgtagccata tgttacagat ctcagcaccg atcagaactg 2161 taaagctata atccccagaa ttaaagtttt tattattttt tatacattgt aaaacataga 2221 cgtttattta tgtgattaaa ttctattaaa atttacatgc taaaataaaa aaaaaaaaaa 2281 a // LOCUS HSU90545 1795 bp mRNA PRI 02-MAY-1997 DEFINITION Human sodium phosphate transporter (NPT4) mRNA, complete cds. ACCESSION U90545 NID g2062691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1795) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE A 1.1 megabase transcript map of the human hereditary hemochromatosis locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 1795) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Sequencing, Mercator Genetics, 4040 Campbell Avenue, Menlo Park, CA 94025, USA FEATURES Location/Qualifiers source 1..1795 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" gene 377..1582 /gene="NPT4" CDS 377..1582 /gene="NPT4" /codon_start=1 /product="sodium phosphate transporter" /db_xref="PID:g2062692" /translation="MQVDETLIPRKGPSLCSARYGIALVLHFCNFTTIAQNVIMNITM VAMVNSTSPQSQLNDSSEVLPVDSFGGLSKAPKSLPAKSSILGGQFAIWEKWGPPQER SRLCSIALSGMLLGCFTAILIGGFISETLGWPFVFYIFGGVGCVCCLLWFVVIYDDPF SYPWISTSEKEYIISSLKQQVGSSKQPLPIKAMLRSLPIWSICLGCFSHQWLVSTMVV YIPTYISSVYHVNIRDNGLLSALPFIVAWVIGMVGGYLADFLLTKKFRLITVRKIATI LGSLPSSALIVSLPYLNSGYITATALLTLSCGLSTLCQSGIYINVLDIAPRYSSFLMG ASRGFSSIAPVIVPTVSGFLLSQDPEFGWRNVFFLLFAVNLLGLLFYLIFGEADVQEW AKERKLTRL" BASE COUNT 473 a 436 c 402 g 484 t ORIGIN 1 acgcgtccgc ccacgcgtcc gcccacgcgt ccggtcgggg ccagagcgca ggtgtacctg 61 gcggccgtgc tggagcacct gaccgccgag atcctggagc tggctggcaa cccggcccgc 121 gacaagaaga cccgcatcat cctgcgccac ctgtagctgg ccattcgcaa cggcgaggag 181 cttaacaagc tgctgggcga agtcaccatc gcgcagggcg gtgtcctgcc caacattcag 241 ggcgtgcttc tgccccagaa gaccaagagc caccacaagg ccaagggtga aaaccattca 301 ctaggagagg agaaacacaa tggccaccaa gacagagttg agtcccacag caagggagag 361 caagaacgca caagatatgc aagtggatga gacactgatc cccaggaaag gtccaagttt 421 atgttctgct cgctatggaa tagccctcgt cttacatttc tgcaatttca caacgatagc 481 acaaaatgtc atcatgaaca tcaccatggt agccatggtc aacagcacaa gccctcaatc 541 ccagctcaat gattcctctg aggtgctgcc tgttgactca tttggtggcc taagtaaagc 601 cccaaagagt cttcctgcaa agtcctcaat acttgggggt cagtttgcaa tttgggaaaa 661 gtggggccct ccacaagaac gaagcagact ctgcagcatt gctttatcag gaatgttact 721 gggatgcttt actgccatcc tcataggtgg cttcattagt gaaacccttg ggtggccctt 781 tgtcttctat atctttggag gtgttggctg tgtctgctgc cttctctggt ttgttgtgat 841 ttatgatgac cccttttcct atccatggat aagcacctca gaaaaagaat acatcatatc 901 ctccttgaaa caacaggtcg ggtcttctaa gcagcctctt cccatcaaag ctatgctcag 961 atctctaccc atttggtcca tatgtttagg ctgtttcagc catcaatggt tagttagcac 1021 aatggttgta tacataccaa cttacatcag ctctgtgtac catgttaaca tcagagacaa 1081 tggacttcta tctgcccttc cttttattgt tgcctgggtc ataggcatgg tgggaggcta 1141 tctggcagat ttccttctaa ccaaaaagtt tagactcatc actgtgagga aaattgccac 1201 aattttagga agtctcccct cttcagcact cattgtgtct ctgccttacc tcaattccgg 1261 ctatatcaca gcaactgcct tgctgacgct ctcttgcgga ttaagcacat tgtgtcagtc 1321 agggatttat atcaatgtct tagatattgc tccaaggtat tccagttttc tcatgggagc 1381 atcaagagga ttttcgagca tagcacctgt cattgtaccc actgtcagcg gatttcttct 1441 tagtcaggac cctgagtttg ggtggaggaa tgtcttcttc ttgctgtttg ccgttaacct 1501 gttaggacta ctcttctacc tcatatttgg agaagcagat gtccaagaat gggctaaaga 1561 gagaaaactc actcgtttat gaagttatcc caccttggat ggaaaagtca ttaggcaccg 1621 tattgcataa aatagaaggc ttccgtgatg aaaataccag tgaaaagatt tttttttcct 1681 gtggctcttt tcaattatga gatcagttca ttattttatt cagacttttt tttgagagaa 1741 atgtaagatg aataaaaatt caaataaaat gataactaag aaaaaaaaaa aaaaa // LOCUS HSU90547 2872 bp mRNA PRI 02-MAY-1997 DEFINITION Human Ro/SSA ribonucleoprotein homolog (RoRet) mRNA, complete cds. ACCESSION U90547 NID g2062695 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2872) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE A 1.1 megabase transcript map of the human hereditary hemochromatosis locus JOURNAL Unpublished REFERENCE 2 (bases 1 to 2872) AUTHORS Ruddy,D.A., Kronmal,G.S., Lee,V.K., Mintier,G.A., Quintana,L., Domingo,R. Jr., Meyer,N.C., Basava,A., McClelland,E., Fullan,A., Mapa,F.A., Moore,T., Thomas,W., Loeb,D.B., Harmon,C., Tsuchihashi,Z., Wolff,R.K., Schatzman,R.C. and Feder,J.N. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Sequencing, Mercator Genetics, 4040 Campbell Avenue, Menlo Park, CA 94025, USA FEATURES Location/Qualifiers source 1..2872 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" gene 22..1419 /gene="RoRet" CDS 22..1419 /gene="RoRet" /codon_start=1 /product="Ro/SSA ribonucleoprotein homolog" /db_xref="PID:g2062696" /translation="MASTTSTKKMMEEATCSICLSLMTNPVSINCGHSYCHLCITDFF KNPSQKQLRQETFCCPQCRAPFHMDSLRPNKQLGSLIEALKETDQEMSCEEHGEQFHL FCEDEGQLICWRCERAPQHKGHTTALVEDVCQGYKEKLQKAVTKLKQLEDRCTEQKLS TAMRITKWKEKVQIQRQKIRSDFKNLQCFLHEEEKSYLWRLEKEEQQTLSRLRDYEAG LGLKSNELKSHILELEEKCQGSAQKLLQNVNDTLSRSWAVKLETSEAVSLELHTMCNV SKLYFDVKKMLRSHQVSVTLDPDTAHHELILSEDRRQVTRGYTQENQDTSSRRFTAFP CVLGCEGFTSGRRYFEVDVGEGTGWDLGVCMENVQRGTGMKQEPQSGFWTLRLCKKKG YVALTSPPTSLHLHEQPLLVGIFLDYEAGVVSFYNGNTGCHIFTFPKASFSDTLRPYF QVYQYSPLFLPPPGD" BASE COUNT 892 a 584 c 688 g 708 t ORIGIN 1 gacccacgcg tccgaaaagc tatggcctca accaccagca ccaagaagat gatggaggaa 61 gccacctgct ccatctgcct gagcctgatg acgaacccag taagcatcaa ctgtggacac 121 agctactgcc acttgtgtat aacagacttc tttaaaaacc caagccaaaa gcaactgagg 181 caggagacat tctgctgtcc ccagtgtcgg gctccatttc atatggatag cctccgaccc 241 aacaagcagc tgggaagcct cattgaagcc ctcaaagaga cggatcaaga aatgtcatgt 301 gaggaacacg gagagcagtt ccacctgttc tgcgaagacg aggggcagct catctgctgg 361 cgctgtgagc gggcaccaca gcacaaaggg cacaccacag ctcttgttga agacgtatgc 421 cagggctaca aggaaaagct ccagaaagct gtgacaaaac tgaagcaact tgaagacaga 481 tgtacggagc agaagctgtc cacagcaatg cgaataacta aatggaaaga gaaggtacag 541 attcagagac aaaaaatccg gtctgacttt aagaatctcc agtgtttcct acatgaggaa 601 gagaagtctt atctctggag gctggagaaa gaagaacaac agactctgag tagactgagg 661 gactatgagg ctggtctggg gctgaagagc aatgaactca agagccacat cctggaactg 721 gaggaaaaat gtcagggctc agcccagaaa ttgctgcaga atgtgaatga cactttgagc 781 aggagttggg ctgtgaagct ggaaacatca gaggctgtct ccttggaact tcatactatg 841 tgcaatgttt ccaagcttta cttcgatgtg aagaaaatgt taaggagtca tcaagttagt 901 gtgactctgg atccagatac agctcatcac gaactaattc tctctgagga tcggagacaa 961 gtgactcgtg gatacaccca ggagaatcag gacacatctt ccaggagatt tactgccttc 1021 ccctgtgtct tgggttgtga aggcttcacc tcaggaagac gttactttga agtggatgtt 1081 ggcgaaggaa ccggatggga tttaggagtt tgtatggaaa atgtgcagag gggcactggc 1141 atgaagcaag agcctcagtc tggattctgg accctcaggc tgtgcaaaaa gaaaggctat 1201 gtagcactta cttctccccc aacttccctt catctgcatg agcagcccct gcttgtggga 1261 atttttctgg actatgaggc cggagttgta tccttttata acgggaatac tggctgccac 1321 atctttactt tcccgaaggc ttccttctct gatactctcc ggccctattt ccaggtttat 1381 caatattctc ctttgtttct gcctccccca ggtgactaag gaaaagagca gaagctcctt 1441 ggtttaacca gcacagagaa aataatataa atcccataag ggcagacgtt tggtctgttt 1501 tcttcgctgt catttcctta gtagttagac tagtgctgag attttagtgg atatataatt 1561 gatttatgtt gaatatatgg acttagcaac taaaaatacc acagatggtt aacctggact 1621 ggggcaaagc aagataatag tgatgatcgt atgttgctgt ctccatccgt ctttaatggg 1681 tcagggcttt gatttccaag ggtcttcagg tgatgagtag gggtacccac aagtcagaag 1741 gtctgcgttc tcctagtttg tttgctgcca tttgaactca tgtagggaat gaaagaaagc 1801 tgcaattatc cgccaactgc atttaaaaca aaacaaaaca gaaaaatcaa aataacattg 1861 actcttccaa ccactgacat gttgtttaat aatctaagcg gcagtcctgg aggctaccag 1921 acttactgag ttctacctga gaaacagcca agcaaagtgt gagagaaggg ttaagactgg 1981 cttacaatga gatgcttcaa atgaaaaggg aattatgagt aaaattgaac tttgatgggg 2041 gattcagttc tggaaaagaa tttggtattt tccagtctgc taggaccaat taccttgaaa 2101 tattttaaaa tctcagtaaa tagttattgc tgaaatggct gttggcagtt cttattatga 2161 ttcagagaag agcaaataga ccttaacttc attttgaaaa agaccaaatt accatacccg 2221 agtgagtaat gacaggacta caactaaaac ataaacaaca ttaatgatga ccataaaaag 2281 tcacaaaatt gctaaatgtt ataatttaga gttgacataa aaattgatgg ccaggcatgg 2341 tggctcacgc ctgtaatccc agaactatgt gaggctgagg caggtggatc acttgaggtc 2401 aggagttcaa caccagcctg gccaacatgg tgaaaccctg tctctactaa aaatacaaaa 2461 attagccggg catggtggta ggggcctgta acccagctac tcgtgaggcc aaggcaggag 2521 aattgcttga gcctgcagca gctgcagtaa gccaagatca tgctgtgcct caaggaaaaa 2581 aaaaattaat gtttactgat atttgttgaa gtcctacaac atcacctctg agaataggag 2641 aaatgaagca acagttgtgt ctagatgtca gaggcatggc tgggcctcca tctctgccta 2701 agggagatat aaaagagttc aaactattgc ccatgttccc cagggtcaga agttctaatt 2761 atgatgatag aggctgggtt gtaagtagta agtgaagggt agcagaatat gccatctttg 2821 gcataagaag tattttgagt tgaagacaat tgagaaaaaa aaaaaaaaaa aa // LOCUS HSU90653 1198 bp mRNA PRI 20-NOV-1997 DEFINITION Human DHHC-domain-containing cysteine-rich protein mRNA, complete cds. ACCESSION U90653 NID g2342646 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1198) AUTHORS Putilina,T., Wong,P. and Gentleman,S. TITLE Evidence for a human gene with a new Cys-rich domain JOURNAL Unpublished REFERENCE 2 (bases 1 to 1198) AUTHORS Gentleman,S. TITLE Direct Submission JOURNAL Submitted (24-FEB-1997) National Eye Institute, NIH, Lab. Retinal Cell Molec Biol, Bldg. 6 Room 305, Bethesda, MD 20892-2740, USA FEATURES Location/Qualifiers source 1..1198 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /dev_stage="adult" CDS 306..959 /codon_start=1 /product="DHHC-domain-containing cysteine-rich protein" /db_xref="PID:g2342647" /translation="MYKMNICNKPSNKTAPEKSVWTAPAQPSGPSPELQGQRSRRNGW SWPPHPLQIVAWLLYLFFAVIGFGILVPLLPHHWVPAGYACMGAIFAGHLVVHLTAVS IDPADDNVRDKSYAGPLPIFNRSQHAHVIEDLHCNLCNVDVSARSKHCSACNKCVCGF DHHCKWLNNCVGERNYRLFLHSVASALLGVLLLVLGGHICLRGVLCQPHASAHQPTL" BASE COUNT 208 a 407 c 347 g 236 t ORIGIN 1 agcccctcca gcctgctgga gccggagccg gagccggagc cggagccgga gccggagcca 61 gagccagagc tcgaggactc accggcccag tctccgtccg ggatggggcc ccgctcccgg 121 gcgcgttgcc gcccagtccc ggggaccgtc cctaccgcga gggtctgagg cgcggctgcc 181 ccggggaggg tggaaggcca ggcgtagagc ccgaacctct ggctgacttt ggaagggacc 241 atctggcacg gtctccgcgg cgcgcagctg ttttcaagtc agcaaacatt tactgaggat 301 ctactatgta caagatgaac atctgcaaca agccctccaa caagacggcc cctgagaaga 361 gtgtgtggac ggcaccggca cagcccagcg gaccctcccc tgagctgcag ggccagcgat 421 cccgccggaa tgggtggagc tggccccctc acccgctcca gattgtggcc tggctgctgt 481 acctcttctt tgctgtgatc ggctttggga tccttgttcc cctcctgcct caccactggg 541 tgcccgctgg ctacgcttgc atgggcgcca tctttgctgg ccaccttgtg gtgcacctga 601 ccgccgtctc catcgatcca gcagatgaca acgtgcggga caagagctat gcggggcccc 661 tgcccatctt caaccgaagc cagcacgcac atgtcattga agacctgcac tgcaacttgt 721 gcaacgtgga tgtgagcgct cgctccaagc actgcagcgc ctgcaacaag tgcgtgtgcg 781 gtttcgacca ccactgcaag tggctcaaca actgtgtggg cgagcggaac taccggctct 841 ttctacacag tgttgcatcc gctttactgg gcgtcctgct cctggtgctg ggtggccaca 901 tatgtcttcg tggagttctt tgtcaacccc atgcgtctgc gcaccaaccg acactttgaa 961 gtcctgaaga atcacacgga tgtgtggttc gtgttcctgc ctgccgcccc cgtggagacc 1021 caggcccctg ccatcctggc cctggccgcc ctgctcatcc ttctgggcct cctgtccaca 1081 gccgtcctgg ggcacctgct ctgcttccac atttatctca tgtggcacaa gctcaccacc 1141 tatgagtaca tcgtgcagca ccgcccacca caggaggcca agggggcccg gaatcttt // LOCUS HSU90724 3428 bp DNA PRI 16-JAN-1998 DEFINITION Human aminopeptidase P gene, complete cds. ACCESSION U90724 NID g2772609 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3428) AUTHORS Venema,R.C., Ju,H., Zou,R., Venema,V.J. and Ryan,J.W. TITLE Cloning and tissue distribution of human membrane-bound aminopeptidase P JOURNAL Biochim. Biophys. Acta 1354 (1), 45-48 (1997) MEDLINE 98041638 REFERENCE 2 (bases 1 to 3428) AUTHORS Venema,R.C., Ju,H., Zou,R., Venema,V.J. and Ryan,J.W. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Vascular Biology Center, Medical College of Georgia, 1120 15th Street, Augusta, GA 30912, USA FEATURES Location/Qualifiers source 1..3428 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 265..2286 /EC_number="3.4.11.9" /codon_start=1 /product="aminopeptidase P" /db_xref="PID:g2772610" /translation="MARAHWGCCPWLVLLCACAWGHTKPLDLGGQDVRNCSTNPPYLP VTVVNTTMSLTALRQQMQTQNLSAYIIPGTDAHMNEYIGQHDERRAWITGFTGSAGTA VVTMKKAAVWTDSRYWTQAERQMDCNWELHKEVGTTPIVTWLLTEIPAGGRVGFDPFL LSIDTWESYDLALQGSNRQLVSITTNLVDLVWGSERPPVPNQPIYALQEAFTGSTWQE KVSGVRSQMQKHQKVPTAVLLSALEETAWLFNLRASDIPYNPFFYSYTLLTDSSIRLF ANKSRFSSETLSYLNSSCTGPMCVQIEDYSQVRDSIQAYSLGDVRIWIGTSYTMYGIY EMIPREKLVTDTYSPVMMTKAVKNSKEQALLKASHVRDAVAVIRYLVWLEKNVPKGTV DEFSGAEIVDKFRGEEQFSSGPSFETISASGLNAALAHYSPTKELNRKLSSDEMYLLD SGGQYWDGTTDITRTVHWGTPSAFQKEAYTRVLIGNIDLSRLIFPAATSGRMVEAFAR RALWDAGLNYGHGTGHGIGNFLCVHEWPVGFQSNNIAMAKGMFTSIEPGYYKDGEFGI RLEDVALVVEAKTKYPGELPDLVVSFVPYDRNLIDVSLLSPEHLQYLNRYYQTIREKV GPELQRRQLLEEFEWLQQHTEPLAARAPDTASWASVLVVSTLAILGWSV" BASE COUNT 808 a 1032 c 859 g 729 t ORIGIN 1 caccctatcc tacactacta ggaacttgca cagtccgcct cgggcagccc aaagctcctc 61 tgcccaccct ggctcccaaa accctccaaa acaaaagacc agaaaagcac tctccaccca 121 gcagccaaac gcctccttct tgacgccagc ccccaccctc tgtctgctcg agcccaggaa 181 aggcctgaag gaacaggccg gggaaggagc cctccctctc tcccttgtcc ctccatccac 241 ccagcgccgg catctggaga ccctatggcc cgggctcact ggggctgctg cccctggctg 301 gtcctcctct gtgcttgtgc ctggggccac acaaagccac tggaccttgg agggcaggat 361 gtgagaaatt gttccaccaa ccccccttac cttccagtta ctgtggtcaa taccacaatg 421 tcactcacag ccctccgcca gcagatgcag acccagaatc tctcagccta catcatccca 481 ggcacagatg ctcacatgaa cgagtacatc ggccaacatg acgagaggcg tgcgtggatt 541 acaggcttta cagggtctgc aggaactgca gtggtgacta tgaagaaagc agctgtctgg 601 accgacagtc gctactggac tcaggctgag cggcaaatgg actgtaattg ggagctccat 661 aaggaagttg gcaccactcc tattgtcacc tggctcctca ccgagattcc cgctggaggg 721 cgtgtgggtt ttgacccctt cctcttgtcc attgacacct gggagagtta tgatctggcc 781 ctccaaggct ctaacagaca gctggtgtcc atcacaacca atcttgtgga cctggtatgg 841 ggatcagaga ggccaccggt tccaaatcaa cccatttatg ccctgcagga ggcattcaca 901 gggagcactt ggcaggagaa agtatctggc gtccgaagcc agatgcagaa gcatcaaaag 961 gtcccgactg ccgtccttct gtcggcgctt gaggagacgg cctggctctt caaccttcga 1021 gccagtgaca tcccctataa ccccttcttc tattcctaca cgctgctcac agactcttct 1081 attaggttgt ttgcaaacaa gagtcgcttt agctccgaaa ccttgagcta tctgaactcc 1141 agttgcacag gccccatgtg tgtgcaaatc gaggattaca gccaagttcg tgacagcatc 1201 caggcctact cattgggaga tgtgaggatc tggattggga ccagctatac catgtatggg 1261 atctatgaaa tgataccaag ggagaaactc gtgacagaca cctactcccc agtgatgatg 1321 accaaggcag tgaagaacag caaggagcag gccctcctca aggccagcca cgtgcgggac 1381 gctgtggctg tgatccggta cttggtctgg ctggagaaga acgtgcccaa aggcacagtg 1441 gatgagtttt cgggggcaga gatcgtggac aagttccgag gagaagaaca gttctcctcc 1501 ggacccagtt ttgaaaccat ctctgctagt ggtttgaatg ctgccctggc ccactacagc 1561 ccgaccaagg agctgaaccg caagctgtcc tcagatgaga tgtacctgct ggactctggg 1621 gggcagtact gggacgggac cacagacatc accagaacag tccactgggg caccccctct 1681 gcctttcaga aggaggcata tacccgtgtg ctgataggaa atattgacct gtccaggctc 1741 atctttcccg ctgctacatc agggcgaatg gtggaggcct ttgcccgcag agccttgtgg 1801 gatgctggtc tcaattatgg tcatgggaca ggccacggca ttggcaactt cctgtgtgtg 1861 catgagtggc cagtgggatt ccagtccaac aacatcgcta tggccaaggg catgttcact 1921 tccattgaac ctggttacta taaggatgga gaatttggga tccgtctcga agatgtggct 1981 ctcgtggtag aagcaaagac caagtaccca ggggagctac ctgaccttgt ggtatcattt 2041 gtgccctatg accggaacct catcgatgtc agcctgctgt ctcccgagca tctccagtac 2101 ctgaatcgct actaccagac catccgggag aaggtgggtc cagagctgca gaggcgccag 2161 ctactagagg agttcgagtg gcttcaacag cacacagagc ccctggccgc cagggcccca 2221 gacaccgcct cctgggcctc tgtgttagtg gtctccaccc ttgccatcct tggctggagt 2281 gtctagaggc tccagactct cctgttaacc ctccatctag atggggggct cccttgctta 2341 gctcccctca ccctgcactg aacatacccc aagagcccct gctggcccat tgcctagaaa 2401 cctttgcatt catcctcctt ctccaagacc tatggagaag gtcccaggcc ccaggaaaca 2461 cagggcttct tggccccaga tggcacctcc ctgcaccccg gggttgtata ccacaccctg 2521 ggcccctaat cccaggcccc gaaataggaa agccagctag tctcttctct tctgtgatct 2581 cagtaggcct aacctataac ctaacacaga ctgctacagc tgctcccctc ccgccaaaca 2641 aagccccaag aaaacaatgc ccctaccacc caagggtgcc atggtcccgg gaaaacccaa 2701 cctgtcaccg cgtgttgggc gtaaccagaa ctgttccccc ccaccagggc ttaaaaatcg 2761 cccccacttt ttaaccatcg tccattaacc acctggtggg catagccaga gctgttcgaa 2821 cccagccagg gatgaaaaat caacccccga catggaaccc atgattccta aacccggggt 2881 aggttccatg ccaagtaaca gcagagggag ttaagccata ggaatttggc tgtggagtaa 2941 gagggaatgc ggtgaggcag tgtggaatat gaccctacca gaggttggag aacaaacttg 3001 ggcagccgga acccgtcact attttagatt cctggcattc gaggagccct ttgaactttc 3061 caaagtgcag ccacagctac aatgctgtta aatcctccca catttcttgg atgccccttc 3121 accttgtgtg gacagtgtct ggtttcccca ttttacagac aggaaaactg agcttcagac 3181 agggggtggg ctttgcctaa ggacacacaa atttggttgg gagttgatgg ggccagatga 3241 gccagcattc cagctgtttc acccttcagc aacatgcaga gtccctgagc ccacctccca 3301 gccctctcct cattctctga acccactgtg gtgagaagaa tttgctccgg ccaaattggc 3361 cgttagccac ctgggtccac atcctgctaa gacgtttaaa acagcctaac aaagacactt 3421 gcctgtgg // LOCUS HSU90875 1407 bp mRNA PRI 19-APR-1997 DEFINITION Human cytotoxic ligand TRAIL receptor mRNA, complete cds. ACCESSION U90875 NID g1945071 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Pan,G., O'Rourke,K., Chinnaiyan,A.M., Gentz,R., Ebner,R., Ni,J. and Dixit,V.M. TITLE The receptor for the cytotoxic ligand TRAIL JOURNAL Science 276 (5309), 111-113 (1997) MEDLINE 97238921 REFERENCE 2 (bases 1 to 1407) AUTHORS Pan,G., O'Rourke,K., Chinnaiyan,A.M., Gentz,R., Ebner,R., Ni,J. and Dixit,V.M. TITLE Direct Submission JOURNAL Submitted (25-FEB-1997) Pathology, University of Michigan, 1301 Catherine Road, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1407 /function="cell death receptor" /codon_start=1 /product="cytotoxic ligand TRAIL receptor" /db_xref="PID:g1945072" /translation="MAPPPARVHLGAFLAVTPNPGSAASGTEAAAATPSKVWGSSAGR IEPRGGGRGALPTSMGQHGPSARARAGRAPGPRPAREASPRLRVHKTFKFVVVGVLLQ VVPSSAATIKLHDQSIGTQQWEHSPLGELCPPGSHRSERPGACNRCTEGVGYTNASNN LFACLPCTACKSDEEERSPCTTTRNTACQCKPGTFRNDNSAEMCRKCSTGCPRGMVKV KDCTPWSDIECVHKESGNGHNIWVILVVTLVVPLLLVAVLIVCCCIGSGCGGDPKCMD RVCFWRLGLLRGPGAEDNAHNEILSNADSLSTFVSEQQMESQEPADLTGVTVQSPGEA QCLLGPAEAEGSQRRRLLVPANGADPTETLMLFFDKFANIVPFDSWDQLMRQLDLTKN EIDVVRAGTAGPGDALYAMLMKWVNKTGRNASIHTLLDALERMEERHAKEKIQDLLVD SGKFIYLEDGTGSAVSLE" BASE COUNT 328 a 355 c 434 g 290 t ORIGIN 1 atggcgccac caccagctag agtacatcta ggtgcgttcc tggcagtgac tccgaatccc 61 gggagcgcag cgagtgggac agaggcagcc gcggccacac ccagcaaagt gtggggctct 121 tccgcgggga ggattgaacc acgaggcggg ggccgaggag cgctccctac ctccatggga 181 cagcacggac ccagtgcccg ggcccgggca gggcgcgccc caggacccag gccggcgcgg 241 gaagccagcc ctcggctccg ggtccacaag accttcaagt ttgtcgtcgt cggggtcctg 301 ctgcaggtcg tacctagctc agctgcaacc atcaaacttc atgatcaatc aattggcaca 361 cagcaatggg aacatagccc tttgggagag ttgtgtccac caggatctca tagatcagaa 421 cgtcctggag cctgtaaccg gtgcacagag ggtgtgggtt acaccaatgc ttccaacaat 481 ttgtttgctt gcctcccatg tacagcttgt aaatcagatg aagaagagag aagtccctgc 541 accacgacca ggaacacagc atgtcagtgc aaaccaggaa ctttccggaa tgacaattct 601 gctgagatgt gccggaagtg cagcacaggg tgccccagag ggatggtcaa ggtcaaggat 661 tgtacgccct ggagtgacat cgagtgtgtc cacaaagaat caggcaatgg acataatata 721 tgggtgattt tggttgtgac tttggttgtt ccgttgctgt tggtggctgt gctgattgtc 781 tgttgttgca tcggctcagg ttgtggaggg gaccccaagt gcatggacag ggtgtgtttc 841 tggcgcttgg gtctcctacg agggcctggg gctgaggaca atgctcacaa cgagattctg 901 agcaacgcag actcgctgtc cactttcgtc tctgagcagc aaatggaaag ccaggagccg 961 gcagatttga caggtgtcac tgtacagtcc ccaggggagg cacagtgtct gctgggaccg 1021 gcagaagctg aagggtctca gaggaggagg ctgctggttc cagcaaatgg tgctgacccc 1081 actgagactc tgatgctgtt ctttgacaag tttgcaaaca tcgtgccctt tgactcctgg 1141 gaccagctca tgaggcagct ggacctcacg aaaaatgaga tcgatgtggt cagagctggt 1201 acagcaggcc caggggatgc cttgtatgca atgctgatga aatgggtcaa caaaactgga 1261 cggaacgcct cgatccacac cctgctggat gccttggaga ggatggaaga gagacatgca 1321 aaagagaaga ttcaggacct cttggtggac tctggaaagt tcatctactt agaagatggc 1381 acaggctctg ccgtgtcctt ggagtga // LOCUS HSU90878 1108 bp mRNA PRI 24-MAR-1997 DEFINITION Human LIM domain protein CLP-36 mRNA, complete cds. ACCESSION U90878 NID g1905873 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1108) AUTHORS Kotaka,M., Tsui,S.K.W., Fung,K.P., Lee,C.Y. and Waye,M.M.Y. TITLE Molecular cloning and sequencing of a novel LIM domain protein, human CLP-36 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1108) AUTHORS Kotaka,M., Tsui,S.K.W., Fung,K.P., Lee,C.Y. and Waye,M.M.Y. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Department of Biochemistry, Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China FEATURES Location/Qualifiers source 1..1108 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" CDS 17..1006 /note="36 kDa" /codon_start=1 /product="LIM domain protein CLP-36" /db_xref="PID:g1905874" /translation="MTTQQIDLQGPGPWGFRLVGRKDFEQPLAISRVTPGSKAALANL CIGDVITAIDGENTSNMTHLEAQNRIKGCTDNLTLTVARSEHKVWSPLVTEEGKRHPY KMNLASEPQEVLHIGSAHNRSAMPFTASPASSTTARVITNQYNNPAGLYSSENISNFN NALESKTAASGVEANSRPLDHAQPPSSLVIDKESEVYKMLQEKQELNEPPKQSTSFLV LQEILESEEKGDPNKPSGFRSVKAPVTKVAASIGNAQKLPMCDKCGTGIVGVFVKLRD RHRHPECYVCTDCGTNLKQKGHFFVEDQIYCEKHARERVTPPEGYEVVTVFPK" BASE COUNT 279 a 302 c 279 g 248 t ORIGIN 1 cccgcacagc cgcgccatga ccacccagca gatagacctc cagggcccgg ggccgtgggg 61 cttccgcctc gtggggcgaa aggacttcga gcagcctctc gccatttccc gggtcactcc 121 tggaagcaag gcggctctag ctaatttatg tattggagat gtaatcacag ccattgatgg 181 ggaaaatact agcaatatga cacacttgga agctcagaac agaatcaaag gctgcacaga 241 caacttgact ctcactgtag ccagatctga acataaagtc tggtctcctc tggtgacgga 301 ggaagggaag cgtcatccat acaagatgaa tttagcctct gaaccccagg aggtcctgca 361 cataggaagc gcccacaacc gaagtgccat gccctttacc gcctcgcctg cctccagcac 421 tactgccagg gtcatcacaa accagtacaa caacccagct ggcctctact cttctgaaaa 481 tatctccaac ttcaacaatg ccctggagtc aaagactgct gccagcgggg tggaggcgaa 541 cagcagaccc ttagaccatg ctcagcctcc aagcagcctt gtcatcgaca aagaatctga 601 agtttacaag atgcttcagg agaaacagga gttgaatgag cccccgaaac agtccacgtc 661 tttcttggtt ttgcaggaaa tcctggagtc tgaagaaaaa ggggatccca acaagccctc 721 aggattcaga agtgttaaag ctcctgtcac taaagtggct gcgtcgattg gaaatgctca 781 gaagttgcct atgtgtgaca aatgtggcac tgggattgtt ggtgtgtttg tgaagctgcg 841 ggaccgtcac cgccaccctg agtgttatgt gtgcactgac tgtggcacca acctgaaaca 901 gaagggccat ttctttgtgg aggatcaaat ctactgtgag aagcatgccc gggagcgagt 961 cacaccacct gagggttatg aagtggtcac tgtgttcccc aagtgagcca gcagatctga 1021 ccactgttct ccagcaggcc tctgctgcag cttttctctc agtgttctgg ccctctcctc 1081 tcttgaaagt tctctgctta ctttggtt // LOCUS HSU90908 1871 bp mRNA PRI 30-MAR-1997 DEFINITION Human clones 23549 and 23762 mRNA, complete cds. ACCESSION U90908 NID g1913887 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1871) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 1871) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 1871) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA COMMENT similar to human KIAA0053 gene deposited under GenBank Accession Number D29642. FEATURES Location/Qualifiers source 1..1871 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /dev_stage="infant" /tissue_type="brain" /clone_lib="Soares library 1NIB from IMAGE consortium" /clone="23549 and 23762" CDS 1..1485 /codon_start=1 /product="unknown" /db_xref="PID:g1913888" /translation="MGQPREFWHLRSQVPVPTCHLLTVQPWCLLPHPTPAGGRERRHS ASSLLAALCLRKAVLRPHPAVLPGVGGCVVLSSGRMKAPLRGGLALGTGPAPTKLQLL HDWQSEPGPSGWQQGPSPPLFPAGTSLVQHLMTVLIRKHSQLFTAPVPEGPTSPRGGL QCAVGWGSEEVTRDSQGEPGGPGLPAHRTSSLDGAAVAVLSRTAPTGPGSRCSPGKKV QTLPSWKSSFRQPRSLSGSPKGGGSSLEVPIISSGGNWLMNGLSSLRGHRRASSGDRL KDSGSVQRLSTYDNVPAPGLVPGIPSVASMAWSGASSSESSVGGSLSSCTACRASDSS ARSSLHTDWALEPSPLPSSSEDPKSLDLDHSMDEAGAGASNSEPSEPDSPTREHARRS EALQGLVTELRAELCRQRTEYERSVKRIEEGSADLRKRMSRLEEELDQEKKKYIMLEI KLRNSERAREDAERRNQLLQREMEEFFSTLGSLTVGAKGARAPK" BASE COUNT 386 a 580 c 601 g 304 t ORIGIN 1 atggggcagc cgagagagtt ctggcatctc aggtcccagg ttccagttcc aacctgccac 61 ctgctcaccg tgcagccttg gtgcctgctt cctcacccca ccccagctgg agggcgtgag 121 cgcaggcaca gtgcttcctc cctgcttgcg gctctgtgcc tgaggaaggc ggttctgcgg 181 ccccatcctg ctgtcctgcc tggagttgga gggtgtgtgg tcctttcctc tgggcggatg 241 aaggctccct tgaggggcgg cctggctctg gggacagggc cagctcccac gaagctgcag 301 cttctccatg actggcagag cgagcctggg ccgagtggct ggcagcaagg cccttctccg 361 cctctcttcc cggcaggcac ttccctcgtc cagcacctga tgaccgtcct catccgcaaa 421 cacagccagc tcttcacggc accggtcccg gaagggccca cctccccgcg cgggggcctg 481 caatgcgcag tggggtgggg ctccgaggag gtcaccaggg acagccaagg agagcccggc 541 ggccccggcc tgcccgcgca caggacctct tccctggacg gggcggccgt ggcggtgctc 601 tccagaacag cccccacggg gccggggagc cggtgcagcc ctgggaagaa ggtgcagacc 661 ctgcccagtt ggaagtcctc cttccggcag ccgaggtccc tatcgggaag cccgaagggg 721 ggcggctcat ccctggaggt gcccatcatc tcctccggcg ggaactggct tatgaacggg 781 ctgtcctccc tgcgcggaca ccgccgggcc tcgtcgggag accggctcaa ggactcgggc 841 tccgtgcaga gactctccac ctacgacaat gtgcccgcgc cgggcctggt ccccggcata 901 cccagcgtgg ccagtatggc gtggtccggg gcctcgtcca gcgagtcgtc ggtggggggc 961 tcactcagca gctgcacggc ctgccgcgcc agcgactcgt ctgcccgcag ttccctgcac 1021 accgactggg ccctggagcc ctccccgctc cccagcagca gcgaggaccc caagtccctg 1081 gacctggacc acagcatgga cgaggcgggc gcgggtgcca gcaacagcga gcccagcgag 1141 ccggacagcc ccacccggga acacgcgcgc cgctccgagg ccttacaggg gctggtcact 1201 gagctcaggg ccgagctgtg ccgccagcgg actgagtacg agaggagtgt gaaaagaatc 1261 gaagaaggga gtgctgacct gagaaaacga atgtcccggt tagaagaaga actggaccag 1321 gaaaagaaaa aatacatcat gctggaaata aagctgcgga actctgaacg ggcgcgggag 1381 gatgcggaga ggaggaacca gctgttgcag agggaaatgg aggagttttt ttcgacccta 1441 ggaagcttga ctgttggggc aaaaggtgcc agggccccaa agtaaaagga atggcagagc 1501 tcacttctgt accacgtctg ctggtctcca gccttgtatg gagttagaag cgtctgtatc 1561 tctggagcag ccaggcgctc tggagccagc tggagagaga gagatcctga tacctctgtg 1621 gggactgtgg ggacttttgg gaccccacac actccaggtg ggatcagatg ctgctccaac 1681 catgcagttc ctggtgaggg tcagaagggg acggtaccaa gagcagcgct tagcccttac 1741 ccaggaaata tccttcatgg ccacagaaat ggagggcgcc caggatccag gcagccaccg 1801 ggaacagtca gctttcttta ttaaatgtgc tcacaaagca aaaaaaaaaa aaaaaaaaaa 1861 aaaaaaaaaa a // LOCUS HSU90919 2207 bp mRNA PRI 30-MAR-1997 DEFINITION Human clones 23667 and 23775 zinc finger protein mRNA, complete cds. ACCESSION U90919 NID g1913900 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2207) AUTHORS Andersson,B., Wentland,M.A., Ricafrente,J.Y., Liu,W. and Gibbs,R.A. TITLE A 'double adaptor' method for improved shotgun library construction JOURNAL Anal. Biochem. 236 (1), 107-113 (1996) MEDLINE 96207227 REFERENCE 2 (bases 1 to 2207) AUTHORS Yu,W., Andersson,B., Worley,K.C., Muzny,D.M., Ding,Y., Liu,W., Ricafrente,J.Y., Wentland,M.A., Lennon,G. and Gibbs,R.A. TITLE Large-scale concatenation cDNA sequencing JOURNAL Unpublished REFERENCE 3 (bases 1 to 2207) AUTHORS Yu,W. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza S930, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..2207 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /dev_stage="infant" /tissue_type="brain" /clone_lib="Soares library 1NIB from IMAGE consortium" /clone="23667 and 23775" CDS 183..1619 /note="similar to human zinc finger protein encoded by GenBank Accession Number M91592" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g1913901" /translation="MLSDELESKPELLVQFVQNTSIPLGQGLVESEAKDITCLSLLPV TEASECSRLMLPDDTTNHSNSSKEVPSSAVLRSLRVNVGPDGEETRAQTVQKSPEFLS TSESSSLLQDLQPSDSTSFILLNLTRAGLGSSAEHLVFVQDEAEDSGNDFLSSESTDS SIPWFLRVQELAHDSLIAATRAQLAKNAKTSSNGENVHLGSGDGQSKDSGPLPQVEKK LKCTVEGCDRTFVWPAHFKYHLKTHRNDRSFICPAEGCGKSFYVLQRLKVHMRTHNGE KPFMCHESGCGKQFTTAGNLKNHRRIHTGEKPFLCEAQGCGRSFAEYSSLRKHLVVHS GEKPHQCQVCGKTFSQSGSRNVHMRKHHLQLGAAGSQEQEQTAEPLMGSSLLEEASVP SKNLVSMNSQPSLGGESLNLPNTNSILGVDDEVLAEGSPRSLSSVPDVTHHLVTMQSG RQSYEVSVLTAVNPQELLNQGDLTERRT" BASE COUNT 623 a 491 c 524 g 569 t ORIGIN 1 actgacggcc ggccggcttc ccggaactgg aaggttacat tgattaccca cctagtacaa 61 catcttacgg gaagagcata gtatttccta gaggaatatg aacataacag gaaggtatca 121 ttggctctga attaaatttg aacttgtccc ctgaatagct acaggttttg gaagctgaat 181 caatgttatc agatgagtta gaatccaaac cagagctcct ggtacagttt gttcagaata 241 cgtccatccc attgggacag gggcttgtag aatcagaagc taaagatatt acttgcttgt 301 ccctccttcc cgtgactgaa gcctcagaat gcagtcggct aatgttacca gatgatacta 361 caaatcattc taactcctcc aaggaggtcc cttcctcagc tgttttgaga agccttcggg 421 tgaatgtggg tccagacgga gaggagacga gagctcagac tgtacagaaa tccccggagt 481 ttttgtccac ttcagagtct tctagcttgt tgcaagatct acagccaagt gatagcactt 541 cttttattct tcttaaccta acaagagcag gtctgggctc ttcagctgag cacttagtgt 601 ttgtacagga tgaggcagaa gattcaggga atgatttcct ctccagtgag agcacagaca 661 gtagcattcc atggttcctc cgggttcagg agttggccca tgacagtttg attgctgcta 721 ctcgtgcaca actggcaaag aatgcaaaaa ccagcagcaa tggagaaaat gtccaccttg 781 gttctggtga tgggcagtca aaagattctg ggccccttcc tcaagtggaa aagaagctca 841 agtgtacagt tgaaggttgt gaccggacat ttgtatggcc agctcacttt aaataccacc 901 tcaagactca tcgaaatgac cgctccttca tctgtcctgc agaaggttgt gggaaaagct 961 tctatgtgct gcagaggctg aaggtgcaca tgaggaccca caatggagag aagcccttta 1021 tgtgccatga gtctggctgt ggtaagcagt ttactacagc tggaaacctg aagaaccacc 1081 ggcgcatcca cacaggagag aaacctttcc tttgtgaagc ccaaggatgt ggccgttcct 1141 ttgctgagta ttctagcctc cgaaaacatc tggtggttca ctcaggagag aagcctcatc 1201 agtgccaagt ctgtgggaag accttctctc agagtggaag caggaatgtg catatgagaa 1261 agcatcacct gcagctggga gcagctggga gtcaagagca ggagcaaact gctgagccac 1321 taatgggcag tagtttgctt gaagaggctt cagtacccag taaaaacctg gtgtctatga 1381 attcccagcc cagccttggt ggagagtcct tgaacctacc aaataccaat tctatcctgg 1441 gagttgatga tgaggtgctt gctgaaggat ccccacgttc cctgtcttca gtgcctgatg 1501 tgacacatca cctggtgacc atgcagtcag ggaggcaatc atatgaagtt tctgtcttaa 1561 ctgcagtaaa tccacaagag ttactaaacc aaggagattt aactgaaaga cggacatgag 1621 cgtgggtgct gactcctgga agagcaactc tatctgatct caaaatgcgt atactgggaa 1681 caggatgcct tagcccacaa cagaaccaga atgaatcttt gaaggcacaa gactctgctt 1741 ttgccactct tcctctttcc tggtatagaa gatggatgta ggagagcttc ttttctaact 1801 accatctgat cagacaagga atgaagcaat gactgtgggc tgggaaactg tacctacctc 1861 tcttcccact gcaaatttct gggatagacc aaaagtgaat ttgattatgt gttggctgaa 1921 gttcttcatt ctgactgttg aggggaggtt ttcctttgaa gagttttcat cccagactca 1981 gctgtctttt cacatggatg aaataattcc tgctaccaac aacagagctt caccaggaag 2041 ttgagttttc aagatgcctt gttgctttga agaagggagt gatgtcaatt ctcttgttac 2101 attctccctt tagcaacctg agtaagagac tctctgccac tgggctgcaa aaaaataaat 2161 tacttgaatc tccccttgaa aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU90920 5238 bp mRNA PRI 15-OCT-1997 DEFINITION Human PTPL1-associated RhoGAP mRNA, complete cds. ACCESSION U90920 NID g2522321 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5238) AUTHORS Saras,J., Franzen,P., Aspenstrom,P., Hellman,U., Gonez,L.J. and Heldin,C.H. TITLE A novel GTPase-activating protein for Rho interacts with a PDZ domain of the protein-tyrosine phosphatase PTPL1 JOURNAL J. Biol. Chem. 272 (39), 24333-24338 (1997) MEDLINE 97450957 REFERENCE 2 (bases 1 to 5238) AUTHORS Saras,J., Franzen,P., Aspenstrom,P., Hellman,U., Gonez,L.J. and Heldin,C.-H. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Ludwig Institute for Cancer Research, Box 595 Biomedical Center, Husargatan 3, Uppsala S-75124, Sweden FEATURES Location/Qualifiers source 1..5238 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" CDS 184..3969 /note="PARG" /codon_start=1 /product="PTPL1-associated RhoGAP" /db_xref="PID:g2522322" /translation="MIAHKQKKTKKKRAWASGQLSTDITTSEMGLKSLSSNSIFDPDY IKELVNDIRKFSHILLYLKEAIFSDCFKEVIHIRLEELLRVLKSIMNKHQNLNSVDLQ NAAEMLTAKVKAVNFTEVNEENKNDLFQEVFSSIETLAFTFGNILTNFLMGDVGNDSF LRLPVSRETKSFENVSVESVDSSSEKGNFSPLELDNVLLKNTDSIELALSYAKTWSKY TKNIVSWVEKKLNLELESTRNMVKLAEATRTNIGIQEFMPLQSLFTNALLNDIESSHL LQQTIAALQANKFVQPLLGRKNEMEKQRKEIKELWKQEQNKMLEAENALKKAKLLCMQ RQDEYEKAKSSMFRAEEEHLSSSGGLAKNLNKQLEKKRRLEEEALQKVEEADELYKVC VTNVEERRNDVENTKREILAQLRTLVFQCDLTLKAVTVNLFHMQHLQAASLADRLQSL CGSAKLYDPGQEYSEFVKATNSTEEEKVDGNVNKHLNSSQPSGFGPANSLEDVVRLPD SSNKIEEDRCSNSADITGPSFIRSWTFGMFSDSESTGGSSESRSLDSESISPGDFHRK LPRTPSSGTMSSADDLDEREPPSPSETGPNSLGTFKKTLMSKAALTHKFRKLRSPTKC RDCEGIVVFQGVECEECLLVCHRKCLENLVIICGHQKLPGKIHLFGAEFTLVAKKEPD GIPFILKICASEIENRALCLQGIYRVCGNKIKTEKLCLALENGMHLVDISEFSSHDIC DVLKLYLRQLPEPFILFRLYKEFIDLAKEIQHVNEEQETKKNSLEDKKWPNMCIEINR ILLKSKDLLRQLPASNFNSLHFLIVHLKRVVDHAEENKMNSKNLGVIFGPSLIRPRPQ TAPITISSLAEYSNQARLVEFLITYSQKIFDGSLQPQDVMCSIGVVDQGCFPKPLLSP EERDIERSMKSLFFSSKEDIHTSESESKIFERATSFEESERKQNALGKCDACLSDKAQ LLLDQEAESASQKIEDGKAPKPLSLKSDRSTNNVERHTPRTKIRPVSLPVDRLLLASP PNERNGRNMGNVNLDKFCKNPAFEGVNRKDAATTVCSKFNGFDQQTLQKIQDKQYEQN SLTAKTTMIMPSALQEKGVTTSLQISGDHSINATQPSKPYAEPVRSVREASERRSSDS YPLAPVRAPRTLQPQHWTTFYKPHAPIISIRGNEEKPASPSAACPPGTDHDPHGLVVK SMPDPDKASACPGQATGQPKEDSEELGLPDVNPMCQRPRLKRMQQFEDLEDEIPQFV" BASE COUNT 1676 a 948 c 1071 g 1543 t ORIGIN 1 gctgtggctg cggctgcggc tgcggctgag atttggccgg gcgtccgcag gccgtggggg 61 atgggggcag cgagctccag ccctcggcgg tggcggcggc cgtaggtgtg gggcgggcgt 121 ccgcgtccgg cacgcgagat ggagcgccgt ggatttcagt ttttctgact gttacatgaa 181 aggatgattg ctcacaaaca gaaaaagaca aagaaaaaac gtgcttgggc atcaggtcaa 241 ctctctactg atattacaac ttctgaaatg gggctcaagt ccttaagttc caactctatt 301 tttgatccgg attacatcaa ggagttggtg aatgatatca ggaagttctc ccacatctta 361 ctatatttga aagaagccat attttcagac tgttttaaag aagttattca tatacgtcta 421 gaggaactgc tccgtgtttt aaagtctata atgaataaac atcagaacct caattctgtt 481 gatcttcaaa atgctgcaga aatgctcact gcaaaagtga aagctgtgaa cttcacagaa 541 gttaatgaag aaaacaaaaa cgatctcttc caggaagtgt tttcttctat tgaaactttg 601 gcatttacct ttggaaatat ccttacaaac ttccttatgg gagatgtagg caatgattca 661 ttcttgcgac tgcctgtttc tcgagaaact aagtcgtttg aaaatgtttc tgtggaatca 721 gtggactcat ccagtgaaaa aggaaatttt tcccctttag aactagacaa cgtgctgtta 781 aagaacactg actctatcga gctggctttg tcatatgcta aaacttggtc aaaatatact 841 aagaacatag tttcatgggt tgaaaaaaag cttaacttgg aattggagtc cactagaaat 901 atggtcaagt tggcagaggc aactagaact aacattggaa ttcaggagtt catgccactg 961 cagtctctgt ttactaatgc tcttcttaat gatatagaaa gcagtcacct tttacaacaa 1021 acaattgcag ctctccaggc taacaaattt gtgcagcctc tacttggaag gaaaaatgaa 1081 atggaaaaac aaaggaaaga aataaaagag ctttggaaac aggagcaaaa taaaatgctt 1141 gaagcagaga atgctctcaa aaaggcaaaa ttattatgca tgcaacgtca agatgaatat 1201 gagaaagcaa agtcttccat gtttcgtgca gaagaggagc atctgtcttc aagtggcgga 1261 ttagcaaaaa atctcaacaa gcaactagaa aaaaagcgaa ggttggaaga ggaggctctc 1321 caaaaagtag aagaagcaga tgaactttac aaagtttgtg tgacaaatgt tgaagaaaga 1381 agaaatgatg tagaaaatac caaaagagaa attttagcac aactccggac acttgttttc 1441 cagtgtgatc ttacccttaa agcggtaaca gttaacctct tccacatgca gcatctgcag 1501 gctgcttccc ttgcagacag attacagtct ctctgtggta gtgccaaact ctatgaccca 1561 ggccaagagt acagtgaatt tgtcaaggcc acaaattcaa ctgaagaaga aaaagttgat 1621 ggaaatgtaa ataaacattt aaatagttcc caaccttcag gatttggacc tgccaactct 1681 ttagaggatg ttgtacgcct tcctgacagt tctaataaaa ttgaagagga cagatgctct 1741 aacagtgcag atataacagg tccttccttt ataagatcat ggacatttgg gatgtttagt 1801 gattctgaga gcactggagg gagcagcgaa tctagatctc tggattcaga atctataagt 1861 ccaggagact ttcatcgaaa acttccacga acaccatcca gtggaactat gtcctctgca 1921 gatgatctag atgaaagaga gccaccttcc ccttcagaaa ctggacccaa ttcccttgga 1981 acatttaaga aaacattgat gtcaaaggca gctctcacac acaagtttcg caaattgaga 2041 tcccccacga aatgtaggga ttgtgaaggc attgtagtgt tccaaggtgt tgaatgtgaa 2101 gagtgtctcc ttgtttgtca tcgaaagtgt ttggaaaatt tagtcattat ttgtggtcat 2161 cagaaacttc caggaaaaat acacttattt ggagcagaat tcacactagt tgcaaaaaag 2221 gaaccagatg gtatcccttt tatactcaaa atatgtgcct cagagattga aaatagagct 2281 ttgtgtctac agggaattta tcgtgtgtgt ggaaacaaaa taaaaactga aaaattgtgt 2341 ctagctttgg aaaatggtat gcacttggta gatatttcag aatttagttc acatgatatc 2401 tgtgacgtct tgaaattata ccttcggcag ctcccagaac catttatttt atttcgattg 2461 tacaaggaat ttatagacct tgcaaaagag atccaacatg taaatgaaga acaagagaca 2521 aaaaagaata gtcttgaaga caaaaaatgg ccaaatatgt gtatagaaat aaaccgaatt 2581 cttctaaaaa gcaaagacct tctaagacaa ttgccagcat caaattttaa cagtcttcat 2641 ttccttatag tacatctaaa gcgggtagta gatcatgcag aagaaaacaa gatgaactcc 2701 aaaaacttgg gggtgatatt tggaccaagt ctcattaggc caaggccaca aactgctcct 2761 atcaccatct cctcccttgc agagtattca aatcaagcac gcttggtaga gtttctcatt 2821 acttactcac agaagatctt cgatgggtcc ctacaaccac aagatgttat gtgtagcata 2881 ggtgttgttg atcaaggctg ttttccaaag cctctgttat caccagaaga aagagacatt 2941 gaacgttcca tgaagtcact atttttttct tcaaaggaag atatccatac ttcagagagt 3001 gaaagcaaaa tttttgaacg agctacatca tttgaggaat cagaacgcaa gcaaaatgcg 3061 ttaggaaaat gtgatgcatg tctcagtgac aaagcacagt tgcttctaga ccaagaggct 3121 gaatcagcat cccaaaagat agaagatggt aaagccccta agccactttc tctgaaatct 3181 gataggtcaa caaacaatgt ggagaggcat actccaagga ccaagattag acctgtaagt 3241 ttgcctgtag atagactact tcttgcaagt cctcctaatg agagaaatgg cagaaatatg 3301 ggaaatgtaa atttagacaa gttttgcaag aatcctgcct ttgaaggagt taatagaaaa 3361 gacgctgcta ctactgtttg ttccaaattt aatggctttg accagcaaac tctacagaaa 3421 attcaggaca aacagtatga acaaaacagc ctaactgcca agactacaat gatcatgccc 3481 agtgcactcc aggaaaaagg agtgacaaca agcctccaga ttagtgggga ccattctatc 3541 aatgccactc aacccagtaa gccatatgca gagccagtca ggtcagtgag agaggcatct 3601 gagagacggt cttcagattc ctaccctctc gctcctgtca gagcacccag aacactgcag 3661 cctcaacatt ggacaacatt ttataaacca catgctccca tcatcagtat cagggggaat 3721 gaggagaagc cagcttcacc ctcagcagca tgccctcctg gcacagatca cgatccccac 3781 ggtctcgtgg tgaagtcaat gccagaccca gacaaagcat cagcttgtcc tgggcaagca 3841 actggtcaac ctaaagaaga ctctgaggag cttggcttgc ctgatgtgaa tccaatgtgt 3901 cagagaccaa ggctaaaacg aatgcaacag tttgaagacc tcgaagatga aattccacaa 3961 tttgtgtagg gatgtcaaat ttcagggttt ttttgttgtt gttgtgttat tttgtggtat 4021 tgtgcttgtt ttgtgaaaga atgttttgac agggcccctt ttgtatagga ctgccaaatc 4081 atgggttttg ccttttgttg ttgtatttat cctctgttgg taatactgaa tggtagaatg 4141 ttttgatagg gtcacatttg tgcctcactg gaattatctt taaattctgt atttttaaag 4201 ttgtgaataa gataggtgga ttcgtatttt ttaaagttca gttgactttc cccaccaaat 4261 ggtccatttg aatgcatccc taatatatga tatagtctca actaataggt gcaatttggg 4321 aaaatcaggt ttattttttg gagtggaact gttataagtg cttatttata aaaggaatgt 4381 ttctgaatgc aagtgcctaa aaagatcttt gttggtatgc atatgttttg tcacacaatt 4441 tatagtgcat ctttcaccat ttgtgctttt ttaagatagt atgtaagctc ttatttttca 4501 attggcaatt cagttaattt ttaaatgttt acataatggc cagaaggctt gcaaatctgt 4561 atttaattgc attttaatta attgccagtt tttacatgta gtagtcagtt gtacaaagaa 4621 aatgcactta aacctgtttc taaattatat attcagttat attatatttg gctttagatg 4681 gttttaatac atttgatagt ttttcacccc ttggctttat tttatataaa cttttgtttt 4741 tcagcagttc tgaacttttt agtattttat aaatggtcca aaaaatgcct gtttcagaag 4801 tttttgaatt cagtgcattt cctcttgatt tgtctgggtt aaaaccattc cttttgtatg 4861 aaatgttttg acttaggaat cattttatgt acttgttcta cctggattgt caacaactga 4921 aagtacatat ttcatccaaa tcaagctaaa atttatttaa gttgattctg agagtacagg 4981 tcagtaagcc tcattatttg gaatttgaga gaagtatagg tgatcggatc tgtttcattt 5041 ataaaaggtc cagtttttag gactagtaca ttcctgttat tttctgggtt ttatcatttt 5101 gcctaaaata ggatataaaa gggacaaaaa ataagtagac tgtttttatg tgtgaattat 5161 atttctacta aatgtttttg tatgactgtg ttatacttga taatatatat atatatatat 5221 aaaaaaaaaa aaaaaaaa // LOCUS HSU90943 1077 bp mRNA PRI 01-JAN-1998 DEFINITION Human voltage dependent anion channel form 3 mRNA, complete cds. ACCESSION U90943 NID g2735306 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1077) AUTHORS Rahmani,Z. and Siddiqui,A. TITLE Human voltage dependent anion channel form 3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1077) AUTHORS Rahmani,Z. and Siddiqui,A. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) URA 1335 CNRS, Hopital Necker-Enfants Malades, 156 Rue de Vaugirard, Paris 75015, France FEATURES Location/Qualifiers source 1..1077 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" CDS 1..852 /codon_start=1 /product="voltage dependent anion channel form 3" /db_xref="PID:g2735307" /translation="MCNTPTYCDLGKAAKDVFNKGYGFGMVKIDLKTKSCSGVEFSTS GHAYTDTGKASGNLETKYKVCNYGLTFTQKWNTDNTLGTEISWENKLAEGLKLTLDTI FVPNTGKKSGKLKASYKRDCFSVGSNVDIDFSGPTIYGWAVLAFEGWLAGYQMSFDTA KSKLSQNNFALGYKAADFQLHTHVNDGTEFGGSIYQKVNEKIETSINLAWTAGSNNTR FGIAAKYMLDCRTSLSAKVNNASLIGLGYTQTLRPGVKLTLSALIDGKNFSAGGHKVG LGFELEA" BASE COUNT 310 a 207 c 264 g 296 t ORIGIN 1 atgtgtaaca caccaacgta ctgtgaccta ggaaaggctg ctaaggatgt cttcaacaaa 61 ggatatggct ttggcatggt caagatagac ctgaaaacca agtcttgtag tggagtggaa 121 ttttctactt ctggtcatgc ttacactgat acagggaaag catcaggcaa cctagaaacc 181 aaatataagg tctgtaacta tggacttacc ttcacccaga aatggaacac agacaatact 241 ctagggacag aaatctcttg ggagaataag ttggctgaag ggttgaaact gactcttgat 301 accatatttg taccgaacac aggaaagaag agtgggaaat tgaaggcctc ctataaacgg 361 gattgtttta gtgttggcag taatgttgat atagattttt ctggaccaac catctatggc 421 tgggctgtgt tggccttcga agggtggctt gctggctatc agatgagttt tgacacagcc 481 aaatccaaac tgtcacagaa taatttcgcc ctgggttaca aggctgcgga cttccagctg 541 cacacacatg tgaacgatgg cactgaattt ggaggttcta tctaccagaa ggtgaatgag 601 aagattgaaa catccataaa ccttgcttgg acagctggga gtaacaacac ccgttttggc 661 attgctgcta agtacatgct ggattgtaga acttctctct ctgctaaagt aaataatgcc 721 agcctgattg gactgggtta tactcagacc cttcgaccag gagtcaaatt gactttatca 781 gctttaatcg atgggaagaa cttcagtgca ggaggtcaca aggttggctt gggatttgaa 841 ctggaggctt aatgtggttt gaggaaagca tcagattttg tccctggaag tgaagagaaa 901 tgaacccact atgttttggc cttaaaattc ttctgtgaaa tttcaaaagt gtgaactttt 961 tattcttcca aagaattgta atcctcccca cactgaagtc tagggggttg cgaatccctc 1021 ctgagggaga tgcctgaagg catgcctgga agttgtcatg tttgtgcacg tttcagt // LOCUS HSU91510 1500 bp mRNA PRI 15-OCT-1997 DEFINITION Human CD39L1 mRNA, complete cds. ACCESSION U91510 NID g2522323 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1500) AUTHORS Chadwick,B.P. and Frischauf,A.M. TITLE Cloning and mapping of a human and mouse gene with homology to ecto-ATPase genes JOURNAL Mamm. Genome 8 (9), 668-672 (1997) MEDLINE 97419269 REFERENCE 2 (bases 1 to 1500) AUTHORS Chadwick,B.P. and Frischauf,A.-M. TITLE Direct Submission JOURNAL Submitted (27-FEB-1997) MAMM, ICRF, 44 Lincoln's Inn Fields, London WC2A 3PX, U.K FEATURES Location/Qualifiers source 1..1500 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="9" /map="9q34.3" CDS 22..1440 /note="similar to Gallus gallus cell membrane ecto-ATPase encoded by GenBank Accession Number U74467" /codon_start=1 /product="CD39L1" /db_xref="PID:g2522324" /translation="MAGKVRSLLPPLLLAAAGLAGLLLLCVPTRDVREPPALKYGIVL DAGSSHTSMFIYKWPADKENDTGIVGQHSSCDVPGGGISSYADNPSGASQSLVGCLEQ ALQDVPKERHAGTPLYLGATAGMRLLNLTNPEASTSVLMAVTHTLTQYPFDFRGARIL SGQEEGVFGWVTANYLLENFIKYGWVGRWFRPRKGTLGAMDLGGASTQITFETTSPAE DRASEVQLHLYGQHYRVYTHSFLCYGRDQVLQRLLASALQTHGFHPCWPRGFSTQVLL GDVYQSPCTMAQRPQNFNSSARVSLSGSSDPHLCRDLVSGLFSFSSCPFSRCSFNGVF QPPVAGNFVAFSAFFYTVDFLRTSMGLPVATLQQLEAAAVNVCNQTWAQQLLSRGYGF DERAFGGVIFQKKAADTAVGWALGYMLNLTNLIPADPPGLRKGTDFSSWVVLLLLFAS ALLAALVLLLRQVHSAKLPSTI" BASE COUNT 249 a 525 c 434 g 292 t ORIGIN 1 ctcccgcgcg cccgcccgcc catggccggg aaggtgcggt cactgctgcc gccgctgctg 61 ctggccgccg cgggcctcgc cggcctccta ctgctgtgcg tccccacccg cgacgtccgg 121 gagccgcccg ccctcaagta tggcatcgtc ctggatgctg gttcttcaca cacgtccatg 181 tttatctaca agtggccggc agacaaggag aacgacacag gcattgtggg ccagcacagc 241 tcctgtgatg ttccaggtgg gggcatctcc agctatgcag acaacccttc tggggccagc 301 cagagtcttg ttggatgcct cgaacaggcg cttcaggatg tgcccaaaga gagacacgcg 361 ggcacacccc tctacctggg agccacagcg ggtatgcgcc tgctcaacct gaccaatcca 421 gaggcctcga ccagtgtgct catggcagtg actcacacac tgacccagta cccctttgac 481 ttccggggtg cacgcatcct ctcgggccaa gaagaagggg tgtttggctg ggtgactgcc 541 aactacctgc tggagaactt catcaagtac ggctgggtgg gccggtggtt ccggccacgg 601 aaggggacac tgggggccat ggacctgggg ggtgcctcta cccagatcac ttttgagaca 661 accagtccag ctgaggacag agccagcgag gtccagctgc atctctacgg ccagcactac 721 cgagtctaca cccacagctt cctctgctat ggccgtgacc aggtcctcca gaggctgctg 781 gccagcgccc tccagaccca cggcttccac ccctgctggc cgaggggctt ttccacccaa 841 gtgctgctcg gggatgtgta ccagtcacca tgcaccatgg cccagcggcc ccagaacttc 901 aacagcagtg ccagggtcag cctgtcaggg agcagtgacc cccacctctg ccgagatctg 961 gtttctgggc tcttcagctt ctcctcctgc cccttctccc gatgctcttt caatggggtc 1021 ttccagcccc cagtggctgg gaactttgtg gccttctctg ccttcttcta cactgtggac 1081 tttttgcgga cttcgatggg gctgcccgtg gccaccctgc agcagctgga ggcagccgca 1141 gtgaatgtct gcaaccagac ctgggctcag cagctgctga gtcgcggcta cggcttcgat 1201 gagcgcgcct tcggcggcgt gatcttccag aagaaggccg cggacactgc agtgggctgg 1261 gcgctcggct acatgctgaa cctgaccaac ctgatccccg ccgacccgcc ggggctgcgc 1321 aagggcacag acttcagctc ctgggtcgtc ctcctgctgc tcttcgcctc cgcgctcctg 1381 gctgcgcttg tcctgctgct gcgtcaggtg cactccgcca agctgccaag caccatttag 1441 gggccgacgg gggcagctgc cccatccctc ccccaacccc tgtatcccca ccccgtactc // LOCUS HSU91616 2067 bp mRNA PRI 13-APR-1997 DEFINITION Human I kappa B epsilon (IkBe) mRNA, complete cds. ACCESSION U91616 NID g1934600 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2067) AUTHORS Whiteside,S.T., Epinat,J.C., Rice,N.R. and Israel,A. TITLE I kappa B epsilon, a novel member of the I kappa B family, controls RelA and cRel NF-kappa B activity JOURNAL EMBO J. 16 (6), 1413-1426 (1997) MEDLINE 97280829 REFERENCE 2 (bases 1 to 2067) AUTHORS Whiteside,S.T., Epinat,J.C., Rice,N.R. and Israel,A. TITLE Direct Submission JOURNAL Submitted (28-FEB-1997) Biologie Moleculaire de l'Expression Genique, Institut Pasteur, 25 rue du Dr Roux, Paris 75015, France FEATURES Location/Qualifiers source 1..2067 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="stG110; between D6S426 and D6S271" /dev_stage="fetus" /tissue_type="brain" gene 34..1536 /gene="IkBe" CDS 34..1536 /gene="IkBe" /function="cytoplasmic NF-kappa B inhibitor" /codon_start=1 /product="I kappa B epsilon" /db_xref="PID:g1934601" /translation="MNQRRSESRPGNHRLQAYAEPGKGDSGGAGPLSGSARRGRGGGG AIRVRRPCWSGGAGRGGGRPWAVRLPTVTAGWTWPALRTLSSLRAGPSEPHSPGRRRP RAGRPLCQADPQPGKAARRSLEPDPAQTGPRPARAAGMSEARKGPDEAEESQYDSGIE SLRSLRSLPESTSAPASGPSDGSPQPCTHPPGPVKEPQEKEDADGERADSTYGSSSLT YTLSLLGGPEAEDPPPRLPLPHVGALSPQQLEALTYISEDGDTLVHLAVIHEAPAVLL CCLALLPQEVLDIQNNLYQTALHLAVHLDQPGAVRALVLKGASRALQDRHGDTALHVA CQRQSWPVPAACWKGGPEPGRGTSHSLDLQLQNWQGLACLHIATLQKNQPLMELLLRN GADIDVQEGSSGKTALHLAVETQERGLVQFLLQAGAQVDARMLNGCTPLHLAAGRGLM GISSTLCKAGADSLLRNVEDETPQDLTEESLVLLPFDDLKISGKLLLCTD" misc_feature 103..117 /gene="IkBe" /note="encodes serine active site" misc_feature 451..453 /gene="IkBe" /note="encodes internal initiation of translation" misc_feature 490..501 /gene="IkBe" /note="encodes CK2 phosphorylation site" misc_feature 499..504 /gene="IkBe" /note="encodes CK2 phosphorylation site" misc_feature 502..504 /gene="IkBe" /note="encodes signal induced phosphorylation site" misc_feature 514..516 /gene="IkBe" /note="encodes signal induced phosphorylation site" misc_feature 532..543 /gene="IkBe" /note="encodes CK2 phosphorylation site" misc_feature 805..903 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 910..1008 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 1009..1107 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 1138..1236 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 1240..1338 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 1339..1437 /gene="IkBe" /note="encodes ankyrin repeat" misc_feature 1447..1458 /gene="IkBe" /note="encodes CK2 phosphorylation site" BASE COUNT 419 a 622 c 669 g 357 t ORIGIN 1 ggaattccaa tgaatgaatg aatgaatgag tgaatgaatc aacgaaggag tgagtcaagg 61 cccgggaacc acagactcca agcctacgca gagcccggga agggggattc cggaggggcg 121 gggcctcttt ccggaagcgc ccgccggggg cggggagggg gcggggccat ccgcgtgagg 181 cgaccctgtt ggtccggagg ggcggggcga ggaggaggac ggccttgggc ggttcggctg 241 cccacagtaa ccgctgggtg gacctggcca gcgctccgaa ccttgtcctc gctgcgcgcc 301 ggcccctcgg agccccacag cccgggaagg aggcggccgc gggcggggcg cccgctctgc 361 caagcggacc cgcaacccgg aaaggcggcg cggcggagcc tggagccgga tcctgctcag 421 accgggcccc ggccggccag agccgcgggc atgtcggagg cgcggaaggg gccggacgag 481 gcggaggaga gccagtacga ctctggcatt gagtctctgc gctctctgcg ctccctaccc 541 gagtccacct cggctccagc ctccgggccc tcggacggca gcccccagcc ctgcacccat 601 cctccgggac ccgtcaagga accacaggag aaggaagacg cggatgggga gcgggctgat 661 tccacctatg gctcctcctc gctcacctac accctgtcct tgctgggggg ccccgaggct 721 gaggacccgc ccccacgcct gccactcccc cacgtggggg cgctgagccc tcagcagctg 781 gaagcactca cttacatctc cgaggacgga gacacgctgg tccacctggc agtgattcat 841 gaggccccag cggtgctgct ctgttgcctg gctttgctgc cccaggaggt cctggacatt 901 caaaataacc tttaccagac agcactccat ctggctgtac atctggacca accgggcgca 961 gttcgggcac tggtgctgaa gggggccagc cgggcactac aggaccggca tggagacaca 1021 gcccttcatg tggcctgcca gcgccagtct tggcctgtgc ccgctgcctg ctggaagggc 1081 gggccagagc caggcagagg aacatctcac tctctggacc tccagctgca aaactggcaa 1141 ggtctggctt gtctccacat tgccaccctt cagaagaacc aaccactcat ggaattgctg 1201 cttcggaatg gagctgacat tgatgtgcag gagggctcca gtggtaagac agcgctgcac 1261 ctggctgtgg aaacccaaga gcggggcctg gtacagttcc tgctccaggc tggtgcccag 1321 gtagatgccc gcatgctgaa cgggtgcaca cccctgcacc tggcagctgg ccggggtctc 1381 atgggcatct catccactct gtgcaaggcg ggtgctgact ccctgctgcg gaatgtggag 1441 gatgagacgc cccaggacct gactgaggaa tcccttgtcc ttttgccctt tgatgacctg 1501 aagatctcag ggaaactgct gctgtgtacc gactgaagcc aggcagggtc tgggatcctc 1561 agggctccac ctctccatct ggaagccgga gccataactg ctgcagtttg ggcccaggct 1621 atgtgctctt ctggtgccct agggactgct gtggccagag cctggggcca gccagtacag 1681 tcctgagccg aggaggaggg actgcaagtg gaagagagcc agtctggaag gaagagcttt 1741 ccaggtggac agggcttctt ggaagacccc caaagcccca ggtatcctgg gtgaagcctg 1801 tttgcctctc ttgaaaatgg caggtgctct tgttttaccc atgttgggtc agcctgaaac 1861 tgccaaccag taggaagcat ggactctcct gagtgagaag agactgaaat aggagcaagc 1921 agaaccctga gaggtgtcca tcttcttgct gttgaggacc ctgaaacacc gttgtttaaa 1981 gacttcacac agaaggctct gaactgagcc actggggaag ggaagtttca gtaacatgac 2041 actaaaatgg cagagacgtt aaaaaaa // LOCUS HSU91618 756 bp mRNA PRI 27-MAR-1997 DEFINITION Human proneurotensin/proneuromedin N mRNA, complete cds. ACCESSION U91618 NID g1907392 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 756) AUTHORS Dong,Z., Wang,X., Townsend,C.M. Jr. and Evers,B.M. TITLE Molecular cloning and nucleotide sequence of the full-length cDNA coding for human neurotensin/neuromedin N JOURNAL Unpublished REFERENCE 2 (bases 1 to 756) AUTHORS Dong,Z., Wang,X., Townsend,C.M. Jr. and Evers,B.M. TITLE Direct Submission JOURNAL Submitted (28-FEB-1997) Surgery Department, The University of Texas Medical Branch, 301 University Boulevard, Galveston, TX 77555, USA FEATURES Location/Qualifiers source 1..756 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="carcinoid cell" /cell_line="BON" CDS 29..541 /codon_start=1 /product="proneurotensin/proneuromedin N" /db_xref="PID:g1907393" /translation="MMAGMKIQLVCMLLLAFSSWSLCSDSEEEMKALEADFLTNMHTS KISKAHVPSWKMTLLNVCSLVNNLNSPAEETGEVHEEELVARRKLPTALDGFSLEAML TIYQLHKICHSRAFQHWELIQEDILDTGNDKNGKEEVIKRKIPYILKRQLYENKPRRP YILKRDSYYY" BASE COUNT 267 a 125 c 148 g 216 t ORIGIN 1 cggacttggc ttgttagaag gctgaaagat gatggcagga atgaaaatcc agcttgtatg 61 catgctactc ctggctttca gctcctggag tctgtgctca gattcagaag aggaaatgaa 121 agcattagaa gcagatttct tgaccaatat gcatacatca aagattagta aagcacatgt 181 tccctcttgg aagatgactc tgctaaatgt ttgcagtctt gtaaataatt tgaacagccc 241 agctgaggaa acaggagaag ttcatgaaga ggagcttgtt gcaagaagga aacttcctac 301 tgctttagat ggctttagct tggaagcaat gttgacaata taccagctcc acaaaatctg 361 tcacagcagg gcttttcaac actgggagtt aatccaggaa gatattcttg atactggaaa 421 tgacaaaaat ggaaaggaag aagtcataaa gagaaaaatt ccttatattc tgaaacggca 481 gctgtatgag aataaaccca gaagacccta catactcaaa agagattctt actattactg 541 agagaataaa tcatttattt acatgtgatt gtgattcatc atcccttaat taaatatcaa 601 attatatttg tgtgaaaatg tgacaaacac acttatctgt ctcttctaca attgtggttt 661 attgaatgtg tttttctgca ctaatagaaa ttagactaag tgttttcaaa taaatctaaa 721 tcttcaaaaa aaaaaaaaaa aaatggggcc gcaatt // LOCUS HSU91641 1899 bp mRNA PRI 04-SEP-1997 DEFINITION Human alpha2,8-sialyltransferase mRNA, complete cds. ACCESSION U91641 NID g2353693 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1899) AUTHORS Kim,Y.J., Kim,K.S., Do,S., Kim,C.H., Kim,S.K. and Lee,Y.C. TITLE Molecular cloning and expression of human alpha2,8-sialyltransferase (hST8Sia V) JOURNAL Biochem. Biophys. Res. Commun. 235 (2), 327-330 (1997) MEDLINE 97342494 REFERENCE 2 (bases 1 to 1899) AUTHORS Kim,Y.-J., Kim,K.-S. and Lee,Y.-C. TITLE Direct Submission JOURNAL Submitted (28-FEB-1997) Division of Molecular Glycobiology, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Yusong, Taejon 305-600, South Korea FEATURES Location/Qualifiers source 1..1899 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 48..1178 /function="involved in ganglioside biosynthesis" /note="glycosyltransferase; ST8Sia V" /codon_start=1 /product="alpha2,8-sialyltransferase" /db_xref="PID:g2353694" /translation="MRYADPSPNRDLLGSRTLLFIFICAFALVTLLQQILYGRNYIKR YFEFYEGPFEYNSTRCLELRHEILEVKVLSMVKQSELFDRWKSLQMCKWAMNISEANQ FKSTLSRCCNAPAFLFTTQKNTPLGTKLKYEVDTSGIYHINQEIFRMFPKDMPYYRSQ FKKCAVVGNGGILKNSRCGREINSADFVFRCNLPPISEKYTMDVGVKTDVVTVNPSII TERFHKLEKWRRPFYRVLQVYENASVLLPAFYNTRNTDVSIRVKYVLDDFESPQAVYY FHPQYLVNVSRYWLSLGVRAKRISTGLILVTAALELCEEVHLFGFWAFPMNPSGLYIT HHYYDNVKPRPGGHAMPSEIFNFLHLHSRGILRVHTGTCSCC" BASE COUNT 414 a 578 c 502 g 405 t ORIGIN 1 attgcctggc cgccccgtac cccccccccg cagccccggt agccaggatg cgctacgcgg 61 acccctcgcc caaccgggat ttgttgggga gccgaacttt gctcttcatc ttcatctgcg 121 cctttgcctt ggtgaccttg ctgcaacaga tcctgtatgg caggaactac attaagaggt 181 actttgaatt ttatgagggc ccttttgaat ataactccac aagatgcctg gagctgaggc 241 acgaaatatt ggaagtgaag gtgctgtcca tggtgaagca gtcagagctg ttcgacaggt 301 ggaagagcct ccagatgtgc aaatgggcga tgaacatctc tgaggccaac cagttcaagt 361 ctactctgtc caggtgctgc aacgcccctg cctttctctt caccacccag aagaacactc 421 ccctggggac aaagctcaag tatgaggtgg acaccagtgg catctaccac atcaaccagg 481 agatcttccg catgtttccc aaggacatgc cctactaccg gtcccagttt aagaagtgtg 541 ctgtagtggg caacggaggc atcttgaaga acagccgctg cgggagggag atcaacagcg 601 ccgacttcgt cttccggtgc aacctgcccc ccatctcaga gaagtacacc atggatgtgg 661 gggtgaagac ggatgtggtc actgtgaacc ccagcatcat cacagagagg ttccacaagc 721 tggagaagtg gcggcggccg ttctatcgcg tgctgcaggt gtacgagaac gcgtcggtgc 781 tgctgcctgc cttctacaac acgcgcaaca ccgacgtgtc catccgcgtc aagtacgtgc 841 tggacgactt cgaatcgccg caagctgtct actacttcca tccgcagtac ctggtcaacg 901 tgtcgcgcta ctggctcagc ctgggggtgc gcgccaagcg catcagcacc ggcctcattc 961 tggtcactgc ggcgctggag ctctgtgagg aggtgcacct ctttggcttc tgggccttcc 1021 ccatgaaccc ctcgggcctg tacatcactc accactacta tgacaacgtc aagccgcgtc 1081 ccggcggcca cgccatgccc tctgagatct tcaacttcct gcacttgcac agccgaggca 1141 tcctccgcgt gcacacgggc acctgcagct gctgctgatg gctgccagcc aggctgcccg 1201 gcaagcggca ccccctctcc tgctgtccct cctggtggaa ctgggagccc cgaacccccc 1261 gaaccgggca gcgtggggtc ctggcgttca ggctgtctct ccctccattg ttcagctctg 1321 tacttggatc ctggggtagg gaggcagggt taggaaccgg ggcagagtca gctctgtgct 1381 gagcctcctg gcctggcccc caccggtgca tccgcccagt gcgtccacct gccctggcta 1441 gcatgctgct gggccacacc ccaagatcag gggccctggg gacggcaagt gaataaagca 1501 catttccacc caattttgtc atccgagaga gagcacaaac tgcaggccct tgttgcagct 1561 gaaggaagac cctacagaga atggaattga gagtggagac attacacttc acaggacact 1621 tgagcacagt caagatttta ttcacacttt tggttcctgt gttttatatc gaagacttag 1681 ctatgaattg ctatctaaat ctcagagctt agaagccaac ccagtgacta gaccttccag 1741 tgaagaaagt gatctcaaga ggtccaggga cttaacagcc aagccacacc acccgacagg 1801 ttcttctgtg acaccgagag atctaaccca aggcctgggt tatgtcttag tcgagacatc 1861 atcatctgag aagcaccaca ccccttttct cccggaatt // LOCUS HSU91939 1400 bp DNA PRI 24-MAR-1997 DEFINITION Human putative G protein-coupled receptor (GPR25) gene, complete cds. ACCESSION U91939 NID g1905877 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1400) AUTHORS Jung,B.P., Nguyen,T., Kolakowski,L.F. Jr., Lynch,K.R., Heng,H.H., George,S.R. and O'Dowd,B.F. TITLE Discovery of a novel human G protein-coupled receptor gene (GPR25) located on chromosome 1 JOURNAL Biochem. Biophys. Res. Commun. 230 (1), 69-72 (1997) MEDLINE 97148573 REFERENCE 2 (bases 1 to 1400) AUTHORS Jung,B.P., Nguyen,T., Kolakowski,L.F. Jr., Lynch,K.R., Heng,H.H.Q., George,S.R. and O'Dowd,B.F. TITLE Direct Submission JOURNAL Submitted (26-FEB-1997) Pharmacology, University of Toronto, 8 Taddle Creek Rd., Toronto, ON M5S 1A8, Canada FEATURES Location/Qualifiers source 1..1400 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="1" /map="1q32.1" gene 80..1162 /gene="GPR25" CDS 80..1162 /gene="GPR25" /codon_start=1 /product="putative G protein-coupled receptor" /db_xref="PID:g1905878" /translation="MAPTEPWSPSPGSAPWDYSGLDGLEELELCPAGDLPYGYVYIPA LYLAAFAVGLLGNAFVVWLLAGRRGPRRLVDTFVLHLAAADLGFVLTLPLWAAAAARR PWPFGDGLCKLSTFALAGTRSAGALLLAGMSVDRYLAVVKLLEARPLRTPRCAVASCC GVWAVALLAGLPSLVYRGLQPLPGGQDSQCGEEPSHAFQGLSLLLLLLTFVLPLVVTL FCYCRISRRLRRPPHVGRARRNSLRIIFAIESTFVGSWLPFSALRAVFHLARLGALPL PCPLLLALRWGLTIATCLAFVNSCANPLIYLLLDRSFRARALDGACGRTGRLARRISS ASSLSRDDSSVFRCRAQAANTASASW" BASE COUNT 154 a 524 c 474 g 248 t ORIGIN 1 tgaagagcaa accccctcct gctcagagct cgtgccgcct gccccagggc tcgactccgc 61 gcaggcctca tagccagcca tggcccccac agagccctgg agccccagcc cggggtcagc 121 gccctgggac tactcggggt tggacggcct ggaggagctg gagctgtgtc cggccgggga 181 cctgccctac ggctacgtct acatccccgc gctctacctg gcggccttcg ccgtgggcct 241 gctgggcaac gcctttgtgg tgtggctgct ggccgggcgg cggggcccgc ggcggctggt 301 ggataccttc gtgctgcacc tggcggcagc tgacctgggc ttcgtgctca cgctgccgct 361 gtgggccgcg gcggcggcta ggcggccgtg gccgttcggc gatggcctct gcaagctcag 421 cacgttcgcg ctggcgggca cgcgctcggc gggcgcgctg ctgctggcgg gcatgagcgt 481 ggaccgctac ctggccgtgg tgaagctgct cgaggcgagg ccactgcgca ccccgcgctg 541 cgccgtggcc tcgtgctgcg gcgtctgggc cgtggcgctg ctggccggcc tgccctccct 601 ggtctaccgg gggttgcagc ccctgcctgg gggccaggac agccagtgcg gcgaggagcc 661 ctcccacgcc ttccagggcc tcagcttgct gctgctgctg ctgaccttcg tgctgcccct 721 ggtcgtcacc ctcttctgct actgccgcat ctcgcgccgc ctgcgacggc cgccgcacgt 781 gggtcgggcc cggaggaact cgctgcgcat catcttcgcc atcgagagca cgtttgtggg 841 ctcctggctg cccttcagcg ccctgcgggc cgtcttccac ctggcgcgtc tgggggcgct 901 gccgctgccg tgccccctgc tgctggcgct gcgctggggc ctcaccattg ccacctgcct 961 ggccttcgtc aacagctgcg ccaacccgct catctacctc ctgctggacc gctcattccg 1021 agcccgggcg ctggacgggg cctgcgggcg caccggccgc ctggcgcgaa ggatcagctc 1081 agcctcctcg ctctccaggg acgacagttc cgtgttccgt tgccgggccc aggccgcgaa 1141 cactgcctcg gcctcctggt agctgccccg ggccgctgga ggtgggcggc agcggagcat 1201 cgagaggagg ccagatgtcc cggaggggac tgagctcccc agacgcgcct gttctggcgg 1261 cagcaagctg ctcgggccgg catcgcattt cctcgcgcgc tgcctggact cccaaggcct 1321 cctccatcgg tttccccgga acctcagaac aattgaactc ccctaaacca ggctcctgtg 1381 actagctgtt ccctctcagc // LOCUS HSU91963 3919 bp mRNA PRI 01-JAN-1998 DEFINITION Human tolloid-like protein (TLL) mRNA, complete cds. ACCESSION U91963 NID g2735326 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3919) AUTHORS Greenspan,D.S. and Takahara,K. TITLE Sequence of human mammalian tolloid-like (mTll) and chromosomal localization of the cognate gene TLL JOURNAL Unpublished REFERENCE 2 (bases 1 to 3919) AUTHORS Greenspan,D.S. and Takahara,K. TITLE Direct Submission JOURNAL Submitted (04-MAR-1997) Pathology, University of Wisconsin, 1300 University Avenue, Madison, WI 53706, USA FEATURES Location/Qualifiers source 1..3919 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4" /map="4q32-q33" gene 648..3689 /gene="TLL" CDS 648..3689 /gene="TLL" /note="mTll; metalloproteinase of the astacin family" /codon_start=1 /product="tolloid-like protein" /db_xref="PID:g2735327" /translation="MGLGTLSPRMLVWLVASGIVFYGELWVCAGLDYDYTFDGNEEDK TETIDYKDPCKAAVFWGDIALDDEDLNIFQIDRTIDLTQNPFGNLGHTTGGLGDHAMS KKRGALYQLIDRIRRIGFGLEQNNTVKGKVPLQFSGQNEKNRVPRAATSRTERIWPGG VIPYVIGGNFTGSQRAMFKQAMRHWEKHTCVTFIERSDEESYIVFTYRPCGCCSYVGR RGNGPQAISIGKNCDKFGIVVHELGHVIGFWHEHTRPDRDNHVTIIRENIQPGQEYNF LKMEPGEVNSLGERYDFDSIMHYARNTFSRGMFLDTILPSRDDNGIRPAIGQRTRLSK GDIAQARKLYRCPACGETLQESNGNLSSPGFPNGYPSYTHCIWRVSVTPGEKIVLNFT TMDLYKSSLCWYDYIEVRDGYWRKSPLLGRFCGDKLPEVLTSTDSRMWIEFRSSSNWV GKGFAAVYEAICGGEIRKNEGQIQSPNYPDDYRPMKECVWKITVSESYHVGLTFQSFE IERHDNCAYDYLEVRDGTSENSPLIGRFCGYDKPEDIRSTSNTLWMKFVSDGTVNKAG FAANFFKEEDECAKPDRGGCEQRCLNTLGSYQCACEPGYELGPDRRSCEAACGGLLTK LNGTITTPGWPKEYPPNKNCVWQVVAPTQYRISVKFEFFELEGNEVCKYDYVEIWSGL SSESKLHGKFCGAEVPEVITSQFNNMRIEFKSDNTVSKKGFKAHFFSDKDECSKDNGG CQHECVNTMGSYMCQCRNGFVLHDNKHDCKEAECEQKIHSPSGLITSPNWPDKYPSRK ECTWEISATPGHRIKLAFSEFEIEQHQECAYDHLEVFDGETEKSPILGRLCGNKIPDP LVATGNKMFVRFVSDASVQRKGFQATHSTECGGRLKAESKPRDLYSHAQFGDNNYPGQ VDCEWLLVSERGSRLELSFQTFEVEEEADCGYDYVELFDGLDSTAVGLGRFCGSGPPE EIYSIGDSVLIHFHTDDTINKKGFHIRYKSIRYPDTTHTKK" BASE COUNT 1089 a 871 c 1001 g 958 t ORIGIN 1 ctcacacttt tgctctcttg cagtcagttg ctttgctggc ttctgcaggc ttttaaggtc 61 tcgcggcgta gaaatgcctg gcccccaccc ccttcctcgg tctccccttt caattcagat 121 gtgctgatgt gcagaccgga ttcatcttct cggagctgcg gcggcggctt tgggctcagg 181 cggcggcggc tcgcgctcgg ccgcggagtc ctggcagcag cggggacgcg gcgcgggagt 241 ccgagctctg gtggcagctg agcccgcggg gcgccgctcg ccgagccgcg gccgcgggaa 301 gttcggcagc cagaaggacg acctggcagg ctgcgagcgc cagcgccgcc agagccgagt 361 ttgcctgcgc cctccccgcc tccgagtgca gagttcctta cctgccctcc gcccacccgt 421 gggcccctag ccaacttctc cctgcgactg ggggtaacag gcagtgcttg ccctctctac 481 tgtcccggcg gcatccacat gtttccggac acctgagcac cccggtcccg ccgaggagcc 541 tccgggtggg gagaagagca ccggtgcccc tagccccgca catcagcgcg gaccgcggct 601 gcctaacctc tgggtcccgt cccctccttt tcctccgggg gaggaggatg gggttgggaa 661 cgctttcccc gaggatgctc gtgtggctgg tggcctcggg gattgttttc tacggggagc 721 tatgggtctg cgctggcctc gattatgatt acacttttga tgggaacgaa gaggataaaa 781 cagagactat agattacaag gacccgtgta aagccgctgt attttggggc gatattgcct 841 tagatgatga agacttaaat atctttcaaa tagataggac aattgacctt acgcagaacc 901 cctttggaaa ccttggacat accacaggtg gacttggaga ccatgctatg tcaaagaagc 961 gaggggccct ctaccaactt atagacagga taagaagaat tggctttggc ttggagcaaa 1021 acaacacagt taagggaaaa gtacctctac aattctcagg gcaaaatgag aaaaatcgag 1081 ttcccagagc cgctacatca agaacggaaa gaatatggcc tggaggcgtt attccttatg 1141 ttataggagg aaacttcact ggcagccaga gagccatgtt caagcaggcc atgaggcact 1201 gggaaaagca cacatgtgtg actttcatag aaagaagtga tgaagagagt tacattgtat 1261 tcacctatag gccttgtgga tgctgctcct atgtaggtcg gcgaggaaat ggacctcagg 1321 caatctctat cggcaagaac tgtgataaat ttgggattgt tgttcatgaa ttgggtcatg 1381 tgataggctt ttggcatgaa cacacaagac cagatcgaga taaccacgta actatcataa 1441 gagaaaacat ccagccaggt caagagtaca attttctgaa gatggagcct ggagaagtaa 1501 actcacttgg agaaagatat gatttcgaca gtatcatgca ctatgccagg aacaccttct 1561 caagggggat gtttctggat accattctcc cctcccgtga tgataatggc atacgtcctg 1621 caattggtca gcgaacccgt ctaagcaaag gagatatcgc acaggcaaga aagctgtata 1681 gatgtccagc atgtggagaa actctacaag aatccaatgg caacctttcc tctccaggat 1741 ttcccaatgg ctacccttct tacacacact gcatctggag agtttctgtg accccagggg 1801 agaagattgt tttaaatttt acaacgatgg atctatacaa gagtagtttg tgctggtatg 1861 actatattga agtaagagac gggtactgga gaaaatcacc tctccttggt agattctgtg 1921 gggacaaatt gcctgaagtt cttacttcta cagacagcag aatgtggatt gagtttcgta 1981 gcagcagtaa ttgggtagga aaaggctttg cagctgtcta tgaagcgatc tgtggaggtg 2041 agatacgtaa aaatgaagga cagattcagt ctcccaatta tcctgatgac tatcgcccga 2101 tgaaagaatg tgtgtggaaa ataacagtgt ctgagagcta ccacgtcggg ctgacctttc 2161 agtcctttga gattgaaaga catgacaatt gtgcttatga ctacctggaa gttagagatg 2221 gaaccagtga aaatagccct ttgatagggc gtttctgtgg ttatgacaaa cctgaagaca 2281 taagatctac ctccaatact ttgtggatga agtttgtttc tgacggaact gtgaacaaag 2341 cagggtttgc tgctaacttt tttaaagagg aagatgagtg tgccaaacct gaccgtggag 2401 gctgtgagca gcgatgtctg aacactctgg gcagttacca gtgtgcctgt gagcctggct 2461 atgagctggg cccagacaga aggagctgtg aagctgcttg tggtggactt cttaccaaac 2521 ttaacggcac cataaccacc cctggctggc ccaaggagta ccctcctaat aagaactgtg 2581 tgtggcaagt ggttgcacca acccagtaca gaatttctgt gaagtttgag ttttttgaat 2641 tggaaggcaa tgaagtttgc aaatatgatt atgtggagat ctggagtggt ctttcctctg 2701 agtctaaact gcatggcaaa ttctgtggcg ctgaagtgcc tgaagtgatc acatcccagt 2761 tcaacaatat gagaattgaa ttcaaatctg acaatactgt atccaagaag ggcttcaaag 2821 cacatttttt ctcagacaaa gatgaatgct ctaaggataa tggtggatgt cagcacgaat 2881 gtgtcaacac gatggggagc tacatgtgtc aatgccgtaa tggatttgtg ctacatgaca 2941 ataaacatga ttgcaaggaa gctgagtgtg aacagaagat ccacagtcca agtggcctca 3001 tcaccagtcc caactggcca gacaagtacc caagcaggaa agaatgcact tgggaaatca 3061 gcgccactcc cggccaccga atcaaattag cctttagtga atttgagatt gagcagcatc 3121 aagaatgtgc ttatgaccac ttagaagtat ttgatggaga aacagaaaag tcaccgattc 3181 ttggacgact atgtggcaac aagataccag atccccttgt ggctactgga aataaaatgt 3241 ttgttcggtt tgtttctgat gcatctgttc aaagaaaagg ctttcaagcc acacattcta 3301 cagagtgtgg cggacgattg aaagcagaat caaaaccaag agatctgtac tcacatgctc 3361 agtttggtga taacaactac ccaggacagg ttgactgtga atggctatta gtatcagaac 3421 ggggctctcg acttgaatta tccttccaga catttgaagt ggaggaagaa gcagactgtg 3481 gctatgacta tgtggagctc tttgatggtc ttgattcaac agctgtgggg cttggtcgat 3541 tctgtggatc cgggccacca gaagagattt attcaattgg agattcagtt ttaattcatt 3601 tccacactga tgacacaatc aacaagaagg gatttcatat aagatacaaa agcataagat 3661 atccagatac cacacatacc aaaaaataac accaaaacct ctgtcagaac acaaaggaat 3721 gtgcataatg gagagaagac atattttttt taaaactgaa gatattggca caaatgtttt 3781 atacaaagag tttgaacaaa aaatccctgt aagaccagaa ttatctttgt actaaaagag 3841 aagtttccag caaaaccctc atcagcatta caaggatatt tgaactccat gcttgatggt 3901 attaataaag ctggtgaaa // LOCUS HSU91985 1633 bp mRNA PRI 03-MAY-1997 DEFINITION Human DNA fragmentation factor-45 mRNA, complete cds. ACCESSION U91985 NID g2065560 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1633) AUTHORS Liu,X., Zou,H., Slaughter,C. and Wang,X. TITLE DFF, a heterodimeric protein that functions downstream of caspase-3 to trigger DNA fragmentation during apoptosis JOURNAL Cell 89 (2), 175-184 (1997) MEDLINE 97262059 REFERENCE 2 (bases 1 to 1633) AUTHORS Liu,X., Zou,H., Slaughter,C. and Wang,X. TITLE Direct Submission JOURNAL Submitted (04-MAR-1997) Biochemistry, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Blvd., Dallas, TX 75235, USA FEATURES Location/Qualifiers source 1..1633 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" CDS 57..1052 /function="triggers DNA fragmentation during apoptosis" /note="DFF-45" /codon_start=1 /product="DNA fragmentation factor-45" /db_xref="PID:g2065561" /translation="MEVTGDAGVPESGEIRTLKPCLLRRNYSREQHGVAASCLEDLRS KACDILAIDKSLTPVTLVLAEDGTIVDDDDYFLCLPSNTKFVALASNEKWAYNNSDGG TAWISQESFDVDETDSGAGLKWKNVARQLKEDLSSIILLSEEDLQMLVDAPCSDLAQE LRQSCATVQRLQHTLQQVLDQREEVRQSKQLLQLYLQALEKEGSLLSKQEESKAAFGE EVDAVDTGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTKEDPKALAVALN WDIKKTETVQEACERELALRLQQTQSLHSLRSISASKASPPGDLQNPKRARQDPT" BASE COUNT 413 a 432 c 451 g 337 t ORIGIN 1 cgccgctccg gcctcccgcg acttctcgaa ggtgggcagg tcccaccttg tggaggatgg 61 aggtgaccgg ggacgccggg gtaccagaat ctggcgagat ccggactcta aagccgtgtc 121 tgctgcgccg caactacagc cgcgaacagc acggcgtggc cgcctcctgc ctcgaagacc 181 tgaggagcaa ggcctgtgac attctggcca ttgataagtc cctgacacca gtcacccttg 241 tcctggcaga ggatggcacc atagtggatg atgacgatta ctttctgtgt ctaccttcca 301 atactaagtt tgtggcattg gctagtaatg agaaatgggc atacaacaat tcagatggag 361 gtacagcttg gatttcccaa gagtcctttg atgtagatga aacagacagc ggggcagggt 421 tgaagtggaa gaatgtggcc aggcagctga aagaagatct gtccagcatc atcctcctat 481 cagaggagga cctccagatg cttgttgacg ctccctgctc agacctggct caggaactac 541 gtcagagttg tgccaccgtc cagcggctgc agcacacact ccaacaggtg cttgaccaaa 601 gagaggaagt gcgtcagtcc aagcagctcc tgcagctgta cctccaggct ttggagaaag 661 agggcagcct cttgtcaaag caggaagagt ccaaagctgc ctttggtgag gaggtggatg 721 cagtagacac gggtatcagc agagagacct cctcggacgt tgcgctggcg agccacatcc 781 ttactgcact gagggagaag caggctccag agctgagctt atctagtcag gatttggagt 841 tggttaccaa ggaagacccc aaagcactgg ctgttgcctt gaactgggac ataaagaaga 901 cggagactgt tcaggaggcc tgtgagcggg agctcgccct gcgcctgcag cagacgcaga 961 gcttgcattc tctccggagc atctcagcaa gcaaggcctc accacctggt gacctgcaga 1021 atcctaagcg agccagacag gatcccacat agcagcagcg ggaagtgtgc caaggaagct 1081 ctgtggcgtt gtgttattgg tagacaccct cagcctcatc atttgactac ctatgtacta 1141 ctctaccccc tgccttagag caccttccag agaagctatt ccaggtctca acatacgccg 1201 ttccaccaat ttttttttta gccccaccag cttcaggact tctgccaatt ttgaatgata 1261 tagctgcacc aacaatatcc cgcctcctct aattacatat gatgttctct gttcaaaagt 1321 aattggcagt gattggccag gcgcagtggc tcacgcctgt aatcccagca ctgggaggcc 1381 gaggggggcg gatcgtgaag tcaggagatc gagaccatcc tggctaacat ggtgaaaccc 1441 tgtctctact aaaaatacaa aaaaaattag ccagccatgg tggcgggcgc ctgtaatccc 1501 agctacttgg gaggctgagg caggagaatg gcatgaacct gggaggcaga gcttgcagtg 1561 agctgagatt gcgccactgc actccagcct gggcaacaga gcgagactcc gtctcaaaaa 1621 aaaaaaaaaa aaa // LOCUS HSU92314 1477 bp mRNA PRI 01-AUG-1997 DEFINITION Homo sapiens hydroxysteroid sulfotransferase SULT2B1a (HSST2) mRNA, complete cds. ACCESSION U92314 NID g1923290 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1477) AUTHORS Her,C., Wood,T.C., Eichler,E., Mohrenweiser,H.W., Siciliano,M.J., Ramagli,L.S. and Weinshilboum,R.M. TITLE Human Placental Hydroxysteroid Sulfotransferase SULT2B1: Two novel enzymes encoded by a single chromosome 19 gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1477) AUTHORS Her,C. and Weinshilboum,R.M. TITLE Direct Submission JOURNAL Submitted (05-MAR-1997) Pharmacology, Mayo Clinic/Mayo Foundation, 200 1st Street, SW, Rochester, MN 55905, USA FEATURES Location/Qualifiers source 1..1477 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /clone_lib="IMAGE CloneID 141495" /tissue_type="placenta" /map="19q13.3" gene 1..1477 /gene="HSST2" CDS 376..1428 /gene="HSST2" /codon_start=1 /product="hydroxysteroid sulfotransferase SULT2B1a" /db_xref="PID:g1923291" /translation="MASPPPFHSQKLPGEYFRYKGVPFPVGLYSLESISLAENTQDVR DDDIFIITYPKSGTTWMIEIICLILKEGDPSWIRSVPIWERAPWCETIVGAFSLPDQY SPRLMSSHLPIQIFTKAFFSSKAKVIYMGRNPRDVVVSLYHYSKIAGQLKDPGTPDQF LRDFLKGEVQFGSWFDHIKGWLRMKGKDNFLFITYEELQQDLQGSVERICGFLGRPLG KEALGSVVAHSTFSAMKANTMSNYTLLPPSLLDHRRGAFLRKGVCGDWKNHFTVAQSE AFDRAYRKQMRGMPTFPWDEDPEEDGSPDPEPSPEPEPKPSLEPNTSLEREPRPNSSP NPSPGQASETPHPRPS" BASE COUNT 326 a 484 c 380 g 287 t ORIGIN 1 ccgtgatctc ggctcactgc aacctccgcc tcctgggttc aagcgattct cctgcctcag 61 cctccggagt aactgggagt acaggcatgc gccaccacgc ttggctgatt tttgtctttt 121 tagtaggggc ggggtttcac catgttggcc aggctggtct caaactcctg acctcaggtg 181 atccacccac ctctgtctcc caaagtgctg ggattacagg agtgtgccac tgcgcctgac 241 cagctttata aagtttatag ggacagtgtc accactttac agaagaggga ctgaggctct 301 gaggaggaag ttccttgcca gggtccgagt gtcgccaccc tgagaactcc agcacccacc 361 tccctactct ccctcatggc gtctccccca cctttccaca gccagaagtt gccaggtgaa 421 tacttccggt acaagggcgt ccccttcccc gtcggcctgt actcgctcga gagcatcagc 481 ttggcggaga acacccaaga tgtgcgggac gacgacatct ttatcatcac ctaccccaag 541 tcaggcacga cctggatgat cgagatcatc tgcttaatcc tgaaggaagg ggatccatcc 601 tggatccgct ccgtgcccat ctgggagcgg gcaccctggt gtgagaccat tgtgggtgct 661 ttcagcctcc cggaccagta cagcccccgc ctcatgagct cccatcttcc catccagatc 721 ttcaccaagg ccttcttcag ctccaaggcc aaggtgatct acatgggccg caacccccgg 781 gacgttgtgg tctccctcta tcattactcc aagatcgccg ggcagttaaa ggacccgggc 841 acacccgacc agttcctgag ggacttcctc aaaggcgaag tgcagtttgg ctcctggttc 901 gaccacatta agggctggct tcggatgaag ggcaaagaca acttcctatt tatcacctac 961 gaggagctgc agcaggactt acagggctcc gtggagcgca tctgtgggtt cctgggccgt 1021 ccgctgggca aggaggcact gggctccgtc gtggcacact caaccttcag cgccatgaag 1081 gccaacacca tgtccaacta cacgctgctg cctcccagcc tgctggacca ccgtcgcggg 1141 gccttcctcc ggaaaggggt ctgcggcgac tggaagaacc acttcacggt ggcccagagc 1201 gaagccttcg atcgtgccta ccgcaagcag atgcggggga tgccgacctt cccctgggat 1261 gaagacccgg aggaggatgg cagcccagat cctgagccca gccctgagcc tgagcccaag 1321 cccagccttg agcccaacac cagcctggag cgtgagccca gacccaactc cagccccaac 1381 cccagccccg gccaggcctc tgagaccccg cacccacgac cctcataata aacacgtcga 1441 ttctgtctaa aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HSU92436 3160 bp mRNA PRI 01-APR-1997 DEFINITION Human mutated in multiple advanced cancers protein (MMAC1) mRNA, complete cds. ACCESSION U92436 NID g1916327 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3160) AUTHORS Steck,P.A., Pershouse,M.A., Jasser,S.A., Lin,H., Yung,W.K.A., Ligon,A.H., Langford,L.A., Baumgard,M.L., Hattier,T., Davis,T., Frye,C., Hu,R., Swedlund,B., Teng,D.H.F. and Tavtigian,S.V. TITLE Identification of a candidate tumour suppressor gene, MMAC1, at chromosome 10q23.3 that is mutated in multiple advanced cancers JOURNAL Nature Genet. 15 (4), 356-362 (1997) MEDLINE 97245711 REFERENCE 2 (bases 1 to 3160) AUTHORS Steck,P.A., Pershouse,M.A., Jasser,S.A., Lin,H., Yung,W.K.A., Ligon,A.H., Langford,L.A., Baumgard,M.L., Hattier,T., Davis,T., Frye,C., Hu,R., Swedlund,B., Teng,D.H.F. and Tavtigian,S.V. TITLE Direct Submission JOURNAL Submitted (07-MAR-1997) Research, Myriad Genetics Inc., 190 Wakara Way, Salt Lake City, UT 84108, USA REMARK Collaboration between the Departments of Neuro-Oncology and Pathology at the Brain Tumor Center University of Texas M.D. Anderson Cancer Center and Myriad Genetics Inc. FEATURES Location/Qualifiers source 1..3160 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /map="10q23.3" gene 1035..2246 /note="mutated in multiple advanced cancers" /gene="MMAC1" CDS 1035..2246 /gene="MMAC1" /codon_start=1 /product="MMAC1" /db_xref="PID:g1916328" /translation="MTAIIKEIVSRNKRRYQEDGFDLDLTYIYPNIIAMGFPAERLEG VYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCRVAQYPFEDHNPPQLELIK PFCEDLDQWLSEDDNHVAAIHCKAGKGRTGVMICAYLLHRGKFLKAQEALDFYGEVRT RDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVALLFHKMMFETIPMFSGGTCNPQFVVC QLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDIKVEFFHKQNKMLKKDKMFHFWVN TFFIPGPEETSEKVENGSLCDQEIDSICSIERADNDKEYLVLTLTKNDLDKANKDKAN RYFSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPF DEDQHTQITKV" BASE COUNT 853 a 742 c 788 g 777 t ORIGIN 1 cctcccctcg cccggcgcgg tcccgtccgc ctctcgctcg cctcccgcct cccctcggtc 61 ttccgaggcg cccgggctcc cggcgcggcg gcggaggggg cgggcaggcc ggcgggcggt 121 gatgtggcag gactctttat gcgctgcggc aggatacgcg ctcggcgctg ggacgcgact 181 gcgctcagtt ctctcctctc ggaagctgca gccatgatgg aagtttgaga gttgagccgc 241 tgtgaggcga ggccgggctc aggcgaggga gatgagagac ggcggcggcc gcggcccgga 301 gcccctctca gcgcctgtga gcagccgcgg gggcagcgcc ctcggggagc cggccggcct 361 gcggcggcgg cagcggcggc gtttctcgcc tcctcttcgt cttttctaac cgtgcagcct 421 cttcctcggc ttctcctgaa agggaaggtg gaagccgtgg gctcgggcgg gagccggctg 481 aggcgcggcg gcggcggcgg cggcacctcc cgctcctgga gcggggggga gaagcggcgg 541 cggcggcggc cgcggcggct gcagctccag ggagggggtc tgagtcgcct gtcaccattt 601 ccagggctgg gaacgccgga gagttggtct ctccccttct actgcctcca acacggcggc 661 ggcggcggcg gcacatccag ggacccgggc cggttttaaa cctcccgtcc gccgccgccg 721 caccccccgt ggcccgggct ccggaggccg ccggcggagg cagccgttcg gaggattatt 781 cgtcttctcc ccattccgct gccgccgctg ccaggcctct ggctgctgag gagaagcagg 841 cccagtcgct gcaaccatcc agcagccgcc gcagcagcca ttacccggct gcggtccaga 901 gccaagcggc ggcagagcga ggggcatcag ctaccgccaa gtccagagcc atttccatcc 961 tgcagaagaa gccccgccac cagcagcttc tgccatctct ctcctccttt ttcttcagcc 1021 acaggctccc agacatgaca gccatcatca aagagatcgt tagcagaaac aaaaggagat 1081 atcaagagga tggattcgac ttagacttga cctatattta tccaaacatt attgctatgg 1141 gatttcctgc agaaagactt gaaggcgtat acaggaacaa tattgatgat gtagtaaggt 1201 ttttggattc aaagcataaa aaccattaca agatatacaa tctttgtgct gaaagacatt 1261 atgacaccgc caaatttaat tgcagagttg cacaatatcc ttttgaagac cataacccac 1321 cacagctaga acttatcaaa cccttttgtg aagatcttga ccaatggcta agtgaagatg 1381 acaatcatgt tgcagcaatt cactgtaaag ctggaaaggg acgaactggt gtaatgatat 1441 gtgcatattt attacatcgg ggcaaatttt taaaggcaca agaggcccta gatttctatg 1501 gggaagtaag gaccagagac aaaaagggag taactattcc cagtcagagg cgctatgtgt 1561 attattatag ctacctgtta aagaatcatc tggattatag accagtggca ctgttgtttc 1621 acaagatgat gtttgaaact attccaatgt tcagtggcgg aacttgcaat cctcagtttg 1681 tggtctgcca gctaaaggtg aagatatatt cctccaattc aggacccaca cgacgggaag 1741 acaagttcat gtactttgag ttccctcagc cgttacctgt gtgtggtgat atcaaagtag 1801 agttcttcca caaacagaac aagatgctaa aaaaggacaa aatgtttcac ttttgggtaa 1861 atacattctt cataccagga ccagaggaaa cctcagaaaa agtagaaaat ggaagtctat 1921 gtgatcaaga aatcgatagc atttgcagta tagagcgtgc agataatgac aaggaatatc 1981 tagtacttac tttaacaaaa aatgatcttg acaaagcaaa taaagacaaa gccaaccgat 2041 acttttctcc aaattttaag gtgaagctgt acttcacaaa aacagtagag gagccgtcaa 2101 atccagaggc tagcagttca acttctgtaa caccagatgt tagtgacaat gaacctgatc 2161 attatagata ttctgacacc actgactctg atccagagaa tgaacctttt gatgaagatc 2221 agcatacaca aattacaaaa gtctgaattt ttttttatca agagggataa aacaccatga 2281 aaataaactt gaataaactg aaaatggacc tttttttttt taatggcaat aggacattgt 2341 gtcagattac cagttatagg aacaattctc ttttcctgac caatcttgtt ttaccctata 2401 catccacagg gttttgacac ttgttgtcca gttgaaaaaa ggttgtgtag ctgtgtcatg 2461 tatatacctt tttgtgtcaa aaggacattt aaaattcaat taggattaat aaagatggca 2521 ctttcccgtt ttattccagt tttataaaaa gtggagacag actgatgtgt atacgtagga 2581 attttttcct tttgtgttct gtcaccaact gaagtggcta aagagctttg tgatatactg 2641 gttcacatcc tacccctttg cacttgtggc aacagataag tttgcagttg gctaagagag 2701 gtttccgaaa ggttttgcta ccattctaat gcatgtattc gggttagggc aatggagggg 2761 aatgctcaga aaggaaataa ttttatgctg gactctggac catataccat ctccagctat 2821 ttacacacac ctttctttag catgctacag ttattaatct ggacattcga ggaattggcc 2881 gctgtcactg cttgttgttt gcgcattttt ttttaaagca tattggtgct agaaaaggca 2941 gctaaaggaa gtgaatctgt attggggtac aggaatgaac cttctgcaac atcttaagat 3001 ccacaaatga agggatataa aaataatgtc ataggtaaga aacacagcaa caatgactta 3061 accatataaa tgtggaggct atcaacaaag aatgggcttg aaacattata aaaattgaca 3121 atgatttatt aaatatgttt tctcaattgt aaaaaaaaaa // LOCUS HSU92538 1901 bp mRNA PRI 03-JAN-1998 DEFINITION Homo sapiens origin recognition complex subunit 5 homolog (Orc5) mRNA, complete cds. ACCESSION U92538 NID g2739445 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1901) AUTHORS Ishiai,M., Dean,F.B., Okumura,K., Abe,M., Moon,K.-Y., Amin,A.A., Kagotani,K., Taguchi,H., Murakami,Y., Hanaoka,F., O'Donnell,M., Hurwitz,J. and Eki,T. TITLE Isolation of human and fission yeast homologues of the budding yeast origin recognition complex subunit ORC5: human homologue (ORC5L) maps to 7q22 JOURNAL Genomics 46 (2), 294-298 (1997) MEDLINE 98086489 REFERENCE 2 (bases 1 to 1901) AUTHORS Ishiai,M. and Hurwitz,J. TITLE Direct Submission JOURNAL Submitted (07-MAR-1997) Molecular Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, Box97, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1901 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7q22" /cell_line="HeLa" gene 1..1901 /gene="Orc5" CDS 89..1396 /gene="Orc5" /note="similar to yeast origin recognition complex subunit 5, Swiss-Prot Accession Number P50874" /codon_start=1 /product="origin recognition complex subunit 5 homolog" /db_xref="PID:g2739446" /translation="MPHLENVVLCRESQVSILQSLFGERHHFSFPSIFIYGHTASGKT YVTQTLLKTLELPHVFVNCVECFTLRLLLEQILNKLNHLSSSEDGCSTEITCETFNDF VRLFKQVTTAENLKDQTVYIVLDKAEYLRDMEANLLPGFLRLQELADRNVTVLFLSEI VWEKFRPNTGCFEPFVLYFPDYSIGNLQKILSHDHPPEYSADFYAAYINILLGVFYTV CRDLKELRHLAVLNFPKYCEPVVKGEASERDTRKLWRNIEPHLKKAMQTVYLREISSS QWEKLQKDDTDPGQLKGLSAHTHVELPYYSKFILIAAYLASYNPARTDKRFFLKHHGK IKKTNFLKKHEKTSNHLLGPKPFPLDRLLAILYSIVDSRVAPTANIFSQITSLVTLQL LTLVGHDDQLDGPKYKCTVSLDFIRAIARTVNFDIIKYLYDFL" BASE COUNT 552 a 391 c 386 g 572 t ORIGIN 1 cgtgggccgc cagactcggg agaggctccg tcttgtgcaa gggtcctgtg ggctggctgc 61 actggcctct gcggtggtgc ctgccagaat gccccacttg gaaaacgtgg tgctttgtcg 121 cgagtctcaa gtgtccatct tgcagtcctt gtttggagag agacatcatt tcagctttcc 181 atccattttt atttatggac atactgctag tggaaagacc tatgtaacac aaacgttgtt 241 gaaaacttta gagctcccac atgtgtttgt gaattgtgtt gaatgcttta cattgaggct 301 gcttttggaa caaattttaa acaaattgaa tcatcttagt tcttcagagg atggatgttc 361 tactgaaata acctgtgaaa catttaatga ctttgttcgc ttgtttaaac aagtaaccac 421 agctgaaaat cttaaagatc agactgtata tattgttcta gataaagcag agtatctaag 481 agatatggaa gcaaatcttt tgcctggatt tcttagatta caagaattgg ctgacagaaa 541 tgtgactgtt ctctttctca gtgaaattgt ttgggaaaag tttcgtccaa atactggatg 601 ctttgagccg tttgtcttat atttccctga ttacagcata ggcaaccttc aaaagatcct 661 gtcccatgat catcctccag agtattcagc tgatttctat gctgcctaca ttaacattct 721 tcttggagtt ttctacactg tttgtcgaga tttgaaagag ctcagacatc tggcagtact 781 taattttcct aaatattgtg aacccgtggt taaaggagaa gcaagtgaac gtgatactcg 841 caaactgtgg agaaatattg aacctcattt gaagaaagct atgcagactg tttatctcag 901 ggaaatatca agttcccagt gggaaaagct acagaaagat gacacagatc cggggcaact 961 gaaaggcctc tcagcgcata ctcatgtgga acttccatat tactctaagt tcattctaat 1021 tgctgcatac cttgcttcat acaatccagc aagaactgac aagaggtttt ttcttaagca 1081 tcatggaaaa atcaagaaaa ccaactttct aaaaaaacac gaaaagacaa gcaatcatct 1141 ccttgggcca aaaccatttc cactagacag attattagca atattatata gtatcgtgga 1201 cagcagagtt gctccaacag caaatatttt ttcccagatt acctctctag tgacccttca 1261 gctgttaacc ctggttggcc atgacgatca gcttgatgga ccaaaataca aatgcacagt 1321 gtctctagac ttcatcagag ctattgcaag gacggtgaac tttgacataa taaaatactt 1381 gtatgatttc ttgtgaaaac aagcttcaaa gccatatgga cactgtgaca atgactaagc 1441 caagctgtgt tcatccagct acttagctgg ccaaggagag gagttctttg gctctattgg 1501 atttgtccaa acaggtgctg gcccagcatg gaatctgatg aaaatattct gattggtctg 1561 ggtggatgtg agcagaagac tatttaccag ggaccctgga gtatttggaa gcaacgtgtt 1621 aattataaac agcagggttt gagcacaatc tgttctactc ttaatgatgt tatcttaaca 1681 ctgaaattgc ctgaaaccca tttacttagg actacatttt gctctgtgaa ctatcccctg 1741 cgctttgaac gtgccagcag cccttgttta tatgcccatt cttttcactt cctctccaca 1801 ggagcctctg cagtcgcttg ccaaagcaga ttttcctaag gccactgttt taaaagatca 1861 tagttgcaaa atataataaa tacaagttct ttttaaaatc g // LOCUS HSU92642 1777 bp mRNA PRI 01-JAN-1998 DEFINITION Human high-affinity lysophosphatidic acid receptor homolog mRNA, complete cds. ACCESSION U92642 NID g2735350 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1777) AUTHORS An,S. TITLE Direct Submission JOURNAL Submitted (09-MAR-1997) Medicine, UCSF, 533 Parnassus, Rm. Ub8, San Francisco, CA 94143-0711, USA FEATURES Location/Qualifiers source 1..1777 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /dev_stage="fetus" CDS 157..1287 /note="similar to the Xenopus high-affinity LPA receptor encoded by GenBank Accession Number U76385" /codon_start=1 /product="high-affinity lysophosphatidic acid receptor homolog" /db_xref="PID:g2735351" /translation="MACNSTSLEAYTYLLLNTSNASDSGSTQLPAPLRISLAIVMLLM TVVGFLGNTVVCIIVYQRPAMRSAINLLLATLAFSDIMLSLCCMPFTAVTLITVRWHF GDHFCRLSATLYWFFVLEGVAILLIISVDRFLIIVQRQDKLNPRRAKVIIAVSWVLSW VLSFCIAGPSLTGWTLVEVPARAPQCVLGYTELPADRAYVVTLVVAVFFAPFGVMLCA YMCILNTVRKNAVRVHNQSDSLDLRQLTRAGLRRLQRQQQVSVDLSFKTKAFTTILIL FVGFSLCWLPHSVYSLLSVFSQRFYCGSSFYATSTCVLWLSYLKSVFNPIVYCWRIKK FREACIELLPQTFQILPKVPERIRRRIQPSTVYCCNENQSAV" BASE COUNT 342 a 543 c 484 g 400 t 8 others ORIGIN 1 gcttactcac tatagggctc gagcggccgc ccgggcaggt cagaggtcca cagacatcca 61 ggaagatgca gccagcagac aaagaatggc agccacccaa gtgctgaggg gagccttccc 121 ggccatttct gtcccagctc caagggtctc tccacgatgg cctgcaacag cacgtccctt 181 gaggcttaca catacctgct gctgaacacc agcaacgcct cagactcggg gtccacccag 241 ttgcccgcac ccctcaggat ctccttggcc atagtgatgc tgctgatgac cgtggtgggg 301 ttcctgggca acactgtggt ctgcatcatc gtgtaccaga ggccggctat gcgctcggcc 361 atcaacctgc tgctggccac cctggccttc tccgacatca tgctgtccct ctgctgcatg 421 cccttcaccg ccgtcaccct catcaccgtg cgctggcact ttggggacca cttctgccgc 481 ctctcagcca cgctctactg gttttttgtc ctggagggcg tggccatcct gctcatcatc 541 agcgtggacc gcttcctcat catcgtccag cgccaggaca agctgaaccc gcgcagggcc 601 aaggtgatca tcgcggtctc ctgggtgctg tcctgggtgc tgtccttctg catcgcgggg 661 ccctcgctca cgggctggac gctggtggag gtgccggcgc gggccccaca gtgcgtgctg 721 ggctacacgg agctccccgc tgaccgcgcc tacgtggtca ccttggtggt ggccgtgttc 781 ttcgcgccct ttggcgtcat gctgtgcgcc tacatgtgca tcctcaacac ggtccgcaag 841 aacgccgtgc gcgtgcacaa ccagtcggac agcctggacc tgcggcagct caccagggcg 901 ggcctgcggc gcctgcagcg gcagcaacag gtcagcgtgg acttgagctt caagaccaag 961 gccttcacca ccatcctgat cctcttcgtg ggcttctccc tctgctggct gccccactcc 1021 gtctacagcc tcctgtctgt gtttagccag cgcttttact gcggttcctc cttctacgcc 1081 accagcacct gcgtcctgtg gctcagttac ctcaagtccg tcttcaaccc catcgtctac 1141 tgctggagaa tcaaaaaatt ccgcgaggcc tgcatagagt tgctgcccca gaccttccaa 1201 atcctcccca aagtgcctga gcggatccga aggagaatcc agccaagcac agtctactgt 1261 tgcaatgaaa accagtctgc ggtttagggg gtcagggggc cacagagaag gggcagctga 1321 gccccagtcc cagggtggat ctgtcctgct ctgttccctg gcatgttggt catagtctgc 1381 actttgtggt ggcaatttaa gcacaaaggt actcatttgt aatcagatga gctgcagctc 1441 ccaaatttca aattttggca cgatgaatta tttttgtttc tctttgcaga gagccaaata 1501 tggggctgat gggaactgca acgtcattaa gtcaaaaatg gantgggctg gggagtgcag 1561 aagttgggca gaaagggaag aagggggcaa cagggaatga agctggttga ntgtggggca 1621 ngaagactgg tcagtcacaa gaacttcaac ctgccctgcg anccctctct gcacctgctc 1681 aagaaaaacc tganaaacct cagtaagtgt nanctcctgg gtgttcattc attttgtttg 1741 acaccaaggt tcttcagcta ttgangtttg ttgtgtt // LOCUS HSU92971 1830 bp mRNA PRI 16-APR-1997 DEFINITION Human protease-activated receptor 3 (PAR3) mRNA, complete cds. ACCESSION U92971 NID g1938374 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1830) AUTHORS Ishihara,H., Connolly,A.J., Zeng,D., Kahn,M.L., Zheng,Y.W., Timmons,C., Tram,T. and Coughlin,S.R. TITLE Protease-activated receptor 3 is a second thrombin receptor in humans JOURNAL Nature 386 (6624), 502-506 (1997) MEDLINE 97242411 REFERENCE 2 (bases 1 to 1830) AUTHORS Ishihara,H., Connolly,A.J., Zeng,D., Kahn,M.L., Zheng,Y.W., Timmons,C., Tram,T. and Coughlin,S.R. TITLE Direct Submission JOURNAL Submitted (11-MAR-1997) CVRI, UCSF, 3rd and Parnassus, San Francisco, CA 94143, USA FEATURES Location/Qualifiers source 1..1830 /organism="Homo sapiens" /db_xref="taxon:9606" gene 145..1270 /gene="PAR3" CDS 145..1269 /gene="PAR3" /note="thrombin receptor; coagulation protease." /codon_start=1 /product="protease-activated receptor 3" /db_xref="PID:g1938375" /translation="MKALIFAAAGLLLLLPTFCQSGMENDTNNLAKPTLPIKTFRGAP PNSFEEFPFSALEGWTGATITVKIKCPEESASHLHVKNATMGYLTSSLSTKLIPAIYL LVFVVGVPANAVTLWMLFFRTRSICTTVFYTNLAIADFLFCVTLPFKIAYHLNGNNWV FGEVLCRATTVIFYGNMYCSILLLACISINRYLAIVHPFTYRGLPKHTYALVTCGLVW ATVFLYMLPFFILKQEYYLVQPDITTCHDVHNTCESSSPFQLYYFISLAFFGFLIPFV LIIYCYAAIIRTLNAYDHRWLWYVKASLLILVIFTICFAPSNIILIIHHANYYYNNTD GLYFIYLIALCLGSLNSCLDPFLYFLMSKTRNHSTAYLTK" BASE COUNT 473 a 464 c 337 g 556 t ORIGIN 1 cctgcctgca cggcacagga gagcaaactt ctacagacag accaaggctt ccatttgctg 61 ctgacacatg gaactgaggt gaaattgtgc tccatgattt tacagatttc ataacgttta 121 agagacggga ctcaggtcat caaaatgaaa gccctcatct ttgcagctgc tggcctcctg 181 cttctgttgc ccactttttg tcagagtggc atggaaaatg atacaaacaa cttggcaaag 241 ccaaccttac ccattaagac ctttcgtgga gctcccccaa attcttttga agagttcccc 301 ttttctgcct tggaaggctg gacaggagcc acgattactg taaaaattaa gtgccctgaa 361 gaaagtgctt cacatctcca tgtgaaaaat gctaccatgg ggtacctgac cagctcctta 421 agtactaaac tgatacctgc catctacctc ctggtgtttg tagttggtgt cccggccaat 481 gctgtgaccc tgtggatgct tttcttcagg accagatcca tctgtaccac tgtattctac 541 accaacctgg ccattgcaga ttttcttttt tgtgttacat tgccctttaa gatagcttat 601 catctcaatg ggaacaactg ggtatttgga gaggtcctgt gccgggccac cacagtcatc 661 ttctatggca acatgtactg ctccattctg ctccttgcct gcatcagcat caaccgctac 721 ctggccatcg tccatccttt cacctaccgg ggcctgccca agcacaccta tgccttggta 781 acatgtggac tggtgtgggc aacagttttc ttatatatgc tgccattttt catactgaag 841 caggaatatt atcttgttca gccagacatc accacctgcc atgatgttca caacacttgc 901 gagtcctcat ctcccttcca actctattac ttcatctcct tggcattctt tggattctta 961 attccatttg tgcttatcat ctactgctat gcagccatca tccggacact taatgcatac 1021 gatcatagat ggttgtggta tgttaaggcg agtctcctca tccttgtgat ttttaccatt 1081 tgctttgctc caagcaatat tattcttatt attcaccatg ctaactacta ctacaacaac 1141 actgatggct tatattttat atatctcata gctttgtgcc tgggtagtct taatagttgc 1201 ttagatccat tcctttattt tctcatgtca aaaaccagaa atcactccac tgcttacctt 1261 acaaaatagt gaaatgatct tagagaacaa ggacagccat cacagagaac gtctgttttc 1321 aagaacaaca taagcatagt gcaaggagct ccatttccga gctcctaaga aatatgcttc 1381 aaaggtcaaa cattacaaaa gcattagtag tttgtttgtt tgtttttgag actgagtctc 1441 actttatcac ccagactggc gtgcagtggc actatcttgg ctcattgcaa cctctgcctc 1501 ccaggtcagc ctcccaagta gctgggatta caccaccatg cccagctact aaaaatactt 1561 gtatttttag tagagacggg gtttcaccat gttgaccagg ctggtcttga actcctgacc 1621 tcaagtgatc ttccggcctc agcctcccaa agtgctggat tacaggcgtg agccactgag 1681 ccagccagca ttagtaattt ttaaaaacac tttatcagta ttttaaaaat gttaatgcag 1741 gagaaaagat atcacaactc tatggaaaat gacatttcca tttgccttat tgctacttca 1801 agctctttaa atcaccatct tccctatttc // LOCUS HSU93163 40352 bp DNA PRI 22-JAN-1998 DEFINITION Human MAGE-B gene cluster, MAGE-B1, MAGE-B2, MAGE-B3, MAGE-B4 genes, complete cds. ACCESSION U93163 NID g2459678 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 40352) AUTHORS Lurquin,C., De Smet,C., Brasseur,F., Muscatelli,F., Martelange,V., De Plaen,E., Brasseur,R., Monaco,A.P. and Boon,T. TITLE Two members of the human MAGEB gene family located in Xp21.3 are expressed in tumors of various histological origins JOURNAL Genomics 46 (3), 397-408 (1997) MEDLINE 98110575 REFERENCE 2 (bases 1 to 40352) AUTHORS Lurquin,C. TITLE Direct Submission JOURNAL Submitted (12-MAR-1997) Ludwig Institute for Cancer Research, Brussels Branch, and Cellular Genetics Unit, Universite Catholique de Louvain, Brussels., 74 avenue Hippocrate, UCL 74.59, Brussels B-1200, Belgium FEATURES Location/Qualifiers source 1..40352 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq21.3" exon 3266..3364 /gene="MAGE-B2" /number=1 mRNA join(3266..3364,6278..7979) /gene="MAGE-B2" /product="MAGE-B2" gene 3266..7979 /gene="MAGE-B2" exon 6278..7979 /gene="MAGE-B2" /number=2 CDS 6283..7224 /gene="MAGE-B2" /codon_start=1 /product="MAGE-B2" /db_xref="PID:g2459680" /translation="MPRGQKSKLRAREKRRKARDETRGLNVPQVTEAEEEEAPCCSSS VSGGAASSSPAAGIPQKPQRAPTTAAAAAAGVSSTKSKKGAKSHQGEKNASSSQASTS TKSPSEDPLTRKSGSLVQFLLYKYKIKKSVTKGEMLKIVGKRFREHFPEILKKASEGL SVVFGLELNKVNPNGHTYTFIDKVDLTDEESLLSSWDFPRRKLLMPLLGVIFLNGNSA TEEEIWEFLNMLGVYDGEEHSVFGEPWKLITKDLVQEKYLEYKQVPSSDPPRFQFLWG PRAYAETSKMKVLEFLAKVNGTTPLCLPNPLRRSFER" polyA_signal 7961..7966 /gene="MAGE-B2" mRNA <23546..25194 /gene="MAGE-B3" /product="MAGE-B3" gene <23546..25194 /gene="MAGE-B3" exon 23546..25194 /gene="MAGE-B3" CDS 23607..24647 /gene="MAGE-B3" /codon_start=1 /product="MAGE-B3" /db_xref="PID:g2459681" /translation="MPRGQKSTLHAREKRQQTRGQTQDHQGAQITATNKKKVSFSSPL ILGATIQKKSAGRSRSALKKPQRALSTTTSVDVSYKKSYKGANSKIEKKQSFSQGLSS TVQSHTDPLTMKTNMLVQFLMEMYKMKKPIMKADMLKIVQKSHKNCFPEILKKASFNM EVVFGVDLKKVDSTKDSYVLVSKMDLPNNGTVTRGRGFPKTGLLLNLLGVIFMKGNCA TEEKIWEFLNKMRIYDGKKHFIFGEPRKLITQDLVKLKYLEYRQVPNSNPARYEFLWG PRAHAETSKMKVLEFWAKVNKTVPSAFQFWYEEALRDEEERVQAAAMLNDGSSAMGRK CSKAKASSSSHA" polyA_signal 25152..25157 /gene="MAGE-B3" mRNA <29748..>31474 /gene="MAGE-B4" /product="MAGE-B4" gene <29748..>31474 /gene="MAGE-B4" exon 29748..>31474 /gene="MAGE-B4" /note="it is likely that this exon extends to the polyA signal located at position 31822..31827" CDS 29808..30848 /gene="MAGE-B4" /codon_start=1 /product="MAGE-B4" /db_xref="PID:g2459682" /translation="MPRGQKSKLRAREKRQRTRGQTQDLKVGQPTAAEKEESPSSSSS VLRDTASSSLAFGIPQEPQREPPTTSAAAAMSCTGSDKGDESQDEENASSSQASTSTE RSLKDSLTRKTKMLVQFLLYKYKMKEPTTKAEMLKIISKKYKEHFPEIFRKVSQRTEL VFGLALKEVNPTTHSYILVSMLGPNDGNQSSAWTLPRNGLLMPLLSVIFLNGNCAREE EIWEFLNMLGIYDGKRHLIFGEPRKLITQDLVQEKYLEYQQVPNSDPPRYQFLWGPRA HAETSKMKVLEFLAKVNDTTPNNFPLLYEEALRDEEERAGARPRVAARRGTTAMTSAY SRATSSSSSQPM" mRNA join(31403..31474,33958..34062,35057..35139,38088..39691) /gene="MAGE-B1" /product="MAGE-B1" exon 31403..31474 /gene="MAGE-B1" /note="this first exon of MAGE-B1 is also included at the end of the MAGE-B4 mRNA" /number=1 gene 31403..39691 /gene="MAGE-B1" exon 33958..34062 /gene="MAGE-B1" /number=2 exon 35057..35139 /gene="MAGE-B1" /number=3 exon 38088..39691 /gene="MAGE-B1" /number=4 CDS 38148..39191 /gene="MAGE-B1" /codon_start=1 /product="MAGE-B1" /db_xref="PID:g2459679" /translation="MPRGQKSKLRAREKRRKAREETQGLKVRHATAAEKEECPSSSPV LGDTPTSSPAAGIPQKPQGAPPTTTAAAAVSCTESDEGAKCQGEENASFSQATTSTES SVKDPVAWEAGMLMHFILRKYKMREPIMKADMLKVVDEKYKDHFTEILNGASRRLELV FGLDLKEDNPSSHTYTLVSKLNLTNDGNLSNDWDFPRNGLLMPLLGVIFLKGNSATEE EIWKFMNVLGAYDGEEHLIYGEPRKFITQDLVQEKYLKYEQVPNSDPPRYQFLWGPRA YAETTKMKVLEFLAKMNGATPRDFPSHYEEALRDEEERAQVRSSVRARRRTTATTFRA RSRAPFSRSSHPM" polyA_signal 39674..39679 /gene="MAGE-B1" BASE COUNT 11599 a 8651 c 9775 g 10327 t ORIGIN 1 aagcttcaga gttgagagac taagcctata atggaaaagt gctcttaact ttggagaaaa 61 ctggaaagaa gctccaccag gggcagcact ggaaagtgac tatgctcttg tcccagtgca 121 gatgaacaca gtgcagaaac tagctgcttt atatacattg tatccagtga gcttcggaga 181 aataaggatg atactccctt aagaagaaat gataagaagc cgctgggtac tctgcactgg 241 gctggaagag ttaagagtca atttcattaa aaggagcatt caaattaggc tatcatgtac 301 ataatttggc aagctctagg caagttctag gtgtttggtg cttgttaaat taatgaaaat 361 aaaggtgatt tagatggaag tgacactggg agggaaggaa gttgttagtc cttgacgcaa 421 atgcgttgtg ttgcacctaa actggagaag tctacctcac attgaataat ttatcattta 481 atggggaaat attacttgat ttttcgtgct aatctgcatt ttggcttatt gggttttctt 541 tcttctagaa tgctgcatat tctctgacat ctgttggatt aagaaaaagc aatcacagtg 601 ggatgggtgg tatcatcaag ttaatcatag gtaatagctg ccatgttgaa acctcaatat 661 gggccaagta ttgtgccagg tgctttacat acattaacca catttcacca attccacaag 721 gaaggggctt atcaaaccta tttcacagat gcagagcctg aggctcacag agtttagtaa 781 tgtgcctgag ttcacatagc tggtaagtgg cagggctgaa accagacccc cagtctggat 841 taatctacat cccacactgt tccagtcttc ccattctgag gctaacctca tgttgcttca 901 ttcatttttt ctctctattt ttgaaaaatg tatctcaggc aataaaaaga aggactctgt 961 gcccatacta ctgggtcaaa tatattccca gaccaatatg aatgccaaaa taaatgagag 1021 gcataaataa aaaaaagaaa aatagtacct ggtgtcaata cgttcatcta cacaagcaat 1081 atcaggtaat ctcaaaagat cagaagctcc caaaatcatc ctggacagtc tcttccctgt 1141 agaaaataaa tgtgtcagaa cagaattttc gtgagaatct aagtctagca agatggaaaa 1201 aatcagcatg tttttttctc tgacagccca ctacctgtcc tggagcttct gccttgtgct 1261 tcagcttccc tatctcacat cactcccgga acacggctcc ttctctgcaa ggtttgcagt 1321 agaaaaactg gtcatgtttg cagggatgtg acatttccca gaaagccttt aatgaaagat 1381 gatccccagt tcagaggaca tgggaaacca atatctcaga atatatgagg ttatgcatag 1441 actgactctc atctgctctc agacctagag agctcgagac agaactgtca ggctgagcat 1501 ccactcattt aatcttcaga cctctcatgg acatggccta agaaggtcaa cttcatgtca 1561 gtagatgcag ttcagcctct gatagatatg aacgtgagga ccctgaatga agatgatgga 1621 aggacccagc cagaactgtg gggtcccaaa aagtccagct caggctgtca gcccttcaaa 1681 gtccagaaca gccctggcta aatgtagctc accacaactt ccagttcaga ggtcggcaga 1741 gggtgagtac cttgatctga tgggaacagc ctcaggtcag tggaggatgg agtgccccaa 1801 gactggccag gtgtcatggt gaggaccctg aagagaaata agagaactgt tataggacca 1861 acgagtttct gtgtcagctg caaagtaaca gacccactat aataaaacag caaagtttgc 1921 agcaaagagt ttaatgattg cagggtgcca agtgaggaga tgagaagaga ctctcaaatt 1981 catctcccca aggagttctg ggctgggatt tgcaaggaga tcatggaggg cgaggggctg 2041 gagaactgag gttgttgatt cgttggggta aaggggatga aatcatcaga atgtggaaac 2101 tgcgtttttt tttttttagt cagctcctgt ggggtctttt ggaccagctg atgtcagtag 2161 ggtcctttag agcagctggc atcagtgggg tccttcagac cagctgagtc attagaatca 2221 tcagtatgca ggaccttaag gaatatctca aagaaaaaaa gtttcattat gttaaaattg 2281 ttatctatag agcaattaag ggcaactata atctagtaac aggatctaca tgattctggg 2341 ataataggca ccaaactact atgaggaagc aggtcagaga gcagctgact tcatgattaa 2401 tgctgaatgt tctgcaagct tggcgtatct tcatttatct cccttccctc ttccctgatt 2461 aattttatac agttcatagg ggcagtttca aaaccaccca cccagaacag aggagcccac 2521 aaattccacc aatctgtcac ccttggggag cccagaagag aattatgagg tggagaaatc 2581 ccatcaatat gcctcgtgtg tttcacagag gtgaagacct tggactagtg gaaatggccc 2641 taggtcaaag agacaaaacc cacgccctaa caggaatcaa ggtgaggatc ctaagtgtta 2701 cgagggggct ctcccccttg caacaccagc agagggggcc aaacagagcc ctcgccattg 2761 ttagcaccga gagccccaaa ataagtatct ctgaatccct tcattccacc tgtatagtct 2821 cggggaagga agggcctttg tctgaagagg gtgacccagt tgtgcagaga gagagtcttg 2881 gctttcacgg gaatcaaggt gaggactctg agggcggatg agaagacctc tccccaaaaa 2941 aggcacattc acagagccct gccgctgctg tcaggcctgt gaggccaggc aggggtggcc 3001 tgtttggcac gcttagattt ccacagtggg ggctgaggga ggtgggggta ttgtttggag 3061 gctggcggat ttgggtcagc acgcatattc gtcccaggct gctagatact gaggtgagga 3121 ccctagtgga gacgaaggga ccagcaacgc tagaacagtg acgtccggta gcgtccagcc 3181 gtcagcccct cagacgccac gggctgccgg atgtgagtca tcctgacttc cgctttgaaa 3241 aaaaagaccc gagcggatgt ggctcatcct gacttccgct ttggaggcga ggacccgagc 3301 gagtgtaggg ggtgcggcgt ctggtcagcc aggggtgaat tctcaggact ggtcggcagt 3361 caaggtgagg accctgagtg taaactgaag agaccacccc cacctgtaac aaagagggcc 3421 ccactaagtc ccgcttctgc atttggtcct gagaggctcc ggtaaagccg tccggcaatg 3481 ttccacctgg aaagttccag ggcaggggaa gggtgggggg aggggcagtc gcgggggaag 3541 gaggtttgga cgcagggaat aggcctcatt ctgcacgtag ggtgggtgta ggccctaact 3601 gaaatcaatt tgagggccct aaatgtggac tgaagagaac atctcctacc cttaacaaag 3661 gtggcctcac taagtctcgt ccctgaagac ggctctggga ggccccagca aagctgtgcc 3721 tggaaagtct cagggagggg agggccttga acctaggaag cagccctgct tctgctcata 3781 tggcatcaac ctaaaaccta actggggtta aggtgaggac tctgagtatt aatgagcgga 3841 cctctaccca aaagaggagt catgctgggc agttcccagg ccgaggtgac actgggtttt 3901 agatggtccc aggctgacat gacaaatgca gagcaacctg acttcctctc ccgggactaa 3961 agaagttgag ggctacatcc aagacccaca tctgctgcca gcaataggaa ggtcagggca 4021 aagctagcaa gctgagatgc tctctagttt cctctggaag gtggtctcag ggaggtgagg 4081 gtctggtgta tggggtaggt ctaatgcgac agagaggccc agtctctaac aggaagcaag 4141 gatgtgacac tgagtgatga ggaggggact ccaaccagaa agaagtgcca tatagagccc 4201 actcttgttg tcagaccagg gagacctgag catggcatgt tgagttgtac tcactctccc 4261 caagacatct cagagaagtg aacgacccca ttaaagaggg cagcctggcc gggcgcggtg 4321 gctcacgcct gtaatcccag cactttggga ggccgagaca ggcggatcac gaggtcagga 4381 gatcgagacc atcctggcta acacggtgaa accctgtctc tactaaaaat accaaaaaaa 4441 aaattagcgg ggagcggtgg cgggcgcctg taatcccagc tactcgggag gctgaggcag 4501 gagaatggtg tgaacctggg aggcggagtt tgcagtgagc cgagatagag ccactgcagt 4561 ccggcctagg cgaaagggcg agactccgtc tcttaaaaaa aaaaaaataa aaaggcagct 4621 tgaggtcagc agagggaggg tttccaggtt gtgccagatg ttatgataag aactctgacg 4681 actgcagggg gctcccaccc catattagtg gagccacaca tcctcagtac tgtcagctct 4741 gagagacccc aggcagaaat gtgaaacaga gtgcccatca gttcccactc aagggtaaca 4801 gggaaatgag agtttaatct gagggtgtga tcacttgtca acagaagtaa tgtactaatt 4861 ggtccattct ggcattacta taaataccca agactgggta atttacaaag aaaagaggtt 4921 taattggctc acagttctgc aggctgtaca ggaagcatga tgctggcatt tgctcagctt 4981 ctgggagcag ggctcaggaa atttacaatc atggcagaag gggaagggga agcagacatg 5041 ttacatggcc agagcaggat caagaggcca ggggaggcgc tacacacttt taaatgacca 5101 gatctcatga gaagtcatcg ttaaggggaa tggtgctaaa ccattcatga aaaacctgcc 5161 ctcatgatcc agtcatctcc caccaggctc cacctccaac attgaggatt acagttagac 5221 atgagatttg gtgaggatac agatccaaac catatcaggg aagaatctta gaccctctca 5281 tgagtcaaat ttagaggctg agttgggact gtatgtggga ccatctatca cagggcagtt 5341 atctgcctgc cacagctttt acttttggga agaccacatt tggtcagata aggccaccct 5401 cacttcctcc tatgggttct cagggatgtt cactcttacc atgacgacat aggcctcagt 5461 tcaataaaaa ggagagctcc tctaccagga gtcaaattga ggaatctgag aattcaagga 5521 accactgaac ccttaacagt ggagacatca aagaatccag tccaaccctt tattttcagc 5581 cctagaaagc tcaggaaagg gttatcaggt ttagtgtccc cctctctttt ttatataaga 5641 tctaagagag gtgagattct tagtctgacg gtgtagccac cactcagcag aagggggctg 5701 gtgccaagcc ctacatgaag ccaaggtggg gaccttgaat gagagctgaa gactctaatg 5761 attccaagac atcaaagacc ccactgaact ctcagcactg ggtgtcactt ctcagggtgg 5821 tggttgaggc accccttcaa tttcttctca tgagtcccag ggtgctttga ggtgtcaaat 5881 tcacagtagc ggaagggagt ggctcaggcc tgcaagactt catatgataa tcctaaaagt 5941 taactgataa aaccccaaac accagaagac aggaggccgc actgccagta cctcctgtca 6001 ccatagtaag cccaaggaag ggctgacaga atgcagtctg aaactcactg taggggttct 6061 gtggttcttc tctggtgtct caggggaaag gataaccagg acgacaggag ccttgtggct 6121 ttctagaaca gtgccttcag ggaaacttgc aaaggcaggc ttccttcagt aaagctaaga 6181 tggggtcttc cagctgaagg tgctcacaga tctcattctc ccatctccag gtatactaac 6241 catctattct tctgcccaca tttcttggtt tacccagcca tcatgcctcg tggtcagaag 6301 agtaagctcc gtgcccgtga gaaacgccgc aaggcccgag atgagacccg gggtctcaat 6361 gttcctcagg tcactgaagc agaggaagaa gaggccccct gctgttcctc ttctgtttct 6421 gggggtgctg cttcaagctc tcctgctgct ggcattcccc agaagcctca gagagcccca 6481 accactgccg ctgctgcggc tgcgggtgtt tcatccacaa aatctaaaaa aggtgccaag 6541 agccaccaag gtgagaaaaa tgcaagttcc tcccaggcct caacatccac taagagccca 6601 agcgaagatc ctctaaccag gaagtcaggg tcgttggtgc agttcctgtt gtacaagtat 6661 aaaataaaaa agtccgttac aaagggagaa atgctgaaaa ttgttggcaa aaggttcagg 6721 gagcacttcc ctgagatcct caagaaagcc tctgagggcc tcagtgttgt ctttggcctt 6781 gagctgaata aagtcaaccc caacggccac acttacacct tcatcgacaa ggtagacctc 6841 actgatgagg aatccctgct cagttcctgg gactttccca ggagaaagct tctgatgcct 6901 ctcctgggtg tgatcttctt aaatggcaac tcagctactg aggaagagat ctgggaattc 6961 ctgaatatgt tgggagtcta tgatggagag gagcactcag tctttgggga accctggaag 7021 ctcatcacca aagatctggt gcaggaaaaa tatctggagt acaagcaggt gcccagcagt 7081 gatcccccac gctttcaatt cctgtggggt ccgagagcct atgctgaaac cagcaagatg 7141 aaagtcctgg agtttttggc caaggtaaat ggtaccacgc ccctgtgcct tcccaaccca 7201 ttacgaagaa gctttgaaag atgaagagaa agccggagtc tgagccagag ttgtagccag 7261 gccttgcact actgccatag ccaatcaatc tcccaaagcc aagtttacct gctgttctca 7321 cccccaatga ggtcttaggc agattcttta ctttgtaatt caaaaggcct gttaaccttt 7381 gttcttgtta tgcatgaata acttgttgac tttttttttt tctctttttc aactagtgtt 7441 tcaacaggtt tatttagatt cagaatgtaa atttacaaat gatatagatc accctgttat 7501 tgctgttttt cagggacagt agaaagtgtt ttgttttttg agtgaaacaa cttattaata 7561 aaaatcctta aatcactttt gtaatccagg acaagaaaat gtggcattag agtagaaata 7621 tctttggaaa tgtgaaagac cccatagtga aatatttggg atcagaagcc agaggtgtaa 7681 aagtggtcaa ttcttggttt acttcattta atctttcttt tcataaagat acatacctgg 7741 atttgtttat gttattcaag aatgtgtgag aaattaaacc atagttagtt aatcctcttg 7801 tttatggctc ttactttaaa tattttaatt gagcatctgt tctttgggag tcttcctgct 7861 tgtactggga atgtttagtc aaagaagacc aagcttgtgc tcatgcaatt gtagattcta 7921 ggagaagctg tcatcatgga aggtgagaca cttcataaaa aataaaagat aggaaaaaga 7981 agtgatgaaa gcaatagttt tcattgctaa gcagcatttt ggctgaatgg ggtttttcag 8041 tcttagaatg ttgcgtattc tcagccgggt gcagttgctc acatctgtaa ctcccagcac 8101 tttgggaggc tgaggtgggt ggatcacctg aggtcgggag ttcgagacca gcctgaccaa 8161 catggagaaa ccctgtctct actaaaaata caaaattagc caggcgtggt ggcgcatggc 8221 tgtaatgcca gctactcggg aggctaaggc aggagaatca cttgaaccca ggaggcggag 8281 gttgctgtga gccgagactg cgccattgca ctccagcctg ggcaacaaga gcgaaactcc 8341 ctctaaaaaa aaaaaaaaaa aaaaaaaaaa aaattgttgc atattctctg agatatcctg 8401 gatttttttt ttttaatctc aactggaatg aaggtaatgt ctctgatcaa aatgatagtg 8461 gtaattgcca cttcctgatg tctcagttga cgagacaggc acatttgcat gcattacata 8521 tattccacca gtactacgag atggggctta tcaaacccac tttacggtaa agagtctgag 8581 gctcactgag tttggtaatt tgccaaagtt tacagggcca gtagtgaaag ggttgggata 8641 agcctcctga tctgaattac cctagagcct accctgttcc acccctctca gcctgaggca 8701 aaccttgaaa aaatatttcc gttttcttct cattactgcg gaatgtgttt caggtgataa 8761 aaagaataac cttgtaaccc aaactagtag attggaatac atagccacag caacaataat 8821 accaaaaaaa ctgagaggta taaataagta aaagaggcaa tattcagtga cagtatgatt 8881 atctacattc gcaatgttag ataacctcaa aagatcaggg gctcccaaat catcctgaac 8941 aatctcttcc ctgtagaaaa taaatctatc agaacagaaa ttacatgaga atgtaaggct 9001 ggctgataga agaaatagat caatgtatct tatctctggc tgcccatctc caatcctagg 9061 ccccctgctt tgtgctacag ctttgcggta tcacaccact cctaggacag agacccattc 9121 tctgcaatgt ttacagagag gaaattgttc tagaatttgc agagatgtgg cattttgcag 9181 gaagcattta atccagagac agagccagat gagggctggg agaccacatc ccagaatatc 9241 aggcaggggt ttacatagac cattcccttc tgctctcaga cacggtccta ggctgagctg 9301 tcagtctgag catcccctca catattcttc aggtgtctga gagacataag agccctaagg 9361 gcagcatcct caagtcagta gggaaaggaa tcccagcctc ttataggaat gaaggtaata 9421 aacctaaatg agggtgatgg aaggacccag ccagaacagt tgggggcctg taaagtcacg 9481 ttccaattgt agtcttgtga gtccagaacc gtcctggtgg acactgactt ccaattcagg 9541 tcccagcaag gtgaggactt tggtctgaag gctatggcct caggtcagta gggagtggag 9601 tcccaggaag ggggcagagt caaggtgaag gcctggcatg tggatatggg gaccactcaa 9661 cccataactg gctggcgtgc ctctagtcat tcctttgcta ttagccctag gaggcctagg 9721 gaggagctgg caggcccagg tgcccgtgtc ttcttcctgg agaatctcaa ggagatgaga 9781 gacttgattt acaggctggc ctcccagtta agcaaagagt ggaatcccag accatgatag 9841 gtattaaggt aaaaatctca tatgaggaaa gaggcaaaca actgctagag aggtgaacag 9901 aacaaggacg tccttagaaa gttatgccgg gccagacgcg gtggctcatg cctgtaatcc 9961 agcactttgg caggccgagg tgggcggatc atttgaggtc aggagttcaa gatcagcctg 10021 gccagcatgg caaaaacccg tctctactta aaaaatacaa aaattagctg ggcgtgatgg 10081 tgggcgcctg taatcccagc tacttgggag gctgaggcag ggagaattgc ttgaacctgg 10141 gaggcgaagc ttgcagtgag ctgagatcac gccactgcac tccagcctgg gcaacagagc 10201 gagactttat cttaaaaaaa aaaaaagaaa agaaaagaaa gttatgccct gggaggccat 10261 aagcatagct ttcaggctga ggttacccca cattcgctga ggggagggga gtggtctcag 10321 ggacaggaag tccttggact aataggagag gccccattca gtagagggag gagattggct 10381 ccttaacagg attcaacatg atgacactga gtgacaatga ggggacccct ctcagaaaga 10441 agagggtcac atagaacctt gtccctattg tcagacctga gacctggaca tgatgacatg 10501 atgtatcata ctcacttctt cccaggtaat ctcgcggagg tgaaaaatat taacaaagtg 10561 ggcagcctca ggtcagtaga gagaagattt ctaggttgtt gcagaagtaa atgtgaggac 10621 cctgaagact gaggtgaaca cctaccccat aacagtgagg gtaacacaga ttccttccac 10681 tactgttagc ctagggagaa catgagcagt tatggcaata tgaagcaaat tttacttctt 10741 cttagggtgt accaggaaat tggtcttcaa ttcccttcat atggggagac aggcctcaag 10801 tcaagacagg gaggatttgc acagggtgtg acaggaataa ggatgaggaa actgaagagt 10861 atgggaccac agaccccaca tcagtaggag ccacacagaa tcctccctac tgtcagccct 10921 cagagacccc aagcagaaat gtcaggtgga gtttcccatc aggtcctact caagggtaac 10981 aagggaagtg agggtcttga tcttagggtg gtagtctcac atcaacagaa aaagaaatct 11041 tagacctgtc cagaagttaa aaatttagga ccctgagttg ggaccctgca tggaactacc 11101 catgataggg ccttgaactg cctgccacag gtttcattct tgtgagacca ttggcaggta 11161 tatagccaga tgaggccatc cttgcttcct cttgctggat cttaagaagg tgtgggcctc 11221 actttgagga gatgtcatca ggtcaagaag gaggggatcc ccaggccctt ccaggagtaa 11281 aatcagggaa tctgagtgga gactgatggt acaacatagc cttgaataga ggggacaaca 11341 aagtgtcatg ccctaccctt tgatatcagc cttagaagac tcagagcagg gctgtcaggg 11401 gaggcacctc atcacttctt tatatacggt ctaggggagg ggaaatcttt ggtctgaaag 11461 ttcagctacc agtcagcaga agagatccaa tacatgccag tcactgtgcc aagtgctttc 11521 catacattac ccacatcctg ccagttctgc cagacagggc ttaccaaacc cattccatag 11581 atgcagaacc taaggctcat aggatttagt aatgtgccca agtcacactg ctagtaatga 11641 cagggctggg atgagaacac tggtctgagt tagtcaaaag cccacactgt gccactcctc 11701 tcagacaaag gatgaccttg ttccttaatt aacttttcct ctcgttactg caaaatgtac 11761 cagaggcaat aaaaagaaag actgtggcca aacgatggac tgtaatatat agccagaaca 11821 ataaaaatat caaaataaat ggaaggtttg agtaaggaaa aaaataatat tttgatgaca 11881 taatttacat acatatgtga tgtccaataa attcaaaaaa tgaggaggta caaaaaaatc 11941 atctgagcca cttccttaga aaaagtaaat atatcacaac agaagttaca tgagacccta 12001 aatctggctg ggcaaaaata ggagatggac atttgctttt tctcttatag cctaccacct 12061 atcctggccc cctgccttgt gctgtagatt tcccatattg catcactcct agggcagggg 12121 cctgttctct gaaaggtttg cagagaggac actgctcaag agtttcgcag agatgtggca 12181 tttcccagga agcctttaat cccactgcaa gagcctgtat gaggacatac ataaatcagg 12241 actggaaaaa caaacactgc agaatattgg ttattaccaa aagactatcc cctgcctgct 12301 ttcagaccta gagggtccca ggcagaactg tcaggatgag catcctctca tttatttttc 12361 acagttgtca ggaacttgag agctttagcc taaggagggc aagcttaggt caggagagag 12421 ataagaccca agatctgtta gatacaaatg ccagatccct taaaagtaat taagaaagca 12481 cccatccaga acaccggggt cccgaagagt tcaggccctg ctgttgaccc ttggaggtcc 12541 tgaattcaat ggttggatga aatatagaga ggccttgaaa gatgaagcag agagagcctg 12601 agccagagct gcagccaggg ctgtgctact gccatggcca tggcacatac cagagccacg 12661 tccagctgct cctcctacat ctagtgagat ctgagacaga ttcttcactt tgtagttgaa 12721 aagataagtc aacattctaa gtagtggaga gtcaattttg acctagggca aacatattgt 12781 atgtcttatt tttgtttttg ctctacttga ataattggaa gatgtatctt ttttattttt 12841 ggtactttta aaatgtattc attttaatag aagatttatt tagcttcagt atctgtgttt 12901 atgaataaca tggataacat atttatttct gttttccata tataaatgta aaagtgctgg 12961 tattttttta tcaacaaact gaaaatcctt aggtctctct ttgtggtcca gaacaagata 13021 acatagcata tgaataagga cttttttatt gtataaattt aagttgtaca gtaggacgtt 13081 ttgatataca tatatggcca tgcaccacat gatgatgtct caatcaggac cacatataca 13141 atggcagtct cgtaagatta taatggagct aaaaaactgt attgtttaca tgtgtgtggt 13201 gatgctggtg aaaacaaacc tactgcacta tcaggcctat aaaagtatag cacatacaat 13261 tatgtgcagt acatagcgct tgatactgat aataaacaac aatgttactg gtttatatct 13321 ttattatact atagtttttt ttaaatcagg ttggtgcaaa agtaattaca ttttttgcca 13381 ttaaaagtaa taagaattga gatggggtag tagtagtaat aatattctag taatactaaa 13441 taagtagtaa tattaatagg tagtactatt aaaatattaa tattaattaa taagtagtaa 13501 tattaaaagt agtaataatt gggacggagt cttcctctgt tgtccaggct ggggtgcagt 13561 ggtgcaatct cagctcattg caacctctgc ctcctggatt caagcattct cctacctcag 13621 ccacctgagt agctgggatt acaggggcct gccaccacga ccagataatt tttggacttt 13681 tagtagagat ggaatttcac catgttggtc aagctggtct tgaactcctg acctcaagtt 13741 atccacccgc ctcatcctcc caaagtgctg ggattacagg tggtactttt attattattt 13801 tagagtgcac tccttctact aaaaaaaaaa atgttaactg taaaaacagt ctcaggcagg 13861 tccttcaggt tgtattccag aagaaggtgg tgttatcata agagatgaca gctccatgcg 13921 tattattgtc cctgaagacc ttccagtggg acaagatgtg gaggtagaag acagtgatat 13981 tgatgatcct gaccttgtgt aggcctaggc taatgtgggt gcttttgtct tcatttttaa 14041 caaaaacgtt taaaaattta aaaagtaaaa atagaaaaaa gcttataaaa taagaatata 14101 aagaaagaaa atatttttgt acagctgtac aatgtgctta tgttttaagc taagtgttac 14161 tacaaaagag tcaaaaagct aaaaaaaatt aagaagttta taaagcaaaa aaagttacag 14221 taagctaagg ttaatttatt gttaaagaaa gaaaactatg cttgagaatt tagtgtagtc 14281 taactgtaca atgtttatta agtctataat agtgtacagt aatgtcccag gccctcacat 14341 tcactcacta ttcaatcact ggctcaccca aagcaacttc cagttctgca agctacatta 14401 atggtaaata ctccacacag gtgtatgatt tttaacaatc ttttgtacca tattttttac 14461 tgtacctttt ctgtgtttag gtacacaaat acttagcacc gtgttacatt tgcctatagt 14521 tttcagtgca gtcacatgct gcacaggttt gtggcctagg agcaataggc tacaccaaat 14581 agcctacgtg tgtagtaggc tatataccac ctaggtttat gtaagtatac tctattatat 14641 ttgcacaaca atgaaattgc ctaacaacgc gtttctcaga aagtatcttc attgttaagc 14701 aatgcattac agtacatggt gaaatgctta ctacaggtaa gcaatttaat atatccatta 14761 tatcatatag ttaccttgtt ttgtggtaag agcagctaaa atctactctt agaaaatttg 14821 cagtatgcaa tacaatatta ctaactgtag tcctcatact gtactttaga tctctagatt 14881 tattcgtctt ataaaactgc aactttgtac tgtttgacat gcatctctgc atcccctccc 14941 caccctgcct ctggtaacta ctgttttatt ctctttttct atgtatttaa catttttttc 15001 tttttcgatt ttacatataa gggatattaa tagcatgcag catttttctt tctgtgtctg 15061 gtttatttca ttcagcataa cgtcccccaa gttcatctgt gttgttgcaa tggcagaatc 15121 tctttctttt tcaaggctaa ataacatttt tttaaatttt atcttagttt caggggtaca 15181 tgtgcaggtt gttatatagg caattgcatg tcatgggggt ctggcgtaca gattatttca 15241 tcacccaggt aataagcata gtacacaata ggcagttttt ccatcctcac cctactccca 15301 ccttccgccc ttaagtaggc cccagtgtct gttgttccct tctttgtgtc catgtgtact 15361 caatgtttaa ctcccacaaa gaacatgcag tatttggttt tctgtttcat tgtacatata 15421 taccacaatt cctttatcca ttcatccatt ggtaaacaat tttgttgttg ccgcatcttg 15481 gtcattgaga ataatactgc agtgaacatg ggggcataaa tatctacagg aggtgatgat 15541 ttcgtttcct tatgcccaga caagggattg ctgggtcata tggtagtcca ttttcaattt 15601 tttgagaaag ctcatactgt attctagaat ggctgtacca atttgcattt ccaccaatag 15661 tgtagaagcg ttccattttc tctacactct tgccaacatt catctccagt gggtttttgt 15721 ttggttgttt ttgttttatt tttttataat agccttccta acttgtgtga ggtgatatct 15781 cgctgtggtt ttgatttgca tttccctgat ggttagtgat gttgagcaca tctttgtata 15841 gctattggac atttttatgt cttctttgag aaatgtcata caagtagtga attttcttga 15901 gtactgatta tgttggaaat gtgattgaat actgtattaa aaattgttga gatcaaaaaa 15961 tttttttaaa aaaggttgtc tattcttggt ttgcctaatt ccttttagtc tttcttctta 16021 taaaattaag agttacatat ctggatttgc ttagattatt aaagaatgta ggagagatta 16081 aatcctaatt tattggactt catactcact ctcttgttta ttccttaatc attaattgag 16141 catcagctct ttggaagtct tcatgttagt actgggaatg ttttccccaa aacaaatcaa 16201 tttcttgccc atttaatttt agagtctggg agcagctgtc ataaaaaaaa aaaagatgat 16261 gaggtgctct ctaaaactta aagaacaagt acaaaggagg aatgagaaaa agaggaggtt 16321 tgagatgaga gcaatcaagt gtaaatgccc tgaggcaagg cagtttggag tgttaggaaa 16381 cctcaagtcc cccattggga ggtaatttta agggaaacta catggtgggc tggatgagac 16441 tgtgggaggg gccaggccct cagatggtgc ctctcagagg tgtgagacaa agcctggaat 16501 gggaagtagt tcttaacagt tattttgtgc tcacggataa accagagaga atctgcacct 16561 ggggcaggaa tggaatgtgc cctgtgctct tgtcccagtg cagcagaaca tagtcacagt 16621 gcacaaacta ggtgctttat acagttggtg cattcagtgt tgagagacta agcccggaat 16681 gggaaactat ccctaacatt atcactttcc tgttgttgga aaaccagaga gaacccctac 16741 ctaggacaaa agtgaaaagt gttctatgtc cctatcctag cacagtctaa tacagtgcac 16801 aacctaggtg ttctatgtac atcatctcta gtgagtttct gagaaataag ggagatgaca 16861 gcttcagggg aggtaagatg cccagaagcc accgtgctgg cactctttgt cctgggttgg 16921 agaatcaaga gcccgctcta ttaaaaggac tttcaacagg ggtgcagcca ggcatggtgg 16981 ttcatgcctg taatcccagc actttgggag gcgaggcagg tggatctctt gagatcagga 17041 gttcaagatt agtctggcca acgtggtgaa accccgtctc taccaaaaat acaaaaatta 17101 gttgggcgtg gtggtggaca tctgtaatcc cagctactag ggaggctgag gcacgagaat 17161 cacttgagcc ctgaggcaca ggttgcaatg agaagagata gtgcctctgc actccagcct 17221 gggtgacaga gtgagactcc atctcaaaaa aaataaataa ataaataata aaaatcaaaa 17281 caggagtggt taacatagaa gggtgcctag gagttagaaa aaaatattgg tcattgacaa 17341 aaattttgag acttgagttg tatccaattg gagaaggctc ttccacaaac actttataaa 17401 ttatataatt ttccttgcta agcagcattt tgtttgatta taatttcttt gtttggagtt 17461 tggtaatatg ctcaagttca cagggtgaga agtgacaggg ctgggactag actcctgttg 17521 tgaattcttc tagagcccac actcttcatt ctgttcctct ctgcctggag cagactttgt 17581 gttctttaat tcacttttct tctcaatact tccaaatata tctcaggggt taaaaagaag 17641 gattctgtgc ccaaagaatt ggattggaat acatagccaa agcaataaag aatatcaaaa 17701 taaatgaaag ctacaaataa agaggagaaa gagatattat tcagtgacaa tatgatcatc 17761 tacatatgca caagtcagat aacctcaaaa tcagggacca cacaatcatc ttgaacagct 17821 ccttcccctt aagaagtaaa tcaattaaaa gagaagtcac ataagaggct agtgctggct 17881 attaaaagag taaatcagca tcatttttct ctgacaaccc accagctacc ttggactccc 17941 tgctttgtgt tataatttct ttacctcaca ttagccctgg ggcacagact tgcaaggttt 18001 gcagtaggaa cgctgcccaa gagtttgcag ggatgtggca tttcccagga agcctttatt 18061 cagagaggca agagaaaata aggaggacac ccataaataa atactggaaa gacaaccaca 18121 ccaaaagata acggggcaat ctacccaaac ctcttctcag cctagaatgc cccaggcagt 18181 gcagtcagac tgagcatcct ctcttttacc cttcagagtt ctcagtgaag tgagagtttt 18241 gacctaaggt gacctcagtt aactagaggg aggatccagg gtctggcaga tatcagggag 18301 accttgaatg aagaacagga ggagccagaa cagtgatgtc catagaaacc cgactccgct 18361 gtctgccctc acaggaccca aacatccctg gcctaatgtg gctcatcaaa acttcagcct 18421 agatctcgga ggaccttgat ctgagcagga ttcatggcat ggaatgtgga gtccaggact 18481 ggccagcagc gaagtgaggt tcttgagtga gtaaaaaggg aaaactaaac ccacaaataa 18541 ggggacttct cggagcccaa tccatatttt taaccctgga aagccctagg cagagctata 18601 aaactggctt gctctcattt cagcctggcg atctcaggga ggggaaggct ttgtctaaca 18661 gggcagcctc agttcttcag aggtctcttg gccctaacta gagtcaagat gaagaccaag 18721 ccggtgcgca tctcaggcct gtaatcccag cacactggga ggccgaggcg agtgaactgt 18781 ttgggccccc caggagttcg aaaccagccg ggacaacatg gcgaaaacct gtctctacaa 18841 aaaaaaaaaa atggcgggca gcagtggagc accctgtagt ccagctacca ggaggctgag 18901 gtgggaggat cgcttaagcc ctaggggtca aggctgcagt gagccaaggt cacgccactg 18961 cgctccagcc tgggtgacag agagaaacac tgtctcaaat tagccgggcg tggtgcggcg 19021 cacgtgtagt cccagctatt cgggaggcgg aggcaggaga atcgcttgaa cccgggaggc 19081 agaggttgca gtgagccaag atagcgccac tgcactcccg cctgggtgac agagcgagac 19141 tccatctcaa aataaataaa taaataaata taaacttttt ttttttaatt ttaaggcagg 19201 gctctggctc acgcttgtaa tcccaacatt ttgggaggac gaagccagcc tcatcgctta 19261 agcacaggag ttcgaggact gggaggtgga ggttgtggta agtcaagatt gcgcggctac 19321 actccagcct ggaagacaga gagagcccct gtcacaaaaa aaaaaaaaaa aaaaaaaaaa 19381 aagaaagaaa gaaagaaaga aatatgaggc ccaccggtgc taaccaggga acctctctcc 19441 aaaagaaggg cccaccaaaa gccctaaccc tgtttcaggt ctttgaagcc ccaggacatg 19501 gtccgataac atgcctagac ttcccctcta gggactatgg gaggggagga ttttggaggt 19561 tggcggactt cgctcagtag aggtgttact ctgctccgct tagtatcaag gtgagaaccc 19621 tgaataagga cctagggacc actgactcca gaacagtggg gtcccagcgt gtcacccgct 19681 gctgtcagcc ctcggagacc ccgagcgggg tgtggctcag cctcacttcc gctttgaaag 19741 tgaggcagtt ggctgacggg tgcaatagct tcagtcaggt tcgtggccta gcgtgagtct 19801 tagaactgat ccggagtaaa ggtaagaacc ctcagcgggg actgaaggga caatccatgt 19861 ttttaaccct ggaaagccct aggcagagct ataaaactgg cttgcctctc atttcagctg 19921 gcgggtctca gggaggggaa ggctttctct aacagggcag cctcagttct tcagaggtct 19981 cttggcccta actagagtca agatgaagac caagccggta cggcgtctca ggcctgtaat 20041 cccagcacac tgggaggccg aggcgagtga actgtttggg cccccccagg agttcgaaac 20101 cagccgggac aacatggcga aaacctgtct ctacaaaaaa aaaaaaaaaa tggcggggca 20161 gcagtggagc acccctgtag tcccagctac ccaggaggct gaggtgggag gatcgcttaa 20221 gccctagggg tcaaggctgc agtgagccaa ggtcacgcca ctgcgctcca gcctgggtga 20281 cagagagaaa cactgtctca aattagccgg gcgtggtgcg gcgcacgtgt agtcccagct 20341 attcgggagg cggaggcagg agaatcgctt gaacccggga ggcagaggtt gcagtgagcc 20401 aagattgcgc cactgcactc ccgcctgggt gacagagcga gactccatct caaaataagt 20461 aaataaataa ataaatataa acttttttta ttttttattt taaggcaggg ctgtggctca 20521 cgcttgtaat cccaacattt tgggcggagg aagccagcct catcgcttaa gcacaggagt 20581 tcgcggactg ggaggtgagc tgcccctgta gtcccagcta cccaggaggc tgaggtggga 20641 ggatcgctta agccctagga gtcaaggctg cagtgagcca aggtcacgcc actgcgctcc 20701 agcctgggcg acagagagag acactgtctc aaattagccg ggcgtggtgg ggcacaggtg 20761 tagtcccagc tattcgggag gtggaggcag gagaatccct tgaacccggg aggcagaggt 20821 tgcagtgagc caagatcgcg ccactgcact cccgcctggg tgacagagcg agactccatc 20881 tcaaaataaa taaataaata aataaataga tataaacttt tttttttttt taattttaag 20941 gcagggctct ggctcacgct tgtaatccca acattttggg aggacgaagc cagcctcatc 21001 gcttaagcac aggagttcga ggactgggag gtggaggttg tggtaagtca agattgcgcg 21061 gctacactcc agcctggaag acagagagag cccctgtcac aaaaaaaaaa aaaaaaaaaa 21121 aaaagaaaga aagaaagaaa tatgaggccc accggtgcta accagggaac ctctctccaa 21181 aagaagggcc caccaaaagc cctaaccctg tttcaggtct ttgaagcccc aggacatggt 21241 ccgataacat gcctagactt cccctctagg gactatggga ggggaggatt ttggaggttg 21301 gcggacttcg ctcagtagag gtgttactct gctccgctta gtaacaaggt gagaaccctg 21361 aataaggacc tagggaccac tgactccaga acagtggggt cccagcgtgt caccccctgc 21421 tgtcagccct cggagaccca gagcgggacg tgtctcaccc tcacaactcc ctgcccctta 21481 caaaagggga cccacacgtc tcacccttgt ggttgaccct gggaggtcct gtttgagttc 21541 tgtctggaaa ggcacccaga gggagggtct ttcactaagg gagcagcccc cagttattca 21601 gttggcgggg acctggaccc taactggagc caaggtgaag tctccgagtg ctaaaggatg 21661 ggatctgttc ccagtagggg cgtcaacaga aagcagcagc agccctactg gttagcatcc 21721 tcctggttag caccagtggt ccttgtcttt tttatttatt tttatttttt ttcagacaag 21781 gtctcactgc tgctagccag tttagaaggc tccagacaag tgtagtcata agcggaggcc 21841 ctgacttccc catccaggga gggaagtgtt cagggagatg agggttttgt ttggaggttg 21901 gcgaactcag gtcagtagag gaagaaattt caggctgtga taagatacca aggtgaagac 21961 ccctgaaatg agaatctagg gaccagcaac tccaaaacag tgaagtctca tagagttcca 22021 cccgtgttgt cagccatcag accccaggaa gctgtgaaca gataaggctc ttcctcactt 22081 ccttggaagt gctttgaagg ggaggatctg gaggcgaggg gcacgggatc tcttcggcag 22141 agggtgaatt ccttggactg tctggagtca aggtcaggac cctgaatgtg catgaaaggg 22201 accaccatcc cccaacctgt aacaaagagg gccacactaa atcctgcccc ggaagtcttc 22261 cctgggaata ccttgaaagc tgtctgacag atatctacct ggaaagtctc agggagggaa 22321 gggccttggt ctaagaaagt agccccagtt cagcaggtgg aagagagact tgggtcctaa 22381 ctggagtcaa ggtgagggcc ctgagtgcta atgaagcaat ctctctgcaa tagaggtgtc 22441 aacacaaaca gtgtccttgc actcattttg caacctccag gcaaaggtat tcataggtga 22501 ggggtccctg acttccctgt ctagggtctt gttctgaggc tggaggactt aggttaatgg 22561 agggaagtgt cccacatcct actaagagtc aagtcagaga cctgagagaa aactaaaggg 22621 aacactctct ctaaaaagta gagtcccaca aattgtgcct cttctctcag ctccaggaag 22681 ctttggaata acgtcagccc cccttactgc ctagagagtg ccagaaaggt tgaactaatt 22741 acatgcccca gttctgcaga gggaagagtg aggagaccca gagccttaca ggggtcacag 22801 tgaggaccct gaatgaagac tagtgataca actcctccca cataaagaag agacaacaaa 22861 gattccccca ctccccactt gcagcgctga tgaggctggg catggaggtc aggtgtatgt 22921 gaactcacct tcttttaatg gcattaggtt gctgagagcc tttatctaaa gtgaggggca 22981 tcagagcagc agaggcagct gtcctaggtc ctatctagag acaagatgag gactatgaat 23041 gaggtgtcag gactccaata agcccaggaa agagtaggac tccacaatgc tgttagcacc 23101 cagttcctcc tgtcaggggt ggtaaggctg agacattcct tcacctcctc ttaggtggtc 23161 ccaaggagat gaggacattt gctggaggtg tcaaatttag tgcagcacag gggagaagac 23221 ccagccctga cagtttttat gatggtcctg agtgtgaact gagagaaacc tcctacccca 23281 gagtaaaagg agatccacaa ggactagaca tgccacggct gctctcagtc ccagtataca 23341 cagaacaggg ctggcaggct gtggcctaag gcacacttta attactttca cagggttctc 23401 agaggacaaa ctgatcagaa cagaagcctc tgggtttcca gagcagtgct ctcacagaaa 23461 actgcagagg cgaccttctt ttaaatccaa agtggtacct ctctgctgaa ggcactcata 23521 ccctctcttt ctctctctcc tccaggtgcc tgtatcacct gcccttctgc tgacactcct 23581 gcctgctgtt cctgactaca gccatcatgc ctcggggtca gaagagtacg ctccatgcac 23641 gtgagaaacg ccagcagacc cggggtcaga cccaggatca ccagggtgct cagatcactg 23701 caactaacaa gaaaaaagta tccttttcat cccctcttat tttgggggct actatccaga 23761 aaaagtctgc tggtaggtca cgtagtgctc tcaagaagcc tcagagagca ctatccacca 23821 ctacatctgt agatgtttct tacaaaaagt catacaaggg agccaacagc aaaattgaga 23881 aaaagcaaag cttctctcag ggtctatcct ccactgtgca gtctcacaca gaccctctaa 23941 ccatgaagac aaatatgttg gtgcagttcc tgatggaaat gtacaagatg aaaaagccca 24001 ttatgaaagc agatatgcta aaaattgtcc aaaaaagcca taagaattgc ttccctgaga 24061 tccttaaaaa agcttctttc aacatggagg tggtgtttgg tgttgattta aagaaagttg 24121 attctaccaa ggactcctat gtccttgtca gcaaaatgga tctccccaac aatgggacag 24181 tgactcgtgg gaggggattt cccaagacag gtctcctgct gaatctcctg ggcgtgatct 24241 tcatgaaggg caactgtgcc actgaggaga agatctggga attcctgaat aagatgagaa 24301 tatatgatgg gaagaaacac ttcatatttg gggagcccag aaagctcatc acccaagatt 24361 tggtgaagct taaatacctg gagtaccgac aagtgcccaa cagtaatcct gcacgctatg 24421 aattcctgtg gggtccaaga gcccatgctg aaaccagcaa gatgaaggtc ctggagtttt 24481 gggccaaggt caataaaact gtccccagtg cgttccagtt ctggtatgaa gaggctttga 24541 gagatgagga agaaagagtc caagctgcag ctatgctcaa tgatggcagt agtgccatgg 24601 gcagaaagtg ttccaaggcc aaggctagca gctcttccca cgcctagtga agttgaagca 24661 aattttgcat tttgtggtta aagagggcag tcactgttcc aaggagtgaa ggactgggtg 24721 ttactggagg gaacacactg tataatacct tttgtttctg ttctaaatgg ataatttgaa 24781 gttttatctg tattttgggg catatttttc aaatgttcct tttatttaac attgtaatct 24841 aagtttagga ttgatactgg tcacatttgt tgtttaagag taaaaatttt gctgttttgt 24901 aaaacagatt gagaaaaatt cgatcttatt tagtgatctg ttgcaagata acttggaatt 24961 agaataagca tttccttgaa aatgtttaaa aaaaaaaagt cagcagtaaa atgtatggca 25021 ttaagaaata gagaaagagt gtaagatggt caatatttgg tttcctaaat gcttttactc 25081 tgtgttttaa gaaaatgaaa gataaataac catatatgtc tggcttactt aagaatgtag 25141 aattaaatca taataaatta gacctcatgc tgacacactc attctccaag tgttaattga 25201 gcatctgctc ttggaaggat ccatgctaat actgggaggg ctaagaaaaa gaagacctag 25261 caactgacct taaaattata aggtcgagaa gcagctatca tctaaggaaa atggtgatat 25321 acactctaag accaaaagga tacatgataa gaagggaggg aggtggttcc acatgagaac 25381 agtcaagtat gaattatcta atcaaggcgg tgttgggcct aaggaaagtg caggtccctc 25441 aatgggagtt aatctaagtg aggctccatg gtgggctgga tgaggctgtg ggaatggcga 25501 gcaggggcca ggctcttaga agttgcctct cacgcagtgc gggcgcctgt agtcccagct 25561 actcgggagg ctgaggcagg agaatggcgt gaacctggga ggcagagctt gcagtgagcc 25621 gagattgtgc cactgcagtc cggcctgggc taaagagcgg gattccgtct caaaagaaaa 25681 aaaaaaagaa gttgcctctc agaggtgcga gacaaagctt ggaataggaa acagttctta 25741 acagttactt tgggatcatg gataaatcag ggagaatctg gggtaggaat gtgtcctgtg 25801 ctcttgtccc agtacaaata aacacagcac acactaggtg ttttgtgcac atcatctcca 25861 gccagtttct gagaaataag ggtaataccc tcaagggacg ggggaggccc agaagccact 25921 gtgctggtac tctttgccct gggctggggg atccagagcc tgttccatta aaagcgcatt 25981 caattaggtt acctcatttg tgatttggca gactctaggc aagttctaga ttttgagagg 26041 tggttaaatg aatcaaaata aaagtggttt agatggaagg atagctgaga gagagggaag 26101 tttttggtcc ttgaaactca ttttagaatc tgtgttgcat ctactgagga agtctacctc 26161 atactaatat tgaatgatct atcaagtaat gaggaaatat tatttagttt tcctttctag 26221 gtagcacttt gtctgatagg gctttctttg tcttagaata tggtataatc tctgacatct 26281 cctgcattga aaaataagtc ccagtgtagt agattgcaac gtcagggtac aaattaatca 26341 taataagaac tggcactatt atatgtctca atacatctca agcactttac atacattgta 26401 tacattttac cagtcctaca aaacagagct tatcaaaccc acttcacaga taaaaagcct 26461 gaagcccaca gtctattaat atgtccaatt tcacatggct ggtaagtgac aggactggga 26521 caagaccccc agtctgaatt agtgtaaagc acactctgtt ccatgcctcc caacctgaga 26581 ctgactgtgt ccccttattc acttttcttc tcactactcc caaacgtata tcaagtgatg 26641 aaaagattct gtacccaatc tgctggactg aaatacagaa ccagagcaag gaaattacca 26701 aaatggtcaa gaggtgtgaa ttaaggagag aagataatat ttggtgacaa taagattaca 26761 catgcaaagg cagataaact caaaagtcat ggggtcacat tctggcaatc tggactggcc 26821 ccccttttat aggaaggtaa atcctttcca agagaaatac ataagagggt agtaaagaga 26881 agaaatagga tcaatggttt tttttttttt ttttttttct ctgacagtat gacttctatc 26941 ctcggcttct gccttgtgtt acagcttccc tatcttatat cactcctggg acagggatgc 27001 attgtctgca agatttatta tagggaaact gatcaagaat ttttagggat gcggcatttc 27061 tcaggaagtc attaactaag ggacaagagc caagataaag accttcatat aggaggactc 27121 agcagaacaa catgccagac agagttctta gcctgagcat ctcctcactt attcttcaag 27181 gttctcaggg acttgtgagt tccgcatctc aggtcagtag agggaggagt gccattctct 27241 gacacatacc aagttgagga ccctgaatga agaatgaggg aagcactcac ccaaatagcc 27301 ttaacaggat gtggctgaac ctgatttcta ctccggaggt cttaagaagg taaagaccat 27361 gatctgaggc tggctgactc aggtccatag aggaaggatt cctaatgtgt gccaggagta 27421 aagtgagtac cctgaagagt gcaggaacca ccaaacccct atcagtggga tcccacagaa 27481 tcctccctat tgtcacctct gagagattta ggcatgaatg tcagaaagag gcaccctcat 27541 atcctcagta gtaacagaga agtgaaggtc ttgttctgag gtgggcaatc ccagctcagc 27601 aaagggagga atattaagcc ttcctgtgag tgagatttag gatcctgagt taggacctga 27661 gtaggacaat ccaccataag acctcaacct gtctaccaca gctgccactc ttgggagacc 27721 atggtcagct gtgccagata aggccaccct cttcctccta tcagatctca gggagtgcag 27781 cctttttaaa tggagagtct tcagtcaaaa aagaggggag ccctaggatc tgctaggagc 27841 caaatcaagg aaaccgagtg agaactgagg ggactgctca cctctgaata gagagagcaa 27901 cagagtccac ccccctgcat aactcttaaa gtgccagggc aacatgtaag gctgaggact 27961 caccatcact tttttttttt taataattga ccttcttttt tagaatagtt ttacgttcac 28021 agcaaaactg agcagaaagg agagtctcac aggcatcctg tcctcccaca gacacagcct 28081 ccttcatatc aacctcccaa gccagggtga tacatttgtt acaattgaga atacatggac 28141 acatcattat cacccaatgt ccgtagttta ggattcattc ctgttgctgc acattctgtg 28201 gttttaatat ttaaaaatgt ataatgacat atgttaccac catagaatca tccagaaatt 28261 gttttactgc cttaaaagtc tctgtgcctg atctatttat ctctcactct ccccaacctg 28321 tggaaaccac tgaactttgt actgtttcca tacttttgcc ttttccagaa tgcatataga 28381 tggaattatg caatgtatag acttttcaaa ttggcctctt tcactttgta atatgtatct 28441 aagtttcatc catgtctctt tgtggcttga tagttcattc ccttttagtg ctgaataata 28501 ttttattctg ttaatttatc cactctcctt ctgaaggaca tattgcttgc ttcccagttt 28561 gggcaattat gaataaagct actataaaca tttgtgtgca ggtttttttt gtggacatac 28621 cgttttcaac tcatctggct aaataacaag gagcgcactt gctagatcat atgatgaaag 28681 tatgcatagc tttgtaagaa acttacaaac tgtcttcaag agtggctgta tcatttttca 28741 tgtccaccag caatgacagt gtgctcctgt tgctccacat cctcatcagt atttggggtt 28801 atcagggttt ttgattttga ccattcttat aggtatgtag tggtatcaca ttattgtttt 28861 aatttgcaag tctctaatga tatgtgatgt tgaccatcat gtcatgtacg tatttgcctt 28921 ccatatatat tatttgttga ggtgcctgtt tagatctttt gtccattttt aatagggttg 28981 gtcagttctt attgttcagt tttaagacat ttttgtatgt tttggttaac agtcccttat 29041 cagatatgtc ttttgaaaat aattttttcc caacctggga gttatcttat tctctttgtg 29101 gtatctttag cagagcagaa gttttaattt agcgaagtcc agattatcaa ttattttctt 29161 tcatagatgc ccatcacatt ttataccaga actagaaaag atgaaattct tggtctgaag 29221 ttgcagttat cagtcagcag aagagacagt ccacaaccct gcttggagtc cagatgagga 29281 tcctgagtgc aaacttggga cctaaagagc ccaggacaga gagagcacta aatgcttcta 29341 ggcaggggtg gtgggttgag gggcccctag acttccctca tctgggtccc agaaaactaa 29401 agagtcaatt tcacaacacc aatagaggga ggctcaggcc ctgccaagag ctgacatgat 29461 aattctaaag gtaatcagag tggatcctct ccaagccaga acacagaaag ccccactgcg 29521 agccttgttg tcacccagtc agccccaggc agggttggca agctgcagcc taaggcacat 29581 tgtaacttcc tcagctggct tctcagggga cagaatgact aagaacaata gcccagtgaa 29641 tacttagagc agtgttctca aggaatcctg cagaggcggc ttctgaaaag ccaaggtagt 29701 atctgcctgc tgaaggtgtt ctcaggattt catttgctct tctccaggaa ccacatcacc 29761 tgcccttctg cctacactcc tgcctgctgt gcctaaccac agccatcatg cctcggggtc 29821 agaagagtaa gctccgtgcc cgtgagaaac gccagcggac ccgtggtcag acccaggatc 29881 tcaaggttgg tcagcctact gcagcagaga aagaagagtc tccttcctct tcctcatctg 29941 ttttgaggga tactgcctcc agctcccttg cttttggcat tccccaggag cctcagagag 30001 agccacccac cacctctgct gctgcagcta tgtcatgcac tggatctgat aaaggcgacg 30061 agagccaaga tgaggaaaat gcaagttcct cccaggcctc aacatccact gagagatcac 30121 tcaaagattc tctaaccagg aagacgaaga tgttagtgca gttcctgctg tacaagtata 30181 aaatgaaaga gcccactaca aaggcagaaa tgctgaagat catcagcaaa aagtacaagg 30241 agcacttccc tgagatcttc aggaaagtct ctcagcgcac ggagctggtc tttggccttg 30301 ccttgaagga ggtcaacccc accactcact cctacatcct cgtcagcatg ctaggcccca 30361 acgatggaaa ccagagcagt gcctggaccc ttccaaggaa tgggcttctg atgcctctac 30421 tgagtgtgat cttcttaaat ggcaactgtg cccgtgaaga ggaaatctgg gaattcctga 30481 atatgctggg gatctatgat ggaaagaggc accttatctt tggggaaccc cgaaagctca 30541 tcacccaaga tctggtgcag gaaaaatatc tggaatacca gcaggtgccc aacagtgatc 30601 ccccacgcta tcaattcctg tggggtccaa gagctcatgc agaaaccagc aagatgaaag 30661 tcctggagtt tttggccaag gtgaatgaca ccacccccaa taacttccca ctcctttatg 30721 aagaggcttt gagagatgaa gaagagagag ctggagcccg gcccagagtt gcagccaggc 30781 gtggcactac agccatgact agtgcgtatt ccagggccac atccagtagc tcttcccaac 30841 ccatgtgaga tctaaggcaa attgttcact ttgtggttga aagacctgct gctttctctg 30901 ttcctgtgat gcatgaataa ctcattgatt tatctctttg ttgtattttc catgatgttt 30961 cttaaaatag aaagtttatt tagattcaga atataaattt agaaatggca tgcatcacac 31021 atttattgct gtttatcagg ttggtttagt gataataatt ttgtttttga aatacaaata 31081 gaaaatcctg aaataatttt tgtgatacag agcaaaataa cacggcatgg gagtaaggtt 31141 atccttagaa atttaaaata actccacagt aaaataggta gaatctgaag atagaaaggg 31201 aagaaaagta aaagttgctt tattcgtggt ttgtcttact cagttcagtc tttttttgct 31261 cataaattta aaagttacat acctggtttg cttagattat tcaagaatgt ggaggcctgg 31321 gccaaggtca atgacagtgt ctccattgtc ttccctccat taagagaaga ctttaagaga 31381 tgagggagag agagccagag acagtgttgc aactgggcct ggcatgtttc agtgtggtgt 31441 ccagcagtgt ctcccactcc ttgtgaagtc tgaggtatat tctttacttt tgattaagaa 31501 aacacttaac cttctaatta atggagagcc aaaggggagt tggtgggaac accatgtata 31561 acatatttgt atgtaaaatg atttatcttt tctttttcct gtttttcagt gttctttttt 31621 taaattgtag atttatttag tttcagaatc taagtttatg aatggcatga atcactcatt 31681 tattaaaata tatcaggttg gagagtgaga atttttgcat tatgtaaaac aatttaaaaa 31741 tcttttaagt ctttttctgt gatctagaac aagataatat ggcattggaa tatggaattt 31801 gtgaaaagga aattaccttg caataaagtt ggtgggacca ggaagtagag aaaaaaaaag 31861 taaaatgtgg tcaattcttg ctttgtttta ttctttttag tctttctttg taaaactgaa 31921 gtatatgtac ccggatctgc ttagcttttt caggaatgtg ggggaaatta aatagcaata 31981 catttgactt cctggtcact tacacttcaa ttgtccaaat attaattgag cagctgtact 32041 ttggagggct cctggctagt accggataag ctaagaaaac aaaaaacaaa caaacaacaa 32101 aaaaacccca atccctgccc acagaatttt agaatccaac agcagctatc atataacgaa 32161 ggtaatgaga tactccttaa gacctaaaga caggtgaaaa ggggataaaa aagaaggtgg 32221 gggtattacc agcagtgaat ctagtgtaaa tgtcctaagc aaggtaactg agacttcggg 32281 aaactgaaca tactgcaata ggaggtaatt ttatgtcctg gcatagtgcc tatatggggc 32341 gcactgagca ggggtcacac actcagatgg tggatctcac acttgagaga tggagactgg 32401 aatgaaaaat tgcctttaac agttactttg ggattgtgga aaaaccagag gggatctttg 32461 cctggagcag ggattgaaag tggccggtgc tcttgtccca gtgcaagtga agtgaatcca 32521 gtacacaact aggtgtttat gagcgtagtc cccagggtgt tctgagaaat aaaggtgata 32581 ctcctttaag actggaaact cataagccac tgggctaata ctctgctatc agctggggag 32641 ccagaaccca ttccattaaa aggaattttt tttcctttca tgtgtttttg caaattctaa 32701 gggagatcta gatttttgat ggtggtaaaa tgaatgaaaa taagactggt ttggatgaaa 32761 ggccagagtt ggtccttgaa ccaaagccta gaaggtttga gttgctttca gctgagaaag 32821 acaactccac actcaaaata tgatttaata tggaaaagtt atttagagtt tctttgtaag 32881 ctgcgttttg ggctgatttt attgttccat gtcttagaat gctgtatatt ctttgttatc 32941 tcttggactt aaagaaaatc agcagggtgg atgatttctt ctgtagttga aataatctta 33001 gtaataacaa gtattgttta aagtctcaat gcacaccagg gctttgcaca ctttcaccac 33061 attccaccag tcctacaaga caaggcttat caagctcatt tcacagatgt agagcctgag 33121 gctcatagag tttattaatt gtgcccaagt tcacacagct agtaagtgat agggctggaa 33181 ctagactgct ggtctgaatt cctctagagc ccacactgtt ccactcctcc caaactgaag 33241 cttactgtga attgcttaat tcattttctt ctacaacccc aaagtgtgtc tcaggtaaga 33301 aagaagaggt ctctgtaccc caaacactgg attggaatac ataaccagag caagaagaat 33361 accaaaataa ttgacagatg tgaattaaag agaaaaagaa ataatactcg gtgacaatat 33421 gatcaccgac agaagcaaaa ttggatagct tcaaaagatc aaaagattac aaaaatcatc 33481 tggctccacc cttttctgta gaaagtaaaa cttttagaac taccctttgg ttggtgaagt 33541 gaaggaatag atcagcatgt ttttttccct gagagcacaa catctgtcct aggctctcca 33601 ccttccgttt cagtttccca acatcacatc acttctggga cagggacctc ttctctgcaa 33661 ggtttgcaaa gaggaaactg gtcaaatttt tgcagggatg tgacatttcc ctggaagagt 33721 ttgatcaaga gacagaagcc aacatgagaa ccctcatgaa tgaatggtgg ggagaaaacc 33781 actccaaaat gtaagggttc atccagaccc tcaggagagt gacctcagtt cagtagatag 33841 aggagtccca ggttctgata catgtaaagg cgaggagcat aaatgaagac tgatggaagg 33901 acccagccaa aactgtgggg ttctatggag tcctttcctt attatctgtt tttgcaggtt 33961 gcaaaaggac tgtgatcata tgaagatcat ccaggagtac aactcgaaat tctcagaaaa 34021 caggaccttg atgtgagagg agcaggttca ggtaaacaaa gggtaagtta caggtttgct 34081 cacttgtcaa ggtgaggacc tgaatgtgga ctaagggtag ctagcaccca catggcctca 34141 caaagtcccc tctgcctgtc agccctagga agccttggcg agatggcagg ctgaattctg 34201 cctggaaagt ctcagggaga tgactggttc cgtctaattg gggcagcctc agttttacag 34261 agcgaagagg ccgagaccct aacaagaatc aatgtagggc ttttaagtgt taagaggggt 34321 atccaccagc agatgagtcc ccacagaatt caccccgttt tgaggcatca gacagagcta 34381 tctacctaag gtgcctctca tttccgcctg gaaggtctca tggaggatgg ggagcgtggg 34441 gcctgaaggg agtagcttca gttctgcctg gagaggaaac cagagtcacg gtgaggactc 34501 tgagagctga tgagaaggcc tctgcccaaa acgggacttt cacagagccc tgccgctgct 34561 gtcaggcctg tgaggccagg caggggtggc ctgtgtggca cgctcagatt tccaccttgg 34621 gggctgagag aggtggggct attgtttgag gctggcggat ttgggtcagc aggcggagtc 34681 gtcccagact gctagatact aaggtgagga cccctagtgg ggacgtaggg accagcgacg 34741 ctagaacagt tacgtccaga agcgtaccac ccctgccgtc agcccggagc cacgggctgc 34801 cggatgtggc tcatcctgac ttccgctttg aaggcgagga ccccagcgag cgtaagggcg 34861 cagtgtccgc ctggcggatt tgggtcagca ggcggaagtc gtcccaggct gctagatact 34921 aaggtgagga accctagtgg ggacgtaggg accagcgaca ctagaacagt gacgtcccgt 34981 agcgtcctgc ctctgccgtc agccctcaga ggccctgggc tgccggatgt ggctcatcct 35041 catttccctt ttgaaggcga ggacccgagc gagcttaagg agtggggtgc agcgtctggt 35101 cagccgaggg tgaattctca ggactggtcg ggagtcaagg tgaggaccct gagtgtaaat 35161 tgaagagacc acccccaccc gtaacaaaga ggtcccctct aagtcccgct tctgcatttg 35221 gtcctgggag gcctcaggta accagatggg tagcaccctg actgtctctt cagcgactca 35281 gggagacgaa ggctttggcc taagccttat agactcaggt caatagaggg aggagtccta 35341 aaccctacta cccgtaatcc cagaactctg ggaggccgag gcaggcggat cacgaggtca 35401 ggatatcaag accatcctgg ctaacacggt gaaaccccgt ctctactaaa aatagaaaaa 35461 attagccagg tgtggtggtg ggctgctgta gtcccagcta ctcaggaggc tgaggcagga 35521 gaatggcgtg aaccagggaa gcggacgtta cagtgagccg agattgcacc actgcactcc 35581 agcctgggtg acagagcgag actcagtctc aagaaaaaaa aaaaaaaaaa aaaaaaaaaa 35641 aaagtcccgc tcctgctgtc ggcacacgca ggccccagtc agccttggtg ggatgtggcc 35701 cactatgact gtgaacttag gtccaaggaa tatgagaact tttgtctacg gggcatggtg 35761 ttaggagcag ttgatgggtg gagtcccaga aagagtgctg agtggaggtg gagacactga 35821 gtgaggaaat ggcggctact ctatacacga gggagacaat tgcgcctgat gctgtccctg 35881 ggagtcccag gccagagctg gcaggctcaa gtctccctgg cttctgcctt catggtctta 35941 gagagaggag ggccttggtt taaggcctca ctgccccagt tcagtagaga gataggagtc 36001 atacgccaag acaggtgtca aggtacaatt tctttatgag gaatgaggta accgtctagt 36061 ccagcaacag gcaggaacac agtcttacct tggcgttcat tccttagaag ccagggttag 36121 actctcaagt tgagatgccc ctcatttcct tcaggtactg tttcaggtac ataatggctt 36181 tggtttaaca tgaaagccct tgtttagtac agggaggagg cccagttcct aaaagatggt 36241 gacactgagg gagtatgagg ggaccccctc caagaaagag gtaacacaga tagagcccta 36301 tccccactga gacctgggag atccaggtat ggtgacatga tgagtctcac tcacttcttc 36361 ctagagcatc tcaggaagtg ggaacttcat caaagggggc agccttatgt tagcagctag 36421 agattcctag taattccagg agtcaaaaga agaccctgag aactggggaa ccactcatcc 36481 cataacagtg aaagaaacac ggaattcctt cctgattttc atccttggga gatgatggac 36541 aattgtagcc agatgggaaa agtttcactt cttcctcagg gagttgtcgg ggggtgcttt 36601 gtatgactgg ataggcgtta gatcaagaca gagaagtcac tgagggagga atgagatgag 36661 catttaccta gagagtgggc ttcaccaagt tctaccccct ctcagctctg ggaaacccca 36721 ggcagaagta cccagatgtg tcatcccctc aagactagcc ctgggtactc agagaggtga 36781 aggcttttgt atgagcctgg aagaatcaag tgggattaga gaggagctca gactctgctt 36841 gagtgaagac taaagagaac aacgacccca gaacagtgga gccccataga ggacgtggtg 36901 actggatgtg attcaggctt tctttcatct tgggactatg aggcaatgag aatcttaatc 36961 tgatgtaagg ggcctcaggt cagtagagag aggatttcca ggttgtgcca ggcctcatag 37021 agaggacttg agggaactcc cacctcatca gtggggatcc ctcagagtcc ctctgtatgt 37081 cagcgctagg aagccccgag cataaatgtc agaaatgccc ctaaattcct cttcagaagt 37141 aacagggaaa tgaaggtctt agtcagatgg gttagcagga ggggtgggag gcattttagg 37201 ccctcccagg agtcaaactg gagaccttga gtgaggacca tgagtggggc tacacatcac 37261 agggcttcaa cctgccagcc acagctttca gcctctggag aatatgggca ttgtgaccag 37321 atacagccac cctcatttcc tccattgggt ctcaggcaga tgtaggcctt actatgagta 37381 aatgtcctca ggtgtagaga gaacagagcc ctaggccctt ctgggagtcg aagtgtgagt 37441 gtggactgaa ggcaccattc ccccaagccc ccaaccccca ccccaaatag aggaaaaaca 37501 acgatgctag ccctgtctct gcacttagct ctgaaaggcc ttggccaagg gttgccaggc 37561 tgagacttta tttctttgca tcaggtctaa gggaggtgac agctttggtc tgaagatgca 37621 gcaccagtta gcagaagaca ggttcccaga acttagatat agatgagatg aggactctga 37681 attaagattg aggtccaact agcccaggac agagagagtt ccatagaact gtcagcactg 37741 ccatcccgcc agcccccggt aaggatggta ggttgaagca gtgcctcatt tttctttgtg 37801 gattccaggg agctttgaag tgtcagcttc agagcagcac aggaaggagt cccagaccct 37861 tccaagagta gatatgaaga tcctgtatat gaattgagag gccttgaaca cagaggagtc 37921 tacactgcca acctctgctg tcacccagtc agcccaggca ggtttggcaa caagaaccag 37981 tggttcctag agcaatgccc tcaagaaaac cagcagaagt gctctctaaa agccaagttg 38041 tacctccctg ctgcaagtac tcacagatct cattctctct ccttcaggtg ccacatctcc 38101 tgcctttctg ctcactttcc tgcctgtttt gcctgaccac agccatcatg cctcggggtc 38161 agaagagtaa gctccgtgct cgtgagaaac gccgcaaggc gcgagaggag acccagggtc 38221 tcaaggttcg tcacgccact gcagcagaga aagaggagtg cccctcctcc tctcctgttt 38281 taggggatac tcccacaagc tcccctgctg ctggcattcc ccagaagcct cagggagctc 38341 cacccaccac cactgctgct gcagctgtgt catgtaccga atctgacgaa ggtgccaaat 38401 gccaaggtga ggaaaatgca agtttctccc aggccacaac atccactgag agctcagtca 38461 aagatcctgt agcctgggag gcaggaatgc tgatgcactt cattctacgt aagtataaaa 38521 tgagagagcc cattatgaag gcagatatgc tgaaggttgt tgatgaaaag tacaaggatc 38581 acttcactga gatcctcaat ggagcctctc gccgcttgga gctcgtcttt ggccttgatt 38641 tgaaggaaga caaccctagt agccacacct acaccctcgt cagtaagcta aacctcacca 38701 atgatggaaa cctgagcaat gattgggact ttcccaggaa tgggcttctg atgcctctcc 38761 tgggtgtgat cttcttaaag ggcaactctg ccaccgagga agagatctgg aaattcatga 38821 atgtgttggg agcctatgat ggagaggagc acttaatcta tggggaaccc cgtaagttca 38881 tcacccaaga tctggtgcag gaaaaatatc tgaagtacga gcaggtgccc aacagtgatc 38941 ccccacgcta tcaattccta tggggtccga gagcctatgc tgaaaccacc aagatgaaag 39001 tcctcgagtt tttggccaag atgaatggtg ccactccccg tgacttccca tcccattatg 39061 aagaggcttt gagagatgag gaagagagag cccaagtccg atccagtgtt agagccaggc 39121 gtcgcactac tgccacgact tttagagcgc gttctagagc cccattcagc aggtcctccc 39181 accccatgtg agaactcagg cagattgttc actttgtttt tgtggcaaga tgccaacctt 39241 ttgaagtagt gagcagccaa gatatggcta gagagatcat catatatatc tcctttgtgt 39301 tcctgttaaa cattagtatc tttcaagtgt ttttctttta atagaatgtt tatttagagt 39361 tgggatctat gtctatgagc gacatggatc acacatttat tggtgctgcc agctttaagc 39421 ataagagttt tgatattcta tatttttcaa atccttgaat cttttttggg ttgaagaaga 39481 agaaagcata gctttagaat agagattttc tcagaaatgt gtgaaagaac ctcacacaac 39541 ataattggag tcttaaaata gaggaagagt aagcaaagca tgtcaagttt ttgttttctg 39601 cattcagttt tgtttttgta aaatccaaag atacatacct ggttgttttt agccttttca 39661 agaatgcaga taaaataaat agtaataaat tatattactt gttcagtggc tcatttattc 39721 tcaccataaa ttgagcatct gctctttgta aggctctgtg atagtagtga ttgtactaag 39781 ttaaagaaga cccttcgcct gcacacagat ttttagtcta aggacagtta ttatttaaag 39841 aagatggtga gatacactct aacatgtaca gatttttttt tttacatata aacactcatt 39901 taaaaaaaaa agaagtgaga atggtgggag aaggttcaga caagagcagt caagtgttaa 39961 tttcctagcc aaggcacttc gtggtgtggg acaatgcaag tccctcgttg ggaggtcatt 40021 ttaagttagc tccatggtga actggatgag gttgtgataa tcataagaag gtgccaaacc 40081 ctcagatcat gagccttaga gttgagagat taagcctgga aagggaaact gcccttaaca 40141 gttactttgg gattgtgggt aaagcagaga gaacctgtgt ctggaggagg agtggaagaa 40201 tacagtgctc tcgtcccagg gcagtcaaac acagtgcaca aactagttgt tttatgcaca 40261 ttgtctccag aaagtgtttg agaaataagg gttatacttc cttgaggtga gatgccaaga 40321 agccactaag ctaacactgt ttccctaagc tt // LOCUS HSU93236 2772 bp mRNA PRI 20-APR-1997 DEFINITION Human menin (MEN1) mRNA, complete cds. ACCESSION U93236 NID g1945386 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2772) AUTHORS Chandrasekharappa,S.C., Guru,S.C., Manickam,P., Olufemi,S.-E., Collins,F.S., Emmert-Buck,M.R., Debelenko,L.V., Zhuang,Z., Lubensky,I.A., Liotta,L.A., Crabtree,J.S., Wang,Y., Roe,B.A., Weisemann,J., Boguski,M.S., Agarwal,S.K., Kester,M., Kim,Y.S., Heppner,C., Dong,Q., Spiegel,A.M., Burns,A.L. and Marx,S.J. TITLE Positional cloning of the gene for multiple endocrine neoplasia-type 1 JOURNAL Science 276 (5311), 404-407 (1997) MEDLINE 97258940 REFERENCE 2 (bases 1 to 2772) AUTHORS Collins,F.S. TITLE Direct Submission JOURNAL Submitted (13-MAR-1997) National Human Genome Research Institute, Bldg 38A, Room 605, National Institutes of Health, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..2772 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="11" /map="11q13" exon 1..87 /gene="MEN1" /number=1 gene 1..2772 /gene="MEN1" exon 88..555 /gene="MEN1" /number=2 CDS 111..1943 /gene="MEN1" /codon_start=1 /product="menin" /db_xref="PID:g1945387" /translation="MGLKAAQKTLFPLRSIDDVVRLFAAELGREEPDLVLLSLVLGFV EHFLAVNRVIPTNVPELTFQPSPAPDPPGGLTYFPVADLSIIAALYARFTAQIRGAVD LSLYPREGGVSSRELVKKVSDVIWNSLSRSYFKDRAHIQSLFSFITGTKLDSSGVAFA VVGACQALGLRDVHLALSEDHAWVVFGPNGEQTAEVTWHGKGNEDRRGQTVNAGVAER SWLYLKGSYMRCDRKMEVAFMVCAINPSIDLHTDSLELLQLQQKLLWLLYDLGHLERY PMALGNLADLEELEPTPGRPDPLTLYHKGIASAKTYYRDEHIYPYMYLAGYHCRNRNV REALQAWADTATVIQDYNYCREDEEIYKEFFEVANDVIPNLLKEAASLLEAGEERPGE QSQGTQSQGSALQDPECFAHLLRFYDGICKWEEGSPTPVLHVGWATFLVQSLGRFEGQ VRQKVRIVSREAEAAEAEEPWGEEAREGRRRGPRRESKPEEPPPPKKPALDKGLGTGQ GAVSGPPRKPPGTVAGTARGPEGGSTAQVPAPAASPPPEGPVLTFQSEKMKGMKELLV ATKINSSAIKLQLTAQSQVQMKKQKVSTPSDYTLSFLKRQRKGL" mutation 175 /gene="MEN1" /standard_name="L22R" /note="Lys22Arg" /replace="g" mutation 357..360 /gene="MEN1" /standard_name="357del4" /note="4 bp deletion in kindred 0176" /replace="" mutation 416 /gene="MEN1" /standard_name="416delC" /note="1 bp deletion" /replace="" mutation 464..466 /gene="MEN1" /standard_name="K119del" /note="3 bp deletion" /replace="" mutation 512 /gene="MEN1" /standard_name="512delC" /note="1 bp deletion" /replace="" exon 556..764 /gene="MEN1" /number=3 mutation 704 /gene="MEN1" /standard_name="W198X" /note="Trp198stop" /replace="a" mutation 735..738 /gene="MEN1" /standard_name="735del4" /note="4 bp deletion" /replace="" exon 765..893 /gene="MEN1" /number=4 exon 894..934 /gene="MEN1" /number=5 exon 935..1022 /gene="MEN1" /number=6 exon 1023..1159 /gene="MEN1" /number=7 mutation 1132 /gene="MEN1" /standard_name="1132delG" /note="1 bp deletion" /replace="" exon 1160..1295 /gene="MEN1" /number=8 mutation 1197..1199 /gene="MEN1" /standard_name="E363del" /note="3 bp deletion" /replace="" exon 1296..1460 /gene="MEN1" /number=9 mutation 1416 /gene="MEN1" /standard_name="W436R" /note="Trp436Arg" /replace="c" mutation 1417 /gene="MEN1" /standard_name="W436X" /note="Trp436stop" /replace="a" exon 1461..2772 /gene="MEN1" /number=10 mutation 1689 /gene="MEN1" /standard_name="R527X" /note="Arg527stop" /replace="t" BASE COUNT 577 a 909 c 779 g 507 t ORIGIN 1 ggtgtccgga gccgcggacc tagagatccc agaagccaca gcgcagcggc ccggcccgcc 61 actatttcca ggctctgcgg ggcaggggcc gccgcccacc gcccgccgcc atggggctga 121 aggccgccca gaagacgctg ttcccgctgc gctccatcga cgacgtggtg cgcctgtttg 181 ctgccgagct gggccgagag gagccggacc tggtgctcct ttccttggtg ctgggcttcg 241 tggagcattt tctggctgtc aaccgcgtca tccctaccaa cgttcccgag ctcaccttcc 301 agcccagccc cgcccccgac ccgcctggcg gcctcaccta ctttcccgtg gccgacctgt 361 ctatcatcgc cgccctctat gcccgcttca ccgcccagat ccgaggcgcc gtcgacctgt 421 ccctctatcc tcgagaaggg ggtgtctcca gccgtgagct ggtgaagaag gtctccgatg 481 tcatatggaa cagcctcagc cgctcctact tcaaggatcg ggcccacatc cagtccctct 541 tcagcttcat cacaggcacc aaattggaca gctccggtgt ggcctttgct gtggttgggg 601 cctgccaggc cctgggtctc cgggatgtcc acctcgccct gtctgaggat catgcctggg 661 tagtgtttgg gcccaatggg gagcagacag ctgaggtcac ctggcacggc aagggcaacg 721 aggaccgcag gggccagaca gtcaatgccg gtgtggctga gcggagctgg ctgtacctga 781 aaggatcata catgcgctgt gaccgcaaga tggaggtggc gttcatggtg tgtgccatca 841 acccttccat tgacctgcac accgactcgc tggagcttct gcagctgcag cagaagctgc 901 tctggctgct ctatgacctg ggacatctgg aaaggtaccc catggcctta gggaacctgg 961 cagatctaga ggagctggag cccacccctg gccggccaga cccactcacc ctctaccaca 1021 agggcattgc ctcagccaag acctactatc gggatgaaca catctacccc tacatgtacc 1081 tggctggcta ccactgtcgc aaccgcaatg tgcgggaagc cctgcaggcc tgggcggaca 1141 cggccactgt catccaggac tacaactact gccgggaaga cgaggagatc tacaaggagt 1201 tctttgaagt agccaatgat gtcatcccca acctgctgaa ggaggcagcc agcttgctgg 1261 aggcgggcga ggagcggccg ggggagcaaa gccagggcac ccagagccaa ggttccgccc 1321 tccaggaccc tgagtgcttc gcccacctgc tgcgattcta cgacggcatc tgcaaatggg 1381 aggagggcag tcccacgcct gtgctgcacg tgggctgggc cacctttctt gtgcagtccc 1441 taggccgttt tgagggacag gtgcggcaga aggtgcgcat agtgagccga gaggccgagg 1501 cggccgaggc cgaggagccg tggggcgagg aagcccggga aggccggcgg cggggcccac 1561 ggcgggagtc caagccagag gagcccccgc cgcccaagaa gccagcactg gacaagggcc 1621 tgggcaccgg ccagggtgca gtgtcaggac ccccccggaa gcctcctggg actgtcgctg 1681 gcacagcccg aggccctgaa ggtggcagca cggctcaggt gccagcaccc gcagcatcac 1741 caccgccgga gggtccagtg ctcactttcc agagtgagaa gatgaagggc atgaaggagc 1801 tgctggtggc caccaagatc aactcgagcg ccatcaagct gcaactcacg gcacagtcgc 1861 aagtgcagat gaagaagcag aaagtgtcca cccctagtga ctacactctg tctttcctca 1921 agcggcagcg caaaggcctc tgaactactg gggacttcgg accgcttgtg gggacccagg 1981 ctccgcctta gtcccccaac tctgagccca tgttctgccc ccagcccaaa ggggacaggc 2041 ctcacctcta cccaaaccct aggttcccgg tcccgagtac agtctgtatc aaacccacga 2101 ttttctccag ctcagaaccc agggctctgc cccagtcgtt agaatatagg tctcttctcc 2161 cagaatccca gccggccaat ggaaacctca cgctgggtcc taattaccag tctttaaagg 2221 cccagcccct agaaacccaa gctcctcctc ggaaccgctc acctagagcc agaccaacgt 2281 tactcagggc tcctcccagc ttgtaggagc tgaggtttca cccttaaccc aagggagcac 2341 aggtcccacc tccagcccgg ggagcctagg accactcagc ccctaggagt atatttccgc 2401 acttcagaat tccatatctt gcgaatccaa gctccctgcc ccaaataact tcagtcctgc 2461 ttccagaatt tggaaatcct agtttcctct ccttcgtatc ccgagtctgg gacacaaaac 2521 tccgccccca gcctatgagc atcctgagcc ccgccctctt cctgacgaaa ctggccccgg 2581 atcagagcag gacctccctt ccgaccctct gggaacctcc cagaggtcca gcccatctcg 2641 gagcatcccg gaggaaatct gcagaggggt taggagtggg tgacaagagc ctgatctctt 2701 cctgttttgt acatagattt atttttcagt tccaagaaag atgaatacat tttgttaaaa 2761 aaaaaaaaaa aa // LOCUS HSU93720 1525 bp mRNA PRI 28-SEP-1997 DEFINITION Homo sapiens TEX28 mRNA, complete cds. ACCESSION U93720 NID g2443443 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1525) AUTHORS Hanna,M.C., Platts,J.T. and Kirkness,E.F. TITLE Identification of a gene within the tandem array of red and green color pigment genes JOURNAL Genomics 43 (3), 384-386 (1997) MEDLINE 97422617 REFERENCE 2 (bases 1 to 1525) AUTHORS Hanna,M.C., Platts,J.T. and Kirkness,E.F. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Department of Molecular and Cellular Biology, The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..1525 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq28" /tissue_type="testes" CDS 145..1377 /codon_start=1 /product="TEX28" /db_xref="PID:g2443444" /translation="MVLKAEHTRSPSATLPSNVPSCRSLSSSEDGPSGPSSLADGGLA HNLQDSVRHRILYLSEQLRVEKASRDGNTVSYLKLVSKADRHQVPHIQQAFEKVNQRA SATIAQIEHRLHQCHQQLQELEEGCRPEGLLLMAESDPANCEPPSEKALLSEPPEPGG EDGPVNLPHASRPFILESRFQSLQQGTCLETEDVAQQQNLLLQKVKAELEEAKRFHIS LQESYHSLKERSLTDLQLLLESLQEEKCRQALMEEQVNGRLQGQLNEIYNLKHNLACS EERMAYLSYERAKEIWEITETFKSRISKLEMLQQVTQLEAAEHLQSRPPQMLFKFLSP RLSLATVLLVFVSTLCACPSSLISSRLCTCTMLMLIGLGVLAWQRWRAIPATDWQEWV PSRCRLYSKDSGPPADGP" BASE COUNT 356 a 464 c 415 g 290 t ORIGIN 1 tggtgccagc actagccccc atgtcggtct cagagaacct tctccccacc tctgagttat 61 tctctcagtg tatcgaagat atcagtcaac tatcttctgg attgcattga tgctattgag 121 aaggcagcct gcagtctaaa agtcatggtt ttaaaggcgg aacacaccag gagccccagc 181 gcaaccctcc cctccaatgt gccttcatgc cggtccctgt catccagcga agacggcccc 241 agtggccctt ccagcctcgc agatggaggc ctagcccaca acttacagga tagtgtcagg 301 caccgcatcc tctacctctc agagcagctg agagtggaga aggccagtcg ggatggcaac 361 actgtgagct acctcaagct ggtatccaaa gcagaccggc accaggtgcc gcacatccag 421 caggcctttg agaaggtgaa ccagcgcgcc tctgccacca tcgcccagat cgagcacagg 481 ctccaccagt gtcaccagca gctccaggag ctggaggaag gctgcaggcc cgagggctta 541 ctgctgatgg cagaaagcga cccagccaac tgcgagccac ccagtgagaa ggccctgctt 601 tcagagcccc ccgagccagg tggggaagac gggccggtca acctgcctca tgccagcagg 661 cccttcatct tggagagtcg cttccagagc ttacagcagg ggacgtgctt agagacagag 721 gatgtggccc agcaacaaaa cctgctgttg cagaaggtaa aggcagagct ggaagaagcc 781 aagaggttcc acatcagcct ccaggagtcc tatcacagcc taaaggagag gtctctgact 841 gacctgcagc tgttgctgga gtcccttcag gaggagaagt gtaggcaagc attgatggaa 901 gaacaggtga atggtcgcct gcagggacag ctgaatgaga tttacaacct caaacacaat 961 ctggcctgca gcgaagagag aatggcctat ctatcctatg agagagccaa ggaaatatgg 1021 gagatcacgg agaccttcaa gagccgaata tccaagctgg agatgctaca gcaagtcacc 1081 caactggagg cagcggagca cctccaaagc cgtcccccgc agatgttgtt caagttcctg 1141 agtccgcgcc tctcactggc aaccgtcctc ttggtctttg tctccacctt gtgtgcctgc 1201 ccctcgtcac tgatcagctc acgcctgtgc acctgcacca tgctgatgct gatcgggctt 1261 ggggtcctgg cctggcagag gtggcgcgcc atccctgcca cagactggca ggaatgggtc 1321 ccctccaggt gtagactgta ctccaaggac tctgggcctc cagcagatgg accttaaggg 1381 gccaggaggg ccacctgcct tagcttgcta gctccctccc tcctcctggg tgctgagggc 1441 atccagcaag cccctccaca gctcttgctt gccgattatg taaccaccag cctggtgaaa 1501 tggatataga cgcccacctg cctca // LOCUS HSU93850 2178 bp mRNA PRI 25-MAY-1997 DEFINITION Homo sapiens elongation factor-2 kinase mRNA, complete cds. ACCESSION U93850 NID g2104698 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2178) AUTHORS Ryazanov,A.G., Ward,M.D., Mendola,C.E., Pavur,K.S., Dorovkov,M.V., Wiedmann,M., Erdjument-Bromage,H., Tempst,P., Parmer,T.G., Prostko,C.R., Germino,F.J. and Hait,W.N. TITLE Identification of a new class of protein kinases represented by eukaryotic elongation factor-2 kinase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (10), 4884-4889 (1997) MEDLINE 97289688 REFERENCE 2 (bases 1 to 2178) AUTHORS Ryazanov,A.G., Ward,M.D., Mendola,C.E., Pavur,K.S., Dorovkov,M.V., Wiedmann,M., Erdjument-Bromage,H., Tempst,P., Parmer,T.G., Prostko,C.R., Germino,F.J. and Hait,W.N. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Pharmacology, Robert Wood Johnson Medical School-UMDNJ, 675 Hoes Lane, Piscataway, NJ 08854, USA FEATURES Location/Qualifiers source 1..2178 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="glioma" /cell_line="T98G" CDS 1..2178 /function="protein kinase" /codon_start=1 /product="elongation factor-2 kinase" /db_xref="PID:g2104699" /translation="MADEDLIFRLEGVDGGQSPRAGHDGDSDGDSDDEEGYFICPITD DPSSNQNVNSKVNKYYSNLTKSERYSSSGSPANSFHFKEAWKHAIQKAKHMPDPWAEF HLEDIATERATRHRYNAVTGEWLDDEVLIKMASQPFGRGAMRECFRTKKLSNFLHAQQ WKGASNYVAKRYIEPVDRDVYFEDVRLQMEAKLWGEEYNRHKPPKQVDIMQMCIIELK DRPGKPLFHLEHYIEGKYIKYNSNSGFVRDDNIRLTPQAFSHFTFERSGHQLIVVDIQ GVGDLYTDPQIHTETGTDFGDGNLGVRGMALFFYSHACNRICESMGLAPFDLSPRERD AVNQNTKLLQSAKTILRGTEEKCGSPRVRTLSGSRPPLLRPLSENSGDENMSDVTFDS LPSSPSSATPHSQKLDHLHWPVFSDLDNMASRDHDHLDNHRESENSGDSGYPSEKRGE LDDPEPREHGHSYSNRKYESDEDSLGSSGRVCVEKWNLLNSSRLHLPRASAVALEVQR LNALDLEKKIGKSILGKVHLAMVRYHEGGRFCEKGEEWDQESAVFHLEHAANLGELEA IVGLGLMYSQLPHHILADVSLKETEENKTKGFDYLLKAAEAGDRQSMILVARAFDSGQ NLSPDRCQDWLEALHWYNTALEMTDCDEGGEYDGMQDEPRYMMLAREAEMLFTGGYGL EKDPQRSGDLYTQAAEAAMEAMKGRLANQYYQKAEEAWAQMEE" BASE COUNT 541 a 607 c 637 g 393 t ORIGIN 1 atggcagacg aagacctcat cttccgcctg gaaggtgttg atggcggcca gtccccccga 61 gctggccatg atggtgattc tgatggggac agcgacgatg aggaaggtta cttcatctgc 121 cccatcacgg atgacccaag ctcgaaccag aatgtcaatt ccaaggttaa taagtactac 181 agcaacctaa caaaaagtga gcggtatagc tccagcgggt ccccggcaaa ctccttccac 241 ttcaaggaag cctggaagca cgcaatccag aaggccaagc acatgcccga cccctgggct 301 gagttccacc tggaagatat tgccaccgaa cgtgctactc gacacaggta caacgccgtc 361 accggggaat ggctggatga tgaagttctg atcaagatgg catctcagcc cttcggccga 421 ggagcaatga gggagtgctt ccggacgaag aagctctcca acttcttgca tgcccagcag 481 tggaagggcg cctccaacta cgtggcgaag cgctacatcg agcccgtaga ccgggatgtg 541 tactttgagg acgtgcgtct acagatggag gccaagctct ggggggagga gtataatcgg 601 cacaagcccc ccaagcaggt ggacatcatg cagatgtgca tcatcgagct gaaggacaga 661 ccgggcaagc ccctcttcca cctggagcac tacatcgagg gcaagtacat caagtacaac 721 tccaactctg gctttgtccg tgatgacaac atccgactga cgccgcaggc cttcagccac 781 ttcacttttg agcgttccgg ccatcagctg atagtggtgg acatccaggg agttggggat 841 ctctacactg acccacagat ccacacggag acgggcactg actttggaga cggcaaccta 901 ggtgtccgcg ggatggcgct cttcttctac tctcatgcct gcaaccggat ttgcgagagc 961 atgggccttg ctccctttga cctctcgccc cgggagaggg atgcagtgaa tcagaacacc 1021 aagctgctgc aatcagccaa gaccatcttg agaggaacag aggaaaaatg tgggagcccc 1081 cgagtaagga ccctctctgg gagccggcca cccctgctcc gtcccctttc agagaactct 1141 ggagacgaga acatgagcga cgtgaccttc gactctctcc cttcttcccc atcttcggcc 1201 acaccacaca gccagaagct agaccacctc cattggccag tgttcagtga cctcgataac 1261 atggcatcca gagaccatga tcatctagac aaccaccggg agtctgagaa tagtggggac 1321 agcggatacc ccagtgagaa gcggggtgag ctggatgacc ctgagccccg agaacatggc 1381 cactcataca gtaatcggaa gtacgagtct gacgaagaca gcctgggcag ctctggacgg 1441 gtatgtgtag agaagtggaa tctcctcaac tcctcccgcc tccacctgcc gagggcttcg 1501 gccgtggccc tggaagtgca aaggcttaat gctctggacc tcgaaaagaa aatcgggaag 1561 tccattttgg ggaaggtcca tctggccatg gtgcgctacc acgagggtgg gcgcttctgc 1621 gagaagggcg aggagtggga ccaggagtcg gctgtcttcc acctggagca cgcagccaac 1681 ctgggcgagc tggaggccat cgtgggcctg ggactcatgt actcgcagtt gcctcatcac 1741 atcctagccg atgtctctct gaaggagaca gaagagaaca aaaccaaagg atttgattac 1801 ttactaaagg ccgctgaagc tggcgacagg cagtccatga tcctagtggc gcgagctttt 1861 gactctggcc agaacctcag cccggacagg tgccaagact ggctagaggc cctgcactgg 1921 tacaacactg ccctggagat gacggactgt gatgagggcg gtgagtacga cggaatgcag 1981 gacgagcccc ggtacatgat gctggccagg gaggcagaga tgctgttcac aggaggctac 2041 gggctggaga aggacccgca gagatcaggg gacttgtata cccaggcagc agaggcagcg 2101 atggaagcca tgaagggccg actggccaac cagtactacc aaaaggctga agaggcctgg 2161 gcccagatgg aggaataa // LOCUS HSU93868 1000 bp mRNA PRI 23-JUL-1997 DEFINITION Human RNA polymerase III subunit (RPC32) mRNA, complete cds. ACCESSION U93868 NID g2228749 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1000) AUTHORS Wang,Z. and Roeder,R.G. TITLE Three human RNA polymerase III-specific subunits form a subcomplex with a selective function in specific transcription initiation JOURNAL Genes Dev. 11 (10), 1315-1326 (1997) MEDLINE 97315201 REFERENCE 2 (bases 1 to 1000) AUTHORS Wang,Z. and Roeder,R.G. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Laboratory of Biochemistry and Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1000 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1000 /gene="RPC32" CDS 163..864 /gene="RPC32" /codon_start=1 /product="RNA polymerase III subunit" /db_xref="PID:g2228750" /translation="MAGNKGRGRAAYTFNIEAVGFSKGEKLPDVVLKPPPLFPDTDYK PVPLKTGEGEEYMLALKQELRETMKRMPYFIETPEERQDIERYSKRYMKVYQEEWIPD WRRLPREMMPRNKCKKAGPKPKKAKDAGKGTPLTNTEDVLKKMVELEKRGDGEKSDEE NEEKEGSKEKSKEGDDDDDDDAAEQEEYDEEEQEEENDYINSYFEDGDDFGADVMTTW MRQPIRHEIFQKIFL" BASE COUNT 359 a 157 c 242 g 242 t ORIGIN 1 cgaggtggac gggcggcagt caagcgccgg cgttctctgc catcaccctt tccttgccgg 61 ccggcacttc ggctgcagag ttttgcccac gcttcgagac ttagggagca gtgcctttca 121 gaatttcaga atttgcccac tcatctggta taactggttc tgatggctgg gaataaagga 181 agaggacgtg ctgcttatac ctttaatatt gaggctgttg gatttagcaa aggtgaaaag 241 ttacctgatg tagtgttgaa accaccccca ctatttcctg atacagatta taaaccagta 301 ccactgaaaa caggagaagg tgaagaatat atgctggctt tgaaacagga gttgagagaa 361 acaatgaaaa gaatgcctta ttttattgaa acacctgaag aaagacaaga tattgaaagg 421 tatagtaaaa gatacatgaa ggtataccag gaagaatgga taccagattg gagaagactt 481 ccaagagaga tgatgccaag aaataaatgt aaaaaagcag gcccaaaacc caaaaaggca 541 aaagacgcag gcaaaggcac accactcact aatactgaag atgtgttgaa aaaaatggtg 601 gaattggaaa aaagaggtga tggtgaaaaa tcagatgagg aaaatgaaga gaaagaagga 661 agcaaagaga aaagtaaaga aggtgatgat gacgatgacg atgatgccgc agaacaggag 721 gaatatgatg aagaagagca agaagaggaa aatgactaca ttaattcata ctttgaagat 781 ggagatgatt ttggcgcaga cgtgatgaca acatggatga ggcaacctat taggcatgaa 841 atttttcaaa aaatattttt atgatgcagc ttctgaacat ttggacagac ttgatttgta 901 ttttatttct gataaggaat aagatcttgt ttctgttgtt ttggacaaaa tgttgttacc 961 aaaatatcaa aaccactttg agtttacata cagttacctt // LOCUS HSU93869 2149 bp mRNA PRI 23-JUL-1997 DEFINITION Human RNA polymerase III subunit (RPC39) mRNA, complete cds. ACCESSION U93869 NID g2228751 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2149) AUTHORS Wang,Z. and Roeder,R.G. TITLE Three human RNA polymerase III-specific subunits form a subcomplex with a selective function in specific transcription initiation JOURNAL Genes Dev. 11 (10), 1315-1326 (1997) MEDLINE 97315201 REFERENCE 2 (bases 1 to 2149) AUTHORS Wang,Z. and Roeder,R.G. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Laboratory of Biochemistry and Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..2149 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..2149 /gene="RPC39" CDS 74..1027 /gene="RPC39" /codon_start=1 /product="RNA polymerase III subunit" /db_xref="PID:g2228752" /translation="MGEVKVKVQPPDADPVEIENRIIELCHQFPHGITDQVIQNEMPQ YRSPAAGSSINRLLSMGQLDLLRSNTGLLYRIKDSQNAGKMKGSDNQEKLVYQIIEDA GNKGIWSRDIRYKSNLPLTEINKILKNLESKKLIKAVKSVAASKKKVYMLYNLQPDRS VTGGAWYSDQDFESEFVEVLNQQCFKFLQSKAETARESKQNPMIQRNSSFASSHEVWK YICELGISKVELSMEDIETILNTLIYDGKVEMTIIACKRRHSWQCRWTHETVQGSQSN HPSHRFGPGHPVDSAPVFDDCHEGGEISPSNCIYMTEWLEF" BASE COUNT 669 a 397 c 446 g 634 t 3 others ORIGIN 1 cgacccgggt tccgccgctt gctaccgggc tgctccgtgc atctttcccc ccaggcgtca 61 ggaactgcgc ctcatgggcg aggtgaaggt gaaggtgcag ccgcctgacg ccgatccggt 121 cgaaatagaa aacaggatta tagaattatg tcaccagttc cctcatggaa tcacagacca 181 agtaattcag aatgaaatgc ctcaatatag aagcccagca gcgggcagta gcatcaatag 241 gttgttgtct atgggtcagt tggatctctt aaggagcaat acgggccttt tatatagaat 301 aaaggactct cagaatgctg gtaaaatgaa gggatccgat aaccaagaaa aactagtata 361 tcaaatcata gaggatgcag gaaataaagg aatatggagc agagatatcc gctataaaag 421 taatttgcca ttaacagaaa tcaacaaaat tctgaagaat ctggaaagta aaaagcttat 481 caaagctgtt aagtctgtag cagcctcaaa aaagaaggtg tatatgctct ataacctgca 541 gccagaccgg tctgtgactg gtggagcctg gtacagtgac caggattttg aatctgaatt 601 tgtagaggtg cttaaccaac agtgttttaa attcctacag tccaaggcag aaacagcacg 661 agaaagcaaa cagaacccaa tgatacaaag aaatagttca tttgcctcat cacatgaagt 721 gtggaaatat atctgcgaat tgggaatcag taaggtagag ttatccatgg aagacattga 781 aaccatcctg aatacactca tttatgatgg aaaagtggag atgacgatta ttgcctgcaa 841 aagaaggcac agttggcagt gtagatggac acatgaaact gtacagggca gtcaatccaa 901 tcatccctcc cacaggtttg gtccgggcca ccctgtggac tctgccccgg tttttgatga 961 ctgccacgaa ggtggtgaga tttcaccatc taactgtatt tacatgacag agtggctcga 1021 attttaatag agagctatga actttattga cattttgcaa atgaagttac ttagggagca 1081 gataatttaa ttcatgatgg aacacgaaat ctccttgaaa gcaaacttca caataatgga 1141 cgtagacttg ctgctatgaa aacatatttt ttttatttat gaagactaaa tttatattgg 1201 taaaatagcc agtagaatat gaaagaaata aggttagtag tgaaattcat tcttcaataa 1261 ataaaacact ttgaaactcc ggaggaccac atctttcaag acttctgatg ggcgaagccc 1321 aaagatgcca acatacccgt atttaccaag tactatgata atggctagag tataaaaatg 1381 ttctttttaa agttatttat taagttcttc attggacgct tttttttata tctggttcac 1441 taccaccatt ttctgtttcc tactttctca gtggtttcat tgaaaagaaa ttagaagggg 1501 ttaaaggcag gaatagcaaa gagtgcaaac ttggggtatg actgggggag agtggaacat 1561 gccttttccg cacaatatta attccttttt gtatcagaaa ggnnctntta ggagttatgc 1621 taccatactt acttcaaacc caatgactac tgtcaaggtc atattttcag tacataaata 1681 ctatcatttt cattctaaag aatattttca ctgttccttc tttcttaaag tcttatgttt 1741 cactctttaa ctcaaatgta ttctttgtta gaatttaccc tagattctta tttaatgtct 1801 gcagtagact gaatgtttgt gtgcccccag aattctaatg ttgaaatctc atttccaatg 1861 tgatggtatt tggaggtggg gcttttggta agtgataggt caggagagta acagcgctca 1921 tgaatgggat tagtgccctt atataaagag acccagagag ctccatcacc ccttctgcca 1981 tgtgaaaggg agaagacaaa catccacgaa ccaggaagtg ggtcctcacc agaaaacaaa 2041 tctgtaagca ccttgatctt ggacttccca gcctccagaa ttgtgagaaa taaatttctg 2101 ttgttgattt tttttttttt tttttttttt tttttttttt ttttttttt // LOCUS HSU94332 1356 bp mRNA PRI 06-MAY-1997 DEFINITION Human osteoprotegerin (OPG) mRNA, complete cds. ACCESSION U94332 NID g2072184 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1356) AUTHORS Simonet,W.S., Lacey,D.L., Dunstan,C.R., Kelley,M., Chang,M.S., Luthy,R., Nguyen,H.Q., Wooden,S., Bennett,L., Boone,T., Shimamoto,G., DeRose,M., Elliott,R., Colombero,A., Tan,H.L., Trail,G., Sullivan,J., Davy,E., Bucay,N., Renshaw-Gegg,L., Hughes,T.M., Hill,D., Pattison,W., Campbell,P., Sander,S., Van,G., Tarpley,J., Derby,P., Lee,R., Amgen EST Program and Boyle,W.J. TITLE Osteoprotegerin: a novel secreted protein involved in the regulation of bone density JOURNAL Cell 89 (2), 309-319 (1997) MEDLINE 97262071 REFERENCE 2 (bases 1 to 1356) AUTHORS Boyle,W.J. TITLE Direct Submission JOURNAL Submitted (18-MAR-1997) Department of Cell Biology, Amgen, Inc., 1840 Dehavilland Drive, Thousand Oaks, CA 91320, USA FEATURES Location/Qualifiers source 1..1356 /organism="Homo sapiens" /db_xref="taxon:9606" gene 95..1300 /gene="OPG" CDS 95..1300 /gene="OPG" /codon_start=1 /product="osteoprotegerin" /db_xref="PID:g2072185" /translation="MNKLLCCALVFLDISIKWTTQETFPPKYLHYDEETSHQLLCDKC PPGTYLKQHCTAKWKTVCAPCPDHYYTDSWHTSDECLYCSPVCKELQYVKQECNRTHN RVCECKEGRYLEIEFCLKHRSCPPGFGVVQAGTPERNTVCKRCPDGFFSNETSSKAPC RKHTNCSVFGLLLTQKGNATHDNICSGNSESTQKCGIDVTLCEEAFFRFAVPTKFTPN WLSVLVDNLPGTKVNAESVERIKRQHSSQEQTFQLLKLWKHQNKAQDIVKKIIQDIDL CENSVQRHIGHANLTFEQLRSLMESLPGKKVGAEDIEKTIKACKPSDQILKLLSLWRI KNGDQDTLKGLMHALKHSKTYHFPKTVTQSLKKTIRFLHSFTMYKLYQKLFLEMIGNQ VQSVKISCL" BASE COUNT 421 a 326 c 314 g 294 t 1 others ORIGIN 1 gtatatataa cgtgatgagc gtacgggtgc ggagacgcac cggagcgctc gcccagccgc 61 cgyctccaag cccctgaggt ttccggggac cacaatgaac aagttgctgt gctgcgcgct 121 cgtgtttctg gacatctcca ttaagtggac cacccaggaa acgtttcctc caaagtacct 181 tcattatgac gaagaaacct ctcatcagct gttgtgtgac aaatgtcctc ctggtaccta 241 cctaaaacaa cactgtacag caaagtggaa gaccgtgtgc gccccttgcc ctgaccacta 301 ctacacagac agctggcaca ccagtgacga gtgtctatac tgcagccccg tgtgcaagga 361 gctgcagtac gtcaagcagg agtgcaatcg cacccacaac cgcgtgtgcg aatgcaagga 421 agggcgctac cttgagatag agttctgctt gaaacatagg agctgccctc ctggatttgg 481 agtggtgcaa gctggaaccc cagagcgaaa tacagtttgc aaaagatgtc cagatgggtt 541 cttctcaaat gagacgtcat ctaaagcacc ctgtagaaaa cacacaaatt gcagtgtctt 601 tggtctcctg ctaactcaga aaggaaatgc aacacacgac aacatatgtt ccggaaacag 661 tgaatcaact caaaaatgtg gaatagatgt taccctgtgt gaggaggcat tcttcaggtt 721 tgctgttcct acaaagttta cgcctaactg gcttagtgtc ttggtagaca atttgcctgg 781 caccaaagta aacgcagaga gtgtagagag gataaaacgg caacacagct cacaagaaca 841 gactttccag ctgctgaagt tatggaaaca tcaaaacaaa gcccaagata tagtcaagaa 901 gatcatccaa gatattgacc tctgtgaaaa cagcgtgcag cggcacattg gacatgctaa 961 cctcaccttc gagcagcttc gtagcttgat ggaaagctta ccgggaaaga aagtgggagc 1021 agaagacatt gaaaaaacaa taaaggcatg caaacccagt gaccagatcc tgaagctgct 1081 cagtttgtgg cgaataaaaa atggcgacca agacaccttg aagggcctaa tgcacgcact 1141 aaagcactca aagacgtacc actttcccaa aactgtcact cagagtctaa agaagaccat 1201 caggttcctt cacagcttca caatgtacaa attgtatcag aagttatttt tagaaatgat 1261 aggtaaccag gtccaatcag taaaaataag ctgcttataa ctggaaatgg ccattgagct 1321 gtttcctcac aattggcgag atcccatgga tgataa // LOCUS HSU94333 3460 bp mRNA PRI 29-APR-1997 DEFINITION Human Clq/MBL/SPA receptor C1qR(p) mRNA, complete cds. ACCESSION U94333 NID g2052497 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3460) AUTHORS Nepomuceno,R.R., Henschen-Edman,A.H., Burgess,W.H. and Tenner,A.J. TITLE cDNA cloning and primary structure analysis of C1qR(P), the human C1q/MBL/SPA receptor that mediates enhanced phagocytosis in vitro JOURNAL Immunity 6 (2), 119-129 (1997) MEDLINE 97199258 REFERENCE 2 (bases 1 to 3460) AUTHORS Nepomuceno,R.R., Henschen-Edman,A.H., Burgess,W.H. and Tenner,A.J. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Molecular Biology and Biochemistry, University of California, Irvine, 3205 BioSci. II, Irvine, CA 92697-3900, USA FEATURES Location/Qualifiers source 1..3460 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937 histiocytic cell line" CDS 149..2107 /function="mediates enhanced phagocytosis by human monocytes and macrophages in response to complement C1q, mannose binding lectin (MBL) and pulmonary surfactant protein A (SPA)" /note="Clq/MBL/SPA receptor" /codon_start=1 /product="C1qR(p)" /db_xref="PID:g2052498" /translation="MATSMGLLLLLLLLLTQPGAGTGADTEAVVCVGTACYTAHSGKL SAAEAQNHCNQNGGNLATVKSKEEAQHVQRVLAQLLRREAALTARMSKFWIGLQREKG KCLDPSLPLKGFSWVGGGEDTPYSNWHKELRNSCISKRCVSLLLDLSQPLLPNRLPKW SEGPCGSPGSPGSNIEGFVCKFSFKGMCRPLALGGPGQVTYTTPFQTTSSSLEAVPFA SAANVACGEGDKDETQSHYFLCKEKAPDVFDWGSSGPLCVSPKYGCNFNNGGCHQDCF EGGDGSFLCGCRPGFRLLDDLVTCASRNPCSSSPCRGGATCVLGPHGKNYTCRCPQGY QLDSSQLDCVDVDECQDSPCAQECVNTPGGFRCECWVGYEPGGPGEGACQDVDECALG RSPCAQGCTNTDGSFHCSCEEGYVLAGEDGTQCQDVDECVGPGGPLCDSLCFNTQGSF HCGCLPGWVLAPNGVSCTMGPVSLGPPSGPPDEEDKGEKEGSTVPRAATASPTRGPEG TPKATPTTSRPSLSSDAPITSAPLKMLAPSGSSGVWREPSIHHATAASGPQEPAGGDS SVATQNNDGTDGQKLLLFYILGTVVAILLLLALALGLLVYRKRRAKREEKKEKKPQNA ADSYSWVPERAESRAMENQYSPTPGTDC" BASE COUNT 764 a 943 c 996 g 757 t ORIGIN 1 aaagccctca gcctttgtgt ccttctctgc gccggagtgg ctgcagctca cccctcagct 61 ccccttgggg cccagctggg agccgagata gaagctcctg tcgccgctgg gcttctcgcc 121 tcccgcagag ggccacacag agaccgggat ggccacctcc atgggcctgc tgctgctgct 181 gctgctgctc ctgacccagc ccggggcggg gacgggagct gacacggagg cggtggtctg 241 cgtggggacc gcctgctaca cggcccactc gggcaagctg agcgctgccg aggcccagaa 301 ccactgcaac cagaacgggg gcaacctggc cactgtgaag agcaaggagg aggcccagca 361 cgtccagcga gtactggccc agctcctgag gcgggaggca gccctgacgg cgaggatgag 421 caagttctgg attgggctcc agcgagagaa gggcaagtgc ctggacccta gtctgccgct 481 gaagggcttc agctgggtgg gcggggggga ggacacgcct tactctaact ggcacaagga 541 gctccggaac tcgtgcatct ccaagcgctg tgtgtctctg ctgctggacc tgtcccagcc 601 gctccttccc aaccgcctgc ccaagtggtc tgagggcccc tgtgggagcc caggctcccc 661 cggaagtaac attgagggct tcgtgtgcaa gttcagcttc aaaggcatgt gccggcctct 721 ggccctgggg ggcccaggtc aggtgaccta caccaccccc ttccagacca ccagttcctc 781 cttggaggct gtgccctttg cctctgcggc caatgtagcc tgtggggaag gtgacaagga 841 cgagactcag agtcattatt tcctgtgcaa ggagaaggcc cccgatgtgt tcgactgggg 901 cagctcgggc cccctctgtg tcagccccaa gtatggctgc aacttcaaca atgggggctg 961 ccaccaggac tgctttgaag ggggggatgg ctccttcctc tgcggctgcc gaccaggatt 1021 ccggctgctg gatgacctgg tgacctgtgc ctctcgaaac ccttgcagct ccagcccatg 1081 tcgtgggggg gccacgtgcg tcctgggacc ccatgggaaa aactacacgt gccgctgccc 1141 ccaagggtac cagctggact cgagtcagct ggactgtgtg gacgtggatg aatgccagga 1201 ctccccctgt gcccaggagt gtgtcaacac ccctgggggc ttccgctgcg aatgctgggt 1261 tggctatgag ccgggcggtc ctggagaggg ggcctgtcag gatgtggatg agtgtgctct 1321 gggtcgctcg ccttgcgccc agggctgcac caacacagat ggctcatttc actgctcctg 1381 tgaggagggc tacgtcctgg ccggggagga cgggactcag tgccaggacg tggatgagtg 1441 tgtgggcccg gggggccccc tctgcgacag cttgtgcttc aacacacaag ggtccttcca 1501 ctgtggctgc ctgccaggct gggtgctggc cccaaatggg gtctcttgca ccatggggcc 1561 tgtgtctctg ggaccaccat ctgggccccc cgatgaggag gacaaaggag agaaagaagg 1621 gagcaccgtg ccccgcgctg caacagccag tcccacaagg ggccccgagg gcacccccaa 1681 ggctacaccc accacaagta gaccttcgct gtcatctgac gcccccatca catctgcccc 1741 actcaagatg ctggccccca gtgggtcctc aggcgtctgg agggagccca gcatccatca 1801 cgccacagct gcctctggcc cccaggagcc tgcaggtggg gactcctccg tggccacaca 1861 aaacaacgat ggcactgacg ggcaaaagct gcttttattc tacatcctag gcaccgtggt 1921 ggccatccta ctcctgctgg ccctggctct ggggctactg gtctatcgca agcggagagc 1981 gaagagggag gagaagaagg agaagaagcc ccagaatgcg gcagacagtt actcctgggt 2041 tccagagcga gctgagagca gggccatgga gaaccagtac agtccgacac ctgggacaga 2101 ctgctgaaag tgaggtggcc ctagagacac tagagtcacc agccaccatc ctcagagctt 2161 tgaactcccc attccaaagg ggcacccaca tttttttgaa agactggact ggaatcttag 2221 caaacaattg taagtctcct ccttaaaggc cccttggaac atgcaggtat tttctacggg 2281 tgtttgatgt tcctgaagtg gaagctgtgt gttggcgtgc cacggtgggg atttcgtgac 2341 tctataatga ttgttactcc ccctcccttt tcaaattcca atgtgaccaa ttccggatca 2401 gggtgtgagg aggctggggc taaggggctc ccctgaatat cttctctgct cacttccacc 2461 atctaagagg aaaaggtgag ttgctcatgc tgattaggat tgaaatgatt tgtttctctt 2521 cctaggatga aaactaaatc aattaattat tcaattaggt aagaagatct ggttttttgg 2581 tcaaagggaa catgttcgga ctggaaacat ttctttacat ttgcattcct ccatttcgcc 2641 agcacaagtc ttgctaaatg tgatactgtt gacatcctcc agaatggcca gaagtgcaat 2701 taacctctta ggtggcaagg aggcaggaag tgcctcttta gttcttacat ttctaatagc 2761 cttgggttta tttgcaaagg aagcttgaaa aatatgagaa aagttgcttg aagtgcatta 2821 caggtgtttg tgaagtcaca taatctacgg ggctagggcg agagaggcca gggatttgtt 2881 cacagatact tgaattaatt catccaaatg tactgaggtt accacacact tgactacgga 2941 tgtgatcaac actaacaagg aaacaaattc aaggacaacc tgtctttgag ccagggcagg 3001 cctcagacac cctgcctgtg gccccgcctc cacttcatcc tgcccggaat gccagtgctc 3061 cgagctcaga cagaggaagc cctgcagaaa gttccatcag gctgtttcct aaaggatgtg 3121 tgaacgggag atgatgcact gtgttttgaa agttgtcatt ttaaagcatt ttagcacagt 3181 tcatagtcca cagttgatgc agcatcctga gattttaaat cctgaagtgt gggtggcgca 3241 cacaccaagt agggagctag tcaggcagtt tgcttaagga acttttgttc tctgtctctt 3301 ttccttaaaa ttgggggtaa ggagggaagg aagagggaaa gagatgacta actaaaatca 3361 tttttacagc aaaaactgct caaagccatt taaattatat cctcatttta aaagttacat 3421 ttgcaaatat ttctccctat gataatgcag tcgatagtgt // LOCUS HSU94352 2042 bp mRNA PRI 20-JUN-1997 DEFINITION Human manic fringe precursor mRNA, complete cds. ACCESSION U94352 NID g2204346 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2042) AUTHORS Johnston,S.H., Rauskolb,C., Wilson,R., Prabhakaran,B., Irvine,K.D. and Vogt,T.F. TITLE A family of mammalian Fringe genes implicated in boundary determination and the Notch pathway JOURNAL Development 124 (11), 2245-2254 (1997) MEDLINE 97330691 REFERENCE 2 (bases 1 to 2042) AUTHORS Johnston,S.H., Rauskolb,C., Wilson,R., Prabhakaran,B., Irvine,K.D. and Vogt,T.F. TITLE Direct Submission JOURNAL Submitted (18-MAR-1997) Molecular Biology, Princeton University, Lewis Thomas Laboratories, Washington Rd., Princeton, NJ 08544, USA FEATURES Location/Qualifiers source 1..2042 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HNBAA16" /cell_type="B-cell" sig_peptide 171..281 CDS 171..1136 /codon_start=1 /product="manic fringe precursor" /db_xref="PID:g2204347" /translation="MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQP NPGPPKLQLHDVFIAVKTTRAFHRLRLELLLDTWVSRTRELTFVFTDSPDKGLQERLG SHLVVTNCSAEHSHPALSCKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFP LARDVYVGRPSLNRPIHASEPQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGS RFMDTSALIRLPDDCTMGYIIECKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSY GVFEGKLNVIKLQGPFSPEEDPSRFRSLHCLLYPDTPWCPQLGAR" mat_peptide 282..1133 /product="manic fringe" BASE COUNT 380 a 632 c 579 g 450 t 1 others ORIGIN 1 ccatggctca gcctctgggt ccagagcctc agctcctacc tcttccctcc ttgccagccc 61 ctgatgcctg ccagactttt gcctctgctg gagcccctgc ctgaccagct tcccctccct 121 gtctggttgg gatttggggg ctgagctgtc tggggtccca gggccaacca atgcagtgcc 181 ggctcccgcg gggcctggct ggagccctcc tcaccctcct gtgcatgggg ctcctgtgtc 241 tgcggtacca cttgaacctg tccccgcagc gggtacaagg gacccccgag ctgagccagc 301 cgaacccggg gccccctaag ctacagctac acgatgtctt cattgcagtg aagacgaccc 361 gggctttcca ccgcttgcgc ctggagctgc tgcttgacac gtgggtttcc aggaccaggg 421 aactgacatt tgtcttcacc gacagcccag acaaaggcct ccaggagaga ctggggtccc 481 accttgtggt caccaactgc tccgcggaac acagccaccc agctctgtcc tgcaagatgg 541 ctgctgagtt cgacaccttc ttggccagtg ggcttaggtg gttctgccat gtggacgatg 601 acaactatgt gaacccaagg gcgctgctgc agcttctgag agccttcccg ctggcccgcg 661 acgtctatgt gggaaggccc agcctgaacc ggcccatcca tgcctcagag ccacagcccc 721 acaaccgcac gaggctggta cagttctggt ttgccactgg gggtgctggc ttctgcatca 781 atcgcaaact ggctttgaag atggctccgt gggccagtgg ctcccgtttc atggacacat 841 ctgctctcat ccggctgcct gatgactgca ccatgggcta tatcattgag tgcaagctgg 901 gcggccgcct gcagcccagc cccctctttc actcccacct ggagaccctg cagctgctga 961 ggactgcaca gctcccagaa caggtcaccc tcagctacgg tgtctttgag gggaaactca 1021 acgtcattaa gctacagggc cccttctccc cggaggagga cccctccaga tttcgctccc 1081 tccattgtct gctctatcca gatacaccct ggtgtcccca gctgggtgcc cgatgaatcc 1141 tgaactgctg ggcaaaggtt gggcagagac ttctgggtgt gccttggctc ccaaggtggc 1201 actgtgggtc cctggcaagt gtcttgtgat aggcagtccc tggcagggcc ttcgggtggt 1261 tggcaagccc aggatctgag tggcaattgg cactgaaggc accccaggcc cctgggaggt 1321 gagttagaca gcccagggga ccaggtggac caggtggtgg ccagagaggc tccaggggct 1381 agactccctc aggaggctga attgaaaaag ggcagggggc acttgagctg ggctggggct 1441 caggggtcct aaccctttag gcagtgacat ggcctctggg tggggtctgn ccgttggccc 1501 tggctaatgt ctctcagtca ttccccctgg ggctcaagcg ctgggccgcc cactcctgcc 1561 tccctcatct gtgtcccgag ttcctgaagg gacatgggtg gaatgatggc agaatccagg 1621 gtcctgcagc acctgctgtt gttgccaacc agtctcccaa agctccttgc tccccacccc 1681 ttgcgaacag gaccagattt tgtttggagc ctcagcatgc cggggcccag atgatggagc 1741 ataacgggtc ccagccaatt gtgatgatcc tttttgctca tttcccagcc tttcttgctg 1801 ttaggggcta ccatgggacc agctctggcc agagggaact aagcaaatcc aatagagatg 1861 tttctgggga aggttttgca gcccactccc catcttcctg ctataaatgt gggtgtgatg 1921 gctggatctg gggcagccac cttgctacca tgaaggaaag gccaagacaa tcatccacag 1981 ctattccctc cagcatctgg ttctgtacaa aaattaaatg cttatttgtt taagtcaaaa 2041 aa // LOCUS HSU94362 3267 bp mRNA PRI 15-NOV-1997 DEFINITION Homo sapiens glycogenin-2 alpha (glycogenin-2) mRNA, complete cds. ACCESSION U94362 NID g2618765 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3267) AUTHORS Mu,J., Skurat,A.V. and Roach,P.J. TITLE Glycogenin-2, a novel self-glucosylating protein involved in liver glycogen biosynthesis JOURNAL J. Biol. Chem. 272 (44), 27589-27597 (1997) MEDLINE 98010589 REFERENCE 2 (bases 1 to 3267) AUTHORS Mu,J. and Roach,P.J. TITLE Direct Submission JOURNAL Submitted (18-MAR-1997) Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, 635 Barnhill Drive, Indianapolis, IN 46202-5122, USA FEATURES Location/Qualifiers source 1..3267 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="7'" /tissue_type="liver" /clone_lib="Clonetech catalog No. HL3006a" gene 1..3267 /gene="glycogenin-2" CDS 284..1789 /gene="glycogenin-2" /note="self-glucosylating protein; associated with glycogen and involved in glycogen biosynthesis; one of three liver isoforms" /codon_start=1 /product="glycogenin-2 alpha" /db_xref="PID:g2618766" /translation="MSETEFHHGAQAGLELLRSSNSPTSASQSAGMTVTDQAFVTLAT NDIYCQGALVLGQSLRRHRLTRKLVVLITPQVSSLLRVILSKVFDEVIEVNLIDSADY IHLAFLKRPELGLTLTKLHCWTLTHYSKCVFLDADTLVLSNVDELFDRGEFSAAPDPG WPDCFNSGVFVFQPSLHTHKLLLQHAMEHGSFDGADQGLLNSFFRNWSTTDIHKHLPF IYNLSSNTMYTYSPAFKQFGSSAKVVHFLGSMKPWNYKYNPQSGSVLEQGSVSSSQHQ AAFLHLWWTVYQNNVLPLYKSVQAGEARASPGHTLCHSDVGGPCADSASGVGEPCENS TPSAGVPCANSPLGSNQPAQGLPEPTQIVDETLSLPEGRRSEDMIACPETETPAVITC DPLSQPSPQPADFTETETILQPANKVESVSSEETFEPSQELPAEALRDPSLQDALEVD LAVSVSQISIEEKVKELSPEEERRKWEEGRIDYMGKDAFARIQEKLDRFLQ" BASE COUNT 749 a 838 c 902 g 778 t ORIGIN 1 cgcgggcttg cgggcagggg ctgcagggag gaggagagga cccgcgcccg cgggggctgg 61 gcggaggcgg ggccgggctt ccggacagag gccaatcgct gccctcgggg cctccagcgc 121 cggctctggg ccgaggcagc cagagcgcgg aagaggcctg gaaatccacg cggattcccg 181 gagacggcgc ctctgctctg cgggttcgtg gcgaggaagt ccacccactg ctcccgggcg 241 caggtctgca ggtccgcgcc cactgcccgc ggcgccactg accatgtcgg agacagagtt 301 tcaccatggt gcccaggctg gtctcgaact cctgaggtca agcaattcac ccacctcagc 361 ctcccaaagt gctggaatga cagtgactga tcaggctttt gtcacactag ccaccaatga 421 catctactgc cagggcgccc tggtcctggg gcagtcactg aggagacaca ggctgacgag 481 gaagctggtg gtgttgatca ctcctcaggt gtccagcctg ctcagggtca tcctctcgaa 541 ggtgttcgat gaagtcattg aagtgaatct aatcgatagt gccgactaca tccacctggc 601 ctttctgaag agacctgagc tcgggctcac cctcaccaag cttcactgtt ggactctcac 661 tcactacagc aagtgtgtct tcctggatgc agacactctg gtgctgtcca atgtcgatga 721 gctgtttgac aggggagagt tttctgcggc cccggacccc ggatggccgg attgcttcaa 781 tagcggggtg tttgtcttcc agccttctct ccacacgcat aaactcctgc tacagcacgc 841 catggaacac ggcagctttg acggggcaga ccaaggctta ctgaatagtt tcttcaggaa 901 ctggtcgacc acagacatcc acaagcacct gccgttcatc tataacttga gtagtaacac 961 gatgtacact tacagccctg ccttcaagca attcggttcc agtgcaaagg tcgtccactt 1021 tttggggtcc atgaaacctt ggaactacaa gtacaatcca cagagtggct cggtgttgga 1081 gcaaggctca gtgtccagca gccagcacca ggcggcattc cttcatctct ggtggacggt 1141 ctaccagaac aacgtgctgc ccctttataa aagcgtccaa gcgggggaag cacgcgcgtc 1201 tcctggtcac acactttgcc acagtgatgt gggggggccg tgtgcggatt cagcctctgg 1261 tgttggagag ccgtgtgaaa attcaacacc cagtgcgggc gtgccgtgtg caaattcacc 1321 actgggttct aaccagcctg ctcagggcct tccggagccg acccagatag tggatgagac 1381 cctgtcccta cctgaaggac gccgttcaga agatatgata gcttgtcctg aaactgagac 1441 tcctgccgtg ataacgtgtg acccactgtc ccagccttcc cctcagcctg cagacttcac 1501 agagactgaa accatcttgc agccagcaaa taaagtcgaa agtgtctcat ccgaggaaac 1561 cttcgaacca agccaggaac tccctgctga ggctctcagg gaccccagtc tgcaggatgc 1621 actggaggtc gacctggccg tctctgtttc ccagatctcc atcgaagaga aggtgaagga 1681 attgagcccc gaggaagaga ggaggaagtg ggaggaaggc cgtatcgact acatggggaa 1741 ggacgcgttt gctcgcatcc aggagaagct ggaccggttc ctgcagtaat ccggcagctg 1801 gtgggcgttg tgtgtagtta gacaatgtcc tgttgggtgg tcctgttgcg tggagatctc 1861 ctctggtcct ttcaaaggga aacgctgttg aaccttgtgc ctctatttat gcttaatcca 1921 tttgagtgcc tcacacaaaa aacgtagagt atagaaatcc accttaaagc ccctcgcccc 1981 aacttctcca ccaacgcctt ctgggctttc ttcagaggtc acttctaccc ttgaagctgt 2041 cggcaaaagc gagcagtaat aacattctag tagactctcg atggtggtct ccgctcttgc 2101 ccgaaggacc tctgaagtac gctggatctg tgttgtacag gtgctgtgag acctacccta 2161 ttcagaatta aacctcactg caaatttcct cccatcacga agctaacaac actaatatac 2221 gtatttagca cctctgaggc tttgccatgg agaccatttc tgtagggcta aggaaacatt 2281 tagacgtggt gactgacttt catttggact tggcgaagtg tatctgagaa acacctcggc 2341 tgtggtctct ctgctttaaa tcctaacagg acttcctaga gcgttgacag aaattctact 2401 cgtggacgtt gggaagaaag attgtaggtg gcttggggaa tgtgggtggc ttagaggatc 2461 taaaccgatt cacttcctgg ttgagaagca acgagggctt gctctaaatc gtttagagga 2521 taacaggatc tagagatgct ctctgcttga caacaaaagt cagggtgcag tcggtccacc 2581 cttgactgct cttggcttgg tctctaccct cactacctca gttctcaata acttagtgaa 2641 tcactgccct cctcaaagcc atttccactc agctctttcc agagaattct cagttttatg 2701 agacgggaaa ctttatttca cgagaaagcc tcattgtcag aagtatcttc attcaatggg 2761 cacaatatgc tgtgtatctc accaggtagc tgtcaggggc caccgagagt gtcgttaaaa 2821 atgggcatcg ttgtaataaa ggaggaaagt gcgacttttg aaatgtttgg aaggtttatt 2881 tctcatgcac attccaggga aaagcagaga gtaaattaga gacgggatag gaaggccgtg 2941 ggagaactcg atcctagcct gtgtcagctg gatgtgttta cgtggagagg cgtggccact 3001 ttttaggtca cctgaagcag tttagccttt ggatagagga acctgcctga atttatggca 3061 ttagtggtgg catttttttg tgtacaagat gtgggtgatg gaggggctgt ttctttttcc 3121 gtgtgggtgg ttaataatcg tcagtctcgg agggcgatgc tcgtaggata tttcaggtga 3181 gtcagggttg gatggtcatc ggctttcaga gggagaccac gggaatgttc agggaaacaa 3241 tgtcagcttc tctgaggacc agaattc // LOCUS HSU94586 518 bp mRNA PRI 22-APR-1997 DEFINITION Human NADH:ubiquinone oxidoreductase MLRQ subunit mRNA, complete cds. ACCESSION U94586 NID g1946691 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 518) AUTHORS Kim,J.W., Lee,Y., Lee,I.A., Kang,H.B., Kang,B.S. and Choe,I.S. TITLE Cloning and expression of human NADH:ubiquinone oxidoreductase MLRQ subunit gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 518) AUTHORS Kim,J.W., Lee,Y., Lee,I.A., Kang,H.B., Kang,B.S. and Choe,I.S. TITLE Direct Submission JOURNAL Submitted (19-MAR-1997) Mol. Cell. Biol. Research Group, Korean Research Institute of Bioscience and Biotechnology, Yoosung, Taejon 305-600, Korea FEATURES Location/Qualifiers source 1..518 /organism="Homo sapiens" /isolate="Korean" /db_xref="taxon:9606" /tissue_type="liver" /dev_stage="fetus" CDS 91..336 /codon_start=1 /product="NADH:ubiquinone oxidoreductase MLRQ subunit" /db_xref="PID:g1946692" /translation="MLRQIIGQAKKHPSLIPLFVFIGTGATGATLYLLRLALFNPDVC WDRNNPEPWNKLGPNDQYKFYSVNVDYSKLKKERPDF" polyA_signal 489..494 polyA_signal 497..502 BASE COUNT 154 a 115 c 103 g 146 t ORIGIN 1 ccgtagtgtc tcattgcaga taatttttag cttagggcct ggtggctagg tcggttctct 61 cctttccagt cggagacctc tgccgcaaac atgctccgcc agatcatcgg tcaggccaag 121 aagcatccaa gcttgatccc cctctttgta tttattggaa ctggagctac tggagcaaca 181 ctgtatctct tgcgtctggc attgttcaat ccagatgttt gttgggacag aaataaccca 241 gagccctgga acaaactggg tcccaatgat caatacaagt tctactcagt gaatgtggat 301 tacagcaagc tgaagaagga acgtccagat ttctaaatga aatgtttcac tataacgctg 361 ctttagaatg aaggtcttcc agaagccaca tccgcacaat tttccactta accaggaaat 421 atttctcctc taaatgcatg aaatcatgtt ggagatctct attgtaatct ctattggaga 481 ttacaatgat taaatcaata aataactgaa aaaaaaaa // LOCUS HSU94703 1594 bp mRNA PRI 25-MAY-1997 DEFINITION Homo sapiens mitochondrial DNA polymerase accessory subunit precursor (MtPolB) mRNA, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U94703 NID g2114435 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1594) AUTHORS Wang,Y., Farr,C.L. and Kaguni,L.S. TITLE Accessory subunit of mitochondrial DNA polymerase from Drosophila embryos. Cloning, molecular analysis, and association in the native enzyme JOURNAL J. Biol. Chem. 272 (21), 13640-13646 (1997) MEDLINE 97298065 REFERENCE 2 (bases 1 to 1594) AUTHORS Kaguni,L.S. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Biochemistry, Michigan State University, Wilson Road, East Lansing, MI 48824-1319, USA FEATURES Location/Qualifiers source 1..1594 /organism="Homo sapiens" /db_xref="taxon:9606" gene 422..1540 /gene="MtPolB" CDS 422..1540 /gene="MtPolB" /codon_start=1 /product="mitochondrial DNA polymerase accessory subunit precursor" /db_xref="PID:g2114436" /translation="MVDLGGGVHGAVFPVDALHHKPSPLLPGDSAFRLVSAETLREIL QDKELSKEQLVAFLENVLKTSGKLRENLLHGALEHYVNCLDLVNKRLPYGLAQIGVCF HPVFDTKQIRNGVKSIGEKTEASLVWFTPPRTSNQWLDFWLRHRLQWWRKFAMSPSNF SSSDCQDEEGRKGTNFTTIFPWGKELIETLWNLGDHELLHMYPGNVSKLHGRDGRKNV VPCVLSVNGDLDRGMLAYLYDSFQLTENSFTRKKNLHRKVLKLHPCLAPIKVALDVGR GPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETT LENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV" BASE COUNT 441 a 314 c 422 g 417 t ORIGIN 1 gccaagcttg gcacgaggtg gcacgagggg cttgttggga tccgttgagt gatgggagag 61 tgtgctcttt aacttcggag agagatgcgc tctcgtgtag ccgtcagggc ctgccataag 121 gtctgcaggt gcctgttgtc tgggtttggg ggtcgagtag atgcggggca gccggagctg 181 ttgacggaaa ggagtagccc caaaggaggg catgtgaagt cgcacgcgga ctcgagggga 241 acggcgagca cccagaagcc cccgggtctg gagagggaag cgaggcgctg ttagagatct 301 gtcagagaag gcatttccta agtggaagca agcagcagct tagccgggat tctcttctga 361 gtgggtgcca tcccggcttc ggacccttgg gcgtagagtt gcggaagaac ctggccgcag 421 aatggtggac ctcggtggtg gtgttcacgg agcggtattc ccggtggacg ccctccacca 481 caaaccaagc cctttgctac ccggggacag tgccttcagg ttagtttctg cagaaactct 541 acgcgaaatc ttgcaagaca aagagctgag taaggaacag ctagtagcat ttcttgagaa 601 cgtattaaaa acttctggga aactacggga gaaccttctt cacggtgcct tggaacacta 661 tgttaattgc ctggatctgg taaacaagag gctaccttat ggccttgctc agattggagt 721 gtgttttcat cctgtttttg acactaagca gatacgaaat ggtgttaaaa gtattggtga 781 gaagactgaa gcttcgttag tatggtttac tcctccgaga acttcaaacc agtggcttga 841 tttctggtta cgtcatcgac tccagtggtg gagaaagttt gccatgagtc catctaactt 901 cagcagcagt gactgtcagg atgaagaagg ccggaaagga acaaacttta ctacaatttt 961 cccctgggga aaggagttaa tagaaaccct gtggaactta ggagatcacg aacttttaca 1021 catgtatcct ggcaatgtgt ctaaattaca tggccgagat ggacgaaaaa atgtggttcc 1081 ttgtgttctc tctgtaaatg gggacctaga ccgaggcatg ctggcctacc tctatgattc 1141 tttccagctg acagagaact cctttacaag aaagaaaaat cttcatagaa aggtacttaa 1201 acttcaccct tgtttagccc ctattaaggt tgctttggat gtaggaagag gccccacatt 1261 ggaactaaga caggtttgtc aagggctatt taatgagtta ctagaaaatg ggatttctgt 1321 gtggcctggt tatttggaaa ctatgcagtc ctcattggaa caactttatt cgaagtatga 1381 tgaaatgagt attctcttca cagttttggt tactgaaact actttggaga atggattaat 1441 acatctgaga agcagagaca ccacaatgaa ggaaatgatg catatatcca aattaaaaga 1501 ctttttgatt aagtatatat catcagctaa gaatgtatag atttttatat ttgtataata 1561 aatattcttc tctcctaaaa aaaaaaaaaa aaaa // LOCUS HSU94747 1367 bp mRNA PRI 02-AUG-1997 DEFINITION Human WD repeat protein HAN11 mRNA, complete cds. ACCESSION U94747 NID g2290529 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1367) AUTHORS de Vetten,N., Quattroccio,F., Mol,J. and Koes,R. TITLE The an11 locus controlling flower pigmentation in petunia encodes a novel WD repeat protein conserved in yeast, plants and animals JOURNAL Unpublished REFERENCE 2 (bases 1 to 1367) AUTHORS de Vetten,N. and Koes,R. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Genetics, Free University, de Boelelaan 1085, Amsterdam 1081 HV, The Netherlands FEATURES Location/Qualifiers source 1..1367 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 73..1101 /note="WD repeat protein; similar to petunia AN11" /codon_start=1 /product="HAN11" /db_xref="PID:g2290530" /translation="MSLHGKRKEIYKYEAPWTVYAMNWSVRPDKRFRLALGSFVEEYN NKVQLVGLDEESSEFICRNTFDHPYPTTKLMWIPDTKGVYPDLLATSGDYLRVWRVGE TETRLECLLNNNKNSDFCAPLTSFDWNEVDPYLLGTSSIDTTCTIWGLETGQVLGRVN LVSGHVKTQLIAHDKEVYDIAFSRAGGGRDMFASVGADGSVRMFDLRHLEHSTIIYED PQHHPLLRLCWNKQDPNYLATMAMDGMEVVILDVRVPCTPVARLNNHRACVNGIAWAP HSSCHICTAADDHQALIWDIQQMPRAIEDPILAYTAEGEINNVQWASTQPDWIAICYN NCLEILRV" BASE COUNT 302 a 388 c 357 g 320 t ORIGIN 1 gatctcaggc tcggctcccc gcccgccgca gcccactgtt gacccggccc gtactgcggc 61 cccgtggcca ccatgtccct gcacggcaaa cggaaggaga tctacaagta tgaagcgccc 121 tggacagtct acgcgatgaa ctggagtgtg cggcccgata agcgctttcg cttggcgctg 181 ggcagcttcg tggaggagta caacaacaag gttcagcttg ttggtttaga tgaggagagt 241 tcagagttta tttgcagaaa cacctttgac cacccatacc ccaccacaaa gctcatgtgg 301 atccctgaca caaaaggcgt ctatccagac ctactggcaa caagcggtga ctatctccgt 361 gtgtggaggg ttggtgaaac agagaccagg ctggagtgtt tgctaaacaa taataagaac 421 tctgatttct gtgctcccct gacctccttt gactggaatg aggtggatcc ttatctttta 481 ggtacctcaa gcattgatac gacatgcacc atctgggggc tggagacagg gcaggtgtta 541 gggcgagtga atctcgtgtc tggccacgtg aagacccagc tgatcgccca tgacaaagag 601 gtctatgata ttgcatttag ccgggccggg ggtggcaggg acatgtttgc ctctgtgggt 661 gctgatggct cggtgcggat gtttgacctc cgccatctag aacacagcac catcatttac 721 gaagacccac agcatcaccc actgcttcgc ctctgctgga acaagcagga ccctaactac 781 ctggccacca tggccatgga tggaatggag gtggtgattc tagatgtccg ggttccctgc 841 acacctgtcg ccaggttaaa caaccatcga gcatgtgtca atggcattgc ttgggcccca 901 cattcatcct gccacatctg cactgcagcg gatgaccacc aggctctcat ctgggacatc 961 cagcaaatgc cccgagccat tgaggaccct atcctggcct acacagctga aggagagatc 1021 aacaatgtgc agtgggcatc aactcagccc gactggatcg ccatctgcta caacaactgc 1081 ctggagatac tcagagtgta gtgttggtgg cgctgtgccc acgaggcagg ggcttttgta 1141 tttcctgcct ctgccccacc cccaaagtaa gaagaaacat gtttccagtg gccagtatgt 1201 ctttcattgc tttgcaccca ctgttaccag aagctgctct aggagttcct ggccagtcac 1261 cccatcgccc tctgtggcag actcagtgct gtgtggcgcc ctcctagccc agggctgagt 1321 tttaagattt tctctccttt cctcttctcc tttggttcct caattaa // LOCUS HSU94780 3676 bp mRNA PRI 20-NOV-1997 DEFINITION Human meningioma-expressed antigen 6 (MEA6) mRNA, complete cds. ACCESSION U94780 NID g2231998 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3676) AUTHORS Heckel,D., Brass,N., Fischer,U., Blin,N., Steudel,I., Tureci,O., Fackler,O., Zang,K.-D. and Meese,E. TITLE cDNA cloning and chromosomal mapping of a predicted coiled-coil proline-rich protein immunogenic in meningioma patients JOURNAL Hum. Mol. Genet. 6 (12), 2031-2041 (1997) REFERENCE 2 (bases 1 to 3676) AUTHORS Heckel,D., Brass,N. and Meese,E.U. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Inst. for Human Genetics, University of Saarland; Medical School, Oskar-Orth-Strasse, Homburg, Saarland 66421, Germany FEATURES Location/Qualifiers source 1..3676 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="meningioma" /note="PCR and in-situ hybridization mapping locate the gene to chromosomes 2,3,6,7,9,13,14" gene 1..3676 /gene="MEA6" CDS 315..2729 /gene="MEA6" /note="meningioma-expressed antigen 6; this sequence is similar to MEA11, GenBank Accession Number U73682, but is longer; it may be an alternate splice variant or it may be from a separate gene" /codon_start=1 /product="MEA6" /db_xref="PID:g2231999" /translation="MEEPGATPQPYLGLLLEELRRVVAALPEGMRPDSNLYGFPWELV ICAAVVGFFAVLFFLWRSFRSVRSRLYVGREKKLALMLSGLIEEKSKLLEKFSLVQKE YEGYEVESSLKDASFEKEATEAQSLEATCEKLNRSNSELEDEILCLEKELKEEKSKHS EQDELMADISKRIQSLEDESKSLKSQVAEAKMTFQIFPMNEERLKIAIKDALNENSQL QESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKD WAAMLGEDITDDDNLELEMNSESENGAYLDNPPKGALKKLIHAAKLNASLKTLEGERN QIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQEN EMKLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYQ GQIISHEKKAHDNWLAARNAERNLNDLRKENAHNRQKLTETELKFELLEKDPYALDVP NTAFGREHSPYGPSPLGWPSSETRAFLSPPTLLEGPLTLSPLLPGGGGRGSRGPGNPL DHQITNERGESSCDRLTDPHRALSDTGFLSPPWDQDRRMMFPPPGQSYPDSALPPQRQ DRFCSNSGRLSGPAELRSFNMPSLDKMDGSMPSEMESSRNDTKDDLGNLNVPDSSLPA ENEATGPGFVPPPLAPVRGPLFPVDARGPFLRRGPPFPPPPPGAMFGASRDYFPPGDF PGPPPAPFAMRNVYPPRGFPPYLPPRPGFFPPPPHSEGRSEFPSGLIPPSNEPATEHP EPQQET" misc_feature 1854..1982 /gene="MEA6" /note="possible alternatively spliced exon" polyA_signal 3135..3140 /gene="MEA6" /note="first polyA_signal" polyA_signal 3640..3645 /gene="MEA6" /note="second polyA_signal" BASE COUNT 1148 a 689 c 794 g 1045 t ORIGIN 1 tgcccggcgg aaaccaaacc gagggatggg gtggcgagga cagggtacgt cgcaggcttg 61 tgcgggtcgg gttcggacct gcgctgcctc gggatgtaaa gtataacaag agggtcggga 121 tgggcagcgt aggcctgtga ggcctgcggg tgcccctgtc ccccagctcc ccccgcagcc 181 ggctccgcag tggtccactc cggttgccgg gtgcggattc gggttccgga ccgaaggctg 241 tgtgttctcc gccgttcatt gtggccccga caggccgggg ttactgtggc gaccacgaga 301 gcagctttgg cgctatggag gagcccgggg ctacccctca accgtatttg gggctgctcc 361 tggaggagct acgcagggtt gtggcagcac tgcctgaagg tatgagacca gattctaatc 421 tttatggttt tccatgggaa ttggtgatat gtgcagctgt tgttggattt tttgctgttc 481 tctttttttt gtggagaagt tttagatcgg ttaggagtcg gctttatgtg ggacgagaga 541 aaaagcttgc tctaatgctt tctggactaa ttgaagaaaa aagtaaacta cttgaaaaat 601 ttagccttgt tcaaaaagag tatgaaggct atgaagtaga gtcatcttta aaggatgcca 661 gctttgagaa ggaggcaaca gaagcacaaa gtttggaggc aacctgtgaa aagctgaaca 721 ggtccaattc tgaacttgag gatgaaatac tctgtctaga aaaagagtta aaagaagaga 781 aatccaaaca ttctgaacaa gatgaattga tggcggatat ttcaaaaagg atacagtctc 841 tagaagatga gtcaaaatcc ctcaaatcac aagtagctga agccaaaatg accttccaga 901 tatttccaat gaatgaagaa cgactgaaga tagcaataaa agatgctttg aatgaaaatt 961 ctcaacttca ggaaagccag aaacagcttt tgcaagaagc tgaagtatgg aaagaacaag 1021 tgagtgaact taataaacag aaagtaacat ttgaagactc caaagtacat gcagaacaag 1081 ttctaaatga taaagaaagt cacatcaaga ctctgactga acgcttgtta aagatgaaag 1141 attgggctgc tatgcttgga gaagacataa cggatgatga taacttggaa ttagaaatga 1201 acagtgaatc ggaaaatggt gcttacttag ataatcctcc aaaaggagct ttgaagaaac 1261 tgattcatgc tgctaagtta aatgcttctt taaaaacctt agaaggagaa agaaaccaaa 1321 tttatattca gttgtctgaa gttgataaaa caaaggaaga gcttacagag catattaaaa 1381 atcttcagac tgaacaagca tctttgcagt cagaaaacac acattttgaa aatgagaatc 1441 agaagcttca acagaaactt aaagtaatga ctgaattata tcaagaaaat gaaatgaaac 1501 tccacaggaa attaacagta gaggaaaatt atcggttaga gaaagaagag aaactttcta 1561 aagtagatga aaagatcagc catgccactg aagagctgga gacctataga aagcgagcca 1621 aagatcttga agaagaattg gagagaacta ttcattctta tcaagggcag attatttccc 1681 atgagaaaaa agcacatgat aattggttgg cagctcggaa tgctgaaaga aacctcaatg 1741 atttaaggaa agaaaatgct cacaacagac aaaaattaac tgaaacagag cttaaatttg 1801 aacttttaga aaaagatcct tatgcactcg atgttccaaa tacagcattt ggcagagagc 1861 attccccata tggtccctca ccattgggtt ggccttcatc tgaaacaaga gcttttctct 1921 ctcctccaac tttgttggag ggtccactca cactctcacc tttgcttcca gggggaggag 1981 gaagaggctc acgaggccca gggaatcctt tggaccatca gattaccaat gaaagaggag 2041 aatcaagctg tgataggtta accgatcctc atagggctct ctctgacact gggtttctgt 2101 cacctccatg ggaccaggac cgtaggatga tgtttcctcc gccaggacaa tcatatcctg 2161 attcagccct tcctccacaa aggcaagaca gattttgttc taattctggt agactgtctg 2221 gaccagcaga actcagaagt tttaatatgc cttctttgga taaaatggat gggtcaatgc 2281 cttcagaaat ggaatccagt agaaatgata ccaaagatga tcttggtaat ttaaatgtgc 2341 ctgattcatc tctccctgct gaaaatgaag ccactggccc tggctttgtt cctccacctc 2401 ttgctccagt cagaggtcca ttgtttccag tggatgcaag aggcccattc ttgagaagag 2461 gacctccttt ccccccacct cctccaggag ccatgtttgg agcttctcga gattattttc 2521 caccagggga tttcccaggt ccaccacctg ctccatttgc aatgagaaat gtctatccac 2581 cgaggggttt tcctccttac cttcccccaa gacctggatt tttcccccca cccccacatt 2641 ctgaaggtag aagtgagttc ccctcaggtt tgattccacc ttcaaatgag cctgctactg 2701 aacatccaga accacagcaa gaaacctgac aatatttttg ctctcttcaa aagtaatttt 2761 gactgatctc attttcagtt taagtaactg ctgttactta agtgattaca cttttgctca 2821 aattgaagct taatggaatt ataattctca ggatagtatt ttgtaaataa agatgattta 2881 aatatgaatc ttatgagtaa attatttcaa ttttatttta gacggtataa ctatttcaat 2941 ttgattaatc cactattata taaacaatag tgggagtttt atatatgtaa tcttgcaggt 3001 ggggaggctt taaattctga agtctgtgtc tttatgccaa gaactgtatt tactgtggtt 3061 gtggacaaat gtgaaagtaa ctttatgctt aaataaatta tagttgattt aaagatttgt 3121 ttggcattga taataataaa atcagtagtt tttctataac tatggctcta ttaattaact 3181 tttttccttt taccaataac tttgaggtgc aaaactcaaa cttatgtggg tcttttgtgt 3241 tcaattatgt tatgacaaat gtgctctctt tcttgtaaat agacatgagt ggcccaaagc 3301 aacaaattaa tacactttta aaagtcaaaa ttgattatat tttaaagata accaggatat 3361 tatctaatgg tgaattgtag aattttgatc ttcttattca ctgagtttct tgcacggttt 3421 ctttattgct ttttttcccg cctgttcttt tgtaaggtat ttactatttt ctgtggagga 3481 tattgagatg tactacagga taactgtagt gaatgatgtg tcatcatttt gagctttgga 3541 ctcaatatct ttagtgtttc cctaaatcag atttgtaggt catgttaagc ttcttgcaca 3601 ttaatatgat tatggaagga aaggcagtga agcataacta ataaacatca taatacttaa 3661 aaaaaaaaaa aaaaaa // LOCUS HSU94831 2114 bp mRNA PRI 09-OCT-1997 DEFINITION Homo sapiens multispanning membrane protein mRNA, complete cds. ACCESSION U94831 NID g2276459 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2114) AUTHORS Chluba-de Tapia,J., de Tapia,M., Jaggin,V. and Eberle,A.N. TITLE Cloning of a human multispanning membrane protein cDNA: evidence for a new protein family JOURNAL Gene 197 (1-2), 195-204 (1997) MEDLINE 97473513 REFERENCE 2 (bases 1 to 2114) AUTHORS Chluba-de Tapia,J., de Tapia,M. and Eberle,A.N. TITLE Direct Submission JOURNAL Submitted (19-MAR-1997) Laboratoire de Chimie Enzymatique et Vectorisation, CNRS URA 1386, Faculte de Pharmacie, 74, route du Rhin, Illkirch F-67401, France FEATURES Location/Qualifiers source 1..2114 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="melanoma" CDS 16..1836 /note="hMP70; predicted molecular weight is 68 kDa; contains nine potential membrane spanning domains; similar to yeast precursor protein p24a encoded by GenBank Accession Number S25110" /codon_start=1 /product="multispanning membrane protein" /db_xref="PID:g2276460" /translation="MTVVGNPRSWSCQWLPILILLLGTGHGPGVEGVTHYKAGDPVIL YVNKVGPYHNPQETYHYYQLPVCCPEKIRHKSLSLGEVLDGDRMAESLYEIRFRENVE KRILCHMQLSSAQVEQLRQAIEELYYFEFVVDDLPIRGFVGYMEESGFLPHSHKIGLW THLDFHLEFHGDRIIFANVSVRDVKPHSLDGLRPDEFLGLTHTYSVRWSETSVERRSD RRRGDDGGFFPRTLEIHWLSIINSMVLVFLLVGFVAVILMRVLRNDLARYNLDEETTS AGSGDDFDQGDNGWKIIHTDVFRFPPYRGLLCAVLGVGAQFLALGTGIIVMALLGMFN VHRHGAINSAAILLYALTCCISGYVSSHFYRQIGGERWVWNIILTTSLFSVPFFLTWS VVNSVHWANGSTQALPATTILLLLTVWLLVGFPLTVIGGIFGKNNASPFDAPCRTKNI AREINPQPWYKSTDIHMTVGGFLPFSAISVELYYIFATVWGREQYTLYGILFFVFAIL LSVGASISIALTYFQLSGEDYRWWWRSVLSVGSTGLFIFLYSVFYYARRSNMSGAVQT VEFFGYSLLTGYVFFLMLGTISFFSSLKFIRYIYVNLKMD" BASE COUNT 434 a 542 c 527 g 611 t ORIGIN 1 tccactgcct taaggatgac agtcgtaggg aaccctcgaa gttggagctg ccagtggttg 61 ccaatcctga tactgttgct gggcacaggc catgggccag gggtggaagg cgtgacacac 121 tacaaggccg gcgaccctgt tattctgtat gtcaacaaag tgggacccta ccataaccct 181 caggaaactt accactacta tcagcttcca gtctgctgcc ctgagaagat acgtcacaaa 241 agccttagcc tgggtgaagt gctggatggg gaccgaatgg ctgagtcttt gtatgagatc 301 cgctttcggg aaaacgtgga gaagagaatt ctgtgccaca tgcagctcag ttctgcacag 361 gtggagcagc tgcgccaggc cattgaagaa ctgtactact ttgaatttgt ggtagatgac 421 ttgccaatcc ggggctttgt gggctacatg gaggagagtg gtttcctgcc acacagccac 481 aagataggac tctggaccca tttggacttc cacctagaat tccatggaga ccgaattata 541 tttgccaatg tttcagtgcg ggacgtcaag ccccacagct tggatgggtt acgacctgac 601 gagttcctag gccttaccca cacttatagc gtgcgctggt ctgagacttc agtggagcgt 661 cggagtgaca ggcgccgtgg tgacgatggt ggtttctttc ctcgaacact ggaaatccat 721 tggttgtcca tcatcaactc catggtgctt gtgtttttac tggtgggttt tgtggctgtc 781 attctaatgc gtgtgcttcg gaatgacctg gctcggtaca acttagatga ggagaccacc 841 tctgcaggtt ctggtgatga ctttgaccag ggtgacaatg gctggaaaat tatccataca 901 gatgtcttcc gcttcccccc ataccgtggt ctgctctgtg ctgtgcttgg cgtgggtgcc 961 cagttcctgg cccttggcac tggcattatt gtcatggcac tgctgggcat gttcaatgtg 1021 caccgtcatg gggccattaa ctcagcagcc atcttgttgt atgccctgac ctgctgcatc 1081 tctggctacg tgtccagcca cttctaccgg cagattggag gcgagcgttg ggtgtggaac 1141 atcattctca ccaccagtct cttctctgtg cctttcttcc tgacgtggag tgtggtgaac 1201 tcagtgcatt gggccaatgg ttcgacacag gctctgccag ccacaaccat cctgctgctt 1261 ctgacggttt ggctgctggt gggctttccc ctcactgtca ttggaggcat ctttgggaag 1321 aacaacgcca gcccctttga tgcaccctgt cgcaccaaga acatcgcccg ggagattaat 1381 ccccagccct ggtacaagtc tactgacatc cacatgactg ttggaggctt cctgcctttc 1441 agtgccatct ctgtggagct gtactacatc tttgccacag tatggggtcg ggagcagtac 1501 actttgtacg gcatcctctt ctttgtcttc gccatcctgc tgagtgtggg ggcttcgatc 1561 tccattgcac tcacctactt ccagttgtct ggggaggatt accgctggtg gtggcgatct 1621 gtgctgagtg ttggctccac cggcctcttc atcttcctct actcagtttt ctattatgcc 1681 cggcgctcca acatgtctgg ggcagtacag acagtagagt tcttcggcta ctccttactc 1741 actggttatg tcttcttcct catgctgggc accatctcct ttttttcttc cctaaagttc 1801 atccggtata tctatgttaa cctcaagatg gactgagttc tgtatggcag aactattgct 1861 gttctctccc tttcttcatg ccctgttgga ctctcctacc agcttctctt ctgaatgact 1921 gaattgtgtg atggcattgt tgccttccct ttgccctttg ggcattcctt ccccagagag 1981 ggcctggaaa ttataaatct ctatcacata aggattatat atttgaactt tttaagttgc 2041 ctttagtttt ggtcctgatt tttcttttac aattaccaaa ataaaattta ttaagaaaaa 2101 ggaaaaaaaa aaaa // LOCUS HSU94832 3009 bp mRNA PRI 30-APR-1997 DEFINITION Human KH type splicing regulatory protein KSRP mRNA, complete cds. ACCESSION U94832 NID g2055426 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3009) AUTHORS Min,H., Turck,C.W., Nicolic,J.M. and Black,D.L. TITLE A new regulatory protein, KSRP, mediates exon inclusion through an intronic splicing enhancer JOURNAL Genes Dev. (1997) In press REFERENCE 2 (bases 1 to 3009) AUTHORS Min,H., Turck,C.W., Nicolic,J.M. and Black,D.L. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Molecular Biology Institute, University of California, at Los Angeles, 675 Circle Drive South 5-748 MRL, Los Angeles, CA 90095-1662, USA FEATURES Location/Qualifiers source 1..3009 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="LA-N-5 neuroblastoma; WERI-1 retinoblastoma" CDS 94..2229 /note="RNA binding protein; KH type RNA binding domain; alternative splicing regulator; cooperative complex formation" /codon_start=1 /product="KSRP" /db_xref="PID:g2055427" /translation="MSDYSTGGPPPGPPPPAGGGGGAGGAGGGPPPGPPGAGDRGGGG PCGGGPGGGSAGGPSQPPGGGGPGIRKDAFADAVQRARQIAAKIGGDAATTVNNSTPD FGFGGQKRQLEDGDQPESKKLASQGDSISSQLGPIHPPPRTSMTEEYRVPDGMVGLII GRGGEQINKIQQDSGCKVQISPDSGGLPERSVSLTGAPESVQKAKMMLDDIVSRGRGG PPGQFHDNANGGQNGTVQEIMIPAGKAGLVIGKGGETIKQLQERAGVKMILIQDGSQN TNVDKPLRIIGDPYKVQQACEMVMDILRERDQGGFGDRNEYGSRIGGGIDVPVPRHSV GVVIGRSGEMIKKIQNDAGVRIQFKQDDGTGPEKIAHIMGPPDRCEHAARIINDLLQS LRSGPPGPPGGPGMPPGGRGRGRGQGNWGPPGGEMTFSIPTHKCGLVIGRGGENVKAI NQQTGAFVEISRQLPPNGDPNFKLFIIRGSPQQIDHAKQLIEEKIEGPLCPVGPGPGG PGPAGPMGPFNPGPFNQGPPGAPPHAGGPPPHQYPPQGWGNTYPQWQPPAPHDPSKAA AAAADPNAAWAAYYSHYYQQPPGPVPGPAPAPAAPPAQGEPPQPPPTGQSDYTKAWEE YYKKIGQQPQQPGAPPQQDYTKAWEEYYKKQAQVATGGGPGAPPGSQPDYSAAWAEYY RQQAAYYGQTPGPGGPQPPPTQQGQQQAQ" BASE COUNT 700 a 913 c 857 g 539 t ORIGIN 1 tgtggagcga agccttgttc ccgcgttgag ccgccgccgc cgccgccgcc tcctcagctt 61 cagcctccgc gccaggcccg gccccgccgc gccatgtcgg actacagcac gggaggaccc 121 ccgcccgggc cgccgccgcc cgccggcggg ggcgggggag ccggaggcgc cgggggaggc 181 cctccgccgg gcccgccagg cgcgggggac cggggcggcg gcggtccctg cggcggcggc 241 ccgggcgggg ggtcggccgg gggcccctct cagccacccg gcggaggcgg cccgggaatc 301 cgcaaggacg ctttcgccga cgccgtgcag cgggcccgcc agattgcagc caaaattgga 361 ggcgatgctg ccacgacagt gaataacagc actcctgatt ttggttttgg gggccaaaag 421 agacagttgg aagatggaga tcaaccggag agcaagaagc tggcttccca gggagactca 481 atcagttctc aacttggacc catccatcct cccccaagga cttcaatgac agaagagtac 541 agggtcccag acggcatggt gggcctgatc attggcagag gaggtgaaca aattaacaaa 601 atccaacagg attcaggctg caaagtacag atttctccag acagcggtgg cctacccgag 661 cgcagtgtgt ccttgacagg agccccagaa tctgtccaga aagccaagat gatgctggat 721 gacattgtgt ctcggggtcg tgggggcccc ccaggacagt tccacgacaa cgccaacggg 781 ggccagaacg gcaccgtgca ggagatcatg atccccgcgg gcaaggccgg cctggtcatt 841 ggcaagggcg gggagaccat taagcagctg caggaacgcg ctggagtgaa gatgatctta 901 attcaggacg gatctcagaa tacgaatgtg gacaaacctc tccgcatcat tggggatcct 961 tacaaagtgc agcaagcctg tgagatggtg atggacatcc tccgggaacg tgaccaaggc 1021 ggctttgggg accggaatga gtacggatct cggattggcg gaggcatcga tgtgccagtg 1081 cccaggcatt ctgttggcgt ggtcattggc cggagtggag agatgatcaa gaagatccag 1141 aatgatgctg gcgtgcggat acagttcaag caagatgacg ggacagggcc cgagaagatt 1201 gctcatataa tggggccccc agacaggtgc gagcacgcag cccggatcat caacgacctc 1261 ctccagagcc tcaggagtgg tcccccaggt cctccagggg gtccaggcat gcccccgggg 1321 ggccgaggcc gaggaagagg ccaaggcaat tggggtcccc ctggcgggga gatgaccttc 1381 tccatcccca ctcacaagtg tgggctggtc atcggccgag gtggcgagaa tgtgaaagcc 1441 ataaaccagc agacgggagc cttcgtagag atctcccggc agctgccacc caacggggac 1501 cccaacttca agttgttcat catccggggt tcaccccagc agattgacca cgccaagcag 1561 cttatcgagg aaaagatcga gggtcctctc tgcccagttg gaccaggccc aggtggccca 1621 ggccctgctg gcccaatggg gcccttcaat cctgggccct tcaaccaggg gccacccggg 1681 gctcccccac atgccggggg gccccctcct caccagtacc caccccaggg ctggggcaat 1741 acctaccccc agtggcagcc gcctgctcct catgacccaa gcaaagcagc tgcagcggcc 1801 gcggacccca acgccgcgtg ggccgcctac tactcacact actaccagca gcccccgggc 1861 cccgtccccg gccccgcacc ggcccctgcg gccccaccgg ctcagggtga gccccctcag 1921 cccccaccca ccggccagtc ggactacact aaggcctggg aagagtatta caaaaagatc 1981 ggccagcagc cccagcagcc cggagcgccc ccacagcagg actacacgaa ggcttgggag 2041 gagtactaca agaagcaagc gcaagtggcc accggagggg gtccaggagc tcccccaggc 2101 tcccagccag actacagtgc cgcctgggcg gaatattaca gacagcaggc cgcttactac 2161 ggacagaccc caggtcctgg cggcccccag ccgccgccca cgcagcaggg acagcagcag 2221 gctcaatgaa tcgaatgaat gtgaacttct tcatctgtga aaaatctttt ttttttccat 2281 tttgttctgt ttgggggctt ctgttttgtt tggcgagaga gcgatggtgc cgtggggagt 2341 actggggagc cctcgcggca agcagggtgg gggggacttg ggggcatgcc gggccctcac 2401 tctctcgcct gttctgtgtc tcacatgctt tttctttcaa aattgggatc cttccatgtt 2461 gagccagcca gagaagatag cgagatctaa atctctgcca aaaaaaaaaa aaacttaaaa 2521 attaaaaaca caaagagcaa agcagaactt ataaaattat atatatatat attaaaaagt 2581 ctctattctt caccccccag ccttcctgaa cctgcctctc tgaggataaa gcaattcatt 2641 ttctcccacc ctcggccctc ttgtttttaa aataaacttt taaaaaggaa aaaaaaaagt 2701 cactcttgct atttcttttt tttagttaga ggtggaacat tccttggacc aggtgttgta 2761 ttgcaggacc ccttccccca gcagccaagc cccctcttct ctccctcccg ccctggctca 2821 gctcccgcgg ccccgcccgt cccccctccc aggactggtc tgttgtcttt tcatctgttc 2881 aagaggagat tgaaactgaa aacaaaatga gaacaacaaa aaaaattgta tggcagtttt 2941 tactttttat cgctcgtttt taacttcaca aataaatgat aacaaaacct caaaaaaaaa 3001 aaaaaaaaa // LOCUS HSU94836 4018 bp mRNA PRI 01-MAY-1997 DEFINITION Human ERPROT 213-21 mRNA, complete cds. ACCESSION U94836 NID g2058690 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4018) AUTHORS LaPlante,J., O'Rourke,F.A., Lu,X., Matthews,E., Olsen,A., Choi,J.S., Rose,E. and Feinstein,M.B. TITLE ERPROT 213-21 JOURNAL Unpublished REFERENCE 2 (bases 1 to 4018) AUTHORS LaPlante,J., O'Rourke,F.A., Lu,X., Matthews,E., Olsen,A., Choi,J.S., Rose,E. and Feinstein,M.B. TITLE Direct Submission JOURNAL Submitted (20-MAR-1997) Pharmacology, University of Connecticut Health Center, 263 Farmington Avenue, Farmington, CT 06060, USA FEATURES Location/Qualifiers source 1..4018 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="19p13.1 (12296 on FISH map)" /cell_type="PMA stimulated erythroleukemia cells" /clone_lib="prepared by Dr. M. Poncz, University of Pennsylvannia" CDS 89..2743 /codon_start=1 /product="ERPROT 213-21" /db_xref="PID:g2058691" /translation="MTMEKQKDNPKFSFLFGGEFYSYYKCKLALEQQQLICKQQTPEL EPAATMPPLPQPPLAPAAPIPPAQGAPSMDELIQQSQWNLQQQEQHLLALRQEQVTAA VAHAVEQQMQKLLEETQLDMNEFDNLLQPIIDTCTKDAISAGKNWMFSNAKSPPHCEL MAGHLRHRITADGAHFELRLHLIYLINDVLHHCQRKQARELLAALQKVVVPIYCTSFL AVEEDKQQKIARLLQLWEKNGYFDDSIIQQLQSPALGLGQYQATLINEYSSVVQPVQL AFQQQIQTLKTQHEEFVTSLAQQQQQQQQQQQQLQMPQMEAEVKATPPPPAPPPAPAP APAIPPTTQPDDSKPPIQMPGSSEYEAPGGVQDPAAAGPRGPGPHDQIPPNKPPWFDQ PHPVAPWGQQQPPEQPPYPHHQGGPPHCPPWNNSHEGMWGEQRGDPGWNGQRDAPWNN QPDAAWNSQFEGPWNSQHEQPPWGGGQREPPFRMQRPPHFRGPFPPHQQHPQFNQPPH PHNFNRFPPRFMQDDFPPRHPFERPPYPHRFDYPQGDFPAEMGPPHHHPGHRMPHPGI NEHPPWAGPQHPDFGPPPHGFNGQPPHMRRQGPPHINHDDPSLVPNVPYFDLPAGLMA PLVKLEDHEYKPLDPKDIRLPPPMPPSERLLAAVEAFYSPPSHDRPRNSEGWEQNGLY EFFRAKMRARRRKGQEKRNSGPSRSRSRSKSRGRSSSRSNSRSSKSSGSYSRSRSRSC SRSYSRSRSRSRSRSRSSRSRSRSQSRSRSKSYSPGRRRRSRSRSPTPPSSAGLGSNS APPIPDSRLGEENKGHQMLVKMGWSGSGGLGAKEQGIQDPIKGGDVRDKWDQYKGVGV ALDDPYENYRRNKSYSFIARMKARDECK" BASE COUNT 886 a 1349 c 1084 g 699 t ORIGIN 1 gaattccccc cccgatgacc aggagcttcg aaatgtcatc gacaagctcg cccagttcgt 61 ggctcgcaat gggcccgagt ttgagaagat gactatggag aagcagaagg acaaccccaa 121 attctcgttt cttttcggag gcgaattcta cagttactac aagtgcaagc tggcgctgga 181 gcagcagcag ctcatctgca agcagcagac cccggagctg gagccagccg ccaccatgcc 241 acccctgcca cagcccccgc tggcccccgc cgcgcccatc ccgccggccc agggcgcgcc 301 atccatggac gagctcatcc agcagagcca gtggaacctc cagcagcagg agcagcactt 361 gctggcgctc agacaggagc aagtgacagc ggccgtggcc cacgcggtgg agcagcagat 421 gcagaagctt ctggaggaga cccagctaga catgaacgag tttgacaacc tcctgcagcc 481 catcatcgac acgtgcacca aggacgccat ctcggccggg aagaactgga tgttcagcaa 541 tgccaagtcc ccgccgcact gtgagctgat ggccggccac ctccgccacc gcatcacggc 601 tgatggggca cacttcgagc tgcggctgca cctcatctac ctgatcaatg acgtgctgca 661 ccactgccag cgcaagcagg cccgggagct gctggccgcc ctgcagaagg tcgtggtgcc 721 catctactgc accagcttct tggccgtgga ggaagacaag cagcagaaga tcgcccggct 781 cctgcagctc tgggagaaaa acggctactt cgatgactcc atcattcagc agctacagag 841 cccagccctg gggcttggtc agtaccaggc caccctcatc aacgagtact cctcagtggt 901 ccagccggtg cagctggcct tccagcagca gatccagacc ctcaagacgc agcacgagga 961 gtttgtcacc agcctggccc agcagcagca gcagcagcaa cagcagcagc agcagctcca 1021 gatgccgcag atggaggctg aagtcaaggc cacgcctcca ccgcctgctc cacccccggc 1081 cccagcacct gcccctgcca tcccgcccac cacccagcct gatgacagca agcctcccat 1141 ccagatgcct ggctcttcag agtacgaagc tccaggaggg gtccaggatc ctgcagctgc 1201 gggcccccgg ggccccgggc cacacgacca gatcccacca aacaagcccc cttggtttga 1261 ccagcctcac cccgtggctc cttggggcca gcagcagccg ccagagcagc caccctaccc 1321 gcaccaccag ggcggcccac cccactgccc cccctggaac aacagccatg agggcatgtg 1381 gggcgagcag cgcggtgacc ccggctggaa cggccagcgc gacgcgccct ggaacaacca 1441 gcccgacgcc gcctggaaca gccagttcga gggcccctgg aacagccagc acgagcagcc 1501 gccctggggc gggggccagc gcgagccacc cttccgcatg cagcggcccc cacacttccg 1561 ggggcccttc ccgccccacc agcagcaccc gcagttcaac cagcctccgc acccccacaa 1621 cttcaaccgc ttcccgcccc gcttcatgca ggacgacttc ccgccacggc accccttcga 1681 gcggccgccc tatccccacc gcttcgacta cccccagggg gacttccctg ccgaaatggg 1741 gccccctcac caccaccctg gccaccgcat gcctcatcct ggcatcaacg agcacccgcc 1801 ttgggctgga ccccagcacc ctgacttcgg ccctcccccc catggcttca acgggcagcc 1861 cccacacatg cggcgacagg gcccacccca catcaaccac gatgacccca gcctggtccc 1921 caatgtgccc tacttcgatc tccctgctgg gctgatggcc cccctcgtga agctggaaga 1981 tcacgagtac aagcctttgg accctaaaga catccgcctc ccacccccca tgccgcccag 2041 cgagaggctg ctggctgcag tggaggcctt ctacagcccc ccgtcccacg acaggcccag 2101 gaacagtgaa ggctgggagc agaacggcct ctatgagttc ttccgagcaa aaatgcgggc 2161 ccggcggagg aaaggccagg agaagaggaa cagcggaccc tcgaggtctc ggagcagatc 2221 caagagtcga gggcgttctt cctcccgctc caactcaaga tcctccaagt cttcaggctc 2281 gtactcaagg tcaaggtcgc gctcctgctc ccgttcctac tcccgctcca gatctagaag 2341 tcggagcagg tcgcgctcct ccagaagccg ctcccggtcc cagtcgcggt cccggtccaa 2401 gtcgtactcc ccaggaagaa gacgccggtc acggtccagg agccccaccc cgccttcctc 2461 tgctggtctg ggttctaatt cggcgcctcc catccctgac tcaaggctcg gagaagagaa 2521 caaaggccat cagatgctgg tgaagatggg ctggagcggc tcaggcggcc tcggtgcgaa 2581 ggagcaaggg atccaggacc ccatcaaggg cggggacgtc cgggataagt gggaccagta 2641 taaaggcgtg ggcgtggctc tggatgaccc ctatgagaac taccgcagga acaagagcta 2701 ctccttcatc gcccgcatga aggccaggga cgagtgtaag taggcgccca tgccgggagc 2761 cgcgccggtg gccagcggtg ccggctgtgg gaccttcctg gctgactggc agaggaagat 2821 tgcagtgaca gctcagttct tacaccgtct ccacttgtgg agagccacag gaagagaagg 2881 aagaccagcg catgcccagt gggaacgccg tgctccatgg cgtggagggc acgggtgcca 2941 cccacaaacc acaccaggag gcgctgcagc caccagggcc ccaggctcgt tttcctttat 3001 agcgaagcaa aaaacacaag accctccgcc cgagtaaatt cttcagccac gaagggatgg 3061 atgtgcatct gccctaagtc agattgaagc ctcctcctag gctccaggag gagcatgtgc 3121 aggaagggtt ggcctgagcc agagccggca ccccagctcc ttcctccagc tccccgcacc 3181 caccggccct gacctggctt cccctcggct gtgtagggac aaagccgaag cccagtgcca 3241 tgactgcccg cggattgaaa aacagaaaca caaaactttg acttgacttg cgaagtgaag 3301 cagggttttc atttttactc tccttggttt aggtctagaa agaagaatac tgacctgaga 3361 ggaggcccag tgagattctg aaacctaatt cttcgaaggc gtactggccc ttagttcagt 3421 tattttagtt gtataagttg atattttttt tctggaatgt agccatttgc tgttatctgg 3481 gaaacaagat tctaacagga aaccagccta agacacttca ggttgagcgc tgcctcggag 3541 tctgtgcccg tcgcgtcccc tgcttgagtt ttgcacttgg aagaaccctg caccggctgg 3601 cgtgtgcgac ggcccagtcc catccagagc atggagcccg accccagcca gcgccttcca 3661 ctccatcatt tcatttcaca cccccgaagg gaggggaggc caggagggga gctgctcctg 3721 ccagaagctg gtgggtgacc gttgggaatc ggccacacct ggtgtccatg ggcagcctgg 3781 tgcaattcca ttcatttgta cagaaacatt tttgaaaaat tcttttcaat aagatgcaaa 3841 atcttccaac ttttcaaccc aacgtgatga atatttgatt ttgttctaga tttcctgtag 3901 ctgtgaattg ttaaaatgta tgattcagga taaaacgtaa acacgtgctg ttagtaattt 3961 cttgtggatt tcattgtttt gccttcaaat aaagcctttt tttaaaaaaa aagaattc // LOCUS HSU94855 1231 bp mRNA PRI 30-APR-1997 DEFINITION Human translation initiation factor 3 47 kDa subunit mRNA, complete cds. ACCESSION U94855 NID g2055430 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1231) AUTHORS Asano,K., Vornlocher,H.-P., Hinnebusch,A.G. and Hershy,J.W.B. TITLE Structure of cDNAs encoding human translation initiation factor eIF3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1231) AUTHORS Asano,K., Vornlocher,H.-P., Hinnebusch,A.G. and Hershy,J.W.B. TITLE Direct Submission JOURNAL Submitted (21-MAR-1997) Laboratory of Eukaryotic Gene Regulation, NICHD, Blg. 6B, Rm 309, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1231 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 7..1080 /note="eIF3 p47 subunit" /codon_start=1 /product="translation initiation factor 3 47 kDa subunit" /db_xref="PID:g2055431" /translation="MATPAVPVSAPPATPTPVPAAAPASVPAPTPAPAAAPVPAAAPA SSSDPAAAAAATAAPGQTPASAQAPAQTPAPALPGPALPGPFPGGRVVRLHPVILASI VDSYERRNEGAARVIGTLLGTVDKHSVEVTNCFSVPHNESEDEVAVDMEFAKNMYELH KKVSPNELILGWYATGHDITEHSVLIHEYYSREAPNPIHLTVDTSLQNGRMSIKAYVS TLMGVPGRTMGVMFTPLTVKYAYYDTERIGVDLIMKTCFSPNRVIGLSSDLQQVGGAS ARIQDALSTVLQYAEDVLSGKVSADNTVGRFLMSLVNQVPKIVPDDFETMLNSNINDL LMVTYLANLTQSQIALNEKLVNL" BASE COUNT 303 a 355 c 317 g 256 t ORIGIN 1 gacaagatgg ccacaccggc ggtaccagta agtgctcctc cggccacgcc aaccccagtc 61 ccggcggcgg ccccagcctc agttccagcg ccaacgccag caccggctgc ggctccggtt 121 cccgctgcgg ctccagcctc atcctcagac cctgcggcag cagcggctgc aactgcggct 181 cctggccaga ccccggcctc agcgcaagct ccagcgcaga ccccagcgcc cgctctgcct 241 ggtcctgctc ttccagggcc cttccccggc ggccgcgtgg tcaggctgca cccagtcatt 301 ttggcctcca ttgtggacag ctacgagaga cgcaacgagg gtgctgcccg agttatcggg 361 accctgttgg gaactgtcga caaacactca gtggaggtca ccaattgctt ttcagtgccg 421 cacaatgagt cagaagatga agtggctgtt gacatggaat ttgctaagaa tatgtatgaa 481 ctgcataaaa aagtttctcc aaatgagctc atcctgggct ggtacgctac gggccatgac 541 atcacagagc actctgtgct gatccatgag tactacagcc gagaggcccc caaccccatc 601 cacctcactg tggacacaag tctccagaac ggccgcatga gcatcaaagc ctacgtcagc 661 actttaatgg gagtccctgg gaggaccatg ggagtgatgt tcacgcctct gacagtgaaa 721 tacgcgtact acgacactga acgcatcgga gttgacctga tcatgaagac ctgctttagc 781 cccaacagag tgattggact ctcaagtgac ttgcagcaag taggaggggc atcagctcgc 841 atccaggatg ccctgagtac agtgttgcaa tatgcagagg atgtactgtc tggaaaggtg 901 tcagctgaca atactgtggg ccgcttcctg atgagcctgg ttaaccaagt accgaaaata 961 gttcccgatg actttgagac catgctcaac agcaacatca atgacctttt gatggtgacc 1021 tacctggcca acctcacaca gtcacagatt gcactcaatg aaaaacttgt aaacctgtga 1081 atggacccca agcagtacac ttgctggtct aggtattaac cccaggactc agaagtgaag 1141 gagaaatggg ttttttgtgg tcttgagtca cactgagata gtcagttgtg tgtgactcta 1201 ataaacggag cctacctttt gtaaaaaaaa a // LOCUS HSU95006 708 bp mRNA PRI 04-MAY-1997 DEFINITION Human D9 splice variant A mRNA, complete cds. ACCESSION U95006 NID g2071992 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 708) AUTHORS Scott,L.M. and Collins,S.J. TITLE Nucleotide sequence of two human D9 transcripts JOURNAL Unpublished REFERENCE 2 (bases 1 to 708) AUTHORS Scott,L.M. and Collins,S.J. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) Program in Molecular Medicine, Fred Hutchinson Cancer Research Center, 1124 Columbia Street, Seattle, WA 98104, USA FEATURES Location/Qualifiers source 1..708 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR <1..3 CDS 4..195 /codon_start=1 /product="D9 splice variant A" /db_xref="PID:g2071993" /translation="MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDA LRVDVDQLEKLLPQLLLDF" 3'UTR 196..708 polyA_signal 675..680 BASE COUNT 165 a 193 c 216 g 134 t ORIGIN 1 gtcatggagg gagcaggagc tggatccggc ttccggaagg agctggtgag caggctgctg 61 cacctgcact tcaaggatga caagaccaaa gaagcagcag tccgcggcgt gcggcaggcc 121 caggcagaag acgcgctccg tgtggacgtg gaccagctgg agaagctgct tccgcagctg 181 ctcctggact tctagggatc tcagacgtgg cttgagccac cccagaggag cccctggtcc 241 acagaagcag gccttgtgtt tccagcggct tctgataaga ggcagggaag gacctgaagg 301 atttggagtt gattcaaaca agatctctgg gagtctccag cctgtgcaag aaggggcagg 361 actgcagtgc actgcgggcc ttggaagtgt ccagtgggga cactggtgtg ggaaggggca 421 gcacctgggg agtccctgcc tctcctccct gggacaatag tgtgcatgcc acccggggtc 481 ctacaggcag gtgctgggaa aggcctggcc agcaggtagc ctgtgtgttt gacaaacagc 541 agctggcagc gctgcctcct gcccacattc ctgccacccg acatcaaagc tggcgtgtga 601 cctttccagc catgcgatat tccccttgga agatgcttcc ccaggctata aatttgttct 661 cacaaagcaa catcaataaa tcaaaactgt ctctcccaaa aaaaaaaa // LOCUS HSU95020 1823 bp mRNA PRI 01-MAY-1997 DEFINITION Human voltage-dependent calcium channel beta-4 subunit mRNA, complete cds. ACCESSION U95020 NID g2058726 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1823) AUTHORS Williams,M.E. TITLE Molecular Characterization of the Beta-4 Subunit of the Voltage-Dependent Calcium Channel from Human Brain JOURNAL Unpublished REFERENCE 2 (bases 1 to 1823) AUTHORS Williams,M.E. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) SIBIA Neurosciences, Inc, 505 Coast Boulevard South, Suite 300, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..1823 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neuron" /tissue_type="brain" CDS 69..1631 /codon_start=1 /product="voltage-dependent calcium channel beta-4 subunit" /db_xref="PID:g2058727" /translation="MSSSSYAKNGTADGPHSPTSQVARGTTTRRSRLKRSDGSTTSTS FILRQGSADSYTSRPSDSDVSLEEDREAIRQEREQQAAIQLERAKSKPVAFAVKTNVS YCGALDEDVPVPSTAISFDAKDFLHIKEKYNNDWWIGRLVKEGCEIGFIPSPLRLENI RIQQEQKRGRFHGGKSSGNSSSSLGEMVSGTFRATPTSTAKQKQKVTEHIPPYDVVPS MRPVVLVGPSLKGYEVTDMMQKALFDSLKHRFDGRISITRVTADISLAKRSVLNNPSK RAIIERSNTRSSLAEVQSEIERIFELARSLQLVVLDADTINHPAQLIKTSLAPIIVHV KVSSPKVLQRLIKSRGKSQSKHLNVQLVAADKLAQCPPEMFDVILDENQLEDACEHLG EYLEAYWRATHTTSSTPMTPLLGRNLGSTALSPYPTAISGLQSQRMRHSNHSTENSPI ERRSLMTSDENYHNERARKSRNRLSSSSQHSRDHYPLVEEDYPDSYQDTYKPHRNRGS PGGYSHDSRHRL" BASE COUNT 546 a 440 c 432 g 405 t ORIGIN 1 agcccagcct cgggggccag ccccctccgc ccaccgcaca cgggctggcc atgcggcggc 61 tctgaacgat gtcctcctcc tcctacgcca agaacgggac cgcggacggg ccgcactccc 121 ccacctcgca ggtggcccga ggcaccacaa cccggaggag caggttgaaa agatccgatg 181 gcagcaccac ttcgaccagc ttcatcctca gacagggttc agcggattcc tacacaagca 241 ggccgtctga ctccgatgtc tctttggaag aggaccggga agcaattcga caggagagag 301 aacagcaagc agctatccag cttgagagag caaagtccaa acctgtagca tttgccgtga 361 agacaaatgt gagctactgc ggcgccctgg acgaggatgt gcctgttcca agcacagcta 421 tctcctttga tgctaaagac tttctacata ttaaagagaa atataacaat gattggtgga 481 taggaaggct ggtgaaagag ggctgtgaaa ttggcttcat tccaagtcca ctcagattgg 541 agaacatacg gatccagcaa gaacaaaaaa gaggacgttt tcacggaggg aaatcaagtg 601 gaaattcttc ttcaagtctt ggagaaatgg tatctgggac attccgagca actcccacat 661 caacagcaaa acagaagcaa aaagtgacgg agcacattcc tccttacgat gttgtaccgt 721 caatgcgtcc ggtggtgtta gtggggccgt cactgaaagg ttacgaggta acagacatga 781 tgcagaaagc cctctttgat tccctgaagc acaggtttga tgggaggatt tcaataacga 841 gagtgacagc tgacatttct cttgctaaga ggtctgtcct aaataatccc agcaagagag 901 caataattga acgttcgaac acccggtcca gcttagcgga agtacaaagt gaaattgaaa 961 gaatctttga gttggcaaga tctttgcaac tggttgttct tgatgcagac accatcaatc 1021 acccagcaca acttataaag acttccttag caccaattat tgttcatgta aaagtctcat 1081 ctccaaaggt tttacagcgg ttgattaaat ctagaggaaa gtcacaaagt aaacacttga 1141 atgttcaact ggtggcagct gataaacttg cacaatgccc cccagaaatg tttgatgtta 1201 tattggatga aaatcagctt gaggatgcat gtgaacatct aggggagtac ctggaggcgt 1261 actggcgtgc cacccacaca accagtagca cacccatgac cccgctgctg ggaaggaatt 1321 tgggctccac ggcactctca ccatatccca cagcaatttc tgggttacag agtcagcgaa 1381 tgaggcacag caaccactcc acagagaact ctccaattga aagacgaagt ctaatgacct 1441 ctgatgaaaa ttatcacaat gaaagggctc ggaagagtag gaaccgcttg tcttccagtt 1501 ctcagcatag ccgagatcat taccctcttg tggaagaaga ttaccctgac tcataccagg 1561 acacttacaa accccatagg aaccgaggat cacctggggg atatagccat gactcccgac 1621 ataggctttg agtctaatga aacaaaaaat attcatctgt tgacaatttg ccatagcagt 1681 gctaggataa accaatcatc ttaacttggc taacatagca cagtatttac tgtgctaatg 1741 ggctgctgtc attttatgct aagtaagggg caaaaaaaaa aattacatta tgcccttgag 1801 tctagatgga tattagatgc ccg // LOCUS HSU95032 2009 bp mRNA PRI 02-JAN-1998 DEFINITION Human growth-arrest-specific protein 2 mRNA, complete cds. ACCESSION U95032 NID g2738231 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2009) AUTHORS Collavin,L., Buzzai,M., Saccone,S., Della Valle,G., Brancolini,C. and Schneider,C. TITLE Characterization and chromosomal mapping of the human homolog of murine gas2 JOURNAL Unpublished (1997) REFERENCE 2 (bases 1 to 2009) AUTHORS Collavin,L., Buzzai,M., Saccone,S., Della Valle,G., Brancolini,C. and Schneider,C. TITLE Direct Submission JOURNAL Submitted (25-MAR-1997) Lab. Naz. C.I.B., Consortium for Interuniversitary Biotechnologies, AREA Science park - Padriciano 99, Trieste 34012, Italy FEATURES Location/Qualifiers source 1..2009 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 67..1008 /note="hGas2; gas2" /codon_start=1 /product="growth-arrest-specific protein 2" /db_xref="PID:g2738232" /translation="MCTALSPKVRSGPGLSDMHQYSQWLASRHEANLLPMKEDLALWL TNLLGKEITAETFMEKLDNGALLCQLAETMQEKFKESMDANKPTKNLPLKKIPCKTSA PSGSFFARDNTANFLSWCRDLGVDETCLFESEGLVLHKQPREVCLCLLELGRIAARYG VEPPGLIKLEKEIEQEETLSAPSPSPSPSSKSSGKKSTGNLLDDAVKRISEDPPCKCP NKFCVERLSQGRYRVGEKILFIRMLHNKHVMVRVGGGWETFAGYLLKHDPCRMLQISR VDGKTSPIQSKSPTLKDMNPDNYLVVSASYKAKKEIK" BASE COUNT 663 a 371 c 390 g 585 t ORIGIN 1 gcacgagtct gtataaaaag caattgctct tttgtttcca aaacaggtat tacaagtgga 61 taaataatgt gcactgctct gagcccaaag gtacgcagtg ggcctggcct ctctgatatg 121 catcagtata gccaatggct agccagcaga catgaagcta atttgctacc aatgaaagaa 181 gatctggcct tgtggttaac caatctatta gggaaggaga ttacagcaga aacttttatg 241 gagaagttgg acaatggtgc cttgctctgt caacttgcag aaactatgca ggagaaattc 301 aaggagagca tggatgctaa caagcccaca aagaatctac cgttgaagaa gatcccatgc 361 aaaaccagtg caccctcggg ctcctttttt gccagagaca atacagcaaa tttcttatcc 421 tggtgccgag atttaggggt ggatgaaacg tgtctatttg aatcggaagg tttggtcctc 481 cacaagcaac ccagagaagt gtgtctctgt ctgctagagc ttggccggat tgcagccagg 541 tatggtgtgg agcctcctgg tttgataaag ctggaaaaag agattgaaca agaagaaaca 601 ctttctgccc cttctccttc accttctcct tcatcaaagt cttctggaaa aaagagtaca 661 ggaaacttac tggatgatgc agtgaaacga atttctgaag atcctccttg caaatgccca 721 aacaagttct gtgtggagcg gctctcccaa ggaagatacc gagtgggaga aaagatcctc 781 ttcattagga tgctgcacaa caaacatgtc atggtccgtg tgggaggagg ctgggaaact 841 tttgcagggt atttgttgaa acacgacccc tgccgaatgc tgcagatctc ccgtgtggat 901 ggcaaaacat cccctatcca aagcaaatct ccaactctaa aggacatgaa tccagataac 961 tacttggtgg tctctgccag ttataaggct aagaaggaaa ttaagtgaaa caaattggtc 1021 atgacaaggg gaccctcata atggcctgta tccacttctc cagtatagtc agtttagttc 1081 atatgttctg aaaactgttt tggagaaaga tagacagaaa aatgtcatca tattgaaaaa 1141 tgttcaaaga gtagcactat ttatttttaa tgcttctgta gaatatgacc tattaaaaga 1201 aaatctaaac tcaaatttaa attatccaaa ttttaggcaa attattattt ctcaatatgc 1261 gaacacagta tttagaacac aatattagaa acacaattct aactaaagat aaataaggag 1321 aaaacattta ggatttgcag ataagtcaac agaaagtgcc tgctttacac tcatgagtaa 1381 aagcactcca gacattttta taattgcaag atttttatgc tttttttttt tttttttttt 1441 acaaaatgat gattagtgtg ataatgtgct gcaaaaaata tccaacagca ctacacataa 1501 gtacagagta ttcacaatag taatatgtta ctggaaaggc cacttccgaa aagtttacat 1561 tcacttggaa ggctgcctta aagtatatct cttatctatg accaaatatc tggggtactt 1621 ttagaagttt tcagtacacc cccttagaga atagcttatg agttatttca cacattcctg 1681 agcacatggc tgtgtttaga atgatctagt gtaattcata catgagctga catacgttga 1741 tactttgacg agtaaattgt acagtcaaat gttcattgat ttcatgttat ttcaaagcat 1801 tttcttatta aaaatatatc tttagttaat cctattttgc tatgctttca agtaaagtaa 1861 attcactgta tctaatgatg ttacagactt acgtataccc ttgtatacct gggacagggc 1921 tgttttatac caataaacag caaaatattg tccttactca ttcaagaatt aaaaggataa 1981 attttaaaaa tgttaaaaaa aaaaaaaaa // LOCUS HSU95044 1838 bp mRNA PRI 02-JUL-1997 DEFINITION Human zinc finger protein (FDZF2) mRNA, complete cds. ACCESSION U95044 NID g2232012 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1838) AUTHORS Wu,G. and Yu,L. TITLE Cloning of a novel KRAB-containing zinc finger gene, FDZF2 JOURNAL Unpublished REFERENCE 2 (bases 1 to 1838) AUTHORS Wu,G. TITLE Direct Submission JOURNAL Submitted (25-MAR-1997) Institute of Genetics, Fudan University, Han Dan Road 220, Shanghai 200433, Peoples Republic of China FEATURES Location/Qualifiers source 1..1838 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="19" 5'UTR 1..220 /gene="FDZF2" gene 1..1838 /gene="FDZF2" CDS 221..1645 /gene="FDZF2" /codon_start=1 /product="zinc finger protein" /db_xref="PID:g2232013" /translation="MTTFKEAVTFKDVAVFFTEEELGLLHPAQRKLYQDVMLENFTNL LSVGHQPFHPFHFLREEKFWMMETATQREGNSGGKTIAEAGPHEDCPVQQIWEQTASD LTQSQDSIINNSHFFEQGDVPSQVEAGLSIIHTGQKPSQNGKCKQSFSDVAIFDPPQQ FHSGEKSHTCNECGKSFCYISALRIHQRVHLREKLSKCDMCGKEFSQSSCLQTHERVH TREKPFKCEQCGKGFRCRAILQVHCKLHTGEKPYICEKCGRAFIHDLKLQKHQIIHTG EKPSKCEICGKSFCLRSSLNRHYMVHTAEKLSKCEECGKGFTDSLDLHKHQIIHTGQK PYNCKECGKSFRWSSYLLIHQRIHSGEKPYRCEECGKGYISKSGLNLHPEGHTGERPY NCKECGKSFSRASSILNHKKLHCRKKPFKCEDCGKRLVHRSFCKDQQGDHNGENSSKC EDCGKRYKRRLNLDIILSLFLNDM" 3'UTR 1646..1838 /gene="FDZF2" BASE COUNT 590 a 358 c 428 g 462 t ORIGIN 1 cgggggagcc tgcggttgct acataaccgc gtagtttgag ccatttctgc gtctggcggg 61 tccttctgaa cttgtcacct tcgcttgggg tcgcaacgac ccgatgatcg atgatccaag 121 caagggaaaa gaagccttgg cggagagcgg aggcacaatt ccaccttcct ttgtgctcca 181 ttactcaaga cactgaagac tccaaaaagt aggaggaaaa atgaccacat tcaaggaggc 241 agtgaccttc aaggatgtgg ctgtgttctt cactgaggag gaactggggc tactgcaccc 301 tgcccagagg aagctgtacc aagacgtgat gcttgagaac ttcacgaacc tgctgtcagt 361 ggggcatcaa ccattccacc ctttccactt cctaagggaa gaaaagtttt ggatgatgga 421 gacagcaacc caaagagaag gaaattcagg cggcaagact attgcggaag caggaccaca 481 tgaagactgc cctgtccagc aaatctggga acaaactgca agtgacttaa cccagtctca 541 agactccatc ataaataatt ctcacttctt tgaacaaggt gatgtcccct cccaggttga 601 ggcaggacta tctataattc atacaggaca gaaaccttca cagaatggga agtgtaaaca 661 gtccttcagt gatgttgcca tctttgatcc tcctcagcag ttccactcag gagagaagtc 721 tcatacatgc aatgagtgtg gaaaaagctt ctgttacatc tcagctcttc gtattcacca 781 gagagttcac ttgagagaga aactctctaa gtgtgacatg tgtggtaagg aattcagtca 841 gagctcatgt ctgcaaactc atgagagagt ccacactaga gagaaaccat tcaaatgtga 901 gcaatgtggg aaaggcttca gatgtagagc gatacttcaa gttcactgca agttacacac 961 aggagagaaa ccttatattt gtgagaaatg tgggagggcc ttcattcacg atttaaagct 1021 tcagaaacat cagataattc acactgggga gaagccttcc aaatgtgaaa tatgtggtaa 1081 gagcttctgc cttaggtcaa gtcttaatag gcattacatg gtccacacag cagagaaact 1141 gtcaaaatgt gaggagtgtg gaaaaggctt cactgatagc ctagatttgc ataagcatca 1201 gataattcac acaggacaga aaccgtacaa ttgtaaagaa tgtgggaaga gcttcagatg 1261 gtcctcatat cttttgatcc atcagcgaat ccacagtgga gaaaaaccat acagatgtga 1321 ggagtgtggg aagggctaca ttagtaagtc aggtcttaac ttgcacccag agggtcatac 1381 tggagagaga ccttataatt gtaaggaatg tgggaagagc tttagccggg cttcaagtat 1441 tttgaatcat aagaaactcc actgccggaa aaaacccttc aaatgtgagg attgtggaaa 1501 gaggcttgta caccggtctt tctgtaaaga ccaacaagga gaccacaatg gagaaaactc 1561 atccaaatgt gaggactgtg ggaagcgcta caagaggcgc ttgaatctgg atataatttt 1621 atcattattt ttaaatgata tgtaagttgt acatatatat aggatatggt atgaaatttt 1681 aatatgtgta tataatacgt aatgatcaaa ttgatgtaat tagtgtatta ccttaaacaa 1741 ttaacatttc tttgttttgg gaaaattcaa atccattgtt ctagcgattt gaaaagaaac 1801 aataaattat tgttgattat agtcaaaaaa aaaaaaaa // LOCUS HSU95301 1020 bp mRNA PRI 01-AUG-1997 DEFINITION Human calcium-dependent group X phospholipase A2 mRNA, complete cds. ACCESSION U95301 NID g2289236 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1020) AUTHORS Lennon,G., Auffray,C., Polymeropoulos,M. and Soares,M.B. TITLE The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression JOURNAL Genomics 33 (1), 151-152 (1996) MEDLINE 96224170 REFERENCE 2 (bases 1 to 1020) AUTHORS Cupillard,L., Koumanov,K., Mattei,M.G., Lazdunski,M. and Lambeau,G. TITLE Cloning, chromosomal mapping and expression of a novel human secretory phospholipase A2 JOURNAL Unpublished REFERENCE 3 (bases 1 to 1020) AUTHORS Cupillard,L., Koumanov,K., Mattei,M.G., Lazdunski,M. and Lambeau,G. TITLE Direct Submission JOURNAL Submitted (27-MAR-1997) IPMC, CNRS, 660 Route des Lucioles, Valbonne 06560, France FEATURES Location/Qualifiers source 1..1020 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.1-p12" /tissue_type="lung" /dev_stage="fetal" CDS 441..938 /EC_number="3.1.1.4" /codon_start=1 /product="calcium-dependent group X phospholipase A2" /db_xref="PID:g2289237" /translation="MGPLPVCLPIMLLLLLPSLLLLLLLPGPGSGEASRILRVHRRGI LELAGTVGCVGPRTPIAYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSP KTERYSWQCVNQSVLCGPAENKCQELLCKCDQEIANCLAQTEYNLKYLFYPQFLCEPD SPKCD" BASE COUNT 212 a 304 c 291 g 213 t ORIGIN 1 ggccttccaa agtgctggga ttacaggcgt gagtcaccgc gcccggccaa ataaaataaa 61 atgttaaagc aaattcagga ctacccctcc tccaagtctt ctgttccctt tgggcgccca 121 ggtgagcggg ggaggggctg ggggagtaat aacatcaaaa gagcgccttt tcctccctta 181 ttccgaggag acttccctgg gcctgactcc cggtcctgtc cccagcgccc cgcggcctct 241 ggagcccctt cagtgaccaa gatacagaga tcaggacgcc tttgcgccgc cccaggtgcc 301 cgcccctagc tggctctgct tgggccgcga gggaaggtga ggtcgggggc ggagccgggg 361 cgtgacagcc ggggtgtgtg tccgccgggc ttggtgcctc cggtggccct gcagcaccgt 421 cccacctctg ccaccctccg atggggccgc tacctgtgtg cctgccaatc atgctgctcc 481 tgctactgcc gtcgctgctg ctgctgctgc ttctacctgg ccccgggtcc ggcgaggcct 541 ccaggatatt acgtgtgcac cggcgtggga tcctggaact ggcaggaact gtgggttgtg 601 ttggtccccg aacccccatc gcctatatga aatatggttg cttttgtggc ttgggaggcc 661 atggccagcc ccgcgatgcc attgactggt gctgccatgg ccacgactgt tgttacactc 721 gagctgagga ggccggctgc agccccaaga cagagcgcta ctcctggcag tgcgtcaatc 781 agagcgtcct gtgcggaccg gcagagaaca aatgccaaga actgttgtgc aagtgtgacc 841 aggagattgc taactgctta gcccaaactg agtacaactt aaagtacctc ttctaccccc 901 agttcctatg tgagccggac tcgcccaagt gtgactgact accttgactt gaaatgctct 961 tttgcacaag gaaataaagc gtcctctcag taatgaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSU95367 3282 bp mRNA PRI 17-JUN-1997 DEFINITION Human GABA-A receptor pi subunit mRNA, complete cds. ACCESSION U95367 NID g2197000 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3282) AUTHORS Hedblom,E. and Kirkness,E.F. TITLE A novel class of GABAA receptor subunit in tissues of the reproductive system JOURNAL J. Biol. Chem. 272 (24), 15346-15350 (1997) MEDLINE 97326112 REFERENCE 2 (bases 1 to 3282) AUTHORS Hedblom,E. and Kirkness,E.F. TITLE Direct Submission JOURNAL Submitted (27-MAR-1997) Department of Molecular and Cellular Biology, The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA FEATURES Location/Qualifiers source 1..3282 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="5" CDS 157..1479 /codon_start=1 /product="GABA-A receptor pi subunit" /db_xref="PID:g2197001" /translation="MNYSLHLAFVCLSLFTERMCIQGSQFNVEVGRSDKLSLPGFENL TAGYNKFLRPNFGGEPVQIALTLDIASISSISESNMDYTATIYLRQRWMDQRLVFEGN KSFTLDARLVEFLWVPDTYIVESKKSFLHEVTVGNRLIRLFSNGTVLYALRITTTVAC NMDLSKYPMDTQTCKLQLESWGYDGNDVEFTWLRGNDSVRGLEHLRLAQYTIERYFTL VTRSQQETGNYTRLVLQFELRRNVLYFILETYVPSTFLVVLSWVSFWISLDSVPARTC IGVTTVLSMTTLMIGSRTSLPNTNCFIKAIDVYLGICFSFVFGALLEYAVAHYSSLQQ MAAKDRGTTKEVEEVSITNIINSSISSFKRKISFASIEISSDNVDYSDLTMKTSDKFK FVFREKMGRIVDYFTIQNPSNVDHYSKLLFPLIFMLANVFYWAYYMYF" BASE COUNT 926 a 727 c 678 g 951 t ORIGIN 1 gggacagggc tgaggatgag gagaaccctg gggacccaga agaccgtgcc ttgcccggaa 61 gtcctgcctg taggcctgaa ggacttgccc taacagagcc tcaacaacta cctggtgatt 121 cctacttcag ccccttggtg tgagcagctt ctcaacatga actacagcct ccacttggcc 181 ttcgtgtgtc tgagtctctt cactgagagg atgtgcatcc aggggagtca gttcaacgtc 241 gaggtcggca gaagtgacaa gctttccctg cctggctttg agaacctcac agcaggatat 301 aacaaatttc tcaggcccaa ttttggtgga gaacccgtac agatagcgct gactctggac 361 attgcaagta tctctagcat ttcagagagt aacatggact acacagccac catatacctc 421 cgacagcgct ggatggacca gcggctggtg tttgaaggca acaagagctt cactctggat 481 gcccgcctcg tggagttcct ctgggtgcca gatacttaca ttgtggagtc caagaagtcc 541 ttcctccatg aagtcactgt gggaaacagg ctcatccgcc tcttctccaa tggcacggtc 601 ctgtatgccc tcagaatcac gacaactgtt gcatgtaaca tggatctgtc taaatacccc 661 atggacacac agacatgcaa gttgcagctg gaaagctggg gctatgatgg aaatgatgtg 721 gagttcacct ggctgagagg gaacgactct gtgcgtggac tggaacacct gcggcttgct 781 cagtacacca tagagcggta tttcacctta gtcaccagat cgcagcagga gacaggaaat 841 tacactagat tggtcttaca gtttgagctt cggaggaatg ttctgtattt cattttggaa 901 acctacgttc cttccacttt cctggtggtg ttgtcctggg tttcattttg gatctctctc 961 gattcagtcc ctgcaagaac ctgcattgga gtgacgaccg tgttatcaat gaccacactg 1021 atgatcgggt cccgcacttc tcttcccaac accaactgct tcatcaaggc catcgatgtg 1081 tacctgggga tctgctttag ctttgtgttt ggggccttgc tagaatatgc agttgctcac 1141 tacagttcct tacagcagat ggcagccaaa gataggggga caacaaagga agtagaagaa 1201 gtcagtatta ctaatatcat caacagctcc atctccagct ttaaacggaa gatcagcttt 1261 gccagcattg aaatttccag cgacaacgtt gactacagtg acttgacaat gaaaaccagc 1321 gacaagttca agtttgtctt ccgagaaaag atgggcagga ttgttgatta tttcacaatt 1381 caaaacccca gtaatgttga tcactattcc aaactactgt ttcctttgat ttttatgcta 1441 gccaatgtat tttactgggc atactacatg tatttttgag tcaatgttaa atttcttgca 1501 tgccataggt cttcaacagg acaagataat gatgtaaatg gtattttagg ccaagtgtgc 1561 acccacatcc aatggtgcta caagtgactg aaataatatt tgagtctttc tgctcaaaga 1621 atgaagctcc aaccattgtt ctaagctgtg tagaagtcct agcattatag gatcttgtaa 1681 tagaaacatc agtccattcc tctttcatct taatcaagga cattcccatg gagcccaaga 1741 ttacaaatgt actcagggct gtttattcgg tggctccctg gtttgcattt acctcatata 1801 aagaatggga aggagaccat tgggtaaccc tcaagtgtca gaagttgttt ctaaagtaac 1861 tatacatgtt ttttactaaa tctctgcagt gcttataaaa tacattgttg cctatttagg 1921 gagtaacatt ttctagtttt tgtttctggt taaaatgaaa tatgggctta tgtcaattca 1981 ttggaagtca atgcactaac tcaataccaa gatgagtttt taaataatga atattattta 2041 ataccacaac agaattatcc ccaatttcca ataagtccta tcattgaaaa ttcaaatata 2101 agtgaagaaa aaattagtag atcaacaatc taaacaaatc cctcggttct aagatacaat 2161 ggattcccca tactggaagg actctgaggc tttattcccc cactatgcat atcttatcat 2221 tttattatta tacacacatc catcctaaac tatactaaag cccttttccc atgcatggat 2281 ggaaatggaa gatttttttg taacttgttc tagaagtctt aatatgggct gttgccatga 2341 aggcttgcag aattgagtcc attttctagc tgcctttatt cacatagtga tggggtacta 2401 aaagtactgg gttgactcag agagtcgctg tcattctgtc attgctgcta ctctaacact 2461 gagcaacact ctcccagtgg cagatcccct gtatcattcc aagaggagca ttcatccctt 2521 tgctctaatg atcaggaatg atgcttatta gaaaacaaac tgcttgaccc aggaacaagt 2581 ggcttagctt aagtaaactt ggctttgctc agatccctga tccttccagc tggtctgctc 2641 tgagtggctt atcccgcatg agcaggagcg tgctggccct gagtactgaa ctttctgagt 2701 aacaatgaga cacgttacag aacctatgtt caggttgcgg gtgagctgcc ctctccaaat 2761 ccagccagag atgcacattc ctcggccagt ctcagccaac agtaccaaaa gtgatttttg 2821 agtgtgccag ggtaaaggct tccagttcag cctcagttat tttagacaat ctcgccatct 2881 ttaatttctt agcttcctgt tctaataaat gcacggcttt acctttcctg tcagaaataa 2941 accaaggctc taaaagatga tttcccttct gtaactccct agagccacag gttctcattc 3001 cttttcccat tatacttctc acaattcagt ttctatgagt ttgatcacct gattttttta 3061 acaaaatatt tctaacggga atgggtggga gtgctggtga aaagagatga aatgtggttg 3121 tatgagccaa tcatatttgt gattttttaa aaaaagttta aaaggaaata tctgttctga 3181 aaccccactt aagcattgtt tttatataaa aacaatgata aagatgtgaa ctgtgaaata 3241 aatataccat attagctacc caccaaaaaa aaaaaaaaaa aa // LOCUS HSU95735 741 bp mRNA PRI 16-OCT-1997 DEFINITION Human SNARE protein Ykt6 (YKT6) mRNA, complete cds. ACCESSION U95735 NID g2529436 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 741) AUTHORS McNew,J.A., Sogaard,M., Lampen,N.M., Machida,S., Ye,R.R., Lacomis,L., Tempst,P., Rothman,J.E. and Sollner,T.H. TITLE Ykt6p, a prenylated SNARE essential for endoplasmic reticulum-Golgi transport JOURNAL J. Biol. Chem. 272 (28), 17776-17783 (1997) MEDLINE 97362273 REFERENCE 2 (bases 1 to 741) AUTHORS McNew,J.A., Sogaard,M., Lampen,N.M., Machida,S., Ye,R.R., Lacomis,L., Tempst,P., Rothman,J.E. and Sollner,T.H. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) Cellular Biophysics and Biochemistry, Memorial Sloan-Kettering Cancer Center, 1275 York Ave, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..741 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 1..741 /gene="YKT6" CDS 60..656 /gene="YKT6" /note="prenylated v-SNARE protein" /codon_start=1 /product="SNARE protein Ykt6" /db_xref="PID:g2507637" /translation="MKLYSLSVLYKGEAKVVLLKAAYDVSSFSFFQRSSVQEFMTFTS QLIVERSSKGTRASVKEQDYLCHVYVRNDSLAGVVIADNEYPSRVAFTLLEKVLDEFS KQVDRIDWPVGSPATIHYPALDGHLSRYQNPREADPMTKVQAELDETKIILHNTMESL LERGEKLDDLVSKSEVLGTQSKAFYKTARKQNSCCAIM" BASE COUNT 198 a 198 c 188 g 157 t ORIGIN 1 gctgcttccc tgagaacggg tcccgcagct gggcaggcgg gcggcctgag ggccggacca 61 tgaagctgta cagcctcagc gtcctctaca aaggcgaggc caaggtggtg ctgctcaaag 121 ccgcatacga tgtgtcttcc ttcagctttt tccagagatc cagcgttcag gaattcatga 181 ccttcacgag tcaactgatt gtggagcgct catcgaaagg cactagagct tctgtcaaag 241 aacaagacta tctgtgccac gtctacgtcc ggaatgatag tcttgcaggt gtggtcattg 301 ctgacaatga atacccatcc cgggtggcct ttaccttgct ggagaaggta ctagatgaat 361 tctccaagca agtcgacagg atagactggc cagtaggatc ccctgctaca atccattacc 421 cagccctgga tggtcacctc agtagatacc agaacccacg agaagctgat cccatgacta 481 aagtgcaggc cgaactagat gagaccaaaa tcattctgca caacaccatg gagtctctgt 541 tagagcgagg tgagaagcta gatgacttgg tgtccaaatc cgaggtgctg ggaacacagt 601 ctaaagcctt ctataaaact gcccggaaac aaaactcatg ctgtgccatc atgtgatgca 661 gcctccagag gcccaatgct ggaatggcac catcattcac atcagaactg cagcccctgg 721 aaaagaagag acatcgttcc t // LOCUS HSU95740 149490 bp DNA PRI 21-AUG-1997 DEFINITION Human chromosome 16p13.1 BAC clone CIT987SK-362G6 complete sequence. ACCESSION U95740 NID g2340040 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 149490) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., Phillips,C., Brandon,R., Fuhrmann,J., Kim,U.J., Kerlavage,A.R. and Venter,J.C. TITLE Human chromosome 16p13.1 BAC clone CIT987SK-362G6 complete sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 149490) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (28-MAR-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 3 (bases 1 to 149490) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., La Bombard,M., Kim,U.J. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (21-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA COMMENT BAC clone CIT987SK-362G6 is located in band 16p13.1 of chromosome 16. Genes were identified by a combination of five methods: XGRAIL (available by anonymous ftp from arthur.epm.ornl.gov), Genefinder (available by anonymous ftp from colin@u.washington.edu), GENSCAN (availible using the e-mail server at genscan@gnomic.stanford.edu), searches of the EST database at TIGR (http://www.tigr.org/tdb/hcd/hcd.html) and searches against a peptide database. Repeats were identified using Censor (Jurka, J., Klonowski, P. Dagman, V., Pelton, P. Censor-a program for the identification and elimination of repetitive elements from DNA sequences. Computers Chem 20: 119-121 (1996); available by anonymous ftp from ncbi.nlm.nih.gov). FEATURES Location/Qualifiers source 1..149490 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p13.1" /clone="362G6" repeat_region 339..767 /rpt_family="L1" mRNA join(838..961,1838..2010,6542..6740,7459..7573,9246..9408, 10431..10609,14807..15043,16337..16512,19564..19784, 20607..20749,21233..21366,22616..22818,23823..24050, 27943..28173,29599..29841,30138..30350,31289..31364, 31788..31913,33317..33487,35404..37953) /gene="362G6.1" gene 838..37953 /gene="362G6.1" CDS join(838..961,1838..2010,6542..6740,7459..7573,9246..9408, 10431..10609,14807..15043,16337..16512,19564..19784, 20607..20749,21233..21366,22616..22818,23823..24050, 27943..28173,29599..29841,30138..30350,31289..31364, 31788..31913,33317..33487,35404..35648) /gene="362G6.1" /codon_start=1 /product="unknown protein CIT987SK_362G6_1" /db_xref="PID:g1930141" /translation="MQVTVAHINATAKNAADDKLRQSLRRFANTHTAPATVVLVSTDV NFALELSDLRHRHGFHIILVHKNQASEALLHHANELIRFEEFISDLPPRLPLKMPQCH TLLYVYNLPANKDGKSVSNRLRRLSDNCGGKVLSITGCSAILRFINQDSAERAQKRME NEDVFEKKDKEETVFQVSYPSAFSKLVASRQVSPLLASQSWSSRASPLAFNIANSSSE ADCPDPFANGADVQVSNIDYRLSRKELQQLLQEAFARHGKVKSVELSPHTDYQLKAVV QMENLQDAIGAVNSLHRYKIGSKKILVSLATGAASKSLSLLRFGHKLNVSDLYKLTDT VAIREQGNGRLVCLLPSSQARQSPLGSSQSHDGSSTNCSPIIFEELEYHEPVCRQHCS NKDFSFPDCYIAEFGDLEVVQENQGGVPLEHFITCVPGVNIATAQNGIKVVKWIHNKP PPPNTDPWLLRSKSPVGNPQLIQFSREVIDLLKSQPSCVIPISHFIPSYHHHFAKQCR VSDYGYSKLIELLEAVPHVLQILGMGSKRLLTLTHRAQVKRFTQDLLKLLKSQASKQV IVREFSQAYHWCFSKDWDVTEYGVCELIDIVSEIPDTTICLSQQDNEMVICIPKRERT QDEIERTKQFSKDVVDLLRHQPHFRMPFNKFIPSYHHHFGRQCKLAYYGFTKLLELFE AIPDTLQVLECGEEKILTLTEVERFKALAAQFVKLLRSQKDNCLMMTDLLTEYAKTFG YTFRLQDYDVSSISALTQKLCHVVKVADIESGRQIQLINRKSLRSLTAQLLVLLMSWE GTTHLSVEELKRHYESTHNTPLNPCEYGFMTLTELLKSLPYLVETAAAFDLQIELKFA GGYSLLLEEILEYIPPPFLSPLPSFLPPPPFFFFVFFVIFIFFIFVFVQFVVFVFFIV VFVFFIVVFVFTNDKMEECVKLTSLYLFAKNVRSLLHTYHYQQIFLHEFSMAYTKYVG ETLQPKTYGHSSVEELLGAIPQVVVQFLQKNGKELSCENYLNLYAKAGRLSSLSLSPA NHENQPSEGERILEVPESHTASELKLGADGSGPSHTEQELLRLTDDSPVDLLCAPVPS CLPSPQLRPDPVILQSADLIQFEERPQEPSEIMILNQEEKMEIPIPGKSKTLTSDSSS SCISAAVPVPPCPSSETSESLLSKDPVESPAKKQPKNRVKLAANFSLAPITKL" repeat_region complement(1204..1493) /rpt_family="ALU" repeat_region 1234..1477 /rpt_family="SVA" repeat_region complement(3211..3291) /rpt_family="SVA" repeat_region 3212..3297 /rpt_family="ALU" repeat_region complement(3623..3802) /rpt_family="SVA" repeat_region 3627..3919 /rpt_family="ALU" repeat_region complement(4728..5004) /rpt_family="ALU" repeat_region 4838..5114 /rpt_family="SVA" repeat_region complement(5060..5272) /rpt_family="ALU" repeat_region 5115..5261 /rpt_family="SVA" repeat_region complement(5371..5664) /rpt_family="ALU" repeat_region 5373..5658 /rpt_family="SVA" repeat_region complement(8019..8299) /rpt_family="SVA" repeat_region 8125..8420 /rpt_family="ALU" repeat_region complement(8336..8414) /rpt_family="SVA" repeat_region complement(8757..9040) /rpt_family="SVA" repeat_region 8763..9052 /rpt_family="ALU" repeat_region 9660..9940 /rpt_family="ALU" repeat_region complement(12170..12460) /rpt_family="ALU" repeat_region 12181..12449 /rpt_family="SVA" repeat_region complement(13058..13348) /rpt_family="ALU" repeat_region complement(14106..14324) /rpt_family="MIR" repeat_region complement(15429..15759) /rpt_family="MER7" repeat_region complement(16711..16998) /rpt_family="ALU" repeat_region 16722..17131 /rpt_family="SVA" repeat_region complement(17004..17128) /rpt_family="ALU" repeat_region 17262..17539 /rpt_family="ALU" repeat_region complement(17275..17529) /rpt_family="SVA" repeat_region complement(17542..17836) /rpt_family="ALU" repeat_region 17552..18005 /rpt_family="SVA" repeat_region complement(17853..18140) /rpt_family="ALU" repeat_region complement(18672..18972) /rpt_family="ALU" repeat_region 18793..18962 /rpt_family="SVA" repeat_region complement(20887..21164) /rpt_family="ALU" repeat_region 20926..21153 /rpt_family="SVA" repeat_region complement(21829..21998) /rpt_family="MER8" repeat_region complement(23094..23381) /rpt_family="ALU" repeat_region 23105..23387 /rpt_family="SVA" repeat_region complement(23561..23723) /rpt_family="L1MA10" repeat_region complement(24785..25074) /rpt_family="ALU" repeat_region 28868..28994 /rpt_family="ALU" STS complement(37535..37954) /db_xref="dbSTS:H09273" repeat_region complement(39128..39411) /rpt_family="ALU" repeat_region 39245..39401 /rpt_family="SVA" repeat_region complement(39986..40245) /rpt_family="ALU" repeat_region complement(40456..40651) /rpt_family="ALU" repeat_region complement(40742..41030) /rpt_family="ALU" repeat_region 40774..41025 /rpt_family="SVA" repeat_region complement(41762..42066) /rpt_family="ALU" repeat_region 42055..42382 /rpt_family="SVA" repeat_region complement(42084..42232) /rpt_family="ALU" repeat_region complement(43121..43411) /rpt_family="ALU" repeat_region 43121..43414 /rpt_family="SVA" repeat_region complement(43543..43732) /rpt_family="ALU" repeat_region complement(43733..43797) /rpt_family="ALU" repeat_region complement(43811..43969) /rpt_family="ALU" mRNA complement(join(44083..45624,49103..49185,51010..51124, 64285..64358,116913..117193)) /gene="362G6.2" gene complement(44083..117193) /gene="362G6.2" CDS complement(join(45512..45624,49103..49185,51010..51124, 64285..64358,116913..>117193)) /gene="362G6.2" /note="unknown protein CIT987SK_362G6_2" /codon_start=1 /db_xref="PID:g1930142" /translation="GVLTNTHLRHRLKSVLKVYDGRRARLCPGTMSGSVPWCSLGVLM VVFPVSCTDQLDIISMAETTMMPEEIELEMAKIQRLREVLVRRESELRFMMDDIQLCK DIMDLKQELQNLVAIPEKEKTKLQKQREDELIQKIHKLVQKRDFLVDDAEVERLREQE EDKEMADFLRIKLKPLDKVTKSPASSRAEKKAEPPPSKPTVAKTGLALIKDCCGATQC NIM" repeat_region 46480..46771 /rpt_family="SVA" repeat_region complement(46506..46791) /rpt_family="ALU" repeat_region complement(47776..48063) /rpt_family="ALU" repeat_region 47798..48052 /rpt_family="SVA" repeat_region complement(48426..48599) /rpt_family="MER45" repeat_region complement(48756..49047) /rpt_family="ALU" repeat_region 48785..49031 /rpt_family="SVA" repeat_region complement(49624..49827) /rpt_family="ALU" repeat_region complement(49933..50225) /rpt_family="ALU" repeat_region 49951..50206 /rpt_family="SVA" repeat_region 51157..51434 /rpt_family="ALU" repeat_region complement(51485..51771) /rpt_family="SVA" repeat_region 51497..51783 /rpt_family="ALU" repeat_region 51938..52047 /rpt_family="ALU" repeat_region 52049..52236 /rpt_family="ALU" repeat_region 52253..52394 /rpt_family="L1PA11" repeat_region 53482..53754 /rpt_family="MLT1A" repeat_region 53980..54194 /rpt_family="ALU" repeat_region complement(54000..54205) /rpt_family="SVA" repeat_region 54948..55107 /rpt_family="LTR9" repeat_region 55156..55589 /rpt_family="LTR9" repeat_region 55593..55788 /rpt_family="MLT1C" repeat_region 55995..56292 /rpt_family="ALU" repeat_region complement(57256..57545) /rpt_family="ALU" repeat_region 57267..57539 /rpt_family="SVA" repeat_region 57755..57970 /rpt_family="LTR8" repeat_region 57971..58342 /rpt_family="THE1B" repeat_region 58367..58606 /rpt_family="THE1B" repeat_region 58714..59088 /rpt_family="LTR8" repeat_region complement(59249..59613) /rpt_family="THE1B" repeat_region complement(59761..59914) /rpt_family="L1MB7" repeat_region complement(60255..60520) /rpt_family="MER4C" repeat_region complement(60623..60690) /rpt_family="MER4A" repeat_region 60687..60973 /rpt_family="ALU" repeat_region complement(60958..61078) /rpt_family="MER4C" repeat_region complement(61152..61777) /rpt_family="MER41" repeat_region 62733..63122 /rpt_family="MSTA" repeat_region complement(63118..63406) /rpt_family="ALU" repeat_region complement(63697..63985) /rpt_family="ALU" repeat_region 63701..63810 /rpt_family="SVA" repeat_region 63815..63965 /rpt_family="SVA" repeat_region complement(64572..64875) /rpt_family="ALU" repeat_region 64692..64855 /rpt_family="SVA" repeat_region complement(65143..65439) /rpt_family="ALU" repeat_region complement(65671..65835) /rpt_family="ALU" repeat_region complement(66487..66774) /rpt_family="ALU" repeat_region 68461..68570 /rpt_family="MSTA" repeat_region 68589..68814 /rpt_family="MSTA" repeat_region 68823..69103 /rpt_family="ALU" repeat_region complement(68845..69092) /rpt_family="SVA" repeat_region complement(69107..69241) /rpt_family="ALU" repeat_region 69611..69823 /rpt_family="MER20" repeat_region complement(70030..70315) /rpt_family="ALU" repeat_region complement(70442..70730) /rpt_family="ALU" repeat_region 70773..71064 /rpt_family="ALU" repeat_region complement(70775..71032) /rpt_family="SVA" repeat_region 71086..71374 /rpt_family="ALU" repeat_region complement(71541..71831) /rpt_family="ALU" repeat_region 71556..71820 /rpt_family="SVA" repeat_region complement(72323..72507) /rpt_family="MER30" repeat_region complement(74290..74576) /rpt_family="ALU" repeat_region 74296..74406 /rpt_family="SVA" repeat_region 74410..74580 /rpt_family="SVA" repeat_region 75458..75744 /rpt_family="ALU" repeat_region complement(75469..75733) /rpt_family="SVA" repeat_region complement(76155..76443) /rpt_family="ALU" repeat_region 76171..76432 /rpt_family="SVA" repeat_region 76970..77251 /rpt_family="ALU" repeat_region 77376..77664 /rpt_family="ALU" repeat_region complement(79031..79341) /rpt_family="SVA" repeat_region 79034..79323 /rpt_family="ALU" repeat_region complement(79342..79448) /rpt_family="ALU" repeat_region complement(79894..80179) /rpt_family="ALU" repeat_region 79953..80182 /rpt_family="SVA" repeat_region 80453..80686 /rpt_family="THE1B" repeat_region 81704..81983 /rpt_family="ALU" repeat_region complement(81715..81982) /rpt_family="SVA" repeat_region 82102..82381 /rpt_family="ALU" repeat_region complement(82113..82234) /rpt_family="SVA" repeat_region complement(82236..82367) /rpt_family="SVA" repeat_region complement(82656..82942) /rpt_family="ALU" repeat_region 82776..82931 /rpt_family="SVA" repeat_region complement(82985..83284) /rpt_family="ALU" repeat_region 84057..84329 /rpt_family="ALU" repeat_region complement(84079..84327) /rpt_family="SVA" repeat_region complement(84708..84828) /rpt_family="SVA" repeat_region 84753..84990 /rpt_family="ALU" repeat_region complement(84834..85306) /rpt_family="SVA" repeat_region 85030..85318 /rpt_family="ALU" repeat_region complement(86054..86336) /rpt_family="ALU" repeat_region 87069..87358 /rpt_family="ALU" repeat_region complement(87080..87369) /rpt_family="SVA" repeat_region complement(88514..88802) /rpt_family="ALU" repeat_region complement(90285..90717) /rpt_family="SVA" repeat_region 90440..90729 /rpt_family="ALU" repeat_region 90737..90903 /rpt_family="ALU" repeat_region complement(90988..91398) /rpt_family="MER21" repeat_region complement(91808..92080) /rpt_family="ALU" repeat_region 91820..92092 /rpt_family="SVA" repeat_region complement(92189..92461) /rpt_family="ALU" repeat_region 92194..92291 /rpt_family="SVA" repeat_region 92307..92780 /rpt_family="SVA" repeat_region complement(92492..92779) /rpt_family="ALU" repeat_region 92880..93026 /rpt_family="MER1A" repeat_region 93027..93410 /rpt_family="MER1A" repeat_region complement(93882..94170) /rpt_family="ALU" repeat_region 93883..94267 /rpt_family="SVA" repeat_region 94384..94671 /rpt_family="ALU" repeat_region complement(94435..94516) /rpt_family="SVA" repeat_region complement(94521..94959) /rpt_family="SVA" repeat_region 94688..94976 /rpt_family="ALU" repeat_region 96263..96616 /rpt_family="THE1B" repeat_region complement(96685..96975) /rpt_family="ALU" repeat_region 96701..97085 /rpt_family="SVA" repeat_region complement(96983..97273) /rpt_family="ALU" repeat_region complement(97780..98022) /rpt_family="ALU" repeat_region 97784..98020 /rpt_family="SVA" repeat_region complement(98793..98911) /rpt_family="SVA" repeat_region 98810..98937 /rpt_family="ALU" repeat_region complement(99487..100088) /rpt_family="SVA" repeat_region 99497..99785 /rpt_family="ALU" repeat_region 99814..100099 /rpt_family="ALU" repeat_region 100948..101068 /rpt_family="ALU" repeat_region complement(101062..101162) /rpt_family="MSTA" repeat_region 101135..101424 /rpt_family="ALU" repeat_region complement(101163..101413) /rpt_family="SVA" repeat_region complement(101443..101743) /rpt_family="THE1B" repeat_region 101742..101921 /rpt_family="ALU" repeat_region 101934..102034 /rpt_family="MLT1D" repeat_region 102162..102496 /rpt_family="MLT1D" repeat_region 103017..103306 /rpt_family="ALU" repeat_region complement(103031..103303) /rpt_family="SVA" repeat_region complement(103313..103618) /rpt_family="MLT1A" repeat_region 103888..103990 /rpt_family="MLT1A" repeat_region 104098..104260 /rpt_family="MSTC" repeat_region complement(104729..105018) /rpt_family="ALU" repeat_region 105018..105277 /rpt_family="L1MB3" repeat_region complement(105670..105957) /rpt_family="ALU" repeat_region 105690..105786 /rpt_family="SVA" repeat_region 105791..105942 /rpt_family="SVA" repeat_region 106155..106353 /rpt_family="MER3" repeat_region complement(106534..106572) /rpt_family="MLT2D" repeat_region complement(106629..106919) /rpt_family="ALU" repeat_region 106750..106913 /rpt_family="SVA" repeat_region complement(106921..107016) /rpt_family="MLT2D" repeat_region complement(107038..107082) /rpt_family="MLT2C2" repeat_region complement(107084..107360) /rpt_family="MLT2D" repeat_region complement(107563..107855) /rpt_family="ALU" repeat_region 107563..107915 /rpt_family="SVA" repeat_region 108794..109055 /rpt_family="LTR1" repeat_region 109056..109663 /rpt_family="LTR1" repeat_region 109665..109845 /rpt_family="MIR" repeat_region 111093..111545 /rpt_family="LOR1" repeat_region complement(111544..111838) /rpt_family="SVA" repeat_region 111555..111850 /rpt_family="ALU" repeat_region 112658..112940 /rpt_family="ALU" repeat_region complement(113306..113429) /rpt_family="MIR2" repeat_region complement(114826..115114) /rpt_family="ALU" repeat_region 114839..115103 /rpt_family="SVA" repeat_region 115520..115697 /rpt_family="MIR" repeat_region complement(115914..116199) /rpt_family="ALU" repeat_region complement(116211..116359) /rpt_family="ALU" repeat_region complement(116486..116606) /rpt_family="MIR" repeat_region complement(117252..117542) /rpt_family="ALU" repeat_region 117254..117833 /rpt_family="SVA" repeat_region complement(117550..117831) /rpt_family="ALU" repeat_region complement(118066..118338) /rpt_family="ALU" repeat_region complement(118440..118727) /rpt_family="ALU" repeat_region 119445..120088 /rpt_family="SVA" repeat_region complement(119455..119744) /rpt_family="ALU" repeat_region complement(119748..119880) /rpt_family="ALU" repeat_region complement(120322..120612) /rpt_family="ALU" repeat_region 120338..120608 /rpt_family="SVA" repeat_region 121341..121630 /rpt_family="ALU" repeat_region complement(122110..122560) /rpt_family="MLT1C" repeat_region complement(123507..123785) /rpt_family="ALU" repeat_region 125418..125869 /rpt_family="LOR1" repeat_region 125872..126157 /rpt_family="ALU" repeat_region complement(125874..126146) /rpt_family="SVA" repeat_region 126176..126258 /rpt_family="LOR1" repeat_region complement(126448..126735) /rpt_family="ALU" repeat_region 126569..126730 /rpt_family="SVA" repeat_region 127591..127885 /rpt_family="ALU" repeat_region 128518..128682 /rpt_family="MIR" repeat_region complement(128673..128745) /rpt_family="MER2" repeat_region 128746..129050 /rpt_family="ALU" repeat_region complement(128757..128929) /rpt_family="SVA" repeat_region complement(129052..129320) /rpt_family="MER2" repeat_region complement(130420..130845) /rpt_family="SVA" repeat_region 130421..130564 /rpt_family="ALU" repeat_region 130566..130857 /rpt_family="ALU" repeat_region 130866..131038 /rpt_family="ALU" repeat_region complement(130886..131033) /rpt_family="SVA" repeat_region complement(131191..131431) /rpt_family="MER33" repeat_region 131390..131679 /rpt_family="ALU" repeat_region complement(131432..131637) /rpt_family="SVA" repeat_region complement(132936..133145) /rpt_family="ALU" repeat_region 132941..133087 /rpt_family="SVA" repeat_region 133088..133145 /rpt_family="SVA" repeat_region complement(133399..133595) /rpt_family="MER20" repeat_region complement(134695..134775) /rpt_family="MLT1E" repeat_region complement(134799..135253) /rpt_family="LOR1" repeat_region complement(135254..135455) /rpt_family="MLT1D" repeat_region 135505..135791 /rpt_family="ALU" repeat_region complement(135511..135780) /rpt_family="SVA" repeat_region 135857..136019 /rpt_family="MIR" repeat_region 136677..136966 /rpt_family="ALU" repeat_region complement(136688..136954) /rpt_family="SVA" repeat_region 136967..137126 /rpt_family="ALU" repeat_region complement(136987..137129) /rpt_family="SVA" repeat_region 137159..137448 /rpt_family="ALU" repeat_region complement(137169..137432) /rpt_family="SVA" repeat_region complement(139024..139268) /rpt_family="ALU" repeat_region 139037..139309 /rpt_family="SVA" repeat_region complement(139416..139705) /rpt_family="ALU" repeat_region 139449..139701 /rpt_family="SVA" repeat_region 140870..140978 /rpt_family="ALU" repeat_region 141193..141488 /rpt_family="ALU" repeat_region 141760..142044 /rpt_family="ALU" repeat_region complement(142166..142449) /rpt_family="SVA" repeat_region 142178..142465 /rpt_family="ALU" repeat_region complement(143517..143790) /rpt_family="ALU" repeat_region 143611..143771 /rpt_family="SVA" repeat_region complement(144197..144493) /rpt_family="MER41" repeat_region complement(145345..145570) /rpt_family="MIR" repeat_region complement(146133..146382) /rpt_family="ALU" repeat_region 147457..147608 /rpt_family="L1MB7" repeat_region 147639..147928 /rpt_family="ALU" repeat_region complement(147650..147912) /rpt_family="SVA" repeat_region 148077..148229 /rpt_family="MSTA" repeat_region 148233..148486 /rpt_family="MSTA" repeat_region complement(148617..148938) /rpt_family="SVA" repeat_region 148661..148950 /rpt_family="ALU" repeat_region 149398..149488 /rpt_family="MIR" BASE COUNT 39770 a 34238 c 35031 g 40450 t 1 others ORIGIN 1 aagcttgcat gggtagcatg gacagttgac ataaaacaca gccttaaatc ttattttagt 61 catttgtctg atctttacca cctccaaatt gttggcagaa gctattagag caaatttaac 121 tgagttcaaa tttcacttcc tatattctac ttattttctg aaaggaggat gaaagaaatg 181 caaaaacaga agattggaga tgagagcagg gttagtcaca aacccagttc atatcaagaa 241 ccaatgttga gattactgag atctgttgta attttttacc attcccaata gtaataagaa 301 tttttttctc aagaattact tattaaaata tgggcagaca tcagtgatag actggataaa 361 gaaaatgtgg cacataatac acgatggaat actatgcagc cataaaaaag aatgagataa 421 tgtcctttgc agggacatgg atgaagctgg aagtcattat tctcagcaaa ctaatgtagg 481 aacagaaaac caaacagtgc atgttcttgc cttataagtg ggagttgaac agtgagaaca 541 catggacaca gggaggggaa aaacacacac tggggtctgt ctgtcggggt tgggggaagg 601 ggaggaagag cattagaaca aatacctaat acatgtgggg gcataaaacc tagatgatgg 661 gttgataggt gcagcaaacc accacggcac atgtatacct atgtaacaaa cctgcacgtt 721 ctgcacatgt accccagaac ttaaagtata ataaaattta aaaaataaaa aatatgggca 781 gaatgtgtag caattttcta gcctcagctc ctgcttattc acccttatta taaattaatg 841 caggtaaccg ttgcccacat caatgctact gcaaagaatg ccgctgatga taaactgcgg 901 cagagtctcc gcagatttgc aaatacacac actgctccag ccacagtggt tcttgtgtca 961 agtaagtaca gaaacatctg ctaatttcag ctaaactcac tgtgtgggaa attatcatag 1021 aagccagcaa catgtgaaag taaaatgtat ccccaaagag gctgaagtac atgagacatg 1081 tttatggtta ttgttgagaa ctgtttagaa aaaaaattgg gtctaggtgg atttctgtat 1141 ttcctccttg gaacttgagt tattgtggga ccaggtttat tttttatttt atttttttgt 1201 ttatttattt tgagatggag tttcgctttt gttgcccagg ctggagtgcg atggcgcaat 1261 ctcggctcac tgcaacctcc gcctcccagg ttgaagtgat tcttctgcct cagcctccca 1321 agtagctagg attacaggca tgttccacca cgcccagcta attttgtatt tttagtagag 1381 atgggggttt ctccagttgg tcaggctggt ctcaaactcc tgacctcagg tgatccaccc 1441 tcctcggcct cccaaagtgc tgggattaca ggcgtgaacc accacgcccg gccgggacca 1501 ggtttattaa agaaaaatgt gatttttttt ttttttttta ccttgtgctc agtattaata 1561 atttttaaag tttaagttaa ttttcaaatt tgagtctccc caacatattt gctccttttt 1621 aatactgaaa gtaacatatg tgtcttggaa ttcttttggt aagattgtct tacattcttc 1681 tgatgtgaaa cgtgtttctt tcctaagaaa cgggggaagc aattttgaat atagtccagc 1741 ctttcctttc ttctcagtgg gtgtcaatag agttgtctgt ctttgtcaag tgtttctaat 1801 gtttggggat ggggttggtc tgtacctttt ctttcagctg atgtcaattt tgcattggaa 1861 cttagtgacc tgagacacag gcatggtttc cacattattt tggtccataa aaaccaggcc 1921 tcagaagcac tgctgcatca tgctaacgag ctgatcagat ttgaagagtt catttccgac 1981 ttgcccccca ggttaccact aaaaatgcca gtaagtgggt ttgcgttatt tttgccattt 2041 tccaatatta cattgtggag gctgaaagac agcccataat aagggggttg ccaggcccag 2101 atggggctgt tctttgtaag aggtgggtta gattttgaag taaggcttag aaaccttcgg 2161 ttcttctcac aaatatacag ataattggta tgtatgaagt ttgcttttat ttatttcaaa 2221 atatcatatg aaattgactt gtagatttac cttcccaact tgccagtgtc ttcaaagcta 2281 gcagtgtttt gtgtcaggtg gatagaaata acggattaaa agtcatgatt ctttttgctt 2341 ccaagtatta tgggtagaga atatggcgag ggttctcaca tacttgtgtg gttggcacca 2401 aataagtgga gtcaattgtg catttttttc ggtatttaat ccttaagctt cagcgggcct 2461 ctgagtctgt gctttctatt gtgcattccc tgtggtgcat ggacacaatt agcaggagtg 2521 gttattgctc cacctgtgtg tagcctgttt atgccatata ctgggaggag ggtgtgagca 2581 catttccagg actagcattt ctgatagact tgacttggga taagaatgtt tattgcagtg 2641 ctgggctgtg tttaagccgc tggcgaatat gtgtgtaact ctgaaagaat taatgataat 2701 ggagaaggag ccatggtgtg ggtgatccag gagagtgggc aatcccatac gttaaatgaa 2761 agtcttcaca tataatttaa taactgagag ttacagaagt acgttagaca tgatctcatg 2821 caaagtcttc atttaaggga gaaagaaatc agctcttaga aaaagtaaac gtgtcattgg 2881 tgattgcagc agttaagaga gtgggccccg ggtatcctga ttccggcact tgcctgctag 2941 tcacctgtgt ccccgatacc caacaagccc cctgcggtct ggcttaagtt gaaactcatg 3001 ctatggacag ttgagagttg tggataccag aggcttaccc agggagtaag catatacagg 3061 ccttacctta atcttttgag gctctttcct aactcacctt tgtcattatt tgtttttagt 3121 cacttggctg tagttacatt tcttcccttc tgcttctgtt tgacggtcac ttttctaact 3181 ttgaaggcct tatgtaaata cggtgttttt tcctgggagg tgaaggttgc agtgagctga 3241 gatcgcgcca ctgcactcca gcctggtcga cagagcgaga ctctgtttca aaaaaaaaaa 3301 aaaaagtaaa tgtagtgttt taggtaagaa cttgcccact ttggagattc ttattcctta 3361 tggttaaaaa aaccaaaaca atgttttgat aattcaaacc atttagtttt gctggatgtt 3421 ttttcccatc cactagagtg ataatttaag atacatttct ttttatgttc tttagtctgt 3481 tcctttaatt ttaataaatt aaacatacat tggtactgcc tgggttttga gtcccaggtg 3541 taataggaaa agtgtagatt ttgttgataa ttggtcagta cccctcaaat aaaaagaaaa 3601 gcctctgatt agaaaagtat ggtgtgggcc gagtgatgtg gctcacgcct gcaatcccag 3661 tactttggga agctgaggtg ggcagatcac caatcaccta aggtcaggag ttcgagacca 3721 gcctggccga tgtggcgaaa ccccatctct actaaaaata caaaaattag ccaggtgtgg 3781 tggcgggcac tggtagtcct agctgcttgg gaggctgaga caggagaatc acttgaacct 3841 gaaagtggag gttgcagtaa gccgagatca caccactgca ctccagcctg ggtgacagag 3901 tagactgtgt ctcaaaaaat ctaaaaaaaa aagaaagaaa agaatagtat ggtatgatca 3961 ttgtaaaaac aaacaaaaca caatatttag aaatgtggaa agtgaaaacc ccacaattct 4021 accctccaga agtaaccact gttaatattt tggtattgcc ctccagattt aaaaaatgta 4081 cgtttaaaaa aaaatgtgtc acatatacag tgttctgtgg cttttggaaa aaaactatgt 4141 cagagcttgg atgtcttttc atgttaatat atattaatat aaatctgcca cattttaaaa 4201 accagaactt tggtttgaaa agtacttgta tggctggtct ttatttaacc agtctacctg 4261 ttagtggaca ttttgtttat acccagtttt gtgttttgtt ttgtttttta ccatttccag 4321 caaggtaatt tatatcttta ggctcctggg ctgacatacc atagggtaat ttccaaggca 4381 taagagttct gggttagaac tttactagca agatccatgt cttcctccag gaaggtggta 4441 ctaatttaca ctcataagag tgtctgtttt aaatgtttgc caggccagta agtgaaaaac 4501 agcatcatct tataattgac ttttaatctt taacacagtg caaaatatcc ttttatgatt 4561 atcgttgttt gctttttatt tcttctgtaa attgttcaca tcctttgcct gttgttagtg 4621 atttctctta tggtttctgg atttaaccct ttatataatt atcttaagtt ttctcctggt 4681 gctttaatgg ttctgctttt taaaatatta ttcttattta aaaaaaattt ttttgagact 4741 ctgtcaccca ggctagagtg cagtggcgtg atctccgctt actgcagact ccacctgccg 4801 ggttcaattg attctccttc ctcagcctcc cgagtagctg ggattacagg cgcccgccac 4861 cacgcctggc taattttgta tttttagtag agagggggtt tcactgtgtt ggccaggctt 4921 gcctcgaact tctgacctca ggtgatcacc cgctttggcc tcgcaaagtg ctgggattac 4981 aggcgtgagc caccgtgccc agcctctggc tagttttttg tagacgaggt ttcaccatgt 5041 tggccaggct ggtcttgaac tcctgacctc aagtgatcca cctgccccag catccctaag 5101 tactgggatt agaggagtga gccacaatgc ctggcaattt ttaaaaattt ttaatagaga 5161 tggggtctca ctgtgttgct caggctggtt ttgaactcct gggctcaagt gatccccccg 5221 cctcggcctc ccaaagtgct gggattacag gcatgagcca ccatgcctgg ccagttcgtt 5281 cgttcgttcg tttgttcgtt ctttctctct ctctctctct tttctttctt tttctccagt 5341 tctacttcct tctttctttc tttctttttt tttttttttt cacagaatct cgctctgtcg 5401 cccaggctgg agtacagtgg tgccatctca gctcactgca acctccgcct cctgggttca 5461 ggcaaattat tctgcctcag cctctggagt agctgggatt acaggcgtct gccactttgc 5521 tcagctaact ttttttgtat ttttagtaga gatggggttt caccatgttg gtcaggctgg 5581 tcgccaactc ttgacctcag gtgatccacc tgccttggcc tcccagagtg ttgcgatgac 5641 aggcgtgagc cactgtgcca ggccaagttc tacttcttaa tacataaatt tcaacttatc 5701 tggaagaatt atgactaccc actgccaaat aaaacttcca gcctaactac tggcaatctg 5761 ctgttgaagc tgaacaggct ctaaattgtt gggttttaaa aaaatttttg tttacttaag 5821 cctatagctc ctagtattcc caggcattct cctatgcaaa aaccaaccag gactgaccct 5881 gctcaccttc tggctctata agttattatt cagcagacct gcagaataaa tagactttta 5941 aaaaacaact tcgttgagat atgatttaca tattacaaaa ttcagctctt ttaagtgtac 6001 aataattttt agtaaattga gttgtacaat tttagaatat ttttgtcacc tcagtaaatc 6061 tattatgcta atttataatt aatccccttc cccactccca ggcactacta atctttctgt 6121 ctctgtagat ttgtattttc tggatgcttt atagaaatgg aatcatatag tatacagacc 6181 tctgtgccta gcatattctt ttttattttt attttttgag acagtgtctc actctgtcac 6241 tcagagtgga gtgcagtggc acagtcactc cagcctcaat cttcctgggc tcaggtaatc 6301 ctcccacctg agtagctggg actgtaggca tgcaccacca tgcccagcta aagcatattc 6361 tttaaaacaa tattcagtta ctttgtcaac actaaataat ttaccctctc ccatctaaat 6421 cactgtcatc ttctcttcaa agttgttaac tgaagtttgg aggtcaccca ttgttcttat 6481 tttgaaattg aatgtgtgcc cttgaattaa cctgacgtaa cacttctttc ctctgtatta 6541 gcagtgccac actctgctct atgtttataa cctaccagca aataaggatg gcaagagcgt 6601 cagcaacagg ctcagacgcc tgtccgataa ttgtggtggg aaagtgctga gtatcacagg 6661 ctgcagtgca attctccgct tcataaacca agatagtgca gagcgcgctc agaagcgaat 6721 ggaaaacgaa gatgtctttg gtaataggat cattgtgtca tttactccaa aaaatagaga 6781 actctgtgaa acaaagagtt caaatgcaat tgctgataaa gtgaagtctc ccaaaaaact 6841 taagaatcca aaattgtgcc tcatcaaaga tgcaagtgaa caatcttcca gtgccaaagc 6901 cacgcctgga aaagggtcac aggcaaattc tggatctgct acaaaaaata caaatgttaa 6961 aagtttacag gtaattttga tacctcttgc tttctgaagt ttatggtagg tttggtttgt 7021 ttctgtgttt tacgtgcccg cttgcttttg gcgtgtccct ttttgatttc agtgtttgat 7081 gatactcaaa gtcaatcgtt ttctgtaaag gatctaacac atcttgggta cttaaaattt 7141 aaaccccact gtgcttgtgt ctttgaagga gctgtgccgc atggagtcaa aaactggtca 7201 tagaaacagt gagcaccagc aaggtcacct gaggctggtc gtacccactc acggtaactc 7261 aagtgctgca gtgtcgacgc cgaaaaactc gggggtggca gaacccgttt acaaaaccag 7321 tcagaagtat gtgaaactac tctttctcat agctgcttct tgaatgtgaa ggaagacata 7381 tgaccttttg ccttcaattt tccccttctg ttagaaagga gaacctcagt gcccgaagtg 7441 ttaccagttc tcctgtagag aaaaaagata aagaggagac tgtattccaa gtgagttacc 7501 cgtctgcttt tagcaagtta gttgcatcca ggcaagtcag tcctctgctc gcatctcagt 7561 cttggtcttc taggtgagtc cagcttcttg acccatgggg gtggttacat gtaataggaa 7621 ggattatgaa atagtatgtt tgttgaaaag aattcagata gtactgaaat gcacaaagta 7681 agtctcccat cactcttctg ccagtccccc tctcctcagg taatcatcgt taacatttag 7741 gggcaatgct ttcaccattt cttcttcctc catctggaaa ttgtggtaag cagaggtcag 7801 aaagagaagc ctgttggaga gtccctggcc agtgcagggg ggcatgtaaa gatgtcccta 7861 tgttcgctgt gttgaggaaa aagcccaaac gagccgtgca ttggtggttc tgattcatat 7921 ggttctgtat aggaagctat ttggaagtaa agttttctgt aaaccatggt aattatttat 7981 cactgtctct cacaccaagg aattcttgat gcttgggaga cagagaagcc gtaaggccgt 8041 gttgcattcc catgagctcc ttttagcatt gtgtttgttc tgctgagttg gtggcagatc 8101 actgtagaaa gtgactatgg atcagccggg cgtggtggct cacgcctgta atcccagcac 8161 tttgggaggc cgaggcggga ggatcatgag gttaggagat caagaccatc ctggctaacc 8221 ggctaacacg gtgaaacccc atctctacta aaaatacaaa aaattagccg ggcgtggtag 8281 tgagtgcctg cagtcccagt tacttgggag gctgaggcag gagaatggca tgaacctggg 8341 aggtggaggt tgcagtgagc tcagatcgca tgactgcact ccagcctggg tgacagagcg 8401 agactccgtc tcaaaaaaaa aaaaaaaaaa agagaaagtg actatggatc tataaaatta 8461 catctaaaaa gccacataaa agttttcctt atgtgcagaa agaactgtaa catgtaggca 8521 agaacgttta actctttcag agacacagag gaggttagac gccttaaatt ggtatcacag 8581 gctgatttcc taaggataat gcacaaagca gaaggttaat tcatcaggac tgcctctaat 8641 tcctgccttt agctttaacc caaaggaaga atatatatta attggttggc ggtaattagg 8701 ggtcagtaga tagtactttt gacaatgtct gaaatcaatg tgtatttaag attaaaaaca 8761 cagctgggca cagtggctca cgcctgtaat cccagccctt tgggaggctg aggcaggtgg 8821 atcacttgag gtcaggagtt taagaccagc ctggccaaca tggtaaaacc ccatctctac 8881 caaaaataaa aaaaattagc tgggtgtagt ggtgcatgcc tgtagtccca gctactgggg 8941 aggctgagac aggagaattg cttgaatcca ggaggtggag gttgcagtaa gccgagatcg 9001 tgccattgca ctccagcctg ggtgacagag cgagactctg tctcaaaaaa aaaaaaaaaa 9061 aaaaaaagat taaaaacaca tttgaaaacc aaatgtttgg tttggttgtg ttgtggaaaa 9121 tctgcttctg tatatagttt caaacaaagc ctaaatgata aaatctaaat atacagaaca 9181 gttttttaaa ataatgtcaa tatgtgtgtt ttaagcagga gtatgtctcc aaacctttta 9241 aacagagcat ccccgcttgc tttcaacatt gcaaattcga gcagcgaagc cgactgccca 9301 gacccatttg caaatggtgc tgatgtccaa gtcagcaaca tagactacag attatcccgg 9361 aaggagctgc agcagctcct gcaggaagca tttgccaggc atggcaaggt aactttttcc 9421 ctcttgtgct tgctttgtta tactctctga tggagtctct agtcaagagg acctaatttg 9481 atatacataa ccacagccgg gatagaagtg tgactgaaca agtttgaact ccttcctttt 9541 ccctttggtc tgctcagggt ggtgctataa atccagtttt tacaatgtta gccccagaga 9601 gcaagacgcc tagcagtgaa aagctcttgt cttaggcaga aattgaaatg gcaatttgtg 9661 gccaggcatg gcggctcatg cctgtaattc cagcactttg ggaggccaag gcggccagat 9721 cacctgaggt caggagtttg agaccagcct ggcgagtgaa accccgtctc tactaaaaaa 9781 aaattagccg ggcgtggtgg tgcacgcctg tagtcccagc tccttaggag gctaaggcag 9841 gagaattgct taaacacggg aggcagaggt tgcagtaagc tgagatgacg ccactgctct 9901 ctggcctggg agacagagtg agactccatc tcagaaagaa aaaaaaaaaa aggcaatttg 9961 ataagtaaaa cattgatgga tttcagttag ctcttcctgc tgaaatacag gtaggattct 10021 aaaataatgt tggctggtct tgtttatgcc ttattatatt tgactcattt tttaggtgca 10081 attttgctaa aaatgcacct ttttagtaga ttcttaaaaa tggccatgtc ctgggttttg 10141 tagagtctag aaaagacccc aaacactctt cccatgtgtt ggcctctgag cctgccttgg 10201 aaacctgtgt gtgccagagc acacaggcag gccgggccag tgtcacatct ctaaaggtca 10261 ccctatcctc actttcttac gtcctggtca gcagaagagc agagttcaga catactcctt 10321 catggaagcg ctgcagtcga caaatgtgtt ccttcatggt tcaccttcag ttttaagacg 10381 acagggttct gggcggacat agcgtttagt tttcgctttt cctgttgtag gtgaagagtg 10441 ttgagctcag cccccataca gattatcaac tcaaggctgt tgtgcaaatg gaaaacttac 10501 aagatgcgat cggtgcagtg aatagcctcc acagatacaa aattggcagc aaaaagatcc 10561 tggtctcact tgccaccggg gctgccagca aatcactctc tttactgagg taagaaacaa 10621 gcaagctgtt tatttcagga aataacattt gacccagaaa caattttaga aataataaaa 10681 atagattcaa tcacattccc acccccttgt gatgattgca tgtggaataa tgcttattct 10741 agattgatac agttagaaac aagccatttg actcttgtct aagtcattaa tcaaagctgg 10801 aatacagctt aaacttgaca ggcacttggg agtaaaataa tacgcaacat gttcaaatag 10861 gttctaaaaa tagtagaccc ctacaataaa agtttataag cctgaaaagt ctaaaatatt 10921 accttgagat cttaggtaga aggtagattt gtgtgtaaag actagatcta caaatcttca 10981 agctctccat ttgcagatat atccttagtg tcaaatagat actcagcccc ataagtctac 11041 cttaagacta tctcagtctc aagcgtctgg tgatggtttt attcatagta aaatgtgata 11101 cttctttggg tctccctgtt tcggggagac tggcaaaaca ggacccactc ctgagtgttt 11161 ccttggatga tcagagatct tttttctaac ttgtacttca ctattactta aattacgagt 11221 tacgggtttt ttttttgttg ttgttctcag aataaaaatg atgcttattg aagaaaattt 11281 ggaaagttta gaaacataaa agaagcaggg gaaaaaaaac catccacagg cctatggccc 11341 aaagccaatt gcacttaaca gtttggcatc ttcttttagt tttgtttgga ttttttccct 11401 ttgagcgttt tttggtgcct ctggtttttt aaaactgtag ttgagattat gttgtattca 11461 gttgtgtatc cttatctatg ggaacgccaa tcacatatat ctaacaaact gcagtgtggc 11521 aggagatatt taagctcttc tcccttttgc tctgaattca tcatgctttt ctgctccact 11581 ggtgtggaaa accctatgat tgatactaat tctactgtgt aacgtaatct tctttcattg 11641 ccttgtccct aaaatccttt ctttaaattt ttctggaaaa tacattctga cataccagaa 11701 atccgtaaat ttattttgca tatgcactgg gttaatttta tgaactctgt ttatataact 11761 tgtaatatgg aggaaatggt ctgttcccta atcaaacttc tgttttacag tgcagaaaca 11821 atgtctgttc ttcaggatgc ccctgcctgt tgcctgcctc tgtttaaatt tacagatatc 11881 tatgaaaaaa agtaagttag gtatattttt tctttatcca gtcagtactg attggtggct 11941 tacatgcaca aaaccatggt aatgactctt gaggaacttc agcctggtag aggagaggag 12001 acagagacac tggcattcag atttacatgt caatgaatgt ttagcagaca gttttaaaac 12061 gtcttttatt tttaacaaag ttagaccttt ggtcctcagt gttgtgctta tttttcccct 12121 aattgactag ctttagtttg ttttttttgt ttgtttgttt gttttttaat ttttttggag 12181 acagagtctc gctctgttgc cgaggctaga gtgcagtggc acaatctcaa ctcactgcaa 12241 cctctgcctc acaagttcag gtgattctca tgcctcagcc tcccgagtag ctgggactac 12301 aggcatgtgc caccaagcct tgctaagttt tcgtattttt agtagagaca gggtttcgcc 12361 atattagcca ggctggtctt aaactcctga cctcaagtga tctgcccact ttagcctccc 12421 aaagtgctag gattacaggt gtgagccact gcgcctggcc aagttttagt atttattgga 12481 tgaacatttt taagcactag actataagct ccaggacagc agggactagc tctgccataa 12541 tactgtatcc taagcaccca gcccatagta gatgcttagc agggactagt gagtacaaga 12601 ccaaggcact gcagagaagg cagaactggg caagacttag gtcttgtcct tgaggagctt 12661 atgtctagcg ggaagagaaa atgctcgtga atccttatgg acagtgaaga aaactgattg 12721 gaaaggagga gaaaaagctt tgagaacaca gaggaaggga agaacaagga acttggccat 12781 tatcccacag gtaataggga gccactgaag tgctggagga aggaggggag gggaaaggtg 12841 tggtgatggg aatcatgctg caggaagact cttctcgtag cagtggtgtc tgagagggca 12901 ggaacagggg agagaaggag gcagagaggg aaagtagcta catcagtcca aattcagagc 12961 agatactgtt gggacagaac tcgatgttac agaaatgtca aaatgtcaaa tgtcattgaa 13021 actagagtgg gtaactaatt ttttttatta aaaacaattt ttattgagcc agggtctcac 13081 tctgtcaccc agggtgtagt gcaggggcac gaccctaggt cattgcagcc tcagactcct 13141 ggactcagat gatcctccca tctcagcctt ttgagtagct gggaccacag gcatccgcca 13201 tcctgcccag ttaatatttt ttaatttttg taatgagggg gttttgctat gttgcccagg 13261 gtggtcttga actcctggct tcaagtgatt ctccctccta ggcctcccaa agtgctggaa 13321 ttacaggcac gggccaccac gtccagccag agtaactagt tttatgtagc aagcaaggta 13381 aaagggagag gtcattttga ttctgagtgt tttgccttgc atgactggga tgatggttgc 13441 acagttaaac tggaaatagg gaaataaagc atggcaagga ggtacagcaa gtttttggtc 13501 ttggttttgg acattcaggt tttaggggcc attggagtag ctgggtggag aggtcacata 13561 gtcttctgca cattgaggga cttctttggg agagaagtag gctctggaga tagggcagtc 13621 atttgccttt aagacgattg ggacagacta gagagcagat aaggttggga gggggaagga 13681 gaaatagagg gcaggaatgg atggaatgcc accagggcca gggatgtggg cgagcaccca 13741 ggatgtgtgc caagagagca catcaggaat ggggggatcc aaggccatga ggaaatgcat 13801 gaggtgtggg gatgaggcag aggccgctga taacttttga gtcatcacta cagagtcctg 13861 tggctaaaat ccagtaggga aggggctgag agactcgggg gtaacacagc agaagcagct 13921 cctgagcagc tggatgtttc cattacaggg acagggtaga gtacgagact gagaatgaga 13981 gtacaaggga gagtgtgtct ctccgggagg tggaagagga ccaggggctg gtttggtaga 14041 tgggaatctg agcatatgta ggacactggg aagaaaacac tggtcagaca gtgagaagac 14101 cagggaacac ttactgagca cttactatga gccaagttct gtccaggcct aaccacactt 14161 aatcctcaca ttgaccctgt gagataggcg ctatcatcac cccattttac agatggaaaa 14221 agtgaggccc cgattggtcg tggaacttga ctaaggtcac acaggttgaa gtggagtcag 14281 gatgtgaact caaatctgcc agcctccagg gccagtgcct ttaagttcca agtaagaaag 14341 atacaggtca ggaggagaac tggggagtcc agtgtcagag ggtgtgtgga tcatgcagtt 14401 gaggcatagg agagttgaga gggtcacgga aagcacatat acttctgaga ggggagaatg 14461 gggatggtga gaagtaacac agagaattgg aagtgagggg agctgcatcg agagaggttt 14521 ccaactcaag agaatttggt gaggtcactt aaagaaactg ggcaggggct gggggtggct 14581 ttcaaatgaa gtgagcattg tcatggtcgg agccagagac gactcaggag taccaagcat 14641 cactggtttg ctagtgaatg aatttgtagt agaattattc cagttttacc tttgaagaga 14701 tggaggaacc attcttggtt attgttcttt atttaaacag gatgcgtgtt tctgtataat 14761 caaagaaaaa ccttctaagt cagcgtctct cttctcttgc ttttaggttt ggacacaagt 14821 tgaatgtgtc agatctatat aaattaacag acacggtcgc aatccgtgaa caaggaaacg 14881 gacggctggt gtgtctccta cccagcagtc aggcccgcca gagccccttg gggtcttccc 14941 agtcacacga cggctcctcc acgaattgca gcccaattat atttgaagag ttagaatatc 15001 acgagcctgt ctgcagacag cattgttcca ataaggattt caggtgtgcc gttcgttttc 15061 ctttagaagt aattcttttc tataagattg tggaacatgt taggatgacc ttgtgaacat 15121 cttagacccc tgcggaatgt cagcttccaa ttgtgttctc ttttcttcag cgaacatgaa 15181 tttgatccag actcttacaa gattcctttt gtgattcttt ctttgaagac atttgcgccc 15241 caggttcaca gtcttctcca gacccacgag ggcaccgtgc ctttattgag gtgaaccatc 15301 tcaaagcctt tgtcgatata atgccctagt tctgcttttt ataggaagcc tgccctttct 15361 gcccacatct ctttacaaat aagtactcta caatgcttca tcctcatgtg tttagccttc 15421 agtcaaagta cagtcatgca ccacataaca aaattttggc caaccatgga ctgtatatag 15481 gacagtggtc ccataagatt ataatcacat attttcacta gacctttcga tgattagata 15541 cacaaatacc attgtgttat agttgctcac agtgtgtagt acagtaacag gctgtccaga 15601 tgtgtagctc gggaaccatg ggctgcacca gcctaggtgt gcagtgggcc gcagcatcta 15661 ggtttgcata agtgcacctg atgatgttct caggtgacag aatcacccag caacacatta 15721 ttgagtacat atccccatca tgaagcagca cgtgactgtt tatctgaaat gggatccatt 15781 cactcagtac ttcaacaaga aaatgtgtgt ttgcgctaca ttttagaaac gtaaacgtga 15841 ggagggcaga ttcttccaca gaggggagat gtatcataag tgctgcagtc gaaatctcca 15901 caagtgttga gagtacacag cttcaaatat gcatacattt caagacttgt gagagaaaag 15961 tattcttcag aagttcattt aatcactcca tgagccacaa cagtagactt aacacaaagg 16021 ccaacacagg actcagtggc ctgcacgtcc ctgctgatgt ggaagcagac gcgtaactga 16081 cttggagccc agttgcccta tttttaatcc ccctcctgga aaactaccca gaaaatgcca 16141 tcagaatgca tcagagaggc cagtgcctca gtgcttgcct ccctgggtcc ttaattttaa 16201 ccttcaaacc tggtggtggg cattcagaca aatgattaac acatacatta aaccaaagcc 16261 ttctgtttag caagcgcctc cttttctgtt attttgattt aacctttcct ttttttgttg 16321 ttttttccta tttcagcttt ccagattgtt acattgcaga gtttggcgat ctagaagtag 16381 tgcaagaaaa ccaaggaggt gttcccttag aacacttcat tacctgtgtt ccaggtgtaa 16441 acattgccac tgctcagaat ggcatcaaag tggttaaatg gattcacaac aagcccccgc 16501 ctcccaacac tggtaggtgt caggtatttt gtcctggagc taccacctgt ccactatgag 16561 accagaggag gctattggct tttattttaa acagcagcac aatttgccaa gtgtgaaaac 16621 accatggatg tgtcctttga tcttttgaaa attcctaaaa tgcttgaggg gttggcgtta 16681 ctggactttg ttttattttt tatttattta ttttttttga gacggagtct tgctctgtcg 16741 cctagctgga gtgcagtggt gcgatctcgg ctcactgcaa cctctgcctc ccgggttcaa 16801 gtgattctcc tgcctcagcc tcccgagtag ctgggactac aggcaccagc caccacgccc 16861 agctaatttt ttgtattttt agtagagaca gggtttcacc atattggcca ggatggtctt 16921 gatctcttga cctcattatc cacttgcctc cgcctcccaa agtgctggga ttacaggcct 16981 gagccacagc aaccggccta gactttgttt tttagatagg gccttgctgt gttgtccagg 17041 ctgctcttga actcctgagc tgaagatgtc gtcccactat ggcctcccaa aatgctggga 17101 ttacaggcat gagccaccac acctggccag tttgaaagta ccaacataag ctatgtaaaa 17161 aatttgaaac tcaaatttca catttacaga tccttttagc aacctaattt tgtgaagctt 17221 ttggcttgcc agttgactga ccttttaaaa atagtttcag gggctgggcc tgttggctca 17281 cacctgtaat cccagcactc tgggaagctg aggtgggtaa atcacctgag gtcaggagtt 17341 caagaccagc ctggccaaca tggtgaaacc ccatctctac taaaaataga aaaatcaggc 17401 atgatggtgg gcacctgtaa tcccagctac tcgggaggct gaggcaagag aatcacttga 17461 acccaggagg tagaggttgt ggtgagctaa gatcaagcca ttgcactcca gcctgggtga 17521 caagagtgaa gctctgtctt tttttttttg agacggagtc ttgctctgtt gcccaggctg 17581 gagtgtagtg gcacaatctt ggctcactgc aagctccgcc tcctgagttc acgccattcc 17641 cctgcctcag tctcctgagg agctgggact acaggcgccc gccaccacgc ccggctaatt 17701 tttttttttt tctgtatttt tagtagagac gggtttcacc ttgttagcca ggatggtctt 17761 gatctcctga cctcatgatc cgcccgcctc agcctcccaa agtgctggga ttacaggtgt 17821 gagccaccgc gcccaggatt tttttttttt tttttttttt gagacggagt caccctctgt 17881 caccaggctg gagtgcagtg gccggatctc agctcactgc aagtccgcct cctcggttca 17941 taccattctc ctgcctcagc ctcccgagta gctgggacta cagacgcctg ccaccacgcc 18001 cggctatttt ttttgtattt ttagtagaga cagggtttca ccgtgttagc aaggatggtc 18061 tcgatctcct gacctcgtga tccgcccgcc tcagcctccc aaaatgctgg gattacagac 18121 gtgagccacc gcacctggcc gaaactctgt ctttaaaaaa aaaagtactt tcaggtattt 18181 gaaggaaata ccacttaatg cagatttacc ttcgaaagtc attttggtcc acgttttgga 18241 ctttgataaa tgtgaaaata aatttaaaag accaaggtat ttggacctag acttgggtga 18301 gtgtctgagt tacgggcatg tcctgagttg cagatatcca aggctgctgc ccgtgaacga 18361 cactttactt gttgccttcc tgtgtataca cacatacttc ctcctctttg tccccaacac 18421 ttttgcttct cccacctccc ttaccctaca ggaaaaaagg tgaggataac caaggaattc 18481 tttctgagtc agcctgctct taggaatgtg tgtactcagg tttcttacgg gacacctcct 18541 ttcttttccc tctgtcttag acttctcagg ggccatagat tatgccactc ctcctttctc 18601 ctaagccatg agcccacaag cctccacttt ttattttatt ttattttatt ttatttattt 18661 atttatttat ttatttattg agacggaatc tcgctctgtg tcccaggctg gagtgcagtg 18721 gtgcaatcta ggctcactgc aagctccacc tcctgggttc atgccattct ccttcctcag 18781 cctcctgagt agctgggact acaggtgcct gccaccacgc ccggctaatt tttttgtatt 18841 tatttttgta tttttagtag agacggggtt tcaccttgtt agccaggatg gtctcgatct 18901 cctgacctcg tgatccgccc gcctcagcct cccaaagtgc tgggattaca ggcatgagcc 18961 gccgcgacca gcaagcctct ccacctttct ataaagattt ccgtttgatt acttatacct 19021 ttttaatagg tataagcagt caagatggtg caaaaacctg ttccttacac ccttttatgt 19081 catgctaatc ttacaacatt atcactttct gtggttttct gggtaacgtt ttcagctcca 19141 ctatggatta attttgctct tctgggccac ctacctgatg accatttata actttgtgga 19201 aaggtacgtt ccacatgacc aagtttcaga aggatcttgg aaatataact agtttctaaa 19261 ctgggagctt cttgtgtact ttgacttctg ctataaataa cactatagta gactctgctt 19321 taccttggcc gcttaacttt gaggattagt gggacagcat ttgacagccc gtgaataagt 19381 tagttagcta gttattggac tatgagacct gaatatagaa actgagacgt aaagcagcta 19441 ctgtatttgg ttttgtttgc ttccccatga tcgaactcaa gccagggact gcagggccat 19501 tttcccttct ttcagctggt gaaatctgtg ctttctgtct tatacacgtt ctttttcttc 19561 tagacccttg gcttctgcgt tcgaagagtc ctgtaggtaa cccccagctg atccagttca 19621 gtagagaagt gattgacttg ctgaaaagcc agccatcttg tgtcataccc atcagtcatt 19681 tcatcccatc ctatcaccat cattttgcaa agcagtgccg agtgtcagac tacggatact 19741 ccaagctgat tgagttatta gaagcagtgc ctcatgtatt gcaggtaagg cgtgtcacgg 19801 gattacttgt ttacaggcag gaatgttcct ccatggcttt ggctgcctcc atcaaagaaa 19861 cagtttaatt cctaagaagt cagttctggg cagaacatcc actaaaaact ttcttcagag 19921 ggcaggaata acacacacag gcggcttctc aatttttcag aacgctgcta gtttccagtt 19981 ttagaggaac tccttggagg actgaggagt attttccctc taagctctat agaccttctt 20041 catcccagca cagctttctc cagaactgtt catgattaca aggaagctcc catgaatagc 20101 cacatacttg tatgttggtg cctatgtgag agtcaagggc ttaacctttc ttctaaattt 20161 aactagttgg agacaagtta ctgggctaga gtgtagcact ggatcctatg aagactttta 20221 ccagttaact cacagtgtta taacagcttc ccactcacta aaaacggctt gaaaaacttg 20281 tagaccaatt tgtctggaca accaatacag ttttcatttc tgtactcacg acttttcttt 20341 tcaaaagatt acacttttca tttttatact caacgactgt ttgggagaag gtgtgccagg 20401 ccaaaaacga taagagtttc agaaggcctt ggacatttgt tccttttgac caactacatt 20461 ttgttgatcg acatgtgttt caaattgcac acattctaca taaagggagt catgctctga 20521 ggctagtaac gagagggtga gccagcagca aggaagtcgc tgtctgctga ggtcttatgt 20581 gtataaataa actatttgct ttgcagattc ttggaatggg ctccaaacgt ctgctgaccc 20641 ttacccacag ggcccaggtg aagcgcttta ctcaggattt actaaaactt ctcaaatccc 20701 aggccagcaa acaggtcatt gtgagagaat tctcacaggc ttatcactgg tgagttgatg 20761 aaaagaacag aaatgaaagt aaatatttta ggacatttta tcttctgcca aactctaaat 20821 agttatctag tatttctatt aagcattctt ttggaaaact tgtagtagtt ttttgttttt 20881 tttttttttt ttttgagacg gagtctcgct ctgtcgccca gtggcacgat cttggctcac 20941 tgcaagctcc gcctcccggg tttatgccgt tgtcctgcct cagcctccag agtagctggg 21001 actacaggcg cccgccacca cacccggctt attttttcta ttttttagta gagacggggt 21061 ttcaccatgt tagccaggat ggtcgcgatc tcttgacctc gtgatccgtc cacctcggcc 21121 tcccaaagtg ctgggattac aggcgtgagc cccagcgccc ggccggaaaa cttgtagttc 21181 ttctaccgat gatactactc attccgtatc cacttccgtt gttaaaacct aggtgtttct 21241 caaaggactg ggatgtcact gaatatggtg tttgtgagtt gattgacatc gtatcagaga 21301 ttccagacac aaccatctgc ttgtcccaac aagataatga aatggtgatt tgtatcccca 21361 aaagaggtat gtgacttaac atttgctgga aggatattgt ctatctcctg ttattttaac 21421 atagtgtggc atggaagatg tggttttccc tccctgtccg tgcttcctga agattccaca 21481 gaaagcacag tgtgaaaact gctgctcttt ggattctgtg ttcttcaaag taggtgaagc 21541 tgttctcaag aacaaaagca ggctggcttt ttagaattga ctctaagggg aaaaaggggc 21601 atcatcagca ctttgctcaa atgccttctg cagcctggct gtggctacat gagccttagt 21661 catccagttg agaatctgtc accgtgcctg ttggctttgc tcttctgttg aactctgagc 21721 tactttattt actgtgcaca cttgagaagg gaagacaact cgaatgtttt tttttctttt 21781 gcctttcaaa ttttaatagt tgaagtcaga actgcacagg tccagttacg ggtagaaagt 21841 acagtatttg agggatgcga aactcatgga tactgggggc caatttttcg tgtctgcagt 21901 ttccgcgggg cagaccgcaa gacttgagta ggtgcggatt ttggtatcca ggcagggtcc 21961 tggacccgat cccttgagaa caccaagggg tgactgtatg gtcgtatcct ccttctatta 22021 tctggttctt aaacggtgct cctgcagaac attgatgtcc atgaaatcat agaattccag 22081 ggcccattct cacttaaaaa ttaatagaaa caatctcagc ttaagctttc cctcctgttt 22141 atttttgttc aacttttgaa gcctgattct ctcttctgct agcaagtttt aactgaagat 22201 cagatgaata aacaaaaaaa gtggggatat tcagaagcgt gatttcatgg gaataatttc 22261 tagagtactt gcttaagtac agggacctgg gtgtgtgtca tgttaataga tatggggcag 22321 gggctagaaa attgtttcta gcttataaga tgacattgtt tctagcttat aagatgaata 22381 tatatgaatt tcataatata ataatatcat aacttcctgg tgttctgccc tggaacatgt 22441 ttgagaacca cttgcaaaga gggaagagta tatgtgtttg ttttaggttg tgtgtgtttg 22501 gaaggcaagg atgatatcgt ttataaattc ccaaaaggat ttactctagt gtggatgttg 22561 aacacaaatg ggtagaaaaa aactgtgatt ttatgttttt tggttttttt aaaagaacgc 22621 actcaggatg aaatagaaag gacaaaacag ttctccaagg atgtcgttga tttgctgcgt 22681 caccaacccc atttccggat gccctttaat aaatttatcc cttcttacca tcaccacttt 22741 ggccggcagt gcaaacttgc gtactatggg tttaccaaac tacttgaact ttttgaagcc 22801 atacctgata ctttacaagt gagtgaagaa attctaattt tataaagtat ttttctgaac 22861 aacgtatagg aaatactttt tcacgttcca gacctgtttt tctgggaagt agggagttac 22921 gaaagaggat gaaaatctaa gctattccag tcagagaagc catattcctg actcattagt 22981 atttttagat aaatatggct ttttccttca ataaaggatt attcaaactt gttttggggg 23041 taattattga aaaaaagatt cagtaggttt tctttttttt gtttgtttgt ttttgttttt 23101 tgagacggag tttctttctt gtcgcccagg ctggagtgca atggcgcaat cttggctcac 23161 tgcaacctcc gcctgcgggt tcaagcagtt cacttgcctc aacctcccga gtagctgggg 23221 attacaggtg cccgccacca tgcccagcta atttttctat ttttagtaga gatggtgttt 23281 caccacgttg gccaggctgg tcttgagctc ctgacctcag ttgatccgcc tgtctcagcc 23341 tcccagagtg ctgggattac aggcgtgagc caccacacct gctgatttct tacactgaat 23401 atttgttcat tacttttata cttttaaaat ccttcgtgtt gaaggtaggt ctgagtacat 23461 gaggtaacat cagtgctaag ctgttcgctg ttgctaagtt aaaccaaatt tctgttgttg 23521 attttatttt ctttttttac tactgaagtg ctttttaaaa attgttttat tgtggcacaa 23581 tacacataac atagaactta ccttcgtaac tatctttata tctcctgtcc agtagtggtc 23641 agcacattca cattgttatg caaccgatct tctgaacttt ttcatcttac acaactgaaa 23701 ctttataccc attcagtaac gctgacttgt tttttcatta ctagctgcat actttttttg 23761 ctattctgta aacacccgtg atttgtttat tttccctttc ctccccccgt gtgttttcaa 23821 aggtattgga atgtggagaa gaaaagatcc ttactctgac agaggtggag cggttcaaag 23881 ctctagctgc ccagtttgtt aaactccttc ggtcccaaaa agataactgc cttatgatga 23941 cagatctcct tacagaatat gctaaaactt ttggttatac atttcgtctc caagactatg 24001 atgtcagttc catttcggct ttaactcaga aactctgcca tgtcgtgaag gtaaatgttt 24061 tttttttttt tttttttttt ccctcatggg ataaaactca cagtgaccta attctcatta 24121 ggacgtttgc gtgtgctgta gcgtttcttt atttacacct tgaactactt aactgtcaca 24181 gcattatgac tcgttttgtg tcattagtgc ttattaggta atagaagcct tggtttggaa 24241 gttgagagtt ttggtcttcc tcttgtgtgt ttctgaatgg ccttgccaaa gcaatgcctt 24301 tgtgaactga tgaggtcact cttcacccca gggttcggtc atgaggtgca gctgcctcag 24361 cactgctggc ctgagcttga ctccttccag tgtgagtttt gcctgtgacg ttcttgggag 24421 cccgtaatca cacagtgtcc tctccctcca cattctcacc cacttaggag taggcttagg 24481 agcaagcgag ggtgcagagt ggcctttgtc actggcagag ctgtagtaga gagggcagct 24541 cgtgctgttt gctgtcctgg actgcacttg acccttcctg tcctaggcag agcagctgcc 24601 agttttcggc ctcctccccc acaggaccag aagaagtcac tgccaggagg aaggaatgga 24661 aagggaagca tgaaggacgg caccctcagt tacaggccca ttactagctg cacacatttc 24721 tgtgtcattt aattccgtaa acagccagaa cgggttcact ctccctttcc ttcccactgt 24781 ggggtttttt tggagttgga gtcttgctct gtggcccagg ctagagtgca gtggtgcgat 24841 cttggctcac tgcaacctcc gcctcccagg ttcaagagat tcttcttcct cagcctcctg 24901 ggtagctggg attacaggcg cccgccacta cgcccagcta gtttttgtat ttttagtaga 24961 gacagggctt caccgtgttg gccaggctga tctcgaactc ctggcctcag gtaatccgcc 25021 cacctcagcc tcccagagtg ctgggattac aggcatgagc cactgcaccc agcccccact 25081 gttttttaat gtattggact gcagctaaga agacgttctt accttggcag aggtaggaac 25141 gctcagagct ctccttaggt actgattggt tggaggagga ggtgagagag caggaagcac 25201 aaccaggaga gccgaacatc cctgagccaa ggcagaggaa tggccagcag catcgcagga 25261 gggctggtca gagagggact gcctgacttc caggtttagc tcagtgggaa gagtcggtgc 25321 tgctggccag gtcagtggca gtgttggttg tgtagacaga agccaccttg gagtgtggtg 25381 aggagtgagg agaggtgaga aaatggccca ggtcatctgg acagcatcct ggtttgtctg 25441 cttgtgaaga tggagggctg tggcagcagc tagaggctgt ggggctgagg gagcaaggga 25501 ggatttgggc tgttggtttt cctacctgga aaaacttgag catgtctgaa agccattgat 25561 gtggatctgt ctagagagag aagatgaaca tacaagggcg agaaggggta agacaggagg 25621 gccttgaggt gtgggagact aggaaacagc acacagagga aggaggaagt atacagggca 25681 ataggctggg ggcaggcatg ggcaggtctg atgtttatca tcaggaatag aagggagtcc 25741 tcttccaaga gtggttttcc actagagaga gggactgatc tctgcttaga gagagagagg 25801 agaggcaggg ctcaggactt aggagtgggg agaaggtcca cagttgtcat ggtgagtgga 25861 gagtggcttg atcaggatgt cacgtggagt gcaggggtat cgagagctca ctgcaagttg 25921 gcaccgtaga ctctgggaga gcagcagttg gggtgtggct atgcaggtgt cggttcaggg 25981 aaagcaggtg gttggattca aggtgggagt ttaccgatgg gcgtcaggaa aaggcatggg 26041 atgagcgaat ttaaatattg acaagaatgt cagtgccagc atgacaaaag aggagagtga 26101 agcaagggag tgggctggca aattgggagc agataggagg cagtgatgct gctgaggttt 26161 gagaaggggc ggagttgagc gggtagagtc tctttctgca ctcccacccc aacagcagct 26221 ccagacttgc ttttgcaagt gggatgctca ggaggtttga atgatttatg ttggatgggg 26281 taatgacagg tttgcactgt gaccctggag atgggtgtct gcagtgaaat tgaacacagg 26341 aggatgtcgt cggagatgac agaggcaagg cattgaaagg ctatacggct gtggtgtggg 26401 acaggtccct gggtgttcag ttgcccagga tgatgaaagg ggtggactca ggtggatagg 26461 aagaccccac gccatgacct agggctttgc tgggtgagtg tgatgggtgg gcagctgggg 26521 tgtgagcagc aagaatgggg agagctgaga gaggctccgt ggtgcctccc tggctgccca 26581 ctgggtctcc tgggagcttc tccaacctta cctttcagat tcttatagaa ttagtcccag 26641 cacactctga gcattggtgt ttttaaaagc tccccaggca attctaatgc atagccggga 26701 ttgagaatca agggatggat ccaagtcgca ccagaggagc aggctctggt ttggtttctt 26761 tcctaagcag ctgtagtgat ggtgggggtc tgggtggcta agatgactgg gatcccctgg 26821 acttgtatcg gttggttgat taaacccgct tctgaggagg aaatctatac agtcccataa 26881 tcatgataag tagggtggga ccctgccagc ctccctcctg ccttggctct gggcacatgt 26941 cagaaccaga taaagatgcc ctggtcttgg gatggctcct caagtgccac ccacctctct 27001 ctgcctctca gaatccacgg tccaaagagt gccaacttgt ggtgacttag ccctgccaag 27061 acctttggcc ctggcaggaa aaccagataa agggggcaaa caagctgtca ggatgagcat 27121 tggtttgtga gagcagtgcc ggagccgtgt gtgtgccaaa gctctgcctg tttttgtgcc 27181 aggaaaggct cacgccacag cctcaccctc gccacccaca gcctgctggc tcagtcacca 27241 gatgcgtgtt tcttcacatg gtgtttggtc cccaagtgac aatctctgtc ttttttgtaa 27301 ggtttccctc tcagcgcatt taaccgctta aaccctttat ctccttgaga atctgaaata 27361 attgaggagt ggacagttac tagtcagatg aaaataccag ctttgtggtg agtaatagca 27421 gactagaaaa atgcagcttg ccttgtaatc agagtaggcc ctctctcacc tccccagccc 27481 gccctgggga cgcatatggg caagtcgggt ctgcacagaa gccatgtgag tgttgcttag 27541 agtctagcct ccccatccct accccccttc gagcacagca acccacactt cacaagcttg 27601 gaccactggc catcagctgg ccagtctgtg tccttcttat agtggggttt ggaaagaatg 27661 aatgaaaaat aaagagaatg atgcaatttc ctcccccatt attatcttgt tgagagtttt 27721 cagttatgga ccggcagggg gaggacagtt tctgcttcca tgttggtctg gtgtcccaaa 27781 gacgctaaaa acaaacccac ttgctttatt tcagtgtaag agaaaaaaat tagagatttt 27841 tcatctgttg gtctcattaa ccaaaatgaa gtgagctgag ttgggtataa cctagggtgt 27901 ctctcttgag ctctgaaaag tgtgaattct cgttttcccc aggttgccga tatagaatct 27961 ggcagacaga ttcagctgat caaccgaaag tctctgcgat ctctcactgc ccagttgctg 28021 gtattgttga tgtcttggga aggaaccacc catctttctg ttgaggagct caagagacat 28081 tacgaaagta cccacaacac tccccttaac ccctgtgaat atggattcat gaccttgacc 28141 gaactgctga agagcctgcc atacttggtt gaggtaggca cgttaatggc tctttagaac 28201 tatcattgaa aactatcgat tgggcatttc cggattacct gatggtctag attagagtgt 28261 gtagagctgg aagggacccg agagaccatt tcatggatga acacgccgag acctcaggca 28321 gtgaagcagc cctgcctaaa tgggtggcag aagccagtga aagcctggtc tcctcatcct 28381 ggcactgggg tgtgtttgtc ccgttctagg atgaaagcct ggtctcctga tcctggctct 28441 ggggtgtgtt tgtcccattc caggatggat ttcgctactg ttttatcttt ctgaatgaga 28501 tggtttatgt ggtggcttga agagagagat gagtaaggtg agggcagaaa ggaaggttgg 28561 ccagccagtc ccccaggcag cctttcaaga atggcgctcc aagattgaga gaatcttaag 28621 agaatactgc atgcaattct cgggcgacag atataaaaac ctagagtaaa taaatgaaat 28681 cctggaaaaa ataagttacc aaaattcact caagaagtta aaaattagac caattaccat 28741 aaaagagact ataaaagatc actgaaatat accagtgaaa aaggcatcag gaccaagtga 28801 gataacaatt gaatttcaac caaccctaaa agaacagata aatctacctt ttaaaaaacc 28861 attctggggc ccggcaaggt ggtacacgtc tataatccca gctactaaga aggctgaggc 28921 tagagaatca cttgagccca ggagtttgag gccagcctgg gcaacatagc aagaacccat 28981 ccccaagaaa aaaaatactg ttctaggctg ctgaaaaaga aaaaaggaaa gcttaccatt 29041 tatttttacg aagccaaaat aaatttaata ccaaaaacat aagatagtcc cttaaaatga 29101 tcctaaataa gatattaaca aatgagctcc agcaatttaa aagaatgata tgccattatc 29161 atctaactgt gggacacatc attaaataat gaaaggatat acatccagca ttttctaaac 29221 tctgttaata tctatcctaa gaacaccagt tctggacagg tgttaatgca tatttcagga 29281 caaaggactt tccctgtcca gctaagattg ggaatatcct ttctcttggc gattccatag 29341 catgttaaac actttgaaaa ggtttgtggt aaagaaatct gcttccttac cctgcataca 29401 caagggtttt gtgtttgttt cttgttgctt ttgatgtgtt tttaaagaat gctagtgtca 29461 tcctcttatc acagtgtttc ttggaaaact ggtttggagg tgcagtttaa tggccccagt 29521 gttcactgtg tgtaccgtga tttgccaaca cagtttgcct cccccgattt cttccctact 29581 cacacactgt ctcttcagac cgctgcagcc ttcgacttgc aaatagaact gaagtttgct 29641 ggaggctata gccttctctt ggaggaaatc cttgaatata tccctcctcc ttttctttct 29701 cctcttcctt ctttccttcc tcctcctcct ttcttcttct ttgtcttttt tgtcatcttc 29761 atcttcttca tctttgtctt tgtccaattc gtcgtcttcg tcttctttat cgtcgtcttc 29821 gtcttcttca tcgtcgtctt cgtcttgtct tcgtcttctt gttctttgtc ttcatctttg 29881 tcgtctttat cttcgttttt gtctcgtcgt ctttgtcttc atcttctttg ttgttgtctt 29941 ctttatcttt gtcgtctcct tcttcttcgt catctttgtc tccttcttca tcttcatctt 30001 ctcatcttcg ttgccttctt tttcttcttt gtcttcgtct tcaactctgg gccttttccc 30061 tcagtgggag gccttcactt tccacggggc agggccagtc actaatattt ctaatgctgc 30121 gttttctttt taaccaggtt ttcactaatg ataagatgga agaatgtgtg aagctcacaa 30181 gtctgtattt gtttgcaaag aatgtgcggt ctttacttca tacttaccac taccagcaga 30241 ttttccttca tgagttttcc atggcctata ccaagtatgt cggagaaact ctgcagccca 30301 agacctacgg ccacagcagc gtggaggagc tcttgggagc aattccacag gtgggcattt 30361 ttctcagctt ccgggagagc atcttctgaa cagccacagg ctaactgtcc tgaacagaaa 30421 aataaaatgc tgccagaata atggaacggc aggcattttg aattgatcct ttcaaaacat 30481 actgttggtc taaagtaaat gattttagca tttctccttg attctttctg taaaccttta 30541 ttaaatccct atatgccagg tgagtaagtt tcaggctgta actctagaaa ggtgttggtt 30601 aatgggagag tcaagacccc caagcaggca actagtcagg gcggtcaggg cagtgggcgc 30661 gctgaggagg gccgtgtcag ggcccacttc ccaagcctgg ggagcctgtt gcaaaacctt 30721 caaggatgtg cctgagggaa gcacgtgaca aggaggggag agagcattct gggcaaaggg 30781 gatgatgaca tgagcagatg cagggttgtg acaagtggca tggagtgtcg agaggctaag 30841 agggagctga gagctcggtt agggagggcg tggctcctcc aggtaaggag ggaggactgt 30901 gttggagaga gctggagccc aggtttgcat ttgaaaagca ccccagtggt gctgtccatg 30961 gtagagccga gagaagcagg catggagaag ggaactgggc taggatcctg gccactgcag 31021 aggtctgggt cagtgttaga tggagggatt gagccagagt aaaaatagta gaatagagga 31081 agtaggaagg acccaggcaa agcttgcctg ggagctgaga tcagtgacct ggcgacggtg 31141 tgcacatgta gggggcctgg gagcctagct ccactactac cactctgctt ggctgagatg 31201 gagagcggca ccatcaggga gaggacttta agatgctgat aatgaactta gtttaaaggg 31261 tctgagagat tgctcttaat tcttttaggt ggttgttcag tttctgcaaa aaaatggcaa 31321 agagctctca tgtgagaact accttaattt atatgccaaa gctggtgttt ggaagacagt 31381 ctttttaaga attataaagg atatttctga aactgtgtct catgattttt gaactactct 31441 gctttttata ggggtatgtt tctgtcagct tctgttgctt taaaagctaa ggagagagta 31501 gagaaacatg cttcttggca gtaactcagc tcctaaattc tagcctaagg caggctccgt 31561 aggtcacact cttcctgaat gttagggctt ctgtcccctc cttttccgct gcttaaccag 31621 tagtatgtct tgtgtgcagg tggtctggat aaaaggacat ggtcataaga gaattgtagt 31681 gttaaaaaat gacatgaaaa gtaagtaagc acccatcccc tcccccctta aaaaatcacg 31741 gttctctcac tgacatttct ctctgacggg tgctctttgt cctgtaggtc gtttgagttc 31801 actcagtctc tcccctgcca atcatgaaaa ccagccctcg gagggcgagc gcatcctgga 31861 ggtgcccgaa tcgcacacag cctcggaact caagcttgga gctgacggca gtggtaagag 31921 aggaaaagca gagacatagg agcttgtgaa attctaggag aaacggcttt ggggtcgggg 31981 agagcgaggg aaaggactcc agcaagtcat ccaatctgct gaagttacat agccacacat 32041 ttttcaaagc agttttggta tttagttagt aacaggataa aattgacctt tatctgttag 32101 cattgcatgg ttgactggtt aaagtccctg aaggatgttg gcacccccat aatgatcatc 32161 ttaccccagt tcagatgttg aagaaaaaaa ttattccgac acttattata atggatattc 32221 atgacacttg caacagggga gagagagcca actcccaata catagcctag gagcagagtg 32281 agggctcagt gggtggaaaa ttactaagag gattcttgct ataggcatgc aggccagcca 32341 aggatctaga catcagtggt tgggaataag gaattcgatc agctgtgaag ggtgagggga 32401 ttctcactaa actgactcaa gctaagactc tagagcgagc cggcaagttc agcaacagac 32461 atggaaggcc aaggtcaggc ctgatccgga agagggctca gggtgcctga ctcatggctg 32521 gtcaaggcgt ctttgtcaca ggacagccct aggcaggatt tgatatacta atgcattaat 32581 attacccaaa attacactga gtcacataga tttcagatag atccacctct gtagtagata 32641 gaggccaatt cttggaatgt tatagcagtt ctaacttttc tgaaggactt ttgaattcat 32701 cgttataggt gagaattttg tcttcataac atgcggcgag atagcacgtt accgtcccct 32761 gtgcccttag gataactgag ttacacagca tttgacgttt tatgaagcgc cctcctggga 32821 atcatctgcc gaagctgccc cacagccttc agcagtgctg acctccgttg tgcagcggag 32881 gctcggaaat ctcaggcagt cacgtattac agctcaccca gagtcagcag ctaggaagcc 32941 acggaaccag gcccagggct tcggattctg agcttccact tgctgttaac tacgtacacc 33001 ccgtctcagg ctgtcccttc tctttggaag ctgactgcag gacccttttc tctggttggg 33061 tcttttgagg ggccagtgtt cctaagaaca cggatcttca gaagaactag cttattaagg 33121 cgttccatta ccaggatctt cctgttagtg ggttttgggt ttttgttttt tttttttttt 33181 ttttttaata ccttaaatac ttcttgactc tcttatcttt aaggcttaga aggaaaactt 33241 caggtgtgag cccagggtgt gtgctggggg tgggcgtgga ggtcctgtgg tgtcacggcc 33301 ctccttgttt gcccagggcc cagtcacaca gagcaggagc ttctccgcct gaccgacgac 33361 tcccccgtcg acctcctgtg tgcgcctgtc ccctcgtgcc tgccgtcccc tcagctgaga 33421 ccagaccccg ttatcctcca atctgctgat ctcattcagt ttgaggagcg ccctcaagag 33481 ccttctggtg ggtgactcca tgttgtcatg gggattttct tcggggttta aaacaaaacc 33541 acgatgagat actgatatag taaatgacgg aggtgggaag ggactacccc ttttcaggtg 33601 gaaaacactg tgctgattat tcgaccttct ttccccacct gggttcctgc agaatgggat 33661 gcacagcctg aggttgcctg gcgagcatca ggcaagggcg atttggcagc tctggagggc 33721 tggtggctcg gggctgtggg cgagtccaca ggggtgggtg gggcctcctg ggcttctgct 33781 ctcccagact ttgtgaaggt tcagctgctc tcaggcacac ctgtcaccag atgtacacgt 33841 cacccatggt gggggtccag agggcccaag ggctttataa aggggtgaat ggagggaggt 33901 gtctgtggct ggagccccgg actctgccca gtgatataac agacaggaag ctgagcacac 33961 tgagtcccag cctagtgcag gcattggcac tgaccctgtg ctcagggctg acagacacca 34021 tcacatttaa ctcttaccac ggctccaggt gagagccgtg gaataaaggc agctgcttta 34081 ttccacaggc tgcgtgacta aggcacagct ggtgccgtca ctagagaggc agtggccccg 34141 ggctgctggg ctagcaaggg cagagccggc attggaaccc aagccctctt gtcctgcggt 34201 gtctcctggc ctctggagag gatggcccat ctgccctgta cagatccctc cctgctcttc 34261 atctgttgga acccaactcc tccccactgc tggtgctgct ttccaggtca ccaggaacaa 34321 cccgtggact cctggcttgg cttttaccat cttggttgtt cttggcagcg ttcagcactg 34381 ttgcccagga aaagaactga ggctgttgac ttggaaaata ggagatgggg aagtaactgc 34441 ctgcggatag aaccagcctg tgcagtcaga cctagaccat gccctgctcc tgtgcagtgc 34501 ccccctaggg ttggtggcac attgatgtta gctcgtttgt ggggttgtgg cctttgtgcc 34561 ttcagtctgg gcaagttcaa ctgcatgcac cggacaagag aggaaaataa ttaaaacact 34621 gtgggcagtt ctgtcagccg ctcttctctt gagggtcagc ctaggcaggg agctggctct 34681 gctgtcccct ctgccactct caccttccat gggcctttgg gagcaggagc cctgtgccct 34741 atgcagtggg agtctaaggc ataatgatgc atctctttca agcagcgtgg ccctgtgtgt 34801 agagcaggtg aattacatgt ggctcagcca cagaaatggc agtccagaag cactgtggga 34861 aagctggctg tgctcaactt cctgggttct gagggtaatt ctcactctgc gtagtccttc 34921 cttatccccg gtaatagacc catggggagc ataggtcctc tctaaatgga gcaggccagg 34981 agctccctga cacccagatg ctccgacccc accccctacc acccacggcc tcccgcctca 35041 ggaagtctct aagtcgtgga tagctttatg ggtccttaat atctggcagg cagaatgaat 35101 gggcagagaa tccaaaatgg aaaaacaaaa agttttaacc tttgctcatt catttatagc 35161 tcaaaataga gatctttcca actttgtaat ccagtgggag aggtgggatg cagtttatag 35221 atacttgatt tctgtgccaa gttcactggc cacacaactc atatgtaaac tgtggcatag 35281 atgtagcatt tagaagcttt ttattactat tattatcttt tgagaaaata tgagcaagga 35341 agagaaaacc cacacttctt acagttcctg gatctgaatc tgtttgcttt tctgtgttta 35401 cagaaattat gattttaaac caagaagaaa aaatggagat tccaatacca ggaaagagca 35461 aaactctgac ctctgactcc agctcgtcct gcatctcagc agctgtcccc gtgcctccct 35521 gcccctcctc ggaaacctcc gagtcactgc tcagcaagga ccccgtggaa agcccggcca 35581 aaaagcaacc caaaaataga gtcaaattgg cagccaactt ttccttagca cctataacca 35641 agctttaact cccatttgga atatagaatt aggatgggaa aacactgtct gattctgcac 35701 acaaaagtgg gtttcaaaaa tacatccttt tctgtgtcat gaaaaccccc cgaagccatt 35761 gacttcatct tacctgtgtt ctatcatgtt tttcttttcc attgaacaca gctttgagct 35821 gaagtctttc ttttcttctt cctttttctt tcttttttca attatttgaa gaacttgtgg 35881 catttgttaa agaatcctta atatatttta ccaaattttt agtaactatt atgaaataca 35941 atggtgttcc aaaagaagaa agcactaaaa actcaactag caggaagcgg ttttgcttcc 36001 attgagccac gtcggtggtg tcatcctaca tgtagcataa tagagtctct cagcctttgc 36061 ttactggtgt ttccgacagt attaacccag tgcgtggcat gtctttgaag caaagcctgt 36121 cccgctagcc tgtgtgttca cacaccaaag gagaagggta ggcgaggggt tcggagtgta 36181 ttttcaggtt ggaatgtgaa ggttcctggc ttccatgtga cttgtaagtg tgccttgttt 36241 tttatttaaa ctatgtctgt agttgacaaa agtggcttca tcacaaattt ttttaaacgt 36301 ttctactttg gggttccatt taggagcttt ctagaaagtt gaggatgttt gaagcttccc 36361 ttctgtgttc cagtttgata tgggttaggt taaggaaaag gagatggtct cgatgtcatg 36421 gatttaaagt cagcatttgg attacaacac acattattgt gtgtcgagag gcagcattgg 36481 aaagagcctg attttctaaa atatgcatca agtgcataaa ttacaaatca gagctgagtg 36541 gggaggtgct tcagcagcag cgggatcaat acgatttcca ctgggaagag acaggaattc 36601 tacctacaca taggaaggcc aaggttgatc cacttaacct tgttattgta attttaaagc 36661 tggtatcggt agctgggaag ctttatgtgt gtgtatgtca gctcagcatc ttttaatcgt 36721 gtctgatttt actctcgtta cttctcgctg ttggaagcat ctctgggttt cttgtggcgt 36781 ctggtgtgga atggccttcc tctctaggtg cctgaggtgt ccgctgacgg gtggcctctc 36841 acccccctct agagaaacct gacatttaca atggattgta tttgtccggc aaaaaaggcg 36901 gattcattca tagagcagaa agaacctgtt ggcgtaaggt cttccactag ttagatggtt 36961 tcttttgggg aaatgtttat atttggtaac ttctagaaag ccagaaaatg gactctgatt 37021 tcatagcatt ttgataaaat gccacaaaat aagggctata gagagatttt tacaagtcag 37081 attttttgtt ccccaaatct tttaaatgaa gccagtgact cccagttctc aacacatggc 37141 cccagtcaag acagaacacc atggaaaaaa ccctgtaagt attatggcac tctccgaatc 37201 ctcctgaagg catttcctct acagacatag tgttacagga agtgaaatgc aaatgtgcaa 37261 ttttttttaa aggagctttt aaacaaagta ataataattg gtgttgagaa catcagttgc 37321 cttattctaa gatttcataa ggctgagttt gcacgcttgg acttcagttc tgctagcatg 37381 taaagagtgg tggactttaa aaccgtaaga ggtatcatta caagtcacct ggaacagact 37441 caaagaacag gttctttggg caacagacaa agaagaatca agttctgtgc tgtcccgtga 37501 gtgtgtctga gtaccattca ctggagttgc tgcttaggtc tgggacgtgt gtgtgactct 37561 taacaattgc tgtctgaaaa tgagagagaa gacttcggaa gcacattgtg ttcataatgt 37621 actccacaat ggccagtcca attgctatct atttttttat ccagaggctt aattaaatga 37681 tgtggtaaaa tgatgtttga gcattaagac aatgtaattc tttatttctg ggtggaaaga 37741 tataccggat taaatttatc tgtttaaaaa aatgacaaaa gttatcacca aaaccccctt 37801 tcccatcttg cactgtttgt tttggattgg gtttggggga aagagatgtt tttcttagtt 37861 gtctactttg tttgaacact tttgttgtgg ttcaagtgct gttttgtgtg ttgggaccaa 37921 acagttgtca ataaacttta caagcgagca tctattttga gtttcccaag tgagtggttt 37981 tgtctgtctg tctgtctgtg gggaggaatt acaaaccaat acagttctgc catctttcat 38041 tttgggtctc ctcaacttct ctccccttct ggaaatagaa ttgtgaggtg gagagtaaat 38101 ccgtctacat cttagaggcc ttttaaactc agcacacgcc gaagcaggtg gctggatggt 38161 gggcccttcg tcgtcaggat cctccaagtt ttccagcctt tgttccagct tataattaca 38221 tgtgggaaaa tacagtaacc acaaaaacct tccagttaga aaactctatt aattcctagg 38281 ctctctcttc ttgtgaaaga aatccagcgt gcccgctgtc tcgtttttgg tcactgctaa 38341 gcatgctttt tgaataagcc aatttggaaa agagttattc cacagataca gttgtgaagc 38401 cctcccatcc cctgagaccg ccattacccc gtcacccctg ccaccagcac cctctggcaa 38461 gaatttaaaa ggacccagaa gagacttgag taaattgctt cttgatgaca tttttttcaa 38521 gttcagcaat gttaaaagct gctaaagacc tttccaaaca tttcacgatg gcaaatttgt 38581 gcagaaaaag ttgagtagcc atcgtgcccc ttttgagcgg cgtcaggagg tcttgttagg 38641 tgtggactgg ggctggaacg gtggctttgc atttatcttt gtttgtttcc cacctaggaa 38701 ggtgattcag aggtgggggc ctagaagaca gctgtctggg ggagagcaga cctgtccgta 38761 gcagaggggg ctgcgctgat tgaagaagat ctgttccctc tctgttccct ttgccagaag 38821 gagcgttgcg ctttgggatt caccttcact ttctagctgc ttccatgggc agagggtgag 38881 tcatggcgca gaagccctca gcggcctcca cccacctgct gtccccattg ctgggggggg 38941 gggcggtggg cggcaaaact tcctcaattc tcctagtgtc cctggctggg ccttaaactg 39001 acaaagacat taataggaaa aaagcaaaca aattttactt cataccagtt ttgcgagagt 39061 cttcataagg aaatgaagac cccagagaaa tggctaagcc tgagtggttt tttgttttgt 39121 tttttggttt tttgagacag tctcactctg tcacccaggc tggagtgcag tggcataatc 39181 tcagctcact gcaacctcca cctcccaggt tcaagcgatt gtcctgtctc agcctcccga 39241 gtagctggga ttacatatgt gcaccatcac actggcaaat ttttgtattt ttagtagaga 39301 tgaggtttca ccatgttggc caggctagtt ttgaactcct gagctcaagt gatcctctgg 39361 gctcagcctc ccaaagtgct ggaattacag gcatgagcca ccacgcctgg ctgagaacgc 39421 ttttttttgg gggggggggg ggcgggggga aaagggctag agagcttaga aggtgccaag 39481 ccagagcacc aagagccaga actttcctta gcaagatgaa gagaagaata cacacgttcc 39541 gggaaggaac gccttggaag tgggggagct gaggtgcaga ggaagaaggc attgcagggt 39601 ggaaggggcc gttcttagag ggaaatcctg ctcagtggct gggctgctca gctcctcact 39661 ccgcacgagg tggtgaacca accaggagca tccgcagggg agcagtacca gggccagcag 39721 ggaacatatt gggcctgctt ggcaccgtct cttccatccg cctcaggcag ctcatgtgat 39781 ggcctccaaa ggcttcccag gtgcagggcc tgggcatgag ggaacccccg ctcctaccac 39841 ccacctgccc ccactgcagt ttctggtatc agcaagtgca gacaaatcca cactggggct 39901 gcagggtcgc cgcccttggt atttcagcag ccagagtctg atgcatcaag ggcaggtgtg 39961 gcaagaatca ggaaatacat tttttttttt tttgagacag agtctcactc tatcgcccag 40021 actggggtgc agtggcacga tctcggctca ctgcaacctc ctaccgggtt caagcgattc 40081 tcatgcctca gcctccgaac agctgggatt acagatgctg ccaccacacc tggctaattt 40141 tttttgagac ggggttttgc catgttgctc tggctcgtct caaactcctg gcctcaagtg 40201 gttcacttgc ctcggcctcc caaagtgcgg ggattacagg cgtaatataa tatagatatt 40261 atatatagat atagatatag atgtctctat aatatctata tataaataga tatatctatt 40321 atatataata gagatatata tctatatata atatatagat atctatatat tatatataga 40381 tatatatcta tatattatgt atagatataa tatagtcaca cagtcaacca tgcccggaac 40441 ccccaagccc ccactttttt tttccgagat ggagtctcac tctgttgccc aggctggagt 40501 gcaatggtgc gatctcggct ctttgcaacc tccgcctcct ggtttcaacc gattctcctg 40561 ccttagcctt ccgggtagct gggattacaa gcatgtgcca ccacacttgc taatttttgt 40621 atttttagta gacatggggt ttcaccctgt tttttaaatt ctctaaaagg aagaggacac 40681 tttttttgtt gttgttgttg ttgtgttgtg ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt 40741 gtgtgttttg agacaaagtc ttgctgtgtc tccagactgg agtgcagtgg cgcaatctcg 40801 gctcactgca acctctgcct cccaggttca agcgattctc ctgcctcagc ctcccaagta 40861 gctgggacca caggcgccca ccaccatgcc caggtaattt ttttgtattt ttagtacaga 40921 tgggttttca ccatgttagc caggatggtc tcaatctcct gatctcgtga tttgcccgcc 40981 tcggcctccc aaagtgctgg gattacaggc gtgagccacc gtgtccagcc ctgtttgctt 41041 ttaaatggga gagacctgtt taaatccctc cgatgggaag agaaagaaca attggggcta 41101 cagtggtggg gagggaggga gatgaatgat gactctgtca ctaacaaccc ctgggaccgt 41161 gggcagtgga tttactgcca cttcctcatc tgtaaagtga gaagtgataa caggatctga 41221 ctcactgttt caagtgttta ttctgcctcc gcttgcaacc tcttgcaacc tccgcttgaa 41281 aagggtaatg gatttactgc tacttcctca tctgtaaagt gagaagtaat aacaggatct 41341 gactcatcag gctgatgtga ggagggaacc agccagtgtg agcacattgc tcaaggcata 41401 gagataagct acaataaatg ttattcccat tatcaacctc ctccatggct tctaaaagga 41461 gcaagtctag gtgtgcagga ccccacagtc tgcaaggttt cccagcttgg aaggtggccc 41521 tggtaatctg gaagcgtctg tgacagctta aataaattgg attttttaat gttttctttt 41581 tttaaaaaaa tcagtaggaa ttacaggaac atacagtagg tgtacccaga gcttagcgga 41641 agtaccagag gcaggatttt gatgagggtt tgatctcctc tggttcgttg gatctgtgag 41701 tgactgctgt ataactggcc tgaggtcaga cctcgctgac ccccccactc ccctctcccc 41761 tttttttttg agatggagtc tcgctctgtc gcccaggctg gagtgcagtg gcacaatctc 41821 tgctcaccgc aagctctgct cactgcaagy tccgcctccc gggttcacgc cattttcctg 41881 cctcagcctc ctgagtagct gggactacag gcgtccacca ccacgcccgg ctaatttttt 41941 gtatttgtag tagagatggg gtttcaccgt gttagccagg atggtctcga tctcctgaac 42001 tcgtgatccg cccgcctcgg cctcccaaag tgctgggatt acaggcttga gccaccacgc 42061 ccggaccccc caagcccccc cttttttttt tccgagatgg agtctcactc tgttgcccag 42121 gctggagtgc aatggtgcga tctcggctct ttgcaacctc cgcctcctgg tttcaaccga 42181 ttctcctgcc ttagcctccc gagtagctgg gattacaagc atgtgccacc acacttgcta 42241 atttttgtat ttttagtaga catggggttt caccatgtta gccaagttgg tcatgagctc 42301 ctgacctcaa gtgagcctcc cacctcggcc tcccaaagtg ctgggattac aggcatgatc 42361 cacagtgccc agccttgacc cacatttttt atggggaaaa gagggcggtg ggccctttgc 42421 agcctgagtt ttcactgcac gttggagcgg gaggggtctg tggtgggtgg aattgggtcc 42481 tccccaaatg catgtctacg tggagcctca gaatgtgacc ttggttggaa agagggtttt 42541 tgcagaagta attaaggaaa gggtccaaat gagatcatac tgcattcggg tggccctcac 42601 tccaatgact ggtgtctttt taagatgaga gggtgcacag aggcacatat acagcggagg 42661 cggccacgtg gagacagagg tgaagactgg agtggtgcag ccagggaact gccaaggtgc 42721 caggagccac caggagctgc aagaggcaaa agaagattct tccccacaga gggagcagag 42781 caccgcagac aggttgattt caggcttctg gcctttagga ctgtgaaagg agacggttcc 42841 actgttttaa gtcacccagt agaaggtcat tcattatggc atccccagta cacgaagagt 42901 ctggcttcat gattggtcta ggttggtgaa ggaggacacg gggtcctctg aggccattgt 42961 ggtggcactg agtagcccaa gggacccttt tcagcatcag aatggtggga agtggtgtgc 43021 agaccaaagc cgcgaacacg cagagtggga ggactggggt ctagcagcct gctgggaggg 43081 cctgtccaac caaagacctt ttgatctaca aatttttttt tttttttcga gacgaagtct 43141 cgctgtgtca cccaggctgg agtgcagtgg cgcaatctca gctcagtgta acctctgctt 43201 cccatgttca agagattctc ctgcctcagc ttcttgagta gctgggatta caggcacaag 43261 tcaccacacc cagctaattt ttgtattttt tggtagagat ggggtttctc tgtattggcc 43321 aggctggtcg cgaactcctg gcctcaagtg atcagcctgc ctctgcctcc cgacgtgctg 43381 ggattacaga tgcgagccac cgcacccagc caatttacaa atattttgaa aatgctactt 43441 atgaatggcc caataatttt acttctagga gtttatctta aggaaaaaat aaaaaattca 43501 gatgaaacct tatcccccta gatgtttatc aagacactat tttttttttc cagggggtgc 43561 gatcttgctc tgttgcccag gctggaatac agtggtgtga tcatggctca ctgcagctgg 43621 aacctcccag gctccagcaa tcctcccacc tcccaagtag ctgggatcac aggcccgcgc 43681 caccgcatat gcccagctat ttgtattttt agtagagaca gggttttgcc atgttgccca 43741 ggctggtctc gaactcctgg actcaagcaa tcctcccgcc tcagcgcctg gcccaagaca 43801 ctttttttgt ttttttttga gatggagtct tgctctgtca cccaggctgg agtgcaatgg 43861 cacgatctcg gctcaccgca acgtctgcct cctgggttca agtgattctc ctgccttagc 43921 ctcccaagta gctgggatta caggcaccca ttaccatgcc tggctaatcc aagacactat 43981 ttataagaaa acctagaacc agtctacaca tccaatagtg aggaaatgat tagatgaatt 44041 acagtacatc cacaagatgg attactaaac agacattggc atgatttatg aagagtgttt 44101 aatgtttaca aagagctttt aatgatgtgg aagatggtgg tataataagt aggaaaaaac 44161 tggttataaa tgataccaca atatgatgtc aacgatgaaa aaatgcacaa aaaaagactt 44221 ttttaaaacg tcgaaacatc agagtggtct ctagttggag gaataatttt tttttttttt 44281 aagacaaact tcttcatact ttttgtactt ttaaaatgat ttactatgta catggattac 44341 ttccataatc aagaaaaaaa gtaaaaactt ttaaaatgct gtgaacgtgg tttccatttt 44401 tattttctct gatgggtgac tgcacatcag catcttgagt tatgctaact acttaggctg 44461 gtcacgacac acacacacat acagagaagt cacacaggct gtaccgggct ggctgttgca 44521 aatgtacatg aattacacgg gttgaagttg tttggcagct agttaacata tgccacccaa 44581 tgacgggaca caatctggtc acacgctgcc tctcctgcct gttgctgttc tttacaaagg 44641 tccgtgtgtc gggctggcaa agtggcatcc tgctggcatc tctgtggtgg ccccggaatg 44701 gtggagtgcc caggtcagca tctactctgg gtcccattct caactgctct ggccttgaca 44761 tcagagcttc cggttccctg ctggctcttc tgcacactgg atccatgcca gtccaggccc 44821 cctccaggca cacacctgag gcagactgag agcacaatgg acgctgtcct gatgtgaact 44881 gtgcccagac acaggatggg gggaagcggg agatgtgacc acgacaccac acacaagctc 44941 gatctttaca gatgggccct ctgggactgc tatatgggtg gtgcccgaga gacacggggg 45001 ccatgtggag ccctccaggg cacagttgct gggagaggac agacgggagg ggacctgtgc 45061 ccatgtgcct agctctgaga gcagagctag gagggtggat ggggaacgtg tccgacatgg 45121 tgctagattg ttgagaagtc acgactacac acagctgggg agcatgcccc tctcccagtt 45181 cattcacacg gtgggttaga tacagctaga gagaagtgga gatatgcttt tttgagggag 45241 gacctttcct ggtgtttgga taaacagtga gacagactcc tgggttcggc atgctctggg 45301 gaggaccatg cgacggtgag agactcccta ctgctacact ctgtacaggg tggccaggcc 45361 cgcctgccgt gcccatggcc tctggtgaca gtcacgggct tgccttcaac cccttctgcc 45421 ttggagagct cttgggcccc ggtgaaatgg gggctataaa gacaagaggg tggggggggg 45481 tccccatggc ccagggcacc ccacgtgggg gctacatgat gttgcactgg gtggccccgc 45541 aacaatcctt gatcagtgcc agccccgtct tggccaccgt gggcttgcta ggtgggggct 45601 ctgctttctt ctctgcccgg gagcctgcag caagatgtgg gcacagggag agggggagaa 45661 aggaacagag agagagaggt gaggtgagca gagccctgac catctgcagc ccagccccac 45721 tgcaacccct ggccaggtct tgttccaggc cttgttcagg tggtatggag agctgatgca 45781 catttcatat gatcctcagc tggtcctcat tttatagtcc tcattttata tggtcctcaa 45841 tgctcagctt ccaaccctaa ctttcagccc ttctcctagt cagctgttat catcattgta 45901 cctgccaagc ctagcacagg gcctgacatc caagaagtat ttggtaaata gataaatggg 45961 aaattttgtt caagtcactt caatctgagg gctcatattt ttcaattttt gaaaatgctc 46021 atacattgtc ttttcagata cctccacctc ccattctctg ttttatctcc tagaatgcct 46081 atcacatgta agttggacct tctcactcta tcctctcggt tttataactt ctctttcata 46141 tttctcatct ctttttctct ttgtgctgca ttctgtgtaa ctttctcagg gctgtcttcc 46201 aatgtataaa ttctcccttc agctgtgtct ctgtatttaa tccattgagt ttttacttta 46261 ttgataaaaa aaaatttcag aaggccagtt ttttgcatgt ctacttattc ttttttcata 46321 gcatcttact atttcatttt gggttttatt ccttctctta gctctttgat catttaaaag 46381 tacttagttt attttccttt ttaaatagtt ctatattttt ggttcttggg aggacattct 46441 ctttggtgca tctactgatt ctcctcatgg tgattcagtt cttcatgggt ttttcttttc 46501 ttttcttttc ttgagatgga gtctcgctct gtcacccagg ctggagtgca gtggtgtgat 46561 cttggctcac tgcaacctcc gcatcctggg ttcaagcgat tctcctgctc agcctcctga 46621 gtagctggga ttacaggtgt gcgccaccat gcctggctaa tttttgtact tttagcagag 46681 acaaggtttc accatattgg tcaggctggt ctcgagctcc tgacctcatg atctgcccac 46741 cttggcctcc caaagtgctg ggattacaga cataagccac tgtgcccggc cgggtttttc 46801 atttttattg agggctcatc ttcaaggggg ttgctttttt ctgtgagaat actgtatacc 46861 atgggatcca gaagaatccc tactgagcag ttttgcattt gctaaatctg ctaggatccc 46921 agagatttca ctggttttac aatagttttt atttttcaac ttggaattcc agtaccagat 46981 gagtagtgta aagttggccc cttgcatgtg gtgcatgctt ggtgttttga tttctcgcca 47041 gaatcatttc ttttttccac ccataatcca atagagaagg caagtttcct tgctcctcct 47101 ccatgctgct aagtggagtt tttctacctt gcctttcatg aaagggtcca ctgaggatgc 47161 tggccttcgg cagtgatcac gtgccttgtg gttataggag ccagcagaag agatcattgc 47221 ccttctcttt ccttactccc tccaaccatc tttcacagga tgttaatgaa aactgggttt 47281 gtagttttat agacagtggc agcttttggt ttttgtcttc ccagcaagat ttcaagtccc 47341 tcttcctcta acagctcatg tccctgtggc tgcaggagtg ggcatgttat acattcctgg 47401 ccaatcccag tacccccagc ctcttggtga tggtaatgga tctaagaggt gggaacataa 47461 tcccagaagg gcgggtgaga atatttccct gagacttatg tatttcgata ccagggaaaa 47521 gaacctgtgg tattgcttca gagttgctaa gctatggttc tagggttctg gtcccatgga 47581 gaaggcctgt gttgtaggaa aaaatgatgc ttacatgcaa agagaggcag agatgagaaa 47641 aagaagaaaa ggaggggagg taatagtgtt cagtccttga ggctctagtt cttgttgctc 47701 tttctccaac aaattccttt ttttgcttag accagtttga attgggcatc catttctttt 47761 ctttctttct tttttttttt tttgagatgg agtcttgctg tgtcacccag gctggagtgt 47821 agtggtgtga tctcggctca ctgcaagctc cgcctcccca gttcacgcca ttctcctgcc 47881 tcagcttccc gagtagctgg gaatacaggc gcccaccacc aagcccggct aattttttgt 47941 atttttagta gagacggggc ttcaccgtgt tagccaggat ggtgtcgatc tcctgacctg 48001 tgatccgccc gcctcggcct cccaaagtgc tggcattaca ggcgtgagcc accgcgccca 48061 gccaggcatc catttcttct aactgaaaga ggcctcacta atacagcagt taaacacatt 48121 tgagaaacgc tgggtcaaca aagaaaagac agtttcttga cagcaggatc catcatcact 48181 ttaatatcct aatggacact gggactctcc aaatacaggg tcaagtaaac agcatttctc 48241 aaacttattt catataaaac cctctttgct ctgaggatct gatgaggtta atattctgag 48301 acatatgctt ctggcttctc cagagattcc agacctaaga gaggccagag ctccctgagc 48361 ctcagtagct acccaattct ttctatgaaa tgcaccaaac tttgcaatta gggaggaaca 48421 actaccagga ccaggtacgt agtttgcaga gttcaatcca atatgaaaat gtagggcccc 48481 atgttccaaa cgtattaaca atttctagac agcaacagga gaacattaaa ccaagcatgg 48541 cacccttcta ggcatgaggc cccaggcaac tgcctacagg cccatgaggc tggtcaggca 48601 tgaggcccca ggcaactgcc tgcaggcccg tgaggctggt cctgcctgct gtcatcccat 48661 gctgaaatct gatctctaga tctgatctct agaaagcatt ttgttgcgcc ccttatggct 48721 cccagacttc tctgaaatgg cctttttttt tttttttttt tttgagacaa agtcttaccc 48781 tgtcacccag gctggagtgc agtggcccaa tctcagctca ctgcaacccc cacctcccgg 48841 gttcaagcaa ttctcctgcc tcagcctctc gagtagctgg gatttcaggt gtgtgccacc 48901 gtgcccagct aatttttata tttttggtag atacggggtt tcaccatgtt gaccaggctg 48961 gtctcgaact cctgacctca ggtgatccac ccaccttggc ctctcatagt gctgggatta 49021 caggcgtgat ccactgtggg cctggcctga aatatacttt tctcttttgt gggggagggg 49081 aatgaataat gtatatactc actggctgga gatttggtta ctttgtctag aggttttaac 49141 ttgattctca ggaaatcagc catttccttg tcttcttctt gctccctgtg aaagtagatg 49201 ggtaaaagaa aagaagatac acagcaagca aggaaagcat caaggaacca atgaaataaa 49261 ccaataggtt aagcaacagg ctttgtggat ggggcatgta tgttgtcacg tgttatgcgt 49321 gtacagtatg tatatgttgt atctggtgaa cctgctctga accagagaca catatcaatg 49381 atggggtaaa tgcttaccca gaggtaggta tgtgacccca gggcagaggc agggacatta 49441 caggaccctg ctggggggtc agtgagtgag gcagcacctc tcctgacctt gtattttgaa 49501 gccaccctat tcacctgtgc tcccacaaag tatgaggcag gtctcagaga gcgagtcacc 49561 ccagctcttc caaggatcac caagataaag taggtgggtg cagacttgct gaactgcctt 49621 gaagagacac ggtctttctt tgccacccag gctggagtgc aatggtgcaa tcgtaactca 49681 agaagccttg aactcctggg ctcaagagat cctcccacct cagcctccca agtagctgag 49741 actacaggag cactccacca ctcctagtta attaaaaaaa aaaattgtag agataggggg 49801 gtctcactat gttggggcag cttgtctgcc tgcccaagac tggcattgat aatttcttct 49861 agatctggaa gcagaaccgg aggtaaggag aatataatat ttgttgtagt tttctgtatt 49921 ttttgttttg ttttgttttt gagatggaat ctcactctgt cacccaggct agagtgcatt 49981 ggtgcaatct tggctcactg caacctctgc ctcctgggtt caaatgattg tttcctgcct 50041 cagcctccca agtagctggg attacaggca tgcaccacca cgcccggcta atttctgtag 50101 ttttagtaga gacggggttt caccatgttg cccaggctgg tcttgaactc ctgacctcag 50161 gtgatccacc caccttggcc tcccagaagc gctgggatta caggcgtcag ccactgtacc 50221 cgaccataga atataatatt tggataggag ggaaaagttg gtttctctca ttaaccagaa 50281 gtccattgca gcatgaagaa gggcagtgga actgctagtc ccaggaccct tgaagaaagg 50341 tctctgagaa cacagacaat aaagagcaag ttgccctctg ggttggaagg gcctttttcc 50401 ttgtgaaagg ccctaaaagc tttgctctgg tgtaaggaaa caaaaatcaa gaaccaacat 50461 gtacagcttt ccacattttc aacatttaca acatgttttt acactgcagc tccagggctg 50521 tggttgtcga ggtcaaggca gcagatgcaa agagaagacc ctgatactta gaaaagggca 50581 ttggcgccac taaggggaag ggaaagggga gaacaaagct gggaggttgg gaggacagag 50641 gttgggagca agtcagttca ggtggcagag tttcggtaga aactgatagc agaagtttca 50701 agacattgcc ttcaggagtg acaggatgtg aaggaagctc tgccatgtat tatgggatgt 50761 gactcctccc tggtcacttc ctggaacaac attccctcaa gaggcacaga tttgaaattc 50821 tcttatatac aaaatgaatt ctcaagccag aaggctgggg ttccttggga ttcagtgtcc 50881 agagactgtg gatcaaggcc atgcctgtta ggcaaggctg tgagtcccag gagcagaaga 50941 tgggttctgg aaggaagcag aaggttgctg tcctgcagcc cagtgatcgg gtacccgcag 51001 tgcactcacc ttaaccgctc gacctccgca tcgtccacca ggaagtctct cttctgcacc 51061 agtttgtgga tcttctggat tagctcatcc tctctctgct tctgcagttt ggttttttct 51121 ttttctggaa agaagacatc aacaaggaga cagatgagca ttttgggagg ctgaggtggg 51181 cggatcactt gaggtcagga gtttcagacc agcctggtca acatggtgaa acaaccccat 51241 ctctaccaaa aatacaaaaa ttagccaggc gtggtggcac acacctggaa ttccagctac 51301 tcgggaggct gtcagccacc acctcgggag gcgggagaat tgcttgaacc tgggaggcgg 51361 aggttgcagt gagctgagac tgcgccactg cactccagcc tgggcaacag aatgagactt 51421 catttcaaga aagaaaaaaa aaacaaaaaa ttacagagag acagatgaga gggtgaccac 51481 taaaagcact tgagctggcc gggtgcggtg gctcacacct gtaatcccag cactttggga 51541 ggccaaggtg ggcggatcac gaggtcagga gatcgagacc atcctggcta acagggtgga 51601 accccgtctc tactaaatac aaaaaattcg ccgggtgtgg tggcgggtgc ctgtagtccc 51661 agctactcgg gaggctgagg caggagaatg acatgaaccc gggaggtgga gcttgcagtg 51721 agctgagatc gcaccactgc actctagcct gggtgataga gcgagactct gtctcaaaaa 51781 aaaaaaaaaa agcacttgag ctgagcggcc aggattgatt ggagacctga ctctgccctt 51841 ccaccctctg gatgaacttg gagaagcaac ttgacctctt caagcttcag ttttctcatc 51901 tgtaaaatac acagtgaggc ttaaaatggg agtaatgggc tgggcatagt gcctgccacc 51961 tgtaatccca gcactttggg agaccaaggc aggaggatca cttgaggcca ggagttcaag 52021 accagcctgg gcaacatagt gagaccctcc tatgtctaca aaaattttct ctgtaaatta 52081 gccacatgtg atggagtgtt cctgtagtct cagctacttg ggaggctgag gtgggtggat 52141 ctcttgagcc caggagttct aggctgcagt gagttatgat tgcaccattg cactccagcc 52201 tgggtaacag agaaagaccc tgtctcttaa aaaaaaaaaa aaaaattgag agggcagggt 52261 gagtgtgggg agggagagca tcaggaaaaa tagctaatgc atgctgggct taatacctat 52321 gtgtcgggct gacaggtgca gcaagccacc atggcccacg tttacctatg taataaacct 52381 gcacatcctg catactaaat tgcatacaat tttgaaatgc agtagtgtta gtaaatggct 52441 tagtggagtg cttggccttg gctgagagcc ttcagtaaat ggtcatgaca aagccagggt 52501 cttctgcaaa ggcagggggt gaggagaggc tcagataggg gtgctgggag ggaactagtt 52561 gtttttttcc ctttggtgaa ggtaaaccaa tttatcagaa gaaaatcatc catgccaatg 52621 acactaaccc tgtcccaaac accaagtgct aggatgccaa agagaaaaaa taaaaaaaag 52681 agagagggct gaagatgagg tttgagaaaa ttaagaggca aaaacttaaa cccacttggg 52741 ctctttctgc tcaaatccct tcagaataaa aaatgtggga cagcttgtct gcctgcccac 52801 gactgacgtt gatcattttt tctggatctg gaagcagaac caggggtaag gagaacataa 52861 tatttggata ggacagaaag ttggtttctc tcattaacca ggagtccacc actgcacgaa 52921 gaagggcaat aggcttgcca gccgctcccc attcccagcc attggtgcat tggatgatgg 52981 tgtcactgct cctgccgtgg taaatcttgt gggttgggtc tgattcccca aatcactcgt 53041 cagcaccgca gagccgctcc ctgccctctc caagcacgca ccaacaccca tcgactcacc 53101 aggacccctg ctcactcagg agctcgtggt gtgactttac tctccaaata accattgaca 53161 gcactctgac attcccaagt gccctccact gctctctaga tcttagccag aatcacccaa 53221 caactggctg ggcccccaac atctgcaaac ctctgtctcc aagccattca tccatgtgtt 53281 ctgtcactca ccaatagagg cattccccag cctcagctgc agcaactccc aggagaaacc 53341 acctactcac acaagtggcc tgcgtcagct cacagaccta tgactagccc aagtctccaa 53401 gtcattggcc aatgctgctc acgtgcgtgt gtgccacata cctttcccat catcttccac 53461 ttgcctacag gtgccaccca ctgaaatcct aacctccatg gtcatagtat taagaaatga 53521 ggcctttggg aagtgagtag attatgaagg tagaaccctc atgaatggaa ttagtgccct 53581 tataaaagag acccaaggga gcttgtttgc cccttccacc acacaaaggc acagtgagaa 53641 ggtaccatct ataaggaaca ggccgtcacc aggcactgta tctgccaaca ccttgatctt 53701 gcacctccca gcctccagaa ctgtaataaa tgttgtttac aagcccccgg tctagagcaa 53761 gccacagaga ctaaggcacc cccaaagttc atgtgtgccc tatttggaaa tagttttccc 53821 cagtttatat ctcttctatt tggaaatact ctatttccaa agtaatttgt aagttgcttt 53881 gcacatgtaa ttataatagt taagctaaaa tgaggtcata cttattagca tggaccctaa 53941 acccaatgac ttgtgtcctt atgagaagag gaaagagacc caagcatggt ggcctgaacc 54001 tgtaatccca gctacttggg aggctgaagc aggaggatca cttaagcctg ggagttctag 54061 accagcctgg gcaacacagc cagaccctgt cttttagaaa aaaaaaaaaa aaaagaggag 54121 aagagaagac acacagacac acacagggtc acgtgaagat ggaggcagag attgaagtgg 54181 tgcaaccaga aggcaagcaa tgccaagaat tgtgtgtgtt actgttagag taggtagcca 54241 ggcagccatg agcaagaaag gagaggggat ttccccccca aggaatgcca ggcaaccatc 54301 aggtgatgtc aggcagttgt taaaattgtc tctctaaaat aataattggt cacagctggt 54361 gccagggaaa ggccatctcc caacagacag aaaacacctg aagttatcag cggtttcccg 54421 atgagatctc aggagttggg caactgggct caagcatgca cattaagagg caaaatggcg 54481 gagtctaact ggtatatgac cttcctctag gaacactcaa ctggtgaggg aaaaacatct 54541 caaataatca ggcccatgac ttctgtaaat acactgtgtt tgcagcctct cccaagcgct 54601 ggcaggccac tgcgcatgcg cacaacctgc cctgagggaa aatgaaggga ggagagacat 54661 aaaaccctgg aatatgccaa catataaaac cccggaacat gccaacatat aaaaccccaa 54721 atcaaaggtc aagcggtgca cttggatctc tcaaatcacc cacttggccg tcttccaagt 54781 gtcctttact gcctttcatt cctgctctaa aacttgttaa taaacttttg ctccttctct 54841 aaaacttgcc tcggtctctc actctgcctt atgcccgctg gtggaattat ttcctctgag 54901 gaggcaagta tcaagttgct tcagactcgt atggatttgc tactgctaat gatacaggag 54961 tgaagaagaa attatttagg cagatagggt aaggaagtcc tcggtaaggt tttcctttta 55021 atgaaaagca acccccaaat cattttcttt tctaacaagg agcagccagt aaaatcgagc 55081 tgtagacata gtcaagggag ctggaagctt acacgggtga atgctggcag ctgtgcccat 55141 aggaaaaggc cacctggtct aggtatgttc aaaatggcgg ctccacgctc ccttctcttt 55201 gccagccatg tgtacagtaa ggagaagaca acatggcgcc gcccaagtgg aaagttcatt 55261 tgcatgataa gattagggtg gggtggccag ccttcccatg cactatgcaa acgtcacacc 55321 tgctccaacc aatctgtggg ccctatgtaa atcagacacc acctcctcaa gcttgtctat 55381 aaaatttggt gcagtctgcc acaggccgga attcccattc agggacttct cgagagagac 55441 cgagagagct gttccctttt ctctttcttt tgcctattaa acctccactc ctaaactcac 55501 tcctctgtat gtcagtgtct gtgtccttaa tctgcttgga gagagacaac gaaccttggg 55561 tatttacccc agtcaacaac actacttcac taacactacc agaagctaag aggcaaagga 55621 gtctcctcta gagccttagc agggaacaag gccctgttga cacctcgatt ttggatgtct 55681 ggcctccaga accgaaagag aatacatcca gaaccgagag agaatacatt tctgttgctt 55741 aagccatcca gtttgcacca atttgttagg cagctctggg aaatgaatgt aatgccctag 55801 ggcttcagga gactccactg tccttccagt aattgtttgt tttgtgctta cataagcttg 55861 gatgggtttc tgtacctttc agccagacta atccttggag aattgtgaaa tgagagaggg 55921 gaacaggcct ggcggttcct aaccgtccct gcttatcaca atcaggtgca aagctttaaa 55981 aattgccaat gctcggctag gcgcggtggc tcacacctgt aattccagca ctttggtagg 56041 ccgaggcggg cagatcacga gatcaggagg ttgagaccat cctggctaac acagtgaaac 56101 ccagtctcta ctaaaaatac aaaaacaaaa tttgccgggc atggtggcgg gcgcctgtag 56161 tcccagctac tcaggaggct gaggcaggag aatcacttca cttgaacccc ggaggcggag 56221 actgcagtga acccagatct cgccactgca ctccagcctg ggtgacagag caagactctg 56281 tctcaaaaaa aaaaaaaaaa aaattactaa tgctgagatg gatcaggtga ggcaggagaa 56341 tatggtctgg aggcagggaa cctagggcca acttcctaga actaaatcaa aagaaaaacc 56401 ccaactttcc acacataagt aacaaaagga ctggaggcta ctccctttgc aaaccctcct 56461 ccttttctgt gtgtcacatg agaaattgaa agtatatctg attggtcgca gaaagcaact 56521 gcaaaagctt tctgcgacca atcagactaa ttgtgggcca ctacttcatt tacatagggt 56581 gtacaccaag tagccaacgg gaaacctcta gagggtattt aaacctcaga aaattctgta 56641 ttggggctcc tgagccccta tgctcaggca cactcccacc ctatggagtg tactttcatt 56701 ttcaataaat ttctgctttt gttgcttcat tctttcctcg cttattttgt ttgtgtgtgc 56761 attttgtcca attctttgtt caaaacacca agaacctgga caccttccac cagtaacaca 56821 gggactccct tttggggtct gtgggagacc cccaccccaa gcatggaaat aaaggaaaat 56881 cttgagttcc ctcaaaggaa attctaggca cctagctagc cttgaaaagt caataagcaa 56941 cttatcagta agaaggtaat agtagcctaa aacaatagcc aaggaagtta gaatctggag 57001 atgttttgtt ttctatagaa actgaagata aacatcttag catatatgtc tgagttgttt 57061 ttcagaaacc caggacccta ccaaatggat ccactggcat atagacgtca gataaaggaa 57121 gactaaggac tgaattctga ccatctttcc ttgttctaag tgtcttccag agaggcctgg 57181 agggagtcac acccatgggc cagggatgcc attcttttct gctgaccccc gatttttatt 57241 atttttattt atttatttat ttgaggcaga gtcttgcttt gtcgcccagg ctggagtgca 57301 gtggcacaat cttggctcac tgcaaccccc ctctcccgag ttcaagcaat tctcctgcct 57361 cagtctcccg agtagctgga agtacaggca ggcgccacca cacgcggcta atttttgtaa 57421 ttttagtaga gacgggggtc tcgccatgtt ggccaggctg gtctccaact cttggcctca 57481 agcaatctgc ccacttcggc ctcccagggt gctgggatta tacgcgtgag ccactgcacc 57541 tggccttgac ccccgatttt taaacaaagc ttctcttcct taaccaattg caaattagaa 57601 aatctttcaa tctacctatg acctgtgagc cccagcttca acgtatgcag cccttttagg 57661 ccaaaattga tatgtaatct caatgtattg atttacaatt ttgcctgtaa cttctgcttg 57721 actgaaattt acccctgcct ttaaaaacct gcactgaaat cgccattgca aaattataag 57781 agacagtgaa agagatctga cctaatcaac tccatcttgc ttctaacctc caagctgtcc 57841 ttgttcatta ctgggcatag gctgaactaa ctatgggagg aacttacttt atcatttaaa 57901 acaaagacga taacagcctt ttcccaaaac aaactccctt cttgcctggg aactagactg 57961 catttgtagg tgatacggtt aggctttgtg tcgccaccca aaactcatct tgaattataa 58021 tccccatcat ccccacgtgt caaaggagag accaggtgga ggtgactgaa tcacgcaggc 58081 agtttcccca tgctgttctc gtgatagtga gcgagttccc atgagatctg atggttttat 58141 aagggttggg cagttcctcc tgcgttcatt ctccttcctg tcaccttgtg aagagagtgc 58201 cttgcttctc cttcaccttc caccatgatt gtaagtttcc tgaggcttcc ccagctatgc 58261 cgaactgtga gtcaatgaaa cctctttcct ttatacccag tcttgggcag ttcttgacag 58321 cagtgtgaaa atggacttaa caaattaacc agaagattag gaattatgat agtgagtgag 58381 ttccatgaga tctgatggtt ttataagggt tgggcagttc ctcctgcgtt cattctcctt 58441 cctgtcacct tgtgaagaga gtgccttgct tctccttgac cttccaccat gattgtaagt 58501 ttcctgaggc ttccccagct atgccgaact gtgagtcaat gaaacctctt tccgttatac 58561 ccagtcttgg gcagttcttg acagcagtgt gaaaatggac ttaacaaatt aagcacaaga 58621 ttaggaatta tggtttagga atcatgcagc tggaggctac aagatgctga ccctccccag 58681 attactcctg aagataacac cactattgta aaaaccaaga tcaatgcttg aggtattttg 58741 cagaccgtgc acttgatggg tcagtgggta ccacccagat aaagtggctc atctgacctt 58801 gtggccccaa cccaggaact gactcagtgc aagaggacag cttcggctcc ctgtgatttc 58861 atctccgacc caaccaatca gcactccatt ctcactgcac ccccaccccc atccaccaaa 58921 tcgtctttaa aaactctgat cctggaatgc tcaggggcac tgatttgagg aataataaaa 58981 ctccagtctc cctcagccgg ctatgcatga attactcttt ctctattgta attccccagt 59041 cttgataagt caggtttgtc taggcagagg gcaaggtgaa cctgttggac agaacattac 59101 ctataagcca ttgaggaggt caggcttgaa gcattagctg cccagtcctc ctggcttgga 59161 gccttgcaaa taaactgttt cctttttacc actgcaaaaa ctctggggag acagctggtc 59221 ttactgtgcc aggaaagcag accctagttg tattagtcca ttttcacgct gctgataaaa 59281 acatacccgc aactgggtaa tttataaaga aaaagaggtt taatggactc acagttccac 59341 gtggctgggg aggcctcaca atcacggcag aaggcaaaag gcacgtctta cttggcagca 59401 ggcagcacag aaaatgagaa tcaagcgaaa ggggaaaccc cttataaaac catcagctct 59461 catgagactt cttcactacc atgagaatag tatgggggaa actgtccccc atgattcaat 59521 catctcctac tgggtccctc ccacaacaca cggggattat aggagctaca attcaagatg 59581 agatttgggt gggaacacag ccaaaccata tcacaagttg agttctatta caatgccaga 59641 gtccaaagcc aacaatgcca gatcaattgc attggggttt ctagtgggtg gagaatagac 59701 tttggtactt ttccctcatt ccttcgttgt tttatgtttg acatttaact ttttttggta 59761 acagctttat tgcaatatga ttcacatacc atgcagttca tctgttccaa gtgtatttgc 59821 agagttgtgc aaccatcacc acaatccatt ttagaacatt ttaatcaccc ccagaagaaa 59881 ccccataccc actagctgtc gctctctttc aactaggaaa gtctattcac tcgtttatcc 59941 ttagggccta cacagtgctg ggcacctagt aaatgctcac gaaagctaga ggattcagtt 60001 ctactgtcat tgcgcacact acccttacaa caagactgtg tggaagactc ctacaccttg 60061 cctttgttat ttatgggata ttcacaacaa ccatgcgggg ggtagtatta ttggcctcat 60121 tttcccaaca aggaattcaa ggctcagaga aattaagtcg cgaactttca cgaagttcac 60181 ctgcctgcta aggctgaatt caaacccaga ttgctgcact ctaattacta aagtagtagt 60241 tcttatgtgg gagctgtaag ctgcaaagtg tctgagaagg tctcaatcaa tctggaggtt 60301 tattttgcca aagtcgagga ggcacctggg aaaaagagac acaggcagca ataggatccc 60361 cggcctgtgc ttttttctgc agaggatctt aaggacttca atatttaaag gggaaagaga 60421 gggcaggaag gggaggattg cattctctgg ccctcagtga atttgcattt tacataggat 60481 aaaggaaacg cggagtagag gaggaagtca aatgtgcatt tgtctccggg tgggcggagg 60541 gacgatgatt tatagtcttg tcttagtccc ctacctgtta ggataagctg ttaatttaca 60601 ttgacagggt gagggaggcc cctgggaaga taggtggcct ttctatcttg cagctatctg 60661 tttaggaaaa aaagaaaggc agttttggct gggtgtggtg gctcacacct gtaatcctag 60721 cactttggga ggcttgaggt gggtggatca tgaggtcagg agatcgagac catcctggct 60781 aacacggtga aaccccacct ctactaaaaa tacaaaaaat tagccgggcg tggtggcagg 60841 tgcctacagg ctcccagcta ctcgggagac tgaggctgag gcatgaaccc gggaggcgga 60901 acttgctgtg agcagagatc gcgccactgc actccagcct gggggacaga gcgagactcc 60961 gtctcaaaaa aaaaaaaaaa agtaggtttt tgtatgactc atagttccca agcttaactt 61021 ttccttttgg tatagtgaat tggggtcctg ggactttatt ttcctttcat agggcaatgt 61081 tgcacacccc cctgacctcg cttggggtac attcagcaac atctgtacat ctatagacat 61141 ttttggttgt cactgtcaga agggtttgaa ctacagtgac tccatcttga ataggggctg 61201 ggtaaaataa ggctgagacc tactgggctg aatgcccagg aggttaggca ttctaagtca 61261 caagatgaga taggaggtcg gcacaagata caggtcacaa agaccttcct gataaaacag 61321 cattagtaaa gaaggctgcc aaaacccacc aaagccaaga tggcgatgaa agtgacctct 61381 ggtcgtcctc actactcatt atatgctaat tataaggtat tagcatgcta agaggtactc 61441 ccatcagtgc catgacagtt tacaaatgtc atggcaacat caggaagtta ccctacatgg 61501 tctaaaaagg ggaggaaccc tcagttgtgg gaattgccca cccctttccc agaaaactca 61561 tgaataatct accccttgtt tagcatataa tcaagaaata gctataagta caatcaatcg 61621 aggagcccaa gctgctaccc tgcctacaga gtagccattc ttttattcct ttgctttctt 61681 aataaacttg ctttcacttt acggactcgc cctaaattct ttcttgctct tggtccaaga 61741 actttggggt ctggatcagg atccctttca ggtaacatca ccattgaggg gtagcggaga 61801 ggtacgttgt gctaatggcc tgtagtgggt agaggccagg gatgaaactc aacatcccac 61861 aagacacaac acagagcccc gtacccccaa caaaaaatga tcaaccccaa aggaagatac 61921 tgacatttgt caaccccaaa tgtcaatagt tgaaggttgg gaaacgctgc tttaaggctc 61981 atcatgctgc acttgtagct cagtaagaac tgccacttat gggcacccac tgtgtgctgg 62041 gcccagtctc tttctgctaa cctgggggct ccttgagagt ggagaaccgt tctctgttca 62101 tcactgcaag cccggctcct aggttcagag cttgctgcag aacagcctct tgggacatag 62161 ttattgaact gacagcatgg caaaaacatt caagtcaagg gtaatggtgg gtgcttatct 62221 ggcctatttg aaaacacctt tgcaaaaatg ataacagtga gaaaattagg acagtgaaag 62281 ggatctgatc taaccaaccc ccatcttgcc tttaacctcc aagctgccct taatcattcc 62341 caggtttagg ccatgctagc tttgggagac atttagttta tagtttaaat gattataacc 62401 ttttccacca actaaactgc ctttgtaaag ctaataaaag gcccccatgt tagaaggatg 62461 ggaggaggct gaattctgct aaggtgtaga cataaatgac tactgccatt attccagagg 62521 tcacaagatt tgcaacttcc ccaattactc ctgcagataa catcactatt gcagaatcta 62581 agatcagcct tttgagatat ctctttaggt tttgcatttc tgatgaatga tgatccacgt 62641 ggacctgcca cccagaagtg gacttagcac ccctgagaat catttttcac acacctatga 62701 ttgcatcccc aaccaatcag cagcatccat tctgatatag tttggatgtg tgtccccgcc 62761 caaatctcat attgaaatgt aatacccagt gttggaggcg tggcctgatg agaggtgatt 62821 ggatcatgat ggaggtttct catgatggtt tagccccatc ctccttggta ctgtccttgt 62881 gatagtgagt tctcatgaga tttggtcttt taaaagtgtg tggcacctcc cgcctcactc 62941 tcttgctcct gctctgggca cgtgacatgt ctgctccctt gtcttccgcc atgattataa 63001 gtttcctgag gccttccagc agctgagcag acaccagcat catgcttctg gtacagcctg 63061 caggactgtg agccaattaa acttattttc tttataaatt actcagttct ctctttcttt 63121 tttttgagtc ggagtctcgc tctgtcaccc aggctgaagt gcagtggtga gatctcacct 63181 cactgcaacc tccacctcct gggttcaagc aattctccca cctcagcctc ctgagtagct 63241 ggaattacag gcacccgcca ccacccctgg ctaattttca tatttttact agagatgggg 63301 tttcaccatg tttgccaggc tgttctcaaa ctcctgaact caagtgatct gccctctcag 63361 cctcctaaac tgctgggatt acaggcgtga gccaccatgc ctggcctggg actagcattt 63421 tgataagggt aagccccaca gagggcaact aacacagcct gatgggcagg ggaaaatttt 63481 ttggagaaga tgccaaataa gtcaggcatt attaactacc aaatctgggg tgacttcaaa 63541 gttagagttt gtacacactt catctgttcc caagtaggat aataatggtc ataattgcta 63601 atatttactg gactcttgag gccaaatatc aagctaagca cttagtttgc atcatctcat 63661 cgatttttat gatatttaat ttaatttaat tatatatttt tttgagacag tgtctcgctg 63721 tgttgcccag gctggagtgc aatggcacaa tctcagctca ctgcaacctc cacctcccaa 63781 gctcaagtga ttctcctgcc tcagcctgcc ggttacctgg gattacaggt gtgtgccacc 63841 acacctggct aatttttgta tttttagtag agatggggtt ttgccatgtt ggccaggctg 63901 gcctcaaact cctgacctca agtaagccac ccatctcagc ctcctaaagt gctgggatta 63961 caggcattag ccaccgtgcc cagcctcttt aggatattta tactcaatga attagaccct 64021 cttactatta ctagccaatt ttcagaggag gagaaattga gtctcagaga agtacagtga 64081 cttgtccaag gacaagaagg tgaccagaat tcactaggac ttgaacccag gtcttcctga 64141 ctcgaagttg tgtttgacac atgcagtcag ggagcactgc attgcaagaa tgaggctctg 64201 tcttccagac ccaggcgtac agatgggcca ggctgggggc ccccaagcac ctgtttagca 64261 caaggtgaag ttgcaaatgg ttacctggga tggcgaccaa gttctgcagc tcctgcttca 64321 agtccatgat gtccttgcag agctggatgt catccatcct ggggaaacag gacaccatca 64381 acagcaggtt acatcagcag agcattacgc cctgagcatc acccggttaa cacaaagaag 64441 ctattaaagg tgggttatct tttgaagagg ccatacatgc tcattgtaga ggttttccct 64501 caaaaggaaa aagccaaaca ctgctcaatt atcccaccac ctagaaatga tcactgtttt 64561 ttgtttgttt gtttgtttgt gacagagtct cactctgttg cccaggctgg agtgcagtgg 64621 catgatctcg gctcaccgca acctccacct tccgggttca agcgattctc ctgcctcagc 64681 ctcccaagta actgggatta taggtgcctg ccaccatgcc cagctaactt tttttgtatt 64741 tttagtagta tttttagtag agacagggtt tcaccatggt ggccaggctg gtcttaaact 64801 cctggcctca agtgatctgc acacctcagc ctcccaaagt gctgggatta caggcataag 64861 ccgcggtgcc tggccatcag tcactgttaa taacacctcg atagagctcc ttccagtctt 64921 catgtgcctg acttaggttt gcttatataa gtgggttcat atggcatatt cagtttcgtt 64981 tttcagctaa agtttacgtt ttttaaacac tgcagaaatt cacaacatag aggaaagtct 65041 ccctctgtcc ttcggagaga atcattgtca acggtttagt atgtattcct ctggattttt 65101 taaaaacatt tttattttga aaatgtttaa ttttttattt tttatttttg gagatagagt 65161 gtcgctctgt tgcccaggat ggagtgcagt ggcgtgatca tagcttactg cagacttgaa 65221 ctcctgggct caagggatcc tcccacctct gcacaacccc tcccaacccc agccccctac 65281 ggagtagctg ggactacagg ctaattttta aacagttttt tgtagagaga gggacctcgt 65341 tatgttgccc ggctggtctt ggattcctgg cttcaagcaa tcctcccacc tcagcctctc 65401 aaagtgctgg gatataggta tgagccactg cacccagcct cctccagatt taaaaaaaaa 65461 gcgtatgctt ataatatcat atatgacacc attgttattg cacaaaaatt ttttgcaatt 65521 tgcttttttc aatttaataa tctttaatag tcttgatcat ctttccatat cagtatatgc 65581 aaatctccac atatttagtg gctgcatagt ataccattat attgatatac agtcatctat 65641 ttttattttt taaatttatt tttattttta ttttttttag acagtctcac tgtatcaccc 65701 aggctggagt gcagtggtgt gatcttggct caccgcaacc tctgcctccc aggttcaagt 65761 gattctctgg cttcagcctc cagagtagct agtagctggg attacaggcg tgagccactg 65821 cactcagctg attatacagt gatctattta actataattt atttaactat tcccctattc 65881 atagacatct gggttgtttc ctaatacatt tctaggatag attcatagaa atgcaattgc 65941 tgggtctagt ggcatgaaca cttacattgt agtaggaatt tccaaattgc cctccggaga 66001 gatcatgtgt gacattacca acagtgtggc catgggggac agtgggtgct ggctcctgtc 66061 atctcagtaa ttgggttgct tttgcaacct agttttgcat tacacattat aattttgcat 66121 tacacattat ggttttgtgt gtgtgctaca aaaattaact catcttgctg taaacaatat 66181 ttgtaatggc tgtctgcaat ttcacaaatg agattcatcg ggattgtgct aattacaccc 66241 tcatcttgat tactaaggct ggcttttcag gtttgagcaa ttttggccac tgcaggatga 66301 acatcagttc ctctaaggct tcttctccat gttggattat ttggggttta gattctcaga 66361 agtggaatca ctgtgtacaa agggcttgat atttccatag ctgttgagat gtatcttcaa 66421 ataacttccc cgtgatacac aggttcacat tcactctgcg tgtctttgtc tttgttgttc 66481 ttgttttgtt ttttgagaca gggtcctgct gtatcaccca aggcacagtc atagctcatt 66541 gcagccctga actcctgggc tcaagtgatc ttcccacctc agcctctgaa gtagctggca 66601 ctacagctgt gcagccatca tgcccagctc atactttttt taatttttaa aattttcata 66661 gagatggggt cttgctgttt accaggttgg tcttgaactc ctggcctcaa gtgatcctcc 66721 cgccttggcc ttccaaagtg ctgggattac aggcgtgagc cactgtacct ggcctgcacg 66781 tctttattga acatctacta tgttgggcat tgttgtcatc tctgaggctt gcaacagcaa 66841 agaagaccaa gccctcagga agatgacatt ttgctagcag agacagaagg taaacccaaa 66901 tacaatattt cagatgttac aaagaagact gaagcaggta agggatgtag tgggtcagct 66961 tggagccagt atcttacttt gctcctgaca actatcctag gaaaaaccag aggacagaaa 67021 tcatattttc attttactca tgcggaaaca ggtaaagagg ctcaactgag tacacacagc 67081 tcatgggggc aggaccagaa ctagaaatcg acccttcgga gtctgagtcc actgatcttt 67141 ctgcctcgtt cctcagcgtc tccatcaagg gccggggctg ctgactcctt tcagcagtgc 67201 catctgtata ccagcatggg gccctatctt agtcccctga gtgttgctat caaggaatgc 67261 ctgtggctga gtgctttttt ttttttttgc ataaaggaat ataaaactat ttattaacca 67321 ctgttcacca gtatttacga taaagtaaac aatatacagt tggataacat tctgattact 67381 acaaagttgt tcttcctggc ttttgctgaa ccagtaaagc aaactgaaga ttgaggctac 67441 atgtaaggaa tgagctgggg taaagaaaaa acatgcaggt cagtaggtta gattacaaaa 67501 ggttgttcac acatttatgg cagcaggtcc taaactgcca gcatctctaa ccatctgatt 67561 aggtttctat gagccaagtc ttacatattc cattcaacat gatcttttag tcaatgtagc 67621 aacagggatt tcaacatttt gttaaggaat ggcccactag ggaaattttt aaatattcat 67681 ttaacttagt tttgtttagc tagttaaaac acactagcat ttgtcttgtt ttctcatctg 67741 gatgtggaaa cctgctgtga tggcagtgat aaaatttttc ctttcaggaa ttttgcaaat 67801 aaaccaatta tagacgcttt aaaattatcc aatttaaatt gtcctattta gaattactta 67861 tttcacttga aatgtatggc ttcaggaaaa ttttcaattt accttgaagt gattatctct 67921 tatttagctc ggaataatgg catctcagaa atatgggttt acctgtgatt ttttgtttgg 67981 gtgaatgctt aaaaacaaaa aaaaatttat gtatgcattt tatagataca cacacacaaa 68041 aaaacatgta aaaaatctag aatggtcctt aggcttatgg gaacacaagt tttgattgag 68101 taatgactat ggacatttcc cccaacattt agaaaagctg ttctttaatg aagaggaaat 68161 aatatcttta taaagacaag aggtttattt ggctcatgat tctgctggct ggaagatggg 68221 acatttggca tgggcctcag gctggctaca ttcatagaag gtgaaaggga gctcatgtgt 68281 gcagaagtca cagggcaaga gtggaagcaa gagagagggg gaggtgccag gcttttttta 68341 acaaccagct gtccaggaac taaaagagtg agaactcact cacccccacc tcccaggaag 68401 agcttaatct attcatgggg gatccacccc cgtgacccaa acacctccca ctggggccta 68461 tgatgtagtt tggatctgtg tccctgccca catctcatgt caaatcataa tccccagtgt 68521 tggaggaggg gcctgggggg aggtgattgg atcatggggg tggacgtctc ccttgctgtt 68581 ctcatgatag tgagttctca cgagatctgg ttgtttaagt gtgtgtagca cctccctctt 68641 tgctctcttg ttccttctcc agccacgtaa gacttcccct tcgccttcca tcatgattgt 68701 aagtttcctg aggcctcccc agccatgctt cctgtacagc cagtggaact gcaagtcaat 68761 taaacttctt ttctgtataa attacccaat ctgaggtagc ctgtttaaaa attttttttt 68821 ctggctgggc acggtggctc atacctgtaa tcccagcact ttgggaggcc gaggtgggca 68881 gatcacgagg tcaggagatc gagaccatcc tgtctaacat ggtgaaaccc cgtctctact 68941 aaaaatacga aaaattagcc gggcgtggtg gtaggcacct gtagtcccag ctactcggga 69001 ggctgaggca ggagaatgtg tgaacccagg aggcggaggt tgcagcgggc caagattgcg 69061 ccactgcact gggcaacaga gcgagtctcc gtcccaaaaa aaaatttttt tgtttcttta 69121 taatagagat agagttacac catgttgccc aagctggttg ccaactcctt ggctcaagca 69181 atccacccac ttcagcctcc caaagtgctg ggattacagg cgtgagccac tgcgcccggc 69241 ctcaggtagt tctttataga aatgcgagaa cagactaatt tcaacatgag atttggagga 69301 gacagacata caaaccacag caggacccct tgccttttgc cactcaagaa caactcatgc 69361 tgtccattct aatgccagca tcccagtttc aggacaagag gcaggaatgc cttgcttcag 69421 cctgcatctc ttgttgtcag aggcacgcca agcatttttc aagtcgtgcc ctggaatgtc 69481 ctagcaggtg acagctgcct cgaatgagcg tggacttgcc gggagggcac agactgttcc 69541 cgtgagtttc tatcagcgat cgttcaactg gagagactcc agacgctttt gcagagatca 69601 ggctacatca gggcaatctt ggtccccagg aacatttggc aatgcctgga gacatatttg 69661 gttattataa ttgagtgtgt gttggggagg ggtgtggttg ctactggcat ctagtgggta 69721 gagaccaggg acactgctaa acatcctact gtgcacagaa cagctcccac aacagaaagt 69781 gagccagcca ccatgtcaat cacactgaag ttgaggaatc ttggattcta accacagaat 69841 tatcaaatta ttgataattt ttgagaggta agagaggaga agatggagaa aggacaaaag 69901 gagagacagg tagtgattag gaggcagagt tttggcacca aacaactctg gctcaaaggg 69961 ccatggtaca gtgacttcct tttctgtctc acctaactca tttttaatta tttatttatg 70021 tttttttttt tttttttgaa aaagagtttt gctctcgttg cctaggctgg agtgcaatgg 70081 cgcgatcttg gctcattgca acctttgcct cccgggttca agtgattctc ctacctaagc 70141 cttccaagta gctgagatta caggcatgca ccaccaggcc cagctaattt tttttttttt 70201 ggacgagatt tcaccatgtt gatcaggctg gtctcgaaca cctgatctca ggtgatccac 70261 ctgcctcagc ttcccaaagt gctgggatga taggcgtgag ccaccgcacc tagccgtcat 70321 gttttaaata ggaataatgg cagtactatt tgctgggatt attatgcaca tcaagcatta 70381 agcacagtgc ctgggaccta cacagtcaaa tgcccagtaa atgctatcta ctattattat 70441 gttttttaga gacagggtct agctctatca cccaggctgg agttcagcag tgcaatcata 70501 gctcactgca gcttcgaacg ccaggggtaa aaggatttgc ctgcctcagc cttccaagta 70561 gctaggactt taggcatatg cccctacact cagctaattt ttaaattttt tgtagagaca 70621 gaatcttgcc atgttggcca agatggtcta gaactcctag cctcaagcga tcctcccatg 70681 tcagcctccc aaagcactgg gattacaggc atgagccacc acccttggcc tttactatta 70741 ttatcattag ctgtgagaaa gaagcagaga atggctgggt ggggtggctc acacctgtaa 70801 tcccagcact ttgggaggcc gaggtggaag gatcacttga agtcaggagt tcaagaccag 70861 cctggccaac gtggtgagac cctttctcta ctaaaaatat aaaaattagc caggtgtggt 70921 ggcacatgcc tgtaatcccc agttctcagg aggctgaagc aggagaatcg cttgaaccca 70981 ggaggcagag gttgcagtga gtcaagatca tgccactgca ctccagcctg gccaacagag 71041 tgagactcct tctcccaaaa aaaaaaaaaa aaaaaaaaaa aaaatggccg gcgtggtggc 71101 tcacacctgt aatcctagca gtttggaagg ctgaggtggg caggtcacaa ggtcaagaga 71161 tcaagaccat cctggccaac ttggtgaaat cctgtctcta ctaaaaatac aaaaattagc 71221 cgggcatggt tggtagtggg cgcctgtaat cccagctact caggaagctg aggcaggaga 71281 atacttgaac ccaggaggca gagattgcgg tgagctgaga tcgtgccact gcactccagc 71341 ctggcaacag agcgagactc cgtctcaaaa aaaaaaaaaa aaaaaaaagc agcagagaaa 71401 aaaggaaaga gaaaatgatg tggaagaagc actatttctg aggaatggag ctaacaagta 71461 aaaccttttt aaaaataact ttcagtgctt atactcttct ttgtaaaagt ttttattttt 71521 atttatttat tttattttat tttattttga gatggagtct ggctctgtca cccaggctgg 71581 aatgcagtgg cacaatctca gctcactgca atctctgcct cctgggctca agccatcctc 71641 cctcctcagc ctccagagta gctgggacta cagggaggcg ccaccacacc tggagaattt 71701 tttgtatttt tggtagagaa ggggttttgc catgttgccc aggctggtct caaactcctg 71761 gactcaagca atccacccac ctcagcctcc caaagtgctg ggattacagg catgagccac 71821 tatgcctggc caataaaagt ttgtaaatgg caattttatc tgattataga atggatccct 71881 aatcattgta gaaattttgg aaaatacaaa caagtacaaa gaaggaagat gaaagtatgc 71941 agaaccttga cctacgtgta gcagaggcca cttctactcc gagggttctg gtgtatttca 72001 cagcaatctt tttcctccat gtaacctaag ggtggcgggc atgccctggg ggcctttgcc 72061 ctggcccttt gtccacctcc aaggctgccc ttagcatgtc attgaatcct cccaaaagat 72121 ccaagcccac aacccattga caagcgaggt ggagctctac agaactgctg acgggattca 72181 cgctgcagga ctccatcaac gtccctatga gatggctctg ttttcctcat gtctcatctc 72241 ttctctctct ctctctacct cctccacaca ccttccgcta aaggaggacc atcaggtcct 72301 aaatcaagct tgtccaaatt aggcccagga tggcttttga aggcagccca acacaaattt 72361 gtaaaccttt ttaaaacatg agattttttt tgtgtgtgat ttttttttta agctcatcag 72421 ctattgttac tataagtgta ttttatgtgt ggtccaagac aattcttcca atgtggccca 72481 gggaagacaa aagattggac acccctgtcc tgaattactc agctctgttc ttcaacttac 72541 aaaccccagt tggcaaaacc aaaagcttca tggtcctttg aacatccttc caagaaagaa 72601 gcttgtcccc cgaccccaag ctgtccatca gggttaggtg tccattcacg taactaaaaa 72661 tcaagtcttt ttctggaata tggaaggatg ggttctgcca ttaaggcaaa caagaagtca 72721 gcatttacgg agcatttaca tgtaaatgca ctttctgtgc ataaactctt ctagaccgtc 72781 acagcagtcc aggaagagaa acattgtttt tagctctatt ttacagatga gtacatggag 72841 gttaagcaac acacccaaag ctctgcactg gtaagtggcc ttccagattc agccctagct 72901 cctgctcttg acaatttcac cgtgttgtct cctctccatg tggctccatg agttgcagcg 72961 ggcatgactg gttgtagacc ttcccctccc gctttctccc agagctctga tttgtccagg 73021 aggcaacctg ccctgccttg atggaagcca gtcacagaat cccatttgct ttggctagac 73081 acatactttt ccaggctctc ttgcaagtgg aggtagctat gacatctact ctgggccaat 73141 gaaatgtaag agaacaactt taagcaacca ttaccatagt cacgtatgtg tcatagtgcc 73201 atataggtga catatgacat atatgtaaca taatgacata tagtgacata tataaggaag 73261 ccagtggagg gtttagttaa atatcatttg gctgcttgac ttttagctat ataagtttta 73321 agatcagcta gaagtaagtg gttaaaagca agcaaggcaa gttaagtttg gtgggcttag 73381 agaaagccct taagagaaac tccgcttggg caggcttctg ggaaggataa ttagggcctt 73441 cggttctccc ctgtcctggt cctaagctcc tacagccact cacgtgagaa tggcgaaacc 73501 gaacgctaac ctgatgaagc ccaccaaact taacttgcct tgcttgcttt taaccactta 73561 cttctagctg gtcttaaaac ttatatagct aaaagtcaag cagccaaatg atatatacct 73621 aaccccccca ccggcttcct tacatatgtc actatatgtc attatggtac atatatgtca 73681 tatgtcacat atatgtcgct acgacacata tgtgactacg gtaatggttg cttaaagttg 73741 tttttcagga ctatgggggc agctcctgtc cagttcgaac ccgttgagat caatgaccct 73801 tcaactgaac ctgtgcaaat gccggagaag tgaccttttg acgtcagagg gcccaaaact 73861 ccccgagatc atgctaatgc tgccattttc taaacatgca ccctatgaag atccaggaag 73921 cttggctatg tatgtgcaga ttgccaatga cctcactttt ccttgcctcc aatcacctct 73981 tcccacactt tagaccaccc tgctccttta ttccataaag atccctaaat tccatcttca 74041 gggaggcaag tatgagacct tttctcctgc ctccttgctt ggctgcgttg gtgtgaataa 74101 attcttttct tttgccaaat ccatcataac agcgattggc ttactaccca tgggcagaag 74161 gggacaggtt cgctatctag ggcagagggg aggaaaggaa acagccagac ccttgatagc 74221 atcctggggc caccccactg accctcgggc cacctgcatc tggactgctt atcaagtcaa 74281 aaaaaatttt tttttttgag acggagtcta gctctgttgc ccaggctgga gtgcagtggc 74341 atgatctcgg ctcactgcaa cctccgcctc acaggttgaa gccattctct gcctcagctt 74401 cccgagtagc tgggattaca ggcatccacc accaagcctg gctaattttt gtatttttag 74461 gagagatgga gtttcaccat cttggccagg ctggtcttga actccttacc tcgtgatcca 74521 cctgcctcag cctccgaaag tgttgggatt acaggtgtga gccactgtgt ccagcctcaa 74581 gtaaatatta aatgtcctta tgacttaagt agtgtgaaca actgacttgg tttgcctggg 74641 gctgtcccag tttctgcact gcagtttcca gatccttgga aacctctcag tcccaggcaa 74701 acagagacag ctggtcaccc aagatttaag acactagtcg tcggacgctg tgttacttgc 74761 agctgaagac attttcacag atacatatgt gtggataggt gcatttttaa tgtgtattgg 74821 atcaagacat acagctgtcc tgctcattgc aagcatgaca tcatgatgat tttccaatgc 74881 cattaaatca tattgttcaa aagacatgtg aatagttcat gcaaagtgct ttaaaagcat 74941 agctggaagt tcgaataagc catgctttag aagcatggct ggaactttga ataagccaca 75001 ttagtccagg tctctgtacc attagggctg aacctggaaa tcaaactctg ctgctgccgg 75061 ctcgttccag agagagatgt tatcttcaag aacctgatta atgttcggaa ggaaaagaaa 75121 gccaacccgt gtcattcagc atcttggtct ttcacgctgg gacgataggg gaaagcgggt 75181 catttttatc agcgttgggt gcggagaggt gatcaggccc tgctggaata atagagctgc 75241 tgctttacaa aatgtttgtg aaaacgacat gaaatggagg catcagcagc ttgagttgca 75301 gcctaatgga aacccagcag agaccacgac aaaggcagag gtaggttttc agaggcaagg 75361 attgactttt aaaaatagtc tttattgtct ccttttaaaa ataagttaca gaggctgcag 75421 gtatcttaag acaggtgaga aaaagatgtc atgtctgggc cgggcgcggt ggctcacgcc 75481 tgtaatccca gcactttggg aggccgagga gggcggatca cgaggtcaag aaatcgagac 75541 cagcctggcc aacatggtga aaccctgtct ctactaaaaa tacaaaaatt agctgggcgt 75601 ggtggcacgt gcttgtagtc ctagctactc aggaggctga ggcaggagaa tcacttgaac 75661 ctgggaggtg gaggttgcag tgagccgaga ttgcaccact gcactccagc ctggcaacag 75721 agcgagactc cgtctcaata aataaataaa taaataaaaa taaataaata agatgagggg 75781 tttagatgag ccctcgagcc ccattctgca agcacagaga ggatacaacc cgctttagac 75841 acagccagat ctcaatgcag atctcagtac ccccattgac tggctatgac ctactttgct 75901 gctctcagcc tcagtgctgt cactgttgaa aagggattac acctttgtct tggggttgta 75961 tgagaatcaa acaggcttgt gtgtgtggtg caggcagccc acggttttgc acaggcagtg 76021 tgggtgtgta ggaacagcca cttccatcca ctcctccacc ctaggtaaaa agtcagctca 76081 cagagactct cctggactat ctcagccaac atttctgtgc aaaatgtgta tctcactatc 76141 acatgctatt tttttttttt ttgagatgga gtttcgctct tgtcgtccag gctgtagtgc 76201 aatggcggga tcttggctca ctgcaacctc acctcccagg ttcaggcgat tctcctgcct 76261 cagcctccca agtagctgag ataacaggca cttgccacca tgcccagcaa atttttgtat 76321 tttggtagag aggggatttc actatgttgg ccaggctggt ctcgaactcc tgacctcagg 76381 tgatccaccc gcctcagctt cccaaagtgc tgggattaca ggcatgagcc accgcgcctg 76441 gccagcatgt gctattacaa tttcctcaaa tcaaatggtt ccaagatgat aaacatgtgc 76501 attggttcag ctttcgtagg attcatgagc ttaaacgcta tccctgggta aggataagtg 76561 ttgataagtg tccagggagc ccctggcctc agcaatttct cagcatcttg ctggggacca 76621 agggaccgga gaggtgcctg ccagtgtaca taggaaggga atgccctcat gttgcaccct 76681 ggaaggccaa tatgtctgtt ttctctctgt ggccatttta atggactgtg gatttacatc 76741 cacctgtcac actggccttt cctttgattg cagtcagaca ctcgttccca aacaaatgtc 76801 accttcttca agacccttgt tcccactctg ttcccaaatg tcaacttcct caagcgcagt 76861 ttattcttct ctcttccaag ggctttctag actgccttag ttttcatttt tcttacaata 76921 acaactacta atactgctct tttctgtatg tatgatataa aaaaattatg gccaggcaca 76981 gtggctcatg cctgtaatcc cagtactttg aggctgagat gggaggatcg atagagtcca 77041 ggagttcgag attagcctag gcaacatagc gagactctgt ctctacacaa taaaaaaatt 77101 agctatgcct ggtggtgtgt acctgtggtt ccagctacca gagaggctga ggtaggagga 77161 tcacttcagc ccaggaggtt gagctcctgg gtgagccatg atcacaccac tgtactccaa 77221 cctgggaggc aaggattgac ttttgaaaat agtcaatttt cttgacgtgg gtttccagct 77281 gtctgccttg acagagtgag accctgtctc aaaacaaaca aacaccagta ttgcttagtt 77341 acctttgttt tagacctaaa agaatacttg ctcctggccg ggcacggtgg ctcaagtctg 77401 taatcccagc actttgggag gccgaggcgg gcggatcaca aggtcaggag atcgagacca 77461 tcctggctaa cacggtgaaa ccctgtctct actgaaaata caaaaaatca gccgggcgtg 77521 gtggcgggca cctgtagtcc cagctacttg ggaggctgag gcaggagaat ggcgtgaacc 77581 tgggaggtgg agcttgcagt gcgctgagat catgccactg cactgcagca tgggcgacag 77641 agcgagacac catctaaaaa aaaaaagaaa aaaaaaagaa tacttggtcc tttttttttt 77701 cttttttaga ggcagggtct tactatattg cccaggctgg acttgaactc ctgggcttaa 77761 gggatcctcc tgcttcagcc tcccaagtag ctgggaacac aggcatatac caccacaccc 77821 agcttggtct atttaaaaaa ttatacaaat attacataaa tatggtctca ctataaaaac 77881 atcaaacaat acagaagtat aattgctagt acttattgaa tgctttcata tgattggtgt 77941 caggctctgg attaagcatt ttacacaaat catctcagtg aactcttgca gcaactctat 78001 gaggtaagga ctattattat tatctcttag acaggataaa gaacttgttc aacgtcacag 78061 agtaagtggt aggaacttga actcgggcag tctgatctga gcccatgctc tcgactggga 78121 ctgtgaaagc tccagccagt gactggagtt caaggaagac agagatcagg atagtctggg 78181 agaaggctgc tcaaatcgag atgcaactta caggggaggg cacaacttgg aggggtcagg 78241 aaaagattcc aggcactcag agagaagggg gagctgagac aaatattagg aatgggtgtg 78301 gtatgccagg gtaggtgaca atgaagagtc tgcccaatta ggacatatca gggacatatg 78361 tgtctgcgtt gtaaagaacc ttaacatggg agccatctta ggttctagag tgagagcatg 78421 gcatggagat gtgtctatga aacccagttt ggaagacatg gtgatcagta tttccacacc 78481 aaggagagag tgcactctcc ctttgacctt ctgtctaaga cgatccaagg aattcctgga 78541 agatgcgttt gctcctcaat ctttagctgg ggttggcaaa ctatagccca cagtcaaaat 78601 ctggcccact gcctgttttg gtacaactca cgaactaaga atagtttaat tttcattttt 78661 aaatacttgg ggaaaatcaa aatcataaca ttttctgata aatgaaaatt atatgaaatt 78721 caaagttcag catctgtgaa taacatgcta ttggaacgaa gtcatgtcca ttcatttcag 78781 cattgtctgt ggctgttttt cccgacagtg gcagaggtga gtaatcacga cagagactgt 78841 acactctgta gagcctaagg tgtttagtac ttggcccttt acagaaatat tttgctgaac 78901 ttacaggagt gtaatcatag ctcactgcag ccttgacctc ctggctagag caatcctcct 78961 gcctcagcct tccaagtagt tgggactaca ggcgcatgcc accacgcctt ggtaatttta 79021 aaaaaacgtt ttgggccagg cacggtggct catgcctgta gtcacagcac ttcgggaggc 79081 cgaggcaggt ggatcaattc aggtcacgag ttcaagacca gcctagccaa catggcgaaa 79141 ctccatctct actaaaaata caaaaactag ccgggcatgg tggtgcatgc ctgtgatccc 79201 agttactcgg gaagccgagg caggagaatt gcttgaacct gggagatgga ggtttcagtg 79261 agctgagatc gtaccactga actccagcct gggcaacaga gtgagactcc atctcaaaaa 79321 aaatgtctgg tagagatggg ggtctcacta tgttgctcag gctggtctca aactcctggg 79381 ctgaagccat tctcctgcct tggcctccca aagtactgag ataacagggg tgagccactg 79441 tgcctggctc taaagaaatt tctgcagcta gaagagctct caccttacat gaatgaagaa 79501 atcaatgccc agagaggtga atgctgttca ccaattcact gactcatcca gaaaacagtt 79561 cttgagcgcc tactacgtgt cagccctcct gatctatttt cactttcaag agacattgag 79621 acataggcca gctaccaagg ccatgtggcg agacagcagg gctcttccca ccccactgct 79681 tcgactgaga ccagagcttg ttttcttctc tgggctctgg cagggaactg ggccgtgccc 79741 agtgcatgca aggttggcca acagatttcc tgcacctgct cactgatgct acctggcagt 79801 ttctgggacc gcacttagcc ctctgcacat ccttcactcc tttgaaggag acagtaagtg 79861 ttgtgagttc atttcttctt cttcttcttc tttttttttt tgagatggag tctcgttctg 79921 tcgcccaggc cggagtgcag tggtgcaatc tcagctcact gcaacctctg cctcccaggt 79981 tcaagcaatc ctcgtgcctc agcctcctga gtagctggga ttacaggggc acaccaccat 80041 accctgctaa tttttgtatt ttcagtagag acggggtttc accatgttga cgaggctggt 80101 cttgaactcc tgacctcaag tgatccaccc acctcggcct cccaaagtgc tgggattaca 80161 ggtgtgagcc accgcgtcca cttgttattt ttgatttttt ttcccccaga ctcttaccag 80221 tgcctagaat gagacacata gtccttccct tggagggccc agcgagctgt gaagtggata 80281 gaagactggc ccttggtgtc cctaacccta cccctaaggc ctttggctaa gctggcccag 80341 aataattaaa gatgtgaaca gagattaagc tccaggtggg cacagcagct cctgcctgtt 80401 atcccagaac taagggaggc cgaggcagga ggatcgcttg agtgcaggag ttcaagagag 80461 ctgctggttt taaatgtggc acttcctaca ctcattttct cttgtgctgt ctctctcctg 80521 ccgccacgta agacgtgctt gctttccctt cgccgtctgc catgactgca agtttcctga 80581 ggcctcccca gccatgcaga actgtgagtc aattaaactt ctttccttta taaatgaccc 80641 agtctcgggt agtatcttta cagcagtgtc agaacagact ggtacacagc ccatgtgcat 80701 ccctgtgtaa agcctggccc caaggagttc tcttgaactc ctgcactcaa gcctttgccc 80761 aactcctccc tcagctcaga aatctttgtc tcaaactaag tgctctgcag cttcactaat 80821 cccttctcct tctggaatct tcctgatgcc ctcagctaca tcaggtcctc ttccttcctg 80881 cagccatagc ccccttgtta tcccattgac atctaggcat tccagaacca tctaggcatt 80941 tgtgtttctg cctcaacaga ctgcacctcc tttaagggac tttgcctttt tccttcatgg 81001 ttctagcaca ccaccccgtc cctgatacat tgcagtaaat acacaaggaa ggaaggaagg 81061 ctggcaggca tggagggtac acacagtact gcattttgtt gaaacaaggc ctaaagactg 81121 caggtgcagg tggaaaaatt tcaagtctca tgatagacac cacgagggca atcgcatcta 81181 tcaaaggcgg tgagtcagcc ctggagaaat ccaattaaac atccgtcatt cgagtgatac 81241 tgctttgcat tctgctatca ataccgtttc tattctgagc tggctcagca tctgggctct 81301 cttgtgcctt ccactctcag tcacctgctg cgagttcctg ccactgtgtg gcgtcctctc 81361 ccccaaatcc accctctcca ctcccaagtc catccctcct ctcaagctga tgccaacaaa 81421 agctgatttg tatccagccc aatctcattg aatcctcatc atctgtaggg tggtattctt 81481 attctccact ttacagaact ctcagagagg ttaagtaaat cgcccgagat cagacagctt 81541 gcaagatttg aaagaagaca gtctaactcc agagccctga caattgccac tacaacagac 81601 tgcctacgca tgtcacccag atataccagg ctgtcatctt ctgttcccat acccccttac 81661 tttatcccgt tctttcagtc aggagccttc taaaatgctt tcaggccggg cgcggtggct 81721 catgcctgta atcccagcac tttgggaggc tgaggcaggc agatcacttg agctcaggag 81781 tttgagatca gcctgggcaa tacggtgaaa ccccgtctct acttaaaaaa aaaaaaaaaa 81841 attagccagg catggtggtg ggcacctgta gtcccagcaa cttgcggggc tgaggtggga 81901 ggatcgcttg aacctgggag gcagaggttg cagtgagctg agattgtgcc actgcactct 81961 ggcctgggtg acagagtgag actttcacca gtggcagttc caaggagtgg gctttcttag 82021 ggatggcagg gaatggtttc agtcccacct cgtaatttcc attagccttc aaaccacctc 82081 aacatttaaa aagtgctttt gggccgggcg cggtggctca cacctgtaat cccagcactt 82141 cgggaggccg aggcggggtg gatcacgagg tcaggagata gagaccatcc tggctaacac 82201 agtgaaaccc tgtctctact aaaaaacaca aaaaatggtg gtaggtgctt gtagtcccag 82261 ctactcggga agctgaggca ggagcatggc gtgaacccgg gaggtggagc ttgcagtgag 82321 ccgagatggc accactgcac tccagcctgg gcgacagagc aagactgcat ctcaaaaaaa 82381 aaaacgtgct tttgaaatgg gaaatatcag gtaggataga agcaataaga gaaaatgaaa 82441 taaagaaatg ggaaatagac atttttccca attttgtgga aagtgtaaga aaaaaacttg 82501 atagaaacta tgcaatttta attaatttgt acgacacaaa tctcactaaa aaccatggac 82561 tccctactgc gaaggcagct ctaaggtaag tcccctgatt gtttcatgtg accaactggc 82621 tggccctaag cccctcctgt ttaggaattt tttttttttt ttttagacaa tgtctcactc 82681 tgtcgccagg atggagtgca gtggcgtgat ctcagctcac tgcaacctcc gcctcctggg 82741 ttcaaggaat tctcctgcct cagcctcctg agtagctggg actacaggcg cccgccacca 82801 cgcccagcta atatttgtat ttttagtaga gacagggttt caccacgttg gccaggatgg 82861 actccatctc ttgacctagt gatccgcctg cctctgcctc ccaaagtgct gggattacag 82921 gcgtgagcca ccgcacccgg ccctgtttaa gaattttaaa gcaatcactc catttttttt 82981 tttttttttt ttgagataga atcttgcttg ctctgttgcc caggctggaa tgcagtggtg 83041 tgatcttgat ctgggcttac tgcaacctcc acctcctggg ttcaagcaat tcctctggct 83101 cagcctccca agtagctggg attacaggca ccagccacca tgctcagcta atttttgtat 83161 ttttagtaga gttggggttt taccagtttg gccaggctgg tctcaaactc ctgacctcag 83221 gtgatctgcc cacctcggcc tccctaagtg ctgggattac aggcatgaac caccacatcc 83281 ggccgcagtc attcctttca gctgagcttc taaggactga aggaaaaaac tcttaagaac 83341 acaatctgca ggccccgggg aagcctcccc tgccctcttc ccacccattc ctgcttagcc 83401 catggactgg gccactagga ccactgtctc agcttagccc tgctcctgcc aaagcacaga 83461 gcacacttct ctgaagccat cacctatcta ccatcaccac tggcaattcc catcagagtc 83521 attgcaccag cagaagcaga taagcttttg ataataagta aattaatgtt taagtcaact 83581 tcatgtttat tcagcatgct agtgattagt ggtctccttt gtaggcaagc gagaccctgg 83641 ggcttgataa accacaaaga aaacgcaagt aatctctgtt ggatgcccca aagtagggca 83701 ggctaggact agaatgagtg atgtgctggg gaatgtttaa ctggctggaa gcaggggagg 83761 gaggtagaag ccttgctgtg aatcatttgc caatttccac ggtgtaaata ctcccactgc 83821 tgcagatttt gagctaccca cagttgaaca actggttctc cagattccta aaaatttaac 83881 atcactggct agaatcaaca agtgggcaga tttaggggtg cttcttcaac actctaaagt 83941 attcattcaa cgtttactta gaagagaagc tgagtattat ttggggaaaa ggttacctgt 84001 ggcgatgatg atgacaatga ttttggctac catttactcg gaattcacca cgcacagggc 84061 gggtgcagtg gcttacacct gtaatcccag cactttgaaa ggccaagacc agcagatcac 84121 ttgagatcag gggttcaaga ccagcctggc caacctggtg aaatcccgtc tctactaaaa 84181 atacaaaaat tagccaggca tggtggtgca cagctgtaat cccagctact cgggaggctg 84241 aggcaggaga atcacttgaa cttgggaggc agaagttgca gtgaaccaag atcatgccac 84301 tgcgctccag cccgggcgac aaacagaatt taccatgcac cttttacacc ctaagtgctt 84361 tatacactaa gaagtgcttc ataggcacta aatgctttac acagatcatc tcaatcctga 84421 taaccactgt ttgaggtggg cactacgtgg ttgtctcttt tatacaggaa gatcccacag 84481 ctcagagagg ttaagtggct ccatcggaat cacacagcta gtcaatggtg gagctgggac 84541 actgacctgt ggatcctgat tcggacaccc acattatctc aatgaagaac atctccttca 84601 ttgtgtggca ggtcaggtct cactaacgca ggcctccatg acagctattt cagcagggac 84661 tgtgtggtta agttaaacat taaaagctga aagtggctgg gcaggcagtg actcatgcct 84721 gtaatcccag cactttggga gcccaaggcg gatgcatcac cttgttcagg agttcaagac 84781 cagcctggcc aacatggtga aaccccgtct cacctaaaaa tacaaaaaaa attagctggg 84841 cgtggtggca ggcacctgta atcctagcta ctttgggagg ctgcagcagg agaatcgctt 84901 gaacctggga ggcggaggtt gcagtgagcc gagatcctgc cattgcactc cagcctgggc 84961 gacaagagca aaactctgtc actaaaaaaa aaaaaaagaa gaagaaaaaa acaaagctga 85021 aggagcctac tgggcatggt ggctcatgcc tgtacaatcc cagcacttta gaaggcagag 85081 gcgggtgaat cacctgatgt caggagttta agaccagcct ggccaacata gtgaaatcat 85141 gtctctacaa aaaatacaaa aattagctgg gcatggtggc gcatgcctgt aatcccagct 85201 actcggaagc tgaggcagga gaatcgcttg aacctaggag gcagaggttg tagtgagcag 85261 agatcgcacc actgcactcc agcctgggcg acagagcgag actctgtctc aaaaaaaaaa 85321 aaaaaaagct aaaagagcca gtgcccttat acaaaggctg gaatgtaaca gaaatcctcc 85381 aacagttttg ctcaggcctt tcctgggcct tgaaacatga cgagataacg aggcaattct 85441 taacaggaca cgtttaggat taaacaagtt ttattggggg tgtgaagaaa ctccccaggc 85501 ctccacaaac aagtttattg ggagtgtgaa ggaagtcccc aaacctccat gatttagcag 85561 gagacaagat aagggtaatc accccagcaa ctggacccat ttagattaag taaatttact 85621 gaggcttcag aggaaggtct tcaggactca gaccttagtt agactagaag aagctgatta 85681 tttaggtctt taggtgaacg cacacttaca cgtggacata tagtttagaa ggtatgtaag 85741 ctctgggaaa ctttgtaatt ttgagttggt ctggtggtat tttccaggct ttttccctgt 85801 aactggtaac agaaatagaa actccctcct ttcccagttt atttgcatct cgttattggg 85861 tcactagaat aagcagccca accctcagtt tgatccagga acaattgttc caagcaattt 85921 aattctgata ccagttcagc ataatgctct tggaaaaaaa tcaaaatcct gcagatagca 85981 ggactcactt tgcttagaac gacggggttt atgtgtatgt gtttttgttt gttgtttgtt 86041 tgtttgtgac agggaaacag ggtcttgctc tgtcatgtag gctggagtgc agttgcttga 86101 tcatggctca ccacagcctc aatctcccag gctcaagaca tcctcctgcc tcagcctcct 86161 gagtggctgg aaatacaggc gtgccctagc atgcttggct aatttttata ttttttatag 86221 agagggggtt ccaccatgtt gcccagtctg gtctcaaact tctggcctca aagcaatcct 86281 cccatctcgg cttcccaaat tgttgggatt acaggcgtga gcccctgtgc ctggccccca 86341 ccctcttctt tgacctctgt caaagaagac cttctttggc caccattcct ggccatggaa 86401 ggctgtttac agactaccta ttgttactta cacatggtct tgggtggggt tgtgtctccc 86461 tcctgtaatg gaaaaaagtt ggaatttttg gctgctcaac ctccaaaccc tagttttagg 86521 gaaagcattc actgtgtcag tcattattag agaagttcaa caagggagca cccccacttc 86581 ctcagactca gccaatcaga agctcatgtc caagactttg accacagagc aaatgacaga 86641 gataaaagga tggagggatt agacttgtgc tgtccagcaa tggcggcttc ctggccagat 86701 tggtcctgtg gccagtctct ggggctttct tgatccatac tcatttccaa gcctgtcctt 86761 ccgactcctg agcccccatc cccactcata gccttctaca atcccatttt ttggttgagt 86821 caggcagagt tgatttctgt tgcttgtggc caacaaccct aaccaaaaga agcgtcagag 86881 tgttcccacc aaccacaccg ggtcttctct ctaagggaag tctggatggg gacaattttc 86941 tccatcatat atcactttag tgagtctgta ctgtcttcta tcgattggta ctgggctagc 87001 ttgcacctaa tcagctgtgg gcagatatca gtaagagggt gttagaaagc acttattaga 87061 aaatttgggg ccaggtgtgg tggctcacac ctgtaatcct agcactttgg gaggccgagg 87121 caggtggatc acttgaggtc aggagttcga gaccagcctg gccaacatgg tgaaaccccg 87181 tctctactaa aaatacaaaa gttagtcagg cgtggtggcg ggtgcctata atcccagcta 87241 ctcgggaggc tgaggcggga gaattgcttg aacccggggg gcagaggttg cagtgagccg 87301 agattgcacc actgcactcc agcctgggtg atagagcgag actccgtctc aaaaaaaaag 87361 aaaaagaaaa gaaaagaaaa tttgggcttg tgttaggtgc tttgggggac agtttaaagg 87421 aagtgagcat ttactctgga ttggatgttg tcaggaagtg gaggtaattc tctgatagag 87481 tgtcttccta attctttcct ttaaagggag aagaacagag taagactgaa actgtgattg 87541 atggagaact agcagtcact catatgagct aggaaatggg gctattgggt gatttttgtg 87601 gctgggacaa tgttcatgtc tttatctgcc ttcaggagta gtgttatttt gtcttgatct 87661 atcacggtct caggatggcc ttgtctgatg ccggcaatct gcaaaattgt ttacattcaa 87721 caggagaaca ctttggccaa gctgtgagtg tcaagccagc tcccggctgc caggggcagc 87781 ttttctcttt ctcacagtgt aggaagtaat ttatatactt cttttctgcc cataagtaat 87841 tactaaatct gcaactatag ggaccagtcc tacccatttt agtcttccct gcggggccca 87901 ccatgatgct ttgcatactg aacgtgctct gggaatgtgc ccagtgaact aaattatggg 87961 cacagagaac tctctgggac ttggcattat taagatcatg ctgaagtata ttttcaaaat 88021 ttaaaatgtg actcatacaa atgtggaatg aacacttgtc aaaaaattta tttaacccat 88081 taatgaggga accagtaaaa tggtaaagct cgctccaagg gcatttaaaa agtggactca 88141 tcagcatctt taatgaaaac cttagcataa gattgctaaa ttcgagagaa atctggttaa 88201 catgctataa gggcaataaa accataacct ttaatgtcgt cttttttcta ctggacagaa 88261 atctacaagt taacgtgtaa tttcactatg tcagtgctct aatcaaataa taaataacaa 88321 agtcagatac atgtactctc tccaagataa ttaatcatcc gtgggcaggt cattaagcag 88381 tgcctcagta tgacatggaa aagacattct tccttatttc caatttttag ttaattttca 88441 tttaagaatc ataacaaaag actcttttgt ggcttcttag gaacatattc aactgaaatt 88501 tttgttttaa ttgtgtttta gagacagggt cttgctctgt ggtccaggct ggcatgcagt 88561 ggtataatca tagctcactg cagccttgac ctctcaggtt taagcaatct tcccacctca 88621 gcctcctgag tagctgggat tatgggtaca cactaccatg cccagctcat ttcaaaattt 88681 tttgtagaga ggagttcttg tgatgttgcc caggctggtc tcaaacccct aggctcaagg 88741 agtcctcctg cttcgacctc ccaaagtgct gggattatag gattgagcca ctgtgcccag 88801 cctgaaattt ttctaatgcc cacctcacac ctggcaattc cccaggtatc agcagcagca 88861 gtagcatcct gctgatacgg acaggcggca gaaggacagg gtcccaggtg agggctccac 88921 cctcaagcct ggacctgcag ccctaaatga gaacaggcat tcctgttttc acacccaaat 88981 gttgcctttt ccaaaaccgc tctggaccac cctgccccca tcctatgccc ttaagaaccc 89041 caaaccccag gctccacaat cagaatagca ccagagtggc gtggcagaga aggagagaag 89101 agaagaagtg tctgaacatc gagaggagtt tggctgggga cagtcagaga ggagatcatc 89161 tgaggatggc tgaactccag gggaagatta ccttcccact ccatcccctt tccagctccc 89221 cttccactga gagccacttc cactacttaa taaaaaattt gtattcacca tccttcaagt 89281 ccatgtgacc tgattcttcc tggacaccag acaagaaccc aggtaccaag agggcagggt 89341 gtaaaaggct atcaccctga atctccactg aactggttaa cacttagtca tctgcgtaca 89401 gcaactgcta aaagagtatt aattgtaaca catccctaga cgctgccatg ggaccagagc 89461 ccaaaagcac tcaccccagc cccaatctgc tcacctgcat gctccccctc ctgcaagggg 89521 tttgatgcag tggcagccaa gtaagcgagc cacacccctg tcacaagtcc tgcaaagggg 89581 acaggggagc tcccccattt cactgctaac atttactgaa cagttattat gtggttaggt 89641 attgtaatat cttttttatg tatcttatct catctgctcc tgaccaaaaa aaaaaaaaaa 89701 aaaaaaaaat ccctaggagg taattcatta agccactagt gtggttaata aaatatctat 89761 ttggtcttca tccctggttc ccagcacaga gctcctaaag cccctggagt ttcctgagcg 89821 ataggactgt cttttgttat ccatagcaat ccccattgta ctacatctga gtgtatgcta 89881 atgagggcac tcaggctgag ttccctagag agatggcttc aggttggggc ctcgtcacca 89941 gaagagcaaa tgtgtgataa agacagggga gaggggtggg cagggcacgg tggctcacgc 90001 ctgtaatcac agcactttgg gaggtccagg ccggcagatc acgaggtcag gagttcgaga 90061 ccagcctggc caatatggtg aaacccccgt ctctactaaa aatacaaaaa ttagctgggt 90121 gtgatggcac gtgcctgtag tccaagctac ttgggaggct gaggcaggat aattgcttga 90181 acctgggagg cggaggttgc agtgagccgc aatcatgcca ctgcactcca gcctaggcta 90241 tagagtgaga ctccgtctca aaaaaaaaaa aaaaaagaca ggtggtcggg ggcactgccg 90301 gatgcagagg ctcatgcctg taatcccagc tactcaggtg gctgagccag gaggatcact 90361 tgagctcagg agtttgagac cagcctgcac aacatagcaa gactccatct ctaccaaaaa 90421 aaaaaaaaaa aaaaatttag gccgggtatg gtggctcaag cctgtaatcc cagcactttg 90481 ggaggccgag gcgggcggat catgaggtca ggagatcgag accatcctgg ctaacatggt 90541 gaaaccccac ctctaccaaa aatacaaaaa aattagccag gcatggtggc aggcgcctgt 90601 aggcccagct actcgggagg ctgaggcagg agaatggcat gaacctggga ggcggagctt 90661 gcagtgagcc aagatcgcgc cactgcactc cagcctgggt gacagagcaa gactctgtct 90721 caaaaaaaaa aaaaaaaaaa attaaaaatt agccaggcat gttggcaccc acctgtgtag 90781 tctcagctac ttgtgaggct gaagcaggag gctcactcct aggaggtcaa ggctgtagtg 90841 agctatgatc tcaccactgc actccagcct gggtgacaga gtgagaccct gtctcaaaaa 90901 aaaaaaaaaa aaaaaaacaa ttcggcgggc aacttccagt cccactcgct gacctgccag 90961 gaagaggagg aggctggcga ctgaattata aaagctcttg aacagtgaga tccagggagc 91021 ctctaggttg gtggacacac tggtgtgctg ggagggcagt gcacccagag agggcatcat 91081 ggaagctctg cacggctccc tctccacacc ttgcccaatg catctctctt ccatttggct 91141 attcctgagt tctgtccttt gtaataaacc agtaaacata agtgaagggc tttcctgagc 91201 tctgtgagtc attccagcaa actatcaaac ccgaggaggt ggtcgtggga acccccaagt 91261 ttgtaattgg ccaggcagaa gggtgggtgg ctccggactt gcgactggca tctgagatag 91321 gggcagtctg gtgggatggg gtcttttaac ttgcaggacc tgatgctaac tccaggacat 91381 agtgtgatag ctgaattgaa ttgcttgggc acccagttgg catccaagaa tcagagactt 91441 agtgtagaaa aatggcacgt atttggtatc agaaaaaaca catttggtgt cagaagtggc 91501 gtcagaaaac accacacaga gtaacagaga agtcgagcaa tttgctcaag accacacagc 91561 cggtaagcat ctccgctggg acacagaccc tggcgcaggc aaagtctgtg actgtaactg 91621 ctacgtgtgc tgcctcttgc tgcgcctgac ctcctcacac cagccaccag gtgcaacatc 91681 cttatcgcct tgcaggcaag aaggaaaagt gtggctcagg gtagcaaagc cagtgtttcc 91741 tgaacatcag tcacttgcac gttaccatgg caacagttgc tatgcctcaa gccctctcta 91801 ttattattat tatttgagac agggtctcac tctgtcaccc aggctggagt gcagtggcac 91861 aatctcagct cactgcaacc tccacctctc aggttcaagt gattctcatg cttcagcctc 91921 ctgagtagct tggattatag gtgcatgcca ccgagagccg ctaatttttg tatttttagt 91981 agagacgggg ttttgccatg ttagacaggc tggtcttgaa ctcctggcct caagtgatct 92041 ccccacctca gcctcccaaa gtgctgggat tacaggcgtg ttctctttta ttgtctattt 92101 aatacttttc tttatgttga ttctcttttt tacatcacca taaatgcaaa cctaatattg 92161 ttcctaaaag acatgaaaat tatatatatt tttttcagca gggtctcact ctgtcgtcca 92221 agttggagtg cagtggcgtg atatcagctc actgcagcct gggttccagc aatcctcccg 92281 cctcagcctc ctgggtagct gggggtacta caggtgcact ccagcacact gggctaattt 92341 tttgtaaaga cggagtttcg tcatgttgcc caggctggac tcaaactcct aggctcaagt 92401 gatcctccac tttggccttc caaaatgcta ggattacaag catgagccac taggccttgc 92461 ctgaaaataa attctttttc tttttctttt tttttttttg agacggaatc ttgctctgtc 92521 accaggctgg agtgcagtgg tacgatctcg gctcactgca acctccacct cccgggttca 92581 agagattctc ctgcctcagc cttccaagta gctgggacta caggtgtgca ccaccacgcc 92641 cagctaattt ttgtattttt agtaaaggct gggtttccac catgttggct aggctggtct 92701 tgatctcttg acctcatgat cctcccacct aggcctccca aagtgctgag attataggca 92761 tgagctacca tgcctggccg aaaaaaaatt cttaattaat gttttcctaa atgctgccta 92821 aaatcatccc agagcgtacc caactttggg aaaaactgga ccaactaagt ctgataaagc 92881 aggggttccc aacccctgag ccaccaccag tacaggtcct tggcctgtta ggagccaggc 92941 cgcacagcag gaggtgagtg gcgggtgagt gagtgaagct tcatctgcat ttacggccac 93001 tccccatgct cacagtaccg gctgagctcc ccctcctgtc agatcagcgg ctgcattaga 93061 ttctcatagg agtgcaaacc ctattgtgaa ctgtgcatgt gagggatcta ggttgcacgc 93121 tccttatagg aatctaatgc ctgatgatct gccactctct cccatcgccc acagatggga 93181 ccatcttgtt gcaggaaaac aagctcaggg ctcccactga ttccacatta tggtgagttg 93241 tataattatt tcattataga ttacaaagta ataatgatag aaataaagtg cacagtaaat 93301 gtaatgcact tgaatcatct caaaaccatg cccccacccc atccctggtc catggaaaaa 93361 ttgtcttcca tgaaaccagt ccttggtgcc aaagaggttg gggatcgctg ctataaagtc 93421 aaacagccaa gatatcaacc caggctcctg cctccaagtc tagggctgtt ttagctacaa 93481 catgggcccc ttcaccacat tgtgtttctg gttaactgtg tcgggcagca agtggagagg 93541 aactggaaat ttctcaactt ggctctaaat gagaccaaag agaaaaacca aatagtgatg 93601 gcagacaaca gatggtgaga gggctcactc ctccaggcgg ttaatgatgc cctgggcaag 93661 ctttccgaat ttaatcacca gagcgattct ggactatgct tcagtccaga atactctgat 93721 tgttctagga tccgctggaa ttttcctctg gaataactca aggtcagttc taacaggtct 93781 aagcagcagc atgagcattg aagacatgga acagtagcaa acagtcacca tcttcaaggg 93841 ggaatgagag cactgacttc caatctaaga gccttttttt tttttctttg agacagagtc 93901 ttgctctgtc acccaggctg gagtgcactg gcacatctcg gctcactgca atctccgcct 93961 ctcaggttca agtgatcctg gtgcctcagc ctcccaagta gctgggatta caggtgcaca 94021 ccaccatgcc cagttaattt ttgtattttt agtaagagat ggtgtttcac catgttggcc 94081 aggctggttt ggaactcctg acctaagtga tccacccacc ttgggctccc aaagggctgg 94141 gattccaggc atgagccaac gcatccggcc catctaaggg ctttttaatg acctggtata 94201 gctcaccaaa ctccacgtgg ggaagcctcc tatcgactca agatgtcagt ttattcaggc 94261 atttcccaaa gtgttgcttt caaaggtaaa ttaaacttca ttatctaaat gagcagtttt 94321 cgtcttgggt caaactatag ctccaaaatg ctttcatagg tggcagacat ttaagaattt 94381 tcagccacgc acggtggctc atgcctgtaa tcccatcatt ttgggagact gaggcaggag 94441 aatcacttga ggccaggagt ttgataacag cctgggcaac acagcaagac cccatctcta 94501 caaaaaaatt aaaaaattgg ccaggcatgg tggcatgtgc ctatagtccc cactactcaa 94561 ggggctgagg caggaagatc actggagccc aggagttcaa ggctgcagta agccatgatt 94621 catcactgca ctccagcctg ggcaacagag tgaaagcccg tctcttaaaa atacatatat 94681 atttctcggc cgggcatggt ggctcatgcc tgtaatccca gcactttggg aggccgaggt 94741 gggtggatca tttgaggaca ggagttcgag accagcctgg ccaacatggt gaaaccccgc 94801 ctctactaaa aatacaaaaa ttagccaggc atggtggcgg gcacctgtaa tccccgctac 94861 tggggaggct gaggcaggag aatcgcttga acttgggaga caaggttgca gtgagccaag 94921 attgtgccac tgcactccag cctgggcaac agagcgagat gctgtctcaa aaaaaaaaaa 94981 aattaagaat aagaaaaatc tatcttctgc tctgctcttg gccagctcac ttgatttctt 95041 taaaaaaata aaaataaaag gagaccctgt gttggttttt agaagtccaa aacaggaaca 95101 aatggaaagc aatagtcatt gaatgatgac aaattctgtt tggttggaca ctgtcagaaa 95161 gcctgtgctt ataattttct tgataaatat agccccaggt tcaatcaaag gcataatcaa 95221 cggacttcaa taacggatgt gatgatgaag acaacagatg gcatcggaat tatctgacgc 95281 cacgggagac cacaaacaca catttcatta tgattctgat cacgatccta ggctacaggg 95341 gagagagaga ggctcccact gctgtgccaa ggagtcaaac taacataggg cagattttgt 95401 actttttatt aacatctgaa gcaagttttc gtcaagaaag gattaagaag atgagtcatc 95461 tataagcagc tgatggacac tagggttctt ctatttcagc cagcgaagag gtgaggggat 95521 gtctgtgccg tgggaggtga gggggcatca gcctacctgc ttctaagact ctcaatgacc 95581 aggctatgcc ttggaacaac tagacaagac tctggctgta agaccaggcc ccagcatttt 95641 ctttaaagtt ctccatgtga ttgcaatagg cagtcaaggc tgagacgcgg aagcatgggg 95701 ggatgttggc ctggtgcaag atggacacat tgcctccaac ttctggctac ttctgtaaag 95761 gctgttgagg gatattagtc agtgctcaat cggggaagca gggtcactac gtgctctggg 95821 gcaagggatg cgttatagga attagacctt ctacaattgt ccagggagct ggggagatgg 95881 cggtctggga agggtggaga ggatgagaga agtcaccgac agctgatatg agagtcagat 95941 gcgtccagct gttaagatgg agtggcagag gggaaacagg tagaggggtc cctagggggc 96001 tgctgcctcg tctgccaagc atccagaggc gggggggctg ctgttggtca gcaggcccag 96061 cggtcaggaa gacaagctgc aagggacaca ggggagggtg aggacaagct ggaacctctg 96121 tggcccctct gtcagatcac ctccttgtct catctggaca gccttcaaag agtaaacact 96181 gatgtcacct tccttccact tcccaaatct cccgcaccat cctcttctgc ctacttttaa 96241 cccacaaccc tacagggaaa aggatatggt ttggctctgt gtccccacca aaatctcact 96301 gcaaattgta atccccacgt gtcgagggag ggaagtgatt ggatcatggc ggcagtttcc 96361 tccatgctgt tctcgtgata aagggtgagt tctcatgaga tctgagggct ttataagtgt 96421 ctggcatttc ccctgctggc tcttattccc tcctgctgcc ttgtgaagaa ggtgcctgct 96481 tccccttctg ccatgattgt aagtttcctg aggcctcccc agccatgcga aactgtgagt 96541 caattaaacc tctctatttt ataaattact caatctttgg tatttttttt ttccagtgtg 96601 aaaatggact aatacaaaat gggatatggg aaataaagtt ctcggtatag ctaagctttc 96661 ttttttcttt tctttccttt tttttttttt ttgagatgga gtcttgctct gtcgccaggc 96721 tggagtactg tggcacgatc tcagctcact gcaacctccg ccttctgggt tcaagtgatt 96781 ctcctgcctc agcctcctga gcagctgggt ctacagggac acaccactgt agtcacctaa 96841 tttttttttg tatttttagt agagatgggg tttcaccatg ttggccagga tggtctccat 96901 ctcttgacct catgatccac ccaccttggc ctctcaaagt gctgggatta caggcgtgag 96961 ccaccgcacc tggccctctt tcttcttttt gagacagagt ctcactctgt tgcccagact 97021 ggagtgcagt ggcacaagca tagctcactc cagccttaga ctcctgggct caagcaatcc 97081 tcccacctca gcctctagaa tagctgaagc tacaggcatg caccactatg cccagctaat 97141 tatttagttt ttttgtagag atggggtctt gctatgttgc ccaagctggt cttgaacttc 97201 taggctcaag cgatcctccc acctaagcct ctcaaagtgc agggattaca ggtgtgagcc 97261 actgcattca gcctaaattt ttaatagaat attgcagcac agactagaag gaacagaaat 97321 ccactcaagc tcacttgaga aaaatatagt ttagtaaagg tgtaagaaaa tgaagggaaa 97381 tgctagcatc ttgacagatg gtacccatat gtatcttggg gagataatgg tttttgcttg 97441 gaattatggc ttgacaaaca cctggggtgg tacttgcaag agctcagcat aaattggtgc 97501 cgtggcttct tagaagaggg caaagctgct gtggataaga caggagaagt aagtgtcttc 97561 cttggaatgg agggagctga gataaaggag aagagaaaaa gaggtgatga aactggaccc 97621 cgtgtgtgga aaaggccaag aaaaaggaaa ccaccccact gccttatggc acagggtggt 97681 gaaaaagatc ttgttaacca ggaatgtcaa atgtggattt ctagttctcc tagttatggc 97741 catatggata ccacgatgcc ctttactgga acattgactt tttttttaag acagagtctc 97801 actctcttgc ccaggctgga gtgtagtggc acaatctcgg ctcactgcaa cgtcttcctc 97861 ccgggttcaa gggattctcc agcctcagcc tccctaatag ctgggattac aggcatgtac 97921 ctccacaccc agctaatttc tgtatttctt agtagagaca gggtttcgcc ttggcctccc 97981 aaagtgctag gattacagac gtgagccacc gcgcctggcc ttttcttttt tttttttttt 98041 tttttttttt ttttgcaatt aaaaaaaaaa tcaaggtctt tgcagggctg tggctggggc 98101 tagtcctctg gggctggagg gacagttaca tggtgagcca ggagacgatt gtcactggtc 98161 atggttttta agctgtctgc tgtgaagaga tcaatgacac cctttttgtg atcctcattg 98221 tattgctcct caaatgacct gtgggagtag gatagggcat tgttttattc tgttgttatg 98281 gaaaccagtg aagcatctga gtttcatatt cagttggatg gaggtacagg actcaaggaa 98341 ccaagcctag gagcagagaa gacttgagga tggaagcccc tggaaatgta cagacattgt 98401 acagctggat tttggatttc cctaggcggt ggcaacacgg ggtgggaggt atttctccaa 98461 gggtctggag aaaggatgcg cagggttgaa caaggtgcaa catcttatgg actacccagt 98521 tcatctggac tgcaatggga atgtcatggc tgagttcctt ctctgctttc aaaggacact 98581 gtctctcttg ttattcttga ggcagacgct ctgtcctctt tcccttcaaa gactggcttt 98641 ctctgcttat gcatagactt cggtttgcca ctccctcatc atccacctgg gctaatctct 98701 ctgcatccca ggagaaagac tctgattggc ccagactgag caaatttgga ccaatcagag 98761 aggagtgagg agaggctcaa gcccccctac acctgaggct gctacttggt cccagctact 98821 cagaaggctg acgcaggagg attgtttgaa cctgggaggc agaggttgca gtgaaccaag 98881 attgaaccac tgcactccag cctgggcaac acaatagact ctatctcagg aaaaaaaaaa 98941 aaaaagaaag aaacgaactt cctcttccac gtaatatcct tcagatattt tgagacaaat 99001 cttaggtgcc ccttgagact tctctcgtcg tttcaacaac cttgacatgg aaaagatcat 99061 cattccaccc tctaatctcc cccttctctt gtgtctcctc ttccaaacaa agaaaccagt 99121 gtctacactt atgcctgtat ctatccattt ccaaactcaa tttcttcctg gtggtaaagg 99181 aaagtaacat ttaatgagca cttacgtggt agcaggcact ggactgagca cttaatatgc 99241 ttgatctctt ttactcctca tgaccaccct gcgagatact ggatgcatgg ctatttaaca 99301 gatgaggcca ccgaggctca gagaaggtaa gatatttgtg caagattgca cagcctataa 99361 atggcagagg tagaatgtaa atctaggtgt aatttaatcc ggagcaagta aacacactta 99421 ccaggcacga ggtctattta tcagaaagca accccatcca agaccaggat atctagtaat 99481 gaaaaaacag ggatctggcc aggcgcggtg gctcacgcct gtaatcccaa cactttggga 99541 ggccgaggcg ggcagatcac gaggtcagga gattgagacc atcctggcta acacagtgaa 99601 accctgtctc tactgaaaat acaaaaaatt agccgggcat ggtggcaggt gcctatagtc 99661 ccagctactt gggaggctga ggcaggcgaa tggcgtgaac ctgggaggca gagcttgcag 99721 tgagccgaga tcgtgccact gcactccagc ctgggcgaca gagcgagact ccgtctcaaa 99781 aaaaaaaaaa aaaaaagaaa aagaaaaaaa aaaggccggg catagtggct caagcctgta 99841 atcccagcac tttgggaggc cgaggcaggc gaatcacgag gtcaggagtt caagaccagc 99901 ccggccaata tggtgaaacc cggtctctac taaaaataca aattagctgg gagcagtggc 99961 gggtgcctgt aatcccagct actagggagg ctgaggcagg agaatcgctt gaacccagga 100021 ggcagaggtt gcagtgagcc aagatcaagc cactgcattc cagcccaggt gacagagtga 100081 gactttgtct caaaaaaaaa caaaaaaaac aaaaaaacaa aacagggatc ctaccacgtt 100141 tggcagaacc caaccaggaa tgtcttggat tgatctaata taacttgaga tccccgagga 100201 gtttagggga ctgaggacag acacagtaga ctggaattcc aggagacaac aagatctaag 100261 taactgatgg ctcaatgaat agcagccaat tacataacat gaaatatgta agtcaggata 100321 ttgccttaag ggggtgctcg ggaatctggt tgtgtagacc caaaacacca aggcagctct 100381 gctaagtcac ttgcagtttt attccctccc atttctcttt gggcatcctg tcctagaact 100441 gtcaagaaag ggttggtatt ttatttgaag gaaaatgtgg gaaggaggct cagccaagca 100501 ttatcctatt caattcagac catcacgtga tagggacaag cccttctcaa tctctgctgc 100561 cactcacatc agcagctttc cagaaataag caaagggaat ggaaaaaccc ttaatcgcca 100621 ggccacggta catgcaattt actggtatct attgccctgt ctttctaaga agcttctttt 100681 ttgggagatt tcaaaaaaat gatgagaacc cggcacagga tgaagctatg agggtgcccc 100741 atgtagtgtg tcatcttccc ctttccccag cctcaccagg gccgtctctg tcattgttgc 100801 tgccacactg tggctcctgc tctgggcttg gcccttatat gaattgctgt ggtcactcaa 100861 tatggtagac tgaagaatgg tccctgaaga tatccatgtc ctaatgaacc tatgaatctg 100921 ttaccttaca cggcaaaagg gattttgggc caggcatggt ggctcatgct tctaatccta 100981 ggactttagg aggcctaggc gggaggatca cttgaaccta ggagttcaag accagcctgg 101041 gcaaggtgat gagatcccaa ctgtattagt ccattctcat gctgctatga agaaataccc 101101 gagaaggagt aacttgtaaa gaaaagaggt ttaaggctgg gcgtgatggc tcacacctgt 101161 aatcccaaca ctttgggagg ctgaggtggg cggatcacga ggtcaggagt ttgagaccag 101221 cctggccaac acagtgaaac cccgtctcta ctaaaaatac aaaaaaaaat tagctgggca 101281 tggcagcggg tgcctgtaat cccagttact tgggaggctg aggcgaataa ttgcttgaac 101341 ccgggaggtg gtggttgtag tgagccaaga ttgagccact gcaatccagc ccagtgacag 101401 tgcgagactc cgtctcaaaa aaaaaaaaaa aaaaaaagaa agaagaaaag aggtttaatt 101461 gactcacagt tccacatagc ttggaaggcc tcaggaaact tacaatcatg gcagaaggtt 101521 cctcttcacc aggcggcagg agagagaatg aatgccagca ggggaaatgc cagatgctta 101581 taaaaccatc agatctcatg agaactcact atcatgataa cagcatggcg caaccaccca 101641 catgattgaa ttaccttcca ccgggtccct ctcatgacat atggggatta tgggattaca 101701 attcaggata agatttgggt ggggacacag ccgaactatg tcaccatctc tacaaaattc 101761 aaaaattagc tatgtgtggt ggcacgtgcc tgtggtccca gcttcttggg aggctgaggc 101821 agtaagattg tttgagccca ggaattaaag gaagcaatga gctatgaccg tgtcactgca 101881 ctccagcctg gacaacagag caagacttcg tctctgaaaa atatatattt ttaaaaagga 101941 aatttgcaga tgtgattaaa ttaaggggtc ctgagatggg gagatcatcc tgggttatcc 102001 aggtgggccc tcaatgtcat cacaagtgcc cctacccatt tatgcctgag gttgcaattt 102061 tttgaatttg agaaatcaga cctggtgata gccttgagaa gtagcatata aataactccc 102121 acatgcttag tgttccaata atggaatgct aggcatacat ttaagagaaa aacagaggga 102181 tttgacacag aagagcaaag gcgacaggat gacctcaatc gagagggact ggactatgcc 102241 atgctgctgg cttcggagat ggaggaggga aacacaagcc aagaaatgca aagagtacgg 102301 ttctagaagc tggaatgaat gaggaagagg cttctcctct agagcctcca gaaagcacgg 102361 ctctgccaac acctgatttc agctcagtga aatcgatttc agacttccag cctccagaac 102421 tgtaagataa aagaaatgtg tgttgtttta acacatttgc agctgtttgt tacagtagcc 102481 acaggaaaat aacgcactta gctgcattat cactctgttc cttcctatga atggacccac 102541 tgttcaccct aaccatcaga gagccaggca gttctcacca cccagacctg ccacatggaa 102601 gggaaccctt gagattatgt gaccgtggga gaagcaggaa gatgagccct cattaattca 102661 ttcaatcaat cttcactcag agtgcctacg ctgagtttgg ctgtgtgctg ggtaccgtca 102721 acatttttgg gagtgaaaat agctcagcct ctgtcctcat ggaggtcagc atctaggagg 102781 agagtcagat attgaacata cactcacacc aatgaacaca aatttacaaa aggcttctaa 102841 ggaaaggatt gcagggatgg taatgggaca gggactgggg aagtgctggg aaggtgtccc 102901 tccagaaatg ccctagatct cagtcctcga ggggacagag tttatcaggt gccttagtcc 102961 atttgggctg ccattctccc tttcttagcc taggtggctt ataaacaaca aacaccgccg 103021 ggcgcggtgg ctcacgcctg taatcccagc acgttgggag gccgaggcag gcggatcacg 103081 aggtcaggag atcaagacca tcctggctaa cacggtgaaa ccctgtctct accaataaat 103141 acaaaaaagt tagccgggcg tggtggtggg cgcctgtagt cccagctact cgggaggctg 103201 aggcaggaga atggcgtgaa cctgggaggc agagcttgca gtgagccaag atcgcaccac 103261 tgcactccag cctgggcgac agagctagac tccgtcaaac aaacaaacaa acaaacaaac 103321 aaacacttat ttttcatggt aaaagaggct ggaaagtcca agatcaaggt gtcagcagat 103381 tcagtgtctg gtgaggacct cttcctcata gatagtgagt tctcactgtg tcctcacaca 103441 gtgaaaggga gaaacaagct accctgggcc tcttttacat gggcacgaat cccatccatg 103501 aggatctgcc ctcatgacct aatcacctcc tgaaagcccc acctcttgat attattacat 103561 tgggaattag atgtcgacat atgtatttag gggtgataca aacattcaga tcatagcacc 103621 agggaaggag aggggggtgg gaggagaggg atccaggcag agggaacatc acaaacaaaa 103681 aggcccccac ggcaggaggg gacctggtgt gtgtatcggg gggtggggag gtggagacta 103741 aaaggaggcc caagggagtg gtgtgtgtgt gtaagggagg gagctggggg tgaggaatgg 103801 atgaagaaaa gaagttggag aagaggtgtg agcagatcca gcaagatctc atgggcttta 103861 tcatgctaac aaattctgga aggtgtttgc tatggtgtaa atgtttatga ctcttcaaaa 103921 ttcatgttga aacttaattt ccaatgcaat agtattaagg ggtgagggcc attaggaggt 103981 gggatcagtg cactaacaaa agtgctcgag agagcaagtt cagtcctttt tgcccttctg 104041 tccattctgc ttgtgagaag acggtgccct tcctctctgg agatcacaga agcaaggcgc 104101 catcttggaa gcagagaatg agcccttcta agtatggaat ctgctggcat cttcatcttg 104161 gacttcccag ccagactcca gaactgtgag aattaaattt ctatttataa attacccagt 104221 ctaaggaatt ttgttacagc aacaggaacg gactaagaca gtgtttaaat ctgagagtta 104281 taatcagatt cacatggtga gaaaatccct ctggctgcag gacagagatg ggcatgtgaa 104341 cctgggtggt ggccatggag atggagaaag atggacagct gagcattatt ttaggaggta 104401 aaacaaacac cagttgatga tgaactgaat atatgaggtg gagggttggg aggggtcaag 104461 aatgatgtct aattttggaa tttgagattt ccatggatag atggtgatgc ttgaccattt 104521 gctgagctag ggctggtggg tattgggact agatgggggg atcatgagtt tgacttggac 104581 agattgagtt acaggtatct ttgagacatt taagtggaga tatcagatac tgtaccttcc 104641 cctgcctgtc tttctcagca ttttcccctt agaaatgtgt gtccaaaaag aaaaataccg 104701 tgtgattcca cttctttttt ttttatttta ttttttgaga cggattctca ctctgtagcc 104761 tgagctagag tgcagtggca caacctcagc tcactgcaac ctctgcctcc aggtatcaag 104821 tgattttcat ccctcagcct cctgagtagc tgggactaca ggtgtgcgcc accacgcctg 104881 gctaattttt ggtattttag tagagacaga gtttcaccat gttgcccagg gtggtctcga 104941 actcctgagc tcaggcgatc cacccacctt ggcctcccaa agtgctggga ttacaggcgt 105001 gagccacagc gcctggcctg attcctgttt tatgaagtat gtagagtcat caaaatcaga 105061 gacagaaagc agaatgctgg ttaccaggag ccaggggaaa gggaaaatgg ggagtgagtg 105121 ttgaatggta cagagtttca gtttgggaag atgaaaaaaa ttctggggag ggatggtgat 105181 ggttacacac agtatgaatg ttcttaatga catagaactg tactcttaga aatggtttca 105241 gatagtaaat attatatata ttttgccaca atttaaaaaa tagaagtgtg tgtgcaaact 105301 acatataagt gtgcataaaa tgttttattt ttatagtaaa atttatataa aatatactat 105361 aataataata aaatataata aaatatgatt tattatattt tattattatt tactattatt 105421 tcatacttca tagaattatt attatattat ttttatggaa taataaagca gttattttcc 105481 caacatccta cttaagtaaa aatctgtgtg tgacttttag acggccccgt gtgccccatt 105541 ccctcatatt tttctcattc cctcagaggt aggcactatc acaaacttgt ggtaatgctt 105601 tttattttta tcatttacca cccatatatg aagcccggaa caattatatt ttttagtttt 105661 tttttttttt tttttttgag atgcagtctc gctctgttgc ccaggctgga gtgcagtggc 105721 gggatctcag ctcactgcaa cctccacctc ctgggttcaa gcgattctcc tgcctcagcc 105781 tcccgactag ctgggactac aggcatacac caccacaccc agctaatttt tgtattttta 105841 gtagaaacgg ggtttcgccg tgttagccag gatggtctca atctcctgaa cttgtgatcc 105901 acccacctca gcctcccaaa gtgctgggat tagaggcgtg agccaccacg cccggcctat 105961 attttttact tttatacact ttatataaac ctgttgctca acatcttgtt tttgagatgc 106021 tttcatgttc atacctgttg ctcattcatt tttcactgct gtatagtatt ctgtggtaca 106081 gctagaccac aaacgaggca gttatgaaca ttctggaaca tgcctcctga tgctcatgtg 106141 tagtttctct agggcagtgc tggtgaatag aatgttctac aacaatagaa atgtcctata 106201 caaatgtatg ctttccaata gggtagccac tagccattat ttactaatga gtgcttgaaa 106261 tatggctcag gctaccgaga gactgaattt tacatgctat ttaattttaa ttaatttaaa 106321 ctgaaatagc cacatgaagc tactggctaa ccaggggtag acccaggagc aaaactgcag 106381 gttataatca cctgttcatc tttactaggt aacaatgaac tgttttccaa agagataagg 106441 ccaatttccc tatattttca ttttatccaa cttgctaggt ctttgctaac ctgataaact 106501 atccaaatga aaatttctga gaataaaaag ggctctatca gtcagggttc tccagagaaa 106561 cagaaccaat actactgaag agatttattt tatttgtttt ttctttattc ttttcttttt 106621 tctttttttt ttttctgaga tgggatctca ctctgttgcc caggctggag tgcactggca 106681 taatcttggc tcactgcaac ctccacctcc caagttcaag cgattctcct gcctcgactt 106741 tccgagtagc tgggactgaa agcacacgcc accacagccg gctaattttt gtatttttag 106801 tagaggcagg gattcaccac attggccagg ctggtctcaa aactcctgac ctcaaatgat 106861 ctgcccacct tggcctccca aagtgctgga attacaggcg tgaaccactg cacctggcct 106921 ggaaaaagat ttatgataag gaattggctc atgggatttt ggaggccaag aagtcccaag 106981 atctataggt ggcagctgga gacccaggag agctgacggt gttaagttcc agtccaaagg 107041 ctggcaggct caagacccaa gaagagctga tgtttcagtt tgcgttcaaa ggccggaaaa 107101 gacccatgtc ccagcccaag caatcaggca gcaggagttc cctcctactc accctttctg 107161 ttgtagtcag gccttcaact gattggacgg ggcccaccca cctctacatt agggagggca 107221 gtctgcttta ctcagtttac caattcaaat gttaatttca tccaccaatg ctcttgcaga 107281 cacatacatc gaagaatctt tggccaaaca tctgggaacc ctgtggccca ctcaagttga 107341 tgcataaaat tgataatcac cagggtctta catggtaatc tccctcccaa cccaatggag 107401 tctaggtggt tcagaggtgg tcctctgagc agaatgtaat acagttctag gaagacggaa 107461 cttctagaga gtattttcct tctcgaaact tgttctgacc atgaaaggca actcgtagca 107521 gcaagagctt gtcaagtaca gaacatttat ttgatttatt tattttcttt gacacagagt 107581 ctcattctgt cacccaggct ggaatgcagt ggcacaatct tggctcatgg aaacctctgc 107641 ctcccaggtt aaagcaactc ttgtgcctca agcctctcga gtagctggga tttcaggtgc 107701 acaccaccat gcctagctaa tttttttgta tttttagtag agatggggtt ttgccatgtt 107761 ggccaggctg gtctggaact cctgatttca ggtgatccac ctgcctcagc ctcccaaagt 107821 gctgggatta caggtatgag ccactgcgcc cagactaaca tgtatttctc cattcaatca 107881 acacagattc ccaccttcct ccccacctcc cagtacattt agaaattcag gaggagggtc 107941 cttccattca gaagtctagc ttagaaattc tccctcctct ctcctccctt tagtctctcc 108001 gtttcttctt ccttcctttc cctcccctac accccaactt cttgtatatt ttgtccaaga 108061 caaactgctt cttagatctt ctgcaattaa gggcatctct tcccatccga tatgtacagt 108121 ggccccggca atggtgacct gttcctggaa gatgctattt aaaaccctgt gtgcaactaa 108181 agtgacgcag agctcctgaa tgtggctgaa atgtgtcaag ttacataaac gcacatgctt 108241 cattttgccc ttggcttgaa gctcttccta acagacttca gaattatcag ctcattaaag 108301 gggaaaaaat gcaggtttat aaaaaaatct taaataatca tcataataaa atcaaggccc 108361 cagggtagag ttgcacttaa tggttgagag ccactctcaa gagagttggc acctcccaag 108421 aggatctcaa ggaaaccaca gtgaatccca tcttctcttg gggagaggaa tattttctgg 108481 gaattctctg gagacaacca taagcagccc tgatgcccta ctctcccacg aagcacaatg 108541 ttaacaatga aaggaaagaa gggaattttg aattgcttac tacctaggaa gggccaccca 108601 atttcatacc tgtgtctgtt ttcatccctt gctatgatgc tgcagggaga tatcactgtc 108661 ccccttgtac tgatggggaa gccgaggctg aaacaggaca aagtatttgc tccaggccac 108721 acagttggta agaagtagaa ttcaaagcta gttgagagac cccacagctt gatgactgag 108781 accatggact ctgtgatatg caagagaggt gataggaagg gaagggcatg gtcgctttaa 108841 atgctacaga aggaaggaag ggaagtgctg ggtagaggag ggtgtggtcc ctggtaaggg 108901 ctccaccccc acgcctgtgc ccacagtcct aggtgaggac aggcattttt gttttcctgc 108961 ccaaatgttg catttcccaa gaccaccctg gcctgccatg cccccatcct gtatctataa 109021 aaaccccaag accctagcag gtgacacaca agctgctgga cgtcaagagc tgcagatcgg 109081 caaagaagac acaagcgtct ggatgtcaag aagacgtcaa gaggaacacg ctggcagaag 109141 agcacatgac agatgctggc aggccatcta ctggcggaac aaagtggagt ttggcctggg 109201 cagtcggagg agtgcccagg ctgctgggcg gcctgactcc agggggaaac cttcccactc 109261 catccccttc tggcttcccc catctgctga gagctacctc cactcaataa aaccttgctc 109321 tcattctcca agtccaggtg tgatccgatt cttctggtgc accaaggcaa gaacccagga 109381 tacagaaagc cctctgtcct tgggacaacg tagagggtct aattgagcta gttaacacaa 109441 gccgcctata gatagcaaaa ctaaaagagc accatgtaac acacacacac tgaagcttca 109501 ggagttgtaa acatccaccc ctagacactg ccgtgggatc agatcccgac aactggccca 109561 tctctatggt cccctagagg tttgagcagc ggggcactga agaagcgagc cactccgaca 109621 tcacacaccc tgcgaggggg acaagggaac ttttcccatt tcatctggac ttagaatgct 109681 atttcatgga aactgtacaa ccatgggcaa gagacttgac ctctcagtgc ctcagtttcc 109741 tcatctataa aataaaggct atgtgatatt ttctgcctgg atggtgatag tgaggattat 109801 atgaatcaaa acaataaaac actagaaaac tagccagcat agaaaatgtt ttaaaagaac 109861 tgctatctac aatgcacaca ctttccgtgt gtgtgtgtgt gtgtgtgtgg ttttcaacac 109921 aatgcatgca ctattacccc attttgaatc tctgccaggt tcccagggcc tgtcctcttt 109981 accacacttg gtacctctta ataaaacgaa gttactggat ttcaagccca ggctctggaa 110041 atattggaac tcaaaggctg tgctcttgaa atatgtcctt cgcacgccta ccttccctgc 110101 ctgctgcagt tctgtctctg gctgattaaa acctgctttt gcccagagca agctgagtgg 110161 cagggcatgc tatccctttt tatgaaaatc cagcaagcgg gtgtaaaagg aagataacca 110221 cacccctgcc ccatgtggag gccaaagata cttaattatc tgaaacccag ataaacagca 110281 ggagcgggtt ggtagactag atgtggaaaa gctgaagcga gccagagggg ctgcaagaag 110341 aattattaaa aactaatgat gtgattcagt gctggaagga aggcgagaaa atgagtctga 110401 ggttgttgac agcctctgag agttcttgcc tgcagtaaga tgggggcagg atgggtaggg 110461 agatggggta gcagtgggag gtactaggag gtgggtggaa catagggtca agaagtgaga 110521 aaacacaggc gtttcaagca ttatctgcca gatgtttgct gcctgttttc cggaggcagt 110581 tttgtgttac tgccctgttc cagtaaagtt gataaaattg ccaagcctca agaaagagat 110641 ctctaaatgc cattttgcac attggaaacc aggggtcaag caaagggaag tcagcaaggg 110701 aacacacact ggaaattcag agtgaggagc acactcagag ttcacacctg cctgctggga 110761 gctcacttat ggctggtctg tgaacttgac ctcctgggcc tgattcaggg caagtgggtg 110821 tcctgggttg gggtggttcc tgatgcatct ttctgttagt aagcatttga tgtgggttat 110881 gtgagcatgt gtgtaaaggg cttagcacaa gcctaggtcc acaataaggg gtctgtaaat 110941 gccaactgtt gtgggcctta agagttatta cgggaaccgg gttctttttc tacagcacat 111001 tcccagcaga ctgacttcac aaactttgac tgtctgcatt ggagctcatc attttggtct 111061 ctgaaataaa cagtttatgt gcagagaaat ggtgatactg aaacaccagg ggtttggtct 111121 aggtcctgct gctcgccgca tagaaagccc atcactgaca agatgagtat tgccaaggaa 111181 gattttaatt gcgtgctgca gccaaggaga tgggagctca gtctcaaatc catctccctg 111241 accaactaaa actaggagtt tacatagcag ggaagaaatg taacaatgta taagaaaaca 111301 ggaactcggg aggagcaagg aagcaatcat gaagaaaaca ggaactcggg aggagcaagg 111361 aagcaatcat gatgaatgag gggtctggta tctcatctct ggatgtggta atctggtgag 111421 tttcagttct ttgatatttt ttgagaggcc tgggggtccc ttcctgagga aataactcag 111481 ataagacaaa tgagggcttc aagctttaag atcagaaagg tcaacttctg tatttatcca 111541 aaagagtatc tatgggccgg gtgcagtggc tcatgcctgt aatcccagca ctttgggagg 111601 ccaaggcggg aggatcacga ggtcaggaga tcgagaccat cctggctaac acggtgaaac 111661 cccgtctcta ctaaaaaata caaaaaaaag ttagccaggc gtggtggcgg gcgcctgtag 111721 tcccagctac tcgggaggct gaggcaggag aatggcgtga acccgggagg tggagcttgt 111781 agtgagtcca gatcgtgccc cactgcactc cagcctgggt ggcagatcga gactcttgtc 111841 tcaaaaaaaa aaaaaaaaaa aaagaacatc tatgatggga ctattgggtc agtttcaatg 111901 gtacagaaaa gtccaaagcc tgacttagta aaatctgtcc aagttccacc acacacaatt 111961 caacattgca gaacaagaca agaccaagtg tgggactggg ggcatctaat cgggggtttg 112021 tccttgcgtt gtaaaagagt aaggcaaaga gaaaaggggt cctggcattg agaataaaga 112081 taacagaaca gcttaaggtt aagggctctg cccccacgag acagacacct tgggttcaaa 112141 tcctggtact actttatcct agctggtgac ctcgaacaag tcactttgcc tctccgacct 112201 caatttcctc attcataaaa taaaagagat agaacttact ctattggggt tcttgcgagg 112261 actcaagaag ctaaaattaa ggggtgaatg catgcagggg acgtagcact cagggtggca 112321 tatcagtgcc cgcgggaaac agcaactctg tgtgctccct catgtgtcca aggtcacaga 112381 ccaaggacgt cagagagcca ggatgatgac ctctgatccc acatctcccg agttctcacc 112441 attccctggg agtagggaaa tctccctcct cttcttcctc actgtcgtgg caggaaaaga 112501 ggaagaacag gcaggggctg tggatcttgg gggaaaggac aagttaaatc agctgagaaa 112561 aaggaagccc atgaaataat ctaaatgtga gggtgccaga gctttggggc tggggcaggc 112621 agtccccccc ggactcatgc ttaaaagagt caatgcaggc tgggtgcagt ggctcacacc 112681 tgtaatccca aggaggccaa ggagggcaga tcacttgagg ccaggagttc aagaccagcc 112741 tggccaacat ggtgaaacct tgtctctact aaaaatacaa aaattagctg ggcatggtgg 112801 tgggcacctg tagtctcagc tacttggaag gccgaggcac aagaatcgct tgaacccagg 112861 agacggaggt tgcagtgagc cgagatcgca ccactgcact ccagcctggg cgacagagcg 112921 agactctgtc acaagaaata aaaataaaag agttaatgct gcgagaagca ttggaagtga 112981 gacacattct caaccatgaa ataggggctg actcacatgg tctctgcagc agctgatggg 113041 cattaaggtg attacttcgc agcaggtgtg acaaaacaaa tctcaccgcg gctattacac 113101 tacggccacc gggagtcttt cagcccaggg ctgtcacatt cctgtcactt aaagcttcct 113161 cgcctgagcc tgaaaatgat cagtagggtc tgaaattttt ttctctccta ctctgtgcca 113221 agagtgtctc acgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtaaga ggattgttgg 113281 ggtaaggatg gagtgagaaa gaataattta ttgcacattt gacaaatatt catgacactc 113341 agtgccaggc attgttccaa atgctgggaa gacgttgtga ggaaattgga taaggtctct 113401 gccctcatgg agtttataaa cttgaaggaa ttatcttggt agtttatact caaagcttcc 113461 ttccttcctc cttttctttc ctcccttccc tcccttcccc tgtccgcctc tcccctccct 113521 ctccccctcc cttctctccc actcccttcc ctccttccct gattcactta ataaacattc 113581 tggagtcaga ctgcatccta gcttggctac ttattagcta atgcagcatc aggcaaattt 113641 tgtaacctcc cattacttcc catccaaaat ggagataaga acaggacaca ctcacaggat 113701 aagccccaca gcttcgggaa aaaccatctc tgaggaacag ccaagaaagt gaattcaagc 113761 agggccttgg ctcaaagcct gttggcctgg agggctgagg atgatggatg aggactttgt 113821 gcagatctga gcatactgtg tcctttgtcc tggttgcctt agtgtccacc gtctatctct 113881 gagattgatg aggcaggtgc accactgggc aagctacaca ccatacaagg ctgagcctgg 113941 ttggggagcg tcatcaccat ccaatactgt ctgtaggact ccattccagc aagacatctg 114001 ggggcctgga agggggtccc tcccctggaa aatgtcacag aatccctgat ctatgtccta 114061 acaacactca ggtccttctg tatcctttag ataatttgat ttatcctagg gtctactgtt 114121 gacatttgtc cctaagtggt cagattcatg ggctagatga ggacattcaa tgagatcaca 114181 agtcaagagc ttactgcaga gctgcctctt aagaaatgct agttattctt gtccttaagc 114241 ctgggctaag tgacaggcat gtgcacatcc atacacacac acatatccat acatccatac 114301 ccacatccac acacgcacac atccacaccc acacacacat ccacacatgc acacacatcc 114361 acacgcacac atacacaccc acatccacac acacacccac acatgcacac acacatcctc 114421 acacacacat acacacacac agatccacac acacacatcc acacacccac acacatccac 114481 acacacacat ccactataca catatacaca cacatccaga cacccacata cgaacacccc 114541 cacacacatc cacacatcca catccacaca catccacaca catccacaca aatatccaca 114601 cacacgcgca cacagtaaag gtcagtaagt gcacagtggt aaccattgtt cctgcagcag 114661 caccattagg taaataaaaa taaatatata tttttaaaat ttttaatttg tatttgtatt 114721 tttaaatttt ttgtattttt gtagtttttt ttttaaacaa agggatcttg atcttgatta 114781 cttaaaagca agactcatgt tctctgtctt ccttttattt tattattatt ttgggatagt 114841 ttctctctgt tgcccagact ggagtgaaat ggtataatct cagctcgctg tagcctccgc 114901 ctcccaggtt caaatgattc ttgtgcctca gcctcccaag tagctgggat tataggcgca 114961 cgccaccaca cgtggctaaa ttttttgtat ttttactaga gacgggtttt tgccctgttg 115021 gccaggctgg tctcaaactc ctggcctcaa gtgatctacc tacctccgcc tctcaaagtg 115081 ctaggattat agatgtgagc caccacacct ggcctctctg tcttcctttt aaaacacagg 115141 gaggaaagga ttgtttctga atgctcacgt atcattttag cttatttcca gacaaaaggg 115201 aagatatttc ttagccatga agtggaaatg ttctagaacc ttcctcagag atgtctgtga 115261 aaaggaatta tgcatgggaa gcatttagct cacggtcttg cacatgcaaa gggctggata 115321 aatgacagct gtgaatactg cagttcattt tatgtggcga gagggaaggt catataacgt 115381 cccagtgaga gtatgggctt tcaggagttc gaggctgcag tgagctatga cggcctcact 115441 gcattccacc ctggacgaca gagcaagacc ctttatctaa ataagtaact aaaaatttat 115501 aaagaagagg tgttttaaaa gcagaaagag ctgagtctga accctaactc cccatttact 115561 agctgggagg ccttgggcag gctgtttcat ttcttcaagc ctcagtctcc tcatctgcaa 115621 aatgggatga tcacagactc agcctcttgg tgttgtgaaa aatcagtgaa agaatagggg 115681 aggggcctag cctgcctaaa agatttgatg ttcctgctgc ttccagtgga gggtgggtgg 115741 gtgggtgagc tgacaggcag cgcagggaat tttccagcca ggttgcctct taacctctgg 115801 tttggtttgg ttttgcttgt aacctttcac tgtttgtcgg caaaagccca atttttaaaa 115861 aaactttttc attttaattt gatgttttta tattttttgt atttttaatt tttttctttt 115921 agcgacaggc tctcactgtg tctcccaggt tggagtgcaa tggtatgatc atagttcact 115981 gcagtcttga acccctgggc taaaccgatc cccccacctc atcctcctga gtagctggga 116041 caaaggcaca cacagccaca cacagctaat tttatttttt gtagagacag ggtctggcta 116101 cattgccctg tctggtcttg aactcctggg ctcaagcgat catcccgcct cagcctccca 116161 aagtgctgtg attacaggtg tgagccactg cacctggcct taatttgtgc tttttagaga 116221 cagtgcctct ctctgtcacc caagatggag tgtggtgatg cgatcatagc tcactgcagc 116281 cttgaactcc aggactcaag cagtcctccc acctcagcct cctgagtagc tgggaccaca 116341 ggtgtgcacc accacacact cagagcccaa ctttttaaat gagagtttgt cttgaaagac 116401 tacctgcaac tcttcaggaa tgctgtgctc tgtagataat gaacgttcaa cccatgtgac 116461 tctgatgttg ctagagaccc catcttgtta tttttcccca ttttagaaat gagggaactg 116521 aaattcagag agtccaagta acctgcccaa gctgacaggc ttaaaaaaaa ctcaggccta 116581 tctgagagca gagtgtgcgc tcgtgaaaga caccagcaca gggtacagcc tctgcattcc 116641 cttccctctt tcttccccgt gggtgataca gaaagaacaa aggttttgga gtcagattgg 116701 cctgaattca attccaactt cagggtcttg cttaaaacag gagatgacaa tatccacttc 116761 cctgggtggt ggtgagaatc aaatgaggct tttaaaaacg tgcataatgc ctttatctca 116821 gtgcctggca cacagtagac gatcaacaaa tattaggttt tcctctctcc tccacacact 116881 tctttccttg gccacatccc caaaacactc acatgaacct gagctcagac tcccggcgga 116941 ccaagacttc ccggagacgc tgaatttttg ccatctccag ctcaatctcc tctggcatca 117001 tggttgtctc cgccatggag atgatgtcca gctgatctgt gcaggagaca ggaaatacaa 117061 ccatcaggac cccaaggctg caccatggga cagaccctga catggttcca gggcacagcc 117121 ttgctctgcg accatcatac accttgagga cagacttcag cctgtgccgc aaatgtgtat 117181 tcgtgagcac accctttggg aaccagaaaa tgtgttttcc aaagtaattt acttcgtttg 117241 ctttgttttg ttttgttttt agacggagtc tggctctgtc gcccaggctg gagtgcggtg 117301 aagcgatctc ggctcactgc aagctccatc tcccaggttc acgccattct cctgcctcag 117361 cctcccgagt agctgggact acaggtgccc gacacctcgc ctggctaatt tgttgtatat 117421 tttttagtag agacggggtt taccatgtta gccaggatgg tctcgatctc ctgaccttgt 117481 gatctacccg cctcggcctc ccaaagtgct gggattacag gcgtgagcca ctgcacccgg 117541 ccttgttttt gttttttgag acacagtcct gctctgttgc ccaggctgca atgcagtggt 117601 gtgatctcaa tctcagctca ctgcaacctc tgcctctgag gttcaggcga ttctcccacc 117661 tcagccttat aagtagctgg gattacaagt gcacaccacc aggcccggct aatttttgta 117721 tttttagtag tgatggggtt tcaccatgtt ggccaggctg acgtcaagtg atccgcccgc 117781 ctcaacctcc caaagtgctg ggattacagt cgtgaggcca ccacgcctgg ctgggaactt 117841 tgttatgatg gctagtggag gctcacaagt gggctcacaa aagcctgatt aaggccttgt 117901 cagctcctgc ctggtcccag attctaagcc tctgctcccc ctgaccacga ccactttgcc 117961 ctgtggtcta agcctctaag accagccgac tctaggcaga agaaagtttt atcataatta 118021 tcttatagta atactactac caagtagtgt taagtatttt ttcatttgtt ttggagacta 118081 ggcctcactc tgtcatccaa gctggggtgc agtggtgcaa tcatagctca ctgccgcctc 118141 gaaccctggg ctcaagggac ccttccgcct cagtctccca agtagctgga actacaggcc 118201 cacctaggta actttttaat ttttattttt ttgtagagat gggggtgtca ctatgttgcc 118261 aaggttggtc tcaaactcct ggcctcaggc gatcctccca acttggcctc ctgaagtgct 118321 gggattacaa gcgtgagcat acatacggat atataaatat gtgtgtgtat atacatatat 118381 gaatatatgt ggttttttat atatataata tgtgtaatat acacagacac actttttttt 118441 cttttttgag acggagtctc gccctgttgc ccaggctgga gtgcagtggc acgatcttgg 118501 ctcaatgcaa ccttcacctc ccgggttcga gtgattctcc tgcgtcagtc tcccaagtag 118561 ctgggataac aagtttatgc cacgacaccc ggctaatttt tgtatttttc gttgagatgg 118621 ggtttcacca tgttggccag gctgatcttg aactcctgac ctcatgaccc acccgcctca 118681 gcctcccaaa gtgctaggat tataggcatg agccaccgcg cctgggcaga cacactttat 118741 aaagactgaa aaagtccact aaaatgttag cagttaaaat gttaatatat atgcgtgtga 118801 atgtatattt atactattat atgcatatac atatatccac tgtgatgtag gtcttatttt 118861 tctaatataa agttaggacc tcactatttc gttatgtgtt cttttcacct agcaacatgc 118921 cttgagcgct cctgcctgtc cttggatggt cttcctcggc agcagcatgg aaggcggggc 118981 ggttttctat ttgtggctgt atcacgaagc acttaagcaa tccacagaca tgtaggttgt 119041 ttcctatttt ttccccactg taatcatgag gaaaaaaatg taagaattat tttcaccgta 119101 agaatgttcc agtgaagaac gttctcaaag tgaaaacttc gcacacatct acagggttcc 119161 ttcgggctat gtttctaggg tgaatggctg agatataagt gatgtgctat tcaaggcctt 119221 tgatcggtag tgggggagac tcggggcagg agatgttttt ggcgtaccct gagacaccat 119281 cacatcatct gtatctgcta attccttcca cacccgcccc caagatggag cccaccccac 119341 acctcagttc tacaagggcg tctgtctttt tccttatact caaggaataa tgaaagttgg 119401 tataccttat acccatgatt tggaaagttg ttctcttttt cctttttctc tttttttttt 119461 ttgagacgga gtatccctct gttgcccagg ctggagtgca gtggcacaat atcggctcac 119521 tgcaacctcc gcctcccgga ttcaagcagt tctcctgcct cagcctcctg agtagcttgg 119581 gttacaggct cgtgccacca tgcccagcta atttttgtat ttttagtaga gttggggttt 119641 caccatgttg gccaggctgg tcttgaactc ctgacctcaa gtgatcagcc tgccttggcc 119701 tcccataatg ctgggattac aggtgtgagc caccacacca ggtcttcttt tttctttttg 119761 tagagatgag gtctcgctat gtggcccaga ctggtctgga acttctggcc tcaagcaatc 119821 ctcctgcctc agcctcccaa aatgctggga ttacttacag gtgtgagtca ctgtgcccag 119881 agaaagttgt cctgtttcta cctgtccaac tgccttctat gtcctggggc cctggccctt 119941 gcacacgtgc acagggttta gggcgctgtg atttgcaggc ttccctgaag tcagtcccta 120001 cctgcttgga tctctagata gaacagcgcc ctcttctagg agtccacctg atgcatttgt 120061 ccacctgtgg gactctcccg gccctgagtt gttcagcaga cgttggattc tcagtaaatt 120121 ggagttcctt tccattacct accccgtgat gtgtccgttc catgtgccaa aattatctgc 120181 ttcccctcca ccaatcaaaa caccttattt tgctaaagag acagaaagaa ggtaattctc 120241 tggagttgat tctcacattg aaatgaatgg cctctcttca ttctctctga gacaattccc 120301 cctttttttt tttttttttt tttttttttg agagcaagtc ttgctctgtt gcccaggctg 120361 gagcgtggtg atgcaatctc tgctcattgc aacctctgcc tcccaggctc cagcaattct 120421 cctgcctcag cctcccaagt atgtggatct acaggcgcat gtcaccacgc ctggctaatt 120481 ttttgtattt ttagtagaga cagggtttcg ccattttggc caggctggtt ttgaactcct 120541 gacctcaggt aatctgccag ccttggcttc ccaaagtgct gggattacag gcgcgagcca 120601 ccgtgcctgg cctgacaact cccagttttc aagagaaaca gtaacaaagg caatggaaaa 120661 ggccaagcaa ttatatgggg agaggaaaat atttgacctg tcagtcatca gcctattgaa 120721 gattttctcc tctttgagtt cagcatctca gagcaatgtg tgcttctttt atattttcag 120781 ggagcaacaa tacttcatga aatagataag tgcagaacat aaatagtgca gaatctgcta 120841 ttgggttgga gagggggtca ctgaacaact gtataagtgg gaggaagaga gtggaggtgg 120901 gtgaatggga aaaacatgaa attggagaga ttcacaatca cctcctcctt gccttctaaa 120961 gatggcgggt cccgaagatt atctggccac ctggccgagc ctctgccagg gaagggtgaa 121021 aagggagaca cagtggaggg agaggaggga gtgtcaaaga gaagcaggaa ggagagaggt 121081 cagaggagcc acctcaggct gtggcttggg gagagggagc gatgtctgtt tgcagaggat 121141 ttgtaggaaa tgtctcatgt gatgtgctgc ctaaagatct cctggggcca tctggttatg 121201 cttcaaggcc cctggaatcc aatttcgggt cctactactg tccctgacac catgctgtgg 121261 tggctcctgg gattgaagca gtgaggacag gcttgagaaa ggagggaagg agggaggaat 121321 aggatcgaaa tattaatatt ggctgggcac ggtggctcac gcctgtaatc ccagcacttt 121381 gagaggccga agcgggcgga tcacctgagg tcaggagttc gagaccagtg tgaccaacac 121441 ggtgaaatcc agtctctact aaaaatacaa aacttagcct ggcatgctgg caggtgcctg 121501 taatcccagc tgcttgggag actgaggcac gagaaccaca tgaacctggg aggcagaggc 121561 tgcagtgagc cgagatcata ccactacact acagcctgag ggacagagtg agactacgtc 121621 taaaaaaaaa aagataatta atactgagtg ctactgtgtg ctgtgatgct accaagctag 121681 aagaccagcc ttaccagatc agatggcctg ggtttgagtt ccagttctgc ccctccctgg 121741 ctgtgtggcc ttggggagtt ggttaacctc tcagcagctc tgtttctcct ctgtaaggga 121801 gcccaccacc caaaacagat gtgagggatg aattaccccg gcacgtcacg tttaatatat 121861 gcttgcaatt gccctcgtta tgaccgccgc acctgctctg cccatcacct cttttaatcc 121921 tcgccataag ccagcaagtt cagtaggttc tcattttcgc aaagaaacaa aggctcagcg 121981 ctggcaaggg aaagagctga gtccaggtct ggcttcaaat tcatgctttt tgaccctcac 122041 agctgcctct ggtccaggca gctgttctaa gtccgacacc ccccattcat tcctatctca 122101 gagtcaagcg tggtgccata acaattcatg gccagtgggg aggcttaaaa caaaaggaat 122161 ttaatctctt ggagttcggg aagccagaag tcggaaacca aggtgttggc agggtcacgc 122221 gctctccaga ggctctaggg atggatcttt tcttggccct tgcagccccc gctggctcct 122281 tggtttgtag tattaggttg gtgcaaaagt aattacggtt tttgccactg gatgtaatag 122341 catcacttca acctctgcct ctgttttcac aggttcttcc tccgggtggc tctgtgtgtc 122401 ttcttttctg tctcattaga tttagggccc accctaatcc aatgtgagct catctggatc 122461 ctgatctgaa ttacccctgc aaagacctta tttccaaata aggtcacatt ctgatgtttt 122521 aagatgcaca tgaacttcag gaggccacaa acccacaaga atgaatggcc acagccttgc 122581 aagcttcctg acaacaatgt cactctctcc aggggcttga gatcctgcac acagagcaca 122641 gccttggctg gaagcagcag gggtcccagc gggagtcacc agggggtgag ggaagtgcat 122701 gggggaagtg gggtctggca aaaaacagct aggcgggggc agtgtcagaa atcctcattt 122761 gacactagtc atcagccgca agacctcgaa tgagtccctt tatgtctttg cgtcactttc 122821 ctcctctgta caagaattca atagttacct gtgctggagg ctttctgtgt attatctcat 122881 ttaatctcta ttttactgag atgactctga ggcttagaga aggaaagtga cttgttcaac 122941 atcacacagc cagatcattg cagagatagt attcaaaccc agcttgtgct ctctctctcc 123001 ccccccaccc cccttccact ccacaaccac gtgaccttaa ccacgtaatc cagcctcctg 123061 gtgagggctg aatgcatttg tgcctgcgaa agtgcgtagt taaccacccc acgatatgag 123121 tatccttgtt tttctcatca cagttcagat gggcaacctg atttccatgg gacagatgct 123181 gaaaacggca acacaccctg gggcgaggcg tgggacgaag agaattaatc acatcccagg 123241 aactggcatt ttacctccat ccccggggga ggatggaggc ggtggtgcca ggctccaggt 123301 gcttagcttg gatttggggc aaagaatgga acaaagacct atgttgaaaa gtcttgctca 123361 aggatgtttt tcttccccaa gagaaaggaa cgaagcctaa agcttgaatc agatacaaat 123421 cgttagaatc ctttcttgcc atgagctgtc tctgcatcag gcactgtgtt gggctcattg 123481 taaacctcat cttttttttt tttttttttt tttggtgggg cagggtctcg ctctgtcttc 123541 caggctggat tgcagtggca tgatcttggc tcactgcagc ctcaacctcc cgggctcaag 123601 caatcctctc acttcagcct cccgagtagc tgaaattaca ggttcactcc accacaccca 123661 gtgacttttt ttgtactatt tgtagagatg gggttttgcc atgttgccca ggctggtctc 123721 ccaactcctg agctcaagtg atctgcccat ctcagcctcc caaagcgctg ggattacagg 123781 aatgaatctc atctcattta gtctgcacaa taccttaggc caggggttga tatcctatgg 123841 ccaggggacc aagcgcagcc ccatgcttga tgttgtaagt agagttgttc tggaacactg 123901 atcacgctca ttcatttaca tgtcatttat ggctgctttt gcattgcaaa aactgagtag 123961 ttgccacaga gactccctgg cccacgaagc ctcaaatatt cactacctgg ctctttatag 124021 aaaaagttgg ccaacccctt ccctgggtgg tatactgtaa tctctccagt ttaaaagtta 124081 ggaaagccag gtttatccag taacgataat aactaacatt tatcgagcat ttatcacatg 124141 ccagcacctg tgcattacct tatctgattc tcaaaacagc cctgggtggt cagtgctgtt 124201 atcccatttc atagatgggg aagtcgaggc tcaggaaagg gagtaagtga tggaggtgag 124261 atttaaaaca agatctctct tactccaaag ctcttgttgt aatcaaccat attcatgtaa 124321 gaactacctc cctacgtaag taccatcatc atcataacaa aactgagtag actgtattgc 124381 tttagacaca gagggagtag ttattcccag acatcagtgg gcaaggtgaa tatctaggga 124441 ggttgtttaa gaaacaattc cctgcctcca cctccaagga ttaggataaa cctgggttgg 124501 ggtgttgtgt ccagggacct taatgaagga ggtcttcaga tcagagtgaa aaatgctgat 124561 acaggacctg aataaatgag tatcacacac agtcagtttc tttccagtct gaacttgtgt 124621 gatagaaatt agcctgctag actgttcctt tcttgatcac attttccaac tagaaggcag 124681 gcatccgaca gctgccaagg aatgtctcca cgaggaggac aaatcacggg aagaaacggg 124741 aggagcattt caagctggct agagacaaca agtctctcac gcacacttcc aagcaagagc 124801 ttgcttgctg tgtggagaca gcaaagcatg aaggcacaga cgtccccagg gctccaactg 124861 ccactgctgt cattgagctt ccaggagctg agtcacttct gggggaagag gcagctaaat 124921 ccatcttccc atccccaaat gccaccccag ggagatgcat gcttcctgag cggtggcacc 124981 agcaaacaag agctgagcac ccgggcagcc ctggtatgca ccagcaggtt taatcactga 125041 tctcaaaaca aaacaaaaag taaggctacc cagggacagg agtggtggag gtgatagagg 125101 atgagggctg gtgatctaaa aacttctagt ttaaatacaa gcttggaggg ttaaactgct 125161 ggggaccaga gagatacctt tcctagccca tcttttccaa tcaggggtca tggtaggaat 125221 ttaagctcag ggtgccctgc acaacctctg cctttacttc tgctttataa ctttacgtgc 125281 ctagtactgg ccccagagct tgctcatttt gggtgtttac cggagccatg atctagctca 125341 tatcaacctc tctttctaga ttaataaaat gaggcttaga gaggtcataa gtgattggtc 125401 aaactcactc actcaattgt tgcagcaaca tcaagggttc tgtctagatc ctgctactcg 125461 ctgcacagaa agccaatcac tgaggcaatg agtattgcca gagaaggctt taattgggtg 125521 ctgcagctga ggagatggaa agagactcaa atccatctcc tcaacccact aaaattagag 125581 gtagcaagga agaaacgcaa ccatgtgtta cattgtgata catccctatt cttttttttt 125641 ttttctggga aattattagg aaggggtaag gaagagaatc tggtcaacag gatgcaggtg 125701 gttggttagg caatcgtgat ggttgaggag tctggcatct cattgtccac atatagtgat 125761 ctggtaaatt tcaattcctt aatactatct ggaaggccga gaggtcagtt tcttgagaaa 125821 ggaactcagg taaaatgaat gtaactttct aaaattttaa gattaggagg gggctgggca 125881 cagtggctca cgcctgtaat cccagcactt tgggaggctg aggcaggtgg atcacaaggt 125941 cagatagaga ccatcttggc taacacggtg aaaccctgtc tctactaaaa atacaaaaaa 126001 ttagccgggc gtggtggcgg gcacctgtag tcccagctac ttgggaggct gaggcaggag 126061 aatggtgtga acctgggagg cggagcttgt agtgagcaga gatcgcaccg cggcactcca 126121 gcctgggtga cagagcaaga ctccgtctca aaaaaaaaag aaaaaagaaa aacaaaagat 126181 taggaaggta aatttttgtg tttattcaaa caaaacaaaa caccgtaaac atcagttcta 126241 aggggcaatt ggggcagtgc tataatgagt ggtagaaatg aaactcaaac ccagatctat 126301 ctgatcccaa tgcccatgct ctaaggtcaa aagagctaca aaacagtgtt tccaaaactc 126361 tcaacactgc atagaattta ccccagctac atattctgta ctctgtcata taccactcag 126421 tatgtcatga cattcacttt tttttttttt tttttgagac ggcgtctcgc tctgtcaccc 126481 aggctggagt gcagtggcgt gatcttggct cactgcaagc tctgcctcct gaattcacgc 126541 cattctcctg cctcagcctc ccaagtagct gggactacag gcgcccacca ccacaccagc 126601 ctaatttttt gtacttttag tagagacggg tttcaccacg ttagccaggg tggtctcgat 126661 ctcctgacct catgacccac ccacctcggc ctcccaaagt gctgggatta caggcgtgag 126721 ccactgcgct tggccgacat tgactttttt tactcaaact ccctaacgga aatgtcccat 126781 gacatccaca gacaataaat gactcttact tggcataatt agaaaatagc catgaaaata 126841 aacctgagga aacaaagcaa tactgttaaa ttcttgctaa atactatagc ctgcttaagg 126901 ttcagaatga agcttgaatt ttctttctta acaaggtgaa tcagcaagta ctaggggcat 126961 taaagacaaa gctagcacca aattgtggct ttctccttta ggtaatcaaa aaggttgata 127021 attggggaga aaatgcgtag ctttcctgaa gggaaattca ttgctgtttg atggtacaag 127081 catgaaagta ccactggggg tacctatccc atcccctgtt gtagaataga attccagtcc 127141 tttgcgttgg aaaggtctta aacgacgttt tggtcagcct catacctgtg ttggatagga 127201 tgtaaaatga cttggcaaga tgtggttccc agggtccctc aacaaacggt tctcattcaa 127261 acttcagtga gaactcaagc tagaaggaca attctcagcc atgggttaag tcctactcaa 127321 agaggaatca gccattttga gcgattattt caacccagct catataacaa aggcaattaa 127381 atgaagggct gggggtgggg gtgctattca tttcagtcat tggaatatac ttgggtcaca 127441 gccgctgcat ttagggaagg acatgaggga ctcaggtagg gggagtcagt gttgggggag 127501 aatcaacttt cacctaatat gtccttttct cttgtttggg tttttaaata tacttttttt 127561 ttttaatact tttatatttc tagaaaatgt ggctggatgt ggtggctcat gtctgtaatc 127621 ccggagcttt ggaaggctga gggaagagga tcacttgaag ccatgagttt aagaccagcc 127681 tgggcaacag agcaagaccc catctctaca aaaataaaat gtaaacatta gccagatgtg 127741 gtggcacaca gctgtggtcc cagctacttg ggaggctgag gtgggaggat cgcttgggtc 127801 caggagtttg agggcttcag tgatctatga tggtgccact gcactccagc ctaggtgaca 127861 gagtgagaca ctatctttaa taaaaaaaaa aaaaaaattt aagaagagtt gcaaagagca 127921 taaagaattc ccctctactc ctcattaaat ttccactatt gttaacatta ccaagaagtt 127981 gacattggtg cattgttatt aactatattc attgaaattt tagcaattta ccaatggtca 128041 gatgttacat ttgcaaaaca gatttaaaat taagaagaga gaaaagaaaa ggggttggcg 128101 tgtgcaggca ccatggggag ccaggccagc ccttgaagag ggcaagagag gggagtggtt 128161 ctgggcatgg gcttttgggt ttcagagtcc agctctcacg cttgagccat gcatcagtga 128221 ctctgggcac ctcagtttct tcatctgcaa cactggggaa caagcatacc catctcacag 128281 gtgattcatg taagtgacaa gcatacagaa agttctctgt aacatatgtt tgctcttata 128341 tacatttaat aactatgcat tgagaacctg cttcgtggca gaccctgctc taagcaatgg 128401 ggaaggagca atgagcaaaa cagacaaaaa tctctgccct tgtggacctg agcttctagc 128461 aagaggctta cactgttgtg gagaacagtt gctctaataa tctgcagggg tcattggctt 128521 aagacagacg tcgcttcaag ccctggctct atcactaatt gtgcgacctg ggcaaattac 128581 ttgatctccc taagtctcag tttcctcttc tgtaaaatgg gactaataac tcctgcctca 128641 tgaagttgct gctgtaggga tgaaatggga tgatatactt gtccctcagt atccgtgtgg 128701 gattggttcc aggcctccct ttggacacca aaatccttgg atgttggccg ggcgcggtgg 128761 ctcatgcctg taatcccagc actttgggag gccgaggtgg atggatcacg aggtcaggag 128821 atcgagacca tcctggataa cacggtgaaa cgccgtctct actacaaata caaatacaaa 128881 aaaaaaaaaa aaaatagccg ggcgtggtgg caggcgcctg tagtcccagc tactcaggag 128941 gctgaggcag aagaatggcg tgaacccggg aggcggagtt tgcagtgagc cgagatcgcg 129001 ccactgcact ccagcctggg caacagagcg agactccgtc tcaaaaaaaa aaaaatcctt 129061 ggatgttcaa gacccggata taacagggtg aaatatttgc atagaagccc cacacatcct 129121 cctgtagact ttaagtcatc tctagattac ttataatccc taataggatg cctacacatc 129181 acttcattta cacagattca gcataatact caaagttttg cttcttggta atttgtggat 129241 ttttttcctg aatattttcc atccataatt ggttgaatct aaggatgcgg aacccacaga 129301 tacagaggac caactgtata caacatgtgc ttggtgtagt gcctagacat cacaaagagt 129361 ggtcagtagc attattcctg ccatcacagt gcctttccct gaaagaagcc agactaacac 129421 tggggagaaa ataaaattgg agccaccatc aaatttgaag aacacagact catggtcatg 129481 gggaccatct cttggcccct ctaggtgaga ccacagcccc gcagctccaa cagacccatg 129541 cctgccctga gaagagaatc tagagggggc cccctccagc cttttgtcaa gaagccagtt 129601 gctgctattg acccaaaaga gagtcttgaa tcccaaggga atagatctgc gccccatgac 129661 cagggagcca atatctgaat gcatttcaaa ggctcctggt ggcactctag acttcgttgt 129721 catcaatcat tcatgtagga aagaagctca ttccaacccc caagcatcct attacgggat 129781 gcaccaagtg cctcgccttc ttgctataac gtgaatgtaa tgggaagccg cccgttgctt 129841 ttaatgagtc accgtttccc aaagagatga agtcactgct tttaacccag aagctcctgg 129901 ctgctcgatt tctcccagcc tgcctcgtgg agaggatgcc actgggcccc caagagcgat 129961 catacctttc tttgcactcc gccgtcccat acatcccgtc tctgtgaggt cgccccacat 130021 ggaagaattt gcggctgagg aaactgacag gcttcagatc ctggctatga ttgttattct 130081 ccagtggcac ctcaagcaca tttttttaac ttctgcgagt ttctgtttcc ccagccacat 130141 ctgacgtgtc aaaggacaca aaagtaattc aggaggacgc accaacagca cgtgaagacg 130201 caggcattgg cgtcaagcag acatgagtag atggtccggc atctttctaa cggccgacct 130261 cggcaaatgc cctaacctct ttgagactca gtttccttat ctgtaaatcg ggggtcataa 130321 aagagagaat gtatagatca gtgcggtcca attgaaatag aacactagcc acatatgtaa 130381 ttttgaacat tccaggaacc acattctaaa aaataaaaaa ggctgggtgt ggaggcacat 130441 gcctgtaatc ccagcacttt gagaggccaa gggtggtggg catatcactt aaggtcagga 130501 gctcaagacc agcctggcca acatggtgaa accccatctc tactaaaaat ataaaaatta 130561 gccgtggcag agcacggtgg ctcactcacg cctgtaatcc tagcattttg ggagtccgag 130621 gcggtggatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg cgaaagcccg 130681 tcttcctaaa aatacaaaaa ttagccaggt gtggtggtgt gtgcctgtaa tcccagctat 130741 tcaggacgct gaggcaggag aattgcttga actggggagg cggaggctgc agtgagccga 130801 gatcgtgcca ctgcactcca gcctgggcaa cagagggaga ctctgtctca acaagaacaa 130861 caagaacaac aaacaaaaaa aattaactgg gcatggtggc aggtgactat aatcgtagct 130921 actcagaagg ttgaggcagg agaatcgctt gaacctggga ggcggaggtt gcagtgagcc 130981 aagatcgagt cactgcactc cagcctgggt gacagaacgg actccatctc aaaaaaaaaa 131041 aaaaaaaaaa aaagtaaaaa gagctgggtg cggaggcaca tgcctgtagt ctcagctact 131101 ctggaggctg aggcaggggg atcacttgag cccaggagtt taagtccaac ctaggcaaca 131161 tagcaacgag accctgtctc ttaaaaataa aaaaaaagat aaaaagaaat gggcgagatt 131221 aatttcacta acatatttta tttaatccaa tagatctaaa ggattatcag ttcaaaatgt 131281 aatcaatata ttaaagatac gaatgagatg ttttatatcc tttttggagg gagtatggaa 131341 gtcttaagtc cagtatgtat attttaccct ttaaatagat ctcaattcag gccaggcacg 131401 gtggctcacg cctgtaatcc cagcactttg ggaggccgag gcaggtggat cacttgaggt 131461 caggagatcg acaccagcct ggacaacatg gtgaaacccc atctctacta aaaatacaga 131521 aaatagccac gtctggtggc acgtgcctgt catcccagct acttgggagg ctgaggcaca 131581 aggatcacgt gaacctgggg gatggaggtt gcagtgagcc cagactgcgc cactgcacta 131641 cagtttgggt gacagagtga gactccgtct caaaactaaa taaaggaaga acagatctca 131701 atttggaatg gccacatttc aagtgctccc aagtcccaag tcattgttca ccatcctggg 131761 accctccatt catctccttc tttcttttct ccctgatacc ttccccttcc tcaaagattc 131821 tgtctccctg aggtgtttct ccatttcagg actcttctgc ttccctctta actgctctca 131881 tgaaaggact tctgatggac cctctcgtcc cacctccagc ccctggcagc ctatgggtgg 131941 ccctgccttt ggtcagggcc tcagcacgtt caacctgggt ggagccacag gtcacacgag 132001 accctcccct tcagcaggga ctgtgaggaa acccttagaa ggagtgtgtg cggtgcaggc 132061 cccacaggtc ggctgccccc taaatcctaa taggactggc tcagctgtga cccagaatcc 132121 cctgcagaat gaatgcctga cttcagctct ctcaggcagt gtagtctgtt gggagccagg 132181 aaggtgggga gaaggacaca ggtatctctg tggaggggac tccacagcag catctgagca 132241 aaggcaccag aaaactggag atgggcagtg ataagttgca aagagccaac agaattcccc 132301 tctacccccc tcactcaatt tccactattt gttaacttta ccaagaaact gacattggca 132361 cattgctatt aactatattg attgaatata ttgattgtgt gataaggggc agctgggatt 132421 ttccagtacc ctcatccctt tgtttcctta ctccctaccc cgactcctta tctggagctg 132481 gaaggtcttt gatgagctga agaaagacac gggttattaa ataataatgc tgcgacatct 132541 tcccgcaaga ggcggtcagt gttggcaaga gaaaagctcg tcttccaaat gctctgaaca 132601 acctgaatga tgagctggaa gaaatgtcca gggcctcaga aatcaaacac catccctcta 132661 ggaaggaact gggaggaagg aaagggaggc agagagaagg tgtccacaga tgcccttccc 132721 aggtggactc aaggagaagc tgtggtcctt cccctctcgt gaaccacaag attatgcaga 132781 actggctgat tgtaggaatt tctttctgga aacatgctct tcctctgtga cctcatagaa 132841 gtccaagatc taagaggtcc aagtacaaag aaatgacaaa gggagcagag gagagcattt 132901 gatgctgttg actttttttt tttgtttttt atttttattt ttttagatgg agtctcattc 132961 tgtctcccag gctggagtgc agtggtgtga tctcggctca ctgcaacctc cacctcccgg 133021 gttcaagtga ttctcctgcc tcagcctccc aagtagctgg gattacaggt gcctgccact 133081 tcgcctgact aacttttgta tttttggtag agacagagtt tcaccatgtt ggccaggctg 133141 ggcccttgac ttttgcagtg aaataggaaa taagggatga aggtgaagag gtgagattga 133201 ggagtagagg agaaactaag atagcccatt tggggaggaa tcctgggagg acccaacact 133261 tgagacaaag agtttctttc acaagaaaga caaattccag cctaatcctc tggaagtgta 133321 atctgagaac atctgaagac tcattctgag tgcaggagcc aaggctagag caagagcttc 133381 atctcactgc tgctccatag cctgtgagtg ctttgacatt tggagttgga cgcttctttg 133441 tgctgggggc tgtgctaaac actgtaggct gtgcagcagc atccctggcc tctgcccact 133501 ggctgccagt agcatcctct ccccaaccca aaatgtcctc agacactggc aagtatcccc 133561 tgggggacaa aatcaccctt tgttgaaaac cacgggttta gactcagagt ctttctcctt 133621 ctttcctttc tctcttttcc ttcctttttc tttctttcct cctttcttcc ttccttagtt 133681 ccatcttccc ttccttcctt cctctttctt cctttctctt tcttttttcc ttccttcctt 133741 tctcccttcc cttcccttcc catttccttt tgcccctttc ctttcttttt tcctttcttt 133801 cctttcctct ttttatctga aacaaggtct cactctatta ccaagtctgg agttcagtgg 133861 tgtgatcata cctcactgca tcctcaagct tctgggctca agcaatcctc ccacttcagc 133921 ctcctgaata ggtgggacta caggtgtgca ccatcacgtc cttttaattt tgtagagacg 133981 gagtctcacc atgtcaccca ggctgatctt aaaccggcct caagcgatcc tcccatctca 134041 gacccccaag ctgatgggat tacaagcatg agccactgca cacagcctaa acccagagtt 134101 tttttattgt gctcccaggg cacaggtatg agacttacct aggtgtgatt aaaaaaaaaa 134161 aatccatcta cctgggcttc tcctggacct actgaatcaa aatctccctg gaggagaccc 134221 agcaagctgc ccttcaagca ggtgattctt cattaaacac tctatgcctc agtttcctga 134281 aatgggaagg ataatgataa catctatctc aagggttggt atggaggtta agtgagggga 134341 ttcacacatt gagcacagtg tgctgaatgt tagctattat cactcacatt aaaggtggag 134401 aagcactggt ttggagagat atgcattgag tgctcttgaa gagagggaag aacaggaacc 134461 cgggggcagg agagtgaacg ctgctgtgat aaccactctg ctcaaatgcc agccaagtac 134521 tgtggctaat gatgaggggg cacaatgggg attatcccgc catcctaggg ctttcctccc 134581 tgcagatcat cattgttcct gttgacttca tgctccagct gcctccccag ctaaaacaaa 134641 tactaacaac tgatgataca aggcaagagg ggtgacagtg gaaaaaaatg ccaatctgtg 134701 tctcagttac aaataaacac aaacttagtg gcttaaaaca acacacattt attatagttc 134761 tggaggacag aggcccaacg gacgtgaaaa tcagggtgtt gaaaccaaca caagtagtcc 134821 catagacagt ttttttcttt cttgataaac atagaaattg acctttctgg tcttaactct 134881 tgaaacttaa agtttgtttt atctgagttc cttcctcagg agggaattcc ttgcctctca 134941 aataagtatc aaagaactga aactcaccag attacagcat ccagacaatg agatgccaga 135001 cccctcattc atcgggattg cttccttgcc cgtcccaagt tcctgttttg ttacacattg 135061 ttacatctct tccctgctgt ataaaccctt gattttagtt ggtcagggag atggatgtga 135121 gactgagtta ctggctcctt ggctgcaaca cctgagtaaa gcctccttcc ttagcaataa 135181 ttgttgtctc agtgtcggcc ttctgtgcag tgagcagcag gacctagacg aatcccctgg 135241 tgttttggta acagtgtcag caggattgcc tgtccttctt ggggctctaa ggaagaatga 135301 gtttccttgc cttttctagc ttctacattc cttggccacc tacattcctt ggcttatggc 135361 cccttgcttc atgttcaaag ccagcaatgg tgaatggtga aaggctggct ctgcccctgc 135421 ctggctctgc ccctgcctgg ctttgtgccc ctggacaagt tatttaaatt ctctgagttt 135481 ccactttaaa aaatttccct tgtgggcctg gcacagtggc tcatgcctgt aatcccagca 135541 ctttgggagg ccgaggcggg cagatcacga ggtcaggaga tcaagaccat cctggctaac 135601 atggtgaaac cccgtctcta ctaaaaataa aaaaaattag ccgggcgtgg tggcgggcgc 135661 ctgtagtccc agctactcgg gaggctgagg caggagaatg gcgtgaaccc gggaggcgga 135721 gcttgtagtg agccaagatc gcaccactgc actccaggct gggtgacaga gcaagactgt 135781 ctcaaaaaaa aaaaaaaatt tcccttgtaa agtgggtcta ttaatcctta gcttgtaata 135841 tttgttcatt tcataatagc tagttgtgtg gctggtaagt gagttatgta actttcccaa 135901 tcctcagtct tcattacctg ttaactgggg ataataatag agggtaccct tggatgcagt 135961 gaggattgaa tgacagatcg tatataaggc tcttaatgtg gtgcctggta catcataagg 136021 acagtgagga cttggtgaat gtcagttatc acaatcactg tctctatcct caccataatc 136081 actatcacta tcataatcat caccactatc accattatct ttaccatcac catcaccact 136141 atcactatca ccaccatcac tgtcattatc accatcacta gcacatcact gtcatcgtca 136201 ccattatcac tatcaccatt atcaccatca ttactatccc caccatcacc atcaccatca 136261 tcacgatcac tgtcatcatc accatcacca ccatcattac cacccctttc atcatcacca 136321 tcaccactat caccaccacc accaccatca acatcactat catcatcatt gccaccacca 136381 ccatcaccac catcaacaaa agaaaaagca gccaaactca gaattccaca tcagctggaa 136441 ctagggttgg ggaagcacag aggacattag cccatgggaa ttctttgccc caacttcacc 136501 tccctcccag aactcccttg gctgggaata tcccacctca tacaccatcc cattctgtgc 136561 cctcttggaa gatgagaagc accaaaggtc ttgtgtccca cctgcaatgt ccacaaatac 136621 taatattagg cttagagaag agaaatgaga agagtgatat taagataata aaagtaggcc 136681 aggcgcggtg gctcacgcct gtaatcccag cacttcggga ggccgaggtg ggcagatcac 136741 gaggtcagga gattgaaacc atcctggcta acatggtgaa accccacctc tactaaaaaa 136801 tacaaaaaat tagccaggcg tggtggcaga tgcctgtagt cccagctact tgggaggctg 136861 aggcaggaga atggtgtgaa cccgggaggc ggagcttgta gtgagccaag atcatgccac 136921 tgcactccag cctgggtgac agagcgagac tctgtctcaa aaaacaaaaa acaaaaaact 136981 tagctgggca tggtggtgca cgcctacagt cccagctact tgggaggctg aggcaggaga 137041 atcgcttgaa cccaggaggt ggagatcacg ccactgcact ccagcctggc aacaggcaac 137101 agagcgaaac tccgtctcaa acaacaacaa taataataat aataataaaa gtaaattagg 137161 ccgggcgtga tggctcacac ctgtaatccc agcactttgg aaggccgagg taggcggatc 137221 acttgaggtc aggagttcga gaccagcttg gccaacgtgg tgaaacccca cctttactaa 137281 aaatacaaaa attagccagg ggtggtggcg catgcctgta atcccagcta ctctggaggc 137341 tgaggcacaa gaatcactta aacccgggag gcagagattg cagtgagctg agatcgagcc 137401 actgcactcc agcctgggca acagagcaag actccttctc aaaaaaaaaa aaaaaaaaaa 137461 aaaagataaa aaaaaggaaa tttaccaaca tttcttgagc cttttaaaac ttttttaagg 137521 ctccaataag atcattaagt cctttctgga ataaggagag gtagtaataa acaaattctt 137581 tgttgctttc aaatcagttt atgcgttact caagaaaagc agttaaaata actttacatt 137641 aaatttcatt tgatcaacca ctttattcaa acaccttggc ccagaatgac tttggttatt 137701 ccaaaaagca aactccgtct aaggcaaagt tttatcacat taaggatatt cacaagactt 137761 tgccacagat tgggactgca gttcccaggt gggacttcca gaaagattgc atttgaatta 137821 cgtgtgtgtt atttccaggg tgagactttg aaagagatga cacaagagtt ccggcaaatt 137881 atttaacaaa ttgtaatgtc attttacatt actcagtgtc ctctccccaa cacttcaaag 137941 tgaccttcaa tgaatcacca aaccatgctg tcatatttct tccaatgaca agcagaaggg 138001 ctccgattgc tttctaaggc cctttgggac catatgagag aataaagtat atacttccta 138061 attcaaagca aatcattagg gctgtggaat taaaattgtg gaaattgata aacctataat 138121 aacaacaatt gcccggttaa aattgcataa acaacaaatc tttttcggca cttctgtagg 138181 ttcattctgt tactcaaggt ccatttgttc tgaaatatcc cccatccctg caatctcaga 138241 atgagttcat gaaaacagac aattataaat ctatcctgag ctatgattca tgcatcttaa 138301 ccgactccct aattcttcca atttacaaac caattcagaa gtcagtcaag ggcccaggtc 138361 acgcaaggat ggttagagga gcagatggag ctgcttaaaa ccatgtgctt ggcggggtta 138421 aaatgatgag aatccagagg aaaataaatt gggcctgtct aaagtaatat ttcagttttt 138481 aaaaactgct tagcctgaaa atcatgtgcc tcatttaatt ccctgcaggc tggaaatttg 138541 acaggagaga taaattgcac caggaacatt attttcattc agaaaggtaa gcccaagaaa 138601 tagggtggga tatatcctca tctgaatgag ggtggtctgc ggagggggga cgcagggaag 138661 agcagccagg ggaaaaacag gatgtgtgca ggcgtgaacg gctccagttc aatgtgaagt 138721 cagagggaga gtcagatcat ggcagaagaa ggaaggaagc aaatctggaa gcattcatgt 138781 cggactgaaa atccacacca gaagggagat gtggctctaa acagggatct caagcacaaa 138841 ggcaaagcat cagtactgga aatccttagg ggatttaaaa aaagattaaa atgcatactc 138901 tgttcagatt tccttgtttt tccgccaatg ttctttttcc attccaggat cccatcgagg 138961 acagtgcatg acattcagtc cttgtgtctt cttagactct tctacactgt gacagtttct 139021 caggtgcagt agcatgatct cagctcactg caacctccgc ctcctgggtt caagcgattc 139081 tcctgcctca gcctcccgag tagggggcag tacaggcatg caccaccacg cctggctaat 139141 tttgtatttt tagtggagac ggggtttctc catgttggtc aggctggtct tgaactcctg 139201 aactcaggtg atctgcccac ctcagcctcc caaagtgctg ggattacagg catgagccca 139261 gcctgccctt gacagtttta aggggtactg cacaggtatt ttgtagtgga atacccttta 139321 attttgagtt tttctgatgg tgagacaagt tgtgagtttg gagagaagac tgcagaggta 139381 agtcaccatt ctcatcactt tcttcttttt tttctttttt tttgagatgg aatctcatat 139441 tgtcactcag gctggagtgc agtggcatga tctcagctca ctgcaacctc cgcctcctgg 139501 gttcaagcga ttctcctgcc ttagcctccc gagtagctgg gactacaggt gtgtgccacc 139561 atgcctggct agtttttgta tttttagtag agacaaggtt tcgccatgtt ggtgaagctg 139621 gtctcaaacc cctgacctca ggtaatccac ccgcctcagc ctcccaaagt gctgggaata 139681 caggtgtgag ccatggtgcc cggccaccat tctcatcact tcacatccgt ggtacatccc 139741 atcaagatga gttggcactg ttgatgctga ccttgagccc ctggctgagg gagtgtctgc 139801 caggtttctc cattgtaaag tttctccttt cacttcctct ttggaaggaa gtcactatac 139861 acagcccaca cttatagggt agggagttat gagtgcttcc ttgagagtgg agtatggtgt 139921 tatttttata attaaagaaa aagatgcctg caatggaaga tgaagaggat cacctgctgg 139981 tggagcaaaa atccagaacc tcgtgtgtgt ctcagctaga actggtggtt tgggaagggt 140041 cgactgccca gcagtgcagc gggagatgca aaacctgagg gttcaagtcc aaaggttgac 140101 ctggctttaa aagccaatga tgtcaaaaaa acaaaaccaa aaacactgag cacaaaagct 140161 aggcaatggt gcagccagtt gggatacttt tcataggcac tgatttttac tctctctgga 140221 attcctttca aagacacagg gtctgctggt ggatggtcgc tcctctgcag ggactggggc 140281 acatggcatg gcctcctatg tcatttgggg gctctttgtt tattcaacat gcccgccaca 140341 caccaggcaa tattctagga gctaggcatg cagaaatgaa acatccatcc tcctgatgaa 140401 gacagacagc aagccaaaag ataaacaaac aacatcattt aggtggacag ggctaggaag 140461 agagtaaaat gccattacgg gggtagacag tgattgatcc cggggactcc tccagatagc 140521 ggggtcacag aagacctcta tgagaaagtg gcatttgagc tgccactgaa tgaccagcag 140581 aagctggcct gggaagacgg agggggaaga atgcttcaga tagaggatcc acgagtgcag 140641 aagctgtgag aggtgacaac acagtgagat cctgtctcta aaagaaagag aaaaggcaga 140701 tgtgggtgga gcaaaatgag gaagggaagg ggtgttccaa gatggagaga caggaagtgg 140761 ccccatgatg cccggcctat aggccacact gaaaagtttg gattttagtc ctagtggtca 140821 acctattgat ggtatttcaa ttttaatttt aaaaaataca atcaggctgg gcgtggtggc 140881 aggtaatccc agctacttgg gaggctgagg cagtgagcca agatctcacc attgcactcc 140941 agactgggca acaagaggga aactctgtct gaaaaaaaga aagaaagaaa aaaaaaagtc 141001 actgtgtttc ccatgagagt gtgatctact aagggagcaa gtatagaaat aggaagacct 141061 gctactgcct caaagaattg ttaaaggatc gaacgcctac taatgtgtgc aaagcctggg 141121 acatagcagg aatgcagtaa atgcaataaa agaaaatagc tgcagtaagt gctatcagta 141181 aatgcactac tggactgggc gtggtggctc acgcctgtaa tcccaacact ttgggaggcc 141241 gaggcgggaa gattacttga gctcaggagt tcgagaccag cctgggcaat atatcaagac 141301 ctcatctcta caaaaaatac aaaaaccaaa atagccgggc atggtgggtg tgcctctggt 141361 cccagctact cgggaggctg aggtgggagg attgctcaag cccagaaggc ggtggaggtt 141421 acagtgagct gagatcatgc cactgcacca cagcctgggt gacagagcaa gaccctgtct 141481 caaataaata aataaaaata aaaaataaga ataaaaatca ataaatacac tactgcactt 141541 atgcagatga acttcacttc ttccgactta gccacacaca actttccagc atcacagatc 141601 tttgcagagg gtgggacagc cgggaaggtg ctgttcgcac agaattggct gaatgaccag 141661 gtgtttgatg gatgaatccc aggtagccag ggcgtctctg ggcacacgca cccaggatca 141721 cacctggaca agaaagtggg gtctaaagta gcagtccgag gctgggtgca gtggctcatg 141781 cctgtaatcc caaaactatg ggaggttgag gcaggagggt tgcttgagac caagagttga 141841 agatcagcct gggcaacata gcaagacccc atctctacaa aattttgaaa aattagctag 141901 gtgtggtggc atatgccagt ggtcccagct actcagaagg ttgaggtagg aggatcactt 141961 gagcctagga gttaaaggct gcagtgggcc atgatcacgc cactgcactc tattttgagt 142021 gacagagatc ctgtctccaa aaaatacata tgccagtgca ggcctggtat gtgtcctgag 142081 agttccctgc ctggcatctc ttgaagtggg caccagtaac cagaagctga agtcctggtt 142141 tggcaaagga tggaggaaaa agatattgga tctggagggc tgggtgcagg tggcttacac 142201 ctgtaatccc agcacttggg gaggccgagg tgggcagatc atgtggtcag gagatcgaga 142261 ggatcctggc tcatacggtg aaactccgtc tctacaaaaa atacaaaaaa ttagccggat 142321 gagggcgggc gcctgtagtc ccagctactc ggaaggctga ggcaggagaa tggcgtgaac 142381 ccgggaggcg tagcttgcag ccagccgaga tcgagccact gcactgcagc ctgggcgaca 142441 gagcaagact ccatctcaaa aaaaaagata ttggatctgg aatggatcgc tgaaaataaa 142501 agaaaactgg ggtgaaaaag aagtgtacat gggataaact ggaggtaatg agttatgttt 142561 gggggaagat agaaaaagtt agactctgtg gcgtgggagg agcagagtaa atgtcaacag 142621 tcccctcgag cctggaaaac tccaccatga gtcatggagg acccacgtca gaaaatagct 142681 attttgagcc cagctagaaa gtttggctct gtctgctttt cctgagagtg gtgagtggtt 142741 gacccaggct gtaagctctt caggtcattg cttccccaaa caaaatttca atcaagcgag 142801 tgccatgacc ctgcacagaa gtccagaact aaacctcgat gtgggactgg gcttgccaaa 142861 ccagaaggtc tgttctaaga gaaagaagct ggaacctgcg aggggaattt tgccaatggg 142921 aggaacagca atttctccaa aaagaggggg taggaccagc aatttcccgt tcctggaagg 142981 tcccctatac attttgttca gagcatttag attctaaatg tcacagtgtg atgtcctcat 143041 gtgcaatgga catttgttga tttgcctccc agtatccatt cacctttctt aggtaatagg 143101 acctgtttcc tttggggaac caactggcac caccgctccc aggacttgca aatttaatgg 143161 ggcttaccct acccaggtgt ggcctaaggt gctcactgtg ttcccttccc gttcagattc 143221 agatagatat aggatcagat acacaccaga agtagtcaga cccagaactt tgctgtgaac 143281 ccccctgggg gaaaagaaac cctgaatcct aagaacaaag ccctcatgga ggggataaaa 143341 gctgagggat aaacagatag actttggact cctgcctcaa ccacccctag agtgtagaaa 143401 ccccggttag tacattcctt tttttttttt tttttttttg catatgcagt ttgggttgag 143461 ttttctgttg cacaaaacca agaaagtcac acaaatacta tatatctttg tttttttttt 143521 ttttgagacg gagtctcgct ctgttgccca ggctggagtg cagtggcgcg atctcggttc 143581 acgccattct cctgcctcag cctccggggt acctgggacc acagacaccc accaccacgc 143641 ccagctaatt tttgtatttt tttttttttt ttagtagaga tggggtttca ctgtgttagc 143701 caggacggtc ttgctctcct gacctcgtga tccacccgcc tcggcctccc aaagtgctgg 143761 gattacagat gtgggccacc gcgcccagcc ccacaaatac tatgtatctt ttaagagaaa 143821 cgcatgatga gaagcagcag ggggtcaaca tgcatctggg gtgttagaac ccaagaggca 143881 gtagacagtc atcttgtacc tcccctgcct gctgtcagaa tgctaatgca ctgtctccta 143941 aagcagggta tgaaaaatat ggccccaagg ccatctccag cccatcactt ctttttgtgc 144001 agcctgtgag acaagaatgg cttttacatt tgtgaatatt tggacggggg gcaaggggtc 144061 agcgtggaat caaatgaaaa ctgcaaaaaa ttcagatttc agcatctata aataaagctt 144121 tattagaaca catccatgcc catttgttta ggtactgttg atgactgctt ttgtgttaca 144181 gcagggctga gtagctggga cagagaccat actgtcagaa gcgtttgaac cagaacaaca 144241 ccatcttgaa taggggctgg gtaaaatgag gcggagacct attgggctgc attcccagga 144301 ggttaggcat tctaagtcac aggatgagct aggaggacgg cacaagatac aggtcacaaa 144361 gacctgaccg ataaaacagg ttgcggtaaa gaagctggcc aaaagccacc aaaaccaaga 144421 gggcaacgca catgatctct ggtcatcctc actactcatt ccacactaat tataatgcat 144481 taccatgcta aaagacccgc ccaccagcgc caggactgtt tacaaatgcc atggcaatgt 144541 cagttacctt atatagtctt aaaagtggag gaaccctcag ctgtgggaat tgcccacctc 144601 tttcccagaa aactcatgaa taattcgccc cttgtttagc atataattta gaaaaaactg 144661 ttaagcatta tgagttgagc agtccaagct gctgttctgg ctgtggagta gccattcttt 144721 tatttcttta ctttcttaat aacttaagaa acttaataaa taaacttgct ttcattttac 144781 tctatggatt cgccttgagt tctttcttgc acaatatcta agaatcctct cttggggtct 144841 ggatcgggac ccctttcctt taacaatatg gccctcaaag acaaaaatat ttaccatgag 144901 tcctttaagc aaaggttcac agactcctgc cctaaaagat actgtaattc taagacatcc 144961 aggaaagaga cattctgtaa cttgtctttt ataaactaag atattaaagg ctttaaattt 145021 caatcttatg ctggcaaact tgataaaaca attggttttt cctgctaggt aagatggcaa 145081 gaattgctat tattttatat aaggctgttt agaagaaacc cttactaaag cttttaaggt 145141 tataaatcaa aagtaccgtc tagaattcaa ataatatgaa ttcaaattat agaattcaag 145201 ggcaatagca atgatatatt caatcttagt aatggttatt cagccctatt ttcccattat 145261 catagaaatt cccttttaaa catcttctat attttgaata atctgaaaaa caatctgtca 145321 taatgaaatg aactttacag taataataat agctatttgt taaatcttta ctatttgcca 145381 ggcagagttc tgagcatttt acttacatta acacatttaa tggctccaac catgctgtca 145441 ggtaagtaat attattatac ccattttgta gatgacaaaa cgaaggcaca gagacaggtt 145501 aagtaatttg cctaagatca cacagcaaat attggggcca ggattcaaac cccagtaatc 145561 tagacccaga aactgaacta tgctaggggt tagaccatct cagatttatt taaaagaggt 145621 ggaaatgaca ataatatttc actatagttt agaatttaat tgatatcact gtgtaacata 145681 cacctttttt tccctcagaa gtatttctgg accactaaca cgatcatgtt gatacaatgg 145741 aactcacatc ccctctccat atttgattgg atcaggtgtc aacatctggc ccaagctgga 145801 ccaataggct ccttccctga gcttttggtg ctgagattaa gattccagtc tcaacctgac 145861 ttctgtcact ctcccatgag atagaagaat agaggagaca tggagaaaaa cagtcaagag 145921 gtagaggtgg tgggagggac ctccttgatt cacaatggac tctatcccct tcctgaattc 145981 cagtggaacc atggccctgt gttctgtgag acaccctaga atccttcaac aatatccact 146041 tttggctgaa cccagtgggt tttggtcacc tagaaccaaa agaatcctca aacgtaaccc 146101 gattaagagc aaaacaacct ggcttttttt tttttttttt gagatggagt cttgctctgt 146161 tgcccaggct ggagtatagt ggcacgatct cggctcactg cagtctctgc ctcccaagtt 146221 ccagcgattg tgccaccaca cctggctaat ttttgtattt ttagtagaga caaggtttca 146281 ccatgttggc cagcctggtc ttgaactcct gacctcaggt gatccaccca cctcagtctc 146341 ccaaagtgct aggattacag gcatgagcca ctacacctgg ccaacctgcc attttatata 146401 ttctctgagt catcagcatg ggcctctcaa atgatggact tgctggattt ctgtttattt 146461 cctttgctgt gcactaagta ggtcagctcc caagaatgac tagtttggag atggagggac 146521 tagtggaata tgcttagaat ggtgcctaag ccaagctggg aaccaacaaa ttggttttgg 146581 gctctacttt agtttttctt gttgttgttt gtttgctttt tttttttggc ctggggggga 146641 aaaaacccac aaatgtcttt ccccagtttc ctccagtaac acagatggaa atagtagttt 146701 atacttggga tggtgctcgg ggaaatagtg tctcagaacc aattactgaa atgaccagaa 146761 tggatataaa atgtggctgc ataatttcct aagtcccatt tctcagggta aaaatggttc 146821 agaccatcca aatgcttctc cgaagaaatt acacaagcga gatggaaaaa gaaagtattg 146881 gtttctcagc gctggtaata attaggactt gaatgagtgg tgataaaccc atttacagac 146941 aggaagtgtg aaggcctgaa tggccaggag atttgctaag tgatgacaag atcaagatca 147001 ggcctctggc ctctgagccc agctccctaa gcctagtgaa ctttgaatga cacctgattt 147061 tttgccccat ccagcaccca ttttctgaat gcaatttgga ggaatgaaca tacgacccag 147121 aacctggcca atcagaatat tctaccgcca cctcttcccc tctcccccac catggtaatc 147181 agttcagcca tgggcaaatt atccaagttg aggcaataat actcagtccc atgacttcca 147241 ttgaaacttc taatgaaacg atctctctct gttgggattg ctacctgatg gctaatgtaa 147301 gcttagagcc actgggattg ccatatggaa agcatctgcc tgagagtgaa gccaacacaa 147361 gtagatgcag aaaaactgag tcctactgat ggcatttgat cccttggatc cagccacgct 147421 ggatatggga tatgggaaac cttgaagtac agtgactact gtggataagg ggtcaatggg 147481 taagtggttt ccttgtggga tgatgaaaat gtttggcatc tagaggtgat ggttgcacaa 147541 gactgtgaat gtacataaag ccactgaagg gtacacttta aaatgatgaa tgctatgtta 147601 tgtgaatttc gccttctttt tttaaaaaag gggaaagagg ccaggtgcgg tggctcacgc 147661 ctgtaatctc agcactttgg gaggccaagg tgggcagatt acctgaggtc aggagctcga 147721 gaccagcctg gccaatatgg tgaaaccctg tctctactaa aaatacaaaa attagccggg 147781 tgtggcggtg tgtgcctgta gtcccagcta tttgggaggc taaggcagga gaatcgcttg 147841 aacccaggag gcgaagtttg cagtgagttg agattgcacc actgcactcc tgcctgggca 147901 gcagagcgag actctttctc aaaaaaaaaa aagaggttgg gggaagagca taattttaac 147961 cacgcagata tcagagattt cacaaaagga gtgtcactga agctagatat ggtggtcaag 148021 aggcaggaga agggggagat gctttctagg cctaataaca aggtgctaag cccttttgat 148081 gtggtctgaa catctgttcc ctccaaatct catgttaaaa tttgatcccc agtgttggaa 148141 gtggggcctg ttgagaggtg tttggatgat gggagcagat ccctcgtgaa tggcttcatg 148201 ccatgtttgc aggattgagt gagttctcac ccttaattcc caagagatct ggctgttaaa 148261 gaagctggca tcttcctctt ttctcttcct ctctcttgac atgtgatgtc tgttcccctt 148321 caccttccac agtgattgta agattcttgg gcccctcacc agaatcagat gctggcacca 148381 tgtttcttgt ataacctgca gaactatgag ccaaataaac ctcttttatt tataaattac 148441 ccaacctccg gtattccttt acagcaacat gaaatggact aagatacctc tacctacagt 148501 aattacatcg ttaatcctca cagcaaccct tgaggtagta aatattactt tagccttgtt 148561 ttataggtaa gtatgcaagg gccagaaagg ggacataact tgcccaagtt tacatagttg 148621 tagatcagca gagagctggg actcaaacat agatttttct ggccaggcac ggtggctcat 148681 gcctgtaatc ctagcacttt gggaggccga ggcgagtgga tcacctgagg tcaagagttc 148741 gagaccagcc tggccaacat ggggaaacct catctctaat aaaactacaa aaattggcca 148801 ggtatggtgg tggacatcta taaccccagc tacttgagag gctgaggcag gagaatcact 148861 tgaacccagg aggcagaggt tgcagtgagc tgagatcatg ccattgcact ccagcctggg 148921 cttcagagag aaactctgtc tcaaaaataa ataaataaat aaaaataggt ttttacaact 148981 ctcaaccact ccgttcaagt gcctcctagt agagggaaga gcatagaaac acagaacatt 149041 tctttagaag aatgtatcat tccatgttct tcctagagga ataatcggag tttagcccac 149101 taagataact cacaacagca acctttataa cagtggaaaa attggaaaca tacaatgttt 149161 ggccatatgg actagttaag ttattgaaga atatgtgatg atttgggact aggatcataa 149221 tttagagctg agtgaaaaag gcaggatatg ggaaagtaaa atagttggat attagctaca 149281 caggaaaaat attaaaaggt tactagaaaa acacgcatca ggatgttggt tgactttggc 149341 gggtagaatt gtgggtgatt tttttctatg tgttttacgc gcgtgtgtgt gtgtatagac 149401 tcagtttcca tatctgtaaa atgggtataa taacagatct cccccactga atggtcagaa 149461 acactaagtg agacaacgca tgtaaagctt // LOCUS HSU96114 3475 bp mRNA PRI 29-MAY-1997 DEFINITION Homo sapiens Nedd-4-like ubiquitin-protein ligase WWP2 mRNA, complete cds. ACCESSION U96114 NID g2072502 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3475) AUTHORS Pirozzi,G., McConnell,S.J., Uveges,A.J., Carter,J.M., Sparks,A.B., Kay,B.K. and Fowlkes,D.M. TITLE Identification of novel human WW domain-containing proteins by cloning of ligand targets JOURNAL J. Biol. Chem. 272 (23), 14611-14616 (1997) MEDLINE 97313427 REFERENCE 2 (bases 1 to 3475) AUTHORS Pirozzi,G. and Uveges,A. TITLE Direct Submission JOURNAL Submitted (02-APR-1997) Cytogen Corp., 201 College Road East, Princeton, NJ 08540, USA FEATURES Location/Qualifiers source 1..3475 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 109..2721 /note="Nedd-4-like ubiquitin-protein ligase; WW domain-containing protein" /codon_start=1 /product="WWP2" /db_xref="PID:g2072503" /translation="MASASSSRAGVALPFEKSQLTLKVVSAKPKVHNRQPRINSYVEV AVDGLPSETKKTGKRIGSSELLWNEIIILNVTAQSHLDLKVWSCHTLRNELLGTASVN LSNVLKNNGGKMENMQLTLNLQTENKGSVVSGGKLTIFLDGPTVDLGNVPNGSALTDG SQLPSRDSSGTAVAPENRHQPPSTNCFGGRSRTHRHSGASARTTPATGEQSPGARSRH RQPVKNSGHSGLANGTVNDEPTTATDPEEPSVVGVTSPPAAPLSVTPNPNTTSLPAPA TPAEGEEPSTSGTQQLPAAAQAPDALPAGWEQRELPNGRVYYVDHNTKTTTWERPLPP GWEKRTDPRGRFYYVDHNTRTTTWQRPTAEYVRNYEQWQSQRNQLQGAMQHFSQRFLY QFWSASTDHDPLGPLPPGWEKRQDNGRVYYVNHNTRTTQWEDPRTQGMIQEPALPPGW EMKYTSEGVRYFVDHNTRTTTFKDPRPGFESGTKQGSPGAYDRSFRWKYHQFRFLCHS NALPSHVKISVSRQTLFEDSFQQIMNMKPYDLRRRLYIIMRGEEGLDYGGIAREWFFL LSHEVLNPMYCLFEYAGKNNYCLQINPASSINPDHLTYFRFIGRFIAMALYHGKFIDT GFTLPFYKRMLNKRPTLKDLESIDPEFYNSIVWIKENNLEECGLELYFIQDMEILGKV TTHELKEGGESIRVTEENKEEYIMLLTDWRFTRGVEEQTKAFLDGFNEVAPLEWLRYF DEKELELMLCGMQEIDMSDWQKSTIYRHYTKNSKQIQWFWQVVKEMDNEKRIRLLQFV TGTCRLPVGGFAELIGSNGPQKFCIDKVGKETWLPRSHTCFNRLDLPPYKSYEQLREK LLYAIEETEGFGQE" BASE COUNT 820 a 993 c 966 g 696 t ORIGIN 1 gaattcgcgg ccgcgtcgac cgcttctgtg gccacggcag atgaaacaga aaggctaaag 61 agggctggag tcaggggact tctcttccac cagcttcacg gtgatgatat ggcatctgcc 121 agctctagcc gggcaggagt ggccctgcct tttgagaagt ctcagctcac tttgaaagtg 181 gtgtccgcaa agcccaaggt gcataatcgt caacctcgaa ttaactccta cgtggaggtg 241 gcggtggatg gactccccag tgagaccaag aagactggga agcgcattgg gagctctgag 301 cttctctgga atgagatcat cattttgaat gtcacggcac agagtcattt agatttaaag 361 gtctggagct gccatacctt gagaaatgaa ctgctaggca ccgcatctgt caacctctcc 421 aacgtcttga agaacaatgg gggcaaaatg gagaacatgc agctgaccct gaacctgcag 481 acggagaaca aaggcagcgt tgtctcaggc ggaaaactga caattttcct ggacgggcca 541 actgttgatc tgggaaatgt gcctaatggc agtgccctga cagatggatc acagctgcct 601 tcgagagact ccagtggaac agcagtagct ccagagaacc ggcaccagcc ccccagcaca 661 aactgctttg gtggaagatc ccggacgcac agacattcgg gtgcttcagc cagaacaacc 721 ccagcaaccg gcgagcaaag ccccggtgct cggagccggc accgccagcc cgtcaagaac 781 tcaggccaca gtggcttggc caatggcaca gtgaatgatg aacccacaac agccactgat 841 cccgaagaac cttccgttgt tggtgtgacg tccccacctg ctgcaccctt gagtgtgacc 901 ccgaatccca acacgacttc tctccctgcc ccagccacac cggctgaagg agaggaaccc 961 agcacttcgg gtacacagca gctcccagcg gctgcccagg cccccgacgc tctgcctgct 1021 ggatgggaac agcgagagct gcccaacgga cgtgtctatt atgttgacca caataccaag 1081 accaccacct gggagcggcc ccttcctcca ggctgggaaa aacgcacaga tccccgaggc 1141 aggttttact atgtggatca caatactcgg accaccacct ggcagcgtcc gaccgcggag 1201 tacgtgcgca actatgagca gtggcagtcg cagcggaatc agctccaggg ggccatgcag 1261 cacttcagcc aaagattcct ataccagttt tggagtgctt cgactgacca tgatcccctg 1321 ggccccctcc ctcctggttg ggagaaaaga caggacaatg gacgggtgta ttacgtgaac 1381 cataacactc gcacgaccca gtgggaggat ccccggaccc aggggatgat ccaggaacca 1441 gctttgcccc caggatggga gatgaaatac accagcgagg gggtgcgata ctttgtggac 1501 cacaataccc gcaccaccac ctttaaggat cctcgcccgg ggtttgagtc ggggacgaag 1561 caaggttccc ctggtgctta tgaccgcagt tttcggtgga agtatcacca gttccgtttc 1621 ctctgccatt caaatgccct acctagccac gtgaagatca gcgtttccag gcagacgctt 1681 ttcgaagatt ccttccaaca gatcatgaac atgaaaccct atgacctgcg ccgccggctt 1741 tacatcatca tgcgtggcga ggagggcctg gactatgggg gcatcgccag agagtggttt 1801 ttcctcctgt ctcacgaggt gctcaaccct atgtattgtt tatttgaata tgccggaaag 1861 aacaattact gcctgcagat caaccccgcc tcctccatca acccggacca cctcacctac 1921 tttcgcttta taggcagatt catcgccatg gcgctgtacc atggaaagtt catcgacacg 1981 ggcttcaccc tccctttcta caagcggatg ctcaataaga gaccaaccct gaaagacctg 2041 gagtccattg accctgagtt ctacaactcc attgtctgga tcaaagagaa caacctggaa 2101 gaatgtggcc tggagctgta cttcatccag gacatggaga tactgggcaa ggtgacgacc 2161 cacgagctga aggagggcgg cgagagcatc cgggtcacgg aggagaacaa ggaagagtac 2221 atcatgctgc tgactgactg gcgtttcacc cgaggcgtgg aagagcagac caaagccttc 2281 ctggatggct tcaacgaggt ggccccgctg gagtggctgc gctactttga cgagaaagag 2341 ctggagctga tgctgtgcgg catgcaggag atagacatga gcgactggca gaagagcacc 2401 atctaccggc actacaccaa gaacagcaag cagatccagt ggttctggca ggtggtgaag 2461 gagatggaca acgagaagag gatccggctg ctgcagtttg tcaccggtac ctgccgcctg 2521 cccgtcgggg gatttgccga actcatcggt agcaacggac cacagaagtt ttgcattgac 2581 aaagttggca aggaaacctg gctgcccaga agccacacct gcttcaaccg tctggatctt 2641 ccaccctaca agagctacga acagctgaga gagaagctgc tgtatgccat tgaggagacc 2701 gagggctttg gacaggagta accgaggccg cccctcccac gccccccagc gcacatgtag 2761 tcctgagtcc tccctgcctg agaggccact ggccccgcag cccttgggag gcccccgtgg 2821 atgtggccct gtgtgggacc acactgtcat ctcgctgctg gcagaaaagc ctgatcccag 2881 gaggccctgc agttcccccg acccgcggat ggcagtctgg aataaagccc cctagttgcc 2941 tttggcccca cctttgcaaa gttccagagg gctgaccctc tctgcaaaac tctcccctgt 3001 cctctagacc ccaccctggg tgtatgtgag tgtgcaaggg aaggtgttgc atccccaggg 3061 gctgccgcag aggccggaga cctcctggac tagttcggcg aggagactgg ccactggggg 3121 tggctgttcg ggactgagag cgccaagggt ctttgccagc aaaggaggtt ctgcctgtaa 3181 ttgagcctct ctgatgatgg agatgaagtg aaggtctgag ggacgggccc tggggctagg 3241 ccatctctgc ctgcctccct agcaggcgcc agcggtggag gctgagtcgc aggacacatg 3301 ccggccagtt aattcattct cagcaaatga aggtttgtct aagctgcctg ggtatccacg 3361 ggacaaaaac agcaaactcc ctccagactt tgtccatgtt ataaacttga aagttggttg 3421 ttgtttgtta ggtttgccag gtttttttgt ttacgcctgc tgtcactttc ctgtc // LOCUS HSU96131 2203 bp mRNA PRI 26-JUL-1997 DEFINITION Homo sapiens HPV16 E1 protein binding protein mRNA, complete cds. ACCESSION U96131 NID g2232018 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2203) AUTHORS Yasugi,T., Vidal,M., Sakai,H., Howley,P.M. and Benson,J.D. TITLE Two classes of human papillomavirus type 16 E1 mutants suggest pleiotropic conformational constraints affecting E1 multimerization, E2 interaction, and interaction with cellular proteins JOURNAL J. Virol. 71 (8), 5942-5951 (1997) MEDLINE 97366654 REFERENCE 2 (bases 1 to 2203) AUTHORS Yasugi,T., Vidal,M., Sakai,H., Howley,P.M. and Benson,J.D. TITLE Direct Submission JOURNAL Submitted (02-APR-1997) Pathology, Harvard Medical School, 200 Longwood Ave., Boston, MA 02115, USA FEATURES Location/Qualifiers source 1..2203 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" CDS 46..1344 /note="16E1-BP; similar to thyroid receptor interactor encoded by GenBank Accession Number L40384" /codon_start=1 /product="HPV16 E1 protein binding protein" /db_xref="PID:g2232019" /translation="MDEAVGDLKQALPCVAESPTVHVEVHQRGSSTAKKEDINLSVRK LLNRHNIVFGDYTWTEFDEPFLTRNVQSVSIIDTELKVKDSQPIDLSACTVALHIFQL NEDGPSSENLEEETENIIAANHWVLPAAEFHGLWDSLVYDVEVKSHLLDYVMTTLLFS DKNVNSNLITWNRVVLLHGPPGTGKTSLCKALAQKLTIRLSSRYRYGQLIEINSHSLF SKWFSESGKLVTKMFQKIQDLIDDKDALVFVLIDEVESLTAARNACRAGTEPSDAIRV VNAVLTQIDQIKRHSNVVILTTSNITEKIDVAFVDRADIKQYIGPPSAAAIFKIYLSC LEELMKCQIIYPRQQLLTLRELEMIGFIENNVSKLSLLLNDISRKSEGLSGRVLRKLP FLAHALYVQAPTVTIEGFLQALSLAVDKQFEERKKLAAYI" BASE COUNT 625 a 498 c 506 g 574 t ORIGIN 1 cggcggccgc gccctggttg ggtccccact gctctcgggg gcgccatgga cgaggccgtg 61 ggcgacctga agcaggcgct tccctgtgtg gccgagtcgc caacggtcca cgtggaggtg 121 catcagcgcg gcagcagcac tgcaaagaaa gaagacataa acctgagtgt tagaaagcta 181 ctcaacagac ataatattgt gtttggtgat tacacatgga ctgagtttga tgaacctttt 241 ttgaccagaa atgtgcagtc tgtgtctatt attgacacag aattaaaggt taaagactca 301 cagcccatcg atttgagtgc atgcactgtt gcacttcaca ttttccagct gaatgaagat 361 ggccccagca gtgaaaatct ggaggaagag acagaaaaca taattgcagc aaatcactgg 421 gttctacctg cagctgaatt ccatgggctt tgggacagct tggtatacga tgtggaagtc 481 aaatcccatc tcctcgatta tgtgatgaca actttactgt tttcagacaa gaacgtcaac 541 agcaacctca tcacctggaa ccgggtggtg ctgctccacg gtcctcctgg cactggaaaa 601 acatccctgt gtaaagcgtt agcccagaaa ttgacaatta gactttcaag caggtaccga 661 tatggccaat taattgaaat aaacagccac agcctctttt ctaagtggtt ttcggaaagt 721 ggcaagctgg taaccaagat gtttcagaag attcaggatt tgattgatga taaagacgcc 781 ctggtgttcg tgctgattga tgaggtggag agtctcacag ccgcccgaaa tgcctgcagg 841 gcgggcaccg agccatcaga tgccatccgc gtggtcaatg ctgtcttgac ccaaattgat 901 cagattaaaa ggcattccaa tgttgtgatt ctgaccactt ctaacatcac cgagaagatc 961 gacgtggcct tcgtggacag ggctgacatc aagcagtaca ttgggccacc ctctgcagca 1021 gccatcttca aaatctacct ctcttgtttg gaagaactga tgaagtgtca gatcatatac 1081 cctcgccagc agctgctgac cctccgagag ctagagatga ttggcttcat tgaaaacaac 1141 gtgtcaaaat tgagccttct tttgaatgac atttcaagga agagcgaggg cctcagcggc 1201 cgggtcctga gaaaactccc ctttctggct catgcgctgt atgtccaggc ccccaccgtc 1261 accatagagg ggttcctcca ggccctgtct ctggcagtgg acaagcagtt tgaagagaga 1321 aagaagcttg cagcttacat ctgatcctgg gcttccccat ctggtgcttt tcccatggag 1381 aacacacaac cagtaagtga ggttgcccca cacagccgtc tcccagggaa tcccttctgc 1441 aaaccaaacg ttacttagac tgcaagctag aaagccacca aggccaggct ttgttaaaag 1501 aagtgtattc tatttatgtt gttttaaaat gcatactgag agacaaacat cttgtcattt 1561 tcactgtttg taaaagataa ttcagattgt ttgtctcctt gtgaagaacc atcgaaacct 1621 gtttgttccc agcccacccc cagtggatgg gatgcataat gccagcaagt tttgtttaac 1681 agcaaaaaag gaagattaat gcaggtgtta tagaagccag aagagaaact gtgtcaccct 1741 aaagaagcat ataatcatag cattaaaaat gcacacatta ctccaggtgg aaggtggcaa 1801 ttgctttctg atatcagctc gtttgattta gtgcaaaaat gttttcaaga ctatttaatg 1861 gatgtaaaaa agcctatttc tacattatac caactgagaa aaaaatggtc ggtaaagtgt 1921 tctttcataa taaataatca agacatggtc ccatttgcag gaaaagtgca gactctgagt 1981 gttccaggga aacacatgct ggacatccct tgtaacccgg tatgggcgcc cctgcattgc 2041 tgggatgttt ctgcccacgg ttttgtttgt gcaataacgt tatcacattt ctaatgagga 2101 ttcacattaa tataatataa aataaatagg tcagttactg gtctctttct gccgaatgtt 2161 atgttttgct tttatctcac agtaaaataa atataattaa aaa // LOCUS HSU96629 167343 bp DNA PRI 22-AUG-1997 DEFINITION Human chromosome 8 BAC clone CIT987SK-2A8 complete sequence. ACCESSION U96629 NID g2341008 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 167343) AUTHORS Adams,M.D. TITLE Human chromosome 8 BAC clone CIT987SK-2A8 complete sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 167343) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (07-APR-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 3 (bases 1 to 167343) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., La Bombard,M., Kim,U.J. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (22-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA COMMENT BAC clone CIT987SK-2A8 is located on chromosome 8. Genes were identified by a combination of five methods: XGRAIL (available by anonymous ftp from arthur.epm., Genefinder (available by anonymous ftp from colin@u.washington.edu), GENSCAN (available e-mail server at genscan@gnomic.stanford.edu), searches of the EST database at TIGR (http://www.tigr.org/tdb/hcd/hcd.html) and searches against a peptide database. Repeats were identified using Censor (Jurka, J., Klonowski, P. Dagman, V., Pelton, P. Censor-a program for the identification and elimination of repetitive elements from DNA sequences. Computers Chem 20: 119-121 (1996);available by anonymous ftp from ncbi.nlm.nih.gov). FEATURES Location/Qualifiers source 1..167343 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /clone="2A8" repeat_region complement(20..308) /rpt_family="Alu-Jo" repeat_region complement(370..1051) /rpt_family="L1" repeat_region 1070..1355 /rpt_family="Alu-Jo" repeat_region complement(1382..2775) /rpt_family="L1" repeat_region complement(3086..3283) /rpt_family="Alu-Jo" repeat_region 3861..3991 /rpt_family="Alu-Jo" repeat_region complement(4198..4490) /rpt_family="Alu-Jb" repeat_region complement(4878..5157) /rpt_family="Alu-Sz" repeat_region complement(5650..5945) /rpt_family="Alu-Sx" repeat_region complement(7305..7586) /rpt_family="Alu-Sz" repeat_region complement(7613..7717) /rpt_family="L1MB3" repeat_region 8787..9080 /rpt_family="Alu-Jo" repeat_region complement(9234..9495) /rpt_family="L1MB3" repeat_region complement(9630..9902) /rpt_family="Alu-Y" repeat_region complement(9914..10017) /rpt_family="L1MB7" repeat_region 10304..10598 /rpt_family="Alu-Sq" repeat_region 11339..11669 /rpt_family="Alu-Jb" repeat_region complement(12169..12457) /rpt_family="Alu-Sz" repeat_region complement(12844..13048) /rpt_family="MLT1B" repeat_region complement(13100..13188) /rpt_family="Alu-J" repeat_region complement(13790..14082) /rpt_family="Alu-Sx" repeat_region complement(14138..14415) /rpt_family="Alu-Jo" repeat_region complement(14996..15257) /rpt_family="Alu-Jo" repeat_region complement(15294..15582) /rpt_family="Alu-Sx" repeat_region complement(15906..15984) /rpt_family="MER31" repeat_region 16308..16403 /rpt_family="MIR" repeat_region 16420..16580 /rpt_family="L1" repeat_region 16598..16694 /rpt_family="Alu-Jb" repeat_region 16695..16975 /rpt_family="Alu-Jo" repeat_region complement(17050..17344) /rpt_family="Alu-Sx" repeat_region complement(17582..17871) /rpt_family="Alu-Sz" repeat_region 18635..18924 /rpt_family="Alu-Jb" repeat_region complement(19052..19565) /rpt_family="MLT2C2" repeat_region complement(19711..19999) /rpt_family="Alu-Sx" repeat_region complement(20040..20356) /rpt_family="Alu-Sx" repeat_region 20613..20713 /rpt_family="Alu-Jo" repeat_region 20851..21135 /rpt_family="Alu-Sxz" repeat_region 21254..21434 /rpt_family="MER5A" repeat_region 21515..21858 /rpt_family="THE1B" mRNA join(21859..21967,24521..24701,30613..30673,37496..37667, 40528..40652,42553..42720,47399..47497,48453..48590, 50157..50816) /gene="2A8.2" gene 21859..50816 /gene="2A8.2" CDS join(21859..21967,24521..24701,30613..30673,37496..37667, 40528..40652,42553..42720,47399..47497,48453..48590, 50157..50333) /gene="2A8.2" /codon_start=1 /product="unknown protein CIT987SK_2A8_1" /db_xref="PID:g1930149" /translation="MDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGG NVLVPCYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNK QSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVFTGHPSLRFGDVVHFM ELWGKSSLNTVIFTEPDFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPL HVVCPEQYTQPPPAQSHRMDLMIDCQPPAMSYRRAEVLALPFKRRYEKIEIMPELADS LVPMEIKPGISLATVSAVLHTKDNKHLLQPPPRPAQPTSGKKRKRVSDDVPDCKVLKP LLSGSIPVEQFVQTLEKHGFSDIKVEDTAKGHIVLLQEAETLIQIEEDSTHIICDNDE MLRVRLRDLVLKFLQKF" repeat_region 21967..22392 /rpt_family="MLT2B2" repeat_region complement(22810..23089) /rpt_family="Alu-Jb" repeat_region 24458..24747 /rpt_family="Alu-Sz" repeat_region complement(25510..25694) /rpt_family="MER5A" repeat_region 27021..27409 /rpt_family="MSTA" repeat_region 27886..28172 /rpt_family="Alu-Sc" repeat_region 28186..28457 /rpt_family="Alu-Jo" repeat_region 28831..29113 /rpt_family="Alu-Sx" repeat_region complement(29159..29615) /rpt_family="L1MA9" repeat_region complement(29616..29892) /rpt_family="Alu-Sg" repeat_region complement(29911..29941) /rpt_family="L1MA9" repeat_region 29965..30243 /rpt_family="Alu-Sx" repeat_region complement(30252..30477) /rpt_family="L1MA10" repeat_region 30492..30781 /rpt_family="Alu-Sp" repeat_region complement(30782..31066) /rpt_family="L1MA10" repeat_region complement(31067..31170) /rpt_family="L1" repeat_region complement(31174..31463) /rpt_family="Alu-Sz" repeat_region complement(31464..32119) /rpt_family="L1" repeat_region complement(32156..32238) /rpt_family="L1MA2" repeat_region 32253..32537 /rpt_family="Alu-Y" repeat_region complement(32539..33095) /rpt_family="L1MA2" repeat_region 33193..33469 /rpt_family="Alu-Jo" repeat_region complement(33485..33809) /rpt_family="MER42C" repeat_region complement(33812..34098) /rpt_family="Alu-Sg" repeat_region complement(34119..34204) /rpt_family="MER42C" repeat_region complement(34285..34342) /rpt_family="MER42C" repeat_region complement(34344..34492) /rpt_family="Alu-Jo" repeat_region complement(34725..35015) /rpt_family="Alu-Sz" repeat_region 35060..35195 /rpt_family="Alu-Jo" repeat_region complement(35656..35820) /rpt_family="MIR" repeat_region 36034..36320 /rpt_family="Alu-Sz" repeat_region complement(36921..37214) /rpt_family="Alu-Sq" repeat_region complement(37265..37496) /rpt_family="MIR" repeat_region complement(38935..39011) /rpt_family="MLT1F" repeat_region complement(39075..39231) /rpt_family="Alu-Sxzg" repeat_region complement(39232..39294) /rpt_family="MLT1F" repeat_region complement(39339..39425) /rpt_family="L1PA15" repeat_region complement(39428..39801) /rpt_family="MLT1F" repeat_region 40477..40764 /rpt_family="Alu-Sp" repeat_region complement(42022..42312) /rpt_family="Alu-Sx" repeat_region 43355..43575 /rpt_family="MIR" repeat_region 43592..43885 /rpt_family="Alu-Sx" repeat_region complement(44001..44263) /rpt_family="Alu-Sg" repeat_region complement(44791..45047) /rpt_family="Alu-J" repeat_region complement(45079..45398) /rpt_family="L1ME3A" repeat_region complement(45432..45706) /rpt_family="Alu-Sz" repeat_region complement(45707..45808) /rpt_family="L1ME3A" repeat_region 46415..46695 /rpt_family="Alu-Sx" repeat_region 47430..47560 /rpt_family="Alu-Spqxz" repeat_region 47575..47864 /rpt_family="Alu-Sz" repeat_region 47874..48046 /rpt_family="Alu-Jb" repeat_region complement(48389..48667) /rpt_family="Alu-Sg" STS complement(50541..50816) /gene="2A8.2" /db_xref="dbSTS:G06197" repeat_region complement(53539..53812) /rpt_family="Alu-Sg" repeat_region 54296..54432 /rpt_family="Alu-Jo" repeat_region 57961..58197 /rpt_family="MIR" repeat_region complement(59386..59442) /rpt_family="L1" repeat_region complement(59486..59692) /rpt_family="Alu-Jo" repeat_region complement(59693..60228) /rpt_family="L1" repeat_region complement(60767..60910) /rpt_family="MIR2" repeat_region complement(62779..63068) /rpt_family="Alu-Sq" mRNA complement(join(64788..67820,75259..75387,80808..80952, 87121..87248,100257..100436,100836..>101015)) /gene="2A8.3" gene complement(64788..>101015) /gene="2A8.3" CDS complement(join(67611..67820,75259..75387,80808..80952, 87121..87248,100257..100436,100836..>101015)) /gene="2A8.3" /codon_start=1 /product="hereditary multiple exostoses gene isolog" /db_xref="PID:g1930150" /translation="LPYQDMLQWNEAALVVPKPRVTEVHFLLRSLSDSDLLAMRRQGR FLWETYFSTADSIFNTFTVVMLTYEREEVLMNSLERLNGLPYLNKVVVVWNSPKLPSE DLLWPDIGVPIMVIEKRTVVRTEKNSLNNRFLPWNEIETEAILSIDDDAHLRHDEIMF GFRVWREARDRIVGFPGRYHAWDIPHQSWLYNSNYSCELSMVLTGAAFFHKYYAYLYS YVMPQAIRDMVDEYINCEDIAMNFLVSHITRKPPIKVTSRWTFRCPGCPQALSHDDSH FHERHKCINFFVKVYGYMPLLYTQFRVDSVLFKTRLPHDKTKCFKFI" repeat_region 71356..71642 /rpt_family="Alu-Jo" repeat_region complement(72515..72794) /rpt_family="Alu-Sx" repeat_region 73349..73673 /rpt_family="Alu-Sg" repeat_region complement(74448..74502) /rpt_family="MIR2" repeat_region complement(74832..75119) /rpt_family="Alu-Sc" repeat_region complement(75414..75634) /rpt_family="MER20" repeat_region complement(75760..76050) /rpt_family="Alu-Sx" repeat_region complement(76616..76757) /rpt_family="MER5A" repeat_region 77465..77762 /rpt_family="Alu-Jo" repeat_region 77879..78165 /rpt_family="Alu-Jb" repeat_region 78189..78473 /rpt_family="Alu-Sg" repeat_region 78589..78690 /rpt_family="Alu-Jo" repeat_region complement(81264..81598) /rpt_family="L1MB7" repeat_region complement(81606..81903) /rpt_family="Alu-Sx" repeat_region complement(81913..82230) /rpt_family="L1MB7" repeat_region complement(82290..82548) /rpt_family="Alu-Jo" repeat_region complement(82549..82702) /rpt_family="L1MB7" repeat_region complement(82703..82988) /rpt_family="Alu-Sx" repeat_region complement(82993..83039) /rpt_family="L1MB3" repeat_region 83301..83565 /rpt_family="Alu-Spqxz" repeat_region 84689..84774 /rpt_family="L1ME3A" repeat_region 84776..85049 /rpt_family="L1MB3" repeat_region complement(85689..85981) /rpt_family="Alu-Jo" repeat_region 87133..87408 /rpt_family="MER33" repeat_region complement(88214..88498) /rpt_family="Alu-Jb" repeat_region complement(89168..89451) /rpt_family="Alu-Sg" repeat_region complement(89834..90067) /rpt_family="Alu-Sz" repeat_region complement(90333..90627) /rpt_family="Alu-Jb" repeat_region 92155..92244 /rpt_family="MIR" repeat_region 92735..92789 /rpt_family="MIR" repeat_region 93250..93356 /rpt_family="Alu-Sz" repeat_region 93383..93489 /rpt_family="Alu-Sbcg" repeat_region 93492..93662 /rpt_family="Alu-Sz" repeat_region 93663..93841 /rpt_family="Alu-Sbcg" repeat_region 93921..94057 /rpt_family="Alu-Jo" repeat_region complement(95560..95844) /rpt_family="Alu-Sc" repeat_region complement(96613..96901) /rpt_family="Alu-Sz" repeat_region 96965..97250 /rpt_family="Alu-Y" repeat_region 97714..97828 /rpt_family="Alu-Jb" repeat_region complement(98576..98835) /rpt_family="Alu-Jo" repeat_region 104336..104622 /rpt_family="Alu-Sq" repeat_region complement(104958..105211) /rpt_family="L1ME2" repeat_region complement(105217..105815) /rpt_family="L1MD2" repeat_region complement(105857..106379) /rpt_family="L1MC2" repeat_region complement(106984..107800) /rpt_family="L1PA2" repeat_region complement(107826..108818) /rpt_family="L1" repeat_region complement(108881..108964) /rpt_family="L1MA10" repeat_region complement(109026..109712) /rpt_family="L1MB3" repeat_region 109974..110212 /rpt_family="Alu-Jo" repeat_region complement(110600..111731) /rpt_family="L1" repeat_region complement(113125..113423) /rpt_family="Alu-Sg" repeat_region 114130..114234 /rpt_family="MLT1F" repeat_region complement(114259..114531) /rpt_family="Alu-Y" repeat_region 114532..114573 /rpt_family="MLT1F" repeat_region complement(115218..115508) /rpt_family="Alu-Sg" repeat_region complement(120069..120104) /rpt_family="Alu-?" repeat_region complement(120736..121025) /rpt_family="Alu-Jb" repeat_region 121108..121384 /rpt_family="Alu-Sg" repeat_region 121392..121678 /rpt_family="Alu-Y" repeat_region 122047..122336 /rpt_family="Alu-Y" repeat_region complement(122426..122509) /rpt_family="Alu-Sxz" repeat_region complement(122510..122688) /rpt_family="Alu-Spqxz" repeat_region complement(122700..122780) /rpt_family="Alu-Sxzg" repeat_region complement(122781..122976) /rpt_family="Alu-Sxz" repeat_region complement(124225..124516) /rpt_family="Alu-Sz" repeat_region 128991..129117 /rpt_family="MER30" repeat_region 130015..130110 /rpt_family="L1PA15" repeat_region 130124..130330 /rpt_family="L1MB3" repeat_region 130366..130933 /rpt_family="L1MC2" repeat_region 131487..131724 /rpt_family="Alu-Jo" repeat_region 131748..132003 /rpt_family="Alu-Y" repeat_region complement(133811..134397) /rpt_family="MER42C" repeat_region complement(134398..134555) /rpt_family="MER42C" repeat_region complement(134652..134744) /rpt_family="MER42C" repeat_region complement(134747..135102) /rpt_family="L1ME3A" repeat_region complement(135134..135434) /rpt_family="Alu-Jb" repeat_region complement(135571..135858) /rpt_family="Alu-Y" repeat_region complement(136276..136477) /rpt_family="MER46" repeat_region complement(136958..137205) /rpt_family="MER42A" repeat_region complement(137271..137560) /rpt_family="Alu-Sq" repeat_region 137704..137743 /rpt_family="MER42C" repeat_region 138455..138724 /rpt_family="Alu-Jb" repeat_region complement(139252..139461) /rpt_family="Alu-Jo" repeat_region 141344..141465 /rpt_family="MER42A" repeat_region 141471..141755 /rpt_family="Alu-Sx" repeat_region complement(143607..143661) /rpt_family="MER5B" repeat_region 144031..144182 /rpt_family="MLT1A" repeat_region 144196..144483 /rpt_family="Alu-Sg" repeat_region 144508..144679 /rpt_family="MLT1A" repeat_region 144861..145149 /rpt_family="Alu-Sx" repeat_region complement(145868..146044) /rpt_family="MER5A" repeat_region complement(147349..147631) /rpt_family="Alu-Spqxz" repeat_region 147714..147809 /rpt_family="Alu-Sxzg" repeat_region 147848..148137 /rpt_family="Alu-Sg" repeat_region 150039..150330 /rpt_family="Alu-Jb" repeat_region complement(150454..150740) /rpt_family="Alu-Jb" repeat_region 151674..151950 /rpt_family="Alu-Jb" repeat_region complement(152766..153045) /rpt_family="Alu-Jo" repeat_region 154779..155069 /rpt_family="Alu-Jo" repeat_region complement(157035..157312) /rpt_family="Alu-Sg" repeat_region complement(157791..158089) /rpt_family="Alu-Jb" repeat_region 158795..158901 /rpt_family="L1MA9" repeat_region complement(159971..160255) /rpt_family="Alu-Sz" repeat_region complement(160264..160552) /rpt_family="Alu-Jb" repeat_region 161425..161683 /rpt_family="Alu-Jb" repeat_region complement(162605..162890) /rpt_family="Alu-Sc" repeat_region complement(164259..164472) /rpt_family="MER46" repeat_region complement(164986..165830) /rpt_family="L1PA2" repeat_region complement(165847..167322) /rpt_family="L1" BASE COUNT 49550 a 37098 c 35753 g 44929 t 13 others ORIGIN 1 atctaccatg atcaagtggg cttcatccct gggatgcaag gctggttcaa tatacgcaaa 61 tcaagaaatg taatccagca tataaacaga accaaagaca aaaaccacat gattatctca 121 atagatgcag aaaaggcctt tgacaaaatt caacaaccct tcatgctaaa aactctcaat 181 aaattaggca ttgatgggac gtatctcaaa ataataagag ctatctatga caaacccaca 241 gccaatatca tactgaatgg gcaaaaactg gaagcattcc ctttgaaaac tggcacaaga 301 cagggatgcc ctctctcacc actcctattc aacatagtgt tggaagttct ggccagggca 361 attaggcagg agaaggaaat aaagggtatt caattaggaa aagaggaagt caaattgtcc 421 ctgtttgcag acgacatgat tgtatatcta gaaaacccca ttgtctcagc ccaaaatctc 481 cttaagctga taagcaactt cagcaaagtc tcaggataca aaatcaatgt acaaaaatca 541 caagcattct tatacaccaa taacagacaa acagccaaat catgagtgaa ctcccattca 601 caattgcttc aaagagaata aaatacctag gaatccaact tacaagggat gtgaaggacc 661 tcttcaagga gaactacaaa caactgctca atgaaataaa agagggtaca aacaaatgga 721 agaacattcc atgctcatgg gtaggaagaa tcagtatcgt taaaatggcc acactgccca 781 aggtaattta tagattcaat gccatcccca tcaagctacc aatgactttc ttcacagaat 841 tggaaaaaac tactttaaag ttcatatgga accaaaaaag agcccacatc accaagtcag 901 tcctaagcca aaagaacaaa gctggaggca tcacgctacc tgacttcaaa ctatactgca 961 aggctacagt aaccaaaaca gcatgttact ggtaccaaaa cagagatata gatcaatgga 1021 acacaacaga gccctcagaa ataacgccac atatctacaa ctatctgatc tttgacaaac 1081 ctgagaaaaa caagcaatgg ggaaaggatt ccctatttaa taaatggtgc tgggaaaact 1141 ggctagccat atggagaaag ctgaaactgg atcccttcct tacaccttat ataaaaatta 1201 attcaagatg gattaaagac ttaaacgtta gacctaaaac cataaaaacc ctagaagaaa 1261 acctaggcat taccattcag gacataggca tgggcaagga cttcatgtct aaaacaccaa 1321 aagcaatggc aayaaaagcc aaaattgaca aatgggatct aattaaacta aagagcttct 1381 gcacagcaaa agaaactacc atcagagtga acaggcaacc tacaaaatgg gagaaaattt 1441 tcgcaaccta ctcatctgac aaagggctaa tatccagaat ctacaatgaa ctcaaacaar 1501 tttacaagaa aaaaacaaac aaccccatca aaaagtgggc aaaggacatg aacagacact 1561 tctcaaaaga agacatttat gcagccaaaa aacacatgaa aaaatgctca ccatcactgg 1621 ccatcagaga aatgcaaatg aaaacyacaa tgagatacca yctyacacca gttagaatgg 1681 caatcattaa aaagtcagga aacaacaggt gctggagagg atgtggagaa ataggaacac 1741 ttttacactg ttggtgggac tgtaaactag ttcaaccatt gtggaagtca gtgtggcgat 1801 tcctcaggga tctagaacta gaaataccat ttgacccagc catcccatta ctgggtatat 1861 acccaaagga ctataartca tgctgctata argacacatg cacacgtatg tttattscgg 1921 cactattcac aatagcaaag acttggaacc aacccaaatg tccaacaatg atagactgga 1981 ttaagaaaat gtgkcacata tacaccatgg aatactatgc agccataaaa aatgatgart 2041 tcatgtcctt tgtagggaca tggacgaaat tggaaatcat cattcacagt aaactatcgc 2101 aagaacaaaa aaaccaaaca ccgcatattc tcactcatag gtgggaattg aacaatgaga 2161 acatatggac acaggaaggg gaacatcaca ctctggggac tgttgtgggt kgggggaggg 2221 gggmgggaca gctttagggg acatacctaa tgctaaatga cgagttaatg ggtgcagcac 2281 accagcatgg cacatgtata catatgtaac taacctgcac attgtgcaca tgtaccctaa 2341 aacttaaagt ataataataa taaaattttt aaaaaaggaa aaaaaaaaga aagtcagttt 2401 tgctagatat atagtccttg gcatgcattt tctttctttg agtatcttaa atatgttctc 2461 atattttttt ctaatattaa acattgctat taaaaacact gataaaatct aattttcttt 2521 ccttgtaagt cacttgttct tttcctagat cccaaaggtt tgcttgtagt ctaaatattt 2581 tccagaatat gtctgttgtt cattgttctg ggtcagtatt ctcaagtgta cactgtgttc 2641 ttttagtgtg tagtttcgtg tctcttcatt ttagcaatta tagtatttag taattgaata 2701 ttatgagtgt taattattat tctcacttgg ttttctgtga tgccacataa gattccctta 2761 tgtggcatct tgcttatctg tcttcaacat ttgttaggtt cttttgaatt gtttaaatct 2821 cttcatttct ttttggtatt ttttattaat ctactcttgt gtttctatta caggttgagt 2881 gtcccttatg tgaaatactt gggaccaaag tgtttcagac ttcagacttt ttccgatttt 2941 ggaatattgc tgattgagca tcccaaatcc aaaatccaaa gtaatccagt gagcatttcc 3001 tttaagcgtc atgtttgcct caaaaagctg cagattttag accatttctg acttcaggtt 3061 ttcagatttg ggatggtcaa catgtagttt agtcttcatt tccaaaatga tgttttcttt 3121 tatttctaat tctttattga gttttgtcac ctcatttata agctttgctg gtttttcatg 3181 tatgtacctc tttcatgttt gtataacttt taaatctttt tagcttattt gaaattctgg 3241 tgtattgttg gcatgctttc actctctata tgacattgta tttctaattt gtaacagctc 3301 tttttattct cttaatcttt tattttgtag caatctcttc tcatttctta gctatactat 3361 cttatttttc taacgatagt aaggacaagc tgttcttaaa gttttcttct acctgcctaa 3421 tttatttctt ctaatttccc tgcctgctcc tctgccccca cttgaggcct ttattatttt 3481 agagactttt ctcaaattta tggtagtcct tggctattgg ctcatgttta agagttgaac 3541 gattaaaaaa actaattaga aagtctatgt gccatgggta gggcttgttc acttccacac 3601 tttaccataa agtaatctga ttgagctgtt tctttgtgga atcctctgcg ttagaatctt 3661 ttcattaatt ttttttcttt gaggctgatc ggattcttca gagaagattc tttcagcccc 3721 ctaccctgag gggaataagc ttactcatag tgctttggca gccaaatgag gagaggaaca 3781 ttgttcctct gtaaattttt gtttaggaag gctgtctcag ttgatggttt cccgtagtcc 3841 agactttcat ttttactccc tccagagaac aacctctggt agcatacctg agaggagaag 3901 ggacatctgc tgagctatat ggaaggaatg aggagatctg gaaggttcta agtatctcgt 3961 ctcttttttc aacagttcct cttgttttta ggttgattca acttcctgat acacctgttg 4021 ttttcagttg ccatattttt tgtgggttct gcagtagaaa ttaaacgttt gcattgaact 4081 ttcctgggcc tatgaagtca gttatcattt gtctgtctac tttctaaaat gccttgctat 4141 tgtctcttct ctcattctct ttgtcttaag ggtgtgtgtg tgagagtgtg tgtgtgtgtg 4201 tgtgtgtgtg tgtgtgtgtg tgtgtgagaa gccctgttca gtgttgtttc aggagagaga 4261 ggagaggcta atggcatgca ttcatttcac cccagtactt ggacctgtat tgtacagtga 4321 atgtcaggga agttactctt caggtctcct gattcttttg gagcaaatga taaaacgttt 4381 ttctgttgac acattttggg cgacatagca agaccatgtc tctatttttt tttttttttt 4441 aaaaaaagaa atggctgagc acggtggctc atgcctgtaa tcccagcact ttgggaggcc 4501 gagttgggcc tatcacaagg tcaggagatt gagaccatca tggccaacat ggtgaaaccc 4561 catctctact aaaaatacaa aaattagccg ggcatggtgg tgggcgcctg taatcccagc 4621 tacttaggag gctgaggcag gagattcgct tgaacccggg aggtggaggt tgcagtgagc 4681 cgagatggcg ccatagcact ccagcctggt gacacagtga gactctgtct caaaaaaagt 4741 aaaaataaaa acagagaaat ggtcataaag gaatcctatg aacaattata tgccagtaaa 4801 ttaaaccatt tggatcaaat ggacaaatta ctagaaagga atgctgtaga acatgaagaa 4861 atgttcacct ggtagttgac attgtgatcc atttgcaggc tgttaccttc tcctctcaag 4921 gatgcagtgg aagtctcaac ctggagaaga tgctatacaa tgcaagaggt gaactctgcc 4981 cttagtaaaa tccagctggt gggatattct cagaaaattg tgagtattca tattacattt 5041 cagttattca tgaatgcttt ccattcatat tgttgtttgt tgtttggaag aatcctatag 5101 ttacgttttt aaagccattc cattgctgag gatccagagc ctctgttctt tcctccgttc 5161 cgcgcaggat tttattggtg ctctttcccc accctcacat ctccatcacc agccagcatt 5221 cgattggcca gcgtgcaggg agtccggaga aaggcgtctc atcctgttca cattagattt 5281 tatagatttt ggatgggtga aacgggaaga gagaagagtt tgtcaagtgt gacttttgag 5341 ctctgaccta aatgataagc cttcccattt cttactgtca tcctgtgccc agagctactc 5401 agtaccgaac aacaagggcc taacacctaa ctgaaaatga aaaaggaaag ccaaagtgtg 5461 tgagtctttg gtctgtttgg taatatttca tctctccctt ttaatgtgtg aaccttgagt 5521 gcctggggac atggaagaga gctgaagctc tcaggtgaca agtaaatatt ataggattgc 5581 tttctttgtc tgccagttga tctgcatcat ctttctgttt tccttaaaac tttctagttt 5641 actttattga ttgattgact gagacaaggt cccactttgt tacccaggct ggagcgcagt 5701 ggtacaaaca tggctcactg cagcctcaac ttcccgggct ccagtgatcc tcctgcccca 5761 agtagctgct tgaggactac aggcatgtgc caccatgccc agctaatttt tgtatttttt 5821 tgtagagaca gggtttcacc atgttgccca ggctggtctt gaactcctgg cctcagcctc 5881 ccaaagtgct gggattacag gcgtgagcca ttgcacccag tctctggttt actttaaaat 5941 aatttttgtt tttaaactga ggatatttct gttgtttttc cctgcagaat tacctcatgt 6001 gactgtcact gtaagctcat tgcacattct tactgtggtt ctcttttagg agctttttgg 6061 tgcggtccag gtgactcctc tgagctctgg ctatgccctt gggagctcca actggatcat 6121 ccagtctcat tacgagaaag tgtcttatgt ctctggatcc tccttgctta ccacacaccc 6181 ccaggtaatt ccaaattctc ttctagcaac tcagcttttt ggttacttaa gtcaaattca 6241 gaatgtatcc aaggaaccat cagccatttt taaatcttcc aaatatggtt ttctacagat 6301 actctctagc caaggtagac tatttgagtc tcaacatttt gacctacagg tttctctgaa 6361 atagtcctgc taccttgagg gtcactccta ggattctgaa atcccccagg ccttccaaag 6421 accatagcct gatgtgggac acagatggtt atgcatttac tcagcaaata ttaactgttt 6481 aaaatccttc ccaagggcca agtgtcaagt gtcatgcaca catctgggta ttggggattc 6541 agtggtgacc aacgggcaaa gcatgtgccc gtagatctta tgttgtaggg gagttgatga 6601 tgttggggag aggatggtgt atagtaggta aacaaataaa gtgcctggtc atttccgatt 6661 gagatacaag tactgaaaac agtaaagcag ggtgattttc agaatgatgg ccattggttt 6721 agattgggtg cccaggaaag ccaatgggaa gatctcactt gaactgagac ctggagagat 6781 aaaccatgtc ggctgggcgc ggtggctcat acctgtaatc ccatcatttt gggaggccga 6841 aatgggataa ctgcttgagc ctaggagttc aggaccggcc tgggcaatat ggcaaaactc 6901 tgtctctaca aaaaatacaa aaattacccg ggtgtggtgg cacacgctgt ggtcccagct 6961 actcaggaag ctaaggcaga aggatcgctt gagcctggga agcggaggtt gcagtcagcc 7021 gagattgcgc caccgcactc cagtgcgggt aacagagtga gattatgcct caagaaaaaa 7081 aaaaaaaggc cgggtatggt ggctcatgcc tgtaatccca gcactttggg aagccaaggc 7141 gagtggatca ctttaggtca ggagttcaag accaacctgg ccaacatggt gaaaccccat 7201 ctctactaaa aatacaaaaa ttaggtgtga tggtgtgcac ctataatccc agctacttgg 7261 gaggctgagg cgggagaatc acttgaactc gggagacaga ggttgcagtg agctgagatc 7321 atgctgctgt acccagcctg ggtgacagag tgagactcca tctcaacaaa aaaaaaaaaa 7381 aagagagaga aagaaaaaag aaaaacagag aaattagcca cgtaaagccg tgagtgtttg 7441 tattacaaag ggatggccag tgaagggccc ctaaagtaag aataagctgg gcatgtttga 7501 agggcagaga aggctattgt ggtcacagcg tggaggtcag cagtgaggtc caagagagtg 7561 gcagacacca tgtcatgtag tgttagcagg ctgtgaggag gaattttggt tttattttaa 7621 tatggagagg gaaactattg gaacgtttta agttattcat tccagtcata tttggcaaga 7681 agcctagcac atataaacat tgttatgaat gtgatactta ctcctttttg gtatttgtaa 7741 ataatttact gttcatttcc tgaatgttgg ttatttctat gtttgtaata gggagtgggg 7801 ggacattagt tagctgttga atgggtatat agatacatta ggtaacttgt ggaagtccat 7861 attacatttg tttatctaca tctatttacg gagagagaga gagagagaga aggtcttgtt 7921 ctgtcacccg gactggagta cagtggtgta gtcatagctc actgtaatct caaactcctg 7981 ggctcaagca atcctcccaa gtagctagga ctatagccac cacacctggc ctatttattt 8041 tttaacataa cctcaaattt ttattgtctt cataataaaa ccaaaaatga agctaagaac 8101 tggatcactt ggccttttct ccttttatcc cttcccagtt aaaaatactt gtatctctta 8161 gtagccagca ttctcctaga tctgcagttg ggcccaacac ttaagcttta gcacaatctc 8221 gtttgtagtt ttagcctttt tccagaagat tggcttggtc tgcctacata gccacccctt 8281 cctgccatta agccactttc ccttggcata cagatcatct tttcccttct tgtaccatgt 8341 cactctgtgg ggttggtgcc aaccatgctt cttacacaaa gtccagtggg tttgaagaac 8401 attcaccatg ttagagcact atcagtaaag aaagaaagaa attattcatt ttttaattac 8461 aaataaaaat tgtatatatt tatggtatgc atgatgtctt gatatgtgca tgcattatgg 8521 aatggctaag tcaataatta acagacccca ttttaataca gggagaacca tgctgtgctc 8581 tagtgttgaa caataggatg tctgagctgc cattctgtat tatttcttta taccttcttt 8641 tatagccaag tttcatctca agatctagag gggacgttgc tattttttcc tgcatctggc 8701 ggaattctgg gcccttcctg gttattgaaa tcaaaagccc atcaatgtca ccatcatctg 8761 cttcattgaa tcaaaatttt ttattggcag cttctatcgt tcctgatatg ttcttccata 8821 aaagacagaa agatgacttg gttgccaact ctcgcgattt gtcctgctta gttcaaagcc 8881 tttacagtac tattgatgta atttccagta aattattctt acaaggtcca taaatttaaa 8941 gggaaaataa tgtcttgaaa gtaatgagca acatacctaa gtaattaatt ttaattttta 9001 gctggcaacc tgtgttatat gtaaaaaaga aaaaaattag atttttctct acccacgtaa 9061 ttggattgtg tattgaattg gcagggatga gaaaagtttt ggtttgaaaa acttgataga 9121 ctaatgcaga tgttagcaaa ctgtggcctg ggcactaaat gtagcatgcc acctattttg 9181 gcatataata ttttgttgaa gtacagccac acccacttgt ttatggaatg tttatggctg 9241 aatatacacc gtaggctgga caaggtggct catgcctgta atcacagcat tttgggaggc 9301 caaggcaaga tgattgcttg agcccaggaa ttggagacca gcctgggcaa catggcaaga 9361 tcccatctct acgaaaagtt aaaataaaat aaaaaaaagc caggtgcggt ggcatgcgcc 9421 tgtggtccca gctactcggg aggctgaggc atgaggattt cttcagcctg ggaggttgag 9481 gctgcagtga gccatgtttg tgccattgta ctctagcctg ggcaacagag caagaccctg 9541 tctcaaaaaa aaaaaaaaag ttataatggc agaattctac tttaaatgtt agagcaaact 9601 ttgctaaccc ctggtctact tgagtacaat ctttactaac taggaagaat atcacaggct 9661 gctgtagaat tctgataaac atggggaaat aaggctttgg attaagcctg aggcagtaag 9721 aatggagaaa agagttaaaa cattggcggg tctttaatgc aagaaacatt tgttgaatgc 9781 ccactgtctt cagaaaagaa agaataaaag ttacagatct tatgtctgca tgacattgag 9841 aatggtgtta atggccattc cagttaacaa ggaagagttg gcagagggac atttgttgca 9901 gaagagggta gtaggtttca tgaatgtgaa tttgagagaa cattagacag atgtaaatat 9961 ggggctggaa ctgggatgtg gaggcaagtc tggagacaaa ctggagagtt gtcacgtttt 10021 aaaaatctaa ccgggcacgg tggcacacac ctgtaatcct agcactttgg gagaccaagg 10081 caggcagatc acaaggtcag gagttcaaga ccccaacatg gtgaaacccc atctctacta 10141 aaaatacaaa aattaaccgg tgtgatggtg ctcacctgta atcccaaata ctcgggaggc 10201 tgaggcagga gaatcgcttg aacccaggag gtggaggttg cagtgagccg agatcgcact 10261 attacacttc agccagggca acagagagag actccgtctc aaaaaaaaaa aaaaaatcta 10321 aataaagggc tgagggccaa agactgatcc atagggaact tttaccaaca gacagtggaa 10381 gaaagaaaaa tagtcttgtg taagaatgga tggagagtta aaggaaaatt gaggccaaag 10441 agtgcaacct cccaaaggga gaaggaagag aactagcctt tactgagcat gaggtctcag 10501 tattaatttt ttaattgact tgatatttag caaccatgct gaattctctt aattctaata 10561 atctattgat attatcttgc caaagaagta acagttttct cacctctctt ctaacctttg 10621 tatcttttat ttttcttatc ttgtgactga gccctataat actacgttgc acagcaatga 10681 tgatagtgga catccttgtc ttgtataagg ctgtaaaagg aaagcttttg tagtttcttc 10741 gttaaacatc acgcttactg caccatgttt atttgtcaag ttaaggagtg tctcctttat 10801 ccccaacttt ctgatttttt aaaagtcaga tataagtgtt ataccttatc aaatgctttt 10861 gagcatgtga gatcaacttt gatttctctc ctttgagacc attaatgtag tgaactgcag 10921 tgttagcttt tctcacattt aaccatccaa tattcctggg ataaatcttg cttgattaca 10981 atctattctt tttaaaatac tctccaggaa tgagttggtg aatattttat tgaagtttat 11041 aatctatagt cataggtgaa aaatgggccc atacattatt ttcttgtact acctttgttt 11101 gttggaagcc aaggtgtatt agtctcataa ggtgatttgg gagcctttcc ctctttttct 11161 aatgtcagaa aaaagtatat gagataggga ttatcttttc ctgaaagttt ggtcaaatgt 11221 tccataaaac tgtctggacc tggattacca ttattgaact atattttctg ggccaaaatt 11281 gtgccagaat tttggcagag atttgtcctt tttgcttagg ttttcaaaat cataggcata 11341 gagctattta taatcctctt ttatttgttt aacctttttt gtgtaagtct gttttcattc 11401 taaattttat tttcatcatc atcttgatca gacttgctag aagtttgtct gtattattga 11461 ttttattcaa aaaataagtt tttgctttta atcgttttgg ttgtattttc atcctttgtt 11521 ctgcccttta tctccttcct tccttcttta ctttggattt actctgttta atacttgcta 11581 agtgtgtttc agtgtttgct tttcgataaa tgtatttaaa gcaaccggtt tcttagtata 11641 attttactct gttacatttt tgatactcag tgctttgtca ttcatctcta agtatgtcat 11701 aattttctct ataatgttca tgatttaaat aacyaaaggt tattttacag tataattgtt 11761 tgtttctagt ccatccagtc tgattagacg taggattaga ggaaatgttt ttaagcatat 11821 gtttcaggat tctaatcttt tgcattataa taaacatatc ctgatggact gaaatttgat 11881 tagtcttcct ttgaagcaca atctattttt gtaaatgttc tacgtgtctt ggaaaagaat 11941 gtgtattcac tgttgggtaa aatatttcta tatgtatttg agttttttgc attattcaag 12001 tcttatatct ttgcttagct actgatttct gaaaagggtg tgttagttgt tgatttatct 12061 gtttctcact gtagtttgcc aatttttact tttttaatat ttctaagctg tatactcagg 12121 agtccatata ttcatgatca ttgtgtttta tcaatcagtt attcttttta tcaggatgct 12181 tcgatgcttt ctttttttct ctataaaaac tgcattaaaa gctaagaggc tttttcccat 12241 ttcatatgtg cctggttttt tttgttttgt tttgtttttt tgagacaagg tcttgctctg 12301 tcgctcaggc tagagcacaa tggtgcaatc tcaactcact gcagcctctg cctccgcagt 12361 tcaagcagtc ctcccacctc agcctcccaa gtagctggga ctacaggcac atgtcaccgt 12421 gccttggcta atttttgttt tttttgtaga gacaggatct tcctatattg cccaggctgg 12481 tctcaaactc ctggcctcaa gcgatcggtc cacctttggc ctcccaaagt gctgggatta 12541 caggcatgag ccaccgtgct tggccgggat ttttttttta atctagtgtc tcttggttgg 12601 tgagcctgtt tgtgtttctt gtgatgacta ttgtagtttt accatcttct ttcatgtttt 12661 tagttcattc ttttcctagt cctttcttgc cttcctttag aagtgtaaat ttccttctgt 12721 atatgtgaaa atgcacattt tatttttatt cttctgagtt atttcttagt ttattttttc 12781 tgtgactatc ttacttatca gtatctgtat ctttcctccc aaagccacac tgtcctcatc 12841 tcccctatct cccctcatct cttcctttgc acatcatacc ctatgatgac catggtgaaa 12901 ccatctagaa ttttagttct gggtcgttta gaacatacat aatacggtgg tgaatatatt 12961 ccttactgca acaacagtga tcttcattga gatatattgt aagtttttca accttacttt 13021 ccataaacag gatctcataa catcctgcta gattgacttt tcttcttcca ggaatgcttg 13081 aggaatggga atctagaggg tcttgaagtg gtaagcctgt gaggccttga attattaaga 13141 atgtctttaa tttttttctc acatttaaat gatagcttgg atggattaaa aatcaaaggc 13201 aaaaaacttc gataggataa agctttggaa atatgacttc attttccact tgtatcgctt 13261 gttgtcatta agaaccctga agccatttag atttgcgttc cattatatgg gatctgcttt 13321 tagaattttc actttaatat ttgtaagttt taaaattatt tctcttcaat gtgtgttttt 13381 cctgtgaatg tagtatctgt gagatcttcc aatttccttt aacttaaata aattcttagt 13441 catattttaa attacttact cctggttgat tttcttttcc ttttaaggaa tttctagtat 13501 tatagatact gacacttctg tgtattgcat gtcttttttc ttgtgtattt cccacctact 13561 tcatgaagcc tcctggaaaa aatcttccag cccctgaatt cattctcagc cgtattcatg 13621 ctgctcctca gcctatctat tgaactcttc atttccacaa ctatactttt gttcacagta 13681 tttctaggtg tttctcttta tacctgctca ttttaattgc cctctgtgta tttttgggac 13741 attttaatac atatattcct actctctggt tcactaattc tccctgtggg gatagatttt 13801 agctcaccat gtttagtaga tgctgccttc cttggtgttc ttgtttgatt ccctgtgagc 13861 tcttcttgct tgaccctcag ggaccctcct ctcataccac tgcttcaggc attgtttctc 13921 ctgagtgtct ccctgacttg tcaccacttt gcccttgtgg tgtgagggaa caagcaagga 13981 gtggcttggt gttctgtgaa ccttcatccc actgttctgg catttccttc ctcatgcagg 14041 ggggcggggg gtattgaacc ttccacaatc tgccaactgt aatacggagg aaagaaaaaa 14101 ggacaaaggg tttttaccca gcctctcctc cacccgcagt agaggcgatt gcctgccatt 14161 ttgtcctcat tgcaagaccc ctagtttccc caggaattta tcccagtttt gatttagttt 14221 ctcaaatttg tcagctgccc ttgcttctga gcgtctctgt cctctaagtt tagattctgg 14281 gagtgtggca gagcatattg gctcatgcct gtaatcccaa caccttggga ggccaaggtg 14341 ggaggattgc ttgagctcag gagtgttcaa gaccagcttg gacaatatag tgggaccccg 14401 tctctacaaa aaatcaagaa agaagctggg cgtggtggca catacctgtg gtcccagcta 14461 ctcaggatgc tgaggtggga ggatcgcttg agttagggag gttgaggctg cagtgagctg 14521 tgactgcacc agtgtgctcc agcctgggca acaaagtgag accctgtctc aaaataaata 14581 aataaaaata aaaatagatt ctgggagcat gccagcagtt catgcccatg tgtggtcttg 14641 tcaggagtta taatagacat cttattttga aataatatta ttttcttcta tttctgatta 14701 gaaaatttta atttgtattt attgtaataa ttttggaaaa tacaaaaatc tcagagaaaa 14761 gataaaaact atatgaatcc tgacattaag agctatttgc agcctgcttt tctactcttt 14821 ctgatgaact gtatagtgaa ctttacttag gtcatcatgg attctaccac atgacatatg 14881 atatctgttt ggtggtctgt cgcgtggata taccatgaaa tgtttaactc ttccactgtt 14941 ggacatttaa atggcttaaa acttttttcc ttaaaaaaac ttatttcaaa cagttgtaca 15001 gtctgcccag aaaaagggcc caggacacag tttaaaaatg gtaatactaa tagaacaaaa 15061 caagcagcac ctgttggaaa gatcccataa acgtattggc aataactagc aagcactttt 15121 gattattgaa gccgcagcct ttctggccct ggctaatcaa atgaatggat ttgcttgtga 15181 cctgcgaacc tgtatttgaa tactacattt tgtattatgt tggtttgaaa agtcaactta 15241 atagtcatat tatttcaata gcttcttggc tactctgtct gacttcaggg gtagacttga 15301 gtttgagatg tgaaattccc cagcatagta tagcaaaagc tacatatacc tagacgttag 15361 ggcttggttt tattatttac ttactttatt tatttatttt tgagacagtc tcactctgtt 15421 gcccaggttg gagtgcagtg gcatgatcat gactcactgc aacctcaaac tctatgggct 15481 cagatgatcc tcccacctca gcctcccaaa tagctgggac tacagtgcac cagcacatct 15541 ggctaatttt tttttttttt tttttgtaga aacggggttt taccatgttg cccagggtgg 15601 tcttgaactc ctgggctcaa gtgattcacc catctcagcc tcccaaagtg gtgggattac 15661 aggcatgagc taggcctggt tagttttaga aacttatcta taatagaatg tgacactgat 15721 gtccttacca ggctaagatt tgaagtatgg aaaattgtag ggcgtggtag aatattttgt 15781 tgttactctt ggcagtatgt tttcatttgt gtttaggttt agtttgttta ttgttttgat 15841 cttttctcat ctttctgacc acaaaagaaa cctggaaagt atccatccta cgcctttagc 15901 tcttacctga aggccttgaa gactctccag caccaacacc ttggtctctg ttctggaatg 15961 aatttggaaa accaagcaca gccagtcaaa tgggctgttt ccttcccata taacttttgg 16021 ccttgaagct aagacacgtg gttctctggt ttctaaggtt ccttgggtct atgagggaga 16081 aggagaggag agattatttg aaagcaagga ttccacaggg ggatgtctgc cttcgagcag 16141 tggttcttaa cattttgtgg gtcattaacc aaaagcctga tagtaagaat ctgagagaac 16201 tactccaaaa aaagtaataa aacatttatg cacattgaca cagacttcgc tttttatttc 16261 tggggaccct gagtttatgg agtcctcaga agcccattgt tatttatcag gttaagaatc 16321 tctggcttag aattttggaa ataatttgtt taagaaatga aataaaagaa aatgaattgg 16381 cattttccac ccagtcattc cctgagctta tgatgtttta ttcttcactg tgggaattcc 16441 ttcttatcca tgggattgga aggcggtgat tggcctatga gaatgtctcc tagagctggc 16501 acaattcccg cacctgtact tcatgatcct tttccctttg aaggtcaggg gaatgctcct 16561 attggctcat tttcttgagg tcttaaagac tctggcactg gttgggcctg gtggctcccg 16621 cctgtaatcc cagcactttg ggaggtcgag gcaggaggat tgcttgagcc caggagtttg 16681 agaccaggct gggcaacatg gtaaaactcc atctctacaa aaaatacaaa aattagctgg 16741 ccatggtggc acacacctgt ggtcccagct acttgggaag ctgaggtggg agtcttactt 16801 tagcccaagg aggttgaggc tgcagtgagc tgagatcacg ccattgcact ccagtctgag 16861 caacagggca agattctgtc tcaaaaataa ataaataagt aaataaagac tggcagtaat 16921 gtagtttctt aaatctaaag aaaatatctt aaatttggat ttcttgtatc aaggtttttg 16981 ttttttgggt tttttttgtt ttttttttgt ttgtttgttt tgagacagag tcttactctg 17041 tcactcaggc tggagggcaa gggcatgatc tcagttcact gcagcttctg cctcctgggc 17101 ttaagagttc ctcccatctc agcctcctga gtagctagag gtataggcgc acaccaccat 17161 gccaggctaa tctttttgta ttttttgtag agatggggtt ttgccatgtt gctgaggctg 17221 gtttcaaact cctgggctca agcgatccac ctgccttggc ctcccaaagt tctgggatta 17281 taggcgtgag ccaccgtgcc cagccgaatc aaatttttaa gaactaaggc agttgctatg 17341 taggtttgtt ttgttttttt gtaatgattt cttccccctg aatttcccca aatgttttgc 17401 tgtttctgca atactatgct ctgatctgga agctctacag taaaagttaa acctaatata 17461 tttgggggct agggtggcag gtaggctgag ctactaatag tccatggatc agttggaggt 17521 tggttccatg aagcaaggag ggggagactg gacaatttac tggccctcca cctgtttctt 17581 tccacgcttg ctatcttgtt tgtcttatct ggctgtacag cttctctctg cagaatattt 17641 ccttctctca gaagtaacgt ataccattta tgtgcatttg tttagttgtt cattcattac 17701 ctcacatagt tagtgatatt tcctaaaccc ctactttggg gaacagagtt aactaggcta 17761 taggagaaac atgaaattta cagatgttat aataggggga gaagatgtgt acatgcagaa 17821 cttttctcca gggtgcaggt gatccgtcaa gtggatctgc tgcttccatc tcctcacctg 17881 ccatgacatt ataatttgtt tctcctgtct ggactgctat atgggcctta aaaatgttct 17941 ctgtctgttt gctctcaccc acctcctttg gtgaaatctc ctgtaattgc tgttaccaga 18001 atgtcatttg ctgcttcaga ctgttggctc ctcactgcct gctctgtcag tgggcatgat 18061 cctgaccttt ttggcccttt accaattgca ctctctttac tcaactcctt tctccggccc 18121 aaagtacact ctccatcctg gccaagtaca ttcatttggc atatgcatgc tgccttgccc 18181 tgcccatgcc ctcccgcctc ctgcagtctg catgcttccc ctcaccttcc tgactcccac 18241 tgcactctcc cagtgtgaaa ttctgatgtt tcctaccaga ccatgttctt tttatatatt 18301 catctgttca gcaaatgttt gtttagtaaa tgctgtatgc caggcatttt gctaggcaac 18361 agggaaacaa agttcttgcc ttcacggagc ttcagagtcc tgtgggggac acagacaagt 18421 aaatagtact ttcagtttgg agtgatcagt gctgagatag aaagtattag atgccccagg 18481 gcacatatta aagggacaac ttggtatagg ggaagggaga gatgtccggg agatgttcca 18541 aaggcagtga gtgacccagg ctgttgaaat tgagtattaa gttccttagc caaggagtga 18601 aagaaaactg gagcaaaaca tcatctgcca aaaagccatg tattactgac ctcagcacac 18661 caatgtggct gagtgaggcc cgagttgggt gttgctggct aggggtcccc ggcttgcaaa 18721 gtgaccaaga agaagaatca cttgtttgtg actttcaact ttgtaaggta ttttaagttg 18781 gtacttggac aagatggctt tttctttgtg tgtgtatttg aacaaaatgt tcccgtttgc 18841 agcactcatt gagtggtcat tgacaccagt aatctataca tttgcccttt agtggtgaaa 18901 tggagttgtt tgaggtgtca gcttggtttg gagtgtcact aaaagccttt taagcctgct 18961 tcatcacagt agccctggga atcaacgaga aatgtctctg agttaagagc taaaattaca 19021 aacatccagt ctgacctgat catgaggtat cttacaatgg ttccaactcg gtgacattcg 19081 acattcgtac tgtagcactg cctctgtttg tttgttagtg gtcatttaac attcaaagga 19141 agaagatgct aatggccaag gttcagagat aatgtttcta gagtttgctc tgtgttatat 19201 gttttgtttt gtttgagacg gagtttcgct cttgttgccc aggctggagt gcaatggtgt 19261 gatcttggct cactgcaacc tccgcctccc gggttcaaac aattctcctg cttcagcctc 19321 ccgagtaggt gggattacag gtgcccgcca ccacgcctag ctaattattt gtatttttag 19381 tagagactgg gtttcgctat gttggccagg ctggtctcga acgcctgacc tcgtgatcca 19441 cccgccttgg cctcacaaag tgctgggatt acaggtgtga gccactgagc ctgacctgtg 19501 ttatatattt ttatctggat cagtaggtct tttgttttat ttgagaggga gagagtcttg 19561 cactgccacc caggctaaag cgcagtggtg caaacatagc tcactgcagc ctcaaatgtc 19621 agagttcaag tgtgaatcag tagttcttca tctttttggg gtcatggccc catttcacca 19681 cccagttaaa tttatggaaa agtatacaca gaggctggtc gtggtggctc acgactgtaa 19741 tcccagcact ttgggagatc aaggcaggca gatcgcttga ggtcaggagt acaagaccag 19801 cctggccaac atggtgaaaa gttttctcta ctaaaaatac aaaagttagc cgggcttggt 19861 gatgagcacc tgtaatccca gctactcagg aggctgaggc aggagaattc cttgaaccca 19921 ggaggtggag gttgcagtga gccgagatgg caccactgca ctccagcctg ggcaacagag 19981 ctgtctcaaa gaaaaaaaag aaaaaagaaa agtttacaca ggcacacaca gaattgtata 20041 taccatttta gaaggttcct ggatcctcta aagtccctca tctcccttta gccctcggga 20101 tcattattgg ttcattctaa caaggtccat ataaaatgat tgccatttta agctaactgt 20161 gctatccatt gatgccttgg ttcctttctc accattctgg tttccttgca gttgataact 20221 cgcacacgag aaacagtctg aggcccctta cacatctgct gctaagaatc actgtcctgt 20281 acttcccttc ctctcttctc tggaaataat ggatgcatat gtatttgttg gagaagtaca 20341 aatagatgag ttctgcccaa gcagagaaaa agctcttaca tatttgtgtg aatatacttg 20401 tgcaaataga aaatagaagc tattcacata tagctgtctt caccactggc ctttttctgt 20461 ttccatatta aatgtttttc aggttataaa gccgcttata acgtaagatc aaaattgtgt 20521 tatttaaaaa ataatgaagc tcatgtatcc atgcttatat ataatagaag gtgaaaggaa 20581 aatactgaag gcacagctac tcggagacca caatgcagat gttgagactt tgctattatt 20641 tggaatttta tttactgcga aattgggtgg gagagaaaaa agaggagtaa gccttcttag 20701 taaactgtgt tgctggcttt tttcttctga cgatccactg ggtattttca atggagatga 20761 ggaaaggatg tgtttcagat ggaaaccttt atgaactctc ctgtgagctc tccagcttct 20821 caatccatgg gccctcattt tggtttctta ttttaatcct aatttattta gaaagggtaa 20881 tattttttga aatgctttga aaacaatcaa aattacattc aagctgtggt gagtaaaaat 20941 aaaaacacag catcctaaga atcacatagt agtgtgccct gggagttcct agttcacaag 21001 aagatcatgg atgttaacct gagagactta ctgaagtcat ctaggggaga tgggtcaaga 21061 aatagcccca ttttatagga aatccagctc agagctgtga ctgaggtcat gaggctggtc 21121 atggaattgg gagtagattt gaccttctag ttcccaatcc agggttcttc atggcttcta 21181 tgccactggg acttagtgta aatctcctta cctctttgag tcctaaattc catattccga 21241 tagtgtatgc ttatttcctg tgcttcagag ttattctgag aatcaaattc tataacgtat 21301 gcttctcaaa gtgtgattcc ccaggccggc aatggcagca tctcctggga agatgtgaaa 21361 atgcagattc tcaggcccca ccccaacctg aatctgaaac tctgggaggg gcccaacaat 21421 ccgtgtttta gcacaccgtc caggggattc tgactcatga agcttgagag ccactgatga 21481 cacgtgagat agcattttga aaagaagaaa gcattacaga aatacaagat accttgtttt 21541 aatggaggta aaatgtatat atggtgaaac acaaagatct taaatgtgta atactgaatt 21601 ttgatataat cagtgcccca gtgaagatac agaacttgtt catcccttat aaagctccct 21661 cttgcctcct cccatcagtc cccacccaac ttaggcagcc agtggttaag gacagactat 21721 tccttagaga acataagaga actcgatgat gggttaaacg tagaaagagc aatgtctgtg 21781 ttctcgtatt ctttcactat ttgtaggtaa tgttcctttt aaaattacta accatatttc 21841 tgtgttcttt ttcagcccat ggaccaagct tctctcaaaa acagcgatgt tcttgttctg 21901 acagggctta cccagatccc cactgcaaac ccagatggaa tggtgggaga gttctgcagc 21961 aacctaggtg tgcaaccgtc tctcatctta cgttggatga tctatcttgc atttatttta 22021 caataataaa tataatattt tacaataatg ggggaaggag tgcttacagg gtagcagttg 22081 tcaaaggagg gaggcagtat atctttgcaa ataatagcac agaaaagagt gttacacttt 22141 gaactcacag cagcgataca gtgaacagat agatatgtat gaatgtttgt gtgtttgttt 22201 ttgagacaga gtcttctctg tcacccaggc ttgagtgcag tggcataatc ttgggttact 22261 gcaacctctg tctcctgggt tcaagcagtt ctcctgactc aatctcctga gtagctggga 22321 ctacaggcgt gtgccaacac acccggctaa tttctgtatt ttttgtagag acatggtttc 22381 accatgttgg ccaggctggt ctggaactcc tgacctcagg caatccgccc gctttggcct 22441 cccaaaatgc tgggatttcc ggcatgagcc acagtgcccg gccaaacagg tatatttttt 22501 ccccactaat atttggttgg ttttattttt tcttcttttg aggaaaggct aaattaagag 22561 aggtatgggg cattttctac ctggaagaaa tttattttcc ttcggatata actgtcacta 22621 aatctggaag ttctgcttct catttagaca aataggttgg ttactgtctt agttagtttg 22681 ggctgccgta acaaaatact gcagacatta acttctcaca attctggaga ctgggaagtc 22741 tgagattagc gtgccagcat ggtcgtttct tgatgcagat gattgccatc ttgcagtgtc 22801 ctcatgtgga gaagagggga agctctggtg tctcttcctc ttcttttttt tttttttttt 22861 ttttttttga gacggagtct tgctctgttg cccaggctag agtgcagtgg cacgatcttg 22921 gctcactgca acctccgcct cccaggttca agcgattctc ctgcttcagc ctcccgagta 22981 gctgggacta caggtgtgcg ccactgtgcc cggctaattt ttgtattttt agtagagaca 23041 aggtttcact atgttggccc atctggtctc gaactcctga cctcatgatc cgtccgcctc 23101 ggcctcccaa agtgctggga ttacaggtgt gagccaccat gcctggcctc tcttcctctt 23161 cttatgaggg catgaatccc atcatggggc ctgcaccctc gacctcatct aaacctaatc 23221 acttcccaaa gtccctgcct ctctgtacca tcacagtggg ggttaggcca acatgagaat 23281 cttgtggggg acacacacat tcagtccgta acagctacca aagaggtatt aatgagctca 23341 gaccttcagc tccagcaact ttaagtgata ttacttctgc tctaggaaga agaagtggtc 23401 atcttatatt tacacggaag gcactgttct tagaaattaa acttagccat gctaataaac 23461 atagtctgtt tttgttcttt gatactaatg caaaggtaat ttatttgtac cttagaaaaa 23521 taattggact aatctcaaat agagtcttgg tttgtatgtt tgtttataat ctagaatcac 23581 agactcaaag aactttaggc ttgaaaggaa ccttacattt aattcagtct cccaaagtgg 23641 ggtccactaa ccgcattccc ttaagaccaa tgggattact tattaaaaat gcaaatttgg 23701 gggccctacc ttagacctag taagtcagaa tctctgggga aaggagactt ccagaagaaa 23761 agttgcattt tcaacatatt ctctggcatt ttccacgcaa actaaagctt gaaaattact 23821 gatctaattc attcttttca tgtaactgat gcagaaactg aggccaagga aggttgtagt 23881 ggctttcctg tggtcctgtg ggttgggaca aaggtaggat ttgagacagg ctcttgagct 23941 atgaccagcg atgttgattt tctccactgt atcctactct agtaccatac tctagtaata 24001 gcaagtccac cagccctcaa gttatagcat ctaggtgagc ctaagtactt aaagtatagg 24061 ggattttcct gcagacaaat gttaatgaaa gaaaatacta ctaactcctg cagacaaacg 24121 ttagtcaaac agaaaaactc ggcctatttt cttataggtc attcagccat ggtcagagac 24181 tgaacagaga caaatccagc aaatttttga gcaggatcta aaacgggaag gagcttggag 24241 gctctgtcct gaagctcagc tgccattggt aaaaacccaa acccgtagtc acatgctcta 24301 ttcccaggga cctagattag acaatgatga gaaaatcatt atcagcctat agcatcccct 24361 gctttgatgt gttcttcaaa agaagcagct tattagacat gtaagtaaat cataaaaaca 24421 gaagtaggaa aacaagtgca aatcttattt tacaagttta tctttataac actgcccttt 24481 tgatatgatg ttttttctcc tctggcatcc acttttctag ctctgacagt ccggaatgga 24541 ggaaacgtgt tggttccctg ctacccttct ggagtgatct atgacctcct ggagtgccta 24601 tatcagtaca tcgactcagc cgggctttcc agcgtccccc tctacttcat ctcccctgtg 24661 gccaacagtt cactggagtt ttcccagatc tttgctgagt ggtatgtccg tggttttttt 24721 ttttgtgtgt gaattttatt tgattcagga cattcaagca gtaagaataa aaataatcct 24781 gttttttctc acattactgt ggaaatttca ttttgttgtt tttctgtctg tgataagatt 24841 gcattattaa aagccaaatc tgttgcattg ctaagtttag aataatagtt gtcaaagagg 24901 gaagaatgca aggcagagac ttaccttagc ccagcacttt caaaactggt aacaaaaatc 24961 ttatatactt atcacatgtc accctctgcc tgttactagg tgaaatgaca ttctaaaagt 25021 taaaaaaatt ttcaagccca atctcatgtt gtctaaaatg tatagtgcca aatctgagaa 25081 gaaaaactag atttttaaaa attgcaatag tatgatattt gacaaaattt tattacatca 25141 gaaaattgat caaatcctag agttggcaaa atatgaaaca atatgaaatt agtgaacctt 25201 tttagagtta tttaggtgca tgtttgaatg taactcacct gaccaaaaat aaagggagaa 25261 gaggaaaata acttttacaa tatccccagt ggtgccttag aatggtgctt cccaaacgtt 25321 ccgggactgt gacacaggca gtctaggctg catttaatcc cttttagtca tgaggtagcc 25381 gatagacaca gcatgtactg agtttctaat taaaaaggaa tttgtacatc atcttctcat 25441 gatatattca gttacgctgc ccccaaccct tgcttttgta aagtactttt ttcattccct 25501 tctgtggtcg tttttttccc ccctgtgttt agactcatac aggcgtctct atcccatgta 25561 caaattattc ttctttgtca cttttttttt ttttttgaga cggagtcttg ctctgttgcc 25621 caggctggag tacagtggca caatctccgc tcactgcaac ctccgcctcc tgggttcaag 25681 caaatctcct gcctcagcct ccgaagaagc tgggattaca ggcacccgcc accatgcccg 25741 gctaattttt gtattcttag tagagacagg gtttcaccat gctggtcagc tggtctcgaa 25801 ctcctgacct caggtgatcc acccgcctcg gcctcccaaa gtgctgggat tacaggcatg 25861 agccactgcg cccaccctta aataacatta gtacattatt attaactctg aatctttatt 25921 ctgattgcac cagtttttcc acaaattttt tttttttgtt tttgtttggg atccaatcca 25981 gggtaacaca ttgcatttag gcctttgatt tttttgtttt tttgcaagaa gtttttttta 26041 gttttttata ctgatagttt tagtctcttt tgcagtttct tctgttgata ctatgtttag 26101 aaaattcttg cctctatagg tgtcacatgg ctaaacatac tttctttcag ttttattgta 26161 gcttctttct ttcttttttt acatcacccc ttaactattt tatctggaat ttgttttagt 26221 atatagtatg aagagaagca ctaatttcat tttttcccaa gtagtcaagt acttacctgt 26281 ccaagtacta tttattgagt aatgttaact ttttcagctg atttgtatta atgccatatg 26341 ccagactttc atatgcacca ggttttgttt ctagactatc ctgattgagt gatccattca 26401 ttctttggcc aacatgatgc taatatattt taataactgc agcctcactt ataattgtac 26461 tctgtggtaa agtacatttc tccattattt ttcttagaat tcttggagct atttttgctt 26521 acttattttt gtggaagaat tgtggaatca ctgtatcagt tttcagaata tctttttgag 26581 tccacaaaac ctataaatta cagtttgcag tagttttccc atgctgagac atgggatgtg 26641 tgtctgtctt ttaagctttt caaatattcc tcccgtagac tcttaaactc agtgatcata 26701 ttattcttgt ttccatcgat agttctattt gcttaaatcc ataaaccttt aagtgccaaa 26761 gcactgagga tacaaagagg tccctgacct tgaggaatct gtaccatgaa ggaagaggca 26821 gctgtgtaaa cctcttacca ctcggaagta atctgatgga aatatataca cacataccca 26881 cacacacacc tacgtatatc tgtatggtat tcagagaagg ggtgggtggt gaccccattt 26941 ggggggttaa gaaaggcatt ctggaaggag gtgctcctga agaataacca agaatcagcc 27001 agacagaaac actatttaag gatgagttgg gtggtctgcc ggcggtgatg tgtgggtgga 27061 gaggataaca caagccaaga catagatggg aggttagaat ggtttggttt gttcagagaa 27121 ctatccatag ttctttattg ttacagtatg aagttcaggg tggggagtgg cagggtatga 27181 ggctagaggg atcctgtcca tgggggggat tcattggagg attctaagca ggaaatgaac 27241 atgattatat gtgcatttta tatagagcct tctgcattta tgtgaagttt gttgggaggt 27301 ggtgggaggg ggtgcaactg aagtacaaga caagagtctt tgcagaagtc gagggactga 27361 agactccagt ctctaccatc ctggaggaaa gcaaggcagg aacccatatg agaggtgatt 27421 aggaaataca aggggcagga cttactggtt acttgataca gaaaaggtag caatcaagat 27481 tgacaccaca atttctagtg tagtagatcg tgttgacccc aaacaaaata ggttctacaa 27541 aggaagggta ggttcataca gcaagtgtgg ttagcttagt ttggttttgt ccctgagggc 27601 attgacggtg cctgaggcag gggatgtgca ggtgaaactt gtccaatcca aagatctgag 27661 aagcccaggc tggagtcata ggttggggtg tcctcagcgt tgaggtagtt gagtggctgg 27721 gattgccaca agaatgaatg ggattgtctg gggagaggat ttgaggttag aagaacaggc 27781 agtggggaaa ggatggactt aagtaatgcc tgcatttttg gggtcattag agaacaaata 27841 tttaggaaaa gtgtgaagac aaatagttaa agaagtagaa gaggccgatc agggtggctc 27901 acacctgtaa tcccagcact ttaggaggcc aaggcgggag gattgcttga ggccaggagt 27961 tcgagatcag cctgagcaac atagcaagac ctcatttcca caaaagatta aaatattagc 28021 agggtatggt ggtgcatgtc catagttcca gctactcggg aggctgaggc aagaggattt 28081 cttgagcctg ggggatttct ctgtgtttct gtttcactgt gctgttctct ttcatgcagc 28141 cttgctgtaa ggcacccttt ttccctaaat aaggaactca gttaccaaaa tggagagctg 28201 ctagctccag acttgcatta acttagcaag tcccagcccc ccatgccagg accaccacaa 28261 gcctgtgctg agggtttggc ttcctctcct ctttggtgtt ctgaacgggt gcttcacagc 28321 ctggctgctc tgtgctcagc ctcaggcccg gcctgctgtt ccctatcact ctggttccct 28381 ggctctgtgc ttcccgttct caggggttct gctctggctt ctacatggtc ctgctttgat 28441 gcctgcagaa gcccagcccc ttgctgtcca gtgtctgccc ttgctccgag ctaaggggct 28501 tggttgtttg ggttggtttt gtttttgcag gggatggaga tgggagggaa tagctcttga 28561 aagacctctc tgatcttttg gagtttggag tgttggggtt cggagtgttg gttggttggt 28621 ttttgagaca ggctctcact ctgtcgccca ggctggagtg cagtagcaca atcacggctc 28681 actgcagcct caacctcctg gtctcaagcg atcctcccac ctcagcctcc tgagcacctg 28741 ggactacagg tgtcaccatc atgcccagct aatttttgta cagacaaggt tgcatctcgt 28801 ctgaacccat gaactcctgg gttcaagtga tctgcccgcc ttggccttcc agagtggtgg 28861 gattacagtc ctgagccaca gtgcctggct ctgatccttt tttgaacaag cagtggaaga 28921 gtgtgcggta cctgaggtct ggccatcagg gagcaggagg gtctgtcaca ttcccaatta 28981 gagataatcc tagaagcgcc atttattctt cattcttcct gataatctgg tatacacaga 29041 tctccttttg aactctaaca gctaccccca gaagaagcaa actctaatca ggtccttcag 29101 cctctgtctt agaaaggggg tgggtccctg tctgctgtgc ctgcatgagg attctagagc 29161 agagtatgga ggatctgtta gcagaactgg cctaagcatt atgtaggtgg gcttcacaat 29221 ctctaatcat attgtaatct cttctgtatc cctaatctct gcctttaatg catgtaggat 29281 aatgtccttt ggaacaatca aaataagttt agaaccaagc tcttatattt gtctccctga 29341 gctagaaata aagacagaac tagtgtctat ttagataata taaggtaacc ctccaaaagc 29401 atcttgctct tccatattta tatcttccaa gtagggtata aagtgatgtt tttttaaacc 29461 aaacttaaac gaaactaagg gtaggaaaaa ttagatacaa tgtattaata caaaatccaa 29521 gccctgaagt cctgagctcc tcccctcaaa gtagtgacta tttttttaaa tgtcaaacct 29581 gcacaacacc cacatatatt gatttatcaa ctgtgaactt tttgccacat ttgctttatc 29641 cagacatctc agtattgtaa agtcataact gactaggaaa aagcaaatgt aaattaccaa 29701 aaacattcac attgtctcta gcctgtgatc ctttgttctt ctctagttgg agttaccaat 29761 gctgctgtta aaaagagtgt gagggccagg cacagtggct cacgcctgta gtctcagcac 29821 tttgggaggc cgaggcgggt ggatcacctg aggtcagcag tttgagacca gcctggccaa 29881 catggtgaaa ccccgtctct actaaaaata caaaaattcg ccgagtgtgg tggcaggtgc 29941 ctgtaatccc agctacttgg gaggctttgg caggagaacc actggaaccc aggaggtgga 30001 ggttgcagtg agccgagatc gcgccattgc actccagctg ggcaacaaga gcgaaactct 30061 gtctccaaaa aaaaagtgca tggacaaaaa cagaagccat gtctcaaggt gtagatcact 30121 ttctttgtga aattgaccac aactaaatgc aatatgatac cacggattgg atcctggaac 30181 agaaaaggga catgactgga aaaactagtg aaatctgaat gaagtctgga gtttagttga 30241 ttgtcattgg cctgatgtta atttcttagt tgacgactgt gccagtcata tcagatgtta 30301 actctgggga catagggtga agaggccatg gaaactctgt actgtctttg cagcttttct 30361 ttaaatctaa aattattcca aaataacaag tttatatttt aagaaaaaat gtattgagaa 30421 attctaaagt ttaaaaacat acaagataca tctcttctct gtaggcactg gatttcattc 30481 acagtgaaat tcactggcgg gaaattttta aataaacttc agtatttaat atttgcactg 30541 ctgccactag gtggcaacag atgccaccgt atgctcttcc tcacatgctg atgtgttttt 30601 cctctttaat aggctttgtc acaacaaaca gagtaaggtg tatcttccag aaccaccttt 30661 tcctcatgca gaggtaagaa aacaaaatca ctgggacatg ggaaggaagc aatgtggata 30721 acctgatgca gatgcagaca gcaggtcatt agatgaaata gattgctgtg taaacctgta 30781 gacccctttg cctcccaagt cagacacagg gaagtatttt aactcaagct tcacttgctt 30841 tcctcctatt aacactttct attgcgcacg tggagcagcc cttctccaaa atgttgtgga 30901 ccgcagaatt gtttcagact tgggattcgg gaatatactt actggttgag catcccaaat 30961 ttgaaagtct gaaatcaaaa tgctccaatg agcatttcct ttgagcatca tgttggtgcc 31021 caaaaagttc agatactgga acattttgga ttagggatgc tcagcctgta ccatgttcat 31081 gcaattcata gcctgcttct gttctactga ctgcatgatg aattgtattt cgatacatat 31141 tactaccttt ttaaattggg tttatgtatt gtcagagtgt tctttccagt tatgtcagtc 31201 atatatgtac atttttagtg acgaaaataa catttcagtt caacaaataa aaggcttctt 31261 cctccctcac agaacaaatg ggtgttttct atatagctga atacctagct ttgttgtcag 31321 gttcttttca cccaagggta tattatgaac gtttttctgc gtctcatgtt attattgctc 31381 tactacaatg aagctaacag acaatagtta ctcctcattt ttggttatat tttcactcaa 31441 agattctcta aattggtatc accaccttag aaaactgaca gtattggctg ggctcggtgg 31501 ctcacgcctg taatcccagc actttgggag gccaaggcgg gtggatcaca aggtcaggag 31561 atcgagacca tcctggctaa cacagtgaaa ccccgtctct actacaaata caaaaaatta 31621 gccaggcgtg gtggcgggtg cctgtagtca caactgctcg ggaggctgaa gcaggagaat 31681 ggcgtgaacc tgggaggcgg agcttgcagt gagcccagat cgcgccactg cactccagcc 31741 tgggtcacag agtgagactc cgtctcaaaa aaagaaaaaa agaaaactga cagtatctgc 31801 taaagctgaa caatgtactc tatgcctccg cagttttgtt cctaaagtat acattgaaca 31861 gaaatgcata gagatgttac caaaagacac acacacaaat ctagaatttg gtcaggtgcg 31921 gtggctcaca cctataatcc caacactttg ggaggctgaa gtgggaggat cactggaggc 31981 caggaatttg agaccaacct tgacatcatg gcaaaaccct gtctctacaa aaaaatacaa 32041 aaaattagcc cggtgtggtg gcacatgcct gtagttctag ctaccctaga ggctggggtg 32101 ggaggatcac ctgaagctga gggagttcga ggctgctgca gtgaactgca atcgtgctac 32161 ttactgcaca ccagtctggg tgacagagca agaccctgtc tcaaaaaaaa aaaaaaatct 32221 aaaatttttg gtaatagtac tgaaatatac tcaaattccc atcaacaata gcatggattt 32281 tgtggtatac tcacacggtc ccttacatca ctgtgaacaa ataagctcca attatatgca 32341 gtgtagataa actgcacaaa cataatgtga gtgaaagatc cagatataaa agagtagata 32401 tggtatgatt ttatttacat aaaagttcaa aaacacaata aactgatctg tggtattaga 32461 tgccagtgtg gtagtgatcc tggaggggag gggacagtag tgacaggaag gggacaaaga 32521 gggatttctg aggagctagt aatgctttat ttcttgatgt acatgtgttc accttgtaaa 32581 aaatccatca aggtgtagag agttagatat aaggaaagag tgaaggctgg aatgaatcct 32641 gtgctgttgg atagaattga tggtattggt gtgaactcct attttcaata tatgtagata 32701 cagaaagaaa tccacttgtg catgtgtgtg tatgtgtgtg tctgtgcaca tacgtatctt 32761 ccagctctgg ccacacagag ggcctgggag cagtgacatg ccactaactg aggaacacat 32821 ttagctccca catgttggtt tctagatacc attctccact aaaaggaacc aggcctcttt 32881 ggaaaataca agatgaggct gtaagatctt gctgtatgct cagagaaaga tggggacatg 32941 tcagaagcca catctgagat cactggaaca tcaaaataaa taatgctagt aatgaatata 33001 atccactgaa taacagaaac tcctgcatcc atagtgaggt aactgagtac ataggcaaga 33061 ggggaaagtt cttccaacag taaactcata attaacatag gaaagaacct tagaattaga 33121 aaatcaccat ttggcagcca ccgcagtaat aatttattcc tgcaagaaac accagtgggt 33181 gctaaaacca gtgggtgaaa atgttatgaa gaactagatc atttatagtc ccaaaaagta 33241 tgtccccaca aaagtcatgt ttattacaaa gacagaaata gtaactggag tttggacaaa 33301 cttgacatat gcaatcaacg ttaacatcac cagtaattgg actaactgac attgcgtggc 33361 tcttaacaca aattattgag aaagcagcat gatttctgtg atcctgctgc taaaaatgct 33421 tcacctgaat ctagtgagca ttcagaccca agtcgaggat gctcaacaaa ataactgacc 33481 tgtacccttt gagaatgtca gagacctaga ggacaaggga agactgagga actgccgaga 33541 gaatgaagag atgtgacaga tagatgtact ccatggccat gggctggatc tggaaatgga 33601 agaagaaaga tctagtttgt ttgctattag gagcattgat aacagttggt aaagtctgaa 33661 tcgggtgtgt agatgagagg gggcagtgtt gtgtcactgt tcattccctg cttttgatgg 33721 ttgtactgtt ataatacatc catgttaact gcgattatct ccccacactc atttctttga 33781 ttgtcatatt tataacccct cctcaactaa ggcaggtaga ctgtttttac ttacagcatg 33841 tcagtgcaga tagatatgtt tagggattta gttgttttgt tttatagtta actaacacgt 33901 atttcaacaa atgtcctgct aattacttta aatgtaattg ctgttttcat actgtaaagg 33961 ataggtcttt tatgaaccag gatgccaagt agaaggtttt gaagaagtta ttttttggtc 34021 cctgtagtct aaatagtatt ttggcagcca gggtttttgc aagctgtgtc aatgccatag 34081 tgaaacacag gctagaaata ttataaaaat gtcagaaaat taagtgtggc aaaacatctt 34141 gtggtggact ttgctcttga atgtctgttt tgcttccttt gcagtcagcc ttgctgtaga 34201 gcttgttttc taggagtgtg atcacattct cactcacaca cctgtcacaa atgacctggt 34261 gccatttaga gttaggaatg tgagtagact gtggtcgtac catgagggtt cctcaggtgc 34321 acttgtcgtt gttagggcat gagggagtca acccttggta atgttaccaa tgcccatgag 34381 aaacggtggt tccaccctta gtactggtaa caaattactg ttcagaattc ctgccccaca 34441 gcttcatttc cactggtcaa atgcagtaag ttggctagaa aggtagatcc aattggcaaa 34501 aaacgatgaa tttatcttag tttctgtgca ttgatcagta gagctacagg aactatagat 34561 aatgcttaaa agtgacttac gtgtgcagag acctgctgct attcttagaa tcacattcat 34621 catcttgaca tcttaggata caatagaccc tttttgacag ccactcaccc atttaactga 34681 gacaactaat gattttggcc atatagttta taaaaagaat gtcagttcaa cttgcagact 34741 acctggaagg aacgtgggaa ttcgatgttt gctccggctt tactattcat attccatcca 34801 agcatgcgac agctgatgaa gatctccagg atagtgttag tgtcttccta atacaaccag 34861 gtctcttcaa ttaaagatga ggtcttcaag gtgaagagag tttggcttct gtttggggta 34921 tgtcctattc tggccacatc cccactctta gggtgacttc atttgcactt caaggtgttg 34981 cccagggccc tctcatgcac aacatgtggc aacaggattg agcctatcac aggccattgc 35041 tttatccatg aaacagcctt ccagagcagt gcttcctttg gcctggttga tatttagggt 35101 ctgtgaagtc tgggtgtcta gcctctggat gctggggtgg ggcaaggagg cctgggcagc 35161 aggcacagtg tctgagacgt tacaagatgc catctagtca taactgtctt tgctattgcc 35221 ttgaatgggc ctgacactgg gagatgattg tcaagtgttg tgctgcaggg gagactcttg 35281 gttcaacacg tacacttgaa agaaagcttt gaggctgcgg ggcacctgct tctttttttt 35341 tttttttgag acggagtctc actgtcgccc aggctggagt gcagtggcgc catctcggct 35401 cactgcaagc tccgcctcct gggttcatgc cattctcctg cctcagcctc ccgagtaacg 35461 gactacaggt gtccgccacc aggcccagct aattttttgt atttttagta gagacggggt 35521 ttcaccatgt tagccaggat ggtctccatc tcctgacctt gtgatctgcc cacctgagca 35581 tcccaaagtg ctgggggttt ttttgtgtgt gtatgtgttt tttttagtga cagggtctca 35641 gttacccatg ccagaataca gcgttgcaat catagattac tgcaaccttg aactcctggg 35701 ctctagccac agtatccaac aacttttttt attttttgta gagacagggt cttgctttgt 35761 tgcccagcct ggtctcaaac ttctgggctc aagcaatcct cttgtctttg tctcccaaag 35821 tgctggaatt acaggcgtaa gccattgtgc ctagcccatt tcttaatata actgtctgtg 35881 ttaccaggac atcacatttc taaaagccaa tttgatcttt gtcgtgcatg tgtgtgtgcg 35941 tgtatgtgtg catgtgtgca cacatgtcca catgctgtac acattcagag aagcttctct 36001 agtagcaaac aacagaaatg atccctgaaa gtacagtctt tggtcttggt ccttattcag 36061 ttgctgcagt agcttaacac agctctagct ttgcaggagg aggtcctgta ctggcaaaca 36121 gtgtttctgg tgtgacagat gtggttactg tcaccaggac ttggtgattc acgagtgttg 36181 ggaaagtcac ttgtacttca aacaagaagt gataatgaga acttcaggcc tggtgtggag 36241 tgtcaggcag cttataaagg aagagtccag ctaaagcagg ccataacaat ctgaatatgt 36301 ttccaggaag tatgtcagta ttaccagaaa gacttgactt gcccatgtgt tccacaaatc 36361 acattctggg taaaaactat tttaataaga ttcacttgta tttttttaaa ttaataagtg 36421 ttacttttca cagcagtttt aggttcacgg caatcatatg cccctgcccc acacacgcag 36481 ttgcccactg caccatccca caccagagag gtgcgtttgc tacggctgat gaacccacat 36541 tgacacgtca ctctcgccca aagcccagag tttacagtag gggttccctt ggcgttgtgc 36601 tttctatggt tttgaacaaa tgaacagtga cctggatcca ccattacatc atcacacaga 36661 ggagcttcct cactctgcag atcctctgtg ctcagcctgt tcatttcact ctccacgaat 36721 ccctggtgac cgctgagcct tttactatct gtatagtttt gccttttcca gaacgtcata 36781 cagttggaat cataggggcc ttggcttttc agagtggcgc ccttcactta ggaataggtt 36841 ccttcatgtc ttttcgtagc ttggcagctc atttcttttt tagggctgaa taatattcca 36901 ttgtctggat gcatcagttt catccttcac ctgctgaagg acacatcttg gttgtttcca 36961 cgttttagca attaggacat tcatgtgcag gtttcttgtg gacatgattt ttcaaaatat 37021 ctttcaaagt ggctgtatcc ttttgcattc ccaccagcag tgaatgagag tccttgttct 37081 tccatatcct tgttagcatt tggtgctgtg agtgttctgg attttggcca ttttattata 37141 acaggtgtat agtggtatct catcatttta atttgcagtt tcctaatgac atacggtgtg 37201 gagcattttt tcgtatgctc atttgccatc tctcttctct gatgaggtgt ctgttcaggt 37261 tttttgccca ctttttaata gggctgttca tttctttttg ctgaggtttc ggagttcata 37321 gattctgggt cacagtcctc tctcaggtgt gacttttgca ggtattttct cccaatccgt 37381 ggcttgtctt ctttgttggt attttagatc cagtcccgct caccctcccg tactttggtt 37441 cccccttcag cctgggcagg ctcacatttc tttgtatttt ttctatattt tccagctcat 37501 tcagaccaat aagctgaagc actaccccag catccacgga gacttcagca acgactttag 37561 acagccctgt gtggtgttca ccgggcaccc ttccctccgc ttcggggacg tggtccactt 37621 catggagctc tggggaaaat ctagtctcaa taccgtcata ttcacgggta agtgaaaaaa 37681 ataaagaaac aaattggttc tctccactga ggccatgagt gaatgcacct acaaggtaga 37741 gacccaggga aggattttgc agtgagacat aaatacaaac attattctac tgtaggtacc 37801 aaagaatgaa gaaaccgcag agaaagagtg aagcagtgtg tgccattgga cagctgggca 37861 tccagcgagg ccttcatgcc tgtgttttca gatttctcca agacagaatc ctgctgagtg 37921 cttttgctag gatatcgtaa gccatttcaa gaagtgcagt gattcagtaa cggtcttgtt 37981 ttacctgtta ggaattgttt acagaggtag atctttttct tctgattgtg gtttactcta 38041 actgtggatt ttcttctgga gacaaatccc tcaggggaaa aaattccttt gataaggtca 38101 agtagagtgt ttacatagat aatgactgta tcattttatc agtgtagcgt gcccagccct 38161 ttgaatgcta ggtctttttt gcttatctgt gataggggat atcttggaaa ttatgcacag 38221 accttttttt tttttttttt tttttttttt ttagctcatc agtcatcatt agtgttagtg 38281 tattttatgt ggggcacgag atagttcttc ttccagtgtg gcccaaagaa gccaaaagct 38341 tggacaccca tgtgttaggg tcttcagtcg gccttgggtt ttagaaatct tacaggctat 38401 gaagaaaaaa gaaaaaaaaa aaaaaaacat tgatttgaaa tctggcccag cttgcagcaa 38461 cctcagccaa ttcaccagca agcatgactg tccccacagt aaatgggact gtcagtagct 38521 acctctgtgg gtcactctgg gcaccaggca cagaacccgg cacatggcgg ctgttgggaa 38581 agcactgtca ccagctccct tcctagcttt aggagctggg aatccagtta caccagaagc 38641 actggggtga cgcttcagcc cttcccccag ctttcatttg tgacctagag gccaccagga 38701 acacgcctgt ggtcaaacca agttgggttt attgcctcat ttcagcaagg ggaacacaca 38761 ccatgggtaa aagaaaagca aaaagacctt gcaggactcc ggctggtgtt cggtgatgcg 38821 caggtgttcg cggaggtgag gcgtcaccct gtattgggtg gcgtcaggat gcagggtcat 38881 tctgcgatgg gtttcttaac tcattcttat ctagaacaca ggaagaatgg agccggcata 38941 gcgggaagtt tgcttatgct gtggtcagga cagttctgtg ttccgtgttc aggatgatta 39001 cagaggggtc ttgtctttgg ccggatccat cattgtcaga caaggtgttg gtgttccagg 39061 aagttgcgtt cacacagcag gaggacacat ggctttgctg tgggtgccag gccggctctt 39121 gctgatacca ggccaggcag aaagtgccag gagaggcccc ggtcaccagg actgctttcc 39181 tcttctcagg cctgctttgg gctaaaggtg gaggaagttg ggccacaaga tattgattga 39241 caacacccag aacttcatag ctgccaagat ttcattaatt aggaggttgt ccagagaatg 39301 tcctatgtag tggggctgag gttggtgtct cctgctcctg ctgctgagtg gtgactcgac 39361 atttgacatg acagtggtga cagcatctac acagcacagt agataacctg gcctttagta 39421 caaatgtttc ttcagctaaa aggaaatcag gactgtgtga tttcctgtga caactctggg 39481 taatgggttt gcatttaaac tggtttatgg ggcttccagg gcagaagttg tgtctgggag 39541 aggttggggc catctttttt tattgttttg tgactcctgg atacatgaaa agggggtcag 39601 tattctcaga gaagcacaat ccactggaat gggcatttat gtacctggca gctctgccag 39661 tttgtcctga caacagtgga gacgtctctg tgtctggtgt gcctaagcca gggtccctcg 39721 tcgctgggca cagactgtgc tgggaatcaa agtgtcacat cagttaggac cgagcgaggt 39781 cttttggctc aaggcaggca gctccctcga gttgggggaa tgttccctgc caagcaggct 39841 gcagcagccc tcaggagaca ggctgagcag agggcgagga ctcttcccgg tctgaggggc 39901 tggggctgct ggggagcatc ccagtctcag tctacagacc attcacgggc ctggaggcgg 39961 ggccgtgcgc ttgtcttccg ggtgcatctc acacctgggc gttaactcag agctgattct 40021 aggttcccgg gtctgtacca ggcctctcca ctgtgaagtc agtttttccc attgtattaa 40081 atcagtacct tgtgggggac tctttgaaac tatatacata ttctgttctc cctcaaaatg 40141 gtatctgata tttttagcat ttgttgatga ttttcatctg aataagtgat gaactgtaat 40201 ggttgccaaa cggtggtttt ggtttttatt tcatcgtttg tttcttggca tttcgttgta 40261 aaaagagctt tcttttctcc cccacatatg tatttctccc tcatttacct catctgcctc 40321 tgctgaagct tggagcccac ccacagggtc catcccagcc tgcccctcct tccacggggc 40381 ccctttgacc tccgtccccc acgtgtgctt cctggctccc tcctgacccc ctgactgtct 40441 gtgggccctc agcgccccag ttgctgtctg gcttggcagc tcctgtgtag tctgcattgt 40501 aagatttctt tcttgtactt tccctagaac cagacttctc ctacctggaa gccctggctc 40561 cttaccagcc gctggccatg aaatgcatct actgccccat cgacacccgg ctgaacttca 40621 tccaggtgtc aaagctgctt aaagaagtgc aggtaatgaa ggacactgct tgtgccttca 40681 cgtagtcatg tcaccttggt gtggctcatg cttgtgtggg gtgaggggag agagatctag 40741 ctgtgtttga ttcttgtctt cagttctcac gcatctgcag aatgctggga cacatgccag 40801 cccccctcca cactgaaaag gagtggtctt tacaccctga ccgcagtttc cattctaaag 40861 aaatcagatg tggaagggaa agaaaaccat ctgtgtccgc ttaaaagcaa accctctcac 40921 ccctgccaaa aaaaaaaaaa gtcattctag aaacatactc actaagctga gacagtttaa 40981 atgaaacgcg ttactggggc cgtgtcgcac gtgtaggctg gtaccacaaa cagtgctgtc 41041 gggtttgggt tttgtggcag tttttggtca tttgtttcac ttcacatttt ctgccctgga 41101 gaaagggaag aagtagctgg ggtgcagtgt agaccaggag gcgcgcgtag caggaaggca 41161 gggccacgga accactgtgc tggctcagcc actgctcgct gggtttctgg ctcttgagag 41221 tcgggagagg aactggaatt ggcaaggagg acagctgaca ccggcgagga agagctctcc 41281 ctttccactc cctggtgttc ccaggagtga gatgagggtg gaggggccca gcacagcacc 41341 ttcaacctca ggatgagaga ggccctttca caaaactcta aggcagggga acaggaaaca 41401 gagaaagccg gagaacccca ggagggcccc aagagcggat tctggtgatt attaatgtgc 41461 ttgcccaatg aagaaagaat actggcactc tctaggtatg atgagagcag acagcaaacg 41521 tggggcctgt ctacagtgat tcgctacccc aatgtatgct catccacgtt agaagcagca 41581 gtgaaaggcg tgttgctttt cattattaac ttcaaatccc agtccctaaa ccagctcttg 41641 acgcccctct gtcaggtgct aatcctggaa actggaggcc acctggtctc cactttaggt 41701 gaggaaaacc tgggagaagc catcagactg cacctgtggc atgagatgct ttgagacagg 41761 tcaagaggag gagcaaaggg cagtttggag gagaaaagta ttagccctaa ggaacaagtg 41821 cttttggaag ctcagcccgg tcagcctggt ggaaagccgt cttcagcagg gaattcaggg 41881 cttggtccaa gctcttaagt agaagcaggg acaacacagt gcccctgtgg gctgccagca 41941 ttccttttca tttgggtgat atttgtgcaa agtaaaaatt ggtttactaa tctttttttc 42001 tcaagataac aaaaagagac attttgttta aaaaaaaaaa aacaaaaaaa actctgcctc 42061 tgctccttgg ttgcacatgg tgagcacatg agctgaggag tgcccactgc ctaataccag 42121 ctgacctgca gatccagcgg aaactccaaa cccacagcgc cagcccggca cgaaaagcca 42181 cagctcttgg taatcagcca agagcttata atagcaggca tgtgggaatg ttagagaaag 42241 accgtgcccc gaggaagccc agagaccgct gggagcagac acatggaagt taccgtgaaa 42301 cttatgtaaa cagtaagaaa gataaattaa gctgaggcag tttaggggtt tccgagatgt 42361 ttcttctgcc ccagtgcctt cacgttccct ctcctgtcta cggttcattg ggcttgagag 42421 gatgaaagtt caccttggcc tggaagtggt gagcctgtaa tggcggggag tggatcgggg 42481 tcaggaatgg gccttccaca ggggccactg tacttcacac cacctttctc aactgtccca 42541 ttggttcctc agcccctgca cgtggtgtgt cctgagcagt acactcagcc gcccccagcc 42601 cagtcccaca ggatggacct catgatcgac tgccagcccc ccgccatgtc ctatcggcgg 42661 gctgaggttc tcgccctgcc cttcaaacgt cggtacgaga agatcgagat catgccagag 42721 gtgagctgtt ctccttccta gggttaaact agagctttcc acagaggctc ttggagatcg 42781 tgcaggggtg gccttctttt ggatttatgt caagtataaa tgaaccaggc tgcgcgcagt 42841 agctcacgcc tataatccca gcactttggg cggccaaggt gggcggatca cttgagggca 42901 ggagttcgag accagcctgg ccaacccagc ccagccaata tggcaaaacc ccatctctac 42961 taaaaataca aaaaaagtag ccaggtgtgg tggcacgcat ctgtaatccc agctactcgt 43021 gaggctgaag cctgagaatc gcttgaacca ggaggtggag gttgcagtga gccgagatca 43081 caccactgca ctccagcctg ggcaacagag tgagactcca agtatgaatg aacaaagaac 43141 atggaccctt aaccaagtaa ccgggaagag gggggatttt cagggccttc ttgtttttca 43201 actaataaaa taacagctgt tagtcaggac tgctccttac ctagcattca gcagcgtgag 43261 ccctgggcca catcatgggt cagagccctg ggaagtggag atgctgacac ccgctctgtc 43321 cctaaatacc ataggatggt gacttttctc ttccttcctg gacctcagtt atgagtgagt 43381 gtcaagagtt tgctgaattc agaggtagat gggggagata acaggaacca aaaaataagg 43441 attgtaaact tggttattta tatcctcttg agcatacttg caggttttgg tctatcaaag 43501 tctaagtatt ttataggtct gtgaactctt agcttcagtt ttagcaggga aagagccaaa 43561 gcatgctgtc catgttgaac agctgtggca tgctgcgctt gggccactcc tctgagaggg 43621 agacagagag ggacgcggcc tctcctgaaa gacagcgttg aggatggttg gaggctacct 43681 ctggcttcct ttcacctctt gaggcaactt gaatgtgttt tcaacagaca ggaaaaagaa 43741 atataaaaac ttattgttaa aaccagtgtg cccaaacttc ttttggagtt tgaggttcag 43801 aaatggcctc cagaccttgg gttggaggtc ttggctcctg aatgtgactc atttccatga 43861 gcctggagag gctgctaggg accaccaggt gccatcttta tggttgttta atgtttaata 43921 tgtttttatc attttgttat gattttttca ctttctctgg attgtttttg tctggtattt 43981 tacaggggct gggattgacg gccttggttt agatttcaac tctctaagcc agcattcctt 44041 aaaccttttg gtctcagaca tccttacaaa tagaactcca aagaggtttt gtttatgtgg 44101 gttatgtcta ttgatgtttg ctatatgaga aattaaaact aagacatttt aaaaatattc 44161 acttaataat acaaacctat tatatgttaa cataactaag ggataaagac aaaagcaaaa 44221 atcagtccca gtgccaggga taaatgttaa gattttgatg tatttgcctt gtctgttcac 44281 tgtgtgtgtg cctactggaa tcacacctca tacactgtcg tctttttcac ctatcagtaa 44341 gtacattata tcatttaaga tatttcagcc aggcatggta gctcactcct gtaatcctag 44401 cactctggga ggccgaggcg ggtggacaat gaggtcagga gttcaagact agcctggcca 44461 agatggtgaa accccatctc cactaaaaaa aattagctgg gcgtggtgtc acacacctgt 44521 aatcccagct acttggaggc tgtggcagag aattgcttga accgggaggc agaggttgca 44581 gtaagccaag atcatgccac cgcactccta cgtggatgac agagcgagac tctgtctcaa 44641 aaaatatata tttcagctgg gcatggtggc tcatgcctgt aaaccccagc acttcaggag 44701 gctgaggcgg gggtgaatca cttaaggtca cgagttcaag accagcctgg ccaacatgat 44761 gaaaccttgt ctctaataaa aaaaacaaaa attagccaca ggcgtggtgg caggcgcctg 44821 taatcgcagc tactcgggag gctgaggttg cagtgagcca aaatcgcgcc actgcactcc 44881 agcttgggca acatagcgag actccgtctc aagaaaaaaa aaaaagatat ttcaaaagct 44941 tcagctttaa tggttgcata atggtctgtc ataatttaac agttcctttt ttcatagatt 45001 tttttttttt tttttgagac ggagtctcgc tctgtcaccc aagctggagt gcattggcgc 45061 gatcttggct cactgcaagc tccgcctccc agcttcatgc cattctcctg cctcagcctc 45121 cctagtagct gggaccacag gcacccgcca ccatgcccag ctaatttttt tgtattttta 45181 gtagagacgg ggtttcatcg tgttagccag gatggtctca atctcctgac cttgtgatcc 45241 acccgccttg gcctcccaga gtgctgggat tacaggcgtg agccactgcg cctggcccct 45301 tttttcacag attttcattt ctggtttttc tgtgttataa ataacacttt taggagcatc 45361 cttttacata aatctttgtc catatatgtt tatttccata agaaaatttt ctgaagttag 45421 aatttctggg tcaaagatta tgaacatccc tttctggctc gaggctatat attgccagct 45481 tgtcctctag aatgagtgtg acagtttata ctcccacagc agagctggag acagctctta 45541 cttctgcctc cttgctaata ttgaatgttg tcctttttta gttattttcc aattttattc 45601 aagtcttttc cagttatata agtatacact gttatctaat tttaaattgt atgtcttttt 45661 ttttcttttt ttgagacgga gtctcgctgt gttgcccagg ctgaagtgca gtggtgagat 45721 ctctgctcac tgcaagctcc acctcctgag ttcacgccat tctcctgcct cagcctcccg 45781 agtatctggg actacaggca cctgccacca cacctggata atttattgta tttttagtag 45841 agacagggtt tcactgtgtt agccaggatg gtcttgatct cctgaccttg tgatctaccc 45901 acctcggcct cccaagtcct gggattacag gcgtgaacca ccgtgcccgg ccctatgtct 45961 ttttttgaga cggagtcttg ccgtgttgcc caggctggag tgtagtggca cagtcttggc 46021 tcactgcaac ctctgcctcc cgggtgcatg cagttctcct ccctaggctc tcgagtagct 46081 gggattatag gcacatgcca ccaatcctag ctaatttttg tatttttggt agagatgggg 46141 tttcaccata ttggccaggc tggtctcaaa ctccagtctg cccaccgtgg cctcccaaag 46201 tgctggaatt acaggcgtga gccaccgcac ccagccaaac tgtacgtctt tgatcattaa 46261 tggaggtaac tgtctcaatc caacttgcta cagtaattgc ctttaaaatg gacattatgg 46321 ccaggcacat tggctcaggc ctgtaatccc agcccttggg aggccaaggc aggaggatca 46381 cttgatgcca ggagttcaag accagcctgg gcaacacagc aagacccccg tatctacaaa 46441 aaaataataa attagccagg cgtggtggtt catgcctgta gtcccagcta ctggggaggc 46501 tgaggaggga acatcacttg agcccaggag gttgaggttg caatgagcta tgatcacacc 46561 accacactcc agcctgggca gcagagtgag gccccatctc aaaaaaaaaa agactccttc 46621 agagtcgtct tggaaatagt gcatggctgc ccagggagag cgcagaacgc catccccaaa 46681 gctcccaccc cagccttgtg cagggaggag gggcctgtgt ggaggaggcc tcaggtgaag 46741 aacgggatct ggcgcacacc ctgctcctcg gcaagggccg cttcacgctc gccataggcc 46801 gttttcttat ttcatgaaac aggcctcacg taccacttgc caatctgctt aagtatccta 46861 agctgcttcc tctgcccgtt tggtattgat cttcatgttt acataatggc ctcttgcatg 46921 tttttgtttt taaataaagg tggcttggct aggtaggggt ctacatgtct taaaaaccat 46981 gcagctaaac ccagcaacag agcacctaat aaggtcaggc tgcacggcag ggcacccatc 47041 aggtgcaggt ggtcggaaag ataccacccc ccaggtaaag ccgtggctcc caccatcagg 47101 agaagtcaga ctttcaggaa gagagagctc cctcaaccgc catgctgctg tccccgtcct 47161 tcctgccact ggtcacctgg agaggggatg agggtgaagt aaaggccaga atgaatgaaa 47221 ggctgcactt ggtgtgtcac ctgggcgaca gagcaagact ccatctcaaa aaaaaaaaaa 47281 ttgtttacct ttaaagttat ttcatctttt tagactgcag tgatgtaaat acagattaaa 47341 ggaagagtaa tggtcatcat taaaggcccc cagcctgaac tgcgcccttt gctttcagct 47401 cgcagattca ctggtgccca tggagatcaa gcctggcatc tccttggcaa ctgtctcggc 47461 cgtgctgcac accaaagata acaagcactt gcttcaggta gggggtgctg ggtgggagtg 47521 caggggaccc tctccccagc aagaaaccag accacctaac agattatatt tgaaatagcg 47581 cttcatgtga attcttgttg aagaattatt tccctggcca tgtgcctcag agaggctgct 47641 gtgcccagag atgaggccgc acgtcatccc aagggctgcc acaggcacat tctgttgggg 47701 agcgctgcca cacgaggcag ggctgtgggg agacgtgcag ggtggcaggt gcagccctgc 47761 ccttgggggc tggaaccgga gggcacctgc gtgaggctgt ggctacctga gagcctggtc 47821 ctaccaatga cccacacaca ggtgggtggc acttcagctc cagggcaggc actgtgtctt 47881 aagaattcct ttcagatctg gactgtgtca cctttatgcc acatgtagag ttgctcctag 47941 ctaccactta aagtctatta gaccctgtgc tgggtccttg acccgccttg tcttactgag 48001 ccgtcagaat tcactgctgt catcatttcg taggcagctt ctctaacctt ggccagatgg 48061 tggcaaaggt ggggtttccc cctttggtct gaccccacag ccagtgtgcc cagccacggg 48121 gtcatgatgt acctgcagca cgacacagtg tattctggag aatttactca gcagatactg 48181 aagtgaacca cctgaaaatt taaaaatgga tcttgataga aggcagagat cttagcgaat 48241 aaggtgttgg taggctggac agttgagcat tagagcgcgt ggatctgggg ctcccggcag 48301 ccagggaacc tgaaccgagt gccggctgag gaaaccgggc cggggctctg tggcctgtga 48361 ggacaggata gtctcaggct ctcagtgtgg cctgcggtgg cccctgctgc tcagaggaag 48421 ctcatgaaag ccactctttc cttctgctct agccccctcc tcggcccgcc cagcccacga 48481 gcgggaagaa gagaaagcgg gtgagcgatg acgtaccaga ctgcaaagtc ctgaagcctt 48541 tgttgagcgg ttccatccct gtggagcagt tcgtgcagac cctggagaag gtgagctggt 48601 ttcgctggtg ccgtgaaaac tccacacgtg gcagcctttc cctggctcac tatggccccc 48661 tggctgcagg gagtggatgt tgctgcttgt cacttagtcc ccactgtcct gtggcatctg 48721 tttggtctaa ggtcctgctg ggagacccag gagaaagaaa gcagagtgag gagtgcccca 48781 tccttcctcc cagcacgagg tcaccagaag gcctctccag actgaagaaa aagctgcttc 48841 cacacacaca tgtgacgagt ggggcagggt agtgaggcca ggacaaagag ggacccggcc 48901 ctgccagagt cttgcacttc cacagatgac tccttgctgt cagaggggag ccaagtctcc 48961 agtcgactgt caggatttgc aggaggcagt cgggggaggg gacactggcc cttcccctct 49021 gtctcagcag ccctgatggc tgcttctccc agagatgaga tttcttgact atgattaaaa 49081 gaaaaaaatc taaccttaaa ggttgtaatt ttggcttcag tcacaggact tcagagatga 49141 ctttattagg attatagaat ctttgatagg aagaaggaat tggctaaagg taatactgtt 49201 catgctgctg cttgcaagaa ctgcaacaaa ttacaatcat tacaaggaag gagatttcta 49261 tgaactttct atccaatgta aatatcacag ttgccgactt tcaaatctta aaggctttcc 49321 ctttcctagg attggttttc tccacctgtc tttgattttc ccgtagggaa aaaggctctg 49381 gctgggtggt tgcggctctc ttccaccctc cctgaagacc ttgcagggct cctgggccct 49441 gttaatgggc ctcaagctgg acttttaaaa acttaagatg aggaccttct gcctggccca 49501 gcctatgtcc tgacccagtg ttccatcccg gctcctctct gcagaaggag caagcacctg 49561 tccaagtccc taggggagcc tgcagccatg aagtacaggt ggcctcccca caccgaggcc 49621 cttcacctgc tgtgtgtctg tttcaggcac atgcctcctt tccatgtcac gtctgatttg 49681 taaggaattt ctgtccttag cattagcaat agctgagaag tttgcactgc tgccttctct 49741 ccttcactct tgagagggct ctgccaagtc ccacaggggt atcttggtgt cacctggcat 49801 tttcctggga gctcagacag ctgaaactta ggagggagct gtcaccaggg aacggcatgg 49861 tgcaagcagc tgagcgtccc agactcctga acacagtgct tggacgtgcc ctcaaagaac 49921 tcacaaaagc ttagccaggt tgtggaaatt ctgttgtttt gcatgagctt ttgcatgttt 49981 agggtctctt ttcaagtata agaaactatc actatcatag gcctatgact agtctgaaga 50041 attgtgttga gacgtgtcag tttctagaaa gttcagtcga gtctgtgaag tgtcatttac 50101 agatctcaca gatgtgcagt ctgcccagcc cacctctttc ttttcttctg gagcagcatg 50161 gcttcagtga tattaaggtg gaggacacag ccaagggcca tatcgtcctg ctccaggagg 50221 ctgagacgct catccagatt gaagaagact cgacccatat catctgcgac aatgacgaga 50281 tgctcagagt gcgactgcgg gaccttgtcc tcaaattctt acagaagttc tgagtgggcc 50341 atctgagcta cttccctgaa atcctgcagt ccctcactgg ctgccctcac aagccacctg 50401 aggagtggca tgagaggcca ttaactgtgt ctttgtggtg tcctctggct taaggagtga 50461 agaggtggct cttgagggaa atggtctgga cttattccca gcactgtttc aggcaagaac 50521 tttccctttc aacttcaggc tcattttctt ctcaactctg gctctctcaa ggagctggag 50581 ggtggcagaa gtgggacagg agaagttttc caagaggttc atgggaggcg gaggtgactg 50641 gctggctgtc ttgcatcagt cccaggcctc ggccagggga gccagccttt ggtttcgttt 50701 acttgcctac agtgctgtac gcaataagat gatgatccca aaatatggta aagtgaaccc 50761 atctgtctgc attttctact ctgagcccat ttgttaataa acacttattt ttatataatt 50821 agctgtcctc tgttgaacct accatctata tattgattta gtagctgaaa aaatatgaaa 50881 atatacagaa cagcatgaac ttagaaaaca ccacaggaaa ttgaattttg atgtgtatgt 50941 taaatcatat aatttgcact gtttataaaa acacagatct gtttctcctt acattgcata 51001 agaaggtgct cacctttaag ctgtggctgc acggagagtg atgcaggtcg gtacaccagc 51061 ctcaggctcc acctgcaccg cctctcccac agatcctcag tctctgcatt aaaccgggcg 51121 ttactcacag ataccctcag agccactggt cgtaggaagc tttcagacaa aagtaacctc 51181 acaaaagatg actgcttttg aaatgtataa aaccaacagt taccaggtga aatagcacga 51241 gctgtgacac ccaggccaac tttgcgagta ttaagaacaa gtcttagccc tggcaggcga 51301 tgctagatag tatgcccagc gcaggctatt cttaaccatc ttgttggagt gattgattga 51361 ttgaaattca ctcagaagtc agtcctccaa ctcggctgac aactaaacag cacacaggga 51421 tttagtgacc caataaatac ataacatgaa cagctgcaga actgactgct ctggctttat 51481 ggcgcattat cactcctctt ggaacaatcg tattggtggg aatgagtgct tcgctaaagc 51541 agggaaaaga ctacttcatg tttgccatct ccaaccttgc caaacctggg catgggaatg 51601 cttaagtagg tttctaattt tccaaggttt gggtccactc cagtcaaggg ataggctaca 51661 gaataaacga gaggcttcca accatggggc aggactgaca ttacaagaga tgaatgtgcc 51721 atggctatga acatttagtt ttctttttag aattgcaaat agacatccca agcaggcata 51781 cttccaatag aacctttgaa agaatcaagt gaaattaaat tttaaaaaca tctgagggcc 51841 aggcatggtg gctcacacct gtaatcccaa cactttggga ggtcaaggca ggcggatcac 51901 aaggtcagga gttcgagacc agcctggcca acatggtgaa accccgtctc tactaaagat 51961 acaaaaaaaa ttagccgggc atgatggcac acacttgtaa tcccagctac tggtgaggct 52021 gaggcaggag aatcacttga acccggcagg tggaggttgc agtgagccga gatcatgcca 52081 ttgcactcca gcctgggcaa cagagcaaga ctccatatca aaaaaaaaaa aaaaaatctg 52141 aaatgcaaaa acagtgtaag ctagagctca ggagaaacca aaaatggtta ttttatttaa 52201 atgtcctagc aatgctatct aggaatgatg ggatctgtca agcctgtctg ccgtgaaagg 52261 gcttgatcag agagcccagt gctggtccct tgaggggggt tgcaaaagaa gtgagcagta 52321 agaacaagcg agtcagtggg tgcccgatga acagggtgca acttagtagg ttttaatcaa 52381 gtcatcacca cccacttagt ggcagaagtc agaggcagga agcagcaaag actcatgctt 52441 tataaaaagc agagagaaaa tccagagccg gcctttccag gtatgagaag agcagttatg 52501 agtaactgcc taaagttcag gtatttggat accatgccag gttggttaga agactccaaa 52561 gaagtggcat aagtggcaga cgtggcctgg ctctatcaga aatgcggccc accgacatta 52621 actgacattg actgacactg acatcaacct ggcgaagact ctgacatcca gaaaagtttg 52681 tactcaaacc cagtggaatc ctaatgatta attgaaaaaa acttaatagt gcagagacct 52741 catattattt aagtcttagt acaaagtgat atattaggta tctattgcac aacaaattac 52801 cccaaaacac ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggcaga 52861 tcacgaagtc aggagatcga gaccatcctg gctaacacgg tgaaacccca tctctactaa 52921 aaatacaaaa aattagccag gtgtggtggg cgcctgtagt cccagctact ccggaagctg 52981 aggcaggaga atggcgtgaa cccaggaggc ggagcttgca gtgagccaag atcgtgccac 53041 tgcactccag cctgggcgac agagcgagac tccgtctcaa aaagaaaaaa aaaaaaaaaa 53101 agaaaacctg acttttctca tctcactgtt tctgtggtcg ggaatctggt gtagtgtggc 53161 ttagctggtc gaccctggct cagggtctcc tctccacacg gctgcagtca gctgttgggt 53221 gagggaacag agcttaagta actttccgca gaaccgccag tgagtggcct ctgccttacc 53281 gcaacaccgt gggtgagtat caggtcagca gccagccagg aaatggcaat ctgtctttta 53341 ggccattgct ttccaagtca catctactcc atctctcctg atccctgaag agcttgaagc 53401 ttttggccct cacagttgtc ctataaaggc atttccaaac tgtaatgaag tatcaacaga 53461 aacaagagtg aagaaacctt taaacctgca taatgacata ttaacaagag tcaagcaacg 53521 agtgggaagg gaaggaggac acttttcctc tggccctgag tccagttttt ttcctgcagc 53581 caagaggagt agttaatgct gtctcactgc tttatgccat ctataagaag gtagacaaca 53641 cttatctttc aaatgcactg cagtgggact acacataaat aacagtagtc ttctttgaac 53701 ctaaaataga gtggaaataa ccaatgacaa ttatggagga agtcacaggt aaatcctgga 53761 gaccagcagt gccaagctga gccacagggc cattctcact gtagacttga gccagcctcc 53821 atcaggaact gatcttctaa agatcaaata ccagagtctc cactgctcct tggcagccca 53881 ttatgggttt taatcacatc ataaagcatt atatacatta tggccaggta cagtggctca 53941 cacctctaat cccagcactt tgggaggcca aggtgggtga atcacaaggt cagaagttca 54001 agaccagcct ggccaagatg gtgaaacccc atctctatta aaaatacaaa aattagccag 54061 gcgtggtggc agatgcctgt aatcccagct actcaggagg ctgaggcaga gaaatgctta 54121 aacccgggga gggggcgggg ggatggaggt tgcagtgagc caagatcgca ccactgcact 54181 cccgcctggg agacagagcc agactctgtc tcaaaaaaaa aaacaaaaaa aaaaaccatc 54241 tatctatcta tctatatata tacatgtgca cacacacaca cacatgcaca cgttaaatgt 54301 aaacttttga gacacaggac cacagatctt tgaaaggggt gtaaacgccc atctccttag 54361 gcatgtagaa tatttcttgc ttctcttctg ttggcattgc aggccattga aaaaaatgtg 54421 caaagccccc gtgtaatggt gtttgtgtta gaaggattta ccctttacct ttttctacaa 54481 taaacattcc taacccatgt gtaagcctcc ctgatgtagt tatcaaatca atcaccagta 54541 aaaagtaact taattctcct acaataaatt ctgagttacc aaacacatta tcaattaaaa 54601 taagtttgct aacgtttcct taaattatcc aatataagtt tttactctag taactattta 54661 catttgcttc acatactttg gaaataatgg actttcattt cacaaagcct ttcccaatca 54721 tcagtaagca ccttccagtc atcagtgggc attagtcggc agctgctcac atattcggtg 54781 tgttgtgccc tctctcatgg ctttagctca ccgtcacaga taagcatttc tcccagactt 54841 acagctagag aggagcacat ttccaggacc atgagcaccc tgggggcagg gtctgttttt 54901 tccaccttgt cccagcatga ggcttgtgga agaaggtaag gaaagaaaat ttcagaaata 54961 tttaggaatt acaggccaaa acaacatttc ctggtgggtc agttttttaa ctgcaatgtt 55021 ctaaacatgg gaacctgcac ataagtgtaa aaatccctat catttagccc atgctttaaa 55081 atagctactc gattcagtgg gcagcttcct gatgagatga atcagaggtt ggtaactgtg 55141 gccgaaaagc caaatctggc ccacaagcag agttgttaga aaaaagatgc aacagaaatc 55201 acatgtggcc cacaaagcct aaaacactgg ctgacccttt acagaaaaag tatgccaatc 55261 cctgctcaag tgctgtgtgt gggaacattt ctgtagttta ttcaagtaaa ggtcaaataa 55321 tggaatggca atgtaacagc tcccatcaga cctgaccctc ctagaggtaa aactataaac 55381 tccagacgta tgtagttacg taagtaggta gatagaacaa cctaccacaa aaaaacaatt 55441 ccattagaga ttttatcacc cttgtaataa ttattaaaac aactagacaa aaaaaaagtc 55501 atagatgacc tgaacaaaac tgtcaaaaac tttgacttaa ttgatacttt ttagaatact 55561 tgctctgcag cagcagaatg tttactatga aaaccatatg ctaggtgata aatctcatta 55621 catctgaaag gaccgaacgc atacacaaaa ccttctccca ccacaatgga attaaattca 55681 aactcaacga agtattttgg aaaaccacaa atatttagaa attaaacact tctaaaatag 55741 ctcatggatc aaagaagaca tcccaaaatg aattggaaag tattttgaac agaaaattaa 55801 agctcaacat gtacaggata ctgctaaagt agtgcttaaa agtcatctta tacctttaaa 55861 tgcttacaga aaaaatgaaa gacctaaact tgatctaaat ttttacctta gaagactata 55921 aaaagagcca aataaaccca aagaaagtag aggaaagaaa tcataaaaat aagcaaaaca 55981 tgagcaaaac agaacagaga aaactaacaa agccaaaagc tgatttttta aaacatcagc 56041 agaactgata cacacctcat tagactgatc aaggaaagac aggaccgact gcccatatgg 56101 gcagtgaaaa aactttggtt atcactacag atcctacgga tatgaagaag acagccaatc 56161 agaaaggaaa gaggggtatt actaaagagc ctacaaatat taaagggata aaaagaacac 56221 caacttatgc caacagattt accaccacag ataaaatgga aaatttcctt tgaagacaca 56281 aatagacaaa gctcattcaa taagaaaaag aacttgatat tcacttaaga aattaaattt 56341 attatcttct cacaaggaaa actccaggcc tagatggttt ccctgggaaa ctatcaaaca 56401 tttaaggaag aaataacacc aatcttgtat aacctctatc aaaaagagga agggggaata 56461 ttccagtccc ttttaagggg ccagcataac tctaatacca aaaccttata aagtcattac 56521 caaaaaagaa aatgagaggt aaatatctct catgaacatc aatgcaaaaa aaaaaaaaaa 56581 aacttaccag caacctgaat ccagcaatac acaaatagga taatatgaca tgaccaagta 56641 gggtttatcc ctggaatgca aggataatta aatatttgaa agccaatcta atttataata 56701 gaatagagga tcatttcaat agatacagga aaaaaagcat ttgatgaaat tctctaacag 56761 cactcagcag acaggaataa aagggaacat actcaacctg ataaaggtta tgtatgaaaa 56821 acttaacagc tcagtgaaat actagagctt ttccccaaat attgagagca aagcaaggtg 56881 ccgatccata ctactgttct atggtgttct cggagtccca gtcattgcaa taaggcaaaa 56941 ttgaagagga aaaggcaggc aggcatacaa acagataaag cataaaggta ggaaagaagt 57001 aaaactgttt tcagatgaga ctttttacat agaaagttct aagaaatcta gaaaactact 57061 ggaataagct cacaagactg caaaatacaa ggttggtatc caaaagtcaa ctgtatttta 57121 tatattaaca agtttttgag agagagtctt actttgtcac ccaggctgaa gtgcagtggc 57181 acagtcatgg ctcactgcag ccttaaactc tcagggtcaa gtgatactcc cacctcagtt 57241 tcctgagtag ctgggatcac aggcacatgc cactgcatcc agctaatttt ttttttcttt 57301 ttacttttat agagacccac cttggcttcc caaagtgctc ggattacagg tgtgaggcac 57361 aacacctggc cagaaataaa atgtttttaa aacagcaact tcattcataa tagtgtgaga 57421 taacttttga aaagatatgt aagatctcta cactaaaagt ctcaaaacct tgctgataaa 57481 aattaacgat ttgaataaat ggagaaatat gccatattga tggattagaa tactcaatac 57541 taacatttta attctgccta ttgatttatg gatttgatgc aataccatcc cagcagacag 57601 ccacaccaca acctaaccca atgttttaag taggtaaagg acttgaataa acatttttcc 57661 aaagatgata cacagatggc caatagcaca taaagagata ttcaacactg gtcattaggg 57721 aaatgaaaat caaacccatg accaggtacc acttcacacc tactaggatg gctgtaccat 57781 ttttttaaat ttttatcaga aagtaagtgt tgggagaagt ggagaaattg gaaccttcat 57841 acgctgctag tggaatgtaa aatgacacag ccgctacgga agacggtttg gcagttcctc 57901 aaaaagttaa atacagaatt accatattgt ccagcaactc cactcctcta tagataccca 57961 aaagaattga gagcagggac tcaaatattt ggccacctat gttcttagca atattattca 58021 ccaccttagt aaccaaaaga tggatgcaac ccaagtatcc accaacagat aaacagataa 58081 aacaaaatgt ggaacataca cacaatgaaa tattatccac tcatagaaaa gaatgagatt 58141 ctgatacatg ctgcaacggg tgaaccttga aaacatgcta agtgaaataa gccagacaca 58201 aaagaccaca tattttatga tttcatttat attcaaatat ccagaataga tgaatccata 58261 gagagagaat agaggttatc agaggctgga agtagtgggg gaatgggaag ttactgttta 58321 atgagtacag aatttgttcg caatgaaaca gttttgtaac tagctagtgg tgagggttac 58381 acaacattgt gaatatactt aatggaacta aattgtacac ttcaaaatgg ctaacatggc 58441 aaattttatg tttaaatttt tttaatctga taatgccagg tttcttagaa gagactgggc 58501 agtattgaga tgaattttat gtaagcataa gagctaatgt acaaaaatca caagcattct 58561 tatacaccaa taacagagag ccaaatgatg agttgaatgc tcattcacaa ttgcttcaaa 58621 gagaataaaa tacctaggaa tccaacttac aagggacgtg aaggacctct tcaaggagaa 58681 ctacaaacca ctgctcaatg aaataaaaga ggatacaaac aaatggaaga acattccatg 58741 ctcatgggta ggaagaatca atatcatgaa aatggccata ctgcccaagg taatttatag 58801 attcaatgcc atccccatca agctaccaat gactttcttc acagaattgg aaaaaactac 58861 tttaaagttc atatggaacc aaaaaagagc ccacattgcc aagtcaatcc taagccaaaa 58921 gaacaaagct ggaggcatca cgctacctga cttcaaacta tactacaagg ctacagtaac 58981 caaaacagca tggtactggt accaaaacag agatatagac caatggaaca gaacagagcc 59041 ctcagaaata acaccgcata tctacaacta tctgatcttt gacaaacctg agaaaaacaa 59101 gcaatgggga aaggattccc tatttaataa atggtgctgg gaaaactggc tagccacatg 59161 tagaaagctg aaactggatc ccttccttac accttataca aaaattaatt caagatggat 59221 taaagactta aacgttagac ctaaaaccat aaaaacccta gaagaaaacc taggcattac 59281 ccttcaggac ataggcatgg gcaaggactt catgtctaaa acaccaaaag caatggcaac 59341 aaaagccaaa attgacaaat gggatctaat taaactaaag agcttctgca cagcaaaaga 59401 aactaccatc agagtgaaca ggcaacctac aaaatgggag aaaattttcg caacctactc 59461 atctgacaaa gggctaatat ccagaatcta caatgaactc aaacaaattt acaagaaaaa 59521 aacaacccca tcaaaaagtg ggccaaggac gtgaacagac acttctcaaa agaagacatt 59581 tatgcagcca aaaaacacat gaaaaaatgc tcaccatcac tggccatcag agaaatgcaa 59641 atgaaaacta caatgagata ccatctcaca ccagttagaa tggcaatcat taaaaagtca 59701 ggaaacaaca ggtgctggag aggatgtgca gaaataggaa cactttttac actgttggtg 59761 ggactgtaaa ctagttcaac cattgtggaa atcagtgtgg tgattcctca gggatctaga 59821 actagaaata ccatttgacc cagccatccc attactgggt atatacccaa aggactataa 59881 atcatgctgc tataaggaca catgcacacg tatgtttatt ccggcactat tcacaatagc 59941 aaagacttgg aaccaaccca aatgtccaac aatgatagac tggattaaga aaatgtggca 60001 catatacacc atggaatact atgcagccat aaaaaatgat gaattcatgt cctttgtagg 60061 gacatggatg agattggaaa tcatcattct cagtaaacta tcgcaagaac aaaaaaccaa 60121 acaccgcata ttctcactca taggtgggaa ttgaacaatg agaacatatg gacacaggaa 60181 ggggaacatc acactctggg actgttgtgg ggttggggga ggggggaggg atatcattag 60241 gagatatacc taatgctaaa tgacgagtta atgggtgcag cacaccagca tggcacatgt 60301 atacatatgt aactaacctg cacattgtgc acatgtaccc taaaacttaa agtaaaaaaa 60361 aggaatatat tatgaaatta taaaattgaa aagaaaagga gctaatgcca tagaactaat 60421 tctaaaattt acagagaaat acaaagtaac tataatattg aaagcaatct tggagatgaa 60481 caaagttgga aagctgcatt catcaagacc gtatggaact ggcacgagga tgaacaaagc 60541 agcataacaa caaagatggt tcagaaacag agccccactt ctataatgac caccttttca 60601 acaaagggaa gggaaagtct ttttaacaaa tggtgctgca atgcccatat agaagaagta 60661 tcagaaacct gaccactgcc acacaccata aacactgaga tggatcttta attataagag 60721 ctaataccat aaagcatttg gtgaaaaaca ctgaaaatat cttcatgatg ttgggtaggc 60781 acaggtttct tgggtcacag aaagtagtaa caagagaatt gtatctcctc aaaattgaaa 60841 acttctgcta atcagacgac accatacaga aaatgattag gcaagccaca aattaaaaaa 60901 ataatttaca aaacatatct gacaatggac tagtgtccag cgcaaaaaat tcctgtaact 60961 cagcaataaa aaagactaaa tacatccata cgatactatt cattgagaaa agaaactggt 61021 tatcaaaccg ggaaaaagat acaaaagaac cttaagtgca tattatatta catgaaagca 61081 gccaatgtga aaaggctaca tgctgtatga ttttatgtga cattctggaa aaggccatag 61141 tgtgaaaaca gtaaaaagat cagtggttgc cagagattca gagagggagg gagggaccaa 61201 taggtgcagc acaggaagtt tttaggggag tgagactgtt ctgtgtgaga ctgtaatggt 61261 gaatatatat cattacatat ttgtcaaaac ccatagaaca tacaacacaa tgaatgaagc 61321 ctaatgtaaa cccatgggct tgagtgaata atgtgtcaac actggctcat caattgtatc 61381 aaatctatca cactaatggc agatgttaat aaaggacaag tgaggggtga ggtggaagaa 61441 gaagtctctt tgtacttctc atgcagtttt gctgtaaatc tgaaactgct ccccccgaaa 61501 tctattaaaa atgtaggaag aaaagaaagc aattcaaaaa aggacaatcc agtttttctt 61561 aatgggcaaa agatgtgtac agataattca caaaggaaat atatataaat ggcgtaaaca 61621 catgaaaagg tgcttaaatc accagtcatc aggaaaacgc agaatgaaat aagacaccat 61681 tactcaccag aatggctaaa attaaaaaga ctgaccagac catggatcag tgaggatgtg 61741 gaactgggag tctcataatt actggtggaa gtacacaatg gaatgatcgc attgagaaaa 61801 ggtctagaag tttcttacaa aactaaacat gtatacatct accatattac ccaacaattc 61861 cactcctagg tatttaccca agagaaataa aaatccacag aaagacttgc acatgaatgt 61921 tcacagaaac tttattcata atatccaaaa actggaaaaa gccccagtac ctatataata 61981 gaacggacag attttactca attcatacaa gggaatacta agcaataaaa agtaactaat 62041 caccaatcta ttcagcaacg atggatgcat ctccaaaacg ttatgctggg tgtgtagaag 62101 acggacacac acaagagtag aaattatagg acaccattta tatgaaattc tagaatatgg 62161 aaaactaatc caaaatgaaa aaaaccatca gcattggcta tgtctgagga tggaggacgt 62221 ggggactgac taggaggaag gagcaggagg ggactttctg ggttgatagt agtgttccat 62281 atattgagag gggtctgggt tacacaggtg tgtgcatttg tcagaactca aaagaatgca 62341 cactgaagat gtgtgcatta cagtgtgcac gtttaaaata aagtttacat taaaaacaca 62401 aacattgacc tataatgaac agttgtatgc ccatgtattt agaaggaaat gcattgatgt 62461 tgccagttta ctcagaaatg tacctcaaca gtgcaccatg aaaggatgaa tggcaggatg 62521 ggtgaaggga cggggcatgg gtagatggga cgctccaagg cgggtccagt aaaatgacat 62581 agacatttat gccctagaaa tgatttcaac attgccgtat gtttgaaatg tgggaccagt 62641 cgtttaaatc aatagaatgt aagtagtttc aatgctaaca tgacagtcct acaacaggac 62701 cagcagctgt actttttttt tatttttatg agacggagtt tcttgttgcc caggctagag 62761 tgcaatggcg caaatcacag ctcactgcaa cctccgcctc ctgggttcaa gcaattctcc 62821 tgcctcagcc tcctgagtag ctgggattac aggcacgtgc caccacacct ggctaatttt 62881 tgtattttta gtagagaagg ggtttcgcca ttttggccag gctggtctca aactcctgac 62941 ctcaggtgat ccacccgcct tggcctcccc aggtgctggg attacaggcg tgaaccaccg 63001 cacccagcct gtactctttc ataaacgtca agacagatga agaaaggtaa aacaatttgc 63061 ctaagctgtg atttctaagt gaccctcttc actttgtcaa agcattcatt catgagaaaa 63121 ctatggaact cctgtgttct tgagaggctg cagtccggtg tgggaggcag agcagtggcc 63181 agcacacagc atggtgaggc gacagagcgt gggggctcta ataggaggtg agcagggcac 63241 tcagccaggc gctggcgctc aaacctagtg gaaggcagaa agagccatga agaagtggac 63301 actattttac tccagtaata gttcattttt attgtgtcaa acagtggact ctacgtatat 63361 tatattattt aacttttaac atatgcttaa gagatgggca caacttttgc caccgtatgg 63421 tgggattaga gcctaaaata gtaatagata acttgctctc caccagtgtg atgggcagcc 63481 caagatctgc acccagtctg ttccagggcc cagaccttta cccactacat tctcctttct 63541 tcttttcagt atcttcataa cattctaatt tttttgtaga gatgggggtc ttgctatgtt 63601 gcccagactg gtcttgaact ggcctcatgt gatcctccca cttctgcctc accaaatgct 63661 gagattaaga tgttaggcac cacacaccac catcaacatt cttcttaaca catttttgta 63721 aaccttgtgg agccttccac ttcagtgatg atcccatcaa cagctaacat ttaccacctt 63781 ggcagaccgt aagtccaaga cacaactcga caggtataga ctcaaagcag acatcatatc 63841 tctgtgtata ggaagacaca ttttctacag cctcatgcca ccttctcaag tctctctggt 63901 cccaggacaa tcgtaacatg gagatggatg gctggaagaa caggagcttg acagccaaaa 63961 ctccagaccc aaagaggaat gcccctcgat gacatctcac ccatcagctg ctgcaaactt 64021 gcctgatcag tcgtgaaccc cacttgagga gggacaccaa ctgttaagtc tcacccattc 64081 ttaggactgt cagtgtgacc aaagctgcca cctgcagagc ccaggagagg agtcctcgcc 64141 tttaccccct ttcccatctc catccttctc cccgaagccc acagctcagt gccctctcct 64201 gaggaagcct ctgatcccac agccaagcac aagatctagg cctgtgggca ccaacaggat 64261 ggggctctgc agtcagggag cgtcagctcg gtgcaggtac aggtgcctta gtgacctata 64321 ggtcaggggc atgacctatg gaccgaatcg agccattcac agtgaggcct cacctgtcct 64381 gggatcgcag gcacacacag ctccccacaa ccactacaca cacacacaca cacacacaca 64441 cacgcatttt aaattcccat gaaaaaatta actttgcata tatgggccac atgcccttcc 64501 acatcctgct taaagcacct caacagcccc taagttcctg ttttgtcaaa atgacttgcc 64561 ctggaaccgg gcacaggcaa ggctgcccat gtgagtgtga gtctgttcac ccatctctgg 64621 tccacagccc acaccagggc ctggtcaggc tgcctcccat cgtcttctgc gagcaggccc 64681 agctggcata cacaggtggc gacctggaat caagcaatca agcaggtgcc ttctctcagg 64741 tcactcttcc atacttgctg aggaaaacca caaaagacct ccaagctgct tgagttaaag 64801 tctccattta tttttatttt tttacaaaaa tccaatgtaa gaccattgtg ctcgtgacga 64861 aaaggggtgg ggtggatgga cgtggcatgg atatcaaagc ttccccccac aaactaggag 64921 ctccccactc tgtccggcgc agctcccaga aagatcccat ccttccggac aggaccccag 64981 ctggtgagcc ctggcctgag gcacagtcca cacggaggag cactgcccag ggagccagcg 65041 ctcacagtgg cctgcagagc cctgggacgg tgttatggta agacagccca aaccggagca 65101 gcaagccggc cacccagaga acgaggcgct cctgcaccct gcgagccagg acaaggtggc 65161 caggggcggc ccacagacag ccaaggagac ccggggtctg tggcgccgct ttcccatctc 65221 aagcgagtca caggtcggcg gctttcccgt ggtgagaagc acctgaccag tgacactgtg 65281 gccaccttgc tgcctctcgc tgaggagggc gtgcccctca gagcctgtct gcagtccttc 65341 aagccagtgt tcctttcagg gtcaaggagg gctgtccttg ttggaagcac cggcaccaca 65401 gccctccctg cggcatgttt tggtgtcaga ccactcagcc cttcttagat ccaccagtga 65461 cattcggggc ccgacaacct ggctccacta aagggagagg ccctggctcc accacacaga 65521 cggccccagc tcactgagtc ccgctaaagg gggtcccacc acacagacgg ccccggctca 65581 ccgagtccca ctgaagtcag tatgtgagtt cctcacatta aaagaaacca gatgaaatag 65641 cagccacaat atagcgccac acaccacact ctttggctcc ccgagggaag aaggctactg 65701 ctaaaaggaa tacaagtcag gagtcaggta gagggcaact agaaagttct gaggaagggc 65761 gtctgacccc cactgctggg aacataacca cactgcctca gcaggggagc tacaggctga 65821 tgctggggtt gggggcgggg aacctttgga aacacagtcc tggcggcggc cgggtccggt 65881 ttgccaatgg ggagagttcc cttaagccga gctagcccta caggtgggtg ggagctacac 65941 aaaagagccc agcttcaaaa cagtacttga agaggaccca cgtggtacag gcaggtcaga 66001 ggagaacgta ttccaagaaa tagaagcaca ggatgccaag gtctagggaa gacggaactg 66061 gcttaaggca tgtgcatgac caggacaaac ctgagctttt gttcagttgc tagaaaactt 66121 ccagagtcaa ctccacttcc agaaagtagg gttcaagaaa cacgtcatgg gctaaatccc 66181 tgacaaatgc cactcacacc ctcctaggtt cccctactgc caccatgacc caaaaaatta 66241 gcttatttca gtttcagccc agggaacaga atcctaagca gggagtggaa agtggtaact 66301 cgggttgtga atgcccgtta gattccaagg ctggatgtga gcttacacag caaatcacag 66361 cctcccattg ttctagcaca taccaaacct cggggagtcc tacagccaag ctgacattag 66421 gggtccaaaa accacagata acacaggatg gggctccaga cagaggcggg gggaaggtga 66481 atttcaccaa ggaattatcc caaggcaggc gccttgctgt aaaacttccc ggccagccgg 66541 gtgggttcct cgaaggacac tggcttgctc tacactaggg agaggaggct gacctgcaaa 66601 ccacttcaga ccacagcaga tgtgcacgct gctgatctcc tgtccaatcc aagaaagagc 66661 acttcagaaa cgcctgaggc ccacagcacg tgtgtttcaa cagaagagca ggatagaaag 66721 agccatctgg gagtggcgtc ttcagcccct attctttctc actctttgct tcctcattct 66781 ctctcaaaca agagagaaat gggagagcag ggataagtac ggaggcaagc ctggcctaaa 66841 gataaatcct caaaaatcgc tggccccagc agcaggaagc tgaacagccc accagggtca 66901 ggcgctccca gggattcact gggaagagaa tgtgagttac aggttgctga ctggcaacag 66961 aaagggtaag gaagagacct tgtccaggcc cgcaagaggg ccaagttcat ccctttctgg 67021 ttgctgcaca cagatggcgc tggggaggat gggagatgat ctttaaggat aagccagtga 67081 cacaaggcca ggacccatct ccgccagaat acagaacaaa ggagcctgcg cggtccctcc 67141 cttagaaagg caaaactcac actcccccag ccaaaaatat atatgtatgc aagtgtgtgc 67201 atgtatttat atacacacac atatatataa ataagccttg aatggcaaat ctgaaacttt 67261 ctctttttaa ataatcataa tagttgttat tgaatgtaaa aaccacgaac cagctgtcct 67321 gggcgtacga acggtgtgag tgactctgca gagtcgccac agtcctcagt gtaagctatc 67381 agtcagtgcc ctgtgtgggg aaccccgggg actccgccca gggctccagg cccagtgtgg 67441 ctgacttcaa gataaaggca gcggtttcct tccactcctc ctgctgcccc ttccagcaga 67501 ggctctgggc cacccaccag cagatgtgcc caaggtcctg caatgcctag gaaccttggg 67561 agccatcttc ctccctctgc tcatcctctt ccccagaccg tgcgctgccc ctagatgaac 67621 ttgaagcact tggtcttgtc atggggcagg cgtgtcttga agagcacaga atccaccctg 67681 aactgcgtgt acaggagggg catgtagccg tacaccttca cgaagaagtt gatgcacttg 67741 tgccgctcgt ggaagtggga gtcatcatga gacagggcct gagggcatcc tgggcatcgg 67801 aatgtccacc gtgaggtcac ctggaaacgg gagagagaga cagagtggga atcccagcta 67861 atactgacag aacccttgca gctgagccga tcccacactc ccatgtccat ggtgaagacg 67921 ctgatcccct caggggcaac atccctgcag agcatggcag gaaccagagc ccggccccag 67981 gcctcctgcc taccagatgt ctccagaaca ttgtcaggta ttctgttgag atggcctacg 68041 cttctcagat gccaaaagcc ttaacgtgtg tagtgtcagc tgtctcagta agtctactcc 68101 tagtatgtac ttggttgcag agccataggt aggtaccgag ttgtttgttt catcaatgtt 68161 ttgaatcaaa atattgaaga ctacccaaag aggggctttg ggtattgaag actacccaaa 68221 gaggggctag tcaaagaggg gctatcattc ttgaatactg tccataaaaa agatgcttaa 68281 ctacatttaa agccatggga aagtggccat actacagtct agtcatatta ttattaatta 68341 gaaaatgtct aactaaaaaa gtatgaagag ggacagcttc attacaatgt ggcaggccga 68401 atggcataaa aacccctcag aacacctgaa catgcaagaa gaaatacata aaccatctct 68461 ttaaatacag ggcagagcct gtaataagaa atgaaattac ctggtgatta attccagcac 68521 tttgggaggc caaggcagga agatcgcttg agcccaggag tacaaaacca gcctgggcaa 68581 caaagcaaaa cctcatctcc acaagagata aaaatattag ctgcgtgtgg cagcaggcca 68641 gctatctggt gtagtcccag ctacttggga ggctgagatg ggaggctgct tgagcccacg 68701 agtttgaggc tgcaatgagc tatgatggta ccactgcact ccagcctggg tgacagtgag 68761 accctgtcac tcactcacat acatacatgc atgcatgaat aaacaatgaa taatgaatga 68821 atgaatgaat gaatgaatga atgaaatcct cagaggccaa acaatgaaaa agcaaatcct 68881 gcaagatagc catgaacttg ggttttaaat gggctggaga agtgacacct gcaaagcggg 68941 ctgggggcct ttggaaacac tggctccatg gaggggagca gggaggggtg gacgcctcac 69001 aaagaaagat ggggaagaag tgtctttaaa tttatcttct acttcctttt cttttcacct 69061 aagtctgatc tttttatccc atttcactga aatttaataa ctatgattct cattttcaat 69121 agttccattt agggctttcc aatctgtttg ttctttttgg agtgatttgt tgctttttta 69181 tgttttcagg ttactaattt taagcctact tgttttatag tctatctaat ggctttatta 69241 tttgaaatcc ttggagaact ataacctgtt tgttatatgt gttcactcct gctcatgatc 69301 agctgttttc ttggtggctg actgttgact ttacatttca agctcatctt caatgaggct 69361 ttacctgtgc gtgtcctatg tgacctgagg tgaagaaatt tctctttttc ttaagtggga 69421 acttcctctg ctgagagtaa tttctcctta taacagattt ttggttttat tttgtcaaac 69481 agtccaaggg tatcgacgac tgggtctagt tttctttttt gttttttccc tggggactcc 69541 ccatattgcc caggctggtc tggaactcct ggcctcaaga aatcctcctg cctcagcctc 69601 tcaacatgtt gggattacag acttgagcca tctcatgtgg ccctgggtct agatttcata 69661 cagaatgagt ccctaagccc atggaggctc aaaagactat ttaacattct caacctacac 69721 ttccccaaca acctgtcaga gtcaaggtta aaataaacaa ggtatgtgtc atctccccgg 69781 ggcaacgggt aggagatctc cattctaatt ctccaccctt aacaggctct acactccttc 69841 acatgagtga taaaatccaa gcctctagac aactaaggtg agagcagccc cccatggtgg 69901 cctcagtgat gccaccacgc ttgccaccct aagttttagt cctcccacct gcttcctttc 69961 tggcaattct cttacctttt tattagctca actatacact gaaaaaataa gtttgttact 70021 tatagtgatc aggttttcaa actacctaat ccactatagt acaaaaccca aaaatttact 70081 gtcaagtttt tttttttttt tgagacagtc tcactctgtc tcccaggctg gagtgcagtg 70141 cggtgatctc ggctcactac gaactccgcc tcccaggttt atgccattct cctgcctcag 70201 cctcccgagt agctgggact acaggcgcct gccaccacac ctggctaatt ttttgtattt 70261 ttagtagaga ttggttttgc tgtgttagcc aggatggtct cgatctcctg acctcgtgat 70321 ctgcccgcct cagcctcccc aagtgttggg attacaggca tgagccacag cgcccagcct 70381 actgtcaagt ttttaaaaag cagactgcaa atcaagtata taaatttaaa atataaaaat 70441 aaggccagat gtggtggttc ccacctgtaa tcccagcact ttgggaggcc aaggtgggcg 70501 gatcacttga gctcagtttg aggccagcct ggccaacatg gcaagaccct gtttctacta 70561 aaaatacaaa aaaattagct gggcatggcg acacatgcct gtaatcccag ctgctgtgga 70621 ggcttaagca ggaaaatcac ttgaacccgg gaggcagagg ttgcagtgac ctgagatcgt 70681 gccactgcac tgcagcctgg gtgacagagc gaggctccat ctcaaaaaaa aaaaaaaaaa 70741 agaaaaagaa aaaatacata tatacgtatt tttacacaca tatgtgtata tatatatgta 70801 tgtataaata aataagtcac cacgatagac aggataccag agaaccaaaa gaaataagcc 70861 aaaagttttg gtacttttga tttctttctg catgtctatc ttttctcaaa taatttttaa 70921 atttccatta taaattaagg ggaaattttt taattgaaag acacatccca taacttaata 70981 gtggaagagt aatcattgtg tacagccagt atgcgccgtc agagcccagg tcccagagtt 71041 taaactggga ggagacacag gccagtgctc aaagggtggc tcccctcaga accgagtctc 71101 tggacagtca tgacctccac aggtccccct ccagggtccc acctgtctcc tcacttctcc 71161 cctcactcac tgctgctctc ttagaaccct tcggggtcac gtcagcactg agttattgct 71221 cttccacggt tcccactgga gcaggatgta ggggtcagga atctggggaa ggatgttctc 71281 aaacagcatc tatgtccagt attccatggg gctctcactg gatctaaaaa cctttctcat 71341 cattccagac accagaatcc aaccccagga gaaatgccct ttaacctgca cattattcca 71401 tgtgacacaa aaggtgactt tataactgtt gttttcacgg aagcagtggt ttccaaatgt 71461 ttttaatcat gtaatccatc agtaaaaaaa acatttaagc tgggtgcggt ggctcacacc 71521 tgtaatccca gcactttggg aggccaaggc gggcagatca cgaggtcaag agatcgagac 71581 cagcctggcc aacatggtga aaccccttct ctactaaaaa tataaaaatt agcggggcgt 71641 ggtggcacac gcctatagtc ccagctactc agaagactga ggcaggaaaa tcgcttgaac 71701 ccgggaggca gaggttgcag tgagccgaga ttgcaccact gcactccagc ctagcaaaag 71761 agcgagactc catctcaaaa aaagaaacaa aaaaccattt aagactgcat ccccaatata 71821 tttgtaaata tataactgtg ttacataata aaacatgcaa aaaatttaaa aagaatgaag 71881 caactataat attaactgaa gtctggacat ttacttattt aaccaatatc gtggatcaca 71941 gtttacatgg aagattccag gtaactcaat ctaagaaaaa tattcgtttt atgcttagta 72001 acaatgagga aaatccttga tagctgccaa gaacctatat caccccagag aaccaagacg 72061 ttcacttgca tttcggcttc cttaccacct aagccatctg ttttctcaaa actttacagg 72121 tgacttttca atctcttatc ctgaatgaag cctatttata ttctgtgttc tccttgcaaa 72181 agtagtacat tattcaaaga aataatatga cattaactcc ccattcgtta gtcaatatta 72241 agatattaac attattgaaa gaacactgcc aatcatacga agcagtcaaa cctccctaac 72301 tcaaacaagg aatagtttga cagtaaaaat ttgaggtatt taaagcacaa caaaaaaatt 72361 actatttttg aacataaaat agtacatata cctgatacca ttaaaattag gtaaataaaa 72421 tatttaattc aaactggttc tttattatga agtaaataat tagattcata agttgaagga 72481 attactaaga gttagaaaac actcttaatt tcagcctttg aatttgaaaa gtcatcccaa 72541 tcttgaattc ttcatatatt ccagaaagat gaagaaaatt cacagagaat actcagtttt 72601 gaagttttca cttggtaaga atcatgtgca ccatgtctaa attacttcca cctgcactga 72661 agagatggct taactaatga aacactggcc taataatgca gtagacaaac acactttaac 72721 aaagatgaaa aattccccat gtctgtgcct gctcaggtaa ctgatgctat tattaggtac 72781 ctaatcactc agatacttta aattttcatg gaccatgtct tctggtctac tagagaggca 72841 taaattgatg catacatctt gactcaagtc cagtccctgg ctacataaga aaggatatat 72901 aaggaagaga aaattgcacc catcattaat tgctttctaa aacctttgcc tccctacctc 72961 aaagtctaca aaatcttttc actgtttaat atgagaccta ccactgtacc tggaaaacat 73021 actgttttta tataaatact tgtgactatt tttcacaatt taaaaaaatt gatacattat 73081 gttgctaatt attcttctct tgtgaggctt tagcagaagt ctcggcaaca gatgaaaccc 73141 tgggacaatc aggagtgaca tcctacgcag gggccacagt tggcctccac atgcatttct 73201 ttgttatgct ttgctgcatg gaaccagcgt cctctggtgg ccaccctgct tagcactcaa 73261 gctacgactt ctttctcact acaatgccca ggctggagtg cagtggctat tcacagacac 73321 gcccatggca cattcagcct tgaactcctg gattcaagca atcctcctgg ctcagcctcc 73381 tgagtagctg agactaccag gcatgtgcca ctacacccag cttctaaaga tgatttcatt 73441 atcgttatta gtacatgctg gtgggtactt agtctagaac acaattatta ttattattat 73501 tttctttttg agacggagtc tcactcagtc acccaggctg gagtgcactg gcatgatctc 73561 agctcactgc aatctctgcc tcctggattc aagcgattct cctgcctcag cctgctgagt 73621 agctgggatt acaggcgcat gctactgtgt gtgcgtgtgt gtgtattttt tttttttttt 73681 gagatggagt ctcgctctgt cacccaggct ggagtgcagt ggcgcgatct tggcttactg 73741 caacctccgc ctccaggttc aagtgattct cctgccttgg cctcctgagt agctgagact 73801 acaggtgcgt gccaccacgc ctggctaatt ttttatattt ttagtagaga caaggtttca 73861 ccgtgttagc caggatggtc ttgagctcct gaccttgtga tccacctgcc tcagccttcc 73921 aaagtgctgg gattataggc gtaagccact gcgcccagcc taatttgtat attttttagt 73981 agagtcgggg tttcaccatg ttggccaggc tggtcacgaa ctcctgacct caagtgatcc 74041 gcctgcctca gcctccaaaa gtgctgggat tacaggcatg agccaccgca cccagtcgaa 74101 cacaactatt tactcatggc aatgtcaccc atgaaggtaa acctatttca taaaattaaa 74161 taatatgcct ttttgataat aatgaaaata agacctcatt agtttgttga cccttctaag 74221 gacatcaggt ataaatctct tactggaatt tagcattttc ttcaattatg aaacagacaa 74281 acacagacga agcacagtca caaatattca tttggagtga cagattctat agcattattg 74341 gttctaataa catctgcttc tgtgaggact gagctatcct aacccttacc agcatgctct 74401 aacttgctga cagagcccac aaagatgaca ggaagggggt ggaaccaggc tttctgtgca 74461 ctgagtgtat gtgttaatac ctccaagaaa aaaacacaac aataccctca gaacttctag 74521 aattctgagg gtatttttgg ttgtgagcaa ataatttata tagtacttat gtgccaggca 74581 ctattcttag agctttacat atattaactc agaaattctt aagttttttg tttgatggac 74641 atcgcctgtg cctctggctt ggcaatctgg tcaagactgt agactcctca aagtaatgtt 74701 tttaggtata taaactacaa tacacaggat gacaaaggaa acgagttaca gtaaaacaca 74761 gtgacataca tgctcttttc ttaatgtatt aaatcacaac atctagggga aagggagtaa 74821 ctgccgtgaa ttcaaagcag taacaaatac aaacaatact ttttgcagat attgcaataa 74881 aggtattgtg atatgaagat atcagtgatt tctactggtg acaaatcagt tactacaaat 74941 actcttatga attatagcct gtttcataac tgaagaaaat gctttattcc agtaagacat 75001 taataaaaat aatgatgcaa catctttccc acccaagttc caaaccttct gatttctatc 75061 cattgccctt aggaatgaag ggcccctgta gtaacaactc atttaagctc acagacaatc 75121 ctttgatgag gtaggtagta tcatccctat tgtacaaatg aggactctga ggtacagtgc 75181 agttacgtgc tgcactactg caaaacaagt gaagtaaaca tgcacgcatc cacagcccca 75241 ccagtggtgg gacctcacct tgatgggggg cttccgagtg atgtgggaga caaggaagtt 75301 catggcaatg tcctcacagt tgatgtattc atccaccatg tcccggatgg cctggggcat 75361 cacataagaa tacaggtagg cataatactg tcaggggaag aaaaagaacc acatgctgtg 75421 ttacaagaca caggttgttg gctttcagcc aaaatatgca tggatggagg ggctgtttgg 75481 gtgtggcagt aactaggagg tattactggc acttagggac tggggcaggg gattcgagac 75541 atcctgtcgt gtggatcttc tgcagtgagg aattatccca ttcaaactgc catcatcacc 75601 ccctttagta acagaatgtc atatcatctc cctggtaccg cagtgatttt gaaatcaata 75661 caaagatttg tcaaactagg tcagatgctg gttcaattga acactatttt atctctaaca 75721 atggccaaaa aaaaaaaaaa agataagtga gagaaaaaag cctggttatt ttctcagacc 75781 tcaataaatc acagaaccat gaaacacacg atccctcact gcctcctgta cagattcttg 75841 agtctggtca gtactcgcca tcggccctgg ctactccctg ctgccaacca ccttcgtctc 75901 ttgcctggat tctccacatc agctcctaaa tattctccct gctgccatat tctcttcccc 75961 atgtgctagt cccagcgcag caggtgattg tgttaacact caaaccaact gaacatatca 76021 ccccttcgct ccaaagcctc caacacttcc catctcactc agagtaaaag gcaaagttct 76081 cagactgtcc tacaaggccc acagaggggt gtgttggagc cactcacacc tgctcatgaa 76141 tggcgctttc tacattttca gaatgttgtc agcttgttgt taaacatagc cattattaaa 76201 gatgtaatta cataaacttc aaattaaata aattaaaatt atattaaaaa tccatgcaat 76261 aaacacctta aactcattac ttcctagtta atattttact attaacttga ggttacctat 76321 atctactgtt gatgttgaaa ttactatgta atggtgtaca actgtgtatc tcttcccaaa 76381 tccgtgttca gtgactcatg ttgataactt caaatcagcc aaggtaagag tatttatacc 76441 atagaaatca gcaaatacta caagacaggg cacatgttaa ctgctatatg ttgcaatttg 76501 ctgtaatgaa caaatgaata ggtgaggtgc ccagttaaac tgattaactg atgaacatat 76561 tgcattacct acaatataat atgttgagtg aaataatgat aaaatttttt tgtaacacag 76621 aataaatgtg ctaattatct tatagcaaag tacttaagag ttggtgaact taaaaaaatg 76681 aattgtaatt ttttttttaa caaaaaggtg atccaggctg ggcatggtgg ctcatgcctg 76741 taatcccaac actttgtttg ggaggccaag gtcggtgaaa tgcttgagcc cagaagttca 76801 aggccagcct gggcaacaca gggagaagac cccatagcta caaaaaaata aaaaattggc 76861 cagatgtagt ggcatgtgcc tgtactgcct gctactcagg aggctgaggt gagaagatca 76921 cttgagcctg ggagttctag gctgcactga gccatggttg tgccactgca atccagcctg 76981 ggtgacagtg agattctgtc tcaaaaaaaa agagtaagaa taaataaaat aaaataaata 77041 cactttttaa aaaaggtaat tcaaatttat tgacccttta aatggccagt gactgtcctt 77101 cgtatgctga tgagaatata ttaacataac acgtcttgaa agaaatgaca ttttaacaat 77161 aagaactgcc ttttaataat aatttaaaaa aaactgatga aagcattatc agaataactg 77221 ttcagaggta tttccatcag tatgtggttt tgctgtcaaa aatgatttat gttgaccagg 77281 cgcagtggct cacgcctata atcccagcat tttgggaggc caaggcgggt ggatcacttg 77341 aggtcaggag ttcgagacca gcctggccaa cagggtgaat cccagctact ggggaggctg 77401 aggcagaaga attgcttgaa cccaggaggc agagactgca gtgagccaag attgcactac 77461 tgtactccag cctggagaaa gaagcgagaa gactccatct caaaaagaag agaaaaaaaa 77521 aagtttaatt tagaaacaga cctgacttgc tataacacac agtatccaat caagattttc 77581 aaaaataata aaacatattc aatcctactg ctttcactaa aatttaagaa ttgagtgatc 77641 acattatttt aaagttttgt ttcatcgtta tttcaacctc taaaaaatat ctatcagtaa 77701 tatacacatg cataaaattt ataagtaaat atacatatat attaggtaca ggtctaaaaa 77761 gtgttattga caggcactta tgattttaaa aaaaaagaaa aaaacttgac agctgttgat 77821 cagagaggac caatctaact gctttcgtgg accaagcaag taagacaaat gagtgtaaag 77881 aaatgggtgt aggccgggtg ctgtggctca cgcctgtaat cccaacactt tgggaggcca 77941 aagcgggcgg atcatgaggt caggagttca agaccagcct gaccaacatg gtgaaaaccc 78001 atctctacta aaaatacaaa aattagccag gtgtggtggc atgctcctgt aatcccagct 78061 actcgggagg ctgaggcaga attgcctaaa cctaggaggt ggaggttgca gggagccgag 78121 atggtgccac tgcactccag cctgggccac acagcaaaac tcagtctcat aaaaataaaa 78181 aaagaaatag gtgtaagaaa aacgaggagc cacaggcagg tgagcgcatg aaggccccat 78241 catgggcctc aactacagga gcagccgcca tgacgcccca gacaggacct cagaggacct 78301 gatcttcatt tgtattgcag ctcaggtctt tttgtgaaat cttgtgattt ttagaagttg 78361 tcagtgcata ggacaacact agagggccca aaaatctctc tgtaagccaa ctgaggtttg 78421 ggcgctgcta gtctgtaatc ttctttatag attttcacac aggaaaaata ctaaatttca 78481 ttaagtaaat gatttcttga aagtagaggt acctgaccat tcatggtttt aaagaacagt 78541 ctgaatctgg gaaggcaatt cagaagataa gtacatcctc aaggtatgag tagacgctgc 78601 taagatcagt ggctccttct tagctgagca agtgtgaaaa tcttggccag ttgctgacac 78661 cctaatcctc tgactctact tgcaatcctc agtccaaaca aggcccaccg aaggaaagga 78721 agtcctgagg tgaagtgcaa gaatgggatg agtgtatcaa cttcacacat taagttttta 78781 aaagaaaaag aacagctgaa agtttaacga ctgcttaggc tggttcaaac gtccctatat 78841 gtcaggcacg gttcctcaca tctgtaatcc caacactttg ggaggctaag gcgggcagat 78901 cgcttgagtc caggagttcg agaccagcct aagcaacatg gcgaaactgc atctctataa 78961 aaattaccaa aaaaaattag ccaggtgtgg tgatgcgtgc ctgtagtccc agctacccag 79021 gagacagagg caggagggtc acctgggccc aagaggtgga ggctaaaatg agctgagacc 79081 ccaccattac actccaacct gggcgacagt gagaccctgt cttaaaaaat taaaaaagtc 79141 cctataaaaa tgaattttat tgttctattt gaggtgactg gcaagatgcc accatctgag 79201 atgggagata tgtaagggag aaaagacttc aaggagctag ggagagacgg tgagctttcc 79261 tgggaaaagt ttacctgaag tgtctgaggg acaaacggga gatatgctgg aaacaatgaa 79321 atatacaaac gcagacctca gcaagaaagg ccaaggctgg aatacagatg aggaaattac 79381 cagcctgcag atgctaagaa aagcctcaaa accttgtgtg tgagacagaa cgcctaggga 79441 aaataagaag agcaacagag gctagacccc gggacacttc accattcatg cagagagagt 79501 ggtgggaggg tcttccgtga ggacagtgga ggcaccagaa ccatggaggg catggatgca 79561 gacaaagaga aggaggcagg tgccaccgtc tttggtgact gtcagggcac gatgaaaagg 79621 ctggttgatg gcagcaagac agacgacagg agctgcaaat gagactttat gtgacagctg 79681 ggagggaagt gtcattggta agcaatgaaa atgttcccta cacctgccct gtgccaaagc 79741 acagatgtgg ggaaatgagt gcctcaaagt ctacaggaaa aggctaatgg gagcactgtc 79801 ctcagagaag actcagggca cagaagaggt gctctgtgtg gtgggcagtg ggggtaatgc 79861 cagggtaatc ttagaacagg gactcctcag ggcccgggaa cacttcagga gggaggtaga 79921 gagcggcact cacggacaca gaaggcaaac cacatacagc actgtaaact ttctagaagc 79981 tacatcgtta aaaagtaaaa agagacagta aaaatcaata actgtattta acccagtaat 80041 ccaaactaac tgcatttcaa gatgcaatca acacaaacaa ttactgagct atctgacacc 80101 ctttgttaca agtttttgaa agctgttgtg cactttacac tgaacagcac gtctccattc 80161 tgaccagtca tgcaccaggt gatcagcagc cacttgtggt caggggccac tttacaggat 80221 ggaagaggta gagagggaag atgggccagg agaaaaaaac agaatacaga acagtagagg 80281 aggaaagact gcagggtcct aagcttcaga tattcagtga aaatcagatt aggaggcaca 80341 gtgaaagtaa taagcactaa agcatcacaa agaactggca gagccacaca gaggctcatc 80401 gtggggcccg ggacaggcat ggtatatcta agtcagaaaa gtgcccaggt caccttctga 80461 tggctgggcc atatctaggg tggcagtgtt aaaactggaa ggtatttgag gtgtctttta 80521 gcccagtgcc ctcagtttta caaacggaga gccaacgccc agaaagataa agtggtttcc 80581 aaatggccta tgtgcaactg tacaggcagc cctctcatct tgacttttta tcccagagtt 80641 gctctaagca tcttgatcat tgtctgtaaa aatagaaaaa actgacttct agcacaaaag 80701 aaacatgtaa gaagcgttag gagagctaag ctgagggcag cattccgcta ccacacaaag 80761 gtgaaactct caccaagtcg atgccattat taccagcttt ttcttacctt gtgaaagaag 80821 gcagcacctg tcagcaccat ggacagctca caggagtagt tggagttgta gagccaggac 80881 tgatggggga tgtcccatgc gtggtaacgg ccagggaagc ccacgatgcg gtcccgagct 80941 tctctccaca cccttgaaaa acacaagtgc atacacagac ctgaatacag agctctaggg 81001 tcatcagaag tgttcacagt tattgcctcc accttacaag ctctggccct taggctttta 81061 cttctcgtat cctttcaaaa taaaacaaaa tcaacaacaa gccaaacagg ataaaagcaa 81121 ataaggtatc atattcagct tccttaataa gcacctgcac attgtccctc tagcagtcag 81181 catcctccag cccttccaga aagaataagc cctaagtttg gaaaggggat ctccagaatg 81241 gggtatgtac aatatctact aaggagggct ccagaatggg gtatatacaa taatctacta 81301 agcagcagaa agatgatatc aatttcaatt ctttttttag cttatttaat ttccaagaaa 81361 gggcttggtg ggatggctca tgcctgtaac ctcagcactt gggaggccaa cacaggagga 81421 ttgcttgaag caaggagctg gagaccagcc tgggcaacat agcaagatcc tgtctctaca 81481 aaaaaaaatt tttgtttgta attagctggg tatggtggag cacacctgta ctaccagcta 81541 cttgggaggc tgaggtggaa ggactgcctg attctaggag ttcaaggctg cagttagcta 81601 tgattgcacc acctcccttg gcctgagcaa cagagcaaga tctggctcta aaaatgaatg 81661 aatgaacaag cattttctaa gaaaggcttt gtttttaata agcatgacat attagttcag 81721 aaggacatgt atgtaattta catatattgc acacttttct tttacagaga aggaacataa 81781 taaaaaggtt tagagagcac tggtttaacc acagaagact actgaactgc accactccta 81841 attccaaatt tgagcagggc tgacggagaa acatgtatga tgagaagtgg cctacagaac 81901 catacaactg aaaggtttca ttaaatggaa gaaataaatg gagacttcag tatgtttcag 81961 tagaaacttc tatatcatct ccaaatttat aggtaaatta gaacaaataa aattggtccc 82021 cagtttcaca ggataaattg gagaactgaa agcgtttaag ctccacagga cctgacaggc 82081 ctgcagaaag gctgccagag atttaaactg cctgcaaact ccctcatcac ttacatggaa 82141 cttcagttcc taagacacag aagattttat ttcaacagag ttcctctcct aataagtcta 82201 gaagcatcta atctaatcca aaagaggaga aatcacaact tctatcacaa tgtaacagcc 82261 ttctaggtgg gtttttttag acaactgatt ttttttaaat tgtggcaaaa caaacataaa 82321 atataccatc ttaatcattt ttaaatgtat ggttcagtgg cattacggac attcacagtg 82381 tcgtgcaacc atccctgcca tccatctcca gaactctttc atcttcccaa acggaaactc 82441 tgtccccatt aaacactaat ccccactccc accttcccac agcccggcag cccctattct 82501 actctccgtc tctatgaatg actacctagg ggcctcacat aatggaacca cagtatttat 82561 ccctctgagt tgtttgcact tctgttacaa ataacgctgc tctggccatt tgtgtattcc 82621 tttctgtatg gacacatgct ctcaagtctc ttggtatacc ttttctgtcc cttatgattg 82681 attgtatctg cctctttctt ggctacctaa gttgaagtga gtcaagatct atctttgcca 82741 gaagaaagaa ttcttagact taccctttcc tttgaactta ggtctgtttc attcccatta 82801 aggtgaaata agcaaattgg ggagattaat aagagaaagg ttttagatca aaggatgccc 82861 aaatgcatga gaaaagggtc agggtaggaa aaggttagga tgtatagaca gcaatgataa 82921 ttcaccagct ccattaccag aggctaaatc tcaaacatga atgacagtta agagacacat 82981 taaaaggctt cccattattc tctcaccacc tgcaaatctg ctggaaaata gcacgggcaa 83041 ggtaagaagt ccctaaatca ggggcttgga agctatgtta atgccagcta tgttaatagg 83101 cttcaaactc cttaaagctg ggctccttat caaaatcatt cttggatcta agggttggca 83161 gttctcctgt taacactcca cgactatgct caccacgcca gtccttcggc acgctccaaa 83221 ctgcatcacg ctgcagcata aacacactcc ctaccgccca cccccaccac taccacctgc 83281 agcagcaaag atcatgcctg gagttactgc atggcttttt tcctttcata aaaacaagtg 83341 gagagagtca gctacttatt atcgtgtaaa aaaaatacac ctcggtttac caggattttt 83401 tttttaatca cagctgtcaa cagacttggt tcaataatac actaagcaag aggtcaaagg 83461 aaatgtgaga ggctgggtgg gggagaataa gaacagatgt tctaattttt cagaaatgtg 83521 tcaaatcatt ctttacagat ggatttaaga cagatgagca ataaagcctc tgctcctttt 83581 atctgagcat ctgctcttac aagcctaagc caaaggcagc tccagagcca ggtaggtcag 83641 gttaggcctt cagtgaacag aatggaagca cagagaaaga actctctcta tcctggatcc 83701 acacttaatt tgaaaaagat cgccaaagaa atctactcca gtgttttttt tgttttgttt 83761 tgttttgttt tgttttagct ctgttgccca ggctagaagt ggcatgatct tggctcactg 83821 caacctccac ctcctgggtt caagcaattc tcctgtctca gcctcctgag tagctgggat 83881 tacaggcgca cgccaacacg ccccgctaat ttttatattt ttagtaaagg cagggtttca 83941 ccatgttggc caggctggtc tcaaactcct gacctcaggt gatccacctg ccttggcctc 84001 ccaaaatgct gggattacag gcgtaagcca ctacacccgg cctccagtgg ttttcaaatg 84061 atgtggggaa gaactaattt ttccccaaaa ttattataga ttaatacttt ggtaaaatac 84121 aacaaaaatg aactgcctgg tttcttaaat atgacatcca aagcacaagc aaccaaagaa 84181 aatagatcca ctgaacttca aaacacgaac cctgtgcttc aaataatacc atcaagaaag 84241 caagaaaata acccatggaa tgggagaaaa ttgtgcaact ccaatcactg ataatggact 84301 tgcatctaga atatataaag aactcttata acgtgataat aaaaagacaa tcctggcctg 84361 gtgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcggg cagatcacct 84421 gaggtcagga gttcgagacc agcctgacca acatggtgaa accctgtctc tactaaaaat 84481 acaaacatta gccaggcatg gtggcaggcg cctgtagtcc cagctacttg ggaggctgag 84541 gcaggagaat ggcgtgaact cgggaggtgg agcttgcagt gagccaagat cacaccactg 84601 cactccagcc tgggtaacag agcgagactc tgtgtcagaa aaaaaaaaaa aaagacgaca 84661 atccaaacaa aaatgggcaa agaatgtgaa aagccgtttc tccaaagaag atatacaaag 84721 gctaactgat caataagcgc atgaaaagaa gctcaacatc attgagagaa atgcaaatca 84781 caactgtacg gccgggtgct gtggctcatg cctgtaatcc cagcacttgg gaggcttgct 84841 cgaggccagg agtttcagac cagcttgaac aataaagtga gaacccatct gtacaaaaaa 84901 aaaaaaaaaa tgtaaagatt agccaggtgt ggtaatgtga gcctgtagtc cccgctactc 84961 aggaggatca cttgagccca ggagttcaag gttaccacat gctaagattg caccactgca 85021 ctccagcctc agcaacaatg tgagacccca tctgtgtgtg tgtgtatata tacacacata 85081 cacacacaca cacatttata tataaaatta gttatcactt tacaatgact aggacggcta 85141 taaattttga aaatggaaaa taacaagcat tgacgaagat gtggagaagc tagaaccttc 85201 atacactgct ggtgagaatg caatatgggg ctgccaccgt gaaaaacagc ctgaccggct 85261 caaaatgtta aagcagctat catgatccac ccacattact cttaggtatc cactcaagag 85321 gaatgacatg ttcatacaaa aacttgcgca tgaaggttca cagcattatt cataatagcc 85381 aagaaataga aatgacccaa atatccatca acagaaaatg aatgaagaac tggtacctgg 85441 gctgggcacc gtggctcatg cctgtaatcc cagcactctg ggaggccgag gcgggcaggt 85501 tgcctgagct caggagttca agatcagcct gggcaacatg gtgaaacccc atctctacta 85561 aaatacaaaa aataaaatta gcttggcatg gtggtggtcc atacctgtaa tcccagctac 85621 tcgggaggct gacatgaaag aatcgcttga acctgggagg cagaggttgc aatgagctga 85681 gatcaagcca ctgcactcca gcctgcgcaa cagagtgaga ctccatctca aaataaaaaa 85741 gaactggtac ctgctacaag atggatgaac cttgaaaaca tcatgttccg tgaaagaaga 85801 gagtcacaaa aggccatgca tcgttgtaca gttctattta tagaagatgt ccagaatagg 85861 caaatctata gagatgcaaa gattgagtgg ctacctagga ctgaggggtt tggagaaaaa 85921 ttgggagtgg ctgttaatag gtacagggtt tctttcagtg gtgatgaaga tttctaaaat 85981 taaccatggt gatgtttgca caactctgaa tatactaaaa ccactgaatt gtacacttaa 86041 atgagtgaat tttatggggt atgaattata ttgaagaaat gttgcaaaaa aaagaactgc 86101 aagaaaaata atcatatact tggatttcat agtaaatgtc aaattgcttt acaagtttct 86161 gaatgcttac cctcaatttt tgtacttacc tcaccattaa caggtaacaa actgtcccta 86221 aaccaacatc ccagtccctg agatacctgg agtagccttc atctactcca tcctcttccc 86281 tgcagtgacc ctcaagtggg atccttcagc aattcctaag actcaagaag gcaggagagt 86341 tgaaggccgg gtgcaggttg ggagtgtgac aaacctgcat ttgaacccag agctctgctg 86401 ccactttcta gcttctacgt ggttctgttc tcttctatct caatttactc ctacatgaaa 86461 tggagacagc tacaatttat gtcatcaaat tttagaagga tgaatgagat aagacaaagt 86521 cctaggctag tccctggcac acagtacggg ttcaacatat gtttaccatc atcatcatca 86581 tcatcattac caccacctcc ttttcctcct ccccttcttt ttccttttaa atcattgctt 86641 ctgacaccct ccttccccca aatctttttg ggtccaggat cctggcactg ttccattgct 86701 ccaacacaca gcaacatgtc acttttgcct tcccattcct ctaaaaacaa aaccctccta 86761 tttcctttag agaactaccc tacccgttgc ctctactctc tgcccatgtg gtttggattt 86821 aaggatgata cacctgcagc accaggaaca ggcaggtaac cagggtctag ccaatcaaag 86881 aattccacct tcctggccac agaggaaagg cctgtgggaa cacagagcgg agcctacaga 86941 tgaagagaga tggactcctc caacgccatc tacgagcctg catccagcca cgtcctacat 87001 cagccctgac tatctgcaag gggttctcag ttaccatcag ccaaaaaatt cattttgcag 87061 cctaatccag gttttctgtc acttgcaacc taaagttttg attggaaatt agtctctcac 87121 cggaacccaa acatgatttc gtcatggcgg aggtgagcat cgtcatcaat ggacaggatg 87181 gcctctgtct caatttcatt ccagggtaag aatcggttgt tcaaactgtt cttctcagta 87241 cggaccacct gtgatgagga aggaaaaaca ttaaaaatta aggctgtgtt atgaaaggcc 87301 aaacaaaatc tgtatttagg tccaaggaga ccatggctgg atttactgaa taattttgcc 87361 tgatctccgc gcttgtaaaa tctagcatat gcctttcagg aataaaagct gccttatact 87421 tcaataaatg tatatagatt taccttttaa gcttcattca ttagttagct aattttcttg 87481 tgaatcaagc aaaagctgaa gattatttta tacacgcaat aaacacgatg tagggaaatt 87541 aaaaacaact ctcccaagag aacacaaggt ggcagagtgg atctgagatt ccaatggcta 87601 tggaattccc agcatgcttg ttaattttaa aacccaactc agaaacctca tgagtctgtc 87661 acttctgact ccccaattct aacgcctttt tgggatataa atcccaaaaa agagcacagc 87721 ccatctggtc gagattagtt acttcacctt tgaaattcct acctacaatg ctgactactc 87781 gtacacaaac tttttccttc ttttcaaggt atcatgtact caagtacaac agcttctgcg 87841 tcttcagcaa atcccaattc aaaacacatc taagtgattc aacatacatg caaagcagta 87901 tttccttcat aaaacagaaa ctggtgcttc aaatagtaca actacataat gaaacaattt 87961 ttatttaacc atatctcagt taagtatagt ttacctacag tgtgggtgag tagctgtgtt 88021 attcaccttg ccacctaata ctcatataaa tgatgaccac agccagtact tggatggctc 88081 attttattct tagagtgtct ttgtctaatt agtccaacca aaggggaacc attattttgt 88141 tctcaaaccc caaaaacaaa gagcatctca tgaagaataa tctttttaga atgccacgaa 88201 aaatcacctt acttccaaca gactatttta cttgtactga gaacaacctc tacctggcat 88261 gatgaattaa ctgcatccga ggacttaaat ttatgaatgg tttccaagga gctctgtgac 88321 ctactagcat gtctcttcaa cttcaaatac cttctcttcc atcctccccc tggaggtcca 88381 gttcagatgc ctcttgccac accctccttg ccaggagaat catttattct atattcttaa 88441 tgcagagccc tcatacttca attaattcat tctagcacac ttaaaatcca attaattcag 88501 agctagaagg gctttggaga ctatagcgtc tgtcacttta cacatgcaga aactgaggcc 88561 cagagtgatg tcatacaact ggcaagttgc aagagccaaa actctaattc ataactttaa 88621 aaaaaaaaaa aaagcgagtt ctcgaagtct catcactatg ttccccccag gcgtctcgaa 88681 ctcctgagct caagagatcc tcctatctcg gctccgaaag tgcaaggatt acaggcatga 88741 gccaccacac ccggtcctaa ctcatacttt gattccaaac ccagtccttt tcctgataaa 88801 cttttgttaa ctttataaac ttcttcaaac caaagccacc atagaaaatg cttttttttt 88861 tttttttttt tttttttgag atggagtctc actctgtcac ccaggctgga gtgcagtcgc 88921 gcaatcttgg ctcactgcag cctctgccct ctgagttcaa gtgattctcc tgcctcagcc 88981 tcccaagtag ctgggattac aggcgcctac caccacgcct ggctattttt ttgcattttt 89041 agtagagacg gggtttcacc atcttggcca ggctggtctt gaaatcctga cctcatgatc 89101 cgcccacctt ggcctcccaa agtgctggga ctacaggcac gagccactgc gcccagacat 89161 tttttttttt tttttttttt tttttgagat agagtctcac tgtggcccag actggaatgc 89221 agtggtgtga tctcggctca ctacaacttc cacctgccag gctcaagtga tcctcctgcc 89281 tcagcctccc aagtagctgg aactacaagc agataccacc atgcccagct aattttttta 89341 tctttgtaga gacagggttt caccatattg cctaggctgg tctcgaactc ctgatctcat 89401 ggcatctgcc tgcctcagcc tctcaaagtg ctgggattac aggcatgagt caccacacct 89461 ggcctgaaaa tgcattatta atctgtgtac catcaagaaa aaacaatgtt gccaattaag 89521 aaggcatgtg aaattgatga tcccttgttt acttgattac agaacttaaa tttttttttc 89581 ttttaagaga tggagtcttg agttgtcacc taggctggag tgcaatggtg ctatcatagc 89641 tcactgcagc ctagagctca ttagctcaag tgattgatcc tcttgtctca gctccccaag 89701 tagctgggac ctacaggcat gcaccaccac acttgggtaa tttcaaaaaa aacttgtaga 89761 gacacgttct ggctatgtag ccttgactgg cctcaaactc ctggtctcaa gcattccccc 89821 tccctcagcc ttccaaaaaa agtgacagga ttacaggcaa gagtcaacac tcttggccag 89881 agctttcttt aagacttcac ctcagcccca gaggaggtcc tgcccaactc aagacaaaga 89941 aggatctgta acagattcac caccacagtt aacagatgtc caagccaagc aacagaccga 90001 gaaatccacc ttgccctgca gcatgtctga ccagcataaa aattcccaag tgtacagccc 90061 agggtatcct aagctcagag tccacaatga caaaacgaag gaccgagtga ggcctaggtc 90121 agacgagaga gcagcaagga gagcagatgc caagtgctca ccttagcagc tgtcggttcc 90181 actcgccaaa gggcgggagg gtggcaagaa ggggccggac ttgaatggca agctcagcaa 90241 tggtaagagg ccatccattg taagacacat ctcaatttca gagatgacaa aatgtaaaat 90301 aaggtccgcc ttggaaatga cggcatatgg tagctgttca caaactccct caacaaactc 90361 ccctcgaaca ttcactttac ctaacacacc tagcattcac tcagtacaga actgattctg 90421 ccaattcagc caaacaaagc tccccctcac acagcttaaa atgaagaaaa accacttcag 90481 ttcttgaata ttggcttgta gattatcagt tttgtgggtt aaccttcagg tggattatct 90541 acggcacaat tagtaaacca ggaatatagc aaggagcttc agagttcaaa gtgtgaggcg 90601 aagaccagca gcacacacca ccggagcctg taggagtgca ggcacacccc aagcccactg 90661 agtcagaatc tgcattttaa catgcccctg ggggattcct gtgcacatta aatggggaga 90721 agcactggta cagagggaga aagcatggct ttggggccaa tcagaaaagc ttgggttcaa 90781 attccaactc ctcctcttac tagacgtgtg aatgccagca ccctctctgc taaatcaaca 90841 tagcaccaca ctgtttgcaa aatctgaagt tacttatcag ccaaacttga caatcctata 90901 aacaacctaa ctctgcacct gaaaacgaaa aacaagaaaa actacaatga tttgatatct 90961 agatcatatc caaaattatc taatttacaa acaaccaaat caagagaacc tacttgtgct 91021 ttagaagact taggtggggt catgcagctg gaggtcaaat atcaaagtgt tttggcctag 91081 atttcacact agtttttttt agtaagttta ttaaagtcca ttacttagat atcaagaagc 91141 aacaagagaa caactactaa ggactccagg aacacagggc gcctgccatc tctgctcacc 91201 ctctgagcac aactgctctg ggctggatga caacagctgt tcaggtatag caaactgcat 91261 tttaacaatc agaacagcaa tcagaataaa agggccaggc atggtggctc acacctgtaa 91321 tcccagcact ttgggaggcc aaggcgggtg gatcacctga ggtcaggagt tcaagaccag 91381 cctggctaat atggcaaaac cccatctcta ctaaaaataa ttttttaaaa atctagccag 91441 gcatggggga gggcacctgt aatcccagtt actcaggagg ctgaggcagg agaatcgctt 91501 gaacccagga agtggaggtt acagtgagcc aagattgcac cactgcactc cacgctgggc 91561 aagtgattcc gtctcaaaaa aaaaaaaaaa aaaaaaagaa aagaaaagct gttaaagatt 91621 cacagaaaca caacaccaag cactacagtt ttgtcagtta gctgacaaaa ctaactgcag 91681 tcagtaagtc agctttaaga attcagagca gtggttctca accaggaaca attttgcctc 91741 gggctacatg tggcaatgtc tgaagggatt tttggttgtc acaactggag aaaagggtgc 91801 gctacttgcg tctagtatct agtgggcaga agccagggat gctgccagat cctatagtgc 91861 acaagacagc ccccacaaca gagaattatc tgacccaaaa tgtcactgtg ccactgctga 91921 aacaccctga tttagagtca acctgcagga agacagtaaa ccaaaacagc acttggaaga 91981 ctaactatag ttcattacct aagatgttcc ccttttccct atagccgcaa aaagatttct 92041 gccctcacaa actttgcaaa cgccaactaa aactaaatgg gtggaagagt aaaagttttc 92101 ttctaacagt tttgcttcaa agctgcagtg cttaatggct aaacaaaagc tcagcaaacc 92161 aactattatc cattctggca ccaaaatcag aagaacagaa aggctcaaac atttctaaat 92221 gcaggccggg cgcagtggct cacgcctgta atcccagcac tttaggaggc cgaggcgggc 92281 ggatcacaag gtcaagagat ccagaccatc ctggccaaca tagtgaaacc cagtttttac 92341 taaaaataca aaaattagcc gggcgtggtg gtgtgcgcct gtaatcccag ctactcagga 92401 ggctgaggca ggagaattgc ttgagcccgg gaggcagagg ctgcagtgag ccgagattgt 92461 gccactgcac cacagcctgg gtgagagagc gagactccat ctcggaaaaa aaaaaaaaaa 92521 aacacttcta aatgcagact cacagatcag cacggcctct aagaatctga gaaaagacag 92581 atcgaacata aaagaaacaa gtcaaccaga gggactgtgt catatttagg aaaggttctc 92641 atttttgttg atgttgtttt gtttcaaatc aaaccaacac tcttccctca accccacaat 92701 actggctatt tcttcatgtt actacagcat attgctatta gatgccttat gattacatct 92761 tagtaacttg caaacaggaa gactcacttt caagtgattg ctttaattac tggtatgaca 92821 ttaaccaaaa tgaatagacc acagtgcctg gcaatatagc agatgttcaa caaatgtttt 92881 ataaatgaat gaatgggcag aaaatagaac ataatttagc cctgccattc tatttacaga 92941 atatgaaata aagacttgag aagtttctag atcaaaatta taggtaaaca ttcaatatct 93001 ttaataatct taaagaatga tagagaggaa ttaggaaacc tcttagtatt tagtgtagtt 93061 ttctatagca aaaaacccat ccacctccat caagccagga gcaatgccca ctctttgctt 93121 ggcctgtctc acacacaggg ctccctgacg gtgcctcgct agctcttctg cacaatatca 93181 ttcacgggac ccttgacctt ctcctatcac aaaggaaaag ggacagcaat cgtggcctgg 93241 aacctgccac ctatgaaatt tggccattta aatacacttg aaatgcccct tttcagatta 93301 catccggccc agccaagccc gacaatctcc atcctccaac aaaacatata tacgtacata 93361 atacatccct atagcaaatc catatctgag aatgaaactt aacatcaagc catcacacag 93421 gcaagaaagg aaacagcaac tgaccttagt tctccatcat ccccttcctc caacttaaaa 93481 gaggaaccat cagagaactc aggaatgagg aaaatgagat ccaggaagag gcacacagtc 93541 atgcccaccc agctcaggag gacctaggta acagagcttg aagtgagtgg ggagggaggt 93601 gagcgatggg agggaggtga gcgacagaga gaagatgata gaaagaggac tacatcatca 93661 tcatcattat tattattgag atggagtctt gccctgtcac ccagactaga gtgcagtggc 93721 acgatctcgg ctcactgcaa cctctgcctc ctgggttcaa acgattctcc tgcctcagcc 93781 tcctgagtag ctgggattac aggcgtccgc cactgcacct ggctaatttt tgtatttttt 93841 tttctttttt tcttcttctt cttttttttt ttttaaagca gagacagggt ttcaccatct 93901 tggccaggct ggtctcaaac tcctgacctc gcgatccacc catctcggcc tcccaaagtg 93961 ctgggattac aggcgtgagc caccacaccc agccaaggac tacattattt aagggattca 94021 ttcaataaac gtcaagtgat ggggcagaaa gcaagaaaac gcaaaggaag aaaagagaat 94081 aagaaggtaa cagtgcattg gttttccatt tataacttta cacagggatg tcatacagta 94141 caaacaaaat tgtacatgtt ttagatgaga caaatctgtt ttaacttata agagaaaaag 94201 ttgccaatga tcccagtgca agtgcaggta agaaagccta ggttagcagg tcaacaaatg 94261 agagaatgca gataaagacc atccacagtg cctagcacac agaaaatgcc caaaaactgt 94321 taacaattat tataacatga tattagcagt ctctatttta attttcatac attttacatg 94381 tatatttcat attctgtatg tattttaatt tttatacatt ttctatattt tatacatatc 94441 tttatttaaa aaaacaagtt tgtgcttctc caagaaattt acacgtggaa aaaaaaaaag 94501 aaaaaaaata catatctatt gtcagaagtc ctaagacctg gtgctggtgg tggctcacac 94561 ctgtaatccc agtactttgg gaggcagaaa tgggcagatc acctgaggtc aggagttcga 94621 gaccagcctg gccaccatgg caaaatcctg actctactaa aaatacaaaa attagccagg 94681 cgtggtggta tgcgcctgta gtcccagcta caaaagaggc tgaggtacaa gaatcactta 94741 aacctgggag gtggagactg cactgagcca agatcacacc actgtgctcc agcctgggca 94801 acagcgtgag actctgtctc aaaaaaaaaa aaaaaaaaaa acagtcctga gccctcattc 94861 taatacaggt atcagttagt caagtgacct gaagcaacag aattcttaca gtctcagatt 94921 ccttactttg aattagtaaa aagagtacac atacactaag aggggaagac attacctcaa 94981 gaatcaattt gctgcaatta gtaaattatg caacatgact ttccagcaat tgctttaaac 95041 ttctgtattt cttagtattc atttttggtt cggggtagcc ttgttttata taattttcct 95101 ttgcagccat acagcccatt cgcaaacaga aacccacagc tatagccacc aagttattaa 95161 gtaaaatgtt gtcaaagaga aagaccaacc acccagatgt gccagctcct agtgaagtgc 95221 accagacctt gcacagtctt ggacctggag aagctggaca aggtttttcc tgctggcttc 95281 acctagctat cacaatttta ggaaattatc gtctcattcg ttcaagggat atttttaaaa 95341 gtagagtggg cagaaataaa aaaatacagc ttaccaacac tttaaggagt aagccctgag 95401 aatgatctcc actctcttgc ctgaggtcta gccagaagcc aagcctctta gcctgagagg 95461 cggagtcccc agccagaaag ttcctgacgc caagagtgca ctacggatgc agcttctctt 95521 ccagtcttcc cttttcccta atagactact ggggagagga tgaaaataac tcccctggaa 95581 tgatatttat attacccaaa aaaagaactc tccctgttca atttgaatat caagggctgg 95641 gacagaggga aaagggcatt gaaaaataat aatcttgtat ctctcttttt tttttttttt 95701 tttttttaga gacagggtct ccctctatca cccaggctgg agcgcggtgg cacaatcaca 95761 gctcactgca gccttgactt accaggctca agcaatcccc tcacctcggc ttcccaagag 95821 cctggattac agacatgcat gatgcctggc taattttttc tatttttttg tagagatggg 95881 gtctccctat gttgcccagg ctggtctcaa acccctaggc tcaagcagtc cacccacctc 95941 agtctcccaa agtgctggga ttacaggcgt gagccactgc gcccggcact atcattttca 96001 tttggaaaaa aaatggtgca ttctgacctc atcacttcca cagagacctt gcagtctgca 96061 aggatgtgtg ctatgctgat ctctgaactg gttctctcta ccaccgctcc tcgcctaggc 96121 tactgcaagt ctttttgctt ctgctctttt ccccatagtt ccataaaaat catgtgcctc 96181 ctctgctcaa caccctccaa gggcatccta aggcagacag gataaaaccc agacttccta 96241 accacgacct gcactgtcct gcacctgctg gtcccactgc cttctccaac ctccttcaaa 96301 cgcgccaccc ggatgcactg ggcatcgctt ctggtgcttg ccattcccca tacatccctc 96361 cagaatttac atggcctcct ctctcgcttc attcaggctt ctgctcaaat gtcacccctt 96421 ctaaaagccc ccttccaagg caccctgcgt caattagcca taccctttat gaagaagaga 96481 atgaaaacct aagactcagg gacgggctgc caagagactg tctcagcagt cagtgagtat 96541 acagtgtgaa gggaagtgat gccttgagtg agctagacta cactgttagt aaatgaaaga 96601 tgtccctttc ctaacagccc acatgttaca actccaaaag gacagactct aaaacagcca 96661 ccctacttac tattttccag agtataaagc agagtaagga agatgtgtaa actggtcaga 96721 ataaagtagt aactcaaacc aaaattttta atgggactat ctatcaagaa gggattactt 96781 ggcatttctg cctccagaag agttcagtaa gcccctgcca gacccagtcc tccctcagat 96841 gacaactata acctctgcac aaaaatacca aaaaaagaat ttccagaagg cactagagag 96901 tgaacaaaag acaaccaatt atggaggggt gctaaaattc agagggaggg aattactgac 96961 acagggagaa ttactgttgc tttcaccctg agagtaggcc agagttggta ccaagaaaga 97021 cagctaaaac tctcatacaa aacccatggt ctttctggcc tgtaaaggaa atgtgtaagg 97081 taaccacagc ctgtagaaag aatggagaaa attccagaca ggagaaagcc agagagaggg 97141 agctccaagt tctgcgtaga aactgctctg tctctggccc acccctaagc catgcatgct 97201 tggtgcaggc tgtaagcaga ccagctacat ataaaagaac tcaacatgag agtggccatt 97261 cacgagacag ggctttcagt ctgagtcaat acagctaacc acctactaaa acaaaaatat 97321 caacactttc cagaataaaa atcaaagaaa accatgctaa ggcataccac agtcaaactg 97381 ctgaaaacca aatacaaaga aaaaatttca aaagtagcca gagaaaacca caccttacat 97441 ataaggaaac aaaaatttga aaaccactga tatatcctca gaaacaacgg aggcctggaa 97501 acagtggaac atctttcagg tgccatgaaa gtggtggtcc ccaacatagt ggagagtttt 97561 tcaaaaggct agacctcaaa tcccttggca taataatact ctctagttgg ctttcattag 97621 atatctttgt tgttatgctc caggagctaa gggacctgat ccatgtgttt acaaaatata 97681 caagagcaag ggagagagca gacactcacc atcgcccctc tgtttggtag tcctacccca 97741 ttcaacggga gacccacttc aactggtggc acttctccca tctctctgca agtcctgtct 97801 ccttgccccg ccaccatccc atttgtgctg aagttctctt tacacagagc atttcaatca 97861 ggagtgttca agggtggtgg caataaagat caccttcact ctaagctaga tctttttatg 97921 caaataatta tttaaaagaa tgaggaattt taacatataa gtcctatggg gcacccctaa 97981 gacaatcctt ctccacttaa aatagctggg gctcaataca cttcacagcc cacaaacacc 98041 cagcacttat gcctgttgct tagtgggaac ctaaacataa gaggagcccg tattgcccag 98101 cactttctga aatggcacgg aggttcctgg gtagattcac tgatgcctgg gaacaaccct 98161 ggtgctaaat ttataaaaat taaccttagc gtattgaatt ggctacgtct acatctagaa 98221 gaaaaaccca ctctgaggtg tatcacagta gtgccctttt tctatagcag agagagctac 98281 cagtctcttt ctagctctga tagctgggta catccgagat gtcagcaact tcaactgttc 98341 cccagaacac ccgcctctcc tagatagaaa gcacaaccac aatatttaca ggatggagtg 98401 aaattctcca tctgaagcta tttcctcttt tttaaaagga ccagaaaaaa aacttgtatt 98461 gctaatatga gaaagctgtt tagaatagcc tatctgtaaa gtttctggca ttttccaatt 98521 aggtattatt gcgatgggct gcccaatagt caggactact tatttcccat cagagttttt 98581 aaaaaaagat tcattctggt aagttcttga tgaatttcag tcaacttaac tggtatggca 98641 ccagcttctc tacatcccta tcaaaatcaa agaaaactca ggaaaaatgg aaaacaatgg 98701 ctgctctatt attctactat ttgaggagca tctatctttc acagagcaaa tgctttctta 98761 atttcaatga cataaagttg taacagaaag aaaaaaagtg aactttgaga agtctattaa 98821 aaaaattccc tatttacaaa acttaatata caaaatacac tgggataaaa aggatttata 98881 accctacagt ctttgaatag cttctaatta taaattcaat taaatttaaa aaaagattag 98941 cagcagtaag aaaaaattta aagcaaagag gcactttgca cagaaggaag taggcagtaa 99001 caactatgac acaaacagaa atgatgtaga gaggatacaa gaagccttta tgagtgaagt 99061 cagttaaagc tgcccagagc ataggcaaga caaacatact ggcttccatc tcctttaaca 99121 ttaagggata agaaggaatc aaataagtga gcacctttct agaatcctta gttgtcttat 99181 cgtagtttcc tctttaatgc tgagatcaaa aaagctaatt atcaaagatc acgaaatgac 99241 tacttaatcc caggtctgta tcactccaaa tctcatactt attacaccat gctgctgctt 99301 agaaaaataa ttcaaatgaa ttggcctccc agtgaagtac attttttaaa aaccgagact 99361 tctagcaacg tgtggcccat cagacttttt gctaccttct gccaggaagt aactacatat 99421 gcagcaggtt aaataggtgg cagtctgctt aagacctgct ctaaggctgc acatttaaga 99481 gagatggtcg ccatctctct cctagaatgc caagtttaat tctgaagatg gtaaactcct 99541 cagaactaaa gccctgtcct gcatatttag ctattatttt tcctgtaaaa tacagcactt 99601 aaccatgaga tggagtaaag aatgagaaag aacctacaag acaccctgga aggttcaatt 99661 ggagttggtt ctccaactcc aaaatatcaa accccaactc cagtcttcca aaagacttct 99721 atgaatacct ggaaaatgac acaggccttc ctaaaccctt tgggaggtga ctaaagctgc 99781 cgcttctgga atcagacata ctaaagctca gcttctccat tactcaccat gatcttgggc 99841 aaattcatta acctaagctt cagctcccat acgaataagc tgaagaaaca gcgataatat 99901 gtcacaaaat gcttattata gtgcctagag gctaagtgct tcctaaatgg tagcttatca 99961 ttatcatcat catcttgtta tgacatggaa gtctacggga taaccgacaa ggttttgtat 100021 attgaatata aaatcagttt cagttttggg gatcctcgat ttaggaagtg agtaacagcc 100081 acagaactgc caaggcttga aaaagcagca agcaaaccct tcaggaagaa aggaatctat 100141 acaggttttt catgagtatg catgtattct cctctgctag aagttacgat tgctaaagtg 100201 aaggaagttg gaaaagggat taagagtgaa atactatttc atgcaccaaa acgaactgtt 100261 cgtttctcta ttaccatgat ggggacgcca atgtcaggcc acagaaggtc ctctgatggc 100321 agcttgggag aattccacac caccacgacc ttgttcaggt aagggaggcc attcagcctc 100381 tctaaagagt tcataagcac ttcctcccgc tcataagtca acatcaccac cgtgaactgc 100441 tctcggggaa cattgcctcc aagcgctgcc tgaaattcct tgccagaacc cccagctcca 100501 ccaccaatag gccgaaagcc agtccctgag cccaagaatt tggcctctga gggcaacaca 100561 gggtcaaagg gagtgtgggg gaaaagatgg aaaggccctg gagcacagtt ccagctgcgg 100621 taaaagtcag tgacagtcag agtgaaattg cggaggtatc tgggtgaggc gtagggcggc 100681 tccgtctcca ctggccccag gtccaggtcc ccgttgtcag ccatgttggg gtcagttcca 100741 gccgccttgc ctgaacggtg ggggatctca gctgccgcct cttcccggat gggagcggct 100801 gggatctgga tgcgagtcct aatcatagcc agcacggtat taaaaatact gtcagcagtg 100861 gagaagtaag tctcccagag aaagcggcct tgccgcctca tagccaggag gtcactatcg 100921 gagaggcttc tgagcaggaa atgaacctcg gtaacacgag gctttggcac caccagggcc 100981 gcctcgttcc actgcagcat gtcctggtag ggaagctgga cctgctcccc cagcaccacc 101041 gggacggcac cgacttccag ggcttcgaag agccgtgttg cacacccaga ggaaataacc 101101 aagcgagggt ccccgggggt aatgatgagg gcgaaggtgg agagcttcag caattccaag 101161 cggtcctccc gctctccaca cagtgcccac tcagttggca ggctgggttt gggctggttt 101221 ttgcaggtga attccaccag gacctgatcc agcttgctgt cctgcaccgc cttcagggtg 101281 gcaatgatcc ggtcatcgta gtcggcggga gggtcgccct ccatttcctc ttcgaaggag 101341 cgggcctcct gaaggctaga cctcagagac tcaatcttct cgccctggaa ggtgaagaga 101401 tatttccgct tcaccggcac ctgtggtggg atttccatga agttgggctc agacatggca 101461 tggaccagcg gtgatacgac caagtcaaag ccaggtctgt actggacagt gtagaaggtg 101521 gactgggcca ccatggcacg gccagtactg acgttataga gaaggttctg tgtatctgac 101581 ttacgtgaca gattgatgat gacatggttg tgtccatccg tccgccagtg tggcagggaa 101641 tacaactgct tctccagctc agcaggccgc agcaccaccg gctcctgcat ctctcccact 101701 agtatcacgt aaaggcaggc gatgtctgca ttttctgtaa cataaacgtt agctcgtgct 101761 gtcgcctgaa aagcctgctt gaccaaggga tccaggtagc tgccaaagac aaactggtca 101821 ctgtcataga cgtagaccgg gaagccagag gtgagagggc aacgagaata atcaaagcag 101881 ttgtgtagcc ggcagccccg agtggccttc gggggaggga ggccggcatc gtccttctct 101941 gggagcagtc ggatgggcag ggacagcttg ggctggttct gggccatgag ctccttgtag 102001 gaatgctcgg tctggctgat gacattcttg agctggagca ggtcctgctt ggcgttctca 102061 atgctcttct tacaggcttc gatcttcaga ttcagcttgg cgatctcgct gttcagctct 102121 tggcgcttgg cctccagctg caggagctct tcactcaccg actcccggat gcggcacaga 102181 tccagcacgt gcttcacctc gcacagctcg ttccccaccc ggggaccaaa aatccgcttg 102241 cctgcctcat cagcctcatc cagagtggtg aggtaatagt gggcgatgag cgggaagaag 102301 accaggatga caaagagcgt gaagctgagc cacgtgaggc ggatgcggtt ggaccagcgc 102361 agcatgcagg tctgacctcc gttccccgcg cccccattcc gcagcatggt atagcctgtc 102421 atgagtcctc tgcagcctgc cccccagatc acgtcgggtc actcgccata accatgggtt 102481 gctattccac aaaacgatct ctgtttcact gacacgtttc cagaagagtt agtgtgctcc 102541 ccagacaagg caccaaataa aatgaacatt tcattttcct cagctgcagc tgaaatggtc 102601 tctgacccta ttccagcaga ttttaagttc tggctgttga ccaaagaaca tgtccttaat 102661 ctttatcaaa cgataaaagg tgccacattc ttgctgagat gaaagggagg aggtacctga 102721 tgatgaaacc caggaaaaac accctggaat cagacagact ttttcaaatg ccatagctct 102781 tgtttcttgg ttttgctgac caacaaatat gcatagtgtc tattcacagt tatacagtaa 102841 taggttagaa cagaaataaa tgccagcttc ttatgatgcc tttgccaaca atcaggcctg 102901 caaaagaaag agaaccatgt cagtcttgaa gaagttatgt tcaacacccc tgccaccata 102961 catttctaga aaatgcttaa atcttagatg gaacaatggc tggaacactg gctgtgtctc 103021 aaagaacatt ataatgacaa tgcagagatg ttgtttgctg tttggtatag gtcttttact 103081 tggggtaata aatggataag tgccccaaaa agctgcagtt tacaacccct ccccacttct 103141 tatttaactg gatctagagc ggcattatag ccctgtaaca cgatgaccaa ctaaattcat 103201 gggacaaaga tgtccatggt cttttcttat cctgttccac acctgggcat catctttaga 103261 tgaacagaaa taccttccta gccaacctgg gtagtttatg tttattccta acctataagt 103321 cttctttgga aatactttac aaaaaaagac tctgaaaagc tcaatttgtt aaatgtagag 103381 ttgaaagggt tgaagagaac tcttttgatc tttatccagt agtagatgca gtaatcctga 103441 gacaaaatgt atttcccagt ttgcttctca tttatcttcc attagcagac atcatgtgct 103501 ctttcttaaa atataaatag taacttgctc ttttagaaag aacactatac ttagaaatga 103561 gaggcattcg ttctccttct ttgctgacag atttgctatc agaccttggt ttcctaatct 103621 tctaaaatgg agataggtgc acggagacgg caatgcacca cgttgctgtg atacaaagtg 103681 cagtggatgg gaggacgctt gtagcgactc agtccctcag caacactccc agccctgctc 103741 tctcaccaag cttcactgcc actggctgca gaggcttgcc acttgctttc cctcaaattc 103801 aacacagcta gaaacaaatc ataatattct atgccaggga atattcccgg tttctttttt 103861 taattcttcc aaaaaatatt caccatactc ttaacagggc taagacatgc taagtataac 103921 tgtgggagaa tctagggtgt ataatccttg acctcatgga acttccctta ccctaagaga 103981 taagatataa acaaacaagg gtacacgtag cataaaatga gtaggacttc acagaggcac 104041 aaccactttc tctagcttct acctctgtca aagatgttta actattaaag gtgtaatagt 104101 cttctctcct ttttaccatt tttataaaca taattttaat tatgtttcag aataaagatt 104161 cctttaaaca ttctaacatt ttttcaagta acatttgatt tcatcgtaac attggacatt 104221 aaattttaat ctgtcaataa attataataa caatttctaa agacaagggg atattaggct 104281 gggcatggtg gctcacacct gtaatcccag cactttgaga ggccgaggcg agcggatctc 104341 ctgaggtcag gagtttgaga ccagcctggc caacatggca aaaccccatc tctactaaaa 104401 atacaaaatt agctgggtgt ggtggcacgc aactgtaatc ccagctactc aggaggctga 104461 ggcaggagaa tcgcctgaac ccgggaggtg gaggttgcag tgagccgaga tcgcaccatt 104521 gcactccagc ccaggcaaca agagtgaaat accatctcaa aaaaaaaaaa aaaaaaaaga 104581 aagaaagaaa agaggatatt agaatcagct aacagcaaag aatgagagga gggaaatgat 104641 ggtgtgagtc actttgtcca ttacaaagaa cacctgacaa gacatcagac ctaaagttga 104701 tgataatatt actaaaaggt ttaagtattt ggataatcta aacttggata attagcagct 104761 gaccaaatac tcaaatttac attatccttg tgattcaaat gtttaaatct cttgctttca 104821 aaagaatctt ctttgcactt atgaccaaat tgtaacaaag aaacaacaga atggaagaaa 104881 aagaaaagaa ggcgtaatca cagcaatcca gctgactcat tccttcctca ccatgtgttt 104941 caggaccctt ccttcctctg acttgtgtag cattacacct cagcacacga cttcttgaaa 105001 gagtgaacct ccagggcttg ctctcctgat ttaaaaaaaa aaacaaaaaa caaaaataga 105061 acagtgacat actattagaa aaatactcaa tactgaaagt gctattaaag aacctattta 105121 ctgtccccta tgaaaagatt tctcttatgt acatgaggtc accaaataat ttactgtcca 105181 aacagagact ctttgaagtg gaaagggaga ctattaataa atacactggg acaagaggta 105241 tacacgggga ctctggcagg caaaccgtcc agacagacgt tacctattta tgtgctctaa 105301 gggggaataa aaccaaacac taaaatatgg aaaagtcctt acttgttgaa agtatatact 105361 gagatattta cagatgaaat gatatacctg gaatttgctt caaaataaac aggatgaggg 105421 tggcggggaa tgtttgcggg tagaaatgaa cccaagatcg gccgtgagct gactgctgtt 105481 gacactgaat gatgggtacc catgggggct tattatatca ggctctcttt tgtctaagtt 105541 tgaaattttt cataccaaaa attctaaaag atactacata cagagtctaa acagaggtta 105601 ttaaaaagtc atttggagac tgactatagt tagtctaata tttctagtgc taccaactta 105661 catataagca gagctgaggg cagaaacaaa tgttctcaca gaaaccaata attcaacaat 105721 gattcaaaag aatgcatccc cactaaattc ccatctcttt tactggagcc aggcaaaagc 105781 atcatccatg tccaatagca tgagcattcc ttcctaaaca gctaattaaa ttatttcaag 105841 cacaaaagaa aaaggatacc ctcagaatct cttctgtcat tctctggaaa atgacaataa 105901 acatatcagc ctctagaaat aaatgtcact gaaacaatga taaggagccc ttcagatttt 105961 ttttattcca tatacaatgt acatgtctaa ttcattctca gtcacctgcc acagcatttc 106021 atgcttaact tgccagctgg cctccattcc tgcccctaca atgcactcca tacacagcaa 106081 ccaggaccat cttgaaacat gagtcaggcc acgcctcccc tctcaatatt ttcaaggctg 106141 cccactgtac tgccgggctc cccagaccca tctcagttac catcgctctt ccccttgctc 106201 tctcagcttc agccacactg gcctcctctt acctcctcga ctgtgccaag cttctcgctc 106261 tcaaaacttt atgcctgttt tgtctgaaat gttcttcccc aggcttctgc ctggcagact 106321 ctttctcatc cttcaggcct caactttcct ggcattacca tttaaagttg cctttcttac 106381 cccccgatgc tctctggcac cgacccactg atttacttcc taatatcttg taatttatta 106441 attccctccc ttccccacca aagcctaatc ctcgagggga ggaacccttt gtgtctggat 106501 cactgctgcg tggccagcac ccagcccagt gtccagcaca ctgtaaacac tctataaata 106561 tttgttaaat aaatgaatcc tatcactgat cacttcctca tcctacaaac tctcaattct 106621 cccctggact tccatgaagc tgtgtttttt tagtgttcca tctacttccc tgactcatcc 106681 tccctttctg ctttgctggg acccagtcct cctacctcat actgaaagtg ttccccatgg 106741 ctctcaacat aatgttaatg aatccattaa caaataatat attgtattga atacattata 106801 aactacagag agagaacttc agagccagga ggcagctgga tggccatatg gacctgcagc 106861 tagactaccc ggctcaggat gcagctcagc cttgagaatt tgggaatgtt acataatctc 106921 cctgagctca tttcctcctt tgtaaagtga gtctgaaaat ctctacctac cgccagggtt 106981 attgcacaaa ttaagtaaga tattatagat ggaagaaaaa aaaatgggaa catggctaaa 107041 acagtgctaa gaggaaaatt tatgcataaa ttcttgcatt gaagaaaagt ctcaaatcaa 107101 taacctatgc tcctccttca agaacccaga aaaaaaacaa aacaaaccta aagagcagaa 107161 atcaacgaaa tcgaaaacag aaaagcagaa gagaaaaatc aagaaaacaa agaggtttgt 107221 cactggtttg aaaaacctac aagaatgaca aagaaaaaag ggaaaagaca caaatttcca 107281 atagcaggaa tgaaacaggg gctatcacca cagtccctgc aggctacaaa caactctata 107341 cacttcagtg aaatagacca actccttgga aaacacaaag taccacaact catccaatag 107401 ggaataatct gaattagttt tataactatt aagtaaactg acttcatact tttgaaaatc 107461 ccaaaaaaga aatctccagc cccagatggt tcactgaaga attctactga acatttaaag 107521 aaaaataaac acctactcta cactgtctct tccagaggaa ggaacacttc ccagttcatt 107581 ttataaacct agcattgccc tgactaaagc cagacaaaga cagtaccaaa ataaagaata 107641 ccacaagcca ggcgctgcgg cttatgcctg taatcacacc actccagaag gctgagggga 107701 gaggatgact tgagaccagc cctggcaaca cagtgagacc ccatctctac caaaaaaaaa 107761 aaaatttaaa ttagccaggc atggtcccag ctactagagg ctgaggtggg aggtgagatc 107821 acacctgggt gacagagcaa gaccttgcct caaaaaaaaa aaaaaaaaag aaagaaagaa 107881 aactacaaaa aaaaaatctc tcatgaatat agacataaaa atacttaaca caatattagg 107941 gtaatcctat ccagaagcat aaaaattctc cccacttaca ccttcatttc tcctatcaaa 108001 gtgtcttgcg ttctcaccca tgctgtgcac ctcatattaa gtcagtctgc attttacact 108061 tcctgcccat gtcctctcct gcttctcttt ctctgacccc ttttcaccac tccccaaatg 108121 tagctgttcc tgcaggcttg tcctcaacct cttttctgcc ttcacctccc agagcttgcc 108181 aatgagcttc gcttagcccc ctgattggct gactctcaaa tttacttttc ccatcttcac 108241 ctccctcctg ataatccttt ttccagtggt cagcaacaca gacatctaca cctcagacgt 108301 tcaatggcag caagcacatc ttctatgact agaacaggat catgacagtg tcttctccca 108361 ggggaaaaaa aattaaaata gttgtataca gagatttatc attcagattg tggccagcat 108421 tctacctttt actcttttcc ctaatcagac atttttgctg acaaatgcaa agcagaagtc 108481 gccatctgct agctcctcat tggagggctg aaccaagcag tagccctgga aagctgtaat 108541 gtaatcactc cattcgagag tctgagcggt gggctgagaa gtcggggctc agagttccaa 108601 tccagaactg tgcacgtgct ggtgttcccc ttcaccttct cgcccctcca cctccacgta 108661 ccagggccct cctcctctca catcccttat cacaatagca aactgcgatt atctgcagga 108721 acattactca cggccttgct ttcaagagtt tgttgatata acaaccatcc tacagactcg 108781 acttttctcc ttgtaaaact aaaacactga tattgaaact tcccattgcg gatctgggat 108841 atgtctctat ttaggtcttc ttttgcatct tttaataaaa ctgtaaattt ttttatatgc 108901 agaaaattat cagactactc caaaagaaag aaaaaaagtt aaactacact aaaacactca 108961 cccggagaga caggagagac aggaggcgcg acagggaaga agggagtcac tgctccatct 109021 ggctgttatg ccttccacgt ggaaggtatg aagggagaac agagtgagaa acagagagag 109081 aggctagacg ctttccagat gttcccaatg aaaccttcaa cggcctctaa tatcttaaat 109141 aattatgata atagctaaca ggtattgaat gcttactgta tgccgggtta aacctattac 109201 catatattcc tcaacacact cacttaatcc tcacagcaat cccgtgaagt gggtttactg 109261 ttattcctgt tctgtacacg aggaaaccaa agcacagagg ctaatgagcc atgggtcacc 109321 catgttatgt ggtaaaactt gaattcaaac caaagcaagc tggctgtaaa gctcatacct 109381 ttaatgcctt aattatgtta cactgtctat attaattcaa gtaagagtgc gagcaggcac 109441 acacacacat gcctatcatg tgtatcattt ttacattctc catatcactg ctactccgct 109501 gtaaccatga ataataatta caattgacac acataatatt cctctaaaac ccaaaaccaa 109561 cactatattc aaagtattta cctgctaaag agaatagcag actcagaaca aaagatgttt 109621 gccactgtgc ctatggccca cctgtatatc tgtgcttgta gtactatttt ctctttttca 109681 tttaggtcaa aataggccca tcaagtggca gaactccatg acaacccagg tgcgggttct 109741 acagagctgt ctgcatgctg ctgtcattgc tgccatcacc aggagccctt ccaattaggt 109801 aaagagagtt ctccacagga aaccatttca gtgaggtcac tgaaagcagt atttcagagg 109861 attgttttgt ttttaagtac taacaaccca aaaaaacatc atttcctgat ttcctaacta 109921 caggcatgac aaacagcctg tcaaggcaag acagtaccta gttcgtgaag tcaggaagta 109981 tgttaataag cactaaaaca catttcccaa cactatcact gatttgtctt ctgtttaaaa 110041 aaaaaaaaaa aaaaaaaagg cacttcccag ggaaactaat tgtagataaa gagtaagctc 110101 taagaactac atgtagacac ttcccaagtt acaggagacc aaggccctat gtttttcaca 110161 atccaacgac cacagtggtt tcttactgtg taacctagcc tggatgaaaa aagggaaaca 110221 gaacatcctc agcaattaaa aagcaaaacg aagtgtgaaa aactggttgt gccttgacct 110281 actgactgaa gagtgaagat tatgatgcaa ccagagaacc agagtttgag ccgcccttat 110341 tacagggctg tttgaaaggg aaaacaattt attctttggg cttaagagta ggtttctaaa 110401 tcccaaggtg ttccacaaat gccactagca gacaaatcac aaaatacaaa aggaactcat 110461 caataagtgg tgagcattcc ttccgctgct gaatatatag atattaacaa ggaaaatgag 110521 gctattgatt actccaagtt atctgtttac ttggcaacaa acctgggccc agaagtctca 110581 actcccagga taagtcctca atttgaaaat tatgccattg ccttatctgc ttcccttccc 110641 accagttcgc taatgtccca caaatccaaa tcgtattgtt ttaccagtca gtttaattat 110701 gtgtaaaaat cagattcacc acttaagaat tttttcaaat aacaaaccgg gaccgtgcta 110761 cattaactaa atcagaattc ctaggtgtgg gggaaaactc ctgcagtttg acaaagttcc 110821 caggtgattt taatgcagag cacacaaccc taactccaaa actattggtc taatgaagaa 110881 ttgatagtaa tggagattca gattgatggc agctcaatca acatagacag ctaaggaaga 110941 caaacagcac tatcccttag ctaacgcaga aagtccgcac ttcaatgcac cacataccct 111001 tggaagatgg ggaggagagg gctttttcat aattgctact gatttatatt tacagtgtgc 111061 taggcacagt actctagata acacacttca cacatacatt tcatcagcca catgggagta 111121 ctgtcatttc cacttcaccg atgaagcagt ggtgtatcac cgaggatagg aaacttgttc 111181 aaggcaatac agcaaccaag ttacaaatcc aggtccgtat gacctacagc cctgtatact 111241 gcttcttgct tatctaccat ttgtttactt agaggattca ttttgtctta attcatttta 111301 caatcattat gtattacttt tgtaattaaa aatattacct tgttgcaatc tttttaaaga 111361 acacctcatt acatttttca ataaataatg tgacacatct atttgggaaa aaaaataaag 111421 tcagattact gcatgacaaa ccaaatccaa aaataagttc caggtggatt caagagttaa 111481 ttataataaa tgaaccgtaa caagaaaagg aaaatataca tgtaatttca tctcaagtac 111541 agccactttt ccaggaatcc aagcaaaagt aaaatccaga aatgttcaac aggtttgact 111601 atataagaat caaatgattc tatgtattca gaaggaaaaa aaaaaagctt aaatttgatt 111661 aaaaatgggg aagcctgctc aatatgacag aattaaaaga aagcaatcaa cagtggtcaa 111721 cggacataaa taagaagtta cacaaaaaaa gggttcaagt gataaacatg tttatatgtt 111781 taaccttcct agcgatcaaa gaaatacaca tttcaaacaa gatactgtga tattttccac 111841 taataaatca tcaaagtatt gtaaaattat aatatctggt gctaagcagg atccagggta 111901 aacattccca cacttggctg ctgggattgc aaattggcac acctttctgg agcacaattt 111961 ggcagtaata aaaacactga aactgtgtct atcctctttc cctgtaattc tatccgagaa 112021 attattctta aagaatcatg agtgagaaaa aagatttaac ttccaaaatg ctcatactaa 112081 aacattaaaa tagtgattaa agtacagtac aactctgaac tatgctggct gctacaatgt 112141 ggcaggtact cttgtgttag tagaaaggta aactgaaaag taatttgcca tttgtaagaa 112201 aaaaaccttc aaaattttct tatctctgat tcagcaattt cactttctag gaatatattt 112261 taggtgagca agatttgtat gtaaagatgc aatcacctca ttattcttta tcatctgtat 112321 aaaatatata aattaaatgt ccaagactag gagcaaggtt aaacaaagtg tgactgtcac 112381 tgatatgact atgataccat taggaagctt ttcaatggtt ttaaataaaa tgaaaacatg 112441 ttcacaatgt tagctggaaa aatacagatt caaagccata tatgcagtat aacatgttta 112501 aaatgcatat gtatatattt ctgaatagaa aaacaaacag aagcaaaaac accaacagag 112561 gcacttctag attgtgaaat tataggtgat ttctgcattc ttcctatctt tctcactctc 112621 cctcctaaaa tgagatgcgt cattttcata agggctgggt agcgatgtag aaacaaggtt 112681 ttcaaataag gtcttcagat ggattttgct aacttattct cagaacagtc aacttagtat 112741 gcaagtgcct agaatataaa ctaatctaac ggttttcgct tctcaaacat acatgatttt 112801 tattttatgc tgtggaggca tacaattgat atcgttagtg ccctgggcct ccctgaatga 112861 gatagagaaa gtgaagcaag tttgctaagc catacataaa tcaggttttt cctttttttt 112921 tttttttaag agacagggtc ttactataat gttgctcaag ctggtcttga actcctggac 112981 tcaaggtgat cctctcacct ccgcctccca aagtgctggg attacaggtg tgagccaccg 113041 tgcccagcct taaatcagct tatgactcgg gcattctcct tcaccctttg tgggtgaatt 113101 cagcttgaga cgctttacca tcccatcatc attaccatat ttctgattca tcaggtcccc 113161 taacttccca attcctcgtt cttgactcat aagctccttg tcctttgtta actcgtaaat 113221 taaggggtta gaccggatga cctcaaagat ccttttagac tctaggccct cactgacaat 113281 tgccttgctc ccaggaagca caaaaacatg ttttgctgtg gggaaaattt caccacccta 113341 cctactcaag gcagcaaggc cattcccaag acctccttct cgtttcacct ccaagatttc 113401 aggcataagg ctttaaggcc ccccttaatt ttccacagac tccattaata atttgggatc 113461 ccatcaacta ttttctccat tcgaagccac tgtgctttta tattttacag ctctacttca 113521 gaaacaaagg aagccggatg cggcggctca cgcctatatc ccagcacttt gggaggctga 113581 ggtgggtgga agttcaagac cagcctggcc aacttggtga aacccagtct ctactgaaaa 113641 tacaaaatta gccgggtgtg gtggcacaca cctgtaatgc cagctacttg ggaggttgag 113701 gcaggagaat tacttgaacc tgggaggcgg aagtttgcag tcacctgaga tcatgccatt 113761 gcactctagc ctgggcgaaa agagcgagac gccgtctcaa tagaaaaatt gaaaaaaaaa 113821 agaaaaagaa aagaagccat gctggaaaga gtaggtcaaa attgctgaaa aaacatttaa 113881 aagcaagttg gaaaagagac tttaaaggga aaatggtcaa aaaagcaaac atccaggacg 113941 ttaaccatta atattattga ccagtccaaa aggtattgga cacagccaaa tgaaggaata 114001 taccaaagga aaggcatgtg tgtgaggggt ggcactctaa ggcaggcacc cgcaagcggc 114061 agctgcctgc ttttgtagat aaagtttcac tggaatacag ctttgctcat tcagttatgg 114121 attccgtttg tatggctgcg tatagtaggc attcttatat attatgtata tgatgctttc 114181 actctccaac agattctaca gttcatcttc ctatggctcc acttctagac ttttgatggg 114241 tcatttgggt gcatgtgagt agtatcctac actgcacttt atggcctaac tgtgggagag 114301 ggaagtatgt tagtaatgag tctccccaat cctcttctat tttcaagatc acaggttttt 114361 taaatcctgc ttctcttctc cctagtaaca tcacccaaga ggtctgaatg actgaaaatt 114421 taaaaggact gtgcaactgg ttcaggcaag aaaagaaaag atgaagctta caggtgagcc 114481 cacctctgtc cctcttgagc tcacaaactc tctctgcctg ggctatgcta tttccatgaa 114541 acctccaaac gtgaaaaatc ctttcttccc tctcagtcag ctgccctatc attgaaagtc 114601 ttcgaaatga tagttgccga aatgaagggg taacaaaaat aaaatagaaa tatgttaata 114661 gaagttttct gagctaaact taataaccag cgaatggagt aggcagtttt aggacgttat 114721 gaaacgtcct ggtttcatat tcctcgcctc actctagagt aacatacaaa ggcgctcgaa 114781 cctttaccaa gagtaggtct gatgggactt catttttctc ctaacacctg agtctacatc 114841 agggaatccc tcccaccctc ctccagaaga ccaccagtct caactgagac aaggactccg 114901 catcactcct gcagcccctc atcacccata accctccaat ccacagctgg cctagggcct 114961 gcggaaaaga acaggtctct ctctagtctt ctgctggctt caaaccaccc tctggacttg 115021 ccctctctcc tagaaataca tttcccatgc tcggcctggc ccctgactta cttctctcca 115081 aactgttccc ttaaaatctt tttactccga ggtcaaaact cttgaggcct aatcactgaa 115141 agatcccaac tacacaccaa gtattaacag ggttttcccc cactagaaaa gcgagaagtg 115201 gagggataca gacatacgcc tgtcaatcat tttttaggta ggtatgcccc tcacatctct 115261 ggacattaag cacgtttccg gaagtctgaa gagccacaat tctgactctt ccagaaagca 115321 cttaggctcg attctctctt gctcgtgagt tcttatgatt cctccggctc cccacaagca 115381 aacgaatggg aaattcccac aggataaggt atttttaaca catcaaataa cagtttaaga 115441 aaacggtttt tctttcatca caaaatattt caaagtccct ctgctaaata gcaagtcgct 115501 gagaaggctt cgcttcgctc cagactctgt gccccgcagt tactatccca gcacacaggt 115561 cacagcgata gtcactgtat cagaatgcag gactcactgc cgaacaaaat acagaaaact 115621 gcagagtctg catggctgca acacacaaag cctttaaaaa caaaagaaag cacggggagc 115681 tctgccagta aaaatgaagc tacctaaatt ggacaaagaa taggacaaag tgacaagaaa 115741 tgctaaagac gactcttaag taaatcacat atgggggaaa taatggacat gttgtggtgt 115801 tctgcgcttc ctcctccacc aaaggagtcg aaccaagagg acttgatgaa gcttttagag 115861 tttttaaaaa gggaagaaaa atccaggttg cggggaaggg cgggggtggg gtggtgcggg 115921 tggcggggga ggggcaaaat ccacaaaatt taagtcttct gagagccaaa cagattttat 115981 taataaaagg agccgaagct ctcgctcaat gtggggaaga gaaagcagca cccatcagca 116041 gccgggcagc cctggctcgc ctccgagggg ctcggaatag gtgctgtccc cgtcgctggg 116101 ctcggagctc cgccgcgcac acacgccccg cgcacccctg tccggtccag cccgtgcagc 116161 gcgaggccgg ctctagggga gctgggcctg ggagccaggg tcctgcagca cctggaccct 116221 cggacaggaa gcggctcctc tgactgtggc tcctgaaagg aggcgagccc ggcaaaaaga 116281 gccagcgggg agggcagcag gcgactgcgt gtagaagcgg ggggcagatg tgggaaggtg 116341 tgctcgggaa ggggtggggg tagtccggag ctgcgcctcc gccgacagaa gatgctccgg 116401 gccagcagcc agagaaacgc cgcgggtcac agagggtgga gggcttcagg gagcagagga 116461 agcccaacag ctgcagccga gcgtccaaaa aaaggtggag gcgggtcccg agcagcccaa 116521 actgggacga gagagggcgt gtgggggcgg ggagggggtg ccccagccca gggacccgtt 116581 agccctcccg gctgccggcc gagggcctgg cggcctctcc ccgggccccc gagccaccgg 116641 gcaggcctac tccgctcgga ggctgcatgc ctcccgccgc cgggcagcag cagcctcccc 116701 ggggcacggc ggacccggtc cctcccgccg cgtccccagc gctcggggcc agccccggca 116761 ccctcccatg agcccttccg ggcgcggccc ccgctcctcg ggctcacgcg cggccagcag 116821 tcctaccggc ttccagctca gggacccgcc gccgccgccg ccgccgcctg cgcgaaagtc 116881 ggcgtcccag aagccgttct ggctgccggc cgcccgcctt ccaggccgcg cctgatccgc 116941 cgctccccct gccggccggc agccatttcc gacaggcgac tgcggaactt gccgaagggc 117001 gccgcgccgg aaatggccga agccggcgtt cgcgagcggg ggcgcggacg cgggcgcgcg 117061 ctcgccactt tcccgaccgc gtccgaagac cgccgaggcc tcccgcagct ccgcggtgac 117121 acccgggtca ggggcgcggg gccgggcgcc ggggattgtg ggaggcgcgg gggggcgcgc 117181 cggccgcctt cggagccccc caactcgcgt cctgcaaagg ccgccgggcc ctgtcgagaa 117241 gacccgaccg cagatggcgg ggaggatgct cccggcggcg tgggaaccgg gtctgactcc 117301 cgagccaccg ccgcttccgc aggggcgccg gccccgggaa agtcaagtca taaatccctg 117361 aatctaaaac tccattctca gagaaaaggc ctccaaggac gggcgccgtg cgcggcaact 117421 gcctgcagtt ttgaagccct ttgactattt cataacaaag acaaggccgg gcggcttgga 117481 cgcttaggaa aatcctgggg ctttgcaaaa acaacaggtt aatctagtcg tgtgggatga 117541 tcaccaaaac aagacaggaa agaagaacac cgtgtcaatg ctgaaaagcc agcccctgtg 117601 agccccaaag tgcacgtttt ccacagtccc aaggaacacg tgactgtgtg tttccacact 117661 tgagaagtca ggataagacc ccttggataa tggaacaggg gatgggggtg ggagcaagca 117721 ccctacctgg tcacctgctt aacttagaaa ccagctttta aaacctgtaa ctgcagtatg 117781 agctacgatc aaatttgtct taacgtattt tttttaatgt ttttaatacc cagaacacag 117841 ggcttctact ccagggtttc ctcgccaggg aaccccaaac acacaggacc tggagaagcc 117901 gggtagagct ggctcctggc cctgcgcttg ggtggtcggc tgccttaaga agaactgcac 117961 cccagagaca ggctcgcagc tgccgacctt atccactcgc cctttctgct ggagcccagg 118021 cccagtgctc cagcaaggag gctgagaaaa tgctgaagac tgatgcccac gggggacagc 118081 ttgggctaag gataacgttt gcaaaacaaa cctttaaaaa cccatagcaa cctgtttcct 118141 agagcacact cttcatctct ccacccccaa actagtcccg actcggatcc tccttttcct 118201 atcctctttc tcttgctctc ccgtctccta ttcacttttc ctctcctttc ctcttgatta 118261 ttataaacaa atgctttcca agtcttaccg ccatcatatg tgtacatatg caacccttac 118321 tgttaccaat ttgttgaagt caagacagga ggaggcaaag tttaaaaatc agaagcattg 118381 caggaaatga aaatggagtg agtgttgcct gggtatcata attttttttt ttttttaaca 118441 gttcctctac ttggctctcc tccaaaggta cgcggccaca gcaggcaggg gcttggcagt 118501 gtgggaggag acaccacaga agacagggaa gaactaccag gccttggttc atctccacac 118561 tggcgagaga ggacgtgcag ttacctgcta cctgttcgac tcagtctttt acgttggagt 118621 aacaacacat tgctgccctt aactttgact tacttgcttt taaagatgat gaagctggcc 118681 aggcgccgtg actcatacct ataatcccag cattttggga ggcccaggca ggtggatcac 118741 gaggtcagca gttcaagacc agcctggcca acatggtgaa accctgtctc taccaaaaat 118801 acaaaaatta gctgggcgtg gtggcgcgtg cctataatcc cagctactca ggaggctgag 118861 gcaggagaat cacttgaacc cgggaggcag aggttgcagg gagccgagat cgcaccactg 118921 cactccagcc tgggcaatag agcaagtctc catctaggga acaacaacaa caaaaagatt 118981 atgaagcctt aggaagaaca gggatattca cctgctgctg agcccccctc cgctttgatc 119041 ttgtgagtct gcactctcct gctcccctgt ctgtctcctc tagctcctgt tccttctcct 119101 accttgtgtt ctctgccaat gatatgactg gggctacttt cttttttcct tctcacactc 119161 tcttcttgct aatttcaacc aatttccctg catcatctcc acctgcaagc tggtccttta 119221 cagcagagct tggggccctg ctgcccagta gcactctgga caccctcaca tcatcatcat 119281 catcctattt ttatttattt tttggaaaca gggtcttgct ctgtcgccca cactggagtg 119341 tagtagtgca gtagtgcgat cacggctcac tgcagccccg atgtccctgg gctcagatgt 119401 tcctcccgcc tcagcctctg gaataactgg gaccatagat cccttccact gtgcctaatt 119461 tttgtttttt gtttttgttt ttgttttgag acggagtctc actcttcttg cccaggctgg 119521 agtgcagtgg catgatctcg actctctgca aactctgcct cccgggttca agtgatctcc 119581 tgccccaccc tcccgagtaa ctgggattac aggcacgcac tactgtgccc agctaatttt 119641 tgtattttta gtagagacag ggtttcacca tgttggccag gctggtctca aactcctgac 119701 ctcaagtgat ccgcccacct cggcctccca aagtgctggg attacaggca tgagccacca 119761 cgcccggcct aatttttgtt ttgttttgtt tttttgtaga gacggggttg caaccatgtt 119821 gaccaggctg gtctcaaatt cctgagctta agcaatcagc ctgtcttggc ctcccaaagt 119881 gctaggatac aggcgtgagc caccacgcgg ggccttcatc accctattaa tatatacttt 119941 ctgatactta attgccaggc aataagctaa acccttttat tcactgtctc actttaatcc 120001 ttacagggaa gtatcggctg ccagataggg agctgagact tcaagaagct aaataaggtg 120061 tccaacacca cagagcatgg agcaaaggac acgggactgc aaatcttcct aactcgtgtg 120121 ctcatctggc tatctcacca gggccttaaa tttaatatat cccaaactga actcatcttt 120181 accccttccc actttgcact cctcaaatgt ccttgtttaa aatagttacc tttatctttc 120241 ctaacccaga aactcaaaac ctggcatcat ctttgacttc tctctttacc ttcacattca 120301 acagtttcca agacttaaag gctttatttg taggatctct accactgatc ctctacagtt 120361 tcacacctac atcccattct cgttcccaaa tccccataac tcctctcctg gcccatccct 120421 taacactgaa atcctggctt ggaaaatatg gtcacattca cagcagctgt ccccaagaag 120481 gaagccaagg caacagtatg cacaatgaag tgagtcttca ctgatctctc catattttga 120541 cattttacag cacttattat ctctactttg tattttgaaa ctgaatccaa aatagttttg 120601 catttgttgt ttaacagtca tgtatgtagt tttttttttt tttttctttt tttttggaga 120661 cagagtctgg ctctgtcacc caggctggag tgcagtggcg tgattttggc tcactgcaac 120721 ctccgccttc tgggttcaag cagttctcgt gcctccctga gcagctggga atacaagcat 120781 acaccaccat gcccagctaa tttattttta gtagagatgg gatttcacca tgttgcccag 120841 gctgatcttg aactcctgag gtcaggcaat ctgcccacct cagcctccca aagtgctggg 120901 attacaggca tcagccacca cacccagccc ctccatgtgt gtagatattt atccacatcc 120961 aaaaattagg aaaagcagga cgcattgaac ctttggtacc cagcagcagg agcctgtggg 121021 tcttctgtct ggagcacaat cacaaggacc gagcatcagc agcatccact gtcctttcag 121081 ctccaaattt taaactcccg taagagagac attattggcc cagcttgggt cgtgtgtcca 121141 cccctttaat caatcagctt tggccaagca gcaggtcatc ctggtccaaa catcacagtt 121201 gggggcctca cttgtaaata gagcttgttc ccaaaaaaga gggaggcaca caccattcat 121261 ttgtttattc attcattcaa tcagcaaata gttgagcatc tatagaaata tatttaaggt 121321 tctattatgt acacaaaatg tataaaacat ggccctgccc tcacaccatg aaagttacca 121381 cataaaaaga agtcaccaga taaaaaaagc ataacagtat tcataagtac tcatgagtga 121441 ccatcaattc agttacacat gatggaagat aattcattat acctagtata agccagtgac 121501 ggtaaaaata gttagcagca atgtgtacat gatcaacaaa agctcacagc agcaccattt 121561 acacaaaaac agaaaagtac ccagatgtcc atcagaggta gaccagataa aatataaaat 121621 ataccaccac acaatggcta acacctgtaa tcccagcact ttgggaggct gaggccggca 121681 gatcacttga ggtcaggagt ttgagaccag cctgatcaac atggtgaaac cctgtctcta 121741 ctaaaaatac aaaaattagc cagttgtcat ggcatgtgcc tgtaatccca gctactcagg 121801 aggccgaggc aagagaatcg cttgaacctg ggaggccaag gttgcagtga gccgagatca 121861 caccactgca ctccagcctg ggtaaaaaag cgagattcca tctcgaaaaa aaaaaagtgt 121921 atatgtatag tgtatgcatg cacagaatac tttacagcaa taagaatgag tgttctgcaa 121981 atatacacaa tattgctgac tctcccaatg ttaaacaaaa gcatccagac acacaacaat 122041 gtgtacagta tattattcca ttgatagaaa gcttaaaaac aggcaaaatt aattcaccct 122101 tatggagtct taagtaaggg gaacaaaagg ggccatctgg gcagtgataa tgctgtttct 122161 tgagctgggt gctgggttca caggtgtgtt cagtttgtca cattcatcaa gcttacactt 122221 ctcatacatc ttcttttcta tatgtatgtc atccttcaat aaaaagtttt taaaaaataa 122281 ataattgggc ttgtgtggtg ggctcacacc tgtaatccta gcactttggg aggctgatgt 122341 gggagaagca cttgagtcca ggagtttgac cagcctgggc aacacaggaa gaccctgtct 122401 ccacaaaaaa tttttaaaag cctggcatgg tggcacactt aggtgggtaa ggtgggagga 122461 tcgcttgagc caggaggttg aggctgcagt gagccgtgat cgcaccactg cactccagcc 122521 tgagtgacaa agtgagacca tgtcttaaaa aaataaaaat aaataattgg cactcaaagt 122581 aagacacctt taatctccct tgaacatcag caccatgatt atcctggagt tgccaattat 122641 tcccacactc cccacctcct ccccatcacc accaccatta tgcccccttc ttagacacat 122701 aagacactgg agcctttgga aggagccact atatttaccg catgacctcc ttccctctgg 122761 tcccagccta ctggacttct tacctggaat tgtgggaaca ggtcactgta actaagtcac 122821 gtgacagagt gcttgatcta ttaatttaca catatttgca agaaagaatt tctgggcatg 122881 tgcacagtga taagctcaga aagctggtct gcagaaaaca gaagcaaata gagtcagcat 122941 agagagggaa acaaacaaac ccaccagaga tggagaagcc tcagaggctg ttgacattga 123001 cctgtggtac ccacatgtcc caggtgacac tgggtgtcca cgtgattgct tatgtagcct 123061 tactatttaa aaaatcctca taatcccagc actttaggag gccgaggcgg gtgtatcaca 123121 aggtcaggag ttcaagacca gcctgaccaa catggtgaaa ccccatctct actaaaaata 123181 caaaaattag ccaggcatgg tggtgggtgc ctgtaatccc agctactcgg gaggctgagg 123241 cagagaatca cttgaaccca ggaggcagag gttgcagtga gccaagatgc cgccactgca 123301 ctgtagcctg agtgacaaga gcaaaactcc gcctcaaaaa aaaaaaaaaa aatcctcatt 123361 tacttaaact aacatgaata cgtttctgtc tccggccacc aaacatgacc ctgcatgttc 123421 ttccctggaa gaaactaagt agttattttg tttgtttgtt tatttggaga cagagtctta 123481 ctctgccacc caggctgaag tgcagtggcg tgatctcagc tcagttttgg caacctctgc 123541 ctcctgggtt caagaaattc tcctgcttca gcctcccgag tagctggatt acaggcatgt 123601 gccaccacgc ccagctagtt ttctgtattt ttagtagaaa tggggtttcg ccaggttgcc 123661 cagtctggtc tcgaactcct gagctcaggc aacctgcctg ctttggcctc ccaaagtgct 123721 gggattacag gtgtgagcca ctgtgcccag ccccttagtt atttcagagc cagactctta 123781 agcactttgc atgtgtcatc ccatgtgctc ctttaacgac cctaaacaat aaggaccatt 123841 attagtcctt tgtcacaaat gagaaaaatg aagcccaggg aggttaacta atttgcctaa 123901 atcaccagcc tagtaagtgg tggtgccagg ttttggaccc tgacagtcta actccagagc 123961 ctgaaacttt accagctgtg ctccgctgtg gtgcaagaga aatgctgacc atggcgatgt 124021 gaattgtctg ctgcattagt agatttaaca aaggcatttg atttgttaaa tgagttcaaa 124081 tgtagaaatg atacaaaaga tcggctgtct agagaagctg gtgcacacat ttctttcaca 124141 agggaattat cgtttgaggt atacaagcca gagaaatgta aactgcatag agtgtgacag 124201 atatgccaaa caagtctgtg ttctcttacc aataaattag tttacagatt tcagcaaatg 124261 ctctcttggg ggcccccact gattgcttat ttttccccac gtgtttaata tccaggagaa 124321 ggggatttga gtcccacaga aggagaaact ggtgataaca gttacttcaa gtctcagaga 124381 gggaggtgcc tcattttcca tgttaatggc tgccagcccc acaatccact cagcaagcct 124441 tctagatcaa tcccaaacaa gccattggtg acccccagca atcttcaaag ggaattatca 124501 gtgaggttaa gtcagataag aacttagtct atttgtaagg ctttgatttt aaaagaaagt 124561 gctgacagcc actattcaag atcttttcta tatataaatg actgagcaat tttgtggctt 124621 ataattagaa caatgcatga caatttctag attgaggttc caaggttact cttctctttg 124681 gtctatcagt gccaaaaagc caaaaggtca tcttctaagg ctccagggat agcactcatt 124741 accctgataa atggctcact ctagaagtcc tggctttgat gttacctttt aaaagtggct 124801 ggtttttgtc tggccaaagg tggggccatt tgggtggctc acagataatt tgtggcaaca 124861 ctgagttaat atcagtttca agacaaaaca cattttattg ttaagaaact atttgttaac 124921 tcattacctc atgtcatagt attctctgcc ttgccatgtg gctataaaaa aaaaaataaa 124981 cattcaagtt tcacattaga aagcttagcc tgattcaaat ctgttttctg tggctgggca 125041 ctgtggctca tgcctataat cccagcactt ttgggaggca gaggtggggg gatcacctga 125101 agtcaggagt ttgagaccac actggccaac atggcaaaaa cccacctcta ctgaaaatac 125161 aaaaattatc ctggtgtggt ggcgggcgcc tgtaatccca gctacttagg agcctgaggc 125221 aggagaattg cttgaacctg ggaggcggag ggtgctgtga gccgagatta tgccattgca 125281 ctccagcctg ggtgacagag caagactcca tctcaaaaaa aaaaaaaaaa aaatctgtta 125341 tctgcataag acacctaacc tgtaatgacc aattaagact caaattagct agcgccaaca 125401 gcgggtatca aaatgccatc aaaattttct aagcttgcac ctacaaatgt tccctaaggc 125461 aagcataaag gcatctaaca tttaccctaa attatgccag tgagtagcaa aaatgtgctc 125521 agttagacgc aacatgtcac aacatggtct gactgttgga agaacttagt gcagggagag 125581 ctatacccag aggaaagaag taaaattagg cagagtgttg atggctgagt tccagtgtca 125641 catttatata cagctcaatg actctagaat tgtccttaca ccaaaaaaaa gttattcata 125701 gattcaaaaa atcaactgct cactactttc atttaaaaat gccttgtgtg aacaaggcgt 125761 tccaactgaa aactggcaga attcatagag gttcttaaag aacatcaatt agattcttag 125821 tcaaccaatt tggctgtaaa atcaaaactg aaagtgcaat ttccaaaact aattatgcta 125881 aatactttta aatatatata acttgataat aacatttgga ctttatgtat ggaaagaaac 125941 agtagtttcc accacaggaa ttttcaaaag aaaaatatat aggttttaaa ccaatttatg 126001 aagatctgca ataagatttt attgaagaga aagttttccc ctattttcct aaatattact 126061 caaaattaat tctcaaccca aaaggtgaca gcatgattct agtagggtcc aagtcaatcc 126121 cagaacacaa taataattga tcccttcccc aacccaagcc ttcagccttg caaacactat 126181 gccatagatc aaaagtggaa ccaaatgaaa atgtgaccat atttctacaa atccatcaat 126241 ttggagggca aaaaaccaac aatccaaagc ccatctctaa tggacagtgt tagatatttc 126301 accctcatgt caaaagaaac atgtataatt acatcatcta ggttactaag aaaagcatat 126361 ctttaaagtg aaggggtatt tagaaaaagg atacttgaca taaatgatgc aaatactcaa 126421 aaaatatatt aaatatctgt gaaatgtgtt aactatgaaa gctttttaaa agcacatgct 126481 gagccttgtc ttactttcgt gtacatttaa ccaggcttca ataatgctct atttatcttt 126541 atttcattaa ttaaataata aatatctaaa tttttttatt ttttgagaca gagtttcgct 126601 gttgcccccc aggctggagt gcaacagtgt gatctcggca caccacaact tctgcctccc 126661 gggttcaagt gattctcctg cctcagcctc ccgagtagct gggattacag gctcgcgcca 126721 ccacgcctgg ctaattttgt atttttagta gagatggggc ttctccatgt tggtcaggct 126781 ggtctcgaac tcccgacctc aggtgatcca cccacctcag cctcccaaag tgctgggatt 126841 acaggcgtga gccaccgtgc ccggccaaca tctacatatt agtaggaaca caatagcaaa 126901 aaaaaaaaaa aaaaaaaaaa tcacaaaaac tgataaatat ttaccaactc tgtggcttcc 126961 ttccagctca tgagcataat tttataaaat tgctatctct atgtgtcaac catttcaagt 127021 ccttcttttt cacttacttt gaatgaagta ttatgtttct acatgatctt cacagtcatc 127081 ttgaaagtta ctggagcatc ctatggtcta gctcagtgat tcctgaataa cagtttattg 127141 accaagctag gatgaagttt tcatcagtcc acagttaaat gcgaaaagca cagacaagtt 127201 tgtgagtttt taacaaagct gaatgattca attgaaagga ttagacttta ttctgagatt 127261 atgttattct ccctttttta tgttaaaatg tgtttttatg aaatgaccat ggtggtggtc 127321 aacggcagct ttttctgtat ctttctcact caacaaaaca ctgaaatata ctaattttgg 127381 tatcccctac ccagttattt tttattttac tggtctatta aacctaaaag tctggtaact 127441 ataataccag tctagcctgt ctaacaacac acatatatat taaggcatac acttcccccc 127501 aacttcaccc ctgcaataca gaatgttttt ggagactccc atggcagcca gcctctgaaa 127561 gggcccccaa tgatccctgc cccctggtat tcacacagtt gtgaagtctc cacccacacc 127621 ctaactagga tccatctgtg tggccaatgg aacacagcaa aagtgaaggt atgtcactcc 127681 caggattaaa cgacacaagg catttcagct tccatcttgg ttgctttctc cttcttagat 127741 cactctggga gaaactcact gccatgttgt gacaacacta tggagacgcc caggtgaggg 127801 actgaggctt cctgccaaca gccacatgaa taagattggg aacagatcct ccagccccag 127861 tcaagccttc agatgactgc agtctcatga aagaccctgt gccaaaacca cccagcttga 127921 tgaaataatc tgtacaacaa acccccatga cacaagttta ctacaacaaa cctgcacatg 127981 tacccctgaa cttaaaagtt aaaacaaaac caccaccacc accaccacca cccagaaaaa 128041 acacccagct aagccacttc tgaattccta acctacagaa actatgaaat aataaatatt 128101 tgtattttca aaattagctg ggtgtggtgc catgtgctta taatcccagc tacttgagag 128161 gctgaggcat gagaatcact tgaacctgag aggcagaggt tgcagtgagc caagattgtg 128221 ccactgcaat ccagcctggg cagcagagcg agactctctc aaaaaaaaga aaaaagaaag 128281 aaagagagaa gaaaaattaa aattaatgtg tagaatattt tttaaattaa agttaaataa 128341 ataaatattt gtactttcaa ccatcaagtt tgaggtaatt tgttattgac caatagataa 128401 taaatacaac ccttttatcc tatttcagcc acaaaatgag catccctgta gccccccagg 128461 gatgcaatgt ggtgcaatgc agaaactgta tttatggctg agttggaaga gagatcggat 128521 cagcaaagac tgtgatctcc tttaccctgg ctttagttta catactctga cttttttctt 128581 ctctgttgct ttttctactt ttcttgtatt gaccagggta ctcagtaaac tgaataatcc 128641 atctctagca agggactcaa tcctgcaagt ttatatgctt aaaggaatta ctttatgtaa 128701 atatggtatt ttatgaaatt ttagaaaact ggtaaatgtc tattgacaga atccctaacc 128761 ccagctgtcc aaatctttgc tagactcatc cataccttaa aagaggagca tgtcttatat 128821 ttcactaaga aaatagaaga caacagatat gaactctttg aaatgccttc cttccacctt 128881 taaaactata agtattgagg tgaaaactat tattttagta gatgctagag ttcttaggga 128941 tggaaaatgc cttatttagg aaactacttt gaaatgacat ttgaagtatg gaaaaagaga 129001 gaatgactta gaataaaact ctgaagcaaa gagacagcta gtcagatcta tattttttaa 129061 aatccaaaaa catggggact ggaggagagg aaatggaggt ggataagaag agatggggct 129121 caaataacag tgtgggaggc tggagctgcg ggagagagtt cccagtgata ggggagccgg 129181 agaatgttta aaatagagat atctattgtc ggaattttaa gttatttgtg ttgctaagga 129241 tataaaatcc cctaagcctt cagtaatatc tgtcacatgc acaaatgcct tatgtgagtg 129301 atttggggga gaattacgaa aaaagattgc aaggggctga gctccacaac tgggtcagca 129361 aagaaccaag aaatgagaac agccacagaa gttcagatac aagtaagata aagaatttaa 129421 tggaagcaga aactcaaagc caaagaaacc ataagaagga gagcttccag gaattcacag 129481 aaatcttgga ttgagtttcc caatggatgc agaatgggga cttaagccaa tgttacttaa 129541 atctcagaaa agaatgttgc cttaagctga cagctgagta catattcact gattcttctt 129601 tcatctcttc cggcccttga caaagagatg tccttaactc ctttctgaaa ctaggtgctc 129661 catttttgaa tgtgatctaa tatccttcct ttaactcttg cttgatcagt tattctcttt 129721 gctacataca tggtcaataa cctccttact atagcgtttt acccccattc tgcttataaa 129781 caggttcagt ctcaggcctg gggaaaataa gagaataact cagctcaagc taccatcatc 129841 ttacaacacg ggctctgaac ccagaaagat ttagatttga atccttgttc cactatgtat 129901 tcatggtgga acaccctggg catattacat aacctctcta tactctctcc actacaattt 129961 cctcatcaga acatggggat aataacggta cctacccata ggagtagtgt aaggattatc 130021 ccagataatg catgtaaatt gttagtccag ggcctggtat acagtaagcc ttcactaaca 130081 tcaactgctg tcatcatcat catttgccca aattcttgag tcatctcagg ctgggcacag 130141 tggctcatgc ctgtaatccc aggactttag gaggccaagg tggacggatc acctgaggtc 130201 aggagttcga gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaatacaaaa 130261 aaaattagcc aggtgtggtg gcaggcacct gtaatcccag ctacttggga ggctgagaca 130321 ggagaattgc ttgaacctgg gaggcagagg ttgcagtgag ccaagatcgt gccactgcac 130381 tccagcctgg gtgacaaaag cgaaactccg tctcaaaaaa aaaaaaaaaa aagtcatctc 130441 ttctctactg tcattcactc tttaatccct ggggggctgg ctgctgtcaa tttactgaaa 130501 ctgctctcat taagataacc agtgatcact tctaatatga ggttatagaa aaaacaaatg 130561 aaaacacaaa atgaaaaaaa gaaccagcaa cttcctaaat tcgttatccc acttaatctt 130621 tcaggccttt ggaactcttc tttagaattt aacagaccta gtcactcacc ttcttgaaat 130681 ggtccagtct ttgctttgca tggcattgcc tctccccatc ctttctcttt tctttcatta 130741 agtcttaatt ctccaccatc ccttaaatgc ttgtgtgtct gggtctccac ccttagccat 130801 ctttttatca ctaggtgaac acttctaaga cttcagcagc caaatctcta tctttagccc 130861 agaccttcct tctgagctct tgagccaaac tgtccactaa atttattgtc taaggttttc 130921 acagtcatcc aaaccaaatt tatagagact attaactaaa tcattatttt ctctcccttc 130981 cccaattctt tcccttccct agtaatcatt ttcttttttt cctttttgag atggagtctc 131041 gctctgttgc ccaggctgga gtgcagtggt gtgatctcgg ctcactgcaa cctccacctc 131101 ctgggttcaa gcgattctcc tgcctcagcc tcccaagtag ctgggattac aggcgcatgc 131161 cgctgcacct ggctaatttt tgtattttaa gtagaggcga ggtttcactg tcttggccag 131221 gctggttacg aactcctgac ctcaagtgat ccatccacct tggcctccca aagtgctggg 131281 attacaggcg tgagccaccg caaccagccc ctactaatca ttttctcaag tttccagctt 131341 ggactggaat gtcattgtta tagtctagcc aggagtccaa gctggaaaca tcagttgtta 131401 tccttatatc tccctcaccc agcatgtcca actggctatc agggcctgac agtcccacct 131461 caaagtctca tggcttcccc gagtcctgct ccatcctaca tgaccccact gtatttcaga 131521 gtgggcttta gagtcacatg ggcctgggtt caaatattaa ctatgccata aacctactaa 131581 tgactgtttt tggtcaagtg acttaacctc tctgacctca gctttttgtg ataattaaat 131641 gagatatcat atgtaaaata gctggcacac agtaagcact caacaaacat tccgctgcat 131701 ccccttcctt tgggtctcca ttgctaccgg gtggaatgca atatctacct acttggtcta 131761 tcttgtcctt tctcctccta attgccctag agttaatttt tctaaaataa ataaataaat 131821 aaatctggta ctatcatcgc tggctttaaa accttcaaca ttttcttttt tcctgtggaa 131881 tgaagtctca attccttaac ataagtggta agttccagct gcctttctgg tccctgctcc 131941 ccaagcccat ttactccaaa acattggctt tttgccagcc acttcatgta catacgggct 132001 taatctccac acatgaagag ccctttgact aattcccttc cccacaccaa gttctgtcca 132061 attggcaaga acctcaaggc ccacttcaaa aactatcata taaagggtga tacctattct 132121 taagtggttc aatttttttc ttttcttttt tttttttttg agagagagag aggatactgt 132181 tatgttgctc aggctggtct tgaactcctg ggctcaagtg atccaccccc atgtcagcct 132241 cccaaaatgc tgggattaca agtgtgagcc tctgcacctg gcctggttca attttttaaa 132301 actatttttt acatatacgc aaacataggc caggcaccgt ggctcacgcc tgtaatccca 132361 gcactttgga aggccaaggc aagcgaatca cttgatgtca ggagttagag accaacctga 132421 aaaacatggt gaaaccccat ctctactaga aatacaaaca ttaactgggc atggtggcag 132481 tcacctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccgggagg 132541 cggaggttgt aggtgaggcg agatggtgcc actgcactcc agcttgagtg acaagacaag 132601 actctgtctc aagaaaaaaa ataaaaataa aaaataaata aaaatataaa atatgtatat 132661 atatacacac acacatacat aatatacata tatacacaca cacaaaggaa gagagagaga 132721 aaaagtgcta aaatgtggat gtggcaaaac atcaaaaact ggtgaatctg ggtaaaaatt 132781 tcaaatgtac aaaaaacttg caaaatgcca tataattctg gcaacatttc tgtaaatttg 132841 aaaatatttc aaaagaaaaa agaaaggacg ggcagggtgg tttgtgcctg taatcccagc 132901 cctttaggaa gcggaggcag gaggatcact tgagcccagg agctcaagat tacagtgagt 132961 tatgatcctg ccacttcact ccagcctgta caacagggcc aaacaactag cctatgtttt 133021 aaaaatgtca atgtcgtcaa aaaaagcaag ggcagaagga aggaaaggag gaagagggag 133081 aaggggaggg ggagaggaag gaaaagggag acaggaagaa agaaggggaa gctgaagaaa 133141 cgttcaagat tagagaagac aaacatgaga gctaaatgcg atgtgtgatc ctggattgga 133201 tgttaaattg gcattaaaaa aaactgctat aaaatacatt acttggctgg gcatggtggc 133261 tcacgcctgt aatcccagca ctttgggagg ccgaggtggg tggatcacga tgtcaggagt 133321 tcaagaccag cctggccaac atggtgaaac tccatctcta cttaaaatat aaaaattagc 133381 taggcgtggt ggcacgtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 133441 cttgaaccca ggagacagaa gttgcagtga gctgtgactg tggcactgca ctccagcctg 133501 ggggacagag caagactcca tctcagaaaa aaaaaaacaa cattattgga acaagtggtg 133561 aaatttgcaa attgactctt tattatataa tagcattata acaatgctaa atgtttttaa 133621 aagttattct gtagttatgt aagagaatgg ccttgtgctt taaaaaattc atgctaaaat 133681 atttaagggc aaaggatcat gatatgtgca actttaaaat gtttcagata aatagtctgt 133741 gttcgtatgt gtgtctagag agagaaaaaa tatagcaaaa tgttaacaat tgataaatct 133801 gtattaagat ttaccacttt tacaactttt ctgcacgttt gaaatgtttt caaaattaac 133861 ttttttaaaa aatatttttt ctgaggcagg gtctcactct gttgcccagg ctgcagtgca 133921 gtgccaaaat cacagctcac tgcagcctca aattcctcgg ttcaagtgac cctcttaccc 133981 cagcctcccg agtagctggg actacagcca tgtaccacca tacccagcaa cattttttat 134041 tttctataga aacaggtctt gctgtgttgc ccaagctggt ctccaactcc tatcctcaag 134101 caatcctccc acctcagcct cccaaagtac tgggattaca agggtgagcc atcatgcatc 134161 gtgcccactg aaaataaaaa aatattttta cagaaccacc tcagatagaa ataatgcctt 134221 ctgaaaacca aaaagcactg atgatagata gtacaaccac tgtgaagagt tttgaggttc 134281 ctcaaaaaac taaaaataga actaccatat gatccaccaa tcccactgct gggtatatac 134341 tcaaaagaaa gaaaatcagt atatcaaaaa ggtagctgca ctcccatgtt taactgaggc 134401 actattcaca atagccaaga tttggaagca acctaagtgt tcaccagtag acaaacagat 134461 aaggaaaatg tggtgcatat acacaaggga ggactattcc gccatataaa aatgagaccc 134521 tgtcacctgc agcaacatgg atagaaacag aggtgattat gttaaatgaa attagccagg 134581 cacaaaaaga caaacttcac ggtctcacgt atttgtggga gctaagaatt aaaacaactg 134641 aattcatgga gtagagagta gaacaacaat ggttacctga ggctagaaag ggcagcggtg 134701 ggggaaaggg gggatggtta atgggcacaa aaatatagtt agaaacaatg aataagatct 134761 agtatttgat agcacaacag ggtgactata gacagcaata attttttttt ttttgagacg 134821 gagtctcaca ctgtggccca ggctggagtg cagtggggca atctcagctc actgcaagct 134881 ccgcctcctg ggttctcgcc attctcctgc ctcagcctcc tgagtagctg ggactacagg 134941 cgcgtgccac tacgcctaat tttttgtatt tttagtagag acagggtttc accatgttag 135001 ccaggatggt ctcgatctcc tgaccttgtg atccacctgc ctcggcctcc caaagtgctg 135061 ggattacagg tgtgagctac ctcacccggc caacagcaat aatttattgt acattttaaa 135121 ataactaaaa gagtataatt ggattgtttg aaacataaag gataaatgtt tgaggtgaca 135181 gatatccccc caaaaaatca atgaaagaaa ttacagacac aaataaatgg aaaaatatcc 135241 tttgttcatt gaatggaaaa attaatgttg ttaaaatgat catattacta aagtgatcta 135301 cagattccat gcaatcccta tccaaattcc aatgacattt ttcataaaaa tagaaaaaat 135361 aatcctaaag tccatatgaa aacacaaaag accctgaata gccaaaacaa tcttgaatga 135421 aaagaacaca tcacgacctg atttcaaaat atactgcaaa gctacagcaa tcaaaatagc 135481 atggtactgc tatgaaaaca gacacataga ccaatggaac agaatagaga gcccagaaat 135541 aaatccacac atttatagtc aattgctctt ccacaaaagt actgagaaca tacaacggga 135601 aaaagagagt cttttcaata aatggcactg ggaaaactgg atatccacat tcaaaagaat 135661 gaaattagac ctttatctca cacaatatac aaaaatgaat tcaaagtaga ttaaagactt 135721 aaacacaaaa cctgaagctg taaaactact agaagaaaac acaggagaaa agcttcttga 135781 cattggtttg ggcaatgatt ttttggatat gaccctaaaa cacaggcaac aaaagcaaaa 135841 atagacaaat gggattgcat cagactaaaa agctgccgca gcctgggtgc agtgactcgt 135901 gcctgtaatc ccagcacttt gggaggccaa ggtgggggca tcacttgagg tcaggagttt 135961 aggaccagcc tggccaacat ggtgaaacct catctctact agaaatacaa aaaattagcc 136021 aggcatggtg gcacacgcct gtagtcccag ctacttggga ggctgaggca ggagaatcgc 136081 ttgatcctgg gaagcagtgg ttgcagtgag ccgagatcgc acaattgcac tccagcctgg 136141 gcaacagagc aagactccat ctcaaaaaaa taaaataaaa ataaaaagct gctgcacagc 136201 aaaggaaaca atcaacagtg aagagacaac ctacagaatg ggagaaaata tttgcaaacc 136261 atacatctga taaggggtta atagccgaaa tatataagaa ctcaactcaa cagcaaggaa 136321 actaataacc caatttaaaa atgagcaaag gacctgaaca gatatttctc aaaaaatatg 136381 caaaaatggc caacaagtat atacatatac aaaaaaatgc tcaacttcgc taatcattag 136441 gaaaatgcaa attaaaacca caatgaaata tcatctcaca cctgttagaa tagccattat 136501 caaaaagaaa acaaatgttg atgtagacgt aaaaaaaagc aaaccttata tattgttgtt 136561 gtttgagacg gagtttcgct cttgttgccc agactggagt gcaatagtgc aatctcagct 136621 caccgcaacc tccacctccc gggttcaagc gattctcctg cctcagcctc ccgagtagct 136681 ggaactggga ctacaggcat gtgccaccac gcctggctaa ttttgtattt ttagtagaga 136741 cagggtttct ccatgttggt caggctggtc tcgaattccc aacctctggt aatccgcctg 136801 cctcagcctc ctaaagtgct gggattacag gcgtgagcta ccatgcccag cctatattgt 136861 tgataagaat gggacatggc acaatcatta tggaaaaaca gtatggagac tcctcaaaaa 136921 attaaaaata gaactaccat atgacccagc aatcgcacgt ctgtagtatt tacccaaagg 136981 aaatgaaatc agcatgttaa agatatatct gcactctctt gttcattgca gtgctattta 137041 caatagccaa aatatgaaat caacccgagt gtctatcaag ggatgcatga attttattta 137101 ttttttgaga cagagtctcg ctctgtcatc caggctggag tgcagtgaca caatctcagc 137161 tcactgcaac ctctgcctcc agggttcaaa tgattctcat gtttcagcta cctgaatagc 137221 tgggattaca gacacgtgcc accatgccca gctaattttt ttgctatttt tagtagagac 137281 agggtttcac aatgttggcc aggctggtct ggaactcctg acctcaggtg atctgcctgc 137341 ctcagccgcc caaagtgctg ggattacagg cgtgagccag tgtgtctgtc tgggatgcat 137401 gaatttttaa aattggaata ctattcagcc ttataaaaaa gaaggaaaat tggcaaggcg 137461 cagtggctca cgcctgtatc ccagcactgt gggaggccga ggtgggcgga tcacaaggtc 137521 aggagtttga gaccagcctg gccaacatgg tgaaaccgtc tctactaaaa atacaaaaat 137581 tagccaggca tggtggtggg tgcctgtaat cccagctact caggaggctg aggcaggaga 137641 atcgcttgaa cccaggcggc ggaggttgca gtgagctgag atcgtgtcac cgcactccag 137701 cctgggcgac agagtgagac tttgtctcaa aaagaaggaa atcttatcat ttgtaacaac 137761 aaggatgaac ctagagacat tatgctaagt gaaataagcc aggcacagaa agacaaatac 137821 tgcattgatc tcacttatat gtagaatcta aataagtcaa actcataaaa gtagagaata 137881 gaatggtggt tgtgaggact gggggtatgg ggagatgtta gtcaaagggt accaagttgc 137941 agttaggatc aattagttcc ggagatctgc tgtacagcat ggtgactata attaatgtat 138001 atttataaat tgctaagaga ttgatcttaa atgttctcac cacacacaca cacaaataag 138061 tatgtgaggt gatggatgtg ttaattcatt tgatttaatc attttacaat gtgtacataa 138121 aacatcatgt cataccctgt aaatatacac aacttttatt tatcagttac acactaataa 138181 agctgggata aagaaaagaa gaaataaata gtatgctgtt tttttttttt ttttttttga 138241 gacagagtct gtgttgccca ggctggagtg caatggtgtg atcttggctc actgcaacct 138301 ccacctccca ggttcaagtg attctcctgc ctcagcctcg gagtagctgg gattacaggc 138361 acctgccatc atgcccagct aatttttgta tttttgtaga gatggggctt caccatgttg 138421 gccaggctgg tcttgaactc ctgacctcag gtgatctgcc cgccttggcc tcccaaagtg 138481 ctgggattat aggcataagc caccgagccc ggctgaggaa ttccttcttt tttaaggcaa 138541 tagtatttgt cttacaccgg aaaaaaaaaa agcacaaata ttaaattcta gcttgctttt 138601 caaaaaataa aaaagaacta atgctgcttg gtttaagctg ctgtaaatgt ttttactttt 138661 actataaaaa gcctggattg agttgtaatt attggtttaa gcatttgtct tattctatta 138721 gactgacagc ttcttgatgc aagaacttaa attgcctttt ggaattgaat agtgagacaa 138781 gtatcctaat tcagggcagt attattttcc tggcatggca ttattagagt actaatatgc 138841 tacaatttag gatcatagta aacaaggctg gacattcttt tttttttttt ttttaagagg 138901 tagggtcggg tcttgctttg tcactcaagc tggaatgcag tggcatgatc atagctcact 138961 gcagccttga actcctgggc tcaagcgatc ctcctgcata gatgggacta catgagtgcc 139021 tcacgacacc tagctatgtt tagttttttg tagaaacagg gtctccctgt gttgcccagg 139081 ctgctcttga atgcctgccc tcaatgaatc ctcccacctt ggcctcccaa agtgctggaa 139141 ttataagcat gagccaccag actggacatt cttttttttg agacagcatc ttgctctgtc 139201 accaggctgg agtgtagtgg cacgatcttg gttcactgta acctctgcct cccaggttca 139261 agcgattctc ccgccttagc ctcccgagta gctgggacta caggcacgcg ccaccacact 139321 cagataattt ttgtattttt agtagagacg ggatttcacc atgttagcca ggatggtctc 139381 gatctcttga cctcgtgatc tgcccgcctc agcctcccaa agtgctggga taacaggcgt 139441 gaaccggcat gcctggccta gactggacat tcttaaaacg ggaacaagaa tagaaaatga 139501 ccctgtggtt tggagcatag aacagtgctg gcattaatct actcaatgta ctgttctgtg 139561 tctttacaga accttctgca ggcaagactg gaaagtccac ccctggtccc aggcagatgc 139621 acaaagaagc tggtataagg gagaggcctc atgaaagttg gagctgaatt tgccattgat 139681 gcctaggatt gcaacccctg gtatttgttt tatcacttcc actacacaca gtgcaggagg 139741 gcagcccatc cttagttggc cagaggtttt actttaaaac ccatgggcta agacaccaaa 139801 cagttggaac atatagggga aatcatgctc ttcccttctc cccatgcttg ttttgatcaa 139861 gaagctagga aactttctct tctccacagt attgaagcga tggcatctgt cttagtccat 139921 ttgtgttgct acaaaggctg ggtaattaat ttataaagaa aaaaaggttt atttggctcg 139981 tggttctgca ggctgcacaa aaagcatgcc accagcatct gcatctggtg agggtctcag 140041 gctgctttca ctcatggggg aagttgaagg ggagccagcg tgtgcagaga tcacatggag 140101 agagaaaaag caaagagaga ggggagaggg gtgccaggct ctttttaaca ccagttctct 140161 cagaaactaa tagagtgaga actcacccac tccttctacc attaatctat tcctaaatga 140221 tccaccccca ttacccaagc atctctcatt aggcttcacc tccaacattg ggaatcgaat 140281 ttcaacatga gatttggagg ggacagacat ccaaactatc tcagcatcca tccttctctc 140341 tgcgtactct gctgacttac tcttccttgt agaagaaaac aattcagtgt gtgatcgatg 140401 agactaggtg cagggtcact gcacactcac cactcaggct gcctttgaat tcctcttttg 140461 tagatgtctg cccacaggcc acgtgccttc ttctctcctc cattcagcag cagatacagc 140521 agtttccggc gactatgcct atgaccaagg tcaagttcaa ttcatggaga aagaaatgag 140581 aagcctgttt tggccttgga tccaagccac cttctccagg ccagcttcag tagcaatcaa 140641 gctgacattt taaacccagt ctgattcctg tgactgtacc atttggttca ggactcaaaa 140701 gagagaagaa gatgaaggac ctctcagaat cccaacagta ttttactaat ctttggatcc 140761 cagcacctct cctggtgctt gttctattac aagccctcaa taaattttgt tgtcttgaac 140821 tcagagtgtg cagcacacag gcagatagct gctcacagct attattgggg tggttgtgtt 140881 tttttttcgt aacagaacag agtgattttt gatgcttttc tagtttgtca gagggctctg 140941 aggctataca gaagcagctt tagtgaacag aggagagcga gctgtgtctt tgtgcttcac 141001 aatgattgca atgccagaga gtgatgtccc aggggagctg tcaaacagct tgacagcaat 141061 tctagcaaga agtggtagaa acacaatttt gcaataatga tcatacgttt tttgaaattt 141121 tcctttatcc ttgaaatgcc ttgtgttgtc gaaaatctat tcattactgt tcagtcatct 141181 gtagcgagtc atccctttag gtctctgtac tcggaagtta cagccctggg agtattttgg 141241 cagagagaca aaggctccta ggcacagtgg gggagtcaga aaggtacaag taaatagcgg 141301 ctccaaggag ttagattttt aaaaaaataa taaaaggacg ggaagtgaca agaaatcatc 141361 ttcctcaaag cggctttagt tttctaaaag caggcaccat agctctttga tatttttacc 141421 atgcacatct ctggtgcttt cattttcttt ttcctctaat cccttccatg catttccttc 141481 attaattatc ccttttctct ccaggatgtt caacttctcc ctgtctctac tgcctccttc 141541 acctcgacct ataaacatgt acaagtttct tacatcctca gaaacttcca gctaccctca 141601 aatgctcact ctcttccctt ctctttgtag ccaagagacg agcctattcc agtgctaccc 141661 aaagcatggt ctgcagacca gcagcaccag catcccaggg aagccagatt tgaaatgcag 141721 ttctcacgct cacccagacc tactgaatcc gaatctctgt gggtggggtc caagaatctg 141781 tttcaacaca ctctccaggt gatgcttagg cacacggggg tctgagaagc actgcctcta 141841 cttcctgtct ctggtcacca ctttgggcga tcttcctctg tccctttaag gtgtgcacct 141901 tccccagggc tctgtcctgg gccttggctt cattgcactc aatcatttcc ctacgtgatc 141961 tcatccacca aaggttgatt tggttatttg tgtgttttaa cataggttta taccagtgat 142021 tctcaaattt atgtctctat cccagacctc tttctctgag ccctaagaat gtccagttgc 142081 tttctggact tgtttaccaa aatgttgcac agttctctaa actatgtcta aaaccaactt 142141 agtatctcct aaacccactc tgcatcaatg tcaataatct gggttgtgtg acagctttgc 142201 cacccccttg gcgcctgcca ccctgggatc cagctacacc cactgccttt atgcttccca 142261 gttcactgac tgaagtgcac accacaaggt ctggcctata gacaagagca atcacagagc 142321 tcttcaagga tgccagggca cccctcatat atttatttct cacattcttg atgaaatgta 142381 tgccttctag accctcccag ggtgggtgag taggcctcaa atgacaattg cactgtaact 142441 gccagtccct taagtctttg aatcccttcc tccacattaa accaagacat gtccaccatc 142501 tccagttcac tcacgtggac cacctttgag tctatgtttc agccagccaa ccaaccaatc 142561 agattcaaca cttccttttt tcttcttttt tttttttttt ttttgagatg gagtctcact 142621 ctgtcaccca ggctggagag cagtggcatg atcttggctc actgcaacct ccgcctccca 142681 ggttcaagcg attctccagc ctcagcctcc caagcagctg ggattacagg cgtgcaccac 142741 cgcacccagc taatttttgt atttttagta gagatggggt ttcaccatgt tggtcaggct 142801 ggtctcgaac tcctgacctc aagtgatctg accgccttgg cctcccaaag tgctgggatt 142861 acaggcatga gctgccgcgc ccagccagat tcaacatttt ctaacgccca aagctgcaac 142921 gctaaatgga gaatccctgc ttagtgagcc catgtcaaaa cattcagccc catccaactt 142981 tatgttcctt ccacctactg ggtgaagtgt cagagcccca gcatcagaaa gtggtcagct 143041 catgggtagt agggtagtaa gaagaattta ctgacaacag tataggttag aaaaagacag 143101 ttttattaga tagaagagtg tagctgggca ctactgcaag agaggaccga gcgtgctgca 143161 gtggactttt ccttaggggt atttatgaat cttaaagagg gagcttaacg gtaattggac 143221 tatactgacc acagaggtca tgatacatga ttacatttgt agacattttg gtgccttgat 143281 gtcagcaagt gttgcacgat gagtttcgac atgcatgcat tctggagatg tatagaaatt 143341 ctagttattt atacattttg gagaaagcag cccataccag atgcctgctt tagatcatag 143401 ggaatctctt atttctaaat ccctcagctg aggagtttgg cctctggatg gactgtttgg 143461 tgcctctccc aggtgatctt tgctctcctc accaccatta tcccacactc atagtatcca 143521 ttcccataca cattccctga atttctgtct gtagaaattt aaaaagtcaa gtagttcagt 143581 ggagtgcagc acacctctta tgggccagtc acacagtgta cctcatcttc aggggctgct 143641 ggactgaagt ctaacaaaga ggagtggtgg ggtgggtcct gaggagttca acattgtgtt 143701 gctcagcacc tgcctcaggg gaggccatta ctatttcctc aggcaatgca ggcttcatcc 143761 tctcagaggt ggaaagacca ataccactga gggttgggaa tgccactgtt gctggggttg 143821 ttgggaagca aaggtgggag tgctccttca ctgataaagg agacatcaga atttaggggc 143881 tcaatgtcct cagctttatc aaagttttcc caaacatccc catcccaact tgcaagatcc 143941 cattctttcc caattaatgc tctcacttta actgcacata gcctgcaaag ctgtgagttc 144001 aacttgcgtt gtaattcagc cacttgcagg atgaggttct gcatttgact ttcagcaatt 144061 tccgcccttc tgtacagtaa ataaaggtct ccctcagggc acacataaaa gttcctaggt 144121 catttttgtg gtgcatgaac taggaatgtg aatccctgac ctcatccttt ccttccacca 144181 gcatgacatt agggttccaa ccagcatcat tatattcatt cattttccaa aatgttcgaa 144241 agtatcatat ataagccagg catggtggct cacacctgta atcccagcat tttgggaggc 144301 caaagtggga ggatcacttg agcccaggag tttgagaaca gcctgggcca catggcaaga 144361 cccttgtctc taaaaaaaaa aagctgggca aagtggcaca tacctgtagt cccagctact 144421 caggaagctg atgtgggagg atcacttgag cctaagcagt caaggctgca gtgagccatg 144481 attgtgctac tgcactccag ctggggtgac agagtaagac tctacctcag aaaacaaaca 144541 aacaaacaaa caaaaggtat catatataac attactgagc tcattgattc tatagttggt 144601 tgattaggag tatccaacac agtattctgt gtatctctac aaacagctca cgttatggac 144661 tattagcact ctttttacta ctggaaatac agtcattagt gcctttaaat ctaatcagat 144721 tagagagcca attctagaaa ccccagaacc agttcagaaa attcatcctt aaaattctgc 144781 tcctctagaa gcactctcag tgccaaaatc tatacaaagt tttccagaga aacagaacaa 144841 gaaggagata tctctatata tagatagaca tagagatatc tccagatatc tccttctggt 144901 cctgtatata gatagataca gagagctagt ctcatccaca aacactctca aagacacaat 144961 gaaaaagaga gagggattga ttaattgtaa ggaattgact cacacgatta tggatagtaa 145021 gtcccatgac cagcctttct gtaagccaga gacccaggaa agctcatggt ataattaagt 145081 ctgcatccaa agtcctgaga accagggaac caacggtgtg taaatcccag tctggagatg 145141 ttccagctca agcaggcagg caggaaacca aaacagggca aactccttct tcctctgcct 145201 tttgttctct tcaggccctc catcgatcag atgatgcctg ctcacattag ggaaggcaat 145261 ctactttaca gaatccaatg tcaatcttag ccagaaacac ccgcaaagac acatcaggaa 145321 ataatgttta ttctgggtat cccatggcta gtcaagttga cagataaaat taaccatttc 145381 atgggcatat gactaaactg agcaaccaca cagtgatgaa aatgcctgct aaaaggaaga 145441 gtgtcatcta tacagttttg aagttctcta gaattctgct tactctatta gtccattttc 145501 aggttgctga taaagacata cccaagactg ggtaatttat aaagaaagag gtttaatgga 145561 ctcacagttc catgtggctg aggaggcctc acaatcgtgg tggaaggcta aaggcacatc 145621 ttacatggcc acaggcaaga gcaaatgaga gtttgtgcag ggaaactccc ctttataaaa 145681 ccatcagatc tctctatctc aagaactgca cagggaagac ccaccccccg attcaattac 145741 ctcccaccgg gtccctccca tgacacgtga gaattgtgga agccacaatt caagatgaga 145801 tttggatggg gacacagcca aaccatatcg gttacctttc taggttttag gtcaatttca 145861 agatgcatac atcaccacca agcaactaca cagcaaatat actcagtccg tgattctgaa 145921 acatgggcat gcatcagagt cacctgggtg gcttgttaca atgcagattt ctagggtcca 145981 cccctagagt ttctgattta gtcggttttg gatgggacct gagatttcct agtgctaaca 146041 aatccccagg tgatattgat gctgatcaaa ggaatacact ttgagaacca gtaaattcaa 146101 gagtacaatt gctacacctg acaatcttca cagccaagag aagctaatct gatctccctt 146161 aataaaacca tattattttt tttctttctc cccccgcccc cccaccccga gaaggagtct 146221 cgctcggttg cccagactgg agtgcagtgg cacgatctcg gctcactgca agctccgcct 146281 cctggtttca tgccattctc ctgcctcagc ctcccgagta gctgggacta taggtgccca 146341 ccaccatgcc cggctaattt ttttgtattt ttagtagaga cagggtttca ccatgttagc 146401 caggatggtc tcgatctcct gacctcacgt gatccaccca ccttggcctc ccaaactgct 146461 gggattacag gcgtgcacca aacgctcctg gccagaaaac catattctaa ggaaagcaaa 146521 cagttatcac aattacacac ttcagcaacc tccatctcct ctttgctact taagggatga 146581 aaacatcaac tgtgtatgta aaagttaaat gttgggaaag cggaggaaca taagtttttg 146641 ttttgtttgt agagacaggg ttctcattat gttacccagc cttgtctcaa actcctgggc 146701 tcaagcactt tacctgcctt agcctcccaa atgagttcta acactttaaa ttctgttcat 146761 ctctgaaaaa atcactgcaa ggctgaattc accgtacgat aaagaaatca tgcccacaat 146821 gttatttttc tagggttccc ttttcctcac aaagtggtgc cagtggaaag cagcatttca 146881 gtaactccta cctttatcct agtttagtga ctgatgcatt aacatggggt gagtttgatt 146941 aaagggggca gccaacattt acaggtacaa ttaaaatagg agctatgggc tgggcatgga 147001 ggctcatgcc tgtaatccca gcactttggg aggcgaaagc aggtgaccac ctgaggtcag 147061 gagttcaaga ccagcctggc caacatggtg aaaccccatc tctactaaaa acacaaaaat 147121 tagccaggca tggtggcaca cacctgtaat ctcacctact ccagaggttg aagcacaaga 147181 atcgcttgaa ctcaggaggc agaggttgcc gaaatcttga gaggttgcgg aggagagagt 147241 gagcagagat cgtgacactg cactccagcc taggcaacag agagagagtc ggtctcaaaa 147301 aaaaaaaaaa aaaaaacaaa aaacaaaaca taaaaataaa attaggccag gcacagtggc 147361 tcatgcctgt aatcccagca ctttgggagg ccaaggtggg catatcacct gaggtcagga 147421 gttcaagact agcctagcca acatggtgaa actccgtctc tactaaaaat acaaaaaatt 147481 agctgggcgt ggtagcacac acctgtaatc ccaactactg gcgaggcaga ggcaggagaa 147541 tcgcttcaac ccgggaggcg gaggctgcag tgagccaaga ttgtgccact gcactccagc 147601 ctaggtgaca gagcaagact ccgtctcaaa aaataaatta attaaaaaaa aaaaacagaa 147661 gctatggtgc tatcaggaaa gggagtaaag atttgctctc attctattct ctcctttatg 147721 tttcagacag ttgaagggac tacccaaata ccaaaatgat attgaggagg aggcactttg 147781 tgatggctaa ttttatgtgt cagcttgatt gggtcaggag tgtccaaaca ttgggtcaga 147841 cgttattcag gtgtctgggg atgacattaa cattggaatc gagagactga gtaaagcctg 147901 ctgtgcttgg gcctcatcca aacagttgaa gacctgacta gaacaaaatg gctgagtatg 147961 aaagaactcc tgcctcactg ttgagcatca cagttgacat cagctgtttc ctgcctttag 148021 acttgaactg agacatcgct tcttccttct gacttgaact gagacatcac ctcttccttc 148081 agacttgcac ggacacatca gctcttcttg agtctcaagc ctgctggttt tcgaactaga 148141 atttacatca ccagcccttc tgggtctcca gccatccaac tgcaaatcct gggacttgtc 148201 agccttcata attgtgtgag tcaattctat actaaatctt tatacactca catactctgt 148261 tggatctgtt tctctggcaa tcccttaata cagaactgga ccaaaaattc cttctaaatc 148321 actgtttgct gccttaattt ctacctcact aaaaattagc actattccta gcaacctgtc 148381 tcaaagtccc ccatctcccc ccaacctttt tttttttttt tttttttgag acagagtctc 148441 actctgctgc ctaagctgga gtgcagtggt gcaatctcag ctcactgcaa tctctgcctc 148501 cctggctcaa gcgatccttc tgcctcagct ccccaagtag ctgggaccac aggcacacaa 148561 catcatgccc agctagtttt tgtatttttg gtcgagacgg ggttttgcca tgttgcccag 148621 gttgctctca aactcctggg ctcaggtgat ccacctgtat cagcctccca aagtgctcag 148681 atcacaggca taagccactg cacccggcct caaagtccct ttaaaggaca tctgcaacct 148741 ggcatctcag tacaggtgat tcagattcaa tgactcagtg gtgatttcag ccctgttgtg 148801 ccatcagccc tgggagtgaa gccaaggttg aggcttgctg aaagtggaac gcatgttcat 148861 ttagacaccc attgtaatat tctgggtgat gctaattttt cttgcttaat atcagagaac 148921 agagaagtta gagatgatat caaaaatgga aacaacatgt acagtcccca taatttgtga 148981 attatgggga cagattccat ttctgtcttt tgtcttgagc ttctatgtga gctactacaa 149041 aaatgacagg gctttctgcc ctccatttcc cccttagttt gcacaacaca cacacccctt 149101 ctcaaacttc tgaaagctct cagacatact tttgaaagta aagaggctat agaggacata 149161 tcaatttatc taatagagta atagcattat gcaggaaatg gtaacttgaa gagaagcatt 149221 tgataggcat gaaagagcag caaagctgca tagcattaac accccactcc actttaagta 149281 ctgatgtagg taactgctgc aataattatg ccattaagaa agagtgttcc aatggccttg 149341 atacatgcta ccatcggaat aaagttagga cattttcctt atagttagtg cagtgcgaat 149401 tgaagaagac caagaaatgc ttttcagagt aagagaggta ccataaaggg cctcagagat 149461 ttgcttctat caggccaggc acagtgactt atgcctgtaa tcccagtatt ttgggaggcc 149521 aaggcaggtg gatcacttaa ggtcaagagt ttgagaccag cctggccaac atggtgaaac 149581 cctgcctcta ctaaaaatac aaaaattagc tgggcatggt ggcacacacc tgtagtccca 149641 gctactcagg aggctgaggc aggagaattg cttgaaccca ggagacggag gttgcagtga 149701 gctgagatca tgccaatgca ctccagcctg ggcaacacag taagactctg tctcaaaaaa 149761 aaaaaaaaaa gagattctat caaaggaggc aggggtatgc tattggttac tggtgcatat 149821 tagatgcttg ccagatgcca agcctaggta aacttgtaca ctagccatga tatgagaagt 149881 atgttggggc tgatgctggc ttcaggagat ctacatggtg tgagtctgga tcaataaaat 149941 gtgaaaatta atggtagctt ccatttagtg aataataaca tcaatagtta acaactctgg 150001 gctaggcaca gtggctcacg cctgtaatct cagcattttg ggaagccgag gcaggcagat 150061 caactgaggt cacaagttcg agaccatcct ggccaacatg gggaaacccc gtctctacta 150121 aaaatacaaa aattagccag gcatggtggt gggcactgtg gctgtaatcc cagctactgg 150181 tgaggctgag gcaggagaat tgcttgaacc tgggacgcgg aggttgcagt gagccgagat 150241 tgcaccactg cactccagcc tgggtgacag agtgagactc tgtctcaaaa aaaaaaaaaa 150301 aaaaaaaaaa agtaacaact ctggaaagaa agtattcttt gtcttttctt ttttcttttc 150361 tttttttttt ttttttgaga caggacctca tattttgttg gagtgcactg gtgcaatcat 150421 acctcactgc agccttgaac tcctgggctc gagcaatcct ctcacgtcag cctcacaagt 150481 agctgccact acaagtgcat gccaccatgc ccgaataatt ttttcagttt tattttgtaa 150541 agacaatgtc tcagcatctt gcccaggctg gtcttgaact cctggactca agagattctc 150601 ccacctcaat cccccaaagt gctaggatta caggcgtgag tcactgagct tgcccaggct 150661 gcttttgaac tcctagacta aagagattct gctgcctcaa ttccccaaag tgttgggatg 150721 acaggtgtga gccaccacgc ccagccaagg gaagaaaata ttcttttttt tttttttata 150781 ctttaatttc tagggtacat gtgcacaatg tgcaggtttg ttacatatgt atacatgtgc 150841 catgttggtg tgctgcaccc attaactcgt catttacatt aggtatatct cctaatgctg 150901 tccctccccc ctccccccac accaagggaa gaaaatattc ttaagtgacc tgcccaaagt 150961 catacagcta ataagtggca gagacaagat ctgaacctaa gtgcttctga ttccaaagcc 151021 tgggcttaaa cacaatttga ttctgcttgc caaagcatta cagctgagta agctttaagg 151081 aaacctcacc aatcggaacc atgcaaaata aagaaatatc agaggcctga gctatcaagt 151141 ccagtgagga gggtagccac ttggccaaga ggcccagtat tgaacagaaa tattcacagt 151201 accttgaatg aaggaggggc caacagtgac tcctggtcct tgaccaaact tgagtcaggc 151261 tcctctgaat gctcttcttg accaggcctc atccttggcc tgctgaatct ggttctgcaa 151321 gaatccccca cccttgttac tttaccaagt tccttgcatt acttttccat ccactggccc 151381 ctgcaccttg tccattgtct acaaatcccc agctgccact gttatattca gggttgagtc 151441 ttgaccccca atgcaatagt cttgaaaaaa gttttctttg cctacttaac ttgttcagcg 151501 caatttttct ctgacaggta aacaatgagg gagctccatt agcacaacca gagtctttca 151561 tccttgccgc cccagaggat ctggtgtctg ggtcaacaga ctgaccagca caggaagctc 151621 ccacaccttc aagttgagtc tgccagagga ctctccaggt tgcattgctg tggggacctt 151681 tatgcaaggt aaggagacaa accagggagt cgaaggcagg aggagaggac tggaatacaa 151741 ttttaagaaa ggagtggctg gggctgggcg tggtggctca tgcctgtaat cccagcgctc 151801 tgagaggccg aggcaggcag atcacctgag gtcaggagtt cgagaccagc ctggccaaca 151861 tggtgaaacc ccatctctac taataataca aaattagctg ggtgtggtgg catgtgcctg 151921 taatcccagc tactggggag gctgaggcac aagaatcact tgaacccagg aggcgggggt 151981 tgtagtgagc caagatcacg ccactgcact ccagcctggg cgacagagtg aaactctgtc 152041 tcaaataaaa aaaaagaaag aaaagaaaag agtggctggg cgtaagcacg cctatagtcc 152101 cagcactttg ggaggccaag gtgggaggat tgcttaagtc caggagtttg agaccagcct 152161 gggcaacata gtgagactcc atcaaaaaaa attagccagg cttggtggta cacgcccatg 152221 gtcccagcta ttcaggaggc tgaggcagga ggatcacttg agcccagttg tttgagaatg 152281 taggaagcca tgatcatgcc actgcagtcc agcctgggtg acagagtgag acattgtcta 152341 aaaacaaaaa gaaagaagga aggaaggaaa agaaaagaaa agaaaagaga cagcaagaaa 152401 gcaagaaaga accttccgga gtttaaactg atgcactgag tacctaagat ctctctcatc 152461 tcccattcaa ggacccattg aaatgatgaa aaaggcattt tgaaaaagag tgaaataata 152521 agaggcgcaa aaagaaaggc tgccatcagc aggcaagaaa tcttaaaaac tcctggaggg 152581 cagaaagcat taggatgaga ttgacaaaga agcagacaag aaaaccacag attcaaacgc 152641 caccaggaag gccagatctt gaaaagaagt ccatggaagc ttctaactgg atgacgccag 152701 acagaaggca cagaagtgca ccatggcaat cattaggata attcattaaa gctgggagag 152761 ttgggactgc cagtgtctta aacacattca gcttttgccc tccagctaaa catagaaaac 152821 ctatccagaa aagaataaaa aagcgtactt ggtaattaag gtatgattac agggcataag 152881 aaaaaaaatc agatggcagg actgccttcc ttagaatgta cacaagtagg acaggcacag 152941 tggctcatgc ctgtaatccc agcactttgg gaggttgaga tggacggatt gcccgagccc 153001 aggagtttga gccatgggca acatggtgag accgcatctc tacaagaaat acaaaaatta 153061 gcttggtgtg gtgccatgtg cctgtagtcc caactacttg ggaggctgag gtgggaggat 153121 cacttgagcc caggagattg aggctgtagt gagccatgac cacactccag ccagggtgac 153181 agagcaagac cctgtctcaa aaaaaaaaaa aaaaaaaaag taaacaagtg acgactgagc 153241 ttgagatatg aaagtaaagg tggccagacg tggtggctca cgcctataac cccaggactt 153301 tgggacgcct aggtgggtgg atcacctgag gtcaggagtt tgagaccagc ctggctaaca 153361 tggcaaaacc ccgtctctac taaaaataca aaaatgagtc aggcatggtg gtggcaggca 153421 actgtaatct cagctactcg ggaggctgag gcatgagaat cactctaacc tgggaggtgg 153481 agcctgcagt gaactgatgt cacaccatcg caccccagtc tgggcgatag agtgagatac 153541 cctctcaaaa aaaaaaaaaa aaaaaaaaaa aaaagtaaag gaaaactttc agaataaaaa 153601 ggaaacagac aaaaataggt aaatgtgaga gaaaaggctc aagggtgata gagtcaggta 153661 gtccaatatt cctttcatag gaattccaaa ggagacaaag aaggaagggg aggaaatcat 153721 caaagatatg agagaaaaag accctgagct gaagaggaac tcatcttcag attacaatgt 153781 ccactgactg ctgtacagag tgaattaaaa aagacctaat ggtgttgcat tcttgtgaaa 153841 tttcagaacg ctgggcaatt ttgaaagctt ccggggggag atgtatataa aaaggaaagg 153901 aaagggaatt aaactgccat caaatttcat caacaatact ggttgctgga agacaatgga 153961 acaatatctt caaatgcctg gggaaaggaa tatcttgaac tctggattct ataaagaatc 154021 atccgacaca gttcaagaat caatatgaaa aaaaatattg agacctgtca aaactcacat 154081 tgtttaccac cactcattcc acgtgaaaaa agtactttag gtgtttgctt actcaaaatg 154141 aaaaaagacc ccagaggccg gatgcagtgg ctcacgtctg tgagccatga tcacgtcact 154201 tcactccagc ctgggtgaca cagcaagacc ctgtctcaaa caaacaaaca aacaaacaaa 154261 caaagatgga aagaaagatt ctgtctctgc ccatgcactc accaagggaa ggccacatgg 154321 gcacacaatg acaggcagcc acctgcaagc cagggagagg gtccctacca gaatgtgacc 154381 atgctggcac cctgatccca gacttccatc ctccagaatg gtgagaaaat aaatgccggc 154441 tgttgaagcc acccagcctg ctgtggtatt ttgttagggc agcccaagca gaccatgaca 154501 gcccgccaaa tccgggtctt tctctctgct cattctgtaa cccactgcct gtcaactgtg 154561 tcttcaccaa tagtcattcc gtcactggtg aagaaggtgt cacctggtca gggcccacgt 154621 gtattttcaa aagataaaga gacagcaatg ttttctcact tattttcttc ctcttttccc 154681 aggagtctat tcacttcgta acgcctgtct aactgagcag ccaaatttag cctgccgcca 154741 gcaatggcag cctcctcagc cctgccccag agaggaaaac tgagagacac cagcctctgc 154801 ctgaaactgt cttgctgagg ggaggtttga gaacgctgtc ttgtaaagtg gaagagatta 154861 ggggtttcaa agaatagtgg tcttcaggcc aggcacagtg gctcacacct gtaattccag 154921 cactttggga ggctgaggtg ggcggatcac ttgaggtcag gagttcgaga ccagcctggc 154981 caacatggtg aaacctcgtc tctactaaaa atttaaaatt tagctgggtg tggtggtgtg 155041 cacctgtaat tctagctact caggaggctg agacaggaga attgcttgaa cccaggaggt 155101 ggaggttgcg gtgagccaag atcacgccac tgtactctag cgtggcgaca cagcgagaca 155161 ccatcacaaa taaaaataaa agaataatgg tcttcaaatg gaggtataag aacacttcct 155221 cttcagtaca agggcaccaa cagtttgaaa ggaattgatt tccaggcccg cttttctgca 155281 actgatctgc ctgagccctt gcctgcgagg gaggggcagg gtcttacttt ccccagtagc 155341 ccttttctac tttataaaaa gaagaggaca ccccttaccc atcctaatct taccatggca 155401 tgtttcctgg ggcaccaaac ccaatcctgg tattagtgct gaaccaacat ataaccacaa 155461 ggactgagta aaatttgctt ttgcaaagtc aggggctttc caacattttt cctttccctc 155521 aagcctaagg agatctcatt gaattgcatg tggatagagc attaaaaatt atttttgacg 155581 ataaatcagc atagggtttt tggctcagaa tgagctcaaa gaattaactg atagtacggt 155641 aatacaatta tttccatttc tatctacttt ttaatttttt ggagacaggg tttcactctg 155701 tcttccaggc tagagtgcag tggcacaatc gtggttcact gcagcctcaa acaactgggc 155761 aatggtgcaa tcgcagctca gctcactgca gcctggacct cctgggttca aggagctccc 155821 acctcagcct ccccagtagc tgggaccaca ggcacgtgcc accacgcctg gctaattttt 155881 gtatttttta gagacaggat ttcaccatgt tgcccaggct ggtctcgaac ccctggactc 155941 taattatcca cccgccttgg cctcccaaag tgctgggatt acagacgtga accaccaagc 156001 ctggctctac tttttataca aacaggtttc ctctgcagtg tcatggagaa acagaattga 156061 ttctagcagt gagtaggaac caaacctaga cacataaact aactggagaa aaaggccaac 156121 tgtcccatta aggaagatat ttctaactta aatctaactc cctatttaat aggacttatt 156181 cattggaaat acatattgtt gttttggcca atttgtatta ctactactga tgacaacttc 156241 atcagaagaa atgattaaac gcttgttcaa tggtcacagg aaataaaaat atcaatatag 156301 gtctatactt tttgtgcagt atgatagggt gaccagcaaa agactttcaa ggataaaaat 156361 atatgtgagg aaaagctgtg tgggaagtgg aatggaaatt caaatttaga aaaaaaaatg 156421 atataacatt tcttatgttt caaggagagc ttgtccaggt attattttaa tggatgatgg 156481 caggaatcaa acacgatgag attcctttgt ataccatcaa aaaaaataat aatgtaacag 156541 gtttctgtgc atgcgtaggt tacactcata tatacacata catctataca catatttaag 156601 gacctattat ttaccctcta tagtttatat aagtatatat tttatattgt attatatatt 156661 tatacttttc atatttaata ttgtttatgt aatatgtgaa acaatatgta atatatacat 156721 ttatatttta tcttttattt taattttttt tttgagaagg agtttcactc tgttgcccag 156781 gctggagtgc agtggcgcaa ccttggctca ctgcaacctc tgcctcccgg gttcaagcaa 156841 ttttcctgcc ttagcctcct gagtagctgg gagtacaggt gcctgccacc acaaccagct 156901 aatttttttt ttgtattttt agtagaggcg gggtttcacc atgttggcca ggctggtctg 156961 gaactcctga cctcaaatga tccacccacc tcggcctccc aaagtgctgg gattacaggc 157021 atgagccacc tcacctggcc tacatatata atttatataa catacagcct taatatcaat 157081 acatatgtat actatatata tatgtgtgtt tatatacgcc ccaacatata tatattcatg 157141 ttaaggcttt atatttaggt atgtgtattt agatattttt tattatgtat acatatactt 157201 atctattcat atgcatatat gcatttgtat ttatgctaaa gctttatata atacatatat 157261 tgtgtgtata tgtgtgtgtg tatatatata tataaaacat aaagctcata tacataaagc 157321 ctcaacatga atatgctctg attgtgatga gattatacag ctgtatacaa tgaccaaaat 157381 tatcaaatta tacacttcaa attggtagac tttattgtat gtaaacaata gaaacaaaca 157441 atcacacctg taatcccagc actttgggag gctgaggcgg gcggatcacg aagtcaggag 157501 atcgagacca tcctggctaa cacgatgaaa ccccgtctct actaaaaata caaaaaatta 157561 gcctggcgtg gtggcaggca cctgtagtcc cagcgacttg ggaggctgag gcagaagaat 157621 agcgtgaacc cgggaggcgg agcttgcagt gagcagagat cgcgccactg cactccagcc 157681 tgggcaacag agcaagactc tgtctcaaaa aaaaaaaaaa aaaagaaacg aacaaaagag 157741 aggaaaactt tccccattaa aatagcaata gcaaaaaaaa aaaaaaaaaa aaaaaaaaag 157801 ccaaaaatcg gaatagaggg ctatttcctt agcatgggat aagtaagtaa tattgtacgt 157861 gcctatgtga ggcacacaga atagtgagaa tcaaaggcag agagtggagt gggagttgcc 157921 gggggatggg gaatggagag ttagtattta gtgggtacag agtttcagtt ttacaagatg 157981 aaaagagttc tagagaagga tagtggtgat ggttgcacaa gattatgaat gtatttaata 158041 ccactgaact gtacacttaa aagtgattaa gatgataaat tgtgttatgt atattttaac 158101 acaataaaaa ttgggagtgt gtgtatgtgt atatatatat gtctgtgtgt acacacacac 158161 atatatataa ttgggagtgt gtgtgtatat acagtatgtg tatgtttgta tgagagctta 158221 acgtacatac acttgtgtac atgtctacct aacaactttt tttttttttt ggagacaagg 158281 tctcactgct ctgtcgcccg ggctggagtg ccgcagtgca atcacagctc actgcagcct 158341 caacctccct agctcaagca atcctcccac ctcagccttg taagtagctg gtactacagg 158401 tgtacaccac tacactgggc taatttttta aattttctgt agtgatgagg tcttggtatg 158461 ttacccaggc tggtctcaaa ctcctggcct caaccgatct tcctgccttg gcctcccaaa 158521 gcactgggat tacaggcatg agccgctgta cccggcccaa ctttattttt taaactaagt 158581 tgagtgtcaa tattgacaat attctgtaaa acatatcctt acaactattt aaacgtatag 158641 taaaatgttg catgtagatt gtcaacatgc gagggggcat gcaattttac aaagttcttt 158701 caggggatat tcaagccaaa gagtgtgaaa acccctggac ccccaggcag aattagacac 158761 aggggagact ccagtacagt ggcaactgag acaacaaaga aacactgagg acattttcac 158821 taccaggata taggcaaacg aaactgcaat gatgtcatgt ttgcatatgt ggcagataca 158881 aaaagcttaa aagcagctct ttgttctctt gctgagtttg gggcaggcac tggcacaaat 158941 tgaggaaagt aagtgacagg accggcagca attagacttg ctgatgttgg ggcgaccctg 159001 gggttgcatc tgggaaaccg acacccggat ccaggataga agctgacata gaagtaagca 159061 aaactgctgt aggccccggt caagggctct cctctcagga ttcctcccat aactacctga 159121 aacaaggatt tggaatacct tgactttgga gagagaaatc gaaatcagtt caactgaact 159181 ctaatcaggc gtgagaatcc tcttgtcatt caagtttaat tggcttaatc tcccaaatga 159241 tactgaaggc agtagtagtt cttatgctcc tagggtgcag tattatatta tttataaatg 159301 cagacacttc aaaagcaata aaacacttgg cccttgctct caataaactt gccctctaac 159361 tgggaggaca gcatccaaat ggaaaaaaaa aaaaatgaag aacagttcaa agcaacatat 159421 aagaagtatg taataatccc ccaagagaaa caaagactgc attgcatact ttcccagtag 159481 aagtacaaat tggcacagca ccccatggag ggaagtgggc cacagagatc agaattacaa 159541 atgagtatct cctttgacct ggtaatttaa cttctgggaa tttatccttc agccgtactt 159601 aggaaataac atatactcta agttactcac tgtagcattg ttcaaaataa caaaagattg 159661 gaaagaaggc aaatatcctt gagtagaaga ctgatgaaat acattgtgct acatacatac 159721 aatggaatat ttcgaaggta taaaagtgca tgaggagggc cgggtgcagt ggctcatgcc 159781 tataatccca gcactttggg aggctgaggt gggtggatca cttgaggttg ggagttcaag 159841 acaagcctga aaaacacaac acaaccccat ctctactaaa aatacaaaaa ttagccaggc 159901 atggtggtgg gcacctgtaa tcccagctac tcaggaggct gaggcagaag aatcacttga 159961 acccaggagg cagaggttgc agtgagctga gattgtgcca ctgcactcca gcctgggcga 160021 cagagcgagc tcaaaaaaag agtgcatgag gaaactttca aggtacagat atttttaaag 160081 tctccaagat aagtgcgggg gcaggggggg aacagcaagg tacagaaaag gtgtataaga 160141 cacttccttt tgtttacaag gaagggaaaa aaagaatata gaatatattt ttatgtgctt 160201 tagtattcac aaataaagtc tagatgaata cacacagaaa tgaaaagctg attacctgga 160261 gtggattagg gagggtgaaa acagggtgga tggggctgag caggagggag acttctgctc 160321 catgaaccat gtgactgtgt tcctactcaa aacaattaag agaataatga aaaaatatcc 160381 cctgctgagg cctgacataa taagcaggaa gttggtttct gagggacccc cccacccacc 160441 gtccggtgtc aagcatatgc cctcagcttt ggctggctct gaacagcagg gaaaatgtga 160501 gagcaggacc acgtggcttc tgcacgggca gccctgtgtc caggcccctg cccagctgct 160561 gagcttcctg cccggtgccc ctgcatcagc cagagtccaa ccccaccctc tcagcctgcc 160621 ctcttgccag cgggctcaga atcagctgtc ctcaccagtt accagaatcc tcaagcagct 160681 ggctttaatt gtgtctatgg gaaggcagaa agaggaaggg aaggtcgatt aagtaaacct 160741 ctattaaggg aggagtgaag cccaggaggt caaagagccc aggatagaag caaggctagc 160801 tgccaagcca agcttggaac tctcccaaaa gataccacag agaaatatgc ccaaatgtga 160861 atgctactgg cttcaagttg tgtaataatg ggtaggtttt ttccccccgg gtctttatgc 160921 tttgatgtgc tttccaattt tttttttaaa taagcacaga tgactcttac aaagcaaaaa 160981 aatagagtgt acaatgtgaa agatgtatac attaaaaata aaaaccaaac catgattgtt 161041 accaaaccat gtagtccaga aaccttgaag gataaaaaag gaagctcaga tggacagcat 161101 aagaatgtta cagctctaaa caaaattaaa atattacaat aaaaaaaatg ttcccataat 161161 gctgaagatg tcattggaca gcaggtcagt ggggcccact tagtcgggcc aggcagagtg 161221 gagctgtcca aggtgccaga gtaagaaagg gcagtggatg cagagatgac tgcgttactc 161281 agtgcactgg caaggccaat agctcctccc cagtcttcct cccactgagt ttaaaactct 161341 ctatccagca attcaaacca ctttcttcct tatacttgct aaagtccata atgagactgg 161401 gcacagtggc tcatgtctat aatttcagca ctttgggagg ccgaggcagg tggatcacct 161461 gaggtcagga gttcaagagc agcctggcca acatggcgaa acctccactt taccaaaaaa 161521 tacaaaaaaa aattagctgg gtgtggtggt ggtggtgggc gcctgtagtt ccacctactt 161581 gggaggctga ggtggaagaa tcacttgaac ccagaggcag aggctgcagt gagccaagat 161641 catgccactg cactccagcc ttggcaacag agtgagaccc tgtctcaaaa taaacaaaaa 161701 aaaagtaaga gagagagaga gtgtgaagaa agaaagaaag aaagaaagaa agaaagaaag 161761 accaaccata ataatggtca cattcatctc agaaacaaca aataattttt tagtcttcat 161821 caattttttt tctcagctct ttaggggtta tgaaaggagt aagcaaatat ttaaactatt 161881 tgaggaggtt ttaggcatat ttgaagctag caaagtttcc caccatttaa cacaaggctt 161941 tacatgaagt cagtaaaatt agatgcaaaa tcaagcccct gaatacttga aaaaatacag 162001 tagaccttga cgtgtgcaag gtatttatcc caaaaccttt cctaatccca aggttgggaa 162061 cagccctata gcaaaaaact tcccccttta ttagtcagga ctcttttgat tataaattat 162121 agaaactcaa atgacacaga ggggaatgaa ttggaggata aaatttaaaa aatagttgaa 162181 caggttgggc gcagtggctc atgcctataa tcccagcact ttgggaagct gagtcaggca 162241 gattacttga ggtcaggagt ttaaaaccag cctgggcaac aatggtgaaa tcctaaaaat 162301 acaaaaatta gccgggtgtg gtggctcacc tgtaatccca gctactcaag aggctgaggc 162361 aggagaatca cttgaacctc ccaggaggca gaggctgcag cgagccaaga tcatgccact 162421 gcaccccaga ctggatgacg ggagagaaat cttatctcaa aaaaaaaaaa tggttgaaca 162481 accttctgat tgctcacaga taataaatta taaattataa atgaccaggg tctagcatgc 162541 cacagagaaa ataagtttta atggcagttg cttccctgaa atggatttat tgtctaaaag 162601 gcagaaggtt ctcaatgatc ctgcatctgg actcatcttg acaccacctg ctctttctca 162661 cccacccatc atcaactaac tcctattatt tctaagccaa taataggtct ccaattagtc 162721 ccttcctctc tctcaactac tgtccttgtt caggccgcca tcatgaccag gttgaatcat 162781 tctgtaaata gcagattgag aaatgtgatg cctgggcttg ttagctaaat acctattaag 162841 aaagaatgat ttaggccagg tgcagtagct catgctacaa tcctagtact ttgggaggcc 162901 gaggctggtg gatcgcttga gcccaagagt tcaagacaag cctaggaaac atagcaaaac 162961 cttgtcctct actaaaagta caaaaaacta gccaggtgtg gtggcacaca cctgtggtcc 163021 cagctactcc agaggctgag gtgggaagat cgcctaagcc cagggaggtc aaagatgcag 163081 tgagctatga tcgtgccact gcactccagc ctgtgcaaca ggtgtgagac gctgtctcaa 163141 aaaaaaaaaa aaaaaaaagg aagattttta ttctcaaggt atattaaaga agactaggaa 163201 aatcacaaga gcatgggttt cagaatcaga tcgttccggc ttaaatgtag ctctatcact 163261 tactctatgg atgaccatgg caaagtattc aatctgagtt gactttctta taaaataggc 163321 ataataatat ttgtcttgca gaattttttt tctttcttct ttttcttaga cagagtgcct 163381 cactctgtca cctaggctgg tcttgaattc ctggactcaa gtgatcctcc caccttggcc 163441 tcccaaagtg ctaggattac aggtgtgagc cagcagggct ggctttgtga acttattatg 163501 aagattaaat caggtggaag atttttaaag tgctcaaaat attgagagaa tattcaatat 163561 atgctgctaa tatcagaggc ctcatgctaa ccttacaaaa gtcaataaac aaacacaagg 163621 taaatgatga gggtcagaaa aatacatcgg ccttactctt ctcaccttgc tttgcctccc 163681 aaacaaaggt ctgccaccat tttatttctc taagcccaaa aggtttgact aaataatagt 163741 tctctgtttg ccttgttagg cagtgtttga tgtggcacca ttacctgaag aatgaagtca 163801 agagtcattc ttggaagagg gttagaatgt ttgaatgttc aggtttgaat gtttgcagaa 163861 ttacaacaaa attggggtat gaaaaagaag atggggctcc agaaagtcaa acatctaaag 163921 tgtttgttct atattattat atgatataga ctgcaatgtg gatataataa tagaagatgg 163981 tattagagat gatattacaa tattgaacat ggattcaaca ataatatctt cctgaaagat 164041 tttttttaaa gctagactcc ccagcctggg caacatagta agaccccatc tttacaaaat 164101 ataaaaagtt ggctagaagt gatggtgagt agtcctagct actcaggtgg ccaaggtagg 164161 agaattgctt gagcccaaga ggttgaggcc gcagtgagct atgatgatgc cactgtactc 164221 cagcctgggc aacaaagcaa gatcctgtct ttaaaaaagc aaaacaaaaa caaacaaaca 164281 aacaaaaaga ataaaaccat tcagcacaga gtaaactcaa tgaaatcaac aaaatctcct 164341 aagaatctga aagccataca agtttctttt tcaccttgtt taataattct caaaaaccat 164401 gactggggaa accaattctg gtattaaaaa taaatactgc tttctccctt tttagctaaa 164461 ctttataaga ctcagcatct cagaaagacc ctcttatatt ctagagatat gctactgtct 164521 tcctagagag catcagcaaa caactaactt aaaatgtaat cagtgaaaaa atataaaaca 164581 tttccaaaag aaattttaac aagacccaaa taaattgaaa gacatcccat gttcatggat 164641 tggaagactt aatattgtta ggatgagaat actatccaaa gctttataca gatccaatgc 164701 aatccctatc aaaatctcaa gagcatcttt tgcagaaatg aaaaatccca ttctaaaatt 164761 cataaagaat taagagactc aaaatagcca aaaataatct tgaaaaagaa aaacaaagtt 164821 ggagggctca catgttctga tttcaaaacg tattacaaag ctacagtaat caaaaaagtg 164881 taatcaaaac agcactaagt gtggtgctgg cataaaaata gacatatcaa ccaatggaat 164941 aaaatttaga acccagaaat aaacccaaat gtctctagtc aattgatttc agcaagagtg 165001 tcaaggccac tcaatgggaa aaagagagtg ttttcaacaa atggtgctga aaaaactgga 165061 tatccacatg cgaaatgaag ttagaccctt accctatacc atatataaaa actaacagtg 165121 aatcaaaagc ctaaatttaa gaggcagaac tataaaactc ttaaaagaaa acatggggca 165181 aatctgcatg gtcttagatt aggcagtggt ttcttaagta tgacacttaa aaagcacagg 165241 taacaaaaga atatatagat aaactaaact ttttgaaaat aaaaaacttg tatgcatcaa 165301 tggacactat caagagagta aaaacacaat ccacagaatg ggagaaaata tgtataaatc 165361 atatatccta taagggtttg atgtccagaa tacgtaaaaa actcctacaa ctgaacaaca 165421 caaaaacaat cccattttaa aatgtgcaaa gggagggatt agcaggaagg aagaaatgaa 165481 taggatgagc acagaggatt tttagggcag taaaactatt ctatatgcta ctatcatgtg 165541 gattcatgtc attatacact catcaaaact tgcataccaa caccaagagt gacctctaac 165601 gtaaatatgc attctgggtg ctaatgatat gtcaatttgg ttaatcaatt gtattagatg 165661 taccactctg atgagggatg ttgaatgtgg gtcagcctat gcatgtgtgg aggtgagagg 165721 tatatgggaa ttctctactt tctgctcagt tttgctgtta acttaaaaac tactctaaaa 165781 aataatacag tggggagaaa aagaggacaa agagcttgaa cagacatttc tccaaagaag 165841 atatacaaat gaccaataaa cacaggaaaa gatgctcaac attgctaatc attaaggaaa 165901 tgcaaatgaa aaccataatg agatagcatt tcacacctaa gatggctata tatatatata 165961 tatggctata tataaatata tctatatatt ttttttgaga caggatctca ctttgtcgtc 166021 tgggctacag tgcagtggca cgatcatggc ttactgcagc ctccacctcc tggggtcaag 166081 tgatcctccc acctcagcct cttgagtagc tgagtccata ggcatgcacc accacagcca 166141 gataattttt ttttttgtag ctatggggcc tccctgtgtt gcgcaggctg gcctggaact 166201 cctgggctca agcaatcctc ccaccttggc ctccaaaaat gctgggttta caggcatgag 166261 ccacaacacc aggctataat ttttttttaa aggaaaatag caaatgtgga agaggatgtg 166321 gaaaaatggg aacccttgga cattgctggt gggaatgtag cgacgcaacc actgtggaaa 166381 acagcttggc agttcctcaa gaagttaaac atagaattac catatgatcc agcaacttca 166441 ctcctatgaa aacacccaga agaagtaaaa aggactcagg caaatacttg cataccaatg 166501 ttcattgagg tattattcac cagagccaaa agctagaaac aactgaaatg cccaacatgg 166561 gaagaaacaa aacgtggttc agtatacata cacacacaca cacacacaca cagacacaca 166621 cacacacaca cacaatggaa tattattcag ccgtcaaaat taagctctga tgcatgctac 166681 aatatggatg gaccttgaag acatgctaaa tgaaagaggc tagacacaaa aggaccatac 166741 tgtatgattc cacatatagg aagagacgca aattcgtaga tacagaagtc taatggtagt 166801 tgccagaagc tgggaggaga aaggaattgg gagttattaa ccttggttaa tgggaagaga 166861 gttttgtcag agtagtgatg cttgcacaga ttatgaatgt aatgaatgcc actgagttat 166921 acacaaaagt ggcttaagtg ggaaatttta tgttatatgt atttcaacac attttttaag 166981 agaaaagtaa tatgtgcaaa atgacctatg aatacaggaa ttagagactg ttgctggtca 167041 ggcatggtgg ctcatgctta taatcccagc actttggaag gctgaggcag gaggatcact 167101 tgagcccagg agtttgagat tagcctgggc aacataagga gagcatgtct ctacaaaaaa 167161 taaaaaatta gccgggtgtg gtggcatatg cctgtagtac tagttattct ggaacctgag 167221 gcgggaagat ttcctgagcc taggagttcg aggctgcagt gagtcatgat agtgccactg 167281 cactccagcg ttggggacaa agttagaccc tgtctttgaa aaaaacagaa gaaactgttc 167341 tga // LOCUS HSU96759 1783 bp mRNA PRI 02-JAN-1998 DEFINITION Homo sapiens von Hippel-Lindau binding protein (VBP-1) mRNA, complete cds. ACCESSION U96759 NID g2738243 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1783) AUTHORS Brinke,A., Green,P.M. and Giannelli,F. TITLE Characterisation of the gene and transcript for the von Hippel-Lindau binding protein (VBP-1) and isolation of the highly conserved murine homologue JOURNAL Unpublished REFERENCE 2 (bases 1 to 1783) AUTHORS Brinke,A., Green,P.M. and Giannelli,F. TITLE Direct Submission JOURNAL Submitted (09-APR-1997) Division of Medical and Molecular Genetics, United Medical and Dental Schools of Guy's and St. Thomas' Hospitals, 8th floor, Guy's Tower, London SE1 9RT, UK FEATURES Location/Qualifiers source 1..1783 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="Xq28" /clone_lib="Burkitt's lymphoma library constructed by Sideras et al. 1992 (J. Immunology 149:244-252)" gene 319..801 /gene="VBP-1" CDS 319..801 /gene="VBP-1" /codon_start=1 /product="von Hippel-Lindau binding protein" /db_xref="PID:g2738244" /translation="MKQPGNETADTVLKKLDEQYQKYKFMELNLAQKKRRLKGQIPEI KQTLEILKYMQKKKESTNSMETRFLLADNLYCKASVPPTDKVCLWLGANVMLEYDIDE AQALLEKNLSTATKNLDSLEEDLDFLRDQFTTTEVNMARVYNWDVKRRNKDDSTKNKA " BASE COUNT 574 a 307 c 383 g 513 t 6 others ORIGIN 1 acgcttgtca nttacgtgag cgcagccaat cagcactaga ggttgggata tntcggncca 61 aaggagaacg grgacttgtg gggaygctcc tgcgcaccaa tgaatgtgca tggagatgga 121 gaggcgggcc tgcaagtgcg aacaagccaa tcacggaatc ccggcggccg gcggcccggg 181 aggcagtcgc gcgctcgcat ccccaagatg gcggccgtta aggacagttg tggcaaagga 241 gaaatggcca cagggaatgg gcggcggctc cacctgggga ttcctgaggc cgtgtttgtg 301 gaagatgtag attccttcat gaaacagcct gggaatgaga ctgcagatac agtattaaag 361 aagctggatg aacagtacca gaagtataag tttatggaac tcaaccttgc tcaaaagaaa 421 agaaggctaa aaggtcagat tcctgaaatt aaacagactt tggaaattct aaaatacatg 481 cagaagaaaa aagagtccac caactcaatg gagaccagat tcttgctggc agataacctg 541 tattgcaaag cttcagttcc tcctaccgat aaagtgtgtc tgtggttggg ggctaatgta 601 atgcttgaat atgatattga tgaagctcag gcattgttgg aaaagaattt atcgactgcc 661 acaaagaatc ttgattccct ggaggaagac cttgactttc ttcgagatca atttactacc 721 acagaagtca atatggccag ggtttataat tgggatgtaa aaagaagaaa caaggatgac 781 tctaccaaga acaaagcata atgctggcaa ttaaaaatgt ggtttagttt tccaaacatg 841 ttatcttaaa taccccttta tccttacagg ttgacataac tttgaatgtt ttaacagcaa 901 gaattttaag aaaagataaa caccatttta tttatttata aaaacaaaat tagtttcaaa 961 tatttttgac attgtgattt ttttttccac atttctcagc aaagctaatg gtattttaat 1021 cattattttt gcctgtcata agaaaactct tagctgaaat ggccgaaaac tgtgagacat 1081 gctatggaag ctgaatgccg gacgctagca cagtttactt tttccctttc taattggctg 1141 atgttactct cacttgatgt ggttaaacca tgttagaggt agagaagaca gacagtttga 1201 atattagtaa acttgttatt ctttagtata tttaggactt agtggtwctc tgttgctatt 1261 gtcttctata agtggagttt catgacttac tgcttaacga ataactaagt actatgatat 1321 tctggacatt ttaggaaatg gtaatttgcc ttgctacaca ttaagagggc tattaagact 1381 acattttttc taacctcaga taagtgcagt gtctttgcaa tgccaacata agggagatct 1441 tggccaacgt gaaataaaat tactcattca aaactctgcc taaggtgatt ttgtagttct 1501 taacagttct ccagagcatc ttgaacagga atattaagat aaatgtgaat ctgcaatggc 1561 tgaaaagagt tgtgagcttt tttattcatg ataaaacctt ataggaatag tataaaaaat 1621 ccctgtggaa agctactagt acattgacca gcgctgggtg atacagattc tgataaaaac 1681 ataaatgtat tagttcatct ccatgtagta aaaagtatac ttatacaatg ttttgtactt 1741 gtatttcatg aaattaaaac agtgatgcta aaactaaaaa aaa // LOCUS HSU96915 736 bp mRNA PRI 25-MAY-1997 DEFINITION Homo sapiens sin3 associated polypeptide p18 (SAP18) mRNA, complete cds. ACCESSION U96915 NID g2108209 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 736) AUTHORS Zhang,Y., Iratni,R., Erdjument-Bromage,H., Tempst,P. and Reinberg,D. TITLE Histone deacetylases and SAP18, a novel polypeptide, are components of a human Sin3 complex JOURNAL Cell 89 (3), 357-364 (1997) MEDLINE 97294379 REFERENCE 2 (bases 1 to 736) AUTHORS Zhang,Y. TITLE Direct Submission JOURNAL Submitted (09-APR-1997) Biochemistry, HHMI/RWJMS/UMDNJ, 663 Hoes Lane, Piscataway, NJ 08854, USA FEATURES Location/Qualifiers source 1..736 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="senescent fibroblasts NbHSF" gene 17..478 /gene="SAP18" CDS 17..478 /gene="SAP18" /note="SAP18p" /codon_start=1 /product="sin3 associated polypeptide p18" /db_xref="PID:g2108210" /translation="MAVESRVTQEEIKKEPEKPIDREKTCPLLLRVFTTNNGRHHRMD EFSRGNVPSSELQIYTWMDATLKELTSLVKEVYPEARKKGTHFNFAIVFTDVKRPGYR VKEIGSTMSGRKGTDDSMTLQSQKFQIGDYLDIAITPPNRAPPPSGRMRPY" BASE COUNT 223 a 152 c 162 g 199 t ORIGIN 1 ggaacgaggg aggaagatgg cggtggagtc gcgcgttacc caggaggaaa ttaagaagga 61 gccagagaaa ccgatcgacc gcgagaagac atgcccactg ttgctacggg tcttcaccac 121 caataacggc cgccaccacc gaatggacga gttctcccgg ggaaatgtac cgtccagcga 181 gttgcagatc tacacttgga tggatgcaac tttgaaagaa ctgacaagct tagtaaaaga 241 agtctaccca gaagctagaa agaagggcac tcacttcaat tttgcaatcg tttttacaga 301 tgttaaaaga cctggctatc gagttaagga gattggcagc accatgtctg gcagaaaggg 361 gactgatgat tccatgaccc tgcagtcgca gaagttccag ataggagatt acttggacat 421 agcaattacc cctccaaatc gggcaccacc tccttcaggg cgcatgagac catattaaat 481 tctatttact atttgttgaa tttatttttc cgtcagttat gtaaaataaa catactcttc 541 ttcctcccct gattattgcc attaagcctt taaattctaa acaaattata atgcatcatc 601 tatttaggag ttagatttgg atgtgctatt gtatgattac gaatagtctg tatgtttcaa 661 gcccttctgt aaaatatgaa gaaaagtctc ttagcattct gtgtaaaact gtactgttaa 721 atatatgtgt gtaatc // LOCUS HSU96922 2937 bp mRNA PRI 07-OCT-1997 DEFINITION Homo sapiens inositol polyphosphate 4-phosphatase type II-alpha mRNA, complete cds. ACCESSION U96922 NID g2232036 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2937) AUTHORS Norris,F.A., Atkins,R.C. and Majerus,P.W. TITLE The cDNA cloning and characterization of inositol polyphosphate 4-phosphatase type II. Evidence for conserved alternative splicing in the 4-phosphatase family JOURNAL J. Biol. Chem. 272 (38), 23859-23864 (1997) MEDLINE 97442457 REFERENCE 2 (bases 1 to 2937) AUTHORS Norris,F.A., Atkins,R.C. and Majerus,P.W. TITLE Direct Submission JOURNAL Submitted (10-APR-1997) Hematology, Washington University, 660 S. Euclid Box 8125, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2937 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 122..2896 /codon_start=1 /product="inositol polyphosphate 4-phosphatase type II-alpha" /db_xref="PID:g2232037" /translation="MEIKEEGASEEGQHFLPTAQANDPGDCQFTSIQKTPNEPQLEFI LACKDLVAPVRDRKLNTLVQISVIHPVEQSLTRYSSTEIVEGTRDPLFLTGVTFPSEY PIYEETKIKLTVYDVKDKSHDTVRTSVLPEHKDPPPEVGRSFLGYASFKVGELLKSKE QLLVLSLRTSDGGKVVGTIEVSVVKMGEIEDGEADHITTDVQGQKCALVCECTAPESV SGKDNLPFLNSVLKNPVCKLYRFPTSDNKWMRIREQMSESILSFHIPKELISLHIKED LCRNQEIKELGELSPHWDNLRKNVLTHCDQMVNMFQDILTELAKETGSSFKSSSSKGE KTLEFVPINLHLQRMQVHSPHLKDALYDVITVGAPAAHFQGFKNGGLRKLLHRFETER RNTGYQFIYYSPENTAKAKEVLSNINQLQPLIATHADLLLNSASQHSPDSLKNSLKML SEKTELFVHAFKDQLVRSALLALYTARPGGILKKPPSPKSSTEESSPQDQPPVMRGQD SIPHHSDYDEEEWDRVWANVGKSLNCIIAMVDKLIERDGGSEGSGGNNDGEKEPSLTD AIPSHPREDWYEQLYPLILTLKDCMGEVVNRAKQSLTFVLLQELAYSLPQCLMLTLRR DIVFSQALAGLVCGFIIKLQTSLYDPGFLQQLHTVGLIVQYEGLLSTYSDEIGMLEDM PVGISDLKKVAFKIIEAKSNDVLPVITGRREHYVVEVKLPARMFESLPLQIKEGQLLH VYPVLFNVGINEQQTLAERFGDVSLQESINQENFELLQEYYKIFMEKMPPDYISHFQE QNDLKALLENLLQNIQSKKRKNVEIMWLAATICRKLNGIRFTCCKSAKDRTSMSVTLE QCSILRDEHQLHKDFFIRALDCMRREGCRIENVLKNIKCRKYAFNMLQLMAFPKYYRP PEGTYGKADT" BASE COUNT 946 a 606 c 663 g 722 t ORIGIN 1 caatttcaga gtgggatatc agatctttag tgtgaagata catctacatt aaaccaggaa 61 tcactagaac tgacatttgg acaagaaaat ttggaaaatt ttaaaactgt gaaggttgat 121 catggaaatt aaagaggaag gggcatcaga agaagggcag cactttcttc ctacagccca 181 ggccaatgat cccggggact gtcagttcac aagtatccag aagactccaa atgaaccgca 241 gttggaattc atccttgcat gcaaggatct cgtggctcct gtccgtgatc gtaaactgaa 301 tacactggtg cagatctccg taatccaccc cgtggagcag agtctgacaa gatactccag 361 caccgaaatt gtggagggaa caagggaccc actgtttttg actggtgtca cattcccatc 421 tgagtatccc atctatgagg agaccaaaat aaaactaaca gtctatgatg tcaaggataa 481 gtctcatgac accgttcgaa ccagtgtcct accagaacat aaggatcccc cgccagaagt 541 tggccgaagt ttcttgggct atgccagttt taaagtggga gagctgctga agtcaaagga 601 gcaattgctg gtcctgagcc tgagaacttc agatggtggc aaagtggttg gcaccataga 661 agtcagtgtc gtgaagatgg gggagattga ggatggggaa gccgaccaca tcaccacaga 721 tgtacaggga caaaagtgtg ccctggtatg tgaatgtaca gccccggaaa gtgtgagcgg 781 aaaagataac ttaccttttt tgaattcagt gttaaagaac ccagtatgta aattatatag 841 atttcccaca tctgacaata agtggatgcg aattcgagag cagatgtcag agagcattct 901 ttcctttcat attcctaagg aattgatttc ccttcacatt aaagaagatt tgtgcagaaa 961 ccaggagata aaagaacttg gtgagctttc tccacattgg gacaatctgc gaaaaaatgt 1021 ccttacgcac tgtgatcaaa tggtgaatat gttccaagac attctgacag aacttgccaa 1081 ggaaacaggg tcctctttca aatcaagcag cagcaaagga gagaaaacat tagaatttgt 1141 tccaataaat ctacatctgc aaagaatgca ggtacacagc cctcacttga aagatgctct 1201 ctacgatgtc atcactgtgg gagccccagc tgcccatttt cagggattta agaatggtgg 1261 tcttcggaag ctactccata gatttgaaac agaaagaaga aataccggat accagtttat 1321 ttactattca cctgaaaaca cagccaaagc aaaggaagtt ctcagcaaca tcaatcaact 1381 acaacctctt atagcaaccc atgcagacct actgcttaat tctgcaagcc agcattctcc 1441 agacagcttg aagaattctt taaagatgct ttcagaaaaa acagagcttt ttgtacatgc 1501 cttcaaggat caacttgtca ggagtgctct tttagcactc tacactgcaa ggccaggagg 1561 cattcttaag aagccaccct ctcctaagag cagcacagag gagagcagtc cccaagacca 1621 acccccagtg atgagagggc aggactccat accacatcat tcagactatg atgaggaaga 1681 gtgggacagg gtgtgggcca atgtggggaa gagcctgaac tgcattattg ctatggtgga 1741 caaactgatt gaaagagatg gtggcagtga aggcagtggt ggcaacaatg atggagaaaa 1801 ggaaccttca ttaacagatg ccattccctc tcacccaaga gaggactggt atgaacagtt 1861 gtatcccctc atccttaccc tgaaggactg catgggagaa gtggtgaacc gagccaagca 1921 gtccctgaca tttgtgctcc ttcaggaact tgcgtacagc ttgccccagt gtctgatgct 1981 gacgctaaga agagacatcg tcttcagcca agcacttgct ggattggttt gtggttttat 2041 catcaaatta cagacaagtc tgtatgaccc aggcttccta cagcagcttc acacagtggg 2101 gttgatagta caatatgaag gattgttaag tacatacagc gatgaaattg gaatgttaga 2161 ggacatgccc gttggcattt ccgatttaaa gaaagttgca tttaaaataa ttgaagccaa 2221 atccaatgat gtattgccag ttataacagg aagacgagaa cattacgtgg tagaggtcaa 2281 gcttccagcc agaatgtttg agtcactacc tctacagatt aaagaaggac agttgcttca 2341 tgtgtatcca gtacttttta atgttggaat caatgaacag caaactctgg ctgaaaggtt 2401 tggagatgtc tctttgcaag aaagtattaa tcaggaaaac ttcgaacttc tacaagaata 2461 ttacaagata tttatggaaa agatgcctcc tgattatatt tcacattttc aggaacaaaa 2521 tgatttaaaa gcattgctag aaaatctcct tcaaaatatc caatccaaaa aaagaaagaa 2581 tgtagaaatt atgtggctgg ctgcaacgat ttgccgcaaa ctgaatggta ttcgtttcac 2641 ctgttgtaaa agtgccaaag acaggacatc gatgtcagtg acacttgaac aatgctcaat 2701 cttgagagat gagcaccagt tacacaagga cttctttatc cgagcgctgg attgcatgag 2761 aagagaagga tgccgcatag agaatgtact gaagaatatc aaatgcagaa agtatgcttt 2821 caacatgcta cagctgatgg ctttccccaa gtactacaga cctccagagg ggacttatgg 2881 aaaagctgac acctaagttt accaacatgt taataaacag gaacacaaat acatttc // LOCUS HSU97018 3962 bp mRNA PRI 18-MAY-1997 DEFINITION Homo sapiens echinoderm microtubule-associated protein homolog HuEMAP mRNA, complete cds. ACCESSION U97018 NID g2104768 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3962) AUTHORS Eudy,J.D., Ma-Edmonds,M., Yao,S., Talmadge,C., Kelley,P.M., Weston,M.D., Kimberling,W.J. and Sumegi,J. TITLE Isolation of a novel human homologue of the gene coding for echinoderm microtubule associated protein EMAP from the Usher syndrome type 1a locus at 14q32 JOURNAL Genomics (1997) In press REFERENCE 2 (bases 1 to 3962) AUTHORS Eudy,J.D., Ma-Edmonds,M., Yao,S. and Sumegi,J. TITLE Direct Submission JOURNAL Submitted (09-APR-1997) Pathology/Microbiology, University of Nebraska Medical Center, 600 South 42nd Street, Omaha, NE 68198-5660, USA FEATURES Location/Qualifiers source 1..3962 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="14q32" /map="D14S78-D14S250" CDS 363..2516 /note="Usher syndrome type 1a candidate; similar to the sea urchin 77-kDa echinoderm microtubule-associated protein" /codon_start=1 /product="HuEMAP" /db_xref="PID:g2104769" /translation="MALCYQRYLLALYHPPPGSGKILLCQQPKVTSRGPALLNECLLG VEGKAMGIPEETGIAQAPPAALPVAKKNSESKPKEPVFSAEEGYVKLFLRGRPVTMYM PKDQVDSYSLEAKVELPTKRLKLEWVYGYRGRDCRNNLYLLPTGETVYFIASVVVLYN VEEQLQRHYAGHNDDVKCLAVHPDRITIATGQVAGTSKDGKQLPPHVRIWDSVTLNTL HVIGIGFFDRAVTCIAFSKSNGGTNLCAVDDSNDHVLSVWDWQKEEKLADVKCSNEAV FAADFHPTDTNIIVTCGEITSLLLDTRRKLPLIRSKDYSRTRKAKVVLCVTFSENGDT ITGDSSGNILVWGKGTNRISYAVQGAHEGGISPLCMLRDGTLVSGGGKDRKLISWSGN YQKLRKTEIPEQFGPIRTVAEGKGDVILIGTTRNFVLQGTLSGDFTPITQGHTDELWG LAIHASKPQFLTCGHDKHATLWDAVGHRPVWDKIIEDPAQSSGFHPSGSVVAVGTLTG RWFVFDTETKDLVTVHTDGNEQLSVMRYSPDGNFLAIGSHDNCIYIYGVSDNGRKYTR VGKCSGHSSFITHLDWSVNSQFLVSNSGDYEILYWVPSACKQVVSVETTRDIEWATYT CTLGFHVFGVWPEGSDGTDINAVCRAHEKKLLSTGDDFGKVHLFSYPCSQFRAPSHIY GGHSSHVTNVDFLCEDSHLISTGGKDTSIMQWRVI" BASE COUNT 1090 a 910 c 962 g 1000 t ORIGIN 1 aggaattccg gtgagctgag cgcggcgcgc ggccgggccg gggagcgggc gcgccggcgg 61 cctcagcatg gaggacggct tctccagcta cagcagcctg tacgacacgt cctcgctgct 121 ccagttctgc aacgatgaca gcgcttctgc tgcaagtagc atggaggtga cagaccgcat 181 tgcttcactg gagcagagag tccagatgca agaagacgac atccagctgc tcaaatcagc 241 tctagctgat gtggttcggc ggctgaacat tactgaggaa cagcaggccg tgcttaacag 301 gaaaggacct accaaagcaa gaccactgat gcagaccctg ccttttagat ccacggtcaa 361 caatggcact gtgttaccaa agatacctac tggctctcta ccatccccct ccgggttcag 421 gaaagatact gctgtgccag caaccaaaag taacatcaag aggaccagct cttctgaacg 481 agtgtctcct gggggtcgaa gggaaagcaa tggggattcc agaggaaacc ggaatcgcac 541 aggctccacc agcagctctt ccagtggcaa aaaagaacag tgaaagcaaa cccaaggagc 601 ctgtattcag tgcagaagaa ggctatgtaa aattgtttct tcgtggacgc cctgttacca 661 tgtacatgcc caaagatcaa gtggattctt acagcttgga agcaaaagta gaacttccaa 721 ccaagagact caagctggaa tgggtctatg ggtacagggg tcgagactgc cgtaacaacc 781 tgtacttgct tccgacggga gagaccgtct acttcatcgc atccgtggtg gtgttataca 841 acgtggagga gcaactgcag aggcattacg ctggccacaa cgatgacgtg aagtgcctag 901 cagttcatcc tgatcggatc acgatagcaa caggacaagt tgcgggcaca tcgaaggatg 961 gaaaacaatt gcccccacat gtgcgcatct gggattctgt gacattgaat actctccacg 1021 tcattggaat aggttttttt gaccgagcag tcacctgtat tgcattctca aaatctaatg 1081 gaggaaccaa tctctgtgct gtggatgact ccaacgacca tgtgctctct gtatgggact 1141 ggcagaaaga agaaaaacta gcagatgtga agtgctctaa tgaagctgtg tttgctgcgg 1201 atttccaccc cacggacacc aacatcatag ttacttgtgg agaaatcaca tctctacttt 1261 tggacactag aaggaagctc ccattaataa gaagcaagga ttattcgaga acaagaaaag 1321 ccaaagttgt cctctgtgtg actttctctg aaaacggtga caccattact ggagattcaa 1381 gtggcaacat cttagtatgg ggaaaaggta caaatcgaat aagctatgca gttcaggggg 1441 cccatgaggg tggcatttct ccactttgta tgttaagaga tggcacactg gtgtcgggag 1501 gtgggaaaga ccgaaagctc atttcttgga gcggaaacta tcaaaaactt cgtaaaacgg 1561 agattccaga acagtttggt ccaatacgga cagtggccga ggggaaaggc gatgtgatct 1621 tgattggcac aactcgaaac tttgtcctgc agggcactct gtcaggggac ttcacaccca 1681 ttactcaggg tcacactgat gagctctggg gactggccat ccatgcctca aaacctcagt 1741 tcttgacctg tgggcatgac aagcatgcca ctctctggga cgctgtgggt caccgtcccg 1801 tctgggacaa aataatagag gatccagctc agtcttctgg ttttcatcct tcagggtctg 1861 tggttgcagt cggaacactc actgggaggt ggtttgtgtt tgacacagaa acaaaagact 1921 tggtcaccgt tcacacagat ggaaacgaac agctctctgt aatgcgatac tcaccagatg 1981 ggaatttctt agccataggc tcacatgaca actgcatcta tatatatggc gttagtgaca 2041 acgggaggaa gtacacgcga gtgggcaagt gctcgggtca ttccagcttc attactcacc 2101 tggactggtc tgtaaactca cagttcctcg tgtcaaattc cggagactac gaaatcctct 2161 actgggttcc ctctgcctgt aagcaagtcg taagtgtgga aactacaaga gacattgaat 2221 gggctaccta tacctgcact ttgggattcc atgtttttgg agtgtggcca gaaggctcgg 2281 acggaaccga catcaatgcc gtctgtcggg cccatgagaa gaaactcctg tcaacaggcg 2341 acgactttgg caaagtgcac ctcttctcat acccctgctc gcagttcagg gctccaagcc 2401 acatctacgg cgggcacagc agccatgtca ccaatgtcga tttcctctgt gaagacagcc 2461 acctcatctc cacgggcggg aaagacacaa gcatcatgca gtggcgcgtc atttagtacc 2521 caccgagagc tgtggggagc agcatgggca aggaagacac agactcgcat tacccttggt 2581 cactgtgatt tctgttttgt ttaaaaaatt cttacaaacc tcaggaaaac tgtgccctcc 2641 gccggctacc ttagcttagc gtgtcagcgg gcgccacagc ggaatcagcg gttccgtgtt 2701 cacttttgtt gtacaatata tgacacagtg cacattgaat accaacaagg ttgcaacgtt 2761 tacattatag ccacatcaac agaagtaact gggtatattc ttagtaactt ttctatggaa 2821 ctcttcaaaa atgggtcaca ggatggcctt ttaaaacatt gtatattatc ttcactgttt 2881 tcacctttta ggttgctaag ttcaatattt gtgatgataa tgaggtactg aaccacgatg 2941 gctgttgagg aattggtcct aaaaggacag atcacttcag aagagtgaat aactgatttg 3001 cacagctgaa tcaggagaca caaagatgag actgtgtttg gttacatttt ccaaagtttc 3061 attgcattct cccttgggga ggctgtgaga gagggcttgt atccctcttg tgctaagcag 3121 actctactcc taactgactt caatatttca gcagggtaca caggcgtttc caagtttcag 3181 tgacaccgtc ctgcctaacc agatgcggtc agcctcttca cacccacctg gcttgcatcc 3241 cccatccctt gttcacacgc cctgattcac ggtgagacat tttgccacct tcttgtgtat 3301 attacttggc atgagatgat attgtacttg tataggattc tagcaattca taataaatat 3361 gtaagactag gctttactgt cttatgctta tggacattgt atatttgtat tttatgacca 3421 agtagaccaa gtcagaaaga tctctctcga gcgcaccata aacctgcaga gagaagtctc 3481 gaaaggctcc accaaggtac caagggcagc tgcttttcct gtcttttgtg catgggcgac 3541 ccattacagt atgagataag attgagttct gatgcgttaa acggaggtgg cagaaatttg 3601 tcaagaaggc cttatccatt tcgattgtgt gacagattga aatttattgt ttacattggg 3661 gaatgtatct caaattttta aatagaagag taataaacag actttaaagc aaatattaag 3721 atttttactc attcaaggca agtaaatgaa tggaattatc tgagctctat ggcactggtt 3781 gtttagagtg actgatgaag tgcacctttc aaaaacattt ttgatgccat caccagccta 3841 ctgcagaagt gcagggcaca gtaaacacca tgtattattg aagatgatct gttttgtatg 3901 tatccttgtc aaatatattc tataatggaa taaaaaatcc tggaaagtgg gggtttcctt 3961 aa // LOCUS HSU97188 4181 bp mRNA PRI 20-MAY-1997 DEFINITION Homo sapiens putative RNA binding protein KOC (koc) mRNA, complete cds. ACCESSION U97188 NID g2105468 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4181) AUTHORS Mueller-Pillasch,F., Lacher,U., Wallrapp,C. and et,a.l. TITLE KH-domain containing transcript overexpressed in cancer JOURNAL Oncogene (1997) In press REFERENCE 2 (bases 1 to 4181) AUTHORS Mueller-Pillasch,F., Lacher,U., Wallrapp,C. and et,a.l. TITLE Direct Submission JOURNAL Submitted (11-APR-1997) Medizinische Klinik, Internal Medicine I, Robert-Koch-Str.8, Ulm 89081, Germany FEATURES Location/Qualifiers source 1..4181 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7p11.5" /cell_type="pancreatic cancer" /cell_line="PaTu8988t" gene 251..1990 /gene="koc" CDS 251..1990 /gene="koc" /function="possible proliferation of cancer tissues" /note="putative RNA binding protein; KH-domain containing protein overexpressed in cancer" /codon_start=1 /product="KOC" /db_xref="PID:g2105469" /translation="MNKLYIGNLSENAAPSDLESIFKDAKIPVSGPFLVKTGYAFVDC PDESWALKAIEALSGKIELHGKPIEVEHSVPKRQRIRKLQIRNIPPHLQWEVLDSLLV QYGVVESCEQVNTDSETAVVNVTYSSKDQARQALDKLNGFQLENFTLKVAYIPDEMAA QQNPLQQPRGRRGLGQRGSSRQGSPGSVSKQKPCDLPLRLLVPTQFVGAIIGKEGATI RNITKQTQSKIDVHRKENAGAAEKSITILSTPEGTSAACKSILEIMHKEAQDIKFTEE IPLKILAHNNFVGRLIGKEGRNLKKIEQDTDTKITISPLQELTLYNPERTITVKGNVE TCAKAEEEIMKKIRESYENDIASMNLQAHLIPGLNLNALGLFPPTSGMPPPTSGPPSA MTPPYPQFEQSETETVHQFIPALSVGAIIGKQGQHIKQLSRFAGASIKIAPAEAPDAK VRMVIITGPPEAQFKAQGRIYGKIKEENFVSPKEEVKLEAHIRVPSFAAGRVIGKGGK TVNELQNLSSAEVVVPRDQTPDENDQVVVKITGHFYACQVAQRKIQEILTQVKQHQQQ KALQSGPPQSRRK" BASE COUNT 1303 a 830 c 851 g 1181 t 16 others ORIGIN 1 ggtggatgcg tttgggttgt agctaggctt tttcttttct ttctctttta aaacacatct 61 agacaaggaa aaaacaagcc tcggatctga tttttcactc ctcgttcttg tgcttggttc 121 ttactgtgtt tgtgtatttt aaaggcgaga agacgagggg aacaaaacca gctggatcca 181 tccatcaccg tgggtggttt taatttttcg ttttttctcg ttattttttt ttaaacaacc 241 actcttcaca atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga 301 cctagaaagt atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac 361 tggctacgcg ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct 421 ttcaggtaaa atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag 481 gcaaaggatt cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct 541 ggatagttta ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc 601 ggaaactgca gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga 661 caaactgaat ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga 721 aatggccgcc cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag 781 gggctcctca aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc 841 tctgcgcctg ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac 901 cattcggaac atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc 961 gggggctgct gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg 1021 taagtctatt ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat 1081 ccccttgaag attttagctc ataataactt tgttggacgt cttattggta aagaaggaag 1141 aaatcttaaa aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga 1201 attgacgctg tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc 1261 caaagctgag gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc 1321 tatgaatctt caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc 1381 acccacttca gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta 1441 cccgcagttt gagcaatcag aaacggagac tgttcatcag tttatcccag ctctatcagt 1501 cggtgccatc atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc 1561 ttcaattaag attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac 1621 tggaccacca gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga 1681 aaactttgtt agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt 1741 tgctgctggc agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc 1801 aagtgcagaa gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt 1861 caaaataact ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct 1921 gactcaggta aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag 1981 acggaagtaa aggctcagga aacagcccac cacagaggca gatgccaaac caaagacaga 2041 ttgcttaacc aacagatggg cgctgacccc ctatccagaa tcacatgcac aagtttttac 2101 ctagccagtt gtttctgagg accaggcaac ttttgaactc ctgtctctgt gagaatgtat 2161 actttatgct ctctgaaatg tatgacaccc agctttaaaa caaacaaaca aacaaacaaa 2221 aaaagggtgg gggagggagg gaaagagaag agctctgcac ttccctttgt tgtagtctca 2281 cagtataaca gatattctaa ttcttcttaa tattccccca taatgccaga aattggctta 2341 atgatgcttt cactaaattc atcaaataga ttgctcctaa atccaattgt taaaattgga 2401 tcagaataat tatcacagga acttaaatgt taagccatta gcatagaaaa actgttctca 2461 gttttatttt tacctaacac taacatgagt aacctaaggg aagtgctgaa tggtgttggc 2521 aggggtatta aacgtgcatt tttactcaac tacctcaggt attcagtaat acaatgaaaa 2581 gcaaaattgt tccttttttt tgaaaatttt atatacttta taatgataga agtccaaccg 2641 ttttttaaaa aataaattta aaatttaaca gcaatcagct aacaggcaaa ttaagatttt 2701 tacttctggc tggtgacagt aaagctggaa aattaatttc agggtttttt gaggcttttg 2761 acacagttat tagttaaatc aaatgttcaa aaatacggag cagtgcctag tatctggaga 2821 gcagcactac catttattct ttcatttata gttgggaaag tttttgacgg tactaacaaa 2881 gtggtcgcag gagattttgg aacggctggt ttaaatggct tcaggagact tcagtttttt 2941 gtttagctac atgattgaat gcataataaa tgctttgtgc ttctgactat caatacctaa 3001 agaaagtgca tcagtgaaga gatgcaagac tttcaactga ctggcaaaaa gcaagcttta 3061 gcttgtctta taggatgctt agtttgccac tacacttcag accaatggga cagtcataga 3121 tggtgtgaca gtgtttaaac gcaacaaaag gctacatttc catggggcca gcactgtcat 3181 gagcctcact aagctatttt gaagattttt aagcactgat aaattaaaaa aaaaaaaaaa 3241 aaattagact ccaccttaag tagtaaagta taacaggatt tctgtatact gtgcaatcag 3301 ttctttgaaa aaaaagtcaa aagatagaga atacaagaaa agttttnggg atataatttg 3361 aatgactgtg aaaacatatg acctttgata acgaactcat ttgctcactc cttgacagca 3421 aagcccagta cgtacaattg tgttgggtgt gggtggtctc caaggccacg ctgctctctg 3481 aattgatttt ttgagttttg gnttgnaaga tgatcacagn catgttacac tgatcttnaa 3541 ggacatatnt tataaccctt taaaaaaaaa atcccctgcc tcattcttat ttcgagatga 3601 atttcgatac agactagatg tctttctgaa gatcaattag acattntgaa aatgatttaa 3661 agtgttttcc ttaatgttct ctgaaaacaa gtttcttttg tagttttaac caaaaaagtg 3721 ccctttttgt cactggtttc tcctagcatt catgattttt ttttcacaca atgaattaaa 3781 attgctaaaa tcatggactg gctttctggt tggatttcag gtaagatgtg tttaaggcca 3841 gagcttttct cagtatttga tttttttccc caatatttga ttttttaaaa atatacacat 3901 aggagctgca tttaaaacct gctggtttaa attctgtcan atttcacttc tagcctttta 3961 gtatggcnaa tcanaattta cttttactta agcatttgta atttggagta tctggtacta 4021 gctaagaaat aattcnataa ttgagttttg tactcnccaa anatgggtca ttcctcatgn 4081 ataatgtncc cccaatgcag cttcattttc caganacctt gacgcaggat aaattttttc 4141 atcatttagg tccccaaaaa aaaaaaaaaa aaaaaaaaaa a // LOCUS HSU97198 1778 bp mRNA PRI 02-OCT-1997 DEFINITION Homo sapiens CG1 mRNA, complete cds. ACCESSION U97198 NID g2459798 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1778) AUTHORS Van Laer,L., Van Camp,G., van Zuylen,D., Green,E., Verstreken,M., Schatteman,I., Van de Heyning,P., Balemans,W., Coucke,P., Greinwald,J.H., Smith,R.J.H., Huizing,E. and Willems,P. TITLE Refined mapping of a gene for autosomal dominant progressive sensorineural hearing loss (DFNA5) to a 2 cM region, and isolation of a candidate gene that is expressed in the cochlea JOURNAL Unpublished REFERENCE 2 (bases 1 to 1778) AUTHORS Van Laer,L., Van Camp,G., van Zuylen,D., Green,E., Verstreken,M., Schatteman,I., Van de Heyning,P., Balemans,W., Coucke,P., Greinwald,J.H., Smith,R.J.H., Huizing,E. and Willems,P. TITLE Direct Submission JOURNAL Submitted (11-APR-1997) Medical Genetics, University of Antwerp, Universiteitsplein 1, Antwerp 2610, Belgium FEATURES Location/Qualifiers source 1..1778 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="7" /map="7p15" /clone="za27e05; yw33a03; yg04b09; c-0jd01; c-0ab06" gene 1..1778 /gene="CG1" CDS 220..1491 /gene="CG1" /codon_start=1 /db_xref="PID:g2459799" /translation="MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNR RGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPF ASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRL EYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAF GFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAAS TSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAP AFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDK LTVEELEQFQSKKFTLGKIPLKPPPLELLNV" BASE COUNT 532 a 379 c 389 g 478 t ORIGIN 1 aaagcgtcag agcggtccgc tcttctcaag tcctattggc tcaagtccat gcgtctcatg 61 gcactgccgg gtgagaggcg gaaaaaggcc ttcccaggcg cgagaagatg acgtcacagt 121 agcccggccg ggcgccgagg ttgccttagg gctctctgcc cagtaacagg catcgaacgg 181 tgcagactga agacgccctc cgtcagcgac gccgtcgcaa tggccatttg tcaattcttc 241 cttcaaggcc ggtgccgctt tggagatcgg tgctggaacg aacatcccgg tgctaggggt 301 gcaggaggag gacggcagca accgcagcag cagccttcag gtaataatag acgtggatgg 361 aatacaacta gccagagata ttccaatgtc atccagccat ccagtttctc caaatccaca 421 ccatgggggg gcagcagaga tcaagaaaag ccatatttca gttcttttga ttctggagct 481 tcaactaaca ggaaggaagg ctttggattg tctgagaacc catttgcttc acttagtcct 541 gatgagcaga aagatgaaaa gaaacttctg gaaggaattg taaaagatat ggaggtttgg 601 gaatcatcag ggcagtggat gttttctgtt tattcaccag tgaaaaagaa acctaatatt 661 tcaggtttta cagacatttc accagaggaa ttgaggcttg aataccataa cttcttaacc 721 agcaataact tacagagtta tctaaattct gtccaacgtt taataaatca atggaggaac 781 agggtaaatg aactgaaaag tctaaatata tcaactaaag tagctttgct ctctgatgta 841 aaggatggag taaatcaagc agcacctgca tttggatttg gcagcagtca agcagcaaca 901 tttatgtcgc caggctttcc agtcaataac agcagcagtg ataatgctca gaactttagt 961 tttaaaacaa actctggatt tgctgctgcc tcttctggaa gccctgctgg ttttgggagt 1021 tccccagcat ttggagctgc agcctctacc agttcaggta tctctacttc tgctccagct 1081 tttggatttg ggaagcctga agtcacatcg gctgcatcat tttcattcaa aagccctgca 1141 gcttccagtt ttggatcacc tggattttca ggacttccag cttccttggc aacaggtcct 1201 gtcagagctc cagtggcccc agcctttgga ggtggcagtt ctgtggctgg ttttggtagt 1261 ccgggctcac attctcacac tgctttttct aagccatcca gtgacacttt tggaaatagc 1321 agcatatcca cttctctgtc agcctcaagc agcatcattg caacagataa tgtgttattc 1381 acacccagag ataaactaac agtagaagaa ctggaacaat ttcaatccaa gaaatttact 1441 ctgggaaaaa ttccattaaa gcctccacct ctggaacttc taaatgttta aaagggcaat 1501 tttaaataca aaaaagaatg atgtttaaaa ttgctttgag tgattcatac agagatgtat 1561 atatgcatac atgtatatat tcataaggaa tataagcttc catcaatagt gattttaaat 1621 ttgatttttt tcttaactct aaatatttaa gtaaaaagta acaaaaactc tgcaagcaag 1681 ggaatttttt tgtactgtaa ttttgaatgg aactgaaaaa ttatgcacga ataaagtact 1741 tttctcatgc cgggaccaaa aaaaaaaaaa aaaaaaaa // LOCUS HSU97519 5869 bp mRNA PRI 25-JUN-1997 DEFINITION Homo sapiens podocalyxin-like protein mRNA, complete cds. ACCESSION U97519 NID g2213812 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5869) AUTHORS Kershaw,D.B., Beck,S.G., Wharram,B.L., Wiggins,J.E., Goyal,M., Thomas,P.E. and Wiggins,R.C. TITLE Molecular cloning and characterization of human podocalyxin-like Protein ORTHOLOGOUS RELATIONSHIP TO RABBIT PCLP1 AND RAT PODOCALYXIN JOURNAL J. Biol. Chem. 272 (25), 15708-15714 (1997) MEDLINE 97332652 REFERENCE 2 (bases 1 to 5869) AUTHORS Kershaw,D.B., Beck,S.G., Wharram,B.L., Wiggins,J.E., Goyal,M., Thomas,P.E. and Wiggins,R.C. TITLE Direct Submission JOURNAL Submitted (14-APR-1997) Pediatrics, University of Michigan, F6865/Box 0297 1505 Simpson Road East, Ann Arbor, MI 48109-0297, USA FEATURES Location/Qualifiers source 1..5869 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..251 /evidence=not_experimental sig_peptide 251..265 /evidence=not_experimental CDS 251..1837 /codon_start=1 /product="podocalyxin-like protein" /db_xref="PID:g2213813" /translation="MRCALALSALLLLLSTPPLLPSSPSPSPSPSPSQNATQTTTDSS NKTAPTPASSVTIMATDTAQQSTVPTSKANEILASVKATTLGVSSDSPGTTTLAQQVS GPVNTTVARGGGSGNPTTTIESPKSTKSADTTTVATSTATAKPNTTSSQNGAEDTTNS GGKSSHSVTTDLTSTKAEHLTTPHPTSPLSPRQPTLTHPVATPTSSGHDHLMKISSSS STVAIPGYTFTSPGMTTTLPSSVISQRTQQTSSQMPASSTAPSSQETVQPTSPATALR TPTLPETMSSSPTAASTTHRYPKTPSPTVAHESNWAKCEDLETQTQSEKQLVLNLTGN TLCAGGASDEKLISLICRAVKATFNPAQDKCGIRLASVPGSQTVVVKEITIHTKLPAK DVYERLKDKWDELKEAGVSDMKLGDQGPPEEAEDRFSMPLIITIVCMASFLLLVAALY GCCHQRLSQRKDQQRLTEELQTVENGYHDNPTLEVMETSSEMQEKKVVSLNGELGDSW IVPLDNLTKDDLDEEEDTHL" variation 315..320 /note="missing in one of two clones" /replace="" variation 433..435 /note="TGC in one clone" /replace="tgc" variation 836..838 /note="TCG in one clone changing L to S" /replace="tcg" misc_difference 963 /note="A region of 96 base pairs which encodes a 32 amino acid span similar to Alu-derived amino acid sequences was found in two of three clones" /replace="tagagacagtgtttcaccatgtcagccaggctggtcttgaactcctga cctcgggtgatctgcccaccttggcctcccaaagtgctgggattacag" BASE COUNT 1508 a 1540 c 1482 g 1338 t 1 others ORIGIN 1 aaacgccgcc caggacgcag ccgccgccgc cgccgctcct ctgccactgg ctctgcgccc 61 cagcccggct ctgctgcagc ggcagggagg aagagccgcc gcagcgcgac tcgggagccc 121 cgggccacag cctggcctcc ggagccaccc acaggcctcc ccgggcggcg cccacgctcc 181 taccgcccgg acgcgcggat cctccgccgg caccgcagcc acctgctccc ggcccagagg 241 cgacgacacg atgcgctgcg cgctggcgct ctcggcgctg ctgctactgt tgtcaacgcc 301 gccgctgctg ccgtcgtcgc cgtcgccgtc gccgtcgccg tcgccctccc agaatgcaac 361 ccagactact acggactcat ctaacaaaac agcaccgact ccagcatcca gtgtcaccat 421 catggctaca gatacagccc agcagagcac agtccccact tccaaggcca acgaaatctt 481 ggcctcggtc aaggcgacca cccttggtgt atccagtgac tcaccgggga ctacaaccct 541 ggctcagcaa gtctcaggcc cagtcaacac taccgtggct agaggaggcg gctcaggcaa 601 ccctactacc accatcgaga gccccaagag cacaaaaagt gcagacacca ctacagttgc 661 aacctccaca gccacagcta aacctaacac cacaagcagc cagaatggag cagaagatac 721 aacaaactct ggggggaaaa gcagccacag tgtgaccaca gacctcacat ccactaaggc 781 agaacatctg acgacccctc accctacaag tccacttagc ccccgacaac ccactttgac 841 gcatcctgtg gccaccccaa caagctcggg acatgaccat cttatgaaaa tttcaagcag 901 ttcaagcact gtggctatcc ctggctacac cttcacaagc ccggggatga ccaccaccct 961 accgtcatcg gttatctcgc aaagaactca acagacctcc agtcagatgc cagccagctc 1021 tacggcccct tcctcccagg agacagtgca gcccacgagc ccggcaacgg cattgagaac 1081 acctaccctg ccagagacca tgagctccag ccccacagca gcatcaacta cccaccgata 1141 ccccaaaaca ccttctccca ctgtggctca tgagagtaac tgggcaaagt gtgaggatct 1201 tgagacacag acacagagtg agaagcagct cgtcctgaac ctcacaggaa acaccctctg 1261 tgcagggggc gcttcggatg agaaattgat ctcactgata tgccgagcag tcaaagccac 1321 cttcaacccg gcccaagata agtgcggcat acggctggca tctgttccag gaagtcagac 1381 cgtggtcgtc aaagaaatca ctattcacac taagctccct gccaaggatg tgtacgagcg 1441 gctgaaggac aaatgggatg aactaaagga ggcaggggtc agtgacatga agctagggga 1501 ccaggggcca ccggaggagg ccgaggaccg cttcagcatg cccctcatca tcaccatcgt 1561 ctgcatggcg tcattcctgc tcctcgtggc ggccctctat ggctgctgcc accagcgcct 1621 ctcccagagg aaggaccagc agcggctaac agaggagctg cagacagtgg agaatggtta 1681 ccatgacaac ccaacactgg aagtgatgga gacctcttct gagatgcagg agaagaaggt 1741 ggtcagcctc aacggggagc tgggggacag ctggatcgtc cctctggaca acctgaccaa 1801 ggacgacctg gatgaggagg aagacacaca cctctagtcc ggtctgccgg tggcctccag 1861 cagcaccaca gagctccaga ccaaccaccc caagtgccgt ttggatgggg aagggaaaga 1921 ctggggaggg agagtgaact ccgaggggtg tcccctccca atccccccag ggccttaatt 1981 tttccctttt caacctgaac aaatcacatt ctgtccagat tcctcttgta aaataaccca 2041 ctagtgcctg agctcagtgc tgctggatga tgagggagat caagaaaaag ccacgtaagg 2101 gactttatag atgaactagt ggaatccctt cattctgcag tgagattgcc gagacctgaa 2161 gagggtaagt gacttgccca aggtcagagc cacttggtga cagagccagg atgagaacaa 2221 agattccatt tgcaccatgc cacactgctg tgttcacatg tgccttccgt ccagagcagt 2281 cccgggcagg ggtgaaactc cagcaggtgg ctgggctgga aaggagggca gggctacatc 2341 ctggctcggt gggatctgac gacctgaaag tccagctccc aagttttcct tctcctaccc 2401 cagcctcgtg tacccatctt cccaccctct atgttcttac ccctccctac actcagtgtt 2461 tgttcccact tactctgtcc tggggcctct gggattagca caggttattc ataaccttga 2521 accccttgtt ctggattcgg attttctcac atttgcttcg tgagatgggg gcttaaccca 2581 cacaggtctc cgtgcgtgaa ccaggtctgc ttaggggacc tgcgtgcagg tgaggagaga 2641 aggggacact cgagtccagg ctggtatctc agggcagctg atgaggggtc agcaggaaca 2701 ctggcccatt gcccctggca ctccttgcag aggccaccca cgatcttctt tgggcttcca 2761 tttccaccag ggactaaaat ctgctgtagc tagtgagagc agcgtgttcc ttttgttgtt 2821 cactgctcag ctgatgggag tgattccctg agacccagta tgaaagagca gtggctgcag 2881 gagaggcctt cccggggccc cccatcagcg atgtgtcttc agagacaatc cattaaagca 2941 gccaggaagg acaggctttc ccctgtatat cataggaaac tcagggacat ttcaagttgc 3001 tgagagtttt gttatagttg ttttctaacc cagccctcca ctgccaaagg ccaaaagctc 3061 agacagttgg cagacgtcca gttagctcat ctcactcact ctgattctcc tgtgccacag 3121 gaaaagaggg cctggaaagc gcagtgcatg ctgggtgcat gaagggcagc ctgggggaca 3181 gactgttgtg ggaacgtccc actgtcctgg cctggagcta ggccttgctg ttcctcttct 3241 ctgtgagcct agtggggctg ctgcggttct cttgcagttt ctggtggcat ctcaggggaa 3301 cacaaaagct atgtctattc cccaatatag gacttttatg ggctcggcag ttagctgcca 3361 tgtagaaggc tcctaagcag tgggcatggt gaggtttcat ctgattgaga agggggaatc 3421 ctgtgtggaa tgttgaactt tcgccatggt ctccatcgtt ctgggcgtaa attccctggg 3481 atcaagtagg aaaatgggca gaactgctta ggggaatgaa attgccattt ttcgggtgaa 3541 acgccacacc tccagggtct taagagtcag gctccggctg tagtagctct gatgaaatag 3601 gctatccact cgggatggct tactttttaa aagggtaggg ggaggggctg gggaagatct 3661 gtcctgcacc atctgcctaa ttccttcctc acagtctgta gccatctgat atcctagggg 3721 gaaaaggaag gccaggggtt cacatagggc cccagcgagt ttcccaggag ttagagggat 3781 gcgaggctaa caagttccaa aaacatctgc cccgatgctc tagtgtttgg aggtgggcag 3841 gatggagaac agtgcctgtt tgggggaaaa caggaaatct tgttaggctt gagtgaggtg 3901 tttgcttcct tcttgcccag cgctgggttc tctccaccca gtaggttttc tgttgtggtc 3961 ccgtgggaga ggccagactg gattattcct cctttgctga tcctgggtca cacttcacca 4021 gccagggctt ttgacggaga cagcaaatag gcctctgcaa atcaatcaaa ggctgcaacc 4081 ctatggcctc ttggagacag atgatgactg gcaaggacta gagagcagga gtgcctggcc 4141 aggtcggtcc tgactctcct gactctccat cgctctgtcc aaggagaacc cggagaggct 4201 ctgggctgat tcagaggtta ctgctttata ttcgtccaaa ctgtgttagt ctaggcttag 4261 gacagcttca gaatctgaca ccttgccttg ctcttgccac caggacacct atgtcaacag 4321 gccaaacagc catgcatcta taaaggtcat catcttctgc cacctttact gggttctaaa 4381 tgctctctga taattcagag agcattgggt ctgggaagag gtaagaggaa cactagaagc 4441 tcagcatgac ttaaacaggt tgtagcaaag acagtttatc atcaactctt tcagtggtaa 4501 actgtggttt ccccaagctg cacaggaggc cagaaaccac aagtatgatg actaggaagc 4561 ctactgtcat gagagtgggg agacaggcag caaagcttat gaaggaggta cagaatattc 4621 tttgcgttgt aagacagaat acgggtttaa tctagtctag gcrccagatt tttttcccgc 4681 ttgataagga aagctagcag aaagtttatt taaaccactt cttgagcttt atcttttttg 4741 acaatatact ggagaaactt tgaagaacaa gttcaaactg atacatatac acatattttt 4801 ttgataatgt aaatacagtg accatgttaa cctaccctgc actgctttaa gtgaacatac 4861 tttgaaaaag cattatgtta gctgagtgat ggccaagttt tttctctgga caggaatgta 4921 aatgtcttac tggaaatgac aagtttttgc ttgatttttt tttttaaaca aaaaatgaaa 4981 tataacaaga caaacttatg ataaagtatt tgtcttgtag atcaggtgtt ttgttttgtt 5041 tttttaattt taaaatgcaa ccctgccccc tccccagcaa agtcacagct ccatttcagt 5101 aaaggttgga gtcaatatgc tctggttggc aggcaaccct gtagtcatgg agaaaggtat 5161 ttcaagatct agtccaatct ttttctagag aaaaagataa tctgaagctc acaaagatga 5221 agtgacttcc tcaaaatcac atggttcagg acagaaacaa gattaaaacc tggatccaca 5281 gactgtgcgc ctcagaagga ataatcggta aattaagaat tgctactcga aggtgccaga 5341 atgacacaaa ggacagaatt cctttcccag ttgttaccct agcaaggcta gggagggcat 5401 gaacacaaac ataagaactg gtcttctcac actttctctg aatcatttag gtttaagatg 5461 taagtgaaca attctttctt tctgccaaga aacaaagttt tggatgagct tttatatatg 5521 gaacttactc caacaggact gagggaccaa ggaaacatga tgggggaggc aagagagggc 5581 aaagagtaaa actgtagcat agcttttgtc acggtcacta gctgatccct caggtctgct 5641 gcaaacacag catggaggac acagatgact ctttggtgtt ggtctttttg tctgcagtga 5701 atgttcaaca gtttgcccag gaactggggg atcatatatg tcttagtgga caggggtctg 5761 aagtacactg gaatttactg agaaacttgt ttgtaaaaac tatagttaat aattattgca 5821 ttttcttaca aaaatatatt ttggaaaatt gtatactgtc aattaaagt // LOCUS HSUBCH5 459 bp RNA PRI 06-JUL-1995 DEFINITION H.sapiens UBCH5 mRNA for ubiquitin conjugating enzyme. ACCESSION X78140 NID g460809 KEYWORDS UBCH5 gene; ubiquitin conjugating enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 459) AUTHORS Scheffner,M., Huibregtse,J.M. and Howley,P.M. TITLE Identification of a human ubiquitin-conjugating enzyme that mediates the E6-AP-dependent ubiquitination of p53 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (19), 8797-8801 (1994) MEDLINE 94377440 REFERENCE 2 (bases 1 to 459) AUTHORS Scheffner,M. TITLE Direct Submission JOURNAL Submitted (11-MAR-1994) M. Scheffner, Deutsches Krebsforschungszentrum, FS:Angewandte Tumorvirologie, Abt. 0662, Im Neuenheimer Feld 242, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..459 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary human foreskin keratinocytes" gene 16..459 /gene="UBCH5" CDS 16..459 /gene="UBCH5" /codon_start=1 /product="ubiquitin conjugating enzyme" /db_xref="PID:g460810" /translation="MALKRIQKELSDLQRDPPAHCSAGPVGDDLFHWQATIMGPPDSA YQGGVFFLTVHFPTDYPFKPPKIAFTTKIYHPNINSNGSICLDILRSQWSPALTVSKV LLSICSLLCDPNPDDPLVPDIAQIYKSDKEKYNRHAREWTQKYAM" BASE COUNT 144 a 106 c 84 g 125 t ORIGIN 1 cgccatccct gacccatggc gctgaagagg attcagaaag aattgagtga tctacagcgc 61 gatccacctg ctcactgttc agctggacct gtgggagatg acttgttcca ctggcaagcc 121 actattatgg ggcctcctga tagcgcatat caaggtggag tcttctttct cactgtacat 181 tttccgacag attatccttt taaaccacca aagattgctt tcacaacaaa aatttaccat 241 ccaaacataa acagtaatgg aagtatttgt ctcgatattc tgaggtcaca atggtcacca 301 gctctgactg tatcaaaagt tttattgtcc atatgttctc tactttgtga tcctaatcca 361 gatgacccct tagtaccaga tattgcacaa atctataaat cagacaaaga aaaatacaac 421 agacatgcaa gagaatggac tcagaaatat gcaatgtaa // LOCUS HSUBCH6 582 bp RNA PRI 07-MAY-1996 DEFINITION H.sapiens mRNA for ubiquitin conjugating enzyme, UbcH6. ACCESSION X92963 NID g1064913 KEYWORDS UbcH6 gene; ubiquitin-conjugating enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 582) AUTHORS Nuber,U., Schwarz,S., Kaiser,P., Schneider,R. and Scheffner,M. TITLE Cloning of human ubiquitin-conjugating enzymes UbcH6 and UbcH7 (E2-F1) and characterization of their interaction with E6-AP and RSP5 JOURNAL J. Biol. Chem. 271 (5), 2795-2800 (1996) MEDLINE 96162027 REFERENCE 2 (bases 1 to 582) AUTHORS Nuber,U. TITLE Direct Submission JOURNAL Submitted (10-NOV-1995) U. Nuber, Deutsches Krebsforschungszentrum, FS: Angewandte Tumorvirologie, Abt.: 0662, Im Neuenheimer Feld 242, D-69120 Heidelberg, FRG COMMENT Overlapping sequence: T19225. FEATURES Location/Qualifiers source 1..582 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary foreskin keratinocytes cell line" gene 1..582 /gene="UbcH6" CDS 1..582 /gene="UbcH6" /codon_start=1 /product="ubiquitin-conjugating enzyme UbcH6" /db_xref="PID:g1064914" /translation="MSDDDSRASTSSSSSSSSNQQTEKETNTPKKKESKVSMSKNSKL LSTSAKRIQKELADITLDPPPNCSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITF TPEYPFKPPKVTFRTRIYHCNINSQGVICLDILKDNWSPALTISKVLLSICSLLTDCN PADPLVGSIATQYMTNRAEHDRMARQWTKRYAT" BASE COUNT 180 a 150 c 116 g 136 t ORIGIN 1 atgtcggatg acgattcgag ggccagcacc agctcctcct catcttcgtc ctccaaccag 61 caaaccgaga aagaaacaaa cacccccaag aagaaggaga gtaaagtcag catgagcaaa 121 aactccaaac tcctctccac cagcgccaag agaattcaga aggagctggc ggacatcact 181 ttagaccctc cacctaattg cagtgctggt cccaaaggcg ataacatcta tgaatggaga 241 tcaaccattc tagggcctcc aggatccgtg tatgagggtg gtgtattctt tctcgatatc 301 acttttacac cagaatatcc cttcaagcct ccaaaggtta catttcggac aagaatctat 361 cattgtaata ttaacagtca aggtgttatt tgcttggaca tattgaaaga taattggagt 421 ccagcactaa ccatttctaa agtcctcctt tctatctgct cacttcttac agactgtaat 481 cctgccgacc ccttggtggg aagtattgcc actcagtata tgaccaacag agcagaacat 541 gacagaatgg ccagacagtg gaccaagaga tacgctacat aa // LOCUS HSUBCH7 465 bp RNA PRI 07-MAY-1996 DEFINITION H.sapiens mRNA for ubiquitin conjugating enzyme, UbcH7. ACCESSION X92962 NID g1064915 KEYWORDS UbcH7 gene; ubiquitin-conjugating enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 465) AUTHORS Nuber,U., Schwarz,S., Kaiser,P., Schneider,R. and Scheffner,M. TITLE Cloning of human ubiquitin-conjugating enzymes UbcH6 and UbcH7 (E2-F1) and characterization of their interaction with E6-AP and RSP5 JOURNAL J. Biol. Chem. 271 (5), 2795-2800 (1996) MEDLINE 96162027 REFERENCE 2 (bases 1 to 465) AUTHORS Nuber,U. TITLE Direct Submission JOURNAL Submitted (10-NOV-1995) U. Nuber, Deutsches Krebsforschungszentrum, FS: Angewandte Tumorvirologie, Abt.: 0662, Im Neuenheimer Feld 242, D-69120 Heidelberg, FRG COMMENT Overlapping sequences: R91081, T30242. FEATURES Location/Qualifiers source 1..465 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary foreskin keratinocytes cell line" gene 1..465 /gene="UbcH7" CDS 1..465 /gene="UbcH7" /codon_start=1 /product="ubiquitin-conjugating enzyme UbcH7" /db_xref="PID:g1064916" /translation="MAASRRLMKELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNP PYDKGAFRIEINFPAEYPFKPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTD QVIQSLIALVNDPQPEHPLRADLAEEYSKDRKKFCKNAEEFTKKYGEKRPVD" BASE COUNT 158 a 108 c 105 g 94 t ORIGIN 1 atggcggcca gcaggaggct gatgaaggag cttgaagaaa tccgcaaatg tgggatgaaa 61 aacttccgta acatccaggt tgatgaagct aatttattga cttggcaagg gcttattgtt 121 cctgacaacc ctccatatga taagggagcc ttcagaatcg aaatcaactt tccagcagag 181 tacccattca aaccaccgaa gatcacattt aaaacaaaga tctatcaccc aaacatcgac 241 gaaaaggggc aggtctgtct gccagtaatt agtgccgaaa actggaagcc agcaaccaaa 301 accgaccaag taatccagtc cctcatagca ctggtgaatg acccccagcc tgagcacccg 361 cttcgggctg acctagctga agaatactct aaggaccgta aaaaattctg taagaatgct 421 gaagagttta caaagaaata tggggaaaag cgacctgtgg actaa // LOCUS HSUBPQPC 518 bp RNA PRI 19-JAN-1995 DEFINITION Human mRNA for mitochondrial ubiquinone-binding protein (QP-C). ACCESSION X13585 NID g37579 KEYWORDS ubiquinol-cytochrome c oxidoreductase; ubiquinone-binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 518) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Cloning and sequencing of a cDNA for human mitochondrial ubiquinone-binding protein of complex III JOURNAL Biochem. Biophys. Res. Commun. 156 (2), 987-994 (1988) MEDLINE 89050136 COMMENT This is a nuclear-encoded gene for a mitochondrial protein. FEATURES Location/Qualifiers source 1..518 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /cell_line="GM637" /clone_lib="pcD" CDS 33..368 /note="ubiquinone-binding protein (AA 1 - 111)" /codon_start=1 /db_xref="PID:g37580" /db_xref="SWISS-PROT:P14927" /translation="MAGKQAVSASGKWLDGIRKWYYNAAGFNKLGLMRDDTIYEDEDV KEAIRRLPENLYNDRMFRIKRALDLNLKHQILPKEQWTKYEEENFYLEPYLKEVIRER KEREEWAKK" misc_feature 467..472 /note="pot. polyA signal" BASE COUNT 178 a 73 c 127 g 140 t ORIGIN 1 cggagaaggc aacgcttctc tttctggtca aaatggctgg taagcaggcc gtttcagcat 61 caggcaagtg gctggatggt attcgaaaat ggtattacaa tgctgcagga ttcaataaac 121 tggggttaat gcgagatgat acaatatacg aggatgaaga tgtaaaagaa gccataagaa 181 gacttcctga gaacctttat aatgacagga tgtttcgcat taagagggca ctggacctga 241 acttgaagca tcagatcttg cctaaagagc agtggaccaa atatgaagag gaaaatttct 301 accttgaacc gtatctgaaa gaggttattc gggaaagaaa agaaagagaa gaatgggcaa 361 agaagtaatc atgtagttga agtctgtgga tgcagctgtt atgaagatgg ttaaacttga 421 aacaaacaat tttaagaatt atttggtctg aagatgtttt actttaaata aatgtctatt 481 gtaatggctg gagtttttga attccaaacc ttatactg // LOCUS HSUBQPRTS 4022 bp RNA PRI 07-MAR-1997 DEFINITION H.sapiens mRNA for herpesvirus associated ubiquitin-specific protease (HAUSP). ACCESSION Z72499 NID g1545951 KEYWORDS ubiquitin-specific protease. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4022) AUTHORS Everett,R.D., Meredith,M., Orr,A., Cross,A., Kathoria,M. and Parkinson,J. TITLE A novel ubiquitin-specific protease is dynamically associated with the PML nuclear domain and binds to a herpesvirus regulatory protein JOURNAL EMBO J. 16 (3), 566-577 (1997) MEDLINE 97186723 REFERENCE 2 (bases 1 to 4022) AUTHORS Everett,R.D. TITLE Direct Submission JOURNAL Submitted (21-MAY-1996) Everett R.D., MRC Virology Unit Church Street Glasgow United Kingdom G11 5JR FEATURES Location/Qualifiers source 1..4022 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /cell_line="HeLa" /clone_lib="lambda ZAP" /sex="Female" CDS 200..3508 /codon_start=1 /evidence=experimental /product="herpesvirus associated ubiquitin-specific protease (HAUSP)" /db_xref="PID:e244584" /db_xref="PID:g1545952" /translation="MNHQQQQQQQKAGEQQLSEPEDMEMEAGDTDDPPRITQNPVING NVALSDGHNTAEEDMEDDTSWRSEATFQFTVERFSRLSESVLSPPCFVRNLPWKIMVM PRFYPDRPHQKSVGFFLQCNAESDSTSWSCHAQAVLKIINYRDDEKSFSRRISHLFFH KENDWGFSNFMAWSEVTDPEKGFIDDDKVTFEVFVQADAPHGVAWDSKKHTGYVGLKN QGATCYMNSLLQTLFFTNQLRKAVYMMPTEGDDSSKSVPLALQRVFYELQHSDKPVGT KKLTKSFGWETLDSFMQHDVQELCRVLLDNVENKMKGTCVEGTIPKLFRGKMVSYIQC KEVDYRSDRREDYYDIQLSIKGKKNIFESFVDYVAVEQLDGDNKYDAGEHGLQEAEKG VKFLTLPPVLHLQLMRFMYDPQTDQNIKINDRFEFPEQLPLDEFLQKTDPKDPANYIL HAVLVHSGDNHGGHYVVYLNPKGDGKWCKFDDDVVSRCTKEEAIEHNYGGHDDDLSVR HCTNAYMLVYIRESKLSEVLQAVTDHDIPQQLVERLQEEKRIEAQKRKERQEAHLYMQ VQIVAEDQFCGHQGNDMYDEEKVKYTVFKVLKNSSLAEFVQSLSQTMGFPQDQIRLWP MQARSNGTKRPAMLDNEADGNKTMIELSDNENPWTIFLETVDPELAASGATLPKFDKD HDVMLFLKMYDPKTRSLNYCGHIYTPISCKIRDLLPVMCDRAGFIQDTSLILYEEVKP NLTERIQDYDVSLDKALDELMDGDIIVFQKDDPENDNSELPTAKEYFRDLYHRVDVIF CDKTIPNDPGFVVTLSNRMNYFQVAKTVAQRLNTDPMLLQFFKSQGYRDGPGNPLRHN YEGTLRDLLQFFKPRQPKKLYYQQLKMKITDFENRRSFKCIWLNSQFREEEITLYPDK HGCVRDLLEECKKAVELGEKASGKLRLLEIVSYKIIGVHQEDELLECLSPATSRTFRI EEIPLDQVDIDKENEMLVTVAHFHKEVFGTFGIPFLLRIHQGEHFREVMKRIQSLLDI QEKEFEKFKFAIVMTGRHQYINEDEYEVNLKDFEPQPGNMSHPRPWLGLDHFNKAPKR SRYTYLEKAIKIHN" polyA_site 4013 BASE COUNT 1183 a 845 c 1012 g 982 t ORIGIN 1 gtacgtgcgc gtctccctgc cgccgccgcc gcccgccgcg ggccgccccg gggccgccgt 61 cgccgacgac gcgcgggagg aggaggagga ggccgccccg ccgccgccgc cgccgccgcc 121 gccccggctc gccgccgccc gcccgccggg ctcgcagccc cggcccccgg ccgcaggcga 181 ggcccaggcc gcggccgaca tgaaccacca gcagcagcag cagcagcaga aagcgggcga 241 gcagcagttg agcgagcccg aggacatgga gatggaagcg ggagatacag atgacccacc 301 aagaattact cagaaccctg tgatcaatgg gaatgtggcc ctgagtgatg gacacaacac 361 cgcggaggag gacatggagg atgacaccag ttggcgctcc gaggcaacct ttcagttcac 421 tgtggagcgc ttcagcagac tgagtgagtc ggtccttagc cctccgtgtt ttgtgcgaaa 481 tctgccatgg aagattatgg tgatgccacg cttttatcca gacagaccac accaaaaaag 541 cgtaggattc tttctccagt gcaatgctga atctgattcc acgtcatggt cttgccatgc 601 acaagcagtg ctgaagataa taaattacag agatgatgaa aagtcgttca gtcgtcgtat 661 tagtcatttg ttcttccata aagaaaatga ttggggattt tccaatttta tggcctggag 721 tgaagtgacc gatcctgaga aaggatttat agatgatgac aaagttacct ttgaagtctt 781 tgtacaggcg gatgctcccc atggagttgc gtgggattca aagaagcaca caggctacgt 841 cggcttaaag aatcagggag cgacttgtta catgaacagc ctgctacaga cgttattttt 901 cacgaatcag ctacgaaagg ctgtgtacat gatgccaacc gagggggatg attcgtctaa 961 aagcgtccct ttagcattac aaagagtgtt ctatgaatta cagcatagtg ataaacctgt 1021 aggaacaaaa aagttaacaa agtcatttgg gtgggaaact ttagatagct tcatgcaaca 1081 tgatgttcag gagctttgtc gagtgttgct cgataatgtg gaaaataaga tgaaaggcac 1141 ctgtgtagag ggcaccatac ccaaattatt ccgcggcaaa atggtgtcct atatccagtg 1201 taaagaagta gactatcggt ctgatagaag agaagattat tatgatatcc agctaagtat 1261 caaaggaaag aaaaatatat ttgaatcatt tgtggattat gtggcagtag aacagctcga 1321 tggggacaat aaatacgacg ctggggaaca tggcttacag gaagcagaga aaggtgtgaa 1381 attcctaaca ttgccaccag tgttacatct acaactgatg agatttatgt atgaccctca 1441 gacggaccaa aatatcaaga tcaatgatag gtttgaattc ccagagcagt taccacttga 1501 tgaatttttg caaaaaacag atcctaagga ccctgcaaat tatattcttc atgcagtcct 1561 ggttcatagt ggagataatc atggtggaca ttatgtggtt tatctaaacc ccaaagggga 1621 tggcaaatgg tgtaaatttg atgacgacgt ggtgtcaagg tgtactaaag aggaagcaat 1681 tgagcacaat tatgggggtc acgatgacga cctgtctgtt cgacactgca ctaatgctta 1741 catgttagtc tacatcaggg aatcaaaact gagtgaagtt ttacaggcgg tcaccgacca 1801 tgatattcct cagcagttgg tggagcgatt acaagaagag aaaaggatcg aggctcagaa 1861 gcggaaggag cggcaggaag cccatctcta tatgcaagtg cagatagtcg cagaggacca 1921 gttttgtggc caccaaggga atgacatgta cgatgaagaa aaagtgaaat acactgtgtt 1981 caaagtattg aagaactcct cgcttgctga gtttgttcag agcctctctc agaccatggg 2041 atttccacaa gatcaaattc gattgtggcc catgcaagca aggagtaatg gaacaaaacg 2101 accagcaatg ttagataatg aagccgacgg caataaaaca atgattgagc tcagtgataa 2161 tgaaaaccct tggacaatat tcctggaaac agttgatccc gagctggctg ctagtggagc 2221 gaccttaccc aagtttgata aagatcatga tgtaatgtta tttttgaaga tgtatgatcc 2281 caaaacgcgg agcttgaatt actgtgggca tatctacaca ccaatatcct gtaaaatacg 2341 tgacttgctc ccagttatgt gtgacagagc aggatttatt caagatacta gccttatcct 2401 ctatgaggaa gttaaaccga atttaacaga gagaattcag gactatgacg tgtctcttga 2461 taaagccctt gatgaactaa tggatggtga catcatagta tttcagaagg atgaccctga 2521 aaatgataac agtgaattac ccaccgcaaa ggagtatttc cgagatctct accaccgcgt 2581 tgatgtcatt ttctgtgata aaacaatccc taatgatcct ggatttgtgg ttacgttatc 2641 aaatagaatg aattattttc aggttgcaaa gacagttgca cagaggctca acacagatcc 2701 aatgttgctg cagtttttca agtctcaagg ttatagggat ggcccaggta atcctcttag 2761 acataattat gaaggtactt taagagatct tctacagttc ttcaagccta gacaacctaa 2821 gaaactttac tatcagcagc ttaagatgaa aatcacagac tttgagaaca ggcgaagttt 2881 taaatgtata tggttaaaca gccaatttag ggaagaggaa ataacactat atccagacaa 2941 gcatgggtgt gtccgggacc tgttagaaga atgtaaaaag gccgtggagc ttggggagaa 3001 agcatcaggg aaacttaggc tgctagaaat tgtaagctac aaaatcattg gtgttcatca 3061 agaagatgaa ctattagaat gtttatctcc tgcaacgagc cggacgtttc gaatagagga 3121 aatccctttg gaccaggtgg acatagacaa agagaatgag atgcttgtca cagtggcgca 3181 tttccacaaa gaggtcttcg gaacgttcgg aatcccgttt ttgctgagga tacaccaggg 3241 cgagcatttt cgagaagtga tgaagcgaat ccagagcctg ctggacatcc aggagaagga 3301 gtttgagaag tttaaatttg caattgtaat gacgggccga caccagtaca taaatgaaga 3361 cgagtatgaa gtaaatttga aagactttga gccacagccc ggtaatatgt ctcatcctcg 3421 gccttggcta gggctcgacc acttcaacaa agccccaaag aggagtcgct acacttacct 3481 tgaaaaggcc attaaaatcc ataactgatt tccaagctgg tgtgttcaag gcgaggacgg 3541 tgtgtgggtg gccccttaac agcctagaac tttggtgcac gtgccctcta gccgaagtct 3601 tcagcaagag gattcgctgc tggtgttaat tttattttat tgaggctgtt cagtttggct 3661 tctctgtatc tattgactgc cctttttgag caaaatgaag atgtttttat aaagcttgga 3721 tgccaatgag agttatttta tggtaaccac agtgcaaggc aactgtcagc gcaatggggg 3781 agaagaggtt agtggatcgg gggtccctgg ctcaaggtct ctgggctgtc cctagtgggc 3841 acgagtggct cggctgcctt cctggggtcc cgtgcaccag ccctgcagct agcaagtctt 3901 gtgtttaggc tcgtctgacc tatttccttc agttatactt tcaatgacct tttgtgcatc 3961 tgttaaggca aaacagagaa actcacaacc taataaatag cgctcttccc ttcaaaaaaa 4021 aa // LOCUS HSUCEH1 761 bp RNA PRI 22-APR-1994 DEFINITION H.sapiens (23k/1) mRNA for ubiquitin-conjugating enzyme UbcH2. ACCESSION Z29328 NID g474826 KEYWORDS UbcH2; ubiquitin-conjugating enzyme. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 761) AUTHORS Kaiser,P., Seufert,W., Hofferer,L., Kofler,B., Sachsenmaier,C., Herzog,H., Jentsch,S., Schweiger,M. and Schneider,R. TITLE A human ubiquitin-conjugating enzyme homologous to yeast UBC8 JOURNAL J. Biol. Chem. 269 (12), 8797-8802 (1994) MEDLINE 94179285 REFERENCE 2 (bases 1 to 761) AUTHORS Kaiser,P. TITLE Direct Submission JOURNAL Submitted (10-JAN-1994) Peter Kaiser, Department of Biochemistry, University Innsbruck, Peter-Mayr-Strasse 1a, Innsbruck, A-6020, Austria FEATURES Location/Qualifiers source 1..761 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="UbcH2, cDNA clone 23k/1, shortest transcript" /cell_type="HeLa" CDS 54..605 /codon_start=1 /product="Ubiquitin-conjugating enzyme UbcH2" /db_xref="PID:g474827" /db_xref="SWISS-PROT:P37286" /translation="MSSPSPGKRRMDTDVVKLIESKHEVTILGGLNEFVVKFYGPQGT PYEGGVWKVRVDLPDKYPFKSPSIGFMNKIFHPNIDEASGTVCLDVINQTWTALYDLT NIFESFLPQLLAYPNPIDPLNGDAAAMYLHRPEEYKQKIKEYIQKYATEEALKEQEEG TGDSSSESSMSDFSEDEAQDMEL" polyA_site 761 BASE COUNT 242 a 156 c 173 g 190 t ORIGIN 1 ccgggccgtg acagacggcc ggcagaggaa gggagagagg cggcggcgac accatgtcat 61 ctcccagtcc gggcaagagg cggatggaca cggacgtggt caagctcatc gagagtaaac 121 atgaggttac gatcctggga ggacttaatg aatttgtagt gaagttttat ggaccacaag 181 gaacaccata tgaaggcgga gtatggaaag ttagagtgga cctacctgat aaataccctt 241 tcaaatctcc atctatagga ttcatgaata aaattttcca tcccaacatt gatgaagcgt 301 caggaactgt gtgtctagat gtaattaatc aaacttggac agctctctat gatcttacca 361 atatatttga gtccttcctg cctcagttat tggcctatcc taaccccata gatcctctca 421 atggtgacgc tgcagccatg tacctccacc gaccagaaga atacaagcag aaaattaaag 481 agtacatcca gaaatacgcc acggaggagg cgctgaaaga acaggaagag ggtaccgggg 541 acagctcatc ggagagctct atgtctgact tttccgaaga tgaggcccag gatatggagt 601 tgtagtagaa aaagcacctg cttttcagaa agactattat ttcctaacca tgagaagcag 661 actataatat tcatatttaa acaaagcaat tttttttatt actaaacaag gtttttatga 721 ataatagcat tgatatatat atattatata tcacccttta g // LOCUS HSUDGM 1224 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for uracil-DNA glycosylase. ACCESSION X52486 NID g37586 KEYWORDS glycosylase; uracil-DNA glycosylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1224) AUTHORS Caradonna,S.J. TITLE Direct Submission JOURNAL Submitted (06-MAR-1990) Caradonna S.J., University of Med. and Dent. of New Jersey, Dept of Biochemistry, 675 Hoes Lane, Piscataway New Jersey 08854, U S A REFERENCE 2 (bases 1 to 1224) AUTHORS Muller,S.J. and Caradonna,S. TITLE Isolation and characterization of a human cDNA encoding uracil-DNA glycosylase JOURNAL Biochim. Biophys. Acta 1088 (2), 197-207 (1991) MEDLINE 91159471 COMMENT Data kindly reviewed (07-NOV-1990) by Muller S.J. FEATURES Location/Qualifiers source 1..1224 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="Jurkat" /clone_lib="lambda gt11" /chromosome="5" CDS 80..1060 /note="uracil-DNA glycosylase" /codon_start=1 /db_xref="PID:g37587" /db_xref="SWISS-PROT:P22674" /translation="MEPLPSFELLSPLREVTLYDALCTAPGPNPGVPAQQARSVSSFH CVSLSELFLPLSWEAPRFFLALPSLPQLPLHPKPSGPASPPPSRQVTAESRCKLLSWL IPVHRQFGLSFESLCLTVNTLDRFLTTTPVLQTASSCLGSPPCSSLANRWRCTRRAWK QLLALCCGAFSRQQLCNLECIRAAQAALHPGCATISFFLTFQHLLSAARPPKLWKRKP WRRGVAELSLADYAFTSYSPSLLAICCLALADRMLRLAARGLATGRPPGGGAGGLYGQ VAAAGGHKQYFLDSHAARSDLREVQPAPELEIKQILRFLLVPGPAAGPLP" misc_feature 1203..1208 /note="polyA signal" BASE COUNT 205 a 408 c 314 g 297 t ORIGIN 1 gtttccgggt tgagtctgga ctcctaccga actcgggcaa gttatttaac acccctgagc 61 ttctttcttc atctgcaaaa tggaacccct cccttccttt gaattgctga gcccattacg 121 cgaggtaacg ttgtacgacg ctctctgcac agcgccaggt cccaatcccg gtgttccggc 181 tcagcaggcc aggagcgttt ccagtttcca ttgcgtttct ctttctgaac tcttcttgcc 241 cctctcctgg gaagctcccc gattcttctt ggcccttcct tcccttccac agctccctct 301 ccaccctaag cccagcgggc ccgcctcccc ccctccttcc cggcaggtga cggcggaatc 361 ccgctgtaag ctgctcagct ggctgatccc ggtgcaccgc caattcggcc tctccttcga 421 gtcgctgtgc ctgacggtga acactctgga ccgcttcctc accaccacgc cggtgctgca 481 gactgcttcc agctgcttgg ggtcacctcc ttgctcatcg cttgcaaaca ggtggaggtg 541 cacccgccgc gcgtggaagc agcttctggc cctctgctgc ggcgccttct cccggcagca 601 gctctgcaac ctcgagtgca tccgtgctgc acaagctgca cttcaccctg ggtgcgccac 661 cattagcttc ttcctgacat ttcagcacct cctgagcgca gctaggcctc cgaagctctg 721 gaagcgcaag ccctggcgcc ggggggtggc agagctgagt ctggccgact atgccttcac 781 cagctactcc ccttccctcc tggcgatctg ctgcctggcg ctggcggacc gcatgctgcg 841 tctcgcggcc cgtggacttg cgactgggag accacccgga ggcggcgctg gaggactgta 901 tgggcaagtt gcagctgctg gtggccataa acagtacttc cttgactcac atgctgcccg 961 ttcagatctg cgagaagtgc agcctgcccc cgagctcgaa ataaaacaga tccttcgttt 1021 ccttttagtc cctggcccgg ctgctggacc tctcccgtag cctcagaaga gtgcagtact 1081 ggtccacaga gaaggcttca ggacctgctt ggtcagctgc aggttgtaaa tagtgtacga 1141 tactagcatc tggtatttta tttattttgc agcgagcaca tgaggaagct gagtctttca 1201 ccaataaaca gttgtggttt gtct // LOCUS HSUNKPROT 2483 bp RNA PRI 07-OCT-1997 DEFINITION H.sapiens mRNA for unknown protein. ACCESSION Y09858 NID g1729768 KEYWORDS unknown protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2483) AUTHORS Laval,S.H., Reed,V., Blair,H.J. and Boyd,Y. TITLE The structure of DXF34, a human X-linked sequence family with homology to a transcribed mouse Y-linked repeat JOURNAL Mamm. Genome 8 (9), 689-691 (1997) MEDLINE 97419273 REFERENCE 2 (bases 1 to 2483) AUTHORS Laval,S.H. TITLE Direct Submission JOURNAL Submitted (04-DEC-1996) S.H. Laval, University of Oxford, Department of Psychiatry, Warneford Hospital, Oxford, OX3 7JX, UK FEATURES Location/Qualifiers source 1..2483 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /map="p11.23-cen" /dev_stage="fetal" CDS 494..1192 /codon_start=1 /product="unknown protein" /db_xref="PID:e284880" /db_xref="PID:g1729769" /translation="MTKKKVSQKKQRGRPSSQPCRNIVGCRISHGWKEGDEPITQWKG TVLDQVPINPSLYLVKYDGIDCVYGLELHRDERVLSLKILSDRVASSHISDANLANTI IGKAVEHMFEGEHGSKDEWRGMVLAQAPIMKAWFYITYEKDPVLYMYQLLDDYKEGDL RIMPESSESPPTEREPGGVVDGLIGKHVEYTKEDGSKRIGMVIHQVETKPSVYFIKFD DDFHIYVYDLVKKS" BASE COUNT 726 a 481 c 705 g 571 t ORIGIN 1 gaattcggca cgagggcagt gggggttaca gacagcacca gaggcactca gggcaggagg 61 gtagtgggtg ctggtcaagg ccgctttgca ggggattggg agtggatgct agtggcagta 121 aagggtaaag ggagggcagt gggctctgag ctaggaggag gtggcccagc cacttctgca 181 gagttcctaa ccccagacag ctctactgca ccaccccagc ccttccccac agcccaaagg 241 aacttaagag ggagagcact gtgggcatgc agccaggatt tccagcattg cttccaggct 301 cctaggggaa acttaggcac tagcgcagac tacatccaga catggctata gggaaaagtt 361 tgccttgtgg cacactgggt atgcctctct atcctctctc ctcccccagc aggcatgaag 421 acccccaacg cacaggaagc cgaaggggca acaaaccagg gcagctgcag gacgggccac 481 tgggtctgca aacatgacaa agaaaaaagt ctcccaaaag aagcagagag gcagaccttc 541 atcccaaccc tgcaggaaca tcgtgggctg cagaatttct catggatgga aggaaggaga 601 tgagcccatc acgcagtgga aaggaaccgt tctggatcag gtgcctataa atccctctct 661 ttatctggtg aaatatgatg gaattgactg tgtctatgga ctggaacttc acagagatga 721 aagggttttg tctcttaaaa ttctttctga cagggtggca tcatctcaca ttagtgatgc 781 caaccttgca aataccataa ttggcaaagc agtggaacac atgtttgagg gagagcatgg 841 ttctaaggat gaatggaggg ggatggtctt agctcaagca cctatcatga aagcctggtt 901 ttatattacc tatgagaaag atcctgtctt gtacatgtac cagcttctag atgattataa 961 ggaaggtgac ctccgcatca tgccagaatc cagtgagtct cctccaacag agagggagcc 1021 aggaggagtt gtagatggcc taataggtaa gcatgtggaa tataccaaag aagatggctc 1081 caaaaggatc ggcatggtca ttcaccaagt ggaaaccaaa ccctctgtgt atttcatcaa 1141 gtttgatgat gatttccata tctatgtcta cgatttggtg aaaaagtcct aactgttagg 1201 gtaaaatttg gcacatgtgt ggaaacaaat gtataatttg tagacatgca aaaaatgttg 1261 cctttcagtg tactgaaagc ttatggaatc cctgataact aaacatcttt gccagcatta 1321 actgttgttt tgctctaaaa aatacaaatt tgtgaataca tgacatgctg tctgtaagcc 1381 ctttgtcttg ttgaaaagtt cgggtgtgtt tggtagatgg ggcatggaag gaacgaacag 1441 ctgtcaattt cggctgtgaa taaagttcag ctagaatcat aatcagtcat ttaaaaatgg 1501 cactggattt agctggtctg gtctggaggg ggcaggggaa ggaacaagag attggctgtc 1561 ttggggagag aggaggaggt gactgcttag gaagaggtgg aaaagggcca gaaagggagg 1621 ggctccttgg gggaggggat gacctatgag aaggaaacat ccaactcaga aggaagacta 1681 acagagggag agtgagatct tggggcttag aggaaaatag aataaactga gtagagagga 1741 acacaaagaa gcttagaaga gggtccagag taaggtggag aaggatcaac ctgacaggga 1801 tctgggagag gctaaaggat acacaaagga agaggctatg tgggttaggg tgttggaggg 1861 attagaagag ggccactgag gaaggaggaa ttcaaagaag aaagactgac agagggagtg 1921 agaatacagg aaaagagttg tggggtatgg ggcccatgag agatcggaag gcccaggaaa 1981 atgttggact tctggtagtg tccacattgc tcttcctggc tctatgcaca aaggaagtgg 2041 caactgtcaa agatgccaga tgcattggag gttcactaga agagcatgtt gacaggaatg 2101 actcagtcaa gttaatcagg agccaagttt atgcccataa ggcttttcct gtttcaggga 2161 gttaatgaca cacttgtcca gcagaaaccc aatgctcagc ctaaacacgc taacatggtg 2221 ccccacatgt gctgggctgt gggctgcctc ctcacatttg tcctgcgcta gataagcaat 2281 cttggtagag gatgggttag tggggctctc agaacttaag agatgtgcca tcttgcattt 2341 ggaaagtact ttatccagaa aaaatgaaat cttactgaac tcaggtacag cagtctcatc 2401 actcctattc cttctgctgg aatgtttgtt ttctatctag agatgagggg catcagagga 2461 ctgagcctag tgtttctctc gag // LOCUS HSUPAR 1097 bp RNA PRI 18-FEB-1994 DEFINITION H.sapiens mRNA for urokinase plasminogen activator receptor. ACCESSION X74039 NID g456192 KEYWORDS urokinase plasminogen activator receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1097) AUTHORS Pyke,C., Eriksen,J., Solberg,H., Nielsen,B.S., Kristensen,P., Lund,L.R. and Dano,K. TITLE An alternatively spliced variant of mRNA for the human receptor for urokinase plasminogen activator JOURNAL FEBS Lett. 326 (1-3), 69-74 (1993) MEDLINE 93314820 REFERENCE 2 (bases 1 to 1097) AUTHORS Pyke,C. TITLE Direct Submission JOURNAL Submitted (08-NOV-1993) C. Pyke, Finsen Laboratory, Strandboulevarden 49, Bldg 7.1, DK-2100 Copenhagen, DENMARK REMARK revised by [3] MAT REFERENCE 3 (bases 1 to 1097) AUTHORS Pyke,C. TITLE Direct Submission JOURNAL Submitted (18-FEB-1994) C. Pyke, Finsen Laboratory, Strandboulevarden 49, Bldg 7.1, DK-2100 Copenhagen, DENMARK FEATURES Location/Qualifiers source 1..1097 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT-1080" CDS 47..892 /codon_start=1 /product="urokinase plasminogen activator receptor" /db_xref="PID:g433901" /translation="MGHPPLLPLLLLLHTCVPASWGLRCMQCKTNGDCRVEECALGQD LCRTTIVRLWEEGEELELVEKSCTHSEKTNRTLSYRTGLKITSLTEVVCGLDLCNQGN SGRAVTYSRSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVVTHWIQEGEEGRP KDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKCNEGPILELENLPQNGRQCYS CKGNSTHGCSSEETFLIDCRGPMNQCLVATGTHERSLWGSWLPCKSTTALRPPCCEEA QATHV" sig_peptide 47..112 mat_peptide 113..880 /product="urokinase plasminogen activator receptor" misc_feature 801..1097 /note="alternative spliced 3' end" BASE COUNT 287 a 301 c 302 g 207 t ORIGIN 1 agagaagacg tgcagggacc ccgcgcacag gagctgccct cgcgacatgg gtcacccgcc 61 gctgctgccg ctgctgctgc tgctccacac ctgcgtccca gcctcttggg gcctgcggtg 121 catgcagtgt aagaccaacg gggattgccg tgtggaagag tgcgccctgg gacaggacct 181 ctgcaggacc acgatcgtgc gcttgtggga agaaggagaa gagctggagc tggtggagaa 241 aagctgtacc cactcagaga agaccaacag gaccctgagc tatcggactg gcttgaagat 301 caccagcctt accgaggttg tgtgtgggtt agacttgtgc aaccagggca actctggccg 361 ggctgtcacc tattcccgaa gccgttacct cgaatgcatt tcctgtggct catcagacat 421 gagctgtgag aggggccggc accagagcct gcagtgccgc agccctgaag aacagtgcct 481 ggatgtggtg acccactgga tccaggaagg tgaagaaggg cgtccaaagg atgaccgcca 541 cctccgtggc tgtggctacc ttcccggctg cccgggctcc aatggtttcc acaacaacga 601 caccttccac ttcctgaaat gctgcaacac caccaaatgc aacgagggcc caatcctgga 661 gcttgaaaat ctgccgcaga atggccgcca gtgttacagc tgcaagggga acagcaccca 721 tggatgctcc tctgaagaga ctttcctcat tgactgccga ggccccatga atcaatgtct 781 ggtagccacc ggcactcacg aacgctcact ctggggaagc tggttgccat gtaaaagtac 841 tactgccctg agaccaccat gctgtgagga agcccaagct actcatgtat aaatgccatg 901 tggagataga gccccagatg tttcagccat ctcagcccag gcaccagaca agtgggtgaa 961 gaagccacct tggacatgta gccccagcag atgtgatata gagaagaaac aggaaacttg 1021 gctatattag tttcctaggg ctgcctgtga taaattatta caaactttat aaaaaaaaaa 1081 aaaaaaaaaa aaaaaaa // LOCUS HSUREATP 2053 bp RNA PRI 07-AUG-1996 DEFINITION H.sapiens mRNA for urea transporter. ACCESSION X96969 NID g1483515 KEYWORDS urea transporter. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2053) AUTHORS Olives,B., Martial,S., Mattei,M.G., Matassi,G., Rousselet,G., Ripoche,P., Cartron,J.P. and Bailly,P. TITLE Molecular characterization of a new urea transporter in the human kidney JOURNAL FEBS Lett. 386 (2-3), 156-160 (1996) MEDLINE 96228053 REFERENCE 2 (bases 1 to 2053) AUTHORS Bailly,P. TITLE Direct Submission JOURNAL Submitted (28-MAR-1996) P. Bailly, INSERM U76, INTS, 6, rue Alexandre CABANEL, F- 75739 PARIS CEDEX 15, FRANCE FEATURES Location/Qualifiers source 1..2053 /organism="Homo sapiens" /note="race=Caucasian" /db_xref="taxon:9606" /sex="female" /dev_stage="20-years old" /tissue_type="kidney" /clone_lib="lambda gt11, Cat. HL 1123B" /clone="HUT2" /chromosome="18" /map="18q12.1-q21.1" CDS 274..1467 /codon_start=1 /product="urea transporter" /db_xref="PID:e242299" /db_xref="PID:g1483516" /translation="MEESSEIKVETNISKTSWIRSSMAASGKRVSKALSYITGEMKEC GEGLKDKSPVFQFFDWVLRGTSQVMFVNNPLSGILIILGLFIQNPWWAISGCLGTIMS TLTALILSQDKSAIAAGFHGYNGVLVGLLMAVFSDKGDYYWWLLLPVIIMSMSCPILS SALGTIFSKWDLPVFTLPFNITVTLYLAATGHYNLFFPTTLLQPASAMPNITWSEVQV PLLLRAIPVGIGQVYGCDNPWTGGIFLIALFISSPLICLHAAIGSTMGMLAALTIATP FDSIYFGLCGFNSTLACIAIGGMFYVITWQTHLLAIACALFAAYLGAALANMLSVFGL PPCTWPFCLSALTFLLLTTNNPAIYKLPLSKVTYPEANRIYYLSQERNRRASIITKYQ AYDVS" BASE COUNT 508 a 596 c 466 g 483 t ORIGIN 1 tccggagaca cagtgggatc caaaaaagta ccatgtaata acatacatga tttctgcagc 61 aaaatgttac actaagcagg acaaagatgt aaagtccctc acacatcaat tgaggttatt 121 gtgctgccct atcaatttac ttaaacttcc aggggatttt cagttttaac actgaatttc 181 cgggaaaagg cgaacaccag gaaagacaaa acaaagaccc atttccctat cgataccgga 241 atgcccacag tcgagctgct tgatctggac accatggagg agagctctga gataaaagtg 301 gaaacaaaca tttccaagac atcctggatt cggagttcca tggctgccag tgggaaaagg 361 gtcagcaaag ccctcagcta catcacagga gagatgaagg agtgtggaga gggacttaaa 421 gacaagtccc cagtgttcca gttctttgac tgggtcctcc gaggcacatc tcaagtgatg 481 tttgtgaaca accccctcag cggcatcctc atcatcctcg gcctcttcat ccagaacccc 541 tggtgggcga tctcaggctg cctgggtacc atcatgtcca ccttgacagc cctcatcctg 601 agtcaggaca agtcggccat cgctgcagga tttcacggct acaatggggt gctggtgggg 661 ctgctgatgg ccgtgttctc agacaaaggt gactactact ggtggctgtt gctacccgtc 721 atcatcatgt ccatgtcttg ccccatcctc tccagtgccc tgggtaccat cttcagcaag 781 tgggacctcc cagtcttcac actgcccttc aatatcactg tgactttgta cctggcagcc 841 acaggccact acaacctttt cttccccaca acgctgctgc agcctgcatc cgccatgccc 901 aacatcacct ggtcagaggt ccaagtgccc ttgcttttga gagccatccc cgttggaatt 961 ggccaagtgt acggctgtga taacccctgg actggaggca tcttcctcat agctctgttc 1021 atatcctcac ctctcatttg cttgcatgca gcaattggat ccaccatggg gatgctagca 1081 gcactcacta ttgcgacgcc ctttgactcc atctacttcg gcctgtgtgg cttcaacagc 1141 accctcgcat gcatagcgat aggaggcatg ttctacgtca tcacctggca gacgcacctc 1201 ctcgccatcg cctgcgcact gtttgctgcc tacctgggtg ctgccctggc taacatgtta 1261 tctgtgtttg gattgccgcc ctgcacttgg cccttctgtc tctcagctct caccttcctg 1321 ctcctgacga ccaataaccc cgccatctac aagctcccgc tcagcaaagt cacctaccca 1381 gaggccaacc gcatctacta cctgtcccag gagagaaaca gaagggcatc aatcataaca 1441 aagtatcaag cctacgatgt ctcctaagtt tccctgtcta aaacacatca gtgtaaattc 1501 aggcttcagc acgccgtcca gatccccagg ataagagacc acttagcctt ccctttggtc 1561 tgttctgtga ctctctcccc aaacacaaag aagcgtgtat gtagtcacca ttccagaacc 1621 tctcttttct aagatgcaca acacttatca aagatatgtt tagtttagac tttataccct 1681 tagctttccc ataagagctc cctttgtggg gaacttgccc tcttctgcga aataagcctc 1741 atccttaaag agaagtcacc ggccgggcac ggtcgtcacg cctgtaatcc cagcactttg 1801 ggaggccgag gcgggtggat cacgaggtca ggagatcgag accatcctgg cgaacatggt 1861 gagaccccat ctctactaaa aatacaaaaa attagccagg catggtggcg ggcacctgta 1921 gtcccagcta cttgggaggc tgaggcagga gaatggcgtg aacccgggag gtggagcttg 1981 cagtgagcca agatcacgcc actgcactcc agcctgggca acggagtgag actctgtctc 2041 aaaaaaaaaa ccg // LOCUS HSUROPLAK 932 bp RNA PRI 09-JUN-1997 DEFINITION Homo sapiens mRNA for uroplakin II. ACCESSION Y13645 NID g2190406 KEYWORDS uroplakin II. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 932) AUTHORS Smith,B.A. JOURNAL Unpublished REFERENCE 2 (bases 1 to 932) AUTHORS Smith,B.A. TITLE Direct Submission JOURNAL Submitted (06-JUN-1997) B.A. Smith, University of Leeds, ICRF Cancer Medicine Research Unit, St James' University Hospital, Beckett Street, Leeds, West Yorkshire, LS9 7TF, UK FEATURES Location/Qualifiers source 1..932 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="ureter" CDS 39..593 /codon_start=1 /product="uroplakin II" /db_xref="PID:e321535" /db_xref="PID:g2190407" /translation="MAPLLPIRTLPLILILLALLSPGAADFNISSLSGLLSPALTESL LVALPPCHLTGGNATLMVRRANDSKVVTSSFVVPPCRGRRELVSVVDSGAGFTVTRLS AYQVTNLVPGTKFYISYLVKKGTATESSREIPMSTLPRRNMESIGLGMARTGGMVVIT VLPSVAMFLLVLGFIIALALGSRK" polyA_signal 894..899 polyA_site 918 BASE COUNT 182 a 330 c 216 g 204 t ORIGIN 1 gaaagcctgc cagcacctat tccacctccc agcccagcat ggcacccctg ctgcccatcc 61 ggaccttgcc cttgatcctg attctgctgg ctctgctgtc cccaggggct gcagacttca 121 acatctcaag cctctctggt ctgctgtccc cggcgctaac ggagagcctg ctggttgcct 181 tgcccccctg tcacctcaca ggaggcaatg ccacactgat ggtccggaga gccaatgaca 241 gcaaagtggt gacgtccagc tttgtggtgc ctccgtgccg tgggcgcagg gaactggtga 301 gtgtggtgga cagtggtgct ggcttcacag tcactcggct cagtgcatac caggtgacaa 361 acctcgtgcc aggaaccaaa ttctacattt cctacctagt gaagaagggg acagccactg 421 agtccagcag agagatccca atgtccacac tccctcgaag gaacatggaa tccattgggc 481 tgggtatggc ccgcacaggg ggcatggtgg tcatcacggt gctgccctct gtcgccatgt 541 tcctgctggt gctgggcttc atcattgccc tggcactggg ctcccgcaag taaggaggtc 601 tgcccggagc agcagcttct ccaggaagcc cagggcacca tccagctccc cagcccacct 661 gctcccaggc cccaggcctg tggctccctt ggtgccctcg cctcctcctc ctgccctcct 721 ctcccctaga gccctctcct ccctctgtcc ctctccttgc ccccagtgcc tcaccttcca 781 acactccatt attcctctca ccccactcct gtcagagttg actttcctcc cattttacca 841 ctttaaacac ccccataaca attcccccat ccttcagtga actaagtccc tataataaag 901 gctgaggctg catctgccaa aaaaaaaaaa aa // LOCUS HSUSFMR 1739 bp RNA PRI 15-FEB-1991 DEFINITION Human usf mRNA for late upstream transcription factor. ACCESSION X55666 NID g37614 KEYWORDS DNA-binding protein; helix-loop-helix domain; transcription factor; upstream stimulatory factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1739) AUTHORS Gregor,P.D. TITLE Direct Submission JOURNAL Submitted (27-SEP-1990) P.D. Gregor, THE ROCKEFELLER UNIVERSITY, 1230 YORK AVENUE, NEW YORK NY 10021, USA REFERENCE 2 (bases 1 to 1739) AUTHORS Gregor,P.D., Sawadogo,M. and Roeder,R.G. TITLE The adenovirus major late transcription factor USF is a member of the helix-loop-helix group of regulatory proteins and binds to DNA as a dimer JOURNAL Genes Dev. 4 (10), 1730-1740 (1990) MEDLINE 91065519 FEATURES Location/Qualifiers source 1..1739 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymphoid" /cell_type="B-cell lymphoma" /cell_line="Namalwa" /clone_lib="lambda-Zap" /clone="dI2" mRNA <121..>1053 /gene="usf" gene 121..1053 /gene="usf" CDS 121..1053 /gene="usf" /codon_start=1 /product="upstream stimulatory factor" /db_xref="PID:g37615" /db_xref="SWISS-PROT:P22415" /translation="MKGQQKTAETEEGTVQIQEGAVATGEDPTSVAIASIQSAATFPD PNVKYVFRTENGGQVMYRVIQVSEGQLDGQTEGTGAISGYPATQSMTQAVIQGAFTSD DAVDTEGTAAETHYTYFPSTAVGDGAGGTTSGSTAAVVTTQGSEALLGQATPPGTGQF FVMMSPQEVLQGGSQRSIAPRTHPYSPKSEAPRTTRDEKRRAQHNEVERRRRDKINNW IVQLSKIIPDCSMESTKSGQSKGGILSKACDYIQELRQSNHRLSEELQGLDQLQLDND VLRQQVEDLKNKNLLLRAQLRHHGLEVVIKNDSN" BASE COUNT 426 a 435 c 491 g 387 t ORIGIN 1 ggaattcctt gaaaattttc cttggatagg aaaggactta gcactcaggc ctgtgaatct 61 aggagataca aagacctcca aaaaaggacc agttcctcgg atgtgccccc tcacagagag 121 atgaaggggc agcagaaaac agctgaaacg gaagagggga cagtgcagat tcaggaaggt 181 gcagtggcta ctggggaaga cccaaccagt gtggctattg ccagcatcca gtcagctgcc 241 accttccctg accccaacgt caagtacgtc ttccgaactg agaatggggg ccaggtgatg 301 tacagggtga tccaggtgtc tgaggggcag ctggatggcc aaactgaggg aactggcgcc 361 atcagtggct accctgccac tcaatccatg acccaggcgg tgatccaggg tgctttcacc 421 agtgatgatg cagttgacac ggaggggaca gctgctgaga cgcactatac ttacttcccc 481 agcacggcag tgggagatgg ggcagggggt accacatcgg ggagtacagc tgctgttgtt 541 actacccagg gctcagaggc actgctgggg caggcgaccc ctcctggcac tggtcaattc 601 tttgtgatga tgtcaccaca agaagtactg cagggaggaa gccagcgctc aattgcccct 661 aggactcacc cttattcccc gaagtcagaa gctccccgga cgactcggga tgagaaacgc 721 agggctcagc ataatgaagt ggagcgtcgc cgccgagaca agatcaacaa ctggatcgtg 781 cagctctcca agataatccc agactgctct atggagagca ccaagtctgg ccagagtaaa 841 ggtgggattc tatccaaagc ttgtgattat atccaggagc ttcggcagag taaccaccgc 901 ttgtctgaag aactgcaggg acttgaccaa ctgcagctgg acaatgacgt gcttcgacaa 961 caggtggaag atcttaaaaa caagaatctg ctgcttcgag ctcagttgcg gcaccacgga 1021 ttagaggtcg tcatcaagaa tgacagcaac taactatggg gattcagggg ctttgggccc 1081 aagaactgca gatagcccag gagcaacagc ctaatcccgt gcccctttcc ttcactgccc 1141 cacttctggc atgggacagg gggaagttca gaaggtgtgt ccttgaactg aggccctgtg 1201 atatggcggc ctgcagtggt gtgaagcaca caatgtggaa cgtgcactga cagccttgcc 1261 cacccccacc atgcagcccc tgggccttgt gctcctctcg cacaatgcat gtgctgtctc 1321 catgctggat actggacaca ctaaactctg gggcttgtcc tgtgcttgct tagagtgccc 1381 agcagaggtt tgctgacagg tgatgctctg gcttgcccca ggactctggc acttccattg 1441 gttcttcctt tccctggagc tgaggtttag atgtgcaacc tgtggctcag gggagcaagc 1501 ttacacaaga agtgagggaa ggatgtttag cagtggctgg tgcccatgaa gaggagattg 1561 gccagtgaga agctgaggcc tatgcagaca tctctggagc cagagagaac aacaggcagg 1621 ggcccacttg gggccttccc ccttgtgggg ggtcgttttt tttttttctt ttcttttttt 1681 tttttttttt tttttttttt aagataaaat tgttcaaagc caaaaaaaaa aggaattcc // LOCUS HSV7LSP 3340 bp RNA PRI 04-NOV-1996 DEFINITION H.sapiens V7 mRNA for leukocyte surface protein. ACCESSION Z33642 NID g1658309 KEYWORDS leukocyte surface protein; regulatory protein; V7 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3340) AUTHORS Ruegg,C.L., Rivas,A., Madani,N.D., Zeitung,J., Laus,R. and Engleman,E.G. TITLE V7, a novel leukocyte surface protein that participates in T cell activation. II. Molecular cloning and characterization of the V7 gene JOURNAL J. Immunol. 154 (9), 4434-4443 (1995) MEDLINE 95238941 REFERENCE 2 (bases 1 to 3340) AUTHORS Ruegg,C.L. TITLE Direct Submission JOURNAL Submitted (18-MAY-1994) Curtis L. Ruegg, Molecular Immunology, Activated Cell Therapy, Inc., 291 North Bernardo Avenue, Mountain View, CA, 94043, USA REMARK Revised by [3] REFERENCE 3 (bases 1 to 3340) AUTHORS Ruegg,C.L. TITLE Direct Submission JOURNAL Submitted (17-OCT-1996) Curtis L. Ruegg, Molecular Immunology, Activated Cell Therapy, Inc., 291 North Bernardo Avenue, Mountain View, CA, 94043, USA FEATURES Location/Qualifiers source 1..3340 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="V7" /cell_type="CD8+ T cell clone" /cell_line="P54" /clone_lib="P54L" /chromosome="1p13" mRNA 1..3340 /gene="V7" gene 1..3340 /gene="V7" CDS 22..3087 /gene="V7" /codon_start=1 /product="leukocyte surface protein" /db_xref="PID:e280469" /db_xref="PID:g1658310" /translation="MAGISYVASFFLLLTKLSIGQREVTVQKGPLFRAEGYPVSIGCN VTGHQGPSEQHFQWSVYLPTNPTQEVQIISTKDAAFSYAVYTQRVRGGDVYVERVQGN SVLLHISKLQMKDAGEYECHTPNTDENYYGSYRAKTNLIVIPDTLSATMSSQTLGKEE GEPLALTCEASKATAQHTHLSVTWYLTQDGGGSQATEIISLSKDFILVPGPLYTERFA ASDVQLNKLGPTTFRLSIERLQSSDQGQLFCEATEWIQDPDETWMFITKKQTDQTTLR IQPAVKDFQVNITADSLFAEGKPLELVCLVVSSGRDPQLQGIWFFNGTEIAHIDAGGV LGLKNDYKERASQGELQLSKLGPKAFSLKIFSLGPEDEGAYRCVVAEVMKTRTGSWQV LQRKQSPDSHVHLRKPAARSVVVSTKNKQQVVWEGETLAFLCKAGGAESPLSVSWWHI PRDQTQPEFVAGMGQDGIVQLGASYGVPSYHGNTRLEKMDWATFQLEITFTAITDSGT YECRVSEKSRNQARDLSWTQKISVTVKSLESSLQVSLMSRQPQVMLTNTFDLSCVVRA GYSDLKVPLTVTWQFQPASSHIFHQLIRITHNGTIEWGNFLSRFQKKTKVSQSLFRSQ LLVHDATEEETGVYQCEVEVYDRNSLYNNRPPRASAISHPLRIAVTLPESKLKVNSRS QGQELSINSNTDIECSILSRSNGNLQLAIIWYFSPVSTNASWLKILEMDQTNVIKTGD EFHTPQRKQKFHTEKVSQDLFQLHILNVEDSDRGKYHCAVEEWLLSTNGTWHKLGEKK SGLTELKLKPTGSKVRVSKVYWTENVTEHREVAIRCSLESVGSSATLYSVMWYWNREN SGSKLLVHLQHDGLLEYGEEGLRRHLHCYRSSSTDFVLKLHQVEMEDAGMYWCRVAEW QLHGHPSKWINQASDESQRMVLTVLPSEPTLPSRICSSAPLLYFLFICPFVLLLLLLI SLLCLYWKARKLSTLRSNTRKEKALWVDLKEAGGVTTNRREDEEEDEGN" BASE COUNT 894 a 838 c 850 g 758 t ORIGIN 1 ctctaaagct ttagagccca aatggcaggc atctcatatg tggcatcttt ctttctcctt 61 ctgactaagc tcagcattgg ccagagagaa gtaacagttc agaaaggacc actgtttaga 121 gctgaaggtt acccagtcag cattggctgc aatgtaactg gccaccaggg accttctgag 181 cagcatttcc agtggtctgt ttacctgccg acaaacccga cccaggaagt ccagatcatt 241 agcaccaagg atgctgcctt ctcttacgca gtatatacgc agcgggtgcg aggcggagac 301 gtctacgtgg agagggtcca gggcaactca gtcttgttgc acatctcaaa actccagatg 361 aaggatgctg gcgagtatga gtgtcacaca ccaaacactg atgagaatta ctatggaagt 421 tacagagcaa agactaatct aattgttatt ccagataccc tctctgccac catgagttct 481 cagactctcg gtaaggagga aggtgagcca ttagccctca cctgtgaggc atccaaagcc 541 acagcccaac atactcacct ctctgtcacc tggtacctaa cacaggatgg aggaggaagc 601 caagccactg agattatttc tctctccaaa gattttatat tggtccctgg gcccttgtat 661 acagagcggt ttgcagccag tgacgtacag ctcaacaaac tgggacccac tacattcagg 721 ctgtccatag agaggctcca gtcctcagat cagggtcagc tgttctgtga ggcaacggaa 781 tggattcagg atccagatga aacttggatg ttcatcacca aaaagcagac cgatcaaacc 841 actctgagga tccagccagc agtgaaagat tttcaagtca acattacagc tgacagcttg 901 tttgctgaag ggaaaccctt agaactggtt tgcctggttg taagcagtgg ccgtgaccca 961 cagcttcaag gcatttggtt cttcaatggg actgaaattg ctcacattga tgctggtgga 1021 gtcctgggcc tgaagaatga ctacaaagag agagcaagtc aaggagagct ccagctttca 1081 aagttaggcc ccaaggcttt ctctctcaag atcttctctc tgggcccaga ggatgaaggc 1141 gcctacagat gtgtggtagc agaggtcatg aaaacacgca caggttcctg gcaggtgctt 1201 cagagaaagc agtcaccaga cagccacgtg cacctgagga agccagcagc aagaagtgtg 1261 gtcgtgtcta ccaagaacaa gcagcaagtt gtgtgggaag gagagacact cgcctttctc 1321 tgtaaggctg gtggagctga aagtcccctg tctgtgagct ggtggcacat cccacgggac 1381 cagacacagc ccgagtttgt ggctggcatg gggcaggatg gcattgtgca gctgggtgcc 1441 tcctatgggg tacccagtta ccatggcaac acaaggctgg agaaaatgga ctgggccacc 1501 ttccagctgg agatcacctt cactgccatc acagacagtg gcacatatga gtgcagagta 1561 tctgagaagt ctcggaacca ggccagagat ctgagctgga ctcagaagat ttcagttact 1621 gtaaagtctc tggagtcaag tttacaagtt agtctgatga gccgtcagcc gcaggtgatg 1681 ttaaccaaca cctttgacct gtcctgtgtc gtgagggccg gttactctga cctcaaggtg 1741 ccactcactg tgacgtggca gttccagcca gctagctctc acatattcca ccagcttatt 1801 cgaatcaccc acaatggcac tattgaatgg gggaatttcc tatcccggtt ccaaaagaag 1861 acgaaagtgt cgcagtcttt atttcgttca caactcctag tccatgatgc cactgaggaa 1921 gagacaggag tgtatcagtg tgaagtagaa gtttatgaca gaaattccct atacaacaac 1981 cgccccccga gggcttctgc catctctcac ccactgagga tagccgtcac tttaccagag 2041 agcaagctaa aagtgaattc aaggagtcaa gggcaagagc tctccatcaa ctccaacact 2101 gatatagaat gtagcatctt gtcccggtcc aatggaaacc ttcagttagc cattatttgg 2161 tatttttctc ctgtttccac taatgcctct tggctaaaga tcctggagat ggaccaaacc 2221 aatgttataa aaactgggga tgagtttcac accccacaga gaaaacaaaa atttcatact 2281 gagaaggttt cccaagactt atttcagctg cacattctga atgtggaaga cagcgatcgg 2341 ggcaaatatc actgtgctgt ggaggaatgg ctcctgtcta caaatggcac ttggcacaag 2401 cttggagaaa agaagtcagg actaacagaa ttgaaactca agcccacagg aagtaaggta 2461 cgtgtctcca aagtgtactg gaccgaaaat gtgactgagc acagagaagt ggccatccgc 2521 tgcagcctgg agagtgtagg cagctcagcc actctgtact ctgtgatgtg gtactggaac 2581 agagaaaact ctggaagtaa attgctggtg cacttgcaac atgatggctt gctggagtat 2641 ggggaagagg ggctcaggag gcacctgcac tgttaccgtt catcctctac agactttgtc 2701 ctgaagcttc atcaggtgga gatggaggat gcaggaatgt actggtgtag ggtggcagag 2761 tggcagctcc atggacaccc aagcaagtgg attaatcaag catccgatga gtcacagcgg 2821 atggtgctca cggtgctgcc ttcagagccc acgcttcctt ccaggatctg ctcctcggcc 2881 cctttactct atttcctgtt catctgtccc ttcgtcctgc tcctccttct gctcatctcc 2941 ctcctctgct tatactggaa ggccaggaag ttgtcaacac tgcgttccaa cacacggaaa 3001 gaaaaagctc tctgggtgga cttgaaagag gctggaggtg tgaccacaaa taggagggaa 3061 gacgaggagg aagatgaagg caactgaatc ccaagaggca cctgcagcca ggaaggaaag 3121 ccccgtgtgg aatgtggtga cctagtcacc tggaaccagc tcctgacaga ccccggcaac 3181 ttctagatga acccaagtga actttcctca ttaccatcct gaagtcacta ccccaggggg 3241 agctatagct tcatgaccgt aacatgtgac ctgtgtgctg gcaggacgac tcactgcggc 3301 tgcgccactg ggacccctcc cctacatgca ccaatgcacg // LOCUS HSVAC 1560 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for vascular anticoagulant. ACCESSION X12454 NID g37636 KEYWORDS phospholipid-binding protein; vascular anticoagulant. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Hauptmann,R. TITLE Direct Submission JOURNAL Submitted (15-AUG-1988) Hauptmann R., Ernst Boehringer Institut fuer Arzneimittelforschung, Dr. Boehringer-Gasse 5/11, A-1121 Wien, Austria REFERENCE 2 (bases 1 to 1560) AUTHORS Maurer-Fogy,I., Reutelingsperger,C.P., Pieters,J., Bodo,G., Stratowa,C. and Hauptmann,R. TITLE Cloning and expression of cDNA for human vascular anticoagulant, a Ca2+-dependent phospholipid-binding protein JOURNAL Eur. J. Biochem. 174 (4), 585-592 (1988) MEDLINE 88271329 COMMENT Data kindly reviewed (11-Nov-1988) by Hauptmann R. FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placental" /clone="lambda P11/3 and others" misc_feature 124..132 /note="Kozak consensus (tcgccatgg)" CDS 129..1091 /note="VAC protein (AA 1-320)" /codon_start=1 /db_xref="PID:g37637" /db_xref="SWISS-PROT:P08758" /translation="MAQVLRGTVTDFPGFDERADAETLRKAMKGLGTDEESILTLLTS RSNAQRQEISAAFKTLFGRDLLDDLKSELTGKFEKLIVALMKPSRLYDAYELKHALKG AGTNEKVLTEIIASRTPEELRAIKQVYEEEYGSSLEDDVVGDTSGYYQRMLVVLLQAN RDPDAGIDEAQVEQDAQALFQAGELKWGTDEEKFITIFGTRSVSHLRKVFDKYMTISG FQIEETIDRETSGNLEQLLLAVVKSIRSIPAYLAETLYYAMKGAGTDDHTLIRVMVSR SEIDLFNIRKEFRKNFATSLYSMIKGDTSGDYKKALLLLCGEDD" misc_feature 186..386 /note="structural repeat (67 AA)" misc_feature 402..602 /note="structural repeat (67 AA)" misc_feature 651..854 /note="structural repeat (68 AA)" misc_feature 879..1079 /note="structural repeat (67 AA)" misc_feature 1539..1544 /note="polyA signal" polyA_site 1560 /note="polyA site" BASE COUNT 429 a 330 c 357 g 444 t ORIGIN 1 ttagcgtctg catctcggcg tcgcccgcgt acccgtcgcc cggctctccg ccgctctccc 61 ggggcttcgg ggcacttggg tcccacagtc gggtcctgct tcaccttccc ctgacctgag 121 tagtcgccat ggcacaggtt ctcagaggca ctgtgactga cttccctgga tttgatgagc 181 gggctgatgc agaaactctt cggaaggcta tgaaaggctt gggcacagat gaggagagca 241 tcctgactct gttgacatcc cgaagtaatg ctcagcgcca ggaaatctct gcagctttta 301 agactctgtt tggcagggat cttctggatg acctgaaatc agaactaact ggaaaatttg 361 aaaaattaat tgtggctctg atgaaaccct ctcggcttta tgatgcttat gaactgaaac 421 atgccttgaa gggagctgga acaaatgaaa aagtactgac agaaattatt gcttcaagga 481 cacctgaaga actgagagcc atcaaacaag tttatgaaga agaatatggc tcaagcctgg 541 aagatgacgt ggtgggggac acttcagggt actaccagcg gatgttggtg gttctccttc 601 aggctaacag agaccctgat gctggaattg atgaagctca agttgaacaa gatgctcagg 661 ctttatttca ggctggagaa cttaaatggg ggacagatga agaaaagttt atcaccatct 721 ttggaacacg aagtgtgtct catttgagaa aggtgtttga caagtacatg actatatcag 781 gatttcaaat tgaggaaacc attgaccgcg agacttctgg caatttagag caactactcc 841 ttgctgttgt gaaatctatt cgaagtatac ctgcctacct tgcagagacc ctctattatg 901 ctatgaaggg agctgggaca gatgatcata ccctcatcag agtcatggtt tccaggagtg 961 agattgatct gtttaacatc aggaaggagt ttaggaagaa ttttgccacc tctctttatt 1021 ccatgattaa gggagataca tctggggact ataagaaagc tcttctgctg ctctgtggag 1081 aagatgacta acgtgtcacg gggaagagct ccctgctgtg tgcctgcacc accccactgc 1141 cttccttcag cacctttagc tgcatttgta tgccagtgct taacacattg ccttattcat 1201 actagcatgc tcatgaccaa cacatacacg tcatagaaga aaatagtggt gcttctttct 1261 gatctctagt ggagatctct ttgactgctg tagtactaaa gtgtacttaa tgttactaag 1321 tttaatgcct ggccattttc catttatata tattttttaa gaggctagag tgcttttagc 1381 cttttttaaa aactccattt atattacatt tgtaaccatg atactttaat cagaagctta 1441 gccttgaaat tgtgaactct tggaaatgtt attagtgaag ttcgcaacta aactaaacct 1501 gtaaaattat gatgattgta ttcaaaagat taatgaaaaa taaacatttc tgtccccctg // LOCUS HSVACB 1940 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for vascular anticoagulant-beta (VAC-beta). ACCESSION X16662 NID g37638 KEYWORDS annexin; calcium binding protein; phospholipase a2 inhibitor; phospholipid-binding protein; vascular anticoagulant. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1940) AUTHORS Hauptmann,R. TITLE Direct Submission JOURNAL Submitted (27-SEP-1989) Hauptmann R., Ernst Boehringer Institut fuer Arzneimittelforschung, Dr. Boehringer-Gasse 5-11, A-1121 Vienna, Austria REFERENCE 2 (bases 2 to 1940) AUTHORS Hauptmann,R., Maurer-Fogy,I., Krystek,E., Bodo,G., Andree,H. and Reutelingsperger,C.P. TITLE Vascular anticoagulant beta: a novel human Ca2+/phospholipid binding protein that inhibits coagulation and phospholipase A2 activity. Its molecular cloning, expression and comparison with VAC-alpha JOURNAL Eur. J. Biochem. 185 (1), 63-71 (1989) MEDLINE 90032687 FEATURES Location/Qualifiers source 1..1940 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /clone_lib="lambda gt11" variation 78 /note="g is c in variant clone" variation 81 /note="a is g in variant clone" variation 97 /note="a is g in variant clone" CDS 107..1090 /note="vascular anticoagulant-beta (AA 1 - 327)" /codon_start=1 /db_xref="PID:g37639" /db_xref="SWISS-PROT:P13928" /translation="MAWWKAWIEQEGVTVKSSSHFNPDPDAETLYKAMKGIGTNEQAI IDVLTKRSNTQRQQIAKSFKAQFGKDLTETLKSELSGKFERLIVALMYPPYRYEAKEL HDAMKGLGTKEGVIIEILASRTKNQLREIMKAYEEDYGSSLEEDIQADTSGYLERILV CLLQGSRDDVSSFVDPALALQDAQDLYAAGEKIRGTDEMKFITILCTRSATHLLRVFE EYEKIANKSIEDSIKSETHGSLEEAMLTVVKCTQNLHSYFAERLYYAMKGAGTRDGTL IRNIVSRSEIDLNLIKCHFKKMYGKTLSSMIMEDTSGDYKNALLSLVGSDP" variation 122 /note="g is u in variant clone; gcc (Ala) is changed to ucc (Ser)" misc_feature 1919..1924 /note="pot. polyA signal" polyA_site 1940 /note="polyA site" BASE COUNT 514 a 516 c 511 g 399 t ORIGIN 1 aggcctgctc actcctcagc tgcaggagcc agacgtgtgg agtcccagca gaggccaacc 61 tgtgtctctt catctccgtg agaaaggtgc ccccgaagtg aaagagatgg cctggtggaa 121 agcctggatt gaacaggagg gtgtcacagt gaagagcagc tcccacttca acccagaccc 181 tgatgcagag accctctaca aagccatgaa ggggatcggg accaacgagc aggctatcat 241 cgatgtgctc accaagagaa gcaacacgca gcggcagcag atcgccaagt ccttcaaggc 301 tcagttcggc aaggacctca ctgagacctt gaagtctgag ctcagtggca agtttgagag 361 gctcattgtg gcccttatgt atccgccata cagatacgaa gccaaggagc tgcatgacgc 421 catgaagggc ttaggaacca aggagggtgt catcattgag atcctggcct ctcggaccaa 481 gaaccagctg cgggagataa tgaaggcgta tgaggaagac tatgggtcca gcctggagga 541 ggacatccaa gcagacacaa gtggctacct ggagaggatc ctggtgtgcc tcctgcaggg 601 cagcagggat gatgtgagca gctttgtgga cccggcactg gccctccaag acgcacagga 661 tctgtatgcg gcaggcgaga agattcgtgg gactgatgag atgaaattca tcaccatcct 721 gtgcacgcgc agtgccactc acctgctgag agtgtttgaa gagtatgaga aaattgccaa 781 caagagcatt gaggacagca tcaagagtga gacccatggc tcactggagg aggccatgct 841 cactgtggtg aaatgcaccc aaaacctcca cagctacttt gcagagagac tctactatgc 901 catgaaggga gcagggacgc gtgatgggac cctgataaga aacatcgttt caaggagcga 961 gattgactta aatcttatca aatgtcactt caagaagatg tacggcaaga ccctcagcag 1021 catgatcatg gaagacacca gcggcgacta caagaacgcc ctgctgagcc tggtgggcag 1081 cgacccctga ggcacagaag aacaagagca aagaccatga agccagagtc tccaggactc 1141 ctcactcaac ctcggccatg gacgcaggtt gggtgtgagg ggggtcccag cctttcggtc 1201 ttctatttcc ctatttccag tgctttccag ccgggtttct gacccagagg tggaaccggc 1261 ctggactcct cttcccaact tcctccaggt catttcccag tgtgagcaca atgccaacct 1321 tagtgtttct ccagccagac agatgcctca gcatgaaggg cttggggact tgtggatcat 1381 tccttcctcc ctgcaggagc ttcccaagct ggtcacagag tctcctgggc acaggttata 1441 cagaccccag ccccattccc atctactgaa acagggtctc cacaagaggg gccagggaat 1501 atgggttttt aacaagcgtc ttacaaaaca cttctctatc atgcagccgg agagctggct 1561 gggagccctt ttgttttaga acacacatcc ttcagcagct gagaaatgaa cacgaatcca 1621 tcccaaccga gatgccatta acattcatct aaaaatgtta ggctctaaat ggacgaaaaa 1681 ttctctcgcc atcttaataa caaaataaac tacaaattcc tgacccaagg acactgtgtt 1741 ataagaggcg tgggctcccc tggtggctga ccaggtcagc tgccctggcc ttgcacccct 1801 ctgcatgcag cacagaaggg tgtgaccatg ccctcagcac cactcttgtc cccactgaac 1861 ggcaactgag actgggtacc tggagattct gaagtgcctt tgctgtggtt ttcaaaataa 1921 taaagatttg tattcaactc // LOCUS HSVASCAS 6371 bp RNA PRI 31-MAR-1995 DEFINITION H.sapiens mRNA for voltage-activated sodium channel. ACCESSION X82835 NID g758109 KEYWORDS sodium channel alpha subunit; voltage-activated sodium channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6371) AUTHORS Klugbauer,N., Lacinova,L., Flockerzi,V. and Hofmann,F. TITLE Structure and functional expression of a new member of the tetrodotoxin-sensitive voltage-activated sodium channel family from human neuroendocrine cells JOURNAL EMBO J. 14 (6), 1084-1090 (1995) MEDLINE 95237189 REFERENCE 2 (bases 1 to 6371) AUTHORS Hofmann,F. TITLE Direct Submission JOURNAL Submitted (21-NOV-1994) F. Hofmann, Institut fuer Pharmakologie und Toxikologie, Technische Univ. Muenchen, Biedersteiner Str. 29, 80802 Muenchen, FRG FEATURES Location/Qualifiers source 1..6371 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid" /cell_line="medullary thyroid carcinoma" /cell_type="medullary thryroid carcinoma" /clone_lib="pcDNA3 (Invitrogen)" gene 49..5982 /gene="hNE-Na" CDS 49..5982 /gene="hNE-Na" /note="voltage-activated sodium channel" /codon_start=1 /product="sodium channel alpha subunit" /db_xref="PID:g758110" /translation="MAMLPPPGPQSFVHFTKQSLALIEQRIAERKSKEPKEEKKDDDE EAPKPSSDLEAGKQLPFIYGDIPPGMVSEPLEDLDPYYADKKTFIVLNKGKTIFRFNA TPALYMLSPFSPLRRISIKILVHSLFSMLIMCTILTNCIFMTMNNPPDWTKNVEYTFT GIYTFESLVKILARGFCVGEFTFLRDPWNWLDFVVIVFAYLTEFVNLGNVSALRTFRV LRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLKHKCF RNSLENNETLESIMNTLESEEDFRKYFYYLEGSKDALLCGFSTDSGQCPEGYTCVKIG RNPDYGYTSFDTFSWAFLALFRLMTQDYWENLYQQTLRAAGKTYMIFFVVVIFLGSFY LINLILAVVAMAYEEQNQANIEEAKQKELEFQQMLDRLKKEQEEAEAIAAAAAEYTSI RRSRIMGLSESSSETSKLSSKSAKERRNRRKKKNQKKLSSGEEKGDAEKLSKSESEDS IRRKSFHLGVEGHRRAHEKRLSTPNQSPLSIRGSLFSARRSSRTSLFSFKGRGRDIGS ETEFADDEHSIFGDNESRRGSLFVPHRPQERRSSNISQASRSPPMLPVNGKMHSAVDC NGVVSLVDGRSALMLPNGQLLPEGTTNQIHKKRRCSSYLLSEDMLNDPNLRQRAMSRA SILTNTVEELEESRQKCPPWWYRFAHKFLIWNCSPYWIKFKKCIYFIVMDPFVDLAIT ICIVLNTLFMAMEHHPMTEEFKNVLAIGNLVFTGIFAAEMVLKLIAMDPYEYFQVGWN IFDSLIVTLSLVELFLADVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALG NLTLVLAIIVFIFAVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLC GEWIETMWDCMEVAGQAMCLIVYMMVMVIGNLVVLNLFLALLLSSFSSDNLTAIEEDP DANNLQIAVTRIKKGINYVKQTLREFILKAFSKKPKISREIRQAEDLNTKKENYISNH TLAEMSKGHNFLKEKDKISGFGSSVDKHLMEDSDGQSFIHNPSLTVTVPIAPGESDLE NMNAEELSSDSDSEYSKVRLNRSSSSECSTVDNPLPGEGEEAEAEPMNSDEPEACFTD GCVRRFSCCQVNIESGKGKIWWNIRKTCYKIVEHSWFESFIVLMILLSSGALAFEDIY IERKKTIKIILEYADKIFTYIFILEMLLKWIAYGYKTYFTNAWCWLDFLIVDVSLVTL VANTLGYSDLGPIKSLRTLRALRPLRALSRFEGMRVVVNALIGAIPSIMNVLLVCLIF WLIFSIMGVNLFAGKFYECINTTDGSRFPASQVPNRSECFALMNVSQNVRWKNLKVNF DNVGLGYLSLLQVATFKGWTIIMYAAVDSVNVDKQPKYEYSLYMYIYFVVFIIFGSFF TLNLFIGVIIDNFNQQKKKLGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPGNKIQ GCIFDLVTNQAFDISIMVLICLNMVTMMVEKEGQSQHMTEVLYWINVVFIILFTGECV LKLISLRHYYFTVGWNIFDFVVVIISIVGMFLADLIETYFVSPTLFRVIRLARIGRIL RLVKGAKGIRTLLFALMMSLPALFNIGLLLFLVMFIYAIFGMSNFAYVKKEDGINDMF NFETFGNSMICLFQITTSAGWDGLLAPILNSKPPDCDPKKVHPGSSVEGDCGNPSVGI FYFVSYIIISFLVVVNMYIAVILENFSVATEESTEPLSEDDFEMFYEVWEKFDPDATQ FIEFSKLSDFAAALDPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLGES GEMDSLRSQMEERFMSANPSKVSYEPITTTLKRKQEDVSATVIQRAYRRYRLRQNVKN ISSIYIKDGDRDDDLLNKKDMAFDNVNENSSPEKTDATSSTTSPPSYDSVTKPDKEKY EQDRTEKEDKGKDSKESKK" BASE COUNT 1948 a 1231 c 1374 g 1818 t ORIGIN 1 ctcttatgtg aggagctgaa gaggaattaa aatatacagg atgaaaagat ggcaatgttg 61 cctcccccag gacctcagag ctttgtccat ttcacaaaac agtctcttgc cctcattgaa 121 caacgcattg ctgaaagaaa atcaaaggaa cccaaagaag aaaagaaaga tgatgatgaa 181 gaagccccaa agccaagcag tgacttggaa gctggcaaac aactgccctt catctatggg 241 gacattcctc ccggcatggt gtcagagccc ctggaggact tggaccccta ctatgcagac 301 aaaaagactt tcatagtatt gaacaaaggg aaaacaatct tccgtttcaa tgccacacct 361 gctttatata tgctttctcc tttcagtcct ctaagaagaa tatctattaa gattttagta 421 cactccttat tcagcatgct catcatgtgc actattctga caaactgcat atttatgacc 481 atgaataacc cgccggactg gaccaaaaat gtcgagtaca cttttactgg aatatatact 541 tttgaatcac ttgtaaaaat ccttgcaaga ggcttctgtg taggagaatt cacttttctt 601 cgtgacccgt ggaactggct ggattttgtc gtcattgttt ttgcgtattt aacagaattt 661 gtaaacctag gcaatgtttc agctcttcga actttcagag tattgagagc tttgaaaact 721 atttctgtaa tcccaggcct gaagacaatt gtaggggctt tgatccagtc agtgaagaag 781 ctttctgatg tcatgatcct gactgtgttc tgtctgagtg tgtttgcact aattggacta 841 cagctgttca tgggaaacct gaagcataaa tgttttcgaa attcacttga aaataatgaa 901 acattagaaa gcataatgaa taccctagag agtgaagaag actttagaaa atatttttat 961 tacttggaag gatccaaaga tgctctcctt tgtggtttca gcacagattc aggtcagtgt 1021 ccagaggggt acacctgtgt gaaaattggc agaaaccctg attatggcta cacgagcttt 1081 gacactttca gctgggcctt cttagccttg tttaggctaa tgacccaaga ttactgggaa 1141 aacctttacc aacagacgct gcgtgctgct ggcaaaacct acatgatctt ctttgtcgta 1201 gtgattttcc tgggctcctt ttatctaata aacttgatcc tggctgtggt tgccatggca 1261 tatgaagaac agaaccaggc aaacattgaa gaagctaaac agaaagaatt agaatttcaa 1321 cagatgttag accgtcttaa aaaagagcaa gaagaagctg aggcaattgc agcggcagcg 1381 gctgaatata caagtattag gagaagcaga attatgggcc tctcagagag ttcttctgaa 1441 acatccaaac tgagctctaa aagtgctaaa gaaagaagaa acagaagaaa gaaaaagaat 1501 caaaagaagc tctccagtgg agaggaaaag ggagatgctg agaaattgtc gaaatcagaa 1561 tcagaggaca gcatcagaag aaaaagtttc caccttggtg tcgaagggca taggcgagca 1621 catgaaaaga ggttgtctac ccccaatcag tcaccactca gcattcgtgg ctccttgttt 1681 tctgcaaggc gaagcagcag aacaagtctt tttagtttca aaggcagagg aagagatata 1741 ggatctgaga ctgaatttgc cgatgatgag cacagcattt ttggagacaa tgagagcaga 1801 aggggctcac tgtttgtgcc ccacagaccc caggagcgac gcagcagtaa catcagccaa 1861 gccagtaggt ccccaccaat gctgccggtg aacgggaaaa tgcacagtgc tgtggactgc 1921 aacggtgtgg tctccctggt tgatggacgc tcagccctca tgctccccaa tggacagctt 1981 ctgccagagg gcacgaccaa tcaaatacac aagaaaaggc gttgtagttc ctatctcctt 2041 tcagaggata tgctgaatga tcccaacctc agacagagag caatgagtag agcaagcata 2101 ttaacaaaca ctgtggaaga acttgaagag tccagacaaa aatgtccacc ttggtggtac 2161 agatttgcac acaaattctt gatctggaat tgctctccat attggataaa attcaaaaag 2221 tgtatctatt ttattgtaat ggatcctttt gtagatcttg caattaccat ttgcatagtt 2281 ttaaacacat tatttatggc tatggaacac cacccaatga ctgaggaatt caaaaatgta 2341 cttgctatag gaaatttggt ctttactgga atctttgcag ctgaaatggt attaaaactg 2401 attgccatgg atccatatga gtatttccaa gtaggctgga atatttttga cagccttatt 2461 gtgactttaa gtttagtgga gctctttcta gcagatgtgg aaggattgtc agttctgcga 2521 tcattcagac tgctccgagt cttcaagttg gcaaaatcct ggccaacatt gaacatgctg 2581 attaagatca ttggtaactc agtaggggct ctaggtaacc tcaccttagt gttggccatc 2641 atcgtcttca tttttgctgt ggtcggcatg cagctctttg gtaagagcta caaagaatgt 2701 gtctgcaaga tcaatgatga ctgtacgctc ccacggtggc acatgaacga cttcttccac 2761 tccttcctga ttgtgttccg cgtgctgtgt ggagagtgga tagagaccat gtgggactgt 2821 atggaggtcg ctggtcaagc tatgtgcctt attgtttaca tgatggtcat ggtcattgga 2881 aacctggtgg tcctaaacct atttctggcc ttattattga gctcatttag ttcagacaat 2941 cttacagcaa ttgaagaaga ccctgatgca aacaacctcc agattgcagt gactagaatt 3001 aaaaagggaa taaattatgt gaaacaaacc ttacgtgaat ttattctaaa agcattttcc 3061 aaaaagccaa agatttccag ggagataaga caagcagaag atctgaatac taagaaggaa 3121 aactatattt ctaaccatac acttgctgaa atgagcaaag gtcacaattt cctcaaggaa 3181 aaagataaaa tcagtggttt tggaagcagc gtggacaaac acttgatgga agacagtgat 3241 ggtcaatcat ttattcacaa tcccagcctc acagtgacag tgccaattgc acctggggaa 3301 tccgatttgg aaaatatgaa tgctgaggaa cttagcagtg attcggatag tgaatacagc 3361 aaagtgagat taaaccggtc aagctcctca gagtgcagca cagttgataa ccctttgcct 3421 ggagaaggag aagaagcaga ggctgaacct atgaattccg atgagccaga ggcctgtttc 3481 acagatggtt gtgtacggag gttctcatgc tgccaagtta acatagagtc agggaaagga 3541 aaaatctggt ggaacatcag gaaaacctgc tacaagattg ttgaacacag ttggtttgaa 3601 agcttcattg tcctcatgat cctgctcagc agtggtgccc tggcttttga agatatttat 3661 attgaaagga aaaagaccat taagattatc ctggagtatg cagacaagat cttcacttac 3721 atcttcattc tggaaatgct tctaaaatgg atagcatatg gttataaaac atatttcacc 3781 aatgcctggt gttggctgga tttcctaatt gttgatgttt ctttggttac tttagtggca 3841 aacactcttg gctactcaga tcttggcccc attaaatccc ttcggacact gagagcttta 3901 agacctctaa gagccttatc tagatttgaa ggaatgaggg tcgttgtgaa tgcactcata 3961 ggagcaattc cttccatcat gaatgtgcta cttgtgtgtc ttatattctg gctgatattc 4021 agcatcatgg gagtaaattt gtttgctggc aagttctatg agtgtattaa caccacagat 4081 gggtcacggt ttcctgcaag tcaagttcca aatcgttccg aatgttttgc ccttatgaat 4141 gttagtcaaa atgtgcgatg gaaaaacctg aaagtgaact ttgataatgt cggacttggt 4201 tacctatctc tgcttcaagt tgcaactttt aagggatgga cgattattat gtatgcagca 4261 gtggattctg ttaatgtaga caagcagccc aaatatgaat atagcctcta catgtatatt 4321 tattttgtcg tctttatcat ctttgggtca ttcttcactt tgaacttgtt cattggtgtc 4381 atcatagata atttcaacca acagaaaaag aagcttggag gtcaagacat ctttatgaca 4441 gaagaacaga agaaatacta taatgcaatg aaaaagctgg ggtccaagaa gccacaaaag 4501 ccaattcctc gaccagggaa caaaatccaa ggatgtatat ttgacctagt gacaaatcaa 4561 gcctttgata ttagtatcat ggttcttatc tgtctcaaca tggtaaccat gatggtagaa 4621 aaggagggtc aaagtcaaca tatgactgaa gttttatatt ggataaatgt ggtttttata 4681 atccttttca ctggagaatg tgtgctaaaa ctgatctccc tcagacacta ctacttcact 4741 gtaggatgga atatttttga ttttgtggtt gtgattatct ccattgtagg tatgtttcta 4801 gctgatttga ttgaaacgta ttttgtgtcc cctaccctgt tccgagtgat ccgtcttgcc 4861 aggattggcc gaatcctacg tctagtcaaa ggagcaaagg ggatccgcac gctgctcttt 4921 gctttgatga tgtcccttcc tgcgttgttt aacatcggcc tcctgctctt cctggtcatg 4981 ttcatctacg ccatctttgg aatgtccaac tttgcctatg ttaaaaagga agatggaatt 5041 aatgacatgt tcaattttga gacctttggc aacagtatga tttgcctgtt ccaaattaca 5101 acctctgctg gctgggatgg attgctagca cctattctta acagtaagcc acccgactgt 5161 gacccaaaaa aagttcatcc tggaagttca gttgaaggag actgtggtaa cccatctgtt 5221 ggaatattct actttgttag ttatatcatc atatccttcc tggttgtggt gaacatgtac 5281 attgcagtca tactggagaa ttttagtgtt gccactgaag aaagtactga acctctgagt 5341 gaggatgact ttgagatgtt ctatgaggtt tgggagaagt ttgatcccga tgcgacccag 5401 tttatagagt tctctaaact ctctgatttt gcagctgccc tggatcctcc tcttctcata 5461 gcaaaaccca acaaagtcca gctcattgcc atggatctgc ccatggttag tggtgaccgg 5521 atccattgtc ttgacatctt atttgctttt acaaagcgtg ttttgggtga gagtggggag 5581 atggattctc ttcgttcaca gatggaagaa aggttcatgt ctgcaaatcc ttccaaagtg 5641 tcctatgaac ccatcacaac cacactaaaa cggaaacaag aggatgtgtc tgctactgtc 5701 attcagcgtg cttatagacg ttaccgctta aggcaaaatg tcaaaaatat atcaagtata 5761 tacataaaag atggagacag agatgatgat ttactcaata aaaaagatat ggcttttgat 5821 aatgttaatg agaactcaag tccagaaaaa acagatgcca cttcatccac cacctctcca 5881 ccttcatatg atagtgtaac aaagccagac aaagagaaat atgaacaaga cagaacagaa 5941 aaggaagaca aagggaaaga cagcaaggaa agcaaaaaat agagcttcat ttttgatata 6001 ttgtttacag cctgtgaaag tgatttattt gtgttaataa aactcttttg aggaagtcta 6061 tgccaaaatc ctttttatca aaatattctc gaaggcagtg cagtcactaa ctctgatttc 6121 ctaagaaagg tgggcagcat tagcagatgg ttatttttgc actgatgatt ctttaagaat 6181 cgtaagagaa ctctgtagga attattgatt atagcataca aaagtgattg attcagtttt 6241 ttggttttta ataaatcaga agaccatgta gaaaactttt acatctgcct tgtcatcttt 6301 tcacaggatt gtaattagtc ttgtttccca tgtaaataaa caacacacgc atacagaaaa 6361 aaaaaaaaaa a // LOCUS HSVASP 2207 bp RNA PRI 13-FEB-1995 DEFINITION Homo sapiens encoding vasodilator-stimulated phosphoprotein (VASP). ACCESSION Z46389 NID g624963 KEYWORDS skeletal protein; vasodilator-stimulated phosphoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2207) AUTHORS Walter,U. TITLE Direct Submission JOURNAL Submitted (25-OCT-1994) Walter U., Medizinische Universitaetsklinik, Klinische Forschergruppe, Josef-Schneider-Str. 2, 97080 Wuerzburg, Germany REFERENCE 2 (bases 1 to 2207) AUTHORS Haffner,C., Jarchau,T., Reinhard,M., Hoppe,J., Lohmann,S.M. and Walter,U. TITLE Molecular cloning, structural analysis and functional expression of the proline-rich focal adhesion and microfilament-associated protein VASP JOURNAL EMBO J. 14 (1), 19-27 (1995) MEDLINE 95129547 FEATURES Location/Qualifiers source 1..2207 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="promyelocytic leukemia" /cell_line="HL-60" /clone_lib="cDNA (P. Murphy, NIH, Washington, D.C.)" 5'UTR 1..254 CDS 255..1397 /function="proline-rich cytoskeletal protein" /codon_start=1 /product="vasodilator-stimulated phosphoprotein (VASP)" /db_xref="PID:g624964" /translation="MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTA NSFRVVGRKMQPDQQVVINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQF AAGMASALEALEGGGPPPPPALPTWSVPNGPSPEEVEQQKRQQPGPSEHIERRVSNAG GPPAPPAGGPPPPPGPPPPPGPPPPPGLPPSGVPAAAHGAGGGPPPAPPLPAAQGPGG GGAGAPGLAAAIAGAKLRKVSKQEEASGGPTAPKAESGRSGGGGLMEEMNAMLARRRK ATQVGEKTPKDESANQEEPEARVPAQSESVRRPWEKNSTTLPRMKSSSSVTTSETQPC TPSSSDYSDLQRVKQELLEEVKKELQKVKEEIIEAFVQELRKRGSP" 3'UTR 1398..2207 polyA_signal 2177..2182 BASE COUNT 466 a 679 c 631 g 431 t ORIGIN 1 ccccttcctg tggggttcat tggggcatcc cctttctgct gcaggaacct ctcatcagac 61 cgcctgaggg aagcggcgcc cggagacccg ccccggcccg gtccacattc tccccaggaa 121 gccggactct atggggcggg accctggggg agcctgagcc gagcccggag ccagccccga 181 acccctgaac ctccagccag gggcgccccg ggagcagcca gcccgtgggc gagccgcccg 241 cccgccgagc agccatgagc gagacggtca tctgttccag ccgggccact gtgatgcttt 301 atgatgatgg caacaagcga tggctccctg ctggcacggg tccccaggcc ttcagccgcg 361 tccagatcta ccacaacccc acggccaatt cctttcgcgt cgtgggccgg aagatgcagc 421 ccgaccagca ggtggtcatc aactgtgcca tcgtccgggg tgtcaagtat aaccaggcca 481 cccccaactt ccatcagtgg cgcgacgctc gccaggtctg gggcctcaac ttcggcagca 541 aggaggatgc ggcccagttt gccgccggca tggccagtgc cctagaggcg ttggaaggag 601 gtgggccccc tccaccccca gcacttccca cctggtcggt cccgaacggc ccctccccgg 661 aggaggtgga gcagcagaaa aggcagcagc ccggcccgtc ggagcacata gagcgccggg 721 tctccaatgc aggaggccca cctgctcccc ccgctggggg tccaccccca ccaccaggac 781 ctccccctcc tccaggtccc cccccacccc caggtttgcc cccttcgggg gtcccagctg 841 cagcgcacgg agcaggggga ggaccacccc ctgcaccccc tctcccggca gcacagggcc 901 ctggtggtgg gggagctggg gccccaggcc tggccgcagc tattgctgga gccaaactca 961 ggaaagtcag caagcaggag gaggcctcag gggggcccac agcccccaaa gctgagagtg 1021 gtcgaagcgg aggtggggga ctcatggaag agatgaacgc catgctggcc cggagaagga 1081 aagccacgca agttggggag aaaaccccca aggatgaatc tgccaatcag gaggagccag 1141 aggccagagt cccggcccag agtgaatctg tgcggagacc ctgggagaag aacagcacaa 1201 ccttgccaag gatgaagtcg tcttcttcgg tgaccacttc cgagacccaa ccctgcacgc 1261 ccagctccag tgattactcg gacctacaga gggtgaaaca ggagcttctg gaagaggtga 1321 agaaggaatt gcagaaagtg aaagaggaaa tcattgaagc cttcgtccag gagctgagga 1381 agcggggttc tccctgacca cagggaccca gaagacccgc ttctcctttc cgcacacccg 1441 gcctgtcacc ctgctttccc tgcctctact tgacttggaa ttggctgaag acacaggaat 1501 gcatcgttcc cactccccat cccacttgga aaactccaag ggggtgtggc ttccctgctc 1561 acacccacac tggctgctga ttggctgggg aggcccccgc ccttttctcc ctttggtcct 1621 tcccctctgc catccccttg gggccggtcc ctctgctggg gatgcaccaa tgaaccccac 1681 aggaaggggg aaggaaggag ggaatttcac attcccttgt tctagattca ctttaacgct 1741 taatgccttc aaagttttgg tttttttaag aaaaaaaaat atatatatat ttgggttttg 1801 ggggaaaagg gaaatttttt tttctctttg gttttgataa aatgggatgt gggagttttt 1861 aaatgctata gccctgggct tgccccattt ggggcagcta tttaagggga ggggatgtct 1921 caccgggctg ggggtgagat atccccccac cccagggact ccccttccct ctggctcctt 1981 ccccttttct atgaggaaat aagatgctgt aactttttgg aacctcagtt ttttgatttt 2041 ttatttgggt aggttttggg gtccaggcca ttttttttac cccttggagg aaataagatg 2101 agggagaaag gagaagggga ggaaacttct cccctcccac cttcaccttt agcttcttga 2161 aaatgggccc ctgcagaata aatctgccag tttttataaa aaaaaaa // LOCUS HSVATPA 1433 bp RNA PRI 27-JAN-1994 DEFINITION H.sapiens mRNA for subunit C of vacuolar proton-ATPase V1 domain. ACCESSION X69151 NID g37642 KEYWORDS vacuolar proton-ATPase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1433) AUTHORS van Hille,B. TITLE Direct Submission JOURNAL Submitted (09-NOV-1992) B. Van Hille, CIBA-GEIGY, Research Dept, Pharmaceuticals Division, Ciba-Geigy Ltd, 4002 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 1433) AUTHORS van Hille,B. JOURNAL Unpublished REFERENCE 3 (bases 1 to 1433) AUTHORS van Hille,B., Vanek,M., Richener,H., Green,J.R. and Bilbe,G. TITLE Cloning and tissue distribution of subunits C, D, and E of the human vacuolar H(+)-ATPase JOURNAL Biochem. Biophys. Res. Commun. 197 (1), 15-21 (1993) MEDLINE 94071935 FEATURES Location/Qualifiers source 1..1433 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 167..1315 /note="subunit C, VI domain" /codon_start=1 /product="vacuolar proton-ATPase" /db_xref="PID:g37643" /db_xref="SWISS-PROT:P21283" /translation="MTEFWLISAPGEKTCQQTWEKLHAATSKNNNLAVTSKFNIPDLK VGTLDVLVGLSDELAKLDAFVEGVVKKVAQYMADVLEDSKDKVQENLLANGVDLVTYI TRFQWDMAKYPIKQSLKNISEIIAKGVTQIDNDLKSRASAYNNLKGNLQNLERKNAGS LLTRSLAEIVKKDDFVLDSEYLVTLLVVVPKLNHNDWIKQYETLAEMVVPRSSNVLSE DQDSYLCNVTLFRKAVDDFRHKARENKFIVRDFQYNEEEMKADKEEMNRLSTDKKKQF GPLVRWLKVNFSEAFIAWIHVKALRVFVESVLRYGLPVNFQAMLLQPNKKTLKKLREV LHELYKHLDSSAAAIIDAPMDIPGLNLSQQEYYPYVYYKIDCNLLEFK" BASE COUNT 439 a 266 c 328 g 400 t ORIGIN 1 ggtagaggaa gccgtgaggc cggagcttag gtcgggaagg gatggatcgc tgagccgata 61 gcgtccgcta ggctgtctgc ctcggtacct gttactgctg ctacttcctc gtttgacacc 121 ttcctggaat ctctcttgat ttttgaggaa atacctagta acaaacatga ctgagttctg 181 gcttatatct gctcctgggg agaaaacctg tcagcaaaca tgggagaaat tgcatgcggc 241 aacttcaaag aacaataatc ttgctgtcac ttccaagttc aatattcctg acttaaaggt 301 tggcacgttg gatgtcttgg ttggcttgtc agatgaactg gctaaactgg atgcatttgt 361 agaaggagtg gttaagaaag tagctcaata catggctgat gtattggaag atagcaaaga 421 caaagttcaa gagaatctgt tggctaatgg agtggacttg gttacttata taacaaggtt 481 ccagtgggac atggccaaat atccaatcaa gcagtccctg aaaaatattt ctgaaataat 541 tgccaaggga gtaactcaga ttgataatga cctgaaatct cgagcatctg catacaataa 601 cctgaaagga aatcttcaga atttggaacg aaagaatgca ggaagtttgc taactagaag 661 tctagcagaa attgtgaaga aggatgactt tgttcttgat tcagagtatc tcgtcacatt 721 actggtagta gttcccaagt taaaccacaa cgactggatt aagcagtatg aaacactagc 781 cgaaatggta gttccaaggt ctagcaatgt tctttcagag gaccaagaca gttacctgtg 841 taatgtcacc ttgtttagga aggcagttga tgacttcaga cacaaagcca gagaaaacaa 901 attcattgtt cgtgacttcc agtataatga agaggagatg aaagcagata aagaagaaat 961 gaacaggctt tctactgata agaaaaaaca atttggacca cttgtacggt ggctgaaagt 1021 gaattttagt gaagcattta ttgcatggat tcacgtgaaa gcattacggg ttttcgttga 1081 gtctgtttta aggtatggct tgccagtgaa cttccaagca atgctacttc agcccaataa 1141 gaaaactttg aagaaactga gagaagtatt acatgaattg tataaacatc tagacagcag 1201 tgcagcagct attattgatg ctcctatgga tattccaggt ttaaacctga gtcaacaaga 1261 atactacccc tatgtgtact acaagattga ttgcaacttg ctggaattca agtgaaaatg 1321 ggctcctccc ccgacaatcc tgtccttgtg tttgtgtgtg ctaacagaaa taagttgcag 1381 tatggtcgta cttttaactc tagtatcctt tgcttgcttc ttaccccctt tcc // LOCUS HSVAVPO 2757 bp RNA PRI 17-FEB-1997 DEFINITION Human mRNA for vav oncogene. ACCESSION X16316 NID g37644 KEYWORDS glycoprotein; oncogene; phosphoprotein; vav oncogene; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2757) AUTHORS Katzav,S., Martin-Zanca,D. and Barbacid,M. TITLE vav, a novel human oncogene derived from a locus ubiquitously expressed in hematopoietic cells JOURNAL EMBO J. 8 (8), 2283-2290 (1989) MEDLINE 90005432 REFERENCE 2 (bases 1 to 106) AUTHORS Romero,F. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) F. Romero, INSERM, U-363, Hopital Cochin, 27, FBG. St. Jaxques, 75014 Paris, FRANCE COMMENT The nucleotides 1 - 167 are derived from pSV2neo. Data kindly reviewed (23-APR-1990) by Barbacid M. FEATURES Location/Qualifiers source 1..2757 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="3rd cycle nude mouse tumor" /clone_lib="lambda gt10" /clone="pSK65" promoter 1..17 /note="SV40 early promoter" misc_feature 18..167 /note="Tn5 bacterial transposase" gene 111..2504 /gene="vav" CDS 111..2504 /gene="vav" /codon_start=1 /product="VAV" /db_xref="PID:g37645" /db_xref="SWISS-PROT:P15498" /translation="MNVSYWAIWTRENASAKRKQFLCLKNIRTFLSTCCEKFGLKRSE LFEAFDLFDVQDFGKVIYTLSALSWTPIAQNRGIMPFPTEEESVGDEDIYSGLSDQID DTVEEDEDLYDCVENEEAEGDEIYEDLMRSEPVSMPPKMTEYDKRCCCLREIQQTEEK YTDTLGSIQQHFLKPLQRFLKPQDIEIIFINIEDLLRVHTHFLKEMKEALGTPGAPNL YQVFIKYKERFLVYGRYCSQVESASKHLDRVAAAREDVQMKLEECSQRANNGRFTARP ADGAYAASSQISPPSPGAGETHAGGDGARKLRLALDAMRDLAQCVNEVKRDNETLRQI TNFQLSIENLDQSLAHYGRPKIDGELKITSVERRSKMDRYAFLLDKALLICKRRGDSY DLKDFVNLHSFQVRDDSSGDRDNKKWSHMFLLIEDQGAQGYELFFKTRELKKKWMEQF EMAISNIYPENATANGHDFQMFSFEETTSCKACQMLLRGTFYQGYRCHRCRASAHKEC LGRVPPCGRHGQDFPGTMKKDKLHRRAQDKKRNELGLPKMEVFQEYYGLPPPPGAIGP FLRLNPGDIVELTKAEAEQNWWEGRNTSTNEIGWFPCNRVKPYVHGPPQDLSVHLWYA GPMERAGAESILANRSDGTFLVRQRVKDAAEFAISIKYNVEVKHTVKIMTAEGLYRIT EKKAFRGLTELVEFYQQNSLKDCFKSLDTTLQFPFKEPEKRTISRPAVGSTKYFGTAK ARYDFCARDRSELSLKEGDIIKILNKKGQQGWWRGEIYGRVGWFPANYVEEDYSEYC" misc_feature 168..2757 /note="vav human proto-oncogene" conflict 927^928 /gene="vav" /citation=[2] /replace="cc" conflict 1021 /gene="vav" /citation=[2] /replace="a" conflict 1023 /gene="vav" /citation=[2] /replace="g" conflict 1024^1025 /gene="vav" /citation=[2] /replace="a" conflict 1028 /gene="vav" /citation=[2] /replace="c" conflict 2116..2118 /gene="vav" /citation=[2] /replace="" polyA_signal 2739..2744 /note="potential" polyA_site 2757 BASE COUNT 693 a 713 c 809 g 542 t ORIGIN 1 ctaggctttt gcaaaaagct tcacgctgcc gcaagcactc agggcgcaag ggctgctaaa 61 ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 121 gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag ttcctgtgcc 181 ttaagaacat tagaaccttc ctgtccacct gctgtgagaa gttcggcctc aagcggagcg 241 agctcttcga agcctttgac ctcttcgatg tgcaggattt tggcaaggtc atctacaccc 301 tgtctgctct gtcctggacc ccgatcgccc agaacagggg gatcatgccc ttccccaccg 361 aggaggagag tgtaggtgat gaagacatct acagtggcct gtccgaccag atcgacgaca 421 cggtggagga ggatgaggac ctgtatgact gcgtggagaa tgaggaggcg gaaggcgacg 481 agatctatga ggacctcatg cgctcggagc ccgtgtccat gccgcccaag atgacagagt 541 atgacaagcg ctgctgctgc ctgcgggaga tccagcagac ggaggagaag tacactgaca 601 cgctgggctc catccagcag catttcttga agcccctgca acggttcctg aaacctcaag 661 acattgagat catctttatc aacattgagg acctgcttcg tgttcatact cacttcctaa 721 aggagatgaa ggaagccctg ggcacccctg gcgcaccgaa tctctaccag gtcttcatca 781 aatacaagga gaggttcctc gtctatggcc gctactgcag ccaggtggag tcagccagca 841 aacacctgga ccgtgtggcc gcagcccggg aggacgtgca gatgaagctg gaggaatgtt 901 ctcagagagc caacaacggg aggttcactg cgcgacctgc tgatggtgcc tatgcagcga 961 gttctcaaat atcacctcct tctccaggag ctggtgaaac acacgcagga ggcgatggag 1021 caaggaaact gcggctggcc ctggatgcca tgagggacct ggctcagtgc gtgaacgagg 1081 tcaagcgaga caacgagaca ctgcgacaga tcaccaattt ccagctgtcc attgagaacc 1141 tggaccagtc tctggctcac tatggccggc ccaagatcga cggggaactc aagatcacct 1201 cggtggaacg gcgctccaag atggacaggt atgccttcct gctcgacaaa gctctactca 1261 tctgtaagcg caggggagac tcctatgacc tcaaggactt tgtaaacctg cacagcttcc 1321 aggttcggga tgactcttca ggagaccgag acaacaagaa gtggagccac atgttcctcc 1381 tgatcgagga ccaaggtgcc cagggctatg agctgttctt caagacaaga gaattgaaga 1441 agaagtggat ggagcagttt gagatggcca tctccaacat ctatccggag aatgccaccg 1501 ccaacgggca tgacttccag atgttctcct ttgaggagac cacatcctgc aaggcctgtc 1561 agatgctgct tagaggtacc ttctatcagg gctaccgctg ccatcggtgc cgggcatctg 1621 cacacaagga gtgtctgggg agggtccctc catgtggccg acatgggcaa gatttcccag 1681 gaactatgaa gaaggacaaa ctacatcgca gggctcagga caaaaagagg aatgagctgg 1741 gtctgcccaa gatggaggtg tttcaggaat actacgggct tcctccaccc cctggagcca 1801 ttggaccctt tctacggctc aaccctggag acattgtgga gctcacgaag gctgaggctg 1861 aacagaactg gtgggagggc agaaatacat ctactaatga aattggctgg tttccttgta 1921 acagggtgaa gccctatgtc catggccctc ctcaggacct gtctgttcat ctctggtacg 1981 caggccccat ggagcgggca ggggcagaga gcatcctggc caaccgctcg gacgggactt 2041 tcttggtgcg gcagagggtg aaggatgcag cagaatttgc catcagcatt aaatataacg 2101 tcgaggtcaa gcacacggtt aaaatcatga cagcagaagg actgtaccgg atcacagaga 2161 aaaaggcttt ccgggggctt acggagctgg tggagtttta ccagcagaac tctctaaagg 2221 attgcttcaa gtctctggac accaccttgc agttcccctt caaggagcct gaaaagagaa 2281 ccatcagcag gccagcagtg ggaagcacaa agtattttgg cacagccaaa gcccgctatg 2341 acttctgcgc ccgtgaccgt tcagagctgt cgctcaagga gggtgacatc atcaagatcc 2401 ttaacaagaa gggacagcaa ggctggtggc gaggggagat ctatggccgg gttggctggt 2461 tccctgccaa ctacgtggag gaagattatt ctgaatactg ctgagccctg gtgccttggc 2521 agagagacga gaaactccag gctctgagcc cggcgtggcg aggcagcgga ccaggggctg 2581 tgacagctcc ggcgggtgga gactttggga tggactggag gaggccagcg tccagctggc 2641 ggtgctcccg ggatgtgccc tgacatggtt aatttataac accccgattt tcctcttggg 2701 tcccctcaag cagacggggg ctcaaggggg ttacatttaa taaaaggatg aagatgg // LOCUS HSVCAM1 2220 bp RNA PRI 31-MAR-1995 DEFINITION Human mRNA for vascular cell adhesion molecule 1 (VCAM-1). ACCESSION X53051 NID g37648 KEYWORDS cell adhesion molecule; vascular cell adhesion molecule; vascular cell adhesion molecule 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2220) AUTHORS Polte,T.R. TITLE Direct Submission JOURNAL Submitted (21-MAY-1990) Polte T.R., Otsuka America Pharmaceutical, Inc.,, 9900 Medical Center Drive, Rockville, MD 20850, USA REFERENCE 2 (bases 1 to 929; 1204 to 2220) AUTHORS Polte,T., Newman,W. and Gopal,T.V. TITLE Full length vascular cell adhesion molecule 1 (VCAM-1) JOURNAL Nucleic Acids Res. 18 (19), 5901 (1990) MEDLINE 91016951 COMMENT See M30257 for related sequence. FEATURES Location/Qualifiers source 1..2220 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein" /cell_type="endothelial" sig_peptide 1..24 /note="signal peptide" CDS 1..2220 /codon_start=1 /product="precursor peptide" /db_xref="PID:g37649" /db_xref="SWISS-PROT:P19320" /translation="MPGKMVVILGASNILWIMFAASQAFKIETTPESRYLAQIGDSVS LTCSTTGCESPFFSWRTQIDSPLNGKVTNEGTTSTLTMNPVSFGNEHSYLCTATCESR KLEKGIQVEIYSFPKDPEIHLSGPLEAGKPITVKCSVADVYPFDRLEIDLLKGDHLMK SQEFLEDADRKSLETKSLEVTFTPVIEDIGKVLVCRAKLHIDEMDSVPTVRQAVKELQ VYISPKNTVISVNPSTKLQEGGSVTMTCSSEGLPAPEIFWSKKLDNGNLQHLSGNATL TLIAMRMEDSGIYVCEGVNLIGKNRKEVELIVQEKPFTVEISPGPRIAAQIGDSVMLT CSVMGCESPSFSWRTQIDSPLSGKVRSEGTNSTLTLSPVSFENEHSYLCTVTCGHKKL EKGIQVELYSFPRDPEIEMSGGLVNGSSVTVSCKVPSVYPLDRLEIELLKGETILENI EFLEDTDMKSLENKSLEMTFIPTIEDTGKALVCQAKLHIDDMEFEPKQRQSTQTLYVN VAPRDTTVLVSPSSILEEGSSVNMTCLSQGFPAPKILWSRQLPNGELQPLSENATLTL ISTKMEDSGVYLCEGINQAGRSRKEVELIIQVTPKDIKLTAFPSESVKEGDTVIISCT CGNVPETWIILKKKAETGDTVLKSIDGAYTIRKAQLKDAGVYECESKNKVGSQLRSLT LDVQGRENNKDYFSPELLVLYFASSLIIPAIGMIIYFARKANMKGSYSLVEAQKSKV" mat_peptide 25..2217 /note="vascular cell adhesion molecule 1" repeat_region 109..927 /note="internal repeat 1" repeat_region 973..1791 /note="internal repeat 2" BASE COUNT 671 a 442 c 529 g 578 t ORIGIN 1 atgcctggga agatggtcgt gatccttgga gcctcaaata tactttggat aatgtttgca 61 gcttctcaag cttttaaaat cgagaccacc ccagaatcta gatatcttgc tcagattggt 121 gactccgtct cattgacttg cagcaccaca ggctgtgagt ccccattttt ctcttggaga 181 acccagatag atagtccact gaatgggaag gtgacgaatg aggggaccac atctacgctg 241 acaatgaatc ctgttagttt tgggaacgaa cactcttacc tgtgcacagc aacttgtgaa 301 tctaggaaat tggaaaaagg aatccaggtg gagatctact cttttcctaa ggatccagag 361 attcatttga gtggccctct ggaggctggg aagccgatca cagtcaagtg ttcagttgct 421 gatgtatacc catttgacag gctggagata gacttactga aaggagatca tctcatgaag 481 agtcaggaat ttctggagga tgcagacagg aagtccctgg aaaccaagag tttggaagta 541 acctttactc ctgtcattga ggatattgga aaagttcttg tttgccgagc taaattacac 601 attgatgaaa tggattctgt gcccacagta aggcaggctg taaaagaatt gcaagtctac 661 atatcaccca agaatacagt tatttctgtg aatccatcca caaagctgca agaaggtggc 721 tctgtgacca tgacctgttc cagcgagggt ctaccagctc cagagatttt ctggagtaag 781 aaattagata atgggaatct acagcacctt tctggaaatg caactctcac cttaattgct 841 atgaggatgg aagattctgg aatttatgtg tgtgaaggag ttaatttgat tgggaaaaac 901 agaaaagagg tggaattaat tgttcaagag aaaccattta ctgttgagat ctcccctgga 961 ccccggattg ctgctcagat tggagactca gtcatgttga catgtagtgt catgggctgt 1021 gaatccccat ctttctcctg gagaacccag atagacagcc ctctgagcgg gaaggtgagg 1081 agtgagggga ccaattccac gctgaccctg agccctgtga gttttgagaa cgaacactct 1141 tatctgtgca cagtgacttg tggacataag aaactggaaa agggaatcca ggtggagctc 1201 tactcattcc ctagagatcc agaaatcgag atgagtggtg gcctcgtgaa tgggagctct 1261 gtcactgtaa gctgcaaggt tcctagcgtg tacccccttg accggctgga gattgaatta 1321 cttaaggggg agactattct ggagaatata gagtttttgg aggatacgga tatgaaatct 1381 ctagagaaca aaagtttgga aatgaccttc atccctacca ttgaagatac tggaaaagct 1441 cttgtttgtc aggctaagtt acatattgat gacatggaat tcgaacccaa acaaaggcag 1501 agtacgcaaa cactttatgt caatgttgcc cccagagata caaccgtctt ggtcagccct 1561 tcctccatcc tggaggaagg cagttctgtg aatatgacat gcttgagcca gggctttcct 1621 gctccgaaaa tcctgtggag caggcagctc cctaacgggg agctacagcc tctttctgag 1681 aatgcaactc tcaccttaat ttctacaaaa atggaagatt ctggggttta tttatgtgaa 1741 ggaattaacc aggctggaag aagcagaaag gaagtggaat taattatcca agttactcca 1801 aaagacataa aacttacagc ttttccttct gagagtgtca aagaaggaga cactgtcatc 1861 atctcttgta catgtggaaa tgttccagaa acatggataa tcctgaagaa aaaagcggag 1921 acaggagaca cagtactaaa atctatagat ggcgcctata ccatccgaaa ggcccagttg 1981 aaggatgcgg gagtatatga atgtgaatct aaaaacaaag ttggctcaca attaagaagt 2041 ttaacacttg atgttcaagg aagagaaaac aacaaagact atttttctcc tgagcttctc 2101 gtgctctatt ttgcatcctc cttaataata cctgccattg gaatgataat ttactttgca 2161 agaaaagcca acatgaaggg gtcatatagt cttgtagaag cacagaaatc aaaagtgtag // LOCUS HSVD3HYD 2107 bp RNA PRI 03-NOV-1993 DEFINITION H.sapiens CYP 27 mRNA for vitamin D3 25-hydroxylase. ACCESSION X59812 NID g414120 KEYWORDS hydroxylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2107) AUTHORS Guo,Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-1991) Y. Guo, Queen's University, Dept. of Biochemistry, Kingston, Ontario, K7L-3N6, Canada REFERENCE 2 (bases 1 to 2107) AUTHORS Guo,Y.D., Strugnell,S., Back,D.W. and Jones,G. TITLE Transfected human liver cytochrome P-450 hydroxylates vitamin D analogs at different side-chain positions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (18), 8668-8672 (1993) MEDLINE 93391416 FEATURES Location/Qualifiers source 1..2107 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /cell_type="hepatocyte" /cell_line="HepG2" gene 200..1793 /gene="CYP 27" sig_peptide 200..296 /gene="CYP 27" /note="Vitamin D3 25-hydroxylase" CDS 201..1793 /gene="CYP 27" /codon_start=1 /product="Vitamin D3 25-hydroxylase" /db_xref="PID:g414121" /translation="MAALGCARLRWALRGAGRGSAPTGRAKAAIPAALPSDKATGAPG AGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGPQ MHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQAL NQRLLKPAERALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAICY ILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNA IFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLMA GVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKET LRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWLR NSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGELK SVARIVLVPNKKVGLQFLQRQC" mat_peptide 297..1790 /gene="CYP 27" /product="Vitamin D3 25-hydroxylase" protein_bind 1147..1191 /gene="CYP 27" /bound_moiety="substrate" misc_binding 1623..1625 /gene="CYP 27" /bound_moiety="heme" BASE COUNT 462 a 610 c 607 g 428 t ORIGIN 1 gtggatatcc ccgagtcacc gcgtccctct cctgcagctc ccgcgtcgct gggaggagcg 61 agggagcgag cgggaagggg tctagctggc ctttgctcgg ccctccccag cgcccggctt 121 tgaacccgcc ctgcactgct gtctgggcgg gtccggggac tcagcactcg acccaaaggt 181 gcaggcgcgc gagacaaccc atggctgcgc tgggctgcgc gaggctgagg tgggcgctgc 241 gaggggccgg ccgtggctct gcccccacgg gcagagccaa ggccgcgatc cctgccgccc 301 tcccctcgga caaggccacc ggagctcccg gagccgggcc tggtgtccgg cggcggcaac 361 ggagcttaga ggagattcca cgtctaggac agctgcgctt cttctttcag ctgttcgttc 421 aaggctatgc cctgcaactg caccagttac aggtgcttta caaggccaag tacggtccaa 481 tgtggatgtc ctacttaggg cctcagatgc acgtgaacct ggccagtgcc ccgctcttgg 541 agcaagtgat gcggcaagag ggcaagtacc cagtacggaa cgacatggag ctatggaagg 601 agcaccggga ccagcacgac ctgacctatg ggccgttcac cacggaagga caccactggt 661 accagctgcg ccaggctctg aaccagcggt tgctgaagcc agcggaacga gcgctctata 721 cggatgcttt caatgaggtg attgatgact ttatgactcg actggaccag ctgcgggcag 781 agagtgcttc ggggaaccag gtgtcggaca tggctcaact cttctactac tttgccttgg 841 aagctatttg ctacatcctg ttcgagaaac gcattggctg cctgcagcga tccatccccg 901 aggacaccgt gaccttcgtc agatccatcg ggttaatgtt ccagaactca ctctatgcca 961 ccttcctccc caagtggact cgccccgtgc tgcctttctg gaagcgatac ctggatggtt 1021 ggaatgccat cttttccttt gggaagaagc tgattgatga gaagctcgaa gatatggagg 1081 cccaactgca ggcagcaggg ccagatggca tccaggtgtc tggctacctg cacttcttac 1141 tggccagtgg acagctcagt cctcgggagg ccatgggcag cctgcctgag ctgctcatgg 1201 ctggagtgga cacgacatcc aacacgctga catgggccct gtaccacctc tcaaaggacc 1261 ctgagatcca ggaggccttg cacgaggaag tggtgggtgt ggtgccagcc gggcaagtgc 1321 cccagcacaa ggactttgcc cacatgccgt tgctcaaagc tgtgcttaag gagactctgc 1381 gtctctaccc tgtggtcccc acaaactccc ggatcataga aaaggaaatt gaagttgatg 1441 gcttcctctt ccccaagaac acccagtttg tgttctgcca ctatgtggtg tcccgggacc 1501 ccactgcctt ctctgagcct gaaagcttcc agccccaccg ctggctgaga aacagccagc 1561 ctgctacccc caggatccag cacccatttg gctctgtgcc ctttggctat ggggtccggg 1621 cctgcctggg ccgcaggatt gcagagctgg agatgcagct actcctcgca aggctgatcc 1681 agaagtacaa ggtggtcctg gccccggaga ccggggagtt gaagagtgtg gcccgcattg 1741 tcctggttcc caataagaaa gtgggcctgc agttcctgca gagacagtgc tgagctgagt 1801 ctccgccttg ctggggcttg tcctagaggc tccagctctg gcacagtggt tcctggctgc 1861 tgccatgtct cagatgagga gggagagaag gaggccgcca gactcgagag gtgggaggaa 1921 ctccttgcac acaccctgag cttttgccac ttctatcatt tttgagcaac tccctctcag 1981 ctaaaaggcc acccctttat cgcattgctg tccttgggta gaatataaaa taaagggact 2041 tttatttctt attggaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2101 aaaaaaa // LOCUS HSVD3R 1335 bp RNA PRI 25-SEP-1992 DEFINITION H.sapiens mRNA for 1,25-dihydroxyvitamin D-3 receptor. ACCESSION X67482 NID g37653 KEYWORDS 1,25-dihydroxyvitamin D3 receptor; steroid/thyroid hormone receptor superfamily. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1335) AUTHORS Goto,H., Chen,K.S., Prahl,J.M. and DeLuca,H.F. TITLE A single receptor identical with that from intestine/T47D cells mediates the action of 1,25-dihydroxyvitamin D-3 in HL-60 cells JOURNAL Biochim. Biophys. Acta 1132 (1), 103-108 (1992) MEDLINE 92379083 COMMENT Related sequence: J03258. This sequence contains one base difference ('c' instead of 't') from the human VDR sequence cloned from T47D human breast cancer cell line at base 1106 (base 1056 from initiation codon). FEATURES Location/Qualifiers source 1..1335 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HL-60 promyelocytic leukemia cells" /clone_lib="lambda gt11 cDNA library" /clone="S3C10P211a (base 1059-1137) and S11C7P1.4 (base 543-3'UTR)" mRNA <1..>1335 /gene="HL-60 VDR" gene 1..1335 /gene="HL-60 VDR" CDS 51..1334 /gene="HL-60 VDR" /codon_start=1 /product="HL-60 1,25-dihydroxyvitamin D-3 receptor" /db_xref="PID:g37654" /db_xref="SWISS-PROT:P11473" /translation="MEAMAASTSLPDPGDFDRNVPRICGVCGDRATGFHFNAMTCEGC KGFFRRSMKRKALFTCPFNGDCRITKDNRRHCQACRLKRCVDIGMMKEFILTDEEVQR KREMILKRKEEEALKDSLRPKLSEEQQRIIAILLDAHHKTYDPTYSDFCQFRPPVRVN DGGGSHPSRPNSRHTPSFSGDSSSSCSDHCITSSDMMDSSSFSNLDLSEEDSDDPSVT LELSQLSMLPHLADLVSYSIQKVIGFAKMIPGFRDLTSEDQIVLLKSSAIEVIMLRSN ESFTMDDMSWTCGNQDYKYRVSDVTKAGHSLELIEPLIKFQVGLKKLNLHEEEHVLLM AICIVSPDRPGVQDAALIEAIQDRLSNTLQTYIRCRHPPPGSHLLYAKMIQKLADLRS LNEEHSKQYRCLSFQPECSMKLTPLVLEVFGNEIS" BASE COUNT 300 a 416 c 351 g 268 t ORIGIN 1 acagaagagc acccctgggc tccacttacc tgccccctgc tccttcaggg atggaggcaa 61 tggcggccag cacttccctg cctgaccctg gagactttga ccggaacgtg ccccggatct 121 gtggggtgtg tggagaccga gccactggct ttcacttcaa tgctatgacc tgtgaaggct 181 gcaaaggctt cttcaggcga agcatgaagc ggaaggcact attcacctgc cccttcaacg 241 gggactgccg catcaccaag gacaaccgac gccactgcca ggcctgccgg ctcaaacgct 301 gtgtggacat cggcatgatg aaggagttca ttctgacaga tgaggaagtg cagaggaagc 361 gggagatgat cctgaagcgg aaggaggagg aggccttgaa ggacagtctg cggcccaagc 421 tgtctgagga gcagcagcgc atcattgcca tactgctgga cgcccaccat aagacctacg 481 accccaccta ctccgacttc tgccagttcc ggcctccagt tcgtgtgaat gatggtggag 541 ggagccatcc ttccaggccc aactccagac acactcccag cttctctggg gactcctcct 601 cctcctgctc agatcactgt atcacctctt cagacatgat ggactcgtcc agcttctcca 661 atctggatct gagtgaagaa gattcagatg acccttctgt gaccctagag ctgtcccagc 721 tctccatgct gccccacctg gctgacctgg tcagttacag catccaaaag gtcattggct 781 ttgctaagat gataccagga ttcagagacc tcacctctga ggaccagatc gtactgctga 841 agtcaagtgc cattgaggtc atcatgttgc gctccaatga gtccttcacc atggacgaca 901 tgtcctggac ctgtggcaac caagactaca agtaccgcgt cagtgacgtg accaaagccg 961 gacacagcct ggagctgatt gagcccctca tcaagttcca ggtgggactg aagaagctga 1021 acttgcatga ggaggagcat gtcctgctca tggccatctg catcgtctcc ccagatcgtc 1081 ctggggtgca ggacgccgcg ctgatcgagg ccatccagga ccgcctgtcc aacacactgc 1141 agacgtacat ccgctgccgc cacccgcccc cgggcagcca cctgctctat gccaagatga 1201 tccagaagct agccgacctg cgcagcctca atgaggagca ctccaagcag taccgctgcc 1261 tctccttcca gcctgagtgc agcatgaagc taacgcccct tgtgctcgaa gtgtttggca 1321 atgagatctc ctgac // LOCUS HSVECAD 4000 bp RNA PRI 21-SEP-1995 DEFINITION H.sapiens VE-cadherin mRNA. ACCESSION X79981 NID g599833 KEYWORDS VE-cadherin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4000) AUTHORS Breviario,F., Caveda,L., Corada,M., Martin-Padura,I., Navarro,P., Golay,J., Introna,M., Gulino,D., Lampugnani,M.G. and Dejana,E. TITLE Functional properties of human vascular endothelial cadherin (7B4/cadherin-5), an endothelium-specific cadherin JOURNAL Arterioscler. Thromb. Vasc. Biol. 15 (8), 1229-1239 (1995) MEDLINE 95353875 REFERENCE 2 (bases 1 to 4000) AUTHORS Breviario,F. TITLE Direct Submission JOURNAL Submitted (30-JUN-1994) F. Breviario, Istituto di Ricerche Farmacologiche, 'Mario Negri', Via Eritrea 62, 20157 Milan, ITALY FEATURES Location/Qualifiers source 1..4000 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="endothelium" /cell_type="HUVEC" gene 25..2379 /gene="VE-cadherin" CDS 25..2379 /gene="VE-cadherin" /codon_start=1 /db_xref="PID:g599834" /translation="MQRLMMLLATSGACLGLLAVAAVAAAGANPAQRDTHSLLPTHRR QKRDWIWNQMHIDEEKNTSLPHHVGKIKSSVSRKNAKYLLKGEYVGKVFRVDAETGDV FAIERLDRENISEYHLTAVIVDKDTGENLETPSSFTIKVHDVNDNWPVFTHRLFNASV PESSAVGTSVISVTAVDADDPTVGDHASVMYQILKGKEYFAIDNSGRIITITKSLDRE KQARYEIVVEARDAQGLRGDSGTATVLVTLQDINDNFPFFTQTKYTFVVPEDTRVGTS VGSLFVEDPDEPQNRMTKYSILRGDYQDAFTIETNPAHNEGIIKPMKPLDYEYIQQYS FIVEATDPTIDLRYMSPPAGNRAQVIINITDVDEPPIFQQPFYHFQLKENQKKPLIGT VLAMDPDAARHSIGYSIRRTSDKGQFFRVTKKGDIYNEKELDREVYPWYNLTVEAKEL DSTGTPTGKESIVQVHIEVLDENDNAPEFAKPYQPKVCENAVHGQLVLQISAIDKDIT PRNVKFKFTLNTENNFTLTDNHDNTANITVKYGQFDREHTKVHFLPVVISDNGMPSRT GTSTLTVAVCKCNEQGEFTFCEDMAAQVGVSIQAVVAILLCILTITVITLLIFLRRRL RKQARAHGKSVPEIHEQLVTYDEEGGGEMDTTSYDVSVLNSVRRGGAKPPRPALDARP SLYAQVQKPPRHAPGAHGGPGEMAAMIEVKKDEADHDGDGPPYDTLHIYGYEGSESIA ESLSSLGTDSSDSDVDYDFLNDWGPRFKMLAELYGSDPREELLY" polyA_signal 3965..3970 polyA_site 3994..4000 BASE COUNT 998 a 1160 c 1015 g 827 t ORIGIN 1 gcacgatctg ttcctcctgg gaagatgcag aggctcatga tgctcctcgc cacatcgggc 61 gcctgcctgg gcctgctggc agtggcagca gtggcagcag caggtgctaa ccctgcccaa 121 cgggacaccc acagcctgct gcccacccac cggcgccaaa agagagattg gatttggaac 181 cagatgcaca ttgatgaaga gaaaaacacc tcacttcccc atcatgtagg caagatcaag 241 tcaagcgtga gtcgcaagaa tgccaagtac ctgctcaaag gagaatatgt gggcaaggtc 301 ttccgggtcg atgcagagac aggagacgtg ttcgccattg agaggctgga ccgggagaat 361 atctcagagt accacctcac tgctgtcatt gtggacaagg acactggtga aaacctggag 421 actccttcca gcttcaccat caaagttcat gacgtgaacg acaactggcc tgtgttcacg 481 catcggttgt tcaatgcgtc cgtgcctgag tcgtcggctg tggggacctc agtcatctct 541 gtgacagcag tggatgcaga cgaccccact gtgggagacc acgcctctgt catgtaccaa 601 atcctgaagg ggaaagagta ttttgccatc gataattctg gacgtattat cacaataacg 661 aaaagcttgg accgagagaa gcaggccagg tatgagatcg tggtggaagc gcgagatgcc 721 cagggcctcc ggggggactc gggcacggcc accgtgctgg tcactctgca agacatcaat 781 gacaacttcc ccttcttcac ccagaccaag tacacatttg tcgtgcctga agacacccgt 841 gtgggcacct ctgtgggctc tctgtttgtt gaggacccag atgagcccca gaaccggatg 901 accaagtaca gcatcttgcg gggcgactac caggacgctt tcaccattga gacaaacccc 961 gcccacaacg agggcatcat caagcccatg aagcctctgg attatgaata catccagcaa 1021 tacagcttca tcgtcgaggc cacagacccc accatcgacc tccgatacat gagccctccc 1081 gcgggaaaca gagcccaggt cattatcaac atcacagatg tggacgagcc ccccattttc 1141 cagcagcctt tctaccactt ccagctgaag gaaaaccaga agaagcctct gattggcaca 1201 gtgctggcca tggaccctga tgcggctagg catagcattg gatactccat ccgcaggacc 1261 agtgacaagg gccagttctt ccgagtcaca aaaaaggggg acatttacaa tgagaaagaa 1321 ctggacagag aagtctaccc ctggtataac ctgactgtgg aggccaaaga actggattcc 1381 actggaaccc ccacaggaaa agaatccatt gtgcaagtcc acattgaagt tttggatgag 1441 aatgacaatg ccccggagtt tgccaagccc taccagccca aagtgtgtga gaacgctgtc 1501 catggccagc tggtcctgca gatctccgca atagacaagg acataacacc acgaaacgtg 1561 aagttcaaat tcaccttgaa tactgagaac aactttaccc tcacggataa tcacgataac 1621 acggccaaca tcacagtcaa gtatgggcag tttgaccggg agcataccaa ggtccacttc 1681 ctacccgtgg tcatctcaga caatgggatg ccaagtcgca cgggcaccag cacgctgacc 1741 gtggccgtgt gcaagtgcaa cgagcagggc gagttcacct tctgcgagga tatggccgcc 1801 caggtgggcg tgagcatcca ggcagtggta gccatcttac tctgcatcct caccatcaca 1861 gtgatcaccc tgctcatctt cctgcggcgg cggctccgga agcaggcccg cgcgcacggc 1921 aagagcgtgc cggagatcca cgagcagctg gtcacctacg acgaggaggg cggcggcgag 1981 atggacacca ccagctacga tgtgtcggtg ctcaactcgg tgcgccgcgg cggggccaag 2041 cccccgcggc ccgcgctgga cgcccggcct tccctctatg cgcaggtgca gaagccaccg 2101 aggcacgcgc ctggggcaca cggagggccc ggggagatgg cagccatgat cgaggtgaag 2161 aaggacgagg cggaccacga cggcgacggc cccccctacg acacgctgca catctacggc 2221 tacgagggct ccgagtccat agccgagtcc ctcagctccc tgggcaccga ctcatccgac 2281 tctgacgtgg attacgactt ccttaacgac tggggaccca ggtttaagat gctggctgag 2341 ctgtacggct cggacccccg ggaggagctg ctgtattagg cggccgaggt cactctgggc 2401 ctggggaccc aaaccccctg cagcccaggc cagtcagact ccaggcacca cagcctccaa 2461 aaatggcagt gactccccag cccagcaccc cttcctcgtg ggtcccagag acctcatcag 2521 ccttgggata gcaaactcca ggttcctgaa atatccagga atatatgtca gtgatgacta 2581 ttctcaaatg ctggcaaatc caggctggtg ttctgtctgg gctcagacat ccacataacc 2641 ctgtcaccca cagaccgccg tctaactcaa agacttcctc tggctcccca aggctgcaaa 2701 gcaaaacaga ctgtgtttaa ctgctgcagg gtctttttct agggtccctg aacgccctgg 2761 taaggctggt gaggtcctgg tgcctatctg cctggaggca aaggcctgga cagcttgact 2821 tgtggggcag gattctctgc agcccattcc caagggagac tgaccatcat gccctctctc 2881 gggagcccta gccctgctcc aactccatac tccactccaa gtgccccacc actccccaac 2941 ccctctccag gcctgtcaag agggaggaag gggccccatg gcagctcctg accttgggtc 3001 ctgaagtgac ctcactggcc tgccatgcca gtaactgtgc tgtactgagc actgaaccac 3061 attcagggaa atgcttatta aaccttgaag caactgtgaa ttcattctgg aggggcagtg 3121 gagatcagga gtgacagatc acagggtgag ggccacctcc acacccaccc cctctggaga 3181 aggcctggaa gagctgagac cttgctttga gactcctcag cacccctcca gttttgcctg 3241 agaaggggca gatgttcccg gagatcagaa gacgtctccc cttctctgcc tcacctggtc 3301 gccaatccat gctctctttc ttttctctgt ctactcctta tcccttggtt tagaggaacc 3361 caagatgtgg cctttagcaa aactgacaat gtccaaaccc actcatgact gcatgacgga 3421 gccgagcatg tgtctttaca cctcgctgtt gtcacatctc agggaactga ccctcaggca 3481 caccttgcag aaggaaggcc ctgccctgcc caacctctgt ggtcacccat gcatcattcc 3541 actggaacgt ttcactgcaa acacaccttg gagaagtggc atcagtcaac agagaggggc 3601 agggaaggag acaccaagct cacccttcgt catggaccga ggttcccact ctggcaaagc 3661 ccctcacact gcaagggatt gtagataaca ctgacttgtt tgttttaacc aataactagc 3721 ttcttataat gattttttta ctaatgatac ttacaagttt ctagctctca cagacatata 3781 gaataagggt ttttgcataa taagcaggtt gttatttagg ttaacaatat taattcaggt 3841 tttttagttg gaaaaacaat tcctgtaacc ttctattttc tataattgta gtaattgctc 3901 tacagataat gtctatatat tggccaaact ggtgcatgac aagtactgta tttttttata 3961 cctaaataaa gaaaaatctt tagcctgggc aacaaaaaaa // LOCUS HSVHATPE 1319 bp RNA PRI 01-FEB-1994 DEFINITION H.sapiens mRNA for vacuolar H+ ATPase E subunit. ACCESSION X76228 NID g452657 KEYWORDS ATPase epsilon subunit; vacuolar H+ ATPase E subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1319) AUTHORS Lipinski,M. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) M. Lipinski, Lab. de Biologie des Tumeurs Humaines, CNRS URA 1156, Inst. Gustave Roussy, 94805 Villeuif Cedex, FRANCE REFERENCE 2 (bases 1 to 1319) AUTHORS Baud,V., Mears,A.J., Lamour,V., Scamps,C., Duncan,A.M., McDermid,H.E. and Lipinski,M. TITLE The E subunit of vacuolar H(+)-ATPase localizes close to the centromere on human chromosome 22 JOURNAL Hum. Mol. Genet. 3 (2), 335-339 (1994) MEDLINE 94272476 FEATURES Location/Qualifiers source 1..1319 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="Ewing tumour" /cell_line="IARC-EW11" /clone="61EW" /chromosome="22q11" CDS 76..756 /codon_start=1 /product="vacuolar H+ ATPase E subunit" /db_xref="PID:g452658" /db_xref="SWISS-PROT:P36543" /translation="MALSDADVQKQIKHMMAFIEQEANEKAEEIDAKAEEEFNIEKGR LVQTQRLKIMEYYEKKEKQIEQQKKIQMSNLMNQARLKVLRARDDLITDLLNEAKQRL SKVVKDTTRYQVLLDGLVLQGLYQLLEPRMIVRCRKQDFPLVKAAVQKAIPMYKIATK NDVDVQIDQESYLPEDIAGGVEIYNGDRKIKVSNTLESRLDLIAQQMMPEVRGALFGA NANRKFLD" polyA_signal 1243..1248 BASE COUNT 415 a 279 c 303 g 322 t ORIGIN 1 ttgccgattt ctctcacctc acctttcaaa cctaaactcg agcctgctgt tcaccggcct 61 agcattgctc tcgccatggc tctcagcgat gctgacgtgc aaaagcagat aaagcatatg 121 atggctttca ttgaacaaga agccaatgag aaagcagaag aaatagatgc aaaggcagaa 181 gaagagttca acatagagaa aggtcggctt gtgcaaaccc aaagactaaa gattatggaa 241 tattatgaga agaaagagaa acagattgag cagcagaaga aaattcagat gtccaatttg 301 atgaatcaag cgagactcaa agtcctcaga gcaagagatg accttatcac agacctacta 361 aatgaagcaa aacagagact cagcaaggtg gtaaaagata caaccaggta ccaagtgctg 421 ctggatggac tggttctcca gggtttgtac cagttgctgg agccccgaat gattgttcgt 481 tgcaggaaac aagatttccc tctggtaaag gctgcagtgc agaaggcaat tcctatgtac 541 aaaattgcca ccaaaaacga tgttgatgtc caaattgacc aggagtccta cctgcctgaa 601 gacatagctg gtggagttga gatctataat ggagatcgta aaataaaggt ttccaacacc 661 ctggaaagcc ggctggatct catagcccag cagatgatgc cagaagtccg gggagccttg 721 tttggtgcaa atgccaacag gaagtttttg gactaagcct tcaggaggtg gagttcgtcg 781 tcagctctcc tgctgtgatg tggaagcttc tgatatttga agaaacacga atgtctctgt 841 agcttcctct tcactgcccc agtattgctc tgtatttatc agcgatgccc ctctgtcact 901 catgccttgc ctaattgttc acaatggtgg aaagcttcat gtaatatgat caggacccac 961 ctccagttct tctgaaagtg tgacagtgtc cagccggttc tgcagcacta ggggaggggg 1021 caaatggtgg ttgcatgggc ttcctgggtc tccactctcc gtctggccta aaggtgatgt 1081 atttggtgtt tggccctgca gtccccactc ttgaggctta aggcgcatgt ggcacaccac 1141 tccttccagc agtagtcgct ttactgttac ctgtttaggc ctagaagttt tccctcatct 1201 gtaaatgtga tttaaaatct aagccatgaa tatgctttat ttattaaaag agttatgcgg 1261 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa // LOCUS HSVHATPI2 2594 bp RNA PRI 02-APR-1992 DEFINITION H.sapiens mRNA for isoform 2 of vacuolar H+ATPase Mr 56,000 subunit. ACCESSION X62949 NID g37793 KEYWORDS H+ ATPase subunit; vacuolar H(+)-ATPase; vacuolar H(+)-ATPase Mr 56,000 subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2594) AUTHORS Guo,X., Masood,K. and Gluck,S.L. TITLE Complete sequence of isoform 2 of human vacuolar H+ATPase Mr 56,000 subunit cDNA and deduced primary protein structure JOURNAL Unpublished REFERENCE 2 (bases 1 to 2594) AUTHORS Bernasconi,P., Rausch,T., Struve,I., Morgan,L. and Taiz,L. TITLE An mRNA from human brain encodes an isoform of the B subunit of the vacuolar H(+)-ATPase JOURNAL J. Biol. Chem. 265 (29), 17428-17431 (1990) MEDLINE 91009188 REMARK (sites) REFERENCE 3 (bases 1 to 2594) AUTHORS Gluck,S.L. TITLE Direct Submission JOURNAL Submitted (31-OCT-1991) S.L. Gluck, Jewish Hospital, Washington University School of Medicine, Renal Division Dept of Medicine, 216 South Kingshighway Blvd, St Louis MO 63110, USA COMMENT . FEATURES Location/Qualifiers source 1..2594 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" mRNA 1..2594 5'UTR 1..25 CDS 26..1561 /codon_start=1 /product="vacuolar isoform 2 of H+ATPase Mr 56,000 subunit" /db_xref="PID:g37794" /db_xref="SWISS-PROT:P21281" /translation="MALRAMRGIVNGAAPELPVPTGGPAVGSREQALAVSRNYLSQPR LTYKTVSGVNGPLVILDHVKFPRYAEIVHLTLPDGTKRSGQVLEVSGSKAVVQVFEGT SGIDAKKTSCEFTGDILRTPVSEDMLGRVFNGSGKPIDRGPVVLAEDFLDIMGQPINP QCRIYPEEMIQTGISAIDGMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKD VVDYSEENFAIVFAAMGVNMETARFFKSDFEENGSMDNVCLFLNLANDPTIERIITPR LALTTAEFLAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATI YERAGRVEGRNGSITQIPILTMPNDDITHPIPDLTGYITEGQIYVDRQLHNRQIYPPI NVLPSLSRLMKSAIGEGMTRKDHADVSNQLYACYAIGKDVQAMKAVVGEEALTSDDLL YLEFLQKFERNFIAQGPYENRTVFETLDIGWQLLRIFPKEMLKRIPQSTLSEFYPRDS AKH" 3'UTR 1562..2594 BASE COUNT 657 a 561 c 624 g 752 t ORIGIN 1 gaattccggg gacagaggag acaagatggc gctgcgggcg atgcggggga ttgtcaacgg 61 ggccgcaccc gagctacccg tccccaccgg tgggccggcg gtgggatctc gggagcaggc 121 gctggcagtc agtcggaact acctctccca gcctcgcctc acatacaaga cagtatctgg 181 agtcaatggt ccactagtga tcttagatca tgttaagttt cccaggtatg ctgaaattgt 241 ccatttgacc ttaccggatg gcacaaagag aagtgggcaa gttctggaag ttagtggttc 301 caaggcagta gttcaggtat ttgaagggac ttcaggtata gatgctaaga aaacgtcctg 361 tgagtttact ggggatattc tccgaacacc ggtgtctgag gatatgcttg gtcgggtatt 421 caatggatcg ggaaaaccca ttgacagagg tcctgttgta ctggccgaag acttccttga 481 tatcatgggt cagccaatca accctcaatg tcgaatctac ccagaggaaa tgattcagac 541 tggcatttcg gccatcgatg ggatgaacag tattgctagg gggcagaaaa ttcctatctt 601 ctctgctgct gggctaccac acaatgagat tgcagctcag atctgtcgcc aggctggttt 661 ggtaaagaaa tccaaagatg tagtagacta cagtgaggaa aattttgcaa ttgtatttgc 721 tgctatgggt gtaaacatgg aaactgcccg gttcttcaaa tctgactttg aagaaaatgg 781 ctcaatggac aatgtctgcc tctttttgaa cttggctaat gacccaacca ttgagcgaat 841 tatcactcct cgcctggctc taaccacagc tgaatttctg gcgtaccaat gtgagaaaca 901 tgtattggtt attctaacag acatgagttc ttatgctgaa gcacttcgag aggtttcagc 961 agccagggaa gaggtacctg gtcgacgagg ttttccaggt tacatgtata cagatttagc 1021 cacgatatat gaacgcgctg ggcgagtgga agggagaaac ggctcgatta ctcaaatccc 1081 tattctaacc atgcctaatg atgatatcac tcaccccatc ccagacttga ctggctacat 1141 tacagagggg cagatctatg tggacagaca gctgcacaac agacagattt atccacctat 1201 caatgtgctg ccctcactat cacggttaat gaagtctgct attggagaag ggatgaccag 1261 gaaggatcat gccgatgtat ctaaccagct atatgcgtgc tatgctattg gaaaggatgt 1321 gcaagccatg aaagctgtcg ttggagaaga agcccttacc tcagatgatc ttctctactt 1381 ggaatttctg cagaagtttg agaggaactt cattgctcag ggtccttacg aaaatcgcac 1441 tgtctttgag actttggaca ttggctggca gctactccga atcttcccca aagaaatgct 1501 gaagagaatc cctcagagca ccctcagcga attttaccct cgagactctg caaagcatta 1561 gctgctgctt ctgcattgct ccgcgctctt gtgaaatact ggttctgttt tctttattcc 1621 ttttgcactc tcggttccca cctttgtgtt ggagtttacc atgttaccct gtaattaaaa 1681 acaaagaata ggtaacatat tgtgccagtg ttgcaacgtt ttaaactgct aacagacctt 1741 aaaatatccc cctacctggg tcctcagtgc tatgtttaaa gtgctgcagg gatggagtgg 1801 cgttttctta ttgctgtatg tattgtacat agtggagtag ttagttacct gataacagtc 1861 ttgttatttg ggtctcttag accttacctc tcaactccct caagagtacc agtctctgaa 1921 gttataatgc tttggtctct acattagggg caagatccag tctgagagaa gtctcctttg 1981 agaagggcca agaggctctt tcctgagtgt ttgctttcgg tttgttggta tgcctgtatt 2041 gctgggctgt gctgctgctc gaagcagatg gttttgactg tctttttgct ctttcctata 2101 taatgaatag atgagtgaaa ggagttttct ttttctcttt agtacttacg tattgggatt 2161 cctgtgtctt acagctctcc ctctccaaat aatacacaga atcctgcaac tttttgcaca 2221 gctggtatct gtctggtagc agtgagaccc cttgtcttgg tgatccttac tgggtttcca 2281 agcagaggag tcacatgatt acaattgcca gtagagttgt tgtttggggt acaagatgag 2341 aagaaagaaa aacctacagc ctttctacat tctgacatgc taacagtggt ttaagtttct 2401 aaagtgttta ccagatgctg aaggcaaggg gagggagcag aagcacttat gtttacggat 2461 attttaaact ctgttagaga gcagcctttg aaaatcccca atttggttct gctttttgac 2521 ctctctctac cttttcaggg taatctttgt ggcacaaacg atagcatttc caagctttag 2581 agttttctga attc // LOCUS HSVHNF1 2816 bp RNA PRI 02-NOV-1993 DEFINITION Human mRNA for variant hepatic nuclear factor 1 (vHNF1). ACCESSION X58840 NID g414047 KEYWORDS DNA-binding protein; hepatic nuclear factor 1; transcription factor; variant hepatic nuclear factor 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2816) AUTHORS Bach,I. TITLE Direct Submission JOURNAL Submitted (09-APR-1991) I. Bach, Inst Pasteur - UA 1149 du CNRS, Unite de Virus Oncogene, Dept de Biotechnologie, 25 Rue du Dr Roux, 75724 Paris Cedex 15, FRANCE REFERENCE 2 (bases 1 to 2816) AUTHORS Bach,I., Mattei,M.G., Cereghini,S. and Yaniv,M. TITLE Two members of an HNF1 homeoprotein family are expressed in human liver JOURNAL Nucleic Acids Res. 19 (13), 3553-3559 (1991) MEDLINE 91305097 FEATURES Location/Qualifiers source 1..2816 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /clone="HV19, HV17" /chromosome="17" /map="17q12.2-q21.1" mRNA 1..2816 /note="variant hepatic nuclear factor 1 (vHNF1)" /evidence=experimental CDS 195..1868 /codon_start=1 /product="variant hepatic nuclear factor 1 (vHNF1)" /db_xref="PID:g414048" /db_xref="SWISS-PROT:P35680" /translation="MVSKLTSLQQELLSALLSSGVTKEVLVQALEELLPSPNFGVKLE TLPLSPGSGAEPDTKPVFHTLTNGHAKGRLSGDEGSEDGDDYDTPPILKELQALNTEE AAEQRAEVDRMLSEDPWRAAKMIKGYMQQHNIPQREVVDVTGLNQSHLSQHLNKGTPM KTQKRAALYTWYVRKQREILRQFNQTVQSSGNMTDKSSQDQLLFLFPEFSQQSHGPGQ SDDACSEPTNKKMRRNRFKWGPASQQILYQAYDRQKNPSKEEREALVEECNRAECLQR GVSPSKAHGLGSNLVTEVRVYNWFANRRKEEAFRQKLAMDAYSSNQTHSLNPLLSHGS PHHQPSSSPPNKLSGVRYSQQGNNEITSSSTISHHGNSAMVTSQSVLQQVSPASLDPG HNLLSPDGKMISVSGGGLPPVSTLTNIHSLSHHNPQQSQNLIMTPLSGVMAIAQSLNT SQAQSVPVINSVAGSLAALQPVQFSQQLHSPHQQPLMQQSPGSHMAQQPFMAAVTQLQ NSHMYAHKQEPPQYSHTSRFPSAMVVTDTSSISTLTNMSSSKQCPLQAW" misc_feature 878..1151 /note="homeobox" BASE COUNT 707 a 879 c 668 g 562 t ORIGIN 1 gtggcgatca tggcaagtta gaagttttct gactcctttc ggaggagcct ccgggacccc 61 ggggagtaac aggtgtctgg aggctgaagg gtggaggggt tcctggattt gggttttgct 121 tgtgaaactc ccctccaccc tcctctctcg cacccaccca ccccctcacc cccttctttt 181 tccgtccttg gaaaatggtg tccaagctca cgtcgctcca gcaagaactc ctgagcgccc 241 tgctgagctc cggggtcacc aaggaggtgc tggttcaggc cttggaggag ttgctgccat 301 ccccgaactt cggggtgaag ctggagacgc tgcccctgtc ccctggcagc ggggccgagc 361 ccgacaccaa gccggtcttc catactctca ccaacggcca cgccaagggc cgcttgtccg 421 gcgacgaggg ctccgaggac ggcgacgact atgacacacc tcccatcctc aaggagctgc 481 aggcgctcaa caccgaggag gcggcggagc agcgggcgga ggtggaccgg atgctcagtg 541 aggacccttg gagggctgct aaaatgatca agggttacat gcagcaacac aacatccccc 601 agagggaggt ggtcgatgtc accggcctga accagtcgca cctctcccag catctcaaca 661 agggcacccc tatgaagacc cagaagcgtg ccgctctgta cacctggtac gtcagaaagc 721 aacgagagat cctccgacaa ttcaaccaga cagtccagag ttctggaaat atgacagaca 781 aaagcagtca ggatcagctg ctgtttctct ttccagagtt cagtcaacag agccatgggc 841 ctgggcagtc cgatgatgcc tgctctgagc ccaccaacaa gaagatgcgc cgcaaccggt 901 tcaaatgggg gcccgcgtcc cagcaaatct tgtaccaggc ctacgatcgg caaaagaacc 961 ccagcaagga agagagagag gccttagtgg aggaatgcaa cagggcagaa tgtttgcagc 1021 gaggggtgtc cccctccaaa gcccacggcc tgggctccaa cttggtcact gaggtccgtg 1081 tctacaactg gtttgcaaac cgcaggaagg aggaggcatt ccggcaaaag ctggccatgg 1141 acgcctatag ctccaaccag actcacagcc tgaaccctct gctctcccac ggctcccccc 1201 accaccagcc cagctcctct cctccaaaca agctgtcagg agtgcgctac agccagcagg 1261 gaaacaatga gatcacttcc tcctcaacaa tcagtcacca tggcaacagc gccatggtga 1321 ccagccagtc ggttttacag caagtctccc cagccagcct ggacccaggc cacaatctcc 1381 tctcacctga tggtaaaatg atctcagtct caggaggagg tttgccccca gtcagcacct 1441 tgacgaatat ccacagcctc tcccaccata atccccagca atctcaaaac ctcatcatga 1501 cacccctctc tggagtcatg gcaattgcac aaagcctcaa cacctcccaa gcacagagtg 1561 tccctgtcat caacagtgtg gccggcagcc tggcagccct gcagcccgtc cagttctccc 1621 agcagctgca cagccctcac cagcagcccc tcatgcagca gagcccaggc agccacatgg 1681 cccagcagcc cttcatggca gctgtgactc agctgcagaa ctcacacatg tacgcacaca 1741 agcaggaacc cccccagtat tcccacacct cccggtttcc atctgcaatg gtggtcacag 1801 ataccagcag catcagtaca ctcaccaaca tgtcttcaag taaacagtgt cctctacaag 1861 cctggtgatg cccacacacc acttacttcg tgcgcaacaa caaggaccct gttttccaca 1921 ccatcaccct ctgggcagct gtcatggaaa agcccagtga cctgaccagc acctgcgaga 1981 ggtccctgct tacctgacgg acgtcctgct ggcacctcag acaatccact ctcaggagcg 2041 cagcccgaag cccagtttcc cttctatgca gtattgccac aatgcctctc ccacgatgtc 2101 aaggactcct gtctgtcctg gaggtgggag acaaggaacc tccgaagagg aagcaagaaa 2161 gccgtactgt ctatgttgtg atccttcatc gaacaaactg atgcgaaaac ttgaatctgt 2221 tactgaaatg aggagagaag gacatgtgct attgaactga gccaaacaca ctgtaaatat 2281 ccacagactc cctcccctgc ccccatccca aatgatcttg agatttcttt taaagaagta 2341 aatttgtcca atggctgtaa actataaact actgtaatta agtgcaattt cccctctgtg 2401 tcctctcccc tctgccctgt atataatact aaagtgtcta ttagttttct ttgtaaaggt 2461 cagagtcaaa atttcaaaag tgatctgtcc cctctcccct catggagaaa catcctaagt 2521 gggaagtgaa gccccttgtc ctctcccgcg aggcctggac acttatgggg acagcatacc 2581 ttggactgac taccagctaa ctccagtctc ctgacattaa gacacacctc tggatccctg 2641 gaggggctga atgtagtgtg tcagagtaac atgccagctt cctgtgggcc aggagctcag 2701 ccgtgcactc cctaagaaac cccagggcag ggaaactggc tgtttgatag cagaagaaaa 2761 agttgcagtc tcagaaagcc ttccattaaa acaatttatt ttatcactaa aaaaaa // LOCUS HSVIMENT 1766 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for vimentin. ACCESSION X56134 NID g37849 KEYWORDS intermediate filament protein; vimentin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1766) AUTHORS Honore,B. TITLE Direct Submission JOURNAL Submitted (04-OCT-1990) Honore B., Institute of Medical Biochemistry, Aarhus University, Ole Worms Alle', Build. 170, DK-8000 Aarhus C, Denmark REFERENCE 2 (bases 1 to 1766) AUTHORS Honore,B., Madsen,P., Basse,B., Andersen,A., Walbum,E., Celis,J.E. and Leffers,H. TITLE Nucleotide sequence of cDNA covering the complete coding part of the human vimentin gene JOURNAL Nucleic Acids Res. 18 (22), 6692 (1990) MEDLINE 91067467 COMMENT V2; Data kindly revied (26-N)V-1990) by Honore B. FEATURES Location/Qualifiers source 1..1766 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="SV40 transformed fibroblasts" /cell_line="MRC-5" /clone_lib="lambda gt-11" /clone="M98C1" CDS 44..1444 /note="vimentin" /codon_start=1 /db_xref="PID:g37850" /db_xref="SWISS-PROT:P08670" /translation="MSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGSAL RPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAINTEFKNTRTN EKVELQELNDRFANYIDKVRFLEQQNKILLAELEQLKGQGKSRLGDLYEEEMRELRRQ VDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAENTLQSFRQDVDNASLARL DLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDVSKPDLTAALRDVRQQY ESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYRRQVQSLTCEVDALK GTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHLREYQDLLNVKMA LDIEIATYRKLLEGEESRISLPLPNFSSLNLRETNLDSLPLVDTHSKRTLLIKTVETR DGQVINETSQHHDDLE" BASE COUNT 479 a 482 c 449 g 356 t ORIGIN 1 cgcgccaccg ccgccgccca ggccatcgcc accctccgca gccatgtcca ccaggtccgt 61 gtcctcgtcc tcctaccgca ggatgttcgg cggcccgggc accgcgagcc ggccgagctc 121 cagccggagc tacgtgacta cgtccacccg cacctacagc ctgggcagcg cgctgcgccc 181 cagcaccagc cgcagcctct acgcctcgtc cccgggcggc gtgtatgcca cgcgctcctc 241 tgccgtgcgc ctgcggagca gcgtgcccgg ggtgcggctc ctgcaggact cggtggactt 301 ctcgctggcc gacgccatca acaccgagtt caagaacacc cgcaccaacg agaaggtgga 361 gctgcaggag ctgaatgacc gcttcgccaa ctacatcgac aaggtgcgct tcctggagca 421 gcagaataag atcctgctgg ccgagctcga gcagctcaag ggccaaggca agtcgcgcct 481 gggggacctc tacgaggagg agatgcggga gctgcgccgg caggtggacc agctaaccaa 541 cgacaaagcc cgcgtcgagg tggagcgcga caacctggcc gaggacatca tgcgcctccg 601 ggagaaattg caggaggaga tgcttcagag agaggaagcc gaaaacaccc tgcaatcttt 661 cagacaggat gttgacaatg cgtctctggc acgtcttgac cttgaacgca aagtggaatc 721 tttgcaagaa gagattgcct ttttgaagaa actccacgaa gaggaaatcc aggagctgca 781 ggctcagatt caggaacagc atgtccaaat cgatgtggat gtttccaagc ctgacctcac 841 ggctgccctg cgtgacgtac gtcagcaata tgaaagtgtg gctgccaaga acctgcagga 901 ggcagaagaa tggtacaaat ccaagtttgc tgacctctct gaggctgcca accggaacaa 961 tgacgccctg cgccaggcaa agcaggagtc cactgagtac cggagacagg tgcagtccct 1021 cacctgtgaa gtggatgccc ttaaaggaac caatgagtcc ctggaacgcc agatgcgtga 1081 aatggaagag aactttgccg ttgaagctgc taactaccaa gacactattg gccgcctgca 1141 ggatgagatt cagaatatga aggaggaaat ggctcgtcac cttcgtgaat accaagacct 1201 gctcaatgtt aagatggccc ttgacattga gattgccacc tacaggaagc tgctggaagg 1261 cgaggagagc aggatttctc tgcctcttcc aaacttttcc tccctgaacc tgagggaaac 1321 taatctggat tcactccctc tggttgatac ccactcaaaa aggacacttc tgattaagac 1381 ggttgaaact agagatggac aggttatcaa cgaaacttct cagcatcacg atgaccttga 1441 ataaaaattg cacacactca gtgcagcaat atattaccag caagaataaa aaagaaatcc 1501 atatcttaaa gaaacagctt tcaagtgcct ttctgcagtt tttcaggagc gcaagataga 1561 tttggaatag gaataagctc tagttcttaa caaccgacac tcctacaaga tttagaaaaa 1621 agtttacaac ataatctagt ttacagaaaa atcttgtgct agaatacttt ttaaaaggta 1681 ttttgaatac cattaaaact gctttttttt ttccagcaag tatccaacca acttggttct 1741 gcttcaataa atctttggaa aaacta // LOCUS HSVIPRE 2684 bp RNA PRI 27-JUN-1994 DEFINITION H.sapiens HIVR mRNA for vasoactive intestinal peptide (VIP) receptor. ACCESSION X75299 NID g407461 KEYWORDS vasoactive intestinal peptide receptor; VIP receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2684) AUTHORS Couvineau,A. TITLE Direct Submission JOURNAL Submitted (28-SEP-1993) A. Couvineau, INSERM U-239, Fac. Med. X. Bichat, 16 rue H. Huchard, 75018 Paris, FRANCE REFERENCE 2 (bases 1 to 2684) AUTHORS Couvineau,A., Rouyer-Fessard,C., Darmoul,D., Maoret,J.J., Carrero,I., Ogier-Denis,E. and Laburthe,M. TITLE Human intestinal VIP receptor: cloning and functional expression of two cDNA encoding proteins with different N-terminal domains JOURNAL Biochem. Biophys. Res. Commun. 200 (2), 769-776 (1994) MEDLINE 94235025 FEATURES Location/Qualifiers source 1..2684 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="Jejunum" /clone_lib="cDNA lambda gt11" /clone="HIVR8" gene 12..1394 /gene="HIVR" CDS 12..1394 /gene="HIVR" /codon_start=1 /product="intestinal VIP (Vasoactive intestinal peptide) receptor" /db_xref="PID:g407462" /db_xref="SWISS-PROT:P32241" /translation="MRPPSPLPARWLCVLAGALAWALGPAGGQAARLQEECDYVQMIE VQHKQCLEEAQLENETIGCSKMWDNLTCWPATPRGQVVVLACPLIFKLFSSIQGRNVS RSCTDEGWTHLEPGPYPIACGLDDKAASLDEQQTMFYGSVKTGYTIGYGLSLATLLVA TAILSLFRKLHCTRNYIHMHLFISFILRAAAVFIKDLALFDSGESDQCSEGSVGCKAA MVFFQYCVMANFFWLLVEGLYLYTLLAVSFFSERKYFWGYILIGWGVPSTFTMVWTIA RIHFEDYGLLRCWDTINSSLWWIIKGPILTSILVNFILFICIIRILLQKLRPPDIRKS DSSPYSRLARSTLLLIPLFGVHYIMFAFFPDNFKPEVKMVFELVVGSFQGFVVAILYC FLNGEVQAELRRKWRRWHLQGVLGWNPKYRHPSGGSNGATCSTQVSMLTRVSPGARRS SSFQAEVSLV" polyA_signal 2656..2661 BASE COUNT 549 a 820 c 700 g 615 t ORIGIN 1 cagggcagac catgcgcccg ccaagtccgc tgcccgcccg ctggctatgc gtgctggcag 61 gcgccctcgc ctgggccctt gggccggcgg gcggccaggc ggccaggctg caggaggagt 121 gtgactatgt gcagatgatc gaggtgcagc acaagcagtg cctggaggag gcccagctgg 181 agaatgagac aataggctgc agcaagatgt gggacaacct cacctgctgg ccagccaccc 241 ctcggggcca ggtagttgtc ttggcctgtc ccctcatctt caagctcttc tcctccattc 301 aaggccgcaa tgtaagccgc agctgcaccg acgaaggctg gacgcacctg gagcctggcc 361 cgtaccccat tgcctgtggt ttggatgaca aggcagcgag tttggatgag cagcagacca 421 tgttctacgg ttctgtgaag accggctaca ccattggcta cggcctgtcc ctcgccaccc 481 ttctggtcgc cacagctatc ctgagcctgt tcaggaagct ccactgcacg cggaactaca 541 tccacatgca cctcttcata tccttcatcc tgagggctgc cgctgtcttc atcaaagact 601 tggccctctt cgacagcggg gagtcggacc agtgctccga gggctcggtg ggctgtaagg 661 cagccatggt ctttttccaa tattgtgtca tggctaactt cttctggctg ctggtggagg 721 gcctctacct gtacaccctg cttgccgtct ccttcttctc tgagcggaag tacttctggg 781 ggtacatact catcggctgg ggggtaccca gcacattcac catggtgtgg accatcgcca 841 ggatccattt tgaggattat ggtctgctca ggtgctggga caccatcaac tcctcactgt 901 ggtggatcat aaagggcccc atcctcacct ccatcttggt aaacttcatc ctgtttattt 961 gcatcatccg aatcctgctt cagaaactgc ggcccccaga tatcaggaag agtgacagca 1021 gtccatactc aaggctagcc aggtccacac tcctgctgat ccccctgttt ggagtacact 1081 acatcatgtt cgccttcttt ccggacaatt ttaagcctga agtgaagatg gtctttgagc 1141 tcgtcgtggg gtctttccag ggttttgtgg tggctatcct ctactgcttc ctcaatggtg 1201 aggtgcaggc ggagctgagg cggaagtggc ggcgctggca cctgcagggc gtcctgggct 1261 ggaaccccaa ataccggcac ccgtcgggag gcagcaacgg cgccacgtgc agcacgcagg 1321 tttccatgct gacccgcgtc agcccaggtg cccgccgctc ctccagcttc caagccgaag 1381 tctccctggt ctgaccacca ggatcccagc ccaagcggcc cctcccgccc cttcccactc 1441 gcagcagacg ccggggacag aggcctgccc gggcgcgcca gccccggccc tgggctcgga 1501 ggctgccccc ggccccctgg tctctggtcc ggacactcct agagaacgca gccctagagc 1561 ctgcctggag cgtttctagc aagtgagaga gatgggagct cctctcctgg aggatgcagg 1621 tggaactcag tcattagact cctcctccaa aggcccccta cgccaatcaa gggcaaaaag 1681 tctacatact ttcatcctga ctctgccccc tgctggctct tctgcccaat tggaggaaag 1741 caaccggtgg atcctcaaac aacactggtg tgacctgagg gcagaaaggt tctgcccggg 1801 aaggtcacca gcaccaacac cacggtagtg cctgaaattt caccattgct gtcaagttcc 1861 tttgggttaa gcattaccac tcaggcattt gactgaagat gcagctcact accctattct 1921 ctctttacgc ttagttatca gctttttaaa gtgggttatt ctggagtttt tgtttggaga 1981 gcacacctat cttagtggtt ccccaccgaa gtggactggc ccctgggtca gtctggtggg 2041 aggacggtgc aacccaagga ctgagggact ctgaagcctc tgggaaatga gaaggcagcc 2101 accagcgaat gctaggtctc ggactaagcc tacctgctct ccaagtctca gtggcttcat 2161 ctgtcaagtg ggactctgtc acaccagcca ttcttatctc tctgtgctgt ggaagcaaca 2221 ggaatcaaga gactgccctc cttgtccacc cacctatgtg ccaactgttg taactaggct 2281 cagagatgtg cacccatggg ctctgacaga aagcagatcc tcaccctgct acacatacag 2341 gatttgaact cagatctgtc tgataggaat gtgaaagcac ggactcttac tgctaacttt 2401 tgtgtatcgt aaccagccag atcctcttgg ttatttgttt accacttgta ttattaatgc 2461 cattatccct gaattcccct tgccacccca ccctccctgg agtgtggctg aggaggcctc 2521 catctcatgt atcatctgga taggagcctg ctggtcacag cctcctctgt ctgcccttca 2581 ccccagtggc cactcagctt cctacccaca cctctgccag aagatcccct caggactgca 2641 acaggcttgt gcaacaataa atgttggctt ggaaaaaaaa aaaa // LOCUS HSVMT 1732 bp RNA PRI 08-DEC-1993 DEFINITION H.sapiens mRNA for vesicular monoamine transporter. ACCESSION X71354 NID g296188 KEYWORDS monoamine transporter; vesicular transport. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1732) AUTHORS Lesch,K.P., Gross,J., Wolozin,B.L., Murphy,D.L. and Riederer,P. TITLE Extensive sequence divergence between the human and rat brain vesicular monoamine transporter: possible molecular basis for species differences in the usceptibility to MPP+ JOURNAL J. Neural Transm. 93, 75-82 (1993) REFERENCE 2 (bases 1 to 1732) AUTHORS Lesch,K.P. TITLE Direct Submission JOURNAL Submitted (02-APR-1993) K.P. Lesch, Dept. of Psychiatry, University of Wuerzburg, Fuechsleinstr. 15, 8700 Wuerzburg, FRG FEATURES Location/Qualifiers source 1..1732 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /cell_type="platelet" CDS 64..1608 /codon_start=1 /product="vesicular monoamine transporter" /db_xref="PID:g296189" /translation="MALSELALVRWLQESRHSRKLILFIVFLALLLDNMLLTVVVPII PSYLYSIKHEKNATEIQTARPVHTASISDSFQSIFSYYDNSTMVTGNATRDLTLHQTA TQHMVTNASAVPSDCPSEDKDLLNENVQVGLLFASKATVQLITNPFIGLLTNRIGYPI PIFAGFCIMFVSTIMFAFSSSYAFLLIARSLQGIGSSCSSVAGMGMLASVYTDDEERG NVMGIALGGLAMGVLVGPPFGSVLYEFVGKTAPFLVLAALVLLDGAIQLFVLQPSRVQ PESQKGTPLTTLLKDPYILIAAGSICFANMGIAMLEPALPIWMMETMCSRKWQLGVAF LPASISYLIGTNIFGILAHKMGRWLCALLGMIIVGVSILCIPFAKNIYGLIAPNFGVG FAIGMVDSSMMPIMGYLVDLRHVSVYGSVYAIADVAFCMGYAIGPSAGGAIAKAIGFP WLMTIIGIIDILFAPLCFFLRSPPAKEEKMAILMDHNCPIKTKMYTQNNIQSYPIGED EESESD" BASE COUNT 401 a 449 c 433 g 449 t ORIGIN 1 cggcgttgcc ggcacgagac ccgggcaggc atcgcaagcg accccgagcg gagccccgga 61 gccatggccc tgagcgagct ggcgctggtc cgctggctgc aggagagccg ccactcgcgg 121 aagctcatcc tgttcatcgt gttcctggcg ctgctgctgg acaacatgct gctcactgtc 181 gtggtcccca tcatcccaag ttatctgtac agcattaagc atgagaagaa tgctacagaa 241 atccagacgg ccaggccagt gcacactgcc tccatctcag acagcttcca gagcatcttc 301 tcctattatg ataactcgac tatggtcacc gggaatgcta ccagagacct gacacttcat 361 cagaccgcca cacagcacat ggtgaccaac gcgtccgctg ttccttccga ctgtcccagt 421 gaagacaaag acctcctgaa tgaaaacgtg caagttggtc tgttgtttgc ctcgaaagcc 481 accgtccagc tcatcaccaa ccctttcata ggactactga ccaacagaat tggctatcca 541 attcccatat ttgcgggatt ctgcatcatg tttgtctcaa caattatgtt tgccttctcc 601 agcagctatg ccttcctgct gattgccagg tcgctgcagg gcatcggctc gtcctgctcc 661 tctgtggctg ggatgggcat gcttgccagt gtctacacag atgatgaaga gagaggcaac 721 gtcatgggaa tcgccttggg aggcctggcc atgggggtct tagtgggccc ccccttcggg 781 agtgtgctct atgagtttgt ggggaagacg gctccgttcc tggtgctggc cgccctggta 841 ctcttggatg gagctattca gctctttgtg ctccagccgt cccgggtgca gccagagagt 901 cagaagggga cacccctaac cacgctgctg aaggacccgt acatcctcat tgctgcaggc 961 tccatctgct ttgcaaacat gggcatcgcc atgctggagc cagccctgcc catctggatg 1021 atggagacca tgtgttcccg aaagtggcag ctgggcgttg ccttcttgcc agctagtatc 1081 tcttatctca ttggaaccaa tatttttggg atacttgcac acaaaatggg gaggtggctt 1141 tgtgctcttc tgggaatgat aattgttgga gtcagcattt tatgtattcc atttgcaaaa 1201 aacatttatg gactcatagc tccgaacttt ggagttggtt ttgcaattgg aatggtggat 1261 tcgtcaatga tgcctatcat gggctacctc gtagacctgc ggcacgtgtc cgtctatggg 1321 agtgtgtacg ccattgcgga tgtggcattt tgtatggggt atgctatagg tccttctgct 1381 ggtggtgcta ttgcaaaggc aattggattt ccatggctca tgacaattat tgggataatt 1441 gatattcttt ttgcccctct ctgctttttt cttcgaagtc cacctgccaa agaagaaaaa 1501 atggctattc tcatggatca caactgccct attaaaacaa aaatgtacac tcagaataat 1561 atccagtcat atccgatagg tgaagatgaa gaatctgaaa gtgactgaga tgagatcctc 1621 aaaaatcatc aaagtgttta attgtataaa acagtgtttc cagtgacaca actcatccag 1681 aactgtctta gtcataccat ccatccctgg tgaaagagta aaacccaaag gt // LOCUS HSVPAM92 836 bp RNA PRI 01-NOV-1997 DEFINITION Homo sapiens mRNA for vacuolar proton-ATPase subunit M9.2. ACCESSION Y15286 NID g2584788 KEYWORDS vacuolar proton-ATPase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 836) AUTHORS Ludwig,J., Kerscher,S., Brandt,U., Pfeiffer,K., Getlawi,F., Apps,D.K. and Schagger,H. TITLE Identification and characterization of a novel 9.2 kDa membrane sector subunit of vacuolar proton-ATPase from chromaffin granules JOURNAL Unpublished REFERENCE 2 (bases 1 to 836) AUTHORS Ludwig,J.H. TITLE Direct Submission JOURNAL Submitted (29-OCT-1997) J.H. Ludwig, Universitaetsklinikum Frankfurt, Gustav Embden-Zentrum der Biologischen Chemie (ZBC), Institut fuer Biochemie I, Theodor Stern-Kai 7 Haus 25 B, 60590 Frankfurt, FRG FEATURES Location/Qualifiers source 1..836 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="Female" /tissue_type="placenta" /clone_lib="Soares placenta Nb2HP" /clone="I.M.A.G.E. EST ID:143553" promoter 60..66 /gene="M9.2" gene 60..815 /gene="M9.2" CDS 63..308 /gene="M9.2" /codon_start=1 /product="vacuolar proton-ATPase subunit M9.2" /db_xref="PID:e1169569" /db_xref="PID:g2584789" /translation="MAYHGLTVPLIVMSVFWGFVGFLVPWFIPKGPNRGVIITMLVTC SVCCYLFWLIAILAQLNPLFGPQLKNETIWYLKYHWP" sig_peptide 63..65 /gene="M9.2" mat_peptide 66..305 /gene="M9.2" /product="vacuolar proton-ATPase subunit M9.2" polyA_signal 810..815 /gene="M9.2" BASE COUNT 192 a 196 c 196 g 252 t ORIGIN 1 gacacttcct ggtgggatcc gagtgaggcg acggggtagg ggttggcgct caggcggcga 61 ccatggcgta tcacggcctc actgtgcctc tcattgtgat gagcgtgttc tggggcttcg 121 tcggcttctt ggtgccttgg ttcatcccta agggtcctaa ccggggagtt atcattacca 181 tgttggtgac ctgttcagtt tgctgctatc tcttttggct gattgcaatt ctggcccaac 241 tcaaccctct ctttggaccg caattgaaaa atgaaaccat ctggtatctg aagtatcatt 301 ggccttgagg aagaagacat gctctacagt gctcagtctt tgaggtcacg agaagagaat 361 gccttctaga tgcaaaatca cctccaaacc agaccacttt tcttgacttg cctgttttgg 421 ccattagctg ccttaaacgt taacagcaca tttgaatgcc ttattctaca atgcagcgtg 481 ttttcctttg ccttttttgc actttggtga attacgtgcc tccataacct gaactgtgcc 541 gactccacaa aacgattatg tactcttctg agatagaaga tgctgttctt ctgagagata 601 cgttactctc tccttggaat ctgtggattt gaagatggct cctgccttct cacgtgggaa 661 tcagtgaagt gtttagaaac tgctgcaaga caaacaagac tccagtgggg tggtcagtag 721 gagagcacgt tcagagggaa gagccatctc aacagaatcg caccaaacta tactttcagg 781 atgaatttct tctttctgcc atcttttgga ataaatattt tcctcctttc tatgga // LOCUS HSVPATPD 1630 bp RNA PRI 27-JAN-1994 DEFINITION H.sapiens mRNA for vacuolar proton ATPase, subunit D. ACCESSION X71490 NID g313011 KEYWORDS vacuolar proton-ATPase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1630) AUTHORS van Hille,B., Vanek,M., Richener,H., Green,J.R. and Bilbe,G. TITLE Cloning and tissue distribution of subunits C, D, and E of the human vacuolar H(+)-ATPase JOURNAL Biochem. Biophys. Res. Commun. 197 (1), 15-21 (1993) MEDLINE 94071935 REFERENCE 2 (bases 1 to 1630) AUTHORS van Hille,B.J.M. TITLE Direct Submission JOURNAL Submitted (16-APR-1993) B.J.M. Van Hille, CIBA-GEIGY, K 681-403, Research Dept. Pharmaceutical Division, CH-4002-Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..1630 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="osteoclastoma" gene 257..1081 /gene="P39" CDS 257..1081 /gene="P39" /note="subunit D" /codon_start=1 /product="vacuolar proton ATPase" /db_xref="PID:g313012" /db_xref="SWISS-PROT:Q02547" /translation="MVVEFRHMRNHAYEPLASFLDFITYSYMIDNVILLITGTLHQRS IAELVPKCHPLGSFEQMEAVNIAQTPAELYNAILVDTPLAAFFQDCISEQDLDEMNIE IIRNTLYKAYLESFYKFCTLLGGTTADAMCPILEFEADRRAFIITINSFGTELSKEDR AKLFPHCGRLYPEGLAQLARADDYEQVKKLADYYPEYKLLFEGAGSNPGDKTLEDRFF EHEVKLNKLAFLNQFHFGVFYAFVKLKEQECRNIVWIAECIAQRHRAKIDNYIPIF" BASE COUNT 354 a 490 c 432 g 354 t ORIGIN 1 gggaccggcc gctcccgcag cagccatgtc gttcttcccg gagctttact ttaacgtgga 61 caatggctac ttggagggac tggtgcgcgc ctgaagggct gggcgagctc agccaggccg 121 actacctcaa cctggtgcag tgcgagacgc tagaagactt gaaactgcat ctgcagagca 181 ctgattatgg taacttcctg gccaacgagg catcgctctg gcggtgtcag tcatcgatga 241 ccggctcaag gagaagatgg tggtggagtt ccgccacatg aggaaccatg cctatgagcc 301 actcgccagc ttcctagact tcattactta cagttacatg atcgacaacg tgatcctgct 361 catcacaggc acgctgcacc agcgctccat cgctgagctc gtgcccaagt gccacccact 421 aggcagcttc gagcagatgg aggccgtgaa cattgctcag acacctgctg agctctacaa 481 tgccattctg gtggacacgc ctcttgcggc ttttttccag gactgcattt cagagcagga 541 ccttgacgag atgaacatcg agatcatccg caacaccctc tacaaggcct acctggagtc 601 cttctacaag ttctgcaccc tactgggcgg gactacggct gatgccatgt gccccatcct 661 ggagtttgaa gcagaccgcc gcgccttcat catcaccatc aattctttcg gcacagagct 721 gtccaaagag gaccgtgcca agctctttcc acactgtggg cggctctacc ctgagggcct 781 ggcgcagctg gctcgggctg acgactatga acaggtcaag aagctggccg attactaccc 841 ggagtacaag ctgctcttcg agggtgcagg tagcaaccct ggagacaaga cgctggagga 901 ccgattcttt gagcacgagg taaagctgaa caagttggcc ttcctgaacc agttccactt 961 tggtgttttc tatgccttcg tgaagctcaa ggagcaggag tgtcgcaaca tcgtgtggat 1021 cgctgaatgt atcgcccagc gccaccgcgc caaaatcgac aactacatcc ctatcttcta 1081 gcgtctggcc caaggctctc aagtgtgtgt gcgtgtgtgt gtatgtggtc tgttacaagc 1141 ctgtggctca cctgcctgtc cggggtgtag tacgctgtcc tagcggctgc ccagttctcc 1201 tgaccctctt agagactgtt cttaggcctg aaaaggggct gggccccccc ccccccacca 1261 aggatggatg gatgaagacc ccctccagag caaggaggcc ccctcagccc tgtggttaca 1321 gccgctgatg tatctaagaa gcatgtcact ttcatgttcc tccctaactc cctgacctga 1381 gaaccctggg gcctgggggc agtttgagcc tcctctccct tctgtgggtc gctcccagag 1441 ccatggccca tgggaaggac agagtgtgtg tgtccttggc ctggggggtg ttgctcctca 1501 cgtccctccc tcagccctgc ccctctgaga caataaaact gccctctcta agcccaaaaa 1561 aaaaaaaaaa aaaaaaaaaa aaccggaatt cgagctcgcc cggggactcc tctagagtcg 1621 acctgcagcc // LOCUS HSWHYDR 1335 bp RNA PRI 12-SEP-1993 DEFINITION Human mRNA for tryptophan hydroxylase (EC 1.14.16.4). ACCESSION X52836 NID g37954 KEYWORDS melatonin; serotonin; tryptophan 5-monooxygenase; tryptophan hydroxylase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1335) AUTHORS Boularand,S. TITLE Direct Submission JOURNAL Submitted (01-JUN-1990) Boularand S., CNRS, Bat. 32, Avenue de la Terrasse F91198 GIF sur Yvette, France REFERENCE 2 (bases 1 to 1335) AUTHORS Boularand,S., Darmon,M.C., Ganem,Y., Launay,J.M. and Mallet,J. TITLE Complete coding sequence of human tryptophan hydroxylase JOURNAL Nucleic Acids Res. 18 (14), 4257 (1990) MEDLINE 90332431 FEATURES Location/Qualifiers source 1..1335 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="carcinoid tumor" /cell_type="serotonin secreting" /clone_lib="Lambda ZAP II" CDS 1..1335 /note="tryptophan hydroxylase (AA 1 - 444)" /codon_start=1 /db_xref="PID:g37955" /db_xref="SWISS-PROT:P17752" /translation="MIEDNKENKDHSLERGRASLIFSLKNEVGGLIKALKIFQEKHVN LLHIESRKSKRRNSEFEIFVDCDINREQLNDIFHLLKSHTNVLSVNLPDNFTLKEDGM ETVPWFPKKISDLDHCANRVLMYGSELDADHPGFKDNVYRKRRKYFADLAMNYKHGDP IPKVEFTEEEIKTWGTVFQELNKLYPTHACREYLKNLPLLSKYCGYREDNIPQLEDVS NFLKERTGFSIRPVAGYLSPRDFLSGLAFRVFHCTQYVRHSSDPFYTPEPDTCHELLG HVPLLAEPSFAQFSQEIGLASLGASEEAVQKLATCYFFTVEFGLCKQDGQLRVFGAGL LSSISELKHALSGHAKVKPFDPKITCKQECLITTFQDVYFVSESFEDAKEKMREFTKT IKRPFGVKYNPYTRSIQILKDTKSITSAMNELQHDLDVVSDALAKVSRKPSI" BASE COUNT 409 a 271 c 274 g 381 t ORIGIN 1 atgattgaag acaataagga gaacaaagac cattccttag aaaggggaag agcaagtctc 61 attttttcct taaagaatga agttggagga cttataaaag ccctgaaaat ctttcaggag 121 aagcatgtga atctgttaca tatcgagtcc cgaaaatcaa aaagaagaaa ctcagaattt 181 gagatttttg ttgactgtga catcaacaga gaacaattga atgatatttt tcatctgctg 241 aagtctcata ccaatgttct ctctgtgaat ctaccagata attttacttt gaaggaagat 301 ggtatggaaa ctgttccttg gtttccaaag aagatttctg acctggacca ttgtgccaac 361 agagttctga tgtatggatc tgaactagat gcagaccatc ctggcttcaa agacaatgtc 421 taccgtaaac gtcgaaagta ttttgcggac ttggctatga actataaaca tggagacccc 481 attccaaagg ttgaattcac tgaagaggag attaagacct ggggaaccgt attccaagag 541 ctcaacaaac tctacccaac ccatgcttgc agagagtatc tcaaaaactt acctttgctt 601 tctaaatatt gtggatatcg ggaggataat atcccacaat tggaagatgt ctccaacttt 661 ttaaaagagc gtacaggttt ttccatccgt cctgtggctg gttacttatc accaagagat 721 ttcttatcag gtttagcctt tcgagttttt cactgcactc aatatgtgag acacagttca 781 gatcccttct ataccccaga gccagatacc tgccatgaac tcttaggtca tgtcccgctt 841 ttggctgaac ctagttttgc ccaattctcc caagaaattg gcttggcttc tcttggcgct 901 tcagaggagg ctgttcaaaa actggcaacg tgctactttt tcactgtgga gtttggtcta 961 tgtaaacaag atggacagct aagagtcttt ggtgctggct tactttcttc tatcagtgaa 1021 ctcaaacatg cactttctgg acatgccaaa gtaaagccct ttgatcccaa gattacctgc 1081 aaacaggaat gtcttatcac aacttttcaa gatgtctact ttgtatctga aagttttgaa 1141 gatgcaaagg agaagatgag agaatttacc aaaacaatta agcgtccatt tggagtgaag 1201 tataatccat atacacggag tattcagatc ctgaaagaca ccaagagcat aaccagtgcc 1261 atgaatgagc tgcagcatga tctcgatgtt gtcagtgatg cccttgctaa ggtcagcagg 1321 aagccgagta tctaa // LOCUS HSWNT13 2385 bp RNA PRI 07-OCT-1996 DEFINITION H.sapiens Wnt-13 mRNA. ACCESSION Z71621 NID g1524104 KEYWORDS Wnt-13 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2385) AUTHORS Katoh,M., Hirai,M., Sugimura,T. and Terada,M. TITLE Cloning, expression and chromosomal localization of Wnt-13, a novel member of the Wnt gene family JOURNAL Oncogene 13 (4), 873-876 (1996) MEDLINE 96358637 REFERENCE 2 (bases 1 to 2385) AUTHORS Terada,M. TITLE Direct Submission JOURNAL Submitted (11-JUL-1996) Masaaki Terada, Office of Director, National Cancer Center Research Institute, Tsukiji 5-chome, Chuo-ku, Tokyo, 104, Japan FEATURES Location/Qualifiers source 1..2385 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal brain, kindey and lung" /clone_lib="MKN28 cDNA (human gastric cancer)" /chromosome="1p13" gene 492..1610 /gene="Wnt-13" CDS 492..1610 /gene="Wnt-13" /function="putative secreted glycoprotein" /codon_start=1 /db_xref="PID:e252735" /db_xref="PID:g1524105" /translation="MLDGLGVVAISIFGIQLKTEGSLRTAVPGIPTQSAFNKCLQRYI GALGARVICDNIPGLVSRQRQLCQRYPDIMRSVGEGAREWIRECQHQFRHHRWNCTTL DRDHTVFGRVMLRSSREAAFVYAISSAGVIHAITRACSQGELSVCSCDPYTRGRHHDQ RGTFDWGGCSDNIHYGVRFAKAFVDAKEKRLKDARALMNLHNNRCGRTAVRRFVKLEC KCHGVSGSCTLRTCWRALSDFRRTGDYLRRRYDGAVQVMATQDGANFTAARQGYRRAT RSDLVYFDNSPDYCVLDKAAGSLGTAGRVCSKTSKGTDGCEIMCCGRGYDTTRVTRVT QCECKFHWCCAVRCKECRNTVDVHTCKAPKKAEWLDQT" BASE COUNT 638 a 592 c 580 g 575 t ORIGIN 1 aaacccactc caccttacta ccagacaacc ttagccaaac catttaccca aataaagtat 61 aggcgataga aattgaaacc tggcgcaata gatatagtac cgcaagggaa agatgaaaaa 121 ttataaccaa gcataatata gcaaggacta acccctatac cttctgcata atgaattaac 181 tagaaataac tttgcaagga gagtcaaagc taaggccccc gaaaccaggc gagctaccta 241 agaacagcta aaagagcaca cccgtctatg tagcaaaata gtgggaagat ttataggtag 301 aggcgacaaa cctaccgagc ctggtgatag ctggttgtcc aagatagaat cttagttcaa 361 ctttaaattt gcccacagaa ccctctaaat ccccttgtaa atttaactgt tagtccaaag 421 aggaacagct ctttggacac taggaaaaaa ccttgtagag agagtgtcag cccaattcca 481 cacttttcca catgttggat ggccttggag tggtagccat aagcattttt ggaattcaac 541 taaaaactga aggatccttg aggacggcag tacctggcat acctacacag tcagcgttca 601 acaagtgttt gcaaaggtac attggggcac tgggggcacg agtgatctgt gacaatatcc 661 ctggtttggt gagccggcag cggcagctgt gccagcgtta cccagacatc atgcgttcag 721 tgggcgaggg tgcccgagaa tggatccgag agtgtcagca ccaattccgc caccaccgct 781 ggaactgtac caccctggac cgggaccaca ccgtctttgg ccgtgtcatg ctcagaagta 841 gccgagaggc agcttttgta tatgccatct catcagcagg ggtgatccac gctattactc 901 gcgcctgtag ccagggtgaa ctgagtgtgt gcagctgtga cccctacacc cgtggccgac 961 accatgacca gcgtgggact tttgactggg gtggctgcag tgacaacatc cactacggtg 1021 tccgttttgc caaggccttc gtggatgcca aggagaagag gcttaaggat gcccgggccc 1081 tcatgaactt acataataac cgctgtggtc gcacggctgt gcggcggttt gtcaagctgg 1141 agtgtaagtg ccatggcgtg agtggttcct gtactctgcg cacctgctgg cgtgcactct 1201 cagatttccg ccgcacaggt gattacctgc ggcgacgcta tgatggggct gtgcaggtga 1261 tggccaccca agatggtgcc aacttcaccg cagcccgcca aggctatcgc cgtgccaccc 1321 ggagtgatct tgtctacttt gacaactctc cagattactg tgtcttggac aaggctgcag 1381 gttccctagg cactgcaggc cgtgtctgca gcaagacatc aaaaggaaca gacggttgtg 1441 aaatcatgtg ctgtggccga gggtacgaca caactcgagt cacccgtgtt acccagtgtg 1501 agtgcaaatt ccactggtgc tgtgctgtac ggtgcaagga atgcagaaat actgtggacg 1561 tccatacttg caaagccccc aagaaggcag agtggctgga ccagacctga acacacagat 1621 acctcactca tccctccaat tcaagcctct caactcaaaa gcacaagatc cttgcatgca 1681 caccttcctc caccctccac cctgggctgc taccgcttct atttaaggat gtagagagta 1741 atccataggg accatggtgt cctggctggt tccttagccc tgggaaggag ttgtcagggg 1801 atataagaaa ctgtgcaagc tccctgattt cccgctctgg agatttgaag ggagagtaga 1861 agagataggg ggtctttaga gtgaaatgag ttgcactaaa gtacgtagtt gaggctcctt 1921 ttttctttcc tttgcaccag cttcccgaca cttcttggtg tgcaagagga agggtacctg 1981 tagagagctt ctttttgttt ctacctggcc aaagttagat gggacaaaga tgaatggcat 2041 gtcccttctc tgaagtccgt ttgagcagaa ctacctggta ccccgaaaga aaaatcttag 2101 gctaccacat tctattattg agagcctgag atgttagcca tagtggacaa ggttccattc 2161 acatgctcat atgtttataa actgtgtttt gtagaagaaa aagaatcata acaatacaaa 2221 cacacattca ttctctcttt ttctctctac cattctcaac ctgtattgga cagcactgcc 2281 tcttttgctt acttgctgcc tgttcaaact gaggtggaat gcagtggttc ccatgcttaa 2341 cagatcatta aaacacccta gaacactcct aggatagatt aatgt // LOCUS HSWP34 1560 bp RNA PRI 10-JUN-1992 DEFINITION H.sapiens mRNA WP34 for phosphorylated lymphocyte differentiation and activation antigen. ACCESSION X55188 NID g37963 KEYWORDS lymphocyte antigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Kadiyala,R.K., McIntyre,B.W. and Krensky,A.M. TITLE Molecular cloning and characterization of WP34, a phosphorylated human lymphocyte differentiation and activation antigen JOURNAL Eur. J. Immunol. 20 (11), 2417-2423 (1990) MEDLINE 91071278 FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hematopoietic" /cell_type="T-cell, fetal liver and thymus" /clone_lib="cDNA expression library" /clone="WP34" gene 48..1067 /gene="WP34" CDS 48..1067 /gene="WP34" /function="differentiation and activation" /codon_start=1 /product="lymphocyte antigen" /db_xref="PID:g37964" /db_xref="SWISS-PROT:P33241" /translation="MAEASSDPGAEEREELLGPTAQWTVEDEEEAVHEQCQHERDRQL QAQDEEGGGHVPERPKQEMLLSLKPSEAPELDEDEGFGDWSQRPEQRQQHEGAQGTLD SGEPPQCRSPEGEQEDRPGLHAYEKEDSDEVHLEELSLSKEGPGPEDTVQDNLGAAGA EEEQEEHQKCQQPRTPSPLVLEGTIEQSSPPLSPTTKLIDRTESLNRSIEKSNSVKKS QPDLPISKIDQWLEQYTQAIETAGRTPKLARQASIELPSMAVASTKSRWETGEVQAQS AAKTPSCKDIVAGDMSKKSLWEQKGGSKTSSTIKSTPSGKRYKFVATGHGKYEKVLVE GGPAP" polyA_signal 1538..1543 polyA_site 1560 BASE COUNT 348 a 476 c 481 g 255 t ORIGIN 1 cactccaggg atctgccagc accctgtggg gcccagacta caggctgatg gcggaggctt 61 cgagtgaccc gggtgctgag gagcgggaag agttgctggg gcccactgct cagtggacgg 121 tggaggacga ggaggaggcc gtccacgagc aatgccagca tgagagagac aggcagcttc 181 aggcccagga cgaggaggga ggcggccatg tccccgagcg gccgaagcag gagatgctcc 241 tcagcctgaa gccctcggag gcccctgaac tggatgagga cgagggcttt ggcgactggt 301 cccagaggcc agagcagcgg cagcagcacg agggggcgca gggcaccttg gacagcggag 361 agccccccca gtgcaggagt cctgaggggg agcaagagga caggcccggc ctgcatgcct 421 acgaaaagga ggacagtgat gaagtccacc tggaggagtt gagtctgagc aaggaggggc 481 caggcccaga ggacactgtc caggacaacc tgggggccgc aggggctgag gaggaacagg 541 aggagcacca gaaatgtcag cagcccagga cacccagccc cttggtcttg gaggggacca 601 tcgaacagag ctcgcctccc ctgagcccta ccaccaaact catcgacagg accgagtccc 661 taaaccgctc catagagaag agtaacagtg tgaagaaatc ccagccagac ttgcccatct 721 ccaagattga tcagtggctg gaacaataca cccaggccat cgagaccgct ggccggaccc 781 ccaagctagc ccgccaggcc tccatagagc tgcccagcat ggctgtggcc agtaccaaga 841 gtcggtggga gacgggtgag gtacaggctc agtctgcggc caagactccg tcctgcaagg 901 atattgtggc tggagacatg agcaagaaaa gcctctggga gcagaaggga ggctccaaga 961 cctcatcaac aattaagagc accccatctg ggaagaggta taagtttgtg gccaccgggc 1021 atgggaagta tgagaaggtg cttgtggaag ggggcccggc tccctaggcg tcccatctcg 1081 cttcctgggt ctgcaggtcc agccggctgg caccctccat gtacccaggg gagattccag 1141 ccagacaccc gccccccggc cctggctaag aagttgcttc ctgttgccag catgacctac 1201 cctcgcctct ttgatgccat ccgctgccac ctccttttgc tcctggaccc tttagcctct 1261 ctgcccttcc actctctgac caccgccccc gccctcccca cctgccgctt cttgttactt 1321 gggggaggaa agaaactcct gatcattggc caaagggact tacccctcca gtgccaagtg 1381 ccttctatag gaagttaggt tgacgagcag cctgtgcaga gagtgtcacc cccccagatc 1441 aaggggaaac tgcaggtcgc ctgttgccgc ccaagggctg ataacggcca tgcaggatgc 1501 ttgatgctcg gtcccccccc accccgccat tttgtataat aaagctccct gtgtattctc // LOCUS HSX99050 3510 bp DNA PRI 09-OCT-1997 DEFINITION H.sapiens mRNA; UV Radiation Resistance Associated Gene. ACCESSION X99050 NID g2102666 KEYWORDS UV radiation resistance associated gene; UVRAG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3510) AUTHORS Canaani,D. TITLE Direct Submission JOURNAL Submitted (04-JUL-1996) D. Canaani, Tel-Aviv University, Biochemistry, Faculty of Life Sciences, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, ISRAEL REMARK Revised by [3] REFERENCE 2 (bases 1 to 3510) AUTHORS Canaani,D. TITLE Direct Submission JOURNAL Submitted (13-MAY-1997) D. Canaani, Tel-Aviv University, Biochemistry, Faculty of Life Sciences, Tel-Aviv University, Ramat-Aviv, Tel-Aviv 69978, ISRAEL FEATURES Location/Qualifiers source 1..3510 /organism="Homo sapiens" /note="between markers D11S906 and D11S916" /db_xref="taxon:9606" /cell_line="KCL22" /cell_type="hematopoetic" /chromosome="11" /map="q13" /clone="6a" /clone="7" /clone_lib="lambda ZAP II/KCL22" mRNA <1..>3510 gene 176..2122 /gene="UVRAG" CDS 176..2122 /gene="UVRAG" /note="UV Radiation Resistance Associated Gene" /codon_start=1 /evidence=experimental /product="p63 (processed form)" /db_xref="PID:e354226" /db_xref="PID:g2102667" /translation="MSASASVGGPVPQPPPGPAAALPPGSAARALHVELPSQQRRLRH LRNIAARNIVNRNGHQLLDTYFTLHLCSTEKIYKEFYRSEVIKNSLNPTWRSLDFGIM PDRLDTSVSCFVVKIWGGKENIYQLLIEWKVCLDGLKYLGQQIHARNQNEIIFGLNDG YYGAPFEHKGYSNAQKTILLQVDQNCVRNSYDVFSLLRLHRAQCAIKQTQVTVQKIGK EIEEKLRLTSTSNELKKKSECLQLKILVLQNELERQKKALGREVALLHKQQIALQDKG SAFSAEHLKLQLQKESLNELRKECTAKRELFLKTNAQLTIRCRQLLSELSYIYPIDLN EHKDYFVCGVKLPNSEDFQAKDDGSIAVALGYTAHLVSMISFFLQVPLRYPIIHKGSR STIKDNINDKLTEKEREFPLYPKGGEKLQFDYGVYLLNKNIAQLRYQHGLGTPDLRQT LPNLKNFMEHGLMVRCDRHHTSSAIPVPKRQSSIFGGADVGFSGGIPSPDKGHRKRAS SENERLQYKTPPPSYNSALAQPVTTVPSMGETERKITSLSSSLDTSLDFSKENKKKGE DLVGSLNGGHANVHPSQEQGEALSGHRATVNGTLLPSEQAGSASVQLPGEFHPVSEAE LCCTVEQAEEIIGLEAQVSPQVIS" polyA_signal 3489..3494 BASE COUNT 974 a 795 c 849 g 892 t ORIGIN 1 tccagcggcg gcaacggcgg cagcggcggc agcggcggcg gctactgtct gggctgagca 61 gtagtgcctc tcgggtggcg ggtttctagg ctgcaggggc ttggtaggtg gtggcaaggg 121 ggcggcggcg gatgccggaa gagtgcccgc cccgcttggc ggcccctgga tcgagatgag 181 cgcctccgcg tcggtcgggg gccccgtccc ccagccaccc ccgggcccgg ccgctgctct 241 gcctcccggt tctgccgcgc gggccctgca tgtggagctg ccgtctcagc agcggcgtct 301 tcgacatctt cggaacattg ctgcccggaa cattgttaat agaaatggcc atcagctcct 361 tgatacctac tttacacttc acttgtgtag tactgaaaag atatataaag aattttatag 421 aagtgaagtg attaagaatt ccttgaatcc cacgtggcga agtctcgatt ttggaattat 481 gccagaccgt cttgatacat ctgtgtcttg tttcgtggtg aagatatggg gtggaaagga 541 gaacatctac cagctgttga ttgaatggaa agtctgtttg gatgggctga aatacttggg 601 tcagcagatt catgcccgaa accaaaatga gataattttt gggctgaatg atggatacta 661 tggtgctcca tttgaacata agggttattc aaatgctcag aagactattc ttctgcaggt 721 ggatcagaac tgtgttcgca attcttacga tgtcttctct ttgctacggc ttcatagagc 781 ccagtgtgca attaaacaga ctcaggtaac tgttcagaaa attggaaagg aaattgaaga 841 aaaactaaga ctcacatcta caagcaatga actgaaaaaa aaaagtgaat gcctgcagtt 901 aaaaattttg gtgcttcaga atgaactgga acggcagaag aaagctttgg gacgggaggt 961 ggcattactg cataagcaac aaattgcatt acaagacaaa ggaagtgcat tttcagctga 1021 gcacctcaaa cttcaactcc agaaggaatc cctaaatgag ctgaggaagg agtgcactgc 1081 aaaaagagaa ctcttcttga agactaatgc tcagttgaca attcgttgca ggcagttact 1141 ctctgagctt tcctacattt accctattga tttgaatgaa cataaggatt actttgtatg 1201 cggtgtcaag ttgcctaatt ctgaggactt ccaagcaaaa gatgatggaa gcattgctgt 1261 tgcccttggt tatactgcac atctggtctc catgatttcc tttttcctac aagtgcccct 1321 cagatatcct ataattcata aggggtctag atcaacaatc aaagacaata tcaatgacaa 1381 actgacggaa aaggagagag agtttccact gtatccaaaa ggaggggaga agttgcagtt 1441 tgattatggt gtctatcttc tgaacaaaaa tatagcacag ctaagatatc aacatggact 1501 agggactcca gacttgcggc aaacccttcc caacctgaaa aacttcatgg agcatggact 1561 aatggtcagg tgtgacagac atcacacctc cagtgcaatc cctgttccta agagacaaag 1621 ctccatattt gggggtgcag atgtaggctt ctctgggggg atcccttcac cagacaaagg 1681 acatcgaaaa cgggccagct ctgagaatga gagacttcag tacaaaaccc ctcctcccag 1741 ttacaactca gcattagccc agcctgtgac caccgtcccc tccatgggag agaccgagag 1801 aaagataaca tctctatcct cctccttgga tacctccttg gacttctcca aagaaaacaa 1861 gaaaaaagga gaggatctag ttggcagctt aaacggaggc cacgcgaatg tgcaccctag 1921 ccaagaacaa ggagaagccc tctccgggca ccgggccaca gtcaatggca ctctcctacc 1981 cagcgagcag gccgggtccg ccagtgtcca gcttccaggc gagttccacc cagtctcaga 2041 agctgagctc tgctgtactg tggagcaagc agaagaaatc atcgggctgg aagcacaggt 2101 ttcgcctcag gtgatcagct agaagcattt aactgcatcc cagtggacag tgctgtggca 2161 gtagagtgtg acgaacaagt tctgggagaa tttgaagagt tctcccgaag gatctatgca 2221 ctgaatgaaa acgtatccag cttccgccgg ccgcgcagga gttccgataa gtgaagtgag 2281 caggtcaaca gtaggactgg ggcagaagct ctgcctaaaa tgaagtgaaa gctgcactta 2341 accctttgtg ataatgatga cacaaaatga atattaatgg aggatattcc tcggaaaaac 2401 agactttggg aatgaaggag ggactcagga tcattgttat cagtgggcca aagttagatt 2461 ttgctttcaa gatttgcttt tcgggcctga tgattttaaa gcaaaaatca ccctctagtt 2521 gaaagagctt acagctcgag tcacctttta gctatttgtc tgctttttat ttacccttgt 2581 atgttatcct cagagggaag atgataatat ataataatat aatgaacaca cccttagttt 2641 ctcataagca tttgccctca ccatggttta taaaactttg ggaaaacgga atattcagaa 2701 ataggtttcc gccatgtact gaaaggtctg tggccatctg tgaggtagat gaagaagcag 2761 catagtggtc tccttacatc taggcctaac tgtccctctt cctgcccccg ggtaccacag 2821 tccaccttta gaccctactg tcgccccatc ttctccgtgg atgggccatg cgttcctgaa 2881 aacaggacat caagattcac tggttctgta acccagtagc tgtgacgttc catctcttct 2941 aaccagccat ggccttcccc tcctctgcca tacccttaat gcggccctca gattagatga 3001 aaaacttgct cctggtggat cccaagggac cctcaaggac ctcgaggtta ctgcagtcag 3061 atgccatctc atccctgtgg gggccaaagt ttttatgtgg gcagatgctg tggtcaggaa 3121 ctaggcatgc tttctggcaa tgcactcacc agacaaaaat ccttgatgta aatcccatgt 3181 taatttatta aatttagtca gaaggtcagc atttacatga cagaatgtat gtagagagtt 3241 ggggtgtctg gtaggcaaac tgcaaggcag ttgagatagt tggattaaga ggctagacga 3301 gacatagaat actattggtg atgtgtgcaa tttcatgaat attaaattat gtttcgaagt 3361 ccagttgtca ttcccgcatt cagatttcat ttgctgatga ctttatacgt tacgtaccca 3421 aggacattgc ctcagggttg caaactcttt aaaggcaaaa tttatccata tatccatgta 3481 ttatatagaa taaaaattga agtttacttc // LOCUS HSXAP4 2225 bp RNA PRI 02-MAR-1995 DEFINITION H.sapiens XAP-4 mRNA for GDP-dissociation inhibitor. ACCESSION X79353 NID g695584 KEYWORDS GDP-dissociation inhibitor; XAP-4 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2225) AUTHORS Sedlacek,Z., Konecki,D.S., Korn,B., Klauck,S.M. and Poustka,A. TITLE Evolutionary conservation and genomic organization of XAP-4, an Xq28 located gene coding for a human rab GDP-dissociation inhibitor (GDI) JOURNAL Mamm. Genome 5 (10), 633-639 (1994) MEDLINE 95152170 REFERENCE 2 (bases 1 to 2225) AUTHORS Sedlacek,Z. TITLE Direct Submission JOURNAL Submitted (18-MAY-1994) Z. Sedlacek, DKFZ, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG FEATURES Location/Qualifiers source 1..2225 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" /chromosome="X" /map="Xq28" gene 81..1424 /gene="XAP-4" CDS 81..1424 /gene="XAP-4" /codon_start=1 /product="GDP-dissociation inhibitor" /db_xref="PID:g695585" /translation="MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESS SITPLEELYKRFQLLEGPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDF KVVEGSFVYKGGKIYKVPSTETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEG VDPQTTSMRDVYRKFDLGQDVIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLAR YGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVAR CKQLICDPSYIPDRVRKAGQVIRIICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYV CMISYAHNVAAQGKYIAIASTTVETTDPEKEVEPALELLEPIDQKFVAISDLYEPIDD GCESQVFCSCSYDATTHFETTCNDIKDIYKRMAGTAFDFENMKRKQNDVFGEAEQ" polyA_signal 2199..2204 BASE COUNT 488 a 632 c 606 g 499 t ORIGIN 1 cggcggcggc ggtggcggcg gcgactgctg cggtgaagga ggaggaggag ccgagcgggc 61 gctggcaccg aggcctgacc atggacgagg aatacgatgt gatcgtgctg gggaccggtc 121 tcaccgaatg catcctgtcg ggcatcatgt ctgtgaacgg gaagaaggtg ctgcacatgg 181 accggaaccc ctactacggg ggcgagagct cctccatcac acccctggag gagctgtata 241 agcgttttca gttgctggag gggccccctg agtcgatggg ccgaggccga gactggaatg 301 ttgacctgat tcccaaattc ctcatggcta acgggcagct ggtaaagatg ctactgtata 361 cagaggtgac tcgctacctg gacttcaagg tggtggaggg cagctttgtc tacaaggggg 421 gcaagatcta caaagtgccg tccactgaga ctgaggcctt ggcttccaat ctgatgggca 481 tgtttgagaa acggcgcttc cgcaagttcc tggtgtttgt ggcaaacttc gatgagaatg 541 accccaagac ctttgagggc gttgaccccc agactaccag catgcgtgac gtctaccgga 601 agtttgatct gggccaggat gtcatcgatt tcactggcca tgccctggcg ctctaccgca 661 ctgatgacta cctggaccag ccctgccttg agaccgtcaa ccgcatcaag ttgtacagtg 721 agtccctggc ccggtatggc aagagcccat atttataccc gctctacggc ttgggcgagc 781 tgccccaggg ttttgcaaga ttgagtgcca tctatggggg gacatatatg ctgaacaaac 841 ctgtggatga catcatcatg gagaacggca aggtggtggg cgtgaagtct gagggagagg 901 tggcccgctg caagcagctg atctgtgacc ccagctacat cccggaccgt gtgcggaagg 961 ctggccaggt tatccgcatc atctgtatcc ttagccaccc catcaagaac accaacgacg 1021 ccaactcctg ccaaataatc atcccccaga accaggtcaa caggaagtca gacatctacg 1081 tgtgcatgat ctcctatgca cacaacgtgg cggcccaggg caagtacata gctattgcca 1141 gcactactgt ggagaccacg gaccctgaaa aggaggtgga gccggctctg gagctgttgg 1201 agcccattga ccagaagttt gtggctatca gtgacttgta tgagcccatt gatgatggtt 1261 gtgagagcca ggtgttctgt tcctgctcct acgatgccac cacacacttt gagacaacct 1321 gcaacgacat caaagacatc tacaaacgca tggctggcac ggcctttgac tttgagaaca 1381 tgaagcgcaa acagaacgac gtctttggag aagctgagca gtgattgtgg ccgcccccag 1441 cccctgctgc cccagcctgt gtctgttctc ctcgagggct ccagcatcct ctgcttcccc 1501 caccacgttc ccatcaccca cctcattgat ccactgacca aatccttaac cctagcgatg 1561 gcttgggaga tggggggttg gatagcatcc tctttcttgg cccttcctta tcctaggaaa 1621 agagggttcc tctccttgtg tgtgtctctt ccccccaccc ctaattcttc tgctctgttt 1681 gggaagacgt ggaggaaaag gtgacttctg cccccaccgc tcttaccccc actgtagtgg 1741 cctttggaga tgcccccacc tcccccccac caactctcgc gtgttggaga gaaggggccc 1801 tcccagcaca aagttgcatt cctccccccc taatttattc taatttatta actttgaccc 1861 accctttctg agcctgcagc cttcccgtgt ggcctgaggg ctgtcgagtg agctgcccca 1921 gccccctccc agcccttgcc cagcctgggg gagtggggaa ggcttgggca tggccccgtt 1981 ggaggttgat ttgctgtttt gtttcttgtc tttgtgttct gtggtacttg ctgagagaaa 2041 agaaaagtga gccaagcaga aggaggtgga aaacggaccc aaaccccagt gtgccctgcc 2101 ccatgccttt cctttagtgg tgggaaaccc ttatcttgca aagtgaatgt gtccccttcc 2161 ccaccctcta gtgtatttca cagaaaacaa aacctcccaa taaaacggtt gaaacctgaa 2221 aaaaa // LOCUS HSXCGD 4267 bp RNA PRI 16-SEP-1994 DEFINITION Human mRNA of X-CGD gene involved in chronic granulomatous disease located on chromosome X. ACCESSION X04011 NID g37983 KEYWORDS chronic granulomatous disease-associated gene; glycoprotein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4267) AUTHORS Royer-Pokora,B., Kunkel,L.M., Monaco,A.P., Goff,S.C., Newburger,P.E., Baehner,R.L., Cole,F.S., Curnutte,J.T. and Orkin,S.H. TITLE Cloning the gene for an inherited human disorder--chronic granulomatous disease--on the basis of its chromosomal location JOURNAL Nature 322 (6074), 32-38 (1986) MEDLINE 86257405 COMMENT Data kindly reviewed (23-FEB-1987) by Orkin S.H. FEATURES Location/Qualifiers source 1..4267 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 208..1728 /note="precursor polypeptide" /codon_start=1 /db_xref="PID:g37984" /db_xref="SWISS-PROT:P04839" /translation="MLILLPVCRNLLSFLRGSSACCSTRVRRQLDRNLTFHKMVAWMI ALHSAIHTIAHLFNVEWCVNARVNNSDPYSVALSELGDRQNESYLNFARKRIKNPEGG LYLAVTLLAGITGVVITLCLILIITSSTKTIRRSYFEVFWYTHHLFVIFFIGLAIHGA ERIVRGQTAESLAVHNITVCEQKISEWGKIKECPIPQFAGNPPMTWKWIVGPMFLYLC ERLVRFWRSQQKVVITKVVTHPFKTIELQMKKKGFKMEVGQYIFVKCPKVSKLEWHPF TLTSAPEEDFFSIHIRIVGDWTEGLFNACGCDKQEFQDAWKLPKIAVDGPFGTASEDV FSYEVVMLVGAGIGVTPFASILKSVWYKYCNNATNLKLKKIYFYWLCRDTHAFEWFAD LLQLLESQMQERNNAGFLSYNIYLTGWDESQANHFAVHHDEEKDVITGLKQKTLYGRP NWDNEFKTIASQHPNTRIGVFLCGPEALAETLSKQSISNSESGPRGVHFIFNKENF" sig_peptide 208..257 misc_feature 409..411 /note="glycosylation site; putative" misc_feature 460..462 /note="glycosylation site; putative" misc_feature 733..735 /note="putative; glycosylation site" misc_feature 1303..1305 /note="putative; glycosylation site" polyA_signal 4248..4253 polyA_site 4267 BASE COUNT 1242 a 838 c 913 g 1274 t ORIGIN 1 cttcctctgc caccatcggg gaactgggct gtgaatgagg ggctctccat ttttgctatt 61 ctggtttggc tggggttgaa cgtcttcctc tttgtctggt attaccgggt ttatgatatt 121 ccacctaagt tcttttacac aagaaaactt cttgggtcag cactggcact ggccagggcc 181 cctgcagcct gcctgaattt caactgcatg ctgattctct tgccagtctg tcgaaatctg 241 ctgtccttcc tcaggggttc cagtgcgtgc tgctcaacaa gagttcgaag acaactggac 301 aggaatctca cctttcataa aatggtggca tggatgattg cacttcactc tgcgattcac 361 accattgcac atctatttaa tgtggaatgg tgtgtgaatg cccgagtcaa taattctgat 421 ccttattcag tagcactctc tgaacttgga gacaggcaaa atgaaagtta tctcaatttt 481 gctcgaaaga gaataaagaa ccctgaagga ggcctgtacc tggctgtgac cctgttggca 541 ggcatcactg gagttgtcat cacgctgtgc ctcatattaa ttatcacttc ctccaccaaa 601 accatccgga ggtcttactt tgaagtcttt tggtacacac atcatctctt tgtgatcttc 661 ttcattggcc ttgccatcca tggagctgaa cgaattgtac gtgggcagac cgcagagagt 721 ttggctgtgc ataatataac agtttgtgaa caaaaaatct cagaatgggg aaaaataaag 781 gaatgcccaa tccctcagtt tgctggaaac cctcctatga cttggaaatg gatagtgggt 841 cccatgtttc tgtatctctg tgagaggttg gtgcggtttt ggcgatctca acagaaggtg 901 gtcatcacca aggtggtcac tcaccctttc aaaaccatcg agctacagat gaagaagaag 961 gggttcaaaa tggaagtggg acaatacatt tttgtcaagt gcccaaaggt gtccaagctg 1021 gagtggcacc cttttacact gacatccgcc cctgaggaag acttctttag tatccatatc 1081 cgcatcgttg gggactggac agaggggctg ttcaatgctt gtggctgtga taagcaggag 1141 tttcaagatg cgtggaaact acctaagata gcggttgatg ggccctttgg cactgccagt 1201 gaagatgtgt tcagctatga ggtggtgatg ttagtgggag cagggattgg ggtcacaccc 1261 ttcgcatcca ttctcaagtc agtctggtac aaatattgca ataacgccac caatctgaag 1321 ctcaaaaaga tctacttcta ctggctgtgc cgggacacac atgcctttga gtggtttgca 1381 gatctgctgc aactgctgga gagccagatg caggaaagga acaatgccgg cttcctcagc 1441 tacaacatct acctcactgg ctgggatgag tctcaggcca atcactttgc tgtgcaccat 1501 gatgaggaga aagatgtgat cacaggcctg aaacaaaaga ctttgtatgg acggcccaac 1561 tgggataatg aattcaagac aattgcaagt caacacccta ataccagaat aggagttttc 1621 ctctgtggac ctgaagcctt ggctgaaacc ctgagtaaac aaagcatctc caactctgag 1681 tctggccctc ggggagtgca tttcattttc aacaaggaaa acttctaact tgtctcttcc 1741 atgaggaaat aaatgtgggt tgtgctgcca aatgctcaaa taatgctaat tgataatata 1801 aataccccct gcttaaaaat ggacaaaaag aaactataat gtaatggttt tcccttaaag 1861 gaatgtcaaa gattgtttga tagtgataag ttacatttat gtggagctct atggttttga 1921 gagcactttt acaaacatta tttcattttt ttcctctcag taatgtcagt ggaagttagg 1981 gaaaagattc ttggactcaa ttttagaatc aaaagggaaa ggatcaaaag gttcagtaac 2041 ttccctaaga ttatgaaact gtgaccagat ctagcccatc ttactccagg tttgatactc 2101 tttccacaat actgagctgc ctcagaatcc tcaaaatcag tttttatatt ccccaaaaga 2161 agaaggaaac caaggagtag ctatatattt ctactttgtg tcatttttgc catcattatt 2221 atcatactga aggaaatttt ccagatcatt aggacataat acatgttgag agtgtctcaa 2281 cacttattag tgacagtatt gacatctgag catactccag tttactaata cagcagggta 2341 actgggccag atgttctttc tacagaagaa tattggattg attggagtta atgtaatact 2401 catcatttac cactgtgctt ggcagagagc ggatactcaa gtaagttttg ttaaatgaat 2461 gaatgaattt agaaccacac aatgccaaga tagaattaat ttaaagcctt aaacaaaatt 2521 tatctaaaga aataacttct attactgtca tagaccaaag gaatctgatt ctccctaggg 2581 tcaagaacag gctaaggata ctaaccaata ggattgcctg aagggttctg cacattctta 2641 tttgaagcat gaaaaaagag ggttggaggt ggagaattaa cctcctgcca tgactctggc 2701 tcatctagtc ctgctccttg tgctataaaa taaatgcaga ctaatttcct gcccaaagtg 2761 gtcttctcca gctagccctt atgaatattg aacttaggaa ttgtgacaaa tatgtatctg 2821 atatggtcat ttgttttaaa taacacccac cccttatttt ccgtaaatac acacacaaaa 2881 tggatcgcat ctgtgtgact aatggtttat ttgtattata tcatcatcat catcctaaaa 2941 ttaacaaccc agaaacaaaa atctctatac agagatcaaa ttcacactca atagtatgtt 3001 ctgaatatat gttcaagaga gagtctctaa atcactgtta gtgtggccaa gagcagggtt 3061 ttctttttgt tcttagaact gctcccattt ctgggaacta aaaccagttt tatttgcccc 3121 accccttgga gccacaaatg tttagaactc ttcaacttcg gtaatgagga agaaggagaa 3181 agagctgggg gaagggcaga agactggttt aggaggaaaa ggaaataagg agaaaagaga 3241 atgggagagt gagagaaaat aaaaaaggca aaagggagag agaggggaag ggggtctcat 3301 attggtcatt ccctgcccca gatttcttaa agtttgatat gtatagaata taattgaagg 3361 aggtatacac atactgatgt tgttttgatt atctatggta ttgaatcttt taaaatctgg 3421 tcacaaattt tgatgctgag ggggattatt caagggacta ggatgaacta aataagaact 3481 cagttgttct ttgtcatact actattcctt tcgtctccca gaatcctcag ggcactgagg 3541 gtaggtctga caaataaggc ctgctgtgcg aatatagcct ttctgaaatg taccaggatg 3601 gtttctgctt agagacactt aggtccagcc tgttcacact gcacctcagg tatcaattca 3661 tctattcaac agatatttat tgtgttatta ctatgagtca ggctctgttt attgtttcaa 3721 ttctttacac caaagtatga actggagagg gtacctcagt tataaggagt ctgagaatat 3781 tggccctttc taacctatgt gcataattaa aaccagcttc atttgttgct ccgagagtgt 3841 ttctccaagg ttttctatct tcaaaaccaa ctaagttatg aaagtagaga gatctgccct 3901 gtgttatcca gttatgagat aaaaaatgaa tataagagtg cttgtcatta taaaagtttc 3961 ctttttatct ctcaagccac cagctgccag ccaccacgag ccagctgcca gcctagcttt 4021 tttttttttt ttttttttag cacttagtat ttagcattta ttaacaggta ctctaagaat 4081 gatgaagcat tgtttttaat cttaagacta tgaaggtttt tcttagttct tctgcttttg 4141 caattgtgtt tgtgaaattt gaatacttgc aggctttgta tgtgaataat tctagcgggg 4201 gacctgggag ataattctac ggggaattct taaaactgtg ctcaactatt aaaatgaatg 4261 agctttc // LOCUS HSXIAPAF1 1326 bp RNA PRI 01-MAR-1997 DEFINITION H.sapiens mRNA for XIAP associated factor-1. ACCESSION X99699 NID g1869900 KEYWORDS XIAP associated factor-1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1326) AUTHORS Toji,S., Yano,M. and Tamai,K. TITLE Identification of novel XIAP associated protein JOURNAL Unpublished REFERENCE 2 (bases 1 to 1326) AUTHORS Tamai,K. TITLE Direct Submission JOURNAL Submitted (01-AUG-1996) K. Tamai, MBL co.Ltd. Ina Laboratory, R&D division, 1063-103 Terasawaoka, Ina City, Nagano, 396, JAPAN FEATURES Location/Qualifiers source 1..1326 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 1..954 /codon_start=1 /evidence=experimental /product="XIAP associated factor-1 (ZAP-1)" /db_xref="PID:e257708" /db_xref="PID:g1869901" /translation="MEGDFSVCRNCKRHVVSANFTLHEAYCLRFLVLCPECEEPVPKE TMEEHCKLEHQQVGCTMCQQSMQKSSLEFHKANECQERPVECKFCKLDMQLSKLELHE SYCGSRTELCQGCGQFIMHRMLAQHRDVCRSEQAQLGKGERISAPEREIYCHYCNQMI PENKYFHHMGKCCPDSEFKKHFPVGNPEILPSSLPSQAAENQTSTMEKDVRPKTRSIN RFPLHSESSSKKAPRSKNKTLDPLLMSEPKPRTSSPRGDKAAYDILRRCSQCGILLPL PILNQHQEKCRWLASSKRKTSEKFQLDLEKERYYKFKRFHF" BASE COUNT 421 a 301 c 301 g 303 t ORIGIN 1 atggaaggag acttctcggt gtgcaggaac tgtaaaagac atgtagtctc tgccaacttc 61 accctccatg aggcttactg cctgcggttc ctggtcctgt gtccggagtg tgaggagcct 121 gtccccaagg aaaccatgga ggagcactgc aagcttgagc accagcaggt tgggtgtacg 181 atgtgtcagc agagcatgca gaagtcctcg ctggagtttc ataaggccaa tgagtgccag 241 gagcgccctg ttgagtgtaa gttctgcaaa ctggacatgc agctcagcaa gctggagctc 301 cacgagtcct actgtggcag ccggacagag ctctgccaag gctgtggcca gttcatcatg 361 caccgcatgc tcgcccagca cagagatgtc tgtcggagtg aacaggccca gctcgggaaa 421 ggggaaagaa tttcagctcc tgaaagggaa atctactgtc attattgcaa ccaaatgatt 481 ccagaaaata agtatttcca ccatatgggt aaatgttgtc cagactcaga gtttaagaaa 541 cactttcctg ttggaaatcc agaaattctt ccttcatctc ttccaagtca agctgctgaa 601 aatcaaactt ccacgatgga gaaagatgtt cgtccaaaga caagaagtat aaacagattt 661 cctcttcatt ctgaaagttc atcaaagaaa gcaccaagaa gcaaaaacaa aaccttggat 721 ccacttttga tgtcagagcc caagcccagg accagctccc ctagaggaga taaagcagcc 781 tatgacattc tgaggagatg ttctcagtgt ggcatcctgc ttcccctgcc gatcctaaat 841 caacatcagg agaaatgccg gtggttagct tcatcaaaaa ggaaaacaag tgagaaattt 901 cagctagatt tggaaaagga aaggtactac aaattcaaaa gatttcactt ttaacactgg 961 cattcctgcc tacttgctgt ggtggtcttg tgaaaggtga tgggttttat tcgttgggct 1021 ttaaaagaaa aggtttggca gaactaaaaa caaaactcac gtatcatctc aatagataca 1081 gaaaaggctt ttgataaaat tcaacttgac ttcatgttaa aaaccctcaa caaaccaggc 1141 gtcgaaggaa catacctcaa aataataaga gccatctatg acaaaaccac agccaacatc 1201 atactgaatg agcaaaagct ggagcattac tcttgagaag tagaacaagg cacttcagtc 1261 ctattcaaca tagtactgga agtctcgcca cagcaatcag gcaagagaaa gaagtaaaag 1321 gcaccc // LOCUS HSXKMTP 5096 bp RNA PRI 27-OCT-1997 DEFINITION H.sapiens XK mRNA for membrane transport protein. ACCESSION Z32684 NID g2570027 KEYWORDS membrane transport protein; XK gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5096) AUTHORS Ho,M., Chelly,J., Carter,N., Danek,A., Crocker,P. and Monaco,A.P. TITLE Isolation of the gene for McLeod syndrome that encodes a novel membrane transport protein JOURNAL Cell 77 (6), 869-880 (1994) MEDLINE 94273191 REFERENCE 2 (bases 1 to 5096) AUTHORS Ho,M.F. TITLE Direct Submission JOURNAL Submitted (21-APR-1994) Meng F Ho, Human Genetics, Imperial Cancer Research Fund, Institute of Molecular Medicine John Radcliffe Hospital Headington, Oxford, OXON, OX3 9DU, United Kingdom REMARK revised by [3] REFERENCE 3 (bases 1 to 5096) AUTHORS Ho,M.F. TITLE Direct Submission JOURNAL Submitted (21-OCT-1997) Meng F Ho, Human Genetics, Imperial Cancer Research Fund, Institute of Molecular Medicine John Radcliffe Hospital Headington, Oxford, OXON, OX3 9DU, United Kingdom FEATURES Location/Qualifiers source 1..5096 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="XK" gene 83..1417 /gene="XK" exon 83..327 /gene="XK" /number=1 CDS 83..1417 /gene="XK" /citation=[1] /codon_start=1 /product="Membrane transport protein" /db_xref="PID:e1154296" /db_xref="PID:g2570028" /translation="MKFPASVLASVFLFVAETTAALSLSSTYRSGGDRMWQALTLLFS LLPCALVQLTLLFVHRDLSRDRPLVLLLHLLQLGPLFRCFEVFCIYFQSGNNEEPYVS ITKKRQMPKNGLSEEIEKEVGQAEGKLITHRSAFSRASVIQAFLGSAPQLTLQLYISV MQQDVTVGRSLLMTISLLSIVYGALRCNILAIKIKYDEYEVKVKALAYVCIFLWRSFE IATRVVVLVLFTSVLKTWVVVIILINFFSFFLYPWILFWCSGSPFPENIEKALSRVGT TIVLCFLTLLYTGINMFCWSAVQLKIDSPDLISKSHNWYQLLVYYMIRFIENAILLLL WYLFKTDIYMYVCAPLLVLQLLIGYCTAILFMLVFYQFFHPCKKLFSSSVSEGFQRWL RCFCWACRQQKPCEPIGKEDLQSSRDRDETPSSSKTSPEPGQFLNAEDLCSA" exon 328..590 /gene="XK" /number=2 exon 591..1417 /gene="XK" /number=3 polyA_signal 5036..5041 polyA_signal 5063..5068 polyA_site 5092 BASE COUNT 1392 a 1063 c 1023 g 1618 t ORIGIN 1 ctcgggcaac ggccgccgcc gccacagcca cacagccgcc gccactgcgt ccgtccccgg 61 tgagcgccgc tgacgcgcgg agatgaaatt cccggcctcg gtgctggcgt ccgtgttcct 121 gttcgtggcc gagacaacgg cggcgctcag cctgagcagc acctaccgct cgggcgggga 181 ccgcatgtgg caggcgctga cgttgctttt ctcgctactg ccttgcgcgc tcgtgcagct 241 cacgcttctc ttcgtacacc gcgacctcag ccgcgaccgc ccgctcgtac tgctgctgca 301 cctgctgcaa cttgggcccc ttttcaggtg ttttgaagtc ttctgcatct actttcagtc 361 aggcaacaat gaagagcctt atgtcagtat caccaagaag aggcaaatgc caaaaaatgg 421 cctctcagag gagattgaga aggaggtggg ccaggcagaa ggcaaactaa tcacccaccg 481 atcagcgttc agccgggcgt cggtgatcca ggctttcttg ggctcagccc cccagctgac 541 cctacagctg tacataagtg tcatgcagca ggacgtcact gttggaagaa gtctcctcat 601 gaccatatcc ctgttgtcca ttgtgtatgg agccttgcgc tgcaacatcc tagccatcaa 661 aatcaagtac gatgagtatg aagtcaaagt gaaggctctg gcctatgtct gtatcttcct 721 gtggaggagc tttgagattg ccactcgagt tgtagtcctg gtcctcttta cctccgtcct 781 gaagacctgg gtggtggtta taatactcat caacttcttc agtttcttct tgtacccctg 841 gatcctcttc tggtgcagtg gttccccatt ccctgagaac atagagaagg ccctcagtag 901 agtgggcacc accattgtac tatgctttct aactttactc tatactggta tcaacatgtt 961 ctgctggtct gctgtacagc tgaaaattga cagccctgac ctcatcagca agtcccataa 1021 ttggtaccag ctactggtgt attacatgat aagattcatc gagaatgcca tcctcctcct 1081 cctgtggtat cttttcaaga ctgacatcta tatgtatgtg tgcgcacctc tgttggtcct 1141 gcagctgctc attgggtact gcacagccat tctcttcatg cttgtattct atcagttctt 1201 ccacccttgc aaaaagctct tttcttccag tgtttctgaa ggctttcaga ggtggctcag 1261 gtgtttttgc tgggcctgca ggcagcaaaa accctgtgag ccgataggaa aggaagatct 1321 acagtcatcc agagatagag atgagacacc ttctagcagt aaaacaagtc ctgagcctgg 1381 tcagttcttg aatgctgaag atctctgctc tgcttaatgg gacccaaggt ctcagagcac 1441 aggcatatta ttttctgggt ttgatactcg ttattcatac aaataatgag ccctacacag 1501 ggaacaaggc aggaagttag ctgttaactc cttgtgagct gcttctcttt atagagctct 1561 tgggtatgta gaactgtatg ggaagaagcc aggaaaacct ctgagtgttg aaggggcaac 1621 ccaaggcatc acagttcaca ggtaaccatg ttgtgttctt ctaggcatta ctggcctttt 1681 cactgacaag ttacccctga aatctgagtt gtactggtta gattcattag gttgaatgag 1741 gagaggggct tacctgttct ctagttttcc agactgctgg tttgggtaac ctgaagtttt 1801 atgatccctg aacatagttt tcacccaaga gctgtcctgg tgaccaaaaa agattactag 1861 ggttttgcca tcagcaacat cattcctgtc agagctttca gggagggctg ttcaagtttg 1921 gtttttgaat agacagaggt ttcattttca tctcattagg gcttttttgt acatagccaa 1981 ctgtagccac cctggcatgc tgtctctaat tgattcaagg gcctagattt ggagactttg 2041 ttccttagca tccatggatg cagaaatgag tgataaattt ggccatgaaa gccctggaat 2101 tttaggaaag ttgtgattct ttgatatgtt gatgaaattc tggatcagga ggggtgttaa 2161 aacatcaaaa tgatggcgac ttgtataagt agaattctta ccatcttact cctgcctccc 2221 cattccttgc ctgtcaggac cattttagaa gagttacctt ggttaactta agggtttttt 2281 agaaaacccc acagaaatct ctcacccagt tcaggtgtat agggaatttt cattttcatt 2341 atttcaagga atagagtttt cctcactacc acttgtaatt agagattgac ttcagaaact 2401 ttttctaaat tataaagacg gaatgaccag aaaatttttg tctttcatgg aatgcaattt 2461 gacttggcat aattgtgagt taatttgata aagatctcca gttgtatcct ctgacaccct 2521 ttaatgttct atataatttt tcctttaatc ggagacccta tttgtttcat taaagaggat 2581 tgggtagcat atcctcaatt atcttggaaa aatgcgtaga tctagtgcca ttatttctac 2641 cattaaaatc ttgaaagcga aatacttgaa aagaggtgac cataaagatc acaccaattt 2701 agaatgaata ttaaagtctg agaaatttac aaaagggact ctatggactc gattttacca 2761 tatggaatag ggatccactt ctctgttaac ctacaaaacg tatttatctg gccattgcac 2821 ttcggataat tttcattttt aattcctctt ggtgtcaaat ttttaagtaa tttcatgcac 2881 acaggagaaa atgcaatttg ataatcagtt tccactacat tcggaggtca agaaagcagt 2941 atatattctt tcacaaggca tccatactgt ttaatattta tagagttcat agaaatacat 3001 caataagctc ctaaattata gagcctcaat tttggttttt ctcatttgaa atgtttttga 3061 tttatcaata tttgtggttt aactttgtgg gggtgtccag gcagtagatc attttttgtc 3121 tatttttaac caaataagaa taacactcaa gactatgttc tccatcccaa cacatcactt 3181 accacttgca ctattcttaa taggcctaaa atagtgagca tatataatca aaataatact 3241 tcatacaaca tatacttcca tgcactcttc atgtaatcac atatgtgcat agatacacaa 3301 tcgtattaca taaaacaccc ttgaaaatat atgactttga ataaaataaa gctaccatgt 3361 taatacatca ttttttgttt cttcctaatt aacaaaaggg ctctataatg attaggagag 3421 ctgtaaggcc attctttagt cattccaagt gtatatttgt atcacaccag ttacttgtgt 3481 tcattgacac caaatatttt caagcttttt gtgagaaaga gattcttgcc ttgaggctct 3541 gcagtggcaa atggtagggg caatagcgct cccttttcac ctgaaatgcc ttcagctcag 3601 gccttgggag tgaggtgtgg gagtagccac cagcctccca ctgactaggc atgttaggaa 3661 ttagacaagg atggagagac tggttttttc cagccttggt ttggaagctt gtctcaaaaa 3721 ccattacaaa atcttaacct tcaaatacat tctaaatcac tttaatttag aagataggtg 3781 agcaacttga gtgatcccaa accaaactaa cagtgctggt ttgtccgaca cccaaagccc 3841 acttcacaca tttataagga cagagatttg gtaactcatt tgggtatttt gtcagtttta 3901 acccttaata tacaggtatc ccagctttgg gcacatgtat gtccattcac tttcaggaag 3961 gggtgtcagt actttctttc tgtgtggtga aaacataagc catggtgaac tgataatttt 4021 tgtttttaaa ttatttatat tactgaattg gcaaaattac agttcacact gcagagctag 4081 gttgtcctat tggcataaaa caaaccagca acttttctca tgtgtttgga gtttggagtt 4141 aaaatgtgtt tttcagttat aatttcaatt tttaaatttc ttggtgcttt gtctaattaa 4201 ttacatccca taacttgtca ggatagttgc tccaactgaa ttgctatcat ttggctggat 4261 gggggtgaag tatgtcttca aaatatattg aagtaagttc atcaaacatg tttagacttc 4321 ctctcattgc tggagacagc ccagtgtaga gattggcact tccctgtcca gcactacaac 4381 ttaattgctc tatgcctcag tttccccata ataccctttt ccaacctacc tcacaaggtt 4441 atcagaaaga tcaaattaaa taatagatgt gaaaatgctt tgaaaatgtt tttaagtgct 4501 atgcatattt atagagttat tatagttatt caaatccata agcaggttat ttttattttt 4561 ttactgtttg aaaagaaata agtagtgtca ttcatatgag ttcctgagtg gatatgcgaa 4621 tctttggtct tctcgacagg tgccctttct cttaccacta gttgcttaca aaaatagcac 4681 ccaaacacat ggcacttgga agaaaaagac ccactgaatc tgagaaagta ctttctggtg 4741 gcaaatgaat ttatcatttt agtatagttt tcgataaagg atatagcaac atcttttgat 4801 aattgtgctt tacaacttga atgctgattc aaggcattat ttggatgtga gtttaatctt 4861 ttcagccttg taaatgggaa aatatatatt aaaattgatt gtcaaataca gcatttcttt 4921 ggaaaccacc ttaaaacatt agtgctatgg ttatgagtgt atgtgccagt acttaccagt 4981 caatgcattg tggatatgag ctttcgttga ctgcttctct gcagtcgttg atgctaataa 5041 atattgtcct gtttcttcat ataataaatt aatttttcaa tttctccttg taaaaa // LOCUS HSXMEF2 1500 bp RNA PRI 13-OCT-1992 DEFINITION H.sapiens mRNA for myocyte-specific enhancer factor 2 (XMEF2). ACCESSION X68502 NID g37991 KEYWORDS alternative splicing; DNA-binding protein; MADS box; muscle specific protein; transcription factor; xmef2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1500) AUTHORS Breitbart,R. TITLE Direct Submission JOURNAL Submitted (25-SEP-1992) R. Breitbart, Children's Hospital, Harvard Medical School, Dept. of Cardiology, 300 Longwood Ave., Boston, MA 02115, USA REFERENCE 2 (bases 1 to 1500) AUTHORS Yu,Y.T., Breitbart,R.E., Smoot,L.B., Lee,Y., Mahdavi,V. and Nadal-Ginard,B. TITLE Human myocyte-specific enhancer factor 2 comprises a group of tissue-restricted MADS box transcription factors JOURNAL Genes Dev. 6 (9), 1783-1798 (1992) MEDLINE 92387551 COMMENT Related sequences:MEF2 and MEFa are isoforms of the same gene that also encodes the human SRF-related clones RSRFC4 and RSRFC9 (acc# x63381). FEATURES Location/Qualifiers source 1..1500 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart & skeletal muscle" /clone_lib="lambda gt11+ZAP II" misc_feature <1..>1500 /note="linker sequence" 5'UTR 2..249 gene 250..1347 /gene="xmef2" CDS 250..1347 /gene="xmef2" /codon_start=1 /product="myocyte-specific enhancer factor 2 (XMEF2)" /db_xref="PID:g37992" /db_xref="SWISS-PROT:Q02080" /translation="MGRKKIQISRILDQRNRQVTFTKRKFGLMKKAYELSVLCDCEIA LIIFNSANRLFQYASTDMDRVLLKYTEYSEPHESRTNTDILETLKRRGIGLDGPELEP DEGPEEPGEKFRRLAGEGGDPALPRPRLYPAAPAMPSPDVVYGALPPPGCDPSGLGEA LPAQSRPSPFRPAAPKAGPPGLVHPLFSPSHLTSKTPPPLYLPTEGRRSDLPGGLAGP RGGLNTSRSLYSGLQNPCSTATPGPPLGSFPFLPGGPPVGAEAWARRVPQPAAPPRRP PQSASSLSASLRPPGAPATFLRPSPIPCSSPGPWQSLCGLGPPCAGCPWPTAGPGRRS PGGTSPERSPGTARARGDPTSLQASSEKTQQ" 3'UTR 1348..1473 polyA_signal 1444..1449 BASE COUNT 313 a 529 c 435 g 223 t ORIGIN 1 cggagccgga gatgcagctc aaggggaaga aagcgccgtg aagaacctgg tggacagcag 61 cgtctacttc cgcagcgtgg agggtctgct caaacaggcc atcagcatcc gggaccatat 121 gaatgccagt gcccagggcc acagcccgga ggaaccaccc ccgccctcct cagcctgatc 181 ctggaagaga ctcggggccc cccagcctcc gccaacccag acaaagatca ttccactcag 241 cctgggacga tggggaggaa aaaaatccag atctcccgca tcctggacca aaggaatcgg 301 caggtgacgt tcaccaagcg gaagttcggg ctgatgaaga aggcctatga gctgagcgtg 361 ctctgtgact gtgagatagc cctcatcatc ttcaacagcg ccaaccgcct cttccagtat 421 gccagcacgg acatggaccg tgtgctgctg aagtacacag agtacagcga gccccacgag 481 agccgcacca acactgacat cctcgagacg ctgaagcgga ggggcattgg cctcgatggg 541 ccagagctgg agccggatga agggcctgag gagccaggag agaagtttcg gaggctggca 601 ggcgaagggg gtgatccggc cttgccccga ccccggctgt atcctgcagc tcctgctatg 661 cccagcccag atgtggtata cggggcctta ccgccaccag gctgtgaccc cagtgggctt 721 ggggaagcac tgcccgccca gagccgccca tctcccttcc gaccagcagc ccccaaagcc 781 gggcccccag gcctggtgca ccctctcttc tcaccaagcc acctcaccag caagacacca 841 cccccactgt acctgccgac ggaagggcgg aggtcagacc tgcctggtgg cctggctggg 901 ccccgagggg gactaaacac ctccagaagc ctctacagtg gcctgcagaa cccctgctcc 961 actgcaactc ccggaccccc actggggagc ttccccttcc tccccggagg ccccccagtg 1021 ggggccgaag cctgggcgag gagggtcccc caacccgcgg cgcctccccg ccgacccccc 1081 cagtcagcat caagtctgag cgcctctctc cggcccccgg gggccccggc gactttccta 1141 agaccttccc ctatcccttg ctcctcgccc ggtccctggc agagcctctg cggcctgggc 1201 ccgccctgcg ccggctgccc ttggccgacg gctggccccg gtaggagatc acccggtggc 1261 accagcccag agcgctcgcc aggtacggcg agggcacgtg gggaccccac ctccctccag 1321 gcctcttcag agaagaccca acagtgacgc ccccctccgc ggtgggggct tggaggtggg 1381 cggctggact caatccaccc tggggggctc ctttccttct tcctatttgt gtgtatatcc 1441 acaaataaaa cgcgcgtggc gtccgtggac cagaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSXPCC 3455 bp RNA PRI 26-APR-1996 DEFINITION H.sapiens mRNA for xeroderma pigmentosum group C complementing factor (XP-C). ACCESSION X65024 NID g37995 KEYWORDS DNA repair factor; Xeroderma Pigmentosum Group C Complementing gene (XPCC); XP-C factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3455) AUTHORS Legerski,R.J. TITLE Direct Submission JOURNAL Submitted (11-MAR-1992) R.J. Legerski, The University of Texas, M.D.Anderson Cancer Center, Mol Genetics Box 11, 1515 Holcombe Blvd, Houston TX 77030, USA REFERENCE 2 (bases 1 to 3455) AUTHORS Legerski,R. and Peterson,C. TITLE Expression cloning of a human DNA repair gene involved in xeroderma pigmentosum group C JOURNAL Nature 359 (6390), 70-73 (1992) MEDLINE 92396218 REMARK Erratum:[Nature 1992 Dec 10;360(6404):610] [see comments] FEATURES Location/Qualifiers source 1..3455 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="cDNA library H8" /clone="pXPC-3" gene 192..2663 /gene="XPCC" CDS 192..2663 /gene="XPCC" /codon_start=1 /product="Xeroderma Pigmentosum Group C Complementing factor" /db_xref="PID:g37996" /db_xref="SWISS-PROT:Q01831" /translation="MNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPVK PVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGFY RNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQD NLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKPS KERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKKR SKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSGE ASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKR GKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPMT YVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLEF QAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHT LHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEEY QPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDF HGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLI RERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQK LKGGPKKTKREKKAAASHLFPFEKL" BASE COUNT 970 a 814 c 966 g 705 t ORIGIN 1 gaaagaggaa aagaggctgc ggtcatcctg ggggttcagc agatggtcca gcaaaaaaga 61 aagtggccaa ggtgactgtt aaatctgaaa acctcaaggt tataaaggat gaagccctca 121 gcgatgggga tgacctcagg gactttccaa gtgacctcaa gaaggcacac catctgaaga 181 gaggggctac catgaatgaa gacagcaatg aagaagagga agaaagtgaa aatgattggg 241 aagaggttga agaacttagt gagcctgtgc tgggtgacgt gagagaaagt acagccttct 301 ctcgatctct tctgcctgtg aagccagtgg agatagagat tgaaacgcca gagcaggcga 361 agacaagaga aagaagtgaa aagataaaac tggagtttga gacatatctt cggagggcga 421 tgaaacgttt caataaaggg gtccatgagg acacacacaa ggttcacctt ctctgcctgc 481 tagcaaatgg cttctatcga aataacatct gcagccagcc agatctgcat gctattggcc 541 tgtccatcat cccagcccgc tttaccagag tgctgcctcg agatgtggac acctactacc 601 tctcaaacct ggtgaagtgg ttcattggaa catttacagt taatgcagaa ctttcagcca 661 gtgaacaaga taacctgcag actacattgg aaaggagatt tgctatttac tctgctcgag 721 atgatgagga attggtccat atattcttac tgattctccg ggctctgcag ctcttgaccc 781 ggctggtatt gtctctacag ccaattcctc tgaagtcagc aacagcaaag ggaaagaaac 841 cttccaagga aagattgact gcggatccag gaggctcctc agaaacttcc agccaagttc 901 tagaaaacca caccaaacca aagaccagca aaggaaccaa acaagaggaa acctttgcta 961 agggcacctg caggccaagt gccaaaggga agaggaacaa gggaggcaga aagaaacgga 1021 gcaagccctc ctccagcgag gaagatgagg gcccaggaga caagcaggag aaggcaaccc 1081 agcgacgtcc gcatggccgg gagcggcggg tggcctccag ggtgtcttat aaagaggaga 1141 gtgggagtga tgaggctggc agcggctctg attttgagct ctccagtgga gaagcctctg 1201 atccctctga tgaggattcc gaacctggcc ctccaaagca gaggaaagcc cccgctcctc 1261 agaggacaaa ggctgggtcc aagagtgcct ccaggaccca tcgtgggagc catcgtaagg 1321 acccaagctt gccagcggca tcctcaagct cttcaagcag taaaagaggc aagaaaatgt 1381 gcagcgatgg tgagaaggca gaaaaaagaa gcatagctgg tatagaccag tggctagagg 1441 tgttctgtga gcaggaggaa aagtgggtat gtgtagactg tgtgcacggt gtggtgggcc 1501 agcctctgac ctgttacaag tacgccacca agcccatgac ctatgtggtg ggcattgaca 1561 gtgacggctg ggtccgagat gtcacacaga ggtacgaccc agtctggatg acagtgaccc 1621 gcaagtgccg ggttgatgct gagtggtggg ccgagacctt gagaccatac cagagcccat 1681 ttatggacag ggagaagaaa gaagacttgg agtttcaggc aaaacacatg gaccagcctt 1741 tgcccactgc cattggctta tataagaacc accctctgta tgccctgaag cggcatctcc 1801 tgaaatatga ggccatctat cccgagacag ctgccatcct tgggtattgt cgtggagaag 1861 cggtctactc cagggattgt gtgcacactc tgcattccag agacacgtgg ctgaagaaag 1921 caagagtggt gaggcttgga gaagtaccct acaagatggt gaaaggcttt tctaaccgtg 1981 ctcggaaagc ccgacttgct gagccccagc tgcgggaaga aaatgacctg ggcctgtttg 2041 gctactggca gacagaggag tatcagcccc cagtggccgt ggacgggaag gtgccccgga 2101 acgagtttgg gaatgtgtac ctcttcctgc ccagcatgat gcctattggc tgtgtccagc 2161 tgaacctgcc caatctacac cgcgtggccc gcaagctgga catcgactgt gtccaggcca 2221 tcactggctt tgatttccat ggcggctact cccatcccgt gactgatgga tacatcgtct 2281 gcgaggaatt caaagacgtg ctcctgactg cctgggaaaa tgagcaggca gtcattgaaa 2341 ggaaggagaa ggagaaaaag gagaagcggg ctctagggaa ctggaagttg ctggccaaag 2401 gtctgctcat cagggagagg ctgaagcgtc gctacgggcc caagagtgag gcagcagctc 2461 cccacacaga tgcaggaggt ggactctctt ctgatgaaga ggaggggacc agctctcaag 2521 cagaagcggc caggatactg gctgcctcct ggcctcaaaa ccgagaagat gaagaaaagc 2581 agaagctgaa gggtgggccc aagaagacca aaagggaaaa gaaagcagca gcttcccacc 2641 tgttcccatt tgagaagctg tgagctgagc gcccactaga ggggcaccca ccagttgctg 2701 ctgccccact acaggcccca cacctgccct gggcatgccc agcccctggt ggtgggggct 2761 tctctgctga gaaggcaaac tgaggcagca tgcacggagg cggggtcagg ggagacgagg 2821 ccaagctgag gaggtgctgc aggtcccgtc tggctccagc ccttgtcaga ttcacccagg 2881 gtgaagcctt caaagctttt tgctaccaaa gcccactcac cctttgagct acagaacact 2941 ttgctaggag atactcttct gcctcctaga cctgttcttt ccatctttag aaacatcagt 3001 ttttgtatgg aagccaccgg gagatttctg gatggtggtg catccgtgaa tgcgctgatc 3061 gtttcttcca gttagagtct tcatctgtcc gacaagttca ctcgcctcgg ttgcggacct 3121 aggaccattt ctctgcaggc cacttacctt cccctgagtc aggcttacta atgctgccct 3181 cactgcctct ttgcagtagg ggagagagca gagaagtaca ggtcatctgc tgggatctag 3241 ttttccaagt aacattttgt ggtgacagaa gcctaaaaaa agctaaaatc aggaaagaaa 3301 aggaaaaata cgaattgaaa attaaggaaa tgttagtaaa atagatcagt gttaaactag 3361 attgtattca ttactagata aaatgtataa agctctctgt actaaggaga aatgactttt 3421 ataacatttt gagaaaataa taaagcattt atcta // LOCUS HSXPGAA 3854 bp RNA PRI 23-JUN-1993 DEFINITION H.sapiens mRNA for XP-G factor. ACCESSION X69978 NID g298110 KEYWORDS xeroderma pigmentosum. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3854) AUTHORS Clarkson,S.G. TITLE Direct Submission JOURNAL Submitted (11-JAN-1993) S.G. Clarkson, University of Geneva, Dept of Genetics & Microbiology, Centre Medical Universitaire, 9 avenue de Champel, 1211 Geneva 4, SWITZERLAND REFERENCE 2 (bases 1 to 3854) AUTHORS Scherly,D., Nouspikel,T., Corlet,J., Ucla,C., Bairoch,A. and Clarkson,S.G. TITLE Complementation of the DNA repair defect in xeroderma pigmentosum group G cells by a human cDNA related to yeast RAD2 JOURNAL Nature 363 (6425), 182-185 (1993) MEDLINE 93247645 FEATURES Location/Qualifiers source 1..3854 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cells" /cell_line="Raji, Mann" /clone_lib="cDNA in lambda gt10" /clone="2.1,2.2,7.3,7.4,20" gene 198..3758 /gene="XPGC" CDS 198..3758 /gene="XPGC" /note="RAD2 related protein" /codon_start=1 /product="XP-G factor" /db_xref="PID:g298111" /db_xref="SWISS-PROT:P28715" /translation="MGVQGLWKLLECSGRQVSPEALEGKILAVDISIWLNQALKGVRD RHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDS RKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPLQEEEKHSSEEE DEKEWQERMNQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTL FEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIRRQYEDEGGFLKEV ESRRVVSEDTSHYILIKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEKLKTEKE PDATPPSPRTLLAMQAALLGSSSEEELESENRRQARGRNAPAAVDEGSISPRTLSAIK RALDDDEDVKVCAGDDVQTGGPGAEEMRINSSTENSDEGLKVRDGKGIPFTATLASSS VNSAEEHVASTNEGREPTDSVPKEQMSLVHVGTEAFPISDESMIKDRKDRLPLESAVV RHSDAPGLPNGRELTPASPTCTNSVSKNETHAEVLEQQNELCPYESKFDSSLLSSDDE TKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTT ESAGQDLISIPKAVEPMEIDSEESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGE EELVGTREGEAPAESESLLRDNSERDDVDGEPQEAEKDAEDSLHEWQDINLEELETLE SNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILD LTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAY LLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHEAQKNPKIRPNPHDTKV KKKLRTLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTK TDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAA ASEIEAVSVAMEKEFELLDKAKRKTQKRGITNTLEESSSLKRKRLSDSKRKNTCGGFL GETCLSESSDGSSSEHAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTS SSSDSDDDGGKEKMVLVTARSVFGKKRRKLRRARGRKRKT" misc_feature 3366..3416 /gene="XPGC" /note="bipartite nuclear localisation signal" polyA_signal 3826..3831 polyA_site 3851 /note="in clone 2.2" BASE COUNT 1232 a 787 c 966 g 869 t ORIGIN 1 gccagagtct ctccgcttta atgcgctccc attagtgccg tcccccactg gaaaaccgtg 61 gcttctgtat tatttgccat ctttgttgtg taggagcagg gagggcttcc tcccggggtc 121 ctaggcggcg gtgcagtccg tcgtagaaga attagagtag aagttgtcgg ggtccgctct 181 taggacgcag ccgcctcatg ggggtccagg ggctctggaa gctgctggag tgctccgggc 241 ggcaggtcag ccccgaagcg ctggaaggga agatcctggc tgttgatatt agcatttggt 301 taaaccaagc acttaaagga gtccgggatc gccatgggaa ctcaatagaa aatcctcatc 361 ttctcacttt gtttcatcgg ctctgcaaac tcttattttt tcgaattcgt cctatttttg 421 tgtttgatgg ggatgctcca ctattgaaga aacagacttt ggtgaagaga aggcagagaa 481 aggacttagc gtccagtgac tccaggaaaa cgacagagaa gcttctgaaa acatttttga 541 aaagacaagc catcaaaact gccttcagaa gcaaaagaga tgaagcacta cccagtctta 601 cccaagttcg aagagaaaac gacctctatg ttttgcctcc tttacaagag gaagaaaaac 661 acagttcaga agaggaagat gaaaaagaat ggcaagaaag aatgaatcaa aaacaagcat 721 tacaggaaga gttctttcat aatcctcaag cgatagatat tgagtctgag gacttcagca 781 gcctgccccc tgaagtaaag catgaaatct tgactgatat gaaagagttc accaagcgca 841 gaagaacatt atttgaagca atgccagagg agtctgatga cttttcacag taccaactca 901 aaggcttgct taaaaagaac tatctgaacc agcatataga acatgtccaa aaggaaatga 961 atcagcaaca ttcaggacac atccgaaggc agtatgaaga tgaagggggc tttctgaagg 1021 aggtagagtc aaggagagtg gtctctgaag acacttcaca ttacatcttg ataaaaggta 1081 ttcaagctaa gacagttgca gaagtggatt cagagtctct tccttcttcc agcaaaatgc 1141 acggcatgtc ttttgacgtg aagtcatctc catgtgaaaa actgaagaca gagaaagagc 1201 ctgatgctac ccctccttct ccaagaactt tactagctat gcaagctgcc ctgctgggaa 1261 gtagctcaga agaggagctg gagagtgaaa atcgaaggca ggcccgtggg aggaacgcac 1321 ctgctgctgt agacgaaggc tccatatcac cccggactct ttcagccatt aagagagctc 1381 ttgacgatga cgaagatgta aaagtgtgtg ctggggatga tgtgcagacg ggagggccag 1441 gagcagaaga aatgcgtata aacagctcca ccgagaacag tgatgaagga cttaaagtga 1501 gagatggaaa aggaataccg tttactgcaa cacttgcgtc atctagtgtg aactctgcag 1561 aggagcacgt agccagcact aatgagggga gagagcccac agactcagtt ccaaaagaac 1621 aaatgtcact tgttcacgtg gggactgaag cctttccgat aagtgatgag tctatgatta 1681 aggacagaaa agatcggctg cctctggaga gtgcagtggt tagacatagt gacgcacctg 1741 ggctcccgaa tggaagggaa ctgacaccgg catctccaac ttgtacaaat tctgtgtcaa 1801 agaatgaaac acatgctgaa gtgcttgagc agcagaacga actttgccca tatgagagta 1861 aattcgattc ttctcttctt tcaagtgatg atgaaacaaa atgtaaaccg aattctgctt 1921 ctgaagtcat tggccctgtc agtttgcaag aaacaagtag catagtaagt gtcccttcag 1981 aggcagtaga taatgtggaa aatgtggtgt catttaatgc taaagagcat gagaattttc 2041 tggaaaccat ccaagaacag cagaccactg aatctgcagg ccaggattta atttccattc 2101 caaaggccgt ggaaccaatg gaaattgact cggaagaaag tgaatctgat ggaagtttca 2161 ttgaagtgca aagtgtgatt agtgatgagg aacttcaagc agaattccct gaaacttcca 2221 aacctccctc agaacaaggc gaagaggaac tggtaggaac tagggaggga gaagcccctg 2281 ctgagtccga gagcctcctg agggacaact ctgagaggga cgacgtggat ggtgagccac 2341 aggaagctga gaaagatgcg gaagattcgc tccatgaatg gcaagatatt aatttggagg 2401 agttggaaac tctggagagc aacctcttag cacagcagaa ttcactgaaa gctcaaaaac 2461 agcagcaaga acggatcgct gctactgtca ccggacagat gttcctggaa agccaggaac 2521 tcctgcgcct gttcggcatt ccctacatcc aggctcccat ggaagcagag gcgcagtgcg 2581 ccatcctgga cctgactgat cagacttccg gaaccatcac tgatgacagt gatatctggc 2641 tgtttggagc gcggcatgtc tatagaaact tttttaataa aaacaagttt gtagaatatt 2701 atcaatatgt ggactttcac aatcaattgg gattggaccg gaataagtta ataaatttgg 2761 cttatttgct tggaagtgat tataccgaag gaataccaac tgtgggttgt gtaaccgcca 2821 tggaaattct caatgaattc cctgggcatg gcctggaacc tctcctaaaa ttctcagaat 2881 ggtggcatga agctcaaaaa aatccaaaga taagacctaa tcctcatgac accaaagtga 2941 aaaaaaaatt acggacattg caactcaccc ctggctttcc taacccagct gttgccgagg 3001 cctacctcaa acccgtggtg gatgactcga agggatcctt tctgtggggg aaacctgatc 3061 tcgacaaaat tagagaattt tgtcagcggt atttcggctg gaacagaacg aagacagatg 3121 aatctctgtt tcctgtatta aagcaactcg atgcccagca gacacagctc cgaattgatt 3181 ccttctttag attagcacaa caggagaaag aagatgctaa acgtattaag agccagagac 3241 taaacagagc tgtgacatgt atgctaagga aagagaaaga agcagcagcc agcgaaatag 3301 aagcagtttc tgttgccatg gagaaagaat ttgagctact tgataaggca aaacgaaaaa 3361 cccagaagag aggcataaca aataccttag aagagtcatc aagcctgaaa agaaagaggc 3421 tttcagattc taaacgaaag aatacatgcg gtggattttt gggggagacc tgcctctcag 3481 aatcatctga tggatcttca agtgaacatg ctgaaagttc atctttaatg aatgtacaaa 3541 ggagaacagc tgcgaaagag ccaaaaacca gtgcttcaga ttcgcagaac tcagtgaagg 3601 aagctcccgt gaagaatgga ggtgcgacca ccagcagctc tagtgatagt gatgacgatg 3661 gagggaaaga gaagatggtc ctcgtgaccg ccagatctgt gtttgggaag aaaagaagga 3721 aactaagacg tgcgagggga agaaaaagga aaacctaatt aaaaaatatg tatcctctat 3781 aattagttat gacagccatt tgtaatgaat ttgtcgcaaa gacgtaataa aattaactgg 3841 tggcacggtc aaaa // LOCUS HSY08564 1737 bp DNA PRI 01-MAY-1997 DEFINITION H.sapiens GalNAc-T4 gene. ACCESSION Y08564 NID g1934911 KEYWORDS GalNAc-T4 gene; UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1737) AUTHORS Bennett,E.P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1737) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (01-OCT-1996) E.P. Bennett, Dental School, University Of Copenhagen, Norre Alle 20, 2200 Copenhagen N, DENMARK REMARK Revised by [3] REFERENCE 3 (bases 1 to 1737) AUTHORS Bennett,E.P. TITLE Direct Submission JOURNAL Submitted (10-APR-1997) E.P. Bennett, Dental School, University Of Copenhagen, Norre Alle 20, 2200 Copenhagen N, DENMARK FEATURES Location/Qualifiers source 1..1737 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="genomic P1 library, #6212" gene 1..1737 /gene="GalNAc-T4" CDS 1..1737 /gene="GalNAc-T4" /note="fourth member of the GalNAc transferase gene family" /codon_start=1 /product="UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase" /db_xref="PID:e307951" /db_xref="PID:g1934912" /translation="MAVRWTWAGKTCLLLAFLTVAYIFVELLVSTFHASAGAGRAREL GSRRLSDLQKNTEDLSRPLYKKPPADSRALGEWGKASKLQLNEDELKQQEELIERYAI NIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLE TSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDRVRLIRTNKREGLVRARLIGATFA TGDVLTFLYCHCECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIGEPMIG GFDWRLTFQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVW GGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRS RGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPE QKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCD ALDKNQIWSFEK" BASE COUNT 482 a 354 c 437 g 464 t ORIGIN 1 atggcggtga ggtggacttg ggcaggcaag acctgcctgc tgctggcgtt tttaacagtg 61 gcctatatct tcgtggagct cttggtctct acttttcatg cctccgcagg agccggccgt 121 gccagggagc tggggtcaag aaggctctca gacctccaga aaaatacgga ggatttgtct 181 cgaccgcttt ataagaagcc ccctgcagat tcccgtgcac ttggggagtg ggggaaagcc 241 agcaaactcc agctcaacga ggatgaactg aagcagcaag aagaactcat tgagagatac 301 gccatcaata tttacctcag tgacaggatt tccctgcatc gacacataga ggataaaaga 361 atgtatgagt gtaagtccca gaagttcaac tataggacac ttcctaccac ctctgttatc 421 attgctttct ataacgaagc ctggtcgact ttgctccgta ccattcacag tgttttagaa 481 acttctcctg cagttctttt gaaagagatc atcttggtgg atgacttgag tgacagagtt 541 tatttgaaga cacaacttga aacttacatc agcaatcttg atagagtacg cttgattagg 601 accaataagc gagaggggct ggttagggcc cgtctgattg gggccacttt cgccactggg 661 gacgtcctca ctttcctgta ttgtcactgt gagtgtaatt ccggttggct ggaaccgctt 721 ttggaaagga ttgggagata tgaaacagca gttgtgtgtc ctgttataga cacaattgat 781 tggaatactt ttgaattcta tatgcagata ggggagccca tgattggtgg gtttgactgg 841 cgtttaacat ttcagtggca ttctgtcccc aaacaggaaa gggacaggcg gatatcaaga 901 attgacccca tcagatcacc taccatggct ggaggactgt ttgctgtcag caagaaatat 961 tttcagtacc ttggaacgta tgacacagga atggaagtgt ggggaggtga aaaccttgag 1021 ctgtctttta gggtgtggca gtgtggtggc aaattggaga tccacccgtg ttcccacgtg 1081 ggccatgtgt tccccaagcg ggcaccatat gctcgcccca atttcctaca gaatactgct 1141 cgggcagcag aagtttggat ggatgaatac aaagagcact tctacaatag aaaccctcca 1201 gcaagaaaag aagcttatgg tgatatttct gaaagaaaat tactacgaga gcggttgaga 1261 tgcaagagct ttgactggta tttgaaaaac gtttttccta atttacatgt tccagaggat 1321 agaccaggct ggcatggggc tattcgcagt agagggatct cgtctgaatg tttagattat 1381 aattctcctg acaacaaccc cacaggtgct aacctttcac tgtttggatg ccatggtcaa 1441 ggaggcaatc aattctttga atatacttca aacaaagaaa taaggtttaa ttctgtgaca 1501 gagttatgtg cagaggtacc tgagcaaaaa aattatgtgg gaatgcaaaa ttgtcccaaa 1561 gatgggttcc ctgtaccagc aaacattatt tggcatttta aagaagatgg aactattttt 1621 cacccacact caggactgtg tcttagtgct tatcggacac cggagggccg acctgatgta 1681 caaatgagaa cttgtgatgc tctagataaa aatcaaattt ggagttttga gaaatag // LOCUS HSY09501 1012 bp RNA PRI 27-NOV-1996 DEFINITION H.sapiens mRNA for NADH-cytochrome b5 reductase. ACCESSION Y09501 NID g1695154 KEYWORDS NADH-cytochrome b5 reductase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1012) AUTHORS Voice,M.W. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1012) AUTHORS Voice,M.W. TITLE Direct Submission JOURNAL Submitted (18-NOV-1996) M.W. Voice, University of Dundee, Biomedical Research Centre, Ninewells Hospital & Medical School, Dundee, DD1 9SY, UK FEATURES Location/Qualifiers source 1..1012 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" CDS 20..925 /EC_number="1.6.2.2" /codon_start=1 /product="NADH-cytochrome-b5 reductase" /db_xref="PID:e283650" /db_xref="PID:g1695155" /translation="MGAQLSTLGHMVLFPVWFLYSLLMKLFQRSTPAITLESPDIKYP LRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARIDGNLVVRPYTPISSDDD KGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIR PDKKSNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQTEKDILL RPELEELRNKHSARFKLWYTLDRAPEAWDYGQGFVNEEMIRDHLPPPEEEPLVLMCGP PPMIQYACLPNLDHVGHPTERCFVF" BASE COUNT 207 a 351 c 271 g 183 t ORIGIN 1 gagcgcggcg cgggccacca tgggggccca gctcagcacg ttgggccata tggtgctctt 61 cccagtctgg ttcctgtaca gtctgctcat gaagctgttc cagcgctcca cgccagccat 121 caccctcgag agcccggaca tcaagtaccc gctgcggctc atcgaccggg agatcatcag 181 ccatgacacc cggcgcttcc gctttgccct gccgtcaccc cagcacatcc tgggcctccc 241 tgtcggccag cacatctacc tctcggctcg aattgatgga aacctggtcg tccggcccta 301 tacacccatc tccagcgatg atgacaaggg cttcgtggac ctggtcatca aggtttactt 361 caaggacacc catcccaagt ttcccgctgg agggaagatg tctcagtacc tggagagcat 421 gcagattgga gacaccattg agttccgggg ccccagtggg ctgctggtct accagggcaa 481 agggaagttc gccatccgac ctgacaaaaa gtccaaccct atcatcagga cagtgaagtc 541 tgtgggcatg atcgcgggag ggacaggcat caccccgatg ctgcaggtga tccgcgccat 601 catgaaggac cctgatgacc acactgtgtg ccacctgctc tttgccaacc agaccgagaa 661 ggacatcctg ctgcgacctg agctggagga actcaggaac aaacattctg cacgcttcaa 721 gctctggtac acgctggaca gagcccctga agcctgggac tacggccagg gcttcgtgaa 781 tgaggagatg atccgggacc accttccacc cccagaggag gagccgctgg tgctgatgtg 841 tggcccccca cccatgatcc agtacgcctg ccttcccaac ctggaccacg tgggccaccc 901 cacggagcgc tgcttcgtct tctgagggcc gggcaccggt cacacggcca ccgcccccgc 961 gcaccccacg ccctgttcac gctcacccag tcacctcccc acatcgcaca ct // LOCUS HSY10805 1435 bp RNA PRI 29-JAN-1997 DEFINITION H.sapiens mRNA for arginine methyltransferase, splice variant, 1435 bp. ACCESSION Y10805 NID g1808643 KEYWORDS arginine methyltransferase; HRMT1L2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1435) AUTHORS Scott,H.S., Lalioti,M.D., Rossier,C. and Antonarakis,S.E. TITLE Isolation and mapping of two human genes (hHMT1 and hHMT2) homologous to a yeast arghinine methyltransferase JOURNAL Unpublished REFERENCE 2 (bases 1 to 1435) AUTHORS Scott,H.S. TITLE Direct Submission JOURNAL Submitted (27-JAN-1997) H.S. Scott, University Of Geneva Medical School, Department Of Genetics And Microbiology, 1 Rue Michel-Servet, 1211 Geneva, SWITZERLAND COMMENT Related sequence D66904. FEATURES Location/Qualifiers source 1..1435 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /map="q" gene 190..1233 /gene="HRMT1L2" CDS 190..1233 /gene="HRMT1L2" /codon_start=1 /product="arginine methyltransferase" /db_xref="PID:e294099" /db_xref="PID:g1808644" /translation="MVGVAEVSCGQAESSEKPNAEDMTSKDYYFDSYAHFGIHEEMLK DEVRTLTYRNSMFHNRHLFKDKVVLDVGSGTGILCMFAAKAGARKVIGIVCSSISDYA VKIVKANKLDHVVTIIKGKVEEVELPVEKVDIIISEWMGYCLFYESMLNTVLYARDKW LAPDGLIFPDRATLYVTAIEDRQYKDYKIHWWENVYGFDMSCIKDVAIKEPLVDVVDP KQLVTNACLIKEVDIYTVKVEDLTFTSPFCLQVKRNDYVHALVAYFNIEFTRCHKRTG FSTSPESPYTHWKQTVFYMEDYLTVKTGEEIFGTIGMRPNAKNNRDLDFTIDLDFKGQ LCELSCSTDYRMR" BASE COUNT 304 a 421 c 414 g 296 t ORIGIN 1 gtgggcagcc gaggccgcga actgcatcat ggagaatttt gtagccacct tggctaatgg 61 gatgagcctc cagccgcctc ttgaagaagt aacccccctt tgcccttccc tgtgtctgcc 121 cccattttcc ttcccctccc ctccccagct gtgggctgag ctagagacgg ggtcagagag 181 actggagaga tggtaggcgt ggctgaggtg tcctgtggcc aggcggaaag cagtgagaag 241 cccaacgctg aggacatgac atccaaagat tactactttg actcctacgc acactttggc 301 atccacgagg agatgctgaa ggacgaggtg cgcaccctca cttaccgcaa ctccatgttt 361 cataaccggc acctcttcaa ggacaaggtg gtgctggacg tcggctcggg caccggcatc 421 ctctgcatgt ttgctgccaa ggccggggcc cgcaaggtca tcgggatcgt gtgttccagt 481 atctctgatt atgcggtgaa gatcgtcaaa gccaacaagt tagaccacgt ggtgaccatc 541 atcaagggga aggtggagga ggtggagctc ccagtggaga aggtggacat catcatcagc 601 gagtggatgg gctactgcct cttctacgag tccatgctca acaccgtgct ctatgcccgg 661 gacaagtggc tggcgcccga tggcctcatc ttcccagacc gggccacgct gtatgtgacg 721 gccatcgagg accggcagta caaagactac aagatccact ggtgggagaa cgtgtatggc 781 ttcgacatgt cttgcatcaa agatgtggcc attaaggagc ccctagtgga tgtcgtggac 841 cccaaacagc tggtcaccaa cgcctgcctc ataaaggagg tggacatcta taccgtcaag 901 gtggaagacc tgaccttcac ctccccgttc tgcctgcaag tgaagcggaa tgactacgtg 961 cacgccctgg tggcctactt caacatcgag ttcacacgct gccacaagag gaccggcttc 1021 tccaccagcc ccgagtcccc gtacacgcac tggaagcaga cggtgttcta catggaggac 1081 tacctgaccg tgaagacggg cgaggagatc ttcggcacca tcggcatgcg gcccaacgcc 1141 aagaacaacc gggacctgga cttcaccatc gacctggact tcaagggcca gctgtgcgag 1201 ctgtcctgct ccaccgacta ccggatgcgc tgaggcccgg ctctcccgcc ctgcacgagc 1261 ccaggggctg agcgttccta ggcggtttcg gggctccccc ttcctctccc tccctcccgc 1321 agaagggggt tttaggggcc tgggctgggg ggatggggag ggcacatcgt gactgtgttt 1381 ttcataactt atgtttttat atggttgcat ttacgccaat aaatcctcag ctggg // LOCUS HSY11416 2234 bp RNA PRI 02-SEP-1997 DEFINITION H.sapiens mRNA for P73. ACCESSION Y11416 NID g2370175 KEYWORDS p53 transcription factor; P73 gene; transcription factor; tumor suppressor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2234) AUTHORS Kaghad,M., Bonnet,H., Yang,A., Creancier,L., Biscan,J.C., Valent,A., Minty,A., Chalon,P., Lelias,J.M., Dumont,X., Ferrara,P., McKeon,F. and Caput,D. TITLE Monoallelically expressed gene related to p53 at 1p36, a region frequently deleted in neuroblastoma and other human cancers JOURNAL Cell 90 (4), 809-819 (1997) MEDLINE 97433090 REFERENCE 2 (bases 1 to 2234) AUTHORS Caput,D. TITLE Direct Submission JOURNAL Submitted (21-FEB-1997) D. Caput, Sanofi-Elf-Bio-Recherches, Labege Innopole - BP 137- Voie N 1, 31676, Labege Cedex, FRANCE FEATURES Location/Qualifiers source 1..2234 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="adenocarcinoma colon" /cell_line="HT29" /chromosome="1" /map="p36" mRNA <1..>2234 /evidence=experimental variation 78..175 /note="exon 2 deletion; second splice variant" /replace="" variation 81 /note="allelic variant" /replace="a" variation 91 /note="allelic variant" /replace="t" gene 111..2021 /gene="P73" CDS join(111..1594,1689..1704) /gene="P73" /note="first splice variant" /codon_start=1 /evidence=experimental /db_xref="PID:e308620" /db_xref="PID:g2370177" /translation="MAQSTATSPDGGTTFEHLWSSLEPDSTYFDLPQSSRGNNEVVGG TDSSMDVFHLEGMTTSVMAQFNLLSSTMDQMSSRAASASPYTPEHAASVPTHSPYAQP SSTFDTMSPAPVIPSNTDYPGPHHFEVTFQQSSTAKSATWTYSPLLKKLYCQIAKTCP IQIKVSTPPPPGTAIRAMPVYKKAEHVTDVVKRCPNHELGRDFNEGQSAPASHLIRVE GNNLSQYVDDPVTGRQSVVVPYEPPQVGTEFTTILYNFMCNSSCVGGMNRRPILIIIT LEMRDGQVLGRRSFEGRICACPGRDRKADEDHYREQQALNESSAKNGAASKRAFKQSP PAVPALGAGVKKRRHGDEDTYYLQVRGRENFEILMKLKESLELMELVPQPLVDSYRQQ QQLLQRPSHLQPPSYGPVLSPMNKVHGGMNKLPSVNQLVGQPPPHSSAATPNLGPVGP GMLNNHGHAVPANGEMSSSHSAQSMVSGSHCTPPPPYHADPSLVRTWGP" CDS 111..2021 /gene="P73" /function="tumor supressor" /codon_start=1 /product="P53-like transcription factor" /db_xref="PID:e308621" /db_xref="PID:g2370176" /translation="MAQSTATSPDGGTTFEHLWSSLEPDSTYFDLPQSSRGNNEVVGG TDSSMDVFHLEGMTTSVMAQFNLLSSTMDQMSSRAASASPYTPEHAASVPTHSPYAQP SSTFDTMSPAPVIPSNTDYPGPHHFEVTFQQSSTAKSATWTYSPLLKKLYCQIAKTCP IQIKVSTPPPPGTAIRAMPVYKKAEHVTDVVKRCPNHELGRDFNEGQSAPASHLIRVE GNNLSQYVDDPVTGRQSVVVPYEPPQVGTEFTTILYNFMCNSSCVGGMNRRPILIIIT LEMRDGQVLGRRSFEGRICACPGRDRKADEDHYREQQALNESSAKNGAASKRAFKQSP PAVPALGAGVKKRRHGDEDTYYLQVRGRENFEILMKLKESLELMELVPQPLVDSYRQQ QQLLQRPSHLQPPSYGPVLSPMNKVHGGMNKLPSVNQLVGQPPPHSSAATPNLGPVGP GMLNNHGHAVPANGEMSSSHSAQSMVSGSHCTPPPPYHADPSLVSFLTGLGCPNCIEY FTSQGLQSIYHLQNLTIEDLGALKIPEQYRMTIWRGLQDLKQGHDYSTAQQLLRSSNA ATISIGGSGELQRQRVMEAVHFRVRHTITIPNRGGPGGGPDEWADFGFDLPDCKARKQ PIKEEFTEAEIH" CDS 255..2021 /gene="P73" /note="second splice variant" /codon_start=1 /evidence=experimental /db_xref="PID:e339489" /db_xref="PID:g2370178" /translation="MDVFHLEGMTTSVMAQFNLLSSTMDQMSSRAASASPYTPEHAAS VPTHSPYAQPSSTFDTMSPAPVIPSNTDYPGPHHFEVTFQQSSTAKSATWTYSPLLKK LYCQIAKTCPIQIKVSTPPPPGTAIRAMPVYKKAEHVTDVVKRCPNHELGRDFNEGQS APASHLIRVEGNNLSQYVDDPVTGRQSVVVPYEPPQVGTEFTTILYNFMCNSSCVGGM NRRPILIIITLEMRDGQVLGRRSFEGRICACPGRDRKADEDHYREQQALNESSAKNGA ASKRAFKQSPPAVPALGAGVKKRRHGDEDTYYLQVRGRENFEILMKLKESLELMELVP QPLVDSYRQQQQLLQRPSHLQPPSYGPVLSPMNKVHGGMNKLPSVNQLVGQPPPHSSA ATPNLGPVGPGMLNNHGHAVPANGEMSSSHSAQSMVSGSHCTPPPPYHADPSLVSFLT GLGCPNCIEYFTSQGLQSIYHLQNLTIEDLGALKIPEQYRMTIWRGLQDLKQGHDYST AQQLLRSSNAATISIGGSGELQRQRVMEAVHFRVRHTITIPNRGGPGGGPDEWADFGF DLPDCKARKQPIKEEFTEAEIH" variation 1595..1688 /gene="P73" /note="exon 13 deletion; first splice variant" /replace="" BASE COUNT 462 a 798 c 646 g 328 t ORIGIN 1 aggggacgca gcgaaaccgg ggcccgcgcc aggccagccg ggacggacgc cgatgcccgg 61 ggctgcgacg gctgcagagc gagctgccct cggaggccgg cgtggggaag atggcccagt 121 ccaccgccac ctcccctgat gggggcacca cgtttgagca cctctggagc tctctggaac 181 cagacagcac ctacttcgac cttccccagt caagccgggg gaataatgag gtggtgggcg 241 gaacggattc cagcatggac gtcttccacc tggagggcat gactacatct gtcatggccc 301 agttcaatct gctgagcagc accatggacc agatgagcag ccgcgcggcc tcggccagcc 361 cctacacccc agagcacgcc gccagcgtgc ccacccactc gccctacgca caacccagct 421 ccaccttcga caccatgtcg ccggcgcctg tcatcccctc caacaccgac taccccggac 481 cccaccactt tgaggtcact ttccagcagt ccagcacggc caagtcagcc acctggacgt 541 actccccgct cttgaagaaa ctctactgcc agatcgccaa gacatgcccc atccagatca 601 aggtgtccac cccgccaccc ccaggcactg ccatccgggc catgcctgtt tacaagaaag 661 cggagcacgt gaccgacgtc gtgaaacgct gccccaacca cgagctcggg agggacttca 721 acgaaggaca gtctgctcca gccagccacc tcatccgcgt ggaaggcaat aatctctcgc 781 agtatgtgga tgaccctgtc accggcaggc agagcgtcgt ggtgccctat gagccaccac 841 aggtggggac ggaattcacc accatcctgt acaacttcat gtgtaacagc agctgtgtag 901 ggggcatgaa ccggcggccc atcctcatca tcatcaccct ggagatgcgg gatgggcagg 961 tgctgggccg ccggtccttt gagggccgca tctgcgcctg tcctggccgc gaccgaaaag 1021 ctgatgagga ccactaccgg gagcagcagg ccctgaacga gagctccgcc aagaacgggg 1081 ccgccagcaa gcgtgccttc aagcagagcc cccctgccgt ccccgccctt ggtgccggtg 1141 tgaagaagcg gcggcatgga gacgaggaca cgtactacct tcaggtgcga ggccgggaga 1201 actttgagat cctgatgaag ctgaaagaga gcctggagct gatggagttg gtgccgcagc 1261 cactggtgga ctcctatcgg cagcagcagc agctcctaca gaggccgagt cacctacagc 1321 ccccgtccta cgggccggtc ctctcgccca tgaacaaggt gcacgggggc atgaacaagc 1381 tgccctccgt caaccagctg gtgggccagc ctcccccgca cagttcggca gctacaccca 1441 acctggggcc cgtgggcccc gggatgctca acaaccatgg ccacgcagtg ccagccaacg 1501 gcgagatgag cagcagccac agcgcccagt ccatggtctc ggggtcccac tgcactccgc 1561 caccccccta ccacgccgac cccagcctcg tcagtttttt aacaggattg gggtgtccaa 1621 actgcatcga gtatttcacc tcccaagggt tacagagcat ttaccacctg cagaacctga 1681 ccattgagga cctgggggcc ctgaagatcc ccgagcagta ccgcatgacc atctggcggg 1741 gcctgcagga cctgaagcag ggccacgact acagcaccgc gcagcagctg ctccgctcta 1801 gcaacgcggc caccatctcc atcggcggct caggggaact gcagcgccag cgggtcatgg 1861 aggccgtgca cttccgcgtg cgccacacca tcaccatccc caaccgcggc ggcccaggcg 1921 gcggccctga cgagtgggcg gacttcggct tcgacctgcc cgactgcaag gcccgcaagc 1981 agcccatcaa ggaggagttc acggaggccg agatccactg agggcctcgc ctggctgcag 2041 cctgcgccac cgcccagaga cccaagctgc ctcccctctc cttcctgtgt gtccaaaact 2101 gcctcaggag gcaggacctt cgggctgtgc ccggggaaag gcaaggtccg gcccatcccc 2161 aggcacctca caggccccag gaaaggccca gccaccgaag ccgcctgtgg acagcctgag 2221 tcacctgcag aacc // LOCUS HSY11739 2697 bp RNA PRI 14-NOV-1997 DEFINITION H.sapiens mRNA for whn transcription factor. ACCESSION Y11739 NID g2315191 KEYWORDS transcription factor; whn gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2697) AUTHORS Schorpp,M., Hofmann,M., Dear,T.N. and Boehm,T. TITLE Characterization of mouse and human nude genes JOURNAL Immunogenetics 46 (6), 509-515 (1997) MEDLINE 98025083 REFERENCE 2 (bases 1 to 2697) AUTHORS Boehm,T. TITLE Direct Submission JOURNAL Submitted (11-MAR-1997) T. Boehm, German Cancer Center, Division 0425, Im Neuenheimer Feld 280, D-69120, Heidelberg, FRG COMMENT Related sequences Y11740-46 (genomic DNA molecule). FEATURES Location/Qualifiers source 1..2697 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /tissue_type="thymus" /map="q11-12" gene 30..1976 /gene="whn" CDS 30..1976 /gene="whn" /codon_start=1 /product="transcription factor" /db_xref="PID:e1173451" /db_xref="PID:g2315192" /translation="MVSLPPPQSDVTLPGPTRLEGERQGDLMQAPGLPGSPAPQSKHA GFSCSSFVSDGPPERTPSLPPHSPRIASPGPEQVQGHCPAGPGPGPFRLSPSDKYPGF GFEEAAASSPGRFLKGSHAPFHPYKRPFHEDVFPEAETTLALKGHSFKTPGPLEAFEE IPVDVAEAEAFLPGFSAEAWCNGLPYPSQEHGPQVLGSEVKVKPPVLESGAGMFCYQP PLQHMYCSSQPPFHQYSPGGGSYPIPYLGSSHYQYQRMAPQASTDGHQPLFPKPIYSY SILIFMVLKNSKTGSLPVSEIYNFMTEHFPYFKTAPDGWKNSVRHNLSLNKCFEKVEN KSGSSSRKGCLWALNPAKIDKMQEELQKWKRKDPIAVRKSMAKPEELDSLIGDKREKL GSPLLGCPPPGLSGSGPIRPLAPPAGLSPPLHSLHPAPGPIPGKNPLQDLLMGHTPSC YGQTYLHLSPGLAPPGPPQPLFPQPDGHLELRAQPGTPQDSPLPAHTPPSHSAKLLAE PSPARTMHDTLLPDGDLGTDLDAINPSLTDFDFQGNLWEQLKDDSLALDPLVLVTSSP TSSSMPPPQPPPHCFPPGPCLTETGSGAGDLAAPGSGGSGALGDLHLTTLYSAFMELE PTPPTAPAGPSVYLSPSSKPVALA" BASE COUNT 556 a 980 c 668 g 493 t ORIGIN 1 acggctttct ttgaggccag gactgggtga tggtgtcgct acccccgccg cagtctgacg 61 tcacgctgcc gggccccacc agactggagg gcgagcgcca aggggacctc atgcaggcac 121 cgggcctccc aggctcccct gccccacaga gtaagcatgc cggcttcagc tgctcgtcat 181 ttgtgtccga cggccctcca gagaggacac cctcactgcc cccacacagc ccccgcattg 241 cgtcaccagg gcccgagcaa gtccagggcc actgcccagc cggccccggc cctgggccct 301 tcaggctctc accctcagac aagtatcctg gctttggctt tgaggaggcc gcagcaagca 361 gccctgggcg attcctcaag ggcagccacg cgcccttcca cccgtacaag cggcctttcc 421 atgaggacgt cttcccagag gccgagacca ccctggccct caaaggacac tcctttaaga 481 ccccagggcc gctggaggcc ttcgaggaga tcccagtgga cgtggcggag gccgaggcct 541 tcctgcctgg cttctcagca gaggcctggt gtaacgggct cccctacccc agccaggagc 601 atggccccca agtcctgggt tcagaggtca aagtcaagcc cccagttctg gagagtggtg 661 ctgggatgtt ctgctaccag cctcccttgc agcatatgta ctgctcctcc cagcccccct 721 tccaccagta ctcgccaggt ggtggcagct accccatacc ctacctgggc tcctcacact 781 atcagtacca gcgaatggca ccccaggcca gcaccgatgg gcaccagcct ctcttcccaa 841 aacccatcta ttcctacagc atcctcatct tcatggtcct taagaacagt aaaactggga 901 gccttcccgt cagcgagatc tacaatttta tgacggagca ctttccttac ttcaagacag 961 cacccgatgg ctggaagaat tctgtccggc acaacctatc cctcaacaag tgcttcgaga 1021 aggtggagaa caaatcagga agttcctccc gcaagggctg cctgtgggcc ctcaatccgg 1081 ccaagatcga caagatgcaa gaggagctgc aaaaatggaa gaggaaagat cccattgctg 1141 tgcgcaaaag catggccaag ccagaagagc tggacagcct cattggagac aagagagaaa 1201 agctgggctc cccactcctg ggctgtccgc cccctgggct gtccggctca ggccccatcc 1261 ggcccctggc acccccagct ggcctctccc caccactgca ctcactccac ccagctccag 1321 gccccattcc tggcaagaac cccctgcagg acctacttat ggggcacaca ccctcctgct 1381 atgggcagac atacttgcac ctctcaccag gcctggcccc tcctggaccc ccgcagccat 1441 tgttcccaca gccggacggg caccttgagc tgcgggccca gccaggcacc ccccaggact 1501 cgcctctgcc tgcccacacc ccacccagcc acagtgccaa gctactggcc gagccttccc 1561 cagccaggac tatgcacgac accctgctgc cagatggaga ccttggcact gacctggatg 1621 ccatcaatcc ctcactcact gacttcgact tccagggaaa cctgtgggaa cagttgaagg 1681 atgatagctt ggccctcgac cccctggtac tggtgacctc atccccgaca tcatcttcga 1741 tgccaccacc ccagccacca cctcactgct tcccccctgg gccctgtctg acagagacag 1801 gcagtggggc aggtgacttg gcagccccgg gcagtggtgg ctccggggca ctgggtgacc 1861 tgcacctcac caccctctac tctgccttta tggagctgga gcccacgccc cccacggccc 1921 ctgcaggccc ctctgtgtac ctcagcccca gctccaagcc cgtggccctg gcatgagctg 1981 tgcccagctt cgtcagctcc agcgtttgcc tggtctggaa gtcctggccg gccgcccaca 2041 tcgggctcac cttaaaggtc aaggaaggaa aatactacct gtcccctatg ccactaagcc 2101 aacgtgtgtg tcagctggta gctgggggcg cagaggacat cacctggggt gctgcctctc 2161 acacatttct gccacgtggt ggggcagctc ctcacccagg gcccccaaag agcaagcgtc 2221 tgggcaagag gaaaatgccc tgtccctagc tcacactcat ccacacttaa gccctcgtgc 2281 acacacacaa attattcaga tgtacaccca cccacatatc ttacagccag aggaaccagc 2341 actccatcac tgagagcccg acttcgtttc tggggcaact gagagctgag cgctttgctt 2401 accaaaagct cagggccctg tgccaggcca aagatccccc cagaccccca ttctgacatc 2461 cacatgctct gcagtcctgg ccccctcgtc attttctttc ccagaagcgc cctgtattta 2521 ttcccccatc ttcatcccaa cagcccagca agaaggagga gacagagagc tcctccctgg 2581 gttgtctgtg gaccccccca ggagctgcta attggcagca cccactcagc cattctctac 2641 ccatccttag tacatgctct gtccagcttt ccccagggtg acatacagaa ggggcaa // LOCUS HSY11997 3045 bp RNA PRI 08-JAN-1998 DEFINITION H.sapiens mRNA for A-kinase anchoring protein AKAP95. ACCESSION Y11997 NID g2765141 KEYWORDS akap95 gene; kinase A anchor protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3045) AUTHORS Larsen,T., Coglan,V., Orstavik,S., Holsve,C., Solberg,R., Skalhegg,B.S., Larsen,T., Coglan,V., Orstavik,S., Holsve,C., Solberg,R., Skalhegg,B.S., Lamb,N.J.C., Langeberg,L., Fernandez,A., Scott,J.D., Jahnsen,T. and Tasken,K. TITLE Molecular cloning, chromosomal localization and cell cycle-dependent subcellular distribution of AKAP95 JOURNAL Unpublished REFERENCE 2 (bases 1 to 3045) AUTHORS Tasken,K. TITLE Direct Submission JOURNAL Submitted (21-MAR-1997) K. Tasken, University of Oslo, Institute of Medical Biochemistry, P.O. Box 1112, Blindern, N-0317 Oslo, NORWAY FEATURES Location/Qualifiers source 1..3045 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="19" /clone_lib="Clontech Lambda gt11, cat. no. HL1128b" /clone="1" /clone_lib="Stratagene Lambda ZAP express, cat no. 939202" /clone_lib="Clonetech Maraton RACE-ready cDNA, cat. no. 7401-1" /clone="4.1" /map="p13.1-q12" 5'UTR <1..61 /gene="akap95" gene 1..3045 /gene="akap95" CDS 62..2140 /gene="akap95" /codon_start=1 /product="A-kinase anchoring protein 95" /db_xref="PID:e1227613" /db_xref="PID:g2765142" /translation="MDQGYGGYGAWSAGPANTQGAYGTGVASWQGYENYNYYGAQNTS VTTGATYSYGPASWEAAKANDGGLAAGAPAMHMASYGPEPCTDNSDSLIAKINQRLDM MSKEGGRGGSGGGGEGIQDRESSFRFQPFESYDSRPCLPEHNPYRPSYSYDYEFDLGS DRNGSFGGQYSECRDPARERGSLDGFMRGRGQGRFQDRSNPGTFMRSDPFVPPAASSE PLSTPWNELNYVGGRGLGGPSPSRPPPSLFSQSMAPDYGVMGMQGAGGYDSTMPYGCG RSQPRMRDRDRPKRRGFDRFGPDGTGRKRKQFQLYEEPDTKLARVDSEGDFSENDDAA GDFRSGDEEFKGEDELCDSGRQRGEKEDEDEDVKKRREKQRRRDRTRDRAADRIQFAC SVCKFRSFDDEEIQKHLQSKFHKETLRFISTKLPDKTVEFLQEYIVNRNKKIEKRRQE LMEKETAKPKPDPFKGIGQEHFFKKIEAAHCLACDMLIPAQPQLLQRHLHSVDHNHNR RLAAEQFKKTSLHVAKSVLNNRHIVKMLEKYLKGEDPFTSETVDPEMEGDDNLGGEDK KETPEEVAADVLAEVITAAVRAVDGEGAPAPESSGEPAEDEGPTDTAEAGSDPQAEQL LEEQVPCGTAHEKGVPKARSEAAEAGNGAETMAAEAESAQTRVAPAPAAADAEVEQTD AESKDAVPTE" 3'UTR 2141..>3045 /gene="akap95" BASE COUNT 772 a 785 c 866 g 622 t ORIGIN 1 tgaacgcatg cgtgctgtgg tcgcctagta aacggggctg ctggtgggcc gcgtcgaaga 61 catggaccag ggctacggag gctacggggc gtggagtgct ggacctgcca acacccaggg 121 tgcatatgga actggtgtgg ccagctggca aggttatgaa aactacaatt actatggcgc 181 ccagaacacc agtgtcacca caggcgcaac ctacagctac ggcccagcct cgtgggaggc 241 cgccaaggcc aatgatggcg gcctggcggc cggggcccct gccatgcaca tggcctctta 301 cggcccagag ccatgcaccg acaattccga ctccctcatt gccaagatca accagcgttt 361 ggacatgatg tccaaggaag gaggcagggg cgggagcggc ggcggtgggg agggcataca 421 ggaccgggag agctccttcc gcttccagcc gttcgagtcc tatgactcca ggccctgcct 481 gccggagcac aacccctacc gccccagcta cagctacgac tatgagttcg acctggggtc 541 cgaccgcaat ggcagctttg gggggcagta cagtgaatgc cgagacccag cccgggagcg 601 gggctccctt gatggcttca tgcggggccg gggccagggc cgcttccagg accggagcaa 661 ccctggcacc ttcatgcgca gcgacccctt cgtgcccccc gctgcgtcct ctgagcccct 721 gtccacgccc tggaacgagc tgaactacgt gggtggacgg ggcctgggag ggccctcccc 781 cagccggcca cctccgtccc tcttctccca gtccatggct cccgactacg gcgtgatggg 841 catgcagggg gcgggcggct atgacagcac catgccctac ggatgtggcc gctcgcagcc 901 tcggatgcgg gatcgggatc ggcccaagag gagagggttt gaccgcttcg gaccagatgg 961 cacgggcagg aaacggaagc agttccaact ttacgaggag ccagacacca aactggcccg 1021 ggttgacagt gaaggagatt tctccgaaaa tgatgacgca gctggtgact tccgctcagg 1081 agatgaagaa ttcaagggtg aggatgaact ctgcgactct gggaggcaaa gaggagagaa 1141 ggaggacgag gacgaggatg tgaagaagag aagggaaaag caaaggagaa gagacaggac 1201 gcgggaccgt gcagccgaca gaattcagtt tgcctgttct gtatgcaagt tccgtagctt 1261 tgatgacgaa gagatccaga agcatctgca aagcaaattt cacaaagaga ccctgcggtt 1321 cataagcacc aagctgcccg acaagaccgt ggagttcctc caggaataca ttgtaaacag 1381 aaataagaaa attgagaagc ggcgtcagga attgatggag aaagaaaccg caaaaccaaa 1441 accagatcct ttcaaaggga ttggccagga gcacttcttc aagaagatcg aggctgctca 1501 ctgcctggcc tgcgacatgc taattcctgc acagccgcag ctcctccagc ggcacctgca 1561 ctccgtggac cacaatcaca accgcaggtt ggctgctgaa cagttcaaga aaaccagtct 1621 ccatgtggct aagagtgttt tgaacaacag acatatagtg aagatgctgg aaaaatacct 1681 caagggtgag gaccctttca ccagtgaaac tgttgatcca gaaatggaag gagatgacaa 1741 tttaggaggt gaggataaga aagagacacc tgaggaggtg gccgcggacg tcttagcaga 1801 ggtgattaca gcagcagtga gggccgtaga tggggaagga gcgcccgctc cagagagcag 1861 cggggagccg gctgaggacg aaggccccac ggacacagcg gaggccggta gtgatcctca 1921 agccgaacag ctgctggaag agcaggtgcc ctgtggaacg gcacatgaga agggcgtccc 1981 caaggccaga agtgaggctg cagaggctgg aaatggcgcc gagacaatgg cagcagaggc 2041 agaaagtgcc caaaccagag ttgctcctgc cccagctgcc gcggatgctg aagtggaaca 2101 aactgatgca gagtctaaag acgctgttcc cacagaatga tgctcatttc cctgttccag 2161 ggaaggcgtt gggatgatgg atgcgttggt ctttctccct tggtttgtaa gcagtacaag 2221 ggcgtgtgct cccagaatat gctgtaatct aattttggtg aagagaccca gcgtttcctc 2281 ctgagcagtg cctctcacgg cttgtctcat gcagtcgtgt ggcttcttgc ccaggtttca 2341 aagctgaagt acattgtcct tagcggctgt aacatgtctc ttgacagtag tgcacttgga 2401 ataataaagg ttgggtgatt atatcttgat gatacattac ttgttcaata cagccactga 2461 tggaatgctt ccttttttat ttttttcctt aatttttttt tttatttggt tgggaacagc 2521 tgaatactag gaatatatct tgctctatag aggatttttt tttgtatgtt tcaagcttca 2581 gcctttaacc tatacctttg tagtgcacca tatggtgtgt gactttcaca ggacttcgca 2641 gcacctggtt cacatgtggc actgaccgcg tcacatccac gcactcccaa aggccagaag 2701 tatctgaccg acctacgcca ctggaaacac acccaccgca acctcaagaa ccagactgtg 2761 cagagggcat tgcgtcccaa tctttagtcc ttgctgaatc agttctctaa tattttacct 2821 catttgtgtt ccacctctag attacttcag gtttttttcc tttaaaatta gttactacca 2881 ctcaaatgta tttacaaaga gaatttggcc aggcacggtg atgcatacct ataatcccag 2941 cacttgggga ggccgtggtg agaggatagc ttaagcccag gagttcaaga ccaacctgga 3001 caacatagca agaccccatc tcttaaaaaa aaaaaaaaaa aaaaa // LOCUS HSY12478 1358 bp RNA PRI 10-OCT-1997 DEFINITION H.sapiens mRNA for CHD5 protein. ACCESSION Y12478 NID g1946204 KEYWORDS CHD5 gene; heart. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1358) AUTHORS Egeo,A., Mazzocco,M., Arrigo,P., Nizetic,D., Rasore-Quartino,A. and Scartezzini,P. JOURNAL Hum. Genet. In press REFERENCE 2 (bases 1 to 1358) AUTHORS Scartezzini,P. TITLE Direct Submission JOURNAL Submitted (07-APR-1997) P. Scartezzini, EO Ospedali Galliera, Department of Pediatrics, Via Mura delle Cappuccine 14, 16128 Genova, ITALY REMARK revised while preliminary 21-APR-97 FEATURES Location/Qualifiers source 1..1358 /organism="Homo sapiens" /db_xref="taxon:9606" /map="21q22.3" /dev_stage="embryo" /tissue_type="heart" gene 43..567 /gene="CHD5" CDS 43..567 /gene="CHD5" /codon_start=1 /product="congenital heart disease 5 protein" /db_xref="PID:e313002" /db_xref="PID:g1946205" /translation="MSSAAADHWAWLLVLSFVFGCNVLRILLPSISSFMSRVLQKDAE QESQMRAEIQDMKQELSTVNMMDEFARYARLERKINKMTDKLKTHVKARTAQLAKIKW VISVAFYVLQAALMISLIWKYYSVPVAVVPSKWITPLDRLVAFPTRVAGGVGITCWIL VCNKVVAIVLHPFS" BASE COUNT 394 a 253 c 285 g 425 t 1 others ORIGIN 1 tgagctgccg tagcggaccc agcacagcca ggagcgtccg ggatgagctc agccgcggcc 61 gaccactggg cgtggttgct ggtgctcagc ttcgtgtttg gatgcaatgt tcttaggatc 121 ctcctcccgt ccatctcatc cttcatgtcc agggtgctgc agaaggacgc ggagcaggag 181 tcacagatga gagcggagat ccaggacatg aagcaggagc tctccacagt caacatgatg 241 gacgagtttg ccagatatgc caggctggaa agaaagatca acaagatgac ggataagctc 301 aaaacccatg tgaaagctcg gacagctcaa ttagccaaga taaaatgggt gataagtgtc 361 gctttctacg tattgcaggc tgccctgatg atctcactca tttggaagta ttattctgtc 421 cctgtggctg tcgtgccgag taaatggata acccctctag accgcctggt agcctttcct 481 actagagtag caggtggtgt tggaattacc tgttggattt tagtctgtaa caaagttgtc 541 gctattgtgc ttcatccgtt cagctgaaca ggaggatgga tacagccgcg aggctaaaaa 601 acggatttcc tcttcctagc ttaaaatctg atttacactg ttttgttttt taagaaacaa 661 aagtgcatag tttagatttt ttttttgttg aatatgtttg ttcttggact ttatgagata 721 gtcttataag aatcacgatt ttctacacct gtcattgagc caagaaagtc cagtttatga 781 cacgtatgta ctagtgaaca ccgtcctcga tctgtcgaaa tgtgaaatgt ttagggacat 841 ctccatgctg tnacttgtga tttgccctct tatgtatttt ggtcatattg ccaactggaa 901 agtcaaaatt ttaacaactt taagtaagtt ctttgaagac ttagtgctgt ttttaatcca 961 gttagaaagt aacttaattt taataccgct actaaaaatt cgaaaatttc ttctttaatc 1021 acattcaata tggttaaaag aacaacacta attgacattg cgtgggcttt ttctcccttt 1081 gtttaaaatg tcatttgttg agcaagagtt gtatagtatt atctacttac ttgaaggctg 1141 ttaatttttc attacagtgt tttgtaaatg tatccacgag accatgatgt cattgttttg 1201 tgctcaactt gtgttttgta tttaaagcat tttgaatgaa gtgtatttta taagcattta 1261 atatttatgc tctttagaat ggaacacaga aaacaaacct tataagcctg attaaattaa 1321 tctgaaccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HSY13286 2274 bp mRNA PRI 06-FEB-1998 DEFINITION Homo sapiens mRNA for GDP dissociation inhibitor beta. ACCESSION Y13286 NID g2853173 KEYWORDS GDP dissociation inhibitor beta. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2274) AUTHORS Sedlacek,Z., Munstermann,E., Mincheva,A., Lichter,P. and Poustka,A. TITLE The human rab GDI beta gene with long retroposon-rich introns maps to 10p15 and its pseudogene to 7p11-p13 JOURNAL Mamm. Genome 9 (1), 78-80 (1998) MEDLINE 98096592 REFERENCE 2 (bases 1 to 2274) AUTHORS Sedlacek,Z. TITLE Direct Submission JOURNAL Submitted (19-MAY-1997) Z. Sedlacek, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, 69120 Heidelberg, FRG COMMENT Related sequence: D13988. FEATURES Location/Qualifiers source 1..2274 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="10" /dev_stage="adult" /tissue_type="skeletal muscle" /map="p15" mRNA 1..2274 /evidence=experimental CDS 153..1490 /codon_start=1 /product="GDP dissociation inhibitor beta" /db_xref="PID:e1250447" /db_xref="PID:g2853174" /translation="MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESA SITPLEDLYKRFKIPGSPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDF KVTEGSFVYKGGKIYKVPSTEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEG IDPKKTTMRDVYKKFDLGQDVIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLAR YGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIAR CKQLICDPSYVKDRVEKVGQVIRVICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYV CMISFAHNVAAQGKYIAIVSTTVETKEPEKEIRPALELLEPIEQKFVSISDLLVPKDL GTESQIFISRTYDATTHFETTCDDIKNIYKRMTGSEFDFEEMKRKKNDIYGED" polyA_signal 2251..2256 polyA_site 2274 /evidence=experimental BASE COUNT 678 a 459 c 506 g 631 t ORIGIN 1 ggcggtcggt ctcgccttgt cgccagctcc attttcctct ctttctcttc ccctttcctt 61 cgcgcccaag agcgcctccc agcctcgtag ggtggtcacg gagcccctgc gccttttcct 121 tgctcgggtc ctgcgtccgc gcctgccccg ccatgaatga ggagtacgac gtgatcgtgc 181 tgggcaccgg cctgacggaa tgtatcctgt caggtataat gtcagtgaat ggcaagaaag 241 ttcttcatat ggatcgaaac ccttactacg gaggagaaag tgcatctata acaccattgg 301 aagatttata caaaagattt aaaataccag gatcaccacc cgagtcaatg gggagaggaa 361 gagactggaa tgttgacttg attcccaagt tccttatggc taatggtcag ctggttaaga 421 tgctgcttta tacagaggta actcgctatc tggattttaa agtgactgaa gggagctttg 481 tctataaggg tggaaaaatc tacaaggttc cttccactga agcagaagcc ctggcatcta 541 gcctaatggg attgtttgaa aaacgtcgct tcaggaaatt cctagtgtat gttgccaact 601 tcgatgaaaa agatccaaga acttttgaag gcattgatcc taagaagacc acaatgcgag 661 atgtgtataa gaaatttgat ttgggtcaag acgttataga ttttactggt catgctcttg 721 cactttacag aactgatgat tacttagatc aaccgtgtta tgaaaccatt aatagaatta 781 aactttacag tgaatctttg gcaagatatg gcaaaagccc atacctttat ccactctatg 841 gccttggaga actgccccaa ggatttgcaa ggctaagtgc tatttatgga ggtacctata 901 tgctgaataa acccattgaa gaaatcattg tacagaatgg aaaagtaatt ggtgtaaaat 961 ctgaaggaga aattgctcgc tgtaagcagc tcatctgtga ccccagctac gtaaaagatc 1021 gggtagaaaa agtgggccag gtgatcagag ttatttgcat cctcagccac cccatcaaga 1081 acaccaatga tgccaactcc tgccagatca ttattccaca gaaccaagtc aatcgaaagt 1141 cagatatcta cgtctgcatg atctcctttg cgcacaatgt agcagcacaa gggaagtaca 1201 ttgctatagt tagtacaact gtggaaacca aggagcctga gaaggaaatc agaccagctt 1261 tggagctctt ggaaccaatt gaacagaaat ttgttagcat cagtgacctc ctggtaccaa 1321 aagacttggg aacagaaagc cagatcttta tttcccgcac atatgatgcc accactcatt 1381 ttgagacaac gtgtgatgac attaaaaaca tctataagag gatgacagga tcagagtttg 1441 actttgagga aatgaagcgc aagaagaatg acatctatgg ggaagactaa cagcagtaca 1501 tgttattatg taattaggac acatttaaaa tttggcaaat aatgcatata atgaaatcaa 1561 tattgtaagg cctgcttttg taatgaaaat ggagagaatg aagagcgctg tgccagtaaa 1621 tactcccctt cacctttcta attattaact tgttttcatg gagtggctat tcagcattgg 1681 cagttaccac attctgttca atttaaccaa actggctttt ttttttttct agtgaagtta 1741 aacaaacatt gggataccga cacagacaac ttgagacagt ttttttaatc ttttaatcag 1801 tgtagctatg tgctgctgct gctcaagagc tggatccata caggttgtgt gtcatctgtc 1861 ctcttgaggt tcaggagatt ctaaattgaa tattccacgg tttgggacag catccggaag 1921 ttttccctat gactttatat tttgtattat gtcaaatgtt atggcagggc ccaaatagca 1981 tagccacaag tttggtttat gtgggcataa aattctaacc aaaccccaga catagggagt 2041 catttggaga aagcctgtat gtggtgtttt aacctaataa agttgatgag agagaagggg 2101 agaggaagcg aacataaagc gggtcaagtg tagtgcatct tttgtatctc aggcgtggct 2161 tcttcagtgg accaagctgc aatgcagtat tgacttgaca gagcctctac ttctgtctca 2221 aaatggctcc aaatgatttc tgtactgcaa aataaagcca aattctggaa acta // LOCUS HSY13936 1932 bp RNA PRI 04-AUG-1997 DEFINITION Homo sapiens mRNA for protein phosphatase 2C gamma. ACCESSION Y13936 NID g2315201 KEYWORDS protein phosphatase 2C gamma. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1932) AUTHORS Travis,S.M. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) Travis S.M., Internal Medicine, University of Iowa, 500 EMRB, Iowa City IA 52242 USA REFERENCE 2 (bases 1 to 1932) AUTHORS Travis,S.M. and Welsh,M.J. TITLE PP2C gamma: a human protein phosphatase with a unique acidic domain JOURNAL FEBS Lett. 412 (3), 415-419 (1997) MEDLINE 97420453 FEATURES Location/Qualifiers source 1..1932 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Skeletal muscle" CDS 25..1665 /codon_start=1 /evidence=experimental /product="protein phosphatase 2C gamma" /db_xref="PID:e323054" /db_xref="PID:g2315202" /translation="MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHN CIPELDSETAMFSVYDGHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAID AKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDDVDNEEAALLHEEATMTIEELLTRY GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAKAYTGFSSNS ERGTEAGQVGEPGIPTGEAGPSCSSASDKLPRVAKSKFFEDSEDESDEAEEEEEDSEE CSEEEDGYSSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEEPGSDSGTTAVVA LIRGKQLIVANAGDSRCVVSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNG GLNLSRAIGDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQ EVVDFIQSKISQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPR NTAELQPESGKRKLEEVLSTEGAEENGNSDKKKKAKRD" BASE COUNT 500 a 468 c 584 g 380 t ORIGIN 1 tgaggccgcc ggccagccgc cgccatgggt gcctacctct cccagcccaa cacggtgaag 61 tgctccgggg acggggtcgg cgccccgcgc ctgccgctgc cctacggctt ctccgccatg 121 caaggctggc gcgtctccat ggaggatgct cacaactgta ttcctgagct ggacagtgag 181 acagccatgt tttctgtcta cgatggacat ggaggggagg aagttgcctt gtactgtgcc 241 aaatatcttc ctgatatcat caaagatcag aaggcctaca aggaaggcaa gctacagaag 301 gctttagaag atgccttctt ggctattgac gccaaattga ccactgaaga agtcattaaa 361 gagctggcac agattgcagg gcgacccact gaggatgaag atgaaaaaga aaaagtagct 421 gatgaagatg atgtggacaa tgaggaggct gcactgctgc atgaagaggc taccatgact 481 attgaagagc tgctgacacg ctacgggcag aactgtcaca agggccctcc ccacagcaaa 541 tctggaggtg ggacaggcga ggaaccaggg tcccagggcc tcaatgggga ggcaggacct 601 gaggactcaa ctagggaaac tccttcacaa gaaaatggcc ccacagccaa ggcctacaca 661 ggcttttcct ccaactcgga acgtgggact gaggcaggcc aagttggtga gcctggcatt 721 cccactggtg aggctgggcc ttcctgctct tcagcctctg acaagctgcc tcgagttgct 781 aagtccaagt tctttgagga cagtgaggat gagtcagatg aggcggagga agaagaggaa 841 gacagtgagg aatgcagcga ggaagaggat ggctacagca gtgaggaggc agagaatgag 901 gaagatgagg atgacaccga ggaggctgaa gaggacgatg aagaagaaga agaagagatg 961 atggtgccag ggatggaagg caaagaggag cctggctctg acagtggtac aacagcggtg 1021 gtggccctga tacgagggaa gcagttgatt gtagccaacg caggagactc tcgctgtgtg 1081 gtatctgagg ctggcaaagc tttagacatg tcctatgatc acaaaccaga ggatgaagta 1141 gaactagcac gcatcaagaa tgctggtggc aaggtcacca tggatgggcg agtcaacggg 1201 ggcctcaacc tctccagagc cattggggac cacttctata agagaaacaa gaacctgcca 1261 cctgaggaac agatgatttc agcccttcct gacatcaagg tgctgactct cactgacgac 1321 catgaattca tggtcattgc ctgtgatggc atctggaatg tgatgagcag ccaggaagtt 1381 gtagatttca ttcaatcaaa gatcagccag cgtgatgaaa atggggagct tcggttattg 1441 tcatccattg tggaagagct gctggatcag tgcctggcac cagacacttc tggggatggt 1501 acagggtgtg acaacatgac ctgcatcatc atttgcttca agccccgaaa cacagcagag 1561 ctccagccag agagtggcaa gcgaaaacta gaggaggtgc tctctactga gggggctgaa 1621 gaaaatggca acagcgacaa gaagaagaag gccaagcgag actagcagtc atccagaccc 1681 ctgcccacct agactgtttt ctgagccctc cggacctgag actgagtttt gtctttttcc 1741 tttagcctta gcagtgggta tgaggtgtgc agggggagct gggtggcttc actccgccca 1801 ttccaaagag ggctctccct ccacactgca gccgggagcc tctgctgtcc ttcccagccg 1861 cctctgctcc tcgggctcat caccggttct gtgcctgtgc tctgttgtgt tggagggaag 1921 gactggcggt tc // LOCUS HSY14391 1868 bp RNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for putative GTP-binding protein. ACCESSION Y14391 NID g2765410 KEYWORDS GTP-binding protein; PGPL gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1868) AUTHORS Gianfrancesco,F., Esposito,T., Montanini,L., Ciccodicola,A., Mumm,S., Mazzarella,R., Rao,E., Giglio,S., Rappold,G. and Forabosco,A. TITLE A novel pseudoautosomal gene encoding a putative GTP-binding protein resides in the vicinity of the Xp/Yp-telomere JOURNAL Unpublished REFERENCE 2 (bases 1 to 1868) AUTHORS Forabosco,A. TITLE Direct Submission JOURNAL Submitted (31-JUL-1997) A. Forabosco, Sezione Istologia, Embriologia e Genetica, Universita di Modena, Via Del Pozzo 71, 41100 Modena, ITALY REMARK revised by [3] REFERENCE 3 (bases 1 to 1868) AUTHORS Forabosco,A. TITLE Direct Submission JOURNAL Submitted (19-DEC-1997) A. Forabosco, Sezione Istologia, Embriologia e Genetica, Universita di Modena, Via Del Pozzo 71, 41100 Modena, ITALY FEATURES Location/Qualifiers source 1..1868 /organism="Homo sapiens" /note="maps to chromosomes X and Y" /db_xref="taxon:9606" /cell_line="uninduced male teratocarcinoma cell line" mRNA 1..1868 /gene="PGPL" gene 1..1868 /gene="PGPL" CDS 214..1542 /gene="PGPL" /note="putative" /codon_start=1 /product="GTP-binding protein" /db_xref="PID:e1227622" /db_xref="PID:g2765411" /translation="MRTRTPRRSCCGESLCCRRGPKRVCLVHPDVKWGPGKSQMTRAE WQVAEATALVHTLDGWSVVHTMVVSTKTPDRKLIFGKGNFEHLTEKIRGSPDITCVFL NVERMAAPTKKELEAAWGVEVFERFTVVLHIFRCNARTKEARLQVALAEMPLHRSNLK RDVAHLYRGVGSRYIMGSGESFMQLQQRLLREKEAKIRKALDRLRKKRHLLRRQRTRR EFPVISVVGYTNCGKTTLIKALTGDAAIQPRDQLFATLDVTAHAGTLPSRMTVLYVDT IGFLSQLPHGLIESFSATLEDVAHSDLILHVRDVSHPEAELQKCSVLSTLRGLQLPAP LLDSMVEVHNKVDLVPGYSPTEPNVVPVSALRGHGLQELKAELDAAVLKATGRQILTL RVRLAGAQLSWLYKEATVQEVDVIPEDGAADVRVIISNSAYGKFRKLFPG" BASE COUNT 357 a 577 c 632 g 302 t ORIGIN 1 gcgggccgcc gtacgcccgg ggctgcggct ctcccgcgtg ggccgcggcc gctcggctcc 61 gcgggcagcc gcgccgtcct gccccgcgcg cgcgctagcc gctgtcggcc gcaggagccc 121 cgggaatctg gaggggccgt ggggcggagg gaggggcctg cgggcggaag gcggacgaaa 181 acagaacgga agacgacaag gaggagccgg aagatgcgga cgagaacgcc gaggaggagc 241 tgctgcgggg agagcctctg ctgccggcgg ggacccaagc gcgtgtgtct ggttcaccct 301 gacgtcaagt ggggcccagg gaaatcgcag atgactcgag ccgagtggca ggtggcggag 361 gccacagcgc tggtgcacac gctggacggc tggtccgtgg tgcacacaat ggtcgtgtcc 421 accaaaacgc cggacaggaa gctcatcttt ggcaaaggga actttgagca cctgacagaa 481 aagatccgag ggtctccaga catcacgtgc gtcttcctga acgtggagag gatggctgcc 541 ccgaccaaga aagaactgga agccgcctgg ggcgtggagg tgtttgagcg cttcacggtc 601 gtcctgcaca tcttccgctg taacgcccgc acgaaggagg cccggcttca ggtggccctg 661 gcggagatgc cgctgcacag gtcgaacctg aaaagggacg tcgcccacct gtaccgagga 721 gtcggctcgc gctacatcat ggggtcagga gaatccttca tgcagctgca gcagcgtctc 781 ctgagagaga aggaggccaa gatcaggaag gccttggaca ggcttcgcaa gaagaggcac 841 ctgctccgcc ggcagcggac gaggcgggag ttccccgtga tctccgtggt ggggtacacc 901 aactgcggaa agaccacgct gatcaaggca ctgacgggcg atgccgccat ccagccacgg 961 gaccagctgt ttgccacgct ggacgtcacg gcccacgcgg gcacgctgcc ctcacgcatg 1021 accgtcctgt acgtggacac catcggcttc ctctcccagc tgccgcacgg cctcatcgag 1081 tccttctccg ccaccctgga agacgtggcc cactcggatc tcatcttgca cgtgagggac 1141 gtcagccacc ccgaggcgga gctccagaaa tgcagcgttc tgtccacgct gcgtggcctg 1201 cagctgcccg ccccgctcct ggactccatg gtggaggttc acaacaaggt ggacctcgtg 1261 cccgggtaca gccccacgga accgaacgtc gtgcccgtgt ctgccctgcg gggccacggg 1321 ctccaggagc tgaaagctga gctcgatgcg gcggttttga aggcgacggg gagacagatc 1381 ctcactctcc gtgtgaggct cgcaggggcg cagctcagct ggctgtataa ggaggccaca 1441 gttcaggagg tggacgtgat ccctgaggac ggggcggccg acgtgagggt catcatcagc 1501 aactcagcct acggcaaatt ccggaagctc tttccaggat gaacggacgc ccacagaggc 1561 ctgcggggtg ggggcattgc tgcctgggga gctgaggcgt taccgctgtg ttgggggcag 1621 cttggtgtca ggtgcagcag ggtcctcctt gtctggttct gcacccgtct cgctcccagc 1681 catttgctgg gatgaccgtg caggccggtg acacggccgc acctgcccca aagcgggccg 1741 cccgagcgtc cactccaagc ctgagcatcc acacaattcc agtgggccct cggtgcctgc 1801 tgtgaactgc tttccctcgg aatgtttccg taacaggaca ttaaaccttt gattttaaaa 1861 aaaaaaaa // LOCUS HSY15014 3054 bp mRNA PRI 14-JAN-1998 DEFINITION Homo sapiens mRNA for UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltransferase. ACCESSION Y15014 NID g2791314 KEYWORDS UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3054) AUTHORS Kolbinger,F., Streiff,M.B. and Katopodis,A.G. TITLE Cloning of a human UDP-galactose:2-acetamido-2-deoxy-D-glucose 3beta-galactosyltransferase catalyzing the formation of type 1 chains JOURNAL J. Biol. Chem. 273 (1), 433-440 (1998) MEDLINE 98079080 REFERENCE 2 (bases 1 to 3054) AUTHORS Kolbinger,F. TITLE Direct Submission JOURNAL Submitted (30-SEP-1997) F. Kolbinger, Novartis Pharma AG, Transplantation Preclinical Research, CH-4002 Basel, SWITZERLAND FEATURES Location/Qualifiers source 1..3054 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /clone_lib="lgt10" CDS 890..2158 /codon_start=1 /product="UDP-galactose:2-acetamido-2-deoxy-D- glucose3beta-galactosyltransferase" /db_xref="PID:e1237254" /db_xref="PID:g2791315" /translation="MLQWRRRHCCFAKMTWNAKRSLFRTHLIGVLSLVFLFAMFLFFN HHDWLPGRAGFKENPVTYTFRGFRSTKSETNHSSLRNIWKETVPQTLRPQTATNSNNT DLSPQGVTGLENTLSANGSIYNEKGTGHPNSYHFKYIINEPEKCQEKSPFLILLIAAE PGQIEARRAIRQTWGNESLAPGIQITRIFLLGLSIKLNGYLQRAILEESRQYHDIIQQ EYLDTYYNLTIKTLMGMNWVATYCPHIPYVMKTDSDMFVNTEYLINKLLKPDLPPRHN YFTGYLMRGYAPNRNKDSKWYMPPDLYPSERYPVFCSGTGYVFSGDLAEKIFKVSLGI RRLHLEDVYVGICLAKLRIDPVPPPNEFVFNHWRVSYSSCKYSHLITSHQFQPSELIK YWNHLQQNKHNACANAAKEKAGRYRHRKLH" BASE COUNT 1104 a 537 c 545 g 868 t ORIGIN 1 cggcaaaaag acaacatact tagaaataaa gttaaataag gaggtaaaag actcgtacac 61 tgaaaactat aaaacattgg tgaaagaaat taaagaaatg aataagtgaa agatggccca 121 tgttcatgga ttagaagaac taatattgtt aaagtgtcca tactacccaa aatgatctac 181 aaattagatg caatccctat caaaactcca atggcatatt tacagggtta gaaaaaacaa 241 ttataaaatt cacatgaacc cacaaaagag ctcaaatggc caaatcaatt gtgagaaaaa 301 aagaacaaag ctggaggcat cacagttctt aatttcaaaa tatattagaa agctatggca 361 atgagaacag tatgctactg gcataaaaat agacatgtgg accgatggat cagaagagag 421 accccagtag gaaatttgca catatacagt gatctgatct gcaacaaggg tgccaagaat 481 aaacagtgct acaagcagaa cgggcaacta cagctctttt gtttaacgaa agagagaaaa 541 tgaaagaaag ggaaaatttc agaagactag gacccatatg aacaaggagg gtaactcgaa 601 gacaagcaga cagatggaca ctttggatac tgtgaaaagc aatcgcagga ggcagactgt 661 tgggggatgt gcgcatgttc gatagcatct tttttgctga agtgatggcg tgccaaaagt 721 attttcagtg ggcataatcc tcttcacata aatggcctga ccaaggaaga atgactacaa 781 gagagacaat gtgactgaat tagaaaatga ttgccaaaga atagtattaa ggagaagaaa 841 acatttttgg tcaccaatct ctcatatacc actactggat atttacaaca tgcttcagtg 901 gaggagaaga cactgctgct ttgcaaagat gacctggaat gccaaaaggt ctctgttccg 961 cactcatctt attggagtac tttctctagt gtttcttttt gctatgtttt tgtttttcaa 1021 tcatcatgac tggctgccag gcagagctgg attcaaagaa aaccctgtga catacacttt 1081 ccgaggattt cggtcaacaa aaagtgagac aaaccacagc tcccttcgga acatttggaa 1141 agaaacagtc cctcaaaccc tgaggcctca aacagcaact aactctaata acacagacct 1201 gtcaccacaa ggagttacag gcctggagaa tacacttagt gccaatggaa gtatttacaa 1261 tgaaaaaggt actggacatc caaattctta ccatttcaaa tatattatta atgagcctga 1321 aaaatgccaa gagaaaagtc cttttttaat actactaata gctgcagagc ctggacaaat 1381 agaagctaga agagctattc ggcaaacttg gggcaatgaa agtctagcac ctggtattca 1441 aatcacaaga atatttttgt tgggcttaag tattaagcta aatggctacc ttcaacgtgc 1501 aatactggaa gaaagcagac aatatcatga tataattcaa caggaatact tagatacgta 1561 ctataatttg accattaaaa cactaatggg catgaactgg gttgcaacat actgtccaca 1621 tattccatat gttatgaaaa ctgacagtga catgtttgtc aacactgaat atttaatcaa 1681 taagttactg aagccagatc tgcctcccag acataactat ttcactggtt acctaatgcg 1741 aggatatgca cccaatcgaa acaaagatag caagtggtac atgccaccag acctctaccc 1801 aagtgagcgt tatcctgtct tctgttctgg aactggttat gttttttctg gagatctggc 1861 agaaaagatt tttaaagttt ctttaggtat ccgccgtttg cacttggaag atgtatatgt 1921 agggatctgt cttgccaagt tgagaattga tcctgtaccc cctcccaatg agtttgtgtt 1981 caatcactgg cgagtctctt attcgagctg taaatacagc cacctaatta cctctcatca 2041 gttccagcct agtgaactga taaaatactg gaaccattta caacaaaata agcacaatgc 2101 ctgtgccaac gcagcaaaag aaaaggcagg caggtatcgc caccgtaaac tacattagaa 2161 aagacaattt tttttcaaat gtgcaatttg taaatattgc taaaagcatg tatagttaga 2221 actgattaca tccgtaggac aagttttagt taaaactcat cacataaaga aattcaagaa 2281 gtattttttt aatttctgaa gaagttaatt cttaaaacta taacattata taacaaaaag 2341 gtttcccaaa acaatctatt taaaaaactg tataaggaga ttctgtgtat taacatgcaa 2401 taacaagcat gcataaatca atggttcaag tcttctgtta ggggccaata aaatgtatct 2461 gcatatgttt tccacataaa ttttaattca agaaatgaca gtcaaaagat ccttcatttt 2521 agattaagct tttcatttta atatataatt taatgtaaat aaaacatcac tatcaatttt 2581 aaggaaacct tttaattgtg caaaggataa attttttgac ctattttagg gttctaaatg 2641 caataagatt tagttgagtt attccacaaa cacattataa agttcagatg tttcatcaat 2701 gcagttctca cgaaagtatt tactttttaa aaataactga gatattattt taaatttctt 2761 ttattaatac tttcttttat taatatatgg gggaaaatta ttttgacatg acgtggtaaa 2821 atgtgaaaaa ctaatgtgtc tcaggctcaa gtttttatag ttattaaatg tttcaaaata 2881 gacaagtttt gtttcctcat tgatgttaag aaccaaactc ctatttcaat gagttattgg 2941 attagaccaa ttactgcact cttaaacagc accaccattt aatttcatgt atatctaact 3001 tcgaatatat ctgtaaagat aatcgaagca aaagtaatca cttaaaggca cccg // LOCUS HSY15227 1003 bp RNA PRI 04-DEC-1997 DEFINITION Homo sapiens mRNA for leukemia associated gene 1. ACCESSION Y15227 NID g2664278 KEYWORDS Leu1 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1003) AUTHORS Liu,Y., Corcoran,M., Rasool,O., Ivanova,G., Ibbotson,R., Grander,D., Iyengar,A., Baranova,A., Kashuba,V., Merup,M., Wu,X., Gardiner,A., Mullenbach,R., Poltaraus,A., Hulstrom,A.L., Juliusson,G., Chapman,R., Tiller,M., Cotter,F., Gahrton,G., Yankovsky,N., Zabarovsky,E., Einhorn,S. and Oscier,D. TITLE Cloning of two candidate tumor suppressor genes within a 10 kb region on chromosome 13q14, frequently deleted in chronic lymphocytic leukemia JOURNAL Oncogene 15 (20), 2463-2473 (1997) MEDLINE 98055620 REFERENCE 2 (bases 1 to 1003) AUTHORS Ivanova,G.M. TITLE Direct Submission JOURNAL Submitted (24-OCT-1997) G.M. Ivanova, Radiumhemmet, Karolinska Hospital, Stockholm, S-17176, SWEDEN FEATURES Location/Qualifiers source 1..1003 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="q14.3" exon 1..385 /number=1 gene 268..486 /gene="Leu1" CDS 268..486 /gene="Leu1" /codon_start=1 /db_xref="PID:e1202443" /db_xref="PID:g2664279" /translation="MRPCIWIHVHLKPPCRLVELLPFSSALQGLSHLSLGTTLPVILP ERNEEQNLQELSHNADKYQMGDCCKEEI" exon 386..980 /number=2 polyA_signal 963..968 polyA_site 981 BASE COUNT 288 a 207 c 217 g 291 t ORIGIN 1 gcacatgcgc agaatcatcg tggtgcaggg ctctcccttt gcttcttcgg ttgcagtcct 61 cttgcttctt gcgcgtgcgt gtagcgcttt tgcaaagccg cggaggtgaa gtgaacttag 121 aggttgtggg gccgaggggt cgtcttatag ctaccagccc acaggcattt agtctacgtt 181 gggaggtaaa caaatacggg tcctgcttag gagaaaagaa aaacgtctta cagccagtgt 241 ctaaactcca aacaacggaa tgtatcaatg agaccttgta tatggataca cgtgcattta 301 aaaccgccct gccggcttgt agagcttttg ccgttctcca gcgctttaca ggggttatcg 361 cacttaagcc tcggaacaac tttaccagtg attctaccag aaaggaatga agaacagaac 421 cttcaggaat tgagtcacaa tgcagacaaa tatcaaatgg gagattgttg caaggaagag 481 atttgatgat agtattttct actagccatt gggaagataa aaggagacag aagattgaag 541 cctttgccag ccattctttc cctttttgct tccaaactcc tcaactggga accttcatat 601 gtgcagtatt tatattggat catactggtg attataaaag ttcctaggag gctagaagag 661 ccaaccaaca gagaagggaa agcagtctgt tctgaacata gggacataag ttcattcatg 721 ccaagtatct ttccagcatg tttctcccat ttagaatatc tagcatgtaa ggcctttcaa 781 tattaatata agcccaatat cagctctttc tctttgtatt tcatctcttt ctactctcct 841 atttgtattt tgtgttccta tcaaagtgtc gtatctggga gatgacctgc cttatcctgt 901 tctataacag ttttgtttgg tgctgtgtct ttagaacagt gcctggcaca cagtaagcac 961 tcaataaatc tttgatgaat gaaaaaaaaa aaaaaaaaaa aaa // LOCUS HSY15228 1750 bp RNA PRI 04-DEC-1997 DEFINITION Homo sapiens mRNA for leukemia associated gene 2. ACCESSION Y15228 NID g2664280 KEYWORDS LEU2 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1750) AUTHORS Liu,Y., Corcoran,M., Rasool,O., Ivanova,G., Ibbotson,R., Grander,D., Iyengar,A., Baranova,A., Kashuba,V., Merup,M., Wu,X., Gardiner,A., Mullenbach,R., Poltaraus,A., Hulstrom,A.L., Juliusson,G., Chapman,R., Tiller,M., Cotter,F., Gahrton,G., Yankovsky,N., Zabarovsky,E., Einhorn,S. and Oscier,D. TITLE Cloning of two candidate tumor suppressor genes within a 10 kb region on chromosome 13q14, frequently deleted in chronic lymphocytic leukemia JOURNAL Oncogene 15 (20), 2463-2473 (1997) MEDLINE 98055620 REFERENCE 2 (bases 1 to 1750) AUTHORS Ivanova,G.M. TITLE Direct Submission JOURNAL Submitted (24-OCT-1997) G.M. Ivanova, Radiumhemmet, Karolinska Hospital, Stockholm, S-17176, SWEDEN FEATURES Location/Qualifiers source 1..1750 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="13" /map="q14.3" exon 1..156 /number=1 exon 157..253 /number=2 gene 241..495 /gene="Leu2" CDS 241..495 /gene="Leu2" /codon_start=1 /db_xref="PID:e1202445" /db_xref="PID:g2664281" /translation="MRLRFNNDRMKTTIKETTILSSAILTFLTYLMKMSFERCTARNK MFVNSPFYPRVDNYCTSSWKKFYLKCYFSLNTIKKEKKMT" exon 254..386 /gene="Leu2" /number=3 exon 387..1735 /number=4 polyA_signal 1717..1721 polyA_site 1735 BASE COUNT 582 a 264 c 279 g 625 t ORIGIN 1 gatgcctgat ctcatcaatc tagcgggaga gacaggataa cctgtccgag agtatagcgc 61 cacttatgac tccgccggaa aaattacttt aaaaatcgcc aaaaattact tggagcaaag 121 ggcagtccgg cggcgttcgc caaggtggcg cagtcggttt tgacctgtag cagagaacca 181 attctggaga acagcctcac ttctttgatt gaatacttac ataatgcatt ggaacatgac 241 atgagattaa ggtttaataa tgatagaatg aagaccacaa taaaagagac cacaatcctt 301 agctcagcaa ttcttacctt tcttacctat ttgatgaaga tgtcttttga aaggtgtact 361 gcaaggaaca aaatgtttgt aaattctccc ttttacccaa gggtggataa ttactgtacc 421 tcctcatgga aaaagtttta tttaaagtgt tatttctcat tgaatactat caaaaaggaa 481 aaaaaaatga cctaaacttt tgagatagat ttggctctag taagtattta gttatatcac 541 ttgcatatct gggagaagaa ataagagact atcatcagta cattcccatc tactaaaaaa 601 atttatttta cacatgtcaa gggattactt ataacttcca ttttattact aatagcttga 661 acccttttaa tgaagaccta actcctccac cagaaattta agtttatgtt cttactttgt 721 ttacttataa aatacatctc aggtatttcg gatgtctttt ttttttctaa gcctatatga 781 aatgaaaaat atattggcaa agtaaatgtt taaacctttt acgttaaaat tactttgaaa 841 gatgaaaagt tagtgctgtt tttgtcacgt tatactgaaa ttaaatgttt ataatttata 901 ttttgggttt atgtataaat catggaattt atgcaaaaat atgagtagta cagattctcc 961 tctaattctg taggactttg aataatgtga tatttttctt ataattggac ccttgtgttt 1021 tgaagaaatg ccaactgctt gaagaatctc cttgttattt gtattatttg ctatagggtt 1081 agatgttgag aaattctgct gacaaaaaat tttaagccag ttttacacta aatgttcctc 1141 agtctgatta atttgttatt ggatgtattc tgtatctttc ttttgtaatt tgtgactttt 1201 atccacttag cacgaatgat tctattaaag aaaatcatta ggaagtggta gaaactttaa 1261 atcgccccag agtttgcctg tttccatatt ttattatctt ataatcttcg ggagtgctta 1321 cacttatgga gctaacattt tcagagatac agcttcttat agtaacacta aaactttctt 1381 cctctttgga ctgaatacct ataattataa ctatatggta gtttaagttt ccttgtgatt 1441 agtcaaaaat accattttag tatgaagcaa tgaagtctat tatttgttgt cccataattg 1501 agaaagctta aatacacctt ttatgtaaga gtttagtaag attctagctt agtctacaca 1561 gatttttata tcaatttgtt tatattttta ttaatgtcat ttctggaagt gtgaaaatgt 1621 taatgttcaa caagcaacat taaaaataga tttgaaacat ttatatatag agaggtacac 1681 atttatttac tgtttaggta ctgaagatta tcacttaata aaaaatatat atcccaaaaa 1741 aaaaaaaaaa // LOCUS HSY15409 2040 bp RNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for putative glucose 6-phosphate translocase. ACCESSION Y15409 NID g2765460 KEYWORDS G6PT gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS Gerin,I., Veiga-da-Cunha,M., Achouri,Y., Collet,J.F. and Van Schaftingen,E. TITLE Sequence of a putative glucose 6-phosphate translocase, mutated in glycogen storage disease type Ib JOURNAL FEBS Lett. 419 (2-3), 235-238 (1997) MEDLINE 98088917 REFERENCE 2 (bases 1 to 2040) AUTHORS Gerin,I. TITLE Direct Submission JOURNAL Submitted (04-NOV-1997) I. Gerin, Universite Catholique de Louvain and International Institute of Cellular and Molecular Pathology, BCHM-GRM 75.39, 75, Avenue Hippocrate B-1200 Brussels, BELGIUM FEATURES Location/Qualifiers source 1..2040 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="bladder" /cell_line="LB831-BLC" gene 170..1459 /gene="G6PT" CDS 170..1459 /gene="G6PT" /note="putative" /codon_start=1 /product="glucose 6-phosphate translocase" /db_xref="PID:e1228711" /db_xref="PID:g2765461" /translation="MAAQGYGYYRTVIFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPL DKDDLGFITSSQSAAYAISKFVSGVLSDQMSARWLFSSGLLLVGLVNIFFAWSSTVPV FAALWFLNGLAQGLGWPPCGKVLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILA QSYSWRSTLALSGALCVVVSFLCLLLIHNEPADVGLRNLDPMPSEGKKGSLKEESTLQ ELLLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGLVG SIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVL GAVFGFSSYGPIALFGVIANESAPPNLCGTSHAIVGLMANVGGFLAGLPFSTIAKHYS WSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE" misc_feature 1445..1450 /gene="G6PT" /note="encodes retention signal for transmembrane protein in the endoplasmic reticulum" BASE COUNT 413 a 561 c 561 g 505 t ORIGIN 1 caggcttaat gattgtccag aaggcggcta taaagggagc ctgggaggct gggtggagga 61 gggagcagaa aaaacccaac tcagcagatc tgggaactgt gagagcggca agcaggaact 121 gtggtcagag gctgtgcgtc ttggctggta gggcctgctc ttttctacca tggcagccca 181 gggctatggc tattatcgca ctgtgatctt ctcagccatg tttgggggct acagcctgta 241 ttacttcaat cgcaagacct tctcctttgt catgccatca ttggtggaag agatcccttt 301 ggacaaggat gatttggggt tcatcaccag cagccagtcg gcagcttatg ctatcagcaa 361 gtttgtcagt ggggtgctgt ctgaccagat gagtgctcgc tggctcttct cttctgggct 421 gctcctggtt ggcctggtca acatattctt tgcctggagc tccacagtac ctgtctttgc 481 tgccctctgg ttccttaatg gcctggccca ggggctgggc tggcccccat gtgggaaggt 541 cctgcggaag tggtttgagc catctcagtt tggcacttgg tgggccatcc tgtcaaccag 601 catgaacctg gctggagggc tgggccctat cctggcaacc atccttgccc agagctacag 661 ctggcgcagc acgctggccc tatctggggc actgtgtgtg gttgtctcct tcctctgtct 721 cctgctcatc cacaatgaac ctgctgatgt tggactccgc aacctggacc ccatgccctc 781 tgagggcaag aagggctcct tgaaggagga gagcaccctg caggagctgc tgctgtcccc 841 ttacctgtgg gtgctctcca ctggttacct tgtggtgttt ggagtaaaga cctgctgtac 901 tgactggggc cagttcttcc ttatccagga gaaaggacag tcagcccttg taggtagctc 961 ctacatgagt gccctggaag ttgggggcct tgtaggcagc atcgcagctg gctacctgtc 1021 agaccgggcc atggcaaagg cgggactgtc caactacggg aaccctcgcc atggcctgtt 1081 gctgttcatg atggctggca tgacagtgtc catgtacctc ttccgggtaa cagtgaccag 1141 tgactccccc aagctctgga tcctggtatt gggagctgta tttggtttct cctcgtatgg 1201 ccccattgcc ctgtttggag tcatagccaa cgagagtgcc cctcccaact tgtgtggcac 1261 ctcccacgcc attgtgggac tcatggccaa tgtgggcggc tttctggctg ggctgccctt 1321 cagcaccatt gccaagcact acagttggag cacagccttc tgggtggctg aagtgatttg 1381 tgcggccagc acggctgcct tcttcctcct acgaaacatc cgcaccaaga tgggccgagt 1441 gtccaagaag gctgagtgaa gagagtccag gttccggagc accatcccac ggtggccttc 1501 cccctgcacg ctctgcgggg agaaaaggag gggcctgcct ggctagccct gaacctttca 1561 ctttccattt ctgcgccttt tctctcaccc gggtggcgct ggaagttatc agtggctagt 1621 gaggtcccag ctccctgatc ctatgctcta tttaaaagat aacctttggc cttagactcc 1681 gttagctcct atttcctgcc ttcagacaaa caggaaactt ctgcagtcag gaaggctcct 1741 gtacccttct tcttttccta ggccctgtcc tgcccgcatc ctaccccatc cccacctgaa 1801 gtgaggctat ccctgcagct gcagggcact aatgaccctt gacttctgct gggtcctaag 1861 tcctctcagc agtgggcgac tgctgttgcc aatacctcag actccaggga aagagaggag 1921 gccatcattc tcactgtacc actaggcgca gttggatata ggtgggaaga aaaggtgact 1981 tgttatagaa gattaaaact agatttgata ctgaaaaaaa aaaaaaaaaa aaaaaaaaaa // LOCUS HSY16132 2006 bp RNA PRI 08-JAN-1998 DEFINITION Homo sapiens mRNA for angiopoietin-like factor. ACCESSION Y16132 NID g2765526 KEYWORDS angiopoietin; CDT6 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2006) AUTHORS Peek,R., van Gelderen,B.E., Bruinenberg,M. and Kijlstra,A. TITLE Molecular cloning of a new angiopoietin-like factor from the human cornea JOURNAL Unpublished REFERENCE 2 (bases 1 to 2006) AUTHORS Peek,R. TITLE Direct Submission JOURNAL Submitted (07-JAN-1998) R. Peek, The Netherlands Opthalmic Research, Institute, P O Box 12141, 1100 AC Amsterdam, NETHERLANDS FEATURES Location/Qualifiers source 1..2006 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="cornea" /clone_lib="lambda UNI-ZAP" CDS 8..1048 /note="angiopoietin-like factor" /codon_start=1 /product="CDT6" /db_xref="PID:e1228713" /db_xref="PID:g2765527" /translation="MLKKPLSAVTWLCIFIVAFVSHPAWLQKLSKHKTPAQPQLKAAN CCEEVKELKAQVANLSSLLSELNKKQERDWVSVVMQVMELESNSKRMESRLTDAESKY SEMNNQIDIMQLQAAQTVTQTSADAIYDCSSLYQKNYRISGVYKLPPDDFLGSPELEV FCDMETSGGGWTIIQRRKSGLVSFYRDWKQYKQGFGSIRGDFWLGNEHIHRLSRQPTR LRVEMEDWEGNLRYAEYSHFVLGNELNSYRLFLGNYTGNVGNDALQYHNNTAFSTKDK DNDNCLDKCAQLRKGGYWYNCCTDSNLNGVYYRLGEHNKHLDGITWYGWHGSTYSLKR VEMKIRPEDFKP" polyA_signal 1900..1905 polyA_signal 1935..1940 polyA_signal 1987..1992 BASE COUNT 581 a 459 c 500 g 466 t ORIGIN 1 acaaaagatg ctgaaaaagc ctctctcagc tgtgacctgg ctctgcattt tcatcgtggc 61 ctttgtcagc cacccagcgt ggctgcagaa gctctctaag cacaagacac cagcacagcc 121 acagctcaaa gcggccaact gctgtgagga ggtgaaggag ctcaaggccc aagttgccaa 181 ccttagcagc ctgctgagtg aactgaacaa gaagcaggag agggactggg tcagcgtggt 241 catgcaggtg atggagctgg agagcaacag caagcgcatg gagtcgcggc tcacagatgc 301 tgagagcaag tactccgaga tgaacaacca aattgacatc atgcagctgc aggcagcaca 361 gacggtcact cagacctccg cagatgccat ctacgactgc tcttccctct accagaagaa 421 ctaccgcatc tctggagtgt ataagcttcc tcctgatgac ttcctgggca gccctgaact 481 ggaggtgttc tgtgacatgg agacttcagg cggaggctgg accatcatcc agagacgaaa 541 aagtggcctt gtctccttct accgggactg gaagcagtac aagcagggct ttggcagcat 601 ccgtggggac ttctggctgg ggaacgaaca catccaccgg ctctccagac agccaacccg 661 gctgcgtgta gagatggagg actgggaggg caacctgcgc tacgctgagt atagccactt 721 tgttttgggc aatgaactca acagctatcg cctcttcctg gggaactaca ctggcaatgt 781 ggggaacgac gccctccagt atcataacaa cacagccttc agcaccaagg acaaggacaa 841 tgacaactgc ttggacaagt gtgcacagct ccgcaaaggt ggctactggt acaactgctg 901 cacagactcc aacctcaatg gagtgtacta ccgcctgggt gagcacaata agcacctgga 961 tggcatcacc tggtatggct ggcatggatc tacctactcc ctcaaacggg tggagatgaa 1021 aatccgccca gaagacttca agccttaaaa ggaggctgcc gtggagcacg gatacagaaa 1081 ctgagacacg tggagactgg atgagggcag atgaggacag gaagagagtg ttagaaaggg 1141 taggactgag aaacagccta taatctccaa agaaagaata agtctccaag gagcacaaaa 1201 aaatcatatg taccaaggat gttacagtaa acaggatgaa ctatttaaac ccactgggtc 1261 ctgccacatc cttctcaagg tggtagactg agtggggtct ctctgcccaa gatccctgac 1321 atagcagtag cttgtctttt ccacatgatt tgtctgtgaa agaaaataat tttgagatcg 1381 ttttatctat tttctctacg gcttaggcta tgtgagggca aaacacaaat ccctttgcta 1441 aaaagaccca tattattttg attctcaaag gataggcctt tgagtgttag agaaaggagt 1501 gaaggagcca ggtgggaaat ggtatttcta tttttaaact ccagtgaaat tatcttgagt 1561 ctacacatta tttttaaaac acaaaaattg ttcggctgga actgacccag gctggacttg 1621 cggggaggaa actccagggc actgcatctg gcgatcagac tctgagcact gcccctgctc 1681 gccttggtca tgtacagcac tgaaaggaat gaggcaccag caggaggtgg acagagtctc 1741 tcatggatgc cggcacaaaa ctgccttaaa atattcatag ttaatacagg tatatctatt 1801 tttatttact ttgtaagaaa caagctcaag gagcttcctt ttaaattttg tctgtaggaa 1861 atggttgaaa actgaaggta gatggtgtta tagttaataa taaatgctgt aaataagcat 1921 ctcactttgt aaaaataaaa tattgtggtt ttgttttaaa cattcaacgt ttcttttcct 1981 tctacaataa acactttcaa aatgtg // LOCUS HSYMTRNH 2779 bp RNA PRI 03-DEC-1996 DEFINITION H.sapiens mRNA for yeast methionyl-tRNA synthetase homologue. ACCESSION X94754 NID g1702931 KEYWORDS methionyl-tRNA synthetase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2779) AUTHORS Lage,H. and Dietel,M. TITLE Cloning of a human cDNA encoding a protein with high homology to yeast methionyl-tRNA synthetase JOURNAL Gene 178 (1-2), 187-189 (1996) MEDLINE 97080567 REFERENCE 2 (bases 1 to 2779) AUTHORS Lage,H. TITLE Direct Submission JOURNAL Submitted (22-DEC-1995) H. Lage, Humboldt University, Institute of Pathology, Charite, Schumannstr. 20/21, D- 10117 Berlin, FRG FEATURES Location/Qualifiers source 1..2779 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="EPG85-257RNOV" CDS 8..2710 /codon_start=1 /product="yeast methionyl-tRNA synthetase homolog" /db_xref="PID:e218477" /db_xref="PID:g1702932" /translation="MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCVVPFLT RPKVPVLQVDSGNYLFSTSAICRYFFLLSGWEQDDLTNQWLEWEATELQPALSAPLYY LVVQGKKGEDVLGSVRRALTHIDHSLSRQNCPFLAGETESLADIVLWGAQYPLLQDPA YLPEELSALHSWFQTLSTQEPCQRAAETVLKQQGVLALRPYLQKQPQPSPAEGRAVTN EPEEEELATLSEEEIAMAVTAWEKGLESLPPLRPQQNPVLPVAGERNVLITSALPYVN NVPHLGNIIGCVLSADVFARYSRLRQWNTLYLCGTDEYGTATETKALEEGLTPQEICD KYHIIHADIYRWFNISFDIFGRTTTPQQTKITQDIFQQLLKRGFVLQDTVEQLRCEHC ARFLADRFVEGVCPFCGYEEARGDQCDKCGKLINAVELKKPQCKVCRSCPVVQSSQHL FLDLPKLEKRLEEWLGRTLPGSDWTPNAQFITRSWLRDGLKPRCITRDLKWGTPVPLE GFEDKVFYVWFDATIGYLSITANYTDQWERWWKNPEQVDLYQFMAKDNVPFHSLVFPC SALGAEDNYTLVSHLIATEYLNYEDGKFSKSRGVGVFGDMAQDTGIPADIWRFYLLYI RPEGQDSAFSWTDLLLKNNSELLNNLGNFINRAGMFVSKFFGGYVPEMVLTPDDQRLL GHVTLELQHYHQLLEKVRIRDALRSILTISRHGNQYIQVNEPWKRIKGSEADRQRAGT VTGLAVNIAALLSVMLQPYMPTVSATIQAQLQLPPPACSILLTNFLCTLPAGHQIGTV SPLFQKLENDQIESLRQRFGGGQAKTSPKPAVVETVTTAKPQQIQALMDEVTKQGNIV RELKAQKADKNEVAAEVAKLLDLKKQLAVAEGKPPEAPKGKKKK" BASE COUNT 670 a 717 c 763 g 629 t ORIGIN 1 cggcgaaatg agactgttcg tgagtgatgg cgtcccgggt tgcttgccgg tgctggccgc 61 cgccgggaga gcccggggca gagcagaggt gctgatcagc actgtaggcc cggaagattg 121 tgtggtcccg ttcctgaccc ggcctaaggt ccctgtcttg caggtggata gcggcaacta 181 cctcttctcc actagtgcaa tctgccgata tttttttttg ttatctggct gggagcaaga 241 tgacctcact aaccagtggc tggaatggga agcgacagag ctgcagccag ctttgtctgc 301 tcccctgtac tatttagtgg tccaaggcaa gaagggggaa gatgttcttg gttcagtgcg 361 gagagccctg actcacattg accacagctt gagtcgtcag aactgtcctt tcctggctgg 421 ggagacagaa tctctagccg acattgtttt gtggggagcc caatacccat tactgcaaga 481 tcccgcctac ctccctgagg agctgagtgc cctgcacagc tggttccaga cactgagtac 541 ccaggaacca tgtcagcgag ctgcagagac tgtactgaaa cagcaaggtg tcctggctct 601 ccggccttac ctccaaaagc agccccagcc cagccccgct gagggaaggg ctgtcaccaa 661 tgagcctgag gaggaggagc tggctaccct atctgaggag gagattgcta tggctgttac 721 tgcttgggag aagggcctag aaagtttgcc cccgctgcgg ccccagcaga atccagtgtt 781 gcctgtggct ggagaaagga atgtgctcat caccagtgcc ctcccttacg tcaacaatgt 841 cccccacctt gggaacatca ttggttgtgt gctcagtgcc gatgtctttg ccaggtactc 901 tcgcctccgc cagtggaaca ccctctatct gtgtgggaca gatgagtatg gtacagcaac 961 agagaccaag gctctggagg agggactaac cccccaggag atctgcgaca agtaccacat 1021 catccatgct gacatctacc gctggtttaa catttcgttt gatatttttg gtcgcaccac 1081 cactccacag cagaccaaaa tcacccagga cattttccag cagttgctga aacgaggttt 1141 tgtgctgcaa gatactgtgg agcaactgcg atgtgagcac tgtgctcgct tcctggctga 1201 ccgcttcgtg gagggcgtgt gtcccttctg tggctatgag gaggctcggg gtgaccagtg 1261 tgacaagtgt ggcaagctca tcaatgctgt cgagcttaag aagcctcagt gtaaagtctg 1321 ccgatcatgc cctgtggtgc agtcgagcca gcacctgttt ctggacctgc ctaagctgga 1381 gaagcgactg gaggagtggt tggggaggac attgcctggc agtgactgga cacccaatgc 1441 ccagtttatc acccgttctt ggcttcggga tggcctcaag ccacgctgca taacccgaga 1501 cctcaaatgg ggaacccctg tacccttaga aggttttgaa gacaaggtat tctatgtctg 1561 gtttgatgcc actattggct atctgtccat cacagccaac tacacagacc agtgggagag 1621 atggtggaag aacccagagc aagtggacct gtatcagttc atggccaaag acaatgttcc 1681 tttccatagc ttagtctttc cttgctcagc cctaggagct gaggataact ataccttggt 1741 cagccacctc attgctacag agtacctgaa ctatgaggat gggaaattct ctaagagccg 1801 cggtgtggga gtgtttgggg acatggccca ggacacgggg atccctgctg acatctggcg 1861 cttctatctg ctgtacattc ggcctgaggg ccaggacagt gctttctcct ggacggacct 1921 gctgctgaag aataattctg agctgcttaa caacctgggc aacttcatca acagagctgg 1981 gatgtttgtg tctaagttct ttgggggcta tgtgcctgag atggtgctca cccctgatga 2041 tcagcgcctg ctgggccatg tcaccctgga gctccagcac tatcaccagc tacttgagaa 2101 ggttcggatc cgggatgcct tgcgcagtat cctcaccata tctcgacatg gcaaccaata 2161 tattcaggtg aatgagccct ggaagcggat taaaggcagt gaggctgaca ggcaacgggc 2221 aggaacagtg actggcttgg cagtgaatat agctgccttg ctctctgtca tgcttcagcc 2281 ttacatgccc acggttagtg ccacaatcca ggcccagctg cagctcccac ctccagcctg 2341 cagtatcctg ctgacaaact tcctgtgtac cttaccagca ggacaccaga ttggcacagt 2401 cagtcccttg ttccaaaaat tggaaaatga ccagattgaa agtttaaggc agcgctttgg 2461 agggggccag gcaaaaacgt ccccgaagcc agcagttgta gagactgtta caacagccaa 2521 gccacagcag atacaagcgc tgatggatga agtgacaaaa caaggaaaca ttgtccgaga 2581 actgaaagca caaaaggcag acaagaacga ggttgctgcg gaggtggcga aactcttgga 2641 tctaaagaaa cagttggctg tagctgaggg gaaaccccct gaagccccta aaggcaagaa 2701 gaaaaagtaa aagaccttgg ctcatagaaa gtcactttaa tagataggga cagtaataaa 2761 taaatgtaca atctctata // LOCUS HSYPT31 701 bp RNA PRI 06-APR-1995 DEFINITION H.sapiens YPT3 mRNA. ACCESSION X79780 NID g763129 KEYWORDS ypt3 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 701) AUTHORS Zhu,A.X., Zhao,Y. and Flier,J.S. TITLE Molecular cloning of two small GTP-binding proteins from human skeletal muscle JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1875-1882 (1994) MEDLINE 95110337 REFERENCE 2 (bases 1 to 701) AUTHORS Zhu,A.X. TITLE Direct Submission JOURNAL Submitted (21-JUN-1994) A.X. Zhu, Beth Israel Hospital, Harvard Medical School, Div. of Endocrinology and Metabolism, 330 Brookline Ave, Boston, MA 02215, USA FEATURES Location/Qualifiers source 1..701 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="muscle" /cell_line="human fetal muscle" gene 7..663 /gene="YPT3" CDS 7..663 /gene="YPT3" /codon_start=1 /db_xref="PID:g763130" /translation="MGTRDDEYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLESKSTI GVEFATRSIQVDGKTIKAQIWDTAGQERYRRITSAYYRGAVGALLVYDIAKHLTYENV ERWLKELRDHADSNIVIMLVGNKSDLRHLRAVPTDEARAFAEKNNLSFIETSALDSTN VEEAFKNILTEIYRIVSQKQIADRAAHDESPGNNVVDISVPPTTDGQKPNKLQCCQNL " BASE COUNT 166 a 228 c 202 g 105 t ORIGIN 1 cggacaatgg ggacccggga cgacgagtac gactacctat tcaaagtggt gctcatcggg 61 gactcaggcg tgggcaagag caacctgctg tcgcgcttca cccgcaacga gttcaacctg 121 gagagcaaga gcaccatcgg cgtggagttc gccacccgca gcatccaggt ggacggcaag 181 accatcaagg cgcagatctg ggacaccgct ggccaggagc gctaccgccg catcacctcc 241 gcgtactacc gtggtgcagt gggcgccctg ctggtgtacg acatcgccaa gcacctgacc 301 tatgagaacg tggagcgctg gctgaaggag ctgcgggacc acgcagacag caacatcgtc 361 atcatgctgg tgggcaacaa gagtgacctg cgccacctgc gggctgtgcc aactgacgag 421 gcccgcgcct tcgcagaaaa gaacaacttg tccttcatcg agacctcagc cttggattcc 481 actaacgtag aggaagcatt caagaacatc ctcacagaga tctaccgcat cgtgtcacag 541 aaacagatcg cagaccgcgc tgcccacgac gagtccccgg ggaacaacgt ggtggacatc 601 agcgtgccgc ccaccacgga cggacagaag cccaacaagc tgcagtgctg ccagaacctg 661 tgacccctgc gtcctccacc cagcgtgcgt gcacgtcctc c // LOCUS HSYY1NFE1 1659 bp RNA PRI 18-JAN-1995 DEFINITION H.sapiens mRNA for YY1/NF-E1 protein. ACCESSION Z14077 NID g38010 KEYWORDS GLI-Krupple related protein; YY1/NF-E1 protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1655) AUTHORS Whitson,R.H., Huang,T., Dang,J. and Itakura,K. TITLE Observed and predicted DNA binding of a zinc finger protein which recognizes upstream sequences in the human cytomegalovirus major immediate early gene JOURNAL Unpublished REFERENCE 2 (bases 1 to 1659) AUTHORS Park,K. and Atchison,M.L. TITLE Isolation of a candidate repressor/activator, NF-E1 (YY-1, delta), that binds to the immunoglobulin kappa 3' enhancer and the immunoglobulin heavy-chain mu E1 site JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (21), 9804-9808 (1991) MEDLINE 92052179 REMARK (sites) REFERENCE 3 (bases 1 to 1659) AUTHORS Shi,Y., Seto,E., Chang,L.S. and Shenk,T. TITLE Transcriptional repression by YY1, a human GLI-Kruppel-related protein, and relief of repression by adenovirus E1A protein JOURNAL Cell 67 (2), 377-388 (1991) MEDLINE 92005716 REMARK (sites) REFERENCE 4 (bases 1 to 1659) AUTHORS Whitson,R.H. TITLE Direct Submission JOURNAL Submitted (13-JUL-1992) Robert H Whitson Jr, Molecular Genetics, Beckman Research, Institute of the City of Hope, 1400 East Duarte Road, Duarte, California, 91010, USA FEATURES Location/Qualifiers source 1..1659 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="foreskin" /cell_type="fibroblast" /clone_lib="HFF lambda gt 11 E. Huang" /clone="EH-2_8-1" CDS 256..1500 /citation=[1] /citation=[2] /citation=[3] /codon_start=1 /product="YY1 /NF-E1" /db_xref="PID:g38011" /db_xref="SWISS-PROT:P25490" /translation="MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEE EEEDDDDEDGGGGDHGGGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDDPTQVHHHQE VILVQTREEVVGGDDSDGLRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGKSG GGGSSSSGGGRVKKGGGKKSGKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIKTLEGE FSVTMWSSDEKKDIDHETVVEEQIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQL AEFARMKPRKIKEDDAPRTIACPHKGCTKMFRDNSAMRKHLHTHGPRVHVCAECGKAF VESSKLKRHQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNK KFAQSTNLKSHILTHAKAKNNQ" misc_feature 1147..1215 /note="region coding for first zinc finger" misc_feature 1234..1296 /note="region coding for second zinc finger" misc_feature 1319..1386 /note="region coding for third zinc finger" misc_feature 1408..1476 /note="region coding for fourth zinc finger" BASE COUNT 408 a 489 c 508 g 253 t 1 others ORIGIN 1 cgaggcgagg cgaggagggg gagccgagac gcagcggccg aggcgggggc gcgggcgcac 61 cgaggcgagg gaggcgggga agccccgccg ccgccgcggc gcccgcccct tcccccgccg 121 cccgcgccct ctccccccgc cgcgctcgcc gccttcctcc ctcgcttcct tccccacggc 181 cggccgcctc ctcgcccgcc cgcccgcagc cgaggagccg aggccgccgg ggccgtggcg 241 gcggagccct cagccatggc ctcgggcgac accctctaca tcgccacgga cggctcggag 301 atgccggccg agatcgtgga gctgcacgag atcgaggtgg agaccatccc ggtggagacc 361 atcgagacca cagtggtggg cgaggaggag gaggaggacg acgacgacga ggacggcggc 421 ggtggcgacc acggcggcgg gggcggccac gggcacgccg gccaccacca ccaccaccat 481 caccaccacc accacccgcc catgatcgct ctgcagccgc tggtcaccga cgacccgacc 541 caggtgcacc accaccagga ggtgatcctg gtgcagacgc gcgaggaggt ggtgggcggc 601 gacgactcgg acgggctgcg cgccgaggac ggcttcgagg atcagattct catcccggtg 661 cccgcgccgg ccggcggcga cgacgactac attgaacaaa cgctggtcac cgtggcggcg 721 gccggcaaga gcggcggcgg cggctcgtcg tcgtcgggag gcggccgcgt caagaagggc 781 ggcggcaaga agagcggcaa gaagagttac ctcagcggcg gggccggcgc ggcgggcggc 841 ggcggcgccg acccgggcaa caagaagtgg gagcagaagc aggtgcagat caagaccctg 901 gagggcgagt tctcggtcac catgtggtcc tcagatgaaa aaaaagatat tgaccatgag 961 acagtggttg aagaacagat cattggagag aactcacctc ctgattattc agaatatatg 1021 acaggaaaga aacttcctcc tggaggaata cctggcattg acctctcaga tcccaaacaa 1081 ctggcagaat ttgctagaat gaagccaaga aaaattaaag aagatgatgc tccaagaaca 1141 atagcttgcc ctcataaagg ctgcacaaag atgttcaggg ataactcggc catgagaaaa 1201 catctgcaca cccacggtcc cagagtccac gtctgtgcag aatgtggcaa agcttttgtt 1261 gagagttcaa aactaaaacg acaccaactg gttcatactg gagagaagcc ctttcagtgc 1321 acgttcgaag gctgtgggaa acgcttttca ctggacttca atttgcgcac acatgtgcga 1381 atccataccg gagacaggcc ctatgtgtgc cccttcgatg gttgtaataa gaagtttgct 1441 cagtcaacta acctgaaatc tcacatctta acacatgcta aggccaaaaa caaccagtga 1501 aaagaagaga gaagaccctt ctcgaccacg gnaagcatct tccagaagtg tgattgggaa 1561 taaatatgcc tctcctttgt atattatttc taggaagaat tttaaaaatg aatcctacac 1621 acctaaggga catgttttga taaagtagta aaaaaaaaa // LOCUS HSZFPBF1 9020 bp RNA PRI 12-SEP-1993 DEFINITION Human PRDII-BF1 gene for a DNA-binding protein. ACCESSION X51435 NID g38017 KEYWORDS DNA-binding protein; PRDII-BF1 gene; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9020) AUTHORS Fan,C.M. TITLE Direct Submission JOURNAL Submitted (11-JAN-1990) Fan C.-M., Harvard University, Dept of Biochemical & Molecular Biology, 7 Divinity Avenue, Cambridge MA 02138, U S A REFERENCE 2 (bases 1 to 9020) AUTHORS Fan,C.M. and Maniatis,T. TITLE A DNA-binding protein containing two widely separated zinc finger motifs that recognize the same DNA sequence JOURNAL Genes Dev. 4 (1), 29-42 (1990) MEDLINE 90169514 FEATURES Location/Qualifiers source 1..9020 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MG63, HL60" /clone_lib="lambda gt11" /clone="PRDII-BF1" CDS 325..8478 /note="PRDII-BF1 protein (AA 1-2717)" /codon_start=1 /db_xref="PID:g38018" /db_xref="SWISS-PROT:P15822" /translation="MPRTKQIHPRNLRDKIEEAQKELNGAEVSKKEILQAGVKGTSES LKGVKRKKIVAENHLKKIPKSPLRNPLQAKHKQNTEESSFAVLHSASESHKKQNYIPV KNGKQFTKQNGETPGIIAEASKSEESVSPKKPLFLQQPSELRRWRSEGADPAKFSDLD EQCDSSSLSSKTRTDNSECISSHCGTTSPSYTNTAFDVLLKAMEPELSTLSQKGSPCA IKTEKLRPNKTARSPPKLKNSSMDAPNQTSQELVAESQSSCTSYTVHMSAAQKNEQGA MQSASHLYHQHEHFVPKSNQHNQQLPGCSGFTGSLTNLQNQENAKLEQVYNIAVTSSV GLTSPSSRSQVTPQNQQMDSASPLSISPANSTQSPPMPIYNSTHVASVVNQSVEQMCN LLLKDQKPKKQGKYICEYCNRACAKPSVLLKHIRSHTGERPYPCVTCGFSFKTKSNLY KHKKSHAHTIKLGLVLQPDAGGLFLSHESPKALSIHSDVEDSGESEEEGATDERQHDL GAMELQNVHIIKRMSNAETLLKSSFTPSSPENVIGDFLLQDRSAESQAVTELPKVVVH HVTVSPLRTDSPKAMDPKPELSSAQKQKDLQVTNVQPLSANMSQGGVSRLETNENSHQ KGDMNPLEGKQDSHVGTVHAQLQRQQATDYSQEQQGKLLSPRSLGSTDSGYFSRSESA DQTVSPPTPFARRFPAQNKTLEGVTDPLQLLSPRQHPLLCHREKALLLPGQMRPPLAT KTLEERISKLISDNEALVDDKQLDSVKPRRTSLSRRGSIDSPKSYIFKDSFQFDLKPV GRRTSSSSDIPKSPFTPTEKSKQVFLLSVPSLDCLPITRSNSMPTTGYSAVPANIIPP PHPLRGSQSFDDKIGAFYDDVFVSGPNAPVPQSGHPRTLVRQAAIEDSSANESHVLGT GQSLDESHQGCHAAGEAMSVRSKALAQGPHIEKKKSHQGRGTMFECETCRNRYRKLEN FENHKKFYCSELHGPKTKVAMREPEHSPVPGGLQPQILHYRVAGSSGIWEQTPQIRKR RKMKSVGDDEELQQNESGTSPKSSEGLQFQNALGCNPSLPKHSVTIRSDQQHKNIQLQ NSHIHLVARGPEQTMDPKLSTIMEQQISSAAQDKIELQRHGTGISVIQHTNSLSRPNS FDKPEPFERASPVSFQELNRTGNSGSLKVIGISQEESHPSRDGSHPHQLALSDALRGE LQESSRKSPSERHVLGQPSRLIRQHNIQVPEILVTEEPDRDLEAQCHDQEKSEKFSWP QRSETLSKLPTEKLPPKKKRLRLAEIEHSSTESSFDSTLSRSLSRESSLSHTSSFSAS LDIEDVSKTEASPKIDFLNKAEFLMIPAGLNTLNVPGCHREMRRTASEQINCTQTSME VSDLRSKSFDCGSITPPQTTPLTELQPPSSPSRVGVTGHVPLLERRRGPLVRQISLGI APDSHLSPVHPTSFQNTALPSVNAVPYQGPQLTSTSLAEFSANTLHSQTQVKDLQAET SNSSSTNVFPVQQLCDINLLNQIHAPPSHQSTQLSLQVSTQGSKPDKNSVLSGSSKSE DCFAPKYQLHCQVFTSGPSCSSNPVHSLPNQVISDPVGTDHCVTSATLPTKLIDSMSN SHPLLPPELRPLGSQVQKVPSSFMLPIRLQSSVPAYCFATLTSLPQILVTQDLPNQPI CQTNHSVVPISEEQNSVPTLQKGHQNALPNPEKEFLCENVFSEMSQNSSLSESLPITQ KISVGRLSPQQESSASSKRMLSPANSLDIAMEKHQKRAKDENGAVCATDVRPLEALSS RVNEASKQKKPILVRQVCTTEPLDGVMLEKDVFSQPEISNEAVNLTNVLPADNSSTGC SKFVVIEPISELQEFENIKSSTSLTLTVRSSPAPSENTHLSPLKCTDNNQERKSPGVK NQGDKVNIQEQSQRPVTSLSLFNIKDTQQLAFPSLKTTTNFTWCYLLRQKSLHLPQKD QKTSAYTDWTVSASNPNPLGLPTKVALALLNSKQNTGKSLYCQAITTHSKSDLLVYSS KWKSSLSKRALGNQKSTVVEFSNKDASEINSEQDKENSLIKSEPRRIKIFDGGYKSNE EYVYIRGRGRGKYICEECGIRCKKPSMLKKHIRTHTDVRPYHCTYCNFSFKTKGNLTK HMKSKAHSKKCVDLGISVGLIDEQDTEESDEKQRFSYERSGYDLEESDGPDEDDNENE DDDEDSQAESVLSATPSVTASPQHLPSRSSLQDPVSTDEDVRITDCFSGVHTDPMDVL PRALLTRMTVLSTAQSDYNRKTLSPGKARQRAARDENDTIPSVDTSRSPCHQMSVDYP ESEEILRSSMAGKAVAITQSPSSVRLPPAAAEHSPQTAAGMPSVASPHPDPQEQKQQI TLQPTPGLPSPHTHLFSHLPLHSQQQSRTPYNMVPVGGIHVVPAGLTYSTFVPLQAGP VQLTIPAVSVVHRTLGTHRNTVTEVSGTTNPAGVAELSSVVPCIPIGQIRVPGLQNLS TPGLQSLPSLSMETVNIVGLANTNMAPQVHPPGLALNAVGLQVLTANPSSQSSPAPQA HIPGLQILNIALPTLIPSVSQVAVDAQGAPEMPASQSKACETQPKQTSVASANQVSRT ESPQGLPTVQRENAKKVLNPPAPAGDHARLDGLSKMDTEKAASANHVKPKPELTSIQG QPASTSQPLLKAHSEVFTKPSGQQTLSPDRQVPRPTGLPRRQPTVHFSDVSSDDDEDR LVIAT" misc_feature 8793..8798 /note="polyA signal" BASE COUNT 2745 a 2164 c 1930 g 2181 t ORIGIN 1 cgctgctggg cggcggcagg gtggcggacg gagcggggga ccggggagcg gcggccgcga 61 ggaggttatg tttgtgtttg gggttgtcaa gtgaaggagg gatcccaggc gccgccgccg 121 ccgccgcggg ggtcgcgaga tcccgagccg cggccgccgc catcagcagc gcagtccagg 181 gccggctgca gcggcagctc cgccgggcgt cctggcagca gcacatggat taattgatgt 241 atgttgagtt tatggagctg ccttttggtg gcttgcttta tctgcagttt ttaagaagaa 301 aaagaaggcc ctgagtcaaa gaagatgcct cgaactaaac aaattcatcc cagaaatcta 361 agagacaaaa ttgaagaagc acaaaaagaa cttaatgggg cagaagtttc aaaaaaagaa 421 atcttacagg ctggtgttaa aggaacttcg gaatccctta aaggtgtgaa acgcaaaaag 481 atcgtagctg agaatcacct gaaaaaaata ccaaaatccc cactgagaaa tcctcttcag 541 gcaaaacata aacaaaatac agaagagtca tctttcgccg ttcttcatag tgcttcggag 601 tctcacaaga aacagaatta tattcctgta aaaaatggga agcagtttac caaacaaaat 661 ggagaaacac ctggaataat tgctgaagcc tcaaaatctg aagaatctgt ctccccaaag 721 aagcccttgt ttctgcagca accatctgaa ctgcgtagat ggagatccga aggcgctgat 781 cctgccaaat tcagtgacct cgatgaacaa tgtgactcaa gttccttgtc aagtaaaacc 841 aggactgaca atagcgaatg catctcttct cattgtggca ctacgtcccc ctcctataca 901 aacactgcat tcgatgtctt actgaaagca atggagccag aactgagcac cttgtcacaa 961 aagggctcac cttgtgcaat taagacagaa aaactgaggc caaataaaac tgcacgttcc 1021 cctcccaaat taaaaaacag ttcaatggat gccccaaatc agacttcaca ggaattggtt 1081 gctgaatcac agtcttcttg tacctcatac acagtccata tgtctgctgc tcagaagaat 1141 gagcaagggg caatgcagtc agcttctcat ttgtatcatc aacatgaaca ctttgttccc 1201 aaatccaacc aacataatca acagcttccg gggtgttcag gtttcacagg atcactgaca 1261 aatctgcaaa atcaagagaa tgccaaactt gaacaggttt ataatatagc agtgacatca 1321 tctgtaggcc taacttcacc ttccagtaga tctcaggtta ctcctcaaaa ccagcaaatg 1381 gattctgctt cacctttgtc aataagtccg gctaattcta cacagtcgcc ccccatgcca 1441 atctataatt caactcatgt tgcctctgtt gttaatcaaa gcgtagagca aatgtgcaat 1501 cttcttctga aagatcagaa gccaaaaaaa caaggaaaat atatttgtga gtattgcaat 1561 agagcatgtg caaagcctag tgtgctttta aagcatatcc gctcccacac tggagagcga 1621 ccctatccct gtgtgacttg tggattttca tttaagacta aaagtaatct gtataagcac 1681 aagaaatccc acgcacatac tatcaaactg ggtcttgtct tgcaaccaga tgctggtggc 1741 ttgttcttgt cccacgagtc ccccaaagca cttagtattc attcagacgt agaagacagt 1801 ggggagagcg aggaggaagg cgccactgat gagagacagc atgacctggg cgccatggag 1861 ctgcagaatg tgcacataat aaagaggatg tcaaatgctg aaactttact aaaatcaagc 1921 ttcactccaa gcagtccaga aaatgtgata ggtgactttt tgctacagga cagatctgca 1981 gaatcacaag ctgtgacaga gttaccgaaa gttgtggtcc accatgtcac tgtgtccccc 2041 ttaagaactg acagtccaaa ggccatggat cccaagcctg aactttctag tgcacaaaag 2101 cagaaggacc ttcaggtgac aaacgtacag ccactttcag ccaacatgtc ccagggtgga 2161 gtctccaggt tggagactaa tgagaattcc caccagaaag gcgacatgaa tccactggaa 2221 ggaaagcaag actctcacgt aggaacggta cacgcccagc tacaaaggca gcaggctacc 2281 gattactccc aagagcagca aggaaagctc ctgagtcctc gaagtttagg aagtacggat 2341 tctggttact tttcacgttc tgaaagtgcc gatcaaacag tgagtccacc aactcccttt 2401 gccagaaggt tcccagcaca gaacaagact ctggaaggag taacggaccc tctgcagctc 2461 ttgtcaccac gtcaacaccc tctgctttgc cacagggaaa aggcattgct tttaccaggt 2521 cagatgcgcc cacctttggc cacaaaaaca cttgaggagc ggatatcgaa gcttatctca 2581 gacaatgaag ctttggtaga tgacaagcaa ctggatagtg tgaagccgcg gagaacctca 2641 ctgtcaagac gaggaagcat tgattccccc aaatcataca tatttaaaga ttctttccag 2701 tttgatttaa aaccagtggg acggagaaca agttcaagct ctgatatacc gaagtcacct 2761 ttcaccccta ctgaaaaatc aaagcaagtg tttcttctgt ctgtaccttc acttgactgt 2821 ttacctatca caagaagtaa ttccatgccg accacaggtt attcagcagt acctgcaaat 2881 ataatacctc ctcctcatcc actaagagga agtcagtcat ttgatgacaa aattggcgct 2941 ttctatgatg atgtctttgt atcgggacct aacgctcctg tgccccagag tgggcatccc 3001 cgtacacttg tgagacaagc agccatagaa gactcttcag caaatgaaag tcatgttctt 3061 ggtactggac agtccctgga tgagagccac caaggatgcc atgctgctgg tgaagccatg 3121 tcagtgagga gcaaggcact ggcacaaggc ccacatatag aaaaaaagaa gtctcatcaa 3181 gggcgaggga caatgtttga gtgtgaaact tgtagaaaca ggtataggaa actggaaaat 3241 tttgaaaatc ataagaaatt ttactgttct gagttacatg gaccaaaaac aaaggtagcc 3301 atgagagaac ctgagcacag ccctgtgccc ggcggtctgc aacctcagat tctacactac 3361 agagtcgctg ggtcctccgg catctgggaa cagacgcccc agataagaaa aaggaggaaa 3421 atgaaaagtg ttggggatga tgaagaactt cagcaaaatg aaagtggaac atctccaaaa 3481 agttctgaag gccttcagtt tcagaatgct ctgggctgta atcccagttt gcctaaacat 3541 agtgttacca taagaagtga ccagcagcat aaaaatatac agttgcaaaa ctcccatatt 3601 caccttgttg ccaggggccc tgagcagacc atggatccca agctgtcgac catcatggaa 3661 caacagataa gttcagcagc ccaggacaag atagaactgc agagacacgg aactggaatc 3721 tctgtcatcc agcacaccaa ctccctgagc aggcccaact catttgacaa gcctgagcct 3781 tttgaaagag cctccccagt ttctttccag gagctgaata gaacggggaa ttccgggtct 3841 ctaaaagtga taggaatctc ccaagaggaa agtcaccctt ctcgggacgg gtctcatcct 3901 caccagcttg cactatcaga cgctctcaga ggagaacttc aggaaagctc cagaaagagt 3961 ccaagtgaac gacatgtgtt aggacagccc tcaagactta tccggcagca caacatccaa 4021 gttccagaga ttttggtcac agaagaacca gatcgagacc tggaagctca atgccatgat 4081 caagaaaagt cagagaagtt cagttggccc cagcgtagtg aaaccttgtc aaaattgcca 4141 acagagaaac tgccacccaa aaagaaaagg ctccgtctgg ctgagataga acattcctca 4201 acagaatcga gctttgattc cactctctcc aggagtctaa gtagggagag cagtttatct 4261 cacacttcaa gtttctcagc ctctttagac atagaggacg tttctaaaac ggaggcttcc 4321 cccaaaatcg attttctaaa taaagccgag tttcttatga ttccagctgg cttgaatact 4381 ctgaatgttc ctggatgtca ccgggaaatg aggcgtactg catcagaaca gattaattgc 4441 acgcaaacgt caatggaggt ctctgatctc agaagcaaat cattcgattg tggaagcatc 4501 accccacccc agacaacacc acttactgaa ttgcagcctc catcttcacc ttctcgagtg 4561 ggagtgactg ggcatgtgcc tctcttagaa agaaggagag gcccactggt acggcaaata 4621 tctttgggca tagccccaga tagtcatctg tctcctgtac acccaacatc tttccaaaat 4681 actgctcttc ccagtgtgaa tgcagtgcca tatcaggggc ctcagctcac tagtacatct 4741 ttagctgagt tttctgcaaa tactttgcac tctcagactc aggttaagga tctgcaggca 4801 gaaacatcaa actccagctc taccaacgtt tttcctgttc aacagctctg tgatatcaat 4861 ttgttaaatc aaatccatgc accgcctagc caccagagca cacagctatc tctgcaagtg 4921 tctacgcagg gtagcaagcc agataaaaat tctgttttat ctgggtcttc taaaagtgag 4981 gattgctttg ctcccaaata ccaattgcat tgtcaggttt tcacttcagg cccatcttgc 5041 tcttctaatc ctgtgcattc tttgccaaat caagttattt cagatccagt tggaacagat 5101 cattgtgtga catcagcaac attaccaacc aaattaattg acagcatgtc taattcgcat 5161 cctctgctac caccagagct caggcccctt ggaagtcagg tgcagaaggt gccatcatca 5221 ttcatgctgc ccatacgcct gcagagtagt gttcctgctt actgttttgc tacactcaca 5281 tccctgccac aaatactagt gacccaagat ctgcccaatc agccaatttg ccagactaat 5341 catagtgtag tgccaatcag tgaagaacaa aattctgtgc caacattaca aaaaggtcat 5401 cagaatgctt tgccaaaccc agagaaggaa tttctatgtg aaaatgtttt ttcagagatg 5461 agccaaaatt cttctctatc agaatccttg cccataactc agaaaatatc tgttggtcga 5521 ctttcccctc aacaagaatc ttcagcttcg agtaaaagga tgctttcccc agcaaatagt 5581 ttagacattg ccatggaaaa gcaccagaag cgggccaaag atgaaaatgg agctgtttgt 5641 gcaacagacg tgagaccttt agaggctttg agttcgagag ttaatgaagc tagtaaacag 5701 aagaagccta ttttagtgag acaggtttgt actacagagc ccctggacgg tgtgatgttg 5761 gaaaaggatg ttttttctca acctgaaatt agtaatgagg ctgttaattt gacaaatgtt 5821 ttaccagctg ataattcatc aacaggatgc tctaaatttg tcgttataga acctataagt 5881 gaattgcagg aatttgaaaa catcaagtca tccacatcat taactcttac agttcgaagt 5941 tcacctgctc cttcagaaaa tactcatctt tctcctttga aatgtacaga caataaccaa 6001 gaaaggaagt ctccaggggt taaaaatcaa ggtgacaaag tgaacatcca agagcaaagt 6061 caacggccag tcacttctct ttcattgttt aacatcaagg acacccagca gctggctttc 6121 cctagcctga aaactacaac caactttaca tggtgttatc tcttaaggca gaagtcgttg 6181 catttgcctc agaaggacca gaaaacttca gcctatactg attggacagt aagcgccagt 6241 aatccaaatc cactcggttt gcccacaaaa gttgcacttg ctctccttaa ttcaaaacag 6301 aacactggaa aatcactata ctgtcaagca ataactaccc attccaagtc agacttattg 6361 gtctattcaa gcaagtggaa aagcagctta agcaagagag cattaggtaa tcaaaagtcc 6421 acagtagttg aattcagcaa taaagatgcc tctgaaatta acagtgagca agataaagaa 6481 aattccttaa tcaaaagtga accaagaaga attaaaatat ttgatggagg atataagtca 6541 aatgaagagt atgtatatat ccgaggcagg ggaagaggaa aatacatttg tgaagaatgt 6601 ggaatacgtt gtaagaaacc tagcatgtta aagaaacaca tacgaaccca tacagatgtc 6661 cgcccctacc actgcactta ctgtaacttc tcctttaaga ctaaaggaaa tctgacaaaa 6721 cacatgaagt ccaaggcaca tagcaagaaa tgtgtggatt taggcatctc agtaggttta 6781 atagatgaac aggatacaga agaatcagat gaaaaacaga gattcagtta tgagcgatct 6841 ggatatgatc ttgaagaatc tgatggccca gatgaggatg acaatgaaaa tgaagacgat 6901 gatgaggaca gccaggctga atcagtcctg tcagccacac cctcagtcac agctagcccg 6961 cagcaccttc catctagaag tagccttcag gaccctgtga gtactgacga ggatgtcagg 7021 atcaccgatt gcttttctgg ggtacacacg gacccaatgg acgttctgcc cagggcgctg 7081 ctcaccagaa tgactgtcct gagcacagca cagtctgact acaataggaa gacactctct 7141 ccggggaagg ccaggcagcg tgctgcgaga gatgaaaacg acacaattcc gtctgtagac 7201 acttccaggt ccccgtgtca tcagatgtct gtggactacc ctgagtcaga agaaattctg 7261 agaagttcta tggcaggaaa agctgttgct ataacacaga gcccatcatc tgtaagactt 7321 cctcctgctg cagctgagca cagcccccag acagcagcgg ggatgccttc tgtggcctca 7381 ccacatcctg accctcaaga acagaagcag caaataactc tacagccgac tccaggcttg 7441 ccttctcccc acactcattt gtttagccac cttcctttgc attcccagca gcaatcgagg 7501 acaccttata atatggttcc agttgggggg atccatgtgg tacctgctgg cctcacatac 7561 tccacgtttg tgccccttca ggctggacca gtgcagctca cgatccctgc tgtcagtgtc 7621 gttcacagaa ctttgggtac tcataggaat acggtcacag aagtgtctgg cactacaaac 7681 cctgctggag tggctgaatt aagcagtgtt gtgccatgta ttcctatcgg ccaaatccgc 7741 gtgccaggcc ttcagaacct aagtacccca ggcttgcagt cactcccctc gttaagcatg 7801 gaaaccgtca atattgtagg cctagccaat acaaatatgg ccccacaagt ccatccacca 7861 ggactggctc tgaatgctgt cggactgcag gttctgactg caaacccttc atcacaaagc 7921 agccccgccc ctcaggcaca cattccaggt ctccagatct tgaacatagc attgcccacc 7981 ttaatcccct cagtcagtca agtagccgtt gatgcacagg gagctccaga aatgccagct 8041 tcccaaagca aagcatgcga gacacaaccc aagcagactt ctgtagccag cgcaaaccag 8101 gtcagcagga ccgagtctcc tcaggggtta cctacagtcc agcgggaaaa tgcaaaaaaa 8161 gttctgaatc cacctgcccc tgcaggtgac catgcaaggc ttgatggcct gagtaaaatg 8221 gacacagaga aggctgcctc ggcaaatcac gtgaagccca agcctgaact cacttccata 8281 cagggccaac cagcgtccac gtcacaacct ctgctgaagg cacattctga agtttttaca 8341 aagccctcag gccagcagac tctctctcca gacagacagg ttcccaggcc cacaggacta 8401 ccgcggaggc agcccactgt gcacttcagc gacgtgagca gcgatgatga cgaggacagg 8461 cttgtgatag caacctgatg gattttattt tttatttgct ttttttttat ataacactta 8521 aaggtttctt tgaaaaccct cctttcctta aagcacattt ttctgacata aactcatgac 8581 taatctttgt gcaatcatga acttttgacc aataattgtt gttttgtgtc agctccagcc 8641 atttttgtac atgttgtata gacaattgtg ccttttagga gctttatgtt tagaaactgt 8701 acagattgtt gaatatctat atacataaaa atatattata tatgtatatg aaaaccaggt 8761 agttatttgt gtttagtaag gaaaacctgt caaataaatc aaatgattaa attatatgtt 8821 ccactgttga atataaattt tatggctatg gggcagagtt tctgtgtata aattagtatg 8881 taaactccat atttatgtat tcatattagt ctttgaaaat gggtctgtcc tccttgtgta 8941 agacagtaac tttacacttc agacagattt tctgtgttat gaaatgtttc agtaaaatat 9001 tgtttactga cctttaaaaa // LOCUS HSZFX1 2088 bp RNA PRI 14-JUN-1991 DEFINITION Human ZFX mRNA for put. transcription activator, isoform 1. ACCESSION X59738 X17312 NID g38019 KEYWORDS sex determination; transcription activator; ZFX gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2088) AUTHORS Schneider-Gadicke,A., Beer-Romero,P., Brown,L.G., Mardon,G., Luoh,S.W. and Page,D.C. TITLE Putative transcription activator with alternative isoforms encoded by human ZFX gene JOURNAL Nature 342 (6250), 708-711 (1989) MEDLINE 90081847 FEATURES Location/Qualifiers source 1..2088 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..2088 /gene="ZFX" /note="alternatively spliced isoform 1" /evidence=experimental gene 1..2088 /gene="ZFX" CDS 192..1919 /gene="ZFX" /note="alternatively spliced isoform 1" /codon_start=1 /product="ZFX product, isoform 1" /db_xref="PID:g38020" /db_xref="SWISS-PROT:P17010" /translation="MTMDTESEIDPCKVDGTCPEVIKVYIFKADPGEDDLGGTVDIVE SEPENDHGVELLDQNSSIRVPREKMVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEE DAAAAGHAPVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHIDESAGL GRLAKQKPKKRRRPDSRQYQTAIIIGPDGHPLTVYPCMICGKKFKSRGFLKRHMKNHP EHLAKKKYRCTDCDYTTNKKISLHNHLESHKLTSKAEKAIECDECGKHFSHAGALFTH KMVHKEKGANKMHKCKFCEYETAEQGLLNRHLLAVHSKNFPHICVECGKGFRHPSELK KHMRIHTGEKPYQCQYCEYRSADSSNLKTHVKTKHSKEMPFKCDICLLTFSDTKEVQQ HALIHQESKTHQCLHCDHKSSNSSDLKRHIISVHTKDYPHKCDMCDKGFHRPSELKKH VAAHKGKKMHQCRHCDFKIADPFVLSRHILSVHTKDLPFRCKRCRKGFRQQSELKKHM KTHSGRKVYQCEYCEYSTTDASGFKRHVISIHTKDYPHRCEYCKKGFRRPSEKNQHIM RHHKEVGLP" BASE COUNT 647 a 458 c 488 g 495 t ORIGIN 1 cggggagctg ggccgctttt tgtcagctcc gagctcggcc cctcctccct ccctccgccc 61 gccccaccag ccggagcccg gcccagtgct ccagagaaag gccggcctgc agcacccgcc 121 accgtcgcgg ccgcccgcaa cgtccgtccg tggatgatgc tggaaaaata gaacacgatg 181 gttcttctgg aatgaccatg gacacagagt cggaaattga tccttgtaaa gtggatggca 241 cttgccctga ggtcatcaag gtgtacattt ttaaagctga ccctggagaa gatgacttag 301 gtggaactgt agacattgtg gagagtgagc ctgagaatga tcatggagtt gaactgcttg 361 atcagaacag cagtattcgt gttcccaggg aaaagatggt ttatatgact gtcaatgact 421 ctcagccaga agatgaagat ttaaatgttg ctgaaatcgc tgacgaagtt tatatggaag 481 tgatcgtagg agaggaggat gctgcagcag caggacacgc gccggtgcac gagcagcaaa 541 tggatgacaa tgaaatcaaa accttcatgc cgattgcatg ggcagcagct tatggtaata 601 attctgatgg aattgaaaac cggaatggca ctgcaagtgc cctcttgcac atagatgagt 661 ctgctggcct cggcagactg gctaaacaaa aaccaaagaa aaggagaaga cctgattcca 721 ggcagtacca aacagcaata attattggcc ctgatggaca tcctttgact gtctatcctt 781 gcatgatttg tgggaagaag tttaagtcga gaggtttttt gaaaaggcac atgaaaaacc 841 atcccgaaca ccttgccaag aagaaatacc gctgtactga ctgtgattac actaccaaca 901 agaagataag tttacacaac cacctggaga gccacaagct gaccagcaag gcagagaagg 961 ccattgaatg cgatgagtgt gggaagcatt tctctcatgc aggggctttg tttactcaca 1021 aaatggtgca taaggaaaaa ggagccaaca aaatgcacaa gtgtaaattc tgtgaatacg 1081 agacagctga acaagggtta ttgaatcgcc acctcttggc agtccacagc aagaactttc 1141 ctcatatttg tgtggagtgt ggtaagggtt ttcgtcaccc gtcagagctc aaaaagcaca 1201 tgagaatcca tactggggag aagccgtacc aatgccagta ctgcgaatat aggtctgcag 1261 actcttctaa cttgaaaacg catgtcaaaa ctaagcatag taaagagatg ccattcaagt 1321 gtgacatttg tcttctgact ttctcggata ccaaagaggt gcagcaacat gctcttatcc 1381 accaagaaag caaaacacac cagtgtttgc attgcgacca caagagttcg aactcaagtg 1441 atttgaaacg acacataatt tcagttcaca cgaaagacta cccccataag tgtgacatgt 1501 gtgataaagg ctttcacagg ccttcagaac tcaagaaaca cgtggctgcc cacaagggca 1561 aaaaaatgca ccagtgtaga cattgtgact ttaagattgc agatccattt gttctaagtc 1621 gccatattct ctcagttcac acaaaggatc ttccatttag gtgcaagaga tgtagaaagg 1681 gatttaggca acagagtgag cttaaaaagc atatgaagac acacagtggc aggaaagtgt 1741 atcagtgtga gtactgtgag tatagcacta cagatgcctc aggctttaaa cggcacgtta 1801 tttccattca cacgaaagac tatcctcacc ggtgtgagta ctgcaagaaa ggcttccgaa 1861 gaccttcaga aaagaaccag cacataatgc gacatcataa agaagttggc ctgccctaac 1921 aatacttcta cagaacgttt gtagagatat tggccttgaa gcagaaaatt cattttaaag 1981 ccaatcagtc tcattcacat acaatactgt atattgattt atgctgtgta caaatagaat 2041 tattacttct agttgacttt tttttaaata tacattttgc tcagtagt // LOCUS HSZIDMRNA 3495 bp RNA PRI 17-OCT-1994 DEFINITION H.sapiens mRNA for ZID protein. ACCESSION X82018 NID g558598 KEYWORDS POZ domain; zid gene; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3495) AUTHORS Bardwell,V.J. TITLE Direct Submission JOURNAL Submitted (03-OCT-1994) V.J. Bardwell, Imperial Cancer Research Fund, Room 529, 44 Lincolns Inn Fields, London, WC2A 3PX, UK REFERENCE 2 (bases 1 to 3495) AUTHORS Bardwell,V.J. and Treisman,R. TITLE The POZ domain: a conserved protein-protein interaction motif JOURNAL Genes Dev. 8 (14), 1664-1677 (1994) MEDLINE 95047323 FEATURES Location/Qualifiers source 1..3495 /organism="Homo sapiens" /strain="HeLaS3" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /cell_line="HelaS3" /clone_lib="Vp16 tagged yeast expression library SD10 and clonetec placental lambda gt11" gene 53..1327 /gene="zid" CDS 53..1327 /gene="zid" /codon_start=1 /product="ZID, zinc finger protein with interaction domain" /db_xref="PID:g558599" /translation="MAAESDVLHFQFEQQGDVVLQKMNLLRQQNLFCDVSIYINDTEF QGHKVILAACSTFMRDQFLLTQSKHVRITILQSAEVGRKLLLSCYTGALEVKRKELLK YLTAASYLQMVHIVEKCTEALSKYLEIDLSMKNNNQHTDLCQSSDPDVKNEDENSDKD CEIIEISEDSPVNIDFHVKEEESNALQSTVESLTSERKEMKSPELSTVDIGFKDNEIC ILHVESISTAGVENGQFSQPCTSSKASMYFSETQHSLINSTVESRVAEVPGNQDQGLF CENTEGSYGTVSEIQNLEEGYSLRHQCPRCPRGFLHVENYLRHLKMHKLFLCLQCGKT FTQKKNLNRHIRGHMGIRPFQCTVCLKTFTAKSTLQDHLNIHSGDRPYKCHCCDMDFK HKSALKKHLTSVHGRSSGEKLSRPDLKRQSLL" BASE COUNT 1072 a 607 c 622 g 1189 t 5 others ORIGIN 1 gcagacgctg ccgtggaatc cttgactcta gttctctgag tcgattgtga tcatggctgc 61 tgagtctgat gttctgcatt tccagtttga acagcaagga gatgtggtct tgcagaaaat 121 gaatcttttg agacagcaga atttattttg tgatgtatca atttacatta atgacactga 181 gttccagggg cacaaggtga ttttggctgc ttgctccact tttatgagag atcagttttt 241 actcacacag tcaaaacatg tcagaatcac catcttacag agtgcagaag ttggcagaaa 301 attgttactg tcttgctata ctggagcact tgaagttaaa aggaaagagc ttttgaaata 361 cttgactgct gccagttacc ttcagatggt tcacattgtg gaaaagtgca cagaagcttt 421 gtcaaagtat ctggaaattg atctttctat gaaaaacaac aaccaacaca ctgacctgtg 481 tcagtcttct gatcctgatg ttaagaatga agatgaaaat tctgataaag actgtgagat 541 aattgaaatt tcagaagata gtcctgtaaa catagatttc catgttaaag aagaggaaag 601 caatgctttg cagtctacag tagagagtct gacatcagag agaaaggaaa tgaagtcacc 661 agagctgtct acagtagaca taggttttaa agacaatgaa atttgtatcc ttcatgtaga 721 atccatcagt acagctggtg tcgaaaatgg gcagttttca cagccttgta cctcttcaaa 781 agcaagcatg tatttctctg aaacacagca ttcattgatc aattctacag ttgagagcag 841 agtggctgaa gttcctggga atcaagatca gggcttattt tgtgagaata ctgaaggaag 901 ttatggtaca gtgagtgaga ttcagaatct ggaggaaggt tattcactga ggcaccagtg 961 ccccaggtgt cctcgaggct ttcttcatgt tgaaaactat ctgcgccacc ttaaaatgca 1021 taaactattc ttatgcttac agtgcggaaa aacatttaca cagaagaaaa atctcaaccg 1081 acacatccga ggacacatgg gcatacggcc ctttcagtgt actgtgtgct tgaagacatt 1141 tactgccaaa agcacacttc aggaccactt gaacatacac agtggggatc ggccatacaa 1201 atgccactgt tgtgatatgg atttcaagca caagtctgct ctcaaaaagc acttaacctc 1261 tgtccatggc agaagcagtg gtgaaaaact atctaggcct gatctcaaaa ggcaaagtct 1321 actataatta taatcacaga cttgttatat aamgtttgta ttattctatt acgcaagtct 1381 ttctgtagag atgcagcatt tgtaatattt catcccccca atctttgttt cttatatttt 1441 gtcggcttac acataatcat tctttctgaa cttctaacag ttccaaagtt atgggaggca 1501 cagtaaagct gatgggattc tgtctcatct cataccttac atataagtct cagtagtatc 1561 cctagagaaa tagtgttcct ctcttatata ttcttgattc tagtgacaaa ggatccagca 1621 gtatttgcag attccgattt ggagtctcta ggtaattgat ttaagtataa aatccaatca 1681 tataaaaact ttaatgaatt aggataacaa gaagagaaaa taaaggtaaa aacactatta 1741 cctgtttcta gtcactttag aacagatggc actaaaaaaa ttttttttta actttttacc 1801 agctgttctt ggattccaaa ctaaacacca agatactttt cattttaaaa tgtataaagt 1861 aacaattcag attcacttgg gaatagcaga atttttttat acagggttaa taatccttaa 1921 aatactttta tttccatgcc ttattagtat acatacttaa gaaatggttc attgggcccg 1981 agtttcttga ttttatttaa tacaatcgga tttaataatt ttgtggccag taaaattata 2041 ccattggcat gaagctgtga aaaatgagaa attctacctt taccagggat atctgttcta 2101 ctgttttaaa agttcctctc tattttattt tttatttaac atgaagtaaa attgaaattt 2161 cttcttggtg tatagttctg tgaattttaa cagaaagatt tgtgcaccac tactgcagtc 2221 ggtacagaac agttccttca ctcccaatag ctctcttatg ctgtcccttt gtagtcagct 2281 ccttctagct ctaatcctgg caacagttct ccgtcactat tattttgcct tttctggact 2341 gtcatataaa cagaatctta tagtacgttc cctcttgaaa tgctttttca cccagcataa 2401 tgactttgag gttcatccat gtcattttaa attttgaatt ggcaatatat attctagctg 2461 ttactagatg agcaatttaa aaaggtgtag ctattagtgt tgtatttttt catttacatt 2521 aatgtaattt ccattcacta tgtgttgata aaaatcttaa ccatccctag agttttatat 2581 ttccaaaaat atgccaaaag gaaaactatg actgtatgat tattttagcg agagtcatat 2641 ttgagatggt gttctgagcc atagttctct aacatctccg tctacatata acacaaaact 2701 tttataaatt attttaacaa agattaaaat atactgcagg gcaaataata ttctctaaaa 2761 cacaggaact gcttttcgta aagaagactg actagatgca tctttctttg aagctgttct 2821 agtcctcctc tagttaactt agctttatgt agaatytcta ctgtatttaa tttytattga 2881 agcccttctg tattttaaaa tgtaatatac tttgtaccat ttataaaaat tgtgtgatga 2941 atgaatcaca tgttcgttgt tataagtgtt aacctttttt tgacaatttt gtgggaaaga 3001 cattctccac tgaattttaa ctttccttga gactttcaag aatgcaggtc tcgtttgatg 3061 aaaaaaccga cttcataacc yaatgttact agttagcgac attgtgctga attctccttg 3121 aactttttaa ctttatatgt aaaataactt tgatcttggt catttgaaag atcatttcca 3181 ggtctaaaat tccatgattt tgtatttaat tctycctgag aacagaagag gggaaagcaa 3241 gagaggacgg ttttgtgtct ttatgaactt tgttacttaa gtactccatt ggagacataa 3301 tttttctaat tagtggatct ttttccactg gaaagtgtga tagctcattg ctaatttttc 3361 attcatagtt cattgcttcc atttgaccta agattggcca tttgttggtg tgaaaatatt 3421 ttcttttcag aaatagttta ctagtgattg aagtactttt cttatttttg cttcctatta 3481 taaaatggcg tgggg // LOCUS HSZNF183 1349 bp RNA PRI 21-JUL-1997 DEFINITION H.sapiens ZNF183 gene. ACCESSION X98253 NID g2274981 KEYWORDS RING finger motif; ZNF183 gene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1349) AUTHORS Frattini,A. TITLE Direct Submission JOURNAL Submitted (03-JUN-1996) A. Frattini, ITBA CNR, Via Ampere 56, I-20131 Milan, ITALY FEATURES Location/Qualifiers source 1..1349 /organism="Homo sapiens" /note="mRNA and DNA" /db_xref="taxon:9606" /clone_lib="genomic X3000.11" /clone_lib="cDNA of infant brain" /clone="genomic t083" /clone="cDNA HIBBB91" /map="Xq25-26" /chromosome="X" 5'UTR 1..210 /gene="ZNF183" gene 1..1349 /gene="ZNF183" CDS 211..1242 /gene="ZNF183" /codon_start=1 /db_xref="PID:e247370" /db_xref="PID:g2274982" /translation="MAEQLSPGKAVDQVCTFLFKKPGRKGAAGRRKRPACDPEPGESG SSSDEGCTVVRPEKKRVTHNPMIQKTRDSGKQKAAYGDLSSEEEEENEPESLGVVYKS TRSAKPVGPEDMGATAVYELDTEKERDAQAIFERSQKIQEELRGKEDDKIYRGINNYQ KYMKPKDTSMGNASSGMVRKGPIRAPEHLRATVRWDYQPDICKDYKETGFCGFGDSCK FLHDRSDYKHGWQIERELDEGRYGVYEDENYEVGSDDEEIPFKCFICRQSFQNPVVTK CRHYFCESCALQHFRTTPRCYVCDQQTNGVFNPAKELIAKLEKHRATGEGGASDLPED PDEDAIPIT" misc_feature 994..1107 /gene="ZNF183" /note="RING finger C3HC4" 3'UTR 1243..1349 /gene="ZNF183" BASE COUNT 349 a 335 c 403 g 262 t ORIGIN 1 cggccgcccg cgagcggcgt gccacgtacg gctccggccg aagcgacggc ctctgctagg 61 gcacaagaga gacgggcgct cgcgtctcgc agtcctcttc cgtcagtgtc ttttgcttcg 121 actcccggcg gagcgcgcaa cgtggagtga cgtgcagggg ccaagtgcaa cccaggcagc 181 cacggctgtt tcggagctca ggactctaaa atggcagagc agctttctcc aggaaaggcg 241 gtggatcagg tgtgcacctt ccttttcaaa aagcctgggc ggaaaggggc tgctggacgc 301 agaaagcgcc cggcctgcga cccagagccc ggagaaagcg gcagcagtag cgacgaaggc 361 tgcactgtgg ttcgaccgga aaagaagcgg gtgacccaca atccaatgat acagaagacc 421 cgtgacagtg gtaaacagaa ggcggcttac ggcgacttga gcagcgaaga ggaagaggaa 481 aatgagcccg agagtctcgg cgtggtttat aaatccaccc gttcggcgaa acccgtggga 541 ccagaggata tgggagcgac agctgtctat gagctggaca cagagaaaga gcgcgatgca 601 caagccatct ttgagcgcag ccagaagatc caggaggagc tgaggggcaa ggaggatgac 661 aagatctatc ggggaatcaa caattatcag aaatacatga agcccaagga tacgtctatg 721 ggcaatgcct cttccgggat ggtgaggaag ggccccatcc gagcgcccga gcatctacgt 781 gccaccgtgc gctgggatta ccagcccgac atctgtaagg actacaaaga gactggcttc 841 tgcggcttcg gagacagctg caaattcctc catgaccgtt cagattacaa gcatgggtgg 901 cagatcgaac gtgagcttga tgagggtcgc tatggtgtct atgaggatga aaactatgaa 961 gtgggaagcg atgatgagga aataccattc aagtgtttca tctgtcgcca gagcttccaa 1021 aacccagttg tcaccaagtg caggcattat ttctgcgaga gctgtgcact gcagcatttc 1081 cgcaccaccc cgcgctgcta tgtctgtgac cagcagacca atggcgtctt caatccagcg 1141 aaagaattga ttgctaaact agagaagcat cgagctacag gagagggtgg tgcttccgac 1201 ttgccagaag accccgatga ggatgcaatt cccattactt aggtttccca taattcttaa 1261 atttaaaaaa taaacgtttt gttcttttgg aagtcttact tcgtgtcctt cctttgtaga 1321 gaaagtgaca gagcaggcgg gtggtgaag // LOCUS HSZNFNPRA 2499 bp RNA PRI 14-DEC-1993 DEFINITION H.sapiens mRNA for zinc finger protein. ACCESSION Z21943 NID g297025 KEYWORDS LAZ-3 gene; zinc finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2499) AUTHORS Kerckaert,J.P., Deweindt,C., Tilly,H., Quief,S., Lecocq,G. and Bastard,C. TITLE LAZ3, a novel zinc-finger encoding gene, is disrupted by recurring chromosome 3q27 translocations in human lymphomas JOURNAL Nature Genet. 5 (1), 66-70 (1993) MEDLINE 94035122 REFERENCE 2 (bases 1 to 2499) AUTHORS KERCKAERT,J.P. TITLE Direct Submission JOURNAL Submitted (02-MAR-1993) KERCKAERT J. P., INSERM U.124, Molecular Onco-Hematology, Place de Verdun, LILLE CEDEX, FRANCE, 59045 FEATURES Location/Qualifiers source 1..2499 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /dev_stage="adult" /tissue_type="skeletal muscle" /clone_lib="Clontech" /clone="pM47 and pM55" /chromosome="3" exon <1..35 /gene="LAZ-3" /number=1 gene 1..2205 /gene="LAZ-3" exon 36..245 /gene="LAZ-3" /number=2 CDS 85..2205 /gene="LAZ-3" /function="putative transcription factor" /note="6 C2H2 zinc finger repeats; positions 1642-2127." /codon_start=1 /product="zinc finger protein" /db_xref="PID:g297026" /db_xref="SWISS-PROT:P41182" /translation="MASPADSCIQFTRHASDVLLNLNRLRSRDILTDVVIVVSREQFR AHKTVLMACSGLFYSIFTDQLKCNLSVINLDPEINPEGFCILLDFMYTSRLNLREGNI MAVMATAMYLQMEHVVDTCRKFIKASEAEMVSAIKPPREEFLNSRMLMPQDIMAYRGR EVVENNLPLRSAPGCESRAFAPSLYSGLSTPPASYSMYSHLPVSSLLFSDEEFRDVRM PVANPFPKERALPCDSARPVPGEYSRPTLEVSPNVCHSNIYSPKETIPEEARSDMHYS VAEGLKPAAPSARNAPYFPCDKASKEEERPSSEDEIALHFEPPNAPLNRKGLVSPQSP QKSDCQPNSPTESCSSKNACILQASGSPPAKSPTDPKACNWKKYKFIVLNSLNQNAKP EGPEQAELGRLSPRAYTAPPACQPPMEPENLDLQSPTKLSASGEDSTIPQASRLNNIV NRSMTGSPRSSSESHSPLYMHPPKCTSCGSQSPQHAEMCLHTAGPTFPEEMGETQSEY SDSSCENGAFFCNECDCRFSEEASLKRHTLQTHSDKPYKCDRCQASFRYKGNLASHKT VHTGEKPYRCNICGAQFNRPANLKTHTRIHSGEKPYKCETCGARFVQVAHLRAHVLIH TGEKPYPCEICGTRFRHLQTLKSHLRIHTGEKPYHCEKCNLHFRHKSQLRLHLRQKHG AITNTKVQYRVSATDLPPELPKAC" BASE COUNT 602 a 773 c 614 g 510 t ORIGIN 1 gatgcaagaa gtttctagga aaggccggac accaggtttt gagcaaaatt ttggactgtg 61 aagcaaggca ttggtgaaga caaaatggcc tcgccggctg acagctgtat ccagttcacc 121 cgccatgcca gtgatgttct tctcaacctt aatcgtctcc ggagtcgaga catcttgact 181 gatgttgtca ttgttgtgag ccgtgagcag tttagagccc ataaaacggt cctcatggcc 241 tgcagtggcc tgttctatag catctttaca gaccagttga aatgcaacct tagtgtgatc 301 aatctagatc ctgagatcaa ccctgaggga ttctgcatcc tcctggactt catgtacaca 361 tctcggctca atttgcggga gggcaacatc atggctgtga tggccacggc tatgtacctg 421 cagatggagc atgttgtgga cacttgccgg aagtttatta aggccagtga agcagagatg 481 gtttctgcca tcaagcctcc tcgtgaagag ttcctcaaca gccggatgct gatgccccaa 541 gacatcatgg cctatcgggg tcgtgaggtg gtggagaaca acctgccact gaggagcgcc 601 cctgggtgtg agagcagagc ctttgccccc agcctgtaca gtggcctgtc cacaccgcca 661 gcctcttatt ccatgtacag ccacctccct gtcagcagcc tcctcttctc cgatgaggag 721 tttcgggatg tccggatgcc tgtggccaac cccttcccca aggagcgggc actcccatgt 781 gatagtgcca ggccagtccc tggtgagtac agccggccga ctttggaggt gtcccccaat 841 gtgtgccaca gcaatatcta ttcacccaag gaaacaatcc cagaagaggc acgaagtgat 901 atgcactaca gtgtggctga gggcctcaaa cctgctgccc cctcagcccg aaatgccccc 961 tacttccctt gtgacaaggc cagcaaagaa gaagagagac cctcctcgga agatgagatt 1021 gccctgcatt tcgagccccc caatgcaccc ctgaaccgga agggtctggt tagtccacag 1081 agcccccaga aatctgactg ccagcccaac tcgcccacag agtcctgcag cagtaagaat 1141 gcctgcatcc tccaggcttc tggctcccct ccagccaaga gccccactga ccccaaagcc 1201 tgcaactgga agaaatacaa gttcatcgtg ctcaacagcc tcaatcagaa tgccaaacca 1261 gaggggcctg agcaggctga gctgggccgc ctttccccac gagcctacac ggccccacct 1321 gcctgccagc cacccatgga gcctgagaac cttgacctcc agtccccaac caagctgagt 1381 gccagcgggg aggactccac catcccacaa gccagccggc tcaataacat cgttaacagg 1441 tccatgacgg gctctccccg cagcagcagc gagagccact caccactcta catgcacccc 1501 ccgaagtgca cgtcctgcgg ctctcagtcc ccacagcatg cagagatgtg cctccacacc 1561 gctggcccca cgttccctga ggagatggga gagacccagt ctgagtactc agattctagc 1621 tgtgagaacg gggccttctt ctgcaatgag tgtgactgcc gcttctctga ggaggcctca 1681 ctcaagaggc acacgctgca gacccacagt gacaaaccct acaagtgtga ccgctgccag 1741 gcctccttcc gctacaaggg caacctcgcc agccacaaga ccgtccatac cggtgagaaa 1801 ccctatcgtt gcaacatctg tggggcccag ttcaaccggc cagccaacct gaaaacccac 1861 actcgaattc actctggaga gaagccctac aaatgcgaaa cctgcggagc cagatttgta 1921 caggtggccc acctccgtgc ccatgtgctt atccacactg gtgagaagcc ctatccctgt 1981 gaaatctgtg gcacccgttt ccggcacctt cagactctga agagccacct gcgaatccac 2041 acaggagaga aaccttacca ttgtgagaag tgtaacctgc atttccgtca caaaagccag 2101 ctgcgacttc acttgcgcca gaagcatggc gccatcacca acaccaaggt gcaataccgc 2161 gtgtcagcca ctgacctgcc tccggagctc cccaaagcct gctgaagcat ggagtgttga 2221 tgctttcgtc tccagcccct tctcagaatc tacccaaagg atactgtaac actttacaat 2281 gttcatccca tgatgtagtg cctctttcat ccactagtgc aaatcatagc tgggggttgt 2341 gggtggtggg ggtcggggcc tgggggactg ggagccgcag cagctccccc tcccccactg 2401 ccataaaaca ttaagaaaat catattgctt cttctcctat gtgtaaggtg aaccatgtca 2461 gcaaaaagca aaatcatttt atatgtcaaa gcgggggag // LOCUS HSZNFPT17 2580 bp RNA PRI 24-AUG-1995 DEFINITION H.sapiens mRNA for Zinc-finger protein (ZNFpT17). ACCESSION X65233 NID g505545 KEYWORDS zinc-finger protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2580) AUTHORS Lania,L. TITLE Direct Submission JOURNAL Submitted (17-MAR-1992) L. Lania, Dpt of Genetics, General and Mol Biology, University of Naples, Via Mezzocannone 8, 80134 Naples, ITALY REFERENCE 2 (bases 1 to 2580) AUTHORS Huebner,K., Druck,T., LaForgia,S., Lasota,J., Croce,C.M., Lanfrancone,L., Donti,E., Pengue,G., La Mantia,G., Pelicci,P.G. et,al. TITLE Chromosomal localization of four human zinc finger cDNAs JOURNAL Hum. Genet. 91 (3), 217-222 (1993) MEDLINE 93239177 REFERENCE 3 (bases 1 to 2580) AUTHORS Di Cristofano,A., Strazullo,M., Longo,L. and La Mantia,G. TITLE Characterization and genomic mapping of the ZNF80 locus: expression of this zinc-finger gene is driven by a solitary LTR of ERV9 endogenous retroviral family JOURNAL Nucleic Acids Res. 23 (15), 2823-2830 (1995) MEDLINE 95388494 FEATURES Location/Qualifiers source 1..2580 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /cell_line="T-cell lymphoma (PEER)" /chromosome="3p21-3qter" CDS 513..1334 /codon_start=1 /product="zinc-finger protein (ZNFpT17)" /db_xref="PID:g963060" /translation="MSPKRDGLGTGDGLHSQVLQEQVSTGDNLHECDSQGPSKDTLVR EGKTYKCKECGSVFNKNSLLVRHQQIHTGVKPYECQECGKAFPEKVDFVRHMRIHTGE KPCKCVECGKVFNRRSHLLCYRQIHTGEKPYECSECGKTFSYHSVFIQHRVTHTGEKL FGCKECGKTFYYNSSLTRHMKIHTGEKPCKCSECGKTFTYRSVFFRHSMTHTAGKPYE CKECGKGFYYSYSLTRHTRSHTGEKPYECLEHRKDFGYHSAFAQQSKIHSGGKNL" misc_feature 663..1248 /note="zinc-finger domain" BASE COUNT 794 a 548 c 566 g 672 t ORIGIN 1 gaattcgggc cggagccagc agtggcaacc cgctggggtc cctttccaca ctgtggaagc 61 tttgttcttt cgctctttgc aataaatctt gctacttctc actctttggg tccacactgc 121 ttttaccagc tgtaacactc accgcaaacg tctgcagctt cactcctgaa gccagcgaga 181 ccacgagccc accgggagga acgaacaact ccagacgcgc cgccttaagc cttaagagct 241 gtaacactca ccacgaaggt ctgcagcttc actcctgagc cagcgagacc acgaacccgc 301 cagaaggaag aaactccgaa cacatccgaa catcagaagg aacaaactct agacgcgcca 361 ccttaagagc tgtaacactc accacaaggg tccgcggctt cattcttgaa gtcagtgaga 421 ccaagaaccc accaattccg gacacagttg gaaatgcaga agggcacttg agtttccctg 481 cagtctccca cacggacacc ttctggagga agatgagccc taaacgcgat gggttgggga 541 caggtgatgg tctgcactca caggttttac aggagcaggt ctccacagga gacaatctcc 601 atgaatgtga ctcccaggga ccaagtaaag acactttggt tcgtgagggg aagacctaca 661 aatgcaagga atgtgggagc gtgtttaaca aaaacagcct ccttgttcga catcagcaga 721 ttcacactgg ggtgaagcct tatgaatgcc aggagtgtgg aaaagccttt cctgaaaagg 781 tcgacttcgt tcgacacatg aggattcaca caggggagaa gccctgtaag tgcgtggagt 841 gcgggaaggt cttcaaccgc aggtcgcacc tcctgtgcta ccgccagatt cacactggag 901 agaagcccta tgagtgcagc gagtgtggaa agaccttcag ctatcactct gtcttcatcc 961 agcatcgtgt gacccacact ggagaaaaac tctttgggtg caaagaatgt ggaaaaacct 1021 tttactacaa ctcttcctta acccggcaca tgaagattca cactggagag aagccctgca 1081 agtgcagtga gtgcgggaag accttcacct accgctctgt tttcttccga catagtatga 1141 cccacactgc aggaaagccc tacgagtgca aagaatgtgg gaaaggtttt tactacagct 1201 attccctcac tcgacataca aggagtcaca ctggagagaa accttatgag tgccttgaac 1261 atagaaagga ctttggctac cactctgctt ttgcccaaca gagtaagatc cactctggag 1321 gaaaaaacct ttgagtgcaa atgatgtggg aatttgtggg ttttcttttt ttttttttct 1381 ttgatacata gagagccaca ctgaggagaa gcactctgaa ttcagtggca gtaggaaagc 1441 tgtgacctgc agctcatcct cacttggcat tgaagaattc cttccagaga taaactctat 1501 gagatatgtg ggaaggcctt ttgcaagtgg gccctcatca gtcgacgtca gagaacttaa 1561 actgcgaaaa gacattttga caaaaactag tgaagaaaat cttttggcca caggaaatat 1621 cttcctcagc atctgagaat ttttccagga gggagacctg ttggttattg tgcacttaaa 1681 agaaccaccc acacctctct ccatagttta tatcactaag ccacttcagg attttttttt 1741 ttttaagtaa gaagggaact gtaaaagaga aaaaaagaaa ttcatgcata gcttctgtca 1801 gacttctctg aaagaaatgc ctatcaaaat ttgaattcag ttcttcattt tgcagatggc 1861 tgaaagggac gagagaatga tggtgaagct gtgttgcttt cccaggtgag ccttgattat 1921 ttcctctgtg ctttctcagc ttctttggcc ttctgtaaga ggggcttctc caagagcatg 1981 tcactagttt tctagaagtt aaacaagagc taagacaaac aaaacaaaac acagcttatc 2041 taatctaatg agaaactcag aaaaaaatgt tttgatgaat acaatgtatt aaaatataaa 2101 acatgtagat cctttgacat gttaattcaa ttctagaatc ctcaagaaat gtttttatat 2161 tatgcagaga tgtgcgtgca aagaataatc gctggagttt tgtttttatg tgtgaacgtt 2221 gagacaatct aaatgtttat cagtggaagt aagtaggtac actgtctatg aaatactgtg 2281 aaacaataag gtatctctgt atgtactgat atcaaaagat ctctaagata cagtatttag 2341 taaaagagaa agtttcagga aataaaacta catctagaag tataagcata aaactgttaa 2401 tagtggtttt ctgcaggaaa tgggatggac cagggttata tgcaatgttc agtttttact 2461 tttatactta agtattgttt atgtttgaaa aataaaaagc atttattgtt tttgcaataa 2521 atgtttaaaa tatttcaata aagatagcat gttagaaaaa aaaaaaaaaa aaccgaattc // LOCUS HSZYGHOMO 2548 bp mRNA PRI 09-JAN-1998 DEFINITION H.sapiens mRNA for ZYG homologue. ACCESSION X99802 NID g2769561 KEYWORDS ZYG homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2548) AUTHORS Wu,Y.Q., Levy,I., Pawlak,A. and Guellaen,G. TITLE Characterization of a human ZYG homologue JOURNAL Unpublished REFERENCE 2 (bases 1 to 2548) AUTHORS Guellaen,G. TITLE Direct Submission JOURNAL Submitted (06-AUG-1996) G. Guellaen, Unite Inserm 99, Hopital Henri Mondor, Creteil, 94010, FRANCE REMARK Revised by author 09-JAN-1998 FEATURES Location/Qualifiers source 1..2548 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /dev_stage="adult" CDS 39..2339 /codon_start=1 /product="ZYG homologue" /db_xref="PID:e1231236" /db_xref="PID:g2769562" /translation="MASDTPESLMALCTDFCLRNLDGTLGYLLDKETLRLHPDIFLPS EICDRLVNEYVELVNAACNFEPHESFFSLFSDPRSTRLTRIHLREDLVQDQDLEAIRK QDLVELYLTNCEKLSAKSLQTLRSFSHTLVSLSLFGCTNIFYEEENPGGCEDEYLVNP TCQVLVKDFTFEGFSRLRFLNLGRMIDWVPVESLLRPLNSLAALDLSGIQTSDAAFLT QWKDSLVSLVLYNMDLSDDHIRVIVQLHKLGHLDISRDRLSSYYKFKLTREVLSLFVQ KLGNLMSLDISGHMILENCSISKMEEEAGQTSIEPSKSSIIPFRALKRPLQFLGLFEN SLCRLTHIPAYKVSGDKNEEQVLNAIEAYTEHRPEITSRAINLLFDIARIERCNQLLR ALKLVITALKCHKYDRNIQVTGSALLFYLTNSEYRSEQSVKLRRQVIQVVLNGMESYQ EVTVQRNCCLTLCNFSIPEELEFQYRRVNELLLSILNPTRQDESIQRIAVHLCNALVC QVDNDHKEAVGKMGFVVTMLKLIQKKLLDKTCDQVMEFSWSALWNITDETPDNCEMFL NFNGMKLFLDCLKEFPEKQELIRNMLGLLGNVAEVKELRPQLMTSQFISVFSNLLESK ADGIEVSYNACGVLSHIMFDGPEAWGVCKPQREEVEERMWAAIQSWDINSRRNINYRS FEPILRLLPQGISPVSQHWATWALYNLVSVYPDKYCPLLIKEGGMPLLRDIIKMATAR QETKEMARKVIEHCSNFKEENMDTSR" BASE COUNT 576 a 741 c 700 g 531 t ORIGIN 1 atccttgtcc tggagtggcc cacctgcttg cccccagcat ggcgtccgac actcccgagt 61 cgctgatggc cctctgtact gacttctgct tgcgcaacct ggatggcacc ctgggctacc 121 tgctggacaa ggagaccctg cggctacatc cggacatctt cttgcccagc gagatctgtg 181 accggctcgt caatgagtat gtggagctgg tgaacgctgc ctgtaacttc gagccacacg 241 agagcttctt cagcctcttt tcggaccccc gcagcacccg cctcacgcgg atccacctcc 301 gtgaggacct ggtgcaggac caggacctgg aggccatccg caagcaggac ctggtggagc 361 tgtacctgac taactgcgag aagctgtccg ccaagagcct gcagacactg aggagcttca 421 gccacaccct ggtgtccttg agcctcttcg gctgtacaaa cattttctat gaggaggaga 481 acccaggggg ctgtgaagat gagtacctcg tcaaccccac ctgccaggtg ctggttaagg 541 atttcacctt cgagggcttc agccgcctcc gcttcctcaa cttgggccgc atgattgatt 601 gggtccctgt ggagtccctg ctgcggccgc ttaactccct ggctgccttg gacctctcag 661 gcattcagac gagcgacgcc gccttcctca cccagtggaa agacagcctg gtgtccctcg 721 tcctctacaa catggacctg tccgacgacc acatccgggt catcgtgcag ctgcacaagc 781 tgggacacct ggacatctcc cgagaccgcc tctccagcta ctacaagttc aagctgactc 841 gggaggtgct gagcctcttt gtgcagaagc tggggaacct aatgtccctg gacatctctg 901 gccacatgat cctagagaac tgcagcatct ccaagatgga agaggaagcg gggcagacca 961 gcattgagcc ttccaagagc agcatcatac ctttccgggc tctgaagagg ccgctgcagt 1021 tcctcgggct ctttgagaac tctctgtgcc gcctcacgca cattccagcc tacaaagtaa 1081 gtggtgacaa aaacgaagag caggtgctga atgccatcga ggcctacacg gagcaccggc 1141 ctgagatcac ctcgcgggcc atcaacttgc tttttgacat cgcccgcatc gagcgttgca 1201 accagctgct gcgggccctg aagctggtca tcacggccct caagtgccac aaatatgaca 1261 ggaacattca agtgacaggc agcgcgcttc tcttctacct aacaaattcc gagtaccgct 1321 cagagcagag tgtgaagctg cgccggcagg ttatccaggt ggtgctgaat ggcatggaat 1381 cctaccagga ggtgacggtg cagcggaact gctgcctgac gctctgcaac ttcagcatcc 1441 ccgaggagct ggaattccag taccgccggg tcaacgagct cctgctcagc atcctcaacc 1501 ccacgcggca ggacgagtct atccagcgga tcgccgtgca cctgtgcaat gccctggtct 1561 gccaggtaga caacgaccac aaggaggccg tgggcaagat gggctttgtc gtgaccatgc 1621 tgaagctgat tcagaagaag ctgctggaca agacatgtga ccaggtcatg gagttctcct 1681 ggagtgccct gtggaacatc acagatgaaa ctcctgacaa ctgcgagatg ttcctcaatt 1741 tcaacggcat gaagctcttc ctggactgcc tgaaggaatt cccagagaag caggaactca 1801 ttaggaatat gctaggactt ttggggaatg tggcagaagt gaaggagctg aggcctcaac 1861 taatgacttc ccagttcatc agcgtcttca gcaacctgtt ggagagcaag gccgatggga 1921 tcgaggtttc ctacaatgcc tgcggcgtcc tctcccacat catgtttgat ggacccgagg 1981 cctggggcgt ctgtaagccc cagcgtgagg aggtggagga acgcatgtgg gctgccatcc 2041 agagctggga cataaactct cggagaaaca tcaattacag gtcatttgaa ccaattctcc 2101 gcctccttcc ccagggaatc tctcctgtca gccagcactg ggcaacctgg gccctgtata 2161 acctcgtgtc tgtctacccg gacaagtact gccctctgct gatcaaagaa ggggggatgc 2221 cccttctgag ggacataatt aagatggcga ccgcacggca ggagaccaag gaaatggccc 2281 gcaaggtgat tgagcactgc agtaacttta aagaggagaa catggacacg tctagataga 2341 ggcctccgtc cccatggccg ccaccgctct ggaccacagg cggggaggaa gcatgctcaa 2401 gcagcccagc ggggggcccc ttccgaggga gcctcccacg gagtaaggag acatggggga 2461 cttttgcaca accgacgctt ttccttaatg ttagtgagat atatatatat tatatatata 2521 tatttttttt ttggttagga agtgtgaa // LOCUS HSZYX 2166 bp RNA PRI 19-DEC-1996 DEFINITION H.sapiens mRNA for Zyxin. ACCESSION X94991 NID g1155087 KEYWORDS zyx gene; zyxin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2166) AUTHORS Beckerle,M.C. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) M.C. Beckerle, University of Utah, Department of Biology, 201 South Biology Building, Salt Lake City, Utah, 84112, USA REFERENCE 2 (bases 1 to 2166) AUTHORS Macalma,T., Otte,J., Hensler,M.E., Bockholt,S.M., Louis,H.A., Kalff-Suske,M., Grzeschik,K.H., von der Ahe,D. and Beckerle,M.C. TITLE Molecular characterization of human zyxin JOURNAL J. Biol. Chem. 271 (49), 31470-31478 (1996) MEDLINE 97094926 FEATURES Location/Qualifiers source 1..2166 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /clone_lib="cDNA library" /tissue_type="umbilical vein" gene 11..1729 /gene="zyx" CDS 11..1729 /gene="zyx" /codon_start=1 /evidence=not_experimental /product="zyxin" /db_xref="PID:e218260" /db_xref="PID:g1155088" /translation="MAAPRPSPAISVSVSAPAFYAPQKKFGPVVAPKPKVNPFRPGDS EPPPAPGAQRAQMGRVGEIPPPPPEDFPLPPPPLAGDGDDAEGALGGAFPPPPPPIEE SFPPAPLEEEIFPSPPPPPEEEGGPEAPIPPPPQPREKVSSIDLEIDSLSSLLDDMTK NDPFKARVSSGYVPPPVATPFSSKSSTKPAAGGTAPLPPWKSPSSSQPLPQVPAPAQS QTQFHVQPQPQPKPQVQLHVQSQTQPVSLANTQPRGPPASSPAPAPKFSPVTPKFTPV ASKFSPGAPGGSGSQPNQKLGHPEALSAGTGSPQPPSFTYAQQREKPRVQEKQHPVPP PAQNQNQVRSPGAPGPLTLKEVEELEQLTQQLMQDMEHPQRQNVAVNELCGRCHQPLA RAQPAVRALGQLFHIACFTCHQCAQQLQGQQFYSLEGAPYCEGCYTDTLEKCNTCGEP ITDRMLRATGKAYHPHCFTCVVCARPLEGTSFIVDQANRPHCVPDYHKQYAPRCSVCS EPIMPEPGRDETVRVVALDKNFHMKCYKCEDCGKPLSIEADDNGCFPLDGHVLCRKCH TARAQT" polyA_signal 2132..2137 /evidence=not_experimental polyA_site 2154 /evidence=not_experimental BASE COUNT 410 a 789 c 573 g 394 t ORIGIN 1 cggcccggcc atggcggccc cccgcccgtc tcccgcgatc tccgtttcgg tctcggctcc 61 ggctttttac gccccgcaga agaagttcgg ccctgtggtg gccccaaagc ccaaagtgaa 121 tcccttccgg cccggggaca gcgagcctcc cccggcaccc ggggcccagc gcgcacagat 181 gggccgggtg ggcgagattc ccccgccgcc cccggaagac tttcccctgc ctccacctcc 241 ccttgctggg gatggcgacg atgcagaggg tgctctggga ggtgccttcc cgccgccccc 301 tcccccgatc gaggaatcat ttccccctgc gcctctggag gaggagatct tcccttcccc 361 gccgcctcct ccggaggagg agggagggcc tgaggccccc ataccgcccc caccacagcc 421 cagggagaag gtgagcagta ttgatttgga gatcgactct ctgtcctcac tgctggatga 481 catgaccaag aatgatcctt tcaaagcccg ggtgtcatct ggatatgtgc ccccaccagt 541 ggccactcca ttcagttcca agtccagtac caagcctgca gccgggggca cagcacccct 601 gcctccttgg aagtcccctt ccagctccca gcctctgccc caggttccgg ctccggctca 661 gagccagaca cagttccatg ttcagcccca gccccagccc aagcctcagg tccaactcca 721 tgtccagtcc cagacccagc ctgtgtcttt ggctaacacc cagccccgag ggcccccagc 781 ctcatctccg gctccagccc ctaagttttc tccagtgact cctaagttta ctcctgtggc 841 ttccaagttc agtcctggag ccccaggtgg atctgggtca caaccaaatc aaaaattggg 901 gcaccccgaa gctctttctg ctggcacagg ctcccctcaa cctcccagct tcacctatgc 961 ccagcagagg gagaagcccc gagtgcagga gaagcagcac cccgtgcccc caccggctca 1021 gaaccaaaac caggtgcgct cccctggggc cccagggccc ctgactctga aggaggtgga 1081 ggagctggag cagctgaccc agcagctaat gcaggacatg gagcatcctc agaggcagaa 1141 tgtggctgtc aacgaactct gcggccgatg ccatcaaccc ctggcccggg cgcagccagc 1201 cgtccgcgct ctagggcagc tgttccacat cgcctgcttc acctgccacc agtgtgcgca 1261 gcagctccag ggccagcagt tctacagtct ggagggggcg ccgtactgcg agggctgtta 1321 cactgacacc ctggagaagt gtaacacctg cggggagccc atcactgacc gcatgctgag 1381 ggccacgggc aaggcctatc acccgcactg cttcacctgt gtggtctgcg cccgccccct 1441 ggagggcacc tccttcatcg tggaccaggc caaccggccc cactgtgtcc ccgactacca 1501 caagcagtac gccccgaggt gctccgtctg ctctgagccc atcatgcctg agcctggccg 1561 agatgagact gtgcgagtgg tcgccctgga caagaacttc cacatgaagt gttacaagtg 1621 tgaggactgc gggaagcccc tgtcgattga ggcagatgac aatggctgct tccccctgga 1681 cggtcacgtg ctctgtcgga agtgccacac tgctagagcc cagacctgag tgaggacagg 1741 ccctcttcag accgcagtcc atgccccatt gtggaccacc cacactgaga ccacctgcgc 1801 ccacctcagt tattgttttg atgtctagcc cctcccattt ccaacccctc cctagcatcc 1861 caggtgccct gacccaggac ccaacatggt ctagggatgc aggatccccg ccctggggtc 1921 tggtcctcgc ccatcctgca gggattgccc accgtcttcc agacacccca cctgaggggg 1981 gcaccaggtt tagtgctgct gctttcactg ctgcacccgc gccctcggcc ggccccccga 2041 gcagcctttg tactctgctt gcggagggct gggagaccct ccaggacatt cccaccctcc 2101 cccatgctgc caagttgtag ctatagctac aaataaaaaa aaaccttgtt ttccaaaaaa 2161 aaaaaa // LOCUS HTLV1RES 2151 bp DNA PRI 12-SEP-1993 DEFINITION Human HTLV-I related endogenous retroviral sequence (HRES-1/1). ACCESSION X16660 NID g38034 KEYWORDS endogenous retrovirus; long terminal repeat. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2151) AUTHORS Perl,A. TITLE Direct Submission JOURNAL Submitted (23-AUG-1989) Perl A., Roswell Park Memorial Institute, 666 Elm Street, Buffalo, NY 14263, USA REFERENCE 2 (bases 1 to 2151) AUTHORS Perl,A., Rosenblatt,J.D., Chen,I.S., DiVincenzo,J.P., Bever,R., Poiesz,B.J. and Abraham,G.N. TITLE Detection and cloning of new HTLV-related endogenous sequences in man JOURNAL Nucleic Acids Res. 17 (17), 6841-6854 (1989) MEDLINE 89386040 COMMENT Data kindly reviewed (08-JAN-1990) by Perl A. FEATURES Location/Qualifiers source 1..2151 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T lymphocyte" /cell_line="MA-T cell line" /clone_lib="lambda DASH" /clone="HRES-1/1" repeat_unit 310..313 /note="inverted repeat A" misc_feature 310..993 /note="put. retroviral LTR" repeat_region 370..380 /note="direct repeat 1" misc_feature 463..470 /note="TAR sequence (trans-activation region)" repeat_region 470..479 /note="direct repeat 2" repeat_region 529..538 /note="direct repeat 2" repeat_region 663..673 /note="direct repeat 1" promoter 782..788 /note="TATA-box" misc_feature 839..844 /note="pot. polyA signal" repeat_unit 990..993 /note="inverted repeat A'" misc_feature 997..1014 /note="tRNA primer binding site" CDS 1144..1815 /note="open reading frame p25 (AA 1-223)" /codon_start=1 /db_xref="PID:g38035" /translation="MRCAHAPAPRTRYPTRAPSGPRPPSRSQAQTPPRSVPRLRPRHR HPQDPRSPGPAPRHRRPPRPDPRAPPARASYRRFRTWPSATSWERRRLSPGHRALARG PPARLGGEGPGAGDRRREGPDRSPRQPPVLPAAAAQPDSSSAQAPGPSTLRPAATARR KRRWATRGPAHPAFARAHGEAGAGRVRTSARAGSTCAGWALWRCALRWAERQVGALGA ESRFP" CDS 1383..1799 /note="open reading frame p15 (AA 1-138)" /codon_start=1 /db_xref="PID:g38036" /translation="MAVCDILGAAPPLAGSPCTRPRSAGASRWGRTWGGGPEEGGARP VPTATAGPPSGCGPAGLQLRAGARALHPPSGRDREEEEEMGYARPGPPRVRACARGGR GGAREDFGARRKHVRGLGALAVCAEVGRAAGGGVGG" BASE COUNT 512 a 629 c 605 g 405 t ORIGIN 1 cttacaagac aaactgacgc cttcctcaag accctctcct ctctgcctcc tcatccacgc 61 tctccactgt actcaaaaca cacccttctt gagttaccct tcccacaggc ctgagatgtt 121 ctgcgtgctg cctgttaatg cctccctgac acccctgtct ggtcatcttt ctttctaaca 181 cactttcacg gaccacactc cttcagaact cttgcctggt ttttaattat tcactcatta 241 gtatcacaat tggattaata ctgcattccc tcaatagact acgctccatg agaccagagc 301 cgtgtctgct gtgtttctgc aggatgtaca gccctcccag cacggtgctc gcttccctaa 361 tgccattcca cagtatttgt tgcagagaag gaaggcagca tgccaggaca gatggagaca 421 ggactaattt ggcctgaggt atgtaatttt gaacttgagc ctctctctgg gactgtaaaa 481 ctccaaatca aagctaatct gagaatacat acatctgaaa gatgattagg actgtaaaca 541 tctattaata ttcaactctg atacaaagta caagttgttt attcttacca cgcaaggcca 601 aaaaagggga gaaaaaaaaa aaagcacaca gcatatgcac tggaaagttt cgcttattca 661 aaacagtatt tgtcaagcac ctccagtctg gtgctgcagg ggaaacaaag attaaacagc 721 caggcggaca ctgctctgct tccaaggtgc ttacggtctt aagaaggaga caagacatgt 781 ttataaatag ccaaaatgca acccagaaaa ggctaaaaaa cactgagagg gagggagaaa 841 taaacgaagc aagaggtctc cggaggaaga gatgaatgaa ttagcctatt aataactccg 901 tcactgtaat cccaatgtaa agcaagaatt ccaaaccagg aaaggtcaaa ctgaagtatt 961 tgaggaacac aggcgtcgcc taagcccttc acaggctcgc tggggataat atcgcagctg 1021 aaaccgaaca tcaagctggc ttttcaaacc ttcagttatt tacattagac tgagagggga 1081 aaagcgctaa gagtgcaaga gcagagaaag agcaagagga aacttgaaaa ggcggatcac 1141 gcaatgcgct gtgcacacgc ccccgccccg cgcacccgct accccacccg agcccccagc 1201 ggcccgcgtc ccccatcccg gtcccaagcc cagacacctc cgagatccgt cccgcggctc 1261 cggccgcgcc accgccaccc tcaagacccc cgcagcccag gcccggcccc gcgccaccga 1321 cgccccccgc ggcccgaccc gcgcgccccg ccagcccggg cctcgtaccg taggtttcgg 1381 acatggccgt ctgcgacatc ttgggagcgg cgccgcctct cgccgggtca ccgtgcactc 1441 gcccgcggtc cgccggcgcg tctcggtggg gaaggacctg gggcggggga ccggaggagg 1501 gaggggcccg accggtcccc acggcaaccg ccggtcctcc cagcggctgc ggcccagccg 1561 gactccagct ccgcgcaggc gccagggccc tccaccctcc gtccggccgc gaccgcgagg 1621 aggaagagga gatgggctac gcgaggcccg gcccaccccg cgttcgcgcg tgcgcacggg 1681 gaggccgggg cggggcgcgt gaggacttcg gcgcgcgccg gaagcacgtg cgcgggctgg 1741 gcgctctggc ggtgtgcgct gaggtgggca gagcggcagg tgggggcgtt gggggctgag 1801 tcccgatttc cctgagggag ggtcgggtag aggcgggcgg tgggcaggtt tgggggtgac 1861 agagggctgg ggacagtggg gtccagttgc cggacagagg aggaaggtgc ccgcactggg 1921 gaggaaagca gtccatttgc caaattggcc cgtctcagtt aagacgttgt cttcggtcat 1981 catctgcggc tgtcagccag gaaaaaactt ccctgacgct gttacgatgg aggccagaac 2041 ttggttaatg tgtaacaagg aggcagtagg ccccaggtgt ccagccagag gccgccctgt 2101 gaatgggagg caggttcatt tacccgttgg acccgttggg aagctaagct t // LOCUS HUAC002302 259894 bp DNA PRI 17-DEC-1997 DEFINITION Homo sapiens Chromosome 16 BAC clone CIT987-SKA-345G4 ~complete genomic sequence, complete sequence. ACCESSION AC002302 NID g2576341 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 259894) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., LaBombard,M., Fuhrmann,J., Brandon,R., Kim,U.J., Kerlavage,A.R. and Venter,J.C. TITLE Homo sapiens Chromosome 16 BAC clone CIT987-SKA-345G4 #complete genomic sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 259894) AUTHORS Adams,M.D. and Loftus,B.J. TITLE Direct Submission JOURNAL Submitted (19-JUN-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA, Email: mdadams@tigr.org REFERENCE 3 (bases 1 to 259894) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (30-OCT-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 4 (bases 1 to 259894) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (17-DEC-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA COMMENT Address all correspondence to: Mark Adams The Institute for Genomic Research 9712 Medical Center Dr, Rockville, MD 20850, USA e-mail address: mdadams@tigr.org. The bac location is on chromosome BAC clone is located on human chromosome 16p12.2-p12 . The orientation of the sequence is from SP6 end to T7 end. Genes were identified by a combination of five methods including: XGRAIL (available by anonymous ftp from arthur.epm.ornl.gov), Genefinder (Phil Green, University of Washington), Genscan (Chris Burge, http://gnomic.stanford.Edu/~chris/GENSCANW.html ) searches of the complete sequence against a peptide database, and the Human gene Index database at TIGR (http://www.tigr.org/tdb/hg i/hgi.html). A gene with homolgy to another protein is annotated as the isolog of that protein. Genes without pepetide homolgy having spliced EST hits are termed 'u nknown protein'. Genes encoding tRNAs are predicted by tRNAscan-SE (Sean Eddy, http://genome.wustl.edu/eddy/tRNAscan-SE/). FEATURES Location/Qualifiers source 1..259894 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p12.2-p12" /clone="A-345G4" repeat_region complement(687..988) /rpt_family="AluSp" STS 782..899 /db_xref="dbSTS:G02122" repeat_region complement(989..1290) /rpt_family="AluJo" mRNA join(2534..2600,3145..3217,3331..3411,3547..3677, 3872..3933,4686..4808,5015..5206) /gene="A-345G4" gene 2534..5206 /gene="A-345G4" CDS join(2534..2600,3145..3217,3331..3411,3547..3677, 3872..3933,4686..4808,5015..5068) /gene="A-345G4" /codon_start=1 /product=" Unknown protein product with similarity to Calcineurin B subunit" /db_xref="PID:g2695572" /translation="MGSRSSHAAVIPDGDSIRRETGFSQASLLRLHHRFRALDRNKKG YLSRMDLQQIGALAVNPLGDRIIESFFPDGSQRVDFPGFVRVLAHFRPVEDEDTETQD PKKPEPLNSRRNKLHYAFQLYDLDRDGKISRHEMLQVLRLMVGVQVTEEQLENIADRT VQEADEDGDGAVSFVEFTKSLEKMDVEQKMSIRILK" repeat_region 4139..4358 /rpt_family="MER20" repeat_region complement(5235..5705) /rpt_family="MER1A" repeat_region complement(6635..6810) /rpt_family="MER63A" repeat_region complement(6847..6926) /rpt_family="(GAAA)n" repeat_region complement(7198..7495) /rpt_family="AluJb" repeat_region 7784..8074 /rpt_family="AluSx" repeat_region 8081..8116 /rpt_family="(CAA)n" repeat_region complement(8463..8627) /rpt_family="AluJo/FRAM" repeat_region complement(8630..8800) /rpt_family="AluSg/x" repeat_region complement(8801..9102) /rpt_family="AluSx" repeat_region complement(9103..9165) /rpt_family="AluJ/FLAM" repeat_region 9822..9854 /rpt_family="AT_rich" repeat_region 9976..10072 /rpt_family="L1MA5" repeat_region 10074..10366 /rpt_family="AluSq" repeat_region complement(10541..10756) /rpt_family="AluJo" repeat_region 11513..11855 /rpt_family="MER7A" repeat_region complement(11984..12277) /rpt_family="AluSx" repeat_region 12403..12531 /rpt_family="MIR" repeat_region complement(12806..13105) /rpt_family="AluSx" repeat_region complement(13176..13463) /rpt_family="L1PA13" STS 13594..13721 /db_xref="dbSTS:Z16998" repeat_region complement(13624..13660) /rpt_family="(CA)n" repeat_region 14128..14399 /rpt_family="AluSx" repeat_region 14814..15051 /rpt_family="AluJo" repeat_region 15054..15354 /rpt_family="AluY" repeat_region 15372..15647 /rpt_family="AluJb" repeat_region 15740..15760 /rpt_family="AT_rich" repeat_region 15891..15925 /rpt_family="AT_rich" repeat_region 15929..16230 /rpt_family="AluSx" repeat_region 16528..16609 /rpt_family="LTR16A" repeat_region complement(16655..16889) /rpt_family="AluJb" repeat_region 16911..17120 /rpt_family="LTR16B" repeat_region complement(17675..17995) /rpt_family="AluJo" repeat_region 18089..18162 /rpt_family="(TAAAA)n" repeat_region complement(18350..18611) /rpt_family="AluJo" repeat_region 18632..18852 /rpt_family="AluSq" repeat_region complement(19349..19709) /rpt_family="MER54" repeat_region complement(19713..20003) /rpt_family="AluSq" repeat_region complement(20006..20080) /rpt_family="MER54" repeat_region 20189..20317 /rpt_family="(TA)n" repeat_region complement(20354..20480) /rpt_family="(TAAA)n" repeat_region complement(20532..20654) /rpt_family="MIR" repeat_region complement(20964..21133) /rpt_family="LINE2" repeat_region 21137..21425 /rpt_family="AluJo" repeat_region complement(21453..21711) /rpt_family="LINE2" repeat_region complement(22022..22309) /rpt_family="AluSx" repeat_region complement(22924..23199) /rpt_family="AluSg" repeat_region complement(23202..23228) /rpt_family="AT_rich" repeat_region complement(23719..24017) /rpt_family="AluJo" repeat_region 24067..24363 /rpt_family="MLT1F" repeat_region 24424..24570 /rpt_family="MLT1F" repeat_region complement(24590..24614) /rpt_family="AT_rich" repeat_region complement(24615..26749) /rpt_family="L1PA2" STS 24956..25087 /db_xref="dbSTS:G19948" STS 25676..25846 /db_xref="dbSTS:G28812" STS 25984..26161 /db_xref="dbSTS:G02335" repeat_region 26949..27253 /rpt_family="AluSx" repeat_region 27511..27682 /rpt_family="MIR" repeat_region 27821..28140 /rpt_family="AluSp" repeat_region 28142..28315 /rpt_family="MER58A" repeat_region complement(28645..28945) /rpt_family="AluJb" repeat_region complement(29305..29600) /rpt_family="AluSx" repeat_region complement(29716..29828) /rpt_family="FLAM_A" repeat_region complement(30570..30897) /rpt_family="AluSx" repeat_region complement(30912..30962) /rpt_family="Alu" repeat_region 31185..31485 /rpt_family="AluJo" repeat_region 31486..31538 /rpt_family="(CAAAA)n" repeat_region complement(31568..31850) /rpt_family="L1PA2" repeat_region 31826..31925 /rpt_family="L1PA3" repeat_region complement(31929..32153) /rpt_family="L1MD2" repeat_region 32178..32229 /rpt_family="MER35" repeat_region complement(32929..33011) /rpt_family="MIR" repeat_region complement(33022..33087) /rpt_family="MLT1F" repeat_region complement(33090..33380) /rpt_family="AluJb" repeat_region complement(33402..33521) /rpt_family="FLAM_A" repeat_region complement(33904..34060) /rpt_family="MIR" repeat_region complement(35407..35520) /rpt_family="MER41B" repeat_region 35554..35851 /rpt_family="AluSq" repeat_region 36249..36302 /rpt_family="LINE2" repeat_region 37240..37395 /rpt_family="MIR" repeat_region 37411..37714 /rpt_family="AluJb" repeat_region 37923..38216 /rpt_family="AluJb" repeat_region 38369..38505 /rpt_family="MER49" repeat_region complement(38968..39193) /rpt_family="AluJo" repeat_region complement(39227..39534) /rpt_family="AluJb" repeat_region complement(40399..40700) /rpt_family="AluJo" repeat_region complement(41017..41318) /rpt_family="AluSx" repeat_region complement(41593..41880) /rpt_family="AluJo" repeat_region 42049..42292 /rpt_family="AluSq" repeat_region 42296..42349 /rpt_family="MER21B" repeat_region complement(42368..42508) /rpt_family="L1MC/D" repeat_region 42510..42753 /rpt_family="LTR8" repeat_region complement(42900..43209) /rpt_family="AluSx" repeat_region 43303..43595 /rpt_family="LTR8" repeat_region 43978..44336 /rpt_family="MER79" repeat_region complement(44337..44637) /rpt_family="AluJb" repeat_region 45809..46108 /rpt_family="AluSx" repeat_region complement(46146..46429) /rpt_family="AluJo" repeat_region 46828..47124 /rpt_family="AluSp" repeat_region complement(47232..47362) /rpt_family="L1P1" repeat_region complement(47429..47473) /rpt_family="MIR" repeat_region complement(47773..48072) /rpt_family="AluSg" repeat_region complement(48101..48167) /rpt_family="MIR" repeat_region 48180..48573 /rpt_family="MLT1C" repeat_region complement(48592..48726) /rpt_family="AluJo" repeat_region complement(48727..49023) /rpt_family="AluSx" repeat_region 49044..49095 /rpt_family="MLT1B" repeat_region complement(49101..49177) /rpt_family="MIR" repeat_region complement(49425..49576) /rpt_family="POLY_G" repeat_region complement(49583..49879) /rpt_family="AluSg" repeat_region 49937..50016 /rpt_family="POLY_A" repeat_region 50268..50578 /rpt_family="AluJo" repeat_region complement(50905..50956) /rpt_family="AT_rich" repeat_region complement(51485..51601) /rpt_family="L1MC/D" repeat_region complement(51727..51790) /rpt_family="(GGA)n" repeat_region complement(52354..52492) /rpt_family="(TA)n" repeat_region complement(52516..52800) /rpt_family="AluSx" repeat_region complement(52801..52828) /rpt_family="AT_rich" repeat_region 52830..52907 /rpt_family="AluSp" repeat_region complement(52923..52951) /rpt_family="AT_rich" repeat_region complement(52954..53252) /rpt_family="AluJb" STS 53481..53604 /db_xref="dbSTS:G17958" repeat_region complement(53581..53869) /rpt_family="L1ME" repeat_region complement(53877..54162) /rpt_family="AluSx" repeat_region complement(54180..54435) /rpt_family="AluSq" repeat_region 54559..54685 /rpt_family="L1MC/D" repeat_region complement(55307..55602) /rpt_family="AluSx" repeat_region 55789..56087 /rpt_family="AluJb" repeat_region 56352..56499 /rpt_family="L1PA10" repeat_region 56508..56806 /rpt_family="AluSx" repeat_region complement(57425..57604) /rpt_family="AluJ" repeat_region complement(57636..57759) /rpt_family="AluJ" repeat_region complement(58903..58954) /rpt_family="(CAGA)n" repeat_region complement(58992..59112) /rpt_family="(GA)n" repeat_region complement(59115..59411) /rpt_family="AluSx" repeat_region 59769..59903 /rpt_family="AluSq/x" repeat_region 59904..60201 /rpt_family="AluSq" repeat_region complement(60688..60747) /rpt_family="(GAAAA)n" repeat_region complement(60749..61049) /rpt_family="AluSx" repeat_region 61391..61651 /rpt_family="L1MA2" repeat_region 61675..61994 /rpt_family="L1MA3" repeat_region complement(62008..62714) /rpt_family="L1MD2" repeat_region complement(62741..63851) /rpt_family="L1MB2" repeat_region complement(63859..64072) /rpt_family="L1MC/D" repeat_region complement(64074..64367) /rpt_family="AluJo" repeat_region complement(64368..64673) /rpt_family="L1MC/D" repeat_region complement(64938..65237) /rpt_family="AluSx" repeat_region complement(65319..65390) /rpt_family="MER2" repeat_region 65394..65693 /rpt_family="AluJo" repeat_region complement(65696..65886) /rpt_family="MER2" repeat_region complement(66243..66290) /rpt_family="(GA)n" repeat_region 66593..66889 /rpt_family="AluJb" repeat_region 67347..67664 /rpt_family="AluSx" repeat_region 67665..67931 /rpt_family="AluSq" STS 67966..68123 /db_xref="dbSTS:G17919" repeat_region complement(68228..68522) /rpt_family="AluJo" repeat_region complement(68535..68841) /rpt_family="AluSx" repeat_region 69654..70083 /rpt_family="L1ME3" repeat_region complement(70147..70457) /rpt_family="AluJo" repeat_region 70579..70649 /rpt_family="(TAAA)n" repeat_region 70850..70982 /rpt_family="AluJ" repeat_region 70986..71272 /rpt_family="AluY" repeat_region 71289..71464 /rpt_family="AluJ" repeat_region 71465..71523 /rpt_family="(GAAAA)n" repeat_region 72051..72285 /rpt_family="MIR" repeat_region complement(72457..72557) /rpt_family="LINE2" repeat_region 72587..72946 /rpt_family="MLT1A2" repeat_region complement(73227..73268) /rpt_family="MIR" repeat_region complement(73435..73728) /rpt_family="AluSx" repeat_region 74655..74960 /rpt_family="L1ME3" repeat_region 74955..75314 /rpt_family="L1MB3" repeat_region 75321..75538 /rpt_family="L1ME3" repeat_region 75563..75837 /rpt_family="AluSg" repeat_region complement(75842..76094) /rpt_family="AluSx" repeat_region complement(76095..76310) /rpt_family="L1MB3" repeat_region complement(76311..76711) /rpt_family="MSTC" repeat_region complement(76739..76953) /rpt_family="L1MB3" repeat_region complement(76977..77024) /rpt_family="L1M4" repeat_region complement(77031..77334) /rpt_family="AluSx" repeat_region complement(77335..78887) /rpt_family="L1M4" repeat_region complement(78897..78955) /rpt_family="(GGAA)n" repeat_region complement(78961..79257) /rpt_family="AluJo" repeat_region complement(79287..79386) /rpt_family="MER33" repeat_region 79407..79708 /rpt_family="AluSx" repeat_region complement(79760..79832) /rpt_family="MIR" repeat_region 79846..80020 /rpt_family="MLT1C" repeat_region 80041..80338 /rpt_family="AluSq" repeat_region 80385..80646 /rpt_family="MLT1C" repeat_region 80837..81492 /rpt_family="LINE2" repeat_region 81600..81623 /rpt_family="(CAA)n" repeat_region 82261..82385 /rpt_family="MIR" repeat_region complement(82498..82617) /rpt_family="LINE2" repeat_region complement(83057..83091) /rpt_family="MIR" mRNA join(83163..84091,85118..85149,235726..>235808) /gene="A-345G4" gene 83163..>235808 /gene="A-345G4" repeat_region complement(83231..83261) /rpt_family="(CA)n" repeat_region complement(83759..83876) /rpt_family="(CGG)n" CDS join(83919..84091,85118..85149,235726..>235808) /gene="A-345G4" /codon_start=1 /product="Protein kinase C beta (PRKCB1) (3'partial)" /db_xref="PID:g2695573" /translation="MADPAAGPPPSEGEESTVRFARKGALRQKNVHEVKNHKFTARFF KQPTFCSHCTDFIWGFGKQGFQCQVCCFVVHKRCHEFVTFSCPGADKGPASD" repeat_region 84096..84205 /rpt_family="GC_rich" repeat_region complement(86199..86222) /rpt_family="AT_rich" repeat_region complement(87021..87311) /rpt_family="AluJo" repeat_region 87434..87612 /rpt_family="MIR" repeat_region complement(88822..89116) /rpt_family="AluSx" repeat_region complement(89226..89391) /rpt_family="AluJo" repeat_region complement(89397..89698) /rpt_family="AluSx" repeat_region complement(89713..89825) /rpt_family="AluJo" repeat_region 89944..90076 /rpt_family="MIR" repeat_region 91437..91463 /rpt_family="AT_rich" repeat_region 91534..91577 /rpt_family="POLY_A" repeat_region 91697..91736 /rpt_family="(CA)n" repeat_region complement(92117..92252) /rpt_family="(TGGA)n" repeat_region complement(92257..92549) /rpt_family="AluY" repeat_region complement(92550..92844) /rpt_family="AluSx" repeat_region complement(92873..92972) /rpt_family="LINE2" repeat_region 93242..93376 /rpt_family="FLAM_C" repeat_region 94071..94371 /rpt_family="AluSx" repeat_region complement(95062..95321) /rpt_family="(TGGA)n" repeat_region complement(95346..95546) /rpt_family="LINE2" repeat_region 96406..96554 /rpt_family="MIR" repeat_region complement(97457..98097) /rpt_family="L1MA7" repeat_region complement(98112..98409) /rpt_family="AluSp" repeat_region complement(98410..100422) /rpt_family="L1MA7" repeat_region complement(100426..100563) /rpt_family="L1ME3A" repeat_region complement(101578..101939) /rpt_family="THE1B" repeat_region complement(102061..102185) /rpt_family="MLT1G" repeat_region complement(102346..102537) /rpt_family="MLT1G" repeat_region 102661..102722 /rpt_family="MIR" repeat_region 102871..102944 /rpt_family="(GA)n" repeat_region 103025..103165 /rpt_family="MIR" repeat_region 103166..103220 /rpt_family="(CAAA)n" repeat_region complement(103363..103681) /rpt_family="LTR16A" repeat_region complement(103907..103962) /rpt_family="MIR" repeat_region complement(104276..104571) /rpt_family="AluJo" repeat_region complement(105446..105503) /rpt_family="(CA)n" repeat_region complement(105599..105766) /rpt_family="MIR" repeat_region complement(105816..105863) /rpt_family="(CAT)n" repeat_region complement(105887..106014) /rpt_family="MIR" repeat_region complement(106560..106677) /rpt_family="MIR" repeat_region 106678..106890 /rpt_family="AluJb" repeat_region complement(106897..106972) /rpt_family="MIR" repeat_region 107616..107664 /rpt_family="(GA)n" repeat_region 107845..107984 /rpt_family="MIR" repeat_region complement(108085..108252) /rpt_family="LINE2" repeat_region 108351..108665 /rpt_family="AluSx" repeat_region complement(108918..109213) /rpt_family="AluJb" repeat_region complement(109774..109960) /rpt_family="L1M4" repeat_region complement(110440..110489) /rpt_family="MIR" repeat_region 110491..110798 /rpt_family="MER41C" repeat_region complement(110814..110860) /rpt_family="(TA)n" repeat_region 110865..111031 /rpt_family="(CATA)n" repeat_region 111023..111236 /rpt_family="MER41C" repeat_region 112058..112183 /rpt_family="FLAM_C" repeat_region complement(112206..112330) /rpt_family="LINE2" repeat_region complement(112492..112811) /rpt_family="L1PA16" repeat_region 112799..112990 /rpt_family="L1" repeat_region complement(112995..113045) /rpt_family="(CA)n" repeat_region complement(113048..113225) /rpt_family="AluJo" repeat_region 113226..113364 /rpt_family="(TA)n" repeat_region complement(114279..114638) /rpt_family="L1ME" repeat_region complement(114682..114868) /rpt_family="(GGAA)n" repeat_region complement(114875..115168) /rpt_family="AluSx" repeat_region complement(115179..115263) /rpt_family="L1P5" repeat_region complement(115454..115510) /rpt_family="MIR" repeat_region complement(116171..116358) /rpt_family="MIR" repeat_region complement(116713..116760) /rpt_family="MIR" repeat_region 117495..117781 /rpt_family="AluSx" repeat_region complement(118151..118274) /rpt_family="MIR" repeat_region complement(119323..119585) /rpt_family="AluSx" repeat_region 119613..119786 /rpt_family="LINE2" repeat_region 120058..120284 /rpt_family="AluJo" repeat_region 120292..120495 /rpt_family="MIR" repeat_region 120544..120564 /rpt_family="AT_rich" repeat_region 120794..121139 /rpt_family="AluYb8" repeat_region 121629..121649 /rpt_family="AT_rich" repeat_region complement(122785..123085) /rpt_family="AluJb" repeat_region 123229..123356 /rpt_family="(TA)n" repeat_region complement(123365..123512) /rpt_family="(CATA)n" repeat_region complement(123516..123573) /rpt_family="(TA)n" repeat_region 124903..125095 /rpt_family="AluJo" repeat_region complement(125579..125880) /rpt_family="AluSx" repeat_region 125887..126497 /rpt_family="MER82" repeat_region 126650..126937 /rpt_family="AluSx" repeat_region complement(127246..127538) /rpt_family="AluSx" repeat_region 127568..127820 /rpt_family="MIR" repeat_region complement(129001..129299) /rpt_family="AluSx" repeat_region complement(129308..129329) /rpt_family="AT_rich" repeat_region complement(129412..129691) /rpt_family="L1MB3" repeat_region complement(129692..129821) /rpt_family="(GAAA)n" repeat_region complement(129822..130128) /rpt_family="AluJo" repeat_region complement(130130..130282) /rpt_family="AluJo" repeat_region complement(130285..130498) /rpt_family="L1MB6" repeat_region complement(130502..130633) /rpt_family="FLAM_C" repeat_region complement(130645..130686) /rpt_family="L1MB6" repeat_region complement(130758..131481) /rpt_family="LINE2" repeat_region complement(131572..131893) /rpt_family="MER33" repeat_region complement(132284..132587) /rpt_family="AluJo" repeat_region 132690..132829 /rpt_family="MIR" repeat_region complement(133054..133353) /rpt_family="AluSg" repeat_region complement(133703..133874) /rpt_family="MIR" repeat_region complement(134158..134459) /rpt_family="AluSx" repeat_region complement(134792..134879) /rpt_family="(GAAA)n" repeat_region complement(134950..135599) /rpt_family="L1PA15" repeat_region complement(136790..136933) /rpt_family="(TGGA)n" repeat_region 136937..137198 /rpt_family="MIR" repeat_region 138684..138801 /rpt_family="(TA)n" repeat_region complement(138803..139099) /rpt_family="AluJo" repeat_region complement(139149..139202) /rpt_family="(CATA)n" repeat_region complement(140497..140551) /rpt_family="MIR" repeat_region complement(141661..141761) /rpt_family="MLT1G" repeat_region complement(142105..142216) /rpt_family="L1P3" repeat_region complement(142536..143179) /rpt_family="L1" repeat_region 143502..143552 /rpt_family="MIR" repeat_region 143917..143943 /rpt_family="AT_rich" repeat_region complement(144089..144466) /rpt_family="LINE2" repeat_region complement(144614..144777) /rpt_family="(TGGA)n" repeat_region complement(144781..144841) /rpt_family="MIR" repeat_region complement(145551..145619) /rpt_family="(TGGA)n" repeat_region complement(145644..145814) /rpt_family="AluSg/x" repeat_region complement(145815..146113) /rpt_family="AluY" repeat_region complement(146120..146467) /rpt_family="LINE2" repeat_region 146571..146743 /rpt_family="MIR" repeat_region complement(147351..147392) /rpt_family="(GA)n" repeat_region complement(147392..147415) /rpt_family="(CA)n" repeat_region complement(147422..147570) /rpt_family="AluJo" repeat_region complement(147600..147711) /rpt_family="AluJo" repeat_region 147782..147975 /rpt_family="MER20" repeat_region complement(148466..148497) /rpt_family="POLY_A" repeat_region complement(148515..148690) /rpt_family="L1ME" repeat_region 149663..149755 /rpt_family="LINE2" repeat_region complement(150029..150197) /rpt_family="L1MD3" repeat_region complement(150198..150324) /rpt_family="MIR" repeat_region complement(150458..150549) /rpt_family="(GGAA)n" repeat_region complement(150550..150877) /rpt_family="AluJo" repeat_region complement(151006..151056) /rpt_family="(CA)n" repeat_region 151444..151577 /rpt_family="LINE2" repeat_region 151593..151714 /rpt_family="LINE2" repeat_region 151926..152124 /rpt_family="MIR" repeat_region 152129..152428 /rpt_family="AluSx" repeat_region 152443..152504 /rpt_family="MIR" repeat_region complement(152724..152803) /rpt_family="(GAAAA)n" repeat_region complement(153176..153360) /rpt_family="MER53" repeat_region 153432..153524 /rpt_family="LINE2" repeat_region complement(154170..154197) /rpt_family="AT_rich" repeat_region complement(154360..154526) /rpt_family="L1" repeat_region complement(154790..154996) /rpt_family="MER20" repeat_region complement(155033..155312) /rpt_family="AluJo" repeat_region complement(156279..156564) /rpt_family="AluY" repeat_region complement(157036..157117) /rpt_family="MLT1C" repeat_region complement(157226..157594) /rpt_family="L1PB3" repeat_region complement(157598..157763) /rpt_family="MLT1A2" repeat_region complement(158396..158879) /rpt_family="L1MB6" repeat_region complement(158881..159174) /rpt_family="AluJo" repeat_region complement(159175..159271) /rpt_family="L1MB6" repeat_region complement(159578..159873) /rpt_family="AluSx" repeat_region complement(160258..160559) /rpt_family="AluSx" repeat_region complement(160572..160827) /rpt_family="AluJo" repeat_region complement(161751..162034) /rpt_family="AluJo" repeat_region 162246..162437 /rpt_family="MIR" repeat_region 163150..163456 /rpt_family="AluJo" repeat_region complement(163476..163505) /rpt_family="AT_rich" repeat_region 163656..163682 /rpt_family="AT_rich" repeat_region complement(163800..164105) /rpt_family="AluJo" repeat_region complement(164514..164818) /rpt_family="AluJb" repeat_region complement(164833..165123) /rpt_family="AluSx" repeat_region complement(166143..166443) /rpt_family="AluSx" repeat_region complement(167312..168064) /rpt_family="L1MB2" repeat_region complement(168586..168943) /rpt_family="THE1B" repeat_region 169004..169112 /rpt_family="MIR" repeat_region 169268..169554 /rpt_family="AluSx" repeat_region 171757..172140 /rpt_family="THE1B" repeat_region 172409..172623 /rpt_family="L1MA8" repeat_region complement(173384..173740) /rpt_family="L1PB3" repeat_region 173741..174019 /rpt_family="AluJo" repeat_region 174020..174248 /rpt_family="AluSg/x" repeat_region complement(174265..174320) /rpt_family="L1MA9" repeat_region complement(174322..174431) /rpt_family="(TA)n" repeat_region complement(174435..174571) /rpt_family="L1MA9" repeat_region complement(174684..174817) /rpt_family="LINE2" repeat_region complement(175001..175043) /rpt_family="LINE2" repeat_region complement(175248..175777) /rpt_family="LINE2" repeat_region complement(175871..176175) /rpt_family="AluSq" repeat_region complement(176464..176764) /rpt_family="AluSq" repeat_region 177080..177318 /rpt_family="MIR" repeat_region 177386..177471 /rpt_family="LINE2" repeat_region 178036..178324 /rpt_family="AluJb" repeat_region 178482..178554 /rpt_family="MIR" repeat_region complement(178582..178714) /rpt_family="MIR" repeat_region complement(181379..181570) /rpt_family="(TGGA)n" repeat_region complement(181599..181738) /rpt_family="(TGGA)n" repeat_region complement(181760..181808) /rpt_family="(TGGA)n" repeat_region 182882..183055 /rpt_family="MER5A" repeat_region 183339..183773 /rpt_family="L1PA16" repeat_region complement(184178..184459) /rpt_family="AluJb" repeat_region complement(184493..184588) /rpt_family="L1ME" repeat_region complement(184948..185034) /rpt_family="MER5B" repeat_region 186572..188152 /rpt_family="L1MC/D" repeat_region 188301..188342 /rpt_family="(CAAAA)n" repeat_region 188340..188444 /rpt_family="(TA)n" repeat_region 188562..188700 /rpt_family="MIR" repeat_region 188819..189016 /rpt_family="MER58A" repeat_region 189161..189836 /rpt_family="L1" repeat_region 190337..190419 /rpt_family="MIR" repeat_region 190850..190984 /rpt_family="MIR" repeat_region 190991..191276 /rpt_family="AluJo" repeat_region complement(191549..191632) /rpt_family="MIR" repeat_region complement(192110..192174) /rpt_family="AT_rich" repeat_region 193430..193483 /rpt_family="MER63B" repeat_region 193484..193524 /rpt_family="POLY_A" repeat_region 193526..193641 /rpt_family="MER63C" repeat_region complement(193758..193900) /rpt_family="MIR" repeat_region 194204..194371 /rpt_family="MIR" repeat_region complement(194429..194582) /rpt_family="MER5A" repeat_region complement(194602..194903) /rpt_family="AluSx" repeat_region 195634..195659 /rpt_family="AT_rich" repeat_region complement(196153..196454) /rpt_family="AluSx" repeat_region complement(196455..196634) /rpt_family="MIR" repeat_region 196933..197017 /rpt_family="(GAAA)n" repeat_region complement(197346..197456) /rpt_family="LINE2" repeat_region complement(197527..197617) /rpt_family="MIR" repeat_region complement(197881..197911) /rpt_family="POLY_A" repeat_region 198193..198302 /rpt_family="MIR" repeat_region complement(198996..199172) /rpt_family="L1ME" repeat_region complement(199181..199465) /rpt_family="AluJb" repeat_region 199705..199833 /rpt_family="LINE2" repeat_region complement(200748..200919) /rpt_family="MIR" repeat_region complement(201441..201497) /rpt_family="(TGAA)n" repeat_region complement(201525..201582) /rpt_family="LINE2" repeat_region complement(201635..201804) /rpt_family="AluSx" repeat_region complement(201805..201876) /rpt_family="L1PA4" repeat_region complement(201877..201997) /rpt_family="AluSx" repeat_region 202357..202519 /rpt_family="MIR" repeat_region 202520..202661 /rpt_family="AluSq/x" repeat_region 202847..202967 /rpt_family="AluSg/x" repeat_region 202968..203001 /rpt_family="(CA)n" repeat_region complement(203870..204090) /rpt_family="AluJb" repeat_region complement(204133..204418) /rpt_family="L1P3" repeat_region 204606..204716 /rpt_family="L1" repeat_region 204729..204757 /rpt_family="POLY_A" repeat_region 205590..205708 /rpt_family="MIR" repeat_region complement(205710..205755) /rpt_family="(TAA)n" repeat_region complement(205757..206039) /rpt_family="AluSx" repeat_region complement(206053..206350) /rpt_family="AluSg" repeat_region 206367..206419 /rpt_family="MIR" repeat_region complement(206432..206530) /rpt_family="(GGA)n" repeat_region 206634..206859 /rpt_family="MIR" repeat_region complement(207544..207662) /rpt_family="MIR" repeat_region complement(208218..208476) /rpt_family="L1PA10" repeat_region complement(208825..208951) /rpt_family="FLAM_C" repeat_region 209392..209628 /rpt_family="MER46" repeat_region 209877..210003 /rpt_family="LINE2" repeat_region complement(210503..210886) /rpt_family="L1PB2" repeat_region 210896..211110 /rpt_family="MIR" repeat_region complement(211799..212006) /rpt_family="MER3" repeat_region complement(212069..212233) /rpt_family="MER58A" repeat_region 212234..212278 /rpt_family="LINE2" repeat_region complement(212307..212377) /rpt_family="MIR" repeat_region complement(212423..212717) /rpt_family="AluSx" repeat_region complement(212754..213180) /rpt_family="L1PA16" repeat_region 213199..213259 /rpt_family="LINE2" repeat_region 214085..214301 /rpt_family="MIR" repeat_region complement(214504..214618) /rpt_family="LINE2" repeat_region complement(215361..215398) /rpt_family="(GAAA)n" repeat_region complement(215400..215702) /rpt_family="AluSx" repeat_region 215910..215981 /rpt_family="MIR" repeat_region 216685..216937 /rpt_family="AluSx" repeat_region complement(217104..217206) /rpt_family="L1ME1" repeat_region 217243..217399 /rpt_family="MIR" repeat_region 217400..217461 /rpt_family="(CAT)n" repeat_region 219569..220066 /rpt_family="L1ME1" repeat_region complement(220535..220642) /rpt_family="LINE2" repeat_region complement(220666..220863) /rpt_family="MIR" repeat_region complement(220928..221230) /rpt_family="AluSx" repeat_region complement(221237..221345) /rpt_family="MIR" repeat_region complement(221431..221707) /rpt_family="AluJb" repeat_region complement(221993..222253) /rpt_family="LINE2" repeat_region 223804..224062 /rpt_family="AluJo" repeat_region complement(225084..225245) /rpt_family="MER5B" repeat_region 226488..226688 /rpt_family="MER3" repeat_region complement(226823..227037) /rpt_family="MIR" repeat_region 227468..227576 /rpt_family="MIR" repeat_region 227590..227852 /rpt_family="AluJo" repeat_region complement(228190..228352) /rpt_family="MIR" repeat_region complement(228546..228573) /rpt_family="AT_rich" repeat_region complement(228791..229236) /rpt_family="MLT1D" repeat_region complement(229307..229423) /rpt_family="(GAAAA)n" repeat_region complement(229637..229945) /rpt_family="AluJb" repeat_region complement(230801..230869) /rpt_family="LINE2" repeat_region complement(230941..231121) /rpt_family="AluSg/x" repeat_region complement(231415..231662) /rpt_family="MIR" repeat_region 231785..231892 /rpt_family="(GAAA)n" repeat_region complement(232309..232613) /rpt_family="AluSx" repeat_region 232883..233366 /rpt_family="L1" repeat_region complement(233470..233767) /rpt_family="AluJb" repeat_region complement(234118..234138) /rpt_family="AT_rich" repeat_region 234353..234465 /rpt_family="L1PA8" repeat_region complement(235647..235720) /rpt_family="(GA)n" repeat_region complement(236177..236481) /rpt_family="AluY" repeat_region complement(236504..236601) /rpt_family="AluJ/FLAM" repeat_region complement(236725..236763) /rpt_family="AT_rich" repeat_region 237395..237695 /rpt_family="AluY" repeat_region complement(237699..237725) /rpt_family="POLY_A" repeat_region complement(237761..238065) /rpt_family="AluJb" repeat_region 239118..239474 /rpt_family="MLT1A2" repeat_region 239478..245115 /rpt_family="HERVL" repeat_region 245116..245612 /rpt_family="MLT2CA" repeat_region 245702..245931 /rpt_family="MIR" repeat_region 247614..247819 /rpt_family="MIR" repeat_region 248054..248291 /rpt_family="MIR" repeat_region 248754..248902 /rpt_family="L1" repeat_region complement(249183..249310) /rpt_family="MLT1G" repeat_region complement(249517..249579) /rpt_family="(CA)n" repeat_region complement(249580..249856) /rpt_family="AluSx" repeat_region complement(250601..250902) /rpt_family="AluSx" repeat_region 251636..251894 /rpt_family="AluJb" repeat_region 252254..252386 /rpt_family="L1ME2" repeat_region complement(252489..252511) /rpt_family="AT_rich" repeat_region 254275..254495 /rpt_family="MIR" repeat_region complement(255265..255457) /rpt_family="MIR" repeat_region complement(255881..256492) /rpt_family="L1MA10" repeat_region 256491..256644 /rpt_family="L1M4" repeat_region complement(256645..256829) /rpt_family="AluSg/x" repeat_region 256833..256949 /rpt_family="L1P5" repeat_region 256950..257237 /rpt_family="AluSx" repeat_region complement(258089..258385) /rpt_family="AluJb" repeat_region 258642..258715 /rpt_family="MIR" repeat_region complement(258735..258961) /rpt_family="AluJo" repeat_region 259004..259146 /rpt_family="MIR" repeat_region complement(259410..259640) /rpt_family="MIR" repeat_region 259643..259854 /rpt_family="LINE2" BASE COUNT 68538 a 56003 c 57421 g 77882 t 50 others ORIGIN 1 aagcttgtgt atgatgtgtg catgggagtt cctaattact ctccggccta atttactttg 61 gtgtagcact gactccaata ttaaggctgg agaagccctt tccaagatca gcatcccaag 121 ctggcctgcg gtgcttatca cagcttctta gggctctcat ctcctcctgg gttcagaact 181 gaacccaaac tacatgcaca gcctgactca cagctgtggg ttgcagtgat tgctgggctt 241 cattagggag gagtgtggtt tcccgcatgc ttagtctttg gggggagtct gaaagagaat 301 caagcattcc ttatgccagt tgcttaatcc ttgggggtat ctgaaagaag caagcacttc 361 ttagatcaat attttcaaga cttatctgat gcaggcctct gcctggggct ggaggataga 421 ctgtttcttc agcttctctc tctccagggt ggatatgaac ttgagccctt gcaatgaact 481 gcttgcatgt tttcccccct gggaaatttg tgtatgtatg tgcaattatg atgggggaaa 541 cggtttaaag atactttcaa aagtggacac ttctagtgtt tgtcccttga caaatgagtt 601 tctcgacctg tttttttttc attatcaacg cctcccctag tctttttaat ttccccaccc 661 tcacccctac cccatgaaat tttattttct ttcttttttt ttttttgaga ccaagtttcg 721 atctttttgc ccaagatggc gtgcaatggc gcgatctagg ctcacctcaa tctccacctc 781 ccgggttcaa gcgattctcc tgcctcagcc tcccgagtag ctggaattac aggcatgcac 841 caccacgccc agctaatttt gtatttttag tagagactgg gtttctccat gttggtcagt 901 ctggtctcaa actcctgacc tcaggtgatc cgcccgtctc gacctcccaa agtgctggga 961 ttacaggcat aagccaccgc gcccggcctc ttttcttttc aagacagggt ctcactctgt 1021 cacccaggct ggagtgtaat ggcgtgatca tggctccctg cagcctcaac ctcctgggct 1081 caagtgatcc tcctgtctca gactcccaag tagctgagac tacaggcatg tgccacgaca 1141 cctggttaat tttttttttt tgtattcttt tgtagagatg gggtctcact atgttgccta 1201 ggctggtctt gagctcctgg gctcaagcaa tcctcctgcc tctacctccc aaagtgctgg 1261 gattacaggc atgagcctcc acgtctgccc tgaaatttca atatcacagc tatactgtat 1321 atctatttat gtgctgtatg tatgctgtgc tttgtacata aaaaatatgt tttatttttt 1381 gccccgttgg ggtgagatag agaccgcact gagaatgcac gctttacata attagttcac 1441 aacttgcccc cggcagcctc caccacgtaa aattagaaga aagcagccca gcagggccta 1501 caaggaatgt gtgaacggat gcacgtaaag tgcttagcaa agcgtggtaa acaagagccc 1561 ttgttgtcat tcttgcttta agcctagtcc cagtgatgtc tgtaaccctc tcttgggcca 1621 aggagtccca tggctctgga gggctgttca gaggactaag tcattcccag tctggagctc 1681 tacgggatca gtaggtacct aagggagtga attgggcttg gagggactgt ggccagaaga 1741 cagctctggg cctggtggtg atagaacttc tgatttctca ggaaaagcca agaatgtgga 1801 ttttgggtta aatctcctga attttaagca ttggctcaat taaacaacaa taaggcaatg 1861 tgaaccgaac aaaagggatg aaaccaagat ttactgactt tcccagagtc acgggcacgg 1921 ctgggattcc aacaaggact cccaaaccag tgctctttcc acttaaccta acgcacaccc 1981 ttcatctcgc cgcccctccc cacttccccg tctttgggag ttcttcgatg cctcttcccc 2041 cctcccccca aggattccct ccccataggc cacccgcctt ccagcagggc gtggtctcta 2101 cagaggaggt caggtctccc aggcttcctc agcgtcgacg ccggggagcg ccctggggtt 2161 gggagcgccc ggggcccgcg aaggaggagg tggctcaggt gtcagggcgc accgtgggaa 2221 ccggccccgg gggaggcgtg gaaacgcggg gcctgggact cgaccagcct ctcccgtgcg 2281 gatcgcaaaa tctcggagct gaaacagccc gttgttcgca gcctccctct gacccgccac 2341 cctgcacatt gttttccatt cgcccgggtc gcgggtggga ggagaggcac gccggggttc 2401 tggagcttgg ccgcgcgcca ggcttgtggc cttcgtcccc ctggggccac tggggcggcc 2461 acgcctctcc ggcgggagga gagaacgcgt gggtccgggt ggctgctccg gcccttccgc 2521 ctccagctcg gccatggggt cgcgcagctc ccacgccgcg gtcattcccg acggggacag 2581 tattcggcga gagaccggct gtgagtgcgc ccgcgtcggc ggctgcggag gggacggggc 2641 gaacccaggc gtctggggct agggaagggg ttgggttgaa ggatggacga acttacaagt 2701 ctggggtccg gggctccccg gagctggaag accaaggccc ctgtgcctgg gatcgctggg 2761 ttaagggcgg gttaacctag gggtcccagc ctccaagtct ggggaggatc cgggttcacg 2821 gggtcggagt ccagaggaat ccaggcaccc aggtgtcctc cagcccggct cgaagctgaa 2881 ggcagagctg acggcggttg gaagggatgg ctttggttct ttggttttcg ggaggttgcg 2941 agccgcccgg gtcttgaacc tggatcttcg cggggtcgtg tcactctccc tccgccccgg 3001 ccagctcaac ccctgattcc ccgggatgat cgccccttcc agccagcccc cgctgttctg 3061 gcccatcttc ctggcctgtt ggggcgggtt tctggagctg gccctgcaga gtcacacacc 3121 cccaccccac tctccttccc gcagtctccc aagccagcct gctccgcctg caccaccggt 3181 tccgggcact ggacaggaat aagaagggct acctgaggtg agggggagcc ggcctcataa 3241 cttctggcct ctgtctctct gactccatgt ccctctttga ccctccgtct ggcatctctg 3301 ccttaccctc tttgaaattt ggctttgcag ccgcatggat ctccagcaga taggggcgct 3361 cgccgtgaac cccctgggag accgaattat agaaagcttc ttccccgatg ggtgaggctt 3421 gctgggcgtg gggaggtgaa ggcgggaaaa ccggtgtgtg agtgggtggg aggggaggtt 3481 agagacggag gcaaagtgat ggccaaggtg accaccacct ctttccattc tgtcccgtct 3541 ccccaggagc cagcgagtgg atttcccagg ctttgtcagg gtcttggctc attttcgccc 3601 tgtagaagat gaggacacag aaacccaaga ccccaagaaa cctgaacctc tcaacagcag 3661 aaggaacaaa cttcactgtg agtttgtgag gacctgcaca agtgagaatg cagatgtacc 3721 cacaccaggg acaggctcca gggatctccc actcccttcc tgagggcatt gacaacctcc 3781 cccctcctcc aaggtggggg gaaggaaggt ggctgctgga agagagccag ggaagaccta 3841 ccttcctttc cccctcccac cctctcccca gatgcatttc agctctatga cctggatcgc 3901 gatgggaaga tctccaggca tgagatgctg caggttggca gaaagcgaga gcaagagatg 3961 tgatgtgtga aggatgggat ggttaaatgc aatggtgtga gagatggggt gggatgattg 4021 gggaactggg gcgcagataa ttggggtatt ggttgggaaa tgggaatggg tgcagttaga 4081 gtgtgggttt gttgggagtc ggagtgggga gcagttgact atgaggttgg gaatagatca 4141 gtgattctca actgggggtg attttgctcc ctgagtgaat atctggcaat atctggagat 4201 gctttttgtt gtcacaatgg gggagagagt gtgctactgg catctagtgg gtagaggtca 4261 ggaatgtggc tatatatcat gcaataccca ggagagcctc ttccaacaag gagttacctg 4321 gctccaaatg tcaacagtgc caagattgag aaactctgga acagatgtga tggaatgagg 4381 atggaattag aaacccttag tggctaaggt ggagaatggt gtggaggatg gatggaggtt 4441 gggtgataaa gttgggtaat gagtttgagc atattttgag ggtagtgcag gatggtgagg 4501 gagagatagg agattggttg ttgaagcagt agggaaaatt attgggggga tatggtgaaa 4561 aatggatgaa ggatggatta taaaattagc aataactttt gggatgaggt gggcaaggtt 4621 taggagatgg ggagttgcag ccttcgtgcc ccctccttat ggctgcctct tcactcatct 4681 ctcaggttct ccgtctgatg gttggggtac aggtgacaga agagcagctg gagaacatcg 4741 ctgaccgcac ggtgcaggag gctgatgaag atggggatgg ggctgtgtcc ttcgtggagt 4801 tcaccaaggt cagagtgccc ttggggattg gggagttgag atcaggagtt ctggggcaga 4861 cagacttggg aaacgttcaa cggaggaggg tggaaagata ggggttgcct gggaatgatg 4921 gggtctctgg aatgggtaga accctggggg aggaggttgg gagcagggag agctgagagt 4981 caagcctctt gcctgccatt tgtttttccc tcagtcctta gagaagatgg acgttgagca 5041 aaaaatgagc atccggatcc tgaagtgact ccgtttgtgc cttgggcttg ctcctgcaac 5101 cagtatctcc ttggaattca tccaaagccc ccatggacgc atggacgcag ggcgacaata 5161 aactgtattt tcgtttctaa ctctatttag ggccaagaga agaaagctgg aaggatgtgt 5221 actaaagtct agctcagcag tccccaacct ttttggcatc agggacagtt tttccacgga 5281 tgggtgacag gggatggttt tgggatgatt caagtgcatt acatttattg tgcactttat 5341 ttctattatg attacattgt aatatataat gaaataatta tacaactcac cataatgtag 5401 aatcagcagg agccctgagc ttgttttcct gcaattagac ggtcccatat gggagtgatg 5461 ggagacagtg acagatcatc aggcattaga ttctcataag gagtgcacaa tctagatcct 5521 ttggtgtgca gttcacagta ggatttgggc tcctatgata atctaatgcc actgctgatc 5581 tgacaggagg cagagctcag gcggtaatgc aagcaatggg gagtggctgt aaatatagat 5641 gaagcttcag ctcgcctgcc gctcaccttg tgctgtgcag cccggttcct aacagaccac 5701 agaccccaca ccaggtctat ctcatttggt ctcagagctg tgaatcagcc agcaatattt 5761 tagttgcaaa tcactgaaaa cccaactcaa agtgacttaa gtcagaaaga aattttatga 5821 attcaggtaa ttaaaaagtc cagaagtatc tgcctttagg cacagctgga tccaagggca 5881 caaatgatgt catcaggctc cagttattct ccatctccca gctcagcttt ttctgtctgt 5941 aagcctgatt ttcaggaagg ctctttccta gtgatggaga tgaccaccat cagctccagg 6001 cttctatcct gctaacccag taacccagtg ggaagagatt tacttattcc aataattcca 6061 agtggagagt gtcattgacc cgtttggggt ctcatctcta cttctagggg aatgaaacac 6121 tctgagtggc caggcctgtg tcatgtgcta attcctagag ccagggaaat aaggtctgag 6181 gattcaggat ggggtgaaag gtggttgctt aaaggaaaat gaaatacaat tagcagaata 6241 aggggaaacg agtggtctgc tctgctcggg caaaacaaga gatgcccatt actgtgaggg 6301 acccttgaag tctggactct taaatgggtt tttgctgatt tcctgggtgc atgctaggat 6361 gatggggctt gatgcagtag ggaagagacg atgtaaaaat aataaacaat atataccttc 6421 ctagagtgtg aatgcattcg aagattctaa gagtaagctt gttttcatgg taagatgttc 6481 aaacattgat ataagatgga atggagcagg aagtcccata aaactatgag actctatgtc 6541 tgcagcagag ctctctggtt gcacctgtgg gagacactgt aattgccctg ctgtttttct 6601 ccctcccaga gctggtcttt ggttgtcttg tatcatcaac acatgagaac cagttgtgct 6661 cataacatct caactctgct ttcaatgata tgttagcagc ttgaaactgg ctatagcagg 6721 aatatttaca ccacagacat tggtaactgc aacaggttaa ggctttttcc tcctggaaag 6781 ccagttgtta aacatttatc tgcacactac ctgctagaca tttcccatta ctttgcatgc 6841 caatggtttc tttccttttt tctcccttct ttctttcttt ctttctttct ttctttcttt 6901 ctttcttttt ctttctttct tttcttatcg gttaatatca gttcaactgt cttaaacaga 6961 aaggaaaatt aatcagctca cataacaaaa acttccaagg tagcacctct tcagggttag 7021 taaattcagt ggttcaatgt catcatagag aatccttccc atcttctgcc atcctcaggc 7081 tagtacccct catagtccca agatagcaga gttctagggg tcgccaagat agactgcaaa 7141 aattgctgca ctacatttta tcccctccca taaagaaata tggtctattt ctttcctttt 7201 tttttttttt tttttttgag acagggtcta gctttgtcac ccaggctgga gtgcagtggc 7261 agtcttggct cactgcaacc tctgcctctc gggctcaagc aatgctccca cctcagcctc 7321 ttgagtagct tggactacag gcacatgcca ccatgcctag ctaatttttt ttttaaaaag 7381 aggtggggtt tcaccatgtt gcccaggcta gtcttgaact cctagactca agtgattcac 7441 ccacctcggc ctcccaaagt gctgggatta taggcatggg ccactgcacc cagccttaca 7501 tgctagtggc ttctttttgc aagcactgtg atgccttgct ccagggcttt gcacatattc 7561 agcctgtata aagggcaggg tggaagtgcc aggatttaat gacattagga gtagctctca 7621 actagtgaca gatggaaatt ggttggataa atacccatta cttcatcatt caggtggcac 7681 aatttagata catgttttct taattttcat agcaatttcc agcagatttg agcccccatt 7741 gctcacaggg ttaattcatt tgcttgttaa atcaccctgt cctggccggg ggcagtggct 7801 caggcctgta atcccagcac tttgggaagt caaggtgggc agatcacctg aggtcaggag 7861 ttcgagacca gcttggccaa catggtgaaa ccctgtctct actaaaaata caaaaattac 7921 tcaggcgtgg tcacatgcac ctgtaatccc agctacttgg gaggctgggg caggagaatt 7981 gcttggacct gggaggcaga ggttgcagtg agccgagatt gtgccactgc acttcagcct 8041 aggcaacaga gtaagactgt ctcaaaaaac acaaccccaa aacaacaaac aacaacaaca 8101 acaacaacaa caaaaaaccc accctgtcct ggcttccttc ccttcctgta tcacgggcgt 8161 acttccctat tagtgcttcc cggatcacct cacaaataaa cttactgtca aacctcaggg 8221 tgcttctggc aaaacccaac ctaagagaca ctagaccttc cagaacataa gctcacactc 8281 aaccatggtt gtgagtactc agtgggacat tttgaggaag cattgcacag ctgctggcac 8341 tttctgattt tagaagtttg aggtcatgta aacacgacac actttggggc cccaggaacc 8401 tggaattgtc cggggttaac cacaatgtga aatgtggatt aaccacattt caacttttaa 8461 catttattat tattattatt tgagatgggg tcttgcttta cagcccaggt tggagtgccg 8521 tggtgggttc tcagctcact gcaacctcca cctcctgggt tcaagcgatt ctcctgcatc 8581 agcctcctga gtagtgggga ctacaggtgt gcgccaccat gcctggctat tttttttttt 8641 tttttttttg agatggattc ttgctctgtc gccaaggctg gagtgcagtg gtgtgatctt 8701 ggctcactgc aacctccacc tcctgggttc aagcaattat cctgcatcag cctcccgagt 8761 agctgggact aaaggtgtgc accaccatgc ctggctaatt tttgttcttt tgtttttttg 8821 gagttggagt ctcgctctgt ttccaaggct ggagtgcagt ggtgtgatct cggctcactg 8881 caatctcagc ctcctgggtt caaatgattc tcctgcctca gcctcctgag tagctgggac 8941 tacaggcgtg tgccaccatg cccggctaat ttttgtattt ttagtagaga cggggtttcg 9001 ccatgttggc caggctgctc ttgaactcct gacctcaagt gatccgcccg cctcggcctc 9061 ccaaagtgct gggattacag gcatgagcca ccacacctgg tctgaacccc cttttttggc 9121 ttcccaaagt actgggatta taggcaagag acagtgtgcc caacccagat atatatattt 9181 ttaaataagt gattctgata ccagaagatg aaggaaaaaa cttttaaaaa cattgaccta 9241 aagcttccgg ccaggctcct agtgactgct atctataaga aggtctccaa cagccataag 9301 gcactgagca tagcaccggg cccacaggct ctaaagaaag gtctccccat ccattcatgc 9361 ccacaccttc tctaggaact cagtaccaat cactagtgct cctcatctaa taagggttac 9421 tagagggcct gagggatccc caggattcca ctcgctgctc tggttgttgg gggcacaaac 9481 cactctcagc acgaaagtcc ccagatccca cccaaacctc aggaggggag gcttttgtca 9541 tgcaggtgag gggctgcagt ctggctgctg ggcagaaact gctgaacgcc accctccgga 9601 tgtcaagtct ggcagatgcc tgtggatttg ttctagaggg gtcctgatga gaagcccctc 9661 tcctgtgagc agcagttgac aatgttcagc gatttgttta ccatcgtcat tattgttgct 9721 gtacagtatc tatgtcctgt actgttactc attgtttgtc tataaacaca ctcatttaaa 9781 atttatacat ttgagtttgt aaaggaaatt gtatttactc ctataaattt tattttcttt 9841 ttaaaatata ttttctgcat ccaatcatcc attatcactc ctataaaata aaagcagcat 9901 caatggctat aaataaaata actacacaaa tacagtacat gcaaagaaac aaaaaataca 9961 catataaggg aatatcaaca caaagaaatg atatatgttt gaggtgaaga atatgctaat 10021 taccctcatt tgagaattac acattgtatg catgtatcaa aatatcacat tgagctgggc 10081 gcagtggctc atgcctctaa tcccagcact ttgggaggct gaggtgggca gatcacctga 10141 ggtcaggagt ttgagaccag cctggccaac atggtgaaac cctgtctcta ctaaaaatac 10201 aaaaattagc caggcatggt ggtgggcacc tgtaatccta gctacttggg cggctgaggc 10261 aggagaatca cttgaaccca ggaagtggag gctgcactga ggtgagattg caccactgca 10321 ctccagcctg ggaaacactc aaacaaaaac aaaaacaaaa acaaaatatc aactgtattg 10381 cataaataag tatgattatt atatgtcaat caaaaataat actttcttta caggggggag 10441 attaacaagt gttaaagagt tttaaaagat aaacctgcct caaacagaga tttctctttg 10501 gtgatctgaa gaatgcaaaa aagaattgag aagggtacaa tttttttttt ctttttttga 10561 gaccagatct tgccctgtca ccaagcctgg agtgcagtgg tgcaattttg gcacactgca 10621 gcctctactt cctgtgctca agtgatcctc tcacctcagc ctcctgaata gttgggatca 10681 caggtacatg ccactacacc tcgttaattt tgctattttt tgtagagatg gggtcttgct 10741 atattgccta ggctggaatt acaagtgtga gctaccgcac ctggttagga ataacctttt 10801 cactgtgtga ttcaatgtta ttgagctgtg ttgttactac ccaaaatcat cttgtgtatg 10861 tgagtggtgt gtttcccaca ccttgggaac ctctggctgg agcaatctat gaggaaataa 10921 acttgggttg agtcttccct tcctgtccct tcagggagca tctggaggca gccatcagtg 10981 ggctgaacat cgtgatgctc aggttggcca ctggtcaggt tgggtttggc agaggttggc 11041 catcgtagcg gaattcactt atcaggccag gtccagggac aagtaacagg acattatgac 11101 agatgcttcg tggagctgga aatattaaaa gcaacagccc tctgctgaca ttcgtctcca 11161 gaacctggta agacttgtaa gtttggagac agggcccatg aacttgccca cttgctgggc 11221 taagatggca gagtccccaa gggagaaact ggaggtggct ctagtcctgg tgcttgcgat 11281 cctcatttga gagggagggt gcttcccagc caggagtggg aagggcactg gagtggtccc 11341 agcttgttcc actgtggctg ctgggaaagg ccttggctat cctggaattt tctagaataa 11401 ctcatctatg tatctcaaag aggagacggg agttccagaa gaagcacctc tccttgccaa 11461 ggagacaaaa cttgagaggg tatgtaggga gccctcggaa gagaagcaag tacaatcatg 11521 catcacttag cgactgggaa atgaaatgcg ttgttggtga tttcgtcatt gtatgaacat 11581 cataaagcgt acttacacaa acctagatgg tgtagcctac tacacgccta ggcgatacgg 11641 tgtagctgat tgctctcagg cgaggagaaa cctgcacggc atgttactgc attaaatact 11701 gtaggcaact gcaacataat ggtaagtatt tatgtatcta aagatagcta aacataggaa 11761 tgttacagta aaactgtggt attacagtcc tgtgggacca tcgtcatata tgtggtccat 11821 cattgaccaa aacgtcatga cgcagcgcat ggctgtagaa ggacgtaggt agtgatgttg 11881 ggtagacctg gttatgctgc cattgcccct tgctaactga taaggaaaat agttaggagt 11941 tctgaccttg tcctcagccc tgcctaagca acaccaataa aactcttttt ttgtgagtga 12001 tggagtctca ctctgttgcc caggctggag tgcagtggcg ggatctcagc tcactgaaac 12061 ctctgcttcc cgggtttaag tgattctcct gcctcagcca accgagtagc tgggattaca 12121 ggtgcccact atgacgccgg gctaattttt gtatttttag tagagatggg gttttgccat 12181 gttggctagg ctggtcttga actcctgacc tcaggtgatc cacctgcctt ggccttctaa 12241 agtgctggga ttacaggcgt gagccaccac gcctggctcc aataaaactc ttaatgttta 12301 agaggcaaca tacagacttg attcctggta gagccctacc ctgagaatca gtgtggttta 12361 gtaaaaccac actggcgata cagtatagct gagtcttgtt aggcctgggt ttgaatctgg 12421 acccttataa tgagctgccg tacaaccttg ggtaagtgat tttaaccctt ggggctttag 12481 cttctgtcat ctgtgacaga gagagatact agcagctgcc ctccagagct gcatcttatc 12541 ctttctctct ccccctctcc ctcaaaccta cttattttag ctgaggccct tggcctcatt 12601 tgtaagcctg gactctggcc tgcaccagct gggattcttt aattgttaac tggctaataa 12661 aataatagaa atttattggt tcatataact gaaaaaaaag tccaaaggta caccaggcaa 12721 aggtgagtcc caaaggctca atgatattac cagaaactag tctttcattt cttaactctg 12781 ctgtcctctg tgttggtttc attcattcat tcattctttt ttttgagaca ggtctcattc 12841 tgttgcccag gctggagggc agtggtgtga tgttggctca ctgcaacctc tgcctcccgg 12901 gttcaaggga acctcctgcc tcagtctccc aagtagctgg gattacaggc atgcgccacc 12961 acactgggct aatttttgta tttttagtag ggacagggtt tcaccatctt ggccaggctg 13021 atcttcaact cctgacctca agtgatccac ccacctcagc ctcctaaagt tctgggatta 13081 ccagcgtgag ccactgagcc cagccaatgt gttggtttca ttcttaagca gtaaaataaa 13141 ataccaataa agagagcctc tctctctttt ttaaaaactt ctattttaag ttcaggggta 13201 caagtgcagg tttgttaaac aggtaaactt gtataatggg aggtggctac acagattatt 13261 tagtcaccca ggtattgagc ctagtaccca ttagttattt ttcctgatcc tccttctcct 13321 tccacccacc accctccaaa agaccccagt atgtgttgtt cccctctggg tgtccatgtg 13381 ttctcatcat ttagctccca cttataagtg aaaacatgtg gtatttggtt ttctgtttct 13441 gtgttagttt gctaaggata atgagtctct ctttcccaac agttctgtca aaagtcccag 13501 aacaaaacct cattggtcca tccattgtag ccaggtaaat gcaacactct ggttagccag 13561 acctgctcac atgcttgccc cagtactatg gactgaagtc aatcccactt gaaccaaatg 13621 caatgtgtat gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgattcccta aaggaatatc 13681 agaatgttgt tatgggaaga aggaggctgc atgttggaca ggcaaaatcc acagataact 13741 atatactagt tagtttgctt ttggctacaa ataacagaat tgccagctaa cagtggctta 13801 aataattaaa acaattatct cacattataa gatgtctctg gtggtaggca gtcttagagt 13861 tattgaatga ctcatcaata taattgagac tccaagatgt tttatccttc tattcctcag 13921 tatgttggct tttaatgctc aggtgtgttg actcatggtt acagaatggc tgccatagct 13981 ctagccatca catcctcaca ggaacagtgt ctagaaaagg cttctggaga gagtgtccta 14041 agaaacagga agtgtaagac cccagtgtct taaagcctga gaaagtggca caatgtcact 14101 tctgttataa tgtattgaga caaaacaggt ggctcatgcc tgtaatccca gcactttggg 14161 aggctgaggt caggagttcg agaccagcct ggccaacata gggaaaccct gtctctacta 14221 aaaatacaaa aattagccgg gtatggtggc aggtgcctgt aatcccagct acacgggagg 14281 ccgaggcagg agaatcactt gaacgtggga ggtggaggtt gcagtgagct gagactgggc 14341 cactgcactg cagcctgggt gacaaagtga gactttgtct tgaaagaaaa aaaaatgaat 14401 tgagacaaaa cagtcctaga gcacacagaa gaggaaatat agactccatc tctggatggc 14461 aggagtactg atgaatttat ggtcatcttt aatctgacat tgtggttaca ggtggggatg 14521 cataattttc ttgcagggaa ggaaagtaga cattaaaaaa aatgaataat aaaatgggcc 14581 ttatcttacc ggttaacatc tatcatttat cacgaaacta aagtaaataa aatggttggt 14641 ttgggaagcc tgtaaagttt aggcaactct atgtaaactg ctttgtgaat ggatttttct 14701 catgcttatt aatgcaaagc actaaatagg tagatgtaaa agagctttgt gtatataaat 14761 aatgtaaagt atctagcaaa ctattacagt attcaaaaaa atataataaa atacactttg 14821 ggaggtcgag atgggaggat cgcttgaggc taggcatttg agaccagctg ggcaacatag 14881 tgagatcatg tctctaaaaa gtaaaacatt agctggcagt gcacacctgt agtcctagct 14941 acttaggaga ctgaggcagg aggatcactt gagcccaagg aggtccaggc tgctatgagc 15001 tgtgatcatt ccttgcattc cagcctgggc aacagaataa gaatccatct ctaggctggg 15061 cgtggtggct cacacctgta atcccagcac tttgggaggc cgaggtgggt ggatcacgag 15121 gtcaggagat cgagaccatc ctggctaaca cagtgaaacc ccatctctac tacaaataca 15181 aaaaattagc cgggggtggt gatgggcgcc tgtggtccca gctaatcggg aggctgaggc 15241 aggagaatgg tgtgaaactg ggaggcggag cttgcagtga gccgagatag cgccactgca 15301 ctccagcctg ggtgacagag caagactcca tctcaaaaaa aaaaaaaaaa agaaaagaaa 15361 aggagtataa tagcacttta ggaggctgag tgggaggatc acatgagctc aggagttcaa 15421 gactagactg ggcaacataa tgagaccctg tctctacaaa aaatacaaaa attagccgag 15481 tgtggtggtg tacacctata gtcccagata cttaggaggc tgaggcagaa gaattgcttg 15541 agcccagggg gttgaggctg cagtggagtg agctgtaatc acatcactgc actccagcct 15601 gggtgacaga gacagatctt gtctcttaaa aaaaaaaaga aagaaaaaaa gtgtgattat 15661 taatgtatta atataatcct gagatgcgga agctttccta agaagaatca taaagtaaaa 15721 tattaatagt gttgactaca taaaaattta aaaaattaaa ggtaaaagac aaactggaga 15781 attatttgca tcatgcatat cagacaacag gttgcaatct tttacatata agtagctctt 15841 tcaattggaa gaatatgacc atcctatatc caacaatatg aacaagaagg tttttttctt 15901 tttttttttt tagaaatata aatatctagg ctgggtgtgg tggctcaagc ctgtaatccc 15961 agcactttgg gagaccaagg caggtggatc acctgaggtc aggagttcaa gaccagcctg 16021 gccaacatgg aaaaaccctg tctctactga aaaatacaaa aattagctgg gcatggtgat 16081 gggcacctgt aatcccagct acttgggaag ctgaggcagg agaattgctt gaacccagga 16141 gacagaggtt gcagtgagcc gagatcgtgc cactgcactc cagcctggac aacagagcga 16201 gactctatgt caaaaaaaaa aaaaaagaaa gaaagaaata taaatgtcta aatgcatatg 16261 aaaacattca acctcaatgt tgattaaaat ttgattaaaa tcacaactct tctgaggccc 16321 agtctagttt tatatatcaa aagccctaaa aatggggaga ttatttgtcc cagatcctca 16381 tttctaggca tgtatcctaa gcaaataatt aaggatgtgc aaaaagatat gacctaccag 16441 attgttcact taggtcttgt ttataacaaa caaagtggat catcagagtc atccaactgt 16501 gcgccaatat gggtggggca gctctagggt gtgctgttct ggtctccctt caagaaggaa 16561 cttgcccttc ggctgtgagg aatgctggga gctgacagat tccagctatt tttatttttc 16621 tatttttgag gcatgggcct tgctctgtca cccaggctca atgcagcttt gaccttctgg 16681 gctcaagtga tcctcccacc tcagcctcct gagtagctgg gactacaggt gtgcaccact 16741 acactcagat aatttacttt tttattttta gtagagatga cgtctcacta tgttgccctg 16801 gctggtgttg aactcctggg ctcaagtgac cttcccacct cggactccca aagtgctggg 16861 attgcaggtg tgagccactg aacctggcct ctgcctcacc tgttaagctg aggttgtgct 16921 gttctcaggc agccccagcg aaggactgag caaggtactg ggactagtga ttatagctca 16981 ctgtgggatt cctttggtgg gaaatctttg ctccagaact ccccactggg ttggatgaac 17041 attttccaga tctgcatctc gatctgatat tctccctggc caatcctcct tcttcaccat 17101 tttatctttc atggatgtta ctcccaaata aacctgatac taccgcagag aacccaacta 17161 acacattgca ttagcattca attggagttt gatcagagga gcagaaccac catggctgac 17221 atgcaatgag agacttatta catgtgggga ttagtcctca tgcaactgtg ggggcaagtt 17281 atcagtcaat gtgggcggtg gctcttctga atctgatgtt gcatctgaag tcatcagggc 17341 cagcagctgg acaggaaaaa atgtacatga agtaggggat gcaaagacca acaggaacct 17401 acaaggacaa cgtggaacct gagtctgttg ccacctccaa tcttgatgat gtgagtgact 17461 ggcaggagga gccagtgccc atgacaggca cctggtgtag ggtttggaga agctgaagga 17521 gacctggagg agctggaggt accacaggcc tggctgatgc ctcacaccca caggtgaagc 17581 atcaggctag ctgcaaattg tgtaagtggc aagaatgcca tgcacctgtg ctcagagcat 17641 agcaattgct gcaccttcag tcctgtctgc caaatctttt ttttttttta agcgacagaa 17701 tctcactctg tcactcaggc tggagttaag tggtgtaatc atagctcaca gcagcctcaa 17761 acctctagac tcaggttatc ctcctgcctc agcctcccga gtaacaggga ccacaggccc 17821 atgtcaccat gcctggctta tttatttatt tatatttatt ttcttatttt ttttgtagag 17881 atgggggatc tcgttatgtt gcccaggctg gtctcgaact cctgggctca agcgattctc 17941 cagcctcagc cttccgaagt gttgggatta caggcgtgag ccactgcatc cggccagtct 18001 gtcaaatctt gcatatgttt ttctcatggc caacaataac atagaccata gagaagggat 18061 ttcttggaaa tatagttcca gttttgctaa gttaaataaa attaaataaa cctaccacaa 18121 taggattggt taaaataaaa tgatatattc ataaaatgaa atggtattca gccatttaaa 18181 acgatgtcat agatgaatat tatcattaag aaaacaatat aaatagcata aactaatttg 18241 caaacatgca tataaacata taaagaaaag tattttgagg ggttatgttc taagatgtta 18301 catttttaag tcaaaacaag tcagtctggg tcacttctct tctttaaact ctttttttta 18361 gagacagggt cttactttgc tgccaggctg gagtgcagtg gcacaataca gctcactgca 18421 gcctcaacct cctgggctca aataatcttc ccacctcagc ctcctgagta tctgggacta 18481 caggtgtgtg tcatcatact ggggttaagt tttctttttt tttttttttt tagagatagg 18541 gtcttgttgt gttgctcggg ttggtctcga ctccctggtc tcaagtaatc ctcctgcctc 18601 aaccttccta atctccttta aattcttaaa tagtactttg ggaggccaag gtgtgtggat 18661 cacctgaggt caggaatttg agaccagcct gaccaacata gtgaaactcc gtctctacta 18721 aaaatacaaa aattatctgg gcgtggcggc gggtgcctgt aatcctattt actcgggagg 18781 ccgaggcagg agaatcgctt gaacccggga ggtggaggtt gcagtgagct gagatcacac 18841 cattgtactc catctcaaaa aaaattctta aacagttttg cacaattgat caaatctaag 18901 tatttttata gcttccaggt cctctgcaat gcagatgacg tatgcggtct ttccagtttc 18961 atctctgact cccacgtgcg tgtcatttca tgttctagga atgccagctg tttgaggttc 19021 ttcctttcta tgctctgctg tttctcactg actttggctt ttctctttgg ttggactatc 19081 tttcacccca acctttgcct gattagttcc tgcttattat tcaataataa ttttctggta 19141 gtgctttatt gagccatttg cagctcaata gttccccaaa gtaattattt tgaacatcaa 19201 aataaattac ctgggaaact tacaaaaata ctgatacttg ggcgtcacta tagatcttct 19261 acattataat ttccagagac aggacctgca agtctgcata tgattttgaa tatagacttg 19321 tctgcatgtg atagagatcc aatggttgtg tggtgaattt ccatagtttt atgttgcctt 19381 ggtacccatt ttgaatataa gttagacttt ctcatatcag acgcagggcc cagtcaacct 19441 tgacacagtt cttcacctct tcccagttcc tcaatgtggc tgatccagac acctgcctta 19501 tatctcctcc tggtgtagat acaacctact tgtctcaccc cacctaattg actcacccct 19561 ctggccccta caccttgcat aggttgcaca gatatgctac agtgaccacc tcttagtcac 19621 agtgtggctt tgcagagctc ctgcctgctt agtctaaacc caccaattag aacttctctt 19681 gggaaatctg ctggggtaat acccaagacg tatttttttt ttttgagata gagtttcact 19741 cttttgccca ggctggagtg caatggcatg atctaagctc accacaacct ctgccttctg 19801 gttttaagca attctcctgc ctcagcctcc cgagtagctg ggattacagg cacccgccac 19861 catgcccagc taatttttgt atttttagca gagacggggt ttccccatgt tggccaggct 19921 ggtctcaaac tcctgacctc gtgatctgcc tgcctcagcc tcccaaagtg ctgggattac 19981 aggcatgagc caccgtgcct ggctgtaccc aggaccttaa taaaggcttt gagccacagg 20041 tctctctctt tctctcttgc tttccactta caggttcaac acgctgctgt ctccagactt 20101 cccatcagtc cttgcaggca cccttggcct gatgaccaaa gttttaaaat agcttcattg 20161 ttttaaatgg tgccatttta aatatgaaat atatatatat gtgtatatat atatacacat 20221 acacatatac atatatatgt atatacaata tatatgtatt ttaaatggta ccattttaaa 20281 tgtgaaatat atatgtatat atatttgtat atatgtaata tatatttcac atttaaaatg 20341 gtaccattta aaatacatat atatttattt ttaatacata tgtattattt aaatatatgt 20401 atatatgtat aaatatacat atatatttca catttaaaat ggtaccatat gtatatatgt 20461 gtgtgtatat atattttttt taccaaattt taatgataaa ggcataatgg taatggctac 20521 ttcctattga gctaagccct acacaaacat aacctccttt aattctaaca ataaacccat 20581 gagttaggta tttttggtaa ccccacatta tgaatatgaa gtaatattct caaggtcatg 20641 cagtcagtaa gtggtgaagt gcttgctttt agtcattaag cttttcagtc tgttgagaaa 20701 ttgctggttt ccaatcacat acatttttct tttaagtaaa cattccctct gatttttcca 20761 caattttgtg tatgaatttt cataattaga cttcagtgta tactaatttg atatttaaga 20821 ggtctgtttg tcgcttccta gcatgaaaat aattcacaaa agatatttaa tttaatactc 20881 ggatgatcaa cttctcattt taagccatac caaactcttt cccaactctg agcaatattt 20941 tcagattctg ttttaagcaa atgattaatt aataaataat taatattcat gcacactttg 21001 gcatgccaga cacggagcta aataaaagca ggtgttacag ggataagcaa gacagacaac 21061 cccactgcca tggagtttat attctagaga ggtaggcaga ccacacaaca aataaatgta 21121 tgacaaagta agttcaggct ggacagagtg gctcatgtaa tcccagtgct ttgggaggct 21181 gagatgggag gaccgggctc acaggaattc aagacaagcc tgggcaacat agtgagactc 21241 tgtctctaca aaaaattaaa aaattagtgg ggtgtggtgg tacacatctg ttgtcctagg 21301 tacttgggag gttgaggtgg aagaatggtt tgaatccagg agtttgaggc tgcagtgagt 21361 atgattgcac cactgtactc cagcctgggc aatagagcaa gactctgtct atgaaatgaa 21421 aataacattt atttttttaa aaaagcagga tgagggagac tcggtggagg ctatggcaat 21481 agtttaggta gggaggaggt ggtgacagta gaggttagat ttggaaaggc atgggacata 21541 ttttggaggt agagtcaaga aaacatgctc atggtttgca tgtggggcta aagtaaaaga 21601 agaatcagag atggtttgta gattcttggt ttgagcatgt ggttggagaa tggtagcatt 21661 tctagagcta gggatttcta gggatcaata tgcttggaaa aatgaagagt tttgtgcaaa 21721 gtaatttttt ttttgaatga gtgcccattg cagagactgc taggtgtcta ctatagcctt 21781 ttcattcccc tcttttttta ataaaggaaa gccaatttta ccgggggtgg caaagtgtct 21841 ggagaaaaca taacatttct tagtttcctt tgtagctaac ctcttttgct tgtctattac 21901 gtgagctcca tatacataat gtgaatgtga ataaccttga gttttgaaat tacatgctta 21961 gactaagaga aggagcctct atgccaaata actttgcaga gttgccatat cagttctagc 22021 cttttttttt tttttctcac cctgtcaccc aggctggagt gcagtagcgt gatctcagct 22081 cacggcaacc ttagcctccc agcgtcaagc aattctccca cctcagcttc ctgtatagct 22141 ggggctacag gtacacacca ccatacctgg ctaatttttg tagtttttag tagagacacg 22201 atttcaccat gttagccagg ccagtctcaa tctcctgacc tcaagtgatc cgcccacctc 22261 agcctctcaa agtgctggga ttacatacgt gatccaactg tgcccagcca attctagcct 22321 tttatatcag agaaaaaact ctaaattgct taatctacaa taattggaac cctgttacta 22381 gtagttgaat acaattccta tctgatacca tactgattct agaaaaaaat tattaagcta 22441 taatttccag agtttgtccc tgaattccca ttctcctcca agcagaacat gaagttgccc 22501 agaaatagtt tatctgtggt atgcaaatcc agtgtattag aaaatatttt ctgaatctac 22561 ttgcggttcc attatcctca accaacatat gtcagcaaaa taaaacaagc atttgtattt 22621 ttagtatgtc actgttgtgc agtctcctaa attttgatgt tttaagcact atgcctttaa 22681 cattcactaa agctgataac atggaagagg ttgattatgg ttccccgggt gccaagaatt 22741 atacgggttt ctgttccaat gttgaggaca gaaatccttc agcgcagatg gaaacatggt 22801 tggattctgt agatacattt cttattcaca caacagagag tctaatagag acgggacatt 22861 attatttttt tgaagcctct acaaatgccc acttaaaaag ctacaagcat tatatacaga 22921 tacttttttg ttttttttga gacaacgtct cattctgtca cctaggctgg agtgcagtgg 22981 cttgatctct gctcactgca acctctacct cctgggttca agcgattctc gtgcctcagc 23041 ctcccgagta gctgggactg taggcgcctg ccatcaagcc ggctaatttt tgtatttttg 23101 gtagagatgg ggtttcacca cattggccag gctggtctcg aactcctgac ctcggcctcc 23161 caaagtcctg ggactacagg cctgaccatt gtgcctggct cattattact tttatttttt 23221 aaaaaataga tgtgggttct gtcatttagg gtggaggctt gggtttgtgc gtggagtgca 23281 tagttccttt cttttgggaa gggccatctg ggtgagctct tttgctttgg gcagggacaa 23341 tgcaaggaca aataccattt tcatagggga cactcttgct ctctttccaa ggttggggct 23401 ggaaggaatt tctgggccta gtgggagggg gaggccagct cccagcatgt ggagtctgag 23461 gctggggcct gtgccccttg gagcagccca gctggacagt gcatctcacg acctgagggt 23521 gtctaattgg gtctaatcag aggcattaga gagtgacaag tctcctggag gccacagatt 23581 gcagaggcct aagtgtggta acaaagacag agactgaatt caaaagccca gttatgcttc 23641 aaactcaacc cagaaaccag cagtccctgc cgcactctct tcctccccag ttccagtcta 23701 gagaggcaac ctctataatt tttttgtttt tttgagacgg agtcttgctc tgttgtccag 23761 gctggagtgc agtgttgtgt tcatggctca ctgcagcctc aagctcctag gctcaagcaa 23821 tcctcttgcc tcagcctcct gagtagctgg gaatacaggc acatgccacc acacctggtt 23881 aatttttaat ttttgtgtag agatgggggt ctcactatgt tgtccaggct ggtcttgaac 23941 tcctggcctc aagtgatgtt cctgccttgg cttcccaaaa tgttggaatt acaggcataa 24001 gccattgcaa cccagccaat ttcaccacta ggttttgcta cactgctgta ataattggaa 24061 cacctttgtg gtaggcggcc tcaaagatga tccccccagt gatccctgtt tcctgatatt 24121 cacaccctgt gtagtcctct ccaaccttgt accagagttc atctctgttg accagcagaa 24181 tagatcagaa gtgagggcat attagttcta agattagtat ataagagaca ttgtaacttc 24241 tgtcttggtc tttgcttcat ccccatctct atttgattat tcactctaga gggagccaac 24301 tgccatatgg agcagcccta tggagaggcc caggtgataa gtaattgagg gatcttgcca 24361 gcaattttgg gaatgaatag atctttcagc ccctgtcaag ccttttgatg acttcagcct 24421 ccgcttaact gcaacttcat aaaagactga accagaggct gctccctgat ttttgaccct 24481 tagagagtgt gtgagatgat atatatttgt tgttttaagc tactagcttt tggggtgtca 24541 tttgctacac agcaatagct aactgataca acctcctatt tggcttccct ttttaaaata 24601 tatatatata tattttttat tatactttaa gttcaagggt acttgtgcac aatgtgcagg 24661 tttgttacat atgtatactt gtgccatgtt ggtgtgctgc acccattaac tcatcatata 24721 cattaggtat atctcctaat gttatccctc cccccttcct ccaccccaca acatgttcca 24781 gtgtgtgatg ttccccttcc tgtgtccatg tgttctcatt gtttaattcc cacctatgag 24841 tgagaacatg tggtgtttgg ttttttgtcc ttgcgatagt ttgctgagaa tgatggtttc 24901 cagcttcatc catgtcccta caaaggacat gaactcatca ttttttatgt ctgcatagta 24961 ttccatggtg tatatgtgcc acattttctt aatccagtat atcattgttg gacatttggg 25021 ttggttccaa gtctttgcta ttgtgagtag tgctgcaata aacatacatg tgcatgtgtc 25081 tttatagcag catgatttat attcctttgg gtatataccc agtaatggga tggctgggtc 25141 aaatggtatt tctagttcta gatccctgag gaatcgccac actgacttcc acaatgtttg 25201 aactagttta cagtcccacc aacagtgtaa aagtgttcct atttctccac atcctctcca 25261 gcacctgttg tttcctgact ttttaatgat caccattcta actggtgtga gatgatatct 25321 cattgtggtt ttgatttgca tttctctgat ggccagtcct gatgagcatt tcttcatgtg 25381 tctgttggct gcataaatgt cttcttttga gaagtgtctg ttcatccttc gcccactttt 25441 tgatggggtt gtttgttttt tttcttgtaa atttgtttga gttctttgta ggttctggat 25501 attagctctt tgtcagatga gtagattgaa aaaattttct cccattctgt aggttgcctg 25561 ttcactctga tggtagtttc ttttgccgtg cagaagctct ttagtttaat tagatcccat 25621 ttgtcaattt tggcttttgt tgccattgct tttggtgttt tagacatgaa gtccttgccc 25681 atgcctatgt cctgaatgat attgcctagg ttttcttcta gggtttttat ggttttaggt 25741 ctaacatgtc agtctttaat ccatcttgaa ttaatttttg tataaggtgt aaggaaggga 25801 tccagtttca gctttctaca tatggctagc cagttttccc agcaccattt gttaaatagg 25861 gaatcctttc cccattgctt gtttttgtca ggtttgtcaa agatcagata gttgtagatg 25921 tgtggtatta tttctgaggg ctctgttctg ttccattggt ctatatctct gttttggtac 25981 cagtaccatg ctgttttggt tactgtagcc ttgtagtata gtttgaagtc aggtagcatg 26041 atgcctccag ctttgttctt ttggcttagg attgacttgg caatgcaggc tcttttttgg 26101 ttccatatga actttaaagt agttttttcc aattctgtga agaaagtcat tggtagcttg 26161 atggggatgg cgttgaatct ataaattacc ttgggcagta tggccatttt cacaatattg 26221 attcttcata tccatgagca tggaatgttc ttccattttt ttgtgtcctc ttttatttca 26281 ttgagcaatg gtttgtagtt ctccttgaag aggtccttca catccctttt aagttggatt 26341 cctaggtatt ttattctctt tgaagcaatt gtgaatggga gttcactcat gatttggctc 26401 tctgtttgtc tgttattggt gtataagaat gcttgtgatt tttgcacatt gattttatat 26461 cctgagactt tgctgaattt gcttatcagc ttaaggagat ttcgggctga gacgatgggg 26521 ttttctagat atacaatcat gtcatttgca aacagggaca atttgacttc ctcttttcct 26581 aattgaatgc cctttatttc tttctcctgc ccgattgccc tggccagaac ttccaagact 26641 atgtggaata ggagtggtga gagagggcat ccctgtcttg tgccagtttt caaagggaat 26701 gcttccagtt tttgcccatt cagtatgata ttggctgtgg gtttgtcatt ggcttcccaa 26761 cttattgctt cagtgtgaac agacctgctg tacattctgg gccacttttt actctcatcc 26821 tagggattcc ctttcctttt tcctgcttcc actgcttcct ggaccccctg ttttcctctt 26881 tatagcttta atttctcatt ttgatagagc agactaccta gtaacctcct aagaaagagt 26941 gcattggtgg tcaggtgtgg tggctcgcat ttgtaattcc agcactttgg gaggctgagg 27001 tgggtggatc atttgagatc aggagttcaa gaccaacctg gccaacatgg tgaaaccccg 27061 tctctattaa aaatacaaaa attagctggg catgttggca ggtgcctgta atcccagcta 27121 cacaggaggc tgaggcagga gaattgcttg aacctgggag gtagaggttg cagtgagctg 27181 agatgattgc tccactgcac tccagcctgg gtgacagagc aagactccat gaaaaaaaaa 27241 aaaaaaaaaa aaaaaagagt gcattggtga aaaatatttg agagcttgca tgatttataa 27301 tgtctttatt ctgccattac gcttgatggt aaggtgtcag ggtctagaat tttaggttga 27361 aaatagcctt tctctcaaaa aaaaaaaagg aaaagtattg cttcattttt cttctagctt 27421 cttatgatgc tcttgaagag tattagtttt tgtatgtgcc tattatgttt ctttccccca 27481 ctcctgaaaa cttctaggtg ccattattgg tggcagttaa gtgtgcaaac tctagaagca 27541 gactacctgg gttcaaatct cagttccaac aatcattggt tatgtgactc tgggaaattt 27601 gtttaactgc tctgtgcctc agtttcccca tctgtaaaga taacagtacc tacctcacag 27661 agttgctgtg aggatcaaat gaccccctac ctttctgtgc tcctctgtat ccgttgatgt 27721 aaaccttcct caagttgggt tggttccaca caccatttct ggagaggggt agggtctaga 27781 acaaggcggg caagctttac ctgtgaaaga gcagatagta ggctggctgt ggaggctcat 27841 atttgtaatc ccagcatgtt gggaggctgg ggtgggtgga tcacctcagg tcaggagttc 27901 gagaccagcc tgaccaacat ggtgaagccc tgtctctaaa aaaaaaaaaa aaaaaaaaaa 27961 aaaaaaacaa aattagccag gcatggtagc acatgcctgt aatcccagct acttgaggag 28021 gctgaggcag gagaatcgct tgaacccggg aggcagaggt tgcagtgagc cgagatcacg 28081 ccattgcact ccagcctggg caacaagaac aaaactccat ctcaaaaaat aaaaaaaaaa 28141 gcagatagta aatagttttg actttgagag ctataaagct agtgtttcag ctatcgaact 28201 ttaccattgt agggcaaaag ctgccataga caaaataaaa gtgaatgaac ctggctgtgt 28261 tccaataaaa cttttgttta tacagctgca ggcagatttg gctcgctgac cgtaggcatc 28321 tgactgtgaa acactatcct ctcctttttg tacttatttt atccgaccct tcttagaggt 28381 atagagatga ctcctagttg gagacatctc agactaaaaa ctgactacct tatattataa 28441 cttctacaca gtgctctaag tggtccgatg attgagttat atcaaatgca gggtttcagt 28501 tgcaacatta atgaaatagc ctgagtaaac ctttcgtgat gacaagagcc agccaccaat 28561 tatctcatcc ctggattgac attattgtaa gccacaaagt accaagcttt ccttgtgctt 28621 aaactgttga ggaaaatgct tgactctttt tttttttttt ttttgagaca gagtctccct 28681 ctgtcaccta ggctggagtg cagtggcacg aacatggctc actgaatcct tgacctcctg 28741 ggttcaggtg agcctttcac ctcagcctcc caagtaggtg gaactacagg tgcaagccac 28801 catacaggct aatttaaaaa attatttcta gatacgaggt cttactatgt tgcccaggct 28861 ggttttgaac tcctgggctc aagtgatcct cctgcctcag cctcccaaag ggtgggtttg 28921 caggcatgaa ccaccatgcc tgcccagcca actctttctt tgtaacattt ttatacaaat 28981 aaaaacagcc atgctatggc acttttatcc attaattcta ggtcaagtga catgctggga 29041 ctggaaagat gaaaatcact gtgtcaccct caaaggtttg agtaagcaag caatttagat 29101 agaacatccc ctgtgtcaca ataaagccct tttcgccaat cagattagca aagactaaaa 29161 aacaaacaaa agacacaatc aatgcttgtt aggacatgat gaaaagggca ctttgtagat 29221 ttctgatgtt aaataaattg caacccagtc tgactacatt aactctgtgc tgttatgttc 29281 atcccaactc tgttttcttt gtcttttctt tttttctttt ttttgagaca gggtcttcct 29341 ctgccatcca ggctggagtg cagtggcata atcttggctc actgcaatct ctgcctcctg 29401 ggctcaagca attctcctgc ctcaccctcc taaatagctg ggactacagg tgcacaccac 29461 catgcctgac taatttttta gtagagatgg ggttttgcca tgttgtccag gctggtcttg 29521 aactcctgag ctcaggtgaa tctgcccgtc ttggccctcc aaagtgcagg gattacaggc 29581 gtgagccacc atgcctggcc ccaactctgt tttctgctct ggccctctgt gtcttgcctt 29641 gggggtatgg gtgggtagat gggggggagt ttcctctttg ttccccactg actctttttt 29701 tttttttttt taaggttatc tatttattta gagatggagg tctcagtatg ttgcccaggc 29761 tggtcttgaa cttccgggct caaacaatcc tcccacctca gcctcctgag tagctaagac 29821 cactggtgac ccaactgatt cttggggaag gcagccccaa gtgagttgac aacaaaagtg 29881 cctccctagc ctggggccag tgagacccca aatatcaccc ccctcccttc gctgggtttt 29941 tcctgcccat ctttacctga ccattggtca ctggaggagc ctcttaaacc ttctaaccgg 30001 gaactgctcc tgctattgta ttttcaccca cctacacaat tactggacca cagcatctcc 30061 taccacgctc tcacttctca cattgtcagt acatccatcc ccagtccttg cctgccaaat 30121 tcccccatct gtcatcaaca cccccccgcc atagattttc tttccccttc catcgacaaa 30181 aaccactttt cccttctccc aatgttatta tgctagtatt ggttaacact gagctatttt 30241 catttatttg cttcatttga aattttcaaa tacattcacg ctccttgtaa aaattaaagt 30301 aattcagaag tacatgaagt aaaaaggtga gagtccctcc ttcgctcaac tcatctcagt 30361 ccccagaaat aaccactgat agtagttttg ctttcctgtt tccagaacta ttttgcaaaa 30421 ttcaatttgg actaatgggg ttttagtgct catatgcata atttatagaa tatggtggat 30481 taagaaaatc acatctctgg gaattgggct gattcaatcg atctgcatct tatatgactt 30541 tgctgcattt tgacattggc caaactagct tttttttttt ttttttttga gatggagttt 30601 cactctgttg cccaggctgg agtgcagtgg tgcaatctca gctcactgaa gcctctgcct 30661 cccaggtaga agcgattctc ctgcctcagc ctcccgagta gctgggacta taggtgtgca 30721 ccatcacatc tggattttat atatatatat atatatatat atatatttgt attttttagt 30781 ggagatgggg tttcaccacg ttggcaaggt tggtcttgaa ctcctgacct caagtgatct 30841 ggctgcctcg gcctcccaaa atgctgggat tataggcgtg agccaccgcc cctgaccaaa 30901 gccaaactgt attttttttt tttttttttt gagacagaat ctcgctctgt cgcctaagct 30961 ggccaaactg tttttgagac ttgactcctt ctaggcttcc tgagcaccat tccctggctg 31021 ctctttctca tttccttgga ggctttctct tatctgtttc tcccctaaat gctggtattc 31081 ctcagggctc tgttctcatc actcttggtg gattaggcac agagcaaatc atcccttctt 31141 tagacttgca aactcacgtc tactagtgaa atatgtgttg agcaggcagt gcatggtggc 31201 tcgcgcctgt aatcccaaca ctttgggagg ctgaggtggg aggattgctt gagtccagga 31261 gtttgagagc agcctgggca acatcatgag actctgtctc tacaaaaaat ttaaaaagtt 31321 agccaggggt ggtagcacac acctgtggtc ccagctactt gggaggctga ggcaggagga 31381 tcacttgagc ataggaggta gagactgcag tgagctatga tcgtgccact atactccagc 31441 ctgggcagca gagcaagact ctgtctcaaa aaaaacaaaa taaaacaaaa caacaacaac 31501 aaaaacaaac ccaaatctcc aaagccaaac aaacaaaaat actttttttt aaaaatttaa 31561 ttttattttt tattatactt taagttttag ggtacatgtg cacattgtgc aggttagtta 31621 catatgtata catgtgccat gctggtgcgc tgcacccact aactcatcat ctagcattag 31681 gtttatctcc caatgctatc cctccccgct ccccctaccc cacaatagtc cccagagtgt 31741 gatgttcccc ttcctgtgtc cctgtgatct cactgttcaa ttcccaccta tgagtgagaa 31801 tatgcggtgt ttggtttttt gttcttgcag ccataaaaaa tgatgagttc atgtcctttg 31861 tagggacatg gatgaaactg gaaatcatca ttctcagtaa actatcgcaa gaacaaaaaa 31921 caaaaatact tttaaatgag tgttttcttt cacttaactt tatatctgtg aagttcatgc 31981 atgtcattgt gtggagtgtt tgtgcacttt cactgttcta tggtattcca ttgtgtgaat 32041 aaatgacaat taaaaaaact cattctcccg ttgaaggata cttgcattgt ttccaatgtt 32101 tggcttttat aaacaatgct gtctgcagaa ctgtttggat aagagatttt gtgattgatg 32161 gggaagcgag cagatttgca gtgatggagc gatggggaaa aggtggtcct gtgcagaatc 32221 cacctcacct ggggcacccc cagaactggt caccttgtgg gcactagggg atggtacctc 32281 aaaaccagca taggcatcct caaaagagac actctcagct gtggctagtt tatgtggctt 32341 cctgaacctc aatctgaagc cagagagaga aagaacttgg caatttttgt gtttttagat 32401 tgcagtggaa aactggtgtg tgagtgcata tgaggttgct ttttgaaagt aggaatgggg 32461 tgcaggtggg aggctcagag gcattgtaat tcaggaatac gtgtaaatgt tccatatatg 32521 aatccactgg aaacaataat ccacaggcct gctgactagt ggggaagggt ctgggccacc 32581 tggggtgggt gggaatgttt ctcactgggt ctggcctgag cagtgacagg tttttgttgg 32641 tgttgctgca gtagaagggc ttgcttcttt cccattggct aactctgaag tagctgttac 32701 tcaccaaaat ggtgttggaa tgagatactc atgatctgtg cagataagca atgactaatg 32761 gtgcttttgc aagttctgga gtccctccct tggggaagga actagacttg gaaccagact 32821 acgggttggc aaaattaagc ataatttctt tctgtacgtg ctccacttca gcagagatca 32881 tcccggaagt ttcctagtat cctgacttca cagatcacat gatgtgatca acattgattg 32941 aatacttatt atgtatcagg cactggttga agcaccttaa tatacattta aaactcaaac 33001 atcactatga gtggggcaat gtgtatcagt tagcttttgc tgtataacaa accttcccaa 33061 gttttagtgt cttaaaacag cagacatcat tttattttta ttgagataga gtcttgctct 33121 gttgcccagg ctggagtgca gtggtgtgaa catggctcac cgcagcctca acctcctgca 33181 ttcaagtgat cctcctgctt cagcctcctg agtagctggg accacaaacc cgtgccacca 33241 tggccagata attgttaaat ttttgtagat acagcgtccc actatgtttc ccaggctgtc 33301 ttgaactcct gggctcaagt gatctcctgc ctcagcctcc caaagtgctg ggctttcagg 33361 catgagcctc tgtgcctggc attgttttat cttttattga attttttttt ttaagagacg 33421 gggtcttgct atgttgccct ggctggtctt aaattcctgg gcccaagcga tcctccctcc 33481 ccagcttcct gagtagttga gactataagc atagccactg caactgacta agcaatagac 33541 attatttctc acagttctgt gaattttctg ggagtttatt ctgatttggg ctagcttggc 33601 tgcaactgaa tggcctagga tagtctcgct cacgtatctg gcaattgcct caatgtcagc 33661 tggggcgaca gccatgtgtc tgtcagtatc agcccacact ggttcacatg atggtaggaa 33721 agttctcagc agcaagagag taaatcctaa tgtgcaagaa ggcttccagc ctctgcttgc 33781 atcacattgg ttaatgtccc attggtcaaa gcaagtcaca tgactaagac cagattatac 33841 atagaggtga ccttctcaag atatggatgt ggggaggggg tggattatgg tagtcatctt 33901 ataaacaacc agctgcgtac tcttattatc ctcattttac tgatgatggt tagataagtt 33961 aagaatcttg ttacaggtca cacagctgct ataaggtaga gctcagcctc aactttaggt 34021 ctatatggtg atgaagcctg tgccttaagt cctgtgctat ttgtattgct tccttcctca 34081 gacctggact gttgtgtctc agcccaggga agaggagaga tgagacactt aactagcctc 34141 ctccagggac cactgatcat agagcctagc cattgcttgt gcagccagct ggacttagca 34201 acccctgcaa tgctgaagag agtaattctc atgaatctaa caattccgcc attgcactgg 34261 attctgtacc ttcagattag tattggagat ctattacata gaccgtagga aatagatgga 34321 aaggataaag gatggcagca gaaatagggc cccacatttg cttcatgggc aatcatcaag 34381 catctactgt gtctgggcta gattgtggtt attcaggcaa agtgcaggaa ggggaacagt 34441 tgctctctaa agtacatttt caacaagaga acatgttgac tgagggacat taaagactga 34501 gatgaaaggc acttagatgc agaagctgca agacagtaag accttgagat ttgtagggaa 34561 gagcagagga gagaagacat ttcttaataa taaccagctg ctttaccagt gctggtgcag 34621 acactacccc ctctcaccag gtggttatag aaactgcacc ttcttctatc caatttcaag 34681 cccaccatct cttaaattta aggtctccaa aagaacagtt tttgatgaca ctgggaagag 34741 ggtgagcagg cagatttggg gtgggaattg aaggaatcaa atcccaaact ccattttaga 34801 tttgcaatca agctgaaact ctcagcaccc ccagagtatg cctttaaaag gaaaaaaaaa 34861 aggctatttg taagaaaaaa agaccaaggt ttgcatctga atggagtgcc aatcagcact 34921 cattagtggg aacacaatgc ggccccatta gcacgaaagg agctttggca attgctcatt 34981 agctttgtag attcaaataa ttattttaag ggatgcttcg gtgtttgatg agatccagca 35041 gtgtcaatga tatgccactt ttctccccaa agaaagggaa tgttcccagg agtaattgta 35101 ctctcttatc cccctccctt ctcactcaca cctcaatgct tctctatata ctcccggggg 35161 tacaggatct caacaaaggg ctgaaggtgg atagaaactt ctccatcaag atattactca 35221 caggaaaact gtggtcccga gagtacagga acacaaaagt agttatctga agagctggga 35281 gaatctgttt attacacgca ctcccttctt ccattgacag gagggaatct cgatgagctg 35341 ggggaagaat agaaacatgc aatagggtct tgagtacata ctcaagagag aggagtgtgg 35401 gcataccaag ggaaggaata ttcatgaaaa ttcctggaaa aagctggaga tttctcagaa 35461 ttatggtgcc accaattttt acatcaatat gggtgctcct ggaactgtcc tggcactggt 35521 gggtttgtga ttaagaatgt taatgagtat ataggctggg tgcagtggct cattcttgta 35581 atcccagcac tttgggaggc cggggcagat cacctgaggt caggagttca agaccagcct 35641 gaccaacatg gtgaaaccct gtctctacta gaaatacaaa aattagccag gtgtgaatgt 35701 gggcacctgt aatcccagct acttgggagg ctgaggcagg agaattgctt gaacccagga 35761 agcagaggct gcagtaagct gagatcacgc cattgcactc cagcctgagt gacaagagtg 35821 aaactccatc tcaaaacaac aacaaaacaa acaaacaaac aaaaacataa ggttctaggt 35881 gaaacctagg tcaaatccag tgccaattaa gttcaattgg tcttagccag catggtccac 35941 accctggttt ttcagggttt tatcagaccc tagcttctgc agctatttca acatctgcct 36001 tttgcctagc catgtgaaag tgctgcctgg aattttctgt tctcctgtga ccaccctgta 36061 ttattcctgt ctcactttga tgaaaatttc aaagatcaga actctgtcca aggggctgga 36121 aaattggggt gggcacaact tttttttttt tttggctgga aggaacattc tggaggataa 36181 ggtgggtcac aggggaccta tgaaggggaa cttcttaggc ctaattctgc tgtaggcatt 36241 aagggagtgc tcctagagca gtgcttgaca cagggtaaga gctcaataag acaatggtta 36301 aaagcccagg tgttaaaata gatggattta ggctcaagtt ccatcattct cagagatctg 36361 ccctttcacc aaacttctag ggtgattctt atgcacatta ttgtttgaga actgctgaga 36421 cagacagtac tatttctgat aattaatgct ggtcctctgg agcttgtatt tggggaggaa 36481 gaaggatgaa tcccactgtg aggtccactt ctgtggttag aatgtggcaa aaacttggtt 36541 tcctcttcct ctttcccaat aaaacctcca tttatttttt gctgtctact gtcctccacg 36601 tggtcatgtt ctttacagga agcggtctct gacctcaaga agttaatagt aattggtcta 36661 aaccaaccat gttttcctat ttcccttgct tagtgatagg ctcagcatgg gcacatgaaa 36721 taattctggt caatgagaca cgagtgggga gtgtattgga gtttctggca aaggtcttcc 36781 ttaatgattg aaaggagatt gattgaaaga gggaagaaac agccccttta tgaaagaggg 36841 aagaaacagc ccctttactg tctctggtta tttgagaaca tgctggctgg agcttgggca 36901 gccatcttgt gcccatgacg ggggctgatt taggtcaagg ataacagtgg aaatccacaa 36961 agaagtgggc ctttactaat gtaattgagc tgtcaaatca atcaaccatg aagcttcccc 37021 attttattat gtgagataat aacatttctt cagtgtgtaa ggtcagatca gttgcaggct 37081 ctgtttcttg ctctgaaagt tttccagaga ggagggatgg agatgtatac tgagaaggcc 37141 tgaagaagga gggactgaat agagggagaa gcctgagaat ggctaggaac tggagtgcat 37201 ctttaaggag aggccctatg agtggcccag actttactgc agcacagtgg ctaaaatcaa 37261 ggactttaga atcagaatgt cttggttcaa attctacctc caccacttac tagttatatc 37321 accttgggta aattacttaa cctcagtttc attgtttgta aaacaggatc atggtggtat 37381 taacctcaca gggtttaaga aggtgagtag ggctgggcac agtggctcac ccctgtaatc 37441 ctagcagttt gggaggccaa ggcaggtgga tcacttgagc tcaggagttt aagaccagcc 37501 tggacaacat ggtgaaaccc catctctgca aaaaaataca aaaaattagc caggcatggt 37561 ggcacatgcc tatagtccca gctacttggg aggctgaggt gggaggatgg cttgagccca 37621 ggaggcggag gttgcaggga acagggatca caccactgca ctccagcctg ggtgacacag 37681 ccagaccctg tctcaagaaa aaaataaaaa ataaaaaaag attagttaat aatgtaaagt 37741 acctagtgta gtgtgatctg agagacctat gtgtaatgtg ctcagagact gaaacagaca 37801 cccctttatc aactaagaca agcccaaggt taaggaaaca aaagttacct acctaaggtc 37861 aagggttcag gtcctggcag gcatggcaaa tttctaaatt cctatagcta taagaaaaac 37921 cagctgggcg ctgtggctca tgcctgtaat cccagcactt tggcatacct aggagggcag 37981 attgcttgag tccaggagtt tgagaccagc ctcggcaaca tggcaaaatc tcatttctac 38041 aaaaaatgca aaaattagtt ggatgtgggc atacctgtat gcccagctac tcaggaggct 38101 gtgagaggat tgcttaagcc tgggaggtgg aggctgcagt gagccatgat tgcaccactg 38161 ctctccagcc tgggtgacag agcgagaccc tgtctcaaaa aagaaaaaga aaaagaaaaa 38221 agtaaaatca cagccttgct aaactcccta accataggag ctatatcagg tgaatttacg 38281 gcccagaccg ctacaactct ggacagggac cagccttata aacattgttt tcttatatgg 38341 aactgcagac cttaagccag tttcagcggc ttatagaggc tgcacacaag ctttctttgt 38401 gtcctgtagg tcaccttttg acgtaacgag tcaaattcca cctcatttta atgctaaaac 38461 ctgccccgaa gtgaacatgg gatgtatgct acgtcccgtg tgtgtgtgat taagaatgtt 38521 aaaaagcata taggctgggt gcagtgcctc atgccactgc agattcccct tcctgcactt 38581 tgcctgaata accagaatct acccgagaca cagtaggtgc ttaatgattg cccatgaagc 38641 aaatgtgggg ccctatttct gctgccaatg tttacccatt gctcatgcat ttggctcctc 38701 tcataaatat gtatagcgtt ctcccaaacc tgctgaatat gcatgactat tgtataataa 38761 gaccttgtgt gacacaaaac ccaacctgcc ctttctctct tcaaagctaa gtgtggtggc 38821 ttatgcctgt aatcccagta ctttgggagg ctgaggtagc cagattgctt gagcccagga 38881 gtttgagacc agcctgggca acatggtgaa accttgtctc tgcaaagaaa atacaaaaat 38941 tagccaggtg tggtgggggc atgcctgtag ctcactgtag cctcgaattc ctggactcaa 39001 gtgatcctcc tgcctcagcc tcctgagtag ctaggactgt gatgtgtgct accatacttg 39061 actatttttt ttttcttttt gtacagatga gggtctttgt gtatttccca ggctggtctt 39121 aaactcctgt cctcaagtga tcctcggcct cccaaagtgc tgggattaca ggtataggcc 39181 agcatgccca gcctaaccca ttttagacaa ccaataagtt ttatttttta tttttaaatt 39241 ttatttttgg agacagagtc ttgctctgtt gcccaggctg gaatgtagtg gcgtgatcat 39301 ggctcactat agactccacc tcttgggctc aagcaatcct cttgcctcag cctgccaagc 39361 agctgggact acaggcacat gccaccacac ctggctaatt ttttttctta tttttataga 39421 gacgaggtct cactatgttt tccaggcttt tatcgaactc ctgggttcaa gtgattctcc 39481 tgccttggcc tcccaaattg ctgggattat aggtgtgagc cgccatgtcc agccctaata 39541 aattttaata gccagctcca cttctctagg agaatgtctc tttgcccatt gctggacatt 39601 tcagcctgta cagtgtgccc cttggtctga agaaatgaca gttagctgcc caaatccatg 39661 tactattttt gatcctgttt tttaatggca tcgtgaatag ttgcatcttc cattgggtaa 39721 gcaaggccca gtgcagagtc agtgtctatt attgtcaaga cccacttgtg gtcccccagg 39781 gctaccagtg tcggtctcac ttgccagcta cgttcagggc cttcccacca aggaatcttt 39841 tccatagccc tcagcagctt ctgtctctct tgctgacaaa cagaacagtc tcactggcat 39901 tttgtacctc agagggcaca agaagaacct gtctagattc agcccgtctc tgcactgctg 39961 cagggccctg tctcatagac ccagctggcc cccacaaggg agcaaatggg gatatctgct 40021 tgtcgattcc aatcaccttc caaacccgga aggaggttct tctgatgggc agcccatgtc 40081 ttactttaat gtactctcac acttccatag ggctgtcccc tataagggca tctctttaat 40141 aggccaggtt tccatgccct cctgcctgag tatggttagg tctctgacca ctgcccatga 40201 gttagtaaaa acccaaacac aaagactttt tattactgtt caatctttcc atcactgcta 40261 ggaaaccacc atgcaattca gcccactgag ctgatttgct tttactgtct ttgatcagag 40321 tggcatcttt ccaaataaga tgttgtctat tcatcttgga acttggaact gtcacccata 40381 ctccatgcag tttttttgtt tgtttgtttg tttgttttga gacagggtct cactctgtca 40441 cccaggcacc caggctggag tgcagtggca tgaacacggc tcattgcagc cttgaactcc 40501 tgggctcaag cgatcctccc acctcagcct tctgagtagc tgggaccaca ggcatgcact 40561 tggctaattt ttaaattgtt tgtagagagg aggtcttacc atgttgccca ggctggtctc 40621 caactcctgg gctcaagcga tcttcctgct tcagcctccc aaagtgttgg gattaccagc 40681 gtgagccacc atgcccagcc ccaaccagct gttatttgag aactgtttgt agggcactat 40741 ccaagtgaca atagaatcca gcaacgcctc ccacaattcc aaagtctatc tgaggggaaa 40801 agaggctctc tgcgtgtatt tcttcctcgc attccccagg tagaatgatc cgctaaacca 40861 tttccatttc atcatggatc tctgctgcac agtgtcacac tcattacagt gtttctctga 40921 tgtcactcca gtcatcgagg gtgttttcca ccttcctggg atgaatcctt tgactttctt 40981 gtctgcattt ccccattctc ttattaatta atctgatttt tgttattgtt gttgttgaga 41041 ctgagtctct ctctgtcacc cagcctggag tgcagtgtga tctcagctga ctgcaacctc 41101 tgcctcccag gttcaagcga ttttcctgca tcagcctcct gagtagtggg actacaggca 41161 tgtgccacca agccctgcta agttttcttt gtatttttag tagagacggg gtttcaccat 41221 gttggccagg ctggtcttga actcctgacc tcaaatgatt tgcccaccta ggcctcccaa 41281 agtgctggga ttacaggcat aagccaccgt gactggccca gaaaacatga taaggcttct 41341 tgaactacct tttgattttg cagcagcaag gttatacggg gcatccacat tacaggggtg 41401 cccttaacca cagcatttac tatgaccttg gcaatgagca cattcagctg gtgaatatcg 41461 cgattctcat aaagccaggc ccacatgact tgcatatgaa acacatcagc tgcttcttcc 41521 gagatgcact actggatatt tataggagga gacagagtcc cccttctcag ggagaacaga 41581 tcttacagtg gctttttttt ttctttttga gatagggtct cactctgtca ccaaagctgg 41641 ttgcagtggc gtgatcttgg catactgtaa cctccgcctc ctaggctcaa gtgatccttg 41701 cacttcagcc tcccaaatag cttggacttc aggtgcacac taccatgcct ggttaatttt 41761 ctttttcttt tggaagggat ggagtctggc cacgttgccc aggctggtct caaactcctg 41821 atctcaagca atccacccac ctctgcctcc cgaagtgctg ggagtacagg catgaggcac 41881 aatatccagc ctatagtggc ttttatccag tccaccaggc tggctgttct cttgggaata 41941 accacctgtg tgtctgcctc ccatataacc atttgtgatt attttgtagt gagctgtggg 42001 tcaacccaaa cgtgctcttt cactgtagta tttaaaacta aagacatagg ccgggtgcgg 42061 tggctcatgt ctgtaatccc agcactttgg gatggtgagg tgggtggatc acttgaggcc 42121 aggaccagcc tggccaacat ggcaaaaccc tgtctctact aaaaatgcaa aaattacccg 42181 ggcgtggtgg tgggcgcctg taaactcagc tacttggaag gctgaggcac gggaatcgtt 42241 tgatccagga ggtagaggtg gcagtgagct gagatcgtgc cactgtactc taagtattct 42301 aaggatttta agagttgtat gccaggaaat gaggtccaaa accaaatata attttacaat 42361 atcacaagtt gtttccaatg gttgcttatt aagagtagag ctgtgataaa cattcatgta 42421 gaggtttttg tatgaatata aatttttatt tctctgggat gaatattcaa gagtgcaatt 42481 gctaggttat atagtagttg catgttcagt ttgcatagat tatgacaatg agacagcgag 42541 agaagtctaa tgtggctgac tccatcctca cagcctggat gtctttgccc attcctgggc 42601 ataggccaag ctaaccatgg gaggagttta atttacagtt taaccttgaa gcaaggatgg 42661 gaatagtctt accttgaaat ggatcctttc cttgtgcagg ggctgaaact gtctttgtaa 42721 gattaataaa agaccacaag attaggatta tggaaggggc ataaatttta aaatgtagat 42781 atagttttta taatcctttt actgctctgg tatcatgtaa ccagaggtca cgagatttga 42841 gactttgcta attgctcctg tagatgacaa cactatttgt agaatcaatc taaggttgat 42901 cttttttttt ttcttttggg acagagtctc actgcgtcac ccaggctgga gtgcagtggt 42961 gcaatcttgg ctcactgcaa cctctgcctc ccaagctcaa gcaattctca tgcttcagcc 43021 tcccaagtag ctgggacaat aggcatgtgc cactgcaccc agctattttt tgtttttgtt 43081 tttgttttta gtagagatgg ggtttcacca tgttggtcag gaaggtcttg aactcccagc 43141 ctcaagtgat cagcctgcct cagccttcca aaatgctgga attacaggca tgagccactg 43201 catctggcca agattgatct tttgagatgt ttttcaggct tttacattct gagtaaagaa 43261 tggattgact cccactgaac ccatgactca tgattcaatc agtcccgtga cctccaccca 43321 gaggcagact cagtgcacga gggccatttt ccgcacacct gtgattgcat ttccaatcca 43381 tcaacaggac ccattcccta atcccccacc caccaatcta tccttgaaaa accctaacct 43441 ccgagccttt ggagagaatg atttgagtga taactccagt tctcccacgt ggccggcctc 43501 acattaatta aaccctttct ttactgcaat ttctttactg caatatcaca gtctcagtga 43561 actggttttg tttgtgtagt gcacaggaag aacccaacag ataattatgc atggatacca 43621 aaattggggg atctccatta cttctgggcg agagggggat ttcttgctcc ccactaggcc 43681 tccatggata ccttcctaga aagggcagga atggctccat actgcttccc acatggcttt 43741 tactgacatc atggaccagg ggtggggtgg ggagatagga gaatagaggg ttggggggat 43801 aggaagttgg gggatggggg ggtttcttct catgactgct gggaggccat aaaaatccta 43861 actttccact agacctacta tgaccccacc cagcagagag ggagaccttg ctactgactt 43921 gtggagataa aaggcccagt tccctacctg cccttctctg acaccacttc tagacatctc 43981 attacagcct agggagtgtg gaattctaga ctattcactc agctttgctg catggctgtg 44041 ggtgcattca gttttttttt ttttttcctc ccctgtggca tttcctggag aagagttatt 44101 atcatttaaa agttttctgt cttgctaggc tgcctctgcc tgctccctta gctacagaga 44161 acacttttgg aagagcatat ttatctgcac ctcttggcat ttctgggttg ctggcttctc 44221 cagctccaag tctaggctat atcaggcaaa aagaaaaccc agagaactca acaatatgtc 44281 attccttgag tcccgaggtt cttaactggc ctctcttctc tctgcctctc agagcctttt 44341 tttttttttt gagcgagggt ctcactcttg ttacccagat tgtagtgcag tggcatgatc 44401 acagctcact gcagcctcct cctcctgggc tcaggtaatc ctcccacctc agcctccaga 44461 gtagctggga ctacaggcga gtgccatcac tcctgactaa tttttgtatt tttttctctg 44521 tagagacagg gtctcattat attgcccagg ctggtctcaa actcctgagc tcaaatgatc 44581 ctcccccctt ggcttcccta agtgctggga ttacaggcat gagccaccat gcctggctga 44641 cttttttctt cgggaattgt aatgtgccca gctgtcaaga atggatcatg acttgtgtag 44701 gccaaccatt gctatcctat tcccctttgc cagatactca tgttctcagc ctcccttgca 44761 cctaagggtg gccgtgtgac ccagttctaa caaatgatat gcaaaagaag atctttctgg 44821 agccttctat gagtattttt gcattttcat tgtcttcttt cctgctgggg acaaaagaga 44881 gagggcatga aggctggagg tgcagcagct atcttgtgac cttgaggtga taagcctaag 44941 agaaaggcca ggtgaatggt tacagatgag ctcccagaga ctgcagccat gcataagctc 45001 ctgattagac tcattaccta gctccaggtt ctcactgaga aaaccagaaa tgtcttcaca 45061 gtctaaatct tcgctggttg gggttctttt tttggttgtt gttacttgca accagacata 45121 ttttgattga ttcagggagg gaggagttta tgatgcaggg aaatagctgc taataaatta 45181 attacagagt ggatgtaaca caaactgatg tttctcaagt atttcaaagg aagatgattc 45241 ttgaaaggga aggagggaag ctggaactag ggttttggat cataaacact gaataaatta 45301 tatgttttca caataataga cgtgagccag caatccaggg tgggaaggga gattggagtg 45361 aaagaagcca cttgagttct ctctggcctt aagggcctgg aactgggagt tatttctagt 45421 gaaagctccc atttgagagg gccctgagga gctacccctg ggagaactct gtcttagcag 45481 cagtcacaag agatgcaaaa atgtggtcag gggctaaaga aagatggacg ctcttcagga 45541 ttcagtaaac taccgaggga agcaaaggat gcttcttgat ttccaacaat aaggacgttt 45601 ttgatctcaa ataataactc aagaagtaga cagctatggg ttagtttgga gaatacctag 45661 atcaccagga acatcaaggt tgtgtgacca gttgcctgcc atccacatgg aaaatccagc 45721 ttgggaaatg gactctgtct ctcagtctct cacattcact caatacaagt gctcctgggt 45781 caaagaaaga aaagaggctc tggaacatgg ctgggcgcag tggctcacgc atgtagtccc 45841 agcattttgg gaggccgagg caggtgggtc atttgaggac aggagttcga gactagcctg 45901 accaacatgg tgaaaccctg tctctattaa aaatacaaaa aattggtaag acgtggtggc 45961 acatgcctgt agtcccagct acttgggagg ctgaggcagg agaatcgctg gaacccgaga 46021 tgtggaggtt gcagtgagcc aagagagtgg gtgccattgc actccagcct gggcgacaga 46081 gtgagactct gtctaaaaaa aagagagaca ctctggaaca ttctggaaag gggatattta 46141 aaagattatt attattttag agacagggcc ttggtctgtc ttccaggctg gagtgcaatc 46201 atagcttact gcagccttga actcttgggt tcaagcaatt ctcaaccttc caagtaactg 46261 tgactacaga tgtaccccat cacagtggat taattttttt tttttttttt tttttgggta 46321 gagacagggt cttgctgtgt tgcttaggct ggtcttgaac tcctgagttc aagcaattct 46381 cctgcctcag tttcccgaag tgttggtatt acaggtgtga gccactgtgt gtgaccttaa 46441 cagtttaaaa tagagttgga agaactagag gaaagaagga aatgttggga aaggtgtagt 46501 ggctcgaggg atgatccttg cagatttgaa aacgttgctt ctcagaagtg atatctctgg 46561 aaatgaggat ttctgtaaag tgggagggta attgttcacc aggatacgaa atatttctaa 46621 aggggaaata atgtcacatt taatcttttt ttccttccaa taataatttt ctctcattgt 46681 aaaagtaggg catgtcctta aagaaaatgt agaaagtaca agtacaagaa aaaaaagaag 46741 aaaaataaat cgctcatatt tctgctacac taacaaagcc gcttccttta ttgattatgt 46801 ttattttaca ttatttttaa agccataggc tgggtgccat ggctcaagcc tgtaatccca 46861 gttctttggg gggttgaggt gggcagatca cttgaggctg ggagttcaag accagcctgg 46921 ccaccatagt gaaacccatc tctgctgaaa atacaaatat tagctgggtg tggtggcaca 46981 tgcctgtaat cccagctact caggaggctg aggcaggaga atcgcttgaa cccagaaggt 47041 ggaggttgca gtgagccgag atcgtgccat tgcactccag cctgggcaac aggaataaaa 47101 aatgaaaaac aaaaacaaaa acaaaccatg ctacaaagct tgatattttg tgttctgttt 47161 tctttttact ttcccatttc cattatattg tgaacagttt tgcttattat tattacagac 47221 ttttggccaa tatcattttt aatgccgctt actaatatat gaagtgcatg tactataatt 47281 tcctgaataa atcttttcca ttagttggac aattaggttg tttccagttt ctctctacta 47341 aaattgatgt ttcagtgaaa attgccatga gaatgcccac cgtcaccact caccaagcac 47401 ttattgtgct gcgtggcttc cattccacat ttatttcatc ctcacaacaa cccaatgagg 47461 cagacgttat tgtggtttat ttccttttct ttttattaaa ttatgaaaat aaaatgtgtc 47521 tctgtaaaag caattcaagc agaaccaaag gggcctaagg ggaaagacca agcccttcct 47581 cccctttccc tccaatctca ccgctgcaat tccccagaga taacactcat acatttcttg 47641 tatacccttc cagaaatttt ttgtccatat ccaaacacat tgggaaatgg actctgtact 47701 ggcatttcat ggcatgtatt ttactctgaa acttcctatt agtctgagtg acatatctca 47761 aatgtctttt tttttttttt tttttttttt ttgagacgga gtctcgttct gtcttccagg 47821 ctggagtgca gtggtgcaat cttggctcac tgcaacctcc gcctcctggg ctcaagcggt 47881 tctcatgcct cagcctccca agtagctggg actacaggtg tgtaccacgg caccctgcta 47941 atttttgtat ttttagtaga gactggcttt tgccatgttg accaggctgg tctcaaactc 48001 ctgacctcat gatctgcctg cattggcctc ccaaagtgct gggattacag gtatgagcca 48061 ccgtgcctgg cctcaaatgt ctttctatgt cagaaaattt caatcctatg agctggctat 48121 aattattttc attttacact tgactaaatg aaggcttagg aggtgaaaga ctatgctgtt 48181 gaattgcatt gtgtgcccag aaaaagatat gttgatgtcc ttcttatcct ccagtacccc 48241 agatgtgagc ttaatttgga aatagagtca ttgaggtaat tagttaagat gaggctgtac 48301 aggagtaagg tgggccataa tccaatacaa ctggtgtgct tataaaaagg ggcgatgtgg 48361 acacaaaata gacaggcata ggaggaagat gtcataagag acacagaaag aaggccatgt 48421 gaagatggaa gattggagtg atacatctgc agccatggag agattgctgg aaaccaccag 48481 aaactgggaa gaagcaaagg agtcccctgc aggtttcaga gaaggcatgg cctggccgac 48541 accttgattt cagatttctt gcctccaaaa ctgaaacata ctttattatt tatttattta 48601 tttattttgt agagataggg tcttgctatg ttgcccaggc tggtcttgaa ttcctgggct 48661 caagtgatcc tcctgcctcc aattcccaaa gcactgggat tacaggtgtg agccactgtg 48721 cctagctctt tttttttttt ttgaaatggg gtctccctct gtcgcccagg ctggagtgca 48781 gtggtgcaat ctcagcccac tgccacctct gcctcctggg ctcaagcgat tctcctgcct 48841 cagcctctca agtagctgag attccaggtg ccctccacga cgcctggcta atttttgtat 48901 ttttagtaga gatcaagttt ctccacattg gccaggctgg ttttgaactc ccgatctcaa 48961 gtgattggcc tgctttggcc tcccaaagta ctgggattac aggcgtgaac cacagcgacc 49021 gactggctca gctcattttt taaaaaccac ccagtttgtg tgactttgtt aaggcagccc 49081 taagaaaccc aaacacatgt caaagtttca aaactcgtag gtagcagagc tggaatttga 49141 actcaggtag gtgtgaccct aaggccaatg catttaaacg aacaggatct atctattttc 49201 ttttttgagc gaacccctgt atatagtgct tagcctgtgc caggctctag gctaagcatt 49261 ggcatgtatc tctctgaaac tggcatttcg ccttctgggc agtgcagtaa tttcctcctt 49321 gaggttgtct ggtggcttcc accagggggc agacctgggc tttcgtgcac aggcaggaaa 49381 ataacacaca aggcaggagg tggacttttc ccttcccttc ccttcccttc ccttcccttc 49441 ccttcccttc ccttccctcc cctcccctcc cctcccctcc cctcccctcc cctcccctcc 49501 cgtcttccct cccctcccct ccccccccct tcccttccct tcccttcttt ccttctttcc 49561 tttctctctt tctctctttc tatctttctt tctttctttc tgacggagtc ttgctctgtc 49621 ccccaggcta gagtgcagtg gcgctatctt ggctcactgc aacctcggtc tcctgggtta 49681 aagcgattct cctgcctcag cctcccaagc agctgggatt acaggcgtct gccaccacac 49741 gtggctaatt tttgtctttt tagtagagac gggatttcac catgttggcc aggctggtct 49801 cgaactcctg acctcatgat ccacccacct cggcctccca aagtgttggg attacaggtg 49861 tgagccactg tgcctggcca ggaggtggac tttctggcag ttccctctgt agagattgtc 49921 tgtaggtttg gtctaccaaa gcaaagacca cattaaaaac aaaaaaacaa aaaacaaaaa 49981 acaaaaaaaa aaaaaaaaaa aaaagagaga gagagaggat cgacccagac gattgagtgc 50041 aaataaactt ccatgttttc tgagggagag agagccaaac accaggggta caggcagtct 50101 gatttccaga gagggactct gtcatggatg tgatgggaga tgtgttatct gctttttctc 50161 caaacagatc ctgaacccat tctcactcag caagcttctg atgaaaaaac aacgaacatt 50221 tgtgtcatag gcacaattct agagatggtc atccaataat taatcttgcc gggtgcggtg 50281 gttcaagcct gtaaccccag cactttggga agccaaggca ggaggatctc ttgagtccaa 50341 gagtttgaga ccagcctggg taacatagaa agaccctgtc tctaaaaaaa aaaaaaaaaa 50401 aaattaaaaa attagccaga catggtggtg cacgcctgta gtctcggcta cttgggaggc 50461 tgaattggga ggatcactcg agtcctgtag tttgaggctg cagtgagcta tgatcatgcc 50521 actgccctcc agcctgggca acaaagccag accttgtctc taaaaaacaa ataaaaaatt 50581 aatctgcctt tttgtgcaat ggtctcagca gcagcagccg tatctatacc acatgacctt 50641 atacccctag ctgaagcaga gcttagggtt ttgacatgag agatgtaaaa gtggcccctt 50701 aggaggaagt ttgcacattc ctgctgctga gtttctgcag ctgccctggt tgctggccct 50761 gctaagccct ggttgttcag cccttccttg gattctgtta atatacccag tatccatccc 50821 ataatctttc tttttacaat gccttttaag ccagtcgtag ttagtttcta ttacttgcaa 50881 aagcacaaaa caaaaaaaac cctgaattta tatgaataag aatagaaaaa ataaatgatt 50941 aagtatatta tttaaagcca tgcaggcaat taaaaagtta taaataataa agtcattaga 51001 acaaccagca cccagaaaag aggtggaggt tctcaataaa tatttattca gtgaccgcag 51061 gaatttaccc acaataatat caaacaaaac tgatgtagaa acaagggaaa aggggataat 51121 aaggctgaac taaatttcgc tcttttcaat ggagacaata tggttttaag ttgatcaatc 51181 aaaacacaga agcacaatca tattatttag aattatagtg tgaacgccag agttaaaaca 51241 agaaaacgtt ttctgaggag tgggagagtg tttttctgtc tccttctaaa catctccttc 51301 taaactctca accctttaat aatttggcag agggttctaa gcttgtttgg aaacccaggc 51361 tgcgagggag taggactgaa atccagaact cgcccgggtc accggcacga ggcacatcct 51421 cccagaaggg ggtgctgctt tctgatgtgc acagaggtgc tcacaggcca gactccacct 51481 cctgtttgcc agtcccatag gagacaatgg tatctcatta ttgtttcaat gtgcatctcc 51541 ttaattgtgt gtgagatcga gtggtttctc ccgtgtctat aagccatgtg aattgcctct 51601 tcatgtcctt tgcactaaat ggaagctgtg tctcatctcg ttcagaactt tctttttctt 51661 atctgatcgt tttattactc agctcctcag tttagctcat ttttggcata ctagctgggt 51721 tttaaattct cctttcctct tccttcttct cctctacccc tctcttcctc ctctatccct 51781 ctctccttct ttacagattt gattttgtaa atacctccca gaatcctaac actcctgctt 51841 agttccttgc aacagtgttt ggggaatcat acataccaat gacaggaata catctcgtat 51901 cggttacaaa atataaaatg accatcttcc ttttgttcag tgccatttac ttctgcaaac 51961 atccctccat tctttctagc tcttccacaa accctctgca ttctcctatg ggcttagatt 52021 tttagggcaa aagccaagac atcttggaga ttattgtgct tttcagacct gccacaaacc 52081 ccctcagaga ctaggacttt caaagttgac tttctgcttg cttggctggg ggaaagacag 52141 tggggagagg aaaagtgtat aaaaatccat aaattaaagt caaaatgtta ttcaattaca 52201 tttgggtaaa cccagaaaca attctggaag cacttgaagg caatatttat cagtgattag 52261 cccgggggga ttttatgtct tactttgtat aattttctat aaattaggat tatgcatgag 52321 ttttacttta ttttcccccc ttgggcatct ctcatatata tatatataca cgtatatata 52381 tatatataca cgtatatata tatacacttg tatatatata cacatatata tacacgtata 52441 tatatataca cacgtatata tatacatata tatacgtgtg tatatatata tacgtgtgta 52501 tatatatata tataattttt tttaagtgag gtcttgctct gttgcccagg ctggagtgca 52561 gtggtgcaat cttggctcac tgcaacctcc gccccccagg ttcaaacgat tctcctgtct 52621 cagccttctg agtagctgag attacaggca tgtgccacca cgtctggcta atttttgtat 52681 ttttagtaga gacgggtttc cccatgttgg ccagctggtc tggagctcct aacctcaggt 52741 gatctgcctg cctcggcctc ccaaagtgct gggattacag gcatgagcca ctgcacctgg 52801 aaatattttt ctttttaaaa atttttttct ggccgggcac agtagctcat gcctgtaatc 52861 ccagctactc aggaggctga ggcaggagaa tcacttgaac ctgggagaaa aaaaaggaaa 52921 ggaaatttta aaaaaatgtt ttttaaaaat tcctttcttt tttttttttt tgaaacaggg 52981 tctcactctg taacccaggc tggagtgcag tggcacaatc ttgacttgct gcatcctcaa 53041 cctcccgggc tcaagcaatt ttcccacttc aacctcctaa gtagctgggg tgacaggctt 53101 gcaccaccat gtctggccaa ttttataatt tttggtagag atgaggtctc actatattgc 53161 ctaggctggt tttgaactcc tggacttaaa tgatcctccc gtctctgcct cccaaagtgc 53221 tggaattaca ggcatgaacc accactcccg gctctcctaa tattttttat ttgaatattt 53281 tctagaaaaa taacatcgta aaattcagag taacaaaatg tatacaatga acacctaatt 53341 tctctcttac tttcacccaa gaaacaatta caattaatgt tctcttatgt aaattccttg 53401 tgatatgatc tttgtagtgt ctctatgtgt gtgtctctat ctcaatctct aactctggat 53461 atcatcctcc attcttttta cacaagtggg tatgtcacct acagactttt atgtccccag 53521 tgctacaact tgtaaattgc ttaatgtcag agcccataaa aatctacctc atttaaaaaa 53581 atggctgtgt agcattcttc tgtggaagtt ctgtaatata acaaacactc tcccattgag 53641 tgggaaacat tgtttctcat cttttgctac tattaataat gtcaaaagaa ataactttga 53701 gcaaatatct ttactcagat tccagcttat gtctaggata aattctcaaa actagagttc 53761 ctgggtcaaa ggattttttt ttcttcctca tttctaattt ttgatagcta tttcagaatc 53821 tccttccaaa gaagttgtac cagtttatgc tcacaccaac aatgtgtgaa gctggatttt 53881 tttttttttt taagacaggg tctcactctg tcacccaggc tggagtgcag tggttcgatc 53941 tcggcttcca ggctcaagct attctcctgc ctcatcctcc tgagtagctg ggattacagg 54001 catgtgtcac cacacctggc taatttttgt attttttgtt ttagtagaga cagggtttag 54061 ttatgttggc caggctggtc tcaaactcct ggcctcaagt gatctgcctg ctttggcctc 54121 ccaaagttct gggattacag acatgagcca ccacacctgg ccagaagctg aagctggttt 54181 tttttttttt tttttttttg agatggagtt tcactcttgt cgcccaggct gtgaactcag 54241 ctcaccgcaa cctccgcctc ccgggttcaa gtgattctag tgcctcagcc tccctagtag 54301 ctgggactac agatgtgagc caccacatcc aactaatttt tgtattttta gtagggacgg 54361 ggttttacca tgttggccag gcttgtctcg aactcctgac ctcaggtgat ctgcccgcct 54421 cagcctccca aagtgaagac atttatttat ataaatgaag aaagtataca gataaattac 54481 acaaacacaa attagagaaa ctttgatctc tctcttctac ttactaaatt agcaacaaca 54541 ggtaataccc ctgcccaagt ggtgatgcta taaaatgtta cagctctttg ggaatacaat 54601 agggcaaaca taagccatac aattgttcat attattctat ttaataatcc cactcccatg 54661 aatttatcct aataaaatta aaaactactt aagtattcag taatactgat gccctctgtt 54721 gatgaatcca ttttcgcccc cacttatttg aaataccact tttatcttat ttaggggttc 54781 aatgacatgg atactcacag aaagtcttgg gtaactgaac tctgagatcc tgttcatttt 54841 tattgctgac atttttgtct ttcagatggt aaaggaagta tctagaggaa ctgttgcctt 54901 gggccaaatt ctggccaacc aggtggaaca agctggtgag gggaagtgac aaggacctta 54961 gtagaaagtg actgtgtggg tctagacacc ctgtctgcct cctaccactt tggttaaccc 55021 tccagctctt tccagcttag ctgggaacat cccttgtggt cagtatcctt gacagcctag 55081 cactgtctct aataacagaa agtttgacca atgagttccc caaatgtgcc ctcctttctc 55141 ttgtctccac gtttttgcct ctgcccttcc ctttgcctgc aatattttcc ctcactttta 55201 gcctagctaa ctcttcagcc ttcacatctg tttaaaggtt acctgcaatg tgaaggagat 55261 aacatgctaa gctagacact ccataatacc ttctattttc ttcttcttct tttttttttt 55321 ttcttgagac agtctcactc tgtcacccag gctgtagtgc agtggtatga tcttggctca 55381 ctgcaacttc tgcctcctag gttcaagtga ttctcctgcc tcagcctccc aagtagctga 55441 gactataggc acccaccacc acgcctggct aatttttgta tttttagtag agacaaaatc 55501 tcctgttgcc caggctggtc tcgaactcct gacctcaggt gatccacttg cctcagcctc 55561 ccaaagtgct gagattacag gtgtgagcca cagcgcctgg cctgcttcta ttttcttaat 55621 ggtctctatc caagatggat ggggtgcagg caagaaagcc actcaaattc caggggcctg 55681 atgatctgga aagtgggctg aggtaaatcc caagtaaata ctttccattt cctgtttgcc 55741 tcaggactag aaccagctac ataaaagccc aatacaaaat gaaaccgtgc tgggcacagt 55801 ggctcatgct tgtaatccca gcattttggg aggcccaggc gggaagatca cctgaggcaa 55861 ggagttcgag accagcctgg gtagcacagt gaaacctcca tctgtacaaa aaattaaaaa 55921 aatcagctgg acattgtggc atgcacctgt agttccagct actcatgggg ctgaggtggg 55981 aggattgctt aagcttgtga ggttgaggct gctgtgagcc aagattgcac cactgcactc 56041 catcctgggc aacagagcga gaccctttaa aaaaaaaaga aagaaaatga aaagaaagaa 56101 agaaaaagcc acgcttcttt gaggtgtcat ggaagtctag ctgaagattt gtggaagtag 56161 gaaatattta gtgaacaaag tctccaatga gataagcatg agttccagag ccagaggcaa 56221 ggatagcacg gaacaggatt tggagttgga gggatacttc tttagaccct tctgtgtgac 56281 tatcaggact ggatgaagat atttagaatc acaaagagct atcttcatat atgataaaaa 56341 cgccaacagt ggggagagca ttagggaaaa gagctaatgc atgctgggct taatacctag 56401 gtgatgagtt gacaggtgca gcaaaccacc atgacacacc tttacctatg taacaaaact 56461 gcatgtcctg cacctgcacc gtggaactta aaaagaaaaa aaaaccagcc aggtgcggtg 56521 gctcacacct gtaatcccag cactttgggg ggccgaggcc ggtggatcac ctgaggtcag 56581 gagttcaaga ccagcctgac caacattgcg aaacttcgtc tctactaaaa atacaaaaat 56641 ttgccgggca tggtggtggg cacctgtaat cccagctact cgggaggctg aggcaggaga 56701 atcgcttgaa cctgggaggc ggaggttgca gtgagctgag atcgcgccat cgtactccag 56761 cctgggtgac agagtgaccc tgtctcaaaa aaaagaaaag aaaaaaaaaa gaaaagaaac 56821 atcaaccgta agtcataaag gatggatagc acacaaaaga catgcttctc accccctctt 56881 aggtctgctt ttttactcta gccaacactg tggtgcctgg gtgtgcacag ccataaacgg 56941 ggagcatctc cctttggttg cattgtgtat ctttcattct ctgtctccgg gcttttctgc 57001 tttctccaga tggtaggact gtaggaatct gcatgaatga accacactga ctcattcaca 57061 cacctggaaa ggcaagggag ctaacagccc atgcagcaac ccttgacaga tgaaggaggc 57121 ctccatggat aaattgctct tctctggagc ctcctgtttg gagaggattc accatcattc 57181 ctctaaggac ctcagcagga ctgagccgca gctgcccatg gtggcaacca gctcaataac 57241 gtatccgtgg atccgtgatc actccttggt ttcatgcttc tccagcctct gctttcctgg 57301 aatcactttc cagattaaac tggctgcctg caagccctta tctcaagctc tgctctttgg 57361 aagaacctag gctaagacaa caggtgtgag aaattctacc aacgttttag tttttccatg 57421 accgtttttt attttttatt ttcaagacag ggtctcactc tgtggcccag gctggagtgt 57481 agtggtgcaa tcatggctca ctgcagccta gacctccttg ggttcaagcg atcctcccac 57541 ctcagcttcc cgagtagttg ggactgcagg tgcacactac aactgactaa tttttttagt 57601 gtttttcttt tttctgtttt ttgtttgttt gtttgtttgt tttttgtttt tgagacaggg 57661 tttcactatg ttgcccaggc tggactcaaa ctgaagagat cctcttgttt tggcctccga 57721 aagtgctggc attacaggca tgagccactg cgcccagcct ccttgactat ctcaacgttc 57781 cggagtcact gctgttgctg ctactgctgc tgcctgaact ggttcaagag ggcaggaaaa 57841 aagataggct caggaaagtt tccaaacagg gtggggagaa ttcaggatga ggctgaggcc 57901 attggggcct agtaagagac ttgcagaaag gatggaaaga actgggtgga caggatcagc 57961 catgcatgtg gcagagttag gacacattat taggagtata gcattgaaaa caaaggaggc 58021 aaaagttctc cattcttcat ggatctgatc ataggctgga acactgtgcg tcggttctgg 58081 acactatact ttaataaaga caacgaaaaa ctgaagcaaa ttaactcaaa agtcaacaaa 58141 ggtttttcag gacttgttac catgttatac aaagaaataa ggagatttac ttgcagaact 58201 gctaggagaa agggagggat aggggagcta tttttcaacg tgtgaagggt tgtcatgtgg 58261 aagagggaat aggtttattc tgcaggtgag aaataatcag tggaagttag aaggagctgg 58321 atttatctca gcttaaggag aaactctgaa acctgggggc cgccttctaa ctcccaaagg 58381 gtccaaatgc ctggggctac tatgcatggt atggtataca acccaagaag aaccaaaggc 58441 aatggaatca aagtttctta gcttgtctct ttcctgtcgc ctccattcat tccatcaaat 58501 tactgagcat ctcaaatttg catgaaggtc ttccttacaa agaattaaaa acctttgggt 58561 aactttcatc ttttgtccct aggctcttct ctcaaaagga aatacaagtc tccagtgttg 58621 cttcgttttg ccagagtcct tcttgaaatg tggttcatgg aactgagtgt gtctgtgata 58681 gccacgtcca cacaccttgg cagcaagagt catttgtatt agttcatcac ctaggatact 58741 tttttgtaaa caacagaaac agaccctggc taatcctatg ggaaaaaaaa taaatttatt 58801 agattgcaaa ataggtacag gtaaaactaa aggttaatag ttgatgctgg aacatggaag 58861 gttctaaggg tctgggtatc aggaacaaag gaacctatat tgtgcatatc tccctctctc 58921 cctctgtatc tgtctgtctt gtctatctgt ctatgaaccc tgtgctcaca accacaatgt 58981 tgtagaaaga atctttcttt tcttctttct ttctttttct ttctttcttt ttctttcttt 59041 ctctctctct ctttctttct ttctttctct ttctttcttt ctctctctct ctctctttcc 59101 ttctttcttt cttctttctt ttttttactt tttgacacag tctcactcca ctgcccaggc 59161 tggagtgcaa tggcacgatc ttggctcact gcaacctctg cctcccgggt tcaagcgatt 59221 ctcctgcctc agcctcccaa gtagatggga ttacacgcac ctgccaccac actggctaat 59281 ttttgtattt ttagtagaga cagcgttttg ccaagttggc caggctggtc tcaaactcct 59341 gacctcaggt gatcctcccg catcggcctc ccaaagtcct gggattacag gcatgagcca 59401 ctgtgctcgg ctgtagaatc tttttcacag gaagtcaact ttacaacctt accttagttg 59461 aaacctagct ccccatggag tttatttctt ctctcccttt ccattaccat cctgagaaat 59521 tttagtgttc ttgcagatga ctttcccagc atgctgggtg gttgttctgc aatcttctca 59581 acgagtgacc ttctcctctg tttcctcatg gccacaccct ggacttggac atcagctaga 59641 gctgctctac ttgaaaaatg tcagaaatgg atacttctat ctggtcatag cttcctatct 59701 cttcacctct tctacacctt cattcccagt taacctattt ttcaacctca tcaaagccac 59761 tggttccagc caggcttagt ggctcatgcc tgtaatccca gcactttggg aagccaagaa 59821 gggcagatca cttgaggtca ggagtttgag accagccttg ccaacatggc gaaaccttgt 59881 ctctactaaa aatacaaaaa ttagctgggc gcagtggctc acatccataa tcccagatct 59941 ttgggaggcc aagacgggca gatcacttga ggtcaggagt ttgagaccag tctggccaac 60001 atggtaaaac cctgtctcta ctaaaaatat aaaaattagc cttgtatggt ggcaggcacc 60061 tgtaatccca gctacttggg agtctgagat aggagaattg cttaaacctg ggaggtgagg 60121 gttgcagtga gctgagatcg ccccactgca ctccaacctg ggcaagagtg agactctgtg 60181 tcaaaaaaaa aaaaaaaaaa atagccactg gttcctcaac tcctcttggc ttcaccttct 60241 tccccagctg ggagtctgtg gttagtgaaa tgaacatccc tgtgaccaat aattccagtt 60301 cctccacacc tttgcctttc taaccattca ttttctcagt tcctatttcc aggctttttg 60361 gcatgactgg agaaaaatca cagtatcaaa ctctttggct tcatgaaaaa ttcttagttt 60421 atatccttag cttgatcttt acactatcaa caatcctttt atgaattctt agatgttctc 60481 cttctcatac aagcctttac cacattcctg aaggtctcta ctccattgta ttatttgatt 60541 cccgagatga ctttacttac agagcaaata taagccaaca atttaaaaca acctcaactt 60601 tcctttcctc cattttgaaa tcagtctgta ttttcctaga ccttaccata atcatggcat 60661 tttcagtagg acaaggatac tgtctaattt tcttttcttt ttttcctttt ctttcttttc 60721 ttttctctct tctcttctct tcttttcgtt tttttttttt tttttttgag acagagtctc 60781 actctatcac ccagcctgga gtgcagtggt ggcatcttgg ctcactgcaa tctctgcccc 60841 ctgggctcaa gcgattctct cgcctcagcc tccagagtag ttgggattat aggcatgcgc 60901 caccacgccc agataatttt catattttta gtagagacgg ggtttcacca cgttggccaa 60961 gatggtcttg aactcctgac ctcaagtgat tcatcctcct ctgcctccca aagtgctggc 61021 attacaggcg tgagccactg cacccggccg gatgctgtct gattttcaag gatgtgttca 61081 cctctttctt caccatgtct cctatcttct ccccaggaac cttgcttcag caattaattc 61141 tctatccacg tactgtttct tcttttcctg ctatgcctgg tttctccctc atcttatatc 61201 tatttcatgt ctccatcatc taaaaattta aaatccagca attccagccc ctttacttaa 61261 tcctatgttc gcctcaaatc atgatcatat tgctccccgc ataatcatcc aaacctctgg 61321 aaaaaatgca tatagtatac attttaaaaa tgtatctcct ttgagataat tgtagattca 61381 catgccattg taaaaaatat tacagaacac atacataatg cagtactatg gagccataaa 61441 aaagaatgag atcctgccat ttgtgacaac gtggatggaa ctggaagtca ttatgtttat 61501 gaaataagcc aggcacagaa agacaaactt catgtgttct aatttaactt gtttgtggga 61561 actaaaagtt aaaacatttg aactcgtgga gatcaagagt agaaggatgg gtaccagagg 61621 ctgggaagag tagtgggggt ggggggatgg gtaccagagg ctgggaaggg tagtgggggt 61681 ggggggatgg gtaccagagg ctgggaaggg tagtgggggc gcggggagaa agtgggaatg 61741 attaatgggt acaaaaaata gaatgaataa tatctagtat ttgatagcac aacagggcat 61801 ctatagtcca tggtaatcta actgtacaat ttaaaataat taaaatagta taattggatt 61861 gtttgtaaca caaaggataa atgcttgagg ggatggatac cccagttacc ctgatgtaat 61921 tactatacat tgtatacctg tatcaaaata ccccataaat acatacatct actatgtacc 61981 cacaaaaatt agaaattaaa aataaaaaaa aaataatata gagagatccc atataccctc 62041 cacccagttt tctccaacag aaacaccctg tgtcacagta gggcgatact ccaaccaggg 62101 aattcacatt gatataatca attgatctta gtccgatctc accagcgttt cagccattca 62161 tttggtgtgc gtgtgtgtgt gtgtatttag tttcttgaga ttttatcaca tgtgtagact 62221 cctgtgacca ccagcagtga agacaagaac aattccatca ctgcagagat cctcgtgcta 62281 tttattcttt tatagccaca cggccatctc ccttcctcta tctccttgtc cctggcaacc 62341 cctcatacat tttccatttc tatagttttg tcattttaag aaagttatat aaatgcagcc 62401 atgcagcatg aaacctttgg gattggtgtt ttccactcag aataattctc ttatgatcta 62461 tccaagttgc tgtgcatatc aatagtttgt taccttttat ttctgagtgt atttctattc 62521 tatggtatgg gtatactgta gtttgttgaa cgattcatcc ttagaaagac atttgggttg 62581 tttccaaatg tctttccagt tttttttttc aattacaaat aaagtggcta tgaaaatttc 62641 tgtatggatt tttgtgtgaa cataaatttt catttgtcta ggatatacct ccaacagtgc 62701 agttgctggg caattgcatg tatttagttt gattgattga ttgattatgg caaaaaacat 62761 ataccataaa tttaccctcc caacgacttt taagtgtaga gttcaggagt gttaaatata 62821 ctcacattgt tgtgaaacaa atgtccagaa tgttttcatc ttgcaaatct gaaactcttt 62881 acccattaaa caaccacgcc tcttttcttc cttccctcag cccctgaaaa tcactattct 62941 attttctgtc tctatgaatt tgactcctca gggtttctca tgccagtgga atcatacagt 63001 atttttcttt tttgtgactg gcttatttca tctagcacag tatcctcaag attcatctat 63061 gtttaacatg ggtcagaatt tccttccttt ttaagtctgg ataatattcc gttgtatgta 63121 tataccatat tttgcttatc catttcgcca gtaatgggca tttgggttcc ttctacatgc 63181 tggctgttgt gaataatgct actatgaaca cggtgtacaa atatcgcttt gagatccaga 63241 tccaattatt ttggataatt gaagtggttt tgctggatcg cgtggtagtt ctgtttttaa 63301 ctttttgagg aacctccaca ttattttcca tgctatgttg gaataatttt acaatggtac 63361 aacaaaaatg ttgcatcatt ttacaattcc accaacagtg ctgaagggtt ccaatttctc 63421 cacatcctca ccaacatttg ttattttccc tcttcctctt cctcaccaac atttgttatt 63481 ttccctcttc ctcttcctct cctgctcctc cttcttgtag ccagcctaat gggtataaag 63541 tgatatctca ctgtggtttt gatttgcatt tgtttgatga ttagtgatgt tgactacctt 63601 ttcatatgct cattgtgttt tgaatatcat ctttggagaa atagctattc aagagctttg 63661 ccttgaatat tttaaaattg ggttatttga ttttttattc ttgaattgta ggtgttcttt 63721 atgtattctg gatatctacc ccttgtcaga tagatgatta gcaaatattt tctccaattc 63781 tgtaggttgc ctttccaatc tatcgattat gtcctttgat acacaaaagt ttttaacttt 63841 gatgtagttc caaccacatt tagtttgtaa gaaattgcca tgctgttttc gagagtggct 63901 gtgccattct acagtcccac cggcaatgga tgtatgatcc tgcttttcca atttcttacc 63961 agaatttggt gttatcactt tttttaaaat tttagtcatc ctcataggtg tgtggtgctt 64021 tctcactatg gctttaattt gtattactct aatggttaat catgttgaac atattttttt 64081 tttagatagg gtctttctct gttgcccagg ctggatcaca gttgcgttat catggctcac 64141 tgcagcctcc acctcctggg ctcaagcaat tctcccacct cagccccact agttcctggg 64201 actacaggga tgtaccatca tgcctggcta atttctttat tgatttttag tagagaagag 64261 gtctcactat gttgctcagg ctcgtcttta acttctgggc tcaagcaatc ctcttgccct 64321 agcttcccaa agtgctggga ttacaggcat gagccaccat gccaggctga acatattttc 64381 atgtttttat ttgtcattta tatatcctct ctcatgaaat gtctgctgat gttttttttt 64441 tttctcactt tccaattggg ttgttagttt taatactgct gagttgtgag ggttctttat 64501 gcgtggtaga tactagtctt ctgtgagtta cgtggtttgt gaatactttc tcctagtatg 64561 tagtttgttt ttgcatctct taacaaggtc tttcaaagaa aaaattttta aaaatttcat 64621 gaggtccaat ttatcaattt ttccttgctt ttggtgtcaa gtctaaggaa tctattatat 64681 attttttaac cctttactta gcatttcctt tttgagacaa acattgctcc gcttatttca 64741 taaaaatcct gttcaatccc tgtggcttga aggctaacaa gggcacagca aatcagaata 64801 ctccatctcc caggctggag tgatagggtc aggcgtagtc atgtctccca agctggacca 64861 tgcagaattg tcctccctga gacttttgct acagtcactg gggaggctgt gctccacttc 64921 tccatgtgga gagagacttt tttttctttt ttttgagaca gagtatcatt ttgtcgtcca 64981 ggctggagtg cagtgatgtg atcttggctc actgcaacct ctgtctccca ggctcaaatg 65041 tttctcctgc ctcagtctcc caagtagttt ctgagattat aggtacatgc caccatgcct 65101 ggctaatttt tgtattttta ttagagacgg ggtttcacca ggttagccag gctggtcttg 65161 aactccagac ctcaagtgat ccacccacct cggcctccca aagtgctggg attacaggtg 65221 tgagccacca tgcccaggat ctatgtggag ggagactttc tgagaagaag gcaagtagaa 65281 ccaagaaatg gagagaaaga cagtgtttcc attatgtaca gtgggttctc catatccaag 65341 gattctgcat ccttggattc aaccaattgc agcccaaaaa tattaaaaaa cagggctgag 65401 aatggtggct cacacttgta atctcagcat gttgggaagc tgaggtggaa ggatcatgag 65461 actaggagct cgaggccagc ctgggcaaca tagagagacc tcatctctac aaaagaaaaa 65521 caaaaaattt agccgggcat ggtagtgtgt gcttgtagtc tcagctactc aggagactga 65581 tgtgagagga tcacttaagc ctaggagttt gagcggtgag ctatgattgc acctctgcac 65641 tccagcctgg gtgatggagt gagaccctgt ctcaaaaaac aaacaaacaa aaaaccaata 65701 cagtataaca gttatttaca tagctgttac attatattag atatataagt aatctagaga 65761 ggcttcaaag tatataaaat tatattcatt ggttatatgc aaatcttaca tcattttata 65821 taacagtttt gagcatcgtc agattctggt atctgagagg gtcccggaac aaattcccca 65881 cagataatat ggagtgactc tagtttgaac ctctgaatcc aagtacgtct gatgctagat 65941 actctttgag tttcagttat aagagctgtt aaatctacac cctcaaacta agcggtttaa 66001 ggtgggatta tgaggcacat aatcgaaaga gtccaagata atgtgacatc ttatatttat 66061 tattcatcct ttcttttctg tagtttactc ttcgattcct catagtctgc ttttcatgtc 66121 acagcactga aactgtcctt gctatgttga ctggtgacaa gaattttatg gtctcttttg 66181 agcccttttt atacttaccc tatttgttat atttggcatt aaataaaatc ttttttctaa 66241 tatctctctc tctctctctc tctctctttc tctctctctc ttctgtttct tgggattgta 66301 acgtggggta ttttctacta cacagcagag gccatgatgg gcatcatcac ttaaattata 66361 tcatctaccc ctccatctgt ttcacatgtg ttatgccatc tttctatgta tgtgtgtatt 66421 tggaattttg gacctcaaaa ctggactttc cttctcatct tctggactct atgtaattta 66481 aggatgtata aatttgcatc tgatttttgg gtaaccatcc ctcaaatttg gctcatatta 66541 tgcaggcagt cagcctggac tccttcacag aaacttaaaa aatatccagc caggctgagc 66601 gcggtagctc atgcctataa tcccagcact ttgggaggct gaggcaggtg gattacttga 66661 gctcaggagt tcgagaccaa cctgggcaac acggtgaaac cccttctcta caaaaaacac 66721 aagaattagc agggcatggt ggtgtgcacc cgaagtccca gctactcaag aggctgaggt 66781 gggaggatgg cttaagccca ggaagtggag gttgcagtga gccaagatca tgccactgca 66841 ctccagcctg ggtgacagca agactgtcta caacaacaac aacaacaaat attcagccaa 66901 tttgggatgt ctggcagtgg gtgaaccacc tcttatcgtc tgggaactga cctccccttg 66961 taacacctca actgatatac aaaaataata tacatcggtt aatatcagtt caactgtctt 67021 aaacagaaag gaaaattaat cagctcacat aacaaaaact tccaaggtag cacctcttca 67081 gggttagtaa attcagtggt tcaatgtcat catagagaat ccttcccatc ttctgccatc 67141 ctcaggctag tacccctcat agtcccaaga tagcagagtt ctaggggtcg ccaagataga 67201 ctgcaaaaat tgctgcacta cattttatcc cctcccataa agaaatatgg tctatttctc 67261 cacccaacca ggcttagcca cgtggcttgc tttggtcaat gggacaacag caaatgtgat 67321 gcttgaagat atttgaaaag tgcttgggca gggcgcggtg gctcacacct ttaatcccag 67381 cactttggga ggctgaggcg ggtggatcac ttgaggtcag tagattgaga ccagcttggc 67441 caacatggtg aaaccccttc tctactaaaa atacaaaaat tagctgggca tgatggcaca 67501 tgcctttaat cccagctact tgggaggctg aggcaggaga atcacttgaa cctgggagtg 67561 gaggttgcag tgagccgaga ttgttgcagt gagccgagat tgtaccactg cactccagcc 67621 tgaaggacag agtgagactt tgtctaaaaa aaaaaaaaaa aaaaggcctg acgcgggggc 67681 tcacacctgt aatctttgag aggccgaggc aggcaggcag atcaggctgg ccaacatggt 67741 gaaaccttgt tctactaaaa acacaaaaaa ttagccaggc gtggtggtgg gcacctgtct 67801 tcccagctac tcgggaggct gaggcaggag aatcacttga acccaggagg cggaggttgc 67861 agtgagctga gaattgcact ccagcctggg caacaagaga gaaattctgt ctcaaaaaaa 67921 aaataaataa ataaaaagaa aagaaaagtg cttccccgga cccctgccca accctcttac 67981 tgtactttga gagcctgaca tatcatgtta ggaagcctgg actaggttcc tgatgtagtc 68041 aagatgagcc atcctcagct gtcccttgta gatcaaacag cttgtgaact gctagacatg 68101 tgaatgaagg catcctagac catgcagtcc tagctgagct agctcagatc agaaccagct 68161 ggctaaccta caaaatcatg agaaataata ggtgattctg tattgtcgct gaatttgtat 68221 gtgtgtgttt tttcttttcg agacagggtg ttgctctgtc acccaggctg gagtgcagtg 68281 gcacaatcat agcttactgc agccttgacc tcttgtgctc aagcaaccct cctgccccag 68341 cctcccaatt aactggacta caggcatccg ccgctgtgcc tggctaattt ttatttttat 68401 ttttgtagag acagagtctc gctatgtggc tcaggctggt tttgaactcc tggccttaaa 68461 tgatcctccc actcttcctc ccaaagtgct gagattacag gtgtgagcta ctgcaactgg 68521 cctagggttt tttttttgtt gttgttgttg ttttgagacg gagtctcact ctatcaccca 68581 ggttgcagtg cagtggcacg gtctcggctc actgcaacct ccgcgattct cctgggttca 68641 ggcgattctc ttgcctcagc ttcctgagta gctgggatta caggtgaggt gcgtgccacc 68701 acgcccggct aatttttgta tttttagtag agacaggatt tcgccatgtt ggccaggctg 68761 gtctcgaact tctgacctca ggtgatccgc ctgccttggc ttcccaaagt gctgggatta 68821 caggcgtgag acatcgcacc ccactgtagg gtggttttta atgtagcgaa agctaattga 68881 tagaatcaca cacagacaca acaacataca gtaacaggag agtggccgtc ttgacattgt 68941 atctattttt agctgtgagg aatgtttcct tatgtatcat tagctagaac tgaatcacag 69001 gtctatttct aaacccatta ctagcaaagg gcacggaatt accatgatta gcttggaaaa 69061 gcactggggc catacagaga agagtagact atccttcaca tggggctttg caaaaacata 69121 ggagccctct atctttttgg gagaatcaca ctgggagaag ggatggtcag tgaagggacc 69181 cttggtttcc tattggcatt gcattatcgg ggggaagcaa ttgtattttc tgattcttat 69241 cgttctccca cctcccacta tggcaaattg gactagggat gcactgtgga agaaagatta 69301 atcaattaga gtctctctct gggaatttgg aattgagact atgtgagagc atgtgcgggg 69361 gatggagatt gtgtcaggta cactgggggg atgtggggag gcacacacac tcctacactc 69421 aggaaggagg tatttcccaa tttggaaaga gaaaagtaga tgagaggaga ggagagagag 69481 atcaatccaa ttcacatttt ttgttttcta gatctggggg cctcgctgga ttgctgctct 69541 tggcggcatg agacacctct taattcttgt agtaatttcc ccttcatttg cttgagttag 69601 ccctaaagag cttctgtcac tcacaataaa agtgttttta atgtgccatt cttatgaaga 69661 gatgctcaac ctcattcata ataagataaa tgaaaattaa aattccacca agagactatt 69721 tatcacttat cagcttggcc aaaatagaaa agcatgacaa tgttgtgcgt accgctgtgg 69781 gggaaactgg cattgtcatc atgagtagga tgagtacaga aggccatagc ctctgtggag 69841 ggcaagctgg ggacatctat cagccttgaa aatgcactta ccctttgacc tagcaatccc 69901 tagttctggg aatttattcg acagttgcac tgctgtgggg aggagatgat gtatgtccaa 69961 ggtcatttgt tgtagcatca tttgcaatag caaaagtttg gagatgaccc aagtttttac 70021 tgatgaggga ctggttaaac taacagttat acagaatatg atacagctgt aatgattgat 70081 gaagttctct atgttctgat acagaaagaa ctccagaaaa gttgtgagtt ttttgttttt 70141 cttttgttta ttttatttga tttgtagaga cagggtctca ctctgttgcc caggttggag 70201 tgcagtggtg caatcataac tcactgcagc ctcaaactcc cgggctcaag tgattctccc 70261 acctcagcct cctgaatagc caggactata ggcgtatgcc accacaccca gctaatttta 70321 actttttttt tttttttaat agtgacaggg tctcactatg ttgctaaggc tggtcttgaa 70381 ctcctgagct taagcaatcc tcccacctca gcctcccgaa gtgttgggat tacaggtgtg 70441 agccactgca cccaacccag aaagattgtt acgtgaaaaa ataaagtgca gaatatttta 70501 catagtgaga tacttttttt tctgtaagaa aagaggaaag gaagaatata tcagaatata 70561 tgcagcattg atttttaaaa atatttaaat aagtaaattt aataaaggaa gtatggaaag 70621 aagaataaga aacgataaac taataaatat gattacctct agggggtgaa gagaaaggct 70681 aggacgggag atgctcctct ccatgaaaac ctttttatat tattttagct cttaaaaaat 70741 gaacacatta actgttctaa aaatgaatgc aagtaaaacg tcatatttta aagtgaattc 70801 aaatattaag taattctatt ctatccaatt gtctaaagaa ttgagggcag gccaggcatg 70861 gtggctcatg tctgcagtcc tggcactttg ggagcccaag gtgggagaat ggcttgagcc 70921 caagagtttg tgaccagctt gggcaacaaa atgagaacct gtctctacaa aaaatagaaa 70981 aaatgggctg ggtgcagtgg ctcaagcctg taatcccagc actttgtgag gctgaggtgg 71041 acggatcatg aggtcaggag atggagacca tcctggctaa cacggtgaaa ccctatctct 71101 actaaaaata tgaaaaatta gccgaacttg gtggcatgcg cctgtagtcc cagctactcg 71161 ggtggctgag gcaggagaat cgcttgaacc tgggaggcag aggttgcagt gagctgcact 71221 ccagcctggg caacagagtg agatttggtc tcaaaaaaaa aaaaaaaaac aaaaacaaaa 71281 aaacaaaaaa gaaaagaaaa aatgagccag ggtgtagtgg tgtgtgcctg tagtcccatc 71341 tattcagcag gctgaggtgc gaagatcact tgaccccagg aggtcgaagc tgcagtgact 71401 cataatcaca ctttagcctg ggtgacagag tgagacccta tctaaaagaa aagaaaagaa 71461 aagaaaagaa aagaaaagaa aagaaaagaa aagaaaagaa aagaaaagaa aagacaaaaa 71521 gaattcaggg ccagggcagt ggaagaaaac atactatgtt tataccttga catttgttta 71581 aaaaaaacac ccaaacctgg aatgtaccaa tgaaaataca aaagcagaga aagtgtgtgg 71641 taagtatccc tttaactgag tagtttgtgc tttctacatt tggcgctgtg gttgcttgtt 71701 gaaacctccc ctctccaggt cacactacat ccggaagcag tattatggat gggcattctg 71761 cttggccagt tgaagattaa gctgaccagc caagcataga aaagaattcc aggacagaga 71821 gagatgttgt attaaacatt tcttttcaac ctgaagggat caggtggcct cttagagaaa 71881 tgacagaacc tattttctcc ttctcatctg ttctcgccgt tggcaatctc ttgttgtagg 71941 agggagtaag taaagattgc ctgggggtgg aggggtaata gagctggcat cttttcacag 72001 atgtgggcgg gagggttctg ttagcctggg agccccatct cattggatac tcaggagcat 72061 agattcggga gtcagagtgg ggtggatgga gcacagctgt cctacgctag taactgagtg 72121 accttgggca cacacttgac tcctctgtgc ctcagtttct tcatctttaa aaggggaaag 72181 tgatatcacc tacctatatg attaccatga ggattaaatg agttgataca ttgaaaaatg 72241 ttagtagaat gctcagcaca tatgaagtcc tcaataaatg ttagcactga aagaaataat 72301 gataatttta ttattatatg acttcagact aaataagaga gttgacaagt atgtaaatgg 72361 ttgccactcc acatctctcg ccaaccaatt tactgcatcc ttgtgaagaa ggttagagga 72421 gagatgtgtg gacactgatt cacactgatc taagagaata aatatagttc cagtagctat 72481 tatgtgtgag gcactgtgtt tggcccaggg aaggataaat tggtaagcaa aataggacta 72541 tctctgcctt cacagaggtc ctaagctacc cagggagatg aatgtgatgg actgagtatt 72601 tgtgtttccc ccaaattcat gtgttgaagc cctaaggcct gtgtgatggt atttggaggt 72661 gaggtccttg agagataatc aggtttaggg gaggtcgtgg tcctcctgat gggattaacg 72721 tcattctaag aagaggaaga ggccagaact ctctttttcc atcttgtgaa gataaactga 72781 gaaggttgct gcctgcaaga ggggaaggga gccctcatca gactccggat caggtggcac 72841 cttgaacttg gacttcccag cctccagaac tgtgagaaat aaatgtctgt tgttcaagct 72901 acctaatctg tgatattttg ttctggcggc ctgagttgat gaagaccatg gccaagtaaa 72961 cagacaacac aatgctgtgt aagtgcaggc cagttcagga tgccagagga tcacagggaa 73021 ggaacatcta atttaaaccc taggaatcag gaaaaacttc ttacaggcaa ggacaggaac 73081 agcagacccc cactccactt ggtaataaaa caactatctc tttcctacag caaaaatagt 73141 agtaatagta atagtagtag ctgtagtagt agcagtagta gtagtatggt aacgataatg 73201 ctaataatta aaacacattg agggcttgtt ctaagcatgt tacatgcatt agctcattta 73261 atcttaacgt cttagtggag ctactctgca cttacatgac ttggccttac cctaattctg 73321 cccaccataa cctatattta ttttaaaagt tgatatttcc ctcatcaggg attttttgca 73381 ttagttttga tgtttttaaa tattgcatta aatgttgttt atcttgctta ctgatttttt 73441 tttttccttt ttgagatgga gtctgtcgcc caggctggag tgcagtggca tgatctcagc 73501 tcactgcaac ctccacgtcg tgggttcaag tgattctctt gcctcagcct cccaagtagc 73561 tgggatgaca ggcatatgcc accacacctg gctaattttt gtatttttag tagagacagg 73621 ctttcgccat gttggccagg ctggtctcaa actgctgacc tcaagttatc cacccatgtt 73681 ggcctcccaa agtgctggga ttataggcat gagccaccac gcctggcctt acttactgat 73741 tttttggcat ccctttaatt ttgcactcag ggcaagcatc tttctctcct tactctagtc 73801 ctggccctgc tgaggaacac catgtgggga agggtaagca gagctgtcta gcctgcccaa 73861 ttcaggctgt ctatgcagaa ggcatttgcg gttcctcctt tccagacagc atcatgggag 73921 gaaagagaaa gagaacgcat tccctatttg gactcagtca ccccttagca tcctccttcc 73981 caaatagggt ccctgcaatg gcttttagta ttactttaac gtggcatcaa gtcccactga 74041 atctcacaat gagagtcctt ccttatttaa cttgctttca attattctaa aaccatcaag 74101 gagaaagtgc cagatcggtg ctaataggtt tttaacacct ctctaatact taatgagcag 74161 gcctctgact ttcctcccct gcctccttga atgaagaggg ttatggagct ggaattttag 74221 aatcacaagt ctgtccttga atctgagccc cattactgta ctattactgc tgggaggacc 74281 agacgactgt gatgggaatg atgcttcagt ctggatgtct gacctctttg tttgtgccct 74341 cgtaccttca tttttttcag tttccctttc tagaatttag taaaccaaac aagaactttc 74401 cctgacctga atactgggtt caattgtgct gtgttcatta ttacgcagca ttttacccac 74461 gattacttcc tgttcaagtt cctgtttgtt gctgtactct gagaccccac ctaggtactg 74521 gaggctcttc aagggaagag gaagacaaaa cctcctttct gaaggggaac ttagctcagg 74581 gagtggggta gatacaggac cacgtggcat cacagatcca actgttcaga gtcagtctaa 74641 aggacttaaa ttttagtaat tctacttctg ggtgtattcc taagagaaat aggcacgtat 74701 gtccacaaaa agacatgtac aaggatgctt atggtagggt tttccaaagt agtcctaaac 74761 tggaaataac ccaagtaact atcaacattt aatggataag tacactgtga tatattcatg 74821 aacagcaata caaaagaacg aacacagata cacacaaaag catggatgaa tcttactgac 74881 attatgtcaa gtgaaagaat ccagaaaccg tatgatttta ttcatatgaa actcaagaac 74941 agggaaaact aatctaccat ctcaacctta aaaaggaagg aaatgctgac acacaatggt 75001 acatgaatga accttgaaga cgttatgcta agtgaaatta cagaaagaca aacactgtat 75061 gagtctactt ttacgaggta cctggagtag tcaaattcat agagacagaa agtaaaatgg 75121 tgattgccag gagcttgggg cagaggaaat ggatagttgt ttaatgggta tagagtttca 75181 gttttgcaag atgaaaagag ttctgcagat tggttgcaca gcaatatgaa tgtacataac 75241 accactgaac tacaaactta aaaaggctaa gatggtaaat gttatgttat atatttatac 75301 cacaatttaa aagattctaa caacaacagc aacaaaacct aatctatgat gatgaaagtc 75361 agacagtggc tcccttttga ggtaatattt cctgggaaaa ggtacagaag agtttctggg 75421 gaactagaac ttttctgtat gttgacctgg tggaagtcat gtgggtatat acatatataa 75481 aaaacatagt ctgttgcatg tacagcttat tgcatagatg ttatactata acaaaaaagt 75541 attttaaaga aggtagccat ctgctgggca cagtggctca tgcctgtaat cccagcactt 75601 tgggaagccg aggcgggcag atcacgaggt caggagtttg agactagcct ggccaacatg 75661 gtgaaacccc gtctctacta aaaatacaaa aaattagctg ggcatggttg tgagcacctg 75721 taattccagc tgctcaggag gctgagacaa gagactgctt gaaccctgga ggtggaggtt 75781 gcagtgagct gagatcgcac tactgcactc cagcctgggt gagtggctcc atctcgagcc 75841 actggagtgc agtggctcaa acttggctca ctgtaacctc cgcctcccag gttcaaatga 75901 ttcttctgcc tcagcctccc gagtagctag gattacaggc acatgccagc atacctatct 75961 aatttttgta tttttagtag agatggggtt ataccatgtt ggccaggctg gtcacgaatt 76021 cctgacctca agtgatctgc ctgccttggc ctcccaagtg ctgggattac aggtgtgagc 76081 caccgtgccc ggcccatctt agctattttt aagtgtagtc cagtgatgtt aagtgcattc 76141 atgttgttgt gccaccatca ccaccatcca tctccggaac tcttttcgtc ttgcaaagcc 76201 aaagctatac ccattaaaca ataatgctcc atttcttcct gctcccagcc cccagcaacc 76261 accattctac tttctgtccc tatgattttg actactctag gtacctcata tgtattagtc 76321 cattttctgt tgcttataac agaatacctg aacctgggta atataaagaa aaggaattta 76381 tttcttacag ttatgaaggc tgagaagtct aaggttgagg ggccacatct ggtgagagcc 76441 ttcttgttgg tgaggactct ctgaagattg cagaggtagc tcagggtatt atatggtgag 76501 ggggctgacc atactaatgt acttgttcag atctctcttc ctcttcacat aaagccacca 76561 gttcccctcc cattataaga cattaattta ttaacccatt aattcactca tgagggcaga 76621 ccacatcatg atccaaccat cacttaaggg ccccacctat taatactacc acactgggga 76681 ttaactttcc aacacatgaa atctgggggg gtgggggagg cacattaaaa caatatcgta 76741 taagtgcaat catacagtat ttgtcttttt atgacttgct tctttcactt agcatcatgt 76801 tctcaaggtt catacatgtt gtagcatgtg tcagaatttt cttctttttg aggctgaata 76861 atatttcatt gtatgtatat gctacatttt gcttatcttt tcatgtttta gctactgtga 76921 ctaatgctgc tgtcaacatg ggtacaaata tttctttcag accctgcttt cacatttttt 76981 aatttcaacc tatttgtgtc tttggatcta aagtaggatc tctttattat ttatttcttt 77041 atttattttt gagatggagt cttgctgtgt cacccaggct gaagtgcagt ggcttgatct 77101 cggctcattg caaagtccac ctaccagggt caagtgattc tcctgcctca gcctctgtag 77161 tagctgggat tataggtgcc caccaccacg cctggctatt tttttttgtg tttttagtag 77221 aatgaggttt cactacgtta gccagcctgg tcttgaactc ctgatctcag gtgatctgcc 77281 tgccctggcc tcccaaagtg ctgagattac aggtgtgagc aaccatgcct ggcctaaggt 77341 gagttctctt gttgtagaca gcatatagtt aggtcgcatt tttgaaaatc tattctgcca 77401 atctctgtct tttgatttga gagtttaatc catttacatt taaagtaatt actgagaatg 77461 agagacttac ttttctcatt ttgatatttg ttttttatat aacttatagc ttttcttgtt 77521 cttcttttcc tgcattgcta tcttattttg tggttagctg attttttttt ttttttttgc 77581 agtgaaatgt tttaattccc ttctcattcc cttgtgtgta tattctataa ttattttcta 77641 tgttgttacc atggggatta cattaaaatc ctaaagtcag aatattctaa tttgaattca 77701 taccaactta actttaataa catacaaaat ctctgcttct aatacctctg tccccatccc 77761 tttcagttat taatgtcata aaattacatc ttcatacatt gtgcacccaa aaacataaag 77821 tagtaatttt aaataatgca ttagtctctt aaatcacgga gaaaacagta agtggatttt 77881 atgatgatat tagcttttat aattgcccat gtatttactt ttactgaaat atttatttct 77941 ttacattgct tcaagacact gtccagtgca ctttgatttc aacctgctgg acttccttta 78001 gtttagtatt tcttgcatgg cagctctagt ggtgacaaac tcagcttttg cttatgtggg 78061 gatgtcttaa tttctctcac ttttgaagga cggttttgac agttagagaa ttttgggttg 78121 attatttttt cttttagcat tttgaatgca tcagcccact accttctaac tgctgaagtt 78181 ctgataagaa tctgcagata atcttattga ggatcccttg tatatgatga gtcacttctc 78241 tcttacactt taaagattct ctttttgtct ttggcttcag acagtctgat tataatgtct 78301 tggtgtgggt ctttttgagt tattcctact tgaagtttgc tgagcttctt ggatgtttat 78361 attcatgtct tttatcaaac ttggggagtt tctggccatt atttcttcaa ataagctctc 78421 aaataattct cctttctctt tctctttttc tgaaattccc atggtacgct gttggtttgc 78481 ttgattgtgt cccacaagtg tttcaggctt tgttcacttt tttcaatcat ttttcttttc 78541 gttcttcaga ctcaataatt tcaattgtcc tatatttaaa tttgtgcatt cttttttctg 78601 cttgcttaaa tctgcctttg aaaccattta gtgattttta aatttcattt attgtatttt 78661 cacctctaga atttcttttt ggtttctttt tgggttttct atttttttat tgacatttct 78721 gttatgttta tgcatttttt taactttctc catgtcttcc tttagttctt tgagcatctt 78781 caatatagtt gctttaaagc cattgtctag taggctcacc ttctgatcat tctctgggac 78841 agtttctgtt ggtttatttt ttccctttga aggggccata ctcccctctc tcccttcctc 78901 tctccctgcc ttcctgcctt ccttcccctc tatctctttc cttctttctc cccctgtctc 78961 tcttcctgtc tttctttttt gagataaggt ctcactctgt cacccaaact ggagtgtggt 79021 ggcacaatca tggctcactg caacccccaa ctctcaggtt caaatgatcc tcctgcctca 79081 gcctcctgag tagctgggac tacaggtgtg gactacctgt ctggctaatt aaaaaaaagt 79141 ttttgtggag atggggtctc actatattgc ctcaactggt ctcaaactcc tagactcaag 79201 tagtcttccc accttggctt cccaaagtgg aggcataagc ctccaccaca cctagccttt 79261 tcttgtttct tactataggt ttttagttac aattagtagc agttctaagt gatcttgaga 79321 gttgccacat ttcaaatgct cagcagccac atgtgggtac tgtgacaact gtgttgaaga 79381 gtgcagcctt agaacaaagg gattgaggcc gagcgcggtg gctcatgcct gtaatcccag 79441 cactttggga ggctgaggcg ggtggatcac ctgaggtcag aaatttcaga ccagcctggc 79501 caacatggtg aaaccctgtc tctactaaaa atacaaaaat tagctgtgca cggtggtgca 79561 cgcctgtaat cccagctact tgggaggccg aggcaggaga atgggttgaa cctgggaggc 79621 ggaagttgca gtgagccgag gtcgcatcac tccactccag cctgggtgac agagcaagac 79681 tctgtctcaa aaaaaaccaa aaccaaaaca aaacaaaaca aaaaacaaaa gaataaaggg 79741 gttgagaggc aaactactgt atccttgtcc taccttagag aagaggaaaa tgagacacag 79801 gaggaagtaa gtgacttgcc cagggttatg cactgtagtc atttaaatgg tagcccccaa 79861 gaaggtacat ccatgttata attcctggaa cttgtgaatg tcattttatt tggcgaaggg 79921 gtctttgcag gcataactat attaaggatt ttgagatgaa gagatcatct cggattatcc 79981 agtgaagccc taaatccaag gaccagtgtt catataaaag atatacaagg tatagggcca 80041 gtgcagtggc tcacgcctgt aatcctagca ctttgggagg ctgaaggcag gcagatcacc 80101 tgaggtcagg agtttgagac cagcctggcc aacatggtga aaccctgtct ctactaaaag 80161 tacaaagatt agccgggcat ggtggcgggt gcctgtaatc ccagctactt gggaggctga 80221 ggcaggaaaa tcacttgaac ccgggaggca gagactgcag tgagccgaga ccatgccatt 80281 tcactccagc ctgggcaaca agagcaaaac tccatcttgg aaaaaaaaaa aaaagaaaag 80341 aaaaaagaga tatacagggg atattagaga aacagaaggg gtgaagatag gcacacagag 80401 gagagggtga tataaagaca gaggaggatg ggagcgatgt gaccacaagc cagggaagcc 80461 agggaatgct gacagcctcc agaagccaca aaaggcaagg aagcgttccc ctctgaaacc 80521 tccagaggga atgtggccct gctgacactg tgacttatgg tttctggcct ccagaatgat 80581 gagagaataa atttttgtta tttctagcca ccctgttctg ttataatttt ttatggcagc 80641 cacaggtctg gagcaaaggg accttggact ccattctcag ccacagcaac ctgccccctc 80701 actgcctctg aaatctttca agtgaccagc attcactcct tcatgagcag agtgggcctg 80761 gggggctctg gtggcaagga ggacccatgc ccaatctacg gggagctgct gccctgcagc 80821 tcctgttggc actatgccat gttgccctaa tctcttgcct gatctgctca gttgacctct 80881 tcactgctct tcctgcctcc accctttctc ctctaatttc cacatagtag ctagggtaag 80941 cttttaaaat gaaaacccca taagttgtat caggccactc ctcagcttcc aaccctccaa 81001 tggcttccat tcactggagt gaaatccaaa ctccttatca catttgcaaa gccctttggg 81061 gccctgctac ctccctgacc tcatctcata agtcctctct cctctctgtt catccaacca 81121 ctgaggcctt ctcgctattt ctccaacata ccaagcttga tactccccta ggctctttgc 81181 attttctgtt ttctctgtct ggaaactctc gatagtcccc tggctcaccc tctcacttta 81241 ttcaggcctt actcagatgt ccccatatta taaaggattt ccctgtggtc tttataataa 81301 tgctttattt ttatttacaa acattctatt atacaccacc aggtatggat tcgttagttg 81361 gttaggttat tactccccct tccccattag actataagct ccactagggc agggaccttt 81421 gtctggtttg tttaacagta tctggaacag cacacagtag gtgcttaata agcaatttgt 81481 tgattgaaag aacaaatgta tgagtgaggg aggaatgtag gctccattta gctctatctt 81541 ttttcctaga taagtcagaa tctggatatt taagtaaaat tctatgaccc gtgaacttaa 81601 aaaacaacaa caacaacaac aacccactgg gctagcttaa aaaaaataga gcatctggag 81661 ccaccagttt gagaccctga cctacaggaa tctgggtcca atgggaacca catttctcat 81721 cctcacttgg tgcaacgcct tcaggaacca agtggctctt ttccatccac agcccttgca 81781 tccccaggcc ccggcgccat gaggtactgt ttgttgttac agaccaaagt catccagggc 81841 acttcattca aaaggcagga tgtgggctgt gagcttccag ctcatcagtt caaaggaaat 81901 ctacttcagc agcaatgtct cctgggatgt tttcccaagc ctccctggga ggtggtgggg 81961 cctcttcaag gactaataat agacaggcac atccagtttc ccaaacatgt caaccttaaa 82021 agtgaatgct attttttcct ccttaccttt gaattactat gtaggagagg gacattctat 82081 tattgcagcc cttgctctcg acatgctacc taataatgac ttaccactca gcagcttcca 82141 ggcaaaattg tcatagtggg acctgagctc gtgcaactcc agcctgcccc ccttcatcca 82201 ccactactca ggttcacccc tttgtcacca ggatgctttg tagtcaggcc agcccaatca 82261 ccattcagca acggtatgac tgtgggtgta tcttaatcac tctgaacctc cccatcctta 82321 tccataatga aagttaccag atctacatag ggttgttgcg ggggctaaat aagacagcac 82381 aagtacctga tgcccagcaa tcacttgata tttattatta aacacctact gggtgtgagt 82441 gggggtaaga agtgccgacc tttaagtctt cattcattca cttgttcatc catgccctca 82501 ctcaatcaac caatatttac tggggtcttg ggaggtgtca ggcactgagc ttggtgctgg 82561 gcgacatggt ggtgaacagc ccccctgctc tcacagagcc taggtctggt gaaggagctc 82621 acagcctaat gaagctctgc gttcccagag tgggggacat ggtggcagtg ggtatcttta 82681 ggccttgtgc atgaacaggg cgctaaataa cattgaatta cacattgtag acaattatta 82741 ccctttcagt taccttcaga tcacatcaag gagagagtcc tggttggata gtaatgtctt 82801 taacacccct ctagcattta ttaatttcct ctcttaacaa ataaaagatg acttcagtcg 82861 aagatgctta ggacagatga cggcacctgg agatatttta ataatgtaga tacctcttgc 82921 tgttcaaact cagaccaaaa gagataggct ttttttcccc cagagggtgc acaaatacga 82981 ccagaatttg tgaagacgag tcagaaatga atgaaatttg gaaaaatatt gatctactga 83041 aatccttcct ccccacacta ttagccctat gttacagttg gggaaacgga gtcgttttgc 83101 agaggggatg gacagaaggt agggagttct cttccaaacg tgcaggaggc aagcaaagcc 83161 aagaatcttc tctgtggtgg agttagagac atataaaata aagatcgctc ctcccctacc 83221 tctgcagaac gtgtgtgtgt atgtgtgtgt aagtgtgtgc ggccacaagc ctttccgaat 83281 gagtgacagc gggagcccat ccctccagga gacgcgtgca gaatgaccaa tgggatggat 83341 gggggtggat gggtaccagt ctccgcagag gccggggtgg aattcgctgc gccccacccc 83401 ttccacccgc tccccttcgc cccgtaggtc tttccactct cgctcctccc ctgggcacat 83461 ctcctgaacg cagccccggg ggccgaggac ggggtggggt ggggggcgag gctcgggtcc 83521 gacgaccccg ggctgcggtc ccggcgctgc agagctgcgg ctgtgcacgc ttagccgcga 83581 ggcccgcggt agcccgggcg ccgatatgta aagcagctgg cagcgctggg cggggcctgg 83641 gcgcgatgca aatgaggagg gcggggctgg cccggggctc cgcctccctc ccccgcagct 83701 ggggccagcg gtgccaagcg cagctggacg agcggcagca gctgggcgag tgacagcccc 83761 ggctccgcgc gccgcggccg ccagagccgg cgcaggggaa gcgcccgcgg ccccgggtgc 83821 agcagcggcc gccgcctccc gcgcctcccc ggcccgcagc ccgcggtccc gcggccccgg 83881 ggccggcacc tctcgggctc cggctccccg cgcgcaagat ggctgacccg gctgcggggc 83941 cgccgccgag cgagggcgag gagagcaccg tgcgcttcgc ccgcaaaggc gccctccggc 84001 agaagaacgt gcatgaggtc aagaaccaca aattcaccgc ccgcttcttc aagcagccca 84061 ccttctgcag ccactgcacc gacttcatct ggtgagcgcg cgcgcgcagg gcaccttccc 84121 gggcccccga gggcagcgcc gcgccaggga ccccctctcc gcgccctctg cgccctccgc 84181 gccctccgca ccctgggacc ccgcgtctcc ggactcccgg ctccggaccc tgctgcccgg 84241 gactcccgga tggacagtcc tgccgttgcc ctgtccccac cctggtccca gacgggccgc 84301 cgcggggcgc ctcctgccct ctcctgctct caggcgcctc tagagcgccc aggggcggcg 84361 tcgcgggagc ctttgctcca cctgactagg agcgcgcggg gtctgtgcct gccctggagg 84421 gcagcgcctc gggtgctctc cgacccgggg ttccctatct ctccgcctgc ttccgggcgc 84481 gaggagccct cgccccccac cccttgtttc cggggggggc ggcgccctgg gtgtccttct 84541 ctatctccct gcgggcatgg gacatccttt ctcactcctc tgtgcctccg ggcagcgccc 84601 tgtgttatct cccattgccc ctccccgagg gcctgggttc ccctttccac tcctcggtca 84661 catcactgcg ggcccctttc ttccccagtc cctccagtag tggggcatcc tttcctcctt 84721 cccagtcccc ctcccagagg acaccaccgc cgcggggtca ctctcgccct ccctctgaat 84781 gcgtctttat ctcttctctt ttcccgaggg tgctcggggc atctatgggt acatctgtcg 84841 cctgccttca gcccctaccc cgacggaaac gctccccact atcccgccac ctggtggtcg 84901 cagcctcctc tcttctgcag gagtgaaggc agatgggggt tacagccgag ctcccaccta 84961 cccccacaaa ggcggaagac tcttgggcac ccgcctgtgg ctgggagttt gcacctgggg 85021 tacagaggca gggaggaagg cgggtgactc tgtgggtaac tagctggagg ctgggccccc 85081 cgggctgcct gacatacacc tccttctgct tttgcagggg cttcgggaag cagggattcc 85141 agtgccaagg taggctctgg ggctttgggg atgctatttg tgggaagaga gggtgaaaaa 85201 tactttatag aagaagttac tgagttaggc agagagtgaa agaatcacgt tggtcggagt 85261 gacctcccag gctaggaatt cttcaccaca acagggtcct ttcaaggggt gtgtgtgtga 85321 ctggggccga tggcgcttgg gagtcttaca tgccaaggaa gttcacctac ccttcctgcc 85381 ttcccggctc tggaagagtc aaagcggtct cctgaagcaa tcctggcatg gtcagttccg 85441 ctgggggaga aagtgttttc ccgggacgtt tctgggagaa cctgtcctcc ttagttcccc 85501 tttctctgcc catgtccctt ccttatctca cccagggagg ttctgcctct ccctgccttg 85561 cagaggtctc tgcaggtggc tgccgctcct ctgcagatgg tgcatcccct agaaaggcga 85621 ctgtgtttgt cgcctgggcc tcctctctct ctgaaagaag tatttccaga gggagtgttt 85681 ctcagatcct ggtttaaatc accctatctt ctgggttgaa acagaaaggt ctccagggca 85741 ctggttacct gagttcttgc atctggcttg agatcctgga ctctaaccag gggtatcaaa 85801 cctgctggtg gtgagttgat cacgtgtaaa gtgtccccct cccccgtctt tggggctcat 85861 gtctaaaaga cagattgcaa attggctctc caggaccaaa agttacccgc agctctgtgt 85921 agcccttagg gtgtcttact tgtgaatttg atgtgaacgt tttagaactg ggagatttca 85981 catacacatc ctaatttcca actttaactt tgctgagaaa tggtctcact ttcgcttgaa 86041 aacaatcgtt tggagctgag agccgtcgtc gtagatgggg aacagcctta ccattttgtc 86101 tcagttgatc ccagtgggga acacttcacc catttctggc ccctgtgggc atttaagttt 86161 gcaacctctc atctggaaag tttgtgtgtt atgttttctt tttaatttaa attttaattt 86221 ttcactcttg ggcttggtaa gattgagttt ggtaagaaac ggcacatttc tggggtattg 86281 tatctttagg ggcaaagtga ccaccaccct ggtgagctgg gtttaaatca aaagaaagta 86341 tttgagtgtg gaagagtctg acgtgattta gcactgactg acagttttgt ggctgcacct 86401 ccaccctcca gggaaaagta aagtgatccc tagtgagtgt aggtggcatc gtcagtggtg 86461 cagaatgttt gcttgctggc agcatggcac tgtgctcctc tgagactctt gcatactctt 86521 ctgagatcgg ggtcccggtg ggctcagtga gccctgccaa cctacccagc ctgtcttacc 86581 aagatttgga tcctctgcac cacaccagag gactcctcaa gtctctttaa gtggcaacaa 86641 cttaggccct ggtgcaaaca aatttggatt gttttctaga actttgaccc ctcaaacaaa 86701 atgccaatgg tgtcttggca ccggaatgtg taatttcttc tttgaggttg cctttcactt 86761 ccatgaatca gtttcagtct ttgtctcttg ctgagatcgg gggcgatttt taaacatttt 86821 taccacggat cttctatgcc acctgctctg ctgttattga tggatgggaa agtcactttg 86881 gaatgtgcaa ggcatgacac atttgaaata ggagatcctt taactcaaag tctggaagat 86941 aataggaatt ttctgatgtt aagagaaaaa ggcagaatta aagtttgtaa tgaaatatgg 87001 gcgctctaga gataggagtg tctttttttt ttttttgaca gggtctcact cttttgcccc 87061 ggctggagta cagtggtatg atcagagctc actgcagcct caaactcatg ggctcaagtg 87121 atcctcctgc atcagcctcc tgattaattg ggactacaag tacgcaccac catgcctggc 87181 taattttaaa acaacttttg tagagaaggg gtcttgtgat gttgcaagac tggtcttgaa 87241 ctcctggctt caagcaaccc tcccacctca gcccccaagt gttgggatta taggtgtgag 87301 ccactatgcc cctctggagt gtctattttt taaattgcct ttctttttct cttccctgcc 87361 tccccaccaa gaggcagcgt agagaagtgg gcatttgcac aggctaccag gcaagtccct 87421 gtcggtctga atcctagctt tgagacttcc caggcatatt acttagcctc tctgtatttc 87481 agcctccttg tctataaaac ggagtcagta acagtaccat agtgtggctg agcagctcca 87541 actcattgca atatccaaga ttcttggacc agagcttgac acagagtatg ctcagaaaat 87601 atctgctctt atgattgtcc ccagactacc ccaatggtgg ggcagctggc acgagagaag 87661 actggctctg cagggttgct ttgctatgga actgcctgcc aggtaagagt caaaactcag 87721 ccatgaactt ctggtcactc tgtgtagctg ctttgacagc atattccatt tgggcagcag 87781 tagggaggag tgaggggagg ggcagattgg cagtccctga aattgagatg tttagaatgg 87841 gaacatatag gaaagaggga aatttatccc tggtactcag cattgcctgg gactaattct 87901 ggaaccactg aggagttcct tagtatctga ttgttcttgg tatcagatat ttcctctgaa 87961 agtaaatcca cttacgatgg aatcaacagt ttatacaaaa cacataaggt tttatgatct 88021 cacttaagtc agcaatccag agcccagggc ctcaggttcc tttgagtttt ctgctccact 88081 atccctgggg agtgacaatt atctcatggt ccaaaaagga tgctagacgc tggagtcctc 88141 acacccttct ttcagctagc aggaaggaag aaaggggtaa agaagggcat atagcacatg 88201 ctcggtgtca cttgtttaag ttgctcaaga tccttctacc tacatcctct tagctagggt 88261 ttaatcacat acccacacct tgctgtgagg gagcatgtgg gaaatgtagt gtttatttca 88321 ggcagccact atttagagga tgtttccatt tcaaaaatac aagcagagag aggatgttgt 88381 ctctgccact ctggtaggag gtgtgtagga tgatcgcaag gcaaagaaac cattgagagt 88441 aaccttcagg ctctattaca ttgtcatgga aattggggac actgctatgg gttgggagag 88501 gaaggatgga aaactcataa accttgggaa attatggatt tctgtgtgct caggggagct 88561 taagaataga ttttcttgct ttcgtatgat gataggacga aactctaggt tgactgcctc 88621 ataatttctc cctttttgtt tccactctgt ctatttctta taaaaccact ctctgggagg 88681 taatttaaga atgtgtttga tgacagggaa ctggggctga agatccctat tgtaaaattg 88741 cctaaattct agagtagaag tggaactcct ttgacaagca aggcagattt ctctctatgg 88801 catttttttc tttgcctttt atttatttat ttatttattg agacagagtc tcactctgtt 88861 gcccaggctg cagtgcagtg gcaccatttc gactcacggt aacctccatc tcctggggct 88921 caagcgattc ttctgcctca gcctcccagg ttgctgggat tacgggggcc tgccaccacg 88981 cctggctaat ttttgtattt ttagtagaga tggggtttca tcattttggc caggctggtc 89041 ttgaactcct gacctcaggt gatccacccg cctcggactc ccaaagtgtt cagattacag 89101 gcatgagcta cagtgctggc tgtttctgtg gcattcttta caggaactta ggacctgcat 89161 ttgagcttta attttaatta aagtcttctc taagaagatt tttggtagag agaaaaacct 89221 tcttctttta ttttaaattt tgtttttaga gacagggtct cactgtgtgg cccaggctgg 89281 agggcagtga catgatcata gctcactgca gcctcaaact cctgggctca agtgattctc 89341 ctgcctcagt gtcctgagta gctgggactg caggtgtgtg tcaccacggc cctttttttt 89401 tttttttttt ttttttgaga cggagtctta ctctgtcatc caggctggag tgcagtgacg 89461 tgatcttagc tcactgcaac ctccacttcc caggttcaag tgattctcgt gcctcagcct 89521 cttgagtagc tggaactaca ggcacgtgcc accatgcccg gctaattttt atatttttag 89581 tagaggtggg gtttcgccac gttggttagg ctggtctcga actcctgacc tcaggtgatc 89641 tgcccgcctt agcttcccaa agtgctagga ttacaggtgt gagccactga acctggccaa 89701 atacatattt tttaatttta aaattttttg tagagatagt ggatcttgct ttgctgcctg 89761 tgcctggtct caaactgcca gcctcaagcc aggagtctca ttctgctttg gcctcccaaa 89821 gtgtttgttg tttttgtaaa ggaagtgact ttggtagaaa tagagaggca ataaaatata 89881 gagattcaga attcttagca atgagttgga gtccagtaga attgggtatg tatctctgat 89941 ggagccgctt actccctgtg tgaccttggg caagtcgtcc cccagcagca actttggtct 90001 cctcatctgt aaaatgggga taatagcacc taactcatag ggttatggtg aagaataatg 90061 aggtggtgca tgagaaatag ccacagcaag tatcaatgac tctaagtgag accatcagta 90121 ttacttgatt agctgtagga aactgtgtca aagtgggacg gagttcattc tggaacttgt 90181 taaatgacta ggcctctttc atgtacttag atagtttctc ttcttgggta tatttcttcc 90241 tgggtattgg gtaacaccag tgagtgctca gttcagtgtc tgggcccaga aacccatgcc 90301 tggttagcat tgtcttttct tctgagaaag atctgaactt gccatggact ctctagaaaa 90361 accccagtgt acaaattttc tatgaaaagc tagaattgat gattataatg agtaagttgg 90421 ctcttccttt tcccatgtag attagaaaac atgtttttaa atcttctaat tgaaattatt 90481 cttaaaaaac cccaaagtct gacatgactc caactattgt tgttttcatc aatatcaatc 90541 ctatcaaatc aaagtttatt ttagtgtgta attaacacat gtagactaga aaagtttgca 90601 cttttttata tcacaaaaaa ggttaatttt attttctatg gaaagctaga attgatgata 90661 atgagtaagt cagctcttct ttttcccatg tagattagaa aacaggtttt taaatcttgt 90721 agttgaaatt attcttaaaa aaccccaaat catgacatga atccaaccat tcttgttttt 90781 atcaatattc accctatcaa atataaagtt tattttagtg ggtaattaac atgtgtagat 90841 tagcatagca tatgtgtatt tatataacaa aaaatgaatc aaaagttaat tttattttgg 90901 gcagtgttcc ctagaaactt ataatatggc cccatgcact tcaggttgaa ttttcttagt 90961 aaaatcataa agaggtccat tttttcaaaa aatcaccaag tcaggctgca agtttagagg 91021 ttacttctat tgtttcactt ctctttcttc ccagaagcag accccagaga cctgtgagcc 91081 aaactccact caccttttag cttattctaa atgtgtcctg cgggcttctc tgtgaataaa 91141 aggcagaggc tttttgactt cgtgtttgat gatttggctt ttttcttttt tttaccacct 91201 gacgggggat ttcaggctca atgtcccaat cgaaataaga atctttgagt gggttaacat 91261 tgagtcttaa aaaggagggc tgatttgttg gagaggcaac aacgaattca tttttcactg 91321 tagagcactc aggaggagtc tgaagctttg tcactgggca acagaatgcc tttctaagta 91381 aatatggaaa atatgacctt tgaaaatcta gctgatatga tatatagatt cttaggatat 91441 aatttttaaa aaatatatta tttccaagca ctaatgatac tccagagata ctgcccacaa 91501 ccagatgcag catgtttata gtgtatctag gccaaaaaaa aaaaaaaaaa aaaaaaaaaa 91561 tagaagatac agaagaaggc aattggtcaa aagattaact gaaggtttga aaataatttt 91621 atacacattt gtactgttgg ggtctctctt ctctctgtct ctgcttatcc ttctgtctct 91681 gtctctttgt ttccatcaca catgcacaca cacacacgtg tatacacaca catatatgta 91741 agtagatcaa tatgttataa ttttcacaat acgcatcttc ttttgctact tttcagatac 91801 aggtcattaa attactaagt ataatttaaa atatatggtc tggcctttta actctgcagt 91861 tcaattaatt cttgggtaga tagggggatc tgctgaaaag tttgtctctg acgccttcaa 91921 aggtaagtcc cttatgttct ggcatttgct ctaataacat gaattctttg acttcattca 91981 tccaatctct tcattgaaat tgtgtgaaat tccatagata gaatgttagc ctttttacag 92041 agaatacaac atccacagtg tttacatatg gcatttggag gtgtgatggc tgcttgaaaa 92101 tccagtagga aatttgtcta ttcaccacat acattcattt attcatccat ccatccctcc 92161 atttatccac ccacctctct attcatcatc cgtttttctg tttatctatt tgtccatcca 92221 tgcatccatc ccttcactcc attctccttc taaaagttct tttttttgag acggagtctt 92281 gctctgtcgc ccaggctgga gtgcagtggt gcaatctttg gctaactgca agctccacct 92341 cccaggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta caggcgccca 92401 ccacctcgcc cggctaattt tttgtatttt tagtagagat ggggtttcac catgttagcc 92461 aggatggtat cgatctcctg acattgtgat cctcctgcct cggtctccca aagtgctgag 92521 attataggcg tgagccaccg cgcccagtct cttttttttg tttagacaaa ctctcactct 92581 ttcgcccagg ctggagtgca gtagtacgat cttggttcac tgtaacctcc gcctcccagg 92641 ttcaagtgat tctcctgcct cagcctccca agtagctgcg attacaggtg cccaccacct 92701 cgcccagcta atttttgtat ttttagtaga gacagggttt caccatgctg gccaggctgg 92761 tctctaactc ctgacctcaa gtgatctgcc cacctcaacc tcccaaagtg ctgggattac 92821 aggagtgagc caccacacct ggcctcactg attaagttct atgtgtcaag ttctatttgt 92881 caggaactat tttaggtgac agaatacaat ggtgaatgaa actgagaaaa tttctgttgt 92941 catggagctt atgatgtagc ggggggagat aagacattct acagatcatt tcactcatat 93001 ttaacctcat ttttattagg tactttgaag aaaatggact gggtaccggg agaatatata 93061 atgggaagct aacctatgta atggctgggg atgcagtcag tggtctggga attaaaaaaa 93121 aaaagcccaa cttctcattc tcattttctg aggtaactgg agaaagtctg agttgtcaag 93181 gaactagagc taaaaatctg tattgtaaaa ctgcccatat cctggataat atatgcaaca 93241 gggctgaagg cagtggctca tgtctgtaat cacagcactc cggaaggcca aggcaagagg 93301 attgcttgag gccaggagtt cgagaccagc ctgggcaaca taacaagacc ctatctcaat 93361 taaaaacaaa acaaaatatg tgcaggagac ttatctttgt gacattctat ggagaaaccg 93421 taaatattta tttgtatcat ggctgcaagt aagatgagta agtatttggt agaactagcc 93481 tgttaccttg agatggcttc tttcccagtt tgacagagaa cttcatctcc tcttagttaa 93541 taactcagat aatgagctga cttccttttt gtttagggtt tacagtttga ggtggctgtt 93601 ctaaagaaga aaaatcagac aatttcagag gtggaggagc aagtctgggc caatagattg 93661 aaaataaata aaaagagaga gcacggcatt tcagtgagag gcattttgag ctgaaagtca 93721 gttttcaggg agcacaagat tactcacatt gcctgccaaa taccttcctc accctgtgtc 93781 catctctgat atattacaac tcttcattcc tttggcttag gcagtatttt ctcttgttca 93841 gaagatactt tttaagtagt ttaacaataa acagtgttga ctcacttata gagaaaggac 93901 ttttgtcaag tctgtctttc tgatttcatt aaggagaaag tctcagttgg tgttaatata 93961 tctttaacaa ccctctaaca caaatctggg cacttagtag atatctgttg aatattcttt 94021 ttttttaatt ataaaggaac tacagggaat tatgagaatg taaaactatg ggctgggtat 94081 ggtggctcat gcctgtaatc ccagcacttt gggaggctga ggtgggtgga tcatctgagg 94141 tcaggaattc gagaccagcc tggccaacat ggtgaaactc cttcactact aaaaatacaa 94201 aaattagcca ggtgtggtgg catgtgcttg tagtcccagc ttctcgggag gctgaggcag 94261 gagaatcgct tgaacccggg aggcagaggt tgcagtgagc caagattgcg ccactgcact 94321 ccagcctggc gacagagcga gactccgtct caaaaaaaaa aaaaaaaaaa aaaaagttag 94381 tttctaccta taaaccaact tttttttgag tcagcgaggg tacactttta cctaatgttg 94441 agcctggttc tgttctaccc tgagcttgat tctgtactgg ctattcctcc ttgtgaacag 94501 agaagggaag ttcagtgtct ttcctggaaa tttttggtgt tgtaaggatg ctcagtgttc 94561 ctgaaggttg agaggaacag caaggacagg atctggtaga gatgtcgctt ttattcagaa 94621 gtgctaagct gttaatcatt accatggttg ttattacctt ttaattgcca tttgtcctgc 94681 ttgtttgtgg tctcctgggc atctccttcc ccctcagccc acaagcagaa agttgcccta 94741 cagagtcagg aggagaactc aaacccagac cagagagctc tgatccactc ctatgggtct 94801 tggaactggc taagtagatg gaaattaact gattcaagat atgcaacacc attagttcaa 94861 ccatagctgt gttcctgggg tgaattctag ctttcttttt tctttcatct tgctacttcc 94921 cagagaccag actttgggcc tctagggaag gagtatcaaa ttgggcttga gagttcaagc 94981 tcaaaacaga cctgatttta aatcttggtt ctgcttctcc atatcttcat gacctttggg 95041 tgggtaaaat atgatcctgc ctctatctat ctatctatct atctatctat ctatctatct 95101 atctatctat ctatctatct gtccatctat ctatctatcc atccatccac ctatccacca 95161 atacatccat ccgtctgtcc atccatccat ccatccatcc atccatccat ccatccaccc 95221 atccacccag ctattcttcc attcattcat ccaaacatcc acccatccac ccagctattc 95281 ttccattcat ccatccaacc atccatccat ccatccatcc atccgtctgg ctatccaacc 95341 gtttttccat tcatttaata tatatttatt ggatgcctac cacatatcag gtactgttct 95401 aggcattgga actagaacag tgaacagaac agcatcgctg gcttcatgga atttacattc 95461 tagtagaata acagtaataa gtgaataaca tagaacgaat atattgtata atgtcaggtt 95521 gccatatggg ctttgaagag aagtaagaca ctgtcctcac ccttaaagat cttcctggtc 95581 ttttaaaagg agaacagcac atggccaatt agttctgaga cataatgtaa agtgctacac 95641 caagaatgta aaaacacttt ggatggcatc cctactcctc ctcggaccgg tgaagattcc 95701 acagaggagg tgacatttgg gaacagtctt acagcaggag tgggagttcc cggattgaca 95761 aggggatgag ggcaggggag ggtgtttcag gcacaggaca ggacaacgca gttgagatag 95821 tgaggagaga gaatagggca catgcaggga acaggaagaa gagggtgtga gggagggagg 95881 agcaggtggg aggtgaacgt catatcagag gggttggccc ctgtgaaggc agtgtagtga 95941 ctaggaagtg gttgttgtca ctgttttaaa gtgggatgtg tgggggtgag tgaaatggcc 96001 aaatgtttga aaaagaggag catgagatgc ctttggcaac aatacttgcc atgactcaaa 96061 tcctgaaatt ctaggcctgc agtgatggaa gtcaggcagg agaaactgaa atggccattt 96121 agaagcagtg ggggtgcctt accggagcac ctcgtggctg ttgctatttt tcacacccta 96181 aagagacatt tggaatttca tgtcagtcat attgttgatt gttctatttt gagggtggtt 96241 actagttatt tttatggttc taacaatagg gacatctctc ttcactgtat ccttagacgc 96301 ggagggccgt agaaatgaat tttcctctga aaggcatcct cccactccct ctccctctcc 96361 actgctgtca cccagtgggt acgtgtttat tgagcacatg ctgcacatgg tggtgaagag 96421 ttggtgtttt ggagaaagaa aggcttcgat ttaaatcttg acttcttgct gtgggacctt 96481 aagcaagtta cttaacctct tcgagcttct gttttttcat agctaaaagg atataataat 96541 aaaacctact ttatgtgaag tgtttcttag caccctacct gacatatata cagtaagcac 96601 ttgataaatg gtagctattt ctgttcttcc taggactggg cactgtcctt gtccttactg 96661 agtttataat cctgttgggg aggcaagatg aatgctcttg aatcaaggag acgacgcaag 96721 aatggatgtg cttacatgtc gtggtcaata gcattagctg caccttacag tggatcttag 96781 agagaaatgc ttggttgggg ccggagaggt gatttgagtg tgagtgagca cagtgctttc 96841 tgagaaaccc tttcctggcc tgcaacactt taatttgtgg gaaacctcag gaagtagaat 96901 tgcaatctgt gatcgcccca tcggctgacc tcacttcaaa aatgtattgg tagcatttca 96961 actttaaata gatttgtgac catttcaccc ttctaaattt gggttccctt caccctttgg 97021 gtggagtgtt tatatccctt ttcggttgaa gaagctgaca gatagtgtag gtgaaggtgg 97081 agggtgaagt ctgaaggacg gtcgcttgtc tcattgggat agtcagggca gtgatttttg 97141 tgggatggcc ctatattgaa gatcttcttt ccctccctaa aggtagagac cattctttag 97201 agtttagtga gtgctgcaaa tgaagatttg atcgttattt ggattttttt gcatgtgtat 97261 gtgtggattt tttttggtag ttatccagat acattttgtg gcaaaattta tggctgaggg 97321 tattgataat tgtttctgag acaaacagtg cacttaggga aacaggattg acatatagag 97381 tagagaagga cttcttttta tttcagttgg atattcatct tttggtggct agcttttttt 97441 aaaaaaacta ttaatattat ttttgattga caaatcataa ttgatggggt acaatgtgat 97501 gttttgatac atatatacaa tgtgtcatgg ttaattcaaa ctaattaata taaccatcag 97561 cttgtttact tattttttat ggtgagacat ttgaaattta cttagttatt tgaaatatat 97621 gatacactta ttattgacta tgttaccctg ctgtgcaata gatctccaaa cctattcccc 97681 ttgtctatct gaaactttgt atcctttgat caacaactcc ttccctcatc ccagattctg 97741 gtaaccatcc tcctactctc tacttctatg actacgactt tattagattc cacatatacg 97801 tgagatcatg ctgtatttgt ctctgtgttt ctggcttatt ttacttagca taatgtcctc 97861 cagattcatg tatattgtca caaatgatat aatttctctc ttttttttaa agtctgggta 97921 ggattctatt gtgtgtatac accatacttt ctttattcat tcatcctctg ttgaacactt 97981 aggttgattc cctatctttg ctattgtaag tagtactgca acaaacatgg gaatgcacat 98041 attctttgac atattgattt cagttgcttt aaatatatgc ccagaagtgg gattacttaa 98101 aaaaaattta atttaatttt ttttaatttg agactgagtt ttgctcttgt tgcccaggct 98161 ggagtacaat ggtgcgatct cagctcactg caacctccac ctctcaggtt caagacattc 98221 tcctgcctcg gcctcccaag cagctgggat tacgggcatg caccaccacg cctggctaat 98281 tttgtatttt tgctagagac ggggtttcac tgtgttggtg aggctggtct caaactcctg 98341 acctcaggtg atccacctgc cttggccttc caaagtgctg gaattacagg tgtcagccac 98401 catgctcggt ggtagttcac ttttttgttt ttttgagaaa cttttagtac tgtttttcat 98461 aaggctatac taacttacat tcccaccaac agtgtatggg tccccttttc tctacatctt 98521 ctccaacatg ttatttttta tctttttgat aaaagccatt ctaacaggtg tgaggtgata 98581 tctcactgtg gttttaattt gcatttccct agtgcttagt gtgctgaaca ttttttcaag 98641 tacctattgg ccatttgtgt gtcttctttt gagtaatttc tgttcagatc ccatttttaa 98701 atcaggttat ttgtttactt gctattgagt tgtttgagtt ctttgtatat tttggatatt 98761 aacttcttat cagatacagc atttgcaaat atttttccca ttctgtgggc tgtctcttca 98821 ctttgttaat tgtttccttt gctgtgagga agctttcagt ttgatgccct cccatttgtc 98881 tccttttgct tttgcttaat cattgcccaa accaatgttg tggagttttt cccttatgtt 98941 tttctctagc agttttatag tttcaggttt taagtcttta atcaattttg agttgattgt 99001 gtatgtggtt tgtaagataa aggtctaatt gcattcttct acatgtggat atccagtttt 99061 cccaacacca tttactgaag agactgtcct ttccctcatt gtgtgttttt ggcaccttta 99121 tcaaaaatca gttggctata aatgtgcagg tttatatctg ggctttctat cctgttccat 99181 tggttgatgc atctgttttt atgctagtac cgtgctgttt tgattacaat tgctttataa 99241 tgtattttga aattaagtcg tgtgattcct ccagctttgt tctttttgct caatgttgtc 99301 ttggctattt ggggtctttt gtggtcccat atgaacttag ggattgtttt ttctatttct 99361 gtgaaaaatg acattggaat tttgataaga attgcactga atctgtagat tgctttggct 99421 agtatggaca ttttcacaat attaatttgt ctagtccatg aacacggaat atcttttcat 99481 ttatttgtgt tttcctcagt ttctttcatc agtgttgtat agttttcagt atacagatag 99541 tttgcttcct tggatacatt tacacctaag taattttttt tttgctgcta ttgtaaatta 99601 gtttgttttc ttaattttct ggtctggtag tttgttatta gataagctag tatataaaat 99661 tagaataatt agataatcta atatagaaac actactgatt gtatgttggt tttgcatcct 99721 gcagttttat tgaatttgtt gatcagttct aacagctttt ttagtggatt ctttagggtt 99781 ttctgtatat aaaatcacct tgtcaacaaa tagagacaag ttcacttctt cccttcctat 99841 taggatgcct tttatttctt tctctttcct aattactctg gctgggactt ccagtactct 99901 attgaaaaga aagggtgaga gtgggcgtcc tgatcttaga ggaaaggctt tcaacttttc 99961 agtgctgaga aggatgttag ctacgggttt gtcataatag agcccatagt gtttattgtg 100021 ttgagtgcat ttcctttata ccttattgct gagtactttt ttcatgaaag aatgttaatt 100081 tttttcaagt gctttttttt cctgcatcta ttgagatgat catatgattt ttatccttta 100141 ttttgttaac atgggtatca catttattga tttctatgtg ttgaaccatc cttgcatccc 100201 agggagaaat cccacttgat catggtgaat aaaattcttt taatgtgttc ttcaatttgg 100261 tttgctaata ttttgttgag gatttttcca tctatgttca tcagggatat tggtttatag 100321 ttttcttctc ttgtgttttt gtttggcttt ggtatgatag tcttggataa tgagtttaga 100381 aatactccct taattttttg gaagagttta agaagaattg attgacatcc agatcaagaa 100441 acagaacatg ctggtaccct agaagcatca tctcacaatc cctatcaacc gtatccttca 100501 ggagtaatca ctatcctaac ctctatcacc tatgattact ttcatctgtt cctgaactta 100561 tgcccacttt tctgttatgc aactccattg acttatgccc atttttctgt taaaaaattt 100621 aaaaagtgca gagctttaag ttactgaggc tgagcctgag aaatgttagc tgggactttt 100681 ttagttttgc atagaggaag tgaaatccgt atgggagaca ggcaccttct ggaatcaccc 100741 attggggaat ggtgccgttt gccttctttg gggaacatct gcttcctgtt tcaatgttgc 100801 ccatcggttc cagactgttc aacaaggaag ccagggcatt ctgacagttt ttccacacaa 100861 taatttagct ctgtaaatac taggaaaact tgagctcaca aactattttt agggcttaac 100921 atttcaaaaa ttctgtatta gtaagaatct attaaagctg ctgtaacaga caaaccttga 100981 agtctctgtg gcttgacaga agggagttta tttcttactc atgtcactaa ccagtgcagg 101041 tggtcctggt caggacatct tccacagagt cacacaggac cccagggtcc tccccattgg 101101 gccagtggaa ggggaaaggg aaattgagaa agcacactca tttcacaaca acttcagctg 101161 gggaaggaag tcacatcact tctgctcaca ttctatttgc cagaacccag gcaaagaggc 101221 aaccaacgga agggggtcct gggacatgtc gactagctct atgtccagga ggaaaagaaa 101281 acaggttttg atgaactcag taaatctcac tccagtctcc aaagccataa tttcctttgt 101341 ctttctttgc ggtgatgtat ctagttttat ctctcacttg actggtaaaa ctaagatgca 101401 aaactctgga cgtctttatc ttatcaaaca tagcagggaa agagtgagtt ctatttgtgt 101461 tcctaaaagt gaaataaaga agcacttttc ttgtttttag ttattagaac aatctggctc 101521 ttgtcagact ttctagtttt tcaatcagtc agaattcttg cctctctggt tgcctgttgt 101581 attaatccat tctcatgctg ctaataaagg cataactgag actgggtaat ttataaagga 101641 aagaggttta attgactcac aattcagcat ggctggggag gcctcacaat catggcagaa 101701 ggcaaaggaa ggtcaaggca cttcttacat ggcagcaggc aagagagcgt gcacagggga 101761 actgcccttt ataaaaccat cagatctcgt gagacttatt cactatcacg agaacagcat 101821 tagaaaaccc acccccatga ttccattacc ttccgccagg tccttctcat gacacatggg 101881 gattatgaga gctacaattc aagatgagat ttgggtgggg acacagccaa accatatcac 101941 ctgtggacat cccaagcaga tctctttgag gccgggttag ttctcttcta agctgacttg 102001 ttatatagaa gaatcctttg ttggttctac ttccagatat ttgactggac acccaacaaa 102061 tcaattatct attgctggat agcacataac ctcaaggctt agtggcttaa agtagccact 102121 gttttattta gttttagttt ctgtgggtca gtaatttggg ctgggctcag ctgggtggtc 102181 tgtctgatct gggcagggat tggctgttct tggctgggct catttacctt cctggggcct 102241 cagcagggat gtctaggaca actggggcct ctcttcaggt ctttcatctt ccagtgggct 102301 atcctgagca tggtcatatg ctgatgggtg cattctctgg gcatgagcaa gggctgcagg 102361 gcttcttgaa gccaaggctc agaacatgca taacatcact tctgccatat tccctggtca 102421 aagcaaatca caagcccacc ccagaagtaa aggaggaaat agagaagaga atagagatgc 102481 ctcctcttga tgggagaagc tgcaaataat ttgtgatatt ttttggaacc caccacacta 102541 atttcagtgt ggtcctcgca attgcttggc tgtctaaaaa catactccag tctagtgtgc 102601 atttctgaaa atgaacttga tcaaaggcag gccccaaatg agtgctcaac atttacacct 102661 gataataata ctacctacct cctagggaaa tggtagggcc agatgagatg ctgtatgcaa 102721 aggatacggc caggacttag tgcacacgca acactgaata ctaccaccat gactactgaa 102781 tactgccatc aaatactacc gttactacca cttggcactc cctaaaagct tgagattcag 102841 agaaaggaag acaaagcccc tgtacttgag gagggagaga ataaagagag ccggcggggg 102901 agggggtggc aggagagaga gagagagaga ggcgtgtgag tgagttcttg caacacagtg 102961 tggtcagtgt gatcacagag ctgtgatgga gtgcggatga ggtctgatga cttactggtc 103021 aagagcatgg gttcaggcct cagttcctcc ccaatcacat gtggggtctt aggcatcttt 103081 ctttttaatc ttcagtttct ttgtctgtga aatgaagata atggcacaga gcctacctcc 103141 cagagttcta tgtggattaa atgagacagg caaacaacaa caaaaaccac aaacaaacaa 103201 aaaacaaaaa tgaaaacaaa aatcatgtga ttgcttatta aatgttagtt atagaaaaaa 103261 atataagttc ttaaatgtgc ccttgtaaaa atgtcctaaa gacctcatct ttatttagtt 103321 cttctgcaac tctggctagc atcattgata aattgtctta aaagatgaag atttgtatgc 103381 aggtggttta ttggggatgt agctcttggg aacaacatct ataagggata gagggaagca 103441 gggaagaggt agttgaactg tggtgcaata gttgtaatag aggccaaagc caattccatg 103501 gctggagctg agacggccct tcagaactgg gacaagccaa tgcatcaagt gggcattgat 103561 tgtgggctgc tctttggaag gcagcattac cttgggcaaa gcagctccct tcttcagggg 103621 aggagtccca gggaggacac agctgtgagc aattggtttt cccagcagct gggggaatga 103681 gtgccttgtc cctatagatg gttttgaggg ggctgcattt gggggaatgc cttctggggt 103741 tcactgcgtg cttctatgtc tttcttagag gttcatttca aactatacct tggttcattc 103801 acagtactga tttagttact atccagattc atagactgta cttgctggca ctggaatatc 103861 atggacagag agtcctaatc tgactaggcc tagggaatga catgggaata gcttgtcctg 103921 ggtcatccag ctaccagaga ggagagctgg gattcatatc caaccaacga tactttatac 103981 ctcacattac ttctgttatg ctacacactc ctgagtcaca ggagattcca acttgtattt 104041 caagcccaga ctgcatgact tcaatgtcct gttgccctta acccctgtat cagattgcag 104101 ccccactctg gaaagttttc ttgaattctt ctatgtatgt tttttaaact gtgggtcatc 104161 actcattaga agttcatgaa gtcaatataa taggtcaaga ataccatgaa gaaagcagaa 104221 cagaaaatat cacagtgcat cactctcagg aagggtagat attgtttcac gaaacttttt 104281 ttttttttga gacagggtct ttgtctgtcg cacaggctgg agtgcagtgg tacaatcata 104341 gctgactgta tcctcaacct cccaggccca agagatcctc ccacctcatt ctcatgagta 104401 gctgggactt gaggcacatg ccatcatgcc tggctaattt ctaaaacttt ttgtagagat 104461 agggtctccc tatgttgctc gggctggtct caaactcctg agctcaaaca gtcctcctac 104521 cttggcttcc caaagtactg ggattatagg catgagccac tactcctggc ctcatacaat 104581 ttttatttga tataaacata tatagccatg agtgaactgg gtcaaaatgt aaaatgtgtc 104641 tcttactgca ggtaactgtc aatgtagttc gaatatcact gtgaaatata tagagaagag 104701 gctctggggg ctcatgaact tggcccatgg actgttgagg cttttttttt aatctttaga 104761 gatgatataa atgtagtgct ttgaattctt tataattaga agttttattc ttaccccatc 104821 accatgtccc aagataattt ttattatcat ctttcagctt taacatatca cagacaaaag 104881 ttatttttcc aaatagaatt ttaagtatgg tataagaggt ttttagaaat ggaatgattg 104941 gttttggcgg aactgcctgg cctgacctgt cactcgggtg aagtgtcaga gagctgtgtc 105001 gaatgctggc ggagaactgt ccctctcctg cagagagaag gagctgctta tcctggggga 105061 taatgtgggt tgagaaagaa gtgttttatg agcagaatca gcacagaggg agggaggagg 105121 aggtgtttag gggctgcaga gagccgccag aagatgtggg aacagtatgc cagcacggag 105181 gtgggtcaga tccccatggt gggactctgc tgagggttgc ctccagccga gacttccaag 105241 ggctgggatt cagtgtgtgt ttatgttctc tggtgtgcct gataggtggg ggcccacatg 105301 acagctggcc tctctgtgtt tcccttctgg ccgtctctga tttggggccc aagggccata 105361 ggggggccag ggcactccag atgggacgaa aagtcaagtc caagaaaaaa atgaaatccg 105421 ggctaggctt gtccctgctt gaggatgtga gagagagaga gagagtgtgt gtgtgtgtgt 105481 gggtgtgtgc gcacgtgcgt ctgcagtccc tagaaggatc tctattcatt ggacagagag 105541 gccaagggaa ttttccattt cctggatgtc ttcaaagatt tcaaatgcca ttaacttgag 105601 ctgacattta ttgagtactc actcggtgac tggcactgtt ctaagtactt tacatgtatt 105661 aacttattta atcctcacaa cagccctgtg tggctgctac tgttattgtt gccattttat 105721 gctgagaaaa cagcacaaag aattgagaaa tttttccaaa gttacagtta ggattcaaaa 105781 atagacttgg ttttaggatt gtgtaacttc agaattgatg gtgacaatga cgttgatgat 105841 gaggatgatg atgatgatga tgaaggcagc taatatttca gtgtctgcaa tgagctgggc 105901 attttcacaa actttatccc tttgaatctt taccttccac taagatttgc tgctattctt 105961 atcccttttg tagataagaa agcaagctta gagaagataa aacatttgct taagtaactg 106021 cagagcgtag attgaattca ggtccattcg agtcagactg tctgttcacc aaggtgaagg 106081 gctccggggg tcacagagga gcaccatggt gaggaaggcc ccgtaagagt gtggcagaat 106141 ggcaaccagg aaggatctgg tatacgggat ggtgataagg cccatctctc atggccagga 106201 aagtgtgcct gccacagcaa tgaatgaaac tggcaacatt tgacaggaag gggaaattct 106261 ctagtgggag gaatacttcc atcaaactgt tttttttttt tcccagtggg ggagtttgca 106321 catgtgtaga tttttttagc catagccttg aataaggaat tcagaaagtg gcttctagtt 106381 tgaggacttt cattggccac gtgatcacag taggctgatt tcttcatgcc taagttacca 106441 ggctgggagg acaggctgag aactatgcaa gctgttgagt cacttggaag gaagagggtt 106501 acatacagca ggcactttca tggaataata cttccctgaa gagctgcact aatttattga 106561 ggcactgagc tggatgcttc acacacatga tctcattgaa tgtccacagc accctaggag 106621 gttgatctca ttattaactt catgaagtga ctgaggctta gagatattaa atagcttcca 106681 gcctgggaaa catagtgaaa ccccatctct gcaaagaata aaaaaaaatg agctaggtgt 106741 ggtggctcgt gcttatattc acagctactt aggaggctga ggtgggagga tcccttgagc 106801 ttgtgagggt tgaggctgta gtgagctttg atggcaccac tgcactccag tctgggtgac 106861 agagtgagac cctctctcta aacaaacaaa tcccaggaga cattaattaa cttgccactg 106921 gtctctaagc taagtagttg aagagggcaa caaacccagg tggtctgact cctaggcttg 106981 aaggtccagg ttcagctact ccaccagtat ccctggtcag tctttggcca gggcacagaa 107041 ggtggcatat caagttgtgg ggtctgattt ggaaagtgtg agatccctcc tttgtatctt 107101 ttccaaagtc cactcagaag agagtagtgt aaaaggagtg atagcacagc gtggtctggc 107161 tggcatagtc agccgccctg tgatgggcag ggggaagctg ctttctgtcc tcaagcagcc 107221 cctaaacttg taggccagac tcataaatca cacattgata actcagtgtg atgtgtgctg 107281 taatggtgga gaggggggag aaatgccagg agcagctcac ctcccagggt cctggggagg 107341 ctgcacagag gagcctgagt tctcagttgg gttttgaggc agaggaagca gtgaaagaga 107401 gaggaagaat gtcagtgagg tgcggggaga agcaagagag agaagacact tggcaggaag 107461 aagtagttga agtacccggg gatgtttagc ttgaagaata gaaaatttgg aggaggcaga 107521 gaaaaggcag gaaactttaa aagcaggaga gaaagacctc caggttccca aagggagcat 107581 ggcacttctc aagggggtga ggttgacagc attaggagag agagagagag aaagagagag 107641 agagaaagag agagagcgag agagttgtca aggaagggaa ggacccatgg aggggagagc 107701 acaggtgaaa atgagaagtg atgggaaaat gggtgggaaa taaatgaagt tttgaggcca 107761 gttgagaaat cttgcttggt ggtttctatt ttctttgtga agcagaaaaa aagaggaatt 107821 agacagacct gcattcaaat ctcatgaccc caggcaaatt tcttatcctt agcttcttca 107881 ttttgaaaaa ggggtgaatg ataataggac ctacctaata agagttgtca ggatcaaata 107941 tgttagtgct tgtgtaagat gcagtggcac cctggcactt agtaccaaaa aaaagtacta 108001 taaatgttgt cacctattga gggagtccgg gtttgggtgc ttgatacata gaaatatcat 108061 gaatagcagc agagaataca agaagaagag aaagcttggg ggagaagata gagaattatg 108121 cgttgggtat gttgaggatg aggtgcctgt gccatatccg ggtgtaggtg cccaaaggca 108181 actggaagtc atgagaagtc agggatagag gtaaatgttt gggagttgtt aatacagggg 108241 tgcattttta aaatatctga agtatgttac tggtggcact caaagatttt taatgatgta 108301 aggtattcaa caacatggaa ctacgtagtt ttaaaagaaa ttctcttgta ggccgcgtgt 108361 ggtggctcac acctgtaatc ccagcacttt gagaggccaa ggtgggtgga tcacttgagg 108421 tcaggaattc cagaccagcc tggtgaacat ggtgaaaccc catctctact aaaaaaaaaa 108481 agaaacaaaa caaatcaaaa caaaaaaacc gggcatggtg gtgtgtgcct gtactcccag 108541 ctactcggga ggctgaggca ggagaattgc ttggacccgg agtcagaggt tgcagtgagc 108601 caagatcgcg ctactgtact ctagcctagg caacagaatg agactcagta taaaaaagaa 108661 aaaaatgttt ttttctttta attcttcggg ttaaggagag tgtcttagtt tggtgctatc 108721 atgtcctaaa catctttcta atgcatgcaa atctcccttt taagctaaaa gaatgaagca 108781 atctcagact cagagctatg tacctgctag aatttaataa tttaaatttt cattgtgttc 108841 ttcatactca ccttctattt atagcaagtg ttactagctc tccaattctg gaggttgttt 108901 cagataggat gcattagttc tttttttttt tttgagacaa ggtcttgctc tgtcacccag 108961 gctggagtgc agtggcacaa acacagctca ctgcagcctc aacctcccag gctcaagcaa 109021 acctcccact tcagcctccc aggtagctgg gactacaggc atgtcccacc attcctagct 109081 aattatttta ttttttatag aaacagggtc tcactgtgtt tcccaggctg gtcttgaatt 109141 tgggctcaag caatcctcac acctccgcct cctaaagtgc tgggattaca ggcatgagcc 109201 accatgcctg acctgctttg gttcttaaaa acagaatatc tggcgagcct aacaaggagg 109261 ctggtggttc cagggttggt ttaatcaaag acttaacact gtcatcaagc gcctctcagt 109321 tatttgctct gccatcctca aaatgtgatc agtgtctccc ttcttggtcc caggaaggct 109381 gtgatagctc ccagaatcac atactcacag ggcaaagcag gaaggaaggg gacataggac 109441 aaaggatttg tgtttaagta tttcatgtca catttgcatg tggtgcttgg agctgtggca 109501 gctatgtttt gaccatgagg gacaaggttg gcaagctgta aatggcagtg taaaaagagg 109561 gagaggatct gtctccttca caatatcctt gagcccctga attcactaat cttgatatag 109621 ctctatctct tgaaattttg ttttgttaga taattctctt actgcttaaa acgtcttaag 109681 atgggttttc tgttctttcc agtagaatgc attgtatctc acacaatttg aggttattgc 109741 ctcaacctac tctgaatcat tgctcctccc ccttttttcc tggttgatta gcttctcttg 109801 agattgtgtc ttatcttcct ggttttttgt atgtcgggta attttggatt gtatcctgga 109861 cactgtaagt gttgtgctgt ggagactctg gattcattcg tatttctctg aaaagcattg 109921 atgtttttgt tttagtaggc cattgacttg gttggacagt agactgtctc tgggacacca 109981 gctcagatgt cagttctgtt ttttatttta tccttgagtc ttctttggga ctgccccatg 110041 catgcatggt tcaagagtca gccagacatt tgggtggggt ttagacacag aatttggggc 110101 tcttctctct tggctgtctc tattatggaa ttccttcctc actttccaga agccgtgagt 110161 gacctgaaac ctgtcctctg gttctgtaca ccagaaagac ttgaggcttc atgttgaagt 110221 tttagctgac agcagcttgc gttcagggga aaagctgtaa aatgaccatt cccttcttcc 110281 aagtttcaac ctctctgcag aatctgcttt cttttgttca ctttcgagta atttcaggaa 110341 aaaatttttt tttcgttgag agtttttact tgttacctat ggaaaggttg ggtctagctg 110401 aagcttgctt gcccataaca gaaacgagac tcttttctgt ctaacacatg ggaaaactaa 110461 cgcccagaga ggtgaagtca tttccccaat tgtcagaggc acttgagcca gagtgactcc 110521 atcttgagta agggctagga aaataagacc gggacttgct gggctgcatt cccagaaaga 110581 aaggtattcc tagcctctag atgtctacag ttaagggaac agattgataa tgtttactaa 110641 acagagccag gcttggcagt gtccagctat cccgatatct tgagaacaaa ggcattccta 110701 attttgcttt aaggataata atattgatta ttgcaaaaca gagtaattaa gaaaattaat 110761 cctttatcac aaacccttgt agcagaacac atctctccat gatttttaaa atcatatata 110821 tatgatttat atatatgatt atataatata taacatatat gatcatataa tcatatatat 110881 gcatgtgtat gtgtatatat atacacatgc atatatatgt atatatgtgt atacatacat 110941 acgtatatat gtgtatatat acatacatat atatgtgtat atatatacat atatatacac 111001 atacacacac acacacacac acacacacac acgaatattt tacctagggt ggacgtgctc 111061 cttctcttac ttttaggaac cccctactct gtctatggag taactgttct ttcaccactt 111121 tactttctta ataaacttgc ttttgctttg cactgtggac tcaccctgaa ttctttcttg 111181 tgcaagatcc aagaaccctt tcttggggtc tggatcagga ctcctttcct gtaatacaat 111241 gtcacagagc tcttaagtgg cagagcttgg ggccaaactt aggctgctgc acccgatgtc 111301 ccagctcctc tgccctatct agctgtggat ttgtgcctcg aaatagaatt gcattttgaa 111361 tgcagtttca ttttcgtatg gaaagtgtct atcctgtaaa tgtaactgta gcagccccaa 111421 agctgagctg ggcagtttta tggcagcctg gagtcagtct tgtccgcttc caatcagggt 111481 tcctggactg gctcctatat taaaattggt attagtagac aatgttccag gcagagtttt 111541 ggatagaaac tatccatcag caccaagaat gtgcctgatt tggtaatctt tttaccaaag 111601 taaaagtcag tagtaagttg tggttaaaag agaggaagtt cagagttcag tacgttattt 111661 ctctctctct ctctctttgt ttaaataaaa gaggaagtta ctgctcttat ttttgtacca 111721 gcccgtcagt aatgggtaca gagtaaaatc ttagctaagc tgaggtcaaa cttcaaaact 111781 tagtaataag gatctccctt gttcctttgt agggcgacta acatgaaggt tctgactctt 111841 aacccataga gccacaaaaa atagtcctag gaaaatcata tgattttatt tattttagga 111901 acagaaatgg tcccttccga agtagaggca tttctacttc tttcctaaac accatctctg 111961 ctcagtgagc tctcccaccc tcaggtactg tgtttcaggc ttattgctgt attgctagaa 112021 gtatcctcac cttcgagatt aagaaaagat atttattgcc gggcacagtg gctcacgcct 112081 gcaatcccag tcttttggga agctgagacg ggaggatcac ttgagcccag gatcttgaga 112141 ccagccaggg cagtatagtg agaactcatc ttgaaaagaa aaatttattg agctcctact 112201 gtatactggg gatacagcaa taggcaagat agcccttgct cctcatgtaa cctacagata 112261 ataaacaaga ttacaaacaa acaaaataag aggtcgttct agattatgat aagtgatgga 112321 aaagaaaaaa cctgtatcta cttcttatag cagagacctg cattgggagt tgctgtgatc 112381 aaacaggaaa cggaaggtgg agtttgcttc tgtatccctc catccctgag tggaggctgg 112441 gtgcttctca cgaccagggc ctctgagatt tctttttttg ttttttttaa attattttag 112501 attaagggag tacatgtgca gtttcgttac gtggttatgg agcgtgatgc taaggttagg 112561 gtttctattg gtcgcgccac tcacatagtg aacctagcat ctgataggta gttttttgac 112621 ccccttcccc tatgccctct ccctcttggg gtccctagtg tcggttgttt ccatctttat 112681 gcccctgtgt acccaatgtt tagttcccac ttcgaagtga gaacatgtgg tatttggttt 112741 tctgtttctg cattaatttg cttaggataa tggcctccag ctgcatccat gttgctgcaa 112801 aggacacaat ccaaataagc acaatcagaa atgaccaagc tgacattatg actgatccca 112861 cagaaacaca ggagatcctc agggatgact atgaacacct ctatgcacac aagttagaaa 112921 acctagagga aatgggtaaa tttcctttgc acacaacctc acaagattga acgaggaaga 112981 aatagaaatc aagatttatt ttctgttttg tttgtgtgtg tgtgtgtgtg tgtgtgtgtg 113041 tggttttttg tttgtttgtt tgttttttgg acaggatctc attctgtcac tcagactgag 113101 gtacagtagc acaatcacac ctcactgcaa ccctgacctc ctgggctcag gtgatcctcc 113161 cacctcagcc tccagggcag ctggaactac aggcatgtac aacaatgccc ggcatatata 113221 tatatatata tatatatata tatatatata tatatatata tatatgtgtg tgtgtgtgtg 113281 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtata tgtatattta aatttttata 113341 tataaatata tatctatctt tgtagaaatg gggttttgcc atgttgtgca ggctgaaatt 113401 gagatttcct aatgtaagaa acagagcccc aaggggctct tcttgggctg gtcgctgtat 113461 ttcatcaggt gcctctgtgt tggacactta cttccctcca ttgcatagat tggatccttt 113521 atgcttagcc tagactctgc cctttgggat gcaaggagct gagtgaaggt tctgggagag 113581 taggcttaag cccagaagag ggaggcctct ctctggatgt catgctagag cctgcctttt 113641 agtctttgtg tttccactct ccatcctcct cttctggagt tcagaacccc ttcctggtca 113701 aattgccccc gggctctatg ggatcctgtg ggattctatc ctttgagtct accccaccct 113761 tcaagtactt cctgtctttt cacacctaag tgaaagaaag taggataaga acagcagttg 113821 ttaacttgat tgattgtaaa ttgccttcat tagaaaatgt taattatcgc ccctgggaag 113881 ggattaatcc tttaggtctc taggttacta gactggcaaa agccaagtgt gccttggttg 113941 actgtttctt tctgagagta aagagaaaac atcaggggag gtagaaacct cagtcatggg 114001 gtaatcatta tggccatgtt tatccccata cacgtatctt ctgctgattc tggagtcagc 114061 tagattttgt gacccagcat agatgatata gtgacattgt cctgctgtgt taaaaaattt 114121 aagttcataa gaaatgcata agtacaatct catcaggata aattacacaa catagataag 114181 attgaagacc ttttaactct tttgacgtct caatttagtc ccctctacct tccccagaag 114241 cctattgttt gtaaatttgg tgtctcttat agaactctgt agcattccaa agtctgaact 114301 ttctcccatg catatggcca cttccctgtt gatgaacatt tagattcttt tttggagggg 114361 ttgtgtctat tacaaacagc atggcaatga gcatctttct acacatctca tcaatatgca 114421 agtgtttctc gtgttcctcg gaagtataat ggcttagccc tggggtatac acgattcagc 114481 ttttaatagg ttttgccaaa tgttctccag aatagctgga ccactgtata ctcccaccag 114541 cagtgtgtgt ttccgtctca ctgcatcttt cccagaccat gatattatca ggccttaaca 114601 cttttgccaa tctgatggga gaaaaatggc atttaatttt aatgatcctt tcctgtcccc 114661 ttctcttctc ttctcttccc ttcccttccc cttcctttct tccttccctt ccttcccttc 114721 cttcccttcc ttcccttcct tcccttcctt ccttccttcc ttccttcctt ccttccttcc 114781 ttccttcctt ccttccttct ctctctctct ctttctttct ttctctttct ttctttctct 114841 ttctttcttt cttttctttc ttcatttcct ttcctttctt tcctttctta ctttccgagt 114901 ctcactctgt cgcccaggct ggaatgccgt ggcatgatct cagctcactg caacctccgc 114961 ctcccaggtt caagcgattc tcctgcctta gcctccttag tagctgggat tactgccacc 115021 tcacctggct gatttttttt gtatttttag tagagacggg gtttcgccat gttggccagg 115081 tgggtcttga actcctgacc tcaggtgatc caccggcctc ggcctcccaa agtgctggga 115141 ttacaggcgt gagccgctac tcccagccac ctcttcttac tagtgagact gaacatcttt 115201 ttacatattt attggacaat tgtatttcat cttctatgaa tgtctccctc cccacttttt 115261 tataatttca tattgatttg taataatttt tatttcaatc tactagacac tgatcctttg 115321 ctcgttgttt gtgtttgtgt tgtctatttt catcaagtcc tgggtcctgt gtcactttag 115381 tgccataagg gaatgacaga ctcccttatt attggtttag ctcttgaagc ctaacaaagt 115441 gtttccaaac agttcattat tcccacttta aggatgtgaa gactgaggct cagggtcaca 115501 acttgtgcaa tcttgggtag actttgaatc tctgagcaga cttagaagta gaccaaactg 115561 ctgtgagcca cagggagatt attattgcta acttttccct gttagccaaa cacttctgct 115621 tggtccctct agagagaagt ctttaattat tcatctcttt ccaagtaatt ggcgacaagt 115681 cctgcagctg ggagttggaa taaacttggt tgctaaaagc atccatcaca cataaaacag 115741 gggagatctg gctccaaaga atggtagatg tagatcttcc cctacggatt tgttttctga 115801 agatctcctt atacactgac ctcagccttt tgctgaccta tccttgtgaa ctgaaggctg 115861 aaaaaaccaa tgtagccatg gtccacatcc atttatcact ttcagaaatg gcccctgcac 115921 tttatatttc ttcaggtcct gcaaaaaata ttgaaaatcc tttacaaata gaggcaaggt 115981 caggccatcc ctgtgggatt tccagggagc agtgcactga cccgttttgc agaatgggaa 116041 gcaaaaaata aaactcagac ttagacagct ggcataagca aggacttccc tattctgagt 116101 gccaaatgtc aatttgaaat acttgaactt ttattctctg ggacatgtac tggcattagt 116161 aagaatttct atttattgag catttatttg gggcaggcag atgataagtt atacacatta 116221 attatcttat ttgattttca caatgaccta tcagttacat aattgattgt ctgcatttta 116281 tggataaata acctgaggct cagttatttg acttgctcaa ggtgatgtaa caagtaaatg 116341 gcagagccat gatttaaata acaaaacgca tgctcttaat caccacatta atgtcttgat 116401 atcaattaat agaaccattt cccctccatt ttctctttag tttcagaact ggtaagagtt 116461 tgtctaataa gatttttata aagtggtgac atgaatggct tgtggggatg gagactgggg 116521 ccggctgcca agggtaccca tctcacacga agcatcaggt ccacttgggt ttgtttttgg 116581 ctgattacaa tcttgaatca aacttttcag tacagctggt ctcccaagaa ctttctaatg 116641 ttgatttgag cttccctgaa tgaatctcag aaaataagaa agggagagag aatgcctgtg 116701 tattgagctt gacccatttc acagatgccg aaatagaagt gaggcgactt gcccagggtc 116761 cctgtggtgg attgtgttgt tgtctcaatt atttgctccc acctcataca aggattatac 116821 atttcacaca tggctatgtg acatacatgt ctacttgcag aagggacaca ctttccttcc 116881 cactgatgtg gagcattggc caatggaacg tgagcagaca caatctactc tacatttaaa 116941 cagaggcttt gagaagctat tgtgtggttt agccattgtt cttcccacct ctgccacaag 117001 accaagaaag ggctgttgcc tcagctgcgt cctgggatgg aaaagacaca ggaagtagaa 117061 tcacagagcc acagacaact gcagtgacac gtagcacgag taagagctaa acttcgaata 117121 ttgtaagtca ctagaatttt ggggttgctt attattgcac ttacctacca aaagctgact 117181 aatacagtca tatagccagt aacttgacta agctggagct tgagccaggt cgaactccca 117241 atcctatcca tggtcttttt cttatttcag tcttccctaa tattccattc taacttacat 117301 catattttct gtgtctgtgt gctgtctgtc cttttttctt ttaaataaat ctttttctac 117361 caaagatata gaacaggaaa taatattttt ctttaaagag tctgtttttt catttcaacc 117421 catttatttt gaaggaaatt ttatcatcgt ataaatagaa aaacagtgtc atcacaaata 117481 attcataact ataagcagtg gctcacgcct gtaatcccag cactttggga ggctgaggtg 117541 ggcggatcac ctggggtcag gagtttgaga ccagcctgac caacatggtg acactccatc 117601 tctactaaaa atacaaaaat tagccaggca tgctggcaga ggcctgtaat cccagctact 117661 caggaggctg aagcaggaga atcgcttaaa cctgggaggc ggaggttgcg gtgagccgag 117721 atcgcgccat tgcactctag cctgggcgac agagtgagac tccatctgaa aaaaaaaaaa 117781 agggtaattt aaacaatatt atgaagttct agctaggtac tgttggcaga caagaatctg 117841 attttgaaat atcgtctctc tatgttccaa aaaggagttt ggcaaatatt agaatggtgt 117901 tcaaaacata caagcaccaa actgatactt tctcctttga taaatcagga agattaaaaa 117961 agaatcagaa gaacaatttc ttcctagtat ggcttggtgt gatgagtgtc atctgtattc 118021 catataaaat tctttcagat atcaccaatg agtgatatgt tcattacaat ggggggtgtc 118081 cctttctttg gttcatgtat aataactgga aacatgtata tagcatgtgg tatgtgccag 118141 gtactcttca agatacttta gttatttaat ctattgtcag ataaggaaac agaggcttgg 118201 agaggctaaa caacttatcc tagctagaaa gaggcagaat tggggcttca agtcaggtct 118261 gtttgattcc agaggcaaca tcctaaagca aagtttcctt ctaactgctc acttgggact 118321 tttaaacgaa gccattaggt gcttcacttt cttgtggcat gccagagtgg tagctgcttc 118381 agggcatcag cctggggcca catcaggacc tacctgcacg taggttgatc aggaccctgg 118441 cagggctgtc aggtacagcc acaaagcaag ctgtttctga ctgtgttccc acactttctc 118501 aggcatccac cctggtgtat gaatttgctt cttctttata acaaactacc tctaaactgc 118561 actctagtct acagatcagc tgggcaatgc tgttgatctc ggctgggctt gctcgtgtgt 118621 ctttggtcag ctagcaggtt atctggggcc tgattggtct aggatggctt cacttgcaca 118681 tctggcactt agatgtgtgt tagctggagc aatagggggc ctggaccccg tgggtctcct 118741 tttccagcag gccacttggg cttgttcaca gggcagatgg gcccagttca aagggtctga 118801 gaacacaacg gtactgcagg atgagctttg gaactgctgc accgtctctt ccactgcatt 118861 ctgcatacgg gccctaaaca agtcccaagc cttctcagat tcaagacatg ggggatgggc 118921 tgtatctctt atggagagag tcacatagca agggtgtggc tctagggagg aatgaagagt 118981 ggtcaagtgc agcctgccat gctggctcct tcaccagcct gtcagttata tgagccaatg 119041 ggctgaatca tagtcatctt tctctccaca tgcctggccc gtgaaggatt tgcctgccca 119101 gcagagaccg tggaatgaat gtgtgctcca ggcactctgc cttctctctc ttctttgact 119161 tctctaagct cgttcctgcc cctgtccccc tctgtctctc ttccccagga tcatggcatt 119221 agtcggatcc ttcccatcat tttcatttta gctccagtat cacctctttg agccttccct 119281 gactaccctt ccagcattct ctaatcccat caccttgttt tttttatctt ttcttttttt 119341 ttgagacgga gtctcactgt gttgcccagg atgaagtgca ggggcacaat ctcggctcac 119401 tgcaacctgt gcctcctggg ttcaggtgat tctctggttt cagccacccg agtagttgtg 119461 attacaggca tgtgccacca tgcctggcta atttttgtat ttttagtaga gacagggttt 119521 catcatgttg gtctggctgg tttcgaactc ctgacctcag gcaatctgcc caactcagcc 119581 tctcagcctc ccaaagttat acaggttttt tttttttttt tttttttaaa tcttttcata 119641 gcatctgtaa ctgtttaaaa tattagtttg acttcctttt ctagaatgta agctttggga 119701 gagcggggtc cttgcctgtc ttgttctctg ttacttctcc agctcccaga atggtggctg 119761 gcactcaggg ggtgctcaac acacataatt gtcaagaaca tgctacatca aggcctgagt 119821 ggctttgcca cggctcttct tgtgactgca ggccttggtg ccagcagcca ccccagttcc 119881 aagaaatggt ctcttgctgg ccaacttagc aggaagaact ggcagcgttc ctgttacagc 119941 acttccaggt ggctgctttt ccctgagtcc atgtggcttc tccatgtctc tctgttggag 120001 tcacaagttg tggaaattca cttaggccaa cttagaccaa gaagtggaat agttaatagt 120061 taaatgtcag tctgggcaac atagcaagac ttcgtcttta caaaaaattt ttaaaaatat 120121 tagctggtgc ggtcgtgtgc acctgtagtc ctacctactc ggggggctgt ggtgggagga 120181 tcgcttgagc tcaggagttg gaagctgcag tgaactatga ttgtgccact gcactccagc 120241 ctggatgaca gagcaagacc ctttatctaa aaaacaagaa aaaattaata aataaaagtt 120301 aaatataagg tctctggagg tatcctgttt gggttcagat tctggctctg ccacttccta 120361 gttgggtcac ttttgggaag ttacttaacc tctttgtgcc tcagtttttc acctacaaaa 120421 atagagcaaa gtgtaaaaat atgaactacc tcacagggtt gtcaggaaaa ctaaattttg 120481 tgtatggaaa gcactgattg ctgagcagat ggtgagcatc tctgagtagg agctcctgct 120541 gccattatta ttattattat tattggaggt tatcatgtgt gattgaagag caggaacttc 120601 agctgggctt caggagcaac tggttagaaa aacggaagac ttggcccggc gtggtgtctc 120661 atgcctgtaa tcccagcact ttgggaggtc gaggcgggca gatcatctga ggttaggagt 120721 tcaagaccag cctgaccaac atggtgaaac cccgactcta ctaaaaataa aataaaaaat 120781 aaaaaaaaaa ttagctgggt gtggtagcgg atgcctgtaa tcccagcatt ttgggaggtc 120841 gaggtgggtg gatcaactga ggtcaggagt ccaagaccag gctggccaac gtggctaaac 120901 cccgtctcta ctaaaaacac acacacacac acacacaaaa aaaaaaaaaa aaaaaagaaa 120961 aaaaaagtta gctgagtgtg gtggtgggcg cctgtaatcc cagctacttg ggaggagact 121021 gaggcaggag aattgcttga gcctgggagg tggaggttgc agtgagccga aatcgtgcca 121081 ttgcactcca gcctgggcga tggagtgaaa ctctgtcaaa aaaaaaaaag agagaaaaaa 121141 gaaaaaaaga aaaatggaag acttatcggg aatgcaggca gctgtgcgat ttcagcttct 121201 ctttgttttt ggaccatggg ttttctggtt cttctttggt tcctgcaggt ctgactttct 121261 cctttttctg ttgatggtct ttcttggcct gtgtttgcaa tgggtcaacc atgtgctttc 121321 tggccatact tcacacttac ttcccaactg ggacagatag catttcttaa gtctaattcc 121381 aaacttctgg gataatctga ttgcccttca ttgagtcaga agccagtgga tagggacatg 121441 gaagatgcta tagctgggct ccacccttgc aggtgagagg agcagtttca ggaaagtctg 121501 ccaggcatga gctgagacct taagagatgt cagctagaat tagccactgt cagtatatct 121561 gcttctgggt cccactcttc atcctgcttt tcaaggtttt caaatataca ctcctactca 121621 gtctcatgaa aattattatt tttaaaatac cctttagcac atttacctct aatgatacta 121681 gattcttgcc tcctgcctct gtaaatcact cggtcacttc ccgatcttat atttggctgt 121741 caagaatgtg gaaatctcac agaacccttg aatttcaaag ttggaaggga aaattgcttg 121801 tctatgctta taaaaaatac acacctacac ccacgtacac atgtatttac taaaatagta 121861 ttaggctaaa tattcagttg tacagctttc tgatttcatt aacaatgtct gtctttccgt 121921 gtcagaagag atagacattt gaaaaaacgg cagcgtagga ttttgttcta tggatgagct 121981 aaaagttctt tcacctgtcc acaaacttcc acctagtccg tgtgccttat aaaacacaaa 122041 ttctgaaatc ccatgtaacg tgtcagctag attctgccta tcactgagta gagatgaacc 122101 atcacatctc atgttctaga tttggggtta caaattcaaa ttcctacagg ggccaggcaa 122161 ctgtcttaga tgtgtgaaac gtcctagata taagaaatgc ctgtaaatga aaccttaaga 122221 gtttttataa gcatacaaat aagtatgtaa tattttgaaa catacttatt ttttggtgta 122281 cccacacagg cctagcatca ccagattctt ctctcttttt tttttttttt gttttgcccc 122341 aagagaggca ggaaatgagt atatttatgg gagctttcct aattgtgaag tgttggtaag 122401 ttcattattt ttaaaaaact aaaaaccaac ttgcaggcca aacaaaatgt gggccaccaa 122461 tgtgtaatca ctgatgtaga taatgcactt caattaatct aaaatcatca actattttgg 122521 cagccctctc agactgctga ctcactgagc ttttcatctg atgaaatcct taagtgattc 122581 acaaatactt aatctgaaaa cctcttaatc taacttgtgt tcctggactt gtaaattggg 122641 tttctggacc ttaatacttg ctcatgttaa atccattctg ttacttatgt ttaaggtcct 122701 acatctgtct tccagcctga tcacccatct tttcagccat gtgttgtttt tgaattcaac 122761 tcatttgatc ctttctctat gtcctttttt tttttttttt tgagacgggg tcttgctctg 122821 ttgcccaggc tggagtgtgg tagcacgatc aggactcaat gcaacctcga cttcctgggc 122881 ttaagcgatc ctcctgcctc agcccccaca agtggttggg actacaagca cgtgccacca 122941 cacctggcta atttttttga attttagtag agatgaggtc tcactatgtt gcccaggctg 123001 gtcttgaact cctgagctca agtaatcctc ccagctcagt ctcccaaagt gctgggatta 123061 caagcgtgaa ccaccatgcc cagcctctct gtgtcttatg ataaaaaaaa tactggatga 123121 gacaagactg aggaggaaga cccagggacc ctttttagca gagcttacca gcgggtgtcc 123181 aaggattgaa tttcccttcc tagacatgtt ttgcttcaac taaaaagata tatatatata 123241 tatatatata tatgatgtat atcacacaca tatgtgtata tcacacacat atgtatgtat 123301 atcacacaca tgtgtatatc aaacacatat gtatatcaca cacatatgta tgtatatcac 123361 acacatatat gtatgtatat cacacatata tgtatgtata tcacacatat atatgtatgt 123421 atatcacaca catatatgta tgtatatcac acatatatat gtatgtatat cacacatata 123481 tatgtatgta tatcacacat atatgtatgt atatcacaca tatatgtgtg tatatcacac 123541 atatatgtgt gtatatcata tatatatatg catgacccat atttaaaagt aggatatttc 123601 ataacgaaat accagtatta ggtatctttt gaaaaagtgt cgtaactgac agcccaagtc 123661 tgcattcctt catggcacca gggctagaac tgagaagtag ctgacttttt ggggtggata 123721 tgggctcccc tagtcatgct atatgcctca ttttatgtga ctggccgggc cccataggcg 123781 tttatgtttg ccactctttt agaggtccct ttaaggttga catatattca ttgattgatc 123841 aactcattta cacacgcatg cagtcatttt ttggacccct gtctatgtgc tctgcacaat 123901 gcgaggtgtg ggatataatg gtgaccaaga cagattccaa ccactcactc acagatccca 123961 cagcctacca gggaagccag atatggaaca attatatacg taattttact tgtaagtgcc 124021 acaaagacaa attgtattaa gtagtaccat gtggctgtgg aagtttaact tgtcactaat 124081 ccaccaaact ctgagaagac agcatatttt ccgtcattcc caaaagacta tcacaggcaa 124141 cattgaggaa tgctttgtta aactccaggg atatattcca catcacactt cccccgtcca 124201 ccagattagc aactgtccaa ggacaaaaca aaacacacac taaaaaggaa atggggttag 124261 tccgatatgt ctccatctcc ttatgctggt tcttttgtct gcagaagcca cggtttcttt 124321 ttctttttga aaaaaaaaaa tgacatttgc gtacctctct gcgtttctct ttctccagga 124381 ttcttctggg atcctaggca gtgtggtttg gtgatctcac tggtgagatc tcttggagcc 124441 ctggaatgca attcatctgg gccagaagag ttgaactaat ttaaggacat ctgagtgctt 124501 cactgatgta tttgttctgc tctgtccctt caaaatacca ctactcaaaa caatgttcag 124561 ctggagaaag tggttaacac tggaattttc tttttttctg agacctaagt gacagggaag 124621 ggtggatgaa ggaaacatat cctttgaaaa tcctttagtt agaatccttt ggttgcaagt 124681 aacataaatg aactctgatt aatgtaagca aaaaagaatt tgttagaagg atatgggggg 124741 ggagggatgg acagagatgt gaaggaaaag atgaagaacc aggctttaga aagtcctgaa 124801 accaaatgaa ttctagggat ctaggaagca gggactaatg actgtttcct tcaggacacc 124861 agagatgggg tgaataactc tgtaaaagat gcaaatttcc cacctgggta acatagtgag 124921 accctgtcct taaaaaaaaa cagtagccag gcacagtgcc tgtagtccca gctatgtgga 124981 ggctgaggca gaaggatcac ttaagcccag gagctcaaga ctgcagtgag ctatgatggc 125041 actgctgcac cccagcctgg gcagcagagg gagaccctgt ctcttaaaaa acaaatgcta 125101 aattttagag agaatagaga gaatcttttc ttaaaaaggt gctaagtctg gggaaggagc 125161 ttctgattgt cctgccttga gtcacacgac catctctgac ttgttgaccc ctctacaatc 125221 ccatataaga ctccccaaga gtagattctg agcaggtaaa gcccatgggt gttcccaaaa 125281 gtcggcctca acctgaagcc acttattcac tctgagctca tcaccaatgg catgccgcac 125341 tgtcaaaact ctctttctgt gcatttgtct gggccactca actcaggaaa gttagttgtt 125401 ctgtgtgagt ccccatgttc tcactgattt catttcagtc tttaagctaa aaatctgtgt 125461 aaatggtaat ggagaatggc tacatcacac ttagtatgtg tcaaagcagt atcctaagtg 125521 ccttagtggt atcagcttat ttaagactga cagtactctg aggtgcagtc acttcttttt 125581 ttttttttcc tttttttcaa aacagagtct cactccgtca tccacactgg agtgcagtgg 125641 catgatctcg gctcaccgca acctccgcct cccagggtca ggtgattctc atgcctcagc 125701 ctcctgagta gttgggacta caggcatgca ccaccactcc tggctaattt ttgcattttt 125761 agtagagatg gggtttcacc acgttggcca ggccggtctt gaactcctga cctcaggtga 125821 tccacccacc tgggcctccc aaagtgctgt aattacaggc gtgagccacc gcacccggcc 125881 tgtcacttct tgtcattcac agtagttatg ttctgtaaag tcactgcaaa cactgaatta 125941 ataaacactg aaccattgtt cctaggtgaa atacaaagtt gggttcctgt gagcctttgg 126001 tcacagcatt ttcatccact gatcaacccg taagcttatt ctgtgtgtgt ttctgtttaa 126061 agacacctta atatatattg ttgactcatt aacatctaac tcactgccaa cagcactcca 126121 actcgtgcct gatcaaagct tatataacac acgcatgaca catctcggac tccttgcact 126181 taggaacacc agatagcact tcaccattac cctatgggac cattttacag caacgtcacc 126241 ctcaaaaagc agaaaaatgg gaaaaagcat gctgctaata taccacagag agaatgcttg 126301 tttacagcat gagagctgaa gtctgaaggt agaacatggt tttgttcatc ctcagctggg 126361 aacacatgtt ggctgactca aatattttgc cactccgtac atgtccgtga gtgatcatga 126421 aagaaccttg agtatggatt tggcggttat aaataaattt tagcaagagg tgaatttgca 126481 aataatgagg attgactata acccctgtta ttagctctag ttaacagatg ggggaatgga 126541 ggtaaacagc tctctgcagg tcttgcagcc agggagtgta gaggaggctt tgtcattctg 126601 ggtgcacagc ctgcgtcctt aagaggcaat gtagtgttgg ctgggtgtgg tggtggttca 126661 tgctgtaatc ctagcacttt gggaggctgg ggtgggtgga ttacttgagg ttaggagttc 126721 aagaccaggc tggccaacat ggtgaaaccc cgtctctact aaaaatacaa aaaaatcagt 126781 ggggcgtggt ggcacgcgcc tgtaatccca gctactcagg aggttgaggc aggagaatca 126841 cttgaacttg ggacgcggag gttgcagtga gctgagagcg tgtcactgca ctccagcctg 126901 ggcgacagaa tgagactcca tctcaaaaaa aaaaagaggc agtgtagtgg ttgagtgcag 126961 tgtagctcct gctgagagct ggttttcatg ttttgtgggc acacaggaag tagagaatgc 127021 tgagagctgg gcactgaagc agtaggagat atgggagtgg gaaagtggtt ttctttagta 127081 agtctgggct gttttcatag cctaggatga agttcagcaa tactcaacct tgaggttttc 127141 caggcacctt ctccttgaca tgaatctcct gcaaacagat gtcatagtaa tctgggaata 127201 aacctttgtt gttggtcctt tagctatcct tttttttttt tttttttttt tttttttttt 127261 tttttgagac ggagccccac tgttgccagg agtgcagaag catgatctca gctcactgca 127321 acctccgcct ccccagttca agtgattctc cagcctaggc ctcccgagta gctgggatta 127381 caggcatgca ccaccatgcc tggctaagtt ttgtattttt agtagagacg gggtttcacc 127441 atgttggcta ggctggtctc gagctcctga cctcaagtga tccgcccgcc tcgacctttc 127501 aaagtgttgg gattataggc gtgagccact gagcccggtg ccctggtcct ttagctttct 127561 aagacagcag catagtagtt cagagcatag gctcttgggc catagcttcc tgagttcgaa 127621 tcctagctct gctgtttttt aaactctgtg accttgggca agtcttaacc tctctgtgcc 127681 tcagttttct tgtctgcaaa atgggattaa taaaaacacc gatctcatag gattgttgtg 127741 aaaatcgagt gagtatgttt atagtgttta ggaaagttcc tggtacacaa gacatgctct 127801 gaaagtgtat taatattatt tgtccttaga ctcagtagta ctctggggga atcgaatccc 127861 cacccctaag ccaggcagca tggtagaagc taggtcagct ggcaggctcc tgggtctttt 127921 ccattctgct gcaggagatg cctgactcac ctcttgtgct cacacagatg acagctaaat 127981 agaagggctc tttcctccta gtattagctt aattataggg gctccctaga acagtgggga 128041 aaatgagctt tgagaatccc cccaccacac acacaaggcc cacactagca cctgaaagag 128101 gagtgaacat tgcttcctgc ctccttctcc ccctccttct cacatatgcc tttgtctctg 128161 tgattctgca cgtttaattt ttaatcccgg gatgggttct attagttttt gtgtattttc 128221 tctgccctgg tttgagctcc aaaaggtctg tttgggtctc tgtgggtgtc tgacagcctc 128281 tgcaggcctt tctgaattat ataactcaag ctttgtattt tctatttgtt actgggggtt 128341 ctttctgtgc tataatttta ctcttagaaa atgaagtagg gtactagaat tttctctggg 128401 atcccctagg gtttccttct gaatattttt tcctgccata ttttcttcgc taaaacattt 128461 ttttttctta tttcagaagc aatacatctt cattatagaa cattacaaca agatgcagag 128521 aaagaaaata atcttatcaa gcagaaatgt taataccttg ctatctaccc atgtctctct 128581 gtctctgtct cctcaaacac acacaaatgg ggtcaagatg acataagaga ctatctgctt 128641 ttcaattcac attttacgat cttttttttt gtatggttct gtagtctcca ctgcttggct 128701 tgacgatctt taacaggtgc cctccggatg gtcatggggg tattttccat ttctcactgt 128761 tacaagcagt gctctgatga aacacctttt gctggcattc ctaattatgt cctgagagca 128821 gacttctaga agttttcagc agtgaggtta aagggaatgt ttgcctctcc tcttcactct 128881 gtgatgctcc tgggggccca ggcccatgag tctctgtgtc catcttccca agatgaaaga 128941 caaagaatgg gcagagcatt gacagggccc tgtgttgact ccttttctct cttttcccag 129001 tttttttttt tttttttgag acagagtctc gctctgttgc ccaggctgaa gtgcggtggc 129061 gcggtcttgg ttcactgcta catttgcctc ccaggttcaa gcaattctcc cgcctcagcc 129121 tcccaagtag ttgggattac aggcactcac caccacaccc agctaatttt gtatttttag 129181 gagagacggg gtttcatcat gttgcccagg ctggtcttga actcctagcc tcaagtgatc 129241 cactcacctt ggcctcccaa agtgctggga ttacaggcgt gagccactgc accctggccc 129301 attcccgttt tttaaaaaaa tattttattc cttttagact atttagccac agttttccat 129361 tctattcctt gggataagtg aaaacttgat ttttaaaaaa taattacaaa attttaaaat 129421 catggtaaaa aacacataac ataaaattta ccatcttaac tattttttta agtgtacagt 129481 atcattttaa gtacatttac attgttgtgc aacagttagg acttttcatc tcgtaaaact 129541 gaaactctat acccactggc aataacttcc catttcttcc tccccaccga gcctctggta 129601 accaccattc tgcttcctgt ttctatgagt ttggctattt tagatgcctc atgtaagtgg 129661 actcttgcag tatttgtctt ttcatggctg gtttcttcct ttttttcttt tcctttcttt 129721 ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctttct tccttccttc 129781 cttccttcct tccttccttc cttcttcctt tcctttcttt ctttcctttt tttttttttt 129841 tgacagggtc tccctgtttc tcgcaggcca ggctggagtg cagtggggca atcacagctc 129901 atggcagcct tgacctcctg tgctcaaggg atcctcccat ctcagcctcc tgagtaactg 129961 ggacttcagg cacacactac catgcccggt taattttttt cttgattttt aatagagaca 130021 aggtcttgct atgttgccca gggttgtctc aaactcctgg cctcaagaga tcctcccatc 130081 tcagcctcct gagtagctgg gattacaggt gcacactacc atgcccagtt aatttttttt 130141 cttgattttt aatagagaca aggttcttgc tatgttgccc aggctggtct caaactcctg 130201 gcctcaagtg atcctcccac ctcagcctct caaggtgaag ggatgagcct cccaaagtga 130261 aggcgtgagc cattgctcct gggatgactg gtttatttca cttagtgtaa tgccctcaga 130321 gttcattcat gttgtagcat atgtcagaat tttcttcctc tttaaggttg attaatattc 130381 cattgtatga ataggcaaca ttttgcttat atattaatcg gtcaatggac atttgagttg 130441 ctttcacctc ttggctattg tgaataatgc tgttgtgaac atgggtatac aaatatcttt 130501 tttttttttt tttttgtaga gatggggtct ccctatgttg gccaggcttg tctcaaactc 130561 cttggcccaa gcaatcctcc tacctcagcc tcccaaagtg ctgagattac aggtgtgaac 130621 tactgcacct ggctgagact ccacttccaa ttcttttgga tatagatcct gaagtggaat 130681 tgctgatggt agaaactgtc tcgggaagat ttttctgggg atttggcttc taactgcacc 130741 acagctccag tggaaacatc tattcattaa actggcattt attgaggacc agctttgttc 130801 cagtcactga acaaggctgg gggcatagcg tggcttacat aggtccccac cctccatgga 130861 gctgatattc agcaagaagc cagacaatac acaggtaagc aaattgaaaa aaacaggatg 130921 agatccactg gcgataaatg tcatgatact gaatcaggat ttgacaacag agtgatgtgg 130981 tgagtggcta ctttagatca gaaaggcctt cctaactggg tgacaggtga cctgagaact 131041 acaaggtggg gaggagctag ccacacagaa tttgggaaaa taccattctg ggccctggga 131101 tgagcaatgg aaaggtcttg aggaggaaag tggctaggcg tgatccttgt tgctggggcg 131161 tggtgggtgt ggggccagat cataaggggc cccacaggat gagcctctgt gctaagtgca 131221 gtgggaagcc actggagagt tttaggtagg cgagatgcaa aggatcagat ttgtatttaa 131281 cacaccttcc attggctgcc aagtagagaa tggattgttg gggactggag cagaagtggg 131341 aggatgagtc aggagtcaat tgcagatttc taggtgagag atggtgtctt tgtctgcacg 131401 ggcagcagtg cagagggaga gacacaggca gatttgggag gtaaaaggac ttactggtaa 131461 atgggatatg aattggaaga gggaagcaaa tgcacagctg ggcacctggt acactttaca 131521 tatgacgtct ttgaacttcg tagcaatccc ataaccacat tttgtagatc tgtgttgtcc 131581 agtagaaata taacatgagc cataaacctg agctgtgtat gtaatttaaa atgttcttag 131641 caatcacatt aaaatggtca aaggaaacag gtgaaatgaa ctgtgagaat gttttcttta 131701 actcagcata ttccaaatat tcccatatca acatgtaatc aatataaaat tattaatgag 131761 atattttact tccccgcata gcaagtcttc aacatttgat atgtatttta cacgtacagc 131821 acatctcaat tcaaccacac ttcaagtact cagtagccac atatggctag tggttaccat 131881 attggtagca cagttataga tgaggaaaga gagtctcaga gagtataagt tcttatgcaa 131941 gcttatagaa ttaggacttg aaccctaatt tgactctggc acccgtgttg tgctcaaagc 132001 actgcatcca cccccaatga tgaagacatt ttcaatgtaa gcaaaagtcc tggagatttc 132061 cccacatcat atacatgggg acataggaac taacatttgt catatattta tattttcctt 132121 tccaaagcag atcaacccca agttaccttc ccttagctct cgttttctgc ctctctttgc 132181 aaatatcact tcattaaaaa ttgtgaacat tctctttgcc ttcccttatg ggcttcctgt 132241 ttgaacacca catccagggc accctgggaa ctgaaccatc tccttttctt tttctttttg 132301 agtcagggtt tcactctgtc gcctagactg gaatgcagtg gtgtgaccgt agctcactgc 132361 agcttcaatc tcttggccta aaacaatcct cccacctcag cctcctaagt agctgggact 132421 acaggaacac accaccatgc ctggctaatt tttttatttt tattttttgt agctgcaagg 132481 tctcactgtg ttgcccagac tggtctcaaa ctcctggact caagcagtcc tcccacctca 132541 gcctcccaaa gtgctaggat gacaggtgtg agccacaatg cctagcccat atctcctttt 132601 cttttgtggc attctagtag ccctttggtt catgcgatca tttagttgtt cagcaaatat 132661 ggatgattga atggatataa taggtgcagg gcactcccag gttctcaaga tggatgatct 132721 tgggcaagtt atttaatttc tctggatttt agtttccttg ccagcaaact caagataata 132781 gtaatgccta cttttaagta gttattgtga agattataag gaattaacaa taacgataat 132841 aataagtaat aatgatacat gttgtttatt agagcctgct atatgccaga cactaactct 132901 atgatttaac cttctcctgc taccctttac tcttattatt tctttaggag tgaaattcac 132961 taatcattaa gctttttaac taactgaatc aattctgttt cttaagcaga tgggagatcc 133021 tgtggctttt taccactatc cacagctctt tttttctttt tcttttcttt tttgagacgg 133081 agtcttgctc tgtcacccag gctggagtgc agtggcatga tctcggctca ctgcagtctc 133141 cgcctcccta gttcaagcca ttctcctgcc tcagcctccc gagtagctga ggttataggc 133201 gcccgccacc acgcccagct aatttttgta tttttagtag agatggggtt tcaccatctt 133261 ggccaggctg gtctcaaact cctgacctag tgatctacct gccttggcct cccaaagtgc 133321 tgggattaca ggcatgagcc actgtgcctg gcctatccac agctggtttc ccctctttgc 133381 ccccttttct ggcattctct tctgattcac tctcaactct gatttccaag ctacccagct 133441 ccttccccta gtgtgaatct gtatttgcac tggatttatt tttaaatggg aaatgaaagc 133501 cactgtgtgc tggaaaatat atatttcaga cacgaggaag ctggcgggga aatatttttt 133561 ggatgtcttt ctagatctct taaaaatcct gtgagccatc accaggttga tcttgtagta 133621 aggtggcctt ggccttagtg tatttctttt tgactcgata ttgggcggca ggaggtttga 133681 ggattgtgat actaacaaca ttaacaacaa caaactttca ttgattgatt gccatgtgct 133741 ggaaatgtgc taataaatta catttatttt ctctttgaat cctcaaacca actctttgag 133801 acaagtgtgt tgtattacag ctgtagaaac tgaggtccta agaggtcagt agatttgctg 133861 aaggtcacag agctgagatc caaatctaag catcacagca agaagtatcc ttaaccatca 133921 tctttgctgt tggttggcat tagtctggct gtgctaagaa agtttccttt gacttgcaga 133981 tggatcaagg cttttcttga gagaggcttc tcttgcctgg ttctccttga ttcttttact 134041 ctaggagact cagttcttct caaatccctg tctgtgggac tttgggcaaa acattcaacg 134101 tcatctttac ctcttagagt cactgcgagg gttggactat ggtgtgttag gtgttttttt 134161 tttttttttt tttttttttg atgaagtctc actctgtcgc ccaggctgga gtgcaatggc 134221 atgatcttgg ctcactgcaa cctccgcctc ctgggttcaa ccgattcttc tgcctcagcc 134281 tcctgagtag ctgggattac aggcacgcac caccacgcct gactaatttt tgtattttta 134341 gtagagacga ggtttcacta tgttggccag gctggtctcg aactcctgac ctcaagtgat 134401 cggcctgcct tggcctcccg aagtgctggg attacaggcg tgagccaccg tgcccagccg 134461 tgttatgagt tgaacagatt tcttggcttc agcaggggct cagtatgggc ctatcatcat 134521 tttcatttat ctccttttat gcctcatgct ctatctgtcc ccttcatctc tgcctgtagc 134581 agccacagac cctctctttc cttggtgcat gtccctcctt cccactttcc taatttcagc 134641 ctgacttagt tctctacctg tcagcttgga cttaaggaaa ggaagttctc ttttcttggg 134701 gggtcactgg ggccatgggc gtctttccac tccatttctt ttccaaagct gcctctggaa 134761 cgaatctgta aatcccaggg cattgcatga gttttctttt ctttctttcc ttctttcttt 134821 ctctctcttt ctttccttct tgctttcttg ctttctttct tgctttcttg ctttctgtca 134881 ctgaaagcaa atattgtaga cagatatgct tgtaggatag aaataaaatg ccccattctt 134941 tgaaacatat tttgtttttt attttagatt catggagtat gtgtgcaagt ttgtcacatg 135001 ggtatattgc gtgatgctga ggtttaggca atgaacttgt tacccaaata gtgagcatag 135061 tacccagtag gtagttttta aaccgttacc cccttctctc cctcctttcc gagttcccag 135121 tgtctgttgt ttccatattt gtgtctgtgt ataccccttg tttagcttcc actataggtg 135181 agaacatgtg gtatttggtt ttctgttttt ctgtgataat tcacttagga taatggcctt 135241 cagctgcatc catgttgctg caaaggacat gatttcattt tttttatggc tgtgtagtat 135301 tccatagtgt atatgcacca cattttcttt atccaatcca ttgttgatgg gcacctaggt 135361 ggattccatg tttttgctgt tatggacagt gatgcaataa acatgagagt gaaggtgtct 135421 ttttggtgga atgatgtgtt ttcctttggg tatataccca gtaatgggaa tgctggattg 135481 aatggcaatt ctatttttag ttctttgaga agtctctaaa ctgcttccca gaggggctga 135541 actaatttac aatttcacca acagtgtaag agcgtccttt tttctccgta atctcaccaa 135601 ggcactgcat aagctatctg agtggccagc aggcagaatg cttgaaggat tctccagctg 135661 ggggtgctgg gcccaaaggc tggcagctgt atggtcctgg cctttttctc tccagcaggg 135721 tccccatgat cttcccaggt ctagtttttt cttgagcaca cctgcatctg acagttaatt 135781 gcaagaatct gacttgctga actctcattt ctccctcctc ctttagcgca tcaagtcccc 135841 ttctttgttt tcccagcttg gttcttgcaa ggactaccta gaggaggact tcaggcaatc 135901 cctttctccc tggtgggctg ctgtgttctc accaataaat caatgctctc tctagtttca 135961 ggctgcctct taggacagat gctgatacca agtgactttt gaggttgttg agttgagaca 136021 cttggaccaa acagcggaag gcacagcttg gctcagctaa aggaagctcc gtgaaagctc 136081 tctgagtggg cctattggct ccatgagttg gcaggctggc cctcctctag gagatggtgg 136141 gtcctcatgg gctgtagcag ggccaggcca caggagcaga ctgctggagg agaacattta 136201 gggccatctt ccttgctgcc tttctgtgat ctcttcaccc aggaagctga atgctgttgg 136261 caagcaaaag ggtgcttggt aaccatttct tttttaagtg tgtgctggtg ccagggaggg 136321 agaaatcctc aaaacccatt gtgtctgact caggcaccac tggtttggcc ataaccgcta 136381 gtttatcctc cctccctctg ccaccctcgt catgtttctc gaccacaacc tgtctggacc 136441 ttccctgggt tggccagggc ttgataatct ttttctgtaa ggagttatcc cttcctcatc 136501 tgccaggatg gggctactgt gcagggactg ggtaacaggc ttgaagttgc aagagggctg 136561 gaagggtctc actagcaagc acacattatg tcttatatag gtccctggcc ttgtgcttca 136621 gaccctatca cagcttttcc acagctgggg ctgttgagaa tgtaagtggt gacatagctg 136681 gatggctaag atggggggag tatgagacct gcaagcgtgt gagtctcctt ttgctagcag 136741 agtgtgaatg aaatgtattt attttcccaa taaattcgac atgactgatc attcatccac 136801 ccacttaccc acccatctac tcactaaccc acccactcag tcatctacac agctgttcac 136861 tcgtccatcc atccatccat ccatccatcc atccatccat ccatccatcc atccatccgt 136921 ccatccacat tcaatgtagt gtaatgatta agagcataga gtttggagtt atcctgctta 136981 ggtttgaatc ccagctctgc cactcaccaa gcatgtgatc ttggttcagt tttaaaacct 137041 ctccttgtct cagtctcctc agctataaaa tgagattgat gttgaaatta ataacagagc 137101 ctaccttacg gcattataaa ggttatgtga gtcagtatgt gtaaggtgct ttcagtagtg 137161 cctgggtcat agcagtgcta tctaagtgtt agctattagc attatggcat ctactcacct 137221 gtttatcatc tataatagtc attaaaggtc ttcaataaat atttctggtc tttggccttg 137281 caggccaatg gtaggattgc actgcttgcc tcccttgtgg gtaggtgggg ccgtatgagt 137341 agctctgacc agtgagttgt gatccaaagt gatgtatgtt cacttttatg ctgggacatg 137401 taattgctgg ttctagatta tccagaaaaa atctttgagt tatggctgct ctatcaactt 137461 gctcctggag acatagaact tcttccttag ctgatcaggc atgaggctgt agtatgtgag 137521 aaaaacaaac ttttacctat tttgagggtt attactgcag gataacccag ttctatctct 137581 gtctctctca tctatcatct atctatttac aataattcca gtctcatttt atttcccttc 137641 ccttttcttt ttggcaccta ttactagctg gcattttctt atattttttg gagtttcctc 137701 ccaaaattta tttgttatta attataagaa tatgttttcc atgagaaaaa ggacttattc 137761 acttatctaa ttccaatagt tagaatggta ctttacacat agtagacact cagcacatat 137821 ttttaaaaca ataaatcagt taatgaatgg atctacttga cttttagtag gtcctattga 137881 acactgcatt ccacagctcc ttgaaaatcc ttactgaggc agctaatatt taagaaggaa 137941 caccctgacc tttcactgtc cagttttatt gcctgaatca agaaaaatat ccctgagata 138001 gacctcactg cctctctgca aacctcttaa tacccttagt caattgtaag aagctgcttg 138061 gaaccctgta agatggtctt tggccagatc aagtcttgct cctcttcctc tgaatcttgt 138121 aggggccttt aatctttctg taactgtgat tccccagaac aggatgcctt tgttagccta 138181 gcatcttttc tgttacttct atgaaacaag gagagacgct ccctgggacc cttaactatt 138241 tctggaagca aaatgaccct tgccccactg aagacttagg agcctggtca cttccaaccg 138301 ggcttttcat ctttgtgact cccagtcatg ggcaagaggc tagctagctc caatcctaat 138361 gcccgtgcca tcaactgctt catttctggt ttcttccttg cttttgtgat accctcattg 138421 actctttcca ctgtaccctg tggacttgcc tctatgggac acttcctccc catcctgcct 138481 gaagaggaaa gagcctgcct ttcttctggg aacatcatcc cattcagaaa caacaacctt 138541 tctggagtca ggagtttgct ttcctcttgt tctttattgc tgcttcctgg agattgcatt 138601 cttgtttaaa agctcacctg taagactctg atcaaatgtc cattcttccc tctcacttca 138661 tccctgaaga tgaatgggac atttatcttt atatatgtat gtatgcctag gggaaaaaac 138721 tgaaggaaca ttatgtatat atataatgcg tgtgtgtgtg tgtgtgtgta tatatataat 138781 gtgtgtgtat atatatatat aatttttttt gagacagggt cttgctagac tggagtgcag 138841 tggagtgatc atggctcact gcagcctcga cctcctgggc ttaagtgagc ctcccaactc 138901 agcctcccta gtagctagga ctacaggcat gtgccaccat acccagctaa ttttaaaatt 138961 tgtatttatt attatttttg gtagagatgg ggtctcacta tgctgcctgc gctgttcttg 139021 aactcctggg ctcaagcaat cctcctgtct cagcctccca aagtgctgga aatacaggca 139081 tgagacactg catccagtct atgatattct tatgttttta caatgtgctt attattgagt 139141 agggaaatta tgagtgatgt gtgtgtatgt atgtgtgtgt atacagatgt atgcatctgt 139201 gttcaatttt tcctgttgag agtgagtaaa caaatatttt tgaaaataat aaaatacata 139261 aatgaggggc aggggctctc ttggacactt ccaaaagtac tttgctcaat tttctatgtt 139321 aggcctatct gcaagaagca taagggtttg tcaggagcta ttatgattat ttattctaca 139381 aatcccagtt caacaaagaa gggtgaggct tgagtgttca attagagaag ctggccagtt 139441 gtggtatcat atcagctcta catccaatgc tcagttttca aggggtcatg ttgctctgaa 139501 aggcctggtg aggttgatga gtgtcttaat gaagccactt gtgaaagata atgtgacatg 139561 tttgtccatc acattgacgc aggaaaggag ctttcacaaa agacagcaca gaaattttgc 139621 aggctcggac cagaagggaa tttgtcagtg tttagcttgg ctgacatccc acaccctacc 139681 ttcttgtgca gttagaatcc cttctagtgt aatgtgtgaa aaaccatagg ccccctgccc 139741 acaccctcct agagctgtat caccaacagc tttcaagtgt gtgcaaattg ggagacggct 139801 ggatcccaca tctctgggtg acatttgctg aaacaaatgc ctgttaataa gtggcacgat 139861 tcccttggtc tcagtgataa agatgtacct gttactggga gcccttgaga gaaatctttg 139921 gtgaaggcct tcacactcat tttaattggt gatagacaga tgtgttgaga aaacttacat 139981 tttttttctc catccgtctc aattagaatg ttatgaaata gcaccccctt tgctcttcgt 140041 gctttatgcc ctagacttta aagactgaca acgagtggtt aaaaaatctc cattgcttca 140101 ggatgcagta tcagctgatt tggtggatta aatcgtttta aggtcctgag ctttaattct 140161 tacttcagtg attctcaaac cttgttttgc aaaaccccag agctttaatc ttaagggatg 140221 agcctgctat ttccaagaca acaaaattat ttctacttca tattaataca aaacattaaa 140281 atatacattg tctttgtaat atagtttatt aattcaaagc acttgcattc ccatttatct 140341 ggtcttcata caaaaccatg tatgagtgag taggcagatg gtcttattta ttacttcgtt 140401 caatatatag atttttgagt gcctaccggt atacagcact gaacgaaaga gaaaaatact 140461 agtttggatg ggagctcaac acaatctgag gggatatatt ggacccattt tgcagaggaa 140521 gaaactttgg ttcaggaagg gaaagtgaac tctgaagttt gacataaagt tagcttcaga 140581 actagaatat ctgccttctg atctcagcct atcacctatc tcaggttctg tctccacgat 140641 accatgtgat tggaactggc tcattaacct ggggatggat caatcagcat cttttttttt 140701 tttttaacag gattggatgg aaatctgcat tccttgatca ctcagtctat agggtgaact 140761 tacaatggcc ctcatgctta atcttgaaat acagggatgg tctgatggag tgtgctggag 140821 accctgctga gatctttgga caactcctct ccagtccact acctgaatcc tttctcttct 140881 cctcaaaaaa atctgagact atccaaatga agagtaaata aagggagaag taagaaggaa 140941 acatgagaca gagaggcagc aggagcagaa tgaaacttgg ggagcagagg gagaagccaa 141001 gaaaacttac tggacttgga gccaataaat tggccatggc atgaaggttt ctgccagcaa 141061 tatggcagcc cattccagct ctttccattt tcttttccca gctgtgctgc caagcaccat 141121 gcttaggctg gtttaacaga gattatgcta ggaggatttt ctaacattca tgtcctgatt 141181 accccttatc tgattcctta gcagtagcct tcagtggcct cagagaattc caggttcagc 141241 tgtctgaaca gttcaatgac tgtttgtaag ttctgactta cgatcaaggc aggaaatgtg 141301 aaaagcattt ttaaaaagga aaaaaaccat ccggcgttgt taacattttt gttattccag 141361 tattttccca tgcatataaa tatgtgtact tcttttattt ttcacaagag tactctcaca 141421 ccataatatt gttttataac cctcttttcc actgtgctct agtgtaaaca ttttactgta 141481 ccaaatcttt ttccacaaaa ggatttttaa ttactgcata gatttctatt gaacggatgt 141541 actaaaattt acctaaacag ctcttattat tcaacatgat tcaaccattg ttacaaataa 141601 tatagttgta tatttatctt tatggaaagc tatttctgaa tactcctaag gataagttca 141661 tgtattagtt aggttttgct gtgtaacaaa tcatcgcaca acttagtggt gtcaaacaat 141721 gactatttac ttacctctgc aggtctgcat tttgggccag gtagttctgc tggactaggc 141781 tggaatcagt ggacataggc tctggttact tatgcatttg caatcagctg ggggactgca 141841 tgggatgatg gtggtatgtc atcttccagc aggccaggtt gtttccatag cagttagagt 141901 tgcaagagca gcaaggaggc aaaccctgat gcacaaactt tttttcaagt tttggttgca 141961 tcacatttgc ttctgtccca taggccaaag catgtgacat ggccaagccc ggagacagtt 142021 tgagaagaca ttaccaaagg catggctaca aaaggcatga aagtctcagg accattgttg 142081 taatcaactc atcacagtcc ccaaaatggg agtttctggg tcaaaaagtg tggccattta 142141 aaggttttca ttgtcacatt gctctccaga gaaattgtaa ctaatttaca caccacctag 142201 agtacatgca tttccaaatt cacttgatca ctggatataa tcattggttt aatttccccc 142261 cagaaaataa tattattgct tttgaagagt acatgctttg tgagggaagc tgtcttgaat 142321 gtccagcttc caggtgcaag cgcctgcatg tgtgcacagg tgacaccatg tagaaccgaa 142381 gaaccacctg gctgagtcta gccaacccag agaattgtga gggaaaagaa tattttttta 142441 aggcagaaga aaaatgtatg ttcattgtaa aaacctgaag caatgtggta cctttatcat 142501 caatatcttc cccagtattt gattcattct gacattatgt tgttgggtta catttatttc 142561 tattacattt aggacctttg agtctatatt cattagtgaa atttgtctgc catttactct 142621 ttttttttgg tttgctgtct ttgacaagtt tttgctattt tcatcaaaac atctgggaag 142681 tcttccatct tttcttgttt tttaatgctt aggaagagca taaatggcaa aggatttaga 142741 atatttttta gaagtttgag tggatttacc atacccccaa agcaaaacaa tttctgggcc 142801 tggtgccttt tccaagatta gttatttgat aattttttct ggtgtatttc atgatcattt 142861 ttggtatact ttcaaatgtg ttatgttttt ttaagaataa gcatttaatt gagtctctaa 142921 tttattagca taaaactata tatattgtcc tgtaatgatg tgaaaaattg ccttgccatc 142981 tgcaattctc ttttctcatt cttactgagt tcctattttt agtttttgaa aattttctta 143041 tgagctttgc cagagtgtca ctgttttatt tttcccctca aacaaccacc tcttaaattt 143101 attcttcagt tcaacttttt tctgtttcca gatgcattaa tttctgcttt ttaattattt 143161 cctcagtttt atcactcttc ctagtttctt gagctaaatg tccctatgta ctttcaatta 143221 tatctactga tttgtcaaaa attggatagt atgctcgtct cctactatga tagtgatttt 143281 tcaacacctt tgtatttttt taaacaaatt ttgttttgta catttctaat gtacttttat 143341 gttataggag ttagagtttc atgattctcc agtcttattg tgaagttact cagatctgag 143401 ttcaaattct cctgtgctgc ctaccaaatg atgaaccagt tttagattct cctaaaagct 143461 tggtgataat aatgccagtg ttgcttattt ttaatactta aattaaatga tataattgat 143521 gttaagtgct catcatagtg ccaggcacat ggaacacaca ctgctattgg ttatattgtt 143581 attattgatg tactctttat gaatacaaaa tatccatctt tgtcctatgt aatattttta 143641 tgctttacat tttatttgtc tgataattaa tatttctgct catggcagct tttcccacct 143701 tttcattttc attttggatc attttatttt cagcaagtct catgggaacc atgtatattt 143761 gaattctatt ttttctttaa attatgaatc tttgtccttt aatgggataa ttcaatccat 143821 tcacattttt tttttttgca agacacagca tttggcaagt gtttacttct ttcttctccc 143881 aatccacttt cctgcctttg aagattttag aaccagaatt tatgaaaaaa aataaatata 143941 tttgatttcc catgaattgc tattgacaca tgcattattt catgcaattc aagacttttt 144001 taaagatgtg actctcaaac ttacttcatt actcactttt tatgtttttt ggacccattt 144061 ccctcttatt gaattaattg attgattcat taattcaatc aacaaatatt tattgaaaac 144121 atactgtatg cagacactgt gctagggtct gagagaaacg tatttgatgt aaacctggtc 144181 tgtgtcttca ctgaatctac atttagattc acaaaattag agacacaaaa atagatgaca 144241 aattaaagca gatataaagg aaaccaataa gggtctcaag gagagattag tgaacgtgca 144301 tgtgagtatc tatttagcaa gtttaatgag gggatgcttt ttgtgaaggt gacatttatg 144361 ctggcagctt aagaagtggc attagctgtg tgaagagagg aaggaagaaa ggcattccag 144421 gcagaaggac tagcatatgc aaagggcttg aaaggggaaa gagccttatc ctaatgtttt 144481 catagaaatt atttttaaat actcaacaat tagcttttga attaaaggaa tgcacacaag 144541 tgaagttaat ctcttatagg caaaaccaaa tgaaggattg gaaaatgcct ctgaagagtc 144601 agctggaagg ctcttcatcc atccatccaa ccatccatcc atccatccat ccatccatcc 144661 atccatccat ccactcatcc atccattcat ccatccatcc acccacccat ctacccaccc 144721 atccatccat ccattcaccc acccactcat ccatctatcc attcatccat ccatccaata 144781 agcagtcatt gagtgcctac tgtgtgccag accctgtgcc aggtgttgat cacaaataat 144841 acaagcacct cctggagagt agagcctagt tgtgtacatg ggatggaggg cccatgataa 144901 taaagctggt acttctcatg gagtgggatg ggccaggggc atggggaagg cacacgtgct 144961 gtgctctgta gagctctgag cagggaggga cgcagtggga aagagctgag aacaattcct 145021 gccagatggc tttattcttt cccctgaagg aagtggggac ttcgtgctgt gacatgatac 145081 acaagaacag tgtttcctgt gggtttgcaa gggcattttg aatataaaca ctacagctcg 145141 ctgatgtgtg aaaaaattat tttttttgct ttttcagact ttgtgaaggg ataatcagat 145201 caattttata agtctggaaa acaggtattt ggggtgaatt ttaatgcaca cattatagtg 145261 gctggaaccc taaccaaagg cagctgtcac ctgttggtgg gttctcttag gcacgttgtt 145321 tgttcatcta gctgtcacct ggcagcttat gccctccagt tcctggaact gaccagttat 145381 ctgcaactga ctaagttctg gtactggatc ctgtgtgatt tgactctaac ctccaagaac 145441 cttactgtct ttagctgtga aatgcggaga agactggact acttgatctc tcaacactct 145501 atatatcttt aaagttaaat catttgttca gtcatttgcc ctttcttctg ccatttctct 145561 ttccctctct cccatccatc catccatcca cccatccacc cacccatcca tatatctatt 145621 taatatttat attatttatg ttatttattt atttatttat tttgcaatgg agtcttgctc 145681 tgttgtccag gctggagtgc aatggcgtga tctcggctca ctgcaacctc cgcctcccag 145741 gctcaagcga ttcttctgcc tcagcttcct gagtagctgg gattacaggc acatgcacca 145801 ccatgcccgg ctaatttttt tttttttttt gagacagagt cttgctctgt cgcccaggcc 145861 agagtgcagt ggtgcgatct cggttcactg caagctccgc ctcccaggtt cacgccattc 145921 tcctgcctca gcctcccgag tagctggaac tacaggcgcc caccaccaca cctcgctaat 145981 tttttgtatt tttagtagag acagggtttc accttgttag ccaggatggt cttgatctcc 146041 tgacctccag tgatccgccc atctcggcct cccaaagtgc tgggattata ggcgtgagcc 146101 accgcactgg accctgtcta atatttaatt gagtgcctgt tgtgtgctaa gtactgtgct 146161 gggttctcat ccagtggcca gtaaattaga caatcctccc atcatagagc ttgccatcta 146221 gtagggagac agacaagcaa ataattacac aggtaataat acaattacag ttatgataag 146281 tattacaaat gagaagaatg gggtaggtga gcatgtgtat taaggctatc ggtctcattt 146341 tgagggccag agaactcttt cttgacaaaa tgaagtttaa acaaaaacta ggtgaagatt 146401 ggcagaaaag catcccaggc agagggcaca gccagtgtga aggccttggg ataggaaggg 146461 gcatggccac atcaagggag gtaaaaatac ataacaaaat aataatatga gtaacaatgg 146521 caactactac tgaaggagga ccgcctattt gccagggacc atgcgatcct taactgtgtg 146581 gttttgatca aatcatctaa actccttgca ccttggtcta tgcatctata aaatggggtg 146641 atagtgatcg tgtatacttc atcaggttct tctgaggcgc caatgaagta acgtctggga 146701 aatccttagg tctatgcctg gaccatagga agtgtttaag agacagccgt tgtgactgtt 146761 atttttatga atggtattgg gtgagaaaaa gttcgtgttg catctcaggg cttgccttgg 146821 accctgttga tgatttctat gttgaggtca ctcttggagt ttctgcagaa gtagaactct 146881 tggaagtctg ctattcagca gatgagtgaa acaagtgctt ctccctccgc tcccacagaa 146941 agaggggcag agatgccagc cacagcttgg ctttcaggca gcaggcacat tgggatcagt 147001 gattggcatt tgcaacattg gttggcatca cagagagcat ggaaggtcaa ggtaataatt 147061 atcaaggaca gtttaaggga aacagtacat tccattcctt atccattgta agaccagtac 147121 attccctgtt ctttaactct tgggaagaca tctcccccag atctctctgc tgaggatcag 147181 ggccatttct ctgttaaaaa tgtcaaacag caacttacca gaggaaacag aatggtggaa 147241 atgccatctc agcattcaag aagccccagt accaaacttt gcatgtatat acgtttgaaa 147301 caagctttgt aaattacttt tgttcatttg taattactta tccataagaa ctctctctct 147361 ctctctctct ctctctctct ctctctctct ctgtgtgtgt gtgtgtgtgt gtgtggtttg 147421 ttgtcacctg ggctggagtg cagtgatgcg accatggctc actatattca aaatcctggg 147481 ctcatgcgat cctcccacct cagccttctg aggagctggg accgtaggca cgtgccacca 147541 ggcccagcaa ataaaattta tttatttatt tatttattta tttatttatt tatttattta 147601 tttatttatt tattgtagag atagggcctt gctatgttgc tcaggctggt ctcgaactcc 147661 tggcctcaag tgatcctcct gtctcagcct cccaaagtgc agggattaca gttaggtatg 147721 gagagagaag gaaataagga agacacagtt attactcttt aggacccata atttaactca 147781 agtgattttg cccccaggtg gtatttggca atgtctggag acattttgtt tgtcacaact 147841 tggggtggtg ggggttgcta ctggaatcta gtgggtagag gccagggatg ctactgaaca 147901 aaatatacaa gacagcctca caacaaagaa ttgaccagtc cacagtgtcc acagtgccat 147961 ggttgagaaa ccttgatcta actgcttatg acataccact tccataaagc cgagagatat 148021 caattcttat gcagttatca ctgtgtacag gaaattgtgt gtttgtacat gtgtgtgcat 148081 gaatgtgtag atataaaggg ataaaacctt tatcttcctt tcctaggtat ttttacctct 148141 gtattatcaa ttgaaataat ttgtttttga atatacaatg caaagtttga aatcctacag 148201 gtacaacaag gtatacagtg aataaagaat ctccccattc cacttctgtc ctcaagtctt 148261 cagttctctg gaggagtact actgtcatta gtctcctata tattagtaga gagacattcc 148321 gcatatgaac cttataccct tagtagcacc tcacacatat ttttttcctt gttgcctttt 148381 tttttcctat gtgatgacgt atcttagaga tattttcata tcagcgcata tagagcagcc 148441 tcattcattt taacagctgc acaggttttt tttttttttt tttttttttt tttttttgtg 148501 gctgtcccat acggtcttct gttgatgggc ataggttatt tctagtcttt tgcataacaa 148561 acagtgctgg agtggatctc cttgtaccta actgtggaca cacatgtgca agtacgtctg 148621 taatccaaaa tcctgcaaga ggaatttcta ggttaagatg gcatgtgcat tttaaatttg 148681 gctagagatt tctgcagtga tgtctgatcc tgtgcaccgt gcattcatca tgtgacccat 148741 ttctagtcct tcttgaccgc cccctctcct atggggatcc tgagctgata gggtcacttc 148801 tgtctctccc ccacactgga acagctctct ctaatctaca agattaattt tttttcctaa 148861 tcaaaacaca attgccaagg atagtttgaa aaatattaat tgtgtctgtc atcctggggc 148921 tctgagcacg tgagcgtttg ggatataatc tttgcaaatt gcctatagtt catacagcac 148981 agcaagagcc tctctctctg cagccagtcc tttatgatcg atcataccca ccccctctcc 149041 caatgggctg tatcctaagg gtaagtagac agcatgggtt ctcttgctta aaaaaaatcg 149101 ttaactcaac gggggcgcct gctgcttgat tgggagtatg tcggcattta gagcctgctg 149161 gctgcaatct cactacgcag actggaagtt gtggggtgtt tttatcagca ggaaacagct 149221 ttgagagcgg aaggggagtg agagtagatg gatagatgag ggaattcgaa gaggaggaag 149281 ccgtgggaga aatgtcttct tttaaagaca gacaaggcag atttctccag cccatctctc 149341 tggaatgtca agtaggaatg aaatgggaag ttgggttaat tctaccaggc ctgcccagca 149401 accagcctgt tcaggttccc caggagttgc agcctgttgt ctgagactgt gaggttctct 149461 tccctgaagt gtcttgtggg aattggctgg tggcagcatt ttctctgagg tcccaaaaat 149521 tcgagtggtc aagaagacac cctgcaatga tcattcccca acattggagg ccataaaagt 149581 cagtctgggt ttgcaaatac acactggaag tctcagagct ggtgcagaca tctcagagct 149641 ggtgcagaca gacactgtgg ctgagtgtgt gtgccttgtt cgttatggtt tcactgctac 149701 tggattggta cctggtacac agtaggcact gaaaaaatat tcactgaaag aaagagaaaa 149761 cttgatggca gttgtaggtc tgggtctgtt ggaattttta cccccaggaa gagatttgat 149821 tttaactcaa agccagttga taaacagcct cacccagcag acagagggca gcttggggga 149881 aactggaggg tatgtagggt ggggtggggc tcttatctgt atggacaatg ccatatattt 149941 tctgaaactt ggatttttct ccttataagg gagaagatcc aagaagacaa tgaaaaaaca 150001 tttgcccctg cctttagaaa aaataacctt ttaatttttg gaataatttc agatttttag 150061 aaaagttgca aaacatttac agagagttcc actgcatacc cctcacccaa agtttccccc 150121 taatgctaac atcttatgca actgtgatgc ctttgcccct accattttga ggttaacaat 150181 taagttagta taatacttat tctgtaccga gctctattct atatcatttt cctgcagttg 150241 tttatttgac cctttcagca tcctttgagg aataagcact gttatagcag tccccatttt 150301 acagacgggg aagctgaggc tcagctagac tctagctagt gttcgaaacc taagttgtct 150361 ggctccagtc tctcacccct caccattatt atctgcacag taatgatggt aactgtcatc 150421 tcttgatgat aatgcgttca ggaactgagc tcaaaagttt ttccctccct cccttcctcc 150481 cttccttcct tccttccttc cttccttcct tccttccttc cttccttcct ccctccctcc 150541 ctcccttcct tccttccttc ctttttttga gatagggtct cactctgtca cccagtctgg 150601 agtgcagtgt cacgatcgta gttcactgta accttgaact actgggctca agccatcctc 150661 ccacctcaac ttcttgagta gctttgacta caggcataag ccatcatgtc tacctaattt 150721 ttagtttttt cttttctttt tttttttttt tcctttttct gtagagacaa agtcttgcta 150781 tgttgcatgg ctggtctcaa actcctggcc tcaggagccc tcctgcttca gcctccccaa 150841 gttctgggat tacaggtgtg agctgccacg cccagcctac ttatgttatt tctaaaccct 150901 cccctgactc agagctgtgg taaagagccc cgcttttgtt gtcctgctca ggacaaaaat 150961 acatggcccc aggcttttgt gagtaggatc cggaagggaa cttaagtgtg tgtgtgtgtg 151021 tgtgtgtgtg tgcacccacg tgtgtgtggg ggagtgatag agattctcct ggtttctgtc 151081 ctagtttttt ctaagcgtat ggagacaggt ctgctccgta catctgggga ttggtattac 151141 cacatcattt cccccctaac agcagcaaag attctcaaag cttcatcaga gcatgaacaa 151201 tcagaaatgc cttaattttg ttagtcacaa taaataatac tttgattaat tagcaagcac 151261 atgccttggg aatatgagaa agtaaatgat ttctacccta gatctttcag atagctggga 151321 gggaagagag tattaaaata cgcttttaac tatgggtttt ggatggagaa gaggagaatg 151381 ccggagaggg gaaagcagca gtggaattgt tcttggagat aggagaaact gaaactcttt 151441 ccacccaggg tctttgcaga aagtggttcc cactgcctgg aatacttttt cttctcttta 151501 catggtgagc tctcactcta gtctcaactt aaatgtctcc ccctttgaca ggcccttgct 151561 aacatcctgt ttcaagttcc ttctggatat tgattaggat ggtaagcttc atgaggttaa 151621 ggatctctgt ttcatccact gctccatcct ctggtgtctg caacgacgct ggcacaccat 151681 aggcccttat taaatatgtg tcgactaaat gaatatgtgg tcttattaaa tatgtgtcga 151741 ctaaatgaat atgtggtcta tttcaagtcc tcgtttgtgt ctgccctaga acttagcact 151801 gtttgccatt agcttgtcta tttccttgtg aatagctaga ttccctcatg agctataagg 151861 gccacgatgg cagggggtgt gtctaccttg ttcaccaagg gtacccagga tgaacctggc 151921 ctgcagctaa gggggtgggc tctagggtca gattacttgg gctcaaatcc cagatttcct 151981 atgttccagc tgtgtgattt ctgctaattg tcaagccact ctgtgcctca ttttccacat 152041 aggtgcaaga ggggtgataa tagcatctat ctcttggggt tgttgaaaga attaaatatg 152101 tcagtattgg ctaaaacact tagaggaagc tgggcgtggt ggctcatgtc tgtaatccca 152161 gcactttggg aggctgaggc aggtggattg cttgaggcca ggagttcaag accagcctgg 152221 ccaacatggc gaaaccccat ctctactaaa aatacaaaaa ttagccagac atggttgtgg 152281 gtgcctgtga tttcagctac ttgggaggct gaggcagaag aatcgcttga acccgggaag 152341 tgaaggttgc agtgagctga gatcgcacta ctgcactcca gcctgggtga cagagcgagt 152401 cttcatctca aaagaacaac aacaacaaca acaaaaaacc acaacactta gaagagtatc 152461 tagcatatag taagtgcttc agtgctttat acatgtcagt tattggtagt actcaaaata 152521 tttgggattt aaaagttggg ggatttttac ataaaaggca cttttcctct ttgagtggta 152581 aaatctcccc taagaaagag gcaaggacag atggagatga tctgagaggc ccaagagatc 152641 cttaaaacca ggctttttaa gaaatctgat gttttatttc tccgtattgt catggaaaag 152701 gccagctttt agcatgcctg gtgtttctgg tgttttcttc tcttcactgt tttctacatt 152761 tctttctttg cattggtcct ttttttttct tttttttttt tttaataaaa aaaaaatagc 152821 acagctcgga gtcttgcttg ctgacttact gctagagtgt gggatccatt gcttccattt 152881 agagaagcag ccggccctgg aatgtggccg cagggctgcc aggctgactc tgtgagctga 152941 gctggcgagg aatgcacttt gcagcttgca ctgagtttgg ccttacgtct gtggaatttc 153001 ttctctcttc ttactcattc gtgtcgttaa ataaataaca acacaagcag tgcatcctcc 153061 cggttataat ggcaaacagt acaatgtgtt acaacacaca gtagcttgaa gaggatatct 153121 ctctcctgtg gtagatgctt cattccccag aacttcactc ctcaccccca gaataggttg 153181 cccagttaac aggatgcctg gttcagtctg cattttgggt aaacaaggaa caccttttta 153241 gtataagtat gtcccaaata ttgcatagca ccaacttatc ctaaaacact atttgttgtt 153301 tatgtgctat tcaggtttaa ctgggcattc tgtagtttta tttgctaaat ctgacaactc 153361 cagagagtct cctagaacca gtttcctgta ttttcttcca gtaattttcc aagcatgtgc 153421 aacctatgga tatcttggca gacttactcc tggtagaatt cccatcacct agaacagtgc 153481 ttaggacatt gtaggtgctt aataaatatg ggttgaatga gtgattgact ctgtacccac 153541 cccatttttc ttcacataag tggaaatata ttatagtgtt gtttcatatg tagcttttcc 153601 ttttaataat gtctcatggg gattgataca tagagctgcc tcatactcct ctaagggttt 153661 cctagcattc ccttttatga tgcataatgt atttaaccag tcccctgatg atcgacattg 153721 gtgctgttga gagtctttag ctattataaa caataaggta gtgagcattc ctatatatgt 153781 ttacctctgt gggtttgacg acgtgcaagg ataggtgtcg cattggaatc atgaggacgg 153841 aggatatgtg ctttaaaatg tggattgatg ttgctgaatt gctttctgta caggatctgt 153901 tcacacttcc ctcagacagc ttgttcccca tctccttacc aacatagaat attatcaaac 153961 tttttcattt tgccaatctg atcagtgaaa aactagtatc tcgctatcat cttaatttac 154021 atctctttgc ttatgagaac atttttatgt ttgaaagctc tatttctttt tctgtgaact 154081 gcttcttctt ttttgcctat ttttttctac ttattgattt ttaaccggtt tttgtttgtg 154141 aaagaacatg tctgttagcc tggcatatgt tttatatata atgtaattta taaaattggt 154201 gtttaacttt gaaaaagtga tttattatgg attttttaaa ctctgtagac attaaaaaat 154261 tcctcagtag tcaatttatg agtcttttct tccacatatt ctacattttg tgtcttgctt 154321 agaaaggcct cttactcaat gagattatag aggaattctc ccatattttc ttttactact 154381 ttaacttctt tttttttgca tttaaatctt taatccatct ggaatttatt ttggttttag 154441 gagggaggtg aggatctagc tttatttttt accagatggc cagccagttg tcccaatacc 154501 atttattgaa taatccatcg cttcccacag aattgaaatg ctgcctttat catatattaa 154561 attgtggagg agggatttta tactcatttt ctgacagcct ccagtgccag ctaggcagca 154621 aggattttag ggggtttcag ctgactagag gcaaagatcc tgacctgaag gataaccgca 154681 atttattaaa ccagaaatac tctgtgtagg aacagtgggt ttcctgggtg gggacagggt 154741 tggggtgggg gaaagtgagt gtcccgtgct gggaggaatt tagttacagc aagatttctc 154801 aaccttggca ttattgacat tggggggtgg aataggtctt tgttggagcc tgtcccatgt 154861 agtgtaggat gtttagcagc atccttggtc tctacccagt atacaccagt attaacttca 154921 ccctgatttg taacaatcaa aaatgtctcc aaacattgcc ctgggtggca aaatcatctt 154981 cagttgagaa ttactgcttt atcaagcctg ggtcgagcag acctcccatc catttctttt 155041 aaacagaggc agggtcttgt tccgttgccc atgcagtgca tgatgtgatc atagctcact 155101 gcagcctcaa attcctggct caagccatcc tcctgcctca gcctctgggg aagctaggac 155161 tataccacat gcccagctaa ttttttagtt tttgtagaga cagggtctta ctatgttgca 155221 taggctggtc tcaaactgct ggccacaagc aatcctcttg cctaggcctc ctgaagcact 155281 gggattatgg gcgtgagcca ctgtgcccag cctgaacaga ccgttgatta gatgcatgtg 155341 aggccctttc ttgttctgag agtctgtcat cagcagtatc aatggctcca gatgtctggg 155401 aggcaggaag ggaggtgcct atctatctca gagaccagaa cacccagatt tctggtatga 155461 gtgtggggtc tgtggggaaa gggtggaacc ataccggtcg actccgggag gaacagaggt 155521 tcccacctag ccctacaggt gaatcttgca aaactcccta ggacctgcct gggagcctag 155581 ccattccccc tgctaaccac acccccagac aacaaggcct ggccatggat ccaggaacaa 155641 tgactctgag aaccggaggt ggcagatgtt ttgtgattta acagcatatc ggtttattgt 155701 cttatttttc ttaggagtaa aagagtccat gggtgtattg ttgggggcag caggaagcac 155761 acgtgtctat tcggagattt ggggcagcag gtggggagag aaagtccatc tagacgaggg 155821 aggtgtggga gtgagatgga agaggagagg gaagtgggtg gaatccctgg gagctggggg 155881 aaggaggaaa acacagagag gagctggctg aggtttgagc aaggcctctg gagattggtg 155941 agagctgggg cggcggggat tctttatgac agtgctgcag agggaaaaag gattgttgct 156001 gagttgggtg gaggtaggta ggagtgattt gaagaaggag gtccccttgg gaagttgcat 156061 tagtgggtga atgggggttt tataggccag gcagagaata tggggagggc aactccagca 156121 gatgaacagc ttttgcaaga cacctgtgtg gagcatgtgg caccacctga gcgtcctccc 156181 caggtgcggc tctaacacag gtgcaggtta tgttgggaga tagagagcca gaactcagga 156241 gtgccttggg tgccttgctg taagatttgg gctttccctt tatttctttt tttttgagac 156301 ggattctcgc tctgtcgccc caggctggag tgcagtggcg ccatctcggc tcactgcaag 156361 ctccgcctcc cgggttcatg ccattctcca gcctcccgag tagctgggac tacaggtgcc 156421 caccaccgtg cccagctaat tttttttttt gtatttttag tagagacggg gtttcaccgt 156481 gatctcgatc tcctgacctc atgatccagc cgcctcagcc tcccaaagtg ctgggattac 156541 aggcgtgagc caccgcgccc agcctgggct ttccctttat ttcaaaacac tcctcaccca 156601 ctcaatatag aaaaccaaaa aatagaggtg agcaaaagga agagagtgaa gaacagcgtg 156661 gtgtcaccca cccagtggcc acctccataa acgctgtcca gtttcctctg cacgtcctta 156721 gcatgctccc tgcacaccca ctgggacacc agctttgtga aggcaggggt ttgttttgtt 156781 ttgtttacag agctaggcga acagttggca ctcagtaact gtgtaatgaa tggatctgta 156841 ctgcaaacct gatatcactt gctcttttca cttcgaagtt catgattcgc ttccttgtta 156901 gtagatacct ttctgggaca ggtttttaat gggcacaaat tattggactg ggtaaatagg 156961 ccacatttca cggaaccaac accctcctgc tttttatcag ttttcttcca cattcctcta 157021 tcacaagtgc tttcatttgt ttgggctgct gtaaccatgg gtggcttaag tgacggaaat 157081 gtattttctc acagttctgg agctggaggt ctgaggttct tggcgagacc ctcctcttgg 157141 cttatagata acctctttct ccttgtattc tcacgtggca gagagcagag agaggaaatc 157201 agctctctca tgtccctttt taaaaattca catacattta agttcaagtg cagttttatt 157261 acatgggtat attgcatggt ggtgaagtct gagctttttg agtaaccatc acccaaataa 157321 tgtacattgt atctactaag taatttctca tccctcaccc tcttcccacc ctcccaagtc 157381 actatcattc cacactctat gtccgtgcac acacattata agctcccact tgtaagtgag 157441 aacatgcggt atttggcttt ctgtttctga gttgtttcag ttaaaatatt gacctctagt 157501 tccttccaca ttgctgcaaa agatgcgatt tcattccttt tatggctgag tagtattctg 157561 tggtgtatat agaccacatt tgcttcatcc attttgagtc ccttcttata agggccttga 157621 tcccattcag aaagtctcca tgatctgatc taatctcatg atctaatcac ctcccaaagg 157681 ccccacctcc taaaagcaac acactggggg gttaggcttc aaccaattaa tctgtagagg 157741 acaaacacgt tcagtccata gcaggaagca gtgccgccat ggggaagagc attgtcgtga 157801 aaccttttgc ccctcttgtt gagttttctc ctccagcaca ccctgaacat gctgacagtg 157861 tcaccttcct ctttacttgt ccctcttccg ctccaagttg cctcctggct tcccagtgtc 157921 tctgtgggct tgagaccaga gcttgtcata agagatagat gacgtcactt gatggaaaga 157981 gtcaagtgca gtgaccagca catagaaggc ggccatcaac attggtttca tcctttagag 158041 caaagtttcc ccaagcgcct ggtgtgcact ggtgagattt ggggacgggg tggtacttaa 158101 gacctggcac taattaacac tagatcactg tgtgggaaac atagtccctt ttctgttctc 158161 ttcaaatcct tctgatcggg ctgaggagaa agcctcaggt tggtgctgat gtgctttttt 158221 ctcccagacc cccatcctaa cagaaacaga gcaggtctca ggctcgggcc gttaggcaat 158281 gatgtcaagc cagaatgaac taacattatt ttgttctcgt tgtgttttat tttcatggga 158341 aacttagagt tatattcaga gatattgatt tttttaatcc catttaaaaa taatgtttta 158401 attgagataa aattcatata atgtaaaatt cacattttaa ccatttaaaa tgtacaattc 158461 agtggtttta aacatattta taatgttgta agccatcacc actatttaat tccagaacat 158521 tctcatcacc cccgaaagaa accctgaacc ccattagcag tcgctctcca ttttttttct 158581 ctcttccctg cggcccctgg taaccaacaa actaattccg tttctgtaga tttgcctatc 158641 ctagacattt tgtataaata gaatcataca atgaatgatc ctttgtgact ggcttcttag 158701 aataatgttt tcaaggctca tccaaggcat gtatcagtac ctcattcctc ttaatggtta 158761 gatcactttc cattgggtgg atggaccata ttttgtctat ttattcatca attagtgagc 158821 atttgagttg tttccacttt ttggctgtta taaatagcca taaacgtata tatgcttttt 158881 tttttttttt tttttttttt gagacaaggt cacccaggct ggagcgcagt ggcacaatca 158941 cagctcattg cagccttgaa ctcccaggct caagtgatca tcttccacct cagcttccca 159001 gatagctggg actacaggca tgtgccacta tgcccaactg atttgtttat tttttgtaga 159061 cacagggtct cactttattg cccaggctgt tcttcaactc ctgagctcaa gtgatccacc 159121 cacctcggcc tcccaaagtg ctgggattac aggcatgcgc taccatgcct ggtccaattt 159181 tttgtataag catgttttca attctcttgg gtgtcttcct aggagtggaa ttgctgggcc 159241 ctataagagc tctgtgttta acttttgcag gccatttcct gagtgacttt aagttgaaaa 159301 gtaagtcagt ttaaagaaaa ctattaatca tattgcagat ggtgtgtgga catgacaaac 159361 gtcataaacg tggtttgcag tgatggaaga gtagaaatcc tgcttggaca tgacctcctc 159421 catcaactgt ttcccactca cctttccagc ctgtggtgcc tctgttccca ggagcctcag 159481 tgcagccacg ctggcctctg aactgtctta catgagatgt ccttcttcaa agatactgcc 159541 gtttgtcttg tcctgggata tgcgcgaccc acattttttt tttttttttt tttttttgag 159601 atggagtctc actgtttgcc cgggctggag tgcagtggca ggatctcggc tcactgcaac 159661 ctccgccttc tgggttcagg tgattctcct gcctcagcct cccgagtagc tgggatttca 159721 ggtgcgcacc accatgccca gctaagtttt gtatttttag tagagacgga gttttgccat 159781 gttagccacg ctagtctcga actcctgacc tcaggtgatc cacccacctc cgcctcccaa 159841 agcgctggga ttacagacat gagccactgt acctaactgc gccacccctt tataccctcc 159901 agggagctgg acttttcaga gatcatcttt cacttagaat ctaccgttta acccttaatt 159961 gcatttcccc tcatattatt atactcacta atgattttgt tggagaatct tttgcctctg 160021 ctataagcca cttgagggta gcatctatat tcaagtagtc cattgttcct ctcatagtgc 160081 agcctcacct gatcaatggt gaacagctgc tgatgaacag aaccttccct ggaggcccct 160141 gttctactga atcttggtca gccccgtgtc ctgatgcttg ccctgcttgc tgttactttt 160201 tgcctgtggt tttactgttt agttaacttc ttcttttcct ttctttcttt cttctttttt 160261 tttttttttt tttttttgag acagagtctc tctctgttgc ccaggctgga gtgcagtggc 160321 gtgatctagg ctcactgcag tctccgcctc ctgggttcca gcgattctcc tgcctcagcc 160381 tccagagtag ctgggattgc agacgtgtgc cacctcaccc ggctaatttt tgtattttta 160441 gtatagatgg ggtctcaccc tgttgggcag gctggtctcg aactcctgac ctcaggtgat 160501 ccaccctctt tggcctccca aagggctggg attacaggtg tgagccaccg tgcctggcct 160561 ttatttattt atttatttat ttattttttg atatggagtc atgctctgtt gcctaggctg 160621 gagtgcagtg gtgcaatcaa agctcactgc agcctcgaac tcctgggctc aagttgtcct 160681 cctgcctcag cctcccaagt agctgggact acagatatgt gccaccacgc ccagctaatt 160741 gttgtatttt ttatagagac tggtcttgaa ctcctgggct taagcgatct gcccaccttg 160801 acctctccaa atgctgggat tgcaggcaaa tcagattgtt tcacagactt cagaatggag 160861 gctgaggttc aaggaggtgg acaccaggca gccctgggag gccatcttcc atccagtatg 160921 tagagtaagg aacaggaaag caggtgtacc gagaggcaag aaccatgaat atggacaagg 160981 agaagccgag agaacacaag gggagaccca actgtcttcc agttcccagt tctcttcctt 161041 tccttcagcc agaccgacct tcctcctatt gggcccaaat catgtcaaaa aaaaaaaatc 161101 tgactggcca ggcttagtca aggaggggat catagagtcc cagtgccatg ggtgagggag 161161 gcagtcagtg tagacatcat atcactgggg agataagcct tcaatggtgc ccactattgg 161221 gtgaaaaggc agacctggag ctctgaactt cagcttcccc ttcttcccaa ggttgtagaa 161281 gtgggaaagc caagctagtc atttctgagt gcgggaggag aataatgtga caaaaatggt 161341 taacatcaga ctagctagcc ttgaaagctt gagtctgaac ggtgatatct taggagaggg 161401 gaaaaaataa gcatgaagct gtggctctgc tccatttctg gggaaggaag ctcttctgac 161461 ttttctggct cagtgcagga agggttgaag aatctcttaa aatctcctga gtcctctttg 161521 tccttcatct cagaattcct gtgcaggtga tgtttggctg tatgctccac gaaaggaata 161581 gggaacttgg ctccctccct ccagggactg ttgctgctcc tctgagtttc cttagcttaa 161641 gctgtgcccc ctgtgtgcca tttgccttgg gacagagatg gcccccattt tccaatcctg 161701 ttgtaggggt gggaatgctc cccagtatgt gaaaatatcc ccaagctgtg tttttttttt 161761 cttttctttt ggggcaggct cttgtgttgc ccaggctgga gtgcagtagc acaatcatag 161821 ctcactgcag cctcaaattc ccaggctcaa gcaatcttgc ttcagcctcc cgagttccta 161881 agtagctgag actacaggca tgtgcctccc tgcccagcta attttcattt tattttatta 161941 tttttgggat gaggtcttgt tatgttgccc gggctcatgt caaactcctg gcctcaagtg 162001 atccttctgc ctcagcttcc tgaacagctg aaatatccct aggcttttaa gcctcagtta 162061 cttaatcaac acagtggggc cagtacagcc catcattggc atttgtggat ttggcagtgg 162121 tggttccttt tctcattagc tacccagaag atccttagca cagcatcata agtcactgtc 162181 ttgaaactcc accctgaatt ggatgcttgg tgagcatctg ggagaggcag aatggtgttg 162241 gaaccaagat tgtgggctct ggagtgagat ttctctgtaa gtctggacac caccccttgg 162301 ttgggtgatc ttgggcaaat taattaatgt ctctatgcct cattttcctt gcccataagt 162361 gcggatgaca tagcacccac ctcacaaggc tgttgggagg tttaaatgag ttaagacctg 162421 cacagccctt agaacaggga gctttcaata tgtgttggcc acagctcttg tgcccaatcg 162481 atttcactca tttgggcaat gcctgggttc tagatgaagg gggaggaggc tgcctgtagt 162541 gtcttcttca tgggccagtc cagctctgca gtggactgta ctggtgatgt ccaccgtgcc 162601 tcagctttta agcatatatt ctccttgaaa actatttaag cagaacacac tggggtcaag 162661 ccagtccaag ccagatgttt ggggtcctgg gtgcttctgt tttctccatg tctgtttacc 162721 tttaacaaga tggatggggg ctgtgaagcc cctctgggca gccttttctg gctgtgacat 162781 gcaccagaaa cccacttttg gcctgtttgc actgtgttag aaaacagcag gtgcagttgg 162841 cagcctctcc cctacccctt ctcccgttag cccctgatct ctggggactc cttagactaa 162901 tgcagtttta atgaatacac attactgttt acatgactga aatgagaggc tggctgctta 162961 ttttatgctg aggtttacag ccctgccttt taaaaccctc tctcccttgc tctgcttctg 163021 gccatggcga tattttctgg cggcccctat gctgtccctc caaataagga attcaccatg 163081 aagtcctgca gcagtgtcag agccattgcc ttgccccgga aggagtgaac tgtgtagaac 163141 aaaagcagag gctgggagct gtagttcacg cctgtaaccc cagcacttcg ggaggcagag 163201 gcagaaggat tgcttgagcc caagaggtct agaccagccc tgacaacaca gtgggacccc 163261 agtctctata aaaaatacaa aatacaaaaa tacaaaaatt gatgacctgc acctgtagtc 163321 ccagctacta gtggacactg aagtaggatg attgcatgag cccaggaggt ggaggctgca 163381 gtgagctttg attgcaccac tgcattccag cctgggcaag agtgagactc tctatcaaaa 163441 aaaaaaaaaa aaaaaaaaag cagaaagatg gattctttat ttttttaatt taaaaatact 163501 ttttacaatt cttaaatgct tgcttatgat agaacaatta gaaaatgaag gtatacaaag 163561 agaagaaaat aaaaatcacc tctaattcca ctactcagaa aaaactcgat cgttaacatt 163621 cctgtttatg ttttagcata ttttaatcaa tctacattta tatgtatttt aaattaaaat 163681 tagaaaccaa acttaataca tgttattcag aaatctgctt tttaccctac ctcacaatta 163741 atatatgatg aatgtctttc cgagttaaga aatatagatc tgcagtatct tttctttaat 163801 tttaatttta attttttaga gacaggatct tgctcagttg cccaggctgg agtgcagtgg 163861 ggtgatcaca gctcactgca gcctcaatct cctgggctca agcgattctc ccacctcagc 163921 ctgccgagta gctgggacta caggtgcatg ccaccacatg tggctaattt tttttttttt 163981 tttttttttg agacgaggtc ttactatgta gcccaggctg gtcttgaact cctggccaca 164041 aacaagcagt tctcccacct cagccttcca aagtgctggg attacaggct tgagcaccat 164101 gctcgatatt gcagtatctt tttttatgca tcattgtatc ctattctata attgtttcta 164161 tggtctcatt tgtagaaagg ttaacattaa atttcgaata gttatatgta ctcactggtt 164221 caaaattaaa aaggcacaag agctttaata cttttattga aagatttctc tcctatccct 164281 agcccttagg ccactctgtt cacctccttt ttggcaccca atattcaact aataatacca 164341 atttattgtg tatacttctg gaaatagttc ataatgtaca tgcacatcat atacatttat 164401 ttttctcttc ctcaaacacg tacattcaaa tgttagctta ccttgctttc ttcacttcac 164461 aatatcctgt atcttggtgt ttgttccatg tcagtgcata tagaacatcc tcattctttt 164521 ttttgaaaca gagtctggct ttgtcaccca ggctggagtg cagtggtgtg atcttggctc 164581 actgcagcct ctgcctcctg ggctcaagca atcctcctac cttagcctcc taagtagctg 164641 ggactacagg cacacaccac catgactggc tgatttttgt attttttttt ttcattgttg 164701 ttggtagaga cagggtttgt ccctgttgcc caggctggtc ttaaactcct gggctcaagt 164761 gatagggtca cctgggcctc ccaaagtgtt ggggttacaa gagtgagcca ctgcacccaa 164821 tgtctcattc tatttttatt tatttatttt ttgagacaga gtctcgctct gttgcccaga 164881 ccggagtgca gtggtcttgg gtcactgcaa cctccacttt ccaggttcat gtgattcttc 164941 cacctcagtt tcctgactag ctgggattac aggtgtgtgc taccacaccc ggctaatttt 165001 tgtgttttta gtagagatag ggttttgcca tgttgcccga gctggtctca aactcctgac 165061 ttcagatgat ccacctgcct cccaaagggt tgggattact ggcgtgagcc accacgcccg 165121 gccacctcat tctgttttat gattgcatcg tggtccatag tatgggcatc cctggggtgt 165181 ttccaaaatt gggatatatt gactcagaaa cccttcctgt ggttctgtcc aagctccttg 165241 ttcatatgac ctggcctggg gtgactggag gaaggatgtc cattcggaga ttaggaggga 165301 ctgcagctct cgattatgaa tgggattggc ctccacaggc ctgagtgaca aggacaaagc 165361 agagctggtg agccccatgc caggtggcgg tcactggtgg aaggccattt gtgacttctc 165421 tgcagttctc cagatggacc acatttaaaa cagaggcact gccattcatt tctacttggt 165481 tgaggaaagg tttcatcgca ggcccagaga gaggatcaga tgttttttgc agtttctggc 165541 aagacattag tgggtatcag agggtaagtg atctcgtaaa gctgactcaa agcaagagaa 165601 aatttcgtgt ctgtcttaga aacacatttg cctccaaagg gaaggcaggg tgtggaggac 165661 taatcaacgc atttcacatt tggtttcttt gtctttcttg gtgatcttac gtgactagcc 165721 ctgaatattt tgcatatgtc aaggaaacat ttcaaattac taagatgaga ggaaaatatg 165781 gggaatgaat aaaatggggt ttatcagcaa gaacagcaaa agacttctta gagccggctg 165841 gggattgttt tgtgaatctc taaacaagag gggcagtttg acttggggat gggggatgtt 165901 ttgacacttg gagggatggg agggagccaa caaagatgta aagcaaaact gggatggtgg 165961 tgatggagaa gcagtggggg tttctatggg aagacatagg aaataatggt ctgtccttca 166021 cagagtaggt gaggtctact agaagatact tatcttttcc ctaaatttca gtcaacttca 166081 catgattaag tttccattat taaagacttg ttctttgtgt taccttttta ttattattat 166141 tattattatt tttttttttt tgaaatagag tttcactctg tcacccaagt tggagtgaat 166201 tggcgtgatc ttggctcact gcaacctcca cctcctgggc tcaagagatt ctcgtgcctc 166261 agccacccga gtagctggga ctacaggcac acaccaccag gcctaactaa tatttgtatt 166321 tttagtagag gtggggtctt gccatgttgg ccaggctggt ctcgaactcc tgggctcaag 166381 taatccaccc gcctcggcct cccaaagtgc tgggattaca ggcatgaggc accatgccag 166441 gccttttgtg ttacttttgt aagagatgaa atgaaactaa tatctaatca aatctaagca 166501 tcaagcaatt aaaaatattt tcttaaataa ctattgagtt aaataggact caaaagtaca 166561 attgcagaga ggctataaat gaaaaaaaat agagtgctat ataaaacaat ggataagatc 166621 agccaatgct gtgcttgcag ataagttcat agctttaata cttttatgat taaggattga 166681 gatgacttgc tttgagcata atctggctat tcacttattc taaggaactt gtcatggctc 166741 aaacatgttt ctagtatcct ctttagaaat tccgtttaga tacagttttc tttattaaac 166801 cttctaaggt gactatttca aagatgaaac attcattttg ttgtttaaat taacacgtat 166861 tttttaaagg gttatgtaaa tgtgttatcc atcaagtata tctttgagcc ttcatcacag 166921 tgtgatgtaa gggatataag ttgttaagga tccagcagta gagaaaattt ctgcacaaaa 166981 gtaactatag ctctattact tttggaatat tccctaggat acatccaaaa ccaaatcaaa 167041 ttatactact tataaaacac atttggaaag atactatgtg aatatgcttt gaatactgag 167101 gcttatgtta gaacagagtc atggacgaat ttgcaaaaaa aaaaaaaacc tccaaaattt 167161 cctggagtct gacaaaaata ccggataggg ttcctgaata tattttcatg atctttgcta 167221 tataaccaga tatctctata ttttgaaatg gatcatattt tattagtttt tcttagtaca 167281 caagcaacat gctatcattg cagatgccag aaagtaggat cacattgttg tgctatagat 167341 ctccagaact tctttatctt gcaaaactca aaaactctat catttctcaa actcaaaact 167401 caacaattct ccattccctc ttcctccacc ccatggccac caccattcta ctttgtctct 167461 atgaatttga atactatagg tatctcatat aagtggaatc atccagtatt tgtcatctta 167521 agatttcttt ctttgtgtaa cgttatcaag gttgatccac attgttgcgt gtgtcagaag 167581 ttcccttctc ttttaaaggc tgaatatttc attgctttat agactgcact tttttatcca 167641 ttcatttgtc aatggacagt tggattgctc tcatcttttg gctattgtga ataatgctgc 167701 tatgaacagg agtgtacaaa tatctcttca aggccctgtt ttcaattctt ttggctatga 167761 acccagaagt ggaattgcta gattaaatgg taattctatt tttaatttgt tgagaaccac 167821 catacttgtt ttcatagtgg ctgcaccatt ttgcattccc atcaacagtg cacaaggatt 167881 tttccacatc ctcaccagca catgttattt tctttttctt tttttgatag tagccattta 167941 atgggtatga ggtggtatct cattgtggtt tttgatttgc attttgctaa tgagtgatgc 168001 tgagcatctt ttcatgtgct tattggccat ttatgtgtct tctttggaga aattctattc 168061 aagttgactt ctatgatttt gcattatcac atctgtattg attcaagcaa gccaactaga 168121 ataaattctg gcatttaaac cgattttgtg gtttttctgc aaataaattc tgcccccaaa 168181 taacctccaa ctttctggaa gcagtcagca ggagtacagt tctgaagata actttcttta 168241 aaaaaggaaa ttcataaaat atcatgcatc ttcctttttt gacactaatg gaacaattta 168301 atgtaatttc agagggaagc agagcccctg gaaaggctgg tgtgataagg gaaggttacc 168361 cagctttcct gtcaggcggt gtgtgggagc agagagtggc attctctgca tactcttggg 168421 gagaagagtg ggtgagacag gctgctcagg gctggggcag agcccagggg aaggggatgg 168481 aaggggaaga acagcccttc aagagtcctg cagaaattgg tggaagttat ttaacagaag 168541 tgttcggctc cacccagcac attctgttgc cttctacata cagagtgtgt tagtctgttc 168601 tcatgctgct aataaagaca cacctgagac caggtaattt ataaaggaaa gaggtttaat 168661 ggactcacag ttccacatgg ctggggaggc ctcacgatca tggcagaagg caaatgaggg 168721 gcaaagtcac gtcttacatg gtggcaggca ggagacagca tgtgcagggg aactcccatt 168781 tatgaaaccg tcagatcacc tgagatttat tcactaccat gagaacagta tgggggaaac 168841 cacccccatg attcagttat ttccacctag ccccaccctt gacatgcagg gattattaca 168901 attcaaagtg agatttgggt ggggccacag ccaaaccata tcacagggta atgaagaatg 168961 tgtgcccaag tagtagaggg cttaagaaaa ccactcttgg gctctgagtc tctctaggtc 169021 tcagtttcct catctttcaa atggcaatat taataagacc cacctcatag ggattgtgtg 169081 gggtttaaat gagaaaagac aggtaaggtg ctggtacctt ataagtgatt aagttgccat 169141 taaggtattg gggacttagt gcttgtcctt gaagagcttt cggtctcgtg aggagacagc 169201 ctgatcgtta taaactatta tagaaaatgg gaagaaataa gggttgaggt gtgataaaga 169261 tgtgctaggc caggcaagat ggctcacgtc tgtaatccca gcactttggg attacaggtg 169321 ggcagatcac ttgaggccag gagttcaaga tcagcctgac caacatggca aaaccctgtt 169381 tctactaaaa atacaaaagt tagctgggtg tggcggcacc catctgtcat cccagctact 169441 tgggaggctg aggcaggaga attgcttgaa tctgagaggc agaggttgca gtgagccaaa 169501 attgcacaac tgtactccag caagactctg tcaaaaaaag aaaaaataaa taaaaaaaga 169561 tatgctatac cctcatatca caataatgcc aggaaataga tcattaattc tgaaccctaa 169621 ttcttgggtg ggcatcaagg caggcctcac aagaagaagg catttgagtt gaatcttata 169681 ggctcagctg ggttccaaca gcaaagactt agggaaaggg cagagcaggc aggggaacag 169741 taagagcaaa ggcttgaagg catggaagtt catgggtaat tactgactga gatgtttggt 169801 gttatggtgc atagaacact atcggatgat aatgaaaaca ttgaattcca gcatgtttag 169861 aaggacattg gctgataaaa atgacgtctt aatagttatc tgtggacaaa tttgtaaatg 169921 taaaagagtt gtagactcaa atcctacaga aattaggcag gagacctaaa tgagtgaagt 169981 gggtgggtat aaaacagagg tggtggcagg aacccatgtt gaattgaata gtgggtgttt 170041 tgtttaaagg ggcaccagta gtaatttttc ctctgttgga atatgatcct agggctgcca 170101 gctcttctac tttttcaaga gaagccagaa atctggattt catgcaaaat ttcttgtttt 170161 tctttttatt aaagacatgg tatggaccaa aaaaagatgt ctctaggcta gatctctgca 170221 acctctcatc tatactctat ttgcattttt aaagactata tgcaaatgtt atttagcaga 170281 gctacctagt ttttttgtgt gatgatagca tgtgaaagag ctatgataca tcttggaaat 170341 gtcaactttg aaatatacag tgactgtgtt ggtcactaca gaatgatctc tgcagaaact 170401 tcagttacat tttctctaaa tgacaccttt gctttgacaa cacattttca aaaagaaaaa 170461 tagcaaacta cccttcctcc agggccttag ccacagtagc tcctgccagc ataacacaat 170521 gctgcacctt actgagggat caggaaatgt ggtttgattt ataattatgg cttcatgggt 170581 agaatggtgg ccaatgtcca catgagccac actcagtggc taagatccag agggcattta 170641 atttaggtag ccagtgttac agacatagtt tagatttttg cagccttgac caatcttaat 170701 agatctgtct gttctttaga gttacctttt tttcctgcct cctagctacc tacacctttt 170761 aatgctaaat gtagttagct agtgaatgct gttatttgaa tcttaaaatt tcaggttggg 170821 acagaacctt aaaattcatg tatttcaacc accctttgta tgaggcagga atcttggtta 170881 tgacaggtga cagaaaattc aaactagctt aattcagaag ggaatttatt gcctttataa 170941 aaccaaaaag tccaagggta gctgccttca ggtattgtct gatccagggg tttgttatgt 171001 ttccaggaca tggtttcctt tattccatct ctgcttatgt attggctccc ttctcagatg 171061 gaagagactc ccctcatcaa ggtgagatgg ctatggaagc tctactcaca tttcctctag 171121 gttcaagtcc atgtagagaa gagcacctct ttctcttgtt atatctcaca ggctctaaat 171181 tagctactag ggccagtgaa aaagcaatgt gctgattgac cacccttatc tactcttgga 171241 gttccggatg gaattaacac tacctgaatt atgtggacca agagtgggaa aggaatagtt 171301 ctccagaagg atatttggta ttgttattaa aataaggtta attgaatact cactgaacca 171361 tctaatgctt gaattccctt catagcacgc cattggtgac tgtccaatct tcagttacta 171421 acatgctacc cccaaaatgg tgtgtttgtg cagctctaat aatgacaggg tttctcctaa 171481 tactgagatg atattggcat ctccatagtt ccagacatta tttctggtcc caaccataat 171541 caacctgttt tgacacgggt gaacatgtaa gggaagggat tgctcattgt ggtcacagaa 171601 gtaacctacc caatagaagc acaccaactt gaagcatgct tccatgatca ctgagggagg 171661 aaaatgattg gcaagtctcg tattggtcct tgaagcgctc cctataaaca cttgtcactg 171721 ccaagcgtgt atcactggcc aaagcaagtc acattgtgat atgattagac tttgtgtccc 171781 cacccaaatc tcatcttgaa ttgtaatccc cataatcccc atgtgtctag ggagagaact 171841 ggtgggaggt gaatggatca taggggcagt ttctcccatg ctgttctcat gatactgagt 171901 gagttctcac aagagctgat ggttttataa ggggctcctc ccccttcact cctcactctt 171961 ctctctcccg tagccatgtg agaaggtcca agcttgcttc cccttcgcct tctgtcataa 172021 ttgtaagttt cctgaggcct ccccaaccat gcggaactgt gagtcattta aatctctttc 172081 ctctagaaat tacccagtct cgggtagtat caagatagca gtgtgagaac cgactaatac 172141 gcatggccat gcctaacttc aaagcggctg ggcaagtatc atattgctgt gtacttggga 172201 ggaggggaga atcagaatat ttgatctggc gaccaccaca caactgaact tctctatttc 172261 tgtgttcttc ttgttaacca tctagtttct gtttcctctt gtgacacata tattaattaa 172321 tttttcaata cccaaacaca agcctctaat atttatcttt gtttagttac ttcctaataa 172381 actaagatat ctctacatcc tgatcctaaa attcaaaggc tccacgtata ccatggtgac 172441 tagttaattc caatatattg catagttgaa aattgccaag agagtagatt ttaagtgttc 172501 tcactacgaa aaaatgcata tgttaaatag cttgatttag ccatttcact atgtatacat 172561 gtaggaaaac atcatgttgt acactacaga tgtatacaat ttttacttgt caattataaa 172621 caacaacaac aaaactgcaa agtctcctta gacccaccca atcacatgtt gtctgctcat 172681 ttaatgagta aatgctatat aggttcattc tggacctggg taaaatgatt tacggactgg 172741 gaccaagtgg gaggacccta tttcaattca ttaggtaagg tcttccaggt caatatttga 172801 ccattgattg gcacatctgg ggtatagtta tttaactgcc ttctagttta cctcaatatg 172861 acccctcagt ggagggaaaa tggagcacaa tgactagaag aatgttgaga gatgaggaaa 172921 gagagaatgg gatggaaaga acataaaata ggaaagcatt gacattgtca cttcttggct 172981 atttcttcct gtaaaatttt gcctggagtg ccctacaatg gggtccccat cctctgccct 173041 gataacatct tagtggatca ctcctctctc ccttttcatt ctgcaggact tgcccattgg 173101 ggctctttca gacctgactg gtcattgggt atctgtggcc atgataatca ctctcttcct 173161 ggacgtgacc ctcctctcgt taaccaggat gcctgtgttc ttaaggttgg ccgtttccaa 173221 ggacagagag cttgcatccc atttctaggg gactggggat ccaaggactc cattcttgac 173281 attgtcttgt cagccttgaa gtgccgaaac ccagcatccc tgtctgaggt catcctgtct 173341 ctcttccaaa catgaggact tcttaaattt tttaaaaaaa gtttttactt gtataaattg 173401 agaaggcaca agtgcaattt tgttacatgg atatgttatt tagtggtaga atctgtgctt 173461 ttggtatgtc catcacccag atagggtacc ctgtactcat taagtgattt cttatccttc 173521 actcccccac ccaacccttc caagaaggat gactattact caacagtcat tcagtagcca 173581 ttggagaatg actattattc catactctgt gtctatgtgt attatatagt tcccatttat 173641 aactaagaat atgcattatt tgactttctc tttctgagtt tttttttttt aagataatgg 173701 ctttgggttc catcctgttg ctgcaaaaga catgatttca gactgggcat ggtggctcat 173761 gcctgtcata ttagcacact gggaggctga cgaaggaaga tcacttgagc ccaggagttt 173821 gaaaccagcc tgagcaactt ggtgagactc cggctctata aaaattaaat tagccaggca 173881 tcgtgttgtg tacctgtggt cccagtttct tgagaggctg aggcaggagg atctcttgag 173941 cccaggagtt caaggctgca gtgagctatg atcacacaac tgccccccag cctgggtgac 174001 agagtgagac cctgttttag tcaggacttc aagaccagcc ttgccaacat ggtgaagccc 174061 catctctgca aaaatacaaa aattagctgg gcatgatggc gggtgcctgt agtcccagct 174121 actcaggagg ccgaggtgga agaatcactt gaacccggga ggtggaggtt gcagtgagcc 174181 gagattgccc cactgcactc cagcctgggc gacagagtga gactgtttca aaaaaaataa 174241 ataataaata aataaataaa aataaaatca aaggtcttac tcttttttat ggctgagtag 174301 tattctattg tgtgtatgta tatatagata tacatacata taccacacac atacgtgtat 174361 atatgtatat atatggtggt atgtgtgtct atatacacac aatgatatgt atattaatat 174421 atatgtatat ataaatatat atacattttc tttatccagt catccattga ttgatgctta 174481 agttgattcc atatctttgc tattgtactg tgataaacgt atgagtgcag gcatctttta 174541 tataataatt tcttctttgg atagctgccc aaaattcagg taaaagatct cagtctacaa 174601 ggcagctccc aagcttcaaa aaattgtcct gccagatttt taaaagcaag ttttctcaaa 174661 agcttcgatg tatacattgg tcgatttatt tatccattaa agtatttatt gaatacttct 174721 tactgcgtat ggtggtggtg gtgctgaggt gtctcacagt gaacacaaca ggcgtagcct 174781 ctgctattac caaaattatt gtctggtaaa gaatgcagat ggctaaccag gcaggtgcct 174841 gtacggtgag atatgtgctg tgatggctgt aagcgcatgg tgtgagcgta gagttgggtg 174901 cttaattgaa ggcatcagga aggcttccag gactaaatga tatcaaagtc ccaccttaag 174961 gatgacaggt ttaactaagt aaaggaggat gtggaaagaa gggggagagt gctccaggca 175021 gaggaaatag tatatacaaa agcacaagag gattgaagaa ccaaaatcac tttggcatgg 175081 ctggaatgaa gggcatgttt gtgtattttc agggtagggg atgtgtttag ggacaagctc 175141 agagaaaagg aagtaaggaa ccctgtcata aagggctcat ctgtcctatg aaagagtgtg 175201 tacattttat tctgagagta cagagccttt tcttttttct aagcattaca tgattagatt 175261 tgtcttttag aaaccactct ggctgccgtt aaaaatgaat ttcacagtga caaggctgga 175321 ggaaggatct tgggttagga ggcggtggta gtaatctagg caggacttga ggacgcctga 175381 gtggtggtga ggaaggagag aggtggatgg aagattcaag agatatttag ggggtgttat 175441 tgagagaact tagtgattga tttggcgtgg gaggaatgag aaagtcaggg atggctccct 175501 ggtttctggc ttgggtacta ggtgcgctgg tggtgctttt gacagagata gaaccatggg 175561 aggaggaaca tgttttgagg gaaggctgtc tcccttttag acacctagtt ggtgtccaaa 175621 tggcacagac cagttggcgt gtggatatct tggggcttag ggcgtggatc tgggctcagg 175681 gtggagatct gagccttagc tgcacatcag tagtcattgc agtcatggag tgtgctgggg 175741 ttttgccaag agctcatgta gattcaggcg agaagagagg aggggcccag gctgggaata 175801 ctggctttgg aaatactctt gaaatatata gaaatgaaag aatgtatagg gctgtttcca 175861 cagacgttta ttttttattt ttattttttt gagatggtgt ttcactcttg ttgcccaggc 175921 tggagtgcaa tggcatgatc tcagctcact gcaacctcca cctcccaggt tcaagcaatt 175981 ctcctgcctc agcctactga gtagctggga ttatagatgc ccacgaccat gcccggctaa 176041 tttttttgta tttttagtag agacacggtt tcaccatgtt ggccaggctg gtctcgaact 176101 cctgacctca ggtgatccac ccaccttggc ctcccaaagt gctgggatta caggcatgag 176161 ccaccgcgcc cggcccacag acatttactg agcaagagct tgtgcaaggc actctggtat 176221 gcatgcagtc ggggaccctc tgatgaatgg aactcaggcc ttgtataagg aagtgacctc 176281 catcagaggg tggagacgct gtcagtgaac atccatcaca gacaagtgac tgagctagga 176341 gctggggtaa ggtagaaaga ggaggacagt gggcgatatg agctgggaaa tgcacctctg 176401 ttgggggact ggggagggct tcctggagga ggtgattttg gtatctagtg gttttgtttt 176461 gttttgtttg ttttttttgt tttgagatgg ggccttgctc ttgttgccca ggctggagtg 176521 caatggcatg atctcggctc actgcaacct ccgcctccca ggttcaagcg attctcctgc 176581 ctcagcctcc caagtagcta ggattacagg cgcccacaac catacctggc taatttttgt 176641 atttttagta gagacggggt tttgccatgt tggccaggct gttcccgacc tcctgacctc 176701 gtgatccgcc cgcctcggca tcccaaagtg ttgggattac aggcgtgagc cactgcgcca 176761 ggcctgtatc cagtgtttaa ggagatgttt tgtcagataa agctggagtt ttgtaatgga 176821 gttttggaga tggggaaggg cagactggct taagagggtg ctgcataaga gatggctcag 176881 aggagagaaa gtgtacgatg ttttcaggac aaattagtcc ttcaccatgg caggaatcta 176941 ggcggccgaa ggacatttgt gggaaattgt agcagaaaat acgaggagga ctttgatgcc 177001 aggccacaga gtttgcagtc ctttttggag actcagggga gttattgaac aattagaaac 177061 taaaggctga tgtgacctcc acaggatact ggctaaacgc atgaactctt gaaccagatg 177121 gacctggatt taagtctgga ttctgccatt tccttgctgt gtggtcttag gcagattatt 177181 aacctctctg agcctcagtt ttctcatctg tcaaatagga gtgaaatacc tcatagaggg 177241 ttgtgaagag tgaagtaaac taatgtggag aaggcattct gcagtgtctg gaattcagcg 177301 acccctcagt acatgttaac acactaatac tttctacgta atcatacctg taacaacaca 177361 gagacgatgg tattatcttt acagcttgtt cgcttctgta tcctgagcat ctaggacagt 177421 gcctggcatg taatggatgc tggagaaata tttcttgggc aagaaatgaa ttgcaccagt 177481 ttctttttgt aagggatcta tttttttctt gtctggaaaa atactttgat aaaaccttag 177541 ggaaaagatg aaattccctt tcttctggag cctagatttc aaagaggaga ggttgagtgc 177601 tctgcaaagt agaaggagaa gattttgggc agtgaggagg agagaattga tgagatggtg 177661 gggcacagcg tactatgaaa tgtgactctt cccccgggac ccccgaaaaa atcttctgta 177721 acattttata tgtgtaagac atctctctac catctgctat ggtcgaaatg ggatttgtag 177781 agagctcatg tgattctcaa acctcctagt aactctgctt acatggagag acagctgtct 177841 ccaagccagc tgttgtcttt tggaaagctt taatctgggg gaagataaca ccttgagaaa 177901 agatgtcagg attggaggca gatgttgacg gagccagttt ctaatgcccc aaggcagcca 177961 cactttgtat ggattgtatg gatgggcagg tgcacaaggt gagttatttt tttccctacc 178021 cagaaattta tggaagcctg gcatggtggc tcacccctat aatcccagca ctttgggagg 178081 ccaaggtggg cagatcactt gagcccagga gttcgaaacc agcctgggca acatggtgaa 178141 accctatctc tactaaaaat acaaaaatta gcctggtatg ggggtatgca cctgtagtct 178201 cagctactca ggaggctgag gtgggagaac tacttggacc caggaggtcg aggcttcagt 178261 gaaccatgtt catgccacca cacttcagcc tgggtgagag agtgagaacc tgtctaaaaa 178321 aaaatatgaa gttgctcttt tgaagagcat tctaatgact tatctggaga ttctaggaaa 178381 gaaattaaat tcagctttca atgtctgtct ttctagattt cttccttcta acgttattgt 178441 tgttatgtcc tcaaacaaaa acatttagta cttattatga gcttcatagg gttgttgtgg 178501 attaatacat gtaaagctct tagtgcaata cacgacacat agataacatt taattttatt 178561 atcgttactg ttgggtagca ctcagttcat gaaattccca cacacagtca agtggtaaat 178621 gctattactt tgtaaatatc atcgatgagg aacctgaagc tcacagaggt tgatgacttg 178681 cctaggatcg ctcagaaagc aggcagcaga gctgattgcc atggtaacca tttccaagct 178741 tcagtgccca gagatggacc cagtctggca tgggcttatc ccctggagca ctgtgagtgt 178801 cttgacagat aagtaaagga agtctttaat ttctgaaaaa tgcccagctg tgattggcat 178861 tgttaggggt cagtgaagac acgctcaaag accttactgt ccagcttttt tatctttcac 178921 ctttaatgcc gctgttagcc ccaggagcta accaaagggg agttttcttt atcagttcca 178981 agagggatgc catggttgtc ccaagattct ggagctaaag gtgggaccag gaccagaatt 179041 cagtttctgt gtttccaaat tcagttctat tctcctctca gtagaccagg aggcctccag 179101 acaagacaga tggcagacag ctacagtagt ttaaaatagc ctgagctcac ttctgaggta 179161 gacatgtgcg tggagtgagc tggaagacag ggacccctgt gtgggaaaga ttttggcaaa 179221 gagaagccca gtggggagtt ctgatgctag gctgagtgtc atcaggttgg ggtgagggtg 179281 tgagaagaga cctgagcagg tgccaacctg aggctgagtg gggaggagag aagcgagagg 179341 acaggtgcca ggcaagatgc ggcactgcca atcagtagaa gcttcctccc tcctttccag 179401 ctgtgctacc gagacgggga ggtcaggtgg tcgttctggg agggctggtg ccttttactt 179461 cccggccttc tggcctgctg ggtcttggag tccagtgcaa aataatgaag ttcctataaa 179521 gacagtttgt atagatcctt ttaggatgac ttgaaagcct atgcatttcc ttaacaatga 179581 tcaattcttg gctaatcctc tttcatccat atccgcaccc aatgcttttc actccatttc 179641 catactggat gttttgaaat aaatcccaga caccatataa tttcagtaat tatcaatctc 179701 aatatcacag aaagtcaaat agacattata tatcttctaa tatgatgcaa atagaagtgg 179761 ccaatgctat ctatgcaata ttattactaa agatctaacc aaatttacag gaaatacaga 179821 gaatatgtag gaaattctac atagaaataa cttgatttct tcacgaaacc aatggcatgg 179881 aagtgtgaaa aaaatagaat taaaagatat agcaatctaa gatagtgtgt ggacctcatt 179941 tggaaataaa ttcaaacaaa tcatccataa agaaaacatt gtgagataat caaggagaag 180001 gaaacattaa ctgaaatata aaatgataat aaagaattgt tattgctttt actgggtggg 180061 ataatggaat tgtggttgct tttttgttgt tcttatctgt ggagatatgc tgaagaattt 180121 atgggtaaaa taaaactaag cgatgctagg tggggaggaa cctatgcatt tcctattttt 180181 tggcaaaaag gccccaaggt gtagtcccta gcacctatgt tcgaacaatg tgacccagca 180241 ttcctgctga tccctgactg aaccaggtat ggctccactt gatcctctct ctggggactg 180301 ggagttagca ttcagaaatg ccagctagcc tccatcaaca gcagactgga aggatcaggc 180361 atctcaggaa tcatgcaatg tgcatcaaaa aatagagaaa gcttatcttc agagagacag 180421 acaagtgaag ctggctcaga ggtcatgtgg accagaggga tgggggaagc agttgccttg 180481 gttcctgaca atcctgcatt gaagtcaaac tgctgttcct ttccctggtt tctgtttcct 180541 tgtaacagac ttattccttt gcttgaactg ggggctttta gtggggttct gttgcttgca 180601 aatcttaagt actctgagaa acagctaagg agatgaagat ggaaaacttt gtcatgaagg 180661 gaatgggtta ctggaatctt tctggaggga gaatttcaga attacgagga tgcagttctg 180721 attgaagggg gtcttgtggg tatctgggat ttggaagtga gaatgaatgc tgtagcagat 180781 gctattagct acccatccaa tggcgattcc ttctttgttt ctagttttgt tcggtggcaa 180841 tgtgcccagg ggacagtgca tgcatgaccc atgattattt ccctttcagt gattggtctg 180901 gaagtgggca ctggacccag tttgggtcat gtaggaaaga ctctttcttt tcatcctttc 180961 ttcctgctct ggaggctgtt aggtgaggac ataattgttg gagctatggc agccacatta 181021 tgactatgaa caggtgccta gatttgtagc aacactgact gagagcactg tcattcttga 181081 gacactcata cctccttctt tcagacttta ttattttggg gatgatattt attgatgagc 181141 cccttttggt taagttgcat atggttggct cttctgttaa ttcaagccag aagattctta 181201 attgatacag ttaggcttca taatttggag gtttgggagg aaaatgaggt gaaagagcat 181261 gcattgttgg ggaagataat attttaaatc agaaaattaa cattttctac cagaagacct 181321 tcaattatta tattatagac tggtttgcag agccttttga gagtggtgtg aactgatatc 181381 catccatcca tccatccatc catccatcca tccatccatc catcatcttc tgtttgttga 181441 ttttctatcc atccatccat tcatccatcc atctatccat ccatccattc gtccatccat 181501 ctatccatct atccactcca tccatccatc catccatcca tccatccatc catccaccca 181561 tccatccatc ttctgtttgt cgattttcta ttcatccatc catccatcca tccatccatc 181621 catccatcat cttgtttgtc aattttctac ccatccatcc atccatcctt ccattcatcc 181681 atccatccat ccatccatcc acctacctca tccatccatc catccatcca tccatccatc 181741 atcttctgtt tgttgatttt ctacccactc atccatccat ccatccatcc atccatccat 181801 ccacccattt gactttcttt ttacgaaact ttccctttga tcaatggagg atttccttcc 181861 tggacttgct tggttctgtc tattatcagc tgggtataaa atgcttttcc ttcttcagtt 181921 aggtcacgtg tagagaattg atattagata gctatggtga ggagtgaatg ggattctgcc 181981 tggcctctct ctagggacac tttgccttct gcagtgtctg gcatttctaa gaaacgttga 182041 tttttaggca ttgaggaaaa aagctgtttt tttttttttt ttttgacatt aaaagctcta 182101 gttttccctt attttattcc ctggagcttc tttcaacata ttcaagcact gttggcttca 182161 ggcgtggctt tgtgggctgg aaagggccag ggtatgtttt cagaggaatc ctgtgggatt 182221 gaattaggaa tgctgccttc aggggataag tattatttta ataggaagat attctttatt 182281 ttttgattct taggaattgc tgtttctcat gtagatgtac ttacctttgt tacattaata 182341 agttccataa aatattttca gcgtatagaa tttttatata tctagaccat tgtctgtttc 182401 aggttaggaa ttgtcagtgt ttatacgtgc gcctaatccc ttgtagtctg ttcgttactg 182461 ggatggtggt gatgggggca ctagagggtg gaatttgcgc acatggagag aggagatggt 182521 gcatgtcagt gtgttttcca gttgaggccc ttggggttgt gggtggggag gaaagaggag 182581 gaggaggaag ccagtgtggt aggagtgtga tccacattct caggaagaac catatttgat 182641 ttcatttaga aaacggaatt cttacttctt gaacaaagtt ggcttttctt ctctgaacgt 182701 ggcctttgca aatgcttgag tctttgagta tggtgatgac tgagtgagag gagggatttg 182761 cttggggcag caggcacatc tggaaccatt gaccacaaac ctggctgagc acagagccac 182821 ccggatgatg ggcttaatag gtacaaactg ggaacttccc ctcaggcatg ccaactcagg 182881 aggttgtcac cagtggctgc agcttggagt gatctgggga tcttttaaat gccccaatgc 182941 cctgcctcta cctctgtcag tgaaagcagg ctgtctggga gtagaatcag agcagcagta 183001 ttcattaaag gctctccagg tgactgtcat cagcatccag gtttgagagc cactgtgggg 183061 aaataatctg tactttctat aaagccagaa gcagtcagtt ttctgaacat cactgattag 183121 accgtagcca ttttccagta tggacaaagt aggttctctc ctgccatccc ccagctgcag 183181 caaacttctc actaatgggc atgctttgag aaacaccaaa ctgcagggac ccaaaggagc 183241 atacaattaa cgttaaatca caccagcttt gggcatatat atccctcctc caccccaaaa 183301 tctcttttaa aatagtgttt taaatgtatt ttttgtagta tgttcacagg actcaaaaaa 183361 aaatacaaaa tgctgcacta ttcacaataa caaagatatg gaatcaaccc aggtgcccat 183421 cagtggcaga atactatgca gtcataaaag agaatgaaat catatccttt gcagcaacat 183481 ggatggagct ggaggccatt attttaagca aattagagca agaacagaaa atcaaatact 183541 gcatattctc actgataagg gagagctaag cattgagcac acatagacat aaatatgaaa 183601 ctagacactg gactactgga agttagaggg aggggtgggt ttaaaaacta cctgtcgagc 183661 actgtgctca ctgccagggt gacgggatct gtacaccaaa cctcagcatc ttgcattatt 183721 cccgtgtaac aaatctgcac atgtacctac tgtatctaaa atgaaagttg aaatttaaaa 183781 aatgtttaaa atccacacag ttttaaagtc tcttattctc ttctgttctc catccaccca 183841 tttcccaccc ccttctctgg atatcttttc agaaatcttt tccttcaccc cctttgttgt 183901 cacacaaaag atgacatact gcaaccccta ttctccacct tgtttttttt ttttcctgct 183961 aatatttggc tagtaagcac cagcatggtg taatgactgg catttgttct gatttgttga 184021 gcatcctcag tgactggcag aagctcacat ggtgctgcca gtctttagat atttgaatta 184081 cttcccctgt ccaccccacg tctagaagat ctctctaagc aatgcaaaga cagcttcctt 184141 atactacttt ttaaaataga tgtagagtat atatatattt gagacagggt ctcgctctca 184201 cccaggttgc agtacattgg tgcgatcttg gcttactaca acctctgcct cccaggcttg 184261 agtgatcctc ccacctcagc ttcctgcgta ggtgaggtca caggtgtgca ccaccacacc 184321 caactaattt tttttttttt tttttttttt ttgcattttt ggtagagaca gagtttcacc 184381 atgttgctca ggctggtctc gaactcaagc gatccacctg ccttggcctc ccaaaatgct 184441 gggattacag atgtgagcct ggcccaagta ttcatttttt aatagatgta aagtatggat 184501 aattcataat ttattaaaca agtcctgtag ggatggactt taggtgattt ccaatttggg 184561 gccattacaa acaatgccat taagaatagt gtggccaatg catcacttca gatgtgtaga 184621 aatccatctg tagagttccc ataagtgcta tttcttggtc aaggggcatg tgtagagtat 184681 ctttagcgac gggaaatttc acgaaggcat cacattttcc agtttgctgt agcaggcagc 184741 tctgcttctg aaattgtagt aggtacaggg ttttcttttc tttttttcca ttggtggctt 184801 tgaggaactt tcagatggga tagttgaaaa atgtacatct aatggtttta gaggagatgc 184861 ttctggagct gcagttctcc tagggccttt cttgaatgac ctgcaggcac catctggtgt 184921 ttgttaaaaa tgtggaccct gagccctatt agactctttg aggcaaggcc caataaattt 184981 ttaacacaag cttcccaagc gatgcttctg tgcactgaag tttcagaacc actgttctaa 185041 gggatttgtt gtagaacaat gccgtgagag gacatctttt catatgatta gcacagtgga 185101 accacactgg tttactcggg acaaagtctg cgactgccat gaacattgca agctcaataa 185161 tgccaagcaa agactgagtc ttggcatcgg gttgaaaata acatttggct atcaacctct 185221 gatgcagcat ttacaagttt gccctcaaat gcccatagaa taggaggaac agggtcaaca 185281 acactgaaaa tagtcagata accagttgct ggaatctcag tgaaatatcc attaactcaa 185341 aggcagatgg cactggattg gatctaccct tcacagtcaa cacctccaac aagatggaac 185401 aagaccaact ggaaacagct ttgtctatct tccactgtcc tattcctaga ctaagagact 185461 cagcttcgta ccaagtggga agtcactctc ttctgctcaa aattaagttt ttgaaatgac 185521 cacaggcctc ttacgatggc taattttcct acattcattc agagaagcca gcaagggaag 185581 gggaggcagt taacttgtgc agtggtttgt gtagggaact gctggtggtg aattgaacgt 185641 tcagttgtgt ggctgccata cgttaggtgc atagaagaga ccgaaggaac tagggtgtga 185701 gtgcattgtg gaatcagaga atcagattgt ggaggtggga gaagagtctg aggcagttca 185761 agagagctgc agtggatgag tgcttgctaa aatcccagtg ctatatgtgt agttctgccc 185821 tccaaccatg gcttgagaag aggactggac cattactagc gaaaacatca ctaggaaggg 185881 cagcctatac aaccaatctt ctggcagaca cagggcctgg atatatcttt ccaaagaata 185941 acccagtact taagacaatg cagctgcgtt tacagagttt tgaagggaaa tgttatcatc 186001 tacatatttg attctcagtc aaactgtcac ttatgtgtga aggcaacaga aagacagttt 186061 tcagatataa aaaggcactg caaatctacc accactgaaa aagttaattg aagatgtact 186121 ctgattgaga gcagatccaa aattaggaac ttaagaatga agaagtcatg gtatggagga 186181 actgggatga gcacttccag taattaaaca taatattgag tccaaataat tgttataagt 186241 ctccaattaa gtgtcagaca acaataatta tttgtgaatg agtagctata tggtgtaaaa 186301 ttagcttcat gatctatttg agtgcaaaat cctagcatat aaatgaagac tgggaagtag 186361 ataacatagg agcaaggata tgaaaacttt acttaatcta tcatctttta ttggttttac 186421 ttaatttttc atctttcatt gtggaagaca gtcagaaaac atgctgttaa ctttctttaa 186481 tcacaatgaa gtaaaactag aagttaataa caaaggttat aaaaaatcaa tgacttaaat 186541 ataaaatact ttccttaatg accctgggac gtgttcaaca tagtgctgta tgttcctgtt 186601 ggtccttacc attgcaatac ggcaaaaaag aaagaaagaa agaaagcata caaagaaaga 186661 aataaaactg tatttatttg caaacgacat gcgtatctgt gcagaaaata ccaaggaatc 186721 tataaaacaa accaatcaac cataaacaag ccatcctaca acacataagt gaattcagca 186781 aggttgccag atacagggga aacatacaaa aaacaattgt atttctataa actagcaatg 186841 aatacgggaa gactgaaatt gaaaatataa tactattgac aattgattgc tcaaaggaaa 186901 aaaataatta gctgtaaatc caacaaaaca tttgcaggac ttatgtactg aaaactgtaa 186961 aatgttgatg aaagaaataa agaatatcta cataaatgaa gacatactgt gttcatgaat 187021 tgaaagactc aatatagtaa aggtagcagc tatccccaaa ttgatataca ggtttaatgt 187081 gattcttata aaaatcccaa caaaattact ttgtagatat aaataagatt attctataat 187141 ttatatgaga aggcaaagga actagcatag ctaaaacagt tttgaagaag tagaataaaa 187201 tgggaggaat cagtctaccg tagtcaagac tatgtgatat tggtggaggg acagacacat 187261 aaatcaatag aacagaatag ataacccaga aatacaccca cacaagtatg ctcaattgat 187321 ttttgacaaa ggcacaaaag gagttcaatg gagaaatgaa ttttcaacac atagtgctag 187381 aacaattgga tatttattgg ccaaaaaatg aacctcagcc caagtcttgc actttataca 187441 aaaattaact caaaattgat catgctatta aatgtaaaat ataaaactat acaactttta 187501 gaagaaaaca taggagaaaa tttttggaat ctagggtcag ggaaaatttc ttagacttta 187561 caccaaaatc atgatccata aaaggaaaaa ttgaatagtc ggacttcatc gaaactaaac 187621 gctattgttc tgcaaaacgc tgttgagaga atgaaaaggc aaactacaat ctggaccaaa 187681 tatttgcaaa tcacttatct gacaaaggac tattatctag gatacataaa gagccctcaa 187741 gtttcagctg taagaaacaa acaatccaat taaaaaaggg caaaagctat caacagacat 187801 ttcaccaaag agggaacaca gatgacaaat aaacacatga aaagatgttc tacatcattg 187861 gcttttagga aaatgcaaat taaaagcaca atgagatatc actacagatc aaaatgtcaa 187921 aaatgaaaaa taatggtaat gccaaattct agcaagggag gctgaggaac aactagatca 187981 cccacacatg gttgctgaga atgggaactg cagtagccac tctggaaaac tggcagtttt 188041 ttaaaacatg ccaactgccc tatgacccag cagttgcact cttgggcatc tatcctagat 188101 aaatcaaagc ttaaaggctc atacaaaaac ctctgtgtga atatttatag caagcaagga 188161 aaaaaaaacc cacaagagaa acctaaggtg aactggagta agaatataga tgagaatata 188221 taatatttta tattctgatg aaatatagtg ttaggatata taaaaaggat tgtgttgata 188281 cctctaagag ttagtcctgg aaaacagaac aaagcaaagc aaacaaaaga gccaataaaa 188341 catctacttc taattatata tttaatcata tatatagatc atatatatat cacgcatata 188401 tttatgtaac atatacaatg aatatatata gcaaatatat atatgaacaa catgcaagtt 188461 agaagtgaga aaggacttat aaatccagag atagaaaaat tataagacga gatgtgctat 188521 tacaatggtg ttcagactgg tactctcgga tttatgtcgt agcttttcca ctactgtttt 188581 ctgaccttgg gtgattattt actctctcag tctcaatctc ctccactata aaacagggat 188641 aataacgcag tagttgtatg gatttggtta aataatgcat gcaaatagct caacccagtg 188701 ttggatgcct catagaacta ggatgatagc tgtttttcaa tttttggtat taagtttgaa 188761 agtccaagaa gacagaaatg ctcaaattgt ctaccataaa agatagactg cctagattag 188821 gggttggcaa atgttttctt acaggactag atagtaaata ttttaggttt atggggcatc 188881 tgagcaacta ctcaatgcta ctattgtagc atgaaggcag ccgtggacaa tttataaaga 188941 agtgagcatg gctgtgttcc aataaatctt tatcaaaaca ggtggcaggc ctgcaggctg 189001 tagtttactg gcctctagtc taaaccaata acaagaagga agtaatgaaa gttttcaaaa 189061 tcctcaccaa aacctatata acaaatgatc cttttggttt ttaaaacatt tctctagacc 189121 atagggaaaa aggacactgt ctcacttctc tcctcttacc ctagcatcat acatacacaa 189181 aaatctaaca aagataaggc aaaaaataaa gccatggtct aatttcactt ataaacacag 189241 agaaaacagt cctaaataaa atcttggcaa gttgaatcct ctaggatatt aaaagaacaa 189301 tgcatcatga tcaaggaagc ttcattctag aaatacatta tgatttatta ttaggaagtc 189361 tattaatacg tcccatcaat atgtcaaagg aaaaatatga tcatagatgc aaaaaaaagc 189421 atttgttaaa atttcaatat ttgttcttgg tgaaaattct tggagaaagg actaagaaca 189481 aaaagatact ttttcagcat gatgaagact atatgtggcc aaccagtagc caaaaatata 189541 cttaattttg agatacttaa ggaattccca ttaaagttaa gaataaatga aagatggttg 189601 atagcaacac tgttatttgt cattgttttg gatgttctag tcaatgaaac aagaaaagaa 189661 acatataaat actaattaca tattggaaaa atgaatacca aattaacact attcacaggt 189721 tatattattt attttagcaa ctctaaagga atccattgaa aaactattag tacaaataac 189781 agaattagtt aagagggctg attacaaaac atgtcaacca aaatcaataa ctttcttaga 189841 gagcatctct aacttggtgg atgacataat ggaaaaataa tcccttttgg tctccagatg 189901 ccaccagctc ccttttcctc ctccgcatgc ttggcagaag gccagaggaa gtactgtagt 189961 cctctactac agagggaagt gtaggaaact ggaatattaa ccatggattg gcaatctgtg 190021 tgtctaaagg aaagtggttt acattttctt ttaaacgtca cacttggcat gccagatccc 190081 catggcgttg gagcatacca aagtgataga gcgtgggcat aatatgtttc aggcaaaatg 190141 tttagcattt gcacagttag gttctcacca atggtgtgtt ccttccatag acatgtgagc 190201 ctggagcttg gagcattgga gtttgagaaa tggcttgatg agggcattga gaagcactag 190261 aaaggatatg gatagcaact cgattcaaat tttatttttg tctacttttc taagtttgtt 190321 tacttcaagc acttaaactc cacttctttg agcctttgtt gccttgccta catactgagg 190381 ataattaaac ctttctccta gggtcttgta gcagtaaata tgattggatg aaaaatacga 190441 aagcatgtca acaactgctt gttaaaagtt agttaccttt ctgcttcctt cttatgcttg 190501 actagctccc ttgactcttt gaaaacatca gaaagcctag ccttaaactc ctttttggaa 190561 actaagcatt gggtcccaga gcctgaggtt tcttaacgtc tgcccctagt ctggttcagc 190621 ttctgcctgt ttctcaactt tgctctcatt tcaggaaagg agacaaatga attattcacg 190681 attattgtct cttgaggggc agtgactgtc tcttctatct tgttcttcca ctcccctccg 190741 gagcatccag agcaactcta gaaataggag tgtagaaata aaacccagga aagagggact 190801 ttagaaacaa gaggcaatgc catggacagg ggtggcagaa ctgttgccct tccagtcttt 190861 gctctgccat ccaccagact tgttgcatgg gaaactcagt caaactctga agcttcggag 190921 tgttctccac tataaaagga gaaaaatgat catctacctt agggtggtgt tgtaaaggtc 190981 acatatgtgt ggccaggcac agtggcttat gcctgtaatc ctagtgcttt gggaggctga 191041 gatgggaaga tcacttgagg tcaggagttc aagaccaacg tggacaacat agcaagaccc 191101 catctctaca aaaaaagtta aacaatacta accaggctag ttgacataca cctgcagtcc 191161 cagctacttg ggaggctgag gcgggaggat cactggagcc caggaggtgg aggttgcagt 191221 gagccatgat tgtgccactg catttcagcc tgggtgatag agtgagactc tctctctctc 191281 tctttttaaa gatacatgtg tgaatgtgtc ttatttaact atgttacaca tacatataaa 191341 tataaattct gtttattaat agctcatata atatacagac tctgcgcctg tacaaatgta 191401 aaggcaatgc attttacgta ggaaaaatgg ttgagtcgaa tgatttaatt tttttacaaa 191461 tcacacaata cttgcatgaa atagcaacag gtgcgaacct cctaggcaga taggagtgca 191521 gtgctttgtt tcacagatac catcaagcct aacctttatt gtgggcttac tatgtgtcag 191581 gcctgttcta ggatcttgtt aatgaactga ttcatttttt tctcataaca acatagaggg 191641 gaaaagatca catggggaag gcaccaggtc cctggacttt gcctcgcgtg ctcctggcat 191701 tttgtagaga ggtaaggact tggttcctta tggactttta agctcatgat tggtttggtc 191761 tggtttcaat tctcatcttc atgtatgctc ttagaaataa tctcatttta ggagagaaat 191821 tcagagtcaa cacggaaact gaaaagctct agtactgcct gaagtctcag aataaagggt 191881 gccttcattt tcacttttcc aaataatcta tgagagggac ttttagtaga tccgtaagat 191941 gtgatactaa catagtatct gcatattgtg aaaaatttat aactgtatcc ataaaggtta 192001 tttgactcca ttgcctcaaa gtctatataa ttagaattcc atgatgtcat ctttgagctt 192061 taatatagct taaaattatc cttctgtagg ttgtgtttct ttctgaaaca ttttaaattt 192121 agaaaattaa cttgttaaac ttttttaagt taaatcttta gatttttttt tattcaccct 192181 gctgtggggg agtcagaatg acaaatgctt gaatcatcaa agtcaaatgt tagggcattt 192241 tagcccctct gggaaagcta aggtctccat ggcccaagaa tctaactcta cttatgggga 192301 tgatgatctc acatgatctc atgatctctg tgttttgcca acattccttc ttggcaggac 192361 ttaggagtgc agatgggccc cagagagcat gccttgctca agccaggtcc ccctcctgct 192421 gccactttgg gtggctggaa aaacctcaca aagggtgcag tgtctgcggt tgggagactg 192481 gtctcaattt tcctccatca cctgctcata aactctcatg ggtttctggg tcacagtgca 192541 gccctaggtc aatcttccaa aagctgattt gctgaatttc atgtgaattg actagcttca 192601 tgtcagcatt ttagggaaca tttgattggt cttcatgcct gctttctgcc tttgcaggct 192661 gagaaacaat gaggggagag aatgagggac agagagagga ggggactgag gggccaagga 192721 ggttccgatg agcaaaaatg gaagagaaag aacgtcagag aggaggctat tgggggatga 192781 ggagagagac cagagatcaa gaagcatgaa ggtattgagg gccagagagg ggacttctag 192841 ggacatggag aggctgcagg acagagaaag ggctagggga acgggacagt ggttctttca 192901 tttcaatgat caaagttccc agctttttga caccacaggg gcaccctgac aattctggca 192961 ataagaacat gaaaggcctg gtctttattt cactcaattc ctgctatgtg tggtgagtgt 193021 gggtgagcca aggggaaggt gatcctattg tcaggaggta atttaccatg aataggggat 193081 gatatggaaa taatgtgtgt gatccttccc ctgccactgt tgggatgtct ttttaatttc 193141 cttccctcat ttgtcacagc cgtgaaaata ctttttctga tatgatgaat gacagatggc 193201 agggtgccgg cagcccttct ggagggatgg gaggttgtgt gtgtccacga taggggccca 193261 ataagtactg gctgaatgag aaaatgagga gcctcactgt gggctttctt tggggtgaat 193321 ggaggtgctg agtgacctct cagcttccta gaagtcacag gccagaagcc gtggaatctc 193381 agtggtggaa agtcctactg atttgaggat cagggaggga gagaatcagc aatggtgtgc 193441 tgataaatgt ttagtagttg gctctctggt aaaaaagaaa aagaaaaaga aaaagaaaac 193501 aaaaacaaaa caaaacaaaa aaaacaaaca atgaacaacc ctggtatgca gtgcttgcca 193561 ctggtcagtt tccatggtca gtttctcacc atgggcaatt tcatgtgcca tcactgaaca 193621 cagagtaggg aagagatcca cgccattggc tcacaagctg gccctaccac accaccagga 193681 ggaatatata ttgggcaacg actaatagca atagaaaaac tcatgcttct ttctcctggc 193741 atgacttcat gtatatgaca tacatcctct cattttatgc ttgtagcaac tctgaaacct 193801 aagatttatt tcctcattgc aggtggggaa gccaaagctc agaggggtta aataatgttt 193861 ctgttattgt acagcaaatc agtggcagag tggattctaa gattcgtgcc ttctctactc 193921 acttcactgg gctgtcagag gttaagggaa gagttacata agccacctct gattattaga 193981 gaatgacagg gctggctaat tctgcctgga tagtattgaa gaggagcttc attcaggggc 194041 accgctttgc taaagagacc acccaaagaa tgagtagttt tgagtcacca ggtaccctga 194101 agtggtacct gaaggtacag ctggtatgat ggatgacaga tgccggtacc ttgctgtcat 194161 ccatcatatc agttgtcagt taccaggtag acctagcggt aaagggtaag ttatatctct 194221 tgcctaaatg tcagtttcct catctgtaaa atgggaccct gaaagtctac tttagagggc 194281 tatagggttg ttaagattta atgagttaag tccataaaga acctaacgta gtgtcagaca 194341 cataatagat ccagagaaat gctgattctt agcaggctaa ctttttcttt ttgaattcct 194401 ctttaggcca gaactgacta gtataggctt ttcaaacttt agcgtgcaga agaatcacct 194461 gtggatctaa ttaagatgca ggttgtgatc cagtaggtct ggggaggggc cgggcctgga 194521 atcctacatt tctaacaagc tgccagatgg tgctgattct gctagtccat agatcacacg 194581 ttttctttca ttctttcttt gtctttgtct tttttttttt tgagacaggg tcttgtgccg 194641 tcatccaggc tgaagtgcag tggtgtgatc tcagctcact gcaacctcca ccttctaggc 194701 ttaagcgatt ctcttgcctt agcctcccaa gtagctggga ttacaggcat atgcaaccac 194761 tgctggctaa tttttgtatt tttagtagag acggggtttt accatgttgg ccaggctggt 194821 ctcgaacttc tgatctcaaa cgatccacct gccttggcct cccaaagcgc tggcattaca 194881 ggcgtgagcc accatgccca gcccatggat cacactttga gtagcaaggg gctaagggat 194941 cctgacatag cattttgggg tcctgcagct tcaacttact ctgctgcctg ttcccatgac 195001 ctctaagact gctcttctcg agacagcatc cttgttctgc actggttcct ttggagcagc 195061 tgctcagccc tgctaaaggc atcttgctca gtgcaatatt agtttgctgc ggaagatctg 195121 ggacctgggc tcctctcctc attctccttg atcttagttt tggttctgcc aggactttgc 195181 tgggcaatct ttgtcatgtc ctttcccctt tttagattca atttcttgat ttgcaaaaca 195241 agacagtagg agagagtcac tttctagttt aggcattctg aagttctgtg aatctgtgat 195301 accagctggt tggtgcattt tcctgccagg ccataagcag tacttgcctt tatttctggg 195361 ctccagtggg ggtcttcaga gatacataga tgcatatagt caggacaaga gttaggagga 195421 gtaaaggtaa gggttcaatc caaacctttt ggaatagtaa cagggtggcc cagagagatg 195481 aagaatgatt ggggcacagg aatcttcttg ctttggggcc taaacgacat ctccagggaa 195541 tggctactca ttgggcacat atgagccagg cactgtgcca agagctaatg ggagacactt 195601 gtcattcttt ctggccacac actctggagc cgcttttttt tttttttttt ttaatatttg 195661 tgaattcgcc acctcataaa cccatgtctc tttaatgcac aactcagaaa cccactcgcc 195721 caacctccct tgcagctaga gcacagacat gtgacctagg atctgtcaat caggtgagtc 195781 cttgctggac tctgaattgg tggctagaag cagcagacac tgtgtttgat ctgttctgga 195841 gggggaggtg ttaagtatgt ccagactgca gagtcagcta tggtggggct tctagcaaga 195901 tctgtccctt gtgaatgttg aatattcaca gcatctgtag tgcagattgc ctggtctgga 195961 ctctggagtg gtggcagcgg gttctgtact gtggtcaaga tgtttcttct ggctttctaa 196021 atgtgtccta attggattct cagggcctcc caaagcttgt ctttcctggt taaactggtt 196081 agagtgggct tccatgttta taaataagaa tcttgactgc ttacaacact ttatgtgtgt 196141 tatctcattg aattcttttt ttaatttttg agatggagtc ttgctctgtt gcccaggctg 196201 gagtgcagtg gtgcgatctt ggctcactgc aacctccacc tccagggttc aagcgagtct 196261 catgccgcag cctcccaggt agctgggatt acaggcacac agcaccatgc ctggctaatt 196321 tttgtatttt tagtagagac taataaggtt caccatgttg gccaggctgg tctcaaactc 196381 ctgacctcaa gtgatccacc tgccttggcc tcccaaagtg ctgggattac aggtgtgagc 196441 caccgtgcct ggccctcctt gaattctcac agtaacctag tgaggtaggt gttattatcc 196501 tcttcttgta actgaagaaa ctgagtcaag aagagatgaa ctaacttgct caaggttcca 196561 ctgtcgataa gtggaggccc aggatttgag tccagatcct ctggttccaa agcacacatt 196621 cttaaccacc atgcatctcc tgtcagagat ttgtgggtgt ggaaaggtcc ttgcaaatca 196681 cttgtcagta gtacagggtt tcatgagatc tacacctggg ctgggcacgc ctgttggttt 196741 tttactgaca acgcataatt tttcgaggag tggtaatcag aggcaaccgt tctagtgaca 196801 gtgattcagg cgtaaacact tggctttggc ccagaggttg cactgtgtga gaccacaggt 196861 gaacactaga atgtttacac tcttccggag tctgtaggtg gatgccaggg acagagaatg 196921 agattgtaac ccagagaaag aaagaaaaga aagaacgaac aaacaaaaga aagaaagaac 196981 gaacgaatgg acaaaagaaa gaacgaatga aagaaagggc cagcccctgg ggcctcctta 197041 gataaggaag agaaaggaga ccttgcctca agctcagaaa ctggttaaaa cacgagccca 197101 gtcacctagt gataagcagt gcttggttct attaactctg acaacatagt actatcttgg 197161 gcagagctct ttggagaagc aggttggcta actggccaac ctcagttgat tctcatggtc 197221 ttggaacttt cctgggggtc caacttcctc cgttctgccc ctcttttttc tctaccttct 197281 atcgttttat tctttgtctc ctttattttt cttcaacaaa tatttttgag cacctattgt 197341 taacaattca accaatattt attgggtgtt tgctcggtgc cacttacagt tctaagtgct 197401 ggagatacag cagtgaacta gccagacaag ctgtcataga gtttccaatt tagtggcagt 197461 ggggtagtcc agtaggcatg aagccaggca ggtaccagtt gttgaggtgg tggtagttcc 197521 acccagtact aggaagataa cattcttatc cccattttac agatgaagac actgaggcaa 197581 gagagcttaa attaattgcc tgaggtctca caggtaggct gggatctctg tgagatctag 197641 gagtgccaga gccacttggg tccagccacc cacacctctt ccctgtgcct caggctggga 197701 tgacacccag agcctctcct ttctccctca cctgggccca gagaccacct ccttccctct 197761 ccttgcattg ctgctgccca tgctaaccga aggagcctcg ccgtggggcg gatgtggcct 197821 ccccagtggc ccacttcctc ctgcttctgc caacagcagc attggcccct atgatttgaa 197881 tttttttttt tttttttttt ttttttctgt tgatccctct agcggcacca gatgtgggtg 197941 gggtggaggg tgggtacacc cacactccct taagcctgga agtaacctag gttccagact 198001 ctgtgtgtcc ctgaatggta atagtgactc tcttcttagc tgtctgtccc actgggtcct 198061 cccactcccc aactggtgct cctgcttttc ctcagaatgg gggacataaa tctccttcta 198121 gtcccaaccc attgtcaggc agggcttcct cctggagggc tgacttccca ggactgggcc 198181 agctaaaagc taagcagtat gggtaagagc acggggttct ggagtcagac agacctaagt 198241 tcaaacccca gtactgccgt ggacaagctg tgtggcctca gacaagttga ttaacctctc 198301 tgagttcctt ttaaaatagg ctgaggtctg tagggataag gtacctgagg tcttagtgcc 198361 tggcacagag taaacccacg tgttggctgt ttccattatt gggtagaaca gaggggcctc 198421 ctggcatcag tgggcatttt gggcctgctg ctgtcttgtc tatctaggcc ttggttttgc 198481 ctttcaggct ggtgcctctt tgcagtttca ctttagcaag tgaaacttta tttttcctgt 198541 ttctcctgat ctgtttcctc cttattttcc atagtgaata atcacacata agcctttctt 198601 ttctcagggc aaggttgctg tctgtgaagg agtggaatgc ctgagcatcc ctggtaggaa 198661 aggaagagtg accagcatca atggggtact ctattccctg agtagcaagc cctcttcccg 198721 ccttgtccct ggctcccaca cctgttatcc tgccggcagt catctccagt atcccagagg 198781 agccataggg atgggtgttt gaatataaga acactctgca gctcagatat gcggcattac 198841 ttttcctgtt tgactcagtt ctgcagaggg ttgctgagtg actgtcatgt gcaaagcacg 198901 tgggaggaaa gatgaataat gagagcaatc gaatgcccat tcccttgcat ccagggtgct 198961 ctactagacc tgcgttcttc catcattctt tcagcccacg tggttgccaa tactgggagt 199021 tatcagtctt ttccattttt gccaatctaa tggataaaat atcttgtatt aatttgcatt 199081 tcccagatat tcatgagatt aggcatacgc tttttgggca cttgtatttt tttctctggg 199141 aattccctct tcgtattact ttgtcaattt ttctagtggc ttttttattt gagacagggt 199201 cttcctctgt cacccaggct ggggtacagt gtccagatca cagctcactg taacctcaac 199261 tccagtgatc ttcccacctt agcctcccta gtaaatggga ctacaggcac acaccactat 199321 gccaggcttt tttttttttt ttttttgtag agacagggtt ttgccatgtt gcccaggctg 199381 gtcttaaact cctgggatca actcatcctc ccaacttagc ctcccaaggt gctgagatta 199441 caggcatgag ccactgcacc tggtcccctg tctttttttt tttaattaat tgatagtaac 199501 tctttatata ttctagatgc caatccgtga tttagtctaa aaataaatcc ccttggtctg 199561 tcctatgtct tttcttttgg tttatgaaat cttttgtaat acaggaacgt cacacttcca 199621 cagtcaaatg tgtcagtctt ctaaatagat ataggtacct tgtatatact ccttgatata 199681 tcacattgat tggtggtgac ctgtttgtct ccctggctgt cagactgtga gaaacttgag 199741 ggcaggggct gtactttatt cttctctgtc tctctaggga ctagtgcaaa gctgggcaca 199801 gagttttgtg cctgaatgag ttttttttgt gtgtaaacgt ggtataaaac aggtcttgtg 199861 actgaatgag ggaatgaatg aaaaagccct gtcctgccct ccctgcagag atggggtctg 199921 ctgcagagat gggagtgagt cagggagtga atcacaagct cttccgtggc tcagctgaca 199981 tttcaaaagc cctgagcaga gaagcccccc tcattccaag agagcctccg cctctttgag 200041 caggcacagt gaagctctgc tgattatcaa actctgtgtg tgtatgtgca cgcacgtcag 200101 gggtttagca gttaaaactc ttagtgctgc atattctgcg tatgtgtgaa caaggcttta 200161 aaaaatctct cagaggacct gtgtatgtgt gtgggtgctg ggggctgcat agagagagag 200221 ttaaaagaga tgagggggtg tttagaatcg ttgcaatgta ttgcatttgt ttctgggagg 200281 gtggggcgac tggagggggt taaatagctc ccagaggcgc tcttgtgagc ggagcttcct 200341 gaaatctctt tatctgtggc cacatgctgt tcttggttgt ccacgtggtt ccctggtgtc 200401 cctgttgctg aaggcacttc tgttttgttc ttggaggctt ggcttctcat tgccattttc 200461 taaagcagta gagagtgcca gttagattgt tagacccaaa taatctgaaa tgtgtgaaaa 200521 aatagtaaat atttactgta ccttttgtgg aacaggttat tgccatggaa atgcactgta 200581 cacgaatctg gatgatggag ggaagctggc tcaggagcta tgggagtgtt tggataccat 200641 gccgcctgaa ttggcaggtt taggagtgag tgccgtgggt ttagaatcac agtgtggttg 200701 ggtgaggctg aaacacttgg tccccagtca acctgtactg agaggggcat ttattcattg 200761 cttcccacat gcagggtgct tcctagagca ttctccatgc attattgcgt ttaatcatca 200821 tgccacccca atgaggtgcc attatcctcc ctgttttacg gagacacaga gtgtttgatc 200881 tacttgcttg aaggtacgca gtcaatgaat ggcagagccc cctgactctg ttctctggag 200941 cttgatcctt ccctgctcag ctgtttctct cactgtgatt taccctagag aaagctaacc 201001 agctgtctac cggggtttta ttttccaata ttgacttcgg ggctgacatt tagaaaccat 201061 caacaatttc cagcttctct tgaaaatgtg cacaatctgg ctactgtgtg gggcttccct 201121 agtggctgct cccgagcagc catccaggtt agccgggcat cagctctgca gttggcgctg 201181 ggcatcctcc tgttgcagct ggggagaaag catggggtat cacacccagg ggctccttga 201241 attaattgag aactgtctgt tgagagctcc tcatcctgtt agacttgaca tttaatttag 201301 ctgttgtttt catttgccaa ttagagcttt tgagattctt ttgattttct tcctgtggtc 201361 tcaaggaatt tgtgggtggt tgagtgaaga tcttcatggc aggaaaggaa ggctgcaaca 201421 ggagatgtaa tgcgtgagca cattcattca ttcattcatt catttacttg tttattcatt 201481 tgttcaacat attcattgag tattggatgt tgaagttaca acagtggaag tatggcaata 201541 aataagacag acccagtcac tgctctcaag gagcatatag tcaaagaggg cagtcagaca 201601 tcaactatga tgcacgttac agggaaggca tccatctttt tttttttttc tgtgtcaccc 201661 aggctggaat gcagtgatgt gatctcagct cactgcaacc tctgcctccc gggttcaagc 201721 gattctcctg cctcagcctt ccaagtagct gagattacag gcacccacca ccatgcccgg 201781 ctaatttttt tttaaatttt attattatta tattttaagt tttagggtac atgtgcacaa 201841 tgtgcaggtt tgttacatat gtatacatgt gccatgtagt agagacgggg tttcatcatg 201901 ttgaccaggc tggtcttgaa ctcctgacct caggtgatct gcccaccttg gcctctcaaa 201961 gtgctgggat tacaggcatg agacaccacg cctgaccagg gaaggcatcg atcttagact 202021 gacttagtgg tgtctaggtc tctgaaggag gtgtcagaga aggtctcctt ggctaatctt 202081 tgtgaccacc ggagctctgt tagtttcata agacataagt ctcagcaact gttgcagaaa 202141 gacccttcct cagtaaggat gatgggctcc tcgggctctt gcccttcccg tcctgggagg 202201 aaccttctat caagagccat ggctagtcat ttcatccatt gggaatgagc ttactggctc 202261 catttatgag agttagttta gactgaggag taaaggagga gtgaaggagt ttgggagttg 202321 tcagtgataa gcatgcacat ttggagttga caggacagca tggacactag agcctgaaca 202381 ccttagttca gcttagctgc actgcctatt atgtgatact gggctggttt cttaactgtt 202441 ctatgcctca gtttccccat ctgaagttgg gaataaaata ctcatctcat agggttattg 202501 tgaagagtaa atgagtcaag ctgggtgtag cggctcatgc ctgtaatccc agcactttgg 202561 aaggccgaag tgggtggatg gcttgaggtc aggagtttga gaacagtctg gacaacacga 202621 tgaaaccctg tctctactaa aactacagaa cattagctgg gattacacat atgcaggtgt 202681 gtaatcccat gaaacacatt tcaaataatt gctaaaattt ttcaagctgt ggcataagag 202741 acaacagtcc tgaagaggac tgaaaggtct tgggatgcag gggagcttaa tgtattaaaa 202801 ggtgacattt tggatcagta gggaaaaaat tacacacagg tgtgtgtgta atcccagcta 202861 cttgggaggt tgaggcacga caatcacttg aatctgggag gtggaggttg tagtgagcca 202921 agatggcacc actgcactcc agcctgggcg agcgagactg tctcaaacac acacacatgc 202981 acacacacac acacacacac agccaggcaa aggatgaaga gaagcagtta ataaatgtga 203041 gccattcacg tgtgtattat gattatcacg tgatatcagc cagtcctgct tgagactcac 203101 ttttccgaca gttctttctt cggtggactg tacccatgcc cctgtgtgcc atatatttgt 203161 gatttccagc cttcctttct gataacttca attcattgct cttcccaggc aagtccttca 203221 atggttatgg ggtgccgcgc ttgccctcat ccccagctga gtgttctgcg tttggggact 203281 tgtgtttata agtgtgggag gctgtcccct gggggttact gcccagacag ttagggttgt 203341 ccataataac agaaaagtgc atggagagag aaggtgcagg tccgcctatt tgatgttggg 203401 taggatgtgc tgcttgatga ccggtgaaac atttacttct cccaccatgg gcgagacctt 203461 ggatgtgtcc tttacctctc tctttgtcac ttctgtcctg aactgcccat gagtgaccag 203521 gggcttccga cagaagtgtc tcacaatcca accacacagc agaaaggcat tttcctttgg 203581 atttgagctt tgtccctgca ctgacttttt gtgtcatctc ctcttcattc aatgtgagat 203641 actcaccttg attcatagaa tgttttctca ttccaggttt gtattttttt aatcaccaaa 203701 agaagcactt atcataaaag actgtgatga tgtataactg tgtatttagt ttgaaaatct 203761 tcaccttata gaagcaataa acaatattta ggtgtattat cctgtgaaat ttttataaat 203821 ataagacttt attacgaata aacaaaaaaa gagatcatat cctatatact ttttttataa 203881 ttttttttga gagagggcct tgctgtgtca tctaggctga agtgcagtgg tgtgatctca 203941 gctcactgca acctccacct ccagggctca atcctcccac ctcagactct caagtagctg 204001 ggactacagg cacgcgccac cacgtccagc taatttttgt aatatttttt ttttgtagag 204061 atggggtttt gccctgttgc ccaggctagt aactttttcc cttattaata ccttatttag 204121 atccacctta tgttttaaat ggctgcagag tgtttcataa tatggccacc atattattta 204181 cattcatatt tataatttgt ctaaaacatt agcaataggc atttggatta tttccaagtt 204241 ttctctataa tgaacagtgc aacagtgaac actcttgggc actgtagtgg ggatttccat 204301 agaacaattc ttagaaatgg ggtgtttagg gcaaaggata ttaacatttt tagttgtaat 204361 agatattgct aaattgctct caaatgttgt gccaatttat gcacccacca acagcgcacg 204421 aaagtgcctc ttagaaaatg tcttttagaa gataaaatac ctattgttaa gccatgaaac 204481 acatttcaaa taattgctaa aatttttcta gctgtggcat aagagacaag agtcctgagg 204541 aggactgaaa ggtcttggga tgcaggggag cttaatgtat taaaaggtga cattttggat 204601 cagtaggaaa aaaaatggct tatttaatac atggtgctgg cataattggg aggccatgtg 204661 gaaggatgct gagtttgatt cctaactcat acaaaaataa attacaggtg gagtaagcta 204721 taggcagtaa aaaaaaaaaa aaaaaaaaaa aaaagaatat tagaggaagt tttaggagca 204781 tacttctgta accttgagtg agggaaatcc agaagccata aagaaaaaga ttagacagtt 204841 tcaattatgt aaaaataatt taatctttta aaaggagaat gtattcatga attgcttatg 204901 taataagaaa gactattaaa aataggtcca gtagataatg tacaatggtt tccaatgtga 204961 ataaagttga agttaataaa gaaaagaaag ttttaggaca tgtcagggag tttttgttca 205021 aggattgaga ttctgcaatg tttattctaa ataaagatga atgatagact agtgcacatg 205081 tagtaaacta gtaaatgtgt agcccataaa ccttgtaagc cctggaaggg taaagaaaaa 205141 gggaaggaaa tgttgttttc tgcctcttta ttttttgttg ttttgagaaa gttgagtacc 205201 cacccaaagc cccaggtgtc ccaggagaga gaagattgaa aacatgccat caaaacagga 205261 gcttcagcct cagccacaag ctccagtgtt tactgggggc cagtgccctt attcccacgt 205321 gaagcatctg ccttttttgc ccccttgaag tctgagtctc tcgtctgttc ctggagattt 205381 ttagccaagc ccctcctccc cacccctcac cttagtattt gtaccaccct ctgaatcaac 205441 cccaaagaca cccctttgta ctcttccctt catctggcca tcaccagggt actgtggtca 205501 ttgaagctga gtgtgtgccc tcactttccg taaactcaca gttcctttat tctgcacctt 205561 cagggattgc aaatcgttaa gggtatgcct tcaaatacca gccctctgac ttcctagctc 205621 aataacctca cacagctcta tatctcagtt tcctcatctg aaaaacagtc atgaaatggc 205681 accacctcct agcattaatg tgaagattgt tattattatg tgttattaat tattattatg 205741 ttatcctatt aatatatgag acagaatctc actctgttgc ccaggctgga gtgcagtggc 205801 acaatcttgg ctcactgcaa cctctgcttc ccaagttcaa gggattctcc tgcctcagcc 205861 tcctgagtag ctgggattac aggtgtgtgc caccacgcct ggctaatttt tgtattttta 205921 gtagagacag ggtttcgcca tgttggccag ggtggtcttg aactcctgat ctcaggtgat 205981 ctgcccacct tggcctccca aagtactggg attacaagtg tgagccactg cgcctggcct 206041 atcttattat tattattatt attacttttt gagatgggat ctcactctgt cacccaggtg 206101 ggagtgcagc agtgcaatct cagctcactg caatgtctgc ctcctgggtt caagcgattc 206161 ttctgcctca gcctcctgag tagctgggac cacaggcacc caccaccatg cccggctaat 206221 ttttgcattt ttagtagaga tggggtgtca ccatattggc caggctgatc tcgaactcct 206281 gacctcgtga tccacccacc tcggcctccc aatgtgctgg gattacaggt gtgagccact 206341 gcgcccagcc tctttttatt attatactta gcacggcact tggcacttag gaaatgttca 206401 gtatatgtta gctattattg gctactgtta tttcttcttc ttctcctcct cctcttcttc 206461 tttctcctct tattcctcct cctcccccac tcccttccac cccctcttct tattcctcct 206521 ccttctcctc attattattt taagtacctc cttaccaatg cctacagaaa tagactgacc 206581 tagtaccagg atctagagga gttcaatcaa gttcgtctgt tgccttcaag gtaacagcgt 206641 agtggtggtt ccgagccaga tgccagaccc agtctgcctt tgttcaaatc ctggttttac 206701 acttaatact gtaggtacct cgggcaagtt acctaacatt tcttcacctc aatttctcca 206761 tttgtaaaat gtatataatg atagtaccta ccacgcatgg ttgttaggaa gattaaatca 206821 attaatattt gtaaagtaga gcagccggct cattgtaagt attgggagta ttggttaaaa 206881 taattacaaa ataaacaatc tcacttagat acatccctta ccccctgtgg attatgtcaa 206941 gggtgaaaga aacagtgtgc ttgaaatcct gctgtgcgcc gagctgtata tcagaggtgt 207001 tatggtgatt cgtcatcatt ctcctgacca ctttccattt ttcttttaga gtcaacactg 207061 gccaatcttt agggaggagc tctgatcatt gagaaactga cactgtggaa gggggcattt 207121 ctgactctct cttcctagct ccatttgatc ttcctaaatc tttcaataat gtgtcattgg 207181 ggtaggacca aagaaaaacg gggtgaatgt acggttgctg ctttgtttcc catagtggga 207241 gtgactctga tgtttcacaa aggcaggggt ggagaggcag cgcctcgcag gaaagccttc 207301 ttgctaaatc gaggacttct gcacagaaat agatacaaac ttgcttactg tttaaataga 207361 atattagcga ttatctttta tccagctgga gtctcctcgg tttggtactg cctcaaatga 207421 gcagcctgca atcttgggtg tcacttcctc actggttctg ctttccataa ctgtgttgaa 207481 atcactcccc cagaaggtca ccaaaaggga aaatattagt gtgtgtttat tggcacgatc 207541 ttgcaggtgc tgggctaagt gtagacatga tttattttca tttaatcttc aaggtagttc 207601 taggcgatgt ccccccactt tgcagatggg gacactgatg tgcaggaaat gaagtgacat 207661 gctgtagcac tgttggtgct ctgctgtgtc ccctcaccct tccatttcat tgcattctgg 207721 cctggcttcc aatgggcagc aactgcagct gttcacctga gggcttgctc tgccgtagtc 207781 cattcggtcc ataggcatgt caggctgtgg aagcacagga ggagaggtga gggatggtgc 207841 aggagagagg agaggatgat taatgtcccc cagggcccat cctcaaccaa tgacttatgg 207901 gaaacaacct cctagagttg tattaagctc cagtgaccca gaatggtaac aagctcacaa 207961 acacttcctt ttttaaaaaa tttcacaaaa atactttatt atacccatac ctccttgttt 208021 tagcccatcc cagtacggtg gtatcttcag aacacttcca ctttcccctc tcctcacctc 208081 ttccctccca cccttgagat gtaatctttg gaccactttt acttctttgc tttctctcag 208141 caccctttct cccctactct atggtgactt tgtgtttttg aattgaagtc gtttcatgtc 208201 tgtgtctttt ttttttcttc aactgttatt ttaagttcta gggtacatgt gcaggatgtg 208261 cagatttgtt acataggtaa acacgtgcca tggtggtttg ctgcacagat cgacccatca 208321 cctaggtatt aagcccagca tccattagct attcttcctg atgctctccc tcttccctcc 208381 ccactctgac atggcccagt gtgtgtcatt ccccaccacc aaatgtccat gtgttctcat 208441 tattcagcgc ccacttataa gtgagaacat gcagtaaaac actgccttta taggcttcct 208501 tcccttcact gtcttacttc cccactccct actgctgctc tctgggatca cctctcacag 208561 aagctgtttg tactttaatc attgtctgag agtctggggt atcccaaact aagataattg 208621 cccaaggtct tagctaaatt caacagccct atttgtcttt atcctccttg tggcaatgcc 208681 actactaatg gatgtccctt cttaaaattc ttccccattt tggcttttgt gacacgtaat 208741 agaattcttc tacttctgat tatttctttc ctttttcatt aattggcagc tctttctttt 208801 taaaaaaatt tgctagattt ttgcttattt ttttttttaa gaaatgaggt ctcattatgt 208861 tgcgcagact ggccttggaa tcctaggctc aagcgatcct cccatctcgt cctccgaagt 208921 gctgggatta caggcgtgag ccactgtgcc catctttctt ttcctttctt tagccaccca 208981 gttttaggta cctccagtat cccagtattt tagttgtaag tgacagaaaa tctgactcaa 209041 attggtttat gtataacaaa atgtaagtta ttggatcaca tacttgggaa tcttaggata 209101 ggtggcttca gggaagatat gatcaaggtg ttcagcaatc tgattctctc tccatcatga 209161 ttcagctttc ctctgtattg cttgattctc aggcatgtgc ttgactaatg aggcaacaac 209221 aacttgcagg tccaatctta tcttcatcaa tggtatctta actacatcag tggtatagga 209281 ctgcttcttt ccttagagtc ccactgtaag ttttgggttt cattctcatt ggcctgtgtt 209341 gagtgatgta tacatttcta aaccaatcac agttgccacg gagatgtaat acaggttgag 209401 catctcttat ccaaaatgct tgggaccaga agcatttctg agtgtggatt ttttttttct 209461 tcagattttg aactatttgc attttacttg ccagttgagc atcccaaatc caaaaagtta 209521 aaatccaaaa tgctccaatg agcctttcct tggagcatca tgtcagtgct caaaaggttt 209581 tgggttttgg agcatttcac attttgggtt cctggatttg gggtgccctc atttcccagg 209641 ctgggtcaca taatcaccct tggagctaag gttagagttg agccctactc cataaggact 209701 gagagtgggg agaggtgatt tcgcaaagga aagtcaaagt gctgtcacat gaggggatgg 209761 gtactggaga ggcaaaaaca acagatgttc atactccgca tctctgggtt catttatctg 209821 ctcttctctt tctctgtgaa tcatctgatg atttccactc ctgcctgcct gagtaggact 209881 ctgaaactcc tgcctccaga tctgactcca cttctgagtt gtagcctcct ttcccccacc 209941 ctctagagtt aggcatttcc atttggtggt gccatgcaga cctgaaactc agtgtgccac 210001 aaccctcgcc acctcccacc tggtttcgct ccctaacgtc cctgtctttg tcagtggtgc 210061 cagttgggtt tccatctctc cctcatccgt cccaaccccc acctaatttc cagccccatt 210121 gtgctgcttt tgattccttg aatgcacctt tgcaatgctt ttccttggtt tgtaaccctc 210181 tttagcatat tcttggtttg gtgaacccct ggccatattt agatatcagc tcagaattca 210241 cttatttggg aacctctaac ctggtcaccc atctctgcca agcctgagtt agatgctctt 210301 tccctgagtt ctactaggcc ttcatctcgt ctaccacagc acttctcact cagactccaa 210361 tgactggctc cattgcctgc cttccccgta agggcagggg cacatctgtg taattcacca 210421 ttgtatcttc agcagtccag cctgacaatt gagtctaggc tggtgtccaa caaaccagtc 210481 tttgtttttt ttttctttta ttttaatttt gatagttttg ggggagcaag tggttttttt 210541 ggttacatgg ataagttctt tcatggtgat ttctgagatt ttggtgcacc catcactgta 210601 acagtatatg tggtatccaa tatgtagtct tttattcttc accccctccc accattcccc 210661 ggagtcctca aagtccatta tatcactctt atgcttttgc atcctcatag cttagcttca 210721 tttacaagtg agaatatatg gtatttggtt ttccagtcct gagttacttc ccttagaaca 210781 atggcctcca acttcatcca aggtgctgca aaggccatta tttcgttccg ttttatggct 210841 gagtagtatt ccatcatgta tgtgtaccac attttcttta tccactagag ataaaccagg 210901 ttttgaattc aggatccacc tctcctgagc tgtgagcaca tgggcaagtt tcctcaccag 210961 tccaagcctg ttttctcatc tctaaaatgg agacaatgac ggttcctccc tcactgagca 211021 cttgtgagaa gtaagtggtg atcacggtta caaagtgttc agcacagtgc ctggaacacg 211081 gtaagcgttc aatcagtgcc aatgatcatt tattatactc actatgaaca ttatttttac 211141 ctcatgcagt gtctagaaca taatcaactc tcaacaaatg acagctctta ttattagcca 211201 agaatggaac ctgaaacata gtacgctcca tggatcattg aagctttgtt tttgaaagtt 211261 tgaaattaac aaaaagagcc tttgatgttt cccttgctgc acattcttcc cctccaggtg 211321 gagtgggcag tttccactgg gtgctggaac caaccctcct gtagttggag gtgatcttct 211381 ctgatgtcac tagggtgccg aggtgatgtg tcaccagcac ttgtttgctt gtgtgatgta 211441 atgagggtgg acttgcttct cctagggaag agcctcatta aggttagcaa ggggcttgtt 211501 cctgggaagt tctaaggacc gcctggggct gccacataag gaactcccct tttctcattt 211561 tgtctccggt aagttgttat tgatctgtac atgatgtttc tctttcttac cagccgtggg 211621 aatgggccat cctctggcat cactgttgcc ttgaacgtta ccatcaccac cgttgccatg 211681 aagtcatgac agcagcacca ttggctctgg tttattgggt gtctaccatg gacagtagta 211741 gggctagaca tttagtatac tccagggcta ctgcggggat taaatgaggt atgaagcatc 211801 tataacccca cggtccaata gggtagccac tagccatgtg tgcctactga gatatacatt 211861 ttaattaaag ttaaataaag aaaaaattca atttatcatg cacagtagcc acatttcaag 211921 tgctcagtat gtgctcatgg ctgatgtagc tggcagtgaa gatggggaac attgtcatca 211981 tcacagaaaa ttcttccaca gcgctgatct agatagaaca ggattcagtc cgctgtggct 212041 tgtggatcca tggactgaat tcatacacac agaattttgt gggaacacag ctgtgcctgt 212101 ttgtttatgc cttatctatg gctgcctttg cgctacaagg atggaattga gtagttgtga 212161 cagagaccat gtggcctgca aaacctaaaa tactcagtct cttatccttg aagaaaaagt 212221 ttgtcagcct ctggtctaga acattgcctg gctcttggta aggttaaata catgtttgga 212281 actttttatt tattgaaaat aatatacatt ttgttctcca cctaaccttt taaggtaagt 212341 attgttatcc ctgtcctaga gatgagaaaa ctgggactgg aattctgtac tgtcacttcc 212401 agggggcaaa cacctcatga ggtttttttt ttttttttgg cagtcttgct ctgtctccca 212461 ggctggactg cagtgtcgcg atctcagctc actgcagcct ctgcctcctg gggttcaagt 212521 gattctcctg cttcagcctc cccagtagct gggattatag gtacctgccc cctctccccg 212581 ctaattcttg tatttttagt agagataggg tttcatcatg ttggccaggc tggtcttgaa 212641 ttcctgacct caagtgatcc acccgccttg gcctcccaaa gtgctgggat tacaggtgtg 212701 agctaccatg cctggtctca tgagtcttgt tcatcactta cctccttttg ttcttccagc 212761 ttcagtttca ggttcagggg gtacatgtac aggtttgtta catgggtaaa ttgcatgtcc 212821 ctggagtttg gtgtactaat gatttcatca cccaggtagt gagcatagta cctgatagta 212881 gtttttgatc ctcaccttcc accctcaagg aggtctcagt gtctgttgtt cccttctttg 212941 tgtcatgagt gcctgatgtt tagctcccac ttataagtga gaatatgcag catttggttt 213001 tctgttcctg tattaactca cttaggatag tggcctccag ctgcgtccat gttgcagcaa 213061 agagcatggt ttcattcttt tttatgcctg catagtattc catggtgttt atataccaca 213121 ttttctttat cccatgcacc gtggatgggc acctccgttg atttcttgtc tttgctattg 213181 catcacttac tcttgaatct ggagcagtgc ctagcacata atagacattc aggacatgtt 213241 tgttgagcaa tgaatggatg agttaggaat caattacata taaatcaata atcaataatc 213301 atcaatgatg aatttctatg aacaaatatg aaaattagat cggccgcatt gtagagatga 213361 ggagacagcc ctgtagaagg aaagtgcctc atcaaaatgc aacatcgctg tgaaagtgta 213421 gacttggagc tccagctgaa gtcatctgtg tattttctaa gctgtcccct ctaccatcct 213481 gccttctccg ggagctgcca aagtgcagcg ctgatgcctc ctctctctgt cccacagcct 213541 gacttttgtt cagctcctgc ttctcttctt caccaactgc cagcttcagc tgctttgtgg 213601 caccctgggg cttgaagctt catttctggt gctgtggcac cacaaagctc tgccccagtg 213661 tcacttgagt ccaagtagat cagccaagtt tgaatgatct ctctcttggc cacttcagtc 213721 tatgaagctg gagttgagac catctgtgat tttacacatt cgttacagca aaaccaagag 213781 ggccatgggg taagagggcc attaagctag aggctgagca aaactctgct cttgaagcac 213841 cttctgagat ggcttctcat ggaacgtctg cgcaaagtgg cttcacatcc aattaagtca 213901 aagatgcatt aaatgaatcc agtttgtaac tgcttccaac catgaagtca cagttccatc 213961 tcttgccctg ggaggggctt ctttcaggct ctgaggactg gagccctgga gactttattg 214021 ccttacggtt tcctcctcta ccatcagatg tttctttttc atgtgttctg gcctggattt 214081 gaatagactg cctggatttg aatgctgcct tcacctcttt ctggtgtgta atcttggaca 214141 agttaccttc tttctcagtg cctcagtttc ctcttcagta caataaggat aataataaga 214201 cctaccacat agctgtagag aggattaaag gagccaatat gtggaaagta tttacaacag 214261 tgtactgcat agtagctttt ctataagtga ttattattat tatcccttcg aatctaaaat 214321 ccctctagtt tgtttcagag gagcaaggat tgtgtctcgc ctgcctttaa atccctttaa 214381 gttgggtgtc tcaagcccaa ccatctgcag gagactagaa ggggacttac atgagtgaat 214441 tgaatcaggt gtggatttct gtgaactaga gaatgcaagg ctgtctgaaa ggggaactat 214501 tccttattca tgcatcacat atacactggt gtgccaagcc ctgttctgca tctggggaca 214561 cagcagagaa tgaaccgggc aaaattccct gcccccttgg agttgatgtt gtagtggact 214621 aagctctgat gactctgttg ctgcatgaga atgtgggccc actgctgaca aaacttctga 214681 tttttctgta agaaaagccg gaaacctgga tttgtaaaaa ccgtgaactc tcctagctta 214741 agaattggca gtggtttcca tttttttttt tttttaaagc acagtacatg ctgtgaaccc 214801 aagagccagt ttgcagcccc tgccttaaga cactcctcta attgctctgg gaggaggcct 214861 gtctctgccc tgcaaatcag cagctggtcg gatttcaagg tcgttggtgt ggagggcgcc 214921 tcctccctct cctgcgtgga ggttattgtg gctttttctg cccacagagc agatgacgct 214981 gccaccaggc agtgagttag aatattcctt ctcttatcct tccttttgga tggtggcctt 215041 ggctgggttc ccagccccct gacagcagaa gacaaaagga tggaagcagc ttcctcatag 215101 tcacccatgg gggcgatagg gtggtgatgg tgatgtggag aaaatgcatt gggtcaggac 215161 agccaccttt gaagccagaa aattaatgat ctctgttgcc tgaatctttt caaagtggag 215221 aagtaattta aaagcttgtg tgggaaggat tgagaatggg attgagaaaa gaagtttagg 215281 ttgctatcag gaacaattcc taacagcagg aggtttttgg agaggcagtg gtgaaattcc 215341 ttggaaattt ttgtgtggtg tctttttttc ttttttcttt ctttctttcc tcttttttgt 215401 ttttgttttt ctttttttga gatagagtct cattctgtca cataggctgg agtgcaatgg 215461 cacaaccccg gcccactgca acctccacct cccaggttca agagattctc ctgcctcagc 215521 ctcccgagta gctgggacta caggcacccg ccaccatgcc cagctaattt ttgtatgttt 215581 agtagagaca gggttttgcc ttgttggcca ggttggcctt gaactcctga cctcaagtga 215641 ggtccacctg ccgtggcctt ccaaagtgtt tggattatag gcgtgagcca ctgcgccggg 215701 ccttttatgg tgtctttgaa gtgcttgttt aaatttacat gtggttgcct ctcagtgggg 215761 tgcatgctct ctggttttcc ccaagcccca ccactccctg ttgtgcttta cccacttgcc 215821 tgacttagtt atgcttactg gcctggcctt tgtgggtctt tgagtttgtg ccttctgctc 215881 tgaaggaagc atgcttgaga tgatggaatc taggttcaaa cctacctgcc ctatctgccc 215941 tttactgact tgtgtgactg tggacaagtt cctccacccc tgaggttgtc aggaccctgc 216001 aggaaataga accattcagg tggctgcaag aagaaagttt catgaaagga ctatttaggg 216061 agacttgggc aaggttaaga ttacaaataa ggaatgtgga agcacctagg gccttgcaac 216121 agcaggaggc ggttaccgtc tgtaaagagg aagaagcaag cggagggaat agtttggagc 216181 ctattgagag ctggggctaa gaaggaaggc atcgccagca ggaaccatgg tcctggaggg 216241 atgaaccgct tcccaaatag cagtaccgag gcagggagga agagggggaa gaaacacccc 216301 agcttctctt cacctctccc attgcctgtt ggcctcaccc actggctgaa cccagctgga 216361 aaccagaagg accagatgat gaattctgta agggtcagcc ttttggggct cagctcacag 216421 cagaatcaga gagaaaaaaa tggatctgag aggtgaaaca aggaacaagc agcacgtaac 216481 tctcagcttc cctaccttga aagtggagga gaatgagata actgaccttg gaaggttatt 216541 gggtggctca tgtactctcg aagggaaggg cctggtcaat attaagtact cataaatgct 216601 gagattcagt gtgggtttca aaccgtttct aggagctaaa ggtcctgaga aagggcttca 216661 gtttcagtat caagagacat tctcaggtgg gcagatctct tgaagccagg agttcgagac 216721 cagcctgccc atcatggtga gaccccatct ctactaaaaa tacaaaaatt agctggatgt 216781 ggtggtgggt gcctgtaatc caagctactc aggaggctga ggcaagtgaa ttgcttgaac 216841 ctgggaggcg gaggttgcag tgagctgaga tcgcgccact gcactccagc ctgggcgaca 216901 gggtgagacc ctgtctcaaa aaccaaaacc aaaaagacat tctcctcctt cccctgcagt 216961 ggggcccact gggaatggtt gcatcttcca tggggagttt aaagtgaatg tgtagtggct 217021 aagagcaaag accctagaac caagtaggtg gacactgtgg tagtccatca gtgtggtaga 217081 cactgatctt tctgttacta tagattaatg gccttgtaca gtatgcacta ttttgtgtct 217141 gccttctttc acttaaaata atgtttttga gactcacgca tgcatgtgcc tgtattggag 217201 tctattgaga gctggtttcc agtcctgatt gccactgaca gtcaagttac ttaacttttc 217261 atgccttatt ttctcatctg taaaatgagg atgttagtag tttctacctc attgggttct 217321 gaggattaaa atgagttaat ctacgtaaag agcttaacac agtgtctaac acatagtaga 217381 caatcaatag atgttagcta ttgtaattgc catcatcgtc atcatcatca tcattatcat 217441 catcatcgtc accatctcca tggatcagct caaatcaaaa ctgacccatt gtcaatagaa 217501 aaattctctg tgggatgaag tccatgccac caggcctctc gggaacttca ttgatcatac 217561 cattatcttc tcatcctccc acagatatga tatttcaggg agtgattgat aggtgtgtgt 217621 gtgtgtacgt ggggaagggt gggcttgtgc ctgactccag ggacagagtg ctttgttatg 217681 cagaatagct tgcctgtcat cccactacag gccccttcct tgaaaagttc caggctattt 217741 ttgagccctc atggctcaca tgttcctttc tacatcatgt caatgccagt acagaaaaat 217801 tgaaaccttt tggcagcatc taaagcaaaa gagatttcaa agtgcatgga ggatgagtgt 217861 tgctcaaatg acagggtatt aaaacgtgac aagaaggctg agttaagaga ctgcaggcag 217921 cttgtgtgtt tgtatttgga ccttgctgga tttcttggga aatcagtgga gttgtccaag 217981 gttaatgtca tctccaagaa ggcaccaaag ggtagtaaca ctccttcacc ctcatttgta 218041 catacactct gcaagagcgg aagatgatgc agccagtgga gtgggcggtt gtcccagagc 218101 accacctggg ctctgagcta caggattctg caggggaaag cacctgagga cagagtgagg 218161 tgggttggtg ccacctggga gctccaggct tttgtgtgct gatgtgcagg cagatactgg 218221 agccagaccg aattgtgtgc gaaatgtggg tctaagagtg aaggcaccaa accctcactt 218281 tttttacttt cattcagcct tcccattttt cttcaaaggg cagaatatga ctttggagtc 218341 aggtagacct gaattcaaat ccctcttctt cttcagtgag tagctaaatc tttccaatgc 218401 tttatttaaa aggaatcacg atcacacttt cagggttgtt gagaggttta gagattgtgg 218461 tgttacaggg cccctggggc tcaaaaggtg tttgctgaac ccctgtcttg tgttaaatag 218521 tcacaatagt cccaagggga ggatgaacag gtagcactgc catttcataa acaggtgact 218581 gaggtgcaga gtggcttatc cataatgaca caggaggtga atgctggagc tggattgaag 218641 gctgagtcat tggactctcc tctgagggag gcagtacaga gtagtggtca gggcttgacc 218701 aacctctaga gtccatgggc agattttctg ggcctctggc ttagttttcc tcatgcataa 218761 atggaatgtg gggattatag tagatgtact ttctatggtt gtgaaggtta catatacatg 218821 agataattct ataaattctt agcatgtgct tgagttacag taaatatgca atgaagtatt 218881 cactactgtt ttagtattat ttttattaga agccgtggca gtagttacca tcattgtcac 218941 catctccatc gatcagctct aatcaaaact gacccattgt caacagaaaa attccctgtg 219001 gatgaagtac atgttgtagg tgtagcgggc aatcgtgatc gccatcagaa attcatattg 219061 gggtttggtt tacaagtgtt ggcctctggg agtgtttcct acgtgctcct tctctcctcc 219121 cctttgagga cggatggtgt gatggttcct ctccatttgg tccctcctct gggaccagca 219181 ttctcctgcc ttctttgtcc tactctgtgt cccaggaggt agacccctgt ggactgtatc 219241 tcctggactc ccttgctggt tgacttttgt ctgagttaag ccaaagggag gtggcagcaa 219301 gagacgagaa ggtgggagga gagagagttt ggggtatttc cacctgcagc tccctctgtc 219361 catctctggc cccgctctgt gatgctagcc cccgcgaggt tgcccatctc tcagggtttc 219421 ctctcttact tgcccctgtt acgtcatttc atgggactat acctttagct ttaggtgcag 219481 aacagttttc cacagttaaa tgtctttaat tgaacatcta gaatgcaaaa catattataa 219541 gataatgtgt tggattgttt tcctttaaaa aatggtacaa ccactttgca aaacaatggc 219601 agtttcttat taagttaaac atagagttac agtatgatac agcaattcca ctcctaggta 219661 tttatccaag aggaattaaa ttgtatgccc acacaaaaaa tgaatgaata aattaagact 219721 tataaatgaa tgttcatagc agctttattc ataataacca gatactgtta aaaaaacccc 219781 agaatattga tcagtagatg aatggataag caaactttag aataatcata taatggaata 219841 ttattcagca gtaaagatga actaactgct aatacagaca cgtgcatgcg cgagtctcaa 219901 aaacattatt ttaagtgaaa gaaggcagac acaaaatagt gcatactgta cgaggccatt 219961 aatctatata gtaacagaaa gtggatcagt gtctaccaca ggctgggggt gtgtacaggg 220021 aaacttttta ggtggtgccc atgctctgta tctcaatttt ggtggtacat gcgtataata 220081 catttgtatt gtcttgtatc aaagtatgcc tcatgaagtt caggtatatg aaaatttgta 220141 tttgctttat tatatttaaa tatcagagaa tgcaaaattg taaaaagaaa aagaaatcct 220201 gaaagtcctt ttaactagag acaaccacca ccatccacat tttatctatt tcctttcagt 220261 ttttaatttt ttcctgtact taggttttcc ccttcactgt cttatttaaa taaaaaagaa 220321 agcaaatgga aagtctccca tacgaaggcc tttttgctat gagtttacct tgtctaaggg 220381 catctagcat gatggtgaga aggttttgtc ttgtataact aaaatgtatt gggaggaaat 220441 gaaaatgttt agcctggtga aggggaaaca ttggggagaa aagtgaggga gagtgtggtc 220501 tacagtcgtg agctttaaat cacaaccaat cagccaatta aatactgagc tcctgttatg 220561 tgtcagtatt aggaagacag tagtgaacaa aacatatggc tcctctgatt taatggagtt 220621 aacagtctgg gagggaaaga tatttttatt atttgcaaca acaatagtaa tagctaatat 220681 ttatgatgag ctagtgccta ttatggtgtt aaggtttagt tttttctttt agtgttttta 220741 atgtgttgtc acatttaatc ccagcaactc catgagtagg tactcttaat tctcatttca 220801 tggttgaaaa taataaagtg gcttggagat gttaagtcaa atggggtcat acagtgagca 220861 tgttaaacct agactggcag gtcaggacgg aagtgatatt taatcctcag tgaaatgatt 220921 tttttctttt taatttttat ttttattgag acagaatctc actctgttgc ccaggctgga 220981 gtgcagtggc acaatcttgg ctcactgcaa ccttcacctc ccgggttcaa gcaattctcc 221041 tgcatcagcc tcccaagtag ctgggattac aagcacccgc caccacgccc ggctaatttt 221101 tgtattttta gtagagatgg ggatttcacc atgttagcca ggctggtttc taactcctga 221161 cctcaggtga ttcacctgcc tcggcctctc aaagtgctgg gattacaggc gtgagccacc 221221 gggcccggcc tctttcagtg tttttaatgt gttgtcacat ttaatcccag caactctatg 221281 agtaggtact ctttttattc ctactttatg gttgaaaata acaaagtgac ttggagacat 221341 taagtcaaat gggaacatac agtgagcatg ttaaacctag attggcacgt caggacagaa 221401 gtgatattta atccccaaga aaataatata ttttttttct ttttttgaga cagggtctca 221461 ctctgttgcc caggctggag tgcattggca tgatcacggc tcactgcagc ctcgacctcc 221521 tgggctcaag tgatcctccc accccagcta cctgggtaca ggcatgcacc accttgcccc 221581 gctactgttt tgtgtttttg cagagatagg gttttgctgt gttgcccagg ctggtctcaa 221641 acacctgggc tcaagcaatt cacctttctt ggcctcccaa agtgctggga ttacaggcgt 221701 aaggcactct gaccagctgg aaatgatatt taatctgata tctgaaggat gataggagtt 221761 aaacaagaat caagtgtcag agaccagaga tatgaggtct catatatata tatttgctca 221821 gggcaggggg ctggttttgg gtgataatta ggggctagac tgaacagaat tagtgtttgg 221881 aagtgagtgg tggtggtggt gagggaggag agaagggtga gtcccagggt aagggatgca 221941 ggcacatcct gagaaatgag gcagaagaat gagaacttgg cttggggcat gttctgttca 222001 tctgaatggt ggtgctgagg agacaggtgg acagagaggt gtggggttca gggaggagca 222061 caggactgag agggaccagc aggaatagaa ttcctgaagt cccttggatt gacaagatgc 222121 ttcagtgaga gagagtccac agggtagaga tcagcaccta ggacagagcc tcgagaactt 222181 cctctaggat agggaagggc tggagaacag gaggaggagc caaggagagg aggaaggggc 222241 caagagggca ggatcaggac tgaggcaggc cgacttctgc gcagttgaag aactctaatg 222301 tcagagctgt ccttaggagg cagagaaccc ctcgtcctgg aagagtgcag ggagctgctt 222361 ggtgaccatc ctttggggat gtcatggaaa agatatttgc atcaatctgg ggtgagatta 222421 ggtgtctgga ttggttcagg ttcttttgtc atttgtaagc acacaaaaga actctagtta 222481 tctaaaggaa aaaaaaaact ggaaatatta tgggttggca cacgagtggt gccaggcctc 222541 agagagtaca gggcccaggg tggctctggg gatctgggtg gcaggagcga atggatgacc 222601 ttcacagggt gacaccatcg ggaggaatca gctccagctg ttttctgtcc ttgtcccaag 222661 aatcaaatcc cagggagaac agtgtgattc cccaatgttg tgaggttcag tcatgtgcaa 222721 ggggaaagcc gggcacctcg attgacagcc ccaccaggac tgcataccgt ggagaaagga 222781 tggcttccca aagaaaaaca acattccagc taccagagaa aagggagtgg acgctgggca 222841 ggcaaagcca acagacactt tttgtagtct ccatggtcct gtgtatgatt cagtgatttt 222901 tgcagtattc acttagcaag tcattcctcg ttgcagaatt gttgctagtg tttttcctgg 222961 ctggggatta agctttttcc atttgcctat tttagagtga gttgggggcc tgggattgac 223021 acgcatttgt ctctgggctt caaggtagcc cagagattat gccacctcag gaggcacaat 223081 ttggtggctg tcactccttg tcatgtcaac aaagggtgtg gcttgtggct ctgtggcctc 223141 tgctttacca gcaaggggag gatgctggta gtagcaaagg gtctaaatcg ctgttgcatc 223201 agcgaggtgg atgtgcctat cctaaaggca cctactgaga agatgcccaa gtaacagaca 223261 ctcatgggca cagtagacct gtcaaacctc atagttcctg gctgactctc tttgggtgac 223321 ctgagtcaca cgcccacgcc tgaacaccca ctgaaactcc taggtgcatt acgtcttctg 223381 caatatctaa aggaattaag ggaaggatga aactctgtat ctcaactttc ctgaattcac 223441 cagtatggac tgatggaatt gagtgctagt aacaccaagc tcctaaataa taataataaa 223501 ggcttactta taggaacgta cgaagcacaa gagaaataag ggcccccacg gcccaccctt 223561 caacttctca catttggttt gttagtcagg actcctttgg tcacaagaga caaaaaaaac 223621 aattcaagtt ggtttaggaa aagaggcatt tattggcttg tacagaggag ttgagagttc 223681 tttaggtctg gctggattta ggtgctccag catggtcatg gggcatctct tggctcagct 223741 tttctctgtg ttggcttcat tctcaaaaaa acttttcctt ggtgatggta aagatggcag 223801 atgcagcact ttgggagcct gaggcaagaa gaccacttga ggccaggagt tcgaaaccag 223861 cctggtcaac ctagtgagac cccatctcta caaaagaaaa aaaaattcag gtgtaatggt 223921 gtgcacctgt agtagctact agggatgtta agtgggagtt gcatgaatcc aggtgctcaa 223981 ggttgcagtg agccataatc gcaccattgc actccacctt gggtgacaga gtgagattct 224041 gtctcaaaaa aacccacaaa aaccgccccc ccaaaaacgg atggcagatg caaatttatt 224101 ccctaccagt ttaacaactc tgctggaaaa taaaccctct ttcctaataa ttccagcaaa 224161 agtcctgaag ctgattctct ttgggtgtcc tgagtcacac gcccatgcct gaatccccac 224221 tgtggcccag gtcccatgtc cacccttaga gccagaggat gagggcagcc ttagctggcc 224281 cacacaatca gtgagccagt gaagagcgat tcctcacagg agaagcatgg aggatgggtg 224341 ctgtcaggca gaaacaagac atgccaccac acttggtttt ggggttaggc ctgcttctgg 224401 tctgagcaat ctgctgctgg atatggctgg tgagtgaaga gatattgatg tgtcatcccc 224461 ttagaccatg gcattgtgtc ctgctctctg catctttggg ggcctgatca aaagcgggaa 224521 gtcatggggt caaggacagt ctccacgtgc cacccaaaca ggaaggatgt ccccagcagg 224581 tgttctgctc agggtctggc tctcccaggg ttgttactgc agagggaagg gtaggggcag 224641 ctttcttctg gtggatttat ttacttctca ggcctgttct tttcctcatg ttggctcaac 224701 tgttttatct tgcttagtta aataaatttg gtaacattat aaaggaaaac agtcttttcc 224761 ggcagttgct gaccactgca aaattactgc tctttctttt gtttttcctt tttagtgctg 224821 ggctgtgaat aagcaaatga agtttgctat ctcttcctcc cctcttccag gaagaaataa 224881 tcaccatcag aactttaaga gtttcccata agactgggca aattagagga aggacctcta 224941 tacggtggaa cagatgcaat tgtgcaaaca gaatggggtt attctctaca tatggagatg 225001 gaacagaccc tcccagatgt gttagtgtgg ggaaaagcag gtggtggagg tgcgttcaga 225061 gtagcagtgt tcagctctcc caagcacatt caaatcaccc aggagctttt aaaacgtact 225121 gatgcctgcc caccaccctc tcctcacagc cccattaaat cagaatctct gggagtggga 225181 cttgagcact tttcatttta aaagctccct ggtgactcct ggggagccag gactgagaac 225241 tagtgctgta gaattgtata tgggcaaaac agcagggatc tagataaaaa aagaaaactg 225301 aagtagggaa tatctcagtc cgtacctaca gatagcctca cacatgtggt cgcgcataca 225361 gagaacgtct ctagaaggat acacaggaga ctggaaagaa cggttctctc cagtgatgaa 225421 gtacagtggt gattctcaac tgtatactcg tttgtaatgt tggaattttg aagccacaag 225481 cataaatgct cactgtgtac tgaaagctgg ccagacaggg ctgtaagatg acccctgaac 225541 tctagcttgt ctgtgttggg gccggctgcc cacagattca cacaaaccat ctccctgccc 225601 gtggtctctg cttgccctcc tctactcagc tgcaatgagt ggtttccgag tgtgggagag 225661 accagaagag ctcattcatt cattcattcg ctcacatttt tagtaagaat taggtataag 225721 ccaggcacgt ggtaggcacg tgaatctgat ctcaccttca caaccgtatg tgaaattggt 225781 ggtcttatct ccattctaag attggggata ttttaacaca gaaattcata caaccaagct 225841 gtaaatattt gaggccgaat gaccagctaa gagagcttat aaactggcag cacttagatg 225901 tgtttggatg gcctgtacag aactttttaa aaaaatccta tttcacgact ctggaccagt 225961 gaacccaggg ccactgttgg ctagagctga gtgtcccttg gaagggcata tgcccctcag 226021 ttcagcgcag tccccccata tcccttccct tctgtgtcac accagccttg catcactcag 226081 tcacaggcct agcctctgca ggcatttgcg tttgtcaccc ctgtgttaag gtaggtagga 226141 tgagtgaact gtcgtctgtc ctgcagggat gctgagctga agtgtcctgt ggaatgtgcc 226201 atttctattc caatctggcc ctctgtctgc tggctggaaa accaagctcc tttgctttaa 226261 gtttcatttg tggactgcat gggatgggct gtgggtattg atttgtttgt ggaaaagatg 226321 tgcacaatag ggaatgtgta aaatacatca tggaaatgga tggttaatgc cgcctggctg 226381 gagtgtgggg cagtggagtg acaagagatg agagtcaagt tagatggtgg ggtatactgt 226441 gggtagatgc cgggctaagc attttgaact ttaaccttta ggctactgct gtccaataag 226501 actttctatg gtgctggaaa tttcttatat ctgtactatc caatgttgtc tccagacgca 226561 tgtggcatca agcacttgaa aagtagctgg catgaatgaa gatttttatt ggcattttag 226621 attttattta attaatttac ttttacatag ctccatgtgg ctggtagcta ccacactggg 226681 cgctgcagat ttaggcaata gcagtggatg tgttggagca gggaaataat atcaaagctg 226741 aatcttcaga aggtgatttt tccaggagtg tacaggccgc atggggtgag cagaagtagg 226801 tagggataat attgacactt cttagcatct attaggcact tcttgtgtcc acacactgtg 226861 ctaagataac tcacttattc ctcataacaa ctctatgagg aaagtactat cattgtccct 226921 attttataga ctagaaaatg agacacagca aagacaatgg gcatattcaa ggtcacacag 226981 ctaatcaaca gtggcacacg gtgtagaatc caggcagctc agctgtagag cttgtaccat 227041 gaatccttcc atcctggaag gaaataaatg aaccttcatt gagaatatac cttctgctgt 227101 cccctggtgc tgggcattca cacgtaaaat ctcatttaga tttttgttaa cactcatgtg 227161 ccattcggtg gatttagatc ctttgtgttt atggaaactt gggctcatag ggatgagata 227221 tcaaggcaac tgcaatctct gtggacacca gttaaggctt tattgtaaaa ttccatgcct 227281 gtgggggaaa tcacaggctg tgggagtcgg ggggaatgta caggaagata tggcataaaa 227341 ggtgttgcga aggaagaatc agtgggattt agcaactgac tgaatggcgg gggtgaggaa 227401 agcgggggag gctttctcct ggaacagtga gatgaatgac aaacacacat gtaactgcac 227461 tgtgtgagtt gagagtgtgg gctctagagg aagacaaact ggtttcaaat acaagtgtta 227521 acacggacta gctgaacaac cttgggcaaa ttacttctct ccgagcatca gtgtccctgt 227581 taatataacg aggaccgaag gcaggaggat cacttgaggt caggagtttg agaccagcct 227641 gggcaacata gtgagatccc gtctctagaa aaaacttaaa aaattagcca ggcatggtga 227701 tgcatgcctg tagtcccatc tactcaggag gctgaggcca gaggatcact tgagcccagg 227761 agtttgcggc tacagtgagc tgtgattgca ccattgcact ccagcctggg tgacagagca 227821 agactttgtc tctataaaag atataatgag aatcatagca ccttcctcac aggactgtga 227881 tgaggctcag gttaagcacg gatatgaaag ccctttgcaa actcaatagc atgaccctgt 227941 gagtcactct tcttagtgct ggtgagacca gccattgggc agaaatggtg accagaactt 228001 tctgtctttg atcatggaaa cagttcggga aattgcctgg gggaacccca cttcatctta 228061 ggtacttggt agagtttaat gcggataagt ctgtgggtca gctttgacct aggaggaaag 228121 agtggttctc ccagcagctg tagagcaaaa tctgcatgcc agtggctcca acttaaatgc 228181 ccttttccac caaacagttt gcatgtgttg tcttatttga gctgcacaac aaccttatat 228241 acataggtac tgtgataaaa taaggctcag agaggtggag acactgagct aatgtccttc 228301 agcttatagg tagtaaaatg agtgttcaaa cgaaggtcct cctgcttcca gaaacttagc 228361 ctgtaattcc tgtaggtgat gggcttctca tttcttctcc taccacagat atctgcaact 228421 agcgctagaa tttctatccc agattgacat ggatttaagg gtaaaggttg cttcctaaga 228481 atgtcagttt tccactatag accccagtaa gaattgtgta caagagggga atcgactcca 228541 aacagaatat atatatatat atgatttaat ttagtcctct ttggaatgca atgagtaaat 228601 tttagtggtt agagtaaacc actaaaattt gggagttagg ggtgaagtgc agtgtgaagc 228661 agagttgaga tctcagtgct tttcctgggg cagaaacatc cgagagggcc cctgtcaccc 228721 aagcttttgg ctacagtcta ctccttccat ctgggattaa ccaggcgttt cttatccttg 228781 ctgatctagt gtttcttatt gctgctgtaa caaattacca caaatttcct ggcctaaaat 228841 ggcacacatt tattatctta tacttctaga tgtcaacatt ctgaacagca ggtcttccgg 228901 ggctgaaatc gaggtgtcag cagggctgcc ttccttctgg aggctctagg ggacaatctg 228961 tttcttacct tttctagagg ccacccacat tccttggctt gcgttccctt ccagaaatgg 229021 cacaattcca acctctgctt ccatcatcac gtctcatcct ctgactccga gtctcttccc 229081 tccctcctat aaggacccct gtgatcacac tgggctcatc cagataatct agtataaccc 229141 cctttctcaa gatccttaat ttaatcacat ctgcaaaatc aattttgtca tgtaagtaat 229201 atattcacag gtcccaggat taggacacgg atatctggtg gtggtgtcag ggggtcgggg 229261 gtaggggatg ggaacattat tctgcctaca gagctggcag gtggcctttc ttctttcctt 229321 tcttttcctt tcctttcctt cccttccctt ccctttcctt tccctttccc ttcccttttt 229381 tccctttctt ttctccttcc cttccttttt tccctttctc ttccccttcc ccttcccttc 229441 cccttccctt ccccttccct ttcccttccc cttcccttcc ccttcccttc cccttccctt 229501 ctcttccctt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 229561 cccttccctt tcccttcccc ttccctttcc cttgcctttc ccttcccctt cgcttctgct 229621 tcccttcccc ttctcttctc ttttcttttt tttctggagt cagggtctca ttctgttgcc 229681 caggctggag tgcagtggtg ctaacacagc tcactacagc cttgatttcc cagcctcccg 229741 ggttcaacca attcccccca cctcagcttc ccaagtagct gggaccacag gtgcccacca 229801 ccaagcctgg ctaattttaa aaatttttta gagatggagt cttgccatgt tgcccaggat 229861 ggtcttgaac tcctgggctc aagcaatcct cctgcctcga cctcccaaag tgctaggatt 229921 ataggtgtga gccactgcac tcagctgcag gtggcctttc tgatcttgcc tgtaataagg 229981 gagggtggga ggtggataga gccggagaca tctttttcat tctctgctgg agctgagtag 230041 caatttcccc ttttaaacag gccatgtgtg cagtccccta cctggccgtc agacattttc 230101 attgctcacc tggcccctag gcatttgagt ttcagacccc attttgtagt tctagtaaag 230161 ccttgctata gactggggtc agagggctgc aggctttcaa cctttgtaga gcaaggggtt 230221 attgaataaa ttatgaagca tctggagatt ctgggctctt aactgggtac ctgtttgaaa 230281 gtagggtcta gtacattttt ttcagtagct cagtaaatgt tgcctcttct taaaaaaaaa 230341 aaaaacagaa aacaaaaact ggtcttactc tgaatttgcc aaatagaaat gagggggcgt 230401 ttatttgtgt cacaaaagga tttgatttct ttaaataggc agccctacta ttttcttcaa 230461 agtgttttag ttaatgggaa atttgatttg cacaactttt cagaaacaca ctgtgtaaag 230521 gaattacctg caccagaagc tcatgaggtt tttcaaagtc tgttctcagc ttctctgcac 230581 atcatgctgg ggttcacttg gcctattgat tctgcctcag tggggatgga ctattcattt 230641 ctgtgggtaa ttggaagcct acctaaagtg accttgtcaa tggttggatt gacaaatatc 230701 caggctggtg ggatgtcagg cgatcatcca ggagctacat ctaccatgaa gttggttatg 230761 ttttctcctt tttctatgac tgatgatatt cattttttgg ttcattcatt caccaaacat 230821 ttatttactg aatatcaatt atgtaccagg aattgtggta gacactagga gactagagat 230881 gagctatctt tatccccttc tgtctccaaa ggggataaag atgaataaga taataattga 230941 tttttttttt tgagacagag tctcgctctg tctcttaggc tggagtgcag tggcgtgatc 231001 tcggctcact gcagcctctg cctccctggt tccagtgatt ctcctgcctc agtctcctgg 231061 gtagctggga ttaccggggc ctgccaccac actttgctaa tttttgtatt ttttgtagag 231121 aataactgat attttagatg tttggaagaa aggttgtaga ataaagcacc tacttgaaaa 231181 tgggggaaac tactttccta aatatttctg ggtcaaagaa taaattaaaa ccatactaat 231241 agaccactta gaaacttgat acaatgatgg gactaaatat aaaacatagg atgcagccaa 231301 aactgtactt aaaagagggt ttttagccat aacatcttta aatgtttaat gaagaggaat 231361 gggaataaat taattgagca gtcaattcaa gaagctttaa aaagaacagt ggataatatt 231421 agctgacatt tgtagaatag tcatgatatg gttggcatgg ttctgggcat ggaataactc 231481 atttagttct gacaacacca ctgtgatatg ggtcctattg ttgctgtcat tttccagaga 231541 ggaaactgaa gtacaggaag atgaggtagt gtacctagtc ttagagtttg caaatgatag 231601 acacaggata tacacctgat catcagactc cagaattcac attcccaacc aataagtaat 231661 acaaatacaa gtaagcaata caatcacaga cacacagaac aaaaaaagtt aaagtagcaa 231721 ttaataaagt aggctaaaac caggtagagt tgataaatat aatcataatc tgcttcttta 231781 acatgataga aagataaaaa tagacaagaa agaacacaaa cttctggaaa aaaaataacg 231841 agagactacc taacaatgca atatgaagaa agaaacaaag aaagagaaaa ggtgtaatat 231901 catcccagat aagatgctac aatttaaaca cgttgagaag aacactgtgg ataatttcca 231961 aggaaaatat aaattccaaa agtttgtttg agtagaggtg aaaaactttc ctgtgtggtt 232021 gaagtgtaga gagggagagg gaggtgaagc tggggaaagc tcagggatgg ggcgagacta 232081 tgctgtcatc cctttacttt catgttgaat aaaagggaaa gttaaacatg gaactacgac 232141 agggggaaaa gagagaataa taaaaatatt atttggggta atacggtttt ctatctggaa 232201 aactcaaagg aagcaactaa aactccacaa aagctatttg agttttttta aaaaatgctt 232261 taaaataaat atgctaacaa tctattgctt tcctattgtt tttgtttgtt tgtttgtttt 232321 gtgttttgag atggagtctc gctctgttgc ccaggctgga gtgcagtggc atgatattgg 232381 ctcacttcaa cctccgcctc ctgggttcaa gcaatctact gcctcagcct tccaagtacc 232441 tgggactact acaggcacct accaccacgc ccagctaatt ttttttgtat ttttaataga 232501 gacggggttt caccatgttg gccaggctgg tctcgaactt ctggcctcaa gtgatctgcc 232561 tgcctcagtc tcccaaagtg ctaggattac aggcatgaac cactgcgcac agctatttgc 232621 tttcctattt ttaaacaatg accagttggg aaacaaaatg aagacatgtc aaatttacaa 232681 tagcataaaa aagaatccct tagatgatgc taaagggaaa atgtgtagga cttatatgga 232741 gaaaataaga gttcaacaag gggacaatca tttcctgatg gaaagactca ttgttgtcaa 232801 ggcatcaatt atcttgaatt taatttctaa atttaacaca atttcaatca aagtcccaat 232861 cattttttgg aggctgaggg ttgactgaat acaatcccaa taaaaattcc attggtattt 232921 ttaaaatctt gataaactga ttctaaagtt tacttggaag aataaatatg cagtgatagc 232981 caagaaaatg ttgaaaataa ggataatgag ggtggacttg ccctgccaag atataacata 233041 ttacagagct actataatta aagcccatat tgctatacta gaagacagat cagtggcaca 233101 gaatagaagg cccaaagaca gaccccagta tatataatta ttaaacatat aataaagttg 233161 gcatttgaaa tcagtgggga agggatgaat tattcaataa aggatgctgg gacaattggt 233221 tacctctttg gaaaaaaatg gcaaattgga ttcccatttc acatcagatt ccaagaaatt 233281 agacatggat caaagagttt aaagtggaaa tgatgccata taaaagtaca aaaaaactcc 233341 aaagagtaat atttatatga ccttggagca tgaagaaggc tgtttgaggt gtacttggac 233401 tgtctttctt tgtccagttt tcccttttag tggtttaaaa ggaatacata gtagttttta 233461 tatttttatt tatttatgta tttatttttg aaacagggtc tcattaggtc acccaggctg 233521 gagtgcagtg gtgcagtcat ggctcactgc agccccaact tcctgggctc aagcgatcct 233581 cccacctcag cctccggagt agctggactg caggcatgca ccaccatgtc tggctaatct 233641 tttatttttt gtagagatgg ggtctcacta tgttgcccag gttggtctta aactcctggg 233701 ctcaagtgat cctcctgcct tggtgtccca aagtgctggg attacagatg tgagccacca 233761 cacccggttt ctattcttat ttttaatact taattgttaa cacgtatatt taaacttata 233821 atgtcctata tgcaatttct caatatcaat aatcttttga ggaatataag actcagcaca 233881 ttttcatctt gtaaccctcc tgcaactccc atttcttatg acttgggttt tctctacttc 233941 tctatctctt cctctgttga aaacttttca tattatggaa aaaattaaac atatataaaa 234001 gtaacattag tataatgcat ccttgtgtat gcatcaccca gttcaataat gacccatcat 234061 gatcaatttc tttttcatct gtaatcctcc cctttctctc cccaccttgg catactgaat 234121 tattttaaat tattttatcc ataaagatgt taatgtatcc tcaaagataa ggattctata 234181 agcaaagaaa aacaagacca ttatcacacc aaatattttt cattccctaa tatcagtaac 234241 aataattccc ttatgtcatc agaaagtgag tatatccatt tccctgattg ccttacagtt 234301 ttttaaactg ttggtttttt tccacttaga atccaaacaa ggtccacatg ttaaatttag 234361 ttgatatggg ttgataggtg cagcaaacca ccatggcaca tgtatacctg tgtaacaaac 234421 ctgaatgttc tgcacgtgta tcccagaatt taaagtaaaa ttaaaaaaat ttagttgata 234481 tgtttcttaa attcctttga aaatatacgt ttctcgtcct attttctccc cacttgcagt 234541 cctctcttgt agtctatttg ttgaagatac catattcttt gccttgtagt tttctacatt 234601 ctgggtattt ggttggggtg ggggggggtg tgtttgctga tcgcatccct gtggtgtcat 234661 ttaatatgtt cctctgtcac ttgtattttc tgtaaatctg tatttgcatc aagagacttg 234721 aaccaatcat atttgattat ttgacaagtg tacttcatag gtagtgaagt gagctttcct 234781 ctgtaggcaa gtggtatctt tttgtgacct tagcaaccat agatgaatgt cacctagact 234841 cattgtgtca ctagggatgg ccttgaggtt tcctgttcta ttttcattta ctaagcatgc 234901 actttagaaa ggctctgagc tgctccctga agcatggcac ccgtcctcca gaaacccgca 234961 gtctagaggt tgtaaaagaa gatttgaaca ccaagactgt aaaagatgat ttgaacaaca 235021 acaacaacaa caaagcagtt ttattttgta tacattcttt cattgtacct cccccagagg 235081 gatataacct ccaaataggg gagccatacc atagatgcat taaattcaca ccaattcaaa 235141 tgtgtgtata tatgtatttg cttgcagact actaaagtga caatgtaaac gttgctgcct 235201 atttttccca ctcttctatt agattgacgt tgcttttggt gagatggggc atgagtccac 235261 catcctgtcc cctctgtttc cacatgaccc tcatcgtatg aacacctctc ttaagtgtga 235321 ttctaggatt taaagataca tatttcttat gcattatgca cctaaacaca agccaactac 235381 ttcatctggc acctgatcca agaaacaaca ttctgttcca aaatcaatag aagacaaggc 235441 actttaaggg atcttcaagg gagtgtaatt ttgagcatat ggaatctttc taaccctagc 235501 taactttaga agtttaactc ctatgcaaga tgaagcatct atgttcccaa gagtttgact 235561 tcagcaaagg acaagctgca gaagctgaca ttattgttta ttttggcata aaggctttac 235621 ttgttggtga agttcaagtt taatgatctc ttcctccctt tctctctctt cctcctctcc 235681 gctttccttc ttcccttcct cccaccctga ttttctcttt tgcagtttgc tgctttgtgg 235741 tgcacaagcg gtgccatgaa tttgtcacat tctcctgccc tggcgctgac aagggtccag 235801 cctccgatgt aagtaatggg catcgattgc ttttctctgt ccacagtcaa tgctgccttg 235861 tgattaaatg tgagtgagca ttcttagacg agtaagtgtg gacgatctca ctctaaggca 235921 tgtggtggtc cctatagctt ttgagacagt tttcctttta gtggaaaaca aaattttgga 235981 gggagtgtgg ctggatagga agtcagctag tttgtctact atgattttgc taatgtcttc 236041 actttgagag ctggagcacg gagggtgaga ggcaggggcc tacttggtag tgtggctgca 236101 aaaaggtagt ttcggtggta ctttgcagtg cacgttatgg cactttgtta gggtgttctt 236161 tcttttgtta ttgttattat tatttttatt atttttcaga tggagtcttg ctctatcgcc 236221 caggctggag tgcagtggag caatctcagc tcactgcaag ctccgcctcc tgggttcatg 236281 ccattctcct gcctcagcct cccgaggagc tgggactaca ggcacctgcc actatgcctg 236341 gctaattttt tgttttgtat ttttggtaga gacggggttt caccgtgtta gacaggatgg 236401 tctcgatctc ctgacctcgt gatccgccgc ctcggcctcc caaagtgctg ggattacagg 236461 tgtgagtcac tgtgcccggc ctattattat tattatttaa gagtgttgcc caggctggtc 236521 tcaaattcct gggctcaagc aatcctcctg ccttggcctc ccaaagtgtt gggattccag 236581 gcgtgagcca gtgcaccctg cttgttaggg tgttcctaat atcagggccc aggaatgagg 236641 aggaaatgaa tttatcctgc taccatgaat ctcaaagtat gactactgct agtgggacaa 236701 gagatgcttg taggtaaaat gtgcaaaaaa tttttgtttt aatcattata aatgtatttt 236761 aatgtggatt gggaaaaatg tatctagtag tggttttgca tttatgataa ggatatagaa 236821 tttcctacct aagtaaagat ttttaggtta aacaaaagat aaactgattt gaggacaata 236881 ttaagtgaat gacagtatgg gtgatataag gatatggcaa aagttttggt acaaatgtgt 236941 ttaaagttta gggtacattg ttgattggcc tacccagcag ccattcacag ccttcatcct 237001 tctggcagag gctacctctc atgatggaag ctaaaaatgc ctaatacttg cttttcccgg 237061 ctctcttgta gctgctgaat gaggatgtga cccaatcctg ctaatgggaa ctgaggggaa 237121 gacctgggaa agagtctagg aaagattttt cctttctgat aagagagaga catcccttcc 237181 cttgtcctct tgggcatttt tgtacaagga tataatgctt ggagttgtgg cagccctttt 237241 gggtcaatga gggacacatt gccgacatcc tgagcatggc agagtggaaa acttaaaaga 237301 accgtgggtc catgatgatg ctgatgagct gttgatacca ctcacagtcc ttttatcccc 237361 aagctggata gatgggagaa gaagcttttt ttttggccag gcacggtggc tcacgcctgt 237421 aatcccagca ctttgggagg ctgaggcggg cagatcacaa ggtcaggaga tcgaaaccat 237481 cctggctaac atggtgaaac cccgtctcta ctaaaaatac aaaaaattag ccaggcatgg 237541 tggtgggcgc ctgtagtcct agctccttgg gaggctgagg caggagaatg gcgtgaacct 237601 gggaggcaga gcttgcagta agcggagatc gcaccactgc actccagcct gggcgacaga 237661 gtgagactct gtctcgaaaa aaaagaaaaa aaagaagctt tttttttttt tttctttttt 237721 tttttaagca aagtcttact ctttgaagcc ttctctttta tttttgtttt tttctctttt 237781 gagacagaat cttgctctgt tgcccaggcc agagtgtggt ggcatgatca tgtctcactc 237841 actgcagctt aacctccctg gctctagcaa cccccccacc tcagccgccc acgtagctgg 237901 gcctacaagt gcatgccacc acacctggct aattttttat ttgcttgtaa agataggttt 237961 tccctatgtt gcccaggctg gtctcgaatt cctgggctga agccctcctc ctaccttgac 238021 ctcccaaagt gctagaatta caggcatgag tcactgtacc tgaccctggg ggcattcttt 238081 tacatgcagc tgaaactgtc ctagctgaga aggctttcca ccatgcacag tctttcattg 238141 cccaataatg aactgcattc agcatacccc aatgcagcca tcacccctag ttctcttact 238201 ttgtttctgc tttgaatccc cttcttgaaa tttctcccat tggctttgtc tccattactg 238261 gttctgttac ccacttagac ttgacagctg acagcttgcc caaagtgagt agtgactatc 238321 ttctgtcatt ggatttcttt agctcctgct ggtcttctgc aaatggtcat ctgccaattt 238381 gctgcaaact ggggtgggag tttgtgggga gggcatatct gttgccatag caacacaaag 238441 atgctggagc atatgttgga tattctttgc ttggaaaatg tggtaattga ggttaagaac 238501 tccaagaaac atggaacaac caaagtcctt ggaaatctgt ttgcaaagac aaagtttgag 238561 actgtttgtc atgttctctc agatcctatg ccctagccct tgtagtgcaa tttcttctct 238621 cctccctagc aaaataggca ggggagtttg gggagactgc cctactcagt atgcagctct 238681 gagtgttcat gcagactgaa acttgatact caacttgcag ccaagaagat ggtgtttccc 238741 atattctatc tatctgctga atgtttctgt tgcctaagac agaaactcag ctcaaactga 238801 cttatatgaa aaggaaaatt ttgggtcctt gaaatagaaa aggcctaagg gttgacttgg 238861 cttcaagcac agtgaatcaa agttctaaat aatgtcctca agaacctgtt tctccccttc 238921 ttggcttagc ctttgtctgt gttgatttta ttttcaggaa ggcttgtctc ctatgcgagc 238981 atggtggctg caacaccttt aatcttatat cttatcagtg ctgcagtgca gtaacaagag 239041 ctaaagtctc attagaccat gaatcacata ttgccagaag aatgggttat gctgattggc 239101 caggcctggg tatatgctgc tgtggtgtga atgttttgtt tctccaaact tatatcctga 239161 aatcctaaca cccaaggtat ggtgttagaa ggtgggcccc ttgggaggtg atcagttcat 239221 gggggaagag ccctcataaa tgagatagtg cccctataaa aaagacccca gagatatccg 239281 tcaccccttc taccatgtga ggttgcaatg agaagacagt tgtgtatgag aaagtgagct 239341 ctcactggag accgaatctt ccagtgcctt gatcttgtac tttctagcct ccagaactgt 239401 gaggaataag tttatcattt aaaaatcact caggttttgg tattttgtta cagcagccag 239461 aagggactag gacagaaaat tttggtacta acagtgtata gaggaacaaa attttaagga 239521 tgggttttct gaattggttt cggggatttg gtaattggct gccaaatctg attagattta 239581 agaatgctaa ttaccctatt tctagtagta aagagagcct ggatagtccg tggcatgatc 239641 tgtttataga gatacacaaa tcatctgcat tggatactcc taatcaacca tttataagaa 239701 gcaaggagcc aagtgattcg atatatgata cccaaacact tttggaaaac taaggcatat 239761 aatgacttgc tcctaatgtc tctggacaaa gtggagaaag aaaaagatga gctcagggat 239821 ttgaattccc agctcaaaga gtttcataaa tgatctaaga gcttctaggt gtgccctgaa 239881 ggagagcctt ctagcctgta gttgcagggc tgaaatttct gaaaattaaa tgcacaacct 239941 catcctgcaa ttggctgaat tacaatgcaa attggactcc ccaccacaca gggtgtctgc 240001 tctaaaagca agggcaaggg cattgatcag aaaagaatgg gatcccataa ggtgggaaga 240061 ctctgatgaa gaccctaatg aagctgggga cattgagccc ctaaattctg acaagtcttt 240121 tttttaccag tggaagtggc ctccttacct caagcaaaag tgtcatcccc actcagggtg 240181 gcattggcct ttccatcttg tctaagggaa ctaaccctgc attgccctaa gaaacagtaa 240241 tggctttccc tgaggcagtt gccatgccag gcaatgctga ttcttgtcag caccgaatcc 240301 caccacccct tcaagtccta gcaggctcct aaacgtgaga tgcactgtgt gacccacggg 240361 gaggtatgct ataccccaaa agaactattt gagttttcta atttatacaa gcagaaccat 240421 agggaatatg catgggaatg atattaaggg ggtaggataa tggtggaagt aacatgaagt 240481 tgaatcaggc tgaatttatt gatagaagtt cactaagcag agattctgaa tgtaatgttc 240541 cagcttaagg agttaggaag ggctctaaca gtttagctgg ttggctgaaa catggattaa 240601 aagatggcct atcatgagtg agttggagat gcctgacctc ccctggttta atgtagagga 240661 aaggattcaa aagcttaggg agattagaat gccagagtgg atttgtcatt taatacccgc 240721 tcacccatgc tgggaaggtc agacatacct ttaagaaagg tatgtctctg tcaatacttt 240781 gagaaatata tttgtgaggg gagccccagc gtcctttaag agctctgtga tccctcttct 240841 ctatagacca aaccttacta tgggaaccac agccacttaa ttggaaaact taaatgcaat 240901 gggaataatt gggtcctgga gtatcagggg ccatttggtg gcactcaact gttaaaggca 240961 aggtaggtgc ggttacctta atggatcgca gagttacaac aacaatcgga atagcctgac 241021 tcatgcagac ctatggcact gcctaatcat ggtgttgcta gaagtgaaat aggtgggaag 241081 cctactaaat tcttacttga tctgtatagc agaaaaattc taggtgaagt gaacagaagt 241141 ctaacttgaa tcataaaaat agagaatcat agcccctcag tccattccca gacttgagcc 241201 agtttataga cccagaaccc tttgaatgaa agggaggtaa aatttccttg agaaagcgat 241261 tatactgcta aaagtttatc ctgttaatct ttctcccagc cttgcccaaa gggacctact 241321 gctttttatc aggataactg tgcactagag aaaaggaaat gatgagacct tttggggact 241381 actgaacact agttctgaac tgacactgat tctaggagac ctaaaatgtc acttgtccct 241441 ccagtcagaa taggggctta tggtggtcag gtgatcaatg gaattttagc tcaggtctgt 241501 cttacagtgg atccagtgag tccccaaaca catcctgcag ttatttcccc agttccagaa 241561 tggataatta gaatagacat acttagcagt tggcagaatc cccatactgg ttctttgacc 241621 tgtagagtga ggcccactat gatgggtgaa gccaagagga agccattaga actgccccag 241681 cctaggaaaa cagccaacca agagcaatac tacatcctta gagggacagc agagatcagt 241741 gccatcatca atgacttgaa agatgcagag gtagtgattc ccaccatatc ccttattcag 241801 ctctcttatt tggcttctgc agaagacaga tgaatctaga agaatgtcag tggataatca 241861 taagcttaac caagtgatca ctccaagtgt agctgctgta tcaaatgtgg tttcattggt 241921 tgagcaaatt aattcatcct ctggtagcta gtatgtagct attgatttat caaatgcctt 241981 tttctccatc tctgtccata agacccacca gagcagtttt attgcagctg gtaaggctag 242041 caatacacct tcactgtcct acttcagggg tatatcaact cttcagtcgt atgtcataat 242101 ttagtttgca gggatcttga tcattttttc cttccacaag ctatgacaat ggtcctttac 242161 attgatgaca ttatgctgat ttgtcctagt gaatgagaag tagcaactac tctagacatt 242221 ggtaagacat ttgtgtgtta gaggatggga aataaatcca actaaaattt agggaccttc 242281 ttcctcagtg aaatttctag gggtcagtgg tgtgtggcat gttaagatac cccttttaac 242341 gtgaaggata agttgttgca tctggaccct tctacatccg agaaagagac agatggccta 242401 ctgggcctac ttggattttg gaggcaacac attcctcatt taagtgtgtt atgctggcct 242461 acttaccaag gggccccaag agcttctggt tttcagtagg gcccagaaca gtagaaggct 242521 ctgcaacagg tccagactgc tgtgcaagct gctttgccac ttgggccatc tgacccagca 242581 aatccaatgg tggttggtta gtggcagaga gggatactgt gtggggcctt tggcaagcct 242641 ttataggtga attgaagcac aagtatttaa gattttggaa tatagtcctg ccataatcca 242701 cagatagcta gtttcctatt aagagacagc tcttggcctg ctaccgggcc ttcgtggaaa 242761 ctgacgattt gaccatgggc cactaaattg ccgtatgacc cgaattgctc ttcattaact 242821 gggtcttatc taaaccaccg agccataaag tgggcatgca cagcagcact tcatcatcaa 242881 atggaagtgg tatgtatgtg atcaagcctg ggcaggctct aaaggcacaa gtacgataca 242941 tgaagaaatg gcctggatgc ctgtgctccc cactcttgct accctgcctt ctctctccca 243001 acttgcacct ttgggctcat gaggagttag ttccctatga tcagttgaca aaggaagagc 243061 agactaggac ccagtttaca gatgtttctg tacggtgtgc agacaccacc cagaagtgga 243121 cagctacagc actgtagtca ctttaggaca tccctgaggg acaatagtga agggaaatct 243181 tctcggtggg cagaactttg agcagtgcac ctggttgtgc actttgcttg ggaggagaag 243241 tggccataca tgtgactata tcctgattca tgggctgtag ccaattattt gactggctgg 243301 gcagggactt ggaaggaata tgattgaaaa aattggtgac aaaaaagtgt ggggatgagg 243361 tgtgtggata gacatctctg agtgggcaat aaacatgaaa atatttgtgt cctaggtgag 243421 tgcttaccaa agggtgacct cagcagagga ggattttaat aattgacttg atagaatgac 243481 ccgttctttg gataccaggc agcctctttc cccagacacc ctgtcatcac acaatgggct 243541 catgaacaaa gtggccgtgg tggcagggat ggaggttatg catgggctca gcagcataga 243601 cttccactca tcaaggctga cctggctatg gccaccgctg agtgctcaat ctgccagcag 243661 cagagaccaa cactgagttc ctgatatggc accatgccta ggggttatca gccagctacc 243721 tggcggcagg ttgattacat tggacccctt ccattatgga agaggcagca ttttatcctt 243781 atggaaatag acatgctgga tacagatttg ccttccctgc ctgcagtgct tctactgaaa 243841 ctgccattca tggacttaga gcatgcttta ttcactgtta tgaaattcca cacagcattg 243901 tgtctgatca aagaaatcac ttcaaagcca aagaggtggg cagtgaactc acactcatgg 243961 aattcactgg tcttaccatg ttccccatca ttctgaagca gctggcttga tagaatggtg 244021 gaatggcctt ttgaagagtc agttatgttg ccagctaggt agcaatacct tgcatgcctg 244081 ggcaagattc tccagaaggc tatatatgct ctgaatcagc atctagtata tggtgctttt 244141 tctctgatag ccagaaattt atggatctgg gaatcaagga agtggaatgg gagtggtacc 244201 actcattatt acccctagta attttaccac cagcaaaatt ttgctttctt ttcccatgac 244261 tttatgctct cccagcatgg aggtctcagt tccagaggga ggaatacttc cagcaggaga 244321 cagaacaatg attctattga tagttaacac tgccacccag ccactttggg gttcccatgc 244381 ctctgagtta acaggccaag aaggaagtta tgaagttggc tggagtgatt gatctggact 244441 actaagggga aattggacac tactctacaa tggagaaaag gaaaaatgtg tctggataca 244501 ggagatccct aaagacatct cttagtatta ccattacctg tgattaagat caatgaaaaa 244561 ctacaacaac tcaatacagg caaaactatg aatggcccag accctttaag aatgaaggtt 244621 tgggtcaccc tgccaggtaa ggaaccacca ccagctgagg tgcccactga aggtaaaggg 244681 aatacagaat aggtaataga agaaagtagt tacaaatacc agctatgacc gtgtgaccag 244741 ttacagaaac aaggactgtg attggcatga ttattggcat tattttgtta tggatatgtt 244801 tgtgtgtata tatacatatg gtgagcaaat atctttgttt tctttgctct tttagttctt 244861 tatcatgtaa cgtaagatgt attgacttta tatcagtatt tttatgtctt agtatttaag 244921 ttataggata ccaggacaag aggaatcatt actcaaggac ttcatctcct tttctgggga 244981 ggggattatt gcattttcag ttgtacaaag tctaattgta tcacatcagg tggaattatg 245041 tccttgttat tctttatttg gaggttcttt atttaagaag gtgtgtatgg atgccaagtt 245101 gagaaagggt ggacttgtga tggttaattt tgcacatgca gttaaccttg actgggctaa 245161 ggggggtacc cagatagctg gtaatacatg atttctgggc atgtctgtga gggtgttttc 245221 aagaaattat catttgaatc agtagactga gtagaaaaat tgccttcatt gacataggca 245281 ggcaccatct aatctgttga ggactcagaa caaaaaggca gaggaagggt aaattcactc 245341 tctctttttg agctggaaca ttaatttttt cccgcctttg ggtgttggtg ctccaggttc 245401 tcagtccttc agacgtggac caggacctac tccattggca cctatggttc tcaggtcttt 245461 gggctcagat ttcaaacatt ggactaaact atacccctgg ctttcttggt tctctggctt 245521 gcagatggaa gactgtagga cttcttggct tccataatta tgtacaccaa ttcacgttat 245581 tgattttttt ttggagaacc ctgactaata cattgtgcct acaccgtaga tgagggcgtg 245641 agtgcccctt ccatggattg agagtgggga agaggtagtt tctctgagga aagctagaga 245701 aacggtattg tgtcctggtt aaaatcacgg attatggagc cagcattcct agcctcagat 245761 ctgtgtgacc ttggacaaat aacttaaatc catttatgcc ttagattcct cagctaaaaa 245821 atgagttaaa taatagtttc tacttgatag cattgttatg agtatcaaat acatagatat 245881 ctgtaaaagc acttaggaca gtgtctggtg catggcaaga gctatatatg tgatgatgat 245941 gatgacaatg acaatgaata acagtgcaca catgctgagc agccagcaag gacaggtgtc 246001 ttgtcagaca gaccaggaaa gggtgaagaa tgagcaggtt ctctgtctgc cacctggcag 246061 tccttcctag aagtgtggag gctctgtaag tggcatgaga cccctctctg tatgaatgaa 246121 ggagtgcagc cattcattgc cttgagcatg aggagagcag gaagtactcc ctgcagccag 246181 catgcatagg atttggcagc ctacctgaaa tgattctatc tctggataca gttccaaaaa 246241 ggtcagttgg caccaaggat gatgaattgg tccagacacc tgaactggct tgactcagtt 246301 cattgacctt tttctgctca tggttggagg catgattaaa gacctagcca cacctgcttg 246361 cccacctggc cagactctcc tttagctgcc cctgatactc atgcaggtat ggaggggtgc 246421 cctctccttc tgcatgacct tggaggaact ggtgtgcagc aagttcgtga agcagaaaca 246481 cacaggcggt gaccttggac aaggctccat ccacgataaa ggatgtgctt ggttgcatcc 246541 tctaccctct gctgttccaa gcccagtctg agccaggcaa tattatggat attagtagag 246601 ctttgtgctg ttggcattat cctacagcta tgttcacatc atatcacatt gaggcaattt 246661 tatgaaccag gcactgtccc gtgaacctag ctactgagcc aaccattcaa taacaatgag 246721 aatagacaac atttatgggc actcgccatt ggagggccac aaagtgtggc agagttcatg 246781 gatgatgctc ctggtcaccc acagtcatcc ttcttgttct tgatctcacc aggcaggaaa 246841 tcctcaattc atctgtctct tgtacctctt agagagacca ggtctctggt actgtcataa 246901 gcattactta tggacattat aggttattca gcactcagaa tactgaccac atacacagcc 246961 tcctgtgaca tctcagaaca agaaacagag gagttgtctc tgagggttgg gactagggtg 247021 agtcaagtga ggtactttcc ctgggcacag tttaaggggt tcttgtctgt gtctgtggat 247081 agctggttgg ttggctggtg ggtggatgat ctaagatggc ccctttcaca tgactgacga 247141 cgagcaggtt gtcggcctgt gtgatttggc tctcgtccat gtggcatcat cctccatcca 247201 actggctagc tctgacttgt tcatatggtg gtttccaggc tccaagaaga gcaagagagc 247261 atctcccaac actttttaag cctttgtatc atgtttgcta atgtcctaaa aagctagtct 247321 ctggccaagt ccagattcaa gtggtagaaa aatagattcc agtgcctcat tggagaggaa 247381 gtatctatgg ttaatctatt gcagatcctg ttaacccaag aggccatagc agaacagaac 247441 ccctggagac catcacactg gactttataa aaatctgcat gttatagtgt ggctcagagt 247501 catagatggc ctcatcaaaa gaatgcacct aatttattcc ctgcctcaat cgccagtgga 247561 cttaaccacc tctagctata agtggttaat gtggcagttt attgatcacg tcaatggtgg 247621 ttaagagtat ggactctgaa atcagacata cctgaatttg agtcttgatt ctgcccagat 247681 actcattagg aaaacttggg catgtcgttc gacttctcta tgccctagtc tccttatatg 247741 taaaatagga tgttagtacc aaccacatag gtttatagtg ataattttaa aaatatttaa 247801 aatgagaaag agttcacaaa caaaagacaa atgaatttca aagcccttac atgaaagcgc 247861 ttggcacagt gcatagcaag cccttggtac atgtagctaa ggccatttcc attgctccac 247921 ctaggtccat aatcaagaat gtactgacaa ttctctacat tcttccaccc agagcatgcc 247981 cttatgaggt gattacagtt tccatttcca ttacttcaca caaacctcaa attctgtgat 248041 gcagtgctaa gaaaagaact caaactttgg agccagaaaa tgctggtgtc aaattccaaa 248101 tggtctctgc tatgtattgc tccaatgacc ttgggaaaat aactttaagt tctgtgaccc 248161 tcagtttatg tctttaaaat agagaaaata atacctaaat aaagcggttt tttttttttt 248221 ggtaactatt caataggaga ctgcttgaac catccctaga ataatgtctg gcacaaagaa 248281 ggcactcaat acctggtttt aaaatcatta ttaatgtctg gaggagcctc tcctgctcta 248341 taagggatac aggttaccct ctgacctgcc aagaaataat agctaataat aatagctaac 248401 tttattgaac agggtctctg ggcaggcagt acaaaaggac tctttgtctg gtttgtctaa 248461 agataccttt ttcaacactt tatggcttta gaaataattt cttatgattt atagtcaagt 248521 tactgagtaa aacccactgt taggtcctgt gatctcctaa ttccatcact aaatcaggtc 248581 aattagagat ggtagacact aatgttgacc ttatcctttt gatcaagtct cactttgtga 248641 taagatagat taaaaatcat aaacttgtta aacttgttaa atgttgaaga tttgataaaa 248701 cagaaatgaa agtattaaat gtatgtatac attaaagtat gtaaataagt gagaacatgt 248761 ttgaaaatct ggttgaaatg gatgattttc tcataaaatt gtcaattttg tctcaaaaag 248821 aaattgaaaa ccagaatagg caaataaccg agaaagaaat gaaaacatta ttcaaggatc 248881 ttttcttctc gtaaaacccc agacaaatat gtttcatggg taaaacttct taaattttag 248941 agcaacgaag aatttctgtt catttaaaat tgtacctgat tacaaagaag aaataaacct 249001 ttacaatttt ttttaaatga aaatgagaga aatccagctc aaattagcag taggaaaaga 249061 gggcccattg actcctgaga aggacattgg gagtagagta agagcttcag gaccaaggac 249121 ctcaaaggct ggaactgaag ctccagtact tgcaagatac ttttccattt ctgctttatt 249181 ggttagttat tttgctctgt aacaaaccat tccaaaattt agtggcttga aacatcagtt 249241 atttgtggtt tctcatgatt gtgtgggtta cttgggtgat tcttctgacc tgggtttatt 249301 tgggtaatcc ctgctgggct ctgtcctggg tcttggtcag ctagtggctg ggccgatgct 249361 ggattaccta gaacagcctt gctagcattg actgagggct ggcaggctgt cgatctgggc 249421 atttcatttt tcctccatgc gtcctctttt cctccaccag gctagcttag tttcacggtt 249481 ccaagcacgg caagagaggg aaatcccaaa gcacaggtgt gtgtgtgtgt atgtgtgtgt 249541 gcgtgcgtgt gtgtgtgtgc gtgcgtgcgt gtgtgtgtgt gaaatggagt ctctttctgt 249601 tacccaggct ggagtgcagt ggtgtgatct cggctcaccg caaccctccc agatacaagt 249661 gattctcctg cctcagcctc ctgagtagct cggattatag gcacccacca ccacgcctag 249721 ctaatttttg tattttttgt agagacgggg tttcaccatg ttggccagct ggttttgaac 249781 tcctgacctc aagcgttctg cctgcctcag cctcccaaag tgctaggatt acaggtgtga 249841 gccactgcgc ctggccagca taggtgtttt ttaagccttt gtttgtgtca cagtggttat 249901 ggacaagatc agatttgggg tggagaaaca aacttcatct ctggatggga agaatagcaa 249961 agtcacatgg ccaaggagca tgcggggatg ggagacattg gtggccattg tgcagtctac 250021 cacactgacc cttatctgct gtgccagatg cctcgtttgc ccttccaatc cactctgctt 250081 tctctgagag actaagccat aatgactgct tcaactgggg ctcccttacc ctctgacctg 250141 acctctggtt ggactcaagg ggaggggcta ataggagact tggggagggg agagtgagac 250201 cctccccgag tcagtcagta tttattatgg agtcatcttc actctacctt ttgactatag 250261 gttgctgctc ctctacagat catcttctct atgtaattct ctccttctgg gttccaggag 250321 ctactccctt cctttgtcct ctggctctta ctagcctcca atcccttata ctttcccaac 250381 accatgccca cacctttaga aacagtcctt tcattgaacc tttcttaaac actcttcact 250441 ttagtatgcc atctgtttcc tgcggggccc ctgactgatg catctgcttt gtcttgcctc 250501 atttgctcct gctgtagatg gaacatgtct gcaaacatcg ttacagttga ccccatggaa 250561 agatggcaac cctagtggag gaattagcct tcttcctgca ttcgtttttt tttttttttt 250621 gagacaaagt ctcgctctgt tgcccaggct ggagtgcagt ggtacgatct cagctcactg 250681 caacctctgc ctcctgaatt caagcaattc tcctgcctca gcctcccaag tagctgggac 250741 tacagatgtg caccaccata ctcagctaat ttttgtattt tcagtagaga taaggtttca 250801 gcatattggc caggctggtc tcgaactcct gatctcaagt gatctccctg tctcagcctc 250861 tcaaagtgct aggattacag gcgtgagcga ctgcacccag ccatcttccc tgcattcata 250921 attgatctca gaaaatgttc tgatggagtt ggttctctat cactgggaga acgaactctt 250981 ataacctcag acctggatcc catggcatcc ctttcaccca ggagcagtgt atgttataag 251041 aagagtgaag gagaaacttt gatgcgctga tgaaagtaat aactaattat tataagcttg 251101 cttaaccaca ataaccgcta ttacctagta aagttggcat aaaaagtaaa ctacaagcca 251161 gtctcataca tgagtctaga tgaacatatc ccaaataaaa tagtgaatgc tattttattg 251221 ttgtcacgga cattatgcat tcaaataatg tggaggaaac aatctagaga taaaacaaca 251281 aagaaaatta tgatcaatat aacaaaatat aggaataaag ataaaaatat gccagcaagg 251341 aaccgtggaa tctggaaagc ataaatttag atggtagaag tgatagcaga atgttaccca 251401 accaatgtaa atggattgaa attctctact gaaaggcaaa gactatgaga ttggatgagc 251461 aatgtacaac tcttggcagt gtataaaaga tgcttaaagc acaggtttta aaaataaaag 251521 ggacaaaagt gtatcatgaa aatacatata taaaaatgaa acatggtctt aataatgata 251581 tcaaaagaaa taggactcaa aacaaaaagc ataaacgggg gcagatgtcc cagcagatca 251641 cttgagctca ggagtttgag accagcctgg gcaacatggc gaaaccaagt ctctacaaaa 251701 aaaaaaaaaa aaaaaagaaa aaattagcca ggcatggtgg catgcacctg tagtcccagc 251761 tactcaggag gctgatgtgg gaggatcact taagcctggg aggtggaggc tgcaatgagc 251821 tgtgattgca ccactgcact ccagcctggg caacagagtg agattctgtc tcaaataaac 251881 aaaactaaac aaaaacataa aacagggaag gagaaaacat gaactctaat gcattactgg 251941 tgagtggcaa attagtatat gttttctaaa gagcaaattt tccatacata tccagagctt 252001 taaacacatt tttgacctaa gaattactaa atctaatcat tttgacctaa gaatgacaat 252061 tctaggaatt tttggtaaga aaccatttca ggataaatgc atagatttat ctagagtgat 252121 attcattgca tcagtattta taatatcaaa aatttgaaaa tgacccaagt ctcactacca 252181 attagagttt atataagtaa aatatattac agtcatatgg tggaatgctg gccagtgatt 252241 cagtgtcaca gtgtctgggg ataaggggaa tgttctgtgt cttgactggg atggtgccta 252301 tacaggtgta cacatttttc aaaactcatc aaactatatg cttaaactct gtacattact 252361 gtatgtaaat tatactttaa taaaaagtca cagtgattaa aatgccattg gcctggaaat 252421 gcagtcataa tatattgcta aatgaaaaaa aaaaagcaga caacaaaaca ttaggtcagg 252481 tatgatagat ttaaaaatat aaaaatatat acacatgtaa agccaaaggc cacattccct 252541 aaataaattt tcgtgaatac ttaattctgt tcactgtctt atgccagact tgttacctgc 252601 agagctctga aggtcacttt gggatgttct gtagctggtt tctccagttt tgtttgtttc 252661 tggggccact ccacctctag tgcagactca ggtttcccac ttagttcagt gaggaccaaa 252721 cttgagtggc ggttccggcg taatcgggag tcacttacag ttccttccaa atgtggttac 252781 tcttgaggtc ctccatccgc tgggagtcag ctcagttcct ggactgagca tcttcaggtg 252841 cttctaataa gattctggcc tctgataagt cttgacttca attcattttt ggccccagag 252901 aaggggaatc ttggctctgc caacaaggcc ctgtccattt cctgcctcag tgtgagtggc 252961 aggaagacca ccttgatctt cagggcaaag gctcgcttcc tcctcttggt gtcatccggt 253021 ctccttggct cagtaaagca acgtaacagt caagtgcact tttctcaacg cctggtttct 253081 tctgtcttac ataaccccaa gtggggacac ccacagtgaa ctttcaaaaa taatgctcct 253141 tccagggagc gaagcagcaa tgagggttga acatctctct tgggatgacc atcacctgga 253201 ccaaaagacc catttacatc tctcaaggat aagtaatgtc aggggcaatt tagtctaaat 253261 gatacagtct ccacctccct tcctcccttc ctcccttctt ccctctctca cttatctatt 253321 atctccctac atctgaaagt gcaccatcct gctgaaatca catcctgttc acataacact 253381 gcagtccata gcttctgaat ccacacagaa gttctcgtgg gcgataccaa atgccaagag 253441 gatgccttag aaggagtctg gagatactcg aggattggcc tctctagctc agaggttaca 253501 ctgtcatcaa atgtccaaag agctcttgta cagaattttt gtggtgcagt tctaaaggag 253561 aagtaggact gaggtccatc tgtgcaggct aacgaagaac agcccatact gaccaactgg 253621 ttgctacata ataatattag ctccccatcc gaggatagta aagctcaatt gccaggcatg 253681 ctgtagaggg catagctgtg gtagatgtgg gtttctacca tgttggtggt tctactagct 253741 tttgttgttt tgttgcatgc aacagaaacc atctttggtt aacataagtt ggaaaggaat 253801 tcattggaaa agatgtggca tccttcagag cgttgaaaga aaacttgaac aaccagacca 253861 gttctatctg attttggttc ttgtgttaca ttcagcctaa attcaaattc cagaaggaga 253921 gagattatct ggccgagctc aagttgccta acaaactctt gggtcgggct tttaaaatgt 253981 tctttcaaag actaaacgct gtgggggtag gatggctttt caaaggagaa tcgaagtact 254041 gatacgggga gcagggggtg gggaggggga agaaggcaca gatgttgcgc caattggaac 254101 aacgtgtttg ctacaatttc taaatgctac attgcagatt gcagagagat tggatgacag 254161 gcttcacctg aagagataga gatcaggacc tgggaatcca tgtgttaaaa agttcttggg 254221 taattctgat ggatagctgg gtttagaagc cagtgtaagg caatatgtat aaaagctctg 254281 aaaccagaat gactgagttt gaattccagt ttggccacta actagctgat aatcttgggc 254341 tagttaccta acctccctgc ttcagtttcc tcatctataa aaagaggata aaatggcacc 254401 cacctcatag ggtccacgtg aggattaaat gagttaacct agatggtaag agcttcatgc 254461 cgtaactggc acataggaca tgctagttaa tattattgcc acaaatgatc ctttcaagtc 254521 ttagattgtg catttcaaaa gtccatcctt atttttggaa ttgcatgcat ttatcttgct 254581 ttctgcaaaa tggaaaggca tctcccatgg gctggttgga gagaggggct gataaaatga 254641 tggtaatgtg ataaaatgca gttctcttga ttgtcaagtg aattctgcat gattcttctg 254701 ctctcatttg ccacgggctg actgcttctg caatgcagaa cccgcttgta aaggtcagaa 254761 tgaattgctt ccaaacatcc ctttgtccct cactgtcacg aagacacaga gctgcatgct 254821 gtggaggaag aagcaaatcc atgatgctgg aaggccagag aacagaaacc aaccgtgcac 254881 agagaggcga gctgctggga cttgaatgaa gctgatgcgt cccatctata aattattgtg 254941 gcttctccta gttgggaggc aggaccagcc tcacaatttc ccagaagacc gtaatttctg 255001 gagcctgcag gcctgggggc aaatggagaa atcaaaaggc agatggcctt ggagataggc 255061 agccagaaag gatgtgaaag gccctggttc tggactcaag agacctcacg tttcctagtc 255121 ttctttctgt ctctgctctt ttgtgttcat gaggttcatg ccctcatgag ggctccccca 255181 cctccagtgg tctaggattc taggatgtct tggtattctt tttcaaaaaa aaaaaactaa 255241 gagccagcat ttatagatat aattctgtcc taaacacttt acatataata attcttcatt 255301 cttcattaca ccctgtgaga taggttatca ttgttataat tccatttcac agatgaggaa 255361 actgaggtac tcagagacta aatgacttgc tcaaggtcac ttgtgagagg tccagctcta 255421 ggccctctgg cccgtggggc tatgctctta cctagtacct caaactgcct ggtagacctc 255481 aaccctgtgt acctgcacaa atggtcctga agcaggcagc tttcttccct gagagctgcc 255541 tttccagata tgcactgcca aaggtctgtg accctacagc cttggcctga tttggtccag 255601 atgcgtgtgg acatctctcc caaacaggat caaattggcc tcttctagga gtttagaatt 255661 tggaatcaag ggtgctggtt ttagtctgga ctggtcacac agtttttgac aactgtttat 255721 tgtacgggaa aagtgtccag cttgatgaat tctcacacag tgaatttttc tttgtaagca 255781 gagccccaga agcccttctt tgctccttct aatctctaca cgtccctccc catccatcaa 255841 aataaccact accagaccag caatcctttg gctgcttttc cccccacccc cagtgatcta 255901 ttttatttta ttttggaaag aacacgtaac atgaaatcta tcctcttaac aaagttttaa 255961 gtgtaaaatg catcattgtt gatgatggat gcagtacgag acagcagatc tctggggctt 256021 attcatcatg cttgactgaa acttgatgcc cattggtaag taattcccca ttttgccctt 256081 cttccaggct gtagccacca ccctttcact ctgattttct atcaatttga ctatttttgt 256141 tactttacgt aagtggaatc atctagtatt tgttttcctg tgactagtgt ttttcattta 256201 ccataatgtc ctcagggttc atctatgttg tctcattgca gaacttcctt ctttttgggg 256261 ggctgaaaaa tattccatca tatgcataaa cacattttcc ttatccattt gttcgtcagt 256321 ggacacctag attgcttcca cgtttagctg ttgtgaagag tgctgcaatg aacatggaac 256381 tgctaacatt tcttcaggat gttaatttca attcttttgg gtaaataccc agaggtgagg 256441 gtgctgaatc atatggtagt tctattttta atgttttgag gaaccttcac acgggctaag 256501 gacttgaata ggcagttgtc cgaggaagac atacaaatga ccaaaaggca tatgaaaatt 256561 taaaaagcta aacatcacta atcactaggc aaatgcaaat caaaaccaca atgagatatt 256621 atctcacact tgttaggatg actatttttt tttttttttt gagatggagt cttgctctgt 256681 tgcccaggct ggagtgcaat ggcgtgatcc tggctcactg caacctccac ctcccgggtt 256741 caagtgattc tgctgcctca gcctcccaga gtgctgggat tacaggtggg agccaccacg 256801 cccggcctat tttttttttt aaaaaagaca agtgttagca aagatatgga gagattggaa 256861 cccttgtaca ctgttggcag gaatacaaaa tgttgcagat actatggaaa acagtttgcc 256921 tgcttttgag cacttaaaaa acaaaactag gctgggcccg gtggctcaca tctgtaatcc 256981 cagcactttg ggaggccgag gtagggggga tcacttgagg tcaggagttt gagaccagcc 257041 tgaccaacat ggcagaaccc cgtccctgct gaaaatacaa aaactagcct ggcatgatgg 257101 tttttgtatc tgtaatccca gctactcagg aggctgaggc aggagaatcg cttgaggttg 257161 cggtgagccg agatcatgct actgcactcc agcctgggta acagagcaag actccatctc 257221 aaacaaaaac caaaaaacag aattatagag tagaccatct cttgtgtctg gctggtttct 257281 ttcatatgtg tgtgtgtgtg gccacatgat attgaaagga cacatagcat cctgttcatc 257341 aggatgtcca gcttcagagc cagcaggggc caggcaggtg ggtgaatgag tgacgtgtgt 257401 gaccacaggc atgtcccatt ggcccattga acaccctatc tgaaaggggc agcggcactc 257461 agctccgttt cctaattcct gctgtttaaa ggatgaaggt ctaagttttc ccattttttt 257521 ccagagaaat ctgaaatccc ctaaatttag attttgacag agaattcaag aattacaact 257581 gtggtgtatt ttaaacacat ttttggactg gattccttct gtgaacctcc ggtttacaac 257641 ttctgcattc gtaggtgggg actgtcagtg gaaggtctcc cccaccccac atcggagggg 257701 agaagggaaa atgtaaattt agtcaaatgg gctttctttg gagagtagag agtgtgcaat 257761 agtccaaggc agggagcttt aaaagaatgc agaagtcttt tttctctctt ggtgtctgag 257821 ctctaaattg gctgggacat atccttagtt aggagtcagt ggctatagcg ccccctgctg 257881 tcctttagca ccctcttcgt gtccagtgag atctgcaggt gagcctggaa gccacttgct 257941 tctcctgcag gttttgggag gaagattcat cattttaaat attgctcaag tgcctgctgc 258001 tgccagtgac ttcataggct cctgttagac tcaccttcac ttatatttgg tcaattacct 258061 actccctcag tcagaccttg ccactcgatt ttttttttct ttttgataca aggtctcact 258121 gtgttgccta cgagtgcaat ggtaccatca cggctcactg cagcctctgc ttcccaggct 258181 taagtgatcc tcctgcctca gtctcccaag tagctggggc tacaggcatg tgccgctacg 258241 cctggttgat tttttaattt ttttgtagag atggggtttc atcatgttgc ccaggctggt 258301 ctcaaactct tgggctcaag caatcctccc accttggcct cccaaagtgc tgggactact 258361 caggcataag ccaccatgcc cagccaccac tctatttttt tttaaattgg aggtagggta 258421 agaaggactg ggccaggtcg tccttcccac aattccagtc tcactgtttc tatctattga 258481 aaagcatttc tggcatttgc ttggtacctt tccctggttt ctgggctgaa atagatttta 258541 tcccattttt aaacaattcc ttctgatact ttcaggggta gtttgtggat gagagcagag 258601 tgaaatgcat aggctcaggt ttgtggcctt gcaagttaat aaagagtgtg gactttggaa 258661 ctagcctgcc tgggctccaa tcttggctct ggtacctact agctgtgtat cattgttttc 258721 attgttttgt tttgttttat tttgttttgt ttgagacagg atctcactct gttgttgggg 258781 ccagagttca gtggtgcaat catagctcac tgcagcctcc aattcctggg atcaagtgat 258841 cctcccacct cagcctcctg agtggttagg accacaggca tacaccacca cacctggcta 258901 attttaaatt tttttgtaga gatggggtct tgctatgttg cccggatggg tcttagcctc 258961 ccaaagtgct gggattacag gcatgaacca ttgtgtccca ccttagctgt gcactttaaa 259021 taagtcactt gacctctctg ttcctgtttc ttcatttaca aggtgaagat gataatatac 259081 tacttcttag ggtgagtatt aaataagtta acctatgcaa agaggttaaa acagtgccta 259141 gcatatcata gaaatgtttg cttgacccag aggcctgaac ctgctttgcc ttcagtgttg 259201 gtctttaggt cctggcttca ctccttaatg ctgcatctca gcctttcctc atgaaacttt 259261 ctgtctttga tccttttcac tccttaactg tgaatctccc aaattccact gagcagtgat 259321 caattcttgg ggcatcttgc tctgtgttcc cagacattcc ttcaccctca ttttctctcc 259381 atggccaaat cctatagtgc ctcctaaata attatagctg acatgtatgt agcacctatc 259441 atatgccacg tgctgttcta agtgattgat atatgctaac tcatttaatc ctcaccctat 259501 gagaaagatg cttttttaac acctatttta tagaggagga aactgaggcc cagagaagtg 259561 aactgacttg ccaggtcaca taactaataa gcagagtctg gatttgagtt tggctctaga 259621 gcctgctctt agccaccatg tcccatcctc actgagagga cccttgttca actccactaa 259681 ccctctctgg acaagtagcc tctggtcctg tctccctgct tcctctctgt ctcactggcc 259741 taccctttcc acacctttct aaaaaccaaa tgtgatcatg tcatcctctt tgttgaaaac 259801 atgtcaatgg cttcccattg ttcttgggac aaagtaaata ctctttgaag tggctgcgaa 259861 gcagggtgga catacgtcct cctctgccaa gctt // LOCUS HUAC002400 138839 bp DNA PRI 17-DEC-1997 DEFINITION Homo sapiens Chromosome 16 BAC clone CIT987-SKA-735G6 ~complete genomic sequence, complete sequence. ACCESSION AC002400 NID g2576344 KEYWORDS HTG. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 138839) AUTHORS Adams,M.D., Loftus,B.J., Zhou,L., LaBombard,M., Fuhrmann,J., Brandon,R., Kim,U.J., Kerlavage,A.R. and Venter,J.C. TITLE Homo sapiens Chromosome 16 BAC clone CIT987-SKA-735G6 #complete genomic sequence JOURNAL Unpublished REFERENCE 2 (bases 1 to 138839) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (07-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA REFERENCE 3 (bases 1 to 138839) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (30-OCT-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA REFERENCE 4 (bases 1 to 138839) AUTHORS Adams,M.D. TITLE Direct Submission JOURNAL Submitted (17-DEC-1997) The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, MD 20850, USA COMMENT Address all correspondence to: Mark Adams The Institute for Genomic Research 9712 Medical Center Dr, Rockville, MD 20850, USA e-mail address: mdadams@tigr.org. The bac location is on chromosome BAC clone is located on human chromosome 16p12 . The orientation of the sequence is from SP6 end to T7 end. Genes were identified by a combination of five methods including: XGRAIL (available by anonymous ftp from arthur.epm.ornl.gov), Genefinder (Phil Green, University of Washington), Genscan (Chris Burge, http://gnomic.stanford.Edu/~chris/GENSCANW.html ) searches of the complete sequence against a peptide database, and the Human gene Index database at TIGR (http://www.tigr.org/tdb/hg i/hgi.html). A gene with homolgy to another protein is annotated as the isolog of that protein. Genes without pepetide homolgy having spliced EST hits are termed 'u nknown protein'. Genes encoding tRNAs are predicted by tRNAscan-SE (Sean Eddy, http://genome.wustl.edu/eddy/tRNAscan-SE/). FEATURES Location/Qualifiers source 1..138839 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="16" /map="16p12" /clone="A-735G6" repeat_region complement(247..538) /rpt_family="AluSx" repeat_region complement(553..844) /rpt_family="AluSx" repeat_region complement(846..1071) /rpt_family="MIR" misc_feature 1107..1223 /note="exon predicted by xgrail, quality good" repeat_region complement(1158..1420) /rpt_family="LINE2" misc_feature 1660..1742 /note="exon predicted by xgrail, quality excellent" misc_feature complement(1737..1826) /note="exon predicted by xgrail, quality good_shadowexon" repeat_region complement(2464..2742) /rpt_family="AluSx" repeat_region complement(2765..3066) /rpt_family="AluSx" repeat_region 3101..3394 /rpt_family="AluSc" repeat_region 3404..3466 /rpt_family="AluSx" repeat_region complement(3568..3868) /rpt_family="AluJb" repeat_region 4197..4500 /rpt_family="AluSx" repeat_region 4658..4877 /rpt_family="AluSx" repeat_region complement(4886..5845) /rpt_family="LTR5" repeat_region complement(5921..6070) /rpt_family="LINE2" repeat_region 6366..6489 /rpt_family="AluY" repeat_region complement(6609..6899) /rpt_family="AluJo" repeat_region complement(6960..7269) /rpt_family="AluSq" repeat_region complement(7351..7646) /rpt_family="AluSc" repeat_region 7867..8401 /rpt_family="LTR18B" misc_feature 8364..8452 /note="exon predicted by xgrail, quality excellent" repeat_region complement(8512..8545) /rpt_family="AT_rich" repeat_region complement(8609..8744) /rpt_family="FLAM_C" mRNA join(<9024..9206,18007..18129,19907..19994,22917..23016, 24119..24287) /gene="A-735G6.1" gene <9024..24287 /gene="A-735G6.1" CDS join(<9024..9206,18007..18129,19907..19994,22917..23008) /gene="A-735G6.1" /codon_start=1 /product="ACYL CARRIER PROTEIN, MITOCHONDRIAL (ACP) (NADH-UBIQUINONE OXIDOREDUCTASE 9.6 KD SUBUNIT) (FRAGMENT(5' partial))" /db_xref="PID:g2576345" /translation="WVGVAMASRVLSAYVSRLPAAFAPLPRVRMLAVARPLSTALCSA GTQTRLGTLQPALVLAQVPGRVTQLCRQYSDMPPLTLEGIQDRVLYVLKLYDKIDPEK LSVNSHFMKDLGLDSLDQVEIIMAMEDEFGFEIPDIDAEKLMCPQEIVDYIADKKDVY E" misc_feature 9039..9206 /gene="A-735G6.1" /note="exon predicted by xgrail, quality marginal" repeat_region 9252..9343 /rpt_family="GC_rich" repeat_region 9450..9570 /rpt_family="MIR" repeat_region complement(9745..9971) /rpt_family="MER8" repeat_region complement(10044..10358) /rpt_family="AluSg" repeat_region 10542..10677 /rpt_family="AluJb" repeat_region 10678..10956 /rpt_family="AluY" repeat_region 10981..11053 /rpt_family="MER5B" repeat_region 11149..11448 /rpt_family="AluY" repeat_region complement(11611..11707) /rpt_family="MER5A" repeat_region complement(11806..11884) /rpt_family="LINE2" repeat_region complement(12027..12297) /rpt_family="AluJo" repeat_region complement(12954..13131) /rpt_family="AluJo" repeat_region 13317..13410 /rpt_family="LINE2" misc_feature complement(13569..13830) /gene="A-735G6.1" /note="exon predicted by xgrail, quality good" repeat_region complement(13764..13824) /rpt_family="LINE2" misc_feature 14164..14317 /gene="A-735G6.1" /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region 14223..14341 /rpt_family="MIR" misc_feature complement(14240..14313) /gene="A-735G6.1" /note="exon predicted by xgrail, quality good" repeat_region 14377..14678 /rpt_family="AluSx" repeat_region complement(14794..15040) /rpt_family="MIR" misc_feature 15151..15222 /gene="A-735G6.1" /note="exon predicted by xgrail, quality marginal_shadowexon" repeat_region complement(15238..15347) /rpt_family="AluJo" repeat_region complement(15371..15673) /rpt_family="AluSx" misc_feature complement(15668..15741) /gene="A-735G6.1" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region complement(16011..16308) /rpt_family="AluSg" repeat_region complement(16332..16619) /rpt_family="AluSx" repeat_region complement(16853..17153) /rpt_family="AluSx" misc_feature 18007..18129 /gene="A-735G6.1" /note="exon predicted by xgrail, quality excellent" misc_feature complement(18064..18104) /gene="A-735G6.1" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region 18379..18678 /rpt_family="AluSp" repeat_region 18694..18988 /rpt_family="AluSg1" repeat_region 19101..19396 /rpt_family="AluSg" repeat_region 19544..19803 /rpt_family="L1PA6" misc_feature 19907..19994 /gene="A-735G6.1" /note="exon predicted by xgrail, quality excellent" repeat_region 20656..20808 /rpt_family="MIR" STS 20702..20827 /gene="A-735G6.1" /db_xref="dbSTS:G25314" repeat_region 20853..21069 /rpt_family="L1MB7" misc_feature 20961..21009 /gene="A-735G6.1" /note="exon predicted by xgrail, quality marginal" repeat_region 21089..22513 /rpt_family="SVA" repeat_region 22593..22700 /rpt_family="L1MB7" misc_feature 22917..22996 /gene="A-735G6.1" /note="exon predicted by xgrail, quality excellent" repeat_region 23401..23639 /rpt_family="MIR" repeat_region complement(23727..24028) /rpt_family="AluSx" misc_feature 24119..24238 /gene="A-735G6.1" /note="exon predicted by xgrail, quality excellent" STS 24130..24231 /gene="A-735G6.1" /db_xref="dbSTS:G27632" misc_feature 24843..24874 /note="exon predicted by xgrail, quality excellent" misc_feature complement(24999..25049) /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region complement(25106..25228) /rpt_family="L1PA15" repeat_region complement(25229..25531) /rpt_family="AluSq" repeat_region complement(25553..25768) /rpt_family="L1PA9" repeat_region complement(25771..25794) /rpt_family="(CA)n" repeat_region complement(25801..26096) /rpt_family="AluSx" repeat_region complement(26099..26262) /rpt_family="L1PA9" repeat_region complement(26272..26543) /rpt_family="AluJb" repeat_region 27574..27777 /rpt_family="L1MC4" misc_feature 27579..27653 /note="exon predicted by xgrail, quality good" repeat_region 27783..27942 /rpt_family="AluJb" repeat_region 28013..28127 /rpt_family="AluJo/FRAM" repeat_region 28130..28425 /rpt_family="AluSq" repeat_region complement(29495..29785) /rpt_family="AluSp" repeat_region complement(30035..30333) /rpt_family="AluSx" repeat_region complement(30335..30468) /rpt_family="AluJo" misc_feature complement(32019..32150) /note="exon predicted by xgrail, quality marginal_shadowexon" misc_feature 32287..32394 /note="exon predicted by xgrail, quality good" misc_feature 32628..32773 /note="exon predicted by xgrail, quality good" misc_feature 33885..34103 /note="exon predicted by xgrail, quality good" misc_feature complement(33940..34106) /note="exon predicted by xgrail, quality good_shadowexon" mRNA complement(join(34341..34809,38222..38304,42561..42666, 43018..43083,45613..45821,47010..47339,47522..48218)) /gene="A-735G6.2" gene complement(34341..48218) /gene="A-735G6.2" CDS complement(join(34699..34809,38222..38304,42561..42666, 43018..43083,45613..45821,47010..47339,47522..48218)) /gene="A-735G6.2" /codon_start=1 /product=" Unknown protein product with similarity to Ubiquitin binding enzyme" /db_xref="PID:g2576346" /translation="MGSRCLNPPPPAHSDTTGKDSFGNIRGAETGQGASACSVTSARV TCGAGSEPHSHRNPGISAQVGLAPSYGAARGRRRPLALQQSPQERRHVGWNSTRGLLP ASLPGTASSQSASATASAALPLKVTGPLARNPTPPWTAAAALATRGQRPEKGLFPGPA PFSLGKRKRGRGRTWERRRRVSIETSTCFRPGCERLGAAAGANLSQLASSQRPLRERW VLYTIIMAAAGAPDGMEEPGMDTEAETVATEAPARPVNCLEAEAAAGAAAEDSGAARG SLQPAPAQPPGDPAAQASVSNGEDAGGGAGRELVDLKIIWNKTKHDVKFPLDSTGSEL KQKIHSITGLPPAMQKVMYKGLVPEDKTLREIKVTSGAKIMVVGSTINDVLAVNTPKD AAQQDAKAEENKKEPLCRQKQHRKVLDKGKPEDVMPSVKGAQERLPTVPLSGMYNKSG GKVRLTFKLEQDQLWIGTKERTEKLPMGSIKNVVSEPIEGHEDYHMMAFQLGPTEASY YWVYWVPTQYVDAIKDTVLGKWQYF" repeat_region complement(35008..35190) /rpt_family="MER5A" repeat_region 35207..35307 /rpt_family="L1ME" repeat_region complement(35600..35653) /rpt_family="AT_rich" repeat_region complement(35717..36018) /rpt_family="AluSx" repeat_region complement(36449..36751) /rpt_family="AluSg" repeat_region 36833..37131 /rpt_family="AluSc" misc_feature 37470..37604 /gene="A-735G6.2" /note="exon predicted by xgrail, quality marginal" misc_feature 37930..38012 /gene="A-735G6.2" /note="exon predicted by xgrail, quality good" misc_feature complement(38222..38304) /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region 38620..38931 /rpt_family="AluSq" repeat_region 39579..39640 /rpt_family="MIR" misc_feature 41231..41347 /gene="A-735G6.2" /note="exon predicted by xgrail, quality marginal_shadowexon" repeat_region 42274..42314 /rpt_family="MER58A" misc_feature complement(42561..42666) /gene="A-735G6.2" /note="exon predicted by xgrail, quality good" misc_feature 42624..42685 /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region 42724..42933 /rpt_family="AluSx" misc_feature complement(43018..43083) /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent" repeat_region 43233..43401 /rpt_family="MER3" misc_feature complement(45421..45510) /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent" misc_feature complement(45613..45801) /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent" repeat_region 45914..46111 /rpt_family="MIR" repeat_region 46232..46505 /rpt_family="AluS" repeat_region 46604..46744 /rpt_family="MER5A" repeat_region 46793..46847 /rpt_family="MIR" misc_feature complement(46985..47339) /gene="A-735G6.2" /note="exon predicted by xgrail, quality excellent" repeat_region complement(47168..47253) /rpt_family="GC_rich" repeat_region complement(47485..47544) /rpt_family="GC_rich" misc_feature complement(47522..47546) /gene="A-735G6.2" /note="exon predicted by xgrail, quality marginal" misc_feature 47946..48084 /gene="A-735G6.2" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region complement(48214..48259) /rpt_family="MIR" repeat_region complement(48317..48607) /rpt_family="LINE2" repeat_region 48631..48936 /rpt_family="AluSp" misc_feature 49071..49197 /note="exon predicted by xgrail, quality good_shadowexon" repeat_region 49397..49597 /rpt_family="AluSg/x" repeat_region 49717..49767 /rpt_family="MIR" repeat_region complement(49782..50085) /rpt_family="AluJb" repeat_region 50090..50172 /rpt_family="MIR" repeat_region complement(50385..50543) /rpt_family="MIR" repeat_region 50970..51262 /rpt_family="AluJb" repeat_region complement(51377..51675) /rpt_family="AluSg" repeat_region complement(51681..51814) /rpt_family="AluJb" repeat_region complement(52140..52362) /rpt_family="AluJb" repeat_region 52364..52498 /rpt_family="AluSq/x" repeat_region 52499..52787 /rpt_family="AluSx" repeat_region complement(52799..52875) /rpt_family="AluJb" misc_feature 52985..53140 /note="exon predicted by xgrail, quality excellent" repeat_region complement(53350..53647) /rpt_family="AluY" repeat_region complement(53656..53947) /rpt_family="AluJo" repeat_region 54553..54853 /rpt_family="AluY" repeat_region complement(54902..55210) /rpt_family="AluJo" repeat_region complement(55227..55497) /rpt_family="LINE2" repeat_region 55534..55784 /rpt_family="AluJb" repeat_region 55789..55921 /rpt_family="MER52" repeat_region 55939..56316 /rpt_family="MER52" misc_feature complement(56187..56240) /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region complement(56211..56351) /rpt_family="LTR20" repeat_region 56368..56437 /rpt_family="MER52" repeat_region complement(56382..56465) /rpt_family="LTR1" repeat_region 56641..56698 /rpt_family="AluJb" repeat_region complement(56989..57037) /rpt_family="(TAAA)n" repeat_region complement(57038..57339) /rpt_family="AluY" repeat_region complement(57341..57474) /rpt_family="AluJb" repeat_region 57501..57779 /rpt_family="AluSg" repeat_region complement(57780..58261) /rpt_family="L1MB6" repeat_region 58433..58734 /rpt_family="AluJb" repeat_region complement(58773..58925) /rpt_family="MIR" repeat_region complement(59186..59284) /rpt_family="MIR" repeat_region 59303..59619 /rpt_family="AluJo" repeat_region 59649..59804 /rpt_family="MIR" repeat_region complement(59877..59916) /rpt_family="MIR" repeat_region 60148..60441 /rpt_family="AluSx" repeat_region 60457..60569 /rpt_family="MIR" misc_feature 60586..60775 /note="exon predicted by xgrail, quality excellent" repeat_region complement(60818..61216) /rpt_family="MLT1F" repeat_region 61217..61303 /rpt_family="MER46" repeat_region complement(61542..61823) /rpt_family="AluSx" repeat_region complement(61860..62151) /rpt_family="AluJo" repeat_region complement(62153..62454) /rpt_family="AluSx" repeat_region complement(62511..62547) /rpt_family="L1MC3" repeat_region complement(62635..62661) /rpt_family="AT_rich" repeat_region 62676..62971 /rpt_family="AluSg" repeat_region 62995..63131 /rpt_family="AluSx" repeat_region 63132..63428 /rpt_family="AluSx" repeat_region 63429..63594 /rpt_family="AluSx" repeat_region complement(63913..63943) /rpt_family="(CAAAA)n" repeat_region complement(63945..64256) /rpt_family="AluSx" repeat_region complement(64280..64589) /rpt_family="AluSx" repeat_region complement(64590..64633) /rpt_family="Alu" repeat_region complement(64637..64831) /rpt_family="L1ME" repeat_region complement(64964..65108) /rpt_family="AluJo" repeat_region complement(65117..65420) /rpt_family="AluSx" repeat_region complement(65428..65558) /rpt_family="AluJo" repeat_region 65618..65640 /rpt_family="AT_rich" repeat_region 66098..66313 /rpt_family="AluSg/x" repeat_region complement(66426..66712) /rpt_family="AluJo" repeat_region 66852..67118 /rpt_family="AluSx" repeat_region 67208..67498 /rpt_family="AluJb" repeat_region 67519..67699 /rpt_family="AluJb" gene complement(67697..68205) /gene="A-735G6.5" /pseudo repeat_region 67923..68030 /rpt_family="AluJb" misc_feature complement(68144..68262) /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region 68309..68608 /rpt_family="AluSg" repeat_region complement(68618..68831) /rpt_family="AluJo" repeat_region complement(68840..68966) /rpt_family="FLAM_C" repeat_region complement(69140..69244) /rpt_family="MER5A" repeat_region complement(69787..69875) /rpt_family="MIR" mRNA join(69858..70401,72524..72632,75392..75545,75657..75791) /gene="A-735G6.3" gene 69858..75791 /gene="A-735G6.3" CDS join(69858..70401,72524..72632,75392..75545,75657..75791) /gene="A-735G6.3" /codon_start=1 /product="Glutamyl tRNA synthetase" /db_xref="PID:g2576348" /translation="MTEVGLLSINLSINSTHAALLPIRYDNRCRNMSQEQVAQKLAKD PKPAIRFRLEQVVPAFQDLVYGWNRHEVASVEGDPVIMKSDGFPTYHLACVVDDHHMG ISHVLRGSEWLVSTAKHLLLYQALGWQPPHFAHLPLLLNRDGSKLSKRQGDVFLEHFA ADGFLPDSLLDIITNCGSGFAENQMGRTLPELITQFNLTQVTCHSALLDLEKLPEFNR LHLQRLVSNESQRRQLVGKLQVLVEEAFGCQLQNRDVLNPVYVERILLLRQGHICRLQ DLVSPVYSYLWTRPAVGRAQLDAISEKVDVIAKRVLG" misc_feature 69902..70383 /gene="A-735G6.3" /note="exon predicted by xgrail, quality excellent" repeat_region 70436..70528 /rpt_family="MIR" repeat_region complement(70530..70637) /rpt_family="LINE2" repeat_region 70621..70761 /rpt_family="MIR" repeat_region complement(71007..71126) /rpt_family="MIR" repeat_region complement(71220..71515) /rpt_family="AluSq" repeat_region 71516..71610 /rpt_family="MER5B" repeat_region 71848..71972 /rpt_family="AluJ" repeat_region complement(71974..72000) /rpt_family="(CATA)n" repeat_region complement(72001..72283) /rpt_family="AluSg" repeat_region 72284..72468 /rpt_family="AluJ" misc_feature 72524..72632 /gene="A-735G6.3" /note="exon predicted by xgrail, quality excellent" repeat_region 72712..72910 /rpt_family="MIR" repeat_region 73177..73463 /rpt_family="AluSx" repeat_region complement(73618..73727) /rpt_family="MIR" repeat_region 73729..74028 /rpt_family="AluY" repeat_region 74093..74392 /rpt_family="AluSq" repeat_region complement(74397..74699) /rpt_family="AluJb" repeat_region 74719..74836 /rpt_family="AluY" repeat_region 74875..75161 /rpt_family="AluJo" misc_feature 75392..75545 /gene="A-735G6.3" /note="exon predicted by xgrail, quality excellent" misc_feature 75657..75787 /gene="A-735G6.3" /note="exon predicted by xgrail, quality excellent" misc_feature complement(75705..75896) /note="exon predicted by xgrail, quality good_shadowexon" misc_feature 75864..75933 /note="exon predicted by xgrail, quality excellent" repeat_region 76066..76140 /rpt_family="MIR" repeat_region complement(76475..76670) /rpt_family="AluJo" repeat_region complement(77052..77408) /rpt_family="MLT1A1" repeat_region 77438..77737 /rpt_family="AluY" repeat_region 78024..78321 /rpt_family="AluJo" repeat_region complement(78823..78844) /rpt_family="AT_rich" repeat_region 78866..79178 /rpt_family="AluSx" repeat_region complement(79274..79570) /rpt_family="AluSx" repeat_region complement(79577..79627) /rpt_family="MIR" misc_feature complement(79679..79877) /note="exon predicted by xgrail, quality good_shadowexon" misc_feature 79944..80045 /note="exon predicted by xgrail, quality excellent" repeat_region complement(80174..80312) /rpt_family="LINE2" repeat_region 80345..80642 /rpt_family="AluSx" misc_feature 80835..80903 /note="exon predicted by xgrail, quality good" repeat_region complement(81241..81635) /rpt_family="MLT1B" repeat_region 82096..82605 /rpt_family="MLT1F" STS 83044..83233 /db_xref="dbSTS:G22951" repeat_region complement(83160..83415) /rpt_family="MIR" repeat_region 83539..83616 /rpt_family="L1MB1" repeat_region 83618..83889 /rpt_family="AluJo" repeat_region 83902..84186 /rpt_family="L1MB3" repeat_region 84199..84502 /rpt_family="AluSx" repeat_region complement(84616..84781) /rpt_family="FRAM" repeat_region 84919..85014 /rpt_family="MIR" repeat_region complement(85016..85323) /rpt_family="AluSp" repeat_region 85947..86247 /rpt_family="AluJb" repeat_region 86312..86476 /rpt_family="LINE2" repeat_region 86777..87074 /rpt_family="AluJb" repeat_region 87080..87371 /rpt_family="AluSg" repeat_region 87391..87548 /rpt_family="AluY" repeat_region 87761..88061 /rpt_family="AluYa5" repeat_region 88062..88203 /rpt_family="AluJo/FRAM" misc_feature 88496..88589 /note="exon predicted by xgrail, quality good" repeat_region complement(88637..88807) /rpt_family="LINE2" repeat_region complement(88881..88977) /rpt_family="MLT2" repeat_region 88978..89277 /rpt_family="AluJo" repeat_region 89309..89447 /rpt_family="AluJo/FRAM" repeat_region complement(89457..89596) /rpt_family="MLT2CB" repeat_region complement(89628..89683) /rpt_family="MSTB" repeat_region complement(89696..89974) /rpt_family="AluSx" repeat_region complement(89983..90113) /rpt_family="FLAM_A" repeat_region 90120..90142 /rpt_family="AT_rich" repeat_region 90300..90458 /rpt_family="MER5A" misc_feature 90312..90550 /note="exon predicted by xgrail, quality good" repeat_region 90492..90549 /rpt_family="MER39" repeat_region 90551..90833 /rpt_family="AluJb" repeat_region 90834..90879 /rpt_family="MER39" repeat_region 90882..91185 /rpt_family="AluSx" repeat_region 91186..91454 /rpt_family="MER39" repeat_region complement(91455..91485) /rpt_family="(GAAAA)n" repeat_region complement(91486..91786) /rpt_family="AluJb" repeat_region 91788..92074 /rpt_family="MER39b_j" repeat_region 92285..92580 /rpt_family="AluSx" misc_feature 92678..92850 /note="exon predicted by xgrail, quality good" misc_feature complement(93381..93505) /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region 93694..93786 /rpt_family="AluSg/x" repeat_region 93831..94138 /rpt_family="AluSg" repeat_region 94149..94249 /rpt_family="AluJb" repeat_region 94250..94272 /rpt_family="(CA)n" repeat_region 94294..94351 /rpt_family="AluJb" repeat_region 94352..94378 /rpt_family="(CA)n" repeat_region 94379..94578 /rpt_family="AluJb" repeat_region 94579..94649 /rpt_family="MER53" repeat_region 94710..94770 /rpt_family="GC_rich" repeat_region 94826..94924 /rpt_family="(CGG)n" mRNA join(94878..94968,109511..109595,110911..110986, 111831..111929,113489..113612,116580..116683, 118499..118579,119137..119274,122285..122366, 124519..124644,125402..125524,126378..126406, 126788..126921,130228..130385,135124..135293, 136293..136403,137589..138258) /gene="A-735G6.4" gene 94878..138258 /gene="A-735G6.4" CDS join(94878..94968,109511..109595,110911..110986, 111831..111929,113489..113612,116580..116683, 118499..118579,119137..119274,122285..122366, 124519..124644,125402..125524,126378..126406, 126788..126921,130228..130385,135124..135293, 136293..136403,137589..137699) /gene="A-735G6.4" /codon_start=1 /product=" Unknown protein product with similarity to KIAA0154" /db_xref="PID:g2576347" /translation="MAATAVAAAVAGTESAQGPPGPAASLELWLNKATDPSMSEQDWS AIQNFCEQVNTDPNGPTHAPWLLAHKIQSPQEKEALYALTVLEMCMNHCGEKFHSEVA KFRFLNELIKVLSPKYLGSWATGKVKGRVIEILFSWTVWFPEDIKIRDAYQMLKKQGI IKQDPKLPVDKILPPPSPWPKSSIFDADEEKSKLLTRLLKSNHPEDLQAANRLIKNLV KEEQEKSEKVSKRVSAVEEVRSHVKVLQEMLSMYRRPGQAPPDQEALQVVYERCEKLR PTLFRLASDTTDDDDALAEILQANDLLTQGVLLYKQVMEGRVTFGNRVTSSLGDIPVS RVFQNPAGCMKTCPLIDLEVDNGPAQMGTVVPSLLHQDLAALGISDAPVTGMVSGQNC CEEKRNPSSSTLPGGGVQNPSADRNLLDLLSAQPAPCPLNYVSQKSVPKEVPPGTKSS PGWSWEAGPLAPSPSSQNTPLAQVFVPLESVKPSSLPPLIVYDRNGFRILLHFSQTGA PGHPEVQVLLLTMMSTAPQPVWDIMFQVAVPKSMRVKLQPASSSKLPAFSPLMPPAVI SQMLLLDNPHKEPIRLRYKLTFNQGGQPFSEVGEVKDFPDLAVLGAA" repeat_region 95048..95106 /rpt_family="GC_rich" repeat_region complement(95259..95383) /rpt_family="MIR" repeat_region complement(95514..95548) /rpt_family="(CA)n" repeat_region complement(95549..95613) /rpt_family="FLAM" repeat_region complement(95632..95750) /rpt_family="MIR" misc_feature 95884..95987 /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region 96934..97225 /rpt_family="AluSx" repeat_region complement(97470..97647) /rpt_family="AluSg/x" repeat_region complement(97664..97930) /rpt_family="AluSq" misc_feature complement(98408..98497) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region complement(98610..98796) /rpt_family="MER20" repeat_region 99155..99292 /rpt_family="MIR" STS 99194..99323 /gene="A-735G6.4" /db_xref="dbSTS:G29538" repeat_region complement(100860..100939) /rpt_family="LINE2" misc_feature 100939..101015 /gene="A-735G6.4" /note="exon predicted by xgrail, quality marginal_shadowexon" repeat_region 101102..101158 /rpt_family="MIR" repeat_region complement(101192..101489) /rpt_family="AluSp" repeat_region 101863..101916 /rpt_family="AT_rich" misc_feature 102013..102184 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent_shadowexon" repeat_region complement(102452..102528) /rpt_family="MER20" misc_feature complement(103444..103511) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" misc_feature complement(103634..104096) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" misc_feature 105368..105472 /gene="A-735G6.4" /note="exon predicted by xgrail, quality marginal_shadowexon" repeat_region 105529..105645 /rpt_family="AluY" repeat_region complement(105649..105686) /rpt_family="AluJ/FLAM" repeat_region 105846..106143 /rpt_family="AluSg" repeat_region complement(106229..106309) /rpt_family="(GGGA)n" repeat_region complement(106705..106840) /rpt_family="AluJo" repeat_region complement(106845..106995) /rpt_family="AluJo" repeat_region complement(106996..107290) /rpt_family="AluSx" repeat_region complement(107293..107437) /rpt_family="AluJo" repeat_region complement(107439..107489) /rpt_family="MIR" repeat_region complement(108167..108315) /rpt_family="AluJ" repeat_region complement(108316..108614) /rpt_family="AluSx" repeat_region complement(108620..108742) /rpt_family="AluJ" repeat_region complement(108967..109271) /rpt_family="AluSx" repeat_region complement(109290..109353) /rpt_family="LINE2" misc_feature 109511..109595 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region 109693..109823 /rpt_family="FLAM_C" misc_feature 110119..110150 /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region complement(110194..110489) /rpt_family="AluSx" misc_feature 110911..110986 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" misc_feature complement(111153..111202) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good_shadowexon" misc_feature 111831..111929 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region 112377..112801 /rpt_family="MER21B" misc_feature complement(112493..112582) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region 112860..112932 /rpt_family="MER21B" repeat_region 112972..113195 /rpt_family="MER21B" repeat_region 113484..113536 /rpt_family="L1M4" misc_feature 113489..113612 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(113660..113739) /rpt_family="L1M4" repeat_region complement(113963..114147) /rpt_family="AluJ" repeat_region 114156..114192 /rpt_family="(CA)n" repeat_region complement(114192..114235) /rpt_family="(GA)n" repeat_region complement(114243..114354) /rpt_family="AluJ" misc_feature 114381..114472 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(114536..114835) /rpt_family="AluSx" repeat_region complement(114842..114952) /rpt_family="L1MA5" repeat_region 115750..116050 /rpt_family="AluJo" misc_feature 116580..116683 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(117675..117752) /rpt_family="LINE2" repeat_region complement(117813..117977) /rpt_family="MER20" repeat_region 118010..118307 /rpt_family="AluSc" misc_feature complement(119111..119268) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good_shadowexon" misc_feature 119137..119274 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(119299..119908) /rpt_family="L1ME1" repeat_region complement(119962..120841) /rpt_family="L1MB6" repeat_region complement(120852..121163) /rpt_family="AluSx" repeat_region complement(121164..121282) /rpt_family="AluSq/x" repeat_region 121344..121647 /rpt_family="AluSp" repeat_region complement(121895..122064) /rpt_family="AluSq" misc_feature 122285..122366 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" misc_feature 122447..122610 /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region complement(123006..123304) /rpt_family="AluSx" repeat_region complement(123497..123798) /rpt_family="AluSx" repeat_region complement(123902..124186) /rpt_family="AluSq" misc_feature complement(124260..124321) /gene="A-735G6.4" /note="exon predicted by xgrail, quality marginal_shadowexon" misc_feature 124519..124644 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" misc_feature complement(124573..124688) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good_shadowexon" repeat_region 124961..125092 /rpt_family="MER5A" misc_feature 125402..125524 /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region complement(127036..127070) /rpt_family="POLY_A" repeat_region complement(127071..127333) /rpt_family="AluSx" repeat_region complement(127392..127512) /rpt_family="THE1B" repeat_region complement(127523..127827) /rpt_family="AluY" repeat_region complement(127829..128053) /rpt_family="THE1B" repeat_region 128054..128326 /rpt_family="AluJb" misc_feature complement(128926..129033) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region complement(129340..129667) /rpt_family="AluSx" repeat_region complement(129668..129800) /rpt_family="MIR" repeat_region 129805..130099 /rpt_family="AluSx" repeat_region 130538..130650 /rpt_family="AluJb" repeat_region 130671..130927 /rpt_family="AluSx" repeat_region 131443..131591 /rpt_family="L1MD3" repeat_region 131662..131953 /rpt_family="AluSx" misc_feature complement(131852..131930) /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region complement(132239..132536) /rpt_family="AluY" repeat_region complement(132562..132732) /rpt_family="AluJb" repeat_region 132792..133097 /rpt_family="AluSx" repeat_region complement(133361..133670) /rpt_family="AluJo" repeat_region 134149..134462 /rpt_family="AluSp" repeat_region complement(134521..134807) /rpt_family="AluSx" misc_feature 135124..135293 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(135882..136028) /rpt_family="MIR" misc_feature 136293..136390 /gene="A-735G6.4" /note="exon predicted by xgrail, quality good" repeat_region 136517..136810 /rpt_family="AluSx" repeat_region 136814..137109 /rpt_family="AluSc" repeat_region 137111..137189 /rpt_family="(GAAA)n" misc_feature 137589..137699 /gene="A-735G6.4" /note="exon predicted by xgrail, quality excellent" repeat_region complement(137855..137923) /rpt_family="(CA)n" misc_feature complement(138456..138629) /note="exon predicted by xgrail, quality excellent_shadowexon" misc_feature 138497..138620 /note="exon predicted by xgrail, quality good" BASE COUNT 35584 a 32067 c 34079 g 37109 t ORIGIN 1 aagcttgcac tgtttgagag cacaagagtg gagaggtggg caccttaatt ggcagtagag 61 ggttggggag agactgtcaa agaaaacctt agggagtttc caaagataag ttaggagtta 121 ggtacgcaaa gttgggtggg aaccagggat aaaaagtatc atgcagttca ttcatataat 181 actctttgga gagctggcca tgcctcaggc accagtgata cagcagtgaa taataataat 241 agctaatttt ttttttgaga cggagtttct ctctgttgcc caggctggag tgcagtggca 301 tgatctcagc tcactgcaac ctctgcctcc caggttcaag cgatcctctt gcctcagccc 361 ccctagtagc tgggattaca ggcacgtgcc accatgccca gctaattttt gtatttttag 421 tagagacggg gtttcaccat gttggccagg ctagtcgtga actccttacc tcaggtgatc 481 tgcctgcctt ggcctcccaa agttctggga ttaccggcgt gagccaccag gcctggccac 541 aataacagct aatttttttt tttggagaca gggtctcact ttatcgccca ggttggagtg 601 cagtggcacg atcttggctc actgcaacct ccacctccca tgttcaagca attcttgtgc 661 ttcagcctcc cgagtagctg ggattacagg catgtgccac cacgcctggc taattttgta 721 tttttagtag agaccgggtt tctccatgtt ggccaggctg gtctcaaact cccaacctca 781 ggtgatccac ccgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccacacc 841 cggctaataa tagctaattt ttattgagca cctcctgtgt gccaggcact gcgctaagcc 901 ttttatagga actgtttcat ttaacctcac cacagtcctg tcaagtatgt gctcttaaaa 961 tcatccctat tgtactgata aagttaaata atttgcccag ggttgcacag ctcttacttt 1021 gtagagttag aatttaaacc caggtagctg actgcagagc ccttgtctaa atgaaaaagg 1081 caaccaggta ggtttctgct tcatagatgg catcttttgt gggggaaatt cccactgaat 1141 atggaaacag ggtaatctca aatagtgaaa agtgcaatga agataaaaac atatatgtca 1201 taaagatgga acagatcccg tatgtgtgtg cacactgcct gagatagagg attaggggtg 1261 acctttctat ggacataaca ggcctgaatg atgacaagaa ctcagccatg ctaagagctg 1321 gggtgagagc ctgtcaggga gaaagaatgg caagagcagt gacactgagt tgggactgag 1381 tttggccaga gtgatggaga gtggaaggaa ggccactgtg cgccagcctt caattgctac 1441 aagatgcatc agagagattg gcaggaactg gacatgtgat tctgtccaaa actgcaacac 1501 aatagccaac agacctctaa ggctacaaat gtaggcattc atagttacag ggatttttgt 1561 tcctgttgct ggttttggga acatggtttt gacctttttt tttttttaat tgttttttgg 1621 atatgtaatc tgaattatat cttctttgta tgctatcagg ttcctggaag gtgacgtgaa 1681 agatcactgt gcagcagcaa tcttgacttc tggaacaatt gccatttggg acttacttct 1741 cggtcagtgt actgccctcc tcccacctgt ctctgaccaa cattggtctt ttgtgaaatg 1801 gtcgggtaca gactctcatt tgctggctgg acaaaaagat ggaaatatat ttgtatacca 1861 ctattcataa gttagggtaa agtgaaaaca caattttctg gatatattgg gcctcttagt 1921 attttttgga gttttaaata taaaggagaa tatctgaatg acacttaaaa tgattgcttg 1981 tttatgtcca gacagactta ttttttattc taatgatggt agcaccactg atcttggatg 2041 tacatttatg tatactttga gaaaaagggt tttaggttga tttttgtaat ttcccacatt 2101 tgtacatgtg cttttaaagg tgtacataaa gcttcaaatg gcaataaata tttattttta 2161 tacattctgc ttggcatgtt attgtttccc attctttcaa gatcatttgc agaagcaaga 2221 actaattatt atacaaccag gatatttaat caatagtctt tgcctagtaa gtgtaaattt 2281 cagttcagtt aacttactac actaatatgc aatatacttt ttgtgctatt tgtaatactt 2341 tatttttaca tacatactaa aataaaatga aacctactaa aatatcctaa tttagggata 2401 ctcttactct ttttcattct catgcatctt tcatgccagt gctgcattat cactagagtc 2461 accttttttt gagatggagc ctcactggtc acccaggcgg gagtgcagtg gcatgatctc 2521 ggctcattgc aacctctgcc tcccaggtta agtgattccc ctgcttcagc ctctcaagta 2581 gctgggatta caggcaccca ccaccacacc tggctaattt ttgtatttta atagagatgg 2641 gggtctcacc atgttggtca ggcaggcctt gaactcctga cctcaagtga tccacctgct 2701 ttgacctccc aaagtgctgg gattacaggc atgaaccact gccgctcttt tttttcttgg 2761 ttttttgttt tttgtttttt ttttagaacg gagtctccct ctatcaccca ggctagagtg 2821 cagtggggtg atctgggctc actgcaacct ccgcctccca ggttcaagtg attctcatgc 2881 ctcagcctcc tgagtagctg ggattacagg catgctctac tatacccggc taatttttat 2941 attgtttgta gagacagggt ttcactgtgt tggccaggct agtctggaac tcctgacctt 3001 aggtgatctg cctacgtcgg cctcccaaag tactgggatt acagggatga gctaccgtgc 3061 ccagccaaag aagattatat tttttaaaga caaatttttg ggccgggcgc agtggatcac 3121 gcctgtaatc ccagcacttt gggaggccaa ggtgggcaga tcactaggta aggagatcaa 3181 gaccatcctg gctaacacag tgaaacccca tctctactaa aaatacacaa aattagccgc 3241 gcatggtggc acacgcctgt agtcccagct actcgggagg ctgaggcagg agaatcactt 3301 gaacccggga ggcggaggtt gcagtgagct gagatcacac cactgcgttc cagcctggac 3361 agagcaagac tctgtctcag aaaaaaaaaa aagacacatt tttggccagg tgcagtgcta 3421 taatcccagc actttgggaa gctgaggcgg gtggatcaca agaggtgggt ggatcacaca 3481 cacaaaaaaa gatacatttt tatcatactg tagtttttgc tcatttacaa attctggtat 3541 ttttttaatt ttctttttca aaggatgtct attatatttt ctttgagaca gtctcactct 3601 gttgcacagg ctggagtaca gtggtatcac agctcactgc agtctcaacc tccccacact 3661 caggcgatcc tccaacctca gcctccccag tagctaggac taaggtatgg gctaccatat 3721 ccagctaatt aaaaaaaaat tttttgtgag acagagtctc accatgttgc ccaggctggt 3781 ctcgaactcc tgggctcaag ccatctaccc actgtggcat cccctcccaa aatgctaaga 3841 ttacaggtgt aagcctccac acccagcctc ttttacaaat tctataccca caatttttaa 3901 cctcaaatta tttgctagtg tatcatagac actggagaac tttttgtaaa gatttaattt 3961 actcatgaac cttgagaaac ctgtattatt atcttgataa taacaagagt caacaaaact 4021 aaggcacaaa gtaattctag tcagtaatgg ttgcacaaac tgagcatctg cttgacacag 4081 taggtatgta aatcattgct gtgagacctc tgccattgag aagtacacag tctagtggga 4141 gagactgtcg cataaagcag tcaggactgt actttaaatt taaggggtat gtttcagacc 4201 cggcacggtg gcacacacct ataatcccag cactttggag gccgaggtgg gcagatcatt 4261 tgaggttagg agtttgagac cggcctggcc aacatggtga aacattgtct cttaaataca 4321 aaaagtagct gggtgtggta gcacacacct gtaatcccag ctactcgaga gactgaggca 4381 ggagaatcgc ttgaacctgg gaggcagaca ttgcagtgag cagagattgc accactgcac 4441 tccagcctgg gcaacagagc gagactctaa ttaaaaaaaa aggaagctaa aaaaaaaaaa 4501 atttaaggga tatgttccag gtgctattat ggaagctcaa ggctgagcag tttggacaac 4561 ccaggagtcc cagaaaaact tcatgggaga tggcaattta gctgagcctt gcgggataag 4621 taggtcaaag cagaagcaac ggatattctg ggcagaaagg acagcctggc caaaatggtg 4681 aaacccagtc tctactaaaa atacaaaaat tagccgggag tggtggcagg cgcctatagt 4741 ctcagctact cgggaggttg aggcagagaa tcgcttgaac ccaggaggca gaggttgcag 4801 tgagccaaga tcgcaccatt gcactccagc ctgggcgaca gagcgagact ctgtctcaaa 4861 caatgacaaa aagaaaaagt gtaactgtag gggtgggttg cccctccaca cctgtgggtg 4921 tttctcgtaa ggtggaatga gagacttagg aaagaaaaag acacagagac aaagtataga 4981 gaaagaaata aggggacccg gggaaccagc gttcagcata tggaggatcc cgccagcctc 5041 tgagttccct tagtatttat ttatcattca tgggtgtttc tcgaagaggg ggatgtgtca 5101 gggtcacaag acaattgtgg ggagagggtc agcagacaaa cacgtgaaca aaggtctttg 5161 catcatagac aatgtaaagg attaagtgct gtgcttttag atatgcatac acataaacat 5221 ctcaatgctt tacaaagcag tattgctgcc cgcaggtccc acctccagcc ctaaggcggt 5281 ttttccctat ctcagtagat ggagcataca atcgggtttt ataccgagac attccattgc 5341 ccagggacag gcaggagaca gatgccttcc tcttgtctca actgcaagag gcattccttc 5401 ctcttttact aatcctcctc agcacagacc ctttacgggt gtcgggctgg gggacggtca 5461 ggtctttccc ttcccacgag gccatatttc agactatcac atggggagaa accttggaca 5521 atacctggct ttcctaggca gaggtccctg tggccttccg cagtttttgt gtccctgggt 5581 acttgagatt agggagtagt gatgactctt aaggagcatg ctgccttcaa gcatctgttt 5641 aacaaagcac atcttgcacc gcccttaatc cattcaactc tgagttgaca cagcacatgt 5701 ttcagagagc acggggttgg gggtaaggtc acagaatctc aaggcagaag aatttttctt 5761 agtacataac aaaatggagt ctcctatgtc tacttctttc tacacagaca cagtaacaat 5821 ctgatctctc ttgcttttcc ccacatgtaa cagcttgtct atgaaaggta ggaggaatgg 5881 tattagagat ctataaaggg tcagcggagg gtcctcttac tttgttttta ggctaagagg 5941 aactttgatg tgtttatggg ttcaggagaa agagccagaa gagagacctt tgagatgtag 6001 ggaagaagag ctaactggtg gtataaggtc ccattgggaa taagagggga tgagactacc 6061 gtgcaaatgg gctggcagag ggagcgatca tggtgggttg gtgggatagt ttgtgctggg 6121 aagctgacag cttttatgct tatccatcct ttgatgggaa tgttactgaa aagcaccggt 6181 gttgtctttt aactcttcct catctgctaa ggatgtgatc ttaatattgg gacttttacc 6241 ctacaggcca tgtaggctga tattttgtct ttatttgttt gctgaaggga aaagtagctc 6301 tcccaaatta tttaaataag agatgatttg ttatgtttta ccaaaaggat taaatgtaac 6361 actgaggccg ggtgcagtgg ctcacgcctg taatcccagc actttgggag gccgaggcgg 6421 gcagatcatg aggtcaggag tttgagacca ccctggctaa cacggtgaaa ccccatctct 6481 ataaatacat acatacatac atatatacat acatacgtgt aacactgaaa gactagatag 6541 gtcatgggaa gagatggccc caattgtgaa cttagtagga ttaggttctt cctaagctaa 6601 gagctacatt tttttgggaa acagagtctc actctgttgc ccaggctgga gtgtagtggt 6661 gtgatcatag ctcactgcag ccttgacctc ctgggctcaa ttggtcctcc cacctcagct 6721 tcccaagcag ctgggaccac agacatgcat caccatgcct ggctaatttt ttaaattttt 6781 tatggagaca aggtctccct gtgttgccta ggttgatctc aaactcctgg gctcaagcga 6841 tccacctccc ttggcctccc aaagtgctgg gattacggag gtgagccact gaaccaggct 6901 tgcctgtctg gcttgctcac agttgcagcc tcaacaatta gtacgtaaca gatactcaat 6961 ttttttttga gacttttttg agatggagct tcactcttgt tgcccaggct agagtgcaat 7021 ggcgcgatct cagctccctg caacctctac ctcccgggtt caagcaattc tcctgtctca 7081 gcttcctgcc tcctgagtag ctgggattac agtcacctgc caccacaccc agctaatttt 7141 tgtattttta gtagagacgg ggtttcacca tgttgtccag gttggtctca aactcctgac 7201 atcaggtgat ccacctgcct cagcctccca aaatgctggg attacaggtt tgacccaccg 7261 tgcctggccc ccagacactc gagttttaaa ggaatgctat aataaataga aattatttta 7321 caacagtggg atgatactat gttttttttg ttttcgtttt ttttttttga gatggagtct 7381 tgctcttttg ccagcctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccgcctc 7441 ccgagttcaa gcgattctcc tgcctcagcc tcctgagtat ctgggactac aggcgcgcac 7501 caccacaccc aactaatttt tgtattttta gtagaaatgg ggtttcacca tgttgaccag 7561 gattgtctcg atctcttgac ctcgtgatcc acccaccttg gcctcccaaa gtgctgggat 7621 tacaggtatg agccaccgcg cctggcagca taccattata ttttatttta cttggattta 7681 acaaaaataa aaagaaattg aggtaaaact ggtgacatta ttggatcatt cattcaggca 7741 gatttcagaa tttggagtca ttatcccatg tgtttccact tcagtcaata tcctttagct 7801 ccctgcttag aaataggaat tagccttctt aaactcctcc ctcgcttacc tatttttgtt 7861 agctattgta aggaacatgg ctgtgctttg gtcaaggata ggccaaggta agatgtttac 7921 atcctgcgtg actcagcaag tttagacaga ggcgtataac tccacttgtt atcacagcca 7981 tgtaggcata acatgggaag gtcatcactt ggctctaagc cactattttc tgtaaaaggt 8041 ataattgccc tgctgatact gtacaggcgt tcgtgcccag agaaagagag agccagagct 8101 gtcagtcttt gcagacagac agggggagcc agatcacagc tgagagagag ttaagctgct 8161 gaccctgaag gcaagggaga gccagcctcg cagttgtgtg tgggagccat gggactaagc 8221 agctgagaca aggcagacag tgtgagagaa ctagtgtaag taagctgctg atgagagctg 8281 ctgctgaata aaatcatctt tcacctgcct accgtcccct gagtgttctt tcagctatct 8341 gttcatccac ccactcccct cagacctcga tatgggctgg aacctggctg tgggcatgac 8401 agttggcata gtcgtggacc tgacagctat attatacact gtcaagtttt aaaacatgta 8461 tattgtttta atcgtaattc tcacatttat tagtcttagg tattgaaatt gattattttt 8521 tcacattatt atatatattt ttttacctat tcaataaatg ttaatacttg tggaagaatc 8581 gtggaatcaa aactttgtcc tttttttctt ttaatctttt ttgtaataga gacggagtct 8641 tgccgtgttg gccaggttgg tcttgaactc ctgagctcaa acagtcctgc caccttggcc 8701 tcccaaagtg ctaggattaa aggcgtcagc caccgcaccc ggccccaaac gtttaaatag 8761 atgtatttca gaactttcag ataaaaaatg aaagtaggtt tctcgtgccg tgtcgtttag 8821 ggaaagacgg cgacgtctgc acagtttagt tgtgacacca agcagttgtt cgtccccgaa 8881 cctggatggg ctccgagggc gcagtgcctc caggagccgc caggggccgc caccgccggc 8941 ggtgcccgac agcgctgccg tgtgcgtcgc ctgcgcgccg ccggaaagga accctggtcc 9001 ggaggcggcg gcgcagtgca tcctgggttg gcgtagccat ggcgtctcgt gtcctttcag 9061 cctatgtcag ccgcctgccc gcggcctttg cgccgctgcc ccgggtccgg atgctggccg 9121 tggcccggcc tctcagcacc gctctgtgct ccgcggggac ccagacgagg ctcgggactt 9181 tgcagccggc cttagtgctc gcgcaggtga ctcgtggtga ctttgctgcc gcttggcaag 9241 ggtcagggat tgggcccggt gggggtgggg gccgggcatc cggggcgcgc cagctgcagg 9301 caggggtgac ccggggggca gccggctcgc tccgcagccc cgctgcctaa aggtgtttag 9361 ttgacccctg acgggaggcc gggtgacctc gtaataacga gcaagagcac tgactcgagt 9421 ccagcactgc tgggctgccg cttaccgaaa ctttggacaa gtctcctagc ctggctcaca 9481 ctcacctgca aaattgggat cctggtctct accttgcgga gtagcaatga ggattagatg 9541 agttaatgcg gctagagcac ctcgtacggt ctcaaaatag cggtcaccgt attgtagtga 9601 aagcagcatg caaagaggca gccacactgg cctacagcct tctccagagc ggttttctct 9661 ctgcagttct gtccttactc ttccctgcag caccttctgc ctgtgtccag gactcctgga 9721 agtaattcta ggaaacaagc caagtgacct ttgaacaata aggatttgaa ttgcgcgggt 9781 ccacttatat gcggattttt tttcaaccaa acgcgaccag tatttacggg atgtgccgcc 9841 ccgcgtataa ggagggcagg ctcttcctat aagcggattc tgcaaggctg actgcgggac 9901 gtgagtgtgc ggggatttgg gtatagtcgg gggtcctgga accaatctgc gcaaatacca 9961 aggaacgact gtacctgtag tatatagctt gagctgatga aatctgcgtg aggctgaaac 10021 gccgggaggc agtgatttcg gtgttgttgt tgttgttttt gagacagagt ctcgctctgt 10081 cacccaggct gtaatgcagt ggcgtgatct cagctcactg caacctccgg ctaacggcaa 10141 cctccgcctc tcaggttcaa gcaattctcc tccctcagcc cccccgagta tttgggacta 10201 caggcgcccg ccaccatgcc cggctaagtt ttgtattttt agtggagacg gggtttcatc 10261 atattggcca ggttggtctt gaactcccga cctcgcaatc cacccgcctc ggcctcccaa 10321 agtgctggga ttacaggcgt gaaccaccgc gccctgcctt ttttttgttg ttttctgagt 10381 tctccagatg actctgaagt gcagttggga ataaaaataa ccgtcttaag gtaggataga 10441 ctcccaaggt caagttatga aacccctttt caaagtcata gacttcaggt aatagtggtt 10501 tcaaccttga ctgcatgtta gtgtgaaaaa taagtgtgtc tggccaggcg tggtggctca 10561 cgcctgtaat cccagcactt tgggaggccg aggcgggcga atcgcttgag cccaggagtt 10621 tgagaccctc ctaggcaaca tggtggaacc ccgtctctac aaaaaataca gaaattaggc 10681 cgggcgcggt ggcttatgcc tgtaatccca gcactttggg atgccgaggc gggcggatca 10741 cgaggtcagg cgatcgagac cattctggct aacacagtga aaccccgtct ctactaaaca 10801 tacaaaaaat tatccgggcg tggtggcacg cgcctgtagt cccagctact caggaggctg 10861 aggtgggagg atcgattgaa cccaggaggc agaggtttta gggagctgag atcatgccac 10921 tgcactccag tctgggaggc agaccgtctt aaataagcca gccagcccgt gccagatcga 10981 cagaatctag aagtgagtac taggcgacta tttttagaag tcatcaagtg tttcttttgt 11041 gaagccagtg ttgctgttca ctgacagtgt aatggattca ttcattgggc actgccctgt 11101 tggctgtatc aaaaggtgaa aagtgtcacc ttgaaattaa gggttaaagg ccaggcgcga 11161 tggctcacgc ctgtaatccc agcagtttgg gaggccgaga cgggcggatc acgaggtcag 11221 gagatggaga ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaata 11281 attagccggg cgtggtggcg ggcgcctgta gtcccagcta ctcgggaggc tgaggcagga 11341 gaatggcatg aacccgggag gcagagcttt cagtgagccg agatcgtgcc actgcactcc 11401 agcctgggcg acagagccag actccatctc aataaataaa taaataaata aataaataaa 11461 taagggttaa aaaggcagta gttagaattg atgagtgtct ggttcctaag attgatttca 11521 acattgggct aaaatgtgaa gtgactgtct ttgttcttac agcattgatt ctcagttgat 11581 tggacataca gatcacttag gttggaatac agattctgat tcagttctgg gaaggggcca 11641 gagagtttcc acctccagtg ggcttccaga agatcctgct gcgggtgttc cttggaccat 11701 gctttgaatg gcaaggagct cagaaatgtt ccactatggc ggcttggctt gcatatatag 11761 cagaggaacc attttattta tatttacaaa gtggttttga ctaagttcat tcaaagatat 11821 tgagggccag ctgtgtgcca ggactcatgc tagctgttga gagatacaga gatggataca 11881 aaaggaagac attagcttac atcttcattg aacgctttgc agcccttatg tccaacttgc 11941 tggctccttc tctgctgctg tcttcttccc taggctgttg tgacactgtt ctcccttggt 12001 tggtttgtcc tcctaccttg ttgggacact ttgcctttca ggctggagtg cagtggcatg 12061 atcgtggctc actgcagcct tgacctcctg gactcaaatg gtcctcctgc ctcagcttcc 12121 tgagtagctg ggactacagg catgagccac cacccctaat tttttgatgt ctgtcttctt 12181 tttagagacg gaatctcctt atgttgccta ggctagttgt gaactcttgg gctcaagcaa 12241 tcctcctgcc tcagcctccc aagtgctggg attacaggca tgagccatca cacctggtgt 12301 gcctaaccct taaatgttgg tgggctcccg agttctctta ccacacatgc tccctggact 12361 cttcactcct ggtttctgtt attgctgtat atcccagacc tccgcctcca gtccttgttt 12421 atctcatttg gaccattgct catgctacct attccctgct gtgaaatttc ctctcctcaa 12481 atccatccac tatgccaact tcagagcaac tagtcactca tttacaccct ttttcccaaa 12541 aggagctaag gcagttttcc tgggtataaa aataccagat gatataaaat agaaggtaag 12601 gaaaaaacaa aaaaacaaaa accaagggca atggcctaaa atgcagcctg gaagtcttat 12661 acttgatcca ggtaggcccc gaatctacac tgtctagcag atggagcaca aagataattt 12721 ccagcagttc tatggggcat ggtacgcaga tgaagacagc acttgtacaa aaaaaagcta 12781 gctattccag gactgggacc aggaccagac aaagtcccag tcccttccca tggcccttaa 12841 ggctctgcag acctgggccc tccctcttag cctccaagcc tgtcatctgt gtaccatctc 12901 ccaacatgag ctgatgtctt ctcacataga cacaggcact gcactttcat gtcttttttt 12961 tttttttttt ttccagacaa ggtctctgtc acttaggctg gagtgcagtg gcatgatcat 13021 atatcagtgc agcttcaaac tcctgggctc aagcaatcct tctgcctcag tcccttgact 13081 agctgggata actggcctgc gctaccacgc ctgacctcat ttcgttttct tgacctgttc 13141 ttccctctcc ttgaattatc agtctcagct cttttcctca ccaaatccaa gtctggcatt 13201 ctcctttccg gccactcaag gactgagtga caacttttgt tattgcatca gcctacagat 13261 acttctgtca tggtgcttgc catattttta tttcaactaa aatgatctgt ggattggtct 13321 tttccattag agtgtaagct tcttttgggg gaaggttgtg tcttattcat ctttgcattt 13381 ccagcccatt gcacagttct tggcacacag gtaaagttcc caggcttctg agacagtcgt 13441 gtaaataaag gcaaggcatt gtgaaacagg tgtgtgtagt gcaaaagcaa aaagagatac 13501 ttgaccaacc tctcagtgaa cgtgatcttg aaggcctgtg cagaggccag gaagagcctg 13561 gcacttacct ccttgtggtt gactcttagg gtgtctggaa gggaacattg ttggggggaa 13621 gagagtgagc atatcatgaa gggagctgta tgctgcagcg aggaattgtg ggtattctct 13681 gcaaaggtca tagggagtga acatgaaaca gggcttattg aaggaataag ccccaggtgg 13741 cggagaagga acagaggtct gtaagcagga gttttggaga caggtcagga gttcatttcc 13801 aaactgttga gtttgagagg tctgggctgc ctgtgtagag tggaccagtc ggcagtttgt 13861 agagacggcc tttaaagccc tctccttagc ctcctctgag tttgcctcag gcagaagtga 13921 gccccagtgg agtaccggga ttctgatgga acctcatctg tttgaattac tagcccagag 13981 ggtcatcact ctttacctgc aaacagtacc ttctctgatg tctgggagag gtggtttatt 14041 tcccatatac ttgttaagtg tagatcttgg ggaagaacaa ctaacaccag aaacatcaca 14101 tgttggctgt tggggaggtg cttgtccatt ttgtttccct tttatttttt tcccaatcaa 14161 cagagatcca gttagaagga gcagcaagac cttccaggag gccatgctgg aaggacatgg 14221 ggtgaggagg atacctaacc ttcccttgcc tcagtttctc atctgtaaaa aggagggtga 14281 tagtatcatg gatctcacca gaatactgga gagctgaaga aagatgatgt gtgtaaagtg 14341 caatatcata gcctgagctg ataagaaatt aatggaggcc gggacgtggt ggcttatgca 14401 tgtaatccca gcactttgag aggctgaggt gagtggatca cttgagacca ggagttcagg 14461 accagcctgg ccaacatggt aaaacctcat ctccactaaa aatacagaaa ttagccaggc 14521 atggtggtgc acgcctgtaa tcccagctac ttgggaggca gaggcataag aatcacttga 14581 acctgggagg cagaagttgt ggtgagccaa ggtcacaaca ctgcactcca gcctgggcaa 14641 cacagtgaga ctctgtctga aaaaaaaaaa aaaaaaaaga agaaatagag accagaggca 14701 ggagcatgag agggtagtag gatcaagtat tttggggcta cgctacagcc aaggtatcaa 14761 actagctgct gctggtcatt gttctcagga tcctactaaa taaccctaaa tagcacttac 14821 tgtgttaagt gccagccaat gtccaagcct ttacttcatc tcactgatgc ttcacaacag 14881 tcctgtgagt gtggtactgt tgtttatccc cattttacag atgaggaaac tgagatgtag 14941 aaggttaagt aacccaccca aggccaaaca actagtgaga gccagcacca tggtgcaggc 15001 ataggtggtc tagtttcagc gtctgagctc ttgcccacta acttactgcc tcacgctgat 15061 cctgatgcag cacccactct agattggcat gtggcctcca gtcagagact catttaattt 15121 aatcacatgg tgcccacctg gcccttgtag gtacttgccg gagctgctga ggacaatgct 15181 gacaacctag ttggcctcat gagtagttgt gtacgaacat agaatatacc ctcaccaggg 15241 gtcttgcttt gttgcccagg ttggtctgga actcctgggc tcaagcgatc cttccacttt 15301 ggcctcccaa agtgttggga ttacaggcgt gagccaccac acctggcatc gtaccatttc 15361 tttttttttt tttttttttt tttctttttg gagacagagt ctcaccctgt cgcccaggct 15421 ggagtgtaga gccatgatct cagttcactg caacctctgc ctcctaggtt caagtgatcc 15481 tcccacttca gcctcctgat tagctgggat tataggcaca cgccacgatg cccggctaat 15541 ttttggactt tcagtagaga tggaggtttc actgtgttgg gcaggctggt cttaaactcc 15601 tgacctcaag agatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc 15661 accacacctg gcctcgtacc atttcttgat aagcacacag cctctgttag atccctccaa 15721 ctgaaatttc ttttctgcca ttgccatatt acctcatttc ttctccccct ttctctttat 15781 tcctttcttc tccattttat ttgtgacttg gaaacctatt tgagtcattg ttaagtaggc 15841 cggtcatgtg cctgtataaa aggtctcctt aacccatgta taatacgaaa ggagactaga 15901 catttagaca aagaaacatg ctctaaaact cttgaaagta ctcagaaagg ctcgttagaa 15961 tatctatcat actctccaac tcatctcttt agtttgctga ttaaacctga tttttttttt 16021 ttttttcttg agacggaatc tcactctgtc acccatgctg gagtgtaaat ggcgtgatct 16081 tggctcactg caacctccga ctcccaagtt caagcgattc ttctgcttca gcctcctgag 16141 tagatgggat tacaggcaca tgttaccacg cccagctaac ttatattttt ggtagagacg 16201 gggtttctcc atgttggtca ggctagtgtc gaactcctga cctcgtgatc cgcctgcttc 16261 ggcttcccaa agtgtggcaa ttacaggtgt gagccactgc gcccagccta aacctgattt 16321 taataactga cttctttttc ttttcttttt tgagacagag tctcgctctg tggcccaagg 16381 tggagtgcag tggcacagtc tcggctcact gcaacctctg tctgggttca agcaattctc 16441 atgcctcagc tacctgaata gctgggatta caggcacaca ccaccacgcc cagctaattt 16501 ttgtattttt agtagagacg aggcttcacc atgttggcca ggctggtctc gaactcctgg 16561 cctcaagtga ttggctcacc tcggcctccc aaagtgctgg gattacaggt gtgagacaca 16621 catcggccca atatctgttc tttatgcaag atgaatgttt cttctacaga ggttatactg 16681 gtttggttct ctgggcagta tttattatga tttttcatgg acctttgtga gtttcaggga 16741 cctgagccct aacaagcagc cccactttct atagggaaat gcctgaagaa cgccaagcct 16801 cggacttaaa atcaacttca ggaggtaaac aaaccaattt agattaggga tttctttctt 16861 tttctttttt ttgagacgga gtcttgcttt gtcgcccagg ctggaatgca atggcgcagg 16921 ctcggctcac tgcaacctct cctcccacat tcaagcgatt ctccttcctc agtctcctga 16981 gtagctggga ttacaggcgc ccaccaccac gcctggctaa tttttgtatt tttagtagag 17041 acggggtttc accatgttgg ccaggctggt ctcgaactcc tgacctcagg tgatctgctc 17101 accgtggcct cccaaagtgt tgtgattata ggcgtgagcc accacgacta gccagattag 17161 ggatttcagt agtgagcaaa actgccagat accctgtttt gttttctctg attctgcgtc 17221 gtaccttcca cagcattaaa ccccctgacc tgtaattgca gctcattcca catttgttga 17281 atgaaatctg gtcaataccc taatacctaa acattgagcc gactaatacc agagatacgt 17341 gaggtattgc tctaacatgg cacaggccag aagggctcat ggtttggctg gtggccatga 17401 ggcaagcctc gctggaaggg aaacagattg gaatgtagtg actgtaagct ccctcatccc 17461 ctctttgggt gggaggccag cttcctttgt cctggctgag atcagggaga taactgctgg 17521 aaccaaacta gacctttctg gtgtatggtc ctcaatttga attgcctttg ggtttaccat 17581 gacagctctt tccttttggg ttattttgac ttgggtcagc ctattaaagg tggtagcacc 17641 ctgtagaggc ctcgggatgc tgaagatgcc atgagaagac acaggcattc tgcagacgct 17701 agacaatttt agtggcagtt agtgtcgcag agcagtatga atgtccagag cctgagggtt 17761 gaagccagga ggcagagggg ccggtcctga ctcccaggaa ggaggcctga gatgctgctg 17821 aactgttggg gcttctgcga ttacccagtc ctgcaactta gtgtgtcctg gtggccacaa 17881 tgagaatcag atgctgtggg ttctctaaag ccccgaaagt tctgtgaaag ttacctatag 17941 gcttgtgcat tgatttagag gtattttcca ttcaatgaca gtgtttcctt cctgccgtcg 18001 ctctaggttc ctggtagagt tacacagttg tgccgccagt atagcgacat gcctcctttg 18061 acgttagagg gcatccagga ccgtgttctt tacgtattga aactctatga caagattgac 18121 ccagagaagg taactgatgg tggtttgaat ttttttcttt aaatatagag tatttaatta 18181 catgctagaa aatgcaaagg gaaagatact cagtaaaaat ctccctggct cccagacacc 18241 caggcccatt ctgggagata gccagcgcta ccagtttcat gtatgtttcc aaaggtactc 18301 aatgcatatg ttaaagaagt atacacacgt gtgtatatta tatgtatgta tctctcaccc 18361 tttttaaaaa acacaactgg ccgggcactg tggctcatgc ctgtaatccc agcactttgg 18421 gaggccgagg cgggcggatc acctgaggtc gggaattcga gaccagcctg actaacatgt 18481 agaaacttca tctctactaa aaatacaaaa ttagccgggc atggtggcac atacctgtaa 18541 tcccagctgc tcgggaggct gaggcagcag aatcacttga acccgggagg tggaggttgc 18601 agtgagccga gatagcgtga ttgcagtcca gcctaggcaa caagagtgaa actccatctc 18661 aaaaataata ataaataata aaaacacaag tggggccggg cgtggtagct cacgcctgta 18721 atcccagcac tttgggaggc cgaggcgagt ggatcatgag gtcaggaatt gaagagcagc 18781 ctggccaaga tggtgaaacc ccatctctac taaaaataca caaattagac aggcgtggtg 18841 gcagacacct gtaatcccag ccactcggga ggatgaggca gagaattgct tgaacccggg 18901 agggagaggg tgcagtgagc cgagatcatg ccactgcaca ctgcagcctg gatgacaggg 18961 tgaaactcca tctaacaacg aaaaaaaatt gaatctgttg tgtaggtaag gcagtttttt 19021 ttttgatttg ggtttttgaa gaatcattcc tggtgatcct gaggtctaaa ttagagtgat 19081 agggtgatag taaacacttt ggccaggcgc agtggctcac gcctgtaatc ccagcacttt 19141 gggaggccaa ggtgggtgga tcacgaggtc aggggttcga gaccagcctg accaacatgg 19201 tgaaaccccg tctctactaa aaatacaaaa attagctggg cgtggtggtg cacgcctata 19261 atcccagcta ctcgggagtc tgaggggtga gaattgcttg aacccgggag gccgagtttg 19321 cagtgagccg agatcgtgcc attgcactcc agcctgggtg acagagcaag actttctcaa 19381 aaaaccaaaa aacagacaac attagagtga aatccttcat tagctatgta tatattgttc 19441 acttcaaaac agagtaagat aaagcaaacc atatacatac tagcataggt tacatataaa 19501 atgtcagtta tagcctctgt atgttaaaaa aaattcgatg tagtcacaag gacagaaaac 19561 caaacactgt gtgttctcgc tcataggtgg gaactgaaca atgaggtcac ttggacagag 19621 ggtggggagc atcatttacc agggcctgtc aaggggtagg gggcttgggg agggacagca 19681 ttaggagaaa tacctaatgt aaatgatgag ttgatgggtg ccgcaaacca acatggcaca 19741 tgtataccta tgtaacaaac ctgcacgttg tgcacatgta ccctaaaact taaagtataa 19801 taaaaaaatt caatatagtg attaagtggc attgtaccac cctgttatct tgggaagcag 19861 atgctttcat ggactaaatt ttgtgtttgg tgctcctttc ttgcagcttt cagtaaattc 19921 tcattttatg aaagacctgg gcttagacag tttggaccaa gtggagatta tcatggccat 19981 ggaagacgaa tttggtaagg ctcagtcttc tatctacaca ctgcgatttt tgattaaggt 20041 ggattaaact gaaagaagtg gtgggcgtct gaattggaat tagaattaaa tattaataga 20101 aacagtagca tccactgccc ttcacacaca ccttcctgcc cacttaccat caggtgaagt 20161 aagctttagg gatggtacat ttattcagag agcaccttgc acaagcccag cagagtgccc 20221 agtgtggtgc caagagccga atgaccagag atggttccag gggactgatt cttcagacag 20281 ggttctaaga caaagggcaa aagcctcccc ggaacacatg ctgatgacat gtcagcccct 20341 cctgagctcg ggtaacgcga accaggtaga agtaatgtgt gccaggacaa gagacgcacc 20401 cagagcctct ggccttggtc tcctatcctg gctcacaagc acactccaag tttgggactg 20461 tgtgttgggc aggtgagacc tttataaggt tccatggctg tttctgcgtg agttggctga 20521 ataggaaatc aaaggacttc tgtgctgttc tgcagagcag ggcctagtta aaattcctca 20581 cagatgatgc tgactaggat ggaatccctt tgcctagtgc ttaagagtac caggtctgga 20641 attgagctta gtcaggtatg agcctggaca agtaactgta tctctcaaag cctgtttcct 20701 cagctgtaaa gtgggggtga taatagtagg aaccccagga ttggaattgc tgtgagaatt 20761 agagatgatg tgtaaaatgc tcagcacagt gtttgggcca aagtaagcta actaccaaaa 20821 cccagccatt tataaaaacc caaccatcat aaaaaaaaaa atgcatgggt gacgtgtgtt 20881 acagcacacg aaccttgaga acgtgctgag tgaaagaagc cagacacaga agatgacacg 20941 ttatatacca tttacatgaa atgtcaagaa gaggcaaatc cagagataga aagtacacga 21001 agtacacggg tggttgccag agctagggag gatggggagt gactgctgac gggcactggg 21061 tttctttttt tttttttttt tttttttttt ttttattgat cattcttggg tgtttctcgc 21121 agagggggat ttggcagggt cataggacaa tagtggaggg aaggtcagca ggtaaacaag 21181 tgaacaaagg tctctggttt tcctaggcag aggaccctgg ggccttccgc agtgtttgtg 21241 tccctgggta cttgagatta gggagtggtg atgactctta acaagcatgc tgccttcaag 21301 tatctgttta acaaagcaca tcttgcacca cccttaatcc atttaaccct gagtggacac 21361 agcacatgtt tcagagagca ccgggttggg ggtaaggtca tagatcaaca gcatcccaag 21421 gcagaagaat ttttcttagt acagaacaaa atggagtctc ctatgtctac ttctttctac 21481 acagacacag caacaatctg atttctctat cttttcccca catttccccc ttttctattc 21541 gaaacaaccg ccatcgtcat catggcccgt tctcaatgag ctgttgggta cacctccctg 21601 acggggtggc tggccgggcg ggggctgcac cccacctccc ggacggggcg gctgccgggt 21661 ggagacgctc ctcacctccc agatgggctc ctcacttctc agagggggca gccgggcaga 21721 gacgctcctc acctcccaga tggggtcgcg gccgggcaga ggcgctcctc acatcccaga 21781 cggggcagtg gggcagaggc gctccccaca tctcagacga tgggtggccg ggcagagacg 21841 ctcctcactt cctagacggg atggtggccg ggaagaggcg ctcctcactt cccagactgg 21901 gcagccgggc agaggcgctc cccacatctc agacgatggg tggccgggca gagacgctcc 21961 tcacttccta gacaggatgg cggccgggaa ggggcgctcc tcacttccca gactgggcag 22021 ccgggcagag gggctcctca catcccagac gatgggcggc caggcagagg ctgcaatctc 22081 ggcactttgg gaggccaagg caggcagctg ggaggtggag gttgtagcta gccgagatca 22141 cgccactgca ctccagcctg ggcaacattg agcactgagt gaacgagact ccatctgcaa 22201 tcccggcacc tcgggaggcc gaggctggca gatcactcgc ggttaggagc tggagaccag 22261 cccagccaac acagcgaaac cctgtctcca ccaaaaaaat acgaaaacca gtcaggcgtg 22321 gcggcgcgtg cctgcaatcg caggcactgg gcaggccgag gcaggagaat caggcaggga 22381 ggttgcagtg agcagagatg gcggcagtac agtccagctt cggctcggca tcagagggag 22441 accatggaaa gagagggaga gggagaggga gaccgtggga agagggagag ggagagggag 22501 accttgggga gagggagagg gagagggaga ccgtgggaag agggagaggg agagggagac 22561 cgtggggaga gggagaggga gctgggtttc ttttacacaa tggtgatggt tgcacaatgt 22621 tgtgaatata ctaaaaacta ccaaattata tggtatatac aaggaggaat tttctattat 22681 gtgaattctg tctcaacaaa caaacaaaca aacaaacaaa caaacatacc tccaaagcat 22741 cgtacccaac ctgtatgcat actgtctaga agctaactgg cccacaacct atatgggatt 22801 caaggttgct gaaagttttg atatacttgt agctgttgta gaagggtcag gtgtgttcag 22861 ttctaaagga tttttttttt ttaactctct cacattgttg tatatttttc ttttagggtt 22921 tgaaattcct gatatagatg ctgaaaagtt aatgtgtcca caagaaattg tagattacat 22981 tgcagataag aaggatgtat atgaataaag tatcaggtaa atagacttaa aaaatgatgt 23041 tcacccatag atttgtctgt ggttcttcaa taaaggaaac ttaagaagaa acagaaaatg 23101 gaaatgcttg accaagattt ttcactactt ataaaatgca agccctttcc tgttgtctgc 23161 cactcaaagg ggcccagccc tggtggtgat ctttgagagg ttgagaactg tcattcctgg 23221 atagaactcc cttgattgtc agctgtcctc acaaaagcag aacattgtac aaagtactga 23281 acagggtaac ctggaaacag acctctgtct ccctcaatta ctcagccaag gtaagcccag 23341 cctgctgggt aagacactga agtgacctgg ctcttaagta ggcaagagag gcctcatgta 23401 taagaggatg gactttggag tcagagacac ccaagtttat tctcagcttt gccacacact 23461 ataagcctgg tggtctcagg caggctgctt agctctgtaa gcctcagttt cctcatatgt 23521 aatatggtgg taatgatagg acctacctgg ggcttttgtg aaagcaaaat tgaagagagt 23581 atatatatca cttgatataa tgatgtaaag aagcctctca gtaaatagta gcttttgtta 23641 cttgggacct agaataatca aataagactt tttgaattaa aacattgtat tagtgttttt 23701 agtctttagc ttttagtgac cctgcatttt tttttttttt tttttaaaga tggaaccttg 23761 ctctgttgcc caggctggag tgtagtggca caatctcggc tcactgcaac ctccgcctcc 23821 tgggttcaag caattctccg gcctcagcct cctgagtaac tgggattaca ggcgtgcacc 23881 tccacgccca gctaattttt atatttttag tagagatggg gtttcaccat gttggccagg 23941 ctggtctcga actcctgacc tcaggtgatc cgcccagctt ggcctcccaa agtgctgaga 24001 ttacgggcat gagccactgc actcagccag aggctgcatt tttagcacct gaaagatgtc 24061 acttatcatt agtaagaata catcaaaaat aacattgaac ttattttgtc ttttccagac 24121 cctttggctt tgctgagaga ggactcagat gatagtgacg aatgtctggc agtgaggaca 24181 cattttggca ttcttgctga ctctgacaga gtgattctga tggacttgta tttaaattgt 24241 atgtgtttta ctctttgaaa ataaatctat aaaaccaaca ttttccccca tcttcagttt 24301 ttcaatgatg tctataaagt gcttttttat gtttgtatac tgacattgct taatattttt 24361 aaaggtctga tttgtcattt gtattcatgc atgtcaaaaa caatggcttc cccagccatg 24421 tacttgaagt aaggctgtct ccccaccttc gtatcaggag tgaatatatt catggaaatg 24481 gactttttgc ttcaaaaaaa actttgtttt ggtttccatt agcttattat cagatgtgtc 24541 agctgatatc agcaggtggg actagttttt catcactgtc ctaaagatcc cttttaatac 24601 agtttaacag ttttggagaa tgggaggctc tctttcctcc tggaaggtca gtttgacatt 24661 gccacacatc aaaccacagt ttgatttgtc acacatcatt attggttctg ccccttttac 24721 tcttagcact gtgttggttc cttggggttg atgtaaataa catgtatgca ttgcttagct 24781 cttgaagagt ttagtttgag agttacatca aaataattcc tctcctttta ctgactttct 24841 agtatgggga aggaagagga accaaaggcc tacagtaagt agtgtctttg aggcagacgg 24901 actaaggtga ggggccctca gccagcagag aatctcatgg ctgagagatg aggtggtcga 24961 aggatgggag tcagctctgg gattgggtcc acactcacat ctgcttccct tcctgcaggt 25021 accaatacgc tggttccttc tgggttgctc tagagttatt ccaggcatgt taggcagaca 25081 tagtcctttt tttttttttt ttttttttta acttttattt aggttcaggg gtacatgtac 25141 atgtttgtta tataggtaaa ttatgtgtca tggggtttgg tgtgcagatt atttcatcac 25201 ccaggtaata agcataatac cccatgggtt tttttatttg agaaggagtt tcgctcttgt 25261 cgcccagact ggagtgcagt ggcacaatct tggctcactg caacctccgt ctcccaggtt 25321 caagcgattc tcctgcctca gcctcccaag tagctgacgt tatacccgcc accatgcacc 25381 cgccaccacc tggctaattt tttgtatttt tagttgagac ggggtttcac catgttggcc 25441 agcctggtat tgaactcctg accttaggtg atccgcccac ctccgcctcc caaagtgctg 25501 ggattacaga tgtgagccac tgtgcccagt ccctgatggg tgttttttgg agcctcaccc 25561 tccaccctcc agtaggcccc agtgcctgtt gttcccttct gtccatgttt actcaatgtt 25621 tagctcccac ttaaagtgag aacgtggtat ttggttttct gatcctgtga tagttcactt 25681 aggatactgg ttccagctgc atccatgtta ccataaagga catgatctca ttttttatgg 25741 ctgcatagta ttccatggag agtgtgtgtg gtgtgtgtgt gtgtgtgtgt gtatacacac 25801 tttctttttt ttttttttca gacagtctca ctctgtcgcc caggctggag tggtatggtg 25861 ctatcttggc taactacaac ctctgcttcc ccagttcacg tgattctcct gcctcggcct 25921 cctgaatagc tgggattaca ggcatccgcc accatgcccg gatactttgt atttttatta 25981 gagatggggt ttcaccatgt tggccagatt ggtttcgaac tcctgacctc aaatgatccg 26041 cccgccttgg cctcccagag tgctaggact acaggcgaga gccactgcac ccagcctaca 26101 cattttcttt acccagtctg cagttgatgg gcatttaggt tgattccatg tctttgttat 26161 tgtgaatagt gttgcagtga acacaggcat gtgtgtgtct ttatggtaga atgatttata 26221 ttcctttatg tatatccagt aatgggattg ctgggtcgaa tgcaaacctc gttttctttg 26281 agttctttga gacattcact gtcacccaat ctgagtgcag tgacacgact gtggctcacc 26341 gcagcctcaa cctcccaggc tcaggccatc ctcctgcctc agcctcccaa gtaggcacac 26401 cactgctaat tttttaagtt atttatagag atggggtctc actgtgttac cccggctggt 26461 ctcgaactcc tgggctcaaa agatgctccc gcctccactt cccaaagtgc tggaattaca 26521 ggcatgaggc actgcacctg gccccttttt agcaatagaa agcatatcct cagttatgaa 26581 aatcatacat tgttacacta gacctttttt atttttttgc aatacttagc aacatatctt 26641 ggaaaactgc caaattggag gataaggctc ttcctcattg tttttttaca actgcattta 26701 tttaacaata gcttgagcac ttactaagca ctggtgatca aatagggagt aaacaggctg 26761 tgtgttattc ctttgattgg aacgtctagt agtttaacac tgggggtatt ttggtggtag 26821 tccccaagtt gttctggcta ttttaaatcc catggcagct gtacatttta ccaaaaaaaa 26881 gcgtctgtga ttgtgtctgc ctaagatcac aaagtctgac tgtactgcca cttgagtgcc 26941 tgctttgcta ctatcttcac cagacgcctg cagtgaaaca gcccattcac atcatccagt 27001 agctggtgta agccccaaag aagccagaat gagggaagac atttactgga tctcacacac 27061 acacatcatt catccccaag gccatttagg gtctggttgt tcaagcaata gcaaaggcct 27121 ttctgttctt tctgctctaa gagcttatca gaatgttctt ttgagagact tctgtgccac 27181 tataggtagg cagcaaactg gccaaatgta gcattccaaa gtgtgtgtga attcaagaac 27241 aggataaatg gaattgtatg catatggtat ttaatagctg ctgaaaacac ctcagaggat 27301 ctgtgcctct gttgtacacc gttcccactg ggaaagggcg gtttgtgaag cctgtcatga 27361 cccagtccca gtcctgtctc cagccatccc caccatatct ccagtggagc tgcttcagtc 27421 ttctcactgt gttctccata cagcaagttc actcactcgc tccacacaca aggaagtacc 27481 cactgtgtgt taagcactcc tggagcctga gtacacagtt tcacaccata tagtttgtct 27541 cccatctcca ggagctcacc accttgtaag ggaggtaggt acaacctagg tggaggtatc 27601 atactcagca tcaccaatga tacaacaagt caacctccta tgctgctcct gatgtgaggc 27661 actgaggtca tgtcacctct gaggcactct tgctaaaatc atctaacctg aatctgatta 27721 tgaggaaaca aatccagaag gtgggagttt ataaggcctg gactttaaaa agtgtcatga 27781 gaggcccggc atggtggctc atgcctgttc tgttaatcct agcactttga tttgaggcca 27841 ggagtttgag accagcctgg gcaatatcgt gagaccctgt ctcttcaaaa atattaaaaa 27901 ttagctgagt gtggtgtcac ctgcctgtag tagcagttag tcccagatgt ggagattggg 27961 ataaagagag atggatgggc aagagattct gggaataaaa tggcagaacc tgggaggatc 28021 acttgagcca gggaattaga ggctacggtg agctatgact gcaccatggc actccagcct 28081 gggcaacaga gggagactct gtctctaaaa aagaattgaa aaataaatag gcggggtgca 28141 gtggctcaca cctgaaatcc cagcactttg ggaggctgag gtgggtggat cacctgaagt 28201 caggagtccg agaccagcct ggccaacatg gcgaaacctc gtctctacta aaaatacaaa 28261 cattagccag gcatggtggc aggtgcacgt aatcccaact acttgggagg ctgaggcagg 28321 agaatcgcct gaacccggga ggcagaggtt gcagtgagcc gagatcatgc cactgcattc 28381 cagcccgggc aacagaaact ccatctcaaa ataaataaaa taaaacaaat aaatggcaga 28441 atctgatcag ttgtgtctaa gctatctccc aacataccag tactgggtga gttgcggtct 28501 aaagcaggta ccagaggatt ggatttgatg cattgaattt gaggtgtttg taggatacct 28561 gagcaggcaa cagataatga gcaacagttc aagaaaaatg tcaggcttgg agtctgagcg 28621 ctggtgaggt gcttccccag tgccggctcc ctcctgctaa gttcccattg aggagtcccc 28681 tcaccctact ctaggaagtg gcttaagtgg gaatgattgg ccctctgccc ctgggatgcg 28741 cacttaagta cttgagcgta atgtgaattc ttcttcaggg atgatttggt gctggggttg 28801 agaagctagg agaactgagt gaaagactta cctatcacct gaaattcttt atcaagtgca 28861 tgtgttactg ttttaatttt taaagtggta gccaactaat aaataaagcc aattaaacaa 28921 atgtaactga caaattaact ttattaatat gcaaagctaa ttaataaagc caacaaacaa 28981 atgccaaggt tcaaagttta aaaaagcaac tcacacaccc aacttagtgc cctgaagaca 29041 cacagcaaaa ccaaggaaag aaaaaaattt gaaagcaaaa taaagcagct gggttctttc 29101 ctaacttctc aagtaccagg ccagtatcta agagcacaga ctttagcatc aggcaaagct 29161 gggctctgaa ttgtagcttc actcggtaac aggcatgtga tgttggctaa gccatctaac 29221 ttctctgaac ctaagatttc ctctagtggc tgaagtgagg gtcacaggag ggatgtggtg 29281 cagtgttgac tgtataatgc aagtaacatc cttcacactg ttaagacata aggaagaagc 29341 tgatgtcaca ggacctgaat gcagagtgtt tagagcaagt acgtagtgga agagcgccca 29401 aaggtgatgt ccattcttcc ctccagggct ggaggcctag agaaacagca tttgcctggc 29461 acacagcaag aggtcaaaac accctgctag gtggtctctt tttttttttg agatggagtt 29521 tcactcttgt tgcccaggct ggagtgcaat ggcacgatct tagctcactg caacctccac 29581 ctcccaggtt ctagtgatcc tcctgcgtca gcctccctag tagctggaat tacaggcatg 29641 cgccaccacg cctaattttg tatttttagt agacatgggg tttctccatg ttggtcaagc 29701 tggtctcgaa ctcccgacct taggtatctg cccgcctcag cctcccaagt gttggcatta 29761 caggcatgag ccatcacact cagcctagaa gtaccagata gcagattctc cagatgggca 29821 tcagccaacg tatagcatgg gatacaatcc caatgatccc cctctcccat gtaggagtca 29881 gatccctaag catccagccg tattgtttca ggattatgag gtactgtgaa aaagtgaaga 29941 gctgaccccc tccgcctgct aggaagaagt gagaccagca gggacaaact ccatcactct 30001 tcatatacca aggattctgt tttagtcccc tgcctttttt tttggcgggg gggacaggat 30061 ctcactctgc cacccaggct ggagtgcagt ggcacaacct cagctcactg cagcttctgc 30121 ctctggggtt caggcgattc ttgtgcctca gcctcccaag cagctgggac tacaggcacc 30181 tgacaccaca tctggctaat ttttatattt tctagtagag gcagggtttc accatgttgg 30241 ccagtctggt ctcgaactcc tgacctcagg tgatccatcc acctcagcct cccaaagtgc 30301 taggattaca ggcgtgagcc accgtgcctg gccgatatct gtattttttg tagagatggg 30361 gctcttgcta tgttgcccag gctggtctca aactcctggg ctcaagtgat cctcccacct 30421 tggtttccaa agtgctggga ttaccggtgt gagccagtga gcctggccca ttttttggta 30481 acagattttc ttcctcccga ctcctgggtc tttatagctt gtggctaaat ctgtagctcc 30541 aatctcagtg ttttctaaaa catgtcctca aaggctaaat ggctaggtgg ggttgcccaa 30601 ggcactagtc aggaggtgtg agtacaggga ggcagcttct cttccaacat gtggaagcca 30661 gggcccacca aggagcagac catgctgctc caacaggagt gtggctcact gtcacaccgg 30721 tcactttcac cattttgcag agtaaatgtc aactgtttca gcagagaata catttgaata 30781 tacttaaagc caaataagac cacaactggc aaagaaatgg ctccaggtcc aattccaaat 30841 gttggttaat ccccagtatg gattaggcgc atgaagaatt ggaaggaaaa agagacgcaa 30901 ttaattatca aactttaatc gattgactaa tcttacagga taaacaataa gacagctatt 30961 caaggttaca ctttaaaaaa aaaatccagc agtgcaaatc taatagacta aatacgttca 31021 tctcctccca aacaggcatt tgatttgaaa acagaaacca gaacagagca tacaccacaa 31081 ggaaacatgg tacttacaaa tattgtcaaa taattcagca gaactcttta agaaaaaaaa 31141 aaggttctta gaaattgaga aagcagtaga accacccagc ctggcagatc aatctaccag 31201 atgttctgca tgcgagaggg ctcccaacca cctgtaactt cagaaaggcc gccaggcact 31261 attacgaagg aaacaggtgc agatcctaac tggagatgac tttatcacaa agccttaaga 31321 caaagggacc atcctcttcc aagacaacag gtgtgaacag aggcaaggag cgagaggaga 31381 gggctccctc agcagatggg tggaaagcaa gcctgccctt gcacggccct tggcacactc 31441 aatctacccc ctttcccttc tctgcccctg gcaaagaaag gctgggcaag tgcaagagcc 31501 cagaatggct cctccaggca catggttaaa gtaagagtaa gaggaaaacg agccccttct 31561 gggtgaaata tggcaacttt tctgagacat ggcaacgagg cagagttaat ctctttggcc 31621 ctcatcaggt aaagcagcaa ctaaggagcc tactcagggc cttacgtggg ccccttctat 31681 gtccacaaga aagtgcatcc ttagctcagt tcaacattta aaaagagcta acgctacttg 31741 caaaaggccc accatggatt caaacactta ataccaaaga ggtccagcaa ctcgcgagtc 31801 tgtctgtgtt ctgtgatctg aagcagagct aaggggggcc tggctcacag ggtcacgctc 31861 ttctgccaag gcgcctcagg gagggaaaaa aaggcatgat gatgtctgtg agccagaacc 31921 tgcaggagga ccaccggtag agattctaga agacaaagaa gccctcagga ggctctgaga 31981 ggtgtccaat gtctaagaga cggccacccc ctccatctct agcttcgagt caacgctgtt 32041 acagtcactc tccggaaacc agctcttcct ttatccacct gcacggcccc tgccacagtg 32101 agggaaagat ctggtccctc tgacagtaag gaaattgtat agaaattatc ctgaggaaaa 32161 aaaaaaccac tgtgacgcta cctaaaagaa agagtatttc aatttgctta cacagattta 32221 gtcagtacac tccagactgt aagaaatgta tgaagagact gcttacggaa ctcaaagact 32281 aaacagtgaa tactactgtg gcccagacac ggatcgaaaa cacttctccc agtggcttct 32341 ggtgctatgt ctgctctccc gagagcgtcc agcaaggtct tccatgcaga cagggtgagg 32401 agccacagag ctcaagattg ttgaagctga tttagtgagc tctgtaaagc aatactcaaa 32461 cagctccgta cgaaaaggtg tcatttacat tattatttat atatagtatc tcgaagtaag 32521 gaaaaaagca ccattttact gaaaataaaa aggctcacgc aggctttggt cctcctgaaa 32581 tgacaccagc ctgagtgaga agcagttggc ccttggggag cggggagatg atggaacgct 32641 gtcaccggca agaagggagg ccaagtggcc cggcaagcca caagaatggg gcaataaggg 32701 gagcaggtgg aggggcctcg gagagcagcc aggctagtct ggggatggag ccacctgctc 32761 tgagagatca aaagtaggtg tgcggcactg tcaggacaag tctctaggac tggggcattg 32821 gcaaacaaaa ggggcaatgt ctgacccgag atgatgttct acttaatgca tctcatttcc 32881 cccaaaacct gtcacctgca gacagaggct tggacaagta gctcagatgt actcttttcc 32941 ccaggaactg caggccgaca aggacagaaa agctgcaatg actgctcctt tctcaaggga 33001 tgaaaaaaca ggctgtttct tggatgctgc acttcacaaa tgtactaatt gcccccaaaa 33061 aagcaaagag tagattttcc tgcctagatg catcacttgg cattgactac accatggaaa 33121 gcaagagtct gtctatgtgg gctggattcc tgaactacag cttagaaggg aagggcgtag 33181 aatatcatca ttaactggct cacaaggaat agtcatcttg ggcgaacaca tgctaatacg 33241 tttcacagaa tgaggggtga cagcctagaa tccaaaatgt gatgtttgga acatttctcg 33301 ccaaacctta ctatgaaagc ctgcaaccag tgagaggaga actgcaaaga cggcagcaga 33361 gaaacagcaa acagcacaaa agagataatt taccaagaga ctttccctcc tttcagaggc 33421 aaccaagcca gaaggagatg acagaagttc acgtgtacag agaacagaag ggctaaaacc 33481 agaagaaggt gagctaatcg tgaaagcaat ttatcccatt tttgttgttc aataacattg 33541 gccaacctgc tagggtcaga tgtcttcttc agaaaggcag catctctgaa cctgggataa 33601 aggacacttg aactgaatgc atgagcagaa ggacccaatg gagaagcata attgaaatca 33661 tccttggact gcttaagaat ttcttgaaaa cttccaaaga aagaaggtta tctggatctg 33721 cagctatatg gctgtataac tatcttctac ctggctcctc ctggaagagg tctttgtctg 33781 cttgggagct ggggctgttg agcggcacag cccaggcaca accatgccag gctgggcctc 33841 taggccaagc tcagtgaggg gtgcagacaa tcatccaacc agcaatggaa ctccttggag 33901 atgggggccc aaagggcaag acatttaaac aggagtcact gggagatact ctgctgctca 33961 acgaacacca cagcaagtgt gagagctcct tgcccaagac tgagcagctc ttcctctccc 34021 accgtgcgtt cacaggcagc agcagcacgg cgggagggag cccaggaaag gaagcgttga 34081 tgaggtacca aaagaagaac tgacatcttt gggttcttaa aacaaaaact caaattgagg 34141 aagcgagaga atgaaaaagg gaaagaatta aatctgaatc aatggacaga aatatgtggc 34201 ttactaaccc acccccattt tgcatacaat tttatacaaa aaaggaaaaa aaaaacaccc 34261 caactcattg caatgcagtc caaatgtttg caaccaaaga tgtatgaaaa tatttcaatt 34321 gaaactgctt tttttttttt taaaaaaaca aagcttttga tcattccctt gtcaacataa 34381 ttggcagatc tactctcaaa cacacttaag agtattccac acaacatgtt cggaattcag 34441 atttattgat ttccattgta ttgactttaa aaatggtatg aagctgactg caagaaaggg 34501 aactggtaat tttaagttaa aattaagcta atggtgaaag tacatcacct ctggcttcag 34561 tcaccacatc acctcagatt gaatatagat tttttatgaa caaagttcca gaactctgaa 34621 atccagggat gctgcaggcc tctcccggca atgtccttca ctttgggtca gtctcctggg 34681 ccagaggtga aagtgctttc aaaaatactg ccatttcccc agcacagtgt ctttgattgc 34741 atccacatat tgagttggaa cccagtacac ccagtagtaa gaggcttccg tggggcccaa 34801 ctgaaacgcc tggaagggaa aaaggaaaac caagcggtac ttcagaggtc gggagaccaa 34861 gcaggcagga caaacatggg tgagggttgg gatgtgagtt aattaaaaag ggaagaaaaa 34921 ccaaacaaaa ccttgcacgt ggggagcatt ctcccaacca aaaacaaaaa caggcaggct 34981 gagggttaag ctgagccaac cctaaagcag cagttctcaa acctgagtgg tcatcagagc 35041 catccggggg gcttgctgga acatggatca gcaggctcca accctgtgaa gtttctaatt 35101 tggggaggtc tggggtgggg cctgagaaac tagcaagctc ccaggtgatg caacactact 35161 gatccaggga ccacagttgg agaaacgctg ccctagaaaa cctgtcacac atgctcaaga 35221 aaacaggtat gaatgttcat tgcagcactg tctgaaacca caaaaacctg gggcaaccta 35281 catatccatc agtaagggaa tgcagaattc aaagtgacaa ggcttagtca caccaatact 35341 ggagctcttt ttaaagaggt atcaactggg tccatatgga tctacagtta atactgacag 35401 attcttatgt tgaaccaaag aaaagcaagt tgtagaacaa tacataatac catttacgaa 35461 aactggaagc acaaaacaac actccatctt gttcgtggat acacgtgtaa atcaaagaat 35521 ggatagatgg gaaggacgca cgtgacacat gacgatggta atggggggag gggaagacag 35581 tggggccatc attgctgtga ataaacttta tttcttttat ttaaaaaaat taaagcaaat 35641 tacaaaatat ttacaattgc cacttctcag tgatggatat atgggtttgt gacagttatt 35701 ctctttcttt tctgtatttt atttttattt tttctgagct ggagtctctc tctgttgccc 35761 aggctggagt gcagcggtgc aatctcagct cactgcaacc tccgcctccc actttcaagc 35821 gattctcctc ccttagcctc ccaagtagct gggattacag gtgcgcgcca ccatgcccag 35881 ctaaatattt gtattttcag tagagacagg gtttcaccat gttggccagg ctggtcttaa 35941 actcctgacc tcaagtgact cacccacctc ggcctcccaa agtgctggga ttacaggtgt 36001 gagccactgc gcccagcctc ttttctgtat tttaaaaaca tttctctaaa aatgaaaaag 36061 cacagccaca gttttccaac tgtgaatgat gacatgttaa tttagctaga gaaatagcat 36121 ctgcccaaag agataactaa gtactttcat taggacaaag gtaaaggtgt ttggaaaaga 36181 gaaggggact actagagcta tcctctgcca gaaataactc gagagaaatg aacatttcaa 36241 tgaaatacta agtcattacc catccacacc aattcctcca aagacgtcca aaacagtcca 36301 ctcaacttcc ttggactccc taccaccaca ctgcttgaaa agtcaggcta caaaccctac 36361 agggtgtgac tggcaacaca aactattcaa aaaggagtga tggggcccag ttaaaaacat 36421 tcaagtccaa ctgttttttt ttgttttgtt tttttgtttg ttttttgaga cagagtctct 36481 gtcacctagg ctggagtgta gtggcacgat ctctgctcac tacaaccttc gcctcccagg 36541 ttcaagcaat tcaagcagtt ctcctgcctc agcctcctga gtagctggga ttacaggtgc 36601 ccgccaccac gcccagctaa tctttgtaat tttagtagag acggggcttc accatgttgg 36661 tcaggctggt ctcaaactcc tgacctcatg atccgcccac ctcagcctcc caaagtgctg 36721 ggattacagg catgagccac cacgcctggc ccaactgtta aaacagaaaa aaataactga 36781 ccaggccaaa ccaatttgtc tgcaatcctt gtaataaaga ctacagagag agggctgggc 36841 atggaggctc acgcctgtaa ccccagcact ttgggaggct gaggtgggcg gatcgccagg 36901 tcaagagatt gagaccatct tggccaacat ggtgaaaccc catctctact aaaaatacaa 36961 aaattagctg ggtgtggtgg cgtgagcctg taatccctgc tactcaggag gcagaggcag 37021 gagaatcgcc tgaacctggg aggcagaggc tgcagtgagc cgagatagcg ccactgcact 37081 tcagcctggt gacagagcga gattccgtct caaaaaaaaa aaaaaaaaaa aaagactaca 37141 gagagagtag gggtggaagg gctgccaagg ggcatcatca cttccagaag acaaaacccc 37201 aagagcccca cagcctaggg actgcactcc cagctacagc cttgctacag ccacactagg 37261 caggatgcca acagggctgc ctcccttgcc caccatattt taagcccctt tgattttttt 37321 ctatagcaag aggatcttcc aacacatctc tctgcctggg acttgttatc ccaatctact 37381 ggtcaaacca ctagttcctt cttttccctg ggatgggctg atgtggcttc atcactgtgg 37441 gatggacact gaaaggcctg cctccttgga tgaacaatgg ccggaggccc cgccctcaca 37501 ttacagctgc tcaggccagg cccctgatgg gcaactgcag aatcaacttc gtggtcaatg 37561 tggctcctcc aaagagcact gggaaaattt tggaaactgc ctaagctgct cattttacca 37621 gatgaatgaa ttttttgctt tgttctgcag acctgcacaa cacctccaaa cattctgtct 37681 tgggaaagag tagcctgctc tctgcagagt gggttctatg cagaatagac gctccacaga 37741 gcccgtcttc cccacgaggc cctcagccac tgggcctgga agagcggcat cgttattaca 37801 gaaaagaggc tgtctggtgc cctcaccaac accaaggcta agggcaagtt ttgttttata 37861 agcaagataa aacttataat gaaaatccac aagccattcc tcctctttat atttattttc 37921 ctggtctaga aacaacacac gacaagaggg aggcaagaga gacttttcta aaagaggtaa 37981 gagaaaacat atccccttta aactgatgga aggtgtgtgt gggggggtgt aaaataaaca 38041 gaactaggaa tagaggccat ggattaagtt ctaagtagat gaagaaaaca tcagggtctt 38101 aaaaaagtgc atcttctgga gactgagttt tcttccgtta aacaagctca gttaaaactg 38161 ttccctgcca ggtgaagtgg gagtctagtg agaggggctg aactcagcca gcgctactta 38221 ccatcatgtg gtagtcttca tgtccttcga taggttcact gaccacattt tttatggagc 38281 ccatgggcaa tttctcagtc cgctctaagt tgagagggat tacagtgatt attcaaaggg 38341 ccctcctgtt ctaacttctc tgactcagca gtccctcatc agggctggtg gctgcgggat 38401 ctacagtaaa ctcaagtgac tggcgccatg ccctctgggg acaacacgac agggtcttct 38461 gaccagtgca catagggctt cgtgtgggag ctgggaactc cgtaggagca agtgggaaac 38521 tgggtttggc aatggtatag taaaggtctc tgagggcaga attgtatctt ccctgggatt 38581 attttttaaa agtctgatgg ttaataaata tccaagaaag gccaggcgcg gtggctcaag 38641 cctgtaatcc taacatcttg tgagtcagga tttggccaag gtgggtggat cacctgagat 38701 caggagttcg agaccagcct ggccagcatg gtgaaacccc gtctctacta aaaatacaaa 38761 aaattagcca ggcgtggtgg tgggcaccta taatcccagc tactcaggag gctgaagcac 38821 gagaatcgct tgaacccggg gggcagaagt tgcagtgagc cgagatcatg ccactttact 38881 tcagcctggg ccaaagagcg aaactccatc tcaacaaaaa aaaatataaa ataaataaat 38941 acccaagaaa ttcctttgta accttatgca agcccagcat tgactaaggt cttgggcagt 39001 agggagagtc aagaggccta cgaacaaaga ggcaacacgt tcccctgtgg ctctcacaag 39061 ggaaggggag tgcctggatt ttagggctag attctatacc acgaccctgc tcccttcccc 39121 ggcctccctc ctcgagtcct ccgcactcta tattcacaca cttttatgaa gttctcttaa 39181 aacatttata accttttgtg attaacactt gagataccac tgatcacatg gggtcagaag 39241 acattgaggc gagtttaagg aggatggcag gtaagagaag ggggcagcaa gggatgcctg 39301 gagtgagatg agactgccgg gagtgagcca agtgaagcgg gacttggtga ggtcagggga 39361 ggcgaggtgc acctgcacaa ggggacatgg ccatcctgga gcagcagaca cctcagggac 39421 gggggcgcag ggggaggggg gcggggaggc tcagcaaaca gctaaaccaa gtggaaaaaa 39481 acttgcagcc tttgcagaaa gaaaaatctg gaaagataga gtctaaaaat agataagttg 39541 gccaaacaga ctaggaattt agagaccagc gttctggtct agctggggac ttaagacatt 39601 ttacttcagt tctgtgggcc tcagtatctg aaaaatgggg gctggggaga agggacctga 39661 agatgtttat ttaggtttgg acttctagtg gctctacaac agtcggccct aaagcaggac 39721 ctgggcagca gctgtcttta agatggaaac ttttaaaaca cctttcatcc tgaaatgtct 39781 ggagatttca caaaaatata caacagattt ctggcttctc ttgaaaactc caaaaattgg 39841 gcaattctgg acccacactc ctgatcacta catcagacct tcagccagat tcaactttcc 39901 tacgagcagg gtacatgctc tcctccagcg gcttccggtt tcctgcctgg agcttgttct 39961 cattcttttg aggctgttag aactgcctcc tcctacacaa gctagcagag tttagtcaag 40021 agtagcagcc cttaagggtc acttactcag caaagcgccc cacgaagagg ccaactcgct 40081 gctgggatgg tcattccata gccgcttctt cttgccagca gaaagacgaa gccagtggct 40141 agcagccaac tcccagaaca agctgtcctc ttatgagtgc ttatgacact taggtaacag 40201 gactgggcaa gtgacaagac tgcctgcccc ttgggctcca gaggtacagt gctatgggtg 40261 ggaatgaaaa tcttaacctt gttcaaccct agaaaacatg agtagtccat ggccatgctc 40321 ctcaacaaga ctggtgagta aggaggcttt caagggctca gaagccccaa catattgagc 40381 ttcttctaca gctctccctt taggccaaag caggaaggcg gctgctccta ctagcaaaaa 40441 gaaaacttcc aatacagggt caggaagttc ttgttctgtc agttcctgca gttatgatgt 40501 gttccctata caacttcctg acacagaaat attccccaag gaaccaatga gcacttacag 40561 acttagttcc catccagact tctctgcctg cacctatctt catggcaggg tctttccagg 40621 tcagccgtag caactgcctt caatgaggct gcccacaagc aaaagtagag gtccagcaag 40681 aggcaggaag tctatgcact tcagcaagtg aatgtgatga gacagaacta agcaaaatca 40741 cctgcaaagt tgacagaggc tgggcagctt actactgata aagggaagat ggacaggctc 40801 tacagaagag aatgatctag cagagggttc aaaggccctt gcaaaaactg gggggaaccc 40861 tcagatattc tgcaaaactt gctttgacca tgcagttaga gttgaatgtg aaatgaagaa 40921 aatagcaagt tatcataaaa ttgatagaaa actcaagcat ttttggcatt tcaatttttg 40981 ttcattttaa caatcagtat agggttgaac actgattttt gtgttttaga tgattaaaaa 41041 ctctctcttc ccaccacccc tgccccaatt attctccata aaacttggtc ccattttaag 41101 tggagtctca ctaaatggtc tgtccctgaa accagtgaac gtcagatgat gggaccccct 41161 atacttctct ttagcattta cctcgaacac tgaataccta cagtgctggt cagaaacagg 41221 tgatttaggc atgttcggtt ccctccagga atctaacagg gggacagaca agtcagagag 41281 taatgagtgc tctggggata taaacgaagc gctctatgaa cagaagcggc agcaagtcat 41341 ttggagggtg gggggagaat caaggaaggc ttcattcaga gaggacattt aagtagggcc 41401 ttggaagatt ttattaataa caaggaatgg gaaaaaggca ttccaggtaa gggcacagca 41461 tgagcaaaga caaggagctg tgcaaagcac actgggagct tcacagagag atgcggctaa 41521 aatacacagg ggaccacagg gcaaacgggc cagaagctgg aaccaaaatt taaactaagg 41581 cccagtgttg agaacactga acgctatggt taggcatttg aatgtgtcct taaacagcca 41641 aaggtccttg agcacgggta cccaggataa aaagcagtag gtaagatgga caggacagaa 41701 gaaaacacag atggggcagt gaaacagggt aactcgcttt gaacagacca agtttaaaga 41761 tgcttgggaa ctttaactgg cacaacagtg tggcaaagaa agggacagac agttctgcag 41821 tgtaactaac aagactttgt aaccatctgg gggaagggag agggaggaca gtcagagaca 41881 cttcccagca aacagcaagg ctgatccaga caccgtgcac aggcgggggt ggggcaggca 41941 gagggcaacc aggcggggag ggagaggcag attcctcggc agcggttagg aaaacttcac 42001 ctactatagg gtcaaactct gacctacagg ggccaggcag gtgacaaatg aacaaagctg 42061 gctacttcta agataaatgg gaccatgaag actgtagtat ggcccactgg aaaccaccac 42121 tcggctctag tggatcgatg ccatgtgaga atgcaggccc caaagggcca gatcatttga 42181 gttttcaaga gaagccaaaa aaaaaaaaaa aatcagatta gtgaaatctc ccaattttta 42241 cagattgtgt gtcatcccac aaaacatttt tctgtggacc agatttggcc cttgggccac 42301 gtgtttggga cccccaaatc tgtgtgcaac tgaagagcca ctgtttggtg gatcagccct 42361 aagatcagac tgaaaacaat gagggaaaac ggtccaaaca aaaggcgagc atctaaaagc 42421 aaatttaaaa gccagaagca agcaaagcac agtggagaga gaggtttcag aaggaggggt 42481 gaaatcgcac tctaaaatgt gagtgctgct gggtgccaga ggcaggcagt gggccagcaa 42541 ggaggcgagg aagaacatac ctttagtgcc aatccacagc tggtcttgtt ctagtttaaa 42601 ggtgagtctc acttttcctc cagatttatt gtacatgccg gacagcggta ccgttggcag 42661 gcgctcctga aggcacgaga gaatggtaag agatggaggg acagtcagaa aaaaagagaa 42721 gggggccaag cgcggtggct cacgcctgta atcccagcac actgggaggc caaagcgggc 42781 agatcacctg aggtcaggag ttggagacta gcctgggcaa catggtaaaa ccccgtctct 42841 actaaaaata caaaaattag ccaggtgtgg tggcacaccc cagcctgggt gacagagcaa 42901 gactgtctca aaaaaaaaaa aacagggaaa aaagagaagg gacaggattg gaggaacctc 42961 aagtcagacg gtggggctgc ctcattttca gaatgctcac aaagaaatcc gccttacctg 43021 ggccccctta acagatggca tcacatcttc aggttttcct ttatccaaca ctttcctgtg 43081 ttgctataag tcagaagaga gaatgaaatg aaacagctga ctacatttcg tcgtgttact 43141 attcccttga aagagaggat gaagaaagcc ttagtgtttt ggctcaaaag cttttaatga 43201 cgacagacta tggatccttt gaaactatag cacagtgctg tccagcagaa ctttctggga 43261 gggcgaatat gttcgtacct gtgctagcca ctacctactg ctgtgactat tacacacttg 43321 aaatgcagct actatgactg aggaactgaa ttcttaattt cattttaacc agccacatgt 43381 ggatagctac catattagat atcaaagttc tacagcaaac gggatttcca gattgccctc 43441 ctcccctcaa aaaaaaaaaa aaaaatcacc actgaaatga ttaaaccatc aaactaaaga 43501 gggcgacaaa gggtgttgct ctcctagcag acaaaaaatg gcacccacct ttttcaactt 43561 aaacttccag agaacactat aagatgctgg taatccgtct aactgcctag gaggagcagt 43621 tcttcaacaa atggagaccc agcccatcgc cgaagttttc cccactgccc tacccttaac 43681 tccagctaaa caggaacctg gaagagccac ctttcagaaa gaaatgctta ttactctata 43741 acctgacatt aagaagcctc taaactccac ccctaccacc tgccccaaga actctgaggc 43801 agctctttaa ggattctaag ttcagggccc cttaaacagt tctcctgcct ggcccaaatg 43861 cctgagtctt ctttcttctc catctgtggc cttataggca tagccactgc ttggaaggct 43921 ggaactaggg gacaatgaat ggatagaagg taacaattta atcaaatgta agcactaggc 43981 agccacaagc cagcccaaat cctcactggc atggtggaat ccctctggga ggaataaact 44041 cagcactgcc agactgctag caaaaagtca ctccagacat cacccccaga ccccttaata 44101 agctggtaaa aaggcaaggg actatatccc tctcattagg tcatgtgatg catcataatt 44161 ggcctcaatg aaacacagca gcatgggctg gaagtacaat tcgagtcaca cctttcaagt 44221 accttttgaa caagatttcc tgtactgtgg agaacagttc taagcctgtt aagctaccat 44281 tagtaactaa aaacctgtat ttaggaggca tgttatgctc caagggtgtg accattgcga 44341 ccactcctat ctgacgtgct cacatttcac tatttcatac atgtggcaaa gaaaaacagg 44401 aatatgagaa gaaactcaat cggtttcaag acactgagac cccatatgtg ggggtgaagt 44461 ctccaaatca acacctttta gataattaat ataggtttat aaccccttat ctgaaaccct 44521 agagttttga actttcaggg ttttggaatt tttataaaga taatattgct ttcaaataca 44581 taacaatctt ctctgtgggg tctggggcag taccttgtca ccaaagagga aaattcacac 44641 taattctggt caggttttgc tgttaaattt ataaaaacac ttctcgtttt tatactcctt 44701 tggggttttg gaagcacaca tgggggatta tggatctgta acacaaactg tgtccttttc 44761 ttactgttgg tcatatatct tagtgtctat ggtcactaaa tgatgttgat tgaagtttat 44821 ctactctact tcagacacag ttgagaatac ctgtcttcct tggaacacta ggaagaacga 44881 tcaggaggct acacagcaat cagtataatt tgccaggtaa aaagataaca agtaatgatt 44941 cttacaacct gttccttcca tagtaaaata atctcacttt tcctcgttac gtgaaagaat 45001 gttaatggct atgatgagca caagacaagg caggatctct tgcagccaaa agaacagaga 45061 acgtaaaaaa ccaagctcct atttcacaga tcaggacaaa aaaataaata aacaaatcca 45121 tcagtgtagt tgtcttccta ctcagcttca tttagcttcc tgaaacggtc agctttaggg 45181 gaacctccga gagttgttga tttccatgcc tagccaggct tgttgacttg gggaaagcca 45241 caaaccaaaa agccaaatgt aagcacttgt acaaaatgag ggcttacttc tagtactggt 45301 agttttctct ggcactaaat aagagagaga aacccacgtc tgattccttt caacacatgg 45361 tctgaagcct ccctttacca aggtcttatc ctaaagtagg tagacttgag acacactgac 45421 ccgccagatg agacaaagtt gcaaatctca ggtagttgtc acctccactc tgcggcaccc 45481 aactgctgcc ataaatttcc atcatgggcc ctgtccaaaa aggagcttat tctagggaga 45541 atataaccag ccacggctca agaggacaaa gcctcaaatt tctcaagacc aagaagcaca 45601 agatggactc actttctgcc tgcagagagg ctccttcttg ttctcttcgg cctttgcatc 45661 ctgctgcgca gcatctttgg gtgtgtttac tgctaaaaca tcattgatgg tggagccaac 45721 caccatgatc ttggccccac tggtcacttt tatttctctc aatgttttat cctcggggac 45781 gagtccctta tacatgactt tctgcatggc aggcgggaga ccttgggaaa aggtagaagg 45841 tggacagtga aaggaaggaa ggacagaaaa atgtatggat actacacagc tctttggtaa 45901 cagaccaaaa agtgaatact gcctccacca cctattagct ttgtgaactt gggagattta 45961 caacctctga acctcagttt ctagaactgt agaagattta ttagagtaag tccttctgta 46021 cagacctgtt tcaaggatta agagaccaat gcggggcatt tagtacggtg gctagcattg 46081 aataattgtt cagtaagtac tagctctcat tctttccaag ttgttcctca gtcatctaaa 46141 tcattcacat attaagtgag aatccacagg catcagagct acagaagcca atagggacaa 46201 aactcctgtt cttaatggag attacaaggc tacatgtaga aagctttaaa aaaaaaaaaa 46261 agttcaaggc caggagtgat ggcttatgcc tgtaatccca gcacttcggg aggccgaggt 46321 gggtggataa aataaaaaaa actagcccgg cggggtagcg ggcacctgta atcccagcta 46381 ctcgggaggc tgaggcagga gaatcgcttg aacctgggag gcggaggctg cagtgagcca 46441 agatggtgcc agtacactcc agcctgggca acagagcaag actccgtctc aaaaaaaaaa 46501 aaaaatagga aaaagaaaag aaatctcaga tggcaaagag ctatgaacaa ggtaaaatta 46561 agaaacagaa tcgactgtga agaaccagca tcaaagtcaa tagcagcgtt tcctaaactt 46621 cacttagaag aattacctaa gatatttgct aaaaataaat tcctggctac atcccagacc 46681 cactgaatca gaatctcctg ggaaagggcc cagaagctgt aaaacaagcg ccttaggtga 46741 ttttttaatt agagatgtta gagcaacatc aatgagaaac tggggcctaa acgtcagact 46801 tcggctttaa tcccagctct gcaacttact agctctataa acttgagtct ctttccccag 46861 ctgagtaatg ggcatccaag tatccttcag aagactgctg taaggatggg gggacctgat 46921 gcaaggaaag ccaccaggtg cctagtaggg tgcaagggcc atcaggaggc ggggtgggga 46981 agactggctg tcagcgccac aggaattacc tgtaatcgag tggatcttct gtttcagctc 47041 ggagcctgtg ctgtccaggg ggaacttcac gtcatgcttg gtcttattcc agatgatctt 47101 caagtccacc agctccctgc ccgcgccgcc gcccgcgtct tcgccgttgc tgaccgaggc 47161 ctgggctgcg gggtccccag ggggctgggc cggggccggc tgcaggctgc ctcgtgcggc 47221 gccggagtcc tcggccgccg cccccgccgc ggcttcagcc tccaggcagt tgacgggccg 47281 cgcgggagcc tcagtagcca cagtctcggc ctccgtgtcc atgccaggtt cctccatgcc 47341 tgcaaggggg ttagggggtg ggcgctcgcc gcgggctggg cttgggaacg gacgctgtcc 47401 gggcgccgcc ggacctccgt tcccgagccg cggttctccc tcccgggccg ggccgcatcc 47461 cccagaggag cgaccgcaac ctcagccccg gcccggcgcc cgccaccccc gccgcactca 47521 ccatccgggg ccccggccgc cgccatgatg attgtgtaca acacccaccg ctcccgcaga 47581 ggccgctggg aagaggcgag ctggctgaga ttggcccccg cagcagcccc taatctctca 47641 cagcccggcc ggaaacaggt ggaggtttct atggagaccc gcctcctccg ctcccaggtc 47701 cggcccctgc cacgctttcg ctttcccaag ctgaaggggg cgggtccggg aaatagccct 47761 ttctctggcc tctgccccct ggtggcgagt gcggccgcag ccgtccaggg gggcgtggga 47821 tttcgcgcta gggggcctgt gactttaagg ggcaaggctg cgctggctgt ggcggaggcc 47881 gattggctgg aagcagtccc cggaagtgac gcagggagaa gcccacgtgt gctattccat 47941 cccacatggc ggcgctcctg aggagactgc tgcagcgcga gaggccttcg gcggcctctg 48001 gccgccccgt aggacggcgc gaggccaacc tgggcactga tgccggggtt gcggtgcgag 48061 tgcggttcgc tcccagcccc acaggtaacc ctggcggacg tgacgctaca ggccgaggcc 48121 ccctgtcccg tctccgcccc acgaatgttc ccgaatgaat cctttccggt ggtgtcagag 48181 tgggcgggcg gaggagggtt aaggcagcgc gagcccattt tacagaggcg gaaactgagg 48241 tgcaggagcg gaaataattc aaattgccag gagtttcttg tgttttttgt gccatcccat 48301 ctctttcgat ctttttttca tgcattccat acatgtgcat tgatgacaac ctgtgtaaca 48361 ggtgttgtag acgctgggat acagtgatca ttgagaaagg taggggcctt gctctgtgga 48421 agtctgcatt ctagtggctg ccgcagacac ttataaaaaa taagtgtaat tttagagtca 48481 catgtgccat gcgggaagtg aaacaggata ggagactggg gattaagaag gacgctccag 48541 aaggctttac tgagcaggta acatttgagc tgagatccaa atgacaagaa ggagcacgct 48601 atgtgaagcc tatagaagga accttctgga ggccaggcgc ggtggctcac gcttgtaatc 48661 ccagcacttt gggaggcaga ggtgggtgca tcacttgagg tcaagtggat cacttgagag 48721 ccgtctgacc aacatggtga agccctgtct ctactaaaaa tacaaaatta tctgggcatg 48781 gtggcgcatg cctgtaatcc cagctacttg ggaggctgag gcaggagaat cgcttgaacg 48841 caggaggtgg agattgcggt gagccaagat cgcgccattg cactccagcc tgggcaacaa 48901 gagcgaaact gtgtctcaaa aacaaaaaag aaggaacctt ctgggtagag gaaatcgcag 48961 gtaccaaggc actgtgctag tgctgttggg agcttagcaa gctagagggt gacagaggcc 49021 cattgtatag cctgttggag atctcgtcag ggccttttgg ttcccaccag taaggctacc 49081 ttctacgttc tctacacgga gtatgcctct catcccaaat cccaggatga acatacaaag 49141 atctttgtag taaaatccca gctatacatg acaggatctt tctggcaggc agtatagcct 49201 agagcagtgc ctttcaaacc atctgttgtg gaagtgctgg ttttaaattt ccaatcaatt 49261 gtgtatcaat atttgcacaa aattaatata atgaccagaa aaataaaatt ttgaaaacat 49321 ccaaaataca aaccaatctc attttatttg ggttagcaga ggtaaaatca ttctgccaga 49381 ttgctatgta agtttcgcct ggccaatatg gcaaaacctt gtctctacta aaatacaaaa 49441 attagccggg catgtggccc aggcctgtaa tcccagctac tcaggaggct gaaaatcgtt 49501 tgaacccagg agaaagaggt tgcagtgagc cgagatcgtg ccaccgcact gcagcccggg 49561 tgacagagtg agactccatc tcaaaacaac aacaaaagtt tctaaacctt tactctcagt 49621 ttctgtactt gattgtgaac tgacaactgt tttcagacca gcactggatt gcacactttg 49681 ctttgaagag cacttatgta gaatgccaaa gcctgggatt gcctgggtcc aaatttctgt 49741 cgtggccact tattagatgt gtaacttact agatgtgcaa attattattg ttattttgag 49801 acagggtttg gctctgtctg tcgtccaggc tggagtgcag tggtgcaatc tcggctcact 49861 gcaacctcag cctccggggc tcaagtgatc ctcccacctc aacctcccga gtagctggga 49921 ctacaagtgt gtgaccacta cgcccagcta atttttgtat tttcttgtag agatggggtt 49981 ttgccatgtt gcctgggcgg atcttgaact cctgagctca agtgatccac ctgccttggc 50041 ctccgaaagt gctgggatta taggtgtgag ccatcacacc tggcctagat gtgcaaatta 50101 tttaatgaat ctgagcctca gtttcaccat ttgtcaaaac ggggctaata ataatatctg 50161 tcttatgggg ttaagctcat gcagccgaac cacttagtaa ggtctagtac acccagaaag 50221 cacttaatac acagcctcta ttatgatgat tgtacttctc atttccttgc tcatttgcgg 50281 ggaaagccca tgtgggaaag gcttttgact ataaacagat aaaagaatta tcaggcacag 50341 atcattacag tagctgaaat taattgaggg tttactttgg tcagcattta gacctcagag 50401 caaccctata ataaagaagg ttcttatttt tttgctctta atccctctga cagatgagga 50461 aatagaagct cataaaagtt aagtagagtt ccttctcaag gggtagcaac taataaatgg 50521 tggatgcaga atttgaatcc aggtgtactt gatgtaagag gctttgtttt cagcagctgg 50581 gttcctcttt tgtcttgggg actcgtggga ctcagtgcca ctgaatacca ctggttttct 50641 ggtgtccttg cagtaactat ccgaaggtta cacacatcag tgttgaagca gggtcggcct 50701 ggccccagaa gagctttttc tctcctctgc tgcctcttgt acttgtaaat gtagatgagc 50761 tcattgtgga gacttagggc tgtgggctgg tccgaagaag gtacggtaag aggtaatgac 50821 aactctggaa aagctgttaa ctttcagttt tcagacagga atgatgaaaa aggcaaatct 50881 taggagttta aactgatgaa gagaaataca gtggaatcaa acactacttt tggttagtgt 50941 ttaaagccag caattagggt gaaaatcttg gcaaggcatg gtggctcatg ccagtaatcc 51001 cagcactttg ggaggctgaa gtgtaaggat tgcttgagcc caggagtttg agaccagcct 51061 gggcaacata gtgagacctt gcctctgcaa aaaaaaaacc aaaaaacaaa tattagccgg 51121 gcatggtggt gtacacttgt agttccagct acttgggagg ctgacgtggg aggattgatt 51181 gagcctgggt ggttgaggct gcagtgagcc gtgactgtac cactggactt cagcctgggc 51241 aagagtgaga ccccatctca aatttcaaaa atgaattttg tttctttaaa aaagtaaaaa 51301 tctcaagtag atgaaattta aaatgtacga cgtggacagt catgcacact gcatagactg 51361 catatgcaaa atgtgatttt tttttttttt ttggagatgg agtctcgccc tgtcgcccag 51421 gctggagtgc aatggtgcga tctcggctca ctgcaacctc tgtctcctgg gttcaagcag 51481 ttctcctgcc tcaacctccc aagtagctgg gattacaggc gcccaccacc atgcccagct 51541 aatttttttt gtggtatttt tagtagagac ggggtttcaa catgttggcc aggctggcct 51601 caaactcctg actttgtgat ccacccgcct cggcctccca aagtgctggg attaaaggtg 51661 tgagccacca tgccccaccg attttttttt tctttgtaga gacaggatct cacaatgttg 51721 cccaggctgg tctcaaattt agagacccag gtgatcctcg tgcctctgcc tccaaaagtg 51781 ctgggattag aggtgtgagc tactgcacct ggcccacaat atgattgtac agtattttaa 51841 atttctagtt tgtttgtaat taattttttt cccagcgcac atcattgaat ttctgtaagt 51901 taaatgtaca tgatatggcc tcaatgaata actctctgct tctctttcag gcgaaacttg 51961 gccttgagac aaatatcaaa atagagtggt ggatgcagtt ggggaaaatt ggagcctttt 52021 aataaaatga catgctccaa atcctgggtg gtttccattt tcaggatgag ggattaaatg 52081 tgtctggaga cagggcacag gggtaactag ttcttagtct ttgaaaaaaa attttttaat 52141 ttttaatttt ttttgtttga gacagagtct cactctactg ctgcccaggc tggagtgcag 52201 aggcgtgatc atggctcgct gcagcctcag cgtccccagg ctcaggtgat cctcccacct 52261 cagtaactga aaccacaggc atgtaccacc atgcccagct aaatttttat attttttgct 52321 aagacaaggt ctcaccatgt tgcccaggca gatctcaaac tcaggccaag cacaatggct 52381 cacgcttgta atcccagcac tttgggaagc caaggcagat ggatcacttg aggtcaggag 52441 ttcgagatca gcctgaccac catggcaaaa tcctatcttt actaaaaata caaaaattgg 52501 ccaggtgtga tggctcttgc ctgtgatccc agcacttggg aggccgaggc tggggggatc 52561 acttgaggcc aggagttcaa gaccagcctt gccaacatgg cgaatgcctg tctctaccaa 52621 aaatacaaac atgtagctgg acgtggtggc acacacctgt aatcccagct actcgggagg 52681 ctgggcatga gaattgctta aacacaagag gcagaggttg cagtgagcca agactgtgcc 52741 actgcactcc agcttgggtg acagagtgag actcttgtct taaaaaacct acaggaacct 52801 cctgggctca agtgatcccc ccggccttgg cctcccaaag tggtaggatt acaggcgtga 52861 gccaatgtgt cctgctggtc ttttaaattt attgcaacat tctgtcacag tagtgcttac 52921 ttgccccttg ctgaggccta ttagcttatc ttccctgtct tatctgagct gttctcttct 52981 ccaggcttct tgcacctggg tggcctccgc actgccttgt acaactacat ctttgctaag 53041 aagtaccagg ggagcttcat cctgaggcta gaggacacag atcagactcg cgttgtgcct 53101 ggggcagcgg agaatattga ggacatgctg gagtgggcag gtaagcctgg cagagaaagg 53161 atctttctgc agggaaggaa caggaagcag gaaaaggagc ctgtgggtct tctcatggag 53221 ttctgagtgg aaaatgggga gttcctggag aagtggctac atttgtactg ccctgcccgg 53281 ggtggcagat gacacctgaa aaaatgccga agctctctga aacacagctc caagactggt 53341 ttctttattt ttatttattt attgtttttg agacagagtc ttgctctgtt gcccaggctg 53401 gagtgcagtg gtgcgatctc ggctcactgc aagctccgcc ttccgggttc acgccattct 53461 tctgcctcag cctccaagta gctggaacta caggcgcccg ccaccacgcc cggctaagtt 53521 tttttgtatt tttagtagag acggggtttc actgtgttag caaggatgat cttgacctcc 53581 tgacctcgtg atccacctgc ctcgacctcc caaagtgctg ggattatagg catgagccac 53641 cgcgcccaac ttggtttctt tattttttaa ttttttagag tcagggtctc attttgttac 53701 ccaggctaga gttccatggc acaatcataa cttactgtag cctcgacctg ggctcaagtg 53761 atcctccacc ttagcctccc aagtagttgg gactacaggc acgcaccacc acgcctggct 53821 aatttttaat ttttttgtag agatgagatc tcactgtgtt gcccaagcta ttctcaaact 53881 cctgggctca agcaatcctc ctgctacagc ctcccaaaga gctgggatta caggtgtgag 53941 ccactgccaa ggctgttttg agattcctgt tttcacctca gaatctcaag ttcattccct 54001 acaggggcca ggcaggaaac acagggccaa gtgggaactg gaggagctgg gccagtgctg 54061 agctgtgaaa accacacctc atccagagtt gggagtgcct ctgagctcca gctccttgtg 54121 cccatgcggg aatgctggcc cagcttgtcc aatcctctga gttgcaagag gagccagaaa 54181 tccagttttt ctggatgaat acttaaccat tattaaatct gggctcagat tttttttaac 54241 catgttgtat aagccacaca aaacacgtac agcttctgaa gttgatcttg tgggaagttt 54301 tatcctgagg acagctaagg tcagggagaa caaagaggag tgtgtttcct tttcatgggt 54361 cgacagttgt ctacctgtat ggacgggtgt gactaggaaa ggggagattc ccaaggaaaa 54421 ataccaagac tctgctagtc tagagattga aaaaaaaaga aagaaaaacc ccaagactat 54481 tttgagatgt gtgttctaat ctcaggtaaa ttactcacta aactctggat tcctaaagag 54541 tataaaatgg ttggccgggc gtggtggctc acgcctgtaa tcccaaaact ttgggaggcc 54601 gaggctggtg catcacgagg tcaggagatc gagaccatcc tggctaacat agtgaaaccc 54661 catctctact aaaaatacaa aaaattagcc aggcgtggtg gcgggtgcct gtagtctcag 54721 ctactcagga ggctgaggca ggagaatggc gtgaacccgg aaggcggagc ttgcagtgag 54781 ccgagattgc atcactgcac tccagcctgg gtgacagagt gagactctgt ctcaaaaaaa 54841 aaaaaaaaaa aaagaccatg tgcttagtgc caagggaaaa gacagatgag tggattaatt 54901 atttcttttt tttttttttt aacaaacaaa atcttgctct gttgcccagg ctagagcgca 54961 gtggttcaaa cacagctcac tgtaatctca aattcctgag ctcaagggat cctctcacct 55021 tagcctcctg ggtagctggc accatagctg tgtgccacca tgcccagcta attttattaa 55081 cttatgtttt tgtagaggta gggtcttgct atgttgccca ggctgctgtt gaacccctga 55141 cctcaagtta attctcctgc ctcggcctcc caaagtgctg gcatcacggt catgaaccat 55201 cgctcctggc tcctaattca ttcttcatag gatactaaat gataataacc agtatggaga 55261 aaagagtaac aggcttagaa actagggcga gggatgccct tgtagatagg gcagttttct 55321 cttaggaggt gatgctggag cagagacctg catgaagtga ggagccagcc ccgtggtgac 55381 ctgggggaag aatgttcaaa gcaaaaagaa tagccaatgc agaggcccca cagcaggagt 55441 gttctagatg tatttgagga gcatcaaaaa gaccagcttg gtctaaccat agtgagcaga 55501 atgaccaggg atgtgaggat agaaggcttg ccaggtcggg cacaggggta cacgcctgga 55561 atcccagcac tttgggaggc caagaagaga ggatcacttg gacccaggag tccaagacca 55621 acctgggcca catagtgaga cctcatctct acaaaaaatt acccaggtgt agtggtgcac 55681 tcttatagtc ccagctactt gggaggctga ggtgggagca tcacctgagc ctggggaagt 55741 tgaggttgag gctgcagtga gctgtgatca tgttacagaa ttccttcagt gccactttgc 55801 cagccagaaa tctctgcagc tgccctcacc tctgtcgggg cctcgcttgg gccttctggg 55861 cccatttggc ccagcaggct gcacttggct catgcccact tggatcccac acctgccatg 55921 gctccacact cagcccatgg ctggactggg tgtgccacga gtggcttctg tgttgggtgc 55981 tggcatctag atgaggggaa catggtggca cccaaaaact tgggaaatac catcagttgt 56041 ggagctccaa ggggtgttat agctcttgtt tggggagtcc tgagatctga gtccccagca 56101 ctgtctcagc tcttcactcc tgtagcttgg tgagtgggag cgtgttacag ctctctttct 56161 cctgtagtct ggtgagcggg agcgtgttac agctctttta ttcccgtcac ccgcagcttg 56221 gcgagttttg ggttcttgtc ctgcaaccaa gaggaatgag gtgtgcaggt accatagagt 56281 gagtaaggca gagaagaatt ttattgaatg acagaaggga aactctccac tgcgagaggg 56341 gaccctgaaa gtggtagcca tctgtgaggc tgagtctatc atttttatgg gcttagaatg 56401 ggggcatgcc tgccgattgg tccatgagtg gtcttggaaa aaggaccatt cagttggtta 56461 aaacatcatc cggaaggaac caatcgagag agtgggtgag acagggaaac aagttctcac 56521 tccagtcatg gactctatcc tgacttgaca gtttggtttt caggctgtct ttggcttaga 56581 agtcagattt caccagggac ctgtccctga ctgcctagga atttgtctgt ctcctgtcag 56641 tcatgccatt acactccagc ctggagcgac agtgtgagac cctgtctcaa aaaaaaaagg 56701 ctttccagga gccagattac atggtgcctc acagaccaag atgaaggctt cagtttcaca 56761 gaggatctgc tacttatcag gtaccacgcc aggcatgagg gatacagtga tgaacaacgg 56821 tgatggtggc gggtcccacc ctcgtggaat tcctggggca gcatcattgt gagaatgagg 56881 cttgcagccc agagcttgtc ccccaccctg gtgttttggc gaggcagagg gcaggtattc 56941 aggcacctca tggtggctga cttctggagc accttacttt gggagtcatt ctttcttttt 57001 taattttttt cattttattt atttatttat ttatttattt atttatttat ttattgagat 57061 ggagtctcgc tctgtcgccc aggctggagt gcagtggtgc gatctcggct cactgcaagc 57121 tccacctccc gggttcatgc cattctcccg cctcagcctc ccgagtagct ggaactacaa 57181 gtgtctgcca ccaagcccag ctaattttgt ttttgtattt ttagtagaga cggggtttca 57241 ctgtgttagc caggatggtc ttgatctcct gacctcgtga tctgcccgcc tcagcctccc 57301 aaagtgctgg gattacagac gtgagccacc gcgcctggct attattatta tttttttaga 57361 ggcggtgttt caccatgttg cccaggctgg tttcgaactc ctgagctcaa atgatccacc 57421 tacctcggcc tcccaaagtg ctgggattac aggcatgacc cagcatgccc agccttttaa 57481 aaataatgac atataagcaa ggcgtggtgg ctctcacatg gggtggctct tgtctgggca 57541 ggcggatcac aaagtcagga gttccagacc agcctggcca acatgttgaa accctgtctc 57601 taataaaaat acaaaaatta gttgggcgtg gtggcacatg cttgtaatcc cagctactca 57661 ggaggctgag gcaggagaat tgcttgaacc gggaggtgga ggttgcagtg agccgagatc 57721 gtgccactgc actccagcct gggtgacaga gctagactct gtcttcgggg aaaaaaaaat 57781 gacatataat tcacatgata tgaaatttac catatcaaag tgtacaattc agtggttttc 57841 aatatattta ctgtgttgcg caaccgtcac cactaattcc agaacatttc catcacccct 57901 aaaagaaact cttccttatt agcagttgtt ttccattcta ccctctccta ccacctggca 57961 actactgatc tctttctgcc tccatggatt tgcctcttct ggacgtttca tatgaatgag 58021 atcatataat atatggcctt ctgtgactgg cttccttcac ttaccataag cgcatctgtg 58081 ggctatcatg cttcattcct ttttatagcc aaataatatt catcatatgg ttataccaca 58141 ttttgtttat tcatcactag tgaatgagca ttttagttac tctggttttt ggctgtccaa 58201 taaaactcta tgaacattta catgccagag tttaagttaa catgttttca gttctcctag 58261 gggagtccct cttaaaccct ggggcattcc aagataccac attaaagaga cttgtttagc 58321 attagatcag gaagggatcc ccccaaattt acccgattta gccctccttt ctgacaaatg 58381 aagaaatgga ggtcccatga gagggagcca agtaacactg ataatcacag atggctgggt 58441 gccatggctc acgcctataa tcccaacact gggaagccga ggcaagagga tcacttgagc 58501 ccaggagttt gagaccagcc tgggcaacat aggcagacct tgtctctata gaaaacttaa 58561 aaaaagttag ccaggcctgg tggcatgtac ctgtagtctc agctacttgg gaggctgagg 58621 tgggaggatc acttgagcct gagaggttga ggcttcaatt agccgagaac gtgccactgc 58681 actccaacct gggcaacaga gtgagaccct gtctcaaata aataaataca taaataccaa 58741 agcatagaga gcaccggtca caggccactt gatgagtact ttacttagat catcttactc 58801 tagtctcatg atgatcctca tgagaaagtt actacttgtt tccttctttg acagatgaga 58861 acccagagcc cagaaagatg acttgcccaa gatctctgag ctggctggta cgtttgaacc 58921 caggcccact cagctcagtt tgtgttagca ttgagggtag caaggtgaac aatgtcctta 58981 tcttcccaat tcctttgaaa gctctttgag ggcagaaaca ccatcttcca gttcttgtgt 59041 ccctgtgccc tctcctgaaa gcattcataa tgaaatttat cctatgtaat cggagggaat 59101 tggactttgg gtttggggtt tggcttacag tagatcatat ttaatctatt ctatagcata 59161 tattgaaact gtcctaaata tacaccactg ggctaagtgc ttataagcac tgacacattt 59221 gttctcacaa ttccctctgc acaggaggct gctagctcca cttttcaaaa gaggaaactg 59281 aggcttagaa agattaagtc tagctgggca tggtggcaca ttcctataat cccagctact 59341 cgagaggctg aggtgggagg atcacttgag cctagaaatt ggagactgta gtgagctatg 59401 attgcactac tgcaccctag cctgagtgac acagcaagac tcgctctcta ctaaaaaaat 59461 agccagatgt ggtgatgtgc acctgtagtt ctagctactc gggaggttga ggcaggaggg 59521 tggcttgagc ccggaagttc gaggctgcag tgtgctatga tcacaccact gcactccagc 59581 tgggtgatag agtgagaccc catctcttag aaaaagaaag gctaagtaac acaaacagca 59641 gtgtaatgag tggttaagaa tatcggctcc ggtgcccatt agtctgggtt tgaagctgat 59701 cgtgcccctt ctactcggcc aggtgacaaa gcatctctgt gcctcagtgt cctcacagag 59761 tgggggaata accgcaccta cctggtaggg tgggtttgag gcttcaggga aaagtgtctt 59821 ttatggcact cagcaccctg gtgctcacac gttggtagca cctcaggatc ccttgtcaca 59881 gctactaagt ggcagagcca tgagtcccac cagggcctga tgggctgttc tctgaaccat 59941 gcttttttct cctaagtcag ctgttcccag gcctgggctt gtttctccct ctccagggaa 60001 tcccatcagg gattgagagg acatacaatt agggatgtgg actttggagc ccgtacacct 60061 ccgtttgctc cttagtccat gagcccatgc gactttgagt gcttcacctc tctgtgtttt 60121 aacttttcat ctctaaaagg gacattgggc cgggtgcagt ggctcacgcc tgtaatccca 60181 gcactttgag aggccgaggc aggtggatca cctgaggtcg ggagttcaag accagcctgg 60241 ccaacatggt gaaacgctgt ctctactaaa aatacaaaaa ttagctgggt gaggtggcac 60301 atgcttgtaa tcccagctat tcgggaggct gaggcaggag aaccgcttga acttgggagg 60361 tggaggttgc agtgagctaa gattgcacca ctgcactcca gccttggcga catcagagca 60421 agactctgcc tcaaaaaaaa aggcggcggg gaggacattg tactaatgcc aacctcatgg 60481 attgttatga ggattaaagc tttaatatgt gcataaagtg cttagagcaa tgcctgagaa 60541 cgagcgctgt gtgcacctta gctgtcatta ttccctgtgt tccaggcatc ccgcctgatg 60601 agagcccccg ccggggcggt cctgctgggc cctaccagca atctcagcgg ttggagctgt 60661 atgcccaggc cacagaagcg ctgctgaaga ccggagctgc ttacccctgt ttctgctcac 60721 cccagcggct ggagctcctg aagaaggagg ccttgcggaa ccaccagacg ccccggtaag 60781 aacctcagct tgttgcagat gcctcatcaa cattgggtgt gtttgttacc ttgtgctgcg 60841 taagaaattc ccctaaacat gtattacctc tcagtttctg tgggtcaggc atctgagcgt 60901 ggcctcacct gtttctttgg ctcagggatt ctctcaaagc cacaagcaac tacaggtcct 60961 agccagtcct ggagttctct taaagcttag cttggggaag atctacttcc atctcctctc 61021 agaaggctat tggccaacct caggtccttg ctagctgttg cctggagaca tcagttcttt 61081 gccatgcgag cctctccata gagcagctta cagcgtggca gccagcattt ctcaaagcca 61141 gccaaggaga gagtgagcac gtacgagctg gagtctacct agttgtggaa gcaacttctc 61201 accacttttg ttttataggc tatgtattcc ttctctggaa cgcttgggac cagaagtgtt 61261 ttggatttca gattttttgc atattgaaat atctgcatat atataatgag atatcttggg 61321 ggtgggaccc aagtctaaac atgagattca tttatgtttc atgtacgcct tatacacata 61381 ggctggaggt gattttatgc aatattttaa ataattttgt gcacgaaaca aagttttgac 61441 tattttgact gtagcctgtc acatgtgttt gggtgttgta ttttccactt atggcatcat 61501 gtcagcactc aaaaggtttc aaatgtcaga tcagttgtgg cttttttttt ttttttttga 61561 gacagtctca ctctgtcgcc caggctggag cgcagtggtg caatcttggc tcactgcaac 61621 ctctgcctcc tgggttcaag cagttatcct gcctcagcct cctgggtagc tgggactaca 61681 ggtgcccgcc accacaccca tttaattttt gtatttttag tagagacggg tgtcactgtg 61741 ttggccaggc caacctcagg tgatccacct ttctcagatt cccaaagtgt taggattata 61801 ggcgtaaacc accgtgccca gccagatcag gttttttggt tttttttttg ttgttgttgt 61861 tgttgttttt tttcatagag tcagggtctt actttgtcac ccaggctgga gtgcaatggt 61921 gtgatgttgg ctcactgcag cctccaactc ctggactcac gcagtgcccc catctcagcc 61981 tcccaagtaa ctaggactgt actcctggct aatttttttt tttaaacttt tcataaagat 62041 ggggatctca ctatgttgcc caggctggtc ttgaactcct ggcctcaagc aatcctccca 62101 ccttggcctc ctaaagtgct gggattatag atgtgagcca ctgcacctga cttttttttt 62161 tttttttttt tggtgagaca gagtcttgct ctgctaccca ggctagagtg cagtggtgca 62221 atctcagctt actgcagtct ccacctcccg ggttcaaggg attctcgtgc ctcagcctcc 62281 tgtgtagctg ggactacagg cacatgccac cacacctggc taatttttgt gtttttaata 62341 gagagagggt ttcgccatgt tgcccaggct gttctcaaac tcctgaactc gaacaatctt 62401 cccacctcag cctcccaaag tgctgggatt acaggcgtga accactgcac ctggaccatt 62461 ggagcatttc agattaggca tatttggcat tatttttgtt cattagacaa aagtcactag 62521 gcccagccca cactcaaggg gaggggacta cacaaggaca gaaacaccag aaggcgagaa 62581 tgactggggc tatgttagaa gccaagattt ggtctcagaa agtaatgtgt aaagtaattt 62641 taattatttg ttatttttaa tgagcaaggt ccacaggccg gacgtggtgg ctcacgcctg 62701 taatcccagc actttgggag gctgaggtgg gtggatcacg aggtcaagag ttcaggacca 62761 gcctggccaa tgtggtgaaa ccccgtctct gctaaaaata caaaaattag ctgggcgtgg 62821 tggtgcgcat gtagtcccag ctgctcggga ggctgaggca ggagaatcat ttgaacccag 62881 gaagtggagg ttgcagtgag ccaagactgc accactgcac tccagcctgg gcgatagagc 62941 gacactgtct caaaaaaaaa aaaaaaaaaa aagaaaaatg aaaggtccac aaagggccaa 63001 gtatggtggc tcacgtctgt aattccagca ctttgggagg gcggaggtgg gcagatcact 63061 tgaggtcaga agttcgagac cagcctggcc aacatggtga aaccccatct ttactaaaaa 63121 tacaaaaatt aggcctggag tggtggctca tgcctgtaat ccccgtacgt tgggaagccg 63181 aggtgggcag atcacctgag gtcaggagtt caagaccagc ctggccaata tggtgaaacc 63241 ccatctctac aaaaatacaa aaattagctg ggcatgatgg cgggtgccta taatcccagc 63301 tacttgggag gctgaggcgg gagaataatt tgaacctggg aggcgaaggt tgcagtgagc 63361 tgagatcatg ccattgtact ccagcctggg caacagagca agactccatc tccaaaaaaa 63421 aaaaaaaatt agctgagctt ggtggtgcac gcctgtaatc ctagctacct gggaggttga 63481 ggcactagaa tcgcatgaac ccagaaggtg gaggttgcag tgagctgaga ttgtgccatt 63541 gtactccagc ctgggtgaca gagggagact gtctcaaaaa aaaaaaaaag aagatccaca 63601 aagatatata ttgaaagaac tctcttccaa tcttgactct ggccactcaa ttctgctctc 63661 cagagataac cagtgttaaa attttcttat gtctcctcca gagacattca gtacatatac 63721 ctataagtac atattatttt ctctttttcc ttattcacat ggtagcatgc aatgtactat 63781 gtacatgcta tgtggtgcaa tttaccatgt ccgcttacca tcttgaggat cagatatcag 63841 tccataataa tgcataatat tccatttgtg gatataactt catttactta atcagtttcg 63901 tcttaacaaa catttatttt tgttttgttt tgttttgttt tatgttatgt tgtgttattt 63961 tagaaatgga gtcttgctct attccccagg ctggagtgca gtggcaagat cttggctcac 64021 tgcaatctcc atctcccagg ttcaagtgat tctctggcct cagcctcccg agtagctggg 64081 attacaggtg cccgccacca caccctgcta atttttgtat ttttttggta gagatggggt 64141 ttcaccatgt tggtcaggct ggtctcaaac tcctgacctc aagtgatctg cccaagtgat 64201 cctcaagtgg cctcccaaag tgctgggatt acaggcatga gctacgacac tcggccttaa 64261 taaaccttta tgttatttat ttatttattt attttttgga gacggagtct tgctctgttg 64321 cccaggctgg tgtgcagtgg tgtgatcttg gctcactgca acctccactt cctggataca 64381 agcaattctc ctgtctcagc ctcctgagga gctgggacta gaggtgcatg ccaccacact 64441 tggctaattt aattttcttt tatattttag tagagacggg gtttcaccgt gttacccagg 64501 ctggtcttga actcttgagc tcaggcaatc cacctgcctc ggcctcccaa agtgctagga 64561 cgagctcagg caatccacct gcctcggcct cccaaagtgc taggactact ggcatgagcc 64621 actgcgctca gcccaaacct ttatgttatt tctaatattt ggttataaaa gaagagagct 64681 gtgatgacta atgttgacat atgtcatttt gcacttgtgc agtatgtttg ctgaatacct 64741 tcctagaggt ggaattgcag agtcgagttg tagctttgat ggatactgcc aagtggctgg 64801 ccctttactt gtttacactc ccatctgcaa tagcattgta acatttacat tgcagttcca 64861 gtgaactatt tggtcaaggg ctcagctgtt tggccaaggt catgacgttt acggtgacat 64921 tgccttttta ttataaacaa tgtatttgtc tcttgacaga atattttttt tctcatctca 64981 ttctgtgacc caggctggag tgcagtagca tgatcacagc tcattgcagc cttgaatctc 65041 ctgggttcaa gcaatcctac atcctcagcc tctcaagtag ctgtgaccac agccatgtac 65101 caccacacac agagtgttat gttttgtttt gttttgagat ggggtctcac tctgtcaccc 65161 agattggagt gcagtggtgt gatattggct caaggcaact tctgcctcct gggctcaagc 65221 agtcctctca tctcagcctc ccaaacagct ggaaccacag gtgtgtgccg ccatgcccgg 65281 caaatttttt ttgtattttt tggtagagac ggggtttctc cgtgttgcct aggctggtct 65341 caaactcctg agcataggag atccacccac ctcagcctcc caaagtgctg ggattatagg 65401 tgtgagccat catgcccggc tggcacacag ctagttttta aatttttgta gagatggaat 65461 tcccctattt tgcccaggct ggtatgaaac tcaagcgatc ttcccacctc agcttcccaa 65521 agtgctggga ttataggcat gagccaccat gcctggccaa cagatttttt ttttttgtta 65581 aattatagtc atgccctgtt gtggaatcct tgcagccatt tatttattta tttattttaa 65641 gctttcagga tttgtattaa atcctagtct aatttaacgg tatctgatgt tacacacatc 65701 atctcatggt gaacgtgttt aataagcgaa agcaaatcag acagcttatc taagtcgtta 65761 ttttttgtgg actaaacagt aaggtaacaa ctacccagaa ccctatgggt catgatggac 65821 acttgctcag ttttcataca agctgtgttg gttatcaaag tatatctgct aatatttaat 65881 aaagtgaaat gtcattgggg tggaaaagtc aagcctagac atttggttgg aaaagaacag 65941 atcaagtatg gatttcacaa accaaaagtt tataaactca atgcaataca agtcctttct 66001 attgtaaaag cttagttgaa actaaaagat ctgtaaaaac tattactttg ggccttaaac 66061 agtactagct cttatgagca aaaaaaggac acaactgcag cctgggcaac gtggtgaaac 66121 cctatctata ttaaagtaca aaaattagct gggcttggtg gtgtgtgccc ataatctcag 66181 ctacttagga ggctgaggca ggagaatcag ctgaactagg gagatggagg ttgcagtgag 66241 ctgagattgt gccactgcac tccagcctgg gagacagaat gagaccctgt ctcaaaaaaa 66301 aaaaaaaaaa aaaaaggagg ggggccataa ctgttgagaa tgtattactt ggttttattt 66361 tacattaggt ggtaggtaca aagcaatcct tcttaataaa gctgacagtt agcttccctt 66421 aggacttttt tttccagaca gggtcttgct gtgtcaccca ggctggagcg atgcaatcat 66481 gactcaccac agccttgact tcctgggctc aagcaatcct gcttcagtct cccaagtggg 66541 tggaactaca cacatgtacc caccatgccc ggctaattct ttttttttct taaatttcta 66601 gtagagacaa ggtctcgtta cccaggttgg ttttgaactc ctgggctcaa gcagttctcc 66661 tgccttgtcc tcccgaaatg ctgggattac aggagtgagc caccgcactt ggacagaaca 66721 tattcaataa tgcacattta aaacaagtat tcatcttaca agttgttctg taatccaaac 66781 atatgacagc ttggagaaca acatttagaa aacagaagcc aatgtaaaaa gacagattaa 66841 tacaactaga agccaggcac ggtggctcac gcctgtaatc ccagcacttt gggaggctga 66901 ggtaggcaga tcacctgagg tcaggaattc gagaccagct tggccaacgt ggtgaaaccc 66961 catctctgct gaaattagct gggcatggtg gcgtgcacct gtaatcccag ctacttggga 67021 gcctgggagg cagaggctac agtgagccaa gatcacgccg ctgcactcca gcctgggtga 67081 cagagcgaga ctccatctca aaacaaaaaa caaaaaaacc caactaggat agtgtaggtt 67141 ttgtatggct cagactttac agttttctta ctgcatcatc aatgtatcga ttagaatatc 67201 tgttccagct aggtgcggtg gctcatgcct gtaatcccag cactttggga ggccaaggtg 67261 ggcagattgc tagagcccag ccttcgcaac atggtgagcc cccatttcta ctaaaaaaaa 67321 tacaaaaatt agccaggcat ggtggtgcac gcctgtagtc ccagctactt gggaggctga 67381 ggtgggagaa tcactttacc cttgggaatg tctaggctgc agtgagctgt gatcactcta 67441 ctgtactcca gcctgggtga cagagtgaga ccctgactca aaaaaaaaaa aaaagaaaga 67501 aaaagaaaag aaaagaaaga aaccctgtct ctactaaaaa tacaaaaaat tagctgggta 67561 tagtcgtgca tacctgtagt cctagctgct cgggaggctg agatgagaga atcacatgag 67621 cccaggaagt caagactgca gtgagccatg attgcaccac tgcattccag cctgatcaac 67681 gagagtgaga ccttgtcccc tgtctcagta aatgaatgaa tgaatgaatg tttcttcagc 67741 tggccccatt gctgtggatt taaagaaata ccttttcttc ctggttttat ttaacctttg 67801 agatccatcc aataatctct aataccatca attagcactt cccctttaac atcccaaaca 67861 ctaaggtacc tcattttccc aatctgaaac atgttatcag ctgagtacag tggcctgtaa 67921 tctaatcaca gcactttggg aggctgagga gggtggatca cttgagacta ggagttcgag 67981 accagcctgg gcaacatggt gaaaccccat ctctatatga aatttaaaaa agaaaataaa 68041 catgttatta tgtttactgc cgctgctctg tttggaagat gacagagctc ccaaagtttt 68101 gccagtcttt tgttccttta caagtttttc tggagcaact tactttttct tctttaactt 68161 tttgtcaaac tcactgtcag aattcctgcc agaagagctt gaagaaacaa gttcctttga 68221 ttcagacata atgctgccct gcttctgcag ctgaacaccc tcctgttcac tcatgtgcaa 68281 ctgactcatt tgtaaaaatt tcttttttgg ccgggcacgg tggctcacac ctgtaatccc 68341 agcactatgg gaggccgagg cgggtagatc acgagatcag gagttcgaga ccagcctgac 68401 caacgtggtg aaaccccatt tctactaaaa atataaaaat tagccggttg tggtggcagg 68461 cacctgtaat cctagctact caggaggctg aggcaggaga attgcttgaa ccctggaggt 68521 ggaggttgca gtgagtcaag gtcacgacac tgcactccag cctgggtgac acagcgagac 68581 tctatctcaa aaaaaaaaaa aaaaaagaaa aaaaaagttt tttttttgag acagggtctc 68641 gctgtgtcac ccaggctaga gtgcagtggt gccatcatag tttactgcag cctcaacctc 68701 ctgggctcca gtgattcttc caccttggcc tcccacagag ctgggactac aggaatgcac 68761 caccacacct ggctgatttt gttttgtttt gttttgtaga gacagagtct cactatgttg 68821 cccaggctgg tattttaaat ttttaaataa gagacaggat tttgctatgt tacccacgct 68881 ggtcttgaac ccttgaggtc aggctgtcct tctaccttga cctcccaaaa cactgggatt 68941 acagacgtga gcaactgtgt ccagccctgt gcaccattta ttgattgatt gaacacttag 69001 ctgtggcaaa gactaacctt acaagcattc catctgtgtc tcagaacgct tttgggtgca 69061 gttagtggaa taccctgtta gaagggacaa acgatacttt aactcacaga tggatcacct 69121 ggggacctag ttaatgttca gattctgact cagtaggtct ggggtggagc ctaagagtct 69181 gcatttctgg cagactccca agaggctgat gctaggggcc cacagaccac actttcagta 69241 gccagggcat caatcataga catttgtctt ttgattctgc acaaggctgg aggtgggtga 69301 ctccaggatg agactagcag tgtagagaca ccagggctct attttggctt ctctctgact 69361 tcattgcctt ccctcctgca gttgcaagta gctgccgcag gtccaagcat gctcccctta 69421 caggacaaca tccaaagagg acagaagtgg ggcagtggga aattttaaga agggattttc 69481 ctcctgtggc tttctcttct tgtcctttgg agagaaaaac ctttcccgga accttaaacc 69541 tttcttactc cccaggcatg cttttctttg catcccagtg actaggatgg ggttacattc 69601 ctgcctggac tggtcactgg catggagcaa taggattgca agattggcta taccaatcag 69661 gattcaaccc ctcggactaa gcattttgct attttcttgt tagcaaggaa gaagcgtgga 69721 atggctggta gataggcaga cattagtgtc tgctacacca tgttatctca gttttccaca 69781 caactgtacc atcatccctg ttatagactg aggaagcagc ttgcagcagt aactgacttg 69841 cttaaggtca cacagctatg acagaggtag gactcctgtc catcaacctt tccatcaaca 69901 gtactcatgc tgctctgctc cccatcaggt atgacaatcg gtgcaggaac atgagccagg 69961 agcaggtggc ccagaagctg gccaaggacc ccaagcctgc gatccgcttc cgcctggagc 70021 aggtggtgcc agccttccag gacctggtct atggctggaa taggcatgaa gtggccagcg 70081 tggagggaga cccagtcatc atgaagagcg acggcttccc cacataccac ctggcctgcg 70141 tggtggacga ccaccacatg ggcatcagcc acgtgctgcg aggctctgag tggctcgtct 70201 ccactgccaa gcacctgctc ctctaccagg ccctgggctg gcagccaccc cacttcgccc 70261 acctgcccct gctcctcaac agggatggca gcaagctctc caagaggcaa ggggacgttt 70321 tcctggagca ctttgctgct gatggcttcc tgcccgattc cttgttggac atcatcacca 70381 actgtggctc aggttttgca ggtacgtgcc cacctgaata gtcctggcag cagagagcat 70441 ggccaggagg agcagggctt tggctcagac agtcctgctc tttggtccca gctttgccac 70501 ctaccagcca ggtgaccttg gacaagttta ttcttttatt caataaatgt ttattttgcc 70561 agacaccaca ttaagtgctg gggctacagt agtgggtgag atagacatga tccctgccct 70621 caagggactt atagtctctg agcctcagtt gcatatctgg aaaatgggga taagtgtcta 70681 cctcattggg ttaggattca ctgagctcag gagtgtaagg tacccagccc agcacctgtc 70741 acactgtaag cacaaaaaat gacagaccct atgattgtga caattattac aagcatgtat 70801 gctaacagta tgactagaat taatatggat ttagtttctt tttttccttc ttcattttac 70861 tgccagatga atctagttta tttcaagctg agcacttacc catggtccaa gacttagacc 70921 caaaccttga agaagcaagc atcaaatagg atgacctgaa ggatttctga aaatgtgtcc 70981 agaatagcct gcctgctggt ctcatttcat ttaacacaac atactattaa ggggtagata 71041 catatctcca ctgtttgtaa gaggattgtg aagctcagag aagctgagta acttgccaaa 71101 gtcacacacc tagtgaggga tatagcgctt taggcctatg cctgatcacg atgcctgctg 71161 accgtatggg actgagctgc tgctcacaag aagcagctcc cagcactacg gttcttaact 71221 tttttttttt ttttttgaga cagaacttct tgtcccccat tctggagtgc aatggcgcga 71281 tctcggctca ccgcaacctc cgtctcctgg gttcaagcga ttgtcctgcc tcagcctccc 71341 gagtacctgg gattacaggc acccgccacc acgcttggct aatttttgta tttttagtag 71401 agatggggtt tcaccatgtt gtccaggctg ttctcgaact cctgacctca ggtgatccac 71461 ccgccttggc ctcccaaagt actgggatta caggcgtgag ccaccgtgcc tggcctggtt 71521 cttaactctt gatcgtgctt tggaatcagc taggaagctt ctaaaatatg ctcagacccc 71581 aaccaaaccc agttgactca gagtctctgg tacagccagg caatgggcgt actgttctgc 71641 agctcgtgtc ttgagagtaa tggagatggg tttagcctga ttgttttcag accttttttt 71701 ctttaatgat ggaacccttt ctgaaaatga aagctcattc aggctcttga tatataaata 71761 gctagaagtt aaatcccttt gggcaaagtg gaatgggggg cccagggctt gctcactcag 71821 cctcccttat gtttgaaagt agaaggaggg cgggcacagt ggctcatgtc tgtaatccca 71881 gcactttggg aggccagtgc aggaggatca tttaagcctg ggagtttgag accaacctgg 71941 gcaatatagt gaaactccat gtctacagaa aaatgtatgt atgtatgtgt gtatgatgta 72001 tttatttgat agagtctcgc tctgttgccc aggctggagt gcaacggcac aatcttagct 72061 cactacaatc tccgcctctc aggttcaagt gattttcctg cctcagcttc ctgagtagct 72121 gggattacag gcatgcgcca ccacacctgg ctaatttttg tatttttagt agagatgggt 72181 ttcaccatgt ttgccaggcc agtctcgaac tcctgacctc atgatctgcc gacctctgcc 72241 tcccaaaatg ctggtattac aggtgtaagc caccatgcct ggctctacag aaaaatttaa 72301 aaattagcca ggtgcggtag tatgtgcctg tagtcccagc tacttggaag acaggctgga 72361 ggattgcttg agcctgggag tttgaggtta cagtgagcta ggattgtacc attgcactcc 72421 aacctgggca gcagagcaag atcctgtctc ttaaaaaaaa gagataaagt ggaaggagga 72481 agggagagag gaagctgagt ggctgggcct ctcttctttg cagagaacca aatgggcagg 72541 accctgccgg agctgatcac acagttcaac ctgacacagg tcacctgtca ctcagccctg 72601 ctggacctgg agaagctccc agaattcaac aggtgagtgg ggagcatgga gatcccctgg 72661 tggcaaaggg ctttcttgct tatgatgatc cctacagcag agggtataat gggctctggt 72721 acctaattct gtgtgaccat ggctaaatca cctagcctct ttgagcctct gttttctctt 72781 ctctaaaatg ggattaataa cagtactgat cttatatcct tgttctatga aagtatcttg 72841 tattaagtgt tgcatgaaaa tatcttgtat agccctggca catacaaagt gctcagtaaa 72901 tggtacgtat atttttatca ttaacattat gttcctagcc accccttgca tggactctgt 72961 taacaaccaa agcctggaga ggtgtatctt cattttagac tccctcccca ccctcagcct 73021 tacatccagc caacacagtg agtgagcact gtctcatggc atctacttta tatttcatgt 73081 atttgatatt gcaagtagct cttcacttgg aaacctccaa atatcgttcc aactcagatc 73141 atagttaaca tgtctgagtc aaagggtatt agtgtcggct gggtgtggcg gctcacacct 73201 ttaattctag cactttgaga ggccaaggtt ggcagatcac ctgaggtcag gagttcgaga 73261 ccagcctggc caacatggtg aaaccctgtc tctactaaaa atacaaaaat tagctggaca 73321 tggcacatgc ttgtagtccc ggctacttgg gaggctgagg cacgaggatc acttgaacct 73381 gggaggcaga ggttgcagtg agacaggatc acaccactgc actccaacct gggcaacaaa 73441 gtgatactct gtctttaaaa aaagttatta gtggtttgca agaccatcag ccaggtgtgc 73501 acctggggtg agggtcttcc tgatactttg cctagagcac tggtttgggg acacagtatc 73561 agtgcattta cttgcatgag cccagagaat taacatcagt tgcagatatt ttagggctat 73621 tactttattt acttctccca tcagacctat gaggtcagtt gtatgatttt gccctgtttc 73681 acgtatgagg aacccaggac atagagaggt taagaaacta gccaaagtgg ccaggcatgg 73741 tggctcacac ctgtaatccc agcactttag gaggccgagg caggcagatc acaaggtcag 73801 gagatcgaga ccatcctggc gaacacagtg aaaccccgtc tctactaaaa atacaaaaaa 73861 cttagccggg tgtggtggcg ggcacctgta gtcccagcta ctcaggaggc tgaggcagaa 73921 gaatggtgtg aacccaggag gcggagcttg cagtgagccg agatcgctcc actgcactcc 73981 agcctgggtg acagaacaag actccgtctc aaaaaaaaaa gaaagaaact agccaaagtt 74041 cacacagtaa acagtgggca tcttttgaat ggggcattca agagtggatg taggctgggc 74101 gcggtggctc atgcctgtag tcgcagcatt ttgggaggct gaggcaggtg gatctcctga 74161 gtcaggagtc gagatcagcc tggccaacat ggtgaaacca catttctact aaaaatagaa 74221 aaaaaaaaaa aaatcagccc gacgtggcgg gcgcctgtag tctcagctac tcaggaggct 74281 gagacaggag aatcacttaa acctgggagg tggaggttgc agtgagccga aatcacacca 74341 ttgcactcca gcctgggcaa caagagtgaa actccatctc aaaaaaaaaa aatgcctctt 74401 tctatatatt ttagtggctg ggtctcgccc tgtcacccag gctggagtgc agtggtgaca 74461 tctcggttca ctgcagcctc agactcccag gctgaagtga tccttccccc tcagcctcca 74521 gaccagctga gtctataggt gtgtaccacc atgccaggct aattttgtta ctttaaattt 74581 tttttgtaga gatggcatct taccatgttg ctaggttggt cttaaactcc tggcctcaag 74641 tgatcctccc ttcttggcct cctaaagttc tgggattaag tgagccactg cacctggcca 74701 aaagcttctc caggtccagc aggactgagt ggcaggtgac ctgggaagcg gaggttgcag 74761 tgagcggagg tcgcaccact gcactccagc ctgggtgaca cagtgaggct ccgtctcaaa 74821 aataaataaa taaaaataaa aggtcagcta ctattttgaa aacactgaaa gcgtggctgg 74881 gcatggtggc ccacacctgt aatcctggca ctttgggagc ccagctgggt ggattgcttg 74941 ttgttgccta gcctggacaa catagcaaga ccctgtctct acaaaaaact aaaaaattag 75001 ctgggcatgg tgatgtgtgt ctgtgggccc agctacttgg gagggtgaga tgggaggatc 75061 gcttgagcct aggaggtcta ggctgtagtg agccatgatc gtggtactgc actccaacct 75121 gggcaacaga gcaagactct caaacaaaaa caaaaaccaa acccctgaaa agaaagcacg 75181 tctccgagtt gcttgggact ttcaggctgt ttgtgacctt agtcacatcc ctttcttttt 75241 ataaacgtga agagattctt ctagcttgcg aaagggaggg ctcatgggac ctgatatgtc 75301 attcttccaa cttcaccctt tccctggggg agagacggtt tgggtgggtt ggggtgttga 75361 cagcagggca gccccttgat ttcttccgca gactgcacct ccagcggctg gtgagcaatg 75421 agagccagag gcgccagctg gtggggaagc tgcaggtcct tgtggaggag gcctttggtt 75481 gccagctgca aaacagggat gtcctcaacc cagtctacgt ggagaggatc ctcctgctga 75541 gacaggtgtg gtgtcaggat tctgggaagc tgagggaggg gttactgggt gcctcaattc 75601 ctcctgcctt ccagagccct gtccctagtg cattcaatga ctgcctcctt ccccagggtc 75661 acatttgccg cctgcaggac ttggtgtccc cagtatactc ttacctgtgg actcgccctg 75721 cagtaggtcg agcacagctg gacgccatct cggagaaggt ggatgtgatt gccaagcgtg 75781 tgctggggtg agtacccgca ggctgagctc agggttccac accctccttc cctctctctg 75841 tcccatggct ctctttcctt cagggctggg ctgttggggc cccagcacaa ctccatggcc 75901 agtgaagaag tgacccttca ctggctttcg ctggtgagga gggcttaccc ggagaacccg 75961 ggctttctct ggacataggc atgtgttcac tgcttgcacc aacctggttg catcccagct 76021 ctgccgtgta ctcaccagat tatgtcacct tgctcaggcg gcttcacttt ggccaagtta 76081 ctgtacctat ttgagcttcc ttttccttct cagtgagaga gagatgatgc taattcttac 76141 tgtaacagta ataactttca attacaaacg caaccaaagc tggcttaggc aagaggggaa 76201 taattgctta tgtaactgta gatgctagaa gaagggctgc ttgaggcaca gcttaatgca 76261 ggagtcggta ctccattacc aagattcgct ttcgctctct gcctttcccc ttctgcccct 76321 actcgccgat gctgaccact tttcctgcag aggctgtgta tcaggtaact agagggcaag 76381 actctctacc agacaaaaga aatgacagcg catgtccttt gtggaagtac atttctccat 76441 tatgtagcaa tctgcaggaa gcagctcagg gctcatcctc ccacctcagc ctctttagaa 76501 actgggacta caggcacaag ccaccacgcc caactaattt ttttattttt tgtagagaca 76561 aggtcttgcc atgttgccca ggctggtctc aaactcctag gctcaagcaa tcttcacacc 76621 tcagcctcac agagtgctgg gattacaggc gtgagccata gtgtctggcc agggcctatt 76681 agttctaatt caccaggatt cagtgtcgta actgcctgtc tccaccatga ggttgggttt 76741 tagatttgct ctgggactcc ctctcgagtc aggaaagctg atacccaaat gttcaggagc 76801 tgccagatca ctagaaatcc tttctgtgcc tatgccatgg ttcttaactc tgggccagct 76861 ctagcaggtc tggaactgcc attgttcccg gcctggggcc tgtcattgtt tcatcagctc 76921 ctcggcagcc tgcttccctg cagtgccatt ggggagagtg tctgcttttt tcccacatta 76981 tattttttgt cttcagtatt tcatctcctg atttttgtaa atagcagtac tcatgtgacc 77041 cagttaataa ttgtctcagt ccatggggtt gctatagcaa aataacatag actggtggct 77101 tcaacaacag acatttattt ctcacagatc tagaggctag aactccaggg tcaaggtgcc 77161 gccgattcag tgtctggtga gagccttgtt cctgggccct tcccttttcc agcccttcgc 77221 tgcatcctca cgtggtggag ggacgaggga ctctctgggg tgtcttttat aaaggcactc 77281 atctgtgcat aagggctcgg cctgcatgac ctgttcacct cccaaagccc acacctgtga 77341 atactgtccc cttgggggtt aaaatttcaa catatgaatt taggggaaca caaacattcc 77401 gaacatagag atgatgcagt aagaacgatc atcctttgct gggcgcggtg actcacgcct 77461 gtaatcccag cactttgtga ggcccaggcg ggccgatcac gaggtcagga gatcaagacc 77521 atcctggcta acacggtgaa accccgtctc tactaaaaat acaaaaattt agccgggtgt 77581 ggtggtgggc gcctgtagtc ccagctactc ggaaggctga ggcaggagaa tggcgtgaat 77641 ccgggaggcg gagcttgcag tgagctgagg ttgcaccact ggactccagc ctgggcgaca 77701 gagcaagact ctgtctcaaa aaaaaaaaaa aaaaaaaaga acgatcactc ttgtctgaaa 77761 ttgaaactca tctgcttcag tgctgacagt tagtaaggag atctgtctgc actgaaaggc 77821 tgcttcttca aatcacttat gaacccagtt ccctgagaca gtagtggtcc atactgagtt 77881 gactttggaa tgcagttgcc tggaccatgt ttgagcaggc agggtcagaa ctgacatcac 77941 agaccatctg caccagaaag atgtccagcc ccagttcctc ttcagtgttt ccatgttgga 78001 atctccagta ataactgaag ggaggctgga cactgtggct tgtacctgta atcccagtat 78061 tctgggaggg tgaatgggga ggatcacttg aagccaggag tttgagacca gcctgggcaa 78121 catagcttga ccccatctct acaaaaggta aaaataaaat tagccaggca tcgtggtggg 78181 tgcctgtggt ccaagctgct caggcagctg aggtgggagg atcgtttgag cccaggagtt 78241 tgaggctgca gtgagctgtg attgcaccac tgcattccag cctaggtgac agagcaagat 78301 cctgtcttaa aaaaaaaata attaattatt tttaaaaact gagaggaact ggaaagttgc 78361 agtgaaatat actcatcccc agcctattcc catctcttga ctaactcaat cagggccgag 78421 gcaaattttt gttaagtcca cctctaactg aatggagaat ttctcctgaa acaaaagctt 78481 ttgggcagag catcttgctc agctggactg ccaggatgtg ggggttggtt gctggcccgc 78541 tggaatccta gaggcttcaa tttgtggaag aatggccata gcaagttggt ggcagagcca 78601 ttgctggagc tgggtggggt cttctgattt cttctcttct tggctacgtc tccaaagaag 78661 tctctctcta ccaatcttaa agctgtattt cagataattt cccagaaatt atccattcta 78721 tgtcaaatct attcctaagt ttcttttttg ttgtgcctgc aagcagaact gtttcttttt 78781 agatcctttt gtaatgtctc aaagctcttg attgtactta actaattttt aaaaatttta 78841 aattcaaata cgaaatgtaa tacttggccg ggcatggtgg ctcacgccta taatcctgcc 78901 tataatccca gcactttggg aggccaaggt gagtggatca cctgaggtca ggtgtttgag 78961 accggcctga tcaacatggt aaaaccctgt ctctactaaa aatacaaaat tagctgggcg 79021 tggtggcacg tgcctgtaat cccagctact caggaggcta aggcaggaga atcgcttgaa 79081 cctggaacgt ggaggttgca gtgagctgag atcgtgccat tgcactccag cctgggcaac 79141 agagcgagac tccatctcca aaaaaaaaaa aaaaaaaagg taatatttct gagtatttat 79201 catgtgttca ggcacttggt gctttataag ttttaactct ctacgatagc cttgtgcagg 79261 acagatacta ttattttttt ttttttattt tgaaacaaag tctcgctctg tcacccaggc 79321 tggagtgcaa tggtgcaatc tcggctcact gcaacctcct cctcccaggt tcaagcaatt 79381 cttctgcctc agcctcctga gtagctggga ctacaggcac ccgccaccat ccctggctaa 79441 tatttttgta tttttagtag agatggggtt ttaccatgtt ggccaggttg gtctcgaaca 79501 cctgacctca agtgatctgc ctgccttggc ctcccaaagt gctgggatta caggtgtgag 79561 ccactgcgcc agcctcagat actattattc tccccacttt acaaatgaga aaactgaggc 79621 ataaagattg catgacacag aggtagcaat gagtaaagcc tcttatggca cctcttacct 79681 ccctgcattg tgaccactgg cttctctgtg tgtggtccct tggtggtttt tccttcccga 79741 cctggataag ggagaaagtg gtcaatatct tcctcacctc cctcaaaaca ggcttgaaca 79801 ttggttgtgc tgctcgtact gtagctgatc aataaccttc tgaagccctt cctcccaccc 79861 acttgccatt ggcccatgca ggccctgtaa actggccctc ttcttctagg cttctagaaa 79921 gatctagtat gagcttaact caggatatgc tgaatggaga actgaagaag ctatcagaag 79981 gtctggaagg caccaagtac agtaatgtga tgaaactcct tcggatggcc ctcagtggac 80041 agcaggtgag gcagggacac gggttggatt gttccctgga gcccctcatt gatcctttga 80101 acttacattt cctggcaggc actgagctca atattgagta tacaaaggtg aatgaaacat 80161 gatcactccc ctcatttagt caatcaacag acactattga gtacctacta tgtgcaagca 80221 ttgtgttagg tacagtgaat acaatataga attaaacaca catacataat ctctgcctcc 80281 atggggtttg cagtttattg gggataacag acctagcaac agcttactta atatgggaaa 80341 atcaggccag gcacggtggc tcacacctgt attcccagca ctttgggagg cctaggcagg 80401 tggatcacct gaggtcagga gtttgagacc agcctgggca acatggtgaa accttgtctc 80461 tactaaaaat acaaaaatta gctgggggtg gtggtgcgcc tgtaatccca gctacttggg 80521 agggtgaggc acgagaattg cctgaacctg ggaggcagag gttgcagtga gccaagatcg 80581 caccgttgta ttccagcctg ggtgatagag tgagactcta tctcaaaaaa aaaaaaaaaa 80641 aagggcggtg gttggaggag aaacaggctc ttaggaccca tccttttcca gggaaataac 80701 acaagcactg atcctcagaa cacaggcagc agtgtggggg ctggcctggg aataatttct 80761 gcagactaag gctgagcttc agttctttag gctgtttaag gtagtaaggc aatatttgct 80821 ctttttgttc ttagcaagga cctcctgtag ctgagatgat gttggccttg ggaccaaagg 80881 aagtacggga acggatccag aaggtggttt ccagctaggg agaggatgtt tcgggcagtg 80941 gagatcgccc taagaacctg tgagcttaga aacagctttc agagaccaga aggaggcctg 81001 gggcccgtcg ggaagtttgc tgaaggaact aagagtggaa acaatctctg actgcacaca 81061 agtgcatttg tgatccacca caggcacgtt tttgtcaatt ggctggggaa gcccaaccca 81121 tctaacctcc tggactctgg aaaggcattc tccgggtctg gtttctcaca cattacacag 81181 cagaggtttg gagttagttc acagaaaagt taacatcccg gaacagtgca gaaggcttgg 81241 gtttcagttt gctagggccc ccataacaaa atgccacaga accaggtggt ttaaacaaca 81301 gagatttatt ttctcacatt tggaggctag aagttcaaga tcaaggtgtc agcaggcctg 81361 gtttttccta gaggcctctt ttgttggctt gcagacagct gtcttctcac tgtgtccttg 81421 tgcggtcttc cctccatgca cacacatctc taagtttcat tatgtgccca acgttcctct 81481 tctaagaaca ccagttggac tggattttaa ctttaaccag cctcatttaa gttaatcacc 81541 tctttaaatg ctcaatctcc aagtacagtc tcattctgag gttccagggg tttctcaacg 81601 taagaattta gggggacaga attcagcccg tagcagctgg gcagcaggac tcatgggtcc 81661 cagttctcag gccccaagga ctcagagcag caaaggatac gtgacacaag cagggctgaa 81721 ctggacacca gactttcttt ggggttgtaa aaggagaagt gtgactccac tgccaagggt 81781 aagcactctt gtcttatgct accctttatt cttgaacctt ggggcagcct gtgtctccct 81841 tccctaaacg tcatctgttc cgtttttcca gtcttccaga cctggtcctt ggactggaga 81901 ccttcaccct tcctattttg tggctttctc ttcaaatcca cctcgtcctt cctccatgac 81961 ttataagtca tcatctctgc tggtttctga aatgagatat gcttcttggg gtcttcagtg 82021 aagaggaata gcaagtgaaa ggaggatggt ggcagggtac atgaacccct accaggcttc 82081 taggcttccc ctgcatggca gccagcctct aagttggccc ccaaccatcc ctgccttctt 82141 ctattcacac ccttgtgtgg ttccttccca cattgaatag gcctgacttg tgtaaccact 82201 acactattga agaaatgaga gtttgacttc tgagggtaag tcataaaata catcgtggct 82261 tctgtcttgt tttctttgat cgcttacttt gggggaagcc agcttccagg tcatgaaaat 82321 actcaagcag ccttatggag tgctccacat gacaagggat cgaggcctcc caccaatagc 82381 catgggagtg tgcaatcttg aaagctggtc cttcaattcc agccaagcat tcacatgacc 82441 gcaatcctag ctgatctcgc ttgtgacctc tatgggagac cctaagccag aaccactcag 82501 agaagctgct ccggaacgtc agacacacat gaactgtgag atagtaaacg tttattgttt 82561 aaagctactt ggtttgggga taatttgcgt ttcagcagta gacaatatat ttattttgtc 82621 caggggagtg gaaaggcgat tacagagttt gttgagcccc atagagtcct agtctttgct 82681 taaagaaata taccagttct ttctaataca gaatccatta agttcgttgg ttgcaaacaa 82741 tagaaaccaa ctctgcctca cttatgttta ttggaggcca taagagagcc acagcatcta 82801 ccaaaaagcc aaaaaaaaaa caggttttgg aaaagacagg aactagctac aggagctaat 82861 ggccaatctt gtctgagttc tttgatggcg acacatgaac tacagccgtt ttgttgttgt 82921 taatttctcc gtgttttccc ctcagttttg gatgactctg ctcaaggttc agattctgag 82981 gagcaagtat ctgttgggct atctgaggtc ctgctaccat tccttggcca gtaataggta 83041 gtttcacagg gattatatgc aatggaggaa aagttgttct caaggaaaac tagggagctt 83101 tctccagaag atgatgtaat ggcagaaaca acaggtggtt gaccacagaa ggggacaata 83161 ataatacttg taatttattg tgcgcttata tgagcctcac cctctcccag gtgtcagata 83221 atgctcgtaa atgccttagt ccatcatctc agtaaatttt ttttataaca ctatgagaga 83281 agcagtattt tttttttctg ttacagatat ggaaaccgag gttctgaaaa gttaactaaa 83341 acttccgagg ttatagtacg aggcaaaacc aagatttaaa ctcagatctg tgccgatcca 83401 gagtccaagc ttaactctga tctactgagt gttgtcagag catctgtttc taagacatta 83461 agaatggtta aaaggagctt cttggctcat ggtcaagtga gtgtttttaa caaagtgagt 83521 ggtcaaaata ctaagtcgat tctggtatat ctacaacatg gatgaacctc taggacttca 83581 tgctaaatga aatgtcagtc acaaaaggac aaatacgtcc taatgctttg tgaggcaaag 83641 gtggtaggct tgtcaaaggc caggagtttg gagaccagcc tgggcaacat agtgagatcc 83701 tgtctctaca aaaaatttaa aaattagcca ggtgaggtga catatgcctg tagtcccagc 83761 tacttgggag actaaggtag gaggatcact tgagctgagg cgttaaggct gcagtgagct 83821 atgaatgcac tactgcactc taacctgggc aacagagcaa gatcctgtgt ctaaaaaaaa 83881 aaaaatgaat aaaataaagt taaaaggata gatactgtaa gattctactt atttgaatta 83941 cttaaaataa tcaaaattat acaggcagaa aataagatgg tggttgtcag acttggggag 84001 gcagaaatgg gaagttactg tttagggtat agagtttcag ttttgcaaga tgtagttaca 84061 ggggtcaatg gtggtgataa ttgcacaaca ttataaagat acgtaatgcc gctgaactgt 84121 atatttaaaa atgactgaga tggtaaattt tatatttttc tgtattttac cacagttttt 84181 aaaaaatact aaattagagc tgggcctggt ggctcacacc tagaatccca acattttggg 84241 aggccaaggc aggcagatca cttgaggccg ggagttcaag accagcctgg ccaacatggt 84301 gaaaccccat ctctactaaa agtacaaaaa ttagccaggt gtggtggtac atacctataa 84361 tcccagctac ttgggtggct gaggcacagg aattacttga acccccacag aacgggaggt 84421 tgcagtgagc caagatcaca ccattgtact ccagcctggg gaacagagcg acactctgtc 84481 tcaacaagaa attaataaaa aagcgaaatt aataaagctg ttgctacttc ctcaagtgaa 84541 gtgtgctgtg ctatcttgaa agcagctcca gacgtctgtc ttctgtttga gggttttgct 84601 caggatcagt aagacttttt tttttttttt taagaggatg ttgctctgtt gcctgcccag 84661 actggagtgc agtggcacaa ttatatttca ctgtagctct aactcctggg ctcaagagat 84721 cctcctatct cagcctcttg agtagctgga atcacaggtg catgccacca cacctggcta 84781 agtaaaagtg tttgaaggaa tttcatacat tttaaataac ctcaagccca tccccagtgt 84841 tttgagggta tagtgcagca gtgaagggca cagaggctgt ctagattcag aaacattctg 84901 ccccttattc acatgtgact ttctgtacct caattttctt gtttgtaaaa taggaataag 84961 aattgtaccc atctcaaaga gttgtaagga ttggaataga taacatatac gaagttcttt 85021 tttttttttt tttttgagac gtagtttcac tcttgttgcc caggctggca tgcaatggcg 85081 tgatcttggc tcactgaaac ctccacttcc cgggttcaag tggttttcct gcctcagcct 85141 cccaagtagc tgggattaca ggcatgtgcc atgtgccgcc atagccagct aattttgtat 85201 ttttagtaga gacggggttt ctccatgtgg atcaggctgg tctcaaactg ccgacctcag 85261 gtgatccgcc tgcctcggcc tcccaaaggg ctgggattac aggcatgagc caccacgcct 85321 ggcaacatat atgaagttct tagaacatta catggcattt aataaatgtt aggtaactat 85381 caccattatt ttcatcctta gaaacagtga tcttggtggc aatgcataga aacacactca 85441 gactaactga aaccatgaaa ggaatttatt ggaagaatta ggactctcac agatctaaag 85501 ggcaggactt ccaactggac ctcacaaaag aatggaatct gtacctagaa acctgttggg 85561 accaaaggca gccactcggc cctctgagat ggcacagttt ctaacattca cccctctgtg 85621 tgtctgctcc ccttgttttc cctctgcagg ctgcctttct ctttatctct gcagttggcc 85681 aacagctgcc ctagccctga aggtaaatca tctctcttct cctacatcac caattaaata 85741 accaatattc cacatacaaa ctcgcagggg agaatctgat tggcctagcc tgggtcaggt 85801 gtctacccct ggtccaatca gctgtggtca atgggagtca tattctatga tcatggctgc 85861 aggtggctca ctttagtgtg tgggaaacca gttccttccc ctgagcgtgt tctttttcgg 85921 tgttgtttag agagtatctt ttccagggct gggcacggcg gctcatgcat gtaatcccag 85981 cacattggga ggccaaggcc agtggatcac ttgagcctag gagttcgaga ccaacctaga 86041 aaacatggtg aaaccctgtt tctacaaaaa atgcaaagat tagtggtgtg tagtgatgca 86101 cacctgtaat cccagctact ggcgtggctg aggtgggagg atcacctgag cccagggagg 86161 tggaggctgc agtgagccag gatagtgcca ctgtactcca gccagggtga cagagcaaga 86221 ctgtctcaaa aaaaaaaaaa aaaaaaagac agtatccgtt ctagaaaatt acatttttat 86281 agccaaattt ggaatcccat tcatactgcc tttatcctac tttatttttg ccttcatagt 86341 actttttact atatgaaatt atatatttgt acatatttac tctcttactc aaaagaatat 86401 gagattatgg caggaagtct gtctttttgt ataaccatag cacctggcac gttgccagat 86461 acatagtaag ggtcaagatc tcatgatgaa aaagtgtttg aggactatgt tagtaagatc 86521 agtttcttca gaaatggcac tcatcagctg gtctaactgc cagactctta acagttacca 86581 gtatcacttc catttaagcg gggaagtctt tcctcttttg caagatttgc cagtataatt 86641 agcagtccag tctgttgctt gtaatttgca ttgaatccag ctgtctggga gaaactacct 86701 ttgacaaagc ttggaagtga aggctgggga cagatgactg acatgtgttt agtggggact 86761 aaaattagaa gttgttggct gggtgtggtg gcttactcct gtagtcccag cactttggga 86821 ggccaatggg ggaggattgc ttgagcacag gcgtttaaga ccagcctggg caacatggtg 86881 aaaccccccc atctttacaa aaaaatacag aaattagctg agcattgtgg tgagcacttg 86941 tggtcccagc tacttgagag gctgaggtgg gaggatagct ggagcccagg aagtcgaggc 87001 agcagtgagc tgtgtctgtg tcactgtact ccatccctgg tgacagagtg agattctgtc 87061 tcaaaaaata taaatgtcag ccaggcgtgg tggctcatgc ctgtaatcct agcactttgg 87121 gaggtgaagt caggtggatc atgaggtcag gagtttggga ccagcctgac caacatggtg 87181 aaatgccgtc tctactaaaa atacaaaaat tagccaggca tggtggtgcg cacctgtaat 87241 cccagctact taggaggctg aggcaggaga atcgcttgaa tccgggaggc agaggttgca 87301 gtgagccgag atcacaacac tgcactccag cctgggtgac agactccatc tcaaaaaata 87361 aaaataaata agtaaataaa taaataaata ggccaggcgc ggtggctcat gcctgtaatc 87421 ccagcatttt gggaggccga ggcaggcaga tcacgaggtt aggaaatcga gaccatcctg 87481 gctaacacag tgaaaccctg tctctactaa aaacacaaaa aattagctag gtgtggtggc 87541 atgcacctca ggggttctga gggatgagta ggagtttccc aggtagatgg gtgttggaag 87601 gctattccaa ggagagggat gaaaaataaa gcttcacaaa gttgttcagg actggaccaa 87661 gagtaggagt caggggagag gtagaagtgg ctgctgagaa accaggcaga tgtgagatcc 87721 aaataataaa acccagattc aatagaaaaa aattagctgg ggccgggcgc ggtggctcac 87781 gcctgtaatc ccagcacttt gggaggccga ggcgggcgga tcacgaggtc aggagatcga 87841 gaccatcccg gctaaaacgg tgaaaccccg tctctactaa aaatacaaaa aattagccgg 87901 gcgtagtggc aggcgcctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 87961 gaacccggga ggcggagctt gcagtgagcc gagatcccgc cactgcactc cagcctgggc 88021 gacagagcga gactccgtct caaaaaaaaa aaaaaaagaa aaaaattagc tgggcatggt 88081 ggcatgtgcc tatagtccca cctactaggg aggctgaggc aggaggatca cttgagccca 88141 ggaagtcaag gctgcagtga gctgtgattg caccactgca ctccagccta ggcaagagag 88201 caataaatct taaataagat ttattatcgt acttagcagg aagttcattg gtaaggtggg 88261 ctgcagggtt ggttaattac acgtgtggtg acatcagtga ggacccaggg tccttctctc 88321 tctccaccct cctggccaca gcatctgcag cagccacagt tggctgcctg cagcagtcag 88381 gcctctgcac ttcctccttc atgtcagctg gggagagggt ggctttctat ggctgtctcc 88441 caacagcaag gaaactccta agtaccctcc ctctactcaa cttccaaatt tttagccatc 88501 cctgctgggg aagtgagggt ctccataaat cagtcagatc cagagctgga gatgaaggga 88561 gtctgcgtgt ccctgaggtg catggccagg tggggagtgg caggcacctc agcaaagtta 88621 gggttcaatt aatgaaggca aggagttggg tcttcataag gagggaggtg cagagccagt 88681 gaaatgtttt aggcagggaa atgacatgag cagattgtgg tttttgaaag gtcactctgg 88741 caactatgcc aaggagagac ttaagcagga gcagatggaa ggaaggagcc ccagtaggag 88801 aagattggaa aatgaggcac agaaaagggc tattttcacg ataaagccat gttctctcat 88861 cacagcctct aagggcctag gttagggttc tccagagaaa cagacccaac aggatatata 88921 tctaggaaga gagatttatt attagatagt gcctcaccaa attatagaag ctgaaaaggc 88981 cgagtgtggt ggctcatgcc tgtaatccca gcactttggg aagccgaggc aagaggattg 89041 cttgaaccca ggagttcgag accagtctgg gcaacatagt aagaccccat ctttacaaaa 89101 aatagaaaaa aaaattggcc gggcatggtg gcgtgtgcct gtagtcccac ctattaggga 89161 ggctgaggcg ggaggatcac ttgagcccag gcagttgagg ctgcagtggg ctgtgattgt 89221 accactgctc tccagcctgg gcaacagagc aagaccctgt ctcaaaaaaa aaaaaaatag 89281 gtcagaggtg atggcttatg cctgtaattg taatcccagc actgtgggag gctgaggtgg 89341 aaggatactt gaacccagga gttggaggct gcaagtgacc tatgattgtg ccactgcact 89401 ccagcctggg caacagagct agactctgtc tgtagaaaaa ataaaaattg atagataagc 89461 tgaaaagtca tctgatctgc ttgtgtaagc tggagaccca ggaaagccag tggtgtagtt 89521 gaagggcctg aaagctggag agccagtagt gtagattcca atccaaacct gaatgcctga 89581 aatccaggag taccaaggac agaagattgg tatttcagct taagctgtgt gttagtctgt 89641 cattgtgttg ctgtaaagaa atatcggagg ctgggtaatt tatttttaat tttaattttt 89701 tttttttttt tttttgcagg ctggagtgca gtggtgcaat ctcagctcac tgaagcctcc 89761 acctcctggg ttcacacgat tctccggtct cagcctcctg agtagctggg attacaggca 89821 cacaccacca tgcccagcta atttttgtat ttttagtaga gaaggggttt tgtcatgttg 89881 gccaggctga tcttgaactc ctgacctcat ataatccatc tgccttggcc tcccaaagtg 89941 ctgggattac aggcgtgagc caccatgcct ggccttttaa tattttttgg ttttgtagag 90001 atggttgtca ttatgttgcc caggctgatc ttgaacccct gggctcaagc aatcttccta 90061 tctgagcctc ccaagtagct gagactacag ggacttgcac caccacaccc agctgataca 90121 tatttatata tatatatatt atctctttta gattaaggaa aaaaaaaacc ccatgaaata 90181 ggcacttatc ttatttctta gatgaaaaca acatagttct aatcacaatg cctgcacacc 90241 caaccactac attaaactcc tcaaccactt cctacccact tagcctttat cctttctgtt 90301 gtggtctcca gaccatcagc atcaatgtca gatggagatc tattacaaag acaaattctg 90361 gggcctcatt ccagacgtac tagatctgaa actgtgtggc ggggccctgc aatttgcaac 90421 aaacctgatt ctgacacatg ctagtttgaa aaccattgat ctagatgttg tgggacatct 90481 ataccttaca tctttcctca cctctccaaa gggtcatggc caacactcct ataacaaaaa 90541 gacgggttaa gaccaggtat ggtggcttat gcctgtagtc ccagcacttt gggaggattg 90601 cttgagccca ggagtttgag accagcctgg gcaacatggt gagaccttat ctctacaaaa 90661 ctaaataaat tagctgggag cggtggtgtg tgtgcctgtt atcccagcta ctctctgagg 90721 actgaggttg cgaagatcat ttgagcctgg gaggtcaagg ctgcagtgag ctctggtggt 90781 gccacggaac tccagcctgg ttgacagagc aagaccctgt ctcaaaaaaa aaaggctaac 90841 aagagaaaag cataacagat ttatttaatt aaagttttac aggccgggca cagtggctta 90901 ctcttgtaat cccagcattt tggaaggctg aggtgggcgg atcatttgag gtcagcagtt 90961 caagaccagc ttggccaaca tggtgaaatc ccgtgtctac aaaaaaattc caaaattagc 91021 caggcatggt ggcggggtgc ctgtattccc agctactcgg gagactgagg caagagaatc 91081 aattgaacct gggaggcaga ggttgcagtg agccaagatc gagccactgc actccagcct 91141 gggcgacagg acttcatcaa aaaaagtttc atcaaaaaaa aaaaagtttt atgttgacat 91201 gggagctttc agaaataaag accgagggaa ggctgtctac ttttatattt aggtttgatg 91261 accaacagcc aggtagaaat aggattgaac aaaatggtat gagtgatcta ttagacttag 91321 gtggggggat ccagcaaggc ctgtccaaat tcttcttggc ttacctgtat atcactcctt 91381 cttcctaggt atagggcagg aataaggggg catgaccacc tgttatcaga caaagtagct 91441 cagataattg cttttttttt cttttctttt cttttctttt tttctttctt tttttttttt 91501 tttttgagac agagtctcat tccgtgccca ggctggagtg cagtggtgtg atctcggctc 91561 actgcaacct ccgcctccca ggctcaagtg gtcctcctgc ctcagcctcc tgagtagctg 91621 ggactacaag tacacaccag cacacccagc taatttttgt atttttttgt agacacgcgg 91681 tttcaccatg ttgcccaggc tagtcttgaa cacctggacc caagtgatct gcacgcctca 91741 gcctcccaaa gtgctgggat tacaggtgtg agccaccatg cccgactgag aatttcttta 91801 tggccaactc ttaagcagaa ataaaggaag gctggagtaa tatgtctagg ttttgtggct 91861 ggctttaggg gagaggggtt tgagtttccc tgacccatct tgggaaagag gaattgagtt 91921 tctgtggctt gccttgggag aagaaagacg ggcaggaggt cagagaacct ttgctctgag 91981 gctgcttctg agggtttcca ctgtctttga gttcaaagta ctcgctatgc caaaacacca 92041 tgctctggga tatggttttc tgagctccaa caatgtaaaa atgttaaagg attgcaggtg 92101 cttaaaataa gtaaagtttc aggagttgaa ggatggtcct tggcaaactg tagagctcat 92161 gccccctgta gcatgtggag actctagtca cgtccaggtt ttgttgaaat gtggaaagcc 92221 gggtccagtg ttgccagatg ttcagaatct tcaagagata aatatgtatt aaagatttta 92281 tttaggccag gtgcagtggt tcacgcctgt aattccgaca ctttgggagg ccgaggtggg 92341 cggatcacct gaggttagga gttcgagagc agcctggcca acctggtgaa acgctgtctc 92401 tactaaagat accaaaatta gccgggtgtg gtggtgggcg cctgtattcc cagctgctca 92461 ggaggctgag gcaggggaat cacttgaacc caggaggcag aggttgcagt gaactgagat 92521 tgcaccattg cactccagcc tgggcaacag agcgagactc aaaaaaaaaa aaaaaaaaaa 92581 aaaaaaaaga ttttctttaa ctccttaatg aggaaactgg tgagatgttc aaaccagtcc 92641 aaagaatgga catcatgtat agatcaggaa tgttgaaatg gatgtgcaag tgggggcaga 92701 actggttttt ccacaagaag tctgtgtgtt cccagaacag tattcatttg cttgtgtgtg 92761 gacaagaatc atcacccatt gctattcagt tatctacaag gaagaatttg tgcttctgta 92821 tggcatacgt acctatggtc taagaagaat gtgtgttctg agttccaagc tacagaatca 92881 cagagtggcc agcccagagg ttcactcttt atctaagagg aacatctgaa cccttggccc 92941 catcctgtgg aacgcaggcc atacggggga tcaaggccct ttggggttaa attgaggttg 93001 ccagatggag gttgctaggg gaagggtgct agttgaaaat actatataaa ctgatgctct 93061 ttacaaatgg tagtggtcct cctgtccagc ccactgccac tagaccaccc cgtaagtccc 93121 ctcagtaaac ctgcctcatt cagtggctct gggtctcttc ttccgccttt caaacatggt 93181 gccatcccta ctgaagttaa taggggtacg gcacaacagc ttcaatggaa aattatgtgt 93241 tgtctagaca gcacaacctt ccaggttcta gcctttcttt gtctttgaga agtctcataa 93301 ctctgttaag ttgataccaa tgcaaaataa ttcttgccta taaataggta aattccgctc 93361 actggaggga cagctctcac cccggtcgaa aagccagttt tgcctccatt tgcacattcg 93421 tcttatcttc ctttttttcc agataaattt gcccttcttt gacttcactt ttggattgat 93481 tataacatct gcttggttcc ctcatgtaat taaataaaat atttaacacg tattcatttc 93541 ataattatga taatcatgac atactttttt gggaaaatta tcttttaaca gaataagcaa 93601 gtaaataata catcgatggt gtactgtgtg atgcagaaaa taaagcagag tattgaggaa 93661 tgcaggacat gttagtttag aagtgttcca ggaaaccagg gaggtggagg ttgcaatgag 93721 ccgagatagc accactgcac tccagcctgg gcgacagagc gagactccat ctgaaaaaaa 93781 aaaaaagtgt tccaggaaaa cccttctgtg aataaaaatg caagcacata ggccgggcgt 93841 ggtggctcac ccctgtaatc ccagcacttt gggagggcga agcgggcaga tcacgaggtc 93901 aggagttcga gaccagcctg gccaacatag tgaaccctcg tttctactaa aaatacaaaa 93961 aaaaaaaaaa tagctgggtg tggtggcatg cacctgtagt cccagctact cgggacgctg 94021 aggcaggaga atcgcttgaa cccaggaagt ggaggttgca gtgagccgag atggcacccc 94081 tgcactccag cctaggcgac agagtgagac tccgtctcaa aaaaagaaaa aaagaaaaat 94141 aaaaaataaa tgcaagcact ttgggaagtc gaggtaagag gatcttgtag cccaggagtt 94201 cgagaccacc cgggcaacat ggtgaaaccc agtctctaac acacacacac acacacacac 94261 acacacacac acggtgaaac ccagtctcta acaccaggag ttcgagacca cccgggcaac 94321 atggtgaaac ccagtctcta acacacacac acacacacac acacacacac acacacacac 94381 acggtgaaac ccagtctcta acacacacac acacacacgc caggcgtggt ggtgccctcc 94441 cctagtccca gctcctccgg agactgaagc gggaggatcg cttgagccta ggaggtggag 94501 gctgcagtga gccgtgatcg cgccattgga ctccagcctg agtgacagag caagaccccg 94561 tctcttaaaa aataaaaatg ctaaaagaat tattcgtgtt cttctgaaaa tcaaatttaa 94621 ctgggcgtct tgtattttat ctggcaaccg tacggtgagt ggtcttagaa gaccagcgct 94681 gggtggaggg atctggacta agaaatgttc cggccggtcc cgcgtggagg gggggcctgg 94741 ggtgggcgtg gcgccagcgg ccccgccccg tcacgtgtcc accgctcctg ccgcgcagtc 94801 agcagaggag agcgccagga cgctacagcg gctgaagagg cagtggcgcc cgcggccgca 94861 gcgtcggggc tggagcgatg gcggcgaccg cggtggcggc ggctgtggcg ggaaccgagt 94921 cggcccaggg tcccccgggc ccggcagcgt cgctggagct gtggctcagt gagtagcctg 94981 gcaggccttc ggcgcagccg ctgcgccgca cgtgaggcct cgcttcccgc ctcccggggc 95041 cttgcttcgg ggcgggcggc cgcaggggag gccggcggcc gggtaggggg cctgggcccg 95101 gcgcggtggg gaagatcgcc caggggtccc ccgagaggag ccccaagcat ccgccggggc 95161 agcggccccc ttctgagctc tgcccctccc cctccgccca ccccccaact tagcagcagc 95221 acaggacccc gacccccacc ccaatggcta cgtgacctgc gctttcccct gcgccaggca 95281 ctgtgctcac agcggcttag tcctgcttaa tcctcattcc caccccgagc tggcttctat 95341 tattaacccc actacggaag cggggacctt gaggcttaag gagacgcagt cgtgcaagac 95401 atgcagtctg tttggggctg agaaagatct ctatctggct accgcgtcaa aaagaaattt 95461 tttacctttt gtgtttactt tggccagact ttttttttta aacacacaca cacgtgtgtg 95521 tgtgtgtgcg tatatatata tatgtgtgtt ttttttttcc tttaagagac ggggtctcat 95581 tatgttaccc tggttgggct caaacccttt ggccagaatt cttctgcatg tctcctgtaa 95641 gcttcacagt catcttaggg gttattatta tctccgtttt accaatgagg gattgaggtt 95701 caaaggggat aaagaatatt ccccttgtac ccagctctgc taggtgccag gaacactgag 95761 atgcataaca cagggtcctt gctcaccaaa acatttttta gtgggtacaa gccggcagtc 95821 acatcacctt actaacggga ttggcgagga ggaggagggc gttcgtttca cctgtggtgg 95881 taggagttgg tttacctggg ggtcgtcaca gccgaccgca ttccaggcag aagaaactgc 95941 gtgtgcaaaa gctggggaca agagaatgtg gagcactccg ggaacaagta ggtcagtccg 96001 ctggagtgca agcctaaagc tcaggaatcc gggaaagggg ctgcacggct aattggggcg 96061 agatcttcaa gagccttgtg tgctgcagaa aggagtttat gttttatccg agggcggggt 96121 tgctgtttgg gagttttcag ccggagaatc tgatcaagga agcctgttaa aggcgtcact 96181 tgcagccata tggtgctaaa ttgggcggag agggtgagac ttcaggcagg gagagcggct 96241 aaaaagctct tttggtggtg gttgggggcc gcaggaggac taaggcagtg gggagaggga 96301 gaagcaatga gcaggcgtag tcagcagcct cggtgccttg gaatgcagac aggggacaag 96361 gagaaatcta ggctagatcg cgggtttcag gctcgaggaa ctggtgggtt acagatgcct 96421 cccctgtgag gaggagctgg tttaggcaga gaaggttgag atgtggagtt tgaggtgctt 96481 gtggcatagt caaaaaagag atgccctcta ggcagctggt tggcactgga gagggcaagt 96541 gtgggacgta cggtttggga gtctggttca ttggcagtga gcttgcggga gaagcttgcg 96601 tatattaaga gcagaggttt ggaaagagag cgtggggagc cccagcattg aagaggcaga 96661 tgggaagaga agactgaagg ctgagcagaa acaggcaggg ggagacctgc aggggcaccc 96721 actgatagca aaggagggat gtggttgtaa cgtagttttg ctccttaaag atggaggaga 96781 atgaggaagg agaagagaga actttgaatt tggtaaccaa ccagttattt aatgagacat 96841 ccttgagtta aacagtagag acagaaggta gtagttttag gaggcaatgg agaacagaag 96901 gcttagactg gagttgcaaa aattagaaaa aaaggctgtt catgcctgta atcgcagcac 96961 tttgggaggc tgaggtgggc caatcctttg aggccaggag tttaagacca gcctggccaa 97021 catggtgaaa ccccgtctct actaaaatac aaaaaatagc catgtgtggt ggcgcacacc 97081 tgtaattcca gctacttggg aggccgaggt gggaggatca cttgaaccct tggggcagag 97141 gttgtggagg tggagccaag attgcaccgc tgcactccag cctaggtgac aaggcgagat 97201 tctgtctcaa aaaaaaaaaa aaaaagtagc cggagtaaga gaaagtagct gcctcaggtg 97261 gaggagataa aaggagtgcc tggaactgag gctgcccaag gacacagttc ttcaactcca 97321 gctttttttg aaacagagtc tcactgtgtc acccaggctg gagtgcagcg gcatgatctc 97381 agctcactgc agcttacccc tctcctgggc tcaagcattt ttcccacatc agcctcccca 97441 gttctagctt tttttaaatg tatttttatt tatttattta tttatttttg agatggagtc 97501 ttgctctgtt gcccaggctg gagtgcagtg gtgccatctc ggctctctgc aacctctgcc 97561 tccttggttc aagcaattct cctgcctcag cctcccaagt agctgggatt acgggcacac 97621 gccaccacaa ccggctaatt tgtgtgtgtg agagagagag agagagatgg agtttcattc 97681 ttgtccaggc tggagtacag tggcacgatc tcagctcact gcaacgtccg cctcccaggt 97741 tcaagcaatt ctcctgcctc agcctcccaa gtagctggga ttacaggcat tagccaccat 97801 gtctggctaa tttttgtatt tttagtagcg atagggtttc accatgtttg tcaggctggt 97861 ctcaaactcc tgacctcaag ttatctgcca gtcttggcct cccgaagtgc tgggattata 97921 ggcatgagcc tggcctaatt tgtgttttat tattatggtt tttttcatgg catatacatt 97981 gtattaagct ccagcttttg ggggcagtgt cccaaagttg ctagatcttc ctgtttttca 98041 ggaacggcta gaacctatat tcttaagtga aatatcgtgg gttttcagaa gttggtgcct 98101 actttggccc ataatttggg gaaggccagg cagaataaat gtgtggggag ggtgcagcca 98161 gtggcctcct cagctgtttt tcatgagtct tgaatgtaga aggaggggga gagaatagcg 98221 agagggaatt taggagtaaa ggagattatt agaaggagag ggggacatgt gagcccctct 98281 tcatgttgat gttccattgg ggaactgccc ctcccccatt ctgggtccag tgtcccatcc 98341 attgcagagg ggcctgaagg tgctgaagga gctcagagcc agagcaaaaa gggggacctg 98401 gcctcacaga gaggaaggac accttttgtt tttctgactg tctggcgaag gagatcaaga 98461 tgattgcaca tgcaaacaag ttcgtcagtg ccaccattgc cacctgagta ttgggtgctc 98521 aagtggaaca ggggacttga ggaaggtggg gaagcgttgg ggagtggctg gtgaggcaaa 98581 ccgaagtggg cccacccgga cggagagctg ggtttctcaa cctttgcacg agtgacatct 98641 tgggcccgat aattctgtgt tgtgggggct gacctgtgca ctgtaggatg tttagtggca 98701 tccctgggct aaatccactg gataccaaag ctcacaccct tcctcccagt cataacagcc 98761 aaaaatgtca ccagatactg ccatgtttcc ccagggttga gtgggatggg atcactccta 98821 cccatctccc cgctgagttc ctgagtgagg actgcagaat gctgactgga catcaggaat 98881 gtgggttgca gtcttcatgg ctgtatttgt tgttgttttc ttctgggagt aggagcagag 98941 aagatgaagt gaacgatggg ttaagtcaga tttgttgggg atggtgggcc attggtgctg 99001 caatggaggg ataagggggt cgtgggattg atagtatggc caagacatgg gtgtagttga 99061 aggcaaaagc tcatgggtct gagctacatg aagtcaccag ggggtggtgt ctgaggactg 99121 gccaagatca ggtccctgca aacaaggcag ctgtatcttt aagatgggaa gagagtaata 99181 aaacctcttc ttagggttgt tgagagaatc aaaggcttta atacacagaa agcacttaaa 99241 atagtgcctt actatgcttg tagtaagtgc ccaagaagcg ctagctatta ttatcattag 99301 gcttttatag ctgcaagtaa ttgaaactaa ctcataccca tacccgctta ccaaaaaaag 99361 gaaagtaaag aaagtcccag agtaggggta ggctcagtcc aggagctcca cggttcactt 99421 ctttgattga attttctctg ctttgctctt ctccctactg caatctcagc agaatttctt 99481 ttccccattg atttgcaaga gttattgttc ttccgcctgt tggcttgggt tttatgtcca 99541 ccttggagtc agcatgacct ggggtcatga aatatgaaga tggccagacc tggggtgggg 99601 tggggtgggg gacagtaccc aacccagagc caaaggtggg gagggtgaat gggggacctg 99661 gaattgagga gaggggttgt tcccagagga gtatcaggaa gccgttacca gaatctcatg 99721 ccttaaaagt acaaacggca gcgcctgtgc attctgtcat tcagttctgt ggtgtttttt 99781 tccacttcat gtattttggt cattattcca catgactgca gatgtagccg ccttgttctt 99841 ttaaatgaaa agccttagac gccccgttgt atgggtgtcc taggatccat ttaaccgggt 99901 ccctgttggt agacatctta ggtgttgcta gccttgcgct attaaaaaca gcactcagtc 99961 ctcttgggct ctagaattct agctctggta gcgctaccag ggaggccatg gcttagtaaa 100021 agttgggtgc tcagtaaatc tgtttcttcc ttccacttgg tcctactccc aggtgatgag 100081 taggacattc ccatgctgag tcttggaggt tttcacatgt gctgaattgt atgattgccg 100141 gactcttggg gtccacgtgt ccctcccagg gaaggagtcc ccctgcagcg tggactgctg 100201 aaggattgca ttctgctcct cggggtgacc acttcctcac agtcttgggg ctccccttgc 100261 tagcacaggc ggctgctggt gccagggtca tgaccccggt ggttggttgg cagctgacgt 100321 gggatggttt ggcttctgaa ttagtcactc ttggatgagc catttcactc acagccggag 100381 gtctcagctg ttttcttaac acctctggcc ctgtagcacg cttgaaatga atcatacaaa 100441 cactgctttt agtgaggaaa tacatggcag cagtgctatt tctcagcacc cctctgtcct 100501 gctcaagtgc cagctataaa taaaccacct tggagtctgg gatagcccaa aacagagaag 100561 gcatcctttg aatttctccc tggccgtgcc ctgtctggcc cctagctagt ccttggagca 100621 gattggtgtc aaacagacct acggcggttg ctgtggcctg gtcagaccca gctgctggcc 100681 ccctgggggt cattttttgg agctgctcaa gtcttctgaa atagtttgtg caggctggga 100741 ttatgtgttt aggggcccag ttatttcaga actgagatag ccatctctcc gtgttatcag 100801 tatgtccagc cagctggcca aacactgtct agcaaaagag gaatcgttca ttcagtagat 100861 acctgccagc aactcttcta ggtactagag cctgtgagag tgaatgagac aggcctgctt 100921 tttgccttca aggagtttat gagcacgaag atcaagctgg cttccaagag aggccctaac 100981 tttgtcaagg aactgaagac ctcaccgcaa gagtggttag aaacacatcc aggctgtgga 101041 gctgggccag cctatgactt aatttttgtt tctgccactc ccttgctttg tggttgggta 101101 ggtttcttca tctgtaaaat ggtgtagatg atcgtgccaa cctggattgt tctgaggaca 101161 tctggcatgg tatctgaccc atagtaggta cttttttttt tttttttgag acggagtttc 101221 actcttatta cccaggctgg agtgcaatgg cacgatcttg gctcaccgca acctccgtct 101281 cccgggttca agcaattctc ctgctttagc ctctcaagta gctgggactg caggcatgca 101341 gcaccatacc tggctaattt tgtattttta gtagagacgg ggtttctcca tgttggtcag 101401 gctggtcttg aactcccaac ctcaggtgat ccgcccacct tggcctccca aaatgctggg 101461 attacaggca tgagccacta cacccggccc catattaggt acttttaaaa atgacggtta 101521 ctgttacttt tgagccatga taaatctttt ttccccccat aggaaaatcc gttttccttt 101581 tattcagtat gcagtggagc taaagtgcag ggcacgaagt attttacaca acattttaag 101641 ttcttgtgta taggttttca tggaaattta aaaatttaac aatgatggga taatagataa 101701 taaactcatc gtaagtacat tatcactagg tttcagcaat tcttattttg ccatttattt 101761 tcaactatca atccaaatat cctagtagtg ttttaaatca aatataagat ataatatcat 101821 taattttgga aacatttaag taactatatc ttacaagtaa agataaaaaa gaaaatatca 101881 ttattacaac taatacaatt tttaatatat aaaatagatc aatttgtgta ttcaccaaag 101941 agtagactct acaaccactg aaggctttgc atgaaatgtt ttaacatgtt tcatctaata 102001 tttcatcttc agacacaaaa aatcaataca tgaaaatatt caattacgta taaagggcta 102061 tagactatgg ttggaggagg aaatcgcaaa tcgacagtgg gatattccaa accactgtat 102121 tgtgaaagca aacacaagaa agaaaatagg gattcctatt gttcccgata aaaggaaagg 102181 aaaggtaact cccagagagc cactccctca ctttgccata ttcaaatgtt ctgaacaagg 102241 cacaggtctg gccataccca aggaaagatg aggattcagg gtgtaacaaa agacagcaga 102301 gatcatgggg cgtcttagga ttctgccccc aatatccacc cttccacaaa aacttaatct 102361 gacatctcac attggtgggg atctggtgtt cttgtaagaa aaaaagaaag tctcagaaag 102421 tgtttccctt gtagactctg gggcagttga tcctcagcac tttctatttt ggaagccaga 102481 aaattccttt ttgactgaga aggggtgcag tcctgtgcac cataggattc ccattgttga 102541 ctgcatgtca gtgggagcca ccagctatta caaccagaaa tgtctgggca aacttcctca 102601 ggatgcagaa gctgctcccg tgagaaccgc tgctcctctc catatgtgaa cagactgaca 102661 gatgcaggca gcaccagcaa tctctttaac cactgaggtg cactggaaaa tgtgagttgc 102721 agtgtgtccc tgtgttagaa ttattaagag gcttttgttg acaattgctg agaccataca 102781 atgcaccaca gcaagattag ctgggtaaat gatcttctga taaaggtttg aacccagatg 102841 ttcctctttt gttgttttcc tggaagtggg taaattcagt tttcacttcg gtagctcccc 102901 taacaatccc aagaccaggt cttttactgc ttccctaggt ggcatgacca ttatcggcct 102961 ggggtaagat gaactgattt catctcttcc tttataatca tgacgtaggg gcctcatttc 103021 acatgcacag attccctcca aagagtagat gagcaggcct gggccaccct ggtgcttgga 103081 gtaatgacag ccttgccccg atgccactct ctcttcccac cattgttgag ttttgggcca 103141 tcatgcattc agtgcctcag tgggcagaga accataccca ggtgctctca caaatctgga 103201 gccaattcta aagttttttg gactccagct cagctcagac cactgtcaca tgctagggac 103261 tgtgggtgac tgtggacatc actggcacct gcttgagact gtgggtgatt gtggactctg 103321 agcagggatg tgctgtgtcc atgtgacact ccttctcccc aagagttgtt caagaaggtg 103381 gtgccctagg aatgccatct ggacccagtg gaacaagaag ggcagaaagc acctggcacc 103441 cactgttcat accaccatca cccagttcct tcaagtggct aacttgaaga tgtttcctct 103501 cgatgaaaat cctaaacaac aaaaaaagaa aatcctaatt ttgttgtttc tttcagcaga 103561 agtgatattt tctagtcttt acagttgaga agtttaggat cttataatgt taatttctct 103621 ctttaagatg gcattagtgc accgaatacc aatgaaattc atgctgacag ccctttatag 103681 cagcggcatg ttgatgcaag tctttgtatt cacacggcca tgacacagcc tccagtttct 103741 gtgctttatc tttgtgagga aaactaaata gaaaatacag gtcacggtca tttgccaagg 103801 cacacaggtc gaagtaatat cttgcataaa cactggcaga cacatgaata ttaaactcaa 103861 gtagctccaa aaaacacttc tccatcttgc tcatgttctc aactgcaatg tccttgggat 103921 tctggctgtc atccacactc cacagaccat gatttctcca aaccttggag gcaagaagca 103981 tggctccaag gacaatcttt ttccaattag taggacacag agcgatgcta ccattagtta 104041 aaagtcgttc gatgtacacc aaagctacga ttgcacatgg agctgtagtt ttatgacttg 104101 aaaaagagta cagaaatgtc tgaaagtaca gttgtgtttg ggatcatgct cgaaggattt 104161 ccctggaacc ttctcttgtg aaagtgggtg tattggcttg tcaaaaatag ccagagatct 104221 atttgcgtat ctgtgcttta tgttgtaata tattgcccaa gtcacacttt ctaatgtgtg 104281 tcaaagatca ggctggctga ctgttctgtt atctagcaat atggtgtaac acgaggtgta 104341 ttttctatca agatgccatg gtatatataa tataggtgag agctcctctt ttcttgcaca 104401 ctcatttgat attttctcaa aaaaagcgta ctttcccttg gatggtcaga agcaaaatat 104461 ttgggcatct cctggtcact gatgtgctgc aggtcatggc ccttgcgggc tcctaaatcc 104521 gactgggcag gctccacagc agtggggtgt ggcgctactg ccacctccgc ctgcgtgatg 104581 ttatagctgc agggcggttc cacccgccac accctgcact ggctttgctt ggggctggca 104641 ttggggagca gtatgttcca tctgcgacgc ctctgctcct ctctgccttg gcctgcacct 104701 ttgcttgacg ctgcctcata gatgtcagca ctgcaggaca gctccccagg ccccgcacac 104761 tgccccgctt ggggctggca ttgggcaaca gcgtgttcca tctgctgtgc ctccactcct 104821 ctctgccttg gcctgcacct ctgcttgcca ccgcctcata gatgtcagag ccacagtaca 104881 gctccgcaga cccccctggg ccgaccctac ctggggctgg cattggtgct caagcaacag 104941 ctcagcatgt tccccgtgag gcagctcggc tcctggctcc tctccggtag cttgggcctt 105001 cccttgcttc caagtctgca gagctcagca cttatacctc ttcccagcaa aggcacgctg 105061 gcagggccag cccttggctg tggggctgct ccccactagc agcccgcacc ccacctcagc 105121 tgcctcccct gagctgcctt gtctacaagg gtcccttggg aacctgtttg gtttccttag 105181 aatgggtcta gatacaacaa gcctgagagc tgtgattgcg gagaagagag cacagaggga 105241 gctcctcttc cttccactgc ttcccaccaa ctgaaggagg ttgccctgcc aaccaccaga 105301 gctctctcct ggttcccagg aaccaatctg gcccagagtc ccctccacat gccctgttcc 105361 catcgtgatg cgggagccta ggcacgtggt tcaccaggcc ccgccctgcc agcggcccca 105421 cccctacctt tcattcattg ctggctgctg ggagctttca ggtttctaca ctgtgaggaa 105481 ggtcttatgg tttcaacaaa tgaagaggct tatgtaaaag aaaactgagc cgggcacagt 105541 ggctcacatc tgtaatccta acactttggg gggccgaggc gggcggatca cgaggtcagg 105601 agatcgagac catcctggct aacatggtga aaccccgtcc ctactcggga tcttcccgcc 105661 tcggcctctc aaagttctgg gattaccaat aaatcttgat tgagtcctca tgatacaagc 105721 tctagcaaat aagaaagaaa cctccttaca ttttgtggat tgtattgctg cttggccttg 105781 gttaagggtt tggaacgtaa tttaacctca gagaggtttt ttctttctta aagaaagctg 105841 gtgccggccg ggtgtggtag ctcacacctg taatcccact actttgggag gctgaggtgg 105901 acggatcacg aggtcagtag ttcgagacta gcctggccaa tatagtgaaa ccccatctct 105961 actgaaaata caaaaaatta gccagtcatg ttggtgggca cctgtgatcc tagctactct 106021 agaggctgag gccggagaat cgcttgaacc cgggaggcgg aggttgcagc gagccgagat 106081 tgtgccgctg cacaccagcc cgggtgacag agcgagactc catatcaaaa aaaaaaaaaa 106141 aaaggtggtg cccgtcagtg ctgtgtctct ttcacttggc agccagggag gagtgagaag 106201 gctctggatt ctgtgagggg agcttgggcc atgcctcctg ttctctgcct ctcctgttct 106261 ctccctgcct ccccctgttc tcttcctgcc tctgctcttc tctccctcca ggtccagtca 106321 catgctgaag tcagaattct tttgaggtgt caaccttgac cttgtagtga actgaaacag 106381 aagcttaaat ggtgtagact ttaggtgatt cagacacata atcgctgaga atcactgctt 106441 taaataattt atttgatgag tttctatgtt gtacgtatgg gtaaggcact gtagggaaga 106501 tatggcttct gcccttaagg tttcattctc tttaggtagg acaagacatg tctacatatt 106561 acagtaaaag ccagagactg tacgtaataa taaacaatta tcctttgtag acctcttact 106621 gcatgctgct gtactaagca ttttagaccc ttccccttta aatcacatga cagtcccatg 106681 aggaaggtac tattttcctt gttttatttt ttgtaaacat atttgtagag atggggcctc 106741 actatgttgc ccaggcttgt cacgaagtcc tgggttcagg cgatcctcct accttggcct 106801 ccaaagtgct gggattacag gcatgagaca ctgtgcccag actgttttta ttattatttt 106861 tgagacgggg tctcattttg tccccccagg ctggagtgca gttgcacaat catagctcac 106921 tgtagcctcc acctcctggg ctcaagtgat gctcctgagt agctaggact gcaggtgtgt 106981 gccaccatgc ctggcttttt ttttttttgg agacagagtc tcaccttgtc acccaggctg 107041 gagtgcagtg tcacgatctc agctcactgc aacctctgcc tcccgggttc aagcgattct 107101 cctgcctcac cctactgagt agctgggatt atagatgcac gctgccacgc ccagctaatt 107161 atttgtattt tggtagagac gggatttcac cgtgttgccc aggctggcct ccaactcctg 107221 agttcaggca atccaccctc ctcggcctcc caaagtgcta ggattaaggc atgagccact 107281 gcgcccagcc ttctgactaa ttttttttaa aaatttctgt ggagattggg tctcgttatg 107341 ttgctcaggc tggtctcaac ctcctaggct caagcggtcc tcctgccttg gcttcccaga 107401 gtgctgggat cataggcatg agctgcagca cctggcccta ttttctttat tttatagttg 107461 aggaaactga ggctcagcga gtccaagtac atgtgactaa tgggtgtagc tggagatccc 107521 tatcggagaa ggggcagata aaatatctgt tcagaggagg aggagacagc ttttgtttta 107581 tggggacggg agataaaaag gtcagagaag tctttagggt ggaatgtgac ttgggactca 107641 agggtagata ggattttaag atacggaaca tgggagagag ctttgcgggg gtggggagga 107701 gaatatagga atcaatcact gatccttccc tcgcccttgt gttctctctt cacttagaat 107761 ggagtcagtg ggcaggaaga tggtgattac ctgtccgttg accctcccta tctagacaca 107821 tgacttccac gtgacataaa tgtgactgca gtcaggaaaa attcaactcg gactgtgctg 107881 tgttgagtaa tcagagggag aagcagtggg gttggcgcag cgggaggaag ttttctgcag 107941 gagctggaaa ttcagatggg tctggaagaa tatggatagg gcactgagag gaacagagtg 108001 gacactcggg ccaggaggat aataatgtca gaagtactgc cccgagaagc aacgcctcat 108061 ttccagagcc taaaagagct gctctgtcag aaccagtacg gtttctcatc agagaggctg 108121 ggcaggccag gaccagatgg ccagagggga gttggaaggg tggttgtttt ttttttttgt 108181 ttttgaggca ggatcttgct ctcccaccca ggttggagtg cagtggcatg atcttggctc 108241 actgcagccg cagtctttcc acttcagctt cctgagtagc tgtgactaca gtcacatgcc 108301 accacgccca gctaattttt tttttttttt tttagatgga gtctcgctct gttgcccagg 108361 ctgagtacag tggtgccatc tcagctcact gcaacctcca cctcctgggt tcaggcaatt 108421 ttcctgcctt agccacctga gtagctggga ttacaggaac actccaccac gcccagctaa 108481 ttttttgtat ttttagtaga cacagggttt caccatgttg gcaaggctgg tctcaaactc 108541 ctgaactcaa gtgatctacc tgcctcggct tcccaaagtg ctgggattac aggcgtgagc 108601 catcgcgcct ggtcttaatt ttttaaaatt ttagtaggca tgtgatttcc ccatgttgcc 108661 catgctggtc tcaaactcct gggctcaagc gatgcacctc agccccatga agtgctggga 108721 ttacaggtgt gagccaccat gctgccactg gaagttttat gaacggtcaa gatggtattc 108781 cttctggtgg tgtgttcttg tcttcctgtc acacagctga gacacttcca ggtggcctct 108841 cttctgtctc cgcttgtcac aggccaggtg ggggcatccc aactcgagtg tgaagccaac 108901 acccgggcca acctctgcac taaaagtgga gccgctttca ggggaataac ttgtcagtta 108961 attttttttt tttttttttt ttttttgaga cagagtctcc ctctgtctcc tgggctggag 109021 tgcagtggcg caatcttggc tcactgcaac ctccacctcc caagttcaag cgattctcct 109081 gcctcagcct cccaagtagc tgggattaca ggcatggatc accatgcctg gctaattttt 109141 ttgtattttt agtagagatg gggtttcatc atgttggtca ggctagtctc aaacttctga 109201 cctcaaatga ttccccccac ctcggcctcc caaaatgccg ggattacagg catgagccac 109261 tgtgcccagc cagcttgtta atttgctggt tcattcaatc agcaaatatt cagtaagggc 109321 ataccatgtg cccaatgctg tgctgggtgg tggtaggcag cgcaggcaca attcctctcc 109381 ttccagctgt ctgcccactg gtctgatcct gtagatctgt gataagacag ctcttggcca 109441 ccatgtgtat ggggaggggt gtaaatttct tcctattatg tactctctaa ctttattatt 109501 tatttgacag acaaagccac agacccaagc atgtcggaac aggattggtc agctatccag 109561 aatttctgtg agcaggtgaa cactgacccc aatgggtaag gtaacttctc acacacatag 109621 ttgactctgg gggcacccca cactgagtgt aggcctttgt ctctcaagac tgttttaggg 109681 ttaagctctc taggcaagga tcggtggctc atgcctataa tcccaacagt ttgggagacc 109741 aaggcaggag gattgcttga gcccaggagt ttgagacctg cctgggtgac agagtgagag 109801 actctgtctc aaaaaaatag aaatgcagtc aggctggcat tcattctctc attttcccct 109861 ataatcctga gaacattatt tttctaaaat tagcaaattc atgtggggga aaaaaaacac 109921 aggatagctc acttgcaact ggcagttatt aattttaaat actgttataa aacactaatg 109981 ctgaggagaa ctctgaatgt tactgataga ctatagacac gttacctaat taaaagagat 110041 ctgtgtattc tccaccattt aaatgtaatt ttaagaaaaa ggaaaagggg aaggttttat 110101 tgaattctgt ttccacagca tctggatagg acctgaagaa tcagaatcaa gtgagtcatc 110161 cagatgtttt gtttttgttt tgttttcttt tcttttcttt tttttttttt cctgagacag 110221 agtgtctcaa gactggagtg caagggcacg atctcagctc actgcaacct ccacctcccg 110281 ggttcaagtg gttctcctgc ctcagcctcc caagtagctg ggattacagg cgcgtgcctc 110341 catgcctggc taattttttg tattttttta gtagagatgg ggcttcacca tgttggccag 110401 gctggtctcg aactcctgac ctcaggtgat ccacctgccc tgacctccca aagtgctggg 110461 attacaggcg tgagccactg tgcctggccc atccagatgt ttaatctgtg tggtgtcgca 110521 tatttggcca gtttgaaccc agagtttgat cgcactttct gctctagcat ttcctaggta 110581 tttatcacag ccatacaggg cctgtgctgc tgtggctact catgggactg ggctgggact 110641 ggcagcttgg agtttggggg ctgggggttg gctgagggga gggcagtgtg ccctgggcag 110701 gaggaggacc ttgggtgctg taggtgcaac cagggtgggt gcgaggtggc cctctccagg 110761 gcccccgcgt ggcctgtttg tcaggcacgc tccacctctg cttttggaca ccagcccctg 110821 gaagaaaagt gattttactg agcatgctca gccactcggg ttagtttctt tttgcccgcc 110881 cagagtctag gtctttttgt ttccctgtag ccccacacat gcgccctggc tactggccca 110941 caagatccag tctccgcaag agaaggaagc tctttatgcc ttaacggtga gtttggcttg 111001 ccttgtagcc taacctttcc tgtccttgtg ctatagagag ccgagaggcg ctttgcttcc 111061 acacagatcc ttttcactgg agaagggagg tcccctgaga ggtttacctt cacagcaggt 111121 tggaggaggg agatctgggc cagggtcccc acccttcttc ctcctgtttc tcgtctgttt 111181 cttcagggca gcattgagtc tactgtggga gaaggaaagg aaggttttat ttcatttgcc 111241 cttcttagac cagggctggc aagttcatgt gccttttcta gagagggcag ttctagcaca 111301 gggcccacta gctgttggca ggtatgagta tgggctcagg gttgctagtt taggtctttg 111361 tatatgtttg agtgtttaga tgttggtgac taatttgcag tgttgaaagc cacagtatgg 111421 acgagaggag gggtctgcag gctcagcttg cctcttgagt gccagtgggt gacatccggc 111481 tggtcttggg cactggatac cagatctctg cctctgggga ggggttggtt ctttagccac 111541 tttctcctct caaagggtct ccttaaaagg tggggtgacc tcagggtttg acatacctca 111601 ctaagctatt tcctgggcat aaacctcagg cagttatagc aagattgaac aggggacatt 111661 tttttcttga tctgaaaacg tcctcttcat caaaacttca gtgtcctcat aggcttgagg 111721 cagggttagc cttggttcca gcgcagtctc cagcaagtga ttaccagcct ggaggggagc 111781 ctgggaccac tggctttcca ccttctcctg ggggtcctgt gtctgtgcag gtgctggaga 111841 tgtgcatgaa ccactgtggg gagaagttcc acagcgaggt ggccaaattt cgtttcctga 111901 acgaactgat caaagtgttg tccccaaagg tgagtacctg ggcttggctc tgctgactgt 111961 gtcgcaccta cgcccactgt caaactcctg gctccctcct caggccagag gcaggcccac 112021 gcatggacaa aacctgctcc tgcctgtgaa agaagcgctc tcctctccaa gctctaggaa 112081 gaggctctgc tcattagctg acccctgccc ctcactgtta gcattttcac atttcccttt 112141 taagaaggcc tggggaacgt gtggcctaga ggagatgagg aactccagct tggaggcaca 112201 ggttcctgat agagacagtg agagatccca ggggtggcgt gaaagtttcc tatcacctcc 112261 tgccttgtcc tctgtacctg tcctcctggc acctgtctta gctcagccag tctcaggtaa 112321 ccaacattga gccctgcctt gttgtgttcc cacaacagtg ccgatgctgg ccatgggtcc 112381 tacagttcat ttcagttctg acactcccct gagttgcatg ctacccggac cccacgcagt 112441 gagggctcag tgctgcttag gcttcctgtg tttcagctgc cagccgtgag tctcatgtgt 112501 cctcaagctg ctgcacttct gcatccccag ctgcagattc aggggttcct atgaccccca 112561 ccgccccata ttccataatt ccctagaatg attcacgtac acaggaaagc actatacttt 112621 ctatccctgt tttattgtaa aggatacaaa tggaagcgat acataggcaa ggtcttggtg 112681 gggcaagagg ggacagagct gctgtgccct cccctggact ctgagcttgc catcctccca 112741 acgtaggttc aactgagagg ctctctgagc ttccttgttt cagtgttttc atctgaggtt 112801 tttttgtttt gttttgtttt tttacacatc attgctggct tgaactagca ctaggtttta 112861 ttacctaggc atgatcggtc aaatcattgg ccacatagtt gaactcaatc tctagttcac 112921 acccccctgc ccgccccacc acctccacag gtggggggat gtgggggcac cggtcagggg 112981 atggggctga aagtttcagc cctctgatca caaggttgat ctctttggag tggccagccc 113041 ttgaagctct ctaggcactc actgtaagtc attagtgaaa actcaggtgc accctgcgag 113101 tgtgtctctt gaataacaag acactcttgt gactcaggaa attccaagga tttttgaagt 113161 tctgtgccag gaacggaaac aaagaccaaa ttttattatt atattcttgt ccaaggcaaa 113221 aaatgactgt ccccagtggt gtagaacagg ctctgaccca ggctcttgca tggtggggtt 113281 tagtgtgagt attgccgaga tgaattcctt gaatcccagg aagggctccg gggctgggcg 113341 ccagatggtg tggcctctag tctctcctca cactcatacc gcgcctggga gcttgtgtct 113401 ctgggcctca gcagctacct ctttagttaa agtgtcggtc tcccaagtcc cttccagctc 113461 tagaattctc tttccttttt actttcagta cctggggtcc tgggccacag gaaaagttaa 113521 aggaagagtc attgaaatac tcttcagttg gacagtctgg tttccggaag acatcaagat 113581 tcgagacgct tatcagatgc tgaagaaaca aggtctgact tgccttactg tgtcttattt 113641 cctcttgact aggcctgtat cttcttgctt tttcatctgc cttataattt tttgttgaaa 113701 gctgtacttc ttatgtagga ccatagacac tgaggtagag tttttttttt ttttttttac 113761 gcaataaatc ttttatctac ttacctttaa aagccatagt gcaaacctgg aaagatgatg 113821 tagggtttgc ttttttccct gttgcctgga aatcatagga aaccacagca tgaaaaaggt 113881 caaaatacaa gctacctatg tcactaaagc acttatttat cttttaaaag acttgtttgc 113941 ctaaaaacta accttttggt tttttttttt tttttttttt ttgagacaag gtgttgccct 114001 gttacctagg ctggagcaca gtggtgcagt catggctcac tgtagcctcg accttctgga 114061 ctccagccat tttcctgcct cagtctccca agcagctggg actacaggcg tgtgccacca 114121 tacccaacta attttttcat ttattattat tctcccaccc acaccccaac acacacacac 114181 aaacacacac actctcttcc tctctctctg tctctgtctc acttgctgtc tcacttacac 114241 caaatatttg tattttttgt agagacaggg tcttgcaatg ttacccacgc tggtctcaaa 114301 ctactgggct caaatgatcc tcccacagcc tcccaggcgt gagccaccat gcccatcttt 114361 aacctctttt cctgaaatag tggaacctta ctatgaagta cagataaatg ctacctatga 114421 gctagagttt gaaaaatact attgtgataa aagtaacaac gtgcagacat aaaatatttt 114481 atagttacaa aagatctttc agtaatcatt tagtggattt atttttattt acttcttttt 114541 tttttttttc ttttgagaca gagtctcact ctgttgccca ggctggagtg cagtggggcg 114601 atctctgttc actgcaacct ccgcctcctg ggttcaagta attcttgtgc ctcagcttct 114661 cgagtagctg ggattacagg cacctgccac catgcctggc taatttttgt atttttagta 114721 gagatggggt ttcaccacat tgctcaggct ggtctcaaac tcccaacctc aaatgatccg 114781 cctgcctcag cctcccaaag tgctggaatt acaggcgtga gccaccgctc ccggctgtgg 114841 atttattttt aattgacaca tacttataca catttattgg agtacaatgt gatattttgg 114901 tacatgcata caatgtgtag tgatcaaatc agggtaattg gcatatccat cagtgaggta 114961 aggagttttt aatccctgga attgggtaga gctttccttt tggtagacag gatttgagct 115021 gggtgtgaga tttgttgttg ctgtagttac tatatcagac ttcagattcc tctagtgtta 115081 gctgatgttt aggttggggg ctagctagtt tgttagtttt cttcagtatc tgttctctca 115141 gctgtaggtc ttcactttcc tctgcacctc agagagagcc tttctccttg ttcttgccac 115201 tcgtagcagt agacagccat cccttgtcag agggataggg tggcgggcag ggccagtctc 115261 tgttctgatt cagcctggtt cttaggcagg cactgcatcc tgggtcttgg taggatctgt 115321 ttgtgatcct gcctcacccc agtgtagatc tgggcccagg acataattcc tgctcctcct 115381 ccaagggtac tagggtttgt ttttgttttt tctcccctgt tttttctcta gctgcagtgg 115441 gttccaccgg tgctctaagg ctgcagggtt tgttactctt ctccccctaa tcccgccttc 115501 aggccttccc gctgttttcg tgagcctctg gagaagagct tattttcttt ttgttgcagc 115561 ctccagaaat tctgtattct cttggtagct cacactggcc tgtagacaat ttttaaataa 115621 ttctagctta gttgttacta ttggtattgc cttttgtcaa tccttggagg ttgttttttt 115681 ccttagattt caggttactt ggttgttgcc ctgtgatctg ggctctctga tgtattcagg 115741 gaagctgtag gctgggcatg gtggctcacg cctgtgatcc cagcactttg ggagtttgag 115801 gtgaggaatc acttgaggcc aggagtttga gaccagcctg ggcaacatag cgagaccccc 115861 atctctacaa aaaatttttt aaaaaaatta cctgggtgtg gtggtgccca cctgtagtcc 115921 cagctacttg ggaggccaag gtgagaggac tgcttgacct taggagtttg aggctgcagc 115981 gagccatgat cacaccactg cattctagcc tgggagacag agcaagaatc tgtctcaaaa 116041 acaacaaaaa tattgtgaat ttgcacactt tccagtgttt tgttgttgtc acgatggggg 116101 tgatagttcc cacctttgtg tgtcctgttt ggaagatggg aatctgaatt cctttttatg 116161 ttgggctctt aaaagaagga actcaggttc aggatggact gttgttccct gtcacataca 116221 ctcagcattt tcaggagctt catcagggac cctccagaaa cttttgatcc ctgggaggtt 116281 atgtagctgg ataaactgat ccatgctttt acagcataag gcccacagac tgcaggcctt 116341 caggactggc tcttctctga catagttggc tctgctgtcc cggccaccat ggtcccttgg 116401 attctcctaa gagatgatgc aaaacttcgt tcatacactg ggtgttgtct ttgaaggctg 116461 ttgacagtgc tttgctagca tggtattagg aaccaggggt ttccagactt tatactggat 116521 tctgactgaa gtttctgggt accatatcgg tgtcttaact ttgtaaattg catttttagg 116581 aattataaaa caagacccta aactaccagt ggataaaatc ttacccccac catctccctg 116641 gcccaagagc tccatctttg atgctgatga agaaaagtcc aaggtaagga aacagaccta 116701 catggtgtct gatcagacct tttctcttag gcactgtaaa tgccctttga tgcttgaatc 116761 ggagtatgga ggggggcagg ttacatgggt tctcctctcc agacaggccc ttaggagtga 116821 ggtagcagaa ggtttgggtg cgttcttcat ggataagtca actaataagg gccagtctgt 116881 tttgattcag cctggttctt aggcagacac tgcatcctgg gtcttggtag gatctgtttg 116941 tgatcctgcc ttaccccagt gtagatctgg gcccaggaca taattcctgc tcctccttca 117001 agggtacttt gtgccctctc ccttttgtgc acagccttcc tgggggaaag aaggtgcaga 117061 aggatagccc gtggggagtg tgggtgatgg atgggtcatg tcctgtgtgg gtgtggtgtt 117121 ggggtgggga cagtggaaga acttgctgct gtgtgttgct tcagctcttt ctcagtgcca 117181 gtgagcccgg ccctatccat tttccatcca ccatgaactc cccaaaccgt tattccttgg 117241 acatcagtat ctagagggac aagaagaaga aggtggccaa gacccaggga taactgtggt 117301 gcctgtgagt ggggagccct ttggcctagt gctggcggtg cttttggtca tgactgttct 117361 tggtcagtca caaggtctag ttcatggagt gttggaaggc tacacgctgc gtactcgttt 117421 ctgcgtggta tgctctgcct gtcaactgca ccctctccct gtacccactc cttgggccta 117481 gatactctgt ggccatcagg tccctgttgt tagaatgagc aaggctagtc taggttccag 117541 gccacctttc tccgctttcc tcctgcttgg tgccttaggt ttgcagaggg tgggaggagc 117601 taggaaggag gtatggtgtg acttctgaga aacaatcctg tgtcaccagc cctcattatc 117661 cccacttttc ctgccatccg tcccacaaag attgagtgac ttctctgtgc caggcatcct 117721 gctgggcgct ggaaatgaag ggatgaacaa gatcaccagg gtctgttgtg tcataaagca 117781 catggtccag ggcaactgca gacctagtcc ccttttgggc cagagaattc tctgttgttg 117841 gggactgtcc tgtgcactgt aggatgttca cagcattcct cactctaccc acgagctgcc 117901 agtagcaccc cctctctagt tgtgacaacc aaatttctcc agacttagcc agacatcccc 117961 tgggagaagt aacccccaca tccccctacc ccactgaaaa ccagtggcag gccaggcacg 118021 ctggcttacg cctgtaatcc cagcattttg ggaggccgag gtgggtggat cacgaggtca 118081 agagattgag accatcctgg ccaacatggt gaaaccctgt ctctactaaa aacacaaaaa 118141 ttagctgggc atggtggcac gcgcctatag tcccagctac ttgggaggct ggggcaggag 118201 aattgcttga acccgggagg cagaggttgc agtgagtcga gattgcgcca ctgcactcca 118261 gcctggcaac agagcaagac cccgtctcaa aaaaaaaaaa aaaacaacag aaaagaaaac 118321 cagtggtgta ttgggatagt cagagtgcct gcatcgtagt gtccttggac catttggtga 118381 gatggtgcat gtaaaatgtg tgcactgttg ttagtgttaa ctcttctgtt gctgctgctt 118441 ctgctctagg cctgagggag ttgtggtctc acctactctt cctgttcact ctctgcagct 118501 tctgacaagg cttctaaaga gcaaccaccc cgaggacctt caggctgcaa accggttaat 118561 caagaatttg gtcaaggagg tgggcattct tccagttggt tcagtagaag tagcatttga 118621 cagaggctga aggacagtgc atatgagcct atgcttgagg ggtagaggaa cggttgcctg 118681 ggagaaagag tggctcctag ccctgaactc tggggaacag ccatagtctc cctcaagggc 118741 agcaaggtgg tgattggaac cagaaatccc gctgccgtac ttctcagctg ctttcatcca 118801 tgctgctcaa agcatttcct agacaccagc ttgggtccca gcaacaggta ttggtatgga 118861 caaagtccag gtttaggaat ctgcttttca gaggacttga ggctggtggc acaaaggcag 118921 aggtcatggt caccagtccc caaactgcat tttcctcccc aggaactctg ggcttttgag 118981 tattttttca cccactgtgt gagttctccc atgcacttaa catcctggtg atgcccctga 119041 gcgagctctc tcaatagtga ctctttccca tcccttgttc aggcttcagc cagggtttct 119101 gctccctcac tcccatcctc ctctttttcc caaaaggaac aagaaaaatc ggagaaggtg 119161 tccaagaggg tcagtgcggt ggaggaagtg cgaagccatg tgaaggtgct gcaggagatg 119221 ctgagcatgt accgcaggcc agggcaggcc ccgcccgacc aggaggccct gcaggtaata 119281 caggtgtaac acagggggct gcacatgttt actaaagtgg gtttgaaatg ctttcacctg 119341 tttgcaccca tgaagccatc tcagagtcaa gacggtaaac atatctgaca cctccccaaa 119401 tgtcgttgtg ctgctttgtc ctccctttgt ctgtcccttg ctccccagtc cccaggcagc 119461 tgctaatttg ctttctgttc tgattggctg acaactccca gactcttgta taaatggaat 119521 gatgcagtac ataaccctct atttttgcct ggtttctgga tctcactcga gatccattca 119581 cgttgctgtg tgtagcagta gcttgtttct tattgccggg tactgtcccg tcacctggat 119641 gtaccacagc atgtacctgt tgatgggcct tgcgttgttt ccagctttgg gtgattgtga 119701 atcatgctac tgtaaacaat tttgtacggg attttgtgtg aacatcgatt ttcatttctt 119761 atgggtaaat gtctgggatt actaggtcat atgctaagta tatatttaac ttgttaggaa 119821 acaaagtggc taagctatct tcaagggtga ttgcaccatt ttacagtcct aacagcagca 119881 tatgcaagtt ctagctcctc cacatcttga gatcttggtt tttatatagc atctttatga 119941 gataattaac ataccataaa agcaaccatc acacctacct agttatagaa tatcttaatc 120001 acccctaaaa gaaaccttgt acccattaac agtcattccc cctttcccaa actccccttt 120061 ccccagtccc tggcaaccac taatccactt tctgtctctc tggatttgct tatttggatg 120121 cttcgtccaa atgcggttgc acaatacgtg ggctttcgtg tctggcttcc tttacgtagc 120181 ctaatgtttg caggtgcatc catctggtaa tgtgtcactg cttcattcct tttcatggct 120241 ggatagcgtt ccattgtgtg gctatgccac agtttgtttc tccatttatc agctgatggg 120301 catttgggtt gttcctactt tttggctatt atgaatgatg cctttgaatt attcattacg 120361 aataatgact ttgaactttg tccaagtttt tatgtgaaca taggttttaa attctcttgg 120421 gtatatacct agaaatggaa ttgctggatc ttggggtaac tccatattta acattttgag 120481 gaactgccaa gctgttttcc aaagtggcca tgccatttta cattcccagc agcaatgtat 120541 tggctccagt tctctgcatc ctgccaacat ttgctgttat ctttttgatt ggagtcctcc 120601 tggagggtgt gaagtactgt ctcattgcaa ctttctgttt cttgatggct tatttatttc 120661 ctatgctcat tggcagcttg tgcatcttag aagacatgtc tgtgcaaatc cattgcccat 120721 cttttggatt gtcttttttg agttacaagc gttttttata tattctggat acaggttcct 120781 tatcagacag ttttctccct tcctttgggc tgtcttttca ctattttgat agcatctttt 120841 gaagcgggag ttttgttttg ttttgttttt tgagacaagg tctcgctctg tcgcccaggc 120901 tggagtgcaa ttgcgtgatc tcagctccgc acgacttctg cctcccgggt tcgagcgatt 120961 ctcctgcctc agcctcccga ggagctggga ttacaggaat gtgccagcac gcctggctaa 121021 ttctgtattt tttttttttt ttaagtagag acagggtttc tctatgttgg ttaggctggt 121081 ctcgaactcc tgacctcagg tgatccaccc gtctcagcct cccaaagtgc tgggattaca 121141 ggtgtgagcc actgtgcctg gccagtagag acagggtttc accatgttgg ccaggctggt 121201 cttgaaccct ggcctgaggt gatccacctg ccttggcctc ctgaagtgct gggattacag 121261 gcatgagcca ccatgcctgg cctttttttt tttttaatgt cttcctggtg cttttgttat 121321 cacatctaag aaaccacagc ccagccgggc gcagtggctc atgcctgtaa tcccagcact 121381 ttgggaggcc gaggcaggcg gatgacctga gttcaggagt tcaagaccag cctgaccaac 121441 atggagaaac cccatctcta ctaaaaatac agtatcagcc gggcgtggtg gcgcatgcct 121501 gtaatcccgg ctactcagga ggctgaggca ggaagaatca tttgaaccca ggaggcggag 121561 gttgcggtga gctgagatca caccattgca ctccagcctg ggcaacaaca gtgaaactct 121621 gtctcaaaaa aaaaccaaaa aaacaaacaa tcaaaaagaa accacagcct gtctgtgcca 121681 ggttcgaggc agccatgctg ttggtctccc ttggaactct gttctccacc tccttctgca 121741 ggtctgtttc agccatctcc ctctgcattc cagtttttta tgtctctctg cacaggagga 121801 gagccagcct ccacgtaact tctttattta cctgtccttc cacttcacaa gattagtaga 121861 aatgaactac tgtctttcca aattcctaag agaaagcctc ccgagtagct gggattacag 121921 gcacccacta acttttgtat ttttagtaga gacagggttt caccatgttg gccaggctgg 121981 tcttgaactc ctgacttcag gtgatcagcc caccttggtc tcctaaagtg ctgggattac 122041 agacgtgagc caccctgccc ggtcagatta ttttattttc agaaatcata cacttgtagc 122101 ctgtagtctt tttctgaaac tctgtcatgt tacttcaaaa ctgcagtctc taagtcagga 122161 ccattgggag gccttctgaa gccccagact ctgccgtctg cctgatcgaa ggctcctggg 122221 gccctggagg ggggcgttat ggggtgcctg ggcgtgtgtc atgaaccaag ctctgctctt 122281 tcaggtcgtg tatgagaggt gtgaaaagct gcggcccacg ctgttccggt tggcgagtga 122341 caccactgat gacgatgatg cactcggtaa gttttctttc cttgggtgta gctagcagcc 122401 caggtgcaag acagactcgg tccctcctgt cacactgctg tgccaggaac aagacagagt 122461 ggactttcgg ttccgtggag cttgggcttg gggagggtgt ggcctgagtt gcctcagaac 122521 tgacttccag caggatccag gcttcagtaa gaaatcagca gcaggtctaa agcctgaaaa 122581 gaagtggaaa ttgaccatgg agattggggt gtgagaggta gaagaaagat cccactggag 122641 gcttcaggaa aagctgacca atcgcagggg ccagagaaac agctgtgggt ggtgtctcaa 122701 ggggatcacg ggcttttgta ccttcagggg ccaggtaggt caactaagag ggccaggccg 122761 atgggaaatg gaaggtacat ctgtaaggaa gtggagaatt gtccacagtt actaaaaaag 122821 aattttaact ctttattgtg gaaaatctcg aaacatgaga gaggggaata gtaaaaccta 122881 tatattcttc actcatcttt aataattacc agctcatagc tgattttttt aatcttactt 122941 catctgtacc tctctttgta cacacccatt ccctagtctg ttttgaagca aattctagac 123001 atttcttttt cttttaatta ttttggagat ggagtctcgc tctgtcgtcc aggttagagt 123061 gcagttgtgc agtcttggct cactgcaacc tctacctccc tggttcaagc cattctcctt 123121 cctcagcctc ccaagtagct gggattatag gcacccacca tcacgcctgg ctaattttgt 123181 atttttgtaa gatgacattt caccattttg gccaggctgg tttcaaactt ctggcctcta 123241 gtgatccgtc tgcctcggct tcccaaagtg ctgggattac aggcgtgagc cacgatcccc 123301 ggcctctgga catttcagaa taatcaaaga ttcagtcagt gttcagattg ccctgataaa 123361 atttctcctt aatcatctct tgctaactta aaaaaaaaaa attgttctaa atcagacata 123421 aaagaaaagc catataattg gtttatattt cttctgtctc tcttagtgga tagcatgtgc 123481 atgccctctg caatcctttt tccttttttt tgagacagtc tcactctgtt gctcaggctg 123541 gagtccagtg gcgtgctcag ctcactgtag cctctacagc ctttacatcc tgtgttcaag 123601 cgattctcat gcctcagcct cctgagtagc tgggactaca gttgcgcacc accacacctg 123661 gctaattttt gtacttttag tagagatgga gttttgccat gttggccagg ctggtctcaa 123721 actcctgggc tcaagtgatt tgcccgcctt ggcttcccaa agtgctggga ttacaggcgt 123781 gaaccaccac acacagcctt ttttccttgc atgaaaatag gccctctgaa ttgacagttc 123841 tctcggtctc agttttgctc gtggcatccc ccatgatgct gctcatgttc tgtttctgat 123901 atttcttttt ttcttgagat ggagtttcgc tcttgctgcc ccggctggag tgcaatggtg 123961 cgatctcagc tcgctgcaac ctccgtctcc tgggtttaag caattctccc acctcagcct 124021 cccgaatagc tgggattaca ggcacctgcc acggctgatt ttttgtattt ttagtagaaa 124081 ctgggtttca ccattttgac caggatagtc tcgaactcct gacctcaggt gatccacccg 124141 cctcggcctc ccaaaatgct gggattatag gcatgagcca ctgcacagag ccagcagctt 124201 cagattctga gttaaagtta aaagtttgtt attgaaaggg ccttaatttg agtgcttact 124261 gctggccatt actctggaca gtacaggtgt agaaggtgaa cagatttgct tccttctgca 124321 ttgagaacta ctaacatgtt aggactcagg ttaactgact aggcctgata gggtccgatg 124381 acacaacctg gaggcaggag gtgtggataa actgaacata gtctagccat atgaaggctg 124441 ttttatggaa aagcttagaa aagactgaga ttgttggttg ccaaaccagg gggttctgag 124501 tcttccctca ttcttcagcg gaaattctcc aggcaaatga cctcctcacc caaggagttc 124561 tgctgtacaa acaggtgatg gagggccggg tcacctttgg aaacagagtg accagctcat 124621 tgggagacat ccctgtctcc agaggtatgt ttgaaagctt ctccttgcct tctctgcctt 124681 ggggcagcct gtaattccca gaaaagaaat gggtcctaaa catattctgg gttaaggaat 124741 agaaatgaga gaactcgccc tttgttcccc ttcctgtgaa agtggatatg ttgccacacc 124801 cagccatagg aattgtcatg agctggttgc cgggctggac cccagctttg actacacaca 124861 aaagcagatg gctaagctgt gcctgcgttt gaattctcat aaatgcttgc atacaggagc 124921 tagagacaaa agaaatgatc ttccttagtg gttcctagtg ccaggtcagc tgtatcagaa 124981 tcacctggag cactggttga aatgcggtcc ctggcttgga ccttctgaat caagactgct 125041 tggagctggg cctggaggtg tgcgtctttc aagcactgtg ggtgattcta atcagctgtg 125101 gtcagaagcc tcagaatggc ctgagctgga gaaggccgaa tctgctttcc tccagagtga 125161 tgggatcttt gcaggagggc ctatgatccc tggggtagga aggcaggcct tggctccctc 125221 ttgtcttgga tgccagctga gcttgtgtgt caggacactc agctgagtgg aaggttcttg 125281 gaagggcagg gagggcgtta ccatttgatt aggaggtggt ctgagggaag ctgataggat 125341 ttatgtggag actttgctct catgtccaac tgacttctct tcctaaccct gggctttcta 125401 gtctttcaga atccagcagg ctgcatgaag acctgccccc tgattgactt ggaggtggac 125461 aatggacctg cgcagatggg gactgtggtg ccatctttgc ttcatcagga cctggcagcc 125521 ttgggtgagg gcaggtgctt gtggggagca ggcacaggat gggtgcgagt aggggagtga 125581 gctggttccc ttcgcggagc acatgcctgc tgctgaggga gtcgagatgg ttgtgaatag 125641 gaagagtgag ctggttccct ctgggagcac gtgcttgctg ctgagggagt caggatgggt 125701 gtgagtaggg gagtgagctg gtccccttgg gaagcacgtg tctgctgctg agcaagtcag 125761 ggagagtgtg agtaggggag tgagctggac cctcggagaa cacgtgtctg ctgctgaggg 125821 agcacatacc tgctgctgag ggaatcaggg tggttgtgag taggagagtg agctggaccc 125881 tcagggagca catgcctgct gctgagggaa tcgggatggg tgtgagtagg ggagtgagct 125941 ggttccctct gggagcaagt gcctgctgct gagagaatca gcagggttct gagtagggga 126001 gtgagctggt ctcctcagga agcacatgcc tgctgctgag ggaatcaggg tggttttgag 126061 taggggagtg agctggtccc ctcaggaagc acatacctgc tgctgaggca atcagggtgg 126121 ttgtgagtag gagagtgagc tggttccctt ggggagcaca tacctgctgc caagggaatc 126181 aggtgttctg ggcagttcag agggtccccc ttgcatatta gtcacagatt ggccatggac 126241 gagtcacatg accttcagta agttgcttat ttgtctagac tttctggatg tcctagaaag 126301 gatgggtgtg gcattactct tcatctggaa gcaggtggag ttactgttag gcatctcact 126361 ctactccttt cttacaggaa tcagtgatgc tcctgttaca ggcatggtaa ggatgccctt 126421 cccagggccc ggagagagaa tgggagggtc tcagacctgc ccttgtgagg cagcactgcc 126481 tggcgagcag agaggtgctc ctgcactgtc ccggtgggat catggccttg gaggtccaaa 126541 gcctcttcct cacccatact gtctggctga tacctggggg agacaggttg caccaggctc 126601 ctgcttcctg gaggtctgcc caggagctgc atgtgatctt gccctttctt tcccctcttg 126661 ccccaagaga tgtcaccagt ctgggccaca gcagcttggg tgggctcagg agagctgtgc 126721 cttggtttta ctgtggctga tggtcattca taccctggtc ttattttagc cattcctgat 126781 ttgacaggtt tctggtcaga attgctgtga ggaaaagagg aatccctcct ccagcacgct 126841 gccaggcggt ggtgttcaga acccttctgc agacaggaat ttgctggacc tcctctcagc 126901 acagccagct ccgtgccctc tgtgagtctt ctgggcaagg ggagtgggag agtgtggctc 126961 ctgagtgcgg ttccttctca tgggacaagg gccagagaat ggtgctctaa gacagcctct 127021 cgccctggag aaacatcttt ctttctttct tttttttttt tttgtctttt tttctttttt 127081 tttttttttt gagacggggt ctcactctgt tgcccaggct agagtgcagt ggcacaatct 127141 cggctcacta cagcctttac ctcctgagtt caagcgattc tcctgcctca gcctcccaag 127201 tagctgggat tacaggcacg cgccaccatg cccagctaat tttttttttt taatttttag 127261 tagagacggg gtttcaccat gttggccagg ctggtctcaa actcctgacc tcaggtgatc 127321 cgtctgcctg gccaagaaac atctttctat tgaaagttta ggccaggtgc agtggctcat 127381 gccttgtact tgttttcaca ctgctgataa agacataccc aagactgtgt aatttaaaaa 127441 caaaaagagg tttaatggac tcatagttcc atgtggctgg ggaggcctcc cagtcatggc 127501 ggaaggcaaa aggcatgtgt ttttgtttgt ttgtttgttt ttgagatgga gtctcgctct 127561 gtcgccctgg gtggagtgca gtggcgcgat ctcagctcac tgcaacctct gcctcccagg 127621 ttcaagcgat tcttttgcct cagcctcctg agtagctggg attacaggcg tgtgccacca 127681 cacccggtta attttttgta ttttttttta gtagagatgg ggtttcactg tgttagccag 127741 gattgtctcg atctcttgac ctcatgatcc gtctgcctca gcctcccaaa gtgctgggat 127801 tacaggcatg agccaccaca ctcagccaaa aggcgtgtct tatgtggcag cagacgagag 127861 aatgagagcc aaacgaaaaa gggtttcccc ttataaaacc atcaggtctc atgagactta 127921 ctaccgtgag aacagtatgg ggaaactgca cccatgattc agttatctct tactgggtcc 127981 ctgctacaac atgtgggaat tgtgggtgct acaattcaag atgagatttg ggtggggaca 128041 cagccaaacc atatcatacc tgtaatccca gaactttggg aggccaaggc aggtggatcg 128101 cttgagccct ggagttcaag accagcctgg gagatatggc aaaaccccat ctctacaaaa 128161 aatactaaaa ttagccaggc gtggtaggtc atgactgtag tcccagctgc tcaggaggct 128221 gaggtagaag gatcacttga gcccggaaag tcgaggctgc agtgagccat tactatgcca 128281 ctgcattcca gcctgggcag cagagtgaga ccctgtctca aaaaaagttt aatgtgctgc 128341 ttcccttttc tcattttact tcttttccta agtcgtagga aaatacctgc tttggccaag 128401 acatctgaaa agaggtttat ttaaggacta ggagtagtag aacatgaggt ttttgccgtt 128461 tgtttttgtt ttcccttcta ataaccatat aattgaagtt aagcaaaatt tgatccatgg 128521 agaaacatac aacaaagtgc aaagtgccac tccctaaaaa caaggttgaa cgttcctttt 128581 ttgctttttt tgtatacatg tagtggtaca ttcctgtgtc ttgttacaca aacaggattt 128641 taatttataa accgcactcc agcatgacct tttcatatta tatctgtgat gcatttcata 128701 tcagtttcta taaatatgcc tcagtcttaa tagctacatt acaatttatt gcatggatgt 128761 atttgatcat ttcccttttt tggacattta cataatttct ggtttggagg tgcttggggg 128821 agtgccttat agtatagatc cctgaaggga gaggttctct aaggtgtgaa tgtaacaaca 128881 ttttcataga tcctgtccct ttgcctctgg aaaagtgcag cttaccttca cccaaacagc 128941 ctgaaagtgc ccatgttcct gatagaaacc accaggctca ggcagaagtg ggatgtcctg 129001 ggagttcatt cccatttgcc acccagaggt catttcttgc agtcttcttt ttttcagcca 129061 ctcgggcagg gtgggttatt tctcctctca ggactttagt tttctaaaat gttttcagtt 129121 tgtgtcccaa atgaacaaaa tcatggagtg gatagtagct ggtataattt gtgtatcatc 129181 tgcctggctc aagtcccttt gtgatcacac ctgggaaaca acctgactga gctacttatt 129241 gtcagggact gcccgacagg tacaacatga acaaccttca taacaaaggc tctaggttat 129301 tgggttctta ttatgtgtgg acattttaag tgcttttgtt tttttctttt ttttttcctg 129361 agatggagtc tcactctgtt gcccaggctg gagtgcagta gtgtgatctc ggctcaccac 129421 aacctccgcc tcccaggctc tagccattct cctgcctcag cctcctgagt agctggtatt 129481 acaagcgtgt gccaccacca cctggctaat tttttatttt ttattttttt ttaatttttg 129541 tatttttagt agagatgggg tttcaccatg tcggccgggc tggtcttgaa ctcctgacct 129601 caaatgatcc acctgcctcg gcctcccaaa atgctgggat tacaggcgtg agccaccgtg 129661 cccagcctct aagtgctttt catacatcgt ctcatttgat tcttacagta gccctaggag 129721 gtaaattact tctgttattc ccatttaaca gatgaggaag ctgaggtatg agttcgatta 129781 cttgacaaaa gtcatataac atctggcgca ttagctcacg cctgtaatcc cagcactttg 129841 ggaggccgag gcggacggat cacctgaggt caagagtttg agaccagcgc ggccaacgtg 129901 gtgaaaccct gtctctaata aaaatacaaa aattagtcgg acatggtggc aggtgcctgt 129961 aatcccagct actcgggagg ctgagtcagg agaatcgctt gagcctggga ggcagaggtt 130021 gcagtgagcc gagactgttc gattgcactc cagcctgggc aacagagcga gatactgtat 130081 caaaaaaaaa aaaaaaaaag gcatataaca tctaactctg tgtgtgtgtg tgtgttattt 130141 tttccttgtt ctttaataag cctaacccag attccaggaa atcgctacaa tttttgctag 130201 tctttgatat tttttatttc tttgtaggaa ttatgtttcg cagaaaagtg tccccaagga 130261 agtgccacca ggtactaagt cctctccagg ttggtcctgg gaggctggcc cgttggctcc 130321 ttccccatct tcacagaata cacctctggc tcaagtgttt gtccctttgg agtctgttaa 130381 gcccagtaag tacaatttca ctccccacct tttttttttt ttccctggga atctattaga 130441 ctcccagctt tttccactcc agtttgatag aggtggccct taaatttggt tgataaatgg 130501 gagagatttt gcatgagggc agaaagtaag aggttgtggc caggtatggt gactcacgcc 130561 tgtaatccca gcactttggg aggctgaggt gggaggatca cttgaggcca ggagtttgag 130621 accagcctgg gcgacatagc cttgtctcta tttggggggg aagaaaaaaa ttgggaggcc 130681 aaggcaggtg atcacttgag gccaggagtt ccagaccagc ctggccaaca taacgaagcc 130741 ctctctctac aaaaaataca aacattagct ggacatggtg gtgcatgcct gtaatcccag 130801 ctacttggga ggctgaggca ggagaactgc ttgaatccag gaggtggagg ttgcagtgag 130861 ccaagatcat gccaccacac tctagcctgg gtgacaaagt gagactccgt ctccaaaaaa 130921 aaaaaaagtt tgcttctttg tcttttagct gatttcttag ccttccttag tattgacact 130981 gtcaaggata tgggcaggca gaatttaatt aaaaggctta cttaggcaca taatgttcgt 131041 tgatttgggt tgggattcaa gaagggaatg tttaatgtgg cctgaaatga ctggcaagag 131101 gtagaattca aacgagtgga tgaatttggc tagtgatcct ccttctctag ggatcctcct 131161 taaggcagaa aagtggatat tatctctgtt aaaagggaag gaaatggaag gctcggctaa 131221 ttaaggtgcc taccgtctct acaccagtat gcagagcagg acacaactca ggtcgtctgt 131281 tccagggcct ggactcttta gcacagagcc ctcctatggc tctttgtggt gaggttagat 131341 tgcagagagc cctgaaagcc aggcagacaa atttataaat aaacagagtt gataaacaat 131401 gggagctctt tgcacatttc tttattgatt tttgatttaa gaaataatac atgttcaatg 131461 ttaattttct gaattcaata actgtattgt agttacgatg ttaacatctg gggaagctac 131521 atgaagggta tatagaaatt tttgtattag ttttataact ttttataagt ctgaaattat 131581 tacaaaacaa actttaaaaa ataaaaaagt taaaaacaaa aatacatgtt caaaatgtag 131641 agttttatta aagatagaat aggccagacg cagtggctca tgcctgtaat cccagcactt 131701 tgggaggcca aggtgggcag atcacttgag gtcaggagtt cgagaccagc ctggccaaca 131761 tggtgaaacc ctaaaaatat aaaaattagc tgggtgtggt aatgtacacc tgtaatccca 131821 gctactcagg agcctgagcc tggagaatca cctgaaccca gatggcagaa gttgcagtga 131881 gctgagatcg tgccactgcg cttcagtctg agcaacagag aagactctgt ctaaaaaaaa 131941 aaaaaaaaaa aaaaaaaaga ctagaaaagt aaaatcatct ataccattaa cagccagaga 132001 tggaaattct cttcatgtct tgatagattt tcttgtctta tcatatacat atatatgtac 132061 atacagttct taataatttg ataccacagt gtgtatagaa tgttttatac tctatttact 132121 gttactttat aagcctatac catttttatg cccttagttg ttctttaaaa acttgatttt 132181 taatgaactt atatatatat gcactattaa gtagaggtac tgtaatctgt ttattcagtc 132241 tttttttttt tttttttgga gacagagtct cgctgtcacc caggctggag tgtagcagca 132301 tgatctcggc tcactgcaag cttcacctcc tggattcaca ccattctctt gcctcagcct 132361 cccgagtagc tgggacacag gtgcccacaa ccacgcccag ctactttttt gtatttttag 132421 tagagacagg gtttcaccgt gttagccagg atggtctcga tctcttgacc tcgtgatctg 132481 cccacctcgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc ccggccctat 132541 tcagtctttt tttttaaatt attttatttt attttatttt tgagacaagg tctggcccta 132601 tcacccaggc tggagtgcag tggcgccatc tcagcttact acaacctcca tcctccaggc 132661 tcaagccatc ctcccacctt ggcctcccaa gtagctggga ctacaggcat gcgccactat 132721 gtccagctta ttaagtcctt ctctgaaggt caatttcttt ctacctgttt gctgttaaaa 132781 acagggctac aggctgggtg tggtggctca tgcctgtaat cccagcacgt tgggaggcca 132841 aggtgggtgg agcacctgag gtcaggagtt tgagaccaac ctggccaaca tggtgaaacc 132901 ctgtctctac taaaaactac aaaaaaaatt agccaggcgt ggtggtgggc gcctgtagtc 132961 ccagctactt gggaggctga ggcaagagaa tcgcttgaac ctgggaggcg gaggttgcag 133021 tgagctgaga ttgcgccact gcactccagc ctgggtgaca cagtgagact gtgtctaaaa 133081 aaaaaaaaaa aaaaaaaaaa aaaaccaggg ctacaaagaa catctttgga tgccagggct 133141 acaaagaaca tctttggatg cctttcttta taccttcctg attatgagga gatcataata 133201 ccagtgaaat gccacatgta gacatttaaa attttgatct gcattaccaa tttcccgtct 133261 gaaaaggttg ctttttttcc ccatactctt gccaccatac cccactggta ttttcagtct 133321 cttaaatctt tggtagtttc attgttaaac tatgttttaa ttttttttta aattttttta 133381 gagatggggc ctcgctccat cgcctaggca ggagtgcggt ggcacagtca tagctcactg 133441 cagcctcata ttcttcttgg gctcaagcaa tcctcctgcc tcagaccccc aggtagctag 133501 aatgacaggc atgcaccacc acacccagca aatttttatt gtctttttga gggtgggagg 133561 cagatctcac tgtattgccc aggctggttt tgaactcctg acctcaagca gttttcccat 133621 ttcagcctcc tgagtagctg ggattacagg catgagacac catacccagc tctaagctat 133681 attttagtaa tgtatatttt gatattaagg tttttttttt agctgattaa ccagtctgag 133741 tgccattcat tgagccagcc ctgatttttg aaggatatgt tatgttacag tctcatttag 133801 atctgttcca cacgttttgt ttatgtcact ttacttgaat tgtaatttta tctgagtatt 133861 tgctatagcc ctgcccccct ttttaaattt attggtcagt cttacatagt tttcccccct 133921 caaataaaca ttagagtcac tccttagcta aattaaaagt ctctttagaa tttccattgg 133981 aatgttatta aacttattaa tttgaggtga tttggtatct acagttttgt acaattaaat 134041 ataaggtggt atattgtgtg tgtgtgagca tatgtgtgtg tttgtactca tataaaagag 134101 accagaagga agtaagtgct tatttctggg tagaattaag aatgcattgg ccgggtgcag 134161 tggctcacgc ctgtaatccc agcactttgg gaggtcaagg tgggcggatc acctgaggtc 134221 gggagtttga ggcctgacct cccccagcct gaccaacatg gagaaatccc atctctacta 134281 aaaacacaaa aattagccag gcgtggtggt gcatgcctgt aatcccagct actagggagg 134341 ctgaggtggg agaatcgttt gaatccggga ggcagaggtt gcggtgagcc aagatcatgc 134401 cattgcattc cagcctgggc aacaaaagcg aaagtctgtc caaaaaaaaa aaaaaaaaaa 134461 aaagcattta tttcattttt taatgataat cctgtttttg tgtgtgtgtg gctttttgtt 134521 ttttgtttgt ttgtttgttt gagacagagt ctcgctctgt cacttaggct ggagtgcagt 134581 tcattgcaac ctccgcctcc cgagttcaag tgattttcgt gcctcagcct ctcaaatggc 134641 tgggattaca ggtgtgtacc accatgccca gctaattttt gtatttttag tagagatggg 134701 gttttgccat gttgtccagg ctggtcttga actcttaacc tcaagtgatc cacccgcctt 134761 ggcctcctaa agtgctggga ttacaggcgt gagcccccac gcctggctga taattctcta 134821 ttttctatag tgttcatgca ttattttaat aatgatcgta tcttcctcaa aaatgtgatt 134881 tttaataact ggtggtgtgt gtatgatggt tggatggtgg ggaaaggtca gtttggaggg 134941 tcataggagt ccagctagag tcatgaggac ttggattatg gtagtgcaga ggaattcagt 135001 cctggacatg ttctgcatgg agaagcaaca gcatctgggg ataacatgta catctctttc 135061 cagcctggct gtccgggggg tttgtagtgg tgttaaacct tctgcttgtg ccctttatac 135121 caggcagcct gccgcctctc attgtgtatg accggaatgg attcagaatt ctgctccact 135181 tctcccagac gggagcccct gggcacccag aggtacaggt gctgctcttg accatgatga 135241 gcacggctcc ccagcctgtc tgggatatca tgtttcaagt ggctgtgcca aaggtgagtc 135301 atctgttggg gatttctcag taatggctgt cgtttttgag tgccaggttc ttttctggga 135361 cttacctgtc gagtgtcaag gaaagaatct tcactcaaac taactgaaga taaagtggga 135421 atgtaatggc ttacctgact ggggattcct gaatatctga cttcagatat gagtgaatcc 135481 aaataggttc ttaaacaatg ccttaagaat ctgtatctct tgactctgct tttctttggc 135541 accattctga cagtttctcc acaaacagga aagtggctac ccacagctcc agggttgcat 135601 ggtccttaac cctctttatc tcagaaactc tcaccgggca tgaagcacat atattgatta 135661 cccaggaaag ggctctgatt gggcttgttg gtgttacttc cataccttgg accaggaact 135721 tgtatcctgg gatatgagac acttggccaa tgtgagataa catccatccc ttttcaaggg 135781 aagtggggcc ctactgatta acaggcaccc cacagtcaca ggggaagggg cctgtcctga 135841 aaggaaaaga ttcgagacag acaaaaatat gacctgcagt ctctacttct tacggtagtc 135901 ctatgggcaa tgttctgcta ctacctctgt tttataagtt aacccaaggc tcagaaatta 135961 aatattttac tcaaggtcac agagctagtg agaagcagtt agaattcaac ccaagatgtg 136021 ccctggctgt tgatgctgcg tctcagatgc cggttattga ccacacaact cccgatccat 136081 tctgtgtgtg gaaggaaagg cctcttagat ccttttatca attagtggga gtctgtctgc 136141 ttgggctccc ataggtgcct cggggtacca cgtttaagag ggccatttcc tcttggaggg 136201 gtgttgggga ggtccagctc cccctcacct gatctccatc ctagaagctg tggttaggaa 136261 tgagttacat gttggttttc ttcccctgat agtcaatgag agtgaagctg cagccggcat 136321 ccagctccaa gcttcctgca ttcagtcctt tgatgcctcc agctgtgata tctcagatgc 136381 tgctgcttga caatccacac aaagtatgtt ccagtgtctg tggtagcagg gagatggggg 136441 cccctgaact gtaaggggag gtaactttgt gggtagagag aggcacattg aggccttgct 136501 ctaaagtctc tggtttgctg ggcacagtgg ttcacccctg taatcccagc actttgggag 136561 gctgaggcag gtggaccacc tgaggtcagg agtttgagac cagcctggcc aacatggtga 136621 aaccccatct ctactaaaaa tacaaaaatt agctggttgt gggggcacgc acctttagtc 136681 ctagctgctc aggaggctga ggcaggagaa ttgcttgaac ccaggaggtg gaggttgcag 136741 tgagccgaga tcgcactact gcactccagc ctgggcaaca gagcaagact ccatctcaaa 136801 aaaaaataaa gtcggccagg tgcagtggct cacgcctgta atcccagcac tttgggaggc 136861 tgaggcagat ggatcacgag atcaagcgat cgagaccatc ctggccaaca tggtgaaacc 136921 ccatctctac taaaaataca aaaattagct gggcatggtg gcgtgcgcct gtagtctcag 136981 ctactcggga gagtgaggca ggagaattgc ttgaacccaa gaggcggagg ttgcagtgag 137041 ccaagatcgc accactgcac tccagcctgg tgacagagcg agactctatc aaatagatag 137101 atagatagac agacagacag atagatgaaa gagagagaga gggagggaag gaggaaagga 137161 aggaaggaga agagaaaaag aaaagaaagt cagtctctgg tttggttggg tgttaggggt 137221 atatgtggaa tatccctcgc tggagtctgt ctttggggct cactgaagga agatcagatg 137281 gattgtcata gctggggacc aagggaggga cacatgaact ctccctaatc ctctggagtc 137341 ctctgaaagg acaaggtctg tcttccttag ctggtctctg aattgcagcc catctcagga 137401 tttcaggact gcagaattat cgaagggttc ccaggatatg tgcctgtttc cccttgtttg 137461 tagtcccaaa gagagtatcc caggtatttt gtgtcagaat tgtttgaaac actcacttga 137521 cttaacattg acccctgttc ctgggttgct ctccactgat ttgacatctt ctgtctcttc 137581 tgtcccagga acctatccgc ttacggtaca agctgacatt caaccaaggt ggacagcctt 137641 tcagcgaagt aggagaagtg aaagacttcc cagacctggc tgtcttgggc gcagcctaac 137701 ttttcacaag atggaccctt catttcaagc ttaggctggc gttacttttg ctgtctagtc 137761 aggactaatc acggtgtttc agtgcggagt gccaagagtc ctatcctgac gtcaggctct 137821 gggtgtcaac ctctgactta ttctgcagat gctctgtgtg tgtgtgtgtg tgtgtgtgtg 137881 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgttcgggga gagggtggta 137941 gcacagggct tgggatatcg gcagtgtggg aaatgcgaag catttctcat catcaccatc 138001 tctgctacag tcatgtttct gcatgtcagc gagcgacact gtccctgcct caggttggag 138061 gttttatcag ccaaagtgtt tttttcatgt atcgttcgtt ccattcatcc actctgtgcc 138121 ttgtcagcct ttgaaaggct tggttgctcc caggctgctg ttctcaggga ccttaaaagg 138181 gacctggtta gtcttggggc agagagtatc tacttgggca ctctcttcca agaaagacct 138241 tgtctccatt ttcattagac aatgcttctt gtgtgtgttc tggaagatct tctaaatgga 138301 atgcttgttg cactgttccc aggcgagtgg ctgccatgag acctgaggac cacacttggg 138361 ggaccaatca tgtccttcac cactgtgcct tagaatcgcc cctggacaga gttcctgggc 138421 agaggggaaa gcagctccca ggccttactc aggcctcagg tccatgggtt gggcagccag 138481 tctgggccct tctcaggatc ctcatctcca tcctcatcct cttccttcac agcatttact 138541 tggagctctt tgtgacacac catgtcagtc atgatgaatc ggccaacagc cagcccttgc 138601 cagctgacgt cacagtctaa gatgggaaac tggtacagat agacatgaag agagcttagc 138661 agtgattgag gtggtgacta aatatacagt cattgaataa ataccatgta gcaagtgtac 138721 tttgtggagt gttgagtaag tggaaaatgg aaagccagtt gcatttagag atgataggcc 138781 taaagggaac tgtcttctgt cgagaagtaa aggaaacttc atgaaggatg tagaagctt // LOCUS HUB384D8 139887 bp DNA PRI 02-JUL-1996 DEFINITION Chromosome 22q13 BAC Clone CIT987SK-384D8 complete sequence. ACCESSION U62317 NID g1399959 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 139887) AUTHORS Adams,M.D. TITLE Complete sequence of a chromosome 22q subtelomeric BAC JOURNAL Unpublished REFERENCE 2 (bases 1 to 139887) AUTHORS Adams,M.D., Kerlavage,A.R., Fuldner,R.A., Phillips,C.A. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (26-JUN-1996) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, Maryland 20850, USA COMMENT BAC clone CIT987SK-384D8 is the most distal BAC clone on the q arm of the chromosome 22 map described in Kim, U.J., et. al, A BAC-based framework contig map of human chromosome 22q. Proc. Natl.Acad. Sci., in press. Genes were identified by a combination of three methods: XGRAIL (available by anonymous ftp from arthur.epm.ornl.gov), searches of the EST database at TIGR (http://www.tigr.org/tdb/hcd/hcd.html), and searches of the complete sequence against a peptide database. Repeats were identified with CENSOR (Jurka, J., Klonowski, P., Dagman, V.,Pelton, P. Censor - a program for identification and elimination of repetitive elements from DNA sequences. Computers Chem. 20: 119-121 (1996); available by anonymous ftp from ncbi.nlm.nih.gov). Alu classification was performed by J. Jurka, Genetic Information Research Institute. FEATURES Location/Qualifiers source 1..139887 /organism="Homo sapiens" /db_xref="taxon:9606" repeat_region complement(646..947) /rpt_family="Alu-Sx" repeat_region 1144..1423 /rpt_family="Alu-J" repeat_region 2549..2868 /rpt_family="Alu-Sxz" repeat_region 3093..3231 /rpt_family="Alu-Jo" repeat_region complement(3262..3555) /rpt_family="Alu-Sq" repeat_region 4147..4513 /rpt_family="THE1B" repeat_region complement(5085..5374) /rpt_family="Alu-Sc" repeat_region complement(5743..5857) /rpt_family="Alu-Sxzg" repeat_region 7318..7469 /rpt_family="Alu-Sbcg" repeat_region 7473..7791 /rpt_family="Alu-Sg" repeat_region complement(8919..9238) /rpt_family="Alu-Sq" repeat_region complement(9243..9516) /rpt_family="Alu-Sx" repeat_region complement(9859..9910) /rpt_family="MER41" repeat_region complement(9984..10200) /rpt_family="MER41" mRNA join(12304..12896,13046..13286,13400..13618,13692..13861, 14174..14298,14389..14516,14771..14873,14988..15406) CDS join(12673..12896,13046..13286,13400..13618,13692..13861, 14174..14298,14389..14516,14771..14873,14988..15307) /codon_start=1 /product="arylsulfatase A" /db_xref="PID:g1399961" /translation="MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHP SSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGL PLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQN LTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQD RPFFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEET LVIFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELAS SLDLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVR TGKYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGA TPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPH A" repeat_region complement(15474..15584) /rpt_family="MIR" repeat_region 16082..16376 /rpt_family="Alu-Sp" repeat_region 16384..16498 /rpt_family="Alu-Jo" repeat_region complement(16516..16835) /rpt_family="Alu-Sx" repeat_region 16837..17011 /rpt_family="Alu-Jo" repeat_region complement(18353..18563) /rpt_family="LTR10" repeat_region complement(19930..20229) /rpt_family="Alu-Sp" repeat_region 20600..20805 /rpt_family="Alu-Sxzg" repeat_region 20806..21169 /rpt_family="TIGGER1" repeat_region complement(21175..21443) /rpt_family="Alu-Sq" repeat_region 21444..21631 /rpt_family="TIGGER1" repeat_region 21632..21927 /rpt_family="Alu-Y" repeat_region 21928..22172 /rpt_family="TIGGER1" repeat_region complement(22208..22340) /rpt_family="Alu-Jb" repeat_region 22341..22431 /rpt_family="TIGGER1" repeat_region 22437..22727 /rpt_family="Alu-Jo" repeat_region 22728..23043 /rpt_family="TIGGER1" repeat_region complement(23045..23203) /rpt_family="Alu-Sc" repeat_region complement(23204..23318) /rpt_family="Alu-S" repeat_region complement(23319..23449) /rpt_family="Alu-Sc" repeat_region complement(23462..23737) /rpt_family="Alu-Sc" repeat_region complement(23741..24050) /rpt_family="Alu-Sq" repeat_region 24052..24127 /rpt_family="TIGGER1" repeat_region complement(24133..24297) /rpt_family="Alu-Jb" repeat_region complement(24393..24523) /rpt_family="Alu-Jb" repeat_region 24527..24633 /rpt_family="TIGGER1" repeat_region complement(24863..25162) /rpt_family="Alu-Y" repeat_region complement(25509..25827) /rpt_family="Alu-Y" repeat_region complement(25959..26288) /rpt_family="Alu-Jb" repeat_region complement(28257..28561) /rpt_family="Alu-Sx" mRNA complement(join(28907..29742,30139..30237,30434..30531, 30627..30735,33458..33528,33684..33791,34515..34624, 34715..34887,35006..35081,35385..35833,35909..35973, 36949..>37314)) CDS complement(join(29670..29742,30139..30237,30434..30531, 30627..30735,33458..33528,33684..33791,34515..34624, 34715..34887,35006..35081,35385..35833,35909..35973, 36949..37314)) /note="hypothetical protein 384D8_2" /codon_start=1 /db_xref="PID:g1399960" /translation="MLPDFPSPSTWAPGLLLPSGPALLSPSVLQDSLSLGRSEQPHPI CSFQDDFQEFEMIDDNEEEDDEDEEEEEEEEEGDGEGQEGGDPGSEAPAPGPLIPSPS VEEPHKHRPTTLRLTTLGAQDSQDPEAAAGPGGVELVDMETLNPTRDTITPLWAAPGR AARPGRACSAACSEEEDEEDDEEEEDAEDSAGSPGGRGTGPSAPRDASLVYDAVKYTL VVDEHTQLELVSLRRCAGLGHDSEEDSGGEASEEEAGAALLGGGQVSGDTSPDSPDLT FSKKFLNVFVNSTSRSSSTESFGLFSCLVNGEEREQTHRAVFRFIPRHPDELELDVDD PVLVEAEEDDFWFRGFNMRTGERGVFPAFYAHAVPGPAKDLLGSKRSPCWVERFDVQF LGSVEVPCHQGNGILCAAMQKIATARKLTVHLRPPASCDLEISLRGVKLSLSGGGPEF QRCSHFFQMKNISFCGCHPRNSCEAPQGAAFQWERGVDRKRVLQTRGNVQPHLGAGQG AALNRATEGSSTGSEKGEWTPLVIMELTQSVNSCYFGFITKHPLLSRFACHVFVSQES MRPVAQSVGRAFLEYYQEHLAYACPTEDIYLE" repeat_region complement(31050..31187) /rpt_family="MIR" repeat_region 40595..40893 /rpt_family="Alu-Sc" repeat_region complement(40903..41017) /rpt_family="Alu-Jo" repeat_region complement(41133..41451) /rpt_family="Alu-Sx" repeat_region 41671..41893 /rpt_family="Alu-Sz" repeat_region 41959..42036 /rpt_family="MIR2" repeat_region complement(42421..42708) /rpt_family="Alu-Jo" repeat_region complement(43869..44163) /rpt_family="Alu-Sz" repeat_region complement(44203..44425) /rpt_family="MLT1F" repeat_region complement(44490..44803) /rpt_family="Alu-Sx" repeat_region 45351..45644 /rpt_family="Alu-Y" repeat_region complement(45772..46069) /rpt_family="Alu-Sz" repeat_region complement(46498..46777) /rpt_family="Alu-Y" repeat_region complement(46860..47174) /rpt_family="Alu-Jb" repeat_region complement(47559..47752) /rpt_family="Alu-Jb" repeat_region complement(48644..48941) /rpt_family="Alu-Jb" repeat_region 49451..49748 /rpt_family="Alu-Jo" repeat_region complement(50779..51091) /rpt_family="Alu-Sp" repeat_region complement(51455..51602) /rpt_family="L1PA7" repeat_region complement(51608..51975) /rpt_family="L1MA5" repeat_region complement(52372..52456) /rpt_family="Alu-S" repeat_region complement(55756..56062) /rpt_family="Alu-Jo" repeat_region complement(56071..56360) /rpt_family="Alu-Sz" mRNA join(<57549..57886,58087..58195,58582..58695,58891..59024, 59784..59879,60028..60086,60173..60254,60362..60470, 60614..60717,60937..61018,61189..61483) CDS join(57549..57886,58087..58195,58582..58695,58891..59024, 59784..59879,60028..60086,60173..60254,60362..60470, 60614..60717,60937..61018,61189..61263) /note="choline kinase isolog 384D8_3" /codon_start=1 /db_xref="PID:g1399962" /translation="MTGEAQAGRKRSRARPEGTEPVRRERSAAWPGARSSPRHGGRGD SCGRKRGCCGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLGGAWRRV QPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQGVDSLVLESVM FAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVLSAAIATKMAQFHGMEMP FTKEPHWLFGTMERYLKQIQDLPPTGLPEMNLLEMYSLKDEMGNLRKLLESTPSPVVF CHNDIQEGNILLLSEPENADSLMLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEEWPFY KARPTDYPTQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDLLVEVSRYALASHFFWG LWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHSSS" repeat_region 58335..58443 /rpt_family="Alu-Jb" repeat_region 59179..59353 /rpt_family="Alu-Jb" repeat_region 59364..59662 /rpt_family="Alu-Sx" mRNA join(<62529..62669,62981..63120,63410..63587,63807..63908, 64109..64246,64332..64409,65844..65951,66025..66109, 66728..66923,67383..67568,68135..68240,68321..68437, 68904..69068,69151..69285,69400..69552,70037..70150, 70775..70867,71022..71107,71362..71581) CDS join(62529..62669,62981..63120,63410..63587,63807..63908, 64109..64246,64332..64409,65844..65951,66025..66109, 66728..66923,67383..67568,68135..68240,68321..68437, 68904..69068,69151..69285,69400..69552,70037..70150, 70775..70867,71022..71105) /note="carnitine palmitoyltransferase isolog 384D8_4" /codon_start=1 /db_xref="PID:g1399963" /translation="MAEAHQAVAFQFTVTPDGVDFRLSREALKHVYLSGINSWKKRLI RIKNGILRGVYPGSPTSWLVVIMATVGSSFCNVDISLGLVSCIQRCLPQGCGPYQTPQ TRALLSMAIFSTGVWVTGIFFFRQTLKLLLCYHGWMFEMHGKTSNLTRIWAMCIRLLS SRHPMLYSFQTSLPKLPVPRVSATIQRYLESVRPLLDDEEYYRMELLAKEFQDKTAPR LQKYLVLKSWWASNYVSDWWEEYIYLRGRSPLMVNSNYYVMDLVLIKNTDVQAARLGN IIHAMIMYRRKLDREEIKPVMALGIVPMCSYQMERMFNTTRIPGKDTDVLQHLSDSRH VAVYHKGRFFKLWLYEGARLLKPQDLEMQFQRILDDPSPPQPGEEKLAALTAGGRVEW AQARQAFFSSGKNKAALEAIERAAFFVALDEESYSYDPEDEASLSLYGKALLHGNCYN RWFDKSFTLISFKNGQLGLNAEHAWADAPIIGHLWEFVLGTDSFHLGYTETGHCLGKP NPALAPPTRLQWDIPKQCQAVIESSYQVAKALADDVELYCFQFLPFGKGLIKKCRTSP DAFVQIALQLAHFRDRGKFCLTYEASMTRMFREGRTETVRSCTSESTAFVQAMMEGSH TKADLRDLFQKAAKKHQNMYRLAMTGAGIDRHLFCLYLVSKYLGVSSPFLAEVLSEPW RLSTSQIPQSQIRMFDPEQHPNHLGAGGGFGPVADDGYGVSYMIAGENTIFFHISSKF SSSETNAQRFGNHIRKALLDIADLFQVPKAYS" repeat_region 65136..65451 /rpt_family="Alu-Y" repeat_region 65477..65641 /rpt_family="Alu-Jo" repeat_region 67661..67769 /rpt_family="Alu-Sz" repeat_region 67837..68039 /rpt_family="Alu-Sz" repeat_region complement(71869..72166) /rpt_family="Alu-Sq" repeat_region complement(72274..72416) /rpt_family="Alu-Jo" repeat_region 72596..72860 /rpt_family="MER7A" repeat_region 72893..73189 /rpt_family="Alu-Y" repeat_region 73226..73495 /rpt_family="Alu-Y" repeat_region 74351..74645 /rpt_family="Alu-Sc" repeat_region 74956..75239 /rpt_family="Alu-Jo" repeat_region 75999..76298 /rpt_family="Alu-Sx" repeat_region complement(79949..80199) /rpt_family="Alu-Sz" repeat_region complement(80462..80768) /rpt_family="Alu-Sg" repeat_region 80992..81289 /rpt_family="Alu-Sx" repeat_region complement(81965..82275) /rpt_family="Alu-Sq" repeat_region 82962..83279 /rpt_family="Alu-Sq" repeat_region complement(83286..83355) /rpt_family="MIR" repeat_region complement(84129..84310) /rpt_family="MER4B" repeat_region 84325..84623 /rpt_family="Alu-Y" repeat_region complement(84625..84988) /rpt_family="MER4B" repeat_region complement(84998..85058) /rpt_family="Alu-S" repeat_region complement(85169..85282) /rpt_family="MER4B" repeat_region complement(85284..85576) /rpt_family="Alu-Y" repeat_region complement(85577..85704) /rpt_family="Alu-Sc" repeat_region complement(85705..86005) /rpt_family="Alu-Sx" repeat_region complement(86208..86505) /rpt_family="Alu-Y" repeat_region 86537..86815 /rpt_family="Alu-Sg" repeat_region 87153..87190 /rpt_family="Alu-?" repeat_region complement(87191..87501) /rpt_family="Alu-Jb" repeat_region complement(87531..87717) /rpt_family="MLT1A" repeat_region complement(87821..88046) /rpt_family="Alu-Sc" repeat_region 96275..96526 /rpt_family="Alu-Sx" repeat_region 96627..96931 /rpt_family="Alu-Sx" repeat_region 97055..97360 /rpt_family="Alu-Y" repeat_region 98244..98545 /rpt_family="Alu-Sx" repeat_region 98546..98650 /rpt_family="Alu-Jo" repeat_region 98800..99095 /rpt_family="Alu-Sx" repeat_region 99826..99959 /rpt_family="Alu-Spqxz" repeat_region complement(100156..100454) /rpt_family="Alu-Sz" repeat_region complement(101288..101606) /rpt_family="Alu-Y" repeat_region complement(102713..103001) /rpt_family="Alu-Sz" repeat_region complement(103162..103480) /rpt_family="Alu-Sp" repeat_region 103686..103820 /rpt_family="Alu-Jo" repeat_region complement(104217..104405) /rpt_family="Alu-Sxzg" repeat_region 104531..104814 /rpt_family="Alu-Sc" repeat_region 104956..105245 /rpt_family="Alu-Jb" repeat_region complement(105253..105553) /rpt_family="Alu-Y" repeat_region complement(105598..106234) /rpt_family="LTR13" repeat_region complement(106235..106526) /rpt_family="LTR13" repeat_region 106623..107087 /rpt_family="LTR2" repeat_region complement(107194..107410) /rpt_family="Alu-Jo" mRNA join(110584..110917,111075..111277,111803..111901, 112696..112825,113130..113248,113675..113837, 113937..114167,114272..114412,114495..114658) CDS join(110704..110917,111075..111277,111803..111901, 112696..112825,113130..113248,113675..113837, 113937..114167,114272..114412,114495..114643) /codon_start=1 /product="endothelial cell growth factor 1" /db_xref="PID:g1399964" /translation="MAALMTPGTGAPPAPGDFSGEGSQGLPDPSPEPKQLPELIRMKR DGGRLSEADIRGFVAAVVNGSAQGAQIGAMLMAIRLRGMDLEETSVLTQALAQSGQQL EWPEAWRQQLVDKHSTGGVGDKVSLVLAPALAACGCKVPMISGRGLGHTGGTLDKLES IPGFNVIQSPEQMQVLLDQAGCCIVGQSEQLVPADGILYAARDVTATVDSLPLITASI LSKKLVEGLSALVVDVKFGGAAVFPNQEQARELAKTLVGVGASLGLRVAAALTAMDKP LGRCVGHALEVEEALLCMDGAGPPDLRDLVTTLGGALLWLSGHAGTQAQGAARVAAAL DDGSALGRFERMLAAQGVDPGLARALCSGSPAERRQLLPRAREQEELLAPADGTVELV RALPLALVLHELGAGRSRAGEPLRLGVGAELLVDVGQRLRRGTPWLRVHRDRPALSGP QNRALQEALVLSDRAPFAAPSPFAELVLPPQQ" repeat_region 112109..112384 /rpt_family="Alu-Jb" repeat_region 112462..112644 /rpt_family="Alu-Jb" mRNA complement(join(116417..117178,117247..117396, 117491..117592,117693..117742,117844..117912, 117998..118070,118162..118232,118374..118427, 118565..118672,118804..118870,119377..119448, 121093..121223,121681..121764,122135..122214, 122361..122440,122604..122672,122752..122836, 122931..122986,123865..>123924)) CDS complement(join(117038..117178,117247..117396, 117491..117592,117693..117742,117844..117912, 117998..118070,118162..118232,118374..118427, 118565..118672,118804..118870,119377..119448, 121093..121223,121681..121764,122135..122214, 122361..122440,122604..122672,122752..122836, 122931..122986,123865..123924)) /note="hypothetical protein 384D8_6" /codon_start=1 /db_xref="PID:g1399965" /translation="MNFIEAALLIQGSACVYSKKVEYLYSLVYQALDFISGKRRAKQL SSVQEDRANGVASSGVPQEAENEFLSLDDFPDSRTNVDLKNDQTPSEVLIIPLLPMAL VAPDEMEKNNNPLYRGAFMLEPEGMSPMEPAGVSPMPGTQKDTGRTEEQPMEVSVCRS PVPALGFSQEPGPSPEGPMPLGGGEDEDAEEAVELPEASAPKAALEPKESRSPQQSAA LPRRYMLREREGAPEPASCVKETPDPWQSLDPFDSLESKPFKKGRPYSVPPCVEEALG QKRKRKGAAKLQDFHQWYLAAYADHADSRRLRRKGPSFADMEVLYWTHVKEQLETLRK LQRREVAEQWLRPAEEDHLEDSLEDLGAADDFLEPEEYMEPEGADPREAADLDAVPMS LSYEELVRRNVELFIATSQKFVQETELSQRIRDWEDTVQPLLQEQEQHVPFDIHTYGD QLVSRFPQLNEWCPFAELVAGQPAFEVCRSMLASLQLANDYTVEITQQPGLEMAVDTM SLRLLTHQRAHKRFQTYAAPSMAQP" repeat_region 119551..119851 /rpt_family="Alu-Sx" repeat_region 119862..120176 /rpt_family="Alu-Jb" repeat_region 120244..120547 /rpt_family="Alu-Y" repeat_region 124718..125008 /rpt_family="Alu-Y" repeat_region complement(125011..125134) /rpt_family="Alu-Jb" repeat_region complement(126279..126452) /rpt_family="Alu-Sxzg" repeat_region complement(126614..126932) /rpt_family="Alu-Sz" repeat_region complement(126961..127176) /rpt_family="Alu-Spqxz" repeat_region 127272..127362 /rpt_family="MIR" repeat_region 127569..127882 /rpt_family="Alu-Jb" repeat_region complement(128175..128299) /rpt_family="MIR2" repeat_region 128328..128458 /rpt_family="Alu-Sg" repeat_region 128467..128765 /rpt_family="Alu-Y" repeat_region 128766..128807 /rpt_family="Alu-Sg" repeat_region 128826..129125 /rpt_family="Alu-Y" repeat_region 129126..129214 /rpt_family="Alu-Sg" repeat_region 129268..129562 /rpt_family="Alu-Y" repeat_region 129685..129967 /rpt_family="Alu-Sx" repeat_region 130014..130130 /rpt_family="Alu-FLA" repeat_region 130193..130500 /rpt_family="Alu-Y" mRNA join(<133354..133613,133906..134123,134195..134373, 134593..134734,134871..135005,135074..135179, 135255..135351,135424..135606,135681..135849, 135957..136068,136503..137458) CDS join(133354..133613,133906..134123,134195..134373, 134593..134734,134871..135005,135074..135179, 135255..135351,135424..135606,135681..135849, 135957..136068,136503..136641) /note="hypothetical protein 384D8_7" /codon_start=1 /db_xref="PID:g1399966" /translation="MTCPPTGLYGPEGILPARRTLRPQGKGRWQQLWETPTLLWEAPR LGLDTAQGLELLSLLGALVALGALLLSPLRHPVIYLLLWAAYLDSLLLETGFLAVLVA PLRPASHRKEAPQGRQAGALPHEDLPFWLVRWLLFRLMFASGVVKLTSRCPAWWGLTA LTYHYETQCLPTPAAWFAHHLPVWLHKLSVVATFLIEIAVPPLFFAPIRRLRLAAFYS QVLLQVLIIITGNYNFFNLMTLVLTTALLDDQHLAAEPGHGSRKKTATSWPKALLATL SLLLELAVYGLLAYGTVHYFGLEVDWQQRTIHSRTTFTFHQFSQWLKTLTLPTVWLGV ASLVWELLSALWRWTQVRGWLRKLSAVVQLSLVGTATVALFLISLVPYSYVEPGTHGR LWTGAHRLFGAVEHLQLANSYGLFRRMTGLGGRPEVVLEGSYDGHHWTEIEFMYKPGN LSRPPPVVVPHQPRLDWQMWFAALGPHTHSPWFTSLVLRLLQGKEPVIRLVQSQVARY PFHKQPPTYVRAQRYKYWFSQPGEQGQWWRRQWVEEFFPSVSLGDPTLETLLRQFGLQ VRGCQPGWGRWQG" repeat_region 138253..138568 /rpt_family="Alu-Sx" repeat_region complement(138590..138700) /rpt_family="Alu-Jo" repeat_region complement(138746..139045) /rpt_family="Alu-Y" repeat_region complement(139118..139187) /rpt_family="L1MB7" repeat_region complement(139210..139352) /rpt_family="Alu-Sxzg" repeat_region complement(139353..139378) /rpt_family="L1MB7" repeat_region complement(139379..139509) /rpt_family="Alu-Spqxz" repeat_region complement(139523..139867) /rpt_family="L1MB7" BASE COUNT 32246 a 38845 c 37337 g 31392 t 67 others ORIGIN 1 gcccaaataa accctccgct catgttaatg ttgcttcagc ttctttcttt taggtcaacg 61 ttcagtacgc gtctgtcatt tccatttgac tgttaaacaa actggagact atagaggcca 121 agggaaaact tgccctctgc cctcaggaga ttcacggaaa atcaactgac aaagtaagag 181 aaaaggcata cagatttatt catgcacaca gggagaacca gagtgatgac cccaacctcc 241 caatggggtt cggaagctta tataccatct tgaggttaca gaaagaacat gggctcaaag 301 tatggccaca gtttatggtg gaaaatcagg tgaccagtgg caagaccggt tacaggaggg 361 agagcagagg aggcctggct agcaaaggtg gccttgttct acagatgaaa cctcacaggt 421 agcagccctg agagagaaca gatggtgaat gttcccttca ggcctttaaa ggcgtcagac 481 tgttactctt cccagatcag gcaaggaggg cctcagagga ggcctggccg catcagtgta 541 gattctctcc acagatgcaa atctccccca caaaagacag cttttcagct attcttctat 601 ttccagtcct tctgagtaac tgtcttgaac cgtgtcaagg aaatattttt gttgttgttg 661 tttgtttgtt ttgaggcaga gtctcactcc gtcacccagg ctggagtgca gtggtgcaat 721 ctcagctcac tgcaacctcc acctactggg ttcaagtgat tctcctgctt cagcctccct 781 gggattacag gcatgcacca ccatgcctgg ctaatttttg tatttttagt agagacgggg 841 tctcgccatg ttggccagcc tggtctcaaa ctcctgacct caggtgatct gcccacctcg 901 gcctcacgaa gtgctgggat tacaggcgtg agtcaccatg cccagccatg gaaatacatt 961 ttggggcgaa atattttggt ttccttcagt ccccactttg agactttcag aaagttgccc 1021 atataaaaag aaagttgata gctttgaaga gatttgagtt agaggttgtt agataagaga 1081 caggcagagg cgggcaaaaa gaaattggga taagcagaaa aagacaaatt taaatgtcat 1141 cctggcaggg cacagtggct catggctgta aatctcagca ctttgggaag ctgaggcagg 1201 aggagcactt gagcccaggt gttcagacca gcccgggcat cacagggaga ctctattgct 1261 acaaaaaata aaaaattagc caggcatggt agtttgtgtc tgtggtccca gctacctggg 1321 aggctgatgc agcaggatga cctgagcctg ggaagtcaag tctgcagaga gctgtgatca 1381 caccactttg tgggtgagtg agtgagaccc tgtctctgaa aaatagatat gtatcatgcc 1441 atattttctt gaatccgtct cttagtcctg agactagatc agctcagtta aacagctgta 1501 tcctattcca agaagtggca ttgcagatgg gctgggacct cgtatataat tcacacaaac 1561 agatctttaa caagatacaa ttctatggaa acagaagaaa aaaaacggtg aatgtctggt 1621 gtcatccata gactagcttt tctagctctc ctcaatctga ggcatctgga gcatcttcag 1681 attgcaatgg caatttgaca gatttttctg gattgtagtt ctaattgggt gttcaagtga 1741 actttctgaa tagtccatat atcaacaggc acaaaggcta tttatacata agtcgctgtg 1801 atgatttctc ctgaagatta taagtaattt agcttcagtt gcagggcttc aagaaaagcc 1861 attttaattt ccagtgattt taaatcagaa aaatgggaga aaaactggaa agcattagtt 1921 tagagacttg gagctaggaa agaatgcagg attcagttca aactataggc aaatgataac 1981 aactcaaaaa caagtcagtc cggactctcg taacaggtgt caacagtcag tccggactct 2041 aataacaggt gtcaacagtc agtccggact ctaataacag gtgtcaacag tcagtccgga 2101 ctctaataac aggtgtcaac agtcagtccg gactctcctc acggcaggtg tcaacagtca 2161 gtccggactc taatgacagg tgtcaacagt cagtccggac tcttgtgaca ggtgtcaaca 2221 gtcagtccgg actctcgtga caggtgtcaa cagtcagtcc ggactctaat aacaggtgtc 2281 aacagtcagt ctggactcta ataacaggta tactatagtt ttcttctaaa acatattttt 2341 tctctctcca gtctcccttt tccacaaaag acaaatcaga gtaagaacaa ctgatttaca 2401 aaataagttt tagtcttatt ataattggcc tgattatttg cataaagtac agtaataata 2461 gtgactggcc attcaacctc ttttaaagtt ggctttgcca gaactttttc ataaggaatc 2521 tcagattaga cttttaaaag catcttgagg ccgggcatgg tggctcacgc ctgtaatccc 2581 agcactttgg gaggccgagg cgggcggatc gtgggctcag gagatcaaga ctatcctggc 2641 taacacagtg aaaccccatc tctactaaaa atacaaagaa aatcagccac gtgtggtggc 2701 gggcgcctgt agtaccagct actcaggagg ctaaggcagg agaatggcat gaacccggga 2761 gacggagctt gcagtgagcc aagaccgcgc cactacactc cagcctgggc gacagagcga 2821 gatcggtctc aaaaaaaaaa aaccaaaaac caaaaaaaaa aaaaaaaaaa aaaagtactt 2881 gtctgggtac tttacatgag tttccttgag gaagaagcaa gtcttgaact gtagctaatt 2941 ataagctgct ttttaaaaag aatcaaagta aaacagtaat tgactgtgga tgacagatga 3001 cttagactaa tgatggttac agatgcaatt cacaaggaga tttgtttttt tgtgtgacat 3061 acaacaattt aacataataa ttataatttt gagccaggcg tggtggctca tacctgtaat 3121 ccctgcactt tgggagacca aggcaggagg gctgcttgag ctcaggagtt cgagatgagc 3181 ctgggtggca cagcaagatc ctgtctccaa aataataaat aaataaataa agacagcatg 3241 aggcaaactc tctactcctc cttttttttt gtgagacaga gtctcactct tgttgcccag 3301 gctggagtgc agtggcacaa tctcagctca ctgcaacctc cgcctcccgg gttcaagcga 3361 ttctcctgcc tcggcctcct gagtagctgg gactacaggc aggcgccacc acgcctggct 3421 aatttttgta tttttagtag agatggggtt tcaccatctt ggccaggcta gtctcaaacg 3481 cctgacctca ggtgatccac ccacctcggc ctcccaaagt gctgggatta caggtgtgaa 3541 ccaccgtacc gggcctactc ctccttttct tgtagtttac tcaaaaggta aacaaaaatc 3601 ttttactgtc tcttattaat actacatgaa aatcttactc aaaaatgaaa actaaattct 3661 acctttacat tagcatatta ttaatactaa agctaatttt aataaaaact taaaaacaga 3721 tccattcaat ctcaatcagc tttgaccaca taagatttcc ataaaccttt aataacctct 3781 taaaatttgt tccattcttt gtttctcaaa ctttctatat atattcagtt ttatctatca 3841 tttttttatt cctttaattt aaaacaacct ttaaaaacat caaaactaga caaaattact 3901 tttcctttaa caaaaacctc attctcatgc cttctttata accttcctta ccaaaaacac 3961 atcctacttt cttataaact ttgcatacag aagtgtttct cttatatcta gtagttttaa 4021 atacatatat taattacaat gttaactctt aggaagccta attttcagtg aaaaacatgg 4081 gaagtaagca attttaattg ttattaaaga tgcaaagccc aggacaaagg acagagcttt 4141 aaagactgat atggtttggc tgtgtcccca cccaaatctc atcttgaact gtagttccca 4201 aaatccacag gtgttgtggg agggacccag tgggaggtga ttagatgaag ggggcggttc 4261 ccccacgctg tttttgtgct agtgagttct cacgaggtct catggtttca tgaggtgctt 4321 ccccactctt cgctctgcac acctccttcc tgacctcatg tgaagaacgt gtttgcttcc 4381 tcttccgcca tgattgtaag tttcctgagg cttcccagcc acgctgaact gtgagtcaat 4441 taaacctctt tcctttacaa atcacccagt ctcaggtatg tctttattag cagtgtgaga 4501 atgggctact acaaagacaa tgcctggagg actcaacccc tcccagcatg accggggggc 4561 acagctgggc cagggaggac aatgctcggg cactgcagac acacagcatg accggggggc 4621 acagctgggc aagagaggac aatgctcggg cactgcagac acacagcatg gctggggcgc 4681 acagctgggc tagggaggac ggggctcagg cactgcagac acacagtatg gccggggggc 4741 acagctgggc tagggaggac ggggctcggg cactgcagac aaacagcatg accggggggt 4801 acagctgggc cagggaggat ggggcttagg tactgcggac acacagatgt ctccaggcca 4861 catcatgtcc acttgtcttg accccagaat gtagaggctc aaagccaaag acctaagttc 4921 acagacaaat taagcaagta tcagaaatat gacagaagca aattttatga ccttaaagca 4981 tctaacagag actgtctgaa cctgtccgac caatgggccc aagcaaagat gtctcaatta 5041 tatttaagac tgacgacttt gaagatattc taattttagc atcatttttt tttttgagat 5101 ggagtctcac tctgtcaccc aggctagagt gcagtagtgc agtctcggct cactgcaacc 5161 tctacctccc aggttcaagt gattctcctg cctcagcctc ctgagtagct aggactacag 5221 gcgcatgcca ccacgcctgg ctaatttttg tatttttagt agagacgggg tttcaccatg 5281 ttggccagga tggtctcgat ctcttgacct cgtgatttgc ccgcctcagc ctcccaaagt 5341 gctgggatta caggtgtgag ccactgcacc cagcagcaac aattttaaaa ctaagtttat 5401 ttgccaaaga ttactagtca catgaactag aaaagcatct gggtttagtt acatcattca 5461 tgagcactta tttatttata cgtcaatttg ctactatgta cataatatac aaacaggcat 5521 gtacacataa aaatacagac acaaatcaag attttatagc tttagtttta agattttagc 5581 cacgaatcgg gtaaaaatca ctagtttaaa aggatactta aaattgtgcc tctgtaaatg 5641 gaacaaggta aaatttatct gtctcacacg gccaaagccc ttaccaagtt ttagagaaaa 5701 caaggcaaca aatttacatc ttgaagcaca gagaaaaaaa aattattttt atttactgaa 5761 ttttttcaag acggagtctt gctctgtcgc ccaggttgga gtgcaacggc acgatcttga 5821 ctcactgcaa cctctgcctc cttggttcaa gcaattccac agggagaaag tttaagcttt 5881 gatttagtat gttaaaggaa gattttaaat ggatgctgag gtaacaaaaa gtcatagaaa 5941 ttcaccacag gattttataa ggacaacaat gttatttaaa tatgtggcta ttaatttatt 6001 ctccattttt gaactagacc actgggctca ggacagagct cattaatgaa catggccaaa 6061 aaagcatttg cagtttggag gacctaatat ttaaatatgt gaaaaacagg tgcagctgga 6121 aggcagagca tctagatctt ttaaaatcaa ggatcccaca tttacactga atgctgagtt 6181 ccctctaaaa agagatatat atgagacaaa gccatacagt gtttccacag tgtacctcac 6241 tatgaagaca ttcccctgag gctggtggga gacccacagc aatcagccca cggagtctca 6301 gcctttgacg ccaagtgctt ccatagtctc caagtgttca gattgtgcct ttctcatcta 6361 aacatgcaga gaaacaagca gccactgcag gaacaaccat tcactacaac tgctttcagc 6421 cacctccaaa actgtgaggc agccctcgcc agtgacctgc cggccatcac acacactcag 6481 gtcatgcact gtctcacagt gcaacgtaat ccctggtacc cccaaagcca aagagatcca 6541 gtgacacagt gccaaagaca gccaagcttt agacccgaga ggaagctacc cacgactcct 6601 caggccccat aaggaagaca aaagcggggc gcgtggcacc tttctttgcg ttccccaagg 6661 ggtctctaag tcatcagaag tcccttgtag atcccttcat taggtaccaa agatggcaaa 6721 ggggatggag gagcacggag gggtagaagt aaatgggact gcaatcctta gaggagccaa 6781 tttggaaaaa tgttaaggtt ctaaaaggcc aatacaattt tacatttttc tcatcaaaat 6841 tataccaaca aaggaaccaa acagaaggac caaacacaca atttaaaggg gtttcagtca 6901 cctgaaaaaa aaattcccag aaacagggtc caaaagcaga aaaggctgta gatacatggc 6961 ttgaagatca gctcacctgt ttttaattaa gccaacttct gaccacagag ctctttttta 7021 aaatcctttc aaatatctta ttatcagatt ttagctgaga ccaacagctg atagccatgg 7081 ctttgagacc ttttttttta aaaccaaagg tacctcccaa gtgactcacc aaaaccaata 7141 agccttaact aaagctatgg acttaaccaa ggacacataa gccatctcca aagaggcgca 7201 aagcagtcct gatgacatcc agagccaccc caaagagctc aaagaaagga aatcaaaagc 7261 tgccagtgga gggggaaagg gtcaacaaca aatgagttcc gcacaaagtc aaaagttggc 7321 tgggtgcgtg gctcacgctg gtaatcccag cactttggga ggccgaggca gccggatcac 7381 aaggtcagga gttcgagacc agcctggcca acacggtgaa accccatgtc tactaaaaat 7441 acaaaaaaaa aaaagaaaaa aaaagaaaat tagctgggcg tggtggctca cacctgtaat 7501 cccagcactc tgggaggcca aggcgggagg atcacgaggt caggagttcg agaccagcct 7561 ggccaacatg gtgaaacccc atctctacta aaaatacaca cacacaaaaa aatcagccgg 7621 gcgtggtggc tcacacctgt aatcccagct actccagagg ctgaggcagg agaattgctt 7681 aaacccagga ggcagagggt gcagtgagcc aagatcatgc cactgcactc cagcctgggc 7741 gacagagcaa gactccgtct ccaaaaaaaa aacaaaagaa agaattcaaa agtcacacaa 7801 atatcaaacc aaaagggact ggttcctcta ttgggaattg aacccaggcc atggtagtga 7861 aagtacagaa ttttaagtag tttccaagat gcagcagtct tcattgtgaa tcctgcaggg 7921 atccaaagca atggtttgcg cgcacaaagg attttaactt gttacaggtc agatttttgc 7981 tctttaattt tgtgaagagg atttctaagg ctagccacaa cattattatg tgtctttcct 8041 ttaatctccc cacaaataca aataaggcaa ttgtttagaa tgagagacct ctaaaatctt 8101 tttttttttt taatttaggg atctttctaa tgtaaaggat tcatcttttg gctattgaca 8161 atcagaattt ccaatgatgt atttattcca atagcaactc aatccaagaa gccccttcat 8221 ggaaagccca aaagattatt ttccaggttt agagagggca tagaagagga ggttccaatg 8281 atgcccccaa aattcactcc caggaatagg caaagacagc aaaagactct tgttcccaga 8341 gacagttaag gaacgtgttt gtacatacag tgcctccagt aacacacaat ctgtcagggg 8401 ctgccagtca cagacccatt aacctgtaac acagggtagg cctgttggga ttgggctttc 8461 tgaggactga ccaggcaaca aacattggga tgacaaaagc cccttatgga tgggcctgag 8521 gagagcaaaa accacctggc gaccatcaaa catcaaacag gccctccgag gcaaaactcc 8581 ttatctgggg aaaatcagaa gtaactaaac ctccctatta tctaaagcag gcatctggtt 8641 ccagatttct tcccccaaaa aacgtgtaag taactataat ttctatgtgt ctccagaata 8701 ccatgacgaa actcactgta caaccgtcgc tgacattaag gcaccaaaat tactataaat 8761 gtaatcattt atcatgactt acgtggctaa tatggtccaa attactgtta agcctccact 8821 ttaaggccta taaatgccct taaggagaaa tccacccggc gtactcagtc ctcttgctga 8881 ggcacctcat tgcactcttc tgcagcattc taataaactt tctttctttt cttttttttt 8941 cttttttttt tttttgagat ggagtttcac tcttgttgcc caggctggag tgcaatggta 9001 tgatctcagc tcaccacaac ctccatctcc caggttcaag cgattctcct gcctcagcct 9061 cccaagtagc tgggaataca ggctcccgac accatgccca gctaattttt tgtattttta 9121 gtagagatgg ggtttcatca tgttggccag gctggtcttg aactcctgcc ctcaggtgat 9181 ccacctgcct cagcatccca aagtgctggg attacaggcg tgagccaccg tgcccggcaa 9241 actttctttt ttcttttttt ttttgagaca gtcttgctct gtctcagctc actgcaacct 9301 ccgcctccca ggttcaagca attctcctgc ctcagccttc caagtagctg ggattacagc 9361 cacgcaccac cacacctggc taagttttat actttcagta gagacagggt ttctccatgt 9421 tggccaggct ggtctcaaac tcctgacctc aggtgatcca cccaccttgg acttccaaag 9481 tgctgagagt acaagcatga gccactatgc ccggccaaac tttccttttt caaacctatt 9541 gtcggtaaac tctttttacc aacccgcaag ttgaccacca ctcccagtgc cggggctctg 9601 acagctcgcc tggcagggac ttcttaagac aaactaaaag cttgacacat tcagaacaaa 9661 aacggtgctg cttaaaatat taaatgtctt gggttcccaa ccattttcag actgaccaca 9721 aaatgtgacc tgcgagtcac tacctcggat tggcagaggc caagagagtg ctcctgaggt 9781 aggaggtggg acttagttga cccagttgag gattggctaa aacagggttg gggcaatagc 9841 agctttcagt cagatatgcc cgccactgtg ccatgtcagt ttaccattgc catggcaaca 9901 cccaggagtt gccacccctt tccatggcaa tgacccaatg atgactactc cttccctaga 9961 aacttctgca taaaccatcc tttaatctac atgcaatcaa aagtgcgtat aaatgtgact 10021 gcaaaactgc cctgagctgc tactctcagc ctacggggtg gccctgctct gcaggagcag 10081 tcacagagct ctaacgccac ctgttcaata aagctgtttt cttctacctc tggcttgtct 10141 ttgaattctt tcctgggcaa agccaaaacc ctcactggct aagctccact ctggggtctg 10201 cctggcctgc atcattccca ctcgctcaca actcaagctt tcaaggacat caaaagagag 10261 gagaatctca tccagttttc agttcaggga cccacagcaa agtctgtcta actagacgca 10321 tccgatcaga gctgcaaaac tgactagcct gcaggacggc cccaagacgg ggtttacagg 10381 ggttttaggc ctgtgttcta ccctctttat gacagaaaaa cacagaaaga caagaatgaa 10441 agacgactat ttctggggaa aaaaagcaat caaacaatat gaatacaaaa ctaatcacat 10501 aaatacattt ctctcattaa aactttgaag aggaaaacaa ggtaaacaga catttttacc 10561 gtcggcttga caagattcca gagaggccgg gagcctggct gctaagacct tcttaccctt 10621 ctttctgcca gcttgtcggc tcctgggttc ccctgacgga ggctcccagg agagtgactt 10681 tagttatcct accgactgca ccaaaactgt aggagcctag agaaaaactt ttctgcattc 10741 tgagttcaca gaaaatcaac tgacaaaagc caggttagta agaaaaaagg catacagatt 10801 tattcatgca cacagggaga accagagtga tgaccccaac ctcccaatgg ggttcggaag 10861 cttatatacc atcttgaggt tacagaaaga acatgggctc aaagtatggc cacagtttat 10921 ggtggaaaat caggtgacca gtggcaagac cggttacagg agggagagca gaggaggcct 10981 ggctagcaaa ggtggccttg ttctacagat gaaacctcac aggtagcagc cctgagagag 11041 aacagatggt gaatgttccc ttcaggcctt taaaggcgtc agactgttac tcttcccaga 11101 tcaggcaagg agggcctcag aggaggcctg gccgcatcag tgtagattct ctccacagat 11161 gcaaatctcc cccacaaaag acagcttttc agctattctt ctatttccag tccttctgag 11221 taactgtctt gaaccgtgtc aaggaaatat atttggaggt gacatattgt gttttccttc 11281 aagatcaaaa aggttaagtc caaggctgcc caacgaggga gcgaggaccc aatcacgttg 11341 tcctcagatg agtttttccc cagcagcgta accctgcaac aggccagact ggaccccaga 11401 cacacattcc acaccacacc acgcctccac caccacacgc acacaacaga cacaccctaa 11461 acaccagaca cacacacaca cctcaaccac cgcaaacaca catacgccac acctctaaca 11521 ccacacacac ccaccacaca catacgtgcc tcaaacacta cacacacctc aaccaccata 11581 cacctcaacc acacacaccc ctcaaacacc aaacacaccc ctcaaccaca cacacacacc 11641 cctcaaacac cacacaccca cctcaaacac cacacacaca tatccctcaa acactacaaa 11701 aacacacaca cacacctcaa acaccacaca tgcacacaca cctgaaacac cacacacctg 11761 cctcaaacac tccacacaca ccacacacag acacgtacac agccccccgc acccagccac 11821 agcttacacg cacacagaca cgcgtacgca ggagaaaggg tgtcaaagct tttgctagtc 11881 gtgaaggaag agcccgcagt ccatggggct gcacggtttt ccctgccatt tgctcattca 11941 ccgggtgccc ataacgctcc gcgctcaggt ctcttacagg agggcccgag tctcccagta 12001 ccggaggtct gggggcgcac gagccgctcc tcctctggag aagctccgga ccgagaggac 12061 accggacact gcgcagcgcc gagccccgcg cgcagcccgg gacgcctcag ccagggccga 12121 ccgcgcagag gaagctccca gagcccgttt caagaccgca gccaacagcc tcaggcgcac 12181 acggcggcct cggagcgagc acgcgcagca acgcccctcg ccccggcccg cccccggccc 12241 cgcccccggc cccgcccccg gccccgcccc gcaagggtca caggtcacgg ggcggggccg 12301 aggcggaagc gcccgcagcc cggtaccggc tcctcctggg ctccctctag cgccttcccc 12361 ccggcccgac tccgctggtc agcgccaagt gacttacgcc cccgaccctg agcccggacc 12421 gctaggcgag gaggatcaga tctccgctcg agaatctgaa ggtgccctgg tcctggagga 12481 gttccgtccc agcccgcggt ctcccggtac tgtcgggccc cggccctctg gagcttcagg 12541 aggcggccgt cagggtcggg gagtatttgg gtccggggtc tcagggaagg gcggcgcctg 12601 ggtctgcggt atcggaaaga gcctgctgga gccaagtagc cctccctctc ttgggacaga 12661 cccctcggtc ccatgtccat gggggcaccg cggtccctcc tcctggccct ggctgctggc 12721 ctggccgttg cccgtccgcc caacatcgtg ctgatctttg ccgacgacct cggctatggg 12781 gacctgggct gctatgggca ccccagctct accactccca acctggacca gctggcggcg 12841 ggagggctgc ggttcacaga cttctacgtg cctgtgtctc tgtgcacacc ctctaggtaa 12901 agagggggcc gcgcctcttc cccgccccga ccctccatcc ctttcctccc aatggattgc 12961 aggggggcgg gaaaaacgtc tgtctctctc tctagggaag gccacatttc tgtctgtctc 13021 agggactctg tgacttgtcc cgcagggccg ccctcctgac cggccggctc ccggttcgga 13081 tgggcatgta ccctggcgtc ctggtgccca gctcccgggg gggcctgccc ctggaggagg 13141 tgaccgtggc cgaagtcctg gctgcccgag gctacctcac aggaatggcc ggcaagtggc 13201 accttggggt ggggcctgag ggggccttcc tgccccccca tcagggcttc catcgatttc 13261 taggcatccc gtactcccac gaccaggtag gaaccacccg ggccctcagc caccctccca 13321 cctcccaaag tcccccagcc cttgatgctc ccgcagcccc acctgccagc ccagccctca 13381 cggcagctgc ccgcctcagg gcccctgcca gaacctgacc tgcttcccgc cggccactcc 13441 ttgcgacggt ggctgtgacc agggcctggt ccccatccca ctgttggcca acctgtccgt 13501 ggaggcgcag cccccctggc tgcccggact agaggcccgc tacatggctt tcgcccatga 13561 cctcatggcc gacgcccagc gccaggatcg ccccttcttc ctgtactatg cctctcacgt 13621 aagtgatctt ggcccaaccc cctggctgcc cgtgacccct acccagtgct aactccagtc 13681 tttgccccca gcacacccac taccctcagt tcagtgggca gagctttgca gagcgttcag 13741 gccgcgggcc atttggggac tccctgatgg agctggatgc agctgtgggg accctgatga 13801 cagccatagg ggacctgggg ctgcttgaag agacgctggt catcttcact gcagacaatg 13861 ggtatgccag cagggcagct gggtgctccg gccctgtcac gggccagggc cctggaggcc 13921 ttgcagttca gctgcttgcc aagaacatag tgggtgaggg ggtgccagga gatgctggcc 13981 acgttgcagg ggcccaaggt gtagtcagga gacacagtgc acagagagct ggtcttggta 14041 ggcctgggag gtgccgggct catgctgggc acctccgggc aagctttgtg acttagaggt 14101 gtggggccac tggtcaccct cggtggctca gaggctgtgg ctccatggct catgagcgcc 14161 tcctgtgtcc cagacctgag accatgcgta tgtcccgagg cggctgctcc ggtctcttgc 14221 ggtgtggaaa gggaacgacc tacgagggcg gtgtccgaga gcctgccttg gccttctggc 14281 caggtcatat cgctcccggt cagtccgcag gccctctcct tggaaccctg gccccaccac 14341 cccaaccttg atggcgaact gagtgactga ccagcctcct gcccccaggc gtgacccacg 14401 agctggccag ctccctggac ctgctgccta ccctggcagc cctggctggg gccccactgc 14461 ccaatgtcac cttggatggc tttgacctca gccccctgct gctgggcaca ggcaaggtag 14521 ggccggtgac ccctgatccc agatccttgg cccctgtcct ggccttcccc tggggtgagt 14581 gtgggcagtg cctgagagtc tgtgcctcag tgcctcctgc actgagtggc atccaagtgg 14641 cgccacctct caggttcctg ggtgggcaag aagcggtgca cgtccagggc ctcccaccag 14701 ggctggcagc cccaggtatg tgcagtgctt ggggcctgcc ccgccccgtg acccctgact 14761 ctgcccccag agccctcggc agtctctctt cttctacccg tcctacccag acgaggtccg 14821 tggggttttt gctgtgcgga ctggaaagta caaggctcac ttcttcaccc agggtaaccc 14881 ctccccgtgg atccctcccc ccgacctgct gacccctccc cggagcccta gatccctggc 14941 ccctcctctc gcccttgccc tgtgcacaga attggccccc tccccaggct ctgcccacag 15001 tgataccact gcagaccctg cctgccacgc ctccagctct ctgactgctc atgagccccc 15061 gctgctctat gacctgtcca aggaccctgg tgagaactac aacctgctgg ggggtgtggc 15121 cggggccacc ccagaggtgc tgcaagccct gaaacagctt cagctgctca aggcccagtt 15181 agacgcagct gtgaccttcg gccccagcca ggtggcccgg ggcgaggacc ccgccctgca 15241 gatctgctgt catcctggct gcaccccccg cccagcttgc tgccattgcc cagatcccca 15301 tgcctgaggg cccctcggct ggcctgggca tgtgatggct cctcactggg agcctgtggg 15361 ggaggctcag gtgtctggag ggggtttgtg cctgataacg taataacacc agtggagact 15421 tgcagatgtg acaattcgtc caatcctggg gtaatgctgt gtgctggtgc cggtcccctg 15481 tggtacgaat gaggaaactg aggtgcagag aggttcagga cttgtacaag atcacccagc 15541 cagaaagagg ttgggctggg atttgaaccc tggtgtcgtg gctctggaag ctgccctggc 15601 gccttggtga tctgcgtggg tcagtgcaca caggcacacg tcagcctcaa ggacatgggc 15661 acatctgttc acaggagcag cgccacgtgc ctttgagtgc caggaacggg gtgggagggt 15721 gggagggtgt gagggccaga agactcagaa gatgcaaagt gcctgagaga gacgggatat 15781 tcccccagaa gaagcattct tagagacaca ggcactggac ctccttggtt cttataagaa 15841 acctgtctga agctgggtga tgagttgcac actccaggtg gggctaaggg gcctggagcc 15901 cctgctggct cctaggaagg cacagcagca ggccctgaga cggctcctct ggggcccctc 15961 caccctccca ggcctctgca tttcacctgt gcccacactt ctgtctcctg ccttcacctt 16021 ttgacccact actaacgatt ctccacccag cagacaaagt gatctcttaa aaatatctgt 16081 tggctgggca cggtggctca cgcctgtaat cccagcactt taggaagccg aggcgggtgg 16141 atcacctgag gtcgggagtt cgagaccagc ctgaccaaca tggagaaacc ccatctctac 16201 taaaaataca aaattagcca ggtgtagtgg tgcatccctg taatcccagc tacttgggag 16261 tctgaggctg gagaatcact tgaacctggg aggcggtggt tgcagtgagc cgagatcgca 16321 ccattgcact ccagcctggg caacaagaga aaaactctgt ctcaaaaaac aaaaaatctg 16381 ttaggctgca cacggcgatt cactcctgta ttcccagtgc tttgggaggc tgaggtgaga 16441 ggatgcctga ggccaggaat tcagaccagc ctgggcaaca tagtgagacc ccagctctaa 16501 agatttgttt ttgttttttt tttttttttt tttttttttt tttttttttt tttgagacgg 16561 agtctcgctc tgtcgcccag gctagagtgc agtggtacca tctccgctca ctgcaacctc 16621 cgcctcccgg gttccaggga ttctcctgcc tcagcctccc tagtagctgg aactacaggt 16681 gtgtgctgcc atgcccagct aatttttttt tatttaatag agacaagatt tcaccatgtt 16741 ggccaggctg gtctcaaact cctgacctca ggtgatccac ccgcctcagc ctcccaaagt 16801 gctgggatta caggtgtgaa ccaccacacc tggccaacaa tatttgtttt aattagccag 16861 gcgtggtagc atttgtccta gcaatttggg aggttgaggt gggagaatca cttcagccca 16921 ctaggtcgag gctgtagtga gctataattg taccactgca ctccagcctc ggggacagag 16981 tgagaccctg tctgcaaata aacaaataaa acatcaggct gggcttgagc atctattcct 17041 gctcaaaatt tcgcaggctt ctcagaagaa aatccaaacc ccttacagtg acccagtttg 17101 cccttgaggc ctccacccac acccccttcc ccccagtctt agggggtggc ctggctgttc 17161 ccttcaacgg caacgctctg cctccattgt tggcctcctc tgcagggagg gactgtctga 17221 gcacctgccc gtgtctgtgc agcatggcac actgacgtca ggcccacgtg catgcccagg 17281 tggccagtca cacgccaggt gctccctcag tgttggccaa gtgagaggag cacaccttcc 17341 gggcgttcag acacctcccc gtggcagaca ccgttcgttg ctaccaaaca gccacctcct 17401 tcctaatggg ctcccatttt tcagtgctgg gcaaaggtcc cttgatcttg gagttgcagc 17461 ctctttctct ccaaggaggg cggtgaccag cctgagccag tcaatccagt gattggttca 17521 ggagtagcct gtgaccagga gtcctggtag tgaacgactg gggcagccct gggggtgagg 17581 accttgcgca gccgtcacag gccctgattg gacactgggc agctgctaac ccagtgtctc 17641 cagctgccta cctggagagc tccaagcgta agaaaataaa ccctgcctgt tgaagccact 17701 gctagtgagg gttctgttat ttgcagccaa aagccttgct ggaatgtggc ctatgaatgg 17761 ttgtgtggcg ggcacatgtg cctgcgtgag cctgtggtgt cagacagtgt gctgaggact 17821 ctccacgcaa ccgtttcatc ccttttcatc tggtttgagg gctgcattga ccacacccca 17881 aatcctgctg tgtttggaat cacccccata aggtggtggt ttccttgggg tgcccagcag 17941 cccctgatgc tggcataagg ggggcccttg tgcacttgtg gtccgagtgg aggtgctggc 18001 cccaggtggc gctgtagaaa gtgaagcaga ggttcctctt cagactttcc tccccatcta 18061 ggagtaaata gtaacttctc ttaaaagcaa aatttattca aagacctgta ctaacattct 18121 taaataactg ctagccgaat aaagaactca atgtccttta tgttcttagc tcccacaatt 18181 tagcctaaat atctgccctg gcatgcttaa actggcccaa gcaagcatta cgtcacagcc 18241 tgttcctctt ccttatttga aggtgttttt acctttctca tcattccaca agttacttcc 18301 tccttccttt gttctcctct ctcttttcct cttttaaaaa gtgctaagtt gctagccaat 18361 cgggacaaat acagaacgtg aggtcccgtt ccagccgatg gaaaccggac acggcagtaa 18421 ggtggacgca tcaggttaca aatgaccctg tccccttttg ttcggtgtac tctcacggca 18481 aaactgctgg cgagtgtacc ctttctgcag aaagtataaa agaaaattaa atttatgttc 18541 aagtgctgtt tctttacggc accgaggaac aagcatttca aacagcgctg tggttcaaga 18601 tgagggcaga gaggaactga actggaggca tgattccagt atcacacgaa agaccaaacc 18661 ctctagcaag aactaaatga atactaacac ccaaccacag tcttcaggca cctcctccag 18721 ctctgctgct ccacttcctc tcacttgggt agtgtgcata actgagttaa tcccacccgt 18781 gtttatgcat cagcaccgtg cagtgctggg catggggttg tcagaggaga cccactctgc 18841 actcaccggg cttacagtct agcaaaggag atgagcttaa tcccgggatg tcagatgcac 18901 ctgaaatttt cagtgtgatt gaagtcactg cctctggtag ggtggggtgt gtggagttgc 18961 ctgaccaggg acaccagaga acctgtgggt gctacaaatc ttccaaatct taaggggtca 19021 taacatgatg catgcatatg tataaagcga tcaacccaca ctcgcaagct gtgcacactt 19081 tatgacgtca ttgcgtattt tctatttcag taacaatcta accataataa acgctacaaa 19141 cgaagggtac ctggtcttac aagaacataa ccaagaggtg ggggtggaaa gataaatgag 19201 ctctgaggag ggtgttcaaa tgcactctgc acccaaggat ttttccactc ccccattccg 19261 cttttggttt tgctctgatt tgtttgtgtt cagaaatgct tttcaccagt tagctgtttg 19321 agccggatgc accctcctca cacaggcatg tacacgtatg cgtatgtgca catgtgtatg 19381 catacagaaa tacgcacgca tgcacactca cattcacgtg cacggaggca catacacatg 19441 catccgggga tacgtgcccc ccggggacgt gcgcacgtga acacacaagg actcacccgt 19501 gcacacaagc atgcatatac gtgcatattt gcagtgacat acacactgtt actgttgatt 19561 tacaaaacag cccctttcct gtgagcctaa gataatctat tcaccaggct ggcagccatc 19621 cattcccaca gagcagcttc ctcagggagc tcagtgtcct ctcgaagggg ctaagagtgt 19681 tttgaaatgg acagaacaaa caaacaagtt gattctgtgg tttcccctca ggtgcgttag 19741 gaaggaagag gggatatccg gcgcccaagt gattgcgtga acgacagggt aagagggggc 19801 tcctgcaggg aaactcccct cccctccctg tggcccctcc ttggctgtgt gctcgtctgt 19861 cccccaagtg accagaaggg ggagcccagg gaacgtttca atggaacaat gtttaaattc 19921 acctctgaat tttttttttt tttttttgag acggagtttc gctcttgttg cccaggctgc 19981 agtgcaatgg cgcgatctca gctcaccgca acctccgcct cccaggttca agtgattctc 20041 ctgcctcagc ctcccgagca gctgggatga caggcatgtg ccgccacacc cggctaattt 20101 tgtatttttt agtggagacg gggtttctcc atgttggtca ggctgatctc aaactcccga 20161 cctcaggtga tccgcctgcc tcggcctccc aaagtgctgc gattacaggc gtgagccact 20221 gtgcctggca cctctgaata tttttaatta cgaaagtaat gcatattcat tagagaaact 20281 caagtaatgc ccgctcatct cagtcccagg tagccaatgc tatcagttta gggtttgggg 20341 ggcgggcaca caccacattg ttctccacat tcatgcaaat tatgtatttt ggccccaggc 20401 gttgattcaa ctctaacctc accccctgct gcaccctgga cactgcctct cagctgtctg 20461 atgtgtcccc tcccttttgc agccctgagc ctggtgcagg gggtcagtag taatgaaggg 20521 gggagtgcat ttatttatgc cccctccagc ctccttggct cctttgtgtg aaggccgaga 20581 gccttccttg tggatgcagc aacatggtga aacccagtct ctactggaat acaaaaaatt 20641 ggccaagtgt ggtggtgtgc gcctgtaatg ccaggtactc aggaggctaa gacaggagag 20701 tcacttgaac cctggaggcg gaggttgcag tgagccacga ttgtgccact gcactccagc 20761 ctgggcgaca gagcaagact ctgtctcaaa aaaaaagaaa gaaaagaaaa aaagaaaaac 20821 aaagaaagaa actgtggcag ccttattgct gtttaaggag aaagtttaag tggtctgcat 20881 agaagatcaa acaagccaaa acatttcctt aacccaaagt ctaatccaga gcaaggccct 20941 aattccctct gattctgtga aagctgggag agatgagaaa gctgcaggag aaaagctgga 21001 aggtagcaga ggttggttca tatggtttaa ggaaagaagc tgtctccatg atgtaaaagt 21061 gcaaggtggg gcagcaagtg ctgatggaga agctgcagca agctctccag aagatccagc 21121 taaggtaagt gatgagggtg gctacagtaa acaatagatc ttcagtgtac gcaatttttt 21181 tttttttttt ggagacggag ttttgctctt gttgcccagg ctggagtgca gtggcgcaat 21241 ctcggctcac tgcaacctcc gcctcccggg ttcaagtgat tctcctgtct cagcctcctg 21301 agtagctagg attacaggca tgcgccacca ggcctggcta attttgtatt ttttgtagag 21361 atggggtttc accatgttgg ccaggcagga cttgaactcc tgacctcagg tgatctgcct 21421 gcctcggcct ctcaaagtgc tgggattaca ggtgtgagcc accgcactcg gcaccatctg 21481 tgactttcat agctagagag aagtcaatgc ctggttttaa agcttcaaag gacagtctga 21541 ctctcttgtt aggggctaat gcagctggtg gctttaagtt gaatgctcat ttaccattct 21601 gaaaatccca gggcccttaa gaaagatgct aggctgggca tggtggctca cgcctataat 21661 cccagcactt ggggaggccg aggcgggtgg atcacgaggg caggagatgg agaccatcct 21721 ggctaacacg gtgaaacccc atctctacta aaattacaaa aaatcagctg ggcgcggtgg 21781 cgggcacctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacctggg 21841 aggcggagct tgcagtgagc cgagatcgcg ccactgcacc ccagtctggg caaccgagtg 21901 agactctgtc tcaaaaaaaa aaaaaaaaag aatactcatg attcacagga agaggtcaaa 21961 atatcatcat cattaacagg gcttgaaaga agttgattcc aactctcatg gatgactttg 22021 aggtgacaag tccagtggag gaagtaactg caaatgtgtt gaaagtatca taatagcaag 22081 agaattagag ttaaacgtgg gcctgaagat gtgactgaat tgctgccatc tcacgataaa 22141 acttgaacag atgaagaatt gattcttatg gacaaagaaa gtttttttta aattttcttg 22201 tttatttatt tatttattta tttaggtaga gatgggggtc tcaccatgtt gcacaggctg 22261 gtcttgaact cctgggctca agtgattctc ctgccttggc ctcccaaatt gctgggatta 22321 caggcatgag ccaccgtgcc tggctgaaag tggtttcttg agatagaatc tactcttggt 22381 gaagatgcta cggacattgt tgaaatgaca acaagggatt agaatattgc acaggaccga 22441 gtgtagtagc tcaggcctgt aatcccaaca ctttgggagg ctgaagtggg aggatcgctt 22501 gagaccagga atttgagacc tgcctggaca acatagtgag acctcatctc tccaaaaaaa 22561 cgttggctag gcaccgtggc acttgcttta ggtcctggct acccgggagg ctaaggtggg 22621 aggatcactt gaacccagga tatcgaggct gcagtgagcc attattgtac cactgcactc 22681 cagcctaagt gacagaacaa gaccctgtct cttaaaaaaa aaaaaaaaga atattacata 22741 aacttagttg ataaagcagc agcagtgttt gagagggttg acttcaattt tgaaagaagt 22801 tctactatga ataaaatgct atcaaacagt gtcacatgct atagagaaat atttcatgaa 22861 agagtcagtg tggcaaactt cattgtctta ataaattgcc acagctgctc caaccttcag 22921 caacgaccac cccgatcagt caacagccat caacactgac gcagtacctt ctaccggcag 22981 aaagattatg actcactgaa ggtggagatg atcactggca tttttttagc aataaggtgt 23041 tttctttttt tttcaaaaca gagtctcgct ctgatgccca ggctggagtg cagtggcatg 23101 atcttggctc actgcagcct ccgcctcccg ggttcaagcg attctcctgc cttagcctcc 23161 agagtaggtg ggactacagg tgcgtgccac cacgcctggc taattttttt ttttttaaga 23221 cggagtcttg ctctgtcacc cagactggag tgcagtggcg tgatctcggc tccccaagta 23281 gctgggacta caggtgcccg ccaccaggcc cggctcattt tttgtatttt tagtagagac 23341 ggggtttgac cgtgttagcc aggatggtct cgatctcctg acctcatgat ccgcctgcct 23401 cagcctccca aagtgctggg attacaggtg tgagccactg agcctggcca cgcctggcta 23461 attttttata tttttagtac agatggagct tcatcatgtt ggccaggatg gtctcaatct 23521 cctgacctca tgatccgcct gcctcagcct cccaaagtgc tgggattaca ggtgtgagcc 23581 actgagcctg gccacgcctg gctaattttt tatattttta gtacagatgg agcttcatca 23641 tgttggccag gatggtctca atctcctgac ctcgtgatcc ccccgccttg gcttcccaaa 23701 gtgctgggat cacaggcatg agccaccaag cccggcccta ttttatttta tttcttttga 23761 gatggagttt tgctcttttt gcccaggctg gggtgcaatg gcgcaatctc ggctcactgc 23821 aacctctgtc tcccgggttc aagcaattct cctgcctcag cctcccaagt agctgggatt 23881 acaggcgcct gccaccacat ctggctattt tttttttttt ttgtattttt agtagagatg 23941 gggtttcacc atgttggcca ggctggtctc aaactcctga cttcaggtga tccacccacc 24001 tcggcctccc aaagcgctgg gattacagtt gtgagtcatc gtgcccaggc tcaataaggt 24061 attttcaaat taaggtatat actttaaaag ctattgcaca ttcaatagac tacagtatag 24121 tggaaacata actttttttt tttttttgag acaagttctc actctgtcat ccacactgga 24181 gtgcagtggt gtaattacag ctcactgcag cctcaacctc ccaagctcaa gcaattctct 24241 cacctcagct tcctgaatag ctggggctac aggcacatgc caccacaccc agctaatata 24301 tatatatata tataaaatat gcatgtattt atatatatta tatacatgta tatctatata 24361 ctacatacgt gtatatctat aaaatataca ggtatacaat atatatacag accgggtttt 24421 tttatgttgt ccctgctagt cttaaactcc tgggctcaag cgatctgctt gcgttggcct 24481 cccaaagtgc tgagattaca ggcatgagcc accattcctg gccaacataa cttttatata 24541 cgctggaaaa cagaaaattt gtgtggctca cgttgttgca atcttagtgt ggtggtctag 24601 aaccaaacct gcaaaatctc caaggtatgc ctgcagcgac ctctgctatt ttgcttttat 24661 cttttgcctt aaattcttct tgattcccct aacctccctc tgccttcaac cctcaagtag 24721 aagggagact ggaaaaaata caccaaaata ttagaaacag ttatctctgg gccataaaaa 24781 tacagttgat tttagccttt cttctatcat ctgcagtctc caaaatgtct atgatatgtg 24841 ttactcttta tagggaacgt aatttttttt tttttctttt agacagagtc tcactctgtc 24901 tcccaggctg gagtgcagtg gtgcggtctc cactcactgc aagctccgcc tcccgggttc 24961 acgccactct cctgcctcag cttcccgtag ccgggaccac aggtgcccac cactatgccc 25021 agcgaatttt gtttttgtat ttttagtaga gacggggttt caccgtgtta gccaggatgg 25081 tctcgatctc ctgacctggt gatccacccg cctcgtcctc ccaaagtgct gggattacag 25141 gtgcaagcca tcacgcccag ccagggaaag taatattttt aaaaataggg aaatcctact 25201 tttatagtgt ttataaaaat aataacttac tcaagcacta cccttggcct gtctgactgg 25261 cagattcatt taagccttat gataacctta tggggtggta tttttattgt agccaaaata 25321 acttttttgt ttgagtttga ttaatttatc ttaacttttt tatctttagt ttttctgtgg 25381 atatttccat ttgcacttgg aggtctctaa taagtagcaa ataatacttc aaaaaattaa 25441 gacatatttg ctgcaggaaa aaaaggaaga gaaacaccct cagctcaact tcagcaaaaa 25501 tgaggtggtt tttttttttt tttttttttt tttttttttt ttttttgaga cggagtcttg 25561 ctctgtctcc caggctggag tgcagtggcg cgatcttggc tcactgcaag ctccgcctcc 25621 caggttcatg ccattctcct gcctcagcct cccgagtagc tggaactaca agcgcctgcc 25681 accacacccg gctaattttt tgtattttta gtagagatgg ggtttgaccg tgttagccag 25741 gatggtctcg atctcctgac ctcatgatcc gcctgcctca gcctcccaaa gtgttgggat 25801 tacaggcatg agccactgcg cccggccaaa aatgactttt aacatttttt gtgtttcagc 25861 atttttcaac atatagtttc tagaatgcta catatgaccc tgcctatttt ctttctttct 25921 ctttctttct ttcatctctc tctctctctc tctctctctt tctctctctc tctctctctt 25981 cttttctttc attgtggtta cagggtcttg ctctttccag ccagagtgtg gtggcatgat 26041 catggctcac tgcagcctca acctcctagg ctcaagctat cctcctccct tacccaccca 26101 agtaactggg actacaggtg tgtgccacca cacctagata atttttgtat tttttttttt 26161 ttttttttgt agagatagag gtttcgcttt gttgctcagg ctggtctcga actcccagac 26221 tcaagggatc tgcccacctt ggcctcccaa attgctaaga ttacacagta gcatgagctg 26281 ctgtgcccag aaatttattt ccacttaata gcataacaga aattttctgt gtaattacaa 26341 acattggaac tttttttcac atctacccaa tatttcaatt aggtaaatat accttaatca 26401 attaacccta ttggtcaata ttcagattac ttttatgctt acattatttc aaagaagaat 26461 gcaataaaca cttttctgta gaacgcgttt tatttagaac gccatccttg ggctagattg 26521 gcggaagttg gaggcgtcat gcagcgcctc ctgcctggga gccaggcgat ccgccaggtt 26581 ctgggaggac ctaaggtcaa cagccacgcc tgctctctgg gctcacagtc agcagccgct 26641 accaggagag caagtcctca gaaggcccga gaccagggcc cgggagcgcc cacaaggacg 26701 ggaggtcacc cagtgacaaa cagggaaggg cagcaggtgc gcccaggaac ccagaggctc 26761 ccagagcaga agtgggatgc aggggtgggg gcgggtcatg ctgaggttca acagggacct 26821 gcttcctggt tcaacaggga gctggccaga gagagggatc tgctgaggct tctgtttaaa 26881 aaccctggcc tgggtgtggg tgcaagtgga tttgcaggtg cagtgaggca gcggccggag 26941 caggctgttg cacgaagacc caggaaagtc agggtggttt gatccaggca gaagaggggg 27001 aaatggagcc tggggcagac gtaaaggtgc tggggtgagg ataagggtga caggagggag 27061 gagtctgggg tgaccaaggc ctgcagctca ggtgactggg aaaaaacata aaacagaaca 27121 cagatgaatg caggggtcgg gacgaaggct cctgtgagtc tgaagtgact gtgagtggca 27181 ccctctggcc agccacagtg gctctcacag ctcattctgc agagcaccgg cccagtgatg 27241 agccgaggaa gtgcccctgg aggtcagctg gatctgccca gtggtctgca cagggggcgg 27301 ctgccagggg cctgagcaca gggaaaccct cgaaactgtg ggagggatga agccagagag 27361 gtagacccca gaaccgacca agcccagcat ggggcgtgga gggcggccaa gctcaagaag 27421 accatgagat tggagaagga gctgcagaga ggtggaaggc agacaggcga aggactcacg 27481 agcacgtcca acagccgtcc taagggccgg ctggagggaa agagacctga agacacgttg 27541 gggggcgggg ccgggcaggg gcggggccag gccgggcagg ggcgggacgg gggcgccgga 27601 gggcagggct tggagaattc tcacctgagt ggaagagcag agggaggagg tgggtgggtg 27661 gcagcggttt ccaggcagca gctacagggg actgagggcc acaccccgaa tgccctagcc 27721 cagggggcac aagtttcaaa cgtacacacc cagtggggct atctgcctct ttacagcttt 27781 atctcaatta gtaggaaagt tttctcataa acgttctgcg tttccgtcgt gccactggtc 27841 tgcgtcctcg gtctctctgg ggcctcaggt ctgtgcgctg cacacgggct gaacctctgg 27901 atcatgatgc acttcctaac actgggcagc tatttctctg cttgttggct ctgtgaagac 27961 gtgtttttta cggacagcac tttctaatgt ttatgtagta agtttgtgta tacttccctt 28021 agctcactaa ggaaccttca actgttgtct tcctggtgtt gcacgaatcc tgggctcaga 28081 cagagggacc ccggccatgc tgacccctgc ctgggaggcg ggctcaccca ggagggcctg 28141 ggagggaaca ttgccgggca tctgcagtca cctaagcaca tagtgaaaaa gtcacctatt 28201 ttcttctggc tttctctgga gaatttttta aatatccatt ttaagatcta agtagattgt 28261 ttttcttttc ctttttctga gacggagtct ccctctgtcg cccaggctgg agtgcagtgg 28321 tgcaatcttg gctcactgca acctccgcct cccaggttca agcgattctc atgccttagc 28381 ctcctgagta gctgggatta caggtgtgcg ccaggacacc tagctaattt ttgtattttt 28441 tgtggagatg gggtttcacc atgttggcca agctggtctc aaactcctga cctcaggtga 28501 tacgccccgg ctcggcctcc caaagtgttg ggattacaag cgtgagccac cgtgcccggc 28561 cagattgttc ttcaaggagt tcatgccccc actacacact gacctgagcc gtgctggccc 28621 ccagtccgtg cacgtggaaa ggacacccca ttgcacagcc tcggattcct ctgtggccaa 28681 tcgagtgtat ctagggagaa ttcccaaggg tgcttctaga tacggcacct cagataaggc 28741 cccagagcag caccgcagga cccccggcac aatttcggaa agagctaaaa gggtgagatt 28801 gccacaacac tcccgccact ggggactctg tccctaccac aatgggcagc ccctgggcac 28861 ctggccccat gtcttagcac cctctagaaa cagccaatgt cacaccaagc agagagttta 28921 ttgaggccat gggcagggcc caggaacaca gcacggggcg tgcgtggggt ggatgcacgc 28981 tggactgcag gggtgcgcta gcagcggcaa gcggaggggt attgggggcc caggagggcc 29041 ccccctccta gcagcgtcta ggggcttcag gtgagcatca agaaccactc ctagcctccg 29101 cagggctctt ttgggaaccc ctccctgctt ccccctgagc tccatgtggg gtggggctgg 29161 ggcgaggggg cccctcttgt ccctgtggtc aaggatggga tgatcccttg agattcacta 29221 taaaaattaa aattccctta taaatcgcag gggagtggca ggagggcatc acttggaagc 29281 cgacattacc tgtccctaga gccgcacctc ctccacaccc ccgaagttct gaccccaaag 29341 tgaagaatag aggataaggt ttgcaggcac agagaggggc agagacacca ggagagagaa 29401 aagggagact gggaagggaa ggaggaggac aacacagaga aagaacggat ctgaggaaaa 29461 gcaggggcac gggggctcca ctcccccgca gccccaacag ctgcgccctg gggaccgagg 29521 acgagagttc ccagcccgca gacccccaca agcgtaaggt cccatgtccc cccaatccag 29581 tccttgccaa agccatggtg gcctcttgag acatcctcag ccccagctgc gcctggagca 29641 gacggccagg agacagagcg gtgggcaggc tactccaggt agatgtcctc cgtggggcag 29701 gcgtacgcca ggtgctcttg gtagtactcc aggaaggcgc ggctggggaa agacggtcct 29761 ccaccagcca ctcagggccg gtcctcaacc cccaaactgc cttccacaga gcctctgagc 29821 tctccctcct tcctccctct gcagccatcc ataacccccc acccaggcat ggccatgtca 29881 cccccaagtg tgaccctcag ctctcctggc ccagcaattc tacccctgcc agagtgcctg 29941 ggccatgtgt ctgtgcctgt tcccacccca tgcccagcac cccaaactgt cacccaccac 30001 tgccctctct acacagcaca tctggttctc cttccccaga cctcccctgc caggctcctg 30061 cccccacctg ctcacctccg ctccccacct ccgtaaccct gcacccattc tgcacccaca 30121 ccctggggcc ccactcaccc cacactctgc gccaccggcc tcatggactc ctgggagaca 30181 aagacgtggc aggcgaagcg gctcagcagg gggtgtttgg tgatgaagcc gaaatagctg 30241 cgggcaagtc agtgtggagg gcacgtcaga gccctggccc agccttggcc accccacaaa 30301 tgatgtagag agaagaggca ggaacaaagg gcttggggag agggtaaggg acaggagtgt 30361 gggggtcgca caatttcaag atctgaagtt gtcttttttt ggtttcagcc caagcacagg 30421 gctgggcacc taccaagagt tcacgctctg agttaattcc atgatgacca ggggcgtcca 30481 ctctcccttc tctgagccgg tgctgctacc ctctgtggcc ctattcaggg cctgccccag 30541 gaccacagag caaggcctct gcctggctac tggagaccat atctcctgcc aactcaggcc 30601 tcctcctgcg ggcctagcag actgacagcc ccttgcccag ctccgaggtg gggctggacg 30661 ttacccctgg tctgtaaaac cctcttccgg tctacccctc gctcccactg gaaagcagct 30721 ccttgcgggg cctcgctcat ggagtcactg cccagctcct ccacccagac attagcaaaa 30781 atcaaaaatc gtcgtccaga gggagaggca ggcagagttc tgcagcagga cgctgcccca 30841 cccccacccc tgaaactggc cgacccgtcc attcccccat ggcttcgggg ctttgcgatt 30901 ccaccaaatg ggaagccaca tgatggtctg cttgccagga cagtgtgtgc acgctgtcca 30961 cagacagagc acagggcccg cctcaaatgc cgcgttcaag tcatgatggt aataagcgcc 31021 gacccacaga tctggcttcc tgcattactc aggcaccatc ctctgccctt tccattaact 31081 cacagaagcc tcacaacagc cctctgaggt agggccctga tggagagcta ggcacagagg 31141 ttcaggagct tgcccaggac acacaggcag gatgggcagc cctgggagaa aaggctacag 31201 tctgtgatcc caacaccaga catcctaatg ccagatccca acacagcaca ccccacgctc 31261 tcctgaccta ccaccaccgc catgaagaaa cgagttcatc gttcacctca cacggtttcg 31321 tccgtgcctc ctcgtggcaa gggaccgggg acggggggtc tggggtgaca aatattagga 31381 aggtctgggc cagggcagca gctcctccac ctcctcccac acccctgcca catcttgact 31441 ccaaacccca aactcgactg accaccttcc tctcacgaaa ttctagctgg ggccctgtcc 31501 agcctgagac ctgagcttca cccagtttcc ttctgatgag aaatttgggt cattcttcct 31561 ctcctggcag cgtccgctgt cccagtgacc ccagagctgt ccccactgtc tggtctgcgc 31621 cccgtccgct gtcccagtga ccccagagct ggccccaccg tctggtctgc gccccgtccg 31681 ctgtcccggt gaccccagag ctggccccac tgtctggtct gcgcccgtgc ccactgtccc 31741 acctctccac cccagagctg tccctgctgt ctggtctgcg cccctgcccg ctgtcccacc 31801 tctccacccc agagctgtcc ctgctgtctg gtctgcgccc cgttcgctgt cccggtgacc 31861 ccagagctgg ccccactgtc tggtctgcgc ccctgcccac tgtcccacct ctccacccca 31921 gagctggccc cgctgtctgg tctgcgcccc tgcccgctgt cccacctctc caccccagag 31981 ctgtccctgc tgtctggtct gcgccccgtt cgctgtcctg gtgaccccag agctggcccc 32041 actgtctggt ctgcgcccct gcccactgtc ccacctttcc accccagagc tggccccgct 32101 gtctggtctg cgcccctgcc cgctgtccca cctctccacc ccagagctgg ccccgcagtc 32161 tggtctgcat cgctgcccgc tgtcccacct ctccacccca ctgagcagcg tttccgcatg 32221 gatcccatca gaccatctct gcactcggcg gggacaacac aaaagggctc cgcaccttcc 32281 agcaggcagc accctcactc tcgaccacag ggagagtcca caaggccccc aagatgcact 32341 gacgtctgtc ccacgcctca gtcctggcag caccgtgccc tgctgtgtct tgcctgtctg 32401 tgagcctgct tcctccaact ccccgcaagt cctttccagc ctctggctgg ctctgcctct 32461 ctggttcctc ccagatctcc ttcctctagg gtgctgggac acagagggca ggggcctgtg 32521 cctggaggga agctggcagc cccccttggc ctgctgctcc tatgccagca cctcctcctc 32581 ccctcctccc ctcaccctgc ctggccattc ccctctcgtc tcagtgatct acctgctctg 32641 acctggtatc ttccctccag gtctccctcc taaactccag acccatggga ccaacggcca 32701 cttcagtctc accgtttgtg attgatcccg catcccctct cggtgtgtgg gctgcagtac 32761 aaccatcccc cagggagacg tcaccctggt ctcttccagg tcaccctcta cagccggtca 32821 acctaggtcc tgcctgtact gcctcttcag tgcctcttgc atctgcacac ttggtctcag 32881 gacatagcac ctgtataacc tcagcagccc ccaggtcccc tgtgtcacct ccttccatta 32941 ggagccttct atggcctcat taccagcaag attaagtaca gctccttggc ataatcaaaa 33001 gggtccaggt cttccccggc atttctagct gctcccctta cacgcgggcc ccagagccta 33061 tatagccccc gaccagtgca tcccacaccc ttggctcagc tagaccccac caggaagatc 33121 cacttcccca ccgctgtcac ctgctgtgtt tccaggctgg ccatcctgag ccccaggccc 33181 actcaggcgt cctctgcacc cacacaggct tctcacatcc ttcgcccgtg tgcctctctc 33241 cacccagact ggatgcaagc ctgtctgacc cacgcctcct gagtccagct cagagcaggc 33301 gccactgcag gctcagctca cagagctaag tgggtgcagg tggagctctc agggctgggc 33361 acgggcagcc ctcagggtct gtggggactc agggggcggt tggactctca gggtttgtgg 33421 gggcggttgg gggactcggg gacggtcaga ctctcaccag ctgttgcggg gatggcagcc 33481 gcagaaggag atgttcttca tctggaagaa atggctgcag cgctggaact agagcagagg 33541 tgggcagaag tccctgtgtc aaaggaggcc tggcgccacc cctgcctccc acctactgga 33601 cacccctgcc agcctggccc cgtcggactc cgtggcctaa ccctatcccc cagtccccgg 33661 tccccagtcc cccacactcc cacctcgggc cctcctccgc tcagactcag cttgaccccc 33721 cgaagagaga tctcgaggtc acaggaggca ggagggcgca ggtggacggt cagtttccgg 33781 gcagtggcaa tctacagaag aagttggggg agaagagggt cttgaggagc ccacaagggc 33841 ctttctttcc ccttggtggg actacgcctc tgctcctttt agcttatgtt tgaggacaca 33901 catgctgagg accctgagat gcaattctgg aagacataag gggtgtggcc ttggccactc 33961 ctcaggctca ggaggagcaa tcgcctctct gcagtcagct tgcttcatct tccttctgct 34021 cctccaggca ggctgctgcg cttggctcag gagccaggaa cagagacctc acccgccctg 34081 ggctgctctg ccctgctccc tactgccccc cagcctcagc ctcagtactg gtgcctgggc 34141 aggtgctcac taaacgtcca ctgagtgagt ggatgagtag agggctccca gggaaggggc 34201 tgtgagggct ggtcttgatt ccagatggga agaggaaggg ctgatggcag tcggggccac 34261 ctggaagatc tgtggcccag agacccagag gtgatgaggg acagctgctg tgctctgccc 34321 tagactcacc tgctcaggcc tctgaaccag ggcctggggt acccaccccg agctccccct 34381 gcagcatccc ccagaccctc cctggaggaa atcagacctc agaccttgga gcctcaagcc 34441 ctgtgccagc ctgcagtgct ggtgtggccg ccaagcggac cctcagcctg cacttgcccg 34501 gcctccacac tgaccttctg catggctgca cacaggatgc cgttgccctg gtggcagggc 34561 acctccacgg agcccaggaa ctgcacgtca aagcgctcca cccagcaggg gctccgctta 34621 ctccctgggg agcggggcac agagggccca ggtcaggcag gggcacaggg gaggatctgg 34681 ggtgaaaagc ctcccctcgg ggttgggacc tcacccagca ggtccttggc agggccgggc 34741 accgcatggg cgtagaaggc aggaaacaca ccgcgctccc ccgtgcgcat gttgaagcca 34801 cggaaccaga agtcgtcctc ctcggcctcc accaacacag ggtcatccac atccagctcc 34861 agctcgtctg gatgccgcgg gatgaacctg ccgttggggg agaggtcact ggggaggggg 34921 gcgggattga cggtcgtgtc cactggcgtt ccgtgatggc agggactgag gctggggcca 34981 ccgccagcaa ggagggaggc cgtacctgaa cacagcccgg tgagtctgct ctcgctcctc 35041 gccgttgacc agacaggaga aaaggccaaa ggactcggtg cctgggagac ccggcaggac 35101 agaggcgggg gagacaggga gacagagtca gagtagtgct tagaggcctg gccttgggca 35161 gaggccaagt cccctccaca gtccggtgta ggtgtcctac gagcgggaag gagctgaggc 35221 agagcaggcc acctcatgcc atgtcctggg tgggagaggg cacccagccc tgaccacagc 35281 cagaccctca ggtggccctg ggcactgtgg gggaccctgg gactggggcc agggagtctg 35341 caccttcctc tgggccaccc cccttttccc cacctctcac tcactggagg accgagatgt 35401 gctgttgacg aagacattga ggaacttctt ggagaaagtg aggtcagggc tgtccggcga 35461 ggtgtccccc gagacctgac cgccgcctag cagcgccgcg cccgcctcct cctcgctggc 35521 ctccccgccg ctgtcctctt cgctgtcgtg gcccagccca gcacagcgcc gcaggctcac 35581 cagctccagc tgcgtgtgct catccaccac cagcgtgtac ttgaccgcgt cgtacaccag 35641 cgacgcgtcc cgcggcgccg aggggcccgt gcccctgccc ccgggggacc ccgcactgtc 35701 ctcggcatcc tcctcttcct cgtcgtcctc ttcgtcctcc tcctcggagc aggcggcgga 35761 gcaggctcgt cccgggcggg cggcgcggcc gggcgcggcc cacagcggcg tgatggtgtc 35821 acgcgtgggg ttgctgagga ataggcaggg cccgggctgc gcggggccgg gtcgaggcgc 35881 ggcgggcgcg ggcggcggcg gcgcgcacag cgtctccatg tccaccagct ccacgccgcc 35941 gggccccgcg gccgcctcgg ggtcctggga gtcctgggcg gccccgccgg ccggcgacac 36001 gggctccccg ggcgggcgcg gcgcaggaga caggcactgg ccggggcagc ggatgggcga 36061 ggagccctcg gagatcatgc ggctcaccag gttgctgagc agccagggcg agtccgcgtc 36121 ctcgctgagg tccggctccg actccgaccc cgactcgtac tcgctgttgg tgtcgtcggg 36181 gcccacgggc aggaaggcgg ggcggcgcgg gggttcgcgc gggggctccg gctccgaggc 36241 gggcgacgag gcctcctcga tggagttggt gaggtgcgag gagcggccgc tgctgctgct 36301 tccgccatcg ctgctcagct ccagctccgt ctccgagatg gacgagatca tgcgccccag 36361 gcggcgccgc ccgcgtcctc cgagtcggag ccgggcgagg acagctcctg gctgctgcga 36421 cgtcccccgc ggccgccgct cgagcggctt ctcaggtcag cctcgatgcc gggatctgag 36481 gagggcgaag tcccccctgg cgcagggggt tccgcagccg gttcccttcg cagtcgcaac 36541 ccgggcgcac tggcgactgc gccccgccgg gcccggtgtc cgtggcaggg agggggcccg 36601 ggagctctgc tgggcaggga ggtggggtca ccctctctgc accacccagg cctcttccag 36661 cagccagtcc atcctccctg ccctcctccc ccgcttcccc gcgccccgcg ggcccaccac 36721 ccctcacctc tgagggcctc cggggcgggt gagcatagcg ctgtctcctg ccaggaggcc 36781 ggacgcacca ggtcaaagcc tccgttgttg tttagggagt cctggggtgg agatacatga 36841 cggcggtcct gaggagggcc agctcccagg gggtgagcag ggctggctgc agcccaggtg 36901 ctatgaaccc aggctcagct ccagggagct gtgggtccgg gcactcacct gggcccccag 36961 tgtggtcaga cggagggtgg tgggccggtg cttgtggggc tcctccacgg aaggggaggg 37021 gataaggggc ccgggggcag gtgcctctga gccagggtct cctccctcct ggccttcccc 37081 atctccctcc tcctcctcct cctcctcttc ctcgtcctca tcgtcctcct cttcattgtc 37141 atcgatcatc tcaaactcct ggaagtcatc ctggaaggag cagatggggt gcggctgctc 37201 cgagcgcccc agggagaggc tgtcctgcag gacggaggga gatagcaggg ctgggccaga 37261 gggcagcagt agcccagggg cccaggtgct aggagaaggg aaatcgggga gcatcagtca 37321 aaggagagcc ccttccagct cactctgccc ctctgtctac agggcgatgt tttcaaaatc 37381 tgcctaacac acattgggat cactccagtt cttgggaagg gcctaaactc tacagctgaa 37441 tgacagggat tctcagagca gccccagccc cactttcaga ctcatctaac tgcagaaccc 37501 catcacattt gacaccagca ctgtgttccc ttcgcagcgt tttaacacaa cttacagtta 37561 ctggacatgg tttttggtct gttgcaagcc ccaccagggc agggtcatgt ccatgttatg 37621 cactgttgca cccgccccca agaacaagac ctggggctca gcggatgttc agtaaacatc 37681 aaacagatgc agaaatggag agacagcctc tgcactctga ggattcacca actttgcgcc 37741 aaccccaggg ccattggcac agttattccc catgctggca cttcctctcc tccattccct 37801 cctacctacc ccatctgcct ggctctgcta acctcagggc atggttgaca tttcctctgg 37861 aaattctccc tgattccttc agtgtggatt gggttccctt ccccctcgtg gaacttatgt 37921 ccctgttgga atgcagccct tgacacgtgg gcagccattg tcccttacaa gtctttctcc 37981 cctactaagt tgcagctcca gagagcagct gtcctgttcc actcccagca cccagaagcc 38041 cccagtactt gccacatggc agttgctcag aaatgttctt caataaatga aagcagaaat 38101 gtggagagac agggaggagg agaagcaaga gatggacgag aaaagatttc agcaggggga 38161 gacatgaaac agacagatgg gagctgcaga cgagggcaca gggctgcaga ggggacaagg 38221 ttgtgatggc agctgccagg gattagatct ttctgggagt gatctgagga cagagggcac 38281 ccaaagcaaa ggagcacctc cctgccccgg gctggccacc gatggggtcg gggagaatgc 38341 tggggaaggc cctgcgtgga ggcctgcaat gtgttggaat aagagagagt tgataccaag 38401 gaaacctcaa gggttaagtg gggaggagcc atcagaggag tccccggttc aaggctgagc 38461 tcacacccac cctaaaaccc caagtgaacg ctaagaagaa cctaatgagg ccctcccttg 38521 agctggatct gtgtccccaa ctcttctccc accttctcac agtggtctga gtcgtagctg 38581 aggcccaggc cacagtcatc agtgatctca gacagatctt cgtcgtcaaa ttcttccagg 38641 cttatgtcct ggggaggcct ggaaaggaga ccagggtcag agagccacca accctggact 38701 ccctgcctgg caactggccc tgaggagggg cagagggagg cgtcagctcc tacacccagt 38761 ggtttccaca gcccccagct ctcggaacac atcctgccat catcttgacc cccaagctct 38821 gctcgcctgt ggatcccctc ccccatctga gccaggcctg cccggaggag ccaaccctgc 38881 ccccacccct cccaccccgc ctccctggag agccagcagc ctctcctccc tccactcgca 38941 gctccaggcc cctggggtca gggtcttgca gagcagggga gggggagctg caacgcggca 39001 gcggcagatg gggaggggtc gctcctgctc tgggaagaga gtctggcctg tttgtctccc 39061 cctctccggc ggagtgaggc ctaaacccag aggcccagcg tcctcccaac cccagtccct 39121 tcccgcaggg ctccgggctt gatttccact ctctcccggc cgggccctat cggccaggtc 39181 tggcgtccac tgaaagaggc caaggccacc gggacacgcc gcgccgggag ggtgtgcccg 39241 catcccgccc gcatcctccc ccgcagaggg tcggcccagg gcctccaccc ttgacaggtc 39301 agccgtccag gccgggcgcg gctcagaatc ccggcggggc ccgggggtca ctgaggctgg 39361 ggacgcgcgg tgaagatgga gggagtggga gcggggtggg ggtggggcgc ccgagccgag 39421 gctcagggac cgcagggggc cggggccggg tccgcgccgg cggcggggct ggggcggcgg 39481 ggacgggggt ccgggccggg tcggggtcgg gggggtgcgc agggagcgga gggtccgccc 39541 ggcccgggcg ccgagggggg tacctgcagc ccggcggcga cagcgagtgg aaggtggaga 39601 gagaaaacat ctccgcgcga tccgccatct tctccgggag aggcccgcga ctgcggcggg 39661 gggtgcggga gcctcgccgc gccccgcgcc ggcgctcggc ccgccccgct cacctcgtga 39721 cacgccctgc ggcgtccgcc ccgcgcacct cgcccggact ccgcggcggc gcggggaggg 39781 cggcggggga ngcgggggat ggactcggcc ggggacgctg gttggaggag aggcggcggc 39841 ttcactgccg ggcgctggtt ggggaagggc cgggcgtgcg atgaggattt ttcccgggga 39901 ggggcgggag ggaaggggcg ggcgggtggg cgtcgggccc cgccctggag gacccgcgcg 39961 gggcgggagg ccgaggaggc agcgccgggt ccgcgggcnt cggttcgccg tgggctgttt 40021 cctctgggcg agaggccggg gcggccccgg gagatgcgcc gccgcgccca ggatcccccc 40081 aggaacccta acccgggtgc ggcccggagc ggggtggcgg gcgaggcggg aggcgcggga 40141 ggagctgtcc gccgtcgggt gctgatcgga gctgtcccca cggggtgctg accggggctg 40201 cacggggacg tctgcacgag ggtcgccgag cagggctgga ttcagaatcg agcacaaggt 40261 actcagagga tgctggctgg ggctaaaagt ggactttgag cactcagtag agactgcaca 40321 gccggacgga cgtaccgacg ggcactggga gaagttagca cccacggagg cgacatgcgt 40381 cccactggaa acactttttc ttgggcaccg tgttgctggg ccctgtgtag ggaactgacc 40441 cgctctggct tgggatggag ccctgtgggg agggccatgg ttgaagctga gctcgagacg 40501 gtggctgtgg gaggaggcgg gagatcaggg ttgagggcgc agaactaggg agagactgca 40561 ggacactttt tttctttcta aaatagagac cggtggctgg gcgcggtggc tcacgcctgc 40621 aatcccacca ctttgagagg ccaaggcggg tggatcacga cgtcaagaga tcgagaccag 40681 agaccatcct ggccaacatg gtgaaacccc gtctttacta aaaatacaaa aaaattagcc 40741 gggcgtggtg gcgggcgcct gtagtcccgg cgacttggga ggctgagcca ggagaatcat 40801 ttgaacccgg gaagcggagg ttgcagtgag acgagattgc gccactgcac tccagcctgg 40861 caagagcaag actccgtctc aaaaaaaaaa aaagagagag agagagaccg gctctctctg 40921 tattgcccag gttggtctca aactcctggg ctcagctatc ctctggcctc agcctcccaa 40981 agtgctggga ttacaggtgt gaaccacagc acctggctaa atattggcta aagaaggagg 41041 ctggcatttc aacaaaagcg caatcccttt tcccccttga tgaaagttat gtgccctaag 41101 atttctttat gtcttagttc tgcggaaaat tctttttttt tttttttttt tttttttttt 41161 tttttttttt gagacagaac ttgctctgtt gcccaggctg gagtgtagtg gcacgatctc 41221 agctcactgt aacctccacc tcccgggttc aaccgattct cgtgcctcag cctcccgagt 41281 agctgggatt acaggcaatc accaccatgt ccggctaatt tttgtgtttt tagtagagat 41341 gggtttttgc catgttggcc aggctggtcc caaactcctg acctcaggtg atctgccctc 41401 cttggcctcc caaagtgctg ggattacagg cgtgagccac cgcccctggc ctggaaaatt 41461 attccttagg aaaggacaag gcacttgagt gacaaacacc agaaacgaca tggcagaagg 41521 agctcatgca agtcgggaag gggggtatca ccattccagc tcagagagag gcctgtggcc 41581 agggcctgat tgaacctgca ctctttggtg tcccatcaca atttatattt tatgccaatt 41641 acagtatttt tctaagaatg gcatttactt ggctgggcac agtgacttac acctgtaatc 41701 ccagtacttt gggaggccaa ggtgggtgcg tcacttgagc tcaagagttt gaaaccagcc 41761 tggaaacatg tctctaccaa aaataaaaaa aattggccga gcccaggagg caaaggttgc 41821 agtgagctga gatcacacca ctgcatccaa cctggggggc agagtgagat cccatctcaa 41881 aaaaaaaaaa aaagccattt actagtttat gcttgccttc ttgacttggg ttggctggag 41941 agaagagagg ctgtcatctc tctgtgaagc cctaggctcc aagcatggtg cctggcaaac 42001 agtaggtgct tagcagacat gtgaatgaat gaatgagaga gcttatggtc tctaacccag 42061 tctggggcat caggtctttg ccataacagt cttttccctt ttagaagcac taagaacatt 42121 ccccatttct gagggagtct gggcctatta accactccct gcctttccca aaaaatccag 42181 atcccttccc agatttcaca gagagtttga cctcaacttt ccagattggc ttttgatcct 42241 gagtcttttc accaaactgg ttcatctgga catccacata aatgacccat ccaataccct 42301 ggtcagtcag ttccttgatt tctgatctgt caacagtctt taatactcaa gtgtagtgac 42361 ctctcccaca aagtgcctcc aggatctacc ttcatctgga attgcagcaa gcttaaaatc 42421 tttttttttt tttttttttt tgagacaggg tctcactctg tcacccaggc tggagtgcag 42481 tggtgtgtcc ctagctcact gtagcctcaa actcctgggc tcaaggaatc cccctgcctc 42541 agccttctga gtagctagga ccacaggtgc acactaccat acctggcatt ttgtttgttt 42601 ttgtaaaaac agggtctcac tatgttaccc aggttggtct taaactcctg ggcccaagcc 42661 caggtgatgg cctcccagag tgctgagatg acaggcatga gccacagcat ccaatctgaa 42721 gtcttatagt ctcaaatttc atcttagatc ctaatcccca aaaccttcct caagcctctc 42781 cccttccttc cctccctctt aaaagatggt cactcccagg tcccatgagc gtaaagacca 42841 tgagatgggg gataccgccc gcaattttat ctgtggacct cccatcttgc ccccagagca 42901 cagatggaag atcgtcggcc ttaaagtctg aaatacggga gccactcttc attttgcacc 42961 aacttgttga acgtgggcaa attactcagt tcctcgagat cttggtttct tcctctgagc 43021 aatggtgctg cgaggaccac atgtcgtggt gtaagcccca gacacaggcc ctgttgtttg 43081 ttggtgatca tttaaggctc acctttcccc tgctcctcac accggggaca tgtcctccca 43141 gtcccccagg gctgctctgt gcagctcagg gatacattag tgaaaaggca gcaactcttt 43201 cctgaggggg tttatgctcc ttacatcatg cacattttat caaatttgca ccttccttgt 43261 gtgctcctca gagcatcatt gtatggatga acaaatgggg gtccacaggg agctgaacat 43321 ccctccttga agttcctcct ccagcctctg ggccacaggc actgggtttt ccttcctgac 43381 caccacccgt tgtcagcgtc cttccagact ccttctcctt caatgacaaa tcaagatgtt 43441 gtgacgcgac cattttaacg ctttcttccc tctatggtgc acgaatggca tagacattac 43501 aaagcacaaa attgattgga cttagaaggg agagttggaa tcgcaacagc ttccacagtc 43561 ctgaccacag caattctaaa aagtcagaga acattttctg acaactttct gagagattag 43621 aaaacaaatg caaactatca gtatttattg tactcatatt tcacagaaat cacaaacaca 43681 aaataaatat gccaatggtt aaaactgcaa acagacacat ttaataaaga tgaaaattaa 43741 gagttctacc ttcaccatgg tgaagtccta acgccaccat ttcccgaatc ttcattttta 43801 tgcagccagg tctctgatca ctacctccta cttttcttgg tctcttgctg cattactaca 43861 atacccactt tttttttttt ttgagacagg gtcttgctct gtcacccagg caggaatgta 43921 gtggtgcaat ctcagctcac tgtaacctct gcctcctagg ttcaagcaat tctcccactt 43981 cattcctcag agtagttgga ctatagatgt gtgccaccac acctggctaa tgtttttatt 44041 ttttgtagag acagggtttc atcatgttgc tcagcctggt ctcaaactcc tgagctcaag 44101 caatccactc acctcagcct cccaaagtgc tgggattaca ggcgtgagcc actgtgtcaa 44161 gccttttttt ttttttttaa ctccataaga aatgtgcttc aattactatt actgtgtaac 44221 aaattatcct caaatttagt ggattaaaac aattaagttg ttatttacaa ttgtttgttt 44281 tgagggtgga aattcaggaa ggactccaat gggaagttct tgcctgggtc tcttttgagg 44341 tttggttgca gttggaggta agcctggctc gaggtctgag atggttcatt cagaaggttg 44401 gcagtggatg ctgagtgtga gctgggggct ctgctgcggc tgtcaccagc atatccactg 44461 ggctctcctg catcgtggtc tcagggtagt ttcttttctt ttcttttttt ttttggagac 44521 aagagtctcg ctctgtcacc caggttggag tgcagtggtg tgatcttggc tcactgcaac 44581 ctccacctcc tgggttcaag tgattctcct acctcagcct cccgagtagc tgggattata 44641 ggtgcctgcc agcacgccca gctcattttt tttttgtatt tttagtagag gcgggtcttc 44701 gccatgttgg ccaggctggt ctggaactcc tgacctcagg tgatccgccc acctcggcct 44761 cccaaaatgc taggactaca ggcgtgagcc accgcaccca gccaagggta gagtttctac 44821 atgatggctg gttcccatca gagtaaggat ccttggaggg catgcccttt tctggtctaa 44881 ccttgtaagt cctacagcat cactgctgct gtgactattg gccaaagcag ttgcaagcct 44941 acccagcttc aagagggggg tcacagaccc tacccctcca tgggaagagt gtcaaagaac 45001 ttaggagctg tgtcataaaa ccactgcaaa tctgcagtcc atgtgtcatg aaaccaccac 45061 aaacctccag tccatgtgtc ataaaaccac tgcaaatctc cagtccacgt gtcatgaaac 45121 caccacaaac ctccagtcca ctgaacctcg ctctttcctc aatgtccact caagtttctt 45181 acctgacatt aaccctctca atcgtcaact tgtccttctc ttctggatca tttccatcag 45241 caaacaagca taacctatgt taaagaaaat ggtaattctc acctgatggt acaggacatc 45301 caaaatcatg gcatttacaa cattgttatg ccttaatgaa agtgacagta ggccgggcac 45361 ggtggctcac ccctgtaatc ccagcacttt gggaggccga ggcaggcgga tcacaaggtc 45421 aggagatcaa gcacaatggc taacacggtg aaaccccatc tctactaaaa atacaaaaaa 45481 ttagccaggc gtggcggcag gcgcctgtag tcccagctac tcgggaggct gaggcaggag 45541 aatggcatga atctgggagg cggagcttgc agtgagccga gatcacgcca ctgcactcca 45601 gcctgggcga cagagcaaga ctctgtctca aaaaaaaaaa aaaatccttc tggtttaaga 45661 aagaaatctc acacataatt atacctttca gaaaacagat ttctatcagt gcagaagatg 45721 attataacat gcatcagttg aagtttttaa atagattgct actatcagga attttttttt 45781 tttttttgag acacgatttt gctctgtcgt ccaggctgga gtgcagtggt gcaatcatag 45841 ttcactgcaa caaccacctc ccaggcttaa gccattctcc ctgcttcagc cttccaagta 45901 gctgggacta caggtgtgta ccaccacacc caactcattt ttgtatttta gtagagatgg 45961 ggtctcacca tgttggccag gctggtctcg aactcctgaa ctcaagtgat ccgcctgcct 46021 cggcctccca gagtgctgag attacaagca tgagccacca cgccttgcca ggaaattttt 46081 taaatctctg aaatctgaat gtctcactag actccttcat agaacgtaca tgcccaaaga 46141 cacttttata ataactgaaa aaacaacagg attatacact ttcatgaaag tctttgcact 46201 gcagcagagt catagcccca aggaggacag cactgcagcc tcgacaaatc tctctctgac 46261 atacaaattt tggagagttt tccatttact gcgatgagca caaagggaac aggcccagca 46321 aagcatgcca ggagctatca agctgcagaa tgtctggtga gcgcttccca gatttcccca 46381 gttcctccca aagatgcact aaagttggtg atgacagggc tctgcctgac agggagactc 46441 cctgataggt tgctggagac cctgaacaga gatttaccaa gccttttttg gcggggggac 46501 ggagtctcgc tctgttgccc agaccggagt gcagtggcgc gatctcggct cactgcaagc 46561 tccgcctccc gggttcacac cattctcctg cctcgtcctt ccaagtagct gggactacag 46621 gcgcccgcca ccgcgcctgg ctaatttttt gtatttttta gtagagatgg ggtttcaccg 46681 tgttagccag gatggtctcg atctcctgac cttgtgatcc gcccacctcg gcctcccaaa 46741 gtgctgggat tacaggagtg agccaccgcg cccggccggc tatcaggcct cttacaccca 46801 caaacttgtc atcttggaac tcccagcata attacctaat ttgattaaca aaaactgtct 46861 tttttttttt tttttttttt tttgagatag ggtctcactc tgtcacccag gctgaagtgc 46921 agggatgtga tcttggctca ctgcaacctc cgcctcctaa gctcaagcaa acctctcacc 46981 tcagtctccc aagtagctag gactacaggc acccgccacc atgcctggct aatttttggt 47041 tttttttttt tttttttttt tttttgtaga gacaaggttt caccatgttt tccaggctgg 47101 tctccaactc ctgggctcaa gtgatccacc cgcttcggcc tcccaatgtg ctgggattac 47161 aggcgtgagc caccatgtgc agcctcaaaa gctgtcttca tcctctggat ctaatagcta 47221 agttctaagt acaatcttgc tacacgggcc gcaggataag agaccgggca gctggcagga 47281 tggcacttct tccctcccct gcctcacttt ctcacccacc caccttattt cctgcagctt 47341 caaacaaact ccttgcatgt gaatcctcgt tttagctact tcttctgaag gaatccaatc 47401 taaggcatcc tgctaattgt gacactcaca cttaaattat caaatcgtaa agttaatcaa 47461 tagttttacc cttttcctga acaatacaag gacccgagaa gtatttaact ctgtttactc 47521 cagtcccaac ttaaatttta ttattatgca ttttaatatt ctttattttt cctttttttt 47581 ttttctggtt ttttagacag agtctcactg tcacgcaggc tggagtgcag tggcatgatc 47641 ttggctcact gcagcctcga cctcccgggc tcaagcaatc ctcctgcctc agcctcccaa 47701 agcactggga ttataggtgt gagccactgt gcctggctat tatttttctt ttcaagaccc 47761 ataggaattg ttattgttgc tttaagccat tgatgtttac tagatttacc caaatgttac 47821 ctctttctat atccttcatt ccttcttcct tgtcagatca ccctgtgata ccaattcctt 47881 acacctttta aatgtttctt taatgagaat cttccgtagt aaactcagtt tgtccgaaaa 47941 tgtcttcatt ttgtcctggt tcctgaaaga tatttttaca ggatacagaa ttctgggtca 48001 ttagtttcca ttatcacctc gaaggtctcg tccattgtcc tcctctggct ttccttgtcg 48061 ctagaaaaca tcatctctca gtcactccct tgaaggtcat tttccatgca tctccagctg 48121 catttaggac gcttttgtct ttggtgttct gaagtttcaa tgtggtctat ctaggtgtgg 48181 gtttattttg tatttttttc tgcatgaggt tctcaattat ataaatttat atgaatttct 48241 caattatgtg actttcatta gtttagaaat ttcatagcca ttatctcttc aaatatttcc 48301 tctgctccat tccatctctt ctccttctgg cactctaatt aggctcatgt tggattttcc 48361 tattctagcc tttatgtctc ttaacccttt ttccaggttt tccaaatcta tgtctttcca 48421 ggcagcattc tgtatactgt attcatattt atcttttagc tcatgaagtc tcgctttggt 48481 tatttataat cttctgttaa acatgcatac taagttttta atttagttga aattaaaatt 48541 caactaaaat ttgaactaaa ttaaaacttt ccttttagaa cttccatttg gttcttttca 48601 aactgcttga ctattcttta tagttttgtg ggttttgcac acattttctt tctttttctt 48661 ttctttagag acagggtctc actgtgttgc acaggctgga gtgcagcggc atgatcacgg 48721 ctcactgtag cctcaacctc ccatgctcac gtgattctct cacctcccaa gcacctggga 48781 ctacaggtgc ataccaccgt ttctggctaa ttttttattt ttttgtagag acagggtctc 48841 actatttgcc taggctggtc ttgaatgcct gggctcaaga attcttcagc cttggcttcc 48901 caaagtgctg ggattatagg catgagccac catgcccagc cttaagacat atgcagatgt 48961 aaaacataac aacaccaata cagaggatgg gaggagaaat gaaaacatac tgctgtaagg 49021 aggctcttac attatatgta gatgccataa catcatgtga gggtagacta tgacaagtta 49081 aaaatgtata ttataaatcc tagagaaaac attttaaaaa ctacaacaag acacaattaa 49141 taagctaaga gtggtaataa aacagaatac gaaaacatac ttgatcaatc catcagaaag 49201 cagaaaagaa gtggaaatgg aacaaaggta agatgggaca aataaatagc aagatggtag 49261 atttaaaccc aataatatca atgattacat taaatagaaa tgatctaaac tcagattgaa 49321 aggcagagat tatcagaatg tatttaaaaa gtgagacaca aactataagc tacctcaaga 49381 aatgaaatgt taaatataaa caccaagata ggctaaaagt aatagactgg aaagaaagaa 49441 aaataaagga gccaggtgtg gtggctcatg cctgtaatcc cagcagggaa gccagggtgg 49501 gaggatagca tgagcccaag agttcgagac cagccagggc aacacagtgg gaccccatct 49561 acacacacac acacaaacac acacgctagc caggtggtgg cacacacctg tagtcccagc 49621 tactcaggag gctgaaggat cgcttgagcc caggaggtca aggctgcagt aagctgtgat 49681 agtgtcactg gactccagcc tgggcaacag agtgagactc tgtcagaaag aaagaaaaga 49741 aagaaaaaga gagagaggaa gaggaagagg aagagggagg gacaaagaaa gaaagaaaga 49801 aggaaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaggaaaa 49861 gaaagaaaga aagaaagaag gaaaagaaag aaagaaagaa agaaggaaaa gaaagaaaag 49921 aaagaaaggg aaaagtgaat tagaaggaaa gatcagaatc aatggaataa aaaatatata 49981 cacagttgag ggtaatcaat gaaaccaaaa gctggttctt taaaaaactc agtaagattg 50041 ataaacctct agctagacta accaagaaaa aagaagacat aaataacctt atcaggaatg 50101 aaagcaaaaa tatcattata gattcagcag atattaaaag aattataaaa ctatgaacaa 50161 ctttatgcca ataaacttga aaatgtaaat taaatgaaca aatttcttga aagacacaaa 50221 ttaccaaagc tggaacaaaa agaaaaagaa aaatctgaat ggcctcataa ttactaaaga 50281 cattgaatta ccaattaaaa acctttccac aaagaaaact ccagatccag attgccttac 50341 tggtgaattc tagcaaacat ttaaggaaga aataatacta atcctacata aacatttcag 50401 gaaacaggag gcagaaacct tcccaactca tgttatgaag ctagaatgtc actactgaaa 50461 atgtggagaa atcgtatcac caaagaaact acaatctagt ttgtcaaacc ctctgacaat 50521 tcagctccta aagtaatccc tacaaaagcc aggaggcatt tgatcccata gaagaaacta 50581 cgcagcctcc aagaaggttc ctctccagtg tcctcctaga tgtggacacg aagtctaaca 50641 gacaaggaga aacattccag aaaactgagc cggcgctgcc aggaaagggg acttcactat 50701 ttatgctcaa taggatttgg caagtgccac agactagaga tcgggtctaa atatggggtc 50761 tttgtagtat ttacccaatt cttactcttt tgttttgttt tgttttgttt gagacggagt 50821 ttcactcttt ttgcccaggc tggagtgcaa tggcgcgatc tcagctcact gaaacctccg 50881 cccgctgggt tcaagcgatt ctcctgcctc agcctcccga gtagctggga ttacaggcat 50941 gcgccaccac gctggctaat tttgtatttt tagtagagac gaggtttctc catgtgggcc 51001 aggctggtgt ctaactccca acctcaggtg atccgcccac ctcagcctcc caaagtgcag 51061 ggattatagg catgagccac tgcgcccggc ccttaatctt tctttgtatg tttgggacac 51121 ataaaatatg ttttagttta tacgacattg gattaaacta gtgtacactg agtttgttgg 51181 aggatgaatg acaaaccttt tataagtcac tggaccacaa ggatctacct ctggacctga 51241 aggagcaaat tgcatataag tcagagaacc taaattttga gctggatgct gcaactagtt 51301 agaactttta cagtggtctc ccttaaggat aaggtgagtg agttttgtat acaggaagta 51361 gagagggcat ggccaaaggg gcagatggtg ccctaatatc aattttccag ttttcccata 51421 attttagaat ccctagccag cacctacacc tggcctttat ccattcacct gccgacgcac 51481 actcaggttg cttcctaact tggctattgt gaataatgct gcaatgaata agctggtact 51541 gatatcctcc aagatattga tttcatttcc tctggatata catccagaag tgggattgct 51601 ggattgcatg gtagttctgt tttaattttt tactgtattc cataatggtt gtaccaatct 51661 accttttcac caacagtata caagggttct cttttctcca catcctcgcc aatatttgtt 51721 atctcttgcc tttgtttttt ttttttatat agtagccaaa ctaagaggtg tgaggtggat 51781 cgcgttgttg tcatatctcg ttgtggtgtt tgcatctccc tgattagtga tgttgggccc 51841 ctttcatata cctgttggtc atttgcatgt cttctttgga gaaatgtcta ttcaggtcct 51901 ttacacattt taaattgggt tatttggtgt tttgctattg agttttgtaa gttccttata 51961 cattttggat attaactcct tatcagatac atgatttgca aatactttct cccgttccat 52021 gggttgttga tcgtacccct ttactatctt gaaaagaaat ttataaccta tttttttaaa 52081 ttacacaatg gtctaatgaa gggatagaga gaggagagtg gcagccaggt catgtcacat 52141 acatcacaaa tccactccac attctcatct tcccatcaat gtaaaccacc aaggagtcca 52201 ggcttcattc agccacccag gcacggtcat aactactccc aggccttgag ccaaaatacg 52261 gaatcaaaga ttctctttga agtgcactca tgttgataaa tttgcactaa cctacaattc 52321 attcttggct ttgtatcctc acagtccatt cctgtatgta ttcccctcgc cttttttttt 52381 tttttttttt tttttttttg agatggagtt tagctctgtt gctcaggctg gagtgcagtg 52441 gcacaatctc ggcacaggct cagggggccg ccatggtgtc agagaaggcc tggagcagga 52501 agtgcaaaga cacacagcag agatgcagct ctgtgtccac tgcagttaaa ggtgggctga 52561 gggcatcagc caagacacca tctcctgccc ctctcagatc tagtcagcta caatattgag 52621 tctactctgt cacgattctt caaagtaaag gctagttaca actcctatga aagccttaat 52681 acaagaggct tagcagaaca aatacaatcc ccactgccac agcttctccc aagtaacaga 52741 tattcagcag ccacctcctc cttggcctca ccctcaccag cattccagct tgtctccatg 52801 gcttgcctgg tgacgctgcc cagaccttca ccctgaggca tctgagccct taggggctat 52861 ggctgctgaa aagacctttt cattgtcatt acttggcatg gaagcaccaa gacagaccta 52921 gatgaatcct ctaagtgcca gatcaatgcc tgcttgcccc gttacgcagc agccacgcta 52981 cctcctcacc gtttatcagg atcaagtatc ccctcctttg tggcctgctg gtccaccagt 53041 atgaggagct gagagtaacc tggtggtctt tgcagattca ggctcagcag aggtatcctt 53101 cctgcaacca gcaggagtgc cctcgtggaa gcgttcctac ctgggaaaca ggctctaatc 53161 cagcagatcc taaggctgtg gagacagaat gtacaaattc tacgtgtaag cactgggaat 53221 gatggtgaga agtagctaat cccaaccctt gggttcctgg acctgtgtct ttctgcttgg 53281 atccacatga agtatctact gggggcagag caccatatat tagactctga ttcagtgtat 53341 atatcccaac tcctcaggat actgtcctag aaccaggacc tcagctaagc ccttaagaag 53401 tcatcccatc tttccagtag cctagctgct tttacgtaac gcaggatatt agagaagcag 53461 tggatcccat ggtcatccac acactaccat acctctttcg ctataaaatg gctttcttgg 53521 cctgaggcga tgtgattcgt gatcccatgt tggtacacgg agcatgcctg aagctcctgg 53581 atagtgatgc tggcctggga ccctgtggat acagaagaca aacccttatc cagagcaggt 53641 gtcaactcca gcaaggatga agggctgtca ccaacttgcc actaagtaac tagttggggt 53701 gctctaggga tggtatctca gcaggggctc agcactgatc gctttcagca ggagtaatca 53761 tgagacaagc cttgagaaag aaggcctgtg ctggtggacg ctaacctgtg ctccttctgc 53821 atgacgccgc cactcacagt ccactgttca agcacagggg tagccagggg cacagctgac 53881 tgacatccca tccatcctgc tgttcaccaa ctcctctgca gtattaacac aaggcagaga 53941 gatgcacata ttgtgtggcc actcctgtag gtccatcttc ccctcttgct cttttcaggt 54001 ccctgaccag ccagcaggac gtcacaaccc agaagtccat atgcgtccct ccctcaggcc 54061 actactctgt ccactgcctc catacaaagt ggatatccac acgaactgct ggaagctcag 54121 ccccatggag gatttcctat tgtcgtcctt tagggtcacc ttgaatggga ctgtagtgta 54181 cctatggttc attttcagtt catatcaaca catcaaactg acctatatac gaaccagacc 54241 agaatgctta tctgctaagg gagaaatatc agtgcaaagc agatgtgggg ccacactgcc 54301 acctcttcct gtaaataact gttacccact agacacgctt gggtccaagc ccaaggcttc 54361 ctttttcatt gtaccatcaa ttgctactga gcctctgacc tactgacttg gaggatctga 54421 caacacctcg cttgggcaga ccctgtcgca tggtcgctga tgtcccacca gcaggtgctc 54481 ggtctctacc tgggtctggt agtgtgcctg atactgctct cacttggcga gtagttctct 54541 gctgcagaag gcatggtctt gtccaagaac tctaggacag tgtactgcaa ctcttccgtt 54601 ggagtttgcc agagatgcca tatagggtcc ttcctacccc aaataagccc cccggtaacc 54661 cattggatct atgcagcagg gaagcccatc tcccaatcca aatctttagc agagccctct 54721 ttttctctgg gccccactca aagctggcag ttttcccgtt acccaataaa ggggtcagag 54781 aggtatcctc tagcgcggtg catgctgact ccaaaacaca gaggtgcctc aatgctgtgc 54841 ctctttcttc cttgtaagga tgcaggatgc aataacttgt ccctggcttg ccctcaaacg 54901 ctgcacccct gagaatgtca ctgatatggc aggctccaga acttctgagg gcttctcgct 54961 catcctctga catgcacggc cttcttcacc atgtccaact agtatgacat catcaatgtt 55021 atatggaata tagatgataa gctacctaag gactattgtg acagagaaca ggagagttca 55081 gacagcaaga agcaagacca tttacagagc tgtccttccc atataaatgt gaactgcttt 55141 taatcctctt taccactggc aattgaaaag aatgttcacc aagtcaataa ttgcattatc 55201 aagagccaga ggtgcattta tcggttccag taaagatacc acacctagca cagctgtgtg 55261 atcggggaca cgacttggct aagtttgtgc agggcatcat catttacttg aatccaccaa 55321 gatttagcag gagccattca ggttaattga agggatgaaa cggtaaccac tgctttataa 55381 gtctttgagg gcgaaatacc tctgcaatta cacctggaat gcagcattgt ttctgactta 55441 ctatcttgga aaggtttcag tagggtgagg agtttcaaag gtttctactt tgcgcttcct 55501 tacatatggt accaaaggcc tgggatcaat gtatacgctc cgccctctgc taggtaaaaa 55561 catcccaagt atgcattgag accagagaag taaccacagg actcgggttg ggaattctgc 55621 ttatcatctg atttccataa gcctgcaccc taatgagagg gccatgatgc ttcagggctc 55681 ctggaaacag tgtcagctca gatcctgtat cagcccttca aagacctaga cagtcccttt 55741 tccgaagtaa actagttttt ttggtgtgtt tgttgttgct gttcagacag tcttgctgtg 55801 ttgctcaggc tggagtgcag tggcgtgaac acaggtcact gcagccttga cctcctgggc 55861 ttaagtgatt ctctcacctc agcatccaga gtagctggaa ccacaggcac ctgccaccac 55921 gcccagctaa tcttttaatt tttgtaaaga caggttcttg ctatgttgcc caggctggtc 55981 tcaaactcct gggctcaaac gatcctccta cttcggcttc ccaaagtgca gggattaaaa 56041 gcgtgaccca ctgcccccag ccagaatcag ttttcttttt tttttttttt tttttgagac 56101 tgagtctgac tctgtcgctc aggctggagt gcagtggctc gcggcaacaa ggcctgggtt 56161 caagggattg tcacgcctca gcctcccgtg tagctgggat tacaggcaca agccaccaaa 56221 cccggctaat ttttgtattt ttagtagcga cggggtttcg ctatgttggc cagggtggtc 56281 tcaaactcct gacctcaagt gatctgcccg cctcggcctc ccaaaatgct gggagtacag 56341 gcgtgagcca ccacgcccgg accagttact ttttaaagtg gctctaggtc cctttaggga 56401 aggattggag gaatatttct tggtatatgc tcagatgctg gtattgtaag atccttcctg 56461 agaggaacca agcctccttg tcaattaatg ggctctaggt ctcaaaactg actcagattt 56521 aaacagaatg atcatgattt tccactgcag caaaagtgac attagccttg tgatcaccaa 56581 tttttgattt ctttctggtt atcaaacaat actctagtca gatgccccaa agaacaccat 56641 ggttcgacag acattgccac agatctctgt gggtccagcg ttcctggtcg ccgctctagc 56701 ctggggcaga agcagaaatg caggaggcac gcgtttaaat caggacgccc tcaatgggag 56761 gtgcgcaggc cgaggagggg gctgcggcaa gctgcgagtc ttcactttct tcggctcttg 56821 ggtttgactc tgtaattggt tctaggcctg gaggagacga ggaggatagg gggagagggg 56881 aatgaatgaa gacgcgcatc tcaagcgggg cagcaaaaga ctcctcattc agaggtgcgg 56941 ctctaccggc cagggagtcc ccgtgtagac tgggcattta tgcaactgga gtcatcggaa 57001 ctgtcccaag ggtcacattc cttcagcaaa tgtttactgc gtacccaccg ggtgcccgca 57061 tcccgcactc tgcccgcgct gtgcggccct gcctagcgcg cgcgcctcct ccaggcctca 57121 cccacgctca ggacggacgc gcgctggacg gctcttcctt gtcggagcgc cccaggggtc 57181 ggggaagagg gcccggcaag ggagccctcg cgccggagct gcagctgcag ccgccgcccc 57241 gccgccccgc cggctcccac ggggcagaga cgcagctcct ctccggtctt cccgtacgct 57301 accgcgcccg ggcagttcct cgcccgcgca cgcgccgctc cgccaactga ttggcctccg 57361 gcgcctcgga ttggcccagg ccgtccaaca gcagccccgc ccagagagag accattggtc 57421 cttgcccata ggggcggggc ccggcagaga tctgcggaat tcggccttcg gaaagagccc 57481 ccgggccggg gcacggagag agccgagcgc cgcagccgtg agccgaatag agccggagag 57541 acccgagtat gaccggagaa gcccaggccg gccggaagag gagccgagcg cggccggaag 57601 gaaccgagcc cgtccgaagg gagcggagcg cagcctggcc tggggcccgg tcgagcccgc 57661 gccatggcgg ccgaggcgac agctgtggcc ggaagcgggg ctgttgcggc tgcctggcca 57721 aagacggctt gcagcagtct aagtgcccgg acactacccc aaaacggcgg cgcgcctcgt 57781 cgctgtcgcg tgacgccgag cgccgagcct accaatggtg ccgggagtac ttgggcgggg 57841 cctggcgccg agtgcagccc gaggagctga gggtttaccc cgtgaggtgg gaggtcaggg 57901 gtcagcctct ccggtgcgcg gatcggggtc aggggtcagc cgcggggccc tcaggatgct 57961 ccatgttttc gcccccctct tgcgcccgcg cctggggcgg ggcggggccg gcctggccgg 58021 gagggggccg gggccgcggc aggtagggcc ggccgcgggc tgagcgcgcc tggtgtgggt 58081 ctgcagcgga ggcctcagca acctgctctt ccgctgctcg ctcccggacc acctgcccag 58141 cgttggcgag gagccccggg aggtgcttct gcggctgtac ggagccatct tgcaggtgag 58201 gggggtgtga gcgccgcagc accagtggct ttagggcctg tcgcttacgc gatgcgggta 58261 gtattgttcc cgttgcgcag ttgaggacac cgaggttcac ggtctgagta acacctcatt 58321 acaccgaagc ctgggcctgt attcccagag ctttgggagg ctgaggcgag aggatcactt 58381 gagcacagga gttcgagacc agcctggaca acatagtgag acccccatct ctaaataaaa 58441 atagaccaac gctaaagcct gtgctccaga gcctccaggc aattggatca gaagtcgcag 58501 ctctggtggg aggaaggcga gccctcatgt gtgtccctgt gccactttgc cttggcccct 58561 ttgctgtcca tcctttttca gggcgtggac tccctggtgc tagaaagcgt gatgttcgcc 58621 atacttgcgg agcggtcgct ggggccccag ctgtacggag tcttcccaga gggccggctg 58681 gaacagtaca tcccagtacg ggcccagtcc taccctctcc tccccaaggc acctccaccc 58741 cctaacccta ccccagtgcc caatgtctgc ctccacatcc ctcaccccaa gtagggaatt 58801 tcccccaaaa ccctgacttc ccctcattgc cccagccctt cccactcctg ccccagcccc 58861 atcaccaccc tgatagcttc ctgggtgcag agtcggccat tgaaaactca agagcttcga 58921 gagccagtgt tgtcagcagc cattgccacg aagatggcgc aatttcatgg catggagatg 58981 cctttcacca aggagcccca ctggctgttt gggaccatgg agcggtgagt caggagcctc 59041 ctcagggctc ctgtactcct gagctgaacc tccatgccag cggatgtgtc gggggctcta 59101 tccagccctc cacttgagtg gatctgatgt catgggctgt cacagaacag gcttttaaat 59161 gcttgccaga ggccacatgc agtgcctcat gcctgtaatc ccagaacttt gggagccgag 59221 gtgggtgagg tgggaggatc acttgagccc aggaggcgga ggttgcagtg agctgagctc 59281 atgccattgc acttcagcct gggtgagagc aagactctgt ctcaaaacac caaacaaaaa 59341 aagaaaaaag aaatgattgc caaggccagg cacggtggct catgcctgta atcctagcac 59401 ttcgggaggc tgaggtgagc agattgcctg agctcaggag ttcaaaacca gcctggacaa 59461 cacagtgaaa ccctgtctct attaaaatac aaaaaaatta gccaggcatg gcagcatgca 59521 cctgtagtcc cagctactcg ggaggctgag gcaggagaat tgcttgaacc cgggaggcag 59581 aggttgcagt gagccaagat cgcgccactg cactccagcc tgggtgacag agcaagactc 59641 catttcaaaa gaaaaaaaga aatgattgcc ggacaagtca cacagtgata ggagcagggg 59701 tgcagtagtc tggcctgagg ggtagatagg cgaggggtgg gaaagaatag cagagccagt 59761 gtatcctatt ctttgggctt caggtaccta aaacagatcc aggacctgcc cccaactggc 59821 ctccctgaga tgaacctgct ggagatgtac agcctgaagg atgagatggg caacctcagg 59881 tgagggcagg caggacaagg ctaatggtaa tggtgtccgc ccttccagta gttgctgagg 59941 gctggtgtca gggcctggcc tgttagggtg gctctgatcc tcctctagtc actcctgctc 60001 aggacccatg ctcccatacc cctgtaggaa gttactagag tctaccccat cgccagtcgt 60061 cttctgccac aatgacatcc aggaaggtag gagaaggcat ctgagtctcc taacccaaga 60121 tggaagagcc agagggctct ggagtgagca gaacctcacc ccattccccc agggaacatc 60181 ttgctgctct cagagccaga aaatgctgac agcctcatgc tggtggactt cgagtacagc 60241 agttataact ataggtgagg ctggaaagat ggcttcccat agatctgttc ccatagggct 60301 cttgaaaaca ggccagctgc ccagggcatt tggggactga atgtccacct tattctccca 60361 ggggctttga cattgggaac catttttgtg agtgggttta tgattatact cacgaggaat 60421 ggcctttcta caaagcaagg cccacagact accccactca agaacagcag gtatgtgggc 60481 cagaggctgg ggagcaggac ccatcctgtg aggaaggagg gaggtggagt ctggaaggaa 60541 tggccggaaa ggatgttacc tgggaaatac tccacagtct ccccaattcc tgactcttgg 60601 ccattgatcg tagttgcatt ttattcgtca ttacctggca gaggcaaaga aaggtgagac 60661 cctctcccaa gaggagcaga gaaaactgga agaagatttg ctggtagaag tcagtcggtg 60721 aggaaggagg ggcagggtgg ggtagggcag agcagaggaa agagggattg gggaagaggc 60781 agatttatca agctgcaggg aaggtggctg tggagagtgg agttaggaca gtgggggaga 60841 agttaaaact gtagggattg gccaacctgg gtgaggggat ccaagcagtg ctagacatgc 60901 tcttgaatgc ccctcctttt cctgccctcc ccccaggtat gctctggcat cccatttctt 60961 ctggggtctg tggtccatcc tccaggcatc catgtccacc atagaatttg gttacttggt 61021 aagtgaccct ggggatggga atgctagctg gggggctggg gagcagcagc agccacactc 61081 ttccaggagg cctggggagt cccgggtggc tgtgggcagc ctgaggtgga tgtagaatgc 61141 tggtcccacg tcttctcacc actgtgtggg gtgggtttcc ttccctagga ctatgcccag 61201 tctcggttcc agttctactt ccagcagaag gggcagctga ccagtgtcca ctcctcatcc 61261 tgactccacc ctcccactcc ttggatttct cctggagcct ccagggcagg accttggagg 61321 gaggaacaac gagcagaagg ccctggcgac tgggctgagc ccccaagtga aactgaggtt 61381 caggagaccg gcctgttcct gagtttgagt aggtccccat ggctggcagg ccagagcccc 61441 gtgctgtgta tgtaacacaa taaacaagct tcttcttccc accctgtcct ggccctgctg 61501 agcagcagca gaaagtacca aaccgagcag tacacacaaa gggactcttc agtgctctgg 61561 gattgaaagt ggttagcgtt catgctgcca gttggggtcc cccatccctc cccagtcccc 61621 tggctgcagc ttagaataat aaatactagg acttggggag gaggagagtg atgggggtat 61681 gaagacgacc ctgaggtggg gatgccgccc ggagcaccag cgatcccaga acaggcagca 61741 gctgacacat cggtgacctt ttccctacat ttggctattt ttagctctaa agccaccatc 61801 ctcacgagac tctggggccc cccaggctcc cagacctttg agcaaccttc accgcacaga 61861 aacccagccg cgccctgcaa ttcccaccgc ggaaggtggg tgggttctgg ttctcgcccc 61921 acgtttttcc ccgaccccga tttgggagta ggtgtcaggt tcctggtgag ggcggggcgg 61981 gggtggctag gcctgaagga cgtggggaca cgggccagag tggctggccc cacgcacgga 62041 caggagtgaa cccgagctgt gagtaggggc cggcacaggg cggcccgcgg gggtctgggc 62101 cctcagccct gcacaggggc gaggacggcg ctggggcccg cgcgctcggg tggggaaggg 62161 cgggctcccg aaccctgcct ctgtccaggc cggctccact tccaggggcg cctctctccc 62221 ctgcccgcgc cctcgctgac gccccccaac tccaggctgc ccttcgccgt ccttggggct 62281 cctggagctt tcaaggccca aatcccctgc accacagtgg ctgtgcccac ccggaaggct 62341 ggcgcgagga tttggcggcg gttggcctgg cggggggcgc gggccggggg cagccgctag 62401 tcgcggggtg gggggcgcga ggggtcgcgg actggctggg ggcgtctcgg cgcggctggc 62461 ggcggggccg gcctaagcgc gcccgcgcac ccatctgccc ccgtcctagg tgccgaccaa 62521 cccccaggat ggcggaagct caccaggccg tggccttcca gttcacggtg accccagacg 62581 gggtcgactt ccggctcagt cgggaggccc tgaaacacgt ctacctgtct gggatcaact 62641 cctggaagaa acgcctgatc cgcatcaagg tgcgcacagg tgcttctccc agagcgtagg 62701 cagaggccgg ctgtcagctg ttaagcgctt tgttagggtc cctcactgcc tccttggctg 62761 gcacttctgc ccggtacagg ttgtggaagt acagacacca gaggggtgca caggatgtgg 62821 tcggacacag ggagctgtgg gtgtggcgga ggaaggagca cagcagggca tcaggagaga 62881 aagccttcca ggccaagacc aggagccagt tcccaagact tcacaggcag gctaacctcc 62941 cgccttccgg ctccataagg gcgcctgttt ctgcccacag aatggcatcc tcaggggcgt 63001 gtaccctggc agccccacca gctggctggt cgtcatcatg gcaacagtgg gttcctcctt 63061 ctgcaacgtg gacatctcct tggggctggt cagttgcatc cagagatgcc tccctcaggg 63121 gtaaggagtg aaactggaag ggcacaggtg ccaccaggga gggctgggcc cagctcccaa 63181 ggctgaggtt cctgagctgg gcagatacag gacagcagcc attggcagtc acggggcagc 63241 cctcccctat gacaaccatt gtcttagccc tacatccgct catttgatgc agtcagacat 63301 gagtgtgccc agggaggttc ttccccttgg tgtctcccct gagacagttc acagccaccc 63361 gaggctggcc tcaagaggac cccctgcagc ccttgcccct ctccaatagg tgtggcccct 63421 accagacccc gcagacccgg gcacttctca gcatggccat cttctccacg ggcgtctggg 63481 tgacgggcat cttcttcttc cgccaaaccc tgaagctgct tctctgctac catgggtgga 63541 tgtttgagat gcatggcaag accagcaact tgaccaggat ctgggctgtg agcagcagcc 63601 agtggagggg ttcaggcacc tgggttgaga ctctttggac tcctttgggg ttctgagcta 63661 gacgggagag gcagacaggg cactggtgcc tggtgtgtgg tttgtcctgg aggggctggg 63721 atggctctga gggtctcagg gagttgctgg ttggtttcca ttttttccac tggctcccac 63781 cccagcactc tgctctgtac ccccagatgt gtatccgcct tctatccagc cggcacccta 63841 tgctctacag cttccagaca tctctgccca agcttcctgt gcccagggtg tcagccacaa 63901 ttcagcgggt gagggcctcg cttgggcatc ccagtgggca ggggaggttg gattcaggag 63961 atgtttccaa atataaggtt ctgtgcaaag agtggcctta agggcttgag aataatgggg 64021 ctgggtgagg agggagaggt gggaagagga ttaagataga ggcagccctt gccatctggc 64081 cccacggtga tgataactgg ctggacagta cctagagtct gtgcgcccct tgttggatga 64141 tgaggaatat taccgcatgg agttgctggc caaagaattc caggacaaga ctgcccccag 64201 gctgcagaaa tacctggtgc tcaagtcatg gtgggcaagt aactatgtaa gttcctgccc 64261 ctgggctcac tgtcacctgc catgtgtcct ggctgcaccc gccccagctc taaccttcca 64321 cctccccaca ggtgagtgac tggtgggaag agtacatcta ccttcgaggc aggagccctc 64381 tcatggtgaa cagcaactat tatgtcatgg tatgaactag agcccccagg tccccgcacg 64441 tgctcagctc tgtcccagct ccaaggcaag ggatctggag gacagcccag agctctagta 64501 gcagcttccg tgggcaagtg ggggttatgg agtgaggcct gagggaaagg gaagagagag 64561 aggagatcct agaagagtcc agaagcagct taggggcaat ggggatccta aggatgagga 64621 gagtggagac cgccagcctg ccaccgcttc tcagagtccc ggggtcactg cccctgccca 64681 gctcgggctc tgtcacctct ttccttggtt tcctcactgg cctcctggcc atgggttccc 64741 agctgtcctg acaccataac cagggaattg tctagaacgt gtcttgcttt gtgtccctct 64801 gcagagccgg acagcagaat ggaggccagg ctgctgcttt tagagctcag gaagtcagtc 64861 tgcctctgcc ccatgtaact ggcccttctg agtctctggg tctcccagcc acctgggctg 64921 tatggcatgc ctcttctctc ctttcccacg gtccaaagca cacttgactt gccagagcct 64981 actggaattc tcctccataa gctgtccttc caggaactta cacacaactt gcccatagca 65041 gaggttttaa actgcttttt agtggtagaa cccctttatc aaagcaaaag aagtagaata 65101 taagcacata aaatagctta taaaaaggca gctcgggccg ggtgcagtgg ctcacgcctg 65161 taatcccagc actttgggag gccgaggcag gtggatcaca aggtcaggag atcgagagca 65221 tcctaacacg gtgaaacccc gtctctacca aaaatacaaa aaattagccg ggcgtggtgg 65281 tgggcgtctg tagtcccagc tactcgggag gctgaggcag gcaaatggcg tgaacccggg 65341 aggtggagct tgaaatgagc cgagatcgca ccactgcact ccagcctggg tgacaagagc 65401 gagactctat ctcagaaaaa taaataaata aaaatttaaa aataaaaata aataaacaaa 65461 taaaaaggaa gctcagagcc aggcgtggtg gtgcatgact ctaatccctg caactcagga 65521 ggctgcagca ggaggatcac ttagggcgag taatttgagg ctgcagtgag ccatgcttgt 65581 tccgctgcac tccagcctgg gcgacagagc aagaccctaa ctctaaaaaa taaagaataa 65641 agcagctcag gctgaagctg gagcagaggg ttactgcgat cagttgtgca tttctggagg 65701 ttcaccccgg cagcatgtgc aggatgtact ggaagaggga gacagaacac ctcaggcccc 65761 cgaatagatt ggtccttggg tcagcaagac tcaggtgaca cccagacaga ggcccccacc 65821 ccgcgggctc tgttcctctg taggaccttg tgctcatcaa gaatacagac gtgcaggcag 65881 cccgcctggg aaacatcatc cacgccatga tcatgtatcg ccgtaaactg gaccgtgaag 65941 aaatcaagcc tgtgagttgc gtcagggttg aaggtgggat gggaggggag acctgagtct 66001 gagccatgct gggccttccc tcaggtgatg gcactgggca tagtgcctat gtgctcctac 66061 cagatggaga ggatgttcaa caccactcgg atcccgggca aggacacagg taactgagcc 66121 ccctcgctgc tacctgtggg ccatctggct ggctcgctgc cctcctgcct gctcatcacc 66181 aagcgtcccc agtgtctcag ggtcctgaaa ctgtgaacag tagtcaattg tgacagatac 66241 tagacatcca ttgtttacag gcatgatgct ggcgcaggga ccccacaggg cccagagcag 66301 agcccctccc tccggggctc actgcgcatg gtgaagggag tcctgctgca gcccaaagct 66361 taggacagac ccggcgctgc catggagccg gcagaggagg ggaggggcgc acccaggggt 66421 ctggcagagg cagcagcctt ccctgcttct gacactgtat ccttaggcgg tttgcacaac 66481 cctctagtgc gtcgtttcct cttctgtgaa atactttata ggattgttgg tgttgcgtga 66541 gagagagtgg aagcacccag cacagggcct ggtttaggac acacggatcc catccagggg 66601 gcaggaaggc ccaggcagag gccgagcaaa acagggtctg caggggtcac ctagtgcatg 66661 ggaggtgggc ccttcccagg atgtagctgg gggccccgcc tcagcttgcc cgtggcctgt 66721 atcacagatg tgctacagca cctctcagac agccggcacg tggctgtcta ccacaaggga 66781 cgcttcttca agctgtggct ctatgagggc gcccgtctgc tcaagcctca ggatctggag 66841 atgcagttcc agaggatcct ggacgacccc tccccacctc agcctgggga ggagaagctg 66901 gcagccctca ctgcaggagg aaggtattgg cctctaggaa gggactgtcc ccaccctgag 66961 ttcagggctc cgtgaggaga ggagcgtggc cctgcctgcc accctggaac tggaggctgg 67021 aggcacaact aggggagggg cattggtggt catggcagca ggacagccag cataacctac 67081 ctctgacggg tggcagccag gtgaagtgtg cagagggtgg ggacaccctc caaaatagct 67141 tggcaccccc cacctccagg cccagcctgg cacacacacc cccacctcca ggcgcagcct 67201 ggcacgcacc ccaacacctc caggcccagc ctggcacccc cacatctcca ggtccagcgt 67261 ggtacccccc atctccaggt ccagcctggc accccacccc catctccagg tccagccagg 67321 ccctcagagg caccctcatc ccaagtccac gtgcccactg cttaccctgc cccatgcttc 67381 agggtggagt gggcgcaggc acgccaggcc ttctttagct ctggaaagaa taaggctgcc 67441 ttggaggcca tcgagcgtgc cgctttcttc gtggccctgg atgaggaatc ctactcctat 67501 gaccccgaag atgaggccag cctcagcctc tatggcaagg ccctgctaca tggcaactgc 67561 tacaacaggt acggcagccc cagccccaca ggttacagct taaggttaaa agttagggtt 67621 atggttagag gattaaagat aaaagaaggt agggttatga gctgggtgca gtggcacaca 67681 cctgtgatcc tagcactttg ggggccaagg caggtggatc acttgagtgc aggagctcaa 67741 gaccagcctg ggcaacggag cgagacccct tcataaaagt agttaggatt gtgattagtg 67801 gttgggtagg gctagtggct agggttaaaa gctagggttt gggttataga atagggttaa 67861 aagccgggca tggtggcagg cacctgtaat cccagctagt cggaaggctg acgcaggaga 67921 agcccttgaa cccgggaggt tatgggaagc taagatcaca ccactgcact ccagcctggg 67981 caacagagca agattccatc tcaatttaaa aaaaaagtga gagaaaaaga gagagagaat 68041 agggttagta attagggtta aaggttgggg ttgcaggtca ggcacctctg gacattccca 68101 gctttggttc ttcatgtgtc tactcttcct gcaggtggtt tgacaaatcc ttcactctca 68161 tttccttcaa gaatggccag ttgggtctca atgcagagca tgcgtgggca gatgctccca 68221 tcattgggca cctctgggag gtaatagcct tgcagaggga acctgcaggg caggctgtag 68281 gggatgaggc cagcctctca gtctcatcct ctccctgcag tttgtcctgg gcacagacag 68341 cttccacctg ggctacacgg agaccgggca ctgcctgggc aaaccgaacc ctgcgctcgc 68401 acctcctaca cggctgcagt gggacattcc aaaacaggtg ggttggaagc tcccagagca 68461 ggtgtgagac cacaaagcag caggtgggta cagccccgac gaggcctgag cctcctcctc 68521 ccctgctggc ctcactgcct ggcccagccc tcgggaaggc acagggcacg tctcaggata 68581 cctgtagagt ccaaactggc ttcagggagg acagagacca cccaccgccc ctggggccat 68641 ctgtgtttag agacagccat gagatggagg aggcactcac aggcccctgg agcattttca 68701 gcacttccct cttacccaca aagctgagcc cggcctctgg gggctgattc tccctcagac 68761 tgtcttttgc gtaccctcct cctgaagatg tcttggccgg ctgtgccctt tctccaccag 68821 ctaaacgtca tgcctcctag acatgaccca gagtcctgct tggagagccc taccctcagc 68881 tgacccttcc catgtcttgg cagtgccagg cggtcatcga gagttcctac caggtggcca 68941 aggcgttggc agacgacgtg gagttgtact gcttccagtt cctgcccttt ggcaaaggcc 69001 tcatcaagaa gtgccggacc agccctgatg cctttgtgca gatcgcgctg cagctggctc 69061 acttccgggt aggagccccg cctcccgctg ctgagagggc agggtggtac cagggtccac 69121 ctgccagatt cacccctctg tatatcccag gacaggggta agttctgcct gacctatgag 69181 gcctcaatga ccagaatgtt ccgggaggga cggactgaga ctgtgcgttc ctgtaccagc 69241 gagtccacag cctttgtgca ggccatgatg gaggggtccc acacagtaag tgtcctctgc 69301 ccatgtgggg gtcacagtcg tcgggtgagg tgccccctct gcctcctgtc tgcctggagg 69361 gccagggcta ctcttcaccc ctttacttct gccccgcaga aagcagacct gcgagatctc 69421 ttccagaagg ctgctaagaa gcaccagaat atgtaccgcc tggccatgac cggggcaggg 69481 atcgacaggc acctcttctg cctttacttg gtctccaagt acctaggagt cagctctcct 69541 ttccttgctg aggtcagcac cgttgttggg tgtgtccttt gtcccactgc cctcctacac 69601 gcagggcttg ggccatctca tgatggagca cggcctgttt tcctggcttg ctcccctgaa 69661 gctccctgga gtgctgggcc agctttcccg cccacacccc acctgggcct gtggtcctgg 69721 gactgagcag gagcaacatc ctctttgtgg tgttggtgtc cttgtgacag ggaacagaca 69781 aatatatgat ctgtcaggtg gtgacagcac tactgagact agtgaggcta tgtgggggag 69841 aggcagagca gagggagggc tgggcagctc acacaagcct tatgtggcat gatgcacagg 69901 ggaccacggg gtggtggcca gggagctcct gcaccctcag cagaaccgtg tctggctcag 69961 agtaggtcgc agacaccacg tgttggacag gtgctttggg tatgtggcct ctgaccagct 70021 gtggcctcca ttgcaggtgc tctcggaacc ctggcgtctc tccaccagcc agatccccca 70081 atcccagatc cgcatgttcg acccagagca gcaccccaat cacctgggcg ctggaggtgg 70141 ctttggccct gtgagtgctc ctgaaggggg tgggtgggca gcaccagggc cctgagggtt 70201 tcagtggcga gtgcaggccc tgaggtcaga cgagaggcag gagcaccttg cttagaggag 70261 agcataaccc cagacctgca tggaagcaga atgttagcta tggactcagg cagccaagca 70321 cctgggaccc acccatccca aatacctccc tgctgaagca tgaccgtggt ttccggggta 70381 ctgaagcatg actgtggggc tggtttccag ggtactgggc agtgcactgg gtgcttctga 70441 agtctactta tccaagagag tgcagctgtc tcccacagaa accttcacat gggctagctt 70501 gtcagaaggc tagagaaccc tctggagaga gacggagtga tctagagaat aagccccctg 70561 agggaaactg ggaggtccca gatcccttgg cggggtctgg gcagcattct ttggttttca 70621 cagtttcctg ggggtggtcc tcaggaaaaa tgcagtgtct agggctgggg cccgttcctg 70681 caactcttgt gggaagaaca agtacaatat cagcttggct ttcagatctt caaggtttga 70741 tttatgcctt cttcaccctt cttgttgccc tcaggtagca gatgatggct atggagtttc 70801 ctacatgatt gcaggcgaga acacgatctt cttccacatc tccagcaagt tctcaagctc 70861 agagacggtg agtctcctgc cacagctcag gcctgaggaa ggggtgccac ctggggctgc 70921 ccaggaacac aggtgtcttt ggctggggag gcatccttgc ttgtgggaac agaggggtgg 70981 gtacatatct gaaggtgcat ctgaactctt ggctcccaca gaacgcccag cgctttggaa 71041 accacatccg caaagccctg ctggacattg ctgatctttt ccaagttccc aaggcctaca 71101 gctgaaggtt ggagaaatgc cagctgccct ttcgtcccca cactgtggag gaagggacct 71161 gtggcagctc acaggcatga ggggtggccg tgcacaggtg cccaggctcc aaggacagct 71221 ccggcagcag gtcctcgctg ggcagatgct gctccctgag ggcccaggtg gtggaggtgg 71281 ggttggagca ggaagggaat tttgattttt ttttttcttg atagatacta ataaaaataa 71341 ggctgtgtaa ttttctctca gcccttaggt acctgtgttt tgtttgggaa ctcggaggcc 71401 ctccccctcc cccagctcag accacagagg tggcaagaga agggctgaag ctggaagact 71461 gttcatgagg gacttgtgtg acctgctttg aaatgtgtga ctctgctgag tgacgtaggc 71521 tctgagatag ctgtccacgc ccacgtgttt gcttggaata aatacttgcc tcagaacctt 71581 cacctgttcc ctggggccat ttctgtttgt ctgtctgctg gagagtgagc ccacctcctc 71641 ccatgcaggg gcatgtgtga ggcacccctc ttggctgagg cacaaccctg cacgggcccc 71701 gcagcttctt gtcagcatgg aaaggggctg caggcccagg cctgaagtct tgaggcgcca 71761 ggtggtcatg tctgcaagag ggcacgcagg atgacctgta gcaacagaca ctccttcact 71821 gaggttggct cggctgctaa aatcatgtta agagtaaata caaaataatt attctttgtt 71881 tttaaggcgg agtttcgctc ttgttgccca ggctggagtg cagtggcacg atctcgactc 71941 actacaacct ctgcctcctg ggttcaagcg attctcgtac ctcagctcct gagtagctgg 72001 gacaacaggc gcccgccatc acgcccggct aattttttgt atttttagta gaggcagggt 72061 tttaccatgt tggccaggct ggtctcgaat tcctgacctc aggtgatcca cctgcctcag 72121 cctcccaacg tgcagggatt ataggtgtga gccaccgcac ccagccataa aaaataattt 72181 ttatgaaaag tataagtagg ccctcaatta ctgtaattga agtgaattag attcagttct 72241 ggaccttcat ttaggaacca gtgtcttcat ctgtttttgt ttttttcttt tttttcccat 72301 gagatagggt ctcactatgt tgcccaggct ggtctcgaac tcctgagctt aagtgatcct 72361 cccacctcga ctttccaaag tgctaggatt tcaggcgtga gccaccgtgc ccggccagga 72421 accagtgtct gttgaacacc caccatagac ttggccctgt gctggggatt cagtggtaaa 72481 caaggtccta ttcctgccca tgtcgaattc actgctgggt gaggcaggtg tggtcagggg 72541 acagagcagc tggcaggaaa ccaggaggct tttgggataa tccacttacc aagtacaggc 72601 atgggttgct taatgaaggg gatacattct gagaagtgca ttgttaggcg attttgttgt 72661 gcaaagattc tagagtgcat ttacacaaag gtagatggtg cagcctacta cagacctggg 72721 ctatgctgct taccctgttg ctcctaggcc gcatgccttt acaacatgtt accatactga 72781 atactgtagg caactgtaat acaatgggaa gtctttgtgt atctaaactt gtctaaacat 72841 agaaaaggta tagtaaaaat gttatataaa agattaaaaa tggcaccctt atggccaggt 72901 gcagtggctc acgcctgtga tcccagcacc ttgggaggcc gaggcaggtg gatcacgagg 72961 tcagaagatc gagaccatcc tggctaacac ggtgaaacag tgtctctact aaacaaaata 73021 caaaaactag ccaggcatgg tggcgggcgc ccgtagtcct agctgcttgg gaggctgagg 73081 caggaggatg gcgtgaaccc aggaggcgga gcttgcactg agccaagatg gcaccactgc 73141 actccagcct gggcaacaga gcgagactcc gtctcaaaaa aaaaaaaaat ggtaccctta 73201 tataagacac ttataagaca ccatgggctg ggcgcagtgg ctcacgcttg taatctcagc 73261 actttgggag gccaaggcag gtggatcaca aggtcaggag atcaagacca cggtgaaacc 73321 ctgtctctac taaaaataca aaaaattagc caggtgtggt ggtgggtgct tgtagtccca 73381 gctactggga gaggctgagg caggagaatg gcgtgaacct gggaggtgga gcttgcagtg 73441 agccaagatc gcgccactgc actccagcct gggcgacaga gcaagactcc gtctcaaaaa 73501 aaaaaaaaga ggaaaagtaa gtggagagac aaaaggagag gtgggtggca atttcagaca 73561 gtggcaaaag caagccttcc tgattggggt cctttgacca caggcctgaa ggagacgagg 73621 agcaagtcat gtccctcatg tccctcatgt tccgtaccta agggcagagt attcctggct 73681 gaaggaacag caagagccaa agctttgcga gcggtgtctg ctgtgatgaa ggatgtggct 73741 agtgtggagt cagggatggg aagagcaagc ggggcagcta gatcggggta acaagggcct 73801 cgtaggccag attctggatt ttctgaatga cgggggaagc tgttgggctt tgaacagagg 73861 gtggcataat ctgtcctaat tttaagggat cactggctaa ccagggtggc gttagaacaa 73921 agtaggacgc agtaacagtg gaagcaagga gacccgctgg agaagtgctc ctggtggtgg 73981 tggagctgga agaaatgagc ggcttaaact aaggcagtgc cataggaaag tagaattaag 74041 gagcttggaa tgtcaaacgt aggactgaca agtctaatca ggaggaatca aagataactc 74101 atattttcta tattgggtga ttgggtgatt aaattaggtc tttaaatgtt gaatttgact 74161 tgcctataag aaatagctgt gaaaatgtct cagactcttg aaaatataaa tgaaaagcta 74221 aggagagatc taggctcctt ctaacagttt tgaagacagt taaaaccttg aaagtgcatg 74281 agaggattcc cagggcagag gctcctcacc tccttcaagt cttcatgagc tataagagta 74341 tgcctcttgg ggccgggcgc ggtggctcac acctgtaatc ccagcacttt gggaggccga 74401 ggcgggcaga tcacgaggtc aggagatcga gaccatcctg gctaatacag caaaaccctg 74461 cctctactaa aaatacaaaa attagccggg tgtggtggca cgtgcctgta gtcccagcta 74521 ctcaggaggc tgaggcagga gaatcgcttg aacccaggag atggaggttg cagtgagcta 74581 agatggtgcc actgtactcc agcctgggag accgagactc tgtctcaaaa aaaaaaaaaa 74641 aaaaagagta cacctctgcc tgcaaaatgt tacattccat atcagagcgt tcttcagctt 74701 ccaaaagtat ttccagagac ctctacctgt ttgggtgcca gtaagcctcc cctgggaagg 74761 tgcctgcgtt ttggtagtct tctggggaca tcagctcacc ccgtcctgag atagcagtaa 74821 acagagtttc caacaagcta gctttcaatt aacatcaaaa ttcagcatta aagttctttt 74881 atctcatttg ttctgggtta aaatagtgag atgaggaaca acagtggtaa accagggtat 74941 aaacctctgg agactgctgg gcatggtagc tcacgcctgt aatgcctgtg ctttgggagg 75001 tcttgagccc aggagttcga gaccagcctg ggtggtgcag tgagacccca tctctacaaa 75061 attaaaaaaa aaaaattagc caggcgtggt ggcacttgcc tgtggcccca gctactctgg 75121 aggctgagtt gggaggatta cttgagcctg agagattgag gctgcagtga gccatgattg 75181 acccactgca ttccagccta ggtgacagag caacaccctg tcttaaaaaa ataaataaac 75241 ctctgatgac aacccaaacc ccaattagtt ccccatgtcc cataggtctg caggtctgca 75301 ttaataaaag ctagcagcca taacataaat cctgaaacct catggctcaa cccaatagaa 75361 gctggactgt tggtcagata aaatccagtt ggctgagatg agggctctgc tccatactgt 75421 tattcaagga tctatggtga cagaggccct gccttcagca ggtagtactc aagctgcctt 75481 gggcatcaac atcataagta cgtggggaga gaaagagggg gccacctgtg ggaggctttg 75541 atggcctcaa cctcaaagtg atacacagca tttccctcca caccctctgg ctaaagactt 75601 gtatttcaag agaatctgag aagtagagtc tagctgtgtg tccagggcgt aaaggaaaca 75661 tttagtgaca gttggtctca gatacacaag ggcatgggaa gaccagcctg acatcttcct 75721 gggcttcatg gcctggagtt tgcagggtag cccacctcca gctcctgagc agggtactca 75781 agagcctgca cattccagct caactgatct ttcccgagtc ttctctgtcc cttaccaaga 75841 accctgagcc cagaacacac tcgccattcc ccaaaggagg caggctgagc cacaccttcg 75901 cctttgccct agatgggcct tctgcctgga atcctgtgcc ctttctccta ctctctcgga 75961 agctgtcttc cttagccatc agaattattc ctgattcggg ccaagcacag tggctcacac 76021 ctgtaatccc agcactttgg gagtccgagg cgggtggatc acctgaggac aggagttcga 76081 gaccagcctg gccaacatag cgaaacccgg tctctactaa aaatacaaaa actagctggg 76141 cgttgtggca tgtgcttgta atcccagcta ctagggaggc tgaggcagga gaatcacttg 76201 aacctgggag gcggaggttg cagtgagccg agatcgtgcc actgcactcc agcctgggca 76261 acagagcaag actccgtctc aaaaaaaaaa aaaaagaatt attccctatt catgtttctc 76321 tgggctgctg taacagaaat cggctctggg ctgtataagc aaatggaagt atgaagagca 76381 gctcacaggt ggaaagggat ggccagaaaa ccgggttcaa ccggggcagg aacctggcag 76441 ctggcggagg gagtgtggcc caggcagcag cctcctcaat aaacaagcga atgctcactc 76501 cttgcccaga actgtctggc tcaaccccag ccatttttcc tcctctgcaa tcaggctgct 76561 aagaatcaga atccctgaga gaaaattcag ggggtctaag gtttgtcagg attctcaaac 76621 acaaggtgtt caaaatcaaa gttaaccttc caagctagct tctcccactc ttccacatct 76681 ctgccagtgg tatcaccccg gccattagcc ccttgcttct ttctccctcc ctcccacatt 76741 accaagccct gtcaagtcga ccttggaatg gctttgaccc tgtccaccac tcagccaaac 76801 aacagcaacg gccggccccg tcgccacccg cctcggcagc cagagacctt ttaactctgc 76861 cactcgccta cttaccaccc ttctagggtg cccacgtccc tctgaaaaaa aaggctgagc 76921 tcagactggc ctctggtcag cctcctccca gccagagctt tctgtcactg cccttctagg 76981 tgttgagtgg ctctgcttcc agttcttgcc tggagaccag gagcgcccgg gaggccccag 77041 cactttccag cccacataga cctgcgtgca cctctgcctc gtctccctga gcccgggctg 77101 cctcctctgt aagatacatg ccacataatg tccgccacac atgatcgctg tgacagtgga 77161 aatagctggg cgttcagcga ctcattcgac aaatgtttac cagcaccttt ggtaccgggc 77221 gctgtcctgg acactgggga cacagcaggc aaacggggcc ctgcgtataa atccttaagc 77281 tcaagaagga ggtgggggca aaactgtcag gccgggaagc tcttcatctc cgcaggattc 77341 tccctcaacc tcaggacgtc cccagcaccg cacctgccag tcccagactc ggacgccgca 77401 gacctgtggg agcagcgcgg aatgccccgc cccttttccc gaggacaccc aatgagggcg 77461 agaggtcact tccggggagg gttcgctaac ggaaagggcg gggcatccct ctcgaggagc 77521 ggccttgttc ctcaagcggc cgctgggggc gccagagcag gaccggagcg cgggccaagc 77581 tggaggtgag cgaggccgcg cccccctggg ccccggccct cccaacgcgc cctgacggct 77641 gggctccgcc tgaccctgtc ctagcgttcg cggtggcacc aacctcaagc cagacccttg 77701 acttcgaccc aactccactc acccgactcc actcactcca caccccatgc ccctcccccg 77761 acgcccctca ccccgcctca cccttcaccc ctcatcgctc acgtcaccgc tcacccgcct 77821 cctcacccag cctcgcccct cacccctcac cccgcctcac ccctcacccc gcctnacccc 77881 tcacccctca tcccgcctca cccctcaccc ctcatcccgc ctcacccctc acccctcctc 77941 accccgcctc acccgnctcg cccctcaccc ctcacccctc ctcaccccgn ctcacccctc 78001 acccctcann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnncc 78061 cctcaccccg cctcaccctt cacccctcac cccgcctcac ccttsacccc tcaccccgcc 78121 tcacccttca cccctcaccc cgcctcaccc ctccttaccc cgcctcaccc gcctcgcccc 78181 tcacccctca tcgctcacgt caccgttcac ccgcctcctc acccagcctc acccctcacc 78241 ccgcctcacc cctcaccccg cctcacccct cacccctcct caccccgcct caccccgcct 78301 cacccctccc ccgctcccct cactgcacac tccatacccc actctcccca ccttacatcc 78361 ctcacctcag tccttcaccc ccttaaccta atctccatgt cctcacacca tacactcctg 78421 atcctgaagt ctgcacaccc tgggcttctc attccgtgca gcctgcccct cttaccctca 78481 cacctgccac acctgggacc ccggagtcca cacaccacct cacaccccaa gttcccacac 78541 gcacctctgg cttcctccac ctgcacttca ggacccgccc agactcttca tagcccaact 78601 ccaagacttc tctcccacat cacaggtgca ttcccttcct gtgctgctct catcccgtac 78661 aaaagagcac atctctgcac caagcatccc tcttatcccc tcacattcct tgttctcaga 78721 gggatcccag actctgttag aaactcttcg taccaggcaa gacactcaga ccacaaacca 78781 gactctaggc tctgactcgc tcaacccccg tccttcacac ctaaagtgat acccaagcca 78841 gaccctacac tcccgcccca cgctcccgcc ccacactcct cccgccccac gctcctcccg 78901 ccccacactt ccgccaccca tcctaagctc cgttccattg tgctttggtg cgggctcagg 78961 cctctttctc cacccgttcc cagtcatccc aggcaacctc attaggcaag aaatgccgtg 79021 aattctaatc ctgtggctgc ttcctccaca tcagctctca caccctctcc ctggatctcc 79081 ttttcagatc catccctctt ctgagatcca gtgccactcc ggcccagcct ccatccttct 79141 agcagtctca tgctctatcc ccaaatctac ctgtggccac cccaacgccc cacttataga 79201 aactacatgt cttttctatg cctctgcttc accccaacca tcaaccaccc cttaagccac 79261 tctcatactt gtgctctcag tgtccttacc ttgtccatcc actgcatgta tgtttacgtt 79321 cccaggcctt ctacactcca tccactgtct ctgtgccgtc ttagggtcct gaggacactg 79381 tggccatgtg gcctcagtca catgcctagg tgcccctagg gaggcatgca gctttgataa 79441 agctttatcg ttctcctcaa accaagaatg tctgttgtta caagaagaaa cttttcattg 79501 ccatttatta gggagttaga gaccatcggg cctgaaatcc cccccttgcc atcctacctt 79561 gacctctacc taccttattc tatctctacc tccttacctt cccttacctt catagggtga 79621 aatgacccct tacttgtctg agaccagtcc ctgcaccatc agttgccctt tcttctgatt 79681 cttcaggctt ttctgtccag gtagctcttt ccacacagcc tagaaacgaa gaattcaaaa 79741 aacaaaaaac cttgactctg tatcaccttc tagttacctc ctgttttttt ctccccttcc 79801 catctaaatc tgtacacaca atccattttg tctaatgatc cattaattat ttaattcatg 79861 aaatattttt acttttatgg ctctcctgaa acgtctcacc gagttgacca gtgacaccca 79921 aatcaaatat cttgaattta ctgagccctt tttttcattt tgcagagggg atcttccaga 79981 tcttcctctg tagcctagtc tagagtgctg tggcacaatt gtcccacctc accctcctga 80041 gtaactagga ctggaggtgc gcaccaccac acccagctat ttttaaattt ttttgtagag 80101 actgttgccc gggctggtct gaaactcctg gcttcaagca tttctcctgc tttagcctcc 80161 caaagcactg ggattccagg catgagctac catgcctggg cctgccatgc ccaaatactg 80221 taatctttga cactggtgta atatctccca acttaaaata aaatacccct gtgactctac 80281 attacattcc aggtactggc atgtttctct gcttctcttt gaagtaaaac tcaccagttt 80341 ctacttctcc ttccctctct ttagaaacat ttcgattggg ctcttgttcc agccactcca 80401 tgaaaacagc tcctgtcaag attgccagtg tgacctccag attgtcaaat ccagtggtta 80461 ctttttattt atttttgttt ttgttttttt ggaacagagt gttgctctgt cacccaggct 80521 ggagtgcagt ggcatgatct cggctcactg caacctccac cttctgggtt caagggattc 80581 ttctgcctca gcctcctgag tagctgggac tacaggcgcc cgtcaccatg cccagctaat 80641 ttttgtattt ttagtagaga tggggttttg ccatattggc caggcgggtc tcgaactcct 80701 gaccttgtga tcccccgcct ctgccttcca aagtgctggg attacaggca tgaccgacta 80761 agcctggcgg ctacctccac ttctaacact ctggtccaag cctcatcatt ccttacctgg 80821 agttttgcag ctgcctccta aatgatgtcc ctgttccatg ctttccaccc cttagtctat 80881 tctcaacaca gaagccagag tgatcctttt aagtcattat tcaaatcaca tcacctttgt 80941 cagatttttc cagtgtttcc tcatctcatt cagaataaaa atcgaagttc tggccaggca 81001 tagtggctca cacctgtaat cccaggactt gggaggccga agcaggcgga tcacctgagg 81061 ttcaggagtt tgagaccagc ctggccaata tggcgaaacc ccgtctctac taaaaataca 81121 aaaattatct gggcatggtg gcgggcacct gtaatcccag ctacttggaa ggctgaggca 81181 ggagaattgc ttgaacctgg gaggcggagg ttgtggtgag ccgatatgac gccactgcac 81241 tccagcctgg gcaacagagt gagactccgt ctcaagaaaa aaaaaaaaag agtgtgtagt 81301 gagaagagaa ggggctaaaa ggaccttcct tgacccatta gagccagtaa aggagagagc 81361 caagaaagag cagggaagga actaccacag gtggacagag tttggagaga acaaggttaa 81421 aacatcaaga gaagagaatt ttgagcagga gaaaatggcc aaatgtcagt gctgcagaga 81481 tgttaaataa aagaaagaca gaaagtgccc tgggtgtgcc agttatgact ttggtgagag 81541 cattgtcagt gaatgtcaga gacagatgcc agatgacagt ggagtgagga ggacagacaa 81601 acgtgtgagg cctggctgct aaaggaagga gagggaatag ctataggaga gggttaagga 81661 aggtgttctg gacatttctt cctcccatca cccactatta gccccctgcc ctgccacctg 81721 cctgtacctc cccacttact cttagcaaac tattcctact taccaaaaag caggacggag 81781 tcctgcacgt ggaatggctc catgcccaca gaagtacctc ctttacagat cccgaaagac 81841 catacaccct gaccctgcac cctcattctc agatctgcct gtcaggcctc ctggagggcc 81901 ctttgctcct ccctcagaat ctctaccctt aacagggagg acggcaagag gggaaagata 81961 ataattttct ttcttttttt tttttgagat gaagtttcgc tcttgtcacc cagactggag 82021 tgcaatggca caatctcggc tcactgcaac ctccacctcc cgggttcaag cgattctcct 82081 gcctcagcct ccccagtaat cccagtagct gggattaggc acctgccgcc acacccggct 82141 aatttttata tttttagtag agactgggtt tcaccatgtt ggccaggctg gtcccaaact 82201 cctgacctca ggtgatccac ccacctcggc ctcccaaagt gttgggatta caggcatgag 82261 cctccgcgct tggccacgat catgattttc tttcagaatt agagctttac aatggtcaac 82321 tcctttattc agccaacatt tattaagcac ttgctgtttt cctagctctg agctgaacac 82381 taggaatagt tcattggaaa gatggcctct gctttccagg accccacagt ctagttgggg 82441 aggtagagga atccacaatc acaataccag ctaagaggca ccgtgatgag ggaggcccag 82501 gaagtcacag gagaactgag tggaggccct taactctgaa gtaaagacca ggaaggcttt 82561 ccccacatgt acagtgggaa caataccgca gcatctgtgg cccatccctc cctgttttat 82621 gcttacctgg ctcattcctt caccttcatg aggttacaag ttccctggca gctcttacat 82681 tatttatcta tttctctctt ttcttactag tgccttgtag ataaataggt gagcatcaaa 82741 tattaacaga atgaattaat gtatattaat aaaggtgagg agaggatctg gagtgatgta 82801 attactgttc aacattttaa ataacaacgg atatgattta ttaagtgctc attacggtag 82861 gtactgtgct gaacagtata taatttattt caattaatcc tcacaaaact ccacaagata 82921 taggtaatac taataattcc cgtttaaaaa atacaaaact tggctgggtg tggtggctca 82981 catctgtaat ctcagcactt tgggaggccg aggtggatag atcacctgag gtcaggagtt 83041 tgagaccagc ctggccaaca tggagaaacc ccatctctac taaaaataca aaattagccg 83101 ggcgtggtgg cacttgcctg taatcccagc tactcaggag gctgaggcag gagaatcact 83161 tgaacccagg aagcagaggt tgtggtgagc cgagatcgtg ccactgtgct ccagcctggg 83221 caacaagagc aaaacttcgt ctcaaaaaaa aaaaaaaaac tcaaaaaaat taccaaaaat 83281 ttgtcgagaa gatgtgtaag ttgccaaaag taacactgct gatatgaggt agcgccaaga 83341 tttgaaccca ggcagctcaa ttccagaacc aggacagtct ctatggggca ctgcctctaa 83401 tgcccaggag aattcttctg gaaagttaag ttgattataa ttttttttaa tgtatgaatc 83461 atttcatatt ttaaaattaa acgtacccgc ctccaggctc ccatcgcttg atgtctgtaa 83521 ttattacatt aactgcttcg acatctgtca catcctggtt tgtcatatgg tgtgtgtgac 83581 ttactcccat attgcactag aaattcctta ggtacaggaa ttgtatcttt gaaacacatc 83641 agcacaatag cttgcatctt agtgggtact caatatggtt ttgcaagaaa aaattccatg 83701 tcaccattat gacagacttt tttcattata atgaagatgt tttctgtgtt aatgactcca 83761 gccttcagct acgtttgtat ggcaaagatg gaagaaaggg gaaggacaac ttatatgtgc 83821 cgaatgctat gctggggcct ttagacatgt tcccatttaa ccttcacaac agccctgtgg 83881 agtacgtggt attgtctgaa tctgagagat ggggaaagaa ttccttagag aaatcactga 83941 gttataacca gtggagctgg atttgaaatg taggtctgtc tgactgcagc ttctttctgt 84001 gcattgccta gatggatgat gctgaccctg aggaaagaaa ctatgacaac atgctgaaaa 84061 tgctgtcaga tctgaataag gacttggaaa agctattaga agagatggag aaaatctcag 84121 gtaggatgtg tgaacccaaa agtatctgag acaggtctca atcaatttag aaagtttatt 84181 ttgccaaggt taaagatgca cctgtgacac agcctcagga ggtcctcagg acatatgttc 84241 aaggttggca gggtacagct tgcttttata cattttaggg agagagaata catcagtcag 84301 tacctgtaag atgtacactg gttcggccgg gcgcggtggc tcactcctgt aatcccagca 84361 ctttgggagg ccgagacggg tggatcacga ggtcaggaga tcgagaccat cctggctaac 84421 acagtaaaac ctcgtctcta ctaaaaatac aaaaaattag ctgggtgtgg tggcaggcac 84481 ctgtagtccc agctactcgg gaggctgagg caggagaatg gcatgaaccc gggaggtgga 84541 gcttgcagtg agccgagatg gtgccactgc actccagcct gggcgacaga gtgagactct 84601 gtctcaaaaa aaaaaaaaaa aaagatgtac attggttcga tctggaaagg tgggacaact 84661 caaaggggga tgattccagg ttatagatag atttaaattt tttctgattg gcaattggtt 84721 gaaagagtta tggtctaaag acctggaatc aatagaaagg aatgtctggg ttatgacggt 84781 aaggggttgt ggagaccaga gttttatcat gcagatgaag cctccgggta gcaggcttca 84841 gagggaatag attgtaaaaa gctttcttat caaacttaag gtctgtgttc atgttaatgc 84901 tagtcacctt ttcctgaatt ccaaaagaga ggcaggtata acgaggcata cccaatttct 84961 ccttctcctc acggcttgaa ccaactttgg ccaagcaata cacccacctt ggcctcccaa 85021 agtgttggga ttacaggtat gagccagcgt gcctggccaa atagctttta ttcactagag 85081 cagactttag aaaatttctc cttagaaatg tttgcaattt tttgcctatg tacttttgaa 85141 ctatgaatag ctgagacacg tatcagttaa tttagagagt ttattttgcc aagtttgggg 85201 acacttatct gtgacacagc ctcagggggt cctaatgaca tgtgcccaag gtggtcagag 85261 cacagcttgg ttttatttat ttatttattt attttaagac agagtctcgc tctgtcgccc 85321 aggctggagt acagtggcgt gatctcggct cactgcaagc tccgcctccc gggttcatgc 85381 cattctcctg cctcagcctc ccgagtagct gggactacag gcgcccgcca ccacgcctgg 85441 ctaatttttt gtatttttag tagagacggg gtttcactgt gttagccagg atggtcttga 85501 tctcctgacc tcgtgatccg cccgtctcgg cctcccaaag tgctgggatt acaggcgtaa 85561 accaccgcgc ctggccttgt atttttaata gagacagggt ttcaccatgt tggccaggat 85621 agtcttgatc tcctgacctt gtgatctgcc cgcctcagcc tcccaaagtc ctgggattac 85681 aggtgtgagc caccgcgcct ggcctttttt tttttttttt tttgagacgg agtcttgctc 85741 tgtcgtccag gcttgaatgc aatgatgtga tctcagctca ctgcaacctc cgcctctcgg 85801 gttcaagtga ttcccctgcc tcagcctcct gggtaggtgg gattacaggc gcctgccacc 85861 atgcccagct aatttttgta tttttagtag agacgaagtt tcaccatttt ggccaggctg 85921 gtctcaaact cctaacctca ggtgatccac ctgccttggc ctcctgaagt gctgggatta 85981 cacgcgtgag ccaccacgcc cagccagcaa acagttttat gtagtgaaaa atctatcaat 86041 attttccttt ctgatttatt gccattattt tgacacctag aaagtctttc ctcaatctga 86101 gatcagtgaa atatttagtt gtattttctt ctacttttct ccgtattcct ttttaacatt 86161 taatcctaac ccatctggca tgccaaatga cctacgctgc gtattgcttt tttttttttt 86221 ttgagacaga gtcgcgctct gttgcccagg ctggagtgca gtggcacaat ctcggctcac 86281 tgcaagctct gcctcccggg ttcacgccat tctcctgcct cagcctcccg agtagctggg 86341 actacaggtg cccgccacca tgcatggcta attttttttg tatttttagt agagatggag 86401 tttccccgtg ttagccagga tggtctcaat ctcttgacct cataatctgc ctgccttggc 86461 ctcccgaagt gctgggatta caggaatgag ccaccgcacc cagccgcata ttgcattttt 86521 atataaatct gctacaggct gggcacggtg gctcacacct gtaaacccag cactttggga 86581 agccgaggca ggaggatcac aaggtcagga gtttgagacc agcctgggca atatggtgaa 86641 actccatctc taataaaaat acaaaaatta gctgggcgtg ttggtggatg cctggaatcc 86701 cagctactca ggaggctgag gcaggagaat cacttgaacc caggagacgg aacttgcagt 86761 gagccgagat cataccattg cactgcagcc tgggtgacgg agcgagactc tctcttctgt 86821 gatagtccct gggcatagag ggagggtgtt tgtatcgttt tagcagcagg gcatttgcag 86881 tgtaacagat ccgggcccag tgcgatgcca aatgagggag attcatatct ttggtcatct 86941 gcagaatacc atgattttag tttccttgga agcaaaatga ggggagataa gtttgacaat 87001 tataagagta atttgtgcat tagaacagaa aaaggaacct attccattag ggcaccaacg 87061 aaaaatatga ggaaaagtta caatctggtc ctctctagag gattattgta gccaagaaac 87121 aatgatttaa tttgcactca aaaaaacgtt atggctgggc gcagtggcct gtaatcccag 87181 cactttggga ttttatttat ttatttattt attttgagac agggtcatac tctgtcaccc 87241 aaactggagt gcagtagtgc aatcttggct cactgcaacc tccacctccc aggctcaagg 87301 gatccttttt ttcatctcag cctcctgagt agttgcaact acaggtgcac accaccatgc 87361 ccagctaaat tttgtatttt ctgtaaagac agggtttcaa catgttgccc agtctgatct 87421 caaacttctg agctcaagtg atccacccac ctcagcctcc caaagtgctg ggattatagg 87481 taccagccac cttgcccggc cagccacctt cttgtatcct cacatagagc ctcacatatc 87541 cttgcataga gtgagagtga gctcttgtgt ctcttcttta aggggtgcta atcccattca 87601 ggagggtcta cccttaggac ctaatcatct ccaaaagacc cgacctccta ataccatcac 87661 gtcgggggtt aggatttcaa caggtgaatg tgggagggac acattcagtc cataacatgt 87721 gtgccaggca gtttacatat atcacccatt taattctcac agtaagccat gagattggta 87781 gggtctccaa agatgaggag actgagattt atgtatatat aacctccacc ttctgggttc 87841 aagcgattct cctgcctcag cctctctagt agctgggact acaggcaccc accaccacac 87901 ccggctaatt tttatatttt tagtagagat gggccaggaa cactggccat gttggccagg 87961 atggtctcaa tctcttgatc tcgtgatcta ccccccttgg cctcccaaag tgctgggatt 88021 acaggcgtga gccaccgcac ccggccgaga ttgatacatt gcagacactt gttccaggtc 88081 acagagtcgt caggtgggga gctgggaggc gtactctgct ctggccccag gtccctgctg 88141 cacctgctgt ccaggtagtc tgcggcctcc ctgcaccgga acgatctggg atgcttgtga 88201 ggattccaca aagaggaatt ttaaggggat gtgtttgcta tgtacagttg gttgtcatgc 88261 agaggggaga tgacctttgt tctgtatagt tgcagagggc tgaaccacac cagccggtgg 88321 actatgcaga gaaaaagttg gaggtgggat ttctgcacac acttctttga gttcacacaa 88381 gaagcatgtg ttcacctata ggcccaggga atgctacctt cgagttgctg tcagagggaa 88441 agagcaagag agcccatcag tgcattggta gatggtcaag gacaccttcc caagaggaga 88501 agccccaggc ctgggcctca acgaaagcat agaggtgccc aggtggaggt gggcaggcat 88561 cccaggccac tgaacataag gtgacatgag gtttgggtct tgcaggactg gaaacaatca 88621 actctacctg agtgtgtgag cttgggtgga tgactcctta ggggtcacct gcagccattg 88681 gctgtgaact gcttgtaggt aaggagctgc ggtttgctct aattctcctc acatttcgta 88741 tcagaccagc ctctagtttt ttacatcatg tctggtgtat ccagaagtta tgccttgcat 88801 gagactagga agcgtcacta tggatgtttg tagggagttt ttatgagcct gtaacaagta 88861 gcctaccagc tgtctctccc tagagtaatt gggtgcctca gcaggatctg ggcctccaga 88921 tgctgagccc agctggaggc atctgaccct tctggggcac cccagagctg cagcccctcc 88981 ctgtggcctg accagtcctg tctgcttgtt gcagtgcagg cgacctggat ggcctatgac 89041 atggtggtga tgcgcaccaa ccctacgctg gccgagtcca tgcgtcggct ggaggatgcc 89101 ttcgtcaact gcaaggagga gatggagaag aactggcaag agctgctgca tgagaccaag 89161 caaaggctgt aggccccact ggcccaccac agctgccatg ccaccctctg cccgtatgaa 89221 gaggtcactg ggggatggag ctggcaccca catgaatagc tgtatgcact gtacttgttt 89281 cttaataaac ttatttttaa gcacagcccg aggacctctt ctctcctgtg ccatgatcca 89341 cccagctgca tatcaaacct tgtgaacaaa acacaacaag tgcaagctgc aggtgctttc 89401 ggtgaattaa aatgtatttg gtgccagtgg tgctgaacat agaataaaaa acagaaaaag 89461 ggcccagaca gggacatcca ggctggaggt tggagtggag cacagtcaga tgtattcact 89521 tttctagcat gggtttcaaa gtgcgtaagg ggagaaaacg catcatggat tgtgctgtgc 89581 atcctggaca ccgtgcgagg tgctcatagc tgtgatcttg tctcccacaa ctctcttggg 89641 tgacagacgt ccactgtcca cattttatag acaaggagaa agggaagtca aatgtctcgt 89701 ccaagtctac acagctaaaa aggggcagaa ctagggtgac gctcaggcct catttagaga 89761 tcgggggttg gcgagaagtg gggtgggctt ctggaggggc tgggagagcc ccacaaggct 89821 gcagagggtg gtgagcccgg agtgggcctg gcctggtgtg ggctgggggt atgggcagga 89881 gctgcagaca gcagggctgc accagcggac cagtttcaga ggcaagggtt ctaggccctt 89941 gagaatccac agtgccaaac agacccagat agctacgggg ttggtacctg gggaggcctt 90001 aggacaggca gaaagtccca gaggcgaggg cgttgcctgg ggacgttttt gctccctgtc 90061 ctgctgacag agcataggaa gtgtgaatgt tttctacccc ctcctctctc ggctcagcag 90121 agctccagcg agccaagtcc ttgtctgtgg agacgcatca gtccctggct ctagggaata 90181 gggagtccca cagacagggg ggtgtcagca agctgagagg gtctgtaagt aggtacggaa 90241 ttgagtcagg aaacagtctg ggtgtggagt gaggggcaga aagaggctga gggagtctgg 90301 gcttcaaaat aatcgacaac ctttaaagca gaaggggaaa gttgtccaga aacaagagca 90361 ggaagttccg atcccagccc cttccctgag ccccggcctc tcaggcccat cccaggaggg 90421 tctccctgga gagcagcgaa gcagctttgg ttctctgcct gccactcaga gtgaggtctg 90481 cagccggtcc tcagggggca gagtcaggat gaatggactg aggaccccgg tgctccccaa 90541 ggggaagggc tgcagctcct tggcctggaa ctgggcagtc cccccagaga ccgtgaaggt 90601 ggcagtgacc tgggggttga ggcagtaaat ggtgttgccc agggtggtgc agtgcagtgg 90661 ggcgggggcg ggcaggggca gggaggcagc cctgctccag gagccggtca ctgtgttgta 90721 gcgcatcacg gcggcgccca cgccccgcag caggtcgaag cggtacagga agccccccag 90781 tgccacgatg tcgctggaac gccggtggct ggcactgtat gggcactcgt cccaagcatc 90841 cttcacgggg ctgtacctga gcaggcggta gaagaggtga cccccggtga cgtagatgtc 90901 cccacggcag gccacagcct cgtgggccac agggaaggtg cctgcgggga gtggcgcgcg 90961 tggggtccag gcgtctgttc gcgggtcgta gcactccatg ctgtacaggc attcgccacc 91021 gatggcatag agcagcccgt ccagggccac cagcttgagc tgggctcggg cctgctgcat 91081 gggccgaacc tggctccaga tgttggtcag agggttgtag cagaagacct cgttggagca 91141 gacggccttg gcaccggagc cacggatgcc ccccgccaga aacaggtagt tgtgcatggt 91201 gcagagaccg cagccccgaa gcggggcctc ctcgggcacc tgggtcaggg gccgccaggt 91261 gttctcccgg gggttgaaca catgcaggtg cgcaggtaga ggcagggaca caggggccgc 91321 cgcaggaggc tcctcgccac gagggcccct ggggagccct gagcggcccc cctggtagag 91381 gctgggcagt acgaggacgc ccagcaccgc cggccccggc cggtccgcag gctgaggatg 91441 cgctcgcggt cggccgcgct cagccggcgg tagaggcacg ggtctcccag cactcgcagc 91501 aggttgtcgc tcatcagcgc gtaggtctcc tgcgccaggc cgggctctcc gtgctgctgg 91561 gcaaaggcca gcacgtccag gcaactgccc aggtccagcc gccgccgcag gctgccgccg 91621 tggcgggctc cagggcccgg gagctgggct tccggggccc ctccaccacc ccccaacccc 91681 cgggcctctg cagaaacacc atgagcttcc gggcctcctc ctgcttttcc gtgagcgccc 91741 cgctgcggcc ggcagggctg cccccctccc ccggcgcctc cccttgaggg tcccccgagg 91801 cggccggcct gacctcctgc tccgagctcg gggcgatttc gcacctgctg cgtttccgta 91861 gctggggtac cggctgtgag cgcctcgcag aggaggaggc cgcggggcca ggcggcctct 91921 gcgcgggggg ctgtgcgggg cccctgggca tcgcggcttg gggcagcggg tctagacttt 91981 tctccctggt tccctctcct cttaggcctg ggggagggtc tgggctgggc gatccgttgg 92041 tctcggagct tcttggctgt ggctgacctc cagcctcaac caggccccct gtacctgtcc 92101 ctatgactgt ccccactgaa gaagaggggc cgtgtctgtc ctcagagtgt gtggaagtgc 92161 tgggacccac ggggctcccg ggaaccgccc tcgggacact cccttggggc ctgtcttgcc 92221 tggggaaggg gtggcttcta agaaccatgg caataaggtt tccccagctg gctgagacct 92281 gcccctcctg gtagcgcctg gggagagcca cactctcctg aggctttgac ccgtcagctg 92341 gggcagggga tgcggctggg gttggggttg gggctgaggt gggggctggg gcaggggatg 92401 cggctggggt tggggctgga gttggagctg ggcttagggc tggggttggg actggggtta 92461 gggctgggga tgcggctggg gttagggctg gagttggagc tgggcttagg gctggggttg 92521 ggactggggt tagggctggg gatgggggtg tgagggttgg gactgggact ggggctgggg 92581 ctggagttgc ggctgcagct ggacttgggg ctggggctgg ggttgaagtt ggagctgagg 92641 ttggagctgg ggctggggct gaggtggggg atgaggttga ggtggtggct gaggttggtg 92701 ctgagctggg ggatgaagcc agggctgggg ctaggactga ggctggagct gaagttaaga 92761 cttgggctga ggctgaactt sgggctggcg ccgctctgag attttcgccc acggaaggat 92821 cctgggtaac ctccaggggc aaaaagtctg aaggagaggg gagagtcagg gcttcaggtg 92881 ggaaccccgg gccgggagct gcgggcgggg aggcggagcg cgggcgaggc tcacggctcc 92941 gggagggcgc gcggctgtgg ccggctgcag ccggctttgc ttgctccggg ggccgggtgg 93001 gccccaggtc agcggcagga ctgttatctc gggagctgtc agccccggcc tcgccggcat 93061 cgtaggcccc tgtggcaccc ggcccttggg acgagagtgg gcgaccttga ggcggactct 93121 ctagccaggg gcgcgtggag tctggggctg ggccaaagct ccgggtgccc gggacctccc 93181 tgctgaccca gggccagcct cctccggggg cagggacccc cgaggcctcc ctgctccagc 93241 cacccttggc cccagagcct tccgtctgag acccaggctg cagggcgcgg ggagcgggca 93301 gcgccgggag tgcgggctcc tgggcgagtg ggggtcctgt ggggaggccg cgcgcgtggg 93361 cgtccagggc ggcggcccgt ccaccgcgtc ccacacgctc cccgcggcgc ccaggcggag 93421 cgggtccagt ttccgggggc ggcgggctct gtctcccgag ccagcgtcca tccgccgccg 93481 ccggccccgg cctctcccca tcgagcccga gggccccgcg cccgcctgcc aggggccagt 93541 gtcctgttgc cccgcggggc ctgcggctgg cgagaggacg ccctgacacc acctgtctcc 93601 gcctccgctt cgctgccagg gctccgagga gtgaagtgga tcaggagggg ggcggacatc 93661 ccccctgctt cgctcctgcc caggagggca gtgcctggcc tgggaggctc ctttccttcc 93721 gctcgggggg ccgcgggggt ggcactgccc cgttgctgtc ctaatgcctc tctgcgggcc 93781 tcctcgacag ccgtcttgag tggaagccgg gccatggcgg ccaggccccc gcccggcgct 93841 gcagcaggga cagggctctg ctggccgcgt cctgggggct ctggtttccc ctgccctggg 93901 ctctgcccct cgctgccatc actgacctta gacttcggtt gcagggccag ctcagctcct 93961 cctgcctgat gaggttgagc cccgagggct gtggacaccg gttctgccgc ctcttgatcg 94021 tgcccggagc caaaccagtg taaggccagc gctgtggcag ccaccacagc cagcgccagg 94081 agggccagca cagcactatc ccagtcattg tcactgtccc agtcccagcc ccagagggga 94141 ccatctggct ccaaggtgcc ctggatcatg gttcagggct gcaatagagg cctgatggca 94201 ggctggcagt gcatgcagag cacagaggcc tgggctgagg caccggccag cgcctgcccc 94261 aagggctgcc cagttaaggt ggaccagtcc tccagggcct gggaatggtg gggtctctgc 94321 ttgcactcag gcctgggggt gtcagtaggg cctggctgca gatatcagga gacagcgagg 94381 cttatggcag gataggagcc cagggtctct gcaagcaccc caccagctcc tgccctcttc 94441 tcctcctcgt aggccccagg gtcctcaggc agcgttcctg agctctgtgc tgtcatcaca 94501 gccaacctgc ttctgcctgg tagaccctca cacccctaaa tttattccac accccacagt 94561 ccctcctctc agcctgggcc cagcttcagg gggcttggca gggaagggtt acccacaggg 94621 agatgttatc tctagctcta gcaatgtgat gtggccctga tgaaatcacc agaaaccgga 94681 gaggctgatt ccactgtggg ccagcagggg aggggtgggg gccgtgcacg ggagggtgca 94741 ggctgtgcct gacggtggtc tctcaggctg aagagggaga ccctggacac gggtcctggc 94801 agtgagcatt atcattgcag gggtgctgtg gaggtttgta ctggggcaca gggagggggc 94861 atagagagga gactgaggct cccatagctc ctcctccagc agcttttgga ctccttactc 94921 tgggtttacc agacatggga tagggttatc aagtcaagag aaaaaaaaaa ttaatgggat 94981 aaataaaatc caaacatatt tctctcttta tggccaggct tatttcagcc cacctagacc 95041 ttctaggatc tcctggaccc cctgcgccca tcgagagtgt cttgactgcc cagatgagac 95101 tctcactgtg cccctggaga accctgagcc cagcctgccc cctactccag ttaccgcatc 95161 agccagcacc tggatccctt agaaaccacc cccctctggc cccaggtgct ctaatagcca 95221 gactctgggt gtcttcatct cagtcctcag ccttccctta tctgagctga ggactcgccc 95281 tcgagaccac atcctggctg gtacctccta gccctggctc aggccggcat gtaggtactt 95341 ctgttggatg tgagcacttg cttcctgagg gagaagcctg ggctcctcca cctgctcctt 95401 gcttaatccc atctcaggag tgtctcgggc caggtgccag gtgaggctca gacaagaaaa 95461 agagcctgga gtccctggga caaaggcagg gctccctgga aagtaccgtg tgtatttttc 95521 cttcctctgg gacacagcag gcctggggat ttctccctta ctcaagcagc aaggagatgg 95581 cctaggagtg tccctggcac ccagcctgaa ccctactgag cctcaggtaa ctcacctccc 95641 agctccccct gggactatta gatgacaggg agagggtgag ggtgggcccc atgccttaag 95701 gtggacctca agttgaggga aggctttcta gaccgtgccc ttattctctg tcccagtctc 95761 aggctctgcc tcttggttaa gtttggggtc cagaaacctt cagaggatgg agatgacacc 95821 cataaatatt agagactgag tagggggcgg gaagcggtgg cggcagcgca cccccggggt 95881 gcaactgccc tggaactcca agtggtggct tcaaaactgg attaaaattt atttccgcga 95941 aaccctggag tcacggtccg gcgccgactg gcgttttctg ccgtagactg agaacgcgtc 96001 agggatggct ccgttggcag cgcgcgcgag cgcccccggg ccgcgcctgg agggttgaca 96061 gcgactcgcg ggcgaactgt gaacccgcag cccctgggct ccgagcgcaa acctggcgac 96121 tgggagggcc cggggagcgg gtgagccagc gccagccccg ccggccccag gagcgcgaat 96181 ttctctacat ttccggctgt ggccgcagtt ggaggatggg atcaaaacgg gcaaaaaagt 96241 gcgacctggg ccgaggtgtc ggatcgcctc gaggtgggag gccgaggtgt tggatcgcct 96301 gaggtcagca gttcgagacc agcctggccg acagggagaa accctgtctt taccgaaaat 96361 agaaaaatta gccgggcgtg gtggcgggcg cctgtaatcc cagctactcc agaggctgag 96421 gcaggagaat cgcttgaacc cggggagttg cagtgagccg agatcgcgcc actgcactcc 96481 agcctgggtg acagagccag actccatctc aaaaaaaaga aaaaaagaca ccagcaaagg 96541 gtttttttct gttaataact aggtcccccc aaccccccaa aaaggaggat ggttaaaagc 96601 agaaaaggca aaaaataaag gcttttggcc gggcgtggtg gctcacacct gtaatctcag 96661 cactttggga ggccgaggca ggcagatcac ctgaggtcag gcgttcaaga ccagcctggc 96721 caacatggtg aaacccctgc tctactttaa aaaaaaaaaa aaattagtgg ggcatggtgc 96781 cggacacctg taatactagc cactcagaag gctgaggcag gagaatcgct tgaacccggg 96841 aggcggaggt ttcagtgagc cgagatcgca ccgccacact ccagcctggg cgacagagcg 96901 agactctgtc ttaaaaaata aaataaataa actcttttat atttcaaaat ctgaagacaa 96961 atcactgtag tgcagataaa ataaaacgtt catatcattt tattatttta acgtgtttca 97021 tattatttta acgtgtgaca ttaaacaatt tcagggccgg gcgcggtggc tcaagcctgt 97081 aatcccagca ctgtgggagg ccgagacggg cggatcacga ggtcaggaga tcgagaccat 97141 cctggctaac acggtgaaac cccatctcta ctaagaatac aaaaaaatta gccgggcgcg 97201 gtggcgggca cctgtagtcc cagctactcg ggaggctgag gcaggagaat ggcgtgaacg 97261 cgggaggcgg aggtttcagt gagccgagat cgcgccactg cactccagcc tgggcgacag 97321 agcgagactc tgtctcaaaa aataaaataa ataaaataaa gtctgttata tttcaaaatc 97381 tgaagacaaa tcactgtagt tcagacaaaa ccttcatacc attttattat ttaacgtgtt 97441 tcatattatt ttaacgtgtg acatgaaaca atttcaggaa attgtattag gtattgcgct 97501 cgtgtcttct taggcgtcct catccctttt cacctcggag ctcagcgtct tcctcagcag 97561 cacttccatg tcatctgccc cgtgaaatca gcctaacgcc gtttctcaat gacgtggatc 97621 gccctaggcc accgcaacct tccggaagct ctctcagctc agttcccatt ctcccaccat 97681 ctcttggttc tccttctacc tcaccggttg ctagtcctcc gcttcgcagc tgaaaatgtg 97741 cccggggcct actgtgggcc tagccaggcc tgcttacgca gtgcagtttc ccatgaatga 97801 tgcccagtca ttatcacata acctgtggca agccagcaag atggccctgg tgacagcaaa 97861 agaaactgca ctaggacctg aatgtagatc tcagtcatgt tccttactaa cagcacgttt 97921 tgcaaccatg cgttaaagaa acatctgact cacaacaaaa ttttaaaggg tttatttgag 97981 tgaaaagcaa tttatgaatt ggggaacacc tgactgaaag agcgttagta ttccagagac 98041 aaaacatcaa gtgcaagttt ttattgggaa aatgtagaag cacaataaag aaattatttg 98101 attggttaca gttatagagt tgctttcttt ggcttatcct gttggaaagt cagtagttat 98161 ataatcatga tagttggtta cttatgattg ggtagattaa aatttatctc tgtctaatgg 98221 aagcatttat cagaaacaac tcagccaggc atggtggctc acgcctataa tcccagcact 98281 ttgggtggga ggctgaggtg ggcagatcac ctgaggtcag gagttcaaga ccagcctggt 98341 caacatagtg aaaccccatc tttacaaaaa tacaaaaatt agctgggcat gatggcaagt 98401 gcctgtaatc ccagctactc gggagggtga ggtgggagaa ttgcttgaac ccggtaggca 98461 gaagttgcag tgagccaaga ttgcgccatt gcactccagc ctgggcaaca gagaagactc 98521 tgtctcaaaa aagaaaaaaa agaaatgact catgcctgta atcccagcac tttgggtgtg 98581 aggccgaggc agggcagatg actagtcttt gagaccagcc tgggcaacat agcaagacct 98641 tgtctctact gaaaaacaaa caaacaaaat aatggctcaa gttaagtttc acttatgttt 98701 tcaatttaag caaggctgaa gtcacttacg agacctaact tgttttgtct gcttagggat 98761 tcttcaggcc cggtctccat tttagtttac ttaaacagcc cgggtgcagt ggctcacacc 98821 tgtaatccca gcactttggg aggccaaggc aggtggatca cctgagatca ggagttggag 98881 accagcctgg ccaacatggt gaaaccccgt ctccactaaa aatacaaaaa aattagctgg 98941 gcatggtggt actcacctgt aatcccagct actcgggaag ctgaggcagg agaattgctg 99001 aacccaggag gcagaggttg cagtgagctg agatcgcact actgcactcc agcctgggtg 99061 acagagcgag actccgtcac aaaaaaaaaa aaaaatttac ttcaacaaat ccccactggg 99121 caagctgaga ggatgaccag atggtagagc tttgctctct gttaccactg agtgacgtag 99181 tcactggagt gcagagtcgt gtacaggtga tcaccatcat agacctgggt ggtggggtgt 99241 tgaattctgt aagttgttta atcctgatgg caatcatttg atgtgagagt ggctgctgga 99301 aagtgttgaa acccttgaga ggatacaatg aaggaactgg gtaaacacaa tgactgtaag 99361 aaaaaccacc aaggactcag caactccctg cagctgagat ccccaggagc caagatttaa 99421 cccattaaat caaccaacgg cgtcgtctat ctcatgagag acttacattt gtttcttaag 99481 agtctttaaa ttttggaaaa caatttctaa cgtattgata aggaaatttt cttatcaaga 99541 acacaagctc ttatttgtta agctcctaag gtatcaagag cttctagttt ggaacactgt 99601 tgaggctaag ttgttcattt cagattgcat tgcatgatgg ggaattcaaa gtattgttct 99661 atggaaacaa aataatttaa caaaggttaa tctttggagt aaattatgac cttagccttg 99721 ttcgtttttt ttttttttga ctaccagtga ctatgatctt agtttttgag cccagagatt 99781 gcagcctctt cagatgatgg aatgaagatg tcagttaaaa aaagaccggg cgcggtggct 99841 cacgtctgta atcccagcac tttgggaggc ggaggtcagg cagatcacct gaggtcggga 99901 gttcgagacc ggcctggcga acatggagaa accccatctc tactaaaaaa aacatatata 99961 tatatacaaa attttcttgc tgacaaattt tgtaacagag ataatatgag tttatttgat 100021 taacaaagct aggtagaata tccattttta ctaaatcacc aatattctta tttattaaag 100081 attaatcaag ccacatggac ttgaaaagta tttgggttac tctttttctg agaaaatact 100141 tcatataaac acttgttttt cttatctttt cctttttttt ttttttttgt ttgagacaga 100201 gtcttgctct gtcccccagg ctggagtgca gtggcgtgat ctcggctcac tgcaatgtca 100261 gcctcctggg ctcaagcgat tctcctgcct cagcctcccg agtagctggg attacaggta 100321 cgcgccatca cacccagcta atttttgtat ttttagtaaa gatgggcttt ctccatgttg 100381 gccagtctgg tcttgaactc ctagtaatcc acccacctcg gcctcccaaa gtgctgggat 100441 tacaggtgtg agcctggcca taaacacttt ttttctcagg ctagtgacaa aaatgtcaga 100501 gataagtatt tgtctgtcta tttaagaaaa tcaggaggca cactgaagaa atcacttggt 100561 tacagttatg caacttcctc atttggtctg tctagttggg aagtccctag ttacgcaatt 100621 gagcgttagc tggccgctta ggattggctg gagttgagtt ttacttctgc ctcacataac 100681 catttatcag aaatgactcc agttaagttt cacttacgtt ttccatttag gcaaggttga 100741 ggtcagttag aacgcctaac tggttttgtc tgctcagggg ttcttcaggc ctggtctcta 100801 ttttaatatt tagctcaggt aaggtaatta tgctggggtt tcacagtaat ctattatagt 100861 ctggaaaatt aagtcttcct ggaatacaac aggacctcac cgcagatgtc aaatacgagt 100921 ccaccagctt aagggtccct ggtgaggatg acagaaactt gagaagctac acttatcaaa 100981 catcccacaa cctaactcat gaaagttggc tgagacgcct aagtgctgag cgtcggaggc 101041 tgtcacttgt ttatgatcca catatcagag tgcttgtttc tgccacgtct ccagcccggg 101101 agtactgcag ggccctgggg ctctttaccc acgtgcacac cctcagtcag gggcagcaaa 101161 gtttttgtgc gaagagctgg attgcggctg tttttgcctc tgcaggtcac acagtctctt 101221 gcaactactc aagtctgcac aaaaccaacc ttaaaaaata aaaggatgtg gctgtgttcc 101281 tataacattt tattttttat ttttattttt atttatttct tttttgagat ggagtctggc 101341 tctgttgccc aggctggagt gcagtggtgg gatctcggct cactgcaagc tccgcctccc 101401 gggttcacgt cattctcctg cctcagcctc ccgagtagct gggactacag gtgcccacca 101461 ccacgcccag ctaatttttt gtatttttag tagggatggg gtttcacagt gttagccagg 101521 atggtctcga tctcctgacc tcatgatccg cccgcctcgg cctcccaaag tgctgggatt 101581 acaggcggga gccaccgcgc ctggccccaa taaaatttta tttacagaaa caggaggtgg 101641 ctggacttgg tgacccctgc actaagcggc cccatccatt cccacggctt gtgcacgcct 101701 ctcagttcag catctccgac ctggacctct ccgctacgtc tgccatgaac atcactgggt 101761 ggcagacgac atctcaagcc tgctgtgctc acatcaggac tcttgatttt ccactgtgct 101821 ggatgtggtc cattgtgttc agcttccatt ctcacctcct tttgtttatt ttcctgtatt 101881 ttagaaccta aaatacaaat gcggcaatac tgtgttttac agatcccctt gcagccacag 101941 ttccgggcga ggctgttgga ttcagtgcag ttgtgtaaga tgtgaaaggc agaaatgact 102001 cggaggcctc cgccctgcgc tgactgctgg cagacagggc tgtggacact cagccgtcgc 102061 tgagggaggc ctgggcagca ggggtgcagc atcaggattc ccggcaggtt cctgatctca 102121 gaatcacagc tacccaagtt tcctcttaag ctcggtagtc ccaaaggcca cctctcagct 102181 attataccac caactcatcc agcgattcta taatattccc cataataaat ccatttctac 102241 ccctgcctga tacactcacc gaacctctct catcttcccc tccgaaaatc tctatcacga 102301 cttagttcca aacagaaggg aacagccaca ccaggaatgt gaacaggaaa aacattaaca 102361 tagcgaatcg ttacctagta agatgacgtt aacagcggaa gtcaagaaaa gaggacacaa 102421 agggattcag aagcagcaaa tgcagaaaga tgccgcgatg gctgggcttc aggagcaagg 102481 aaggggccgg gaatgcacag agctgccagg ggtgcgtggc ccagccaacc tggtgcatca 102541 aattaactgt cgcatgcggt taatacctaa gaaccctttt attcttttcc tgttttagaa 102601 tttttttgac aattgaacac atttttgcat atgaacttta aatttgtgtt ttcctatttt 102661 taacattgga aatcttactg gaattaaata tttatattca tttgaggagc tgtttgtttg 102721 tttgtgacag agtctcgctc tgtcccccag gctggagtgc agtggtgcag tcttggctca 102781 ctgcaacctc cacctcccag gttcaagcga ttctcctgcc tcagcctccc taagtagcta 102841 ggattacagg tgcctgacac caagtccggt taattttgta tttttagtag agatgggttt 102901 tcactatgtt ggccaggctg gtttccaact cctgaccgca agtgatctac ctgcctcggc 102961 ctcccaaagt gccgcgatta caggcatgag ccacagcgcc cagtctcgtc ttcccatctt 103021 gttcagagga ttaccggtag ggtaatgaaa atcgtaaagc gttagccatc tgaacaggta 103081 accttaacct gttctgtttt atcaaaaaca taatttaaat ccaactgtcc tttcacacac 103141 ccgtgaattt atatttttat gttttactgt ttttgttttt gcttttgttt ttgtttcgag 103201 acagagtttt gctcttgttg cccaggctga agtgaaatgg tgcaatctcg gctcactgaa 103261 acctccgcct cccggattta agtgattctc ctgcttcagc ctcccgagta gctgggatga 103321 caggcacgca ccaccacatc tggctaattt ttgtattttt agcaaagaca gggtttctcc 103381 atgttggtca ggttggtctc gaactcctga cctcaagtga tctgcccacc tcggcctccc 103441 aaagtgctgg gattacaggc atgagccacc gtgcctggcc tactgtttta taactaaaat 103501 tttcaagtga aaagccacaa gatctttgtg ttgtctatgt ttttgtgtct ttatgtctat 103561 atgtgatttt aatttacaaa gaactctatt taattggcat aaagaaaaaa agcacttaaa 103621 tcaagtaatc taagagcata cagtgtatac tatttttata ctatatttaa gaaatatagg 103681 gacaagcaca gtggctcatg cctataatcc ctgaactttg ggaggccaat gcaggaggat 103741 tgtttgaagt cagtttgata ccagcctggg caacaaagta agacctcttc tctacaacaa 103801 aaacaaaaat agctgagtgt ctctattaaa gtatagatag atagatagat agaagaaaat 103861 acaaaaataa aatcattctg caaactgagt tcatcatatt acttcactct gtgtcattct 103921 tttaaaaccc tgcacctgcc caacctctgc aaaagtacaa ctgctaacaa gtaatgctgg 103981 cgcagcactt tgacatgata gggaacacct gtgagactga tcaaatgaaa ggccgactta 104041 acccaatagc tactccttcc aaatttcctt ccagctgctc aaatgtggtt caaatggttt 104101 tgacactgac ttaaaattgc tggccattct cccaacatgg caccaagtca acctggacag 104161 gaagggataa tcgaaaccta accacaggat aattgatcag tgatgtcttc agagaatttt 104221 tttttttttt ttttggggac tgagtctcac tatcacccag gctggagtgc aatggcacgg 104281 tctcagctta ctgcaaactc cgcctcccgg gatcaagtga ttctcctgcc tcagcctccc 104341 aagtagctgg gattacaggc acttgccacc atgcccagct aatttttgta tttttagagg 104401 aaacgtaaaa attgccaata tcaagatgga gtcacttgtg tcgaacccaa acaaaataag 104461 aggttaagaa ggggtggggg ccttcatgca cacatgccta tgaaaagaac catgacaaaa 104521 ctctctgagt ggctggacgc ggtggctcac gcctataatc ccagcacttt gggaggccga 104581 ggcgggcaga tcacaaggtc aagagatcga gaccatcctg gccaacatgg tgaaacccca 104641 tctctactaa aaatacaaaa attagctggg catggtggcg ggcgcctgta atcccagcta 104701 ttcgggagga taaggcagaa gaattgcttg aatccgggag gtggaggttg cagtgagccg 104761 agattgtgcc actgcactcc agcctggcga cagagcgaga ctctgtctca aaaatataca 104821 tatatagtta ttatatttta tctactactt tgtgttagtt cctgtttcat tcctttatgt 104881 tacttatttt ttttccattt gtaagatata tactaagctt tttgaatcgt ttcttcatta 104941 ataatgtagt cacgaggctg ggtggcttac ttctgtaatc ccagcacttt gggaagccaa 105001 ggccggtgga tctttcgagc ccaggtgttc aaggccggcc tgggcaacat ggtgaaaccc 105061 cacctctccc aaaaatacaa aacaagtagc cgagtgtggt ggtgcatgtc tgtggtccca 105121 gctacttggg aggctgacaa gggaagattg tttgagcctg ggaggcagag gttgcagcga 105181 gctgagatgc tccactgcac cccagcctgg ttgacagagt gagactctgt ctaaaataat 105241 aataatgaag tctttttttt tttttttttg agatggagtc ttgctctgtc gcccaggctg 105301 gagtgcagtg gtgcaatctc cgctcactgc aagctccgcc tcctgggttc acgccattct 105361 cctgcctcag cctcccgagt agctgagact acaggcgccc gccactatga ctggctaatt 105421 tttttttgta tttttagtag agacggggtt ccaccatgtt agccaggata gtctcgatct 105481 cctgacctcg tgatccaccc gcctcggcct cccaaagtgc tgggattaca ggcgtgagcc 105541 accgcaccca gccataatga agtcatctta aaatgactat aaattttcct cagaacatgt 105601 agggtccagc cgtacggggc ttagtgggtg ttctccccgt gtgcggagat gagagattat 105661 aataaataaa gacacaagac aaagagataa agagaaaaca gctgggcccg ggggaccact 105721 accatcaaga ggaggagacc ggtagtggcc ccaaacggct gatatttatt gcatacaaga 105781 caaggggaca gggtaaggag ggtgaatctt ctgagtgatt gacaaggtga agcatgtcac 105841 gtgatcacag gacaggggcc ccttcccttt taggtagctg aagcagagag ggaaggcagc 105901 aaacgtcagc gttttcttct atgcatttct aagaaagatc aaatacttta agactttaac 105961 tatttcttct accgctatct actgtgaact tcaaagagga accaggagta tgggaggagc 106021 atgaaagtgg acaaggagcg tgaccagtga agcaccacag ggagggggtt aggcctccgg 106081 atgactgcgg gcaggcctgg aaaatatcca gcctcccaca agaagctggt ggagcagagt 106141 gttccctgac tccttcaaag aaaggagact gcctttcgca gtctgctaag taacgggtgc 106201 cttcccagac actggcgtta ccgcttgacc aaggagccct caaacggccc ttatatgggc 106261 gtgacagacg gctcacctct tgccttctag gtcacttctc acaatgtccc ttcagcacct 106321 gaccctatac ccgccggtga ttcctaggtt atattaatga tgcaacaaag agtaatatta 106381 aaagttaatg actaatgtct acactaatga ttgataatgt ccatggtcat ctctatatct 106441 aatttgtatg ataactattc ttactgtaac tattttcttt attatactga aacagtttgt 106501 gccttcagtc tcttgcctcg gcacctgggt aatcctccgc ccacaagaac acagctttag 106561 ctgcctcctg taggagttca gtaggaaggg aatctacctt tgtagtgttt gatatgtagt 106621 tttaaggaag gagaccaccc ctcatatata tgtcttatgc ccaatttctg cctccaaaga 106681 aagaaaaagt aaaacctaaa agacagaaat aaaatccaca ggcagacagc ccagtgccgc 106741 gccctgggcc tggtagttaa agagggaccc ctgacctaac cggttatgtt atctgtagat 106801 tccagacatt gcatggaaaa gcactgtgaa aatccctgtc ctgttctgtt ccgttctgat 106861 tacccggtgc acgcagaccc cagtcacgta ccccctgctt gctcaatcga tcatgacctc 106921 tcacgcagac ccccttagag ttgtgagccc ttaaaaggga caggaattgc tcactcaggg 106981 agctcggctc ttgagacagg agtcttgccg atgcccccgg ccgaataaat cccttccttc 107041 tttaacctgg tgtctgagga gttttgtctg cggctcttcc tgctacagtt tgtaaccaga 107101 ttatttctga caagttttga tccagggttt ttttttaaag atttgtcaat ttcaaaatat 107161 ttgtttgctc ctgcttcctg ttctctctct cccttttttt ttcttttttt aagacaaggt 107221 ctcactctgt cacccaggcc ggagtctagt ggtgggatca ctgctcactg caggatccaa 107281 ctcccagact caacagatcc tgccgactca gccaccgacc ccatagctgg gaatacagat 107341 gtacatcatc acacccagct aatttttaaa gttttttgtc gagatgggtc tcactgtatt 107401 gcccaggctg agttttcttc ttcttttttt tacattttgt ttgcctagaa ttgtatttga 107461 ttgtaatcaa atcatgtgtc acatcttttt ttcaatttgt gtgattattt tgtggcccaa 107521 tacatgattt tttttgcgaa tatttcatgg acttaaatag ataagtcctc tctctacaca 107581 taattaaatg gagtgtgtgc tttgcattat tcagttcctt tattcccatc cttgcttttg 107641 tctgcagatt tgtcctattc cagaatgggt atactgaaac ttgggtaagg gtgaggccag 107701 cgttctgggt ggcccaggct actatcccac ccgtgccaca gaatccacac agaggcccag 107761 cggcggcact gggtggtggg gaagggggag gcggacccag gttggggggg caggcagtca 107821 gtgcggttgc caggaagcgg gaaagggcca caatggggtc tgggaggtgg gcggggcgga 107881 gcggggatgt ccagccacgt cgctttgttt tcccacgcta ggagctacca caacaggtca 107941 gtgcacctcg gggcaccggg tcccccacgc aggcccttcc cggggcaccc tttcccaaca 108001 gcacgaaaat ctcctgcccc caccccaccc ccaggtgatt cccttcacca agtctgggtc 108061 tagacttggc ccccgcgagt tcacgtccct gcctaatgcg cctggagtcc gggtgatggc 108121 cctggatccg ctcccggccc caagctccac acagccggga aggaagctcg cgtttgtttt 108181 gctggcgcca gggatacggt ccaggccccg ccccgccccg cgctgatccc agggtagtgg 108241 gaaagtgcgc tccccaagaa tttgcggtgc aggggccgcg gccaccgcct tctgtccgca 108301 ggtgctgcga gccgtaagcg ccccccaccc gcgctatggg ctcggacgcc tgggtgggcc 108361 tttggcggcc acaccggccc cgcggcccca tcgcggcgca ctacggaggc cccgggccca 108421 aatacaagct gccgcccaac accggtagaa ggggcaggcc ccgggattga ggggtgggac 108481 ggcgcctggg ggagggaagg ggaagagacc agggaggcct gcgctcgggg ctgaaccgtg 108541 cctgctagag gggagagggg gtctgtgccg ggggagggtc cgtgttgcag atggggcttg 108601 gggctgggcc cggccagacg ctaactcgga tgctcccagg ctacgccctg catgacccgt 108661 cgcggccgcg cgcccccgcc ttcaccttcg gcgcgcgctt ccccacgcag cagacgacgt 108721 gcggccccgg gccaggccac ctggtgcccg ctcgcatgac cgtgcgcggc accgacggcg 108781 cccccgccta ctccatctac ggccgcccac gccgctcagc gcccttcctc actccgggac 108841 ctggtcagga cccccgggcc cctggccacc ccaacgccga actgcctcca gggaggccca 108901 cctgggaacc cccgacctga accccgagtc cccctcggat accctaacac cgcatattcg 108961 gtacccccat atccggatct caaatcccaa accccgaacc ccacggggct ttgataaatc 109021 gtggctcaga ctccccacta gtcccaggac cccatctcgg gtacccacca ggctcccacg 109081 cagttctagc cccccacacc cttgatccgc cccgcaggca ggtacttccc ggagcgagcg 109141 gggaacgcga cgtaccccag tgcgcctcgg cacaccattg ctccccgaaa ctggggtgtc 109201 caggcggaac agcagagccc aggtgaggtc agaacggccc atcccagaac tgtgggcctt 109261 cccactcgag accggggacc gccctccggg agctgggacc accctgcgcc tgtccgcgga 109321 gacccactac ccccgagccc tgcctcctcc ccaggtcccg cggcctatac ggtgccctcg 109381 ctcttgggtc cgcgcgtcat cggcaaagtc tccgccccaa cttgctccat ctacggccgc 109441 agagcggctg gcagtttctt cgaggacctc agcaaggtgg gggaggggcc ggggcggacg 109501 cagggggtcc ctggtccgcg gcagtggagg cggcagccag caccctctgc cctctcgcag 109561 accccgggcc cctgcgccta tcaggtcgtg agtccagggg tctacaagtc ccgggccccc 109621 cagttcacga ttctggcgcg gacttcgctc ccccaagaca acactcggaa gccagggccc 109681 gcggcctaca acgtggatca ggtggcctgg agcccagggt caagggtcag agtcaggaga 109741 gtggggaggg cctgaggtcg gagtgatggg atcagagtcc ccgggggtcc aggggtcccg 109801 gcgcggagag gatgccggcc ccgcgaggtc agcggtgtct ccgggcccgc agcaccggaa 109861 gccccgcggc tggagtttcg ggatccggca ctcggactac ctggccccgc tggtgaccga 109921 cgcggacaac tgacccgcca ggcgggagcg gccccacacg tgtttgctta aagtctgcga 109981 gtccgcatcg tgtccgcctc tctctctctc tctctgcgcg tcctggcgca aggcctgggg 110041 tggagccacg gctggggccg tgtcccaact ccgaacccag cggggcgggg cccgagcgtc 110101 gggcgaggcc gggaccccag cgctgcgccg cgtccgaacg tcgagacccc accgagggcg 110161 ggagggggac tctcgggagc cacagacgcc cgagacccac gccgggcggg accggccagg 110221 gatcaccccc gccgacggcc ccgggccccg acggcccgga agttccgcgt gtccgggggc 110281 accgggggat tggccggggc gcggcgtgca aggcttcccg ggggcggcga ctgccgagct 110341 ccgccctcca ggcggcccca cccgcctgcc gtcctggggc gccgccgccc cgccgccggc 110401 agtggaccgc tgtgcgcgaa ccctgaaccc tacggtcccg acccgcgggc gaggccgggt 110461 acctgggctg ggatccggag caagcgggcg agggcagcgc cctaagcagg tacgggcggg 110521 gctcaagtcg cgaggcgggg aagcgggagg cagacacgga cgagggcgac acagacacgg 110581 gaccgagggg cggacaccgg agagacacgg gaaaggggtc gggacaggag cacgtggctc 110641 agacaccgac gccgggaggc cgcagacccc ggacgtgtca ggcatccccg caggcccgga 110701 gcgatggcag ccttgatgac cccgggaacc ggggccccac ccgcgcctgg tgacttctcc 110761 ggggaaggga gccagggact tcccgaccct tcgccagagc ccaagcagct cccggagctg 110821 atccgcatga agcgagacgg aggccgcctg agcgaagcgg acatcagggg cttcgtggcc 110881 gctgtggtga atgggagcgc gcagggcgca cagatcggtg cgtggggagg gttgggcgtt 110941 cctgaccccg actgggaggt cagcccgaga gactttgggt ccctgggggt gcgacggtgc 111001 cccactacca gcaccggccc cagggtgccc caccgctgtg ggctgccacc ctcacgcgta 111061 cccccacata ccaggggcca tgctgatggc catccgactt cggggcatgg atctggagga 111121 gacctcggtg ctgacccagg ccctggctca gtcgggacag cagctggagt ggccagaggc 111181 ctggcgccag cagcttgtgg acaagcattc cacagggggt gtgggtgaca aggtcagcct 111241 ggtcctcgca cctgccctgg cggcatgtgg ctgcaaggtt agaaaccacc tcctttccag 111301 acgggagcct ataccgcaca tgcagcaacc agtccatcca caggcagctc ccaacctcaa 111361 gcctggccca aagcctccaa gaccctacca aggcttctcc ccaccctgct ccccagcaca 111421 gttctcccca ccccgttccc cagcacagcg cttggggccc ctctggctcc agaccaggcc 111481 ccttggagca ggaaaaagat ccactgatgg aattcagacc cctttcccct tgggtcccca 111541 gacagctccc ccaagggagg agctgaggac tttcctccct ctgccccaag ccttgtttcc 111601 ccaaggacag gtaccaacct cctcccctac tgacacttct caaccaagaa aacttccttt 111661 ccattccctc accagctggg cacccctata gctgcttaaa tactttccaa atccagctgc 111721 actcctagcc agggaaggtg aagggatgca cagaggtggg ggaggggtac tgtgcagggt 111781 actcagcatc cctgaccacc aggtgccaat gatcagcgga cgtggtctgg ggcacacagg 111841 aggcaccttg gataagctgg agtctattcc tggattcaat gtcatccaga gcccagagca 111901 ggtacggggc gccacggatc agtcattaat ccaggttgat gatggagacc ctggccagaa 111961 tcactaaaag atcactggtg gatcattagg gtcactaatg agaacactgg tcaaggttac 112021 tcatgagtca ctgggcctgg gccgaaatca tcagtggaac tttgattagg atcataaaat 112081 gggaagttgg tcaaaatcac agatggctgg cggggcacgg tggctcacac ctgtagtcct 112141 agcacttggg gaggccgaag agggcagatc ccttgaaccc aggagttcaa aaccagcctg 112201 gataacacgg caaaacccca tctctacaaa atagttcgct gcgtgtggtg gtgcacgcat 112261 gtggttccag ctactcagga ggctgaggca ggaggatcac ttgagcctgg gaggtctagg 112321 ctgcagtgag ccgggacgat gccactgcac tccagcctgg gcaacagagt gagaccctgt 112381 cccagcactc tgggaggcag aggagcccag ttggagatca gcctgggtaa tatagtgaaa 112441 cttgatctct acaaaaaaaa gaagaaaaaa aaaagccgcg tgtggtggtg cgcacctgta 112501 gtcccagcta ctgggaagct gaggtgggag gatcacttaa gcccaggagg cagaggtcac 112561 aatgagccga aattgtgcca actgcactcc agcctgggca acagaggaag actcttcaca 112621 gaaaaaaaaa aaaaaaaaaa aaaagctgct aagtcattta ccataagtca ctgagaacag 112681 gggatgtctg accagatgca agtgctgctg gaccaggcgg gctgctgtat cgtgggtcag 112741 agtgagcagc tggttcctgc ggacggaatc ctatatgcag ccagagatgt gacagccacc 112801 gtggacagcc tgccactcat cacaggtgac ctgactccat ggcctgcttc tgcatgttca 112861 caggctcctg acctccaaac tcaagtcaag ggcctctcgt taggagttac ccgtcacctg 112921 accgtgtgcc cccctacccc catcacaaga tgcctgacca ccaccatgtg gggtggcctg 112981 atactcaacc caccaggtgc tgccaccccc ataataaggg acttgaccct caatgctcag 113041 ggcccctgac cccaaagtcg gcatccccga actctcccaa gaagctccag gttctccatt 113101 gtctccaacc tcctctgcct cccccaaagc ctccattctc agtaagaaac tcgtggaggg 113161 gctgtccgct ctggtggtgg acgttaagtt cggaggggcc gccgtcttcc ccaaccagga 113221 gcaggcccgg gagctggcaa agacgctggt gagcggtgtg gcctttccct gggcaagcgt 113281 cttgatgcgg gcccagccta cccttcaccc ctcccgtccc cactgcctcc ctccactcag 113341 cagtcctgcc taaccccagt cccaccctct tctgcccgaa gtccctccct ccttcacggc 113401 ttcctaacct gctgtgactt tagaggtcaa ggctggcccg gcctggacct ggggaagccc 113461 tctgtggcgt tcctgcccca gaccaagtac aagttcctcc tggccccatg gcgaggtgtc 113521 gcacttcact cgtgtctctt ccccacccca atccttccct gacttcatgc tggggggctg 113581 gcaacccagg gtgcagcagg ggctggagtt cgaccaagaa ccggctgcag aaggccccgc 113641 catggggggt ccacgctgag cctcctctcc gcaggttggc gtgggagcca gcctagggct 113701 tcgggtcgcg gcagcgctga ccgccatgga caagcccctg ggtcgctgcg tgggccacgc 113761 cctggaggtg gaggaggcgc tgctctgcat ggacggcgca ggcccgccag acttaaggga 113821 cctggtcacc acgctcggtg agggggacgg ggtgtagggg agcggaggcg gcggggggtg 113881 cttcccgctg gggccgcccc gacccggccg cgcctaagac ccgtccccgc ccgcaggggg 113941 cgccctgctc tggctcagcg gacacgcggg gactcaggcc cagggcgctg cccgggtggc 114001 cgcggcgctg gacgacggct cggcccttgg ccgcttcgag cggatgctgg cggcgcaggg 114061 cgtggatccc ggtctggccc gagccctgtg ctcgggaagt cccgcagaac gccggcagct 114121 gctgcctcgc gcccgggagc aggaggagct gctggcgccc gcagatggtg agcgtcgggg 114181 gagtccccgt ccttccgcct ccgccatccc cttcccttcc cgaggccccg ccccttcccg 114241 agcccgcgcc tctcagcccc tctccccgca ggcaccgtgg agctggtccg ggcgctgccg 114301 ctggcgctgg tgctgcacga gctcggggcc gggcgcagcc gcgctgggga gccgctccgc 114361 ctgggggtgg gcgcagagct gctggtcgac gtgggtcaga ggctgcgccg tggtgagcgc 114421 cgcccccgcc ctgctggccc cgcacccccg cccagctccg gccgcgcggc ctctaacagc 114481 ccctcgctct gcagggaccc cctggctccg cgtgcaccgg gacaggcccg cgctcagcgg 114541 cccgcaaaac cgcgccctgc aggaggcgct cgtactctcc gaccgcgcgc cattcgccgc 114601 cccctcgccc ttcgcagagc tcgttctgcc gccgcagcaa taaagctcct ttgccgcgaa 114661 accttgtcag tgcttgggcg ggagcggaag gatccagggc tgcggaggcg ggggccgtct 114721 cgatgaacac gtgacccccg gcgggctccg ccttccgcgc acgcgctgag agcctgtcag 114781 cggctgcgcc cgtgtgcgca tgcgcagctc cggggacgcc tgcgccctgc ctgtgagcgt 114841 gtggcgcccg ctttccctga gccggcgggg cagagcgcag ggagctggag gtcggcgctt 114901 cctctcgtgc ttggtccact gacgcgcggc cccgccgcga ggtgcggacg ccggggctgg 114961 gaggggagga ggtagccctg aggactcgct ggactccggg gtagtttccc agctccggct 115021 actgcgcggg gctggcgggg cacaccccag ggcgcgctgg aggccggagc gaggctgggg 115081 cgcccgtggg aggctcccag caggcaccgg tgttctcgcg gccaagcaca gttataacgc 115141 gctcgcgcgg cgcttcgagt ggtcctggaa cctttctggc cacgagggcg ctggccttgc 115201 tggggagggc acaaacccag aaccgcccgg gcgggggtgc agtgagtcct cgggagggtg 115261 ccctcagcag gagggggcag tgaccgggag tcctgagacc tccacctagc aaaccttctg 115321 cgggggcccg tgggaaaggc tcaaaggtca ccaacgcaaa ggcagggcgt cggctgtgag 115381 cccggaggag ctgctgggaa gcctggatgt gaggagggtg gggttttgtg gcgggtggaa 115441 gtgtcgtgcg tctctgccag gagaggttaa acacagccgg cgggcagagt ctgagctccg 115501 ggggtaggtc gtgcaggttt tctgctggga gtgtggagga aggccgcggt tggttgaagt 115561 ggctggaggt aacaggaaag tgttggagga atcggttgct ctcggggatt gcaagccaga 115621 gagttaccca cctcctttta agaaatgggt ttattgcaaa tagataacgt ggttagttca 115681 ggcaaggcat gcacttggaa tgctttccgt cagcaagagg ttcaccttgc tgaggtgcag 115741 gtgcagggca gggtgcggtg acaggctggt gatcccaggt agaggacagg agtgacaggt 115801 gtggttgccc aggtgtggat gtttggtgga ggtggagttc tgagctcagg tgagcagctg 115861 caaatgcctg ttaagcctga acgtgggctg ggtccttcag atgggtggct ggtctcaagt 115921 cgcaggggca gcccaggcac tgtcctgggc cttcccttct ggctcctgac gcctgtgctt 115981 gtttccagga gcatcagatc catgctgctg ctgactcgga gccccacagc ttggcacagg 116041 ctctctcagc tcaagcctcg ggtcctccct gggaccctgg gaggccaggc cctgcatctg 116101 aggtcctggc ttttgtcaag gcagggccct gcagagacag gtgggcaggg ccagccccag 116161 ggccctgggc ttcgaacccg gctgctgatc acaggcctgt tcggggctgg actcggtggg 116221 gcctggctgg ccctgagggc tgagaaggag aggctgcagc agcaaaagcg aacagaagcc 116281 ctgcgccagg cagctgtggg ccagggcgac ttccacctgc tggatcacag aggccgggct 116341 cgctgcaagg ctgacttccg gggccagtgg gtgctgatgt actttggctt cactcactgc 116401 cctgacatct gcccagacga gctggagaag ctggtgcagg tggtgcggca gctggaagca 116461 gagcctggtt tgcctccagt gcagcctgtc ttcatcactg tggaccccga gcgggacgac 116521 gttgaagcca tggcccgcta cgtccaggac ttccacccaa gactgttggg tctgaccggc 116581 tccaccaaac aggttgccca ggctagtcac agttaccgcg tgtactacaa tgcaggcccc 116641 aaggatgagg accaggacta catcgtggac cactccattg ccatctacct gctcaaccct 116701 gacggcctct tcacggatta ctacggccgg agcagatcgg ctgagcagat ctcagacagt 116761 gtgcggcggc acatggcggc tttccgcagt gtcctgtctt gagccactgc agtctgggcc 116821 ccatcattaa acgggctgcg tttaatctgt gtgtgtgtgt gatctgtacc agcggcccag 116881 ggaggccagg cagccctctt ctctgcgcag gtgtacataa aagggttttt agggggaaga 116941 tgagatggca acactgcttt attaggccgg gccagccagg agcagacaca cggctcctca 117001 gtacacattc ccccacccct gcctcggtgc tccccactca gggctgggcc atggaggggg 117061 cagcgtaggt ctggaagcgc ttgtgcgctc gctggtgcgt gagcagtctc agggacatgg 117121 tgtccacggc catctccagc ccgggctgct gggttatctc cactgtgtag tcattggcct 117181 gcaggggtgg cacatcagca gggtctgggg accgtctccc cctcccacgt atcccaggct 117241 actcaccagc tgcagggagg ccagcatgga acgacacacc tcgaaggccg gctggccagc 117301 caccagctcc gcaaagggac accactcatt gagctggggg aaccgtgaga ccagctggtc 117361 cccataggtg tggatgtcaa agggcacatg ctgctcctgc acaggggagg ggcatgttgg 117421 ctgcggagag aggcaaggcc ctggacagac ccgtggcaca cagctctggt tcccagcggc 117481 cccgcctcac ctgctcctgg agcagaggct gcactgtgtc ctcccagtcc ctgatgcgct 117541 ggctcagctc tgtctcctgg acaaacttct gggaggtggc gatgaagagc tcctagggga 117601 ggatcagcag gagctactgc ctcccaagca ggggcctggg caccgctcac ctgtccccct 117661 cccctccccg tctccctcta acccaggcct accacattcc ttcgaaccag ctcctcgtag 117721 ctcagggaca tcggcactgc gtctgggaag gggtagctgt gagctgggca gctaaggggc 117781 cacgctctcc cctgccccca gctccagtgg cccctcagca ctcctagccc gctgcccacc 117841 taccaaggtc agcggcttcc ctggggtctg ctccctcggg ctccatgtac tcctcaggct 117901 ctagaaagtc atctgctggg aggtgggaga gtgaacagca cgcaggcagc tagtcggcag 117961 gtgccaagcc ccaccccacc ccctggcagg cacccacctg ctgcccccag gtcttccagg 118021 gaatcctcca ggtggtcctc ctctgcaggc cgcagccact gctcagccac ctgtccagag 118081 aaagcatctg tgaggggcac tgtcacctcc catcagggcc accacggggt cctagatcac 118141 agctgaccag ctgggacttg cctccctcct ctgcagcttc cggagagttt ccaactgctc 118201 cttcacgtgt gtccagtaca ggacctccat gtctgcagac aaaggccttg tgagggctcc 118261 ctcgtccttc ccaggccccg tgccagtcca cccaaggctt tgggtgacac caggtggggg 118321 aagaggtagg ggaaggagag taaaactgtc ttccccgagg acttcagcct cacctgcaaa 118381 ggacggaccc tttcgccgaa gccgcctgct gtcggcatgg tctgcatctg tgaaagaaca 118441 tgctcagcac cagccaggcc aagagctgcc cacagcattg ggaaaagggt gctgatcctc 118501 ctccactggc cctggcccag cccccagctc ccagtggtcc ggagtgcaca ccacacccac 118561 tcacaggcag ccaggtacca ctggtggaag tcctgcagct tggcagcgcc cttcctcttg 118621 cgcttctgtc ccagagcctc ctccacacag gggggcacag agtaaggcct acctgggggc 118681 aaacagggac tgtcttagag ccgctggggg atcagggtgc aaagcccact cccacccccc 118741 cacctcatgc cccaggaaac ccagccaggg gtacctgagt ggaggtacct tccacccaat 118801 tacctttctt gaagggctta gactccaagg agtcaaaggg gtccaggctc tgccaggggt 118861 ctggagtctc ctgggcacag agcacaagaa ggcgctggag aagtaggggg tgcacagccg 118921 tccctcccaa aggagtaggg gatgcacagc caccctccca aaggaagagt agggggtaca 118981 cagccgcgct cccaaaggag gagtaggggg tgcacagcca ccctcccaaa ggaggagtag 119041 ggggttcaca gccaccctcc caaaggagta gggggtacac agccaccctc ccagaggagt 119101 agggggtgca cagccgcctt cccaaagccc aggcaacaag tgcagtgaca gcctggggca 119161 gagccgacca cactcaaatc ctggatcaca gaggggaaca ctagagagag ggtgggagcc 119221 cagagtcgca tgagggggtc tttaagagag caagtgcaaa gggccacagg ctgggaacac 119281 cacaggccag caggagcaag agcatgatgg gcaggcagcc agtcctcgaa tccgcccatc 119341 cctccctgcc gggggtcagg gccccaacac tcctaccttc acgcaggatg caggctctgg 119401 ggccccctcc cgctcccgca gcatgtacct cctgggcagg gcagcactct gcaaagacag 119461 tcacagggtc cccacttctc ctctttctcc cctcaaggct ctgggtaccc caggaactcc 119521 accttgggtt aacatcaaga accctacaag ggtcgggcgt ggtggctcat gcctgtaatc 119581 ccagcacttt gggaggccaa ggcgggcaga tcacccgagg tcaagagttt gagaccagcc 119641 tggccaacat ggtgaaaccc ccgcctctac taaaaatgca aaaaaaatta gccgggtgtg 119701 gtggcgcaca cctgtaatcc cacctactcg ggaggccgag gcaggagaat cgcttgaacc 119761 caggaggcag aggttgcagt gagccgatat cacaccactg cactccagcc tgggcaaaag 119821 aacaagactc cgtccaaaaa aaaaaaaaaa acctacaggg accggacaca gtggctcacg 119881 cctataatcc cagtgctttg agaggccaag gcgggtagat cacttgagcc caggagttca 119941 agaccagcct gggcaatgca gtgagacccc gtctctacaa aaaatacaaa aattagccgg 120001 gtgtggtggt acacgcctgt agtcccagct actcgggagg ctgaggtgga aggatcacct 120061 gagcctagag agagtcaagg ctgcaatgag ccaagatcag gccactgcac tctagcctgg 120121 gcaacagagc aagaccctgt ctcaaaaata aataaataaa ataagataaa aataaaccct 120181 gctagtactc tctgggggta ttaaaacaaa aaccctaaga aaaactttaa aacacaaaca 120241 gtgggccggg cgtggtggct cacgcctgta atcccagtgc tttgggaggc caagacgggc 120301 agatcacgaa gtcaggagat tgagaccatc ctggccaaca tggtgaaacc ccgtctctac 120361 taaaaataca aaaaattagc caggcatggt ggcgggcgcc tgtagtccca gctacttggg 120421 aggctgaggg aggagaatgg cgtgaacccg ggaggcggag cttgcagtga gccaagatcg 120481 caccactgca ctccagcctg ggggacagag caaaactcca tctcaaaaaa caaacagaaa 120541 acacaaacag tggtcacggg gtactcttct tcaaaactag gaaaaatgaa tagaatatga 120601 caaatatcca tgccatctac caactagagt gaagaaacgt taacatttcc tctgtatgca 120661 tcaggttgat tttgttttaa tgaataaaaa ggaaggattt cttggctcta ggctgggtac 120721 agttgacccc gggggttgag ggaggcagca atggcctcca gacactccct cctaccagcc 120781 cgactgcaga gaggcgcctt ggcaaccggc ggccatccca ggttgccgtg agctcgacag 120841 ggctcaaatc cttggacgcc agcccactac tctgaggagc atcagttctg ctcagagagg 120901 ggaggaacac agaagctgct cttcaaaagg gctggttgca ggacgggctg caagacaggt 120961 gggcttctgc cccagaccag tggtctgggg ccaaggccat accactgtcc tccttgtgca 121021 aggtgactcg gccccagagc cagggttgcc agttcacagc tcaggttctg caggcctcca 121081 tgtgggtccc acctgctgcg ggctcctgga ctccttgggc tccagagcgg ccttgggggc 121141 cgaggcctca ggaagctcta ctgcctcctc tgcatcctcg tcctcgcccc cacccagggg 121201 catcgggcct tctggagagg ggcctaggca gggagcaaga gagtgatcag ctccaaggag 121261 tgccgagacc agacggcacc aaagggaccc tgatggctac ttcctccctc agctttagtc 121321 acggagccca gaggatttgg cgggtgcctt gcaccatcct cgtcgtgctc ctgtgctctg 121381 tgctctgctt ggttcatttt atttcttagc aaatgttcct ctttaaatgc agccatccac 121441 atctgcccag agaggagacc caggagggca gtcagagggc agtcagtgaa ccaagatggg 121501 ctgaggctca ccgggggctg tggctgcttc cccggggcga ggcctcactc ccatgggcca 121561 cagaaggggc atgaggctgg cagctgagag cagccccttg gagccctgtc cactgggtgc 121621 tgcctccctg gtcccctctt tggctgccct gccagtccca ccggggagct ctcttctcac 121681 ctggctcctg ggagaagccg agtgctggga cagggctcct gcacacggaa acttccattg 121741 gctgctcctc agtcctcccg gtgtctggga gagacaggga tcagacaagg tcagcccagc 121801 ccctctccaa gatggtccct gcccggtgtt caacaagcct ctccaagggg ccagcaggag 121861 ggcagagacc agcagactcc aggccttggg ctggggggag acagcagggc cctgagccag 121921 gcaccaggtc tctggcactg agaccatgtt aacttgtaat caaggcacag cagatgggtg 121981 ggatgagcac cagccaacct gtctagacta gagcaggcac ccagagacag ggagctccca 122041 agaccagagg tggcaagcac agggcacccc acaagagtcc aaccagaaga ccctcccttc 122101 cttcccacca agccccccgc atccaagccc tcacccttct gggtccctgg catgggggaa 122161 acgcccgctg gttccatggg ggacatgccc tctggctcca acatgaaggc ccctctgggg 122221 tggggaacgc acgtgttcat cctgaaatcc ttccggctgg ccaggacctc accctgacgg 122281 ctgtgctggc aaaaacacgc ctgctgtaga gacccttccc aggcaggcct ccctccccgt 122341 cgctgggctc agatccctac ctgtacaggg gattgttgtt cttctccatt tcatcagggg 122401 ccaccagggc catgggcagg agggggatga tgaggacctc ctgcatggag gccaagggtg 122461 ggtgagccag gcacccttcc actccagtgg agaacaggac ccacatgggg caatgacacc 122521 ttcccacacc ccatcagctc aacacagccc cagggaaagt gctgacctca ccccagccta 122581 agaggggcca ggccaggact cacactgggc gtctgatcat tcttgagatc cacgttagtc 122641 cgggagtcag ggaagtcatc cagcgacagg aactggaggg agacagagct cgtgggggcg 122701 gagcccaggc aaacacacag tgggcgggga ccacatgcca aagaaactca cctcattctc 122761 tgcctcctgg gggaccccgg agctggcaac cccattggcc ctgtcctcct gcaccgaaga 122821 gagctgcttg gcccgcctga gagagaagca catcccgtgg gcatcccagg agctgcaggg 122881 gcctccctgg ccacatgcag ggcagcacag tgagtggctg cagaactcac ctctttccag 122941 agatgaaatc aagggcctgg tagacgagtg agtagaggta ttccacctgg tgaccagatg 123001 caggcagtga ggacccagaa aggaggggcc cacccccaag gccttcccag ctatccatcg 123061 gtgcccccac gaggcctggc caccaccacc aaacctccag tacaagaaac caccaatttt 123121 ggttccaaaa ataattctca aatcaaacca ggaatctcat gagaattact ttaccagctg 123181 aggccaccca catctctcct ctgccctcta tggtccacgc agccgcgagc gtggccttct 123241 caacgtgtga ccctgacagc actgtggttc ccacagtgtg cctcatcaca cggtccttgg 123301 tggtgaggcc cctcgtgctt ggcacccact ctgccgacgg cctgcagccc tgctgctcct 123361 ccgcccgtgt gctcccgcca gcatcacccc agcccctcct cctgtcctgg ccctgccctg 123421 tcccagctcc tgccgccacg gggcctgcag gacagagccc gtcactcact tcctgtcctt 123481 ctcctccaca gaacagaggc cccaccgtga gggtggtggt tgtggacttt ctaggtgctg 123541 acaatgaggc acccacaagt aggggacggg gaaggggaag gtgggtgcag gtggcaccct 123601 gggctgtgtc caagtgcaca cagcaggtgt cattccccca agacaacaga gcccaacagc 123661 acaaaagacc ctccctgcct ggctgtgcga ttttaaatta aaaattgttc ttggtgttgg 123721 aaggtgctga ggaattggct gagacctgag gagagaggga ggtaggtcac agacgggacg 123781 accactgcac ctgccccaga aggagagccg actgtggtcc cactggccac caaaatgcca 123841 agaccagcgt caagcagggc ccaccttctt actgtagacg caggcagagc cctggatcaa 123901 caacgctgcc tcaatgaagt tcattgtggt cttgccttcg tcaaaagaaa tgcagatctg 123961 atccagctgc aagaaacaac acaaagacag ggggaatcca aggcccagtc tgcactcgct 124021 tcttccaggg cagccccacc catctttggt ctctgcccag cacctcccga gacgggcagc 124081 tctagctatg ctgtgggctc accggttgtc cctggtattt cggggcagag acaggtctct 124141 gtgtttcccc ctacttccta taacaaaaac ctgttttgga agtggaaggc ccccagggca 124201 cactgtctac aaacaaccag ggcagcacag cggctccctc cacaaggcgc cagaggctga 124261 cagactccag caggaagatc cccgacacag agaagaaaag ggaacctcag gggtaggaaa 124321 ggcagagata aaatgctgag gtctccatcc tctgagcaga gaactcagga aacagttaca 124381 ggtgagtcgc catccttcct ctgctcccct cagcctcccg gagccccagc tcctctgtca 124441 ccccttaaac gtcagtgcgc caggcgctcc accccccgcc atgcacattc cttcatccca 124501 tgcccaagct gccaaccata agccaatgag ccccaaaccc aacccgacca gctctctctc 124561 cttagctctg gggctgcatt catagctgtc cacagagtac attcaaatgt ccccaaacac 124621 tttacttcca tactttaccc aacatcttcc ccaacgctgg ctcatctcct gtattccttc 124681 tcagcaaatg acactttaaa aaaaaaaaaa aaaaaaaggc cgggtgcggt ggctcacacc 124741 tgtaatccca gcactttggg aggccgaggc gggcagatca tgaggtcagg agatcgagac 124801 catcctggct aacacggtga aacctcgtct ctactaaaaa tacaaaaaat tagctgggcg 124861 tggtggcggg cgcctgtagt cccagctact cgggaggctg aggcaggaga atggcatgaa 124921 cctgggaggc ggagcttgca gtgagccgag atcgcgccac tgcactccag cctgggcgac 124981 agagcgagac tccgtctcaa aaaaaaaatg tttttgtaga gacagggttt ctctatgttg 125041 cccaggttgg tctcaaactc ctggcctcca gtgatcctcc tgcctcagcc tcccaaagca 125101 ctgggatcac aggcatgcac caccgtgccc agccacaaac gacactttga agcccctagg 125161 tgtctttcca gctcttcaat tttgcatttc catctgcctg gaacaccgac ccccagctgt 125221 ccccacgcaa actctttttc agagcccctc cattcctggg gcctctgtgc cgctccctta 125281 gtgtgatcgt gtcccttagt gaactgtgtg agcagcatta gtgtaattcc agccttctgt 125341 tcagtgcctg acactggtgt ccagtaaata tttgatgaac acatgcatag ataatgtcaa 125401 agctaagcct gaaaaacgca gaggctcacc agacagatga agggaacgtg aaggcagcta 125461 gcagagactg gacacgtctc ttcaggaaaa tgagcatgat ctaaacatgg ccaaagcatg 125521 gcgcgtgctg gggagaggca agattttagg caccatggta gcacgaagga gtggcagagg 125581 ctgggaagcc atccccaagg ggagtcactt ccattagctc ttgaaggata aagggtgttt 125641 gtcagatctg ggaggaaggg agaggaaaca gcagcatggt cgtctacgga gtgtaaggac 125701 ctcgggggtg ttcaggacct gaagagctgg tccagaaaga agggaacatg gaggagatgg 125761 cggcctggaa gctgagctgc agaagtggcc cttctctcca gtgaccctca gcctggctgg 125821 cttgcccact cccaagacac tactgggctt cacttgtctt ctcacctccc caggttggac 125881 ctgccatctg agcccaaacc cacatgttca gctgccttct tcaaacatcg cttggatggc 125941 ttcaccaaga ttcatactga atgcatccac actgaactcc aaatagcccc caacccacct 126001 gcccagaaac ccaggggttc ctctccacag tcacccccat tctggcccca ccacctcccc 126061 tttcacctct gcactgacag ggctcctcct gcccttcact cccctcccgt ccactctcca 126121 cgctctgctc agtgcctctg tttaggattt tgggtttttc ctaacagtgg caggaaaaac 126181 ccaaaatcct aaacggagcc agagggaggc ctccatgctg gccagcctcc ctcgccagct 126241 gcctgccctc ctctgctcta tggtcccaac ttggaacctt ctgttttgtt ttgttttgtt 126301 ttgtttgaga cggagtctca ctctgtcacc caagctggaa tgcaggggcg caatctcagc 126361 tcactgcaac ctccgcctcc tgggttcaag taattctccc tgcctaagct tcctgagtag 126421 ctgggattat aggcgcccgg ctaatttttg taaaaactcc aatccttcca acaccaggtt 126481 tcctccacct ggagctaaac attctttagc tcaccataaa taagtaactg ctttcatctc 126541 aaacaaacta agccccccaa actctccaca ggaccataca ttttctttca aatcccttat 126601 aagtttctgg tgattttttc ggttggttgg ttggttttgg tttttgtttt tgagacagtc 126661 tctcgctctg ttgtccaggc tggagtgcag tggcacaatc tcggctcact gcaacccctg 126721 cctcccaggt tcaagtgatt ttcctgcctc agcctcccaa gtaactggga ctacaggcat 126781 gcaccaccat acctggctaa ttttttttgt acttttagca gagacaggtt ttcaccatgc 126841 tggtcagact catctcgaac tcctgacctc aaatgatcct cctgcctcgg cctcccaaag 126901 tgctagcatt acaggcgtaa gccacctcgg cctccctaag tgctgggatc acaggcgtga 126961 gccacctcgg cctccctaag tgctgggatc acaggcgtga gccacctcgg ccccccaaag 127021 tgctgggatc acaggcgtga gccacctcgg ctccccaaag tgctgggatc acaggcgtga 127081 gccaccttgg ccccccaaac tgctgggatc acaggcgtga gccaccttgg ccccccaaag 127141 tgctgggatc acaggcgtta gccacctcgc ctggccatca gtttctgact acacaaagtc 127201 acacgcaggg gtggaggctg gattctgatc tgccaggaac cactatcatt cagagtacat 127261 ctctaacgcc tctactaaca aggtttttgt gaggactctg cattcatatg tgtaaagcac 127321 ataaaacagg tgcttaacac aagtaaacac caaatgcata tttattacac ccccatgcgg 127381 ttactctctt ggacccgtct tcccatgaca gaggaagtcc taccatagtg gggattatca 127441 tttgctttgc ttggcactat tttttcaaag tttaacatgg ttcttggcac acaattcaca 127501 actaacaaca atgaataaat aaatggataa aagagagtag gatcatgaca gtactagaag 127561 aacattctgg ccagtgcaat ggctcacacc tataatccca gcactctggg aggccaaggc 127621 aggtggatca cttgagcttg ggagtttgag aacagcctgg gcaacacagt gacaccccat 127681 ctctaccaaa aatacaaaaa attagccagg tgcggtggca catgcctgca gtcccagcta 127741 cttgggagat gggaggatca cttgagcctg ggagatcgag gctgcagtga gccatgattg 127801 caccactgca ctctagcctg ggcaacaggg caagaaccta tctaaaaaaa aaaaacaaaa 127861 aaaaaaaaac aaaggacaaa aaaaagaaaa agaacatcat ggtagtgtgt ggaaaatgac 127921 agaaggaaaa agataatact agaaaccagg aaatcaggac ttcaagctgc agaggtttgg 127981 aagagcagac ataagaaagc aggaagggag ctgaccaagc caggaaggag atttaccaag 128041 tgaacaggtg tcctcatgga ccctgcttga tgtgaaaacc tgaatccaca ggagctctgc 128101 ccagcacagt tccccgtatt ttctcagcca actatgcatg agagcagaga agtttggtca 128161 tcagatcaca tgtgcattta acaaacactt taacgtccac acctatgtgc caggcactct 128221 actaagcaga caaaagaggg cataagtatg gtccctgccc tggcagaatt cacattctag 128281 ggtggagaca ggccacaaac aggtaaatga gttgaagaat ggcttgtggc cagtcgcagt 128341 ggctcacacc tgcaatccca gcactttcaa aggatgaggt gggcggatca cgaggtcaag 128401 agttcaagac cagcctggcc aacatggtga aaccccatct ctactaaaaa tacaaaaatt 128461 agctggggcc aggcacggtg gctcacacct gtaatcccag cattttggga ggccgaggcg 128521 ggtggatcac gaggtcagga gaccaagacc atcctggcta acatggtgaa accctgtctc 128581 tactgaaaaa tacaaaaaaa ttagccaggc gtggtggcgg gcgcctgtaa tcccagctac 128641 tgaggaggct gaagtaggag aatggcgcga acccaggaga tggagcttgc agtgagccga 128701 gattgtgcca ctgcactcca acttgggcca cagggcaaga ctctgtctca agaaaaaaaa 128761 aaaaattagc caggcgtggt ggcaggcgcc tgtaatcccg gctactcaag aggctgaggc 128821 aggatcggac gcggtggctc aggcctgtaa tcccagcact ttgggaggcc gaggcgggtg 128881 gatcatgagg tcaggagatc gagaccatcc tggctaacac ggtgaaaccc cgtctctact 128941 aaaaaaatac aaaaaattag ccgggcgtgg tggcgggtgc cagtagtccc agctactcgg 129001 gaggctgagg caggagaatg gtgtgaaccc aggaggtgga gcttgcagtg agccgagatt 129061 gcaccactgc actccagcct gggcgacaga gtgagactcc gtctcaaaaa aaaaaaaaaa 129121 aaaaaaagag gctgaggcag gagaactgct tgaacccaga aggtggaggc tgcagcaagc 129181 tgagattgca ccactgcact ccagcctggg tgacagagtg agactccatc tcaaaaaaaa 129241 aacatataaa taaataaaat tttttttggc tgggcgcggt ggctcatgcc tgtaatccca 129301 gcactttggg acaccgacac gggcggatca ccaggtcagg agatcgagac catcctggct 129361 aacacggtaa aaccccgtct ctactaaaaa tacaaaacat tagccgggtg tggtggcagg 129421 tgcctgtagt cccagctact cgggaagctg aggcaggaaa atggcgtgaa cccagaaggc 129481 agagcttgca gtgagctgag atcatgccac tgcactccag cctgggcgac agagtaagac 129541 tccatctcaa aaaaaaaaaa aatttaaatg gtttatgata aacattctaa agtgtgtaag 129601 gtgctgagat ggagaggact attttagata gggtggtcag ctgacattta cactaagacc 129661 ttaagaatga aaaggactca gtcagccagg cgtggtggct cacgtctgta atcccagcac 129721 tttggaaaga ggcagagctg gctggatcac ctgaggtcag gagtttgaga ccagcctggc 129781 caatgtggcg aaacccatct ctactaaaaa tacaaaaatt aactggacgt ggtggtgcat 129841 gcctgtaatc ccagctactc gggaggccaa ggcaggagaa ttgcttgaac ctgggagtcg 129901 gaggttgcag tgagccaaga ttctgccatt gtactccagc ccgggcaaca gagcaagact 129961 ccatctcaaa aaaagaaaag aaaacaaact cagtcattga aagaatattc ccagccaggt 130021 gtggtggtgc acagctatag tcccagctac ctgagaagct gaggcaggag gactgcttga 130081 gcacagcagt ttgagaccag cctgggcaac acagtaagat cctcctctct gatttttttt 130141 ttctcctttt ccttgccata tgaagtccct ctttatttaa aaacaaaaaa gaggccgggc 130201 gcggtggctc acgcctgtaa tcccagcact ttgggaggcc aaggcaggcg gatcacgagg 130261 tcaggagatc gagaccatcc tggctaacac ggtgaaaccc tgtctctact aaaatacaaa 130321 aaaattagtt gggcgtggag gcgggcgcct gtagtcccag ctactgcaga ggctgaggca 130381 gaatgatgtg aacccaggag gcggagcttg cagtgagctc agatcgcgcc actgcacttc 130441 cgtctcagag tgagactccg tctcaaaata tataaataaa taaataaaaa caaaaaagaa 130501 gattcccaag tgggggcaga aagcatgaag atcttccacc aagaaaaagc ttggaggggc 130561 cataggttga ctggttcaaa agaacaagat acaaaacagc ccaaaatgag gctaaaatgg 130621 agagcagagc caggttcaga gaaggaaatt tgggttttat tctgggcata atggaaagcc 130681 actgaggggt tgattttagg cagaaaacag tgctgtggcc agattttaat ttttaagagt 130741 cacacttgct gctggttgga gcacgaattg gacgggacaa aaacagggat aagggaaccc 130801 aaataggaag gttttacaca aacccaggcc agagacaatg gtggcctaga ccaaataaaa 130861 tggacaacag atgaattcaa gtgatattcc ggtggtataa ctgatggaat tcatggatga 130921 actgagtgtt aagcatgggg cttcactata gccacattta gcttttgctt ttggttgaac 130981 agcggtgtca ttttccgaga tgggaaaagc tggggtttaa gcagaaaaat gaagagttcc 131041 actttaaata tgttaaatga gatgtttgcg tgacacccac tagatgtcaa gagagcagct 131101 ggatatgtaa agtagatttg gaggaaagat ctaggatgga gctgcagatc tggtagttgt 131161 cctaatacag gcaatattta aaatcataga aataaatgac acttcagggt aaatgcatca 131221 gaagatgacc taggccccat cctgaggaac tcaacactta gagactggat agaaaagaaa 131281 aatcaggcaa tgaaataggg aaaaaaaatc ctaccaccat gtttcaaaaa gaaaggagtg 131341 gccagtccaa caaatgctga cttatgtccc ttggatttgg tagaatggag atcactgctg 131401 atcaagacaa gaaacattta agtgtcgtgg acgccagact gagctgggct gaggagtaaa 131461 aatgaagcgc aaatatggag cagacagctt aacacaggtc tagacagggc tgtcaaacat 131521 ggaagaaaca ctaatcgcta acaggataga gcatggcaat agcttgaaat tggtttgaaa 131581 atccaagata aaaaattcct ctgacatcgt gatccagacc tctcaaacac acaagcatgc 131641 tcgcaataac tgcaataaaa gcaataatgc tttctagtct attccactac gttagagaaa 131701 cggctggtca ccacccactc aatcattcac ggctcgctga cgggtagaaa cctgcaatga 131761 caggtccacg ctaatccaac aagggatgat ttggggaggc agagatggag tttgaaaaga 131821 agcagggagc ccgaggacct gcgcggccag agcgccagat agccataggc gtggagacgg 131881 tggggcgccc acagccccgg gcccggagcc cccgccccgc agccccaccc gccggcccac 131941 cccgcgtcac tcccccgccg cccttacctc ctccagatac tcgcccagct gggccgccac 132001 gtccacctcc cagttcttgg tgaggtcgcg gatgggctgc aagaggtggg cgaagcgcgc 132061 ctccacgtcc tccatgtccg ggagggaacg ggcggcaaag ggaccgcagg gctgccttcc 132121 gaggggtacg gcgtcctcag cccgccacta gttctggcgc cattttgctg ttcccgccca 132181 ggaaaatgcg tagcgcggcg tcgcgcgcat gtctatgacg taacgtgggc aaaagccaga 132241 gaggcgaacc ccggcgccgc gcagcgggag ctcaccggaa gagtcgcggg gcggggtcka 132301 acctccggga ggggggggtg ggcggarctt tgggctgcca ccagtttgga cgggagggct 132361 tttggggccg ccstgggtck ggcggggcgg ggtggggcgg ggcctggagc tgccgcgggc 132421 garcgggccg aggaggcggg gcttgggggc tgccacgggt tgggggggcg gggcggtgar 132481 gaagcggggc ctggggcagc cgcgggcggg cggacgggcc gaggaggcgg ggcttggggc 132541 tgccacgggt tgggcgggcg gggcggtgag gaggcggggc ctgggactgc cgcgggcggg 132601 cgggcgggcc gaggaggcgg ggcttggggc tgccacggat tgggcgggcg gggcggtgag 132661 gaggcggggc ctggggctgc cgcgggcggg cggacgcgga gggcgggccc tgctctagcg 132721 ggccgcgtag cggacatggc ggctcccgct cccgcggcag tyttcctcca gggcgtggcg 132781 gccgtcttca tgtttgcttt cgcttccctc tacacgcaaa tcccaggtga aggcacccag 132841 ggtgggacac ccccagcctc ggcgcgggac cctatgtcct cttttctctg tgcccggcac 132901 tcacgcagtc cggggttggg agtgagggcc aggccttggc gtggggtagg gggcggcggt 132961 tcgtgtcctg caggccgctg tgggggtagg aaggtgtgcg gtgatccatt tcccgtaagg 133021 ctctggactc tgacctgggg tgggggacag aactctgaga gaggtgggag agaggcttcc 133081 tcccagacct ggtgggacag gtaggagatc gggctgagac ctggaagagg gcagtgggct 133141 ctgctgggtg ggaggccctg actcaggtag ttgagcaggc ctctagccct gggacggggt 133201 tttgacaggt cccgacctgg gggacgggct ctggagggac tgcagactct gacctaggga 133261 tggaggtatc cgcatgctct gacctaggct gggggatctg caggccctga ctcagaggta 133321 gtctagtctg caagggctgt ccagggcctg gccatgacat gccctcccac aggcctgtat 133381 ggccccgagg gcatcctacc tgcaaggagg acgctgcggc ctcagggcaa ggggcgctgg 133441 cagcagctgt gggagacccc gacgctgctg tgggaagcgc cgagactggg gctggacacg 133501 gcccagggcc tggagctgct gagcctgctg ggtgcactag tggccctggg agccctgctg 133561 ctgagcccac tgcgccaccc tgtcatctac ttgctgcttt gggccgccta cctgtcagcc 133621 tgccaggcaa gtgggacctg tgaccagctc actctatctc caacccccac cacccacccc 133681 tgcctctacc ttacctgagt ccacagacac acccgctccc tgacagcccc tctctccgca 133741 ggtgggccag gtgttccttt atttccagtg gtgagtgacg gctgggcctg gggctgtggg 133801 aggctgggta gggggtgggg gagggctctg cccttcctgg gaggactttc cgagggaagc 133861 agctgtgtgg ggaggggctc ttggtgctat ttgctctccc catagggact ccctgctgct 133921 agagactggc ttcctggccg tgctggtggc cccgctgagg ccagcctccc accgcaagga 133981 ggccccccag ggcaggcagg caggggccct gccccacgaa gacctcccct tctggctggt 134041 gcgatggctg ctgttccgcc tcatgttcgc ctcaggcgtg gtcaagctga ccagccgctg 134101 ccctgcgtgg tgggggctca ctggtgaggg gcccggtctg gacctggggt agtctggggg 134161 ctcggccagc cctgactcct gccctcgcct gcagccctca cctaccacta cgagacccag 134221 tgcctgccca cgcccgccgc ctggttcgca caccacctgc cggtctggct gcacaagctc 134281 agcgtggtgg ccaccttcct aattgagatc gctgtgccgc ccctgttctt cgcccccatt 134341 cgacgcctgc gcttggctgc tttctactcg caggtgggtg agggccgcag ctgtcagaga 134401 gaggcagaca gggctcggcc ttcagcgctg ctcagcgtgc cctgttgggg acagcgtggc 134461 tcccggggac agctggcaac cgctgttggg aaggccttgt agcagaggga tgcaccctgc 134521 agggtggagg tgggtgcctg ccacggggcc aggcgtcagt ctccgggagc actgagagcg 134581 ggccgccccc aggtgctgct gcaggtcctg attatcatca ccggcaacta caacttcttc 134641 aacctgatga cgctggtgct taccactgcg ctgctggacg accagcacct ggctgctgag 134701 cctggccacg gcagccgcaa gaagacggcc acctgtgcgt gtcacttgtc cagccccctg 134761 ccctcccagg ggttctcccc tccaacaccc ccacaggact cccactctag ctgcacccca 134821 gccccctggc tggggaccct ctgctgaccc atgactgtcc tgccccacag cctggcccaa 134881 ggccctgctg gccaccctgt cgctgctgct ggaactagcc gtctacgggc ttctggccta 134941 tggcactgtg cactactttg gcctggaggt tgactggcag cagcgcacca tccactccag 135001 aaccagtgag tgccgacctg ggggggccag agggcagggg cggaccagtc ctcacacccc 135061 cctcctccac cagctttcac cttccaccag ttttctcagt ggctgaagac actgacgctg 135121 cccactgtgt ggctgggtgt ggcctccctg gtctgggagc tgctgagtgc cctgtggagg 135181 tacatggccc aggaagcagg gagggggcac aggtgtgcag ggccggcctg acgaccttgg 135241 gtccttgccc acaggtggac ccaggtgcgg ggctggctac ggaagctcag tgctgtagtc 135301 caactgtccc ttgtgggcac tgcgaccgtg gccttgttcc tgattagcct ggtgggtagc 135361 gctgcagggc tgggggatgg gctgggggca gggcagccct gagaacctcc cacctgtctc 135421 caggtgccgt actcctacgt ggagcccggg acccacgggc gcctctggac cggggcccac 135481 cgcctgtttg gtgccgtgga gcacctacag ctggccaact cctacggcct cttccgccgc 135541 atgactgggc ttggtggacg gcctgaggtg gtgctggagg gcagttacga cggccaccac 135601 tggacggtga gccctcccag ggcgggcagg gtggtggggc ccaggtcagc gggcccagct 135661 gacctcggcc tgcctggcag gagatcgagt tcatgtacaa gcctgggaac ctgagccggc 135721 cgcccccggt tgtggtgccc caccagccac gcctggactg gcagatgtgg tttgcagccc 135781 tgggcccaca cacgcacagc ccgtggttca caagcctggt cttgcgcctg ctgcagggca 135841 aggagccagg tgcggagcgg gcttgggggg tgtggagtgg gctggggtgg agcggggtgg 135901 ggaggggagg ggcagggcgg gtactgtggg ggccacgctg atgcctctcg ctgcagtgat 135961 ccgccttgtc cagagccaag tggccaggta tcccttccac aagcagccgc ccacctacgt 136021 ccgagcccag cgctacaagt actggttctc ccagcctggg gagcaggggt aaggctgggc 136081 accgggtggg gtgtggagcc cggggtgcag gggtaaggcc gggcaccggg ggtgtggagc 136141 ccggggtgca ggggtaaggc cgggcaccgg gggggggtgt ggagcccggg gtgcaggggt 136201 aaggccgggc actggggggg gtgtggagcc cggggtgcag gggtaaggcc gggcaccggg 136261 gtggggtgtg gagcctgggg tgcaggggta aggccgggca ctgggggggg tgtggagccc 136321 ggggngcagg gntaaggccg ggcaccgggt ggggtgtgga gcccggggtg caggggtaag 136381 gccgggcacc agggggggat gtggagcgtg aggagctgag gagcaggggt aaggctgggc 136441 actggcgatg gggtatggag cccggggtgc ctccagcttc tgaacatcgg gtyctgctgc 136501 agccagtggt ggcggcgcca gtgggtggag gagttcttcc catccgtgtc cctgggggac 136561 cccacgctgg agacgctgct caggcagttt ggactacagg taagggggtg tcagccaggc 136621 tgggggaggt ggcaggggtg accccagcct cccggggtaa acccagtaag cacttcccca 136681 acctccctcc ctccctccct cccacaggag aaaagcccac ctcgcacccg cagcgccaac 136741 agcaccctgg cccaggccct ccactggact cgctctcagc tgtctcccct ggaggccccc 136801 gccctgctct gggggctcct catggccgtg ggggctgtca gatttgtgca agccctgcta 136861 gcaccctgtt ctctccggtc ctccccgctg gcaccagtca gcggggagaa gcgcaggcca 136921 gcctcccaga aagactccgg agctgcctcc gaacaggcca ccgcagcccc caacccctgc 136981 tccagtagtt cgaggaccac ccggcgaaag aagtagctgt gttctcccag ctgcacgtcc 137041 tgagagggcc aggtcgccgg gagtgctctg gcctccggca ggacaggacc cagccactgt 137101 gccttagctg accctgcagg gccaggcaca ggttgggggg ctgcccctgg ggtttgcagg 137161 gtgctgcatt gagggctcca ggccccaccc ccacgccagc catgcccctc cccaggactc 137221 ccactattgc ctctgtgatt ggcccaggag gaaaacacga ccaagctcaa gacccttccc 137281 ctgccctggg ctgtgggggt ctgagtctag agcccccaac cctaggcccc gtgccagagg 137341 ggaagaggct gactcccagg ggaagagggg aagcactgtc atcttccacg tcatcttcac 137401 accagcccat cctgcccttt agatctgggc accaataaag gcgtcttttg tgcttggctg 137461 tgtgcctggt gctgctttcc caggaggcat gaggatggga ttgtctcccc tgccccaggc 137521 agtgggccgc ccagagcaca gtcaggctcc gccacggcgt ggctccctgg gcaccctcct 137581 gccccgtcct cagcattctc tctgaagagg cctggcttgg tgttgcccac tgcctgatca 137641 cagaggagcc tcaggcctgg cacagcccgg tggacctgtc agctccacac tccaaggtcg 137701 gtggtggagc cggtccctca cttccctcct gctcccactt cttggcctca ggaaaggacg 137761 gaagggccag agtgggcaag gggagaagga gagatgaggt gggcaggtta gaagaggctc 137821 ctctgcagag tgaggccagg gggagggtag ctggaggccc tgggttggga cggaggaggg 137881 aaggtgcatc tgatgcccct ggcctccatc ctgggaagag aacagcagca gagacagcag 137941 tcagggcacc gtccaggttt ccatttgggg tcaggttatc aatcagaggt cactgcccac 138001 cctggaaact cacagcacag cttcctctgc ctctggcccg tcctcttggt acagcccctg 138061 cctcctggtc tcctgccaga cccctgggca gctcccaatg tacccagctc cctcatgtca 138121 ctgacctcca tctggcccac ccaggcagag ccctgtgctc aaacacagcc tgtggcgtga 138181 actctcagat gtcttcaagg ggctcctata catggaataa acaatcgaga agtcagcctt 138241 ggcttaataa taggctgggc gctgtggctc atggctgtaa tcccagcact ttgagtggcc 138301 gaggtgggtg gatcacctga ggtcaggagt tcgagaccag cctggccaac atggtgaaac 138361 ccaatctcta ctaaaaatac aaaaattagc tgggcatggt gtcgggtgcc tgtaatgcca 138421 gctactccac aggctgaggc aggaaaatcg cttgaacctg ggaggcagag gttgcaggga 138481 gccgagatca tgccactgca cttcagccta gacaacagag tgagactcca tctcaaaaaa 138541 aaaaagtaat atgtttaaaa ctaaaaaacc tgttcctagg cttcttggct ttctagacag 138601 ggtctcactc tgttgcccag gctggagcac agtggtgcag ttgtggctca ctgcagtctc 138661 agactcctga gctcaaatga ttctcctgcc tcaggctgcc atcaccaccc acactcccct 138721 tgttgcccag gctgatgttc ctgagttttt ttttttttga gacggagtct cgctctgtca 138781 cccagtctga agtgcagtgg caggatctcg gctcactgca agctccactt cccgggttcg 138841 cgccattctc ctgcctcagc ctctctgagt agctgggact acaggcgccc gccaccacac 138901 ctggctaatt ttttgtattt tttttttagt agagacgggg tttcaccatg ttagccagga 138961 tggtctcgat ctcctgacct cgtgatccgc ctgcctcggc ctcccaaagt gctgggatta 139021 caggtgtgag ccaccgcgcc cggcctgatg ttcctaagct ttcaagctca cttgatgaat 139081 gcgttctgtg ttttgttttg tgtagcagct ctcttggtaa gccacgtgct acacaattca 139141 cccgttcaag tgcgcaattc agtggctttc agtttattca cagagttgca cgtctatcac 139201 cagagtccat ttatttactt atttagagat ggagtctcac tctgtcaccc aggctggagt 139261 gcagtggcac aatctcggct cactgcaacc tccacctcct gggttcaagc gattctcctg 139321 cctcaacctt gcaagtagct gggattacag gcgtgcacca ccacaccggg ctaatttttg 139381 tagtttatta gtagagatgg gttttcacta tgttggccag gctggtctcg aactcctgac 139441 ctcaggagat ctgcccactt cggcctccca aagtgctggg attataggca tgagccacgg 139501 cgcctggccc ggaggccatt ttagaacctt tcatctccag aagaaacccc aaacccttta 139561 gcctttgccc tctgacccct ccctgccaag ccctaagcaa ctattcatct accttctgtc 139621 tctgtggatt tacctactct ggatatttcc tgtaaattgg atcagtaaca tgggaccttt 139681 gtgtctggct tctttcactc agcacggtgt tttcaaggcc acacccacgt catagcacgt 139741 atgagcgctg tgttccttta gggctgagtc agtccagtgt acggaggggc cacgttttcc 139801 ctgtccctat tcattcgtat atcgatagac gcttgcgtgg cttccgcctt tcctatgtga 139861 atagtgctgc tgtgaacatt ggatatg // LOCUS HUM12LIPOX 2348 bp mRNA PRI 30-OCT-1994 DEFINITION Human arachidonate 12-lipoxygenase mRNA, complete cds. ACCESSION M62982 NID g177106 KEYWORDS 12-lipoxygenase; arachidonate 12-lipoxygenase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2348) AUTHORS Yoshimoto,T., Yamamoto,Y., Arakawa,T., Suzuki,H., Yamamoto,S., Yokoyama,C., Tanabe,T. and Toh,H. TITLE Molecular cloning and expression of human arachidonate 12-lipoxygenase JOURNAL Biochem. Biophys. Res. Commun. 172 (3), 1230-1235 (1990) MEDLINE 91058562 FEATURES Location/Qualifiers source 1..2348 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 54..2045 /gene="ALOX12" CDS 54..2045 /gene="ALOX12" /codon_start=1 /db_xref="GDB:G00-127-547" /product="arachidonate 12-lipoxygenase" /db_xref="PID:g177107" /translation="MGRYRIRVATGAWLFSGSYNRVQLWLVGTRGEAELELQLRPARG EEEEFDHDVAEDLGLLQFVRLRKHHWLVDDAWFCDRITVQGPGACAEVAFPCYRWVQG EDILSLPEGTARLPGDNALDMFQKHREKELKDRQQIYCWATWKEGLPLTIAADRKDDL PPNMRFHEEKRLDFEWTLKAGALEMALKPCLHLLSSWNCLEDFDQIFWGQKSALAEKV RQCWQDDELFSYQFLNGANPMLLRRSTSLPSRLVLPSGMEELQAQLEKELQNGSLFEA DFILLDGIPANVIRGEKQYLAAPLVMLKMEPNGKLQPMVIQIQPPSPSSPTPTLFLPS DPPLAWLLAKCWVRNSDFQLHEIQYHLLNTHLVAEVIAVATMRCLPGLHPIFKFLIPH IRYTMEINTRARTQLISDGGIFDKAVSTGGGGHVQLLRRAAAQLTYCSLCPPDDLADR GLLGLPGALYAHDALRLWEIIARYVEGIVHLFYQRDDIVKGDPELQAWCREITEVGLC QAQDRGFPVSFQSQSQLCHFLTMCVFTCTAQHAAINQGQLDWYAWVPNAPCTMRMPPP TTKEDVTMATVMGSLPDVRQACLQMAISWHLSRRQPDMVPLGHHKEKYFSGPKPKAVL NQFRTDLEKLEKEITARNEQLDWPYEYLKPSCIENSVTI" BASE COUNT 515 a 696 c 641 g 496 t ORIGIN 1 cccgggaatc gcacaggacc cggctcccct cgcctaagct gctggggggc gccatgggcc 61 gctaccgcat ccgcgtggcc accggggcct ggctcttctc cgggtcgtac aaccgcgtgc 121 agctttggct ggtcgggacg cgcggggagg cggagctgga gctgcagctg cggccggcgc 181 ggggcgagga ggaggagttt gatcatgacg ttgcagagga cttggggctc ctgcagttcg 241 tgaggctgcg caagcaccac tggctggtgg acgacgcgtg gttctgcgac cgcatcacgg 301 tgcagggccc tggagcctgc gcggaggtgg ccttcccgtg ctaccgctgg gtgcagggcg 361 aggacatcct gagcctgccc gagggcaccg cccgcctgcc aggagacaat gctttggaca 421 tgttccagaa gcatcgagag aaggaactga aagacagaca gcagatctac tgctgggcca 481 cctggaagga agggttaccc ctgaccatcg ctgcagaccg taaggatgat ctacctccaa 541 atatgagatt ccatgaggag aagaggctgg actttgaatg gacactgaag gcaggggctc 601 tggagatggc cctcaaaccg tgtttacacc tcctgagctc ctggaactgc ctagaagact 661 ttgatcagat cttctggggc cagaagagtg ccctggctga gaaggttcgc cagtgctggc 721 aggatgatga gttgttcagc taccagttcc tcaatggtgc caaccccatg ctgttgagac 781 gctcgacctc tctgccctcc aggctagtgc tgccctcggg gatggaagag cttcaggctc 841 aactggagaa agaacttcag aatggttccc tgtttgaagc tgacttcatc cttctggatg 901 gaattccagc caacgtgatc cgaggagaga agcaatacct ggctgccccc ctcgttatgc 961 tgaagatgga gcccaatggg aagctgcagc ccatggtcat ccagattcag cctcccagcc 1021 ccagctctcc aaccccaaca ctgttcctgc cctcagaccc cccacttgcc tggctcctgg 1081 ccaaatgctg ggtccgaaat tcagatttcc aactgcacga gatccagtat cacttgctga 1141 acactcacct ggtggctgag gtcatcgctg tcgccaccat gcggtgcctc ccaggactgc 1201 accccatctt caagttcctg atcccccata tccgctacac catggaaatc aacacccggg 1261 cccggaccca actcatctca gatggaggaa tttttgataa ggcagtgagc acaggtggag 1321 ggggccatgt acagttgctc cgtcgggcgg cagctcagct gacctactgc tccctctgtc 1381 ctcctgacga cctggctgac cggggcctgc tgggactccc aggtgctctc tatgcccatg 1441 atgctttacg gctctgggag atcattgcca ggtatgtgga ggggatcgtc cacctcttct 1501 accaaaggga tgacatagtg aagggggacc ctgagctgca ggcctggtgt cgggagatca 1561 cggaggtggg gctgtgccag gcccaggacc gaggtttccc tgtctccttc cagtcccaga 1621 gtcaactctg ccatttcctc accatgtgcg tcttcacgtg cactgcccag catgccgcca 1681 tcaaccaggg ccagctggac tggtatgcct gggtccctaa tgctccatgc acaatgcgga 1741 tgcccccacc caccaccaag gaagatgtga cgatggccac agtgatgggg tcactacctg 1801 atgtccggca ggcctgtctt caaatggcca tctcatggca tctgagtcgc cgccagccag 1861 acatggtgcc tctggggcac cacaaagaaa aatatttctc aggccccaag cccaaagctg 1921 tgctaaacca attccgaaca gatttggaaa agctagaaaa ggagattaca gcccggaatg 1981 agcaacttga ctggccctat gaatatctga agcccagctg catagagaac agtgtcacca 2041 tctgagccct agagtgactc tacctgcaag atttcacatc agctttagga ctgacatttc 2101 tatcttgaat ttcatgcttt cctaaagtct ctgctgctaa ggctctattt cctcccccag 2161 ttaaaccccc tacattagta tcccactagc ccaggggagc agtaaacttt ctctgcaaag 2221 actagatcct tttttacgct ttgcagaccg catagtcact gtctcaacta ctcagctctc 2281 ctgctgcagc atgaaggcag ccacagacaa catggaaatg agtgtgacta tgttccaata 2341 aaacttta // LOCUS HUM130LEU 4782 bp mRNA PRI 26-MAY-1995 DEFINITION Human leucine-rich protein mRNA, complete cds. ACCESSION M92439 NID g177109 KEYWORDS glycoprotein 130; leucine-rich protein. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4782) AUTHORS Hou,J., Wang,F., McKeehan,W.L., Hou,J., Wang,F. and McKeehan,W.L. TITLE Molecular cloning and expression of the gene for a major leucine-rich protein from human hepatoblastoma cells (HepG2) JOURNAL In Vitro Cell. Dev. Biol. Anim. 30 (2), 111-114 (1994) MEDLINE 94282390 REFERENCE 2 (bases 1 to 4782) AUTHORS McKeehan,W.L. TITLE Direct Submission JOURNAL Submitted (07-MAY-1992) Wallace L. McKeehan, Albert B. Alkek Institute of Biosciences and Technology, Texas A&M University, 2121 Holcombe Blvd., Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..4782 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="terminal hepatoma" CDS 46..3867 /standard_name="GP130" /note="130kD" /codon_start=1 /evidence=not_experimental /product="leucine-rich protein" /db_xref="PID:g801893" /translation="MPCFYLRSCGSLLPELKLEERTEFAHRIWDTLQKLGAVYDVSHY NALLKVYLQNEYKFSPTDFLAKMEEANIQPNRVTYQRLIASYCNVGDIEGASKILGFM KTKDLPVTEAVFSALVTGHARAGDMENAENILTVMRDAGIEPGPDTYLALLNAYAEKG DIDHVKQTLEKVEKFELHLMDRDLLQIIFSFSKAGYLSMSQKFWKKFTCERRYIPDAM NLILLLVTEKLEDVALQILLACPVSKEDGPSVFGSFFLQHCVTMNTPVEKLTDYCKKL KEVQMHSFPLQFTLHCALLANKTDLAKALMKAVKEEGFPIRPHYFWPLLVGRRKEKNV QGIIEILKGMQELGVHPDQETYTDYVIPCFDSVNSARAILQENGCLSDSDMFSQAGLR SEAANGNLDFVLSFLKSNTLPISLQSIRSSLLLGFRRSMNINVWSEITELLYKDGRYC QEPRGPTEAVGNFLYNLIDSMSDSEVQAKEEHLRQYFHQLEKMNVKIPENIYRGIRNL LESYHVPELIKDAHLLVERKNLDFQKTVQLTSSELESTLETLKAENQPIRDVLKQLIL VLCSEENMQKALELKAKYESDMVTGGYAALINLCCRHDKVEDALNLKEEFDRLDSSAV LDTGNYLGLVRVLAKHGKLQDAIKILKEMKEKDVLIKDTTALSFFHMLNGAALRGEIE TVKQLHEAIVTLGLAEPSTNISFPLVTVHLEKGDLSTALEVAIDCYEKYKVLPRIHDV LCKLVEKGETDLIQKAMDFVSQEQGEMVMLYDLFFAFLQTGNYKEAKKIIETPGIRAR SARLQWFCDRCVANNQVETLEKLVELTQKLFECDRDQMYYNLLKLYKINGDWQRADAV WNKIQEENVIPREKTLRLLAEILREGNQEVPFDVPELWYEDEKHSLNSSSASTTEPDF QKDILIACRLNQKKGAYDIFLNAKEQNIVFNAETYSNLIKLLMSEDYFTQAMEVKAFA ETHIKGFTLNDAANSRLIITQVRRDYLKEAVTTLKTVLDQQQTPSRLAVTRVIQALAM KGDVENIEVVQKMLNGLEDSIGLSKMVFINNIALAQIKNNNIDAAIENIENMLTSENK VIEPQYFGLAYLFRKVIEEQLEPAVEKISIMAERLANQFAIYKPVTDFFLQLVDAGKV DDARALLQRCGAIAEQTPILLLFLLRNSRKQGKASTVKSVLELIPELNEKEEAYNSLM KSYVSEKDVTSAKALYEHLTAKNTKLDDLFLKRYASLLKYAGEPVPFIEPPESFEFYA QQLRKLRENSS" CDS 244..3867 /standard_name="GP130" /note="130kD" /codon_start=1 /evidence=not_experimental /product="leucine-rich protein" /db_xref="PID:g177110" /translation="MEEANIQPNRVTYQRLIASYCNVGDIEGASKILGFMKTKDLPVT EAVFSALVTGHARAGDMENAENILTVMRDAGIEPGPDTYLALLNAYAEKGDIDHVKQT LEKVEKFELHLMDRDLLQIIFSFSKAGYLSMSQKFWKKFTCERRYIPDAMNLILLLVT EKLEDVALQILLACPVSKEDGPSVFGSFFLQHCVTMNTPVEKLTDYCKKLKEVQMHSF PLQFTLHCALLANKTDLAKALMKAVKEEGFPIRPHYFWPLLVGRRKEKNVQGIIEILK GMQELGVHPDQETYTDYVIPCFDSVNSARAILQENGCLSDSDMFSQAGLRSEAANGNL DFVLSFLKSNTLPISLQSIRSSLLLGFRRSMNINVWSEITELLYKDGRYCQEPRGPTE AVGNFLYNLIDSMSDSEVQAKEEHLRQYFHQLEKMNVKIPENIYRGIRNLLESYHVPE LIKDAHLLVERKNLDFQKTVQLTSSELESTLETLKAENQPIRDVLKQLILVLCSEENM QKALELKAKYESDMVTGGYAALINLCCRHDKVEDALNLKEEFDRLDSSAVLDTGNYLG LVRVLAKHGKLQDAIKILKEMKEKDVLIKDTTALSFFHMLNGAALRGEIETVKQLHEA IVTLGLAEPSTNISFPLVTVHLEKGDLSTALEVAIDCYEKYKVLPRIHDVLCKLVEKG ETDLIQKAMDFVSQEQGEMVMLYDLFFAFLQTGNYKEAKKIIETPGIRARSARLQWFC DRCVANNQVETLEKLVELTQKLFECDRDQMYYNLLKLYKINGDWQRADAVWNKIQEEN VIPREKTLRLLAEILREGNQEVPFDVPELWYEDEKHSLNSSSASTTEPDFQKDILIAC RLNQKKGAYDIFLNAKEQNIVFNAETYSNLIKLLMSEDYFTQAMEVKAFAETHIKGFT LNDAANSRLIITQVRRDYLKEAVTTLKTVLDQQQTPSRLAVTRVIQALAMKGDVENIE VVQKMLNGLEDSIGLSKMVFINNIALAQIKNNNIDAAIENIENMLTSENKVIEPQYFG LAYLFRKVIEEQLEPAVEKISIMAERLANQFAIYKPVTDFFLQLVDAGKVDDARALLQ RCGAIAEQTPILLLFLLRNSRKQGKASTVKSVLELIPELNEKEEAYNSLMKSYVSEKD VTSAKALYEHLTAKNTKLDDLFLKRYASLLKYAGEPVPFIEPPESFEFYAQQLRKLRE NSS" BASE COUNT 1531 a 856 c 1018 g 1377 t ORIGIN 1 aagtttttaa tgatacctgc cgctcaggtg gcctaggtgg tagtcatgcc ttgcttctac 61 ttacgtagtt gtggttctct cttgcctgaa ctaaagcttg aagagagaac agaatttgct 121 cataggatat gggacacact tcagaaatta ggtgctgtgt atgatgtgag tcactataat 181 gctttactta aagtctatct tcaaaatgaa tataaattct caccaactga tttcctggca 241 aaaatggagg aagcaaacat tcaaccaaat cgagtgacat accagagatt gattgcttct 301 tattgtaatg taggagatat tgaaggtgcc agcaagattc ttggatttat gaaaactaag 361 gatctcccag ttacagaggc agtattcagt gcccttgtga cagggcatgc cagagctggt 421 gatatggaga atgcagaaaa cattctcaca gtgatgagag atgccggaat tgagcctggt 481 ccagacacat acctcgcatt attgaatgca tatgctgaga agggcgacat tgaccatgtt 541 aagcagactc tggagaaggt ggagaagttc gagcttcacc ttatggaccg tgatttactg 601 caaattattt ttagcttcag taaagctggg tatctcagta tgtctcagaa attttggaaa 661 aagtttacat gtgaaagaag atatattcca gatgcaatga acctcatttt acttttagtc 721 actgaaaaat tggaagatgt agcgttgcaa attttactag catgccccgt atcaaaggaa 781 gatggcccaa gtgtctttgg cagtttcttt ttacaacact gtgtgactat gaatacgcct 841 gtggagaagc taacagacta ctgtaagaag ttaaaggaag tccagatgca ctcctttcct 901 ctgcagttca ccctccattg tgctttactc gccaataaaa ctgatttggc aaaagcctta 961 atgaaggctg tgaaggagga aggttttcct atcagacctc actatttctg gccattgcta 1021 gttggacgtc ggaaggaaaa aaatgttcaa ggtataattg aaatcctcaa aggaatgcaa 1081 gaattgggag tacatcctga tcaggaaaca tatacagatt atgtgattcc atgctttgat 1141 agtgtaaact cagcacgagc cattttgcag gaaaatggat gtctgtctga tagtgatatg 1201 ttttctcaag ctggattgag aagtgaagca gcaaatggga acttagactt tgtattatca 1261 tttttgaaat caaatacatt gcccatctcg ctgcagtcta taagaagtag cctactgcta 1321 ggcttcagga ggtctatgaa tataaatgtt tggagcgaga taacagaatt attgtacaag 1381 gatggacgtt attgccagga gcctcgagga ccgacggaag ctgttggcaa ttttctttat 1441 aacttgattg acagcatgag tgactcagag gtacaggcca aggaggagca tttgagacaa 1501 tacttccatc agctggagaa gatgaatgta aaaattcctg aaaatatcta cagaggcatt 1561 cgtaatctcc tggaaagcta ccatgttcct gaattgatta aggatgctca cttgttggtt 1621 gagcgtaaga atttagactt tcaaaaaact gtgcaactta catcatctga attggagtca 1681 acacttgaaa cactaaaagc tgaaaatcaa cctataagag atgtcctaaa gcaactcata 1741 ttagtgcttt gttcagaaga gaatatgcaa aaagcccttg aattgaaagc aaaatatgaa 1801 tccgacatgg ttactggtgg ctatgcagct ttaataaatt tatgctgtcg acatgataaa 1861 gtagaagatg ccttgaactt gaaagaagaa tttgaccgct tagattcatc tgctgtcctt 1921 gacaccggca actatctagg ccttgtaaga gtattggcaa agcatggcaa gctccaagat 1981 gctattaaga ttctgaagga gatgaaagag aaggatgttc ttatcaaaga tacaacagcc 2041 ttgtcctttt tccacatgct aaatggcgca gctttaagag gtgaaattga aacagtaaaa 2101 cagttgcatg aagccatcgt gactctaggg ttagcagaac catccaccaa cataagtttc 2161 ccattggtca ctgtacactt ggaaaagggc gacctatcta ctgctcttga ggtcgccatt 2221 gactgctatg aaaagtataa agtattacca aggattcatg atgtcttgtg taaactggta 2281 gagaaaggcg agactgatct aattcagaaa gcaatggact ttgtgagcca agaacaaggt 2341 gaaatggtga tgctctatga tctcttcttt gccttcctac aaacaggaaa ttacaaagag 2401 gccaagaaga tcattgagac tccagggatt agagctcgat ctgcaaggct tcagtggttt 2461 tgtgacagat gtgttgcaaa taatcaggtt gaaactctgg aaaaattagt ggagctgaca 2521 cagaagctat ttgaatgtga tagagaccag atgtactaca atctgctaaa actgtataaa 2581 ataaacggtg actggcaaag agctgatgca gtctggaata aaatccaaga agaaaatgtt 2641 attcctcgtg aaaagacatt aagattatta gcagaaatcc ttagagaggg taaccaggaa 2701 gttccgtttg acgtacctga gttgtggtat gaagatgaaa aacattccct gaattcttcg 2761 tcagcctcaa ccacagaacc tgatttccag aaagatatat tgattgcctg ccgattgaac 2821 caaaaaaaag gggcatatga tattttcctg aatgcaaaag agcaaaacat tgtgtttaat 2881 gctgaaacct acagcaatct cattaaatta ctgatgtcag aagattattt tacacaagca 2941 atggaagtga aagcattcgc ggagacccac atcaagggct tcacactgaa cgatgctgcc 3001 aacagccgcc tcatcataac gcaagttagg cgggattatt tgaaagaggc tgtgacaaca 3061 ctgaaaacag tattggatca gcagcagacc ccttctaggt tagcagtgac ccgtgtcatc 3121 caggcattgg ccatgaaggg tgatgttgaa aacatagaag tagttcagaa gatgttaaat 3181 ggactcgaag actccattgg actttcaaaa atggttttca tcaataacat tgctttggct 3241 caaataaaga ataataacat agatgccgca atagaaaaca ttgaaaatat gcttacttca 3301 gagaataaag tcattgaacc ccaatacttc ggcttggcat acttattcag aaaagtaata 3361 gaggagcagt tggaaccagc agttgaaaag ataagcatca tggcggagag attggccaat 3421 cagtttgcaa tttataaacc tgtcactgat tttttccttc aacttgtgga tgcaggcaag 3481 gtggatgatg ccagagctct cctacagaga tgtggtgcaa ttgctgaaca aaccccgatt 3541 ttgttgttgt tcctccttag gaattctagg aaacaaggaa aggcatcaac tgtgaaatct 3601 gtgttagaat tgattcctga attaaatgaa aaggaagaag catacaattc cctcatgaaa 3661 agctatgtct cagagaaaga tgtcacatct gctaaagcac tgtatgaaca tttgactgca 3721 aagaatacaa aattggatga tctgtttcta aagcgttacg catctttgct gaagtatgct 3781 ggagagcctg tccctttcat tgaaccccct gaaagctttg aattttatgc acagcagcta 3841 agaaaattga gggaaaactc ttcttgaaat aaccaggcga tactttgttt tgtatatatt 3901 tgtgattctg tgtctacatg ttattttgaa gtatatctga gggaaaaata aatgaaaatt 3961 ttctttatgt acttatgtat gtgtgatgca tgttcaaagt cttattgacc ataactctgt 4021 gcacttggtt attggacatt tttggagttt tttctctggg aaaaatcgat agtgttttct 4081 tcaatgctgc tgctgtgtga agccatactt ttcaggattc ttccctaatt ggctctttgg 4141 tttccctgct ctgtttcatt tatttcatta aaatgttatt cctttattta agattcactt 4201 attagtctgc tgtttctctg aaaaatttta gagctaggta tagtgaccgt gaacttctaa 4261 cgcataatat ctgtgataca gccattccgt acatgtgtga gtctgcataa ctttcgaact 4321 ttcgaacttt gttaaatgtt ggcactagga gtcatcagat ctaggattca tcattttcca 4381 gtgagaagca gagacccaaa ggcctgttac ttgtgcttgg tcaggggact gtctgtcatg 4441 cctggaggct cttcggcaca cttccccatc tttcccttct gccactgtgg cttcaagcac 4501 ctctgttcat agagcgtctc tgaaattgag tctcggtcat gacttatccc gaagtagagc 4561 aatgtgtttc ctctcattgt agtttcagga ctttgtcagt acaagctctg ccctaggctt 4621 cttactttat actcatatcc tgaaaagatg tgatttcatc tatgaagggg taaaatattg 4681 gtttgtattt aattgtttga aataaaagtg atccctataa aaaaaaaaaa aaaaaaaaaa 4741 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS HUM14RPA 692 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens replication protein A 14kDa subunit (RPA) mRNA, complete cds. ACCESSION L07493 NID g291582 KEYWORDS replication protein A; replication protein A 14kDa subunit. SOURCE Homo sapiens female cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 692) AUTHORS Umbricht,C.B., Erdile,L.F., Jabs,E.W. and Kelly,T.J. TITLE Cloning, overexpression, and genomic mapping of the 14-kDa subunit of human replication protein A JOURNAL J. Biol. Chem. 268 (9), 6131-6138 (1993) MEDLINE 93203195 FEATURES Location/Qualifiers source 1..692 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="epithelial cell" /sex="female" /map="7" gene 31..396 /gene="RPA" CDS 31..396 /gene="RPA" /codon_start=1 /product="replication protein A 14kDa subunit" /db_xref="PID:g291583" /translation="MVDMMDLPRSRINAGMLAQFIDKPVCFVGRLEKIHPTGKMFILS DGEGKNGTIELMEPLDEEISGIVEVVGRVTAKATILCTSYVQFKEDSHPFDLGLYNEA VKIIHDFPQFYPLGIVQHD" BASE COUNT 208 a 128 c 143 g 213 t ORIGIN 1 ttccccgagc cgcagtcttg gaccataatc atggtggaca tgatggactt gcccaggtcg 61 cgcatcaacg ccggcatgct agctcaattc atcgacaagc ctgtctgctt cgtagggagg 121 ctggaaaaga ttcatcccac cggaaaaatg tttattcttt cagatggaga aggaaaaaat 181 ggaaccatcg agttgatgga accccttgat gaagaaatct ctggaattgt ggaagtggtt 241 ggaagagtaa ccgccaaggc caccatcttg tgtacatctt atgtccagtt taaagaagat 301 agccatcctt ttgatcttgg actttacaat gaagctgtga aaattatcca tgacttccct 361 cagttttatc ctttagggat tgtgcaacat gattgatctt gatggatttt catacgattg 421 taaatgagct atattaaagt ctattaaagg aagcccttct tgtttgaggg agagatttct 481 gtgctttctc atatttaatt tgctgttttt aagatattcc aacctagagt ttttgatgga 541 actgatatat tgacagttct caccgaagcc cttttataaa gaattgctac tccaatatat 601 ggtcagatta gatgcaagaa taaagcagtt gtccgagtct aagtttctat tttattaata 661 aaaactaaaa tggtacgtac aaaaaaaaaa cc // LOCUS HUM1D12A 492 bp mRNA PRI 14-MAR-1994 DEFINITION Human pre-T/NK cell associated protein (1D12A2) mRNA, complete cds. ACCESSION L17325 NID g306322 KEYWORDS . SOURCE Homo sapiens fetus liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 492) AUTHORS Ranes-Goldberg,M.G., Hori,T., Mohan-Peterson,S. and Spits,H. TITLE Identification of human pre-T/NK cell-associated genes JOURNAL J. Immunol. 151 (10), 5810-5821 (1993) MEDLINE 94044805 FEATURES Location/Qualifiers source 1..492 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pre T/NK" /dev_stage="fetus" /germline /tissue_type="liver" gene 5..492 /gene="1D12A" CDS 5..64 /gene="1D12A" /codon_start=1 /db_xref="PID:g306323" /translation="MSQKTCFIETSRKLQTKKT" polyA_site 492 /gene="1D12A" BASE COUNT 151 a 90 c 67 g 184 t ORIGIN 1 cgagatgtct cagaaaacct gttttataga aacttctaga aagttgcaaa ccaaaaaaac 61 atgaatgttc atcatgtcat ctcatttttc tgaaaaatta tgaagttttc agtaattgtg 121 actttttaaa catgtaaaag tatggaagta catccagtaa acaatgccat gtacattccc 181 cctcaatttc caaccccatc cccatttgct gtccagagtg tgaccacagt taacggttaa 241 tgtgcatctt ttatgtactt aacatgtctg taaatatgtc ttttatcttt ttcccctctt 301 ttaaatttta acttgacgtt aaagtttaga acttccatgt tagtatgtgc agctgtaaca 361 cattcttttt ttagtagcca catagtgttt caatgtatgg atgttccaaa atacattacc 421 catttcctgc tgtaacattt taacttttta gaattcaact atgtgtactt ttgtaattaa 481 aaatgtgaat tt // LOCUS HUM215MBP 622 bp DNA PRI 19-JAN-1996 DEFINITION Homo sapiens synthetic myelin basic protein 21.5 kDa isoform gene, complete cds. ACCESSION L41657 NID g1162921 KEYWORDS myelin basic protein; synthetic DNA; synthetic gene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kamholz,J., de Ferra,F., Puckett,C. and Lazzarini,R. TITLE Identification of three forms of human myelin basic protein by cDNA cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (13), 4962-4966 (1986) MEDLINE 86259714 REFERENCE 2 (bases 1 to 622) AUTHORS Nye,S.H., Pelfrey,C.M., Burkwit,J.J., Voskuhl,R.R., Lenardo,M.J. and Mueller,J.P. TITLE Purification of immunologically active recombinant 21.5 kDa isoform of human myelin basic protein JOURNAL Mol. Immunol. 32 (14-15), 1131-1141 (1995) MEDLINE 96128281 COMMENT Sequence M13577 overlaps this sequence. FEATURES Location/Qualifiers source 1..622 /organism="Homo sapiens" /note="synthetically derived in the laboratory with oligonucleotides" /db_xref="taxon:9606" /map="18q22-qter" mRNA 1..622 gene 4..615 /gene="MBP" CDS 4..615 /gene="MBP" /note="21.5 kDa isoform; bp 595-612: histidine tag" /codon_start=1 /db_xref="GDB:G00-119-379" /product="myelin basic protein" /db_xref="PID:g1162922" /translation="MASQKRPSQRHGSKYLATASTMDHARHGFLPRHRDTGILDSIGR FFGGDRGAPKRGSGKVPWLKPGRSPLPSHARSQPGLCNMYKDSHHPARTAHYGSLPQK SHGRTQDENPVVHFFKNIVTPRTPPPSQGKGRGLSLSRFSWGAEGQRPGFGYGGRASD YKSAHKGFKGVDAQGTLSKIFKLGGRDSRSGSPMARRHHHHHH" BASE COUNT 120 a 220 c 167 g 115 t ORIGIN 1 catatggcgt ctcagaaacg tccgtcccag cgtcacggct ccaaatacct ggccaccgcc 61 agcaccatgg accatgcccg tcatggcttc ctgccgcgtc accgtgacac cggcatcctg 121 gactccatcg gccgcttctt cggcggtgac cgtggtgcgc cgaaacgtgg ctctggcaaa 181 gtgccgtggc tgaaaccggg ccgtagcccg ctgccgtctc atgcccgtag ccagccgggc 241 ctgtgcaaca tgtacaaaga ctcccaccac ccggctcgta ccgcgcacta tggctccctg 301 ccgcagaaat cccacggccg tacccaggat gaaaacccgg tggtgcactt cttcaaaaac 361 attgtgaccc cgcgtacccc gccgccgtct cagggcaaag gccgtggcct gtccctgagc 421 cgtttcagct ggggcgccga aggccagcgt ccgggcttcg gctacggcgg ccgtgcgtcc 481 gactataaat ctgctcacaa aggcttcaaa ggcgtggatg cccagggcac cctgtccaaa 541 attttcaaac tgggcggccg tgatagccgt tctggctctc cgatggctag acgtcatcac 601 catcaccatc actaataagc tt // LOCUS HUM24DCOAR 1120 bp mRNA PRI 26-MAY-1995 DEFINITION Human mitochondrial 2,4-dienoyl-CoA reductase mRNA, complete cds. ACCESSION L26050 NID g602702 KEYWORDS 2,4-dienoyl-CoA reductase. SOURCE Homo sapiens (tissue library: Clontech) adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1120) AUTHORS Koivuranta,K.T., Hakkola,E.H. and Hiltunen,J.K. TITLE Isolation and characterization of cDNA for human 120 kDa mitochondrial 2,4-dienoyl-coenzyme A reductase JOURNAL Biochem. J. 304 (Pt 3), 787-792 (1994) MEDLINE 95118295 FEATURES Location/Qualifiers source 1..1120 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /tissue_type="liver" /tissue_lib="Clontech" 5'UTR 1..10 /partial /note="putative" CDS 11..1018 /EC_number="1.3.1.34" /note="mitochondrial; putative" /codon_start=1 /product="2,4-dienoyl-CoA reductase" /db_xref="PID:g602703" /translation="MKLPARVFFTLGSRLPCGLAPRRFFSYGTKILYQNTEALQSKFF SPLQKAMLPPNSFQGKVAFITGGGTGLGKGMTTLLSSLGAQCVIASRKMDVLKATAEQ ISSQTGNKVHAIQCDVRDPDMVQNTVSELIKVAGHPNIVINNAAGNFISPTERLSPNA WKTITDIVLNGTAFVTLEIGKQLIKAQKGAAFLSITTIYAETGSGFVVPSASAKAGVE AMSKSLAAEWGKYGMRFNVIQPGPIKTKGAFSRLDPTGTFEKEMIGRIPCGRLGTVEE LANLAAFLCSDYASWINGAVIKFDGGEEVLISGEFNDLRKVTKEQWDTIEELIRKTKG S" sig_peptide 11..85 /note="putative" 3'UTR 1016..1120 /note="putative" polyA_signal 1103..1108 /note="putative" polyA_site 1120 BASE COUNT 341 a 220 c 250 g 309 t ORIGIN 1 gagactcaac atgaagctac cggccagggt tttctttact ctggggtccc ggctgccctg 61 tggcctcgct cctcggaggt ttttcagtta tgggacaaaa atattatatc aaaacactga 121 agctttgcaa tctaaattct tttcacctct tcaaaaagcg atgctaccac ctaatagttt 181 tcaaggaaaa gtggcattca ttactggggg aggtactggc cttggtaaag gaatgacaac 241 tcttctgtcc agcctaggtg ctcagtgcgt gatagccagc cggaagatgg atgttttgaa 301 agctaccgca gaacaaattt cttctcaaac tggaaataag gttcatgcaa ttcagtgtga 361 tgtgagggat cctgatatgg ttcaaaacac tgtgtcagaa ctgatcaaag ttgcaggaca 421 tcctaatatt gtgataaaca atgcagcagg gaattttatt tctcctactg aaagactttc 481 tcctaatgct tggaaaacca taactgacat agttctaaat ggcacagcct tcgtgacact 541 agaaattgga aaacaactaa ttaaagcaca gaaaggagca gcatttcttt ctattactac 601 tatctatgct gagactggtt caggttttgt agtaccaagt gcttctgcca aagcaggtgt 661 ggaagccatg agcaagtctc ttgcagctga atggggtaaa tatggaatgc gattcaatgt 721 gattcaacca gggcctataa aaaccaaagg tgcctttagc cgtctggacc caactggaac 781 atttgagaaa gaaatgattg gcagaattcc ctgtggtcgc ctggggactg tagaagaact 841 cgcaaatctt gctgctttcc tttgtagtga ttatgcttct tggattaatg gagcagtcat 901 taaatttgac ggtggagagg aagtacttat ttcaggggaa ttcaacgacc tgagaaaggt 961 caccaaggag cagtgggaca ccatagaaga actcatcagg aagacaaaag gttcctaaga 1021 ccactttggc cttcatcttg gttacagaaa agggaataga aatgaaacaa attatctctc 1081 atcttttgac tatttcaagt ctaataaatt cttaattaac // LOCUS HUM26SPSIV 1599 bp mRNA PRI 27-SEP-1993 DEFINITION Human 26S protease (S4) regulatory subunit mRNA, complete cds. ACCESSION L02426 NID g403455 KEYWORDS 26S protease (S4) regulatory subunit; ATP-dependant protease; ATPase; ubiquitin-dependent protease. SOURCE Homo sapiens (library: Lambda Zap II) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Dubiel,W., Ferrell,K., Pratt,G. and Rechsteiner,M.C. TITLE Subunit 4 of the 26S protease is a member of a novel eukaryotic ATPase family JOURNAL J. Biol. Chem. 267, 22699-22702 (1992) MEDLINE 93054576 FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="erythrocyte" /tissue_lib="Lambda Zap II" RBS 55..64 CDS 61..1383 /codon_start=1 /function="degradation of ubiquitinated proteins" /evidence=experimental /product="26S protease (S4) regulatory subunit" /db_xref="PID:g403456" /translation="MGQSQSGGHGPGGGKKDDEDKKKKYEPPVPTRVGKKKKKTKGPD AASKLPLVTPHTQCRLKLLKLERIKDYLLMEEEFIRNQEQMKPLEEKQEEERSKVDDL RGTPMSVGTLEEIIDDNHAIVSTSVGSEHYVSILSFVDKDLLEPGCSVLLNHKVHAVI GVLMDDTDPLVTVMKVEKAPQETYADIGGLDNQIQEIKESVELPLTHPEYYEEMGIKP PKGVILYGPPGTGKTLLAKAVANQTSATFLRVVGSELIQKYLGDGPKLVRELFRVAEE HAPSIVFIDEIDAIGTKRYDSNSGGEREIQRTMLELLNQLDGFDSRGDVKVIMATNRI ETLDPALIRPGRIDRKIEFPLPDEKTKKRIFQIHTSRMTLADDVTLDDLIMAKDDLSG ADIKAICTEAGLMALRERRMKVTNEDFKKSKENVLYKKQEGTPEGLYL" polyA_signal 1577..1582 BASE COUNT 489 a 322 c 423 g 365 t ORIGIN 1 aattccggcg gaagtggtgg aggaacttcc ggcagcggca gctcaagtgg ccaaggcaag 61 atgggtcaaa gtcagagtgg tggtcatggt cctggaggtg gcaagaagga tgacgaggac 121 aagaaaaaga aatatgaacc tcctgtacca actagagtgg ggaaaaagaa gaagaaaaca 181 aagggaccag atgctgccag caaactgcca ctggtgacac ctcacactca gtgccggtta 241 aaattactga agttagagag aattaaagac tatcttctca tggaggaaga attcattaga 301 aatcaggaac aaatgaaacc attagaagaa aagcaagagg aggaaagatc aaaagtggat 361 gatctgaggg ggaccccgat gtcagtagga accttggaag agattattga tgacaatcat 421 gccatcgtgt ctacatctgt gggctcagaa cactacgtca gcattctttc atttgtagac 481 aaggatctgc tggaacctgg ctgctcggtc ctgctcaacc acaaggtgca tgccgtgata 541 ggggtgctga tggatgacac ggatcccctg gtcacagtga tgaaggtaga aaaggccccc 601 caggagacct atgcagatat tggggggttg gacaaccaaa ttcaggaaat taaggaatct 661 gtggagcttc ctctcaccca tcctgaatat tatgaagaga tgggtataaa gcctcctaag 721 ggggtcattc tctatggtcc acctggcaca ggtaaaacct tgttagccaa agcagtagca 781 aaccaaacct cagccacttt cttgagagtg gttggctctg aacttattca gaagtaccta 841 ggtgatgggc ccaaactcgt acgggaattg ttccgagttg ctgaagaaca tgcaccgtcc 901 atcgtgttta ttgatgaaat tgacgccatt gggacaaaaa gatatgactc caattctggt 961 ggtgagagag aaattcagcg aacaatgttg gaactgctga accagttgga tggatttgat 1021 tctaggggag atgtgaaagt tatcatggcc acaaaccgaa tagaaacttt ggatccagca 1081 cttatcagac caggccgcat tgacaggaag attgagttcc ccctgcctga tgaaaagacg 1141 aagaagcgca tctttcagat tcacacaagc aggatgacgc tggctgatga tgtaaccctg 1201 gacgacctga tcatggctaa agatgacctc tctggtgctg acatcaaggc aatctgtaca 1261 gaagctggtc tgatggcctt aagagaacgt agaatgaaag taacaaatga agacttcaaa 1321 aaatctaaag aaaatgttct ttataagaaa caggaaggca cccctgaggg gctgtatctc 1381 taatgaacca tggctgtcat caggaaaatg gttgggagat ttctcaatcc ctgaaaggga 1441 tgaggttggg ggagttgccc agaggaatcc ctgttcccac tgatttttat tagcaaaaca 1501 tcctgtgtct tttggagtac gatgtgtaag tgcccattgg gtggcctgtt ggtcactgtg 1561 cagcagtctg cttcccaata aagcgtgctc tttcacaag // LOCUS HUM2OGDH 4122 bp mRNA PRI 21-MAR-1996 DEFINITION Human mRNA for 2-oxoglutarate dehydrogenase, complete cds. ACCESSION D10523 D90499 NID g531240 KEYWORDS 2-oxoglutarate dehydrogenase. SOURCE Homo sapiens cDNA to mRNA, clone_lib:lamda gt11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Koike,K., Urata,Y. and Goto,S. TITLE Cloning and nucleotide sequence of the cDNA encoding human 2-oxoglutarate dehydrogenase (lipoamide) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (5), 1963-1967 (1992) MEDLINE 92179301 REFERENCE 2 (sites) AUTHORS Koike,K. TITLE The gene encoding human 2-oxoglutarate dehydrogenase: structural organization and mapping to chromosome 7p13-p14 JOURNAL Gene 159 (2), 261-266 (1995) MEDLINE 95347609 REFERENCE 3 (bases 1 to 4122) AUTHORS Koike,K. JOURNAL Unpublished (1994) REFERENCE 4 (bases 1 to 4122) AUTHORS Koike,K. TITLE Direct Submission JOURNAL Submitted (11-SEP-1991) to the DDBJ/EMBL/GenBank databases. Kichiko Koike, Atomic Disease Institute, Nagasaki Univ. School of Medicine, Department of Pathological Biochemistry; Sakamoto 1-12-4, Nagasaki, Nagasaki 852, Japan (Tel:0958-47-2111(ex.2347), Fax:0958-45-9790) COMMENT Submitted (11-Sep-1991) to DDBJ by: Kichiko Koike Department of Pathological Biochemistry Atomic Disease Institute Nagasaki University School of Medicine 12-4 Sakamoto-machi Nagasaki-shi 852 Japan Phone: 0958-47-2111 Fax: 0958-47-8514. FEATURES Location/Qualifiers source 1..4122 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lamda gt11" 3'UTR 1..57 old_sequence 39 /note="replace (39, 'g')" /citation=[1] sig_peptide 58..177 CDS 58..3066 /codon_start=1 /product="2-oxoglutarate dehydrogenase precursor" /db_xref="PID:d1001866" /db_xref="PID:g531241" /translation="MFHLRTCAAKLRPLTASQTVKTFSQNRPAAARTFQQIRCYSAPV AAEPFLSGTSSNYVEEMYCAWLENPKSVHKSWDIFFRNTNAGAPPGTAYQSPLPLSRG SLAAVAHAQSLVEAQPNVDKLVEDHLAVQSLIRAYQIRGHHVAQLDPLGILDADLDSS VPADIISSTDKLGFYGLDESDLDKVFHLPTTTFIGGQESALPLREIIRRLEMAYCQHI GVEFMFINDLEQCQWIRQKFETPGIMQFTNEEKRTLLARLVRSTRFEEFLQRKWSSEK RFGLEGCEVLIPALKTIIDKSSENGVDYVIMGMPHRGRLNVLANVIRKELEQIFCQFD SKLEAADEGSGDVKYHLGMYHRRINRVTDRNITLSLVANPSHLEAADPVVMGKTKAEQ FYCGDTEGKKVMSILLHGDAAFAGQGIVYETFHLSDLPSYTTHGTVHVVVNNQIGFTT DPRMARSSPYPTDVARVVNAPIFHVNSDDPEAVMYVCKVAAEWRSTFHKDVVVDLVCY RRNGHNEMDEPMFTQPLMYKQIRKQKPVLQKYAELLVSQGVVNQPEYEEEISKYDKIC EEAFARSKDEKILHIKHWLDSPWPGFFTLDGQPRSMSCPSTGLTEDILTHIGNVASSV PVENFTIHGGLSRILKTRGEMVKNRTVDWALAEYMAFGSLLKEGIHIRLSGQDVERGT FSHRHHVLHDQNVDKRTCIPMNHLWPNQAPYTVCNSSLSEYGVLGFEAGLRMASPNAL VLWEAQFGDFHNTAQCIIDQFICPGQAKWVRQNGIVLLLPHGMEGMGPEHSSARPERF LQMCNDDPDVLPDLKEANFDINQLYDCNWVVVNCSTPGNFFHVLRRQILLPFRKPLII FTPKSLLRHPEARSSFDEMLPGTHFQRVIPEDGPAAQNPENVKRLLFCTGKVYYDLTR ERKARDMVGQVAITRIEQLSPFPFDLLLKEVQKYPNAELAWCQEEHKNQGYYDYVKPR LRTTISRAKPVWYAGRNPAAAPATGNKKTH" mat_peptide 178..3063 /EC_number="1.2.4.2" /product="2-oxoglutarate dehydrogenase" old_sequence 322 /note="replace (322,'')" /citation=[1] old_sequence 325 /note="replace (325,'')" /citation=[1] old_sequence 332 /note="replace (332, '')" /citation=[1] old_sequence 350 /note="repalce (350, '')" /citation=[1] old_sequence 538..540 /note="replace(538..540, 'tgc')" /citation=[1] old_sequence 627 /note="replace (627, 'c')" /citation=[1] old_sequence 1087 /note="replace (1087, 'g')" /citation=[1] old_sequence 1149 /note="replace (1149, 'c')" /citation=[1] old_sequence 2247 /note="replace (2247, 'a')" /citation=[1] old_sequence 2403..2404 /note="replace (2403..2404, 'ta')" /citation=[1] old_sequence 2440 /note="replace (2440,'g')" /citation=[1] old_sequence 2452 /note="replace (2452, 'g')" /citation=[1] old_sequence 2779 /note="replace (2779, 'g')" /citation=[1] old_sequence 2857..2858 /note="replace (2857..2858, 'cg')" /citation=[1] old_sequence 3026 /note="replace (3026, 'g')" /citation=[1] old_sequence 3044..3045 /note="replace (3044..3045, 'cg')" /citation=[1] 5'UTR 3067..4122 old_sequence 3152..3153 /note="replace (3152..3153, '')" /citation=[1] old_sequence 3209 /note="replace (3209, 't')" /citation=[1] old_sequence 3231..3232 /note="replace (3231..3232, 'cc')" /citation=[1] old_sequence 3425 /note="replace (3425, 'c')" /citation=[1] old_sequence 3435 /note="replace (3435, 'a')" /citation=[1] old_sequence 3590..3595 /note="replace (7590..3595, 'ggagtc')" /citation=[1] old_sequence 3614 /note="replace (3614, 'g')" /citation=[1] old_sequence 3620..3623 /note="replace (3620..3623, 'ggag')" /citation=[1] old_sequence 3663 /note="replace (3663, 'a')" /citation=[1] old_sequence 3671 /note="replace (3671, 'g')" /citation=[1] old_sequence 3674 /note="replace (3674, 'g')" /citation=[1] old_sequence 3684 /note="replace (3684, 'c')" /citation=[1] old_sequence 3688..3689 /note="replace (3688..3689, 'tg')" /citation=[1] old_sequence 3710 /note="replace (3710, 'a')" /citation=[1] old_sequence 4066 /note="replace (4066, '')" /citation=[1] old_sequence 4078..4089 /note="replace (4078..4089, '')" /citation=[1] polyA_site 4122 BASE COUNT 895 a 1196 c 1142 g 889 t ORIGIN 1 cgggttcggg tggagctgag ccggagacag gcaattgtga aaaacttcag gacaaaaatg 61 tttcatttaa ggacttgtgc tgctaagttg aggccattga cggcttccca gactgttaag 121 acattttcac aaaacagacc agcagcagct aggacatttc aacagattcg gtgctattct 181 gcacctgttg ctgctgagcc ctttctcagt gggactagtt cgaactatgt ggaggagatg 241 tactgtgctt ggctggaaaa ccccaaaagt gtacataagt catgggacat tttttttcgc 301 aacacgaatg ccggagcccc accgggcact gcctaccaga gtccccttcc cctgagccga 361 ggctccctgg ctgctgtggc ccatgcacag tccctggtag aagcacagcc caacgtggac 421 aagctcgtgg aggaccacct ggcagtgcag tcactcatca gggcatatca gatacgaggg 481 caccatgtag cacagctgga ccccctgggg attttggatg ctgatctgga ctcctccgtg 541 cccgctgaca ttatctcatc cacagacaaa cttgggttct atggcctgga tgagtctgac 601 ctcgacaagg tcttccactt gcccaccacc actttcatcg ggggacagga atcagcactt 661 cctctgcggg agatcatccg tcggctggag atggcctact gccagcatat tggggtggag 721 ttcatgttca tcaatgacct ggagcagtgc cagtggatcc ggcagaagtt tgagacccct 781 gggatcatgc agttcacaaa tgaggagaaa cggaccctgc tggccaggct tgtgcggtcc 841 accaggtttg aggagttcct acagcggaag tggtcctctg agaagcgctt tggtctagaa 901 ggctgcgagg tactgatccc tgccctcaag accatcattg acaagtctag tgagaatggc 961 gtggactacg tgatcatggg catgccacac agagggcggc tgaacgtgct tgcaaatgtc 1021 atcaggaagg agctggaaca gatcttctgt caattcgatt caaagctgga ggcagctgat 1081 gagggctccg gagatgtgaa gtaccacctg ggcatgtatc accgcaggat caatcgtgtc 1141 accgacagga acattacctt gtccttggtg gccaaccctt cccaccttga ggccgctgac 1201 cccgtggtga tgggcaagac caaagccgaa cagttttact gtggcgacac tgaagggaaa 1261 aaggtcatgt ccatcctgtt gcatggggat gctgcatttg ctggccaggg cattgtgtac 1321 gagaccttcc acctcagcga cctgccatcc tacacaactc atggcaccgt gcacgtggtc 1381 gtcaacaacc agatcggctt caccaccgac cctcggatgg cccgctcctc cccctacccc 1441 actgacgtgg cccgagtggt gaatgccccc attttccacg tgaactcaga tgaccccgag 1501 gctgtcatgt acgtgtgcaa agtggcggcc gagtggagga gcaccttcca caaggacgtg 1561 gttgtcgatt tggtgtgtta ccggcgcaac ggccacaacg agatggatga gcccatgttc 1621 acgcagccgc tcatgtacaa gcagatccgc aagcagaagc ctgtgttaca gaagtacgct 1681 gagctgctgg tgtcgcaggg tgtggtcaac cagcctgagt atgaggagga aatttccaag 1741 tatgataaga tctgtgagga agcttttgcc agatctaaag atgagaagat cttgcacatt 1801 aagcactggc tggactctcc ctggcctggc ttcttcaccc tggacgggca gcccaggagc 1861 atgtcctgcc cctccacggg tctgacggag gatattctga cacacatcgg gaatgtggct 1921 agttctgtgc ctgtggaaaa ctttactatt catggagggc tgagccggat cttgaagact 1981 cgtggggaaa tggtgaagaa ccggactgtg gactgggctc tagcggagta catggcgttt 2041 ggctcgctcc tgaaggaggg catccacatt cggctgagcg gccaggacgt ggagcggggc 2101 acattcagcc accgccacca tgtgctccat gaccagaatg tggacaagag aacctgcatc 2161 cccatgaacc atctctggcc caatcaggcc ccctatactg tgtgcaacag ctcactgtct 2221 gagtacggcg tgctgggctt tgaagctggg cttcgcatgg ccagtcctaa tgccctggtc 2281 ctctgggaag cccaatttgg tgacttccac aacacggccc agtgtatcat cgaccagttc 2341 atctgcccgg gacaagccaa gtgggtgcgg cagaatggca tcgtgttgct gctgccccat 2401 ggcatggagg gcatgggtcc agaacattcc tccgcccgcc cagagcggtt cttgcagatg 2461 tgcaacgatg acccagatgt cctgccagac cttaaagaag ccaacttcga catcaatcag 2521 ctatatgact gcaattgggt tgttgtcaac tgctccactc ctggcaactt cttccacgtg 2581 ctacgacgcc agatcctgct gccattccgg aagccgttaa ttatcttcac ccccaaatcc 2641 ctgttgcgcc accccgaggc cagatccagc tttgatgaga tgcttccagg aacccacttc 2701 cagcgggtga tcccagaaga tggccctgca gctcagaacc cagaaaatgt caaaaggctt 2761 ctcttctgca ccggcaaagt gtattatgac ctcacccggg agcgcaaagc acgcgacatg 2821 gtggggcagg tggccatcac aaggattgag cagctgtcgc cattcccctt tgacctcctg 2881 ctgaaggagg tgcagaagta ccccaatgct gagctggcct ggtgccagga ggagcacaag 2941 aaccaaggct actatgacta cgtgaagcca agacttcgga ccaccatcag ccgcgccaag 3001 cccgtctggt atgccggccg gaacccagcg gctgctccag ccaccggcaa caagaagacc 3061 cactgacgga gctgcagcgc ctcctggaca cggccttcga cctggacgtc ttcaagaact 3121 tctcgtagat gctgcctagg gttgcttggg ccactgccct ctccacaccc atgactgccc 3181 cttgcttctc aactaaagaa tagtgcctca gcgctgccca caccaccgtc ctcctcgctg 3241 tgccaccacc cctccctctg ctctcatagg agttaggctg tcgtccccct ccagtgcttg 3301 gctgccccac aggccacacg ctgcccaggc tctgctgact tctgagcagt tttccaggag 3361 gccgggggga gcaggaggag gaaaggtagc ccccgaggga tgtccttggg gaggggtcag 3421 ctctggcccc aatcctcccc accagtctca cccactagga taggaactgg gccttgtgtg 3481 ctggcttccg ctgtcaccca gcaaggcaca ggctcctgta tttgagacta ggatagcttc 3541 atcttgagcc tgagccttag aatctgtaga ggagcctgga gtcggatcta gccatggctg 3601 gcagaggttt ctagggtggg ccccagccgt ggcgtgaact gaggatgacc cggggcagct 3661 ggcaggagag agccttggcc tgacctggca cagaaagggc agcttcagtc tctgcagtgt 3721 ccattatctg ctgttccttc gagggttcca ggctgtgtgt ggggcccaag catgccccac 3781 ccaccctcct gggcccaggc agcacctgga gcccacagag tctgtgtgta gccaggaagc 3841 cccgctcagg tagccaccgc cggggcactg gctgctctgt cttggtcctg ttaaccctcc 3901 acctcctctc ttggactccc tccccacccc aaccactctt tctttctcct ttaacccaat 3961 ggagactttc tgatgcatcg ttttctttgc tgtgccaaag caggtcagaa gagggagagg 4021 aggggctggg ggtgaggggc caggccatgg ccaaggggcc agctgcccct catttatcac 4081 tctgaccttc acagggacag atctgattta tttattttgg tt // LOCUS HUM33DPTP 1182 bp mRNA PRI 15-SEP-1990 DEFINITION Human 33-kDa phototransducing protein mRNA, complete cds. ACCESSION M33478 NID g177186 KEYWORDS phototransducing protein. SOURCE Homo sapiens adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1182) AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S., Yamaki,K. and Shinohara,T. TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in retina and pineal gland JOURNAL Gene 91, 209-215 (1990) MEDLINE 91007277 COMMENT Draft entry and computer-readable sequence for [Gene (1990) In press] kindly submitted by T.Shinohara, 30-MAR-1990. FEATURES Location/Qualifiers source 1..1182 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="photoreceptor rod cell" /dev_stage="adult" mRNA <1..1181 /note="33-kDa phototransducing protein mRNA" CDS 52..792 /note="33-kDa phototransducing protein" /codon_start=1 /db_xref="PID:g177187" /translation="MEEAKSQSLEEDFEGQATHTGPKGVINDWRKFKLESQDSDSIPP SKKEILRQMSSPQSRNGKDSKERVSRKMSIQEYELIHKEKEDENCLRKYRRQCMQDMH QKLSFGPRYGFVYELETGKQFLETIEKELKITTIVVHIYEDGIKGCDALNSSLTCLAA EYPIVKFCKIKASNTGAGDRFSLDVLPTLLIYKGGELISNFISVAEQFAEEFFAGDVE SFLNEYGLLPEREVHVLEHTKIEEEDVE" BASE COUNT 405 a 187 c 242 g 348 t ORIGIN 1 aggacaccag gcacagagat ccaaactatt atatcaaatc caatccctaa aatggaagaa 61 gccaaaagcc aaagtttgga ggaagacttt gaaggacagg ccacacatac aggacccaaa 121 ggagtaataa atgattggag aaagtttaaa ttagagagtc aagacagtga ttcaattcca 181 cctagcaaga aggagattct caggcaaatg tcttctcctc agagtaggaa tggcaaagat 241 tcaaaggaac gagtcagcag aaagatgagc attcaagaat atgaactaat ccataaagag 301 aaagaggatg aaaactgcct tcgtaaatac cgtagacagt gtatgcagga tatgcaccag 361 aagctgagtt ttgggcctag atatgggttt gtgtatgagc tggaaactgg aaagcaattc 421 ctagaaacaa ttgaaaagga actgaagatc accacaattg ttgttcacat ttatgaagat 481 ggtattaagg gttgtgatgc tctaaacagt agtttaacat gccttgcagc agaataccct 541 atagttaagt tttgtaaaat aaaagcttcg aatacaggtg ctggggaccg cttttcctta 601 gatgtacttc ctacactgct catctataaa ggtggggaac tcataagcaa ttttattagt 661 gttgctgaac agtttgctga agaatttttt gctggggatg tggagtcttt cctaaatgaa 721 tatgggttac tacctgaaag agaggtacat gtcctagagc ataccaaaat agaagaagaa 781 gatgttgaat gaagattcac tatgtcaata tctcatgttt atcctttagg tattggatga 841 tggttttggt agtatctata ttgcttttgt gaacacagag tatgggcacg gctatgctaa 901 cttgacaaaa atgactgatg caacaatcga gttattagca tttcatagta ttagttactc 961 aaattgatac aatgcttgac tacaaaacaa agctgtcttc agcaacatta ttagtagaca 1021 aagaggatgt ggataatatt atgacatttt tcaaaaatcc ctttcaagtt atgttttgtc 1081 ttttttactc cattttccct catcactgtt attatttgga cttttcaaat tacattattc 1141 attataattt tctttgtgta ataaaaatga aatctcatga ag // LOCUS HUM3CL 1911 bp mRNA PRI 13-JUN-1996 DEFINITION Human pre-T/NK cell associated protein (3Cl) mRNA, complete cds. ACCESSION L17328 NID g306331 KEYWORDS . SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1911) AUTHORS Ranes-Goldberg,M.G., Hori,T., Mohan-Peterson,S. and Spits,H. TITLE Identification of human pre-T/NK cell-associated genes JOURNAL J. Immunol. 151 (10), 5810-5821 (1993) MEDLINE 94044805 FEATURES Location/Qualifiers source 1..1911 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pre-T/NK" /dev_stage="fetus" /germline /tissue_type="liver" repeat_region 504..640 /rpt_family="Alu" gene 504..1911 /gene="3Cl" CDS 1297..1452 /gene="3Cl" /codon_start=1 /db_xref="PID:g1374787" /translation="MICCQETVYSCEYWRTFKRYFSSWRFKIRSWLTYLSWGLLCTSW YCKVKMS" polyA_site 1911 /gene="3Cl" BASE COUNT 501 a 352 c 388 g 670 t ORIGIN 1 attctttctc cagaaaggat aacaaacctt ttgtttgaga gagatttttc actttttgaa 61 atggtctgtt tgtttgaagt tacagaagaa aaataagtgt aatgcacagt tgtatttatt 121 attcatgtga ccctggaaac atttcttccc cgccctgtgt ttcagtttct tcatctgtaa 181 aatgaggtta ataatggttc ccccttctaa tgttgttttt gaagattaca tgagttaata 241 catgtaaatt gcttagaaca gtgcctagca ctaatataat ggaagtgttg cctactgcct 301 attggcagtt ttctgtacag cttcaccttc ccttaggact tcttgtcagc ccagttatgt 361 gatttttcac ctcgtttttg tttaggagac tctactttgt tgatgttttt ctgagaaaca 421 atgtgctgtc tgtttttctc tgtttccttt gacatttgct gttcatcttt cactgaatat 481 ttgtttaaga aagacataca gaatgttaat tttcattgat tgtagctttc ttctttttaa 541 ggatttggaa tgccctgaca gataattatg ggaatgtgat gcctgtagac tggaagtcat 601 cgcatacagg accttgcact tgcttactct gaacctctca gaaaaagggg taagttagga 661 attgaaccaa tagtgtttga ataactgatg ctgataatag aactaaaaca gtagcatttc 721 tgcttaacac agattcactg acctgcctct catttctgaa attatagcag tttgggtggc 781 tctgtcttga tgtcttggct gtagggccat caccttgctg gcacattcct ccagttcttc 841 tccacgacac tcatttttta tgtgtagcta aatttccatc acaatttttt tttttttgac 901 ggagtctcac tctgttgccc aggctggagt acagtggctt actacaacct ctgcctcctg 961 ggttcaagcg attctcccac ctcaaactcc cgagtagctg ggattacagc atgagccacc 1021 agtcccggct aatttttgta ttttcagtag agatagggtt tcgctgtgtt ggccaggctg 1081 gtctcgaact cctgacctca ggtaattcac ctcccagagt gctggattac aggtgtgagc 1141 cactgcacct ggcagtgatg atgcactgaa acactcagta aaattttgat gcactgaaac 1201 cctcagtaaa aatgctagcc tttggtttaa tatactgaga gtattggaat ctcaacacgg 1261 agttaacttt ttaatgagtg tagtgacctg ctagacatga tatgttgtca ggagactgtg 1321 tattcctgtg aatactggag aacctttaaa agatatttta gttcttggcg ttttaaaata 1381 cggtcttggc taacttacct ttcttggggt ctcctgtgta catcctggta ttgtaaagtg 1441 aagatgtctt gactaagaag tgtgagcacg agtggtggga caggtaggga ggcagaaaag 1501 ggttgagaat tacttgaaat ggaaaatcac agtgttaagc ttgtttaaat ctgatgagaa 1561 acatatccaa attatctaag tttcagggcc tggctgttta agttccttat caaataataa 1621 tgtaaactca tgccagatct tcttgcgtgc tctatatttc attaagtgta attatgtgat 1681 tattgagaat atagtgcagt aggcctgtca ctattttaca tagtattgtt aatgttttgt 1741 taaaaggatt ttcttacata tgtgatcaga aaagcctact gttattattt tctagtaaaa 1801 tgtagaattc tggctgtatt tcccagtatt cagtagctat ggaaatcttc ctgactgtgc 1861 cagtcattaa tcccattaag ctttgcagac tgacctggca tatatttaat g // LOCUS HUM49KDA 2201 bp mRNA PRI 06-MAR-1996 DEFINITION Human hnRNP H mRNA, complete cds. ACCESSION L22009 NID g347313 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2201) AUTHORS Honore,B., Rasmussen,H.H., Vorum,H., Dejgaard,K., Liu,X., Gromov,P., Madsen,P., Gesser,B., Tommerup,N. and Celis,J.E. TITLE Heterogeneous nuclear ribonucleoproteins H, H', and F are members of a ubiquitously expressed subfamily of related but distinct proteins encoded by genes mapping to different chromosomes JOURNAL J. Biol. Chem. 270 (48), 28780-28789 (1995) MEDLINE 96081943 FEATURES Location/Qualifiers source 1..2201 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MRC-5 V2" /cell_type="Fibroblast" /tissue_lib="lambda ZAP II" CDS 73..1422 /note="49 kDa protein; heterogeneous nuclear ribonucleoprotein H" /codon_start=1 /product="hnRNP H" /db_xref="PID:g347314" /translation="MMLGTEGGEGFVVKVRGLPWSCSADEVQRFFSDCKIQNGAQGIR FIYTREGRPSGEAFVELESEDEVKLALKKDRETMGHRYVEVFKSNNVEMDWVLKHTGP NSPDTANDGFVRLRGLPFGCSKEEIVQFFSGLEIVPNGITLPVDFQGRSTGEAFVQFA SQEIAEKALKKHKERIGHRYIEIFKSSRAEVRTHYDPPRKLMAMQRPGPYDRPGAGRG YNSIGRGAGFERMRRGAYGGGYGGYDDYNGYNDGYGFGSDRFGRDLNYCFSGMSDHRY GDGGSTFQSTTGHCVHMRGLPYRATENDIYNFFSPLNPVRVHIEIGPDGRVTGEADVE FATHEDAVAAMSKDKANMQHRYVELFLNSTAGASGGAYEHRYVELFLNSTAGASGGAY GSQMMGGMGLSNQSSYGGPASQQLSGGYGGGYGGQSSMSGYDQVLQENSSDFQSNIA" polyA_signal 2177..2182 polyA_site 2201 BASE COUNT 625 a 371 c 566 g 639 t ORIGIN 1 tttttttttt cgtcttagcc acgcagaagt cgcgtgtcta gtttgtttcg acgccggacc 61 gcgtaagaga cgatgatgtt gggcacggaa ggtggagagg gattcgtggt gaaggtccgg 121 ggcttgccct ggtcttgctc ggccgatgaa gtgcagaggt ttttttctga ctgcaaaatt 181 caaaatgggg ctcaaggtat tcgtttcatc tacaccagag aaggcagacc aagtggcgag 241 gcttttgttg aacttgaatc agaagatgaa gtcaaattgg ccctgaaaaa agacagagaa 301 actatgggac acagatatgt tgaagtattc aagtcaaaca acgttgaaat ggattgggtg 361 ttgaagcata ctggtccaaa tagtcctgac acggccaatg atggctttgt acggcttaga 421 ggacttccct ttggatgtag caaggaagaa attgttcagt tcttctcagg gttggaaatc 481 gtgccaaatg ggataacatt gccggtggac ttccagggga ggagtacggg ggaggccttc 541 gtgcagtttg cttcacagga aatagctgaa aaggctctaa agaaacacaa ggaaagaata 601 gggcacaggt atattgaaat ctttaagagc agtagagctg aagttagaac tcattatgat 661 ccaccacgaa agcttatggc catgcagcgg ccaggtcctt atgacagacc tggggctggt 721 agagggtata acagcattgg cagaggagct ggctttgaga ggatgaggcg tggtgcttat 781 ggtggaggct atggaggcta tgatgattac aatggctata atgatggcta tggatttggg 841 tcagatagat ttggaagaga cctcaattac tgtttttcag gaatgtctga tcacagatac 901 ggggatggtg gctctacttt ccagagcaca acaggacact gtgtacacat gcggggatta 961 ccttacagag ctactgagaa tgacatttat aatttttttt caccgctcaa ccctgtgaga 1021 gtacacattg aaattggtcc tgatggcaga gtaactggtg aagcagatgt cgagttcgca 1081 actcatgaag atgctgtggc agctatgtca aaagacaaag caaatatgca acacagatat 1141 gtagaactct tcttgaattc tacagcagga gcaagcggtg gtgcttacga acacagatat 1201 gtagaactct tcttgaattc tacagcagga gcaagcggtg gtgcttatgg tagccaaatg 1261 atgggaggca tgggcttgtc aaaccagtcc agctacgggg gcccagccag ccagcagctg 1321 agtgggggtt acggaggcgg ctacggtggc cagagcagca tgagtggata cgaccaagtt 1381 ttacaggaaa actccagtga ttttcaatca aacattgcat aggtaaccaa ggagcagtga 1441 acagcagcta ctacagtagt ggaagccgtg catctatggg cgtgaacgga atgggagggt 1501 tgtctagcat gtccagtatg agtggtggat ggggaatgta attgatcgat cctgatcact 1561 gactcttggt caaccttttt tttttttttt ttttctttaa gaaaacttca gtttaacagt 1621 ttctgcaata caagcttgtg atttatgctt actctaagtg gaaatcagga ttgttatgaa 1681 gacttaaggc ccagtatttt tgaatacaat actcatctag gatgtaacag tgaagctgag 1741 taaactataa ctgttaaact taagttccag cttttctcaa gttagttata ggatgtactt 1801 aagcagtaag cgtatttagg taaaagcagt tgaattatgt taaatgttgc cctttgccac 1861 gttaaattga acactgtttt ggatgcatgt tgaaagacat gcttttattt tttttgtaaa 1921 acaatatagg agctgtgtct actattaaaa gtgaaacatt ttggcatgtt tgttaattct 1981 agtttcattt aataacctgt aaggcacgta agtttaagct tttttttttt ttaagttaat 2041 gggaaaaatt tgagacgcaa taccaatact taggattttg gtcttggtgt ttgtatgaaa 2101 ttctgaggcc ttgatttaaa tctttcattg tattgtgatt tccttttagg tatattgcgc 2161 taagtgaaac ttgtcaaata aatcctcctt ttaaaaactg c // LOCUS HUM4AI 1383 bp mRNA PRI 16-JUN-1993 DEFINITION Human mRNA for eukaryotic initiation factor 4AI. ACCESSION D13748 NID g219402 KEYWORDS eukaryotic initiation factor 4AI. SOURCE Homo sapiens lymphoma, cell line U937, cDNA to mRNA, clone HP00180. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kim,N.S., Kato,T., Abe,N. and Kato,S. TITLE Nucleotide sequence of human cDNA encoding eukaryotic initiation factor 4AI JOURNAL Nucleic Acids Res. 21 (8), 2012 (1993) MEDLINE 93261841 REFERENCE 2 (bases 1 to 1383) AUTHORS Kato,S. JOURNAL Unpublished (1993) COMMENT Submitted (24-NOV-1992) to DDBJ by: Seishi kato Genetic Engineering Section Sagami Chemical Research Center 4-4-1 Nishi-Ohnuma Sagamihara, Kanagawa 229 Japan Phone: 0427-42-4791 Fax: 0427-49-7631. FEATURES Location/Qualifiers source 1..1383 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /tissue_type="lymphoma" CDS 17..1237 /codon_start=1 /product="eukaryotic initiation factor 4AI" /db_xref="PID:d1003402" /db_xref="PID:g219403" /translation="MSASQDSRSRDNGPDGMEPEGVIESNWNEIVDSFDDMNLSESLL RGIYAYGFEKPSAIQQRAILPCIKGYDVIAQAQSGTGKTATFAISILQQIELDLKATQ ALVLAPTRELAQQIQKVVMALGDYMGASCHACIGGTNVRAEVQKLQMEAPHIIVGTPG RVFDMLNRRYLSPKYIKMFVLDEADEMLSRGFKDQIYDIFQKLNSNTQVVLLSATMPS DVLEVTKKFMRDPIRILVKKEELTLEGIRQFYINVEREEWKLDTLCDLYETLTITQAV IFINTRRKVDWLTEKMHARDFTVSAMHGDMDQKERDVIMREFRSGSSRVLITTDLLAR GIDVQQVSLVINYDLPTNRENYIHRIGRGGRFGRKGVAINMVTEEDKRTLRDIETFYN TSIEEMPLNVADLI" polyA_signal 1353..1358 BASE COUNT 361 a 332 c 369 g 321 t ORIGIN 1 ctagtttcta aggatcatgt ctgcgagcca ggattcccga tccagagaca atggccccga 61 tgggatggag cccgaaggcg tcatcgagag taactggaat gagattgttg acagctttga 121 tgacatgaac ctctcggagt cccttctccg tggcatctac gcctatggtt ttgagaagcc 181 ctctgccatc cagcagcgag ccattctacc ttgtatcaag ggttatgatg tgattgctca 241 agcccaatct gggactggga aaacggccac atttgccata tcgattctgc agcagattga 301 attagatcta aaagccaccc aggccttggt cctagcaccc actcgagaat tggctcagca 361 gatacagaag gtggtcatgg cactaggaga ctacatgggc gcctcctgtc acgcctgtat 421 cgggggcacc aacgtgcgtg ctgaggtgca gaaactgcag atggaagctc cccacatcat 481 cgtgggtacc cctggccgtg tgtttgatat gcttaaccgg agatacctgt cccccaaata 541 catcaagatg tttgtactgg atgaagctga cgaaatgtta agccgtggat tcaaggacca 601 gatctatgac atattccaaa agctcaacag caacacccag gtagttttgc tgtcagccac 661 aatgccttct gatgtgcttg aggtgaccaa gaagttcatg agggacccca ttcggattct 721 tgtcaagaag gaagagttga ccctggaggg tatccgccag ttctacatca acgtggaacg 781 agaggagtgg aagctggaca cactatgtga cttgtatgaa accctgacca tcacccaggc 841 agtcatcttc atcaacaccc ggaggaaggt ggactggctc accgagaaga tgcatgctcg 901 agatttcact gtatccgcca tgcatggaga tatggaccaa aaggaacgag acgtgattat 961 gagggagttt cgttctggct ctagcagagt tttgattacc actgacctgc tggccagagg 1021 cattgatgtg cagcaggttt ctttagtcat caactatgac cttcccacca acagggaaaa 1081 ctatatccac agaatcggtc gaggtggacg gtttggccgt aaaggtgtgg ctattaacat 1141 ggtgacagaa gaagacaaga ggactcttcg agacattgag accttctaca acacctccat 1201 tgaggaaatg cccctcaatg ttgctgacct catctgaggg gctgtcctgc cacccagccc 1261 cagccagggc tcaatctctg ggggctgagg agcagcagga ggggggaggg aagggagcca 1321 agggatggac atcttgtcat tttttttctt tgaataaatg tcactttttg aggcaaaaga 1381 agg // LOCUS HUM4COLA 2334 bp mRNA PRI 30-OCT-1994 DEFINITION Human type IV collagenase mRNA, complete cds. ACCESSION J05070 NID g177204 KEYWORDS collagenase; metalloprotease. SOURCE Human lung fibroblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2334) AUTHORS Wilhelm,S.M., Collier,I.E., Marmer,B.L., Eisen,A.Z., Grant,G.A. and Goldberg,G.I. TITLE SV40-transformed human lung fibroblasts secrete a 92-kDa type IV collagenase which is identical to that secreted by normal human macrophages [published erratum appears in J Biol Chem 1990 Dec 25;265(36):22570] JOURNAL J. Biol. Chem. 264 (29), 17213-17221 (1989) MEDLINE 90008879 FEATURES Location/Qualifiers source 1..2334 /organism="Homo sapiens" /db_xref="taxon:9606" /map="16q13-q21" gene 20..2143 /gene="CLG4A" CDS 20..2143 /gene="CLG4A" /note="92 kDa type IV collagenase" /codon_start=1 /db_xref="GDB:G00-120-592" /db_xref="PID:g177205" /translation="MSLWQPLVLVLLVLGCCFAAPRQRQSTLVLFPGDLRTNLTDRQL AEEYLYRYGYTRVAEMRGESKSLGPALLLLQKQLSLPETGELDSATLKAMRTPRCGVP DLGRFQTFEGDLKWHHHNITYWIQNYSEDLPRAVIDDAFARAFALWSAVTPLTFTRVY SRDADIVIQFGVAEHGDGYPFDGKDGLLAHAFPPGPGIQGDAHFDDDELWSLGKGVVV PTRFGNADGAACHFPFIFEGRSYSACTTDGRSDGLPWCSTTANYDTDDRFGFCPSERL YTRDGNADGKPCQFPFIFQGQSYSACTTDGRSDGYRWCATTANYDRDKLFGFCPTRAD STVMGGNSAGELCVFPFTFLGKEYSTCTSEGRGDGRLWCATTSNFDSDKKWGFCPDQG YSLFLVAAHEFGHALGLDHSSVPEALMYPMYRFTEGPPLHKDDVNGIRHLYGPRPEPE PRPPTTTTPQPTAPPTVCPTGPPTVHPSERPTAGPTGPPSAGPTGPPTAGPSTATTVP LSPVDDACNVNIFDAIAEIGNQLYLFKDGKYWRFSEGRGSRPQGPFLIADKWPALPRK LDSVFEEPLSKKLFFFSGRQVWVYTGASVLGPRRLDKLGLGADVAQVTGALRSGRGKM LLFSGRRLWRFDVKAQMVDPRSASEVDRMFPGVPLDTHDVFQYREKAYFCQDRFYWRV SSRSELNQVDQVGYVTYDILQCPED" BASE COUNT 412 a 760 c 697 g 465 t ORIGIN 1 agacacctct gccctcacca tgagcctctg gcagcccctg gtcctggtgc tcctggtgct 61 gggctgctgc tttgctgccc ccagacagcg ccagtccacc cttgtgctct tccctggaga 121 cctgagaacc aatctcaccg acaggcagct ggcagaggaa tacctgtacc gctatggtta 181 cactcgggtg gcagagatgc gtggagagtc gaaatctctg gggcctgcgc tgctgcttct 241 ccagaagcaa ctgtccctgc ccgagaccgg tgagctggat agcgccacgc tgaaggccat 301 gcgaacccca cggtgcgggg tcccagacct gggcagattc caaacctttg agggcgacct 361 caagtggcac caccacaaca tcacctattg gatccaaaac tactcggaag acttgccgcg 421 ggcggtgatt gacgacgcct ttgcccgcgc cttcgcactg tggagcgcgg tgacgccgct 481 caccttcact cgcgtgtaca gccgggacgc agacatcgtc atccagtttg gtgtcgcgga 541 gcacggagac gggtatccct tcgacgggaa ggacgggctc ctggcacacg cctttcctcc 601 tggccccggc attcagggag acgcccattt cgacgatgac gagttgtggt ccctgggcaa 661 gggcgtcgtg gttccaactc ggtttggaaa cgcagatggc gcggcctgcc acttcccctt 721 catcttcgag ggccgctcct actctgcctg caccaccgac ggtcgctccg acggcttgcc 781 ctggtgcagt accacggcca actacgacac cgacgaccgg tttggcttct gccccagcga 841 gagactctac acccgggacg gcaatgctga tgggaaaccc tgccagtttc cattcatctt 901 ccaaggccaa tcctactccg cctgcaccac ggacggtcgc tccgacggct accgctggtg 961 cgccaccacc gccaactacg accgggacaa gctcttcggc ttctgcccga cccgagctga 1021 ctcgacggtg atggggggca actcggcggg ggagctgtgc gtcttcccct tcactttcct 1081 gggtaaggag tactcgacct gtaccagcga gggccgcgga gatgggcgcc tctggtgcgc 1141 taccacctcg aactttgaca gcgacaagaa gtggggcttc tgcccggacc aaggatacag 1201 tttgttcctc gtggcggcgc atgagttcgg ccacgcgctg ggcttagatc attcctcagt 1261 gccggaggcg ctcatgtacc ctatgtaccg cttcactgag gggcccccct tgcataagga 1321 cgacgtgaat ggcatccggc acctctatgg tcctcgccct gaacctgagc cacggcctcc 1381 aaccaccacc acaccgcagc ccacggctcc cccgacggtc tgccccaccg gaccccccac 1441 tgtccacccc tcagagcgcc ccacagctgg ccccacaggt cccccctcag ctggccccac 1501 aggtcccccc actgctggcc cttctacggc cactactgtg cctttgagtc cggtggacga 1561 tgcctgcaac gtgaacatct tcgacgccat cgcggagatt gggaaccagc tgtatttgtt 1621 caaggatggg aagtactggc gattctctga gggcaggggg agccggccgc agggcccctt 1681 ccttatcgcc gacaagtggc ccgcgctgcc ccgcaagctg gactcggtct ttgaggagcc 1741 gctctccaag aagcttttct tcttctctgg gcgccaggtg tgggtgtaca caggcgcgtc 1801 ggtgctgggc ccgaggcgtc tggacaagct gggcctggga gccgacgtgg cccaggtgac 1861 cggggccctc cggagtggca gggggaagat gctgctgttc agcgggcggc gcctctggag 1921 gttcgacgtg aaggcgcaga tggtggatcc ccggagcgcc agcgaggtgg accggatgtt 1981 ccccggggtg cctttggaca cgcacgacgt cttccagtac cgagagaaag cctatttctg 2041 ccaggaccgc ttctactggc gcgtgagttc ccggagtgag ttgaaccagg tggaccaagt 2101 gggctacgtg acctatgaca tcctgcagtg ccctgaggac tagggctccc gtcctgcttt 2161 gcagtgccat gtaaatcccc actgggacca accctgggga aggagccagt ttgccggata 2221 caaactggta ttctgttctg gaggaaaggg aggagtggag gtgggctggg ccctctcttc 2281 tcacctttgt tttttgttgg agtgtttcta ataaacttgg attctctaac cttt // LOCUS HUM4EBP1A 357 bp mRNA PRI 17-FEB-1995 DEFINITION Human 4E-binding protein 1 mRNA, complete cds. ACCESSION L36055 NID g561629 KEYWORDS 4E-binding protein 1; homologue. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 357) AUTHORS Pause,A., Belsham,G.J., Gingras,A.C., Donze,O., Lin,T.A., Lawrence,J.C. Jr. and Sonenberg,N. TITLE Insulin-dependent stimulation of protein synthesis by phosphorylation of a regulator of 5'-cap function [see comments] JOURNAL Nature 371 (6500), 762-767 (1994) MEDLINE 95021760 FEATURES Location/Qualifiers source 1..357 /organism="Homo sapiens" /note="(vector lambda gt11)" /db_xref="taxon:9606" /clone_lib="Dr. M. Park" /tissue_type="placenta" CDS 1..357 /standard_name="4E-BP1" /note="homologue of PHAS-1, phosphorylated heat and acid stable protein regulated by insulin" /codon_start=1 /function="interacts with translation initiation factor eIF-4E and inhibits cap-dependent translation" /product="4E-binding protein 1" /db_xref="PID:g561630" /translation="MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFS TTPGGTRIIYDRKFLMECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSP EDKRAGGEESQFEMDI" BASE COUNT 79 a 123 c 105 g 50 t ORIGIN 1 atgtccgggg gcagcagctg cagccagacc ccaagccggg ccatccccgc cactcgccgg 61 gtggtgctcg gcgacggcgt gcagctcccg cccggggact acagcacgac ccccggcggc 121 acgctcttca gcaccacccc gggaggtacc aggatcatct atgaccggaa attcctgatg 181 gagtgtcgga actcacctgt gaccaaaaca cccccaaggg atctgcccac cattccgggg 241 gtcaccagcc cttccagtga tgagcccccc atggaagcca gccagagcca cctgcgcaat 301 agcccagaag ataagcgggc gggcggtgaa gagtcacagt ttgagatgga catttaa // LOCUS HUM4EBP2A 363 bp mRNA PRI 17-FEB-1995 DEFINITION Human 4E-binding protein 2 mRNA, complete cds. ACCESSION L36056 NID g561631 KEYWORDS 4E-binding protein 2; homologue. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 363) AUTHORS Pause,A., Belsham,G.J., Gingras,A.C., Donze,O., Lin,T.A., Lawrence,J.C. Jr. and Sonenberg,N. TITLE Insulin-dependent stimulation of protein synthesis by phosphorylation of a regulator of 5'-cap function [see comments] JOURNAL Nature 371 (6500), 762-767 (1994) MEDLINE 95021760 FEATURES Location/Qualifiers source 1..363 /organism="Homo sapiens" /note="(vector lambda gt11)" /db_xref="taxon:9606" /clone_lib="Dr. M. Park" /tissue_type="placenta" CDS 1..363 /standard_name="4E-BP2" /note="homologue of 4E-BP1" /codon_start=1 /function="interacts with translation initiation factor eIF-4E and inhibits cap-dependent translation" /product="4E-binding protein 2" /db_xref="PID:g561632" /translation="MSSSAGSGHQPSQSRAIPTRTVAISDAAQLPHDYCTTPGGTLFS TTPGGTRIIYDRKFLLDRRNSPMAQTPPCHLPNIPGVTSPGTLIEDSKVEVNNLNNLN NHDRKHAVGDDAQFEMDI" BASE COUNT 93 a 119 c 83 g 68 t ORIGIN 1 atgtcctcgt cagccggcag cggccaccag cccagccaga gccgcgccat ccccacccgc 61 accgtggcca tcagcgacgc cgcgcagcta cctcatgact attgcaccac gcccgggggg 121 acgctcttct ccaccacacc gggaggaact cgaatcattt atgacagaaa gtttctgttg 181 gatcgtcgca attctcccat ggctcagacc ccaccctgcc atctgcccaa tatcccagga 241 gtcactagcc ctggcacctt aattgaagac tccaaagtag aagtaaacaa tttgaacaac 301 ttgaacaatc acgacaggaa acatgcagtt ggggatgatg ctcagttcga gatggacatc 361 tga // LOCUS HUM4F2A 1822 bp mRNA PRI 30-OCT-1994 DEFINITION Human 4F2 antigen heavy chain mRNA, complete cds. ACCESSION J02769 NID g177206 KEYWORDS 4F2 antigen heavy chain; antigen; cell surface antigen. SOURCE Human SV40 transformed fibroblast cell line GM637, cDNA to mRNA, clone pcD-4F2.A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1822) AUTHORS Teixeira,S., Di Grandi,S. and Kuhn,L.C. TITLE Primary structure of the human 4F2 antigen heavy chain predicts a transmembrane protein with a cytoplasmic NH2 terminus JOURNAL J. Biol. Chem. 262 (20), 9574-9580 (1987) MEDLINE 87250620 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Kuehn, 11-MAY-1987. FEATURES Location/Qualifiers source 1..1822 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21" mRNA <1..1822 /note="4F2 mRNA" gene 54..1643 /gene="H4F2" CDS 54..1643 /gene="H4F2" /note="4F2 antigen heavy chain" /codon_start=1 /db_xref="GDB:G00-120-032" /db_xref="PID:g177207" /translation="MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVK IKVAEDEAEAAAPAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVI IVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVL GPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKKSIRVILDLTPNYRGENSWF STQVDTVATKVKDALEFWLQAGVDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLI AGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSL SQARLLTSFLPAQLLRLYQLMLFTLPGTPVFSYGDEIGLDAAALPGQPMEAPVMLWDE SSFPDIPGAVSANMTVKGQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGPGLF SYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPL ELERLKLEPHEGLLLRFPYAA" BASE COUNT 363 a 521 c 526 g 412 t ORIGIN 47 bp upstream of PstI site; chromosome 11. 1 ttgagccacc atctgaccgc aagctgcgtc gtgtcgccgg ttctgcaggc accatgagcc 61 aggacaccga ggtggatatg aaggaggtgg agctgaatga gttagagccc gagaagcagc 121 cgatgaacgc ggcgtctggg gcggccatgt ccctggcggg agccgagaag aatggtctgg 181 tgaagatcaa ggtggcggaa gacgaggcgg aggcggcagc gccggctaag ttcacgggcc 241 tgtccaagga ggagctgctg aaggtggcag gcagccccgg ctgggtacgc acccgctggg 301 cactgctgct gctcttctgg ctcggctggc tcggcatgct tgctggtgcc gtggtcataa 361 tcgtgcgagc gccgcgttgt cgcgagctac cggcgcagaa gtggtggcac acgggcgccc 421 tctaccgcat cggcgacctt caggccttcc agggccacgg cgcgggcaac ctggcgggtc 481 tgaaggggcg tctcgattac ctgagctctc tgaaggtgaa gggccttgtg ctgggtccaa 541 ttcacaagaa ccagaaggat gatgtcgctc agactgactt gctgcagatc gaccccaatt 601 ttggctccaa ggaagatttt gacagtctct tgcaatcggc taaaaaaaag agcatccgtg 661 tcattctgga ccttactccc aactaccggg gtgagaactc gtggttctcc actcaggttg 721 acactgtggc caccaaggtg aaggatgctc tggagttttg gctgcaagct ggcgtggatg 781 ggttccaggt tcgggacata gagaatctga aggatgcatc ctcattcttg gctgagtggc 841 aaaatatcac caagggcttc agtgaagaca ggctcttgat tgcggggact aactcctccg 901 accttcagca gatcctgagc ctactcgaat ccaacaaaga cttgctgttg actagctcat 961 acctgtctga ttctggttct actggggagc atacaaaatc cctagtcaca cagtatttga 1021 atgccactgg caatcgctgg tgcagctgga gtttgtctca ggcaaggctc ctgacttcct 1081 tcttgccggc tcaacttctc cgactctacc agctgatgct cttcaccctg ccagggaccc 1141 ctgttttcag ctacggggat gagattggcc tggatgcagc tgcccttcct ggacagccta 1201 tggaggctcc agtcatgctg tgggatgagt ccagcttccc tgacatccca ggggctgtaa 1261 gtgccaacat gactgtgaag ggccagagtg aagaccctgg ctccctcctt tccttgttcc 1321 ggcggctgag tgaccagcgg agtaaggagc gctccctact gcatggggac ttccacgcgt 1381 tctccgctgg gcctggactc ttctcctata tccgccactg ggaccagaat gagcgttttc 1441 tggtagtgct taactttggg gatgtgggcc tctcggctgg actgcaggcc tccgacctgc 1501 ctgccagcgc cagcctgcca gccaaggctg acctcctgct cagcacccag ccaggccgtg 1561 aggagggctc ccctcttgag ctggaacgcc tgaaactgga gcctcacgaa gggctgctgc 1621 tccgcttccc ctacgcggcc tgacttcagc ctgacatgga cccactaccc ttctcctttc 1681 cttcccaggc cctttggctt ctgatttttc tcttttttaa aaacaaacaa acaaactgtt 1741 gcagattatg agtgaacccc caaatagggt gttttctgcc ttcaaataaa agtcacccct 1801 gcatggtgaa gtcttccctc tg // LOCUS HUM51C 4657 bp mRNA PRI 10-APR-1996 DEFINITION Human (clone 51C-3) 51C protein mRNA, complete cds. ACCESSION L36818 NID g556191 KEYWORDS 51C protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4657) AUTHORS Hejna,J.A., Saito,H., Merkens,L.S., Tittle,T.V., Jakobs,P.M., Whitney,M.A., Grompe,M., Friedberg,A.S. and Moses,R.E. TITLE Cloning and characterization of a human cDNA (INPPL1) sharing homology with inositol polyphosphate phosphatases JOURNAL Genomics 29 (1), 285-287 (1995) MEDLINE 96079124 FEATURES Location/Qualifiers source 1..4657 /organism="Homo sapiens" /note="(vector pbluescript) (vector lambdaYES)" /db_xref="taxon:9606" /map="11q23" /cell_line="GM 639" /cell_type="immortalized fibroblast" /clone="51C-3" CDS 317..3766 /codon_start=1 /product="51C protein" /db_xref="PID:g556192" /translation="MCTRIAFCLMEKISWLCRPRRVCLCPASRPWVSSSACTPSPTRA LCAPCLFLYRVSESRTHRMTGMPQMGRMRSPRCPRALAPPAFLPPTGPSSPLPAPETP TAPAAESAPNGLSTVSHDYLKGSYGLDLEAVRGGASHLPHLTRTLATSCRRLHSEVDK VLSGLEILSKVFDQQSSPMVTRLLQQQNLPQTGEQELESLVLKLSVLKDFLSGIQKKA LKALQDMSSTAPPAPQPSTRKAKTMPVQAFEVKLDVTLGDLTKIGKSQKFTLSVDVEG GRLVLLRRQRDSQEDWTTFTHDRIRQLIKSQRVQNKLGVVFEKEKDRTQRKDFIFVSA RKREAFCQLLQLMKNKHSKQDEPDMISVFIGTWNMGSVPPPKNVTSWFTSKGLGKTLD EVTVTIPHDIYVFGTQENSVGDREWLDLLRGGLKELTDLDYRPIAMQSLWNIKVAVLV KPEHENRISHVSTSSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNCHLTSGNEKTARR NQNYLDILRLLSLGDRQLNAFDISLRFTHLFWFGDLNYRLDMDIQEILNYISRKEFEP LLRVDQLNLEREKHKVFLRFSEEEISFPPTYRYERGSRDTYAWHKQKPTGVRTNVPSW CDRILWKSYPETHIICNSYGCTDDIVTSDHSPVFGTFEVGVTSQFISKKGLSKTSDQA YIEFESIEAIVKTASRTKFFIEFYSTCLEEYKKSFENDAQSSDNINFLKVQWSSRQLP TLKPILADIEYLQDQHLLLTVKSMDGYESYGECVVALKSMIGSTAQQFLTFLSHRGEE TGNIRGSMKVRVPTERLGTRERLYEWISIDKDEAGAKSKAPSVSRGSQEPRSGSRKPA FTEASCPLSRLFEEPEKPPPTGRPPAPPRAAPREEPLTPRLKPEGAPEPEGVAAPPPK NSFNNPAYYVLEGVPHQLLPPEPPSPARAPVPSATKNKVAITVPAPQLGHHRHPRVGE GSSSDEESGGTLPPPDFPPPPLPDSAIFLPPSLDPLPGPVVRGRGGAEARGPPPPKAH PRPPLPPGPSPASTFLGEVASGDDRSCSVLQMAKTLSEVDYAPAGPAASALLPGPLEL QPPPGTALGLWPAPQLPSTPHPGEHPGRPGRGGSVPAGRAGQRAGRGRHECLAAGHRL GAL" polyA_signal 4633..4638 BASE COUNT 957 a 1394 c 1392 g 914 t ORIGIN 1 tttagacagc tgtggaggcc acaaaagagc tccctgttgc ttctcaggcc ctgcctgtgg 61 tgggtgtgga gtctgtgtag gtctggtggg agacggggtg ttctggtcat tcctgccaat 121 aggcaacacc aggagggtgg aagtggaccg gccacatcat taaccctgtg cagcctgggc 181 aggtggtttt agggaaacca agctgagggg gttggggtgc tgagcttgct ggcaggagga 241 aggggtgctt tgggtcttag accccagcct gggggtgaca gattctggcc ctgccttggc 301 atcaggtatc agaagcatgt gcacacgtat cgcattctgc ctgatggaga agatttcttg 361 gctgtgcaga cctcgcaggg tgtgcctgtg ccccgcttcc agaccctggg tgagctcatc 421 ggcctgtacg cccagcccaa ccagggcctt gtgtgcgccc tgcctcttcc tgtatagggt 481 gagcgagagc cggacccacc ggatgaccgg gatgcctcag atggggagga tgagaagccc 541 ccgctgcccc cgcgctctgg ctccaccagc atttctgccc cccactgggc ccagcagtcc 601 cctgccagct cctgagactc ccacagctcc agctgctgag agtgctccca atgggctgag 661 caccgtctcg cacgactacc tgaaaggcag ctatgggctg gacctggaag ctgtgagggg 721 tggagccagc cacctgcccc acctcacccg taccctcgct acctcatgcc ggaggctgca 781 cagtgaggtg gacaaggtcc tgtcaggcct ggagatcctg tccaaggtgt ttgaccagca 841 gagctcgccc atggtgaccc gccttttgca gcagcagaac ctgccacaga caggggagca 901 ggaactagag agcctggtgc tgaagctgtc agtgctaaag gacttcctgt caggcatcca 961 gaagaaggcc ctgaaggccc tacaggacat gagctccaca gcacccccag ctccgcagcc 1021 atccacacgt aaggccaaga ccatgcccgt gcaggccttt gaggtgaagc tagacgtgac 1081 cctgggtgac ctgaccaaga ttgggaagtc acagaagttc acgctgagcg tggatgtgga 1141 gggtgggcgg ctggtgctgc tgcggagaca gcgggactcc caggaggact ggaccacctt 1201 cacgcacgac cgcatccgcc agctcattaa gtcccagcgt gtccagaaca agctgggtgt 1261 tgtgtttgag aaggagaagg accggactca gcgcaaggac ttcatctttg tcagtgcccg 1321 gaagcgggag gccttctgcc agctgttgca gctcatgaag aacaagcact ccaagcagga 1381 cgagcccgac atgatctcag tcttcatagg cacctggaac atgggaagtg taccacctcc 1441 aaaaaacgtg acatcctggt tcacatcgaa gggtctgggg aagaccctgg acgaggtcac 1501 agtgaccata ccccatgaca tctatgtctt tgggacccag gagaactcag tgggcgaccg 1561 cgagtggctg gacctactgc gcgggggcct caaggagctt acggatctgg attaccgccc 1621 gattgccatg caatcactgt ggaatatcaa ggtggcagtg ctggtcaagc cagagcacga 1681 gaaccgtatc agccatgtca gtacgtccag tgtgaagact ggcatcgcca acaccctggg 1741 gaacaagggg gctgtgggcg tctccttcat gtttaatggc acctcatttg gctttgtgaa 1801 ttgtcacctc acctcgggaa atgagaagac ggctcggagg aaccaaaact acttggacat 1861 cctgcggctg ctctcgctgg gcgaccggca gctcaatgcc tttgacatct ctctgcgttt 1921 cacacacctc ttctggtttg gggacctcaa ctaccgcctg gacatggata tccaggagat 1981 cctgaactac atcagcagga aagagtttga gcccctcctc agggtggacc agctcaacct 2041 ggagcgggag aagcacaagg tcttccttcg attcagtgag gaggagatct ccttcccacc 2101 cacctaccgc tatgagcggg gttcccggga cacatatgcc tggcacaagc agaagccaac 2161 tggggtccgg accaatgtgc cctcatggtg tgaccggatt ctgtggaaat cctaccctga 2221 aactcacatc atctgcaatt cttatggttg cactgatgac atcgtcacca gcgaccattc 2281 ccccgtgttt gggacatttg aggttggagt tacctcccag ttcatctcca agaaagggct 2341 ctcaaagact tcagaccagg cctacattga gtttgagagc atcgaggcca ttgtgaagac 2401 agccagccgc accaagttct tcatcgagtt ctactctacc tgcctggagg aatacaagaa 2461 gagctttgag aatgatgccc agagcagtga caacatcaac ttcctcaaag tgcagtggtc 2521 ttcacgccag ctgcccacgc tcaaaccaat tctggctgat atcgagtacc tgcaggacca 2581 gcacctcctg ctcacagtca agtccatgga tggctatgaa tcctatgggg agtgtgtggt 2641 tgcactcaaa tccatgatcg gcagcacggc ccaacagttc ctgaccttcc tatcccaccg 2701 tggcgaggag acaggcaata tcagaggctc catgaaggtg cgggtgccca cggagcgcct 2761 gggcacccgt gagcggctct acgagtggat cagcattgat aaggatgagg caggagcaaa 2821 gagcaaagcc ccctctgtgt cccgagggag ccaggagccc aggtcaggga gccgcaagcc 2881 agccttcaca gaggcctcct gcccgctctc caggttattt gaagaaccag agaaaccgcc 2941 accaacgggg aggcccccag ccccaccccg agcagctccc cgggaggagc ccttgacccc 3001 caggttgaag ccagagggag ctcctgaacc agaaggggtg gcggcccccc cacccaagaa 3061 cagcttcaat aaccctgcct actacgtcct tgaaggggtc ccgcaccagc tgctgccccc 3121 ggagccaccc tcgcctgcca gggcccctgt cccatctgcc accaagaaca aagtggccat 3181 tacagtgcct gctccacagc ttgggcacca ccggcaccct cgtgtgggag aggggagttc 3241 ttcagatgag gagtctggag gcacactgcc ccctccagac tttccacctc caccactgcc 3301 ggactcagcc atcttcctgc cccccagcct ggatccttta ccagggccag tggtccgggg 3361 ccgtggtggg gctgaggccc gtggcccacc acctcccaag gcccatccaa ggcctccact 3421 gcccccaggc ccctcaccag ccagcacttt cctgggggaa gtggccagtg gggatgaccg 3481 gtcctgctcg gtgctgcaga tggccaagac gctgagcgag gtggactatg cccctgctgg 3541 gcctgcagcc tcagcgctcc tcccaggccc cctggagctg cagccccccc cggggactgc 3601 cctcggacta tggccggccc ctcagcttcc ctccaccccg catccgggag agcatccagg 3661 aagacctggc agaggaggct ccgtgcctgc agggcgggcg ggccagcggg ctgggcgagg 3721 caggcatgag tgcctggctg cgggccatcg gcttggagcg ctatgaggag ggcctggtgc 3781 ataatggctg ggacgacctg gagtttctca gtgacatcac cgaggaggac ttggaggagg 3841 ctggggtgca ggacccggct cacaagcgcc tccttctgga caccctgcag ctcagcaagt 3901 gatagcggag gcaccacgaa gctgtgaact cagagcccct ccctgctacc aaggcccagc 3961 tatggcccca gggttgaaaa gttatgaggg tcagggcagt atctctctgc ctatttattg 4021 gggtgcctat ttattgggga tctgcattcc ccgctgccca atcatttgca atgccctaat 4081 tagggcatcc tgcccctcgc cttttaggct caggacggaa ggtcagttgc catggttacc 4141 gaggaccctg gttactctgg tgctgtcctg ttttactgga ccccgcctcc cagccccagg 4201 ggtgcctgtg ggggtccatt tgggtacgtc tgggccccca ctttcaccag tttctgcggc 4261 cttccaccgg gcctgaacca cagcggagga gctccgctaa gacctcccca cccccgctgg 4321 gggtgggggc gggtgtccgt ccggaaatga aggaatagcc cgaggaccgg gctggggttt 4381 atttaaactg ttctgtgtgg gtctggggag ggagagcacc ttaatattat tggggttggt 4441 tggggtgggg caggatctca gccataaagt gccagtttgc ttagttctca ctgtctcctg 4501 gtctgtgctg ccctgctctg gggatgcacg gcggcagggt gggggaggga ggttcctcgc 4561 aggtctcagc ccgggacagg gtcttgcaag cagcctcctg ggcagtcgta agggttgcgg 4621 cgtgatgtct tcaataaatt aagttttatt tggaaaa // LOCUS HUM56KAUTO 1958 bp mRNA PRI 14-JUL-1994 DEFINITION Homo sapiens 56K autoantigen annexin XI gene mRNA, complete cds. ACCESSION L19605 NID g457128 KEYWORDS 56K autoantigen; annexin XI; autoantigen; calcium-binding protein; phospholipid binding protein. SOURCE Homo sapiens (library: lambda-gt11) teratocarcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1958) AUTHORS Misaki,Y., Pruijn,G.J., van der Kemp,A.W. and van Venrooij,W.J. TITLE The 56K autoantigen is identical to human annexin XI JOURNAL J. Biol. Chem. 269 (6), 4240-4246 (1994) MEDLINE 94140847 FEATURES Location/Qualifiers source 1..1958 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="teratocarcinoma" /tissue_lib="lambda-gt11" CDS 179..1696 /note="homology with annexin from bp 827 to 1676" /codon_start=1 /product="56K autoantigen" /db_xref="PID:g457129" /translation="MSYPGYPPPPGGYPPAAPGGGPWGGAAYPPPPSMPPIGLDNVAT YAGQFNQDYLSGMAANMSGTFGGANMPNLYPGAPGAGYPPVPPGGFGQPPSAQQPVPP YGMYPPPGGNPPSRMPSYPPYPGAPVPGQPMPPPGQQPPGAYPGQPPVTYPGQPPVPL PGQQQPVPSYPGYPGSGTVTPAVPPTQFGSRGTITDAPGFDPLRDAEVLRKAMKGFGT DEQAIIDCLGSRSNKQRQQILLSFKTAYGKDLIKDLKSELSGNFEKTILALMKTPVLF DIYEIKEAIKGVGTDEACLIEILASRSNEHIRELNRAYKAEFKKTLEEAIRSDTSGHF QRLLISLSQGNRDESTNVDMSLAQRDAQELYAAGENRLGTDESKFNAVLCSRSRAHLV AVFNEYQRMTGRDIEKSICREMSGDLEEGMLAVVKCLKNTPAFFAERLNKAMRGAGTK DRTLIRIMVSRSETDLLDIRSEYKRMYGKSLYHDISGDTSGDYRKILLKICGGND" BASE COUNT 435 a 632 c 536 g 355 t ORIGIN 1 gctgctgcgc ccgcggctcc ccagtgcccc gagtgccccg cgggccccgc gagcgggagt 61 gggacccagc cctaggcaga acccaggcgc cgcgcccggg acgcccgcgg agagagccac 121 tcccgcccac gtcccatttc gcccctcgcg tccggagtcc ccgtggccag atctaaccat 181 gagctaccct ggctatcccc cgcccccagg tggctaccca ccagctgcac caggtggtgg 241 tccctgggga ggtgctgcct accctcctcc gcccagcatg ccccccatcg ggctggataa 301 cgtggccacc tatgcggggc agttcaacca ggactatctc tcgggaatgg cggccaacat 361 gtctgggaca tttggaggag ccaacatgcc caacctgtac cctggggccc ctggggctgg 421 ctacccacca gtgccccctg gcggctttgg gcagcccccc tctgcccagc agcctgttcc 481 tccctatggg atgtatccac ccccaggagg aaacccaccc tccaggatgc cctcatatcc 541 gccataccca ggggcccctg tgccgggcca gcccatgcca ccccccggac agcagccccc 601 aggggcctac cctgggcagc caccagtgac ctaccctggt cagcctccag tgccactccc 661 tgggcagcag cagccagtgc cgagctaccc aggatacccg gggtctggga ctgtcacccc 721 cgctgtgccc ccaacccagt ttggaagccg aggcaccatc actgatgctc ccggctttga 781 ccccctgcga gatgccgagg tcctgcggaa ggccatgaaa ggcttcggga cggatgagca 841 ggccatcatt gactgcctgg ggagtcgctc caacaagcag cggcagcaga tcctactttc 901 cttcaagacg gcttacggca aggatttgat caaagatctg aaatctgaac tgtcaggaaa 961 ctttgagaag acaatcttgg ctctgatgaa gaccccagtc ctctttgaca tttatgagat 1021 aaaggaagcc atcaaggggg ttggcactga tgaagcctgc ctgattgaga tcctcgcttc 1081 ccgcagcaat gagcacatcc gagaattaaa cagagcctac aaagcagaat tcaaaaagac 1141 cctggaagag gccattcgaa gcgacacatc agggcacttc cagcggctcc tcatctctct 1201 ctctcaggga aaccgtgatg aaagcacaaa cgtggacatg tcactcgccc agagagatgc 1261 ccaggagctg tatgcggccg gggagaaccg cctgggaaca gacgagtcca agttcaatgc 1321 ggttctgtgc tcccggagcc gggcccacct ggtagcagtt ttcaatgagt accagagaat 1381 gacaggccgg gacattgaga agagcatctg ccgggagatg tccggggacc tggaggaggg 1441 catgctggcc gtggtgaaat gtctcaagaa taccccagcc ttctttgcgg agaggctcaa 1501 caaggccatg aggggggcag gaacaaagga ccggaccctg attcgcatca tggtgtctcg 1561 cagcgagacc gacctcctgg acatcagatc agagtataag cggatgtacg gcaagtcgct 1621 gtaccacgac atctcgggag atacttcagg ggattaccgg aagattctgc tgaagatctg 1681 tggtggcaat gactgaacag tgactggtgg ctcacttctg cccacctgcc ggcaacacca 1741 gtgccaggaa aaggccaaaa gaatgtctgt ttctaacaaa tccacaaata gccccgagat 1801 tcaccgtcct agagcttagg cctgtcttcc acccctcctg acccgtatag tgtgccacag 1861 gacctgggtc ggtctagaac tctctcagga tgccttttct accccatccc tcacagcctc 1921 ttgctgctaa aatagatgtt tcatttttct gaaaaaaa // LOCUS HUM56KDAPR 1853 bp mRNA PRI 20-APR-1995 DEFINITION Human IEF SSP 9502 mRNA, complete cds. ACCESSION L07758 NID g177764 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1853) AUTHORS Honore,B., Leffers,H., Madsen,P. and Celis,J.E. TITLE Cloning of a cDNA encoding a novel human nuclear phosphoprotein belonging to the WD-40 family JOURNAL Gene 151 (1-2), 291-296 (1994) MEDLINE 95129878 FEATURES Location/Qualifiers source 1..1853 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MRC-5 V2" /cell_type="fibroblast" /tissue_lib="lambda ZAPII" CDS 88..1593 /note="nuclear phosphoprotein (similarity to Saccharomyces cerevisiae PWP1 protein)" /codon_start=1 /product="IEF SSP 9502" /db_xref="PID:g177765" /translation="MNRSRQVTCVAWVRCGVAKETPDKVELSKEEVKRLIAEAKEKLQ EEGGGSDEEETGSPSEDGMQSARTQARPREPLEDGDPEDDRTLDDDELAEYDLDKYDE EGDPDAETLGESLLGLTVYGSNDQDPYVTLKDTEQYEREDFLIKPSDNLIVCGRAEQD QCNLEVHVYNQEEDSFYVHHDILLSAYPLSVEWLNFDPSPDDSTGNYIAVGNMTPVIE VWDLDIVDSLEPVFTLGSKLSKKKKKKGKKSSSAEGHTDAVLDLSWNKLIRNVLASAS ADNTVILWDMSLGKPAASLAVHTDKVQTLQFHPFEAQTLISGSYDKSVALYDCRSPDE SHRMWRFSGQIERVTWNHFSPCHFLASTDDGFVYNLDARSDKPIFTLNAHNDEISGLD LSSQIKGCLVTASADKYVKIWDILGDRPSLVHSRDMKMGVLFCSSCCPDLPFIYAFGG QKEGLRVWDISTVSSVNEAFGRRERLVLGSARNSSISGPFGSRSSDTPMES" polyA_signal 1817..1822 polyA_site 1853 BASE COUNT 528 a 368 c 461 g 496 t ORIGIN 1 gatccctgag cgtgtggcag cagtgcggtc gtggtccctc cctatgcagc ctggtttcta 61 gcgtgacacg cccttgactt gaggaccatg aaccgcagcc gccaggtgac gtgcgtggcc 121 tgggtccgct gcggcgtggc caaagagaca ccagacaagg tagagctgag taaagaagaa 181 gtaaaacgcc tcattgctga ggcaaaggag aaattgcaag aagaaggtgg tggcagtgat 241 gaagaggaga caggcagtcc ttcagaagat ggcatgcaga gtgcacgcac ccaggcacgc 301 ccaagagagc ccctggagga tggtgaccca gaggatgaca ggacgcttga tgatgatgag 361 ctggctgagt acgacttaga taaatatgat gaggaaggtg acccagatgc tgagactctt 421 ggtgaatctc tcttgggtct tacggtctac gggagtaatg atcaagatcc ttacgttact 481 ctgaaagata cagaacaata tgaacgtgaa gatttcttga ttaagcccag tgataatctt 541 atagtttgtg gccgagctga acaggaccag tgcaatttag aggtgcatgt ttataatcaa 601 gaagaagact ctttttatgt acaccatgat atactcttgt ctgcatatcc tctgagtgtg 661 gaatggctga attttgatcc tagcccagat gattctactg gaaattacat tgctgtagga 721 aacatgaccc ctgttattga agtgtgggac cttgatatag tggactcttt agagccagtc 781 ttcacactcg gaagtaaact ttcaaaaaag aagaaaaaga aaggaaagaa gagttcctca 841 gcagaagggc ataccgatgc tgtccttgac ctttcatgga ataagctaat cagaaatgtt 901 ttagcaagtg catcagctga caacactgta attctgtggg atatgtcctt ggggaaacca 961 gcagctagcc tcgctgtaca cacagacaag gtccaaacac tgcagtttca tccatttgaa 1021 gcacagactc tgatttctgg ctcatatgat aagtcagtgg ctttgtatga ctgccgaagt 1081 ccagatgaaa gccatcgaat gtggcgattc agtgggcaga tagagagagt gacttggaat 1141 cacttttcac cttgtcattt cttggccagt acagatgacg gctttgtata taatttggat 1201 gcacgttcag ataagccaat ttttacactt aatgcacaca atgatgaaat ctctggtctt 1261 gatcttagca gtcaaatcaa gggctgtctc gtgactgctt cagctgacaa atacgtgaag 1321 atctgggaca tcttaggaga taggccaagt ctagttcatt ctagggacat gaaaatggga 1381 gttctcttct gttcttcatg ttgccctgat ttgccattta tttatgcctt tggaggtcaa 1441 aaagaagggc ttcgggtctg ggatataagc acagtctctt cagtaaatga agcatttgga 1501 agacgagaga ggcttgttct tgggagtgca agaaattcat ctattagtgg cccttttggc 1561 agcaggagct cagatacacc catggagtct taatgaagat catctaattt cctgcttacc 1621 ttaactggga attttaaaaa gttggcctaa aaatgttcca tgcgtggcag caaccatgca 1681 gagtgactga aacacaattc atttctgact gacattcctt tctgcaactg cggtggcacc 1741 acaaatatcc ggtctttgtg cttgctcttc agatggatgg tttgtaaggc tcttgttgca 1801 tttcttaaaa aagagtaata aaaaggattt ttaaaaagta attccttaaa cat // LOCUS HUM5AR 2102 bp mRNA PRI 15-SEP-1990 DEFINITION Human steroid 5-alpha-reductase mRNA, complete cds. ACCESSION M32313 NID g177766 KEYWORDS dihydrotestosterone; steroid 5-alpha-reductase. SOURCE Human adult prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2102) AUTHORS Andersson,S. and Russell,D.W. TITLE Structural and biochemical properties of cloned and expressed human and rat steroid 5-alpha-reductases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3640-3644 (1990) MEDLINE 90251612 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.W. Russell, 23-FEB-1990, for release after publication. FEATURES Location/Qualifiers source 1..2102 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2102 /note="steroid 5-alpha-reductase mRNA" CDS 31..810 /note="steroid 5-alpha-reductase (EC 1.3.99.5)" /codon_start=1 /db_xref="PID:g177767" /translation="MATATGVAEERLLAALAYLQCAVGCAVFARNRQTNSVYGRHALP SHRLRVPARAAWVVQELPSLALPLYQYASESAPRLRSAPNCILLAMFLVHYGHRCLIY PFLMRGGKPMPLLACTMAIMFCTCNGYLQSRYLSHCAVYADDWVTDPRFLIGFGLWLT GMLINIHSDHILRNLRKPGDTGYKIPRGGLFEYVTAANYFGEIMEWCGYALASWSVQG AAFAFFTFCFLSGRAKEHHEWYLRKFEEYPKFRKIIIPFLF" BASE COUNT 470 a 482 c 486 g 664 t ORIGIN 1 gggcatggag cacgctgccc agccctggcg atggcaacgg cgacgggggt ggcggaggag 61 cgcctgctgg ccgcgctcgc ctacctgcag tgcgccgtgg gctgcgcggt cttcgcgcgg 121 aatcgtcaga cgaactcagt gtacggccgc cacgcgctgc ccagccacag gctccgagtg 181 ccggcgcggg ccgcctgggt ggtgcaggag ctgccctcgc tggccctgcc gctctaccag 241 tacgccagcg agtccgcccc gcgtctccgc agcgcgccca actgcatcct cctggccatg 301 ttcctcgtcc actacgggca tcggtgctta atttacccgt ttctgatgcg aggaggaaag 361 cctatgccac tgttggcatg tacaatggcg attatgttct gtacctgtaa cggctatttg 421 caaagcagat acttgagcca ttgtgcagtg tatgctgatg actgggtaac agatccccgt 481 tttctaatag gttttggctt gtggttaaca ggcatgttga taaacatcca ttcagatcat 541 atcctaagga atctcagaaa accaggagat actggataca aaataccaag gggaggctta 601 tttgaatacg taactgcagc caactatttt ggagaaatca tggagtggtg tggctatgcc 661 ctggccagct ggtctgtcca aggcgcggct tttgctttct tcacgttttg ttttttatct 721 ggtagagcaa aagagcatca tgagtggtac ctccggaaat ttgaagagta tccaaagttc 781 agaaaaatta taattccatt tttgttttaa gtgcgttttt catgaaatta tcttcaactt 841 gaagctttcc aatggcgctt ctctatggac tttgtaaata agttatatct ttgtaatttt 901 cctgctactt tatcattttc aagatgtcct ctaggaattt tttttctagt aattttgcaa 961 tctacctaat aagtacctaa atacgctgaa atggaggttg aatatcctac tgtgtaacag 1021 gtcagaattt caagctctgg gtaataactg ctgatatttt ttctaatttc aaatttacct 1081 cttttggcta tgtcttgcca agtgtgtatg agactagact ttacaactgt ctttgatggc 1141 attttcagaa caataaatgt cacaatccct tctatagccc cctacagtga tctcttcaag 1201 gtcaactgca gtgttgcttc cctcccccta tagggctgga atctgtctag gagccctctc 1261 tcggaggcca cagaggctgg gggtagccat tgtgcagtca tggcccgggg gaaacttgcc 1321 aaccttcgtg tcaggtgctg tgtgtaagtg gagaacttgg ggatagagga ggaagctcct 1381 cgtggccctt ccaaggtgag gcaaaggcat ctggacttgt tccagcccag cccaccgggt 1441 gacatcaccg ggcagggagg ggtgctggtg gtggttcata cggagtaagc tgctctgcct 1501 gtgtgagtgg ctcctgggcc ctaaacaggc acctttaggc catgggtcac tcaccgtgag 1561 ccatcaatgt gctctggtct gacatggttt ctctctgtct tctagtctag acctagtttt 1621 tttgttctgt tccccacgta tggatatagt agagattgtt gtctgtgaaa tttctctttt 1681 gtagattttg agttttccct tgtagtgtaa agaatgatca ctttctgtaa caataacaag 1741 accacttttt aagatttatc ctgtttgttc tttgttgatt gaaacataat aattgttaaa 1801 attctctaca gccttctttt tcttccatag ctaatcttcc ttctaatagt ttttgctttc 1861 tgttttgctg ttgttgcttt gcaaagcttt cccctcatag cctgtacctg ttatcaatat 1921 aaaataatct tcctgttgaa tgcttcatga cttgaattct actttgataa aaacattgcc 1981 atactgcttt ttatcttgat gaattcatct ggcattgctt tgccttatca tctcatctgg 2041 agtttttaaa tgccatttgt ttcagttgtc tttaacaaca taataaatag actttgccat 2101 tt // LOCUS HUM5HSR 1984 bp mRNA PRI 21-MAR-1996 DEFINITION Homo sapiens 5-HT6 serotonin receptor mRNA, complete cds. ACCESSION L41147 NID g1162923 KEYWORDS 5-HT6 serotonin receptor. SOURCE Homo sapiens adult Brain striatum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1984) AUTHORS Kohen,R., Metcalf,M.A., Khan,N., Druck,T., Huebner,K., Lachowicz,J.E., Meltzer,H.Y., Sibley,D.R., Roth,B.L. and Hamblin,M.W. TITLE Cloning, characterization, and chromosomal localization of a human 5-HT6 serotonin receptor JOURNAL J. Neurochem. 66 (1), 47-56 (1996) MEDLINE 96102917 FEATURES Location/Qualifiers source 1..1984 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="Brain striatum" /map="1p34-1pterm" 5'UTR <1..467 mRNA <1..>1984 CDS 468..1790 /codon_start=1 /product="5-HT6 serotonin receptor" /db_xref="PID:g1162924" /translation="MVPEPGPTANSTPAWGAGPPSAPGGSGWVAAALCVVIALTAAAN SLLIALICTQPALRNTSNFFLVSLFTSDLMVGLVVMPPAMLNALYGRWVLARGLCLLW TAFDVMCCSASILNLCLISLDRYLLILSPLRYKLRMTPLRALALVLGAWSLAALASFL PLLLGWHELGHARPPVPGQCRLLASLPFVLVASGLTFFLPSGAICFTYCRILLAARKQ AVQVASLTTGMASQASETLQVPRTPRPGVESADSRRLATKHSRKALKASLTLGILLGM FFVTWLPFFVANIVQAVCDCISPGLFDVLTWLGYCNSTMNPIIYPLFMRDFKRALGRF LPCPRCPRERQASLASPSLRTSHSGPRPGLSLQQVLPLPLPPDSDSDSDAGSGGSSGL RLTAQLLLPGEATQDPPLPTRAAAAVNFFNIDPAEPELRPHPLGIPTN" 3'UTR 1791..>1984 BASE COUNT 265 a 796 c 555 g 368 t ORIGIN 1 cccgagagcg cccattcacc cccctcaccc acctccccgc gttcccactt ccccgcactc 61 tgacccggcc ggacgcccct cccctatctt gccgcccgcc ccctccaggg ggctctgctc 121 ccaccccagg gagcccatcc gacctctgct tgacttcccg ccgcttcctt caggggcctc 181 ggctcatcgg gtgcccctcc ccaaacttcc aacccgtttg ctccaggagt tcctgcccca 241 tccccgaggg cgcccaaata gccacactgt gtcctcctgt agtcgccgcc ccctgaccta 301 gcgcgaccca gcgcccccgc ccatgtcccc ccactcacct cccccggggg gcgtggtgag 361 tcgcggtctg ttctcacgga cggtccccgt ccagcctgcg cttcgccggg gccctcatct 421 gctttcccgc caccctatca ctcccttgcc gtccaccctc ggtcctcatg gtcccagagc 481 cgggcccaac cgccaatagc accccggcct ggggggcagg gccgccgtcg gccccggggg 541 gcagcggctg ggtggcggcc gcgctgtgcg tggtcatcgc gctgacggcg gcggccaact 601 cgctgctgat cgcgctcatc tgcactcagc ccgcgctgcg caacacgtcc aacttcttcc 661 tggtgtcgct cttcacgtct gacctgatgg tggggctggt ggtgatgccg ccggccatgc 721 tgaacgcgct gtacgggcgc tgggtgctgg cgcgcggcct ctgcctgctc tggaccgcct 781 tcgacgtgat gtgctgcagc gcctccatcc tcaacctctg cctcatcagc ctggaccgct 841 acctgctcat cctctcgccg ctgcgctaca agctgcgcat gacgcccctg cgtgccctgg 901 ccctagtcct gggcgcctgg agcctcgccg ctctcgcctc cttcctgccc ctgctgctgg 961 gctggcacga gctgggccac gcacggccac ccgtccctgg ccagtgccgc ctgctggcca 1021 gcctgccttt tgtccttgtg gcgtcgggcc tcaccttctt cctgccctcg ggtgccatat 1081 gcttcaccta ctgcaggatc ctgctagctg cccgcaagca ggccgtgcag gtggcctccc 1141 tcaccaccgg catggccagt caggcctcgg agacgctgca ggtgcccagg accccacgcc 1201 caggggtgga gtctgctgac agcaggcgtc tagccacgaa gcacagcagg aaggccctga 1261 aggccagcct gacgctgggc atcctgctgg gcatgttctt tgtgacctgg ttgcccttct 1321 ttgtggccaa catagtccag gccgtgtgcg actgcatctc cccaggcctc ttcgatgtcc 1381 tcacatggct gggttactgt aacagcacca tgaaccccat catctaccca ctcttcatgc 1441 gggacttcaa gcgggcgctg ggcaggttcc tgccatgtcc acgctgtccc cgggagcgcc 1501 aggccagcct ggcctcgcca tcactgcgca cctctcacag cggcccccgg cccggcctta 1561 gcctacagca ggtgctgccg ctgcccctgc cgccggactc agattcggac tcagacgcag 1621 gctcaggcgg ctcctcgggc ctgcggctca cggcccagct gctgcttcct ggcgaggcca 1681 cccaggaccc cccgctgccc accagggccg ctgccgccgt caatttcttc aacatcgacc 1741 ccgcggagcc cgagctgcgg ccgcatccac ttggcatccc cacgaactga cccgggcttg 1801 gggctggcca atggggagct ggattgagca gaacccagac cctgagtcct tgggccagct 1861 cttggctaag accaggaggc tgcaagtctc ctagaagccc tctgagctcc agaggggtgc 1921 gcagagctga ccccctgctg ccatctccag gccccttacc tgcagggatc atagctgact 1981 caga // LOCUS HUM5HT1DA 1506 bp DNA PRI 23-MAR-1992 DEFINITION Human 5-HT1D-type serotonin receptor gene, complete cds. ACCESSION M89955 NID g177771 KEYWORDS 5-HT1D-type serotonin receptor. SOURCE Homo sapiens (library: lambda FIX II, stratagene #946203) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1506) AUTHORS Hamblin,M.W. and Metcalf,M.A. TITLE Primary structure and functional characterization of a human 5-HT1D-type serotonin receptor JOURNAL Mol. Pharmacol. 40, 143-148 (1991) MEDLINE 91342595 FEATURES Location/Qualifiers source 1..1506 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda FIX II, stratagene #946203" CDS 271..1404 /note="RDC4 homologue; putative" /codon_start=1 /product="5-HT1D-type serotonin receptor" /db_xref="PID:g177772" /translation="MSPLNQSAEGLPQEASNRSLNATETSEAWDPRTLQALKISLAVV LSVITLATVLSNAFVLTTILLTRKLHTPANYLIGSLATTDLLVSILVMPISIAYTITH TWNFGQILCDIWLSSDITCCTASILHLCVIALDRYWAITDALEYSKRRTAGHAATMIA IVWAISICISIPPLFWRQAKAQEEMSDCLVNTSQISYTIYSTCGAFYIPSVLLIILYG RIYRAARNRILNPPSLYGKRFTTAHLITGSAGSSLCSLNSSLHEGHSHSAGSPLFFNH VKIKLADSALERKRISAARERKATKILGIILGAFIICWLPFFVVSLVLPICRDSCWIH PALFDFFTWLGYLNSLINPIIYTVFNEEFRQAFQKIVPFRKAS" BASE COUNT 309 a 457 c 322 g 418 t ORIGIN 1 agaccttaac taccagctgg tagttgtctc agcattcttc aaatagtccg gtcttgttta 61 atattattat tattattgtt atttaatttt attttattgc aactgtactt agagaatagt 121 ctggttcttg agaccttttc actgtggtct gttctggtgt acggctccca ccagtgtgaa 181 gcagaaggat gactttgctc tgttgtcagg acaaccttga aggaaggagc caaatgtgtg 241 gaggtctgtg ggaagagaga gccacctagc atgtccccac tgaaccagtc agcagaaggc 301 cttccccagg aggcctccaa cagatccctg aatgccacag aaacctcaga ggcttgggat 361 cccaggaccc tccaggcgct caagatctcc cttgccgtgg tcctttccgt catcacactg 421 gccacagtcc tctccaatgc ctttgtactc accaccatct tactcaccag gaagctccac 481 acccctgcca actacctgat tggctccctg gccaccaccg acctcttggt ttccatcttg 541 gtaatgccca tcagcatcgc ctataccatc acccacacct ggaactttgg ccaaatcttg 601 tgtgacatct ggctgtcctc tgacatcacg tgctgcacag cctccatcct gcatctctgt 661 gtcattgctc tggacaggta ctgggcaatc acagatgccc tggaatacag taaacgcagg 721 acggctggcc acgcggccac catgatcgcc attgtctggg ccatctccat ctgcatctcc 781 atccccccgc tcttctggcg gcaggccaag gcccaggagg agatgtcgga ctgtctggtg 841 aacacctctc agatctccta caccatctac tccacctgtg gggccttcta cattccctcg 901 gtgttgctca tcatcctata tggccggatc taccgggctg cccggaaccg catcctgaat 961 ccaccctcac tctatgggaa gcgcttcacc acggcccacc tcatcacagg ctctgccggg 1021 tcctcgctct gctcgctcaa ctccagcctc catgaggggc actcgcactc ggctggctcc 1081 cctctctttt tcaaccacgt gaaaatcaag cttgctgaca gtgccctgga acgcaagagg 1141 atttctgctg ctcgagaaag gaaagccact aaaatcctgg gcatcattct gggggccttt 1201 atcatctgct ggctgccctt cttcgtggtg tctctggtcc tccccatctg ccgggactcc 1261 tgctggatcc acccggcgct ctttgacttc ttcacctggc taggctattt aaactccctc 1321 atcaatccaa taatctacac tgtgtttaat gaagagtttc ggcaagcttt tcagaaaatt 1381 gtccctttcc ggaaggcctc ctagtcttat tcggtgatga ctcttgttat cttttgtgtc 1441 ctgtaacctc atcgggattg tctttttttt ttttaattat tttctgagac ttggattaat 1501 tcatgg // LOCUS HUM5HT1E 1930 bp mRNA PRI 31-DEC-1994 DEFINITION Human serotonin receptor (5HT1E) mRNA, complete cds. ACCESSION M91467 NID g177773 KEYWORDS serotonin receptor. SOURCE Homo sapiens female cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1930) AUTHORS McAllister,G., Charlesworth,A., Snodin,C., Beer,M.S., Noble,A.J., Middlemiss,D.N., Iversen,L.L. and Whiting,P. TITLE Molecular cloning of a serotonin receptor from human brain (5HT1E): a fifth 5HT1-like subtype JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (12), 5517-5521 (1992) MEDLINE 92302274 FEATURES Location/Qualifiers source 1..1930 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" gene 567..1664 /gene="5HT1E" CDS 567..1664 /gene="5HT1E" /note="putative" /codon_start=1 /product="serotonin receptor" /db_xref="PID:g177774" /translation="MNITNCTTEASMAIRPKTITEKMLICMTLVVITTLTTLLNLAVI MAIGTTKKLHQPANYLICSLAVTDLLVAVLVMPLSIIYIVMDRWKLGYFLCEVWLSVD MTCCTCSILHLCVIALDRYWAITNAIEYARKRTAKRAALMILTVWTISIFISMPPLFW RSHRRLSPPPSQCTIQHDHVIYTIYSTLGAFYIPLTLILILYYRIYHAAKSLYQKRGS SRHLSNRSTDSQNSFASCKLTQTFCVSDFSTSDPTTEFEKFHASIRIPPFDNDLDHPG ERQQISSTRERKAARILGLILGAFILSWLPFFIKELIVGLSIYTVSSEVADFLTWLGY VNSLINPLLYTSFNEDFKLAFKKLIRCREHT" BASE COUNT 481 a 511 c 438 g 500 t ORIGIN 1 atcgaatgtt gagagaagca gtgctctgat ccagctcagg agaaaaagga gcgggttccg 61 agtgagactt ctggagccag ctggacgtgc cggtttgccc agtgcggcgc ggctgcacgc 121 accgtccaca agagtctcag tcgcccaggc tggagtgcag cagcacagtc tcacctcatt 181 gcaacctccg cctcccgggt tcgcgggttc tccgcctcag cttcctagta gctgggattg 241 caggcactca ccaccatgcc cggctaattt tttgaatttt tagtggagac gggatttcac 301 catgttggcc atgctggtct tgaacccccg acctcggatg attcgcccgc ctcggcctcc 361 caaagtgctg gaattacagg cgaaccttca ctcagaagaa atgctgtggc ccttcccttt 421 accaacagaa aatggaacac aagagaccac atagctgaac aaattatagc ctccttacaa 481 gtgagaaacc ttcgaggcta catagttttc agccaaagga aaataaccaa cagcttctcc 541 acagtgtaga ctgaaacaag ggaaacatga acatcacaaa ctgtaccaca gaggccagca 601 tggctataag acccaagacc atcactgaga agatgctcat ttgcatgact ctggtggtca 661 tcaccaccct caccacgttg ctgaacttgg ctgtgatcat ggctattggc accaccaaga 721 agctccacca gcctgccaac tacctaatct gttctctggc cgtgacggac ctcctggtgg 781 cagtgctcgt catgcccctg agcatcatct acattgtcat ggatcgctgg aagcttgggt 841 acttcctctg tgaggtgtgg ctgagtgtgg acatgacctg ctgcacctgc tccatcctcc 901 acctctgtgt cattgccctg gacaggtact gggccatcac caatgctatt gaatacgcca 961 ggaagaggac ggccaagagg gccgcgctga tgatccttac cgtctggacc atctccattt 1021 tcatctccat gccccctctg ttctggagaa gccaccgccg cctaagccct ccccctagtc 1081 agtgcaccat ccagcacgac catgttatct acaccattta ctccacgctg ggtgcgtttt 1141 atatcccctt gactttgata ctgattctct attaccggat ttaccacgcg gccaagagcc 1201 tttaccagaa aaggggatca agtcggcact taagcaacag aagcacagat agccagaatt 1261 cttttgcaag ttgtaaactt acacagactt tctgtgtgtc tgacttctcc acctcagacc 1321 ctaccacaga gtttgaaaag ttccatgcct ccatcaggat cccccccttc gacaatgatc 1381 tagatcaccc aggagaacgt cagcagatct ctagcaccag ggaacggaag gcagcacgca 1441 tcctggggct gattctgggt gcattcattt tatcctggct gccatttttc atcaaagagt 1501 tgattgtggg tctgagcatc tacaccgtgt cctcggaagt ggccgacttt ctgacgtggc 1561 tcggttatgt gaattctctg atcaaccctc tgctctatac gagttttaat gaagacttta 1621 agctggcttt taaaaagctc attagatgcc gagagcatac ttagactgta aaaagctaaa 1681 aggcacgact ttttccagag cctcatgagt ggatgggggt aaggggtgca acttattaat 1741 tcttgaacat acttggttca ggagagtttg taagtatgtg tggtcttgtt tccttgtttg 1801 tttgtttgtt ttgttctgtt ttgtttgagg attgttattt ggcgtgctgt tttctacctc 1861 tggtcttatc tgtgatacat aatttcaaat aaacattatc atacaaaaac aaaaaaaaaa 1921 aaaaaaaaaa // LOCUS HUM5N 3256 bp mRNA PRI 05-AUG-1996 DEFINITION Human mRNA for 5'-nucleotidase. ACCESSION D38524 NID g633070 KEYWORDS 5'-nucleotidase. SOURCE Homo sapiens placenta cDNA to mRNA, clones h14 and h28. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3256) AUTHORS Oka,J., Matsumoto,A., Hosokawa,Y. and Inoue,S. TITLE Molecular cloning of human cytosolic purine 5'-nucleotidase JOURNAL Biochem. Biophys. Res. Commun. 205 (1), 917-922 (1994) MEDLINE 95091838 REFERENCE 2 (bases 1 to 3256) AUTHORS Oka,J. TITLE Direct Submission JOURNAL Submitted (13-OCT-1994) to the DDBJ/EMBL/GenBank databases. Jun Oka, National Institute of Health and Nutrition; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162, Japan (Tel:03-3203-5723(ex.3405), Fax:03-3203-0335) FEATURES Location/Qualifiers source 1..3256 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 84..1769 /EC_number="3.1.3.5" /note="putative" /codon_start=1 /product="5'-nucleotidase" /db_xref="PID:d1008110" /db_xref="PID:g633071" /translation="MSTSWSDRLQNAADMPANMDKHALKKYRREAYHRVFVNRSLAME KIKCFGFDMDYTLAVYKSPEYESLGFELTVERLVSIGYPQELLSFAYDSTFPTRGLVF DTLYGNLLKVDAYGNLLVCAHGFNFIRGPETREQYPNKFIQRDDTERFYILNTLFNLP ETYLLACLVDFFTNCPRYTSCETGFKDGDLFMSYRSMFQDVRDAVDWVHYKGSLKEKT VENLEKYVVKDGKLPLLLSRMKEVGKVFLATNSDYKYTDKIMTYLFDFPHGPKPGSSH RPWQSYFDLILVDARKPLFFGEGTVLRQVDTKTGKLKIGTYTGPLQHGIVYSGGSSDT ICDLLGAKGKDILYIGDHIFGDILKSKKRQGWRTFLVIPELAQELHVWTDKSSLFEEL QSLDIFLAELYKHLDSSSNERPDISSIQRRIKKVTHDMDMCYGMMGSLFRSGSRQTLF ASQVMRYADLYAASFINLLYYPFSYLFRAAHVLMPHESTVEHTHVDINEMESPLATRN RTSVDFKDTDYKRHQLTRSISEIKPPNLFPLAPQEITHCHDEDDDEEEEEEEE" BASE COUNT 941 a 670 c 752 g 893 t ORIGIN 1 cgcgcgttga ggcggctgca gcagttgcgc gctgggattg ttgcggtgcg ctggagccga 61 atacaaaata cagttaaaat aaaatgtcga cctcctggag tgatcggtta cagaatgcag 121 cagatatgcc tgctaacatg gataagcatg ccctgaaaaa gtatcgtcga gaagcctatc 181 atcgggtgtt tgtgaaccga agtttagcaa tggaaaagat aaagtgtttt ggttttgata 241 tggattatac ccttgctgtg tacaagtccc cagagtatga gtcccttggt tttgagctta 301 ctgtggagag attagtttct attggctatc cccaggagtt gctcagcttt gcttatgatt 361 ctacattccc taccagggga cttgtctttg acacactgta tggaaatctt ttgaaagtcg 421 atgcctatgg aaacctcttg gtctgtgcac atggatttaa ctttataagg ggaccagaaa 481 ctagagaaca gtatccaaat aaatttatcc agcgagatga tactgaaaga ttttacattc 541 tgaacacact attcaaccta ccagagacct acctgttggc ctgcctagta gattttttta 601 ctaattgtcc cagatatacc agttgtgaaa caggatttaa agatggggac ctcttcatgt 661 cctaccggag tatgttccag gatgtaagag atgctgttga ctgggttcat tacaagggct 721 cccttaagga aaagacagtt gaaaatcttg agaagtatgt agtcaaagat ggaaaactgc 781 ctttgcttct gagccggatg aaggaagtag ggaaagtatt tcttgctacc aacagtgact 841 ataaatatac agataaaatt atgacttacc tgtttgactt cccacatggc cccaagcctg 901 ggagctccca tcgaccatgg cagtcctact ttgacttgat cttggtggat gcacggaaac 961 cactcttttt tggagaaggc acagtactgc gtcaggtgga tactaaaact ggcaagctga 1021 aaattggtac ctacacaggg cccctacagc atggtatcgt ctactcagga ggttcttctg 1081 atacgatctg tgacctgttg ggagccaagg gaaaagacat tttgtatatt ggagatcaca 1141 tttttgggga cattttaaaa tcaaagaaac ggcaagggtg gcgaactttt ttggtgattc 1201 ctgaactcgc acaggagcta catgtctgga ctgacaagag ttcacttttc gaagaacttc 1261 agagcttgga tattttcttg gctgaactct acaagcatct tgacagcagt agcaatgagc 1321 gtccagacat cagttccatc cagagacgta ttaagaaagt aactcatgac atggacatgt 1381 gctatgggat gatgggaagc ctgtttcgca gtggctcccg gcagaccctt tttgccagtc 1441 aagtgatgcg ttatgctgac ctctatgcag catctttcat caacctgctg tattaccctt 1501 tcagctacct cttcagggct gcccatgtct tgatgcctca tgaatcaacg gtggagcaca 1561 cacacgtaga tatcaatgag atggagtctc ctcttgccac ccggaaccgc acatcagtgg 1621 atttcaaaga cactgactac aagcggcacc agctgacacg gtcaattagt gagattaaac 1681 ctcccaacct cttcccactg gccccccagg aaattacaca ctgccatgac gaagatgatg 1741 atgaagagga ggaggaggag gaagaataag gaggaaaacc aaaaccccaa gcacccatta 1801 aacaagtcct ggcaggactc acaggaacaa acgaggtccc tgttagggtt ctactcgggg 1861 gagggagggg gctccatgaa aggtacgtct gaaaagtttc tgaagatttt attatcatag 1921 atacttgttt tggttttgtg tatctgtact ctctgcagat ggtccaaaat tgtaatggag 1981 tctgtattag aagaaaataa gggtaaaatc aggctgaact gcatgtatat ggctccactg 2041 tggcttgtga cacttttaaa atcatccgta tgtcagtgta tctggataca cgaggaaaag 2101 gaaagagtct cagagtggaa caaagagtgg gaagaggtga tctgtaatgt tacaaattgt 2161 gctattactc caaggtccaa cttttccagt gcattacatg gtattgtata tcagtggaga 2221 aatgtattat ttccatgatc aaatgtagtc tctgttaagg tcaagttttc ttttataagc 2281 ctttaattca tcctcagtga ctctggcaag gctgcttctc tatcactggc tttgcacaga 2341 agtatgctct acttgcgttg ctttagggca ggattctatt ttgagggaaa agacagtatc 2401 cttattacct tttgtttgtt taatagcaca aatgcttatt tgttatccaa aaacaacctc 2461 cttcttatct gtgataaatc tatagaaaga atttagctgc aagtggacaa aggaacaagc 2521 ccccagaaaa gaaagggaag aactgccttc ttatactaca gaacatgcat tagtgtgggc 2581 tatatagctg tggctcatgc tacccaattc cagatttctt tgtcctctaa gagttgattg 2641 ctgtatatta aaattgaaca tcagaggatg ggaagagggc tctgtaagcc agaaccttac 2701 taaagtagag ggcacaatca gtgtgaataa atccacttca gaatctcaag tcaaggccag 2761 gcacggcggc tcacgcctgt aatcccagca ctttgggagg ccgagacagg cggatcacct 2821 gaggtcggga gttcgagacc agccttacca acatggagaa accccatctc tactaaaaat 2881 acaaaattac ctgggcgtgg tggtgcatgc ctgtaatccc atcatctact caggaggctg 2941 aggcaggaga attgcttgaa cccaggaggc ggaggttgca gtgagccagg attgtgccat 3001 tgcactccag cctgggcaac aagaacaaaa ctccatctca aaaaataaaa atcccaatcc 3061 caagtcgaaa tcacctcttg ttttaaacaa gaatgaatca ttactgtgta tgttagggta 3121 ttaaaactgt ttcaccagta cagtgaaagt tgtttcaaca ttttaaacaa acagtggtta 3181 tagactcttt ctttaaccat tgtatatttt cttccattct tgtcattggt caatagggga 3241 gggtagatta gctgct // LOCUS HUM60RO 1792 bp mRNA PRI 15-SEP-1989 DEFINITION Human 60-kdal ribonucleoprotein (Ro) mRNA, complete cds. ACCESSION J04137 NID g177782 KEYWORDS Ro ribonucleoprotein; Sjogren syndrome type A antigen; autoimmune antigen; ribonucleoprotein. SOURCE Human placenta, cDNA to mRNA, clone Ro-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1792) AUTHORS Deutscher,S.L., Harley,J.B. and Keene,J.D. TITLE Molecular analysis of the 60-kDa human Ro ribonucleoprotein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9479-9483 (1988) MEDLINE 89071722 COMMENT Draft entry and sequence for [1] kindly submitted by S.L.Deutscher, 24-FEB-1989. FEATURES Location/Qualifiers source 1..1792 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1792 /note="Ro ribonucleoprotein mRNA" CDS 40..1656 /note="Ro ribonucleoprotein" /codon_start=1 /db_xref="PID:g177783" /translation="MEESVNQMQPLNEKQIANSQDGYVWQVTDMNRLHRFLCFGSEGG TYYIKEQKLGLENAEALIRLIEDGRGCEVIQEIKSFSQEGRTTKQEPMLFALAICSQC SDISTKQAAFKAVSEVCRIPTHLFTFIQFKKDLKESMKCGMWGRALRKAIADWYNEKG GMALALAVTKYKQRNGWSHKDLLRLSHLKPSSEGLAIVTKYITKGWKEVHELYKEKAL SVETEKLLKYLEAVEKVKRTKDELEVIHLIEEHRLVREHLLTNHLKSKEVWKALLQEM PLTALLRNLGKMTANSVLEPGNSEVSLVCEKLCNEKLLKKARIHPFHILIALETYKTG HGLRGKLKWRPDEEILKALDAAFYKTFKTVEPTGKRFLLAVDVSASMNQRVLGSILNA STVAAAMCMVVTRTEKDSYVVAFSDEMVPCPVTTDMTLQQVLMAMSQIPAGGTDCSLP MIWAQKTNTPADVFIVFTDNETFAGGVHPAIALREYRKKMDIPAKLIVCGMTSNGFTI ADPDDRGMLDMCGFDTGALDVIRNFTLDMI" BASE COUNT 590 a 313 c 389 g 500 t ORIGIN Unreported. 1 attttgcctt tttgttaggt ttcctaaaga caaaaaaaaa tggaggaatc tgtaaaccaa 61 atgcagccac tgaatgagaa gcagatagcc aattctcagg atggatatgt atggcaagtc 121 actgacatga atcgactaca ccggttctta tgtttcggtt ctgaaggtgg gacttattat 181 atcaaagaac agaagttggg ccttgaaaat gctgaagctt taattagatt gattgaagat 241 ggcagaggat gtgaagtgat acaagaaata aagtcattta gtcaagaagg cagaaccaca 301 aagcaagagc ctatgctctt tgcacttgcc atttgttccc agtgctccga cataagcaca 361 aaacaagcag catttaaagc tgtttctgaa gtttgtcgca ttcctaccca tctctttact 421 tttatccagt ttaagaaaga tctgaaggaa agcatgaaat gtggcatgtg gggtcgtgcc 481 ctccggaagg ctatagcgga ctggtacaat gagaaaggtg gcatggccct tgctctggca 541 gttacaaaat ataaacagag aaatggctgg tctcacaaag atctattaag attgtcacat 601 cttaaacctt ccagtgaagg acttgcaatt gtgaccaaat atattacaaa gggctggaaa 661 gaagttcatg aattgtataa agaaaaagca ctctctgtgg agactgaaaa attattaaag 721 tatctggagg ctgtagagaa agtgaagcgc acaaaagatg agctagaagt cattcatcta 781 atagaagaac atagattagt tagagaacat cttttaacaa atcacttaaa gtctaaagag 841 gtatggaagg ctttgttaca agaaatgccg cttactgcat tactaaggaa tctaggaaag 901 atgactgcta attcagtact tgaaccagga aattcagaag tatctttagt atgtgaaaaa 961 ctgtgtaatg aaaaactatt aaaaaaggct cgtatacatc catttcatat tttgatcgca 1021 ttagaaactt acaagacagg tcatggtctc agagggaaac tgaagtggcg ccctgatgaa 1081 gaaattttga aagcattgga tgctgctttt tataaaacat ttaagacagt tgaaccaact 1141 ggaaaacgtt tcttactagc tgttgatgtc agtgcttcta tgaaccaaag agttttgggt 1201 agtatactca acgctagtac agttgctgca gcaatgtgca tggttgtcac acgaacagaa 1261 aaagattctt atgtagttgc tttttccgat gaaatggtac catgtccagt gactacagat 1321 atgaccttac aacaggtttt aatggctatg agtcagatcc cagcaggtgg aactgattgc 1381 tctcttccaa tgatctgggc tcagaagaca aacacacctg ctgatgtctt cattgtattc 1441 actgataatg agacctttgc tggaggtgtc catcctgcta ttgctctgag ggagtatcga 1501 aagaaaatgg atattccagc taaattgatt gtttgtggaa tgacatcaaa tggtttcacc 1561 attgcagacc cagatgatag aggcatgttg gatatgtgcg gctttgatac tggagctctg 1621 gatgtaattc gaaatttcac attagatatg atttaaccat aagcagcagc acgatccaga 1681 gatccattgc catcagtgat ctcactaaaa aatatacagc tacttcccag ctaatctcca 1741 cccaatgaat gatgatggta tagtatgtgc ataatggaaa gttaccttac tg // LOCUS HUM6H9A 641 bp mRNA PRI 14-MAR-1994 DEFINITION Human pre-T/NK cell associated protein (6H9A) mRNA, complete cds. ACCESSION L17330 NID g306436 KEYWORDS . SOURCE Homo sapiens fetus liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 641) AUTHORS Ranes-Goldberg,M.G., Hori,T., Mohan-Peterson,S. and Spits,H. TITLE Identification of human pre-T/NK cell associated genes JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..641 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pre-T/NK" /dev_stage="fetus" /germline /tissue_type="liver" gene 95..641 /gene="6H9A" CDS 95..268 /gene="6H9A" /codon_start=1 /db_xref="PID:g306437" /translation="MRLSCLVIITITAELCVPLMLCAHGEQAQLPRGVCVLGTGTSPA WSPVLLGRLPFPH" polyA_site 641 /gene="6H9A" BASE COUNT 204 a 135 c 144 g 158 t ORIGIN 1 aaaaaaaaaa caatacaaaa aaaagaaagg aagagactgt ggaacggtgc agagccagaa 61 ggagtttgtg acttttcatc ttcaaattgt ccaaatgagg ctctcatgtc tagtgattat 121 taccattact gcagaactct gtgtgccact gatgctctgt gcccacggag aacaggcaca 181 gctgccaagg ggtgtgtgtg tgttggggac cggcacgtct cctgcttggt ctcctgtctt 241 gctcggtagg ctgccattcc cgcattaaca gccaacaatg cctgaagcgt agccttcatg 301 gagagtccac acgtctaggc aggccagaga tttgagttct tgagaataaa gccactgggc 361 caggaactca cctttcaaac cctggtagag cattaaaatt tccttgcaag accagccaat 421 gaataatggt aatcaatcaa atacaaaata actttcaaat agcattgtgc tttatagcac 481 acaaagccct tatttaacct tcaagaaaac cctgaggagt gggttttcaa aattattata 541 ttagtatccc tattatacag gtgaagacac caagagtaga aagattaagt gacaagcctg 601 gggtagcaca tagaaggcac atggctcagt aaatattgat g // LOCUS HUM8ODGTP 643 bp mRNA PRI 09-MAY-1997 DEFINITION Human mRNA for 8-oxo-dGTPase, complete cds. ACCESSION D16581 NID g2077946 KEYWORDS 8-oxo-dGTPase. SOURCE Homo sapiens lymphoma T-cell, cell-line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 643) AUTHORS Sekiguchi,M. TITLE Direct Submission JOURNAL Submitted (07-JUL-1993) to the DDBJ/EMBL/GenBank databases. Mutsuo Sekiguchi, Medical Institute of Bioregulation, Kyushu University, Department of Biochemistry; 3-1-1, Maidasi, Higashi-ku, Fukuoka, Fukuoka 812, Japan (E-mail:f77360a@kyushu-cc.cc.kyushu-u.ac.jp, Tel:092-641-1151(ex.3731), Fax:092-633-6801) REFERENCE 2 (bases 1 to 643) AUTHORS Sakumi,K., Furuichi,M., Tsuzuki,T., Kakuma,T., Kawabata,S., Maki,H. and Sekiguchi,M. TITLE Cloning and expression of cDNA for a human enzyme that hydrolyzes 8-oxo-dGTP, a mutagenic substrate for DNA synthesis JOURNAL J. Biol. Chem. 268 (31), 23524-23530 (1993) MEDLINE 94043152 FEATURES Location/Qualifiers source 1..643 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T-cell" /tissue_type="lymphoma" misc_signal 23..30 /note="similarity to 'Kozak sequence'" CDS 27..497 /codon_start=1 /product="8-oxo-dGTPase" /db_xref="PID:d1004529" /db_xref="PID:g452589" /translation="MGASRLYTLVLVLQPQRVLLGMKKRGFGAGRWNGFGGKVQEGET IEDGARRELQEESGLTVDALHKVGQIVFEFVGEPELMDVHVFCTDSIQGTPVESDEMR PCWFQLDQIPFKDMWPDDSYWFPLLLQKKKFHGYFKFQGQDTILDYTLREVDTV" polyA_signal 622..627 polyA_site 642 /note="one of possible polyA_sites" polyA_site 643 /note="one of possible polyA_sites" BASE COUNT 145 a 166 c 217 g 115 t ORIGIN 1 gagcggcggt gcagaaccca gggaccatgg gcgcctccag gctctatacc ctggtgctgg 61 tcctgcagcc tcagcgagtt ctcctgggca tgaaaaagcg aggcttcggg gccggccggt 121 ggaatggctt tgggggcaaa gtgcaagaag gagagaccat cgaggatggg gctaggaggg 181 agctgcagga ggagagcggt ctgacagtgg acgccctgca caaggtgggc cagatcgtgt 241 ttgagttcgt gggcgagcct gagctcatgg acgtgcatgt cttctgcaca gacagcatcc 301 aggggacccc cgtggagagc gacgaaatgc gcccatgctg gttccagctg gatcagatcc 361 ccttcaagga catgtggccc gacgacagct actggtttcc actcctgctt cagaagaaga 421 aattccacgg gtacttcaag ttccagggtc aggacaccat cctggactac acactccgcg 481 aggtggacac ggtctagcgg gagcccaggg cagcccctgg gcaggagacg tggctgctga 541 acagctgcaa accatcttca cctgggggca ttgagtggcg cagagccggg tttcatctgg 601 aattaactgg atggaaggga aaataaagct atctagcggt gaa // LOCUS HUM9G8SF 971 bp mRNA PRI 27-JUN-1994 DEFINITION Homo sapiens 9G8 splicing factor mRNA, complete cds. ACCESSION L22253 NID g506401 KEYWORDS 9G8 splicing factor; serine/arginine protein. SOURCE Homo sapiens (library: lambda ZAP-II) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 971) AUTHORS Cavaloc,Y., Popielarz,M., Gattoni,R., Fuchs,J.-P. and Stevenin,J. TITLE Characterization and cloning of the human splicing factor 9G8: a novel 35 kDa factor of the serine/arginine protein family JOURNAL EMBO J. 13, 2639-2649 (1994) MEDLINE 94283389 FEATURES Location/Qualifiers source 1..971 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="293" /tissue_lib="lambda ZAP-II" CDS 106..822 /codon_start=1 /evidence=experimental /product="9G8 splicing factor" /db_xref="PID:g506402" /translation="MSRYGRYGGETKVYVGNLGTGAGKGELERAFSYYGPLRTVWIAR NPPGFAFVEFEDPRDAEDAVRGLDGKVICGSRVRVELSTGMPRRSRFDRPPARRPFDP NDRCYECGEKGHYAYDCHRYSRRRRSRSRSRSHSRSRGRRYSRSRSRSRGRRSRSASP RRSRSISLRRSRSASLRRSRSGSIKGSRYFQSPSRSRSRSRSISRPRSSRSKSRSPSP KRSRSPSGSPRRSASPERMD" BASE COUNT 267 a 204 c 260 g 240 t ORIGIN 1 gtagtgccgc cgggactctt ggcgggtgaa ggtgtgtgtc agcttttgcg tcactcgagc 61 cctgggcgct gcttgctaaa gagccgagca cgcgggtctg tcatcatgtc gcgttacggg 121 cggtacggag gagaaaccaa ggtgtatgtt ggtaacctgg gaactggcgc tggcaaagga 181 gagttagaaa gggctttcag ttattatggt cctttaagaa ctgtatggat tgcgagaaat 241 cctccaggat ttgcctttgt ggaattcgaa gatcctagag atgcagaaga tgcagtacga 301 ggactggatg gaaaggtgat ttgtggctcc cgagtgaggg ttgaactatc gacaggcatg 361 cctcggagat cacgttttga tagaccacct gcccgacgtc cctttgatcc aaatgataga 421 tgctatgagt gtggcgaaaa gggacattat gcttatgatt gtcatcgtta cagccggcga 481 agaagaagca ggtcacggtc tagatcacat tctcgatcca gaggaaggcg atactctcgc 541 tcacgcagca ggagcagggg acgaaggtca aggtcagcat ctcctcgacg atcaagatct 601 atctctcttc gtagatcaag atcagcttca ctcagaagat ctaggtctgg ttctataaaa 661 ggatcgaggt atttccaatc cccgtcgagg tcaagatcaa gatccaggtc tatttcacga 721 ccaagaagca gccgatcaaa gtccagatct ccatctccaa aaagaagtcg ttccccatca 781 ggaagtcctc gcagaagtgc aagtcctgaa agaatggact gaagctctca agttcaccct 841 ttagggaaaa gttattttgt ttacattatt ataagggatt tgtgatgtct gtaaagtgta 901 acctaggaaa gataattcaa ccatctaatc aaaatggatc tggattacta tgtaaattca 961 cagcagtaag g // LOCUS HUMA 2676 bp mRNA PRI 21-AUG-1997 DEFINITION Homo sapiens ras interactor (RIN1) mRNA, complete cds. ACCESSION L36463 NID g1695232 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Colicelli,J., Nicolette,C., Birchmeier,C., Rodgers,L., Riggs,M. and Wigler,M. TITLE Expression of three mammalian cDNAs that interfere with RAS function in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (7), 2913-2917 (1991) MEDLINE 91187901 REFERENCE 2 (bases 1 to 2676) AUTHORS Han,L. and Colicelli,J. TITLE A human protein selected for interference with Ras function interacts directly with Ras and competes with Raf1 JOURNAL Mol. Cell. Biol. 15 (3), 1318-1323 (1995) MEDLINE 95166216 REFERENCE 3 (bases 1 to 2676) AUTHORS Han,L., Wong,D., Dhaka,A., Afar,D., White,M., Xie,W., Herschman,H., Witte,O. and Colicelli,J. TITLE Protein binding and signaling properties of RIN1 suggest a unique effector function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 94 (10), 4954-4959 (1997) MEDLINE 97289700 REFERENCE 4 (bases 1 to 2676) AUTHORS Han,L. TITLE Direct Submission JOURNAL Submitted (15-MAY-1995) UCLA Medical School, Biological Chemistry, 33-257 CHSTTT, Los Angeles, CA 96066, USA FEATURES Location/Qualifiers source 1..2676 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U118-MG" /cell_type="human glioblastoma" /clone="JC99" /clone_lib="yeast expression library" gene 1..2676 /gene="RIN1" CDS 128..2479 /gene="RIN1" /codon_start=1 /product="ras interactor" /db_xref="PID:g1695233" /translation="MESPGESGAGSPGAPSPSSFTTGHLAREKPAQDPLYDVPNASGG QAGGPQRPGRVVSLRERLLLTRPVWLQLQANAAAALHMLRTEPPGTFLVRKSNTRQCQ ALCMRLPEASGPSFVSSHYILESPGGVSLEGSELMFPDLVQLICAYCHTRDILLLPLQ LPRAIHHAATHKELEAISHLGIEFWSSSLNIKAQRGPAGGPVLPQLKARSPQELDQGT GAALCFFNPLFPGDLGPTKREKFKRSFKVRVSTETSSPLSPPAVPPPPVPVLPGAVPS QTERLPPCQLLRRESSVGYRVPAGSGPSLPPMPSLQEVDCGSPSSSEEEGVPGSRGSP ATSPHLGRRRPLLRSMSAAFCSLLAPERQVGRAAAALMQDRHTAAGQLVQDLLTQVRD GQRPQELEGIRQALSRARAMLSAELGPEKLVSPKRLEHVLEKSLHCSVLKPLRPILAA RLRRRLAADGSLGRLAEGLRLARAQGPGAFGSHLSLPSPVELEQVRQKLLQLVRTYSP SAQVKRLLQACKLLYMALRTQEGEGSGADGFLPLLSLVLAHCDLPELLLEAEYMSELL EPSLLTGEGGYYLTSLSASLALLSGLGQAHTLPLSPVQELRRSLSLWEQRRLPATHCF QHLLRVAYQDPSSGCTSKTLAVPPEASIATLNQLCATKFRVTQPNTFGLFLYKEQGYH RLPPGALAHRLPTTGYLVYRRAEWPETQGAVTEEEGSGQSEARSRGEEQGCQGDGDAG VKASPRDIREQSETTAEGGQGQAQEGPAQPGEPEAEGSRAAEE" misc_feature 332..616 /gene="RIN1" /note="encodes SH2 domain" misc_feature 902..931 /gene="RIN1" /note="encodes proline-rich sequence; SH3 binding domain" old_sequence 1185..1187 /gene="RIN1" /note="Sequence correction: position of additional nucleotide (G) that alters and extends the open reading frame of JC99 (M37190).; gc" /citation=[1] /replace="" misc_feature 1414..1597 /gene="RIN1" /note="sequence absent in alternatively spliced isoform" BASE COUNT 471 a 925 c 853 g 427 t ORIGIN 1 gtggacacac aagagttaac tggcgggtgt gacaggcgga ccgccctcag gaagtgttac 61 tcactgggga tgtgcgtgcc ttgccttggg actggattct cttcctgaag cgaaggggct 121 cccagccatg gaaagccctg gagagtcagg cgcgggctct cctggagccc ccagcccgtc 181 cagcttcact actgggcacc tggcgagaga aaagccagcc caggacccac tgtatgacgt 241 gcccaatgcc agcggcgggc aggcaggcgg gccgcagcgg ccggggcgcg ttgtgagcct 301 gcgggagcgc ctgctgctca cccggcccgt gtggctgcag ctgcaagcca acgcagcggc 361 cgcactgcac atgctgagga ccgagccccc ggggacgttc ctcgtgcgga aatctaacac 421 ccgccagtgc caggccctgt gcatgcggtt gcctgaagcc agtggcccct ccttcgtctc 481 cagccactac atcctggaga gccctggcgg cgtctccttg gagggctcgg agctcatgtt 541 cccagaccta gtccagctca tctgtgccta ctgccacacc cgggacatcc ttctcctccc 601 gctgcagctc cccagagcca tccaccacgc agccactcac aaagagctgg aggccatctc 661 ccatctgggc attgagttct ggagctcctc cctcaacatc aaggctcagc ggggcccggc 721 tggaggccca gtgttgcccc agctgaaggc ccggtcccct caagagctgg accagggcac 781 cggagccgcc ttgtgcttct tcaaccccct gttcccgggg gacctagggc ccaccaagcg 841 ggagaaattc aagagaagct tcaaagtgcg cgtgtccaca gagacctcca gccccctgtc 901 tccacctgcc gtgccacctc cccccgtccc cgtgctgcca ggggcagtcc ccagccagac 961 agagcggctg cccccttgcc agctgctacg gagggagagc tcagtggggt accgcgtgcc 1021 agcaggcagt ggccctagcc ttccgcctat gccctccctc caagaggtgg actgcggctc 1081 ccccagcagc tccgaggagg agggggtgcc agggtcccgg gggagcccag cgacctcacc 1141 ccacctgggc cgccgacgac ctctgcttcg gtccatgagc gccgccttct gctccctact 1201 ggcaccggag cggcaggtgg gccgggctgc ggcagcactg atgcaggacc gacacacagc 1261 cgcgggccag ctggtgcagg acctactgac ccaggtgcgg gatgggcaga ggccccagga 1321 gctcgagggc atccgtcagg cgctgagccg ggcccgggcc atgctgagtg cggagctggg 1381 ccctgagaag ctcgtgtcgc ctaagaggct ggaacatgtc ctggagaagt cattgcattg 1441 ctctgtgctc aagcctctcc ggcccatcct ggcagcccgc ctgcggcgcc ggcttgccgc 1501 agacggctcc ctgggccgcc tagctgaggg cctccgcctg gcccgggccc agggccccgg 1561 agccttcggg tcccacctga gcctgccctc cccagtagag ttggagcaag tgcgccagaa 1621 gctgctgcag ctcgtccgca cctactcacc cagcgcccag gtcaagcggc tcctgcaggc 1681 ctgcaagctg ctctacatgg ccctgaggac ccaggaaggg gagggctcgg gtgccgacgg 1741 gttcctgcct ctgctgagcc tcgtcttggc ccactgtgac cttcctgagc tgctgctgga 1801 ggccgagtac atgtcggagc tgctggagcc cagcctgctt actggagagg gtggctacta 1861 cctgaccagc ctctctgcca gcctggccct gctgagtggc ctgggtcagg cccacaccct 1921 cccactgagc cccgtgcagg agctacggcg ctccctcagc ctctgggagc agcgccgcct 1981 gcctgccacc cactgcttcc agcacctcct ccgagtagcc tatcaggatc ccagcagtgg 2041 ctgcacctcc aagaccctgg ccgtgccccc agaggcctcg attgccaccc tgaaccagct 2101 ctgtgccacc aagttccgag tgacccagcc caacactttt ggcctcttcc tgtacaagga 2161 gcagggctac caccgcctgc cccctggggc cctggcccac aggctgccca ccactggcta 2221 cctcgtctac cgccgggcag agtggcctga gacccagggg gctgtgacag aggaggaggg 2281 cagtgggcag tcagaggcaa gaagcagagg ggaggagcaa gggtgccagg gagatgggga 2341 tgctggggtc aaagccagcc ccagggacat tcgggaacag tctgagacaa ctgctgaagg 2401 gggccagggt caagcccagg aaggccctgc tcagccaggg gaaccagagg cagagggaag 2461 ccgggcagca gaggagtagc ttgaagtggc cagaagggtc attcggggcg ggagaccctg 2521 agcctgctga gaaatccttt tagcgccagc aagccccacc cagggccctg tcctgtgtct 2581 gccaccacct ttgtctgata cttgtttcca gggaagctgg gggaactgcc acatctgagg 2641 aactggaata aagatgaggg gccttcgggg gccaat // LOCUS HUMA15 1743 bp mRNA PRI 13-MAR-1993 DEFINITION Human mRNA for cell surface glycoprotein, complete cds. ACCESSION D10653 NID g285900 KEYWORDS ME491/CD63 superfamily; cell surface glycoprotein. SOURCE Homo sapiens (library: lambda gt10) immature T cell line HPB-ALL cDNA to mRNA, clone A15. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS Emi,N., Kitaori,K., Seto,M., Ueda,R., Saito,H. and Takahashi,T. TITLE Isolation of a novel cDNA clone showing marked similarity to ME491/CD63 superfamily JOURNAL Immunogenetics 37 (3), 193-198 (1993) MEDLINE 93131291 REFERENCE 2 (bases 1 to 1743) AUTHORS Emi,N. TITLE Direct Submission JOURNAL Submitted (29-FEB-1992) to the DDBJ/EMBL/GenBank databases. Nobuhiko Emi, Nagoya University School of Medicine, First Dept. of Internal Medicine; 65 Tsurumai, Showa-ku, Nagoya, Aichi 466, Japan (Tel:052-741-2111, Fax:052-741-1612) COMMENT Submitted (29-FEB-1992) to DDBJ by: Nobuhiko Emi First Department of Internal Medicine Nagoya University School of Medicine Tsurumai, Showaku Nagoya 466 Japan Phone: 052-741-2111 Fax: 052-741-1612. FEATURES Location/Qualifiers source 1..1743 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HPB-ALL" /cell_type="lymphocyte" /clone_lib="lambda gt10" /tissue_type="immature T cell" gene 25..759 /gene="A15" CDS 25..759 /gene="A15" /codon_start=1 /product="cell surface glycoprotein" /db_xref="PID:d1001976" /db_xref="PID:g285901" /translation="METKPVITCLKTLLIIYSFVFWITGVILLAVGVWGKLTLGTYIS LIAENSTNAPYVLIGTGTTIVVFGLFGCFATCRGSPWMLKLYAMFLSLVFLAELVAGI SGFVFRHEIKDTFLRTYTDAMQTYNGNDERSRAVDHVQRSLSCCGVQNYTNWSTSPYF LEHGIPPSCCMNETDCNPQDLHNLTVAATKVNQKGCYDLVTSFMETNMGIIAGVAFGI AFSQLIGMLLACCLSRFITANQYEMV" polyA_signal 1722..1727 polyA_site 1743 BASE COUNT 432 a 424 c 384 g 503 t ORIGIN 1 ctaaagagta tggcatcgag gagaatggag accaaacctg tgataacctg tctcaaaacc 61 ctcctcatca tctactcctt cgtcttctgg atcactgggg tgatcctgct ggctgttgga 121 gtctggggca aacttactct gggcacctat atctccctta ttgccgagaa ctccacaaat 181 gctccctatg tgctcatcgg aactggcacc actattgttg tctttggcct gtttggatgc 241 tttgctacat gtcgtggtag cccatggatg ctgaaactgt atgccatgtt tctgtccctg 301 gtgttcctgg ctgagctcgt agctggcatt tcagggtttg tgtttcgtca tgagatcaag 361 gacaccttcc tgaggactta cacggacgct atgcagactt acaatggcaa tgatgagagg 421 agccgggcag tggaccatgt gcagcgcagc ctgagctgct gtggtgtgca gaactacacc 481 aactggagca ccagccccta cttcctggag catggcatcc cccccagctg ctgcatgaac 541 gaaactgatt gtaatcccca ggatctacac aatctgactg tggccgccac caaagttaac 601 cagaagggtt gttatgatct ggtaactagt ttcatggaga ctaacatggg aatcatcgct 661 ggagtggcgt ttggaatcgc attctcccag ttaattggca tgctgctggc ctgctgtctg 721 tcccggttca tcacggccaa tcagtatgag atggtgtaag gagaagtctt tcaagaatga 781 cggaataaga gacctgtttt aaaaaggaac tgcagcaatc tttgaaagac ttccaaagaa 841 tgttagagca cagtacataa tacacttgcc ctgctccctc taccccttac cccacaacgt 901 gcaactgaca ctcccaccca gtctctgctc cacctttcag cccacgtcac gtgtagtgtc 961 cattttgtga agccctgttg tgccacagag tgtagccagg tccccctgca gctagtccta 1021 gtgaacctca ccccgaggcc ctgcatgggc agcccctcca tctgtacttg gtccaactgc 1081 aactcatcat cggtgactgg ttatcacacc atcgctcgct ttgggccctg catgtagtgt 1141 gggaggctcc tgttagctcc tcactgtggt aaatgccaca cacctttaag tagataagca 1201 gacgatagtt atctgttctt ttgacttaat ctcatttggt ttgattttcc ctctactaag 1261 gctttcctac cttcttcagg ctgcctaaga catgtaagcg aaacacttca ataattgtcc 1321 atgaggagaa aaaaagcatt gtcatgcatg aaggaaactg aacttgaggt ggcctccttg 1381 cttgttacat acctgggtat gtgtaggcag tttagtgcat ctttgcctct cagttgaaac 1441 ctgtataacc ctgttacaaa gctgtgttgt tgcttcttgt gaaggccatg atattttgtt 1501 ttttccccaa ttaattgcta ttgtgttatt ttactaactt ctctctgtat tttttcttgc 1561 attgacatta tagacattga ggacctcatc caaacaattt aaaaatgagt gtgaaggggg 1621 aacaagtcaa aatattttta aaagatcttc aaaaataatg cctctgtcta gcatgccaac 1681 aagaatgcat tgatattgtg aacatttgtg atatatgtat taataaatag agcaattaca 1741 agc // LOCUS HUMA1AADR 2002 bp mRNA PRI 04-NOV-1991 DEFINITION Human alpha-A1-adrenergic receptor mRNA, complete cds. ACCESSION M76446 NID g177806 KEYWORDS G-protein linked receptor; alpha-1A-adrenergic receptor; transmembrane protein. SOURCE Homo sapiens female 2 years old brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2002) AUTHORS Bruno,J.F., Whittaker,J., Song,J. and Berelowitz,M. TITLE Molecular cloning and sequencing of a cDNA encoding a human alpha-1A adrenergic receptor JOURNAL Biochem. Biophys. Res. Commun. 179, 1485-1490 (1991) MEDLINE 92028892 FEATURES Location/Qualifiers source 1..2002 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /tissue_type="brain" CDS 56..1561 /codon_start=1 /product="alpha-1A-adrenergic receptor" /db_xref="PID:g177807" /translation="MAAALRSVMMAGYLSEWRTPTYRSTEMVQRLRMEAVQHSTSTAA VGGLVVSAQGVGVGVFLAAFILMAVAGNLLVILSVACNRHLQTVTNYFIVNLAVADLL LSATVLPFSATMEVLGFWAFGRAFCDVWAAVDVLCCTASILSLCTISVDRYVGVRHSL KYPAIMTERKAAAILALLWVVALVVSVGPLLGWKEPVPPDERFCGITEEAGYAVFSSV CSFYLPMAVIVVMYCRVYVVARSTTRSLEAGVKRERGKASEVVLRIHCRGAATGADGA HGMRSAKGHTFRSSLSVRLLKFSREKKAAKTLAIVVGVFVLCWFPFFFVLPLGSLFPQ LKPSEGVFKVIFWLGYFNSCVNPLIYPCSSREFKRAFLRLLRCQCRRRRRRRPLWRVY GHHWRASTSGLRQDCAPSSGDAPPGAPLALTALPDPDPEPPGTPEMQAPVASRRSHPA PSASGGCWGRSGDPRPSCAPKSPACRTRSPPGARSAQRQRAPSAQRWRLCP" BASE COUNT 307 a 697 c 643 g 355 t ORIGIN 1 cccgtgcagg ggccctacgg acaccaccag ggctacgacc cagagcaggg ccaggatggc 61 ggccgccttg cgctcggtca tgatggctgg gtacttgagt gagtggcgca cgcccacgta 121 ccggtccacg gagatggtgc agaggctgag gatggaggcc gtgcagcaca gcacgtccac 181 ggcggccgtc gggggactgg tggtgagcgc gcagggcgtg ggcgtgggcg tcttcctggc 241 agccttcatc cttatggccg tggcaggtaa cctgcttgtc atcctctcag tggcctgcaa 301 ccgccacctg cagaccgtca ccaactattt catcgtgaac ctggccgtgg ccgacctgct 361 gctgagcgcc accgtactgc ccttctcggc caccatggag gttctgggct tctgggcctt 421 tggccgcgcc ttctgcgacg tatgggccgc cgtggacgtg ctgtgctgca cggcctccat 481 cctcagcctc tgcaccatct ccgtggaccg gtacgtgggc gtgcgccact cactcaagta 541 cccagccatc atgaccgagc gcaaggcggc cgccatcctg gccctgctct gggtcgtagc 601 cctggtggtg tccgtagggc ccctgctggg ctggaaggag cccgtgcccc ctgacgagcg 661 cttctgcggt atcaccgagg aggcgggcta cgctgtcttc tcctccgtgt gctccttcta 721 cctgcccatg gcggtcatcg tggtcatgta ctgccgcgtg tacgtggtcg cgcgcagcac 781 cacgcgcagc ctcgaggcag gcgtcaagcg cgagcgaggc aaggcctccg aggtggtgct 841 gcgcatccac tgtcgcggcg cggccacggg cgccgacggg gcgcacggca tgcgcagcgc 901 caagggccac accttccgca gctcgctctc cgtgcgcctg ctcaagttct cccgtgagaa 961 gaaagcggcc aagactctgg ccatcgtcgt gggtgtcttc gtgctctgct ggttcccttt 1021 cttctttgtc ctgccgctcg gctccttgtt cccgcagctg aagccatcgg agggcgtctt 1081 caaggtcatc ttctggctcg gctacttcaa cagctgcgtg aacccgctca tctacccctg 1141 ttccagccgc gagttcaagc gcgccttcct ccgtctcctg cgctgccagt gccgtcgtcg 1201 ccggcgccgc cgccctctct ggcgtgtcta cggccaccac tggcgggcct ccaccagcgg 1261 cctgcgccag gactgcgccc cgagttcggg cgacgcgccc cccggagcgc cgctggccct 1321 caccgcgctc cccgaccccg accccgaacc cccaggcacg cccgagatgc aggctccggt 1381 cgccagccgt cgaagccacc cagcgccttc cgcgagtgga ggctgctggg gccgttccgg 1441 agacccacga cccagctgcg cgccaaagtc tccagcctgt cgcacaagat cgccgccggg 1501 ggcgcgcagc gcgcagaggc agcgtgcgcc cagcgctcag aggtggaggc tgtgtcccta 1561 ggcgtcccac acgaggtggc cgagggcgcc acctgccagg cctacgaatt ggccgactac 1621 agcaacctac gggagaccga tatttaagga ccccagagct aggccgcgga gtgtgctggg 1681 cttgggggta agggggacca gagaggcggg ctggtgttct aagagccccc gtgcaaatcg 1741 gagacccgga aactgatcag ggcagctgct ctgtgacatc cctgaggaac tgggcagagc 1801 ttgaggctgg agcccttgaa aggtgaaaag tagtggggcc ccctgctgga ctcaggtgcc 1861 cagaactctt ttcttagaag ggagaggctg cgggctccgt ggggcctttt gctcccaatc 1921 cctatttgag aaacactgcc ccatcctcca tgccctgaac cctgagtaga cagccccaag 1981 catggccagg aaggcctgcc cc // LOCUS HUMA1ACM 1520 bp mRNA PRI 30-OCT-1994 DEFINITION Human alpha-1-antichymotrypsin (AACT) mRNA, complete cds. ACCESSION K01500 NID g177808 KEYWORDS alpha-1-antichymotrypsin. SOURCE Human liver, cDNA to mRNA, library of Chandra et al, clone phACT235. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1520) AUTHORS Chandra,T., Stackhouse,R., Kidd,V.J., Robson,K.J. and Woo,S.L. TITLE Sequence homology between human alpha 1-antichymotrypsin, alpha 1-antitrypsin, and antithrombin III JOURNAL Biochemistry 22 (22), 5055-5061 (1983) MEDLINE 84080367 COMMENT [1] reports that the deduced amino acid sequence is 42% homologous with alpha-1-antitrypsin (mostly in the N-terminal half). It is only 33% homologous with human antithrombin III. The signal peptide contains two potential start codons [1]; the first is used in the Features table. FEATURES Location/Qualifiers source 1..1520 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="Chandra et al." /map="14q32.1" mRNA <1..1520 /note="alpha-1-antichymotrypsin mRNA" sig_peptide 12..86 /gene="AACT" /product="alpha-1-antichymotrypsin" gene 12..1313 /gene="AACT" CDS 12..1313 /gene="AACT" /codon_start=1 /db_xref="GDB:G00-118-955" /product="alpha-1-antichymotrypsin" /db_xref="PID:g177809" /translation="MERMLPLLALGLLAAGFCPAVLCHPNSPLDEENLTQENQDRGTH VDLGLASANVDFAFSLYKQLVLKALDKNVIFSPLSISTALAFLSLGAHNTTLTEILKA SSSPHGDLLRQKFTQSFQHLRAPSISSSDELQLSMGNAMFVKEQLSLLDRFTEDAKRL YGSEAFATDFQDSAAAKKLINDYVKNGTRGKITDLIKDPDSQTMMVLVNYIFFKAKWE MPFDPQDTHQSRFYLSKKKWVMVPMMSLHHLTIPYFRDEELSCTVVELKYTGNASALF ILPDQDKMEEVEAMLLPETLKRWRDSLEFREIGELYLPKFSISRDYNLNDILLQLGIE EAFTSKADLSGITGARNLAVSQVVHKVVSDVFEEGTEASAATAVKITLLSALVETRTI VRFNRPFLMIIVPTDTQNIFFMSKVTNPSKPRACIKQWGSQ" mat_peptide 87..1310 /gene="AACT" /note="G00-118-955" /product="alpha-1-antichymotrypsin" BASE COUNT 372 a 418 c 392 g 338 t ORIGIN 146 bp 5' to AvaII site. 1 cagagttgag aatggagaga atgttacctc tcctggctct ggggctcttg gcggctgggt 61 tctgccctgc tgtcctctgc caccctaaca gcccacttga cgaggagaat ctgacccagg 121 agaaccaaga ccgagggaca cacgtggacc tcggattagc ctccgccaac gtggacttcg 181 ctttcagcct gtacaagcag ttagtcctga aggcccttga taagaatgtc atcttctccc 241 cactgagcat ctccaccgcc ttggccttcc tgtctctggg ggcccataat accaccctga 301 cagagattct caaggcctcg agttcacctc acggagactt actgaggcag aaattcactc 361 agagcttcca gcacctccgc gcaccctcaa tcagttccag cgatgagctg cagctgagta 421 tgggaaatgc catgtttgtc aaagagcaac tcagtctgct ggacaggttc acggaggatg 481 ccaagaggct gtatggctcc gaggcctttg ccactgactt tcaggactca gctgcagcta 541 agaagctcat caacgactac gtgaagaatg gaactagggg gaaaatcaca gatctgatca 601 aggaccccga ctcgcagaca atgatggtcc tggtgaatta catcttcttt aaagccaaat 661 gggagatgcc ctttgacccc caagatactc atcagtcaag gttctacttg agcaagaaaa 721 agtgggtaat ggtgcccatg atgagtttgc atcacctgac tataccttac ttccgggacg 781 aggagctgtc ctgcaccgtg gtggagctga agtacacagg caatgccagc gcactcttca 841 tcctccctga tcaagacaag atggaggaag tggaagccat gctgctccca gagaccctga 901 agcggtggag agactctctg gagttcagag agataggtga gctctacctg ccaaagtttt 961 ccatctcgag ggactataac ctgaacgaca tacttctcca gctgggcatt gaggaagcct 1021 tcaccagcaa ggctgacctg tcagggatca caggggccag gaacctagca gtctcccagg 1081 tggtccataa ggtcgtgtct gatgtatttg aggagggcac agaagcatct gctgccacag 1141 cagtcaaaat caccctcctt tctgcattag tggagacaag gaccattgtg cgtttcaaca 1201 ggcccttcct gatgatcatt gtccctacag acacccagaa catcttcttc atgagcaaag 1261 tcaccaatcc cagcaagcct agagcttgca tcaagcagtg gggctctcag taaggaactt 1321 ggaatgcaag ctggatgcct gggtctctgg gcacagctgg cccctgtgca ccgtagtggc 1381 catggcatgt gtggccctgt ctgcttatcc ttggaaggtg acagcgattc cctgtgaagc 1441 tctcacacgc acaggggccc atggactctt cagtctggag ggtcctggcc tcctgacagc 1501 aataaataat ttcgttggcc // LOCUS HUMA1CKII 1677 bp mRNA PRI 30-OCT-1994 DEFINITION Human casein kinase II alpha' subunit mRNA, complete cds. ACCESSION M55268 J02924 NID g177837 KEYWORDS casein kinase II alpha'-subunit. SOURCE Human T-lymphocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Lozeman,F.J., Litchfield,D.W., Piening,C., Takio,K., Walsh,K.A. and Krebs,E.G. TITLE Isolation and characterization of human cDNA clones encoding the alpha and the alpha' subunits of casein kinase II JOURNAL Biochemistry 29 (36), 8436-8447 (1990) MEDLINE 91070071 FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-lymphocyte" /map="Unassigned" mRNA 1..1677 /gene="CSNK2A2" /note="G00-129-561" /product="casein kinase II alpha' subunit" gene 1..1677 /gene="CSNK2A2" CDS 164..1216 /gene="CSNK2A2" /codon_start=1 /db_xref="GDB:G00-129-561" /product="casein kinase II alpha' subunit" /db_xref="PID:g177838" /translation="MPGPAAGSRARVYAEVNSLRSREYWDYEAHVPSWGNQDDYQLVR KLGRGKYSEVFEAINITNNERVVVKILKPVKKKKIKREVKILENLRGGTNIIKLIDTV KDPVSKTPALVFEYINNTDFKQLYQILTDFDIRFYMYELLKALDYCHSKGIMHRDVKP HNVMIDHQQKKLRLIDWGLAEFYHPAQEYNVRVASRYFKGPELLVDYQMYDYSLDMWS LGCMLASMIFRREPFFHGQDNYDQLVRIAKVLGTEELYGYLKKYHIDLDPHFNDILGQ HSRKRWENFIHSENRHLVSPEALDLLDKLLRYDHQQRLTAKEAMEHPYFYPVVKEQSQ PCADNAVLSSGLTAAR" polyA_signal 1660..1665 /gene="CSNK2A2" /note="G00-129-561" BASE COUNT 457 a 429 c 412 g 379 t ORIGIN 1 tgtcacccag gctggagtgc agtggcgcaa tctcagctca ctgcaacctc cacctccctg 61 gttcaagcga ttctcctgcc tcctccgccc gacgccccgc gtcccccgcc gcgccgccgc 121 cgccaccctc tgcgccccgc gccgcccccc ggtcccgccc gccatgcccg gcccggccgc 181 gggcagcagg gcccgggtct acgccgaggt gaacagtctg aggagccgcg agtactggga 241 ctacgaggct cacgtcccga gctggggtaa tcaagatgat taccaactgg ttcgaaaact 301 tggtcgggga aaatatagtg aagtatttga ggccattaat atcaccaaca atgagagagt 361 ggttgtaaaa atcctgaagc cagtgaagaa aaagaagata aaacgagagg ttaagattct 421 ggagaacctt cgtggtggaa caaatatcat taagctgatt gacactgtaa aggaccccgt 481 gtcaaagaca ccagctttgg tatttgaata tatcaataat acagatttta agcaactcta 541 ccagatcctg acagactttg atatccggtt ttatatgtat gaactactta aagctctgga 601 ttactgccac agcaagggaa tcatgcacag ggatgtgaaa cctcacaatg tcatgataga 661 tcaccaacag aaaaagctgc gactgataga ttggggtctg gcagaattct atcatcctgc 721 tcaggagtac aatgttcgtg tagcctcaag gtacttcaag ggaccagagc tcctcgtgga 781 ctatcagatg tatgattata gcttggacat gtggagtttg ggctgtatgt tagcaagcat 841 gatctttcga agggaaccat tcttccatgg acaggacaac tatgaccagc ttgttcgcat 901 tgccaaggtt ctgggtacag aagaactgta tgggtatctg aagaagtatc acatagacct 961 agatccacac ttcaacgata tcctgggaca acattcacgg aaacgctggg aaaactttat 1021 ccatagtgag aacagacacc ttgtcagccc tgaggcccta gatcttctgg acaaacttct 1081 gcgatacgac catcaacaga gactgactgc caaagaggcc atggagcacc catacttcta 1141 ccctgtggtg aaggagcagt cccagccttg tgcagacaat gctgtgcttt ccagtggtct 1201 cacggcagca cgatgaagac tggaaagcga cgggtctgtt gcggttctcc cacttttcca 1261 taagcagaac aagaaccaaa tcaaacgtct taacgcgtat agagagatca cgttccgtga 1321 gcagacacaa aacggtggca ggtttggcga gcacgaacta gaccaagcga agggcagccc 1381 accaccgtat atcaaacctc acttccgaat gtaaaaggct cacttgcctt tggcttcctg 1441 ttgacttctt cccgacccag aaagcatggg gaatgtgaag ggtatgcaga atgttgttgg 1501 ttactgttgc tccccgagcc cctcaactcg tcccgtggcc gcctgttttt ccagcaaacc 1561 acgctaacta gctgaccaca gactccacag tggggggacg ggcgcagtat gtggcatggc 1621 ggcagttaca tattattatt ttaaaagtat atattattga ataaaaggtt ttaaaag // LOCUS HUMA1SBU 1709 bp mRNA PRI 07-OCT-1996 DEFINITION Human replication factor C, 40-kDa subunit (A1) mRNA, complete cds. ACCESSION M87338 NID g1590810 KEYWORDS RFC; Activator 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1709) AUTHORS Chen,M., Pan,Z.Q. and Hurwitz,J. TITLE Sequence and expression in Escherichia coli of the 40-kDa subunit of activator 1 (replication factor C) of HeLa cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (7), 2516-2520 (1992) MEDLINE 92212860 REFERENCE 2 (bases 1 to 1709) AUTHORS Hurwitz,J. TITLE Direct Submission JOURNAL Submitted (31-DEC-1994) Molecular Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1709 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 208..1272 /gene="A1" CDS 208..1272 /gene="A1" /function="elongation of primed DNA templates by DNA polymerases delta and epsilon" /note="replicative polymerase accessory protein; activator 1" /codon_start=1 /product="replication factor C, 40-kDa subunit" /db_xref="PID:g1590811" /translation="MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRP VKLNEIVGNEDTVSRLEVFAREGNVPNIIIAGPPGTGKTTSILCLARALLGPALKDAM LELNASNDRGIDVVRNKIKMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEI YSKTTRFALACNASDKIIEPIQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGL EAIIFTAQGDMRQALNNLQSTFSGFGFINSENVFKVCDEPHPLLVKEMIQHCVNANID EAYKILAHLWHLGYSPEDIIGNIFRVCKTFQMAEYLKLEFIKEIGYTHMKIAEGVNSL LQMAGLLARLCQKTMAPVAS" BASE COUNT 468 a 429 c 461 g 351 t ORIGIN 1 caatttgagt ttccatttct cggatttggg aactggtata agcattgtct gtgatgtaaa 61 caaagtcttc aatatttgga gaaaacatct cctcatactt gagagcacaa gaggaagaga 121 gagaccctca ctgctgggga gtccctgcca cactcagtcc cccaccacac tgaatcggaa 181 ttccgagagg gaagaggagg cgcgagaatg gaggtggagg ccgtctgtgg tggcgcgggc 241 gaggtggagg cccaggactc tgaccctgcc cctgccttca gcaaggcccc cggcagcgcc 301 ggccactacg aactgccgtg ggttgaaaaa tataggccag taaagctgaa tgaaattgtc 361 gggaatgaag acaccgtgag caggctagag gtctttgcaa gggaaggaaa tgtgcccaac 421 atcatcattg cgggccctcc aggaaccggc aagaccacaa gcattctgtg cttggcccgg 481 gccctgctgg gcccagcact caaagatgcc atgttggaac tcaatgcttc aaatgacagg 541 ggcattgacg ttgtgaggaa taaaattaaa atgtttgctc aacaaaaagt cactcttccc 601 aaaggccgac ataagatcat cattctggat gaagcagaca gcatgaccga cggagcccag 661 caagccttga ggagaaccat ggaaatctac tctaaaacca ctcgcttcgc ccttgcttgt 721 aatgcttcgg ataagatcat cgagcccatt cagtcccgct gtgcagtcct ccggtacaca 781 aagctgaccg acgcccagat cctcaccagg ctgatgaatg ttatcgagaa ggagagggta 841 ccctacactg atgacggcct agaagccatc atcttcacgg cccagggaga catgaggcag 901 gcgctgaaca acctgcagtc caccttctca ggatttggct tcattaacag tgagaacgtg 961 ttcaaggtct gtgacgagcc ccacccactg ctggtaaagg agatgatcca gcactgtgtg 1021 aatgccaaca ttgacgaagc ctacaagatt cttgctcact tgtggcatct gggctactca 1081 ccagaagata tcattggcaa catctttcga gtgtgtaaaa ctttccaaat ggcagaatac 1141 ctgaaactgg agtttatcaa ggaaattgga tacactcaca tgaaaatagc ggaaggagtg 1201 aactctcttt tgcagatggc aggcctcctg gcaaggctgt gtcagaagac aatggccccg 1261 gtggccagtt agagcagaga cttcactgac tgacttacag gtgccctatt ctgaggtaca 1321 ggagccgcgg ctttctgatg ggggaaaatg cgccttaggc tgagccaaca tgactgtccc 1381 ccaaactcca gtggctggcc aggcgcggta gtcacgcctg taatcccaac actttgggag 1441 gccgaggcag gtggatcacc tgaggtcaga agttcaagac cagcctggcc aacatgggga 1501 aaccctgtct ttactaaaaa tataaaaatt agctgggtgt ggtggcgggc acctgtaatc 1561 ccagctactc gggaggctgt ggcaggcgaa atcgcttgaa cccaggagga ggaggtggag 1621 gttgcagtga gccaagatca caccattgca ctccagcctg ggcgacagag actccatctg 1681 gggaaaaaaa ttaaataaat aaactcccg // LOCUS HUMA20 4426 bp mRNA PRI 30-OCT-1994 DEFINITION Human tumor necrosis factor alpha inducible protein A20 mRNA, complete cds. ACCESSION M59465 J05610 NID g177865 KEYWORDS . SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4426) AUTHORS Opipari,A.W. Jr., Boguski,M.S. and Dixit,V.M. TITLE The A20 cDNA induced by tumor necrosis factor alpha encodes a novel type of zinc finger protein JOURNAL J. Biol. Chem. 265 (25), 14705-14708 (1990) MEDLINE 90368626 FEATURES Location/Qualifiers source 1..4426 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 67..2439 /gene="TNFAIP1" CDS 67..2439 /gene="TNFAIP1" /note="tumor necrosis factor alpha inducible protein" /codon_start=1 /db_xref="GDB:G00-127-514" /product="A20" /db_xref="PID:g177866" /translation="MAEQVLPQALYLSNMRKAVKIRERTPEDIFKPTNGIIHHFKTMH RYTLEMFRTCQFCPQFREIIHKALIDRNIQATLESQKKLNWCREVRKLVALKTNGDGN CLMHATSQYMWGVQDTDLVLRKALFSTLKETDTRNFKFRWQLESLKSQEFVETGLCYD TRNWNDEWDNLIKMASTDTPMARSGLQYNSLEEIHIFVLCNILRRPIIVISDKMLRSL ESGSNFAPLKVGGIYLPLHWPAQECYRYPIVLGYDSHHFVPLVTLKDSGPEIRAVPLV NRDRGRFEDLKVHFLTDPENEMKEKLLKEYLMVIEIPVQGWDHGTTHLINAAKLDEAN LPKEINLVDDYFELVQHEYKKWQENSEQGRREGHAQNPMEPSVPQLSLMDVKCETPNC PFFMSVNTQPLCHECSERRQKNQNKLPKLNSKPGPEGLPGMALGASRGEAYEPLAWNP EESTGGPHSAPPTAPSPFLFSETTAMKCRSPGCPFTLNVQHNGFCERCHNARQLHASH APDHTRHLDPGKCQACLQDVTRTFNGICSTCFKRTTAEASSSLSTSLPPSCHQRSKSD PSRLVRSPSPHSCHRAGNDAPAGCLSQAARTPGDRTGTSKCRKAGCVYFGTPENKGFC TLCFIEYRENKHFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQ NQRFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPN QRMGPGAHRGEPAPEDPPKQRCRAPACDHFGNAKCNGYCNECFQFKQMYG" BASE COUNT 1192 a 1070 c 1055 g 1109 t ORIGIN 1 tgccttgacc aggacttggg actttgcgaa aggatcgcgg ggcccggaga ggtgttggag 61 agcacaatgg ctgaacaagt ccttcctcag gctttgtatt tgagcaatat gcggaaagct 121 gtgaagatac gggagagaac tccagaagac atttttaaac ctactaatgg gatcattcat 181 cattttaaaa ccatgcaccg atacacactg gaaatgttca gaacttgcca gttttgtcct 241 cagtttcggg agatcatcca caaagccctc atcgacagaa acatccaggc caccctggaa 301 agccagaaga aactcaactg gtgtcgagaa gtccggaagc ttgtggcgct gaaaacgaac 361 ggtgacggca attgcctcat gcatgccact tctcagtaca tgtggggcgt tcaggacaca 421 gacttggtac tgaggaaggc gctgttcagc acgctcaagg aaacagacac acgcaacttt 481 aaattccgct ggcaactgga gtctctcaaa tctcaggaat ttgttgaaac ggggctttgc 541 tatgatactc ggaactggaa tgatgaatgg gacaatctta tcaaaatggc ttccacagac 601 acacccatgg cccgaagtgg acttcagtac aactcactgg aagaaataca catatttgtc 661 ctttgcaaca tcctcagaag gccaatcatt gtcatttcag acaaaatgct aagaagtttg 721 gaatcaggtt ccaatttcgc ccctttgaaa gtgggtggaa tttacttgcc tctccactgg 781 cctgcccagg aatgctacag ataccccatt gttctcggct atgacagcca tcattttgta 841 cccttggtga ccctgaagga cagtgggcct gaaatccgag ctgttccact tgttaacaga 901 gaccggggaa gatttgaaga cttaaaagtt cactttttga cagatcctga aaatgagatg 961 aaggagaagc tcttaaaaga gtacttaatg gtgatagaaa tccccgtcca aggctgggac 1021 catggcacaa ctcatctcat caatgccgca aagttggatg aagctaactt accaaaagaa 1081 atcaatctgg tagatgatta ctttgaactt gttcagcatg agtacaagaa atggcaggaa 1141 aacagcgagc aggggaggag agaggggcac gcccagaatc ccatggaacc ttccgtgccc 1201 cagctttctc tcatggatgt aaaatgtgaa acgcccaact gccccttctt catgtctgtg 1261 aacacccagc ctttatgcca tgagtgctca gagaggcggc aaaagaatca aaacaaactc 1321 ccaaagctga actccaagcc gggccctgag gggctccctg gcatggcgct cggggcctct 1381 cggggagaag cctatgagcc cttggcgtgg aaccctgagg agtccactgg ggggcctcat 1441 tcggccccac cgacagcacc cagccctttt ctgttcagtg agaccactgc catgaagtgc 1501 aggagccccg gctgcccctt cacactgaat gtgcagcaca acggattttg tgaacgttgc 1561 cacaacgccc ggcaacttca cgccagccac gccccagacc acacaaggca cttggatccc 1621 gggaagtgcc aagcctgcct ccaggatgtt accaggacat ttaatgggat ctgcagtact 1681 tgcttcaaaa ggactacagc agaggcctcc tccagcctca gcaccagcct ccctccttcc 1741 tgtcaccagc gttccaagtc agatccctcg cggctcgtcc ggagcccctc cccgcattct 1801 tgccacagag ctggaaacga cgcccctgct ggctgcctgt ctcaagctgc acggactcct 1861 ggggacagga cggggacgag caagtgcaga aaagccggct gcgtgtattt tgggactcca 1921 gaaaacaagg gcttttgcac actgtgtttc atcgagtaca gagaaaacaa acattttgct 1981 gctgcctcag ggaaagtcag tcccacagcg tccaggttcc agaacaccat tccgtgcctg 2041 gggagggaat gcggcaccct tggaagcacc atgtttgaag gatactgcca gaagtgtttc 2101 attgaagctc agaatcagag atttcatgag gccaaaagga cagaagagca actgagatcg 2161 agccagcgca gagatgtgcc tcgaaccaca caaagcacct caaggcccaa gtgcgcccgg 2221 gcctcctgca agaacatcct ggcctgccgc agcgaggagc tctgcatgga gtgtcagcat 2281 cccaaccaga ggatgggccc tggggcccac cggggtgagc ctgcccccga agaccccccc 2341 aagcagcgtt gccgggcccc cgcctgtgat cattttggca atgccaagtg caacggctac 2401 tgcaacgaat gctttcagtt caagcagatg tatggctaac cggaaacagg tgggtcacct 2461 cctgcaagaa gtggggcctc gagctgtcag tcatcatggt gctatcctct gaacccctca 2521 gctgccactg caacagtggg cttaagggtg tctgagcagg agaggaaaga taagctcttc 2581 gtggtgccca cgatgctcag gtttggtaac ccgggagtgt tcccaggtgg ccttagaaag 2641 caaagcttgt aactggcaag ggatgatgtc agattcagcc caaggttcct cctctcctac 2701 caagcaggag gccaggaact tctttggact tggaaggtgt gcggggactg gccgaggccc 2761 ctgcaccctg cgcatcagga ctgcttcatc gtcttggctg agaaagggaa aagacacaca 2821 agtcgcgtgg gttggagaag ccagagccat tccacctccc ctcccccagc atctctcaga 2881 gatgtgaagc cagatcctca tggcagcgag gccctctgca agaagctcaa ggaagctcag 2941 ggaaaatgga cgtattcaga gagtgtttgt agttcatggt ttttccctac ctgcccggtt 3001 cctttcctga ggacccggca gaaatgcaga accatccatg gactgtgatt ctgaggctgc 3061 tgagactgaa catgttcaca ttgacagaaa aacaagctgc tctttataat atgcaccttt 3121 taaaaaatta gaatatttta ctgggaagac gtgtaactct ttgggttatt actgtcttta 3181 cttctaaaga agttagcttg aactgaggag taaaagtgtg tacatatata atataccctt 3241 acattatgta tgagggattt ttttaaatta tattgaaatg ctgccctaga agtacaatag 3301 gaaggctaaa taataataac ctgttttctg gttgttgttg gggcatgagc ttgtgtatac 3361 actgcttgca taaactcaac cagctgcctt tttaaaggga gctctagtcc tttttgtgta 3421 attcacttta tttattttat tacaaacttc aagattattt aagtgaagat atttcttcag 3481 ctctggggaa aatgccacag tgttctcctg agagaacatc cttgctttga gtcaggctgt 3541 gggcaagttc ctgaccacag ggagtaaatt ggcctctttg atacactttt gcttgcctcc 3601 ccaggaaaga aggaattgca tccaaggtat acatacatat tcatcgatgt ttcgtgcttc 3661 tccttatgaa actccagcta tgtaataaaa aactatactc tgtgttctgt taatgcctct 3721 gagtgtccta cctccttgga gatgagatag ggaaggagca gggatgagac tggcaatggt 3781 cacagggaaa gatgtggcct tttgtgatgg ttttattttc tgttaacact gtgtcctggg 3841 ggggctggga agtcccctgc atcccatggt accctggtat tgggacagca aaagccagta 3901 accatgagta tgaggaaatc tctttctgtt gctggcttac agtttctctg tgtgctttgt 3961 ggttgctgtc atatttgctc tagaagaaaa aaaaaaaagg aggggaaatg cattttcccc 4021 agagataaag gctgccattt tgggggtctg tacttatggc ctgaaaatat ttgtgatcca 4081 taactctaca cagcctttac tcatactatt aggcacactt tccccttaga gccccctaag 4141 tttttcccag acgaatcttt ataatttcct ttccaaagat accaaataaa cttcagtgtt 4201 ttcatctaat tctcttaaag ttgatatctt aatattttgt gttgatcatt atttccattc 4261 ttaatgtgaa aaaaagtaat tatttatact tattataaaa agtatttgaa atttgcacat 4321 ttaattgtcc ctaatagaaa gccacctatt ctttgttgga tttcttcaag tttttctaaa 4381 taaatgtaac ttttcacaag agtcaacatt aaaaaataaa ttattt // LOCUS HUMA2M 4577 bp mRNA PRI 30-OCT-1994 DEFINITION Human alpha-2-macroglobulin mRNA, complete cds. ACCESSION M11313 NID g177869 KEYWORDS alpha-2-macroglobulin; protease inhibitor. SOURCE Human liver, cDNA to mRNA, clone p-alpha-2-M1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4577) AUTHORS Kan,C.C., Solomon,E., Belt,K.T., Chain,A.C., Hiorns,L.R. and Fey,G. TITLE Nucleotide sequence of cDNA encoding human alpha 2-macroglobulin and assignment of the chromosomal locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (8), 2282-2286 (1985) MEDLINE 85190481 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.R.Gehring, 22-JAN-1987. FEATURES Location/Qualifiers source 1..4577 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p13.3-p12.3" mRNA <1..4577 /note="a2M mRNA" sig_peptide 44..112 /gene="A2M" /note="alpha-2-macroglobulin signal peptide" gene 44..4468 /gene="A2M" CDS 44..4468 /gene="A2M" /note="alpha-2-macroglobulin precursor" /codon_start=1 /db_xref="GDB:G00-119-639" /db_xref="PID:g177870" /translation="MGKNKLLHPSLVLLLLVLLPTDASVSGKPQYMVLVPSLLHTETT EKGCVLLSYLNETVTVSASLESVRGNRSLFTDLEAENDVLHCVAFAVPKSSSNEEVMF LTVQVKGPTQEFKKRTTVMVKNEDSLVFVQTDKSIYKPGQTVKFRVVSMDENFHPLNE LIPLVYIQDPKGNRIAQWQSFQLEGGLKQFSFPLSSEPFQGSYKVVVQKKSGGRTEHP FTVEEFVLPKFEVQVTVPKIITILEEEMNVSVCGLYTYGKPVPGHVTVSICRKYSDAS DCHGEDSQAFCEKFSGQLNSHGCFYQQVKTKVFQLKRKEYEMKLHTEAQIQEEGTVVE LTGRQSSEITRTITKLSFVKVDSHFRQGIPFFGQVRLVDGKGVPIPNKVIFIRGNEAN YYSNATTDEHGLVQFSINTTNVMGTSLTVRVNYKDRSPCYGYQWVSEEHEEAHHTAYL VFSPSKSFVHLEPMSHELPCGHTQTVQAHYILNGGTLLGLKKLSFYYLIMAKGGIVRT GTHGLLVKQEDMKGHFSISIPVKSDIAPVARLLIYAVLPTGDVIGDSAKYDVENCLAN KVDLSFSPSQSLPASHAHLRVTAAPQSVCALRAVDQSVLLMKPDAELSASSVYNLLPE KDLTGFPGPLNDQDDEDCINRHNVYINGITYTPVSSTNEKDMYSFLEDMGLKAFTNSK IRKPKMCPQLQQYEMHGPEGLRVGFYESDVMGRGHARLVHVEEPHTETVRKYFPETWI WDLVVVNSAGVAEVGVTVPDTITEWKAGAFCLSEDAGLGISSTASLRAFQPFFVELTM PYSVIRGEAFTLKATVLNYLPKCIRVSVQLEASPAFLAVPVEKEQAPHCICANGRQTV SWAVTPKSLGNVNFTVSAEALESQELCGTEVPSVPEHGRKDTVIKPLLVEPEGLEKET TFNSLLCPSGGEVSEELSLKLPPNVVEESARASVSVLGDILGSAMQNTQNLLQMPYGC GEQNMVLFAPNIYVLDYLNETQQLTPEVKSKAIGYLNTGYQRQLNYKHYDGSYSTFGE RYGRNQGNTWLTAFVLKTFAQARAYIFIDEAHITQALIWLSQRQKDNGCFRSSGSLLN NAIKGGVEDEVTLSAYITIALLEIPLTVTHPVVRNALFCLESAWKTAQEGDHGSHVYT KALLAYAFALAGNQDKRKEVLKSLNEEAVKKDNSVHWERPQKPKAPVGHFYEPQAPSA EVEMTSYVLLAYLTAQPAPTSEDLTSATNIVKWITKQQNAQGGFSSTQDTVVALHALS KYGAATFTRTGKAAQVTIQSSGTFSSKFQVDNNNRLLLQQVSLPELPGEYSMKVTGEG CVYLQTSLKYNILPEKEEFPFALGVQTLPQTCDEPKAHTSFQISLSVSYTGSRSASNM AIVDVKMVSGFIPLKPTVKMLERSNHVSRTEVSSNHVLIYLDKVSNQTLSLFFTVLQD VPVRDLKPAIVKVYDYYETDEFAIAEYNAPCSKDLGNA" mat_peptide 113..4465 /gene="A2M" /note="alpha-2-macroglobulin" BASE COUNT 1229 a 1166 c 1086 g 1096 t ORIGIN Chromosome 12; 1120 bp upstream of TaqI site. 1 gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact 61 ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc 121 tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa 181 gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc 241 tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg 301 tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca 361 agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga 421 cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt 481 tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata 541 cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg 601 cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt 661 ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct 721 tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat 781 gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt 841 gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt 901 ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac 961 caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat 1021 ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac 1081 cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt 1141 tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat 1201 cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca 1261 gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa 1321 ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca 1381 cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca 1441 tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac 1501 cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt 1561 ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat 1621 ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt 1681 acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa 1741 caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg 1801 agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct 1861 catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga 1921 cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca 1981 taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat 2041 gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc 2101 caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg 2161 tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc 2221 tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt 2281 aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa 2341 ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct 2401 ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga 2461 ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt 2521 gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca 2581 ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg 2641 aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga 2701 ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga 2761 acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga 2821 ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc 2881 ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct 2941 ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt 3001 actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg 3061 ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag 3121 cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct 3181 gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc 3241 cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact 3301 gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac 3361 catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt 3421 ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac 3481 caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt 3541 actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc 3601 tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt 3661 ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga 3721 ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg 3781 cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc 3841 cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt 3901 ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc 3961 agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac 4021 ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca 4081 gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag 4141 tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt 4201 ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag 4261 ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac 4321 actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat 4381 agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc 4441 ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag 4501 tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat 4561 aaacactttt tctggtc // LOCUS HUMA2MGRAP 1493 bp mRNA PRI 30-OCT-1994 DEFINITION Human alpha-2-macroglobulin receptor-associated protein mRNA, complete cds. ACCESSION M63959 NID g177873 KEYWORDS Heymann nephritis antigen; alpha-2-macroglobulin receptor-associated protein. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1493) AUTHORS Striekland,D.K., Ashcom,J.D., Williams,S., Battey,F., Behre,E., McTigue,K., Battey,J.F. and Argraves,W.S. TITLE Primary structure of alpha 2-macroglobulin receptor-associated protein. Human homologue of a Heymann nephritis antigen JOURNAL J. Biol. Chem. 266 (20), 13364-13369 (1991) MEDLINE 91302371 FEATURES Location/Qualifiers source 1..1493 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="lambda-gt11" /map="12p13.3-p12.3" gene 14..1087 /gene="A2M" CDS 14..1087 /gene="A2M" /codon_start=1 /db_xref="GDB:G00-119-639" /product="alpha-2-macroglobulin receptor-associated protein" /db_xref="PID:g177874" /translation="MAPRRVRSFLRGLPALLLLLLFLGPWPAASHGGKYSREKNQPKP SPKRESGEEFRMEKLNQLWEKAQRLHLPPVRLAELHADLKIQERDELAWKKLKLDGLD EDGEKEARLIRNLNVILAKYGLDGKKDARQVTSNSLSGTQEDGLDDPRLEKLWHKAKT SGKFSGEELDKLWREFLHHKEKVHEYNVLLETLSRTEEIHENVISPSDLSDIKGSVLH SRHTELKEKLRSINQGLDRLRRVSHQGYSTEAEFEEPRVIDLWDLAQSANLTDKELEA FREELKHFEAKIEKHNHYQKQLEIAHEKLRHAESVGDGERVSRSREKHALLEGRTKEL GYTVKKHLQDLSGRISRARHNEL" sig_peptide 14..115 /gene="A2M" mat_peptide 116..1084 /gene="A2M" /product="alpha-2-macroglobulin receptor-associated protein" polyA_signal 1472..1477 BASE COUNT 360 a 412 c 488 g 233 t ORIGIN 1 tgagcggggg atgatggcgc cgcggagggt caggtcgttt ctgcgcgggc tcccggcgct 61 gctactgctg ctgctcttcc tcgggccctg gcccgctgcg agccacggcg gcaagtactc 121 gcgggagaag aaccagccca agccgtcccc gaaacgcgag tccggagagg agttccgcat 181 ggagaagttg aaccagctgt gggagaaggc ccagcgactg catcttcctc ccgtgaggct 241 ggccgagctc cacgctgatc tgaagataca ggagagggac gaactcgcct ggaagaaact 301 aaagcttgac ggcttggacg aagatgggga gaaggaagcg agactcatac gcaacctcaa 361 tgtcatcttg gccaagtatg gtctggacgg aaagaaggac gctcggcagg tgaccagcaa 421 ctccctcagt ggcacccagg aagacgggct ggatgacccc aggctggaaa agctgtggca 481 caaggcgaag acctctggga aattctccgg cgaagaactg gacaagctct ggcgggagtt 541 cctgcatcac aaagagaaag ttcacgagta caacgtcctg ctggagaccc tgagcaggac 601 cgaagaaatc cacgagaacg tcattagccc ctcggacctg agcgacatca agggcagcgt 661 cctgcacagc aggcacacgg agctgaagga gaagctgcgc agcatcaacc agggcctgga 721 ccgcctgcgc agggtcagcc accagggcta cagcactgag gctgagttcg aggagcccag 781 ggtgattgac ctgtgggacc tggcgcagtc cgccaacctc acggacaagg agctggaggc 841 gttccgggag gagctcaagc acttcgaagc caaaatcgag aagcacaacc actaccagaa 901 gcagctggag attgcgcacg agaagctgag gcacgcagag agcgtgggcg acggcgagcg 961 tgtgagccgc agccgcgaga agcacgccct gctggagggg cggaccaagg agctgggcta 1021 cacggtgaag aagcatctgc aggacctgtc cggcaggatc tccagagctc ggcacaacga 1081 actctgaagg cactggggag cccagcccgg cagggaagag gccagcgtga aggacctggg 1141 ctcttggccg tggcatttcc gtggacagcc cgccgtcagg gtggctgggg ctggcacggg 1201 tgtcgaggca ggaaggattg tttctggtga ctgcagccgc tgccgtcgcg acacagggct 1261 tggtggtggt agcatttggg tctgagatcg gcccagctct gactgaaggg gcttggcttc 1321 cactcagcat cagcgtggca gtcaccaccc cagtgaggac ctcgatgtcc agctgctgtc 1381 aggtctgata gtcctctgct aaaacaacac gatttacata aaaaatctta cacatctgcc 1441 accggaaata ccatgcacag agtccttaaa aaatagagtg cagtatttaa acc // LOCUS HUMA2PIA 2287 bp mRNA PRI 13-NOV-1997 DEFINITION Homo sapiens mRNA for alpha-2-plasmin inhibitor, complete cds. ACCESSION D00174 NID g219409 KEYWORDS alpha-2-plasmin inhibitor precursor; alpha-2-plasmin inhibitor; serine protease inhibitor. SOURCE Homo sapiens hepatocyte and normal liver(purchased from Toyobo) cell_line:HepG2 cDNA to mRNA, clone_lib:lambda gt11 clone:lambda-APH34,lambda-APH28,lambda-APL1. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Tone,M., Kikuno,R., Kume-Iwaki,A. and Hashimoto-Gotoh,T. TITLE Structure of human alpha 2-plasmin inhibitor deduced from the cDNA sequence JOURNAL J. Biochem. 102 (5), 1033-1041 (1987) MEDLINE 88139254 COMMENT The published amino-terminal sequence of alpha-2-PI was observed from the 40th amino acid after the first methionine. This indicates that the first 39 amino acid residues may constitute a signal peptide. FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /note="1 bp upstream of EcoRI site" /db_xref="taxon:9606" /cell_line="HepG2" /clone="lambda-APH34,lambda-APH28,lambda-APL1" /clone_lib="lambda gt11" /tissue_type="hepatocyte and normal liver(purchased from Toyobo)" CDS 25..1500 /codon_start=1 /product="alpha-2-plasmin inhibitor precursor" /db_xref="PID:g219410" /translation="MALLWGLLVLSWSCLQGPCSVFSPVSAMEPLGWQLTSGPNQEQV SPLTLLKLGNQEPGGQTALKSPPGVCSRDPTPEQTHRLARAMMAFTADLFSLVAQTST CPNLILSPLSVALALSHLALGAQNHTLQRLQQVLHAGSGPCLPHLLSRLCQDLGPGAF RLAARMYLQKGFPIKEDFLEQSEQLFGAKPVSLTGKQEDDLANINQWVKEATEGKIQE FLSGLPEDTVLLLLNAIHFQGFWRNKFDPSLTQRDSFHLDEQFTVPVEMMQARTYPLR WFLLEQPEIQVAHFPFKNNMSFVVLVPTHFEWNVSQVLANLSWDTLHPPLVWERPTKV RLPKLYLKHQMDLVATLSQLGLQELFQAPDLRGISEQSLVVSGVQHQSTLELSEVGVE AAAATSIAMSRMSLSSFSVNRPFLFFIFEDTTGLPLFVGSVRNPNPSAPRELKEQQDS PGNKDFLQSLKGFPRGDKLFGPDLKLVPPMEEDYPQFGSPK" sig_peptide 28..141 mat_peptide 142..1497 /note="alpha-2-PI" /product="alpha-2-plasmin inhibitor" polyA_signal 2213..2218 BASE COUNT 472 a 711 c 660 g 444 t ORIGIN 1 gaattccggc tggcagggga gaacatggcg ctgctctggg ggctcctggt gctcagctgg 61 tcctgcctgc aaggcccctg ctccgtgttc tcccctgtga gcgccatgga gcccttgggc 121 tggcagctaa ctagcgggcc gaaccaggag caggtgtccc cacttaccct cctcaagttg 181 ggcaaccagg agcctggtgg ccagactgcc ctgaagagtc ccccaggagt ctgcagcaga 241 gaccccaccc cagagcagac ccacaggctg gcccgggcca tgatggcctt cactgccgac 301 ctgttctccc tggtggctca aacgtccacc tgccccaacc tcatcctgtc acccctgagt 361 gtggccctgg cgctgtctca cctggcacta ggtgctcaga accacacgtt gcagaggctg 421 caacaggtgc tgcacgcagg ctcagggccc tgcctccccc atctgctgag ccgcctctgc 481 caggacctgg gccccggcgc gttccgactg gctgccagga tgtacctgca gaaaggattt 541 cccatcaaag aagatttcct ggaacaatcc gaacagctat ttggggcaaa gcccgtgagc 601 ctgacgggaa agcaggaaga tgacctggca aacatcaacc aatgggtgaa ggaggccacg 661 gaggggaaga ttcaggaatt cctctctggg ctgccggaag acaccgtgtt gcttctcctc 721 aacgccatcc acttccaggg tttctggagg aacaagtttg acccgagcct tacccagaga 781 gactccttcc acctggacga gcagttcacg gtgcccgtgg aaatgatgca ggcccgcacg 841 tacccgctgc gctggttctt gctggagcag cctgagatcc aggtggctca tttccccttt 901 aagaacaaca tgagctttgt ggtccttgta cccacccact ttgaatggaa cgtgtcccag 961 gtactggcca acctgagttg ggacaccctg cacccacctc tggtgtggga gaggcccacc 1021 aaggtccggc tgcctaagct gtatctgaaa caccaaatgg acctggtggc caccctcagc 1081 cagctgggcc tgcaggagtt gttccaggcc ccagacctgc gtgggatctc cgagcagagc 1141 ctggtggtgt ccggcgtgca gcatcagtcc accctggagc tcagcgaggt cggcgtggag 1201 gcggcggcgg ccaccagcat tgccatgtcc cgcatgtccc tgtcctcctt cagcgtgaac 1261 cgccccttcc tcttcttcat cttcgaggac accacaggcc ttcccctctt cgtgggcagc 1321 gtgaggaacc ccaaccccag tgcaccgcgg gagctcaagg aacagcagga ttccccgggc 1381 aacaaggact tcctccagag cctgaaaggc ttcccccgcg gagacaagct tttcggccct 1441 gacttaaaac ttgtgccccc catggaggag gattaccccc agtttggcag ccccaagtga 1501 ggggccgtgg ctgtggcatc cagactccct gcctggacca gcctctccac tcatgtgact 1561 ctttccaacc tgctttgtgg cactggggca ggggccgggg gcagtctgag agaggccatt 1621 ctttcccaac acctcttggg gagtttaggg tggggggggg cgcggctggg aggagggcag 1681 gcatcgggga gccgggagcc tgaccctcat ctttcttcca aacaggctca gagggtgtcc 1741 tgcaccgggg cctgggcagg agggaggtgc ttctagttct gccaggagac aggttagctg 1801 ctccccacgt cagctgggac accccgactt ttgtttacca gagaaaaagg gagggggaga 1861 gggctgcctt tggacttgtc ccgggacacc taggctaggg tggggagaga cgggccctgg 1921 tggtggctcg ggaggcgaag cgttgtcctc agccccgcgt ggaactcgtg tctggcacag 1981 cctggctgtg gcctaacctg ccgagagtcc atcagcctcc atcctacccc ctgtgccttg 2041 tcacgccaga cttcccacgg ctcctcgaga tcccaacact gccagcattt cccttccttc 2101 ctctcctgtc tccctcctct gcccgggagc tcaggaaccg aggcagggaa ggatcccatg 2161 agctccttaa ggctcttttg taaggttttt gtagtgattt ttatgccacc tgaataaaga 2221 atgaatgggc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2281 ggaattc // LOCUS HUMA2TPI 1493 bp mRNA PRI 11-NOV-1985 DEFINITION Human alpha-2-thiol proteinase inhibitor mRNA, complete coding sequence. ACCESSION K02566 NID g177889 KEYWORDS kininogen; thiol proteinase inhibitor. SOURCE Human liver, cDNA to mRNA, clone lambda-HTPI.1529. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 5 to 1493) AUTHORS Ohkubo,I., Kurachi,K., Takasawa,T., Shiokawa,H. and Sasaki,M. TITLE Isolation of a Human cDNA for alpha-2-thiol proteinase inhibitor and its identity with low molecular weight kininogen JOURNAL Biochemistry 23, 5691-5697 (1984) MEDLINE 85122621 REFERENCE 2 (bases 1 to 12) AUTHORS Ohkubo,I. JOURNAL Unpublished (1985) COMMENT Draft entry kindly provided by I. Ohkubo, March 1985. Alpha-2-thiol proteinase inhibitor and low molecular weight kininogen are identical in AA sequence and biological activity [1]. Upon exposure to kallikrein, low molecular weight kininogen is converted to a heavy chain and a light chain held together by a disulfide bond, and a nonapeptide, bradykinin. The amino terminal end of the light chain is located at base 1217; the amino terminal end of the heavy chain has not yet been identified. A poly-A signal is found at bp 1471-1476. FEATURES Location/Qualifiers source 1..1493 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1493 /note="a2-tpi mRNA" sig_peptide 50..103 /note="alpha-2-thiol proteinase inhibitor signal peptide" CDS 50..1333 /note="prepro alpha-2-thiol proteinase inhibitor" /codon_start=1 /db_xref="PID:g177890" /translation="MKLITILFLCSRLLLSLTQESQSEEIDCNDKDLFKAVDAALKKY NSQNQSNNQFVLYRITEATKTVGSDTFYSFKYEIKEGDCPVQSGKTWQDCEYKDAAKA ATGECTATVGKRSSTKFSVATQTCQITPAEGPVVTAQYDCLGCVHPISTQSPDLEPIL RHGIQYFNNNTQHSSLFMLNEVKRAQRQVVAGLNFRITYSIVQTNCSKENFLFLTPDC KSLWNGDTGECTDNAYIDIQLRIASFSQNCDIYPGKDFVQPPTKICVGCPRDIPTNSP ELEETLTHTITKLNAENNATFYFKIDNVKKARVQVVAGKKYFIDFVARETTCSKESNE ELTESCETKKLGQSLDCNAEVYVVPWEKKIYPTVNCQPLGMISLMKRPPGFSPFRSSR IGEIKEETTSHLRSCEYKGRPPKAGAEPASEREVS" mat_peptide 1190..1216 /note="bradykinin" BASE COUNT 467 a 347 c 337 g 342 t ORIGIN EcoRI site. 1 aattccggtt gaaaccatcc ctcagctcct agagggagat tgttagatca tgaaactaat 61 taccatcctt ttcctctgct ccaggctact actaagttta acccaggaat cacagtccga 121 ggaaattgac tgcaatgaca aggatttatt taaagctgtg gatgctgctc tgaagaaata 181 taacagtcaa aaccaaagta acaaccagtt tgtattgtac cgcataactg aagccactaa 241 gacggttggc tctgacacgt tttattcctt caagtacgaa atcaaggagg gggattgtcc 301 tgttcaaagt ggcaaaacct ggcaggactg tgagtacaag gatgctgcaa aagcagccac 361 tggagaatgc acggcaaccg tggggaagag gagcagtacg aaattctccg tggctaccca 421 gacctgccag attactccag ccgagggccc tgtggtgaca gcccagtacg actgcctcgg 481 ctgtgtgcat cctatatcaa cgcagagccc agacctggag cccattctga gacacggcat 541 tcagtacttt aacaacaaca ctcaacattc ctccctcttc atgcttaatg aagtaaaacg 601 ggcccaaaga caggtggtgg ctggattgaa ctttcgaatt acctactcaa ttgtgcaaac 661 gaattgttcc aaagagaatt ttctgttctt aactccagac tgcaagtccc tttggaatgg 721 tgataccggt gaatgtacag ataatgcata catcgatatt cagctacgaa ttgcttcctt 781 ctcacagaac tgtgacattt atccagggaa ggattttgta caaccaccta ccaagatttg 841 cgtgggctgc cccagagata tacccaccaa cagcccagag ctggaggaga cactgactca 901 caccatcaca aagcttaatg cagagaataa cgcaactttc tatttcaaga ttgacaatgt 961 gaaaaaagca agagtacagg tggtggctgg caagaaatat tttattgact tcgtggccag 1021 ggaaaccaca tgttccaagg aaagtaatga agagttgacc gaaagctgtg agaccaaaaa 1081 acttggccaa agcctagatt gcaacgctga agtttatgtg gtaccctggg agaaaaaaat 1141 ttaccctact gtcaactgtc aaccactggg aatgatctca ctgatgaaaa ggcctccagg 1201 tttttcacct ttccgatcat cacgaatagg ggaaataaaa gaagaaacaa ctagtcacct 1261 aaggtcctgc gagtacaagg gtcgaccccc aaaggcaggg gcagagccag catctgagag 1321 ggaggtctct tgaccaatgg gcagaatctt cactccaggc acatagcccc aaccacctct 1381 gccagcaacc ttgagaggaa ggacaagaag aaagatggga tagaatttaa atagagaaga 1441 atgccatttt atcactctgc ctctgggtga aataaagatc agtcttgatg ttc // LOCUS HUMA2XXX 2383 bp mRNA PRI 31-DEC-1994 DEFINITION Human adenosine receptor (A2) gene, complete cds. ACCESSION M97370 NID g177891 KEYWORDS adenosine receptor. SOURCE Homo sapiens (tissue library: UniZAPI) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2383) AUTHORS Tiffany,H.L. and Murphy,P.M. JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..2383 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /cell_type="neutrophil" /tissue_lib="UniZAPI" gene 271..1500 /gene="A2" CDS 271..1500 /gene="A2" /codon_start=1 /product="adenosine receptor" /db_xref="PID:g177892" /translation="MGSSVYITVELAIAVLAILGNVLVCWAVWLNSNLQNVTNYFVVS LAAADIAVGVLAIPFAITISTGFCAACHGCLFIACFVLVLTQSSIFSLLAIAIDRYIA IRIPLRYNGLVTGTRAKGIIAICWVLSFAIGLTPMLGWNNCGQPKEGKNHSQGCGEGQ VACLFEDVVPMNYMVYFNFFACVLVPLLLMLGVYLRIFLAARRQLKQMESQPLPGERA RSTLQKEVHAAKSLAIIVGLFALCWLPLHIINCFTFFCPDCSHAPLWLMYLAIVLSHT NSVVNPFIYAYRIREFRQTFRKIIRSHVLRQQEPFKAAGTSARVLAAHGSDGEQVSLR LNGHPPGVWANGSAPHPERRPNGYALGLVSGGSAQESQGNTGLPDVELLSHELKGVCP EPPGLDDPLAQDGAGVS" BASE COUNT 455 a 712 c 709 g 507 t ORIGIN 1 ggcacgaggc tggctgagcc atgatgctgc tgccagaacc cctgcagagg gcctggtttc 61 aggagactca gagtcctctg tgaaaaagcc cttggagagg cgccccagca gggctgcact 121 tggctcctgt gaggaagggg ctcagggtct gggcccctcc gcctgggccg ggctgggagc 181 caggcgggcg gctgggctgc agcaatggac cgtgagctgg cccagcccgc gtccgtgctg 241 agcctgcctg tcgtctgtgg ccatgccatc atgggctcct cggtgtacat cacggtggag 301 ctggccattg ctgtgctggc catcctgggc aatgtgctgg tgtgctgggc cgtgtggctc 361 aacagcaacc tgcagaacgt caccaactac tttgtggtgt cactggcggc ggccgacatc 421 gcagtgggtg tgctcgccat cccctttgcc atcaccatca gcaccgggtt ctgcgctgcc 481 tgccacggct gcctcttcat tgcctgcttc gtcctggtcc tcacgcagag ctccatcttc 541 agtctcctgg ccatcgccat tgaccgctac attgccatcc gcatcccgct ccggtacaat 601 ggcttggtga ccggcacgag ggctaagggc atcattgcca tctgctgggt gctgtcgttt 661 gccatcggcc tgactcccat gctaggttgg aacaactgcg gtcagccaaa ggagggcaag 721 aaccactccc agggctgcgg ggagggccaa gtggcctgtc tctttgagga tgtggtcccc 781 atgaactaca tggtgtactt caacttcttt gcctgtgtgc tggtgcccct gctgctcatg 841 ctgggtgtct atttgcggat cttcctggcg gcgcgacgac agctgaagca gatggagagc 901 cagcctctgc cgggggagcg ggcacggtcc acactgcaga aggaggtcca tgctgccaag 961 tcactggcca tcattgtggg gctctttgcc ctctgctggc tgcccctaca catcatcaac 1021 tgcttcactt tcttctgccc cgactgcagc cacgcccctc tctggctcat gtacctggcc 1081 atcgtcctct cccacaccaa ttcggttgtg aatcccttca tctacgccta ccgtatccgc 1141 gagttccgcc agaccttccg caagatcatt cgcagccacg tcctgaggca gcaagaacct 1201 ttcaaggcag ctggcaccag tgcccgggtc ttggcagctc atggcagtga cggagagcag 1261 gtcagcctcc gtctcaacgg ccacccgcca ggagtgtggg ccaacggcag tgctccccac 1321 cctgagcgga ggcccaatgg ctatgccctg gggctggtga gtggagggag tgcccaagag 1381 tcccagggga acacgggcct cccagacgtg gagctcctta gccatgagct caagggagtg 1441 tgcccagagc cccctggcct agatgacccc ctggcccagg atggagcagg agtgtcctga 1501 tgattcatgg agtttgcccc ttcctaaggg aaggagatct ttatctttct ggttggcttg 1561 accagtcacg ttgggagaag agagagagtg ccaggagacc ctgagggcag ccggttccta 1621 ctttggactg agagaaggga gccccaggct ggagcagcat gaggcccagc aagaagggct 1681 tgggttctga ggaagcagat gtttcatgct gtgaggcctt gcaccaggtg ggggccacag 1741 caccagcagc atctttgctg ggcagggccc agccctccac tgcagaagca tctggaagca 1801 ccaccttgtc tccacagagc agcttgggca cagcagactg gcctggccct gagactgggg 1861 agtggctcca acagcctcct gccacccaca caccactctc cctagactct cctagggttc 1921 aggagctgct gggcccagag gtgacatttg acttttttcc aggaaaaatg taagtgtgag 1981 gaaacccttt ttattttatt acctttcact ctctggctgc tgggtctgcc gtcggtcctg 2041 ctgctaacct ggcaccagag cctctgccgg ggagcctcag gcagtcctct cctgctgtca 2101 cagctgccat ccacttctca gtcccagggc catctcttgg agtgacaaag ctgggatcaa 2161 ggacagggag ttgtaacaga gcagtgccag agcatgggcc caggtcccag gggagaggtt 2221 ggggctggca ggccactggc atgtgctgag tagcgcagag ctacccagtg agaggccttg 2281 tctaactgcc tttccttcta aagggaatgt ttttttctga gataaaataa aaacgagcca 2341 catcgtgttt taagcttgtc caaatgaaaa aaaaaaaaaa aaa // LOCUS HUMA3ADENR 1767 bp mRNA PRI 23-FEB-1994 DEFINITION Human A3 adenosine receptor mRNA, complete cds. ACCESSION L20463 NID g349448 KEYWORDS A3 adenosine receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1767) AUTHORS Sajjadi,F.G. and Firestein,G.S. TITLE cDNA cloning and sequence analysis of the human A3 adenosine receptor JOURNAL Biochim. Biophys. Acta 1179 (1), 105-107 (1993) MEDLINE 94002215 FEATURES Location/Qualifiers source 1..1767 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 year old" /sex="male" /tissue_type="heart" /tissue_lib="lambda ZAPII" CDS 292..1248 /codon_start=1 /product="A3 adenosine receptor" /db_xref="PID:g349449" /translation="MPNNSTTLSLANVTYITMEIFIGLCAIVGNVLVICVVKLNPSLQ TTTFYFIVSLALADIAVGVLVMPLAIVVSLGITIHFYSCLFMTCLLLIFTHASIMSLL AIAVDRYLRVKLTVRYKRVTTHRRIWLALGLCWLVSFLVGLTPMFGWNMKLTSEYHRN VTFLSCQFVSVMRMDYMVYFSFLTWIFIPLVVMCAIYLDIFYIIRNKLSLNLSNSKET GAFYGREFKTAKSLFLVLFLFALSWLPLSIINCIIYFNGEVPQLVLYMGILLSHANSM MNPIVYAYKIKKFKETYLLILKACVVCHPSDSLDTSIEKNSE" polyA_signal 1691..1696 polyA_signal 1751..1756 BASE COUNT 410 a 443 c 396 g 518 t ORIGIN 1 tccttctgat tcagtccata tagagctgtc ctacagcatt ctggaaactt gaggatgtgc 61 ggtgcataaa ggggctggaa gtgacccacc tgtgatgagc cctttctaag gagaagggtt 121 tccaagagat caccccacca gaaaagggta ggaatgagca agttgggaat tttagactgt 181 cactgcacat ggacctctgg gaagacgtct ggcgagagct aggcccactg gccctacaga 241 cggatcttgc tggctcacct gtccctgtgg aggttcccct gggaaggcaa gatgcccaac 301 aacagcacta ctctgtcatt ggccaatgtt acctacatca ccatggaaat tttcattgga 361 ctctgcgcca tagtgggcaa cgtgctggtc atctgcgtgg tcaagctgaa ccccagcctg 421 cagaccacca ccttctattt cattgtctct ctagccctgg ctgacattgc tgttggggtg 481 ctggtcatgc ctttggccat tgttgtcagc ctgggcatca caatccactt ctacagctgc 541 ctttttatga cttgcctact gcttatcttt acccacgcct ccatcatgtc cttgctggcc 601 atcgctgtgg accgatactt gcgggtcaag cttaccgtca gatacaagag ggtcaccact 661 cacagaagaa tatggctggc cctgggcctt tgctggctgg tgtcattcct ggtgggattg 721 acccccatgt ttggctggaa catgaaactg acctcagagt accacagaaa tgtcaccttc 781 ctttcatgcc aatttgtttc cgtcatgaga atggactaca tggtatactt cagcttcctc 841 acctggattt tcatccccct ggttgtcatg tgcgccatct atcttgacat cttttacatc 901 attcggaaca aactcagtct gaacttatct aactccaaag agacaggtgc attttatgga 961 cgggagttca agacggctaa gtccttgttt ctggttcttt tcttgtttgc tctgtcatgg 1021 ctgcctttat ctatcatcaa ctgcatcatc tactttaatg gtgaggtacc acagcttgtg 1081 ctgtacatgg gcatcctgct gtcccatgcc aactccatga tgaaccctat cgtctatgcc 1141 tataaaataa agaagttcaa ggaaacctac cttttgatcc tcaaagcctg tgtggtctgc 1201 catccctctg attctttgga cacaagcatt gagaagaatt ctgagtagtt atccatcaga 1261 gatgactctg tctcattgac cttcagattc cccatcaaca aacacttgag ggcctgtatg 1321 cctgggccaa gggattttta catccttgat tacttccact gaggtgggag catctccagt 1381 gctccccaat tatatctccc ccactccact actctcttcc tccacttcat ttttcctttg 1441 tcctttctct ctaattcagt gttttggagg cctgacttgg ggacaacgta ttattgatat 1501 tattgtctgt tttccttctt cccaatagaa gaataagtca tggagcctga agggtgccta 1561 gttgacttac tgacaaaagg ctctagttgg gctgaacatg tgtgtggtgg tgactcattt 1621 ccatgccatt gtggaattga gcagagaacc tgctctcgga ggatgcctag aagatgttgg 1681 gaacagaaga aataaactga gtttaagggg gacttaaact gctgaattca cctgtggatg 1741 tttttgagta aataaaagct aatagcg // LOCUS HUMA4 945 bp mRNA PRI 29-JUN-1993 DEFINITION Homo sapiens differentiation-dependent A4 protein mRNA, complete cds. ACCESSION L09604 NID g177899 KEYWORDS A4 protein. SOURCE Homo sapiens intestine (colonic adenocarcinoma) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 945) AUTHORS Oliva,M.M., Wu,T.-C. and Yang,V.W. TITLE Isolation and characterization of a differentiation-dependent gene in the human colonic cell line HT29-18 JOURNAL Arch. Biochem. Biophys. 302, 183-192 (1993) MEDLINE 93228341 FEATURES Location/Qualifiers source 1..945 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT29-18" /cell_type="epithelial cell" /tissue_type="intestine (colonic adenocarcinoma)" 5'UTR 1..75 CDS 76..534 /codon_start=1 /product="A4 protein" /db_xref="PID:g177900" /translation="MADSERLSAPGCWAACTNFSRTRKGILLFAEIILCLVILICFSA STPGYSSLSVIEMILAAIFFVVYMCDLHTKIPFINWPWSDFFRTLIAAILYLITSIVV LVERGNHSKIVAGVLGLIATCLFGYDAYVTFPVRQPRHTAAPTDPADGPV" 3'UTR 535..945 polyA_signal 909..914 BASE COUNT 227 a 282 c 203 g 233 t ORIGIN 1 ctgggtgtac agcgtcctcg aaaccacgag caagtgagca gatcctccga ggcaccaggg 61 actccagccc atgccatggc ggattctgag cgcctctcgg ctcctggctg ctgggccgcc 121 tgcaccaact tctcgcgcac tcgaaaggga atcctcctgt ttgctgagat tatattatgc 181 ctggtgatcc tgatctgctt cagtgcctcc acaccaggct actcctccct gtcggtgatt 241 gagatgatcc ttgctgctat tttctttgtt gtctacatgt gtgacctgca caccaagata 301 ccattcatca actggccctg gagtgatttc ttccgaaccc tcatagcggc aatcctctac 361 ctgatcacct ccattgttgt ccttgttgag agaggaaacc actccaaaat cgtcgcaggg 421 gtactgggcc taatcgctac gtgcctcttt ggctatgacg cctatgtcac cttccccgtt 481 cggcagccaa gacatacagc agcccccact gaccccgcag atggcccggt gtaggcgaac 541 ttccctcatt tctctctgca atctgcaaat aactcctcca ttgaaataac tcctccccac 601 cccaacaaca acattcccag cagaccaact cccaccccct ctttgaggta aaagtgcctt 661 tattgggaga cttttgtctt ccagcctgcc aatcaaccct cctgggtgtg gccaccatat 721 gtgtgtgcct aggtcctcct tctgcacgat ccaataggag acaccagttc tgactgaacc 781 atgcccccac ctaagtcaca aaatgaggga agtggggagt tagatttcag agtccaggcc 841 ctaggttggg acccactcca aataatctcc tcggtgtggg tggtggttct atagagggat 901 aaatgaataa taaacattgt taaaatataa aaaaaaaaaa aaaaa // LOCUS HUMA5NICRC 1679 bp mRNA PRI 31-DEC-1994 DEFINITION H.sapiens nicotinic receptor alpha 5 subunit mRNA, complete cds. ACCESSION M83712 NID g177925 KEYWORDS nicotinic receptor alpha 5 subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1679) AUTHORS Chini,B., Clementi,F., Hukovic,N. and Sher,E. TITLE Neuronal-type alpha-bungarotoxin receptors and the alpha 5-nicotinic receptor subunit gene are expressed in neuronal and nonneuronal human cell lines JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (5), 1572-1576 (1992) MEDLINE 92179225 FEATURES Location/Qualifiers source 1..1679 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR32" /cell_type="neuroblastoma cell line" sig_peptide 149..214 /gene="nicotinic receptor alpha 5 subunit" CDS 149..1555 /gene="nicotinic receptor alpha 5 subunit" /codon_start=1 /product="nicotinic receptor alpha 5 subunit" /db_xref="PID:g177926" /translation="MAARGSGPRALRLLLLVQLVAGALRSSRARRAARRGLSEPSSIA KHEDSLLKDLFQDYERWVRPVEHLNDKIKIKFGLAISQLVDVDEKNQLMTTNVWLKQE WIDVKLRWNPDDYGGIKVIRVPSDSSWTPDIVLFDNADGRFEGTSTKTVIRYNGTVTW TPPANYKSSCTIDVTFFPFDLQNCSMKFGSWTYDGSQVDIILEDQDVDKRDFFDNGEW EIVSATGSKGNRTDSCCWYPYVTYSFVIKRLPLFYTLFLIIPCIGLSFLTVLVFYLPS NEGEKICLCTSVLVSLTVFLLVIEEIIPSSSKVIPLIGEYLVFTMIFVTLSIMVTVFA INIHHRSSSTHNAMAPLVRKIFLHTLPKLLSMRSHVDRYFTQKEETESGSGPKSSRNT LEAALDSIRYITTHIMKENDVREVVEDWKFIAQVLDRMFLWTFLFVSIVGSLGLFVPV IYKWANILIPVHIGNANK" gene 149..1555 /gene="nicotinic receptor alpha 5 subunit" mat_peptide 215..1552 /gene="nicotinic receptor alpha 5 subunit" /product="nicotinic receptor alpha 5 subunit" BASE COUNT 444 a 352 c 389 g 494 t ORIGIN 1 gggagctgtg gcgcggagcg gcccctctgc tgcgtctgcc ctcgttttgt ctcacgactc 61 acactcagtg ctccattccc caagagttcg cgttccccgc gcggcggtcg agaggcggct 121 gcccgcggtc ccgcgcgggc gcggggcgat ggcggcgcgg gggtcagggc cccgcgcgct 181 ccgcctgctg ctcttggtcc agctggtcgc gggggcgctg cggtctagcc gggcgcggcg 241 ggcggcgcgc agaggattat ctgaaccttc ttctattgca aaacatgaag atagtttgct 301 taaggattta tttcaagact acgaaagatg ggttcgtcct gtggaacacc tgaatgacaa 361 aataaaaata aaatttggac ttgcaatatc tcaattggtg gatgtggatg agaaaaatca 421 gttaatgaca acaaacgtct ggttgaaaca ggaatggata gatgtaaaat taagatggaa 481 ccctgatgac tatggtggaa taaaagttat acgtgttcct tcagactctt cgtggacacc 541 agacatcgtt ttgtttgata atgcagatgg acgttttgaa gggaccagta cgaaaacagt 601 catcaggtac aatggcactg tcacctggac tccaccggca aactacaaaa gttcctgtac 661 catagatgtc acgtttttcc catttgacct tcagaactgt tccatgaaat ttggttcttg 721 gacttatgat ggatcacagg ttgatataat tctagaggac caagatgtag acaagagaga 781 tttttttgat aatggagaat gggagattgt gagtgcaaca gggagcaaag gaaacagaac 841 cgacagctgt tgctggtatc cgtatgtcac ttactcattt gtaatcaagc gcctgcctct 901 cttttatacc ttgttcctta taataccctg tattgggctc tcatttttaa ctgtacttgt 961 cttctatctt ccttcaaatg aaggtgaaaa gatttgtctc tgcacttcag tacttgtgtc 1021 tttgactgtc ttccttctgg ttattgaaga gatcatacca tcatcttcaa aagtcatacc 1081 tctaattgga gagtatctgg tatttaccat gatttttgtg acactgtcaa ttatggtaac 1141 cgtcttcgct atcaacattc atcatcgttc ttcctcaaca cataatgcca tggcgccttt 1201 ggtccgcaag atatttcttc acacgcttcc caaactgctt tcgatgagaa gtcatgtaga 1261 caggtacttc actcagaaag aggaaactga gagtggtagt ggaccaaaat cttctagaaa 1321 cacattggaa gctgcgctcg attctattcg ctacattaca acacacatca tgaaggaaaa 1381 tgatgtccgt gaggttgttg aagattggaa attcatagcc caggttcttg atcggatgtt 1441 tctgtggact tttcttttcg tttcaattgt tggatctctt gggctttttg ttcctgttat 1501 ttataaatgg gcaaatatat taataccagt tcatattgga aatgcaaata agtgaagcct 1561 cccaagggac tgaagtatac atttagttaa cacacatata tctgatggca cctataaaat 1621 tatgaaaatg taagttatgt gttaaattta gtgcaagctt taacagacta agttgctaa // LOCUS HUMAAE 1458 bp mRNA PRI 17-AUG-1994 DEFINITION Homo sapiens dbpB-like protein mRNA, complete cds. ACCESSION L28809 NID g454151 KEYWORDS . SOURCE Homo sapiens adult bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1458) AUTHORS Horwitz,E.M., Maloney,K.A. and Ley,T.J. TITLE A human protein containing a 'cold shock' domain binds specifically to H-DNA upstream from the human gamma-globin genes JOURNAL J. Biol. Chem. 269 (19), 14130-14139 (1994) MEDLINE 94245734 FEATURES Location/Qualifiers source 1..1458 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="bone marrow" /clone="hBP5" CDS 82..1056 /note="similar to dbpB protein" /codon_start=1 /db_xref="PID:g454152" /translation="MSSEAETQQPPAAPPAAPALSAADTKPGTTGSGAGSGGPGGLTS AAPAGGDKKVIATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQTAIKKNNPRKYLRS VGDGETVEFDVVEGEKGAEAANVTGPGGVPVQGSKYAADRNHYRRYPRRRGPPRNYQQ NYQNSESGEKNEGSESAPEGQAQQRRPYRRRRFPPYYMRRPYGRRPQYSNPPVQGEVM EGADNQGAGEQGRPVRQNMYRGYRPRFRRGPPRQRQPREDGNEEDKENQGDETQGQQP PQRRYRRNFNYRRRRPENPKPQDGKETKAADPPAENSSAPEAEQGGAE" polyA_site 1458 BASE COUNT 427 a 364 c 373 g 294 t ORIGIN 1 gccgccgccg gcctagttac catcacaccc cgggaggagc cgcagctgcc gcagccggcc 61 ccagtcacca tcaccgcaac catgagcagc gaggccgaga cccagcagcc gcccgccgcc 121 ccccccgccg cccccgccct cagcgccgcc gacactaagc ccggcactac gggcagcggc 181 gcagggagcg gtggcccggg cggcctcaca tcggcggcgc ctgccggcgg ggacaagaag 241 gtcatcgcaa cgaaggtttt gggaacagta aaatggttca atgtaaggaa cggatatggt 301 ttcatcaaca ggaatgacac caaggaagat gtatttgtac accagactgc cataaagaag 361 aataacccca ggaagtacct tcgcagtgta ggagatggag agactgtgga gtttgatgtt 421 gttgaaggag aaaagggtgc ggaggcagca aatgttacag gtcctggtgg tgttccagtt 481 caaggcagta aatatgcagc agaccgtaac cattatagac gctatccacg tcgtaggggt 541 cctccacgca attaccagca aaattaccag aatagtgaga gtggggaaaa gaacgaggga 601 tcggagagtg ctcccgaagg ccaggcccaa caacgccggc cctaccgcag gcgaaggttc 661 ccaccttact acatgcggag accctatggg cgtcgaccac agtattccaa ccctcctgtg 721 cagggagaag tgatggaggg tgctgacaac cagggtgcag gagaacaagg tagaccagtg 781 aggcagaata tgtatcgggg atatagacca cgattccgca ggggccctcc tcgccaaaga 841 cagcctagag aggacggcaa tgaagaagat aaagaaaatc aaggagatga gacccaaggt 901 cagcagccac ctcaacgtcg gtaccgccgc aacttcaatt accgacgcag acgcccagaa 961 aaccctaaac cacaagatgg caaagagaca aaagcagccg atccaccagc tgagaattcg 1021 tccgctcccg aggctgagca gggcggggct gagtaaatgc cggcttacca tctctaccat 1081 catccggttt agtcatccaa caagaagaaa tatgaaattc cagcaataag aaatgaacaa 1141 aagattggag ctgaagacct aaagtgcttg ctttttgccc gttgaccaga taaatagaac 1201 tatctgcatt atctatgcag catggggttt ttattatttt tacctaaaga cgtctctttt 1261 tggtaataac aaacgtgttt tttaaaaaag cctggttttt ctcaatacgc ctttaaaggt 1321 ttttaaattg tttcatatct ggtcaagttg agatttttaa gaacttcatt tttaatttgt 1381 aataaaagtt tacaacttga ttttttcaaa aaagtcaaca aactgcaagc acctgttaat 1441 aaataattgt ctttgtgt // LOCUS HUMAAMP1X 1762 bp mRNA PRI 22-JUN-1995 DEFINITION Homo sapiens angio-associated migratory cell protein (AAMP) mRNA, complete cds. ACCESSION M95627 NID g870802 KEYWORDS angio-associated migratory cell protein. SOURCE Homo sapiens brain cDNA to mRNA; and Homo sapiens brain metastasis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1762) AUTHORS Beckner,M.E., Krutzsch,H.C., Stracke,M.L., Williams,S.T., Gallardo,J.A. and Liotta,L.A. TITLE Identification of a new immunoglobulin superfamily protein expressed in blood vessels with a heparin-binding consensus sequence JOURNAL Cancer Res. 55 (10), 2140-2149 (1995) MEDLINE 95262124 FEATURES Location/Qualifiers source 1..1762 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" mRNA <1..1762 /gene="AAMP" gene 1..1762 /gene="AAMP" CDS 1..1359 /gene="AAMP" /note="amino acid feature: WD40 repeat, aa 181..220; amino acid feature: WD40 repeat, aa 322..363; amino acid feature: immunoglobulin type domain, aa 214..303; amino acid feature: immunoglobulin type domain, aa 94..168; amino acid feature: WD40 repeat, aa 221..261; amino acid feature: WD40 repeat, aa 139..180; amino acid feature: WD40 repeat, aa 96..138; amino acid feature: WD40 repeat, aa 364..404" /codon_start=1 /product="angio-associated migratory cell protein" /db_xref="PID:g870803" /translation="MDSGRRLGPEKWIRRLRRMESESESGAAADTPPLETLSFHGDEE IIEVVELDPGPPDPDDLAQEMEDVDFEEEEEEEGNEEGWVLEPQEGVVGSMEGPDDSE VTFALHSASVFCVSLDPKTNTLAVTGGEDDKAFVWRLSDGELLFECAGHKDSVTCAGF SHDSTLVATGDMSGLLKVWQVDTKEEVWSFEAGDLEWMEWHPRAPVLLAGTADGNTWM WKVPNGDCKTFQGPNCPATCGRVLPDGKRAVVGYEDGTIRIWDLKQGSPIHVLKGTEG HQGPLTCVAANQDGSLILTGSVDCQAKLVSATTGKVVGVFRPETVASQPSLGEGEESE SNSVESLGFCSVMPLAAVGYLDGTLAIYDLATQTLRHQCQHQSGIVQLLWEAGTAVVY TCSLDGIVRLWDARTGRLLTDYRGHTAEILDFALSKDASLVVTTSGDHKAKVFCVQRP DR" 3'UTR 1361..1762 /gene="AAMP" polyA_site 1762 /gene="AAMP" BASE COUNT 362 a 449 c 569 g 382 t ORIGIN 1 atggactctg ggaggcgttt gggcccagag aagtggatcc gccgcttgcg ccgcatggag 61 tccgaatcgg aaagcggggc tgctgctgac acccccccac tggagaccct aagcttccat 121 ggtgatgaag agattatcga ggtggtagaa cttgatcccg gtccgccgga cccagatgac 181 ctggcccagg agatggaaga tgtggacttt gaggaagaag aggaggaaga gggcaacgaa 241 gagggctggg ttctagaacc ccaggaaggg gtggtcggca gcatggaggg ccccgacgat 301 agcgaggtca cctttgcatt gcactcagca tctgtgtttt gtgtgagcct ggaccccaag 361 accaatacct tggcagtgac cgggggtgaa gatgacaaag ccttcgtatg gcggctcagc 421 gatggggagc tgctctttga gtgtgcaggc cataaagact ctgtgacttg tgctggtttc 481 agccatgact ccactctagt ggccacaggg gacatgagtg gcctcttgaa agtgtggcag 541 gtggacacta aggaggaggt ctggtccttt gaagcgggag acctggagtg gatggagtgg 601 catcctcggg cacctgtcct gttggcgggc acagctgacg gcaacacctg gatgtggaaa 661 gtcccgaatg gtgactgcaa gaccttccag ggtcccaact gcccagccac ctgtggccga 721 gtcctccctg atgggaagag agctgtggta ggctatgaag atgggaccat caggatttgg 781 gacctgaagc agggaagccc tatccatgta ctgaaaggga ctgagggtca ccagggccca 841 ctcacctgtg ttgctgccaa ccaggatggc agcttgatcc taactggctc tgtggactgc 901 caggccaagc tggtcagtgc caccaccggc aaggtggtgg gtgtttttag acctgagact 961 gtggcctccc agcccagcct gggagaaggg gaggagagtg agtccaactc ggtggagtcc 1021 ttgggcttct gcagtgtgat gcccctggca gctgttggct acctggatgg gaccttggcc 1081 atctatgacc tggctacgca gactcttagg catcagtgtc agcaccagtc gggcatcgtg 1141 cagctgctgt gggaggcagg cactgccgtg gtatatacct gcagcctgga tggcatcgtg 1201 cgcctctggg acgcccggac cggccgcctg cttactgact accggggcca cacggctgag 1261 atcctggact ttgccctcag caaagatgcc tccctggtgg tgaccacgtc aggagaccac 1321 aaagcgaaag tattttgtgt ccaaaggcct gaccgttaat ggctgcagcc cctgcctgtg 1381 tgtctggtgt tgaggggacg aagggacccc tgcccctgtc tgccagcaga ggcagtaggg 1441 cacagaggga agaggagggt ggggccctgg atgactttcc agcctcttca actgacttgc 1501 tcccctctcc ttttcttctc tttagagacc cagcccaggg ccctcccacc cttgcccaga 1561 cctggtgggc ccttcagagg gaggggtgga cctgtttctc tttcactttc atttgctggt 1621 gtgagccatg gggtgtgtat ttgtatgtgg ggagtaggtg tttgaggttc ccgttctttc 1681 ccttcccaag tctctggggg tggaaaggag gaagagatac tagttaaaga ttttaaaaat 1741 gtaaataaaa tatacttccc ag // LOCUS HUMAARE 2374 bp mRNA PRI 21-OCT-1996 DEFINITION Human mRNA for acylamino acid-releasing enzyme, complete cds. ACCESSION D38441 NID g556513 KEYWORDS acylamino acid-releasing enzyme. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2374) AUTHORS Mitta,M., Ohnogi,H., Mizutani,S., Sakiyama,F., Kato,I. and Tsunasawa,S. TITLE The nucleotide sequence of human acylamino acid-releasing enzyme JOURNAL DNA Res. 3 (1), 31-35 (1996) MEDLINE 96281126 REFERENCE 2 (bases 1 to 2374) AUTHORS Mitta,M. TITLE Direct Submission JOURNAL Submitted (29-OCT-1994) to the DDBJ/EMBL/GenBank databases. Masanori Mitta, Takara Shuzo Co., Ltd., Biotechnology Research Laboratories; 3-4-1 Seta, Otsu, Shiga 520-21, Japan (Tel:0775-43-7244, Fax:0775-43-2494) FEATURES Location/Qualifiers source 1..2374 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 20..2218 /EC_number="3.4.19.1" /codon_start=1 /product="acylamino acid-releasing enzyme" /db_xref="PID:d1008056" /db_xref="PID:g556514" /translation="MERQVLLSEPEEAAALYRGLSRQPALSAACLGPEVTTQYGGQYR TVHTEWTQRDLERMENIRFCRQYLVFHDGDSVVFAGPAGNSVETRGELLSRESPSGSM KAVLRKAGGTGPGEEKQFLEVWEKNRKLKSFNLSVLEKHGPVYEDDCFGCLSWSHSET HLLYVAERKRPKAESFFQTKALDVSASDDEIARLKKPDQPIKGDQFVFYEDWGENMVS KSIPVLCVLDVESGNISVLEGVPENVSPGQAFWAPGDAGVVFVGWWHEPFRLGIRFCT NRRSALYYVDLIGGKCELLSDDSLAVSSPRLSPDQCRIVYLQYPSLIPHHQCSQLCLY DWYTKVTSVVVDVVPRQLGENFSGIYCSLLPLGCWSADSQRVVFDSAQRSRQDLFAVD TQVGTVTSLTAGGSGGSWKLLTIDQDLMVAQFSTPSLPPTLKVGFLPSAGKEQSVLWV SLEEAEPIPDIHWGIRVLQPPPEQENVQYAGLDFEAILLQPGSPPDKTQVPMVVMPHG GPHSSFVTAWMLFPAMLCKMGFAVLLVNYRGSTGFGQDSILSLPGNVGHQDVKDVQFA VEQVLQEEHFDASHVALMGGSHGGFISCHLIGQYPETYRACVARNPVINIASMLGSTD IPDWCVVEAGFPFSSDCLPDLSVWAEMLDKSPIRYIPQVKTPLLLMLGQEDRRVPFKQ GMEYYRALKTRNVPVRLLLYPKSTHALSEVEVESDSFMNAVLWLRTHLGS" polyA_signal 2362..2367 BASE COUNT 476 a 648 c 728 g 522 t ORIGIN 1 aggcggcaga gaggagacta tggaacgtca ggtgctgctg agcgagcccg aggaggcggc 61 ggctctgtat cggggcctta gccgccagcc cgcgctgagc gccgcctgcc tgggcccgga 121 ggtcaccaca cagtacggcg gccagtaccg gacggtgcac actgagtgga cccagaggga 181 cctggaacgc atggagaaca ttcgattctg ccgccaatac ctggtgttcc atgacgggga 241 ctcagtggtg tttgccggac ctgcaggcaa cagtgtggag acccgggggg aactgctgag 301 cagagagtct ccttcaggca gcatgaaagc tgtgctgcgc aaggctggag gcacgggccc 361 tggggaagag aagcagttcc tggaggtctg ggagaagaac cggaagctca agagtttcaa 421 cctatcagtg ctggagaaac atgggcctgt ttatgaggat gactgttttg gctgcctgtc 481 ctggtcgcac tcggagacac acttgttgta tgtggcagag aggaagcgcc ccaaggccga 541 gtccttcttt cagaccaaag ccttggacgt cagtgccagc gatgatgaga tagccaggct 601 gaagaagcca gaccaaccca tcaaggggga tcagtttgtg ttttacgaag actggggaga 661 aaacatggtt tccaaaagca tccctgtgct ctgcgtgctg gatgtcgaga gtggcaacat 721 ctctgtgctt gagggggtcc ctgagaatgt gtcccctgga caggcatttt gggcccctgg 781 agatgctggt gtggtgtttg tgggctggtg gcatgagccc ttccggttgg gcatccgctt 841 ttgcaccaat cgcaggtcag ccctgtatta tgtggacctc atcgggggga agtgtgagct 901 cctctcggat gactccctgg ctgtctcttc tccccggctg agcccagacc aatgtcgcat 961 tgtctacctg cagtacccat ctctgatccc ccatcaccaa tgcagccagc tgtgcctgta 1021 tgactggtat accaaggtta cctcagtggt ggtagatgtt gtgcctcggc agctgggaga 1081 gaacttctct gggatctact gcagccttct gcctttggga tgctggtcag ctgacagcca 1141 gagagtggtc tttgactcgg ctcagcgcag ccggcaggac ctgtttgctg tggacaccca 1201 agtgggcact gtgacctccc tcacagctgg agggtcaggt gggagctgga agttgctcac 1261 aattgaccag gacctcatgg tggcacagtt ttccacaccc agcctacctc caaccctgaa 1321 agttgggttc ctgccttctg cagggaagga gcagtcagtg ttgtgggtgt ccctggagga 1381 ggccgagccc attcccgaca tccactgggg catccgggtg ctacagccac ccccagagca 1441 agagaatgtg cagtatgctg gccttgactt tgaagcaatc ctgctgcagc ctggcagccc 1501 tccagataag acccaagtgc ccatggtggt catgccccac ggggggcccc attcatcctt 1561 tgtcactgcc tggatgctgt tcccagccat gctttgcaag atgggctttg cggtactact 1621 agtgaactat cgtggctcca cgggctttgg ccaggacagc atcctctccc tcccaggcaa 1681 tgtgggccac caggatgtga aggatgtcca gtttgcagtg gaacaggtgc tccaggagga 1741 acactttgat gcaagccatg tggcccttat gggtggttcc catggtggct tcatttcctg 1801 ccacttgatt ggtcagtacc cagagaccta cagggcctgc gtggcccgga accccgtgat 1861 caacatcgcc tccatgttgg gctccactga catccctgac tggtgcgtgg tggaggctgg 1921 ctttcctttc agcagtgact gcctgccaga cctcagcgtg tgggctgaga tgctggacaa 1981 atcgcccatc agatacatcc ctcaggtgaa gacaccactg ttactgatgt tgggccagga 2041 ggaccggcgt gtgcccttca agcagggcat ggagtattac cgtgccctca agacccggaa 2101 tgtgcctgtt cggctcctgc tctatcccaa aagcacccac gcattatcag aggtggaggt 2161 ggagtcagac agcttcatga atgctgtgct ctggctacgc acacacttgg gcagctgaag 2221 ccctgccatt ctgcatgagc tgatcagcct gtgccacact tcgctcttga ggagctcaac 2281 ggtctggcag ggcagcagga ggctttctgg gctctggact ccacggatgc gtgggcagag 2341 gaatgtgggc tatgtagtca taataaatta ggac // LOCUS HUMAATP 2284 bp mRNA PRI 08-JUN-1993 DEFINITION Human amino acid transport protein mRNA, complete cds. ACCESSION M95548 M95298 NID g306441 KEYWORDS amino acid transport. SOURCE Homo sapiens kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2284) AUTHORS Wells,R.G, Lee,W.-S., Sabbag,R.V and Hediger,M.A. TITLE Cloning and chromosomal localization of a human kidney cDNA involved in cystine, dibasic, and neutral amino acid transport JOURNAL J. Clin. Invest. 90, 1959-1963 (1993) FEATURES Location/Qualifiers source 1..2284 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" CDS 45..2102 /codon_start=1 /product="amino acid transport protein" /db_xref="PID:g306442" /translation="MAEDKSKRDSIEMSMKGCQTNNGFVHNEDILEQTPDPGSSTDNL KHSTRGILGSQEPDFKGVQPYAGMPKEVLFQFSGQARYRIPREILFWLTVASVLVLIA ATIAIIALSPKCLDWWQEGPMYQIYPRSFKDSNKDGNGDLKGIQDKLDYITALNIKTV WITSFYKSSLKDFRYGVEDFREVDPIFGTMEDFENLVAAIHDKGLKLIIDFIPNHTSD KHIWFQLSRTRTGKYTDYYIWHDCTHENGKTIPPNNWLSVYGNSSWHFDEVRNQCYFH QFMKEQPDLNFRNPDVQEEIKEILRFWLTKGVDGFSLDAVKFLLEAKHLRDEIQVNKT QIPDTVTQYSELYHDFTTTQVGMHDIVRSFRQTMDQYSTEPGRYRFMGTEAYAESIDR TVMYYGLPFIQEADFPFNNYLSMLDTVSGNSVYEVITSWMENMPEGKWPNWMIGGPDS SRLTSRLGNQYVNVMNMLLFTLPGTPITYYGEEIGMGNIVAANLNESYDINTLRSKSP MQWDNSSNAGFSEASNTWLPTNSDYHTVNVDVQKTQPRSALKLYQDLSLLHANELLLN RGWFCHLRNDSHYVVYTRELDGIDRIFIVVLNFGESTLLNLHNMISGLPAKIRIRLST NSADKGSKVDTSGIFLDKGEGLIFEHNTKNLLHRQTAFRDRCFVSNRACYSSVLNILY TSC" BASE COUNT 690 a 485 c 514 g 595 t ORIGIN 1 gccttactgc aggaaggcac tccgaagaca taagtcggtg agacatggct gaagataaaa 61 gcaagagaga ctccatcgag atgagtatga agggatgcca gacaaacaac gggtttgtcc 121 ataatgaaga cattctggag cagaccccgg atccaggcag ctcaacagac aacctgaagc 181 acagcaccag gggcatcctt ggctcccagg agcccgactt caagggcgtc cagccctatg 241 cggggatgcc caaggaggtg ctgttccagt tctctggcca ggcccgctac cgcatacctc 301 gggagatcct cttctggctc acagtggctt ctgtgctggt gctcatcgcg gccaccatag 361 ccatcattgc cctctctcca aagtgcctag actggtggca ggaggggccc atgtaccaga 421 tctacccaag gtctttcaag gacagtaaca aggatgggaa cggagatctg aaaggtattc 481 aagataaact ggactacatc acagctttaa atataaaaac tgtttggatt acttcatttt 541 ataaatcgtc ccttaaagat ttcagatatg gtgttgaaga tttccgggaa gttgatccca 601 tttttggaac gatggaagat tttgagaatc tggttgcagc catacatgat aaaggtttaa 661 aattaatcat cgatttcata ccaaaccaca cgagtgataa acatatttgg tttcaattga 721 gtcggacacg gacaggaaaa tatactgatt attatatctg gcatgactgt acccatgaaa 781 atggcaaaac cattccaccc aacaactggt taagtgtgta tggaaactcc agttggcact 841 ttgacgaagt gcgaaaccaa tgttattttc atcagtttat gaaagagcaa cctgatttaa 901 atttccgcaa tcctgatgtt caagaagaaa taaaagaaat tttacggttc tggctcacaa 961 agggtgttga tggttttagt ttggatgctg ttaaattcct cctagaagca aagcacctga 1021 gagatgagat ccaagtaaat aagacccaaa tcccggacac ggtcacacaa tactcggagc 1081 tgtaccatga cttcaccacc acgcaggtgg gaatgcacga cattgtccgc agcttccggc 1141 agaccatgga ccaatacagc acggagcccg gcagatacag gttcatgggg actgaagcct 1201 atgcagagag tattgacagg accgtgatgt actatggatt gccatttatc caagaagctg 1261 attttccctt caacaattac ctcagcatgc tagacactgt ttctgggaac agcgtgtatg 1321 aggttatcac atcctggatg gaaaacatgc cagaaggaaa atggcctaac tggatgattg 1381 gtggaccaga cagttcacgg ctgacttcgc gtttggggaa tcagtatgtc aacgtgatga 1441 acatgcttct tttcacactc cctggaactc ctataactta ctatggagaa gaaattggaa 1501 tgggaaatat tgtagccgca aatctcaatg aaagctatga tattaatacc cttcgctcaa 1561 agtcaccaat gcagtgggac aatagttcaa atgctggttt ttctgaagct agtaacacct 1621 ggttacctac caattcagat taccacactg tgaatgttga tgtccaaaag actcagccca 1681 gatcggcttt gaagttatat caagatttaa gtctacttca tgccaatgag ctactcctca 1741 acaggggctg gttttgccat ttgaggaatg acagccacta tgttgtgtac acaagagagc 1801 tggatggcat cgacagaatc tttatcgtgg ttctgaattt tggagaatca acactgttaa 1861 atctacataa tatgatttcg ggccttcccg ctaaaataag aataaggtta agtaccaatt 1921 ctgccgacaa aggcagtaaa gttgatacaa gtggcatttt tctggacaag ggagagggac 1981 tcatctttga acacaacacg aagaatctcc ttcatcgcca aacagctttc agagatagat 2041 gctttgtttc caatcgagca tgctattcca gtgtactgaa catactgtat acctcgtgtt 2101 aggcaccttt atgaagagat gaagacactg gcatttcagt gggattgtaa gcatttgtaa 2161 tagcttcatg tacagcatgc tgcttggtga acaatcatta attcttcgat atttctgtag 2221 cttgaatgta accgctttaa gaaaggttct caaatgtttt gaaaaaaata aaatgtttaa 2281 aagt // LOCUS HUMABLA 3840 bp mRNA PRI 30-OCT-1994 DEFINITION Human c-abl gene, complete cds. ACCESSION M14752 NID g177942 KEYWORDS c-myc proto-oncogene; protein kinase. SOURCE Human DNA and cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3840) AUTHORS Shtivelman,E., Lifshitz,B., Gale,R.P., Roe,B.A. and Canaani,E. TITLE Alternative splicing of RNAs transcribed from the human abl gene and from the bcr-abl fused gene JOURNAL Cell 47 (2), 277-284 (1986) MEDLINE 87028219 COMMENT The sequence shown is the 5' promoter region from figure 5 and the sequence from figure 1. FEATURES Location/Qualifiers source 1..3840 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q34.1" gene 365..3757 /gene="ABL1" CDS 365..3757 /gene="ABL1" /note="abl protein" /codon_start=1 /db_xref="GDB:G00-119-640" /db_xref="PID:g177943" /translation="MLEICLKLVGCKSKKGLSSSSSCYLEEALQRPVASDFEPQGLSE AARWNSKENLLAGPSENDPNLFVALYDFVASGDNTLSITKGEKLRVLGYNHNGEWCEA QTKNGQGWVPSNYITPVNSLEKHSWYHGPVSRNAAEYPLSSGINGSFLVRESESSPSQ RSISLRYEGRVYHYRINTASDGKLYVSSESRFNTLAELVHHHSTVADGLITTLHYPAP KRNKPTVYGVSPNYDKWEMERTDITMKHKLGGGQYGEVYEGVWKKYSLTVAVKTLKED TMEVEEFLKEAAVMKEIKHPNLVQLLGVCTREPPFYIITEFMTYGNLLDYLRECNRQE VNAVVLLYMATQISSAMEYLEKKNFIHRDLAARNCLVGENHLVKVADFGLSRLMTGDT YTAHAGAKFPIKWTAPESLAYNKFSIKSDVWAFGVLLWEIATYGMSPYPGIDRSQVYE LLEKDYRMKRPEGCPEKVYELMRACWQWNPSDRPSFAEIHQAFETMFQESSISDEVEK ELGKQGVRGAVTTLLQAPELPTKTRTSRRAAEHRDTTDVPEMPHSKGQGESDPLDHEP AVSPLLPRKERGPPEGGLNEDERLLPKDKKTNLFSALIKKKKKTAPTPPKRSSSFREM DGQPERRGAGEEEGRDISNGALAFTPLDTADPAKSPKPSNGAGVPNGALRESGGSGFR SPHLWKKSSTLTSSRLATGEEEGGGSSSKRFLRSCSVSCVPHGAKDTEWRSVTLPRDL QSTGRQFDSSTFGGHKSEKPALPRKRAGENRSDQVTRGTVTPPPRLVKKNEEAADEVF KDIMESSPGSSPPNLTPKPLRRQVTVAPASGLPHKEEAWKGSALGTPAAAEPVTPTSK AGSGAPRGTSKGPAEESRVRRHKHSSESPGRDKGKLSKLKPAPPPPPAASAGKAGGKP SQRPGQEAAGEAVLGAKTKATSLVDAVNSDAAKPSQPAEGLKKPVLPATPKPHPAKPS GTPISPAPVPLSTLPSASSALAGDQPSSTAFIPLISTRVSLRKTRQPPERASGAITKG VVLDSTEALCLAISGNSEQMASHSAVLEAGKNLYTFCVSYVDSIQQMRNKFAFREAIN KLENNLRELQICPASAGSGPAATQDFSKLLSSVKEISDIVQR" BASE COUNT 863 a 1157 c 1186 g 634 t ORIGIN 1 ggccttcccc ctgcgaggat cgccgttggc ccgggttggc tttggaaagc ggcggtggct 61 ttgggccggg ctcggcctcg ggaacgccag gggcccctgg gtgcggacgg gcgcggccag 121 gagggggtta aggcgcaggc ggcggcgggg cgggggcggg cctggcgggc gccctctccg 181 ggccctttgt taacaggcgc gtcccggcca gcggagacgc ggccgccctg ggcgggcgcg 241 ggcggcgggc ggcggtgagg gcggcctgcg gggcggcgcc cgggggccgg gccgagccgg 301 gcctgagccg ggcccggacc gagctgggag aggggctccg gcccgatcgt tcgcttggcg 361 caaaatgttg gagatctgcc tgaagctggt gggctgcaaa tccaagaagg ggctgtcctc 421 gtcctccagc tgttatctgg aagaagccct tcagcggcca gtagcatctg actttgagcc 481 tcagggtctg agtgaagccg ctcgttggaa ctccaaggaa aaccttctcg ctggacccag 541 tgaaaatgac cccaaccttt tcgttgcact gtatgatttt gtggccagtg gagataacac 601 tctaagcata actaaaggtg aaaagctccg ggtcttaggc tataatcaca atggggaatg 661 gtgtgaagcc caaaccaaaa atggccaagg ctgggtccca agcaactaca tcacgccagt 721 caacagtctg gagaaacact cctggtacca tgggcctgtg tcccgcaatg ccgctgagta 781 tccgctgagc agcgggatca atggcagctt cttggtgcgt gagagtgaga gcagtcctag 841 ccagaggtcc atctcgctga gatacgaagg gagggtgtac cattacagga tcaacactgc 901 ttctgatggc aagctctacg tctcctccga gagccgcttc aacaccctgg ccgagttggt 961 tcatcatcat tcaacggtgg ccgacgggct catcaccacg ctccattatc cagccccaaa 1021 gcgcaacaag cccactgtct atggtgtgtc ccccaactac gacaagtggg agatggaacg 1081 cacggacatc accatgaagc acaagctggg cgggggccag tacggggagg tgtacgaggg 1141 cgtgtggaag aaatacagcc tgacggtggc cgtgaagacc ttgaaggagg acaccatgga 1201 ggtggaagag ttcttgaaag aagctgcagt catgaaagag atcaaacacc ctaacctagt 1261 gcagctcctt ggggtctgca cccgggagcc cccgttctat atcatcactg agttcatgac 1321 ctacgggaac ctcctggact acctgaggga gtgcaaccgg caggaggtga acgccgtggt 1381 gctgctgtac atggccactc agatctcgtc agccatggag tacctagaga agaaaaactt 1441 catccacaga gatcttgctg cccgaaactg cctggtaggg gagaaccact tggtgaaggt 1501 agctgatttt ggcctgagca ggttgatgac aggggacacc tacacagccc atgctggagc 1561 caagttcccc atcaaatgga ctgcacccga gagcctggcc tacaacaagt tctccatcaa 1621 gtccgacgtc tgggcatttg gagtattgct ttgggaaatt gctacctatg gcatgtcccc 1681 ttacccggga attgaccgtt cccaggtgta tgagctgcta gagaaggact accgcatgaa 1741 gcgcccagaa ggctgcccag agaaggtcta tgaactcatg cgagcatgtt ggcagtggaa 1801 tccctctgac cggccctcct ttgctgaaat ccaccaagcc tttgaaacaa tgttccagga 1861 atccagtatc tcagacgaag tggaaaagga gctggggaaa caaggcgtcc gtggggctgt 1921 gactaccttg ctgcaggccc cagagctgcc caccaagacg aggacctcca ggagagctgc 1981 agagcacaga gacaccactg acgtgcctga gatgcctcac tccaagggcc agggagagag 2041 cgatcctctg gaccatgagc ctgccgtgtc tccattgctc cctcgaaaag agcgaggtcc 2101 cccggagggc ggcctgaatg aagatgagcg ccttctcccc aaagacaaaa agaccaactt 2161 gttcagcgcc ttgatcaaga agaagaagaa gacagcccca acccctccca aacgcagcag 2221 ctccttccgg gagatggacg gccagccgga gcgcagaggg gccggcgagg aagagggccg 2281 agacatcagc aacggggcac tggctttcac ccccttggac acagctgacc cagccaagtc 2341 cccaaagccc agcaatgggg ctggggtccc caatggagcc ctccgggagt ccgggggctc 2401 aggcttccgg tctccccacc tgtggaagaa gtccagcacg ctgaccagca gccgcctagc 2461 caccggcgag gaggagggcg gtggcagctc cagcaagcgc ttcctgcgct cttgctccgt 2521 ctcctgcgtt ccccatgggg ccaaggacac ggagtggagg tcagtcacgc tgcctcggga 2581 cttgcagtcc acgggaagac agtttgactc gtccacattt ggagggcaca aaagtgagaa 2641 gccggctctg cctcggaaga gggcagggga gaacaggtct gaccaggtga cccgaggcac 2701 agtaacgcct ccccccaggc tggtgaaaaa gaatgaggaa gctgctgatg aggtcttcaa 2761 agacatcatg gagtccagcc cgggctccag cccgcccaac ctgactccaa aacccctccg 2821 gcggcaggtc accgtggccc ctgcctcggg cctcccccac aaggaagaag cctggaaagg 2881 cagtgcctta gggacccctg ctgcagctga gccagtgacc cccaccagca aagcaggctc 2941 aggtgcacca aggggcacca gcaagggccc cgccgaggag tccagagtga ggaggcacaa 3001 gcactcctct gagtcgccag ggagggacaa ggggaaattg tccaagctca aacctgcccc 3061 gccgccccca ccagcagcct ctgcagggaa ggctggagga aagccctcgc agaggcccgg 3121 ccaggaggct gccggggagg cagtcttggg cgcaaagaca aaagccacga gtctggttga 3181 tgctgtgaac agtgacgctg ccaagcccag ccagccggca gagggcctca aaaagcccgt 3241 gctcccggcc actccaaagc cacaccccgc caagccgtcg gggaccccca tcagcccagc 3301 ccccgttccc ctttccacgt tgccatcagc atcctcggcc ttggcagggg accagccgtc 3361 ttccactgcc ttcatccctc tcatatcaac ccgagtgtct cttcggaaaa cccgccagcc 3421 tccagagcgg gccagcggcg ccatcaccaa gggcgtggtc ttggacagca ccgaggcgct 3481 gtgcctcgcc atctctggga actccgagca gatggccagc cacagcgcag tgctggaggc 3541 cggcaaaaac ctctacacgt tctgcgtgag ctatgtggat tccatccagc aaatgaggaa 3601 caagtttgcc ttccgagagg ccatcaacaa actggagaat aatctccggg agcttcagat 3661 ctgcccggcg tcagcaggca gtggtccggc ggccactcag gacttcagca agctcctcag 3721 ttcggtgaag gaaatcagtg acatagtgca gaggtagcag cagtcagggg tcaggtgtca 3781 ggcccgtcgg agctgcctgc agcacatgcg ggctcgccca tacccatgac agtggctgag // LOCUS HUMABMP 879 bp mRNA PRI 05-FEB-1996 DEFINITION Homo sapiens apolipoprotein B mRNA editing protein mRNA, complete cds. ACCESSION L25877 NID g1177797 KEYWORDS apolipoprotein B mRNA editing protein. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 879) AUTHORS Hadjiagapiou,C., Giannoni,F., Funahashi,T., Skarosi,S.F. and Davidson,N.O. TITLE Molecular cloning of a human small intestinal apolipoprotein B mRNA editing protein JOURNAL Nucleic Acids Res. 22 (10), 1874-1879 (1994) MEDLINE 94268910 REFERENCE 2 (bases 1 to 879) AUTHORS Davidson,N.O. TITLE Direct Submission JOURNAL Submitted (04-AUG-1994) Nicholas O. Davidson, Medicine, University of Chicago, Chicago, IL 60637, USA FEATURES Location/Qualifiers source 1..879 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="intestine" CDS 23..733 /codon_start=1 /product="apolipoprotein B mRNA editing protein" /db_xref="PID:g1177798" /translation="MTSEKGPSTGDPTLRRRIEPWEFDVFYDPRELRKEACLLYEIKW GMSRKIWRSSGKNTTNHVEVNFIKKFTSERDFHPSISCSITWFLSWSPCWECSQAIRE FLSRHPGVTLVIYVARLFWHMDQQNRQGLRDLVNSGVTIQIMRASEYYHCWRNFVNYP PGDEAHWPQYPPLWMMLYALELHCIILSLPPCLKISRRWQNHLTFFRLHLQNCHYQTI PPHILLATGLIHPSVAWR" BASE COUNT 253 a 206 c 190 g 230 t ORIGIN 1 tgaattcgtg ggacagagca ccatgacttc tgagaaaggt ccttcaaccg gtgaccccac 61 tctgaggaga agaatcgaac cctgggagtt tgacgtcttc tatgacccca gagaacttcg 121 taaagaggcc tgtctgctct acgaaatcaa gtggggcatg agccggaaga tctggcgaag 181 ctcaggcaaa aacaccacca atcacgtgga agttaatttt ataaaaaaat ttacgtcaga 241 aagagatttt cacccatcca tcagctgctc catcacctgg ttcttgtcct ggagtccctg 301 ctgggaatgc tcccaggcta ttagagagtt tctgagtcgg caccctggtg tgactctagt 361 gatctacgta gctcggcttt tttggcacat ggatcaacaa aatcggcaag gtctcaggga 421 ccttgttaac agtggagtaa ctattcagat tatgagagca tcagagtatt atcactgctg 481 gaggaatttt gtcaactacc cacctgggga tgaagctcac tggccacaat acccacctct 541 gtggatgatg ttgtacgcac tggagctgca ctgcataatt ctaagtcttc caccctgttt 601 aaagatttca agaagatggc aaaatcatct tacatttttc agacttcatc ttcaaaactg 661 ccattaccaa acgattccgc cacacatcct tttagctaca gggctgatac atccttctgt 721 ggcttggaga tgaataggat gattccgtgt gtgtactgat tcaagaacaa gcaatgatga 781 cccactaaag agtgaatgcc atttagaatc tagaaatgtt cacaaggtac cccaaaactc 841 tgtagcttaa accaacaata aatatgtatt acctctggc // LOCUS HUMACADL 2217 bp mRNA PRI 30-OCT-1994 DEFINITION Human long chain acyl-CoA dehydrogenase (ACADL) mRNA, complete cds. ACCESSION M74096 NID g177961 KEYWORDS fatty acid oxidation; long chain acyl-CoA dehydrogenase; mitochondrial matrix enzyme. SOURCE Homo sapiens skin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2217) AUTHORS Indo,Y., Yang-Feng,T., Glassberg,R. and Tanaka,K. TITLE Molecular cloning and nucleotide sequence of cDNAs encoding human long-chain acyl-CoA dehydrogenase and assignment of the location of its gene (ACADL) to chromosome 2 [published erratum appears in Genomics 1992 Mar;12(3):626] JOURNAL Genomics 11 (3), 609-620 (1991) MEDLINE 92128943 FEATURES Location/Qualifiers source 1..2217 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="skin" /map="2q34-q35" sig_peptide 6..95 /gene="ACADL" /note="G00-118-745" CDS 6..1298 /gene="ACADL" /EC_number="1.3.99.3" /codon_start=1 /db_xref="GDB:G00-118-745" /product="long chain acyl-CoA dehydrogenase" /db_xref="PID:g177962" /translation="MAARLLRGSLRVLGGHRAPRQLPAARCSHSGGEERLETPSAKKL TDIGIRRIFSPEHDIFRKSVRKFFQEEVIPHHSEWEKAGEVSREVWEKAGKQGLLGVN IAEHLGGIGGDLYSAAIVWEEQAYSNCSGPGFSIHSGIVMSYITNHGSEEQIKHFIPQ MTAGKCIGAIAMTEPGAGSDLQGIKTNAKKDGSDWILNGSKVFISNGSLSDVVIVVAV TNHEAPSPAHGISLFLVENGMKGFIKGRKLHKMGLKAQDTAELFFEDIRLPASALLGE ENKGFYYIMKELPQERLLIADVAISASEFMFEETRNYVKQRKAFGKTVAHLQTVQHKL AELKTHICVTRAFVDNCLQLHEAKRLDSATACMAKYWASELQNSVAYDCVQLHGGWGY MWEYPIAKAYVDARVQPIYGGTNEIMKELIAREIVFDK" gene 6..1298 /gene="ACADL" mat_peptide 96..1295 /gene="ACADL" /EC_number="1.3.99.3" /note="G00-118-745" /product="long chain acyl-CoA dehydrogenase" BASE COUNT 732 a 409 c 468 g 608 t ORIGIN chromosome 2, map position q34-q35. 1 cggacatggc cgcacgcctt ctccgagggt ccctacgcgt cctgggcggc caccgtgcgc 61 cgcgccagct gcccgccgcg cgatgttctc attccggagg ggaagaacgt ctagaaactc 121 cttctgctaa aaaattaaca gatataggaa ttcgaagaat cttttctcca gagcatgaca 181 ttttccggaa aagtgtaagg aagtttttcc aagaagaagt gattcctcat cactcagaat 241 gggagaaagc tggagaagta agtagggagg tttgggaaaa agctggaaaa caaggactgc 301 ttggtgtcaa tattgcagag catcttggtg gaattggagg ggatctgtac tccgcagcta 361 ttgtctggga ggagcaagct tattcaaatt gttcaggccc aggttttagt attcattcag 421 gtattgtcat gtcctatatt acaaaccatg gctcagaaga acagattaag cactttattc 481 cccagatgac tgcaggcaaa tgtattggtg caatagcaat gacagagcct ggagctggaa 541 gtgacttaca gggaataaaa acaaatgcta aaaaggatgg aagtgactgg attctcaatg 601 gaagcaaggt gttcatcagt aatgggtcat taagtgatgt tgtgattgta gttgcggtca 661 caaatcatga agctccctcc cctgcccatg gtattagcct ttttctggtg gaaaatggaa 721 tgaaaggatt tatcaaggga cgaaagctac ataaaatggg attaaaagcc caggataccg 781 cagaactatt ctttgaagat atacggttgc cagctagtgc cctacttgga gaagagaata 841 aaggcttcta ttacatcatg aaagagcttc cacaggaaag gctgttaatt gctgatgtgg 901 caatttcagc tagtgaattc atgtttgaag aaaccaggaa ctatgttaaa caaagaaaag 961 cttttggcaa aacagttgct cacctacaga cagtgcaaca taaattagca gaattaaaaa 1021 cacatatatg tgtaacccga gcatttgtgg acaactgtct ccagctgcat gaagcgaaac 1081 gtttggactc cgccactgct tgcatggcga aatattgggc atctgagtta caaaatagtg 1141 tagcttacga ctgtgtacag ctccatggag gttggggata catgtgggag tacccaattg 1201 caaaagctta tgtggatgcc agagttcagc caatctatgg tggtacaaat gaaataatga 1261 aggagctgat tgcaagagag attgtctttg acaagtagac atctgcccac atcctggagt 1321 cctattacag ctaatctcgt tttaaatctg ctcaagataa aatgtaactt ggaaagcgag 1381 gaaacactaa acatgttttt acctgctctc tctatagaga aggaaataaa atataaatat 1441 aagattaaca cagtggaagg acaaatcttt gaagccaaaa ttctagtttt ccaatataag 1501 gtttaactta cagtttttta tgtagccaaa ggtaaacggt tttctgaatc ttgcctaggt 1561 gtttcattta tctctaaaat tctaaaaagc ataaatcatt caaatcttca aaccaaggca 1621 gaaataattt tatgtcgcta tagtataaaa acattaataa gatagcacat tgacttttaa 1681 agggaaaagt aaatataact tagcatgtaa actcatttcg gctaccattt gctccaaatt 1741 ccctagaaca gtggttttta ccactgtact ccaaccccgt ttttaagcaa tggaactctt 1801 tcttcaaaca aaagcttatg cagaacatct ctgtgaaacg ctgctgagtg agaactgctt 1861 tcattgaagc tggaagccat cataccttac tgccttgaaa cccctaggac tcagctaagt 1921 atttgcctaa ccctgaccag ggaatgcctt ggttctgtca attgctgaca tctgagaaca 1981 cagaataatc catcattttt aatttcaaga tattggtaca ttttataggt atcaaagcaa 2041 tggcttttct tttgcaacag ttaatgtatt tattaactta ataattactt tatgtcttct 2101 ataaaccagg ctgttaatac aatgatgaca aacaaaactg gcaagatcac taaaaaataa 2161 gtgaataaac aaataagtag taaaataagg taagaagtaa atatgtaaaa gagataa // LOCUS HUMACADM 2019 bp mRNA PRI 30-OCT-1994 DEFINITION Human medium-chain acyl-CoA dehydrogenase (ACADM) mRNA, complete cds. ACCESSION M16827 J05355 NID g177963 KEYWORDS acyl-CoA dehydrogenase. SOURCE Human liver (library of S.Woo) and placenta (library of E.Sadler), cDNA to mRNA, clones M-[2,3]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2019) AUTHORS Kelly,D.P., Kim,J.J., Billadello,J.J., Hainline,B.E., Chu,T.W. and Strauss,A.W. TITLE Nucleotide sequence of medium-chain acyl-CoA dehydrogenase mRNA and its expression in enzyme-deficient human tissue JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (12), 4068-4072 (1987) MEDLINE 87231952 FEATURES Location/Qualifiers source 1..2019 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p31" mRNA <1..2019 /note="ACADM mRNA" gene 19..1284 /gene="ACADM" CDS 19..1284 /gene="ACADM" /note="medium-chain acyl-CoA dehydrogenase (EC 1.3.99.3)" /codon_start=1 /db_xref="GDB:G00-118-958" /db_xref="PID:g177964" /translation="MAAGFGRCCRVLRSISRFHWRSQHTKANRQREPGLGFSFEFTEQ QKEFQATARKFAREEIIPVAAEYDKTGEYPVPLIRRAWELGLMNTHIPENCGGLGLGT FDACLISEELAYGCTGVQTAIEGNSLGQMPIIIAGNDQQKKKYLGRMTEEPLMCAYCV TEPGAGSDVAGIKTKAEKKGDEYIINGQKMWITNGGKANWYFLLARSDPDPKAPANKA FTGFIVEADTPGIQIGRKELNMGQRCSDTRGIVFEDVKVPKENVLIGDGAGFKVAMGA FDKTRPVVAAGAVGLAQRALDEATKYALERKTFGKLLVEHQAISFMLAEMAMKVELAR MSYQRAAWEVDSGRRNTYYASIAKAFAGDIANQLATDAVQILGGNGFNTEYPVEKLMR DAKIYQIYEGTSQIQRLIVAREHIDKYKN" BASE COUNT 655 a 311 c 425 g 628 t ORIGIN 46 bp upstream of PstI site; chromosome 1p31. 1 ccggaacgga gagccaacat ggcagcgggg ttcgggcgat gctgcagggt cctgagaagt 61 atttctcgtt ttcattggag atcacagcat acaaaagcca atcgacaacg tgaaccagga 121 ttaggattta gttttgagtt caccgaacag cagaaagaat ttcaagctac tgctcgtaaa 181 tttgccagag aggaaatcat cccagtggct gcagaatatg ataaaactgg tgaatatcca 241 gtccccctaa ttagaagagc ctgggaactt ggtttaatga acacacacat tccagagaac 301 tgtggaggtc ttggacttgg aacttttgat gcttgtttaa ttagtgaaga attggcttat 361 ggatgtacag gggttcagac tgctattgaa ggaaattctt tggggcaaat gcctattatt 421 attgctggaa atgatcaaca aaagaagaag tatttgggga gaatgactga ggagccattg 481 atgtgtgctt attgtgtaac agaacctgga gcaggctctg atgtagctgg tataaagacc 541 aaagcagaaa agaaaggaga tgagtatatt attaatggtc agaagatgtg gataaccaac 601 ggaggaaaag ctaattggta ttttttattg gcacgttctg atccagatcc taaagctcct 661 gctaataaag cctttactgg attcattgtg gaagcagata ccccaggaat tcagattggg 721 agaaaggaat taaacatggg ccagcgatgt tcagatacta gaggaattgt cttcgaagat 781 gtgaaagtgc ctaaagaaaa tgttttaatt ggtgacggag ctggtttcaa agttgcaatg 841 ggagcttttg ataaaaccag acctgtagta gctgctggtg ctgttggatt agcacaaaga 901 gctttggatg aagctaccaa gtatgccctg gaaaggaaaa ctttcggaaa gctacttgta 961 gagcaccaag caatatcatt tatgctggct gaaatggcaa tgaaagttga actagctaga 1021 atgagttacc agagagcagc ttgggaggtt gattctggtc gtcgaaatac ctattatgct 1081 tctattgcaa aggcatttgc tggagatatt gcaaatcagt tagctactga tgctgtgcag 1141 atacttggag gcaatggatt taatacagaa tatcctgtag aaaaactaat gagggatgcc 1201 aaaatctatc agatttatga aggtacttca caaattcaaa gacttattgt agcccgtgaa 1261 cacattgaca agtacaaaaa ttaaaaaaat tactgtagaa atattgaata actagaacac 1321 aagccactgt ttcagctcca gaaaaaagaa agggctttaa cgttttttcc agtgaaaaca 1381 aatcctctta tattaaatct aagcaactgc ttattatagt agtttatact tttgcttaac 1441 tctgttatgt ctcttaagca ggtttggttt ttattaaaat gatgtgtttt ctttagtacc 1501 actttacttg aattacatta acctagaaaa ctacataggt tattttgatc tcttaagatt 1561 aatgtagcag aaatttcttg gaattttatt tttgtaatga cagaaaagtg ggcttagaaa 1621 gtattcaaga tgttacaaaa tttacattta gaaaatattg tagtatttga atactgtcaa 1681 cttgacagta actttgtaga cttaatggta ttattaaagt tctttttatt gcagtttgga 1741 aagcatttgt gaaactttct gtttggcaca gaaacagtca aaattttgac attcatattc 1801 tcctatttta cagctacaag aactttcttg aaaatcttat ttaattctga gcccatattt 1861 cacttacctt atttaaaata aatcaataaa gcttgcctta aattattttt atatgactgt 1921 tggtctctag gtagcctttg gtctattgta cacaatctca tttcatatgt ttgcattttg 1981 gcaaagaact taataaaatt gttcagtgct tattatcat // LOCUS HUMACALX 724 bp mRNA PRI 15-SEP-1990 DEFINITION Human calcitonin mRNA, complete cds. ACCESSION M26095 NID g177965 KEYWORDS calcitonin. SOURCE Human cell-line BEN, cDNA to mRNA, clone hBEN-JR2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 724) AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M., Al-Kazwini,S.J., Holman,J.J. and Marshall,I. TITLE Expression and function of the human calcitonin/alpha-CGRP gene in health and disease JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986) MEDLINE 87213363 FEATURES Location/Qualifiers source 1..724 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BEN" CDS 35..460 /note="calcitonin precursor" /codon_start=1 /db_xref="PID:g177966" /translation="MGFQKFSPFLALSILVLLQAGSLHAAPFRSALESSPADPATLSE DEARLLLAALVQDYVQMKASELEQEQEREGSSLDSPRSKRCGNLSTCMLGTYTQDFNK FHTFPQTAIGVGAPGKKRDMSSDLERDHRPHVSMPQNAN" sig_peptide 35..109 /note="calcitonin signal peptide" mat_peptide 287..382 /note="calcitonin" mat_peptide 383..457 /note="flanking peptide" BASE COUNT 163 a 195 c 200 g 166 t ORIGIN 1 ggtgagcccc gagattctgg ctcagagagg tgtcatgggc ttccaaaagt tctccccctt 61 cctggctctc agcatcttgg tcctgttgca ggcaggcagc ctccatgcag caccattcag 121 gtctgccctg gagagcagcc cagcagaccc ggccacgctc agtgaggacg aagcgcgcct 181 cctgctggct gcactggtgc aggactatgt gcagatgaag gccagtgagc tggagcagga 241 gcaagagaga gagggctcca gcctggacag ccccagatct aagcggtgcg gtaatctgag 301 tacttgcatg ctgggcacat acacgcagga cttcaacaag tttcacacgt tcccccaaac 361 tgcaattggg gttggagcac ctggaaagaa aagggatatg tccagcgact tggagagaga 421 ccatcgccct catgttagca tgccccagaa tgccaactaa actcctccct ttccttccta 481 atttcccttc ttgcatcctt cctataactt gatgcatgtg gtttggttcc tctctggtgg 541 ctctttgggc tggtattggt ggctttcctt gtggcagagg atgtctcaaa cttcagatgg 601 gaggaaagag agcaggactc acaggttgga agagaatcac ctgggaaaat accagaaaat 661 gagggccgct ttgagtcccc cagagatgtc atcagagctc ctctgtcctg ctttctgaat 721 gtgc // LOCUS HUMACHRM2 2210 bp DNA PRI 30-OCT-1994 DEFINITION Human m2 muscarinic acetylcholine receptor gene. ACCESSION M16404 NID g177989 KEYWORDS acetylcholine receptor; m2 muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2210) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes [published erratum appears in Science 1987 Sep 25;237(4822):237] JOURNAL Science 237 (4814), 527-532 (1987) MEDLINE 87263421 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES Location/Qualifiers source 1..2210 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q35-qter" intron <1..148 /note="ACHR-m2 intron A" prim_transcript <1..2171 /note="ACHR-m2 pre-mRNA" gene 195..1595 /gene="CHRM2" CDS 195..1595 /gene="CHRM2" /note="muscarinic acetylcholine receptor m2" /codon_start=1 /db_xref="GDB:G00-125-214" /db_xref="PID:g177990" /translation="MNNSTNSSNNSLALTSPYKTFEVVFIVLVAGSLSLVTIIGNILV MVSIKVNRHLQTVNNYFLFSLACADLIIGVFSMNLYTLYTVIGYWPLGPVVCDLWLAL DYVVSNASVMNLLIISFDRYFCVTKPLTYPVKRTTKMAGMMIAAAWVLSFILWAPAIL FWQFIVGVRTVEDGECYIQFFSNAAVTFGTAIAAFYLPVIIMTVLYWHISRASKSRIK KDKKEPVANQDPVSPSLVQGRIVKPNNNNMPSSDDGLEHNKIQNGKAPRDPVTENCVQ GEEKESSNDSTSVSAVASNMRDDEITQDENTVSTSLGHSKDENSKQTCIRIGTKTPKS DSCTPTNTTVEVVGSSGQNGDEKQNIVARKIVKMTKQPAKKKPPPSREKKVTRTILAI LLAFIITWAPYNVMVLINTFCAPCIPNTVWTIGYWLCYINSTINPACYALCNATFKKT FKHLLMCHYKNIGATR" BASE COUNT 657 a 467 c 474 g 612 t ORIGIN 19 bp upstream of AccI site. 1 acatgggaat taggcaggta gacacagtaa tcatgcaggg gaagggagat ttgggagaaa 61 ataatgtggt ttaaaaggag aaacaacatt atgtatttta aaccaatgtt tatattatgt 121 ttgttaattt tattctattt ccttgcaggt ttaaatgttt atttgctact tggctactga 181 ttagagaacg caaaatgaat aactcaacaa actcctctaa caatagcctg gctcttacaa 241 gtccttataa gacatttgaa gtggtgttta ttgtcctggt ggctggatcc ctcagtttgg 301 tgaccattat cgggaacatc ctagtcatgg tttccattaa agtcaaccgc cacctccaga 361 ccgtcaacaa ttacttttta ttcagcttgg cctgtgctga ccttatcata ggtgttttct 421 ccatgaactt gtacaccctc tacactgtga ttggttactg gcctttggga cctgtggtgt 481 gtgacctttg gctagccctg gactatgtgg tcagcaatgc ctcagttatg aatctgctca 541 tcatcagctt tgacaggtac ttctgtgtca caaaacctct gacctaccca gtcaagcgga 601 ccacaaaaat ggcaggtatg atgattgcag ctgcctgggt cctctctttc atcctctggg 661 ctccagccat tctcttctgg cagttcattg taggggtgag aactgtggag gatggggagt 721 gctacattca gtttttttcc aatgctgctg tcacctttgg tacggctatt gcagccttct 781 atttgccagt gatcatcatg actgtgctat attggcacat atcccgagcc agcaagagca 841 ggataaagaa ggacaagaag gagcctgttg ccaaccaaga ccccgtttct ccaagtctgg 901 tacaaggaag gatagtgaag ccaaacaata acaacatgcc cagcagtgac gatggcctgg 961 agcacaacaa aatccagaat ggcaaagccc ccagggatcc tgtgactgaa aactgtgttc 1021 agggagagga gaaggagagc tccaatgact ccacctcagt cagtgctgtt gcctctaata 1081 tgagagatga tgaaataacc caggatgaaa acacagtttc cacttccctg ggccattcca 1141 aagatgagaa ctctaagcaa acatgcatca gaattggcac caagacccca aaaagtgact 1201 catgtacccc aactaatacc accgtggagg tagtggggtc ttcaggtcag aatggagatg 1261 aaaagcagaa tattgtagcc cgcaagattg tgaagatgac taagcagcct gcaaaaaaga 1321 agcctcctcc ttcccgggaa aagaaagtca ccaggacaat cttggctatt ctgttggctt 1381 tcatcatcac ttgggcccca tacaatgtca tggtgctcat taacaccttt tgtgcacctt 1441 gcatccccaa cactgtgtgg acaattggtt actggctttg ttacatcaac agcactatca 1501 accctgcctg ctatgcactt tgcaatgcca ccttcaagaa gacctttaaa caccttctca 1561 tgtgtcatta taagaacata ggcgctacaa ggtaaaatat ctttgaaaaa gatagaaggt 1621 gggcaagggg agcttgagaa gaataaaagg gataaacgag ctcctagttt taaaatctct 1681 gccattgcac tttatagtct gattacaaaa cgtgcaattc aggagcccag cagtgacaca 1741 cttatcacgc ctaggctcca gtttgcaaaa attgcacctt ataaactgtc agtattagga 1801 gcaatgagac aatgaaagaa acatgttggg atcgtggatt taagaaacta tacactgttt 1861 ctcataatct cttgaagaag ggcttctgat tctacaattt tatcagtctc tgcacaagag 1921 gaataacctt gttccttttt tgttactttt gttgttgttg ttctcatgtg tccttaagag 1981 aaggaatgcc acagttacaa ggtaaacatg gagacttaaa cataaagaaa taggcactat 2041 acaatgggga cataaaaaaa gaaaatgaaa gaaggatgca gaaatttgtc tccggagtgt 2101 taagcatatt ttattctttt gttacggtcc tatttagagg attggaatgt aataaatgct 2161 tattttttgc ctttcttttt cccaccatga agagaaagca aacaaacaga // LOCUS HUMACHRM4 2595 bp DNA PRI 30-OCT-1994 DEFINITION Human m4 muscarinic acetylcholine receptor gene. ACCESSION M16405 NID g177991 KEYWORDS acetylcholine receptor; muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2595) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes [published erratum appears in Science 1987 Sep 25;237(4822):237] JOURNAL Science 237 (4814), 527-532 (1987) MEDLINE 87263421 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES Location/Qualifiers source 1..2595 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p12-p11.2" intron <1..771 /note="ACHR-m4 intron A" prim_transcript <1..2595 /note="ACHR-m4 pre-mRNA" gene 801..2237 /gene="CHRM4" CDS 801..2237 /gene="CHRM4" /note="muscarinic acetylcholine receptor m4" /codon_start=1 /db_xref="GDB:G00-125-216" /db_xref="PID:g177992" /translation="MANFTPVNGSSGNQSVRLVTSSSHNRYETVEMVFIATVTGSLSL VTVVGNILVMLSIKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTVYIIKGYWPLGA VVCDLWLALDYVVSNASVMNLLIISFDRYFCVTKPLTYPARRTTKMAGLMIAAAWVLS FVLWAPAILFWQFVVGKRTVPDNHCFIQFLSNPAVTFGTAIAAFYLPVVIMTVLYIHI SLASRSRVHKHRPEGPKEKKAKTLAFLKSPLMKQSVKKPRPGGRPGGLRNGKLEEAPP PALPPPPRPVADKDTSNESSSGSATQNTKERPATELSTTEATTPAMPAPPLQPRALNP ASRWSKIQIVTKQTGNECVTAIEIVPATPAGMRPAANVARKFASIARNQVRKKRQMAA RERKVTRTIFAILLAFILTWTPYNVMVLVNTFCQSCIPDTVWSIGYWLCYVNSTINPA CYALCNATFKKTFRHLLLCQYRNIGTAR" BASE COUNT 528 a 839 c 674 g 552 t 2 others ORIGIN 1 bp upstream of XbaI site. 1 tctagaccac cagcctggac aacataccaa gaccctgtct ctacaaataa atagataaat 61 aaatagacac tttttttaag tgtcaaaagt gcttggcact tagtagacca tcagtgttag 121 gtgctcatac ataccccgat tattgccttg tcccagtgtc ttgtacaggg gttggagagn 181 aggtgttaag aaatgaccga atgggtaaat ggatgaacag aacacctccc tccagagccc 241 acatgctcgt gggcctctgg gaccactctc ctcctcctct tgcttccctg agctccccca 301 gcatggcctc tgtccaggcc ttgcgctgcc tccaggcctt tgctgtggct actgcccctg 361 gagcgccatn tccacagctc ctcctgtggc tggctcctca tcacccagat gacctggtgg 421 gtgaggccac ctagcaagga gtcatgcctg tcctgccttc tgactcactc tctcatcacc 481 ctgccttttt tttcttttgt ggctcacgtg tttgcatgtc tccccccatg aggcaggggg 541 ccatgtgtgt cttattcact tctgtagcca cagcaccctg agcaatgctt gccacatagt 601 aggtgctcaa ttaatgttga atgaatgggc aaaatgcggg atggcgggac agagttctct 661 caaggcattc tgccagagaa tgtccctctg tcaccttgaa tccagtgtac ctccagatga 721 ctcccccatt ccctcctgta gttcatgctt ttctctcccc ttcctcccca gacacggcct 781 acccacccct ggcaaccaac atggccaact tcacacctgt caatggcagc tcgggcaatc 841 agtccgtgcg cctggtcacg tcatcatccc acaatcgcta tgagacggtg gaaatggtct 901 tcattgccac agtgacaggc tccctgagcc tggtgactgt cgtgggcaac atcctggtga 961 tgctgtccat caaggtcaac aggcagctgc agacagtcaa caactacttc ctcttcagcc 1021 tggcgtgtgc tgatctcatc ataggcgcct tctccatgaa cctctacacc gtgtacatca 1081 tcaagggcta ctggcccctg ggcgccgtgg tctgcgacct gtggctggcc ctggactacg 1141 tggtgagcaa cgcctccgtc atgaaccttc tcatcatcag ctttgaccgc tacttctgcg 1201 tcaccaagcc tctcacctac cctgcccggc gcaccaccaa gatggcaggc ctcatgattg 1261 ctgctgcctg ggtactgtcc ttcgtgctct gggcgcctgc catcttgttc tggcagtttg 1321 tggtgggtaa gcggacggtg cccgacaacc actgcttcat ccagttcctg tccaacccag 1381 cagtgacctt tggcacagcc attgctgcct tctacctgcc tgtggtcatc atgacggtgc 1441 tgtacatcca catctccctg gccagtcgca gccgagtcca caagcaccgg cccgagggcc 1501 cgaaggagaa gaaagccaag acgctggcct tcctcaagag cccactaatg aagcagagcg 1561 tcaagaagcc ccgcccggga ggccgcccgg gaggactgcg caatggcaag ctggaggagg 1621 cccccccgcc agcgctgcca ccgccaccgc gccccgtggc tgataaggac acttccaatg 1681 agtccagctc aggcagtgcc acccagaaca ccaaggaacg cccagccaca gagctgtcca 1741 ccacagaggc caccactccc gccatgcccg cccctcccct gcagccgcgg gccctcaacc 1801 cagcctccag atggtccaag atccagattg tgacgaagca gacaggcaat gagtgtgtga 1861 cagccattga gattgtgcct gccacgccgg ctggcatgcg ccctgcggcc aacgtggccc 1921 gcaagttcgc cagcatcgct cgcaaccagg tgcgcaagaa gcggcagatg gcggcccggg 1981 agcgcaaagt gacacgaacg atctttgcca ttctgctagc cttcatcctc acctggacgc 2041 cctacaacgt catggtcctg gtgaacacct tctgccagag ctgcatccct gacacggtgt 2101 ggtccattgg ctactggctc tgctacgtca acagcaccat caaccctgcc tgctatgctc 2161 tgtgcaacgc cacctttaaa aagaccttcc ggcacctgct gctgtgccag tatcggaaca 2221 tcggcactgc caggtaggca ggcaggagtg ccctaggagg tgcggtgtgc gtgcgtgtgc 2281 tgggggacca cacggctcac ttgctgtggg gaagagtgca ggcaccattc tgcgttcacg 2341 tttgctgagg aggaagttca gaagaggctc tgtggctgca ttcagagacc agatctctgc 2401 tcacccgtga ggaggctcac cccagggagt gtctgaactg gggctgcctg gcccacctct 2461 gtggccctgc ttcagcgagc tgcggggcac tggcctgggt gggcacctgc ccactgtgac 2521 caaccatcag cagtgctgga agaatggaga tctggatggg ggccgaagcc cagggccccc 2581 tcaggaagaa caaag // LOCUS HUMACKII 2195 bp mRNA PRI 03-OCT-1990 DEFINITION Human casein kinase II alpha subunit mRNA, complete cds. ACCESSION M55265 J02920 NID g177993 KEYWORDS casein kinase II alpha-subunit. SOURCE Human T-lymphocytes, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2195) AUTHORS Lozeman,F.J., Litchfield,D.W., Piening,C., Takio,K., Walsh,K. and Krebs,E.G. TITLE Isolation and characterization of human cDNA clones encoding the alpha and alpha' subunits of casein kinase II JOURNAL Biochemistry 29, 8436-8447 (1990) MEDLINE 91070071 FEATURES Location/Qualifiers source 1..2195 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-lymphocyte" mRNA 1..2195 /partial /product="casein kinase II alpha subunit" CDS 149..1324 /codon_start=1 /product="casein kinase II alpha subunit" /db_xref="PID:g177994" /translation="MSGPVPSRARVYTDVNTHRPREYWDYESHVVEWGNQDDYQLVRK LGRGKYSEVFEAINITNNEKVVVKILKPVKKKKIKREIKILENLRGGPNIITLADIVK DPVSRTPALVFEHVNNTDFKQLYQTLTDYDIRFYMYEILKALDYCHSMGIMHRDVKPH NVMIDHEHRKLRLIDWGLAEFYHPGQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSL GCMLASMIFRKEPFFHGHDNYDQLVRIAKVLGTEDLYDYIDKYNIELDPRFNDILGRH SRKRWERFVHSENQHLVSPEALDFLDKLLRYDHQSRLTAREAMEHPYFYTVVKDQARM GSSSMPGGSTPVSSANMMSGISSVPTPSPLGPLAGSPVIAAANPLGMPVPAAAGAQQ" BASE COUNT 604 a 498 c 494 g 599 t ORIGIN 1 aggggagagc ggccgccgcc gctgccgctt ccaccacagt ttgaagaaaa caggtctgaa 61 acaaggtctt acccccagct gcttctgaac acagtgactg ccagatctcc aaacatcaag 121 tccagctttg tccgccaacc tgtctgacat gtcgggaccc gtgccaagca gggccagagt 181 ttacacagat gttaatacac acagacctcg agaatactgg gattacgagt cacatgtggt 241 ggaatgggga aatcaagatg actaccagct ggttcgaaaa ttaggccgag gtaaatacag 301 tgaagtattt gaagccatca acatcacaaa taatgaaaaa gttgttgtta aaattctcaa 361 gccagtaaaa aagaagaaaa ttaagcgtga aataaagatt ttggagaatt tgagaggagg 421 tcccaacatc atcacactgg cagacattgt aaaagaccct gtgtcacgaa cccccgcctt 481 ggtttttgaa cacgtaaaca acacagactt caagcaattg taccagacgt taacagacta 541 tgatattcga ttttacatgt atgagattct gaaggccctg gattattgtc acagcatggg 601 aattatgcac agagatgtca agccccataa tgtcatgatt gatcatgagc acagaaagct 661 acgactaata gactggggtt tggctgagtt ttatcatcct ggccaagaat ataatgtccg 721 agttgcttcc cgatacttca aaggtcctga gctacttgta gactatcaga tgtacgatta 781 tagtttggat atgtggagtt tgggttgtat gctggcaagt atgatctttc ggaaggagcc 841 atttttccat ggacatgaca attatgatca gttggtgagg atagccaagg ttctggggac 901 agaagattta tatgactata ttgacaaata caacattgaa ttagatccac gtttcaatga 961 tatcttgggc agacactctc gaaagcgatg ggaacgcttt gtccacagtg aaaatcagca 1021 ccttgtcagc cctgaggcct tggatttcct ggacaaactg ctgcgatatg accaccagtc 1081 acggcttact gcaagagagg caatggagca cccctatttc tacactgttg tgaaggacca 1141 ggctcgaatg ggttcatcta gcatgccagg gggcagtacg cccgtcagca gcgccaatat 1201 gatgtcaggg atttcttcag tgccaacccc ttcacccctt ggacctctgg caggctcacc 1261 agtgattgct gctgccaacc cccttgggat gcctgttcca gctgccgctg gcgctcagca 1321 gtaacggccc tatctgtctc ctgatgcctg agcagaggtg ggggagtcca ccctctcctt 1381 gatgcagctt gcgcctggcg gggaggggtg aaacacttca gaagcaccgt gtctgaaccg 1441 ttgcttgtgg atttatagta gttcagtcat aaaaaaaaaa ttataatagg ctgattttct 1501 tttttctttt tttttttaac tcgaactttt cataactcag gggattccct gaaaaattac 1561 ctgcaggtgg aatatttcat ggacaaattt ttttttctcc cctcccaaat ttagttcctc 1621 atcacaaaag aacaaagata aaccagcctc aatcccggct gctgcattta ggtggagact 1681 tcttcccatt cccaccattg ttcctccacc gtcccacact ttagggggtt ggtatctcgt 1741 gctcttctcc agagattaca aaaatgtagc ttctcagggg aggcaggaag aaaggaagga 1801 aggaaagaag gaagggagga cccaatctat aggagcagtg gactgcttgc tggtcgctta 1861 catcacttta ctccataagc gcttcagtgg ggttatccta gtggctcttg tggaagtgtg 1921 tcttagttac atcaagatgt tgaaaatcta cccaaaatgc agacagatac taaaaacttc 1981 tgttcagtaa gaatcatgtc ttactgatct aaccctaaat ccaactcatt tatactttta 2041 tttttagttc agtttaaaat gttgatacct tccctcccag gctccttacc ttggtctttt 2101 ccctgttcat ctcccaacat gctgtgctcc atagctggta ggagagggaa ggcaaaatct 2161 ttcttagttt tctttgtctt ggccattttg aattc // LOCUS HUMACT1A 1446 bp mRNA PRI 07-OCT-1996 DEFINITION Human replication factor C, 37-kDa subunit mRNA, complete cds. ACCESSION M87339 NID g1498255 KEYWORDS RFC; Activator 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1446) AUTHORS Chen,M., Pan,Z.Q. and Hurwitz,J. TITLE Studies of the cloned 37-kDa subunit of activator 1 (replication factor C) of HeLa cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (12), 5211-5215 (1992) MEDLINE 92302215 REFERENCE 2 (bases 1 to 1446) AUTHORS Hurwitz,J. TITLE Direct Submission JOURNAL Submitted (31-DEC-1994) Molecular Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1446 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa (D98/AH-2)" CDS 284..1375 /note="replicative polymerase accessory protein; activator 1" /codon_start=1 /evidence=experimental /product="replication factor C, 37-kDa subunit" /db_xref="PID:g1498256" /translation="MQAFLKGTSISTKPPLTKDRGVAASAGSSGENKKAKPVPWVEKY RPKCVDEVAFQEEVVAVLKKSLEGADLPNLLFYGPPGTGKTSTILAAARELFGPELFR LRVLELNASDERGIQVVREKVKNFAQLTVSGSRSDGKPCPPFKIVILDEADSMTSAAQ AALRRTMEKESKTTRFCLICNYVSRIIEPLTSRCSKFRFKPLSDKIQQQRLLDIAKKE NVKISDEGIAYLVKVSEGDLRKAITFLQSATRLTGGKEITEKVITDIAGVIPAEKIDG VFAACQSGSFDKLEAVVKDLIDEGHAATQLVNQLHDVVVENNLSDKQKSIITEKLAEV DKCLADGADEHLQLISLCATVMQQLSQNC" BASE COUNT 445 a 276 c 355 g 370 t ORIGIN 1 ctgccattta ggacaagctg gatgatgatg gtttgatagc tccaggggtt cgtgtatagg 61 agatgatgaa tctgcttcat ccagaatcac aatcttaaaa ggcgggaact gaggcgactg 121 tggggacatc agtgatcgta agtctcctgg gcccgttatt ctcagattag gtgacggagc 181 taagacttcg agaccatctc gtcctttttg tatcgcggaa acctgaggaa cgagccggcg 241 gcggtgacct gcacgagaag ccaggctaac tgggtgaagt accatgcaag catttcttaa 301 aggtacatcc atcagtacta aacccccgct gaccaaggat cgaggagtag ctgccagtgc 361 gggaagtagc ggagagaaca agaaagccaa acccgttccc tgggtggaaa aatatcgccc 421 aaaatgtgtg gatgaagttg ctttccagga agaagtggtt gcagtgctga aaaaatcttt 481 agaaggagca gatcttccta atctcttgtt ttacggacca cctggaactg gaaaaacatc 541 cactattttg gcagcagcta gagaactctt tgggcctgaa cttttccgat taagagttct 601 tgagttaaat gcatctgatg aacgtggaat acaagtagtt cgagagaaag tgaaaaattt 661 tgctcaatta actgtgtcag gaagtcgctc agatgggaag ccgtgtccgc cttttaagat 721 tgtgattctg gatgaagcag attctatgac ctcagctgct caggcagctt taagacgtac 781 catggagaag gagtcgaaaa ccacccgatt ctgtcttatc tgtaactatg tcagtcgaat 841 aattgaaccc ctgacctcta gatgttcaaa attccgcttc aagcctctgt cagataaaat 901 tcaacagcag cgattactag acattgccaa gaaggaaaat gtcaaaatta gtgatgaggg 961 aatagcttat cttgttaaag tgtcagaagg agacttaaga aaagccatta catttcttca 1021 aagcgctact cgattaacag gtggaaagga gatcacagag aaagtgatta cagacattgc 1081 tggggtaata ccagctgaga aaattgatgg agtatttgct gcctgtcaga gtggctcttt 1141 tgacaaacta gaagctgtgg tcaaggattt aatagatgag ggtcatgcag caactcagct 1201 cgtcaatcaa ctccatgatg tggttgtaga aaataactta tctgataaac agaagtctat 1261 tatcacagaa aaacttgccg aagttgacaa atgcctagca gatggtgctg atgaacattt 1321 gcaactcatc agcctttgtg caactgtgat gcagcagtta tctcagaatt gttaacgtga 1381 tatatctgga tggggggttt tgtaaataat gaagttgtaa taaaaataaa atgaccaaaa 1441 gcaccg // LOCUS HUMACTIIA 2382 bp mRNA PRI 14-MAY-1992 DEFINITION Human activin type II receptor mRNA, complete cds. ACCESSION M93415 NID g178049 KEYWORDS activin receptor type II. SOURCE Homo sapiens male testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2382) AUTHORS Donaldson,C.J., Mathews,L.S. and Vale,W.W. TITLE Molecular cloning and binding properties of the human type II activin receptor JOURNAL Biochem. Biophys. Res. Commun. 184, 310-316 (1992) MEDLINE 92231944 FEATURES Location/Qualifiers source 1..2382 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="testis" sig_peptide 174..230 CDS 174..1715 /codon_start=1 /product="activin type II receptor" /db_xref="PID:g178050" /translation="MGAAAKLAFAVFLISCSSGAILGRSETQECLFFNANWEKDRTNQ TGVEPCYGDKDKRRHCFATWKNISGSIEIVKQGCWLDDINCYDRTDCVEKKDSPEVYF CCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLMLIAGIVICAFWV YRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPLQLLEVKARGRFGCVWKAQLLNEYVAV KIFPIQDKQSWQNEYEVYSLPGMKHENILQFIGAEKRGTSVDVDLWLITAFHEKGSLS DFLKANVVSWNELCHIAETMARGLAYLHEDIPGLKDGHKPAISHRDIKSKNVLLKNNL TACIADFGLALKFEAGKSAGDTHGQVGTRRYMAPEVLEGAINFQRDAFLRIDMYAMGL VLWELASRCTAADGPVDEYMLPFEEEIGQHPSLEDMQEVVVHKKKRPVLRDYWQKHAG MAMLCETIEECWDHDAEARLSAGCVGERITQMQRLTNIITTEDIVTVVTMVTNVDFPP KESSL" mat_peptide 231..1712 /product="activin type II receptor" BASE COUNT 698 a 464 c 551 g 669 t ORIGIN 1 ggggccgccc cttccccgcg ccgcagccgc ctcgccgcca ccgccgcgag ctcggccgcc 61 agtggtcctc ggactttagg tgtctgggtt gaaggaggtt tgtctccgag gaagacccag 121 ggaactggat atctagcgag aacttcctcc ggattccccg gcgcctcggg aaaatgggag 181 ctgctgcaaa gttggcgttt gccgtctttc ttatctcctg ttcttcaggt gctatacttg 241 gtagatcaga aactcaggag tgtcttttct ttaatgctaa ttgggaaaaa gacagaacca 301 atcaaactgg tgttgaaccg tgttatggtg acaaagataa acggcggcat tgttttgcta 361 cctggaagaa tatttctggt tccattgaaa tagtgaaaca aggttgttgg ctggatgata 421 tcaactgcta tgacaggact gattgtgtag aaaaaaaaga cagccctgaa gtatattttt 481 gttgctgtga gggcaatatg tgtaatgaaa agttttctta ttttccggag atggaagtca 541 cacagcccac ttcaaatcca gttacaccta agccacccta ttacaacatc ctgctctatt 601 ccttggtgcc acttatgtta attgcgggga ttgtcatttg tgcattttgg gtgtacaggc 661 atcacaagat ggcctaccct cctgtacttg ttccaactca agacccagga ccacccccac 721 cttctccatt actaggtttg aaaccactgc agttattaga agtgaaagca aggggaagat 781 ttggttgtgt ctggaaagcc cagttgctta acgaatatgt ggctgtcaaa atatttccaa 841 tacaggacaa acagtcatgg caaaatgaat acgaagtcta cagtttgcct ggaatgaagc 901 atgagaacat attacagttc attggtgcag aaaaacgagg caccagtgtt gatgtggatc 961 tttggctgat cacagcattt catgaaaagg gttcactatc agactttctt aaggctaatg 1021 tggtctcttg gaatgaactg tgtcatattg cagaaaccat ggctagagga ttggcatatt 1081 tacatgagga tatacctggc ctaaaagatg gccacaaacc tgccatatct cacagggaca 1141 tcaaaagtaa aaatgtgctg ttgaaaaaca acctgacagc ttgcattgct gactttgggt 1201 tggccttaaa atttgaggct ggcaagtctg caggcgatac ccatggacag gttggtaccc 1261 ggaggtacat ggctccagag gtattagagg gtgctataaa cttccaaagg gatgcatttt 1321 tgaggataga tatgtatgcc atgggattag tcctatggga actggcttct cgctgtactg 1381 ctgcagatgg acctgtagat gaatacatgt tgccatttga ggaggaaatt ggccagcatc 1441 catctcttga agacatgcag gaagttgttg tgcataaaaa aaagaggcct gttttaagag 1501 attattggca gaaacatgct ggaatggcaa tgctctgtga aaccattgaa gaatgttggg 1561 atcacgacgc agaagccagg ttatcagctg gatgtgtagg tgaaagaatt acccagatgc 1621 agagactaac aaatattatt accacagagg acattgtaac agtggtcaca atggtgacaa 1681 atgttgactt tcctcccaaa gaatctagtc tatgatggtt gcgccatctg tgcacactaa 1741 gaaatgggac tctgaactgg agctgctaag ctaaagaaac tgcttacagt ttattttctg 1801 tgtaaaatga gtaggatgtc tcttggaaat gttaagaaag aagacccttt gttgaaaaat 1861 gttgctctgg gagacttact gcattgccga cagcacagat gtgaaggaca tgagactaag 1921 agaaaccttg caaactctat aaagaaactt ttgaaaaagt gtacatgaag aatgtagccc 1981 tctccaaatc aaggatcttt tggacctggc taatggagtg tttgaaaact gacatcagat 2041 ttcttaatgt ctgtcagaag acactaattc cttaaatgaa ctactgctat tttttttaaa 2101 tcaaaaactt ttcatttcag attttaaaaa gggtaacttg tttttattgc atttgctgtt 2161 gtttctataa atgactattg taatgccaat atgacacagc ttgtgaatgt ttagtgtgct 2221 gctgttctgt gtacataaag tcatcaaagt ggggtacagt aaagaggctt ccaagcatta 2281 ctttaacctc cctcaacaag gtatacctca gttccacggt tgctaaatta taaaattgaa 2341 aacactaaca aaatttgaat aataaatcga tccatgtttc cc // LOCUS HUMACTINBI 2007 bp mRNA PRI 12-AUG-1993 DEFINITION Homo sapiens thyroid autoantigen (truncated actin-binding protein) mRNA, complete cds. ACCESSION M62994 NID g349450 KEYWORDS actin-binding protein; thyroid autoantigen. SOURCE Homo sapiens thyroid cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Leedman,P.J., Faulkner-Jones,B., Cram,D.C., Harrison,P.J., West,J., O'Brien,E.E., Simpson,R., Coppel,R.L. and Harrison,L.C. TITLE Cloning from the thyroid of a protein related to actin binding protein that is recognized by Graves disease immunoglobulins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 5994-5998 (1993) MEDLINE 93317610 FEATURES Location/Qualifiers source 1..2007 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid" CDS 109..696 /note="truncated actin-binding protein; Membrane-associated domain 631-684, kinase consensus sequence 211-222" /codon_start=1 /product="thyroid autoantigen" /db_xref="PID:g349451" /translation="MYTPMAPGNYLISVKYGGPNHIVGSPFKAKVTGQRLVSPGSANE TSCILVESVTRSSTETCYSAIPKASSDASKVTSKGAGLSKAFVGQKSSFLVDCSKAGS NMLLIGVHGPTTPCEEVSMKHVGNQQYNVTYVVKERGDYVLAVKWGRNTSLAALFMSQ CLKTVFSNPGESSCGCLCCLFVIHFIQSPPACLWG" misc_feature 109..201 /note="beta pleated sheet; putative" protein_bind 202..304 /note="putative" /bound_moiety="glycoprotein" misc_feature 305..580 /note="beta pleated sheet; putative" BASE COUNT 503 a 506 c 521 g 477 t ORIGIN 1 gggctttatt aacaccaccc gagcaggtcc agggacatta tccgtcacca tcgaaggccc 61 atccaaggtt aaaatggatt gcaggaaaca cctgaagggt acaaagtcat gtacaccccc 121 atggctcctg gtaactacct gatcagcgtc aaatacggtg ggcccaacca catcgtgggc 181 agtcccttca aggccaaggt gacaggccag cgtctagtta gccctggctc agccaacgag 241 acctcatgca tcctggtgga gtcagtgacc aggtcgtcta cagagacctg ctatagcgcc 301 attcccaagg catcctcgga cgccagcaag gtgacctcta agggggcagg gctctcaaag 361 gcctttgtgg gccagaagag ttccttcctg gtggactgca gcaaagctgg ctccaacatg 421 ctgctgatcg gggtccatgg gcccaccacc ccctgcgagg aggtctccat gaagcatgta 481 ggcaaccagc aatacaacgt cacatacgtc gtcaaggaga ggggcgatta tgtgctggct 541 gtgaagtggg ggaggaacac atccctggca gcccttttca tgtcacagtg ccttaaaaca 601 gttttctcaa atcctggaga gagttcttgt ggttgccttt gttgcttgtt tgtaattcat 661 tttatacaaa gccctccagc ctgtttgtgg ggctgaaacc ccatccctaa aatattgctg 721 ttgtaaaatg ccttcagaaa taagtcctag actggactct tgagggacat attggagaat 781 cttaagaaat gcaagcttgt tcagggggct gagaagatcc tgagtacact aggtgcaaac 841 cagaactctt ggtggaacag accagccact gcagcagaca gaccaggaac acaatgagac 901 tgacatttca aaaaaacaaa actggctagc ctgagctgct ggttcactct tcagcattta 961 tgaaacaagg ctaggggaag atgggcagag aaaaagggga cacctagttt ggttgtcatt 1021 tggcaaagga gatgacttaa aaatccgctt aatctcttcc agtgtccgtg ttaatgtatt 1081 tggctattag atcactagca ctgctttacc gctcctcatc gccaacaccc ccatgctctg 1141 tggccttctt acacttctca gagggcagag tggcagccgg gcaccctaca gaaactcaga 1201 gggcagagtg gcagccaggc ccacatgtct ctcaagtacc tgtcccctcg ctctggtgat 1261 tatttcttgc agaatcacca cacgagacca tcccggcagt catggttttg ctttagtttt 1321 ccaagtccgt ttcagtccct tccttggtct gaagaaattc tgcagtggcg agcagtttcc 1381 cacttgccaa agatcccttt taaccaacac tagcccttgt ttttaacaca cgctccagcc 1441 cttcatcagc ctgggcagtc ttaccaaaat gtttaaagtg atctcagagg ggcccatgga 1501 ttaacgccct catcccaagg tccgtcccat gacataacac tccacacccg ccccagccaa 1561 cttcatgggt cactttttct ggaaaataat gatctgtaca gacaggacag aatgaaactc 1621 tgcgggtctt tggcctgaaa gttgggaatg gttgggggag agaagggcag cagcttattg 1681 gtggtctttt caccattggc agaaacagtg agagctgtgt ggtgcagaaa tccagaaatg 1741 aggtgtaggg aattttgcct gccttcctgc agacctgagc tggctttgga atgaggttaa 1801 agtgtcaggg acgttgcctg agcccaaatg tgtagtgtgg tctgggcagg cagaccttta 1861 ggttttgctg cttagtcctg aggaagtggc cactcttgtg gcaggtgtag tatctggggc 1921 gagtgttggg ggtaaaagcc caccctacag aaagtggaac agcccggagc ctgatgtgaa 1981 aggaccacgg gtgttgtaag ctggccc // LOCUS HUMACTN1A 3081 bp mRNA PRI 30-OCT-1994 DEFINITION Human non-muscle alpha-actinin mRNA, complete cds. ACCESSION M95178 M31300 NID g178051 KEYWORDS alpha-actinin. SOURCE Homo sapiens (individual_isolate p5aA) umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3081) AUTHORS Youssoufian,H., McAfee,M. and Kwiatkowski,D.J. TITLE Cloning and chromosomal localization of the human cytoskeletal alpha-actinin gene reveals linkage to the beta-spectrin gene JOURNAL Am. J. Hum. Genet. 47 (1), 62-71 (1990) MEDLINE 90274024 FEATURES Location/Qualifiers source 1..3081 /organism="Homo sapiens" /isolate="p5aA" /db_xref="taxon:9606" /cell_type="endothelial cell" /tissue_type="umbilical vein" /map="14q24.1-q24.2" gene 112..2790 /gene="ACTN1" CDS 112..2790 /gene="ACTN1" /codon_start=1 /db_xref="GDB:G00-125-187" /product="alpha-actinin" /db_xref="PID:g178052" /translation="MDHYDSQQTNDYMQPEEDWDRDLLLDPAWEKQQRKTFTAWCNSH LRKAGTQIENIEEDFRDGLKLMLLLEVISGERLAKPERGKMRVHKISNVNKALDFIAS KGVKLVSIGAEEIVDGNVKMTLGMIWTIILRFAIQDISVEETSAKEGLLLWCQRKTAP YKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNLNTAFDVAEKYLDI PKMLDAEDIVGTARPDEKAIMTYVSSFYHAFSGAQKAETAANRICKVLAVNQENEQLM EDYEKLASDLLEWIRRTIPWLENRVPENTMHAMQQKLEDFRDYRRLHKPPKVQEKCQL EINFNTLQTKLRLSNRPAFMPSEGRMVSDINNAWGCLEQVEKGYEEWLLNEIRRLERL DHLAEKFRQKASIHEAWTDGKEAMLRQKDYETATLSEIKALLKKHEAFESDLAAHQDR VEQIAAIAQELNELDYYDSPSVNARCQKICDQWDNLGALTQKRREALERTEKLLETID QLYLEYAKRAAPFNNWMEGAMEDLQDTFIVHTIEEIQGLTTAHEQFKATLPDADKERL AILGIHNEVSKIVQTYHVNMAGTNPYTTITPQEINGKWDHVRQLVPRRDQALTEEHAR QQHNERLRKQFGAQANVIGPWIQTKMEEIGRISIEMHGTLEDQLSHLRQYEKSIVNYK PKIDQLEGDHQLIQEALIFDNKHTNYTMEHIRVGWEQLLTTIARTINEVENQILTRDA KGISQEQMNEFRASFNHFDRDHSGTLGPEEFKACLISLGYDIGNDPQGEAEFARIMSI VDPNRLGVVTFQAFIDFMSRETADTDTADQVMASFKILAGDKNYITMDELRRELPPDQ AEYCIARMAPYTGPDSVPGALDYMSFSTALYGESDL" polyA_site 3081 /gene="ACTN1" /note="G00-125-187" BASE COUNT 789 a 880 c 857 g 555 t ORIGIN 1 cccgccagcc cagcccagcc caaccctact ccctccccac gccagggcag cagccgttgc 61 tcagagagaa ggtggaggaa gaaatccaga ccctagcacg cgcgcaccat catggaccat 121 tatgattctc agcaaaccaa cgattacatg cagccagaag aggactggga ccgggacctg 181 ctcctggacc cggcctggga gaagcagcag agaaagacat tcacggcatg gtgtaactcc 241 cacctccgga aggcggggac acagatcgag aacatcgaag aggacttccg ggatggcctg 301 aagctcatgc tgctgctgga ggtcatctca ggtgaacgct tggccaagcc agagcgaggc 361 aagatgagag tgcacaagat ctccaacgtc aacaaggccc tggatttcat agccagcaaa 421 ggcgtcaaac tggtgtccat cggagccgaa gaaatcgtgg atgggaatgt gaagatgacc 481 ctgggcatga tctggaccat catcctgcgc tttgccatcc aggacatctc cgtggaagag 541 acttcagcca aggaagggct gctcctgtgg tgtcagagaa agacagcccc ttacaaaaat 601 gtcaacatcc agaacttcca cataagctgg aaggatggcc tcggcttctg tgctttgatc 661 caccgacacc ggcccgagct gattgactac gggaagctgc ggaaggatga tccactcaca 721 aatctgaata cggcttttga cgtggcagag aagtacctgg acatccccaa gatgctggat 781 gccgaagaca tcgttggaac tgcccgaccg gatgagaaag ccatcatgac ttacgtgtct 841 agcttctacc acgccttctc tggagcccag aaggcggaga cagcagccaa tcgcatctgc 901 aaggtgttgg ccgtcaacca ggagaacgag cagcttatgg aagactacga gaagctggcc 961 agtgatctgt tggagtggat ccgccgcaca atcccgtggc tggagaaccg ggtgcccgag 1021 aacaccatgc atgccatgca acagaagctg gaggacttcc gggactaccg gcgcctgcac 1081 aagccgccca aggtgcagga gaagtgccag ctggagatca acttcaacac gctgcagacc 1141 aagctgcggc tcagcaaccg gcctgccttc atgccctctg agggcaggat ggtctcggac 1201 atcaacaatg cctggggctg cctggagcag gtggagaagg gctatgagga gtggttgctg 1261 aatgagatcc ggaggctgga gcgactggac cacctggcag agaagttccg gcagaaggcc 1321 tccatccacg aggcctggac tgacggcaaa gaggccatgc tgcgacagaa ggactatgag 1381 accgccaccc tctcggagat caaggccttg ctcaagaagc atgaggcctt cgagagtgac 1441 ctggctgccc accaggaccg tgtggagcag attgccgcca tcgcacagga gctcaatgag 1501 ctggactatt atgactcacc cagtgtcaac gcccgttgcc aaaagatctg tgaccagtgg 1561 gacaatctgg gggccctaac tcagaagcga agggaagctc tggagcggac cgagaaactg 1621 ctggagacca ttgaccagct gtacttggag tatgccaagc gggctgcacc cttcaacaac 1681 tggatggagg gggccatgga ggacctgcag gacaccttca ttgtgcacac cattgaggag 1741 atccagggac tgaccacagc ccatgagcag ttcaaggcca ccctccctga tgccgacaag 1801 gagcgcctgg ccatcctggg catccacaat gaggtgtcca agattgtcca gacctaccac 1861 gtcaatatgg cgggcaccaa cccctacaca accatcacgc ctcaggagat caatggcaaa 1921 tgggaccacg tgcggcagct ggtgcctcgg agggaccaag ctctgacgga ggagcatgcc 1981 cgacagcagc acaatgagag gctacgcaag cagtttggag cccaggccaa tgtcatcggg 2041 ccctggatcc agaccaagat ggaggagatc gggaggatct ccattgagat gcatgggacc 2101 ctggaggacc agctcagcca cctgcggcag tatgagaaga gcatcgtcaa ctacaagcca 2161 aagattgatc agctggaggg cgaccaccag ctcatccagg aggcgctcat cttcgacaac 2221 aagcacacca actacaccat ggagcacatc cgtgtgggct gggagcagct gctcaccacc 2281 atcgccagga ccatcaatga ggtagagaac cagatcctga cccgggatgc caagggcatc 2341 agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 2401 ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 2461 aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 2521 ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 2581 gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 2641 accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 2701 atggccccct acaccggccc cgactccgtg ccaggtgctc tggactacat gtccttctcc 2761 acggcgctgt acggcgagag tgacctctaa tccaccccgc ccggccgccc tcgtcttgtg 2821 cgccgtgccc acagatgtga aatgaatgta atctaataga agcctaatca gcccaccatg 2881 ttctccactg aaaaatcctc tttctttggg gtttttcttt ctttcttttt tgattttgca 2941 ctggacggtg acgtcagcct gtacaggctc ccaggggtgg cgtcaaatgc tattgaaatt 3001 gcgctgaatc gtatgctttt tccttttgat aaataaacaa tgtaaaaatg tttcaaaaac 3061 ctaataaaat aaataaatac g // LOCUS HUMACTN2A 4181 bp mRNA PRI 30-OCT-1994 DEFINITION Homo sapiens skeletal muscle alpha 2 actinin (ACTN20 mRNA, complete cds. ACCESSION M86406 NID g178053 KEYWORDS skeletal muscle alpha-actinin. SOURCE Homo sapiens (tissue library: lambda-gt10 of M.Koenig) fetus skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4181) AUTHORS Beggs,A.H., Byers,T.J., Knoll,J.H., Boyce,F.M., Bruns,G.A. and Kunkel,L.M. TITLE Cloning and characterization of two human skeletal muscle alpha-actinin genes located on chromosomes 1 and 11 JOURNAL J. Biol. Chem. 267 (13), 9281-9288 (1992) MEDLINE 92250531 FEATURES Location/Qualifiers source 1..4181 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="skeletal muscle" /tissue_lib="lambda-gt10 of M.Koenig" /map="Unassigned" gene 1..4181 /gene="ACTN2" 5'UTR 1..173 /gene="ACTN2" /note="G00-127-919" CDS 174..2858 /gene="ACTN2" /codon_start=1 /function="'skeletal and cardiac muscle specific'" /db_xref="GDB:G00-127-919" /product="alpha-actinin" /db_xref="PID:g178054" /translation="MNQIEPGVQYNYVYDEDEYMIQEEEWDRDLLLDPAWEKQQRKTF TAWCNSHLRKAGTQIENIEEDFRNGLKLMLLLEVISGERLPKPDRGKMRFHKIANVNK ALDYIASKGVKLVSIGAEEIVDGNVKMTLGMIWTIILRFAIQDISVEETSAKEGLLLW CQRKTAPYRNVNIQNFHTSWKDGLGLCALIHRHRPDLIDYSKLNKDDPIGNINLAMEI AEKHLDIPKMLDAEDIVNTPKPDERAIMTYVSCFYHAFAGAEQAETAANRICKVLAVN QENERLMEEYERLASELLEWIRRTIPWLENRTPEKTMQAMQKKLEDFRDYRRKHKPPK VQEKCQLEINFNTLQTKLRISNRPAFMPSEGKMVSDIAGAWQRLEQAEKGYEEWLLNE IRRLERLEHLAEKFRQKASTHETWAYGKEQILLQKDYESASLTEVRALLRKHEAFESD LAAHQDRVEQIAAIAQELNELDYHDAVNVNDRCQKICDQWDRLGTLTQKRREALERME KLLETIDQLHLEFAKRAAPFNNWMEGAMEDLQDMFIVHSIEEIQSLITAHEQFKATLP EADGERQSIMAIQNEVEKVIQSYNIRISSSNPYSTVTMDELRTKWDKVKQLVPIRDQS LQEELARQHANERLRRQFAAQANAIGPWIQNKMEEIARSSIQITGALEDQMNQLKQYE HNIINYKNNIDKLEGDHQLIQEALVFDNKHTNYTMEHIRVGWELLLTTIARTINEVET QILTRDAKGITQEQMNEFRASFNHFDRRKNGLMDHEDFRACLISMGYDLGEAEFARIM TLVDPNGQGTVTFQSFIDFMTRETADTDTAEQVIASFRILASDKPYILAEELRRELPP DQAQYCIKRMPAYSGPGSVPGALDYAAFSSALYGESDL" 3'UTR 2859..>4181 /gene="ACTN2" /note="G00-127-919" polyA_signal 3249..3254 /gene="ACTN2" /note="G00-127-919" polyA_site 3269 /gene="ACTN2" /note="G00-127-919" polyA_signal 3415..3420 /gene="ACTN2" /note="G00-127-919" polyA_site 3431 /gene="ACTN2" /note="G00-127-919" BASE COUNT 1139 a 977 c 1102 g 963 t ORIGIN Chromosome 1q42-q43. 1 ggaactccgc ttcgcccgag acccagcgcc caggcgtgtc gcccgagagg agccgcgcga 61 aggtcacccc gcgcccgccg cccgccgccc gccgcctccg tgggtccgtt tgccagtcag 121 cccgtgcgtc cgagcccctc gcgccccgcc gcagccccgg ccaaccgagc gccatgaacc 181 agatagagcc cggcgtgcag tacaactacg tgtacgacga ggatgagtac atgatccagg 241 aggaggagtg ggaccgcgac ctgctcctgg acccagcctg ggagaagcag cagaggaaga 301 ccttcactgc ctggtgtaac tcccacctaa ggaaagccgg cacccagatt gagaacatcg 361 aggaagactt caggaatggc cttaagctca tgctgctttt ggaagtcatc tcaggggaaa 421 ggctgcccaa acctgaccgg ggaaaaatgc ggttccacaa aattgctaat gtcaacaaag 481 ctttggatta catagccagc aaaggggtga aactggtgtc catcggcgct gaagaaattg 541 ttgatggcaa tgtgaaaatg accctgggta tgatctggac catcatcctt cgctttgcta 601 ttcaggatat ttcggttgaa gaaacatctg ccaaagaagg tctgctgctt tggtgtcaga 661 ggaaaactgc tccttataga aatgtgaaca ttcagaactt ccatactagc tggaaagatg 721 gccttggact ctgtgccctc atccaccgac accggcctga cctcattgac tactcaaagc 781 ttaacaagga tgaccccata ggaaatatta acctggccat ggaaatcgct gagaagcacc 841 tggatattcc taaaatgttg gatgctgaag acatcgtgaa cacccctaaa cccgatgaaa 901 gagccatcat gacgtacgtc tcttgcttct accacgcttt tgcgggcgcg gagcaggccg 961 agacagcggc taacaggata tgtaaggttc ttgctgtgaa tcaagagaat gagaggctga 1021 tggaagaata tgagaggcta gcgagtgagc ttttggaatg gattcgtcgc acgatcccct 1081 ggctggagaa ccggactccc gagaagacca tgcaagccat gcagaagaag ctggaggact 1141 tccgggatta ccgccggaag cacaagccac ccaaggtgca ggagaaatgc cagctggaga 1201 tcaacttcaa cacgctgcag accaagctgc ggatcagcaa ccgtcctgcc ttcatgccct 1261 ccgagggcaa gatggtgtcg gatattgctg gtgcctggca gaggctggag caggctgaga 1321 agggttacga ggagtggttg ctcaatgaga ttcggagact ggagcgcttg gaacacctgg 1381 ctgagaagtt caggcagaag gcctcaacgc acgagacttg ggcttatggc aaagagcaga 1441 tcttgctgca gaaggattac gagtcggcgt cgctgacaga ggtgcgggct ctgctgcgga 1501 agcacgaggc gttcgagagc gacctggcag cgcaccagga ccgcgtggag cagatcgcag 1561 ccatcgcgca ggagctcaat gaactggact atcacgacgc tgtgaatgtc aatgatcggt 1621 gccagaaaat ttgtgaccag tgggaccgac tgggaacgct tactcagaag aggagagaag 1681 ccctagagag aatggagaaa ttgctagaaa ccattgatca gcttcacctg gagtttgcca 1741 agagggctgc tcctttcaac aattggatgg agggcgctat ggaggatctg caagatatgt 1801 tcattgtcca cagcattgag gagatccaga gtctgatcac tgcgcatgag cagttcaagg 1861 ccacgctgcc cgaggcggac ggagagcggc agtccatcat ggccatccag aacgaggtgg 1921 agaaggtgat tcagagctac aacatcagaa tcagctcaag caacccgtac agcactgtca 1981 ccatggatga gctccggacc aagtgggaca aggtgaagca actcgtgccc atccgcgatc 2041 aatccctgca ggaggagctg gctcgccagc atgctaacga gcgtctgagg cgccagtttg 2101 ctgcccaagc caatgccatt gggccctgga tccagaacaa gatggaggag attgcccgga 2161 gctccatcca gatcacagga gccctggaag accagatgaa ccagctgaag cagtatgagc 2221 acaacatcat caactataag aacaacatcg acaagctgga gggagaccat cagctcatcc 2281 aggaggccct tgtctttgac aacaagcaca cgaactacac gatggagcac attcgtgttg 2341 gatgggagct gctgctgaca accatcgcca gaaccatcaa tgaggtggag actcagatcc 2401 tgacgagaga tgcgaagggc atcacccagg agcagatgaa tgagttcaga gcctccttca 2461 accactttga caggaggaag aatggcctga tggatcatga ggatttcaga gcctgcctga 2521 tttccatggg ttatgacctg ggtgaagccg aatttgcccg cattatgacc ctggtagatc 2581 ccaacgggca aggcaccgtc accttccaat ccttcatcga cttcatgact agagagacgg 2641 ctgacaccga cactgccgag caggtcatcg cctccttccg gatcctggct tctgataagc 2701 catacatcct ggcggaggag ctgcgtcggg agctgccccc ggatcaggcc cagtactgca 2761 tcaagaggat gcccgcctac tcgggcccag gcagtgtgcc tggtgcactg gattacgctg 2821 cgttctcttc cgcactctac ggggagagcg atctgtgatg ctgagcttct gtaatcactc 2881 atcccatcag aatgcaataa aagcggaagt cacagtttgt ttcctggaaa ctttgacaag 2941 ctttattaag ttgagagaga gagaggggga aaaaaaaaaa gcctttcgta gttcagtaat 3001 tgccagcaat ataacacggc taaaatgaag tttttacagt atatgacata gtgcgcttca 3061 taaataggtt tatttctgag tttttagcaa aatgtaatga aatatcaggt tgatttcttt 3121 gattaaacag aacaaattac ttgagtaata ggaaattagg aggatctagg gacagaagga 3181 aagtgaaaaa tgtgaaaata caaaataccc aagatttaag accgggggga aaaaaccaca 3241 aattggtaaa taaaggtttg ctatttgtaa aaaatttcat ttatctctaa tatgcttatg 3301 tgattggccc taggggagta tatttgggat tctaatgttt tattttcatg cttatccaaa 3361 gattactatt gtatcttcaa atgaacttaa tattgtgaga tggaactgcc ggggattaaa 3421 aagactaccc aaaagatttt tggcacttac aatttttaaa atagtttatg tcatctcttc 3481 attatttagg gctggatggt caactcagtc agtgattttt tgatgcttct cttatcctcc 3541 agaatagaga cctaaggaca cgtggaagtc agtttaattg ccagagagaa ggatgcaatc 3601 actaggtgaa atgaggtttt taggattatt tattgattcc aggttcccat gctttttgtt 3661 agagcttatt agtacaggtt ctcaagagat gaccacataa aagtgctctg tttataaata 3721 agcaggtttc tgtagtactg actggttcat cacaaggcaa gtcagaaacc agtatccttc 3781 tagctctcca gtcaggactt ccttatgcct ctagttttat gaccggttaa ggagaagcca 3841 gagttagagt aggagaggac taattctcag cagcagtgga ggtgagttct ttcttttgcg 3901 gaagctttac atatgttttg tgtagtagga ataactagat attttagcta gtgtgcggtg 3961 tgtgttcacc cctgggattg gacagtgtat cctaacaagt cccatgtctg gttctgtgtc 4021 taaaggcctg ctccatgaca caggatgcta catgcactcc tgctagcaca tcttgatctg 4081 ttgaatgttc attctttctt tttgctcata ctgctgtagg ctataattcc cccctgtttt 4141 tccatcttgt tgacagcttg tagagaataa agcaggaatt c // LOCUS HUMACTN3A 2858 bp mRNA PRI 30-OCT-1994 DEFINITION Homo sapiens alpha actinin 3 (ACTN3) mRNA, complete cds. ACCESSION M86407 NID g178057 KEYWORDS ACTN3 gene; skeletal muscle alpha-actinin. SOURCE Homo sapiens (tissue library: lambda-gt10 of M.Koenig) fetus skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2858) AUTHORS Beggs,A.H., Byers,T.J., Knoll,J.H., Boyce,F.M., Bruns,G.A. and Kunkel,L.M. TITLE Cloning and characterization of two human skeletal muscle alpha-actinin genes located on chromosomes 1 and 11 JOURNAL J. Biol. Chem. 267 (13), 9281-9288 (1992) MEDLINE 92250531 FEATURES Location/Qualifiers source 1..2858 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="skeletal muscle" /tissue_lib="lambda-gt10 of M.Koenig" /map="Unassigned" gene 1..2858 /gene="ACTN3" 5'UTR 1..18 /gene="ACTN3" /note="G00-127-920" CDS 19..2724 /gene="ACTN3" /codon_start=1 /function="'skeletal muscle specific'" /db_xref="GDB:G00-127-920" /product="alpha-actinin" /db_xref="PID:g178058" /translation="MMMVMQPEGLGAGEGRFAGGGGGGEYMEQEEDWDRDLLLDPAWE KQQRKTFTAWCNSHLRKAGTQIENIEEDFRNGLKLMLLLEVISGERLPRPDKGKMRFH KIANVNKALDFIASKGVKLVSIGAEEIVDGNLKMTLGMIWTIILRFAIQDISVEETSA KEGLLLWCQRKTAPYRNVNVQNFHTSWKDGLALCALIHRHRPDLIDYAKLRKDDPIGN LNTAFEVAEKYLDIPKMLDAEDIVNTPKPDEKAIMTYVSCFYHAFAGAEQAETAANRI CKVLAVNQENEKLMEEYEKLASELLEWIRRTVPWLENRVGEPSMSAMQRKLEDFRDYR RLHKPPRIQEKCQLEINFNTLQTKLRLSHRPAFMPSEGKLVSDIANAWRGLEQVEKGY EDWLLSEIRRLQRLQHLAEKFRQKASLHEAWTRGKEEMLSQRDYDSALLQEVRALLRR HEAFESDLAAHQDRVEHIAALAQELNELDYHEAASVNSRCQAICDQWDNLGTLTQKRR DALERMEKLLETIDQLQLEFARRAAPFNNWLDGAVEDLQDVWLVHSVEETQSLLTAHD QFKATLPEADRERGAIMGIQGEIQKICQTYGLRPCSTNPYITLSPQDINTKWDMVRKL VPSRDQTLQEELARQQVNERLRRQFAAQANAIGPWIQAKVEEVGRLAAGLAGSLEEQM AGLRQQEQNIINYKTNIDRLEGDHQLLQESLVFDNKHTVYSMEHIRVGWEQLLTSIAR TINEVENQVLTRDAKGLSQEQLNEFRASFNHFDRKRNGMMEPDDFRACLISMGYDLGE VEFARIMTMVDPNAAGVVTFQAFIDFMTRETAETDTTEQVVASFKILAGDKNYITPEE LRRELPAKQAEYCIRRMVPYKGSGAPAGALDYVAFSSALYGESDL" 3'UTR 2725..2858 /gene="ACTN3" /note="G00-127-920" polyA_signal 2836..2841 /gene="ACTN3" /note="G00-127-920" polyA_site 2858 /gene="ACTN3" /note="G00-127-920" BASE COUNT 667 a 815 c 896 g 480 t ORIGIN Chromosome 11q13-q14. 1 gccaggagcc cgatcgagat gatgatggtt atgcagcccg agggtctggg ggccggggag 61 gggcgctttg cgggcggcgg cgggggcggc gagtacatgg aacaggagga ggactgggac 121 cgcgacctgc tgctggaccc ggcctgggag aagcagcagc ggaaaacctt cactgcctgg 181 tgcaactcac acctgcgcaa ggcaggcacc cagatcgaga acatcgagga agatttccgc 241 aatggcctca aactcatgct gctcctggag gtcatttcag gtgagaggct gcctaggcca 301 gataaaggca agatgcgctt ccacaaaatc gccaacgtta acaaggccct ggacttcatt 361 gccagcaagg gggttaaact ggtgtccatt ggtgctgaag agattgttga cgggaacctg 421 aagatgaccc tgggcatgat ctggaccatc atccttcgct tcgccatcca ggacatctct 481 gtggaagaaa cctcagccaa ggaaggcttg cttctgtggt gccagaggaa gacagcaccg 541 taccgcaacg tcaacgtgca gaacttccac accagctgga aggatggcct ggccctctgt 601 gccctcatcc accgacaccg ccctgacctc atcgactacg ccaaactgcg aaaggatgac 661 cccatcggaa acctgaacac tgcctttgag gtggcagaga aatacctgga catccccaag 721 atgttggatg cagaagacat tgtgaacacc ccaaagccgg atgagaaggc catcatgacc 781 tatgtgtcct gcttctacca tgcctttgcc ggggctgagc aggcagagac agctgccaac 841 aggatctgca aggtgctggc agtgaaccag gaaaacgaga agctgatgga ggagtatgag 901 aagcttgcca gtgagctgct ggagtggatc cgccgcactg tcccatggct ggagaaccgt 961 gtgggtgagc ccagcatgag tgccatgcag cgcaaactag aggactttcg ggactaccgg 1021 cgtctgcaca agccgccccg cattcaggaa aagtgccagc tggagatcaa cttcaacaca 1081 ctgcagacca agttgcggct cagccaccgg cctgccttca tgccctccga gggcaagctg 1141 gtctcggaca tcgccaacgc ctggcggggg ctggagcagg tggaaaaggg ctatgaggac 1201 tggctgctct cggagatccg gcgcctgcag cgactccagc acctggctga gaagttccgg 1261 cagaaggcct ccctgcacga agcctggacc cggggaaagg aggagatgct gagccagcgc 1321 gactacgatt cggctttgct acaggaggtg cgggcgttgc tgcggcgcca cgaggccttt 1381 gagagcgacc tggcggcgca ccaggaccgc gtggagcaca ttgccgcgct ggcccaggag 1441 ctcaatgagc tggactacca cgaggcagcc tcagtgaata gccgctgcca ggccatctgc 1501 gatcagtggg acaacctggg caccctgacc cagaagaggc gggatgcgct agagcggatg 1561 gagaagctcc tggagaccat tgaccagctg caactggagt ttgcccggcg ggccgcgccc 1621 ttcaacaact ggctggatgg tgccgtggag gacctgcagg acgtgtggct ggtacactct 1681 gtggaggaga cccagagcct gctgacagcg cacgatcagt tcaaggcaac actgcccgag 1741 gctgaccgag agcgaggtgc catcatgggc atccagggtg agatccagaa gatctgccag 1801 acgtatgggc tgcggccctg ctccaccaat ccctacatca ccctcagccc gcaggacatc 1861 aacaccaagt gggatatggt ccgaaagctg gtgcccagcc gtgaccagac actgcaggag 1921 gagctggcac ggcagcaggt aaacgagagg ctccggcgac agtttgcggc ccaggccaat 1981 gccattggac cctggatcca ggcgaaggtg gaggaagtgg ggcggctggc agcagggcta 2041 gctggctctc tggaggagca gatggctggg ctacggcagc aggagcagaa cattatcaac 2101 tacaagacta acattgaccg gctggagggt gaccaccagc tgctgcagga gagcctggtg 2161 ttcgacaata agcacaccgt ctacagcatg gagcacatcc gcgtgggctg ggagcagctg 2221 ctcacctcca ttgcccgcac catcaatgaa gtggagaacc aggtactgac ccgagacgcc 2281 aagggactga gccaggagca gctcaacgag ttccgagcat ccttcaacca ctttgacagg 2341 aagcggaatg ggatgatgga gcctgatgac ttccgagctt gcctcatctc catgggctat 2401 gacctggggg aagtggagtt tgctcgcatc atgaccatgg tggaccccaa cgcagctggg 2461 gtggtgacct tccaggcctt catagacttc atgacccgag agacagccga gactgacacg 2521 actgagcaag ttgtagcttc cttcaagatc ttggcaggag acaagaacta catcaccccc 2581 gaggagctgc ggcgcgagct ccctgccaag caggccgagt actgcatccg ccgtatggtg 2641 ccctacaagg gatccggggc cccggctgga gccctggact acgtggcctt ctccagtgcc 2701 ctctatgggg agagcgacct ttgaccccaa ccactgaggt tctctatgca agatggagag 2761 aggatgcacc ctgtggctga tcccatccgt ccctcggagc aagggcctaa gagaaaagcc 2821 agccaagtgc ttctgaataa agatccctct ctgggtca // LOCUS HUMACYLCOA 4011 bp mRNA PRI 24-NOV-1993 DEFINITION Human acyl coenzyme A:cholesterol acyltransferase mRNA, complete cds. ACCESSION L21934 NID g409203 KEYWORDS acyl-CoA; acyl-CoA acyltransferase; acyl-coenzyme A:cholesterol acyltransferase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4011) AUTHORS Chang,C.C., Huh,H.Y., Cadigan,K.M. and Chang,T.Y. TITLE Molecular cloning and functional expression of human acyl-coenzyme A:cholesterol acyltransferase cDNA in mutant Chinese hamster ovary cells JOURNAL J. Biol. Chem. 268 (28), 20747-20755 (1993) MEDLINE 94012607 FEATURES Location/Qualifiers source 1..4011 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1 (Phorbol ester activated)" /cell_type="macrophage" CDS 1397..3049 /standard_name="ACAT" /codon_start=1 /product="acyl-coenzyme A: cholesterol acyltransferase" /db_xref="PID:g409204" /translation="MVGEEKMSLRNRLSKSRENPEEDEDQRNPAKESLETPSNGRIDI KQLIAKKIKLTAEAEELKPFFMKEVGSHFDDFVTNLIEKSASLDNGGCALTTFSVLEG EKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALLILFILSTLVVD YIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLFQHWRTGYSKSSHPLI RSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRFVMKAHSFVRENVP RVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCF FYVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAF AEMLRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAML AVFAVSAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLMWTSLF LGNGVLLCFYSQEWYARRHCPLKNPTFLDYVRPRSWTCRYVF" polyA_signal 3384..3389 polyA_signal 3962..3967 BASE COUNT 1056 a 792 c 839 g 1324 t ORIGIN 1 gggtagagac ggggtttcac cgtgttagcc aggatggtct ggatctcctg acctcgtgat 61 ccacccacct cggcctccta aagtgctggg attacagaca tgagccaccg cgcccagccc 121 tattcatccc ttttcaaaag tcagacccta ggaagctgga gggaggtggg gcatggtttt 181 acagtgaatt tctgatttca ctcagggtga taaatcagac tcttggggaa gcgggtggtg 241 gctctggaca gcagcaggaa tggggatcca gttagcaaca aatccatgga cctatgacag 301 gctgaaagcc accccttctc catctttggg aggttgccaa tgtctgattt aacactatcc 361 aatgaatgat cattgaaagt aaaaaataac tatcaactag cagaaaatat aaatggtaag 421 cattagcaca tatttcacat gtttatattt ggctctcaga ttgacctata aaacaaagtc 481 tgggaaattc tatatgatcc tgaaaaaatg atacgctggt ctggatggta gaataagttg 541 gagaaatgtt taagccaaaa tgcagtctta ccaatgactt tttattttat tttattaatt 601 ttcaggattt ttggtataca ggtggttttt ggttacatgg aaaagttctt tactggtgat 661 ttctgagatt ttagttcacc ccttatcctg agcagtgtac actgttccca atatgtagcc 721 ttttatccct caccccctct aagttcaaga agactatggt cctgcagaaa gctttatatg 781 taattaacat atctttatct ttatctttat aggcagtaga ctcatctttt gaaacagatt 841 ccattaagag tgaatgtgta ccctccctct agcctttatt attactgttt ttgctattac 901 atgtgttagt gtatgtgaat ttaatgctta aaaatgtatc ccattggcta ctatggcaaa 961 aggttgactc ataagagttt agcacgggtt aagatctgaa agttttctcc cagcctctta 1021 tcactggcgc agacttcaca attcatggaa gccaccagtg agatgacatt gcctcaggca 1081 gttactattt ttatattcta taactcgagg agctcagggt ttcggaaatc attaaacttt 1141 ttttgtcctt ttaaagttgg agacagcaat tgtagacagc cttccagtgg gttatctttt 1201 tgtgtctcct tacctgtgga gaagcctatt agctgggata tgtagttaaa tagctatatt 1261 tatatatatc cagggcaccc cgaattcggg agagcttccc ggagtcgacc ttcctgctgg 1321 ctgctctgtg accgcttccc ggctctgccc tcttggccga agtgcccgct gccgggcgcg 1381 ggcctcagac aatacaatgg tgggtgaaga gaagatgtct ctaagaaacc ggctgtcaaa 1441 gtccagggaa aatcctgagg aagatgaaga ccagagaaac cctgcaaagg agtccctaga 1501 gacacctagt aatggtcgaa ttgacataaa acagttgata gcaaagaaga taaagttgac 1561 agcagaggca gaggaattga agccattttt tatgaaggaa gttggcagtc actttgatga 1621 ttttgtgacc aatctcattg aaaagtcagc atcattagat aatggtgggt gcgctctcac 1681 aaccttttct gttcttgaag gagagaaaaa caaccataga gcgaaggatt tgagagcacc 1741 tccagaacaa ggaaagattt ttattgcaag gcgctctctc ttagatgaac tgcttgaagt 1801 ggaccacatc agaacaatat atcacatgtt tattgccctc ctcattctct ttatcctcag 1861 cacacttgta gtagattaca ttgatgaagg aaggctggtg cttgagttca gcctcctgtc 1921 ttatgctttt ggcaaatttc ctaccgttgt ttggacctgg tggatcatgt tcctgtctac 1981 attttcagtt ccctattttc tgtttcaaca ttggcgcact ggctatagca agagttctca 2041 tccgctgatc cgttctctct tccatggctt tcttttcatg atcttccaga ttggagttct 2101 aggttttgga ccaacatatg ttgtgttagc atatacactg ccaccagctt cccggttcat 2161 cattatattc gagcagattc gttttgtaat gaaggcccac tcatttgtca gagagaacgt 2221 gcctcgggta ctaaattcag ctaaggagaa atcaagcact gttccaatac ctacagtcaa 2281 ccagtatttg tacttcttat ttgctcctac ccttatctac cgtgacagct atcccaggaa 2341 tcccactgta agatggggtt atgtcgctat gaagtttgca caggtctttg gttgcttttt 2401 ctatgtgtac tacatctttg aaaggctttg tgcccccttg tttcggaata tcaaacagga 2461 gcccttcagc gctcgtgttc tggtcctatg tgtatttaac tccatcttgc caggtgtgct 2521 gattctcttc cttacttttt ttgccttttt gcactgctgg ctcaatgcct ttgctgagat 2581 gttacgcttt ggtgacagga tgttctataa ggattggtgg aactccacgt catactccaa 2641 ctattataga acctggaatg tggtggtcca tgactggcta tattactatg cttacaagga 2701 ctttctctgg tttttctcca agagattcaa atctgctgcc atgttagctg tctttgctgt 2761 atctgctgta gtacacgaat atgccttggc tgtttgcttg agctttttct atcccgtgct 2821 gttcgtgctc ttcatgttct ttggaatggc tttcaacttc attgtcaatg atagtcggaa 2881 aaagccgatt tggaatgttc tgatgtggac ttctcttttc ttgggcaatg gagtcttact 2941 ctgcttttat tctcaagaat ggtatgcacg tcggcactgt cctctgaaaa atcccacatt 3001 tttggattat gtccggccac gttcctggac ttgtcgttac gtgttttaga agcttggact 3061 ttgtttcctc cttgtcactg aagattgggt agctccctga tttggagcca gctgtttcca 3121 gttgttactg aagttatctg tgttatttgg accactccag gctttacaga tgactcactc 3181 cattcctagg tcacttgaag ccaaactgtt ggaagttcac tggagtcttg tacacttaag 3241 cagagcagaa ctttttttgt ggggctgggt ggggggagaa gaccgactaa cagctgaagt 3301 aatgacagat tgttgctggg tcatatcagc tttatccctt ggtaattata tctgttttgt 3361 ttcttgactc tgtccaatca gagaataaac atcatagttt cttggccact gaattagcca 3421 aaacacttag gaagaaatca cttaaatacc tctggcttag aaattttttc atgcacactg 3481 ttggaatgta tgctaattga acatgcaatt ggggaagaaa aaatgtagaa tgatttttgc 3541 tatttctagt agaaagaaaa tgtctgtttt ccaaagataa tgttatacat cctattttgt 3601 aatttttttg aaaaaagttc aatgttcagt tttccttagt ttttaccttg ttttctctat 3661 aggtcatgat ttctgtgaag caaaaagatg ccttttacca tgaattcttg agtttacatc 3721 aataatattg tatattaagg ggatcagaag taggaaggaa aaaataagag atagcagagg 3781 aaaaagaaaa acatttcctc ttataacttc tgaagtaatt tgtaaaaaag atttgtagag 3841 tcaatcatgt gtttaaatta ttttatcaca aacttaacat ggaagatatt cctttttaac 3901 tttgtggtaa cttctttgaa gttatttaga aatatccttt ggaacaatta ttttattgtc 3961 taataaatat tgacttctct tgaattattt tgcagactag tgagtctgta c // LOCUS HUMACYLHYD 2254 bp mRNA PRI 24-SEP-1991 DEFINITION Human acyloxyacyl hydrolase mRNA, complete cds. ACCESSION M62840 NID g178068 KEYWORDS acyloxyacyl hydrolase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2254) AUTHORS Hagen,F.S., Grant,F.J., Kuijper,J.L., Slaugher,C.A., Moomaw,C.R., Orth,K., O'Hara,P.J. and Munford,R.S. TITLE Expression and characterization of recombinant human acyloxyacyl hydrolase, a leukocyte enzyme that deacylates bacterial lipopolysaccharides JOURNAL Biochemistry 30, 8415-8423 (1991) MEDLINE 91355197 FEATURES Location/Qualifiers source 1..2254 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..2254 CDS 275..2002 /codon_start=1 /product="acyloxyacyl hydrolase" /db_xref="PID:g178069" /translation="MQSPWKILTVAPLFLLLSLQSSASPANDDQSRPSLSNGHTCVGC VLVVSVIEQLAQVHNSTVQASMERLCSYLPEKLFLKTTCYLVIDKFGSDIIKLLSADM NADVVCHTLEFCKQNTGQPLCHLYPLPKETWKFTLQKARQIVKKSPILKYSRSGSDIC SLPVLAKICQKIKLAMEQSVPFKDVDSDKYSVFPTLRGYHWRGRDCNDSDESVYPGRR PNNWDVHQDSNCNGIWGVDPKDGVPYEKKFCEGSQPRGIILLGDSAGAHFHISPEWIT ASQMSLNSFINLPTALTNELDWPQLSGATGFLDSTVGIKEKSIYLRLWKRNHCNHRDY QNISRNGASSRNLKKFIESLSRNKVLDYPAIVIYAMIGNDVCSGKSDPVPAMTTPEKL YSNVMQTLKHLNSHLPNGSHVILYGLPDGTFLWDNLHNRYHPLGQLNKDMTYAQLYSF LNCLQVSPCHGWMSSNKTLRTLTSERAEQLSNTLKKIAASEKFTNFNLFYMDFAFHEI IQEWQKRGGQPWQLIEPVDGFHPNEVALLLLADHFWKKVQLQWPQILGKENPFNPQIK QVFGDQGGH" BASE COUNT 601 a 553 c 538 g 562 t ORIGIN 1 aaagaaccgc acaccacaga ctccctccag ctctttgtgt gtggctctct cagggtccaa 61 caagagcaag ctgtgggtct gtgagtgttt atgtgtgctt ttattcactt cacacttatt 121 gaaaagtgtg tatgtgagag ggtggggtgt gtgtgtcaaa gagagtgagg aagagaagga 181 gagagagatc aattgattct gcagcctcag ctccagcatc cctcagttgg gagcttccaa 241 agccgggtga tcacttgggg tgcatagctc ggagatgcag tccccctgga aaatccttac 301 ggtggcgcct ctattcttgc tcctgtctct tcagtcctcg gcctctccag ccaacgatga 361 ccagtccagg cccagcctct cgaatgggca cacctgtgta gggtgtgtgc tggtggtgtc 421 tgtaatagaa cagcttgctc aagttcacaa ctcgacggtc caggcctcga tggagagact 481 gtgcagctac ctgcctgaaa aactgttctt gaaaaccacc tgctatttag tcattgacaa 541 gtttggatca gacatcataa aactgcttag cgcagatatg aatgctgatg tggtatgtca 601 cactctggag ttttgtaaac agaacactgg ccaaccattg tgtcatctct accctcttcc 661 caaggagaca tggaaattta cactacagaa ggcaagacaa attgtcaaga agtccccgat 721 tctgaaatat tctagaagtg gttctgacat ttgttcactc ccggttttgg ccaagatctg 781 ccagaaaatt aaattagcta tggaacagtc tgtgccattc aaagatgtgg attcagacaa 841 atacagcgtt ttcccaacac tgcggggcta tcactggcgg gggagagact gtaatgacag 901 cgacgagtca gtgtacccag gtagaaggcc gaacaactgg gatgtccatc aggattcaaa 961 ctgtaatggc atttggggtg tcgatccaaa agatggagtt ccatatgaga agaaattctg 1021 tgaaggttca cagcccaggg gaatcatttt gctgggagac tcagctgggg ctcattttca 1081 catctctcct gaatggatca cagcgtcgca gatgtctttg aactctttca tcaatctacc 1141 aacagccctt accaacgagc ttgactggcc ccaactctct ggtgctacag gatttctgga 1201 ctccactgtt ggaattaaag aaaaatctat ttaccttcgc ttatggaaaa gaaaccactg 1261 taatcacagg gactaccaga atatttcaag aaatggtgca tcttcccgaa acctgaagaa 1321 atttatagaa agcttgtcta gaaacaaggt gttggactat cccgccatcg ttatatatgc 1381 catgattgga aatgatgtct gcagtgggaa gagtgaccca gtcccagcca tgaccactcc 1441 tgagaaactc tactccaacg tcatgcagac tctgaagcat ctaaattccc acctgcccaa 1501 tggcagccat gttattttgt atggcttacc agatggaacc tttctctggg ataatttgca 1561 caacagatat catcctctcg gccagctaaa taaagacatg acctatgcgc agttgtactc 1621 cttcctgaac tgcctccagg tcagcccctg ccacggctgg atgtcttcca acaagacgtt 1681 gcggactctc acttcagaga gagcagagca actctccaac acactgaaaa aaattgcagc 1741 cagtgagaaa tttacaaact tcaatctttt ctacatggat tttgccttcc atgaaatcat 1801 acaggagtgg cagaagagag gcggacagcc ctggcagctc atcgagcccg tggatggatt 1861 ccaccccaac gaggtggctt tgctgttgtt ggcggatcat ttctggaaaa aggtgcagct 1921 ccagtggccc caaatcctgg gaaaggagaa tccgttcaac ccccagatta aacaggtgtt 1981 tggagaccaa ggcgggcact gagcctctca ggagcatgca cccctgggga gcacagggag 2041 gcagaggctt gggtaaactc attccacaaa ccctatgggg gctgccacgt cacaggccca 2101 aaggactctt cttcagcagc atctttgcaa aatgtctttc tctcaatgaa gagcatatct 2161 ggacgactgt gcaatgctgt gtgctcccgg ggatcagtaa cccttccgct gttcctgaaa 2221 taacctttca taaagtgctt tgggtgccat tcca // LOCUS HUMACYPRO 1415 bp mRNA PRI 24-SEP-1993 DEFINITION Human aminoacylase-1 (ACY1) mRNA, complete cds. ACCESSION L07548 NID g178070 KEYWORDS amidohydrolase; amino acid hydrolysis; aminoacylase-1; cross-hybridizing DNA. SOURCE Homo sapiens (library: Lambda gt10; Lamda gt11) kidney, liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Miller,Y.E., Drabkin,H.A., Jones,C. and Fisher,J.H. TITLE Human aminoacylase-1: cloning, regional assignment to distal chromosome 3p21.1, and identification of a cross-hybridizing sequence on chromosome 18 JOURNAL Genomics 8, 149-154 (1990) MEDLINE 91184800 REFERENCE 2 (bases 1 to 1415) AUTHORS Cook,R.M., Burke,B.J., Buchhagen,D.L., Minna,J.D. and Miller,Y.E. TITLE Human aminoacylase-1. Cloning, sequence, and expression analysis of a chromosome 3p21 gene inactivated in small cell lung cancer JOURNAL J. Biol. Chem. 268 (23), 17010-17017 (1993) MEDLINE 93352474 FEATURES Location/Qualifiers source 1..1415 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1415 /gene="ACY1" 5'UTR 1..61 /gene="ACY1" CDS 62..1288 /gene="ACY1" /standard_name="n-acyl-l-aminoacid amidohydrolase" /EC_number="3.5.1.14" /note="putative" /citation=[2] /codon_start=1 /function="hydrolysis of n-acylated or n-acetylated aminoacids" /product="aminoacylase-1" /db_xref="PID:g178071" /translation="MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQ LGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDS EGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQ RPEFHALRAGFALDEGIANPTDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEK LHKVVNSILAFREKEWQRLQSNPHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFR VAPDVDFKAFEEQLQSWCQAAGEGVTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDM NLTLEPEIMPAATDNRYIRAVGVPALGFSPMNRTPVLLHDHDERLHEAVFLRGVDIYT RLLPALASVPALPSDS" 3'UTR 1289..1415 /gene="ACY1" BASE COUNT 297 a 416 c 416 g 286 t ORIGIN 1 gggcgctgag aggcgagcgt gagcccagcg acaggagagt gagctcacca cgcgcagcgc 61 catgaccagc aagggtcccg aggaggagca cccatcggtg acgctcttcc gccagtacct 121 gcgtatccgc actgtccagc ccaagcctga ctatggagct gctgtggctt tctttgagga 181 gacagcccgc cagctgggcc tgggctgtca gaaagtagag gtggcacctg gctatgtggt 241 gaccgtgttg acctggccag gcaccaaccc tacactctcc tccatcttgc tcaactccca 301 cacggatgtg gtgcctgtct tcaaggaaca ttggagtcac gacccctttg aggccttcaa 361 ggattctgag ggctacatct atgccagggg tgcccaggac atgaagtgcg tcagcatcca 421 gtacctggaa gctgtgagga ggctgaaggt ggagggccac cggttcccca gaaccatcca 481 catgaccttt gtgcctgatg aggaggttgg gggtcaccaa ggcatggagc tgttcgtgca 541 gcggcctgag ttccacgccc tgagggcagg ctttgccctg gatgagggca tagccaatcc 601 cactgatgcc ttcactgtct tttatagtga gcggagtccc tggtgggtgc gggttaccag 661 cactgggagg ccaggccatg cctcacgctt catggaggac acagcagcag agaagctgca 721 caaggttgta aactccatcc tggcattccg ggagaaggaa tggcagaggc tgcagtcaaa 781 cccccacctg aaagaggggt ccgtgacctc cgtgaacctg actaagctag agggtggcgt 841 ggcctataac gtgatacctg ccaccatgag cgccagcttt gacttccgtg tggcaccgga 901 tgtggacttc aaggcttttg aggagcagct gcagagctgg tgccaggcag ctggcgaggg 961 ggtcacccta gagtttgctc agaagtggat gcacccccaa gtgacaccta ctgatgactc 1021 aaacccttgg tgggcagctt ttagccgggt ctgcaaggat atgaacctca ctctggagcc 1081 tgagatcatg cctgctgcca ctgacaaccg ctatatccgc gcggtggggg tcccagctct 1141 aggcttctca cccatgaacc gcacacctgt gctgctgcac gaccacgatg aacggctgca 1201 tgaggctgtg ttcctccgtg gggtggacat atatacacgc ctgctgcctg cccttgccag 1261 tgtgcctgcc ctgcccagtg acagctgagc cctggaactc ctaaaccttt gcccctgggg 1321 cttccatccc aaccagtgcc aaggacctcc tcttccccct tccaaataat aaagtctatg 1381 gacagggctg tctctgaagt actaacacaa ggaca // LOCUS HUMADCY 2591 bp mRNA PRI 09-MAR-1993 DEFINITION Homo sapiens adenylyl cyclase-associated protein (CAP) mRNA, complete cds. ACCESSION L12168 NID g178083 KEYWORDS adenylyl cyclase-associated protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2591) AUTHORS Kawamukai,M., O'Neill,K., Rodgers,L., Riggs,M., Schaller,H., Chalfie,M., Field,J. and Wigler,M. TITLE Genes from metazoans encoding homologs of yeast adenylyl cyclase-associated proteins JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2591 /organism="Homo sapiens" /db_xref="taxon:9606" gene 40..1467 /gene="CAP" CDS 40..1467 /gene="CAP" /note="putative" /codon_start=1 /product="adenylyl cyclase-associated protein" /db_xref="PID:g178084" /translation="MADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYV QAFDSLLAGPVAEYLKISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLS DLLAPISEQIKEVITFREKNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDA AMFYTNRVLKEYKDVDKKHVDWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELS GLPSGPSAGSGPPPPPPGPPPPPVSTSSGSDESASRSALFAQINQGESITHALKHVSD DMKTHKNPALKAQSGPVRSGPKPFSAPKPQTSPSPKRATKKEPAVLELEGKKWRVENQ ENVSNLVIEDTELKQVAYIYKCVNTTLQIKGKINSITVDNCKKLGLVFDDVVGIVEII NSKDVKVQVMGKVPTISINKTDGCHAYLSKNSLDCEIVSAKSSEMNVLIPTEGGDFNE FPVPEQFKTLWNGQKLVTTVTEIAG" BASE COUNT 714 a 611 c 610 g 656 t ORIGIN 1 gcggccgcgt gaggcggaac tctgagcagg tggtccatta tggctgacat gcaaaatctg 61 gtagaaagat tggagagggc agtgggccgc ctggaggcag tatctcatac ctctgacatg 121 caccgtgggt atgcagacag tccttcaaaa gcaggagcag ctccatatgt gcaggcattt 181 gactcgctgc ttgctggtcc tgtggcagag tacttgaaga tcagtaaaga gattggggga 241 gacgtgcaga aacatgcgga gatggtccac acaggtttga agttggagcg agctctgttg 301 gttacagctt ctcagtgtca acagccagca gaaaataagc tttccgattt gttggcaccc 361 atctcagagc agatcaaaga agtgataacc tttcgggaga agaaccgagg cagcaagttg 421 tttaatcacc tgtcagctgt cagcgaaagt atccaggccc tgggctgggt ggctatggct 481 cccaagcctg gcccttatgt gaaagaaatg aatgatgccg ccatgtttta tacaaaccga 541 gtcctcaaag agtacaaaga tgtggataag aagcatgtag actgggtcaa agcttattta 601 agtatatgga cagagctgca ggcttacatt aaggagttcc ataccaccgg actggcctgg 661 agcaaaacgg ggcctgtggc aaaagaactg agcggactgc catctggacc ctctgccgga 721 tcaggtcctc ctccccctcc accaggcccc cctcctcccc cagtctctac cagttcaggc 781 tcagatgagt ctgcttcccg ctcagcactg ttcgcgcaga ttaatcaggg ggagagcatt 841 acacatgccc tgaaacatgt atctgatgac atgaagactc acaagaaccc tgccctgaag 901 gctcagagtg gtccagtacg cagtggcccc aaaccattct ctgcacctaa accccaaacc 961 agcccatccc ccaaacgagc cacaaagaag gagccagctg tacttgaact ggagggcaag 1021 aagtggagag tggaaaatca ggaaaatgtt tccaacctgg tgattgagga cacagagctg 1081 aaacaggtgg cttacatata caagtgtgtc aacacgacat tgcaaatcaa gggcaaaatt 1141 aactccatta cagtagataa ctgtaagaaa cttggcctgg tattcgatga cgtggtgggc 1201 attgtggaga taatcaacag taaggatgtc aaagttcagg taatgggtaa agtgccaacc 1261 atatccatca acaaaacaga tggctgccat gcttacctga gcaagaattc cctggattgt 1321 gaaatagtca gtgccaaatc ttccgagatg aatgtcctca ttcctacaga aggcggtgac 1381 tttaatgaat tcccagttcc tgagcagttc aagaccctat ggaacgggca gaagttggtc 1441 accacagtga cagaaattgc tggataagcg aagtgccact gggttctttg ccctcccttc 1501 acaccatggg ataaatctgt atcaagacgg ttcttttcta gatttcctct acctttttgc 1561 tcttaaaact gcttctctgc tctgagaagc acagctacct gccttcactg aaatatacct 1621 caggctgaga tttggggtgg gatagcaggt cagttgatct tctgcaggaa ggtgcagctt 1681 ttccatatca gctcaaccac gccgccagtc cattcttaag gaactgccga ctaggactga 1741 tgatgcattt tagctttgag cttttggggg ttattctacc aacaaacagt ccattggaaa 1801 gaaaacagtc cctggaatta acagatcaga atgttcacac tggttaatct ttttttaaca 1861 atgagcatga aggtagcaga agctggtgtg tttccagatg gttcttctaa ccaaactaat 1921 ttttcactgt tgacaagcga ggcaagggtt gcactggacc aaaggctgag gcttggccat 1981 ctagcattcc atacaaaatt gtttcctata agcattcctt ttattctcta ttctatcctg 2041 ggtctgcctc aaccgtgaga taggagagtc tctggtacta gctgctgtag cagtgccctt 2101 catccagggc agttaatgga gtcttggacc ctttctttct ctgggatccc tgcccagcac 2161 cttcctatag agatgacttt aaaaggaaaa aaaaaaaaaa acccacatga tttcaaggag 2221 tctggcattc ctgaatcctt cttccctgcc aggtgcctgt cacctgtctt cactgcctcc 2281 ttttccctgt catgctcatc agcttatggc ttctgtctaa gcacctgaac agaggactga 2341 aacctccact gcaggctggt tttaggtctt gaattatgta agaatcttgc acagcactgc 2401 taatgtaaat ttcagttgtt tttccctcta ggacaaacac ttaccaaaat atgcaacttt 2461 tttttggtgg gaagagagat tgtcctgtga tttctaccca tttcctgagg cctgtggaaa 2521 taaaccttta tgtacttaaa gttatacaga aaatagaata aagttaatac caaacttgaa 2581 aaagcggccg c // LOCUS HUMADD 1463 bp mRNA PRI 12-APR-1996 DEFINITION Human adrenodoxin mRNA, complete cds. ACCESSION J03548 NID g178085 KEYWORDS adrenodoxin; iron-sulfur protein; cholesterol side-chain cleavage enzyme. SOURCE Homo sapiens (clone: hADx[2,6,7].) adrenal cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1463) AUTHORS Picado-Leonard,J., Voutilainen,R., Kao,L.C., Chung,B.C., Strauss,J.F. III. and Miller,W.L. TITLE Human adrenodoxin: cloning of three cDNAs and cycloheximide enhancement in JEG-3 cells JOURNAL J. Biol. Chem. 263 (7), 3240-3244 (1988) MEDLINE 88139395 REMARK Erratum:[J Biol Chem 1988 Aug 5;263(22):11016] REFERENCE 2 (bases 1 to 1463) AUTHORS Miller,W.L. JOURNAL Unpublished (1988) COMMENT [1] revises [2]. Printed copy of sequence for [2],[1] kindly provided by W.L.Miller, 08-DEC-1987. FEATURES Location/Qualifiers source 1..1463 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hADx[2,6,7]." /tissue_type="adrenal" /map="11q13-qter" mRNA 1..1283 /note="adrenodoxin mRNA (alt.)" mRNA 1..922 /note="adrenodoxin mRNA (alt.)" mRNA 1..1463 /note="adrenodoxin mRNA (alt.)" gene 133..687 /gene="FDX1" CDS 133..687 /gene="FDX1" /note="preproadrenodoxin" /codon_start=1 /product="adrenodoxin" /db_xref="PID:g178086" /db_xref="GDB:G00-119-657" /translation="MAAAGGARLLRAASAVLGGPAGRWLHHAGSRAGSSGLLRNRGPG GSAEASRSLSVSARARSSSEDKITVHFINRDGETLTTKGKVGDSLLDVVVENNLDIDG FGACEGTLACSTCHLIFEDHIYEKLDAITDEENDMLDLAYGLTDRSRLGCQICLTKSM DNMTVRVPETVADARQSIDVGKTS" mat_peptide 313..654 /gene="FDX1" /note="adrenodoxin" variation replace(1099,"a") /note="g in hAdx[2,7]; a in hAdx6" variation replace(1123,"c") /note="t in hAdx[2,7]; c in hAdx6" BASE COUNT 405 a 286 c 338 g 434 t ORIGIN 216 bp upstream of BamHI site. 1 gccactccag ccccgcgccc ctcgccgcgg ccctcggcgt ctgcgccgca gctgccgccc 61 ccgcctcttt ggagtctctc gcggcctcaa agcgcggcct gcgtcgcttc cggcagttcc 121 agaccgcggg cgatggctgc cgctgggggc gcccggctgc tgcgcgccgc ttctgctgtc 181 ctcggcggcc cggccggccg gtggctgcac cacgctgggt cccgcgctgg atccagcggc 241 ctgctgagga accgggggcc gggcgggagc gcggaggcga gccggtcgct gagcgtgtcg 301 gcgcgggccc ggagcagctc agaagataaa ataacagtcc actttataaa ccgagatggt 361 gaaacattaa caaccaaagg aaaagttggt gattctctgc tagatgttgt ggttgaaaat 421 aatctagata ttgatggttt tggtgcatgt gagggaaccc tggcttgttc aacctgtcac 481 ctcatctttg aagatcacat atatgagaag ttagatgcaa tcactgatga ggagaatgac 541 atgctcgatc tggcatatgg actaacagac agatcacggt tgggctgcca aatctgtttg 601 acaaaatcta tggacaatat gactgttcga gtgcctgaaa cagtggctga tgccagacaa 661 tccattgatg tgggcaagac ctcctgaact agaacaaata ggaatatttt catggatttt 721 acctattttt ataattatta tttcttaaag tgattaaatg agaacatgga tgagtggact 781 tcatattatg actagcttta ctattttaat tcaccttgca taactactga attttgtcat 841 tcttgaaagt atgcaatttt tattttggtt atattacaaa aatgtcaatc aaatattaaa 901 aaatagttag tgtgatagaa aaacctacat attttttttc tagtttgttt agcgacttag 961 caaaatgttt tcatatggtc tcatctgttt acctagaaga taggttaagg aaatatagta 1021 ttattcctgt ttgatgtggt tgaaggcaga gatctaacct ggcttgttta gggccatacc 1081 actaattaga aaatctgtgc tagaacctgt gtcttattcc tataagctat gtgttcagac 1141 tgaaactgga gaaattatga ctattttatt tatagtagta gttaaatctg aatgtgtatg 1201 gattaaaaat atttaattgc tgagtaaact gcttaacttc aaagatagtt attgacctta 1261 taaataaata tttcaaaatt ttgattcgga agactaagtc tggagctaga cattataatg 1321 ctatcaaaga agttgatctc tgttttgact aaactagagg aaaaatgatt ggatgagttt 1381 attcttttct aagcagaatg gtttaacttt gtactcaaag aaaaataatg ctgatttata 1441 aatctctgcc tataatagaa tgg // LOCUS HUMADH1CA 1450 bp mRNA PRI 30-OCT-1994 DEFINITION Human class I alcohol dehydrogenase (ADH1) alpha subunit mRNA, complete cds. ACCESSION M12963 NID g178089 KEYWORDS alcohol dehydrogenase; dehydrogenase. SOURCE Human liver, cDNA to mRNA, clones pADH6 and pADH11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1450) AUTHORS Von Bahr-Lindstroem,H., Hoeoeg,J.-V., Heden,L.-O., Kaiser,R., Fleetwood,L., Larsson,K., Lake,M., Holmquist,B., Holmgren,A., Hempel,J., Vallee,B.L. and Joernvall,H. TITLE cDNA and protein structure for the alpha subunit of human liver alcohol dehydrogenase JOURNAL Biochemistry 25 (9), 2465-2470 (1986) MEDLINE 86243367 COMMENT Draft entry and sequence for [1] kindly provided in computer-readable form by H.von Bahr-Lindstroem, 26-AUG-1986. The other human class I ADH1 alpha subunit sequence is found under accession M12271. FEATURES Location/Qualifiers source 1..1450 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q21-q23" mRNA <1..1450 /note="aADH mRNA" gene 73..1200 /gene="ADH1" CDS 73..1200 /gene="ADH1" /note="alcohol dehydrogenase alpha subunit (EC 1.1.1.1)" /codon_start=1 /db_xref="GDB:G00-119-650" /db_xref="PID:g178090" /translation="MSTAGKVIKCKAAVLWELKKPFSIEEVEVAPPKAHEVRIKMVAV GICGTDDHVVSGTMVTPLPVILGHEAAGIVESVGEGVTTVKPGDKVIPLAIPQCGKCR ICKNPESNYCLKNDVSNPQGTLQDGTSRFTCRRKPIHHFLGISTFSQYTVVDENAVAK IDAASPLEKVCLIGCGFSTGYGSAVNVAKVTPGSTCAVFGLGGVGLSAIMGCKAAGAA RIIAVDINKDKFAKAKELGATECINPQDYKKPIQEVLKEMTDGGVDFSFEVIGRLDTM MASLLCCHEACGTSVIVGVPPDSQNLSMNPMLLLTGRTWKGAILGGFKSKECVPKLVA DFMAKKFSLDALITHVLPFEKINEGFDLLHSGKSIRTILMF" BASE COUNT 414 a 307 c 354 g 375 t ORIGIN 55 bp upstream from PstI site; chrosome 4q21. 1 gatgcacttg agcagggaag aaatccacaa ggactcacca gtctcctggt ctgcagagaa 61 gacagaatca acatgagcac agcaggaaaa gtaatcaaat gcaaagcagc tgtgctatgg 121 gagttaaaga aacccttttc cattgaggag gtggaggttg cacctcctaa ggcccatgaa 181 gttcgtatta agatggtggc tgtaggaatc tgtggcacag atgaccacgt ggttagtggt 241 accatggtga ccccacttcc tgtgatttta ggccatgagg cagccggcat cgtggagagt 301 gttggagaag gggtgactac agtcaaacca ggtgataaag tcatcccact cgctattcct 361 cagtgtggaa aatgcagaat ttgtaaaaac ccggagagca actactgctt gaaaaacgat 421 gtaagcaatc ctcaggggac cctgcaggat ggcaccagca ggttcacctg caggaggaag 481 cccatccacc acttccttgg catcagcacc ttctcacagt acacagtggt ggatgaaaat 541 gcagtagcca aaattgatgc agcctcgcct ctagagaaag tctgtctcat tggctgtgga 601 ttttcaactg gttatgggtc tgcagtcaat gttgccaagg tcaccccagg ctctacctgt 661 gctgtgtttg gcctgggagg ggtcggccta tctgctatta tgggctgtaa agcagctggg 721 gcagccagaa tcattgcggt ggacatcaac aaggacaaat ttgcaaaggc caaagagttg 781 ggtgccactg aatgcatcaa ccctcaagac tacaagaaac ccatccagga ggtgctaaag 841 gaaatgactg atggaggtgt ggatttttca tttgaagtca tcggtcggct tgacaccatg 901 atggcttccc tgttatgttg tcatgaggca tgtggcacaa gtgtcatcgt aggggtacct 961 cctgattccc aaaacctctc aatgaaccct atgctgctac tgactggacg tacctggaag 1021 ggagctattc ttggtggctt taaaagtaaa gaatgtgtcc caaaacttgt ggctgatttt 1081 atggctaaga agttttcatt ggatgcatta ataacccatg ttttaccttt tgaaaaaata 1141 aatgaaggat ttgacctgct tcactctggg aaaagtatcc gtaccattct gatgttttga 1201 gacaatacag atgttttccc ttgtggcagt cttcagcctc ctctacccta catgatctgg 1261 agcaacagct gggaaatatc attaattctg ctcatcacag attttatcaa taaattacat 1321 ttgggggctt tccaaagaaa tggaaattga tgtaaaatta tttttcaagc aaatgtttaa 1381 aatccaaatg agaactaaat aaagtgttga acatcagctg gggaattgaa gccaataaac 1441 cttccttctt // LOCUS HUMADH4C1 1981 bp mRNA PRI 30-OCT-1994 DEFINITION Human class II alchohol dehydrogenase (ADH4) pi subunit mRNA, complete cds. ACCESSION M15943 NID g178120 KEYWORDS alcohol dehydrogenase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1981) AUTHORS Hoog,J.O., von Bahr-Lindstrom,H., Heden,L.O., Holmquist,B., Larsson,K., Hempel,J., Vallee,B.L. and Jornvall,H. TITLE Structure of the class II enzyme of human liver alcohol dehydrogenase: combined cDNA and protein sequence determination of the pi subunit JOURNAL Biochemistry 26 (7), 1926-1932 (1987) MEDLINE 87242382 COMMENT Draft entry and clean copy of sequence [1] kindly provided by J.-O Hoeoeg, 22-JUN-1987. Possible polyadenylation signals are located at bases 1492-1497, 1496-1501, 1825-1830 and 1961-1966. FEATURES Location/Qualifiers source 1..1981 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1981 /gene="ADH4" /note="pADH mRNA; G00-119-653" gene 1..1981 /gene="ADH4" CDS 62..1240 /gene="ADH4" /note="alcohol dehydrogenase pi subunit" /codon_start=1 /db_xref="GDB:G00-119-653" /db_xref="PID:g178121" /translation="MGTKGKVIKCKAAIAWEAGKPLCIEEVEVAPPKAHEVRIQIIAT SLCHTDASVIDSKFEGLAFPVIVGHEAAGIVESIGPGVTNVKPGDKVIPLYAPLCRKC KFCLSPLTNLCGKISNLKSPASDQQLMEDKTSRFTCKGKPVYHFFGTSTFSQYTVVSD INLAKIDDDANLERVCLLGCGFSTGYGAAINNAKVTPGSTCAVFGLGGVGLSAVMGCK AAGASRIIGIDINSEKFVKAKALGATDCLNPRDLHKPIQEVIIELTKGGVDFALDCAG GSETMKAALDCTTAGWGSCTFIGVAAGSKGLTIFPEELIIGRTINGTFFGGWKSVDSI PKLVTDYKNKKFNLDALVTHTLPFDKISEAFDLMNQGKSVRTILIFGRCQEQFRILSD " BASE COUNT 625 a 353 c 448 g 555 t ORIGIN 12 bp upstream of HindIII site. 1 cgaggagttt gaagctttct taactcagaa agaaacttcc aacacagttt cccaaagaaa 61 aatgggcacc aagggcaaag ttattaaatg caaagcagcc atcgcctggg aagcaggcaa 121 gcccctttgc attgaagagg ttgaagtagc tccccccaag gctcatgaag ttcgcattca 181 gatcattgct acctccctgt gccatactga tgccagtgtt atcgattcta aatttgaggg 241 cctagctttc ccagtgatcg ttggccatga ggctgcaggt attgtggaaa gtattgggcc 301 aggagtgacc aacgtcaaac caggtgacaa agtaattcca ctttatgcac ctctatgtag 361 aaaatgcaag ttttgtctga gtccactcac aaatttgtgt gggaaaatca gtaatctcaa 421 aagtcctgct agtgatcaac aactaatgga agacaaaacc agcaggttta cctgcaaagg 481 aaaaccagtt taccatttct ttggaaccag tacattctct cagtacactg tggtgtcaga 541 tatcaatctt gccaaaatag atgatgatgc aaatttagag agagtttgtc tgcttggatg 601 tgggttttca actggctatg gggctgcaat caacaatgcc aaggtcaccc ctggttcgac 661 ttgtgctgtc tttggcctag gaggtgtggg tctttctgct gtaatgggtt gtaaagcagc 721 aggagcttcc agaatcatag gtattgacat caacagtgag aagtttgtga aggctaaagc 781 cctgggagcc actgactgcc tcaatcctag agacttacat aaaccgatcc aggaagttat 841 cattgaattg accaagggag gtgtggattt tgcccttgac tgtgcaggtg gatctgaaac 901 catgaaagca gccctggact gtacaaccgc aggctgggga tcatgtactt tcattggagt 961 agctgctggt agcaaaggat tgactatttt tccagaggag ctaataatcg gccgtactat 1021 aaatggaaca ttctttggtg gttggaaaag tgtagattct atcccaaagc tggtcactga 1081 ctataagaat aagaaattca atctggatgc attggtgacc cataccctgc cttttgacaa 1141 aatcagtgag gcatttgacc taatgaacca aggaaaaagc gtccgaacaa tcctcatctt 1201 tggaagatgc caggagcaat tcagaatact atctgattga atgtgaacct gcctggttaa 1261 tttattacct gatttgatga accaaggaaa gccatgagtt taaacaaata tttacattta 1321 atatgggaac ataaaagagc tttaaatatt atagactttg tacctgttat atatatgaat 1381 attccctatg ttaaataata ataataacta gtgtttatga atagaatcat atcatcttta 1441 gaaattgttt aaaattagtt ctgggaagtt gaaagtgggg aatgaagaga taataaataa 1501 aactagattg gccatatgtt tataattttt ttagattggg taatgaatac atggagtttc 1561 attatacttt tctctccacg ttttgtctat gttgaaaatt ttctgggagc taaatgatga 1621 gaacacatgg acacatgatg gggaacaaca cacactgggg cctgttgagg gcagggagtc 1681 ggcagagaga gagcatcagg aagaatagct aatggatgct gggcttcata cctgggtgat 1741 gagatgatct gtgcagcaaa gcaccatggt acatgtttac ctatgtaaca aacctgcaca 1801 tcctgcacat gtaccctgga acttaataaa agttggaaat ttttaaaaag aatgaataag 1861 acctggtatt tgatagcaca acagggagac tatagtcaac agcaatttaa ttgtatattt 1921 taatatgact aaaagagtat aatggattgt ttgtaacaca aataaatgct tgaggagatg 1981 c // LOCUS HUMADH5C3 1613 bp mRNA PRI 30-OCT-1994 DEFINITION Human alcohol dehydrogenase class III (ADH5) mRNA, complete cds. ACCESSION M29872 NID g178131 KEYWORDS alcohol dehydrogenase; zinc metalloenzyme. SOURCE Human liver, cDNA to mRNA, clones ADH5-[14.1,30.1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1613) AUTHORS Giri,P.R., Krug,J.F., Kozak,C., Moretti,T., O'Brien,S.J., Seuanez,H.N. and Goldman,D. TITLE Cloning and comparative mapping of a human class III (chi) alcohol dehydrogenase cDNA JOURNAL Biochem. Biophys. Res. Commun. 164 (1), 453-460 (1989) MEDLINE 90026418 FEATURES Location/Qualifiers source 1..1613 /organism="Homo sapiens" /db_xref="taxon:9606" gene 5..1183 /gene="ADH5" CDS 5..1183 /gene="ADH5" /note="alcohol dehydrogenase class III" /codon_start=1 /db_xref="GDB:G00-118-978" /db_xref="PID:g178132" /translation="MGAATPDVSPPRRPESVNMANEVIKCKAAVAWEAGKPLSIEEIE VAPPKAHEVRIKIIATAVCHTDAYTLSGADPEGCFPVILGHEGAGIVESVGEGVTKLK AGDTVIPLYIPQCGECKFCLNPKTNLCQKIRVTQGKGLMPDGTSRFTCKGKTILHYMG TSTFSEYTVVADISVAKIDPLAPLYKVCLLGCGISTGYGAAVNTAKLEPGSVCAVFGL GGVGLAVIMGCKVAGASRIIGVDINKDKFARAKEFGATECINPQDLSKPIQEVLIEMT DGGVDYSFECIGNVKVMRAALEACHKGWGVSVVVGVAASGEEIATRPFQLVTGRTWKG TAFGGWKSVESVPKLVSEYMSKKIKVDEFVTHNLSFDEINKAFELMHSGKSIRTVVKI " BASE COUNT 448 a 317 c 417 g 431 t ORIGIN 1 gggcatgggc gcggccaccc cggatgtcag ccccccgcgc cgaccagaat ccgtgaacat 61 ggcgaacgag gttatcaagt gcaaggctgc agttgcttgg gaggctggaa agcctctctc 121 catagaggag atagaggtgg cacccccaaa ggctcatgaa gttcgaatca agatcattgc 181 cactgcggtt tgccacaccg atgcctatac cctgagtgga gctgatcctg agggttgttt 241 tccagtgatc ttgggacatg aaggtgctgg aattgtggaa agtgttggtg agggagttac 301 taagctgaag gcgggtgaca ctgtcatccc actttacatc ccacagtgtg gagaatgcaa 361 attttgtcta aatcctaaaa ctaacctttg ccagaagata agagtcactc aagggaaagg 421 attaatgcca gatggtacca gcagatttac ttgcaaagga aagacaattt tgcattacat 481 gggaaccagc acattttctg aatacacagt tgtggctgat atctctgttg ctaaaataga 541 tcctttagca cctttgtata aagtctgcct tctaggttgt ggcatttcaa ccggttatgg 601 tgctgctgtg aacactgcca agttggagcc tggctctgtt tgtgccgtct ttggtctggg 661 aggagtcgga ttggcagtta tcatgggctg taaagtggct ggtgcttccc ggatcattgg 721 tgtggacatc aataaagata aatttgcaag ggccaaagag tttggagcca ctgaatgtat 781 taaccctcag gatttaagta aacccatcca ggaagtgctc attgagatga ccgatggagg 841 agtggactat tcctttgaat gtattggtaa tgtgaaggtc atgagagcag cacttgaggc 901 atgtcacaag ggctggggcg tcagcgtcgt ggttggagta gctgcttcag gtgaagaaat 961 tgccactcgt ccattccagc tggtaacagg tcgcacatgg aaaggcactg cctttggagg 1021 atggaagagt gtagaaagtg tcccaaagtt ggtgtctgaa tatatgtcca aaaagataaa 1081 agttgatgaa tttgtgactc acaatctgtc ttttgatgaa atcaacaaag cctttgaact 1141 gatgcattct ggaaagagca ttcgaactgt tgtaaagatt taattcaaaa gagaaaaata 1201 atgtccatcc tgtcgtgatg tgataggagc agcttaacag gcagggagaa gcgcctccaa 1261 cctcacagcc tcgtagagct tcacagctac tccagaaaat agggttatgt gtgtcattca 1321 tgaatctcta taatcaagga caaggataat tcagtcatga acctgttttc tggatgctcc 1381 tccacataaa taattgctag ttataaggat atttaacata ataaaagtaa ttctacattg 1441 tgtgaattgt cttgtttatg ctgtcatcat tgtcacggtt tgtctgccca ttatcttcat 1501 tctgcaaggg aaagggaaag gaagcagggc agtggtgggt gtctgaaacc tcagaaacat 1561 aacgttgaac ttttaagggt ctcagtcccc gttgattaaa gaacagatcc ccg // LOCUS HUMADORA1 1687 bp mRNA PRI 30-OCT-1994 DEFINITION Human adenosine A2b receptor (ADORA2) mRNA, complete cds. ACCESSION M97759 NID g178149 KEYWORDS adenosine A2b receptor. SOURCE Homo sapiens hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1687) AUTHORS Pierce,K.D., Furlong,T.J., Selbie,L.A. and Shine,J. TITLE Molecular cloning and expression of an adenosine A2b receptor from human brain JOURNAL Biochem. Biophys. Res. Commun. 187 (1), 86-93 (1992) MEDLINE 92392387 FEATURES Location/Qualifiers source 1..1687 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" /map="Unassigned" mat_peptide 136..1131 /gene="ADORA2" /note="G00-126-602" /evidence=experimental /product="adenosine A2b receptor" gene 136..1134 /gene="ADORA2" CDS 136..1134 /gene="ADORA2" /codon_start=1 /db_xref="GDB:G00-126-602" /evidence=experimental /product="adenosine A2b receptor" /db_xref="PID:g178150" /translation="MLLETQDALYVALELVIAALSVAGNVLVCAAVGTANTLQTPTNY FLVSLAAADVAVGLFAIPFAITISLGFCTDFYGCLFLACFVLVLTQSSIFSLLAVAVD RYLAICVPLRYKSLVTGTRARGVIAVLWVLAFGIGLTPFLGWNSKDSATNNCTEPWDG TTNESCCLVKCLFENVVPMSYMVYFNFFGCVLPPLLIMLVIYIKIFLVACRQLQRTEL MDHSRTTLQREIHAAKSLAMIVGIFALCWLPVHAVNCVTLFQPAQGKNKPKWAMNMAI LLSHANSVVNPIVYAYRNRDFRYTFHKIISRYLLCQADVKSGNGQAGVQPALGVGL" BASE COUNT 375 a 442 c 443 g 427 t ORIGIN 1 cccagccccg aggctcagaa gcggcaggcg gaggcgcggt ccgggcgcta tggccatgcc 61 cggcgggtct cacgcggctg cccctcgccc ggcgcgcctt cggtaggggg cgcccggggc 121 ccagctggcc cggccatgct gctggagaca caggacgcgc tgtacgtggc gctggagctg 181 gtcatcgccg cgctttcggt ggcgggcaac gtgctggtgt gcgccgcggt gggcacggcg 241 aacactctgc agacgcccac caactacttc ctggtgtccc tggctgcggc cgacgtggcc 301 gtggggctct tcgccatccc ctttgccatc accatcagcc tgggcttctg cactgacttc 361 tacggctgcc tcttcctcgc ctgcttcgtg ctggtgctca cgcagagctc catcttcagc 421 cttctggccg tggcagtcga cagatacctg gccatctgtg tcccgctcag gtataaaagt 481 ttggtcacgg ggacccgagc aagaggggtc attgctgtcc tctgggtcct tgcctttggc 541 atcggattga ctccattcct ggggtggaac agtaaagaca gtgccaccaa caactgcaca 601 gaaccctggg atggaaccac gaatgaaagc tgctgccttg tgaagtgtct ctttgagaat 661 gtggtcccca tgagctacat ggtatatttc aatttctttg ggtgtgttct gcccccactg 721 cttataatgc tggtgatcta cattaagatc ttcctggtgg cctgcaggca gcttcagcgc 781 actgagctga tggaccactc gaggaccacc ctccagcggg agatccatgc agccaagtca 841 ctggccatga ttgtggggat ttttgccctg tgctggttac ctgtgcatgc tgttaactgt 901 gtcactcttt tccagccagc tcagggtaaa aataagccca agtgggcaat gaatatggcc 961 attcttctgt cacatgccaa ttcagttgtc aatcccattg tctatgctta ccggaaccga 1021 gacttccgct acacttttca caaaattatc tccaggtatc ttctctgcca agcagatgtc 1081 aagagtggga atggtcaggc tggggtacag cctgctctcg gtgtgggcct atgatctagg 1141 ctctcgcctc ttccaggaga agatacaaat ccacaagaaa caaagaggac acggctggtt 1201 ttcattgtga aagatagcta cacctcacaa ggaaatggac tgcctctctt gagcacttcc 1261 ctggagctac cacgtatcta gctaatatgt atgtgtcagt agtagcacca aggattgaca 1321 aatatattta tgatctattc agctgctttt actgtgtgga ttatgccaac agcttgaatg 1381 gattctaaca gactcttttg tttttaaaag tctgccttgt ttatggtgga aaattactga 1441 aactatttta ctgtgaaaca gtgtgaacta ttataatgca aatacttttt aacttagagg 1501 caatggaaaa ataaaagttg actgtactaa aaatgtatac ttgttgccag gaaggtgacc 1561 tcaaaaatta aaagtataat tattcggccg ggcatggtgg ctcacacctg taattccagc 1621 actttgggag gccaaggcag gcggatcacg aggtcaggag ttcaaaacca gcctgtccaa 1681 tatagtg // LOCUS HUMADORA1X 2900 bp mRNA PRI 12-APR-1994 DEFINITION Human adenosine A1 receptor (ADORA1) mRNA exons 1-6, complete cds. ACCESSION L22214 NID g347520 KEYWORDS adenosine A1 receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2900) AUTHORS Ren,H. and Stiles,G.L. TITLE Characterization of the human A1 adenosine receptor gene. Evidence for alternative splicing JOURNAL J. Biol. Chem. 269 (4), 3104-3110 (1994) MEDLINE 94132091 FEATURES Location/Qualifiers source 1..2900 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 year old" /tissue_type="hippocampus" /tissue_lib="Stratagene #936205" gene 1..2900 /gene="ADORA1" exon 1..82 /gene="ADORA1" /number=1 exon 83..99 /gene="ADORA1" /number=2 exon 100..199 /gene="ADORA1" /number=3 exon 200..353 /gene="ADORA1" /number=4 exon 354..751 /gene="ADORA1" /number=5 CDS 411..1391 /gene="ADORA1" /codon_start=1 /product="adenosine A1 receptor" /db_xref="PID:g347521" /translation="MPPSISAFQAAYIGIEVLIALVSVPGNVLVIWAVKVNQALRDAT FCFIVSLAVADVAVGALVIPLAILINIGPQTYFHTCLMVACPVLILTQSSILALLAIA VDRYLRVKIPLRYKMVVTPRRAAVAIAGCWILSFVVGLTPMFGWNNLSAVERAWAANG SMGEPVIKCEFEKVISMEYMVYFNFFVWVLPPLLLMVLIYLEVFYLIRKQLNKKVSAS SGDPQKYYGKELKIAKSLALILFLFALSWLPLHILNCITLFCPSCHKPSILTYIAIFL THGNSAMNPIVYAFRIQKFRVTFLKIWNDHFRCQPAPPIDEDLPEERPDD" exon 752..2900 /gene="ADORA1" /number=6 BASE COUNT 500 a 851 c 934 g 615 t ORIGIN 1 atgagtgtca gaagtgtgaa gggtgcctgt tctgaatccc agagcctcct ctccctctgt 61 gaggctggca ggtgaggaag ggtttaacct cactggaagg aatccctgga gctagcggct 121 gctgaaggcg tcgaggtgtg ggggcacttg gacagaacag tcaggcagcc gggagctctg 181 ccagctttgg tgaccttggg ccgggctggg agcgctgcgg cgggagccgg aggactatga 241 gctgccgcgc gttgtccaga gcccagccca gccctacgcg cgcggcccgg agctctgttc 301 cctggaactt tgggcactgc ctctgggacc cctgccggcc agcaggcagg atggtgcttg 361 cctcgtgccc cttggtgccc gtctgctgat gtgcccagcc tgtgcccgcc atgccgccct 421 ccatctcagc tttccaggcc gcctacatcg gcatcgaggt gctcatcgcc ctggtctctg 481 tgcccgggaa cgtgctggtg atctgggcgg tgaaggtgaa ccaggcgctg cgggatgcca 541 ccttctgctt catcgtgtcg ctggcggtgg ctgatgtggc cgtgggtgcc ctggtcatcc 601 ccctcgccat cctcatcaac attgggccac agacctactt ccacacctgc ctcatggttg 661 cctgtccggt cctcatcctc acccagagct ccatcctggc cctgctggca attgctgtgg 721 accgctacct ccgggtcaag atccctctcc ggtacaagat ggtggtgacc ccccggaggg 781 cggcggtggc catagccggc tgctggatcc tctccttcgt ggtgggactg acccctatgt 841 ttggctggaa caatctgagt gcggtggagc gggcctgggc agccaacggc agcatggggg 901 agcccgtgat caagtgcgag ttcgagaagg tcatcagcat ggagtacatg gtctacttca 961 acttctttgt gtgggtgctg cccccgcttc tcctcatggt cctcatctac ctggaggtct 1021 tctacctaat ccgcaagcag ctcaacaaga aggtgtcggc ctcctccggc gacccgcaga 1081 agtactatgg gaaggagctg aagatcgcca agtcgctggc cctcatcctc ttcctctttg 1141 ccctcagctg gctgcctttg cacatcctca actgcatcac cctcttctgc ccgtcctgcc 1201 acaagcccag catccttacc tacattgcca tcttcctcac gcacggcaac tcggccatga 1261 accccattgt ctatgccttc cgcatccaga agttccgcgt caccttcctt aagatttgga 1321 atgaccattt ccgctgccag cctgcacctc ccattgacga ggatctccca gaagagaggc 1381 ctgatgacta gaccccgcct tccgctccca ccagcccaca tccagtgggg tctcagtcca 1441 gtcctcacat gcccgctgtc ccaggggtct ccctgagcct gccccagctg ggctgttggc 1501 tgggggcatg ggggaggctc tgaagagata cccacagagt gtggtccctc cactaggagt 1561 taactaccct acacctctgg gccctgcagg aggcctggga gggcaagggt cctacggagg 1621 gaccaggtgt ctagaggcaa cagtgttctg agcccccacc tgcctgacca tcccatgagc 1681 agtccagcgc ttcagggctg ggcaggtcct ggggaggctg agactgcaga ggagccacct 1741 gggctgggag aaggtgcttg ggcttctgcg gtgaggcagg ggagtctgct tgtcttagat 1801 gttggtggtg cagccccagg accaagctta aggagaggag agcatctgct ctgagacgga 1861 tggaaggaga gaggttgagg atgcactggc ctgttctgta ggagagactg gccagaggca 1921 gctaaggggc aggaatcaag gagcctccgt tcccacctct gaggactctg gaccccaggc 1981 cataccaggt gctagggtgc ctgctctcct tgccctgggc cagcccagga ttgtacgtgg 2041 gagaggcaga aagggtaggt tcagtaatca tttctgatga tttgctggag tgctggctcc 2101 acgccctggg gagtgagctt ggtgcggtag gtgctggcct caaacagcca cgaggtggta 2161 gctctgagcc ctccttcttg ccctgagctt tccggggagg agcctggagt gtaattacct 2221 gtcatctggg ccaccagctc cactggcccc cgttgccggg cctggactgt cctaggtgac 2281 cccatctctg ctgcttctgg gcctgatgga gaggagaaca ctagacatgc caactcggga 2341 gcattctgcc tgcctgggaa cggggtggac gagggagtgt ctgtaaggac tcagtgttga 2401 ctgtaggcgc ccctggggtg ggtttagcag gctgcagcag gcagaggagg agtacccccc 2461 tgagagcatg tgggggaagg ccttgctgtc atgtgaatcc ctcaataccc ctagtatctg 2521 gctgggtttt caggggcttt ggaagctctg ttgcaggtgt ccgggggtct aggactttag 2581 ggatctggga tctggggaag gaccaaccca tgccctgcca agcctggagc ccctgtgttg 2641 gggggcaagg tgggggagcc tggagcccct gtgtgggagg gcgaggcggg ggagcctgga 2701 gcccctgtgt gggagggcga ggcgggggat cctggagccc ctgtgtcggg gggcgaggga 2761 ggggaggtgg ccgtcggttg accttctgaa catgagtgtc aactccagga cttgcttcca 2821 agcccttccc tctgttggaa attgggtgtg ccctggctcc caagggaggc ccatgtgact 2881 aataaaaaac tgtgaaccct // LOCUS HUMADPRF 1364 bp mRNA PRI 12-DEC-1994 DEFINITION Homo sapiens ADP-ribosylation factor mRNA, complete cds. ACCESSION L38490 NID g601847 KEYWORDS ADP-ribosylation factor. SOURCE Homo sapiens (tissue library: Stratagene #937202) fetal retina cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1364) AUTHORS Smith,S.A., Holik,P.R., Stevens,J., Melis,R., White,R. and Albertsen,H. TITLE Isolation and mapping of a gene encoding a novel human ADP-ribosylation factor to chromosome 17q12-21 JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..1364 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="retina" /tissue_lib="Stratagene #937202" /map="chromosome 17q12-21" 5'UTR 1..156 mRNA 1..1364 CDS 157..762 /codon_start=1 /function="membrane trafficing and protein secretion" /product="ADP-ribosylation factor" /db_xref="PID:g601848" /translation="MGNHLTEMAPTASSFLPHFQALHVVVIGLDSAGKTSLLYRLKFK EFVQSVPTKGFNTEKIRVPLGGSRGITFQVWDVGGQEKLRPLWRSYNRRTDGLVFVVD AAEAERLEEAKVELHRISRASDNQGVPVLVLANKQDQPGALSAAEVEKRLAVRELAAA TLTHVQGCSAVDGLGLQQGLERLYEMILKRKKAARGGKKRR" 3'UTR 763..1364 polyA_signal 1340..1345 polyA_site 1364 BASE COUNT 272 a 359 c 405 g 328 t ORIGIN 1 gcacgagggt gtctgcgggg gtctcgcggg gcggctgcgg tgtttcaccg gggaaaggct 61 cgaggagagc gcggctcacg agagataacc cagctgtgct ccctggaacc ttcaatttca 121 aggcctccct gcctctacta ggcgccttag ctcactatgg ggaaccactt gactgagatg 181 gcgcccactg cctcctcctt cttgccccac ttccaagccc tgcatgtcgt ggtcattggg 241 ctggactctg ctggaaagac ctccctcctt taccgcctca agttcaagga gtttgtccag 301 agtgtcccca ccaaaggctt caacaccgag aagatccggg tgcccctcgg gggatcgcgt 361 ggcatcacct tccaagtgtg ggacgtcggg gggcaggaga agctgcgacc actgtggcgc 421 tcttataacc gccggacaga cggtctagtg tttgtggtgg acgctgcgga ggctgagcgg 481 cttgaggaag ccaaggtgga gttgcaccga atcagccggg cctcggacaa ccagggcgtg 541 ccagtcctgg tgctggccaa caagcaggac cagcccgggg cactgagcgc tgctgaggtg 601 gagaagaggc tggcagtccg agagctagca gccgccactc tcactcatgt gcaaggctgc 661 agcgctgtgg acggtctggg cctgcagcag ggccttgagc gcctctatga gatgatcctc 721 aagaggaaga aggcagctcg gggtggcaag aagagacggt gacccaagcc ccccctccct 781 ttcctcccac ctagtagggg tctgcacact tggacagcag ggtgggacca gcctgtgacc 841 tctcagtcag actggggtgc aggacctgtc cacctcaatg aaggagagag gagcatgggg 901 tgtcccgttt tggtgccaca ctggggtggg gatgggagat gggatgtctt tgcatatctc 961 tctcatcctc tctggagaag tgggcgctgc aggactgtgg agacgtaaat gtaaactgtg 1021 actctacctc gaccctgttt cttatttttc ttctctggct aaaaattttt aattggatgt 1081 gtttggggcc ggggggatgg aagtgacttg gagaatgtgt ttgggatgaa ataactatct 1141 ccccttcctc tgtcccccaa ctggggagtc tccccaggct gcttttctag gaataccagt 1201 cacatagttt ttatttttgt gtctgtgaaa gtgccaagaa cccctcccca catttgtaga 1261 tccatgaccc tttttataag ctgtgtgtgt cctctgtatt attgttatta actatttttt 1321 agcatttgcc tgtaagttat taaagactga taactgtagc tctt // LOCUS HUMADRA 1521 bp DNA PRI 30-OCT-1994 DEFINITION Human platelet alpha-2-adrenergic receptor gene, complete cds. ACCESSION M18415 NID g178191 KEYWORDS alpha-2-adrenergic receptor; alpha-adrenergic receptor. SOURCE Human (lambda-EMBL 3 library) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Kobilka,B.K., Matsui,H., Kobilka,T.S., Yang-Feng,T.L., Francke,U., Caron,M.G., Lefkowitz,R.J. and Regan,J.W. TITLE Cloning, sequencing, and expression of the gene coding for the human platelet alpha 2-adrenergic receptor JOURNAL Science 238 (4827), 650-656 (1987) MEDLINE 88042789 FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q23-q25" gene 59..1411 /gene="ZNF32" CDS 59..1411 /gene="ZNF32" /note="alpha-2-adrenergic receptor old gene name 'ADRA2R'" /codon_start=1 /db_xref="GDB:G00-125-339" /db_xref="PID:g178192" /translation="MGSLQPDAGNASWNGTEAPGGGARATPYSLQVTLTLVCLAGLLM LLTVFGNVLVIIAVFTSRALKAPQNLFLVSLASADILVATLVIPFSLANEVMGYWYFG KTWCEIYLALDVLFCTSSIVHLCAISLDRYWSITQAIEYNLKRTPRRIKAIIITCWVI SAVISFPPLISIEKKGGGGGPQPAEPRCEINDQKWYVISSCIGSFFAPCLIMILVYVR IYQIAKRRTRVPPSRRGPDAVAAPPGGTERRPNGLGPERSAGPGGAEAEPLPTQLNGA PGEPAPAGPRDTDALDLEESSSSDHAERPPGPRRPERGPRGKGKARASQVKPGDSLRG AGRGRRGSGRRLQGRGRSASGLPRRRAGAGGQNLEKRFTFVLAVVIGVFVVCWFPFFF TYTLTAVGCSVPRTLFKFFFWFGYCNSSLNPVIYTIFNHDFRRAFKKILCRGDRKRIV " BASE COUNT 223 a 546 c 499 g 253 t ORIGIN Chromosome 10q23-q25. 1 cccgccttca tcttccgcca ggaggccaag gccgttggcc gagggcagct ttgcgcccat 61 gggctccctg cagccggacg cgggcaacgc gagctggaac gggaccgagg cgccgggggg 121 cggcgcccgg gccacccctt actccctgca ggtgacgctg acgctggtgt gcctggccgg 181 cctgctcatg ctgctcaccg tgttcggcaa cgtgctcgtc atcatcgccg tgttcacgag 241 ccgcgcgctc aaggcgcccc aaaacctctt cctggtgtct ctggcctcgg ccgacatcct 301 ggtggccacg ctcgtcatcc ctttctcgct ggccaacgag gtcatgggct actggtactt 361 cggcaagact tggtgcgaga tctacctggc gctcgacgtg ctcttctgca cgtcgtccat 421 cgtgcacctg tgcgccatca gcctggaccg ctactggtcc atcacacagg ccatcgagta 481 caacctgaag cgcacgccgc gccgcatcaa ggccatcatc atcacctgtt gggtcatctc 541 ggccgtcatc tccttcccgc cgctcatctc catcgagaag aagggcggcg gcggcggccc 601 gcagccggcc gagccgcgct gcgagatcaa cgaccagaag tggtacgtca tctcgtcgtg 661 catcggctcc ttcttcgctc cctgcctcat catgatcctg gtctacgtgc gcatctacca 721 gatcgccaag cgtcgcaccc gcgtgccacc cagccgccgg ggtccggacg ccgtcgccgc 781 gccgccgggg ggcaccgagc gcaggcccaa cggtctgggc cccgagcgca gcgcgggccc 841 ggggggcgca gaggccgaac cgctgcccac ccagctcaac ggcgcccctg gcgagcccgc 901 gccggccggg ccgcgcgaca ccgacgcgct ggacctggag gagagctcgt cttccgacca 961 cgccgagcgg cctccagggc cccgcagacc cgagcgcggt ccccggggca aaggcaaggc 1021 ccgagcgagc caggtgaagc cgggcgacag cctgcgcggc gcgggccggg ggcgacgggg 1081 atcgggacgc cggctgcagg gccgggggag gagcgcgtcg gggctgccaa ggcgtcgcgc 1141 tggcgcgggc gggcagaacc tcgagaagcg cttcacgttc gtgctggccg tggtcatcgg 1201 agtgttcgtg gtgtgctggt tccccttctt cttcacctac acgctcacgg ccgtcgggtg 1261 ctccgtgcca cgcacgctct tcaaattctt cttctggttc ggctactgca acagctcgtt 1321 gaacccggtc atctacacca tcttcaacca cgatttccgc cgcgccttca agaagatcct 1381 ctgtcggggg gacaggaagc ggatcgtgtg aggtttccgc tggcgcccgc gtagactcac 1441 gctgactgca ggcagcgggg ggcatcgagg ggtgcttagc ccgagggcac tcagaaaccc 1501 gggcgctgct gctctgcgtt t // LOCUS HUMADRB1 1723 bp mRNA PRI 30-OCT-1994 DEFINITION Human beta-1-adrenergic receptor mRNA, complete cds. ACCESSION J03019 NID g178199 KEYWORDS beta-1 adrenergic receptor. SOURCE Human placenta, cDNA to mRNA (library of E.Sadler), clone 11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1723) AUTHORS Frielle,T., Collins,S., Daniel,K.W., Caron,M.G., Lefkowitz,R.J. and Kobilka,B.K. TITLE Cloning of the cDNA for the human beta 1-adrenergic receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (22), 7920-7924 (1987) MEDLINE 88068509 COMMENT Draft entry for [1] kindly provided by T.Frielle, 02-FEB-1988. FEATURES Location/Qualifiers source 1..1723 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q24-q26" gene 87..1520 /gene="ADRB1" CDS 87..1520 /gene="ADRB1" /note="beta-1-adrenergic receptor" /codon_start=1 /db_xref="GDB:G00-119-654" /db_xref="PID:g178200" /translation="MGAGVLVLGASEPGNLSSAAPLPDGAATAARLLVPASPPASLLP PASESPEPLSQQWTAGMGLLMALIVLLIVAGNVLVIVAIAKTPRLQTLTNLFIMSLAS ADLVMGLLVVPFGATIVVWGRWEYGSFFCELWTSVDVLCVTASIETLCVIALDRYLAI TSPFRYQSLLTRARARGLVCTVWAISALVSFLPILMHWWRAESDEARRCYNDPKCCDF VTNRAYAIASSVVSFYVPLCIMAFVYLRVFREAQKQVKKIDSCERRFLGGPARPPSPS PSPVPAPAPPPGPPRPAAAAATAPLANGRAGKRRPSRLVALREQKALKTLGIIMGVFT LCWLPFFLANVVKAFHRELVPDRLFVFFNWLGYANSAFNPIIYCRSPDFRKAFQGLLC CARRAARRRHATHGDRPRASGCLARPGPPPSPGAASDDDDDDVVGATPPARLLEPWAG CNGGAAADSDSSLDEPCRPGFASESKV" BASE COUNT 228 a 652 c 559 g 284 t ORIGIN 14 bp upstream of SmaI site. 1 tgctacccgc gcccgggctt ctggggtgtt ccccaaccac ggcccagccc tgccacaccc 61 cccgcccccg gcctccgcag ctcggcatgg gcgcgggggt gctcgtcctg ggcgcctccg 121 agcccggtaa cctgtcgtcg gccgcaccgc tccccgacgg cgcggccacc gcggcgcggc 181 tgctggtgcc cgcgtcgccg cccgcctcgt tgctgcctcc cgccagcgaa agccccgagc 241 cgctgtctca gcagtggaca gcgggcatgg gtctgctgat ggcgctcatc gtgctgctca 301 tcgtggcggg caatgtgctg gtgatcgtgg ccatcgccaa gacgccgcgg ctgcagacgc 361 tcaccaacct cttcatcatg tccctggcca gcgccgacct ggtcatgggg ctgctggtgg 421 tgccgttcgg ggccaccatc gtggtgtggg gccgctggga gtacggctcc ttcttctgcg 481 agctgtggac ctcagtggac gtgctgtgcg tgacggccag catcgagacc ctgtgtgtca 541 ttgccctgga ccgctacctc gccatcacct cgcccttccg ctaccagagc ctgctgacgc 601 gcgcgcgggc gcggggcctc gtgtgcaccg tgtgggccat ctcggccctg gtgtccttcc 661 tgcccatcct catgcactgg tggcgggcgg agagcgacga ggcgcgccgc tgctacaacg 721 accccaagtg ctgcgacttc gtcaccaacc gggcctacgc catcgcctcg tccgtagtct 781 ccttctacgt gcccctgtgc atcatggcct tcgtgtacct gcgggtgttc cgcgaggccc 841 agaagcaggt gaagaagatc gacagctgcg agcgccgttt cctcggcggc ccagcgcggc 901 cgccctcgcc ctcgccctcg cccgtccccg cgcccgcgcc gccgcccgga cccccgcgcc 961 ccgccgccgc cgccgccacc gccccgctgg ccaacgggcg tgcgggtaag cggcggccct 1021 cgcgcctcgt ggccctacgc gagcagaagg cgctcaagac gctgggcatc atcatgggcg 1081 tcttcacgct ctgctggctg cccttcttcc tggccaacgt ggtgaaggcc ttccaccgcg 1141 agctggtgcc cgaccgcctc ttcgtcttct tcaactggct gggctacgcc aactcggcct 1201 tcaaccccat catctactgc cgcagccccg acttccgcaa ggccttccag ggactgctct 1261 gctgcgcgcg cagggctgcc cgccggcgcc acgcgaccca cggagaccgg ccgcgcgcct 1321 cgggctgtct ggcccggccc ggacccccgc catcgcccgg ggccgcctcg gacgacgacg 1381 acgacgatgt cgtcggggcc acgccgcccg cgcgcctgct ggagccctgg gccggctgca 1441 acggcggggc ggcggcggac agcgactcga gcctggacga gccgtgccgc cccggcttcg 1501 cctcggaatc caaggtgtag ggcccggcgc ggggcgcgga ctccgggcac ggcttcccag 1561 gggaacgagg agatctgtgt ttacttaaga ccgatagcag gtgaactcga agcccacaat 1621 cctcgtctga atcatccgag gcaaagagaa aagccacgga ccgttgcaca aaaaggaaag 1681 tttgggaagg gatgggagag tggcttgctg atgttccttg ttg // LOCUS HUMADRBR 3451 bp mRNA PRI 13-FEB-1996 DEFINITION Human beta-2-adrenergic receptor mRNA, complete cds. ACCESSION M15169 J02728 M16106 NID g178201 KEYWORDS adrenergic receptor. SOURCE Homo sapiens (clone: pTF.) (tissue library: Evan Sadler) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3451) AUTHORS Kobilka,B.K., Frielle,T., Dohlman,H.G., Bolanowski,M.A., Dixon,R.A., Keller,P., Caron,M.G. and Lefkowitz,R.J. TITLE Delineation of the intronless nature of the genes for the human and hamster beta 2-adrenergic receptor and their putative promoter regions JOURNAL J. Biol. Chem. 262 (15), 7321-7327 (1987) MEDLINE 87222338 REFERENCE 2 (bases 1399 to 1985) AUTHORS Kobilka,B.K., Dixon,R.A., Frielle,T., Dohlman,H.G., Bolanowski,M.A., Sigal,I.S., Yang-Feng,T.L., Francke,U., Caron,M.G. and Lefkowitz,R.J. TITLE cDNA for the human beta 2-adrenergic receptor: a protein with multiple membrane-spanning domains and encoded by a gene whose chromosomal location is shared with that of the receptor for platelet-derived growth factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (1), 46-50 (1987) MEDLINE 87092393 FEATURES Location/Qualifiers source 1..3451 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pTF." /tissue_type="placenta" /tissue_lib="Evan Sadler" /map="5q31-q32" mRNA 1369..3383 /gene="ADRB2" /note="b-2-adr mRNA (alt.); G00-120-541" gene 1369..3383 /gene="ADRB2" mRNA 1376..3383 /gene="ADRB2" /note="b-2-adr mRNA (alt.); G00-120-541" mRNA 1379..3383 /gene="ADRB2" /note="b-2-adr mRNA (alt.); G00-120-541" mRNA 1388..3383 /gene="ADRB2" /note="b-2-adr mRNA (alt.); G00-120-541" CDS 1487..1546 /gene="ADRB2" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-541" /db_xref="PID:g560761" /translation="MRLPGVRSRPAEPRRGSAR" CDS 1588..2829 /gene="ADRB2" /codon_start=1 /db_xref="GDB:G00-120-541" /product="beta-2 adrenergic receptor" /db_xref="PID:g178202" /translation="MGQPGNGSAFLLAPNRSHAPDHDVTQQRDEVWVVGMGIVMSLIV LAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFG NFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIV SGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASSIVSFYVPLVIMVFV YSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLG IIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRI AFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGTEDFVGHQG TVPSDNIDSQGRNCSTNDSLL" BASE COUNT 790 a 873 c 895 g 893 t ORIGIN 1 cccgggttca agagattctc ctgtctcagc ctcccgagta gctgggacta caggtacgtg 61 ccaccacacc tggctaattt ttgtattttt agtagagaca agagttacac catattggcc 121 aggatctttt gctttctata gcttcaaaat gttcttaatg ttaagacatt cttaatactc 181 tgaaccatat gaatttgcca ttttggtaag tcacagacgc cagatggtgg caatttcaca 241 tggcacaacc cgaaagatta acaaactatc cagcagatga aaggattttt tttagtttca 301 ttgggtttac tgaagaaatt gtttgaattc tcattgcatc tccagttcaa cagataatga 361 gtgagtgatg ccacactctc aagagttaaa aacaaaacaa caaaaaaatt aaaacaaaag 421 cacacaactt tctctctctg tcccaaaata catacttgca tacccccgct ccagataaaa 481 tccaaagggt aaaactgtct tcatgcctgc aaattcctaa ggagggcacc taaagtactt 541 gacagcgagt gtgctgagga aatcggcagc tgttgaagtc acctcctgtg ctcttgccaa 601 atgtttgaaa gggaatacac tgggttaccg ggtgtatgtt gggaggggag cattatcagt 661 gctcgggtga ggcaagttcg gagtacccag atggagacat ccgtgtctgt gtcgctctgg 721 atgcctccaa gccagcgtgt gtttactttc tgtgtgtgtc accatgtctt tgtgcttctg 781 ggtgcttctg tgtttgtttc tggccgcgtt tctgtgttgg acaggggtga ctttgtgccg 841 gatggcttct gtgtgagagc gcgcgcgagt gtgcatgtcg gtgagctggg agggtgtgtc 901 tcagtgtcta tggctgtggt tcggtataag tctgagcatg tctgccaggg tgtatttgtg 961 cctgtatgtg cgtgcctcgg tgggcactct cgtttccttc cgaatgtggg gcagtgccgg 1021 tgtgctgccc tctgccttga gacctcaagc cgcgcaggcg cccagggcag gcaggtagcg 1081 gccacagaag agccaaaagc tcccgggttg gctggtaagg acaccacctc cagctttagc 1141 cctctggggc cagccagggt agccgggaag cagtggtggc ccgccctcca gggagcagtt 1201 gggccccgcc cgggccagcc ccaggagaag gagggcgagg ggaggggagg gaaaggggag 1261 gagtgcctcg ccccttcgcg gctgccggcg tgccattggc cgaaagttcc cgtacgtcac 1321 ggcgagggca gttcccctaa agtcctgtgc acataacggg cagaacgcac tgcgaagcgg 1381 cttcttcaga gcacgggctg gaactggcag gcaccgcgag cccctagcac ccgacaagct 1441 gagtgtgcag gacgagtccc caccacaccc acaccacagc cgctgaatga ggcttccagg 1501 cgtccgctcg cggcccgcag agccccgccg tgggtccgcc cgctgaggcg cccccagcca 1561 gtgcgcttac ctgccagact gcgcgccatg gggcaacccg ggaacggcag cgccttcttg 1621 ctggcaccca atagaagcca tgcgccggac cacgacgtca cgcagcaaag ggacgaggtg 1681 tgggtggtgg gcatgggcat cgtcatgtct ctcatcgtcc tggccatcgt gtttggcaat 1741 gtgctggtca tcacagccat tgccaagttc gagcgtctgc agacggtcac caactacttc 1801 atcacttcac tggcctgtgc tgatctggtc atgggcctgg cagtggtgcc ctttggggcc 1861 gcccatattc ttatgaaaat gtggactttt ggcaacttct ggtgcgagtt ttggacttcc 1921 attgatgtgc tgtgcgtcac ggccagcatt gagaccctgt gcgtgatcgc agtggatcgc 1981 tactttgcca ttacttcacc tttcaagtac cagagcctgc tgaccaagaa taaggcccgg 2041 gtgatcattc tgatggtgtg gattgtgtca ggccttacct ccttcttgcc cattcagatg 2101 cactggtacc gggccaccca ccaggaagcc atcaactgct atgccaatga gacctgctgt 2161 gacttcttca cgaaccaagc ctatgccatt gcctcttcca tcgtgtcctt ctacgttccc 2221 ctggtgatca tggtcttcgt ctactccagg gtctttcagg aggccaaaag gcagctccag 2281 aagattgaca aatctgaggg ccgcttccat gtccagaacc ttagccaggt ggagcaggat 2341 gggcggacgg ggcatggact ccgcagatct tccaagttct gcttgaagga gcacaaagcc 2401 ctcaagacgt taggcatcat catgggcact ttcaccctct gctggctgcc cttcttcatc 2461 gttaacattg tgcatgtgat ccaggataac ctcatccgta aggaagttta catcctccta 2521 aattggatag gctatgtcaa ttctggtttc aatcccctta tctactgccg gagcccagat 2581 ttcaggattg ccttccagga gcttctgtgc ctgcgcaggt cttctttgaa ggcctatggg 2641 aatggctact ccagcaacgg caacacaggg gagcagagtg gatatcacgt ggaacaggag 2701 aaagaaaata aactgctgtg tgaagacctc ccaggcacgg aagactttgt gggccatcaa 2761 ggtactgtgc ctagcgataa cattgattca caagggagga attgtagtac aaatgactca 2821 ctgctgtaaa gcagtttttc tacttttaaa gacccccccc cccccaacag aacactaaac 2881 agactattta acttgagggt aataaactta gaataaaatt gtaaaaattg tatagagata 2941 tgcagaagga agggcatcct tctgcctttt ttattttttt aagctgtaaa aagagagaaa 3001 acttatttga gtgattattt gttatttgta cagttcagtt cctctttgca tggaatttgt 3061 aagtttatgt ctaaagagct ttagtcctag aggacctgag tctgctatat tttcatgact 3121 tttccatgta tctacctcac tattcaagta ttaggggtaa tatattgctg ctggtaattt 3181 gtatctgaag gagattttcc ttcctacacc cttggacttg aggattttga gtatctcgga 3241 cctttcagct gtgaacatgg actcttcccc cactcctctt atttgctcac acggggtatt 3301 ttaggcaggg atttgaggag cagcttcagt tgttttcccg agcaaaggtc taaagtttac 3361 agtaaataaa atgtttgacc atgccttcat tgcacctgtt tgtccaaaac cccttgactg 3421 gagtgctgtt gcctccccca ctggaaaccg c // LOCUS HUMADRBRA 3458 bp DNA PRI 13-FEB-1996 DEFINITION Human beta-2-adrenergic receptor gene, complete cds. ACCESSION J02960 NID g178203 KEYWORDS adrenergic receptor; beta-2 adrenergic receptor. SOURCE Homo sapiens (clone: H-beta-R-[9,10,11].) epidermis DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3458) AUTHORS Emorine,L.J., Marullo,S., Delavier-Klutchko,C., Kaveri,S.V., Durieu-Trautmann,O. and Strosberg,A.D. TITLE Structure of the gene for human beta 2-adrenergic receptor: expression and promoter characterization JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (20), 6995-6999 (1987) MEDLINE 88041037 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by L.J.Emorine, 25-AUG-1987. FEATURES Location/Qualifiers source 1..3458 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H-beta-R-[9,10,11]." /cell_line="A431" /tissue_type="epidermis" /map="5q31-q32" CDS 277..1032 /note="ORF; putative" /codon_start=1 /product="unknown protein" /db_xref="PID:g560762" /translation="MFEREYTGLPGVCWEGSIISARVRQVRSTQMETSVSVSLWMPPS QRVFTFCVCHHVFVLLGASVFVSGRVSVLDRGDFVPDGFCVRARASVHVGELGGCVSV SMAVVRYKSEHVCQGVFVPVCACLGGHSRFLPNVGQCRCAALCLETSSRAGAQGRQVA ATEEPKAPGLAGKHTTSSFSPLGPARVAGKQWWPALQGAVGPRPGQPQEKEGEGRGGK GEECLAPSRLPACHWPKVPVRHGEGSSPKVLCT" mRNA 1045..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" mRNA 1055..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" mRNA 1064..3057 /note="beta-2-adrenergic receptor mRNA (alt.)" gene 1264..2505 /gene="ADRB2" CDS 1264..2505 /gene="ADRB2" /codon_start=1 /db_xref="GDB:G00-120-541" /product="beta-2 adrenergic receptor" /db_xref="PID:g178204" /translation="MGQPGNGSAFLLAPNGSHAPDHDVTQQRDEVWVVGMGIVMSLIV LAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFG NFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQSLLTKNKARVIILMVWIV SGLTSFLPIQMHWYRATHQEAINCYANETCCDFFTNQAYAIASSIVSFYVPLVIMVFV YSRVFQEAKRQLQKIDKSEGRFHVQNLSQVEQDGRTGHGLRRSSKFCLKEHKALKTLG IIMGTFTLCWLPFFIVNIVHVIQDNLIRKEVYILLNWIGYVNSGFNPLIYCRSPDFRI AFQELLCLRRSSLKAYGNGYSSNGNTGEQSGYHVEQEKENKLLCEDLPGTEDFVGHQG TVPSDNIDSQGRNCSTNDSLL" BASE COUNT 777 a 890 c 886 g 905 t ORIGIN 1 bp upstream of EcoRI site; chromosome 5q31-q32. 1 gaattctcat tgcatctcca gttcaacaga taatgagtga gtgatgccac actctcaaga 61 gttaaaaaca aaacaacaaa aaaattaaaa caaaagcaca caactttctc tctctgtccc 121 aaaatacata cttgcatacc cccgctccag ataaaatcca aagggtaaaa ctgtcttcat 181 gcctgcaaat tcctaaggag ggcacctaaa gtacttgaca gcgagtgtgc tgaggaaatc 241 ggcagctgtt gaagtcacct cctgtgctct tgccaaatgt ttgaaaggga atacactggg 301 ttaccgggtg tatgttggga ggggagcatt atcagtgctc gggtgaggca agttcggagt 361 acccagatgg agacatccgt gtctgtgtcg ctctggatgc ctccaagcca gcgtgtgttt 421 actttctgtg tgtgtcacca tgtctttgtg cttctgggtg cttctgtgtt tgtttctggc 481 cgcgtttctg tgttggacag gggtgacttt gtgccggatg gcttctgtgt gagagcgcgc 541 gcgagtgtgc atgtcggtga gctgggaggg tgtgtctcag tgtctatggc tgtggttcgg 601 tataagtctg agcatgtctg ccagggtgta tttgtgcctg tatgtgcgtg cctcggtggg 661 cactctcgtt tccttccgaa tgtggggcag tgccggtgtg ctgccctctg ccttgagacc 721 tcaagccgcg caggcgccca gggcaggcag gtagcggcca cagaagagcc aaaagctccc 781 gggttggctg gtaagcacac cacctccagc tttagccctc tggggccagc cagggtagcc 841 gggaagcagt ggtggcccgc cctccaggga gcagttgggc cccgcccggg ccagcctcag 901 gagaaggagg gcgaggggag gggagggaaa ggggaggagt gcctcgcccc ttcgcggctg 961 ccggcgtgcc attggccgaa agttcccgta cgtcacggcg agggcagttc ccctaaagtc 1021 ctgtgcacat aacgggcaga acgcactgcg aagcggcttc ttcagagcac gggctggaac 1081 tggcaggcac cgcgagcccc tagcacccga caagctgagt gtgcaggacg agtccccacc 1141 acacccacac cacagccgct gaatgaggct tccaggcgtc cgctcgcggc ccgcagagcc 1201 ccgccgtggg tccgcctgct gaggcgcccc cagccagtgc gcttacctgc cagactgcgc 1261 gccatggggc aacccgggaa cggcagcgcc ttcttgctgg cacccaatgg aagccatgcg 1321 ccggaccacg acgtcacgca gcaaagggac gaggtgtggg tggtgggcat gggcatcgtc 1381 atgtctctca tcgtcctggc catcgtgttt ggcaatgtgc tggtcatcac agccattgcc 1441 aagttcgagc gtctgcagac ggtcaccaac tacttcatca cttcactggc ctgtgctgat 1501 ctggtcatgg gcctagcagt ggtgcccttt ggggccgccc atattcttat gaaaatgtgg 1561 acttttggca acttctggtg cgagttttgg acttccattg atgtgctgtg cgtcacggcc 1621 agcattgaga ccctgtgcgt gatcgcagtg gatcgctact ttgccattac ttcacctttc 1681 aagtaccaga gcctgctgac caagaataag gcccgggtga tcattctgat ggtgtggatt 1741 gtgtcaggcc ttacctcctt cttgcccatt cagatgcact ggtacagggc cacccaccag 1801 gaagccatca actgctatgc caatgagacc tgctgtgact tcttcacgaa ccaagcctat 1861 gccattgcct cttccatcgt gtccttctac gttcccctgg tgatcatggt cttcgtctac 1921 tccagggtct ttcaggaggc caaaaggcag ctccagaaga ttgacaaatc tgagggccgc 1981 ttccatgtcc agaaccttag ccaggtggag caggatgggc ggacggggca tggactccgc 2041 agatcttcca agttctgctt gaaggagcac aaagccctca agacgttagg catcatcatg 2101 ggcactttca ccctctgctg gctgcccttc ttcatcgtta acattgtgca tgtgatccag 2161 gataacctca tccgtaagga agtttacatc ctcctaaatt ggataggcta tgtcaattct 2221 ggtttcaatc cccttatcta ctgccggagc ccagatttca ggattgcctt ccaggagctt 2281 ctgtgcctgc gcaggtcttc tttgaaggcc tatggcaatg gctactccag caacggcaac 2341 acaggggagc agagtggata tcacgtggaa caggagaaag aaaataaact gctgtgtgaa 2401 gacctcccag gcacggaaga ctttgtgggc catcaaggta ctgtgcctag cgataacatt 2461 gattcacaag ggaggaattg tagtacaaat gactcactgc tataaagcag tttttctact 2521 tttaaagacc cccccccgcc caacagaaca ctaaacagac tatttaactt gagggtaata 2581 aacttagaat aaaattgtaa aattgtatag agatatgcag aaggaagggc atccttctgc 2641 cttttttatt tttttaagct gtaaaaagag agaaaactta tttgagtgat tatttgttat 2701 ttgtacagtt cagttcctct ttgcatggaa tttgtaagtt tatgtctaaa gagctttagt 2761 cctagaggac ctgagtctgc tatattttca tgacttttcc atgtatctac ctcactattc 2821 aagtattagg ggtaatatat tgctgctggt aatttgtatc tgaaggagat tttccttcct 2881 acacccttgg acttgaggat tttgagtatc tcggaccttt cagctgtgaa catggactct 2941 tcccccactc ctcttatttg ctcacacggg gtattttagg cagggatttg aggagcagct 3001 tcagttgttt tcccgagcaa agtctaaagt ttacagtaaa taaattgttt gaccatgcct 3061 tcattgcacc tgtttctcca aaaccccttg actggagtgc tgttgcctcc cccactggaa 3121 accgcaggta actacttgta attactgccc atgacttaat gtagaatgat acaagaatga 3181 catgcacaga ttgcttaacc ctttcatttg cctttgagtc tgctgctgca aagctgcatc 3241 tctcctgaca cttgtgcccc aaatcagttc tgcctgctct tagtatagct caactctccc 3301 tatggttatt gttctgtgtt gttacctcag aaacactgac tcacagaagc ggagttaagg 3361 ggatatgttt ttttctctcc acgtgcaccc accacccacc ttccagttct acttgtttca 3421 aaactgttta tatttctgtc ttggccatgt gtttacag // LOCUS HUMADXR 1830 bp mRNA PRI 07-AUG-1995 DEFINITION Human adrenodoxin reductase mRNA, complete cds. ACCESSION J03826 NID g178212 KEYWORDS adrenodoxin reductase; alternative splicing; ferridoxin(adrenodoxin)-NADPH+ oxidoreductase; flavoprotein. SOURCE . ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1830) AUTHORS Solish,S.B., Picado-Leonard,J., Morel,Y., Kuhn,R.W., Mohandas,T.K., Hanukoglu,I. and Miller,W.L. TITLE Human adrenodoxin reductase: two mRNAs encoded by a single gene on chromosome 17cen----q25 are expressed in steroidogenic tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (19), 7104-7108 (1988) MEDLINE 89017146 COMMENT Draft entry and sequence for [1] kindly submitted by W.L.Miller, 03-JAN-1989. Adrenodoxin reductase encodes two species of mRNA that apparently arise by alternative processing of the primary transcript and is not tissue specific. FEATURES Location/Qualifiers source 1..1830 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda haAR-1; lambda htAR-1." /tissue_type="adrenal and testis" /map="17cen-q25" exon <1..629 /gene="FDXR" /note="adrenodoxin reductase precursor species 1; G00-119-659" gene 1..1514 /gene="FDXR" CDS 21..1514 /gene="FDXR" /note="adrenodoxin reductase precursor species 2" /codon_start=1 /db_xref="GDB:G00-119-659" /db_xref="PID:g178214" /translation="MASRCWRWWGWSAWPRTRLPPAGSTPSFCHHFSTQEKTPQICVV GSGPAGFYTAQHLLKHPQAHVDIYEKQPVPFGLVRFGVAPDHPEVKNVINTFTQTAHS GRCAFWGNVEVGRDVTVPELQEAYHAVVLSYGAEDHRALEIPGEELPGVCSARAFVGW YNGLPENQELEPDLSCDTAVILGQGNVALDVARILLTPPEHLEALLLCQRTDITKAAL GVLRQSRVKTVWLVGRRGPLQVAFTIKELREMIQLPGARPILDPVDFLGLQDKIKEVP RPRKRLTELLLRTATEKPGPAEAARQASASRAWGLRFFRSPQQVLPSPDGRRAAGVRL AVTRLEGVDEATRAVPTGDMEDLPCGLVLSSIGYKSRPVDPSVPFDSKLGVIPNVEGR VMDVPGLYCSGWVKRGPTGVIATTMTDSFLTGQMLLQDLKAGLLPSGPRPGYAAIQAL LSSRGVRPVSFSDWEKLDAEEVARGQGTGKPREKLVDPQEMLRLLGH" sig_peptide 21..116 /gene="FDXR" /note="adrenodoxin reductase signal peptide species 2" CDS join(21..629,648..1514) /gene="FDXR" /note="adrenodoxin reductase precursor species 1" /codon_start=1 /db_xref="GDB:G00-119-659" /db_xref="PID:g178213" /translation="MASRCWRWWGWSAWPRTRLPPAGSTPSFCHHFSTQEKTPQICVV GSGPAGFYTAQHLLKHPQAHVDIYEKQPVPFGLVRFGVAPDHPEVKNVINTFTQTAHS GRCAFWGNVEVGRDVTVPELQEAYHAVVLSYGAEDHRALEIPGEELPGVCSARAFVGW YNGLPENQELEPDLSCDTAVILGQGNVALDVARILLTPPEHLERTDITKAALGVLRQS RVKTVWLVGRRGPLQVAFTIKELREMIQLPGARPILDPVDFLGLQDKIKEVPRPRKRL TELLLRTATEKPGPAEAARQASASRAWGLRFFRSPQQVLPSPDGRRAAGVRLAVTRLE GVDEATRAVPTGDMEDLPCGLVLSSIGYKSRPVDPSVPFDSKLGVIPNVEGRVMDVPG LYCSGWVKRGPTGVIATTMTDSFLTGQMLLQDLKAGLLPSGPRPGYAAIQALLSSRGV RPVSFSDWEKLDAEEVARGQGTGKPREKLVDPQEMLRLLGH" sig_peptide 21..116 /gene="FDXR" /note="adrenodoxin reductase signal peptide species 1" mat_peptide join(117..629,648..1511) /gene="FDXR" /note="adrenodoxin reductase species 1" mat_peptide 117..1511 /gene="FDXR" /note="adrenodoxin reductase species 2" mat_peptide 117..629 /gene="FDXR" /note="adrenodoxin reductase species 1" variation 388 /gene="FDXR" /note="a in lambda htAR-1; g in lambda haAR-1" /replace="g" exon 648..>1514 /gene="FDXR" /note="adrenodoxin reductase precursor species 1" mat_peptide 648..1511 /gene="FDXR" /note="adrenodoxin reductase species 1" variation 1001 /gene="FDXR" /note="g in lambda htAR-1; a in lambda haAR-1" /replace="a" variation 1828 /note="c in lambda htAR-1; a in lambda haAR-1" /replace="a" BASE COUNT 323 a 556 c 612 g 339 t ORIGIN 139 bp upstream of BglII site. 1 gggggttgct gctcccagcc atggcttcgc gctgctggcg ctggtggggc tggtcggcgt 61 ggcctcggac ccggctgcct cccgccggga gcaccccgag cttctgccac catttctcca 121 cacaggagaa gaccccccag atctgtgtgg tgggcagtgg cccagctggc ttctacacgg 181 cccaacacct gctaaagcac ccccaggccc acgtggacat ctacgagaaa cagcctgtgc 241 cctttggcct ggtgcgcttt ggtgtggcgc ctgatcaccc cgaggtgaag aatgtcatca 301 acacatttac ccagacggcc cattctggcc gctgtgcctt ctggggcaac gtggaggtgg 361 gcagggacgt gacggtgccg gagctgcagg aggcctacca cgctgtggtg ctgagctacg 421 gggcagagga ccatcgggcc ctggaaattc ctggtgagga gctgccaggt gtgtgctccg 481 cccgggcctt cgtgggctgg tacaacgggc ttcctgagaa ccaggagctg gagccagacc 541 tgagctgtga cacagccgtg attctggggc aggggaacgt ggctctggac gtggcccgca 601 tcctactgac cccacctgag cacctggagg ccctcctttt gtgccagaga acggacatca 661 cgaaggcagc cctgggtgta ctgaggcaga gtcgagtgaa gacagtgtgg ctagtgggcc 721 ggcgtggacc cctgcaagtg gccttcacca ttaaggagct tcgggagatg attcagttac 781 cgggagcccg gcccattttg gatcctgtgg atttcttggg tctccaggac aagatcaagg 841 aggtcccccg cccgaggaag cggctgacgg aactgctgct tcgaacggcc acagagaagc 901 cagggccggc ggaagctgcc cgccaggcat cggcctcccg tgcctggggc ctccgctttt 961 tccgaagccc ccagcaggtg ctgccctcac cagatgggcg gcgggcagca ggtgtccgcc 1021 tagcagtcac tagactggag ggtgtcgatg aggccacccg tgcagtgccc acgggagaca 1081 tggaagacct cccttgtggg ctggtgctca gcagcattgg gtataagagc cgccctgtcg 1141 acccaagcgt gccctttgac tccaagcttg gggtcatccc caatgtggag ggccgggtta 1201 tggatgtgcc aggcctctac tgcagcggct gggtgaagag aggacctaca ggtgtcatag 1261 ccacaaccat gactgacagc ttcctcaccg gccagatgct gctgcaggac ctgaaggctg 1321 ggttgctccc ctctggcccc aggcctggct acgcagccat ccaggccctg ctcagcagcc 1381 gaggggtccg gccagtctct ttctcagact gggagaagct ggatgccgag gaggtggccc 1441 ggggccaggg cacggggaag cccagggaga agctggtgga tcctcaggag atgctgcgcc 1501 tcctgggcca ctgagcccag ccccagcccc ggcccccagc agggaaggga tgagtgttgg 1561 gaggggaagg gctgggtccg tctgagtggg actttgcacc tctgctgatc ccggccggcc 1621 ctggcttgga ggcttggctg ctcttccagc gtctctcctc cctcctgggg aaggtcgccc 1681 ttgcgcgcaa ggttttagct ttcagcaact gaggtaacct tagggacagg tggaggtgtg 1741 ggccgatcta accccttacc catctctcta ctgctggact gtggagggtc accaggttgg 1801 gaacatgctg gaaataaaac agctgcaccc // LOCUS HUMAE1 3637 bp mRNA PRI 15-MAR-1990 DEFINITION Human anion exchange protein 1 (AE1, band 3) mRNA, complete cds. ACCESSION M27819 NID g178215 KEYWORDS anion antiporter; anion exchange protein; integral membrane protein; transmembrane protein; transport protein. SOURCE Human fetal liver, cDNA to mRNA, library of B.Forget, clones pHB3-[45 A/B,9,22]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3637) AUTHORS Lux,S.E., John,K.M., Kopito,R.R. and Lodish,H.F. TITLE Cloning and characterization of band 3, the human erythrocyte anion-exchange protein (AE1) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9089-9093 (1989) MEDLINE 90083213 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Lux, 12-SEP-1989. FEATURES Location/Qualifiers source 1..3637 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 115..2850 /note="anion exchange protein 1" /codon_start=1 /db_xref="PID:g178216" /translation="MEELQDDYEDMMEENLEQEEYEDPDIPESQMEEPAAHDTEATAT DYHTTSHPGTHKVYVELQELVMDEKNQELRWMEAARWVQLEENLGENGAWGRPHLSHL TFWSLLELRRVFTKGTVLLDLQETSLAGVANQLLDRFIFEDQIRPQDREELLRALLLK HSHAGELEALGGVKPAVLTRSGDPSQPLLPQHSSLETQLFCEQGDGGTEGHSPSGILE KIPPDSEATLVLVGRADFLEQPVLGFVRLQEAAELEAVELPVPIRFLFVLLGPEAPHI DYTQLGRAAATLMSERVFRIDAYMAQSRGELLHSLEGFLDCSLVLPPTDAPSEQALLS LVPVQRELLRRRYQSSPAKPDSSFYKGLDLNGGPDDPLQQTGQLFGGLVRDIRRRYPY YLSDITDAFSPQVLAAVIFIYFAALSPAITFGGLLGEKTRNQMGVSELLISTAVQGIL FALLGAQPLLVVGFSGPLLVFEEAFFSFCETNGLEYIVGRVWIGFWLILLVVLVVAFE GSFLVRFISRYTQEIFSFLISLIFIYETFSKLIKIFQDHPLQKTYNYNVLMVPKPQGP LPNTALLSLVLMAGTFFFAMMLRKFKNSSYFPGKLRRVIGDFGVPISILIMVLVDFFI QDTYTQKLSVPDGFKVSNSSARGWVIHPLGLRSEFPIWMMFASALPALLVFILIFLES QITTLIVSKPERKMVKGSGFHLDLLLVVGMGGVAALFGMPWLSATTVRSVTHANALTV MGKASTPGAAAQIQEVKEQRISGLLVAVLVGLSILMEPILSRIPLAVLFGIFLYMGVT SLSGIQLFDRILLLFKPPKYHPDVPYVKRVKTWRMHLFTGIQIICLAVLWVVKSTPAS LALPFVLILTVPLRRVLLPLIFRNVELQCLDADDAKATFDEEEGRDEYDEVAMPV" BASE COUNT 696 a 1101 c 1036 g 804 t ORIGIN 1 cagcggctgc aggacttcac caagggaccc tgaggctcgt gagcagggac ccgcggtgcg 61 ggttatgctg ggggctcaga tcaccgtaga caactggaca ctcaggacca cgccatggag 121 gagctgcagg atgattatga agacatgatg gaggagaatc tggagcagga ggaatatgaa 181 gacccagaca tccccgagtc ccagatggag gagccggcag ctcacgacac cgaggcaaca 241 gccacagact accacaccac atcacacccg ggtacccaca aggtctatgt ggagctgcag 301 gagctggtga tggacgaaaa gaaccaggag ctgagatgga tggaggcggc gcgctgggtg 361 caactggagg agaacctggg ggagaatggg gcctggggcc gcccgcacct ctctcacctc 421 accttctgga gcctcctaga gctgcgtaga gtcttcacca agggtactgt tctcctagac 481 ctgcaagaga cctccctggc tggagtggcc aaccaactgc tagacaggtt tatctttgaa 541 gaccagatcc ggcctcagga ccgagaggag ctgctccggg ccctgctgct taaacacagc 601 cacgctggag agctggaggc cctggggggt gtgaagcctg cagtcctgac acgctctggg 661 gatccttcac agcctctgct cccccaacac tcctcactgg agacacagct cttctgtgag 721 cagggagatg ggggcacaga agggcactca ccatctggaa ttctggaaaa gattcccccg 781 gattcagagg ccacgttggt gctagtgggc cgcgccgact tcctggagca gccggtgctg 841 ggcttcgtga ggctgcagga ggcagcggag ctggaggcgg tggagctgcc ggtgcctata 901 cgcttcctct ttgtgttgct gggacctgag gccccccaca tcgattacac ccagcttggc 961 cgggctgctg ccaccctcat gtcagagagg gtgttccgca tagatgccta catggctcag 1021 agccgagggg agctgctgca ctccctagag ggcttcctgg actgcagcct agtgctgcct 1081 cccaccgatg ccccctccga gcaggcactg ctcagtctgg tgcctgtgca gagggagcta 1141 cttcgaaggc gctatcagtc cagccctgcc aagccagact ccagcttcta caagggccta 1201 gacttaaatg ggggcccaga tgaccctctg cagcagacag gccagctctt cgggggcctg 1261 gtgcgtgata tccggcgccg ctacccctat tacctgagtg acatcacaga tgcattcagc 1321 ccccaggtcc tggctgccgt catcttcatc tactttgctg cactgtcacc cgccatcacc 1381 ttcggcggcc tcctgggaga aaagacccgg aaccagatgg gagtgtcgga gctgctgatc 1441 tccactgcag tgcagggcat tctcttcgcc ctgctggggg ctcagcccct gcttgtggtc 1501 ggcttctcag gacccctgct ggtgtttgag gaagccttct tctcgttctg cgagaccaac 1561 ggtctagagt acatcgtggg ccgcgtgtgg atcggcttct ggctcatcct gctggtggtg 1621 ttggtggtgg ccttcgaggg tagcttcctg gtccgcttca tctcccgcta tacccaggag 1681 atcttctcct tcctcatttc cctcatcttc atctatgaga ctttctccaa gctgatcaag 1741 atcttccagg accacccact acagaagact tataactaca acgtgttgat ggtgcccaaa 1801 cctcagggcc ccctgcccaa cacagccctc ctctcccttg tgctcatggc cggtaccttc 1861 ttctttgcca tgatgctgcg caagttcaag aacagctcct atttccctgg caagctgcgt 1921 cgggtcatcg gggacttcgg ggtccccatc tccatcctga tcatggtcct ggtggatttc 1981 ttcattcagg atacctacac ccagaaactc tcggtgcctg atggcttcaa ggtgtccaac 2041 tcctcagccc ggggctgggt catccaccca ctgggcttgc gttccgagtt tcccatctgg 2101 atgatgtttg cctccgccct gcctgctctg ctggtcttca tcctcatatt cctggagtct 2161 cagatcacca cgctgattgt cagcaaacct gagcgcaaga tggtcaaggg ctccggcttc 2221 cacctggacc tgctgctggt agtaggcatg ggtggggtgg ccgccctctt tgggatgccc 2281 tggctcagtg ccaccaccgt gcgttccgtc acccatgcca acgccctcac tgtcatgggc 2341 aaagccagca ccccaggggc tgcagcccag atccaggagg tcaaagagca gcggatcagt 2401 ggactcctgg tcgctgtgct tgtgggcctg tccatcctca tggagcccat cctgtcccgc 2461 atccccctgg ctgtactgtt tggcatcttc ctctacatgg gggtcacgtc gctcagcggc 2521 atccagctct ttgaccgcat cttgcttctg ttcaagccac ccaagtatca cccagatgtg 2581 ccctacgtca agcgggtgaa gacctggcgc atgcacttat tcacgggcat ccagatcatc 2641 tgcctggcag tgctgtgggt ggtgaagtcc acgccggcct ccctggccct gcccttcgtc 2701 ctcatcctca ctgtgccgct gcggcgcgtc ctgctgccgc tcatcttcag gaacgtggag 2761 cttcagtgtc tggatgctga tgatgccaag gcaacctttg atgaggagga aggtcgggat 2821 gaatacgacg aagtggccat gcctgtgtga ggggcgggcc caggccctag accctccccc 2881 accattccac atccccacct tccaaggaaa agcagaagtt catgggcacc tcatggactc 2941 aggatcctcc tggagcagca gctgaggccc cagggctgtg ggtggggaag gaaggcgtgt 3001 ccaggagacc ttccacaaag ggtagcctgg cttttctggc tggggatggc cgatggggcc 3061 cacattaggg ggtttgttgc acagtccctc ctgttgccac actttcactg gggatcccgt 3121 gctggaagac ttagatctga gccctccctc ttcccagcac aggcaggggt agaagcaaag 3181 gcaggaggtg ggtgagcggg tggggtgctt gctgtgtgac cttgggtaag tcccttgacc 3241 tttccaggcc tatatttcct cttctgtaaa atgggtatat tgatgataat acccacatta 3301 caggatggtt actgaggacc aaagatacat gtaaaatagg gctttgtaaa ctccacaggg 3361 actgttctat agcagtcatc atttgtcttt gaacgtaccc aaggtcacat agctgggatt 3421 tgaactgagc cgtgcagctg ggatttgaac caggccttct gatttcaagg tccgagctct 3481 gtcctctgtc agtcatgcgt ccactttccc ttcccctgtg actcctccct tccccactct 3541 gctcccagcc cctaccttga gaccctcttc tctgggccca gagagaggcg tcctgggtga 3601 aggaaggtac aggcaggatg atccagggat tgggctg // LOCUS HUMAGCGB 2893 bp mRNA PRI 30-MAR-1993 DEFINITION Human chromosome 3p21.1 gene sequence, complete cds. ACCESSION L13434 NID g291843 KEYWORDS . SOURCE Homo sapiens adenogastric carcinoma, gastric mucosa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2893) AUTHORS Shridhar,V., Kamat,A.K., Golembieski,W., Smith,S.E., Siegfried,J.M., Hunt,J.D., Miller,O.J., Wozniak,A. and Smith,D.I. TITLE Identificaiton of new genes from human chromosomal band 3P21.1 and their levels of expression in lung cancer cell lines JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2893 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AGS" /tissue_type="adenogastric carcinoma, gastric mucosa" CDS 1627..2397 /codon_start=1 /db_xref="PID:g291844" /translation="METVVCPRPWEERRKRRSLSSDRGRTTHSPYEERRSRTKGSGQQ SERGSDRTPERSRKENHSSEGTKESSSNSLSNSRHGAEERGHHHHHHEAADSSHGKKA RDSERNHRTTEAEPKPLEEPKHETKKLKNLSEYAQTLQLGWNGLLVLKNSCFPTSMHI LEGDQGVISSLLKDHTSGSKLTQLKIAQRLRLDQPKLDEDTRRIKQGSPNGYAVLLAT QATPSGLGTEGMPTVEPGLQRRLLRNLVSYLKQKQAAG" BASE COUNT 589 a 907 c 885 g 512 t ORIGIN 1 caatggtcgc ccggcgtgtt tggggcgcct cgcgcgtgcc tccggcctcc gcccgcccgc 61 cgcccgccgc gtgttccgga gacgccgcgg ttcgtgggcc aaggtcgccc gggcgctgtt 121 tgcgggccgc cgcggccgcg gatgagccgg gacacgcggg ctcggcccca gcggtggcgg 181 aggcggcggc agaggagacg cgcgggggcg acatgccgga ggcgcccggg acccccgagg 241 ccttgccgcc gcgacccccg ccgccagcgc cggagcccga gggagcgccc gctactcgcg 301 cggtgattcg cgtccccggc gccgcggccg tcgcgatgcc cgccgagcac ctcgaggacc 361 ggctcttcca ccagttccaa gcgcttcggc gagatcagcc tccgcctgtc gcacacgcct 421 gagctgggcc gtgtggctac gtgaatttcc ggcacccaca ggacgcacgc gaggcccgcc 481 agcacgccct ggcccggcag ctgctgctct acgaccgccc gctcaaggta gagcccgtgt 541 acctgcgtgg cggcggcggg agcagtcggc gaagtagcag cagcagcgcc gccgcttcca 601 cgcctccccc agggccgccc gcgcccgccg tcccgctcgg ctacctcccg ctacacggag 661 gctaccagta caagcagcgc tcgctgtccc ccgtcgctgc cccgcccctg cgggagcccc 721 gtgcccgtca cgccgccgca gccttcgcct ggatgccgct gctgccgccg ccgtgggact 781 gtccgggagc gggccctgga ctactacggg ctgtacgacg acggtgggcg cccctatggc 841 tacccagctg tgtgtgagga ggacctgatg cccgaggatg accagcgggc cacgcgcaac 901 ctcttcattg gtaacctgga ccacagcgta tctgaggtgg agctgcgaag ggccttcgag 961 aaatatggca tcatcgagga ggtggtcatc aagaggcctg cccgtggcca gggcggtgcc 1021 tatgccttcc tcaagttcca gaacctggac atggcccata gggctaaggt ggccatgtcg 1081 ggccgagtga ttggtcgcaa ccccattaag ataggctatg gcaaggccaa ccccaccact 1141 cgtctctggg tgggtggcct gggacctaac acgtcactgg cggctctggc ccgaggattt 1201 gaccgctttg ggagcattcg gaccattgat cacgtcaaag gagatagctt tgcccatatt 1261 cagtacgaga gcttggacgc tgcccaggcc gcctgtgcta aaatgagggg ttttcccttg 1321 gtggtccaga ccgcagctcc gcgtggattt tgccaaagca gaggagactc ggtacccccc 1381 agcagtacca gccctcgcca ctccctgtgc attatgaggc tgctcacaga tggatacacc 1441 ggcaccgcaa cctggacgcc gacctggtgc ggacaggacg cccccacacc ttctgtactc 1501 agaccgagac cggacttttt tggaagggga ctggaccagc cccagtaaaa gctctgaccg 1561 gccgaaacag ccttgagggc tacagtcgct cagtgcgcag ccggagtggt gagcgttggg 1621 gggcagatgg agaccgtggt ttgcccaagg ccctgggaag agaggcggaa acggagaagc 1681 ctttccagtg accgtgggag gacaacccat tcaccatatg aggaacggag gagtaggacc 1741 aagggcagtg ggcagcagtc agagcggggc tccgaccgca cccctgagcg cagccgcaag 1801 gagaaccact ccagtgaagg gaccaaggag tccagcagca actccctcag caacagcaga 1861 catggggctg aggaacgggg ccaccaccac caccaccacg aggctgcaga ctcttcccac 1921 gggaagaagg caagagacag cgagcgcaat caccggacca cagaggccga gcccaagcct 1981 ctggaagagc caaaacacga gaccaaaaag ctgaagaatc tttcagagta cgctcagaca 2041 ctacagctgg gttggaatgg gcttctggtg ttgaaaaaca gctgcttccc cacgtctatg 2101 catatcctag agggggacca gggggtgatc agcagtctcc tcaaagacca cacttctggg 2161 agcaagctga cccagctgaa gatcgcccag cgccttcgac tggaccagcc caagcttgac 2221 gaggacacac gacgcatcaa gcaggggagc cccaacggct atgcggtcct cttagccacc 2281 caggcaaccc ccagtgggct tggcactgag gggatgccca cagtagagcc cggtctgcag 2341 aggcggcttc tcaggaacct ggtctcctac ttgaaacaga agcaggccgc agggtgatca 2401 gcttgccagt gggggggtcc aagggcagag acggcacaag gcatgctcta cgccttccca 2461 ccctgcgact tttcccagca gtacctccag tctgcactaa ggacattggg caagctagaa 2521 gaagaacaca tggtgatagt catcgtcaga gacactgcct agcccaagcc tgtctttccc 2581 agcgtcatgt ttgtgtcaca aaagcagtta ttttaaaatc tgatcccctc tctaccctac 2641 cactttggtt tgaattatct cctgggttat tttggttcat ttgggtgggg atcaaagtcc 2701 tgtccaccac caaaactaag ttcttagatt ttgggggatt ttttttttta aacgatgaga 2761 agggaatccg gttatgttga tttctagtgt acaagatact gtctgctgtg gttctgtatt 2821 tttttatttt ttgaccaact gtatggaaag ttgtcagtaa aacctttgac gagagatgga 2881 tttttaaacc tga // LOCUS HUMAGG 4668 bp DNA PRI 30-OCT-1994 DEFINITION Human angiogenin gene, complete cds, and three Alu repetitive sequences. ACCESSION M11567 NID g178249 KEYWORDS Alu repeat; angiogenin; repeat region. SOURCE Human DNA, library of Maniatis et al., clone lambda-HAG1, and liver, cDNA to mRNA, clone pHAG1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4668) AUTHORS Kurachi,K., Davie,E.W., Strydom,D.J., Riordan,J.F. and Vallee,B.L. TITLE Sequence of the cDNA and gene for angiogenin, a human angiogenesis factor JOURNAL Biochemistry 24 (20), 5494-5499 (1985) MEDLINE 86077688 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by K.Kurachi, 29-MAY-1986. There is only one gene encoding angiogenin in the human genome. The signal peptide could start at position 1815 instead of 1809. FEATURES Location/Qualifiers source 1..4668 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11" repeat_region 1082..1398 /note="Alu repeat copy A" mRNA 1697..2425 /note="angiogenin mRNA" CDS 1809..2252 /gene="ANG" /codon_start=1 /db_xref="GDB:G00-119-679" /product="angiogenin" /db_xref="PID:g178250" /translation="MVMGLGVLLLVFVLGLGLTPPTLAQDNSRYTHFLTQHYDAKPQG RDDRYCESIMRRRGLTSPCKDINTFIHGNKRSIKAICENKNGNPHRENLRISKSSFQV TTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQSIFRRP" sig_peptide 1809..1880 /gene="ANG" /note="angiogenin signal peptide; G00-119-679" gene 1809..2252 /gene="ANG" mat_peptide 1881..2249 /gene="ANG" /note="angiogenin; G00-119-679" repeat_region 2524..2855 /note="Alu repeat copy B" repeat_region 3351..3663 /note="Alu repeat copy C" BASE COUNT 1247 a 1091 c 982 g 1348 t ORIGIN 132 bp upstream of XbaI site. 1 tgtttgcatt aagttcatag attataattt gtaatggaat caacaccaaa tgcaaattag 61 aaagagagcc cactttgctc acccagtcac gtcttcccat gtaaccatag aacgttgggg 121 tcctgtgtct ttctagatcc acagtcttgc tctcagaaca ggctagccac accacaggcc 181 tagtgccagg acccatggcc tttttttaag ctcagactcc cttctgtgaa cagcaatatc 241 cccacaactt gtacaacatt ggtgcttcct gcaagggcta cagaactatt tgatacgaaa 301 atgttcattg acttacacac aagagaagca caaaataaaa aattaataat taatttaatg 361 tctttgaaaa tgtaccattt atttttacat ttggggtcat aagaattgta ttacacttaa 421 gaatgcaata caatttgaag atcagatttt tctccctttg tgagaatttc tcagtatgtg 481 tgatgactac caagaaatca tagccagtca taaattcagt gagttactca taaacgaaca 541 agaaccacct acttcttggg gaggtaggtc tgcttccctt caactcagga tacaactgct 601 ttcaactgct ttcttcacat tagctgacta attagctaga agcctgtcgt aaacaatttt 661 atggttgact ccttccctgg gctcagggtt ccctagaaca gagaggtccc caaatcccgg 721 tctgtggcct gtccgcctaa gctctgcctc ctgccagatc agcaggcagc attagattct 781 cataggagct ggacgcctat tgtgaactgc gcatgtgcgg gatccagatt gtgcactctt 841 tatgagaatc taactaatgc ttgatgatct atctgaacca gaacaatttc atcctgaaac 901 catcccccac caatccatag aaatactgtc ttccacaaaa atgatccctg gtgccaaaaa 961 tgttagagac cactccccta aaactctctt cttagctctc acctcctgta ttactatctc 1021 atctcagtac attgaagccc ccatcttttc cccatggatg cctcatttcc tattagggag 1081 gcattttttt attttttgtt tttatttttt tccgagacgg agtctcgctc tgtcgccaag 1141 gctggagtgc agtggcgcga tctcggctca ctgcaagctc cgcctcccgg gttcacgcca 1201 ttctcctgcc tcagcctccc aagtagctgg gactacaggc gcccgcacta cgcccggcta 1261 attttttgta tttttagtag agacggggtt tcaccgtggt agccaggatg gtctcgatct 1321 cctgacctcg tgatccgccc gccttggcct cccaaagtgc tgggattaca ggcgtgagac 1381 cgcgcccggc cgtcatttgg tatgtcttaa tgtgcctcag gacctagcac agtccctggt 1441 acccagtaga gacctatgta atgttcgtta ttcaataata aatacatgaa ttaaagagtg 1501 agagtggatt ttgtaatgtt acgactgata gagaaatact cagtgattct aagggatggg 1561 gaagaacggt tggagctaga ggttgtgctc aggaaactat taaatagacg ttccgcagga 1621 agggattgac gaagtgtgag gttaatgagg aagggaaaat agaatataaa atttggtggt 1681 ggaaaagatc tgattcatga tgccgtgtca gagagcaaag ctcctgtcct tttggcctaa 1741 tttggtgatg ctgttcttgg gtctaccaca cctccttttg ccctccgcag gagcctgtgt 1801 tggaagagat ggtgatgggc ctgggcgttt tgttgttggt cttcgtgctg ggtctgggtc 1861 tgaccccacc gaccctggct caggataact ccaggtacac acacttcctg acccagcact 1921 atgatgccaa accacagggc cgggatgaca gatactgtga aagcatcatg aggagacggg 1981 gcctgacctc accctgcaaa gacatcaaca catttattca tggcaacaag cgcagcatca 2041 aggccatctg tgaaaacaag aatggaaacc ctcacagaga aaacctaaga ataagcaagt 2101 cttctttcca ggtcaccact tgcaagctac atggaggttc cccctggcct ccatgccagt 2161 accgagccac agcggggttc agaaacgttg ttgttgcttg tgaaaatggc ttacctgtcc 2221 acttggatca gtcaattttc cgtcgtccgt aaccagcggg cccctggtca agtgctggct 2281 ctgctgtcct tgccttccat ttcccctctg cacccagaac agtggtggca acattcattg 2341 ccaagggccc aaagaaagag ctacctggac cttttgtttt ctgtttgaca acatgtttaa 2401 taaataaaaa tgtcttgata tcagtaagaa tcagagtctt ctcactgatt ctgggcatat 2461 tgatctttcc cccattttct ctacttggct gctccctgag aggactgcat aggatagaaa 2521 tgcctttttc ttttcttttc gttttttttt tttttttttt ttgagatgga gtctcactct 2581 gtcgcccagg cttaagtgca atggcacaat ctcggctcac tgcaacctct ctctcctggg 2641 ttcaagtgat tctcctgcct cagcctccca aatagctgag attacaggca tgcaccacca 2701 cacctggcta atttttgtgt ttttagtaga gacagggttt caccgttttg gccaggttgg 2761 tcttgaactc ctgacctcgg gagatccgcc caccttggcc tctctttgtg ctgggattac 2821 aggcatgagc cactgagccg ggccactttt tccttatcag tcagttttta caagtcatta 2881 gggaggtaga ctttacctct ctgtgaagga aagtatggta tgttgatcta cagagagaga 2941 tggaaaaatt ccagggctcg tagctactaa gcagaatttc caagataggc aaattgtttt 3001 ttctgtcaaa taataagcta atattacttc tacaaatatg agaccttgga gagaagtttc 3061 caaggaccaa gtaccaacat accaacagat tattatagtt tctctcactc ttacacacac 3121 acacacacat atacacatat gtaatccagc atgaatacca aaattcattc agggtagcca 3181 ccttttgtct taatcgagag ataattttga tgtttgaatg gaatgctccc aggatattct 3241 cttgtcatgg ttattttata taaaattcaa aaaccaatta cattatttcc tctgtaatct 3301 tttactttat caactaatgt ctggcaagtg tgatgttttg gggaagttat agaagattcc 3361 ggccaggcgc ttatctcacg cttgtaatcc agcactttgg gaagctgagg cggacagatc 3421 acgaggtcaa gagatcaaga ccatcctgga caacatggtg aaaccttgtc tctactaaaa 3481 atgtgaaaat tagctgggcg tggtggcaca cacctatagt cccagctact cgggaggctg 3541 aggcaggaga atcgcttgaa cctaggaggc ggaggttgca ctgagccgag atcacgccac 3601 tgcactccag cctgggcgac agagcgagac tccatctcaa aaaaaaaaaa aaaagaaaga 3661 tcccagttta tcccagttta tcccttattc ttcctcaatt ctcaagattt gtttttaagt 3721 taacataact taggttaaca cactctttgt aaaatacact gttcaatcta cagactcagt 3781 ggttagcttc ctgttaacta atttctgttg acaggtactt ggatatttta tttagaaagt 3841 ggttgccaat aaattagtta taagtcgcca gtttcactgc cttgtgaaca cataattatt 3901 gtggtctcag tattccctat ggtggcttct cctgctcctg gtattgccct gaaatgggcc 3961 aaaagccgtg gctccccaat gctcaggtta tagaacattg tccaggtacc acctaggaga 4021 gcccagcctc actgaaagta ttcaaattta ggaatgggtt tgagaagtag gtagctggta 4081 tgtgcttagc acaagaatct ctcttccttg ggttagtctg tttcaaaact gaaaacactg 4141 tcattcctta agaaaatagg aaaaagtatt ccaaacctct gtcactagaa aatttgccat 4201 attaccaaat ctcaaaaacc tctcaggaaa tgagaaagtc ccagtttctg gtaaactatt 4261 tgggcccttt tctcaagttc tccttccagt gctatttcct tgaggtgagg caaagttact 4321 caagatcatc gctgccactc aaggccttga tagggcaagt gaaaggcatg gaccattatt 4381 atattgatca cagcataagc tgtgaaaacc cacatcttct ccaaacatct gcttggagca 4441 ttatcatcgc atagtttgct ctggtgttca gggaaatcgc tgtttcatag gaaatcacat 4501 ggcagtggga tgggagtgtt tcctgacctg ccgatggtac tggcacctga gcaagcattc 4561 ctagtccttt ttggtctggg cctcttgttc tatcacaacc acaagctgtt taaaataaaa 4621 acgtcaagtc acaggcaggt cattttatcc tgcgtgaatc aattgaag // LOCUS HUMAGT 2512 bp mRNA PRI 30-NOV-1993 DEFINITION Human beta-1,4 N-acetylgalactosaminyltransferase mRNA, complete cds. ACCESSION M83651 NID g431032 KEYWORDS beta-1,4 N-acetylgalactosaminyltransferase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2512) AUTHORS Nagata,Y., Yamashiro,S., Yodoi,J., Lloyd,K.O., Shiku,H. and Furukawa,K. TITLE Expression cloning of beta-1,4 N-acetylgalactosaminyltransferase cDNAs that determine the expression of the GM2 and GD2 gangliosides JOURNAL J. Biol. Chem. 267, 12082-12089 (1992) MEDLINE 92291088 FEATURES Location/Qualifiers source 1..2512 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="YT" CDS 61..1662 /codon_start=1 /product="beta-1,4 N-acetylgalactosaminyltransferase" /db_xref="PID:g431033" /translation="MWLGRRALCALVLLLACASLGLLYASTRDAPGLRLPLAPWAPPQ SPRRPELPDLAPEPRYAHIPVRIKEQVVGLLAWNNCSCESSGGGLPLPFQKQVRAIDL TKAFDPAELRAASATREQEFQAFLSRSQSPADQLLIAPANSPLQYPLQGVEVQPLRSI LVPGLSLQAASGQEVYQVNLTASLGTWDVAGEVTGVTLTGEGQADLTLVSPGLDQLNR QLQLVTYSSRSYQTNTADTVRFSTEGHEAAFTIRIRHPPNPRLYPPGSLPQGAQYNIS ALVTIATKTFLRYDRLRALITSIRRFYPTVTVVIADDSDKPERVSGPYVEHYLMPFGK GWFAGRNLAVSQVTTKYVLWVDDDFVFTARTRLERLVDVLERTPLDLVGGAVREISGF ATTYRQLLSVEPGAPGLGNCLRQRRGFHHELVGFPGCVVTDGVVNFFLARTDKVREVG FDPRLSRVAHLEFFLDGLGSLRVGSCSDVVVDHASKLKLPWTSRDAGAETYARYRYPG SLDESQMAKHRLLFFKHRLQCMTSQ" BASE COUNT 491 a 802 c 705 g 514 t ORIGIN 1 ggcgagagcc cggcacagcc cggaccgaaa ttttgccgct gccttagagc gttagacagg 61 atgtggctgg gccgccgggc cctgtgcgct ctggtccttc tgctcgcctg cgcctcgctg 121 gggctcctgt acgcgagcac ccgggacgcg cccggcctcc ggctacctct tgcgccgtgg 181 gcgcccccgc aaagcccccg caggcccgag ctgccagatc ttgctcctga gccccgctac 241 gcacacatcc cggtcaggat caaggagcaa gtagtggggc tgctggcttg gaacaactgc 301 agttgtgagt ccagtggggg gggcctcccc ctccccttcc agaaacaagt ccgagctatt 361 gacctcacca aggcctttga ccctgcagag ctgagggctg cctctgccac aagagagcag 421 gagttccagg cctttctgtc gaggagccag tccccagctg accagctgct catagcccct 481 gccaactccc cgctccagta ccccctacag ggtgtggaag ttcagcccct caggagcatc 541 ttggtgccag ggctgagcct tcaggcagct tctggtcagg aggtatacca ggtgaacctg 601 actgcctccc taggcacctg ggacgtggca ggggaagtga ctggagttac tctcactgga 661 gagggtcagg cagatctcac ccttgtcagc ccagggctgg accaactcaa caggcaacta 721 caactggtca cttacagcag ccgaagctac cagaccaaca cagcagacac agtccggttc 781 tccaccgagg gacatgaggc tgctttcact atccgcataa gacacccgcc caaccctcgg 841 ctgtacccac ctgggtctct accccaggga gcccagtaca acatcagcgc tctagtcacg 901 attgccacca agaccttcct ccgttatgat cggctacggg ctctcatcac cagtatccgc 961 cgcttctacc caacggttac cgtggtcatc gctgacgaca gcgacaagcc agagcgcgtt 1021 agtggcccct acgtggaaca ctatctcatg cccttcggca agggctggtt cgcaggccgg 1081 aacctggccg tgtctcaagt aaccaccaag tacgtgctgt gggtggacga cgacttcgtc 1141 ttcacggcgc ggacgcggct ggagaggctt gtggacgtgc tggagcggac gccgctggac 1201 ctggtggggg gcgcggtgcg cgagatctcc ggctttgcca ccacttatcg gcagctgctg 1261 agcgtggagc ccggcgcccc aggcctcggg aactgcctcc ggcaaaggcg cggcttccac 1321 cacgagctcg tcggcttccc aggctgcgtg gtcaccgacg gcgtggttaa cttcttcctg 1381 gcgcggactg acaaggtgcg cgaggtcggt ttcgaccccc gcctcagccg cgtggctcat 1441 ctggaattct tcttggatgg gcttggttcc cttcgggttg gctcctgctc cgacgtcgtg 1501 gtggatcatg catccaaact gaagctgcct tggacatcaa gggatgccgg agcagagact 1561 tacgcccggt accgttaccc aggatcactg gacgagagcc agatggccaa acaccggctg 1621 ctcttcttca aacaccggct gcagtgcatg acctcccagt gatggcccgc tggggatttc 1681 tgactgtcag gctgggcctg cctccttgtc cctgccagga atttccaaca aaccccacca 1741 ccctgtgagc actctactgg ctgtccctga gcctctagtt cctcactctt ccttttcaga 1801 acctgatgcc cagtaggggt tgtcctggtg acacccctcc tttttccagt gcccagaggc 1861 ctggtggagc cataacctct cccacagcca gtgccaagtc ctccccctgc ccattctcat 1921 ggggcaggaa atggggggat cactttccaa gtgccaaaga gcccagaggg actctaagaa 1981 cctaaggtgg aaacactgtc ctctcatctt gggaccgagg gggtggggaa gttccccaac 2041 acataatccc aagactgtgc ccctcatctg catcttcaga tccagtactc tgtgtacctg 2101 ctccagcccc acccccacag agagaacttg tggctctggg gctggggtga gggctggtgg 2161 ttggtgaaag ccattcttag ttgtgtctct gcaatgctgt gggcacaaaa gaaggggcac 2221 cagagtccct gtgcaaacac ctagactcac ttcatggatt ccaaagctct cagcttcatt 2281 ttattagtta cgttaggtaa gggggttcaa gggtcatggt cctcatcaca cacatgtcat 2341 cagggccctc ctgcactcca catgatgagg tcagacccac acggtgcaaa tctttgggtc 2401 agtgagctcc tggagaagag aggagacatg tcaggaatag attaggcacc cctcttcctt 2461 aatgaaatgt ggcagtcctc tcaggggtac cccacctact tagggatctg ag // LOCUS HUMAHCY 2211 bp mRNA PRI 30-OCT-1994 DEFINITION Human S-adenosylhomocysteine hydrolase (AHCY) mRNA, complete cds. ACCESSION M61831 NID g178276 KEYWORDS S-adenosylhomocysteine hydrolase. SOURCE Homo sapiens RNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2211) AUTHORS Coulter-Karis,D.E. and Hershfield,M.S. TITLE Sequence of full length cDNA for human S-adenosylhomocysteine hydrolase JOURNAL Ann. Hum. Genet. 53 (Pt 2), 169-175 (1989) MEDLINE 90087640 COMMENT From EMBL entry HSAHCY; dated 29-MAR-1991. FEATURES Location/Qualifiers source 1..2211 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20cen-q13.1" exon <48..901 /gene="AHCY" /note="G00-118-983" /product="S-adenosylhomocysteine hydrolase" CDS join(48..901,1003..1447) /gene="AHCY" /EC_number="3.3.1.1" /codon_start=1 /db_xref="GDB:G00-118-983" /product="S-adenosylhomocysteine hydrolase" /db_xref="PID:g178277" /translation="MSDKLPYKVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKP LKGARIAGCLHMTVETAVLIETLVTLGAEVQWSSCNIFSTQNHAAAAIAKAGIPVYAW KGETDEEYLWCIEQTLYFKDGPLNMILDDGGDLTNLIHTKYPQLLPGIRGISEETTTG VHNLYKMMANGILKVPAINVNDSVTKSKFDNLYGCRESLIDGIKRATDVMIAGKVAVV AGYGDVGKGCAQALRGFGARVIITEIDPINALQAAMEGYEVTTMDEACQEGNIFVTTT GCIDIILGRHFEQMKDDAIVCNIGHFDVEIDVKWLNENAVEKVNIKPQVDRYRLKNGR RIILLAEGRLVNLGCAMGHPSFVMSNSFTNQVMAQIELWTHPDKYPVGVHFLPKKLDE AVAEAHLGKLNVKLTKLTEKQAQYLGMSCDGPFKPDHYRY" gene join(48..901,1003..1447) /gene="AHCY" exon 1003..>1447 /gene="AHCY" /note="G00-118-983" /product="S-adenosylhomocysteine hydrolase" BASE COUNT 501 a 591 c 638 g 481 t ORIGIN 1 ctgaggccca gcccccttcg cccgtttcca tcacgagtgc cgccagcatg tctgacaaac 61 tgccctacaa agtcgccgac atcggcctgg ctgcctgggg acgcaaggcc ctggacattg 121 ctgagaacga gatgccgggc ctgatgcgta tgcgggagcg gtactcggcc tccaagccac 181 tgaagggcgc ccgcatcgct ggctgcctgc acatgaccgt ggagacggcc gtcctcattg 241 agaccctcgt caccctgggt gctgaggtgc agtggtccag ctgcaacatc ttctccaccc 301 agaaccatgc ggcggctgcc attgccaagg ctggcattcc ggtgtatgcc tggaagggcg 361 aaacggacga ggagtacctg tggtgcattg agcagaccct gtacttcaag gacgggcccc 421 tcaacatgat tctggacgac gggggcgacc tcaccaacct catccacacc aagtacccgc 481 agcttctgcc aggcatccga ggcatctctg aggagaccac gactggggtc cacaacctct 541 acaagatgat ggccaatggg atcctcaagg tgcctgccat caatgtcaat gactccgtca 601 ccaagagcaa gtttgacaac ctctatggct gccgggagtc cctcatagat ggcatcaagc 661 gggccacaga tgtgatgatt gccggcaagg tagcggtggt agcaggctat ggtgatgtgg 721 gcaagggctg tgcccaggcc ctgcggggtt tcggagcccg cgtcatcatc accgagattg 781 accccatcaa cgcactgcag gctgccatgg agggctatga ggtgaccacc atggatgagg 841 cctgtcagga gggcaacatc tttgtcacca ccacaggctg tattgacatc atccttggcc 901 ggtaggtgcc agatgggggg tcccggggag tgagggagga gggcagagtt gggacagctt 961 tctgtccctg acaatctccc acggtcttgg gctgcctgac aggcactttg agcagatgaa 1021 ggatgatgcc attgtgtgta acattggaca ctttgacgtg gagatcgatg tcaagtggct 1081 caacgagaac gccgtggaga aggtgaacat caagccgcag gtggaccggt atcggttgaa 1141 gaatgggcgc cgcatcatcc tgctggccga gggtcggctg gtcaacctgg gttgtgccat 1201 gggccacccc agcttcgtga tgagtaactc cttcaccaac caggtgatgg cgcagatcga 1261 gctgtggacc catccagaca agtaccccgt tggggttcat ttcctgccca agaagctgga 1321 tgaggcagtg gctgaagccc acctgggcaa gctgaatgtg aagttgacca agctaactga 1381 gaagcaagcc cagtacctgg gcatgtcctg tgatggcccc ttcaagccgg atcactaccg 1441 ctactgagag ccaggtctgc gtttcaccct ccagctgctg tccttgccca ggccccacct 1501 ctcctcccta agagctaatg gcaccaactt tgtgattggt ttgtcagtgt cccccatcga 1561 ctctctgggg ctgatcactt agtttttggc ctctgctgca gccgtcatac tgttccaaat 1621 gtggcagcgg gaacagagta ccctcttcaa gccccggtca tgatggaggt cccagccaca 1681 gggaaccatg agctcagtgg tcttggaaca gctcactaag tcagtccttc cttagcctgg 1741 aagtcagtag tggagtcaca aagcccatgt gttttgccat ctaggccttc acctggtctg 1801 tggacttata cctgtgtgct tggtttacag gtccagtggt tcttcagccc atgacagatg 1861 agaaggggct atattgaagg gcaaagagga actgttgttt gaattttcct gagagcctgg 1921 cttagtgctg ggccttctct taaacctcat tacaatgagg ttagtacttt tagtccctgt 1981 tttacagggg ttagaataga ctgttaaggg gcaactgaga aagaacagag aagtgacagc 2041 taggggttga gaggggccag aaaaacatga atgcaggcag atttcgtgaa atctgccacc 2101 actttataac cagatggttc ctttcacaac cctgggtcaa aaagagaata atttggccta 2161 taatgttaaa agaaagcagg aaggtgggta aataaaaatc ttggtgcctg g // LOCUS HUMAHREC 5228 bp mRNA PRI 03-FEB-1994 DEFINITION Human AH-receptor mRNA, complete cds. ACCESSION L19872 NID g416141 KEYWORDS AH-receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5228) AUTHORS Dolwick,K.M., Schmidt,J.V., Carver,L.A., Swanson,H.I. and Bradfield,C.A. TITLE Cloning and expression of a human Ah receptor cDNA JOURNAL Mol. Pharmacol. 44 (5), 911-917 (1993) MEDLINE 94067047 FEATURES Location/Qualifiers source 1..5228 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 376..2922 /codon_start=1 /evidence=experimental /product="AH-receptor" /db_xref="PID:g416142" /translation="MNSSSANITYASRKRRKPVQKTVKPIPAEGIKSNPSKRHRDRLN TELDRLASLLPFPQDVINKLDKLSVLRLSVSYLRAKSFFDVALKSSPTERNGGQDNCR AANFREGLNLQEGEFLLQALNGFVLVVTTDALVFYASSTIQDYLGFQQSDVIHQSVYE LIHTEDRAEFQRQLHWALNPSQCTESGQGIEEATGLPQTVVCYNPDQIPPENSPLMER CFICRLRCLLDNSSGFLAMNFQGKLKYLHGQKKKGKDGSILPPQLALFAIATPLQPPS ILEIRTKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEAELCTRGSGYQFIHAADMLYCA ESHIRMIKTGESGMIVFRLLTKNNRWTWVQSNARLLYKNGRPDYIIVTQRPLTDEEGT EHLRKRNTKLPFMFTTGEAVLYEATNPFPAIMDPLPLRTKNGTSGKDSATTSTLSKDS LNPSSLLAAMMQQDESIYLYPASSTSSTAPFENNFFNESMNECRNWQDNTAPMGNDTI LKHEQIDQPQDVNSFAGGHPGLFQDSKNSDLYSIMKNLGIDFEDIRHMQNEKFFRNDF SGEVDFRDIDLTDEILTYVQDSLSKSPFIPSDYQQQQSLALNSSCMVQEHLHLEQQQQ HHQKQVVVEPQQQLCQKMKHMQVNGMFENWNSNQFVPFNCPQQDPQQYNVFTDLHGIS QEFPYKSEMDSMPYTQNFISCNQPVLPQHSKCTELDYPMGSFEPSPYPTTSSLEDFVT CLQLPENQKHGLNPQSAIITPQTCYAGAVSMYQCQPEPQHTHVGQMQYNPVLPGQQAF LNKFQNGVLNETYPAELNNINNTQTTTHLQPLHHPSEARPFPDLTSSGFL" polyA_site 5228 BASE COUNT 1596 a 1100 c 975 g 1557 t ORIGIN 1 cacggcccag acccaggatt ctttatagac ggcccaggct cctcctccgc ccgggccgcc 61 tcacctgcgg gcattgcgcg ccgcctccgc cggtgtagac ggcacctgcg ccgccttgct 121 cgcgggtctc cgccctcgcc caccctcact gcgccaggcc caggcagctc acctgtactg 181 gcgcgggctg cggaagctgc gtgacgcgag gcgttgaggc gcggcgccca cgccactgtc 241 ccgagaggac gcaggtggag cgggcgcgac ttcgcgaacc cggcgccggc cgccgcagtg 301 gtcccagcct acaccgggtt ccggggaccc ggccgccagt gcccggggag tagccgccgc 361 cgtcggctgg gcaccatgaa cagcagcagc gccaacatca cctacgccag tcgcaagcgg 421 cggaagccgg tgcagaaaac agtaaagcca atcccagctg aaggaatcaa gtcaaatcct 481 tccaagcggc atagagaccg acttaataca gagttggacc gtttggctag cctgctgcct 541 ttcccacaag atgttattaa taagttggac aaactttcag ttcttaggct cagcgtcagt 601 tacctgagag ccaagagctt ctttgatgtt gcattaaaat cctcccctac tgaaagaaac 661 ggaggccagg ataactgtag agcagcaaat ttcagagaag gcctgaactt acaagaagga 721 gaattcttat tacaggctct gaatggcttt gtattagttg tcactacaga tgctttggtc 781 ttttatgctt cttctactat acaagattat ctagggtttc agcagtctga tgtcatacat 841 cagagtgtat atgaacttat ccataccgaa gaccgagctg aatttcagcg tcagctacac 901 tgggcattaa atccttctca gtgtacagag tctggacaag gaattgaaga agccactggt 961 ctcccccaga cagtagtctg ttataaccca gaccagattc ctccagaaaa ctctccttta 1021 atggagaggt gcttcatatg tcgtctaagg tgtctgctgg ataattcatc tggttttctg 1081 gcaatgaatt tccaagggaa gttaaagtat cttcatggac agaaaaagaa agggaaagat 1141 ggatcaatac ttccacctca gttggctttg tttgcgatag ctactccact tcagccacca 1201 tccatacttg aaatccggac caaaaatttt atctttagaa ccaaacacaa actagacttc 1261 acacctattg gttgtgatgc caaaggaaga attgttttag gatatactga agcagagctg 1321 tgcacgagag gctcaggtta tcagtttatt catgcagctg atatgcttta ttgtgccgag 1381 tcccatatcc gaatgattaa gactggagaa agtggcatga tagttttccg gcttcttaca 1441 aaaaacaacc gatggacttg ggtccagtct aatgcacgcc tgctttataa aaatggaaga 1501 ccagattata tcattgtaac tcagagacca ctaacagatg aggaaggaac agagcattta 1561 cgaaaacgaa atacgaagtt gccttttatg tttaccactg gagaagctgt gttgtatgag 1621 gcaaccaacc cttttcctgc cataatggat cccttaccac taaggactaa aaatggcact 1681 agtggaaaag actctgctac cacatccact ctaagcaagg actctctcaa tcctagttcc 1741 ctcctggctg ccatgatgca acaagatgag tctatttatc tctatcctgc ttcaagtact 1801 tcaagtactg caccttttga aaacaacttt ttcaacgaat ctatgaatga atgcagaaat 1861 tggcaagata atactgcacc gatgggaaat gatactatcc tgaaacatga gcaaattgac 1921 cagcctcagg atgtgaactc atttgctgga ggtcacccag ggctctttca agatagtaaa 1981 aacagtgact tgtacagcat aatgaaaaac ctaggcattg attttgaaga catcagacac 2041 atgcagaatg aaaaattttt cagaaatgat ttttctggtg aggttgactt cagagacatt 2101 gacttaacgg atgaaatcct gacgtatgtc caagattctt taagtaagtc tcccttcata 2161 ccttcagatt atcaacagca acagtccttg gctctgaact caagctgtat ggtacaggaa 2221 cacctacatc tagaacagca acagcaacat caccaaaagc aagtagtagt ggagccacag 2281 caacagctgt gtcagaagat gaagcacatg caagttaatg gcatgtttga aaattggaac 2341 tctaaccaat tcgtgccttt caattgtcca cagcaagacc cacaacaata taatgtcttt 2401 acagacttac atgggatcag tcaagagttc ccctacaaat ctgaaatgga ttctatgcct 2461 tatacacaga actttatttc ctgtaatcag cctgtattac cacaacattc caaatgtaca 2521 gagctggact accctatggg gagttttgaa ccatccccat accccactac ttctagttta 2581 gaagattttg tcacttgttt acaacttcct gaaaaccaaa agcatggatt aaatccacag 2641 tcagccataa taactcctca gacatgttat gctggggccg tgtcgatgta tcagtgccag 2701 ccagaacctc agcacaccca cgtgggtcag atgcagtaca atccagtact gccaggccaa 2761 caggcatttt taaacaagtt tcagaatgga gttttaaatg aaacatatcc agctgaatta 2821 aataacataa ataacactca gactaccaca catcttcagc cacttcatca tccgtcagaa 2881 gccagacctt ttcctgattt gacatccagt ggattcctgt aattccaagc ccaattttga 2941 ccctggtttt tggattaaat tagtttgtga aggattatgg aaaaataaaa ctgtcactgt 3001 tggacgtcag caagttcaca tggaggcatt gatgcatgct attcacaatt attccaaacc 3061 aaattttaat ttttgctttt agaaaaggga gtttaaaaat ggtatcaaaa ttacatatac 3121 tacagtcaag atagaaaggg tgctgccacg gagtggtgag gtaccgtcta catttcacat 3181 tattctgggc accacaaaat atacaaaact ttatcaggga aactaagatt cttttaaatt 3241 agaaaatatt ctctatttga attatttctg tcacagtaaa aataaaatac tttgagtttt 3301 gagctactgg attcttatta gttccccaaa tacaaagtta gagaactaaa ctagtttttc 3361 ctatcatgtt aacctctgct tttatctcag atgttaaaat aaatggtttg gtgcttttta 3421 taaaaagata atctcagtgc tttcctcctt cactgtttca tctaagtgcc tcacattttt 3481 ttctacctat aacactctag gatgtatatt ttatataaag tattcttttt cttttttaaa 3541 ttaatatctt tctgcacaca aatattattt gtgtttccta aatccaacca attttcatta 3601 attcaggcat attttaactc cactgcttac ctactttctt caggtaaaag ggcaaataat 3661 gatcgaaaaa ataattattt attacataat ttagttgttt ctagactata aatgttgcta 3721 tgtgccttat gttgaaaaaa tttaaaagta aaatgtcttt ccaaattatt tcttaattat 3781 tataaaaata ttaagacaat agcacttaaa ttcctcaaca gtgttttcag aagaaataaa 3841 tataccactc tttaccttta ttgatatctc catgatgata gttgaatgtt gcaatgtgaa 3901 aaatctgctg ttaactgcaa ccttgtttat taaattgcaa gaagctttat ttctagcttt 3961 ttaattaagc aaagcaccca tttcaatgtg tataaattgt ctttaaaaac tgttttagac 4021 ctataatcct tgataatata ttgtgttgac tttataaatt tcgcttctta gaacagtgga 4081 aactatgtgt ttttctcata tttgaggagt gttaagattg cagatagcaa ggtttggtgc 4141 aaagtattgt aatgagtgaa ttgaatggtg cattgtatag atataatgaa caaaattatt 4201 tgtaagatat ttgcagtttt tcattttaaa aagtccatac cttatatatg cacttaattt 4261 gttggggctt tacatacttt atcaatgtgt ctttctaaga aatcaagtaa tgaatccaac 4321 tgcttaaagt tggtattaat aaaaagacaa ccacatagtt cgtttacctt caaactttag 4381 gtttttttaa tgatatactg atcttcatta ccaataggca aattaatcac cctaccaact 4441 ttactgtcct aacatggact ttcaaaaaga aaaaatgaca ccatctttta ttcttttttt 4501 tttttttttt ttgagagaga gtcttactct gccgcccaaa ctggagtgca gtggcacaat 4561 cttggctcac tgcaacctct acctcctggg ttcaagtgat tctcttgcct cagcctcccg 4621 agttgctggg attgcgggca tggtggcgtg agcctgtagt cctagctact cgggaggctg 4681 aggcaggaga atagcctgaa cctgggaatc ggaggttgca gggccaagat cgccccactg 4741 cactccagcc tggcaataga ccgagctccg tctccaaaaa aaaaaataca atttttattt 4801 cttttacttt ttttagtaag ttaatgtata taaaaatggc ttcggacaaa atatctctga 4861 gttctgtgta ttttcagtca aaactttaaa cctgtagaat caatttaagt gttgaaaaaa 4921 atttgtctga aacatttcat aatttgtttc cagcatgagg tatctaagga tttagaccag 4981 aggtctagat taatactcta tttttacatt taaacctttt attataagtc ttacataaac 5041 catttttgtt actctcttcc acatgttact ggataaattg tttagtggaa aataggcttt 5101 ttaatcatga atatgatgac aatcagttat acagttataa aattaaaagt ttgaaaagca 5161 atattgtata tttttatcta tataaaataa ctaaaatgta tctaagaata ataaaatcac 5221 gttaaacc // LOCUS HUMAICEB 4020 bp mRNA PRI 30-OCT-1994 DEFINITION Human angiotensin I-converting enzyme mRNA, complete cds. ACCESSION J04144 NID g178285 KEYWORDS angiotensin converting enzyme; dipeptidyl carboxypeptidase. SOURCE Human endothelial cell, cDNA to mRNA, clones lambda-HEC1922, lambda-HEC2111, and lambda-CHDT32. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4020) AUTHORS Soubrier,F., Alhenc-Gelas,F., Hubert,C., Allegrini,J., John,M., Tregear,G. and Corvol,P. TITLE Two putative active centers in human angiotensin I-converting enzyme revealed by molecular cloning JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (24), 9386-9390 (1988) MEDLINE 89071703 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by F.Soubrier 04-JAN-1989. FEATURES Location/Qualifiers source 1..4020 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q23" sig_peptide 1..109 /note="angiotensin I-converting enzyme signal peptide" gene 23..3943 /gene="DCP1" CDS 23..3943 /gene="DCP1" /note="angiotensin I-converting enzyme precursor (EC 3.4.15.1)" /codon_start=1 /db_xref="GDB:G00-119-840" /db_xref="PID:g178286" /translation="MGAASGRRGPGLLLPLPLLLLLPPQPALALDPGLQPGNFSADEA GAQLFAQSYNSSAEQVLFQSVAASWAHDTNITAENARRQEEAALLSQEFAEAWGQKAK ELYEPIWQNFTDPQLRRIIGAVRTLGSANLPLAKRQQYNALLSNMSRIYSTAKVCLPN KTATCWSLDPDLTNILASSRSYAMLLFAWEGWHNAAGIPLKPLYEDFTALSNEAYKQD GFTDTGAYWRSWYNSPTFEDDLEHLYQQLEPLYLNLHAFVRRALHRRYGDRYINLRGP IPAHLLGDMWAQSWENIYDMVVPFPDKPNLDVTSTMLQQGWNATHMFRVAEEFFTSLE LSPMPPEFWEGSMLEKPADGREVVCHASAWDFYNRKDFRIKQCTRVTMDQLSTVHHEM GHIQYYLQYKDLPVSLRRGANPGFHEAIGDVLALSVSTPEHLHKIGLLDRVTNDTESD INYLLKMALEKIAFLPFGYLVDQWRWGVFSGRTPPSRYNFDWWYLRTKYQGICPPVTR NETHFDAGAKFHVPNVTPYIRYFVSFVLQFQFHEALCKEAGYEGPLHQCDIYRSTKAG AKLRKVLQAGSSRPWQEVLKDMVGLDALDAQPLLKYFQPVTQWLQEQNQQNGEVLGWP EYQWHPPLPDNYPEGIDLVTDEAEASKFVEEYDRTSQVVWNEYAEANWNYNTNITTET SKILLQKNMQIANHTLKYGTQARKFDVNQLQNTTIKRIIKKVQDLERAALPAQELEEY NKILLDMETTYSVATVCHPNGSCLQLEPDLTNVMATSRKYEDLLWAWEGWRDKAGRAI LQFYPKYVELINQAARLNGYVDAGDSWRSMYETPSLEQDLERLFQELQPLYLNLHAYV RRALHRHYGAQHINLEGPIPAHLLGNMWAQTWSNIYDLVVPFPSAPSMDTTEAMLKQG WTPRRMFKEADDFFTSLGLLPVPPEFWNKSMLEKPTDGREVVCHASAWDFYNGKDFRI KQCTTVNLEDLVVAHHEMGHIQYFMQYKDLPVALREGANPGFHEAIGDVLALSVSTPK HLHSLNLLSSEGGSDEHDINFLMKMALDKIAFIPFSYLVDQWRWRVFDGSITKENYNQ EWWSLRLKYQGLCPPVPRTQGDFDPGAKFHIPSSVPYIRYFVSFIIQFQFHEALCQAA GHTGPLHKCDIYQSKEAGQRLATAMKLGFSRPWPEAMQLITGQPNMSASAMLSYFKPL LDWLRTENELHGEKLGWPQYNWTPNSARSEGPLPDSGRVSFLGLDLDAQQARVGQWLL LFLGIALLVATLGLSQRLFSIRHRSLHRHSHGPQFGSEVELRHS" mat_peptide 110..3940 /gene="DCP1" /note="angiotensin I-converting enzyme" BASE COUNT 857 a 1261 c 1174 g 728 t ORIGIN 1 gccgagcacc gcgcaccgcg tcatgggggc cgcctcgggc cgccgggggc cggggctgct 61 gctgccgctg ccgctgctgt tgctgctgcc gccgcagccc gccctggcgt tggaccccgg 121 gctgcagccc ggcaactttt ctgctgacga ggccggggcg cagctcttcg cgcagagcta 181 caactccagc gccgaacagg tgctgttcca gagcgtggcc gccagctggg cgcacgacac 241 caacatcacc gcggagaatg caaggcgcca ggaggaagca gccctgctca gccaggagtt 301 tgcggaggcc tggggccaga aggccaagga gctgtatgaa ccgatctggc agaacttcac 361 ggacccgcag ctgcgcagga tcatcggagc tgtgcgaacc ctgggctctg ccaacctgcc 421 cctggctaag cggcagcagt acaacgccct gctaagcaac atgagcagga tctactccac 481 cgccaaggtc tgcctcccca acaagactgc cacctgctgg tccctggacc cagatctcac 541 caacatcctg gcttcctcgc gaagctacgc catgctcctg tttgcctggg agggctggca 601 caacgctgcg ggcatcccgc tgaaaccgct gtacgaggat ttcactgccc tcagcaatga 661 agcctacaag caggacggct tcacagacac gggggcctac tggcgctcct ggtacaactc 721 ccccaccttc gaggacgatc tggaacacct ctaccaacag ctagagcccc tctacctgaa 781 cctccatgcc ttcgtccgcc gcgcactgca tcgccgatac ggagacagat acatcaacct 841 caggggaccc atccctgctc atctgctggg agacatgtgg gcccagagct gggaaaacat 901 ctacgacatg gtggtgcctt tcccagacaa gcccaacctc gatgtcacca gtactatgct 961 gcagcagggc tggaacgcca cgcacatgtt ccgggtggca gaggagttct tcacctccct 1021 ggagctctcc cccatgcctc ccgagttctg ggaagggtcg atgctggaga agccggccga 1081 cgggcgggaa gtggtgtgcc acgcctcggc ttgggacttc tacaacagga aagacttcag 1141 gatcaagcag tgcacacggg tcacgatgga ccagctctcc acagtgcacc atgagatggg 1201 ccatatacag tactacctgc agtacaagga tctgcccgtc tccctgcgtc ggggggccaa 1261 ccccggcttc catgaggcca ttggggacgt gctggcgctc tcggtctcca ctcctgaaca 1321 tctgcacaaa atcggcctgc tggaccgtgt caccaatgac acggaaagtg acatcaatta 1381 cttgctaaaa atggcactgg aaaaaattgc cttcctgccc tttggctact tggtggacca 1441 gtggcgctgg ggggtcttta gtgggcgtac ccccccttcc cgctacaact tcgactggtg 1501 gtatcttcga accaagtatc aggggatctg tcctcctgtt acccgaaacg aaacccactt 1561 tgatgctgga gctaagtttc atgttccaaa tgtgacacca tacatcaggt actttgtgag 1621 ttttgtcctg cagttccagt tccatgaagc cctgtgcaag gaggcaggct atgagggccc 1681 actgcaccag tgtgacatct accggtccac caaggcaggg gccaagctcc ggaaggtgct 1741 gcaggctggc tcctccaggc cctggcagga ggtgctgaag gacatggtcg gcttagatgc 1801 cctggatgcc cagccgctgc tcaagtactt ccagccagtc acccagtggc tgcaggagca 1861 gaaccagcag aacggcgagg tcctgggctg gcccgagtac cagtggcacc cgccgttgcc 1921 tgacaactac ccggagggca tagacctggt gactgatgag gctgaggcca gcaagtttgt 1981 ggaggaatat gaccggacat cccaggtggt gtggaacgag tatgccgagg ccaactggaa 2041 ctacaacacc aacatcacca cagagaccag caagattctg ctgcagaaga acatgcaaat 2101 agccaaccac accctgaagt acggcaccca ggccaggaag tttgatgtga accagttgca 2161 gaacaccact atcaagcgga tcataaagaa ggttcaggac ctagaacggg cagcgctgcc 2221 tgcccaggag ctggaggagt acaacaagat cctgttggat atggaaacca cctacagcgt 2281 ggccactgtg tgccacccga atggcagctg cctgcagctc gagccagatc tgacgaatgt 2341 gatggccaca tcccggaaat atgaagacct gttatgggca tgggagggct ggcgagacaa 2401 ggcggggaga gccatcctcc agttttaccc gaaatacgtg gaactcatca accaggctgc 2461 ccggctcaat ggctatgtag atgcagggga ctcgtggagg tctatgtacg agacaccatc 2521 cctggagcaa gacctggagc ggctcttcca ggagctgcag ccactctacc tcaacctgca 2581 tgcctacgtg cgccgggccc tgcaccgtca ctacggggcc cagcacatca acctggaggg 2641 gcccattcct gctcacctgc tggggaacat gtgggcgcag acctggtcca acatctatga 2701 cttggtggtg cccttccctt cagccccctc gatggacacc acagaggcta tgctaaagca 2761 gggctggacg cccaggagga tgtttaagga ggctgatgat ttcttcacct ccctggggct 2821 gctgcccgtg cctcctgagt tctggaacaa gtcgatgctg gagaagccaa ccgacgggcg 2881 ggaggtggtc tgccacgcct cggcctggga cttctacaac ggcaaggact tccggatcaa 2941 gcagtgcacc accgtgaact tggaggacct ggtggtggcc caccacgaaa tgggccacat 3001 ccagtatttc atgcagtaca aagacttacc tgtggccttg agggagggtg ccaaccccgg 3061 cttccatgag gccattgggg acgtgctagc cctctcagtg tctacgccca agcacctgca 3121 cagtctcaac ctgctgagca gtgagggtgg cagcgacgag catgacatca actttctgat 3181 gaagatggcc cttgacaaga tcgcctttat ccccttcagc tacctcgtcg atcagtggcg 3241 ctggagggta tttgatggaa gcatcaccaa ggagaactat aaccaggagt ggtggagcct 3301 caggctgaag taccagggcc tctgcccccc agtgcccagg actcaaggtg actttgaccc 3361 aggggccaag ttccacattc cttctagcgt gccttacatc aggtactttg tcagcttcat 3421 catccagttc cagttccacg aggcactgtg ccaggcagct ggccacacgg gccccctgca 3481 caagtgtgac atctaccagt ccaaggaggc cgggcagcgc ctggcgaccg ccatgaagct 3541 gggcttcagt aggccgtggc cggaagccat gcagctgatc acgggccagc ccaacatgag 3601 cgcctcggcc atgttgagct acttcaagcc gctgctggac tggctccgca cggagaacga 3661 gctgcatggg gagaagctgg gctggccgca gtacaactgg acgccgaact ccgctcgctc 3721 agaagggccc ctcccagaca gcggccgcgt cagcttcctg ggcctggacc tggatgcgca 3781 gcaggcccgc gtgggccagt ggctgctgct cttcctgggc atcgccctgc tggtagccac 3841 cctgggcctc agccagcggc tcttcagcat ccgccaccgc agcctccacc ggcactccca 3901 cgggccccag ttcggctccg aggtggagct gagacactcc tgaggtgacc cggctgggtc 3961 ggccctgccc aagggcctcc caccagagac tgggatggga acactggtgg gcagctgagg // LOCUS HUMAIR 1816 bp DNA PRI 27-JAN-1996 DEFINITION Homo Sapiens angiotensin II receptor gene, complete cds. ACCESSION L48211 NID g1160612 KEYWORDS angiotensin; angiotensin II; angiotensin II receptor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1816) AUTHORS Razdan,K. and Kroll,M.H. TITLE Molecular cloning of a novel platelet protein showing homology to the angiotensin II receptor C-terminal domain JOURNAL J. Biol. Chem. 271 (4), 2221-2224 (1996) MEDLINE 96147204 FEATURES Location/Qualifiers source 1..1816 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1481..1696 /codon_start=1 /product="angiotensin II receptor" /db_xref="PID:g1160613" /translation="MHSSFCSLGDRAACSVITASELKSHRSPDSRSFLMHSSQSQLRQ YSRLYVLWQREADEHSFREKADGKPVS" BASE COUNT 477 a 415 c 461 g 463 t ORIGIN 1 gggtaaaacc atttgtttaa ttctaaatca aatcactttc acaacagtga aaattagtga 61 ctggttaagg tgtgccactg tacatatcat cattttctga ctggggtcag gacctggtcc 121 tagtccacaa gggtggcagg aggagggtgg aggctaagaa cacagaaaac acacaaaaga 181 aaggaaagct gccttggcag aaggatgagg tggtgagctt gccgagggat ggtgggaagg 241 gggctccctg ttggggccga gccaggagtc ccaagtcagc tctcctgcct tacttagctc 301 ctggcagagg gtgagtgggg acctacgagg ttcaaaatca aatggcattt ggccaggctg 361 gctttactaa caggttccca gagtgcctct gttggctgag ctctcctggg ctactcattt 421 cattgaagag tccaaatgat tcattttcct acccacaact tttcattatt cttctggaaa 481 cccatttctg ttgagtccat ctgacttaag tcctctctcc ctccactagt tggggccact 541 gcactgaggg gggtcccacc aattctctct agagaagaga cactccagag gcccctgcaa 601 ctttgcggat ttccagaagg tgataaaaag agcactcttg agtgggtgcc caggaatgtt 661 taaaatctat caggcacact ataaagctgg tggtttcttc ctaccaagtg gattcggcat 721 atgaaccacc tactcaatac tttatatttt gtctgtttaa acactgaact ctggtgttga 781 caggtacaag gagaagagat ggggactatg aagaggggag ggcttccctc atcttcctca 841 agatctttgt ttccacaaac tatgcagtca taatttgaga aaaagcaata gatggggctt 901 cctaccattt gttggttatt gctggggtta gccaggagca gtgtggatgg caaagtagga 961 gagaggccca gaggaaagcc catctccctc cagctttggg gtctccagaa agaggctgga 1021 tttctgggat gaagcctaga aggcagagca agaactgttc caccaggtga acagtcctac 1081 ctgcttggta ccatagtccc tcaataagat tcagaggaag aagcttatga aactgaaaat 1141 caaatcaagg tattgggaag aataatttcc cctcgattcc acaggaggga agaccacaca 1201 atatcattgt gctggggctc cccaaggccc tgccacctgg ctttacaaat catcaggggt 1261 tgcctgcttg gcagtcacat gcttccctgg ttttagcaca catacaagga gttttcaggg 1321 aactctatca agccatacca aaatcagggt cacatgtggg tttccccttt ccttgcctct 1381 tcataaaaga caacttggct tctgaggatg gtggtctttt gcatgcagtt gggctgacct 1441 gacaaagccc ccagtttcct gtggcaggtt ctgggagagg atgcattcaa gcttctgcag 1501 cctaggggac agggctgctt gttcagttat tactgcctcg gagctcaaat cccaccgaag 1561 tcctgactcc aggtctttcc taatgcacag tagtcagtct cagcttcggc agtattctcg 1621 gctgtatgtt ctctggcaga gagaggcaga tgaacatagt tttagggaga aagctgatgg 1681 gaaacctgtg agttaagcca catgtctcac caggaataat ttatgccagg aaaccaggaa 1741 gtcattcaag ttgttctctg aggccaaaga cactgagcac agcccagagc caataaaaga 1801 tctttgagtc tctggt // LOCUS HUMAKAP79A 2618 bp mRNA PRI 31-DEC-1994 DEFINITION Human cAMP-dpendent protein kinase (AKAP 79) mRNA, complete cds. ACCESSION M90359 NID g178323 KEYWORDS anchoring protein; cAMP dependent; protein kinase. SOURCE Homo sapiens thyroid gland cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2618) AUTHORS Carr,D.W., Stofko-Hahn,R.E., Fraser,I.D., Bishop,S.M., Acott,T.S., Brennan,R.G. and Scott,J.D. TITLE Interaction of the regulatory subunit (RII) of cAMP-dependent protein kinase with RII-anchoring proteins occurs through an amphipathic helix binding motif JOURNAL J. Biol. Chem. 266 (22), 14188-14192 (1991) MEDLINE 91317762 REFERENCE 2 (bases 1 to 2618) AUTHORS Hirsch,A.H., Glantz,S.B., Li,Y., You,Y. and Rubin,C.S. TITLE Cloning and expression of an intron-less gene for AKAP 75, an anchor protein for the regulatory subunit of cAMP-dependent protein kinase II beta JOURNAL J. Biol. Chem. 267 (4), 2131-2134 (1992) MEDLINE 92129278 REFERENCE 3 (sites) AUTHORS Carr,D.W., Hausken,Z.E., Fraser,I.D., Stofko-Hahn,R.E. and Scott,J.D. TITLE Association of the type II cAMP-dependent protein kinase with a human thyroid RII-anchoring protein. Cloning and characterization of the RII-binding domain JOURNAL J. Biol. Chem. 267 (19), 13376-13382 (1992) MEDLINE 92317056 REFERENCE 4 (bases 1 to 2618) AUTHORS Carr,D.W., Stofko-Hahn,R.E., Fraser,I.D., Cone,R.D. and Scott,J.D. TITLE Localization of the cAMP-dependent protein kinase to the postsynaptic densities by A-kinase anchoring proteins. Characterization of AKAP 79 JOURNAL J. Biol. Chem. 267 (24), 16816-16823 (1992) MEDLINE 92380978 FEATURES Location/Qualifiers source 1..2618 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thyroid gland" 5'UTR 1..1297 /gene="AKAP 79" gene 1..2618 /gene="AKAP 79" CDS 1298..2581 /gene="AKAP 79" /codon_start=1 /product="protein kinase" /db_xref="PID:g178324" /translation="METTISEIHVENKDEKRSAEGSPGAERQKEKASMLCFKRRKKAA KALKPKAGSEAADVARKCPQEAGASDQPEPTRGAWASLKRLVTRRKRSESSKQQKPLE GEMQPAINAEDADLSKKKAKSRLKIPCIKFPRGPKRSNHSKIIEDSDCSIKVQEEAEI LDIQTQTPLNDQATKAKSTQDLSEGISQKDGDEVCESNVSNSITSGEKVISVELGLDN GHSAIQTGTLILEEIETIKEKQDVQPQQASPLETSETDHQQPVLSDVPPLPAIPDQQI VEEASNSTLESAPNGKDYESTEIVAEETKPKDTELSQESDFKENGITEEKSKSEESKR MEPIAIIITDTEISEFDVTKSKNVPKQFLISAENEQVGVFANDNGFEDRTSEQYETLL IETASSLVKNAIQLSIEQLVNEMASDDNKINNLLQ" 3'UTR 2582..2618 /gene="AKAP 79" BASE COUNT 950 a 476 c 504 g 688 t ORIGIN 1 gaattccttt ttttttacta atgaaacact gcagattagt atccccaaac aattcatttt 61 ctgacgaaac tcttaaagcc cagtttcaga acataagctt taagcaacaa gcaatcttgt 121 gttaatctga gctctgttac agctctgccg taatctcagc caaattgctt catctgtacg 181 tggtgatacc accatctttg taatattgtg gtgaggatta gagatgtatg taaaagtgca 241 tgcactgcct aatacccaag aattgctcaa taaatgtcag ctattactac tcttactgac 301 ttccaactat gtgcagatta tggtggaaac atgtgataca gcagatacag tacttaaatg 361 aaaatgtcaa aattgtacaa ctaatttaag agtgggaaat gacagtaatg ctaaattttc 421 ctaagaatat gtatgaaaaa tagcacttcc ttgatggaac atacattttt agaacagaaa 481 ccttatgtta taatggtaaa actagagagt caaagtctct agttctaact ctggctatga 541 ccttatctag ttgcatggct tttgacaaga caactgctcc cctctggagt tcagttttgt 601 catctggctg gatttgagta atctcaaata atctctaaag tcctttgcag ctttaaatta 661 tctattgagt gtgataaaga ttacatcagc ttgcttttca tcctgcactt catttttggt 721 tcttgttgtt tgcttaactt gcactttaga gtcataagca attcacacag acaacacaac 781 taaagaactg ataatatcag acttgcaggg taatccaaaa tgaagtttca acaaggagga 841 taaaacaaaa tagaatcaca ctgaactctt taatagggcc aaagtaaacc ccaatgttta 901 ttctccagtc atttcattaa gttattatat ggaaaatatg cctggtagaa tcttatacat 961 tcactgaaat agtctccaca gaaaataata ggctcatagt gaaataaaat accactttct 1021 tttctctcat caggttgatc tttaaataaa gcaaaaattt atccaaagaa gggctttcat 1081 caagaaatta aagagagaaa ttacaagcca tagagaaact gggcatttct atactagaga 1141 aaccacctaa aacaactgta tggagtaaga tgaaaggtac tgaatatgcc tggaagaaat 1201 tttacctagt gtgacatttt tgctcatctt tttgtcacaa aaggctgcta aggaagagag 1261 aatacagttt tctagagaat aagagtgcag tgtaaaaatg gaaaccacaa tttcagaaat 1321 tcatgtagaa aacaaggatg agaagagatc agcagaaggt agtcctgggg ctgaaaggca 1381 gaaggaaaag gcatccatgc tttgcttcaa gagaagaaag aaagcagcca aagcactgaa 1441 gcccaaagct ggctctgaag ctgctgatgt ggcaaggaag tgtccacaag aagcaggagc 1501 ttctgatcag ccagagccca cacggggggc ctgggcctca ctcaaacgtc ttgtaacacg 1561 caggaaaagg tcagagtctt caaagcagca aaagccattg gagggtgaaa tgcaacctgc 1621 aataaatgct gaggatgctg atctttctaa gaaaaaggca aaatctagac ttaagattcc 1681 ctgcataaaa ttcccaagag ggccaaaaag gagtaatcat tccaaaatta tagaagactc 1741 agactgcagc atcaaagtcc aggaagaagc tgaaattttg gatatacaaa cacagacccc 1801 attgaatgat caggcaacaa aggctaagtc aacccaggat ctaagtgaag gcatctcaca 1861 gaaagatggt gatgaggtct gtgaatcaaa tgtgagcaat agcataactt ctggagagaa 1921 agtgatttca gtagaacttg gattagataa tgggcattct gctattcaaa cgggaactct 1981 aatccttgaa gaaattgaaa cgatcaagga aaaacaagat gttcaacccc agcaagcaag 2041 cccacttgaa acttcagaaa cagaccatca gcagccagta ctttctgatg ttcctccttt 2101 acctgcaatt ccagatcaac aaattgtgga agaagccagt aacagtaccc tagaaagtgc 2161 accaaatgga aaagactatg aaagtacaga gattgtagct gaagaaacta agccaaaaga 2221 tactgaattg agccaagaat cagattttaa agaaaatggg atcactgaag agaaatccaa 2281 atcagaagaa agcaaaagaa tggagccaat tgctattatt attacagaca ctgaaatcag 2341 tgaatttgat gttacaaaat ctaaaaatgt ccctaagcaa ttcttaattt cagctgaaaa 2401 tgagcaagta ggggtttttg ctaatgataa tggttttgag gatagaactt cagaacaata 2461 tgaaacactc ttaattgaaa cagcctcttc tctagtcaag aatgctattc agttgtcaat 2521 agaacagctg gttaatgaaa tggcctctga tgataataaa ataaacaatc ttctacagtg 2581 acttactctc cagagtcacg gcagaaaaaa aggaattc // LOCUS HUMAKT2A 1599 bp mRNA PRI 31-DEC-1994 DEFINITION Human protein-serine/threonine (AKT2) mRNA, complete cds. ACCESSION M95936 NID g178325 KEYWORDS protein serine/threonine kinase. SOURCE Homo sapiens (tissue library: Clontech) thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Cheng,J.Q., Godwin,A.K., Bellacosa,A., Taguchi,T., Franke,T.F., Hamilton,T.C., Tsichlis,P.N. and Testa,J.R. TITLE AKT2, a putative oncogene encoding a member of a subfamily of protein-serine/threonine kinases, is amplified in human ovarian carcinomas JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (19), 9267-9271 (1992) MEDLINE 93028445 FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thymus" /tissue_lib="Clontech" /map="19q13.1-19q13.2" gene 88..1533 /gene="AKT2" CDS 88..1533 /gene="AKT2" /codon_start=1 /product="protein serine/threonine kinase" /db_xref="PID:g178326" /translation="MNEVSVIKEGWLHKRGEYIKTWRPRYFLLKSDGSFIGYKERPEA PDQTLPPLNNFSVAECQLMKTERPRPNTFVIRCLQWTTVIERTFHVDSPDEREEWMRA IQMVANSLKQRAPGEDPMDYKCGSPSDSSTTEEMEVAVSKARAKVTMNDFDYLKLLGK GTFGKVILVREKATGRYYAMKILRKEVIIAKDEVAHTVTESRVLQNTRHPFLTALKYA FQTHDRLCFVMEYANGGELFFHLSRERVFTEERARFYGAEIVSALEYLHSRDVVYRDI KLENLMLDKDGHIKITDFGLCKEGISDGATMKTFCGTPEYLAPEVLEDNDYGRAVDWW GLGVVMYEMMCGRLPFYNQDHERLFELILMEEIRFPRTLSPEAKSLLAGLLKKDPKQR LGGGPSDAKEVMEHRFFLSINWQDVVQKKLLPPFKPQVTSEVDTRYFDDEFTAQSITI TPPDRYDSLGLLELDQRTHFPQFSYSASIRE" BASE COUNT 351 a 487 c 457 g 304 t ORIGIN 1 gagactgtgc cctgtccacg gtgcctcctg catgtcctgc tgccctgagc tgtcccgagc 61 taggtgacag cgtaccacgc tgccaccatg aatgaggtgt ctgtcatcaa agaaggctgg 121 ctccacaagc gtggtgaata catcaagacc tggaggccac ggtacttcct gctgaagagc 181 gacggctcct tcattgggta caaggagagg cccgaggccc ctgatcagac tctacccccc 241 ttaaacaact tctccgtagc agaatgccag ctgatgaaga ccgagaggcc gcgacccaac 301 acctttgtca tacgctgcct gcagtggacc acagtcatcg agaggacctt ccacgtggat 361 tctccagacg agagggagga gtggatgcgg gccatccaga tggtcgccaa cagcctcaag 421 cagcgggccc caggcgagga ccccatggac tacaagtgtg gctcccccag tgactcctcc 481 acgactgagg agatggaagt ggcggtcagc aaggcacggg ctaaagtgac catgaatgac 541 ttcgactatc tcaaactcct tggcaaggga acctttggca aagtcatcct ggtgcgggag 601 aaggccactg gccgctacta cgccatgaag atcctgcgaa aggaagtcat cattgccaag 661 gatgaagtcg ctcacacagt caccgagagc cgggtcctcc agaacaccag gcacccgttc 721 ctcactgcgc tgaagtatgc cttccagacc cacgaccgcc tgtgctttgt gatggagtat 781 gccaacgggg gtgagctgtt cttccacctg tcccgggagc gtgtcttcac agaggagcgg 841 gcccggtttt atggtgcaga gattgtctcg gctcttgagt acttgcactc gcgggacgtg 901 gtataccgcg acatcaagct ggaaaacctc atgctggaca aagatggcca catcaagatc 961 actgactttg gcctctgcaa agagggcatc agtgacgggg ccaccatgaa aaccttctgt 1021 gggaccccgg agtacctggc gcctgaggtg ctggaggaca atgactatgg ccgggccgtg 1081 gactggtggg ggctgggtgt ggtcatgtac gagatgatgt gcggccgcct gcccttctac 1141 aaccaggacc acgagcgcct cttcgagctc atcctcatgg aagagatccg cttcccgcgc 1201 acgctcagcc ccgaggccaa gtccctgctt gctgggctgc ttaagaagga ccccaagcag 1261 aggcttggtg gggggcccag cgatgccaag gaggtcatgg agcacaggtt cttcctcagc 1321 atcaactggc aggacgtggt ccagaagaag ctcctgccac ccttcaaacc tcaggtcacg 1381 tccgaggtcg acacaaggta cttcgatgat gaatttaccg cccagtccat cacaatcaca 1441 ccccctgacc gctatgacag cctgggctta ctggagctgg accagcggac ccacttcccc 1501 cagttctcct actcggccag catccgcgag tgagcagtct gcccacgcag aggacgcacg 1561 ctcgctgcca tcaccgctgg gtggtttttt acccctgcc // LOCUS HUMALBGC 19002 bp DNA PRI 03-MAY-1996 DEFINITION Human serum albumin (ALB) gene, complete cds. ACCESSION M12523 J04457 NID g178343 KEYWORDS albumin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 19002) AUTHORS Minghetti,P.P., Ruffner,D.E., Kuang,W.J., Dennison,O.E., Hawkins,J.W., Beattie,W.G. and Dugaiczyk,A. TITLE Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11-22 of chromosome 4 JOURNAL J. Biol. Chem. 261 (15), 6747-6757 (1986) MEDLINE 86196112 REFERENCE 2 (bases 17688 to 17755; 18526 to 18555) AUTHORS Minchiotti,L., Galliano,M., Iadarola,P., Meloni,M.L., Ferri,G., Porta,F. and Castellani,A.A. TITLE The molecular defect in a COOH-terminal-modified and shortened mutant of human serum albumin JOURNAL J. Biol. Chem. 264 (6), 3385-3389 (1989) MEDLINE 89123466 COMMENT Computer-readable sequence in [1] was kindly provided by A.Dugaiczyk, 01-JUL-1986. Draft entry and printed copy of sequence for [2] kindly provided by L.Minchiotti, 09-DEC-1988. [2] describes a missplicing event in alooalbumin Venezia by which exon 14 is not translated. The protein translation goes from exon 13 to exon 15, which is normally in the 3' flanking region. The 3' end of this incorrectly translated and prematurely ended protein is hydrophilic instead of hydrophobic [2]. FEATURES Location/Qualifiers source 1..19002 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="T. Maniatis" /map="4q11-q13" /chromosome="4" exon 1737..1854 /gene="ALB" /number=1 gene 1737..18688 /gene="ALB" CDS join(1776..1854,2564..2621,4076..4208,6041..6252, 6802..6934,7759..7856,9444..9573,10867..11081, 12481..12613,13702..13799,14977..15115,15534..15757, 16941..17073,17688..17732) /gene="ALB" /note="precursor" /codon_start=1 /product="albumin" /db_xref="PID:g178344" /translation="MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFK ALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLFGDKLCTVA TLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLK KYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSA KQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLE CADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFV ESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHE CYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEV SRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNR RPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKE QLKAVMDDFAAFVEKCCKADDKETCFAEEGKKLVAASQAALGL" sig_peptide 1776..1829 /gene="ALB" CDS join(1776..1854,2564..2621,4076..4208,6041..6252, 6802..6934,7759..7856,9444..9573,10867..11081, 12481..12613,13702..13799,14977..15115,15534..15757, 16941..17073,18526..18555) /gene="ALB" /note="precursor" /codon_start=1 /product="alloalbumin Venezia" /db_xref="PID:g178345" /translation="MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFK ALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLFGDKLCTVA TLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLK KYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSA KQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLE CADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFV ESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHE CYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEV SRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNR RPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKE QLKAVMDDFAAFVEKCCKADDKETCFAEEPTMRIRERK" mat_peptide join(1848..1854,2564..2621,4076..4208,6041..6252, 6802..6934,7759..7856,9444..9573,10867..11081, 12481..12613,13702..13799,14977..15115,15534..15757, 16941..17073,18526..18552) /gene="ALB" /product="alloalbumin Venezia" mat_peptide join(1848..1854,2564..2621,4076..4208,6041..6252, 6802..6934,7759..7856,9444..9573,10867..11081, 12481..12613,13702..13799,14977..15115,15534..15757, 16941..17073,17688..17729) /gene="ALB" /product="albumin" intron 1855..2563 /gene="ALB" /number=1 exon 2564..2621 /gene="ALB" /number=2 intron 2622..4075 /gene="ALB" /number=2 repeat_region complement(3035..3041) /note="3' insertion target sequence" /rpt_type=direct repeat_region complement(3042..3334) /note="Alu I" /rpt_family="Alu" repeat_region complement(3335..3341) /note="5' insertion target sequence" repeat_region 3421..3427 /note="5' insertion target sequence" repeat_region 3428..3722 /note="Alu 2" /rpt_family="Alu" repeat_region 3725..3731 /note="3' insertion target sequence" exon 4076..4208 /gene="ALB" /number=3 intron 4209..6040 /gene="ALB" /number=3 exon 6041..6252 /gene="ALB" /number=4 intron 6253..6801 /gene="ALB" /number=4 exon 6802..6934 /gene="ALB" /number=5 intron 6935..7758 /gene="ALB" /number=5 exon 7759..7856 /gene="ALB" /number=6 intron 7857..9443 /gene="ALB" /number=6 exon 9444..9573 /gene="ALB" /number=7 intron 9574..10866 /gene="ALB" /number=7 repeat_region complement(9885..9892) /note="3' insertion target sequence" /rpt_type=direct repeat_region complement(9893..10191) /rpt_family="Alu" repeat_region complement(10192..10199) /note="5' insertion target sequence" /rpt_type=direct exon 10867..11081 /gene="ALB" /number=8 intron 11082..12480 /gene="ALB" /number=8 repeat_region complement(11906..11918) /note="3' insertion target sequence" /rpt_type=direct repeat_region complement(11921..12200) /rpt_family="Alu" repeat_region complement(12201..12213) /note="5' insertion target sequence" /rpt_type=direct exon 12481..12613 /gene="ALB" /number=9 intron 12614..13701 /gene="ALB" /number=9 exon 13702..13799 /gene="ALB" /number=10 intron 13800..14976 /gene="ALB" /number=10 exon 14977..15115 /gene="ALB" /number=11 intron 15116..15533 /gene="ALB" /number=11 repeat_region complement(15297..15481) /rpt_family="Alu" exon 15534..15757 /gene="ALB" /number=12 intron 15758..16940 /gene="ALB" /number=12 exon 16941..17073 /gene="ALB" /number=13 intron 17074..17687 /gene="ALB" /number=13 intron 17074..18525 /gene="ALB" /note="alloalbumin Venezia" /number=13 exon 17688..17755 /gene="ALB" /number=14 exon 17688..18499 /gene="ALB" /note="alb mRNA (alt.)" /number=14 intron 17756..18525 /gene="ALB" /number=14 exon 18526..18688 /gene="ALB" /note="alloalbumin Venezia" /number=14 BASE COUNT 5891 a 3215 c 3440 g 6456 t ORIGIN 629 bp upstream of SstI site; chromosome 4q11-22. 1 cctttcccag ggacttctac aaggaaaaag ctagagttgg ttactgactt ctaataaata 61 atgcctacaa tttctaggaa gttaaaagtt gacataattt atccaagaaa gaattatttt 121 cttaacttag aatagtttct tttttctttt cagatgtagg tttttctggc tttagaaaaa 181 atgcttgttt ttcttcaatg gaaaataggc acacttgttt tatgtctgtt catctgtagt 241 cagaaagaca agtctggtat ttcctttcag gactcccttg agtcattaaa aaaaatcttc 301 ctatctatct atgtatctat catccatcta gctttgattt tttcctcttc tgtgctttat 361 tagttaatta gtacccattt ctgaagaaga aataacataa gattatagaa aataatttct 421 ttcattgtaa gactgaatag aaaaaatttt ctttcattat aagactgagt agaaaaaata 481 atactttgtt agtctctgtg cctctatgtg ccatgaggaa atttgactac tggttttgac 541 tgactgagtt atttaattaa gtaaaataac tggcttagta ctaattattg ttctgtagta 601 tcagagaaag ttgttcttcc tactggttga gctcagtagt tcttcatatt ctgagcaaaa 661 gggcagaggt aggatagctt ttctgaggta gagataagaa ccttgggtag ggaaggaaga 721 tttatgaaat atttaaaaaa ttattcttcc ttcgctttgt ttttagacat aatgttaaat 781 ttattttgaa atttaaagca acataaaaga acatgtgatt tttctactta ttgaaagaga 841 gaaaggaaaa aaatatgaaa cagggatgga aagaatccta tgcctggtga aggtcaaggg 901 ttctcataac ctacagagaa tttggggtca gcctgtccta ttgtatatta tggcaaagat 961 aatcatcatc tcatttgggt ccattttcct ctccatctct gcttaactga agatcccatg 1021 agatatactc acactgaatc taaatagcct atctcagggc ttgaatcaca tgtgggccac 1081 agcaggaatg ggaacatgga atttctaagt cctatcttac ttgttattgt tgctatgtct 1141 ttttcttagt ttgcatctga ggcaacatca gctttttcag acagaatggc tttggaatag 1201 taaaaaagac acagaagccc taaaatatgt atgtatgtat atgtgtgtgt gcatgcgtga 1261 gtacttgtgt gtaaattttt cattatctat aggtaaaagc acacttggaa ttagcaatag 1321 atgcaatttg ggacttaact ctttcagtat gtcttatttc taagcaaagt atttagtttg 1381 gttagtaatt actaaacact gagaactaaa ttgcaaacac caagaactaa aatgttcaag 1441 tgggaaatta cagttaaata ccatggtaat gaataaaagg tacaaatcgt ttaaactctt 1501 atgtaaaatt tgataagatg ttttacacaa ctttaataca ttgacaaggt cttgtggaga 1561 aaacagttcc agatggtaaa tatacacaag ggatttagtc aaacaatttt ttggcaagaa 1621 tattatgaat tttgtaatcg gttggcagcc aatgaaatac aaagatgagt ctagttaata 1681 atctacaatt attggttaaa gaagtatatt agtgctaatt tccctccgtt tgtcctagct 1741 tttctcttct gtcaacccca cacgcctttg gcacaatgaa gtgggtaacc tttatttccc 1801 ttctttttct ctttagctcg gcttattcca ggggtgtgtt tcgtcgagat gcacgtaaga 1861 aatccatttt tctattgttc aacttttatt ctattttccc agtaaaataa agttttagta 1921 aactctgcat ctttaaagaa ttattttggc atttatttct aaaatggcat agtattttgt 1981 atttgtgaag tcttacaagg ttatcttatt aataaaattc aaacatccta ggtaaaaaaa 2041 aaaaaaggtc agaattgttt agtgactgta attttctttt gcgcactaag gaaagtgcaa 2101 agtaacttag agtgactgaa acttcacaga atagggttga agattgaatt cataactatc 2161 ccaaagacct atccattgca ctatgcttta tttaaaaacc acaaaacctg tgctgttgat 2221 ctcataaata gaacttgtat ttatatttat tttcatttta gtctgtcttc ttggttgctg 2281 ttgatagaca ctaaaagagt attagatatt atctaagttt gaatataagg ctataaatat 2341 ttaataattt ttaaaatagt attcttggta attgaattat tcttctgttt aaaggcagaa 2401 gaaataattg aacatcatcc tgagtttttc tgtaggaatc agagcccaat attttgaaac 2461 aaatgcataa tctaagtcaa atggaaagaa atataaaaag taacattatt acttcttgtt 2521 ttcttcagta tttaacaatc cttttttttc ttcccttgcc cagacaagag tgaggttgct 2581 catcggttta aagatttggg agaagaaaat ttcaaagcct tgtaagttaa aatattgatg 2641 aatcaaattt aatgtttcta atagtgttgt ttattattct aaagtgctta tatttccttg 2701 tcatcagggt tcagattcta aaacagtgct gcctcgtaga gttttctgcg ttgaggaaga 2761 tattctgtat ctgggctatc caataaggta gtcactggtc acatggctat tgagtacttc 2821 aaatatgaca agtgcaactg agaaacaaaa acttaaattg tatttaattg tagttaattt 2881 gaatgtatat agtcacatgt ggctaatggc tactgtattg gacagtacag ctctggaact 2941 tgcttggtgg aaaggacttt aatataggtt tcctttggtg gcttacccac taaatcttct 3001 ttacatagca agcattcctg tgcttagttg ggaatattta attttttttt ttttttaaga 3061 cagggtctcg ctctgtcgcc caggctggag tgcagtggcg caatctcggc tcactgcaaa 3121 ctccgctccc gggttcacgc cattctcctg cctcagcctc ccgagtagct gggactacag 3181 gcgcccgcca tcacgcccgg ctaatctttt gtatttttag tagagatggg gtttcaccgt 3241 gtgccaggat ggtctcaatc tcctgacatc gtgatctgcc cacctcggcc tcccaaagtg 3301 ctgggattac aggagtgagt caccgcgccc ggcctattta aatgtttttt aatctagtaa 3361 aaaatgagaa aattgttttt ttaaaagtct acctaatcct acaggctaat taaagacgtg 3421 tgtggggatc aggtgcggtg gttcacacct gtaatcccag cactttggaa ggctgatgca 3481 ggaggattgc ttgagcccag gagtacaaga ccagcctggg caagtctctt taaaaaaaac 3541 aaaacaaaca aacaaaaaaa ttaggcatgg tggcacatgc ctgtagtcct agctacttag 3601 gaggctgacg taggaggatc gtttggacct gagaggtcaa ggctacagtg agccatgatt 3661 gtgccactgc actccagcct gggtgacaga gtgagactct gtctcaaaaa agaaaaagga 3721 aatctgtggg gtttgtttta gttttaagta attctaagga ctttaaaaat gcctagtctt 3781 gacaattaga tctatttggc atacaatttg cttgcttaat ctatgtgtgt gcatagatct 3841 actgacacac gcatacatat aaacattagg gaactaccat tctctttgcg taggaagcca 3901 catatgccta tctaggcctc agatcatacc tgatatgaat aggctttctg gataatggtg 3961 aagaagatgt ataaaagata gaacctatac ccatacatga tttgttctct agcgtagcaa 4021 cctgttacat attaaagttt tattatacta catttttcta catcctttgt ttcagggtgt 4081 tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta aaattagtga 4141 atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa aattgtgaca 4201 aatcacttgt aagtacattc taattgtgga gattctttct tctgtttgaa gtaatcccaa 4261 gcatttcaaa ggaatttttt ttaagttttc tcaattatta ttaagtgtcc tgatttgtaa 4321 gaaacactaa aaagttgctc atagactgat aagccattgt ttcttttgtg atagagatgc 4381 tttagctatg tccacagttt taaaatcatt tctttattga gaccaaacac aacagtcatg 4441 gtgtatttaa atggcaattt gtcatttata aacacctctt tttaaaattt gaggtttggt 4501 ttctttttgt agaggctaat agggatatga tagcatgtat ttatttattt atttatctta 4561 ttttattata gtaagaaccc ttaacatgag atctaccctg ttatattttt aagtgtacaa 4621 tccattattg ttaactacgg gtacactgtt gtatagctta ctcatcttgc tgtattaaaa 4681 ctttgtgccc attgattagt aacccctcgt ttcgtcctcc cccagccact ggcaaccagc 4741 attatactct ttgattctat gagtttgact actttagcta ccttatataa gtggtattat 4801 gtactgttta tctttttatg actgacttat ttcccttagc atagtgcatt caaagtccaa 4861 ccatgttgtt gcctattgca gaatttcctt cttttcaagg ctgaataata ttccagtgca 4921 tgtgtgtacc acattttctt tatccattaa tttgttgatt gatagacatt taggttggtt 4981 ttctacatct tgactatcat gaatagtgtt gcaatgaaca caggagagct actatctctt 5041 agagatgata tcatggtttt tatcatcaga aaacacccac tgatttctat gctaattttg 5101 ttacctgggt ggaataatag tacagctata tattcctcat tttagatatc tttgtatttc 5161 tacatacaat aaaaaagcag agtacttagt catgttgaag aactttaaac ttttagtatt 5221 tccagatcaa tcttcaaaac aaggacaggt ttatctttct ctcaccactc aatctatata 5281 tacctcttgt gggcaaggcc agtttttatc actggagcct ttcccctttt tattatgtac 5341 ctctccctca cagcagagtc aggactttaa ctttacacaa tactatggct ctacatatga 5401 aatcttaaaa atacataaaa attaataaat tctgtctaga gtagtatatt ttccctgggg 5461 ttacgattac tttcataata aaaattagag ataaggaaag gactcattta ttggaaagtg 5521 attttaggta acatttctgg aagaaaaatg tctatatctt aatagtcact taatatatga 5581 tggattgtgt tactcctcag ttttcaatgg catatactaa aacatggccc tctaaaaagg 5641 gggcaaatga aatgagaaac tctctgaatg tttttctccc ctaggtgaat tcacctgctg 5701 cttagaagct tattttctct tgatttctgt tataatgatt gctcttaccc tttagtttta 5761 agtttcaaaa taggagtcat ataactttcc ttaaagctat tgactgtctt tttgtcctgt 5821 tttattcacc atgagttata gtgtgacagt taattcttat gaaaattata tagagatggt 5881 taaatcatca gaaactgtaa acctcgattg ggaggggaag cggattttta aatgatttcc 5941 tgaccaagct taaccagtat attaaatcct ttgtactgtt ctttggctat aaagaaaaaa 6001 ggtactgtcc agcaactgaa acctgctttc ttccatttag catacccttt ttggagacaa 6061 attatgcaca gttgcaactc ttcgtgaaac ctatggtgaa atggctgact gctgtgcaaa 6121 acaagaacct gagagaaatg aatgcttctt gcaacacaaa gatgacaacc caaacctccc 6181 ccgattggtg agaccagagg ttgatgtgat gtgcactgct tttcatgaca atgaagagac 6241 atttttgaaa aagtaagtaa tcagatgttt atagttcaaa attaaaaagc atggagtaac 6301 tccataggcc aacactctat aaaaattacc ataacaaaaa tattttcaac attaagactt 6361 ggaagttttg ttatgatgat tttttaaaga agtagtattt gataccacaa aattctacac 6421 agcaaaaaat atgatcaaag atattttgaa gtttattgaa acaggataca atctttctga 6481 aaaatttaag atagacaaat tatttaatgt attacgaaga tatgtatata tggttgttat 6541 aattgatttc gttttagtca gcaacattat attgccaaaa tttaaccatt tatgcacaca 6601 cacacacaca cacacacact taaccctttt ttccacatac ttaaagaatg acagagacaa 6661 gaccatcatg tgcaaattga gcttaattgg ttaattagat atctttggaa tttggaggtt 6721 ctggggagaa tgtcgattac aattatttct gtaatattgt ctgctataga aaagtgactg 6781 tttttctttt tcaaaattta gatacttata tgaaattgcc agaagacatc cttactttta 6841 tgccccggaa ctccttttct ttgctaaaag gtataaagct gcttttacag aatgttgcca 6901 agctgctgat aaagctgcct gcctgttgcc aaaggtatta tgcaaaagaa tagaaaaaaa 6961 gagttcatta tccaacctga ttttgtccat tttgtggcta gatttaggga acctgagtgt 7021 ctgatacaaa ctttccgaca tggtcaaaaa agccttcctt ttatctgtct tgaaaatctt 7081 tcatctttga aggcctacac tctcgtttct tcttttaaga tttgccaatg atgatctgtc 7141 agaggtaatc actgtgcatg tgtttaaaga tttcaccact ttttatggtg gtgatcacta 7201 tagtgaaata ctgaaacttg tttgtcaaat tgcacagcaa ggggacacag ttcttgttta 7261 tcttttcatg ataattttta gtagggaggg aattcaaagt agagaatttt actgcatcta 7321 gatgcctgag ttcatgcatt cattccataa atatatatta tggaatgctt tattttcttt 7381 tctgaggagt ttactgatgt tggtggagga gagactgaaa tgaattatac acaaaattta 7441 aaaattagca aaattgcagc ccctgggata ttagcgtact ctttctctga cttttctccc 7501 acttttaagg ctctttttcc tggcaatgtt tccagttggt ttctaactac atagggaatt 7561 ccgctgtgac cagaatgatc gaatgatctt tccttttctt agagagcaaa atcattattc 7621 gctaaaggga gtacttggga atttaggcat aaattatgcc ttcaaaattt aatttggcac 7681 agtctcatct gagcttatgg aggggtgttt catgtagaat ttttcttcta attttcatca 7741 aattattcct ttttgtagct cgatgaactt cgggatgaag ggaaggcttc gtctgccaaa 7801 cagagactca agtgtgccag tctccaaaaa tttggagaaa gagctttcaa agcatggtaa 7861 atacttttaa acatagttgg catctttata acgatgtaaa tgataatgct tcagtgacaa 7921 attgtacatt tttatgtatt ttgcaaagtg ctgtcaaata catttctttg gttgtctaac 7981 aggtagaact ctaatagagg taaaaatcag aatatcaatg acaatttgac attattttta 8041 atcttttctt ttctaaatag ttgaataatt tagaggacgc tgtccttttt gtcctaaaaa 8101 aagggacaga tatttaagtt ctatttattt ataaaatctt ggactcttat tctaatggtt 8161 cattattttt atagagctgt aggcatggtt ctttatttaa ttttttaaag ttatttttaa 8221 tttttgtgga tacagagtag gtatacatat ttacggggta tatgagatat tttgatataa 8281 gtatacaaca tatataatcc ctttatttaa ttttatcttc cccccaatga tctaaaacta 8341 tttgcttgtc cttttatgtc ttatagttaa attcagtcac caactaagtt gaagttactt 8401 cttatttttg catagctcca gctctgatct tcatctcatg tttttgcctg agcctctgtt 8461 ttcatattac ttagttggtt ctgggagcat actttaatag ccgagtcaag aaaaatacta 8521 gctgccccgt cacccacact cctcacctgc tagtcaacag caaatcaaca caacaggaaa 8581 taaaatgaaa ataatagaca ttatgcatgc tctctagaaa ctgtcaattg aactgtattt 8641 gctcatcatt cctaccatct acaccaccaa aatcaaccaa atttatgaaa aaaaaacagc 8701 cccaacataa aattatacac agataaacag gctatgattg gttttgggaa agaagtcacc 8761 tttacctgat ttaggcaact gtgaaatgac tagagaatga agaaaattag acgtttacat 8821 cttgtcatag agtttgaaga tagtgctgga tctttctttt tataagtaag atcaataaaa 8881 actccctcat tctgtagaag ttatgatttc ttttctaaga gacctttaga agtcagaaaa 8941 aatgtgtttc aattgagaaa aaagataact ggagtttgtg tagtacttcc cagattataa 9001 aatgcttttg tatgtattat ctaatttaat cctcaaaact tcttcaattt agcatgttgt 9061 catgacactg cagaggctga agctcagaga cgctgagccc tctgctaaca agtcctactg 9121 ctaacaagtg ataaagccag agctggaagt cacatctgga ctccaaacct gatgcttctc 9181 agcctgttgc cccttttaga gttccttttt aatttctgct tttatgactt gctagatttc 9241 tacctaccac acacactctt aaatggataa ttctgcccta aggataagtg attaccattt 9301 ggttcagaac tagaactaat gaattttaaa aattatttct gtatgtccat tttgaatttt 9361 cttatgagaa atagtatttg cctagtgttt tcatataaaa tatcgcatga taataccatt 9421 ttgattggcg attttctttt tagggcagta gctcgcctga gccagagatt tcccaaagct 9481 gagtttgcag aagtttccaa gttagtgaca gatcttacca aagtccacac ggaatgctgc 9541 catggagatc tgcttgaatg tgctgatgac agggtaaaga gtcgtcgata tgctttttgg 9601 tagcttgcat gctcaagttg gtagaatgga tgcgtttggt atcattggtg atagctgaca 9661 gtgggttgag attgtcttct gtgctttcgt ctgtcctatc ttcaatcttt ccctgcctat 9721 ggtggtggta cctttctgtt tttaacctgc tataaattac cagataaacc cattcactga 9781 tttgtaactc ctttcagtca tgctctaact gtaaatgaag gcttaaactg aagtagaaca 9841 gttacaaggt tttacttggc agaacatctt gcaaggtaga tgtctaagaa gatttttttt 9901 tcttttttta agacagagtt tcgctcttgt ttcccaggct ggggtgcaat ggtgtgatct 9961 tggctcagcg caacctctgc ctcctgggtt caagtgattt tcatgcctca gcctcccaag 10021 tagctgggat tacaggcatg cgccaccaca cctggctaat tttgtatttt tagtagaggc 10081 ggggtttcac catattgtcc agactggtct cgaactcctg acctcaggtg atccacccgc 10141 cttggcctcc caaagtgctg ggattacagg catgagccac cttgcccagc ctaagaagat 10201 tttttgaggg aggtaggtgg acttggagaa ggtcactact tgaagagatt tttggaaatg 10261 atgtattttt cttctctata ttccttccct taattaactc tgtttgttag atgtgcaaat 10321 atttggaatg atatctcttt tctcaaaact tataatattt tctttctccc tttcttcaag 10381 attaaactta tgggcaaata ctagaatcct aatctctcat ggcactttct ggaaaattta 10441 aggcggttat tttatatatg taagcagggc ctatgactat gatcttgact catttttcaa 10501 aaatcttcta tattttattt agttatttgg tttcaaaagg cctgcactta attttggggg 10561 attatttgga aaaacagcat tgagttttaa tgaaaaaaac ttaaatgccc taacagtaga 10621 aacataaaat taataaataa ctgagctgag cacctgctac tgattagtct attttaatta 10681 agtgggaatg tttttgtagt cctatctaca tctccaggtt taggagcaaa cagagtatgt 10741 tcatagaagg aatatgtgta tggtcttaga atacaatgaa catgttctgc caacttaata 10801 aaggtctgag gagaaagtgt agcaatgtca attcgtgttg aacaatttcc accaacttac 10861 ttataggcgg accttgccaa gtatatctgt gaaaatcaag attcgatctc cagtaaactg 10921 aaggaatgct gtgaaaaacc tctgttggaa aaatcccact gcattgccga agtggaaaat 10981 gatgagatgc ctgctgactt gccttcatta gctgctgatt ttgttgaaag taaggatgtt 11041 tgcaaaaact atgctgaggc aaaggatgtc ttcctgggca tgtaagtaga taagaaatta 11101 ttcttttata gctttggcat gacctcacaa cttaggagga tagcctaggc ttttctgtgg 11161 agttgctaca atttccctgc tgcccagaat gtttcttcat ccttcccttt cccaggcttt 11221 aacaattttt gaaatagtta attagttgaa tacattgtca taaaataata catgttcacg 11281 gcaaagctca acattcctta ctccttaggg gtatttctga aaatacgtct agaaacattt 11341 tgtgtatata taaattatgt atacttcagt cattcattcc aagtgtattt cttgaacatc 11401 tataatatat gtgtgtgact atgtattgcc tgtctatcta actaatctaa tctaatctag 11461 tctatctatc taatctatgc aatgatagca aagaagtata aaaagaaata tagagtctga 11521 cacaggtgct ttatatttgg tgaaaagacc agaagttcag tataatggca atatggtagg 11581 caactcaatt acaaaataaa tgtttacgta ttgtcagaag ttgtggtgat aaactgcatt 11641 tttgttgttg gattatgata atgcactaaa taatatttcc taaaattatg taccctacaa 11701 gatttcactc atacagagaa gaaagagaat attttaagaa catatctctg cccatctatt 11761 tatcagaatc cttttgagat gtagtttaaa tcaaacaaaa tgttaataaa aataacaagt 11821 atcattcatc aaagacttca tatgtgccaa gcagtgtgtg ctttgtgtag attatgtcat 11881 atagttctca taatccacct tccgagacag atactattta ttttttgaga cagagtttta 11941 ctcttgttgc ccaggctgga gtgcaatggt gccatctcgg ctcaccacaa ccttcgcctc 12001 ccaggttcaa gcgattctcc tgcctcagcc tcctgggatt acaggcatgc accaccatgc 12061 ctggctaatt ttgtattttt agtagagatg gggtttcacc atgttggtca gactggtctc 12121 aaactcctga cctctggtga tatgcctgcc tcagcctcct aaagtgctgg gattacaggc 12181 atgagccact gtgcccagcc gacagatact attattattt ccattctacc gagaaggaga 12241 ctaaggctct gatcatttaa ataagttgcc taaggtgatg cagtgatata agtagcagag 12301 ctaggaattg agccttggta actttaactc tggaccccaa gtccttagct actaagcttt 12361 actgcatggg gtttagtcaa attaagactt ttggaatatg agttactttt gagattagct 12421 ttgtgatatt ttttgtgctc atttgtccaa caaagtctat tttattttca tcttaattag 12481 gtttttgtat gaatatgcaa gaaggcatcc tgattactct gtcgtgctgc tgctgagact 12541 tgccaagaca tatgaaacca ctctagagaa gtgctgtgcc gctgcagatc ctcatgaatg 12601 ctatgccaaa gtggtaggtt tattgttgga aaaaaatgta gttctttgac tgatgattcc 12661 aataatgaga aagaaaaata atgcaagaat gtaaaatgat atacagtgca atttagatct 12721 tttcttgaga tggtttcaat tctggaatct taaacatgaa agaaaaagta gccttagaat 12781 gattaacaaa atttagacta gttagaatag aaagatctga atagagcaat ctctaaaaaa 12841 ttttgatctt tttttctctt tttcacaatc ctgagaacaa aaaaaaatta aatttaaatg 12901 ttaattagaa gatatttaac ttagatgtaa agtgagttaa cctgattcca ggattaatca 12961 agtactagaa ttagtatctt atggcaaatt atagaaccta tccctttaga atattttcaa 13021 atctttttga ggatgtttag gaatagtttt acaagaaatt aagttaggag aggaaatctg 13081 ttctggagga tttttagggt tcccactagc atatgtaatg gtttctgaac tattcagaat 13141 cagagaaaac tcatttttcc tgctttcaag aagctactgt atgccaggca ccatgcacaa 13201 acaatgacca acgtaaaatc tctcattttg gagagcctgg aatctaactg gaaaggtgaa 13261 ctaataataa taatatgtac aatcatagcc atcatttatt aaacttttat tatatgcaag 13321 gcactgttta atttcattag cttacctggt ttacagagca gctctatgag atgagtgcca 13381 tctttgcccc tattttaggg ataaggattc cgaaatgtgg agatggtaag taaaattgca 13441 caactgaaga atgagttaca tgacttggct caaatactgg tcattgaact ccagagcctg 13501 aatattctta accacttaca tgatgcaagc tcaccaaata aatagttcga atgtattgtg 13561 acagagcggc attgatattc atctattcat gtggctttga gtaggaagaa gaaaggatat 13621 cattctgacc agaggggtga aaaacaacct gcatctgatc ctgaggcata atactattaa 13681 cacaattctt ttatgtttca gttcgatgaa tttaaacctc ttgtggaaga gcctcagaat 13741 ttaatcaaac aaaattgtga gctttttgag cagcttggag agtacaaatt ccagaatgcg 13801 taagtaattt ttattgactg atttttttta tcaatttgta attatttaag acttaatata 13861 tgagccacct agcatagaac ttttaagaat gaaaatacat tgcatatttc taatcactct 13921 ttgtcaagaa agataggaga ggagagataa aatagttgat ggggtggaga ggtctatatt 13981 tgaatgtagt ctaaaaattg ttctcttaag attggaagta tgtaggctgg gagggtaaat 14041 accaaatctt ggtatatcag aactgagcat gtcccttgaa ggttaagaaa tagttaatgg 14101 gcaaatagag catggcaata ttttgtagag cagcaagtag taggccttga atagatgtcg 14161 ctcaaaaagt aatatgtaag ctgaacacaa aaatgtaaca aatgaattta gatacatatt 14221 tgaatattaa attcaggttg tttgggagat gcacctagtc tttgatggtt aaacctttcc 14281 ctccatagaa gagacagaga cagaatggct tgctggacta atgtcccaat tcaatagagt 14341 cttatctacg aaggttaaaa acaagaagag acatattata cagtagatat ttattgtgtg 14401 gctcatacac atggtgctct tctgattatg gattttagag ataataacag tgaacaagac 14461 atagtttctt tcctcgagta gattaaagtc atacattgac ttttaatggt gactggcatt 14521 cttaatacat gattattata tattaggtac catgtcagat taattataat actttactat 14581 ttttaattta acccttgaac tatccctatt gagtcagata tatttccttc cattttctac 14641 ttgtatcttt caagtttagc atatgctgat acatatgaag ctctctccag gttttattga 14701 aagaagaaat taataaattt attaatgtca ctgaattagg caactcactt tcccaagatt 14761 atgcaagtgg tacaggtgga actcaaagcc aagtttaact agttgttcag gagaatgttt 14821 tctaccctcc actaacccac tactctgcag atggagataa tatgatgaat ggaacatagc 14881 aacatcttag ttgattccgg ccaagtgttc tctgttttat ctactatgtt agacagtttc 14941 ttgccttgct gaaaacacat gacttctttt tttcaggcta ttagttcgtt acaccaagaa 15001 agtaccccaa gtgtcaactc caactcttgt agaggtctca agaaacctag gaaaagtggg 15061 cagcaaatgt tgtaaacatc ctgaagcaaa aagaatgccc tgtgcagaag actatgtgag 15121 tctttaaaaa aatataataa attaataatg aaaaaatttt acctttagat attgataatg 15181 ctagctttca taagcagaag gaagtaatgt gtgtgtgtgc atgtttgtgt gcatgtgtgt 15241 gtgcatgcac gtgtgtgtat gtgtgatatt ggcagtcaag gccccgagga tgataatttt 15301 tttttttttt ttgagacgga gtctcgcttt gttgtccagg ctggagtgca gtggtgccat 15361 ctcggctcac tgcaacctcc gcctcccaag ttcaagccat tctcctgcct cagcctccca 15421 agtagctggg actacaggtg catgccacca tgcctggcta attttttgta tttttagtag 15481 aaaattttca gcttcacctc ttttgaattt ctgctctcct gcctgttctt tagctatccg 15541 tggtcctgaa ccagttatgt gtgttgcatg agaaaacgcc agtaagtgac agagtcacca 15601 aatgctgcac agaatccttg gtgaacaggc gaccatgctt ttcagctctg gaagtcgatg 15661 aaacatacgt tcccaaagag tttaatgctg aaacattcac cttccatgca gatatatgca 15721 cactttctga gaaggagaga caaatcaaga aacaaacgtg aggagtattt cattactgca 15781 tgtgtttgta gtcttgatag caagaactgt caattcaagc tagcaacttt ttcctgaagt 15841 agtgattata tttcttagag gaaagtattg gagtgttgcc cttattatgc tgataagagt 15901 acccagaata aaatgaataa ctttttaaag acaaaatcct ctgttataat attgctaaaa 15961 ttattcagag taatattgtg gattaaagcc acaatagaat aacatgttag accatattca 16021 gtagaaaaag atgaacaatt aactgataaa tttgtgcaca tggcaaatta gttaatggga 16081 accataggag aatttatttc tagatgtaaa taattatttt aagtttgccc tatggtggcc 16141 ccacacatga gacaaacccc caagatgtga cttttgagaa tgagacttgg ataaaaaaca 16201 tgtagaaatg caagccctga agctcaactc cctattgcta tcacaggggt tataattgca 16261 taaaatttag ctatagaaag ttgctgtcat ctcttgtggg ctgtaatcat cgtctaggct 16321 taagagtaat attgcaaaac ctgtcatgcc cacacaaatc tctccctggc attgttgtct 16381 ttgcagatgt cagtgaaaga gaaccagcag ctcccatgag tttggatagc cttattttct 16441 atagcctccc cactgaaggg agcaaagttt aagaaccaaa tataaagttt ctcatcttta 16501 tagatgagaa aaattttaaa taaagtccaa gataattaaa tttttaagga tcatttttag 16561 ctctttaata gcaataaaac tcaatatgac ataatatggc acttccaaaa tctgaataat 16621 atataattgc aatgacatac ttcttttcag agatttactg aaaagaaatt tgttgacact 16681 acataacgtg atgagtggtt tatactgatt gtttcagttg gtcttcccac caactccatg 16741 aaagtggatt ttattatcct catcatgcag atgagaatat tgagacttat agcggtatgc 16801 ctggcccaag tactcagagt tgcctggctc caagatttat aatcttaaat gatgggacta 16861 ccatccttac tctctccatt tttctatacg tgagtaatgt tttttctgtt tttttttttt 16921 ctttttccat tcaaactcag tgcacttgtt gagctcgtga aacacaagcc caaggcaaca 16981 aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 17041 gctgacgata aggagacctg ctttgccgag gaggtactac agttctcttc attttaatat 17101 gtccagtatt catttttgca tgtttggtta ggctagggct tagggattta tatatcaaag 17161 gaggctttgt acatgtggga cagggatctt attttacaaa caattgtctt acaaaatgaa 17221 taaaacagca ctttgttttt atctcctgct ctattgtgcc atactgttga atgtttataa 17281 tgcatgttct gtttccaaat ttgtgatgct tatgaatatt aataggaata tttgtaaggc 17341 ctgaaatatt ttgatcatga aatcaaaaca ttaatttatt taaacattta cttgaaatgt 17401 ggtggtttgt gatttagttg attttatagg ctagtgggag aatttacatt caaatgtcta 17461 aatcacttaa aatttccctt tatggcctga cagtaacttt tttttattca tttggggaca 17521 actatgtccg tgagcttcca tccagagatt atagtagtaa attgtaatta aaggatatga 17581 tgcacgtgaa atcactttgc aatcatcaat agcttcataa atgttaattt tgtatcctaa 17641 tagtaatgct aatattttcc taacatctgt catgtctttg tgttcagggt aaaaaacttg 17701 ttgctgcaag tcaagctgcc ttaggcttat aacatcacat ttaaaagcat ctcaggtaac 17761 tatattttga attttttaaa aaagtaacta taatagttat tattaaaata gcaaagattg 17821 accatttcca agagccatat agaccagcac cgaccactat tctaaactat ttatgtatgt 17881 aaatattagc ttttaaaatt ctcaaaatag ttgctgagtt gggaaccact attatttcta 17941 ttttgtagat gagaaaatga agataaacat caaagcatag attaagtaat tttccaaagg 18001 gtcaaaattc aaaattgaaa ccaaggtttc agtgttgccc attgtcctgt tctgacttat 18061 atgatgcggt acacagagcc atccaagtaa gtgatggctc agcagtggaa tactctggga 18121 attaggctga accacatgaa agagtgcttt atagggcaaa aacagttgaa tatcagtgat 18181 ttcacatggt tcaacctaat agttcaactc atcctttcca ttggagaata tgatggatct 18241 accttctgtg aactttatag tgaagaatct gctattacat ttccaatttg tcaacatgct 18301 gagctttaat aggacttatc ttcttatgac aacatttatt ggtgtgtccc cttgcctagc 18361 ccaacagaag aattcagcag ccgtaagtct aggacaggct taaattgttt tcactggtgt 18421 aaattgcaga aagatgatct aagtaatttg gcatttattt taataggttt gaaaaacaca 18481 tgccatttta caaataagac ttatatttgt ccttttgttt ttcagcctac catgagaata 18541 agagaaagaa aatgaagatc aaaagcttat tcatctgttt ttctttttcg ttggtgtaaa 18601 gccaacaccc tgtctaaaaa acataaattt ctttaatcat tttgcctctt ttctctgtgc 18661 ttcaattaat aaaaaatgga aagaatctaa tagagtggta cagcactgtt atttttcaaa 18721 gatgtgttgc tatcctgaaa attctgtagg ttctgtggaa gttccagtgt tctctcttat 18781 tccacttcgg tagaggattt ctagtttctt gtgggctaat taaataaatc attaatactc 18841 ttctaagtta tggattataa acattcaaaa taatattttg acattatgat aattctgaat 18901 aaaagaacaa aaaccatggt ataggtaagg aatataaaac atggctttta ccttagaaaa 18961 aacaattcta aaattcatat ggaatcaaaa aagagcctgc ag // LOCUS HUMALBP 634 bp mRNA PRI 31-OCT-1994 DEFINITION Human adipocyte lipid-binding protein, complete cds. ACCESSION J02874 NID g178346 KEYWORDS adipocyte lipid-binding protein. SOURCE Human adipose, cDNA to mRNA, clone lambda-H-ALBP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 634) AUTHORS Baxa,C.A., Sha,R.S., Buelt,M.K., Smith,A.J., Matarese,V., Chinander,L.L., Boundy,K.L. and Bernlohr,D.A. TITLE Human adipocyte lipid-binding protein: purification of the protein and cloning of its complementary DNA JOURNAL Biochemistry 28 (22), 8683-8690 (1989) MEDLINE 90105397 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted R.Sha, 07-AUG-89. FEATURES Location/Qualifiers source 1..634 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 63..461 /gene="FABP4" CDS 63..461 /gene="FABP4" /note="adipocyte lipid-binding protein" /codon_start=1 /db_xref="GDB:G00-128-030" /db_xref="PID:g178347" /translation="MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIIS VNGDVITIKSESTFKNTEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKS TTIKRKREDDKLVVECVMKGVTSTRVYERA" polyA_signal 613..618 BASE COUNT 200 a 108 c 151 g 175 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattccag gagggtgcag cttccttctc accttgaaga ataatcctag aaaactcaca 61 aaatgtgtga tgcttttgta ggtacctgga aacttgtctc cagtgaaaac tttgatgatt 121 atatgaaaga agtaggagtg ggctttgcca ccaggaaagt ggctggcatg gccaaaccta 181 acatgatcat cagtgtgaat ggggatgtga tcaccattaa atctgaaagt acctttaaaa 241 atactgagat ttccttcata ctgggccagg aatttgacga agtcactgca gatgacagga 301 aagtcaagag caccataacc ttagatgggg gtgtcctggt acatgtgcag aaatgggatg 361 gaaaatcaac caccataaag agaaaacgag aggatgataa actggtggtg gaatgcgtca 421 tgaaaggcgt cacttccacg agagtttatg agagagcata agccaaggga cgttgacctg 481 gactgaagtt cgcattgaac tctacaacat tctgtgggat atattgttca aaaagatatt 541 gttgttttcc ctgatttagc aagcaagtaa ttttctccca agctgatttt attcaatatg 601 gttacgttgg ttaaataact ttttttagat ttag // LOCUS HUMALCAM 2539 bp mRNA PRI 07-AUG-1995 DEFINITION Homo sapiens CD6 ligand (ALCAM) mRNA, complete cds. ACCESSION L38608 NID g886257 KEYWORDS T-cell glycoprotein CD6; alcam CD6 ligand. SOURCE Homo sapiens cDNA to mRNA; and Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2539) AUTHORS Bowen,M.A., Patel,D.D., Li,X., Modrell,B., Malacko,A.R., Wang,W.C., Marquardt,H., Neubauer,M., Pesando,J.M., Francke,U. et,al. TITLE Cloning, mapping, and characterization of activated leukocyte-cell adhesion molecule (ALCAM), a CD6 ligand JOURNAL J. Exp. Med. 181 (6), 2213-2220 (1995) MEDLINE 95279947 FEATURES Location/Qualifiers source 1..2539 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA activated T-cells" 5'UTR 1..63 /partial sig_peptide 64..144 CDS 64..1815 /note="precursor" /codon_start=1 /product="alcam" /db_xref="PID:g886258" /translation="MESKGASSCRLLFCLLISATVFRPGLGWYTVNSAYGDTIIIPCR LDVPQNLMFGKWKYEKPDGSPVFIAFRSSTKKSVQYDDVPEYKDRLNLSENYTLSISN ARISDEKRFVCMLVTEDNVFEAPTIVKVFKQPSKPEIVSKALFLETEQLKKLGDCISE DSYPDGNITWYRNGKVLHPLEGAVVIIFKKEMDPVTQLYTMTSTLEYKTTKADIQMPF TCSVTYYGPSGQKTIHSEQAVFDIYYPTEQVTIQVLPPKNAIKEGDNITLKCLGNGNP PPEEFLFYLPGQPEGIRSSNTYTLMDVRRNATGDYKCSLIDKKSMIASTAITVHYLDL SLNPSGEVTRQIGDALPVSCTISASRNATVVWMKDNIRLRSSPSFSSLHYQDAGNYVC ETALQEVEGLKKRESLTLIVEGKPQIKMTKKTDPSGLSKTIICHVEGFPKPAIQWTIT GSGSVINQTEESPYINGRYYSKIIISPEENVTLTCTAENQLERTVNSLNVSAISIPEH DEADEISDENREKVNDQAKLIVGIVVGLLLAALVAGVVYWLYMKKSKTASKHVNKDLG NMEENKKLEENNHKTEA" mat_peptide 145..1812 /product="alcam" variation 836 /note="g in clone HL60, Asn->Ser" /replace="g" variation 965 /note="c in clone HL60, Met->Thr" /replace="c" 3'UTR 1813..2539 /partial BASE COUNT 842 a 512 c 527 g 658 t ORIGIN 1 cgggacgacg ccccctcctg cggcgtggac tccgtcagtg gcccaccaag aaggaggagg 61 aatatggaat ccaagggggc cagttcctgc cgtctgctct tctgcctctt gatctccgcc 121 accgtcttca ggccaggcct tggatggtat actgtaaatt cagcatatgg agataccatt 181 atcatacctt gccgacttga cgtacctcag aatctcatgt ttggcaaatg gaaatatgaa 241 aagcccgatg gctccccagt atttattgcc ttcagatcct ctacaaagaa aagtgtgcag 301 tacgacgatg taccagaata caaagacaga ttgaacctct cagaaaacta cactttgtct 361 atcagtaatg caaggatcag tgatgaaaag agatttgtgt gcatgctagt aactgaggac 421 aacgtgtttg aggcacctac aatagtcaag gtgttcaagc aaccatctaa acctgaaatt 481 gtaagcaaag cactgtttct cgaaacagag cagctaaaaa agttgggtga ctgcatttca 541 gaagacagtt atccagatgg caatatcaca tggtacagga atggaaaagt gctacatccc 601 cttgaaggag cggtggtcat aatttttaaa aaggaaatgg acccagtgac tcagctctat 661 accatgactt ccaccctgga gtacaagaca accaaggctg acatacaaat gccattcacc 721 tgctcggtga catattatgg accatctggc cagaaaacaa ttcattctga acaggcagta 781 tttgatattt actatcctac agagcaggtg acaatacaag tgctgccacc aaaaaatgcc 841 atcaaagaag gggataacat cactcttaaa tgcttaggga atggcaaccc tcccccagag 901 gaatttttgt tttacttacc aggacagccc gaaggaataa gaagctcaaa tacttacaca 961 ctgatggatg tgaggcgcaa tgcaacagga gactacaagt gttccctgat agacaaaaaa 1021 agcatgattg cttcaacagc catcacagtt cactatttgg atttgtcctt aaacccaagt 1081 ggagaagtga ctagacagat tggtgatgcc ctacccgtgt catgcacaat atctgctagc 1141 aggaatgcaa ctgtggtatg gatgaaagat aacatcaggc ttcgatctag cccgtcattt 1201 tctagtcttc attatcagga tgctggaaac tatgtctgcg aaactgctct gcaggaggtt 1261 gaaggactaa agaaaagaga gtcattgact ctcattgtag aaggcaaacc tcaaataaaa 1321 atgacaaaga aaactgatcc cagtggacta tctaaaacaa taatctgcca tgtggaaggt 1381 tttccaaagc cagccattca gtggacaatt actggcagtg gaagcgtcat aaaccaaaca 1441 gaggaatctc cttatattaa tggcaggtat tatagtaaaa ttatcatttc ccctgaagag 1501 aatgttacat taacttgcac agcagaaaac caactggaga gaacagtaaa ctccttgaat 1561 gtctctgcta taagtattcc agaacacgat gaggcagacg agataagtga tgaaaacaga 1621 gaaaaggtga atgaccaggc aaaactaatt gtgggaatcg ttgttggtct cctccttgct 1681 gcccttgttg ctggtgtcgt ctactggctg tacatgaaga agtcaaagac tgcatcaaaa 1741 catgtaaaca aggacctcgg taatatggaa gaaaacaaaa agttagaaga aaacaatcac 1801 aaaactgaag cctaagagag aaactgtcct agttgtccag agataaaaat catatagacc 1861 aattgaagca tgaacgtgga ttgtatttaa gacataaaca aagacattga cagcaattca 1921 tggttcaagt attaagcagt tcattctacc aagctgtcac aggttttcag agaattatct 1981 caagtaaaac aaatgaaatt taattacaaa caataagaac aagttttggc agccatgata 2041 ataggtcata tgttgtgttt ggttcaattt tttttccgta aatgtctgca ctgaggattt 2101 ctttttggtt tgccttttat gtaaattttt tacgtagcta tttttataca ctgtaagctt 2161 tgttctggga gttgctgtta atctgatgta taatgtaatg tttttatttc aattgtttat 2221 atggataatc tgagcaggta catttctgat tctgattgct atcagcaatg ccccaaactt 2281 tctcataagc acctaaaacc caaaggtggc agcttgtgaa gattggggac actcatattg 2341 ccctaattaa aaactgtgat ttttatcaca agggagggga ggccgagagt cagactgata 2401 gacaccatag gagccgactc tttgatatgc caccagcgaa ctctcagaaa taaatcacag 2461 atgcatatag acacacatac ataatggtac tcccaaactg acaattttac ctattctgaa 2521 aaagacataa aacagaatt // LOCUS HUMALD 1382 bp mRNA PRI 15-MAR-1989 DEFINITION Human fructose 1,6-bisphosphatase mRNA, complete cds. ACCESSION M19922 NID g178348 KEYWORDS fructose-1,6-bisphosphatase. SOURCE Human clonal variant cells of HL-60 origin DNA, clone pD3-137. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1382) AUTHORS Solomon,D.H., Raynal,M.-C., Tejwani,G.A. and Cayre,Y.E. TITLE Activation of the fructose 1,6 bisphosphatase gene by 1, 25-dihydroxyvitamin D3 during monocytic differentiation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6904-6908 (1988) MEDLINE 88320544 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Solomon, 21-JUL-1988. FEATURES Location/Qualifiers source 1..1382 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 199..1215 /note="fructose 1,6-bisphosphatase (EC 3.1.3.11)" /codon_start=1 /db_xref="PID:g178349" /translation="MADQAPFDTDVNTLTRFVMEEGRKARGTGELTQLLNSLCTAVKA ISSAVRKAGIAHLYGIAGSTNVTGDQVKKLDVLSNDLVMNMLKSSFATCVLVSEEDKH AIIVEPEKRGKYVVCFDPLDGSSNIDCLVSVGTIFGIYRKKSTDEPSEKDALQPGRNL VAAGYALYGSATMLVLAMDCGVNCFMLDPAIGEFILVDKDVKIKKKGKIYSLNEGYAK DFDPAVTEYIQRKKFPPDNSAPYGARYVGSMVADVHRTLVYGGIFLYPANKKSPNGKL RLLYECNPMAYVMEKAGGMATTGKEAVLDVIPTDIHQRAPVILGSPDDVLEFLKVYEK HSAQ" BASE COUNT 313 a 408 c 368 g 293 t ORIGIN Unreported. 1 agttgcacca cgagcgctgc ggacactcgg gcggccagtc ggtctgtcag tcctcccgcc 61 aggtcccggc ccgcacctgc cgcccgcacc tgcagctcgc acctgcggcc agtgcctact 121 gccctctctt gccgcccgca cctgcagccc cgcacctgcc gcttgcacct gcagccccgc 181 gctctacccg gttcaagcat ggctgaccag gcgcccttcg acacggacgt caacaccctg 241 acccgcttcg tcatggagga gggcaggaag gcccggggca cgggcgagtt gacccagctg 301 ctcaactcgc tctgcacagc agtcaaagcc atctcttcgg cggtgcgcaa ggcgggcatc 361 gcgcacctct atggcattgc tggttctacc aacgtgacag gtgatcaagt taagaagctg 421 gacgtcctct ccaacgacct ggttatgaac atgttaaagt catcctttgc cacgtgtgtt 481 ctcgtgtcag aagaagataa acacgccatc atagtggaac cggagaaaag gggtaaatat 541 gtggtctgtt ttgatcccct tgatggatct tccaacatcg attgccttgt gtccgttgga 601 accatttttg gcatctatag aaagaaatca actgatgagc cttctgagaa ggatgctctg 661 caaccaggcc ggaacctggt ggcagccggc tacgcactgt atggcagtgc caccatgctg 721 gtccttgcca tggactgtgg ggtcaactgc ttcatgctgg acccggccat cggggagttc 781 attttggtgg acaaggatgt gaagataaaa aagaaaggta aaatctacag ccttaacgag 841 ggctacgcca aggactttga ccctgccgtc actgagtaca tccagaggaa gaagttcccc 901 ccagataatt cagctcctta tggggcccgg tatgtgggct ccatggtggc tgatgttcat 961 cgcactctgg tctacggagg gatatttctg taccccgcta acaagaagag ccccaatgga 1021 aagctgagac tgctgtacga atgcaacccc atggcctacg tcatggagaa ggctggggga 1081 atggccacca ctgggaagga ggccgtgtta gacgtcattc ccacagacat tcaccagagg 1141 gcgccggtga tcttgggatc ccccgacgac gtgctcgagt tcctgaaggt gtatgagaag 1201 cactctgccc agtgagcacc tgccctgcct gcatccggag aattgcctct acctggacct 1261 tttgtctcac acagcagtac cctgacctgc tgtgcacctt acattcctag agagcagaaa 1321 taaaaagcat gactatttcc accatcaaat gctgtagaat gcttggcact ccctaaccaa 1381 aa // LOCUS HUMALDHIII 1636 bp mRNA PRI 31-OCT-1994 DEFINITION Human aldehyde dehydrogenase type III (ALDHIII) mRNA, complete cds. ACCESSION M74542 NID g178401 KEYWORDS aldehyde dehydrogenase III. SOURCE Homo sapiens mucosa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1636) AUTHORS Schuuring,E.M.D., Verhoeven,E., Eckey,R., Vos,H.L. and Michalides,R.J.A.M. TITLE Cloning and complete nucleotide sequence of a cDNA encoding the full-length open reading frame of the human aldehyde dehydrogenase type III gene JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..1636 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="UMSCC2" /cell_type="squamous cell carcinoma" /tissue_type="mucosa" /map="17" gene 43..1610 /gene="ALDH3" CDS 43..1404 /gene="ALDH3" /codon_start=1 /db_xref="GDB:G00-118-992" /product="aldehyde dehydrogenase type III" /db_xref="PID:g178402" /translation="MSKISEAVKRARAAFSSGRTRPLQFRFQQLEALQRLIQEQEQEL VGALAADLHKNEWNAYYEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHS EPLGVVLVIGTWNYPFNLTIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDK DLYPVINGGVPETTELLKERFDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCY VDKNCDLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDA KKSRDYGRIISARHFQRVMGLIEGQKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEE IFGPVLPIVCVRSLEEAIQFINQREKPLALYMFSSNDKVIKKMIAETSSGGVAANDVI VHITLHSLPFGGVGNSGMGSYHGKKSFETFSHRRSCLVRPLMNDEGLKVRYPPSPAKM TQH" polyA_signal 1605..1610 /gene="ALDH3" /note="G00-118-992" BASE COUNT 359 a 504 c 483 g 290 t ORIGIN 1 ccaggagccc cagttaccgg gagaggctgt gtcaaaggcg ccatgagcaa gatcagcgag 61 gccgtgaagc gcgcccgcgc cgccttcagc tcgggcagga cccgtccgct gcagttccga 121 ttccagcagc tggaggcgct gcagcgcctg atccaggagc aggagcagga gctggtgggc 181 gcgctggccg cagacctgca caagaatgaa tggaacgcct actatgagga ggtggtgtac 241 gtcctagagg agatcgagta catgatccag aagctccctg agtgggccgc ggatgagccc 301 gtggagaaga cgccccagac tcagcaggac gagctctaca tccactcgga gccactgggc 361 gtggtcctcg tcattggcac ctggaactac cccttcaacc tcaccatcca gcccatggtg 421 ggcgccatcg ctgcagggaa cgcagtggtc ctcaagccct cggagctgag tgagaacatg 481 gcgagcctgc tggctaccat catcccccag tacctggaca aggatctgta cccagtaatc 541 aatgggggtg tccctgagac cacggagctg ctcaaggaga ggttcgacca tatcctgtac 601 acgggcagca cgggggtggg gaagatcatc atgacggctg ctgccaagca cctgacccct 661 gtcacgctgg agctgggagg gaagagtccc tgctacgtgg acaagaactg tgacctggac 721 gtggcctgcc gacgcatcgc ctgggggaaa ttcatgaaca gtggccagac ctgcgtggcc 781 ccagactaca tcctctgtga cccctcgatc cagaaccaaa ttgtggagaa gctcaagaag 841 tcactgaaag agttctacgg ggaagatgct aagaaatccc gggactatgg aagaatcatt 901 agtgcccggc acttccagag ggtgatgggc ctgattgagg gccagaaggt ggcttatggg 961 ggcaccgggg atgccgccac tcgctacata gcccccacca tcctcacgga cgtggacccc 1021 cagtccccgg tgatgcaaga ggagatcttc gggcctgtgc tgcccatcgt gtgcgtgcgc 1081 agcctggagg aggccatcca gttcatcaac cagcgtgaga agcccctggc cctctacatg 1141 ttctccagca acgacaaggt gattaagaag atgattgcag agacatccag tggtggggtg 1201 gcggccaacg atgtcatcgt ccacatcacc ttgcactctc tgcccttcgg gggcgtgggg 1261 aacagcggca tgggatccta ccatggcaag aagagcttcg agactttctc tcaccgccgc 1321 tcttgcctgg tgaggcctct gatgaatgat gaaggcctga aggtcagata ccccccgagc 1381 ccggccaaga tgacccagca ctgaggaggg gttgctccgc ctggcctggc catactgtgt 1441 cccatcggag tgcggaccac cctcactggc tctcctggcc ctggagaatc gctcctgcag 1501 ccccagccca gccccactcc tctgctgacc tgctgacctg tgcacacccc actcccacat 1561 gggcccaggc ctcaccattc caagtctcca cccctttcta gaccaataaa gagacaaata 1621 caattttcta actcgg // LOCUS HUMALFUC 2035 bp mRNA PRI 15-MAR-1990 DEFINITION Human alpha-L-fucosidase, complete cds. ACCESSION M29877 NID g178408 KEYWORDS fucosidase. SOURCE Human liver, cDNA to mRNA, (library of G.Howlett), clones lambda-HF[05,12,27]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2035) AUTHORS Occhiodoro,T., Beckmann,K.R., Morris,C. and Hopwood,J.J. TITLE Human alpha-L-fucosidase: Complete coding sequence from cDNA clones JOURNAL Biochem. Biophys. Res. Commun. 164, 439-445 (1989) MEDLINE 90026416 FEATURES Location/Qualifiers source 1..2035 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2007 /note="alpha-L-fucosidase mRNA" sig_peptide 19..84 /note="alpha-L-fucosidase signal peptide" CDS 19..1404 /note="alpha-L-fucosidase precursor (EC 3.2.1.5)" /codon_start=1 /db_xref="PID:g178409" /translation="MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPL PAWFDEAKFGVFIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFG PQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLV GELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYK PDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNSSCHHGGYYNCED KFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNI GPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYA IFLHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAE FAWTIKLTGVK" mat_peptide 85..1401 /note="alpha-L-fucosidase" BASE COUNT 525 a 489 c 496 g 525 t ORIGIN 1 gaattccggg ctccggggat gaggtcgcgg ccggcgggtc ccgcgctgtt gctgctgctg 61 ctcttcctcg gagcggccga gtcggtgcgt cgggcccagc ctccgcgccg ctacacccca 121 gactggccga gcctggattc tcggccgctg ccggcctggt tcgacgaagc caagttcggg 181 gtgttcatcc actggggcgt gttctcggtg cccgcctggg gcagcgagtg gttctggtgg 241 cactggcagg gcgaggggcg gccgcagtac cagcgcttca tgcgcgacaa ctacccgccc 301 ggcttcagct acgccgactt cggaccgcag ttcactgcgc gcttcttcca cccggaggag 361 tgggccgacc tcttccaggc cgcgggcgcc aagtatgtag ttttgacgac aaagcatcac 421 gaaggcttca caaactggcc gagtcctgtg tcttggaact ggaactccaa agacgtgggg 481 cctcatcggg atttggttgg tgaattggga acagctctcc ggaagaggaa catccgctat 541 ggactatacc actcactctt agagtggttc catccactct atctacttga taagaaaaat 601 ggcttcaaaa cacagcattt tgtcagtgca aaaacaatgc cagagctgta cgaccttgtt 661 aacagctata aacctgatct gatctggtct gatggggagt gggaatgtcc tgatacttac 721 tggaactcca caaattttct ttcatggctc tacaatgaca gccctgtcaa ggatgaggtg 781 gtagtaaatg accgatgggg tcagaactct tcctgtcacc atggaggata ctataactgt 841 gaagataaat tcaagccaca gagcttgcca gatcacaagt gggagatgtg caccagcatt 901 gacaagtttt cctggggcta tcgtcgtgac atggcattgt ctgatgttac agaagaatct 961 gaaatcattt cggaactggt tcagacagta agtttgggag gcaactatct tctgaacatt 1021 ggaccaacta aagatggact gattgttccc atcttccaag aaaggcttct tgctgttggg 1081 aaatggctga gcatcaatgg ggaggctatc tatgcctcca aaccatggcg ggtgcaatgg 1141 gaaaagaaca caacatctgt atggtatacc tcaaagggat cggctgttta tgccattttt 1201 ctgcactggc cagaaaatgg agtcttaaac cttgaatccc ccataactac ctcaactaca 1261 aagataacaa tgctgggaat tcaaggagat ctgaagtggt ccacagatcc agataaaggt 1321 ctcttcatct ctctacccca gttgccaccc tctgctgtcc ccgcagagtt tgcttggact 1381 ataaagctga caggagtgaa gtaatcattt gagtgcaaga agaaagaggc gctgctcact 1441 gttttcctgc ttcagttttt ctcttatagt accatcacta taatcaacga acttctcttc 1501 tccacccaga gatggctttt ccaacacatt ttaattaaag gaactgagta cattaccctg 1561 atgtctaaat ggaccaaaga tctgagatcc attgtgatta tatctgtatc aggtcagcag 1621 aagaaggaac tgagcagttg aactctgagt tcatcaattc taatatttgg aaattatcta 1681 caatggaatc ttccctctgt tctctgataa cctacttgct tactcaatgc ctttaagcca 1741 agtcaccctg ttgcctatgg gaggaggtgg aaggatttgg caagctcaac cacatgctat 1801 ttagttagca tcagttgtca ccaacagtct ttctgcaaag ggcaggagag ctttggggga 1861 aaggaaaagg cttaccaggc tgctatggtc aactcttcag aaattttcag agcaatctaa 1921 aagcgccaaa attcgctatg tttacagtga tactattaag aaaatgaatg tgattctgct 1981 ctgtcttttt aagtatgatc aaataaaaaa tttgtacatc acaatcattt ctacc // LOCUS HUMALK5A 2308 bp mRNA PRI 24-JAN-1994 DEFINITION Human activin receptor-like kinase (ALK-5) mRNA, complete cds. ACCESSION L11695 NID g431034 KEYWORDS activin; activin receptor-like kinase; serine/threonine kinase; transforming growth factor-beta; transmembrane protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2308) AUTHORS Franzen,P., ten Dijke,P., Ichijo,H., Yamashita,H., Schulz,P., Heldin,C.H. and Miyazono,K. TITLE Cloning of a TGF beta type I receptor that forms a heteromeric complex with the TGF beta type II receptor JOURNAL Cell 75 (4), 681-692 (1993) MEDLINE 94061986 FEATURES Location/Qualifiers source 1..2308 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HEL" /tissue_lib="lambda gt10" gene 77..1582 /gene="HLK-5" gene 77..1588 /gene="ALK-5" sig_peptide 77..148 /gene="HLK-5" CDS 77..1588 /gene="ALK-5" /note="amino acid feature: N-glycolsylation site, aa 45 .. 47; amino acid feature: cysteine-rich region, aa 36 .. 106; amino acid feature: extracellular domain, aa 25 .. 125; amino acid feature: intracellular domain, aa 148 .. 503; amino acid feature: kinase insert-I, aa 321 .. 327; amino acid feature: kinase insert-II, aa 444 .. 462; amino acid feature: serine/threonine kinase domain, aa 207 .. 498; amino acid feature: transmembrane domain, aa 148 .. 503" /codon_start=1 /product="activin receptor-like kinase" /db_xref="PID:g431035" /translation="MEAAVAAPRPRLLLLVLAAAAAAAAALLPGATALQCFCHLCTKD NFTCVTDGLCFVSVTETTDKVIHNSMCIAEIDLIPRDRPFVCAPSSKTGSVTTTYCCN QDHCNKIELPTTVKSSPGLGPVELAAVIAGPVCFVCISLMLMVYICHNRTVIHHRVPN EEDPSLDRPFISEGTTLKDLIYDMTTSGSGSGLPLLVQRTIARTIVLQESIGKGRFGE VWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENILGFIAADNKDNGTWTQL WLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEIVGTQGKPAIAHRDL KSKNILVKKNGTCCIADLGLAVRHDSATDTIDIAPNHRVGTKRYMAPEVLDDSINMKH FESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKVVCEQKL RPNIPNRWQSCEALRVMAKIMRECWYANGAARLTALRIKKTLSQLSQQEGIKM" mat_peptide 149..1582 /gene="HLK-5" BASE COUNT 642 a 459 c 537 g 670 t ORIGIN 1 ggcgaggcga ggtttgctgg ggtgaggcag cggcgcggcc gggccgggcc gggccacagg 61 cggtggcggc gggaccatgg aggcggcggt cgctgctccg cgtccccggc tgctcctcct 121 cgtgctggcg gcggcggcgg cggcggcggc ggcgctgctc ccgggggcga cggcgttaca 181 gtgtttctgc cacctctgta caaaagacaa ttttacttgt gtgacagatg ggctctgctt 241 tgtctctgtc acagagacca cagacaaagt tatacacaac agcatgtgta tagctgaaat 301 tgacttaatt cctcgagata ggccgtttgt atgtgcaccc tcttcaaaaa ctgggtctgt 361 gactacaaca tattgctgca atcaggacca ttgcaataaa atagaacttc caactactgt 421 aaagtcatca cctggccttg gtcctgtgga actggcagct gtcattgctg gaccagtgtg 481 cttcgtctgc atctcactca tgttgatggt ctatatctgc cacaaccgca ctgtcattca 541 ccatcgagtg ccaaatgaag aggacccttc attagatcgc ccttttattt cagagggtac 601 tacgttgaaa gacttaattt atgatatgac aacgtcaggt tctggctcag gtttaccatt 661 gcttgttcag agaacaattg cgagaactat tgtgttacaa gaaagcattg gcaaaggtcg 721 atttggagaa gtttggagag gaaagtggcg gggagaagaa gttgctgtta agatattctc 781 ctctagagaa gaacgttcgt ggttccgtga ggcagagatt tatcaaactg taatgttacg 841 tcatgaaaac atcctgggat ttatagcagc agacaataaa gacaatggta cttggactca 901 gctctggttg gtgtcagatt atcatgagca tggatccctt tttgattact taaacagata 961 cacagttact gtggaaggaa tgataaaact tgctctgtcc acggcgagcg gtcttgccca 1021 tcttcacatg gagattgttg gtacccaagg aaagccagcc attgctcata gagatttgaa 1081 atcaaagaat atcttggtaa agaagaatgg aacttgctgt attgcagact taggactggc 1141 agtaagacat gattcagcca cagataccat tgatattgct ccaaaccaca gagtgggaac 1201 aaaaaggtac atggcccctg aagttctcga tgattccata aatatgaaac attttgaatc 1261 cttcaaacgt gctgacatct atgcaatggg cttagtattc tgggaaattg ctcgacgatg 1321 ttccattggt ggaattcatg aagattacca actgccttat tatgatcttg taccttctga 1381 cccatcagtt gaagaaatga gaaaagttgt ttgtgaacag aagttaaggc caaatatccc 1441 aaacagatgg cagagctgtg aagccttgag agtaatggct aaaattatga gagaatgttg 1501 gtatgccaat ggagcagcta ggcttacagc attgcggatt aagaaaacat tatcgcaact 1561 cagtcaacag gaaggcatca aaatgtaatt ctacagcttt gcctgaactc tccttttttc 1621 ttcagatctg ctcctgggtt ttaatttggg aggtcagttg ttctacctca ctgagaggga 1681 acagaaggat attgcttcct tttgcagcag tgtaataaag tcaattaaaa acttcccagg 1741 atttctttgg acccaggaaa cagccatgtg ggtcctttct gtgcactatg aacgcttctt 1801 tcccaggaca gaaaatgtgt agtctacctt tattttttat taacaaaact tgttttttaa 1861 aaagatgatt gctggtctta actttaggta actctgctgt gctggagatc atctttaagg 1921 gcaaaggagt tggattgctg aattacaatg aaacatgtct tattactaaa gaaagtgatt 1981 tactcctggt tagtacattc tcagaggatt ctgaaccact agagtttcct tgattcagac 2041 tttgaatgta ctgttctata gtttttcagg atcttaaaac taacacttat aaaactctta 2101 tcttgagtct aaaaatgacc tcatatagta gtgaggaaca taattcatgc aattgtattt 2161 tgtatactat tattgttctt tcacttattc agaacattac atgccttcaa aatgggattg 2221 tactatacca gtaagtgcca cttctgtgtc tttctaatgg aaatgagtag aattgctgaa 2281 agtctctatg ttaaaaccta tagtgttt // LOCUS HUMALR 1132 bp mRNA PRI 31-OCT-1994 DEFINITION Human aldehyde reductase mRNA, complete cds. ACCESSION J04794 NID g178480 KEYWORDS alcohol:NADP+ oxireductase; aldehyde reductase. SOURCE Human liver, cDNA to mRNA, (library of Prochownick and Michaelson). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1132) AUTHORS Bohren,K.M., Bullock,B., Wermuth,B. and Gabbay,K.H. TITLE The aldo-keto reductase superfamily. cDNAs and deduced amino acid sequences of human aldehyde and aldose reductases JOURNAL J. Biol. Chem. 264 (16), 9547-9551 (1989) MEDLINE 89255461 COMMENT Draft entry and computer-readable seuence for [1] kindly provided by K.H.Gabbay, 21-JUN-1989. FEATURES Location/Qualifiers source 1..1132 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q35" mRNA <1..1132 /note="ALR mRNA" gene 61..1038 /gene="ALDR1" CDS 61..1038 /gene="ALDR1" /note="aldehyde reductase (EC 1.1.1.2)" /codon_start=1 /db_xref="GDB:G00-128-041" /db_xref="PID:g178481" /translation="MAASCVLLHTGQKMPLIGLGTWKSEPGQVKAAVKYALSVGYRHI DCAAIYGNEPEIGEALKEDVGPGKAVPREELFVTSKLWNTKHHPEDVEPALRKTLADL QLEYLDLYLMHWPYAFERGDNPFPKNADGTICYDSTHYKETWKALEALVAKGLVQALG LSNFNSRQIDDILSVASVRPAVLQVECHPYLAQNELIAHCQARGLEVTAYSPLGSSDR AWRDPDEPVLLEEPVVLALAEKYGRSPAQILLRWQVQRKVICIPKSITPSRILQNIKV FDFTFSPEEMKQLNALNKNWRYIVPMLTVDGKRVPRDAGHPLYPFNDPY" BASE COUNT 267 a 289 c 324 g 252 t ORIGIN 1 agccagaaat gtgaagtgct agctgaagga tgagcagcag ctagccaggc aaagggggca 61 atggcggctt cctgtgttct actgcacact gggcagaaga tgcctctgat tggtctgggt 121 acctggaaga gtgagcctgg tcaggtaaaa gcagctgtta agtatgccct tagcgtaggc 181 taccgccaca ttgattgtgc tgctatctac ggcaatgagc ctgagattgg ggaggccctg 241 aaggaggacg tgggaccagg caaggcggtg cctcgggagg agctgtttgt gacatccaag 301 ctgtggaaca ccaagcacca ccccgaggat gtggagcctg ccctccggaa gactctggct 361 gacctccagc tggagtatct ggacctgtac ctgatgcact ggccttatgc ctttgagcgg 421 ggagacaacc ccttccccaa gaatgctgat gggactatat gctacgactc cacccactac 481 aaggagactt ggaaggctct ggaggcactg gtggctaagg ggctggtgca ggcgctgggc 541 ctgtccaact tcaacagtcg gcagattgat gacatactca gtgtggcctc cgtgcgtcca 601 gctgtcttgc aggtggaatg ccacccatac ttggctcaaa atgagctaat tgcccactgc 661 caagcacgtg gcttggaggt aactgcttat agccctttgg gctcctctga tcgtgcatgg 721 cgtgatcctg atgagcctgt cctgctggag gaaccagtag tcctggcatt ggctgaaaag 781 tatggccgat ctccagctca gatcttgctc aggtggcagg tccagcggaa agtgatctgc 841 atccccaaaa gtatcactcc ttctcgaatc cttcagaaca tcaaggtgtt tgacttcacc 901 tttagcccag aagagatgaa gcagctaaat gccctgaaca aaaattggag atatattgtg 961 cctatgctta cggtggatgg gaagagagtc ccaagggatg cagggcatcc tctgtacccc 1021 tttaatgacc cgtactgaga ccacagcttc ttggcctccc ttccagctct gcagctaatg 1081 aggtcctgcc acaacggaaa gagggagtta ataaagccat tggagcatcc at // LOCUS HUMAMD 1805 bp mRNA PRI 31-OCT-1994 DEFINITION Human S-adenosylmethionine decarboxylase mRNA, complete cds. ACCESSION M21154 J04048 NID g178517 KEYWORDS AdoMet decarboxylase; S-adenosylmethionine decarboxylase. SOURCE Human fibroblast, cDNA to mRNA, (library of Okayama), clone pSAMh1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1805) AUTHORS Pajunen,A., Crozat,A., Janne,O.A., Ihalainen,R., Laitinen,P.H., Stanley,B., Madhubala,R. and Pegg,A.E. TITLE Structure and regulation of mammalian S-adenosylmethionine decarboxylase JOURNAL J. Biol. Chem. 263 (32), 17040-17049 (1988) MEDLINE 89034205 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Crozat, 27-OCT-1988. FEATURES Location/Qualifiers source 1..1805 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq22-q28" mRNA <1..1805 /note="AMD mRNA" gene 249..1253 /gene="AMD2" CDS 249..1253 /gene="AMD2" /note="S-adenosylmethionine decarboxylase proenzyme (EC 4.1.1.50) old gene name 'AMD'" /codon_start=1 /db_xref="GDB:G00-120-743" /db_xref="PID:g178518" /translation="MEAAHFFEGTEKLLEVWFSRQQPDANQGSGDLRTIPRSEWDILL KDVQCSIISVTKTDKQEAYVLSESSMFVSKRRFILKTCGTTLLLKALVPLLKLARDYS GFDSIQSFFYSRKNFMKPSHQGYPHRNFQEEIEFLNAIFPNGAGYCMGRMNSDCWYLY TLDFPESRVISQPDQTLEILMSELDPAVMDQFYMKDGVTAKDVTRESGIRDLIPGSVI DATMFNPCGYSMNGMKSDGTYWTIHITPEPEFSYVSFETNLSQTSYDDLIRKVVEVFK PGKFVTTLFVNQSSKCRTVLASPQKIEGFKRLDCQSAMFNDYNFVFTSFAKKQQQQQS " BASE COUNT 528 a 344 c 385 g 548 t ORIGIN 499 bp upstream of KpnI site; chromosome Xq22-q28. 1 aagagactga actgtatctg cctctatttc caaaagactc acgttcaact ttcgctcaca 61 caaagccggg aaaattttat tagtcctttt tttaaaaaaa gttaatataa aattatagca 121 aaaaaaaaaa ggaacctgaa ctttagtaac acagctggaa caatcgcagc ggcggcggca 181 gcggcgggag aagaggttta atttagttga ttttctgtgg ttgttggttg ttcgctagtc 241 tcacggtgat ggaagctgca cattttttcg aagggaccga gaagctgctg gaggtttggt 301 tctcccggca gcagcccgac gcaaaccaag gatctgggga tcttcgcact atcccaagat 361 ctgagtggga catacttttg aaggatgtgc aatgttcaat cataagtgtg acaaaaactg 421 acaagcagga agcttatgta ctcagtgaga gtagcatgtt tgtctccaag agacgtttca 481 ttttgaagac atgtggtacc accctcttgc tgaaagcact ggttcccctg ttgaagcttg 541 ctagggatta cagtgggttt gactcaattc aaagcttctt ttattctcgt aagaatttca 601 tgaagccttc tcaccaaggg tacccacacc ggaatttcca ggaagaaata gagtttctta 661 atgcaatttt cccaaatgga gcaggatatt gtatgggacg tatgaattct gactgttggt 721 acttatatac tctggatttc ccagagagtc gggtaatcag tcagccagat caaaccttgg 781 aaattctgat gagtgagctt gacccagcag ttatggacca gttctacatg aaagatggtg 841 ttactgcaaa ggatgtcact cgtgagagtg gaattcgtga cctgatacca ggttctgtca 901 ttgatgccac aatgttcaat ccttgtgggt attcgatgaa tggaatgaaa tcggatggaa 961 cttattggac tattcacatc actccagaac cagaattttc ttatgttagc tttgaaacaa 1021 acttaagtca gacctcctat gatgacctga tcaggaaagt tgtagaagtc ttcaagccag 1081 gaaaatttgt gaccaccttg tttgttaatc agagttctaa atgtcgcaca gtgcttgctt 1141 cgccccagaa gattgaaggt tttaagcgtc ttgattgcca gagtgctatg ttcaatgatt 1201 acaattttgt ttttaccagt tttgctaaga agcagcaaca acagcagagt tgattaagaa 1261 aaatgaagaa aaaacgcaaa aagagaacac atgtagaagg tggtggatgc tttctagatg 1321 tcgatgctgg gggcagtgct ttccataacc accactgtgt agttgcagaa agccctagat 1381 gtaatgatag tgtaatcatt ttgaattgta tgcattatta tatcaaggag ttagatatct 1441 tgcatgaatg ctctcttctg tgtttaggta ttctctgcca ctcttgctgt gaaattgaag 1501 tggatgtaga aaaaaccttt tactatatga aactttacaa cacttgtgaa agcaactcaa 1561 tttggtttat gcacagtgta atatttctcc aagtatcatc caaaattccc cacagacaag 1621 gctttcgtcc tcattaggtg ttggcctcag cctaaccctc taggactgtt ctattaaatt 1681 gctgccagaa ttttacatcc agttacctcc actttctaga acatattctt tactaatgtt 1741 attgaaacca atttctactt catactgatg tttttggaaa cagcaattaa agtttttctt 1801 ccatg // LOCUS HUMAMII 4917 bp mRNA PRI 13-MAR-1996 DEFINITION Homo sapiens alpha mannosidase II isozyme mRNA, complete cds. ACCESSION L28821 NID g945214 KEYWORDS alpha mannosidase IIx. SOURCE Homo sapiens (tissue library: ATCC HTB 72) melanoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4917) AUTHORS Misago,M., Liao,Y.-F., Kudo,S., Eto,S., Mattei,M.-G., Moremen,K.W. and Fukuda,M.N. TITLE Molecular cloning and expression of cDNAs encoding human alpha-mannosidase II and a previously unrecognized alpha-mannosidase IIx isozyme JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (25), 11766-11770 (1995) MEDLINE 96102195 FEATURES Location/Qualifiers source 1..4917 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SK-Mel-28" /cell_type="myeloma cell" /tissue_type="melanoma" /tissue_lib="ATCC HTB 72" 5'UTR 1..68 CDS 69..2459 /codon_start=1 /product="alpha mannosidase II isozyme" /db_xref="PID:g945215" /translation="MKLKKQVTVCGAAIFCVAVFSLYLMLDRVQHDPTRHQNGGNFPR SQISVLQNRIEQLEQLLEENHEIISHIKDSVLELTANAEGPPAMLPYYTVNGSWVVPP EPRPSFFSISPQDCQFALGGRGQKPELQMLTVSEELPFDNVDGGVWRQGFDISYDPHD WDAEDLQVFVVPHSHNDPGWIKTFDKYYTEQTQHILNSMVSKLQEDPRRRFLWAEVSF FAKWWDNINVQKRAAVRRLVGNGQLEIATGGWVMPDEANSHYFALIDQLIEGHQWLER NLGATPRSGWAVDPFGYSSTMPYLLRRANLTSMLIQRVHYAIKKHFAATHSLEFMWRQ TWDSDSSTDIFCHMMPFYSYDVPHTCGPDPKICCQFDFKRLPGGRINCPWKVPPRAIT EANVAERAALLLDQYRKKSQLFRSNVLLVPLGDDFRYDKPQEWDAQFFNYQRLFDFFN SRPNLHVQAQFGTLSDYFDALYKRTGVEPGARPPGFPVLSGDFFSYADREDHYWTGYY TSRPFYKSLDRVLEAHLRGAEVLYSLAAAHARRSGLAGRYPLSDFTLLTEARRTLGLF QHHDAITGTAKEAVVVDYGVRLLRSLVNLKQVIIHAAHYLVLGDKETYHFDPEAPFLQ VDDTRLSHDALPERTVIQLDSSPRFVVLFNPLEQERFSMVSLLVNSPRVRVLSEEGQP LAVQISAHWSSATEAVPDVYQVSVPVRLPALGLGVLQLQLGLDGHRTLPSSVRIYLHG RQLSVSRHEAFPLRVIDSGTSDFALSNRYMQVWFSGLTGLLKGSGLCFLAEHPKGG" 3'UTR 2460..4917 polyA_signal 4895..4900 BASE COUNT 1057 a 1413 c 1376 g 1071 t ORIGIN 1 ggcagctcgg ccgactgggc ccggagcggc gcggaggccg ggcgctgacg gtgtgtgtgg 61 aggccagtat gaagctgaaa aagcaggtga cagtgtgtgg ggctgccatc ttctgtgtgg 121 cagtcttctc gctctacctc atgctggacc gagtgcaaca cgatcccacc cgacaccaga 181 atggtgggaa cttcccccgg agccaaattt ctgtgctgca gaaccgcatt gagcagctgg 241 agcagctttt ggaggagaac catgagatta tcagccatat caaggactcc gtgctggagc 301 tgacagccaa cgcagagggc ccgcccgcca tgctgcccta ctacacggtc aatggctcct 361 gggtggtgcc accggagccc cggcccagct tcttctccat ctccccgcag gactgccagt 421 ttgctttggg gggccggggt cagaagccag agctgcagat gctcactgtg tcggaggagc 481 tgccgtttga caacgtggat ggtggtgtgt ggaggcaagg cttcgacatc tcctacgacc 541 cgcacgactg ggatgctgaa gacctgcagg tgtttgtggt gccccactct cacaatgacc 601 caggctggat caagaccttt gacaagtact acacagagca gacccaacac atcctcaata 661 gcatggtgtc taagctgcag gaggaccccc ggcggcgctt cctctgggca gaggtctcct 721 tcttcgccaa gtggtgggac aacatcaatg tccaaaagag agcggcagtc cgaaggctgg 781 tgggaaacgg gcagctggag attgcgacag gaggctgggt gatgccagat gaggccaatt 841 cccactactt tgcattgatt gaccagctca tcgaaggaca ccagtggctg gagagaaatc 901 ttggtgcaac cccccgctct ggctgggcag tggacccctt tggatacagc tccaccatgc 961 cttacctgct gcgccgtgcc aacctcacca gcatgctgat tcagagagtg cactatgcca 1021 tcaagaagca ctttgctgcc acccacagcc tagagttcat gtggaggcag acatgggact 1081 cggactccag cacagacatc ttctgtcaca tgatgccctt ctacagctat gacgtccccc 1141 atacctgtgg cccagatccc aagatctgct gccaatttga tttcaaacgc ctgcctggtg 1201 ggcgcatcaa ctgcccttgg aaggtgccac cccgggccat cacagaggcc aacgtggcag 1261 agagggcagc cctgcttctg gaccaatacc ggaagaagtc ccagctgttc cgaagcaacg 1321 tcctcctggt gcctcttgga gatgacttcc gatatgacaa gccccaggag tgggatgccc 1381 agttcttcaa ctaccaacgg ctctttgact tcttcaacag caggcctaac ctccatgtgc 1441 aggcccagtt tggcactctt tctgactatt ttgatgccct gtacaagagg acaggggtgg 1501 agccaggggc ccggcctcca gggtttcctg tgctgagcgg ggatttcttc tcctatgcgg 1561 accgggagga tcattactgg acaggctatt acacttcccg gcccttctac aagagcttag 1621 accgagtcct ggaagcccac ctgcgggggg cagaggttct gtacagcctg gctgcagctc 1681 acgctcgccg ctctggtctg gctggccggt acccactgtc tgatttcacc ctcctgacgg 1741 aagctcggcg cacattgggg ctcttccagc atcacgatgc catcactggc acggccaagg 1801 aggctgtggt ggtggactat ggggtcaggc ttctgcgctc ccttgtcaac ctgaagcagg 1861 tcatcattca tgcagcccac tatctggtgc tgggggacaa ggagacctac cactttgacc 1921 ctgaggcgcc cttcctccaa gtggatgaca ctcgcttaag tcacgacgcc ctcccagagc 1981 gcacggtgat ccagctggat tcctcgccca ggtttgtggt cctattcaac ccactggaac 2041 aggagcgatt cagcatggtg tccctgctgg tcaactctcc ccgcgtgcgt gtcctttcgg 2101 aggagggtca gcccctggcc gtgcagatca gcgcacactg gagctctgcc accgaggcgg 2161 tccctgacgt ctaccaggtg tctgtgcctg tccgcctgcc agccctgggc ctgggcgtgc 2221 tgcagctaca gctgggcctg gatgggcacc gcacgctgcc ctcctctgtg cgcatctacc 2281 tgcacggccg gcagctgtcc gtcagcaggc acgaagcgtt tcctctccgt gtcattgact 2341 ctggcaccag cgacttcgcc ctcagcaacc gctacatgca ggtctggttc tcaggcctta 2401 ctgggctcct caaggggtca gggctgtgtt ttttggcaga gcatccgaag ggtggatgag 2461 gagcacgagc agcaggtgga catgcaggtc cttgtctatg gcacccgtac gtccaaagac 2521 aagagtggag cctacctctt cctgcccgat ggcgaggcaa gccctacgtc cccaaggagc 2581 cccccgtgct gcgtgtcact gaaggccctt tcttctcaga ggtggttgcg tactatgagc 2641 acattcacca ggcggtccgg ctttacaatc tgccaggggt ggaggggctg tctctggaca 2701 tatcatccct ggtggacatc cgggactacg tcaacaagga gctggccctg cacatccata 2761 cagacatcga cagccagggt gcagccccga cggtatctga agaagctccc cctccaggcc 2821 aacttctacc ccatgccagt catggcctat atccaggacg cacagaagcg cctcacgctg 2881 cacactgccc aggccctggg tgtctctagc ctcaaagatg gccagctgga ggtgatcttg 2941 gaccggcggc tgatgcagga tgacaaccgg ggcctaggcc aagggctcaa ggacaacaag 3001 agaacctgca accgtttccg cctcctgcta gagcggcgaa ccgtgggcag tgaggtccaa 3061 gatagccact ctaccagcta cccatccctc ctcagccacc tgacctccat gtacctgaac 3121 gccccggcgc tcgctctgcc tgtagccagg atgcagctcc caggccctgg tctgcgctca 3181 tttcatcctc tggcttcctc actgccctgt gacttccacc tgctcaacct acgtacgctc 3241 caggctgagg aggacaccct accctcggcg gagaccgcac tcatcttaca ccgcaagggt 3301 tttgactgcg gcctggaggc caagaacttg ggcttcaact gcaccacaag ccaaggcaag 3361 gtagccctgg gcagcctttt ccatggcctg gatgtggtat tccttcagcc aacctccttg 3421 acgttactgt accctctggc ctccccgtcc aacagcactg acgtctattt ggagcccatg 3481 gagattgcta cctttcgcct ccgcttgggt tagggcttct tgtggcctga agagaaagtt 3541 cattcacaga gactgcctct taacatgaag atcattggac aagccacacg ggtatcccat 3601 cccgatctgc ctcccagaac tgtgacacac tgggctctgc cctcattttc tgtttattgc 3661 tgctgctgtg ttttcggcgc aacccacaaa cccagtgatg ggtaaatagg gcagacgcca 3721 gtgagatcag ggagagaagg cccttggtca gagtgggcag tgccaggctc tgctttgggt 3781 tgtgagtgga cacccaactg ggcacaggct caggcaccca tcctttttcc aaacagggat 3841 atagaagtgg tggaagcaga cagaagaggt aagggaggct aagtgggtaa cagcccagca 3901 tcagggtcac tgtggcaaca gcaggctcta ggggaatcct gtggttatgt agagactcca 3961 tgtcctggtg tgatgagcag gatcagagtg actctgggag gacaggggtg gggacccaga 4021 gttagcagtg gggatggagc agtagaagga atcactgttt ctcctaggag tctgaaggcc 4081 tcgctgcttt ctgtgatggc tttgcagtaa gtgccgcctg gcctgcatgc attggctaac 4141 aggctgcaga atggcaggaa ggactcgcta gagattgtca tggccagaga tcataggtca 4201 cttcaggtag caagacccct ggcaaactgg gcacttggcc tatgtactga tttgtgggat 4261 ggtggcaggg gtgtggggtc cttcaccctg cctgaattct ctttggcttc tgtgctctgt 4321 atgctgctgt ccccaagggc tctttcttat tatggcaggg agtggggatt ggtcctactt 4381 tctttctctg gaaaggaaag cctccaagac tccatgtgct tgggcagctt gagaaggcgt 4441 tcagcaccac gcctagcagg cagaccttga agcctcacct ttagtctatc tgcagaggta 4501 ttcagttcct ggcacagggg actaggggca tgtagagtat atgaggaggc agtatggctg 4561 tgcaggagcc ttcatttcag cttcaattaa tagggaagaa tttatgatag ctctatagat 4621 gctgaaaagg tatttcgtaa gatttaaaat ccatccctta ttaaaactct tagtaaatta 4681 agtctggaaa gaaacaccct aatctagata aaggtctgtt tcagaaacca acagtgatgg 4741 cattctaaag agtcagacgc cacaggcatt cccattaaag tcagaaacta gccaagggca 4801 agctattatt cagcagtgtc ccggcactac taacccctgc aacaagccag atgaggaaca 4861 taaggaagaa ttataattgt cattatttgt agacaataaa actgcctacc tgtaaaa // LOCUS HUMAMINOP 3000 bp mRNA PRI 20-SEP-1993 DEFINITION Homo sapiens aminopeptidase A mRNA, complete cds. ACCESSION L14721 NID g291853 KEYWORDS aminopeptidase A; angiotensinase; aspartate aminopeptidase; surface glycoprotein. SOURCE Homo sapiens (library: Clontech GT11) male kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3000) AUTHORS Nanus,D.M., Pfeffer,L.P., Bander,N.H., Bahri,S. and Albino,A.P. TITLE Antiproliferative and antitumor effect of alpha-interferon in renal cell carcinomas: Correlation with the expression of a kidney-associated differentiation glycoprotein JOURNAL Cancer Res. 50, 4190-4194 (1990) MEDLINE 90304763 REFERENCE 2 (bases 1 to 3000) AUTHORS Nanus,D.M., Engelstein,D., Gastl,G.A., Gluck,L., Vidal,M.J., Morrison,M., Finstad,C.L., Bander,N.H. and Albino,A.P. TITLE Molecular cloning of the human kidney differentiation antigen gp160: Human aminopeptidase A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 7069-7073 (1993) MEDLINE 93348214 FEATURES Location/Qualifiers source 1..3000 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="kidney" /tissue_lib="Clontech GT11" mRNA 1..3000 CDS 84..2957 /standard_name="angiotensinase" /EC_number="3.4.11.7" /note="gp160 kidney differentiation surface glycoprotein" /citation=[2] /citation=[1] /codon_start=1 /function="L-a-aspartyl (L-a-glutamyl)-peptide hydrolase" /evidence=experimental /product="aminopeptidase A" /db_xref="PID:g291854" /translation="MNFAEREGSKRYCIQTKHVAILCAVVVGVGLIVGLAVGLTRSCD SSGDGGPGTAPAPSHLPSSTASPSGPPAQDQDICPASEDESGQWKNFRLPDFVNPVHY DLHVKPLLEEDTYTGTVSISINLSAPTRYLWLHLRETRITRLPELKRPSGDQVQVRRC FEYKKQEYVVVEAEEELTPSSGDGLYLLTMEFAGWLNGSLVGFYRTTYTENGRVKSIA ATDHEPTDARKSFPCFDEPNKKATYTISITHPKEYGALSNMPVAKEESVDDKWTRTTF EKSVPMSTYLVCFAVHQFDSVKRISNSGKPLTIYVQPEQKHTAEYAANITKSVFDYFE EYFAMNYSLPKLDKIAIPDFGTGAMENWGLITYRETNLLYDPKESASSNQQRVATVVA HELVHQWFGNIVTMDWWEDLWLNEGFASFFEFLGVNHAETDWQMRDQMLLEDVLPVQE DDSLMSSHPIIVTVTTPDEITSVFDGISYSKGSSILRMLEDWIKPENFQKGCQMYLEK YQFKNAKTSDFWAALEEASRLPVKEVMDTWTRQMGYPVLNVNGVKNITQKRFLLDPRA NPSQPPSDLGYTWNIPVKWTEDNITSSVLFNRSEKEGITLNSSNPSGNAFLKINPDHI GFYRVNYEVATWDSIATALSLNHKTFSSADRASLIDDAFALARAQLLDYKVALNLTKY LKREENFLPWQRVISAVTYIISMFEDDKELYPMIEEYFQGQVKPIADSLGWNDAGDHV TKLLRSSVLGFACKMGDREALNNASSLFEQWLNGTVSLPVNLRLLVYRYGMQNSGNEI SWNYTLEQYQKTSLAQEKEKLLYGLASVKNVTLLSRYLDLLKDTNLIKTQDVFTVIRY ISYNSYGKNMAWNWIQLNWDYLVNRYTLNNRNLGRIVTIAEPFNTELQLWQMESFFAK YPQAGAGEKPREQVLETVKNNIEWLKQHRNTIREWFFNLLESG" misc_binding 390..400 /note="putative" /citation=[2] /bound_moiety="zinc-dependent metallopeptidase" BASE COUNT 897 a 624 c 701 g 778 t ORIGIN 1 tccaattcaa aaagaaagtc tctgacgtta gttagtttaa tttaacatct ttttatgtgt 61 aacacttgac tttggaagca aaaatgaact ttgcggagag agagggctct aagagatact 121 gcattcaaac gaaacatgtg gccattctct gtgcggtggt ggtgggtgta ggattaatag 181 tgggacttgc cgtgggcttg accagatcgt gtgactccag cggggacggc gggccgggca 241 ctgcgccagc tccttcccac ctgccttctt ccacggccag cccctcaggt cctcctgccc 301 aggaccagga catctgcccg gccagtgagg atgagagcgg acagtggaaa aactttcgac 361 tgccggactt cgtcaaccca gtccactacg acctgcacgt gaagcccctg ttggaggagg 421 acacctacac gggcaccgtg agcatctcca tcaacctgag cgctcccacc cggtacctgt 481 ggctgcacct ccgggagacc aggatcaccc ggctcccgga gctgaagagg ccctctgggg 541 accaggtgca agtccggagg tgtttcgagt acaaaaagca ggagtacgtg gtggtcgagg 601 cggaggaaga gcttaccccc agcagtggag atggcctgta tctcctgacc atggagttcg 661 ccggctggct gaacggctcc ctcgtgggat tttatagaac cacctacacg gagaacggac 721 gagtcaagag catagcggcc accgatcatg aaccaacaga tgccaggaaa tcttttcctt 781 gttttgatga gcccaacaaa aaggcaactt atacaatatc tatcacccat cccaaagaat 841 acggagcact ttcaaatatg ccagtggcga aagaagagtc agtggatgat aaatggactc 901 gaacaacttt tgagaagtct gtccccatga gcacgtacct ggtgtgcttt gctgtacatc 961 aatttgactc tgtaaagaga atatcaaata gtggaaaacc tcttacaatt tatgtccagc 1021 cagagcaaaa gcacacagcc gaatatgctg caaacataac taaaagtgtg tttgattatt 1081 ttgaagaata ctttgctatg aattattctc ttcctaaatt agataaaatc gctattccag 1141 attttggcac tggtgccatg gagaactggg gactcatcac gtacagagaa acgaacctgc 1201 tttatgaccc taaggaatca gcctcatcaa accaacagag ggtggccact gtggttgccc 1261 atgaacttgt gcatcagtgg tttggaaata ttgtgaccat ggactggtgg gaagacttgt 1321 ggctaaatga aggatttgct tctttctttg agtttctggg agtaaaccat gcagaaacag 1381 actggcaaat gcgtgaccaa atgttacttg aagatgtatt acctgttcaa gaggatgatt 1441 ctttgatgtc ttcgcatcca attattgtga ctgtgacaac ccctgatgaa ataacatctg 1501 tttttgatgg aatatcctat agcaagggat cttctatttt gagaatgctt gaagactgga 1561 taaaaccaga gaattttcaa aaaggatgtc agatgtactt ggaaaaatac caattcaaga 1621 atgcaaaaac ttctgatttt tgggcagcac tggaagaggc aagtaggcta ccagtgaaag 1681 aagtaatgga cacctggacc agacagatgg gttatcctgt gcttaacgtg aacggtgtca 1741 agaacatcac acagaaacgc tttttgttgg acccaagagc taacccttct cagccccctt 1801 cagatcttgg ttatacatgg aatatcccag ttaaatggac tgaagataat ataacaagca 1861 gtgtgttatt taataggtca gaaaaagaag gaatcacttt gaactcctct aatcctagtg 1921 gaaatgcttt tctcaaaata aacccagatc atattgggtt ttatcgtgta aattatgaag 1981 tagcaacttg ggactcgata gctacagcgc tctccttgaa ccacaagaca ttttcttcag 2041 cagatcgtgc aagtcttatt gatgatgctt ttgccttggc aagagctcaa cttctagatt 2101 ataaggtggc tttgaacttg accaagtatc tcaaaaggga agagaatttt ttaccatggc 2161 agagagtaat ttcagctgta acctacatca ttagcatgtt tgaagatgat aaagagctat 2221 atcctatgat tgaggaatac ttccaaggtc aagtgaagcc tattgcagat tctctgggat 2281 ggaatgatgc tggagaccat gtcacaaagt tactccgttc ctccgtgtta gggtttgcgt 2341 gcaagatggg agacagagaa gccttgaaca atgcttcctc gttatttgag cagtggctaa 2401 atgggactgt aagccttccc gtaaatctca ggcttctggt gtatcggtat gggatgcaga 2461 actctggcaa tgagatttca tggaactaca ctcttgagca ataccagaaa acttcattag 2521 ctcaagaaaa agaaaaactg ctgtatggat tagcatcagt gaagaacgtt actcttttgt 2581 caaggtattt ggatttgctc aaggacacga accttattaa aactcaggat gtgtttacag 2641 tcattcgata tatctcatat aacagctatg ggaagaacat ggcctggaat tggatacaac 2701 tcaactggga ctatctagtc aacagatata cactcaataa cagaaacctt ggccgaattg 2761 tcacaatagc agagccattc aacactgaac tgcaactgtg gcagatggag agcttttttg 2821 caaaatatcc acaagctgga gcaggagaaa aacctaggga acaagtgctg gaaacagtga 2881 aaaacaatat agagtggcta aaacaacata gaaacaccat cagagaatgg ttttttaatt 2941 tacttgagag tggttaatgt attcaaatgt tagagtttaa ttttgtgaat ctattgtttc // LOCUS HUMAMIPEP 3494 bp mRNA PRI 31-OCT-1994 DEFINITION Human aminopeptidase N/CD13 mRNA encoding aminopeptidase N, complete cds. ACCESSION M22324 NID g178535 KEYWORDS aminopeptidase N. SOURCE Human HL-60 cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3494) AUTHORS Look,A.T., Ashmun,R.A., Shapiro,L.H. and Peiper,S.C. TITLE Human myeloid plasma membrane glycoprotein CD13 (gp150) is identical to aminopeptidase N JOURNAL J. Clin. Invest. 83 (4), 1299-1307 (1989) MEDLINE 89198086 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Look 27-JAN-1989. FEATURES Location/Qualifiers source 1..3494 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q25-qter" sig_peptide 121..216 /gene="IGF1R" /note="aminopeptidase N signal peptide" CDS 121..3024 /gene="IGF1R" /note="aminopeptidase N precursor (EC 3.4.11.2)" /codon_start=1 /db_xref="GDB:G00-120-082" /db_xref="PID:g178536" /translation="MAKGFYISKSLGILGILLGVAAVCTIIALSVVYSQEKNKNANSS PVASTTPSASATTNPASATTLDQSKAWNRYRLPNTLKPDSYQVTLRPYLTPNDRGLYV FKGSSTVRFTCKEATDVIIIHSKKLNYTLSQGHRVVLRGVGGSQPPDIDKTELVEPTE YLVVHLKGSLVKDSQYEMDSEFEGELADDLAGFYRSEYMEGNVRKVVATTQMQAADAR KSFPCFDEPAMKAEFNITLIHPKDLTALSNMLPKGPSTPLPEDPNWNVTEFHTTPKMS TYLLAFIVSEFDYVEKQASNGVLIRIWARPSAIAAGHGDYALNVTGPILNFFAGHYDT PYPLPKSDQIGLPDFNAGAMENWGLVTYRENSLLFDPLSSSSSNKERVVTVIAHELAH QWFGNLVTIEWWNDLWLNEGFASYVEYLGADYAEPTWNLKDLMVLNDVYRVMAVDALA SSHPLSTPASEINTPAQISELFDAISYSKGASVLRMLSSFLSEDVFKQGLASYLHTFA YQNTIYLNLWDHLQEAVNNRSIQLPTTVRDIMNRWTLQMGFPVITVDTSTGTLSQEHF LLDPDSNVTRPSEFNYVWIVPITSIRDGRQQQDYWLIDVRAQNDLFSTSGNEWVLLNL NVTGYYRVNYDEENWRKIQTQLQRDHSAIPVINRAQIINDAFNLASAHKVPVTLALNN TLFLIEERQYMPWEAALSSLSYFKLMFDRSEVYGPMKNYLKKQVTPLFIHFRNNTNNW REIPENLMDQYSEVNAISTACSNGVPECEEMVSGLFKQWMENPNNNPIHPNLRSTVYC NAIAQGGEEEWDFAWEQFRNATLVNEADKLRAALACSKELWILNRYLSYTLNPDLIRK QDATSTIISITNNVIGQGLVWDFVQSNWKKLFNDYGGGSFSFSNLIQAVTRRFSTEYE LQQLEQFKKDNEETGFGSGTRALEQALEKTKANIKWVKENKEVVLQWFTENSK" gene 121..3024 /gene="IGF1R" mat_peptide 217..3021 /gene="IGF1R" /note="aminopeptidase N" BASE COUNT 814 a 1097 c 906 g 677 t ORIGIN Chromosome 15q25-26. 1 taatttttgc ccagtctgcc tgttgtgggg ctcctcccct ttggggatat aagcccggcc 61 tggggctgct ccgttctctg cctggcctga ggctccctga gccgcctccc caccatcacc 121 atggccaagg gcttctatat ttccaagtcc ctgggcatcc tggggatcct cctgggcgtg 181 gcagccgtgt gcacaatcat cgcactgtca gtggtgtact cccaggagaa gaacaagaac 241 gccaacagct cccccgtggc ctccaccacc ccgtccgcct cagccaccac caaccccgcc 301 tcggccacca ccttggacca aagtaaagcg tggaatcgtt accgcctccc caacacgctg 361 aaacccgatt cctaccaggt gacgctgaga ccgtacctca cccccaatga caggggcctg 421 tacgttttta agggctccag caccgtccgt ttcacctgca aggaggccac tgacgtcatc 481 atcatccaca gcaagaagct caactacacc ctcagccagg ggcacagggt ggtcctgcgt 541 ggtgtgggag gctcccagcc ccccgacatt gacaagactg agctggtgga gcccaccgag 601 tacctggtgg tgcacctcaa gggctccctg gtgaaggaca gccagtatga gatggacagc 661 gagttcgagg gggagttggc agatgacctg gcgggcttct accgcagcga gtacatggag 721 ggcaatgtca gaaaggtggt ggccactaca cagatgcagg ctgcagatgc ccggaagtcc 781 ttcccatgct tcgatgagcc ggccatgaag gccgagttca acatcacgct tatccacccc 841 aaggacctga cagccctgtc caacatgctt cccaaaggtc ccagcacccc acttccagaa 901 gaccccaact ggaatgtcac tgagttccac accacgccca agatgtccac gtacttgctg 961 gccttcattg tcagtgagtt cgactacgtg gagaagcagg catccaatgg tgtcttgatc 1021 cggatctggg cccggcccag tgccattgcg gcgggccacg gcgattatgc cctgaacgtg 1081 acgggcccca tccttaactt ctttgctggt cattatgaca caccctaccc actcccaaaa 1141 tcagaccaga ttggcctgcc agacttcaac gccggcgcca tggagaactg gggactggtg 1201 acctaccggg agaactccct gctgttcgac cccctgtcct cctccagcag caacaaggag 1261 cgggtggtca ctgtgattgc tcatgagctg gcccaccagt ggttcgggaa cctggtgacc 1321 atagagtggt ggaatgacct gtggctgaac gagggcttcg cctcctacgt ggagtacctg 1381 ggtgctgact atgcggagcc cacctggaac ttgaaagacc tcatggtgct gaatgatgtg 1441 taccgcgtga tggcagtgga tgcactggcc tcctcccacc cgctgtccac acccgcctcg 1501 gagatcaaca cgccggccca gatcagtgag ctgtttgacg ccatctccta cagcaagggc 1561 gcctcagtcc tcaggatgct ctccagcttc ctgtccgagg acgtattcaa gcagggcctg 1621 gcgtcctacc tccacacctt tgcctaccag aacaccatct acctgaacct gtgggaccac 1681 ctgcaggagg ctgtgaacaa ccggtccatc caactcccca ccaccgtgcg ggacatcatg 1741 aaccgctgga ccctgcagat gggcttcccg gtcatcacgg tggataccag cacggggacc 1801 ctttcccagg agcacttcct ccttgacccc gattccaatg ttacccgccc ctcagaattc 1861 aactacgtgt ggattgtgcc catcacatcc atcagagatg gcagacagca gcaggactac 1921 tggctgatag atgtaagagc ccagaacgat ctcttcagca catcaggcaa tgagtgggtc 1981 ctgctgaacc tcaatgtgac gggctattac cgggtgaact acgacgaaga gaactggagg 2041 aagattcaga ctcagctgca gagagaccac tcggccatcc ctgtcatcaa tcgggcacag 2101 atcattaatg acgccttcaa cctggccagt gcccataagg tccctgtcac tctggcgctg 2161 aacaacaccc tcttcctgat tgaagagaga cagtacatgc cctgggaggc cgccctgagc 2221 agcctgagct acttcaagct catgtttgac cgctccgagg tctatggccc catgaagaac 2281 tacctgaaga agcaggtcac acccctcttc attcacttca gaaataatac caacaactgg 2341 agggagatcc cagaaaacct gatggaccag tacagcgagg ttaatgccat cagcaccgcc 2401 tgctccaacg gagttccaga gtgtgaggag atggtctctg gccttttcaa gcagtggatg 2461 gagaacccca ataataaccc gatccacccc aacctgcggt ccaccgtcta ctgcaacgct 2521 atcgcccagg gcggggagga ggagtgggac ttcgcctggg agcagttccg aaatgccaca 2581 ctggtcaatg aggctgacaa gctccgggca gccctggcct gcagcaaaga gttgtggatc 2641 ctgaacaggt acctgagcta caccctgaac ccggacttaa tccggaagca ggacgccacc 2701 tctaccatca tcagcattac caacaacgtc attgggcaag gtctggtctg ggactttgtc 2761 cagagcaact ggaagaagct ttttaacgat tatggtggtg gctcgttctc cttctccaac 2821 ctcatccagg cagtgacacg acgattctcc accgagtatg agctgcagca gctggagcag 2881 ttcaagaagg acaacgagga aacaggcttc ggctcaggca cccgggccct ggagcaagcc 2941 ctggagaaga cgaaagccaa catcaagtgg gtgaaggaga acaaggaggt ggtgctccag 3001 tggttcacag aaaacagcaa atagtcccca gcccttgaag tcacccggcc ccgatgcaag 3061 gtgcccacat gtgtccatcc cagcggctgg tgcagggcct ccattcctgg agcccgaggc 3121 accagtgtcc tcccctcaag gacaaagtct ccagcccacg ttctctctgc ctgtgagcca 3181 gtctagttcc tgatgaccca ggctgcctga gcacctccca gcccctgccc ctcatgccaa 3241 ccccgcccta ggcctggcat ggcacctgtc gcccagtgcc ctggggctga tctcagggaa 3301 gcccagctcc agggccagat gagcagaagc tctcgatgga caatgaacgg ccttgctggg 3361 ggccgccctg taccctcttt cacctttccc taaagaccct aaatctgagg aatcaacagg 3421 gcagcagatc tgtatatttt tttctaagag aaaatgtaaa taaaggattt ctagatgaaa 3481 aaaaaaaaaa aaaa // LOCUS HUMAML1A 1025 bp mRNA PRI 28-OCT-1993 DEFINITION Homo sapiens acute myeloid leukemia associated protein (AML1/EAP) mRNA, complete cds. ACCESSION L21756 NID g400340 KEYWORDS AML1 gene; oncogene. SOURCE Homo sapiens adult peripheral blood mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1025) AUTHORS Nucifora,G., Begy,C.R., Erickson,P., Drabkin,H.A. and Rowley,J.D. TITLE The 3;21 translocation in myelodysplasia results in a fusion transcript between the AML1 gene and the gene for EAP, a highly conserved protein associated with the Epstein-Barr virus small RNA EBER 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (16), 7784-7788 (1993) MEDLINE 93361531 FEATURES Location/Qualifiers source 1..1025 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="peripheral blood" /map="Unassigned" gene 1..786 /gene="AML1/EAP" CDS 1..786 /gene="AML1/EAP" /codon_start=1 /db_xref="PID:g400341" /translation="MNPSRDVHDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALA GKLRSGDRSMVEVLADHPGELVRTDSPNFLCSVLPTHWRCNKTLPIAFKVVALGDVPD GTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPP QVATYHRAIKITVDGPREPRRHRQKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPHH PAPTPNPRASLNHSTAFNPQPQSQMQESWMLPILSSFCKKGSK" BASE COUNT 266 a 293 c 280 g 186 t ORIGIN 1 atgaatcctt ctagagacgt ccacgatgcc agcacgagcc gccgcttcac gccgccttcc 61 accgcgctga gcccaggcaa gatgagcgag gcgttgccgc tgggcgcccc ggacgccggc 121 gctgccctgg ccggcaagct gaggagcggc gaccgcagca tggtggaggt gctggccgac 181 cacccgggcg agctggtgcg caccgacagc cccaacttcc tctgctccgt gctgcctacg 241 cactggcgct gcaacaagac cctgcccatc gctttcaagg tggtggccct aggggatgtt 301 ccagatggca ctctggtcac tgtgatggct ggcaatgatg aaaactactc ggctgagctg 361 agaaatgcta ccgcagccat gaagaaccag gttgcaagat ttaatgacct caggtttgtc 421 ggtcgaagtg gaagagggaa aagcttcact ctgaccatca ctgtcttcac aaacccaccg 481 caagtcgcca cctaccacag agccatcaaa atcacagtgg atgggccccg agaacctcga 541 agacatcggc agaaactaga tgatcagacc aagcccggga gcttgtcctt ttccgagcgg 601 ctcagtgaac tggagcagct gcggcgcaca gccatgaggg tcagcccaca ccacccagcc 661 cccacgccca accctcgtgc ctccctgaac cactccactg cctttaaccc tcagcctcag 721 agtcagatgc aggaatcatg gatgctgcca attttgagca gtttttgcaa gaaaggatca 781 aagtgaacgg aaaagctggg aaccttggtg gaggggtggt gaccatcgaa aggagcaaga 841 gcaagatcac cgtgacatcc gaggtgcctt tctccaaaag gtatttgaaa tatctcacca 901 aaaaatattt gaagaagaat aatctacgtg actggttgcg cgtagttgct aacagcaaag 961 agagttacga attacgttac ttccagatta accaggacga agaagaggag gaagacgagg 1021 attaa // LOCUS HUMAMPD1 2341 bp mRNA PRI 16-DEC-1994 DEFINITION Human myoadenylate deaminase (AMPD1) mRNA, complete cds. ACCESSION M60092 NID g178543 KEYWORDS myoadenylate deaminase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2341) AUTHORS Sabina,R.L., Fishbein,W.N., Pezeshkpour,G., Clarke,P.R. and Holmes,E.W. TITLE Molecular analysis of the myoadenylate deaminase deficiencies JOURNAL Neurology 42 (1), 170-179 (1992) MEDLINE 92131279 FEATURES Location/Qualifiers source 1..2341 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="skeletal muscle" /dev_stage="adult" /map="1p13" mRNA 1..2341 /gene="AMPD1" /note="G00-119-677" gene 1..2341 /gene="AMPD1" CDS 85..2328 /gene="AMPD1" /EC_number="3.5.4.6" /codon_start=1 /db_xref="GDB:G00-119-677" /product="myodenlate deaminase" /db_xref="PID:g178544" /translation="MPLFKLPAEEKQIDDAMRNFAEKVFASEVKDEGGRQEISPFDVD EICPISHHEMQAHIFHLETLSTSTEARRKKRFQGRKTVNLSIPLSETSSTKLSHIDEY ISSSPTYQTVPDFQRVQITGDYASGVTVEDFEIVCKGLYRALCIREKYMQKSFQRFPK TPSKYLRNIDGEAWVANESFYPVFTPPVKKGEDPFRTDNLPENLGYHLKMKDGVVYVY PNEAAVSKDEPKPLPYPNLDTFLDDMNFLLALIAQGPVKTYTHRRLKFLSSKFQVHQM LNEMDELKELKNNPHRDFYNCRKVDTHIHAAACMNQKHLLRFIKKSYQIDADRVVYST KEKNLTLKELFAKLKMHPYDLTVDSLDVHAGRQTFQRFDKFNDKYNPVGASELRDLYL KTDNYINGEYFATIIKEVGADLVEAKYQHAEPRLSIYGRSPDEWSKLSSWFVCNRIHC PNMTWMIQVPRIYDVFRSKNFLPHFGKMLENIFMPVFEATINPQADPELSVFLKHITG FDSVDDESKHSGHMFSSKSPKPQEWTLEKNPSYTYYAYYMYANIMVLNSLRKERGMNT FLFRPHCGEAGALTHLMTAFMIADDISHGLNLKKSPVLQYLFFLAQIPIAMSPLSNNS LFLEYAKNPFLDFLQKGLMISLSTDDPMQFHFTKEPLMEEYAIAAQVFKLSTCDMCEV ARNSVLQCGISHEEKVKFLGDNYLEEGPAGNDIRRTNVAQIRMAYRYETWCYELNLIA EGLKSTE" BASE COUNT 675 a 562 c 489 g 615 t ORIGIN 1 ttttatagtg tcagtcagtc accccacagt ctcctctctc ttcttttcta ctgtgctatc 61 ctagaatcaa ggatttcagc aacaatgcct ctgttcaaac tcccagctga agagaaacaa 121 attgatgatg caatgcgcaa ctttgctgaa aaagtgtttg cctctgaagt caaagatgaa 181 ggaggtcgtc aggagatttc cccctttgat gtggatgaga tctgtccgat ttctcatcat 241 gagatgcaag cacacatatt ccatctggag actctgtcca cctccacaga agccaggaga 301 aaaaagcgtt tccaaggacg gaagactgtt aatttgtcca ttccactaag tgaaacatct 361 tccaccaaac tgtcccacat tgatgaatac atttcctcat ctccaaccta ccagaccgtg 421 cctgattttc agagagtgca gattactggt gactatgcct ctggggttac agttgaagat 481 tttgaaattg tttgcaaagg tctgtatcgg gcactatgca tacgtgagaa atacatgcag 541 aagtcgtttc agaggttccc taaaacccct tccaaatact tgcggaacat tgatggtgag 601 gcttgggtag caaatgagag cttctatcca gtctttactc ctcctgtgaa gaagggagag 661 gaccccttcc gaacagacaa ccttcctgaa aacctgggct atcacctcaa aatgaaggac 721 ggtgtagttt acgtctatcc taatgaagca gcagtcagca aagatgagcc taagccactt 781 ccttacccaa atctggacac cttcttagac gatatgaatt ttttacttgc tttaattgct 841 caaggacctg ttaagaccta tacccaccgg cgcctgaagt tcctctcctc caagttccag 901 gtccatcaga tgcttaacga gatggacgag ttaaaggagc tgaaaaacaa cccccaccga 961 gatttttata actgcaggaa ggtggacacc catatccatg cagccgcttg catgaaccag 1021 aaacatctgc tgcgttttat taagaaatct taccaaattg atgctgacag agtggtctat 1081 agcaccaaag agaagaatct gaccctaaag gaactttttg ctaaattaaa aatgcatcct 1141 tatgacctga ctgttgattc tctggatgtt catgctggac gccagacctt ccagcgtttt 1201 gataagttca atgacaaata taatcctgta ggagcaagtg agctacggga cctctacttg 1261 aagacagaca attacattaa tggggaatat tttgccacta tcatcaagga ggtaggtgcg 1321 gacctggtgg aggccaagta ccagcatgct gagccccgcc tgtccatcta tggccgcagt 1381 cctgatgagt ggagcaaact ctcctcctgg ttcgtctgca atcgcatcca ctgccccaac 1441 atgacatgga tgatccaggt tcccaggatc tatgatgtgt tccgttccaa gaatttcctt 1501 ccacattttg gaaaaatgct ggagaatatt ttcatgccag tgtttgaggc caccatcaac 1561 ccccaggctg acccagaact cagtgtcttc ctcaagcata tcactggctt tgacagtgtg 1621 gatgatgagt ccaaacacag tggccacatg ttctcctcca agagtcccaa gccccaggag 1681 tggacattgg aaaagaatcc atcttacact tactatgcct actacatgta tgcaaacatc 1741 atggtgctca acagcctgag aaaggaacga ggcatgaata cgtttctgtt ccgacctcac 1801 tgtggagaag ctggagccct cacccatctc atgacagcat tcatgatagc agatgatatc 1861 tctcatggcc taaatttaaa aaagagtccc gtgctacagt acttgttttt cttagcccaa 1921 attcccatcg ccatgtcacc actaagtaac aatagcctat ttctagagta tgccaaaaat 1981 ccttttttgg atttccttca gaaagggcta atgatctcac tgtctacaga tgacccaatg 2041 caattccact ttaccaagga gcccctaatg gaagaatatg ctattgctgc acaagtcttc 2101 aagctgagca cctgtgatat gtgcgaagtg gcaaggaaca gtgtcttgca gtgtggaatt 2161 tctcatgagg agaaagtaaa gtttctgggc gacaattacc ttgaggaagg ccctgctgga 2221 aatgatatcc ggaggacaaa tgtagcccaa atccgcatgg cctatcgcta tgaaacctgg 2281 tgttatgaac tcaatttaat tgctgagggt cttaaatcaa cagaataaaa aaaagtaaac 2341 c // LOCUS HUMAMPD2 3386 bp RNA PRI 08-FEB-1995 DEFINITION Human AMP deaminase (AMPD2) mRNA. ACCESSION M91029 NID g644508 KEYWORDS AMP deaminase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3359) AUTHORS Bausch-Jurken,M.T., Mahnke-Zizelman,D.K., Morisaki,T. and Sabina,R.L. TITLE Molecular characterization of AMP deaminase isoform L: cloning, sequence and expression of human AMPD2 cDNA JOURNAL Unpublished REFERENCE 2 (bases 1 to 3386) AUTHORS Van den Bergh,F. and Sabina,R.L. TITLE Exon shuffling at the 5' end of the human AMPD2 gene produces multiple transcripts encoding variable N-terminal extensions of isoform L JOURNAL Unpublished REFERENCE 3 (bases 1 to 3386) AUTHORS Sabina,R.L. TITLE Direct Submission JOURNAL Submitted (06-OCT-1994) Richard L Sabina, Biochemistry, Medical College of Wisconsin, 8701, Watertown Plank Road, Milwaukee, Wisconsin, 53226, USA FEATURES Location/Qualifiers source 1..3386 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-lymphoblast" /tissue_type="placenta fetal heart" gene join(<1..7,236..2491) /gene="AMPD2" CDS join(<1..7,236..2491) /gene="AMPD2" /EC_number="3.5.4.6" /codon_start=2 /product="AMP deaminase isoform L splicing variant" /db_xref="PID:g644509" /translation="AEELFTRSLAESELRSAPYEFPEESPIEQLEERRQRLERQISQD VKLEPDILLRAKQDFLKTDSDSDLQLYKEQGEGQGDRSLRERDVLEREFQRVTISGEE KCGVPFTDLLDAAKSVVRALFIREKYMALSLQSFCPTTRRYLQQLAEKPLETRTYEQG PDTPVSADAPVHPPALEQHPYEHCEPSTMPGDLGLGLRMVRGVVHVYTRREPDEHCSE VELPYPDLQEFVADVNVLMALIINGPIKSFCYRRLQYLSSKFQMHVLLNEMKELAAQK KVPHRDFYNIRKVDTHIHASSCMNQKHLLRFIKRAMKRHLEEIVHVEQGREQTLREVF ESMNLTAYDLSVDTLDVHADRNTFHRFDKFNAKYNPIGESVLREIFIKTDNRVSGKYF AHIIKEVMSDLEESKYQNAELRLSIYGRSRDEWDKLARWAVMHRVHSPNVRWLVQVPR LFDVYRTKGQLANFQEMLENIFLPLFEATVHPASHPELHLFLEHVDGFDSVDDESKPE NHVFNLESPLPEAWVEEDNPPYAYYLYYTFANMAMLNHLRRQRGFHTFVLRPHCGEAG PIHHLVSAFMLAENISHGLLLRKAPVLQYLYYLAQIGIAMSPLSNNSLFLSYHRNPLP EYLSRGLMVSLSTDDPLQFHFTKEPLMEEYSIATQVWKLSSCDMCELARNSVLMSGFS HKVKSHWLGPNYTKEGPEGNDIRRTNVPDIRVGYRYETLCQELALITQAVQSEMLETI PEEAGITMSPGPQ" exon 1..7 /gene="AMPD2" /label=exon2 intron 8..235 /note="present in alternative pre-spliced transcript" /label=intron2 CDS 209..2491 /gene="AMPD2" /EC_number="3.5.4.6" /codon_start=1 /db_xref="GDB:G00-118-753" /product="AMP deaminase isoform L" /db_xref="PID:g178547" /translation="MLTFLPSPQELFTRSLAESELRSAPYEFPEESPIEQLEERRQRL ERQISQDVKLEPDILLRAKQDFLKTDSDSDLQLYKEQGEGQGDRSLRERDVLEREFQR VTISGEEKCGVPFTDLLDAAKSVVRALFIREKYMALSLQSFCPTTRRYLQQLAEKPLE TRTYEQGPDTPVSADAPVHPPALEQHPYEHCEPSTMPGDLGLGLRMVRGVVHVYTRRE PDEHCSEVELPYPDLQEFVADVNVLMALIINGPIKSFCYRRLQYLSSKFQMHVLLNEM KELAAQKKVPHRDFYNIRKVDTHIHASSCMNQKHLLRFIKRAMKRHLEEIVHVEQGRE QTLREVFESMNLTAYDLSVDTLDVHADRNTFHRFDKFNAKYNPIGESVLREIFIKTDN RVSGKYFAHIIKEVMSDLEESKYQNAELRLSIYGRSRDEWDKLARWAVMHRVHSPNVR WLVQVPRLFDVYRTKGQLANFQEMLENIFLPLFEATVHPASHPELHLFLEHVDGFDSV DDESKPENHVFNLESPLPEAWVEEDNPPYAYYLYYTFANMAMLNHLRRQRGFHTFVLR PHCGEAGPIHHLVSAFMLAENISHGLLLRKAPVLQYLYYLAQIGIAMSPLSNNSLFLS YHRNPLPEYLSRGLMVSLSTDDPLQFHFTKEPLMEEYSIATQVWKLSSCDMCELARNS VLMSGFSHKVKSHWLGPNYTKEGPEGNDIRRTNVPDIRVGYRYETLCQELALITQAVQ SEMLETIPEEAGITMSPGPQ" exon 236..366 /gene="AMPD2" /label=exon3 exon 367..3386 /label=exon4-18 3'UTR 2492..3386 polyA_signal 3369..3374 BASE COUNT 658 a 1046 c 973 g 709 t ORIGIN 1 cgccgaggta tcacctggca ccacccgcac cctcaccccg tgtctccatg ccctgcctct 61 gctccccaca cccctcacct caagctgtcc cctcacctca cgcttggctg tctcctgatc 121 ctcagcctct cccaggtacc cctggtcctg ctgccctcac cccatcccca gactctgtag 181 gagagtgccc gagggcggag ggccagccat gctgaccttc cttccctccc cccaggagct 241 gttcacccgc tcactggctg agagcgagct ccgtagtgcc ccgtatgagt tccccgagga 301 gagccccatt gaacagctgg aggagcggcg gcagcggctg gagcggcaga tcagccagga 361 tgtcaagctg gagccagaca tcctgcttcg ggccaagcaa gatttcctga agacggacag 421 tgactcggac ctacagctct acaaggaaca gggtgagggg cagggtgacc ggagcctgcg 481 ggagcgtgat gtgctggaac gggagtttca gcgggtcacc atctctgggg aggagaagtg 541 tggggtgccg ttcacagacc tgctggatgc agccaagagt gtggtgcggg cgctcttcat 601 ccgggagaag tacatggccc tgtccctgca gagcttctgc cccaccaccc gccgctacct 661 gcagcagctg gctgaaaagc ctctggagac ccggacctat gaacagggcc ccgacacccc 721 tgtgtctgct gatgccccgg tgcacccccc tgcgctggag cagcacccgt atgagcactg 781 tgagccaagc accatgcctg gggacctggg cttgggtctg cgcatggtgc ggggtgtggt 841 gcacgtctac acccgcaggg aacccgacga gcattgctca gaggtggagc tgccataccc 901 tgacctgcag gaatttgtgg ctgacgtcaa tgtgctgatg gccctgatta tcaatggccc 961 cataaagtca ttctgctacc gccggctgca gtacctgagc tccaagttcc agatgcatgt 1021 gctactcaat gagatgaagg agctggccgc ccagaagaaa gtgccacacc gagatttcta 1081 caacatccgc aaggtggaca cccacatcca tgcctcgtcc tgcatgaacc agaagcatct 1141 gctgcgcttc atcaagcggg caatgaagcg gcacctggag gagatcgtgc acgtggagca 1201 gggccgtgaa cagacgctgc gggaggtctt tgagagcatg aatctcacgg cctacgacct 1261 gagtgtggac acgctggatg tgcatgcgga caggaacact ttccatcgct ttgacaagtt 1321 taatgccaaa tacaacccta ttggggagtc cgtcctccga gagatcttca tcaagacgga 1381 caacagggta tctgggaagt actttgctca catcatcaag gaggtgatgt cagacctgga 1441 ggagagcaaa taccagaatg cagagctgcg gctctccatt tacgggcgct cgagggatga 1501 gtgggacaag ctggcgcgct gggccgtcat gcaccgcgtg cactccccca acgtgcgctg 1561 gctggtgcag gtgccccgcc tctttgatgt gtaccgtacc aagggccagc tggccaactt 1621 ccaggagatg ctggagaaca tcttcctgcc actgttcgag gccactgtgc accctgccag 1681 ccacccggaa ctgcatctct tcttagagca cgtggatggt tttgacagcg tggatgatga 1741 gtccaagcct gaaaaccatg tcttcaacct ggagagcccc ctgcctgagg cgtgggtgga 1801 ggaggacaac ccaccctatg cctactacct gtactacacc tttgccaaca tggccatgtt 1861 gaaccacctg cgcaggcaga ggggcttcca cacgtttgtg ctgaggccac actgtgggga 1921 ggctgggccc atccaccacc tggtgtcagc cttcatgctg gctgagaaca tttcccacgg 1981 gctccttctg cgcaaggccc ccgtcctgca gtacctgtac tacctggccc agatcggcat 2041 cgccatgtct ccgctcagca acaacagcct cttcctcagc tatcaccgga atccgctacc 2101 ggagtacctg tcccgcggcc tcatggtctc cctgtccact gatgatccct tgcagttcca 2161 cttcaccaag gagccgctga tggaggagta cagcatcgcc acccaggtgt ggaagctcag 2221 ctcctgcgat atgtgtgagc tggcccgcaa cagcgtgctc atgagcggct tctcgcacaa 2281 ggtaaagagc cactggctgg gacccaacta taccaaggaa ggccctgagg ggaatgacat 2341 ccgccggacc aatgtgccag acatccgcgt gggctaccgc tacgagaccc tgtgccagga 2401 gctggcgctc atcacgcagg cagtccagag tgagatgctg gagaccattc cagaggaggc 2461 gggtatcacc atgagcccag ggcctcaatg agcctggtcc atgaagtgcc caccacatcg 2521 cagcactttt accacgtttt gtcctcagac cccgcccatg ctgtgtggtc tctgcatgtc 2581 tccattcttc tctgtctctg tcttgcatgt ctcctaccat gtcactgtcc ctgggccacc 2641 cagtgaaagc aaagcctggg aatctgctca ttgttgtttg ggctcaggta ttgagcctga 2701 tggcccaggt attgagggcc tcccctgctg gtggccctgt cctgggatcc tcagaagcct 2761 gactgtccta tgggcttctc cagtgtccac aggggcttgg gatggttgtg gggggctggc 2821 ccctctagcc tttccggtcc ttcctgggca aatctaagcc ttggccaggg cgaagtttag 2881 gcccctgtct tgttcatgta gccgaggggc aggcggggga cctctacacc tctgctgtgg 2941 gcacggggct gctgagggtc tgtggaactc cagcagctct gcactgggta gagctgggcc 3001 tagagctcag tcacaggcct gggcttcctg gcctgagtgg gtagacgcag gcggcagagg 3061 tgctggacca catctccgcc aagtcactgc ccagcagcct tctccgtcct gtccccagcc 3121 cacgtgctcc ttgggtgtca gcttcctgtg cctctgtggg agaggcagct gccttgtgtt 3181 atgtctgggg ccacagttgc tgcaaagtcc tggatctgcc actcaacccc gggagtggtg 3241 ttcccagtgt ggctcccaga gctttgacca gattgtgatc ccagctggcc cctatgttgt 3301 gttctggact gaggcctttg ctgtgaactg cagtgtttca tacgaaccat ctttcctagt 3361 gcatgagaaa taaagattat ttaagt // LOCUS HUMAMPD3B 3680 bp mRNA PRI 31-DEC-1994 DEFINITION Human AMP deaminase (AMPD3) mRNA, complete cds. ACCESSION M84721 NID g178550 KEYWORDS AMP deaminase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Mahnke-Zizelman,D.K. and Sabina,R.L. TITLE Cloning of human AMP deaminase isoform E cDNAs. Evidence for a third AMPD gene exhibiting alternatively spliced 5'-exons JOURNAL J. Biol. Chem. 267 (29), 20866-20877 (1992) MEDLINE 93015995 FEATURES Location/Qualifiers source 1..3680 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="vascular endothelial, T-lymphoblast" 5'UTR 1..134 /gene="AMPD3" gene 1..3680 /gene="AMPD3" CDS 135..2438 /gene="AMPD3" /EC_number="3.5.4.6" /codon_start=1 /product="AMP deaminase" /db_xref="PID:g178551" /translation="MPRQFPKLNISEVDEQVRLLAEKVFAKVLREEDSKDALSLFTVP EDCPIGQKEAKERELQKELAEQKSVETAKRKKSFKMIRSQSLSLQMPPQQDWKGPPAA SPAMSPTTPVVTGATSLPTPAPYAMPEFQRVTISGDYCAGITLEDYEQAAKSLAKALM IREKYARLAYHRFPRITSQYLGHPRADTAPPEEGLPDFHPPPLPQEDPYCLDDAPPNL DYLVHMQGGILFVYDNKKMLEHQEPHSLPYPDLETYTVDMSHILALITDGPTKTYCHR RLNFLESKFSLHEMLNEMSEFKELKSNPHRDFYNVRKVDTHIHAAACMNQKHLLRFIK HTYQTEPDRTVAEKRGRKITLRQVFDGLHMDPYDLTVDSLDVHAGRQTFHRFDKFNSK YNPVGASELRDLYLKTENYLGGEYFARMVKEVARELEESKYQYSEPRLSIYGRSPEEW PNLAYWFIQHKVYSPNMRWIIQVPRIYDIFRSKKLLPNFGKMLENIFLPLFKATINPQ DHRELHLFLKYVTGFDSVDDESKHSDHMFSDKSPNPDVWTSEQNPPYSYYLYYMYANI MVLNNLRRERGLSTFLFRPHCGEAGSITHLVSAFLTADNISHGLLLKKSPVLQYLYYL AQIPIAMSPLSNNSLFLEYSKNPLREFLHKGLHVSLSTDDPMQFHYTKEALMEEYAIA AQVWKLSTCDLCEIARNSVLQSGLSHQEKQKFLGQNYYKEGPEGNDIRKTNVAQIRMA FRYETLCNELSFLSDAMKSEEITALTN" 3'UTR 2439..3680 /gene="AMPD3" polyA_signal 3659..3664 /gene="AMPD3" BASE COUNT 922 a 950 c 896 g 912 t ORIGIN 1 tcaaccctgc ttggttttag aggattgctc ccgtgggtca cttgaggcag gctccacctt 61 ccccaggagg agtggcagag tccagccagc gctcggagct ggaggcccac gtgggagcag 121 tgagcggctc tgagatgccg cggcagtttc ccaagctgaa catctctgaa gtggatgagc 181 aagtccggct cctggcggag aaggtgtttg ctaaagtgct ccgagaagag gacagcaaag 241 atgccctgtc cctgttcact gtcccagagg actgccccat cgggcaaaag gaagccaagg 301 agagggagct gcagaaggag ctggcagagc agaagtctgt ggagaccgca aaaagaaaga 361 aaagtttcaa gatgattcgg tcccagtccc tgtctctgca aatgccgcca cagcaagatt 421 ggaagggccc cccggcagcc agtccggcca tgtctcccac aacccctgtg gtcactggag 481 ccacttccct gcccacgcca gcaccctatg ccatgcctga gttccagcgg gtcaccatca 541 gcggagatta ctgtgccggg atcactttgg aggactatga gcaggcagcc aagagtctgg 601 ccaaggccct aatgatccgg gagaagtatg cgcggctcgc ctaccaccgc ttcccgcgga 661 tcacatccca gtacctgggt catccgcggg cggatactgc acctccggaa gagggccttc 721 cagacttcca ccctcctcca ctgccccagg aagaccccta ctgcctggat gatgcacccc 781 ccaacctgga ttacttggtc cacatgcagg ggggcatcct ctttgtgtat gataacaaga 841 agatgctgga gcaccaggag ccgcacagcc taccctaccc cgacctggag acctacacgg 901 tggacatgag ccacatcctg gctctcatca ccgatggccc cacgaaaacc tattgtcacc 961 ggcgactgaa ctttctggaa tccaagttca gccttcatga gatgttaaac gaaatgtccg 1021 agttcaaaga gttgaagagt aacccccacc gggacttcta taacgtgaga aaggtggaca 1081 cacacatcca tgcggccgcc tgcatgaacc aaaagcatct gctgcgcttc atcaagcaca 1141 cataccagac ggagcctgac aggactgtgg cagagaagcg gggccggaag atcaccctgc 1201 ggcaggtgtt tgacggcctg cacatggacc cctacgacct cactgtggac tcactggatg 1261 tccacgcggg ccggcagaca ttccaccgct ttgacaagtt caactccaaa tacaaccctg 1321 tgggggccag tgagctgcgt gacctgtatt tgaaaactga aaactatctg ggaggagagt 1381 actttgctcg gatggtcaag gaggttgccc gggagctgga ggagagcaag taccagtact 1441 cagagccacg gctctccatc tacggccgca gtcctgagga gtggcccaac ctggcctact 1501 ggttcatcca gcacaaggtc tactctccca acatgcgctg gatcatccag gtgccccgga 1561 tttatgacat atttaggtca aagaagctgc tgccaaactt tgggaagatg ctggagaaca 1621 tcttcctgcc ccttttcaag gccactatca acccccaaga tcatcgagag cttcacctct 1681 tccttaaata tgtgacgggg tttgacagcg tggatgatga gtccaagcac agcgaccaca 1741 tgttttccga caagagccca aacccggacg tctggaccag tgagcagaac ccaccctaca 1801 gctactacct gtactacatg tatgccaaca tcatggtgct caacaacctc cgcagggagc 1861 gcggcctgag cacgttcctg ttccggccgc actgtgggga agccggctcc atcacccacc 1921 tggtgtctgc cttcctcact gctgacaaca tttcccacgg gctgctcctc aagaagagtc 1981 cggtattgca gtatctctac taccttgctc agatccccat tgccatgtct cctcttagca 2041 acaacagttt gttcctcgaa tattccaaga accctctgag ggaattccta cacaagggac 2101 tgcatgtttc tctttccacc gatgacccca tgcagttcca ctacacgaag gaagcactta 2161 tggaagaata tgccattgca gctcaagtgt ggaagctgag cacctgcgac ctgtgtgaga 2221 tcgccaggaa cagcgtgctg cagagcggcc tctcgcatca ggaaaagcaa aagtttctgg 2281 gacaaaatta ttataaagaa ggacctgaag gaaatgatat tcgaaagaca aatgtggctc 2341 agatccggat ggcattccga tatgagacct tatgcaatga gctcagcttc ctgtctgatg 2401 ctatgaaatc agaagagatc accgccttga ccaactaggt ccagcatttg acatgcattt 2461 taactttttg gttcaatttc aagtctgctg tggctaatag tggtcaagat tccgaactag 2521 gactttcctc tgtgaagagg atgcctctga agaaatttta aactggtgat tttggttgca 2581 ctgctcactt taagagttaa catgctcact tgttagtatt tctgagtaac aagatggtga 2641 cttctccttg gggatctggg agctgagcac ttgtctatac ttgttcctaa ttttccaagt 2701 atttctcttg aaactgccag tgcctgaact gttggggcca ggattttccc tggtcagatg 2761 ccaagtaaca tgtggttttc tgccatactt ttctccattg gcccaggtag gctaattggt 2821 agttgttcat ttcagcctct ggatggctgg ctggccttaa acacaatcaa tttcaaagct 2881 ccattttcat aaaggggcta ctttgaagga gttaagatgg aagacttcct tcttgacaaa 2941 ttgtgttttt agtgaatttc ttaaaccgtt ttatttagcc ctccttccct ctttctagtt 3001 ggaagccaaa tgtactcatg aaaacagcca ctcctattct gagtcttggt ttcttcacct 3061 agaaagtgag ggtttggact agatgagtgg ctttcagggt gttctgtgaa tctcctcatg 3121 aatactttag ggtgggggag ggaagggagt gagtgatgct caggggctgt caaagtgact 3181 gcgttcatca gttttacact ggggctgcta cataatattt tcatttgaac gaagaacttc 3241 aaaaagcaca ggactagatg atctctgttc cttttggctc taatatgcta caactgtagg 3301 ccaattatca ctttaccaat taagagttag gccagataag tgaaatttaa cttaagggca 3361 cacagctaat aagtaatagg cctaaactgg atttccttat tccaaatcct gtcttttccc 3421 cactattcca ttagacccca caaatgttag ttgtgtgtgt gtgtgtgtgt gtttttaatc 3481 actgtaaccg ggtgcatttt tttaaggcaa aatttctccc ttatctactg tgatgacttc 3541 agaagataca atggtcccag gggccaagta gaaagcattt ttaaagatta atctgaatta 3601 agctttatca gtgtactctt tatctgtgtt actagtgcct ggtatgtagt aggtgctcaa 3661 taaatgcata ttgaataact // LOCUS HUMAMY 1549 bp mRNA PRI 26-JAN-1994 DEFINITION Human AD amyloid mRNA, complete cds. ACCESSION L08850 NID g437364 KEYWORDS AD amyloid; Alzheimer's disease; NACP; amyloid. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1549) AUTHORS Ueda,K., Fukushima,H., Masliah,E., Xia,Y., Iwai,A., Yoshimoto,M., Otero,D.A., Kondo,J., Ihara,Y. and Saitoh,T. TITLE Molecular cloning of cDNA encoding an unrecognized component of amyloid in Alzheimer disease JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11282-11286 (1993) MEDLINE 94068588 FEATURES Location/Qualifiers source 1..1549 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="ATCC 37432 of A. Lazzarini" terminator 35..473 CDS 53..475 /standard_name="NACP" /codon_start=1 /product="AD amyloid" /db_xref="PID:g437365" /translation="MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKT KEGVVHGVATVAEKTKEQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGK NEEGAPQEGILEDMPVDPDNEAYEMPSEEGYQDYEPEA" polyA_signal 1023..1028 polyA_site 1023 polyA_signal 1079..1084 polyA_signal 1529..1534 polyA_site 1549 BASE COUNT 471 a 248 c 342 g 488 t ORIGIN 1 gctctcggag tggccattcg acgacagtgt ggtgtaaagg aattcattag ccatggatgt 61 attcatgaaa ggactttcaa aggccaagga gggagttgtg gctgctgctg agaaaaccaa 121 acagggtgtg gcagaagcag caggaaagac aaaagagggt gttctctatg taggctccaa 181 aaccaaggag ggagtggtgc atggtgtggc aacagtggct gagaagacca aagagcaagt 241 gacaaatgtt ggaggagcag tggtgacggg tgtgacagca gtagcccaga agacagtgga 301 gggagcaggg agcattgcag cagccactgg ctttgtcaaa aaggaccagt tgggcaagaa 361 tgaagaagga gccccacagg aaggaattct ggaagatatg cctgtggatc ctgacaatga 421 ggcttatgaa atgccttctg aggaagggta tcaagactac gaacctgaag cctaagaaat 481 atctttgctc ccagtttctt gagatctgct gacagatgtt ccatcctgta caagtgctca 541 gttccaatgt gcccagtcat gacatttctc aaagttttta cagtgtatct cgaagtcttc 601 catcagcagt gattgaagta tctgtacctg cccccactca gcatttcggt gcttcccttt 661 cactgaagtg aatacatggt agcagggtct ttgtgtgctg tggattttgt ggcttcaatc 721 tacgatgtta aaacaaatta aaaacaccta agtgactacc acttatttct aaatcctcac 781 tatttttttg ttgctgttgt tcagaagttg ttagtgattt gctatcatat attataagat 841 ttttaggtgt cttttaatga tactgtctaa gaataatgac gtattgtgaa atttgttaat 901 atatataata cttaaaaata tgtgagcatg aaactatgca cctataaata ctaaatatga 961 aattttacca ttttgcgatg tgttttattc acttgtgttt gtatataaat ggtgagaatt 1021 aaaataaaac gttatctcat tgcaaaaata ttttattttt atcccatctc actttaataa 1081 taaaaatcat gcttataagc aacatgaatt aagaactgac acaaaggaca aaaatataaa 1141 gttattaata gccatttgaa gaaggaggaa ttttagaaga ggtagagaaa atggaacatt 1201 aaccctacac tcggaattcc ctgaagcaac actgccagaa gtgtgttttg gtatgcactg 1261 gttccttaag tggctgtgat taattattga aagtggggtg ttgaagaccc caactactat 1321 tgtagagtgg tctatttctc ccttcaatcc tgtcaatgtt tgctttatgt attttgggga 1381 actgttgttt gatgtgtatg tgtttataat tgttatacat ttttaattga gccttttatt 1441 aacatatatt gttatttttg tctcgaaata attttttagt taaaatctat tttgtctgat 1501 attggtgtga atgctgtacc tttctgacaa taaataatat tcgaccatg // LOCUS HUMAMY2AZ 1574 bp mRNA PRI 31-OCT-1994 DEFINITION Human pancreatic amylase (amy2A) mRNA, complete cds. ACCESSION M28443 NID g529396 KEYWORDS amylase. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1574) AUTHORS Wise,R.J., Karn,R.C., Larsen,S.H., Hodes,M.E., Gardell,S.J. and Rutter,W.J. TITLE A complementary DNA sequence that predicts a human pancreatic amylase primary structure consistent with the electrophoretic mobility of the common isozyme, Amy2 A JOURNAL Mol. Biol. Med. 2 (5), 307-322 (1984) MEDLINE 86091475 FEATURES Location/Qualifiers source 1..1574 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /map="1p21" gene 14..1549 /gene="AMY2A" CDS 14..1549 /gene="AMY2A" /codon_start=1 /db_xref="GDB:G00-120-547" /product="amylase" /db_xref="PID:g529397" /translation="MKFFLLLFTIGFCWAQYSPNTQQGRTSIVHLFEWRWVDIALECE RYLAPKGFGGVQVSPPNENVAIYNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCN NVGVRIYVDAVINHMCGNAVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGS GDIENYNDATQVRDCRLTGLLDLALEKDYVRSKIAEYMNHLIDIGVAGFRLDASKHMW PGDIKAILDKLHNLNSNWFPAGSKPFIYQEVIDLGGEPIKSSDYFGNGRVTEFKYGAK LGTVIRKWNGEKMSYLKNWGEGWGFVPSDRALVFVDNHDNQRGHGAGGASILTFWDAR LYKMAVGFMLAHPYGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTT CGNDWVCEHRWRQIRNMVIFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDWS FSLTLQTGLPAGTYCDVISGDKINGNCTGIKIYVSDDGKAHFSISNSAEDPFIAIHAE SKL" BASE COUNT 458 a 271 c 378 g 467 t ORIGIN 1 acttcaaagc aaaatgaagt tctttctgtt gcttttcacc attgggttct gctgggctca 61 gtattcccca aatacacaac aaggacggac atctattgtt catctgtttg aatggcgatg 121 ggttgatatt gctcttgaat gtgagcgata tttagctccg aagggatttg gaggggttca 181 ggtctctcca ccaaatgaaa atgttgcaat ttacaaccct ttcagacctt ggtgggaaag 241 ataccaacca gttagctata aattatgcac aagatctgga aatgaagatg aatttagaaa 301 catggtgact agatgtaaca atgttggggt tcgtatttat gtggatgctg taattaatca 361 tatgtgtggt aacgctgtga gtgcaggaac aagcagtacc tgtggaagtt acttcaaccc 421 tggaagtagg gactttccag cagtcccata ttctggatgg gatttcaatg atggtaaatg 481 taaaactgga agtggagata tcgagaatta caatgatgct actcaggtca gagattgtcg 541 tctgactggt cttcttgatc ttgcactgga gaaggattac gtgcgttcta agattgccga 601 atatatgaac catctcattg acattggtgt tgcagggttc agacttgatg cttccaagca 661 catgtggcct ggagacataa aggcaatttt ggacaaactg cataatctaa acagtaactg 721 gttccctgca ggaagtaaac ctttcattta ccaggaggta attgatctgg gtggtgagcc 781 aattaaaagc agtgactact ttggtaatgg ccgggtgaca gaattcaagt atggtgcaaa 841 actcggcaca gttattcgca agtggaatgg agagaagatg tcttacttaa agaactgggg 901 agaaggttgg ggtttcgtac cttctgacag agcgcttgtc tttgtggata accatgacaa 961 tcaacgagga catggggctg gaggagcctc tattcttacc ttctgggatg ctaggctgta 1021 caaaatggca gttggattta tgcttgctca tccttacgga tttacacgag taatgtcaag 1081 ctaccgttgg ccaagacagt ttcaaaatgg aaacgatgtt aacgattggg ttgggccacc 1141 aaataataat ggagtaatta aagaagttac tattaatcca gacactactt gtggcaatga 1201 ctgggtctgt gaacatcgat ggcgccaaat aaggaacatg gttattttcc gcaatgtagt 1261 ggatggccag ccttttacaa attggtatga taatgggagc aaccaagtgg cttttgggag 1321 aggaaacaga ggattcattg ttttcaacaa tgatgactgg tcattttctt taactttgca 1381 aactggtctt cctgctggca catactgtga tgtcatttct ggagataaaa ttaatggcaa 1441 ttgcacaggc attaaaattt acgtttctga tgatggcaaa gctcattttt ctattagtaa 1501 ctctgctgaa gatccattta ttgcaattca tgctgaatct aaattgtaaa atttaaaatt 1561 aaatgcatgt cctc // LOCUS HUMANCDA 1071 bp mRNA PRI 05-AUG-1992 DEFINITION Human adipsin/complement factor D mRNA, complete cds. ACCESSION M84526 NID g178625 KEYWORDS adipsin; complement factor D. SOURCE Homo sapiens glioma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1071) AUTHORS White,R.T., Damm,D.L., Hancock,N., Rosen,B.S., Lowell,B.L., Usher,P., Flier,J.S. and Spiegelman,B.M. TITLE Human adipsin is identical to complement factor D and is expressed at high levels in adipose tissue JOURNAL J. Biol. Chem. 267, 9210-9213 (1992) MEDLINE 92250520 FEATURES Location/Qualifiers source 1..1071 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="human glioma explant HS 683 (ATCC # HTB 138)" /tissue_type="glioma" sig_peptide 55..171 /product="adipsin/complement factor D" CDS 55..741 /note="putative" /codon_start=1 /product="adipsin/complement factor D" /db_xref="PID:g178626" /translation="MLGGREAEAHARPYMASVQLNGAHLCAGVLVAERWVLSAAHCLE DAADGKVQVLLGAHSLSQPEPSKRLYDVLRAVPHPDSQPDTIDHDLLLLQLSEKATLG PAVRPLPWQRVDRDVAPGTLCDVAGWGIVNHAGRRPDSLQHVLLPVLDRATCNRRTHH DGAITERLMCAESNRRDSCKGDSGGPLVCGGVLEGVVTSGSRVCGNRKKPGIYTRVAS YAAWIDSVLA" mat_peptide 172..216 /standard_name="activation peptide" /note="putative" /product="adipsin/complement factor D" BASE COUNT 185 a 334 c 380 g 172 t ORIGIN 1 gcagttctgg tcctcctagg agcggccgcc tgcgcggcgc ggccccgtgg tcggatgctg 61 ggcggcagag aggccgaggc gcacgcgcgg ccctacatgg cgtcggtgca gctgaacggc 121 gcgcacctgt gcgcaggcgt cctggtggcg gagcggtggg tgctgagcgc ggcgcactgc 181 ctggaggacg cggccgacgg gaaggtgcag gttctcctgg gcgcgcactc cctgtcgcag 241 ccggagccct ccaagcgcct gtacgacgtg ctccgcgcag tgccccaccc ggacagccag 301 cccgacacca tcgaccacga cctcctgctg ctacagctgt cggagaaggc cacactgggc 361 cctgctgtgc gccccctgcc ctggcagcgc gtggaccgcg acgtggcacc gggaactctc 421 tgcgacgtgg ccggctgggg catagtcaac cacgcgggcc gccgcccgga cagcctgcag 481 cacgtgctct tgccagtgct ggaccgcgcc acctgcaacc ggcgcacgca ccacgacggc 541 gccatcaccg agcgcttgat gtgcgcggag agcaatcgcc gggacagctg caagggtgac 601 tccgggggcc cgctggtgtg cgggggcgtg ctcgagggcg tggtcacctc gggctcgcgc 661 gtttgcggca accgcaagaa gcccgggatc tacacccgcg tggcgagcta tgcggcctgg 721 atcgacagcg tcctggccta gggtgccggg gcctgaaggt cagggtcacc caagcaacaa 781 agtcccgagc aatgaagtca tccactcctg catctggttg gtctttattg agcacctact 841 atatgcagaa ggggaggccg aggtgggagg atcattggat ctcaggagtt ggagatcagc 901 atgggccacg tagcgcgact ccatctctac aaataaataa aaattagctg ggcaattggc 961 gggcatggag gtgggtgctt gtagttccag ctactcagga ggctgaggtg ggaggatgac 1021 ttgaacgcag gaggctgagg ctgcagtgag ttgtgattgc accactgccc t // LOCUS HUMANDREC 3569 bp mRNA PRI 31-OCT-1994 DEFINITION Human androgen receptor (AR) mRNA, complete cds. ACCESSION M20132 J03180 NID g178627 KEYWORDS androgen receptor. SOURCE Human, epididymal cDNA to mRNA, clones ARHEL[1-3] and ARHFL1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3569) AUTHORS Lubahn,D.B., Joseph,D.R., Sar,M., Tan,J., Higgs,H.N., Larson,R.E., French,F.S. and Wilson,E.M. TITLE The human androgen receptor: complementary deoxyribonucleic acid cloning, sequence analysis and gene expression in prostate JOURNAL Mol. Endocrinol. 2 (12), 1265-1275 (1988) MEDLINE 89112208 COMMENT Draft entry and computer readable sequence [1] kindly submitted by E.M.Wilson, 18-AUG-1988. FEATURES Location/Qualifiers source 1..3569 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq11.2-q12" gene 363..3122 /gene="AR" CDS 363..3122 /gene="AR" /note="androgen receptor" /codon_start=1 /db_xref="GDB:G00-120-556" /db_xref="PID:g178628" /translation="MEVQLGLGRVYPRPPSKTYRGAFQNLFQSVREVIQNPGPRHPEA ASAAPPGASLLLLQQQQQQQQQQQQQQQQQQQQQETSPRQQQQQQGEDGSPQAHRRGP TGYLVLDEEQQPSQPQSALECHPERGCVPEPGAAVAASKGLPQQLPAPPDEDDSAAPS TLSLLGPTFPGLSSCSADLKDILSEASTMQLLQQQQQEAVSEGSSSGRAREASGAPTS SKDNYLGGTSTISDNAKELCKAVSVSMGLGVEALEHLSPGEQLRGDCMYAPLLGVPPA VRPTPCAPLAECKGSLLDDSAGKSTEDTAEYSPFKGGYTKGLEGESLGCSGSAAAGSS GTLELPSTLSLYKSGALDEAAAYQSRDYYNFPLALAGPPPPPPPPHPHARIKLENPLD YGSAWAAAAAQCRYGDLASLHGAGAAGPGSGSPSAAASSSWHTLFTAEEGQLYGPCGG GGGGGGGGGGGGGGGGGGGGGGEAGAVAPYGYTRPPQGLAGQESDFTAPDVWYPGGMV SRVPYPSPTCVKSEMGPWMDSYSGPYGDMRLETARDHVLPIDYYFPPQKTCLICGDEA SGCHYGALTCGSCKVFFKRAAEGKQKYLCASRNDCTIDKFRRKNCPSCRLRKCYEAGM TLGARKLKKLGNLKLQEEGEASSTTSPTEETTQKLTVSHIEGYECQPIFLNVLEAIEP GVVCAGHDNNQPDSFAALLSSLNELGERQLVHVVKWAKALPGFRNLHVDDQMAVIQYS WMGLMVFAMGWRSFTNVNSRMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQ ITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRF YQLTKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSGKVKPI YFHTQ" BASE COUNT 796 a 1009 c 974 g 790 t ORIGIN 1 taataactca gttcttattt gcacctactt cagtggacac tgaatttgga aggtggagga 61 ttttgttttt ttcttttaag atctgggcat cttttgaatc tacccttcaa gtattaagag 121 acagactgtg agcctagcag ggcagatctt gtccaccgtg tgtcttcttc tgcacgagac 181 tttgaggctg tcagagcgct ttttgcgtgg ttgctcccgc aagtttcctt ctctggagct 241 tcccgcaggt gggcagctag ctgcagcgac taccgcatca tcacagcctg ttgaactctt 301 ctgagcaaga gaaggggagg cggggtaagg gaagtaggtg gaagattcag ccaagctcaa 361 ggatggaagt gcagttaggg ctgggaaggg tctaccctcg gccgccgtcc aagacctacc 421 gaggagcttt ccagaatctg ttccagagcg tgcgcgaagt gatccagaac ccgggcccca 481 ggcacccaga ggccgcgagc gcagcacctc ccggcgccag tttgctgctg ctgcagcagc 541 agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag cagcaagaga 601 ctagccccag gcagcagcag cagcagcagg gtgaggatgg ttctccccaa gcccatcgta 661 gaggccccac aggctacctg gtcctggatg aggaacagca accttcacag ccgcagtcgg 721 ccctggagtg ccaccccgag agaggttgcg tcccagagcc tggagccgcc gtggccgcca 781 gcaaggggct gccgcagcag ctgccagcac ctccggacga ggatgactca gctgccccat 841 ccacgttgtc cctgctgggc cccactttcc ccggcttaag cagctgctcc gctgacctta 901 aagacatcct gagcgaggcc agcaccatgc aactccttca gcaacagcag caggaagcag 961 tatccgaagg cagcagcagc gggagagcga gggaggcctc gggggctccc acttcctcca 1021 aggacaatta cttagggggc acttcgacca tttctgacaa cgccaaggag ttgtgtaagg 1081 cagtgtcggt gtccatgggc ctgggtgtgg aggcgttgga gcatctgagt ccaggggaac 1141 agcttcgggg ggattgcatg tacgccccac ttttgggagt tccacccgct gtgcgtccca 1201 ctccttgtgc cccattggcc gaatgcaaag gttctctgct agacgacagc gcaggcaaga 1261 gcactgaaga tactgctgag tattcccctt tcaagggagg ttacaccaaa gggctagaag 1321 gcgagagcct aggctgctct ggcagcgctg cagcagggag ctccgggaca cttgaactgc 1381 cgtctaccct gtctctctac aagtccggag cactggacga ggcagctgcg taccagagtc 1441 gcgactacta caactttcca ctggctctgg ccggaccgcc gccccctccg ccgcctcccc 1501 atccccacgc tcgcatcaag ctggagaacc cgctggacta cggcagcgcc tgggcggctg 1561 cggcggcgca gtgccgctat ggggacctgg cgagcctgca tggcgcgggt gcagcgggac 1621 ccggttctgg gtcaccctca gccgccgctt cctcatcctg gcacactctc ttcacagccg 1681 aagaaggcca gttgtatgga ccgtgtggtg gtggtggggg tggtggcggc ggcggcggcg 1741 gcggcggcgg cggcggcggc ggcggcggcg gcggcggcga ggcgggagct gtagccccct 1801 acggctacac tcggccccct caggggctgg cgggccagga aagcgacttc accgcacctg 1861 atgtgtggta ccctggcggc atggtgagca gagtgcccta tcccagtccc acttgtgtca 1921 aaagcgaaat gggcccctgg atggatagct actccggacc ttacggggac atgcgtttgg 1981 agactgccag ggaccatgtt ttgcccattg actattactt tccaccccag aagacctgcc 2041 tgatctgtgg agatgaagct tctgggtgtc actatggagc tctcacatgt ggaagctgca 2101 aggtcttctt caaaagagcc gctgaaggga aacagaagta cctgtgcgcc agcagaaatg 2161 attgcactat tgataaattc cgaaggaaaa attgtccatc ttgtcgtctt cggaaatgtt 2221 atgaagcagg gatgactctg ggagcccgga agctgaagaa acttggtaat ctgaaactac 2281 aggaggaagg agaggcttcc agcaccacca gccccactga ggagacaacc cagaagctga 2341 cagtgtcaca cattgaaggc tatgaatgtc agcccatctt tctgaatgtc ctggaagcca 2401 ttgagccagg tgtagtgtgt gctggacacg acaacaacca gcccgactcc tttgcagcct 2461 tgctctctag cctcaatgaa ctgggagaga gacagcttgt acacgtggtc aagtgggcca 2521 aggccttgcc tggcttccgc aacttacacg tggacgacca gatggctgtc attcagtact 2581 cctggatggg gctcatggtg tttgccatgg gctggcgatc cttcaccaat gtcaactcca 2641 ggatgctcta cttcgcccct gatctggttt tcaatgagta ccgcatgcac aagtcccgga 2701 tgtacagcca gtgtgtccga atgaggcacc tctctcaaga gtttggatgg ctccaaatca 2761 ccccccagga attcctgtgc atgaaagcac tgctactctt cagcattatt ccagtggatg 2821 ggctgaaaaa tcaaaaattc tttgatgaac ttcgaatgaa ctacatcaag gaactcgatc 2881 gtatcattgc atgcaaaaga aaaaatccca catcctgctc aagacgcttc taccagctca 2941 ccaagctcct ggactccgtg cagcctattg cgagagagct gcatcagttc acttttgacc 3001 tgctaatcaa gtcacacatg gtgagcgtgg actttccgga aatgatggca gagatcatct 3061 ctgtgcaagt gcccaagatc ctttctggga aagtcaagcc catctatttc cacacccagt 3121 gaagcattgg aaaccctatt tccccacccc agctcatgcc ccctttcaga tgtcttctgc 3181 ctgttataac tctgcactac tcctctgcag tgccttgggg aatttcctct attgatgtac 3241 agtctgtcat gaacatgttc ctgaattcta tttgctgggc tttttttttc tctttctctc 3301 ctttcttttt cttcttccct ccctatctaa ccctcccatg gcaccttcag actttgcttc 3361 ccattgtggc tcctatctgt gttttgaatg gtgttgtatg cctttaaatc tgtgatgatc 3421 ctcatatggc ccagtgtcaa gttgtgcttg tttacagcac tactctgtgc cagccacaca 3481 aacgtttact tatcttatgc cacgggaagt ttagagagct aagattatct ggggaaatca 3541 aaacaaaaaa caagcaaaca aaaaaaaaa // LOCUS HUMANG 2099 bp mRNA PRI 31-OCT-1994 DEFINITION Human angiotensinogen mRNA, complete CDS. ACCESSION K02215 NID g178639 KEYWORDS angiotensin; angiotensinogen. SOURCE Human liver, cDNA to mRNA, clones pHag3 and pHag11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2099) AUTHORS Kageyama,R., Ohkubo,H. and Nakanishi,S. TITLE Primary structure of human preangiotensinogen deduced from the cloned cDNA sequence JOURNAL Biochemistry 23 (16), 3603-3609 (1984) MEDLINE 85000455 COMMENT Human preangiotensinogen is encoded by two mRNAs that differ only in the length of the 3'-untranslated region. [1] postulates that the two preangiotensinogens arise from a single gene and utilize two separate poly-A signals (bases 1883-1888 and 2083-2088). There are two 'atg' codons that could possibly be used for translation initiation (bases 40-42 and 67-69). FEATURES Location/Qualifiers source 1..2099 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q42-q43" mRNA <1..1907 /gene="AGT" /note="PAT mRNA (alt.); G00-118-750" mRNA <1..2099 /gene="AGT" /note="PAT mRNA (alt.); G00-118-750" gene 1..2099 /gene="AGT" sig_peptide 40..138 /gene="AGT" /note="preangiotensinogen signal peptide; G00-118-750" CDS 40..1497 /gene="AGT" /note="preangiotensinogen" /codon_start=1 /db_xref="GDB:G00-118-750" /db_xref="PID:g178640" /translation="MRKRAPQSEMAPAGVSLRATILCLLAWAGLAAGDRVYIHPFHLV IHNESTCEQLAKANAGKPKDPTFIPAPIQAKTSPVDEKALQDQLVLVAAKLDTEDKLR AAMVGMLANFLGFRIYGMHSELWGVVHGATVLSPTAVFGTLASLYLGALDHTADRLQA ILGVPWKDKNCTSRLDAHKVLSALQAVQGLLVAQGRADSQAQLLLSTVVGVFTAPGLH LKQPFVQGLALYTPVVLPRSLDFTELDVAAEKIDRFMQAVTGWKTGCSLMGASVDSTL AFNTYVHFQGKMKGFSLLAEPQEFWVDNSTSVSVPMLSGMGTFQHWSDIQDNFSVTQV PFTESACLLLIQPHYASDLDKVEGLTFQQNSLNWMKKLSPRTIHLTMPQLVLQGSYDL QDLLAQAELPAILHTELNLQKLSNDRIRVGEVLNSIFFELEADEREPTESTQQLNKPE VLEVTLNRPFLFAVYDQSATALHFLGRVANPLSTA" mat_peptide 139..1494 /gene="AGT" /note="angiotensinogen mature peptide; G00-118-750" mat_peptide 139..168 /gene="AGT" /note="angiotensin I mature peptide; G00-118-750" mat_peptide 139..162 /gene="AGT" /note="angiotensin II mature peptide; G00-118-750" BASE COUNT 467 a 590 c 561 g 481 t ORIGIN 23 bp upstream of RsaI site. 1 aagaagctgc cgttgttctg ggtactacag cagaagggta tgcggaagcg agcaccccag 61 tctgagatgg ctcctgccgg tgtgagcctg agggccacca tcctctgcct cctggcctgg 121 gctggcctgg ctgcaggtga ccgggtgtac atacacccct tccacctcgt catccacaat 181 gagagtacct gtgagcagct ggcaaaggcc aatgccggga agcccaaaga ccccaccttc 241 atacctgctc caattcaggc caagacatcc cctgtggatg aaaaggccct acaggaccag 301 ctggtgctag tcgctgcaaa acttgacacc gaagacaagt tgagggccgc aatggtcggg 361 atgctggcca acttcttggg cttccgtata tatggcatgc acagtgagct atggggcgtg 421 gtccatgggg ccaccgtcct ctccccaacg gctgtctttg gcaccctggc ctctctctat 481 ctgggagcct tggaccacac agctgacagg ctacaggcaa tcctgggtgt tccttggaag 541 gacaagaact gcacctcccg gctggatgcg cacaaggtcc tgtctgccct gcaggctgta 601 cagggcctgc tagtggccca gggcagggct gatagccagg cccagctgct gctgtccacg 661 gtggtgggcg tgttcacagc cccaggcctg cacctgaagc agccgtttgt gcagggcctg 721 gctctctata cccctgtggt cctcccacgc tctctggact tcacagaact ggatgttgct 781 gctgagaaga ttgacaggtt catgcaggct gtgacaggat ggaagactgg ctgctccctg 841 atgggagcca gtgtggacag caccctggct ttcaacacct acgtccactt ccaagggaag 901 atgaagggct tctccctgct ggccgagccc caggagttct gggtggacaa cagcacctca 961 gtgtctgttc ccatgctctc tggcatgggc accttccagc actggagtga catccaggac 1021 aacttctcgg tgactcaagt gcccttcact gagagcgcct gcctgctgct gatccagcct 1081 cactatgcct ctgacctgga caaggtggag ggtctcactt tccagcaaaa ctccctcaac 1141 tggatgaaga aactgtctcc ccggaccatc cacctgacca tgccccaact ggtgctgcaa 1201 ggatcttatg acctgcagga cctgctcgcc caggctgagc tgcccgccat tctgcacacc 1261 gagctgaacc tgcaaaaatt gagcaatgac cgcatcaggg tgggggaggt gctgaacagc 1321 attttttttg agcttgaagc ggatgagaga gagcccacag agtctaccca acagcttaac 1381 aagcctgagg tcttggaggt gaccctgaac cgcccattcc tgtttgctgt gtatgatcaa 1441 agcgccactg ccctgcactt cctgggccgc gtggccaacc cgctgagcac agcatgaggc 1501 cagggcccca gaacacagtg cctggcaagg cctctgcccc tggcctttga ggcaaaggcc 1561 agcagcagat aacaaccccg gacaaatcag cgatgtgtca cccccagtct cccacctttt 1621 cttctaatga gtcgactttg agctggaaag cagccgtttc tccttggtct aagtgtgctg 1681 catggagtga gcagtagaag cctgcagcgg cacaaatgca cctcccagtt tgctgggttt 1741 attttagaga atgggggtgg ggaggcaaga accagtgttt agcgcgggac tactgttcca 1801 aaaagaattc caaccgacca gcttgtttgt gaaacaaaaa agtgttccct tttcaagttg 1861 agaacaaaaa ttgggtttta aaattaaagt atacattttt gcattgcctt cggtttgtat 1921 ttagtgtctt gaatgtaaga acatgacctc cgtgtagtgt ctgtaatacc ttagtttttt 1981 ccacagatgc ttgtgatttt tgaacaatac gtgaaagatg caagcacctg aatttctgtt 2041 tgaatgcgga acaatagctg gttatttctc ccttgtgtta gtaataaacg tcttgccac // LOCUS HUMANK 6192 bp mRNA PRI 31-OCT-1994 DEFINITION Human erythroid ankyrin mRNA, complete cds. ACCESSION M28880 NID g178645 KEYWORDS ankyrin; membrane skeleton protein. SOURCE Human peripheral blood reticulocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6192) AUTHORS Lambert,S., Yu,H., Prchal,J.T., Lawler,J., Ruff,P., Speicher,D., Cheung,M.C., Kan,Y.W. and Palek,J. TITLE cDNA sequence for human erythrocyte ankyrin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (5), 1730-1734 (1990) MEDLINE 90175370 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Lambert, 06-OCT-89. Some of the cDNAs isolated lack bases 4622-5107 or bases 5629-5763, suggesting that alternative splicing may occur. FEATURES Location/Qualifiers source 1..6192 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="erythrocyte" /map="8p21.1-p11.2" gene 85..5727 /gene="ANK1" CDS 85..5727 /gene="ANK1" /standard_name="ANK" /note="precursor" /codon_start=1 /db_xref="GDB:G00-118-737" /product="ankyrin" /db_xref="PID:g178646" /translation="MPYSVGFREADAATSFLRAARSGNLDKALDHLRNGVDINTCNQN GLNGLHLASKEGHVKMVVELLHKEIILETTTKKGNTALHIAALAGQDEVVRELVNYGA NVNAQSQKGFTPLYMAAQENHLEVVKFLLENGANQNVATEDGFTPLAVALQQGHENVV AHLINYGTKGKVRLPALHIAARNDDTRTAAVLLQNDPNPDVLSKTGFTPLHIAAHYEN LNVAQLLLNRGSSVNFTPQNGITPLHIASRRGNVIMVRLLLDRGAQIETKTKDELTPL HCAARNGHVRISEILLDHGAPIQAKTKNGLSPIHMAAQGDHLDCVRLLLQYDAEIDDI TLDHLTPLHVAAHCGHHRVAKVLLDKGAKPNSRALNGFTPLHIACKKNHVRVMELLLK TGASIDAVTESGLTPLHVASFMGHLPIVKNLLQRGASPNVSNVKVETPLHMAARAGHT EVAKYLLQNKAKVNAKAKDDQTPLHCAARIGHTNMVKLLLENNANPNLATTAGHTPLH IAAREGHVETVLALLEKEASQACMTKKGFTPLHVAAKYGKVRVAELLLERDAHPNAAG KNGLTPLHVAVHHNNLDIVKLLLPRGGSPHSPAWNGYTPLHIAAKQNQVEVARSLLQY GGSANAESVQGVTPLHLAAQEGHAEMVALLLSKQANGNLGNKSGLTPLHLVAQEGHVP VADVLIKHGVMVDATTRMGYTPLHVASHYGNIKLVKFLLQHQADVNAKTKLGYSPLHQ AAQQGHTDIVTLLLKNGASPNEVSSDGTTPLAIAKRLGYISVTDVLKVVTDETSFVLV SDKHRMSFPETVDEILDVSEDEGEELISFKAERRDSRDVDEEKELLDFVPKLDQVVES PAIPRIPCAMPETVVIRSEEQEQASKEYDEDSLIPSSPATETSDNISPVASPVHTGFL VSFMVDARGGSMRGSRHNGLRVVIPPRTCAAPTRITCRLVKPQKLSTPPPLAEEEGLA SRIIALGPTGAQFLSPVIVEIPHFASHGRGDRELVVLRSENGSVWKEHRSRYGESYLD QILNGMDEELGSLEELEKKRVCRIITTDFPLYFVIMSRLCQDYDIIGPEGGSLKSKLV PLVQATFPENAVTKRVKLALQAQPVPDELVTKLLGNQATFSPIVTVEPRRRKFHRPIG LRIPLPPSWTDNPRDSGEGDTTSLRLLCSVIGGTDQAQWEDITGTTKLVYANECANFT TNVSARFWLSDCPRTAEAVNFATLLYKELTAVPYMAKFVIFAKMNDPREGRLRCYCMT DDKVDKTLEQHENFVEVARSRDIEVLEGMSLFAELSGNLVPVKKAAQQRSFHFQSFRE NRLAMPVKVRDSSREPGGSLSFLRKAMKYEDTQHILCHLNITMPPCAKGSGAEDRRRT PTPLALRYSILSESTPGSLSGTEQAEMKMAVISEHLGLSWAELARELQFSVEDINRIR VENPNSLLEQSVALLNLWVIREGQNANMENLYTALQSIDRGEIVNMLEGSGRQSRNLK PDRRHTDRDYSLSPSQMNGYSSLQDELLSPASLGCALSSPLRADQYWNEVAILDAIPL AATEHDTMLEMSDMQVWSAGLTPSLVTAEDSSLECSKAEDSDATGHEWKLEGALSEEP RGPELGSLELVEDDTVDSDATNGLIDLLEQEEGQRSEEKLPGSKRQDDATGAGQDSEN EVSLVSGHQRGQARITHSPTVSQVTERSQDRLQDWDADGSIVSYLQDAAQGSWQEEVT QGPHSFQGTSTMTEGLEPGGSQEYEKVLVSVSEHTWTEQPEAESSQADRDRRQQGQEE QVQEAKNTFTQVVQGNEFQNIPGEQVTEEQFTDEQGNIVTKKIIRKVVRQIDLSSADA AQEHEEVELRGSGLQPDLIEGRKGAQIVKRASLKRGKQ" mat_peptide 88..5724 /gene="ANK1" /note="erythrocyte ankyrin; G00-118-737" BASE COUNT 1487 a 1794 c 1813 g 1098 t ORIGIN 1 agaggctgcg gtgagtccgc cagccccagc tgctcctcct caagccccca aggcccttcg 61 gcgggcccct gctgaaagac cggcatgccc tattctgtgg gcttccgcga agccgatgct 121 gctaccagct ttctgagagc agcaagatca ggtaacttgg acaaagcttt ggatcacctg 181 cggaatgggg tagatattaa cacctgtaac cagaatgggt tgaatggctt gcatctggct 241 tctaaggaag gccatgtgaa aatggtggtt gaacttctgc acaaagaaat cattctagaa 301 acgacaacca agaaggggaa cacggccctg cacatcgctg ctctagccgg gcaggatgag 361 gtggtccggg agcttgtcaa ctatggagcc aacgtcaatg cccagtcaca gaaaggtttt 421 acacccctgt acatggcagc acaagagaac cacttggaag tggttaagtt tttactggaa 481 aatggagcta accagaatgt agccacagaa gacggcttca cgcctctggc ggtagccctg 541 cagcagggcc atgagaacgt cgtcgcgcac ctcatcaact acggcaccaa ggggaaggtg 601 cgcctcccgg ccctgcacat cgcggcccgc aacgacgaca cgcgcacggc tgcggtgctg 661 ctgcagaacg accccaaccc ggacgtgctt tccaagacgg gattcacgcc cctgcacatt 721 gcggctcact acgagaacct caacgtggcc cagttgctcc tcaacagagg cagcagcgtc 781 aatttcacac cacagaacgg catcacgcca ctgcacatcg cctcccgcag gggcaacgtg 841 atcatggtgc ggctgctgct ggatcgggga gcccagatag aaaccaagac caaggacgaa 901 ttgacacctc tccactgtgc agctcgaaat gggcacgtgc gaatctcaga gatcctgctg 961 gaccacgggg caccaatcca agccaaaacc aagaacggcc tgtccccaat tcacatggcg 1021 gctcagggag accacctcga ctgtgtccgg ctcctgttgc aatacgacgc agagatagac 1081 gacatcaccc tggaccacct gaccccactc cacgtggctg cccactgtgg acaccacagg 1141 gtggctaagg tccttctgga taaaggggcc aaacccaact ccagagccct gaatggcttt 1201 acccccttac acatcgcctg caaaaagaac cacgtccgtg tcatggagct gctgctgaag 1261 acgggagcct cgatcgacgc ggtcaccgag tctggcctga cacctctcca cgtggcctcc 1321 ttcatggggc accttcccat cgtgaagaac ctcctgcagc ggggggcgtc gcccaacgtc 1381 tccaacgtga aagtggagac cccgctacac atggcagcca gagccgggca cacggaagtg 1441 gccaaatatt tactccagaa caaagccaaa gtcaatgcca aggccaagga tgaccagacc 1501 ccacttcact gtgcagctcg catcggccac acaaacatgg tgaagctcct gctggaaaat 1561 aacgccaacc ccaacctggc caccaccgcc gggcacaccc ccctgcacat tgcagcccgt 1621 gagggccatg tggaaacagt cctggccctt ctggaaaagg aagcatccca ggcctgcatg 1681 accaagaaag gatttacccc tctgcacgtg gcggccaagt acgggaaggt gcgggtggca 1741 gagctgctgc tggagcggga cgcacacccg aatgctgccg gaaaaaatgg cctgaccccc 1801 ctgcacgtgg ccgtccatca caacaacctg gacatcgtca agctgctgct tccccggggc 1861 ggctccccgc acagccctgc ctggaatggc tacacccctt tgcacatcgc tgccaagcag 1921 aaccaggtgg aggtggcccg tagtctgctg cagtatgggg gctcagcaaa cgccgagtcg 1981 gtgcaaggtg tgacgcccct tcacctggcc gcccaggagg gccacgcaga gatggtggct 2041 ctgctgctct cgaaacaagc caatggcaac ctggggaaca agagcggact cactcccctc 2101 catctggtag cacaagaagg ccacgttcca gtggcagatg tgctgatcaa acacggcgtc 2161 atggtggacg ccaccacccg gatgggctac actcccctcc atgtggccag tcactatgga 2221 aacatcaagc tggtgaagtt tctgctgcag caccaggcag atgtcaatgc caagaccaag 2281 ctaggataca gccccctgca ccaggcagcc cagcagggac acacagacat cgtgactctg 2341 cttctgaaaa acggtgcttc cccaaacgag gtcagctcgg atggaaccac acctctggcc 2401 atagccaagc gcttgggcta catttctgtc accgacgtgc tcaaggtcgt cacggatgaa 2461 accagtttcg tgttagtcag tgataagcat cgaatgagtt tccctgagac agttgatgag 2521 atcctggatg tctcggaaga tgaaggggaa gaactcatca gcttcaaggc tgagaggcgg 2581 gattccaggg atgttgatga agagaaggag ctgctggatt ttgtgccgaa gctagaccaa 2641 gtggtggaat ctccagccat ccccaggatt ccctgtgcca tgcctgagac agtggtgatc 2701 aggtcagaag agcaggagca ggcatctaaa gagtatgatg aggactccct catccccagc 2761 agcccggcca ccgagacctc agacaacatc agcccggtgg ccagcccggt gcatacaggg 2821 tttctggtga gcttcatggt tgacgcccgg ggtggttcca tgagaggaag tcgccacaac 2881 ggcctgcgag tggtgatccc gccacggacg tgcgcagcgc ccacccgcat cacctgccgc 2941 ctggtcaagc cccagaagct cagcacgccg cccccactgg ccgaggagga gggcctggcc 3001 agcaggatca tagcactggg gcccacgggc gcacagttcc tgagccctgt aatcgtggag 3061 atcccgcact ttgcctccca tggccgtgga gaccgcgagc tcgtggttct gaggagcgaa 3121 aacggctccg tgtggaagga gcacaggagc cgctatggag agagctacct ggatcagatc 3181 ctcaacggga tggacgaaga gctggggagc ctggaggagc tagagaagaa gagggtgtgc 3241 cgaatcatca ccaccgactt cccgctgtac ttcgtgatca tgtcacggct ctgccaggac 3301 tacgacatca tcggtcccga agggggctcc ctgaagagca agctggtgcc cctggtacag 3361 gcaacgttcc cggagaatgc cgtcaccaag agagtgaagc tggctctgca ggcccagcct 3421 gtcccggatg agcttgtcac taagctcctg ggcaaccagg ccacattcag ccccattgtc 3481 accgtggagc cccggcgccg gaagttccac cgccccattg ggcttcggat cccactacct 3541 ccttcctgga ccgacaaccc gagggacagt ggggagggag acaccaccag cctgcgcctg 3601 ctttgcagcg tcattggagg aacagaccaa gcccagtggg aagacataac aggaaccacc 3661 aaacttgtat atgccaacga gtgcgccaac ttcaccacca atgtctctgc caggttttgg 3721 ctgtcggact gtcctcggac tgctgaggct gtgaactttg ccaccctgct gtacaaagag 3781 ctcactgcag tgccctacat ggccaaattc gtcatctttg ccaagatgaa tgacccccga 3841 gaggggcgcc tgcgctgcta ctgcatgaca gatgataaag tggacaagac cctggagcag 3901 catgagaact tcgtggaggt ggcccggagc agggacatag aggtgttgga aggaatgtcc 3961 ctgtttgcag aactctctgg gaacctggtg cctgtgaaga aagctgccca gcagcggagc 4021 ttccacttcc agtcatttcg ggagaaccgt ctggccatgc ctgtaaaggt gagggacagc 4081 agtcgagagc cgggagggtc cctgtcgttt ctgcgcaagg cgatgaagta cgaggacacc 4141 cagcacattc tctgccacct gaacatcacc atgcccccct gcgctaaggg aagtggagcc 4201 gaagatagga gaaggacccc gacgcccctg gccctgcgat acagcattct cagtgagtcc 4261 acaccaggtt ctctcagtgg gacagagcag gcagagatga agatggctgt tatctcagag 4321 cacctcggtc tcagctgggc agagttggcc cgggagctgc agttcagtgt ggaagacatc 4381 aacaggatcc gagtggaaaa tcccaactcc ctgttggagc agagtgtggc cttgctgaac 4441 ctctgggtca tccgtgaagg ccaaaacgca aacatggaga atctgtacac agccctgcag 4501 agcattgacc gtggcgagat cgtgaacatg ctggagggtt ccggccgaca gagccgcaac 4561 ttgaagccag acaggcggca caccgaccgc gactactcgc tgtcaccctc ccagatgaat 4621 ggttactcct cactgcagga cgagctgctg tcccctgcct ccctgggctg tgcactttcc 4681 tctccgctac gtgcagacca gtactggaat gaggtggcca tcctagacgc catccccttg 4741 gcggccacgg agcatgacac catgctggag atgtctgaca tgcaggtgtg gtctgcgggc 4801 ctcacgcctt ctctggtcac tgctgaggac tcctctctgg agtgtagcaa ggctgaggac 4861 tctgatgcca caggtcacga gtggaagttg gagggggcac tctcagagga accgcggggc 4921 cccgagttgg gctctctgga acttgtggag gacgacacag tggattcaga tgccacaaat 4981 ggccttatcg atttgcttga acaggaggaa ggtcagaggt cagaagagaa gctgccaggt 5041 tctaagaggc aggatgacgc gacaggtgca gggcaggact cagagaatga agtgtctctt 5101 gtttcaggcc atcagagggg gcaagcccga atcacacatt cccccaccgt gagtcaggtg 5161 acggagagga gtcaggacag actgcaggac tgggatgcag acggctcgat tgtctcatac 5221 ctgcaagatg ctgcacaagg ttcctggcaa gaggaggtca cgcaaggtcc acactcattc 5281 cagggaacaa gtaccatgac tgaagggcta gagcccggtg gatctcagga gtacgagaag 5341 gtcctggtgt ctgtaagtga gcacacgtgg acagaacagc ccgaggctga gagctcccag 5401 gccgacaggg accggaggca gcaaggccaa gaagagcagg tgcaggaggc caagaacacc 5461 ttcacccaag tggtgcaggg gaatgagttt cagaatattc caggggagca ggtgacagag 5521 gagcaattca cggatgagca gggcaacatt gtcaccaaga agatcattcg caaggtggtt 5581 cgacagatag acttgtccag cgccgatgcc gcccaggagc acgaggaggt ggagctgaga 5641 gggagtggcc tacagccgga cctgatagag ggcaggaagg gggcgcagat agtgaagcgg 5701 gccagcctga aaagggggaa acagtgaccc cgagccgctc tccttggagt agcctctcgg 5761 gaggatcaca cctcgacacc caacccctga accccacaca ctctgccatg cacacaggag 5821 gagagctgga cctgagggcc accgcagcgg tgcacacatt cctctgggct gacggcatga 5881 cctctgtaag ggactcctgc tagtcccctc ttggcatgaa tgactgactg tagacgcatg 5941 acctccaggc ttcaatcctg cctcttgcaa tgacagctga tctgtcggaa ccaggacaca 6001 aaagcagcaa gaagcgggga gagagaggga tagaaaacaa gcgcaggaga gcctgcgaac 6061 gcaaaagtga atgagggctt tttgtggctg gggatgggtt ttggttttgg ggtttttttt 6121 ttaaattgtt ttgacttcgt acagggtact ttttcccggc ctcatctgtc agaaatccat 6181 gtgggcttcc tg // LOCUS HUMANONYMO 2754 bp DNA PRI 31-DEC-1994 DEFINITION Human anonymous gene, complete cds. ACCESSION L18972 NID g388011 KEYWORDS . SOURCE Homo sapiens (tissue library: Stratagene #936206) fetus DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2754) AUTHORS Xie,Y.G., Han,F.Y., Peyrard,M., Ruttledge,M.H., Fransson,I., DeJong,P., Collins,J., Dunham,I., Nordenskjold,M. and Dumanski,J.P. TITLE Cloning of a novel, anonymous gene from a megabase-range YAC and cosmid contig in the neurofibromatosis type 2/meningioma region on human chromosome 22q12 JOURNAL Hum. Mol. Genet. 2 (9), 1361-1368 (1993) MEDLINE 94061029 FEATURES Location/Qualifiers source 1..2754 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_lib="Stratagene #936206" /map="22q12" gene 35..2086 /gene="anonymous" CDS 35..2086 /gene="anonymous" /codon_start=1 /db_xref="PID:g388012" /translation="MSSESSKKRKPKVIRSDGAPAEGKRNRSDTEQEGKYYSEEAEVD LRDPGRDYELYKYTCQELQRLMAEIQDLKSRGGKDVAIEIEERRIQSCVHFMTLKKLN RLAHIRLKKGRDQTHEAKQKVDAYHLQLQNLLYEVMHLQKEITKCLEFKSKHEEIDLV SLEEFYKEAPPDISKAEVTMGDPHQQTLARLDWELEQRKRLAEKYRECLSNKEKILKE IEVKKEYLSSLQPRLNSIMQASLPVQEYLFMPFDQAHKQYETARHLPPPLYVLFVQAT AYGQACDKTLSVAIEGSVDEAKALFKPPEDSQDDESDSDAEEEQTTKRRRPTLGVQLD DKRKEMLKRHPLSVMLDLKCKDDSVLHLTFYYLMNLNIMTVKAKVTTAMELITPISAG DLLSPDSVLSCLYPGDHGKKTPNPANQYQFDKVGILTLSDYVLELGHPYLWVQKLGGL HFPKEQPQQTVIADHSLSASHMETTMKLLKTRVQSRLALHKQFASLEHGIVPVTSDCQ YLFPAKVVSRLVKWVTIAHEDYMELHFTKDIVDAGLAGDTNLYYMALIERGTAKLQAA VVLNPGYSSIPPIFQLCLNWKGEKTNSNDDNIRAMEGEVNVCYKELCGPWPSHQLLTN QLQRLCVLLDVYLETESHDDSVEGPKEFPQEKMCLRLFRGPSRMKPFKYNHPQGFFSH R" BASE COUNT 744 a 694 c 723 g 593 t ORIGIN 1 tgccaggttc ttggagctgt gaggaggaac aaccatgtca tcagaatcga gcaaaaaacg 61 gaagcccaaa gtgatccgaa gcgatggagc cccagctgaa ggaaagcgga atcgatctga 121 caccgagcag gaaggtaaat actacagtga ggaggccgag gtggatctgc gggaccctgg 181 cagagactat gagttataca agtacacctg ccaggagcta cagaggctga tggctgagat 241 ccaagacctg aagagcaggg gtggcaagga tgtggcaata gaaatagaag aacggaggat 301 ccagagctgt gtgcatttca tgactctaaa gaagcttaac cgattagccc acatcaggtt 361 gaagaaagga agagatcaga cccacgaggc taagcagaaa gtagatgcct atcatctgca 421 gctccagaac ctgttgtatg aggtgatgca cctacagaag gagatcacca aatgtttgga 481 gtttaagtca aagcatgaag aaattgatct ggtcagttta gaggagtttt ataaggaggc 541 tccaccagat atcagcaagg ccgaagtcac catgggagac cctcaccagc aaacactggc 601 acgtctggac tgggagctgg agcagcggaa aaggctggca gagaagtacc gagagtgcct 661 atctaacaag gagaagattc tcaaggagat tgaggtgaag aaggagtacc tgagcagcct 721 ccagccccgc ctcaacagca tcatgcaggc ttcccttccg gtgcaggagt acctgtttat 781 gccattcgac caggctcaca agcagtatga gacagccaga cacctgccgc ctcccctcta 841 tgtcctcttt gttcaggcca ctgcgtatgg gcaggcctgt gataagacgt tatctgtggc 901 aatcgaaggc agtgtggatg aagccaaggc tctgttcaaa cctccagagg actcccaaga 961 tgacgaaagt gactcagatg ccgaggagga gcagactacg aagcgccgga gacccacact 1021 gggggttcag ttggacgaca aacgcaagga gatgctgaag aggcacccac tgtctgtcat 1081 gctcgacctg aagtgcaaag atgacagtgt gcttcacctg actttctact acctcatgaa 1141 cctcaacatc atgacagtaa aagccaaagt gacaactgcc atggagctga tcacccccat 1201 cagtgcaggt gacttgctgt ctcctgactc agtcctgagt tgcttgtatc ctggggatca 1261 tggaaagaaa actccgaatc cagccaatca gtatcagttt gataaagttg gcatcctgac 1321 tttgagcgac tatgtacttg agctaggtca cccctatttg tgggtgcaga agctgggtgg 1381 cctccacttc cccaaagagc agccccagca aacagtgatt gctgaccact cgctgagcgc 1441 cagccacatg gagaccacca tgaaacttct gaagaccagg gtgcagtccc gcctggccct 1501 ccacaaacag tttgcatccc tagaacatgg cattgtgcca gttaccagtg attgccagta 1561 cctcttccct gccaaggttg tctctcgcct ggtgaaatgg gtgacaattg cccatgagga 1621 ttacatggag ctgcacttca ccaaagacat tgtggatgcg ggactggctg gggacaccaa 1681 tctctactac atggcgctca tcgaaagggg cacagccaaa ctgcaggccg ctgtggtgtt 1741 gaaccctggc tactcctcca tcccacctat tttccagctc tgtttgaact ggaaagggga 1801 gaaaaccaac agcaacgatg acaacattcg ggccatggag ggcgaagtca atgtgtgcta 1861 caaggagctg tgtggccctt ggcccagcca ccagctgttg accaaccagc tgcagcggct 1921 gtgtgtgctg ctggatgttt acctggagac cgagagccat gacgacagtg tggaggggcc 1981 caaggaattt ccccaggaga agatgtgtct gcggctcttc aggggtccta gcaggatgaa 2041 gccatttaaa tacaaccatc ctcagggatt cttcagccat cgctgatctc ccgcgcagac 2101 cgttgtttcc cccaaggcct caccctgagc actgggcttc tgctttctgc tctggcccac 2161 atgtgactct tgatattctc caaagacacc agccaattaa aaagcgtcac ctgaccagta 2221 gcctttgtct gtggttcctg gcaaggtggc tttgcagtct ggaagggcag gtgggagctg 2281 tgacacagtg tgaaaaagca tttgtagaga gactttttct cagcagccaa taaaagcaga 2341 gtggaaaaag attccaattc tgcagagaga tgctcacctc ttgtctacgc acaccctatt 2401 tgtgctttgc ggggtgaggt cctcatgatc ttgtatttat tatcccaagt tcctgctgtt 2461 aagaggtggt aggagaagcc aaaggcagca gagcacaaaa agcaaaactc ttccctcccc 2521 acccgctctt cccattagtc ctgtcagggt tgccgatgga caaattgtct ctgatcgttg 2581 gatgttataa atgtctgaca gtgcagtgca aacagaagac aaactcagtt gatccttgaa 2641 caactcaggg gttaggggca ccaacacccc ctgccctgca cagttgaaaa atccgtgtat 2701 aacttttgac tccctaaaaa cttaactaat agcctgctgt tgaccagtag tatg // LOCUS HUMANP70 2086 bp mRNA PRI 31-OCT-1994 DEFINITION Human lupus p70 (Ku) autoantigen protein mRNA, complete cds. ACCESSION J04611 NID g178649 KEYWORDS Ku; autoantigen; p70 autoantigen. SOURCE Human hepatoma G2 cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2086) AUTHORS Reeves,W.H. and Sthoeger,Z.M. TITLE Molecular cloning of cDNA encoding the p70 (Ku) lupus autoantigen JOURNAL J. Biol. Chem. 264 (9), 5047-5052 (1989) MEDLINE 89174787 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.H.Reeves, 21-FEB-1989. FEATURES Location/Qualifiers source 1..2086 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11-q13" gene 34..1863 /gene="G22P1" CDS 34..1863 /gene="G22P1" /note="p70 autoantigen" /codon_start=1 /db_xref="GDB:G00-119-963" /db_xref="PID:g178650" /translation="MSGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVDA SKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAVVFYGTEKDKNSVNFKNI YVLQELDNPGAKRILELDQFKGQQGQKRFQDMMGHGSDYSLSEVLWVCANLFSDVQFK MSHKRIMLFTNEDNPHGNDSAKASRARTKAGDLRDTGIFLDLMHLKKPGGFDISLFYR DIISIAEDEDLRVHFEESSKLEDLLRKVRAKETRKRALSRLKLKLNKDIVISVGIYNL VQKALKPPPIKLYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQIYGSRQIILEKEET EELKRFDDPGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIGSSTLFSALLIKCLEK EVAALCRYTPRRNIPPYFVALVPQEEELDDQKIQVTPPGFQLVFLPFADDKRKMPFTE KIMATPEQVGKMKAIVEKLRFTYRSDSFENPVLQQHFRNLEALALDLMEPEQAVDLTL PKVEAMNKRLGSLVDEFKELVYPPDYNPEGKVTKRKHDNEGSGSKRPKVEYSEEELKT HISKGTLGKFTVPMLKEACRAYGLKSGLKKQELLEALTKHFQD" BASE COUNT 589 a 461 c 550 g 486 t ORIGIN 1 cgcttccctg cgccaaagtg agcagtagcc aacatgtcag ggtgggagtc atattacaaa 61 accgagggcg atgaagaagc agaggaagaa caagaagaga accttgaagc aagtggagac 121 tataaatatt caggaagaga tagtttgatt tttttggttg atgcctccaa ggctatgttt 181 gaatctcaga gtgaagatga gttgacacct tttgacatga gcatccagtg tatccaaagt 241 gtgtacatca gtaagatcat aagcagtgat cgagatctct tggctgtggt gttctatggc 301 accgagaaag acaaaaattc agtgaatttt aaaaatattt acgtcttaca ggagctggat 361 aatccaggtg caaaacgaat tctagagctt gaccagttta aggggcagca gggacaaaaa 421 cgtttccaag acatgatggg ccacggatct gactactcac tcagtgaagt gctgtgggtc 481 tgtgccaacc tctttagtga tgtccaattc aagatgagtc ataagaggat catgctgttc 541 accaatgaag acaaccccca tggcaatgac agtgccaaag ccagccgggc caggaccaaa 601 gccggtgatc tccgagatac aggcatcttc cttgacttga tgcacctgaa gaaacctggg 661 ggctttgaca tatccttgtt ctacagagat atcatcagca tagcagagga tgaggacctc 721 agggttcact ttgaggaatc cagcaagcta gaagacctgt tgcggaaggt tcgcgccaag 781 gagaccagga agcgagcact cagcaggtta aagctgaagc tcaacaaaga tatagtgatc 841 tctgtgggca tttataatct ggtccagaag gctctcaagc ctcctccaat aaagctctat 901 cgggaaacaa atgaaccagt gaaaaccaag acccggacct ttaatacaag tacaggcggt 961 ttgcttctgc ctagcgatac caagaggtct cagatctatg ggagtcgtca gattatactg 1021 gagaaagagg aaacagaaga gctaaaacgg tttgatgatc caggtttgat gctcatgggt 1081 ttcaagccgt tggtactgct gaagaaacac cattacctga ggccctccct gttcgtgtac 1141 ccagaggagt cgctggtgat tgggagctca accctgttca gtgctctgct catcaagtgt 1201 ctggagaagg aggttgcagc attgtgcaga tacacacccc gcaggaacat ccctccttat 1261 tttgtggctt tggtgccaca ggaagaagag ttggatgacc agaaaattca ggtgactcct 1321 ccaggcttcc agctggtctt tttacccttt gctgatgata aaaggaagat gccctttact 1381 gaaaaaatca tggcaactcc agagcaggtg ggcaagatga aggctatcgt tgagaagctt 1441 cgcttcacat acagaagtga cagctttgag aaccccgtgc tgcagcagca cttcaggaac 1501 ctggaggcct tggccttgga tttgatggag ccggaacaag cagtggacct gacattgccc 1561 aaggttgaag caatgaataa aagactgggc tccttggtgg atgagtttaa ggagcttgtt 1621 tacccaccag attacaatcc tgaagggaaa gttaccaaga gaaaacacga taatgaaggt 1681 tctggaagca aaaggcccaa ggtggagtat tcagaagagg agctgaagac ccacatcagc 1741 aagggtacgc tgggcaagtt cactgtgccc atgctgaaag aggcctgccg ggcttacggg 1801 ctgaagagtg gtctgaagaa gcaggagctg ctggaagccc tcaccaagca cttccaggac 1861 tgaccagagg ccgcgcgtcc agctgccctt ccgcagtgtg gccaggctgc ctggccttgt 1921 cctcagccag ttaaaatgtg tttctcctga gctaggaaga gtctacccga cataagtcga 1981 gggactttat gtttttgagg ctttctgttg ccatggtgat ggtgtagccc tcccactttg 2041 ctgttcctta ctttactgcc tgaataaaga gccctaagtt tgtact // LOCUS HUMANPCR 2081 bp mRNA PRI 31-OCT-1994 DEFINITION Human atrial natriuretic peptide clearance receptor (ANP C-receptor) mRNA, complete cds. ACCESSION M59305 NID g178651 KEYWORDS atrial natriuretic peptide clearance receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2081) AUTHORS Porter,J.G., Arfsten,A., Fuller,F., Miller,J.A., Gregory,L.C. and Lewicki,J.A. TITLE Isolation and functional expression of the human atrial natriuretic peptide clearance receptor cDNA JOURNAL Biochem. Biophys. Res. Commun. 171 (2), 796-803 (1990) MEDLINE 90386656 FEATURES Location/Qualifiers source 1..2081 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5p14-p13" gene 363..1988 /gene="ANPRC" sig_peptide 363..497 /gene="ANPRC" /note="2 of 2 possible signal peptides; G00-125-201; putative" CDS 363..1988 /gene="ANPRC" /codon_start=1 /db_xref="GDB:G00-125-201" /product="atrial natriuretic peptide clearance receptor" /db_xref="PID:g178652" /translation="MPSLLVLTFSPCVLLGWALLAGGTGGGGVGGGGGGAGIGGGRQE REALPPQKIEVLVLLPQDDSYLFSLTRVRPAIEYALRSVEGNGTGRRLLPPGTRFQVA YEDSDCGNRALFSLVDRVAAARGAKPDLILGPVCEYAAAPVARLASHWDLPMLSAGAL AAGFQHKDSEYSHLTRVAPAYAKMGEMMLALFRHHHWSRAALVYSDDKLERNCYFTLE GVHEVFQEEGLHTSIYSFDETKDLDLEDIVRNIQASERVVIMCASSDTIRSIMLVAHR HGMTSGDYAFFNIELFNSSSYGDGSWKRGDKHDFEAKQAYSSLQTVTLLRTVKPEFEK FSMEVKSSVEKQGLNMEDYVNMFVEGFHDAILLYVLALHEVLRAGYSKKDGGKIIQQT WNRTFEGIAGQVSIDANGDRYGDFSVIAMTDVEAGTQEVIGDYFGKEGRFEMRPNVKY PWGPLKLRIDENRIVEHTNSSPCKSSGGLEESAVTGIVVGALLGAGLLMAFYFFRKKY RITIERRTQQEESNLGKHRELREDSIRSHFSVA" sig_peptide 363..416 /gene="ANPRC" /note="1 of 2 possible signal peptides; G00-125-201; putative" mat_peptide 417..1985 /gene="ANPRC" /note="1 of 2 possible mature peptides; G00-125-201; putative" /product="atrial natriuretic peptide clearance receptor" mat_peptide 498..1985 /gene="ANPRC" /note="2 of 2 possible mature peptides; G00-125-201; putative" /product="atrial natriuretic peptide clearance receptor" BASE COUNT 502 a 488 c 621 g 470 t ORIGIN 1 gcatggtcga ctacacgccc aaataagaag ccacctctaa gcaaaatagc tatatgtata 61 aacggagggc gaatatatac aagtatatat atatgtatat tacagacgca caggtttaca 121 cccggtgaac tttttctttt tctttttctt tttccttttt ttttaagaaa aactagtgac 181 attggagaga aggacgcttc ctctctatct tttggcgcat tagtgaaggg ggtattctat 241 tttgttaaag cgcccaaggg gaccgggaac cttggagaga agagtgggga ggaaagagga 301 agggtgggtg gggggcagag ggcgagtcgg cggcggcgag ggcaagctct tcttgcggca 361 cgatgccgtc tctgctggtg ctcactttct ccccgtgcgt actactcggc tgggcgttgc 421 tggccggcgg caccggtggc ggtggcgttg gcggcggcgg cggtggcgcg ggcataggcg 481 gcggacgcca ggagagagag gcgctgcctc cacagaagat cgaggtgctg gtgttactgc 541 cccaggatga ctcgtacttg ttttcactca cccgggtgcg gccggccatc gagtatgctc 601 tgcgcagcgt ggagggcaac gggactggga ggcggcttct gccgccgggc actcgcttcc 661 aggtggctta cgaggattca gactgtggga accgtgcgct cttcagcttg gtggaccgcg 721 tggcggcggc gcggggcgcc aagccagacc ttatcctggg gccagtgtgc gagtatgcag 781 cagcgccagt ggcccggctt gcatcgcact gggacctgcc catgctgtcg gctggggcgc 841 tggccgctgg cttccagcac aaggactctg agtactcgca cctcacgcgc gtggcgcccg 901 cctacgccaa gatgggcgag atgatgctcg ccctgttccg ccaccaccac tggagccgcg 961 ctgcactggt ctacagcgac gacaagctgg agcggaactg ctacttcacc ctcgaggggg 1021 tccacgaggt cttccaggag gagggtttgc acacgtccat ctacagtttc gacgagacca 1081 aagacttgga tctggaagac atcgtgcgca atatccaggc cagtgagaga gtggtgatca 1141 tgtgtgcgag cagtgacacc atccggagca tcatgctggt ggcgcacagg catggcatga 1201 ccagtggaga ctacgccttc ttcaacattg agctcttcaa cagctcttcc tatggagatg 1261 gctcatggaa gagaggagac aaacacgact ttgaagctaa gcaagcatac tcgtccctcc 1321 agacagtcac tctactgagg acagtgaaac ctgagtttga gaagttttcc atggaggtga 1381 aaagttcagt tgagaaacaa gggctcaata tggaggatta cgttaacatg tttgttgaag 1441 gattccacga tgccatcctc ctctacgtct tggctctaca tgaagtactc agagctggtt 1501 acagcaaaaa ggatggaggg aaaattatac agcagacttg gaacagaaca tttgaaggta 1561 tcgccgggca ggtgtccata gatgccaacg gagaccgata tggggatttc tctgtgattg 1621 ccatgactga tgtggaggcg ggcacccagg aggttattgg tgattatttt ggaaaagaag 1681 gtcgttttga aatgcggccg aatgtcaaat atccttgggg ccctttaaaa ctgagaatag 1741 atgaaaaccg aattgtagag catacaaaca gctctccctg caaatcatca ggtggcctag 1801 aagaatcggc agtgacagga attgtcgtgg gggctttact aggagctggc ttgctaatgg 1861 ccttctactt tttcaggaag aaatacagaa taaccattga gaggcgaacc cagcaagaag 1921 aaagtaacct tggaaaacat cgggaattac gggaagattc catcagatcc catttttcag 1981 tagcttaaag gaagcccccc actttttttt tttctgcctg agattcttta aggagataga 2041 cgggttgaaa gacatcaatg aacagaaggg gcgttcttga a // LOCUS HUMANTCD2 1504 bp mRNA PRI 31-OCT-1994 DEFINITION Human surface antigen CD2 mRNA, complete cds. ACCESSION M16445 NID g178668 KEYWORDS surface antigen. SOURCE Human HPB-ALL T-cell leukemia cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1504) AUTHORS Seed,B. and Aruffo,A. TITLE Molecular cloning of the CD2 antigen, the T-cell erythrocyte receptor, by a rapid immunoselection procedure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (10), 3365-3369 (1987) MEDLINE 87204137 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.Seed, 13-OCT-1987. A polyadenylation signal is located at positions 1490-1495. FEATURES Location/Qualifiers source 1..1504 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p13" sig_peptide 7..63 /gene="CD2" /note="surface antigen CD2 signal peptide" CDS 7..1062 /gene="CD2" /note="surface antigen CD2 precursor" /codon_start=1 /db_xref="GDB:G00-118-735" /db_xref="PID:g178669" /translation="MSFPCKFVASFLLIFNVSSKGAVSKEITNALETWGALGQDINLD IPSFQMSDDIDDIKWEKTSDKKKIAQFRKEKETFKEKDTYKLFKNGTLKIKHLKTDDQ DIYKVSIYDTKGKNVLEKIFDLKIQERVSKPKISWTCINTTLTCEVMNGTDPELNLYQ DGKHLKLSQRVITHKWTTSLSAKFKCTAGNKVSKESSVEPVSCPEKGLDIYLIIGICG GGSLLMVFVALLVFYITKRKKQRSRRNDEELETRAHRVATEERGRKPQQIPASTPQNP ATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQ PKPPHGAAENSLSPSSN" gene 7..1062 /gene="CD2" mat_peptide 64..1059 /gene="CD2" /note="surface antigen CD2" BASE COUNT 462 a 356 c 328 g 358 t ORIGIN Chromosome 1p13. 1 cctaagatga gctttccatg taaatttgta gccagcttcc ttctgatttt caatgtttct 61 tccaaaggtg cagtctccaa agagattacg aatgccttgg aaacctgggg tgccttgggt 121 caggacatca acttggacat tcctagtttt caaatgagtg atgatattga cgatataaaa 181 tgggaaaaaa cttcagacaa gaaaaagatt gcacaattca gaaaagagaa agagactttc 241 aaggaaaaag atacatataa gctatttaaa aatggaactc tgaaaattaa gcatctgaag 301 accgatgatc aggatatcta caaggtatca atatatgata caaaaggaaa aaatgtgttg 361 gaaaaaatat ttgatttgaa gattcaagag agggtctcaa aaccaaagat ctcctggact 421 tgtatcaaca caaccctgac ctgtgaggta atgaatggaa ctgaccccga attaaacctg 481 tatcaagatg ggaaacatct aaaactttct cagagggtca tcacacacaa gtggaccacc 541 agcctgagtg caaaattcaa gtgcacagca gggaacaaag tcagcaagga atccagtgtc 601 gagcctgtca gctgtccaga gaaaggtctg gacatctatc tcatcattgg catatgtgga 661 ggaggcagcc tcttgatggt ctttgtggca ctgctcgttt tctatatcac caaaaggaaa 721 aaacagagga gtcggagaaa tgatgaggag ctggagacaa gagcccacag agtagctact 781 gaagaaaggg gccggaagcc ccaacaaatt ccagcttcaa cccctcagaa tccagcaact 841 tcccaacatc ctcctccacc acctggtcat cgttcccagg cacctagtca tcgtcccccg 901 cctcctggac accgtgttca gcaccagcct cagaagaggc ctcctgctcc gtcgggcaca 961 caagttcacc agcagaaagg cccgcccctc cccagacctc gagttcagcc aaaacctccc 1021 catggggcag cagaaaactc attgtcccct tcctctaatt aaaaaagata gaaactgtct 1081 ttttcaataa aaagcactgt ggatttctgc cctcctgatg tgcatatccg tacttccatg 1141 aggtgttttc tgtgtgcaga acattgtcac ctcctgaggc tgtgggccac agccacctct 1201 gcatcttcga actcagccat gtggtcaaca tctggagttt ttggtctcct cagagagctc 1261 catcacacca gtaaggagaa gcaatataag tgtgattgca agaatggtag aggaccgagc 1321 acagaaatct tagagatttc ttgtcccctc tcaggtcatg tgtagatgcg ataaatcaag 1381 tgattggtgt gcctgggtct cactacaagc agcctatctg cttaagagac tctggagttt 1441 cttatgtgcc ctggtggaca cttgcccacc atcctgtgag taaaagtgaa ataaaagctt 1501 tgac // LOCUS HUMANTCD36 1870 bp mRNA PRI 15-JUN-1990 DEFINITION Human CD36 antigen mRNA, complete cds. ACCESSION M24795 NID g178670 KEYWORDS cell surface antigen; cell surface receptor; erythrocyte antigen; monocyte antigen; platelet antigen. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1870) AUTHORS Oquendo,P., Hundt,E., Lawler,J. and Seed,B. TITLE CD36 directly mediates cytoadherence of Plasmodium falciparum parasitized erythrocytes JOURNAL Cell 58, 95-101 (1989) MEDLINE 89324065 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Seed, 12-MAY-1989. FEATURES Location/Qualifiers source 1..1870 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 211..1629 /note="CD36 antigen" /codon_start=1 /db_xref="PID:g178671" /translation="MGCDRNCGLIAGAVIGAVLAVFGGILMPVGDLLIQKTIKKQVVL EEGTIAFKNWVKTGTEVYRQFWIFDVQNPQEVMMNSSNIQVKQRGPYTYRVRFLAKEN VTQDAEDNTVSFLQPNGAIFEPSLSVGTEADNFTVLNLAVAAASHIYQNQFVQMILNS LINKSKSSMFQVRTLRELLWGYRDPFLSLVPYPVTTTVGLFYPYNNTADGVYKVFNGK DNISKVAIIDTYKGKRNLSYWESHCDMINGTDAASFPPFVEKSQVLQFFSSDICRSIY AVFESDVNLKGIPVYRFVLPSKAFASPVENPDNYCFCTEKIISKNCTSYGVLDISKCK EGRPVYISLPHFLYASPDVSEPIDGLNPNEEEHRTYLDIEPITGFTLQFAKRLQVNLL VKPSEKIQVLKNLKRNYIVPILWLNETGTIGDEKANMFRSQVTGKINLLGLIEMILLS VGVVMFVAFMISYCACRSKTIK" BASE COUNT 599 a 343 c 356 g 572 t ORIGIN Unreported. 1 gaaaaatcct tcttagccat tttaaagata gctttccaat gattagacga attgattctt 61 tctgtgactc atcagttcct ttcctgtaaa attcatgtct tgctgttgat ttgtgaataa 121 gaaccagagc ttgtagaaac cactttaatc atatccagga gtttgcaaga aacaggtgct 181 taacactaat tcacctcctg aacaagaaaa atgggctgtg accggaactg tgggctcatc 241 gctggggctg tcattggtgc tgtcctggct gtgtttggag gtattctaat gccagttgga 301 gacctgctta tccagaagac aattaaaaag caagttgtcc tcgaagaagg tacaattgct 361 tttaaaaatt gggttaaaac aggcacagaa gtttacagac agttttggat ctttgatgtg 421 caaaatccac aggaagtgat gatgaacagc agcaacattc aagttaagca aagaggtcct 481 tatacgtaca gagttcgttt tctagccaag gaaaatgtaa cccaggacgc tgaggacaac 541 acagtctctt tcctgcagcc caatggtgcc atcttcgaac cttcactatc agttggaaca 601 gaggctgaca acttcacagt tctcaatctg gctgtggcag ctgcatccca tatctatcaa 661 aatcaatttg ttcaaatgat cctcaattca cttattaaca agtcaaaatc ttctatgttc 721 caagtcagaa ctttgagaga actgttatgg ggctataggg atccattttt gagtttggtt 781 ccgtaccctg ttactaccac agttggtctg ttttatcctt acaacaatac tgcagatgga 841 gtttataaag ttttcaatgg aaaagataac ataagtaaag ttgccataat cgacacatat 901 aaaggtaaaa ggaatctgtc ctattgggaa agtcactgcg acatgattaa tggtacagat 961 gcagcctcat ttccaccttt tgttgagaaa agccaggtat tgcagttctt ttcttctgat 1021 atttgcaggt caatctatgc tgtatttgaa tccgacgtta atctgaaagg aatccctgtg 1081 tatagatttg ttcttccatc caaggccttt gcctctccag ttgaaaaccc agacaactat 1141 tgtttctgca cagaaaaaat tatctcaaaa aattgtacat catatggtgt gctagacatc 1201 agcaaatgca aagaagggag acctgtgtac atttcacttc ctcattttct gtatgcaagt 1261 cctgatgttt cagaacctat tgatggatta aacccaaatg aagaagaaca taggacatac 1321 ttggatattg aacctataac tggattcact ttacaatttg caaaacggct gcaggtcaac 1381 ctattggtca agccatcaga aaaaattcaa gtattaaaga atctgaagag gaactatatt 1441 gtgcctattc tttggcttaa tgagactggg accattggtg atgagaaggc aaacatgttc 1501 agaagtcaag taactggaaa aataaacctc cttggcctga tagaaatgat cttactcagt 1561 gttggtgtgg tgatgtttgt tgcttttatg atttcatatt gtgcatgcag atcgaaaaca 1621 ataaaataag tatgtaccaa aaaatattgc ttcaataata ttagcttata tattacttgt 1681 tttcacttta tcaaagagaa gttacatatt aggccatata tatttctaga catgtctagc 1741 cactgatcat ttttaaatat aggtaaataa acctataaat attatcacgc agatcactaa 1801 agtatatctt taattctggg agaaatgaga taaaagatgt acttgtgacc attgtaacaa 1861 tagcacaaat // LOCUS HUMANTIIR 1575 bp mRNA PRI 31-DEC-1994 DEFINITION Human angiotensin II type-1 receptor (AT1) mRNA, complete cds. ACCESSION M93394 NID g178680 KEYWORDS angiotensin II type-1 receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1575) AUTHORS Curnow,K.M., Pascoe,L. and White,P.C. TITLE Genetic analysis of the human type-1 angiotensin II receptor JOURNAL Mol. Endocrinol. 6 (7), 1113-1118 (1992) MEDLINE 92375105 FEATURES Location/Qualifiers source 1..1575 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1575 /gene="AT1" 5'UTR 1..172 /gene="AT1" misc_feature 42..125 /gene="AT1" /note="Unprocessed intron, use of an alternate splice donor site or additional 5' untranslated exon; present in about half of all transcripts in most tissues expressing AT1." CDS 173..1252 /gene="AT1" /codon_start=1 /product="angiotensin II type-1 receptor" /db_xref="PID:g178681" /translation="MILNSSTEDGIKRIQDDCPKAGRHNYIFVMIPTLYSIIFVVGIF GNSLVVIVIYFYMKLKTVASVFLLNLALADLCFLLTLPLWAVYTAMEYRWPFGNYLCK IASASVSFNLYASVFLLTCLSIDRYLAIVHPMKSRLRRTMLVAKVTCIIIWLLAGLAS LPAIIHRNVFFIENTNITVCAFHYESQNSTLPIGLGLTKNILGFLFPFLIILTSYTLI WKALKKAYEIQKNKPRNDDIFKIIMAIVLFFFFSWIPHQIFTFLDVLIQLGIIRDCRI ADIVDTAMPITICIAYFNNCLNPLFYGFLGKKFKRYFLQLLKYIPPKAKSHSNLSTKM STLSYRPSDNVSSSTKKPAPCFEVE" 3'UTR 1253..1575 /gene="AT1" BASE COUNT 440 a 339 c 302 g 494 t ORIGIN 1 gccggccctc ggcgggacgt gacgcagcgc ccggggcgcg ggtttgatat ttgacaaatt 61 gatctaaaat ggctgggttt ttatctgaat aactcactga tgccatccca gaaagtcggc 121 accaggtgta tttgatatag tgtttgcaac aaattcgacc caggtgatca aaatgattct 181 caactcttct actgaagatg gtattaaaag aatccaagat gattgtccca aagctggaag 241 gcataattac atatttgtca tgattcctac tttatacagt atcatctttg tggtgggaat 301 atttggaaac agcttggtgg tgatagtcat ttacttttat atgaagctga agactgtggc 361 cagtgttttt cttttgaatt tagcactggc tgacttatgc tttttactga ctttgccact 421 atgggctgtc tacacagcta tggaataccg ctggcccttt ggcaattacc tatgtaagat 481 tgcttcagcc agcgtcagtt tcaacctgta cgctagtgtg tttctactca cgtgtctcag 541 cattgatcga tacctggcta ttgttcaccc aatgaagtcc cgccttcgac gcacaatgct 601 tgtagccaaa gtcacctgca tcatcatttg gctgctggca ggcttggcca gtttgccagc 661 tataatccat cgaaatgtat ttttcattga gaacaccaat attacagttt gtgctttcca 721 ttatgagtcc caaaattcaa cccttccgat agggctgggc ctgaccaaaa atatactggg 781 tttcctgttt ccttttctga tcattcttac aagttatact cttatttgga aggccctaaa 841 gaaggcttat gaaattcaga agaacaaacc aagaaatgat gatattttta agataattat 901 ggcaattgtg cttttctttt tcttttcctg gattccccac caaatattca cttttctgga 961 tgtattgatt caactaggca tcatacgtga ctgtagaatt gcagatattg tggacacggc 1021 catgcctatc accatttgta tagcttattt taacaattgc ctgaatcctc ttttttatgg 1081 ctttctgggg aaaaaattta aaagatattt tctccagctt ctaaaatata ttcccccaaa 1141 agccaaatcc cactcaaacc tttcaacaaa aatgagcacg ctttcctacc gcccctcaga 1201 taatgtaagc tcatccacca agaagcctgc accatgtttt gaggttgagt gacatgttcg 1261 aaacctgtcc ataaagtaat tttgtgaaag aaggagcaag agaacattcc tctgcagcac 1321 ttcactacca aatgagcatt agctactttt cagaattgaa ggagaaaatg cattatgtgg 1381 actgaaccga cttttctaaa gctctgaaca aaagcttttc tttccttttg caacaagaca 1441 aagcaaagcc acattttgca ttagacagat ggacggctgc tcgaagaaca atgtcagaaa 1501 ctcgatgaat gtgttgattc gagaaatttt actgacagaa atgcaatctc cctagcctgc 1561 ttttgtcctg ttatt // LOCUS HUMANTN 1879 bp mRNA PRI 29-MAR-1991 DEFINITION Human nuclear autoantigen (SP-100) mRNA, complete cds. ACCESSION M60618 M34541 NID g178688 KEYWORDS autoantigen; nuclear antigen. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1879) AUTHORS Szotecki,C., Guldner,H.H., Netter,H.J. and Will,H. TITLE Molecular cloning of a cDNA encoding for SP-100 autoantigen sequences JOURNAL J. Immunol. 145, 4338-4347 (1990) MEDLINE 91079525 FEATURES Location/Qualifiers source 1..1879 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="lambda-gt10 (Clontech)" gene 32..1868 /gene="Sp-100" CDS 32..1474 /gene="Sp-100" /codon_start=1 /product="nuclear autoantigen" /db_xref="PID:g178689" /translation="MAGGGGDLSTRRLNECISPVANEMNHLPAHSHDLQRMFTEDQGV DDRLLYDIVFKHFKRNKVEISNAIKKTFPFLEGLRDRDLITNKMFEDSQDSCRNLVPV QRVVYNVLSELEKTFNLPVLEALFSDVNMQEYPDLIHIYKGFENVIHDKLPLQESEEE EREERSGLQLSLEQGTGENSFRSLTWPPSGSPSHAGTTPPENGLSEHPCETEQINAKR KDTTSDKDDSLGSQQTNEQCAQKAEPTESCEQIAVQVNNGDAGREMPCPLPCDEESPE AELHNHGIQINSCSVRLVDIKKEKPFSNSKVECQAQARTHHNQASDIIVISSEDSEGS TDVDEPLEVFISAPRSEPVINNDNPLESNDEKEGQEATCSRPQIVPEPMDFRKLSTFR ESFKKRVIGQDHDFSESSEEEAPAEASSGALRSKHGEKAPMTSRSTSTWRIPSRKRRF SSSDFSDLSNGEELQETCSSSLRRGSGKED" polyA_signal 1701..1706 /gene="Sp-100" polyA_signal 1863..1868 /gene="Sp-100" polyA_site 1879 /gene="Sp-100" BASE COUNT 618 a 378 c 429 g 454 t ORIGIN 1 ctgaggccca cgcagggcct agggtgggaa gatggcaggt gggggcggcg acctgagcac 61 caggaggctg aatgaatgta tttcaccagt agcaaatgag atgaaccatc ttcctgcaca 121 cagccacgat ttgcaaagga tgttcacgga agaccagggt gtagatgaca ggctgctcta 181 tgacattgta ttcaagcact tcaaaagaaa taaggtggag atttcaaatg caataaaaaa 241 gacatttcca ttcctcgagg gcctccgtga tcgtgatctc atcacaaata aaatgtttga 301 agattctcaa gattcttgta gaaacctggt ccctgtacag agagtggtgt acaatgttct 361 tagtgaactg gagaagacat ttaacctgcc agttctggaa gcactgttca gcgatgtcaa 421 catgcaggaa taccccgatt taattcacat ttataaaggc tttgaaaatg taatccatga 481 caaattgcct ctccaagaaa gtgaagaaga agagagggag gagaggtctg gcctccaact 541 aagtcttgaa caaggaactg gtgaaaactc ttttcgaagc ctgacttggc caccttcggg 601 ttccccatct catgctggta caaccccacc tgaaaatgga ctctcagagc acccctgtga 661 aacagaacag ataaatgcaa agagaaaaga tacaaccagt gacaaagatg attcgctagg 721 aagccaacaa acaaatgaac aatgtgctca aaaggctgag ccaacagagt cctgcgaaca 781 aattgctgtc caagtgaata atggggatgc tggaagggag atgccctgcc cgttgccctg 841 tgatgaagaa agcccagagg cagagctaca caaccatgga atccaaatta attcctgttc 901 tgtgcgactg gtggatataa aaaaggaaaa gccattttct aattcaaaag ttgagtgcca 961 agcccaagca agaactcatc ataaccaggc atctgacata atagtcatca gcagtgagga 1021 ctctgaagga tccactgacg ttgatgagcc cttagaagtc ttcatctcag caccgagaag 1081 tgagcctgtg atcaataatg acaacccttt agaatcaaat gatgaaaagg agggccaaga 1141 agccacttgc tcacgacccc agattgtacc agagcccatg gatttcagaa aattatctac 1201 attcagagaa agttttaaga aaagagtgat aggacaagac cacgactttt cagaatccag 1261 tgaggaggag gcgcccgcag aagcctcaag cggggcactg agaagcaagc atggtgagaa 1321 ggctcctatg acttctagaa gtacatctac ttggagaata cccagcagga agagacgttt 1381 cagcagtagt gacttttcag acctgagtaa tggagaagag cttcaggaaa cctgcagctc 1441 atccctaaga agagggtcag gtaaagaaga ttaggatgcc aagacttggc ctgcagaatg 1501 tcaggaatgt gaattaaaag ctgctgtttc cagacgcttt ttattctgag caccttcact 1561 accttgtatc cagttcatct gggaactcct ttttgcattt tagaaaatgg aaagaggcag 1621 gaaattatga taaactcatg tttaacagaa agagtttcac tgactaaatg tatgtaatta 1681 tattttgttg ttgtagaaga aataaatagc aaatttgtgg tattcttttt tttaaacctg 1741 ctctcattcc tattaacact aagatcttag atttttatag tgataaatgg gttgacatca 1801 ttgtcgtttg taattgtaaa gcctcaaaag acaactgttc ctactatgta attatagaca 1861 gaaataaaaa cttcagatc // LOCUS HUMANX3 1268 bp mRNA PRI 07-NOV-1994 DEFINITION Human 1,2-cyclic-inositol-phosphate phosphodiesterase (ANX3) mRNA, complete cds. ACCESSION M63310 NID g178696 KEYWORDS 1,2-cyclic-inositol-phosphate phosphodiesterase; annexin III; placental anticoagulant protein III. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1268) AUTHORS Tait,J.F., Frankenberry,D.A., Miao,C.H., Killary,A.M., Adler,D.A. and Disteche,C.M. TITLE Chromosomal localization of the human annexin III (ANX3) gene JOURNAL Genomics 10 (2), 441-448 (1991) MEDLINE 91301701 REFERENCE 2 (sites) AUTHORS Tait,J.F., Smith,C., Xu,L. and Cookson,B.T. TITLE Structure and polymorphisms of the human annexin III (ANX3) gene JOURNAL Genomics 18 (1), 79-86 (1993) MEDLINE 94102764 FEATURES Location/Qualifiers source 1..1268 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="4q13-q22" gene 37..1008 /gene="ANX3" CDS 37..1008 /gene="ANX3" /codon_start=1 /db_xref="GDB:G00-125-900" /product="1,2-cyclic-inositol-phosphate phosphodiesterase" /db_xref="PID:g178697" /translation="MASIWVGHRGTVRDYPDFSPSVDAEAIQKAIRGIGTDEKMLISI LTERSNAQRQLIVKEYQAAYGKELKDDLKGDLSGHFEHLMVALVTPPAVFDAKQLKKS MKGAGTNEDALIEILTTRTSRQMKDISQAYYTVYKKSLGDDISSETSGDFRKALLTLA DGRRDESLKVDEHLAKQDAQILYKAGENRWGTDEDKFTEILCLRSFPQLKLTFDEYRN ISQKDIVDSIKGELSGHFEDLLLAIVNCVRNTPAFLAERLHRALKGIGTDEFTLNRIM VSRSEIDLLDIRTEFKKHYGYSLYSAIKSDTSGDYEITLLKICGGDD" BASE COUNT 422 a 237 c 281 g 328 t ORIGIN Chromosome: 4q21. 1 tagtgtgatc tcagctcaag gcaaaggtgg gatatcatgg catctatctg ggttggacac 61 cgaggaacag taagagatta tccagacttt agcccatcag tggatgctga agctattcag 121 aaagcaatca gaggaattgg aactgatgag aaaatgctca tcagcattct gactgagagg 181 tcaaatgcac agcggcagct gattgttaag gaatatcaag cagcatatgg aaaggagctg 241 aaagatgact tgaagggtga tctctctggc cactttgagc atctcatggt ggccctagtg 301 actccaccag cagtctttga tgcaaagcag ctaaagaaat ccatgaaggg cgcgggaaca 361 aacgaagatg ccttgattga aatcttaact accaggacaa gcaggcaaat gaaggatatc 421 tctcaagcct attatacagt atacaagaag agtcttggag atgacattag ttccgaaaca 481 tctggtgact tccggaaagc tctgttgact ttggcagatg gcagaagaga tgaaagtctg 541 aaagtggatg agcatctggc caaacaagat gcccagattc tctataaagc tggtgagaac 601 agatggggca cggatgaaga caaattcact gagatcctgt gtttaaggag ctttcctcaa 661 ttaaaactaa catttgatga atacagaaat atcagccaaa aggacattgt ggacagcata 721 aaaggagaat tatctgggca ttttgaagac ttactgttgg ccatagttaa ttgtgtgagg 781 aacacgccgg cctttttagc cgaaagactg catcgagcct tgaagggtat tggaactgat 841 gagtttactc tgaaccgaat aatggtgtcc agatcagaaa ttgacctttt ggacattcga 901 acagagttca agaagcatta tggctattcc ctatattcag caattaaatc ggatacttct 961 ggagactatg aaatcacact cttaaaaatc tgtggtggag atgactgaac caagaagata 1021 atctccaaag gtccacgatg ggcttttcca acagctccac cttacttctt ctcatactat 1081 ttaagagaac aagcaaatat aaacagcaac ttgtgttcct aacaggaatt ttcattgttc 1141 tataacaaca acaacaaaag cgattattat tttagagcat ctcatttata atgtagcagc 1201 tcataaatga aattgaaaat ggtattaaag atctgcaact actatccaac ttatatttct 1261 gctttcaa // LOCUS HUMANX4A 1976 bp mRNA PRI 31-OCT-1994 DEFINITION Human annexin IV (ANX4) mRNA, complete cds. ACCESSION M82809 NID g178698 KEYWORDS annexin IV; chromobindin 4; placental anticoagulant protein II. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1976) AUTHORS Tait,J.F., Smith,C., Frankenberry,D.A., Miao,C.H., Adler,D.A. and Disteche,C.M. TITLE Chromosomal mapping of the human annexin IV (ANX4) gene JOURNAL Genomics 12 (2), 313-318 (1992) MEDLINE 92155721 FEATURES Location/Qualifiers source 1..1976 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="Unassigned" gene 74..1039 /gene="ANX4" CDS 74..1039 /gene="ANX4" /standard_name="annexin IV" /codon_start=1 /function="calcium-dependent phospholipid binding protein" /db_xref="GDB:G00-131-395" /evidence=experimental /product="annexin IV (placental anticoagulant protein II)" /db_xref="PID:g178699" /translation="MAMATKGGTVKAASGFNAMEDAQTLRKAMKGLGTDEDAIISVLA YRNTAQRQEIRTAYKSTIGRDLIDDLKSELSGNFEQVIVGMMTPTVLYDVQELRRAMK GAGTDEGCLIEILASRTPEEIRRISQTYQQQYGRSLEDDIRSDTSFMFQRVLVSLSAG GRDEGNYLDDALVRQDAQDLYEAGEKKWGTDEVKFLTVLCSRNRNHLLHVFDEYKRIS QKDIEQSIKSETSGSFEDALLAIVKCMRNKSAYFAEKLYKSMKGLGTDDNTLIRVMVS RAEIDMLDIRAHFKRLYGKSLYSFIKGDTSGDYRKVLLVLCGGDD" BASE COUNT 597 a 379 c 420 g 580 t ORIGIN 1 gcagaggagg agcgcacgcc ggcctcgaag aacttctgct tgggtggctg aactctgatc 61 ttgacctaga gtcatggcca tggcaaccaa aggaggtact gtcaaagctg cttcaggatt 121 caatgccatg gaagatgccc agaccctgag gaaggccatg aaagggctcg gcaccgatga 181 agacgccatt attagcgtcc ttgcctaccg caacaccgcc cagcgccagg agatcaggac 241 agcctacaag agcaccatcg gcagggactt gatagacgac ctgaagtcag aactgagtgg 301 caacttcgag caggtgattg tggggatgat gacgcccacg gtgctgtatg acgtgcaaga 361 gctgcgaagg gccatgaagg gagccggcac tgatgagggc tgcctaattg agatcctggc 421 ctcccggacc cctgaggaga tccggcgcat aagccaaacc taccagcagc aatatggacg 481 gagccttgaa gatgacattc gctctgacac atcgttcatg ttccagcgag tgctggtgtc 541 tctgtcagct ggtgggaggg atgaaggaaa ttatctggac gatgctctcg tgagacagga 601 tgcccaggac ctgtatgagg ctggagagaa gaaatggggg acagatgagg tgaaatttct 661 aactgttctc tgttcccgga accgaaatca cctgttgcat gtgtttgatg aatacaaaag 721 gatatcacag aaggatattg aacagagtat taaatctgaa acatctggta gctttgaaga 781 tgctctgctg gctatagtaa agtgcatgag gaacaaatct gcatattttg ctgaaaagct 841 ctataaatcg atgaagggct tgggcaccga tgataacacc ctcatcagag tgatggtttc 901 tcgagcagaa attgacatgt tggatatccg ggcacacttc aagagactct atggaaagtc 961 tctgtactcg ttcatcaagg gtgacacatc tggagactac aggaaagtac tgcttgttct 1021 ctgtggagga gatgattaaa ataaaaatcc cagaaggaca ggaggattct caacactttg 1081 aattttttta acttcatttt tctacactgc tattatcatt atctcagaat gcttatttcc 1141 aattaaaacg cctacagctg cctcctagaa tatagactgt ctgtattatt attcacctat 1201 aattagtcat tatgatgctt taaagctgta cttgcatttc aaagcttata agatataaat 1261 ggagatttta aagtagaaat aaatatgtat tccatgtttt taaaagatta ctttctactt 1321 tgtgtttcac agacattgaa tatattaaat tattccatat tttcttttca gtgaaaaatt 1381 ttttaaatgg aagactgttc taaaatcact tttttcccta atccaatttt tagagtggct 1441 agtagtttct tcatttgaaa ttgtaagcat ccggtcagta agaatgccca tccagttttc 1501 tatatttcat agtcaaagcc ttgaaagcat ctacaaatct ctttttttag gttttgtcca 1561 tagcatcagt tgatccttac taagtttttc atgggagact tccttcatca catcttatgt 1621 tgaaatcact ttctgtagtc aaagtatacc aaaaccaatt tatctgaact aaattctaaa 1681 gtatggttat acaaaccata tacatctggt taccaaacat aaatgctgaa cattccatat 1741 tattatagtt aatgtcttaa tccagcttgc aagtgaatgg aaaaaaaaat aagcttcaaa 1801 ctaggtattc tgggaatgat gtaatgctct gaatttagta tgatataaag aaaacttttt 1861 tgtgctaaaa atacttttta aaatcaattt tgttgattgt agtaatttct atttgcactg 1921 tgcctttcaa ctccagaaac attctaagat gtacttggat ttaattaaaa agttca // LOCUS HUMAOP1 1542 bp mRNA PRI 29-MAY-1996 DEFINITION Human mRNA for Apo1_Human (MER5(Aop1-Mouse)-like protein), complete cds. ACCESSION D49396 NID g682747 KEYWORDS Aop1_Human; MER5-like protein; Aop1_Mouse-like protein; antioxidant protein. SOURCE Homo sapiens blood erythroblast cell_line:YN-1-0-A cDNA to mRNA, clone:Aop1_Human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1542) AUTHORS Tsuji,K., Copeland,N.G., Jenkins,N.A. and Obinata,M. TITLE Mammalian antioxidant protein complements alkylhydroperoxide reductase (ahpC) mutation in E.coli JOURNAL Biochem. J. (1995) In press REFERENCE 2 (bases 1 to 1542) AUTHORS Obinata,M. TITLE Direct Submission JOURNAL Submitted (17-FEB-1995) to the DDBJ/EMBL/GenBank databases. Masuo Obinata, Institute of Development,Aging and Cancer,Tohoku University, Department of Cell Biology; 4-1 Seiryoumachi Aoba-ku, Sendai, Miyagi 980, Japan (E-mail:d21953@cctu.cc.tohoku.ac.jp, Tel:022-273-9495, Fax:022-272-5081) FEATURES Location/Qualifiers source 1..1542 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="YN-1-0-A" /cell_type="erythroblast" /chromosome="10" /clone="Aop1_Human" /map="10q25-26" /tissue_type="blood" CDS 7..777 /standard_name="Aop1_Human" /codon_start=1 /product="Aop1_Human, MER5(Aop1_Mouse)-like protein" /db_xref="PID:d1008985" /db_xref="PID:g682748" /translation="MAAAVGRLLRASVARHVSAIPWGISATAALRPAACGRTSLTNLL CSGSSQAKLFSTSSSCHAPAVTQHAPYFKGTAVVNGEFKDLSLDDFKGKYLVLFFYPL DFTFVCPTEIVAFSDKANEFHDVNCEVVAVSVDSHFSHLAWINTPRKNGGLGHMNIAL LSDLTKQISRDYGVLLEGSGLALRGLFIIDPNGVIKHLSVNDLPVGRSVEETLRLVKA FQYVETHGEVCPANWTPDSPTIKPSPAASKEYFQKVNQ" BASE COUNT 436 a 303 c 309 g 494 t ORIGIN Chromosome 10q25-26. 1 ctgaagatgg cggctgctgt aggacggttg ctccgagcgt cggttgcccg acatgtgagt 61 gccattcctt ggggcatttc tgccactgca gccctcaggc ctgctgcatg tggaagaacg 121 agcttgacaa atttattgtg ttctggttcc agtcaagcaa aattattcag caccagttcc 181 tcatgccatg cacctgctgt cacccagcat gcaccctatt ttaagggtac agccgttgtc 241 aatggagagt tcaaagacct aagccttgat gactttaagg ggaaatattt ggtgcttttc 301 ttctatcctt tggatttcac ctttgtgtgt cctacagaaa ttgttgcttt tagtgacaaa 361 gctaacgaat ttcacgatgt gaactgtgaa gttgtcgcag tctcagtgga ttcccacttt 421 agccatcttg cctggataaa tacaccaaga aagaatggtg gtttgggcca catgaacatc 481 gcactcttgt cagacttaac taagcagatt tcccgagact acggtgtgct gttagaaggt 541 tctggtcttg cactaagagg tctcttcata attgacccca atggagtcat caagcatttg 601 agcgtcaacg atctcccagt gggccgaagc gtggaagaaa ccctccgctt ggtgaaggcg 661 ttccagtatg tagaaacaca tggagaagtc tgcccagcga actggacacc ggattctcct 721 acgatcaagc caagtccagc tgcttccaaa gagtactttc agaaggtaaa tcagtagatc 781 acccatgtgt atctgcacct tctcaactga gagaagaacc acagttgaaa cctgctttta 841 tcattttcaa gatggttatt tgtagaaggc aaggaaccaa ttatgcttgt attcataagt 901 attactctaa atgttttgtt tttgtaattc tggctaggac cttttaaaca tggttagttg 961 ctagtacagg aatcgtttat tggtaacatc ttggtggctg gctagctagt ttctacagaa 1021 cataatttgc ctctatagaa ggctattctt agatcatgtc tcaatggaaa cactcttctt 1081 tcttagcctt acttgaatct tgcctataat aaagtagagc aacacacatt gaaagcttct 1141 gatcaacggt cctgaaattt tcatcttgaa tgtctttgta ttaaactgaa ttttctttta 1201 agctaacaaa gatcataatt ttcaatgatt agccgtgtaa ctcctgcaat gaatgtttat 1261 gtgattgaag caaatgtgaa tcgtattatt ttaaaaagtg gcagagtgac ttaactgatc 1321 atgcatgatc cctcatccct gaaattgagt ttatgtagtc attttactta ttttattcat 1381 tagctaactt tgtctatgta tatttctaga tattgattag tgtaatcgat tataaaggat 1441 atttatcaaa tccagggatt gcattttgaa attataatta ttttctttgc tgaagtattc 1501 attgtaaaac atacaaataa catatttaaa caaaaaaaaa aa // LOCUS HUMAOX 4957 bp mRNA PRI 10-APR-1996 DEFINITION Human aldehyde oxidase (hAOX) mRNA, complete cds. ACCESSION L11005 NID g438655 KEYWORDS aldehyde oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4957) AUTHORS Wright,R.M., Vaitaitis,G.M., Wilson,C.M., Repine,T.B., Terada,L.S. and Repine,J.E. TITLE cDNA cloning, characterization, and tissue-specific expression of human xanthine dehydrogenase/xanthine oxidase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (22), 10690-10694 (1993) MEDLINE 94068467 REFERENCE 2 (bases 1 to 4957) AUTHORS Wright,R.M. TITLE Direct Submission JOURNAL Submitted (16-FEB-1993) Richard M. Wright, Webb-Waring Lung Institute, University of Colorado Health Sciences Center, Denver, CO 80262, USA COMMENT Reference [1] reports nucleotides 1 through 3713. FEATURES Location/Qualifiers source 1..4957 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="Lambda gt11" gene 131..4147 /gene="hAOX" CDS 131..4147 /gene="hAOX" /codon_start=1 /product="aldehyde oxidase" /db_xref="PID:g438656" /translation="MDRASELLFYVNGRKVIEKNVDPETMLLPYLRKKLRLTGTPYGC GGGGCGACTVMISRYNPITKRIRHHPANACLIPICSLYGAAVTTVEGIGSTHTRIHPV QERIAKCHGTQCGFCTPGMVMSIYPLLRNHPEPTLDQLTDALGGNLCRCHGYRPIIDA CKTFCKTSGCCQSKENGVCCLDQGINGLPEFEEGSKTSPKLFAEEEFLPLDPTQELIF PPELMIMADKQSQRTRVFGSERMMWFSPVTLKDLLEFKFKYPQAPVIMGNTSVGPEVK FKGVFHPGYNSPDRIEEPECCKPCIYGLTLGAGLSLAQVKDILADVVQKLPEEKTQMY HALLKHLGTLAGSQIRNMASLGGHIISRHPDSDLNPILAVGNCTLNLLSKEGKRQIPL NEQFLSKCPNADLKPQEILVSVNIPISRKWEFVSAFRQAQRQENALAIVNSGMRVFFG EGDGIIRELCISYGGVGPATICAKNSCQKLIGRHWNEQMLDIACRLILNEVSLLGSAP GGKVEFKRTLIISFLFKFYLEVSQILKKMDPVHYPSLADKYESALEDLHSKHHCSTLK YQNIGPKQHPEDPIGHPIMHLSGVKHATGEAIYCDDMPLVDQELFLTFVTSSRAHAKI VSIDLSEALSMPGVVDIMTAEHLSDVNSFCFFTEAEKFLATDKVFCVGQLVCAVLADS EVQAKRAAKRVKIVYQDLEPLILTIEESIQHNSSFKPERKLEYGNVDEAFKVVDQILE GEIHMGGQEHFYMETQSMLVVPKGEDQEMDVYVSTQFPKYIQDIVASTLKLPANKVMC HVRRVGGAFGGKVLKTGIIAAVTAFAANKHGRAVRCVLERGEDMLITGGRHPYLGKYK AGFMNDGRILALDMEHYSNAGASLDESLFVIEMGLLKMDNAYKFPNLRCRGWACRTNL PSNTAFRGFGFPQAVLITESCITEVAAKCGLSPEKVRIINMYKEIDQTPYKQEINAKN LIQCWRECMAMSSYSLRKVAVEKFNAENYWKKKGLAMVPLKFPVGLASRAAGQAAALV HIYLDGSVLVTHGGIEMGQGVHTKMIQVVSRELRMPMSNVHLRGTSTETVPNANISGG SVVADLNGLAVKDACQTLLKRLEPIISKNPKGTWKDWAQTAFDESINLSAVGYFRGYE SDMNWEKGEGQPFEYFVYGAACSEVEIDCLTGDHKNIRTDIVMDVGCSINPAIDIGQI EGAFIQGMGLYTIEELNYSPQGILHTRGPDQYKIPAICDMPTELHIALLPPSQNSNTL YSSKGLGESGVFLGCSVFFAIHDAVSAARQERGLHGPLTLNSPLTPEKIRMACEDKFT KMIPRDEPGSYVPWNVPI" polyA_site 4957 BASE COUNT 1400 a 1091 c 1167 g 1299 t ORIGIN 1 gaccgaacac gttctttagg ctccagcaaa gcgccccact cggcgggtcg gtgccgccgg 61 gtcccaggtg cccgctactt cccagaactc gcctcccgct ccgggccctc gaaccagcgc 121 ggacaccaca atggaccggg cgtccgagct gctcttctac gtgaacggcc gcaaggtgat 181 agaaaaaaat gtcgatcctg aaacaatgct gttgccttat ttgaggaaga agcttcgact 241 cacaggaact ccgtatggct gtggaggagg aggctgtggt gcttgtacag tgatgatatc 301 acgatacaac cccatcacca agaggataag gcatcaccca gccaatgcct gtctgattcc 361 catctgttct ctgtatggtg ctgccgtcac cacagtagaa ggcataggaa gcacccacac 421 cagaattcat cctgttcagg agaggattgc caagtgtcat ggcacccagt gtggcttctg 481 cacacctggg atggtgatgt ccatctaccc cctgctcagg aaccacccag agcccactct 541 ggatcagtta actgatgccc ttggtggtaa cctgtgccgt tgccatggat acaggcccat 601 aattgatgca tgcaagactt tctgtaaaac ttcgggctgc tgtcaaagta aagaaaatgg 661 ggtttgctgt ttggatcaag gaatcaatgg attgccagaa tttgaggaag gaagtaagac 721 aagtccaaaa ctcttcgcag aagaggagtt tctgccattg gatccaaccc aggaactgat 781 atttcctcct gagctaatga taatggctga taaacagtcg caaaggacca gggtgtttgg 841 cagtgagaga atgatgtggt tttcccccgt gaccctgaag gacctgctgg aatttaaatt 901 caagtatccc caggctcctg ttatcatggg aaacacctct gtggggcctg aagtgaaatt 961 taaaggcgtc tttcacccag gttataattc tcctgataga attgaagaac ctgagtgttg 1021 taaaccatgc atatatggac tcacccttgg tgctggtctc agcctagccc aggtgaagga 1081 cattttggct gatgtagtcc agaagcttcc agaggagaag acacagatgt accatgctct 1141 cctgaagcat ttgggaactc tggctgggtc ccagatcagg aacatggctt ctttaggggg 1201 acacatcatt agcaggcatc cagattcaga tctgaatccc atcctggctg tgggtaactg 1261 taccctcaac ttgctatcaa aagaaggaaa acgacagatt cctttaaatg agcaattcct 1321 cagcaagtgc cctaatgcag atcttaagcc tcaagaaatc ttggtctcag tgaacatccc 1381 catctcaagg aagtgggaat ttgtgtcagc cttccgacaa gcccagcgac aggagaatgc 1441 gctagcgata gtcaattcag gaatgagagt cttttttgga gaaggggatg gcattattag 1501 agagttatgc atctcatatg gaggcgttgg tccagccacc atctgtgcca agaattcctg 1561 ccagaaactc attggaaggc actggaacga acagatgctg gatatagcct gcaggcttat 1621 tctgaatgaa gtctcccttt tgggctcggc gccaggtggg aaagtggagt tcaagaggac 1681 tctcatcatc agcttcctct tcaagttcta cctggaagtg tcacagattt tgaaaaagat 1741 ggatccagtt cactatccta gccttgcaga caagtatgaa agtgctttag aagatcttca 1801 ttccaaacat cactgcagta cattaaagta ccagaatata ggcccaaagc agcatcctga 1861 agacccaatt ggccacccca tcatgcatct gtctggtgtg aagcatgcca cgggggaggc 1921 catctactgt gatgacatgc ctctggtgga ccaggaactt ttcttgactt ttgtgactag 1981 ttcaagagct catgctaaga ttgtgtctat tgatctgtca gaagctctca gcatgcccgg 2041 tgtggtggac atcatgacag cagaacatct tagtgacgtc aactccttct gcttttttac 2101 tgaagctgag aaatttctgg cgacagataa ggtgttctgt gtgggtcagc ttgtctgtgc 2161 tgtgcttgcc gattctgagg ttcaggcaaa gcgagctgct aagcgagtga agattgtcta 2221 tcaagacttg gagccgctga tactaacaat tgaggaaagt atacaacaca actcctcctt 2281 caagccagaa aggaaactgg aatatggaaa tgttgacgaa gcatttaaag tggttgatca 2341 aattcttgaa ggtgaaatac atatgggagg tcaagaacat ttttatatgg aaacccaaag 2401 catgcttgtc gttcccaagg gagaggatca agaaatggat gtctacgtgt ccacacagtt 2461 tcccaaatat atacaggaca ttgttgcctc aaccttgaag ctcccagcta acaaggtcat 2521 gtgccatgta aggcgtgttg gtggagcgtt tggagggaag gtgttaaaaa ccggaatcat 2581 tgcagccgtc actgcatttg ccgcaaacaa acatggccgt gcagttcgct gtgttctgga 2641 acgaggagaa gacatgttaa taactggagg ccgccatcct taccttggaa agtacaaagc 2701 tggattcatg aacgatggca gaatcttggc cctggacatg gagcattaca gcaatgcagg 2761 cgcctccttg gatgaatcat tattcgtgat agaaatggga cttctgaaaa tggacaatgc 2821 ttacaagttt cccaatctcc gctgccgggg ttgggcatgc agaaccaacc ttccatccaa 2881 cacagctttt cgtgggtttg gctttcctca ggcagtgctg atcaccgaat cttgtatcac 2941 ggaagttgca gccaaatgtg gactatcccc tgagaaggtg cgaatcataa acatgtacaa 3001 ggaaattgat caaacaccct acaaacaaga gatcaatgcc aagaacctaa tccagtgttg 3061 gagagaatgt atggccatgt cttcctactc cttgaggaaa gttgctgtgg aaaagttcaa 3121 tgcagagaat tattggaaga agaaaggact ggccatggtc cccctgaagt ttcctgttgg 3181 ccttgcgtca cgtgctgctg gtcaggctgc tgccttggtt cacatttatc ttgatggctc 3241 tgtgctggtc actcacggtg gaattgaaat ggggcagggg gtccacacta aaatgattca 3301 ggtggtcagc cgtgaattaa gaatgccaat gtcgaatgtc cacctgcgtg gaacaagcac 3361 agaaactgtc cctaatgcaa atatctctgg aggttctgtg gtggcagatc tcaacggttt 3421 ggcagtaaag gatgcctgtc aaactcttct aaaacgcctc gaacccatca tcagcaagaa 3481 tcctaaagga acttggaaag actgggcaca gactgctttt gatgaaagca ttaacctttc 3541 agctgttgga tacttcagag gttatgagtc agacatgaac tgggagaaag gcgaaggcca 3601 gcccttcgaa tactttgttt atggagctgc ctgttccgag gttgaaatag actgcctgac 3661 gggggatcat aagaacatca gaacagacat tgtcatggat gttggctgca gtataaatcc 3721 agccattgac ataggccaga ttgaaggtgc atttattcaa ggcatgggac tttatacaat 3781 agaggaactg aattattctc cccagggcat tctgcacact cgtggtccag accaatataa 3841 aatccctgcc atctgtgaca tgcccacgga gttgcacatt gctttgttgc ctccttctca 3901 aaactcaaat actctttatt catctaaggg tctgggagag tcgggggtgt tcctggggtg 3961 ttccgtgttt ttcgctatcc atgacgcagt gagtgcagca cgacaggaga gaggcctgca 4021 tggacccttg acccttaata gtccactgac cccggagaag attaggatgg cctgtgaaga 4081 caagttcaca aaaatgattc cgagagatga acctggatcc tacgttcctt ggaatgtacc 4141 catctgaatc aaatgcaaac ttctggagaa aacagagtgc ctcttcccag atggcaatct 4201 gtcctatctc tgtgctggaa gatgctagat ctgaaagaca gagtttccac agttcagaaa 4261 tcatcccaca gtgttgcttt tctctggagc tgatttaaag tattccattt agatttgata 4321 gatatgctta agcaatctat aaatcatttt caatgttata aacactaatt ggtttcctct 4381 agggtgatat tcgtcattac tctgtctctt caatccatcc agctaaatgg aataggtgat 4441 gacttggatg tgactcctac ttggcttcta tcaccaacag aaattatacc atatagtgaa 4501 aggcaatttt ctaaataatt tcattactaa tatgaactgt gaagttgtca ttttttcatt 4561 tgtccttttc tgctatcacc ttcctcttgt cagaatgaat atagacacat gtatctaagt 4621 gggaccaaag aaaaaatagc gaactttcac caaagttttc atgaaaaccc aaaagcttta 4681 aaagttacta tcaagaaatt gaaaggaaac ccacagaata ggataaaata tttgtaaatc 4741 atatttgata aaagtcttgt aaccagatac ataaagagct cttacaactc aataaaaggc 4801 aagtaattta aaaataggca aaagaattgc tggatggtat ggtagttcta tttttagttt 4861 ttaccctaac tactctgact tgatcattta acattctgtg tatgtaacaa aatatcacat 4921 gcataaatat tatgtgtcaa taaaattttt taatggg // LOCUS HUMAP2 1636 bp mRNA PRI 03-JUN-1991 DEFINITION Human sequence-specific DNA-binding protein (AP-2) mRNA, complete cds. ACCESSION M36711 Y00229 NID g178702 KEYWORDS DNA-binding protein. SOURCE Human HeLa cell, cDNA to mRNA, clones AP2-[9,22,4A]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1636) AUTHORS Williams,T., Admon,A., Luescher,B. and Tjian,R. TITLE Cloning and expression of AP-2, a cell-type-specific transcription factor that activates inducible enhancer elements JOURNAL Genes Dev. 2, 1557-1569 (1988) MEDLINE 89107991 FEATURES Location/Qualifiers source 1..1636 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa cell" CDS 63..1376 /note="DNA-binding protein (AP-2)" /codon_start=1 /db_xref="PID:g178703" /translation="MLWKLTDNIKYEDCEDRHDGTSNGTARLPQLGTVGQSPYTSAPP LSHTPNADFQPPYFPPPYQPIYPQSQDPYSHVNDPYSLNPLHAQPQPQHPGWPGQRQS QESGLLHTHRGLPHQLSGLDPRRDYRRHEDLLHGPHALSSGLGDLSIHSLPHAIEEVP HVEDPGINIPDQTVIKKGPVSLSKSNSNAVSAIPINKDNLFGGVVNPNEVFCSVPGRL SLLSSTSKYKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRSLREKLDKIGLNL PAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAVAEFLNRQHSDPNEQVTR KNMLLATKQICKEFTDLLAQDRSPLGNSRPNPILEPGIQSCLTHFNLISHGFGSPAVC AAVTALQNYLTEALKAMDKMYLSNNPNSHTDNNAKSSDKEEKHRK" BASE COUNT 369 a 594 c 402 g 271 t ORIGIN 1 gaattccggc tctctgggtg agagaccgag aggggcatat ccgttcacgc cgatccatga 61 aaatgctttg gaaattgacg gataatatca agtacgagga ctgcgaggac cgtcacgacg 121 gcaccagcaa cgggacggca cggttgcccc agctgggcac tgtaggtcaa tctccctaca 181 cgagcgcccc gccgctgtcc cacaccccca atgccgactt ccagccccca tacttccccc 241 caccctacca gcctatctac ccccagtcgc aagatcctta ctcccacgtc aacgacccct 301 acagcctgaa ccccctgcac gcccagccgc agccgcagca cccaggctgg cccggccaga 361 ggcagagcca ggagtctggg ctcctgcaca cgcaccgggg gctgcctcac cagctgtcgg 421 gcctggatcc tcgcagggac tacaggcggc acgaggacct cctgcacggc ccacacgcgc 481 tcagctcagg actcggagac ctctcgatcc actccttacc tcacgccatc gaggaggtcc 541 cgcatgtaga agacccgggt attaacatcc cagatcaaac tgtaattaag aaaggccccg 601 tgtccctgtc caagtccaac agcaatgccg tctccgccat ccctattaac aaggacaacc 661 tcttcggcgg cgtggtgaac cccaacgaag tcttctgttc agttccgggt cgcctctcgc 721 tcctcagctc cacctcgaag tacaaggtca cggtggcgga agtgcagcgg cggctctcac 781 cacccgagtg tctcaacgcg tcgctgctgg gcggagtgct ccggagggcg aagtctaaaa 841 atggaggaag atctttaaga gaaaaactgg acaaaatagg attaaatctg cctgcaggga 901 gacgtaaagc tgccaacgtt accctgctca catcactagt agagggagaa gctgtccacc 961 tagccaggga ctttgggtac gtgtgcgaaa ccgaatttcc tgccaaagca gtagctgaat 1021 ttctcaaccg acaacattcc gatcccaatg agcaagtgac aagaaaaaac atgctcctgg 1081 ctacaaaaca gatatgcaaa gagttcaccg acctgctggc tcaggaccga tctcccctgg 1141 ggaactcacg gcccaacccc atcctggagc ccggcatcca gagctgcttg acccacttca 1201 acctcatctc ccacggcttc ggcagccccg cggtgtgtgc cgcggtcacg gccctgcaga 1261 actatctcac cgaggccctc aaggccatgg acaaaatgta cctcagcaac aaccccaaca 1321 gccacacgga caacaacgcc aaaagcagtg acaaagagga gaagcacaga aagtgaggct 1381 ctcctcccgc cccgcccctc ccacgcctca ccagcccccc gcgcgcccac cctccggcgg 1441 gtgacagctc cgggatcagc aacccttcct gctgctgctc ctgctgctga tgctgccgcc 1501 gccgccgccg ccgctgccct tgggtccccc cgagtctccg ggactgccct ctcgactgtc 1561 agtggggcag cctctccgac tctgcacccg cctcgacctc cccacccgct cccacacccc 1621 tgtgcccccg gaattc // LOCUS HUMAPNH1A 3989 bp mRNA PRI 06-MAR-1995 DEFINITION Human Na/H antiporter (APNH1) mRNA, complete cds. ACCESSION M81768 J03163 NID g178752 KEYWORDS Na+/H+ antiporter; sodium/proton antiporter. SOURCE Homo sapiens kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3989) AUTHORS Sardet,C., Franchi,A. and Pouyssegur,J. TITLE Molecular cloning, primary structure, and expression of the human growth factor-activatable Na+/H+ antiporter JOURNAL Cell 56 (2), 271-280 (1989) MEDLINE 89106219 REFERENCE 2 (bases 1 to 3989) AUTHORS Sardet,C., Counillon,L., Franchi,A. and Pouyssegur,J. TITLE Growth factors induce phosphorylation of the Na+/H+ antiporter, glycoprotein of 110 kD JOURNAL Science 247 (4943), 723-726 (1990) MEDLINE 90140739 REFERENCE 3 (bases 1 to 3989) AUTHORS Takaichi,K., Wang,D., Balkovetz,D.F. and Warnock,D.G. TITLE Cloning, sequencing and expression of Na+/H+ antiporter cDNAs from human tissues JOURNAL Am. J. Physiol. 262, 1069-1076 (1992) FEATURES Location/Qualifiers source 1..3989 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /map="1p36.1-p35" gene complement(54..3989) /gene="APNH" CDS 54..2501 /gene="APNH" /note="putative" /citation=[3] /citation=[1] /codon_start=1 /function="electrically-neutral Na/H exchange" /db_xref="GDB:G00-119-683" /product="Na/H antiporter" /db_xref="PID:g178753" /translation="MVLRSGICGLSPHRIFPSLLVVVALVGLLPVLRSHGLQLSPTAS TIRSSEPPRERSIGDVTTAPPEVTPESRPVNHSVTDHGMKPRKAFPVLGIDYTHVRTP FEISLWILLACLMKIGFHVIPTISSIVPESCLLIVVGLLVGGLIKGVGETPPFLQSDV FFLFLLPPIILDAGYFLPLRQFTENLGTILIFAVVGTLWNASFLGGLMYAVCLVGGEQ INNIGLLDNLLFGSIISAVDPVAVLAVFEEIHINELLHILVFGESLLNDAVTVVLYHL FEEFANYEHCGIVDIFLGFLSFFVVALGGVLVGVVYGVIAAFTSRFTSHIRVIEPLFV FLYSYMAYLSAELFHLSGIMALIASGVVMRPYVEANISHKSHTTIKYFLKMWSSVSET LIFIFLGVSTVAGSHHWNWTFVISTLLFCLIARVLGVLGLTWFINKFRIVKLTPKDQF IIAYGGLRGAIAFSLGYLLDKKHFPMCDLFLTAIITVIFFTVFVQGMTIRPLVDLLAV KKKQETKRSINEEIHTQFLDHLLTGIEDICGHYGHHHWKDKLNRFNKKYVKKCLIAGE RSKEPQLIAFYHKMEMKQAIELVESGGMGKIPSAVSTVSMQNIHPKSLPSERILPALS KDNEEEIRKILRNNLQKTRQRLRSYNRHTLVADPYEEAWNQMLLRRQKARQLEQKINN YLTVPAHKLDSPTMSRARIGSDPLAYEPKEDLPVITIDPASPQSPESVDLVNEELKGK VLGLSRDPAKVAEEDEDDDGGIMMRSKETSSPGTDDVFTPAPSDSPSSQRIQRCLSDP GPHPEPGEGEPSFPKGQ" conflict complement(2495) /gene="APNH" /note="confirms corrected sequence reported in reference #3" /citation=[3] /replace="" 3'UTR complement(2499..3989) /partial /gene="APNH" /note="G00-119-683" /citation=[3] /citation=[1] /evidence=experimental polyA_signal complement(3969..3975) /gene="APNH" /note="G00-119-683" /citation=[3] /evidence=experimental polyA_site complement(3989) /gene="APNH" /note="G00-119-683" /citation=[3] /evidence=experimental BASE COUNT 756 a 1318 c 1043 g 872 t ORIGIN 1 cccggtttta ctctcatttg ggtaaggtcg aggctggctc tggaagcagc accatggttc 61 tgcggtctgg catctgtggc ctctctccac atcggatctt cccttcctta ctcgtggtgg 121 ttgctttggt ggggctgctg cctgttctca ggagccatgg cctccagctc agcccaactg 181 ccagcaccat tcgaagctca gagccaccac gagaacgctc gattggggat gtcaccaccg 241 ctccaccgga ggtcacccca gagagccgcc ctgttaatca ttccgtcact gatcatggca 301 tgaagccgcg caaggccttt ccagtcctgg gcatcgacta cacacacgtg cgcaccccct 361 tcgagatctc cctctggatc cttctggcct gcctcatgaa gataggtttc catgtgatcc 421 ccactatctc aagcatcgtc ccggagagct gcctgctgat cgtggtgggg ctgctggtgg 481 ggggcctgat caagggtgta ggcgagacac cccccttcct gcagtccgac gtcttcttcc 541 tcttcctgct gccgcccatc atcctggatg cgggctactt cctgccactg cggcagttca 601 cagaaaacct gggcaccatc ctgatctttg ccgtggtggg cacgctgtgg aacgcctcct 661 tcctgggcgg cctcatgtac gccgtgtgcc tggtgggcgg tgagcagatc aacaacatcg 721 gcctcctgga caacctgctc ttcggcagca tcatctcggc cgtggacccc gtggcggttc 781 tggctgtctt tgaggaaatt cacatcaatg agctgctgca catccttgtt tttggggagt 841 ccttgctcaa tgacgccgtc actgtggtcc tgtatcacct ctttgaggag tttgccaact 901 acgaacactg tggcatcgtg gacatcttcc tcggcttcct gagcttcttc gtggtggccc 961 tgggcggggt gcttgtgggc gtggtctacg gggtcatcgc agccttcacc tcccgattta 1021 cctcccacat ccgggtcatc gagccgctct tcgtcttcct ctacagctac atggcctact 1081 tgtcagccga gctcttccac ctgtcaggca tcatggcgct catagcctca ggagtggtga 1141 tgcgccccta tgtggaggcc aacatctccc acaagtccca caccaccatc aaatacttcc 1201 tgaagatgtg gagcagcgtc agcgagaccc tcatcttcat cttcctcggc gtctccacgg 1261 tggccggctc ccaccactgg aactggacct tcgtcatcag caccctgctc ttctgcctca 1321 tcgcccgcgt gctgggggtg ctgggcctga cctggttcat caacaagttc cgtatcgtga 1381 agctgacccc caaggaccag ttcatcatcg cctatggggg cctgcgaggg gccatcgcct 1441 tctctctggg ctacctcctg gacaagaagc acttccccat gtgtgacctg ttcctcactg 1501 ccatcatcac tgtcatcttc ttcaccgtct ttgtgcaggg catgaccatt cggcccctgg 1561 tagacctgtt ggctgtgaag aaaaagcaag agacgaagcg ctccatcaac gaagagatcc 1621 acacacagtt cctggaccac cttctgacag gcatcgaaga catctgtggc cactacggtc 1681 accaccactg gaaggacaag ctcaaccggt ttaataagaa atatgtgaag aagtgtctga 1741 tagctggcga gcgctccaag gagccccagc tcattgcctt ctaccacaag atggagatga 1801 agcaggccat cgagctggtg gagagcgggg gcatgggcaa gatcccctct gccgtctcca 1861 ccgtctccat gcagaacatc caccccaagt ccctgccttc cgagcgcatc ctgccagcac 1921 tgtccaagga caacgaggag gagatccgca aaatcctgag gaacaacttg cagaagacca 1981 ggcagcggct gcggtcctac aacagacaca cgctggtggc agacccctac gaggaagcct 2041 ggaaccagat gctgctccgg aggcagaagg cccggcagct ggagcagaag atcaacaact 2101 acctgacggt gccagcccac aagctggact cacccaccat gtctcgggcc cgcatcggct 2161 cagacccact ggcctatgag ccgaaggagg acctgcctgt catcaccatc gacccggctt 2221 ccccgcagtc acccgagtct gtggacctgg tgaatgagga gctgaagggc aaagtcttag 2281 ggttgagccg ggatcctgca aaggtggctg aggaggacga ggacgacgat gggggcatca 2341 tgatgcggag caaggagact tcgtccccag gaaccgacga tgtcttcacc cccgcgccca 2401 gtgacagccc cagctcccag aggatacagc gctgcctcag tgacccaggc ccacaccctg 2461 agcctgggga gggagaaccg tccttcccca aggggcagta acgccagggc cagcaggcag 2521 cgcctgtccc ctcacagact cttccaccag agcaggggct gctgggggct ccccttgccc 2581 ttcctgaccc ggattggccc tgcccctccc cctaccgcat ggcagctggg cccacagccc 2641 ccaccccagc acagctcctc ccctgccgcc tcccgggaag catcctcccc accagagctg 2701 cctccccaat ccatttggca gaactgctgg ggctggtgag gccggccctg cccctcccta 2761 gatccaggct tctcccggac ctggactagg gcctcggagg ctcctccctc tgcctcatcc 2821 tcctcctcat tcagaccaat cttagtttct aaccaaagag tctctggctc agctgtggtc 2881 ccacccagga agggagggag ctgaggcctc ccttgagtag gccctgcttt atcaggggac 2941 aaaccagggg taccaggcac atggctgggg gagggactgc tgacccacca aggtctcaca 3001 ctcctcctgc cagctctgtc accctggcca ccacccaacc tgtccttact cagagctgcg 3061 ggctgagggc atctctgagt gtctctgcct ggagcagggg tggtttctac ggtgacagtg 3121 acgtgactca gagcttttcg aactgtgctc ccacggggac cactgggccc ctcaggggaa 3181 gctgctaggg gaaggactgg ccgtggctcc agaatgtgct gcctttttaa gttttgtttg 3241 ttcacactcc tatatatgat tgtttgcaca gagggcgctc ctgtttctaa aacattttga 3301 aaacccctgg ctgaacagtg ctctgcctct aactccctcc tcacactcca gaattaccct 3361 tcctcatctg tgcctgtctg tccaacccct cccccacgtc tctctgcctg ctgggctctt 3421 aactgttgct gaagactgtg acatcagaag taactcccac tcctaatcaa gagtctctcc 3481 agcctcacag atgctggcct cttggcacct gcctagctct tgggcctgac ctccagtcct 3541 gctggcctgc tcttacttcc cccaccctgg gtttggcccc tggaaccttt cccttgtgtg 3601 taccacaccc tgcctgctgt ggagcccatt gtggaggcgg tgggggggag aaggcctccc 3661 ctgaggatcc cctgtcccct ggggctggtg gattgggcag aatcctgggc ccccagagac 3721 ctttgcccac acacactcct tccccttgtc cctggggcac tcccccagga ttgtgcaata 3781 gtcagagtgt ccctttttgc agggactggg ccatggtcct cggcccatct gtccatcctc 3841 ctctccatgc aagtgctgtt tgggcaggag tcaccatgca agggtgacat cgacaaccac 3901 gtaccaagcc accgcagctg cgccactctg ctgcctgtac agaagaaact gaatcttttt 3961 catattctaa taaatcaatg tgagttttt // LOCUS HUMAPOD 809 bp mRNA PRI 08-AUG-1995 DEFINITION Human apolipoprotein D mRNA, complete cds. ACCESSION J02611 NID g178840 KEYWORDS apolipoprotein; apolipoprotein D. SOURCE Homo sapiens (clone: cAPOD.[6,8,16]) adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 809) AUTHORS Drayna,D., Fielding,C., McLean,J., Baer,B., Castro,G., Chen,E., Comstock,L., Henzel,W., Kohr,W., Rhee,L., Wion,K. and Lawn,R. TITLE Cloning and expression of human apolipoprotein D cDNA JOURNAL J. Biol. Chem. 261 (35), 16535-16539 (1986) MEDLINE 87057347 COMMENT Draft entry and clean copy sequence for [1] kindly provided by D.T.Drayna, 05-JAN-1987. The variation described in FEATURES starting at position 688 could be due to polymorphisms or, they could be the result of artifacts in cDNA cloning. FEATURES Location/Qualifiers source 1..809 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cAPOD.[6,8,16]" /dev_stage="adult" /tissue_type="liver" /map="3q26.2-qter" mRNA <1..809 /note="apoD mRNA" variation 8 /note="t in cAPOD.[6,16]; c in cAPOD.8" sig_peptide 62..121 /gene="APOD" /note="apolipoprotein D signal peptide" gene 62..631 /gene="APOD" CDS 62..631 /gene="APOD" /note="apolipoprotein D precursor" /codon_start=1 /db_xref="GDB:G00-119-690" /db_xref="PID:g178841" /translation="MVMLLLLLSALAGLFGAAEGQAFHLGKCPNPPVQENFDVNKYLG RWYEIEKIPTTFENGRCIQANYSLMENGKIKVLNQELRADGTVNQIEGEATPVNLTEP AKLEVKFSWFMPSAPYWILATDYENYALVYSCTCIIQLFHVDFAWILARNPNLPPETV DSLKNILTSNNIDVKKMTVTDQVNCPKLS" mat_peptide 122..628 /gene="APOD" /note="apolipoprotein D" variation 449 /gene="APOD" /note="c in cAPOD.[8,16] and DNA; t in cAPOD.6" /replace="t" variation 688..704 /note="taccccaccccccccca in DNA; taccccccccccca in cAPOD.6" /replace="taccccccccccca" variation 688..704 /note="taccccaccccccccca in DNA; taccccaccccccgcca in cAPOD.16" /replace="taccccaccccccgcca" BASE COUNT 222 a 226 c 169 g 192 t ORIGIN Chromosome 3q. 1 atgcctgtct tcatcttgaa agaaaagctc caggtccctt ctccagccac ccagccccaa 61 gatggtgatg ctgctgctgc tgctttccgc actggctggc ctcttcggtg cggcagaggg 121 acaagcattt catcttggga agtgccccaa tcctccggtg caggagaatt ttgacgtgaa 181 taagtatctc ggaagatggt acgaaattga gaagatccca acaacctttg agaatggacg 241 ctgcatccag gccaactact cactaatgga aaacggaaag atcaaagtgt taaaccagga 301 gttgagagct gatggaactg tgaatcaaat cgaaggtgaa gccaccccag ttaacctcac 361 agagcctgcc aagctggaag ttaagttttc ctggtttatg ccatcggcac cgtactggat 421 cctggccacc gactatgaga actatgccct cgtgtattcc tgtacctgca tcatccaact 481 ttttcacgtg gattttgctt ggatcttggc aagaaaccct aatctccctc cagaaacagt 541 ggactctcta aaaaatatcc tgacttctaa taacattgat gtcaagaaaa tgacggtcac 601 agaccaggtg aactgcccca agctctcgta accaggttct acagggaggc tgcacccact 661 ccatgttact tctgcttcgc tttcccctac cccacccccc cccataaaga caaaccaatc 721 aaccacgaca aaggaagttg acctaaacat gtaaccatgc cctaccctgt taccttgcta 781 gctgcaaaat aaacttgttg ctgacctgc // LOCUS HUMAPOF 1719 bp mRNA PRI 01-MAY-1995 DEFINITION Human apolipoprotein F (APOF) mRNA, complete cds. ACCESSION L27050 NID g435966 KEYWORDS apolipoprotein F. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Day,J.R., Albers,J.J., Gilbert,T.L., Whitmore,T.E., McConathy,W.J. and Wolfbauer,G. TITLE Purification and molecular cloning of human apolipoprotein F JOURNAL Biochem. Biophys. Res. Commun. 203 (2), 1146-1151 (1994) MEDLINE 94380022 FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" gene 51..1031 /gene="APOF" CDS 51..1031 /gene="APOF" /note="putative" /codon_start=1 /product="apolipoprotein F" /db_xref="PID:g435967" /translation="MTGLCGYSAPDMRGLRLIMIPVELLLCYLLLHPVDATSYGKQTN VLMHFPLSLESQTPSSDPLSCQFLHPKSLPGFSHMAPLPKFLVSLALRNALEEAGCQA DVWALQLQLYRQGGVNATQVLIQHLRGLQKGRSTERNVSVEALASALQLLAREQQSTG RVGRSLPTEDCENEKEQAVHNVVQLLPGVGTFYNLGTALYYATQNCLGKARERGRDGA IDLGYDLLMTMAGMSGGPMGLAISAALKPALRSGVQQLIQYYQDQKDANISQPETTKE GLRAISDVSDLEETTTLASFISEVVSSAPYWGWAIIKSYDLDPGAGSLEI" BASE COUNT 464 a 404 c 444 g 407 t ORIGIN 1 caaacacata caggaagcga tcaaacctac caaggcagtc tcacttctca atgactggac 61 tgtgtgggta ctctgctcca gacatgcgtg gcctcagact catcatgata ccagttgagc 121 tgctactttg ctacctcctg ctgcaccctg tggatgccac ttcatatgga aagcagacaa 181 atgtcttgat gcactttccc ttgtccttgg aatcccagac accctcctca gaccccttgt 241 cctgccaatt tctgcaccca aagtcactgc ctggtttcag ccacatggcc cctctaccca 301 agttcttggt aagcctggct ctaaggaatg ccctggagga agctggttgt caggctgatg 361 tttgggctct acagctacag ctctaccgcc agggtggtgt gaatgctaca caggtcctca 421 tccagcatct tcgagggctc cagaaaggca gaagcacaga gaggaacgtg tcagtggaag 481 ccctggcctc tgctctgcag ctgttagcca gggagcagca aagcacagga agggtcgggc 541 gctccctccc gacagaggac tgtgagaatg agaaggagca agctgtgcac aatgtagtcc 601 agctgctgcc aggagtggga accttctaca acctgggcac agctttgtat tatgctactc 661 aaaactgcct gggcaaggcc agggaacgag gccgagatgg ggccatagat ctgggatatg 721 accttctgat gaccatggct gggatgtcag gggggcctat gggtctagcg atcagtgctg 781 cacttaaacc tgcattaagg tctggggttc agcagttgat ccagtattac caagatcaga 841 aagacgcaaa catctctcag ccggagacca ccaaggaggg tttgagggcc atctcagatg 901 tgagtgactt ggaagaaaca actactctgg cttctttcat atcagaagta gtaagttcag 961 ctccctactg ggggtgggcc ataatcaaga gctatgactt agatcctggg gctgggagtc 1021 ttgagatata aaagaatgtg gtaaccacag aattaataac tgtctaccct gacaagctat 1081 atacatgtct tcaaaatttt aatctgattt atccaggagg aaggctgtac agtaaaacgt 1141 aagaacgtaa atgtttgggt gttgaagtca cagggtttgg tttcgaatct aggctccact 1201 tgttagagcc tcggtgatca ctgaatagta acttctttct tgaactaaga tcagttttga 1261 agtttctaaa ggagatagaa tgattttaac ctcaatgagt tgccctgtaa atttaaaatg 1321 atacaatgaa tctaaaatgc ttatcacagt actttcaata aatagctatt agccaggtgc 1381 ggtggctcac gcctgtaatc ccagcactgt gagaggctga ggcgggatga tcacctgagg 1441 tcaggagttc aagatcagcc tgcgcaacat ggcgaaaccc cgtctctaca ataaatagca 1501 aaaaattatc ctggcggagt tatgcacgct tgtagtccca actacctggg aggctgaggc 1561 gggagaatca cctgagcctg ggaggctgag gcgggagaat cacctgagcc tgggaggtcg 1621 aggctgcagc gagccgagat cgcgccgctg cattccagcc tgggtgacag agcgagacca 1681 tgtctcaaaa aataaaaata aaaaaaaatt gttttcatt // LOCUS HUMAPOLIPH 1185 bp mRNA PRI 31-OCT-1994 DEFINITION Human apolipoprotein H mRNA, complete cds. ACCESSION M62839 NID g178856 KEYWORDS apolipoprotein H; beta-2-glycoprotein I. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1185) AUTHORS Day,J.R., O'Hara,P.J., Grant,F.J., Lofton-Day,C., Berkaw,M.N., Werner,P. and Arnaud,P. TITLE Molecular cloning and sequence analysis of the cDNA encoding human apolipoprotein H (beta 2-glycoprotein I) JOURNAL Int. J. Clin. Lab. Res. 21 (3), 256-263 (1992) MEDLINE 92273779 FEATURES Location/Qualifiers source 1..1185 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q" mRNA 1..1184 /gene="APOH" /note="G00-118-887" gene 1..1184 /gene="APOH" sig_peptide 40..96 /gene="APOH" /note="G00-118-887" CDS 40..1077 /gene="APOH" /note="a.k.a. beta-2-glycoprotein I" /codon_start=1 /db_xref="GDB:G00-118-887" /product="apolipoprotein H" /db_xref="PID:g178857" /translation="MISPVLILFSSFLCHVAIAGRTCPKPDDLPFSTVVPLKTFYEPG EEITYSCKPGYVSRGGMRKFICPLTGLWPINTLKCTPRVCPFAGILENGAVRYTTFEY PNTISFSCNTGFYLNGADSAKCTEEGKWSPELPVCAPIICPPPSIPTFATLRVYKPSA GNNSLYRDTAVFECLPQHAMFGNDTITCTTHGNWTKLPECREVKCPFPSRPDNGFVNY PAKPTLYYKDKATFGCHDGYSLDGPEEIECTKLGNWSAMPSCKASCKVPVKKATVVYQ GERVKIQEKFKNGMLHGDKVSFFCKNKEKKCSYTEDAQCIDGTIEVPKCFKEHSSLAF WKTDASDVKPC" mat_peptide 97..1074 /gene="APOH" /note="a.k.a. beta-2-glycoprotein I; G00-118-887" /product="apolipoprotein H" BASE COUNT 362 a 256 c 248 g 319 t ORIGIN 1 agaaaaccac tttggtagtg ccagtgtgac tcatccacaa tgatttctcc agtgctcatc 61 ttgttctcga gttttctctg ccatgttgct attgcaggac ggacctgtcc caagccagat 121 gatttaccat tttccacagt ggtcccgtta aaaacattct atgagccagg agaagagatt 181 acgtattcct gcaagccggg ctatgtgtcc cgaggaggga tgagaaagtt tatctgccct 241 ctcacaggac tgtggcccat caacactctg aaatgtacac ccagagtatg tccttttgct 301 ggaatcttag aaaatggagc cgtacgctat acgacttttg aatatcccaa cacgatcagt 361 ttttcttgta acactgggtt ttatctgaat ggcgctgatt ctgccaagtg cactgaggaa 421 ggaaaatgga gcccggagct tcctgtctgt gctcccatca tctgccctcc accatccata 481 cctacgtttg caacacttcg tgtttataag ccatcagctg gaaacaattc cctctatcgg 541 gacacagcag tttttgaatg tttgccacaa catgcgatgt ttggaaatga tacaattacc 601 tgcacgacac atggaaattg gacaaaatta ccagaatgca gggaagtaaa atgcccattc 661 ccatcaagac cagacaatgg atttgtgaac tatcctgcaa aaccaacact ttattacaag 721 gataaagcca catttggctg ccatgatgga tattctctgg atggcccgga agaaatagaa 781 tgtaccaaac tgggaaactg gtctgccatg ccaagttgta aagcatcttg taaagtacct 841 gtgaaaaaag ccactgtggt gtaccaagga gagagagtaa agattcagga aaaatttaag 901 aatggaatgc tacatggtga taaagtttct ttcttctgca aaaataagga aaagaagtgt 961 agctatacag aggatgctca gtgtatagat ggcactatcg aagtccccaa atgcttcaag 1021 gaacacagtt ctctggcttt ttggaaaact gatgcatccg atgtaaagcc atgctaaggt 1081 ggttttcaga ttccacataa aatgtcacac ttgtttcttg ttcatccaag gaacctaatt 1141 gaaatttaaa aataaagcta ctgaatttat tgccgcaaaa aaaaa // LOCUS HUMAPR 1885 bp mRNA PRI 13-JAN-1992 DEFINITION Human ATL-derived PMA-responsive (APR) peptide mRNA. ACCESSION D90070 M57246 NID g219475 KEYWORDS 12-myristate 13-acetate; ATL; PMA; PMA-inducible mRNA. SOURCE Human adult T-cell leukemia cell line IKD, cDNA to mRNA, clone ICP82-23. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1885) AUTHORS Hijikata,M., Kato,N., Sato,T., Kagami,Y. and Shimotohno,K. TITLE Molecular cloning and characterization of a cDNA for a novel phorbol-12-myristate-13-acetate-responsive gene that is highly expressed in an adult T-cell leukemia cell line JOURNAL J. Virol. 64 (10), 4632-4639 (1990) MEDLINE 90376412 COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511 Fax: 03-545-3567. FEATURES Location/Qualifiers source 1..1885 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 174..338 /note="APR peptide" /codon_start=1 /db_xref="PID:d1014813" /db_xref="PID:g219476" /translation="MPGKKARKNAQPSPARAPAELEVECATQLRRFGDKLNFRQKLLN LISKLFCSGT" misc_feature 1551..1558 /note="mRNA destabilizer like sequence" misc_feature 1566..1570 /note="mRNA destabilizer like sequence" misc_feature 1743..1747 /note="mRNA destabilizer like sequence" misc_feature 1801..1805 /note="mRNA destabilizer like sequence" misc_feature 1819..1823 /note="mRNA destabilizer like sequence" polyA_signal 1859..1864 /note="put. polyadenylation signal" BASE COUNT 560 a 303 c 388 g 634 t ORIGIN 1 cgggcactca ccgtgtgtag ttggcatctc cgcgcgtccg gacacccgat cccagcatcc 61 ctgcctgcag gactgttcgt gttcagctcg cgtcctgcag ctgtccgagg tgctccagtt 121 ggaggctgag gttcccgggc tctgtcgctg agtgggcggc ggcaccggcg gagatgcctg 181 ggaagaaggc gcgcaagaac gctcaaccga gccccgcgcg ggctccagca gagctggaag 241 tcgagtgtgc tactcaactc aggagatttg gagacaaact gaacttccgg cagaaacttc 301 tgaatctgat atccaaactc ttctgctcag gaacctgact gcatcaaaaa cttgcatgag 361 gggactcctt caaaagagtt ttctcaggag gtgcacgttt catcaatttg aagaaagact 421 gcattgtaat tgagaggaat gtgaaggtgc attcatgggt gcccttggaa acggaagatg 481 gaatacatca aagtgaattt ctgttcaagt tttcccagat tatcattctt tgggatgaga 541 gaacattata aaaccacttt gtttatttta aagcaagaat ggaagaccct tgaaaataaa 601 gaagtaatta ttgacacatt tcttttttac ttagagaatc gttctagtgt ttttgccgaa 661 gattaccgct ggcctactgt gaaggtagat gacctgtgat tagactgggc ggctggggag 721 aaacagttca gtgcattgtt gttgttgctg tttttggtgt tttgcttttc agtgccaact 781 cagcacattg tatatgattc ggtttataca tattaccttg ttataatgaa aaaactcatt 841 ctgagaacac tgaaatgtta tactcagtgt tgatttcttc ggtcactaca caacgtaaaa 901 tcatttgttt cttttgactc aaattgtatt gcttctgttc agatgatctt tcattcaatg 961 tgttcctgtt gggcgttact agaaactatg gaaaactgga aaataacttt gaaaaaattg 1021 gataaagtat aggagggtta cttggggcca gtaaatcagt agactgaaca ttcaatataa 1081 taaaagaaca tggggatttt gtataaccag ggataataaa aagaaaaaga agttaatttt 1141 taattgatgt ttttgaaact tagtagaaca aatattcaga agtaacttga taagatatga 1201 atgtttctaa agagtttcta aaggttcgaa atgctccttg tcacattagt gtgcatccta 1261 caaaaagtga tctcttaatg taaattaaga atattttcat aattggaata tacttttctt 1321 aaaaaaaagg aacagttagt tctcatctag aatgaaagtt ccatatatgc attggtgaat 1381 atatatgtat acacatactt acatacttat atgggtatct gtatagataa tttgtattag 1441 agtattatat agcttcttag tagggtctca agtaagttca ttttttttat ctgggctata 1501 tacagtcctc aaataaataa tgtcttgatt ttatttcagc aggaataatt ttatttattt 1561 tgcctattta taattaaagt atttttcttt agtttgaaat gtgtattaaa gttacatttt 1621 tgagttacaa gagtcttata actacttgaa tttttagtta aaatgtctta atgtaggttg 1681 tagtcacttt agatggaaaa ttacctcaca tctgttttct tcagtattac ttaagattgt 1741 ttatttagtg gtagagagat tttttttttc agcctagagg cagctatttt accatctggt 1801 atttatggtc taatttgtat ttaaacatat gcacacatat aaaagttgat actgtggcag 1861 taaactatta aaagttttca ctgtt // LOCUS HUMAPRF 2787 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens DNA-binding protein (APRF) mRNA, complete cds. ACCESSION L29277 NID g475788 KEYWORDS APRF DNA-binding protein; binding protein; transcription factor. SOURCE Homo sapiens (tissue library: lambda gtII) placental cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2787) AUTHORS Akira,S., Nishio,Y., Inoue,M., Wang,X.J., Wei,S., Matsusaka,T., Yoshida,K., Sudo,T., Naruto,M. and Kishimoto,T. TITLE Molecular cloning of APRF, a novel IFN-stimulated gene factor 3 p91-related transcription factor involved in the gp130-mediated signaling pathway JOURNAL Cell 77 (1), 63-71 (1994) MEDLINE 94208062 FEATURES Location/Qualifiers source 1..2787 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placental" /tissue_lib="lambda gtII" gene 221..2533 /gene="APRF" CDS 221..2533 /gene="APRF" /standard_name="transcription factor" /codon_start=1 /product="DNA-binding protein" /db_xref="PID:g475789" /translation="MAQWNQLQQLDTRYLEQLHQLYSDSFPMELRQFLAPWIESQDWA YAASKESHATLVFHNLLGEIDQQYSRFLQESNVLYQHNLRRIKQFLQSRYLEKPMEIA RIVARCLWEESRLLQTAATAAQQGGQANHPTAAVVTEKQQMLEQHLQDVRKRVQDLEQ KMKVVENLQDDFDFNYKTLKSQGDMQDLNGNNQSVTRQKMQQLEQMLTALDQMRRSIV SELAGLLSAMEYVQKTLTDEELADWKRRQQIACIGGPPNICLDRLENWITSLAESQLQ TRQQIKKLEELHQKVSYKGDPIVQHRPMLEERIVELFRNLMKSAFVVERQPCMPMHPD RPLVIKTGVQFTTKVRLLVKFPELNYQLKIKVCIDKDSGDVAALRGSRKFNILGTNTK VMNMEESNNGSLSAEFKHLTLREQRCGNGGRANCDASLIVTEELHLITFETEVYHQGL KIDLETHSLSVVVISNICQMPNAWASILWYNMLTNNPKNVNFFTKPPIGTWDQVAEVL SWQFSSTTKRGLSIEQLTTLAEKLLGPGVNYSGCQITWANFCKENMAGKGFSYWVWLD NIIDLVKKYILALWNEGYIMGFISKERERAILSTKPPGTFLLRFSESSKEGGVTFTWV EKDISGKTQIQSVEPYTKQQLNNMSFAEIIMGYKIMDATNILLSPLVYLYPDIPKEEA FGKYCRPESQEHPEADPGSAAPYLKTKFICVTPTTCSNTIDLPMSPRALDSLMQFGNN GEGAEPSAGGQFESLTFDMELTSECATSPM" BASE COUNT 729 a 719 c 753 g 586 t ORIGIN 1 cagctggaat tcggggcggc ggcgcagact gggaggggga gccgggggtt ccgacgtcgc 61 agccgaggga acaagcccca accggatcct ggacaggcac cccggcttgg cgctgtctct 121 ccccctcggc tcggagaggc ccttcggcct gagggagcct cgccgcccgt ccccggcaca 181 cgcgcagccc cggcctctcg gcctctgccg gagaaacagg atggcccaat ggaatcagct 241 acagcagctt gacacacggt acctggagca gctccatcag ctctacagtg acagcttccc 301 aatggagctg cggcagtttc tggccccttg gattgagagt caagattggg catatgcggc 361 cagcaaagaa tcacatgcca ctttggtgtt tcataatctc ctgggagaga ttgaccagca 421 gtatagccgc ttcctgcaag agtcgaatgt tctctatcag cacaatctac gaagaatcaa 481 gcagtttctt cagagcaggt atcttgagaa gccaatggag attgcccgga ttgtggcccg 541 gtgcctgtgg gaagaatcac gccttctaca gactgcagcc actgcggccc agcaaggggg 601 ccaggccaac caccccacag cagccgtggt gacggagaag cagcagatgc tggagcagca 661 ccttcaggat gtccggaaga gagtgcagga tctagaacag aaaatgaaag tggtagagaa 721 tctccaggat gactttgatt tcaactataa aaccctcaag agtcaaggag acatgcaaga 781 tctgaatgga aacaaccagt cagtgaccag gcagaagatg cagcagctgg aacagatgct 841 cactgcgctg gaccagatgc ggagaagcat cgtgagtgag ctggcggggc ttttgtcagc 901 gatggagtac gtgcagaaaa ctctcacgga cgaggagctg gctgactgga agaggcggca 961 acagattgcc tgcattggag gcccgcccaa catctgccta gatcggctag aaaactggat 1021 aacgtcatta gcagaatctc aacttcagac ccgtcaacaa attaagaaac tggaggagtt 1081 gcaccaaaaa gtttcctaca aaggggaccc cattgtacag caccggccga tgctggagga 1141 gaggatcgtg gagctgttca gaaacttaat gaaaagtgcc tttgtggtgg agcggcagcc 1201 ctgcatgccc atgcatcctg accggcccct cgtcatcaag accggcgtcc agttcactac 1261 taaagtcagg ttgctggtca agttccctga gttgaattat cagcttaaaa ttaaagtgtg 1321 cattgacaaa gactctgggg acgttgcagc tctcagagga tcccggaaat ttaacattct 1381 gggcacaaac acaaaagtga tgaacatgga agaatccaac aacggcagcc tctctgcaga 1441 attcaaacac ttgaccctga gggagcagag atgtgggaat gggggccgag ccaattgtga 1501 tgcttccctg attgtgactg aggagctgca cctgatcacc tttgagaccg aggtgtatca 1561 ccaaggtctc aagattgacc tagagaccca ctccttgtca gttgtggtga tctccaacat 1621 ctgtcagatg ccaaatgcct gggcgtccat cctgtggtac aacatgctga ccaacaatcc 1681 caagaatgtg aacttcttca ctaagccgcc aattggaacc tgggaccaag tggccgaggt 1741 gctcagctgg cagttctcgt ccaccaccaa gcgggggctg agcatcgagc agctgacaac 1801 gctggctgag aagctcctag ggcctggtgt gaactactca gggtgtcaga tcacatgggc 1861 taacttctgc aaagaaaaca tggctggcaa gggcttctcc tactgggtct ggctagacaa 1921 tatcatcgac cttgtgaaaa agtatatctt ggccctttgg aatgaagggt acatcatggg 1981 tttcatcagc aaggagcggg agcgggccat cttgagcact aagcccccag gcaccttcct 2041 gctgcgcttc agtgaaagca gcaaagaagg aggcgtcact ttcacttggg tggagaagga 2101 catcagcggt aagacccaga tccagtccgt ggaaccatac acaaagcagc agctgaacaa 2161 catgtcattt gctgaaatca tcatgggcta taagatcatg gatgctacca atatcctgtt 2221 gtctccactt gtctatctct atcctgacat tcccaaggag gaggcattcg ggaagtattg 2281 tcggccagag agccaggagc atcctgaagc tgacccaggt agcgctgccc catacctgaa 2341 gaccaagttt atctgtgtga caccaacgac ctgcagcaat accattgacc tgccgatgtc 2401 cccccgcgct ttagattcat tgatgcagtt tggaaataat ggtgaaggtg ctgaaccctc 2461 agcaggaggg cagtttgagt ccctcacctt tgacatggag ttgacctcgg agtgcgctac 2521 ctcccccatg tgaggagctg agaacggaag ctgcagaaag atacgactga ggcgcctacc 2581 tgcattctgc cacccctcac acagccaaac cccagatcat ctgaaactac taactttgtg 2641 gttccagatt ttttttaatc tcctacttct gctatctttg agcaatctgg gcacttttaa 2701 aaatagagaa atgagtgaat gtgggtgatc tgcttttatc taaatgcaaa taaggatgtg 2761 ttctctgaga cccatgatca ggggatg // LOCUS HUMAR 538 bp mRNA PRI 11-SEP-1996 DEFINITION Human mRNA for chemokine, complete cds. ACCESSION D43767 NID g1536878 KEYWORDS chemokine, thymus and activation-regulated; chemokine. SOURCE Homo sapiens male peripheral blood cDNA to mRNA, clone:D3A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Imai,T., Yoshida,T., Baba,M., Nishimura,M., Kakizaki,M. and Yoshie,O. TITLE Molecular cloning of a novel T cell-directed CC chemokine expressed in thymus by signal sequence trap using Epstein-Barr virus vector JOURNAL J. Biol. Chem. 271 (35), 21514-21521 (1996) MEDLINE 96355526 REFERENCE 2 (bases 1 to 538) AUTHORS Imai,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 538) AUTHORS Imai,T. TITLE Direct Submission JOURNAL Submitted (07-DEC-1994) to the DDBJ/EMBL/GenBank databases. Toshio Imai, Shionogi Institute for Medical Science; 2-5-1 Mishima, Settsu, Osaka 566, Japan (Tel:06-382-2612, Fax:06-382-2598) FEATURES Location/Qualifiers source 1..538 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="D3A" /sex="male" /tissue_type="peripheral blood" CDS 53..337 /note="thymus and activation regulated" /codon_start=1 /product="chemokine" /db_xref="PID:d1008410" /db_xref="PID:g1536879" /translation="MAPLKMLALVTLLLGASLQHIHAARGTNVGRECCLEYFKGAIPL RKLKTWYQTSEDCSRDAIVFVTVQGRAICSDPNNKRVKNAVKYLQSLERS" BASE COUNT 118 a 168 c 149 g 103 t ORIGIN 1 ccctgagcag agggacctgc acacagagac tccctcctgg gctcctggca ccatggcccc 61 actgaagatg ctggccctgg tcaccctcct cctgggggct tctctgcagc acatccacgc 121 agctcgaggg accaatgtgg gccgggagtg ctgcctggag tacttcaagg gagccattcc 181 ccttagaaag ctgaagacgt ggtaccagac atctgaggac tgctccaggg atgccatcgt 241 ttttgtaact gtgcagggca gggccatctg ttcggacccc aacaacaaga gagtgaagaa 301 tgcagttaaa tacctgcaaa gccttgagag gtcttgaagc ctcctcaccc cagactcctg 361 actgtctccc gggactacct gggacctcca ccgttggtgt tcaccgcccc caccctgagc 421 gcctgggtcc aggggaggcc ttccagggac gaagaagagc cacagtgagg gagatcccat 481 ccccttgtct gaactggagc catgggcaca aagggcccag attaaagtct ttatcctc // LOCUS HUMARDE 1599 bp mRNA PRI 09-SEP-1994 DEFINITION Human arylacetamide deacetylase mRNA, complete cds. ACCESSION L32179 NID g537513 KEYWORDS arylacetamide deacetylase; esterase; lipase. SOURCE Homo sapiens male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Probst,M.R., Beer,M., Beer,D., Jenoe,P., Meyer,U.A. and Gasser,R. TITLE Human liver arylacetamide deacetylase: Molecular cloning of a novel esterase involved in the metabolic activation of arylamine carcinogens with high sequence similarity to hormone sensitive lipase JOURNAL J. Biol. Chem. 34, 21650-21656 (1994) FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="liver" 5'UTR 1..85 mRNA 1..1599 CDS 86..1285 /codon_start=1 /product="arylacetamide deacetylase" /db_xref="PID:g537514" /translation="MGRKSLYLLIVGILIAYYIYTPLPDNVEEPWRMMWINAHLKTIQ NLATFVELHGSSIFMDSFKVVGSFDEVPPTSDENVTVTETKFNNILVRVYVPKRKSEA LRRGLFYIHGGGWCVGSAALSGYDLLSRWTADRLDAVVVSTNYRLAPKYHFPIQFEDV YNALRWFLRKKVLAKYGVNPERIGISGDSAGGNLAAAVTQQLLDDPDVKIKLKIQSLI YPALQPLDVDLPSYQENSNFLFLSKSLMVRFWSEYFTTDRSLEKAMLSRQHVPVESSH LFKFINWSSLLPERFIKGHVYNNPNYGSSELAKKYPGFLDVRAAPLLADDNKLRGLPL TYVITCQYDLLRDDGLMYVTRLRNTGVQVTHNHVEDGFHGAFSFLGLKISHRLINQYI EWLKENL" 3'UTR 1286..1599 polyA_site 1599 BASE COUNT 473 a 293 c 318 g 515 t ORIGIN 1 cactgcttat taaagtacac tattcaggca tatcatgtag gtttactttc tgtgtttcta 61 gagaccaaga agcgggacgt tcaccatggg aagaaaatcg ctgtaccttc tgattgtggg 121 gatcctcata gcatattata tttatacgcc tctcccagat aacgttgagg agccatggag 181 aatgatgtgg ataaacgcac atctgaaaac tatacaaaat ttggctacat ttgtggagct 241 ccatgggagt tccattttta tggattcctt taaggttgtc gggagctttg atgaagtccc 301 accaacctca gatgaaaatg tcactgtgac tgagacaaaa ttcaacaaca ttcttgttcg 361 ggtatatgtg ccaaagagaa agtctgaagc actaagaagg gggttgtttt acatccatgg 421 tggaggctgg tgcgtgggaa gtgctgctct aagtggttat gacttgctgt caagatggac 481 agcagacaga cttgatgctg tcgtcgtatc aaccaactac agattagcac ctaagtatca 541 tttcccaatt caatttgaag atgtatataa tgccttaagg tggttcttac gtaaaaaagt 601 tcttgcaaaa tatggtgtga accctgagag aatcggtatt tctggagata gtgcaggagg 661 gaatttagct gcagcagtga ctcaacagct ccttgatgac ccagatgtca agatcaaact 721 caagatccag tctttaattt atcctgccct tcagcctctt gatgtagatt taccgtcata 781 tcaagaaaat tcaaattttc tatttctatc caaatcactc atggtcagat tctggagtga 841 atattttacc actgatagat cacttgaaaa agccatgctt tccagacaac atgtacctgt 901 ggaatcaagt catctcttca aatttattaa ttggagttcc ctgctccctg agaggtttat 961 aaaaggacat gtttataaca atccaaatta tggcagttct gagctggcta aaaaatatcc 1021 agggttccta gatgtgaggg cagccccttt gttggctgat gacaacaaat tacgtggctt 1081 acccctgacc tatgtcatca cctgtcaata tgatctctta agagatgatg gactcatgta 1141 tgtcacccga cttcgcaaca ctggggttca ggtgactcat aaccatgttg aggatggatt 1201 ccatggagca ttttcatttc tgggacttaa aattagtcac agacttataa atcagtatat 1261 tgagtggcta aaggaaaatc tatagtaaaa catgtagcta taacatattt taaaaataaa 1321 atctgaaaac ctcagaaaat ttcgattaga aattggtctt tcttagaatg gtctagttaa 1381 gttccacatg tagcataatt cttaaatagg cacttttctg tttttttttt cttactgtgg 1441 gatttcattt caattttcta cattgtctat ctgctttttc ggagattttc cttcttacac 1501 tgttaatctt attttaaaaa atattacatt cttgtatact ttatttttgt gagttggcta 1561 ctatttacga tgcaagagaa taaatgtgag caaagattg // LOCUS HUMARF4 1529 bp mRNA PRI 10-NOV-1994 DEFINITION Human ADP-ribosylation factor 4 (ARF4) mRNA, complete cds. ACCESSION M36341 M31890 NID g178984 KEYWORDS ADP-ribosylation factor 4. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 766) AUTHORS Monaco,L., Murtagh,J.J., Newman,K.B., Tsai,S.C., Moss,J. and Vaughan,M. TITLE Selective amplification of an mRNA and related pseudogene for a human ADP-ribosylation factor, a guanine nucleotide-dependent protein activator of cholera toxin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (6), 2206-2210 (1990) MEDLINE 90192776 REFERENCE 2 (bases 66 to 1529) AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C. TITLE Human ADP-ribosylation factors. A functionally conserved family of GTP-binding proteins JOURNAL J. Biol. Chem. 266 (4), 2606-2614 (1991) MEDLINE 91115891 REFERENCE 3 (bases 1 to 766) AUTHORS Monaco,L. TITLE Direct Submission JOURNAL Submitted (02-FEB-1990) Lucia Monaco, Laboratory of Cellular Metabolism, National Heart, Lung and Blood Institute, NIH, Bethesda, MD 20892, USA REFERENCE 4 (bases 66 to 1529) AUTHORS Kahn,R.A. TITLE Direct Submission JOURNAL Submitted (10-JUL-1990) Richard A. Kahn, National Cancer Institute, Lab. of Biol. Chem., DCT, Bldg. 37, NIH, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..1529 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="breast cancer cell line MDA-MB 231" gene 130..672 /gene="ARF4" CDS 130..672 /gene="ARF4" /codon_start=1 /product="ADP-ribosylation factor 4" /db_xref="PID:g178985" /translation="MGLTISSLFSRLFGKKQMRILMVGLDAAGKTTILYKLKLGEIVT TIPTIGFNVETVEYKNICFTVWDVGGQDRIRPLWKHYFQNTQGLIFVVDSNDRERIQE VADELQKMLLVDELRDAVLLLFANKQDLPNAMAISEMTDKLGLQSLRNRTWYVQATCA TQGTGLYEGLDWLSNELSKR" BASE COUNT 427 a 294 c 295 g 513 t ORIGIN 1 ctgcctccct ctttcttcct ccgctctttc tcttccctct cgtttagttt gcctggagct 61 tgaaaggaga aagcacgggg tcgccccaaa ccccttctgc ttctgcccat cacaagtgcc 121 actaccgcca tgggcctcac tatctcctcc ctcttctccc gactatttgg caagaagcag 181 atgcgcattt tgatggttgg attggatgct gctggcaaga caaccattct gtataaactg 241 aagttagggg agatagtcac caccattcct accattggtt ttaatgtgga aacagtagaa 301 tataagaaca tttgtttcac agtatgggat gttggtggtc aagatagaat taggcctctc 361 tggaagcatt acttccagaa tacccagggt cttatttttg tggtagatag caacgatcgt 421 gaaagaattc aggaagtagc agatgagctg cagaaaatgc ttctggtaga tgaattgaga 481 gatgcagtgc tgctactttt tgcaaacaaa caggatttgc caaatgctat ggccatcagt 541 gaaatgacag ataaactagg gcttcagtct cttcgtaaca gaacatggta tgttcaagcc 601 acttgtgcaa cacaaggaac tggtctgtat gaaggacttg actggctgtc aaatgagctt 661 tcaaaacgtt aaatgaaatt ggatatctaa ccaaggacat gtttgataaa attggtctag 721 gcttgttaca acaaaattag tttgtatctt ggttattaaa cagtatctgg gactggtttg 781 ggcagaatat taaacttatt ttgttgccaa ttattgttta ccgagtataa tgttgctatt 841 tagcaatgtg cttggtttta aagaaattct ccttgggaaa aaagtatcct cttttaattt 901 tacttcccat aagcgtaaat gcctggacat agctcttgtg aacctttaaa taaattgttt 961 gagtgttttt gagccccaga caaataatgt tttaaagtta tcccttgcta ctttactgat 1021 acctttatca ttcctgagac agtttgctaa tttaaaaatg tagcattcca tttgtattta 1081 tttctctccc ttgccaaaaa gattttctaa tactgcttgt accagccaga gaaagatcca 1141 aaacactact cagctctctt gcactgagga aatttttccc cctacattga ctcctggcct 1201 acatcagcca aacttaacct tggtggggtt tggatttgat agccaattag ttctgtgctg 1261 gttgcaaaga attgatattt agatggtttt taatactcag cagattgtct tcccatattg 1321 tgtctttttt atgttgcatg ttgcttttgt tatcagcctg attttttgct cagtatatga 1381 tagttctgct gatgttttgt ttattgggca gacatatctt cattaagagt ttttggaaaa 1441 ctcatcaaat tcgatgaata cattttcttc ataacccatt tggaattatt cctaataaaa 1501 tgataaaata cgtaaaaaaa aaggaattc // LOCUS HUMARF6A 1194 bp mRNA PRI 28-FEB-1996 DEFINITION Human ADP-ribosylation factor (hARF6) mRNA, complete cds. ACCESSION M57763 NID g178988 KEYWORDS ADP-ribosylation factor. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Tsuchiya,M., Price,S.R., Tsai,S.C., Moss,J. and Vaughan,M. TITLE Molecular identification of ADP-ribosylation factor mRNAs and their expression in mammalian cells JOURNAL J. Biol. Chem. 266 (5), 2772-2777 (1991) MEDLINE 91131565 FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /tissue_type="lambda ZAP library of H.Malech" gene 518..1045 /gene="hARF6" CDS 518..1045 /gene="hARF6" /codon_start=1 /product="ADP-ribosylation factor" /db_xref="PID:g178989" /translation="MGKVLSKIFGNKEMRILMLGLDAAGKTTILYKLKLGQSVTTIPT VGFNVETVTYKNVKFNVWDVGGQDKIRPLWRHYYTGTQGLIFVVDCADRDRIDEARQE LHRIINDREMRDAIILIFANKQDLPDAMKPHEIQEKLGLTRIRDRNWYVQPSCATSGD GLYEGLTWLTSNYKS" BASE COUNT 230 a 352 c 403 g 209 t ORIGIN 1 ggccggaggg agcccgcgct cggggcggcg gctggaggca gcgcaccgag ttcccgcgag 61 gatccatgac ctgacggggc cccggagccg cgctgcctct cgggtgtcct gggtcggtgg 121 ggagcccagt gctcgcaggc cggcgggcgg gccggagggc tgcagtctcc ctcgcggtga 181 gaggaaggcg gaggagcggg aaccgcggcg gcgctcgcgc ggcgcctgcg gggggaaggg 241 cagttccggg ccgggccgcg cctcagcagg gcggcggctc ccagcgcagt ctcagggccc 301 gggtggcggc ggcgactgga gaaatcaagt tgtgcggtcg gtgatgcccg agtgagcggg 361 gggcctgggc ctctgccctt aggaggcaac tcccacgcag gccgcaaagg gctctcgcgg 421 ccgagaggct tcgtttcggt ttcgcggcgg cggcggcgtt gttggctgag gggacccggg 481 acacctgaat gcccccggcc ccggctcctc cgacgcgatg gggaaggtgc tatccaaaat 541 cttcgggaac aaggaaatgc ggatcctcat gttgggcctg gacgcggccg gcaagacaac 601 aatcctgtac aagttgaagc tgggccagtc ggtgaccacc attcccactg tgggtttcaa 661 cgtggagacg gtgacttaca aaaatgtcaa gttcaacgta tgggatgtgg gcggccagga 721 caagatccgg ccgctctggc ggcattacta cactgggacc caaggtctca tcttcgtagt 781 ggactgcgcc gaccgcgacc gcatcgatga ggctcgccag gagctgcacc gcattatcaa 841 tgaccgggag atgagggacg ccataatcct catcttcgcc aacaagcagg acctgcccga 901 tgccatgaaa ccccacgaga tccaggagaa actgggcctg acccggattc gggacaggaa 961 ctggtatgtg cagccctcct gtgccacctc aggggacgga ctctatgagg ggctcacatg 1021 gttaacctct aactacaaat cttaatgagc attctccacc catcccctgg aaggagagaa 1081 atcaaaaacc cattcatagg attatcgcca ccatcacctc tttcaattgc cactttctct 1141 tcttttgaat ttgaactctg gagttactgt tctacagttt ggcggggacg gggc // LOCUS HUMARG 1103 bp mRNA PRI 24-SEP-1996 DEFINITION Human arginine-rich protein (ARP) gene, complete cds. ACCESSION M83751 NID g178990 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1103) AUTHORS Shridhar,V., Rivard,S., Shridhar,R., Mullins,C., Bostick,L., Sakr,W., Grignon,D., Miller,O.J. and Smith,D.I. TITLE A gene from human chromosomal band 3p21.1 encodes a highly conserved arginine-rich protein and is mutated in renal cell carcinomas JOURNAL Oncogene 12 (9), 1931-1939 (1996) MEDLINE 96211400 FEATURES Location/Qualifiers source 1..1103 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="ags" /cell_type="gastric mucosa" /tissue_type="adenogastric carcinoma" /map="3p21.1" /chromosome="3" gene 133..837 /gene="ARP" CDS 133..837 /gene="ARP" /note="putative" /codon_start=1 /product="arginine-rich protein" /db_xref="PID:g178991" /translation="MGKWHVGGRRGSPRQWGATARGRDLEAVRRGGCGSVGRRRQRRR RRRRRMRRMRRMWATQGLAVRVALSVLPGSRALRPGDCEVCISYLGRFYQDLKDRDVT FSPATIENELIKFCREARGKENRLCYYIGATDDAATKIINEVSKPLAHHIPVEKICEK LKKKDSQICELKYDKQIDLSTVDLKKLRVKELKKILDDWGETCKGCAEKSDYIRKINE LMPKYAPKAASAPTDL" BASE COUNT 278 a 255 c 338 g 232 t ORIGIN 1 cttcggtcct gctgtagtgc cttctgcgcc aggcccggtt caatcagcgg ccacaactgt 61 ctagggctca gacaccacca gccaatgagg gagggcacgt ggagccgcgt ctgggctcgc 121 ggctcctgac caatggggaa gtggcatgtg ggagggcgcc ggggttcccc ccgccaatgg 181 ggagctacgg cgcgcggccg ggacttggag gcggtgcggc gcggcgggtg cggttcagtc 241 ggtcggcggc ggcagcggag gaggaggagg aggaggagga tgaggaggat gaggaggatg 301 tgggccacgc aggggctggc ggtgcgcgtg gctctgagcg tgctgccggg cagccgggcg 361 ctgcggccgg gcgactgcga agtttgtatt tcttatctgg gaagatttta ccaggacctc 421 aaagacagag atgtcacatt ctcaccagcc actattgaaa acgaacttat aaagttctgc 481 cgggaagcaa gaggcaaaga gaatcggttg tgctactata tcggggccac agatgatgca 541 gccaccaaaa tcatcaatga ggtatcaaag cctctggccc accacatccc tgtggagaag 601 atctgtgaga agcttaagaa gaaggacagc cagatatgtg agcttaagta tgacaagcag 661 atcgacctga gcacagtgga cctgaagaag ctccgagtta aagagctgaa gaagattctg 721 gatgactggg gggagacatg caaaggctgt gcagaaaagt ctgactacat ccggaagata 781 aatgaactga tgcctaaata tgcccccaag gcagccagtg caccgaccga tttgtagtct 841 gctcaatctc tgttgcacct gagggggaaa aaacagttca actgcttact cccaaaacag 901 cctttttgta atttattttt taagtgggct cctgacaata ctgtatcaga tgtgaagcct 961 ggagctttcc tgatgatgct ggccctacag tacccccatg aggggattcc cttccttctg 1021 ttgctggtgt actctaggac ttcaaagtgt gtctgggatt tttttattaa agaaaaaaaa 1081 tttctagctg tcaaaaaaaa aaa // LOCUS HUMARGCAA 3849 bp mRNA PRI 22-JUN-1990 DEFINITION Human tyrosine kinase arg gene mRNA. ACCESSION M35296 NID g178992 KEYWORDS tyrosine kinase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3849) AUTHORS Kruh,G.D., Perego,R., Miki,T. and Aaronson,S.A. TITLE The complete coding sequence of arg defines the Abelson subfamily of cytoplasmic tyrosine kinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5802-5806 (1990) MEDLINE 90332670 COMMENT Authorin copy of sequence for [1] kindly submitted by G.D.Kruh, 19-JUN-1990, for release after publication. FEATURES Location/Qualifiers source 1..3849 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 205..3753 /note="arg tyrosine kinase" /codon_start=1 /db_xref="PID:g178993" /translation="MGQQVGRVGEAPGLQQPQPRGIRGSSAARPSGRRRDPAGRTTET GFNIFTQHDHFASCVEDGFEGDKTGGSSPEALHRPYGCDVEPQALNEAIRWSSKENLL GATESDPNLFVALYDFVASGDNTLSITKGEKLRVLGYNQNGEWSEVRSKNGQGWVPSN YITPVNSLEKHSWYHGPVSRSAAEYLLSSLINGSFLVRESESSPGQLSISLRYEGRVY HYRINTTADGKVYVTAESRFSTLAELVHHHSTVADGLVTTLHYPAPKCNKPTVYGVSP IHDKWEMERTDITMKHKLGGGQYGEVYVGVWKKYSLTVAVKTLKEDTMEVEEFLKEAA VMKEIKHPNLVQLLGVCTLEPPFYIVTEYMPYGNLLDYLRECNREEVTAVVLLYMATQ ISSAMEYLEKKNFIHRDLAARNCLVGENHVVKVADFGLSRLMTGDTYTAHAGAKFPIK WTAPESLAYNTFSIKSDVWAFGVLLWEIATYGMSPYPGIDLSQVYDLLEKGYRMEQPE GCPPKVYELMRACWKWSPADRPSFAETHQAFETMFHDSSISEEVAEELGRAASSSSVV PYLPRLPILPSKTRTLKKQVENKENIEGAQDATENSASSLAPGFIRGAQASSGSPALP RKQRDKSPSSLLEDAKETCFTRDRKGGFFSSFMKKRNAPTPPKRSSSFREMENQPHKK YELTGNFSSVASLQHADGFSFTPAQQEANLVPPKCYGGSFAQRNLCNDDGGGGGGSGT AGGGWSGITGFFTPRLIKKTLGLRAGKPTASDDTSKPFPRSNSTSSMSSGLPEQDRMA MTLPRNCQRSKLQLERTVSTSSQPEENVDRANDMLPKKSEESAAPSRERPKAKLLPRG ATALPLRTPSGDLAITEKDPPGVGVAGVAAAPKGKEKNGGARLGMAGVPEDGEQPGWP SPAKAAPVLPTTHNHKVPVLISPTLKHTPADVQLIGTDSQGNKFKLLSEHQVTSSGDK DRPRRVKPKCAPPPPPVMRLLQHPSICSDPTEEPTALTAGQSTSETQEGGKKAALGAV PISGKAGRPVMPPPQVPLPTSSISPAKMANGTAGTKVALRKTKQAAEKISADKISKEA LLECADLLSSALTEPVPNSQLVDTGHQLLDYCSGYVDCIPQTRNKFAFREAVSKLELS LQELQVSSAAAGVPGTNPVLNNLLSCVQEISDVVQR" misc_feature 1045..1800 /note="tyrosine kinase domain of protein" BASE COUNT 1018 a 983 c 1059 g 789 t ORIGIN 1 aaaagcagaa tctgtgagtc gcctggaggc agcgcggcgg ctgccgtgag gaggccgggt 61 gcggagccgc cggtggccca gccgctcagg gccagggcct gggctgggag ggagagaccg 121 gagcagcgcc aggagcccga ggccggagcc gaggaggaat gtgaccaggg gtcggcgggg 181 gcgcgggagt acgcgagagc agggatgggg cagcaggtgg gccgcgtcgg ggaagctccg 241 gggctccagc agcctcagcc ccgcgggatc cggggcagca gtgcagccag gccctccggc 301 cgcaggcggg acccggcggg gcgcaccaca gagaccggct tcaatatctt cacccagcat 361 gatcactttg ccagctgtgt ggaggatgga tttgagggag acaagactgg aggcagtagt 421 ccagaagctt tgcatcgtcc ctatggttgt gatgttgaac cccaggcact aaatgaggct 481 atcaggtgga gctccaagga gaacttgctc ggagccactg agagtgaccc taatctcttc 541 gttgcacttt atgattttgt agcaagtggt gataacacac tcagcatcac taaaggtgaa 601 aagctacgag tccttggtta caaccagaat ggtgagtgga gtgaagttcg ctctaagaat 661 gggcagggct gggtgccaag caactacatc accccagtga acagcctgga aaaacactcc 721 tggtaccatg gacctgtgtc acgcagtgca gctgagtatc tgctcagcag tctaatcaat 781 ggcagcttcc tggtgcgaga aagtgagagt agccctgggc agctgtccat ctcgctcagg 841 tacgagggac gtgtgtatca ctacaggatc aataccactg cagatggcaa ggtgtatgtg 901 actgctgaga gccgcttcag caccttggca gagcttgtac accatcactc cacagtggct 961 gatgggctgg tgacaacatt acactaccca gcacccaagt gtaataagcc tacagtctat 1021 ggtgtgtccc ccatccacga caaatgggaa atggagcgaa cagatattac catgaagcac 1081 aaacttgggg gcggtcagta tggagaggtt tacgttggcg tctggaagaa atacagcctt 1141 acagttgctg tgaaaacatt gaaggaagat accatggagg tagaagaatt cctgaaagaa 1201 gctgcagtaa tgaaggaaat caagcatcct aatctggtac aacttttagg tgtgtgtact 1261 ttggagccac cattttacat tgtgactgaa tacatgccat acgggaattt gctggattac 1321 ctccgagaat gcaaccgaga agaggtgact gcagttgtgc tgctctacat ggccactcag 1381 atttcttctg caatggagta cttagagaag aagaatttca tccatagaga tcttgcagct 1441 cgtaactgcc tagtgggaga aaaccatgtg gtaaaagtgg ctgactttgg cttaagtaga 1501 ttgatgactg gagacactta tactgctcat gctggagcca aatttcctat taagtggaca 1561 gcaccagaga gtcttgccta caataccttc tcaattaaat ctgacgtctg ggcttttggg 1621 gtattgttgt gggaaattgc tacctatgga atgtcaccat atccaggtat tgacctgtct 1681 caggtctatg acctactaga aaaaggatat cgaatggaac agcctgaggg atgcccccct 1741 aaggtttatg aacttatgag agcatgctgg aagtggagcc ctgccgatag gccctctttt 1801 gctgaaacac accaagcttt tgaaaccatg ttccatgact ccagcatttc tgaagaggta 1861 gctgaggagc ttgggagagc cgcctcctcg tcatctgttg ttccatacct gccccggcta 1921 cctatacttc cttccaagac tcggacactg aagaaacagg tggagaacaa ggagaacatt 1981 gaaggggcac aagatgccac agaaaattct gcttccagtt tagcaccagg gttcatcaga 2041 ggtgcacagg cctctagtgg atccccagca ctgcctcgaa agcaaagaga caagtcaccc 2101 agcagcctct tggaagatgc caaagagaca tgcttcacca gggataggaa ggggggcttc 2161 ttcagctcct tcatgaagaa gagaaatgct cctacacccc ccaaacgcag cagctccttc 2221 cgagaaatgg agaatcagcc ccataagaaa tacgaactca cgggtaactt ctcatctgtt 2281 gcttctctac agcatgctga tgggttctct ttcactcctg cccagcaaga ggcgaatctg 2341 gtgccaccca agtgctatgg ggggagcttt gcacagagga acctctgtaa tgacgacggt 2401 ggtgggggtg ggggcagtgg cactgctggg ggtgggtggt ctggcatcac aggcttcttt 2461 acaccacgct taatcaaaaa gacactgggc ttacgagcag gtaaacccac agccagtgat 2521 gacacttcca agccttttcc aaggtcaaac tctacatctt ccatgtcctc agggcttcca 2581 gagcaggata ggatggcaat gacccttccc aggaactgcc agaggtccaa actccagctg 2641 gaaaggacag tgtccacctc ttctcagcca gaagagaatg tggacagggc caatgacatg 2701 cttccaaaaa aatcagagga aagtgctgct ccaagcaggg agagaccaaa agccaagtta 2761 ttgcccagag gagccacagc tcttcctctc agaacaccct ctggggatct agccattaca 2821 gagaaggacc ctccaggggt gggagtggct ggagtggcag ctgcccccaa gggtaaagag 2881 aagaatggtg gggcacgact tgggatggct ggagttccag aggatggaga gcagccgggc 2941 tggccttctc cagccaaggc tgcccccgtc ctcccaacca ctcacaacca caaagtgcca 3001 gtccttatct cacccactct gaaacacact ccagctgacg tgcagctcat tggcacagac 3061 tctcagggga ataaattcaa gctcttatct gagcatcagg tcacatcctc tggagacaag 3121 gaccgacccc gacgggtaaa accaaagtgt gccccacccc caccaccagt gatgagacta 3181 ctgcagcatc cgtccatctg ctcagaccct acagaagagc caactgccct aactgcagga 3241 cagtccacat cagaaacaca ggaaggagga aagaaggcag ctctgggcgc agtgcccatc 3301 agtgggaaag ctgggaggcc agtgatgcct ccacctcaag tgcctctgcc cacatcttcc 3361 atctcgccag ccaaaatggc caatggcaca gcaggtacta aagtggctct gagaaaaacc 3421 aaacaggccg ctgagaaaat ctcagcagac aaaatcagca aagaggccct gctggaatgt 3481 gctgacctac tgtccagtgc actcacggaa cctgtgccca acagccagct ggtagacact 3541 ggacaccagc tgcttgacta ctgctcaggc tatgtggact gcatccctca aactcgcaac 3601 aaatttgcct tccgagaggc tgtgagcaaa ctggaactca gcctgcagga gctacaggtt 3661 tcttcagcag ctgctggtgt gcccgggaca aaccctgtcc ttaataactt attgtcatgt 3721 gtacaggaaa tcagtgatgt ggtgcagagg tagccactgt tagcctggtg ggaaaatgca 3781 cacatttctg aggggagagg gaaaaggact tgttttcctg tgttcttgtt ttcagaaaat 3841 gaaagactc // LOCUS HUMARGL 1445 bp mRNA PRI 31-OCT-1994 DEFINITION Human liver arginase mRNA, complete cds. ACCESSION M14502 NID g178994 KEYWORDS arginase; liver arginase. SOURCE Human liver, cDNA to mRNA, library of G.A.Ricca, clone lambda-phARG6; primer library of Y.Ebina, clone lambda-hERG109. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1445) AUTHORS Haraguchi,Y., Takiguchi,M., Amaya,Y., Kawamoto,S., Matsuda,I. and Mori,M. TITLE Molecular cloning and nucleotide sequence of cDNA for human liver arginase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (2), 412-415 (1987) MEDLINE 87092419 COMMENT Clean copy sequence for [1] kindly provided by M.Mori, 18-FEB-1987. FEATURES Location/Qualifiers source 1..1445 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6q23" mRNA <1..1445 /note="arg mRNA" gene 57..1025 /gene="ARG1" CDS 57..1025 /gene="ARG1" /note="arginase (EC 3.5.3.1)" /codon_start=1 /db_xref="GDB:G00-119-006" /db_xref="PID:g178995" /translation="MSAKSRTIGIIGAPFSKGQPRGGVEEGPTVLRKAGLLEKLKEQE CDVKDYGDLPFADIPNDSPFQIVKNPRSVGKASEQLAGKVAQVKKNGRISLVLGGDHS LAIGSISGHARVHPDLGVIWVDAHTDINTPLTTTSGNLHGQPVSFLLKELKGKIPDVP GFSWVTPCISAKDIVYIGLRDVDPGEHYILKTLGIKYFSMTEVDRLGIGKVMEETLSY LLGRKKRPIHLSFDVDGLDPSFTPATGTPVVGGLTYREGLYITEEIYKTGLLSGLDIM EVNPSLGKTPEEVTRTVNTAVAITLACFGLAREGNHKPIDYLNPPK" BASE COUNT 465 a 287 c 328 g 365 t ORIGIN 12 bp upstream of HincII site; chromosome 6q23. 1 tcactgaggg ttgactgact ggagagctca agtgcagcaa agagaagtgt cagagcatga 61 gcgccaagtc cagaaccata gggattattg gagctccttt ctcaaaggga cagccacgag 121 gaggggtgga agaaggccct acagtattga gaaaggctgg tctgcttgag aaacttaaag 181 aacaagagtg tgatgtgaag gattatgggg acctgccctt tgctgacatc cctaatgaca 241 gtccctttca aattgtgaag aatccaaggt ctgtgggaaa agcaagcgag cagctggctg 301 gcaaggtggc acaagtcaag aagaacggaa gaatcagcct ggtgctgggc ggagaccaca 361 gtttggcaat tggaagcatc tctggccatg ccagggtcca ccctgatctt ggagtcatct 421 gggtggatgc tcacactgat atcaacactc cactgacaac cacaagtgga aacttgcatg 481 gacaacctgt atctttcctc ctgaaggaac taaaaggaaa gattcccgat gtgccaggat 541 tctcctgggt gactccctgt atatctgcca aggatattgt gtatattggc ttgagagacg 601 tggaccctgg ggaacactac attttgaaaa ctctaggcat taaatacttt tcaatgactg 661 aagtggacag actaggaatt ggcaaggtga tggaagaaac actcagctat ctactaggaa 721 gaaagaaaag gccaattcat ctaagttttg atgttgacgg actggaccca tctttcacac 781 cagctactgg cacaccagtc gtgggaggtc tgacatacag agaaggtctc tacatcacag 841 aagaaatcta caaaacaggg ctactctcag gattagatat aatggaagtg aacccatccc 901 tggggaagac accagaagaa gtaactcgaa cagtgaacac agcagttgca ataaccttgg 961 cttgtttcgg acttgctcgg gagggtaatc acaagcctat tgactacctt aacccaccta 1021 agtaaatgtg gaaacatccg atataaatct catagttaat ggcataatta gaaagctaat 1081 cattttctta agcatagagt tatccttcta aagacttgtt ctttcagaaa aatgtttttc 1141 caattagtat aaactctaca aattccctct tggtgtaaaa ttcaagatgt ggaaattcta 1201 acttttttga aatttaaaag cttatatttt ctaacttggc aaaagactta tccttagaaa 1261 gagaagtgta cattgatttc caattaaaaa tttgctggca ttaaaaataa gcacacttac 1321 ataagccccc atacatagag tgggactctt ggaatcagga gacaaagcta ccacatgtgg 1381 aaaggtacta tgtgtccatg tcattcaaaa aatgtgattt tttataataa actctttata 1441 acaag // LOCUS HUMARGNP 2736 bp mRNA PRI 16-OCT-1991 DEFINITION Human arginine-rich nuclear protein mRNA, complete cds. ACCESSION M74002 NID g178996 KEYWORDS U1-snRNP; arginine/serine-rich; spliceosome; transformer protein. SOURCE Homo sapiens (library: lambda gt11 (Mike Mueckler)) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2736) AUTHORS Chaudhary,N., McMahon,C. and Blobel,G. TITLE Primary structure of a human arginine-rich nuclear protein that colocalizes with splicesome components JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 8189-8193 (1991) MEDLINE 91376109 FEATURES Location/Qualifiers source 1..2736 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2 cells" /haplotype="2N" /tissue_type="liver" /tissue_lib="lambda gt11 (Mike Mueckler)" CDS 84..1538 /note="Gene product is 54 kDa but migrates aberrantly on SDS gels as a ~70 kDa protein.; In mammalian cells, the protein colocalizes with many components of the pre-mRNA splicing machinery in the nucleus." /codon_start=1 /product="arginine-rich nuclear protein" /db_xref="PID:g178997" /translation="MSNTTVVPSTAGPGPSGGPGGGGGGGGGGGGTEVIQVTNVSPSA SSEQMRTLFGFLGKIDELRLFPPDDSPLPVSSRVCFVKFHDPDSAVVAQHLTNTVFVD RALIVVPYAEGVIPDEAKALSLLAPANAVAGLLPGGGLLPTPNPLTQIGAVPLAALGA PTLDPALAALGLPGANLNSQSLAADQLLKLMSTVDPKLNHVAAGLVSPSLKSDTSSKE IEEAMKRVREAQSLISAAIEPDKKEEKRRHSRSRSRSRRRRTPSSSRHRRSRSRSRRR SHSKSRSRRRSKSPRRRRSHSRERGRRSRSTSKTRDKKKEDKEKKRSKTPPKSYSTAR RSRSASRERRRRRSRSGTRSPKKPRSPKRKLSRSPSPRRHKKEKKKDKDKERSRDERE RSTSKKKKSKDKEKDRERKSESDKDVKQVTRDYDEEEQGYDSEKEKKEEKKPIETGSP KTKECSVEKGTGDSLRESKVNGDDHHEEDMDMSD" polyA_signal 2718..2723 BASE COUNT 867 a 521 c 608 g 740 t ORIGIN 1 gagagaagcc ttttccgttg ccggtgccgg cctagcgtcc tggaattact tcaatcaaca 61 ggagcgagaa cccgagcagc gccatgagca acactaccgt cgtccccagc actgcaggtc 121 cgggccccag cggcgggccc ggtggcggag gtggtggtgg cggcggaggc ggcggcaccg 181 aggtaatcca ggtgactaat gtctccccga gcgctagctc tgagcagatg cggactctct 241 tcggtttcct aggcaagatc gacgaactgc gcctcttccc gccggatgat tcgcctttgc 301 cagtctcatc tcgtgtctgc tttgttaagt tccatgatcc agactcagca gttgtggcac 361 agcatctgac aaacactgta ttcgttgaca gagctttgat agtcgtacca tatgcagaag 421 gagttattcc tgatgaagct aaagctttgt ctctgttggc accagctaat gcagtggcag 481 gtcttctgcc tggtggtgga ctcctgccta ctcctaaccc acttacccag attggcgctg 541 ttccactggc tgctttgggg gctcctactc ttgatcctgc ccttgctgca cttgggcttc 601 ctggagcaaa cttgaactct cagtctcttg ctgcagatca gttgctgaag cttatgagta 661 ctgttgatcc caagttgaat catgtagctg ctggtctcgt ttcaccaagt ctgaaatcgg 721 atacctctag taaagaaata gaggaagcta tgaaaagagt acgagaagca cagtccctaa 781 tttctgctgc tatagaacca gataagaaag aagaaaaaag aaggcattca agatcaagat 841 cacgttctag gaggaggagg actccctcat cttctagaca caggcggtca agaagcagat 901 cgagacggcg gtcacattct aagtctagga gtcggcgacg atccaaaagc ccaaggcgga 961 gaagatctca ttccagagaa agaggtagaa ggtcaaggag cacatcaaaa acaagagaca 1021 aaaagaaaga agacaaagaa aagaaacgtt ctaaaacacc accaaaaagt tacagcacag 1081 ccagacgttc tagaagtgca agcagagaga gacgacgacg aagaagcagg agtggcacaa 1141 gatctcctaa aaagcctcgg tctcctaaaa gaaaattgtc ccgctcacca tcccctagga 1201 gacataaaaa ggagaagaag aaagataaag acaaagaaag aagtagggat gaaagagaac 1261 gatcaacaag caagaagaag aagagtaaag ataaggaaaa ggaccgggaa agaaaatcag 1321 agagtgataa agatgtaaaa caggttacac gggattatga tgaagaggaa caggggtatg 1381 acagtgagaa agagaaaaaa gaagagaaga aaccaataga aacaggttcc cctaaaacaa 1441 aggaatgttc tgtggaaaag ggaactggtg attcactaag agaatccaaa gtgaatgggg 1501 atgatcatca tgaagaagac atggatatga gtgactgaat attgcctctg agggagtcca 1561 actgtatacc tgcatcagtg tcattccttt gtgtgatttc ttaatgctgt atttgttcat 1621 ctcaaaccta gatgtataca gctctgagtt ataaatggtt ataaagctcc tgttactcat 1681 attagttatt tacatcaaaa agcttttaga aaatggtacg aggtaaccaa ttcttgtcat 1741 ggtgaaatct gattgagtaa ccaagcagtt ttactattct ggtgctgctt cataacaaaa 1801 atgaaaagct gcatgcatct acagcaggca tggattgttt atgtcgtatg atatccttta 1861 ttaagtaagt tcacttatag tatttctata atttgattca ttgccgtaat agagccatgt 1921 aggaaatgca ctgattgcat gttattgtgg caagaatatc ctaaatgtca ttaaaatcct 1981 ccaacatgat ggatctactt atggtcttgt ttgttgacat gacaaattaa cattcttata 2041 gttacatctg gaaatgagca tttgaaatag ataatccttt aagccttgtg gcaaaatttt 2101 tgtggctttt gtttaacttt gaaaggttat tatgcactaa ccttttttgg tggctaatta 2161 gggtttaaat acagaaacaa gatttcaaat aaaactgtct ttgggcagtg agtaaatagc 2221 atattttgaa gtagagttgt atactttttc ataagatgtt tgggaatttt tttcctgaag 2281 taataattta ttcccacatc tacatcagtg aaagctatct acctatcctg agtctatctt 2341 aaaggaaaaa aagaaaaaaa ccttatctct tgcccttatt ttgaattttc cactctttca 2401 ttaatttgtt ttaagctcct gttggaaaaa aaggggtagt gcattttaaa ttgaccttca 2461 tacgctttta aaataagaca aatctacttg ataatgtacc tttatttgat ctcaagttgt 2521 ataaaaccaa taaatttgtg ttactgcagt agtaatctta tgcacacggt gatttcatgt 2581 tatatatgca aagtaggcaa ctgttttctt agttacagaa gtttcaagct tcacttttgt 2641 gcagtagaaa caaaagtagg ctacagtctg tgccatgttg atgtacagtt tctgaaattg 2701 ttttacaaga ctttgataat aaaaccctta aactta // LOCUS HUMARH 1365 bp mRNA PRI 21-SEP-1993 DEFINITION Human ADP-ribosylarginine hydrolase mRNA, complete cds. ACCESSION L13291 NID g402477 KEYWORDS ADP-ribosylarginine hydrolase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1365) AUTHORS Takada,T., Iida,K. and Moss,J. TITLE Cloning and site-directed mutagenesis of human ADP-ribosylarginine hydrolase JOURNAL J. Biol. Chem. 268, 17837-17843 (1993) MEDLINE 93352593 FEATURES Location/Qualifiers source 1..1365 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..105 /evidence=experimental CDS 106..1179 /codon_start=1 /evidence=experimental /product="ADP-ribosylarginine hydrolase" /db_xref="PID:g402478" /translation="MEKYVAAMVLSAAGDALGYYNGKWEFLQDGEKIHRQLAQLGGLD ALDVGRWRVSDDTVMHLATAEALVEAGKAPKLTQLYYLLAKHYQDCMEDMDGRAPGGA SVHNAMQLKPGKPNGWRIPFNSHEGGCGAAMRAMCIGLRFPHHSQLDTLIQVSIESGR MTHHHPTGYLGALASALFTAYAVNSRPPLQWGKGLMELLPEAKKYIVQSGYFVEENLQ HWSYFQTKWENYLKLRGILDGESAPTFPESFGVKERDQFYTSLSYSGWGGSSGHDAPM IAYDAVLAAGDSWKELAHRAFFHGGDSDSTAAIAGCWWGVMYGFKGVSPSNYEKLEYR NRLEETARALYSLGSKEDTVISL" 3'UTR 1180..1365 /evidence=experimental BASE COUNT 320 a 329 c 362 g 354 t ORIGIN 1 agagtcgttg ttttttctga ttcttcttgc tcctcatcct atttcctttg tcttaatttc 61 cccacaaaga caccctctcc agagcccagc aattgtgagg gactgatgga gaagtatgtg 121 gctgctatgg tgctgagtgc agctggagat gccctggggt actacaatgg gaagtgggag 181 ttcctccagg atggggagaa gatacaccgg cagttggccc agctgggcgg cttggatgcc 241 ctagacgtgg gaaggtggag agttagtgac gacacagtga tgcacttggc cacagcagaa 301 gctcttgtgg aagctgggaa agcccctaag ttgactcaac tgtattacct ccttgctaag 361 cattaccaag actgcatgga agacatggat gggcgggcac caggtggtgc ctcggtgcac 421 aacgccatgc agctgaagcc gggcaagccc aatggctgga ggattccctt caacagccat 481 gagggcggct gtggggctgc catgcgggcc atgtgcatcg gtctcaggtt cccacaccat 541 agccaactgg acacactgat ccaagtgagc atcgagagtg gtcggatgac ccaccaccac 601 ccaacaggct acctgggggc ccttgcgtct gctcttttta cagcctatgc tgtgaatagc 661 agaccaccct tgcagtgggg aaaaggactg atggagctgc taccagaagc taaaaagtac 721 attgtccaat caggctactt tgtagaggaa aatcttcaac actggtccta cttccaaacc 781 aaatgggaaa attacctaaa acttagaggg attttggatg gagaatcagc ccctaccttc 841 cctgagtctt tcggtgtgaa ggagagggat cagttctaca cctccctgag ctactctggc 901 tggggtggca gcagtgggca cgatgccccc atgattgcct acgatgctgt tcttgctgca 961 ggagactcct ggaaggagct tgcccaccga gcctttttcc atggtggaga cagtgattct 1021 acagctgcca ttgctggctg ctggtgggga gttatgtatg gttttaaagg agtgagtccc 1081 tccaactatg agaaactaga atacagaaac cggctggaag agacagctag ggctttatat 1141 tctctcgggt caaaagaaga cactgtaatt tccctttagg gagacgtgat gttcacttct 1201 gatggattct tcttttgtgt atttcctttt ctgctatttc ttttcagttt ttccaaagtc 1261 aagagtctta accttgtact cagggaattt tgagataaca agtcccttgg gcaccttaag 1321 ctcagttttt tcaggctcat cctgttcttc cagaatctat ccctt // LOCUS HUMARL1A 1008 bp mRNA PRI 13-JAN-1995 DEFINITION Homo sapiens ARL1 mRNA, complete cds. ACCESSION L28997 NID g607027 KEYWORDS ADP-ribosylation factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1008) AUTHORS Lee,F.-J.S., Stevens,L.A., Kao,Y.L., Moss,J. and Vaughan,M. TITLE Molecular characterization of a human ADP-Ribosylation factor-like gene (hARL1) JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 1008) AUTHORS Zhang,G.F., Patton,W.A., Lee,F.J., Liyanage,M., Han,J.S., Rhee,S.G., Moss,J. and Vaughan,M. TITLE Different ARF domains are required for the activation of cholera toxin and phospholipase D JOURNAL J. Biol. Chem. 270 (1), 21-24 (1995) MEDLINE 95113826 FEATURES Location/Qualifiers source 1..1008 /organism="Homo sapiens" /db_xref="taxon:9606" gene 145..690 /gene="ARL1" CDS 145..690 /gene="ARL1" /note="putative" /codon_start=1 /db_xref="PID:g607028" /translation="MGGFFSSIFSSLFGTREMRILILGLDGAGKTTILYRLQVGEVVT TIPTIGFNVETVTYKNLKFQVWDLGGQTSIRPYWRCYYSNTDAVIYVVDSCDRDRIGI SKSELVAMLEEEELRKAILVVFANKQDMEQAMTSSEMANSLGLPALKDRKWQIFKTSA TKGTGLDEAMEWLVETLKSRQ" polyA_site 1008 BASE COUNT 314 a 174 c 237 g 283 t ORIGIN 1 aattcggtac cccgtagcga ccggcgctca gctggaattc gggggaagtt gctggctgac 61 tgggcttgcg aggaaaccgc ctcggagctg cagcgaaggc caaggaatca ctgaagatcg 121 gcgagggagg acagggggtt catcatgggt ggctttttct caagtatatt ttccagtctg 181 tttggaactc gggaaatgag aattttaatt ttgggattag atggagcagg aaaaaccaca 241 attttgtaca gattacaagt gggagaagtt gttactacta tacctaccat tggatttaat 301 gtagagacgg tgacgtacaa aaaccttaaa ttccaagtct gggatttagg aggacagaca 361 agtatcaggc catactggag atgttactat tcaaacacag atgcagtcat ttatgtagta 421 gacagttgtg accgagaccg aattggcatt tccaaatcag agttagttgc catgttggag 481 gaagaagagc tgagaaaagc cattttagtg gtgtttgcaa ataaacagga catggaacag 541 gccatgactt cctcagagat ggcaaattca cttgggttac ctgccttgaa ggaccgaaaa 601 tggcagatat tcaaaacgtc agcaaccaaa ggcaccggcc ttgatgaggc aatggaatgg 661 ttagttgaaa cattaaaaag cagacagtaa ttcagtccat tcttctcccc tgaaatgaag 721 actacatcac ctctctccct ttggaaacag tcaagtgtac ttcacactac tagatgttaa 781 aactatatga ttattggcat atactgactg actgcaatat ttgtagtaaa tagggaaaat 841 aagtatttag ttggagggat aatttgatcg aatcacctga atgttctatg taatgtaaaa 901 tattcttttc ttgctttctt gtgttaaggt atatattcta tttgtatgga attcttattc 961 aaatacagtt gtattaaaga gtatactcct attggatgaa aaaaacct // LOCUS HUMARM 2453 bp mRNA PRI 15-DEC-1988 DEFINITION Human aromatase (Aro1) mRNA, complete cds. ACCESSION M18856 NID g178999 KEYWORDS aromatase. SOURCE Human placenta, cDNA to mRNA, clone JM109. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2453) AUTHORS Chen,S., Besman,M.J., Sparkes,R.S., Zollman,S., Klisak,I.J., Mohandas,T.K., Hall,P.F. and Shively,J.E. TITLE Human aromatase: cDNA cloning, southern blot analysis, and assignment of the gene to chromosome 15 JOURNAL DNA 7, 27-38 (1988) MEDLINE 88166351 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by S.Chen, 25-APR-1988. FEATURES Location/Qualifiers source 1..2453 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1260 /note="aromatase" /codon_start=1 /db_xref="PID:g179000" /translation="MRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKLGLQCIGMH EKGIIFNNNPELWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVD VLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKY EKSVKDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAEKRGDLTRENVNQCILE MLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIKEIQTVIGERDIKIDDIQKLKVMENFI YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENF AKNVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVESIQKIHD LSLHPDETKNMLEMIFTPRNSDRCLEH" BASE COUNT 733 a 518 c 527 g 675 t ORIGIN 362 bp upstream of BamHI site; chromosome 15q21.1. 1 atgcgagtct ggatctctgg agaggaaaca ctcattatca gcaagtcctc aagtatgttc 61 cacataatga agcacaatca ttacagctct cgattcggca gcaaacttgg gctgcagtgc 121 atcggtatgc atgagaaagg catcatattt aacaacaatc cagagctctg gaaaacaact 181 cgacccttct ttatgaaagc tctgtcaggc cccggccttg ttcgtatggt cacagtctgt 241 gctgaatccc tcaaaacaca tctggacagg ttggaggagg tgaccaatga atcgggctat 301 gtggacgtgt tgacccttct gcgtcgtgtc atgctggaca cctctaacac gctcttcttg 361 aggatccctt tggacgaaag tgctatcgtg gttaaaatcc aagggtattt tgatgcatgg 421 caagctctcc tcatcaaacc agacatcttc tttaagattt cttggctata caaaaagtat 481 gagaagtctg tcaaggattt gaaagatgcc atagaagttc tgatagcaga aaaaagacgc 541 aggatttcca cagaagagaa actggaagaa tgtatggact ttgccactga gttgatttta 601 gcagagaaac gtggtgacct gacaagagag aatgtgaacc agtgcatatt ggaaatgctg 661 atcgcagctc ctgacaccat gtctgtctct ttgttcttca tgctatttct cattgcaaag 721 caccctaatg ttgaagaggc aataataaag gaaatccaga ctgttattgg tgagagagac 781 ataaagattg atgatataca aaaattaaaa gtgatggaaa acttcattta tgagagcatg 841 cggtaccagc ctgtcgtgga cttggtcatg cgcaaagcct tagaagatga tgtaatcgat 901 ggctacccag tgaaaaaggg gacaaacatt atcctgaata ttggaaggat gcacagactc 961 gagtttttcc ccaaacccaa tgaatttact cttgaaaatt ttgcaaagaa tgttccttat 1021 aggtactttc agccatttgg ctttgggccc cgtggctgtg caggaaagta catcgccatg 1081 gtgatgatga aagccatcct cgttacactt ctgagacgat tccacgtgaa gacattgcaa 1141 ggacagtgtg ttgagagcat acagaagata cacgacttgt ccttgcaccc agatgagact 1201 aaaaacatgc tggaaatgat ctttacccca agaaactcag acaggtgtct ggaacactag 1261 agaaggctgg tcagtaccta ctctggagca tttctcatca gtagttcaca tacaaatcat 1321 ccatccttgc caatagtgtc atcctcacag tgaacactca gtgcccatgg cattttatag 1381 gcatacctcc tatgggttgt caccaagcta ggtgctattg gtcatctgct cctgttcaca 1441 ccagagaacc aggctacaag agaaaaagca gaggccaaga gtttgaggga gaaatagtcg 1501 gtgaagaaac cgtatccata aagacccgag gccaccaaat tttgtttgag aaggataggc 1561 cttcattaac aaaatgtatg tctggttccc cagtagagct ctactgcctc aacccaaggg 1621 gatttttatg tctggggcag aaacactcaa gttgattaga aagaccaggc caatgtcagg 1681 gtacctgggg ccaaacccac ctgctagtgt gaattaaagt actttaattt tgttttctgt 1741 ggaggtggaa aagcaacatt catagtcttt ggagaaatgc ttacaaattc agcatttggc 1801 ccttgctgtg aattaagccc aattaattcc tgtttgtcta catatgatct gtctgtggca 1861 aaagtttaat cagaggaaat tcttgggcag tctgtcgatt tatgcctcag ccacttgcct 1921 gtgctacgat tcattgtgtt acctgtagat tcaggtaata caaactatat ataatcatca 1981 agtaatacaa actaatttag taatagcctg ggtaagtatt attagggccc tgtgtctgct 2041 gtagaaaaaa aaattcacat gatgcacttc aaattcaaat aaaaatcctt ttggcatgtt 2101 cccatttttg cttagctcaa ttagtgtggc taaccaagag ataactgtaa atgtgacatt 2161 gatttgctct tactacagct tcagtgattg ggggaggaaa agtcccaacc caatgggctc 2221 aaacttctaa ggggtactcc tctcatcccc ttatccttct ccctcgacat tttctccctc 2281 tttcttccca tgaccccaaa gcaagggcaa cagatcagta aagaacgtgg tcagagtaga 2341 acccctgaag tattttttaa tcctacctca aaatttaaca gttacctgag agatttaaca 2401 ttatctagtt cattgaatca ttgtatgtgg tcatggataa attgcacacc ttg // LOCUS HUMARNTA 2616 bp mRNA PRI 31-OCT-1994 DEFINITION Human aryl hydrocarbon receptor nuclear translocator (ARNT) mRNA, complete cds. ACCESSION M69238 NID g179003 KEYWORDS aryl hydrocarbon receptor nuclear translocator; translocation. SOURCE Homo sapiens Liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2616) AUTHORS Hoffman,E.C., Reyes,H., Chu,F.F., Sander,F., Conley,L.H., Brooks,B.A. and Hankinson,O. TITLE Cloning of a factor required for activity of the Ah (dioxin) receptor JOURNAL Science 252 (5008), 954-958 (1991) MEDLINE 91240280 FEATURES Location/Qualifiers source 1..2616 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="Hepatocyte" /tissue_type="Liver" /map="1p36-q12" gene 57..2426 /gene="ARNT" CDS 57..2426 /gene="ARNT" /codon_start=1 /function="required for Ah receptor function, affects translocation" /db_xref="GDB:G00-119-701" /evidence=experimental /product="Arnt" /db_xref="PID:g179004" /translation="MAATTANPEMTSDVPSLGPAIASGNSGPGIQGGGAIVQRAIKRR PGLDFDDDGEGNSKFLRCDDDQMSNDKERFARSDDEQSSADKERLARENHSEIERRRR NKMTAYITELSDMVPTCSALARKPDKLTILRMAVSHMKSLRGTGNTSTDGSYKPSFLT DQELKHLILEAADGFLFIVSCETGRVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDV DKLREQLSTSENALTGRILDLKTGTVKKEGQQSSMRMCMGSRRSFICRMRCGSSSVDP VSVNRLSFVRNRCRNGLGSVKDGEPHFVVVHCTGYIKAWPPAGVSLPDDDPEAGQGSK FCLVAIGRLQVTSSPNCTDMSNVCQPTEFISRHNIEGIFTFVDHRCVATVGYQPQELL GKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFRSKNQEWLWMRTSSFTFQNPY SDEIEYIICTNTNVKNSSQEPRPTLSNTIQRPQLGPTANLPLEMGSGQLAPRQQQQQT ELDMVPGRDGLASYNHSQVVQPVTTTGPEHSKPLEKSDGLFAQDRDPRFSEIYHNINA DQSKGISSSTVPATQQLFSQGNTFPPTPRPAENFRNSGLAPPVTIVQPSASAGQMLAQ ISRHSNPTQGATPTWTPTTRSGFSAQQVATQATAKTRTSQFGVGSFQTPSSFSSMSLP GAPTASPGAAAYPSLTNRGSNFAPETGQTAGQFQTRTAEGVGVWPQWQGQQPHHRSSS SEQHVQQPPAQQPGQPEVFQEMLSMLGDQSNSYNNEEFPDLTMFPPFSE" BASE COUNT 682 a 690 c 643 g 601 t ORIGIN 1 atggcggctc ctcccactgg ggggggggtg gcgcggcggc ggtggcatct gcggccatgg 61 cggcgactac tgccaacccc gaaatgacat cagatgtacc atcactgggt ccagccattg 121 cctctggaaa ctctggacct ggaattcaag gtggaggagc cattgtccag agggctatta 181 agcggcgacc agggctggat tttgatgatg atggagaagg gaacagtaaa tttttgaggt 241 gtgatgatga tcagatgtct aacgataagg agcggtttgc caggtcggat gatgagcaga 301 gctctgcgga taaagagaga cttgccaggg aaaatcacag tgaaattgaa cggcggcgac 361 ggaacaagat gacagcctac atcacagaac tgtcagatat ggtacccacc tgtagtgccc 421 tggctcgaaa accagacaag ctaaccatct tacgcatggc agtttctcac atgaagtcct 481 tgcggggaac tggcaacaca tccactgatg gctcctataa gccgtctttc ctcactgatc 541 aggaactgaa acatttgatc ttggaggcag cagatggctt tctgtttatt gtctcatgtg 601 agacaggcag ggtggtgtat gtgtctgact ccgtgactcc tgttttgaac cagccacagt 661 ctgaatggtt tggcagcaca ctctatgatc aggtgcaccc agatgatgtg gataaacttc 721 gtgagcagct ttccacttca gaaaatgccc tgacagggcg tatcctggat ctaaagactg 781 gaacagtgaa aaaggaaggt cagcagtctt ccatgagaat gtgtatgggc tcaaggagat 841 cgtttatttg ccgaatgagg tgtggcagta gctctgtgga cccagtttct gtgaataggc 901 tgagctttgt gaggaacaga tgcaggaatg gacttggctc tgtaaaggat ggggaacctc 961 acttcgtggt ggtccactgc acaggctaca tcaaggcctg gcccccagca ggtgtttccc 1021 tcccagatga tgacccagag gctggccagg gaagcaagtt ttgcctagtg gccattggca 1081 gattgcaggt aactagttct cccaactgta cagacatgag taatgtttgt caaccaacag 1141 agttcatctc ccgacacaac attgagggta tcttcacttt tgtggatcac cgctgtgtgg 1201 ctactgttgg ctaccagcca caggaactct taggaaagaa tattgtagaa ttctgtcatc 1261 ctgaagacca gcagcttcta agagacagct tccaacaggt agtgaaatta aaaggccaag 1321 tgctgtctgt catgttccgg ttccggtcta agaaccaaga atggctctgg atgagaacca 1381 gctcctttac tttccagaac ccttactcag atgaaattga gtacatcatc tgtaccaaca 1441 ccaatgtgaa gaactctagc caagaaccac ggcctacact ctccaacaca atccagaggc 1501 cacaactagg tcccacagct aatttacccc tggagatggg ctcaggacag ctggcaccca 1561 ggcagcagca acagcaaaca gaattggaca tggtaccagg aagagatgga ctggccagct 1621 acaatcattc ccaggtggtt cagcctgtga caaccacagg accagaacac agcaagcccc 1681 ttgagaagtc agatggttta tttgcccagg atagagatcc aagattttca gaaatctatc 1741 acaacatcaa tgcggatcag agtaaaggca tctcctccag cactgtccct gccacccaac 1801 agctattctc ccagggcaac acattccctc ctaccccccg gccggcagag aatttcagga 1861 atagtggcct agcccctcct gtaaccattg tccagccatc agcttctgca ggacagatgt 1921 tggcccagat ttcccgccac tccaacccca cccaaggagc aaccccaact tggaccccta 1981 ctacccgctc aggcttttct gcccagcagg tggctaccca ggctactgct aagactcgta 2041 cttcccagtt tggtgtgggc agctttcaga ctccatcctc cttcagctcc atgtccctcc 2101 ctggtgcccc aactgcatcg cctggtgctg ctgcctaccc tagtctcacc aatcgtggat 2161 ctaactttgc tcctgagact ggacagactg caggacaatt ccagacacgg acagcagagg 2221 gtgtgggtgt ctggccacag tggcagggcc agcagcctca tcatcgttca agttctagtg 2281 agcaacatgt tcaacaaccg ccagcacagc aacctggcca gcctgaggtc ttccaggaga 2341 tgctgtccat gctgggagat cagagcaaca gctacaacaa tgaagaattc cctgatctaa 2401 ctatgtttcc ccccttttca gaatagaact attggggtga ggataagggg tgggggagaa 2461 aaaatcactg tttgttttta aaaagcaaat ctttctgtaa acagaataaa agttcctctc 2521 ccttcccttc cctcacccct gacatgtacc ccctttccct tctggctgtt cccctgctct 2581 gttgcctcct aaggtaacat ttataaaaaa aaaaaa // LOCUS HUMARPR 1432 bp DNA PRI 31-OCT-1994 DEFINITION Human androgen receptor gene, 5' end and promoter region. ACCESSION M58158 NID g179025 KEYWORDS androgen receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1432) AUTHORS Tilley,W.D., Marcelli,M. and McPhaul,M.J. TITLE Expression of the human androgen receptor gene utilizes a common promoter in diverse human tissues and cell lines JOURNAL J. Biol. Chem. 265 (23), 13776-13781 (1990) MEDLINE 90337992 FEATURES Location/Qualifiers source 1..1432 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq11.2-q12" mRNA 315..1432 /partial /gene="AR" /note="G00-120-556" /product="androgen receptor" gene 315..1432 /gene="AR" mRNA 323..1432 /partial /gene="AR" /note="G00-120-556" /product="androgen receptor" CDS 991..1017 /gene="AR" /note="ORF" /codon_start=1 /db_xref="PID:g179026" /translation="MKRQSGLQ" CDS 1430..1432 /partial /gene="AR" /codon_start=1 /db_xref="GDB:G00-120-556" /product="androgen receptor" BASE COUNT 317 a 441 c 402 g 272 t ORIGIN 1 ccaaatttcg tgagtgctgg cctccaggaa atctggagcc ctggcgccta aaccttggtt 61 taggaaagca ggagctattc aggaagcagg ggtcctccag ggctagagct agcctctcct 121 gccctcgccc acgctgcgcc agcacttgtt tctccaaagc cactaggcag gcgttagcgc 181 gcggtgaggg gaggggagaa aaggaaaggg gaggggaggg aaaaggaggt gggaaggcaa 241 ggaggccggc ccggtggggg cgggacccga ctcgcaaact gttgcatttg ctctccacct 301 cccagcgccc cctccgagat cccggggagc cagcttgctg ggagagcggg acggtccgga 361 gcaagcccac aggcagagga ggcgacagag ggaaaaaggg ccgagctagc cgctccagtg 421 ctgtacagga gccgaaggga cgcaccacgc cagccccagc ccggctccag cgacagccaa 481 cgcctcttgc agcgcggcgg cttcgaagcc gccgcccgga gctgcccttt cctcttcggt 541 gaagttttta aaagctgcta aagactcgga ggaagcaagg aaagtgcctg gtaggactga 601 cggctgcctt tgtcctcctc ctctccaccc cgcctccccc caccctgcct tccccccctc 661 ccccgtcttc tctcccgcag ctgcctcagt cggctactct cagccaaccc ccctcaccac 721 ccttctcccc acccgccccc ccgcccccgt cggcccagcg ctgccagccc gagtttgcag 781 agaggtaact ccctttggct gcgagcgggc gagctagctg cacattgcaa agaaggctct 841 taggagccag gcgactgggg agcggcttca gcactgcagc cacgacccgc ctggttaggc 901 tgcacgcgga gagaaccctc tgttttcccc cactctctct ccacctcctc ctgccttccc 961 caccccgagt gcggagccag agatcaaaag atgaaaaggc agtcaggtct tcagtagcca 1021 aaaaacaaaa caaacaaaaa caaaaaagcc gaaataaaag aaaaagataa taactcagtt 1081 cttatttgca cctacttcag tggacactga atttggaagg tggaggattt tgtttttttc 1141 ttttaagatc tgggcatctt ttgaatctac ccttcaagta ttaagagaca gactgtgagc 1201 ctagcagggc agatcttgtc caccgtgtgt cttcttctgc acgagacttt gaggctgtca 1261 gagcgctttt tgcgtggttg ctcccgcaag tttccttctc tggagcttcc cgcaggtggg 1321 cagctagctg cagcgactac cgcatcatca cagcctgttg aactcttctg agcaagagaa 1381 ggggaggcgg ggtaagggaa gtaggtggaa gattcagcca agctcaagga tg // LOCUS HUMARSBX 2802 bp mRNA PRI 31-OCT-1994 DEFINITION Human arylsulfatase B (ASB) mRNA, complete cds. ACCESSION M32373 NID g179029 KEYWORDS arylsulfatase; lysosomal hydrolase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2802) AUTHORS Schuchman,E.H., Jackson,C.E. and Desnick,R.J. TITLE Human arylsulfatase B: MOPAC cloning, nucleotide sequence of a full-length cDNA, and regions of amino acid identity with arylsulfatases A and C JOURNAL Genomics 6 (1), 149-158 (1990) MEDLINE 90152677 FEATURES Location/Qualifiers source 1..2802 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5p11-q13" mRNA <1..2802 /note="ASB mRNA" sig_peptide 560..679 /gene="ARSB" /note="arylsulfatase B signal peptide" gene 560..2161 /gene="ARSB" CDS 560..2161 /gene="ARSB" /note="arylsulfatase B precursor" /codon_start=1 /db_xref="GDB:G00-119-008" /db_xref="PID:g179030" /translation="MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRP PHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKR AIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA VGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVAS PLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRIELL HNIDPNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTGYPGCGYW FPPPSQYNVSEIPSSDPPTKTLWLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKH SVPVYFPAQDPRCDPKATGVWGPWM" mat_peptide 680..2158 /gene="ARSB" /note="arylsulfatase B" BASE COUNT 646 a 767 c 726 g 663 t ORIGIN 1 catggatttc gacattgctg gacctgccac aggctgggct cttgtgctag aaatgacttg 61 ctagctagac atcatggttc aggatctgag tcagaggttt aaccatttat aagctttttt 121 cttatgaaaa attggcacta attataatgt ctaactgtca gagttgttgc aggctttaca 181 ggagacgcgg gctgtgaaga tgctttgtaa attgtgaagc gttattaaag aacacatctt 241 tttttttagg aaaccacggt gcaaatttaa ttgccgggga agataacggg ccttggtgcc 301 ctccaagcgt cagctgagtt tccaagaagc cgggcagcgg gcgcccgcgg gttcgtctct 361 ggctcctcct ccgccacagc agccgggggc ccgggtcgga ggcggcgggg gccgagcgcc 421 cggcctcgca agcccacggc ccgctggggg tgccgtcccg cgccggggcg gagcaggccc 481 cggcagccca gttcctcatt ctatcagcgg tacaaggggc tggtggcgcc acaggcgctg 541 ggaccgcggg cggacaagga tgggtccgcg cggcgcggcg agcttgcccc gaggccccgg 601 acctcggcgg ctgctcctcc ccgtcgtcct cccgctgctg ctgctgctgt tgttggcgcc 661 gccgggctcg ggcgccgggg ccagccggcc gccccacctg gtcttcttgc tggcagacga 721 cctaggctgg aacgacgtcg gcttccacgg ctcccgcatc cgcacgccgc acctggacgc 781 gctggcggcc ggcggggtgc tcctggacaa ctactacacg cagccgctgt gcacgccgtc 841 gcggagccag ctgctcactg gccgctacca gatccgtaca ggtttacagc accaaataat 901 ctggccctgt cagcccagct gtgttcctct ggatgaaaaa ctcctgcccc agctcctaaa 961 agaagcaggt tatactaccc atatggtcgg aaaatggcac ctgggaatgt accggaaaga 1021 atgccttcca acccgccgag gatttgatac ctactttgga tatctcctgg gtagtgaaga 1081 ttattattcc catgaacgct gtacattaat tgacgctctg aatgtcacac gatgtgctct 1141 tgattttcga gatggcgaag aagttgcaac aggatataaa aatatgtatt caacaaacat 1201 attcaccaaa agggctatag ccctcataac taaccatcca ccagagaagc ctctgtttct 1261 ctaccttgct ctccagtctg tgcatgagcc ccttcaggtc cctgaggaat acttgaagcc 1321 atatgacttt atccaagaca agaacaggca tcactatgca ggaatggtgt cccttatgga 1381 tgaagcagta ggaaatgtca ctgcagcttt aaaaagcagt gggctctgga acaacacggt 1441 gttcatcttt tctacagata acggagggca gactttggca gggggtaata actggcccct 1501 tcgaggaaga aaatggagcc tgtgggaagg aggcgtccga ggggtgggct ttgtggcaag 1561 ccccttgctg aagcagaagg gcgtgaagaa ccgggagctc atccacatct ctgactggct 1621 gccaacactc gtgaagctgg ccaggggaca caccaatggc acaaagcctc tggatggctt 1681 cgacatgtgg aaaaccatca gtgaaggaag cccatccccc agaattgagc tgctgcataa 1741 tattgacccg aacttcgtgg actcttcacc gtgtcccagg aacagcatgg ctccagcaaa 1801 ggatgactct tctcttccag aatattcagc ctttaacaca tctgtccatg ctgcaattag 1861 acatggaaat tggaaactcc tcacgggcta cccaggctgt ggttactggt tccctccacc 1921 gtctcaatac aatgtttctg agataccctc atcagaccca ccaaccaaga ccctctggct 1981 ctttgatatt gatcgggacc ctgaagaaag acatgacctg tccagagaat atcctcacat 2041 cgtcacaaag ctcctgtccc gcctacagtt ctaccataaa cactcagtcc ccgtgtactt 2101 ccctgcacag gacccccgct gtgatcccaa ggccactggg gtgtggggcc cttggatgta 2161 ggatttcagg gaggctagaa aacctttcaa ttggaagttg gacctcaggc cttttctcac 2221 gactcttgtc tcatttgtta tcccaacctg ggttcacttg gcccttctct tgctcttaaa 2281 ccacaccgag gtgtctaatt tcaaccccta atgcatttaa gaagctgata aaatctgcaa 2341 cactcctgct gttggctgga gcatgtgtct agaggtggct ggagcatgtg tctagaggtg 2401 ggggtggctg ggtttatccc cctttcctaa gccttgggac agctgggaac ttaacttgaa 2461 ataggaagtt ctcactgaat cctggaggct ggaacagctg gctcttttag actcacaagt 2521 cagacgttcg attcccctct gccaatagcc agttttattg gagtgaatca catttcttac 2581 gcaaatgaag ggagcagaca gtgattaatg gttctgttgg caaggcttct ccctgtcggt 2641 gaaggatcat gttcaggcac tccaagtgaa ccacccctct tggttcaccc cttactcact 2701 tatctcatca cagagcataa ggcccatttt gttgttcagg tcaacagcaa aatgcctgca 2761 ccatgactgt ggcttttaaa ataaagaaat gtgtttttat cg // LOCUS HUMARXC 1230 bp mRNA PRI 31-OCT-1994 DEFINITION Human amphiregulin (AR) mRNA, complete cds, clones lambda-AR1 and lambda-AR2. ACCESSION M30704 NID g179039 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SOURCE Homo sapiens breast carcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1230) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor-related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10 (5), 1969-1981 (1990) MEDLINE 90220581 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES Location/Qualifiers source 1..1230 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MCF-7" /tissue_type="breast carcinoma" /map="4q13-q21" mRNA 1..1230 /gene="AREG" /note="G00-119-697" gene 1..1230 /gene="AREG" sig_peptide 210..266 /gene="AREG" /note="G00-119-697" CDS 210..968 /gene="AREG" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-697" /product="amphiregulin" /db_xref="PID:g179040" /translation="MRAPLLPPAPVVLSLLILGSGHYAAGLDLNDTYSGKREPFSGDH SADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSV RVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECK YIEHLEAVTCKCQQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAV ITVQLRRQYVRKYEGEAEERKKLRQENGNVHAIA" mat_peptide 510..962 /gene="AREG" /note="alternative; G00-119-697" /product="amphiregulin" mat_peptide 528..962 /gene="AREG" /note="alternative; G00-119-697" /product="amphiregulin" BASE COUNT 375 a 273 c 285 g 297 t ORIGIN 1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc 61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg 121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga 181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg 241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg 301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg 361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt 421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac 481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc 541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag 601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc 661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca 721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca 781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg 841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat 901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta 961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc 1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat 1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga 1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt 1201 tattttacag ctcattaaac ttttttaacc // LOCUS HUMASAM 1941 bp mRNA PRI 11-JAN-1991 DEFINITION Human cytosolic aspartate aminotransferase mRNA, complete cds. ACCESSION M37400 NID g179066 KEYWORDS aspartate aminotransferase. SOURCE Human adult liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1941) AUTHORS Bousquet-Lemercier,B., Pol,S., Pave-Preux,M., Hanoune,J. and Barouki,R. TITLE Properties of human liver cytosolic aspartate aminotransferase mRNAs generated by alternative polyadenylation site selection JOURNAL Biochemistry 29, 5293-5299 (1990) MEDLINE 90344765 FEATURES Location/Qualifiers source 1..1941 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" mRNA <1..1941 /note="aspartate aminotransferase mRNA" CDS 25..1266 /note="aspartate aminotransferase" /codon_start=1 /db_xref="PID:g179067" /translation="MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKVNLGVGAYRTD DCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSPALKEKRVG GVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFKDIRSYR YWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMKHRFL FPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVVGK EPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTM RSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSG LTTKNLDYVATSIHEAVTKIQ" BASE COUNT 493 a 469 c 475 g 504 t ORIGIN 1 tctcttgatt cctagtctct cgatatggca cctccgtcag tctttgccga ggttccgcag 61 gcccagcctg tcctggtctt caagctcact gccgacttca gggaggatcc ggacccccgc 121 aaggtcaacc tgggagtggg agcatatcgc acggatgact gccatccctg ggttttgcca 181 gtagtgaaga aagtggagca gaagattgct aatgacaata gcctaaatca cgagtatctg 241 ccaatcctgg gcctggctga gttccggagc tgtgcttctc gtcttgccct tggggatgac 301 agcccagcac tcaaggagaa gcgggtagga ggtgtgcaat ctttgggggg aacaggtgca 361 cttcgaattg gagctgattt cttagcgcgt tggtacaatg gaacaaacaa caagaacaca 421 cctgtctatg tgtcctcacc aacctgggag aatcacaatg ctgtgttttc cgctgctggt 481 tttaaagaca ttcggtccta tcgctactgg gatgcagaga agagaggatt ggacctccag 541 ggcttcctga atgatctgga gaatgctcct gagttctcca ttgttgtcct ccacgcctgt 601 gcacacaacc caactgggat tgacccaact ccggagcagt ggaagcagat tgcttctgtc 661 atgaagcacc ggtttctgtt ccccttcttt gactcagcct atcagggctt cgcatctgga 721 aacctggaga gagatgcctg ggccattcgc tattttgtgt ctgaaggctt cgagttcttc 781 tgtgcccagt ccttctccaa gaacttcggg ctctacaatg agagagtcgg gaatctgact 841 gtggttggaa aagaacctga gagcatcctg caagtccttt cccagatgga gaagatcgtg 901 cggattactt ggtccaatcc ccccgcccag ggagcacgaa ttgtggccag caccctctct 961 aaccctgagc tctttgagga atggacaggt aatgtgaaga caatggctga ccggattctg 1021 accatgagat ctgaactcag ggcacgacta gaagccctca aaacccctgg gacctggaac 1081 cacatcactg atcaaattgg catgttcagc ttcactgggt tgaaccccaa gcaggttgag 1141 tatctggtca atgaaaagca catctacctg ctgccaagtg gtcgaatcaa cgtgagtggc 1201 ttaaccacca aaaatctaga ttacgtggcc acctccatcc atgaagcagt caccaaaatc 1261 cagtgaagaa acaccacccg tccagtacca ccaaagtagt tctctgtcat gtgtgttccc 1321 tgcctgcaca aacctacatg tacataccat ggattagaga cacttgcagg actgaaagct 1381 gctctggtga ggcagcctct gtttaaaccg gccccacatg aagagaacat cccttgagac 1441 gaatttggag actgggatta gagcctttgg aggtcaaagc aaattaagat ttttatttaa 1501 gaataaaaga gtactttgat catgagacat aggtatcttg tccctctcac taaaaaggag 1561 tgttgtgtgt ggcggccacg tgcttctatg tggtgtttga ctctgtacaa attctagtcc 1621 caaagatcaa gttgtctgaa ggagccaaag tgtgaatgtg ggtgtcggct gcggcattaa 1681 attcatcatc tcaacccaga gtgtctggtc tccctgctct ttctgcatgg ttgtgtccct 1741 agtcctaagc tttggttctt tagggtgact gtggtaagaa ggatatttaa tcatgacatg 1801 cacggacacg tacatattta actgaaacaa gttttaccaa acagtattta ctcgtgatgt 1861 gcgtagtgca ttctgatatt tttgagccat tctattgtgt tctacttcac ctaaaaaaat 1921 aaaataaaaa tgttgatcaa g // LOCUS HUMASCT1A 2102 bp mRNA PRI 15-SEP-1993 DEFINITION Human alanine/serine/cysteine/threonine transporter (ASCT1) mRNA, complete cds. ACCESSION L14595 NID g348011 KEYWORDS neutral amino acid transporter; sodium-dependent transporter; transmembrane protein. SOURCE Homo sapiens (library: lambda ZAPII) adult brain motor cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2102) AUTHORS Arriza,J.L., Kavanaugh,M.P., Fairman,W.A., Wu,Y.-N., Murdoch,G.H., North,R.A. and Amara,S.G. TITLE Cloning and expression of the human neutral amino acid transporter with structural similarity to the glutamate transporter gene family JOURNAL J. Biol. Chem. 268, 15329-15329 (1993) MEDLINE 93340119 REFERENCE 2 (bases 1 to 2102) AUTHORS Arriza,J.L. TITLE Direct Submission JOURNAL Submission received (Apr 19 1993) via Electronic mail by Genbank FEATURES Location/Qualifiers source 1..2102 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain motor cortex" /tissue_lib="lambda ZAPII" gene 184..1782 /gene="ASCT1" CDS 184..1782 /gene="ASCT1" /standard_name="system ASC, alanine/serine/cysteine/threonine transporter" /note="2 potential N-linked glycosylation sites at residues 201 and 206; in-frame stop codon 24 bp upstream of initiator methionine" /codon_start=1 /product="neutral amino acid transporter" /db_xref="PID:g348012" /translation="MEKSNETNGYLDSAQAGPAAGPGAPGTAAGRARRCARFLRRQAL VLLTVSGVLAGAGLGAALRGLSLSRTQVTYLAFPGEMLLRMLRMIILPLVVCSLVSGA ASLDASCLGRLGGIRVAYFGLTTLSASALAVALAFIIKPGSGAQTLQSSDLGLEDSGP PPVPKETVDSFLDLARNLFPSNLVVAAFRTYATDYKVVTQNSSSGNVTHEKIPIGTEI EGMNILGLVLFALVLGVALKKLGSEGEDLIRFFNSLNEATMVLVSWIMWYVPVGIMFL VGSKIVEMKDIIVLVTSLGKYIFASILGHVIHGGIVLPLIYFVFTRKNPFRFLLGLLA PFATAFATCSSSATLPSMMKCIEENNGVDKRISRFILPIGATVNMDGAAIFQCVAAVF IAQLNNIELNAGQIFTILVTATASSVGAAGVPAGGVLTIAIILEAIGLPTHDLPLILA VDWIVDRTTTVVNVEGDALGAGILHHLNQKATKKGEQELAEVKVEAIPNCKSEEETSP LVTHQNPAGPVASAPELESKESVL" BASE COUNT 410 a 637 c 603 g 452 t ORIGIN 1 cccgcactct gcgcctctcc tcgcctttct cgcacctgct cctgcgccag gcccggagac 61 ccccggggcg gcttcccaga acctgcggag cacaactggc cgaccgaccc attcattggg 121 aacccgtctt ttgccagagc ccacgtcccc tgccacctct agctcggagc ggcgtgtagc 181 gccatggaga agagcaacga gaccaacggc taccttgaca gcgctcaggc ggggcctgcg 241 gccgggcccg gagctccggg gaccgcggcg ggacgcgcac ggcgttgcgc gcgcttcctg 301 cggcgccaag cgctggtgct gctcaccgtg tccggggtgc tggcgggcgc gggcctgggc 361 gcggcgttgc gcgggctcag cctgagccgc acgcaggtca cctacctggc cttccccggc 421 gagatgctgc tccgcatgct gcgcatgatc atcctgccgc tggtggtctg cagcctggtg 481 tcgggcgccg cctcgctcga tgccagctgc ctcgggcgtc tgggcggcat ccgtgtcgcc 541 tactttggcc tcaccacact gagtgcctcg gcgctcgccg tggccttggc gttcatcatc 601 aagccaggat ccggtgcgca gacccttcag tccagcgacc tggggctgga ggactcgggg 661 cctcctcctg tccccaaaga gacggtggac tctttcctcg acctggccag aaacctgttt 721 ccctccaatc ttgtggttgc agctttccgt acgtatgcaa ccgattataa agtcgtgacc 781 cagaacagca gctctggaaa tgtaacccat gaaaagatcc ccataggcac tgagatagaa 841 gggatgaaca ttttaggatt ggtcctgttt gctctggtgt taggagtggc cttaaagaaa 901 ctaggctccg aaggagaaga cctcatccgt ttcttcaatt ccctcaacga ggcgacgatg 961 gtgctggtgt cctggattat gtggtacgta cctgtgggca tcatgttcct tgttggaagc 1021 aagatcgtgg aaatgaaaga catcatcgtg ctggtgacca gcctggggaa atacatcttc 1081 gcatctatat tgggccatgt tattcatgga ggaattgttc tgccacttat ttattttgtt 1141 ttcacacgaa aaaacccatt cagattcctc ctgggcctcc tcgccccatt tgcgacagca 1201 tttgctacct gctccagctc agcgaccctt ccctctatga tgaagtgcat tgaagagaac 1261 aatggtgtgg acaagaggat cagcaggttt attctcccca tcggggccac cgtgaacatg 1321 gacggagcag ccatcttcca gtgtgtggcc gcggtgttca ttgcgcaact caacaacata 1381 gagctcaacg caggacagat tttcaccatt ctagtgactg ccacagcgtc cagtgttgga 1441 gcagcaggcg tgccagctgg aggggtcctc accattgcca ttatcctgga ggccattggg 1501 ctgcctactc atgacctgcc tctgatcctg gctgtggact ggattgtgga ccggaccacc 1561 acggtggtga atgtggaagg ggatgccctg ggtgcaggca ttctccacca cctgaatcag 1621 aaggcaacaa agaaaggcga gcaggaactt gctgaggtga aagtggaagc catccccaac 1681 tgcaagtctg aggaggagac atcgcccctg gtgacacacc agaaccccgc tggccccgtg 1741 gccagtgccc cagaactgga atccaaggag tcggttctgt gatggggctg ggctttgggc 1801 ttgcctgcca gcagtgatgt cccaccctgt tcacccagcc gccagtcatg gacacagggc 1861 actgccttgc caacttttac cctcccaagc aatgctttgg cccagtcgct ggcctgaggc 1921 ttacctctcg gcactggcat tgggctcccc agccggaact ggttaccaag gacaaggaca 1981 ctctgacatt cggcttgatc catgtccagg tgcaactgtg tgtacaccag ggatctgttt 2041 ggaaacaacc ccttgagctg ccaggctcaa gaaatcatgg actcacaggg tcctgtgtgg 2101 tt // LOCUS HUMASF 1717 bp mRNA PRI 05-SEP-1991 DEFINITION Human alternative splicing factor mRNA, complete cds. ACCESSION M72709 NID g179073 KEYWORDS alternative splicing factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1717) AUTHORS Ge,H., Zuo,P. and Manley,J.L. TITLE Primary structure of the human splicing factor ASF reveals similarities with drosophila regulators JOURNAL Cell 66, 373-382 (1991) MEDLINE 91309149 FEATURES Location/Qualifiers source 1..1717 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" 5'UTR 1..96 exon 1..648 /number=1 mRNA join(1..648,831..1717) /note="alternative" mRNA join(1..648,848..1717) /note="alternative" CDS join(97..648,831..1157) /note="alternative" /codon_start=1 /number=2 /db_xref="PID:g179074" /translation="MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRD IDLKNRRGGPPFAFVEFEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGG GGGGAPRGRYGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVV EFVRKEDMTYAVRKLDNTKFRSHEFCLSNREKLPTSGLKLMGPEVQVMEDLDLEAVVV AEAVAEATAGVAVTPQGEAEDHHAILPVIADLALVHKMIGDTFCRTHVVYSFPLFSTI FSFFNSNCFVQNGLKC" CDS join(97..648,848..1042) /note="alternative" /codon_start=1 /number=1 /db_xref="PID:g179075" /translation="MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRD IDLKNRRGGPPFAFVEFEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGG GGGGAPRGRYGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVV EFVRKEDMTYAVRKLDNTKFRSHEGETAYIRVKVDGPRSPSYGRSRSRSRSRSRSRSR SNSRSRSYSPRRSRGSPRYSPRHSRSRSRT" intron 649..847 /number=1 /label=1a intron 649..830 /number=1 /label=1b exon 831..1717 /number=2 /label=2b exon 848..1717 /number=2 /label=2a 3'UTR 1043..1717 BASE COUNT 420 a 343 c 465 g 489 t ORIGIN 1 cgccgcggga gacgtggtgc cgctgcgggc tcgctctgcc gtgcgctagg cttggtggga 61 aggcctgttc tcgagtccgc acttttcgtc accgccatgt cgggaggtgg tgtgattcgt 121 ggccccgcag ggaacaacga ttgccgcatc tacgtgggta acttacctcc agacatccga 181 accaaggaca ttgaggacgt gttctacaaa tacggcgcta tccgcgacat cgacctcaag 241 aatcgccgcg ggggaccgcc cttcgccttc gttgagttcg aggacccgcg agacgcagag 301 gacgcggtgt atggtcgcga cggctatgat tacgatgggt accgtctgcg ggtggagttt 361 cctcgaagcg gccgtggaac aggccgaggc ggcggcgggg gtggaggtgg cggagctccc 421 cgaggtcgct atggcccccc atccaggcgg tctgaaaaca gagtggttgt ctctggactg 481 cctccaagtg gaagttggca ggatttaaag gatcacatgc gtgaagcagg tgatgtatgt 541 tatgctgatg tttaccgaga tggcactggt gtcgtggagt ttgtacggaa agaagatatg 601 acctatgcag ttcgaaaact ggataacact aagtttagat ctcatgaggt aggttataca 661 cgtattcttt tctttgacca gaattggata cagtggtctt aacagtggaa tttcaaggta 721 aggattcagg caaggttgtc aagtaaattg ccagatttct ggttttagtt acattgtatt 781 acttacgcat gtctgaagat agatgaaagc ttagatcttt caatggaaag ttctgtctat 841 ccaataggga gaaactgcct acatccgggt taaagttgat gggcccagaa gtccaagtta 901 tggaagatct cgatctcgaa gccgtagtcg tagcagaagc cgtagcagaa gcaacagcag 961 gagtcgcagt tactccccaa ggagaagcag aggatcacca cgctattctc cccgtcatag 1021 cagatctcgc tctcgtacat aagatgattg gtgacacttt ttgtagaacc catgttgtat 1081 acagttttcc tttattcagt acaatctttt cattttttaa ttcaaactgt tttgttcaga 1141 atgggctaaa gtgttgaatt gcattcttgt aatatcccct tgctcctaac atctacattc 1201 ccttcgtgtc tttgataaat tgtattttaa gtgatgtcat agacaggatt gtttaaattt 1261 agttaactcc atactcttca gactgtgata ttgtgtaaat gtctatctgc cctggtttgt 1321 gtgaactggg atgttggggt gtttgtggtt atcttacctg ggaagttctt agtttatctt 1381 gcttttcatg tgtctttctg tagacatatc tgaagagatg gattaagaat gctttggatt 1441 aaggattgtg gagcacattt caatcatttt aggattgtca aaaggaggat tgaggaggat 1501 cagatcaata atggaggcaa tggtttggat tggagagggc tcactggatc ccaatccttg 1561 gagctggatc attggattca aatcataatg tggataggat agggaggatg aattaccagg 1621 attcatggag cgggatcaga ttaccaggaa cataggagtg gattcctgcc ccaaccaaac 1681 cgcattcgtg tggatttttt tattcaactt aattggc // LOCUS HUMASGPR1 1277 bp mRNA PRI 31-OCT-1994 DEFINITION Human asialoglycoprotein receptor H1 mRNA, complete cds. ACCESSION M10058 NID g179078 KEYWORDS asialoglycoprotein receptor. SOURCE Human hepatoma cell line Hep G2, cDNA to mRNA, clone A21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1277) AUTHORS Spiess,M., Schwartz,A.L. and Lodish,H.F. TITLE Sequence of human asialoglycoprotein receptor cDNA. An internal signal sequence for membrane insertion JOURNAL J. Biol. Chem. 260 (4), 1979-1982 (1985) MEDLINE 85130911 COMMENT Draft entry and sequence in computer-readable form for [1] kindly provided by M.Spiess, 03-SEP-1985. The asialoglycoprotein receptor is expressed exclusively in hepatic parenchymal cells in mammals. After ligand binding to the receptor, the resulting complex is internalized and transported to a sorting organelle, where receptor and ligand are disassociated. The receptor then returns to the cell membrane surface. The membrane spanning region of the protein is encoded by nucleotides 300 to 356. This region might function as an internal hydrophobic membrane insertion signal. There is no proteolytic processing of the protein and no signal peptide. [1] is not sure of the 5' end of the mRNA. FEATURES Location/Qualifiers source 1..1277 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17pter-p12" mRNA <1..>1277 /note="ASGP-R1 mRNA" gene 173..1048 /gene="ASGR1" CDS 173..1048 /gene="ASGR1" /note="asialoglycoprotein receptor H1" /codon_start=1 /db_xref="GDB:G00-118-754" /db_xref="PID:g179079" /translation="MTKEYQDLQHLDNEESDHHQLRKGPPPPQPLLQRLCSGPRLLLL SLGLSLLLLVVVCVIGSQNSQLQEELRGLRETFSNFTASTEAQVKGLSTQGGNVGRKM KSLESQLEKQQKDLSEDHSSLLLHVKQFVSDLRSLSCQMAALQGNGSERTCCPVNWVE HERSCYWFSRSGKAWADADNYCRLEDAHLVVVTSWEEQKFVQHHIGPVNTWMGLHDQN GPWKWVDGTDYETGFKNWRPEQPDDWYGHGLGGGEDCAHFTDDGRWNDDVCQRPYRWV CETELDKASQEPPLL" BASE COUNT 271 a 394 c 378 g 234 t ORIGIN 70 bp upstream of HindIII site. 1 agggccctcc tatggaccct gcccgctccc ctcccattgt ccacggctgt ccgcccaccc 61 ccattctcca agcttcagcc ccctccttag ttcggcatct gcacagcact gaagaacctg 121 ggaatcagac cctgagaccc tgagcaatcc caggtccagc gccagcccta tcatgaccaa 181 ggagtatcaa gaccttcagc atctggacaa tgaggagagt gaccaccatc agctcagaaa 241 agggccacct cctccccagc ccctcctgca gcgtctctgc tccggacctc gcctcctcct 301 gctctccctg ggcctcagcc tcctgctgct tgtggttgtc tgtgtgatcg gatcccaaaa 361 ctcccagctg caggaggagc tgcggggcct gagagagacg ttcagcaact tcacagcgag 421 cacggaggcc caggtcaagg gcttgagcac ccagggaggc aatgtgggaa gaaagatgaa 481 gtcgctagag tcccagctgg agaaacagca gaaggacctg agtgaagatc actccagcct 541 gctgctccac gtgaagcagt tcgtgtctga cctgcggagc ctgagctgtc agatggcggc 601 gctccagggc aatggctcag aaaggacctg ctgcccggtc aactgggtgg agcacgagcg 661 cagctgctac tggttctctc gctccgggaa ggcctgggct gacgccgaca actactgccg 721 gctggaggac gcgcacctgg tggtggtcac gtcctgggag gagcagaaat ttgtccagca 781 ccacataggc cctgtgaaca cctggatggg cctccacgac caaaacgggc cctggaagtg 841 ggtggacggg acggactacg agacgggctt caagaactgg aggccggagc agccggacga 901 ctggtacggc cacgggctcg gaggaggcga ggactgtgcc cacttcaccg acgacggccg 961 ctggaacgac gacgtctgcc agaggcccta ccgctgggtc tgcgagacag agctggacaa 1021 ggccagccag gagccacctc tcctttaatt tatttcttca atgcctcgac ctgccgcagg 1081 ggtccgggat tgggaatccg cccatctggg gcctcttctg ctttctcggg aattttcatc 1141 taggatttta agggaagggg aaggataggg tgatgttccg aaggtgagga gcttgaaacc 1201 cgtggcgctt tctgcagttc acaatgataa cctgcaaact gcagaaagcg ccacgggttt 1261 caagctcctc accttcg // LOCUS HUMASH1A 1635 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens achaete scute homologous protein (ASH1) mRNA, complete cds. ACCESSION L08424 NID g306459 KEYWORDS achaete-scute protein; bHLH transcription factor; transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1635) AUTHORS Ball,D.W., Azzoli,C.G., Baylin,S.B., Chi,D., Dou,S., Donis-Keller,H., Cumaraswamy,A., Borges,M. and Nelkin,B.D. TITLE Identification of a human achaete-scute homolog highly expressed in neuroendocrine tumors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (12), 5648-5652 (1993) MEDLINE 93296195 FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TT" /cell_type="medullary thyroid carcinoma" gene 433..1149 /gene="ASH1" CDS 433..1149 /gene="ASH1" /note="alternative start codon at position 451; homologue 1; putative" /codon_start=1 /function="Putative bHLH transcription factor" /product="achaete scute protein" /db_xref="PID:g306460" /translation="MESSAKMESGGAGQQPQPQPQQPFLPPAACFFATAAAAAAAAAA AAAQSAQQQQQQQQQQQQQQAPQLRPAADGQPSGGGHKSAPKQVKRQRSSSPELMRCK RRLNFSGFGYSLPQQQPAAVARRNERERNRVKLVNLGFATLREHVPNGAANKKMSKVE TLRSAVEYIRALQQLLDEHDAVSAAFQAGVLSPTISPNYSNDLNSMAGSPVSSYSSDE GSYDPLSPEEQELLDFTNWF" repeat_region 583..624 /note="polymorphic; putative" /rpt_family="trinucleotide" /rpt_type=tandem /rpt_unit=583..585 BASE COUNT 361 a 515 c 471 g 288 t ORIGIN 1 cccgagaccc ggcgcaagag agcgcagcct tagtaggaga ggaacgcgag acgcggcaga 61 gcgcgttcag cactgacttt tgctgctgct tctgcttttt tttttcttag aaacaagaag 121 gcgccagcgg cagcctcaca cgcgagcgcc acgcgaggct cccgaagcca acccgcgaag 181 ggaggagggg agggaggagg aggcggcgtg cagggaggag aaaaagcatt ttcacctttt 241 ttgctcccac tctaagaagt ctcccgggga ttttgtatat attttttaac ttccgtcagg 301 gctcccgctt catatttcct tttctttccc tctctgttcc tgcacccaag ttctctctgt 361 gtccccctcg cgggccccgc acctcgcgtc ccggatcgct ctgattccgc gactccttgg 421 ccgccgctgc gcatggaaag ctctgccaag atggagagcg gcggcgccgg ccagcagccc 481 cagccgcagc cccagcagcc cttcctgccg cccgcagcct gtttctttgc cacggccgca 541 gccgcggcgg ccgcagccgc cgcagcggca gcgcagagcg cgcagcagca gcagcagcag 601 cagcagcagc agcagcagca gcaggcgccg cagctgagac cggcggccga cggccagccc 661 tcagggggcg gtcacaagtc agcgcccaag caagtcaagc gacagcgctc gtcttcgccc 721 gaactgatgc gctgcaaacg ccggctcaac ttcagcggct ttggctacag cctgccgcag 781 cagcagccgg ccgccgtggc gcgccgcaac gagcgcgagc gcaaccgcgt caagttggtc 841 aacctgggct ttgccaccct tcgggagcac gtccccaacg gcgcggccaa caagaagatg 901 agtaaggtgg agacactgcg ctcggcggtc gagtacatcc gcgcgctgca gcagctgctg 961 gacgagcatg acgcggtgag cgccgccttc caggcaggcg tcctgtcgcc caccatctcc 1021 cccaactact ccaacgactt gaactccatg gccggctcgc cggtctcatc ctactcgtcg 1081 gacgagggct cttacgaccc gctcagcccc gaggagcagg agcttctcga cttcaccaac 1141 tggttctgag gggctcggcc tggtcaggcc ctggtgcgaa tggactttgg aagcagggtg 1201 atcgcacaac ctgcatcttt agtgctttct tgtcagtggc gttgggaggg ggagaaaagg 1261 aaaagaaaaa aaaagaagaa gaagaagaaa agagaagaag aaaaaaacga aaacagtcaa 1321 ccaaccccat cgccaactaa gcgaggcatg cctgagagac atggctttca gaaaacggga 1381 agcgctcaga acagtatctt tgcactccaa tcattcacgg agatatgaag agcaactggg 1441 acctgagtca atgcgcaaaa tgcagcttgt gtgcaaaagc agtgggctcc tggcagaagg 1501 gagcagcaca cgcgttatag taactcccat cacctctaac acgcacagct gaaagttctt 1561 gctcgggtcc cttcacctcc ccgccctttc ttagagtgca gttcttagcc ctctagaaac 1621 gagttggtgt ctttc // LOCUS HUMASL 1549 bp mRNA PRI 31-OCT-1994 DEFINITION Human argininosuccinate lyase mRNA, complete cds. ACCESSION M14218 NID g179082 KEYWORDS argininosuccinate lyase. SOURCE Human liver, cDNA to mRNA, library of S.Woo, clone PAL3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1549) AUTHORS O'Brien,W.E., McInnes,R., Kalumuck,K. and Adcock,M. TITLE Cloning and sequence analysis of cDNA for human argininosuccinate lyase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7211-7215 (1986) MEDLINE 87016917 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by W.E.O'Brien, 29-JAN-1987. FEATURES Location/Qualifiers source 1..1549 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7pter-q22" mRNA <1..1549 /note="ASL mRNA" gene 115..1506 /gene="ASL" CDS 115..1506 /gene="ASL" /note="argininosuccinate lyase (EC 4.3.2.1)" /codon_start=1 /db_xref="GDB:G00-119-703" /db_xref="PID:g179083" /translation="MASESGKLWGGRFVGAVDPIMEKFNASIAYDRHLWEVDVQGSKA YSRGLEKAGLLTKAEMDQILHGLDKVAEEWAQGTFKLNSNDEDIHTANERRLKELIGA TAGKLHTGRSRNDQVVTDLRLWMRQTCSTLSGLLWELIRTMVDRAEAERDVLFPGYTH LQRAQPIRWSHWILSHAVALTRDSERLLEVRKRINVLPLGSGAIAGNPLGVDRELLRA ELNFGAITLNSMDATSERDFVAEFLFWRSLCMTHLSRMAEDLILYCTKEFSFVQLSDA YSTGSSLMPQKKNPDSLELIRSKAGRVFGRCAGLLMTLKGLPSTYNKDLQEDKEAVFE VSDTMSAVLQVATGVISTLQIHQENMGQALSPDMLATDLAYYLVRKGMPFRQPTRLRE SCVHGRDQGGRPQPAVTAGAADHQPPVLGRRDLRVGLRAQCGAVWCPGRHCALQRRLA DRQVRALLQAQQA" BASE COUNT 325 a 455 c 495 g 274 t ORIGIN 21 bp upstream of AvaI site; chromosome 7pter-q22. 1 agaactcgga gccagcccgg cccgggggac cctgctggcc aaggaggtcg tcagtccggt 61 cttgtcttcc agacccggag accgaagctt ccggacgacg aggaaccgcc caacatggcc 121 tcggagagtg ggaagctttg gggtggccgg tttgtgggtg cagtggaccc catcatggag 181 aagttcaacg cgtccattgc ctacgaccgg cacctttggg aggtggatgt tcaaggcagc 241 aaagcctaca gcaggggcct ggagaaggca gggctcctca ccaaggccga gatggaccag 301 atactccatg gcctagacaa ggtggctgag gagtgggccc agggcacctt caaactgaac 361 tccaatgatg aggacatcca cacagccaat gagcgccgcc tgaaggagct cattggtgca 421 acggcaggga agctgcacac gggacggagc cggaatgacc aggtggtcac agacctcagg 481 ctgtggatgc ggcagacctg ctccacgctc tcgggcctcc tctgggagct cattaggacc 541 atggtggatc gggcagaggc ggaacgtgat gttctcttcc cggggtacac ccatttgcag 601 agggcccagc ccatccgctg gagccactgg attctgagcc acgccgtggc actgacccga 661 gactctgagc ggctgctgga ggtgcggaag cggatcaatg tcctgcccct ggggagtggg 721 gccattgcag gcaatcccct gggtgtggac cgagagctgc tccgagcaga actcaacttt 781 ggggccatca ctctcaacag catggatgcc actagtgagc gggactttgt ggccgagttc 841 ctgttctggc gttcgctgtg catgacccat ctcagcagga tggccgagga cctcatcctc 901 tactgcacca aggaattcag cttcgtgcag ctctcagatg cctacagcac gggaagcagc 961 ctgatgcccc agaagaaaaa ccccgacagt ttggagctga tccggagcaa ggctgggcgt 1021 gtgtttgggc ggtgtgccgg gctcctgatg accctcaagg gacttcccag cacctacaac 1081 aaagacttac aggaggacaa ggaagctgtg tttgaagtgt cagacactat gagtgccgtg 1141 ctccaggtgg ccactggcgt catctctacg ctgcagattc accaagagaa catgggacag 1201 gctctcagcc ccgacatgct ggccactgac cttgcctatt acctggtccg caaagggatg 1261 ccattccgcc agcccacgag gctccgggaa agctgtgttc atggccgaga ccaagggggt 1321 cgccctcaac cagctgtcac tgcaggagct gcagaccatc agccccctgt tctcgggcga 1381 cgtgatctgc gtgtgggact acgggcacag tgtggagcag tatggtgccc tgggcggcac 1441 tgcgcgctcc agcgtcgact ggcagatcgc caggtgcggg cgctactgca ggcacagcag 1501 gcctaggtcc tcccacacct gccccctaat aaagtgggcg cgagaggag // LOCUS HUMASNS 1992 bp mRNA PRI 31-OCT-1994 DEFINITION Human asparagine synthetase mRNA, complete cds. ACCESSION M27396 M16763 NID g179099 KEYWORDS asparagine synthetase. SOURCE Human fibroblast cDNA to mRNA (library of Okayama and Berg), clones pH[57,60,131]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1992) AUTHORS Andrulis,I.L., Chen,J. and Ray,P.N. TITLE Isolation of human cDNAs for asparagine synthetase and expression in Jensen rat sarcoma cells JOURNAL Mol. Cell. Biol. 7 (7), 2435-2443 (1987) MEDLINE 87286877 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by I.L.Andrulis, 25-AUG-1987. FEATURES Location/Qualifiers source 1..1992 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q21-q31" mRNA 1..1992 /note="asparagine synthetase mRNA" gene 179..1864 /gene="ASNS" CDS 179..1864 /gene="ASNS" /note="asparagine synthetase" /codon_start=1 /db_xref="GDB:G00-119-706" /db_xref="PID:g179100" /translation="MCGIWALFGSDDCLSVQCLSAMKIAHRGPDAFRFENVNGYTNCC FGFHRLAVVDPLFGMQPIRVKKYPYLWLCYNGEIYNHKKMQQHFEFEYQTKVDGEIIL HLYDKGGIEQTICMLDGVFAFVLLDTANKKVFLGRDTYGVRPLFKAMTEDGFLAVCSE AKGLVTLKHSATPFLKVEPFLPGHYEVLDLKPNGKVASVEMVKYHHCRDVPLHALYDN VEKLFPGFEIETVKNNLRILFNNAVKKRLMTDRRIGCLLSGGLDSSLVAATLLKQLKE AQVQYPLQTFAIGMEDSPDLLAARKVADHIGSEHYEVLFNSEEGIQALDEVIFSLETY DITTVRASVGMYLISKYIRKNTDSVVIFSGEGSDELTQGYIYFHKAPSPEKAEEESER LLRELYLFDVLRADRTTAAHGLELRVPFLDHRFFSYYLSLPPEMRIPKNGIEKHLLRE TFEDSNLIPKEILWRPKEAFSDGITSVKNSWFKILQEYVEHQVDDAMMANAAQKFPFN TPKTKEGYYYRQVFERHYPGRADWLSHYWMPKWINATDPSARTLTHYKSAVKA" BASE COUNT 559 a 411 c 459 g 563 t ORIGIN 27 bp upstream of BstNI site; chromosome 7p11-q11. 1 aaacttcccg cacgcgttac aggagccagg tcggtataag cgccacgcct cgccgcccgt 61 caagctgtcc acatccctgg cctcagcccg ccacatcacc ctgacctgct tacgcccaga 121 ttttcttcaa tcacatctga ataaatcact tgaagaaagc ttatagcttc attgcaccat 181 gtgtggcatt tgggcgctgt ttggcagtga tgattgcctt tctgttcagt gtctgagtgc 241 tatgaagatt gcacacagag gtccagatgc attccgtttt gagaatgtca atggatacac 301 caactgctgc tttggatttc accggttggc ggtagttgac ccgctgtttg gaatgcagcc 361 aattcgagtg aagaaatatc cgtatttgtg gctctgttac aatggtgaaa tctacaacca 421 taagaagatg caacagcatt ttgaatttga ataccagacc aaagtggatg gtgagataat 481 ccttcatctt tatgacaaag gaggaattga gcaaacaatt tgtatgttgg atggtgtgtt 541 tgcatttgtt ttactggata ctgccaataa gaaagtgttc ctgggtagag atacatatgg 601 agtcagacct ttgtttaaag caatgacaga agatggattt ttggctgtat gttcagaagc 661 taaaggtctt gttacattga agcactccgc gactcccttt ttaaaagtgg agccttttct 721 tcctggacac tatgaagttt tggatttaaa gccaaatggc aaagttgcat ccgtggaaat 781 ggttaaatat catcactgtc gggatgtacc cctgcacgcc ctctatgaca atgtggagaa 841 actctttcca ggttttgaga tagaaactgt gaagaacaac ctcaggatcc tttttaataa 901 tgctgtaaag aaacgtttga tgacagacag aaggattggc tgccttttat cagggggctt 961 ggactccagc ttggttgctg ccactctgtt gaagcagctg aaagaagccc aagtacagta 1021 tcctctccag acatttgcaa ttggcatgga agacagcccc gatttactgg ctgctagaaa 1081 ggtggcagat catattggaa gtgaacatta tgaagtcctt tttaactctg aggaaggcat 1141 tcaggctctg gatgaagtca tattttcctt ggaaacttat gacattacaa cagttcgtgc 1201 ttcagtaggt atgtatttaa tttccaagta tattcggaag aacacagata gcgtggtgat 1261 cttctctgga gaaggatcag atgaacttac gcagggttac atatattttc acaaggctcc 1321 ttctcctgaa aaagccgagg aggagagtga gaggcttctg agggaactct atttgtttga 1381 tgttctccgc gcagatcgaa ctactgctgc ccatggtctt gaactgagag tcccatttct 1441 agatcatcga tttttttcct attacttgtc tctgccacca gaaatgagaa ttccaaagaa 1501 tgggatagaa aaacatctcc tgagagagac gtttgaggat tccaatctga tacccaaaga 1561 gattctctgg cgaccaaaag aagccttcag tgatggaata acttcagtta agaattcctg 1621 gtttaagatt ttacaggaat acgttgaaca tcaggttgat gatgcaatga tggcaaatgc 1681 agcccagaaa tttcccttca atactcctaa aaccaaagaa ggatattact accgtcaagt 1741 ctttgaacgc cattacccag gccgggctga ctggctgagc cattactgga tgcccaagtg 1801 gatcaatgcc actgaccctt ctgcccgcac gctgacccac tacaagtcag ctgtcaaagc 1861 ttaggtggtc tttatgctgt aatgtgaaag caaatatttc ttcgtgttgg atggggactg 1921 tgggtagata ggggaacaat gagagtcaac tcaggctaac ttgggtttga aaaaaataaa 1981 attcctaaat tt // LOCUS HUMASP 2271 bp mRNA PRI 15-MAR-1990 DEFINITION Human aspartyl-tRNA synthetase alpha-2 subunit mRNA, complete cds. ACCESSION J05032 NID g179101 KEYWORDS transfer RNA-Asp synthetase. SOURCE Human fibroblast HeLa cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2271) AUTHORS Jacobo-Molina,A., Peterson,R. and Yang,D.C.H. TITLE cDNA sequence, predicted primary structure and evolving amphiphilic helix of human aspartyl-tRNA synthetase JOURNAL J. Biol. Chem. 264, 16608-16612 (1989) MEDLINE 89380283 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.C.H.Yang, 09-AUG-1989. FEATURES Location/Qualifiers source 1..2271 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 94..1596 /note="aspartyl-tRNA synthetase" /codon_start=1 /db_xref="PID:g179102" /translation="MPSATQRKSQEKPREIMDAAEDYAKERYGISSMIQSQEKPDRVL VRVRDLTIQKADEVVWVRARVHTSRAKGKQCFLVLRQQQFNVQALVAVGDHASKQMVK FAANINKESIVDVEGVVRKVNQKIGSCTQQDVELHVQKIYVISLAEPRLPLQLDDAVR PEQEGEEEGRATVNQDTRLDNRVIDLRTSTSQAVFRLQSGICHLFRETLINKGFVEIQ TPKIISAASEGGANVFTVSYFKNNAYLAQSPQLYKQMCICADFEKVFSIGPVFRAEDS NTHRHLTEFVGLDIEMAFNYHYHEVMEEIADTMVQIFKGLQERFQTEIQTVNKQFPCE PFKFLEPTLRLEYCEALAMLREAGVEMGDEDDLSTPNEKLLGHLVKEKYDTDFYILDK YPLAVRPFYTMPDPRNPKQSKSYDMFMRGEEILSGAQRIHDPQLLTERALHHGNDLEK IKAYIDSFRFGAPPHAGGGIGLERVTMLFLGLHNVRQTSMFPRDPKRLTP" BASE COUNT 755 a 398 c 482 g 636 t ORIGIN 1 tggccggaat tccggggagg gagaagcccc tttggcctgc cttacggaag cctgcgaggg 61 agggtggtgt ccactgccca gttccgtgtc ccgatgccca gcgccacgca gcgcaagagt 121 caggagaagc cgcgggagat catggacgcg gcggaagatt atgctaaaga gagatatgga 181 atatcttcaa tgatacaatc acaagaaaaa ccagatcgag ttttggttcg ggttagagac 241 ttgacaatac aaaaagctga tgaagttgtt tgggtacgtg caagagttca tacaagcaga 301 gctaaaggga aacagtgctt cttagtccta cgtcagcagc agtttaatgt ccaggctctt 361 gtggcggtgg gagaccatgc aagcaagcag atggttaaat ttgctgccaa catcaacaaa 421 gagagcattg tggatgtaga aggtgttgtg agaaaagtga atcagaaaat tggaagctgt 481 acacagcaag acgttgagtt acatgttcag aagatttatg tgatcagttt ggctgaaccc 541 cgtctgcccc tgcagctgga tgatgctgtt cggcctgagc aagaaggaga agaggaagga 601 agagctactg ttaaccagga tacaagatta gacaacagag tcattgatct taggacatca 661 actagtcagg cagtcttccg tctccagtct ggcatctgcc atctcttccg agaaacttta 721 attaacaaag gttttgtgga aatccaaact cctaaaatta tttcagctgc cagtgaagga 781 ggagccaatg tttttactgt gtcatatttt aaaaataatg catacctggc tcagtcccca 841 cagctatata agcaaatgtg catttgtgct gattttgaga aggttttctc tattggacca 901 gtattcagag cggaagactc taatacccat agacatctaa ctgagtttgt tggtttggac 961 attgaaatgg cttttaatta ccattaccac gaagttatgg aagaaattgc tgacaccatg 1021 gtacaaatat tcaaaggact tcaagaaagg tttcagactg aaattcaaac agtgaataaa 1081 cagttcccat gtgagccatt caaatttttg gagccaactc taagactaga atattgtgaa 1141 gcattggcta tgcttaggga agctggagtc gaaatgggag atgaagacga tctgagcaca 1201 ccaaatgaaa agctgttggg tcatttggta aaggaaaagt atgatacaga tttttatatt 1261 cttgataaat atccattggc tgtaagacct ttctatacca tgcctgaccc aagaaatccc 1321 aaacagtcca agtcttacga tatgttcatg agaggagaag aaatattgtc aggagctcaa 1381 agaatacatg atcctcaact gctaacagag agagctttac atcatggaaa tgatttggag 1441 aaaattaagg cttacattga ttccttccgc tttggagccc ctcctcatgc tggtggaggc 1501 attggattgg aacgagttac tatgctgttt ctgggattgc ataatgttcg tcagacctcc 1561 atgttccctc gtgatcccaa acgactcact ccttaaattc acactttgcc acttaactcc 1621 agtgtggatg acagagcgag accctgcctc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagaaagcca cacttattct tttcagtaac 1741 ctgctagtgc acaggctgta ctttaggtac ttaaaatatg cactagaata aatttgcaag 1801 gccctaaaat atcactgtta tttttggagt aattcagtat aggttcgttt aaaagagatt 1861 tttataactt cagacatgca tcagtaggaa ataacttgag aaattcatat ggttatgtta 1921 caaattcata ttctgttact acagtaaacg ttaagagttt taaacagtta agattgtaca 1981 atttttcttc ttttctatat tacaagggcc ccagtgttaa tgtcttagat tttcagtatt 2041 tgaacttatt tttttaaatt ctgtcattga gataagaata attcaggtag catctgaaat 2101 tttaatgaat gtataattgg catatcatgg aaaattaacc agaaagtatc agttcttaaa 2161 agttatgcct agaaattatg taaagctaaa ctactggtta gaaagtattc agtgtaatat 2221 tgtattaatt tgttaaattc taaacttgaa tttcaataaa attttaaagc t // LOCUS HUMASPA 584 bp DNA PRI 27-FEB-1996 DEFINITION Homo sapiens agouti signalling protein (ASP) gene, complete cds. ACCESSION L37019 NID g608647 KEYWORDS agouti signalling protein; homologue. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 584) AUTHORS Wilson,B.D., Ollmann,M.M., Kang,L., Stoffel,M., Bell,G.I. and Barsh,G.S. TITLE Structure and function of ASP, the human homolog of the mouse agouti gene JOURNAL Hum. Mol. Genet. 4 (2), 223-230 (1995) MEDLINE 95276734 FEATURES Location/Qualifiers source 1..584 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1..170 /gene="ASP" gene 1..584 /gene="ASP" CDS 11..409 /gene="ASP" /note="mouse homologue" /codon_start=1 /product="agouti signaling protein" /db_xref="PID:g608648" /translation="MDVTRLLLATLLVFLCFFTANSHLPPEEKLRDDRSLRSNSSVNL LDVPSVSIVALNKKSKPIGRKAAEKKRSSKKEASMKKVVRPRTPLSAPCVATRNSCKP PAPACCDPCASCQCRFFRSACSCRVLSLNC" exon 171..232 /gene="ASP" exon 233..584 /gene="ASP" BASE COUNT 117 a 191 c 169 g 107 t ORIGIN 1 gcctcctggg atggatgtca cccgcttact cctggccacc ctgctggtct tcctctgctt 61 cttcactgcc aacagccacc tgccacctga ggagaagctc cgagatgaca ggagcctgag 121 aagcaactcc tctgtgaacc tactggatgt cccttctgtc tctattgtgg cgctgaacaa 181 gaaatccaaa ccgatcggca gaaaagcagc agaaaagaaa agatcttcta agaaggaggc 241 ttcgatgaag aaagtggtgc ggccccggac ccccctatct gcgccctgcg tggccacccg 301 caacagctgc aagccgccgg cacccgcctg ctgcgacccg tgcgcctcct gccagtgccg 361 cttcttccgc agcgcctgct cctgccgcgt gctcagcctc aactgctgag cgcccccact 421 cccggccgcg agcaggcagg gcttcgggga cgcggggcgc ttctcgggcg ggtgatccct 481 aacagggcgg cttcccaggg ctgcaggcgg gcggaggttc caggagatgg gacttcaggg 541 agacctggct tgggctaaaa tcgaaataca atatatatag gctg // LOCUS HUMASPAT 2339 bp mRNA PRI 15-SEP-1989 DEFINITION Human mitochondrial aspartate aminotransferase mRNA, complete cds. ACCESSION M22632 NID g179103 KEYWORDS aspartate aminotransferase. SOURCE Human liver, cDNA to mRNA, clones M-[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2339) AUTHORS Pol,S., Bousquet-Lemercier,B., Pave-Preux,M., Pawlak,A., Nalpas,B., Berthelot,P., Hanoune,J. and Barouki,R. TITLE Nucleotide sequence and tissue distribution of the human mitochondrial aspartate aminotransferase mRNA JOURNAL Biochem. Biophys. Res. Commun. 157, 1309-1315 (1988) MEDLINE 89087454 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P. Stanislas, 17-FEB-1989. FEATURES Location/Qualifiers source 1..2339 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2339 /note="AspAT mRNA" sig_peptide 9..96 /note="aspartate aminotransferase signal peptide" CDS 9..1301 /note="aspartate aminotransferase precursor (2.6.1.1)" /codon_start=1 /db_xref="PID:g179104" /translation="MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPI LGVTEAFKRDTNSKKMNLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGL AEFCKASAELALGENSEVLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKP TWGNHTPIFRDAGMQLQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTG VDPRPEQWKEIATVVKKRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQS YAKNMGLYGERVGAFTMVCKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTP DLRKQWLQEVKGMADRIIGMRTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVE RLIKEFSIYMTKDGRISVAGVTSSNVGYLAHAIHQVTK" mat_peptide 97..1299 /note="aspartate aminotransferase" BASE COUNT 591 a 592 c 568 g 588 t ORIGIN Unreported. 1 ggtccaccat ggccctgctg cactccggcc gcgtcctccc cgggatcgcc gccgccttcc 61 acccgggcct cgccgccgcg gcctctgcca gagccagctc ctggtggacc catgtggaaa 121 tgggacctcc agatcccatt ctgggagtca ctgaagcctt taagagggac accaatagca 181 aaaagatgaa tctgggagtt ggtgcctacc gggatgataa tggaaagcct tacgttctgc 241 ctagcgtccg caaggcagag gcccagattg ccgcaaaaaa tttggacaag gaatacctgc 301 ccattggggg actggctgaa ttttgcaagg catctgcaga actagccctg ggtgagaaca 361 gcgaagtctt gaagagtggc cggtttgtca ctgtgcagac catttctgga actggagcct 421 taaggatcgg agccagtttt ctgcaaagat tttttaagtt cagccgagat gtctttctgc 481 ccaaaccaac ctggggaaac cacacaccca tcttcaggga tgctggcatg cagctacaag 541 gttatcggta ttatgacccc aagacttgcg gttttgactt cacaggcgct gtggaggata 601 tttcaaaaat accagagcag agtgttcttc ttctgcatgc ctgcgcccac aatcccacgg 661 gagtggaccc gcgtccggaa cagtggaagg aaatagcaac agtggtgaag aaaaggaatc 721 tctttgcgtt ctttgacatg gcctaccaag gctttgccag tggtgatggt gataaggatg 781 cctgggctgt gcgccacttc atcgaacagg gcattaatgt ttgcctctgc caatcatatg 841 ccaagaacat gggcttatat ggtgagcgtg taggagcctt cactatggtc tgcaaagatg 901 cggatgaagc caaaagggta gagtcacagt tgaagatctt gatccgtccc atgtattcca 961 accctcccct caatggggcc cggattgctg ctgccattct gaacacccca gatttgcgaa 1021 aacaatggct gcaagaagtg aaaggcatgg ctgaccgcat cattggcatg cggactcaac 1081 tggtctccaa cctcaagaag gagggttcca cccacaattg gcaacacatc accgaccaaa 1141 ttggcatgtt ctgtttcaca gggctaaagc ctgaacaggt ggagcggctg atcaaggagt 1201 tctccatcta catgacaaaa gatggccgca tctctgtggc aggggtcacc tccagcaacg 1261 tgggctacct tgcccatgcc attcaccagg tcaccaagta atgtccctgg gtcgaggaaa 1321 cagagacaac ctttctgtct tcagcctctg ctattgagag cttcacacag acaatgagag 1381 agggtggatg gtggtgagtg gatcatttct ttcagccaca gtgtgtaaca ctcagcattt 1441 gaatgtttct cagaaaagaa catgtagtga cacagggcag aggcatccat ggctggcgtc 1501 tggaatatta aaccaaactc tccccggtcc ttttttctcc aacttttctc aaagagttta 1561 catgtgcaag aaagtcatcg caccaaaaaa cctgtcaatt atgccattgc aatatttcag 1621 aagctttaac tgaagtgtca ggttcctcgt gagaaacaag cacaccttag aggctttgag 1681 agaaggccag ctagttctgt catgagtagt cggcctcgtg tctgtcctcc catcttggaa 1741 caaccttatc aacaggccgc actgcagaaa tgatgtttta tgaaaaccat gagctgctgc 1801 cactccagca agggaaataa tgcagtttcc tgtcttattt aagaaaaaga gaaggctctc 1861 ttttctccct tgtcattgcc gttcttttcc ttacacgcaa aacattttta actattgcag 1921 tttcatccca ttctactgct tgattgacca tcaactccat cctatcgaga ttatttaaga 1981 atgaagaaca taatttttct gctgatgccg taccctcacc cttttcagca aagaatagtg 2041 gagagtagga aactgtactt tatctcggca tcctcttgaa tgatagtgca agtttctcca 2101 gttgggatgt tgtctctgcc cggttggacc tcctcccttt gttgaatgtg gtgtgcagcc 2161 tctcatctca cactgtgagt ccagcggcgc agggtggtac caggaaagag gatattctag 2221 gcttgcgtgc tgctagctgg gttcaggctt cacccactgg aaagaaccac catctgctct 2281 aaccatgtag acttattgcg gcctggtttt ctctgttaca ataaaattac tgtagaccc // LOCUS HUMASPX 7787 bp mRNA PRI 31-OCT-1994 DEFINITION Human nonerythroid alpha-spectrin (SPTAN1) mRNA, complete cds. ACCESSION J05243 NID g179105 KEYWORDS alpha-fodrin; nonerythroid alpha-spectrin. SOURCE Human lung fibroblast cell line WI38, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7787) AUTHORS Moon,R.T. and McMahon,A.P. TITLE Generation of diversity in nonerythroid spectrins. Multiple polypeptides are predicted by sequence analysis of cDNAs encompassing the coding region of human nonerythroid alpha-spectrin JOURNAL J. Biol. Chem. 265 (8), 4427-4433 (1990) MEDLINE 90170948 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.T.Moon, 12-DEC-1989. FEATURES Location/Qualifiers source 1..7787 /organism="Homo sapiens" /db_xref="taxon:9606" /map="9q34.1" gene 103..7521 /gene="SPTAN1" CDS 103..7521 /gene="SPTAN1" /note="nonerythroid alpha-spectrin" /codon_start=1 /db_xref="GDB:G00-120-385" /db_xref="PID:g179106" /translation="MDPSGVKVLETAEDIQERRQQVLDRYHRFKELSTLRRQKLEDSY RFQFFQRDAEELEKWIQEKLQIASDENYKDPTNLQGKLQKHQAFEAEVQANSGAIVKL DETGNLMISEGHFASETIRTRLMELHRQWELLLEKMREKGIKLLQAQNLVQYLRECED VMDWINDKEAIVTSEELGQDLEHVEVLQKKFEEFQTDMAAHEERVNEVNQFAAKLIQE QHPEEELIKTKQDEVNAAWQRLKGLALQRQGKLFGAAEVQRFNRDVDETISWIKEKEQ LMASDDFGRDLASVQALLRKHEGLERDLAALEDKVKALCAEADRLQQSHPLSATQIQV KREELITNWEQIRTLAAERHARLNDSYRLQRFLADFRDLTSWVTEMKALINADELASD VAGAEALLDRHQEHKGEIDAHEDSFKSADESGQALLAAGHYASDEVREKLTVLSEERA ALLELWELRRQQYEQCMDLQLFYRDTEQVDNWMSKQEAFLLNEDLGDFLDSVEALLKK HEDFEKSLSAQEEKITALDEFATKLIQNNHYAMEDVATRRDALLSRRNALHERAMRRR AQLADSFHLQQFFRDSDELKSWVNEKMKTATDEAYKDPSNLQGKVQKHQAFEAELSAN QSRIDALEKAGQKLIDVNHYAKDEVAARMNEVISLWKKLLEATELKGIKLREANQQQQ FNRNVEDIELWLYEVEGHLASDDYGKDLTNVQNLQKKHALLEADVAAHQDRIDGVTIQ ARQFQDAGHFDAENIKKKQEALVARYEALKEPMVARKQKLADSLRLQQLFRDVEDEET WIREKEPIAASTNRGKDLIGVQNLLKKHQALQAEIAGHEPRIKAVTQKGNAMVEEGHF AAEDVKAKLHELNQKWEALKAKASQRRQDLEDSLQAQQYFADANEAESWMREKEPIVG STDYGKDEDSAEALLKKHEALMSDLSAYGSSIQALREQAQSCRQQVAPTDDETGKELV LALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVNDRQGFVPAAYVKKLDPAQSA SRENLLEEQGSIALRQEQIDNQTRITKEAGSVSLRMKQVEELYHSLLELGEKRKGMLE KSCKKFMLFREANELQQWINEKEAALTSEEVGADLEQVEVLQKKFDDFQKDLKANESR LKDINKVAEDLESEGLMAEEVQAVQQQEVYGMMPRDETDSKTASPWKSARLMVHTVAT FNSIKELNERWRSLQQLAEERSQLLGSAHEVQRFHRDADETKEWIEEKNQALNTDNYG HDLASVQALQRKHEGFERDLAALGDKVNSLGETAERLTQSHPESAEDLQEKCTELNQA WSSLGKRADQRKAKLGDSHDLQRFLSDFRDLMSWINGIRGLVSSDELAKDVTGAEALL ERHQEHRTEIDARAGTFQAFEQFGQQLLAHGHYASPEIKQKLDILDQERADLEKAWVQ RRMMLDQCLELQLFHRDCEQAENWMAAREAFLNTEDKGDSLDSVEALIKKHEDFDKAI NVQEEKIAALQAFADQLIAAGHYAKGDISSRRNEVLDRWRRLKAQMIEKRSKLGESQT LQQFSRDVDEIEAWISEKLQTASDESYKDPTNIQSKHQKHQAFEAELHANADRIRGVI DMGNSLIERGACAGNEDAVKARLAALADQWQFLVQKSAEKSQKLKEANKQQNFNTGIK DIAFWLSEVEALLASEDYGKDLASVNNLLKKHQLLEADISAHEDRLKDLNSQADSLMT SSAFDTSQVKDKRDTINGRFQKIKSMAASRRAKLNESHRLHQFFRDMDDEESWIKEKK LLVGSEDYGRDLTGVQNLRKKHKRLEAELAAHEPAIQGVLDTGKKLSDDNTIGKEEIQ QRLAQFVEHWKELKQLAAARGQRLEESLEYQQFVANVEEEEAWINEKMTLVASEDYGD TLAAIQGLLKKHEAFETAFTVHKDRVNDVCTNGQDLIKKNNHHEENISSKMKGLNGKV SDLEKAAAQRKANVDENSAFLQFNWKADVVESWIGEKENSLKTDDYGRDLSSVQTLLT KQETFDAGLQAFQQEGIANITALKDQLLAAKHVQSKAIEARHASLMKRWSQLLANSAA RKKKLLEAQSHFRKVEDLFLTFAKKASAFNSWFENAEEDLTDPVRCNSLEEIKALREA HDAFRSSLSSAQADFNQLAELDRQIKSFRVASNPYTWFTMEALEETWRNLQKIIKERE LELQKEQRRQEENDKLRQEFAQHANAFHQWIQETRTYLLDGSCMVEESGTLESQLEAT KRKHQEIRAMRSQLKKIEDLGAAMEEALILDNKYTEHSTVGLAQQWDQLDQLGMRMQH NLEQQIQARNTTGVTEEALKEFSMMFKHFDKDKSGRLNHQDGKSCLRSLGYDLPMVEE GEPDPEFEAILDTVDPNRDGHVSLQEYMAFMISRETENVKSSEEIESAFRALSSEGKP YVTKEELYQNLTREQADYCVSHMKPIVDGKGRELPTAFDYVEFTRSLFVN" BASE COUNT 2147 a 1926 c 2239 g 1475 t ORIGIN 1 gaattcgggg aacggtgtgg agcggaggcc gcggaggctc ctcggtcctt cagcacccct 61 cggcccgacg cacccacgcc cctcaccccc cgagagccga aaatggaccc aagtggggtc 121 aaagtgctgg aaacagcaga ggacatccag gagaggcggc agcaggtcct agaccgatac 181 caccgcttca aggaactctc aacccttagg cgtcagaagc tggaagattc ctatcgattc 241 cagttctttc aaagagatgc tgaagagctg gagaaatgga tacaggaaaa acttcagatt 301 gcatctgatg agaattataa agacccaacc aacttgcagg gaaagcttca gaagcatcaa 361 gcatttgaag ctgaagtgca ggccaactca ggagccattg ttaagctgga tgaaactgga 421 aacctgatga tctcagaagg gcattttgca tctgaaacca tacggacccg tttgatggag 481 ctgcaccgcc agtgggaatt acttttggag aagatgcgag aaaaaggaat caaattgctg 541 caggcccaga acttggtgca gtacttacga gaatgtgagg acgtgatgga ctggatcaat 601 gacaaggaag caattgttac ttctgaagag ctgggccagg atctggagca tgtagaggtt 661 ttacagaaga aatttgaaga gtttcaaaca gatatggctg ctcatgaaga aagagttaat 721 gaagtgaacc agtttgctgc caaactcata caggagcagc accctgagga ggaactgatc 781 aagactaagc aggatgaagt caatgcagcc tggcagcggc tgaagggcct ggctctgcag 841 aggcagggga agctctttgg ggcagcagaa gttcagcgct ttaacaggga tgtggatgag 901 actatcagtt ggattaagga aaaggagcag ttaatggcct ctgatgattt tggccgagac 961 ctggcaagtg ttcaggctct gcttcggaag cacgagggtc tggagagaga tcttgctgct 1021 ctagaagaca aggtcaaagc cctgtgtgct gaggctgacc gcctgcaaca gtcccaccct 1081 ctgagtgcaa cacagattca agtgaagcga gaggaactga ttacaaactg ggagcagatc 1141 cgcaccttgg cggcagagag acatgcacgg ctcaatgatt catacaggct tcaacgcttc 1201 cttgctgact tccgtgacct caccagctgg gtgactgaga tgaaagccct catcaatgca 1261 gatgagcttg ccagtgatgt ggctggggct gaagccctgc tagatagaca ccaagagcac 1321 aagggtgaaa ttgatgccca tgaagacagc ttcaaatctg cagatgaatc tggacaggca 1381 ctgcttgctg ctggtcacta tgcctcagat gaagtgaggg agaagctgac cgtcctttcc 1441 gaggagagag cggcgctgct ggagctgtgg gagctgcgca ggcagcagta cgagcagtgc 1501 atggacctgc agctcttcta ccgggacact gagcaggtgg acaactggat gagcaagcag 1561 gaggcgttcc tgttgaatga agacttggga gatttcttgg atagtgtgga agcgcttctt 1621 aagaagcacg aagactttga gaaatccctt agtgcccagg aggaaaagat tacagcatta 1681 gatgaatttg caaccaagct aattcagaac aaccactatg caatggaaga tgtggccact 1741 cgccgagacg ctctgttgag ccgccgcaat gcccttcacg agagagccat gcgtcgccgg 1801 gcccagctag ccgattcttt ccatctgcag cagtttttcc gtgattctga tgagctcaag 1861 agttgggtga atgagaagat gaaaactgcc acagatgaag cttataaaga tccatccaac 1921 ctacaaggaa aagtacagaa gcatcaggct tttgaggctg agctctcagc aaaccagagc 1981 cgaattgatg ccttggagaa agctggccaa aagctgattg atgtcaacca ctatgccaag 2041 gatgaagtgg cagctcgtat gaatgaggtg atcagtttgt ggaagaaact gctagaggcc 2101 actgaactga aaggaataaa gcttcgtgaa gccaaccagc aacagcaatt taatcgcaat 2161 gttgaggata ttgaattgtg gctatatgaa gtagaaggtc acttggcttc ggatgattac 2221 ggcaaagatc ttaccaatgt gcagaacctc cagaagaaac atgccctgct agaggcagat 2281 gtggctgctc accaggaccg aattgatggc gtcaccattc aggcccgcca gttccaagat 2341 gctggccatt ttgatgcaga aaacatcaag aagaaacagg aagccctcgt ggctcgctat 2401 gaggcactca aggagcccat ggttgcccgg aagcagaagc tggccgattc tctgcggttg 2461 cagcagctct tccgggatgt tgaggatgag gagacgtgga ttcgagagaa agagcccatt 2521 gccgcatcta ccaacagagg taaggattta attggggtcc agaatctgct aaagaaacat 2581 caagccttac aagcagaaat tgctggacat gaaccacgca tcaaagcagt tacacagaag 2641 gggaatgcca tggtggagga aggccatttt gctgcagagg atgtgaaggc caagcttcac 2701 gagctgaacc aaaagtggga ggcactgaaa gcaaaagctt cccagcgtcg gcaggacctg 2761 gaggactctc tgcaggccca gcagtacttt gctgatgcta acgaggctga atcctggatg 2821 cgggagaagg aacccattgt gggcagcact gactatggca aggacgaaga ctctgctgag 2881 gctctactga agaaacacga agctttgatg tcagatctca gtgcctacgg cagcagcatc 2941 caggctttgc gagaacaagc acagtcctgc cggcaacaag tggcccccac ggatgatgag 3001 actgggaagg agctggtctt ggctctctac gactatcagg agaagagtcc ccgagaggtc 3061 accatgaaga agggagatat ccttacctta ctcaacagca ccaacaagga ttggtggaaa 3121 gtggaagtga acgatcgtca gggttttgtg ccggctgcgt acgtgaagaa attggacccc 3181 gcccagtcag cctcccggga gaatctcctg gaggagcaag gcagcatagc actgcggcag 3241 gagcagattg acaatcagac acgcataact aaggaggccg gcagtgtatc tctgcgtatg 3301 aagcaggtgg aagaactata tcattctctg ctggaactgg gtgagaagcg taaaggcatg 3361 ttggagaaga gttgcaagaa gtttatgttg ttccgtgaag cgaatgaact acagcaatgg 3421 atcaatgaga aggaagccgc tctgacaagt gaggaggtcg gagcagactt ggagcaggtt 3481 gaggtgctcc agaagaagtt tgatgacttc cagaaggacc tgaaggccaa tgagtcacgg 3541 ttgaaggaca ttaacaaggt agctgaagac ctggagtctg aaggtcttat ggcagaggag 3601 gtgcaggctg tgcaacaaca ggaagtgtat ggcatgatgc ccagggatga aactgattcc 3661 aagacagcct ccccgtggaa gtctgctcgt ctgatggttc acaccgtggc cacctttaat 3721 tccatcaagg agctgaatga gcgctggcgg tccctacagc agctggccga ggaacggagc 3781 cagctcttgg gcagcgccca tgaagtacag aggttccaca gagatgctga tgaaaccaaa 3841 gaatggattg aagagaagaa tcaagctcta aacacagaca attatggaca tgatctcgcc 3901 agtgtccagg ccctgcaacg caagcatgag ggcttcgaga gggaccttgc ggctctcggt 3961 gacaaggtaa actcccttgg tgaaacagca gagcgcctga cccagtccca tcccgagtca 4021 gcagaagacc tgcaggaaaa gtgcacagag ttaaaccagg cctggagcag cctggggaaa 4081 cgtgcagatc agcgcaaggc aaagttgggt gactcccacg acctgcagcg cttccttagc 4141 gatttccggg acctcatgtc ttggatcaat ggaatacggg ggttggtgtc ctcagatgag 4201 ctagccaagg atgtcaccgg agctgaggca ttgctggagc gacaccagga acaccggaca 4261 gaaatcgatg ccagggctgg cactttccag gcatttgagc agtttggaca gcagctgttg 4321 gctcacggac actatgccag ccctgagatc aagcagaaac ttgatattct tgaccaggag 4381 cgtgcagacc tggagaaggc ctgggttcag cgcaggatga tgctggatca gtgccttgaa 4441 ctgcagctgt tccatcggga ctgtgagcaa gctgagaact ggatggctgc ccgggaggcc 4501 ttcttgaata ccgaagacaa aggagactca ctggacagcg tagaggctct gatcaaaaaa 4561 catgaagact ttgacaaagc gattaacgtc caggaagaga agattgctgc tctgcaggcc 4621 tttgccgacc agctcatcgc tgccggccat tatgccaagg gagacatttc tagccggcgc 4681 aatgaggtct tggacaggtg gcgacgtctg aaagcccaga tgattgagaa aaggtcaaag 4741 ctaggagaat ctcaaaccct ccaacagttc agccgggatg tggatgagat tgaggcttgg 4801 atcagtgaaa aattgcaaac agcgagtgat gagtcgtaca aggatcccac caacatccag 4861 agcaagcacc agaagcacca ggcttttgaa gcagagctgc atgccaacgc tgaccggatc 4921 cgtggggtta tcgacatggg caactccctc attgaacgtg gagcctgtgc cggcaatgag 4981 gatgctgtca aggcccgcct ggctgcctta gctgaccagt ggcaattctt ggtgcaaaag 5041 tcagcggaaa agagccagaa actgaaagaa gccaacaagc agcagaactt caacacaggg 5101 atcaaggaca ttgcattctg gctgtctgag gtggaggccc tgctggcatc cgaagattat 5161 ggcaaagacc tggcttctgt gaacaacctg ctgaaaaagc atcaactgct ggaagcagat 5221 atatctgccc atgaggatcg cctgaaggac ctgaacagcc aggcagacag cctgatgacc 5281 agcagtgcct tcgacacctc ccaagtaaag gacaagaggg acaccatcaa cgggcgcttc 5341 cagaagatca agagcatggc ggcctcccgg cgagccaagc tgaatgaatc ccatcgcctg 5401 caccagttct tccgggacat ggatgacgag gagtcctgga tcaaggagaa gaagctgctg 5461 gtgggctcag aggactacgg ccgggaccta actggcgtgc agaacctgag gaagaagcac 5521 aagcggctgg aagcagaact ggctgcgcat gagccggcta ttcagggtgt cctggacact 5581 ggcaagaagc tgtccgatga caacaccatc gggaaagagg agatccagca gcggctggcg 5641 cagtttgtgg agcactggaa agagctgaag cagctggcag ctgcccgggg tcagcggctg 5701 gaagagtcct tggaatatca gcagtttgta gccaatgtgg aagaggaaga agcctggatc 5761 aatgagaaaa tgaccctggt ggccagcgaa gattatggcg acactcttgc cgccatccag 5821 ggcttactga agaaacatga agcttttgag acagccttca ccgtccacaa ggatcgcgtg 5881 aatgatgtct gcaccaatgg acaagacctc attaagaaga acaatcacca tgaggagaac 5941 atctcttcaa agatgaaggg cctgaacggg aaagtgtcag acctggagaa agctgcagcc 6001 cagagaaagg cgaacgtgga tgagaactcg gccttccttc agttcaactg gaaggcggac 6061 gtggtggagt cctggatcgg tgaaaaggag aacagcttga agacagatga ttatggccga 6121 gacctgtctt ctgtgcagac gctcctcacc aaacaggaaa cttttgacgc tgggctgcag 6181 gccttccagc aggaaggcat tgccaacatc actgccctca aagatcagct tctcgccgcc 6241 aaacacgttc agtccaaggc catcgaggcc cggcacgcct ccctcatgaa gaggtggagc 6301 cagcttctgg ccaactcagc cgcccgcaag aagaagcttc tggaggctca gagtcacttc 6361 cgcaaggtgg aggacctctt cctgaccttc gccaaaaagg cttctgcctt caacagctgg 6421 tttgaaaatg cagaggagga cttaacagac cccgtgcgct gcaactcctt ggaagaaatc 6481 aaagctttgc gcgaggccca cgacgccttc cgctcctccc tcagctctgc ccaggctgac 6541 ttcaaccagc tggccgagct ggaccgccag atcaagagct tccgcgtagc ctccaacccc 6601 tacacctggt ttaccatgga ggccctggag gagacctgga ggaacctaca gaaaatcatc 6661 aaggagaggg agctggagct gcagaaggaa cagcggcggc aggaggagaa cgacaagctg 6721 cgccaggagt ttgcccagca cgccaacgcc ttccaccagt ggatccaaga gaccaggaca 6781 tacctcctcg atgggtcctg tatggtggaa gagtcgggga ccctcgaatc ccagcttgaa 6841 gctaccaaac gcaagcacca ggaaatccga gccatgagaa gtcagctcaa aaagatcgag 6901 gacctggggg ccgccatgga ggaggccctc atcctggaca acaagtacac ggagcacagc 6961 accgtgggcc tcgcccagca gtgggaccag ctggaccagc tgggcatgcg catgcagcac 7021 aacctggagc agcagatcca ggccaggaac acaacaggtg tgactgagga ggccctcaaa 7081 gaattcagca tgatgtttaa acactttgac aaggacaagt ctggcaggct gaaccatcag 7141 gatggcaaat cttgcctgcg ctccctgggc tatgacctgc ccatggtgga ggaaggggaa 7201 cctgaccctg agttcgaggc aatcctggac acggtggatc cgaacagaga tggccatgtc 7261 tccttgcaag aatacatggc tttcatgatc agccgcgaaa ctgagaacgt caagtccagc 7321 gaggagattg agagcgcctt ccgggccctc agctcagagg gaaagcctta cgtgaccaag 7381 gaggagctct accagaacct gacccgggaa caagccgact actgcgtctc ccacatgaag 7441 cccatcgtgg acggcaaggg ccgcgagctc cccaccgcgt tcgactacgt ggagttcacc 7501 cgctcgcttt tcgtgaactg agccactccc tgggtcaccc acccctcgct gcttgccctg 7561 cgtcgccttg ctgcatgtcc gctcctctgt gtgctctcac tttccactgt aaccttaagc 7621 ctgcttagct tggaataaga cttaggagaa aatggtgctt cactaacccg cttccggtcc 7681 agtcacaatc atcatgtcac tgtgggaccc agatctgtgt cttgaagcag ctgccctcat 7741 tccgacttca gaaaatcgaa gcagctggcg cctccccttc ggaattc // LOCUS HUMATF3X 1914 bp mRNA PRI 04-AUG-1994 DEFINITION Human activating transcription factor 3 (ATF3) mRNA, complete cds. ACCESSION L19871 NID g442421 KEYWORDS DNA-binding protein; activating transcription factor 3; transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1914) AUTHORS Chen,B.P., Liang,G., Whelan,J. and Hai,T. TITLE ATF3 and ATF3 delta Zip. Transcriptional repression versus activation by alternatively spliced isoforms JOURNAL J. Biol. Chem. 269 (22), 15819-15826 (1994) MEDLINE 94253175 FEATURES Location/Qualifiers source 1..1914 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 165..710 /gene="ATF3" CDS 165..710 /gene="ATF3" /codon_start=1 /function="DNA binding protein, transcription factor" /product="activating transcription factor 3" /db_xref="PID:g442422" /translation="MMLQHPGQVSASEVSASAIVPCLSPPGSLVFEDFANLTPFVKEE LRFAIQNKHLCHRMSSALESVTVSDRPLGVSITKAEVAPEEDERKKRRRERNKIAAAK CRNKKKEKTECLQKESEKLESVNAELKAQIEELKNEKQHLIYMLNLHRPTCIVRAQNG RTPEDERNLFIQQIKEGTLQS" BASE COUNT 492 a 466 c 495 g 461 t ORIGIN 1 gcagccaggc gcgcactgca cagctctctt ctctcgccgc cgcccgagcg cacccttcag 61 cccgcgcgcc ggccgtgagt cctcggtgct cgcccgccgg ccagacaaac agcccgcccg 121 accccgtccc gaccctggcc gccccgagcg gagcctggag caaaatgatg cttcaacacc 181 caggccaggt ctctgcctcg gaagtgagtg cttctgccat cgtcccctgc ctgtcccctc 241 ctgggtcact ggtgtttgag gattttgcta acctgacgcc ctttgtcaag gaagagctga 301 ggtttgccat ccagaacaag cacctctgcc accggatgtc ctctgcgctg gaatcagtca 361 ctgtcagcga cagacccctc ggggtgtcca tcacaaaagc cgaggtagcc cctgaagaag 421 atgaaaggaa aaagaggcga cgagaaagaa ataagattgc agctgcaaag tgccgaaaca 481 agaagaagga gaagacggag tgcctgcaga aagagtcgga gaagctggaa agtgtgaatg 541 ctgaactgaa ggctcagatt gaggagctca agaacgagaa gcagcatttg atatacatgc 601 tcaaccttca tcggcccacg tgtattgtcc gggctcagaa tgggaggact ccagaagatg 661 agagaaacct ctttatccaa cagataaaag aaggaacatt gcagagctaa gcagtcgtgg 721 tatgggggcg actggggagt cctcattgaa tcctcatttt atacccaaaa ccctgaagcc 781 attggagagc tgtcttcctg tgtacctcta gaatcccagc agcagagaac catcaaggcg 841 ggagggcctg cagtgattca gcaggccctt cccattctgc cccagagtgg gtcttggacc 901 agggcaagtg catctttgcc tcaactccag gatttaggcc ttaacacact ggccattctt 961 atgttccaga tggcccccag ctggtgtcct gcccgccttt catctggatt ctacaaaaaa 1021 ccaggatgcc caccgttaga ttcaggcagc agtgtctgta cctcgggtgg gagggatggg 1081 gccatctcct tcaccgtggc taccattgtc actcgtaggg gatgtggagt gagaacagca 1141 tttagtgaag ttgtgcaacg gccagggttg tgctttctag caaatatgct gttatgtcca 1201 gaaattgtgt gtgcaagaaa actaggcaat gtactcttcc gatgtttgtg tcacacaaca 1261 ctgatgtgac ttttatatgc tttttctcag atctggtttc taagagtttt ggggggcggg 1321 gctgtcacca cgtgcagtat ctcaagatat tcaggtggcc agaagagctt gtcagcaaga 1381 ggaggaacag aattctccca gcgttaacac aaaatccatg ggcagcatga tggcaggtcc 1441 tctgttgcaa actcagttcc aaagtcacag gaagaaagca gaaagttcaa cttccaaagg 1501 gttaggactc tccactcaat gtcttaggtc aggagttgtg tctaggctgg aagagccaaa 1561 gaaatattcc attttccttt ccttgtggtt gaaaccacag tcagtggaga gatgtttgga 1621 acacagtcag tggagctggt ggtaccaggt ttagcattat tggatgtcaa aagcattttt 1681 tttgtcatgt agctgtttta agaaatctgg cccagggtgt ttgcagctgt gagaagtcac 1741 tcacactggc cacaaggacg ctggctactg tctattaaaa ttctgatgtt tctgtgaaat 1801 tctcagagtg tttaattgta ctcaatggta tcattacaat tttctgtaag agaaaatatt 1861 acttatttat cctagtattc ctaacctgtc agaataataa atattgtggt aaaa // LOCUS HUMATP2B2X 4979 bp mRNA PRI 27-OCT-1994 DEFINITION Human plasma membrane calcium ATPase isoform 2 (ATP2B2) mRNA, complete cds. ACCESSION L20977 NID g404701 KEYWORDS calcium ATPase; isoform 2; plasma membrane. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4979) AUTHORS Kuzmin,I., Stackhouse,T., Latif,F., Duh,F.M., Geil,L., Gnarra,J., Yao,M., Orcutt,M.L., Li,H., Tory,K., Le Paslier,D., Chumakov,I., Cohen,D., Chinault,A.C., Linehan,W.M., Lerman,M.I. and Zbar,B. TITLE One-megabase yeast artificial chromosome and 400-kilobase cosmid-phage contigs containing the von Hippel-Lindau tumor suppressor and Ca(2+)-transporting adenosine triphosphatase isoform 2 genes JOURNAL Cancer Res. 54 (9), 2486-2491 (1994) MEDLINE 94215191 REFERENCE 2 (bases 1 to 4979) AUTHORS Latif,F., Duh,F.-M., Gnarra,J.R., Tory,K., Kuzmin,I., Yao,M.C., Stackhouse,T., Modi,W., Geil,L., Schmidt,L., Li,H., Orcutt,M., Maher,E., Richards,F.O.Jr.., Ferguson-Smith,M.A., Le Paslier,D., Linehan,M., Zbar,B. and Lerman,M.I. TITLE von Hippel-Lindau syndrome: cloning and identification of the plasma membrane Ca(++)-transporting ATPase isoform 2 gene that resides in the von Hippel-Lindau gene region JOURNAL Cancer Res. 53 (4), 861-867 (1993) MEDLINE 93153786 FEATURES Location/Qualifiers source 1..4979 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" /tissue_lib="lambda gt10" /map="3p25-26" gene 1..4978 /gene="ATP2B2" 5'UTR 1..576 /gene="ATP2B2" CDS 577..4173 /gene="ATP2B2" /EC_number="3.6.1.38" /note="PMCA-2" /codon_start=1 /product="plasma membrane calcium ATPase isoform 2" /db_xref="PID:g404702" /translation="MGDMTNSDFYSKNQRNESSHGGEFGCTMEELRSLMELRGTEAVV KIKETYGDTEAICRRLKTSPVEGLPGTAPDLEKRKQIFGQNFIPPKKPKTFLQLVWEA LQDVTLIILEIAAIISLGLSFYHRPGEGNEGCATAQGGAEDEGEAEAGWIEGAAILLS VICVVLVTAFNDWSKEKQFRGLQSRIEQEQKFTVVRAGQVVQIPVAEIVVGDIAQVKY GDLLPADGLFIQGNDLKIDESSLTGESDQVRKSVDKDPMLLSGTHVMEGSGRMLVTAV GVNSQTGIIFTLLGAGGEEEEKKDKKAKQQDGAAAMEMQPLKSAEGGDADDRKKASMH KKEKSVLQGKLTKLAVQIGKAGLVMSAITVIILVLYFTVDTFVVNKKPWLPECTPVYV QYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKMMKDNNLVRHLDACETMGNATA ICSDKTGTLTTNRMTVVQAYVDDVHYKEIPDPSSINTKTMELLINAIAINSAYTTKIL PPEKEGALPRQVGNKTECGLLGFVLDLKQDYEPVRSQMPEEKLYKVYTFNSVRKSMST VIKLPDESFRMYSKGASEIVLKKCCKILNGAGEPRVFRPRDRDEMVKKVIEPMACDGL RTICVAYRDFPSSPEPDWDNENDILNELTCICVVGIEDPVRPEVPEAIRKCQRAGITV RMVTGDNINTARAIAIKCGIIHPGEDFLCLEGKEFNRRIRNEKGEIEQERIDKIWPKL RVLARSSPTDKHTLVKGIIDSTHTEQRQVVAVTGDGTNDGPALKKADVGFAMGIAGTD VAKEASDIILTDDNFSSIVKAVMWGRNVYDSISKFLQFQLTVNVVAVIVAFTGACITQ DSPLKAVQMLWVNLIMDTFASLALATEPPTETLLLRKPYGRNKPLISRTMMKNILGHA VYQLALIFTLLFVGEKMFQIDSGRNAPLHSPPSEHYTIIFNTFVMMQLFNEINARKIH GERNVFDGIFRNPIFCTIVLGTFAIQIVIVQFGGKPFSCSPLQLDQWMWCIFIGLGEL VWGQVIATIPTSRLKFLKEAGRLTQKEEIPEEELNEDVEEIDHAERELRRGQILWFRG LNRIQTQIRVVKAFRSSLYEGLEKPESRTSIHNFMAHPEFRIEDSQPHIPLIDDTDLE EDAALKQNSSPPSSLNKNNSAIDSGINLTTDTSKSATSSSPGSPIHSLETSL" 3'UTR 4171..4978 /gene="ATP2B2" polyA_signal 4863..4867 /gene="ATP2B2" polyA_site 4979 /gene="ATP2B2" BASE COUNT 1165 a 1462 c 1384 g 968 t ORIGIN 1 gggcagccgc ggcagaggag ccgcagccgc agcggggccg ggccgggccg ggcgcgcacc 61 agcggcagcg gcagcggcag cggcggcggc agcatctcgc tctcggagcc ggcgcaggtt 121 tccctgtact tagagcgtgg actgtgaacc cccaagcaga cgagctgaac ttcgtcaccc 181 agtccctaga gccagcaaga catgggcccc agtttccaga ccctgacacc tctcatttaa 241 ccagaagacg tggaggggag ccaccacccc tgaccatgta gatgccagtt ccagggagca 301 gcatgggccc cactgaatgg agactcctgg gtctacagcc ctgagcccct ccggcccctg 361 gacctcgtcc cacaccggag gacacctctt ggagctcacc accactgtca ccagcccgcc 421 tcggccaccc ccaccccccg ggacccggag tcggccgcct ggtgccacag ctgaccagtg 481 agggtgtgct gaggacagcc acaagcagcc atcacccggc agcctcttgt ccagcgctga 541 cccttgggcc cagcccgagc aaggaccgca gcaaacatgg gtgacatgac caacagcgac 601 ttttactcca aaaaccaaag aaatgagtcg agccatgggg gcgagttcgg gtgcacaatg 661 gaggagctcc gctccctcat ggagctgcgg ggcactgagg ctgtggtcaa gatcaaggag 721 acttatgggg acaccgaagc catctgccgg cgcctcaaaa cctcacctgt tgaaggtttg 781 ccgggcaccg ctccagacct ggaaaagaga aagcaaattt ttgggcaaaa ctttatacct 841 ccaaagaagc caaaaacctt cctgcagctc gtgtgggagg cgctgcagga cgtgacgctc 901 atcatcctgg agattgccgc catcatctcc ctggggctgt ccttctacca ccgccccggc 961 gagggcaacg aaggatgtgc gacggcccag ggtggggcag aggatgaagg agaggcagag 1021 gcaggttgga tcgagggggc cgccattctc ctctcagtta tctgtgtggt cctggtcacg 1081 gccttcaatg actggagcaa agagaaacag ttccggggcc tgcagagccg catcgagcag 1141 gaacagaaat ttaccgtggt ccgggctggc caggtggtcc agatccctgt ggctgagatc 1201 gtggttgggg acatagccca ggtcaaatat ggtgacctcc tccctgccga cggactcttc 1261 atccagggca atgacctcaa gattgatgaa agctccctaa ctggagagtc tgaccaggtg 1321 cgcaagtccg tggacaagga ccccatgctg ctgtcaggaa cccacgtgat ggagggctca 1381 ggacggatgt tggtgactgc tgtgggtgtg aactctcaga ctggcatcat ctttaccctc 1441 ctgggggctg gtggtgaaga ggaagagaag aaagacaaaa aagccaaaca gcaggacggg 1501 gcagccgcca tggagatgca gcccctcaag agtgccgagg gcggcgacgc tgacgacagg 1561 aagaaggcca gcatgcacaa gaaggagaag tccgtgctgc agggcaagct caccaagctg 1621 gctgtgcaga tcgggaaggc gggcttggtg atgtcagcca tcacggtgat catcctggtg 1681 ctctacttca ctgtggacac cttcgtggtc aacaagaagc cgtggctgcc tgagtgcacg 1741 cccgtctacg tgcagtactt tgtcaagttc ttcatcattg gcgtgacggt gctggtggtc 1801 gccgtgcccg aggggctccc tctggccgtc accatctcgt tggcctattc ggtgaagaaa 1861 atgatgaagg acaacaacct ggtacgccac ctggatgcct gtgagaccat gggcaatgcc 1921 acagccatct gctcagacaa gacaggcacg ctgaccacca atcgcatgac agtggtacag 1981 gcctatgtcg acgacgtcca ctataaagag atccccgacc ccagctccat caacaccaag 2041 actatggagc tgctgatcaa tgccatcgcc atcaacagcg cctacaccac caagattctg 2101 cccccagaga aggagggcgc cctgcctcgg caggtgggca acaagacgga gtgcggcctg 2161 ctgggcttcg tgctggacct gaagcaggac tacgagcccg tgcgcagcca gatgccagag 2221 gagaagttgt acaaagtgta caccttcaac tccgtgcgca agtccatgag cactgtcatc 2281 aagctgcccg acgagagctt ccgcatgtac agcaaggggg cttctgagat cgtgctcaag 2341 aagtgctgca aaatcctcaa tggggcggga gagcctcgtg tcttccggcc ccgcgaccgg 2401 gacgagatgg taaagaaggt gattgagccc atggcttgcg atgggctccg cactatctgc 2461 gtggcctacc gcgacttccc cagcagcccg gagccggact gggacaatga gaatgacatc 2521 ctcaacgaac tcacctgcat ctgcgtggtg ggcatcgagg acccggtgcg gccagaggtc 2581 ccagaagcca tccgcaagtg ccagcgggca ggcatcacgg tccgcatggt cactggcgac 2641 aatatcaaca cggctcgggc catcgccatc aagtgtggca tcatccatcc tggggaggac 2701 tttctgtgcc tcgagggcaa ggagttcaac aggaggatcc gcaacgagaa gggggagatt 2761 gagcaggagc gaattgacaa gatctggcca aagctgcggg tgctggctcg ctcctcccca 2821 acggacaagc ataccctggt taaaggcatc atcgacagca cacacactga gcagcggcag 2881 gtggtggccg tgacggggga cgggaccaac gacgggcctg cactcaagaa ggccgacgtg 2941 ggcttcgcca tgggcatcgc aggcactgac gtggccaagg aggcctcaga catcatcctg 3001 acagacgaca atttcagcag catcgtcaag gcagtgatgt ggggccgcaa cgtctatgac 3061 agcatctcca aattcttgca gttccagctc accgtcaacg tggtggccgt gattgtggcc 3121 ttcacaggcg cctgcatcac gcaggactcc cctctgaagg ccgtgcagat gctctgggtg 3181 aacctcatca tggacacgtt tgcctcgctg gcactggcca ctgagccgcc cacggagacc 3241 ctgctgctga ggaagccgta cggccgcaac aagccgctca tctccaggac catgatgaag 3301 aacatcctgg gccatgctgt ctaccagctt gccctcatct tcaccctgct ctttgttggc 3361 gagaagatgt tccagatcga cagcgggagg aacgcgcccc tgcattcgcc accctcagaa 3421 cattacacca tcatcttcaa caccttcgtc atgatgcagc tcttcaacga gatcaacgcc 3481 cgcaagatcc acggcgagcg caatgtcttt gacggcatct tccggaaccc catcttctgc 3541 accatcgtgc tgggcacctt tgccatccag atagtgatcg tgcagtttgg agggaagcca 3601 ttcagctgct ctccactgca gctggaccag tggatgtggt gcatattcat tgggttagga 3661 gagctcgttt ggggccaggt catcgccacc atcccgacca gcagactcaa gttcctcaag 3721 gaggcaggca ggctcacaca gaaggaggag atcccggagg aggagctcaa cgaggacgtg 3781 gaggagatcg accacgcaga gcgggagctg cggcggggcc agatcctgtg gttccgaggc 3841 ctgaatcgga tccagacaca gatccgcgtc gtgaaggcgt tccgtagctc tctctatgaa 3901 ggtttagaaa agcctgaatc tcgaacctcc atccataact tcatggctca tcctgaattc 3961 cggatcgaag attcccagcc ccacatcccc ctcattgatg acaccgacct ggaagaagat 4021 gccgcgctca agcagaactc gagcccgccg tcatccctca acaagaacaa cagcgccatc 4081 gacagtggga tcaacctgac gaccgacaca agcaaatcag ctacctcttc aagtccaggg 4141 agccccatcc acagcctgga gacgtcgctt tagctgagga ccctgtcgcc tgcccgcccg 4201 ccctcatgga ccccgctgtc acccgctttc cgggcaccca tccatccagg cacccaactc 4261 acccaagcag caacgagcaa caatcggaaa ccaaatactg gagagaaaac caacgtttcc 4321 acccacagac cctttctctg gctgcgatgc tgtttgaact ctttttcact tcaaggcaag 4381 gggcgggatc tccactgggg gcttacggga gtgagcggtt ttcccaaaac aagcccttcc 4441 tggctcccac ccagacatgg accagccatg cacccgccca gtcaccacgt cccccgcatg 4501 aatgtactgt acactttcaa tcctcccctt gtttggtttt tgggggttgg ggaggggttt 4561 ttgtttgttt gtttgttttc ttaggcggga actgcaaaca gactcttttc tgagactatt 4621 tatccaatcc actggtctgt gagtttttga aacgcttgca cagcatggtc tcagttgtat 4681 agattaattt aataactttt tgaaattgca gagcttaact cgcctagtag atttgcacca 4741 atggaaccga agaacttcat agacactcac aaggttatat ccatttcttt gtatctatat 4801 caacgtatac ttttccgaga ctgtatacgt ccatatagat aggtagatat atatatatat 4861 atataaatat atatacatgg atatataaag tttctttgcc ggcatgttgc cttgtttccg 4921 cttaaattgc tctattttaa cttatttatg tcctaaaaga agaatgtaat ttgtttaca // LOCUS HUMATPAVAC 3118 bp mRNA PRI 28-APR-1993 DEFINITION Human vacuolar ATPase (isoform HO68) mRNA, complete cds. ACCESSION L09234 NID g291865 KEYWORDS ATPase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3118) AUTHORS van Hille,B.J.M., Richener,H., Evans,D.B., Green,J.R. and Bilbe,G. TITLE Identification of two isoforms of the vacuolar H-ATPase in human osteoclastom JOURNAL J. Biol. Chem. 268, 7075-7080 (1993) MEDLINE 93216643 FEATURES Location/Qualifiers source 1..3118 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 60..1907 /note="isoform H068" /codon_start=1 /product="ATPase" /db_xref="PID:g291866" /translation="MTSTLIKTSDEDRESKFGFVFAVSGPVVTAERMAGSAMYELVRV GYYELVGEIIRLEGDMATIQVYEDTSGVTVGDPVLRTGKPLSVELGPGIMGSIFDGIQ RPLKDINELSNSIYIPKGVNVPALSRTAQWDFSPVSVKVGSHITGGDLYGLVHENTLV KHKLLLPPRAKGTVTYIAEPGNYTVDDVVLETEFDGERSKFTMLQVWPVRQPRPVTEK LPANYPLLTGQRVLDSLFPCVQGGTTAIPGAFGCGKTVISQSLSKYSNSDVIIYVGCG ERGNEMSEVLRDFPQLSLEIDGVTESIMKRTALVANTSNMPVAAREASIYTGITLSEY FRDMGYNVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLASFYERAGRVKC LGNPDREGSVSIVGAVSPPGGDFSDPVTTATLGIVQVFWGLDKKLAQRKHFPSINWLI SYSKYMRALDDFYDKNFPEFVPLRTKVKEILQEEEDLSEIVQLVGKASLAETDKITLE VAKLLKDDFLQQNSYSPYDRFCPFYKTVGMLKNMIAFYDMSRHAVESTAQSENKITWN VIRDSMGNILYQLSSMKFKDPVKDGEAKIKADFEQLHEDIQQAFRNLED" polyA_signal 3101..3106 BASE COUNT 842 a 649 c 737 g 890 t ORIGIN 1 gaattccggc agctgactag tcttgtgatt ggggtcctgg gctgataaaa tcattccaaa 61 tgacgagcac attgataaag acgtccgatg aggaccggga gtccaaattc ggctttgttt 121 ttgccgtatc tggacctgtg gtgacagctg aacgaatggc cggttctgct atgtacgaac 181 tggtgcgtgt cggttattat gaactggtcg gagagatcat ccggttggag ggtgacatgg 241 caacaatcca agtatacgaa gacacctcag gtgtgacagt aggcgatccc gtgctgcgca 301 caggcaagcc gctgtccgtg gaactgggac ccggaatcat gggcagcatc ttcgacggta 361 tccagcgacc gctgaaggat atcaatgaac tgtcaaatag tatctacatc ccgaaaggtg 421 tcaatgtgcc tgccctgagt cgcactgcac agtgggactt cagtcccgtc agtgtcaagg 481 ttggaagcca cattactggt ggtgacctgt acggtttggt ccacgaaaat actctggtga 541 aacacaagtt gctgctgccg ccccgtgcca agggaactgt cacgtacatt gcagaacctg 601 gaaactacac agttgatgat gttgtcctgg agacagaatt tgacggcgag cgatcaaagt 661 tcaccatgct gcaagtgtgg cctgtacgtc agcccaggcc tgttacagaa aagttgccag 721 ctaactaccc cctccttact ggccagcgtg tgctcgactc cctattcccg tgtgtccagg 781 gtggaacaac agctattcct ggggccttcg gatgtggcaa gactgtaata tcacagtctt 841 tgtcaaaata ctcaaactcc gatgtaatta tctatgtagg ttgtggtgag cgaggtaatg 901 aaatgtcaga agtactcagg gatttcccgc agttgtcgtt ggagattgat ggtgtgactg 961 aatcaatcat gaagagaaca gccctggtcg caaacacatc aaacatgcct gtggctgctc 1021 gagaagcatc tatctacaca ggtattacac tgtcagaata cttcagggac atgggttaca 1081 atgtatccat gatggctgac tcaacttcac gatgggccga agctcttcga gaaatctcag 1141 gtcgattggc tgaaatgcct gccgacagcg gttatcccgc ctacctaggt gcacgacttg 1201 ccagtttcta cgagcgtgcc ggccgtgtga agtgcttggg taacccagac agggagggct 1261 ccgtgagtat agtgggcgcc gtgtcgccgc ccggtggaga cttctcagat cccgtgacga 1321 cggccacact aggtatcgtc caggtgttct ggggtctcga caagaaactt gcccagcgaa 1381 agcacttccc atccatcaac tggctcatct cgtacagtaa atacatgcgt gctctggatg 1441 acttctacga caagaatttc ccagagtttg tcccactgcg tacaaaggtg aaggagattt 1501 tgcaggagga agaagacctg tctgaaattg tgcagttggt cggtaaagct tcattggcag 1561 aaactgacaa gatcacactt gaggttgcca aactattaaa ggatgatttc ctgcaacaga 1621 acagctattc accatatgac cgtttctgcc cattctacaa gacagtagga atgctgaaaa 1681 atatgattgc tttctacgat atgtctcggc atgcagttga atctactgct cagagcgaga 1741 acaagatcac ttggaatgtt attagagatt ctatgggcaa tattctgtat cagctttcct 1801 ccatgaaatt caaggatcca gtcaaggatg gagaagcgaa gatcaaggca gactttgagc 1861 agcttcatga agacattcag caagccttca ggaacctgga ggattaaagt ggtagctgcc 1921 agtggttctc tcggtgcagt tgtcacattt ggcaagctct gtagggttgc cgagtggcat 1981 cggtgctaga cacctgagca ttcctttgcc acataaagac taaagcaggt ggaatttcag 2041 ttgtaaaaag ctggttccat tggtgctaag attatgttgt gcccttttct gcttctcaca 2101 ttccaacaga ggaatttact tccagttttc ttccattttc ctcctcattt taagtgtcgg 2161 tacagaggca ataatctgat aactctgtac cgtcacttac aagcagggag aatttgtaat 2221 tattacaaat cccattatct ctgtgcacca cagccttgta aattcatttg tcccaggact 2281 ccctcttgtg tgtacgtgag attgccgtct gtatgtatgt acacaccgta ctgcagtatt 2341 tgaagtcagt cagaaggtga attacaccac ttactcattg tgtcacgtag caagtgtgca 2401 aactgccatc cattgtccta tttattcaca taactagttt tctttgcatt tccagtgttg 2461 caaattgtgt ttagaaaatt atgccatcga gactggtcga acctcacatt gtaactcagt 2521 atttacacac acgtttactt gctacagaaa tgtagaaaaa ataattgttg tatattgaaa 2581 gtacaagtga caaagttgca tttaaaatgg tgaatgtatt ttatatttct tttgtagaca 2641 caagagttaa tgcattttgc ttaatggaga tgtatgtaaa cctaaaatag cagtttgtgc 2701 acaaattatg tatatgtgaa atggagatgg tttctaattt gctgattgat tgccagtatt 2761 aatttaaaca actgtagttg tgggatgtag tgggaagatt ttttttttcc tataaaattg 2821 gtggatgtat gtgtcggaga ttttgattgt atgtgtaaaa tagtgatccc agtaactgta 2881 aagctttaga atacagttac tgactgtata gttgtacagg tgttgttact tttaagaatt 2941 tattgacaca aaggtgaaag tctattattg tattgtaatg tttaaagcat ttaaggttta 3001 aaaatcctac ttctgtgtat aaatgttacc attcctcata taacataact gtgtagaaat 3061 acagtcaact tcatgttcat tagcatttca ctgttgtcac ataaattatg cccggaat // LOCUS HUMATPAVAD 1924 bp mRNA PRI 12-DEC-1995 DEFINITION Human vacuolar ATPase (isoform VA68) mRNA, complete cds. ACCESSION L09235 NID g291867 KEYWORDS ATPase. SOURCE Homo sapiens cDNA to mRNA; and Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1924) AUTHORS van Hille,B., Richener,H., Evans,D.B., Green,J.R. and Bilbe,G. TITLE Identification of two subunit A isoforms of the vacuolar H(+)-ATPase in human osteoclastoma JOURNAL J. Biol. Chem. 268 (10), 7075-7080 (1993) MEDLINE 93216643 FEATURES Location/Qualifiers source 1..1924 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1..97 /number=1 CDS 16..1869 /codon_start=1 /product="ATPase" /db_xref="PID:g291868" /translation="MDFSKLPKILDEDKESTFGYVHGVSGPVVTACDMAGAAMYELVR VGHSELVGEIIRLEGDMATIQVYEETCGVSVGDPVLRTGKPLSVDVGPGIMGAIFDGI QRPLSDISSQTQSIYIPRGVNVSALSRDIKWDFTPCKNLRVGSHITGGDIYGIVSENS LIKHKIMLPPRNRGTVTYIAPPGNYDTSDVVLELEFEGVKEKFTMVQVWPARQVRPVT EKLPANHPLLTGQRVLDALFPCVQGGTTAIPGAFGCGKTVISQSLSKYSNSDVIIYVG CGERGNEMSEVLRDFPELTMEVDGKVESIMKRTALVANTSNMPVAAREASIYTGITLS EYFRDMGYHVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLASFYERAGRV KCLGNPEREGSVSIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSVNW LISYSKYMRALDEYYDKHFTEFVPLRTKAKEILQEEEDLAEIVQLVGKASLAETDKIT LEVAKLIKDDFLQQNGYTPYDRFCPFYKTVGMLSNMIAFYDMARRAVETTAQSDNKIT WSIIREHMGDILYKLSSMKFKDPLKDGEAKIKSDYAQLLEDMQNAFRSLED" exon 98..228 /number=2 exon 229..440 /number=3 BASE COUNT 535 a 399 c 468 g 522 t ORIGIN 1 taggaaacta acattatgga tttttccaag ctacccaaaa tactcgatga agataaagaa 61 agcacatttg gttatgtgca tggggtctca ggacctgtgg ttacagcctg tgacatggcg 121 ggtgcagcca tgtatgagct ggtgagagtg ggccacagcg aattggttgg agagattatt 181 cgattggagg gtgacatggc tactattcag gtgtatgaag aaacttgtgg tgtgtctgtt 241 ggagatcctg tacttcgcac tggtaaaccc ctctctgtag acgttggtcc tggcattatg 301 ggagccattt ttgatggtat tcaaagacct ttgtcggata tcagcagtca gacccaaagc 361 atctacatcc ccagaggagt aaacgtgtct gctcttagca gagatatcaa atgggacttt 421 acaccttgca aaaacctacg ggttggtagt catatcactg gcggagacat ttatggaatt 481 gtcagtgaga actcgcttat caaacacaaa atcatgttac ccccacgaaa cagaggaact 541 gtaacttaca ttgctccacc tgggaattat gatacctctg atgttgtctt ggagcttgaa 601 tttgaaggtg taaaggagaa gttcaccatg gtgcaagtat ggcctgcacg tcaagttcga 661 cctgtcactg agaagctgcc agccaatcat cctctgttga ctggccagag agtccttgat 721 gccctttttc cgtgtgtcca gggaggaact actgctatcc ctggagcctt tggctgtgga 781 aagacagtga tatcacagtc tctatccaag tattctaaca gtgatgtaat catctatgta 841 ggatgtggtg aaagaggaaa tgagatgtct gaagtcctcc gggacttccc agagctcaca 901 atggaggttg atggtaaggt agagtcaatt atgaagagga cagctttggt agccaatacc 961 tccaatatgc ctgttgctgc tagagaagcc tctatttata ctggaatcac actgtcagag 1021 tacttccgtg acatgggcta tcatgtcagt atgatggctg actctacctc tagatgggct 1081 gaggccctta gagaaatctc tggtcgttta gctgaaatgc ctgcagatag tggatatcca 1141 gcctatcttg gtgcccgtct ggcctcgttt tatgaacgag caggcagggt gaaatgtctt 1201 ggaaatcctg aaagagaagg gagtgtcagc attgtaggag cagtttctcc acctggtggt 1261 gatttttctg atccagttac atctgccact cttggtatcg ttcaggtgtt ctggggctta 1321 gataagaaac tagctcaacg taagcatttc ccctctgtca attggctcat cagctacagc 1381 aagtatatgc gtgccttgga tgaatactat gacaaacact tcacagagtt cgttcctctg 1441 aggacgaaag ctaaggaaat tctgcaggaa gaagaagacc tggcagaaat tgtacagctt 1501 gtgggaaagg cttctttggc agaaacagat aaaatcactc tggaggtagc aaaacttatc 1561 aaagatgatt tcctacaaca aaatggatat actccttatg acaggttctg cccattctac 1621 aagacagtag ggatgctgtc caacatgatt gcattttatg atatggctcg tagagctgtt 1681 gaaaccactg cccagagtga caataaaatc acatggtcca ttattcgtga gcacatggga 1741 gacatcctct ataaactttc ctccatgaaa ttcaaggatc cactgaaaga tggtgaggca 1801 aagatcaaaa gcgactatgc acaacttctt gaagacatgc agaatgcatt ccgtagcctt 1861 gaagattaga agccttgaag attacaactg tgatttcctt ttcctcagca agctcctccg 1921 gaat // LOCUS HUMATPBII 2773 bp mRNA PRI 31-OCT-1994 DEFINITION Human sodium/potassium ATPase beta-2 subunit (atpb2) mRNA, complete cds. ACCESSION M81181 NID g291869 KEYWORDS Na,K-ATPase beta 2 subunit; adhesion molecule; sodium/potassium ATPase; transmembrane glycoprotein. SOURCE Homo sapiens adult retina cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Martin-Vasallo,P., Dackowski,W., Emanuel,J.R. and Levenson,R. TITLE Identification of a putative isoform of the Na,K-ATPase beta subunit. Primary structure and tissue-specific expression JOURNAL J. Biol. Chem. 264 (8), 4613-4618 (1989) MEDLINE 89174720 REFERENCE 2 (bases 1 to 2773) AUTHORS Hernando,N., Martin-Vasallo,P., Ghosh,S., Ghosh,P.K., Swaroop,A. and Coca-Prados,M. TITLE Nucleotide sequence of a cDNA for the beta 2 subunit isoform of Na+,K(+)-ATPase from human retina JOURNAL Biochim. Biophys. Acta 1189 (1), 109-111 (1994) MEDLINE 94137737 FEATURES Location/Qualifiers source 1..2773 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /map="17" gene 408..1280 /gene="ATP1B2" CDS 408..1280 /gene="ATP1B2" /standard_name="sodium/potassium-transporting ATPase" /EC_number="3.6.1.37" /codon_start=1 /db_xref="GDB:G00-118-756" /product="Na/K-ATPase beta 2 subunit" /db_xref="PID:g179245" /translation="MVIQKEKKSCGQVVEEWKEFVWNPRTHQFMGRTGTSWAFILLFY LVFYGFPTAMFTLTMWVMLQTVSDHTPKYQDRLATPGLMIRPKTENLDVIVNVSDTES WDQHVQKLNKFLEPYNDSMQAQKNDVCRPGRYYEQPDNGVLNYPKLACQFNRTQLGNC SGIGDSTHYGYSTGQPCVFIKMNRVINFYAGANQSMNVTCAGKRDEDAENLGNFVMFP ANGNIDLMYFPYYGKKFHVNYTQPLVAVKFLNVTPNVEVNVECRINAANIATDDERDK FAGRVAFKLRINKT" BASE COUNT 527 a 889 c 625 g 732 t ORIGIN 1 ccacctctct ctgccttttt gtaccccgct ttttttctgc gttctgctcg gtttttgtag 61 ccgtctgttt ttgcacccca tttcgttttg tttctagacg gtttggcggg gggtgaagct 121 gcattcatac cccttcctct tgttattctc ccctgctctg acagcacccc ttttcatcgc 181 agttgggggg cctaggatcg gtgcatcttc cgccgcgctg ccagcacccc gcagcgcgtg 241 gtcgtgcacc ccggaatctg cagcagctgc atatctgagg ggggtctcct ttgcccgcgc 301 cgccttcgct ccccgtgctt ttgggtgtgt ggagggcttc agtgcgcggc gcccccgctt 361 ctccgcaacc ccccgccccg cgcccggact cgccccgcgc caccaagatg gtcatccaga 421 aagagaagaa gagctgcggg caggtggttg aggagtggaa ggagttcgtg tggaacccga 481 ggacgcacca gtttatgggc cgcaccggga ccagctgggc ctttatcctc ctcttctacc 541 tcgtttttta tgggttcccc accgccatgt tcaccctcac catgtgggtg atgctgcaga 601 ctgtctccga ccataccccc aagtaccagg accgactggc cacaccgggc ttgatgattc 661 gccccaagac tgagaacctt gatgtcattg tcaatgtcag tgacactgaa agctgggacc 721 agcatgttca gaagctcaac aagttcttgg agccttacaa cgactctatg caagcccaaa 781 agaatgatgt ctgccgccct gggcgctatt acgaacagcc agataatgga gtcctcaact 841 accccaaact ggcctgccaa ttcaaccgga cccagctggg caactgctcc ggcattgggg 901 actccaccca ctatggttac agcactgggc agccctgtgt cttcatcaag atgaaccggg 961 tcatcaactt ctatgcagga gcaaaccaga gcatgaatgt tacctgtgct gggaagcgag 1021 atgaagatgc tgagaatctc ggcaacttcg tcatgttccc cgccaacggc aacatcgacc 1081 tcatgtactt cccctactat ggcaaaaagt tccacgtgaa ctacacacag cccctggtgg 1141 ctgtgaagtt cctgaatgtg acccccaacg tggaggtgaa tgtagaatgt cgcatcaacg 1201 ccgccaacat cgccacagac gatgagcgag acaagttcgc cggccgcgtg gccttcaaac 1261 tccgcatcaa caaaacctga ggccccttcc tcccacccca tctctctcct gtggatgctc 1321 ctggaatgtc cctgaccctg cctgatccct ccctcaccca ccccaaaggt atttttgata 1381 acagagctat gacttgtctg agcctcacat ccttttcctt gacttctcaa cccagcctga 1441 agtccattgc ggttccgtca ctcgcctttc ccaccaactt ctcccaacct cagatcagtc 1501 agacagggag ctgggctaag atggccacag aggagttagg agcctttcta gttctggttt 1561 agctgtgaga gctatccact ctcctgcctg catatcccct gagagttatc ggaagtgccc 1621 actgacccac ccacccacct acaccccccg ccacacacac acacaaacgt gcacacgcag 1681 tctcatttga cccctttgct tccagagatg aatgtggcac tccctccttc cattcctaag 1741 ctctagccac cgtcccttga tctctcatac tttctccctg tctacacagt cgccatcttg 1801 gtgactttga atttatctgg ctcctgggca ggtcttctcc tcctctccat ccctattccc 1861 tcctctgaaa tgcacccctt tgtaattgag gacaaggtgg ttctgtggcc ttttccctct 1921 ttgctggcac gttctgcttc tcaccctctg gtgactctgt gagctgggaa atgcgggact 1981 ggaagtgagg cctgtgttga cccttcctga aaatcctcta gcagcccccg acttcagcag 2041 tttctttctt tgtttttttg agatggagtt tcgctcttgt tgcccaggct ggagtgcaat 2101 ggtgcaatct cagctcactg caacttccgc atcccaggtt caagcgattc tcccgcctca 2161 ggttcccgag tagctgggac tacaggcatg tgccaccatg cccggctaat ttctttcttt 2221 cttttttttt ttttttttgc attttttagt agagatgggg gtttctcctt gttggtcagg 2281 ctggtctcga actcccgacc tcaggtgatc cacctgcctc ggcctcccaa agtgttggga 2341 ttacaggcgt gagccaccgc gcctggcctt cagtttcttc ctaggccgtt ctgtcaccca 2401 aatagctgct acccagaggg gcggggttga cctaggctga atatccactt tgtttttatg 2461 gatggctccc ttcccccatt cgccttccca gaatatcctt caagttccac ttcccaggga 2521 gctctggggg aggggcggcc attctggctc cgtccccagt ggccaccttg gaaacatcgg 2581 ctggctttgg gactattcca cctccttccc ctgagcccag atctgccccc accatccttt 2641 ctctggcttc ttttagcaag ttatcaacta atcactaact ccttcctttt cctctgcatg 2701 ccagcctgaa aattccaaat ctagcctctg aatgtcttgg ctccatctct tcagacccct 2761 ttgcctttaa aaa // LOCUS HUMATPCU 8478 bp mRNA PRI 07-MAY-1993 DEFINITION Human putative Cu++-transporting P-type ATPase mRNA, complete cds. ACCESSION L06133 NID g179252 KEYWORDS Cu++-transporting ATPase; P-type ATPase. SOURCE Homo sapiens (library: cDNA libraries from J. Ellison and J. Edmund) fibroblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8478) AUTHORS Vulpe,C.D, Levinson,B., Whitney,S., Packman,S. and Gitschier,J. TITLE Isolation of a candidate gene for Menkes disease and evidence that it encodes a copper-transporting ATPase JOURNAL Nature Genet. 3, 7-13 (1993) MEDLINE 93258410 FEATURES Location/Qualifiers source 1..8478 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="fibroblast" /tissue_lib="cDNA libraries from J. Ellison and J. Edmund" /map="Xq13.3" CDS 146..4648 /note="putative" /codon_start=1 /product="Cu++-transporting P-type ATPase" /db_xref="PID:g179253" /translation="MDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVHHIKVSLE EKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLTLPWDH IQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKSGAC EDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQPH LISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTND STATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKA IVAVSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCV QSIEGVISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSDTNEP LVVIAQPSSEMPLLTSTNEFYTKGMTPVQDKEEGKNSSKCYIQVTGMTCASCVANIER NLRREEGIYSILVALMAGKAEVRYNPAVIQPPMIAEFIRELGFGATVIENADEGDGVL ELVVRGMTCASCVHKIESSLTKHRGILYCSVALATNKAHIKYDPEIIGPRDIIHTIES LGFEASLVKKDRSASHLDHKREIRQWRRSFLVSLFFCIPVMGLMTYMMVMDHHFATLH HNQNMSKEEMINLHSSMFLERQILPGLSVMNLLSFLLCVPVQFFGGWYFYIQAYKALK HKTANMDVLIVLATTIAFAYSLIILLVAMYERAKVNPITFFDTPPMLFVFIALGRWLE HIAKGKTSEALAKLISLQATEATIVTLDSDNILLSEEQVDVELVQRGDIIKVVPGGKF PVDGRVIEGHSMVDESLITGEAMPVAKKPGSTVIAGSINQNGSLLICATHVGADTTLS QIVKLVEEAQTSKAPIQQFADKLSGYFVPFIVFVSIATLLVWIVIGFLNFEIVETYFP GYNRSISRTETIIRFAFQASITVLCIACPCSLGLATPTAVMVGTGVGAQNGILIKGGE PLEMAHKVKVVVFDKTGTITHGTPVVNQVKVLTESNRISHHKILAIVGTAESNSEHPL GTAITKYCKQELDTETLGTCIDFQVVPGCGISCKVTNIEGLLHKNNWNIEDNNIKNAS LVQIDASNEQSSTSSSMIIDAQISNALNAQQHKVLIGNREWMIRNGLVINNDVNDFMT EHERKGRTAVLVAVDDELCGLIAIADTVKPEAELAIHILKSMGLEVVLMTGDNSKTAR SIASQVGITKVFAEVLPSHKVAKVKQLQEEGKRVAMVGDGINDSPALAMANVGIAIGT GTDVAIEAADVVLIRNDLLDVVASIDLSRKTVKRIRINFVFALIYNLVGIPIAAGVFM PIGLVLQPWMGSAAMAASSVSVVLSSLFLKLYRKPTYESYELPARSQIGQKSPSEISV HVGIDDTSRNSPKLGLLDRIVNYSRASINSLLSDKRSLNSVVTSEPDKHSLLVGDFRE DDDTAL" repeat_region 5655..5951 polyA_site 8478 BASE COUNT 2520 a 1595 c 1701 g 2662 t ORIGIN 1 gccgcagccg cagctactgt gacttctccg attgtgtgag ctttgttgga gcctgcgtac 61 gtggatttat cgctgccacg gtctgcgtag ttccagaggt ttaaccatag gatagagaaa 121 ccaggaatgt aatgaggaaa tcaaaatgga tccaagtatg ggtgtgaatt ctgttaccat 181 ttctgttgag ggtatgactt gcaattcctg tgtttggacc attgagcagc agattggaaa 241 agtgaatggt gtgcatcaca ttaaggtatc actggaagaa aaaaatgcaa ctattattta 301 tgaccctaaa ctacagactc caaagaccct acaggaagct attgatgaca tgggctttga 361 tgctgttatc cataatcctg accctctccc tgttttaact gacaccttgt ttctgactgt 421 tacggcgtca ctgactttgc catgggacca tatccaaagc acattgctga agaccaaggg 481 tgtgacagac attaaaattt accctcagaa aagaactgta gcagtgacaa taatcccttc 541 tatagtgaat gccaatcaga taaaagagct ggttccagaa ctcagtttag atactgggac 601 actggagaaa aagtcaggag cttgtgaaga tcatagtatg gctcaagctg gtgaagtcgt 661 gctgaagatg aaagtggaag ggatgacctg ccattcatgt actagcacta ttgaaggaaa 721 aattgggaaa ctgcaaggtg ttcagcgaat taaagtctcc ctggacaatc aagaagctac 781 tattgtttat caacctcatc ttatctcagt agaggaaatg aaaaagcaga ttgaagctat 841 gggctttcca gcatttgtca aaaagcagcc caagtacctc aaattgggag ctattgatgt 901 agaacgtcta aagaacacac cagttaaatc ctcagaaggg tcacagcaaa ggagtccatc 961 atataccaat gattcaacag ccactttcat cattgatggc atgcattgta aatcatgtgt 1021 gtcaaatatt gaaagtactt tatctgcact ccaatatgta agcagcatag tagtttcttt 1081 agagaatagg tctgccattg tgaagtataa tgcaagctca gtcactccag aatccctgag 1141 aaaagcaata gtggctgtat caccggggct atatagagtt agtatcacaa gtgaagttga 1201 gagtacctca aactctccct ccagctcatc tcttcagaag attcctttga atgtagttag 1261 ccagcctctg acacaagaaa ctgtgataaa cattgatggc atgacttgta attcctgtgt 1321 gcagtctatt gagggtgtca tatcaaaaaa gccaggtgta aaatccatac gagtctccct 1381 tgcaaatagc aatgggactg ttgagtatga tcctctacta acctctccag aaacgttgag 1441 aggagcaata gaagacatgg gatttgatgc taccttgtca gacacgaatg agccgttggt 1501 agtaatagct cagccttcat cggaaatgcc gcttctgact tcaactaatg aattttatac 1561 taaagggatg acaccagttc aagacaagga ggaaggaaag aattcatcta agtgttacat 1621 acaggtcact ggcatgactt gcgcttcctg tgtagcaaac attgaacgga atttaaggcg 1681 ggaagaagga atatattcta tacttgtggc cctgatggct ggcaaggcag aagtaaggta 1741 taatcctgct gttatacaac ccccaatgat agcagagttc atccgagaac ttggatttgg 1801 agccactgtg atagaaaatg ctgatgaagg agatggtgtt ttggaacttg ttgtgagggg 1861 aatgacgtgt gcctcctgcg tacataaaat agagtctagt ctcacaaaac acagagggat 1921 cctatactgc tccgtggccc tggcaaccaa caaagcacat attaaatatg acccagaaat 1981 tattggtcct agagatatta tccatacaat tgaaagctta ggttttgaag cttctttggt 2041 caagaaggat cggtcagcaa gtcacttaga tcataaacga gaaataagac aatggagacg 2101 gtcttttctt gtgagtctgt ttttctgtat tcctgtaatg gggctgatga catatatgat 2161 ggttatggac caccactttg caactcttca ccataatcaa aacatgagta aagaagaaat 2221 gatcaacctt cattcttcta tgttcctgga gcgccagatt cttccaggat tgtctgttat 2281 gaatttgctg tcctttttat tgtgtgtacc tgtacagttt ttcggaggct ggtacttcta 2341 cattcaggct tataaagcac tgaagcataa gacagcaaat atggacgtac tgattgtgct 2401 ggcaaccacc attgcatttg cctactcttt gattattctt ctagttgcaa tgtatgagag 2461 agccaaagtg aaccctatta ctttctttga cacaccccct atgctgtttg tgtttattgc 2521 actaggccga tggctggaac atatagcaaa gggcaaaaca tcagaggctc ttgcaaagtt 2581 aatttcacta caagctacag aagcaactat tgtaactctt gattctgata atatcctcct 2641 cagtgaagaa caagtggatg tggaacttgt acaacgtgga gatatcatta aagtagttcc 2701 aggaggcaaa tttccagtgg atggtcgtgt tattgaagga cattctatgg tagatgagtc 2761 cctcatcaca ggggaggcaa tgcctgtggc taagaaacct ggcagcacag tgattgctgg 2821 ttccattaac cagaacgggt cactgcttat ctgcgcaaca catgttggag cagacacaac 2881 cctttctcaa attgtcaaac ttgtggaaga ggcacaaaca tcaaaggctc ctatccagca 2941 gtttgcagac aaactcagtg gctattttgt tccttttatt gtttttgttt ccattgccac 3001 cctcttggta tggattgtaa ttggatttct gaattttgaa attgtggaaa cctactttcc 3061 tggctacaat agaagtatct cccgaacaga aacgataata cgatttgctt tccaagcctc 3121 tatcacagtt ctgtgtattg catgtccctg ttcactggga ctggccactc caactgctgt 3181 gatggtgggt acaggagtag gtgctcaaaa tggcatacta ataaaaggtg gagagccatt 3241 ggagatggct cataaggtaa aggtagtggt atttgataag actggaacca ttactcacgg 3301 aaccccagtg gtgaatcaag taaaggttct aactgaaagt aacagaatat cacaccataa 3361 aatcttggcc attgtgggaa ctgctgaaag taacagtgaa caccctctag gaacagccat 3421 aaccaaatat tgcaaacagg agctggacac tgaaaccttg ggtacctgca tagatttcca 3481 ggttgtgcca ggctgtggta ttagctgtaa agtcaccaat attgaaggct tgctacataa 3541 gaataactgg aatatagagg acaataatat taaaaatgca tccctggttc aaattgatgc 3601 cagtaatgaa cagtcatcaa cttcgtcttc catgattatt gatgcccaga tctcaaatgc 3661 tcttaatgct cagcagcata aagtcctcat tggtaaccgg gagtggatga ttagaaatgg 3721 tcttgtcatt aataacgatg taaatgattt catgactgaa catgagagaa aaggtcggac 3781 tgctgtatta gtagcagttg atgatgagct gtgtggcttg atagccattg cagacacagt 3841 gaagcctgaa gcagaactgg ctatccatat tctgaaatct atgggcttag aagtagttct 3901 gatgactgga gacaacagta aaacagctag atctattgct tctcaggttg gcattactaa 3961 ggtgtttgct gaagttctac cttctcacaa ggttgctaaa gtgaagcaac ttcaagagga 4021 ggggaaacgg gtagcaatgg tgggagatgg aatcaatgac tccccagctc tggcaatggc 4081 taatgtggga attgctattg gcacaggcac agatgtagcc attgaagcag ctgatgtggt 4141 tttgataagg aatgatcttc tggatgtagt ggcaagtatt gacttatcaa gaaagacagt 4201 caagaggatt cggataaatt ttgtctttgc tctaatttat aatctggttg gaattcccat 4261 agctgctgga gtttttatgc ccattggttt ggttttgcag ccctggatgg gatctgcagc 4321 aatggctgct tcatctgttt ctgtagtact ttcttctctc ttccttaaac tttacaggaa 4381 accaacttac gagagttatg aactgcctgc ccggagccag ataggacaga agagtccttc 4441 agaaatcagc gttcatgttg gaatagatga tacctcaagg aattctccta aactgggttt 4501 gctggaccgg attgttaatt atagcagagc ctctataaac tcactactgt ctgataaacg 4561 ctccctaaac agtgttgtta ccagtgaacc tgacaagcac tcactcctgg tgggagactt 4621 cagggaagat gatgacactg cattataaaa ggccatggag agtgctgcca gtttaacttg 4681 tcatgcactg acacagcatt catgatgtta ccttcacttt tcaaaatatt gtagaaggat 4741 ttttctcatg ctcttatatt agggattcta tttgagttgc gtttatctgt tggcaaaaat 4801 atctttttca aggcatcagc tctgaaccta gctttattta aactgaattt ccagtatatt 4861 tttgtttttt cactaacaac agataaggta gagcagtgag gtttacaaca agccctacaa 4921 ttagagattg ctgaactgct gctaaagtga tttttttttt atttgaccaa aaaaaaaaag 4981 gcccaagaag aagaaaatga aaaatttgaa gatttgagag catgaagata ttcatgcttt 5041 tgaactcaaa atattgaaga tactctcaag cctgtatccc tgcccactgg ggagcaatga 5101 ctttcaaagc actgtgtata aaacatctag ttttagaagg gaaacagttg aaactgttta 5161 aaaatagatg tgccttattt attgcaggct ttctttcccc cattctccct gcatccttgt 5221 ccttgcaggt gcttttttag atgctccaat atgtcttctt ttgttatttt ctttcgagct 5281 aaccagttta gggtggtttt tcattgatta aaaataactg acaactgttc taatattttg 5341 ctccttttta aattttgtag ctcaaaagac cttaaaggtc tgtagggttc cctgcctccc 5401 atctttccac tgttgtaaaa agtatatcaa attattcctt caagtttcct agctctgtgc 5461 tcagtttcag ttcactcctg ccaagttgga ctctaagtta ttcttcatgt agtctgctga 5521 tctcagtctg gaaacttaac attatgagcc ttttctgctc aaaaaatttt caaagattaa 5581 aactattata catatacagg tcatataaaa ttacctggat tcactaaatt tgtttgttgt 5641 tgttgttgtt gttgttgttg ttgttgagac agagtcttgt tttgcagccc aggctggagt 5701 gcagtggcac catcttggct cactgcaacc tctgcctacc ggattcaagg aattctccct 5761 gcctcagcct cctgagtagc taggattaca ggtgcctgcc accacacccg gctaattttc 5821 atatttttca gtagagacgg ggtttcgcca tgttggctag cctggtctta aactcctaac 5881 ctcaggtgat ccacccgcct tggcctccca aagtgctggg attacaggtg tgagcctcca 5941 tgcccagcct aaatttgtat tttttgaatt gagtataaca ctttgctagt atatataatt 6001 taatagattt tatttatctt tttagtgttt cagataacct ctcaaaaaga ctttaaaata 6061 atgctattac acaaaagctg catttaccaa aaaatacaag taaaatcata atacagaaac 6121 taaaatttcc ctaggttatg acgcttttta gctaaatata tactcttctc tagtttaaaa 6181 catttgaact tgcctagtta gtgtggttgg caaatttagg agcttgttcc cattgccaaa 6241 tggatttaga aattcccttg tgagtgcctg gtagctaata cactggtcag agatctggta 6301 cttgtaagac tatttaaatt tctttgttag ttgcaagatg gatttcatat gcagaatatg 6361 taaatgaaga ggactcataa gtaaattcct aacattttgt tcccattacc agaagcaaag 6421 ctgctgctaa cccaacatct ggcacatagg atttgtactc ggtaaatgtt agttcttttc 6481 tccccttgag gtcagtaata aatacaaaaa aaaaatcatt tttctagagc agagtcttaa 6541 aatcaggtgg gggtagggga tggacttctt cctttcctac cccttttctc ttttatcctt 6601 tcatatatac acatgcaaag tttacaacct tattccatgc tgtctttcag atttagaaaa 6661 gatctaattt ctgtctcagc tgtcttaaag agagatccaa gcttttgatg aaggtgctat 6721 taatctagaa aggcaaaccc atttcactga aatatcaatg ggtttgcata tctaggccct 6781 ttttttagac caatgcctat gccatcctcc atgctttcag tttgagtttt attatttatt 6841 attttaattc cagtggccca tcttataata caacttgttt cttctagaag acagagctga 6901 tagggtaaat gttgaaaaaa gaagcatgcc tccttctccc tcccacccac ctcaagcagt 6961 tgaacacagc cagttattct tccattatta tgtgtacctt ggagtcatcc tcttggtctt 7021 gtattcatat tgtgggacag tgggaatagc agcttgtagt attgaaataa tcaaagagat 7081 aatttcagct cttacaacaa gaacagaaaa catgctaatt agaaaaagtc ctgctttaag 7141 taatatgtag ccacatttga atcctctacc acaagtcttt ttggcaatct tgaactttca 7201 ttagcttcaa agagaagctg tctttacagg agaaagaggt gattatggag ggaatcaaaa 7261 ttactgcttt cagctagtag ctagctttta gtcaggagtt agtaatgaga aattttataa 7321 atgtgtattt ctgtgtattc acatacatat atatatacac acatatatat gtacatacac 7381 atacatacat attaccatac tgaagggaaa tggatttata tttgaatttg atatttgaat 7441 atttgaagtt ctctatttat atttcattta tcactcttct gtattcactc agcaaccatg 7501 cccttagtct gaagataaca aggatacttt aatatccagt gccggttcag actcacctat 7561 gtggcacctt aaacttaaat atccaaagag cccttttgaa tttcaaagat taaaacacag 7621 ctagagtagt agtattgtag tctcaagatg tatggtgtgt catttgtgaa aatagaaacg 7681 ttatttttcc tagtttagta tcaacatttt agaatgtcaa atttgatgcc ttgtgaacaa 7741 gtaattttat attgtgcttt aattttttaa aaagtattct ttattcatat gtatagaatg 7801 cttaaaatag cactgtagac aagatgtttc caaaacttta agtgcgcata tcttttctgt 7861 atacaggctt aaaatcaaaa atgtatgtta agagaatttt ttctattact tatggatcat 7921 gctttaaaat gatttttctt gatatttatt ttactccatt ttgttttttt ctctgggtga 7981 ggcatctggt tactggttat tttaaaaaga ataaagacat tcctggaatc actccttttg 8041 ggtttttgtt tgttttgttt cttttctaca tttttaagta ttattttaca aaatagaaaa 8101 aatatagatt catgccaaat tattacctat ttttacacta actttctgca gtccctccat 8161 ctggtatgag taaccagata gagcacaagc atgagttctt gttcttcagt taaaagagct 8221 tctttcacag tgttgataac aaatgccttt tgtagccaaa accaggcgtc tcaaccttac 8281 gtttttagtt aaagaaatgt ttagctaaac gctgttgaac attgattgtt tggtaccgaa 8341 aacagcagtg gacgatgttg tgcaatatcc atctactgta gttaagatat tcagtagttt 8401 gtttttcata agcatgtaat tgatcatatt tctgccaagg atgtgccttc aactttataa 8461 ttatagtgtt gtaaaata // LOCUS HUMATPSGH 1078 bp mRNA PRI 15-FEB-1995 DEFINITION Human mRNA for ATP synthase gamma-subunit (H-type), complete cds. ACCESSION D16563 NID g468447 KEYWORDS ATP synthase gamma-subunit. SOURCE Homo sapiens heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1078) AUTHORS Matsuda,C., Endo,H., Ohta,S. and Kagawa,Y. TITLE Gene structure of human mitochondrial ATP synthase gamma-subunit. Tissue specificity produced by alternative RNA splicing JOURNAL J. Biol. Chem. 268 (33), 24950-24958 (1993) MEDLINE 94043360 REFERENCE 2 (bases 1 to 1078) AUTHORS Kagawa,Y. TITLE Direct Submission JOURNAL Submitted (23-JUN-1993) to the DDBJ/EMBL/GenBank databases. Yasuo Kagawa, Jichi Medical School, Department of Biochemistry; 3311-1 Yakushiji, Minamikawachi-machi, Tochigi 329-04, Japan (E-mail:ykagawa@ddbj.nig.ac.jp, Tel:0285-44-2111(ex.3149), Fax:0285-44-1827) COMMENT Submitted (23-JUN-1993) to DDBJ by: Yasuo Kagawa Department of Biochemistry Jichi Medical School Minamikawachi-machi Kawachi-gun Tochigi 329-04 Japan Phone: 0285-44-2111 x3149 Fax: 0285-44-1827. FEATURES Location/Qualifiers source 1..1078 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" 5'UTR 1..31 sig_peptide 32..106 CDS 32..925 /note="H(heart)-type ATP synthase gamma-subunit" /codon_start=1 /product="ATP synthase gamma-subunit" /db_xref="PID:d1004513" /db_xref="PID:g665585" /translation="MFSRAGVAGLSAWTLQPQWIQVRNMATLKDITRRLKSIKNIQKI TKSMKMVAAAKYARAERELKPARIYGLGSLALYEKADIKGPEDKKKHLLIGVSSDRGL CGAIHSSIAKQMKSEVATLTAAGKEVMLVGIGDKIRGILYRTHSDQFLVAFKEVGRKP PTFGDASVIALELLNSGYEFDEGSIIFNKFRSVISYKTEEKPIFSLNTVASADSMSIY DDIDADVLQNYQEYNLANIIYYSLKESTTSEQSARMTAMDNASKNASEMIDKLTLTFN RTRQAVITKELIEIISGAAAL" mat_peptide 107..922 /product="ATP synthase gamma-subunit" 3'UTR 926..1078 polyA_site 1078 BASE COUNT 331 a 223 c 233 g 291 t ORIGIN 1 ctgaccgacc ttcagcaggg ctgtggctac catgttctct cgcgcgggtg tcgctgggct 61 gtcggcctgg accttgcagc cgcaatggat tcaagttcga aatatggcaa ctttgaaaga 121 tatcaccagg agactaaagt ccatcaaaaa catccagaaa attaccaagt ctatgaaaat 181 ggtagcggca gcaaaatatg cccgagctga gagagagctg aaaccagctc gaatatatgg 241 attgggatct ttagctctgt atgaaaaagc tgatatcaag gggcctgaag acaagaagaa 301 acacctcctt attggtgtgt cctcagatcg aggactgtgt ggtgctattc attcctccat 361 tgctaaacag atgaaaagcg aggttgctac actaacagca gctgggaaag aagttatgct 421 tgttggaatt ggtgacaaaa tcagaggcat actttatagg actcattctg accagtttct 481 ggtggcattc aaagaagtgg gaagaaagcc ccccactttt ggagatgcgt cagtcattgc 541 ccttgaatta ctaaattctg gatatgaatt tgatgaaggc tccatcatct ttaataaatt 601 caggtctgtc atctcctata agacagaaga aaagcccatc ttttccctta ataccgttgc 661 aagtgctgac agcatgagta tctatgacga tattgatgct gacgtgctgc aaaattacca 721 agaatacaat ctggccaaca tcatctacta ctctctgaag gagtccacca ctagtgagca 781 gagtgccagg atgacagcca tggacaatgc cagcaagaat gcttctgaga tgattgacaa 841 attgacattg acattcaacc gtacccgcca agctgtcatc acaaaagagt tgattgaaat 901 tatctctggt gctgcagctc tgtaaagaag gaaaattcag ccagttgatt ttgtttttag 961 cttactgctg cctttgtccg aagaaactgt tcctccatta tttgaattac tgaagacagc 1021 aagatatttg taaattatct taaaataaac aacttaaaat aaaatcattg tttttctt // LOCUS HUMATPSY 471 bp mRNA PRI 31-OCT-1994 DEFINITION Human mitochondrial ATPase coupling factor 6 subunit (ATP5A) mRNA, complete cds. ACCESSION M37104 NID g179274 KEYWORDS ATPase; coupling factor 6. SOURCE Homo sapiens fetus muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 471) AUTHORS Javed,A.A., Ogata,K. and Sanadi,D.R. TITLE Human mitochondrial ATP synthase: cloning cDNA for the nuclear-encoded precursor of coupling factor 6 JOURNAL Gene 97 (2), 307-310 (1991) MEDLINE 91153664 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990) Albert Einstein College of Med, Dept Obs/Gyn,Ullman #117, 130] kindly submitted by A.A.Javed, 25-JUL-1990. FEATURES Location/Qualifiers source 1..471 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="muscle" /map="10" gene 6..455 /gene="ATP5A" CDS 6..332 /gene="ATP5A" /EC_number="3.6.1.3" /codon_start=1 /db_xref="GDB:G00-119-025" /product="ATPase coupling factor 6 subunit" /db_xref="PID:g179275" /translation="MILQRLFRFSSVIRSAVSVHLRRNIGVTAVAFNKELDPIQKLFV DKIREYKSKRQTSGGPVDASSEYQQELERELFKLKQMFGNADMNTFPTFKFEDPKFEV IEKPQA" sig_peptide 6..101 /gene="ATP5A" /note="G00-119-025" mat_peptide 102..329 /gene="ATP5A" /note="G00-119-025" /product="ATPase coupling factor 6 subunit" polyA_signal 449..455 /gene="ATP5A" /note="G00-119-025; putative" BASE COUNT 149 a 84 c 97 g 141 t ORIGIN 1 tcagcatgat tcttcagagg ctcttcaggt tctcctctgt cattcggtca gccgtctcag 61 tccatttgcg gaggaacatt ggtgttacag cagtggcatt taataaggaa cttgatccta 121 tacagaaact ctttgtggac aagattagag aatacaaatc taagcgacag acatctggag 181 gacctgttga tgctagttca gagtatcagc aagagctgga gagggagctt tttaagctca 241 agcaaatgtt tggtaatgca gacatgaata catttcccac cttcaaattt gaagatccca 301 aatttgaagt catcgaaaaa ccccaggcct gaagaaataa agtaaaatta atctggtaat 361 ttgtcacgga ttagttgtac aactagttag aagtttcaga ataaacatgc atttcataac 421 tgtcaaatgt tcttttaatt ctgagtccaa ataaattatt tggtgatgtt g // LOCUS HUMATS 3344 bp mRNA PRI 25-NOV-1996 DEFINITION Human mRNA for alanyl-tRNA synthetase, complete cds. ACCESSION D32050 NID g1015320 KEYWORDS alanyl-tRNA synthetase. SOURCE Homo sapiens T-lymophocyte cell-line KUT-2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Shiba,K., Ripmaster,T., Suzuki,N., Nichols,R., Plotz,P., Noda,T. and Schimmel,P. TITLE Human alanyl-tRNA synthetase: conservation in evolution of catalytic core and microhelix recognition JOURNAL Biochemistry 34 (33), 10340-10349 (1995) MEDLINE 95383296 REFERENCE 2 (bases 1 to 3344) AUTHORS Shiba,K. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 3344) AUTHORS Shiba,K. TITLE Direct Submission JOURNAL Submitted (04-JUL-1994) to the DDBJ/EMBL/GenBank databases. Kiyotaka Shiba, Cancer Institute, Department of Cell Biology; 1-37-1, Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:shiba@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4232), Fax:03-3917-7564) FEATURES Location/Qualifiers source 1..3344 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KUT-2" /cell_type="T-lymophocyte" CDS 111..3017 /codon_start=1 /product="alanyl-tRNA synthetase" /db_xref="PID:d1007382" /db_xref="PID:g1015321" /translation="MDSTLTASEIRQRFIDFFKRNEHTYVHSSATIPLDDPTLLFANA GMNQFKPIFLNTIDPSHPMAKLSRAANTQKCIRAGGKQNDLDDVGKDVYHHTFFEMLG SWSFGDYFKELACKMALELLTQEFGIPIERLYVTYFGGDEAAGLEADLECKQIWQNLG LDDTKILPGNMKDNFWEMGDTGPCGPCSEIHYDRIGGRDAAHLVNQDDPNVLEIWNLV FIQYNREADGILKPLPKKSIDTGMGLERLVSVLQNKMSNYDTDLFVPYFEAIQKGTGA RPYTGKVGAEDADGIDMAYRVLADHARTITVALADGGRPDNTGRGYVLRRILRRAVRY AHEKLNASRGFFATLVDVVVQSLGDAFPELKKDPDMVKDIINEEEVQFLKTLSRGRRI LDRKIQSLGDSKTIPGDTAWLLYDTYGFPVDLTGLIAEEKGLVVDMDGFEEERKLAQL KSQGKGAGGEDLIMLDIYAIEELRARGLEVTDDSPKYNYHLDSSGSYVFENTVATVMA LRREKMFVEEVSTGQECGVVLDKTCFYAEQGGQIYDEGYLVKVDDSSEDKTEFTVKNA QVRGGYVLHIGTIYGDLKVGDQVWLFIDEPRRRPIMSNHTATHILNFALRSVLGEADQ KGSLVAPDRLRFDFTAKGAMSTQQIKKAEEIANEMIEAAKAVYTQDCPLAAAKAIQGL RAVFDETYPDPVRVVSIGVPVSELLDDPSGPAGSLTSVEFCGGTHLRNSSHAGAFVIV TEEAIAKGIRRIVAVTGAEAQKALRKAESLKKCLSVMEAKVKAQTAPNKDVQREIADL GEALATAVIPQWQKDELRETLKSLKKVMDDLDRASKADVQKRVLEKTKQFIDSNPNQP LVILEMESGASAKALNEALKLFKMHSPQTSAMLFTVDNEAGKITCLCQVPQNAANRGL KASEWVQQVSGLMDGKGGGKDVSAQATGKNVGCLQEALQLATSFAQLRLGDVKN" polyA_signal 3327..3332 BASE COUNT 827 a 823 c 966 g 728 t ORIGIN 1 ggtacagctg cgcgtctgcg ggaataggtg cagcgggccc ttggcggggg actctgaggg 61 aggagctggg gacggcgacc ctaggagagt tctttggggt gactttcaag atggactcta 121 ctctaacagc aagtgaaatc cggcagcgat ttatagattt cttcaagagg aacgagcata 181 cgtatgttca ctcgtctgcc accatcccat tggatgaccc cactttgctc tttgccaatg 241 caggcatgaa ccagtttaaa cccattttcc tgaacacaat tgacccatct caccccatgg 301 caaagctgag cagagctgcc aatacccaga agtgcatccg ggctgggggc aaacaaaatg 361 acctggacga tgtgggcaag gatgtctatc atcacacctt cttcgagatg ctgggctctt 421 ggtcttttgg agattacttt aaggaattgg catgtaagat ggctctggaa ctcctcaccc 481 aagagtttgg cattcccatt gaaagacttt atgttactta ctttggcggg gatgaagcag 541 ctggcttaga agcagatctg gaatgcaaac agatctggca aaatttgggg ctggatgaca 601 ccaaaatcct cccaggcaac atgaaggata acttctggga gatgggtgac acgggcccct 661 gtggtccttg cagtgagatc cactacgacc ggattggtgg tcgggacgcc gcacatcttg 721 tcaaccagga cgaccctaat gtgctggaga tctggaacct tgtgttcatc cagtataaca 781 gggaagctga tggcattctg aaacctcttc ccaagaaaag cattgacaca gggatgggcc 841 tggaacgact ggtatctgtg ctgcagaata agatgtccaa ctatgacact gacctttttg 901 tcccttactt tgaagccatt cagaagggca caggtgcccg accatacact gggaaagttg 961 gtgctgagga tgccgatggg attgacatgg cctaccgggt gctggctgac catgctcgga 1021 ccatcactgt ggcactggct gatggtggcc ggcctgacaa cacagggcgt ggatatgtgt 1081 tgagacggat tctccgccga gctgtccgat acgcccatga aaagctcaat gccagcaggg 1141 gcttctttgc tacgttagtg gatgttgtcg tccagtccct gggagatgca tttcctgagc 1201 tgaagaagga cccagacatg gtgaaggaca tcattaatga agaagaggtg cagtttctca 1261 agactctcag cagagggcgt cgcatcctgg acaggaaaat tcagagcctg ggagacagca 1321 agaccattcc cggagacact gcttggctcc tctatgacac ctatgggttt ccagtggatc 1381 tgactggact gattgctgaa gagaagggcc tggtggtaga catggatggc tttgaagagg 1441 agaggaaact ggcccagctg aaatcacagg gcaagggagc tggtggggaa gacctcatta 1501 tgctggacat ttacgctatc gaagagctcc gggcacgggg tctggaggtc acagatgatt 1561 ccccaaagta caattaccat ttggactcca gtggtagcta tgtatttgag aacacagtgg 1621 ctacggtgat ggctctgcgc agggagaaga tgttcgtgga agaggtgtcc acaggccagg 1681 agtgtggagt ggtgctggac aagacctgtt tctatgctga gcaaggaggc cagatctatg 1741 acgaaggcta cctggtgaag gtggatgaca gcagtgaaga taaaacagag tttacagtga 1801 agaatgctca ggtccgagga gggtatgtgc tacacattgg aaccatctac ggtgacctga 1861 aagtggggga tcaggtctgg ctgtttattg atgagccccg acgaagaccc atcatgagca 1921 accacacagc tacgcacatt ctgaacttcg ccctgcgctc agtgcttggg gaagctgacc 1981 agaaaggctc attggttgct cctgaccgcc tcagatttga ctttactgcc aagggagcca 2041 tgtccaccca acagatcaag aaggctgaag agattgctaa tgagatgatt gaggcagcca 2101 aggccgtcta tacccaggat tgccccctgg cagcagcgaa agccatccag ggcctacggg 2161 ctgtgtttga tgagacctat cctgaccctg tgcgagtcgt ctccattggg gtcccggtgt 2221 ccgagttgct ggatgacccc tctgggcctg ctggctccct gacttctgtt gagttctgtg 2281 ggggaacgca cctgcggaac tcgagtcatg caggagcttt tgtgatcgtg acggaagaag 2341 ccattgccaa gggtatccgg aggattgtgg ctgtcacagg tgccgaggcc cagaaggccc 2401 tcaggaaagc agagagcttg aagaaatgtc tctctgtcat ggaagccaaa gtgaaggctc 2461 agactgctcc aaacaaggat gtgcagaggg agatcgctga ccttggagag gccctggcca 2521 ctgcagtcat cccccagtgg cagaaggatg aattgcggga gactctcaaa tccctaaaga 2581 aggtcatgga tgacttggac cgagccagca aagccgatgt ccagaaacga gtgttagaga 2641 agacgaagca gttcatcgac agcaacccca accagcctct tgtcatcctg gagatggaga 2701 gcggcgcctc agccaaggcc ctgaatgaag ccttgaagct cttcaagatg cactcccctc 2761 agacttctgc catgctcttc acggtggaca atgaggctgg caagatcacg tgcctgtgtc 2821 aagtccccca gaatgcagcc aatcggggct taaaagccag cgagtgggtg cagcaggtgt 2881 caggcttgat ggacggtaaa ggtggtggca aggatgtgtc tgcacaggcc acaggcaaga 2941 acgttggctg cctgcaggag gcgctgcagc tggccacttc cttcgcccag ctgcgcctcg 3001 gggatgtaaa gaactgagtg gggaaggagg aggctcccac tggatccatc cgtccagcca 3061 agagctcttc atctgctaca agaacatttg aatcttggga cctttaaaga gcccctccta 3121 acccagcagt aactggaaca cacttgggag cagtcctatg tctcagtgcc ccttaaattt 3181 ctgccctgag ccctccacgt cagtgccatc ggtctagaac cactaacccc gcattgctgt 3241 tgatcgtcac gctcgcatct atagataacg gctctccaga cctgagcttt ccgcgtcagc 3301 aagtaggaat cgtttttgct gcagagaata aaaggaccac gtgc // LOCUS HUMATXT 3110 bp DNA PRI 24-MAY-1996 DEFINITION Human autotaxin-t (atx-t) gene, complete cds. ACCESSION L46720 NID g1160615 KEYWORDS autotaxin; motility factor; phosphodiesterase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3110) AUTHORS Murata,J., Lee,H.Y., Clair,T., Krutzsch,H.C., Arestad,A.A., Sobel,M.E., Liotta,L.A. and Stracke,M.L. TITLE cDNA cloning of the human tumor motility-stimulating protein, autotaxin, reveals a homology with phosphodiesterases JOURNAL J. Biol. Chem. 269 (48), 30479-30484 (1994) MEDLINE 95074054 REFERENCE 2 (bases 1 to 3110) AUTHORS Lee,H.Y., Murata,J., Clair,T., Polymeropoulos,M.H., Torres,R., Manrow,R.E., Liotta,L.A. and Stracke,M.L. TITLE Cloning, chromosomal localization, and tissue expression of autotaxin from human teratocarcinoma cells JOURNAL Biochem. Biophys. Res. Commun. 218 (3), 714-719 (1996) MEDLINE 96158950 FEATURES Location/Qualifiers source 1..3110 /organism="Homo sapiens" /note="(vector lambda gt10)" /db_xref="taxon:9606" /cell_line="NTera2D1" /cell_type="teratocarcinoma" /dev_stage="adult" /sex="male" /tissue_type="testis" gene 60..2651 /gene="atx-t" CDS 60..2651 /gene="atx-t" /codon_start=1 /product="autotaxin-t" /db_xref="PID:g1160616" /translation="MARRSSFQSCQIISLFTFAVGVNICLGFTAHRIKRAEGWEEGPP TVLSDSPWTNISGSCKGRCFELQEAGPPDCRCDNLCKSYTSCCHDFDELCLKTARAWE CTKDRCGEVRNEENACHCSEDCLARGDCCTNYQVVCKGESHWVDDDCEEIKAAECPAG FVRPPLIIFSVDGFRASYMKKGSKVMPNIEKLRSCGTHSPYMRPVYPTKTFPNLYTLA TGLYPESHGIVGNSMYDPVFDATFHLRGREKFNHRWWGGQPLWITATKQGVKAGTFFW SVVIPHERRILTILQWLTLPDHERPSVYAFYSEQPDFSGHKYGPFGPEMTNPLREIDK IVGQLMDGLKQLKLHRCVNVIFVGDHGMEDVTCDRTEFLSNYLTNVDDITLVPGTLGR IRSKFSNNAKYDPKAIIANLTCKKPDQHFKPYLKQHLPKRLHYANNRRIEDIHLLVER RWHVARKPLDVYKKPSGKCFFQGDHGFDNKVNSMQTVFVGYGPTFKYKTKVPPFENIE LYNVMCDLLGLKPAPNNGTHGSLNHLLRTNTFRPTMPEEVTRPNYPGIMYLQSDFDLG CTCDDKVEPKNKLDELNKRLHTKGSTEERHLLYGRPAVLYRTRYDILYHTDFESGYSE IFLMPLWTSYTVSKQAEVSSVPDHLTSCVRPDVRVSPSFSQNCLAYKNDKQMSYGFLF PPYLSSSPEAKYDAFLVTNMVPMYPAFKRVWNYFQRVLVKKYASERNGVNVISGPIFD YDYDGLHDTEDKIKQYVEGSSIPVPTHYYSIITSCLDFTQPADKCDGPLSVSSFILPH RPDNEESCNSSEDESKWVEELMKMHTARVRDIEHLTSLDFFRKTSRSYPEILTLKTYL HTYESEI" polyA_signal 3065..3070 polyA_site 3094..3110 BASE COUNT 906 a 667 c 674 g 863 t ORIGIN 1 agtgcactcc gtgaaggcaa agagaacacg ctgcaaaagg ctttccaata atcctcgaca 61 tggcaaggag gagctcgttc cagtcgtgtc agataatatc cctgttcact tttgccgttg 121 gagtcaatat ctgcttagga ttcactgcac atcgaattaa gagagcagaa ggatgggagg 181 aaggtcctcc tacagtgcta tcagactccc cctggaccaa catctccgga tcttgcaagg 241 gcaggtgctt tgaacttcaa gaggctggac ctcctgattg tcgctgtgac aacttgtgta 301 agagctatac cagttgctgc catgactttg atgagctgtg tttgaagaca gcccgtgcgt 361 gggagtgtac taaggacaga tgtggggaag tcagaaatga agaaaatgcc tgtcactgct 421 cagaggactg cttggccagg ggagactgct gtaccaatta ccaagtggtt tgcaaaggag 481 agtcgcattg ggttgatgat gactgtgagg aaataaaggc cgcagaatgc cctgcagggt 541 ttgttcgccc tccattaatc atcttctccg tggatggctt ccgtgcatca tacatgaaga 601 aaggcagcaa agtcatgcct aatattgaaa aactaaggtc ttgtggcaca cactctccct 661 acatgaggcc ggtgtaccca actaaaacct ttcctaactt atacactttg gccactgggc 721 tatatccaga atcacatgga attgttggca attcaatgta tgatcctgta tttgatgcca 781 cttttcatct gcgagggcga gagaaattta atcatagatg gtggggaggt caaccgctat 841 ggattacagc caccaagcaa ggggtgaaag ctggaacatt cttttggtct gttgtcatcc 901 ctcacgagcg gagaatatta accatattgc agtggctcac cctgccagat catgagaggc 961 cttcggtcta tgccttctat tctgagcaac ctgatttctc tggacacaaa tatggccctt 1021 tcggccctga gatgacaaat cctctgaggg aaatcgacaa aattgtgggg caattaatgg 1081 atggactgaa acaactaaaa ctgcatcggt gtgtcaacgt catctttgtc ggagaccatg 1141 gaatggaaga tgtcacatgt gatagaactg agttcttgag taattaccta actaatgtgg 1201 atgatattac tttagtgcct ggaactctag gaagaattcg atccaaattt agcaacaatg 1261 ctaaatatga ccccaaagcc attattgcca atctcacgtg taaaaaacca gatcagcact 1321 ttaagcctta cttgaaacag caccttccca aacgtttgca ctatgccaac aacagaagaa 1381 ttgaggatat ccatttattg gtggaacgca gatggcatgt tgcaaggaaa cctttggatg 1441 tttataagaa accatcagga aaatgctttt tccagggaga ccacggattt gataacaagg 1501 tcaacagcat gcagactgtt tttgtaggtt atggcccaac atttaagtac aagactaaag 1561 tgcctccatt tgaaaacatt gaactttaca atgttatgtg tgatctcctg ggattgaagc 1621 cagctcctaa taatgggacc catggaagtt tgaatcatct cctgcgcact aataccttca 1681 ggccaaccat gccagaggaa gttaccagac ccaattatcc agggattatg taccttcagt 1741 ctgattttga cctgggctgc acttgtgatg ataaggtaga gccaaagaac aagttggatg 1801 aactcaacaa acggcttcat acaaaagggt ctacagaaga gagacacctc ctctatgggc 1861 gacctgcagt gctttatcgg actagatatg atatcttata tcacactgac tttgaaagtg 1921 gttatagtga aatattccta atgccactct ggacatcata tactgtttcc aaacaggctg 1981 aggtttccag cgttcctgac catctgacca gttgcgtccg gcctgatgtc cgtgtttctc 2041 cgagtttcag tcagaactgt ttggcctaca aaaatgataa gcagatgtcc tacggattcc 2101 tctttcctcc ttatctgagc tcttcaccag aggctaaata tgatgcattc cttgtaacca 2161 atatggttcc aatgtatcct gctttcaaac gggtctggaa ttatttccaa agggtattgg 2221 tgaagaaata tgcttcggaa agaaatggag ttaacgtgat aagtggacca atcttcgact 2281 atgactatga tggcttacat gacacagaag acaaaataaa acagtacgtg gaaggcagtt 2341 ccattcctgt tccaactcac tactacagca tcatcaccag ctgtctggat ttcactcagc 2401 ctgccgacaa gtgtgacggc cctctctctg tgtcctcctt catcctgcct caccggcctg 2461 acaacgagga gagctgcaat agctcagagg acgaatcaaa atgggtagaa gaactcatga 2521 agatgcacac agctagggtg cgtgacattg aacatctcac cagcctggac ttcttccgaa 2581 agaccagccg cagctaccca gaaatcctga cactcaagac atacctgcat acatatgaga 2641 gcgagattta actttctgag catctgcagt acagtcttat caactggttg tatattttta 2701 tattgttttt gtatttatta atttgaaacc aggacattaa aaatgttagt attttaatcc 2761 tgtaccaaat ctgacatatt atgcctgaat gactccactg tttttctcta atgcttgatt 2821 taggtagcct tgtgttctga gtagagcttg taataaatac tgcagcttga gtttttagtg 2881 gaagcttcta aatggtgctg cagatttgat atttgcattg aggaaatatt aattttccaa 2941 tgcacagttg ccacatttag tcctgtactg tatggaaaca ctgattttgt aaagttgcct 3001 ttatttgctg ttaactgtta actatgacag atatatttaa gccttataaa ccaatcttaa 3061 acataataaa tcacacattc agttttttct ggtaaaaaaa aaaaaaaaaa // LOCUS HUMAUANTIG 2331 bp mRNA PRI 05-MAR-1996 DEFINITION Homo sapiens autoantigen mRNA, complete cds. ACCESSION L05425 NID g179284 KEYWORDS autoantigen. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2331) AUTHORS Racevskis,J., Dill,A., Stockert,R. and Fineberg,S.A. TITLE Cloning of a novel nucleolar guanosine 5'-triphosphate binding protein autoantigen from a breast tumor JOURNAL Cell Growth Differ. 7 (2), 271-280 (1996) MEDLINE 96419438 REFERENCE 2 (bases 1 to 2331) AUTHORS Racevskis,J. TITLE Direct Submission JOURNAL Submitted (02-NOV-1992) Janis Racevskis, Montefiore Medical Center, Oncology, 111 East 210th Street, Bronx, NY 10467, USA FEATURES Location/Qualifiers source 1..2331 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="breast tumor" CDS 80..2275 /codon_start=1 /product="autoantigen" /db_xref="PID:g179285" /translation="MVKPKYKGRSTINPSKASTNPDRVQGAGGQNMRDRATIRRLNMY RQKERRNSRGKIIKPLQYQSTVASGTVARVEPNIKWFGNTRVIKQSSLQKFQEEMDTV MKDPYKVVMKQSKLPMSLLHDRIRPHNLKVHILDTESFETTFGPKSQRKRPNLFASDM QSLIENAEMSTESYDQGKDRDLVTEDTGVRNEAQEEIYKKGQSKRIWGELYKVIDSSD VVVQVLDARDPMGTRSPHIETYLKKEKPWKHLIFVLNKCDLVPTWATKRWVAVLSQDY PTLAFHASLTNPFGKGAFIQLLRQFGKLHTDKKQISVGFIGYPNVGKSSVINTLRSKK VCNVAPIAGETKVWQYITLMRRIFLIDCPGVVYPSEDSETDIVLKGVVQVEKIKSPED HIGAVLERAKPEYISKTYKIDSWENAEDFLEKLAFRTGKLLKGGEPDLQTVGKMVLND WQRGRIPFFVKPPNAEPLVAPQLLPSSSLEVVPEAAQNNPGEEVTETAGEGSESIIKE ETEENSHCDANTEMQQILTRVRQNFGKINVVPQFSGDDLVPVEVSDLEEELESFSDEE EEEQEQQRDDAEESSSEPEEENVGNDTKAVIKALDEKIAKYQKFLDKAKAKKFSAVRI SKGLSEKIFAKPEEQRKTLEEDVDDRAPSKKGKKRKAQREEEQEHSNKAPRALTSKER RRAVRQQRPKKVGVRYYETHNVKNRNRNKKKTNDSEGQKHKRKKFRQKQ" polyA_signal 2292..2297 /note="putative" BASE COUNT 785 a 472 c 571 g 503 t ORIGIN 1 cttcgcggct tggtaacata aaagtttgtt tcaccacgta agccggacct cgcactccgg 61 tcccggtctc gtcgccaaga tggtgaagcc caagtacaaa ggacggagca ccatcaaccc 121 gtccaaggcc agcacaaacc cagatcgagt gcagggagca ggaggccaaa acatgaggga 181 ccgggccacc atccggcgcc tgaatatgta taggcaaaag gagcgcagga acagtcgtgg 241 taaaataatt aaacccctgc aatatcaatc aacggtggct tctggcacag tggcaagagt 301 agagccaaat attaaatggt ttggaaacac acgtgtgatt aagcagtcat cattacaaaa 361 atttcaagag gaaatggata cagttatgaa ggatccatac aaagttgtca tgaagcaaag 421 caagttacca atgtctcttc tccatgatcg aatccggcct cataacttga aggtgcacat 481 tcttgatact gaaagttttg aaactacatt tggccctaag tcacagagga aacgaccaaa 541 cttatttgca agtgatatgc agtctcttat cgaaaatgct gaaatgtcca ctgagagcta 601 tgaccagggc aaggatcgtg atttggtaac tgaagacact ggtgtgagaa atgaagctca 661 agaagagatc tataaaaagg gacagtccaa aagaatatgg ggtgagctct acaaggtgat 721 agattcatca gatgttgtag ttcaagttct tgatgctaga gatccaatgg gtactcgttc 781 ccctcacatt gaaacttacc tgaagaagga aaaaccttgg aaacacctca tttttgtact 841 taacaaatgt gaccttgttc caacctgggc aacaaaacgg tgggttgctg tcctctccca 901 ggattatcca acacttgctt tccatgcaag ccttactaac ccgtttggca agggagcatt 961 cattcagctt ctgcggcagt ttggaaagtt gcacactgac aagaaacaga tcagtgttgg 1021 gttcattggc tatccaaatg ttggcaagag ctctgtgata aatacattgc gttctaagaa 1081 agtttgcaac gtggctccca ttgcaggtga aacaaaggtc tggcagtata ttactttgat 1141 gcgtcggata ttcctgattg actgtccagg tgtggtttac ccctctgagg actccgagac 1201 agacattgtg ctaaaaggag ttgttcaagt agaaaaaatt aagagtcctg aagaccacat 1261 tggtgctgta cttgaacgag caaagccaga atatatcagc aaaacataca agattgattc 1321 ttgggagaat gctgaggact ttcttgagaa gctcgctttc cggactggga agttactaaa 1381 gggtggagag cccgacttgc agactgtggg taagatggtc ctcaatgact ggcagagggg 1441 ccggattcct ttctttgtca agccacccaa tgcagagcca cttgtggccc cccagcttct 1501 accctcctca tctttggaag ttgtcccaga agcagcccag aacaatccag gggaggaagt 1561 cacagaaact gcaggtgaag ggtcagaatc catcattaag gaagaaacag aagagaacag 1621 tcactgtgat gctaacacag agatgcagca gattctcaca cgagttcggc agaactttgg 1681 taaaatcaac gtggtgcctc agttttctgg ggatgacctg gttcctgtgg aggtgtcaga 1741 tcttgaggaa gagcttgaga gcttttctga tgaagaggag gaggaacagg agcaacaaag 1801 agatgatgcg gaagagtctt cctcggagcc tgaggaggaa aatgtgggaa acgacaccaa 1861 agccgttatt aaagcactgg atgagaagat tgccaaatat cagaagtttc tagacaaagc 1921 caaagccaaa aagttttcag cagtcagaat atccaaggga ctgagtgaaa agatatttgc 1981 aaaacctgaa gaacaaagaa aaacactgga agaagatgta gatgacagag caccttccaa 2041 aaagggaaag aagcggaagg cacaaaggga agaggaacag gaacattcaa ataaagctcc 2101 cagggcgctt acatcaaaag aacggaggcg agcagtacga cagcaacggc cgaaaaaagt 2161 tggtgtgcgc tactatgaaa cacacaacgt gaaaaatagg aacaggaaca aaaagaagac 2221 caatgactca gagggacaga aacacaaacg caaaaaattc agacaaaagc agtaatgttt 2281 aaaaggtttt tattaaatta tacaaaaaca ggcaaaaaaa aaaaaaaaaa a // LOCUS HUMAUAP69X 1672 bp mRNA PRI 11-APR-1995 DEFINITION Human autoantigen p69 mRNA, complete cds. ACCESSION L21181 NID g437366 KEYWORDS autoantigen p69; diabetes. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1672) AUTHORS Miyazaki,I., Gaedigk,R., Hui,M.F., Cheung,R.K., Morkowski,J., Rajotte,R.V. and Dosch,H.M. TITLE Cloning of human and rat p69 cDNA, a candidate autoimmune target in type 1 diabetes JOURNAL Biochim. Biophys. Acta 1227 (1-2), 101-104 (1994) MEDLINE 95002197 REFERENCE 2 (bases 1 to 1672) AUTHORS Dosch,H.M. TITLE Direct Submission JOURNAL Submitted (31-DEC-1993) H.M. Dosch, Immunology and Cancer, The Hospital for Sick Children, 555 University Avenue, Toronto, Ontario, Canada, M5G 1X8 FEATURES Location/Qualifiers source 1..1672 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="beta cell" /tissue_type="pancreatic islets" CDS 77..1528 /note="putative" /codon_start=1 /function="target antigen in autoimmune (Type I) Diabetes Mellitus" /product="autoantigen p69" /db_xref="PID:g437367" /translation="MSGHKCSYPWDLQDRYAQDKSVVNKMQQKYWETKQAFIKATGKK EDEHVVASDADLDAKLELFHSIQRTCLDLSKAIVLYQKGICFLSQEENELGKFLRSQG FQDKTRAGKMMQATGKALCFSSQQRLALRNPLCRFHQEVETFRHRAISDTWLTVNRME QCRTEYRGALLWMKDVSQELDPDLYKQMEKFRKVQTQVRLAKKNFDKLKMDVCQKVDL LGASRCNLLSHMLATYQTTLLHFWEKTSHTMAAIHESFKGYQPYEFTTLKSLQDPMKK LVEKEEKKKINQQESTDAAVQEPSQLISLEEENQRKESSSFKTEDGKSILSALDKGST HTACSGPIDELLDMKSEEGACLGPVAGTPEPEGADKDDLLLLSEIFNASSLEEGEFSK EWAAVFGDGQVKEPVPTMALGEPDPKAQTGSGFLPSQLLDQNMKDLQASLQEPAKAAS DLTAWFSLFADLDPLSNPDAVGKTDKEHELLNA" BASE COUNT 526 a 368 c 389 g 389 t ORIGIN 1 ataatataac ttatcctctc atgctttttt cctgcccctt ctccccaaat catcaacaat 61 agaagaagaa gaaaacatgt caggacacaa atgcagttat ccctgggact tacaggatcg 121 atatgctcaa gataagtcag ttgtaaataa gatgcaacag aaatattggg agacgaagca 181 ggcctttatt aaagccacag ggaagaagga agatgaacat gttgttgcct ctgacgcgga 241 cctggatgcc aagctagagc tgtttcattc aattcagaga acctgtctgg acttatcgaa 301 agcaattgta ctctatcaaa aggggatatg tttcttgtct caagaagaaa acgaactggg 361 aaaatttctt cgatcccaag gtttccaaga taaaaccaga gcaggaaaga tgatgcaagc 421 gacaggaaag gccctctgct tttcttccca gcaaaggttg gccttacgaa atcctttgtg 481 tcgatttcac caagaagtgg agacttttcg gcatcgggcc atctcagata cttggctgac 541 ggtgaaccgc atggaacagt gcaggacgga atatagagga gcactattat ggatgaagga 601 cgtgtctcag gagcttgatc cagacctcta caagcaaatg gagaagttca ggaaggtaca 661 aacacaagtg cgccttgcaa aaaaaaactt tgacaaattg aagatggatg tttgtcaaaa 721 agtggatctt cttggagcga gcagatgcaa tctcttgtct cacatgctag caacatacca 781 gaccactctg cttcattttt gggagaaaac ttctcacact atggcagcca tccatgagag 841 tttcaaaggt tatcaaccat atgaatttac tactttaaag agcttacaag accctatgaa 901 aaaattagtt gagaaagaag agaagaagaa aatcaaccag caggaaagta cagatgcagc 961 cgtgcaggag ccgagccaat taatttcatt agaggaagaa aaccagcgca aggaatcctc 1021 tagttttaag actgaagatg gaaaaagtat tttatctgcc ttagacaaag gctctacaca 1081 tactgcatgc tcaggaccca tagatgaact attagacatg aaatctgagg aaggtgcttg 1141 cctgggacca gtggcaggga ccccggaacc tgaaggtgct gacaaagatg acctgctgct 1201 gttgagtgag atcttcaatg cttcctcctt ggaagagggc gagttcagca aagagtgggc 1261 cgctgtgttt ggagacggcc aagtgaagga gccagtgccc actatggccc tgggagagcc 1321 agaccccaag gcccagacag gctcaggttt ccttccttcg cagcttttag accaaaatat 1381 gaaagactta caggcctcgc tacaagaacc tgctaaggct gcctcagacc tgactgcctg 1441 gttcagcctc ttcgctgacc tcgacccact ctcaaatcct gatgctgttg ggaaaaccga 1501 taaagaacac gaattgctca atgcatgaat ctgtaccctt cggagggcac tcacatgccg 1561 cccccagcag ctcccctggg ggctagcaga agtataaagt gatcagtatg ctgttttaat 1621 aattatgtgc cattttaata aaatgaaagg gtcaacggcc ctgttaaaaa aa // LOCUS HUMAUTANT 4659 bp mRNA PRI 30-SEP-1994 DEFINITION Human autoantigen mRNA, complete cds. ACCESSION L26339 NID g533201 KEYWORDS autoantigen. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4659) AUTHORS Bloch,D.B., Rabkina,D., Quertermous,T. and Bloch,K.D. TITLE The immunoreactive region in a novel autoantigen contains a nuclear localization sequence JOURNAL Clin. Immunol. Immunopathol. 72 (3), 380-389 (1994) MEDLINE 94340813 FEATURES Location/Qualifiers source 1..4659 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /tissue_lib="Clontech" CDS 137..3784 /codon_start=1 /product="autoantigen" /db_xref="PID:g533202" /translation="MASCASIDIEDATQHLRDILKLDRPAGGPSAESPRPSSAYNGDL NGLLVPDPLCSGDSTSANKTGLRTMPPINLQEKQVICLSGDDSSTCIGILAKEVEIVA SSDSSISSKARGSNKVKIQPVAKYDWEQKYYYGNLIAVSNSFLAYAIRAANNGSAMVR VISVSTSERTLLKGFTGSVADLAFAHLNSPQLACLDEAGNLFVWRLALVNGKIQEEIL VHIRQPEGTPLNHFRRIIWCPFIPEESEDCCEESSPTVALLHEDRAEVWDLDIVRSSH STWPVDVSQIKQGFIVVKGHSTCLSEGALSPDGTVLATASHDGYVKFWQIYIEGQDEP RCLHEWKPHDGRPLSCLLFCDNHKKQDPDVPFWRFLITGADQNRELKMWCTVSWTCLQ TIRFSPDIFSSVSVPPSLKVCLDLSAEYLILSDVQRKVLYVMELLQNQEEGHACFSSI SEFLLTHPVLSFGIQVVSRCRLRHTEVLPAEEENDSLGADGTHGAGAMESAAGVLIKL FCVHTKALQDVQIRFQPQLNPDVVAPLPTHTAHEDFTFGESRPELGSEGLGSAAHGSQ PDLRRIVELPAPADFLSLSSETKPKLMTPDAFMTPSASLQQITASPSSSSSGSSSSSS SSSSSLTAVSAMSSTSAVDPSLTRPPEELTLSPKLQLDGSLTMSSSGSLQASPRGLLP GLLPAPADKLTPKGPGQVPTATSALSLELQEVEPLGLPQASPSRTRSPDVISSASTAL SQDIPEIASEALSRGFGSSAPEGLEPDSMASAASALHLLSPRPRPGPELGPQLGLDGG PGDGDRHNTPSLLEAALTQEASTPDSQVWPTAPDITRETCSTLAESPRNGLQEKHKSL AFHRPPYHLLQQRDSQDASAEQSDHDDEVASLASASGGFGTKVPAPRLPAKDWKTKGS PRTSPKLKRKSKKDDGDAAMGSRLTEHQVAEPPEDWPALIWQQQRELAELRHSQEELL QRLCTQLEGLQSTVTGHVERALETRHEQEQRRLERALAEGQQRGGHWQEQLTQQLSQA LSSAVAGRLERSIRDEIKKTVPPCVSRSLEPMAGQLSNSVATKLTAVEGSMKENISKL LKSKNLTDAIARAAADTLQGPMQAAYREAFQSVVLPAFEKSCQAMFQQINDSFRLGTQ EYLQQLESHMKSRNGREQEAREPVLAQLRGLVSTLQSATEQMQPPWPAVFVLRCSTSC MWLWAACRSPF" BASE COUNT 973 a 1417 c 1366 g 903 t ORIGIN 1 gaattccgaa ctgcgggtgg actgtgtagt gaccggcgtc ccgctgtctc gccccgtggc 61 gggtgagcga gggtgcgtgg tgcgcggcgg cggcggaacg aacgcggtgc gggcggggcg 121 cccgccgcaa gggcccatgg cctcctgcgc gagcatcgac atcgaggacg ccacgcagca 181 cctgcgggac atcctcaagc tggaccggcc cgcgggcggc cccagtgcag agagcccacg 241 gccatccagt gcctacaatg gggacctcaa tggacttctg gtcccagacc cgctctgctc 301 aggtgatagt acctcagcaa acaagactgg tcttcggacc atgccaccca ttaacctgca 361 agagaagcag gtcatctgtc tctcaggaga tgatagctcc acctgcattg ggattttggc 421 caaggaggtg gagattgtgg ctagcagtga ctctagcatt tcaagcaagg cccggggaag 481 caacaaggtg aaaattcagc ctgtcgccaa gtatgactgg gaacagaagt actactatgg 541 caacctgatt gctgtgtcta actccttctt ggcctatgcc attcgggctg ccaacaatgg 601 ctctgccatg gtgcgggtga tcagcgtcag cacttcggag cggaccttgc tcaagggctt 661 cacaggcagt gtggctgatc tggctttcgc gcacctcaac tctccacagc tggcctgcct 721 ggatgaggca ggcaacctgt tcgtgtggcg cttggctctg gttaatggca aaattcaaga 781 agagatcttg gtccatattc ggcagccaga gggcacgcca ctgaaccact ttcgcaggat 841 catctggtgc cccttcatcc ctgaggagag cgaagactgc tgtgaggaga gcagcccaac 901 agtggccctg ctgcatgaag accgggctga ggtgtgggac ctggacatcg tccgctccag 961 ccacagtacc tggcctgtgg atgttagcca gatcaagcag ggcttcattg tggtaaaagg 1021 tcatagcacg tgcctcagtg aaggagccct ctctcctgat gggactgtgc tggctactgc 1081 gagccacgat ggctatgtca agttctggca gatctacatt gaggggcaag atgagccaag 1141 gtgtctgcac gagtggaaac ctcatgatgg gcggcccctc tcctgcctcc tgttctgtga 1201 caaccataag aaacaagacc ctgatgtccc tttctggagg ttccttatta ctggtgctga 1261 ccagaaccga gagttaaaga tgtggtgtac agtatcctgg acctgcctgc agactattcg 1321 cttctcccca gatatcttca gctcagtgag tgtgccccct agcctcaagg tttgcttgga 1381 cctctcagca gaatacctga ttctcagcga tgtgcaacgg aaggtcctct atgtgatgga 1441 gctgctgcaa aaccaggagg agggccacgc ctgcttcagc tccatctcgg agttcctgct 1501 cacccaccct gtgctgagct ttggtatcca ggttgtgagt cgctgccggc tacggcacac 1561 tgaggtgctg cctgccgaag aggaaaatga cagcctgggt gctgatggta cccatggagc 1621 cggtgccatg gagtctgcgg ccggtgtgct catcaagctc ttttgtgtgc atactaaggc 1681 actgcaagat gtgcagatcc gcttccagcc acagctgaac cctgatgtgg tggccccact 1741 gcccacccac actgcccacg aggacttcac atttggagag tctcggcccg aactgggctc 1801 tgagggcctg gggtcagccg ctcacggctc ccagcctgac ctccgacgaa tcgtggagct 1861 gcctgcacct gccgacttcc tcagtctgag cagtgagacc aagcccaagt tgatgacacc 1921 tgacgccttc atgacaccta gcgcctcctt gcagcagatc actgcctctc ccagcagcag 1981 cagcagcggt agcagcagca gcagcagcag tagcagcagc tcccttacag ctgtgtctgc 2041 catgagcagc acctcagctg tggacccctc cttgaccagg ccacctgagg agctgacctt 2101 gagccccaag ctgcagctgg atggcagcct gacaatgagc agcagtggca gccttcaggc 2161 aagcccgcgt ggcctcctgc ctggcctgct cccagcccca gctgacaaac tgactcccaa 2221 ggggccgggc caggtgccta ctgccacctc tgcactgtcc ctggagctgc aggaagtgga 2281 gcccctgggg ctaccccaag cctcccctag ccgcactcgt tcccctgatg tcatctcctc 2341 agcttccact gccctgtccc aggacatccc tgagattgca tctgaggccc tgtcccgtgg 2401 ttttggctcc tctgcaccag agggccttga gccagacagt atggcttcag ccgcctcggc 2461 actgcacctg ctgtccccac ggccccggcc agggcccgag ctcggccccc agctcgggct 2521 tgatggaggc cctggggatg gagatcggca taataccccc tccctcctgg aggcagcctt 2581 gacccaggag gcctcgactc ctgacagtca ggtttggccc acagcacctg acattactcg 2641 tgagacctgc agcaccctgg cagaaagccc caggaatggc cttcaggaaa agcacaagag 2701 cctggccttc caccgaccac catatcacct gctgcagcaa cgtgacagcc aggatgccag 2761 tgctgagcaa agtgaccatg atgatgaggt ggccagcctt gcctctgctt caggaggctt 2821 tggcaccaaa gttcctgctc cacggctgcc tgccaaggac tggaagacca agggatcccc 2881 tcgaacctca cccaagctca agaggaaaag caagaaggat gatggggatg cagccatggg 2941 atcccggctc acagagcacc aggtggcaga gccccctgag gactggccag cactaatttg 3001 gcaacagcag agagagctgg cagagctgcg gcacagccag gaagagctgc tgcagcgtct 3061 gtgtacccaa ctcgaaggcc tgcagagcac agtcacaggc cacgtagaac gtgcccttga 3121 gactcggcac gagcaggaac agcggcggct ggagcgagca ctggctgagg ggcagcagcg 3181 gggagggcac tggcaggagc agctgacaca acagttgtcc caagcactgt cgtcagctgt 3241 agctgggcgg ctagagcgca gcatacggga tgagatcaag aagacagtcc ctccatgtgt 3301 ctcaaggagt ctggagccta tggcaggcca actgagcaac tcagtggcta ccaagctcac 3361 agctgtggag ggcagcatga aagagaacat ctccaagctg ctcaagtcca agaacttgac 3421 tgatgccatc gcccgagcag ctgcagacac attacaaggg ccgatgcagg ctgcctaccg 3481 ggaagccttc cagagtgtgg tgctgccggc ctttgagaag agctgccagg ccatgttcca 3541 gcaaatcaat gatagcttcc ggctggggac acaggaatac ttgcagcagc tagaaagcca 3601 catgaagagc cggaacggac gggaacagga ggccagggag cctgtgctag cccagctgcg 3661 gggcctggtc agcacactgc agagtgccac tgagcagatg cagccaccgt ggccggcagt 3721 gttcgtgctg aggtgcagca ccagctgcat gtggctgtgg gcagcctgca ggagtccatt 3781 ttagcacagg tacagcgcat cgttaagggt gaggtgagtg tggcgctcaa ggagcagcag 3841 gccgccgtca cctccagcat catgcaggcc atgcgctcag ctgctggcac acctgtaccc 3901 tctgcccacc ttgactgcca ggcccagcaa gcccatatcc tgcagctgct gcagcagggc 3961 cacctcaatc aggccttcca gcaggcgctg acagctgctg acctgaacct ggtgctgtat 4021 gtgtgtgaaa ctgtggaccc agcccaggtt tttgggcagc caccctgccc gctctcccag 4081 cctgtgctcc tttccctcat ccagcagctg gcatctgacc ttggcactcg aactgacctc 4141 aagctcagct acctggaaga ggccgtgatg cacctggacc acagtgaccc catcactcgg 4201 gaccacatgg gctccgttat ggcccaggtg cgccaaaagc tttttcagtt cctgcaggct 4261 gagccacaca actcacttgg caaagcagct cggcgtctca gcctcatgct gcatggcctc 4321 gtgaccccca gcctccctta gctgctaagc ctgccttgcc caggggtggg atggcactga 4381 aggccagcag acaggcctag gctggggcag ggtcacggct ggcctttacc tgctcaggcc 4441 ccatctctgg ggtgtttggg ggtcagggag cagggagcac tggccgtggt ctacagcgtg 4501 tggtagtcag aaggtttagc tgggcccagg gcaggtattg cgcctgcttg ggttctgcca 4561 tgcctggagc atgaccctga gatcgtgaca ccacttgagt ggaattttcc atgttccttt 4621 ttacctctaa tttggatctt tttgtttttg aaaaacata // LOCUS HUMB12A 3512 bp mRNA PRI 31-DEC-1994 DEFINITION Human B12 protein mRNA, complete cds. ACCESSION M80783 NID g179303 KEYWORDS B12 protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3512) AUTHORS Wolf,F.W., Marks,R.M., Sarma,V., Byers,M.G., Katz,R.W., Shows,T.B. and Dixit,V.M. TITLE Characterization of a novel tumor necrosis factor-alpha-induced endothelial primary response gene JOURNAL J. Biol. Chem. 267 (2), 1317-1326 (1992) MEDLINE 92112779 FEATURES Location/Qualifiers source 1..3512 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" gene 154..1104 /gene="B12" CDS 154..1104 /gene="B12" /codon_start=1 /product="B12 protein" /db_xref="PID:g179304" /translation="MSGDTCLCPASGAKPKLSGFKGGGLGNKYVQLNVGGSLYYTTVR ALTRHDTMLKAMFSGRMEVLTDKEGWILIDRCGKHFGTILNYLRDDTITLPQNRQEIK ELMAEAKYYLIQGLVNMCQSALQDKKDSYQPVCNIPIITSLKEEERLIESSTKPVVKL LYNRSNNKYSYTSNSDDHLLKNIELFDKLSLRFNGRVLFIKDVIGDEICCWSFYGQGR KLAEVCCTSIVYATEKKQTKVEFPEARIYEETLNVLLYETPRVPDNSLLEATSRSRSQ ASPSEDEETFELRDRVRRIHVKRYSTYDDRQLGHQSTHRD" BASE COUNT 840 a 962 c 867 g 843 t ORIGIN 1 cgagccaccc agctgagagg agaggcgccc ccggggacgc actgagatta tgaggctctg 61 gcctccactg gccactcact cgtgaccctt tccaccacgg cggagccttc caagcctacc 121 tcctgccgtg tggtgatcta cctgcagcgg gagatgtcgg gggacacctg cctgtgccca 181 gcctcagggg ccaagcccaa gctcagtggc ttcaagggag gagggttggg caacaagtat 241 gtccagctca acgtgggcgg ctctctgtac tacaccactg tgcgggccct gacccgccac 301 gacaccatgc tcaaggccat gttcagtggg cgcatggagg tgctgaccga caaagaaggc 361 tggatcctca tagaccgttg tggaaagcac tttggcacca ttttgaatta cctccgagat 421 gacaccatca ccctccctca gaaccggcaa gaaatcaagg aattgatggc tgaagcaaag 481 tattacctca tccaggggct ggtgaatatg tgccagagtg ccctgcagga caagaaggac 541 tcctaccagc ctgtgtgcaa catccccatc atcacatccc taaaggagga ggagcggctc 601 atcgaatcct ccaccaagcc cgtggtgaag ctgctgtaca acagaagcaa caacaagtat 661 tcctacacca gcaactctga cgaccacctg ctgaaaaaca tcgagctgtt tgacaagctc 721 tccctgcgct tcaacggccg cgtgctcttc atcaaggatg tcattggtga cgagatctgc 781 tgctggtcct tttatggcca gggccgtaag ctggcagagg tgtgctgtac ctccatcgtg 841 tatgccacgg agaagaagca gaccaaggtg gaattcccag aggcccgaat ctatgaggag 901 acactcaacg tcctactcta tgagactccc cgcgtccccg acaactcctt gttggaggcc 961 acaagccgta gccgcagcca ggcttccccc agtgaagatg aggagacctt tgaactgcgg 1021 gaccgtgtcc gccgcatcca cgtcaagcgc tacagcactt acgatgaccg gcagctcggc 1081 caccagtcta cccatcgcga ctgaccagac cctcagggag tcagggcacg ggaggcccta 1141 tctcccatcc tgtggaaccc gccccattgg ccaccccatg ctgctgctgc ctgggtctct 1201 gctctagcac ccagaggcat gacaggccct gctcagaggt cagagggtct gggcagagga 1261 gggaccacat tcccctgcct tgcccctgag cacttctgga gactgcgtcc tgtcctatct 1321 gctcaccatc acccttcctg cccgacggag ctgcttctgc tccctggggc atatggactg 1381 acccacctcc tgctgagaac cttcccctag gccctgtgca gaagggctac tgccccttag 1441 gcctcagctg ggggaaaggc agttctggtg ctgtagaggc cctggtgcag aaagtgggac 1501 gtcctttttc ctaaggtgtt taagcacagg cttgataagt ttggttttta aaaaataatc 1561 taggaaatga ataattctaa atctagtaat gaggaaactg agcatttctt ttgccctcca 1621 gggtgccaag accctacata tgacagaacc cttggccctt ctccatgcct gtgggatctg 1681 tttctttaaa gcactttgta ctgttattca ggaggttgat aatctccttg acccatgtct 1741 ttctacccta atccccactt ccctgcagaa tcaatctgag ggaggggata aagaggaagc 1801 aataaaaaaa aaacatccga cagagcagct ctggctttgc cagcctggcc agcagctcag 1861 agtgcaccga ggagggaagg atggctaagc tgggaccggc agtcctcaca gggtgcctgt 1921 gagaaaggac attttacccc cacatcatag tcacatcact gactcctagg tctagcacga 1981 ctgctctttg tgatcctctt gagtaccctt ggcttccagc catgctgtcc tcacatacgg 2041 taaagccaaa gagctgtcac atgggccaga aacatgagcc acggcaggaa gaccgtggag 2101 cccgtgggca ctgcatggtg ttggctggca tgcccatcag ctgaggacag caaactccca 2161 gcagccccta cagaggtggc acatgcttgg ccacacatct actccctgcc cacaccatct 2221 atgctcttgg ttggtgctgg ctgggatggc ggttctgccc agtggtgtct ctgagcgcgg 2281 gatgacagga gcaaccgaag caccctgaag gccttcactc cttgttgggt aactcagcca 2341 tggagatgcc aagcactagc caggaggtga gttcctcttt agggctttgg ttttcattcc 2401 tttttgtttg gcttggccaa accagaattc agcttatctg aattattttc caaaggaatg 2461 ctgtcaggga gggactgttc tgccagccta acaaagcaac gtagccacgt atagtaccca 2521 cttctgctct ttggagagaa cacaggttat caagttcatc tctcttgact actcttatga 2581 tagctgatgc cacagagcct atgggcaaat gccagaccca gggttagaca caaggacctg 2641 aagtgacatg acggcgggac aggggaaatg tgactttcta attaggcatt ttatgttagt 2701 cacagtcttg aatgtataaa cagcactaag actctcaggt caggtacctt ggtgatcagc 2761 tactagttct tccagccctc attgaggtaa caagataaag acaaatccac ttctttggcc 2821 aaattcaggc tttggcttta tgactttccc acagagactg gaatgcgtca gcctgagacc 2881 actggcctat tttctcagct gccctcttga ggtcctttaa cactcaaatt cccagctccc 2941 cactgaggtg ttgtgatgct tgccttttga cctccccatc ccctttagtc cctgcttact 3001 actttgacat tcacatcctc agtgtctcag tcttttttgc cgagaaagca cagtagtctg 3061 ggactgggca tttatcttct ctgactgaaa atctctcctt ggtcttaagg aaaatactaa 3121 cattgaactc actgacatga tcttagcttc tttaatcaga ctttgtgact taaaagtttg 3181 ggggttttct ttgaaagttt ccagccctat tcagaaagca actcttggct gtgtgcattt 3241 ttcaactcca agcagcccag gggtaagtaa acaaagtatg gatgaaggtc agattttctt 3301 gtcagtttct gagaaacctg gcagcctgct gttaacaaca caggccagta ttgggtttta 3361 ttgaatttgg tatgtgacca aggtcggcct aaaggatggc gcaggtcctg ggcaggaaag 3421 aatttttcct ttatcacata actgtaatat ttggttgctc agcataagtg atggaagcaa 3481 acactaattt ctaataaaat tgtgttaaac tc // LOCUS HUMB17HSD 1344 bp mRNA PRI 08-DEC-1993 DEFINITION Human 17 beta hydroxysteroid dehydrogenase type 2 mRNA, complete cds. ACCESSION L11708 NID g306461 KEYWORDS 17 beta hydroxysteroid dehydrogenase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1344) AUTHORS Wu,L., Einstein,M., Geissler,W.M., Chan,H.K., Elliston,K.O. and Andersson,S. TITLE Expression cloning and characterization of human 17 beta-hydroxysteroid dehydrogenase type 2, a microsomal enzyme possessing 20 alpha-hydroxysteroid dehydrogenase activity JOURNAL J. Biol. Chem. 268, 12964-12969 (1993) MEDLINE 93286147 REFERENCE 2 (bases 1 to 1344) AUTHORS Andersson,S. TITLE Direct Submission JOURNAL Submitted (16-JUL-1993) S. Andersson, Department of Biochemistry, Merck Research Laboratories, Rahway, NJ 07065 USA FEATURES Location/Qualifiers source 1..1344 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="prostate" CDS 84..1247 /codon_start=1 /function="20 alpha-hydroxysteroid dehydrogenase activity" /product="17 beta hydroxysteroid dehydrogenase type 2" /db_xref="PID:g306462" /translation="MSTFFSDTAWICLAVPTVLCGTVFCKYKKSSGQLWSWMVCLAGL CAVCLLILSPFWGLILFSVSCFLMYTYLSGQELLPVDQKAVLVTGGDCGLGHALCKYL DELGFTVFAGVLNENGPGAEELRRTCSPRLSVLQMDITKPVQIKDAYSKVAAMLQDRG LWAVINNAGVLGFPTDGELLLMTDYKQCMAVNFFGTVEVTKTFLPLLRKSKGRLVNVS SMGGGAPMERLASYGSSKAAVTMFSSVMRLELSKWGIKVASIQPGGFLTNIAGTSDKW EKLEKDILDHLPAEVQEDYGQDYILAQRNFLLLINSLASKDFSPVLRDIQHAILAKSP FAYYTPGKGAYLWICLAHYLPIGIYDYFAKRHFGQDKPMPRALRMPNYKKKAT" polyA_site 1344 BASE COUNT 331 a 323 c 355 g 335 t ORIGIN 1 cagaactcag gctgcctcca gccagccttt gcccgctaga ctcactggcc ctgagcactt 61 gaaggtgcag caagtcactg agaatgagca ctttcttctc ggacacagca tggatctgcc 121 tggctgtccc cacagtacta tgtgggacag tattttgcaa atacaagaag agctcagggc 181 agctgtggag ctggatggtc tgcctggcag gcctctgtgc agtctgcctg ctcatcctgt 241 cccctttttg gggcttgatc ctcttctcgg tgtcatgctt cctcatgtat acttacttat 301 ctggccaaga attgttacct gtggatcaga aggcagtcct ggtgacaggt ggtgattgcg 361 ggcttggcca tgctttgtgc aagtatctgg atgagctggg cttcacggta tttgccggag 421 ttttgaatga aaatggccca ggagctgagg aattgcgaag aacctgctct ccgcgcctct 481 cggtgctcca aatggacatc acgaagccag tgcagataaa agatgcttac agcaaggttg 541 cagcaatgct gcaggacaga ggactgtggg ctgtgatcaa caatgctggg gtgcttggct 601 ttccaactga tggggagctt cttcttatga ctgactacaa acaatgcatg gccgtgaact 661 tctttggaac tgtggaggtc acaaagacgt ttttgcctct tcttagaaaa tccaaaggga 721 ggctggtgaa tgtcagcagc atgggaggag gggccccaat ggaaaggctg gcatcttatg 781 gctcatcaaa ggcggctgtg accatgttct catcagttat gagactggag ctttccaagt 841 ggggaattaa agttgcttcc atccaacctg gaggcttcct aacaaatatc gcaggcacca 901 gtgacaagtg ggaaaagctg gagaaggaca ttctggacca cctccccgct gaggtacagg 961 aagactacgg ccaggactac atcttagcac agcggaattt cctcctattg atcaactcgt 1021 tagccagcaa ggacttctct ccggtgctgc gggacatcca gcatgctatc ttggcgaaga 1081 gcccttttgc ctattacacg ccagggaaag gcgcttactt gtggatctgc cttgctcact 1141 atttgcctat tggcatatat gattactttg ctaaaagaca ttttggccaa gacaagccca 1201 tgcccagagc tctaagaatg cctaactaca agaaaaaggc cacctaggca atggaagccc 1261 tcaaagaagt cggaatgtca tagtcttgaa atgaaaggga aactgggaaa ctgggtttct 1321 cattaaagtt gtttcccact ctga // LOCUS HUMB1LYM 1146 bp DNA PRI 15-JUL-1993 DEFINITION Human B-lymphocyte cell-surface antigen B1 (CD20). ACCESSION M27394 J03574 NID g179307 KEYWORDS B1 antigen; antigen. SOURCE Human tonsillar lymphocyte cDNA to mRNA, clone pB1-21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1146) AUTHORS Tedder,T.F., Streuli,M., Schlossman,S.F. and Saito,H. TITLE Isolation and structure of a cDNA encoding the B1 (CD20) cell-surface antigen of human B lymphocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 208-212 (1988) MEDLINE 88124792 COMMENT Submitted in computer readable form by T.Tedder 23-NOV-1987. FEATURES Location/Qualifiers source 1..1146 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q12-13" CDS 134..1027 /codon_start=1 /product="cell surface antigen B1" /db_xref="PID:g179308" /translation="MTTPRNSVNGTFPAEPMKGPIAMQSGPKPLFRRMSSLVGPTQSF FMRESKTLGAVQIMNGLFHIALGGLLMIPAGIYAPICVTVWYPLWGGIMYIISGSLLA ATEKNSRKCLVKGKMIMNSLSLFAAISGMILSIMDILNIKISHFLKMESLNFIRAHTP YINIYNCEPANPSEKNSPSTQYCYSIQSLFLGILSVMLIFAFFQELVIAGIVENEWKR TCSRPKSNIVLLSAEEKKEQTIEIKEEVVGLTETSSQPKNEEDIEIIPIQEEEEEETE TNFPEPPQDQESSPIENDSSP" BASE COUNT 349 a 247 c 224 g 326 t ORIGIN Chromosome 11q12-13. 1 cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 61 agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 121 ttttgagagc aaaatgacaa cacccagaaa ttcagtaaat gggactttcc cggcagagcc 181 aatgaaaggc cctattgcta tgcaatctgg tccaaaacca ctcttcagga ggatgtcttc 241 actggtgggc cccacgcaaa gcttcttcat gagggaatct aagactttgg gggctgtcca 301 gattatgaat gggctcttcc acattgccct ggggggtctt ctgatgatcc cagcagggat 361 ctatgcaccc atctgtgtga ctgtgtggta ccctctctgg ggaggcatta tgtatattat 421 ttccggatca ctcttggcag caacggagaa aaactctagg aagtgtttgg tcaaaggaaa 481 aatgataatg aattcattga gcctctttgc tgccatttct ggaatgattc tttcaatcat 541 ggacatactt aatattaaaa tttcccattt tttaaaaatg gagagtctga attttattag 601 agctcacaca ccatatatta acatatacaa ctgtgaacca gctaatccct ctgagaaaaa 661 ctccccatct acccaatact gttacagcat acaatctctg ttcttgggca ttttgtcagt 721 gatgctgatc tttgccttct tccaggaact tgtaatagct ggcatcgttg agaatgaatg 781 gaaaagaacg tgctccagac ccaaatctaa catagttctc ctgtcagcag aagaaaaaaa 841 agaacagact attgaaataa aagaagaagt ggttgggcta actgaaacat cttcccaacc 901 aaagaatgaa gaagacattg aaattattcc aatccaagaa gaggaagaag aagaaacaga 961 gacgaacttt ccagaacctc cccaagatca ggaatcctca ccaatagaaa atgacagctc 1021 tccttaagtg atttcttctg ttttctgttt ccttttttaa acattagtgt tcatagcttc 1081 caagagacat gctgactttc atttcttgag gtactctgca catacgcacc acatctctat 1141 ctggcc // LOCUS HUMB5A 2472 bp mRNA PRI 14-AUG-1996 DEFINITION Human putative transmembrane protein precursor (B5) mRNA, complete cds. ACCESSION L38961 NID g624703 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2472) AUTHORS Lissy,N.A., Bellacosa,A., Sonoda,G., Miller,P.D., Jhanwar,S.C. and Testa,J.R. TITLE Isolation, characterization, and mapping to human chromosome 11q24-25 of a cDNA encoding a highly conserved putative transmembrane protein, TMC JOURNAL Biochim. Biophys. Acta, Gene Struct. Expr. 1306 (2-3), 137-141 (1996) MEDLINE 96221283 FEATURES Location/Qualifiers source 1..2472 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..106 /gene="B5" gene 1..2455 /gene="B5" CDS 107..2224 /gene="B5" /note="putative transmembrane protein precursor" /codon_start=1 /db_xref="PID:g624704" /translation="MTKFGFLRLSYEKQDTLLKLLILSMAAVLSFSTRLFAVLRFESV IHEFDPYFNYRTTRFLAEEGFYKFHNWFDDRAWYPLGRIIGGTIYPGLMITSAAIYHV LHFFHITIDIRNVCVFLAPLFSSFTSIVTYLLTKELKDAGAGLLAAAMIAVVPGYISR SVAGSYDNEGIAIFCMLLTYYMWIKAVKTGSICWAAKCALAYFYMVSSWGGYVFLINL IPLHVLVLMLTGRFSHRIYVAYCTVYCLGTILSRQISFVGFQPVLSSEHMAGFGVFGL CQIHAFVDYLRSKLNPQQFEVLFRSVISLVGFVLLTVGALLMLTGKISPWTGRFYSLL DPSYAKNNIPIIASVSEHQPTTWSSYYFDLQLLVFMFPVGLYYCFSNLSDARIFIIMY GVTSMYFSAVMVRLMLVLAPVMSILSGIGVSQVLSTYMKNLDISRPDKKSKKQQDSTY PIKIEVASGMILVMAFFLITYTFHSTWVTSEAYSSPSIVLSARGGDGSRIIFDDFREA YYWLRHNTPEDAKVMSWWDYGYQITAMANRTILVDNNTWNNTHISRVGQAMASTEEKA YEIMRELDVSYVLVIFGGLTGYSSDDINKFLWMVRIGGSTDTGKHIKENDYYTPTGEF RVDREGSPVLLNCLMYKMCYYRFGQVYTEAKRPPGFDRVRNAEIGNKDFELDVLEEGY TTEHWLVRIYKVKDLDNRGLSRT" sig_peptide 107..196 /gene="B5" mat_peptide 197..2221 /gene="B5" /note="putative transmembrane protein" misc_feature 452..499 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 533..580 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 620..667 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 746..793 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 815..862 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 1013..1060 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 1199..1246 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 1310..1357 /gene="B5" /note="encodes putative transmembrane domain" misc_feature 1475..1522 /gene="B5" /note="encodes putative transmembrane domain" 3'UTR 2223..2472 polyA_site 2455 /gene="B5" BASE COUNT 566 a 568 c 583 g 755 t ORIGIN 1 ctgccagggt tgggtgcgcc gctgaacgga tggctgaggg agccccgcgg atcgttagga 61 aagccggcca gctgatcgtc gtgtgttgcc acccattcat gtcaagatga ctaagtttgg 121 atttttgcga ttgtcctatg agaagcagga cacacttttg aagcttctca ttctgtcaat 181 ggctgctgta ttatccttct ccactcgtct gtttgctgtc ctgagatttg aaagtgttat 241 ccatgagttt gatccgtact ttaattatcg gactaccagg ttcctggctg aggaggggtt 301 ttataaattc cataactggt ttgatgaccg agcctggtac cctttgggac gaatcattgg 361 aggaacaatt tacccaggtt taatgatcac ctctgctgca atctaccatg tactccattt 421 tttccacatc accatcgaca ttcggaatgt ctgtgtgttc ctggcccctc tcttctcctc 481 cttcacctcc atcgtcacgt acctccttac caaagagctc aaggatgcag gggctgggct 541 tcttgctgct gccatgattg ctgtagttcc tggatatatc tcccgatctg tggctggctc 601 ctatgataat gaagggattg ccatcttttg catgctactc acctactaca tgtggatcaa 661 ggcagtaaag actggttcca tctgttgggc agctaagtgt gcccttgctt atttctacat 721 ggtctcgtca tggggaggtt atgtgttcct gatcaactta attcctctcc acgtcctcgt 781 gctgatgctc acaggccgtt tctctcaccg gatctatgtg gcctactgta ctgtttactg 841 cctgggtact atactttcta ggcagatctc ctttgtgggt ttccagcctg tcctttcatc 901 agagcacatg gcagggtttg gggtctttgg tctctgccag atccatgcct ttgtggatta 961 cctgcgcagc aagttgaatc cacaacaatt tgaagttctt ttccggagcg tcatctctct 1021 ggtaggcttt gtccttctca ccgtgggagc tctcctcatg ctgacaggaa aaatatctcc 1081 ctggacgggg cgtttctact cactgctgga tccctcttat gctaagaaca acatccccat 1141 cattgcttct gtgtctgagc atcagcccac aacctggtcc tcatactatt ttgacctgca 1201 gctcctcgtc ttcatgtttc cagttggcct ctattactgc tttagcaacc tgtctgatgc 1261 ccggattttt atcatcatgt atggtgtgac cagcatgtac ttttcagctg taatggtgcg 1321 tctaatgcta gtgttggcac ctgttatgag cattctctct ggcattggag tctcccaggt 1381 gctgtccaca tacatgaaga atctggacat aagtcgccca gacaagaaga gcaagaagca 1441 acaggattcc acctacccta ttaagattga agtggcaagt gggatgatac tggtcatggc 1501 tttctttctc atcacctaca cctttcattc aacctgggtg accagtgagg cctactcttc 1561 tccgtccatt gtactatctg cccgtggtgg ggatggcagt aggatcatat ttgatgactt 1621 ccgagaagca tattattggc ttcgtcataa tactccagag gatgcgaagg tcatgtcctg 1681 gtgggattat ggctatcaga ttacagctat ggcaaaccga acaattttag tggacaataa 1741 cacatggaat aatacccata tttctcgagt agggcaggca atggcgtcca cagaggaaaa 1801 agcctatgag atcatgaggg agctcgatgt cagctatgtg ctggtcattt ttggaggcct 1861 cactgggtat tcctctgatg atatcaacaa gtttctttgg atggtccgga ttggagggag 1921 cacagataca ggcaaacata tcaaggagaa tgactattat actccaactg gggagttccg 1981 tgtggaccgt gaaggttctc cagtgctgct caactgcctc atgtacaaga tgtgttacta 2041 tcgctttgga caggtttaca cagaagccaa gcgtcctcca ggctttgacc gtgtccgaaa 2101 tgctgagatt gggaataaag actttgagct tgatgtcctg gaggaaggct ataccacaga 2161 acattggctg gtcaggatat acaaggtaaa ggacctggat aatcgaggct tgtcaaggac 2221 ataaatgtca cgtccagctc tgatatcttc gcactgagca catcacattt aggacgttga 2281 agattttttt tttttttttt tttttaatat gcagtttgta agaacaaaac tggatggcat 2341 ccgaattgtc tggaagtttt gtcttgggca tgatgggctg ggccaaatga aatgattttt 2401 ataattctaa acaggttacc aaatgaaatg tcatggcttt actttggtca attaaagggg 2461 ggaatttttt ta // LOCUS HUMB61 1480 bp mRNA PRI 31-DEC-1994 DEFINITION Human B61 mRNA, complete cds. ACCESSION M57730 M37476 NID g179320 KEYWORDS intermediate-early response gene. SOURCE Human umbilical vein endothelial cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1480) AUTHORS Holzman,L.B., Marks,R.M. and Dixit,V.M. TITLE A novel immediate-early response gene of endothelium is induced by cytokines and encodes a secreted protein JOURNAL Mol. Cell. Biol. 10 (11), 5830-5838 (1990) MEDLINE 91042512 FEATURES Location/Qualifiers source 1..1480 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="umbilical vein endothelium" mRNA <1..1480 /gene="B61" gene 1..1480 /gene="B61" sig_peptide 74..127 /gene="B61" /note="putative" /product="B61" CDS 74..691 /gene="B61" /note="putative" /codon_start=1 /product="B61" /db_xref="PID:g179321" /translation="MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLN DYVDIICPHYEDHSVADAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPE KLSEKFQRFTPFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVN PQEKRLAADDPEVRVLHSIGHSAAPRLFPLAWTVLLLPLLLLQTP" mat_peptide 128..688 /gene="B61" /note="putative" /product="B61" polyA_signal 1458..1463 /gene="B61" /evidence=experimental BASE COUNT 359 a 422 c 393 g 306 t ORIGIN 1 gcggagaaag ccagtgggaa cccagaccca taggagaccc gcgtccccgc tcggcctggc 61 caggccccgc gctatggagt tcctctgggc ccctctcttg ggtctgtgct gcagtctggc 121 cgctgctgat cgccacaccg tcttctggaa cagttcaaat cccaagttcc ggaatgagga 181 ctacaccata catgtgcagc tgaatgacta cgtggacatc atctgtccgc actatgaaga 241 tcactctgtg gcagacgctg ccatggagca gtacatactg tacctggtgg agcatgagga 301 gtaccagctg tgccagcccc agtccaagga ccaagtccgc tggcagtgca accggcccag 361 tgccaagcat ggcccggaga agctgtctga gaagttccag cgcttcacac ctttcaccct 421 gggcaaggag ttcaaagaag gacacagcta ctactacatc tccaaaccca tccaccagca 481 tgaagaccgc tgcttgaggt tgaaggtgac tgtcagtggc aaaatcactc acagtcctca 541 ggcccatgtc aatccacagg agaagagact tgcagcagat gacccagagg tgcgggttct 601 acatagcatc ggtcacagtg ctgccccacg cctcttccca cttgcctgga ctgtgctgct 661 ccttccactt ctgctgctgc aaaccccgtg aaggtgtatg ccacacctgg ccttaaagag 721 ggacaggctg aagagaggga caggcactcc aaacctgtct tggggccact ttcagagccc 781 ccagccctgg gaaccactcc caccacaggc ataagctatc acctagcagc ctcaaaacgg 841 gtcagtatta aggttttcaa ccggaaggag gccaaccagc ccgacagtgc catccccacc 901 ttcacctcgg agggacggag aaagaagtgg agacagtcct ttcccaccat tcctgccttt 961 aagccaaaga aacaagctgt gcaggcatgg tcccttaagg cacagtggga gctgagctgg 1021 aaggggccac gtggatgggc aaagcttgtc aaagatgccc cctccaggag agagccagga 1081 tgcccagatg aactgactga aggaaaagca agaaacagtt tcttgcttgg aagccaggta 1141 caggagaggc agcatgcttg ggctgaccca gcatctccca gcaagacctc atctgtggag 1201 ctgccacaga gaagtttgta gccaggtact gcattctctc ccatcctggg gcagcactcc 1261 ccagagctgt gccagcaggg gggctgtgcc aacctgttct tagagtgtag ctgtaagggc 1321 agtgcccatg tgtacattct gcctagagtg tagcctaaag ggcagggccc acgtgtatag 1381 tatctgtata taagttgctg tgtgtctgtc ctgatttcta caactggagt ttttttatac 1441 aatgttcttt gtctcaaaat aaagcaatgt gttttttcgg // LOCUS HUMB72A 1112 bp mRNA PRI 31-DEC-1994 DEFINITION Human CTLA4 counter-receptor (B7-2) mRNA, complete cds. ACCESSION L25259 NID g416368 KEYWORDS CTLA4 counter-receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1112) AUTHORS Freeman,G.J., Gribben,J.G., Boussiotis,V.A., Ng,J.W., Restivo,V.A. Jr., Lombard,L.A., Gray,G.S. and Nadler,L.M. TITLE Cloning of B7-2: a CTLA-4 counter-receptor that costimulates human T cell proliferation [see comments] JOURNAL Science 262 (5135), 909-911 (1993) MEDLINE 94053735 FEATURES Location/Qualifiers source 1..1112 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphocyte" gene 107..1096 /gene="B7-2" CDS 107..1096 /gene="B7-2" /codon_start=1 /product="CTLA4 counter-receptor" /db_xref="PID:g416369" /translation="MDPQCTMGLSNILFVMAFLLSGAAPLKIQAYFNETADLPCQFAN SQNQSLSELVVFWQDQENLVLNEVYLGKEKFDSVHSKYMGRTSFDSDSWTLRLHNLQI KDKGLYQCIIHHKKPTGMIRIHQMNSELSVLANFSQPEIVPISNITENVYINLTCSSI HGYPEPKKMSVLLRTKNSTIEYDGIMQKSQDNVTELYDVSISLSVSFPDVTSNMTIFC ILETDKTRLLSSPFSIELEDPQPPPDHIPWITAVLPTVIICVMVFCLILWKWKKKKRP RNSYKCGTNTMEREESEQTKKREKIHIPERSDEAQRVFKSSKTSSCDKSDTCF" sig_peptide 107..175 /gene="B7-2" mat_peptide 176..1093 /gene="B7-2" /product="CTLA4 counter-receptor" polyA_site 1112 BASE COUNT 346 a 237 c 230 g 299 t ORIGIN 1 cacagggtga aagctttgct tctctgctgc tgtaacaggg actagcacag acacacggat 61 gagtggggtc atttccagat attaggtcac agcagaagca gccaaaatgg atccccagtg 121 cactatggga ctgagtaaca ttctctttgt gatggccttc ctgctctctg gtgctgctcc 181 tctgaagatt caagcttatt tcaatgagac tgcagacctg ccatgccaat ttgcaaactc 241 tcaaaaccaa agcctgagtg agctagtagt attttggcag gaccaggaaa acttggttct 301 gaatgaggta tacttaggca aagagaaatt tgacagtgtt cattccaagt atatgggccg 361 cacaagtttt gattcggaca gttggaccct gagacttcac aatcttcaga tcaaggacaa 421 gggcttgtat caatgtatca tccatcacaa aaagcccaca ggaatgattc gcatccacca 481 gatgaattct gaactgtcag tgcttgctaa cttcagtcaa cctgaaatag taccaatttc 541 taatataaca gaaaatgtgt acataaattt gacctgctca tctatacacg gttacccaga 601 acctaagaag atgagtgttt tgctaagaac caagaattca actatcgagt atgatggtat 661 tatgcagaaa tctcaagata atgtcacaga actgtacgac gtttccatca gcttgtctgt 721 ttcattccct gatgttacga gcaatatgac catcttctgt attctggaaa ctgacaagac 781 gcggctttta tcttcacctt tctctataga gcttgaggac cctcagcctc ccccagacca 841 cattccttgg attacagctg tacttccaac agttattata tgtgtgatgg ttttctgtct 901 aattctatgg aaatggaaga agaagaagcg gcctcgcaac tcttataaat gtggaaccaa 961 cacaatggag agggaagaga gtgaacagac caagaaaaga gaaaaaatcc atatacctga 1021 aagatctgat gaagcccagc gtgtttttaa aagttcgaag acatcttcat gcgacaaaag 1081 tgatacatgt ttttaattaa agagtaaagc cc // LOCUS HUMB94 4180 bp mRNA PRI 07-JUN-1993 DEFINITION Homo sapiens B94 protein mRNA, complete cds. ACCESSION M92357 NID g306463 KEYWORDS B94 protein; tumor necrosis factor inducible gene. SOURCE Homo sapiens umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4180) AUTHORS Sarma,V., Wolf,F.W., Marks,R.M., Shows,T.B.Jr.. and Dixit,V.M. TITLE Cloning of a novel tumor necrosis factor-alpha-inducible primary response gene that is differentally expressed in development and capillary tube-like formation in vitro JOURNAL J. Immunol. 148, 3302-3312 (1992) MEDLINE 92251199 FEATURES Location/Qualifiers source 1..4180 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /tissue_type="umbilical vein" CDS 132..2096 /codon_start=1 /product="B94 protein" /db_xref="PID:g179331" /translation="MSEASSEDLVPPLEAGAAPYREEEEAAKKKKEKKKKSKGLANVF CVFTKGKKKKGQPSSAEPEDAAGSRQGLDGPPPTVEELKAALERGQLEAARPLLALER ELAAAAAAGGVSEEELVRRQSKVEALYELLRDQVLGVLRRPLEAPPERLRQALAVVAE QEREDRQAAAAGPGTSGLAATRPRRWLQLWRRGVAEAAEERMGQRPAAGAEVPESVFL HLGRTMKEDLEAVVERLKPLFPAEFGVVAAYAESYHQHFAAHLAAVAQFELCERDTYM LLLWVENLYPNDIINSPKLVGELQGMGLGSLLPPRQIRLLEATFLSSEAANVRELMDR ALELEARRWAEDVPPQRLDGHCHSELAIDIIQITSQAQAKAESITLDLGSQIKRVLLV ELPAFLRSYQRAFNEFLERGKQLTNYRANVIANINNCLSFRMSMEQNWQVPQDTLSLL LGPLGELKSHGFDTLLQNLHEDLKPLFKRFTHTRWAAPVETLENIIATVDTRLPEFSE LQGCFREELMEALHLHLVKEYIIQLSKGRLVLKTAEQQQQLAGYILANADTIQHFCTQ HGSPATWLQPALPTLAEIIRLQDPSAIKIEVATYATCYPDFSKGHLSAILAIKGNLSN SEVKRIRSILDVSMGAQEPSRPLFSLIKVG" BASE COUNT 787 a 1249 c 1285 g 859 t ORIGIN 1 ccagggtgat gctgaagatg atgaccttct tccaaggcct ctagagccat cagcctgtgc 61 caggcaccct cgacttgcct agaggccccc aaaagttgca gtccacatca gaggcagagt 121 cagaggcctc catgtcggag gcctcctctg aggacctggt gccacccctg gaggctgggg 181 cagccccata tagggaggag gaagaggcgg cgaagaagaa gaaggagaag aagaagaagt 241 ccaaaggcct ggccaatgtg ttctgcgtct tcaccaaagg gaagaagaag aagggtcagc 301 ccagctcagc ggagcccgag gacgcagccg ggtccaggca ggggctggat ggcccgcccc 361 ccacagtgga ggagctgaag gcggcgctgg agcgcgggca gctggaggcg gcgcggccgc 421 tgctggcgct ggagcgggag ctggcggcgg cggcggcggc gggcggtgtg agcgaggagg 481 agctggtgcg gcgccagagc aaggtggagg cgctgtacga gctgctgcgc gaccaggtgc 541 tgggcgtgct gcggcggccg ctggaggcgc cgcccgagcg gctgcgccag gcgctggccg 601 tggtggcgga gcaggagcgc gaggaccgcc aggcggcggc ggcggggccg gggacctcgg 661 ggctggcggc cacgcgcccg cggcgctggc tgcagctgtg gcggcgcggc gtggcggagg 721 cggccgagga gcgcatgggc cagcggccgg ccgcgggcgc cgaggtcccc gagagcgtct 781 ttctgcactt gggccgcacc atgaaggagg acctggaggc cgtggtggag cggctgaagc 841 cgctgttccc cgccgagttc ggcgtcgtgg cggcctacgc cgagagctac caccagcact 901 tcgcggccca cctggccgcc gtggcgcagt tcgagctgtg cgagcgcgac acctacatgc 961 tgctgctctg ggtggagaac ctctacccca atgacatcat caacagcccc aagctggtgg 1021 gtgagctgca gggtatgggg ctcgggagcc tcctgccccc caggcagatc cgactgctgg 1081 aggccacatt cctgtccagt gaggcggcca atgtgaggga gttgatggac cgagctctgg 1141 agctagaggc acggcgctgg gctgaggatg tgcctcccca gaggctggac ggccactgcc 1201 acagcgagct ggccatcgac atcatccaga tcacctccca ggcccaggcc aaggccgaga 1261 gcatcacgct ggacttgggc tcacagataa agcgggtgct gctggtggag ctgcctgcgt 1321 tcctgaggag ctaccagcgc gcctttaatg aatttctgga gagaggcaag cagctgacga 1381 attacagggc caatgttatt gccaacatca acaactgcct gtccttccgg atgtccatgg 1441 agcagaattg gcaggtaccc caggacaccc tgagcctcct gctgggcccc ctgggtgagc 1501 tcaagagcca cggctttgac accctgctcc agaacctgca tgaggacctg aagccactgt 1561 tcaagaggtt cacgcacacc cgctgggcgg cccctgtgga gaccctggaa aacatcatcg 1621 ccactgtaga cacgaggctg cctgagttct cagagctgca gggctgtttc cgggaggagc 1681 tcatggaggc cttgcacctg cacctggtga aggagtacat catccaactc agcaaggggc 1741 gcctggtcct caagacggcc gagcagcagc agcagctggc tgggtacatc ctggccaatg 1801 ctgacaccat ccagcacttc tgcacccagc acggctcccc ggcgacctgg ctgcagcctg 1861 ctctccctac gctggccgag atcattcgcc tgcaggaccc cagtgccatc aagattgagg 1921 tggccactta tgccacctgc taccctgact tcagcaaagg ccacctgagc gctatcctgg 1981 ccatcaaggg gaacctatcc aacagtgagg tcaagcgcat ccggagcatc ttggacgtca 2041 gcatgggggc gcaggagccc tcccggcccc tattttccct tataaaggtt ggttagcttt 2101 tcctgtggcc tgacctgcct gtgagtgccc agcaagcctt gggcacaccc cgctgggagc 2161 tgttaagagc agcgctggtt ctcggttcct cccgggtctc ctgtgctctg atgctacttc 2221 tgcctagccc tggcggaggt gcaggccctg tcagctggaa ctggacagac cttggtttgt 2281 ttacatgtcc gatgggggca ggagctccca tcctgggcag ccaaccaggc aacaccaagg 2341 actctttgta aacgatagct gatcgtgtgc acgcaaggaa agaaccagga gggagagtgc 2401 agccaggctc agggatcccc ggacacctct gtccagagcc cctccacagt cggcctcatg 2461 actgtcctcc tcgtgggtgg ggccgagggc cctcttcagc tctctggaga caggggccga 2521 gcctcaccca tctgccctct gcagcccagg gccgccgtga gcgggattca gcaatggtgg 2581 aatggaagac agaactggaa gagaaagaag gaaaagatga gctctcgtct ggcaggggct 2641 tttagggtcc tgtggcgagc tgtgagcacc gccagcgtta gacgtcacat ccaggtggcc 2701 ccacggcccc tacaggctgg ccctgcaatg gggccctgag ccctccctct tcatccccca 2761 aggcctcaac tagagggtgg tcccccgagg gcttggtgtc tactaccgaa gggcccaaga 2821 cctcctgggt cctctcaggc tcccccttcc ccaaggcagg gacaggccct gggggtgcca 2881 ccgtgggccc tgccacccag aagtctggct gaggtctggg caggggcagg gcaagcttga 2941 cctctcactg ttgacccttt ggcctctgta tttgtttcct attgccgtga caggtttcca 3001 caaacttcgt ggatcaaaac gaggtcttcc agttctgcgg gtcagaaggc tgacctgggg 3061 ctcaaatctg ggtgtcggca gtcctgcact ccttctggag gctctagggg agaattcatt 3121 tctggccttt tcatttttag aggctgaccg taattcttga cttcaggctc ctccatcttc 3181 agagccagct gtgggtagtt gaatcttttt cccgtcacct cattgaggcc tcccctctcc 3241 tgcctccctc caccactttt tttttttttt ttttgagaca gggtcttgct gtgttgccca 3301 ggctggagtg cagtggcctg gtcatggcat caaggctcac tgcagcctgg acctcctggt 3361 tcaagtgatc ctcttgtctc agtcccctga gacaatcccc cacgcccagc tacatatttt 3421 tgtggataca gggtctcatt ctgttgccta ggcttgtctg gaactcctgg gctcaaggga 3481 tcttgtagcc ttagcctcct aaagtgctgg gattataggc atgagtcact cgtacccggc 3541 ctgctctacc gcttttaagg acgcttatga tcacattgcg cctacccaga gaacccaggt 3601 cgtctttcta ttttcaggtc agctgattag ccaccttagt tccatctgca actttagttc 3661 ccactggctg tgtaacctaa catagtcaca ggctctgggg actgtcacgt ggacatcttt 3721 gggaggccgt tattctgccc accgcaccct ccgttcatcc cctgccctgc cgggcacctc 3781 gctctacccc aggaaaatgt gagctcgttt tcctgctcgg catgtgctcc ccctaaggct 3841 ctgctcctcc ctgggcctga aagttccttc tcagcctgag agggggccct tcgatctcag 3901 gcatgactca gcccggctga tgcctctgca gtgctgagtc aggatttggg gccggctctc 3961 ttgggtctgt ccccttttcc caggtactgc cttacaaagc tgtggccagg aagtggccgg 4021 tataaaggat gcccaaggtc tttgtacgtg tgtaggagtt agcgtgtttg atattgttaa 4081 tataataata attatttttt agagtactgc ttttgtatgt atgttgaaca ggatccaggt 4141 ttttatagct tgatataaaa cagaattcaa aagtgaaaaa // LOCUS HUMBADPTA 5701 bp mRNA PRI 15-SEP-1990 DEFINITION Human beta adaptin mRNA, complete cds. ACCESSION M34175 J05273 NID g179332 KEYWORDS beta adaptin. SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5701) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) MEDLINE 90202947 FEATURES Location/Qualifiers source 1..5701 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..5701 /note="beta adaptin mRNA" CDS 178..2991 /note="beta adaptin" /codon_start=1 /db_xref="PID:g179333" /translation="MTDSKYFTTNKKGEIFELKAELNNEKKEKRKEAVKKVIAAMTVG KDVSSLFPDVVNCMQTDNLELKKLVYLYLMNYAKSQPDMAIMAVNSFVKDCEDPNPLI RALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLHDINAQMVEDQG FLDSLRDLIADSNPMVVANAVAALSEISESHPNSNLLDLNPQNINKLLTALNECTEWG QIFILDCLSNYNPKDDREAQSICERVTPRLSHANSAVVLSAVKVLMKFLELLPKDSDY YNMLLKKLAPPLVTLLSGEPEVQYVALRNINLIVQKRPEILKQEIKVFFVKYNDPIYV KLEKLDIMIRLASQANIAQVLAELKEYATEVDVDFVRKAVRAIGRCAIKVEQSAERCV STLLDLIQTKVNYVVQEAIVVIRDIFRKYPNKYESIIATLCENLDSLDEPDARAAMIW IVGEYAERIDNADELLESFLEGFHDESTQVQLTLLTAIVKLFLKKPSETQELVQQVLS LATQDSDNPDLRDRGYIYWRLLSTDPVTAKEVVLSEKPLISEETDLIEPTLLDELICH IGSLASVYHKPPNAFVEGSHGIHRKHLPIHHGSTDAGDSPVGTTTATNLEQPQVIPSQ GDLLGDLLNLDLGPPVNVPQVSSMQMGAVDLLGGGLDSLVGQSFIPSSVPATFAPSPT PAVVSSGLNDLFELSTGIGMAPGGYVAPKAVWLPAVKAKGLEISGTFTHRQGHIYMEM NFTNKALQHMTDFAIQFNKNSFGVIPSTPLAIHTPLMPNQSIDVSLPLNTLGPVMKME PLNNLQVAVKNNIDVFYFSCLIPLNVLFVEDGKMERQVFLATWKDIPNENELQFQIKE CHLNADTVSSKLQNNNVYTIAKRNVEGQDMLYQSLKLTNGIWILAELRIQPGNPNYTL SLKCRAPEVSQYIYQVYDSILKN" polyA_signal 5683..5688 BASE COUNT 1528 a 1373 c 1284 g 1516 t ORIGIN 1 ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa 61 gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc 121 cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg 181 actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa 241 ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgctatg 301 accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac 361 aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca 421 gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg 481 attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat 541 ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca 601 gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt 661 ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta 721 gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca 781 cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc 841 atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt 901 gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa 961 gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg 1021 aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc 1081 gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc 1141 aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc 1201 atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat 1261 gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc 1321 atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc 1381 aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac 1441 cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag 1501 ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat 1561 gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag 1621 ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag 1681 ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac 1741 cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc 1801 ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat 1861 gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt 1921 gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat 1981 gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc 2041 ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat 2101 gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat 2161 agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct 2221 acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc 2281 atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa 2341 ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac 2401 ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc 2461 tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc 2521 attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat 2581 aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca 2641 ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg 2701 aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct 2761 gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg 2821 gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc 2881 gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct 2941 gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg 3001 tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa 3061 ttgtattgct gcgtagaatc tgaacacact gaggccacct agcaaggtag taactagtct 3121 aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg 3181 taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact 3241 tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag 3301 tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag 3361 caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga 3421 aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt 3481 cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta 3541 gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact 3601 ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg 3661 ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag 3721 cctgtcagca ccaaagctga cagaagttat accttactcc tttcctttcc cctgaacaaa 3781 cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct 3841 gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt 3901 agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa 3961 aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc 4021 tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa 4081 tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt 4141 tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta 4201 aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact 4261 tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc 4321 tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat 4381 taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga 4441 aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc 4501 caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg 4561 cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa 4621 atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact 4681 cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga 4741 aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc 4801 ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc 4861 ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg 4921 tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac 4981 ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct 5041 catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca 5101 atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc 5161 agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc 5221 actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact 5281 tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct 5341 tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc 5401 ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag 5461 caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc 5521 ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag 5581 gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg 5641 ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca 5701 c // LOCUS HUMBARK1A 2100 bp mRNA PRI 31-DEC-1994 DEFINITION Human beta-adrenergic receptor kinase 1 mRNA, complete cds. ACCESSION M80776 NID g179334 KEYWORDS receptor kinase. SOURCE Homo sapiens blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2100) AUTHORS Chuang,T.T., Sallese,M., Ambrosini,G., Parruti,G. and De Blasi,A. TITLE High expression of beta-adrenergic receptor kinase in human peripheral blood leukocytes. Isoproterenol and platelet activating factor can induce kinase translocation JOURNAL J. Biol. Chem. 267 (10), 6886-6892 (1992) MEDLINE 92202245 FEATURES Location/Qualifiers source 1..2100 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocytes" /tissue_type="blood" gene 16..2085 /gene="receptor kinase" CDS 16..2085 /gene="receptor kinase" /codon_start=1 /product="receptor kinase" /db_xref="PID:g179335" /translation="MADLEAVLADVSYLMAMEKSKATPAARASKKILLPEPSIRSVMQ KYLEDRGEVTFEKIFSQKLGYLLFRDFCLNHLEEARPLVEFYEEIKKYEKLETEEERV ARSREIFDSYIMKELLACSHPFSKSATEHVQGHLGKKQVPPDLFQPYIEEICQNLRGD VFQKFIESDKFTRFCQWKNVELNIHLTMNDFSVHRIIGRGGFGEVYGCRKADTGKMYA MKCLDKKRIKMKQGETLALNERIMLSLVSTGDCPFIVCMSYAFHTPDKLSFILDLMNG GDLHYHLSQHGVFSEADMRFYAAEIILGLEHMHNRFVVYRDLKPANILLDEHGHVRIS DLGLACDFSKKKPHASVGTHGYMAPEVLQKGVAYDSSADWFSLGCMLFKLLRGHSPFR QHKTKDKHEIDRMTLTMAVELPDSFSPELRSLLEGLLQRDVNRRLGCLGRGAQEVKES PFFRSLDWQMVFLQKYPPPLIPPRGEVNAADAFDIGSFDEEDTKGIKLLDSDQELYRN FPLTISERWQQEVAETVFDTINAETDRLEARKKAKNKQLGHEEDYALGKDCIMHGYMS KMGNPFLTQWQRRYFYLFPNRLEWRGEGEAPQSLLTMEEIQSVEETQIKERKCLLLKI RGGKQFILQCDSDPELVQWKKELRDAYREAQQLVQRVPKMKNKPRSPVVELSKVPLVQ RGSANGL" BASE COUNT 459 a 619 c 656 g 366 t ORIGIN 1 gccgccgccg ccaagatggc ggacctggag gcggtgctgg ccgacgtgag ctacctgatg 61 gccatggaga agagcaaggc cacgccggcc gcgcgcgcca gcaagaagat actgctgccc 121 gagcccagca tccgcagtgt catgcagaag tacctggagg accggggcga ggtgaccttt 181 gagaagatct tttcccagaa gctggggtac ctgctcttcc gagacttctg cctgaaccac 241 ctggaggagg ccaggccctt ggtggaattc tatgaggaga tcaagaagta cgagaagctg 301 gagacggagg aggagcgtgt ggcccgcagc cgggagatct tcgactcata catcatgaag 361 gagctgctgg cctgctcgca tcccttctcg aagagtgcca ctgagcatgt ccaaggccac 421 ctggggaaga agcaggtgcc tccggatctc ttccagccat acatcgaaga gatttgtcaa 481 aacctccgag gggacgtgtt ccagaaattc attgagagcg ataagttcac acggttttgc 541 cagtggaaga atgtggagct caacatccac ctgaccatga atgacttcag cgtgcatcgc 601 atcattgggc gcgggggctt tggcgaggtc tatgggtgcc ggaaggctga cacaggcaag 661 atgtacgcca tgaagtgcct ggacaaaaag cgcatcaaga tgaagcaggg ggagaccctg 721 gccctgaacg agcgcatcat gctctcgctc gtcagcactg gggactgccc attcattgtc 781 tgcatgtcat acgcgttcca cacgccagac aagctcagct tcatcctgga cctcatgaac 841 ggtggggacc tgcactacca cctctcccag cacggggtct tctcagaggc tgacatgcgc 901 ttctatgcgg ccgagatcat cctgggcctg gagcacatgc acaaccgctt cgtggtctac 961 cgggacctga agccagccaa catccttctg gacgagcatg gccacgtgcg gatctcggac 1021 ctgggcctgg cctgtgactt ctccaagaag aagccccatg ccagcgtggg cacccacggg 1081 tacatggctc cggaggtcct gcagaagggc gtggcctacg acagcagtgc cgactggttc 1141 tctctggggt gcatgctctt caagttgctg cgggggcaca gccccttccg gcagcacaag 1201 accaaagaca agcatgagat cgaccgcatg acgctgacga tggccgtgga gctgcccgac 1261 tccttctccc ctgaactacg ctccctgctg gaggggttgc tgcagaggga tgtcaaccgg 1321 agattgggct gcctgggccg aggggctcag gaggtgaaag agagcccctt tttccgctcc 1381 ctggactggc agatggtctt cttgcagaag taccctcccc cgctgatccc cccacgaggg 1441 gaggtgaacg cggccgacgc cttcgacatt ggctccttcg atgaggagga cacaaaagga 1501 atcaagttac tggacagtga tcaggagctc taccgcaact tccccctcac catctcggag 1561 cggtggcagc aggaggtggc agagactgtc ttcgacacca tcaacgctga gacagaccgg 1621 ctggaggctc gcaagaaagc caagaacaag cagctgggcc atgaggaaga ctacgccctg 1681 ggcaaggact gcatcatgca tggctacatg tccaagatgg gcaacccctt cctgacccag 1741 tggcagcggc ggtacttcta cctgttcccc aaccgcctcg agtggcgggg cgagggcgag 1801 gccccgcaga gcctgctgac catggaggag atccagtcgg tggaggagac gcagatcaag 1861 gagcgcaagt gcctgctcct caagatccgc ggtgggaaac agttcatttt gcagtgcgat 1921 agcgaccctg agctggtgca gtggaagaag gagctgcgcg acgcctaccg cgaggcccag 1981 cagctggtgc agcgggtgcc caagatgaag aacaagccgc gctcgcccgt ggtggagctg 2041 agcaaggtgc cgctggtcca gcgcggcagt gccaacggcc tctgacccgc ccacccgcct // LOCUS HUMBASONU 4890 bp mRNA PRI 25-JAN-1993 DEFINITION Human zinc finger protein basonuclin mRNA, complete cds. ACCESSION L03427 NID g179336 KEYWORDS basonuclin; zinc-finger protein. SOURCE Homo sapiens (individual_isolate FRTS) newborn foreskin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4890) AUTHORS Tseng,H. and Green,H. TITLE Basonuclin: a keratinocyte protein with multiple paired zinc fingers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 10311-10315 (1992) MEDLINE 93066228 FEATURES Location/Qualifiers source 1..4890 /organism="Homo sapiens" /isolate="FRTS" /db_xref="taxon:9606" /cell_type="keratinocyte" /dev_stage="newborn" /tissue_type="foreskin" CDS 88..3069 /note="ORF; putative" /codon_start=1 /product="basonuclin" /db_xref="PID:g179337" /translation="MRRRPEPGRTRGGRARETRRQPRHRSGRRMAEAISCTLNCSCQS FKPGKINHRQCDQCKHGWVAHALSKLRIPPMYPTSQVEIVQSNVVFDISSLMLYGTQA IPVRLKILLDRLFSVLKQDEVLQILHALDWTLQDYIRGYVLQDASGKVLDHWCIMTSE EEVATLQQFLRFGETKSIVELMAIQEKEEQSIIIPPSTANVDIRAFIESCSHRSSSLP TPVDKGNPSSIHPFENLISNMTFMLPFQFFNPLPPALIGSLPEQYMLEQGHDQSQDPK QEVHGPNPDSSFLTSSSTPFQVEKDQCLNCPDAITKKEDSTHLSDSSSYNIVTKFERT QLSPEAKVKPERNSLGTKKGRVFCTACEKTFYDKGTLKIHYNAVHLKIKHKCTIEGCN MVFSSLRTRNRHSANPNPRLHMPMNRNNRDKDLRNSLNLASSENYKCPGFTVTSPDCR PPPSYPGSGEDSKGQPAFPNIGQNGVLFPNLKTVQPVLPFYRSPATPAEVANTPGILP SLPLLSSSIPEQLISNEMPFDALPKKKSRKSSMPIKIEKEAVEIANEKRHNLSSDEDM PLQVVSEDEQEACSPQSHRVSEEQHVQSGGLGKPFPEGERPCHRESVIESSGAISQTP EQATHNSERETEHTPALIMVPREVEDGGHEHYFTPGMEPQVPFSDYMELQQRLLAGGL FSALSNRGMAFPCLEDSKELEHVGQHALARQIEENRFQCDICKKTFKNACSVKIHHKN MHVKEMHTCTVEGCNATFPSRRSRDRHSSNLNLHQKALSQEALESSEDHFRAAYLLKD VAKEAYQDVAFTQQASQTSVIFKGTSRMGSLVYPITQVHSASLESYNSGPLSEGTILD LSTTSSMKSESSSHSSWDSDGVSEEGTVLMEDSDGNCEGSSLVPGEDEYPICVLMEKA DQSLASLPSGLPITCHLCQKTYSNKGTFRAHYKTVHLRQLHKCKVPGCNTMFSSVRSR NRHSQNPNLHKSLASSPSHLQ" BASE COUNT 1359 a 1185 c 1144 g 1202 t ORIGIN 1 gtggcggggg cggccatcgt gctgcgcagc tgggcgcttg gggagccgcc cacttcgccg 61 ggtcgcgccc cgacggccgg agcgtggatg cggcggcgcc cggagccggg gcggacgcgg 121 ggcggccggg cccgggagac gcgccggcag ccccggcacc gcagcggtcg caggatggcc 181 gaggctatca gctgtactct gaactgtagt tgccaaagtt tcaaacccgg gaaaataaac 241 caccgtcagt gtgaccaatg caagcatgga tgggtggccc acgctctaag taagctaagg 301 atccccccca tgtatccaac aagccaggtg gagattgtcc agtccaatgt agtgtttgat 361 attagcagcc tcatgctcta tgggacccag gccatccccg ttcgcctaaa aatcctactg 421 gaccggctct tcagtgtgtt gaagcaagat gaggttctcc agatcctcca tgccttggac 481 tggacacttc aggattatat ccgtggatac gtactgcagg atgcatcagg aaaggtgttg 541 gatcactggt gcatcatgac cagtgaggaa gaagtggcca ccttgcagca gttccttcgt 601 tttggagaga ccaaatctat agttgaactc atggcaattc aagagaaaga agagcaatcc 661 atcatcatac caccttccac agcaaatgta gatatcaggg ctttcatcga gagctgcagt 721 cacaggagtt ctagcctccc cactcctgtg gacaaaggaa accccagcag tatacacccc 781 tttgagaacc tcataagcaa catgactttc atgctgcctt tccagttctt caaccctctg 841 cctcctgcac tgatagggtc attgcccgaa caatatatgt tggagcaggg tcatgaccaa 901 agtcaggacc ccaaacagga agtccatggg cccaaccctg acagcagctt cttaacttcc 961 agttccacac catttcaggt tgaaaaagat cagtgtttaa actgtccgga tgctattact 1021 aaaaaagaag acagcaccca tttaagtgac tccagctcat acaacattgt cactaagttt 1081 gaaaggacac agttatcccc tgaggccaaa gtgaagcctg agaggaatag ccttggtaca 1141 aagaagggcc gggtgttctg cactgcatgt gagaagacct tctatgacaa aggcaccctc 1201 aaaatccact acaatgccgt ccacttgaag atcaagcata agtgcaccat cgaagggtgt 1261 aacatggtgt tcagctccct aaggacgcgg aatcgccata gcgccaaccc caaccctcgg 1321 ctgcacatgc caatgaacag aaataaccgg gacaaagacc tcaggaacag cctgaacctg 1381 gccagctctg agaactacaa gtgcccaggt ttcacagtga cgtccccaga ctgtaggcct 1441 cctcccagct accctggttc aggagaggat tccaaaggcc aaccagcctt cccaaacatt 1501 gggcaaaatg gtgtgctttt tcccaaccta aagacagtcc agccagtcct tcctttctac 1561 cgcagtccag ccacgcctgc cgaggtagca aacacgcctg ggatactccc ttccctcccg 1621 ctgttgtcct cttcaatccc agaacagctc atttcaaacg aaatgccatt tgatgccctt 1681 cccaagaaga aatccaggaa gtccagtatg cctatcaaaa tagagaaaga agctgtggaa 1741 atagctaatg agaaaagaca caacctcagc tcagatgaag acatgcccct acaggtggtc 1801 agtgaagatg agcaggaggc ctgcagtcct cagtcacaca gagtatctga ggagcagcat 1861 gtacagtcag gaggcttagg gaagcctttc cctgaagggg agaggccctg ccatcgtgaa 1921 tcagtaattg agtccagtgg agccatcagc caaacccctg agcaggccac acacaattca 1981 gagagggaga ctgagcatac accagcattg atcatggtgc caagggaggt cgaggatggt 2041 ggccatgaac actacttcac acctgggatg gaaccccaag ttcctttttc tgactacatg 2101 gaactgcagc agcgcctgct ggctggggga ctcttcagtg ctttgtccaa caggggaatg 2161 gcttttcctt gtcttgaaga ttctaaagaa ctggagcacg tgggtcagca tgcattagca 2221 aggcagatag aagaaaatcg cttccagtgt gacatctgca agaagacctt taaaaatgct 2281 tgtagtgtga aaattcatca caagaatatg catgtcaaag aaatgcacac atgcacagtg 2341 gagggctgta atgctacctt tccctcccgc aggagcagag acagacacag ctcaaaccta 2401 aacctccacc aaaaagcatt gagccaggaa gcattggaga gtagtgaaga tcatttccgt 2461 gcagcttacc ttctgaaaga tgtggctaag gaagcctatc aggatgtggc ttttacacag 2521 caagcctccc agacatctgt catcttcaaa ggaacaagtc gaatgggcag tctggtttac 2581 ccaataacgc aagtccacag tgccagcctg gagagctaca actctggccc cttgagcgag 2641 ggcaccatcc tggatttgag cactacctcg agcatgaagt cagagagtag cagccattct 2701 tcctgggact ctgacggggt gagtgaggaa ggcactgtgc ttatggagga cagtgatggg 2761 aactgtgaag ggtcgagcct tgtccctggg gaagatgagt accccatctg tgtcctgatg 2821 gagaaggctg accagagcct tgctagcctg ccttctgggt tgcccataac ctgtcatctc 2881 tgccaaaaga catacagtaa caaagggacc tttagggccc actacaaaac tgtgcacctc 2941 cggcagctcc acaaatgcaa agtaccaggc tgcaacacca tgttttcgtc tgttcgcagt 3001 cgaaacagac acagccagaa tcccaacctg cacaaaagcc tggcctcatc tccaagtcac 3061 ctccagtaac aagatggcaa accaagtatg ctcagataag cttttttcat aattcaggaa 3121 taaagtagtc catagaaatg tttctgtttc atatcatttg gggcgagtca ggcaaaagta 3181 tttgatttga ctttatagtt ttccacagca caatgagcaa aagacaaacc tcgtgggaag 3241 atgacactgg ggcagccctt cctattattt ttcttagccc aagaggtctt tcactgatac 3301 aaggaaaact tgcagaaatg tgatttttcc cagatttgtt tacatgttcc ctgggacaga 3361 tccaggtctg cagatcgaca ccagtgggcc caggacctgg gggtggcttt aaatgaggct 3421 tgcagtgtta aaggtcttgg ataagaaggg tcctggggaa gaagactctg tggacaagat 3481 accagtcccc aaaacagcat tttcagttcc ttcttcaatt agtttgaaat ccagacctga 3541 gtttggaaga ctgatttttt gagaccatcc ctgtgtttgg agtggataat tgtccctccc 3601 ctcagccctg caccagaggt ctcatatgtt accccaggga gttctcagag gattgggttg 3661 gcctctaaca tgttccttgt taattcttgt tctgtaacat gcattcaaga agctagggga 3721 aaaatatctc atgcacttaa ataatggtct tcaatttaat ttaaaaatat tttgacaata 3781 tttaatttgt gcttatgtgg tgtttggtgt gagtgcagat attgcactgt gtcacctctg 3841 gatctctgct cagaagcaga acaagtgatg acctaaatgt caaaatcact gctcgttttc 3901 atttggtgaa cttcaaactc tgttcttttt ggtcacctgt ggaatgaatg caagcatgat 3961 tttggcagga acatttgtac atattctgcc gtagataatg tggttctgat ggttgttgtg 4021 tattttcagt atcactggat ccctcagtct tcaccgtttt ataaacgtat aagattagga 4081 tgaacttttg aatttacttg gtaggaaaaa aagtaggaca ttattgccat attgtatgtc 4141 ttaatattta acttattcgg aaatatattc cacactgtta catacatttt ccatggtaga 4201 aaggaagttc agtcagtcct gtggaatgaa accatctcct aaaattcagc atttgcagca 4261 ttctaaaagc ctgtgtaggt acaaggacat tgattttgta ttcagaattc aagttaacta 4321 tcttttaaat tcgtggttga tgtaagtaat aaaaaacatt cttaaagttg agggttataa 4381 gagagattat ttctgtggtc taaaggttaa aaagccaaca acctgttacc aattatttca 4441 gctttttttg ttttaataag tgtgacaact taaaacttgt ttctatttaa agtgaaatgt 4501 atctttcaac tgtttagtta cccagctgtt taatattcca gtcttcccaa agtgaaaaga 4561 tttgtataca aatgttttct atgatttaat aaaaatatat ggcacaaaaa accacttcgc 4621 cgggtcgcgc cccgacggcc gggcccggga gacgcgccgg cagccccggc accttgccaa 4681 agtttcaaac ccgggaaaat aaacgtaagc taaggatccc ccccatgtat ccaacctcat 4741 gctctatggg acccaggcca tccccgtgag gttctccaga tcctccatgc cttggacgaa 4801 aggtgttgga tcactggtgc atcatgacac caaatctata gttgaactca tggcaattca 4861 gatatcaggg ctttcatcga gagctgcagt // LOCUS HUMBAT2A 6704 bp mRNA PRI 15-JUN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) mRNA, complete cds. ACCESSION M33509 M31293 NID g179338 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6704) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) MEDLINE 90192810 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..6704 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..6704 /note="BAT2 mRNA" CDS 102..6530 /note="HLA-B-associated transcript 2 (BAT2)" /codon_start=1 /db_xref="PID:g179339" /translation="MSDRSGPTAKGKDGKKYSSLNLFDTYKGKSLEIQKPACCPSPWP AESRESCHCPAYRPPANLPSLKAENKGNDPNVSLVPKDGTGWASKQEQSDPKSSDAST AQPPESQPLPASQTPASNQPKRPPAAPENTPLVPSGVKSWAQASVTHGAHGDGGRASS LLSRFSREEFPTLQAAGDQDKAAKERESAEQSSGPGPSLRPQNSTTWRDGGGRGPDEL EGPDSKLHHGHDPRGGLQPSGPPQFPPYRGMMPPFMYPPYLPFPPPYGPQGPYRYPTP DGPSRFPRVAGPRGSGPPMRLVEPVGRPSILKEDNLKEFDQLDQENDDGWAGAHEEVD YTEKLKFSDEEDGRDSDEEGAEGHRDSQSASGEERPPEADGKKGNSPNSEPPTPKTAW AETSRPPETEPGPPAPKPPLPPGDYPDRGGPPCKPPAPEDEDEAWRQRRKQSSSEISL AVERARRRREEEERRMQEERRAACAEKLKRLDEKFGAPDKRLKAEPAAPPAAPSTPAP PPAVPKELPAPPAPPPASAPTPETEPEEPAQAPPAQSTPTPGVAAAPTLVSGGGSTSS TSSGSFEASPVEPQLPSKEGPEPPEEVPPPTTPPVPKVEPKGDGIGPTRQPPSQGLGY PKYQKSLPPRFQRQQQEQLLKQQQQHQWQQHQQGSAPPTPVPPSPPQPVTLGAVPAPQ APPPPPKALYPGALGRPPPMPPMNFDPRWMMIPPYVDPRLLQGRPPLEFYPPGVHPSG LVPRERSDSLGLSSEPFDRHAPAMLRERGTPPVDPKLAWVGDVFTATPAEPRPLTSPL RQAADEDDKGMRSETPPVPPPPPYLASYPGFPENGAPGPPISRFPLEEPGPRPLPWPP GSDEVAKIQTPPPKKEPPKEETAQLTGPEAGRKLPASRSGAGPPPPRRESRTETRWGP RPGSSRRGIPPEEPGAPPRRAGPIKKPPPPTKVEELPPKPLEQGDETPKPPKPDPLKI TKGKLGGPKETPPNGNLSPAPRLRRDYSYERVGPTSCRGRGRGEYFARGRGFRGTYGG RGRGGQANSAVTESFEEMMGVEVGQGDQTTLLLPEAAMPARHGARVQSMRKSPSGAGS GAQKQAARPMRVIWLLQTRRLPHPRREHSPRSSRSPTTRSPTLHRAPARFTCPGVGES SLPEGAISPGPRRREAPPQVCPGWSPPAKSLAPKKPPTGPLPPSKEPLKEKLIPGPLS PVARGGSNGGSNVGMEDGERPRRRRHGRAQQQDKPPRFRRLKQERENAARGSEGKPSL TLPASAPGPEEALTTVTVAPAPPRAAAKSPDLSNQNSDQANEEWETASESSDFTSERR GDKEAPPPVLLTPKAVGTPGGGGGGAVPGISAMSRGDLSQRAKDLSKRSFSSQRPGME RQNRRPGPGGKAGSSGSSSGGGGGGPGGRTGPGRGDKRSWPSPKNRSRPPEERPPGLP LPPPPPSSSAVFRLDQVIHSNPAGIQQALAQLSSRQGSVTAPGGHPRHKPGPPQAPQG PSPRPPTRYEPQRVNSGLSSDPHFEEPGPMVRGVGGTPRDSAGVSPFPPKRRERPPRK PELLQEESLPPPHSSGFLGSKPEGPGPQAESRDTGTEALTPHIWNRLHTATSRKSYRP TSMEPWMEPLSPFEDVAGTEMSQSDSGVDLSGDSQVSSGPCSQRSSPDGGLKGAAEGP PKRPGGSSPLNAVPCEGPPGSEPPRRPPPAPHDGDRKELPREQPLPPGPIGTERSQRT DRGTEPGPIRPSHRPGPPVQFGTSDKDSDLRLVVGDSLKAEKELTASVTEAIPVSRDW ELLPSAAASAEPQSKNLDSGHCVPEPSSSGQRLYPEVFYGSAGPSSSQISGGSHGLSI TSKQWRLRPGTPSLHPYRSQPLYLPPGPAPPSALLSGVALKGQFLDFSTMQATELGKL PAGGVLYPPPSFLYSPAFCPSPLPDTSLLQVRQDLPSPSDFYSTPLQPGGQSGFLPSG APAQQMLLPMVDSQLPVVNFGSLPPAPPPAPPPLSLLPVGPALQPPSLAVRPPPAPAT RVLPSPARPFPASLGRAELHPVELKPFQDYQKLSSNLGGPGSSRTPPTGRSFSGLNSR LKATPSTYSGVFRTQRVDLYQQASPPDALRWIPKPWERTGPPPREGPSRRAEEPGSRG DKEPGLPPPR" polyA_signal 6692..6697 BASE COUNT 1435 a 2224 c 1897 g 1148 t ORIGIN Chromosome 6p21.3. 1 cctaggcccg ggtcccggat ccccgcgcac ccggccaggc tctggcacgt tttgggggag 61 gtgcctgcag gacccaacat actcaatgag cttccagcgc aatgtccgat cgctcggggc 121 cgactgccaa gggaaaggat ggaaagaagt attcctcgct caacctgttt gatacgtata 181 agggcaagtc cttagagatc cagaaacccg cctgttgccc ctcgccatgg cctgcagagt 241 ctcgggaaag ttgccattgc ccggcgtatc gacctccagc caaccttcca agcctgaaag 301 ccgagaacaa aggcaatgac cccaatgtct cactagtgcc aaaagacgga acaggatggg 361 caagcaaaca ggagcagtcc gaccccaaga gttccgatgc ctcaaccgct cagccgccgg 421 aatcgcagcc actgccggct tcacagacgc ctgcctccaa ccagccgaaa cgacccccag 481 cagcccccga gaacactcct ttggttccaa gcggggtaaa gtcctgggca caagccagcg 541 tcacccatgg agcacatgga gatggtggaa gggcatcaag cctactgtca cgattctctc 601 gagaggaatt tccgaccctg caggcggctg gcgaccagga caaggctgcc aaggaaaggg 661 agtctgccga acagtcgtct gggcccggac caagcctccg cccccaaaat tctacaactt 721 ggagggacgg aggtgggcgt ggccctgatg agctggaggg cccggactcc aaacttcatc 781 atggtcatga tccccggggt gggctacagc cttcaggccc accccagttc cctccctacc 841 gcggaatgat gccgcctttc atgtatcccc catatctccc gttccctccg ccctatggac 901 cccaggggcc ttaccgatac cccactcctg atgggcccag ccgttttccc cgtgtggcgg 961 gcccccgagg ctcagggcca ccaatgcgct tagtagagcc tgtgggtcgt ccctctattc 1021 tcaaagagga taatctcaaa gagtttgatc agttggatca ggagaatgat gatggttggg 1081 caggggccca tgaagaggtt gactacactg aaaagctcaa gttcagcgat gaggaagatg 1141 ggcgagactc tgatgaggag ggagctgagg gccacaggga ttcccaatca gcttctggtg 1201 aggaacggcc ccctgaagca gatggcaaaa agggcaactc ccccaacagc gaaccgccca 1261 ctcctaagac ggcctgggca gaaacctctc ggcctccaga gacagagccg ggacctcctg 1321 ccccaaagcc tcccctaccc cctggggact acccagatcg tgggggtcct ccctgcaagc 1381 ccccagcacc tgaagatgag gatgaggcat ggcggcagcg acgaaagcag tcgtcatctg 1441 agatttccct ggcagtggag cgggcccggc gacggcgaga agaagaggag cggcgcatgc 1501 aagaagagcg ccgggcagcc tgtgctgaga agctcaagcg actcgatgaa aagtttgggg 1561 cacctgacaa gcggctcaaa gcagagcctg ctgccccacc tgctgcccct tctaccccag 1621 ccccaccacc tgcagtccct aaagaactcc ctgcacctcc agctccacct ccagcatcag 1681 ccccaacacc agagacagaa cctgaagagc cagcacaggc ccctcctgcc caatctactc 1741 ctactccagg tgtggctgcg gctcccactc tggtgagtgg tggtggcagt accagtagca 1801 ccagcagtgg cagcttcgaa gccagcccag tggaaccaca actgccctca aaagagggtc 1861 ctgaaccacc agaagaggtt cctcctccta ccacaccccc agttccaaag gtggaaccca 1921 agggtgatgg gattggtccc acccgccagc cccctagtca gggcttgggc taccccaaat 1981 atcagaagtc gttgcctcct cgtttccagc ggcagcagca ggagcagctc ctgaagcagc 2041 agcagcagca ccagtggcag cagcatcaac agggctctgc ccctcctacc ccagtgcccc 2101 catcaccacc acagcctgtg accctggggg ctgtgccagc tccacaggct ccacccccgc 2161 cccccaaggc cctgtaccca ggtgctctgg gccggccccc acccatgccc ccaatgaact 2221 ttgatccccg atggatgatg attcctcctt atgtggaccc ccggctcctc cagggtcgtc 2281 cccctctaga gttctaccct cctggtgtgc atccctctgg cctagttccc cgagagcgtt 2341 cagacagtct ggggctcagc tcagagccat ttgaccgtca tgcacctgct atgttacggg 2401 aacggggcac tccaccggtg gatccaaagt tggcctgggt aggagatgtc ttcaccgcca 2461 cacccgctga accccgccca cttacctcac ctctgcgcca ggctgcggat gaggatgaca 2521 aggggatgag gagcgagact cctccagtac ctcccccacc accctatctg gccagttatc 2581 caggctttcc tgagaatgga gcccctgggc ccccaatctc tcgctttcct ctggaggaac 2641 cagggccccg tccactcccc tggcccccag gcagtgatga agtggccaag atacaaactc 2701 caccacccaa gaaggagccc cctaaggagg agactgcaca gctgacgggg ccagaagcag 2761 gccgaaagct gcccgcgagt cggagtggag caggcccccc accaccacgc agagagagtc 2821 gcacagagac ccgctggggc cctcgtccag ggagcagtcg tcgtggaatc cctccagagg 2881 agccaggggc cccaccccgc cgggctgggc ctataaagaa acctccacca cctacaaaag 2941 tagaagagct gcctcccaag cccctcgaac agggggatga aacccccaaa cccccaaagc 3001 cagacccact caagataacc aaggggaagc tagggggccc caaggagacc ccacccaatg 3061 gaaatctttc ccctgcccca aggcttcgga gggactattc gtatgaaaga gtgggtccta 3121 cctcttgccg gggtcggggc cgaggcgagt attttgccag agggaggggt tttcggggga 3181 cctatggggg acgagggcgg ggaggccaag cgaattccgc agttaccgag agtttcgagg 3241 agatgatggg cgtggaggtg ggacaggggg accaaaccac cctcctgctc cccgaggccg 3301 ccatgccagc gagacacgga gcgagggttc agagtatgag gaaatcccca agcggtgccg 3361 gcagcggggc tcagaaacag gcagcgagac ccatgagagt gatctggctc cttcagacaa 3421 ggaggctccc acacccaagg agggaacact cacccaggtc ctctcgctcc cccaccacca 3481 ggagccccac ccttcaccga gcgccagccc gcttcacgtg cccgggggtc ggcgagtctt 3541 cactcccaga gggtgccatc tcgccggggc cgaggaggag ggaggcccct cctcaagttt 3601 gcccaggctg gagccctcca gccaagtctc tggctcccaa gaaacctccc acaggccctt 3661 tgccaccaag taaggagcct ttgaaagaga agttgatccc agggcctctg tcccctgtgg 3721 cgcgcggagg cagcaatgga ggtagcaatg tgggcatgga agatggggag cgaccccgaa 3781 ggaggcgaca tgggagggct cagcagcagg ataaaccgcc tcgtttccgg aggctgaagc 3841 aggaacggga gaatgccgca agggggtctg agggcaagcc ctccctaacc cttccagcct 3901 ccgctcctgg acctgaggag gccctcacaa cagtcacagt ggccccagca cctccgcggg 3961 cagctgccaa gtctcctgat ctgtcaaacc agaactcaga ccaagccaat gaggaatggg 4021 agactgcatc agagagcagt gacttcacca gtgagcgccg aggggacaaa gaggcacccc 4081 caccagtact gctgacaccc aaggctgtgg gaactcctgg gggaggtgga ggtggagccg 4141 taccaggtat ttcagccatg tcccgcggag atctgagcca gagagccaag gatttgagta 4201 aacggagctt ctcaagtcag cggccaggca tggaacggca gaatcggcgc cctggcccag 4261 ggggcaaggc tggcagcagt ggcagcagca gtggaggagg cggtgggggt cctggaggaa 4321 ggaccgggcc aggacgaggc gacaagagga gctggccctc tcccaagaac cgaagtcgtc 4381 ctccagagga gcgtcccccg gggcttcccc tgcctccccc acctcccagc agttctgctg 4441 tcttccgcct ggaccaagtt atccacagca accctgctgg catccaacag gctctggccc 4501 agcttagtag ccgtcaaggg agtgtaactg caccaggggg tcatccaagg cacaagcctg 4561 ggcctcccca agcccctcag ggcccctctc ctaggccccc aacccgatac gagccccaga 4621 gggtcaacag cggcctcagt tctgaccccc actttgagga gccggggcca atggtgagag 4681 gggtgggtgg gactcctcgg gactctgccg gggttagtcc ctttccccct aaacgtcggg 4741 agcggcctcc cagaaaacca gagctgctac aggaggaatc tttgccacct cctcatagct 4801 ctggattctt gggctctaag cctgagggcc caggccctca ggcagagtcc agagatacag 4861 gcacagaggc cctgacccct cacatctgga accgtttaca tactgccact agccgaaaga 4921 gttaccggcc cacgtccatg gagccttgga tggagcccct gagtcctttt gaggatgtgg 4981 ctggcacaga aatgagtcag tctgacagtg gggtggacct gagtggggat tctcaggtgt 5041 catcaggtcc ctgcagccag cgaagttccc ctgatggagg actcaagggg gcagcagagg 5101 gaccccccaa gaggcctgga ggctcctcac ccctgaatgc tgttccttgt gagggtccac 5161 ctggctctga acctcctagg agaccaccac ctgcccccca cgatggggac agaaaggagc 5221 tgccccggga gcagcctctg ccccctggcc ccattggcac agaacgatca cagcgtacag 5281 accgaggcac agagcctggc cccattcggc catcccatcg acctggtccc ccagtccagt 5341 ttggcactag tgacaaggac tcagacttac gcctagtggt aggagacagc ttgaaagcag 5401 agaaggagct aacagcatca gtcactgagg ccattcctgt atcacgagac tgggagctgc 5461 ttcccagtgc tgctgcctct gctgagccac aatccaagaa cctggattct gggcactgtg 5521 tcccggagcc cagctcctca ggccagcgcc tgtatcctga ggttttctat ggcagtgctg 5581 ggccttccag ttctcagatc tctgggggga gccatggact ctcaattaca tccaaacagt 5641 ggaggcttcg ccctgggaca ccctcactgc acccttacag atcacagccc ctatacctac 5701 ccccgggccc agcccctccc tcagcactgc tctctggggt agctctcaag ggccagtttc 5761 tggatttctc cacaatgcaa gctacagagc tggggaagtt gccggctgga ggagttctct 5821 accctccacc ttccttcctc tactctccgg ctttctgccc cagtcctttg cctgacacat 5881 cgttgcttca ggtacgccag gatctgccat ccccttcgga tttttattct actcctctgc 5941 agcctggtgg ccaaagtggc tttctccctt caggggctcc tgcccagcag atgcttctac 6001 ccatggtaga ctcacagctg cctgtggtga actttggctc cctgccgcca gcaccacctc 6061 ctgccccacc tcccctttct ctgttacctg tgggccctgc tctgcagccc cccagcctgg 6121 ctgtgcggcc cccacctgct cctgctactc gggtgctgcc ttcacctgcc aggcccttcc 6181 ccgctagctt ggggcgagca gagctgcatc cagtggaact aaagccgttc caggattatc 6241 aaaaactgag cagcaacctt gggggacctg gatcatcacg gactccccca actggaaggt 6301 ccttctctgg cctcaattcc cgtctcaagg ccacgccttc cacctacagt ggagtcttcc 6361 gcacccagcg cgtcgacctt taccagcagg cctccccacc agatgccctg cgctggatac 6421 ctaagccttg ggagcggaca gggccgccac ctcgagaagg gccctcccga cgggcagagg 6481 agcctgggtc ccgaggggac aaggagcctg ggttgccccc accccgctga gggagttcct 6541 cttgccccct acccccgggg cttgtatata gattataaat atataagggg gaaaggggtg 6601 ggcggggagg ggttgtgggg ctggggcctc acttcccctc ctcccccttc ccctggtccc 6661 ctgtccctgg ggctgtttgt taaaaaagag taataaaagg attt // LOCUS HUMBAT3A 3740 bp mRNA PRI 15-JUN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) mRNA, complete cds. ACCESSION M33519 M31294 NID g179346 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3740) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) MEDLINE 90192810 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..3740 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..3740 /note="BAT3 mRNA" CDS 250..3648 /note="HLA-B-associated transcript 3 (BAT3)" /codon_start=1 /db_xref="PID:g179347" /translation="MEPNDSTSTAVEEPDSLEVLVKTLDSQTRTFIVGAQMNVKEFKE HIRASVSIPSEKQRLIYQGRVLQDDKKLQEYNVGGKVIHLVERAPPQTHLPSGASSGT GSASATHGGGSPPGTRGPGASVHDRNANSYVMVGTFNLPSDGSAVDVHINMEQAPIQS EPRVRLVMAQHMIRDIQTLLSRMETLPYLQCRGGPQPQHSQPPPQPPAVTPEPVALSS QTSEPVESEAPPREPMEAEEVEERAPAQNPELTPGPAPAGPTPAPETNAPNHPSPAEY VEVLQELQRLESRLQPFLQRYYEVLGAAATTDYNNNHEGREEDQRLINLVGESLRLLG NTFVALSDLRCNLACTPPRHLHVVRPMSHYTTPMVLQQAAIPIQINVGTTVTMTGNGT RPPPTPNAEAPPPGPGQASSVAPSSTNVESSAEGAPPPGPAPPPATSHPRVIRISHQS VEPVVMMHMNIQDSGTQPGGVPSAPTGPLGPPGHGQTLGQQVPGFPTAPTRVVIARPT PPQARPSHPGGPPVSGTLQGAGLGTNASLAQMVSGLVGQLLMQPVLVAQGTPGMAPPP APATASASAGTTNTATTAGPAPGGPAQPPPTPQPSMADLQFSQLLGNLLGPAGPGAGG PGVASPTITVAMPGVPAFLQGMTDFLQATQTAPPPPPPPPPPPPAPEQQTMPPPGSPS GGAGSPGGLGLESLSPEFFTSVVQGVLSSLLGSLGARAGSSESIAAFIQRLSGSSNIF EPGADGALGFFGALLSLLCQNFSMVDVVMLLHGHFQPLQRLQPQLRSFFHQHYLGGQE PTPSNIRMATHTLITGLEEYVRESFSLVQVQPGVDIIRTNLEFLQEQFNSIAAHVLHC TDSGFGARLLELCNQGLFECLALNLHCLGGQQMELAAVINGRIRRMSRGVNPSLVSWL TTMMGLRLQVVLEHMPVGPDAILRYVRRVGDPPQPLPEEPMEVQGAERASPEPQRENA SPAPGTTAEEAMSRGPPPAPEGGSRDEQDGASAETEPWAAAVPPEWVPIIQQDIQSQR KVKPQPPLSDAYLSGMPAKRRKTMQGEGPQLLLSEAVSRAAKAAGARPLTSPESLSRD LEAPEVQESYRQQLRSDIQKRLQEDPNYSPQRFPNAQRAFADDP" BASE COUNT 744 a 1182 c 1057 g 757 t ORIGIN Chromosome 6p21.3. 1 ggcgacagcg gtggcggctc ctcggggtgc tcggctccct cccacctagg ccggccccgg 61 cccgactcgc cctcagaaac tcactgtttg gggctgcgga ctttctcgtc gtgccccaca 121 aaagtaaagc ttggggacct ggggggagcc ggaagtatcg cttcgagatc cccaaatact 181 atcggggaaa cggaagtggc cgtcggtggc aggtttgggg gagaccggaa gtgacgagac 241 ctgtcggcca tggagcctaa tgatagtacc agtaccgctg tggaggagcc tgacagcttg 301 gaggtgttgg tgaagacctt ggactctcaa actcgtacct ttattgtggg ggcccagatg 361 aatgtaaaag agtttaagga gcacattcgt gcctctgtca gcatcccatc tgaaaaacaa 421 cggctcattt accagggacg agttctgcaa gatgataaga agcttcagga atacaatgtt 481 gggggaaagg ttatccacct ggtggaacgg gctcctcctc agactcacct cccttctggg 541 gcatcttctg ggacggggtc tgcctcagcc actcatggtg ggggatcccc ccctggtact 601 cgggggcctg gggcctctgt tcatgaccgg aatgccaaca gctatgtcat ggttggaacc 661 ttcaatcttc ctagtgacgg ctctgctgtg gatgttcaca tcaacatgga acaggccccg 721 attcagagtg agccccgggt acggctggtg atggctcagc acatgatcag ggatatacag 781 accttactat cccggatgga gactctcccc taccttcagt gtcgaggagg gccccaaccg 841 cagcacagtc agccgccccc gcagccaccg gctgtgaccc cggagccagt agccttgagc 901 tctcaaacat cagaaccagt tgaaagtgaa gcacctcccc gggagcccat ggaggcagaa 961 gaagtggagg agcgtgcccc agcccagaac ccggagctca ctcctggccc agccccagcg 1021 ggcccaacac ctgccccgga aacaaatgca cccaaccatc cttcccctgc ggagtatgtc 1081 gaggtgctcc aggagctaca gcggctggag agtcgcctcc agcccttctt gcagcgctac 1141 tacgaggttc tgggtgctgc tgccaccacg gactacaata acaatcacga gggccgggag 1201 gaggatcagc ggttgatcaa cttggtaggg gagagcctgc gactgctggg caacaccttt 1261 gttgcactgt ctgacctgcg ctgcaatctg gcctgcacgc ccccacgaca cctgcatgtg 1321 gtccggccta tgtctcacta caccaccccc atggtgctcc agcaggcagc cattcccata 1381 cagatcaatg tgggaaccac tgtgaccatg acaggaaatg ggactcggcc ccccccaact 1441 cccaatgcag aggcacctcc ccctggtcct gggcaggcct catccgtggc tccgtcttct 1501 accaatgtcg agtcctcagc tgagggggct cccccgccag gtccagctcc cccgccagcc 1561 accagccacc cgagggtcat ccggatttcc caccagagtg tggaacccgt ggtcatgatg 1621 cacatgaaca ttcaagattc tggcacacag cctggtggtg ttccgagtgc tcccactggc 1681 cccctgggac cccctggtca tggccaaacc ctgggacagc aggtgccagg cttcccaaca 1741 gctccaaccc gggtggtgat tgcccggccc actcctccac aggctcggcc ttcccatcct 1801 ggagggcccc cagtctctgg gacactgcag ggcgccggtc tgggtaccaa tgcctcgttg 1861 gcccagatgg tgagcggcct tgtggggcag cttcttatgc agccagtcct tgtggctcag 1921 gggaccccag gtatggctcc accgccagcc cctgccactg cttctgccag tgctggcacc 1981 accaacacag ctaccacagc tggccccgct cctggggggc ctgcccagcc tccacccacc 2041 cctcaaccct ccatggctga tcttcagttc tctcagcttc tggggaacct gctagggcct 2101 gcagggccag gggctggagg gcctggtgtg gcttctccca ccatcactgt ggcgatgcct 2161 ggtgtccctg cctttctcca aggcatgact gacttcttgc aggcaacaca gacagcccct 2221 ccaccacccc cacctcctcc acccccacca cctgccccag agcagcagac catgccccca 2281 ccaggctccc cttctggtgg cgcagggagt cctggaggcc tgggtcttga gagcctgtca 2341 ccggagtttt ttacctcagt ggtgcagggt gtgctcagct ccctgctggg ctccctgggg 2401 gctcgggctg gcagcagtga aagtattgct gccttcatac aacgcctcag tggatccagc 2461 aacatctttg agcctggagc tgatggggcc cttggattct ttggggcctt gctttctctt 2521 ctgtgccaga acttctctat ggtggacgta gtgatgcttc tccatgggca tttccagcca 2581 ctacaacggc tccagcccca gctgcgatcc ttcttccacc agcactacct gggtggtcag 2641 gagcccacac ccagtaacat ccggatggca acccacacat tgatcacggg gctagaagag 2701 tatgtgcggg agagtttttc cttggtgcag gttcagccag gtgtggacat catccggaca 2761 aacctggaat ttctccaaga gcagtttaat agcattgctg cgcatgtgct gcattgcaca 2821 gatagtggat ttggggcccg gttgctggag ttgtgtaacc aaggcctgtt tgaatgcctg 2881 gccctaaacc tgcactgctt ggggggacag cagatggagc ttgctgctgt tatcaatggc 2941 cgaattcgtc gtatgtctcg tggggtgaat ccctccttgg tgagctggct gaccactatg 3001 atgggactga ggcttcaggt ggtactggag cacatgcctg taggccctga tgccattctc 3061 agatacgttc gcagggttgg tgatcccccc cagccacttc ctgaggagcc aatggaagtt 3121 cagggagcag aaagagcttc ccctgagcct cagcgggaga atgcttcccc agcccctgga 3181 acaacagcag aagaggccat gtcccgaggt ccacctcctg ctcctgaggg gggctcccgg 3241 gatgaacagg atggagcttc agctgagaca gaaccttggg cagctgcagt ccccccagaa 3301 tgggtcccta ttatccagca ggacattcag agccagcgga aggtgaaacc gcagccccct 3361 ctgagtgatg cctacctcag tggtatgcct gccaagagac gcaagacgat gcagggtgag 3421 ggcccccagc tgcttctctc agaggctgtg agccgggcag ctaaggcagc cggagctcgg 3481 cccctgacga gccccgagag cctgagccgg gacctggagg caccagaggt tcaggagagc 3541 tacaggcagc agctccggtc tgatatacaa aaacgactgc aggaagaccc caactacagt 3601 ccccagcgct tccccaatgc ccagcgggcc tttgctgatg atccttagct ctttgctcta 3661 tggcccttcc tcatcagggg accgtttccc ccctcttcct tcacagtatt taagaaataa 3721 aagtcggatt ttttctggcc // LOCUS HUMBAXG 126 bp mRNA PRI 15-DEC-1993 DEFINITION Human Bax gamma mRNA, complete cds. ACCESSION L22475 NID g388169 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 126) AUTHORS Oltvai,Z.N., Milliman,C.L. and Korsmeyer,S.J. TITLE Bcl-2 heterodimerizes in vivo with a conserved homolog, Bax, that accelerates programmed cell death JOURNAL Cell 74 (4), 609-619 (1993) MEDLINE 93364978 FEATURES Location/Qualifiers source 1..126 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Namalwa and UOC-B1" /cell_type="B cell" gene 1..126 /gene="bax" exon <1..34 /gene="bax" /note="putative" CDS 1..126 /gene="bax" /codon_start=1 /product="Bax gamma" /db_xref="PID:g388170" /translation="MDGSGEQPRGGVSSRIEQGEWGGRHPSWPWTRCLRMRPPRS" exon 35..>126 /gene="bax" /note="putative" BASE COUNT 25 a 34 c 52 g 15 t ORIGIN 1 atggacgggt ccggggagca gcccagaggc ggggtttcat ccaggatcga gcagggcgaa 61 tgggggggga ggcacccgag ctggccctgg acccggtgcc tcaggatgcg tccaccaaga 121 agctga // LOCUS HUMBCKDHA 1339 bp mRNA PRI 31-OCT-1994 DEFINITION Human branched chain alpha-keto acid dehydrogenase (BCKDHB) E1-beta subunit mRNA, complete cds. ACCESSION M55575 NID g179361 KEYWORDS branched chain alpha-keto acid dehydrogenase E1-beta subunit. SOURCE Human placenta, cDNA to mRNA, clone lambda-hBE-1-beta-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1339) AUTHORS Nobukuni,Y., Mitsubuchi,H., Endo,F., Akaboshi,I., Asaka,J. and Matsuda,I. TITLE Maple syrup urine disease. Complete primary structure of the E1 beta subunit of human branched chain alpha-ketoacid dehydrogenase complex deduced from the nucleotide sequence and a gene analysis of patients with this disease JOURNAL J. Clin. Invest. 86 (1), 242-247 (1990) MEDLINE 90307967 FEATURES Location/Qualifiers source 1..1339 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="6p22-p21" mRNA <1..1339 /gene="BCKDHB" /note="G00-118-759" gene 1..1339 /gene="BCKDHB" CDS 5..1183 /gene="BCKDHB" /codon_start=1 /db_xref="GDB:G00-118-759" /product="branched chain alpha-keto acid dehydrogenase E1-beta subunit" /db_xref="PID:g179362" /translation="MAVVAAAAGWLLRLRAAGAEGHWRRLPGAGLARGFLHPAATVED AAQRRQVAHFTFQPDPEPREYGQTQKMNLFQSVTSALDNSLAKDPTAVIFGEDVAFGG VFRCTVGLRDKYGKDRVFNTPLCEQGIVGFGIGIAVTGATAIAEIQFADYIFPAFDQI VNEAAKYRYRSGDLFNCGSLTIRSPWGCVGHGALYHSQSPEAFFAHCPGIKVVIPRSP FQAKGLLLSCIEDKNPCIFFEPKILYRAAAEEVPIEPYNIPLSQAEVIQEGSDVTLVA WGTQVHVIREVASMAKEKLGVSCEVIDLRTIIPWDVDTICKSVIKTGRLLISHEAPLT GGFASEISSTVQEECFLNLEAPISRVCGYDTPFPHIFEPFYIPDKWKCYDALRKMINY " sig_peptide 5..154 /gene="BCKDHB" /note="G00-118-759" mat_peptide 155..1180 /gene="BCKDHB" /note="G00-118-759" /product="branched chain alpha-keto acid dehydrogenase E1-beta subunit" polyA_signal 1327..1332 /gene="BCKDHB" /note="G00-118-759" polyA_site 1339 /gene="BCKDHB" /note="G00-118-759" BASE COUNT 344 a 285 c 344 g 366 t ORIGIN 1 ggggatggcg gttgtagcgg cggctgccgg ctggctactc aggctcaggg cggcaggggc 61 tgaggggcac tggcgtcggc ttcctggcgc ggggctggcg cggggctttt tgcaccccgc 121 cgcgactgtc gaggatgcgg cccagaggcg gcaggtggct cattttactt tccagccaga 181 tccggagccc cgggagtacg ggcaaactca gaaaatgaat cttttccagt ctgtaacaag 241 tgccttggat aactcattgg ccaaagatcc tactgcagta atatttggtg aagatgttgc 301 ctttggtgga gtctttagat gcactgttgg cttgcgagac aaatatggaa aagatagagt 361 ttttaatacc ccattgtgtg aacaaggaat tgttggattt ggaatcggaa ttgcggtcac 421 tggagctact gccattgcgg aaattcagtt tgcagattat attttccctg catttgatca 481 gattgttaat gaagctgcca agtatcgcta tcgctctggg gatcttttta actgtggaag 541 cctcactatc cggtcccctt ggggctgtgt tggtcatggg gctctctatc attctcagag 601 tcctgaagca ttttttgccc attgcccagg aatcaaggtg gttataccca gaagcccttt 661 ccaggccaaa ggacttcttt tgtcatgcat agaggataaa aatccttgta tattttttga 721 acctaaaata ctttacaggg cagcagcgga agaagtccct atagaaccat acaacatccc 781 actgtcccag gccgaagtca tacaggaagg gagtgatgtt actctagttg cctggggcac 841 tcaggttcat gtgatccgag aggtagcttc catggcaaaa gaaaagcttg gagtgtcttg 901 tgaagtcatt gatctgagga ctataatacc ttgggatgtg gacacaattt gtaagtctgt 961 gatcaaaaca gggcgactgc taatcagtca cgaggctccc ttgacaggcg gctttgcatc 1021 ggaaatcagc tctacagttc aggaggaatg tttcttgaac ctagaggctc ctatatcaag 1081 agtatgtggt tatgacacac catttcctca catttttgaa ccattctaca tcccagacaa 1141 atggaagtgt tatgatgccc ttcgaaaaat gatcaactat tgaccatata ggtaggtatg 1201 catcttgaga aagctactat gtgcccctga cattaacgta ctgttaacca agacacagca 1261 atcatcagtg ttttgatggt aacaaacttt gatggtaaag ttgataaaag gcaactttca 1321 gaagaaaata atgtgcttt // LOCUS HUMBCL2A 5086 bp mRNA PRI 31-OCT-1994 DEFINITION Human B-cell leukemia/lymphoma 2 (bcl-2) proto-oncogene mRNA encoding bcl-2-alpha protein, complete cds. ACCESSION M13994 NID g179366 KEYWORDS alternative splicing; bcl-2-alpha protein; proto-oncogene. SOURCE Human pre-B-cell leukemia cell line 380, cDNA to mRNA, clones B[3,4,10]; and DNA, clone lambda-18-27. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5086) AUTHORS Tsujimoto,Y. and Croce,C.M. TITLE Analysis of the structure, transcripts, and protein products of bcl-2, the gene involved in human follicular lymphoma JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (14), 5214-5218 (1986) MEDLINE 86259760 COMMENT Clean copy sequence for [1] kindly provided by Y.Tsujimoto, 10-FEB-1987. The bcl-2 gene is transcribed by alternative splicing into three mRNAs of different sizes. It consists of at least two exons and encodes two proteins which only differ at their carboxy-terminal ends, and it is activated by translocation into poximity with the Ig heavy chain locus. Both the normal and rearranged bcl-2 gene products are expressed in the B-cell leukemia/lymphoma 2 cells. Genomic clone lambda-18-27 contained all the DNA sequences on the 5' of the splice site (position 2044). FEATURES Location/Qualifiers source 1..5086 /organism="Homo sapiens" /db_xref="taxon:9606" /map="18q21.3" mRNA 1..5086 /note="bcl2a mRNA" gene 1459..2178 /gene="BCL2" CDS 1459..2178 /gene="BCL2" /note="bcl2-alpha protein" /codon_start=1 /db_xref="GDB:G00-119-031" /db_xref="PID:g179367" /translation="MAHAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAP APGIFSSQPGHTPHPAASRDPVARTSPLQTPAAPGAAAGPALSPVPPVVHLALRQAGD DFSRRYRGDFAEMSSQLHLTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVE SVNREMSPLVDNIALWMTEYLNRHLHTWIQDNGGWDAFVELYGPSMRPLFDFSWLSLK TLLSLALVGACITLGAYLSHK" BASE COUNT 1262 a 1224 c 1287 g 1313 t ORIGIN 710 bp upstream of SstI site. 1 gcgcccgccc ctccgcgccg cctgcccgcc cgcccgccgc gctcccgccc gccgctctcc 61 gtggccccgc cgcgctgccg ccgccgccgc tgccagcgaa ggtgccgggg ctccgggccc 121 tccctgccgg cggccgtcag cgctcggagc gaactgcgcg acgggaggtc cgggaggcga 181 ccgtagtcgc gccgccgcgc aggaccagga ggaggagaaa gggtgcgcag cccggaggcg 241 gggtgcgccg gtggggtgca gcggaagagg gggtccaggg gggagaactt cgtagcagtc 301 atccttttta ggaaaagagg gaaaaaataa aaccctcccc caccacctcc ttctccccac 361 ccctcgccgc accacacaca gcgcgggctt ctagcgctcg gcaccggcgg gccaggcgcg 421 tcctgccttc atttatccag cagcttttcg gaaaatgcat ttgctgttcg gagtttaatc 481 agaagacgat tcctgcctcc gtccccggct ccttcatcgt cccatctccc ctgtctctct 541 cctggggagg cgtgaagcgg tcccgtggat agagattcat gcctgtgtcc gcgcgtgtgt 601 gcgcgcgtat aaattgccga gaaggggaaa acatcacagg acttctgcga ataccggact 661 gaaaattgta attcatctgc cgccgccgct gccaaaaaaa aactcgagct cttgagatct 721 ccggttggga ttcctgcgga ttgacatttc tgtgaagcag aagtctggga atcgatctgg 781 aaatcctcct aatttttact ccctctcccc ccgactcctg attcattggg aagtttcaaa 841 tcagctataa ctggagagtg ctgaagattg atgggatcgt tgccttatgc atttgttttg 901 gttttacaaa aaggaaactt gacagaggat catgctgtac ttaaaaaata caagtaagtc 961 tcgcacagga aattggttta atgtaacttt caatggaaac ctttgagatt ttttacttaa 1021 agtgcattcg agtaaattta atttccaggc agcttaatac attgttttta gccgtgttac 1081 ttgtagtgtg tatgccctgc tttcactcag tgtgtacagg gaaacgcacc tgatttttta 1141 cttattagtt tgttttttct ttaacctttc agcatcacag aggaagtaga ctgatattaa 1201 caatacttac taataataac gtgcctcatg aaataaagat ccgaaaggaa ttggaataaa 1261 aatttcctgc gtctcatgcc aagagggaaa caccagaatc aagtgttccg cgtgattgaa 1321 gacaccccct cgtccaagaa tgcaaagcac atccaataaa atagctggat tataactcct 1381 cttctttctc tgggggccgt ggggtgggag ctggggcgag aggtgccgtt ggcccccgtt 1441 gcttttcctc tgggaaggat ggcgcacgct gggagaacgg ggtacgacaa ccgggagata 1501 gtgatgaagt acatccatta taagctgtcg cagaggggct acgagtggga tgcgggagat 1561 gtgggcgccg cgcccccggg ggccgccccc gcaccgggca tcttctcctc ccagcccggg 1621 cacacgcccc atccagccgc atcccgcgac ccggtcgcca ggacctcgcc gctgcagacc 1681 ccggctgccc ccggcgccgc cgcggggcct gcgctcagcc cggtgccacc tgtggtccac 1741 ctggccctcc gccaagccgg cgacgacttc tcccgccgct accgcggcga cttcgccgag 1801 atgtccagcc agctgcacct gacgcccttc accgcgcggg gacgctttgc cacggtggtg 1861 gaggagctct tcagggacgg ggtgaactgg gggaggattg tggccttctt tgagttcggt 1921 ggggtcatgt gtgtggagag cgtcaaccgg gagatgtcgc ccctggtgga caacatcgcc 1981 ctgtggatga ctgagtacct gaaccggcac ctgcacacct ggatccagga taacggaggc 2041 tgggatgcct ttgtggaact gtacggcccc agcatgcggc ctctgtttga tttctcctgg 2101 ctgtctctga agactctgct cagtttggcc ctggtgggag cttgcatcac cctgggtgcc 2161 tatctgagcc acaagtgaag tcaacatgcc tgccccaaac aaatatgcaa aaggttcact 2221 aaagcagtag aaataatatg cattgtcagt gatgtaccat gaaacaaagc tgcaggctgt 2281 ttaagaaaaa ataacacaca tataaacatc acacacacag acagacacac acacacacaa 2341 caattaacag tcttcaggca aaacgtcgaa tcagctattt actgccaaag ggaaatatca 2401 tttatttttt acattattaa gaaaaaagat ttatttattt aagacagtcc catcaaaact 2461 ccgtctttgg aaatccgacc actaattgcc aaacaccgct tcgtgtggct ccacctggat 2521 gttctgtgcc tgtaaacata gattcgcttt ccatgttgtt ggccggatca ccatctgaag 2581 agcagacgga tggaaaaagg acctgatcat tggggaagct ggctttctgg ctgctggagg 2641 ctggggagaa ggtgttcatt cacttgcatt tctttgccct gggggcgtga tattaacaga 2701 gggagggttc ccgtgggggg aagtccatgc ctccctggcc tgaagaagag actctttgca 2761 tatgactcac atgatgcata cctggtggga ggaaaagagt tgggaacttc agatggacct 2821 agtacccact gagatttcca cgccgaagga cagcgatggg aaaaatgccc ttaaatcata 2881 ggaaagtatt tttttaagct accaattgtg ccgagaaaag cattttagca atttatacaa 2941 tatcatccag taccttaaac cctgattgtg tatattcata tattttggat acgcaccccc 3001 caactcccaa tactggctct gtctgagtaa gaaacagaat cctctggaac ttgaggaagt 3061 gaacatttcg gtgacttccg atcaggaagg ctagagttac ccagagcatc aggccgccac 3121 aagtgcctgc ttttaggaga ccgaagtccg cagaacctac ctgtgtccca gcttggaggc 3181 ctggtcctgg aactgagccg ggccctcact ggcctcctcc agggatgatc aacagggtag 3241 tgtggtctcc gaatgtctgg aagctgatgg atggagctca gaattccact gtcaagaaag 3301 agcagtagag gggtgtggct gggcctgtca ccctggggcc ctccaggtag gcccgttttc 3361 acgtggagca taggagccac gacccttctt aagacatgta tcactgtaga gggaaggaac 3421 agaggccctg ggccttccta tcagaaggac atggtgaagg ctgggaacgt gaggagaggc 3481 aatggccacg gcccattttg gctgtagcac atggcacgtt ggctgtgtgg ccttggccac 3541 ctgtgagttt aaagcaaggc tttaaatgac tttggagagg gtcacaaatc ctaaaagaag 3601 cattgaagtg aggtgtcatg gattaattga cccctgtcta tggaattaca tgtaaaacat 3661 tatcttgtca ctgtagtttg gttttatttg aaaacctgac aaaaaaaaag ttccaggtgt 3721 ggaatatggg ggttatctgt acatcctggg gcattaaaaa aaaatcaatg gtggggaact 3781 ataaagaagt aacaaaagaa gtgacatctt cagcaaataa actaggaaat ttttttttct 3841 tccagtttag aatcagcctt gaaacattga tggaataact ctgtggcatt attgcattat 3901 ataccattta tctgtattaa ctttggaatg tactctgttc aatgtttaat gctgtggttg 3961 atatttcgaa agctgcttta aaaaaataca tgcatctcag cgtttttttg tttttaattg 4021 tatttagtta tggcctatac actatttgtg agcaaaggtg atcgttttct gtttgagatt 4081 tttatctctt gattcttcaa aagcattctg agaaggtgag ataagccctg agtctcagct 4141 acctaagaaa aacctggatg tcactggcca ctgaggagct ttgtttcaac caagtcatgt 4201 gcatttccac gtcaacagaa ttgtttattg tgacagttat atctgttgtc cctttgacct 4261 tgtttcttga aggtttcctc gtccctgggc aattccgcat ttaattcatg gtattcagga 4321 ttacatgcat gtttggttaa acccatgaga ttcattcagt taaaaatcca gatggcgaat 4381 gaccagcaga ttcaaatcta tggtggtttg acctttagag agttgcttta cgtggcctgt 4441 ttcaacacag acccacccag agccctcctg ccctccttcc gcgggggctt tctcatggct 4501 gtccttcagg gtcttcctga aatgcagtgg tcgttacgct ccaccaagaa agcaggaaac 4561 ctgtggtatg aagccagacc tccccggcgg gcctcaggga acagaatgat cagacctttg 4621 aatgattcta atttttaagc aaaatattat tttatgaaag gtttacattg tcaaagtgat 4681 gaatatggaa tatccaatcc tgtgctgcta tcctgccaaa atcattttaa tggagtcagt 4741 ttgcagtatg ctccacgtgg taagatcctc caagctgctt tagaagtaac aatgaagaac 4801 gtggacgttt ttaatataaa gcctgttttg tcttttgttg ttgttcaaac gggattcaca 4861 gagtatttga aaaatgtata tatattaaga ggtcacgggg gctaattgct agctggctgc 4921 cttttgctgt ggggttttgt tacctggttt taataacagt aaatgtgccc agcctcttgg 4981 ccccagaact gtacagtatt gtggctgcac ttgctctaag agtagttgat gttgcatttt 5041 ccttattgtt aaaaacatgt tagaagcaat gaatgtatat aaaagc // LOCUS HUMBCL3AA 1813 bp mRNA PRI 31-OCT-1994 DEFINITION Human B-cell lymphoma 3-encoded protein (bcl-3) mRNA, complete cds. ACCESSION M31732 NID g179375 KEYWORDS lymphoma 3-encoded protein. SOURCE Human lymphocyte Louckes cell line (Burkitt's lymphoma), cDNA to mRNA, clone cLK2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1813) AUTHORS Ohno,H., Takimoto,G. and McKeithan,T.W. TITLE The candidate proto-oncogene bcl-3 is related to genes implicated in cell lineage determination and cell cycle control JOURNAL Cell 60 (6), 991-997 (1990) MEDLINE 90199880 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.McKeithan, 30-JAN-1990. FEATURES Location/Qualifiers source 1..1813 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.1-q13.2" mRNA <1..1813 /note="bcl-3 mRNA" gene 42..1382 /gene="BCL3" CDS 42..1382 /gene="BCL3" /note="lymphoma 3-encoded protein (bcl-3)" /codon_start=1 /db_xref="GDB:G00-120-561" /db_xref="PID:g179376" /translation="MDEGPVDLRTRPKAAGLPGAALPLRKRPLRAPSPEPAAPRGAAG LVVPLDPLRGGCDLPAVPGPPHGLARPEALYYPGALLPLYPTRAMGSPFPLVNLPTPL YPMMCPMEHPLSADIAMATRADEDGDTPLHIAVVQGNLPAVHRLVNLFQQGGRELDIY NNLRQTPLHLAVITTLPSVVRLLVTAGASPMALDRHGQTAAHLACEHRSPTCLRALLD SAAPGTLDLEARNYDGLTALHVAVNTECQETVQLLLERGADIDAVDIKSGRSPLIHAV ENNSLSMVQLLLQHGANVNAQMYSGSSALHSASGRGLLPLVRTLVRSGADSSLKNCHN DTPLMVARSRRVIDILRGKATRPASTSQPDPSPDRSANTSPESSSRLSSNGLLSASPS SSPSQSPPRDPPGFPMAPPNFFLPSPSPPAFLPFAGVLRGPGRPVPPSPAPGGS" BASE COUNT 298 a 714 c 486 g 315 t ORIGIN 1 ccgtccccgg cggccccatg ccccgatgcc ccgcgggggc catggacgag gggcccgtgg 61 acctgcgcac ccggcccaag gccgccggac tcccgggcgc cgcgctgccg ctccgcaagc 121 gcccgctgcg cgcgccctcc ccggagcccg ccgctccccg cggcgctgcg ggccttgtcg 181 tccccctgga ccctctgcgc ggcggctgcg acctgccggc ggtccccggg cccccccacg 241 gcctggcccg gccggaggcg ctttactacc ccggagcctt actgcctttg taccccactc 301 gggccatggg ctccccgttt cctctggtga acctgcctac acccctatac cccatgatgt 361 gccccatgga acaccccctt tctgctgaca tcgccatggc cacccgtgca gatgaggacg 421 gagacacgcc tctccatatt gctgtggtgc agggtaacct gccagctgtg caccggctgg 481 tcaacctctt ccagcagggg ggccgggagc tcgacatcta caacaaccta cggcagacac 541 cgctccacct ggctgtgatc accacattac cgtctgtggt ccggctcctg gtgacagctg 601 gtgccagccc catggcgctg gaccgccatg gccagacggc cgctcacctg gcgtgcgagc 661 accgcagccc gacctgcctg cgagccctgc tggacagcgc agctccgggc acgttggacc 721 tggaggcccg caattatgac gggctcaccg ccctgcacgt ggcagtgaac accgagtgcc 781 aagaaaccgt gcagctcttg ctagagcgcg gtgccgacat cgacgcagtg gacattaaga 841 gcggccgctc cccgctcatc cacgccgtgg aaaacaacag ccttagcatg gtgcagctgc 901 tgctgcagca cggcgccaac gtgaacgcgc aaatgtactc cggcagctcc gccctgcact 961 cagcgtccgg ccgcgggctc ctcccgctgg tgcgcacgct ggtccgcagc ggcgctgaca 1021 gcagcctcaa gaactgccac aacgacacgc cgctcatggt ggcgcgcagc cgcagggtca 1081 tcgacatcct gagggggaag gccacccggc ctgcttccac ctcccagcca gacccctccc 1141 ctgaccggag cgccaacacc tcccccgaga gcagcagccg cctcagctcc aatggtcttc 1201 tctccgcatc accatcctcc tcaccctccc agtctccccc cagggacccc cctggattcc 1261 ccatggctcc tcccaatttc ttccttcctt ccccatctcc acccgccttc ctgccctttg 1321 ctggggtcct ccgaggccct ggccggccgg tgcccccctc cccagctcca ggaggcagct 1381 gagggggatg ggggggcaga tcttggactc atgaggaggg gcccccctgc ccagaggggt 1441 caacccttct ggaaactgtg aagatctgac ttcgcccccc ccccccccca tcttcgggac 1501 caggatttgc acagaagcac atgcacctac ccatacaccc cctcttctga gcgtccctgt 1561 tcccccatct cgctccctcc caggactctg accccagcat tctcaggcac cagtccctgt 1621 ccggaatgcc acccacatct tccatttcca tgtcccctcc cagagctggt ggacccaggg 1681 aacagccact cccctccact ctctaccaga taactgagga ggggagaggt gggccgtaac 1741 gggcacggat cacgatgtaa attattaagc attttggttg gatttctttt gtaataaact 1801 atttttgtac cat // LOCUS HUMBCTHA 1105 bp mRNA PRI 31-OCT-1994 DEFINITION Human brain-type clathrin light-chain a mRNA, complete cds. ACCESSION M20471 J04174 NID g179396 KEYWORDS clathrin; clathrin light chain a. SOURCE Human adult retina (neuronal cell), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1105) AUTHORS Jackson,A.P. and Parham,P. TITLE Structure of human clathrin light chains. Conservation of light chain polymorphism in three mammalian species JOURNAL J. Biol. Chem. 263 (32), 16688-16695 (1988) MEDLINE 89034155 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Jackson, 02-SEPT-1988. FEATURES Location/Qualifiers source 1..1105 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 75..821 /gene="CLTA" CDS 75..821 /gene="CLTA" /note="clathrin light-chain a" /codon_start=1 /db_xref="GDB:G00-128-049" /db_xref="PID:g179397" /translation="MAELDPFGAPAGAPGGPALGNGVAGAGEEDPAAAFLAQQESEIA GIENDEAFAILDGGAPGPQPHGEPPGGPDAVDGVMNGEYYQESNGPTDSYAAISQVDR LQSEPESIRKWREEQMERLEALDANSRKQEAEWKEKAIKELEEWYARQDEQLQKTKAN NRVADEAFYKQPFADVIGYVTNINHPCYSLEQAAEEAFVNDIDESSPGTEWERVARLC DFNPKSSKQAKDVSRMRSVLISLKQAPLVH" BASE COUNT 275 a 272 c 314 g 244 t ORIGIN 9 bp upstream of BstXI site. 1 gccaccgctg tggtgtcggt gggtcggttg gtttttgtct caccgttggt gtccgtgccg 61 ttcagttgcc cgccatggct gagctggatc cgttcggcgc ccctgccggc gcccctggcg 121 gtcccgcgct ggggaacgga gtggccggcg ccggcgaaga agacccggct gcggccttct 181 tggcgcagca agagagcgag attgcgggca tcgagaacga cgaggccttc gccatcctgg 241 acggcggcgc ccccgggccc cagccgcacg gcgagccgcc ggggggtccg gatgctgttg 301 atggagtaat gaatggtgaa tactaccagg aaagtaatgg tccaacagac agttatgcag 361 ctatttcaca agtggatcga ttgcagtcag agcctgaaag tatccgtaaa tggagagaag 421 aacaaatgga acgcttggaa gcccttgatg ccaattctcg gaagcaagaa gcagagtgga 481 aagaaaaggc aataaaggag ctagaagaat ggtatgcaag acaggacgag cagctacaga 541 aaacaaaagc aaacaacagg gtggcagatg aagctttcta caaacaaccc ttcgctgacg 601 tgattggtta tgtcacaaac ataaaccatc cttgctacag cctagaacag gcagcagaag 661 aagcctttgt aaatgacatt gacgagtcgt ccccaggcac tgagtgggaa cgggtggccc 721 ggctgtgtga ctttaacccc aagtctagca agcaggccaa agatgtctcc cgcatgcgct 781 cagtcctcat ctccctcaag caggccccgc tggtgcactg aagagccacc ctgtggaaac 841 actacatctg caatatctta atcctactca gtgaagctct tcacagtcat tggattaatt 901 atgttgagtt cttttggacc aaaccttttt gtctttagag ttgttcattg tttgtgattg 961 catgtttcct tccttcaact gtgttctccc tggcattcag agaggaggga gaggaggaag 1021 aggaagggga gggaagcttc ccaagagtag cctcaacctg tgcttctgtg cattattctg 1081 agaataaatt tctgtttcaa actgt // LOCUS HUMBCTHB 1134 bp mRNA PRI 31-OCT-1994 DEFINITION Human brain-type clathrin light-chain b mRNA, complete cds. ACCESSION M20469 J04174 NID g179398 KEYWORDS clathrin; clathrin light chain b. SOURCE Human adult retina (neuronal cell), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1134) AUTHORS Jackson,A.P. and Parham,P. TITLE Structure of human clathrin light chains. Conservation of light chain polymorphism in three mammalian species JOURNAL J. Biol. Chem. 263 (32), 16688-16695 (1988) MEDLINE 89034155 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Jackson, 02-SEPT-1988. FEATURES Location/Qualifiers source 1..1134 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 173..862 /gene="CLTB" CDS 173..862 /gene="CLTB" /note="clathrin light-chain b" /codon_start=1 /db_xref="GDB:G00-125-861" /db_xref="PID:g179399" /translation="MADDFGFFSSSESGAPEAAEEDPAAAFLAQQESEIAGIENDEGF GAPAGSHAAPAQPGPTSGAGSEDMGTTVNGDVFQEANGPADGYAAIAQADRLTQEPES IRKWREEQRKRLQELDAASKVTEQEWREKAKKDLEEWNQRQSEQVEKNKINNRIADKA FYQQPDADIIGYVASEEAFVKESKEETPGTEWEKVAQLCDFNPKSSKQCKDVSRLRSV LMSLKQTPLSR" BASE COUNT 225 a 338 c 390 g 181 t ORIGIN 8 bp upstream of NlaIV site. 1 cgcggggagc cggcgtcggc ggggacgggc ttggcgcgga ccgcacttcc tctccgccac 61 cgggcccggc tccccgcggc tcgggtgaca gcgtcgcggc cgccggacgc agcgcggggc 121 aggcgcgggc agagccgagc gcagcggagg ctccggcgga ggcgcgggga aaatggctga 181 tgactttggc ttcttctcgt cgtcggagag cggtgccccg gaggcggcgg aggaggaccc 241 ggcggccgcc ttcctggccc agcaggagag cgagattgca ggcatagaga acgacgaggg 301 cttcggggca cctgccggca gccatgcggc ccccgcgcag ccgggcccca cgagtggggc 361 tggttctgag gacatgggga ccacagtcaa tggagatgtg tttcaggagg ccaacggtcc 421 tgctgatggc tacgcagcca ttgcccaggc tgacaggctg acccaggagc ctgagagcat 481 ccgcaagtgg cgagaggagc agaggaaacg gctgcaagag ctggatgctg catctaaggt 541 cacggaacag gaatggcggg agaaggccaa gaaggacctg gaggagtgga accagcgcca 601 gagtgaacaa gtagagaaga acaagatcaa caaccggatc gctgacaaag cattctacca 661 gcagccagat gctgatatca tcggctacgt ggcatccgag gaggctttcg tgaaggaatc 721 caaggaggag accccaggca cagagtggga gaaggtggcc cagctatgtg acttcaaccc 781 caagagcagc aagcagtgca aagatgtgtc ccgcctgcgc tcggtgctca tgtccctgaa 841 gcagacgcca ctgtcccgct aggtgcctgc taggtgcatg gccacagagc atgggctggg 901 cctgggcaca ggaggagcag ctgctttggt cggggtggag actcgcagca gctgctaccc 961 acagcctatt ccactcctcc ccatctccag gcgctgggag gggggccctc accccatcac 1021 gcctcgctcc ctcctggccc tctggtccag cccctcacgc ctcctctcag tctactcaat 1081 tgtgactgtc cctcctgatg tatttttttt cttggcttaa agggtgtgtt gttg // LOCUS HUMBDGALA 2399 bp mRNA PRI 31-OCT-1994 DEFINITION Human beta-D-galactosidase mRNA, complete cds. ACCESSION M27507 J05124 NID g179400 KEYWORDS alternative splicing; beta-D-galactosidase; beta-galactosidase. SOURCE Human testis, cDNA to mRNA, clones H-beta-Ga39 and H-beta-GaL. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2399) AUTHORS Morreau,H., Galjart,N.J., Gillemans,N., Willemsen,R., van der Horst,G.T. and d'Azzo,A. TITLE Alternative splicing of beta-galactosidase mRNA generates the classic lysosomal enzyme and a beta-galactosidase-related protein JOURNAL J. Biol. Chem. 264 (34), 20655-20663 (1989) MEDLINE 90062209 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.d'Azzo, 31-AUG-1989. FEATURES Location/Qualifiers source 1..2399 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p23-p22" sig_peptide 51..119 /gene="GLB1" /note="beta-D-galactosidase signal peptide" CDS 51..2084 /gene="GLB1" /note="beta-D-galactosidase precursor (EC 3.2.1.23)" /codon_start=1 /db_xref="GDB:G00-119-987" /db_xref="PID:g179401" /translation="MPGFLVRILLLLLVLLLLGPTRGLRNATQRMFEIDYSRDSFLKD GQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDH DVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDK WLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLF TTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDH WGQPHSTIKTEAVASSLYDILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYD APLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDIL CPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGI PQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTD WTIFPLDTEDAVRSHLGGWGHRDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLP QDTFIQFPGWTKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWA PCSSDDPELCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV" gene 51..2084 /gene="GLB1" mat_peptide 120..2081 /gene="GLB1" /note="beta-D-galactosidase" BASE COUNT 566 a 631 c 608 g 594 t ORIGIN 1 gcgaagcggc cggcctgggc gccgactgca gagccgggag gctggtggtc atgccggggt 61 tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct acgcgcggct 121 tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc ttcctcaagg 181 atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg ccccgcttct 241 actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc cagacgtatg 301 tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag gaccatgatg 361 tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg aggcccgggc 421 cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta gagaaagagt 481 ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag tggttgggag 541 tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt ataacagtgc 601 aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg cgcttcctgc 661 agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact gatggagcac 721 ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg gactttggaa 781 caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc aaaggaccct 841 tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct cactccacaa 901 tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg gcgagtgtga 961 acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc aactcaccct 1021 atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct ggggacctca 1081 ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta ccagaaggtc 1141 ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa aagttaaaga 1201 cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc ctttatccct 1261 tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca acacttcctc 1321 aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac gatcgagcat 1381 atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg atcactctga 1441 acataacagg gaaagctgga gccactctgg accttctggt agagaacatg ggacgtgtga 1501 actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact ctcagttcca 1561 atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg cgcagccacc 1621 tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc cacaactcat 1681 ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt gggatcccag 1741 acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag gtctggatta 1801 atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc ttgtttgtgc 1861 cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa ctggagtggg 1921 caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac aggccagtta 1981 ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga ctcatgcccc 2041 cacccccgca aaaaaacaaa gattcatggc tggaccatgt atgatgatga aagcctgtgt 2101 ctttgaggga ttctaccctg aacatacctc acagatcctc cctgtcatgc cacatttcac 2161 tgattggaat gtggaaatgg aaaaggaatt taggatgtgc attttcacct gaggtttccc 2221 tgcatccctg cagtgccaaa gccccacctt cagggaccac ctggaatgtg tgagggctga 2281 cagcacagta acgtgcatac atatctgcag ggctggaatg gaagctttaa aggtggtagt 2341 gatttttatt ttggaagaat catgttacct ttttgttaaa taaaatttgt actcaaatg // LOCUS HUMBDNF 918 bp DNA PRI 31-OCT-1994 DEFINITION Human brain-derived neurotrophic factor (BDNF) gene, complete cds. ACCESSION M37762 NID g179402 KEYWORDS neurotrophic factor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 918) AUTHORS Jones,K.R. and Reichardt,L.F. TITLE Molecular cloning of a human gene that is a member of the nerve growth factor family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (20), 8060-8064 (1990) MEDLINE 91045937 COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by K.R.Jones, 13-AUG-1990. FEATURES Location/Qualifiers source 1..918 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" sig_peptide 76..123 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" CDS 76..819 /gene="BDNF" /note="putative" /codon_start=1 /db_xref="GDB:G00-125-916" /product="brain-derived neurotrophic factor" /db_xref="PID:g179403" /translation="MTILFLTMVISYFGCMKAAPMKEANIRGQGGLAYPGVRTHGTLE SVNGPKAGSRGLTSLADTFEHVIEELLDEDQKVRPNEENNKDADLYTSRVMLSSQVPL EPPLLFLLEEYKNYLDAANMSMRVRRHSDPARRGELSVCDSISEWVTAADKKTAVDMS GGTVTVLEKVPVSKGQLKQYFYETKCNPMGYTKEGCRGIDKRHWNSQCRTTQSYVRAL TMDSKKRIGWRFIRIDTSCVCTLTIKRGR" gene 76..816 /gene="NTF3" /map="12p13" gene 76..819 /gene="BDNF" /map="11p13" mat_peptide 124..816 /gene="NTF3" /note="G00-125-917; putative" /product="brain-derived neurotrophic factor" BASE COUNT 269 a 192 c 237 g 220 t ORIGIN 1 ggtgaaagaa agccctaacc agttttctgt cttgtttctg ctttctccct acagttccac 61 caggtgagaa gagtgatgac catccttttc cttactatgg ttatttcata ctttggttgc 121 atgaaggctg cccccatgaa agaagcaaac atccgaggac aaggtggctt ggcctaccca 181 ggtgtgcgga cccatgggac tctggagagc gtgaatgggc ccaaggcagg ttcaagaggc 241 ttgacatcat tggctgacac tttcgaacac gtgatagaag agctgttgga tgaggaccag 301 aaagttcggc ccaatgaaga aaacaataag gacgcagact tgtacacgtc cagggtgatg 361 ctcagtagtc aagtgccttt ggagcctcct cttctctttc tgctggagga atacaaaaat 421 tacctagatg ctgcaaacat gtccatgagg gtccggcgcc actctgaccc tgcccgccga 481 ggggagctga gcgtgtgtga cagtattagt gagtgggtaa cggcggcaga caaaaagact 541 gcagtggaca tgtcgggcgg gacggtcaca gtccttgaaa aggtccctgt atcaaaaggc 601 caactgaagc aatacttcta cgagaccaag tgcaatccca tgggttacac aaaagaaggc 661 tgcaggggca tagacaaaag gcattggaac tcccagtgcc gaactaccca gtcgtacgtg 721 cgggccctta ccatggatag caaaaagaga attggctggc gattcataag gatagacact 781 tcttgtgtat gtacattgac cattaaaagg ggaagatagt ggatttatgt tgtatagatt 841 agattatatt gagacaaaaa ttatctattt gtatatatac ataacagggt aaattattca 901 gttaagaaaa aaataatt // LOCUS HUMBDR2 840 bp mRNA PRI 27-MAR-1996 DEFINITION Human BDR-2 mRNA for hippocalcin, complete cds. ACCESSION D16593 NID g493243 KEYWORDS calcium-binding; hippocalcin. SOURCE Homo sapiens 2 years old female brain (hippocampus) (library: lambda ZAP II clontech Co.Ltd) cDNA to mRNA, clone BDR-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 840) AUTHORS Takamatsu,K., Kobayashi,M., Saitoh,S., Fujishiro,M. and Noguchi,T. TITLE Molecular cloning of human hippocalcin cDNA and chromosomal mapping of its gene JOURNAL Biochem. Biophys. Res. Comm. 200, 606-611 (1994) REFERENCE 2 (bases 1 to 840) AUTHORS Kobayashi,M. TITLE Direct Submission JOURNAL Submitted (02-JUL-1993) to the DDBJ/EMBL/GenBank databases. Masaaki Kobayashi, Toho University School of Medicine, Department of Physiology; Ohmori-nishi, Ohta-ku, Tokyo 143, Japan (Tel:03-3762-4151(ex.2345), Fax:03-3761-0546) COMMENT Submitted (02-JUL-1993) to DDBJ by: Masaaki Kobayashi Dept. of Physiology Toho University School of Medicine 5-21-16 Ohmori-nishi Ohta-ku, Tokyo 143 Japan Phone: 03-3762-4151 x2345 Fax: 03-3762-8225. FEATURES Location/Qualifiers source 1..840 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /tissue_type="brain ( hippocampus )" gene 11..592 /gene="BDR-2" CDS 11..592 /gene="BDR-2" /codon_start=1 /product="hippocalcin" /db_xref="PID:d1004535" /db_xref="PID:g747652" /translation="MGKQNSKLRPEMLQDLRENTEFSELELQEWYKGFLKDCPTGILN VDEFKKIYANFFPYGDASKFAEHVFRTFDTNSDGTIDFREFIIALSVTSPGRLEQKLM WAFSMYDLDGNGYISREEMLEIVQAIYKMVSSVMKMPEDESTPEKRTEKIFRQMDTNN DGKLSLEEFIRGAKSDPSIVRLLQCDPSSRSQF" misc_feature 227..262 /gene="BDR-2" /note="calcium-binding domain; putative" misc_feature 335..370 /gene="BDR-2" /note="calcium-binding domain; putative" misc_feature 479..514 /gene="BDR-2" /note="calcium-binding domain; putative" BASE COUNT 187 a 252 c 240 g 161 t ORIGIN 1 gaattccgcc atgggcaagc agaacagcaa gctgcggccc gagatgttgc aggacctgcg 61 agagaacaca gagttctcag agctggagct gcaggagtgg tacaagggct tcctcaagga 121 ctgccccaca ggaatcctca atgtggatga gttcaagaag atctacgcca acttctttcc 181 ctatggtgac gcctccaagt ttgccgagca cgtcttccgc acctttgaca ccaacagcga 241 tggcaccata gactttcggg agttcatcat tgcgctgagc gtgacctcgc ccggccgcct 301 ggagcagaag ctcatgtggg ccttcagcat gtatgacctg gacggcaacg gctacatcag 361 ccgggaggag atgctggaga tcgtgcaggc catttacaag atggtttcgt ccgtgatgaa 421 gatgccggag gacgagtcga ccccggaaaa gaggactgag aaaatcttcc gccaaatgga 481 cacaaacaac gacggcaagc tgtccttgga ggagttcatc cgcggggcca aaagcgaccc 541 gtccatcgtg cgtctgctgc agtgcgaccc cagcagtcgc tcccagttct gagaggagcc 601 aggttcccct tcctccctcc cttcaccggc cccctcccgg ctcttagctt ccactccctt 661 gtgtgtattc tggctggggg ccagattggg gaagcccttc tccccgggtc tgcctgtggg 721 gggcttccgg aaaagggaac ctgcggtacc cccaggccaa agcaagtaag cggttagcac 781 cccccaatcc cagaggcaac aatagagaca caggctgggt tggtctgccc ctcggaattc // LOCUS HUMBETAADA 3909 bp mRNA PRI 27-NOV-1995 DEFINITION Human beta adaptin protein mRNA, complete cds. ACCESSION L13939 NID g520827 KEYWORDS beta adaptin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3909) AUTHORS Peyrard,M., Fransson,I., Xie,Y.G., Han,F.Y., Ruttledge,M.H., Swahn,S., Collins,J.E., Dunham,I., Collins,V.P. and Dumanski,J.P. TITLE Characterization of a new member of the human beta-adaptin gene family from chromosome 22q12, a candidate meningioma gene JOURNAL Hum. Mol. Genet. 3 (8), 1393-1399 (1994) MEDLINE 95078847 FEATURES Location/Qualifiers source 1..3909 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /tissue_lib="fetal brain cDNA library" CDS 47..2896 /codon_start=1 /product="beta adaptin protein" /db_xref="PID:g520828" /translation="MTDSKYFTTTKKGEIFELKAELNSDKKEKKKEAVKKVIASMTVG KDVSALFPDVVNCMQTDNLELKKLVYLYLMNYAKSQPDMAIMAVNTFVKDCEDPNPLI RALAVRTMGCIRVDKITEYLCEPLRKCLKDEDPYVRKTAAVCVAKLHDINAQLVEDQG FLDTLKDLISDSNPMVVANAVAALSEIAESHPSSNLLDLNPQSINKLLTALNECTEWG QIFILDCLANYMPKDDREAQSICERVTPRLSHANSAVVLSAVKVLMKFMEMLSKDLDY YGTLLKKLAPPLVTLLSAEPELQYVALRNINLIVQKRPEILKHEMKVFFVKYNDPIYV KLEKLDIMIRLASQANIAQVLAELKEYATEVDVDFVRKAVRAIGRCAIKVEQSAERCV STLLDLIQTKVNYVVQEAIVVIKDIFRKYPNKYESVIATLCENLDSLDEPEARAAMIW IVGEYAERIDNADELLESFLEGFHDESTQVQLQLLTAIVKLFLKKPTETQELVQQVLS LATQDSDNPDLRDRGYIYWRLLSTDPVAAKEVVLAEKPLISEETDLIEPTLLDELICY IGTLASVYHKPPSAFVEGGRGVVHKSLPPRTASSESAESPETAPTGAPPGEQPDVIPA QGDLLGDLLNLDLGPPVSGPPLATSSVQMGAVDLLGGGLDSLMGDEPEGIGGTNFVAP PTAAVPANLGAPIGSGLSDLFDLTSGVGTLSGSYVAPKAVWLPAMKAKGLEISGTFTR QVGSISMDLQLTNKALQVMTDFAIQFNRNSFGLAPAAPLQVHAPLSPNQTVEISLPLS TVGSVMKMEPLNNLQVAVKNNIDVFYFSTLYPLHILFVEDGKMDRQMFLATWKDIPNE NEAQFQIRDCPLNAEAASSKLQSSNIFTVAKRNVEGQDMLYQSLKLTNGIWVLAELRI QPGNPSCTDLELSLKCRAPEVSQHVYQAYETILKN" BASE COUNT 841 a 1202 c 1130 g 736 t ORIGIN 1 gagctattgg gacctgcgga agcctggcta cagataaggg accaaaatga ctgactcaaa 61 atatttcacc acgaccaaga aaggggagat cttcgagctg aaggcagagc tcaacagtga 121 caagaaggag aagaagaagg aggcagtgaa gaaagtgatt gcatcgatga ccgtgggcaa 181 agatgtcagt gccctcttcc ccgatgtggt caactgcatg cagacggaca acctggagct 241 gaagaagctg gtatacctct acttgatgaa ttacgccaag agtcagcctg acatggccat 301 tatggccgtc aacacctttg tgaaggactg tgaggacccc aaccccctca tccgagccct 361 ggcagtgcgg accatgggct gcatccgcgt tgacaagatc acagagtacc tgtgcgagcc 421 actccggaag tgcctgaagg acgaggatcc atatgtgcgc aagacagcag ctgtgtgcgt 481 ggccaagctc cacgacatca acgcccagct ggtggaggac cagggcttcc tggacaccct 541 taaagacctc atctccgact ctaaccccat ggtggtggcc aatgcagtgg cagcgctctc 601 agaaattgcc gagtctcacc ccagcagcaa cctgctcgat ctgaacccac agtccatcaa 661 caagctgctg acagccctca atgagtgcac cgagtggggc cagatcttca tcctggactg 721 cctcgccaac tatatgccca aggacgaccg cgaggcccag agcatctgtg agcgggtcac 781 ccccaggctc tcccatgcca actccgctgt ggtgctctct gctgtgaagg tgctgatgaa 841 gttcatggag atgttgtcta aggacttgga ctactacggc acactgctca agaagctggc 901 cccacccctg gtcacactgc tgtcagccga gccagagctg cagtatgtgg ccctgcgcaa 961 catcaatctc atcgtgcaga aaaggcctga gatcctgaag catgagatga aggtgttctt 1021 cgtgaagtac aacgacccta tctacgtgaa gctggagaag ctggacatca tgatccgcct 1081 ggcctctcag gccaacatcg cccaggtgtt ggcagagctg aaagagtacg caacagaagt 1141 ggatgtggac tttgtacgga aggctgtgcg tgctattggc cgctgcgcca tcaaggtgga 1201 gcaatctgcg gagcgctgtg tgagcacgct gctcgacctc atccagacca aggtcaacta 1261 tgtggtccag gaggccatcg tggtcatcaa ggacatcttc cgcaagtacc ccaacaagta 1321 tgagagtgtg attgccacac tgtgtgagaa tctggactcc ctggatgagc ctgaggcccg 1381 ggctgccatg atctggattg tgggcgagta cgcggaacgg atcgacaacg cagatgagct 1441 gctggagagc ttcctcgagg gcttccatga cgagagcaca caggtccagc tgcagctgct 1501 gacagccatt gtgaaactct ttctaaagaa gccaacagag acccaggagc tggtgcagca 1561 ggtcctcagt ttggccactc aggactcaga taacccagac ctgcgggacc gtggctacat 1621 ctactggcgc ctgctgtcca cggacccggt ggcagccaag gaggtggtgt tggctgagaa 1681 gccactcatc tctgaagaga cggacctcat cgagcccaca ctgttagacg agcttatctg 1741 ctacatcggc acgctggctt ccgtctacca taagcctccc agtgcctttg tggagggggg 1801 ccggggcgtc gtgcacaaga gcctgccacc tcgcacggcc tcgagtgaga gcgcagagag 1861 ccctgagaca gcccctactg gagcacctcc tggggagcag ccagatgtca tccccgccca 1921 gggcgacctg ctgggtgacc tcctcaacct ggacctcggc cccccagtga gcggcccacc 1981 cctggccacc tcctcggtgc agatgggagc tgtggacctt cttggcggtg gccttgacag 2041 cctgatgggg gatgagcctg aagggattgg gggcaccaac ttcgtggcac ctccaacagc 2101 agcagtacca gccaatcttg gagcacccat cggcagtggc ctgagtgacc tctttgacct 2161 gaccagtggc gtgggcaccc tgtcaggatc atatgtggcc cccaaagcag tctggctccc 2221 agccatgaag gctaaggggc tggagatctc aggcaccttc acccgccagg tgggctccat 2281 ctccatggac ctgcagctga ccaacaaggc cttgcaggtc atgaccgact ttgccatcca 2341 gttcaaccgc aacagctttg gcctggcccc cgccgccccc ctccaggtcc acgcgccact 2401 cagccccaac cagacagtgg agatctccct gcctctcagc acggtgggct cggtcatgaa 2461 gatggagcct ctgaacaacc tccaggtggc cgtgaagaac aacatcgatg tcttctactt 2521 cagcaccttg tacccactgc acatcctctt tgtggaggac gggaagatgg accggcagat 2581 gttcctggcc acatggaagg atattcccaa tgagaatgag gcccagttcc agatcagaga 2641 ctgccccctc aatgcagagg ctgcgagcag caagctgcag agcagcaaca tcttcactgt 2701 cgccaagagg aacgtggagg gccaggacat gctctaccag tccctgaagc tgaccaacgg 2761 catctgggtg ctggcggagc tgcggatcca gccgggcaac cccagctgca cggacttaga 2821 gctgtccctg aagtgtcgag caccagaggt gtcccagcac gtgtaccagg cctacgagac 2881 catcctcaag aactgagacc ccggccagcg cccaccccag ccttctgccc gccccatcga 2941 ggaggcccct cgggggcagc acatcttcct cctcgcagga gggaccaggc ggggctccag 3001 gccactcagt gggctccctg gtcctgatgg cagaacccac ccgatccctg gggtagggca 3061 ccaccccctc ctggggtgag agcgcagtgc actcccgtgc tctgggacac ccctgctcct 3121 gtggctgtga tgtggggtta agtgaggtgg ggaccaaagg aaacagagcc agagcagcca 3181 cagaagctgt gcctgaaggg tgagtgtgga gcttgcccct ccggctcaca gccccggcag 3241 cccctggctc cttggtctct ccggttggtg ttaaagggcc ctccactgcc acctctcatg 3301 ggatggaccc tgccaacctg gcctgggtgt gcagggaggg gttccccttg gtaccaggag 3361 agctgctcac ttagggcctg gggctcaagg agctgtaggt gccggcagag gggcagagct 3421 aggctgggag gagcccaggg cctgcaccac ccacttgcaa ccaccaggct ggtgccctgc 3481 agctgtgcca gttgggccac agcctcccga gtgctgaccc acatggtcaa gaccaggcag 3541 aagctcccag agcccctgtc tcgggctccc caccgactgg cagctgcact atccccaact 3601 agcccctgtc tcgggctccc caccgactgg cagctgcact atccccaact ccccacttct 3661 gccccaaggg tggtcactgc ctgtgataca ttcctcagtg tcgcctctga gcccaggcct 3721 cctgccacaa agactgggcc gagagatggg gctgggtaag cgggtgcgcc tcctgtttgg 3781 gtttcttcgg ggtttctctg tagtgtctgc tgccctcctc ctggcctccc aggtttcagt 3841 tgtgtctgac agagcattag gttttcctgt tactgctgtg taataataaa gaaagattcg 3901 gcttttggc // LOCUS HUMBFDNA 2825 bp mRNA PRI 15-MAR-1990 DEFINITION Human DNA-binding factor mRNA, complete cds. ACCESSION M29204 NID g179411 KEYWORDS DNA-binding protein. SOURCE Human epidermoid carcinoma cell line A431, cDNA to mRNA, clones lambda-GCF[1,4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2825) AUTHORS Kageyama,R. and Pastan,I. TITLE Molecular cloning and characterization of a human DNA binding factor that represses transcription JOURNAL Cell 59, 815-825 (1989) MEDLINE 90075226 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Pastan, 19-OCT-1989. FEATURES Location/Qualifiers source 1..2825 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2825 /note="DNABF mRNA (alt.)" mRNA 40..2825 /note="DNABF mRNA (alt., 5' end approx.)" CDS 224..2578 /note="DNA-binding factor" /codon_start=1 /db_xref="PID:g179412" /translation="MKKRVTNRERHWTHRRRRQRTRKKKKKKKRVLGRRALGPRPWLT GRKGLFGSARLIPATAMAPRSRLLSLGRRGNFRSRVLRRKSRPLEEAARRWRDCPTGF GALVAGAGSGRAPGVPPKRLPARTKAQNPEPLMCPQMKRIKYITPQKVRMIRVCLLTV LALLEKKNFHQQLRSQMQLLFRQPAENVELARAQDDYISLDVQHTSSISVSRNEETSE ESQEDEKQDTWEQQQMRKAVKIIEERDIDLSCGSGSSKVKKFDTSISFPPVNLEIIKK QLNTRLTLLQETHRSHLREYEKYVQDVKSSKSTIQNLESSSNQALNCKFYKSMKIYVE NLIDCLNEKIINIQEIESSMHALLLKQAMTFMKRRQDELKHESTYLQQLSRKDETSTS GNFSVDEKTQWILEEIESRRTKRRQARVLSGNCNHQEGTSSDDELPSAEMIDFQKSQG DILQKQKKVFEEVQDDFCNIQNILLKFQQWREKFPDSYYEAFISLCIPKLLNPLIRVQ LIDWNPLKLESTGLKEMPWFKSVEEFMDSSVEDSKKESSSDKKVLSAIINKTIIPRLT DFVEFLWDPLSTSQTTSLITHCRVILEEHSTCENEVSKSRQDLLKSIVSRMKKAVEDD VFIPLYPKSAVENKTSPHSKFQERQFWSGLKLFRNILLWNGLLTDDTLQELGLGKLLN RYLIIALLNATPGPDVVKKCNQVAACLPEKWFENSAMRTSIPQLENFIQFLLQSAHKL SRSEFRDEVEEIILILVKIKALNQAESFIGEHHLDHLKSLIKED" BASE COUNT 966 a 556 c 623 g 680 t ORIGIN 1931 bp upstream of EcoRI site. 1 accaaccagg aagcagctga gcccaaggag gttccagcgc acagtacaga agtaggtagg 61 gatcacaacg aagaagaggg tgaagaaaca ggattaaggg acgagaaacc aatcaagaca 121 gaagttcctg gttctccagc aggaactgag ggcaactgtc aggaagcgac aggtccaagt 181 acagtagaca ctcaaaatga acccttagat atgaaagagc ccgatgaaga aaagagtgac 241 caacagggag aggcattgga ctcatcgcag aagaagacaa agaacaagaa aaaaaaaaaa 301 aaaaaaaaag cgggttctag ggcgccgggc gctcgggcct cggccatggc tcacaggccg 361 aaaaggactt ttcggcagcg cgcggctgat tccagcgaca gcgatggcgc cgaggagtcg 421 cctgctgagc ctggggcgcc gagggaactt ccggtcccgg gttctgcgga ggaagagccg 481 ccctctggag gaggccgcgc gcaggtggcg ggactgcccc accgggttcg gggccctcgt 541 ggccggggcc gggtctgggc gagctcccgg cgtgccacca aagcggctcc ccgcgcggac 601 gaaggctcag aatccagaac ccttgatgtg tccacagatg aagaggataa aatacatcac 661 tcctcagaaa gtaaggatga tcagggtttg tcttctgaca gttctagctc tcttggagaa 721 aaagaacttt catcaacagt taagatccca gatgcagctt ttattcaggc agcccgcaga 781 aaacgttgaa ttggccaggg cccaagatga ctatatttct ttggatgtac aacatacctc 841 ctccatctct gtaagcagaa atgaagaaac aagtgaagaa agtcaggaag atgaaaagca 901 agatacttgg gaacaacagc aaatgaggaa agcagttaaa atcatagagg aaagagacat 961 agatctttcc tgtggcagtg gatcttcaaa agtgaagaaa tttgatactt ccatttcatt 1021 tccgccagta aatttagaaa ttataaagaa gcaattaaat actagattaa cattactaca 1081 ggaaactcac cgctcacacc tgagggagta tgaaaaatac gtacaagatg tcaaaagctc 1141 aaagagtacc atccagaacc tagagagttc atcaaatcaa gctctaaatt gtaaattcta 1201 taaaagcatg aaaatttatg tggaaaattt aattgactgc cttaatgaaa agattatcaa 1261 catccaagaa atagaatcat ccatgcatgc actcctttta aaacaagcta tgacctttat 1321 gaaacgcagg caagatgaat taaaacatga atcaacgtat ttacaacagt tatcacgcaa 1381 agatgagaca tccacaagtg gaaacttctc agtagatgaa aaaactcagt ggattttaga 1441 agagattgaa tctcgaagga caaaaagaag acaagcaagg gtgctttctg ggaattgtaa 1501 ccatcaggaa ggaacatcta gtgatgatga actgccttca gcagagatga ttgacttcca 1561 aaaaagccaa ggtgacattt tacagaaaca gaagaaagtt tttgaagaag tgcaagatga 1621 tttttgtaac atccagaata ttttgttgaa atttcagcaa tggcgagaaa agtttcctga 1681 ctcctattat gaagctttca ttagtttatg cataccaaag cttttaaatc ccctaatacg 1741 agttcagttg attgattgga atcctcttaa gttggaatcc acaggtttaa aagagatgcc 1801 atggttcaaa tctgtagaag aatttatgga tagcagtgta gaagattcaa agaaggaaag 1861 tagttcagat aaaaaagtct tgtctgcaat catcaacaaa acaattattc cccgacttac 1921 agactttgta gaattccttt gggatccttt gtcaacctca cagacaacaa gtttaataac 1981 acattgcaga gtgattcttg aagaacattc cacttgtgaa aatgaagtta gtaaaagcag 2041 acaggattta cttaaatcca ttgtttcaag aatgaaaaag gcagtagaag atgatgtttt 2101 tattcctctg tatccaaaga gtgctgtaga aaacaaaaca tcacctcatt caaagttcca 2161 agaaagacag ttctggtcag gcctaaagct cttccgcaat attcttcttt ggaatggact 2221 ccttacagat gacaccttgc aagaactagg actagggaag ctgctaaatc gttaccttat 2281 tatagcactt ctcaatgcca cacctgggcc agatgtggtt aaaaagtgca accaggtagc 2341 agcatgtcta ccagaaaaat ggtttgaaaa ttctgccatg aggacatcta ttccacagct 2401 agaaaacttc attcagtttt tattgcagtc tgcacataaa ttatctagaa gtgaattcag 2461 ggatgaagtc gaagaaataa ttcttatttt ggtgaaaata aaagctttga atcaagcaga 2521 atccttcata ggagagcatc acctagacca tcttaaatca ctaattaaag aagattgaat 2581 aaactttatt ggaaaatgct aaaattttaa tatagttaca ctcagttcct ttgtttgaga 2641 agaagctggt gcctctctct tctttattcc ctgtaataga aggtaggatt tgaaaaaaag 2701 caggactcca cctctgtatt cccccgtgct ttaccttctg gcatcatgaa aagctgccat 2761 gattctgtgg tgttctaagg aattaaatgc actggagctt taagagctca acgtgtttcc 2821 ctttg // LOCUS HUMBGLUKIN 2585 bp mRNA PRI 31-OCT-1994 DEFINITION Human pancreatic beta-cell glucokinase mRNA, complete cds. ACCESSION M88011 NID g179426 KEYWORDS glucokinase. SOURCE Homo sapiens adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2585) AUTHORS Nishi,S., Stoffel,M., Xiang,K., Shows,T.B., Bell,G.I. and Takeda,J. TITLE Human pancreatic beta-cell glucokinase: cDNA sequence and localization of the polymorphic gene to chromosome 7, band p 13 [published erratum appears in Diabetologia 1992 Nov;35(11):1100] JOURNAL Diabetologia 35 (8), 743-747 (1992) MEDLINE 92380355 FEATURES Location/Qualifiers source 1..2585 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pancreatic beta-cell" /dev_stage="adult" /map="Unassigned" gene 330..1727 /gene="GCK" CDS 330..1727 /gene="GCK" /codon_start=1 /db_xref="GDB:G00-127-550" /product="glucokinase" /db_xref="PID:g179427" /translation="MLDDRARMEAAKKEKVEQILAEFQLQEEDLKKVMRRMQKEMDRG LRLETHEEASVKMLPTYVRSTPEGSEVGDFLSLDLGGTNFRVMLVKVGEGEEGQWSVK TKHQMYSIPEDAMTGTAEMLFDYISECISDFLDKHQMKHKKLPLGFTFSFPVRHEDID KGILLNWTKGFKASGAEGNNVVGLLRDAIKRRGDFEMDVVAMVNDTVATMISCYYEDH QCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCVNTEWGAFGDSGELDEFLLEYDRL VDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDENLLFHGEASEQLRTRGAFETRF VSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACESVSTRAAHMCSAGLAGVINR MRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPSCEITFIESEEGSGRGAA LVSAVACKKACMLGQ" BASE COUNT 578 a 740 c 826 g 441 t ORIGIN 1 ggacactaag ccccacagct caacacaacc aggagagaaa gcgctgagga cgccacccaa 61 gcgcccagca atggccctgc ctggagaaca tccaggctca gtgaggaagg gtccagaagg 121 gaatgcttgc cgactcgttg gagaacaatg aaaaggagga aactgtgact gaacctcaaa 181 ccccaaacca gcccgaggag aaccacattc tcccagggac ccagggcggg ccgtgacccc 241 tgcggcggag aagccttgga tatttccact tcagaagcct actggggaag gctgaggggt 301 cccagctccc cacgctggct gctgtgcaga tgctggacga cagagccagg atggaggccg 361 ccaagaagga gaaggtagag cagatcctgg cagagttcca gctgcaggag gaggacctga 421 agaaggtgat gagacggatg cagaaggaga tggaccgcgg cctgaggctg gagacccatg 481 aagaggccag tgtgaagatg ctgcccacct acgtgcgctc caccccagaa ggctcagaag 541 tcggggactt cctctccctg gacctgggtg gcactaactt cagggtgatg ctggtgaagg 601 tgggagaagg tgaggagggg cagtggagcg tgaagaccaa acaccagatg tactccatcc 661 ccgaggacgc catgaccggc actgctgaga tgctcttcga ctacatctct gagtgcatct 721 ccgacttcct ggacaagcat cagatgaaac acaagaagct gcccctgggc ttcaccttct 781 cctttcctgt gaggcacgaa gacatcgata agggcatcct tctcaactgg accaagggct 841 tcaaggcctc aggagcagaa gggaacaatg tcgtggggct tctgcgagac gctatcaaac 901 ggagagggga ctttgaaatg gatgtggtgg caatggtgaa tgacacggtg gccacgatga 961 tctcctgcta ctacgaagac catcagtgcg aggtcggcat gatcgtgggc acgggctgca 1021 atgcctgcta catggaggag atgcagaatg tggagctggt ggagggggac gagggccgca 1081 tgtgcgtcaa taccgagtgg ggcgccttcg gggactccgg cgagctggac gagttcctgc 1141 tggagtatga ccgcctggtg gacgagagct ctgcaaaccc cggtcagcag ctgtatgaga 1201 agctcatagg tggcaagtac atgggcgagc tggtgcggct tgtgctgctc aggctcgtgg 1261 acgaaaacct gctcttccac ggggaggcct ccgagcagct gcgcacacgc ggagccttcg 1321 agacgcgctt cgtgtcgcag gtggagagcg acacgggcga ccgcaagcag atctacaaca 1381 tcctgagcac gctggggctg cgaccctcga ccaccgactg cgacatcgtg cgccgcgcct 1441 gcgagagcgt gtctacgcgc gctgcgcaca tgtgctcggc ggggctggcg ggcgtcatca 1501 accgcatgcg cgagagccgc agcgaggacg taatgcgcat cactgtgggc gtggatggct 1561 ccgtgtacaa gctgcacccc agcttcaagg agcggttcca tgccagcgtg cgcaggctga 1621 cgcccagctg cgagatcacc ttcatcgagt cggaggaggg cagtggccgg ggcgcggccc 1681 tggtctcggc ggtggcctgt aagaaggcct gtatgctggg ccagtgagag cagtggccgc 1741 aagcgcaggg aggatgccac agccccacag cacccaggct ccatggggaa gtgctcccca 1801 cacgtgctcg cagcctggcg gggcaggagg cctggccttg tcaggaccca ggccgcctgc 1861 cataccgctg gggaacagag cgggcctctt ccctcagttt ttcggtggga cagccccagg 1921 gccctaacgg gggtgcggca ggagcaggaa cagagactct ggaagccccc cacctttctc 1981 gctggaatca atttcccaga agggagttgc tcactcagga ctttgatgca tttccacact 2041 gtcagagctg ttggcctcgc ctgggcccag gctctgggaa ggggtgccct ctggatcctg 2101 ctgtggcctc acttccctgg gaactcatcc tgtgtgggga ggcagctcca acagcttgac 2161 cagacctaga cctgggccaa aagggcagcc aggggctgct catcacccag tcctggccat 2221 tttcttgcct gaggctcaag aggcccaggg agcaatggga gggggctcca tggaggaggt 2281 gtcccaagct ttgaataccc ccagagacct tttctctccc ataccatcac tgagtggctt 2341 gtgattctgg gatggaccct cgcagcaggt gcaagagaca gagcccccaa gcctctgccc 2401 caaggggccc acaaagggga gaagggccag ccctacatct tcagctccca tagcgctggc 2461 tcaggaagaa accccaagca gcattcagca caccccaagg gacaacccca tcatatgaca 2521 tgccaccctc tccatgccca acctaagatt gtgtgggttt tttaattaaa aatgttaaaa 2581 gtttt // LOCUS HUMBHEXB 1635 bp mRNA PRI 19-JUN-1995 DEFINITION Homo sapiens beta-hexosaminidase beta chain mRNA, complete cds. ACCESSION M19735 NID g867690 KEYWORDS beta-hexosaminidase; beta-hexosaminidase beta-subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1635) AUTHORS Proia,R.L. TITLE Gene encoding the human beta-hexosaminidase beta chain: extensive homology of intron placement in the alpha- and beta-chain genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (6), 1883-1887 (1988) MEDLINE 88158097 FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1635 /EC_number="3.2.1.52" /codon_start=1 /product="beta-hexosaminidase beta-subunit" /db_xref="PID:g867691" /translation="MLLALLLATLLAAMLALLTQVALVVQVAEAARAPSVSAKPGPAL WPLPLSVKMTPNLLHLAPENFYISHSPNSTAGPSCTLLEEAFRRYHGYIFGFYKWHHE PAEFQAKTQVQQLLVSITLQSECDAFPNISSDESYTLLVKEPVAVLKANRVWGALRGL ETFSQLVYQDSYGTFTINESTIIDSPRFSHRGILIDTSRHYLPVKIILKTLDAMAFNK FNVLHWHIVDDQSFPYQSITFPELSNKGSYSLSHVYTPNDVRMVIEYARLRGIRVLPE FDTPGHTLSWGKGQKDLLTPCYSRQNKLDSFGPINPTLNTTYSFLTTFFKEISEVFPD QFIHLGGDEVEFKCWESNPKIQDFMRQKGFGTDFKKLESFYIQKVLDIIATINKGSIV WQEVFDDKAKLAPGTIVEVWKDSAYPEELSRVTASGFPVILSAPWYLDLISYGQDWRK YYKVEPLDFGGTQKQKQLFIGGEACLWGEYVDATNLTPRLWPRASAVGERLWSSKDVR DMDDAYDRLTRHRCRMVERGIAAQPLYAGYCNHENM" BASE COUNT 444 a 356 c 381 g 454 t ORIGIN 1 atgctgctgg cgctgctgtt ggcgacactg ctggcggcga tgttggcgct gctgactcag 61 gtggcgctgg tggtgcaggt ggcggaggcg gctcgggccc cgagcgtctc ggccaagccg 121 gggccggcgc tgtggcccct gccgctctcg gtgaagatga ccccgaacct gctgcatctc 181 gccccggaga acttctacat cagccacagc cccaattcca cggcgggccc ctcctgcacc 241 ctgctggagg aagcgtttcg acgatatcat ggctatattt ttggtttcta caagtggcat 301 catgaacctg ctgaattcca ggctaaaacc caggttcagc aacttcttgt ctcaatcacc 361 cttcagtcag agtgtgatgc tttccccaac atatcttcag atgagtctta tactttactt 421 gtgaaagaac cagtggctgt ccttaaggcc aacagagttt ggggagcatt acgaggttta 481 gagaccttta gccagttagt ttatcaagat tcttatggaa ctttcaccat caatgaatcc 541 accattattg attctccaag gttttctcac agaggaattt tgattgatac atccagacat 601 tatctgccag ttaagattat tcttaaaact ctggatgcca tggcttttaa taagtttaat 661 gttcttcact ggcacatagt tgatgaccag tctttcccat atcagagcat cacttttcct 721 gagttaagca ataaaggaag ctattctttg tctcatgttt atacaccaaa tgatgtccgt 781 atggtgattg aatatgccag attacgagga attcgagtcc tgccagaatt tgatacccct 841 gggcatacac tatcttgggg aaaaggtcag aaagacctcc tgactccatg ttacagtaga 901 caaaacaagt tggactcttt tggacctata aaccctactc tgaatacaac atacagcttc 961 cttactacat ttttcaaaga aattagtgag gtgtttccag atcaattcat tcatttggga 1021 ggagatgaag tggaatttaa atgttgggaa tcaaatccaa aaattcaaga tttcatgagg 1081 caaaaaggct ttggcacaga ttttaagaaa ctagaatctt tctacattca aaaggttttg 1141 gatattattg caaccataaa caagggatcc attgtctggc aggaggtttt tgatgataaa 1201 gcaaagcttg cgccgggcac aatagttgaa gtatggaaag acagcgcata tcctgaggaa 1261 ctcagtagag tcacagcatc tggcttccct gtaatccttt ctgctccttg gtacttagat 1321 ttgattagct atggacaaga ttggaggaaa tactataaag tggaacctct tgattttggc 1381 ggtactcaga aacagaaaca acttttcatt ggtggagaag cttgtctatg gggagaatat 1441 gtggatgcaa ctaacctcac tccaagatta tggcctcggg caagtgctgt tggtgagaga 1501 ctctggagtt ccaaagatgt cagagatatg gatgacgcct atgacagact gacaaggcac 1561 cgctgcagga tggtcgaacg tggaatagct gcacaacctc tttatgctgg atattgtaac 1621 catgagaaca tgtaa // LOCUS HUMBINDA 1363 bp mRNA PRI 13-JUN-1995 DEFINITION Homo sapiens DNA binding protein for surfactant protein B mRNA, complete cds. ACCESSION L10403 NID g860726 KEYWORDS DNA-binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1363) AUTHORS Luzi,P. and Strayer,D.S. TITLE DNA binding proteins that amplify surfactant protein B gene expression: isolation and characterization JOURNAL Biochem. Biophys. Res. Commun. 208 (1), 153-160 (1995) MEDLINE 95194400 FEATURES Location/Qualifiers source 1..1363 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 586..906 /codon_start=1 /product="DNA-binding protein" /db_xref="PID:g860727" /translation="MPPCSCARSLCALQVLLLTVLGSSTNGQTKRNIGKRKCRDLFLA PVAASAIPVSGRKTGLVLLGLESAIITTNPSGGVTERGHFTFAGHSAGHFTKALQIIF TMAL" BASE COUNT 389 a 312 c 291 g 371 t ORIGIN 1 gaattcagta agctaaaatc caaagtaagc ctctgatgat aattggctct tattttatcc 61 atacggtccc aaagaacatc tgctgtcttt ggcgcagggc catatatttg tggtttcagg 121 tgcccctaaa gtgtctatag gagcctataa acaaagccta taaactgtgt tgtaggaaag 181 acagcacata ttgttacagg ctcatacaaa gaaaatatat gtagtgtttc agtctagttc 241 ttaccttcct aagtagagtc cttacacatg tgtaagggag ataggtattg agaaagggag 301 agtgggaatg tgaagtgatg cataacatgc aacttagtag gaattttgac ctgtgttggg 361 cacagcttga caagcttgtg tgtgtgtatc accacatacc ctcacttccc ccttccctac 421 ctctttctcc ttactgactt caagggagag catataaatg acatcaaggg gtatgaaaag 481 ccacttaact gcagacttgt aggcagcaac tcaccctcaa gaggaagtct tcaggctcta 541 gaaacatctt taacttcggc ttctgcacca taagcctcag actcaatgcc accctgcagc 601 tgtgccagat cactttgtgc cctgcaggtg ctgctgttga ctgttctggg ttcctccacc 661 aatggacaaa ctaagagaaa catagggaaa aggaaatgta gagatctgtt ccttgcacct 721 gttgctgctt ctgctatacc tgtatctggg agaaagactg gcttggtgct cctggggctg 781 gagagtgcca ttataacaac aaatccaagt ggaggggtca cagagagggg gcacttcaca 841 tttgctgggc attctgctgg gcactttact aaagctttac agatcatatt cacaatggct 901 ttatgagaga ggtacaatta ccttcaattt acaattgaga gaactgagaa aaatattcac 961 gaccactaat agatcacttt ttaccccagc tgtaagtgta gacagtgact tgtacactga 1021 actgcgctgc gtgtatgtga agtcaacctt tgtacttcat cccagaaaca tccacaattt 1081 ggagttggtc tcagcaggac cccattgcag caaagacgaa gtaatgtaag ccactgcttc 1141 tgtcgtatcg cctcatcagg gaagccctct acctccatcc ccatctgcat tcatttcctc 1201 cagtctcaca gatcctttct gatattcagg ccaggacacc cacagataat tctattctct 1261 cttgcagagc cactctgtaa gatgggagaa aaaatctgcc tggacccaga tgctcccaga 1321 atcaataaaa ttgtacagaa aatgttgaaa gttgatgaat tcc // LOCUS HUMBINDC 1935 bp mRNA PRI 13-JUN-1995 DEFINITION Homo sapiens DNA binding protein for surfactant protein B mRNA, complete cds. ACCESSION L10405 NID g860729 KEYWORDS DNA-binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1935) AUTHORS Luzi,P. and Strayer,D.S. TITLE DNA binding proteins that amplify surfactant protein B gene expression: isolation and characterization JOURNAL Biochem. Biophys. Res. Commun. 208 (1), 153-160 (1995) MEDLINE 95194400 FEATURES Location/Qualifiers source 1..1935 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1577..1930 /codon_start=1 /product="DNA-binding protein" /db_xref="PID:g860730" /translation="MSSAFLPPDSQSTKSAAMVLVCFAKYSGKYGEAVSSAPSTIKMI LTGKVWSFSFKYSATALDAQLNCLYRRQHRGHIAYHPVLTMSKAPSSTGQGVRRVGHH SGYKSQGSWHRVGHW" BASE COUNT 496 a 404 c 457 g 578 t ORIGIN 1 gaattctttt gttaattgtt cctttggttg ggcaaagata tctaacagtg agccacgttc 61 aatgatttca ccattttgca tgaccgcgac tttattggcg atttctttga cggcctgcat 121 ttcgtgggtg atgagcacaa ttgtcaaacc aaattcttgg ttgagctttt tgagcaaggc 181 caaaatttga ttcgttgtct tgggatccag tgccgaagtg gcttcatcgg aaattaaaat 241 ttcaggatca ttggccaaag cgcgggcgat ggcacacgtt gcttttgacc accagacaat 301 tgtgatggat attgttgtgc gcgatcagat aagtcaacca attcaaggag ctcagccact 361 ttttagcttt ttctttttca gatagtgggc tgtgttggag ggcaaacgcc acgttttctg 421 ttaccgtgcg ttcgtttagc aagttaaaat gttggaaaat catgccaatt ttgcgtcgcc 481 catcacggag cttttggcta gaaatacggg taattttacc ggtcttatca tcactttgaa 541 agaaaacttc gccgttgacg accacagaac cactggttgg ttcttgtagg aggttaatga 601 cacgtactaa tgttgattta ccagcacctg aataaccaac aatgccgtaa acatcccccc 661 gttcaacatg gacagagacg ttgtttaccg cagtaatcgt ccgttttttt cgatggaaag 721 cgacagtaat attgtctaat gcaataatgg gttgactcat tcatttactc ctttagttga 781 gatgcttgat aactcttaat taaagctttg acggcatcaa tatgttggtc ataatcaacc 841 aaacgaatat gctcgtttgg tgcatgatcg aggttattgg cataaccaac gccaaggctg 901 gcaattgggg cctgcaaagc cgcataaatc gttgccattg gacccgtgcc aggtgatgtc 961 ggcatgacca cgggtgcgac ttgataataa gctttggcca cagagatgac ccgttgaatt 1021 tctgggtcag acatatcgct gcgataacca gctagcccca gtgttttggt aacggtgata 1081 tccgtaaaac cttgtgcagt caaatggtaa gcaatgcgtt gtaacgtgat gtcaggtgac 1141 atattgggga ctaagcgctt ctaatttagc cgtggcggtc gcgggtaaga cagttttaac 1201 acctttatca ttataccctg agatgatgcc ttggacatta agggttggtt ggaaatacaa 1261 ggtttctttt aaatcttggc cgttacgatc tgtataaagt ggcactttta ggccgtgttg 1321 tgccactaaa tcttctcgtg ttagcggtaa ctggccaatt aaagcttttt cacgtgcatt 1381 tggtgcgatg acttcatcat aaaaatacgg tacggcaata ttgccttgtt ggtcaaataa 1441 tgcggcaatg gcttgtgata agcggatagg agccgagtca acaacagccg acaatgatga 1501 atgtaggtca ctatctgctg tgttagcggt taattcaaag gtcacaatcc ctttattccc 1561 gccataaatt tcgacaatgt cgtccgcatt tttaccacca gattcccaaa gcactaaatc 1621 agccgcgatg gtgttagtat gttttgccaa atattctggt aagtatggtg aagcagtttc 1681 ttcggcacct tcgacgataa agatgatatt aaccggtaag gtttggtcat tttcttttaa 1741 atactcggca actgcgctag acgcgcagtt aaattgcctt tatcgtcgtc aacaccgcgg 1801 ccatatagct taccatcccg tgctgacaat gtccaaggcg ccgtcgtcca caggtcaagg 1861 ggttcggcgg gttggacatc atagtggtta taaatcacaa gggtcttggc atcgggttgg 1921 tcactggtga attcc // LOCUS HUMBIXBR 824 bp mRNA PRI 03-OCT-1996 DEFINITION Human mRNA for biliverdin-IXbeta reductase I. ACCESSION D32143 NID g699602 KEYWORDS biliverdin-IXbeta reductase I. SOURCE Homo sapiens (library: liver rambda gt11 and gt10) cDNA to mRNA, clone pET-3c-BvR. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 824) AUTHORS Komuro,A., Tobe,T., Hashimoto,K., Nakano,Y., Yamaguchi,T., Nakajima,H. and Tomita,M. TITLE Molecular cloning and expression of human liver biliverdin-IX beta reductase JOURNAL Biol. Pharm. Bull. 19 (6), 796-804 (1996) MEDLINE 96392687 REFERENCE 2 (bases 1 to 824) AUTHORS Tobe,T. TITLE Direct Submission JOURNAL Submitted (13-JUL-1994) to the DDBJ/EMBL/GenBank databases. Takashi Tobe, Showa University, School of Pharmaceutical Sciences, Department of Physiological Chemistry; 1-5-8, Hatanodai, Shinagawa-ku, Tokyo 142, Japan (Tel:03-3784-8215, Fax:03-3784-8216) FEATURES Location/Qualifiers source 1..824 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="liver ramda gt11 and gt10" CDS 110..730 /codon_start=1 /product="biliverdin-IXbeta reductase I" /db_xref="PID:d1007449" /db_xref="PID:g1480221" /translation="MAVKKIAIFGATGQTGLTTLAQAVQAGYEVTVLVRDSSRLPSEG PRPAHVVVGDVLQAADVDKTVAGQDAVIVLLGTRNDLSPTTVMSEGARNIVAAMKAHG VDKVVACTSAFLLWDPTKVPPRLQAVTDDHIRMHKVLRESGLKYVAVMPPHIGDQPLT GAYTVTLDGRGPSRVISKHDLGHFMLRCLTTDEYDGHSTYPSHQYQ" BASE COUNT 176 a 255 c 252 g 141 t ORIGIN 1 gaattcgaat gccaccctcc cagatggggg cagagagcac cgcccagcag ccagtgggtt 61 cccgcgcgtg ccgagactct gaggccttgc acccccacga tcccgtacga tggccgtcaa 121 gaagatcgcg atcttcggcg ccactggcca gaccgggctc accaccctgg cgcaggcggt 181 gcaagcaggt tacgaagtga cagtgctggt gcgggactcc tccaggctgc catcagaggg 241 gccccggccg gcccacgtgg tagtgggaga tgttctgcag gcagccgatg tggacaagac 301 cgtggctggc caggacgctg tcatcgtgct gctgggcacc cgcaatgacc tcagtcccac 361 gacagttatg tccgagggcg cccggaacat tgtggcagcc atgaaggctc atggtgtgga 421 caaggtcgtg gcctgcacct cggctttcct gctctgggac cctaccaagg tgcccccacg 481 actgcaggct gtgactgatg accacatccg gatgcacaag gtgctgcggg aatcaggcct 541 gaagtacgtg gctgtgatgc cgccacacat aggagaccag ccactaactg gggcgtacac 601 agtgaccctg gatggacgag ggccctcaag ggtcatctcc aaacatgacc tgggccattt 661 catgctgcgc tgcctcacca ccgatgagta cgacggacac agcacctacc cctcccacca 721 gtaccagtag cactctgtcc ccatctggga gggtggcatt ctggacatga ggagcaaagg 781 aagggggcaa taaatgttga gccaagagct tcaaattact ctag // LOCUS HUMBMI1X 3203 bp mRNA PRI 19-JUL-1994 DEFINITION Human prot-oncogene (BMI-1) mRNA, complete cds. ACCESSION L13689 NID g291872 KEYWORDS prot-oncogene. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3203) AUTHORS Alkema,M.J., Wiegant,J., Raap,A.K., Berns,A. and van Lohuizen,M. TITLE Characterization and chromosomal localization of the human proto-oncogene BMI-1 JOURNAL Hum. Mol. Genet. 2 (10), 1597-1603 (1993) MEDLINE 94093545 FEATURES Location/Qualifiers source 1..3203 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="k562" /germline mRNA 1..3203 /gene="BMI-1" gene 1..3203 /gene="BMI-1" 5'UTR 1..479 /gene="BMI-1" /note="putative" CDS 480..1460 /gene="BMI-1" /note="putative" /codon_start=1 /function="prot-oncogene" /db_xref="PID:g291873" /translation="MHRTTRIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIV RYLETSKYCPICDVQVHKTRPLLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHP SADAANGSNEDRGEVADEDKRIITDDEIISLSIEFFDQNRLDRKVNKDKEKSKEEVND KRYLRCPAAMTVMHLRKFLRSKMDIPNTFQIDVMYEEEPLKDYYTLMDIAYIYTWRRN GPLPLKYRVRPTCKRMKISHQRDGLTNAGELESDSGSDKANSPAGGVPSTSSCLPSPS TPVQSPHPQFPHISSTMNGTSNSPSGNHQSSFANRPRKSSVNGSSATSSG" 3'UTR 1461..3203 /gene="BMI-1" /evidence=experimental polyA_site 3203 /gene="BMI-1" BASE COUNT 959 a 591 c 678 g 975 t ORIGIN 1 gagaggcaga gatcggggcg agacaatggg gatgtgggcg cgggagcccc gttccggctt 61 agcagcacct cccagccccg cagaataaaa ccgatcgcgc cccctccgcg cgcgccctcc 121 cccgagtgcg gagcgggagg aggcggcggc ggccgaggag gaggaggagg aggccccgga 181 ggaggaggcg ttggaggtcg aggcggaggc ggaggaggag gaggccgagg cgccggagga 241 ggccgaggcg ccggagcagg aggaggccgg ccggaggcgg catgagacga gcgtggcggc 301 cgcggctgct cggggccgcg ctggttgccc attgacagcg gcgtctgcag ctcgcttcaa 361 gatggccgct tggctcgcat tcattttctg ctgaacgact tttaactttc attgtctttt 421 ccgcccgctt cgatcgcctc gcgccggctg ctctttccgg gattttttat caagcagaaa 481 tgcatcgaac aacgagaatc aagatcactg agctaaatcc ccacctgatg tgtgtgcttt 541 gtggagggta cttcattgat gccacaacca taatagaatg tctacattcc ttctgtaaaa 601 cttgtattgt tcgttacctg gagaccagca agtattgtcc tatttgtgat gtccaagttc 661 acaagaccag accactactg aatataaggt cagataaaac tctccaagat attgtataca 721 aattagttcc agggcttttc aaaaatgaaa tgaagagaag aagggatttt tatgcagctc 781 atccttctgc tgatgctgcc aatggctcta atgaagatag aggagaggtt gcagatgaag 841 ataagagaat tataactgat gatgagataa taagcttatc cattgaattc tttgaccaga 901 acagattgga tcggaaagta aacaaagaca aagagaaatc taaggaggag gtgaatgata 961 aaagatactt acgatgccca gcagcaatga ctgtgatgca cttaagaaag tttctcagaa 1021 gtaaaatgga catacctaat actttccaga ttgatgtcat gtatgaggag gaacctttaa 1081 aggattatta tacactaatg gatattgcct acatttatac ctggagaagg aatggtccac 1141 ttccattgaa atacagagtt cgacctactt gtaaaagaat gaagatcagt caccagagag 1201 atggactgac aaatgctgga gaactggaaa gtgactctgg gagtgacaag gccaacagcc 1261 cagcaggagg agttccctcc acctcttctt gtttgcctag ccccagtact ccagtgcagt 1321 ctcctcatcc acagtttcct cacatttcca gtactatgaa tggaaccagc aacagcccca 1381 gcggtaacca ccaatcttct tttgccaata gacctcgaaa atcatcagta aatgggtcat 1441 cagcaacttc ttctggttga tacctgagac tgttaaggaa aaaaatttta aacccctgat 1501 ttatatagat atcttcagcc attacgactt tctagagcta atacatgtga ctatcgtcca 1561 atttgctttc ttttgtagtg acattaaatt tggctataaa agatggacta catgtgatac 1621 tcctgtccgt cttggttcaa aagaaagatt gttgttataa agaattggtt tcttggaaag 1681 caggcaagac tttttctctg tgttaggaaa gatgggaaat ggtttctgta accattgttt 1741 ggatttggaa gtactctgca gtggacataa gcattgggcc atagtttgtt aatctcaact 1801 aacgcctaca ttacattctc cttgatcgtt cttgttatta cgctgttttg tgaacctgta 1861 gaaaaacaag tgctttttat cttgaaattc aaccaacgga aagaatatgc atagaataat 1921 gcattctatg atgccatgtc actgtgaata acgatttctt gcagctattt agccattttg 1981 attgctgttt gatttatact tctctgttgc tacgcaaaac cgatcaaaga aaagtgaact 2041 tcagttttac aatctgtatg cctaaaagcg ggtactaccg tttattttac tgacttgttt 2101 aaatgattcg cttttgtaag aatcagatgg cattatgctt gttgtacaat gccatattgg 2161 tatatgacat aacaggaaac agtattgtat gatatatcta taaatgctat aaagaaatat 2221 tgtgtttcat gcattcagaa atgattgtta aaattctccc aactggttcg acctttgcag 2281 atacccataa cctatgttga gccttgctta ccagcaaaga atatttttaa tgtggatatc 2341 taattctaaa gtctgttcca ttagaagcaa ttggcacatc tttctatact ttatatactt 2401 ttctccagta atacatgttt actttaaaaa ttgttgcagt gaagaaaaac ctttaactga 2461 gaaatatgga aaccgtctta attttccatt ggctatgatg gaattaatat tgtattttaa 2521 aaatgcatat tgatcactat aattctaaaa caatttttta aataaaccag caggttgcta 2581 aaagaaggca ttttatctaa agttatttta ataggtggta tagcagtaat tttaaattta 2641 agagttgctt ttacagttaa caatggaata tgccttctct gctatgtctg aaaatagaag 2701 ctatttatta tgagcttcta caggtatttt taaatagagc aagcatgttg aatttaaaat 2761 atgaataacc ccacccaaca attttcagtt tattttttgc tttggtcgaa cttggtgtgt 2821 gttcatcagt tatttgtgag ggtgtttatt ctatatgaat attgtttcat gtttgtaggg 2881 aaattgtagc taaacatttc attgtcccca gtctgcaaaa gaagcacaat tctattgctt 2941 tgtcttgctt atagtcatta aatcattact tttacatata ttgctgttac ttctgctttc 3001 tttaaaaata tagtaaagga tgttttatga agtcacaaga tacatatatt tttattttga 3061 cctaaatttg tacagtccca ttgtaagtgt tgtttctaat tatagatgta aaatgaaatt 3121 tcatttgtaa ttggaaaaaa tccaataaaa aggatattca tttagaaaat agctaagatc 3181 tttaataaaa atttgatatg aaa // LOCUS HUMBMP1A 2487 bp mRNA PRI 31-OCT-1994 DEFINITION Human bone morphogenetic protein 1 (BMP-1) mRNA. ACCESSION M22488 NID g179499 KEYWORDS bone morphogenetic protein. SOURCE Human osteosarcoma cell line U-2 OS, cDNA to mRNA, clone lambda-U2OS1-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. TITLE Novel regulators of bone formation: molecular clones and activities JOURNAL Science 242 (4885), 1528-1534 (1988) MEDLINE 89072730 REFERENCE 2 (bases 1 to 2487) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. JOURNAL Unpublished (1989) COMMENT [1] sites. Draft entry and computer readable copy of sequence [1] kindly submitted by R.W. Kriz 10-FEB-1989. FEATURES Location/Qualifiers source 1..2487 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8" gene 30..2222 /gene="BMP1" CDS 30..2222 /gene="BMP1" /note="bone morphogenetic protein 1" /codon_start=1 /db_xref="GDB:G00-125-203" /db_xref="PID:g179500" /translation="MPGVARLPLLLGLLLLPRPGRPLDLADYTYDLAEEDDSEPLNYK DPCKAAAFLGDIALDEEDLRAFQVQQAVDLRRHTARKSSIKAAVPGNTSTPSCQSTNG QPQRGACGRWRGRSRSRRAATSRPERVWPDGVIPFVIGGNFTGSQRAVFRQAMRHWEK HTCVTFLERTDEDSYIVFTYRPCGCCSYVGRRGGGPQAISIGKNCDKFGIVVHELGHV VGFWHEHTRPDRDRHVSIVRENIQPGQEYNFLKMEPQEVESLGETYDFDSIMHYARNT FSRGIFLDTIVPKYEVNGVKPPIGQRTRLSKGDIAQARKLYKCPACGETLQDSTGNFS SPEYPNGYSAHMHCVWRISVTPGEKIILNFTSLDLYRSRLCWYDYVEVRDGFWRKAPL RGRFCGSKLPEPIVSTDSRLWVEFRSSSNWVGKGFFAVYEAICGGDVKKDYGHIQSPN YPDDYRPSKVCIWRIQVSEGFHVGLTFQSFEIERHDSCAYDYLEVRDGHSESSTLIGR YCGYEKPDDIKSTSSRLWLKFVSDGSINKAGFAVNFFKEVDECSRPNRGGCEQRCLNT LGSYKCSCDPGYELAPDKRRCEAACGGFLTKLNGSITSPGWPKEYPPNKNCIWQLVAP TQYRISLQFDFFETEGNDVCKYDFVEVRSGLTADSKLHGKFCGSEKPEVITSQYNNMR VEFKSDNTVSKKGFKAHFFSEKRPALQPPRGRPHQLKFRVQKRNRTPQ" BASE COUNT 503 a 804 c 707 g 473 t ORIGIN 1 gccgcttccc tcgccgccgc cccgccagca tgcccggcgt ggcccgcctg ccgctgctgc 61 tcgggctgct gctgctcccg cgtcccggcc ggccgctgga cttggccgac tacacctatg 121 acctggcgga ggaggacgac tcggagcccc tcaactacaa agacccctgc aaggcggctg 181 cctttcttgg ggacattgcc ctggacgaag aggacctgag ggccttccag gtacagcagg 241 ctgtggatct cagacggcac acagctcgta agtcctccat caaagctgca gttccaggaa 301 acacttctac ccccagctgc cagagcacca acgggcagcc tcagagggga gcctgtggga 361 gatggagagg tagatcccgt agccggcggg cggcgacgtc ccgaccagag cgtgtgtggc 421 ccgatggggt catccccttt gtcattgggg gaaacttcac tggtagccag agggcagtct 481 tccggcaggc catgaggcac tgggagaagc acacctgtgt caccttcctg gagcgcactg 541 acgaggacag ctatattgtg ttcacctatc gaccttgcgg gtgctgctcc tacgtgggtc 601 gccgcggcgg gggcccccag gccatctcca tcggcaagaa ctgtgacaag ttcggcattg 661 tggtccacga gctgggccac gtcgtcggct tctggcacga acacactcgg ccagaccggg 721 accgccacgt ttccatcgtt cgtgagaaca tccagccagg gcaggagtat aacttcctga 781 agatggagcc tcaggaggtg gagtccctgg gggagaccta tgacttcgac agcatcatgc 841 attacgctcg gaacacattc tccaggggca tcttcctgga taccattgtc cccaagtatg 901 aggtgaacgg ggtgaaacct cccattggcc aaaggacacg gctcagcaag ggggacattg 961 cccaagcccg caagctttac aagtgcccag cctgtggaga gaccctgcaa gacagcacag 1021 gcaacttctc ctcccctgaa taccccaatg gctactctgc tcacatgcac tgcgtgtggc 1081 gcatctctgt cacacccggg gagaagatca tcctgaactt cacgtccctg gacctgtacc 1141 gcagccgcct gtgctggtac gactatgtgg aggtccgaga tggcttctgg aggaaggcgc 1201 ccctccgagg ccgcttctgc gggtccaaac tccctgagcc tatcgtctcc actgacagcc 1261 gcctctgggt tgaattccgc agcagcagca attgggttgg aaagggcttc tttgcagtct 1321 acgaagccat ctgcgggggt gatgtgaaaa aggactatgg ccacattcaa tcgcccaact 1381 acccagacga ttaccggccc agcaaagtct gcatctggcg gatccaggtg tctgagggct 1441 tccacgtggg cctcacattc cagtcctttg agattgagcg ccacgacagc tgtgcctacg 1501 actatctgga ggtgcgcgac gggcacagtg agagcagcac cctcatcggg cgctactgtg 1561 gctatgagaa gcctgatgac atcaagagca cgtccagccg cctctggctc aagttcgtct 1621 ctgacgggtc cattaacaaa gcgggctttg ccgtcaactt tttcaaagag gtggacgagt 1681 gctctcggcc caaccgcggg ggctgtgagc agcggtgcct caacaccctg ggcagctaca 1741 agtgcagctg tgaccccggg tacgagctgg ccccagacaa gcgccgctgt gaggctgctt 1801 gtggcggatt cctcaccaag ctcaacggct ccatcaccag cccgggctgg cccaaggagt 1861 acccccccaa caagaactgc atctggcagc tggtggcccc cacccagtac cgcatctccc 1921 tgcagtttga cttctttgag acagagggca atgatgtgtg caagtacgac ttcgtggagg 1981 tgcgcagtgg actcacagct gactccaagc tgcatggcaa gttctgtggt tctgagaagc 2041 ccgaggtcat cacctcccag tacaacaaca tgcgcgtgga gttcaagtcc gacaacaccg 2101 tgtccaaaaa gggcttcaag gcccacttct tctcagaaaa gaggccagct ctgcagcccc 2161 ctcggggacg cccccaccag ctcaaattcc gagtgcagaa aagaaaccgg accccccagt 2221 gaggcctgcc aggcctcccg gaccccttgt tactcaggaa cctcaccttg gacggaatgg 2281 gatgggggct tcggtgccca ccaacccccc acctccactc tgccattccg gcccacctcc 2341 ctctggccgg acagaactgg tgctctcttc tccccactgt gcccgtccgc ggaccgggga 2401 cccttccccg tgccctaccc cctcccattt tgatggtgtc tgtgacattt cctgttgtga 2461 agtaaaagag ggacccctgc gtcctgc // LOCUS HUMBMP2A 1547 bp mRNA PRI 31-OCT-1994 DEFINITION Human bone morphogenetic protein 2A (BMP-2A) mRNA. ACCESSION M22489 NID g179501 KEYWORDS bone morphogenetic protein. SOURCE Human osteosarcoma cell line U-2 OS, cDNA to mRNA, clone hHBMP-2A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. TITLE Novel regulators of bone formation: molecular clones and activities JOURNAL Science 242 (4885), 1528-1534 (1988) MEDLINE 89072730 REFERENCE 2 (bases 1 to 1547) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. JOURNAL Unpublished (1989) COMMENT [1] sites. Draft entry and computer readable copy of sequence [1] kindly submitted by R.W. Kriz 10-FEB-1989. FEATURES Location/Qualifiers source 1..1547 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20" gene 324..1514 /gene="BMP2" CDS 324..1514 /gene="BMP2" /note="bone morphogenetic protein 2A" /codon_start=1 /db_xref="GDB:G00-125-204" /db_xref="PID:g179502" /translation="MVAGTRCLLALLLPQVLLGGAAGLVPELGRRKFAAASSGRPSSQ PSDEVLSEFELRLLSMFGLKQRPTPSRDAVVPPYMLDLYRRHSGQPGSPAPDHRLERA ASRANTVRSFHHEESLEELPETSGKTTRRFFFNLSSIPTEEFITSAELQVFREQMQDA LGNNSSFHHRINIYEIIKPATANSKFPVTRLLDTRLVNQNASRWESFDVTPAVMRWTA QGHANHGFVVEVAHLEEKQGVSKRHVRISRSLHQDEHSWSQIRPLLVTFGHDGKGHPL HKREKRQAKHKQRKRLKSSCKRHPLYVDFSDVGWNDWIVAPPGYHAFYCHGECPFPLA DHLNSTNHAIVQTLVNSVNSKIPKACCVPTELSAISMLYLDENEKVVLKNYQDMVVEG CGCR" BASE COUNT 377 a 423 c 410 g 337 t ORIGIN 1 ggggacttct tgaacttgca gggagaataa cttgcgcacc ccactttgcg ccggtgcctt 61 tgccccagcg gagcctgctt cgccatctcc gagccccacc gcccctccac tcctcggcct 121 tgcccgacac tgagacgctg ttcccagcgt gaaaagagag actgcgcggc cggcacccgg 181 gagaaggagg aggcaaagaa aaggaacgga cattcggtcc ttgcgccagg tcctttgacc 241 agagtttttc catgtggacg ctctttcaat ggacgtgtcc ccgcgtgctt cttagacgga 301 ctgcggtctc ctaaaggtcg accatggtgg ccgggacccg ctgtcttcta gcgttgctgc 361 ttccccaggt cctcctgggc ggcgcggctg gcctcgttcc ggagctgggc cgcaggaagt 421 tcgcggcggc gtcgtcgggc cgcccctcat cccagccctc tgacgaggtc ctgagcgagt 481 tcgagttgcg gctgctcagc atgttcggcc tgaaacagag acccaccccc agcagggacg 541 ccgtggtgcc cccctacatg ctagacctgt atcgcaggca ctcaggtcag ccgggctcac 601 ccgccccaga ccaccggttg gagagggcag ccagccgagc caacactgtg cgcagcttcc 661 accatgaaga atctttggaa gaactaccag aaacgagtgg gaaaacaacc cggagattct 721 tctttaattt aagttctatc cccacggagg agtttatcac ctcagcagag cttcaggttt 781 tccgagaaca gatgcaagat gctttaggaa acaatagcag tttccatcac cgaattaata 841 tttatgaaat cataaaacct gcaacagcca actcgaaatt ccccgtgacc agacttttgg 901 acaccaggtt ggtgaatcag aatgcaagca ggtgggaaag ttttgatgtc acccccgctg 961 tgatgcggtg gactgcacag ggacacgcca accatggatt cgtggtggaa gtggcccact 1021 tggaggagaa acaaggtgtc tccaagagac atgttaggat aagcaggtct ttgcaccaag 1081 atgaacacag ctggtcacag ataaggccat tgctagtaac ttttggccat gatggaaaag 1141 ggcatcctct ccacaaaaga gaaaaacgtc aagccaaaca caaacagcgg aaacgcctta 1201 agtccagctg taagagacac cctttgtacg tggacttcag tgacgtgggg tggaatgact 1261 ggattgtggc tcccccgggg tatcacgcct tttactgcca cggagaatgc ccttttcctc 1321 tggctgatca tctgaactcc actaatcatg ccattgttca gacgttggtc aactctgtta 1381 actctaagat tcctaaggca tgctgtgtcc cgacagaact cagtgctatc tcgatgctgt 1441 accttgacga gaatgaaaag gttgtattaa agaactatca ggacatggtt gtggagggtt 1501 gtgggtgtcg ctagtacagc aaaattaaat acataaatat atatata // LOCUS HUMBMP2B 1751 bp mRNA PRI 31-OCT-1994 DEFINITION Human bone morphogenetic protein-2B (BMP-2B) mRNA. ACCESSION M22490 NID g179503 KEYWORDS bone morphogenetic protein. SOURCE Human osteosarcoma cell line U-2 OS, cDNA to mRNA, clone hBMP-2B. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. TITLE Novel regulators of bone formation: molecular clones and activities JOURNAL Science 242 (4885), 1528-1534 (1988) MEDLINE 89072730 REFERENCE 2 (bases 1 to 1751) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. JOURNAL Unpublished (1989) COMMENT [1] sites. Draft entry and computer readable copy of sequence [1] kindly submitted by R.W. Kriz 10-FEB-1989. FEATURES Location/Qualifiers source 1..1751 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20" gene 395..1621 /gene="BMP2" CDS 395..1621 /gene="BMP2" /note="bone morphogenetic protein 2B" /codon_start=1 /db_xref="GDB:G00-125-204" /db_xref="PID:g179504" /translation="MIPGNRMLMVVLLCQVLLGGASHASLIPETGKKKVAEIQGHAGG RRSGQSHELLRDFEATLLQMFGLRRRPQPSKSAVIPDYMRDLYRLQSGEEEEEQIHST GLEYPERPASRANTVRSFHHEEHLENIPGTSENSAFRFLFNLSSIPENEVISSAELRL FREQVDQGPDWERGFHRINIYEVMKPPAEVVPGHLITRLLDTRLVHHNVTRWETFDVS PAVLRWTREKQPNYGLAIEVTHLHQTRTHQGQHVRISRSLPQGSGNWAQLRPLLVTFG HDGRGHALTRRRRAKRSPKHHSQRARKKNKNCRRHSLYVDFSDVGWNDWIVAPPGYQA FYCHGDCPFPLADHLNSTNHAIVQTLVNSVNSSIPKACCVPTELSAISMLYLDEYDKV VLKNYQEMVVEGCGCR" BASE COUNT 394 a 510 c 490 g 357 t ORIGIN 1 ggcagaggag gagggaggga gggaaggagc gcggagcccg gcccggaagc taggtgagtg 61 tggcatccga gctgagggac gcgagcctga gacgccgctg ctgctccggc tgagtatcta 121 gcttgtctcc ccgatgggat tcccgtccaa gctatctcga gcctgcagcg ccacagtccc 181 cggccctcgc ccaggttcac tgcaaccgtt cagaggtccc caggagctgc tgctggcgag 241 cccgctactg cagggaccta tggagccatt ccgtagtgcc atcccgagca acgcactgct 301 gcagcttccc tgagcctttc cagcaagttt gttcaagatt ggctgtcaag aatcatggac 361 tgttattata tgccttgttt tctgtcaaga caccatgatt cctggtaacc gaatgctgat 421 ggtcgtttta ttatgccaag tcctgctagg aggcgcgagc catgctagtt tgatacctga 481 gacggggaag aaaaaagtcg ccgagattca gggccacgcg ggaggacgcc gctcagggca 541 gagccatgag ctcctgcggg acttcgaggc gacacttctg cagatgtttg ggctgcgccg 601 ccgcccgcag cctagcaaga gtgccgtcat tccggactac atgcgggatc tttaccggct 661 tcagtctggg gaggaggagg aagagcagat ccacagcact ggtcttgagt atcctgagcg 721 cccggccagc cgggccaaca ccgtgaggag cttccaccac gaagaacatc tggagaacat 781 cccagggacc agtgaaaact ctgcttttcg tttcctcttt aacctcagca gcatccctga 841 gaacgaggtg atctcctctg cagagcttcg gctcttccgg gagcaggtgg accagggccc 901 tgattgggaa aggggcttcc accgtataaa catttatgag gttatgaagc ccccagcaga 961 agtggtgcct gggcacctca tcacacgact actggacacg agactggtcc accacaatgt 1021 gacacggtgg gaaacttttg atgtgagccc tgcggtcctt cgctggaccc gggagaagca 1081 gccaaactat gggctagcca ttgaggtgac tcacctccat cagactcgga cccaccaggg 1141 ccagcatgtc aggattagcc gatcgttacc tcaagggagt gggaattggg cccagctccg 1201 gcccctcctg gtcacctttg gccatgatgg ccggggccat gccttgaccc gacgccggag 1261 ggccaagcgt agccctaagc atcactcaca gcgggccagg aagaagaata agaactgccg 1321 gcgccactcg ctctatgtgg acttcagcga tgtgggctgg aatgactgga ttgtggcccc 1381 accaggctac caggccttct actgccatgg ggactgcccc tttccactgg ctgaccacct 1441 caactcaacc aaccatgcca ttgtgcagac cctggtcaat tctgtcaatt ccagtatccc 1501 caaagcctgt tgtgtgccca ctgaactgag tgccatctcc atgctgtacc tggatgagta 1561 tgataaggtg gtactgaaaa attatcagga gatggtagta gagggatgtg ggtgccgctg 1621 agatcaggca gtccttgagg atagacagat atacacacca cacacacaca ccacatacac 1681 cacacacaca cgttcccatc cactcaccca cacactacac agactgcttc cttatagctg 1741 gacttttatt t // LOCUS HUMBMP3A 1774 bp mRNA PRI 31-OCT-1994 DEFINITION Human bone morphogenetic protein-3 (BMP-3) mRNA. ACCESSION M22491 NID g179505 KEYWORDS bone morphogenetic protein. SOURCE Human H128 (ATCC HTB120) cDNA to mRNA, clone hBMP-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. TITLE Novel regulators of bone formation: molecular clones and activities JOURNAL Science 242 (4885), 1528-1534 (1988) MEDLINE 89072730 REFERENCE 2 (bases 1 to 1774) AUTHORS Wozney,J.M., Rosen,V., Celeste,A.J., Mitsock,L.M., Whitters,M.J., Kriz,R.W., Hewick,R.M. and Wang,E.A. JOURNAL Unpublished (1989) COMMENT [1] sites. Draft entry and computer readable copy of sequence [1] kindly submitted by R.W. Kriz 10-FEB-1989. FEATURES Location/Qualifiers source 1..1774 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20" gene 321..1739 /gene="BMP2" CDS 321..1739 /gene="BMP2" /note="bone morphogenetic protein 2A" /codon_start=1 /db_xref="GDB:G00-125-204" /db_xref="PID:g179506" /translation="MAGASRLLFLWLGCFCVSLAQGERPKPPFPELRKAVPGDRTAGG GPDSELQPQDKVSEHMLRLYDRYSTVQAARTPGSLEGGSQPWRPRLLREGNTVRSFRA AAAETLERKGLYIFNLTSLTKSENILSATLYFCIGELGNISLSCPVSGGCSHHAQRKH IQIDLSAWTLKFSRNQSQLLGHLSVDMAKSHRDIMSWLSKDITQFLRKAKENEEFLIG FNITSKGRQLPKRRLPFPEPYILVYANDAAISEPESVVSSLQGHRNFPTGTVPKWDSH IRAALSIERRKKRSTGVLLPLQNNELPGAEYQYKKDEVWEERKPYKTLQAQAPEKSKN KKKQRKGPHRKSQTLQFDEQTLKKARRKQWIEPRNCARRYLKVDFADIGWSEWIISPK SFDAYYCSGACQFPMPKSLKPSNHATIQSIVRAVGVVPGIPEPCCVPEKMSSLSILFF DENKNVVLKVYPNMTVESCACR" BASE COUNT 434 a 472 c 487 g 381 t ORIGIN 1 agatcttgaa aacacccggg ccacacacgc cgcgacctac agctctttct cagcgttgga 61 gtggagacgg cgcccgcagc gccctgcgcg ggtgaggtcc gcgcagctgc tggggaagag 121 cccacctgtc aggctgcgct gggtcagcgc agcaagtggg gctggccgct atctcgctgc 181 acccggccgc gtcccgggct ccgtgcgccc tcgccccagc tggtttggag ttcaaccctc 241 ggctccgccg ccggctcctt gcgccttcgg agtgtcccgc agcgacgccg ggagccgacg 301 cgccgcgcgg gtacctagcc atggctgggg cgagcaggct gctctttctg tggctgggct 361 gcttctgcgt gagcctggcg cagggagaga gaccgaagcc acctttcccg gagctccgca 421 aagctgtgcc aggtgaccgc acggcaggtg gtggcccgga ctccgagctg cagccgcaag 481 acaaggtctc tgaacacatg ctgcggctct atgacaggta cagcacggtc caggcggccc 541 ggacaccggg ctccctggag ggaggctcgc agccctggcg ccctcggctc ctgcgcgaag 601 gcaacacggt tcgcagcttt cgggcggcag cagcagaaac tcttgaaaga aaaggactgt 661 atatcttcaa tctgacatcg ctaaccaagt ctgaaaacat tttgtctgcc acactgtatt 721 tctgtattgg agagctagga aacatcagcc tgagttgtcc agtgtctgga ggatgctccc 781 atcatgctca gaggaaacac attcagattg atctttctgc atggaccctc aaattcagca 841 gaaaccaaag tcaactcctt ggccatctgt cagtggatat ggccaaatct catcgagata 901 ttatgtcctg gctgtctaaa gatatcactc aattcttgag gaaggccaaa gaaaatgaag 961 agttcctcat aggatttaac attacgtcca agggacgcca gctgccaaag aggaggttac 1021 cttttccaga gccttatatc ttggtatatg ccaatgatgc cgccatttct gagccagaaa 1081 gtgtggtatc aagcttacag ggacaccgga attttcccac tggaactgtt cccaaatggg 1141 atagccacat cagagctgcc ctttccattg agcggaggaa gaagcgctct actggggtct 1201 tgctgcctct gcagaacaac gagcttcctg gggcagaata ccagtataaa aaggatgagg 1261 tgtgggagga gagaaagcct tacaagaccc ttcaggctca ggcccctgaa aagagtaaga 1321 ataaaaagaa acagagaaag gggcctcatc ggaagagcca gacgctccaa tttgatgagc 1381 agaccctgaa aaaggcaagg agaaagcagt ggattgaacc tcggaattgc gccaggagat 1441 acctcaaggt agactttgca gatattggct ggagtgaatg gattatctcc cccaagtcct 1501 ttgatgccta ttattgctct ggagcatgcc agttccccat gccaaagtct ttgaagccat 1561 caaatcatgc taccatccag agtatagtga gagctgtggg ggtcgttcct gggattcctg 1621 agccttgctg tgtaccagaa aagatgtcct cactcagtat tttattcttt gatgaaaata 1681 agaatgtagt gcttaaagta taccctaaca tgacagtaga gtcttgcgct tgcagataac 1741 ctggcaaaga actcatttga atgcttaatt caat // LOCUS HUMBN51 1881 bp mRNA PRI 31-OCT-1994 DEFINITION Human BN51 mRNA, complete cds. ACCESSION M17754 NID g179512 KEYWORDS . SOURCE Human fibroblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1881) AUTHORS Ittmann,M., Greco,A. and Basilico,C. TITLE Isolation of the human gene that complements a temperature-sensitive cell cycle mutation in BHK cells JOURNAL Mol. Cell. Biol. 7 (10), 3386-3393 (1987) MEDLINE 88065472 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by M.Ittmann, 24-FEB-1988. FEATURES Location/Qualifiers source 1..1881 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8pter-q24" gene 52..1239 /gene="BN51T" CDS 52..1239 /gene="BN51T" /note="BN51 protein" /codon_start=1 /db_xref="GDB:G00-119-728" /db_xref="PID:g179513" /translation="MSEGNAAGRPARQGPDLLLTGARGSSGGGGLPSPPAVPSIRSRD LTLGGVKKKTFTPNIISRKIKEEPKEEVTVKKEKRERDRDRQREGHGRGRRRPEVIQS HSIFEQGPAEMMKKKGNWDKTVDVSDMGPSHIINIKKEKRETDEETKQILRMLEKDDF LDDPGLRNDTRNMPVQLPLAHSGWLFKEENDEPDVKPWLAGPKEEDMEVDIPAVKVKE EPRDEEEEAKMKAPPKAARKTPGLPKDVSVAELLRELSLTKEEELLFLQLPDTLPGQP PTQDIKPIKTEVQGEDGQVVLIKQEKDREAKLAENACTLADLTEGQVGKLLIRKSGRV QLLLGKVTLDVTMGTACSFLQELVSVGLGDSRTGEMTVLGHVKHKLVCSPDFESLLDH KHR" BASE COUNT 470 a 492 c 550 g 369 t ORIGIN 118 bp upstream of SmaI site. 1 acttccgccc ggcgcgagac cgaagctggc ggctggtcgg ttgcaggcaa catgtcggaa 61 ggaaacgccg ccggacgccc agcacggcag ggcccggacc ttctcctgac tggggcccgg 121 gggtcatcgg gcggcggcgg cctcccctca cccccggccg ttccctccat ccgttccagg 181 gacctaaccc tcgggggagt caagaagaaa accttcaccc caaatatcat cagtcggaag 241 atcaaggaag agcccaagga agaagtaact gtcaagaagg agaagcgtga aagggacaga 301 gaccgacaac gagaggggca tggacgaggg cgacgccgtc cagaagtgat ccagtctcac 361 tccatctttg agcagggccc agctgaaatg atgaagaaaa aagggaactg ggataagaca 421 gtggatgtgt cagacatggg accttctcat atcatcaaca tcaaaaaaga gaagagagag 481 acagacgaag aaactaaaca gatcttgcgt atgctggaga aggacgattt cctcgatgac 541 cccggcctga ggaacgacac tcgaaatatg cctgtgcagc tgccgctggc tcactcagga 601 tggcttttta aggaagaaaa tgacgaacca gatgttaaac cttggctggc tggccccaag 661 gaagaggaca tggaggtgga catacctgct gtgaaagtga aagaggagcc acgagatgag 721 gaggaagagg ccaagatgaa ggctcctccc aaagcagcca ggaagactcc aggcctcccg 781 aaggatgtat ctgtggcaga gctgctgagg gagctgagcc tcaccaagga agaggaactg 841 ctgtttctgc agctgccaga caccctccct ggccagccac ccacccagga catcaagcct 901 atcaagacag aggtgcaggg cgaggacgga caggtggtgc tcatcaagca ggagaaagac 961 cgagaagcca aattggcaga gaatgcttgt accctggctg acctgacaga gggtcaggtt 1021 ggcaagctac tcatccgcaa gtctggaagg gtgcaactcc tcttgggcaa ggtgactctg 1081 gacgtgacca tgggaactgc ctgctccttc ctgcaggagc tggtgtccgt gggccttgga 1141 gacagtagga caggggagat gacagtcctg ggacacgtga agcacaaact tgtatgttcc 1201 cctgattttg aatccctctt ggatcacaaa caccggtaaa atgagcaggt ggaggaggac 1261 ggcgcctgtg cccacgctgc tgcctgctcc agacattttg ttcttgaatc tgtgagaccc 1321 agaagggccc actgagccca ctcactccac ctttggcaac cattgttcca ggtcccccag 1381 ggcttcctcc cacagcagct gtgaatggca cagtgacctt cctgcagcgt ggagatggca 1441 catccttgct gctggggact tggccctgct atttattttt gtatttatgt cttaatctct 1501 tccactgatg catcctccaa gggtagatgg ggagggtctg tgtgaagggg ccggcttctc 1561 ttggtgcctg ctgggttgca ggggcaggaa gcgtgtggac tgcagcttct gctggtgctc 1621 cccccgtcct cctggaggca gtataggaga gagagcaagg attgagtctg agacttaagc 1681 actcggtccc agcttgccag ttcctggttc tgtgtccttg gaaaactacc taacctttct 1741 gagcctccta tactatccga cacaaatggg gatgatacct acctccaggg ttggcgtgag 1801 gattcatggg ctattataga tgaaaactgc acaaggccag aactagcagg cactcaataa 1861 acgttcatgt cctttttctc t // LOCUS HUMBP1 915 bp mRNA PRI 24-JUL-1996 DEFINITION Human mRNA for MOBP (myelin-associated oligodendrocytic basic protein), complete cds, clone hOPRP1. ACCESSION D28113 NID g662276 KEYWORDS MOBP; myelin-associated oligodendrocytic basic protein. SOURCE Homo sapiens spinal cord cDNA to mRNA, clone:hOPRP1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yamamoto,Y., Mizuno,R., Nishimura,T., Ogawa,Y., Yoshikawa,H., Fujimura,H., Adachi,E., Kishimoto,T., Yanagihara,T. and Sakoda,S. TITLE Cloning and expression of myelin-associated oligodendrocytic basic protein. A novel basic protein constituting the central nervous system myelin JOURNAL J. Biol. Chem. 269 (50), 31725-31730 (1994) MEDLINE 95081123 REFERENCE 2 (bases 1 to 915) AUTHORS Yamamoto,Y. TITLE Direct Submission JOURNAL Submitted (24-JAN-1994) to the DDBJ/EMBL/GenBank databases. Yoichi Yamamoto, Osaka University Hospital, Department of Neurology; 2-2 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-879-3573, Fax:06-879-3579) FEATURES Location/Qualifiers source 1..915 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hOPRP1" /tissue_type="spinal cord" CDS 9..557 /codon_start=1 /product="MOBP" /db_xref="PID:d1006204" /db_xref="PID:g1408049" /translation="MSQKPAKEGPRLSKNQKYSEHFSIHCCPPFTFLNSKKEIVDRKY SICKSGCFYQKKEEDWICCACQKTRTSRRAKSPQRPKQQPAAPPAVVRAPAKPRSPPR SERQPRSPPRSERQPRSPPRSERQPRSPPRSERQPRPRPEVRPPPAKQRPPQKSKQQP RSSPLRGPGASRGGSPVKASRF" BASE COUNT 219 a 291 c 235 g 170 t ORIGIN 1 acagtgagat gagtcagaaa ccggccaagg agggtcccag actctccaaa aaccagaagt 61 actccgaaca cttcagcata cactgctgcc cgccgttcac cttcctcaat tccaagaagg 121 agatagtgga tcggaaatac agcatctgta agagcggctg cttctaccag aagaaagagg 181 aggactggat ctgctgcgcc tgccagaaga ccagaaccag ccgccgtgcc aagtcccctc 241 agaggcccaa gcaacagcca gctgcgcccc ccgcggtggt cagagcgcca gccaagccac 301 ggtcccctcc gaggtctgag cgtcagccac ggtcccctcc gaggtctgag cgtcagccac 361 ggtcccctcc gaggtctgag cgtcagccac ggtcccctcc gaggtctgag cgtcagccac 421 gtccccgccc agaggtccga ccaccgccag ccaagcagcg tccccctcag aagtccaagc 481 aacagccgcg cagcagcccc ctcagagggc caggcgccag ccgtgggggg tcccccgtca 541 aagcttctag gttctgattg aaaaggagga tcaggccaac cccaaagaag aagtgaccaa 601 ggaggagttt aaactgaatg aacaacctcg gctcctggac tcattgcttc acaacccatc 661 tacccctgga tgaagttatc tggcttcaaa tattatgcag gggcaaacac ctgctgatgt 721 ggcaactgct gatgctcatg gtccccatgg catgggggcc tcagggcagc ctgcctggag 781 tactttgaag atgtcatccc attgtcttct gacctctata attgccactg agagatctgc 841 tgtcagtctg cttatccttc cacggactca agtttcttca atctgaagat acatgtcttt 901 ctccagggac atgtg // LOCUS HUMBPAG1B 8684 bp mRNA PRI 07-NOV-1994 DEFINITION Human bullous 230 kDa pemphigoid antigen (BPAG1) mRNA, complete cds. ACCESSION L11690 NID g402479 KEYWORDS bullous pemphigoid antigen. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Elgart,G.W. and Stanley,J.R. TITLE Cloning of the 5' mRNA for the 230-kD bullous pemphigoid antigen by rapid amplification of cDNA ends JOURNAL J. Invest. Dermatol. 101 (2), 244-246 (1993) MEDLINE 93346806 COMMENT Ref [1] reports bp 1-1822. FEATURES Location/Qualifiers source 1..8684 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /map="6p12-p11" gene 103..8052 /gene="BPAG1" CDS 103..8052 /gene="BPAG1" /codon_start=1 /db_xref="GDB:G00-125-207" /product="bullous pemphigoid antigen" /db_xref="PID:g403124" /translation="MHSSSYSYRSSDSVFSNTTSTRTSLDSNENLLLVHCGPTLINSC ISFGSESFDGHRLEMLQQIANRVQRDSVICEDKLILAGNALQSDSKRLESGVQFQNEA EIAGYILECENLLRQHVIDVQILIDGKYYQADQLVQRVAKLRDEIMALRNECSSVYSK GRILTTEQTKLMISGITQSLNSGFAQTLHPSLTSGLTQSLTPSLTSSSMTSGLSSGMT SRLTPSVTPAYTPGFPSGLVPNFSSGVEPNSLQTLKLMQIRKPLLKSSLLDQNLTEEE INMKFVQDLLNWVDEMQVQLDRTEWGSDLPSVESHLENHKNVHRAIEEFESSLKEAKI SEIQMTAPLKLTYAEKLHRLESQYAKLLNTSRNQERHLDTLHNFVSRATNELIWLNEK EEEEVAYDWSERNTNIARKKDYHAELMRELDQKEENIKSVQEIAEQLLLENHPARLTI EAYRAAMQTQWSWILQLCQCVEQHIKENTAYFEFFNDAKEATDYLRNLKDAIQRKYSC DRSSSIHKLEDLVQESMEEKEELLQYKSTIANLMGKAKTIIQLKPRNSDCPLKTSIPI KAICDYRQIEITIYKDDECVLANNSHRAKWKVISPTGNEAMVPSVCFTVPPPNKEAVD LANRIEQQYQNVLTLWHESHINMKSVVSWHYLINEIDRIRASNVASIKTMLPGEHQQV LSNLQSRFEDFLEDSQESQVFSGSDITQLEKEVNVCKQYYQELLKSAEREEQEESVYN LYISEVRNIRLRLENCEDRLIRQIRTPLERDDLHESVFRITEQEKLKKELERLKDDLG TITNKCEEFFSQAAASSSVPTLRSELNVVLQNMNQVYSMSSTYIDKLKTVNLVLKNTQ AAEALVKLYETKLCEEEAVIADKNNIENLISTLKQWRSEVDEKRQVFHALEDELQKAK AISDEMFKTYKERDLDFDWHKEKADQLVERWQNVHVQIDNRLRDLEGIGKSLKYYRDT YHPLDDWIQQVETTQRKIQENQPENSKTLATQLNQQKMLVSEIEMKQSKMDECQKYAE QYSATVKDYELQTMTYRAMVDSQQKSPVKRRRMQSSADLIIQEFMDLRTRYTALVTLM TQYIKFAGDSLKRLEEEEIKRCKETSEHGAYSDLLQRQKATVLENSKLTGKISELERM VAELKKQKSRVEEELPKVREAAENELRKQQRNVEDISLQKIRAESEAKQYRRELETIV REKEAAERELERVRQLTIEAEAKRAAVEENLLNFRNQLEENTFTRRTLEDHLKRKDLS LNDLEQQKNKLMEELRRKRDNEEELLKLIKQMEKDLAFQKQVAEKQLKEKQKIELEAR RKITEIQYTCRENALPVCPITQATSCRAVTGLQQEHDKQKAEELKQQVDELTAANRKA EQDMRELTYELNALQLEKTSSEEKARLLKDKLDETNNTLRCLKLELERKDQAEKGYSQ QLRELGRQLNQTTGKAEEAMQEASDLKKIKRNYQLELESLNHEKGKLQREVDRITRAH AVAEKNIQHLNSQIHSFRDEKELERLQICQRKSDHLKEQFEKSHEQLLQNIKAEKENN DKIQRLNEELEKSNECAEMLKQKVEELTRQNNETKLMMQRIQAESENIVLEKQTIQQR CEALKIQADGFKDQLRSTNEHLHKQTKTEQDFQRKIKCLEEDLAKSQNLVSEFKQKCD QQNIIIQNTKKEVRNLNAELNASKEEKRRGEQKVQLQQAQVQELNNRLKKVQDELHLK TIEEQMTHRKMVLFQEESGKFKQSAEEFRKKMEKLMESKVITENDISGIRLDFVSLQQ ENSRAQENAKLCETNIKELERQLQQYREQMQQGQHMEANHYQKCQKLEDELIAQKREV ENLKQKMDQQIKEHEHQLVLLQCEIQKKSTAKDCTFKPDFEMTVKECQHSGELSSRNT GHLHPTPRSPLLRWTQEPQPLEEKWQHRVVEQIPKEVQFQPPGAPLEKEKSQQCYSEY FSQTSTELQITFDETNPITRLSEIEKIRDQALNNSRPPVRYQDNACEMELVKVLTPLE IAKNKQYDMHTEVTTLKQEKNPVPSAEEWMLEGCRASGGLKKGDFLKKGLEPETFQNF DGDHACSVRDDEFKFQGLRHTVTARQLVEAKLLDMRTIEQLRLGLKTVEEVQKTLNKF LTKATSIAGLYLESTKEKISFASAAERIIIDKMVALAFLEAQAATGFIIDPISGQTYS VEDAVLKGVVDPEFRIRLLEAEKAAVGYSYSSKTLSVFQAMENRMLDRQKGKHILEAQ IASGGVIDPVRGIRVPPEIALQQGLLNNAILQFLHEPSSNTRVFPNPNNKQALYYSEL LRMCVFDVESQCFLFPFGERNISNLNVKKTHRISVVDTKTGSELTVYEAFQRNLIEKS IYLELSGQQYQWKEAMFFESYGHSSHMLTDTKTGLHFNINEAIEQGTIDKALVKKYQE GLITLTELADSLLSRLVPKKDLHSPVAGYWLTASGERISVLKASRRNLVDRITALRCL EAQVSTGGIIDPLTGKKYRVAEALHRGLVDEGFAQQLRQCELVITGIGHPITNKMMSV VEAVNANIINKEMGIRCLEFQYLTGGLIEPQVHSRLSIEEALQVGIIDVLIATKLKDQ KSYVRNIICPQTKRKLTYKEALEKADFDFHTGLKLLEVSEPLMTGISSLYYSS" BASE COUNT 3067 a 1571 c 1918 g 2128 t ORIGIN 1 gagctgccac ttttcaccgt tagaagtaga gctttttcca gacctcctac cttttagtct 61 actttgaaag gtgaaagaaa gaacatcgtt tcaggaataa aaatgcacag tagtagttat 121 agttaccgta gcagtgattc tgtgtttagt aacactacca gcactcgaac cagtcttgat 181 tcaaatgaaa atcttctctt ggttcattgt ggtccaacac tgatcaactc ttgcattagc 241 ttcggtagtg aatcctttga tggacacagg ttagaaatgt tgcaacagat tgccaacaga 301 gttcagaggg acagtgtcat ctgtgaagac aaactgattc ttgctggaaa tgctcttcag 361 tctgattcta aaagattaga atcaggagtg cagtttcaga atgaagcaga aattgctggg 421 tatatacttg aatgtgagaa ccttttacgc cagcatgtaa ttgatgtaca gattcttatt 481 gatggaaaat actaccaggc agatcaattg gtacagaggg ttgcaaaact gcgtgacgaa 541 attatggcct taaggaacga atgttcttct gtgtacagca aaggacgcat actgacaaca 601 gaacagacaa agctcatgat atcaggaatc actcaaagtt taaactcagg atttgcacag 661 accttacacc ctagtctgac ctcagggctg acccagagtt taacaccttc cctaacctct 721 tctagtatga cttctggcct gtcatcaggg atgacttccc gcctgactcc atctgtcact 781 ccagcttata cacctggttt cccatcagga ttagttccaa atttcagttc aggagtagag 841 ccaaattcat tgcaaacttt gaagttgatg cagatccgaa aaccccttct aaagtcttct 901 ttgctggatc aaaatttaac agaagaagaa atcaatatga aatttgttca ggatcttttg 961 aattgggttg atgagatgca ggtacaactg gaccgcactg agtggggctc agatttgcca 1021 agtgttgaaa gccatttaga aaatcataaa aatgttcata gagctattga agaatttgaa 1081 tctagtctca aagaagctaa aatcagtgag attcaaatga cagcacctct taaactgact 1141 tatgcagaaa agttgcacag attagagagt cagtatgcaa aactcttgaa tacatccagg 1201 aatcaagaac ggcaccttga tacactccat aattttgtaa gtcgtgcgac taatgaactt 1261 atttggttga atgaaaaaga agaggaggaa gttgcttatg actggagtga gagaaacacc 1321 aacatagcta ggaaaaaaga ttatcatgct gaattaatga gagaacttga tcaaaaggaa 1381 gaaaatatta aatcagttca ggagatagca gagcagctac ttctagaaaa tcatccagcc 1441 cggttaacta ttgaggccta cagagcggca atgcagacgc agtggagctg gatcttacag 1501 ctctgccagt gtgtggagca gcacataaag gagaacacag cgtatttcga gtttttcaat 1561 gatgccaaag aagctactga ttacttaagg aatctaaaag atgccattca gcggaagtac 1621 agctgtgata gatcaagcag cattcacaag ctagaagacc ttgttcagga atcaatggaa 1681 gagaaagaag aacttctgca gtacaaaagc actatagcaa acctaatggg aaaagcaaaa 1741 acaataattc aactgaagcc aaggaattct gactgtccac tcaaaacttc tattccgatc 1801 aaagctatct gtgactacag acaaattgag ataaccattt acaaagacga tgaatgtgtt 1861 ttggcgaata actctcatcg tgctaaatgg aaggtcatta gtcctactgg gaatgaggct 1921 atggtcccat ctgtgtgctt caccgttcct ccaccaaaca aagaagcggt ggaccttgcc 1981 aacagaattg agcaacagta tcagaatgtc ctgactcttt ggcatgagtc tcacataaac 2041 atgaagagtg tagtatcctg gcattatctc atcaatgaaa ttgatagaat tcgagctagc 2101 aatgtggctt caataaagac aatgctacct ggtgaacatc agcaagttct aagtaatcta 2161 caatctcgtt ttgaagattt tctggaagat agccaggaat cccaagtctt ttcaggctca 2221 gatataacac aactggaaaa ggaggttaat gtatgtaagc agtattatca agaacttctt 2281 aaatctgcag aaagagagga gcaagaggaa tcagtttata atctctacat ctctgaagtt 2341 cgaaacatta gacttcggtt agagaactgt gaagatcggc tgattagaca gattcgaact 2401 cccctggaaa gagatgattt gcatgaaagt gtgttcagaa tcacagaaca ggagaaacta 2461 aagaaagagc tggaacgact taaagatgat ttgggaacaa tcacaaataa gtgtgaggag 2521 tttttcagtc aagcagcagc ctcttcatca gtccctaccc tacgatcaga gcttaatgtg 2581 gtccttcaga acatgaacca agtctattct atgtcttcca cttacataga taagttgaaa 2641 actgttaact tggtgttaaa aaacactcaa gctgcagaag ccctcgtaaa actctatgaa 2701 actaaactgt gtgaagaaga agcagttata gctgacaaga ataatattga gaatctaata 2761 agtactttaa agcaatggag atctgaagta gatgaaaaga gacaggtatt ccatgcctta 2821 gaggatgagt tgcagaaagc taaagccatc agtgatgaaa tgtttaaaac gtataaagaa 2881 cgggaccttg attttgactg gcacaaagaa aaagcagatc aattagttga aaggtggcaa 2941 aatgttcatg tgcagattga caacaggtta cgggacttag agggcattgg caaatcactg 3001 aagtactaca gagacactta ccatccttta gatgattgga tccagcaggt tgaaactact 3061 cagagaaaga ttcaggaaaa tcagcctgaa aatagtaaaa ccctagccac acagttgaat 3121 caacagaaga tgctggtgtc cgaaatagaa atgaaacaga gcaaaatgga cgagtgtcaa 3181 aaatatgcag aacagtactc agctacagtg aaggactatg aattacaaac aatgacctac 3241 cgggccatgg tagattcaca acaaaaatct ccagtgaaac gccgaagaat gcagagttca 3301 gcagatctca ttattcaaga gttcatggac ctaaggactc gatatactgc cctggtcact 3361 ctcatgacac aatatattaa atttgctggt gattcattga agaggctgga agaggaggag 3421 attaaaaggt gtaaggagac ttctgaacat ggggcatatt cagatctgct tcagcgtcag 3481 aaggcaacag tgcttgagaa tagcaaactt acaggaaaga taagtgagtt ggaaagaatg 3541 gtagctgaac taaagaaaca aaagtcccga gtagaggaag aacttccgaa ggtcagggag 3601 gctgcagaaa atgaattgag aaagcagcag agaaatgtag aagatatctc tctgcagaag 3661 ataagggctg aaagtgaagc caagcagtac cgcagggaac ttgaaaccat tgtgagagag 3721 aaggaagccg ctgaaagaga actggagcgg gtgaggcagc tcaccataga ggccgaggct 3781 aaaagagctg ccgtggaaga gaacctcctg aattttcgca atcagttgga ggaaaacacc 3841 tttaccagac gaacactgga agatcatctt aaaagaaaag atttaagtct caatgatttg 3901 gagcaacaaa aaaataaatt aatggaagaa ttaagaagaa agagagacaa tgaggaagaa 3961 ctcttgaagc tgataaagca gatggaaaaa gaccttgcat ttcagaaaca ggtagcagag 4021 aaacagttga aagaaaagca gaaaattgaa ttggaagcaa gaagaaaaat aactgaaatt 4081 cagtatacat gtagagaaaa tgcattgcca gtgtgtccga tcacacaggc tacatcatgc 4141 agggcagtaa cgggtctcca gcaagaacat gacaagcaga aagcagaaga actcaaacag 4201 caggtagatg aactaacagc tgccaataga aaggctgaac aagacatgag agagctgaca 4261 tatgaactta atgccctcca gcttgaaaaa acgtcatctg aggaaaaggc tcgtttgcta 4321 aaagataaac tagatgaaac aaataataca ctcagatgcc ttaagttgga gctggaaagg 4381 aaggatcagg cggagaaagg gtattctcaa caactcagag agcttggtag gcaattgaat 4441 caaaccacag gtaaagctga agaagccatg caagaagcta gtgatctcaa gaaaataaag 4501 cgcaattatc agttagaatt agaatctctt aatcatgaaa aagggaaact acaaagagaa 4561 gtagacagaa tcacaagggc acatgctgta gctgagaaga atattcagca tttaaattca 4621 caaattcatt cttttcgaga tgagaaagaa ttagaaagac tacaaatctg ccagagaaaa 4681 tcagatcatc taaaagaaca atttgagaaa agccatgagc agttgcttca aaatatcaaa 4741 gctgaaaaag aaaataatga taaaatccaa aggctcaatg aagaattgga gaaaagtaat 4801 gagtgtgcag agatgctaaa acaaaaagta gaggagctta ctaggcagaa taatgaaacc 4861 aaattaatga tgcagagaat tcaggcagaa tcagagaata tagttttaga gaaacaaact 4921 atccagcaaa gatgtgaagc actgaaaatt caggcagatg gttttaaaga tcagctacgc 4981 agcacaaatg aacacttgca taaacagaca aaaacagagc aggattttca aagaaaaatt 5041 aaatgcctag aagaagacct ggcgaaaagt caaaatttgg taagtgaatt taagcaaaag 5101 tgtgaccaac agaacattat catccagaat accaagaaag aagttagaaa tctgaatgcg 5161 gaactgaatg cttccaaaga agagaagcga cgcggggagc agaaagttca gctacaacaa 5221 gctcaggtgc aagagttaaa taacaggttg aaaaaagtac aagacgaatt acacttaaag 5281 accatagagg agcagatgac ccacagaaag atggttctgt ttcaggaaga atctggtaaa 5341 ttcaaacaat cagcagagga gtttcggaag aagatggaaa aattaatgga gtccaaagtc 5401 atcactgaaa atgatatttc aggcattagg cttgactttg tgtctcttca acaagaaaac 5461 tctagagccc aagaaaatgc taagctttgt gaaacaaaca ttaaagaact tgaaagacag 5521 cttcaacagt atcgtgaaca aatgcagcaa gggcagcaca tggaagcaaa tcattaccaa 5581 aaatgtcaga aacttgagga tgagctgata gcccagaagc gtgaggttga aaacctgaag 5641 caaaaaatgg accaacagat caaagagcat gaacatcaat tagttttgct ccagtgtgaa 5701 attcaaaaaa agagcacagc caaagactgt accttcaaac cagattttga gatgacagtg 5761 aaggagtgcc agcactctgg agagctgtcc tctagaaaca ctggacacct tcacccaaca 5821 cccagatccc ctctgttgag atggactcaa gaaccacagc cattggaaga gaagtggcag 5881 catcgggttg ttgaacagat acccaaagaa gtccaattcc agccaccagg ggctccactc 5941 gagaaagaga aaagccagca gtgttactct gagtactttt ctcagacaag caccgagtta 6001 cagataactt ttgatgagac aaaccccatt acaagactgt ctgaaattga gaagataaga 6061 gaccaagccc tgaacaattc tagaccacct gttaggtatc aagataacgc atgtgaaatg 6121 gaactggtga aggttttgac acccttagag atagctaaga acaagcagta tgatatgcat 6181 acagaagtca caacattaaa acaagaaaag aacccagttc ccagtgctga agaatggatg 6241 cttgaagggt gcagagcatc tggtggactc aagaaagggg atttccttaa gaagggctta 6301 gaaccagaga ccttccagaa ctttgatggt gatcatgcat gttcagtcag ggatgatgaa 6361 tttaaattcc aagggcttag gcacactgtg actgccaggc agttggtgga agctaagctt 6421 ctggacatga gaacaattga gcagctgcga ctcggtctta agactgttga agaagttcag 6481 aaaactctta acaagtttct gacgaaagcc acctcaattg cagggcttta cctagaatct 6541 acaaaagaaa agatttcatt tgcctcagcg gccgagagaa tcataataga caaaatggtg 6601 gctttggcat ttttagaagc tcaggctgca acaggtttta taattgatcc catttcaggt 6661 cagacatatt ctgttgaaga tgcagttctt aaaggagttg ttgaccccga attcagaatt 6721 aggcttcttg aggcagagaa ggcagctgtg ggatattctt attcttctaa gacattgtca 6781 gtgtttcaag ctatggaaaa tagaatgctt gacagacaaa aaggtaaaca tatcttggaa 6841 gcccagattg ccagtggggg tgtcattgac cctgtgagag gcattcgtgt tcctccagaa 6901 attgctctgc agcaggggtt gttgaataat gccatcttac agtttttaca tgagccatcc 6961 agcaacacaa gagttttccc taatcccaat aacaagcaag ctctgtatta ctcagaatta 7021 ctgcgaatgt gtgtatttga tgtagagtcc caatgctttc tgtttccatt tggggagagg 7081 aacatttcca atctcaatgt caagaaaaca catagaattt ctgtagtaga tactaaaaca 7141 ggatcagaat tgaccgtgta tgaggctttc cagagaaacc tgattgagaa aagtatatat 7201 cttgaacttt cagggcagca atatcagtgg aaggaagcta tgttttttga atcctatggg 7261 cattcttctc atatgctgac tgatactaaa acaggattac acttcaatat taatgaggct 7321 atagagcagg gaacaattga caaagccttg gtcaaaaagt atcaggaagg cctcatcaca 7381 cttacagaac ttgctgattc tttgctgagc cggttagtcc ccaagaaaga tttgcacagt 7441 cctgttgcag ggtattggct gactgctagt ggggaaagga tctctgtact aaaagcctcc 7501 cgtagaaatt tggttgatcg gattactgcc ctccgatgcc ttgaagccca agtcagtaca 7561 gggggcataa ttgatcctct tactggcaaa aagtaccggg tggccgaagc tttgcataga 7621 ggcctggttg atgaggggtt tgcccagcag ctgcgacagt gtgaattagt aatcacaggg 7681 attggccatc ccatcactaa caaaatgatg tcagtggtgg aagctgtgaa tgcaaatatt 7741 ataaataagg aaatgggaat ccgatgtttg gaatttcagt acttgacagg agggttgata 7801 gagccacagg ttcactctcg gttatcaata gaagaggctc tccaagtagg tattatagat 7861 gtcctcattg ccacaaaact caaagatcaa aagtcatatg tcagaaatat aatatgccct 7921 cagacaaaaa gaaagttgac atataaagaa gccttagaaa aagctgattt tgatttccac 7981 acaggactta aactgttaga agtatctgag cccctgatga caggaatttc tagcctctac 8041 tattcttcct aatgggacat gtttaaataa ctgtgcaagg ggtgatgcag gctggttcat 8101 gccacttttt cagagtatga tgatatcggc tacatatgca gtctgtgaat tatgtaacat 8161 actctatttc ttgagggctg caaattgcta agtgctcaaa atagagtaag ttttaaattg 8221 aaaattacat aagatttaat gcccttcaaa tggtttcatt tagccttgag aatggttttt 8281 tgaaacttgg ccacactaaa atgttttttt ttttttacgt agaatgtggg ataaacttga 8341 tgaactccaa gttcacagtg tcatttcttc agaactcccc ttcattgaat agtgatcatt 8401 tattaaatga taaattgcac tcgctgaaag agcacgtcat gaagcaccat ggaatcaaag 8461 agaaagatat aaattcgttc ccacagcctt caagctgcag tgttttagat tgcttcaaaa 8521 aatgaaaaag ttttgccttt ttcgatatag tgaccttctt tgcatattaa aatgtttacc 8581 acaatgtccc atttctagtt aagtcttcgc acttgaaagc taacattatg aatattatgt 8641 gttggaggag gggaaggatt ttcttcattc tgtgtatttt ccgg // LOCUS HUMBPIAA 1813 bp mRNA PRI 31-OCT-1994 DEFINITION Human bactericidal permeability increasing protein (BPI) mRNA, complete cds. ACCESSION J04739 NID g179528 KEYWORDS bactericidal permeability increasing protein; bactericidal protein. SOURCE Human myeloid leukemia cell line HL-60, cDNA to mRNA, clone 4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1813) AUTHORS Gray,P.W., Flaggs,G., Leong,S.R., Gumina,R.J., Weiss,J., Ooi,C.E. and Elsbach,P. TITLE Cloning of the cDNA of a human neutrophil bactericidal protein. Structural and functional correlations JOURNAL J. Biol. Chem. 264 (16), 9505-9509 (1989) MEDLINE 89255455 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Gray, 21-MAR-1989. FEATURES Location/Qualifiers source 1..1813 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" mRNA <1..1813 /note="bactericidal permeability increasing protein" sig_peptide 31..123 /gene="BPI" /note="bactericidal permeability increasing protein signal peptide" gene 31..1494 /gene="BPI" CDS 31..1494 /gene="BPI" /note="bactericidal permeability increasing protein (BPI) precursor" /codon_start=1 /db_xref="GDB:G00-131-572" /db_xref="PID:g179529" /translation="MRENMARGPCNAPRWVSLMVLVAIGTAVTAAVNPGVVVRISQKG LDYASQQGTAALQKELKRIKIPDYSDSFKIKHLGKGHYSFYSMDIREFQLPSSQISMV PNVGLKFSISNANIKISGKWKAQKRFLKMSGNFDLSIEGMSISADLKLGSNPTSGKPT ITCSSCSSHINSVHVHISKSKVGWLIQLFHKKIESALRNKMNSQVCEKVTNSVSSKLQ PYFQTLPVMTKIDSVAGINYGLVAPPATTAETLDVQMKGEFYSENHHNPPPFAPPVME FPAAHDRMVYLGLSDYFFNTAGLVYQEAGVLKMTLRDDMIPKESKFRLTTKFFGTFLP EVAKKFPNMKIQIHVSASTPPHLSVQPTGLTFYPAVDVQAFAVLPNSSLASLFLIGMH TTGSMEVSAESNRLVGELKLDRLLLELKHSNIGPFPVELLQDIMNYIVPILVLPRVNE KLQKGFPLPTPARVQLYNVVLQPHQNFLLFGADVVYK" mat_peptide 124..1491 /gene="BPI" /note="bactericidal permeability increasing protein" BASE COUNT 452 a 500 c 441 g 420 t ORIGIN 1 caggccttga ggttttggca gctctggagg atgagagaga acatggccag gggcccttgc 61 aacgcgccga gatgggtgtc cctgatggtg ctcgtcgcca taggcaccgc cgtgacagcg 121 gccgtcaacc ctggcgtcgt ggtcaggatc tcccagaagg gcctggacta cgccagccag 181 caggggacgg ccgctctgca gaaggagctg aagaggatca agattcctga ctactcagac 241 agctttaaga tcaagcatct tgggaagggg cattatagct tctacagcat ggacatccgt 301 gaattccagc ttcccagttc ccagataagc atggtgccca atgtgggcct taagttctcc 361 atcagcaacg ccaatatcaa gatcagcggg aaatggaagg cacaaaagag attcttaaaa 421 atgagcggca attttgacct gagcatagaa ggcatgtcca tttcggctga tctgaagctg 481 ggcagtaacc ccacgtcagg caagcccacc atcacctgct ccagctgcag cagccacatc 541 aacagtgtcc acgtgcacat ctcaaagagc aaagtcgggt ggctgatcca actcttccac 601 aaaaaaattg agtctgcgct tcgaaacaag atgaacagcc aggtctgcga gaaagtgacc 661 aattctgtat cctccaagct gcaaccttat ttccagactc tgccagtaat gaccaaaata 721 gattctgtgg ctggaatcaa ctatggtctg gtggcacctc cagcaaccac ggctgagacc 781 ctggatgtac agatgaaggg ggagttttac agtgagaacc accacaatcc acctcccttt 841 gctccaccag tgatggagtt tcccgctgcc catgaccgca tggtatacct gggcctctca 901 gactacttct tcaacacagc cgggcttgta taccaagagg ctggggtctt gaagatgacc 961 cttagagatg acatgattcc aaaggagtcc aaatttcgac tgacaaccaa gttctttgga 1021 accttcctac ctgaggtggc caagaagttt cccaacatga agatacagat ccatgtctca 1081 gcctccaccc cgccacacct gtctgtgcag cccaccggcc ttaccttcta ccctgccgtg 1141 gatgtccagg cctttgccgt cctccccaac tcctccctgg cttccctctt cctgattggc 1201 atgcacacaa ctggttccat ggaggtcagc gccgagtcca acaggcttgt tggagagctc 1261 aagctggata ggctgctcct ggaactgaag cactcaaata ttggcccctt cccggttgaa 1321 ttgctgcagg atatcatgaa ctacattgta cccattcttg tgctgcccag ggttaacgag 1381 aaactacaga aaggcttccc tctcccgacg ccggccagag tccagctcta caacgtagtg 1441 cttcagcctc accagaactt cctgctgttc ggtgcagacg ttgtctataa atgaaggcac 1501 caggggtgcc gggggctgtc agccgcacct gttcctgatg ggctgtgggg caccggctgc 1561 ctttccccag ggaatcctct ccagatctta accaagagcc ccttgcaaac ttcttcgact 1621 cagattcaga aatgatctaa acacgaggaa acattattca ttggaaaagt gcatggtgtg 1681 tattttaggg attatgagct tctttcaagg gctaaggctg cagagatatt tcctccagga 1741 atcgtgtttc aattgtaacc aagaaatttc catttgtgct tcatgaaaaa aaacttctgg 1801 tttttttcat gtg // LOCUS HUMBPIGE 914 bp mRNA PRI 07-FEB-1991 DEFINITION Human IgE-binding protein (epsilon-BP) mRNA, complete cds. ACCESSION M57710 J02921 NID g179530 KEYWORDS IgE-binding protein. SOURCE Human lung, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 914) AUTHORS Robertson,M.W., Albrandt,K.A., Keller,D. and Liu,F.-T. TITLE Human IgE-binding protein: A soluble lectin exhibiting a highly conserved interspecies sequence and differential recognition of IgE glycoforms JOURNAL Biochemistry 29, 8093-8100 (1990) MEDLINE 91084480 FEATURES Location/Qualifiers source 1..914 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" gene 19..900 /gene="epsilon-BP" CDS 19..771 /gene="epsilon-BP" /codon_start=1 /product="IgE-binding protein" /db_xref="PID:g179531" /translation="MADNFSLHDALSGSGNPNPQGWPGAWGNQPAGAGGYPGASYPGA YPGQAPPGAYPGQAPPGAYHGAPGAYPGAPAPGVYPGPPSGPGAYPSSGQPSAPGAYP ATGPYGAPAGPLIVPYNLPLPGGVVPRMLITILGTVKPNANRIALDFQRGNDVAFHFN PRFNENNRRVIVCNTKLDNNWGREERQSVFPFESGKPFKIQVLVEPDHFKVAVNDAHL LQYNHRVKKLNEISKLGISGDIDLTSASYTMI" polyA_signal 895..900 /gene="epsilon-BP" BASE COUNT 253 a 233 c 220 g 208 t ORIGIN 1 ccagccaacg agcggaaaat ggcagacaat ttttcgctcc atgatgcgtt atctgggtct 61 ggaaacccaa accctcaagg atggcctggc gcatggggga accagcctgc tggggcaggg 121 ggctacccag gggcttccta tcctggggcc taccccgggc aggcaccccc aggggcttat 181 cctggacagg cacctccagg cgcctaccat ggagcacctg gagcttatcc cggagcacct 241 gcacctggag tctacccagg gccacccagc ggccctgggg cctacccatc ttctggacag 301 ccaagtgccc ccggagccta ccctgccact ggcccctatg gcgcccctgc tgggccactg 361 attgtgcctt ataacctgcc tttgcctggg ggagtggtgc ctcgcatgct gataacaatt 421 ctgggcacgg tgaagcccaa tgcaaacaga attgctttag atttccaaag agggaatgat 481 gttgccttcc actttaaccc acgcttcaat gagaacaaca ggagagtcat tgtttgcaat 541 acaaagctgg ataataactg gggaagggaa gaaagacagt cggttttccc atttgaaagt 601 gggaaaccat tcaaaataca agtactggtt gaacctgacc acttcaaggt tgcagtgaat 661 gatgctcact tgttgcagta caatcatcgg gttaaaaaac tcaatgaaat cagcaaactg 721 ggaatttctg gtgacataga cctcaccagt gcttcatata ccatgatata atctgaaagg 781 ggcagattaa aaaaaaaaaa aaagaatcta aaccttacat gtgtaaaggt ttcatgttca 841 ctgtgagtga aaatttttac attcatcaat atccctcttg taagtcatct acttaataaa 901 tattacagtg aaag // LOCUS HUMBRAF 2510 bp mRNA PRI 12-JUN-1992 DEFINITION Human B-raf mRNA, complete cds. ACCESSION M95712 M95720 X54072 NID g179532 KEYWORDS b-raf oncogene; serine/threonine protein kinase. SOURCE Homo sapiens RNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Sithanandam,G. JOURNAL Unpublished (1990) REFERENCE 2 (bases 285 to 2510) AUTHORS Sithanandam,G., Kolch,W., Duh,F.M. and Rapp,U.R. TITLE Complete coding sequence of a human B-raf cDNA and detection of B-raf potein kinase with isozyme specific antibodies JOURNAL Oncogene 5, 1775-1780 (1990) MEDLINE 91133728 REFERENCE 3 (sites) AUTHORS Stephens,R.M., Sithanandam,G., Copeland,T., Kaplan,D.R., Rapp,U.R. and Morrison,D.K. TITLE 95kDa b-Raf serine/threonine kinase: idendification of the protein and its major autophosphorylation site JOURNAL Unpublished (1992) COMMENT From EMBL 27 entry HSBRAF; dated 11-MAR-1991. FEATURES Location/Qualifiers source 1..2510 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testes" 5'UTR 1..61 /gene="B-raf" /note="putative" gene 1..2444 /gene="B-raf" CDS 62..2359 /gene="B-raf" /codon_start=1 /product="B-raf protein" /db_xref="PID:g179533" /translation="MAALSGGGGGGAEPGQALFNGDMEPEAGAGRPAASSAADPAIPE EVWNIKQMIKLTQEHIEALLDKFGGEHNPPSIYLEAYEEYTSKLDALQQREQQLLESL GNGTDFSVSSSASMDTVTSSSSSSLSVLPSSLSVFQNPTDVARSNPKSPQKPIVRVFL PNKQRTVVPARCGVTVRDSLKKALMMRGLIPECCAVYRIQDGEKKPIGWDTDISWLTG EELHVEVLENVPLTTHNFVRKTFFTLAFCDFCRKLLFQGFRCQTCGYKFHQRCSTEVP LMCVNYDQLDLLFVSKFFEHHPIPQEEASLAETALTSGSSPSAPASDSIGPQILTSPS PSKSIPIPQPFRPADEDHRNQFGQRDRSSSAPNVHINTIEPVNIDDLIRDQGFRGDGG STTGLSATPPASLPGSLTNVKALQKSPGPQRERKSSSSSEDRNRMKTLGRRDSSDDWE IPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGVLRKTR HVNILLFMGYSTKPQLAIVTQWCEGSSLYHHLHIIETKFEMIKLIDIARQTAQGMDYL HAKSIIHRDLKSNNIFLHEDLTVKIGDFGLATVKSRWSGSHQFEQLSGSILWMAPEVI RMQDKNPYSFQSDVYAFGIVLYELMTGQLPYSNINNRDQIIFMVGRGYLSPDLSKVRS NCPKAMKRLMAECLKKKRDERPLFPQILASIELLARSLPKIHRSASEPSLNRAGFQTE DFSLYACASPKTPIQAGGYGAFPVH" polyA_signal 2403..2408 /gene="B-raf" polyA_signal 2439..2444 /gene="B-raf" BASE COUNT 747 a 578 c 564 g 621 t ORIGIN 1 cgcctcccgg ccccctcccc gcccgacagc ggccgctcgg gccccggctc tcggttataa 61 gatggcggcg ctgagcggtg gcggtggtgg cggcgcggag ccgggccagg ctctgttcaa 121 cggggacatg gagcccgagg ccggcgccgg ccggcccgcg gcctcttcgg ctgcggaccc 181 tgccattccg gaggaggtgt ggaatatcaa acaaatgatt aagttgacac aggaacatat 241 agaggcccta ttggacaaat ttggtgggga gcataatcca ccatcaatat atctggaggc 301 ctatgaagaa tacaccagca agctagatgc actccaacaa agagaacaac agttattgga 361 atctctgggg aacggaactg atttttctgt ttctagctct gcatcaatgg ataccgttac 421 atcttcttcc tcttctagcc tttcagtgct accttcatct ctttcagttt ttcaaaatcc 481 cacagatgtg gcacggagca accccaagtc accacaaaaa cctatcgtta gagtcttcct 541 gcccaacaaa cagaggacag tggtacctgc aaggtgtgga gttacagtcc gagacagtct 601 aaagaaagca ctgatgatga gaggtctaat cccagagtgc tgtgctgttt acagaattca 661 ggatggagag aagaaaccaa ttggttggga cactgatatt tcctggctta ctggagaaga 721 attgcatgtg gaagtgttgg agaatgttcc acttacaaca cacaactttg tacgaaaaac 781 gtttttcacc ttagcatttt gtgacttttg tcgaaagctg cttttccagg gtttccgctg 841 tcaaacatgt ggttataaat ttcaccagcg ttgtagtaca gaagttccac tgatgtgtgt 901 taattatgac caacttgatt tgctgtttgt ctccaagttc tttgaacacc acccaatacc 961 acaggaagag gcgtccttag cagagactgc cctaacatct ggatcatccc cttccgcacc 1021 cgcctcggac tctattgggc cccaaattct caccagtccg tctccttcaa aatccattcc 1081 aattccacag cccttccgac cagcagatga agatcatcga aatcaatttg ggcaacgaga 1141 ccgatcctca tcagctccca atgtgcatat aaacacaata gaacctgtca atattgatga 1201 cttgattaga gaccaaggat ttcgtggtga tggaggatca accacaggtt tgtctgctac 1261 cccccctgcc tcattacctg gctcactaac taacgtgaaa gccttacaga aatctccagg 1321 acctcagcga gaaaggaagt catcttcatc ctcagaagac aggaatcgaa tgaaaacact 1381 tggtagacgg gactcgagtg atgattggga gattcctgat gggcagatta cagtgggaca 1441 aagaattgga tctggatcat ttggaacagt ctacaaggga aagtggcatg gtgatgtggc 1501 agtgaaaatg ttgaatgtga cagcacctac acctcagcag ttacaagcct tcaaaaatga 1561 agtaggagta ctcaggaaaa cacgacatgt gaatatccta ctcttcatgg gctattccac 1621 aaagccacaa ctggctattg ttacccagtg gtgtgagggc tccagcttgt atcaccatct 1681 ccatatcatt gagaccaaat ttgagatgat caaacttata gatattgcac gacagactgc 1741 acagggcatg gattacttac acgccaagtc aatcatccac agagacctca agagtaataa 1801 tatatttctt catgaagacc tcacagtaaa aataggtgat tttggtctag ctacagtgaa 1861 atctcgatgg agtgggtccc atcagtttga acagttgtct ggatccattt tgtggatggc 1921 accagaagtc atcagaatgc aagataaaaa tccatacagc tttcagtcag atgtatatgc 1981 atttgggatt gttctgtatg aattgatgac tggacagtta ccttattcaa acatcaacaa 2041 cagggaccag ataattttta tggtgggacg aggatacctg tctccagatc tcagtaaggt 2101 acggagtaac tgtccaaaag ccatgaagag attaatggca gagtgcctca aaaagaaaag 2161 agatgagaga ccactctttc cccaaattct cgcctctatt gagctgctgg cccgctcatt 2221 gccaaaaatt caccgcagtg catcagaacc ctccttgaat cgggctggtt tccaaacaga 2281 ggattttagt ctatatgctt gtgcttctcc aaaaacaccc atccaggcag ggggatatgg 2341 tgcgtttcct gtccactgaa acaaatgagt gagagagttc aggagagtag caacaaaagg 2401 aaaataaatg aacatatgtt tgcttatatg ttaaattgaa taaaatactc tctttttttt 2461 taaggtggaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaccc // LOCUS HUMBRAINPR 1441 bp mRNA PRI 31-DEC-1994 DEFINITION Human brain protein recognized by the sera of patients with paraneoplastic sensory neuronopathy mRNA, complete cds. ACCESSION M62843 NID g179536 KEYWORDS brain protein. SOURCE Homo sapiens (tissue library: Zap expression (Stratagene)) cerebellum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1441) AUTHORS Szabo,A., Dalmau,J., Manley,G., Rosenfeld,M., Wong,E., Henson,J., Posner,J.B. and Furneaux,H.M. TITLE HuD, a paraneoplastic encephalomyelitis antigen, contains RNA-binding domains and is homologous to Elav and Sex-lethal JOURNAL Cell 67 (2), 325-333 (1991) MEDLINE 92005711 FEATURES Location/Qualifiers source 1..1441 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum" /tissue_lib="Zap expression (Stratagene)" gene 95..1237 /gene="brain protein" CDS 95..1237 /gene="brain protein" /codon_start=1 /product="brain protein" /db_xref="PID:g179537" /translation="MVMIISTMEPQVSNGPTSNTSNGPSSNNRNCPSPMQTGATTDDS KTNLIVNYLPQNMTQEEFRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYIDPKDAEK AINTLNGLRLQTKTIKVSYARPSSASIRDANLYVSGLPKTMTQKELEQLFSQYGRIIT SRILVDQVTGVSRGVGFIRFDKRIEAEEAIKGLNGQKPSGATEPITVKFANNPSQKSS QALLSQLYQSPNRRYPGPLHHQAQRFRLDNLLNMAYGVKRLMSGPVPPSACSPRFSPI TIDGMTSLVGMNIPGHTGTGWCIFVYNLSPDSDESVLWQLFGPFGAVNNVKVIRDFNT NKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVSFKTNKAHKS" polyA_site 1441 /gene="brain protein" BASE COUNT 437 a 366 c 306 g 332 t ORIGIN 1 ccaatagtag tcattttaaa tatatattct gaaatctttg caaattttaa cagaagagtc 61 gaagctctgc gagacccaat atttgccaat aagaatggtt atgataatta gcaccatgga 121 gcctcaggtg tcaaatggtc cgacatccaa tacaagcaat ggaccctcca gcaacaacag 181 aaactgtcct tctcccatgc aaacaggggc aaccacagat gacagcaaaa ccaacctcat 241 cgtcaactat ttaccccaga atatgaccca agaagaattc aggagtctct tcgggagcat 301 tggtgaaata gaatcctgca aacttgtgag agacaaaatt acaggacaga gtttagggta 361 tggatttgtt aactatattg atccaaagga tgcagagaaa gccatcaaca ctttaaatgg 421 actcagactc cagaccaaaa ccataaaggt ctcatatgcc cgtccgagct ctgcctcaat 481 cagggatgct aacctctatg ttagcggcct tcccaaaacc atgacccaga aggaactgga 541 gcaacttttc tcgcaatacg gccgtatcat cacctcacga atcctggttg atcaagtcac 601 aggagtgtcc agaggggtgg gattcatccg ctttgataag aggattgagg cagaagaagc 661 catcaaaggg ctgaatggcc agaagcccag cggtgctacg gaaccgatta ctgtgaagtt 721 tgccaacaac cccagccaga agtccagcca ggccctgctc tcccagctct accagtcccc 781 taaccggcgc tacccaggtc cacttcacca ccaggctcag aggttcaggc tggacaattt 841 gcttaatatg gcctatggcg taaagagact gatgtctgga ccagtccccc cttctgcttg 901 ttcccccagg ttctccccaa ttaccattga tggaatgaca agccttgtgg gaatgaacat 961 ccctggtcac acaggaactg ggtggtgcat ctttgtctac aacctgtccc ccgattccga 1021 tgagagtgtc ctctggcagc tctttggccc ctttggagca gtgaacaacg taaaggtgat 1081 tcgtgacttc aacaccaaca agtgcaaggg attcggcttt gtcaccatga ccaactatga 1141 tgaggcggcc atggccatcg ccagcctcaa cgggtaccgc ctgggagaca gagtgttgca 1201 agtttccttt aaaaccaaca aagcccacaa gtcctgaatt tcccattctt acttactaaa 1261 atatatatag aaatatatac gaacaaaaca cacgcgcgca cacacacaca tacacgaaag 1321 agagagaaac aaacttttca aggcttatat tcaaccatgg actttataag ccagtgttgc 1381 ctaagtatta aaacattgga ttatcctgag gtgtaccagg aaaggatttt ataatgctta 1441 g // LOCUS HUMBRCA1 117143 bp DNA PRI 04-DEC-1996 DEFINITION Human BRCA1, Rho7 and vatI genes, complete cds, and ipf35 gene, partial cds. ACCESSION L78833 NID g1698398 KEYWORDS BRCA1 breast cancer susceptibility gene; ifp35 interferon induced gene; rho7 gene; vatI homologue. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 117143) AUTHORS Smith,T.M., Lee,M.K., Szabo,C.I., Jerome,N., McEuen,M., Taylor,M., Hood,L. and King,M.C. TITLE Complete genomic sequence and analysis of 117 kb of human DNA containing the gene BRCA1 JOURNAL Genome Res. 6 (11), 1029-1049 (1996) MEDLINE 97092865 COMMENT GSDB:S:76287. Characterization of an aberrant BRCA1 cDNA clone in the original report (Miki, 1994) led to the misidentification of an inserted Alu element as exon 4. Not normally found in BRCA1 transcripts, insertion of this Alu would lead to introduction of a STOP codon. Hence, BRCA1 exons and introns are numbered 1a, 1b, 2, 3, 5, 6, etc. FEATURES Location/Qualifiers source 1..117143 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Los Alamos chromosome 17 specific cosmid library" /germline /map="17q21" repeat_region complement(222..2071) /partial /note="putative" /rpt_family="pTR5" repeat_region complement(692..992) /partial /note="putative" /rpt_family="AluY" exon 3344..3464 /gene="BRCA1" /note="exon 1a; putative" /number=1 gene 3344..84436 /gene="BRCA1" intron 3465..4619 /gene="BRCA1" /note="intron 1a; putative" /number=1 exon 3621..3998 /gene="BRCA1" /note="exon 1b; putative" /number=1 /db_xref="GDB:126611" repeat_region 3746..3977 /partial /note="putative" /rpt_family="AluSg" repeat_region 3958..3987 /partial /note="putative" /rpt_unit=(CAAAA)x6 intron 3999..4619 /gene="BRCA1" /note="intron 1b; putative" /number=1 repeat_region 4588..4599 /partial /note="putative" /rpt_unit=(AT)x6 exon 4620..4718 /gene="BRCA1" /note="putative" /number=2 CDS join(4639..4718,12955..13008,22201..22278,23778..23866, 24473..24612,28853..28958,31443..31488,32810..32886, 33872..37297,37700..37788,46156..46327,52118..52244, 54211..54401,57494..57804,61038..61125,64782..64859, 65360..65400,71598..71681,77620..77674,79543..79616, 81034..81094,82936..83060) /gene="BRCA1" /codon_start=1 /evidence=experimental /db_xref="PID:g1698399" /translation="MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFC KFCMLKLLNQKKGPSQCPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYA NSYNFAKKENNSPEHLKDEVSIIQSMGYRNRAKRLLQSEPENPSLQETSLSVQLSNLG TVRTLRTKQRIQPQKTSVYIELGSDSSEDTVNKATYCSVGDQELLQITPQGTRDEISL DSAKKAACEFSETDVTNTEHHQPSNNDLNTTEKRAAERHPEKYQGSSVSNLHVEPCGT NTHASSLQHENSSLLLTKDRMNVEKAEFCNKSKQPGLARSQHNRWAGSKETCNDRRTP STEKKVDLNADPLCERKEWNKQKLPCSENPRDTEDVPWITLNSSIQKVNEWFSRSDEL LGSDDSHDGESESNAKVADVLDVLNEVDEYSGSSEKIDLLASDPHEALICKSERVHSK SVESNIEDKIFGKTYRKKASLPNLSHVTENLIIGAFVTEPQIIQERPLTNKLKRKRRP TSGLHPEDFIKKADLAVQKTPEMINQGTNQTEQNGQVMNITNSGHENKTKGDSIQNEK NPNPIESLEKESAFKTKAEPISSSISNMELELNIHNSKAPKKNRLRRKSSTRHIHALE LVVSRNLSPPNCTELQIDSCSSSEEIKKKKYNQMPVRHSRNLQLMEGKEPATGAKKSN KPNEQTSKRHDSDTFPELKLTNAPGSFTKCSNTSELKEFVNPSLPREEKEEKLETVKV SNNAEDPKDLMLSGERVLQTERSVESSSISLVPGTDYGTQESISLLEVSTLGKAKTEP NKCVSQCAAFENPKGLIHGCSKDNRNDTEGFKYPLGHEVNHSRETSIEMEESELDAQY LQNTFKVSKRQSFAPFSNPGNAEEECATFSAHSGSLKKQSPKVTFECEQKEENQGKNE SNIKPVQTVNITAGFPVVGQKDKPVDNAKCSIKGGSRFCLSSQFRGNETGLITPNKHG LLQNPYRIPPLFPIKSFVKTKCKKNLLEENFEEHSMSPEREMGNENIPSTVSTISRNN IRENVFKEASSSNINEVGSSTNEVGSSINEIGSSDENIQAELGRNRGPKLNAMLRLGV LQPEVYKQSLPGSNCKHPEIKKQEYEEVVQTVNTDFSPYLISDNLEQPMGSSHASQVC SETPDDLLDDGEIKEDTSFAENDIKESSAVFSKSVQKGELSRSPSPFTHTHLAQGYRR GAKKLESSEENLSSEDEELPCFQHLLFGKVNNIPSQSTRHSTVATECLSKNTEENLLS LKNSLNDCSNQVILAKASQEHHLSEETKCSASLFSSQCSELEDLTANTNTQDPFLIGS SKQMRHQSESQGVGLSDKELVSDDEERGTGLEENNQEEQSMDSNLGEAASGCESETSV SEDCSGLSSQSDILTTQQRDTMQHNLIKLQQEMAELEAVLEQHGSQPSNSYPSIISDS SALEDLRNPEQSTSEKAVLTSQKSSEYPISQNPEGLSADKFEVSADSSTSKNKEPGVE RSSPSKCPSLDDRWYMHSCSGSLQNRNYPSQEELIKVVDVEEQQLEESGPHDLTETSY LPRQDLEGTPYLESGISLFSDDPESDPSEDRAPESARVGNIPSSTSALKVPQLKVAES AQSPAAAHTTDTAGYNAMEESVSREKPELTASTERVNKRMSMVVSGLTPEEFMLVYKF ARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYFWVTQSIKER KMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMV QLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSV ALYQCQELDTYLIPQIPHSHY" intron 4719..12954 /gene="BRCA1" /note="putative" /number=2 repeat_region 5088..5380 /partial /note="putative" /rpt_family="AluSc" repeat_region complement(5500..5661) /partial /note="putative" /rpt_family="AluJb" repeat_region complement(5662..5970) /partial /note="putative" /rpt_family="AluY" repeat_region complement(5971..6109) /partial /note="putative" /rpt_family="AluSq" repeat_region 6131..6395 /partial /note="putative" /rpt_family="MIR2" repeat_region 6396..6409 /partial /note="putative" /rpt_unit=(TAAAC)x2 repeat_region complement(7032..7330) /partial /note="putative" /rpt_family="AluSp" repeat_region 7334..7357 /partial /note="putative" /rpt_unit=(TTTTG)x4 repeat_region complement(7341..7641) /partial /note="putative" /rpt_family="AluY" repeat_region complement(7648..7940) /partial /note="putative" /rpt_family="AluSg" repeat_region complement(7951..8251) /partial /note="putative" /rpt_family="AluY" repeat_region complement(9450..9549) /partial /note="putative" /rpt_family="AluFLAM_C" repeat_region 9693..9973 /partial /note="putative" /rpt_family="AluSx" repeat_region 9999..10356 /partial /note="putative" /rpt_family="AluSx" repeat_region 10367..10806 /partial /note="putative" /rpt_family="MIR2" repeat_region 10797..10852 /partial /note="putative" /rpt_family="MIR2" repeat_region complement(10984..11275) /partial /note="putative" /rpt_family="AluSg" repeat_region complement(11276..11575) /partial /note="putative" /rpt_family="AluSx" repeat_region 11650..11732 /partial /note="putative" /rpt_family="MER42c" repeat_region 11917..12211 /partial /note="putative" /rpt_family="AluSq" repeat_region 12223..12370 /partial /note="putative" /rpt_family="AluJo" exon 12955..13008 /gene="BRCA1" /note="putative" /number=3 intron 13009..22200 /gene="BRCA1" /note="putative" /number=3 repeat_region complement(13213..13511) /partial /note="putative" /rpt_family="AluSg" repeat_region 14274..14343 /partial /note="putative" /rpt_family="AluFRAM" repeat_region 14503..14644 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(14645..14932) /partial /note="putative" /rpt_family="AluSq" repeat_region 15068..15368 /partial /note="putative" /rpt_family="AluSc" repeat_region 15353..15371 /partial /note="putative" /rpt_unit=(AAAAG)x3 repeat_region 15632..15932 /partial /note="putative" /rpt_family="AluSp" repeat_region 16085..16096 /partial /note="putative" /rpt_unit=(TG)x6 repeat_region complement(16108..16295) /partial /note="putative" /rpt_family="AluJo" repeat_region 16350..16639 /partial /note="putative" /rpt_family="AluSg" repeat_region 16874..16896 /partial /note="putative" /rpt_unit=(TTTG)x5 repeat_region complement(16894..17193) /partial /note="putative" /rpt_family="AluY" repeat_region 16900..16913 /partial /note="putative" /rpt_unit=(TTTGT)x2 repeat_region complement(17196..17496) /partial /note="putative" /rpt_family="AluSx" repeat_region 17201..17215 /partial /note="putative" /rpt_unit=(TTTTG)x3 repeat_region complement(17618..17915) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(17921..18041) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(18071..18129) /partial /note="putative" /rpt_family="MER33" repeat_region complement(18130..18429) /partial /note="putative" /rpt_family="AluJb" repeat_region 18541..18837 /partial /note="putative" /rpt_family="AluY" repeat_region 19438..19664 /partial /note="putative" /rpt_family="AluJb" repeat_region 19716..19739 /partial /note="putative" /rpt_unit=(TTTTG)x4 repeat_region complement(19721..20001) /partial /note="putative" /rpt_family="AluSp" repeat_region complement(20008..20305) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(20385..20671) /partial /note="putative" /rpt_family="AluSx" repeat_region 20524..20535 /partial /note="putative" /rpt_unit=(CAC)x4 repeat_region 20807..20881 /partial /note="putative" /rpt_family="MIR2" repeat_region complement(20924..21029) /partial /note="putative" /rpt_family="L1" repeat_region 21067..21351 /partial /note="putative" /rpt_family="AluSx" repeat_region 21372..21671 /partial /note="putative" /rpt_family="AluY" repeat_region complement(21690..21817) /partial /note="putative" /rpt_family="L1" repeat_region 22176..22189 /partial /note="putative" /rpt_unit=(TTCT)x3 exon 22201..22278 /gene="BRCA1" /note="putative" /number=5 intron 22279..23777 /gene="BRCA1" /note="putative" /number=5 repeat_region complement(22596..22869) /partial /note="putative" /rpt_family="AluJb" repeat_region 22907..23037 /partial /note="putative" /rpt_family="MIR2" repeat_region 23195..23496 /partial /note="putative" /rpt_family="AluJo" repeat_region 23318..23333 /partial /note="putative" /rpt_unit=(AAAT)x4 exon 23778..23866 /gene="BRCA1" /note="putative" /number=6 intron 23867..24472 /gene="BRCA1" /note="putative" /number=6 exon 24473..24612 /gene="BRCA1" /note="putative" /number=7 intron 24613..28852 /gene="BRCA1" /note="putative" /number=7 repeat_region 24628..24650 /partial /note="putative" /rpt_unit=(TTC)x7 repeat_region complement(24657..24909) /partial /note="putative" /rpt_family="AluJo" repeat_region complement(24709..24909) /partial /note="putative" /rpt_family="AluSx" repeat_region 25115..25221 /partial /note="putative" /rpt_family="AluFLAM_A" repeat_region 25135..25243 /partial /note="putative" /rpt_family="SVA" repeat_region complement(25248..25549) /partial /note="putative" /rpt_family="AluSx" repeat_region 25665..25786 /partial /note="putative" /rpt_family="AluFLAM_C" repeat_region 25790..26078 /partial /note="putative" /rpt_family="AluJo" repeat_region 26081..26377 /partial /note="putative" /rpt_family="AluSc" repeat_region 26381..26664 /partial /note="putative" /rpt_family="AluY" repeat_region 26666..26827 /partial /note="putative" /rpt_family="AluJo" repeat_region 26934..27229 /partial /note="putative" /rpt_family="AluSx" repeat_region 27242..27540 /partial /note="putative" /rpt_family="AluSg" repeat_region 27705..27996 /partial /note="putative" /rpt_family="AluSx" repeat_region complement(28013..28033) /partial /note="putative" /rpt_family="AluY" repeat_region 28033..28075 /partial /note="putative" /rpt_unit=(AT)x21 repeat_region 28089..28099 /partial /note="putative" /rpt_unit=(TA)x5 repeat_region 28126..28174 /partial /note="putative" /rpt_unit=(TA)x24 repeat_region 28175..28188 /partial /note="putative" /rpt_unit=(CA)x7 repeat_region 28197..28208 /partial /note="putative" /rpt_unit=(TA)x6 repeat_region 28209..28222 /partial /note="putative" /rpt_unit=(CA)x7 repeat_region 28216..28394 /partial /note="putative" /rpt_family="AluSx" repeat_region complement(28561..28685) /partial /note="putative" /rpt_family="AluJo" exon 28853..28958 /gene="BRCA1" /note="putative" /number=8 intron 28959..31442 /gene="BRCA1" /note="putative" /number=8 repeat_region 29163..29465 /partial /note="putative" /rpt_family="AluSp" repeat_region 29612..29914 /partial /note="putative" /rpt_family="AluSp" repeat_region 29925..30042 /partial /note="putative" /rpt_family="AluJo" repeat_region 30067..30368 /partial /note="putative" /rpt_family="AluSx" repeat_region 30426..30703 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(30886..31117) /partial /note="putative" /rpt_family="MIR" repeat_region complement(31130..31260) /partial /note="putative" /rpt_family="AluSq" repeat_region 31307..31357 /partial /note="putative" /rpt_family="MIR2" exon 31443..31488 /gene="BRCA1" /note="putative" /number=9 intron 31489..32809 /gene="BRCA1" /note="putative" /number=9 repeat_region complement(31577..31755) /partial /note="putative" /rpt_family="L1PA15" repeat_region complement(32142..32311) /partial /note="putative" /rpt_family="AluSc" repeat_region 32322..32334 /partial /note="putative" /rpt_unit=(TTTC)x3 repeat_region complement(32336..32636) /partial /note="putative" /rpt_family="AluSx" exon 32810..32886 /gene="BRCA1" /note="putative" /number=10 intron 32887..33871 /gene="BRCA1" /note="putative" /number=10 repeat_region 33126..33144 /partial /note="putative" /rpt_unit=(TTTG)x4 repeat_region complement(33131..33432) /partial /note="putative" /rpt_family="AluJb" exon 33872..37297 /gene="BRCA1" /note="putative" /number=11 intron 37298..37699 /gene="BRCA1" /note="putative" /number=11 exon 37700..37788 /gene="BRCA1" /note="putative" /number=12 intron 37789..46155 /gene="BRCA1" /note="putative" /number=12 repeat_region 37798..37810 /partial /note="putative" /rpt_unit=(GT)x6 repeat_region 37903..38058 /partial /note="putative" /rpt_family="MIR2" repeat_region 38083..38359 /partial /note="putative" /rpt_family="AluSx" repeat_region 38343..38357 /partial /note="putative" /rpt_unit=(AAAC)x3 repeat_region complement(38445..38528) /partial /note="putative" /rpt_family="MIR" repeat_region complement(38702..39003) /partial /note="putative" /rpt_family="AluJb" repeat_region 39059..39185 /partial /note="putative" /rpt_family="AluSq" repeat_region complement(39398..39700) /partial /note="putative" /rpt_family="AluSq" repeat_region 40124..40287 /partial /note="putative" /rpt_family="SUB_L1-S" repeat_region 40191..40294 /partial /note="putative" /rpt_family="L1MB4" repeat_region 40297..40347 /partial /note="putative" /rpt_family="MER60B" repeat_region 40357..40578 /partial /note="putative" /rpt_family="L1MB4" repeat_region 40681..40990 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(41050..41350) /partial /note="putative" /rpt_family="AluSp" repeat_region complement(42397..42440) /partial /note="putative" /rpt_family="MIR2" repeat_region 42583..42621 /partial /note="putative" /rpt_unit=(TG)x19 repeat_region 42877..42887 /partial /note="putative" /rpt_unit=(TA)x5 repeat_region 43351..43364 /partial /note="putative" /rpt_unit=(TTTTC)x2 repeat_region complement(43352..43473) /partial /note="putative" /rpt_family="AluSg" repeat_region 44267..44401 /partial /note="putative" /rpt_family="AluSx" repeat_region 44402..44699 /partial /note="putative" /rpt_family="AluY" repeat_region 44700..44868 /partial /note="putative" /rpt_family="AluSx" repeat_region 44854..44868 /partial /note="putative" /rpt_unit=(AAAT)x3 repeat_region 45149..45199 /partial /note="putative" /rpt_family="MIR2" repeat_region 45276..45562 /partial /note="putative" /rpt_family="AluSx" repeat_region 45550..45568 /partial /note="putative" /rpt_unit=(AAAG)x4 exon 46156..46327 /gene="BRCA1" /note="putative" /number=13 intron 46328..52117 /gene="BRCA1" /note="putative" /number=13 repeat_region complement(47828..47912) /partial /note="putative" /rpt_family="AluSq" repeat_region complement(47927..48208) /partial /note="putative" /rpt_family="AluJo" repeat_region 48236..48415 /partial /note="putative" /rpt_family="AluSp" gene complement(48899..49390) /gene="rpL21" CDS complement(48899..49390) /gene="rpL21" /note="Similar to ribosomal protein L21" /codon_start=1 /pseudo /evidence=experimental repeat_region 49528..49551 /partial /note="putative" /rpt_unit=(TTTTG)x4 repeat_region complement(49538..49813) /partial /note="putative" /rpt_family="AluSq" repeat_region 50260..50346 /partial /note="putative" /rpt_family="AluJb" repeat_region 50347..50642 /partial /note="putative" /rpt_family="AluSx" repeat_region 50643..50694 /partial /note="putative" /rpt_family="AluJo" repeat_region 50697..51000 /partial /note="putative" /rpt_family="AluSp" repeat_region 50986..51013 /partial /note="putative" /rpt_unit=(AAAT)x7 repeat_region 51099..51399 /partial /note="putative" /rpt_family="AluSq" exon 52118..52244 /gene="BRCA1" /note="putative" /number=14 intron 52245..54210 /gene="BRCA1" /note="putative" /number=14 repeat_region 52610..52624 /partial /note="putative" /rpt_unit=(AAT)x5 repeat_region 52628..52640 /partial /note="putative" /rpt_unit=(TAT)x4 repeat_region complement(52630..52886) /partial /note="putative" /rpt_family="AluJb" repeat_region 53006..53171 /partial /note="putative" /rpt_family="AluSg" repeat_region 53167..53178 /partial /note="putative" /rpt_unit=(AG)x6 repeat_region 53646..53665 /partial /note="putative" /rpt_unit=(TTCC)x5 repeat_region 53667..53684 /partial /note="putative" /rpt_unit=(TCCTT)x3 repeat_region 53688..53708 /partial /note="putative" /rpt_unit=(CCTT)x5 repeat_region 53755..53768 /partial /note="putative" /rpt_unit=(CTTTT)x2 repeat_region 53769..53807 /partial /note="putative" /rpt_unit=(CTTTC)x7 repeat_region complement(53794..53959) /partial /note="putative" /rpt_family="AluSx" exon 54211..54401 /gene="BRCA1" /note="putative" /number=15 intron 54402..57493 /gene="BRCA1" /note="putative" /number=15 repeat_region 54676..54965 /partial /note="putative" /rpt_family="AluSx" repeat_region 54965..54983 /partial /note="putative" /rpt_unit=(AT)x9 repeat_region complement(55565..55676) /partial /note="putative" /rpt_family="AluJo" repeat_region 55729..55838 /partial /note="putative" /rpt_family="AluJo" repeat_region 55865..56164 /partial /note="putative" /rpt_family="AluSp" repeat_region 56175..56321 /partial /note="putative" /rpt_family="AluJb" repeat_region complement(56334..56626) /partial /note="putative" /rpt_family="AluSp" repeat_region 56688..56720 /partial /note="putative" /rpt_unit=(TTTA)x8 repeat_region complement(56705..57010) /partial /note="putative" /rpt_family="AluY" exon 57494..57804 /gene="BRCA1" /note="putative" /number=16 intron 57805..61037 /gene="BRCA1" /note="putative" /number=16 repeat_region 58500..58798 /partial /note="putative" /rpt_family="AluSp" repeat_region complement(60164..60302) /partial /note="putative" /rpt_family="AluFLAM_C" repeat_region complement(60439..60735) /partial /note="putative" /rpt_family="AluY" repeat_region 60845..60896 /partial /note="putative" /rpt_family="AluY" repeat_region complement(60897..60969) /partial /note="putative" /rpt_family="L1PA9" exon 61038..61125 /gene="BRCA1" /note="putative" /number=17 intron 61126..64781 /gene="BRCA1" /note="putative" /number=17 repeat_region 61180..61477 /partial /note="putative" /rpt_family="AluSc" repeat_region 61616..61918 /partial /note="putative" /rpt_family="AluSq" repeat_region 62026..62323 /partial /note="putative" /rpt_family="AluSg" repeat_region 62303..62314 /partial /note="putative" /rpt_unit=(CAA)x4 repeat_region complement(63044..63346) /partial /note="putative" /rpt_family="AluSp" repeat_region 63605..63904 /partial /note="putative" /rpt_family="AluY" repeat_region 64073..64093 /partial /note="putative" /rpt_unit=(GCT)x7 exon 64782..64859 /gene="BRCA1" /note="putative" /number=18 intron 64860..65359 /gene="BRCA1" /note="putative" /number=18 exon 65360..65400 /gene="BRCA1" /note="putative" /number=19 intron 65401..71597 /gene="BRCA1" /note="putative" /number=19 repeat_region 65550..65838 /partial /note="putative" /rpt_family="AluSp" repeat_region 65840..66130 /partial /note="putative" /rpt_family="AluSx" repeat_region 66341..66634 /partial /note="putative" /rpt_family="AluJo" repeat_region 66635..66645 /partial /note="putative" /rpt_unit=(GT)x5 repeat_region 66849..67147 /partial /note="putative" /rpt_family="AluSx" repeat_region complement(67223..67305) /partial /note="putative" /rpt_family="MIR2" repeat_region complement(67355..67578) /partial /note="putative" /rpt_family="MIR2" repeat_region complement(67588..67889) /partial /note="putative" /rpt_family="AluSx" repeat_region 67927..68244 /partial /note="putative" /rpt_family="AluSx" repeat_region 68249..68389 /partial /note="putative" /rpt_family="AluSx" repeat_region 68392..68660 /partial /note="putative" /rpt_family="AluSp" repeat_region 68661..68829 /partial /note="putative" /rpt_family="AluSx" repeat_region 68813..68834 /partial /note="putative" /rpt_unit=(AAT)x7 repeat_region complement(68971..69103) /partial /note="putative" /rpt_family="MIR2" repeat_region 69218..69263 /partial /note="putative" /rpt_unit=(TTG)x15 repeat_region complement(69242..69551) /partial /note="putative" /rpt_family="AluSq" repeat_region complement(69561..69636) /partial /note="putative" /rpt_family="MIR2" repeat_region 70309..70323 /partial /note="putative" /rpt_unit=(TTTC)x3 repeat_region 70335..70353 /partial /note="putative" /rpt_unit=(TG)x9 repeat_region complement(70353..70654) /partial /note="putative" /rpt_family="AluJo" repeat_region complement(70694..70875) /partial /note="putative" /rpt_family="AluSg" repeat_region 70874..70896 /partial /note="putative" /rpt_unit=(TTTG)x5 repeat_region complement(70878..71179) /partial /note="putative" /rpt_family="AluY" repeat_region complement(71199..71505) /partial /note="putative" /rpt_family="AluSx" exon 71598..71681 /gene="BRCA1" /note="putative" /number=20 intron 71682..77619 /gene="BRCA1" /note="putative" /number=20 repeat_region 72017..72047 /partial /note="putative" /rpt_unit=(TC)x15 repeat_region complement(72041..72225) /partial /note="putative" /rpt_family="AluSg" repeat_region 72255..72557 /partial /note="putative" /rpt_family="AluJb" repeat_region complement(72561..72764) /partial /note="putative" /rpt_family="MIR" repeat_region complement(72774..72954) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(72956..73230) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(73232..73367) /partial /note="putative" /rpt_family="AluSq" repeat_region 73684..73988 /partial /note="putative" /rpt_family="AluSq" repeat_region 73994..74289 /partial /note="putative" /rpt_family="AluJb" repeat_region 74275..74291 /partial /note="putative" /rpt_unit=(CAAAA)x3 repeat_region complement(74745..75039) /partial /note="putative" /rpt_family="AluSp" repeat_region complement(75052..75352) /partial /note="putative" /rpt_family="AluY" repeat_region 75594..75854 /partial /note="putative" /rpt_family="AluSg" repeat_region 75907..75956 /partial /note="putative" /rpt_unit=(GT)x25 repeat_region complement(76229..76277) /partial /note="putative" /rpt_family="MER3" repeat_region 76278..76458 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(76460..76596) /partial /note="putative" /rpt_family="MER3" repeat_region 76803..77098 /partial /note="putative" /rpt_family="AluSg" exon 77620..77674 /gene="BRCA1" /note="putative" /number=21 intron 77675..79542 /gene="BRCA1" /note="putative" /number=21 repeat_region complement(78109..78410) /partial /note="putative" /rpt_family="AluY" exon 79543..79616 /gene="BRCA1" /note="putative" /number=22 intron 79617..81033 /gene="BRCA1" /note="putative" /number=22 repeat_region 79920..80224 /partial /note="putative" /rpt_family="AluSx" repeat_region 80218..80230 /partial /note="putative" /rpt_unit=(AG)x6 repeat_region 80234..80535 /partial /note="putative" /rpt_family="AluSc" repeat_region 80525..80537 /partial /note="putative" /rpt_unit=(AG)x6 repeat_region 80545..80683 /partial /note="putative" /rpt_family="AluJo" repeat_region 80686..80965 /partial /note="putative" /rpt_family="AluJb" exon 81034..81094 /gene="BRCA1" /note="putative" /number=23 intron 81095..82935 /gene="BRCA1" /note="putative" /number=23 repeat_region 81337..81635 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(81964..82087) /partial /note="putative" /rpt_family="AluSx" repeat_region 82089..82339 /partial /note="putative" /rpt_family="AluSx" repeat_region 82340..82509 /partial /note="putative" /rpt_family="AluSq" repeat_region 82515..82815 /partial /note="putative" /rpt_family="AluSx" exon 82936..84436 /gene="BRCA1" /note="putative" /number=24 3'UTR 83061..84436 /gene="BRCA1" repeat_region 83637..83932 /partial /note="putative" /rpt_family="AluSx" polyA_signal 84416..84421 /gene="BRCA1" /note="putative" repeat_region 84495..84558 /partial /note="putative" /rpt_family="MIR" repeat_region complement(84619..84714) /partial /note="putative" /rpt_family="MLT1E" repeat_region complement(84659..85023) /partial /note="putative" /rpt_family="MLT1D" repeat_region complement(85059..85107) /partial /note="putative" /rpt_family="MLT1E" repeat_region 86964..87122 /partial /note="putative" /rpt_family="MER5A" repeat_region complement(87301..87621) /partial /note="putative" /rpt_family="MER44A" repeat_region complement(88150..88261) /partial /note="putative" /rpt_family="MIR" repeat_region 88352..88652 /partial /note="putative" /rpt_family="AluY" repeat_region 88640..88656 /partial /note="putative" /rpt_unit=(AAAAC)x3 repeat_region 88725..88735 /partial /note="putative" /rpt_unit=(TG)x5 repeat_region complement(89740..90041) /partial /note="putative" /rpt_family="AluSq" repeat_region 89744..89759 /partial /note="putative" /rpt_unit=(TG)x8 repeat_region complement(90286..90387) /partial /note="putative" /rpt_family="MIR" repeat_region 90566..90611 /partial /note="putative" /rpt_unit=(GT)x23 repeat_region 91071..91359 /partial /note="putative" /rpt_family="AluSq" repeat_region 91593..91892 /partial /note="putative" /rpt_family="AluY" repeat_region 91924..92031 /partial /note="putative" /rpt_family="L1MB5" repeat_region complement(92617..92882) /partial /note="putative" /rpt_family="AluJb" repeat_region 92891..92909 /partial /note="putative" /rpt_unit=(TTTTG)x3 repeat_region complement(92893..93196) /partial /note="putative" /rpt_family="AluJb" repeat_region complement(93496..93663) /partial /note="putative" /rpt_family="MIR" repeat_region 94406..94418 /partial /note="putative" /rpt_unit=(GGAA)x3 repeat_region 94735..94785 /partial /note="putative" /rpt_family="SVA" repeat_region complement(95014..95137) /partial /note="putative" /rpt_family="MIR" repeat_region 95329..95462 /partial /note="putative" /rpt_family="AluY" repeat_region 95463..95762 /partial /note="putative" /rpt_family="AluY" repeat_region 95763..95917 /partial /note="putative" /rpt_family="AluY" repeat_region 95926..96140 /partial /note="putative" /rpt_family="AluSx" repeat_region 96143..96191 /partial /note="putative" /rpt_family="AluSc" repeat_region 96192..96487 /partial /note="putative" /rpt_family="AluSp" repeat_region 96489..96664 /partial /note="putative" /rpt_family="AluSg" 3'UTR complement(96693..100053) /gene="Rho7" gene complement(96693..103623) /gene="Rho7" exon complement(96693..98031) /gene="Rho7" /note="putative" /number=6 polyA_signal 96704..96710 /gene="Rho7" /evidence=experimental repeat_region complement(97592..97762) /partial /note="putative" /rpt_family="AluFRAM" intron complement(98032..100053) /gene="Rho7" /note="putative; does not fit consensus" /number=5 repeat_region 98289..98340 /partial /note="putative" /rpt_family="MIR2" repeat_region 98328..98342 /partial /note="(TGAA)x3; putative" /rpt_unit=(GGAA)x3 repeat_region complement(98811..99120) /partial /note="putative" /rpt_family="AluY" exon 100054..100302 /gene="Rho7" /note="putative" /number=5 CDS complement(join(100054..100302,100539..100673, 101442..101551,102687..102774,103285..103386)) /gene="Rho7" /codon_start=1 /evidence=experimental /db_xref="PID:g1698400" /translation="MEGQSGRCKIVVVGDAECGKTALLQVFAKDAYPGSYVPTVFENY TASFEIDKRRIELNMWDTSGSSYYDNVRPLAYPDSDAVLICFDISRPETLDSVLKKWQ GETQEFCPNAKVVLVGCKLDMRTDLATLRELSKQRLIPVTHEQGTVLAKQVGAVSYVE CSSRSSERSVRDVFHVATVASLGRGHRQLRRTDSRRGMQRSAQLSGRPDRGNEGEIHK DRAKSCNLM" intron complement(100303..100538) /gene="Rho7" /note="putative" /number=4 exon complement(100539..100673) /gene="Rho7" /note="putative" /number=4 intron complement(100674..101441) /gene="Rho7" /note="putative" /number=3 repeat_region 101128..101141 /partial /note="putative" /rpt_unit=(TCCCA)x2 exon complement(101442..101551) /gene="Rho7" /note="putative" /number=3 intron complement(101552..102686) /gene="Rho7" /note="putative" /number=2 repeat_region complement(102066..102366) /partial /note="putative" /rpt_family="AluSx" exon complement(102687..102774) /gene="Rho7" /note="putative" /number=2 intron complement(102775..103284) /gene="Rho7" /note="putative" /number=1 exon complement(103285..103623) /gene="Rho7" /note="putative" /number=1 5'UTR complement(103387..103623) /gene="Rho7" repeat_region 104267..104517 /partial /note="putative" /rpt_family="AluSc" repeat_region 104589..104958 /partial /note="putative" /rpt_family="AluJo" repeat_region complement(105237..105529) /partial /note="putative" /rpt_family="AluY" repeat_region complement(105530..105826) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(105828..106119) /partial /note="putative" /rpt_family="AluSx" repeat_region 106514..106531 /partial /note="(GCC)x6; putative" gene 106659..114716 /gene="VatI" exon 106659..106789 /gene="VatI" /number=1 /evidence=experimental CDS join(106682..106789,109927..110134,110520..110690, 110796..110885,112774..113015,113178..113261) /gene="VatI" /codon_start=1 /evidence=experimental /db_xref="PID:g1698401" /translation="MARQGLYDRLPPLPVTPGMEGAGVVIAVGEGVSDRKAGDRVMVL NRSGMWQEEVTVPSVQTFLIPEAMTFEEAAALLVNYITAYMVLFDFGNLQPGHSVLVH MAAGGVGMAAVQLCRTVENVTVFGTASASKHEALKENGVTHPIDYHTTDYVDEIKKIS PKGVDIVMDPLGGSDTAKGYNLLKPMGKVVTYGMANLLTGPKRNLMALARTWWNQFSV TALQLLQANRAVCGFHLGYLDGEVELVSGVVARLLALYNQGHIKPHIDSVWPFEKVAD AMKQMQEKKNVGKVLLVPGPEKQN" intron 106790..109926 /gene="VatI" /note="putative" /number=1 repeat_region 106805..106819 /partial /note="putative" /rpt_unit=(AGGGC)x3 repeat_region complement(107830..108133) /partial /note="putative" /rpt_family="AluSp" repeat_region 109337..109355 /partial /note="putative" /rpt_unit=(TAT)x6 repeat_region complement(109358..109558) /partial /note="putative" /rpt_family="MIR" repeat_region 109638..109768 /partial /note="putative" /rpt_family="MIR" repeat_region 109785..109799 /partial /note="putative" /rpt_unit=(TTA)x5 exon 109927..110134 /gene="VatI" /note="putative" /number=2 intron 110135..110519 /gene="VatI" /note="putative" /number=2 exon 110520..110690 /gene="VatI" /note="putative" /number=3 intron 110691..110795 /gene="VatI" /note="putative" /number=3 exon 110796..110885 /gene="VatI" /note="putative" /number=4 intron 110886..112773 /gene="VatI" /note="putative" /number=4 repeat_region complement(111122..111423) /partial /note="putative" /rpt_family="AluSq" repeat_region 111550..111700 /partial /note="putative" /rpt_family="AluJb" repeat_region 111701..111756 /partial /note="putative" /rpt_family="AluJb" repeat_region complement(111781..111937) /partial /note="putative" /rpt_family="AluSq" repeat_region complement(111938..112117) /partial /note="putative" /rpt_family="AluSg" repeat_region 112131..112148 /partial /note="putative" /rpt_unit=(TTTGT)x3 repeat_region complement(112137..112439) /partial /note="putative" /rpt_family="AluSp" repeat_region complement(112441..112586) /partial /note="putative" /rpt_family="AluSx" exon 112774..113015 /gene="VatI" /note="putative" /number=5 intron 113016..113177 /gene="VatI" /note="putative" /number=5 exon 113178..114716 /gene="VatI" /number=6 /evidence=experimental repeat_region 113979..113990 /partial /note="putative" /rpt_unit=(TG)x6 polyA_signal 114696..114702 /gene="VatI" /evidence=experimental gene complement(115033..116275) /gene="ifp35" CDS complement(join(<115033..115214,115440..115546, 115660..115846,115949..116055,116128..>116275)) /gene="ifp35" /note="putative" /codon_start=1 /db_xref="PID:g1698402" /translation="VPFSVPKIPLVFRGHTQQDPEVPKSLVSNLRIHCPLLAGSALIT FDDPKVAEQVLQQKEHTINMEECRLRVQVQPLELPMVTTIQVSSQLSGRRVLVTGFPA SLRLSEEELLDKLEIFFGKTRNGGGDVDVRELLPGSVMLGFARDGVAQRLCQIGQFTV PLGGQQVPLRVSPYVNGEIQKAEIRSQPVPRSVLVLNIPDILDGPELHDVLEIHFQKP TRGGGEVEALTVVPQGQQGLAVFTS" intron complement(115215..115439) /gene="ifp35" /note="putative" /number=1 exon complement(115440..115546) /gene="ifp35" /note="putative" /number=1 intron complement(115547..115659) /gene="ifp35" /note="putative" /number=2 exon complement(115660..115846) /gene="ifp35" /note="putative" /number=2 intron complement(115847..115948) /gene="ifp35" /note="putative" /number=3 exon complement(115949..116055) /gene="ifp35" /note="putative" /number=3 intron complement(116056..116127) /gene="ifp35" /note="putative" /number=4 repeat_region complement(116490..116555) /partial /note="putative" /rpt_family="MIR" repeat_region complement(116569..116863) /partial /note="putative" /rpt_family="AluSx" repeat_region complement(116864..116999) /partial /note="putative" /rpt_family="AluSx" BASE COUNT 31159 a 26426 c 27239 g 32319 t ORIGIN 1 acggggtctc gaaaaaagga gaatgggatg agaaggatat atgggtagtg tcatttttta 61 acttgcagat ttcatcctag tcttccagtt atcgtttcct agcactccat gttcccaaga 121 tagtgtcacc accccaagga ctctctctca ttttctttgc ctgggccctc tttctactga 181 ggagtcgtgg ccttccatca gtagaagccg gatgttcttg tgtccgaaat tggtgggttc 241 ttggtctcac tgacttcaag aatgaagttg cggaccctca cggtgagtgg tacagttctt 301 aaagatgatg tgtccagagt ttgttccttc tgatgttcgg acgtgttcag agttacctcc 361 ttctggtgga ttcgtggtct cgctggcttc aggagtgaag ctgcagacct ttgcggtgag 421 tgttacagct cttaaggcgg catgtctgga gtttgttcgt tcctcccgtc tggagttgtt 481 cattcctcct ggtgggttcg tggtctcgct ggcttcagga gtgaagctgc agacctctgc 541 ggtcggtgtt accagcagat aaatgctatg cggacccaaa gagtgagcag cagcaagatt 601 tattgcaaag agcacaagaa caaagcttcc acagcgtgga aggagaccag agcgggttgc 661 tgctgctggc tcaggcagcc tgcatttttt tttttttttt tttttttttt tgagatggag 721 tctccctctg tcacccaggc tggaatgcag tggtgcaatc tgggctcact gcaagctccg 781 cctcccgggt tcacgccatt ctcctgcctc aacctcccca gtagagggga ttacaggcac 841 ccaccaccgc acccagctaa tattttgtct ttttagtaga gtcggggttt cactgtgtta 901 gccaggatgg tctcgatctc ttgacctcgt gatccacccc tctaggcctc ccaaattgct 961 gggattacag gtgtgagcca ctggcaccca gcggggcagc ctgcttttat tcccttatct 1021 gaccccaccc acatcctgtt gattggtcca ttttacagag agctaattgg tccgttttga 1081 cagggtgctg attggtgcat ttacaatccc tgagctagat atacacagag tgctgattgg 1141 tgcatttaca atcctctagc tagacataaa aattctccaa gtccccacta catttgctag 1201 acacagagca ctgattggtg cgtttacaaa cctttagcta gacacagagt gctgattggt 1261 gcatttgcaa accttgagct agacacagag cactgattgg tgcatttaca atcctttagc 1321 tagacacaga agttctccaa gtgcccacca gattagctag atacagagtg ctgattggtg 1381 catccccaaa ccccaagcta gacacagagt gctgactggt gcatataaaa tcctcaggct 1441 agacataaaa gttttccaag tccccatctg actcaggagc ccagctggct tcacctagtg 1501 gatcctgcgc agggctgtgc cgggcgcctg cactcctctc agcccttggg cagtcgatgg 1561 gaccgggcgc tgaggagcag ggggcggtgc ccgtcgggga ggctcaggcc acgctggagc 1621 tcacaggggt tgggaggggg ctcgggcatg gcgggctgca ggtcctgagc cttgccctgt 1681 gcagggcggc tggggcccgg tgagaattca agcggggtgc aggcgggccg gcagtgctgg 1741 gggacccggc gcaccctctg cagctgctgg cccgggtgct aggcccctga ctgcccgggg 1801 ccgggggtgc ggggcccgct gagcccgcgc ccacctggaa ctcgcgctgg ctggcgagcg 1861 ctgcgcgcag ccccagttcc cacacccgcc tctccctcca cacttccccg caagcagagg 1921 gagccggctc tggcttcggc cagcccagag aggggccccc acagcgcagt ggcgggctga 1981 agggctcctc cagcacggcc agaatggacg ccaaggccga ggaggcgccg agagcgagcg 2041 agggctgcta gcacgttgtc acctcgcatt ctgaaccaca gactctccaa ctctccggcg 2101 cttttcgccc actcggtccc tcagaacacg aagggctctc tcatcctgtc actaaaacga 2161 ttagctgtcc ggagacacgg aaaaagtcgc ccctcttctt tgcaggattc ctcccttgaa 2221 cttctccaaa ccctcttagt gtgacgtgac cccaccccta gctaacccag gctgcttcct 2281 taccagcttc ccgccccctg gggaggcggc aatgcaaaga ccgtccgctg ccagctctgc 2341 cgctatctct gtggggtgaa tctaacatgg cggacaaaga cagtaactag tcccgtttct 2401 ccgcgttttc gccaagaaga ttggctctta ccacttgtcc ctcaaaacga ccaccccatt 2461 gactggtggc gattgcgtcg acggagacgg ggcaaaagca agctgaaccc gaaaaataac 2521 aaacactggg gctgaggggt ggaactacga gtgcgcagac atgggccaga gcgcatttcc 2581 cctgccccag gcaaattcgg cgctcactgc gtccccgcag gccactgacc ttacaagact 2641 acttgcccca gactcctggg gctggatggg aattgtagtc tccctaaaga gttgtacgta 2701 tctttttaag gcctagtttc tgctttcaaa atacgaaaac ataacactcc agtccataac 2761 tgttgacaag tacaagcgcg cacaggtctc caatctatcc actggatttc cgtgagaatt 2821 gtgcccgctc tggtattgga tgttcctctc cataagacta cagtttctaa ggaacactgt 2881 ggcgaagacc tttcattccg caacgcatgc tggaaataat tatttccctc caccccccca 2941 acaatcctta ttacttatat ttaccgaaac tggagacctc cattagggcg gaaagagtgg 3001 gggattggga cctcttctta cgactgcttt ggacaatagg tagcgattct gaccttcgta 3061 cagcaattac tgtgatgcaa taagccgcaa ctggaagagt agaggctaga gggcaggcac 3121 tttatggcaa actcaggtag aattcttcct cttccgtctc tttcctttta cgtcatccgg 3181 gggcagactg ggtggccaat ccagagcccc gagagacgct tggctctttc tgtccctccc 3241 atcctctgat tgtaccttga tttcgtattc tgagaggctg ctgcttagcg gtagcccctt 3301 ggtttccgtg gcaacggaaa agcgcgggaa ttacagataa attaaaactg cgactgcgcg 3361 gcgtgagctc gctgagactt cctggacggg ggacaggctg tggggtttct cagataactg 3421 ggcccctgcg ctcaggaggc cttcaccctc tgctctgggt aaaggtagta gagtcccggg 3481 aaagggacag ggggcccaag tgatgctctg gggtactggc gtgggagagt ggatttccga 3541 agctgacaga tgggtattct ttgacggggg gtaggggcgg aacctgagag gcgtaaggcg 3601 ttgtgaaccc tggggagggg ggcagtttgt aggtcgcgag ggaagcgctg aggatcagga 3661 agggggcact gagtgtccgt gggggaatcc tcgtgatagg aactggaata tgccttgagg 3721 gggacactat gtctttaaaa acgtcggctg gtcatgaggt caggagttcc agaccagcct 3781 gaccaacgtg gtgaaactcc gtctctacta aaaatacaaa aattagccgg gcgtggtgcc 3841 gctccagcta ctcaggaggc tgaggcagga gaatcgctag aacccgggag gcggaggttg 3901 cagtgagccg agatcgcgcc attgcactcc agcctgggcg acagagcgag actgtctcaa 3961 aacaaaacaa aacaaaacaa aacaaaaaac accggctggt atgtatgaga ggatgggacc 4021 ttgtggaaga agaggtgcca ggaatatgtc tgggaagggg aggagacagg attttgtggg 4081 agggagaact taagaactgg atccatttgc gccattgaga aagcgcaaga gggaagtaga 4141 ggagcgtcag tagtaacaga tgctgccggc agggatgtgc ttgaggagga tccagagatg 4201 agagcaggtc actgggaaag gttaggggcg gggaggcctt gattggtgtt ggtttggtcg 4261 ttgttgattt tggttttatg caagaaaaag aaaacaacca gaaacattgg agaaagctaa 4321 ggctaccacc acctacccgg tcagtcactc ctctgtagct ttctctttct tggagaaagg 4381 aaaagaccca aggggttggc agcaatatgt gaaaaaattc agaatttatg ttgtctaatt 4441 acaaaaagca acttctagaa tctttaaaaa taaaggacgt tgtcattagt tctttggttt 4501 gtattattct aaaaccttcc aaatcttaaa tttactttat tttaaaatga taaaatgaag 4561 ttgtcatttt ataaaccttt taaaaagata tatatatatg tttttctaat gtgttaaagt 4621 tcattggaac agaaagaaat ggatttatct gctcttcgcg ttgaagaagt acaaaatgtc 4681 attaatgcta tgcagaaaat cttagagtgt cccatctggt aagtcagcac aagagtgtat 4741 taatttggga ttcctatgat tatctcctat gcaaatgaac agaattgacc ttacatacta 4801 gggaagaaaa gacatgtcta gtaagattag gctattgtaa ttgctgattt ccttaactga 4861 agaactttaa aaatatagaa aatgattcct tgttctccat ccactctgcc tctcccactc 4921 ctctcctttt caacacaaat cctgtggtcc gggaaagaca gggactctgt cttgattggt 4981 tctgcactgg ggcaggaatc tagtttagat taactggcat tttggctttt cttccagctc 5041 taaaacaagc tccatcactt gaaatggcaa aataaaatca tggatgaggc cgagggcggt 5101 ggcttatgcc tgtaatccca gcactttggg aggccaaggt ggtaggatca cgaggtcagg 5161 agatcgagac catcctggcc aacatggtga aaccccctct ccactaaaaa tacaaaaatt 5221 agctgggcgt agtggcatgt gcctgtaatc ccagctactc aggaggctga ggcaggagaa 5281 tcacttgaac caggaggcag atgttgctgt gagccaatat ggcaccactg aactccagcg 5341 acagagctaa actccatccc aaaaaaaaaa aaaaaaaaaa aaaaacatgg atgatcggtg 5401 tcgttgagag gataggtatt tggaagaacc tttgtttgaa actggctctg tacatacaat 5461 gaaattacat acttatttac atacaatgaa atgcagaggt ttttttttta tataggatct 5521 ctgtcgagag gctggagtgc agtggtgcta tcacagctca ctgcagcctc aacctcgtca 5581 ggctcaagca atcctcccac ctcagcctcc agagtagcag ggacgatagg tgtgcaccac 5641 catgcccagc taatttttgt attttttttt ctttttttga gatggagtct tgctctgttg 5701 cccaggctgg agtgcagtgg cgcgatctca gctcactgca aactctgcct cccgggttca 5761 tgccattctt ctgcctgagc ctcctgaata gctgggacta caagcaccca ctaccacgcc 5821 cggctaattt tttgtatttt tttttctttt ttagtagagg cgggatttca ccgtgttagc 5881 caggatagtc ttgatctcct gaccttgtga tccacccgcc tgggcctccc aaagtgctag 5941 gattacaggc ataagccact gcgtccagcc attcttgtat ttttctgttg tagagatagg 6001 gttttgctat gttggccatg ctggtctcaa actcctgacc tcaagtgatc taccctccct 6061 tggcctctca aggtgctggg attacaggcc tgagccattg cacccagcca tggtctaaaa 6121 atcttgattg aaataccacc ttttcatttc cagacacccc tatttaaaat taccacaccc 6181 ccagcacaca ctttatcttc tattcctgct gcttctccat aacactgatt actagctgac 6241 attctatgta atgtatccat tttttatctc tagtcccaca gaatgtaaac tccaggatgg 6301 gatttttgtt ttgtttacat acatctgtat gttcagtagt tagaacggta cttgggacct 6361 agttgccact caataaacat ttgtcaaata aataataaac taaactaaat tagttcttta 6421 atttttttaa atatggtgat ggttagtagt gagtaacatt caaaaaataa gttgaaaagt 6481 tgtaccattg cctcttaccc acaataaaaa agggtaaatt cttttctgct ttatgaaagt 6541 tgtttttcat atttgaagtc aagttaatca gattaaggaa aatgtatgtt gtgttttcag 6601 agcgatacaa gatttataaa taaccatcct ctcccttgcc cttcaacatt atagctaaac 6661 aaaaataaga ggaaaacagg attcacaatt tatcaattta ttgaaaatca gagccagaga 6721 agcaggaaat gacattgtag gaaaaaactg cttttgaaaa agcacaaaac ttactcatga 6781 caatcagtga tcaggaaaat cctcaatagt gtggcatttg gatacattta tgtttcattt 6841 ccatgggaga gagtcataaa aataggatgt tctttctcat tctggcaaat taaaccatca 6901 attaaaaact cagatacata aaaattaaag atgtaagaat gaaaatgcta aattgttatt 6961 ttcaatcaac tattatgttt tctagctttt cattgctttt ttctgtttcc tgttaagatt 7021 aatttctttt tttttttttt tttttttttt tgagacagac tttggctctt gttgcccagg 7081 ctggagtgca gtggcacaat ctcggctcac tacaacctcc acctcccggg ttcaagcaat 7141 tctgctgcct cagcctccgg agtacctggg attgcaggca tgtgccatca caccagctaa 7201 ttttgtattt ttagtagaga cagggtttct ccatattggt caggttggtc tcgaactcct 7261 gacctcaggt gatcctcctg ccttggcctc cgaaagtgct gggattacag gcgtgagcca 7321 ccgctcccag actttttgtt ttgttttgtt ttgttttttt gagacacggt ctcgctctgc 7381 tgcctaggct ggagtgcagt ggcacgatct tggctcactg ccagctccga ctcccgggtt 7441 caggccattc tcctgcctca gcctcccgag tagctgggac tacaggcgcc caccactatg 7501 cccggctaat tttttgtatt tttagtagag acggggtttc accatgttag ccaagatggt 7561 ctcgatctcc tgaccttgtg atccacccgc ctcagccttc caaagtgctg ggattacagt 7621 cctgagccac tgcgcccggc ctggaccttt ttttttcggg gtggggggtt ggagtctggc 7681 tctgtcgccc aggctggagt gcagtggcgc catcttggct cactgcaacc tccgcctgcc 7741 aggttcaagt tcaagcgctt ctcctgcctc agcctcctga gtagctggga ttataggcgc 7801 acgccaccgt ggccggctaa ttttgtattt ttagtagaga tagggtttca tcacgttggt 7861 caggctggtc ttgaagtcct gatctcgtga tccacccgcc tcggccttcc aaagtgctgg 7921 cgtgagccac tgcgcctggc ttaagattaa tttttgtttg ttttgttttt gagacggagt 7981 ctcgctcttt cacccaggcc ggagtgcagt ggcgccatct cggctcactg caagctccgc 8041 ctcccgggtt cacgccattc tcctgcctcg gccccccaag tagctgggac tacaggcgtc 8101 caccaccacg cccggctaat tttttgtatt tttagtagag acggggtttc accgtgttag 8161 ccaggatggt ctccacttcc tgacctcgtg atccgcccac ctcggcctcc caaagtgctg 8221 ggattacagg cgtgagccac cgcgcccggc cttaagatta atttttatgg tgttttacat 8281 tcatttgtat ggaaagttct aggataggga tcatatttca cttcctttta atatagtaca 8341 gtatagcaca atttgcagtt atgtcttaat atgtgatcag gaatgatcat gactggaaac 8401 agtgttattt gtggtagcta tagggtaggt aaggttttca gcctgtttta ggtttcttga 8461 actaaaattc cttctgctgt cttctaagtc aatattggca gctatttctg acaattggta 8521 gttctttgta actttttacc tatgactata acatttttga ctttcagaag aatttgctaa 8581 aatgtgttcc ccggtgggtt gttgtttttc aacctaaacc tagctgcttt ttccagtcac 8641 ttatccgtat tggaagctca aaatgcaaat atacagtagg cctaaaatat tgcctggttt 8701 gaaaagtgtt taaaatattt gaatcatttt tatagtaaac atttactctc atcaggacct 8761 agaaggggaa cattttaatt ttttttcttt tcccttttca cagtcttcct tcaacattca 8821 ttaccttttt acatatcgga gttttcatct gttcaaagtt tgtgtttaca gtgtgtttat 8881 atagtttaga ttataattac catactgaaa tataattgtt tcagaattga gtcagtggtg 8941 agaatgaaag ccatctggta tgataactga atccaatttt tcttttacgg agaatttctt 9001 tgaaatgtag cttatctcag aaatagggat ttagtaacca atcagagttt tctttgtcaa 9061 ggttgttttt ctttttaaag tcacatttgg tcccagtaat aataccaatg ttggtacaag 9121 ttatctcagg ttgtgaagca tttttcccaa gtcatctcag gttgtgaagc attttcccaa 9181 gtagcattta attttattct tgcaatagcc caaggagtct ggcagggtga atggcaagag 9241 aaggaaacag gttcaggtag agtggttagc ccaaggtggc tctgcttata tacacaactg 9301 gtagtagaaa cccagcctcc tgacttagtt cattgttttt cttttcactg ccctgtgcta 9361 tgtcaaaaac cccatgatta caagagttgt attacaaccc ttcacaataa ggttactgtc 9421 cacaagcttt tcttgtgatc cttttctttt ttttttttct ttttttgaga tggattctct 9481 gtcacccagg ctggcccgcc ttggcctccc aaaatgctgg gattacagcg tgagccaccg 9541 cacctggccc ttgtgatcct tttctaaaaa gttaaatatt taaggaaaaa accacattct 9601 tgtcacactg ccaggttagt cgttctttga tatcttgcct ggactttatc caaaaaatcc 9661 gtttcaaaaa ttcacattta gagctaagtg tagtggctca cgcctgtaat cccggtcgag 9721 gcagatggat cacttgaggt caggacttca agaccagcct gggcaatatg gtgaaacccc 9781 ttccctacca aaaatacaaa aaaattagcc gggtgtggca gcacgcgcct gtagtcccag 9841 ctacttggaa tgctgaggca caagaatcac ttcaatccga gaggcagagg ttgcagtgag 9901 ccaagaccac accactgcac tccagcctga gcagcagagt gagtgagact ccatctccaa 9961 aaaaaaaaaa aaaggttcac attcagaaga aagctaaagg ccgggtatag tagctcacac 10021 ctgtaatccc agcactttgg gaagccgaag caggaagatt gcttgatgcc aggcattcaa 10081 gaccagcatg ggcatcatag tgagatcctg tctctacaaa aattaattaa cattaaaaat 10141 taaaaagatg gctggcatgg tggctcactc ctgtaatccc agtactttgg gaggccaagg 10201 catggtggtg catgccttta gtcccagcta ctcgggaggc tgaggcagga gaatcacttg 10261 aattcaggag gcggaggtta cagagagccg agatggtgcc actgcactcc agcctgggcg 10321 acagaacgag actctgtctg aaaaaaaaaa agaaaattaa aaagaccaga ataaagctaa 10381 agatttaaaa tagcctatag gttcctacca gaagttacca gctacctctc tgatagtctt 10441 tccctacaat atcctcctgg attattacat tttagcacct tgacctatct gatgtcctgc 10501 atacacaggc atggtcctgc tcagggtttg ccttctctgc tccctctttc ttggaatgct 10561 cttcccctaa ttgttgcata gtgtgtttct ttacattatt aagctatcct ctagtctcac 10621 ctcagtgaaa cctttcctga ctccccccat gtacatctca cccccacata gatattgaac 10681 tacctgtttc cccttaccct gcttaatttt tctctttaat gcacttattc ccatgtattc 10741 tttaattccg tatcaactgt ctaccacact agaatatgag ctctatgaga gcaggcttta 10801 ttttgtaaac tgctacattt ctatctccta gaatagtact tgaatatagt agtagatact 10861 taataaacac ttgttatatt agtataataa atgaactaat ctcaggaatg ccttggtttt 10921 gtggatagac aggtagggat gggaacttgg gtgatgtatt ttctgaagtt tttattttta 10981 agcttattat tattttgaga tggagtccag ctctgtcgcc caggttggag tacagtggcg 11041 cgatcttggc tcactgcaac gtgcacttcc ccggttcaag cgattctcct gccttagcct 11101 cccaagtagc tgggattaca ggcgcatgcc accatgccca gttagttttg gtatttttag 11161 tagagacagc gtttcactgt gttggccagg ctggtctcga aatcctgacc tcatgatccg 11221 cccgcctcgg cctcccaagt gctgggatta caagcatgag cccccgtgtc tggccttatt 11281 ttcttttttt tgagacagag tcttcctctg tcacctaggc tggagtgcag tggcacgata 11341 ttggctcact ctgcaacctc cacctccagg attcaagtga tccttctacc ttagtctcca 11401 aagtagctga gaccacaggc atgccccacc acgcccggct aatttccgta ttttaagcgt 11461 agacagggtt tcaccatatt gtccaggatg atctggaact cctgagctca ggtgatccac 11521 ccacctcagc ctcccaaagt gctaggatta caggcatgag gcaccatgcc cggccttaag 11581 cttatcattt tctaaatttc ctttagtgag tacttattac actgttttta caaagtaatc 11641 acaaaccaaa catcatgcct cttctgaagt gatctaataa gagtacacag taccatctgt 11701 aaagtgttct tgccagaaag ttgaacctga atgattaagc ctgtaagtct agtttatagg 11761 aaataaggct agaggaacaa gttaaacctc accatagggt tatacaatca gcaaaatcca 11821 gaatggggga aactccacag gtcaaatgac ctaattttaa aaataaatga caagggagaa 11881 aaagtaagag acacctatag atcagaagac acttggggct gggcatggtg gctcacacct 11941 gtaatcccag cactttggga ggccaaggca ggcggatcac ctgaggtcag gagttcaaga 12001 ccagccggcc aacatggtga accccaactc tactaaaatt acgaaaaatc agccgggcgt 12061 ggtggcgcac gcgtgtagtt ccaactacct gggaggctga ggcaggagaa tcacttgaac 12121 ttgggaggca gaggttgcag tgagccgaga tcgcaccatt gcatgccagt ctgggctaca 12181 aaagcaaaac cccatctcaa aaaaaagaag acacttgggt ttgggtgtgt tggctcatgc 12241 ctgtaaaccc cgtgctggga ggattgcttg agcccaggag ttcaaggctg cagtgaggta 12301 tgtttgcacc actgcactcc agcctaggtg acagagtgtg accttatctt aaaagtaata 12361 ataattaaaa taatctgggg taggggtgga tatgggtgaa acagcttggc catgagttga 12421 tggttgttgg accagggtga tggtccatat agttcatttt attattttat ttacttgaaa 12481 ttttgaaata cttgaaattt tccatattaa gttaaaaagg catttacagt aaacaaaaaa 12541 aagttctagg aaggaattca aaagaaatat aagcagaaaa ttttgtcttt atggagctta 12601 aagatgagat gtgcacccac agtgatagtg cagaaaaata tatcactgga aatgaattcg 12661 tacgaactat tatcaactaa tcttttaaat gctgatgata gtatagagta ttgaagggat 12721 caatataatt ctgttttgat atctgaaagc tcactgaagg taaggatcgt attctctgct 12781 gtattctcag ttcctgacac agcagacatt taataaatat tgaacgaact tgaggcctta 12841 tgttgactca gtcataacag ctcaaagttg aacttattca ctaagaatag ctttattttt 12901 aaataaatta ttgagcctca tttattttct ttttctcccc ccctaccctg ctagtctgga 12961 gttgatcaag gaacctgtct ccacaaagtg tgaccacata ttttgcaagt aagtttgaat 13021 gtgttatgtg gctccattat tagcttttgt ttttgtcctt cataacccag gaaacaccta 13081 actttataga agctttactt tcttcaatta agtgagaacg aaaaatccaa ctccatttca 13141 ttctttctca gagagtatat agttatcaaa agttggttgt aatcatagtt cctggtaaag 13201 ttttgacata tattatcttt tttttttttt ttgagacaaa gtctcgctct gtcgcccagg 13261 ctggagtgca gtggcatgat cttggctcac tgcaacctcc gccccccgag ttcaagcgat 13321 tcttctacct cagcctccca ggtagctggg actacaggca cccgccacca tgcttggcta 13381 atttttgtac ttttagtaga gataaggttt caccatattg gccaggctgg tctcgaactc 13441 ctgaccttgt gatccacctg cctcggcctc ccaaagttct gggattacag gcgtgagcca 13501 ccacacccga ctgacatata ttatctatta ggatgtaaca tcattttgaa cagtgttttg 13561 tattttttgt gtccatcagt gaaagcaaac tgcaagcagt tttgaaataa gcacattgtg 13621 tttgagcctt cccagtttct cctttctgtt catttctgca tatccttatg cattccccct 13681 tctaagggtc agtgtttgcc cgctttgtaa tcattgtgaa gacaggaaag gacctgatac 13741 cagtttctat ttaggccaaa attcatttat agcagtgatt caagttatat ttacgtattt 13801 gatgatcttg tcttttgaaa tgaaaatgtt tgtttcttaa taaaagaatt tcagaaaaag 13861 tagagtaggt aatttagtag aacaagtggg ctttctcctt ttctttatgt taagctatgg 13921 ctcacatctt accttaaatg tcaactaatt tgtttttaag tatttatgta cctggtacat 13981 aacctggtac caggtacaaa ctatgtactt ggtaaaaagt ttattagcac aaaaaggtat 14041 atgatgcaaa gtatacttcc ctcttaccct acaacccctg cctccctgtt ccctccccag 14101 acaaccacaa tgatcaattt cttatgtatc ctttgaggaa tttttaaatt ccagagttct 14161 taacttgggg tttatgaata gtctttatga atttcctaga attatattta aattgtattc 14221 aaaactatgg ccatgtacat ttttctggga agatagtcca taattttcat ctgagtgagc 14281 taagatcatg ccactgcact ccagcctggg agacaagagg gagactcaaa aaaaaaaaag 14341 aaaggcccag tatttactac agagagctaa agattaacct ttaaagccct ggggctttca 14401 atttatctgg atgagaatct ttctggaatg aactgtatgt tttatggtca gcttgagtaa 14461 caaatgctga gcatactata ctattattac agggactcag gggcccagtg tggtagctcc 14521 tgcccataac cccagcactt tgggaggcca aggcaggagg atcacttaag gccaggagtt 14581 cgaagctgta gtgagctatg atcacaccac tgcactccag cctagatgac agagtgagac 14641 cctgtctttt tttttttttg agatggtgtt tcactctatt gcccaggctg gagtgcagtg 14701 gtgtgatctc ggctcactgc aacctccacc tcctgggttc aagcgattct cctgcctcag 14761 cctcttgagt agctgggatt acaggcatct gccaccacac ccagctaatt tttgtatttt 14821 tagtcgagac agggttttca ccatgttggc caggctgctc tcaaactcct gacttcagct 14881 accttggcct taaaaagtgt tgggattaca ggtgtaagcc accgcgcctg gctgaccctg 14941 tctcttaaca aaaaaagaga gattaagtta tgaatatagt tgctttgaga acttgtggaa 15001 gaaggaaatt ataggcttat aggcagagat aataatacga gcaaatgtac aaataaaaga 15061 aaatagagga cgggcgcggt ggctcacgcc tataatacca gcactttggg aggtcgaggt 15121 gggcggatca cgaggtcagg aaattaagac catcctggcc aaaatggcga aacactgtct 15181 ctactaaaac acacaaaaaa ctagcctggc atggtggcac gtacctgtag tcccagctac 15241 ttggtaggct gcggcagggg tatcacttga acctgggagg cagaggttgc cctgagccga 15301 gatcatgcca atgcactcca gcctgacaac agagtgagac tctgtctgaa aaaaaagaaa 15361 agaaaagaaa atacatccag gaaaaataag ctaactttgc atatgtgtat aggagttgtg 15421 ttagaaaagg aagaagccct caaagatggg aagccatttg caagaaagag aaggtccaag 15481 aggaggcaga agggattgga aatagaaaaa ggatgtaaga aagagttgat tattactcat 15541 aaacagtaat gaaggaaaag gagagtaatt ctacaggaag atgctgaggt gctttgagcc 15601 cagtgaagtt ggaggtaaag acagctgttg aggccgggca cggtggctca cgcctctaat 15661 cctagcactt ttggagccca aggcaggtgg atcacctgag gtcaggagct caagaccagc 15721 ctgaccaaca tagagaaacc ccatctctac taaaaataca aaattagacg ggcgtggagg 15781 cgcatgcctg taatcccagc tacttgggag gctgaggcag gagaatcact tgaacctggg 15841 aggcggaggt tgcagtgagc cgagattgcg ccattgcact ccagcctggc cgacaagagt 15901 gaaaactgtc tcaaaaaaaa aaaacaacaa aaaacagctg ttgagattga gaggattaga 15961 gttggcaact ggagaagagt gagaagcttg gtttcaagct tgtgatagtc aggattgtga 16021 tagtcaggaa agaaccagtc ataaagatat atgtgtgtgt atacatataa atatgttata 16081 tatatgtgtg tgtgtgacac atatatattt ttgtttgttt ctttgagaca gtgtctccct 16141 ctgacaccca ggctggagtt cagtggtgtg atcatagttc acttttacct tgcaatctgg 16201 gttcaagcaa tctctcatct cagcccctca agtagctagg actacaggta catggcattt 16261 gcccagctaa tttttaagtt tcttgtagag atgggccagc catattttaa attgtgtttt 16321 gaatgttata ttagaattaa aagtccaaag ccgggtgtgg tggctcacgc ctgtaatccc 16381 agcactttgg gaggctgagg tgggcggatc acgaggtcag gagttcgaga ccagcctggc 16441 caatatggta acaccatctc tactaaaaat acaaaaatta gctgggtatg ggggcacatg 16501 cctgtagtcc cagctactca ggaggctgag gcagaggaac ctcttgaacc caggaggcag 16561 aggctgcagt gagttgagat cgtgccactg tactctagcc tgggcgacag agcaagattc 16621 cgtctcaaaa aaaaaaaaag tccagtataa tgcccatgtg atagatcgac tttttcatga 16681 aatctcttct gtaatatcaa tataatctga ataacacttt gatctatatg atgagaaagc 16741 tgggagcctg ggagcgatac ccccatgctt ttgttgtatt aattgtattt tctacggata 16801 aactctaatt gctaaaaata aaacaacttt attgacccaa gcaagcctaa agttctgaaa 16861 tctttttttt atttttgttt gtttgtttgt ttgtttttgt ttgttttgtt ttgagacgga 16921 gtctcgctct gtcgcccagg ctggagtgcg gtggtgcagt ctcggctcac tgcaagctcc 16981 acctcccggg ttcacaccat tctcctgcct cagcctccca agtagctggg actacagacg 17041 cctgccacca cgcccagcta atttttttgt atttttagta gagaaagggt ttcaccgtgt 17101 tagccaggat ggtctcgatc tcctgacctc gtgatctgcc cgccttggcc tccctaagtt 17161 ctgggattac aagtgtgagc caccacgccc ggctgttttt ttttgttttg ttttgagacg 17221 gagtctcact gtgttgccca gactggagtg cagtggcatg atctcagctc actgccacct 17281 ccatctcctg ggttcaagca aatctcctgc ctcagcctcc cgagtagctg ggactacagg 17341 catgtgccac cacacctggc taatttttgt atttttagta gagacggggt ttcactatgt 17401 tggccaggct ggtccaaaac tcctgacctc aggtgatctg ctcgccttgg cctcccacag 17461 tgccaggatt acaggcatga gccaccttgc ccagccagtt ctgaaatctt ttatgaagcc 17521 tataaaaaaa gataataata ccaatctaga aaatatttct taaggcagtc atgcattagt 17581 ttgaactttc caaacaaaaa aatgcaatgt gtaatacttt tttttttttt tttgagatgg 17641 agtcttgttc tgttgcccag gctggagtgc agtggtacaa tctcggctca ctgcagcctc 17701 tgcctctctg gttcaagtga ttctcctgcc tcagcctccc aagtagctgg gattacaggc 17761 gtgcaccacc atgcatggct aatttttgta tttttagtag agacagggtt tcaccatgtt 17821 gacaaggctg atctcgaact cctgacctca ggtgatccgc ccacctcagc ctcccaaagt 17881 gctgagatta caggcattag ccaccacgcc cagcctttta ttttagtaga gaccatgttt 17941 caccatgttg accaagctgg tcttgagctg acctcaagtg atccgcccac ctccacctcc 18001 caaaatggtg ggattatagg catgagccac cgcacccagc ctgtaatact tttttgaaga 18061 tctagaacca cattgttcaa agagatagaa tgtgagcaat aaatgtaact taaatttttc 18121 aacagctact tttttttttt ttttttgaga cagggtctta ctctgttgtc ccagctggag 18181 tacagtggtg cgatcatgag gcttactgtt gccttgacct cctaggctca agcgatccta 18241 tcacctcagt ctcccaagta gctgggactg taagtgcaca ccaccatatc cagctaaatt 18301 ttgtgttttc tgtagagacg gggtttcgcc atgtttccca ggctggtctt gaactttggg 18361 cttaacccgt ctgcccacct aggcatccca aagtgctagg attacaggtg tgagtcatca 18421 tgcctggcca gtattttagt tagctctgtc ttttcaagtc atatacaagt tcattttctt 18481 ttaagtttag ttaacaacct ttatacatgt attctttttc tagcataaag aaagattcga 18541 ggccgggtgc ggtggctcac gcctgtaatc ccagcacttt gggaggctga gatgggcaca 18601 tcacgaggtc aggagatcga gaccatcctg gctaacatgg tgaaaccccg cctctactaa 18661 aattacaaaa agttagccag gcgtggtagc gggcacctgt agtcccagct actcaggagg 18721 ctgaggcagg agaatggcgt gaacccagga ggcagagctt gcagtgagca gagattgtgc 18781 cactgcactc cagcctgaga gacagagcga gactccgtct aaaaaaaaaa aaaaagattc 18841 gaatccttat cttggttgat ttttgcgtat ctagttccac tgaattattt atataattgt 18901 atagactaca gcacgagaca gcttagcttg tcactctact gtactatatt ctgcagtact 18961 atcataaggg aatttcctcc ctacccctgc tctgaattgt tcaattgtac tatttgctgg 19021 agtaatgctt gatgccttct tgatccatta tactagagta tatgtagtat ttgtagattc 19081 tgaaggagtg ggagcctcta ttctgagttt taaaggtact tatgtacagt ggaggtagct 19141 ttttgacagc ctcatcttcc aaactataga gtcattgttt tgttgagtgc aatatggtac 19201 ttgaagcatc tatatcggcg aagaaggacc caagtctcct tgaccttacc tacctacatt 19261 cactttctct ggtaggaaga ttgtgggtgc ctctctccag acttagtttc catgtcaaaa 19321 aagaaaaaag gaagattgtg ggctttgcta caatccaatt ctggatccaa tataaccttc 19381 attgcttaat tactgtgtga tctgggacaa gcctctactc tataaaaatg aagataaggc 19441 caggcttgat ggctcatgcc tgtaatccca gcacgttggg atgccaaggc aggaggatca 19501 cttgaggtca ggagttcgag accagactgg gcaatatagt gaaaccacat ctgtacaaaa 19561 ataaagatag aaagtagccc agcgcaatgg ctcacacctg taatcccagc actttgggag 19621 gctgaagcag gcgatcactt gaggtcggga gttcaagact gtagacagat agataggtag 19681 gtagatagat agagatatag atatagttgg ggtttttttg ttttgttttg ttttgttttt 19741 gagatggagt ttcgctcttg ttgcccaggc tggagtgcaa tggcgcgatc tcagtttact 19801 gcaacctccg cctcccgggt tcaagagatt ctcctgcctc agcctcctga gtagccagga 19861 ttacaggcat atgccaccat gcccggctaa tttttgtatt tttagtagag acagggtttc 19921 tccgtgttgg tcaggctggt cttgaactcc tgacctctcc caaagtgttg ggattacagg 19981 cgtgagccac cgctcctggc cttttttttt tttttttttt tttttttgag acagagtctt 20041 cctctgttgc ccagggtgga gtgcagtggc actcttctca gctcattgca acctctgcca 20101 tcctgggttc cagtgattct catgcctcag cctcccaagt agctgggact caggcgtgtg 20161 cccaccacgc ctggctaatt ttgttgtatt tttagtagag acagggtttc accatgttag 20221 ccaggctggt ctcaaactcc aggcctcaag tgatctgcct gcctcagcct cctgggattg 20281 cagacatgag ccactgcacc cggccaagag agggtaataa atgttaaatt acctggctag 20341 taaaaaatat tctctaagtg tcttttctca caattcccaa tgcctttttt ttttttttgg 20401 cacaatctca ctctgttgcc caggctggaa tgcaatggtg caatattggc tcactgtaac 20461 ccccgcctca caggttcaac ttattctcat gcctcagcct cccgagtaac tgggactaca 20521 gtgcaccacc accacaccca gctaattttt gaatatttag tagagacagg gtttcaccat 20581 gttggccagg ctggtcttga actcctggcc tcaagtgatt cacccacccc gcaagtgctg 20641 ggattacagg tgtggaccac cgtgcacagc cctagtgact ttttttttag ccccttaatc 20701 ttttctttcc tgggtctctt cattgtcagt gtctgctatt tactccctac ctagtcaccc 20761 ccttcaccag tatattatgt cctttatgtt ttattttgca ggatcttatt ttgcttttct 20821 attgaatccc ctccatctag aatagtacta gacatagtaa atattggttg tatgagtgaa 20881 tcgctgcttt taattatcat caccattgct ctctctactt ctggtctatg atccactttg 20941 agttaacttt tgttatttgg tgtgagatag gagtataatt tcattctttt acatgtggtt 21001 atacttttgt ctcaacactg tttgttaaaa acacaaaaag tattattttc ccatttaatc 21061 atctttggcc tgggcacggt ggctcatgcc tgtaatccca gcactctgga aggccaaggc 21121 agatggatca atttgaggcc aggagttcaa gactagccaa catggtgaaa ctaaaaatac 21181 aaaaaattag ctgggtatgg tggtgcatgt ctgtaatccc agctactcgg gaggctgagg 21241 cacgagaatt gcttgagcct aggaggtgga ggttgtagtg agctgagatt gtgtcactac 21301 cctccagcct gggtgataga gtgagtctgt ctcaaaaaaa aaaaaaaaaa attaagaaaa 21361 taaaaatcgt cggccaggca tggtggctca cacctgtaat cccagcactt tgggaggcag 21421 aggcgggcag atcacgaggt caggagatgg agaccatcct ggctaacatg gtgaaacccc 21481 gtctctacta aaaataaaaa aattagccgg gcatggtgct gggcgcctgt agtcccagct 21541 gctcgggagg ctgaggcagg agaatggcgt gaacccagga ggtggagctt gcagtgagcc 21601 gagatcgtgc cactgcactc cagcctggga gacagagcga gactccgtct caaaaaaaaa 21661 aaaaaaaaaa attgtcttgg tatttattat tgttgaaaat cgcttgatca cagatgtatg 21721 tatgagttta tttctgtact gtcaattcca ttttattgat gtatgtgtct attcttatgc 21781 tattaccaca ctttcttgat tactatagct ttgtggtgag gtgttgagat tttaaactaa 21841 ttataagcat cttacatgaa ctacttaccg tttatatttg attatgcagc atgaaataat 21901 tatgaatata tcattaaata tgccatatta acttttatta agttttatgt gatcataaca 21961 gtaagccata tgcatgtaag ttcagttttc atagatcatt gcttatgtag tttaggtttt 22021 tgcttatgca gcatccaaaa acaattagga aactattgct tgtaattcac ctgccattac 22081 tttttaaatg gctcttaagg gcagttgtga gattatcttt tcatggctat ttgccttttg 22141 agtattcttt ctacaaaagg aagtaaatta aattgttctt tctttcttta taatttatag 22201 attttgcatg ctgaaacttc tcaaccagaa gaaagggcct tcacagtgtc ctttatgtaa 22261 gaatgatata accaaaaggt atataatttg gtaatgatgc taggttggaa gcaaccacag 22321 taggaaaaag tagaaattat ttaataacat agcgttccta taaaaccatt catcagaaaa 22381 atttataaaa gagtttttag cacacagtaa attatttcca aagttatttt cctgaaagtt 22441 ttatgggaca tctgccttat acaggtatta gaaacttact gcctttctct aatgcttcta 22501 gtgtaaaaac ttgcagactt atgtaaagta gggctgtatc gccgtgcccc cattgtctgt 22561 taatcttgtt tttatatttt tgattgtgtt tccttttctt tttttttttt tttttaagac 22621 agggtcctgc tctgtcactg aggctggagt gcagtggcgt gatctcggct cactgtagcc 22681 tctgtctccc agcctcttcc tgccttagcc tcccaaatag ctgggactac aggcacacgc 22741 taccatgccc ggccaatttt tgtatttttt gtagagatga ggttttacca tgttgcccag 22801 gctggtaact cctgagctca ggtgatctgc ccacctcggc ctcccaaagt gctggggttc 22861 acaggtgtgt gtttatttct atctaattat ttacacaaac acaatgtatt tatatattgt 22921 gtatctcttc tgctacaatg taaattctat gagagtagta attttgtctg tctcaacact 22981 gtttttccta agtttggtac atagtaggca ctcagatgct taaaggaatg aatgaattgt 23041 gctttaattc cactttacta aacccaaatc tccctttgga cattgttatc tatgtgtttt 23101 caaagaagta taatcataat ttgacagaaa tccttgagag gcagaactaa gtgagggatt 23161 gggcagggtt cagatgttaa gaacagtaag ctcagcaggg tgtgattgct catgcctata 23221 accctagcac tctaggaggc tgaggtggga tgattgcttg aggccaggag tttgaaatca 23281 gcctgggcaa catagtgaga ccccatcact accaacaaaa taaataaata aatgtacatg 23341 gtggcatatg cccatagtcc tagctacttg ggaggctata gtgggaggat agcttgagta 23401 cagaagtctg aggctgcagt gagctatgat tgtggcactg catgctagcc tgggcaatag 23461 agcaagaccc tgtctctaaa ttaaacaaaa aaaaaagtac tctagttttc tatgcaatgc 23521 attatatctg ctgtggattt agggcagtat tatatcagat aattttaggc atttggtagg 23581 cttaaatgaa tgacaaaaag ttactaaatc actgccatca cacggtttat acagatgtca 23641 atgatgtatt gattatagag gttttctact gttgctgcat cttattttta tttgtttaca 23701 tgtcttttct tattttagtg tccttaaaag gttgataatc acttgctgag tgtgtttctc 23761 aaacaattta atttcaggag cctacaagaa agtacgagat ttagtcaact tgttgaagag 23821 ctattgaaaa tcatttgtgc ttttcagctt gacacaggtt tggagtgtaa gtgttgaata 23881 tcccaagaat gcaactcaag tgctgtccat gaaaactcag gaagtttgca caattacttt 23941 ctatgacgtg gtgataagac cttttagtct aggttaattt tagttctgta tctgtaatct 24001 atttttaaaa aattactccc actggtctca caccttattt tatcaatcgt aaggtgcaca 24061 tttttcacat cttaacatct ctgaaattgg gaacatttta ctattgaggg tgtgtcattt 24121 gtttaatttg tgtgctttct ttcttagtga tacacgaaat aatagtgcca cttacattgt 24181 tggtgtctta gctttagtga aatacagtat tgataggcaa atttcttagt gttaaggtag 24241 aaaacaagga ctctaaataa ctttgatggt ctgtgtattt gtttttgttt cctaggagta 24301 aaatttccag ttgatttttt aaaatttgat ttttaaaaaa aatcacaggt aaccttaatg 24361 cattgtctta acacaacaaa gagcatacat agggtttctc ttggtttctt tgattataat 24421 tcatacattt ttctctaact gcaaacataa tgttttccct tgtattttac agatgcaaac 24481 agctataatt ttgcaaaaaa ggaaaataac tctcctgaac atctaaaaga tgaagtttct 24541 atcatccaaa gtatgggcta cagaaaccgt gccaaaagac ttctacagag tgaacccgaa 24601 aatccttcct tggtaaaacc atttgttttc ttcttcttct tcttcttctt ttcttttttt 24661 tttctttttt ttttttgaga tggagtcttg ctctgtggcc caggctagaa gcagtcctcc 24721 tgccttagcc cccttagtag ctgggattac aggcacgcgc caccatgcca ggctaatttt 24781 tgtattttta gtagagacgg ggtttcatca tgttggccag gctggtctcg aactcctaac 24841 ctcaggtgat ccacccacct cggctcccca aattgctggg attacaggtg tgagccactg 24901 tgcccggccg gtaaaaccat tttcatttat tctggcaaca tctctttatt gagcattgtg 24961 aatatgttag tgaatgtgct agatgctcat agatttatat aaaaagttag tgaagaagga 25021 aagatggtat attaagtggt tagacaagtg ttctaatcag ttagagttca gagaaggtca 25081 gggtacctga tataatcaag agagagacct tacagccagg tgaggtgaat gtacctataa 25141 tcccagctac ttaggaggct gaaatgggag gatcacttga gtccaggttt gagaccagcc 25201 caggcaacat agcaagatcc ccatcagata caccaaaaag acagatttct tttttttttt 25261 tttttttgag acagagtctc gctctgtcgc ccaggctgga gcgcagtgac acgatgtcag 25321 ctcactgcaa cctccgcctc ccaggttcaa gtgattctcc tgcctcagcc tcctgagtag 25381 ttgggactac aggggtacga caccagacct ggctaatttt tgtaatttta gtagagtcgg 25441 ggtttcacca tattggtcag gctggtctcg aactcctgac ctcaggtgat ccaccctcct 25501 tggcctccca gagtgctggg attacaggcg tgagccacca agcccggcca aaaaagagag 25561 ctcttatagg cccttccttg ctttggagct ttatctgctc tgtgatgctt atctaaaata 25621 gccataaggt cactgatatt tttaagcatt tggaaattac ttcagctggg tgccatggct 25681 catgcctata atcccaaccc tttgggaggc tgaggtagga ggtcctttga gcccagcttg 25741 ggcaacacag tgagacactg tctctgcaat taaaaaaaaa aaaaaagtag ctgggtgccg 25801 tggctcacgc ctgtaattcc agcactagga ggcttgagga ttgcctgagc tcaggagttc 25861 aagaccagtt tgggcaacat agcaagtcct tgtctatatt aaaagttttt ttaaattatc 25921 tgggcatggt ggtgtgtgcc tgtagtccca gctacttggg aagctgagac agaaggatca 25981 cttgagtcca ggagatgtag actacagtga gctatgatca ctccactgca cttcagcgtg 26041 ggcggcaaag caagatctag ttgcaaaaaa aaaaagaact ggctgggtgc ggcggctaac 26101 acctgcaatc ccagcacctt gggaggctga ggccagtgga tcatgaggtc aggagattga 26161 gaccaccctg gccaacatgg tgaaacccgg tctctactaa aaatacaaaa attagctggg 26221 tgtggtggca cgtgcctgta atcccagcta ctccagaggc tgaggatgga gaatcacttg 26281 aacctgagag tcggaggttg cagtgagccg agattgcgcc actgcactcc agcctggcga 26341 cagagcgaga ctccgtctca aaaaaaaaaa aaaaaaagct tcacgcctgt aatcccagca 26401 ctttgggagg ccgagtcaag tggatcacga ggtgtggaga tcaagactat cctggctcac 26461 atggtgaaag cccgtctcta ctaaaaacac agaaaaatta gctgagcgtg atggcggact 26521 cctgtagtcc cagctactcg ggaggctgag gcaggagaat agcatgaacc cgggaggtgg 26581 agcttgcagt gagccgagat cccgccactg cgatccagcc tgggcgacag agtgagactc 26641 tgtctcaaaa aaaaaacaaa aaaacttagc tgggcgtggt ggtatgcacc tgtggtccta 26701 gctacttggg aggctgaggc tggagcattg ctttaacata gagagtcaag gctgcagttg 26761 agctatgact gtgccactgg actccagcgc aggtgactga gaccctatct tttaaaaaaa 26821 gggaaaatta cttgaactta aaaggtgtaa ttgttaaaga aaatgtagtg atttgctctg 26881 ttgttactta tatgtgcatg aatgatggag atcttaaaaa gtaatcattc tggggctggg 26941 cgtagtagct tgcacctgta atcccagcac ttcgggaggc tgaggcaggc agataatttg 27001 aggtcaggag tttgagacca gcctggccaa catggtgaaa cccatctcta ctaaaaatac 27061 aaaaattagc tgggtgtggt ggcacgtacc tgtaatccca gctactcggg aggcggaggc 27121 acaagaattg cttgaaccta ggacgcggag gttgcagcga gccaagatcg cgccactgca 27181 ctccagcctg ggccgtagag tgagactctg tctcaaaaaa gaaaaaaaag taattgttct 27241 agctgggcgc agtggctctt gcctgtaatc ccagcacttt gggaggccaa ggcgggtgga 27301 tctcgagtcc tagagttcaa gaccagccta ggcaatgtgg tgaaacccca tcgctacaaa 27361 aaatacaaaa attagccagg catggtggcg tgcgcatgta gtcccagctc cttgggaggc 27421 tgaggtggga ggatcacttg aacccaggag acagaggttg cagtgaaccg agatcacgcc 27481 accacgctcc agcctgggca acagaacaag actctgtcta aaaaaataca aataaaataa 27541 aagtagttct cacagtacca gcattcattt ttcaaaagat atagagctaa aaaggaagga 27601 aaaaaaaagt aatgttgggc ttttaaatac tcgttcctat actaaatgtt cttaggagtg 27661 ctggggtttt attgtcatca tttatccttt ttaaaaatgt tattggccag gcacggtggc 27721 tcatggctgt aatcccagca ctttgggagg ccgaggcagg cagatcacct gaggtcagga 27781 gtgtgagacc agcctggcca acatggcgaa acctgtctct actaaaaata caaaaattaa 27841 ctaggcgtgg tggtgtacgc ctgtagtccc agctactcgg gaggctgagg caggagaatc 27901 aactgaacca gggaggtgga ggttgcagtg tgccgagatc acgccactgc actctagcct 27961 ggcaacagag caagattctg tctcaaaaaa aaaaaacata tatacacata tatcccaaag 28021 tgctgggatt acatatatat atatatatat atatcatatc tatatatata tatatgtaat 28081 atatatgtta tatatatatt acatatatat atgttatata tatgttatat atatataata 28141 tatatatgtt atatatatgt tatatatata tatacacaca cacacacata tatatgtata 28201 tatatataca cacacacaca catattagcc aggcatagtt gcacacgctt gtagacccag 28261 ctactcagga ggctgaggca ggagaatctc ttgaacttag gaggcggagg ttgcagtgag 28321 ctgagattgc gccactgcac tccagcctgg gtgacagagc aggactctgt acacccccca 28381 aaacaaaaaa aaaagttatc agatgtgatt ggaatgtata tcaagtatca gcttcaaaat 28441 atgctatatt aatacttcaa aaattacaca aataatacat aatcaggttt gaaaaattta 28501 agacaacaga aaaaaaaatt caaatcacac atatcccaca cattttatta ttactactac 28561 tattattttg tagagactgg gtctcactct gttgcttatg ctggtcttga actcctggcc 28621 tcaagcagtc ctgctccagc ctcccaaagt gctgggatta taggcatgag ctaccgctcc 28681 cagccccaga cattttagtg tgtaaattcc tgggcatttt ttccaggcat catacatgtt 28741 agctgactga tgatggtcaa tttattttgt ccatggtgtc aagtttctct tcaggaggaa 28801 aagcacagaa ctggccaata attgcttgac tgttctttac catactgttt agcaggaaac 28861 cagtctcagt gtccaactct ctaaccttgg aactgtgaga actctgagga caaagcagcg 28921 gatacaacct caaaagacgt ctgtctacat tgaattgggt aagggtctca ggttttttaa 28981 gtatttaata ataattgctg gattccttat cttatagttt tgccaaaaat cttggtcata 29041 atttgtattt gtggtaggca gctttgggaa gtgaatttta tgagccctat ggtgagttat 29101 aaaaaatgta aaagacgcag ttcccacctt gaagaatctt actttaaaaa gggagcaaaa 29161 gaggccaggc atggtggctc acacctgtaa tcccagcact ttgggaggcc aaagtgggtg 29221 gatcacctga ggtcgggagt tcgagaccag cctagccaac atggagaaac tctgtctgta 29281 ccaaaaaata aaaaattagc caggtgtggt ggcacataac tgtaatccca gctactcggg 29341 aggctgaggc aggagaatca cttgaacccg ggaggtggag gttgcggtga accgagatcg 29401 caccattgca ctccagcctg ggcaaaaata gcgaaactcc atctaaaaaa aaaaaagaga 29461 gcaaaagaaa gaatatctgg ttttaaatat gtgtaaatat gttttggaaa gatggagagt 29521 agcaataagg aaaaacatga tggattgcta cagtatttag ttccaagata aattgtacta 29581 gatgaggaag ccttttaaga agagctgaat tgccaggcgc agtgctcacg cctgtaatcc 29641 cagcactttg gaaggccgag gtgggcggat cacctgaggt cgggagttca agaccagcct 29701 gaccaacatg gagaaacccc atctctacta aaaaaaaaaa aaaaaaaatt agccggggtg 29761 gtggcttatg cctgaaatcc cagctactca ggaggctgag gcaggagaat cgcttgaacc 29821 caggaagcag aggttgcagt gagccaagat cgcaccattg cactccagcc taggcaacaa 29881 gagtgaaact ccatctcaaa aaaaaaaaaa aagagctgaa tcttggctgg gcaggatggc 29941 tcgtgcctgt aatcctaacg ctttggaaga ccgaggcaga aggattggtt gagtccacga 30001 gtttaagacc agcctggcca acatagggga accctgtctc tatttttaaa ataataatac 30061 atttttggcc ggtgcggtgg ctcatgcctg taatcccaat actttgggag gctgaggcag 30121 gtagatcacc tgaggtcaga gttcgagacc agcctggata acctggtgaa acccctcttt 30181 actaaaaata caaaaaaaaa aaaaaattag ctgggtgtgg tagcacatgc ttgtaatccc 30241 agctacttgg gaggctgagg caggagaatc gcttgaacca gggaggcgga ggttacaatg 30301 agccaacact acaccactgc actccagcct gggcaataga gtgagactgc atctcaaaaa 30361 aataataatt tttaaaaata ataaattttt ttaagcttat aaaaagaaaa gttgaggcca 30421 gcatagtagc tcacatctgt aatctcagca gtggcagagg attgcttgaa gccaggagtt 30481 tgagaccagc ctgggcaaca tagcaagacc tcatctctac aaaaaaattt cttttttaaa 30541 ttagctgggt gtggtggtgt gcatctgtag tcccagctac tcaggaggca gaggtgagtg 30601 gatacattga acccaggagt ttgaggctgt agtgagctat gatcatgcca ctgcactcca 30661 acctgggtga cagagcaaga cctccaaaaa aaaaaaaaaa agagctgctg agctcagaat 30721 tcaaactggg ctctcaaatt ggattttctt ttagaatata tttataatta aaaaggatag 30781 ccatcttttg agctcccagg caccaccatc tatttatcat aacacttact gttttccccc 30841 cttatgatca taaattccta gacaacaggc attgtaaaaa tagttatagt agttgatatt 30901 taggagcact taactatatt ccaggcacta ttgtgctttt cttgtataac tcattagatg 30961 cttgtcagac ctctgagatt gttcctatta tacttatttt acagatgaga aaattaaggc 31021 acagagaagt tatgaaattt ttccaaggta ttaaacctag taagtggctg agccatgatt 31081 caaacctagg aagttagatg tcagagcctg tgcttttttt ttgtttttgt ttttgttttc 31141 agtagaaacg ggggtctcac tttgttggcc aggctggtct tgaactccta acctcaaata 31201 atccacccat ctcggcctcc tcaagtgctg ggattacagg tgagagccac tgtgcctggc 31261 gaagcccatg cctttaacca cttctctgta ttacatacta gcttaactag cattgtacct 31321 gccacagtag atgctcagta aatatttcta gttgaatatc tgtttttcaa caagtacatt 31381 tttttaaccc ttttaattaa gaaaactttt attgatttat tttttggggg gaaatttttt 31441 aggatctgat tcttctgaag ataccgttaa taaggcaact tattgcaggt gagtcaaaga 31501 gaacctttgt ctatgaagct ggtattttcc tatttagtta atattaagga ttgatgtttc 31561 tctcttttta aaaatatttt aacttttatt ttaggttcag ggatgtatgt gcagtttgtt 31621 atataggtaa acacacgact tgggatttgg tgtatagatt tttttcatca tccgggtact 31681 aagcataccc cacagttttt tgtttgcttt ctttctgaat ttctccctct tcccaccttc 31741 ctccctcaag taggctggtg tttctccaga ctagaatcat ggtattggaa gaaaccttag 31801 agatcatcta gtttagttct ctcattttat agtggaggaa ataccctttt tgtttgttgg 31861 atttagttat tagcactgtc caaaggaatt taggataaca gtagaactct gcacatgctt 31921 gcttctagca gattgttctc taagttcctc atatacagta atattgacac agcagtaatt 31981 gtgactgatg aaaatgttca aggacttcat tttcaactct ttctttcctc tgttccttat 32041 ttccacatat ctctcaagct ttgtctgtat gttatataat aaactacaag caaccccaac 32101 tatgttacct accttcctta ggaattattg cttgacccag gttttttttt tttttttttt 32161 ggagacgggg tcttgccctg ttgccaggat ggagtgtagt ggcgccatct cggctcactg 32221 caatctccaa ctccctggtt caagcgattc tcctgtctca atctcacgag tagctgggac 32281 tacaggtata caccaccacg cccggttaat tgaccattcc atttctttct ttctctcttt 32341 tttttttttt tttttgagac agagtcttgc tctgttgccc aggctggagt acagaggtgt 32401 gatctcacct ctccgcaacg tctgcctccc aggttgaagc catactcctg cctcagcctc 32461 tctagtagct gggactacag gcgcgcgcca ccacacccgg ctaatttttg tatttttagt 32521 agagatgggg tttcaccatg ttggccaggc tggtcttgaa ctcatgacct caagtggtcc 32581 acccgcctca gcctcccaaa gtgctggaat tacaggcttg agccaccgtg cccagcaacc 32641 atttcatttc aactagaagt ttctaaagga gagagcagct ttcactaact aaataagatt 32701 ggtcagcttt ctgtaatcga aagagctaaa atgtttgatc ttggtcattt gacagttctg 32761 catacatgta actagtgttt cttattagga ctctgtcttt tccctatagt gtgggagatc 32821 aagaattgtt acaaatcacc cctcaaggaa ccagggatga aatcagtttg gattctgcaa 32881 aaaagggtaa tggcaaagtt tgccaactta acaggcactg aaaagagagt gggtagatac 32941 agtactgtaa ttagattatt ctgaagacca tttgggacct ttacaaccca caaaatctct 33001 tggcagagtt agagtatcat tctctgtcaa atgtcgtggt atggtctgat agatttaaat 33061 ggtactagac taatgtacct ataataagac cttctgtaac tgattgttgc cctttcgttt 33121 ttttttttgt ttgtttgttt gttttttttt gagatggggt ctcactctgt tgcccaggct 33181 ggagtgcagt gatgcaatct tggctcactg caacctccac ctccaaggct caagctatcc 33241 tcccacttca gcctcctgag tagctgggac tacaggcgca tgccaccaca cccggttaat 33301 tttttgtggt tttatagaga tggggtttca ccatgttacc gaggctggtc tcaaactcct 33361 ggactcaagc agtctgccca cttcagcctc ccaaagtgct gcagttacag gcttgagcca 33421 ctgtgcctgg cctgcccttt acttttaatt ggtgtatttg tgtttcatct tttacctact 33481 ggtttttaaa tatagggagt ggtaagtctg tagatagaac agagtattaa gtagacttaa 33541 tggccagtaa tctttagagt acatcagaac cagttttctg atggccaatc tgcttttaat 33601 tcactcttag acgttagaga aataggtgtg gtttctgcat agggaaaatt ctgaaattaa 33661 aaatttaatg gatcctaagt ggaaataatc taggtaaata ggaattaaat gaaagagtat 33721 gagctacatc ttcagtatac ttggtagttt atgaggttag tttctctaat atagccagtt 33781 ggttgatttc cacctccaag gtgtatgaag tatgtatttt tttaatgaca attcagtttt 33841 tgagtacctt gttatttttg tatattttca gctgcttgtg aattttctga gacggatgta 33901 acaaatactg aacatcatca acccagtaat aatgatttga acaccactga gaagcgtgca 33961 gctgagaggc atccagaaaa gtatcagggt agttctgttt caaacttgca tgtggagcca 34021 tgtggcacaa atactcatgc cagctcatta cagcatgaga acagcagttt attactcact 34081 aaagacagaa tgaatgtaga aaaggctgaa ttctgtaata aaagcaaaca gcctggctta 34141 gcaaggagcc aacataacag atgggctgga agtaaggaaa catgtaatga taggcggact 34201 cccagcacag aaaaaaaggt agatctgaat gctgatcccc tgtgtgagag aaaagaatgg 34261 aataagcaga aactgccatg ctcagagaat cctagagata ctgaagatgt tccttggata 34321 acactaaata gcagcattca gaaagttaat gagtggtttt ccagaagtga tgaactgtta 34381 ggttctgatg actcacatga tggggagtct gaatcaaatg ccaaagtagc tgatgtattg 34441 gacgttctaa atgaggtaga tgaatattct ggttcttcag agaaaataga cttactggcc 34501 agtgatcctc atgaggcttt aatatgtaaa agtgaaagag ttcactccaa atcagtagag 34561 agtaatattg aagacaaaat atttgggaaa acctatcgga agaaggcaag cctccccaac 34621 ttaagccatg taactgaaaa tctaattata ggagcatttg ttactgagcc acagataata 34681 caagagcgtc ccctcacaaa taaattaaag cgtaaaagga gacctacatc aggccttcat 34741 cctgaggatt ttatcaagaa agcagatttg gcagttcaaa agactcctga aatgataaat 34801 cagggaacta accaaacgga gcagaatggt caagtgatga atattactaa tagtggtcat 34861 gagaataaaa caaaaggtga ttctattcag aatgagaaaa atcctaaccc aatagaatca 34921 ctcgaaaaag aatctgcttt caaaacgaaa gctgaaccta taagcagcag tataagcaat 34981 atggaactcg aattaaatat ccacaattca aaagcaccta aaaagaatag gctgaggagg 35041 aagtcttcta ccaggcatat tcatgcgctt gaactagtag tcagtagaaa tctaagccca 35101 cctaattgta ctgaattgca aattgatagt tgttctagca gtgaagagat aaagaaaaaa 35161 aagtacaacc aaatgccagt caggcacagc agaaacctac aactcatgga aggtaaagaa 35221 cctgcaactg gagccaagaa gagtaacaag ccaaatgaac agacaagtaa aagacatgac 35281 agcgatactt tcccagagct gaagttaaca aatgcacctg gttcttttac taagtgttca 35341 aataccagtg aacttaaaga atttgtcaat cctagccttc caagagaaga aaaagaagag 35401 aaactagaaa cagttaaagt gtctaataat gctgaagacc ccaaagatct catgttaagt 35461 ggagaaaggg ttttgcaaac tgaaagatct gtagagagta gcagtatttc attggtacct 35521 ggtactgatt atggcactca ggaaagtatc tcgttactgg aagttagcac tctagggaag 35581 gcaaaaacag aaccaaataa atgtgtgagt cagtgtgcag catttgaaaa ccccaaggga 35641 ctaattcatg gttgttccaa agataataga aatgacacag aaggctttaa gtatccattg 35701 ggacatgaag ttaaccacag tcgggaaaca agcatagaaa tggaagaaag tgaacttgat 35761 gctcagtatt tgcagaatac attcaaggtt tcaaagcgcc agtcatttgc tccgttttca 35821 aatccaggaa atgcagaaga ggaatgtgca acattctctg cccactctgg gtccttaaag 35881 aaacaaagtc caaaagtcac ttttgaatgt gaacaaaagg aagaaaatca aggaaagaat 35941 gagtctaata tcaagcctgt acagacagtt aatatcactg caggctttcc tgtggttggt 36001 cagaaagata agccagttga taatgccaaa tgtagtatca aaggaggctc taggttttgt 36061 ctatcatctc agttcagagg caacgaaact ggactcatta ctccaaataa acatggactt 36121 ttacaaaacc catatcgtat accaccactt tttcccatca agtcatttgt taaaactaaa 36181 tgtaagaaaa atctgctaga ggaaaacttt gaggaacatt caatgtcacc tgaaagagaa 36241 atgggaaatg agaacattcc aagtacagtg agcacaatta gccgtaataa cattagagaa 36301 aatgttttta aagaagccag ctcaagcaat attaatgaag taggttccag tactaatgaa 36361 gtgggctcca gtattaatga aataggttcc agtgatgaaa acattcaagc agaactaggt 36421 agaaacagag ggccaaaatt gaatgctatg cttagattag gggttttgca acctgaggtc 36481 tataaacaaa gtcttcctgg aagtaattgt aagcatcctg aaataaaaaa gcaagaatat 36541 gaagaagtag ttcagactgt taatacagat ttctctccat atctgatttc agataactta 36601 gaacagccta tgggaagtag tcatgcatct caggtttgtt ctgagacacc tgatgacctg 36661 ttagatgatg gtgaaataaa ggaagatact agttttgctg aaaatgacat taaggaaagt 36721 tctgctgttt ttagcaaaag cgtccagaaa ggagagctta gcaggagtcc tagccctttc 36781 acccatacac atttggctca gggttaccga agaggggcca agaaattaga gtcctcagaa 36841 gagaacttat ctagtgagga tgaagagctt ccctgcttcc aacacttgtt atttggtaaa 36901 gtaaacaata taccttctca gtctactagg catagcaccg ttgctaccga gtgtctgtct 36961 aagaacacag aggagaattt attatcattg aagaatagct taaatgactg cagtaaccag 37021 gtaatattgg caaaggcatc tcaggaacat caccttagtg aggaaacaaa atgttctgct 37081 agcttgtttt cttcacagtg cagtgaattg gaagacttga ctgcaaatac aaacacccag 37141 gatcctttct tgattggttc ttccaaacaa atgaggcatc agtctgaaag ccagggagtt 37201 ggtctgagtg acaaggaatt ggtttcagat gatgaagaaa gaggaacggg cttggaagaa 37261 aataatcaag aagagcaaag catggattca aacttaggta ttggaaccag gtttttgtgt 37321 ttgccccagt ctatttatag aagtgagcta aatgtttatg cttttgggga gcacatttta 37381 caaatttcca agtatagtta aaggaactgc ttcttaaact tgaaacatgt tcctcctaag 37441 gtgcttttca tagaaaaaag tccttcacac agctaggacg tcatctttga ctgaatgagc 37501 tttaacatcc taattactgg tggacttact tctggtttca ttttataaaa gcaaatccag 37561 gtgtcccaaa gcaaggaatt taatcatttt gtgtgacatg aaagtaaatc cagtcctgcc 37621 aatgagaaga aaaagacaca gcaagttgca gcgtttatag tctgctttta catctgaacc 37681 tctgtttttg ttatttaagg tgaagcagca tctgggtgtg agagtgaaac aagcgtctct 37741 gaagactgct cagggctatc ctctcagagt gacattttaa ccactcaggt aaaaagcgtg 37801 tgtgtgtgtg cacatgcgtg tgtgtggtgt cctttgcatt cagtagtatg tatcccacat 37861 tcttaggttt gctgacatca tctctttgaa ttaatggcac aattgtttgt ggttcattgt 37921 ctccttaaat tagactgtaa gcaccttgat ggaactcata ctacctttta tttcacacac 37981 acgcacacgc gcacacacag cctacacata cactgcctag ctcattgtag catactaaat 38041 actgatttta atgaataagc taaaccttcg aaacccattt gctaatccca gcactttggg 38101 aggccaaggt gggtggatca cctcaggtca gaagtttgag accagcctgg ccaacatggt 38161 gaaaccccac atctactaaa aatacaaaaa ttagctgggc gtggtggcca atgccttgta 38221 atcccagcta ttctggaggc tgagacagga gaatcgcctg aacctgggag gcggaggttg 38281 cactgagctg ggattgtacc actgcactcc agcctgggtg acaaagtgag actccatctc 38341 aaaaacaaac aaacaaaaac acatcatttc ccctatagca aaaacatgac ggcacttact 38401 gtatcaagag aggtgagaaa aaggagccac agcaggatga ttcaagggac tctgcatagc 38461 tccattttaa gaatatgcct actgcaggtc agagaaggta agcaaactgc ctaaggccac 38521 acagccaggt acagaactct caccaatatt attgccagca atcgcaattt tggtgtttat 38581 tcttggtacc aagttggaga ctatagggtt ctcttcctaa tagagaccat ctagcctttc 38641 actgttttgt ggatacttct ttctcttctt cttttttttt ttccctttta aaatctagtt 38701 atttttttct ttttggtttc tttgacacag ggtctcttac tctgttaccc aggctggaat 38761 ggagtagtgc agtcatggtt cactgtagct ttgacttcct gggctcaagc gatcctccta 38821 cctcagcttc ccgagtagct gggaccacag gcgcccacca acacctccag ctaattttta 38881 agtttttact agagacaaca tctcactatg ttgcccaggc tggtctcaaa atcctgggct 38941 caagtgatcc cacctcagcc ttccaaaatg ctgggattac aggtgtgtgc accacgcctg 39001 gcctattttt tttttaattg ctcataaatc atcttttttc tttaaaaaaa agaaagatgg 39061 gaggctaaag caggagaatc acttgaaccc aggaggcgga ggttgcagtg agctgagatc 39121 atgctgctgc tctccagcct gggcaacaag agtgaaactc catctcaaaa aaaaaaaaaa 39181 agaaagtaca caattttact ttctggacct aatggtcaag gccaataatt tggtcaccta 39241 tgaaataaat aaaagcttta ccatatatat gaccatttga taatgtaata tgaaatgttt 39301 atgtactaaa ggcagaatag tctagaaaaa acattctgta tcacaacgtc taaaaatgaa 39361 tatcatcttc atcatagaac caggctcttt ctcctaattt ttttttttga gatggagttt 39421 tgctctgtca cccaggctgg aatgcagtgg cacaattttg gctcactgca accttcagct 39481 cccaggttca ggatcaagtg attttcgtgc ttcagccttc taagtagctg ggattacagg 39541 tgactgccac cacacccagc tcattttttt gtattttttt agtagagaga gggtttcacc 39601 atgttggcca ggctcgtctc gaactcctga cctcaaataa tccacccgtc tcagcctccc 39661 aaagtgctga gattacaggc gtgagccacc aggcctggcc tcctaatttt tatttgtaga 39721 agtggcacca aaattttcca agttctcatg caaaaattca ggctcatctc agtttatttt 39781 tttcatttat ttatctccca ctaaattgac aacttctaat aattaggttg gttctttgta 39841 ttcccagcac agggttctat gcagaataca cacacagcag ttgctggcaa taatattggt 39901 gagagttctg tactgggcta tgtgatctta gacagtttgc ttatgttctc tgacctgccg 39961 taggcacatt cttaaaatga agctgttcag accccctcga ttcatcctgc tgtggcttct 40021 ttttcccacc taaatcttaa ataccctttt agctgctagt aagtgaatga tgttttttta 40081 tgaactttct gaagtcagat tagatgaagt tgagaaaagc ctgatattct tataaagtta 40141 tatatgtgca tcatagaaaa cttagaaaat acagataaac aaaaatcatc catggacgaa 40201 ccttgaagac attgtgttaa ctgaaataaa ccggacacca aaggacacat gttatatgct 40261 tccacttata tgagatacct agaatagtta catttggtta ctctgggtac attgcctata 40321 gataagcctt gctccacaag gagcagttaa aaaaaaaaaa aagataaatt cataggatgg 40381 aaggtagaat agtggttact agggacttgg ggagggggaa atggggagtt actgtttgat 40441 gagtgcagat ttcagtttgg gatgatgaaa aagttctgga gatagatagt ggcaatggta 40501 acacaacagt gtgaaaataa tgccactgaa ctgtacactt aaaatgatta aaatgataag 40561 ttaattgtaa tttgtgttat ccagaaatgg ttagcaattt attggtgtat attcttttag 40621 tattcctgtg tgtgcacagg ggtgcttgta tatactttat ctttaaaata tatccaggaa 40681 gctaggcaca gtggcttaca cctgtaatcc cagcactttg ggagggtgag gcaggaagat 40741 tgcctgagcc ccggaggtca aggctgcagt gagttgtgat cacgctactg cactctgttc 40801 tgggcaaccc ctgtctggga aaaaaaaaaa aattagtgag gcttagtggt gcacacctgt 40861 agtctcagct acttgagtgg ctggggtagg attgcttgat cccagcaagt tgaggccgtg 40921 gtgagccatg atggtgccac tgcactccat cctgggtgat atggtgagac cctgtctcaa 40981 aaacaagaaa tccagataat tctgtgcatt ataatctagc ttttactgga tcattaaaat 41041 tcttttttct tttttttttt ttttttctga gatggagttt cactcttgtt gcccaggctg 41101 gagtgcagtg gtgtgacctt ggctcaccgc atcctctgcc tcccgggttc atgcgattct 41161 cctgcctcag cctcccgagt agctgggatt acaggcatgt gccaccatgc ccagctaact 41221 ttgtattttt agtagagaca gggtttctcc atgttgacca ggctggtctc aaactcctgg 41281 cctcaagtga tccacccacc tcggcctccc aaagtgctgg ggttacaggc gtgagccacc 41341 gcactcagcc tgggtcgtta aaattcttaa gtgacttcat ttttaattac tatatgggat 41401 tctatctttc cagtgtatca tgatttattt gacctattgc tgaatgttgg aggtttcagg 41461 gtaagaggca cagtttgcta ttatgtacat cactatagtg gcatcctgat agctaaatat 41521 ttgcctacat ccctgattat ttccttagtc taaattactg ggactaggat tttggtgttt 41581 gatacatgtt actaaattgt tttttagaaa gattaaacca gtttatgctc ttccagcccc 41641 tgtggtatat gatagttccc attttcctgt accttgccaa cactgggtga tatccagttt 41701 taaaatctaa atcttgcatt gctatgagaa ctacaattag agaaggctta tcttctactg 41761 cccattctct gtacagagca aatccctcta gacctgaagc cccttggagt tgtcaagaaa 41821 cctttgagat gactccccac tctgtatctg agctgtcacc agtattctcc acttcttcag 41881 gattgccatg gcaactaaat tgatgaaaag atttaggagg ccttttctct ctttgcaatt 41941 cctatgatcc tttttgaatg tgggtttggg actctgtcaa tatacccatc atctaattct 42001 gtccattgtg ttttaaagtt taaggttgca atttctgatt acatctgcct tagccatact 42061 gtattatatt tgacattcaa tatacaatgt ccttgttttt ctgtatttct aatcttattc 42121 ccagagatgt gtctatttgt tcaggattca ttttgcaacg tgtttttact aagcatctac 42181 ccaaaaccgt tgaagtcaga tttcaggctg tcttacgtct aaagtagcac aggcaggaaa 42241 aactattgaa gtgggatttt tttttccctt tttgtactga accgagaaaa agtatataga 42301 tgatagagaa ttcctaattt ggtatcattg atatctgggt ttttgtttgt ttttacagaa 42361 gactgattaa ctatacttat ttattaattt atcttctcat taataaacac ttgctgagtg 42421 cttactgtct gctaggcatt agggagacaa atatgattaa gggaagcttc ctcctatcaa 42481 ggtcatgtgt tccatttggg tatactaatg cattagcaat gtaaatcaag tagtgagaga 42541 tcatctgttc ccgataggag atggattatt ggtggggact tctgtgtgtg tgtgtgtgtg 42601 tgtgtgtgtg tgtgtgtgtg tatgtatgtg ataaaataaa tataggaaat gttaattata 42661 gattctaagt agtagatata taaacactca ttgcaaagtt gcttcaagtt ttctgtatat 42721 ttgaaaatat tcacaacatg tcgacaaaac tagcatgata aagccactat ttgtgctaag 42781 acttcagctt gtatctggat taggcttatt atgtagtagt aggaacatta gaaatagttt 42841 taactcatta aatacacatg ttttatggga aggttttata tatatattta tatgtaatga 42901 atgtgaacaa acaagggtca gatatacact ctgcttccct ccagaccagt tccggctgct 42961 ctgctgcaca tttcaggagt cttattagaa ttagccacat tctgcccact tgcccttact 43021 tctcatattt cacaactcct cctggtgggg acttaaggag acattcaaac taggccttga 43081 aagatgagaa tttttccaag tggaaaaaga ggagtggcag caagtaaggt aaaggtacag 43141 agtcatggaa ttcccaggaa acgtaaagtt gtcatgtgtt ataggaaaac aacttgtgtg 43201 aggggtgttg ggagaaatga gagataatac cagggtataa agggcctttt gaatgctatg 43261 ttgaggaatt ttatcctaat ggcagtaatg actaacaatt atatagtgtt caaaaagtat 43321 aaatcagcag tggtatacca ctaagggttt ttttcttttc tttttttttt tgagacagag 43381 ttttgctctg ttgcccaggc tggagtgccg tggcacgatc tcagctcact gcacttccgc 43441 cacctgggtt caagtgattc ttctgcctca gccagtgttt cactgtgatg gccaggatgg 43501 agcactaagg gtctttatgg aagaaaaaga catgataaac aaggctttta gggaacttct 43561 acagtaatgt agctgtatta aaagtagaga tcagagcagc atagtagaag tagaaggcta 43621 gagctaattg aaggagcact tcagaattag aatcaagaag tcttagaaac ctattggttt 43681 tattctccct aatgtatttg gccacttacc tgctggggaa tttgtctaag ttataaaaaa 43741 taattccttt gggaaaccca aaggaaagtt atctattaat aattacccca ctactttttc 43801 tgatttatgt aatggccacg tagaggttag atgtgatggt tgtgacagta gtgactaata 43861 cagcctgtga agcattttgg tcagatatct atgtgctttc attccaggtt gactgaggca 43921 agactttggc tagggtttga tcagtgatgt aactactcac gagtaccacg tggtggcaat 43981 ggcattgctg cagaccttgg cagcaaagca gtgttagagt agcagtagaa acctttgtga 44041 agctaggaat acattttctg gtcataaaaa cctcctgaaa attgtgaact cagtgtagca 44101 ggagaaagaa gatggcttgt ttttagtaaa gggcaaagtc atttttaagg atcagaagaa 44161 gaaacggaga gtgaaacaat gtgttcctgc cctactcccc cactggactt tttggcaacc 44221 attgctgttc cttctaaaag tgatttttaa acatgtatat tttgaagcca ggcacagtga 44281 ctcacgtctg taatcccagc actttgggag gccgaggcgg gcagatcacc tgaggtcagg 44341 agttcaagac cagcctggcc aacatggtga aaccccgtct ctactaaaaa tacaaaaatt 44401 aggccaggtg tggtggctca cgcctgtaat cccagcactt tgggaggccg aggcgggagg 44461 atcatgtggt caggagatcc agaccatcct ggctaacacg gtgaaacacc atttctacta 44521 aaaatacaaa aaattagctg ggcatggtgg cgggcgcctg taatcccagc tactcaggag 44581 gctgaagcag aagaatggct tgaacctggg aggcggagct tgcagtgaac caagattgcg 44641 ccactgcact ccagcctggg caacaaagtg agactccgtc tcaaaaaaaa aaaaaaaaat 44701 tagtcgggca tggtaacagg tgcctgtaat cccagctact tgagaggctg aggcagggag 44761 aattgcttga accaggtagg cggaggttgc agtgagccaa gatcgcacca ctgcactcca 44821 gcctggggca acagagcaag actgtctcaa aaaaaataaa taaataaaat aaattcttaa 44881 gaaggatatt ttggaaaact ccttacatac ctaaattctt tgtttatcaa atacttggac 44941 ttagcacact cttctttgaa atggaccaat aaacaacagg agcccataag caaaaagaac 45001 tcattatttt aaaaacagta actatcctta caggctttct cagggctctt tctgttggat 45061 ccttccctct cacaggtcct tgctaatgat ctctaggtgg acacattcta gatgagatgt 45121 ccctgtctag aatggcagca ccatgagggc tatatcctca gtactaggac agcgcctggt 45181 gcttaataga tagtaaatag ttgtctaatt aactgagcaa acagatagat tcatgaatta 45241 gctttttgct ttttctgtta gaaactaaag gttcaggtca ggcacaatgg cgcatgtctc 45301 taatcccagc actttgggag gccgaggcgg gctgatcact tgaggtcagg agttcaagac 45361 cagcctggcc aacatagtaa aaccctgttt ctacaaaaat taccaaaatt agccgggcgt 45421 cttggcaagc acctgtaatg ccagctactt gagaggctga ggtgggagaa tcgcttgaac 45481 ctgggaggaa gaggttgcag tgagccgaga tggtgccaac ctgggtgaca gagggagact 45541 taaaaaaaaa aagaaagaaa gaaagaaaag aaactaaagg ttcaaagaat cccagaaaag 45601 gaagagtcct cacaagccag taatctaggc aggattactg atagtatttt tatatttgtt 45661 gtatttttat aaaatgccat agatagaggg cttttttcaa cattacatca gtctaaaaat 45721 cacacatttt tatatgaact aacctaaatg tctgatgaat ctcacaacac caagtctttg 45781 aaatgtgccc atataaataa aatgttaaca gattcatgct aattttaaat atcgatagtg 45841 tttaaatgcc ttaattattt tttcactccc tagctttaaa agaaaataac caacttcaaa 45901 aggacatcac aataacatca agtctatttg ggggaatttg aggatttttt ccctcactaa 45961 catcatttgg aaataatttc atgggcatta attgcatgaa tgtggttaga ttaaaaggtg 46021 ttcagctaga acttgtagtt ccatactagg tgatttcaat tcctgtgcta aaattaattt 46081 gtatgatata ttttcattta atggaaagct tctcaaagta tttcattttc ttggtgccat 46141 ttatcgtttt tgaagcagag ggataccatg caacataacc tgataaagct ccagcaggaa 46201 atggctgaac tagaagctgt gttagaacag catgggagcc agccttctaa cagctaccct 46261 tccatcataa gtgactcttc tgcccttgag gacctgcgaa atccagaaca aagcacatca 46321 gaaaaaggtg tgtattgttg gccaaacact gatatcttaa gcaaaattct ttccttcccc 46381 tttatctcct tctgaagagt aaggacctag ctccaacatt ttatgatcct tgctcagcac 46441 atgggtaatt atggagcctt ggttcttgtc cctgctcaca actaatatac cagtcagagg 46501 gacccaaggc agtcattcat gttgtcatct gagtacctac aacaagtaga tgctatgggg 46561 agcccatgga agatacatgg tatacaacat agctcttgct ctattggaag ctaagtggaa 46621 tgggagaaat tggtgacagg caaccccata atttcagaaa gctatgaaaa agtactcaga 46681 catattcctt ataacactgg tgtcacatca caaagaccta tttaatgtgc ttctgattta 46741 tagggagaga catcctatac ttcaggaact gcactttgat ccacagaaag cctagtgatg 46801 tagagctcct gttagttcaa aaggaaaaga aaagaacaac acagaaagcc taattatgca 46861 atagagtcaa gtgctttata gcaatgttac agttatcaaa aaaaatccag atggacctct 46921 gagaggatgc cattggagta accaggcaga tgcagttgat cagagctgac ttcctataag 46981 aagtgagcac tgagctgagg aataatggca taaatgaagg aaagtgagat ggaaatttga 47041 gtttttaatt ggaaagacaa tacatcaggc agatttttaa ataggggcaa acaaacagac 47101 acataggaga tgctaggcat ggggtcccca ctaggatgct gcttagaaac atgcaggggt 47161 ggtgagtact cccaaagtac acttcattcc tagctcagtg attcttatct gagtgttaaa 47221 gttccttctt cagcaccccg ttccacagtc caactgggaa ctttaagacc tttcttggag 47281 tctttctagg aactcaagtc tgctacttat acagaacagt ggctttggtc cccagttgtg 47341 ccttgcagta tttttgtgtt caggaagaaa cagtagctct tggataaaga agctagctag 47401 aaactctgtt gctatggcag tgcttcaaaa tgtatttcct taaatgcttt ctttgtaact 47461 atcttcattt agttcatctc tcagataatg agagatcaga gtcccatccc cagtataata 47521 ctcttcttta gggtactttc accatcttca gtctaaacac agactagact ttcaattata 47581 atgtgtaaga tttaaaatgt tattattgtg tgactttgaa tatctgtgta aatctactat 47641 ctcctctttg gtatatacgt gtgtttattt ttttctggag atctgtaact gaaatgctta 47701 atttctgaat tgttttggat atcacaactt aataccaaca taagttttga gcctttttct 47761 ccctaaatct ggtgtgagtc taactgaaac tcaaatgaac tttttaaaaa taattttttc 47821 ttttctttaa tttttttttt aagtagagac agggacgcac tgttaactag gctggtcttg 47881 aactcctgat cttgagccat cctccccgac ctgagcctca ccttatagag agggtcttgc 47941 tctgttgccc aagctggagg gcagtggcat aatcacagct cactgcagcc tctcgacctc 48001 ctcaagcgat cctcctgcct tagcctccca agtagctggg actataggcg tccaccacca 48061 tacccagcta attttttttt ttattttttg tagagacaag gtctccctat gttgcccaag 48121 ttggtctcaa actcctggac tcaagcagtc ctctcacctc agggtcccaa agtgctgggg 48181 ttacaggtgt gagccatggc acctggccag aacttctagt aaaaagaata ttgttgccgg 48241 gtacggtggc tcacgcctgt aaccccagca ctttgggagg ccaaggcagg cgaatcacct 48301 gaggtcggga gctcgagacc agcctgacca acatggagaa accacatctc tactaaaact 48361 acaaaaaatt agccgggcgt ggtggcacat gcctgtaatc ccagctactt gggagctacg 48421 gtgcctggcc tagtttatta tttcttaata tctgttgtct tccagtgtct tccttaattc 48481 ttcacaatac cctgtacaat gcttagcaca cagtgggcag tctgtaagtt tattaaatgt 48541 ttggtgtggc ccatacttcc tatccacaaa gaatgtaaca tgttaagaca tctagatgag 48601 ggaatgattt aagaggaact acaataatat tctgaaactt ggactctgga tctctgcatt 48661 tagactttcc taaaccagcc agcaagtaga tcatcatgtc acaaggctta ggttgggctt 48721 gctgttcaga gaatgaatta aggattaagg agaaaaaaaa gcagaaaggt tttgctctgt 48781 ttttcaggtt ctattgagtt gttaacttct aacaagttat cttatttgct tcattgcatg 48841 aggcccattg tagtaagaag aggaatttat atgctaaatg ttctggtgat agaatgactt 48901 ttcttttttt ttacagtcca aaggtctttt tttttttttt ttaacaccta ttatgccatg 48961 aattcatagg gaataggttc cagctgctca ggctccttcc cattggttct cacaaagtgt 49021 gcttctctgg gtggagcagg ctggtgcttc agttgaaccc acgtaccttt ctctttggct 49081 tctttctttt tctgatcatt ttccttcacg cgtttcagga agctgtcttg gctcttagag 49141 tgtttaatgt gctcaatacg cacattaatt ctcttggcaa gaatcttgcc cttaacttgt 49201 ttacagcgat gccaacagca tgctgggtca cgttgtagac ttttccagtt ttgccatggt 49261 aacacttgtg gggcattcct ttttgaacag tacccgttcc cttgatgtct acaatttcac 49321 ctttcttaca gattcgcata tacatggcca aaggaacaac tccatgtttt ctaaaaggcc 49381 tagagaacat atatcaggtg cctctcctct ttccctttgt gttcgtcatt ttggcaaatt 49441 actgaaagat ggtggttctg gccaaaagga ggaatgactt tttaatagct gtgtttgtat 49501 ctgagccttc cctctgcctt tcattttttt tgttttgttt tgttttgttt ttgtttgaga 49561 tgaagtttca cttttgttgc ccaggctgga gtgcaatggt gtgatttcgg ctcattacaa 49621 tgtccgcctc agcctcctgg gtagctggga ttacaggcac ccgccaccac gcccagctaa 49681 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggctggt ctcaaactcc 49741 tgacctcagg tgatctgtcc acctcggcct ctcaaagtgc tgggattgta ggcgtgagcc 49801 acatcacctg gccacttttt taactctttc caatggttaa ttccgtttga tatggttcct 49861 tggaacttgc acattaccct ttatcaatta tcaccctgta ttgggggtgg ggaggatgat 49921 acctctcttc atagttagat cctacttact ttcaacagag ttcttaacaa tcctagaaac 49981 tcacaggtcc agaaaagaca agcataaagg aaactataaa taatgcattt gaagactaac 50041 tcaggaaatc aatgattatt tccccccagg ctacccagtg tcttaaaaaa acagtttaat 50101 taatacaatc ttttgtttca attttctacc tatatttatg gcttttagct tttctaataa 50161 aagctcaaaa tgaattacag tcatcagtga ctttttaatg aatagaagac ttttgcaatt 50221 tttaactatt tgtttttact tattaaatat ttccgccttg gccaggcatg gtggctcacg 50281 cctataatcc cagcactgtg agatgccaag gcaggaggat cacttgagtt taagagttct 50341 agaccaggct gggtatggtg gctcatgcct ataatcccag cactttgtga ggccaaggtt 50401 ggcggatcac ctgaggtcag gagtttaaga ccagcctggc caacatggta aaaccccatc 50461 tctacaaaaa atacaaaaat tagccaaggg gtggtggtgg gcacctataa tcccatcttc 50521 ttgggaggct aaggcaggag aatcgcttga acctggaggc agaggttgca gtgagccgag 50581 atcatgccac tgtattccag cctgggtaac agagcaagac tctgtctcaa aaaaaaaaaa 50641 aagtttgaaa ccagcctggt caacacagca agacacccat ctcgttgaaa aataacggtc 50701 gggcgcagtg gctcacgcct gtaatcccat cactttggga ggccgaggca ggcagatcac 50761 ctgaggtcgg gagttcgaga ccagcgtgac caacatggag aaaccccatc tctactaaaa 50821 atacaaaatt agttgggcga ggtggtgcat acctgtaatc ccaactactt gggaggctga 50881 ggcaggagaa cagcttgaac ctgggaggca gagaggttgt ggtgagccaa gatcatgcca 50941 ttgcactgca gcctgggcaa caagagcaaa actccatctc aaaaaaaata aataaataaa 51001 aataaataaa taagtacttc tgcctttaag ccacttccta gaaggcagtg gcacaaagtg 51061 atacatttgg aggagtaaat atattacaaa atgaattagg ctgggcgcag tggctcatgt 51121 ctgtaatccc cgcactttgg gaggccaagg cgggtggatc acttgaggtc aggagttcga 51181 gactagcctg atcaacaggg taaaatccca tctctactaa aaataccaaa aaaactagct 51241 gggcgtggtg gcaggcacct gtaatgtcag ctactaggaa ggctgaggca ggagaatcgc 51301 ttgaacccag gaggtggagg ttgcagtgag ccaagattgc accattgcac ttcagcttgg 51361 gcaacagagt gagactccgt ctcaaaaaaa aaaaaagaac taacatgcca gaactttgcc 51421 ttcagtatgt tttgtgattt ttcccttctt gtgccatttc atcattagtt ccatgtatta 51481 tttaagattt cttatcaacc agcaccttgg gatttttttg tgtatgtgtt ggtttagggg 51541 gtttatttgt ttttttcttt tttttcggta attgaaaatg tgaagcaaaa tgtcacctgt 51601 tttttctttc atgtctgaca ctcatgtctt gtttaccccc gacatgcaga agctgaaatc 51661 cccatttcat acagtcttca atgtggaggc agtagggatg gagaaaataa tgtactttgt 51721 gctctccggt actctttctt tcctattgtc tgaggggatt tgggcataat ttattttgct 51781 gcagagataa aaatttgtta tatatatttt ttatcattca gggccaagga atatagattt 51841 tttttttcag ccttgtctca gctgggtgtc tttatttact ctgtcttaaa gtgttccttt 51901 tattatcatt attatttttt aatcattgaa ttccatttgg tgctagcatc tgtctgttgc 51961 attgcttgtg tttataaaat tctgcctgat atacttgttt aaaaaccaat ttgtgtatca 52021 tagattgatg cttttgaaaa aaatcagtat tctaacctga attatcacta tcagaacaaa 52081 gcagtaaagt agatttgttt tctcattcca tttaaagcag tattaacttc acagaaaagt 52141 agtgaatacc ctataagcca gaatccagaa ggcctttctg ctgacaagtt tgaggtgtct 52201 gcagatagtt ctaccagtaa aaataaagaa ccaggagtgg aaaggtaaga aacatcaatg 52261 taaagatgct gtggtatctg acatctttat ttatattgaa ctctgattgt taattttttt 52321 caccatactt tctccagttt tttgcataca ggcatttata cacttttatt gctctaggat 52381 acttcttttg tttaatccta tataggtttt ttgaacctat aacataagct acaacatgag 52441 aaatgtgcgg ttagatagat atgtcccttc tgaaggtcag aaaaaaatat aatggaggta 52501 aaacctgaac aagcttggaa actgatggta gacttcttca aggcagccct tgccctaatt 52561 aaaattcttg tctttctaga aaaagtctag ctgttgattt accacagaaa ataataataa 52621 taattactat tattattatt ttttgagaca gggtcgccct gtgtcaccta gattgcagtg 52681 gtgcagtcat ggctcactgc atcctccgtt tttcaggctc aagcaatcct cccaccttag 52741 cctcctgagt agctgggtcc acaagcatgc gccacccaca cccactaagt ttttgtattt 52801 ttggtagaga tggagtttta ccttgttgcc caggctggtc tcaaattcct ggactcaagt 52861 agtccgcccg ccttgccctc ccaaagccag aaaacattta gaatatcttt cagagatgtg 52921 tatttacacc actattaaca cagggctgta tagcagtcca gtactggact atgtagtcca 52981 gtactattct tttccttact ggagggccag gcgtggtggc aggtgcctgt aatcccagct 53041 actcaggagg ctgaggcagg agaattgctt gaacctggga ggcagaggtt gcagtgagct 53101 gggaccgtgc cattgcactc cagcctgggc gacagagcaa gactccgtct caaaacaaaa 53161 aaaaaaagag agagagagca gtaattcagg tctcacccat cttcaatcca gggggcctag 53221 ccttagtatt tgacccatag taagcaccca ataattgttt aaattaatta acctctgagg 53281 ccctttaaat ctgttgataa gtatcttatt ttgcaaagtc ctaagcactt ggaagagcag 53341 aggaactatt tactgggtgt gtatgctttt ctaacaatat tttatagctg gcttttgttt 53401 ttagaatgaa tttgaacatt gaaaaggcag gcaataggga tgattctgtg aattctgcta 53461 aaactgagta gaaagaatga gtgtagagat gtcgacattg atcaactttc tatcttcata 53521 agagatctga ttctaacata tccatttaga ctcaagtaga atattgtgta tagagtgagt 53581 ggcagtgagt aatttggtaa aaatttgctg acctgctttt attctttcct cctttctttc 53641 ttcctttcct tccttccttc cttccgtcct ttcctttcct ttccctccct tccttccttc 53701 tttccttctt tctttccttt ctttcctttc ttcctttctt tccttcctcc cttccttttc 53761 ttttctttct ttcctttcct tttctttcct ttctttcctt tcctttcttt cttgacagag 53821 tcttgctctg tcactcaggc tggagtgcag tggcgtgatc tcggctcact gcaacctctg 53881 tctcccaggt tcaagcaatt ttcctgcctc agcctcccga gtagctgaga ttacaggcgc 53941 cagccaccac acccagctac tgacctgctt ttaaacagct gggagatatg gtgcctcaga 54001 ccaacccaac cccatgttat atgtcaaccc tgacatattg gcaggcaaca tgaatccaga 54061 cttctaggct gtcttgcggg ctcttttttg ccagtcattt ctgatctctc tgacatgagc 54121 tgtttcattt atgctttggc tgcccagcaa gtatgatttg tcctttcaca attggtggcg 54181 atggttttct ccttccattt atctttctag gtcatcccct tctaaatgcc catcattaga 54241 tgataggtgg tacatgcaca gttgctctgg gagtcttcag aatagaaact acccatctca 54301 agaggagctc attaaggttg ttgatgtgga ggagcaacag ctggaagagt ctgggccaca 54361 cgatttgacg gaaacatctt acttgccaag gcaagatcta ggtaatattt catctgctgt 54421 attggaacaa acactttgat tttactctga atcctacata aagatattct ggttaaccaa 54481 cttttagatg tactagtcta tcatggacac ttttgttata cttaattaag cccactttag 54541 aaaaatagct caagtgttaa tcaaggttta cttgaaaatt attgaaactg ttaatccatc 54601 tatattttaa ttaatggttt aactaatgat tttgaggatg agggagtctt ggtgtactct 54661 aaatgtatta tttcaggcca ggcatagtgg ctcacgcctg taatcccagt actccaggag 54721 gccgaggcag gtggatcagc tgaggtcagg agttcaagac ctgtctggcc aacatggtga 54781 aaccctgtct ctactaaaaa tacaaaaaaa ttaactgggt gtgctagtgc atgcccgtaa 54841 tcctagctac tctggaggct gaggcagcag aatcacttga acccgggagg cggaggttgc 54901 ggtgagccaa gatcacacca ctgcactcca gtctgggtga cagagcaaga ctccatctca 54961 aaaaatatat atatatatat atacacacat atattttatt tcaactgtta gacaagagtc 55021 caaaggccaa agaataaagt tttaggccag tcctttatta gaaaatgagt caaatcccaa 55081 agcaagtttt tttatgagtt aatgaatata aatgactaca tattttatgc cttaaaaatc 55141 acttttaatg aatggtgttt tatggcttgt aaatcagagt tttaatcagt aaagaaagtt 55201 tttaatcctc aaaaacacgt tatcataaaa gacactgttt ggcatcaaat gtggtatttg 55261 gccatgttca ttagggtcat tttaggaatc tcatacattc tacttagcta tgcttaattc 55321 ctgataccat ggcattttct gaaatgtttc aaggatgaca tctctgctgt ttttaatttg 55381 gtaatgatat ctgctgattt attaagtgaa aaaagtaatg gtgtcattac cttggatgaa 55441 gaaacaaaaa taaagcattt gccacatttt tcaactttgt tttcctttct tacaaaattg 55501 ctataagctc attgccccca aattggacaa tatagggaat aaaaaagata atttggggtg 55561 gggttagaca cgggtcttgt tatgttgccg aggctggtct ctaactcctg gcctcatgca 55621 atcttcctac cttggcctcc caaagtgctg ggattatagg tgtgagccac ttcaccaagc 55681 tgagatgcca cctcttaaaa gagagaataa ggacagatta cagccactgc tcatgcctgt 55741 aatgtcagta ctttgggagg ccaaggtggg agaattgctc gaggccaaga gttcaagacc 55801 agcctgggca atgtagcgag acctgatctc tatgaaaagg ggggtggggg ggaaaactag 55861 ctggggccag gcgtggtggg tggcttacgc ctgtaatccc agcactttgg gaggccgagg 55921 cgggcagatc acctgagggc aggagttcag gaccaacctg accaatatgg agaaaccctg 55981 tctctactaa aaatacaaaa ttagccaggc ttggtggctt atgcctgtag tcccagctac 56041 tcgggaggct gaggcaggag aatcgcttga acctgggagg cagaggtttc agtgagctga 56101 gatcgcgcca ttgcactcta gcctgggcaa caagaatgaa actccatctc aaaaaaaaaa 56161 aaaatcagct ggaaggtggc aaacacctgt ggtcccagct actcaggagg ctgagacagg 56221 aagatcactt gagtccagga ggtcaaggct gcaggtgagc catgtttgtg ccactgcact 56281 gcagcctgga tgacagaccg agacccttct caaaaaaaaa atttttcccg gtattttttt 56341 ttgggggggg gtttaattct tgttgcccag gctggggtga attggggaat tttgggttaa 56401 gggaaccttc ggcttcctgg gttggggggt ttttcctgtt taggcttccc cagtagctgg 56461 gattacaggc atgcaccacc acgcccggct aattttttgt atttttagta gagacagggt 56521 ttctccatgt tggtcagact ggtctcgacc tcttgacctc aggtgatccg cccaccttgg 56581 cctcccaaag tgttgggatt acaggcctga gccaccgcac ccggcctgta ctcttattct 56641 ttaataataa aatatttctg tgtttcttta gtcattttac ataaactttt atttatttat 56701 ttatttttat ttatttattt ttttgagacg gagtctcgtt ctgttgccca ggctggaatg 56761 caatggctca atctcagctc actgcaagct ctgcctcccg ggtacacgcc attcccctgc 56821 ctcagcctcc ctagtagccg ggactacagg cgcccgccac cacgcccagc taattttttt 56881 ttttgtattt tcagtagaga cagggtttca ctgtgttagc caggatggtc ttgatctcct 56941 gacctcgtga tccacccgtc tcggcctccc aaagtgctgg gattacaggt gtgagccacc 57001 gtgctcggcc cataaacttt tatttttaaa ataatgtcat gataaataat attgcttagg 57061 tgtctttaat atattagtaa catttctgtt ttattgtaca tcaacattta tattcaaatt 57121 aatgggtgaa gagtactcca ttggactagg tatatcgtaa tttaatctcc tattattgga 57181 caactacatt gtttctaaaa ttatactatt cctatgacta aacctttgca tatatctttt 57241 atctccctag gatatatttc taaaactagc attgttgact gaaagtgtaa atacgtgtta 57301 aggtgtttgc tacataatgc catatttcct ttttaggaaa ctaagctact ttggatttcc 57361 accaacactg tattcatgta cccatttttc tcttaaccta actttattgg tctttttaat 57421 tcttaacaga gaccagaact ttgtaattca acattcatcg ttgtgtaaat taaacttctc 57481 ccattccttt cagagggaac cccttacctg gaatctggaa tcagcctctt ctctgatgac 57541 cctgaatctg atccttctga agacagagcc ccagagtcag ctcgtgttgg caacatacca 57601 tcttcaacct ctgcattgaa agttccccaa ttgaaagttg cagaatctgc ccagagtcca 57661 gctgctgctc atactactga tactgctggg tataatgcaa tggaagaaag tgtgagcagg 57721 gagaagccag aattgacagc ttcaacagaa agggtcaaca aaagaatgtc catggtggtg 57781 tctggcctga ccccagaaga atttgtgagt gtatccatat gtatctccct aatgactaag 57841 acttaacaac attctggaaa gagttttatg taggtattgt caattaataa cctagaggaa 57901 gaaatctaga aaacaatcac agttctgtgt aatttaattt cgattactaa tttctgaaaa 57961 tttagatcta gataaagcta tagtgtggat tattttatgt atatttactt gagaaaataa 58021 ttattaaata ttagtggaaa agctatactt tgggtatgat ataggacttt cgaattggaa 58081 ttttcctttc tatctgtaaa agcaagtagg tatagtttta ttccccagaa ggcatctttt 58141 tctccccctt gtctcacatg ggtgaattta ccagcatatt taactaaatt cagactggtt 58201 ccaaatgtac tgccagatag tagcatttct ctagtgtttg ttttcatcct ggcttgtaag 58261 aatgccctgc cacttctgcc ctgcaatatc ccttgctatt aggattttgg catcaccttg 58321 ggtccttaat gccagaaatg ggaattgctt catactgtgg aaaaataccc attaaaatat 58381 taagaccagt aaaacctcgt ttctgcttgg gctatttgtg gatttcagac atcctgagaa 58441 gtttaccacc cctgtaatta attgtcattg tcatcacttc ataataaaaa taattgcatg 58501 gccgggcatg gtggctcaag cctgtaatcc cagcactttg ggaggctgag gtggtcagat 58561 cacctaaggt caggagatca agaccagcct gaccaacatg aagaaacccc atctttacta 58621 aaaatacaca attagccggg cgtggtggcg catgcctata atcccagcta ctcaggaggc 58681 tgaggcagga gaattgcttg aacccgggag gcggaggttg cggtgagccg agattgcacc 58741 attgcactcc agcctgggca acaagagcga aactctgtct caataataag aagaagaatt 58801 gcgtgaatat ttctttaaaa ctatgatgag ataacatacc agattatcaa atggattcag 58861 tagtgggtgt gccatttatt gcacactgag agatgaccaa gtcattctga aatatcttta 58921 ttaatatatc cttcctagga tttttcatcc taacttctcc ataggtagtt acttagcata 58981 acatctctgt ggccagatgt atcccactac taaaagggca aagtaagctg tggctgccct 59041 ggtagataca atgagtaagt gcacagtgat ggctataaat gttttcatct cataatccca 59101 tgtccagacc agcaatttgc tctgaaagct cttacctgtg tctgtttcaa tggctcttga 59161 tcacttgcct gcacgtccag aattccttat ttattcattg aaaattagcg ttctttatcc 59221 ctttgttttg caagttcagc tttttagaga tggctaaaat ggtctaatct ttcttggcaa 59281 aggcaattct gagctgcaga ttagactaca agtggcttgg gtacatgttg tctttaaaca 59341 agcgaagagg aaaactttga gctctattca gacttggtga agtgtggtaa atttatgatg 59401 aaagctactg actgtattac acatgattaa ttctgaagcc catattaaga tgatcttttc 59461 agcagttcag cattgctctt ctaactgaac agtttcaagg ctgggatttc agcaattaat 59521 cagttcagaa ttgctaatga tctggcggag ggtggtagca aaagggggag gatgtcatta 59581 gcttctctag cctgcctttt ttcagtgccc tgtggcagta tggagtgagg caacatgaaa 59641 gaaagatggc ctgaccttca tggcagtatt gtgcaacacg taaatactgg tgtgagtggc 59701 tgtggctatg gctagtaaat gatggccctt ggtaaacaaa gttatttatc agacaatacc 59761 taccagctag gtcaactgtg cccataattg atctggttaa tttcttttgc tgcctattga 59821 tttttatttg gttgatagat aatagctaga ggactctaaa tttctttggg gaagaacatg 59881 aaccccttct aagccttctt acgagagaat tgatcgcttt tgcactgacc tttagtaaca 59941 tcctgatttc agtgttttgt aactatcaga gggttgagtc ttggttttaa gccatgtata 60001 tctgtagcat aactttctgt gtaggctagt tacctctcag cttataaagt gtaggctgat 60061 aaatttatag tacagtagag tgtcactatg caaagaaacg atcttaggga atcgaatgat 60121 atctgctatt aaagcaaaat taatatatat tttttctttt tacttttttt tttttttaaa 60181 gacatgaaat ctcactgtat tgcccaggct ggtcttggtc tcagactctt gagctcaagc 60241 agtcctccca cctcagcttc ccaaagtgct gggattatag gcatgagctg ccgtgtctgg 60301 cccagtatat attttttaag ttttaagttt tgtggtacgt agtaggttta taatattatt 60361 ttgaatcctt agttgtaatt ttatgtctgc tgatgtgtac ataattttta ttaaactatt 60421 tatttgagac ttcaggtatc tttttttttt ttttgagacg gagtctcgca ctctcgccca 60481 ggctagagtg cagtggcgcc atctcggctt actgcaagct ctgcttcctg ggttcacgcc 60541 attctcctgc ctcagcctcc tgagtagctg agactacagg tgcccgccac cacgcctggc 60601 taattttttg tatttttagt agagacaggg tttcaccgtg ttagccagga tggtctcgat 60661 ctcctgacct tgtgatctgc ccgcctcagc ctcccaaagt gctgagatta caggcgtgag 60721 ccaccgcgcc cagccgagac ttcaggtgtc ttagaatttt ttaaatgtac cctttctgag 60781 aaaaacagag acttaaagct aggataactg gtattctatt tttttttttt tttttttttt 60841 ttacctccag cctgggtgac agagcaagac tctgtctaaa aaaaaaaaaa aaaaaattca 60901 ctttaaatag ttccaggaca cgtgtagaac gtgcaggatt gctacatagg taaacatatg 60961 ccatggtgga ataactagta ttctgagctg tgtgctagag gtaactcatg ataatggaat 61021 atttgattta atttcagatg ctcgtgtaca agtttgccag aaaacaccac atcactttaa 61081 ctaatctaat tactgaagag actactcatg ttgttatgaa aacaggtata ccaagaacct 61141 ttacagaata ccttgcatct gctgcataaa accacatgag gcgaggcacg gtggcgcatg 61201 cctgtaatcg cagcactttg ggaggccgag gcgggcagat cacgagatta ggagatcgag 61261 accatcctgg ccagcatggt gaaaccccgt ctctactaaa aaataaaaaa attagctggg 61321 tgtggtcgcg tgcgcctgta gtcccagcta ctcgtgaggc tgaggcagga gaatcacttg 61381 aaccggggag atggaggttg cagtgagccg agatcatgcc actgcattcc agcctggcga 61441 cagagcaagg ctccgtctca aaaaaaaaaa aaaaaaacgt gaaaaaataa gaatatttgt 61501 tgagcatagc atggatgata gtcttctaat agtcaatcaa ttactttatg aaagacaaat 61561 aatagttttg ctgcttcctt acctcctttt gttttgggtt aagatttgga gtgtgggcca 61621 ggcacggtgg ctcacacctg taatctcagc actttgggag gccgaggcgg gtggatcacc 61681 tgaggtcagg agttcgagac cagcctggcc aacgtgttga aaccccgtct ctactaaaaa 61741 tataaaaatt aggtgggcgt ggtggcaggc acctgtaatc ccagctactc aggaggctga 61801 ggcagcagaa tcgcttgaac ccaggaggtg gaggttgcag tgacccaaga tcgcaccatt 61861 gcactccagc ctggggacaa gagcgagatt cttgtctcaa aaaaaaaaaa aaaaaaaaaa 61921 ggtttggagg gtggtgagct gagatagtca actattaact cctatctacc tgctgggact 61981 acactggtga ggtggagcct aagtcctaaa acaacaagtg aggcagctgg acgcggtggc 62041 tcgcatcagt aatcccagca ctttgggagc ctgaggcggg cagatcacaa ggtcaggagt 62101 tcgagaccag cctggccaat atggtaaaac ccagtctcta ctaaaaatac ataaattggc 62161 tgggcgtggt ggtgtgcacc tgtaatccca gctacttggg aggctgacac agaagaattg 62221 cttgaactct ggaggctgag gttgcagtca gctgagatcc tgccactgca ctccagcctg 62281 gcgacagagt gagactctgt ctcaacaaca acaaaagaaa gaacaagtga ggcaaaacct 62341 ggagacccca gcttcatgta acacctagtt tgagtattgt tgagagtttt tcaggaaaaa 62401 agtctgataa cagctccgag atagtcttaa catatgaaaa agcaaaaaag ggaggagaca 62461 gatcatttgt cctatacctt tctcttttaa ggttttaatt ataacttgtg taatacagga 62521 gacctctggg tgtttttagt tgactataaa ctaaatctga gtacacattt cagggctgct 62581 aaaaatgctt atttgaaact gggccgtatt aacacaagca gaggctctgg agcaagtgaa 62641 gtacagatcc agagccccac tgtattctcc aatggagtga ttgcctgaaa gatgatgtca 62701 gttttaagca ccgtgcttgg tttttaacat ggtcactgac aaattggaga gtgtttatcc 62761 agaggtagat ggtaaagata cataaaagta acttgaaata ctgtcttttg aagaagaaat 62821 gagaagattt aaggaaataa gacactgtct tcaagtatct gaagaaccgt tacccggaag 62881 agaactgtta tctggaacag gattaagact cactcatggg gctccagaaa gcagacgagt 62941 gcatggagga cgcagaagat gcagattgtg tggctcaact ctaaaatctt tctaacaaaa 63001 ttagttctct ggatgtgttc cagttcactt gatgatgatt cttttgtttt tgtttttgtt 63061 tttgaggtgt agtttttcac tcttgttgcc caggctgctg gagtgcaatg gcacgatctt 63121 ggcttgctgc aacctccccc tcccgggttc aagcgattct cctgcctcag cctcccgagt 63181 agctgggatt acaggaatgc accaccatac ctaattttgt atttttagta gagacagggt 63241 ttttccatgt cagtcaggct ggtcttggac tcccgacctc aggtgatcca cctacctcgg 63301 cctcccaaag tgctgggatt acaggtgtga gccatcgcgc ctagcctatg atgattcttt 63361 tcacagagat acaggcactt aaggagagga tctaaacccc ttggacacat tgccgttgaa 63421 cttctaagat cttaggtttc cacttactca tgaaaattat accacagggt cagagggtag 63481 tgttcattgg agccaggtgc cagaacaagt tattacaaac tactatttta gagaaaaatg 63541 tcattaaagt ttaagatacc ttaagctata ggtttgcatc aaagttaatg aaaggtaaaa 63601 agatgccaag cgtggtggct caggcctgta atcccagcgc tttggggggc caaggcgggc 63661 agatcacgag gtcaggagat cgagaccatc ctggctaaca cggtgaaacc ccatctctag 63721 taaaaataca aaaaattagc cgggcatggt ggcgggcatc tgtagtccca gctactcagg 63781 aggctgaggc aggagaatgg catgaaccca ggaggcagag cttgccgtga gctgagatcc 63841 agccactgca ctccagcctg gctgacagag caagactgca tctcaaaaaa aaaaaaaaaa 63901 aaaaatgcaa atcaaatcta aagtagttca gtctttaaac tcaaagccaa tacatttgct 63961 ttgaactaca aatgaactga agtttttaag tgtaataaat gttactaaat cggcttttgt 64021 agcagttaaa caaaaaactt caaaaattgt aaggattctg tgagggagca tggctgctgc 64081 tgctgctgct gcttgcagat agcctgctgt gtttaggatt tagttaaata catttctcct 64141 gtttaaaact aaatggtctt tccttagttt gcttagttct tcagaagggc ctttgaaaca 64201 ctgggaaata aacaagtgat tctttagcta ctgctttctg aaatacttat ataaaagctc 64261 tgcactgtat tctcccatcc ctctcagggg aatattagag ggttaggact ccccaggtag 64321 acattctagg ggtgaaaatt tgtcattaca ttgacatttc agatttaggt tttcaacaat 64381 actgttttct tctttcacat attgccatct agtaatatag atgttctccg tccacattaa 64441 tcaaaactat tgacatggat aattcctaat tccttgaaca ctataatgga gatctatagc 64501 tagccttggc gtctagaaga tgggtgttga gaagagggag tggacagata tttcctctgg 64561 tcttaacttc atatcagcct cccctagact tccaaatatc catacctgct ggttataatt 64621 agtggtgttt tcagcctctg attctgtcac caggggtttt agaatcataa atccagattg 64681 atcttgggag tgtaaaaaac tgaggctctt tagcttctta ggacagcact tcctgatttt 64741 gttttcaact tctaatcctt tgagtgtttt tcattctgca gatgctgagt ttgtgtgtga 64801 acggacactg aaatattttc taggaattgc gggaggaaaa tgggtagtta gctatttctg 64861 taagtataat actatttctc ccctcctccc tttaacacct cagaattgca tttttacacc 64921 taacgtttaa cacctaaggt ttttgctgat gctgagtctg agttaccaaa aggtctttaa 64981 ttgtaatact aaactacttt tatctttaat atcactttgt tcagataagc tggtgatgct 65041 gggaaaatgg gtctctttta taactaatag gacctaatct gctcctagca atgttagcat 65101 atgagctagg gatttattta atagtcggca ggaatccatg tgcagcaggc aaacttataa 65161 tgtttaaatt aaacatcaac tctgtctcca gaaggaaact gctgctacaa gccttattaa 65221 agggctgtgg ctttagaggg aaggacctct cctctgtcat tcttcctgtg ctcttttgtg 65281 aatcgctgac ctctctatct ccgtgaaaag agcacgttct tctgctgtat gtaacctgtc 65341 ttttctatga tctctttagg ggtgacccag tctattaaag aaagaaaaat gctgaatgag 65401 gtaagtactt gatgttacaa actaaccaga gatattcatt cagtcatata gttaaaaatg 65461 tatttgcttc cttccatcaa tgcaccactt tccttaacaa tgcacaaatt ttccatgata 65521 atgaggatca tcaagaatta tgcaggcctg cactgtggct catacctata atcccagcgc 65581 tttgggaggc tgaggcgctt ggatcacctg atgtcgggag ttcaagacca gcctgaccaa 65641 catggagaaa ccccgtttct actaaaaata caaaattagc cgggcttggt ggcacttgcc 65701 tgtaattcca gctactcggg aggctgaggc aggagaatca cttgaacctg ggaggcgggg 65761 gttgcagtga gctgagatcg catcattgca ctctaacctg ggcaacaaga gcaaaactcc 65821 atcaaaagaa aaaaaaaatc gggtgcagtg gctcatgcct gtaatcctaa cactgtggga 65881 ggccaagaca ggcagattgc ctgagctcag gagttcgaga tcagcctggg caacatggtg 65941 aaaccctgtc tctactaaaa tacaaaaaat tactcagcgt ggtggcatgc gcctttagtt 66001 ccagctactc aggaggctga ggcaggagaa tctcttgaac ccgggaggtg gaggttgcaa 66061 tgagccaaga tcgtgccact gcactccaac ctggcaacag agcgagactc cgtcttaaaa 66121 aaaaaaaaaa ttttgcagcg caaaccagga tatcctctgt tctcatttgt tctagatttc 66181 aaaagaaaca gtcctttctt tggggaaaag agaaaggaaa aggagtttta taaaaggaaa 66241 gaaaagattc ataagaacaa gaagtgggcc cacttgcata tacctttgta gaaaactgtt 66301 cactgttgtt gaagaaaagc tcttcatatt aatatgcagt ccagatgcag tggctcacac 66361 ttataatctc agccctttgg gaggctgaga caggaagatt acttgaggcc aggagtttga 66421 aaccagcctg ggcaacatag tgagactctg tctccacaaa attttttttt aattagccgg 66481 gcatggcagt gtgcttctgt agtcttagct actgaggaag ctaagccaga agaatcactt 66541 gagcccagga gttcaaggct gcagtgagct atgatcatac cattgcactc ttgcacttgc 66601 acagagcaag accctgtctc ttaaaaaaaa aaaagtgtgt gtgtgcatat gcatatatac 66661 atatatatac atgcaaatgt atctgtttat aattcagatt gcttcaaaaa gatgttgcac 66721 tttatgatac tgagaacagt gagaagtaaa taagatagag tgtaggagga ggaataattt 66781 cagaacagcc atctgagaac ttctgtgaca acagatcagg caaaatgaaa tgtgaaagta 66841 attttatagg ccaggcgtgg tggctcatgc ctataatccc agcactttga gtggccaagg 66901 caggtggatc acttgaggtc aggagttcga gaccagcctg gtcaacatgg tgaaaccttg 66961 tctctactaa aaacacaaaa aaattagtcg agcgtggtgg catgtgcctg taatcctagc 67021 tgctggggag gctgaggcag gagaatcact tgaacccggg aggcggaggt tgcagtgagc 67081 ctagattgca ccactgcact ccagcctgtg agacagaatg agaccctgtc ttaaaaaaaa 67141 aaaaaaagta attttataaa ctattgtgca caattcgatg tattcataat taattaaatg 67201 attatttttg ttggttttaa cttttattca gtggctattt attgggagcc tactgtgttc 67261 tgggcactag gaatgcaaca gtaaataaga ctaactaagt ccctggtagg attcaggttc 67321 tgtcgagggg agatacacaa taaagatgaa tttaagataa caataaatgc tatggagaaa 67381 tatacagaac agtggaatag tattagctgt caaaggttgt tgattacttt cgtttaagga 67441 ggccagggaa agcctttctg aaaaaattga gctgagacct aaataacaag aaataattgt 67501 ccttgaaaaa tgaagggaat gcatcttata ggcagaggaa tagcaaacat aaaggtcttg 67561 aggtaataat gagtgtggtt ttttgatttc tgtattttgg tttttttgag atggtgtctc 67621 cctctatccc ccaggctgga gtgcagtggc acaatcttgg ctcactgcaa actctgtctc 67681 ctgggttcaa gcaattctcc tgccttggcc tcctgagtag ctggtattac aggcacgcgt 67741 gctaccacac ccgactagtt tttattttta gtagagatgg ggttttacca cgttggtcag 67801 gctggtctca aactcctgaa ctcaagtgat ccaaccacct caacctccca aagtgctggg 67861 atcacaggcg tgagccacca tgcccggcca gagcttggtt tattttttaa aagataggcc 67921 aatgttggtc gtgtgtggtg gctcgtgcct ataatcccag cactttggga agccaaggca 67981 ggcaaatcac ttgaggtcag gagttcgaga ccagcctggc caacatggtg aaaccccatc 68041 tctactaaaa atacaaaaaa ctagcatggt gtggtggtgt gtgcctgtaa tcccagtgcc 68101 tgtaatccca gctactccag aggctgaggc aggagaatca cttgaaccga aaggtaggag 68161 ttacagtgag ccaagatcgc atcactgcac tccagcctga acgacagagc aagactcctg 68221 tctcaagaaa taataatgat aaaaggttcg ggcacagtgg ctcacacctg taattccagc 68281 actctaggag gccgaggcag gcagatcccc tgaggtcagg agtttgagac cagcctggcc 68341 aacgtggcaa aaccccatct ctactaaaaa atgcaaaaat tagctgggca cggctgggtg 68401 tggtggctca ttcctgtaat cccagcactt tgggaggtca aggcggacag atcactgagg 68461 tagaaaccct gtctctacta aaaatacaaa aatttgccca gcgtggtggc gcgtgcctct 68521 aatcccagct acacgggagg ctgagacaag agaatcactt catcaacccg ggaggtggag 68581 gttgtggtga gctgagatcg caccattgca ctccagcctg ggcaacaaga gtgaaactcc 68641 atctcaaaaa caaaaaaaaa ttagctggga atggtggcat gtgcctgtaa tcacagctac 68701 ttgggaggct ggggcaggag aatcgcttga acccaggagg cggagattgc agtgagctga 68761 gattgcgcca ctgcactcca ggctgggcga aagagcaaga ctccgtctca aaaataataa 68821 taataataat aataggccag tgtagctgga gtaatttgca aattatgtgt ggaggcagag 68881 attacacaag gaatgggaga aggtcataga tgagggccag atcacatagt atttggtggt 68941 aaggaattca gattttatcc ttgtggtaat tggtggtgtg gagatggtta aaaacaaggt 69001 tggtttggga tgggtttgaa gagaggactt gctaatggat taaatttgga ggataaggta 69061 aagagaaatt gaaggagtga cacttgggtt ttggcttgaa caatagatct tgttagtaat 69121 attaaattag atgaagaagg catggtaggg aatatggggg agtgggaaag gcaggaagca 69181 ggaatggaac caggaactct gttttagatg tgagaatttg ttgttgttgt tgttgttgtt 69241 gttgttgttg ttgttgttgt tgtgacagca tctcgttctg ttgcccaggc tagagtgcat 69301 ggagtgcggt agcacgatct cagctcactc caacctccgc ctcccggttc aagtgatttt 69361 cctgcctcag cctcccgagt agctgggatt acaggcacct gccacaatgc ctggctaata 69421 cttgtatttt tagtagagat ggggttttac catgttggcc aggctggtct taaactcctg 69481 acctcaggta atccacccac ctcggcctcc caaagtgctg ggattgcagg tgtgagccac 69541 tgtgcccggc cagatgcatg aattttgaga tgtatactag acttctggat agagaagtta 69601 agtaggcagt tggacacatt gtatgaagct caggggtaca aggaggacta tgaacatggg 69661 agtcttctga caaatttatc actagactcc tcattcaagt aactaggaaa tgtcagatat 69721 tcttccccta gtaatagcca gtggttatac tcttgccttt agttttcttc acaatactct 69781 tggcaacaca taaggccttc cctacaatct gagtttcagt cagaattgtt tctgagcgtt 69841 cttcctcaaa tttctcccca gtctcattat tctttattct catgtccatg accagtcata 69901 atagtaatta tgaaaaacct ctaactttct ttagtgcatt gaatgtatat tttatcattt 69961 tggttgtgtt aactgtaaat ctctcagtgg aaatctgaaa agcctttatt tccttagatg 70021 ataatataca attgatttag gagataggga atttttcagt tacctttata acagcacagt 70081 attagcagtc taatctaaat gctaagtgaa tgttttgaga ggagatagat gttgaaaatt 70141 aaaatacatt aagtcccagt gaggtgaaaa gccgattgtt aagttctgca cacaaaagat 70201 ttgcttcagt gaattgattt caacagctga gatcctagtc atttcacctg gtctaccaaa 70261 aagaatgatt ttacttgctt ttggtcaaat ctctgcccag caattctttt tctttctttc 70321 ttttttttgt tttatgtgtg tgtgtgtgtg tgtttttttt tagcagagtc tcactttgtc 70381 acccaggcgg gagtgtggtg gtatgatcac agttcactgc agcctccaac tcctgggctc 70441 aagtgatcct ccagcttcag cttttcaaga aattgggact gcaggcacat gcaactatgc 70501 ctggctgagg ttttatgtat cttttttcta gagaaggggt ctcactgtgt tgcccagctg 70561 ggtctccagc tcctggtctc aagctgtcct cctgcctcag cctcccaaag tgccaaagtg 70621 ctagggttat aggtgtgagc cattggtgcc cagctactgc ctgcctggca attctgaatg 70681 ccttaaattt tttttttttt tttttttttt tttgagacag agtttcactc tgtcacccag 70741 gctggagtgc agtggcatga tcgtggctca cagcaacctc tgcctcctgg attccagcaa 70801 ttctcatgcc tcagcttccc gagtagctgg gactacaggt gcatgccacc acgcccagct 70861 aatttttggt ttttttgttt gtttgtttgt ttgttttgag acggagtctc gctcagttgc 70921 ccaggctgga gtgcagtggc gtgatctccg ctcactgcaa gctccgcctc ccgggttcac 70981 gccattctcc tgcctcagcc tcccgagtag ctgggactac aggcgcctgc cactacaccc 71041 ggctaatttt tttgtatttt aagtagagac ggggtttcac cgtgttagcc aggatggtct 71101 cgatctcctg acctcgtgat ccgcctgtct cggcctccca aagtcctggg attacaggcg 71161 tgagccacca cacccggcct aatttttttt tttttaattt tatttttaat tttttgagat 71221 gcgagatgga gtctcgctct gttacccagg ctggagtgca gtggcaccat ctcagctcac 71281 tgcaacctcc acctcctgca ttcaaaagat tctcctgcct cagcctccca agtagctggg 71341 attacaggtg cctgccacca cgcccaacta attttttgta tttttagtag agatgaggtt 71401 tcaccatgtt ggtcagactg gtgtcgaact cctgacctca agtgatctgc ctgcctcagt 71461 ctcccaaagt gctaggatta caggggtgag ccactgcgcc tggcctgaat gccttaaata 71521 tgacgtgtct gctccacttc cattgaagga agcttctctt tctcttatcc tgatgggttg 71581 tgtttggttt ctttcagcat gattttgaag tcagaggaga tgtggtcaat ggaagaaacc 71641 accaaggtcc aaagcgagca agagaatccc aggacagaaa ggtaaagctc cctccctcaa 71701 gttgacaaaa atctcacccc accactctgt attccactcc cctttgcaga gatgggccgc 71761 ttcattttgt aagacttatt acatacatac acagtgctag atactttcac acaggttctt 71821 ttttcactct tccatcccaa ccacataaat aagtattgtc tctactttat gaatgataaa 71881 actaagagat ttagagaggc tgtgtaattt ggattcccgt ctcgggttca gatcttagct 71941 gataagtgga agagctggga ctttaagcag atgagaatct aaagactttg ctcttttcac 72001 ttcactgggg tgtctttctc tctctctctc ttgctctctc tctctctttt tttttttccc 72061 aagacggagt ctcactccat tgcccaggcc agagtgcagt ggtgcgatct cagctcactg 72121 aaaactcatc ttgcccaggc tggtcttgaa cccctgacct tgtgatcctc ccgccttggc 72181 ctccccaagt gctgggatag gcgtgagcca ccgtgcccag ccaataatag ctaaaattta 72241 tataatgttc actgggccag gcacagcggc tcgttcctgt tatcccagca ctttgggaag 72301 ctgaggcagg cagatcgctt gagccaagga gttcgatacc agcctgggca acatggcaaa 72361 accccatctc taccaaaaaa aatatacaaa aattagccag gcgtggtggc atgtacttgt 72421 agttccagct actcggaagg ctgagttgag agtatctctt gagcccaaga agaggggact 72481 acagtgaacg gagattgcgc cactgcactc cagcctagac gacagacaga agatctcaaa 72541 agaaaaaaaa aaaaaaaaga tcactttatg ctgggactgc tctaaaggcc caaccatgtt 72601 ttaactaatt aacaatttta tgacaactct atgagctatg tactgtaatt atgcctatat 72661 tacagatgtg aaaattgagg ctcagagagg ttgaataagt tgctcaaagt cacacaggta 72721 ataagtgatg gaactagaag ttgaactcag gaagtctagc tccaagtcta aattctttgt 72781 taatttattt ttcgggccag agtcttactc tgtcacccag gctggagtgc agtgccacta 72841 tctctgctca ctgcaacctt cacctcccaa gttcaaacct tgttcaattc ttgtgccttg 72901 gcctcccaag tggctaggat tacaggcatg tgccacaaca actagctaat tttttgtctg 72961 attctgttgg ccagtctgga gtgcagtggc gcaatctcag ctcactgcag tctccagctc 73021 ccaggttcaa gtgattctcg tgccttagcc tcccaaatag ctgggattac aggcacgtgc 73081 caccacaccg agatagtttt ttgtattttt aatagaaaca aggtttcaac atgttggcca 73141 ggctggtctc aaattccaga cctcagatca tctgcccgcc tcaggctccc aaagtgctgg 73201 gattacaggc atgagccact gcacccggcc ttaattttta tatttttatt agagatgggg 73261 ttttgccatg ttggccaggc tggccttgat ctcctggcct ccagtgatcc acccgccttg 73321 gcttcccaaa gtgctgggat tacaagcatg agccactgca cccggcctcc aattctaaac 73381 tcttaacaac aatactatag tttcttgaaa agttgttgaa ggcttcacgg agggaaaaaa 73441 aatggagcat tctaacaact ttgcagatga gacccaagaa gactcaatga ctttctcctg 73501 atcatattgt agcagatgac ttagccagaa ctctgacttc ctcacaggga gaaagtctgc 73561 aagatttcac acttacctgt caggcctgag ctggctgctt tctcagctcc ctaagtgcta 73621 tgttcccagt ctgcttttct tcctttttca agtgtgcact accaggcatt tcagaacatc 73681 ccaggctggt cgcggtggct cacacctgtg atcccagcac tttgggagcc caaggcgggt 73741 ggatcacctg aggtcaggag ttcgagacca gcctggccaa catggtgaaa ccccatctct 73801 actaaaaata caaaagttaa ctgggcgtgg tggtaggcac ctgtaatcct agctcaggat 73861 tactcgggag gctgaggcta gagaatcggt tgaacccagg aggcggaggt tgcagtgagc 73921 caagattgcg ccactgcact ctagcctggg gacaagaggg agacttcatc tcaaaaaaaa 73981 aaaaaaaatc ccagctgggc acagcggctc acttctgtaa tcccagcact ttaggaggcc 74041 aaggcaggag gatcacttga gcccaggagt tcaagactag cctgggcaac atagtaagac 74101 cctgtctcta caaaaaaatt taaaaattaa ttgggtgtcg tagcacactc ttgtattccc 74161 agctactcag gaggctgagg tgagaagaat gcttgagtct gggaggtcga ggctgcagtg 74221 agccatgatg gtgctactgc actccagcct ggccaacatt gtgagacctt gtctcaaaac 74281 aaaacaaaac atccttctac tgagcacttt ctgtcccttt atagaaactt aagagggaac 74341 cagtagaggt aatttcctaa ggaaaactgc tttgggacat gatcacaaat gaagcctgga 74401 gttttgaact gctgaggtca gcctgttttt accttctgag cctatcaagt aattgttcca 74461 gatgccaaga aaagctgctg gccttatttc tgcttctgcc tttaccacag gggagcgcca 74521 tgtgagccag tcctctgttt ttcctccact gtatgctagg cagtattagc accagattct 74581 tcccctcttt aaaaagaaat tctagtgctt tggatttttt cctccatgca gaatagcaat 74641 gatggaaagt atgtggtcaa agtaatgaca ttctgaaaat actaaatgtc accatagtat 74701 ttttctctgg aagagaaatg tatatgtaga ggtgaaactt caaatttctt tttttttttt 74761 tttaagacga agctttgctc ttcttgccca ggctgaagta caatggcgtg atcttggctc 74821 accgcaatct ctgcctccag ggttcaagtg attctcctgc ctcagcctcc taagtagcta 74881 ggattacagg catgtgccac cacgcccagc tgattttgta tttttagtag agatggggtt 74941 tctccatgtt ggtcacgctg gtcttgaact cccgacccca agtgatccac ccacctcggc 75001 ctcccaaagt gctaggatta caggccaccg cgcccggcct gaaacttcaa atttcttttt 75061 ttttttgaga cagagtctcg ctatgtcacc caggctggag tgcagtggcg ccgtctcggc 75121 tcactaccag ctccactcca cctcctgggt tcacaccatt ctcctgcctc agcctcccaa 75181 gtagctggga ctacaggtgc ccgccaccat gcccagctaa ttttttgtat ttttagtaga 75241 gacgggtttt cactgtgtta gacgggatgg tctccatctc ttgacctcgt gatccgcctg 75301 cctcagcctc ccaaagtgct gggattacag gcgtgagcca ctacgccaag cccgaaactt 75361 caaatttctt atctcataac taggcatcct tatcactgag tgttagcctg gatataaaca 75421 ttcctaatct tttgtacttt tcatgtcagc atttggctcc acttggctgc ctggggagaa 75481 cttctagcat tatgagcatg caggtcctat caacaggttg ggggtgcggt ttattcatac 75541 aggtagtgag agtggcacag atggatgctg tcccttaaaa caaacagact tgtctttggg 75601 agcctgaggc gggtggatca tgaggtcagg agttcaagac cagcctggcc aacatagtga 75661 aaccccgttt ctactaaaaa tacaaaaaat tagccgggtg tggtggtgtg cacctgtaat 75721 cccagctact agggaggctg aggcaggaga atcacttgaa cccaggaggt ggaggttgca 75781 gtgagccgag atggcaccat tgcactccag cccaggcgac agtgcaagac tgcgtctcaa 75841 aaaaaaaaaa aaaacacaca gacttgtcct actgccattt cttttcactc tggcggtaaa 75901 gtaagagtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtttct 75961 gtccgtctgt ctgtcaaggg ggagggtgac cactttctaa aaggccatcc gtgtattttt 76021 agcttcctga tttttttctc tatcgcagtc tctttgaagc caggtgaatt ttaggccttg 76081 gcaattttct ttttattgca atgggaaggt caagacactg agagtcaccc aaaacatatc 76141 catccaaaat gatacaattt tagggtttat ttttaagtga tacccaagtt atttgctaag 76201 aacctatgcc agtgtgttta tgagaatttg cactgtccca cactgttgcc accagccaca 76261 tgggactgtt taaatttaaa ttttacaaat tagccagtca tggtggtgtg cacttgtagt 76321 cccagctact taggaggctg aggcaagagg attgcttgag cccagaagtt caatactgca 76381 gcaagctatg atcgtgccac tgtactccag cctgagtgac agaatgagac ctcatctctt 76441 caaaaaaaaa gaaaaaaatt aaaatatgaa gtttagttct tcattcaccc taaccacatc 76501 tccagtgctc aataactata tgtgactcat ggctacctta ttagcataga tatagaacat 76561 tgtgactatc acagaaagtt gttttgaaca gtgttgccaa gccctgtaag tggaagaggc 76621 agtgcagtgt gatctgtgtc ttcaggaaac caggtagtca gactagttca atgaggagag 76681 gcagaacctg gcttcacttc tagattaaaa actgcttagg tggcctaaag atacaatggc 76741 cattctcaga gtagtgagaa ggaaggaaca gatgtttagg gggctagaag aaagtcagag 76801 agggccgggc gcagtggctt atgcctgtaa tcccagcact ttggaaggcc aagacaggca 76861 gatcacgagg tcaggagttc gagaccagcc tgaccaacat ggtgaaaccc tgtctctact 76921 aaaaatacaa aaattagccg ggcgtggtgg tgcgcgcctg taatcccagc tactcaggag 76981 gctaaggcag gagaatcgct tgaaccctgg aggcagaggt tgcagtgagc ccagatcgca 77041 ccactacgct ccagcctagg tgacagagag agactccgtc tcaaaaaaaa aaaaaaaagt 77101 cagaggagac aaggagcatg tacacctaaa atcaacatag acccctctgt tgatggggtc 77161 atagtgagta cttgaggtac caagtctgga taaacatcaa acttcagcca ataactttga 77221 gtttctagcc atccaagcct cttattaaac atacagaagg accttttttc ccttgcatct 77281 aacaagttaa agcacctgca gagatcatta gggaggagcc ttggcctgat tggtgacaaa 77341 agtgagatgc tcagtccttg aatgacaaag aatgcctgta gagtgcaggt caactacata 77401 tgcacttcaa gaagatcttc tgaaatccag tagtgttctg gacattggac tgcttgtccc 77461 tgggaagtag cagcagaaat catcaggtgg tgaacagaag aaaaagaaaa gctcttcctt 77521 tttgaaagtc tgttttttga ataaaagcca atattctttt ataactagat tttccttctc 77581 tccattcccc tgtccctctc tcttcctctc ttcttccaga tcttcagggg gctagaaatc 77641 tgttgctatg ggcccttcac caacatgccc acaggtaaga gcctgggaga accccagagt 77701 tccagcacca gcctttgtct tacatagtgg agtattataa gcaagatccc acgatggggg 77761 ttcctcagat tgctgaaatg ttctagaggc tattctattt ctctaccact ctccaaacaa 77821 aacagcacct aaatgttatc ctatggcaaa aaaaaactat accttgtccc ccttctcaag 77881 agcatgaagg tggttaatag ttaggattca gtatgttatg tgttcagatg gcgttgagct 77941 gctgttagtg ccaacatgtt agtgagaaaa tatctttgga taggtaaaaa tcaaggagga 78001 gttctcctct tcctaaacca tcttaattta cttacataga agaaagcaca gcagctggcc 78061 caccacggac gggcccagag caggggaaga ttctcggtga acatttcttt ttttttttct 78121 ttttttttga ggtcgagtct ctgttgccca ggccagagtg caatggcgcg atctcggctc 78181 actgcaacct ccacctcccg ggttcaagtg attctcctgc ctcagcctcc caaatagctg 78241 agactacagg cgtgtgccac cacgcccgac taattttttg tatttttttt ttagtagaga 78301 cggggtttca ccgtgttagc caatatggtc ccgatctcct gacctcgtga tccacccgcc 78361 tcagcctccc aaagtgctag gattacaggc atgagccact gtgcccagcc ctctccatga 78421 acattttcta attaaacttg acacttaata caatgttatg cttaggactg ctataaagct 78481 tacctctgga gttgcgcagc acaaaggcct tggtgtgtgt ataaatttgg tttgttcttt 78541 tcacagcaaa agctacccac ctttgcctcc tgtgcctgct tctgcccagg gacttaggtc 78601 ctcttacacc ttagagaaag gccttagcat ctggtcacag gcagatgagt gacagcaaga 78661 aaacctggct gcaatgtaat tttgtttcca tcctctttat tagttatcaa ttggattttt 78721 atgaaatttc caagttccac tcaaggattt ctcagtgttt ttttactttg gtatagtgga 78781 aaccagggtt gccagaaagt attattttgg gggtgagtta gtcaaccttc gttcagtcag 78841 acagacagga gcacctcagc aattcccaga aacgggctga tgggaaagag caacatacat 78901 gaatgtcttg aagaacacag ccaacagagc ccattgggca gttctgattt tccaggtaca 78961 cagcatctcc acagtctctt ctgattttta ttcccctgag tatatggatt ccagctcagc 79021 atgtagcctt tccctgctga gtctctaacc aggataacat gtattttttt gactggatga 79081 attatcttcc catctcttga catttacagt aattaccacc aagtatggta ttttcagtgg 79141 ccgtgattat cagttaccaa cacagaatta ggatgaaggg aggaagggag ggaaggaagg 79201 tgggtgtttt ttcacacagt gtcttagcca gcaatttagc aaattaatgg aaattagatc 79261 tttgattttt ttttctttca agcattttat ttgagagact atcaaacctt ataccaagtg 79321 gccttatgga gactgataac cagagtacat ggcatatcag tggcaaattg acttaaaatc 79381 cataccccta ctattttaag accattgtcc tttggagcag agagacagac tctcccattg 79441 agaggtcttg ctataagcct tcatccggag agtgtagggt agagggcctg ggttaagtat 79501 gcagattact gcagtgattt tacatctaaa tgtccatttt agatcaactg gaatggatgg 79561 tacagctgtg tggtgcttct gtggtgaagg agctttcatc attcaccctt ggcacagtaa 79621 gtattgggtg ccctgtcaga gagggaggac acaatattct ctcctgtgag caagactggc 79681 acctgtcagt ccctatggat gcccctactg tagcctcaga agtcttctct gcccacatac 79741 ctgtgccaaa agactccatc tgtaagggat gggtaaggat ttgagaactg cacatattaa 79801 atatactgag ggaagacttt ttccctctaa ctctttttcc catatgtccc tccccctcct 79861 ctctgtgact gccccagcat actgtgtttc aacaaatcat caagaaatga tgggctggag 79921 gctgggcatg gtggctcatg tctgtaatcc cagcactttg ggaggccgag gcaggtggat 79981 cacttgtcag gagtttgaga ccagcctggc caacatggtg aaaccccatc tgtactaaaa 80041 aaaaaaaaac aaaaagtagc caggcctggt ggagcatgcc tgtaatgcca gctatttggg 80101 aagttgaggt gtgagcatcg cttgaacgtg ggaggcagag gttgcagtga gccaagattg 80161 caccactgca ctccagactg ggtgacagag tgagactttg tctaaaaaaa aaaaaaaaga 80221 gagagagaga aaagctaggt gcggtggctc acgcctgtaa tcccagcact ttgggaggct 80281 gaggtgggca gatcacgagg tcaagagatc gagaccatcc tggccaacca acatggcgaa 80341 accccgtctc tactaaaaat acaaaaatta gctgggcgta ggggcgcacg cctgtagtcc 80401 cagctacttg aggggctggg gcaggagaat cgcttgaacc ccggaggcgg aggttgcagt 80461 tagccaagat cgcgccactg cactccagcc cgggcgacag agcgagactc cgtctcaaaa 80521 aaaaagagag agagagaaat gatgggctgg gccagtgccc cacccctgta atcacaacac 80581 tgggaggcca aggtgggaga atcgcttgag cctgggagct gaagaccagc ctgggcaata 80641 cagtaggacc tcatgtctac aaaaaaatta ttaaaaatta gccaaggctg ggtgcggtgg 80701 ctcatgccta taatcccggg ggtgaagttg agcccaggag tttgagacca gcctgggcaa 80761 catggcaaaa ccctgtctct accaaaaata caaaaaaatt agccaggggt ggtggtacgt 80821 gtctgtagtt ccagctactt aggaggctga gatggaagga ttgcttgagc ccaggaggca 80881 gaggtggcag tgagctgaga tcacaccact gcactccagc ctgggtgaca gagcaagacc 80941 ctgtctcaaa aacaaacaaa aaaaatgatg aagtgacagt tccagtagtc ctactttgac 81001 actttgaatg ctctttcctt cctggggatc cagggtgtcc acccaattgt ggttgtgcag 81061 ccagatgcct ggacagagga caatggcttc catggtaagg tgcctgcatg tacctgtgct 81121 atatggggtc cttttgcatg ggtttggttt atcactcatt acctggtgct tgagtagcac 81181 agttcttggc acattttaaa tatttgttga atgaatggct aaaatgtctt tttgatgttt 81241 ttattgttat ttgttttata ttgtaaaagt aatacatgaa ctgtttccat ggggtgggag 81301 taagatatga atgttcatca caaaaacata aatcaaggcc gggcatggtg gctcatgcct 81361 ataattccag cactttggga ggtcaagatg gaggtcaagg tgggagccta gaagttcgag 81421 accagcctgg gcaacataag gagacttcat ctgtacaaca aatttaaaaa gtagctgggt 81481 gtggtggcag atgcctgtag tcgcagctac ttgggaagct gaggtgggag gatcacttga 81541 gctcaggagg ttgatgcttc agtgagccac gatcacacca ctgtactcca gcctgggcga 81601 cagagcgaga ccgtgtctca aaaagaaaaa agaaagtata aatttacaca aaaacaataa 81661 aataatccca gtaattccac cacttggaga tgatcaccat aaaactccac caggcatatg 81721 tgcgtatata tacacgtgta ttttataaaa tgtgatcata attacactgt tttgcttttt 81781 tccttaagat attacataca tttttccaca tcgttaaatt acagtgctgt tttcctggtg 81841 gctttccttt aacagattga agttcatgtt aatacagttg ccagaggctg tgggctttca 81901 ctgtcaccag gagtcactcc tagggcctct tcagagcaag gccttatgtc ctgaagcatt 81961 gccttttttt tttttttttg aggtggagtc tcactctgtc acttagcagg ctggagtgca 82021 gtggcccagt cttggctcac tgcaacctcc gcctcctggg tttaaatgat tctcctgcct 82081 cagcctcagg gcggatcacc tgacatcagg agtttgagac cagcctggcc aatatggcga 82141 aaccccatct ctactaaaaa tactaaaaaa aattagccag gcatggtggc acgcacttgt 82201 agtcccagct acttgggaga ctgaggcagg agaatcgctt gaacccagga tgttgaggtt 82261 gcagtgagct gagatcacac catcacaatc cagcctgagt gacagagtga gactccatct 82321 gaaaaaaaag aaaaaacaat tagcctggca tggtggcagg cacctgtaat ccctgctact 82381 tgggaggctg aggcaggaga attgcttgaa cccgggaggt ggaggttgca gtgagctgag 82441 atcgtgccat tgcattccag gctgagcaac aagagcaaga ctccgtctca aaaaaaaaaa 82501 aaaaaaaaaa aaaaggccag gtgcagtggc tcacgcctgt aatcccagca ctttgggagg 82561 ccaaggtggg tggatcacct gaggtcagga gttccagagc agcctggcca acattgtgaa 82621 acccccgtct ctactaaaaa tacaaaaatt agctgggtgt gatggcatgt gcctgtaatt 82681 ccagctactc aggaggcaga gacaggagaa ttgcttgaac ccaggaggcg gaggttgaat 82741 gagccgagat tgcgccatca cactctagcc tcggcgacag agcaagactc cgtctcaaaa 82801 aaaaaaaaaa aaaaattagc ttctacctca ttaatcctaa gaactcatac aaccaggacc 82861 ctggagtcga ttgattagag cctagtccag gagaatgaat tgacactaat ctctgcttgt 82921 gttctctgtc tccagcaatt gggcagatgt gtgaggcacc tgtggtgacc cgagagtggg 82981 tgttggacag tgtagcactc taccagtgcc aggagctgga cacctacctg ataccccaga 83041 tcccccacag ccactactga ctgcagccag ccacaggtac agagccacag gaccccaaga 83101 atgagcttac aaagtggcct ttccaggccc tgggagctcc tctcactctt cagtccttct 83161 actgtcctgg ctactaaata ttttatgtac atcagcctga aaaggacttc tggctatgca 83221 agggtccctt aaagattttc tgcttgaagt ctcccttgga aatctgccat gagcacaaaa 83281 ttatggtaat ttttcacctg agaagatttt aaaaccattt aaacgccacc aattgagcaa 83341 gatgctgatt cattatttat cagccctatt ctttctattc aggctgttgt tggcttaggg 83401 ctggaagcac agagtggctt ggcctcaaga gaatagctgg tttccctaag tttacttctc 83461 taaaaccctg tgttcacaaa ggcagagagt cagacccttc aatggaagga gagtgcttgg 83521 gatcgattat gtgacttaaa gtcagaatag tccttgggca gttctcaaat gttggagtgg 83581 aacattgggg aggaaattct gaggcaggta ttagaaatga aaaggaaact tgaaacctgg 83641 gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccaaggtggg cagatcactg 83701 gaggtcagga gttcgaaacc agcctggcca acatggtgaa accccatctc tactaaaaat 83761 acagaaatta gccggtcatg gtggtggaca cctgtaatcc cagctactca ggtggctaag 83821 gcaggagaat cacttcagcc cgggaggtgg aggttgcagt gagccaagat cataccacgg 83881 cactccagcc tgggtgacag tgagactgtg gctcaaaaaa aaaaaaaaaa aaggaaaatg 83941 aaactaggaa aggtttctta aagtctgaga tatatttgct agatttctaa agaatgtgtt 84001 ctaaaacagc agaagatttt caagaaccgg tttccaaaga cagtcttcta attcctcatt 84061 agtaataagt aaaatgttta ttgttgtagc tctggtatat aatccattcc tcttaaaata 84121 taagacctct ggcatgaata tttcatatct ataaaatgac agatcccacc aggaaggaag 84181 ctgttgcttt ctttgaggtg atttttttcc tttgctccct gttgctgaaa ccatacagct 84241 tcataaataa ttttgcttgc tgaaggaaga aaaagtgttt ttcataaacc cattatccag 84301 gactgtttat agctgttgga aggactaggt cttccctagc ccccccagtg tgcaagggca 84361 gtgaagactt gattgtacaa aatacgtttt gtaaatgttg tgctgttaac actgcaaata 84421 aacttggtag caaacacttc caccatgaat gactgttctt gagacttagg ccagccgact 84481 ttctcagagc cttttcactg tgcttcagtc tcccactctg taaaatgggg gtaatgatag 84541 tatctacctc ctaggattta ttgaggcagc ttaaatacct tttgtatttc ctgttgctgc 84601 caaaacaaat tgttgcaagg tcagaagtct gaggtggctc aactgtttct ttgtttcagg 84661 tttcatgagg ccaaaataaa ggtgttcgca gggcgtgttc ccttctagag gctctgggtc 84721 cttgcagttc taggactaag atccctgttt cccactggct gttggctggg catcattctc 84781 agcttcttga ggctccccac attcctaggc tcctggcctg tctgcctcca tcttcaaaac 84841 cagcaatggg tggtcaagtt tttctcacac tgaatcttgc tgactactgt atctttctaa 84901 ctcctgccag agacatttct ctgtttctaa gggctcaagt gattagattg cacccacttg 84961 gtaatccaaa gtgatcttca tatcttaagg cccatagcct taattatagc tgcaaagtcc 85021 cttcgcagca gtacctagat tactgttgga atgaataacc agaagacagc aatcaaggga 85081 ggacatcttt agaattctgc ctaccacttg tatttaacat gcttaatcca cagatgacac 85141 tctctaccat tatttcctgg tcctcacact gctcagagat tggaatcctt tttaagcaaa 85201 gagaatgaag tcatcacata gttcagtcct gctgtatttg ctggaaacag tgaaggaaga 85261 tagagaaaat ggagctaact gccaatatta ccattttata atcagtcctc aatcatagcc 85321 ctatgaagtg ggtatttgtt acctcattgg aaaaatggga gttgaatctc aagttccttg 85381 tttgtaagat tttactcaga tttgcacagc taaaaatgac tacatggaga cccaaagcca 85441 cctttctgtt cccatcatca gctttccatc tgcctctgtc actgaccccg ggacagaagg 85501 ttcaagcctt aagggaattt ggagagagaa ctagattttg aggggaactc acactcactt 85561 cccttttggg ccacagtagg agacagtaaa agcagcccca tgtcaggcaa agggtcttac 85621 aggagtggat catggctgct gtttccactt ctctctggct tcccagctta tgactgtgta 85681 tcttagttgt caaagccttc cagttcatcc tcacctacag cttgacttcc caagggccca 85741 tgccagctcc ctgtctacct gccagtgagt tgatgagtct cggtgttagt agtaaaggca 85801 ggcgggaagc aagcagaagt gctactgggc cttgagggta agccaggcct cagccttctg 85861 accccatcac taatgggtta ataggaaaag cagtatccat ctagtacagc ctgccttttc 85921 aggaatagtg agtaaaagca aagatgacta aaatacatta aagttttctg taattgtctc 85981 taaggtctcc caacaaacat ataccccatc tgtttcaagc tctgcataac ctttcccaga 86041 agtcaagttc aggccctggc ctcatggtgc ctggcccagg ttaagagtgc tacctgatga 86101 tggagttaat acacgttgct ttgacctctg actttaagat gtcctcccac ttttccaccc 86161 cgcaatctct agccctctct gggcacagca gcaattggga actagttcct gtactgcctt 86221 tatctcattt tacaaaacaa acttctacaa agaagctgga aaggaaggag gagaaaggat 86281 tatcatgcag gcacagggag ggggcctaga gaagagctct ggcagattat gtccctctta 86341 aaaaatgcaa ccagaatcat caacaaagta tcacctcaaa aatatgcagg agaaaaagaa 86401 aggaatcaat gttggggtgg ttggagcaga aggagccaaa ctgcccagaa ggtgtcctct 86461 gaaggctgcg aggagcagac taggtcagcc ccagaggcag atggcacaca cagaatggca 86521 ggattaccgg tagctgcaga tttattgaca gaagccctag agactgggcc tgctgcccag 86581 ggaatgtggg ggctcactta tcaaagacta ctggaaaatg gctgagccgg caaccccttc 86641 cactacagtg atgcctggtt ttcttgcaca gcctgtaact ctgcctgtaa caaggagaaa 86701 attaaagcaa cgaatctggc caaatagaaa attaaggaaa actagaaaca gctcctatgg 86761 agagcaggga ttggggaggt gtagaggggc tgatcctgaa tgtctggagg atcaggaaaa 86821 tgtaagcaat tgtttaaggg actggtagga atcaagatct ggagagagat cctcccgact 86881 ctggtggctg gggaatatga actgtggaga catggtttca aggactcaaa atgatatgac 86941 agcttaacat ttacagttca gtgcagaggc tctcaaagca tgatcccctg ataggagtgt 87001 gagggggcaa caccatcacc taggaatttg ttagatatgc aaattcccag acccactgaa 87061 tcccagacag gtggggccag caacctgtga atctacaaca ggctcaagtt tgagaacaaa 87121 tgacttagtg taaggggctg ctaattgata taaaatgtta cctgtggtct attattttgt 87181 ctgtagtgaa tattgggctt gttaaggata aataaggttt gtgggctggt aagtggttct 87241 tctctacaag gttgcaacag ctggcttcag ttaacttcaa aaaggccctt taagcaaaga 87301 cagtagtccc cccactcatc catggttttg atttcagtta cttatggtca accatggtcc 87361 aaaaatatta aaaggaaaat tccggaaacc agtcactcgt tttaaattgt acaccattct 87421 aagtattgtt tgacttgttc tattttatta ttagttattg ttgttaatct cttactgtgc 87481 ccaatttata aattaaattt tatcataggt atgtatgtat agggaaaaac ataatatata 87541 tagggtttgg tactatccga gggttcaggc atctacttgt ggtcttggaa catatgcccc 87601 gcagataagg ggggactgct gtacaatgca aaggacaaag attaaattat attagcaatc 87661 taggagcaga agggcaagac tgctttttta aaaaacagct aaaggtttag gaggttttat 87721 taatatttaa attgtattga aaccacagct gcagcctttg actccagcat agagatatgc 87781 aaatatggct ttcaaaagaa aggcaatttc agacagccct caaagtaaca ggaacaaata 87841 aaacaaatga ttttgtaatt tatctttatt gactgatgtt gcacaaggca caggccatac 87901 cctgtgagag tcagcaacag ccgagctctc tgaggagaga agagaaagcc aggctggagg 87961 gagaggcagg ccgacccata gacaggtgac aggaaagaca cagagcaggc agatgggaga 88021 agaagacaac taaattaaaa gggaaggaaa ataaaaaccc agccctgggt cctgtagacc 88081 atctgatctt gctggctctc agcagcaaca acaataatca ttaatgacta tcatttgcca 88141 cactactact aagtgccatg cactattcct cacatacaaa tgaggaaaat gaagctttga 88201 gaggtcaagc aacttaccca aggtcacaca acaaaaggaa ggggcagagc ccagattcaa 88261 agatttgtgt gaggctgaag ccctgtgctc tttccagtgc attatgctgg gaaccagtcc 88321 tgggaggcag tgaataacaa taaggttaat gggccgggcg cagtggctca tgcctgtaat 88381 cccagcactt tgggaggcgg aggcgggcaa atcacgaggt caggagatcg agaccatcct 88441 ggctaacatg gtgaaaccct gtctctacta aaaatacaaa aaattagccg ggcgtggtgt 88501 cgggcacctg tagtcccagc tactcaggag gctgaggcag gagaatggca tgaacctggg 88561 aggcagagct tgcggtgagc cacgatcgcg ccactgcact ccaacctggg tgacacagtg 88621 agactccgtc tcaaaagaaa aaacaaaaca aaacaataag gttaatgatt gaggggacac 88681 tttgtgccca gttctgtggt attctgtatg ggcatgcgtg tgtctgtgtg tgtgtatatg 88741 tatgtaactg tggaaaagag ggtgaaaacc tccatttctg accttcaaat tggttactat 88801 ccaatgagta aggcaagaaa agaaagccaa agaaaacttg cagaattctg gtgtaaaagt 88861 tcttttgggg ccgtgtggtg gggccagctc tgcctgttgt ggaagacttc tggtggaggc 88921 atctcagctg gccttggcct tgagtaaaat ttagccagat gaaaaggaaa gctggagatt 88981 acacaggccc aggtgagagc ctccagctgc tagaattgga ggaaggagca cctgattcag 89041 agagatgaga aaaggcaaga gaatcctgaa aggatacata tctctgaccc tttgtcccca 89101 tccaatctcc ccagaccttc catcccaagc ccaaacacaa ccttacctgc tgctcctttt 89161 caggcaccct ggccaccaaa tataggaacc cataaatttt gctcatactc tatgttctac 89221 taggcaagtc ctgatctgtc atctctacag gccccaatcc ttcccgctca cccctacaga 89281 gccttctcca ggttttctag gccagaatct ctccccactt agaatactcc agaagttttg 89341 ctttatttgt gagactttat tcaattgaag ttacttgtgt gcatatgtta tcctctctat 89401 ttgactagaa ggtccttata atcccttatg accataatta ttttatcttt gatataaccc 89461 agctctgtaa ctagcagata ctttgttagg catccagtgg gtttttccta aatgaatgaa 89521 gtaaaggatg aatgaatgga ctcagtgcat tgaagggctt atccaactat tggttccact 89581 ctcaagacct ttggaaaact agccatgttc tggaatgcta attcccttca atgcctttcg 89641 cccatttttc tatgaccctg atttactcca aaaacaatat aagggatcta agtgtccaag 89701 aatgactcct tctaaaccca cacctaagga ttttctctct ttttgtgtgt gtgtgtgtga 89761 gacagagttt cactcttatt gcccaggctg gagtgcaatg gtgcgatctc agctcactgc 89821 aacctccgcc tccagggttc aagtgattct ccctgtctca ccctcccgag tagctgagat 89881 tacaggcgcc tgccatcaca cccagctaat ttttgtattt ttagtagaga cagggttcac 89941 cacgttggcc aggctggtct cgaactcctg tcctcaggtg atccacccac cttggcctcc 90001 caaagtgctg ggattacaag catgagccac cacacccggc ctctttgatt ctcttttgcc 90061 tatcatgaag tctacccctt tgtaattaat tagaccaatg tccacccaga cagaataaca 90121 ttttccccta tccatcagcg aggtcttctc cgtgatggac attcaaggca gacagagaga 90181 ctgctgctgc aataactggg gaaataatta tggtgttcat gatgatttct ttgcaggttc 90241 aaagcactag cccagccatt atctctccca cttcactagg ataaaattgc taaccccact 90301 ttataggtgc taaaacaggt ccagggcctt gtcaaaggtc actcagtgag ctggtggcag 90361 acctggaaat aactagccta ggagtctcga tattcattag gccacagatg gaaatgccct 90421 cattatgctg tctgggctat gtctgagaga gagtcaacta actggactcc agttaaatgg 90481 agatatgcac tggaagataa gtttgtgact acagagtgtt tttctctgca atgctgcagc 90541 agttggcact ggttaattcc agagggtgtg tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg 90601 tgtgtgtgtg tttaaagcat tatcacgcgt cctagatgag ggaagagagg gtgaatccaa 90661 ggtaacacag acacacaggt aagcagatgt ttgccatctt ctcttgaaag tcatataaaa 90721 ccaaatgaca gtgtatatta gcaggagaaa ctcaggaggc tcttcccagc tgttaggcta 90781 tacgactctg gaataagcta gtacaaatta ggtagaaagt ctaggattgt tcctagagcc 90841 tggtggcggg aggtctttcc tggaggcaaa ggactgtggg gctgtctcag ggccttctgc 90901 agctgctaaa gtgagaagcc tgccgacggg atcatcccca agcccacaga agctctgaaa 90961 gctatggaaa ccaagatctg tacaggagcc acttctggtt tctaatgcct gagagattaa 91021 aatggaaaaa aaaattccca tggaaattca agaatgcaag aatgttctgg ggccaggcac 91081 ggtggctcat gcctgtaatc ccagcacttt tggaaggccg atcacctaag atcagaagtt 91141 caagaccagc ctggtcaaca tggtgaaacc ccgtctctat taaaaataga aaattagctg 91201 ggcgtggtgg tgtgcacctg taatcccagc tactcaggag gctgaggcca gagaatcact 91261 tgaaccctgg agccagaggt tgcagcgagc tgagatcatg ccattgcact ccagcccagg 91321 caacaagagc aaaactccat ctcaaaaaaa aaaaaaaaag ttctacaacg tggccacagg 91381 tccgttctgg ctaaggcagt gatgtccccc tcccaccaaa gcccaaacct tctaacatca 91441 tcctaaagtg tgggaatcac ctcttcacct caggccagct ctgggctttt ctcagcctat 91501 tcatcagcct ccattagtcc tcagctctgc tgaggcctca gcagcttccc agtcccactg 91561 aaggctgtgg ggcatagaat gggcagaggg caggccgggc gtggtggctc aagcctgtaa 91621 tcccagcact ttgggaggcc gaggcgggca gatcacgagg tcaggagatc gagaccatcc 91681 tggctaacac ggtgaacccc atctctacta aaaatacaaa aaattagcca ggcgtggtgg 91741 caggtgcctg tagtcccagc tacttggtag gctgaggcag gagaatggca tgaacccggg 91801 aggcggagct tgcagtgagc caagatcctg ccactgcact ccagcctggg cgacagagca 91861 aaactccgtc tcaaaaaaaa aaaaaaaaag aaagaatggg cagagggcat aaaacctgag 91921 tccagaggtg gtggttgcac aacattgtga atgcactaaa tgcccctgaa ttgtacattt 91981 taaaatggct aattgtatgt tatgtgaatt tcaatcgatt tttaaaaaaa ataaaactga 92041 gccacctttg gggtggggag aggagctggg ccaggctctg aggatttgag ggttgaaact 92101 ccttgcaggg agtgaaatga acgacaatgg ggaggccagt ctggccctcc caactcctcc 92161 tccaggacca gatgggaact ggggctaggg agaaaggccc aactggggcg gcgccgggct 92221 ctgggcagaa gagaagcact cagtgaatgt gaggaggctg cagccgtcgg ctcatttgca 92281 tcataagtga ttggttttcc ctgctcgtcc ctcatcagga cacaatggac agttgtttgc 92341 tggcgcagca gatccattta ccaagggaga gaggagacag agcacaagtg accgatgggt 92401 aatagtgttg aagggtgggc agccgcctcc cctcccctgt gctcccaggc cactgggact 92461 cttgttctca cacaatgaga agggacctta gagagcaaat caccgcttca ctttatagaa 92521 gaagagactg aggtgctgag aggaggtgag ccttgctgtg gtcaaacagc aagaacatag 92581 caaaactaag cattttaaac tctaacctct ggatcctttt tctttgagac aaggtctcgc 92641 tctgtcaccc aggctggagt agtacagtgg cacaatctca gctcactgca acctctacct 92701 cccaggctca agtgatcccc ccacctcaac tttctgtggg ccacgacacc tggctaattt 92761 tttgtagaga caaggtctca ctgtgttgcc cacatttttc tcgaactcct gggctcaagt 92821 gatcctccag ccttggcctc ccaaagtgct aggattacag gtgtgagcca cagcactggg 92881 cctgtttggt ttttgttttg ttttgttttt ttgagatagg gttccactct gtcatccagg 92941 ctagagtgca gtggtatgat cactgctcac tacagcctca cactcctggt tcaagtgatc 93001 ctcccacctc agcctcctga gtagctagga ctataagtat gtgccaccac gcttagctaa 93061 tttttatttc ttttattttt tctagaaaca gtgtttccct atgttgccca ggctggtctc 93121 acctgggttc aagtgatcct cccacctcag cctcctgaat agctgggatt acaagtgtga 93181 gccactgcac caggcctgga ttctaaactg tcatttgagg gttcacttcg tttccccata 93241 ataccttctc tgtgccctta ttcccatctc tgtggtctcc tgttcccggg gccttcgttt 93301 ccatcattct gtctcctggg tctatttctt tttctctgtt tctgttcctc tctgtatctt 93361 ttcctcttgg tttccaggca gctaggataa ttactagact tttaattaac ccctgcccta 93421 acaaaggtgg gtctggcatg aggcagcaat ttagcaagtg tcttggtttg ttttggggat 93481 aggtggggag agaaaaatat gtgttggggc ctattataaa ccaggcacta agacagacac 93541 ataatgcact ttatctcatt tcatcctttc aatcttgcaa ggtaggtgtt cttagagaca 93601 aaaagggtga ggctcagaga ggttaagtaa cttgtgcaag ggcacccagc tagcaaattg 93661 tagttacctg actccaaaac ctctgttctt ccatctcaca ccctggcaca gggtctaact 93721 cagggtaagg ggctcattga tttcaggcgc aagggaggca caaagtcact gaaggagacc 93781 actgttttgt tgctgtcctc taggcagcga ctgcgtcccc ccagagcccc ctccttctct 93841 gagccccttc tgcagcgtgg cgaaatctca caaatgcaag cttttgcccc cagggaggtg 93901 gggaggcagt gatcaagaaa gaaacctgac aaacccagac caaccatggg ggtctccctc 93961 ctgttaacac ccctccctaa cagccctcct ggtggttccc tgtctgcccc tcccctttat 94021 gggtcaagcc tgctggcgtc tgtttcattg tgctgtgggg gaaggggaga gtcaggggtg 94081 agtgggtgtc tgtgtgcatg aacataaggc ctccgggttc attctgacac tgaatgaaaa 94141 actcaccaat tattcgtcca gtctcattaa tatgcagaca gacatctgtt atttaggagt 94201 caacagcaga ggcattttct tgtcgggagg ggcactagtg tacagggctc tcttgtctct 94261 ccgctgctag cgggtagcta ctcaggaatc atcccacacc tcccgacctg gagcctcccc 94321 tctctctgac cctcactcac agctttgagg acagcaagta agggattgac cagacagaga 94381 tggagggaga tctgggaacc tggctggaag gaaggaagca ggagaggagc tgccttgtgt 94441 agaacaaact gagaacaaaa cgctaaaccc tttcctgggg aagagaatgt ggagttgggg 94501 gagagagctg tgccaagagt gcctgcccca ctgggaatct cagggacatg acccctcccc 94561 ccacaccttc ctcagcctgc aggacaagtc tgagtgcatc tgaagcaggg agagggtcac 94621 tatggcaaca tgaagtcctc acccagagac tgcagaaaac gtaataagag gagttcagaa 94681 aaatggagac caagggacct caatcttttt tttttttttt caatttattt attttttttt 94741 attgatcatt cttgggtgtt tctcgcagag ggggatttgg cagggaggga cctcaatctt 94801 tggaggagtc actaaagctc tcttcaggcc ccaagatagg ggtgggaaca agacataacc 94861 acctcctgct ttctgtcttc tgtcttcctc cagcctttaa gtcccagcac aaaatacccc 94921 tccaagaagc ctttcccaac tccctcaggc ccagcttaga agcacttaag ccttggtgtc 94981 ttggttgtaa agaataaaag ttgacatcag ctgagcaaca caatgaggta gatgtggtga 95041 tcagccccct tttctaggtg caaaaactga ggttcagaga ggtgctgggc ctcaccaaag 95101 attccccagg gaagaagcag cagagctcaa cccaggccct gggacttctg cctctgaacc 95161 tgaagctctt cccacgacta ccccctggga gggccagagt cacaagggga ggacccttgt 95221 cagctgaagt gtttcaggag tttgattgag tcctctcttc ccatccaccc tgtccttccc 95281 cctcctccct cctaggcagg cggattgcct aggttaagaa acactagtct gggcgaggtg 95341 gctcacgcct gtaatcccag cactttggga ggccaaggca ggtggatcac taggtcagga 95401 aattgagacc atcctggcta acacggtgaa acctcgtctc tactaaaaaa tacaaaaaat 95461 taggccgggc gcggtggctt atgcctgtaa tcccagcact ttgggaggcc aagacaggcg 95521 gatcacgagg tcaggagatc aagaccatcc tgactaacac ggtgaaaccc cgtctctact 95581 aaaaatacaa aaaatgagcc ggacgtggtg gcaggtgcct gtagtcccag ctactcggga 95641 ggctgaggca ggagaatggc gtgaacccag gaggcagagc ttgcagtgag ccgagattgc 95701 gccactgcac tccagcctgg gcgacagagc gagactccgt ctcaaaaaaa aaaaaaaaaa 95761 aattagccag gcgtggtggc gggtgcctgt ggtcccagtt actggggcgg ctgaggcagg 95821 agaatggcgt gaacctgtga ggtggagctt gcagtgagcc aagattgggc cactgcactc 95881 cagcctgggc aagaaagcga gactctcaaa aaaaaaacca ctagtctggg cgcggtggct 95941 cctgcctgta atcccaacac tttgggaggc caaggtgggt ggatcacctg aggtcaggag 96001 ttcgagacca gcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattag 96061 gtcaggcatg gtggctcacg cctgtaatcc cagcactttg ggagactgaa gcaggcggat 96121 catgaggtca agagatggag accatcctgg ccaacatggt gaaaccctgt ctctactaaa 96181 aatacaaaaa tggctgggca cggtggctca tgcctgtaat cccagcactt tgggaggccg 96241 aggtgggtgg atcacgttag gtcgggagtt caagactagc ctgaccaaca tggagaaacc 96301 ccatctctac taaaaataca aaattagctg ggcatggtgg cacatggctg taatcccagc 96361 tacttcagga agctgaggca ggagaatcac ttgaacccag gaggtgaggt tgccgtgagc 96421 cgagatcgcg ccagtgcact ccagcctggg caataagagt gaaactccgt ctcaaaaaaa 96481 aaaaaaacac taaaattagc tgtgcatggt ggcgcgcgcc tgtagtccca gctactcagg 96541 agactgaggc aggagaatcg cttgaaccca ggaggcagag gttgcagtga gccgagatgg 96601 tgccactgca ctccagcctg aatgacagag tgaaactccg tctcaaaaaa aaaaaaaaaa 96661 aaaaaaaaaa aacacagaaa ctctggcagc catcacagtg tgattatttg tttatttcat 96721 taaatgttta acgaggctac attgtttccc aaaccaatgt ctaatttgtg aaggaaacag 96781 cgcagagaag gaagctgggt gactcctgca tctggggtgg ggaagggagt aaggtcccct 96841 ccctccatcc tacagaggcc tttgaggatc agcaacagtc ccattccctc ctcccaccca 96901 ctgagctcct cagcccagag ccctcctccc cagaaataaa acgtctggca acccagacct 96961 gcagaaaggg accaaaaatc cattcctggt ggtattgaaa atgtattaaa ctttgggggg 97021 tcctccagct gattgatttt tctaattatg tttgctttag atggatattt aaatgcattt 97081 gcattccctg agctcacatg gcaggatatg gaggttggag gaaagagggg gcacaaacac 97141 tccacactct gcactttggt ggttgcaggc ttgaacctgc tatacactga gaagtccaaa 97201 gtggaaaaga gaagccactc agctaaaaat cgcaagtcga tttttatggc aggtccttgt 97261 ggggaaaggg tcagtcctca gagacagatg gagatccacc tagctgggcc tggagcccct 97321 gccctctcct gtacccttag ccgaggactc agggtctttg agtcagtccc taaccaggtc 97381 tcagtttgag ggggtggtta tccaagcaca cttagataat ttcaaatgcc attgaagtta 97441 tcctagaatc tttgagactg gctgagatga actagtccca taggagaggt tgggataggg 97501 atatctgatg atccagggag tggtgggtag ggattccttt cctctcaaga ctggaacctg 97561 gcataaggga aaggagaagc tattttttat tttttatttt ttaatttttt ttagagacag 97621 ggtctcactc tgacactcag gctggaagac aatggcatga tcatagctca ctgcagtctc 97681 taactcctgg cctcaagcga tcctcccagc ttggcctccc aaaggaggaa tcttggctgg 97741 gattataggt gtgagccact gccgagaagc tattatttta aatgacacac ctcagagcca 97801 aatctcccag ctccaacacc acatccagat aaccatctat ccaaaaaaca actctgatca 97861 cttcactctc tgcctgaaat tcctggtggc ttcatcctct gaggatgatt tcattcatcc 97921 tcagaatgaa attctgattc ctctgtggac cctgcagtag cctcagtgta ccttcctagc 97981 cttgtcttct attctccctg ccatgggagc cccaacagtg ctatgctcat tctcatctcc 98041 acgtgtttac acatgctgta ccctctgccc agagtgcctt tcctacccct tccctgcccg 98101 gaaaactcct cttcaaccct caggacctgg ctcacaggac tggcttctca ggctgcaagg 98161 agctcccata gcatcccata catgtacaaa tatccctcag ccatggcacc ctcacacctg 98221 aggtacctct cttccacact gggctggcct ccaaaagctt agagactggt ccttgcaatc 98281 tcccaagcca gtgtatgcca cacagttggt gttcagtaca tacttgctga atgaatgaat 98341 gagggaggaa tgggctataa atttgggtgg gatcccagca gatagttggg taaggtcagt 98401 gttctcttcc agtgtgtctg ggagaactgg ctagggctgg gggagggaag ggccagggat 98461 ggttcctggg ggagaatgtc accgaaaaga ggccagtggg accagagcca ggaagggaat 98521 acaggacaat ctgaaaccag actcccgaga aaacagacca gtactgtctt tcctgacaac 98581 aggcgctcag ccgtcctctc caccgtcttt cctttaaggg acagggtagg ggtgactcta 98641 acagctgatg ctcccctgaa agccatcatg aaactcagca tgggaggaga gaaaggtccc 98701 tggcctgagc ctctaaggag accccagaat caaactactg acctcttagg aaacttcacg 98761 ctgtacaggg gtagcttctg tgatgtggag gcttttgatg ccttcttttt tttttttttt 98821 tttttttttg gagacagagt cttgctctgt catccaggct agaatgcagt ggcgcaatct 98881 cggctcactg caagctccgc ctcccaggtt cacccgggtt cacaccattc tcctgcctca 98941 gcctcccgag cagctgggac tacaggcact gccaccgcgc ctggctaatt ttttgtattt 99001 ttagtagaga tggggtttca ccgtgttagc caggatggtc tcgatctcct gaacttgtga 99061 tccgcccacc tcggcctccc aaagtgctgg gattacaggc gtgagccagc gcgcccggcc 99121 tgatgcctcc ttatccccac aacccccagg gtaccagcag gctccaagcc aggggtacag 99181 atggtgagca ggacccctcc cacactagca gcaggcctgg ctgggctgag aaatgctgac 99241 taattatatg tcggtctgct ctaaaacccc ctaatggcct gagaattgcc cacttcatta 99301 actaggatga acagtcccag gatgtctcct tctcccaact ctgactccta aaaggacact 99361 tctgatccaa cctatggctt tgtcctctgc tctatctgtg gacatggaca ggaaagttcc 99421 ccagctgagg tctaactttc cctcttactg ctaaagattg gtagattgat ttttttaaaa 99481 agcaacaaaa ctaaagccac agccgttttc catggaggtg ggctttatta ggtgactgtt 99541 gaggcaaggg aggttctagg gctggtggac tgatgggggg caagggcttc tccttgcttt 99601 tgaatttagt gcatgttgcc tagaggttag atgtgtgaga atagctgcag aagtgagagg 99661 agaggaaaag agaaggtcat agaatggaca ttttccttgg gccagaacca ggatatgggg 99721 actgggggtg gagagggagg gtactcttca cataggagac tccaccaagg tcaccatttg 99781 ataccagctt ccctaacgcc cacctgcccc atcccagttc atgccccaga tgccagccct 99841 gttagctccc tcaacatcca ctggagaaga ggagggagga ggagatgaga aacgatgata 99901 cccttctgcc ctccagctcc cctgcccaga attgctcgct ctgcctgccc taacttccca 99961 gccaggatca ggaggtcagg aagcctgggc gcaggcaggg gaacaattgt gtccctcacc 100021 acccctcttc acactctgcc ctctcctagc ccctcacatg aggttgcagc ttttggctcg 100081 atccttgtgt atctcgccct cattcccccg gtctggccgt cctgacagct gagcggatcg 100141 ctgcattccc cggcgtgagt cagttcggcg cagctgccta tggccacggc caagggaggc 100201 cactgtagcc acatggaaga catccctgac gctgcgctca gaggaccggg aggagcactc 100261 aacataggac acagccccca cctgcttggc cagcacagtg ccctggaaaa ggaaggagtg 100321 catgggagaa agttataggt cttagcctgc agcttagaaa ggggcatgga ggctgtgggc 100381 tggggttgaa tgtcaaggct gggggcagtc aagggatcag gtcagaggtc acagaccaga 100441 acaaaggatg agccaggtaa ggtgttttca aatcagagac tgaaggggca gaggtgacag 100501 gtctaggctg ggatgaggtc agacgtcaag ggtcccacct gctcatgtgt aacagggata 100561 agcctctgct tggacagctc cctcagtgtg gccaggtcag tccgcatgtc cagtttacag 100621 ccaaccagca caaccttggc attggggcag aactcttgag tctctccttg ccactatgta 100681 ggagaagaaa agagaattgt ggaggggtgg agggaggcac agcgtgagcc tttgtgccat 100741 gccagcagtg tgccctctct tcctcaagca cgcagaaacc atgtatccct caggactctc 100801 ccacatgagg aaggaggtga gaatgctagg cttggtttga gagcaggatg gggtgagaga 100861 gaagcagcgg ggaggagagg gctggcgtgg gcctgtggac tcttctccct cagtggctat 100921 gaagggtctg ccagcctgga aacttcccat tcccactcgc ctttcccata ctctctcctg 100981 ggaacaaatt tatcccaccc tcccctcccc cacctgggaa ctggggctgc cctagctgac 101041 ttctgtccat tcacccctgg cttttgtcat ggaaactggg gcctggagag gaggccacag 101101 ggctttcctg agctccagga tcaggcttcc catcccatcc cctccatctg ggactcccac 101161 tgccaaggag ttggaagagc agagaaggaa tcctgtgagt gcttctccct tcactccccc 101221 acctccccta gctgctgtat ccagctctga ggactttcag gaaaggggtg gctgggaggg 101281 atgggggctg gaggcagggg tggcagggat gagagcttca ctcgctaacc agctgatggc 101341 cataccccag ccatcactcc ctcctcctgg cccatgttat gtcaggacca tggtggtctg 101401 gtccccctca gtctagctgc cctatttccc caggctccca ccttcttgag aacactgtcc 101461 agtgtttctg gtcggctaat gtcgaagcag atgagcacag catcagaatc aggataggcc 101521 agaggccgga cattatcata gtaagaggaa cctgaaggaa gacagggcat gaggggcctg 101581 agtggatggg aggcctggaa gggaatggga gaccttacag atcctggtca agggatgctg 101641 ggagcagaga atggaaaaga gcaaccatta atacccatca ttaataccca tcatggagtc 101701 tgggtttccc aagtggggga gcgggcagaa gggaggtcac atggtgggaa acggctctct 101761 gcttccctag tggtttaaaa agtaataatt tttattcttg actgaggttt ggaccccttt 101821 gagaagctca taaaatctat agacattctc ataaaatgtt gcatgtgctt tctgggcttt 101881 ggcagtctaa agccctgaag ggaactcttg taaaaataaa aaccctcctt atgttgggag 101941 gaggaaaaga gcagtggcag ggagagcact ctcccattct cctgggactt gacctcagat 102001 ttatctagtt caatggagag aaacagcctc ctgaccaccc ctcactcaag acattctaaa 102061 tgtcttttct gcttttttat tttttgagat ggagtctcgc tctatcgctc aggctggagt 102121 gcagcggcat gatctcggct cactgcagcc tccgcctccc aggttccagc gattctcttg 102181 cctcagcctc ctggtagctg agattacagg cacgtgccac acgcccgatt aatttttgta 102241 tttttagtag agacggagtt tcaaccatgt tggccaggct ggtctcaaac tcctgacctc 102301 aggtgatccc cccaccccag cctcccaaag tgctaggatt acaggcgtga gccaccgtgc 102361 ctggcctcta aatgtctttt ctaaacccag tcttatctct gaaggaacat gttccaaatt 102421 aaaagccatt ccttcccaac tttcccaggt aggcaggacc cagctgaggg gctcagatcc 102481 taggctttct ccttcaggac atggtttctg tcaggctctg caagttctac ctcagtttcc 102541 ctggatttta gcagaatatt gatttttccc ttcctgttgg aattggatgg atgggctggg 102601 cttacctgga acctaagggt ctaaccaagg gaggggacag agtgggccgc cttggaagtc 102661 agggtgaccc ccagggactt ggctacctga agtgtcccac atgttgagct caatgcggcg 102721 cttgtcgatc tcaaagctcg cagtgtagtt ctcaaacacg gtggggacat aactctgcag 102781 acacggatgg atccagatga gatcatctct ccaagtcctc accccagata aatgccctcc 102841 tcactcccac accccgtcca gcccagcccc tggacacaag ttggtctggg attgggcaaa 102901 ggggagatcc cgggagctcc tgcccccccc cccccccccc cggagaatct ctgctaagct 102961 ccaagacctc ctcctgttcc cttctggtcc cttcagcccc ctagttcttc ccctctccat 103021 tgcccaggac ctccgacatt cccctcgccc tccctcccgg atgctctggt tctcccaaac 103081 cccatctatc tcaccctgct ccctaattcc tcccacccca atcctctgca gtccttcctg 103141 agccctcaag gcctccccgc gtcctccgac cccctgtctc ttgtccctgg ttacccgagt 103201 acccattccc atccgtcgcc agggcccctg tcacccaccc cccagcagcc ttagcgtccc 103261 cctcccaaga cgcaggtccc tcaccccggg ataggcgtcc ttggcgaaca cctgcagcag 103321 cgccgtcttg ccgcactctg cgtctcccac caccacgatc ttgcagcggc cgctctgccc 103381 ctccatggtc ccggctacga gccggtcccc cacgccaccc ctctcccggt cccctccttc 103441 gcctctcccg cccctgcaac tgcagccgct cgggccgcca acaccggcat ctcccgggcc 103501 cgcgccgagc ccccgccccg ggtccgcggc cccccctccg ccccgccccc agccagcgcc 103561 gcgccccccg gccctcctcc cacccccgca gccggggtcg gggccagaag atctggcgga 103621 gccctgggaa cagaggcctc agagccgggg tccagcccgc cggtgtggtc tgaggggccc 103681 ctgccggttt gggacaggcc gaactgggct tatttgactt tctcggatat aagggcaggt 103741 cagagttcaa gcgaagttct taggggtaga atatgagcgg cacaagcgcg aagctcgggc 103801 ctgctgtgta cccacgcgtg cacgcaggtg taccagtgcg gacaagagct ggggcagcca 103861 tccacttcct gaacacggcg ggagagagat gctaagggga aggagggagc ctctttggtt 103921 ttctctcccg cgtccgccta tgtcctggca gggggtcttg gggaaatggg agggtgaacc 103981 ccagcacacc cacccggacg gtggtgacat catagctttc cgtccccatg gcaacgggca 104041 gccgggtctc cggttacatt gacttaaccg ccggcctaga ctagcagaga agcgtggact 104101 gagttcctcc agccaagcac tgggtggaaa gttttggggg agctgcgtcc tctggtggat 104161 gcttggggac aaggagataa ggaagagaaa gaaccaaccg ccagagttgc tctgctggag 104221 ccagagctaa acccaaaagt caggcttgat taagagtctg acaataggcc gggcgcggtg 104281 gctcacgcct gtaataccag cactttggga ggccgaggcg ggcggatcaa gaggtcagga 104341 gatcgagact atcatcctgg ctaacacggt gaaacccgca cggtgggcgc ctgtaatccc 104401 agctactcag gaggctgagg caggagaatg gcgtgaaccc gggaggcgga gcttgcagtg 104461 agccgagatt gcctcactgc actccagctt gtgcaataga gtttcgaaaa aaaaaaagtg 104521 cttttttata tcgaggcaat tcgagtcaat aatatatgct gcaaataatt ctgtaaagat 104581 aactagaagc tgggcgcggt ggctcacgcc tgtaaactca gcactttggg aggccaaggc 104641 ttgcttgcgt gcaggagttt gaggccatcc tgggcaacat tagcgagacc ctctctctag 104701 aaaaaaaaat caaaacttag ctaggtttgg ccactctagg cacgctgcct atggggtagc 104761 cctgctctgc aaagagcagt aaaacataaa gttagccggg cgtggtgaca catgcctgtg 104821 gtcccagcta ttcaggaggc tgaggtggga ggattgcttg aagccgggag tttgaggctg 104881 tagggagctg tgatcgcccc acctcgctca gcctgggtga cagagtgaga ccctgtatca 104941 aaaaaataaa aatataaata taactagagc acgcagcatc atcactatgt tacagaaggg 105001 aaaatgagga acagaacgtt aacaccaaag tcagaaagtt ttaaaggctt ggtctccatg 105061 cttctacttt gccactgcaa gaccacagtg aattaagtct catccctgcc tgggttagat 105121 gtcagagcct gagacacaat gtagttggac tccagtccac aggtggctga ctccaaatct 105181 gatatgagtt aactccaaat ctgatgtaag ttcaagtttt gggactgttc cttaactttt 105241 tttttttttt tttgagacgg agtctcgctc tgtagcccag gctggagtgc agtggcgcga 105301 tctcgactca ctgcaagctc tgcctcctgg gttcatgcca ttctcctgcc tcagcctccc 105361 aggtagctgg gactacaggc gcctgccacc acgcctggct aattttttgt attttttagt 105421 agagacgggg tttcaccgtg ttagccaggt gaatctcctg acctcgtaat ctgcccgcct 105481 cggcctccca aagtgctggg attacaggcg tgagacaccg cgcccggcct tttttttttt 105541 cgagatggag tctccctctg tagcccaggc tggagtgcag cggcatgatc ttggctaact 105601 gcaacctccg cctcctgggt tcaagcaatt ctccagcctc cgcctcccta gtagctggga 105661 ctataggcac ctgccaccat gcctggctaa tttttgtagt tttagtagag ctggggtttc 105721 accatactgg tcaggctggt ctcgaattcc tgacctcagg tgatccaccc acccgcctcg 105781 gcctcccaaa gtgttgggat tacaggcgtg agcacggcgc ccggcccttt tttttttttt 105841 ttttaacagg gtctcactct gttgcccaga ctggagtgta gtggcgcgat ctcggctcac 105901 ctccgcctcc caggctcaag cgattcctct gcctcagcct cccaagtagc tgagattaca 105961 ggcgcgcgcc actaccgccc ggctaacttt tttattttta gtaaagacgg ggtttcacca 106021 tgttggccag gctggtcttg aactcctgac ctcaaatgat ccaccagcct cggcctccca 106081 aagtgctggg attacaggcg tgagccaccg cgccaggcct atcccttaaa atagttttta 106141 atttgaataa ggtttactat gaataaataa atcacagtcg gcttgatccc aagagcacag 106201 acgttcctgg tgcccctttt cgtgctctcc cagcttgcgc cactatggcc ctggcccttt 106261 aaggctgagc gcgaggcccc gcctcgcccg gcgccccgcc cctcccgctg gatcccgcag 106321 ccgcggctct tcccgacgcg ttccgacttc cccagctgtg cactctccat ccagctgtgc 106381 gctctcgtcg ggagtcccag ccatgtccga cgagagagag gtagccgagg cagcgaccgg 106441 ggaagacgcc tcttcgccgc ctccgaaaac cgaggcagcg agcgaccccc agcatcccgc 106501 ggcctccgaa ggggccgccg ccgccgccgc ctcgccgcca ctgctgcgct gcctagtgct 106561 caccggcttt ggaggctacg acaaggtgaa gctgcagagc cggccggcag cggccccggc 106621 ccctgggccc ggccagctga cgctgcgtct gcgggcctgc gggctcaact tcgcagacct 106681 catggctagg caggggctgt acgaccgtct cccgcctctg cctgtcactc cgggcatgga 106741 gggcgcgggt gttgtgatcg cagtgggcga gggagtcagc gaccgcaagg tgagcgggtt 106801 gcgtagggca gggcagggct gcgcaggcca ctgggcagtg gggcacgagt gggcgagcgc 106861 cgggggtgtg gcagggcggg agaaactggc gcggacctgg gtgcacgagc gtggaaagcg 106921 tagccaagga acttgtgttt gggggctcct ggagagcggc atttatgtgg ggaggggaga 106981 cgaaattatc gccccttccc caaccatttt aagttgtggc cgccgcccag aagctgtgct 107041 ggtggggggg aaaacaataa ggtgcccatg cgcatgcgca caaccacact accgtcccca 107101 cccacccccc ccccccccat taaaaccaca cctgtacccc tacccaccaa acactctctg 107161 ggtaattgtg gtctgtgact atgagtgacg gttagtgccc cctttccccg agggagcttg 107221 aggggctatg tcgtcggggt tgggcggggg cacagcggcc gtgccagagt cctggtcaca 107281 tgcagccccg tggtctgtgg gggtgtgagg cggcccctcc caaagcaagg ccaaagagac 107341 gagacacgcc catcacggag gagagagagc ctttgctacc ccaccgccac cagccttaca 107401 ccgccgatct gattttgggg tgggggaggc gggattgggt catccgatct ttgtcttggg 107461 ctctgtgtct cccgtgactg cagtatctcc tcctcctgtg actcagccct cagccttcgg 107521 gccacgaccc ggggctgccc ttgggaatgc ctggggcggg gagtggaagg ggggacccac 107581 ctctgccttc ctcctgcaga ggacccccac ttcagaaacc ccagtgccag gggtttggac 107641 tggaacggag aggtgcggcg ccttgaactg gttggccaag tctgcaggcc tgtttctcct 107701 tctcatttat cattaatctt ggccacaacc ctggacacca gagagctcaa aatgatcagc 107761 tttttgagag acctgggatg aggcctcagc acgccatttg tttagaggtt tctttttttt 107821 ttctttttct tttttttttt tttttttttt agacggagtt tcgctcttgt tgcccaggct 107881 ggagtgcaat ggcgcgatct cggctcaccg caacctctgc ctcccgggtt caagcgattc 107941 tcctgcctca gcctctcgag tagctgggat tacaggcatg cgccaccatg cctggctaat 108001 tttcgtattt tttagtagag acaaggtttc accattttgg gcaggctggt ctcgaactcc 108061 cgacctcagg tgatctgccc acctcggcct ccctaagtgc tggggttaca gacataagcc 108121 actgcgcttg gccaggagtt tcctttttaa atcagacccc tcaatgagag gccccacaga 108181 tgcagcctct tgcagacctg ccagcccaat tctggagcca ggtttgttgg attcatcctg 108241 tatgcaaaca gcttctcctt aaggctttcc tctgaattca gctctggccc caccctcaaa 108301 ctgacttcta aatgatccca ctcttgagca ggcgtctaag aggaatattt tcgggaggta 108361 gttgtagttc atgttactgc tgaaggccac ccacctcacc tcccctccat acactttccg 108421 cctggtaaat acaggatatc ctgtccaggg caagaatctg atgtaagagc ctggattctg 108481 cggggagggc ccttccctct ctctccctcc tcctccctcc tggtgtctgg gttggggagg 108541 ggtcatggcc ctgatttgga tggcctgagg gttagcatga gccagggtaa gtgagacttg 108601 ttctgggtca aatctgggac tggccatgac cctaaatgac caatgcactc ctcgcagctc 108661 tcctgggttg ttctgtatct gctagtcctg agtccctggg tggagggctt ccgttcttgt 108721 tctccagacc tcatctcagg ccagaacttg gaaggaaaga ccccagcatg ccctcagttc 108781 tcgtattcag tggagtgtgg gggcttgagg acatgaaaaa gggcgtaagt ggcagtccca 108841 tcccccttcc ccatggaccc taactcttgt taatatacag aattcccatc attcctggca 108901 gggatcaaga cagacccaga ttgtcccaga acagcaccca cacctccctc ttcatgctct 108961 tcagagagcg cagagaagtc ttttctcctg acgctccctc cttttccctg cccttccctt 109021 gaccccactt gctaagctgg agagaaaggt tctgttatct ttgtcccctt tccctcctgc 109081 acagaggctc tcgtgggggt gggggggaag ccttttactg ctgcgtaggc ctctgtagcc 109141 cttcttgtct gttgcccctc ctgcaccatc tctgagtgaa gatgttttct gggctcccag 109201 tgctggctca aacacacttc tcccgaggtg accacaccct gctgtaagcg ctcagagaac 109261 tttgtctgca ctttggtatg gccctgacca cacagtgccg tcttcttttg gttgtgtgac 109321 ttccttgctg ctattatatt attattatta ttattgttag ctaacattat taagtgctta 109381 ttatgtgcca gacattgtgc taaaattttt tttttccatt tggaaaaact gccctaattg 109441 acagataaga aaactggagc tggaaaagtg gagctcaaaa agattaagta atttttatgc 109501 atccgaagtc acacagtcag taaacagttg agaccgcttt tgtaaccaca gcagtttgat 109561 cttttccaca atacttcatg ctgccctaac tgtaagcttc ttgcattcag ggatcttaca 109621 caatagtgac atgtaagatc tgtaagaaga tgataaggat agtatctgcc tcaaagaact 109681 gatgtaaggg ttaattaagc attatatata aagcacttga aatcgggctc ggccctcagt 109741 cagcactcag taaaggtgag ttgttattgt tgttgatgat gatgttatta ttattattac 109801 catgactatt gctgcttctc ctgctacttc atttggaaga aggtggcctt gtacctaggg 109861 ggaaatcagt aaatagcctg tgggtgaatg aatgagtgac tgaatgatac tcttctgttc 109921 ttcaaggcag gagaccgggt gatggtgttg aaccggtcag ggatgtggca ggaagaggtg 109981 actgtgccct cggtccagac cttcctgatt cctgaggcca tgacctttga ggaagctgct 110041 gccttgctcg tcaattacat tacagcctac atggtcctct ttgacttcgg caacctacag 110101 cctggccaca gcgtcttggt acacatggct gcaggtgaca ggtcccctca ctttatcacc 110161 ccttacccca cccagatttc cttccaggcc ccttccctgc agcctgtctg ggttgttgtc 110221 atggcaacac caggctgcct tggcctgtgg ctcccagagg cctctgctgt gtagttgccg 110281 tggtaacatt caggcaccag gtctagtctg gtgtgctatc cttagcaacg tgccctcacc 110341 ccacaccccc acctctctag ctaccttccc caccacttct cagtcatgga aattagacac 110401 ggccctaaaa tgagcgtagg caaaatgaag gtgacaggct gagtccctgg gaggctagaa 110461 tggagtggtt ggtggccagg gcaactccat atcccctgct tatgggtgtc ttgctgcagg 110521 gggtgtgggt atggctgccg tgcagctgtg ccgtacagtg gagaatgtga cagtgttcgg 110581 aacggcctcg gccagcaagc acgaggcact gaaggagaat ggggtcacac atcccatcga 110641 ctatcacacg actgactacg tggatgagat caagaagatt tcccctaaag gtggggggca 110701 taatatggga gggggtaggg aggcacagga cagggagggg agctccagat ctgtggatcc 110761 taatgttgtt cttgggttcc cctactctat gacaggagtg gacattgtca tggaccctct 110821 gggtgggtca gatactgcca agggctacaa cctcctgaaa cccatgggca aagtcgtcac 110881 ctatggtgag ttagtgggcc agggatggag agagcatgtg agggcaggag ggagggtcta 110941 aggggtggga tatagaggcc agggcttttg aatgaagaag gggtagggac tcaggtgctc 111001 tgtagacgat cagggttagg aatggtcctg tatgctgcat tcagattgct gactcttggg 111061 taccagctct tttcattctc tgtcacaact tttcatatga gtgatagtaa attctacatt 111121 cttttttttg gttttttttt tgagatggag tttcgctctt gtcgcccagg ctggagtgca 111181 atggcatgat ctcggtcagt gcaacctctg cctcctgggt tcaagcgatt ctcctgcctc 111241 accctcccga gtagctggaa ttacaggtgt ctgccaccac gcccaactaa tttttgtatt 111301 tttagtagag gcggtgtttc accatgttgg ccaggctggt cttgaactcc tgatctcagg 111361 tgatccagcc gcctcagcct cccaaagtgc tgggattata gccgtgagcc accacgcctg 111421 gccaaattct acattcttgt ttggggatta ttcttgaaca accagcctgc cttctttctg 111481 tcctacctcc ctgagcatct taggcagggt gcattttcat ttaaaaaagt atttcataca 111541 acaaaataag ccaggcatgg tggctcatgc ctgtaatccc agcactttgg aaggccaggg 111601 caggcagatg gcttgagcct agaaattcga gaccagcctg gtcaacatgg tgtaccttat 111661 ctccacaaaa aatacaaaaa ttagccaggt gtggtggcac tgtactccag cctgggcaac 111721 agagcgagac cctgactcaa acaaatgaac aaacaagcaa atacataaag attagagcaa 111781 tttttttttt tttgagacag aatatcactc tgttacccat cctggagtgc agtggcgcaa 111841 cctcggctct ctgcagcatc cacctccctg gttcaagcag ttctgcctca ggctcctgag 111901 tagctggaat tacaggtgcc catctccacg cctggctttt ttttttttct ttttcttgag 111961 acagagtctg gctgtcaccc aggctggagt gcagtgacac gatctcggct cactgcaacc 112021 tccacctccc ggattcaagc aattcttctg cctcagcctc ccaagtagct gggactgcgg 112081 gcgcacgcca ccatacctgg ctaatttttg tattgttgtg ggtttttttg tttgttttgt 112141 tttgtttttt ttttttgaga cggagtttcg ctctttttgc ccaggctgaa gtgcagtggc 112201 gcgatctcgg ctcactgcaa cctccgcctc ccgggttcaa gcgattctcc tgcctcagac 112261 ttcctgagta gctgggatta caggcatgta ccaccatgcc cagctaattt tgtattttca 112321 gtagagacgg ggtttctcca tgttggtcac gctggtctcg aactcctgac gtccagtgat 112381 ccgcccacct cggcctccca aagtgctggg attataggcg tgagccacca tacccggcca 112441 acacctggct aattttcgta ttttttagta gagacaaggt ttcaccattt tgggcaggct 112501 ggtctcgaac tcctgacctc aagtgatccc cccaccttgg cttcccagag tgctgggatt 112561 atggatgtga gccatagcac ccagccccta gagcaattta aagtcagcca gggttggttt 112621 gggcctggta accagcagtt tgagaattat ccaatcactc cctggcactg gtgtggagaa 112681 ttgcaagggg atcacaggga acagagagca cagatgcaga cacacaggga tgtgcctggg 112741 ggggttcaca ggctatgtca ccttgtcctt caggaatggc caacctgctg acgggcccca 112801 aacggaacct gatggccctg gcccggacat ggtggaatca gttcagcgtg acagctctgc 112861 agctgctgca ggccaaccgg gctgtgtgtg gcttccacct gggctacctg gatggtgagg 112921 tggagctggt cagtggtgtg gtggcccgcc tcctggctct gtacaaccag ggccacatca 112981 agccccacat tgactcagtc tggcccttcg agaaggtgaa tgtgaggact ttgcagggag 113041 ggcttgggta ggactcatga aggctggggt cccaaggggc agattcctgg ggaagaggag 113101 ggctgcctgc atcacactgg cccctgttgg atgagggttg gatagcactg ggagccgcat 113161 ctttccttcc tccccaggtg gctgatgcca tgaaacagat gcaggagaag aagaatgtgg 113221 gcaaggtcct cctggttcca gggccagaga agcagaacta gggcaagtgg ctgtgagacc 113281 ctagagacca gcgaagggag aagttgggaa gctacgttct gttggccacc agacttgcat 113341 ttcagcctct gtcataatgc tctgccctcc ctcccccgaa gttctctgtg gtgatgaccg 113401 ctctcccctg cccctccccg cttcctgacc tctgaagagg ttgggaagtg accatttgga 113461 tgtctgggcc ctgccaaggc gacagggagg gtcagaggga ggccggctgc ttcctgcccc 113521 caccctttcc ccgggcctgc tgtgctgctt ttgtgccaag gttagccagt cccccctgtt 113581 gtgttccatg tgctttcacc tctgcctcat ctttcctccc gtccctgccc cgccacctcc 113641 ccaaagaatt gaaacgtcag ctcaggatat ggggccaatc tctgtgagtc cagcatgtac 113701 ctgtctctcc ctagtgtccc ttcagcctgg gctgaccagt gcccgcctct gggcttgacc 113761 agttcccaat ctcgtcctct gtccccaact tcttaagcac aattgggctt cttccatctc 113821 caggttttct gccattctta accaaggctg cctcttccaa cagggcggga atcagaccta 113881 ctcccctagg tcacaactct gggaaggata cagagccccc acccttcact gagttctctg 113941 gatttgttct cagtgcctta gcaacgaaaa cctgtgcttg tgtgtgtgtg gcggcgggga 114001 gggaggatcc tgtttcccac ctccttctcc tcccctgtac tccccagtgc cttccttgtt 114061 ctggtggagc tggggtttct ctcctcccca gtcccacaac actgccaaaa atctgtgtat 114121 gtgccattgg gtggggcagc cccaagcctc ctggggaggc agggcaaaaa caggtgccct 114181 catcgtggtc tgtgccatgt cccgtctcta tggtggttga ggagaaaggc ggggaagctt 114241 cctcagcctt gcagatatgt gtggcattta ctagccagag ctctgaaagg cagtgctgtc 114301 tgtttcttgt actgggacca aagtaaaaat ccaagcacat tccccttgca gttaggggag 114361 gccctactgc cttctcaaag cagagaggca gcttatcaaa ctcagcccaa aactctgttt 114421 acatgggtgg ggagatggag cagggaagta cagagtggga tggtcaggac ctgggccatt 114481 gcaaccaaaa tggggacttc ctgggtaggg aggtcactcc ctctactcac tgagctagga 114541 ttagggaggg ttattgcccc aaccattgca atgggaggtg gagggacagg ctcagcctcc 114601 tcattgtcta aatgaggcct aaatgtgtga agtgcgattt ctgcttttgt gtaccccacc 114661 accccattac cacagctgcc tttgtgtgtt tgtgtcaata aaaagccaaa ccctgggtcc 114721 tgcttgttgc ctctgagagt ggagggaagg tgagctcctg gaaggctagt gctgccagca 114781 gaagatctgg gctgcttcct gccccctgcc tctttccatg cccaaatcac gtttcctttc 114841 atgagtgaaa tgaggaagaa catcatggca tgcaggctct ttttactgtt ctgggcagtg 114901 ttttgcaatg tgtgacccct ccgcactgtt ggtgaacata cagacctcct atatgggcac 114961 ccaagcccag gccagtgtga gaaccttggc gggggggtgg ggaggatgag aaggggaggc 115021 ccctagcctg actcagaggt gaagactgct aggccctgct gtccttgggg tacgactgtc 115081 agggcctcta cctccccgcc cccgcgggtg ggcttctgga agtggatctc caggacgtca 115141 tgcagctccg ggccatccaa gatatcagga atgttgagca ccagtaccga gcggggaact 115201 ggctgcgacc tgatctggaa aaggagccag gagatggcaa aggctcatgg gagaagggct 115261 gggggctggg ggtagggcag gcagtgcccc atgggggtat gaggtcggga ggcctggaga 115321 ggctgatggt gggtaggtag gtcagtgggg tttggggcct ctgatccttc attgggcaag 115381 ttcctggcag gcagacaggt tacccagccc agcccctgtt cttcacccct cctgcttacc 115441 tcagccttct ggatctcccc attcacatac ggagagactc tcagagggac ttgctgccca 115501 cccagtggca ctgtgaactg gccgatttgg cacagacgct gagccactgt ggggagcaag 115561 gaggggtggg cctttagcat caagccctct cctctccctg caccataatc ttgcacaccc 115621 tataccctct cccctgcagg aggcctgcat agccctcacc tccatcccta gcaaacccca 115681 gcatgacact ccctggcagt agctcccgaa cgtccacatc gccacctccg ttcctagtct 115741 tgccaaagaa gatctctagc ttgtccagca gctcctcctc actcagcctg aggctggcag 115801 gaaatccagt gaccaacacc ctccggccac tcaactggct ggacacctag ggggagacag 115861 caaggggtgg tcccagacac agctccactt ccccatgccc aggtgcatgg catgctttgc 115921 atgccccagg attctgtcat accatcacct ggatggtggt gaccatgggc agctccaagg 115981 gctggacctg cacccgcagc cggcactcct ccatgttgat cgtgtgctcc ttttgttgca 116041 gcacctgctc agccactggg tggtgggaaa cagggtcagt actgggaacc ctccccacct 116101 ccctcctgag gctccccatg agcttacctt tggggtcatc aaaggtgatc agagcagagc 116161 ccgcaagcag agggcagtgg atccgcaaat tggaaactaa agacttaggc acttccgggt 116221 cctgctgggt gtgtcctcgg aataccaggg ggatcttggg cactgaaaat gggacctgga 116281 agtaggggag gggagtagga gtgttcctca ctcccaccta ggaggaaggc atctccgatt 116341 ccattccatt gatgctgagt gccggagata aatacattta ctgaaagaaa agctggataa 116401 atgagtggat gaatgcaggt ctggtccata cagggcaatt tacatacaac atcttaattt 116461 gtcctgggag gtgggtagct ctactttaca gatgttcaag taacataatg gaggtcacac 116521 agcaaggaag tagcagagcc aggaataaaa ccgagaactc cccagctctt ttttgtgttt 116581 ctttgtttta aagggagtct tgctctgtca cccagctgga gtgtagtggc atgatcttgg 116641 ctcactgcaa cctccacttc ccaggttcaa gcaattcgga ctcagcctcc caagtagctg 116701 ggattacagg catgtgccac caaacccagc taatttttgt atttttagta gagacggggt 116761 ttcaccctgt tggccaggct ggtctcaaac tgacctcagg tgatccacca gcctcggcct 116821 cccaaagtgc tgggattaca ggcgtgagcc actgcgcccg gcctaatttt tgtattttta 116881 gtagagatgg gatttcactg tgttggccag gctggtctca aactcctgac ctgaggtgat 116941 caggccccct tggcctacca aagtgctggg tttacaggtg tgagccaccg cacccggcct 117001 cgccagctcg tttttaccaa acaacaccag atcctttagg gggggttatt gcgggcggat 117061 ttttgaaacg gagggctggt gataggcttt ttataggtat gcatgtatat ggttctagcg 117121 tagtggattt tgggaattgc ggt // LOCUS HUMBSAP 3277 bp mRNA PRI 31-DEC-1994 DEFINITION Human B-cell specific transcription factor (BSAP) mRNA, complete cds. ACCESSION M96944 NID g179555 KEYWORDS B-cell specific transcription factor; transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3277) AUTHORS Adams,B., Dorfler,P., Aguzzi,A., Kozmik,Z., Urbanek,P., Maurer-Fogy,I. and Busslinger,M. TITLE Pax-5 encodes the transcription factor BSAP and is expressed in B lymphocytes, the developing CNS, and adult testis JOURNAL Genes Dev. 6 (9), 1589-1607 (1992) MEDLINE 92387536 FEATURES Location/Qualifiers source 1..3277 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BJA" /cell_type="B-lymphocyte" gene 76..1251 /gene="BSAP" CDS 76..1251 /gene="BSAP" /standard_name="transcription factor BSAP" /codon_start=1 /product="transcription factor" /db_xref="PID:g179556" /translation="MDLEKNYPTPRTSRTGHGGVNQLGGVFVNGRPLPDVVRQRIVEL AHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVEKIAEY KRQNPTMFAWEIRDRLLAERVCDNDTVPSVSSINRIIRTKVQQPPNQPVPASSHSIVS TGSVTQVSSVSTDSAGSSYSISGILGITSPSADTNKRKRDEGIQESPVPNGHSLPGRD FLRKQMRGDLFTQQQLEVLDRVFERQHYSDIFTTTEPIKPEQTTEYSAMASLAGGLDD MKANLASPTPADIGSSVPGPQSYPIVTGRDLASTTLPGYPPHVPPAGQGSYSAPTLTG MVPGSEFSGSPYSHPQYSSYNDSWRFPNPGLLGSPYYYSAAARGAAPPAAATAYDRH" BASE COUNT 743 a 961 c 908 g 665 t ORIGIN 1 aaaaaaaaga aaaaaaaagg cacaaaaaag tggaaacttt tccctgtcca ttccatcaag 61 tcctgaaaaa tcaaaatgga tttagagaaa aattatccga ctcctcggac cagcaggaca 121 ggacatggag gagtgaatca gcttgggggg gtttttgtga atggacggcc actcccggat 181 gtagtccgcc agaggatagt ggaacttgct catcaaggtg tcaggccctg cgacatctcc 241 aggcagcttc gggtcagcca tggttgtgtc agcaaaattc ttggcaggta ttatgagaca 301 ggaagcatca agcctggggt aattggagga tccaaaccaa aggtcgccac acccaaagtg 361 gtggaaaaaa tcgctgaata taaacgccaa aatcccacca tgtttgcctg ggagatcagg 421 gaccggctgc tggcagagcg ggtgtgtgac aatgacaccg tgcctagcgt cagttccatc 481 aacaggatca tccggacaaa agtacagcag ccacccaacc aaccagtccc agcttccagt 541 cacagcatag tgtccactgg ctccgtgacg caggtgtcct cggtgagcac ggattcggcc 601 ggctcgtcgt actccatcag cggcatcctg ggcatcacgt cccccagcgc cgacaccaac 661 aagcgcaaga gagacgaagg tattcaggag tctccggtgc cgaacggcca ctcgcttccg 721 ggcagagact tcctccggaa gcagatgcgg ggagacttgt tcacacagca gcagctggag 781 gtgctggacc gcgtgtttga gaggcagcac tactcagaca tcttcaccac cacagagccc 841 atcaagcccg agcagaccac agagtattca gccatggcct cgctggctgg tgggctggac 901 gacatgaagg ccaatctggc cagccccacc cctgctgaca tcgggagcag tgtgccaggc 961 ccgcagtcct accccattgt gacaggccgt gacttggcga gcacgaccct ccccgggtac 1021 cctccacacg tcccccccgc tggacagggc agctactcag caccgacgct gacagggatg 1081 gtgcctggga gtgagttttc cgggagtccc tacagccacc ctcagtattc ctcgtacaac 1141 gactcctgga ggttccccaa cccggggctg cttggctccc cctactatta tagcgctgcc 1201 gcccgaggag ccgccccacc tgcagccgcc actgcctatg accgtcactg acccttggag 1261 ccaggcgggc accaaacact gatggcacct attgagggtg acagccaccc agccctcctg 1321 aagatagcca gagagcccat gagaccgtcc cccagcatcc cccacttgcc tgaagctccc 1381 ctcttcctct cttcctccag ggactctggg gccctttggt ggggccgttg gacttctgga 1441 tgcttgtcta tttctaaaag ccaatctatg agcttctccc gatggccact gggtctctgc 1501 aaaccaatag actgtcctgc aaataaccgc agccccagcc cagcctgcct gtcctccagc 1561 tgtctgacta tccatccatc ataaccaccc cagcctggga aggagagctt gcttttgttg 1621 cttcagcagc acccatgtaa ataccttctt gcttttctgt gggcctgaag gtccgactga 1681 gaagactgct ccacccatga tgcatctcgc actcttggtg catcaccgga catcttagac 1741 ctatggcaga gcatcctctc tgccctgggt gaccctggca ggtgcgctca gagctgtcct 1801 caagatggag gatgctgccc ttgggcccca gcctcctgct catccctcct tctttagtat 1861 ctttacgagg agtctcactg ggctggttgt gctgcaggct ccccctgagg cccctctcca 1921 agaggagcac actttgggga gatgtcctgg tttcctgcct ccatttctct gggaccgatg 1981 cagtatcagc agctcttttc cagatcaaag aactcaaaga aaactgtctg ggagattcct 2041 cagctacttt tccgaagcag aatgtcatcc gaggtattga ttacattgtg gactttgaat 2101 gtgagggctg gatgggacgc aggagatcat ctgatcccag ccaaggaggg gcctgaggct 2161 ctccctactc cctcagcccc tggaacggtg ttttctgagg catgcccagg ttcaggtcac 2221 ttcggacacc tgccatggac acttcaccca ccctccagga ccccagcaag tggattctgg 2281 gcaagcctgt tccggtgatg tagacaataa ttaacacaga ggactttccc ccacacccag 2341 atcacaaaca gcctacagcc agaacttctg agcatcctct cggggcagac cctccccgtc 2401 ctcgtggagc ttagcaggca gctgggcatg gaggtgctgg ggctggggca gatgcctaat 2461 ttcgcacaat gcatgcccac ctgttgatct aaggggccgc gatggtcagg gccacggcca 2521 agggccacgg gaacttggag agggagcttg gagaactcac tgtgggctag ggtggtcaga 2581 ggaagccagc agggaagatc tgggggacag aggaaggcct cctgagggag gggcaggaga 2641 gcagtgagga gctgctgtgt gacctgggag tgattttgac atgggggtgc caggtgccat 2701 catctcttta cctggggcct taattccttg catagtctct cttgtcaagt cagaacagcc 2761 aggtagagcc cttgtccaaa cctgggctga atgacagtga tgagaggggg cttggccttc 2821 ttaggtgaca atgtccccca tatctgtatg tcaccaggat ggcagagagc cagggcagag 2881 agagactgga cttgggatca gcaggccagg caggtcttgt cctggtcctg gccacatgtc 2941 tttgctgtgg gacctcagac aaaaccctgc acctctttga gccttggctg ccttggtgca 3001 gcagggtcat ctgtagggcc accccacagc tctttccttc ccctcctctc tccagggagc 3061 cggggctgtg agaggatcat ctggggcagg ccctccactt ccaagcaagc agatgggggt 3121 gggcacctga ggcccaataa tatttggacc aagtgggaaa caagaacact cggaggggcg 3181 ggaatcagaa gagcctggaa aaagacctag cccaacttcc cttgtgggaa actgaggccc 3241 agcttgggga aggccaggac catgcaggga gaaaaag // LOCUS HUMBST2 996 bp mRNA PRI 12-APR-1996 DEFINITION Human mRNA for BST-2, complete cds. ACCESSION D28137 NID g457563 KEYWORDS BST-2. SOURCE Homo sapiens synovial cell cDNA to mRNA, clone RS38. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 996) AUTHORS Ishikawa,J., Kaisho,T., Tomizawa,H., Lee,B.O., Kobune,Y., Inazawa,J., Oritani,K., Itoh,M., Ochi,T., Ishihara,K. and Hirano,T. TITLE Molecular cloning and chromosomal mapping of a bone marrow stromal cell surface gene, BST2, that may be involved in pre-B-cell growth JOURNAL Genomics 26 (3), 527-534 (1995) MEDLINE 95331788 REFERENCE 2 (bases 1 to 996) AUTHORS Hirano,T. TITLE Direct Submission JOURNAL Submitted (31-JAN-1994) to the DDBJ/EMBL/GenBank databases. Toshio Hirano, Osaka Univ. Med. Sch., Division of Molecular Oncology; 2-2, Yamadaoka, Suita, Osaka 565, Japan (Tel:06-879-3880, Fax:06-879-3889) COMMENT Submitted (31-Jan-1994) to DDBJ by: Toshio Hirano Division of Molecular Oncology Osaka University School of Medicine 2-2 Yamadaoka, Suita Osaka 565 Japan Phone: 06-879-3880 Fax: 06-879-3889. FEATURES Location/Qualifiers source 1..996 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="synovial cell" CDS 10..552 /codon_start=1 /product="BST-2" /db_xref="PID:d1006224" /db_xref="PID:g506861" /translation="MASTSYDYCRVPMEDGDKRCKLLLGIGILVLLIIVILGVPLIIF TIKANSEACRDGLRAVMECRNVTHLLQQELTEAQKGFQDVEAQAATCNHTVMALMASL DAEKAQGQKKVEELEGEITTLNHKLQDASAEVERLRRENQVLSVRIADKKYYPSSQDS SSAAAPQLLIVLLGLSALLQ" BASE COUNT 236 a 246 c 304 g 210 t ORIGIN 1 gtggaattca tggcatctac ttcgtatgac tattgcagag tgcccatgga agacggggat 61 aagcgctgta agcttctgct ggggatagga attctggtgc tcctgatcat cgtgattctg 121 ggggtgccct tgattatctt caccatcaag gccaacagcg aggcctgccg ggacggcctt 181 cgggcagtga tggagtgtcg caatgtcacc catctcctgc aacaagagct gaccgaggcc 241 cagaagggct ttcaggatgt ggaggcccag gccgccacct gcaaccacac tgtgatggcc 301 ctaatggctt ccctggatgc agagaaggcc caaggacaaa agaaagtgga ggagcttgag 361 ggagagatca ctacattaaa ccataagctt caggacgcgt ctgcagaggt ggagcgactg 421 agaagagaaa accaggtctt aagcgtgaga atcgcggaca agaagtacta ccccagctcc 481 caggactcca gctccgctgc ggcgccccag ctgctgattg tgctgctggg cctcagcgct 541 ctgctgcagt gagatcccag gaagctggca catcttggaa ggtccgtcct gctcggcttt 601 tcgcttgaac attcccttga tctcatcagt tctgagcggg tcatggggca acacggttag 661 cggggagagc acggggtagc cggagaaggg cctctggagc aggtctggag gggccatggg 721 gcagtcctgg gtgtggggac acagtcgggt tgacccaggg ctgtctccct ccagagcctc 781 cctccggaca atgagtcccc cctcttgtct cccaccctga gattgggcat ggggtgcggt 841 gtggggggca tgtgctgcct gttgttatgg gttttttttg cggggggggt tgcttttttc 901 tggggtcttt gagctccaaa aaataaacac ttcctttgag ggagagcaaa aaaaaaaaaa 961 aaaaaaaaaa aaaaaaaaaa aaagaattcc accaca // LOCUS HUMBTEB 4859 bp mRNA PRI 10-NOV-1995 DEFINITION Human mRNA for GC box bindig protein, complete cds. ACCESSION D31716 NID g505081 KEYWORDS GC box binding protein; zinc finger. SOURCE Homo sapiens germline cDNA to mRNA, clone_lib:placenta. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4859) AUTHORS Ohe,N., Yamasaki,Y., Sogawa,K., Inazawa,J., Ariyama,T., Oshimura,M. and Fujii-Kuriyama,Y. TITLE Chromosomal localization and cDNA sequence of human BTEB, a GC box binding protein JOURNAL Somat. Cell Mol. Genet. 19 (5), 499-503 (1993) MEDLINE 94120483 REFERENCE 2 (bases 1 to 4859) AUTHORS Fujii-Kuriyama,Y. TITLE Direct Submission JOURNAL Submitted (31-MAY-1994) to the DDBJ/EMBL/GenBank databases. Yoshiaki Fujii-Kuriyama, Faculty of Science, Tohoku University, Department of Chemistry; Aramaki Aoba-ku, Sendai, Miyagi 980-77, Japan (Tel:022-222-1800(ex.3380), Fax:022-262-6609) COMMENT Submitted (31-May-1994) to DDBJ by: Yoshiaki Fujii-Kuriyama Tohoku University Faculty of Science Department of Chemistry Aramaki Aoba-ku Sendai, Miyagi 980-77 Japan Phone: 022-222-1800 x3380 Fax: 022-262-6609. FEATURES Location/Qualifiers source 1..4859 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="placenta" /germline gene 1265..1999 /gene="BTEB" CDS 1265..1999 /gene="BTEB" /note="three-times repeated zinc finger motif" /codon_start=1 /product="GC box binding protein" /db_xref="PID:d1007095" /db_xref="PID:g1060891" /translation="MSAAAYMDFVAAQCLVSISNRAAVPEHGVAPDAERLRLPEREVT KEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSDSDVTTE SGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHL KAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDHL TKHARRHTEFHPSMIKRSKKALANAL" BASE COUNT 1285 a 1111 c 1193 g 1270 t ORIGIN Chromosome 9, q13. 1 cacgttgggt gacataatgg ggttttttta attatagatt cacactgcat ttattcatca 61 cccctgtcct ctcatccata actcaaattt actaccagca acacaaaata caaagatgtg 121 tccagtttca ctacagctct tcgcgtttac aagtgtcgag cgcttgcttt cggaacgccc 181 ttgtgattgg ccgagccaat gccagtgaca tcaaccaact tacttttgat tggaaggctg 241 gttgctggga ctgtagcgtt tgcaggaagt cacttaactg tttgggagct ggaaaaccga 301 agctgaagtt ctcttttgcc ataggaacga gcgcaactga ctaggaaaga tgtgtcccaa 361 agctccgcaa gctggaacgt gagccaggag gcccggaccg gccacgggac cgcgaggcac 421 tccgaaagtg tgcggctgcc ccttccctgc ctcccagctg ttaccctttt aaatgtcagt 481 gttcgaggct gtaggggtag cacgaggcag cgaaacggaa cagtcggatt ggccgcacgc 541 ctcagttcta gacgcacctc tccaccgaag ccgttctgac tggcaggggg agaaagtaaa 601 cagagttgaa tcaccctccc cactggccaa ttggaggggg tttggtttgt gacgtgatgg 661 gattctgcga aattgttact gagcaagaga atgccggaac gtgcggaccg gccggagcag 721 gggttcagaa gccgtcagtg gactcgggaa aaagtgtctc ttagacctgg cgctcggcgg 781 ggccctcgcc acccgcgtcg gggtgatcgg gtgaatgtcc tggggctttg gctcgacggc 841 gaggcggccg agggcgtgca cctctcttgc agtttcctct cccagcgcct cgggggcgtt 901 ttcagtcgaa taaacttgcg accgccacgt gtggcatctt tccaagggag ccggctcaga 961 ggggccggcg cgcccgtcgg gggatcgcgg ccggcgcggg gcaggggcgg cggctagagg 1021 cggcggcgcg gcggagcccg gggccgtgga tgctgcgtgc ggaggcgctg ccggttacgt 1081 aaagatgagg ggctgaggtc gcctcggcgc tcctgcgagt cggaagcgcc ccgcgccccc 1141 gcccccttgg ccgccgcgcc gtgccgggcg ggcgggtcgt cgtccgaggc cagggagggc 1201 gagccgaacc tccgcagcca ccgccaagtt tgtccgcgcc gcctgggctg ccgtcgcccg 1261 caccatgtcc gcggccgcct acatggactt cgtggctgcc cagtgtctgg tttccatttc 1321 gaaccgcgct gcggtgccgg agcatggggt cgctccggac gccgagcggc tgcgactacc 1381 tgagcgcgag gtgaccaagg agcacggtga cccgggggac acctggaagg attactgcac 1441 actggtcacc atcgccaaga gcttgttgga cctgaacaag taccgaccca tccagacccc 1501 ctccgtgtgc agcgacagtc tggaaagtcc agatgaggat atgggatccg acagcgacgt 1561 gaccaccgaa tctgggtcga gtccttccca cagcccggag gagagacagg atcctggcag 1621 cgcgcccagc ccgctctccc tcctccatcc tggagtggct gcgaagggga aacacgcctc 1681 cgaaaagagg cacaagtgcc cctacagtgg ctgtgggaaa gtctatggaa aatcctccca 1741 tctcaaagcc cattacagag tgcatacagg tgaacggccc ttcccctgca cgtggccaga 1801 ctgccttaaa aagttctccc gctcagacga gctgacccgc cactaccgga cccacactgg 1861 ggaaaagcag ttccgctgtc cgctgtgtga gaagcgcttc atgaggagtg accacctcac 1921 aaagcacgcc cggcggcaca ccgagttcca ccccagcatg atcaagcgat cgaaaaaggc 1981 gctggccaac gctttgtgag gtgctgcccg tggaagccag ggagggatgg accccgaaag 2041 gacaaaagta ctcccaggaa acagacgcgt gaaaactgag ccccagaaga ggcacacttg 2101 acggcacagg aagtcactgc tctttggtca atattctgat tttcctctcc ctgcattgtt 2161 tttaaaaagc acattgtagc ctaagatcaa agtcaacaac actcggtccc cttgaagagg 2221 caactctctg aacccgtctc tgactgttgg agggaaggca aatgcttttg ggttttttgg 2281 tttttgtttt tgtttttttt tctcctttta tttttttgcg ggggagggta gggagtgggt 2341 gggggggagg gggtaaggcc aagactgggt agattttaaa gattcaacac tggtgtacat 2401 atgtccgctg ggtgagttga cctgtggcct cgcacagtga ttctaggccc tttatgcttg 2461 ctgtctctca gaattgtttt cttacctttt aatgtaatga cgagtgtgct tcagtttgtt 2521 tagcaaaacc actctcttga atcacgttaa cttttgagat taaaaaaaaa aacgccatag 2581 cacagctgtc tttatgcaag caagagcaca tctactccag catgatctgt catctaaaga 2641 cttgaaaaca aaaaacagtt acttatagtc aatgggtaag cagagtctga atttatacta 2701 atcaagacaa acctttgaaa ggttacacta agtacagaac ttttaaacct tgctttgtat 2761 gagttgtact ttttgaacat aagctgcact tttattttct aatgcagagg atgaataagt 2821 taaatacatg ctttgaggat agaagcagat gttctgtttg gcaccacgtt ataatctgct 2881 tattttacaa tatacacgtt tccctaagaa atcatgcgca gagatgtgag ggcagaatat 2941 acacaacaga tgctgaagga gaaggagggt agtgttttgc aaaagaaaaa gaaaagaacc 3001 aacagaattt taactctatt aacttttcca aattttccta tgcttttagt taacatcatt 3061 attgtatcct aatgccacta ggggagagag cttttgactc tgttgggttt tatttgaatg 3121 tgtgcataac agtaatgaga tctggaaaca cctatttttt ggggaaaaag gtttgttggt 3181 ctccttcctg tgttcctaca aaactcccac tctcaggtgc aagagttatg tagaaggaaa 3241 gggagctgaa ataggaacag aaaaatcaac ccctataact agtgaacacc aagggaaaat 3301 accacaatga tttcagagga gactctgcaa aatcgtccct tgtggagaat gcaggcaaca 3361 tggaatacta cgaatgaaat cacatcactg tatcttttac atcaatagcc tcaccactaa 3421 tatatcttgt atctaggtgt ctataatggc tgaaaccact acatccatct atgccattta 3481 cctgaaaact taactgtggc ctttatgagg ccagaaaagt gaactgagtt ttgtagttaa 3541 gacctcaaat gaggggagtc agcagtgatc atgggggaaa tgtttacatt ttttttttct 3601 tcagaagtaa cgctttctga tgattttatc tgatatttaa aacagggagc tatggtgcac 3661 tctagtttat acttgcgctc tgaaatgtgt aaacataggg tgcctaccta tttcacctga 3721 cccatactcg tttctgattc agaatcagtg tgggctcctg cagtgggcgc gggtcacggc 3781 tgactccaac ttccaataca acagccatca ctagcacagt gtttttttgt ttaaccaacg 3841 tagtgttatt agtagttcta taaagagaac tgcttttaac attagggact gggagcagtc 3901 catgggataa aaaggaaagt gttttctcac gagaaaacat gtcaggaaaa ataaagaaca 3961 ctttctacct ctgtttcaga tttttgaaac acttatttta aaccaaattt taatttctgt 4021 gtccaaaata agttttaagg acatctgttc ttccatacga aataggttag gctgcctatt 4081 tctcactgag ctcatggaat ggttctgctt atgatactct gcacgctgcc ttttagtgag 4141 tgaggagttt ggggttgcct agcacttgct aacttgtaaa aagtcatctt tccctcacag 4201 aaagaaacga aagaaagcaa agcaaagtca gtgaaagaca atctttatag tttcaggagt 4261 aaatctaaat gtggcttttg tcaagcactt agatggatat aaatgcagca acttgtttta 4321 aaaaaatgca catttacttc ccaaaaaagt tgttacttgc cttttcaagt gtgacaaact 4381 cacatttgat attctcttat atgttatagt aatgtaacgt ataaactcaa gcctttttat 4441 tctttgtgat taaatcctgt tttaaaatgt cacaaaacag gaaccagcat tctaattaga 4501 tttactatat caagatatgg ttcaaatagg actactagag ttcattgaac actaaaacta 4561 tgaaacaatt actttttata ttaaaaagac catggattta acttatgaaa atccaaatgc 4621 aggatagtaa tttttgttta cttttttaac caaactgaat ttttgaaaga ctattgcagg 4681 tgtttaaaaa gaaagaaaag ttgttttatc taatactgta agtagttgtc atattctgga 4741 aaatttaata gttttagagt taagatatct cctctctttg gttagggaag aagaaagccc 4801 ttcaccattg tggaatgatg ccctggcttt aaggtttagc tccacatcat gcttctctt // LOCUS HUMBTEB2 1301 bp mRNA PRI 04-JUL-1993 DEFINITION Human mRNA for GC-Box binding protein BTEB2, complete cds. ACCESSION D14520 NID g303596 KEYWORDS BTEB2; DNA-binding; GC-Box binding protein; zinc-finger. SOURCE Homo sapiens placenta cDNA to mRNA, clone_lib:lambda gt11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1301) AUTHORS Sogawa,K., Imataka,H., Yamasaki,Y., Kusume,H., Abe,H. and Fujii-Kuriyama,Y. TITLE cDNA cloning and transcriptional properties of a novel GC box-binding protein, BTEB2 JOURNAL Nucleic Acids Res. 21 (7), 1527-1532 (1993) MEDLINE 93241930 REFERENCE 2 (bases 1 to 1301) AUTHORS Sogawa,K. TITLE Direct Submission JOURNAL Submitted (26-FEB-1993) to the DDBJ/EMBL/GenBank databases. Kazuhiro Sogawa, Tohoku University, Faculty of Science, Department of Chemistry; Aramaki aza aoba,Aoba-ku, Sendai, Miyagi 980, Japan (E-mail:ychujyo@ddbj.nig.ac.jp, Tel:022-222-1800(ex.3382), Fax:022-263-9207) COMMENT Submitted (26-Feb-1993) to DDBJ by: Kazuhiro Sogawa Department of Chemistry Tohoku University, Faculty of Science Aramaki aza Aoba Aoba-ku Sendai Miyagi 980 Japan Phone: 022-222-1800 x3382 Fax: 022-263-9207 E-mail: ychujyo@ddbj.nig.ac.jp. FEATURES Location/Qualifiers source 1..1301 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="placenta" CDS 559..1218 /codon_start=1 /product="BTEB2" /db_xref="PID:d1003903" /db_xref="PID:g303597" /translation="MPSSTNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMP PCTYTMPSQFLPQQATYFPPSPPSSEPGSPDRQAEMLQNLTPPPSYAATIASKLAIHN PNLPTTLPVNSQNIQPVRYNRRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHT GEKPYKCTWEGCDWRFARSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQ N" BASE COUNT 352 a 431 c 260 g 258 t ORIGIN 1 gggcacgcgc accaccgccc gcagcgcagc ccgcgcccgc gcaggccccg cagccggccc 61 agcccgccgc caccggccgc ggctgcctcc agaggacctg gtccagacaa gatgtgaaat 121 ggagaagtat ctgacacctc agcttcctcc agttcctata attccagagc ataaaaagta 181 tagacgagac agtgcctcag tcgtagacca gttcttcact gacactgaag ggttacctta 241 cagtatcaac atgaacgtct tcctccctga catcactcac ctgagaactg gcctctacaa 301 atcccagaga ccgtgcgtaa cacacatcaa gacagaacct gttgccattt tcagccacca 361 gagtgaaacg actgcccctc tccggccccg acccaggccc tccctgagtt caccagtata 421 ttcagctcac accagaccgc agctccagag gtgaacaata ttttcatcaa acaagaactt 481 cctacaccag atcttcatct ttctgtccct acccagcagg gccacctgta ccagctactg 541 aatacaccgg atctagatat gcccagttct acaaatcaga cagcagcaat ggacactctt 601 aatgtttcta tgtcagctgc catggcaggc cttaacacac acacctctgc tgttccgcag 661 actgcagtga aacaattcca gggcatgccc ccttgcacat acacaatgcc aagtcagttt 721 cttccacaac aggccactta ctttcccccg tcaccaccaa gctcagagcc tggaagtcca 781 gatagacaag cagagatgct ccagaattta accccacctc catcctatgc tgctacaatt 841 gcttctaaac tggcaattca caatccaaat ttacccacca ccctgccagt taactcacaa 901 aacatccaac ctgtcagata caatagaagg agtaaccccg atttggagaa acgacgcatc 961 cactactgcg attaccctgg ttgcacaaaa gtttatacca agtcttctca tttaaaagct 1021 cacctgagga ctcacactgg tgaaaagcca tacaagtgta cctgggaagg ctgcgactgg 1081 aggttcgcgc gatcggatga gctgacccgc cactaccgga agcacacagg cgccaagccc 1141 ttccagtgcg gggtgtgcaa ccgcagcttc tcgcgctctg accacctggc cctgcatatg 1201 aagaggcacc agaactgagc actgcccgtg tgacccgttc caggtcccct gggctccctc 1261 aaatgacaga cctaactatt cctgtgtaaa aacaacaacc c // LOCUS HUMBTF2A 1893 bp mRNA PRI 31-DEC-1994 DEFINITION Human basic transcription factor 62kD subunit (BTF2), complete cds. ACCESSION M95809 NID g179568 KEYWORDS basic transcription factor 62kD subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1893) AUTHORS Fischer,L., Gerard,M., Chalut,C., Lutz,Y., Humbert,S., Kanno,M., Chambon,P. and Egly,J.M. TITLE Cloning of the 62 kd component of the basic transcription factor BTF2 JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1893 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela cells" gene 55..1701 /gene="BTF2" CDS 55..1701 /gene="BTF2" /codon_start=1 /product="basic transcription factor 62kD subunit" /db_xref="PID:g179569" /translation="MATSSEEVLLIVKKVRQKKQDGALYLMAERIAWAPEGKDRFTIS HMYADIKCQKISPEGKAKIQLQLVLHAGDTTNFHFSNESTAVKERDAVKDLLQQLLPK FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDSSSTSNH KQDVGISAAFLADVRPQTDGCNGLRYNLTSDIIESIFRTYPAVKMKYAENVPHNMTEK EFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGLKTMVSLGVKNPLLDLTALEDKP LDEGYGISSVPSASNSKSIKENSNAAIIKRFNHHSAMVLAAGLRKQEAQNEQTSEPSN MDGNSGDADCFQPAVKRAKLQESIEYEDLGKNNSVKTIALNLKKSDRYYHGPTPIQSL QYATSQDIINSFQSIRQEMEAYTPKLTQVLSSSAASSTITALSPGGALMQGGTQQAIN QMVPNDIQSELKHLYVAVGELLRHFWSCFPVNTPFLEEKVVKMKSNLERFQVTKLCPF QEKIRRQYLSTNLVSHIEEMLQTAYNKLHTWQSRRLMKKT" BASE COUNT 639 a 386 c 407 g 461 t ORIGIN 1 agttagttac ttcctgtcta gagttgtagc ttccacctgc accttctagc caccatggca 61 acctcatctg aagaagtttt gctgattgta aagaaagtgc gtcaaaagaa gcaggatgga 121 gctctgtacc tcatggcaga aagaattgct tgggcacctg aaggcaaaga tagatttaca 181 atcagccata tgtatgcaga tattaaatgc cagaaaatta gtccagaagg aaaagctaaa 241 attcagcttc agctggtcct acatgcaggg gacacaacta acttccattt ttccaatgaa 301 agcacagcag tgaaagagcg agatgcagta aaagaccttc ttcagcagct gctgcccaaa 361 ttcaagagga aagcaaataa agaactggaa gagaagaaca gaatgctgca agaagatcct 421 gttttgtttc agctttataa agaccttgtt gtgagtcaag tgatcagtgc tgaggaattc 481 tgggccaatc gtttaaatgt gaatgcaaca gatagttctt ccacatccaa tcataagcag 541 gatgttggca tttctgctgc atttctggct gatgtccggc cccaaactga tggctgtaac 601 ggtctaagat ataatttaac ttctgatatc attgagtcca tatttaggac ctatccagca 661 gtaaaaatga aatatgcaga aaatgttccc cacaacatga cagagaagga attctggaca 721 cgttttttcc agtcccatta ttttcacagg gatcggctga atacagggtc aaaggatctc 781 tttgcagaat gtgccaaaat agatgaaaaa ggcctaaaaa caatggtttc attaggagtg 841 aaaaacccac tactagattt aacagctttg gaagataaac cattagatga gggctatggc 901 atttcctctg tgccatctgc ttccaattct aaatccataa aagagaatag taatgctgcc 961 atcatcaaga gatttaacca tcacagtgcc atggtcctgg cagctggact cagaaaacaa 1021 gaagcacaaa atgaacaaac tagtgagccc agcaacatgg atggaaattc cggagatgca 1081 gactgctttc agccagcagt caaaagggcg aaattacaag agtccattga atatgaagac 1141 ttggggaaaa ataattctgt aaaaacgatt gcactaaacc tcaagaagtc agataggtat 1201 tatcatggtc caactccaat ccagtcacta cagtatgcaa caagtcagga cattattaat 1261 tcttttcaaa gtattagaca agaaatggaa gcttatacac ccaagttaac tcaggttctc 1321 tcaagtagtg ctgccagtag taccatcaca gcactgtcac ctggaggggc acttatgcag 1381 ggaggaacac agcaagccat aaaccagatg gtgccaaatg atattcaatc tgaattgaaa 1441 cacttatatg tagctgttgg agaacttcta cgacatttct ggtcctgctt tcctgttaat 1501 acgccattcc tagaagaaaa ggtagtgaaa atgaaaagta atttggaacg attccaagtt 1561 acgaagctct gtccattcca agaaaagatt cggagacagt atttaagcac aaatttggta 1621 agtcacatag aagagatgct ccagacagcc tacaacaagc tccacacatg gcagtcacgg 1681 cgtctgatga agaaaacgtg aggtggccat gatgcttaca ggttttgtga gattgagaga 1741 actatgacct gcagcaactc tggaaacctg gcctgacaga caagcagatg acctcacagg 1801 agtgataaga aacatctgct ccacgccaac tcccagagct gatgctattg tacttgcaca 1861 ttggagactg aaaggaaaga agggactaaa tgc // LOCUS HUMBTFD 1350 bp DNA PRI 31-DEC-1994 DEFINITION Human BTF3 protein homologue gene, complete cds. ACCESSION M90356 NID g179575 KEYWORDS BTF3 protein homologue. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1350) AUTHORS Kanno,M., Chalut,C. and Egly,J.M. TITLE Genomic structure of the putative BTF3 transcription factor JOURNAL Gene 117 (2), 219-228 (1992) MEDLINE 92347696 FEATURES Location/Qualifiers source 1..1350 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" gene 517..1161 /gene="BTF3 homologue" CDS 517..1161 /gene="BTF3 homologue" /codon_start=1 /product="BTF3 homologue" /db_xref="PID:g179576" /translation="MVLSRGSLLNTRPRITSGSGSTGRPRSGSTCRSTDHEPGKLAKL QAQVRIGGKGTAHRKKKVFHRTATADDKKLQFSLKKLQVNNISGIEKVNMFTNQGTVI HFNNPKFQASLAVNTFTITGHAEAKQVTEMLPSVLSQLGADSLTSLRRLAEVLPKQPV DGKAPLATGGDDDDGVPELWRILMRLPGMRQAELSRLLKKIKLEEVTGSCYFLL" BASE COUNT 423 a 272 c 279 g 376 t ORIGIN 1 agaaaattaa gggaaaatcg attcctctta tctagttact tagatattgg ccttggcttt 61 atctcaatat tatatggatc atagctggca actaattcag tccagtaaat atcctcaata 121 gggaataata tatgcttccc attccatcgg gaaaaagttt tgttcaacac accaagctca 181 atcaactcac taatgtatgg gaattgtttt gatgtaacca catacttcct gccttcatta 241 agggctgcgc acaaaaccat agattgctct tctgtaaggt tttgaattac tgatcgcact 301 ttatcgtttt gcatcttaat gcgttttctt agcttaaatc gcttatatct ggcgctggca 361 atagctgata atcgatgcac attaattgct agcgaaaatg caagagcaaa gacgaaaaca 421 tgccacacat gaggaatacc gattctctca ttaacatatt caggccagtt atctgggctt 481 aaaagcagaa gtccaaccca gataacgatc atatacatgg ttctctccag aggttcatta 541 ctgaacactc gtccgagaat aacgagtgga tctgggtcga ccggtcgacc cagatctggg 601 tcgacctgca ggtcaacgga tcatgaacca ggaaaactcg ccaaactgca ggcacaagtg 661 cgcattggtg ggaaaggaac tgctcacaga aagaagaagg tcttccatag aacagccaca 721 gccgatgata aaaagcttca attctcctta aagaagttac aggtaaacaa tatctctggt 781 attgaaaagg tgaatatgtt tacaaaccaa ggaacagtga tccactttaa caaccctaaa 841 tttcaggcat cgctggcagt gaacactttc accataacag gtcatgctga ggcaaagcag 901 gtgacagaaa tgctacccag tgtcttaagc cagcttggtg cagacagtct gactagttta 961 aggagactgg ctgaagttct gcccaaacaa cctgtggatg gaaaagcacc acttgctact 1021 ggaggggatg atgatgatgg agttccagaa ttgtggagaa ttttgatgag gcttccagga 1081 atgaggcaag ctgaattgag tcgacttctg aagaagataa aacttgaaga agttactggg 1141 agctgctatt ttctattatg actgcttttt aagaaatttt ttgttcatgg atctgataaa 1201 atctagatct ctatacttct aagcccaagc cccttggaca ctgtagcact ttttagtttt 1261 cgcttataca taatcattct ttttagctaa ttaagctgca gaacgtggga aataaagttc 1321 gaaacaaagg ttaataaagt tctttgcctt // LOCUS HUMBYSTIN 1262 bp mRNA PRI 10-JAN-1996 DEFINITION Homo sapiens bystin mRNA, complete cds. ACCESSION L36720 NID g1160618 KEYWORDS bystin. SOURCE Homo sapiens (clone: HT-H 53) male embryo carcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1262) AUTHORS Fukuda,M.N. and Zara,J. TITLE Bystin JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..1262 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HT-H 53" /cell_line="HT-H" /cell_type="germ cell" /dev_stage="embryo" /sex="male" /tissue_type="carcinoma" mRNA 1..1262 5'UTR 1..64 CDS 65..985 /codon_start=1 /function="assist trophinin mediated cell adhesion" /product="bystin" /db_xref="PID:g1160619" /translation="MEKLTEKQTEVETVMSEVSGFPMPQLDPRVLEVYRGVREVLSKY RSGKLPKAFKIIPALSNWEQILYVTEPEAWTAAAMYQATRIFASNLKERMAQRFYNLV LLPRVRDDVGEYKRLNFHLYMALKKALFKPGAWFKGILIPLCESGTCTLREAIIVGSI ITKCSIPVLHSSAAMLKIAEMEYSGANSIFLRLLLDKKYALPYRVLDALVFHFLGFRT EKRELPVLWHQCLLTLVQRYKADLATDQKEALLELLRLQPHPQLSPEIRRELQSAAPA CGRCSHHRGVRKTVSLSWPKGFGRTPRPRW" 3'UTR 986..1261 polyA_signal 1236..1241 polyA_site 1262 BASE COUNT 294 a 348 c 336 g 284 t ORIGIN 1 gcgtgccata gagatgttca tgaacaagaa ccctcctgcc aggcgcaccc tggctgacat 61 catcatggag aagctgactg agaagcagac agaggttgag acagtcatgt cagaggtgtc 121 gggcttccct atgccccagc tggacccccg ggtcctagaa gtgtacaggg gggtccggga 181 ggtattatct aagtaccgca gtggaaaact gcccaaggca tttaagatca tccctgcact 241 ctccaactgg gagcaaatcc tctacgtcac agagccggag gcctggactg cagctgccat 301 gtaccaggcc accaggattt ttgcctctaa cctgaaggaa cgcatggccc agcgcttcta 361 caaccttgtc ctgctccctc gagtacgaga tgacgttggt gaatacaaac gactcaactt 421 ccatctctac atggctctca agaaggccct tttcaaacct ggagcctggt tcaaagggat 481 cctgattcca ctgtgcgagt ctggcacttg taccctccgg gaagccatca ttgtgggtag 541 catcatcacc aagtgctcca tccctgtgtt gcactccagt gcggccatgc tgaaaattgc 601 tgagatggaa tacagcggtg ccaacagcat cttcctgcga ctgctgctgg ataagaagta 661 tgcactgcct taccgggtgc tggatgccct agtcttccac ttcctggggt tccggacaga 721 gaagcgtgaa ctgcctgtgc tgtggcacca gtgcctcctg actttggtcc agcgctacaa 781 ggccgacttg gccacagacc agaaagaggc cctcttagaa ctgctccggc tgcagcccca 841 tccacagcta tcgcccgaaa tcaggcgtga gcttcagagt gcagcccccg catgtggaag 901 atgttcccat caccgtggag tgaggaaaac agtcagcttg tcctggccaa aggggtttgg 961 aaggacacca agaccccgtt ggtgactgaa gatgacactg agctttaatg gctgaagacc 1021 cagatcaggg cagtgaccag atcacaggga catctgtggc tcccagtcca ggacaggaag 1081 gactgagggt ctggctggtt ccctcttcca ttctaggccc ttatccctgt ttagttctga 1141 gagccaactt gagataccat atgctagcat tcccagtccc cagctggggc ttggtgtgag 1201 tactttttct atggctattg tgtcaggtca ctgtggataa aggcaaagac agatatttat 1261 tg // LOCUS HUMBZIPA 1496 bp mRNA PRI 28-JUN-1994 DEFINITION Human leucine zipper mRNA, complete cds. ACCESSION L13974 NID g506817 KEYWORDS NF-E2 complex; leucine zipper protein. SOURCE Homo sapiens (library: K562 lambda gt11 cDNA library of Stuart Orkin) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1496) AUTHORS Ney,P.A., Andrews,N.C., Jane,S.M., Safer,B., Purucker,M.E., Weremowicz,S., Goff,S.C., Orkin,S.H. and Nienhuis,A.W. TITLE Purification of the human NF-E2 complex: cDNA cloning of the hematopoietic cell-specific subunit and evidence for an associated partner JOURNAL Mol. Cell. Biol. 13, 5604-5612 (1993) MEDLINE 93360994 FEATURES Location/Qualifiers source 1..1496 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562 human erythroleukemia" /tissue_lib="K562 lambda gt11 cDNA library of Stuart Orkin" CDS 257..1378 /standard_name="bzip; p45 NF-E2" /note="basic region" /codon_start=1 /product="leucine zipper protein" /db_xref="PID:g506818" /translation="MSPCPPQQSRNRVIQLSTSELGEMELTWQEIMSITELQGLNAPS EPSFEPQAPAPYLGPPPPTTYCPCSIHPDSGFPLPPPPYELPASTSHVPDPPYSYGNM AIPVSKPLSLSGLLSEPLQDPLALLDIGLPAGPPKPQEDPESDSGLSLNYSDAESLEL EGTEAGRRRSEYVEMYPVEYPYSLMPNSLAHSNYTLPAAETPLALEPSSGPVRAKPTA RGEAGSRDERRALAMKIPFPTDKIVNLPVDDFNELLARYPLTESQLALVRDIRRRGKN KVAAQNCRKRKLETIVQLERELERLTNERERLLRARGEADRTLEVMRQQLTELYRDIL EHLRDESGNSYSPEEYALQQAADGTIFLVPRGTKMEATD" BASE COUNT 335 a 476 c 401 g 284 t ORIGIN 1 tgcttggggc tcctgtgctc agctcagcct gagcttccac actcagcgct cagcaatggc 61 ccgggggcgg ggcgcggtcc tcgcagattc tcaaaggtag ccgggatcct cgtccagcag 121 tgtcagctca ggctcagcct ccccagagac aacaccggga ccctcatctc tctcctcacc 181 ctgctgtgac tccaccacag gtttctagag ccatctgggc tttccgggaa cctggaccag 241 actctggccc agtaggatgt ccccgtgtcc tccccagcag agcaggaaca gggtgataca 301 gctgtccact tcagagctag gagagatgga actgacttgg caggagatca tgtccatcac 361 cgagctgcag ggtctgaatg ctccaagtga gccatcattt gagccccaag ccccagctcc 421 ataccttgga cctccaccac ccacaactta ctgcccctgc tcaatccacc cagattctgg 481 cttcccactt cctccaccac cttatgagct cccagcatcc acatcccatg tcccagatcc 541 cccatactcc tatggcaaca tggccatacc agtctccaag ccactgagcc tctcaggcct 601 gctcagtgag ccgctccaag accccttagc cctcctggac attgggctgc cagcagggcc 661 acctaagccc caagaagacc cagaatccga ctcaggatta tccctcaact atagcgatgc 721 tgaatctctt gagctggagg ggacagaggc tggtcggcgg cgcagcgaat atgtagagat 781 gtacccagtg gagtacccct actcactcat gcccaactcc ttggcccact ccaactatac 841 cttgccagct gctgagaccc ccttggcctt agagccctcc tcaggccctg tgcgggctaa 901 gcccactgca cggggggagg cagggagtcg ggatgaacgt cgggccttgg ccatgaagat 961 tccttttcct acggacaaga ttgtcaactt gccggtagat gactttaatg agctattggc 1021 aaggtacccg ctgacagaga gccagctagc gctagtccgg gacatccgac gacggggcaa 1081 aaacaaggtg gcagcccaga actgccgcaa gaggaagctg gaaaccattg tgcagctgga 1141 gcgggagctg gagcggctga ccaatgaacg ggagcggctt ctcagggccc gcggggaggc 1201 agaccggacc ctggaggtca tgcgccaaca gctgacagag ctgtaccgtg acattttgga 1261 gcaccttcgg gatgaatcag gcaacagcta ctctcctgaa gagtacgcgc tgcaacaggc 1321 tgccgatggg accatcttcc ttgtgccccg ggggaccaag atggaggcca cagactgagc 1381 tggcccagag ggtggaactg ctgatgggat tttccttcat tcccttctga taaaggtact 1441 ccccaaccct gagtcccaga aggagctgag ttctctagac cagaagagga tgacaa // LOCUS HUMC1PHTYR 1521 bp mRNA PRI 29-JUL-1993 DEFINITION Homo sapiens cytoplasmic phosphotyrosyl protein phosphatase (clone type 1) complete cds. ACCESSION M83653 NID g179635 KEYWORDS cytoplasmic phosphotyrosyl protein phosphatase; red cell acid phosphatase. SOURCE Homo sapiens (library: Lambda gt11) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1521) AUTHORS Wo,Y.-Y.P., McCormack,A.L., Shabonowitz,J., Hunt,D.F., Davis,J.P., Mitchell,G.L. and Van Etten,R.L. TITLE Sequencing, cloning and expression of Human red cell-type acid phosphatase, a Cytoplasmic Phosphotyrosyl protein phosphatase JOURNAL J. Biol. Chem. 267, 10856-10865 (1992) MEDLINE 92268143 REFERENCE 2 (bases 1 to 1521) AUTHORS Zhou,M.-M., Davis,J.P. and Van Etten,R.L. TITLE Identification and pKa determination of the histidine residues of human low-molecular-weight phosphotyrosyl protein phosphatases: A convenient approach using a MLEV-17 spectral editing scheme JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..1521 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="placenta cDNA" /tissue_lib="Lambda gt11" CDS 45..521 /codon_start=1 /product="cytoplasmic phosphotyrosyl protein phosphatase" /db_xref="PID:g179636" /translation="MAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWRVDS AATSGYEIGNPPDYRGQSCMKRHGIPMSHVARQITKEDFATFDYILCMDESNLRDLNR KSNQVKTCKAKIELLGSYDPQKQLIIEDPYYGNDSDFETVYQQCVRCCRAFLEKAH" mat_peptide 48..518 /product="cytoplasmic phosphotyrosyl protein phosphatase" mutation 516..517 /standard_name="H157A mutant" /note="created by site directed mutagenesis" /citation=[2] /replace="gc" BASE COUNT 466 a 293 c 340 g 422 t ORIGIN 1 gggggcgtgc ggaacggggt gtctcggcgc ctctgcgcgg gaagatggcg gaacaggcta 61 ccaagtccgt gctgtttgtg tgtctgggta acatttgtcg atcacccatt gcagaagcag 121 ttttcaggaa acttgtaacc gatcaaaaca tctcagagaa ttggagggta gacagcgcgg 181 caacttccgg gtatgagata gggaaccccc ctgactaccg agggcagagc tgcatgaaga 241 ggcacggcat tcccatgagc cacgttgccc ggcagattac caaagaagat tttgccacat 301 ttgattatat actatgtatg gatgaaagca atctgagaga tttgaataga aaaagtaatc 361 aagttaaaac ctgcaaagct aaaattgaac tacttgggag ctatgatcca caaaaacaac 421 ttattattga agatccctat tatgggaatg actctgactt tgagacggtg taccagcagt 481 gtgtcaggtg ctgcagagcg ttcttggaga aggcccactg aggcaggttc gtgccctgct 541 gcggccagcc tgactagacc ccaccctgag gtcctgcatt tctcagtcgg tgtgtaatca 601 cgttccaggg cccaaagcca gctctttgtt cagttgactt actgtttctt accttaaaaa 661 gtaattgtag atggaaatca gttgtgtttg gcaggagaat caataaaaat gtttgattca 721 gacagcttat ggggtatttt aagcattctt agactagttg aacatctcac tttgccccag 781 ttacaaaaat agtagaacaa gcaacataaa acaatgaagg aaaacctcac ttgaaggccc 841 aggtcaacat ctaagcctgt tgagacttag ataatcgagt ctacctcttc agtaggtttg 901 tgtggatggc ctggaggcag gtgcttgctc cccagtgcta cctctctctt ccctagggcc 961 ttttgtggat tgacagtagt cccctccgta gagctcacag tctagattag aagtgtttta 1021 atttctacac acccatagtg cacacttgta tattgaaaag atagggaaga gagaaacatt 1081 tatggaatca gtcgttggca ccttcaatac ttcatgattt ttgtcgagtt tacttcatga 1141 ggaggtcagc ccattggctc ccatctgaac cactttgcct ctgaaactta attacatcca 1201 gaaagaagga cacttgtatg ctagtctatg gtcagttgag gaatatgact gtttttatat 1261 gcacatgtaa cccaaatgtc caatataaat tggcttattt tttaaaataa ttttaaaagt 1321 tgggaaaagt gttattattt ggcatgctta aatattgaat aagtattctt catcagcatt 1381 taataaatgt ataggcagat gtaaggtaat ttctgtgtat tttgagataa tgtcaaaatc 1441 atgaatattt caaaataaac tggggagtta taaaaataca actagagata taaaaaaaaa 1501 aaaaaaaaaa aaaaaaaacc c // LOCUS HUMC1R 2493 bp mRNA PRI 31-OCT-1994 DEFINITION Human complement C1r mRNA, complete cds. ACCESSION M14058 NID g179643 KEYWORDS . SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2493) AUTHORS Leytus,S.P., Kurachi,K., Sakariassen,K.S. and Davie,E.W. TITLE Nucleotide sequence of the cDNA coding for human complement C1r JOURNAL Biochemistry 25 (17), 4855-4863 (1986) MEDLINE 87026566 FEATURES Location/Qualifiers source 1..2493 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p13" mat_peptide 64..1452 /gene="C1R" /note="C1r A chain" CDS 64..2181 /gene="C1R" /note="human complement C1r" /codon_start=1 /db_xref="GDB:G00-119-729" /db_xref="PID:g179644" /translation="MWLLYLLVPALFCRAGGSIPIPQKLFGEVTSPLFPKPYPNNFET TTVITVPTGYRVKLVFQQFDLEPSEGCFYDYVKISADKKSLGRFCGQLGSPLGNPPGK KEFMSQGNKMLLTFHTDFSNEENGTIMFYKGFLAYYQAVDLDECASRSKSGEEDPQPQ CQHLCHNYVGGYFCSCRPGYELQEDRHSCQAECSSELYTEASGYISSLEYPRSYPPDL RCNYSIRVERGLTLHLKFLEPFDIDDHQQVHCPYDQLQIYANGKNIGEFCGKQRPPDL DTSSNAVDLLFFTDESGDSRGWKLRYTTEIIKCPQPKTLDEFTIIQNLQPQYQFRDYF IATCKQGYQLIEGNQVLHSFTAVCQDDGTWHRAMPRCKIKDCGQPRNLPNGDFRYTTT MGVNTYKARIQYYCHEPYYKMQTRAGSRESEQGVYTCTAQGIWKNEQKGEKIPRCLPV CGKPVNPVEQRQRIIGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTLYPKE HEAQSNASLDVFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENS VTLGPNLLPICLPDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWL RGKNRMDVFSQNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSR GYGFYTKVLNYVDWIKKEMEEED" gene 64..2181 /gene="C1R" mat_peptide 1453..2178 /gene="C1R" /note="C1r B chain serine protease" BASE COUNT 619 a 680 c 662 g 532 t ORIGIN Chromosome 12p13. 1 ggatcgattt gagtaagagc atagctgtcg ggagagccca ggattcaaca cgggccttga 61 gaaatgtggc tcttgtacct cctggtgccg gccctgttct gcagggcagg aggctccatt 121 cccatccctc agaagttatt tggggaggtg acttcccctc tgttccccaa gccttacccc 181 aacaactttg aaacaaccac tgtgatcaca gtccccacgg gatacagggt gaagctcgtc 241 ttccagcagt ttgacctgga gccttctgaa ggctgcttct atgattatgt caagatctct 301 gctgataaga aaagcctggg gaggttctgt gggcaactgg gttctccact gggcaacccc 361 ccgggaaaga aggaatttat gtcccaaggg aacaagatgc tgctgacctt ccacacagac 421 ttctccaacg aggagaatgg gaccatcatg ttctacaagg gcttcctggc ctactaccaa 481 gctgtggacc ttgatgaatg tgcttcccgg agcaaatcag gggaggagga tccccagccc 541 cagtgccagc acctgtgtca caactacgtt ggaggctact tctgttcctg ccgtccaggc 601 tatgagcttc aggaagacag gcattcctgc caggctgagt gcagcagcga gctgtacacg 661 gaggcatcag gctacatctc cagcctggag taccctcggt cctacccccc tgacctgcgc 721 tgcaactaca gcatccgggt ggagcggggc ctcaccctgc acctcaagtt cctggagcct 781 tttgatattg atgaccacca gcaagtacac tgcccctatg accagctaca gatctatgcc 841 aacgggaaga acattggcga gttctgtggg aagcaaaggc cccccgacct cgacaccagc 901 agcaatgctg tggatctgct gttcttcaca gatgagtcgg gggacagccg gggctggaag 961 ctgcgctaca ccaccgagat catcaagtgc ccccagccca agaccctaga cgagttcacc 1021 atcatccaga acctgcagcc tcagtaccag ttccgtgact acttcattgc tacctgcaag 1081 caaggctacc agctcataga ggggaaccag gtgctgcatt ccttcacagc tgtctgccag 1141 gatgatggca cgtggcatcg tgccatgccc agatgcaaga tcaaggactg tgggcagccc 1201 cgaaacctgc ctaatggtga cttccgttac accaccacaa tgggagtgaa cacctacaag 1261 gcccgtatcc agtactactg ccatgagcca tattacaaga tgcagaccag agctggcagc 1321 agggagtctg agcaaggggt gtacacctgc acagcacagg gcatttggaa gaatgaacag 1381 aagggagaga agattcctcg gtgcttgcca gtgtgtggga agcccgtgaa ccccgtggaa 1441 cagaggcagc gcataatcgg agggcaaaaa gccaagatgg gcaacttccc ctggcaggtg 1501 ttcaccaaca tccacgggcg cgggggcggg gccctgctgg gcgaccgctg gatcctcaca 1561 gctgcccaca ccctgtatcc caaggaacac gaagcgcaaa gcaacgcctc tttggatgtg 1621 ttcctgggcc acacaaatgt ggaagagctc atgaagctag gaaatcaccc catccgcagg 1681 gtcagcgtcc acccggacta ccgtcaggat gagtcctaca attttgaggg ggacatcgcc 1741 ctgctggagc tggaaaatag tgtcaccctg ggtcccaacc tcctccccat ctgcctccct 1801 gacaacgata ccttctacga cctgggcttg atgggctatg tcagtggctt cggggtcatg 1861 gaggagaaga ttgctcatga cctcaggttt gtccgtctgc ccgtagctaa tccacaggcc 1921 tgtgagaact ggctccgggg aaagaatagg atggatgtgt tctctcaaaa catgttctgt 1981 gctggacacc catctctaaa gcaggacgcc tgccaggggg atagtggggg cgtttttgca 2041 gtaagggacc cgaacactga tcgctgggtg gccacgggca tcgtgtcctg gggcatcggg 2101 tgcagcaggg gctatggctt ctacaccaaa gtgctcaact acgtggactg gatcaagaaa 2161 gagatggagg aggaggactg agcccagaat tcactaggtt cgaatccaga gagcagtgtg 2221 gaaaaaaaaa aaacaaaaaa caactgacca gttgttgata accactaaga gtctctatta 2281 aaattactga tgcagaaaga ccgtgtgtga aattctcttt cctgtagtcc cattgatgta 2341 ctttacctga aacaacccaa aggccccttt ctttcttctg aggattgcag aggatatagt 2401 tatcaatctc tagttgtcac tttcctcttc cactttgata ccattgggtc attgaatata 2461 actttttcca aataaagttt tatgagaaat gcc // LOCUS HUMC2CNT 2204 bp DNA PRI 10-APR-1996 DEFINITION Homo sapiens core 2 beta-1,6-N-acetylglucosaminyltransferase (core 2 GnT) gene, complete cds. ACCESSION L41415 NID g886272 KEYWORDS beta-1,6-N-acetylglucosaminyltransferase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2204) AUTHORS Bierhuizen,M.F., Maemura,K., Kudo,S. and Fukuda,M. TITLE Genomic organization of core 2 and I branching beta-1,6-N-acetylglucosaminyltransferases. Implication for evolution of the beta-1,6-N-acetylglucosaminyltransferase gene family JOURNAL Glycobiology 5 (4), 417-425 (1995) MEDLINE 96078409 FEATURES Location/Qualifiers source 1..2204 /organism="Homo sapiens" /note="(vector lambda EMBL3)" /db_xref="taxon:9606" /map="chromosome 9" /tissue_type="placenta" intron <1..100 /number=1 exon 101..2204 /number=2 gene 244..1530 /gene="core 2 GnT" CDS 244..1530 /gene="core 2 GnT" /EC_number="2.4.1.102" /note="core 2" /codon_start=1 /product="beta-1,6-N-acetylglucosaminyltransferase" /db_xref="PID:g886273" /translation="MLRTLLRRRLFSYPTKYYFMVLVLSLITFSVLRIHQKPEFVSVR HLELAGENPSSDINCTKVLQGDVNEIQKVKLEILTVKFKKRPRWTPDDYINMTSDCSS FIKRRKYIVEPLSKEEAEFPIAYSIVVHHKIEMLDRLLRAIYMPQNFYCVHVDTKSED SYLAAVMGIASCFSNVFVASRLESVVYASWSRVQADLNCMKDLYAMSANWKYLINLCG MDFPIKTNLEIVRKLKLLMGENNLETERMPSHKEERWKKRYEVVNGKLTNTGTVKMLP PLETPLFSGSAYFVVSREYVGYVLQNEKIQKLMEWAQDTYSPDEYLWATIQRIPEVPG SLPASHKYDLSDMQAVARFVKWQYFEGDVSKGAPYPPCDGVHVRSVCIFGAGDLNWML RKHHLFANKFDVDVDLFAIQCLDEHLRHKALETLKH" BASE COUNT 641 a 414 c 498 g 651 t ORIGIN 1 gatttattgt gaaaaactct ctctctctct ctctctctgt atatatatat atatatatat 61 atatatttat ttatatttat aattgcttct tttatttcag tgctgctctt catttcaaga 121 tgccgttgca gctctgataa atgcaaactg acaaccttca aggccacgac ggagggaaaa 181 tcattggtgc ttggagcata gaagactgcc cttcacaaag gaaatccctg attattgttt 241 gaaatgctga ggacgttgct gcgaaggaga cttttttctt atcccaccaa atactacttt 301 atggttcttg ttttatccct aatcaccttc tccgttttaa ggattcatca aaagcctgaa 361 tttgtaagtg tcagacactt ggagcttgct ggggagaatc ctagtagtga tattaattgc 421 accaaagttt tacagggtga tgtaaatgaa atccaaaagg taaagcttga gatcctaaca 481 gtgaaattta aaaagcgccc tcggtggaca cctgacgact atataaacat gaccagtgac 541 tgttcttctt tcatcaagag acgcaaatat attgtagaac cccttagtaa agaagaggcg 601 gagtttccaa tagcatattc tatagtggtt catcacaaga ttgaaatgct tgacaggctg 661 ctgagggcca tctatatgcc tcagaatttc tattgcgttc atgtggacac aaaatccgag 721 gattcctatt tagctgcagt gatgggcatc gcttcctgtt ttagtaatgt ctttgtggcc 781 agccgattgg agagtgtggt ttatgcatcg tggagccggg ttcaggctga cctcaactgc 841 atgaaggatc tctatgcaat gagtgcaaac tggaagtact tgataaatct ttgtggtatg 901 gattttccca ttaaaaccaa cctagaaatt gtcaggaagc tcaagttgtt aatgggagaa 961 aacaacctgg aaacggagag gatgccatcc cataaagaag aaaggtggaa gaagcggtat 1021 gaggtcgtta atggaaagct gacaaacaca gggactgtca aaatgcttcc tccactcgaa 1081 acacctctct tttctggcag tgcctacttc gtggtcagta gggagtatgt ggggtatgta 1141 ctacagaatg aaaaaatcca aaagttgatg gagtgggcac aagacacata cagccctgat 1201 gagtatctct gggccaccat ccaaaggatt cctgaagtcc cgggctcact ccctgccagc 1261 cataagtatg atctatctga catgcaagca gttgccaggt ttgtcaagtg gcagtacttt 1321 gagggtgatg tttccaaggg tgctccctac ccgccctgcg atggagtcca tgtgcgctca 1381 gtgtgcattt tcggagctgg tgacttgaac tggatgctgc gcaaacacca cttgtttgcc 1441 aataagtttg acgtggatgt tgacctcttt gccatccagt gtttggatga gcatttgaga 1501 cacaaagctt tggagacatt aaaacactga ccattacggg caattttatg aacaagaaga 1561 aggatacaca aaacgtaccc ttatctgttt ccccttcctt gtcagcatcg ggaagatggt 1621 atgaagtcct ctttggggca gggactctag tagatcttct tgtcagagaa gctgcatggt 1681 ttctgcagag cacagttagc tagaaaggtg atagcattaa atgttcatct agagttaata 1741 gtgggaggag taaaggtagc cttgaggcca gagcaggtag caaggcattg tggaaagagg 1801 ggaccagggt ggctggggaa gaggccgatg cataaagtca gcctgttcaa agtgctcagg 1861 gacttagcaa aatgagaaga tgtgacctgt gccaaaacta ttttgagaat tttaaatgtg 1921 accatttttc tggtatgaat aaacttacag caacaaataa tcaaagatac aattaatctg 1981 atattatatt tgttgaaata gaaatttgat tgtactataa atgatttttg taaataattt 2041 atattctgct ctaatactgt actgtgtagt gtgtctccgt atgtcatctc agggagctta 2101 aaatgggctt gatttaacat tgtttttgtg ttatttttgc ttgaaacaac gcacacattt 2161 tcaacaacca aaaaatgaca atttctagtt tagttaattt ctac // LOCUS HUMC3 5067 bp mRNA PRI 10-JAN-1996 DEFINITION Human complement component C3 mRNA, alpha and beta subunits, complete cds. ACCESSION K02765 NID g179664 KEYWORDS complement component 3. SOURCE Homo sapiens (clone: pC3.[11,49,59].) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5067) AUTHORS de Bruijn,M.H. and Fey,G.H. TITLE Human complement component C3: cDNA coding sequence and derived primary structure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (3), 708-712 (1985) MEDLINE 85140166 FEATURES Location/Qualifiers source 1..5067 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pC3.[11,49,59]." /tissue_type="liver" /map="19p13.3" gene 61..5052 /gene="C3" CDS 61..5052 /gene="C3" /note="preprocomplement component C3" /codon_start=1 /db_xref="GDB:G00-119-044" /product="complement component C3" /db_xref="PID:g179665" /translation="MGPTSGPSLLLLLLTHLPLALGSPMYSIITPNILRLESEETMVL EAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNHMGNVTFTIPANREFKSEKGR NKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVLYRIFTVNHKLLPVG RTVMVNIENPEGIPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKIRAYYENSPQQVF STEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYGKKVEGTAFVIFGIQ DGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNLRAEDLVGKSLYVSATVILHS GSDMVQAERSGIPIVTSPYQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYRVPVAVQ GEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQALPYSTVG NSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQVR EPGQDLVVLPLSITTDFIPSFRLVAYYTLIGASGQREVVADSVWVDVKDSCVGSLVVK SGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEKAD IGCTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAARRRRSVQLTEKRMDK VGKYPKELRKCCEDGMRENPMRFSCQRRTRFISLGEACKKVFLDCCNYITELRRQHAR ASHLGLARSNLDEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDS ITTWEILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQ NQELKVRVELLHNPAFCSLATTKRRHQQTVTIPPKSSLSVPYVIVPLKTGLQEVEVKA AVYHHFISDGVRKSLKVVPEGIRMNKTVAVRTLDPERLGREGVQKEDIPPADLSDQVP DTESETRILLQGTPVAQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDE TEQWEKFGLEKRQGALELIKKGYTQQLAFRQPSSAFAAFVKRAPSTWLTAYVVKVFSL AVNLIAIDSQVLCGAVKWLILEKQKPDGVFQEDAPVIHQEMIGGLRNNNEKDMALTAF VLISLQEAKDICEEQVNSLPGSITKAGDFLEANYMNLQRSYTVAIAGYALAQMGRLKG PLLNKFLTTAKDKNRWEDPGKQLYNVEATSYALLALLQLKDFDFVPPVVRWLNEQRYY GGGYGSTQATFMVFQALAQYQKDAPDHQELNLDVSLQLPSRSSKITHRIHWESASLLR SEETKENEGFTVTAEGKGQGTLSVVTMYHAKAKDQLTCNKFDLKVTIKPAPETEKRPQ DAKNTMILEICTRYRGDQDATMSILDISMMTGFAPDTDDLKQLANGVDRYISKYELDK AFSDRNTLIIYLDKVSHSEDDCLAFKVHQYFNVELIQPGAVKVYAYYNLEESCTRFYH PEKEDGKLNKLCRDELCRCAEENCFIQKSDDKVTLEERLDKACEPGVDYVYKTRLVKV QLSNDFDEYIMAIEQTIKSGSDEVQVGQQRTFISPIKCREALKLEEKKHYLMWGLSSD FWGEKPNLSYIIGKDTWVEHWPEEDECQDEENQKQCQDLGAFTESMVVFGCPN" sig_peptide 61..126 /gene="C3" /note="complement component C3 signal peptide" mat_peptide 127..2061 /gene="C3" /note="complement component C3 beta chain mature peptide" mat_peptide 2074..5049 /gene="C3" /note="complement component C3 alpha chain mature peptide" BASE COUNT 1245 a 1470 c 1394 g 958 t ORIGIN 816 bp upstream of BstEII site. 1 ctcctcccca tcctctccct ctgtccctct gtccctctga ccctgcactg tcccagcacc 61 atgggaccca cctcaggtcc cagcctgctg ctcctgctac taacccacct ccccctggct 121 ctggggagtc ccatgtactc tatcatcacc cccaacatct tgcggctgga gagcgaggag 181 accatggtgc tggaggccca cgacgcgcaa ggggatgttc cagtcactgt tactgtccac 241 gacttcccag gcaaaaaact agtgctgtcc agtgagaaga ctgtgctgac ccctgccacc 301 aaccacatgg gcaacgtcac cttcacgatc ccagccaaca gggagttcaa gtcagaaaag 361 gggcgcaaca agttcgtgac cgtgcaggcc accttcggga cccaagtggt ggagaaggtg 421 gtgctggtca gcctgcagag cgggtacctc ttcatccaga cagacaagac catctacacc 481 cctggctcca cagttctcta tcggatcttc accgtcaacc acaagctgct acccgtgggc 541 cggacggtca tggtcaacat tgagaacccg gaaggcatcc cggtcaagca ggactccttg 601 tcttctcaga accagcttgg cgtcttgccc ttgtcttggg acattccgga actcgtcaac 661 atgggccagt ggaagatccg agcctactat gaaaactcac cacagcaggt cttctccact 721 gagtttgagg tgaaggagta cgtgctgccc agtttcgagg tcatagtgga gcctacagag 781 aaattctact acatctataa cgagaagggc ctggaggtca ccatcaccgc caggttcctc 841 tacgggaaga aagtggaggg aactgccttt gtcatcttcg ggatccagga tggcgaacag 901 aggatttccc tgcctgaatc cctcaagcgc attccgattg aggatggctc gggggaggtt 961 gtgctgagcc ggaaggtact gctggacggg gtgcagaacc tccgagcaga agacctggtg 1021 gggaagtctt tgtacgtgtc tgccaccgtc atcttgcact caggcagtga catggtgcag 1081 gcagagcgca gcgggatccc catcgtgacc tctccctacc agatccactt caccaagaca 1141 cccaagtact tcaaaccagg aatgcccttt gacctcatgg tgttcgtgac gaaccctgat 1201 ggctctccag cctaccgagt ccccgtggca gtccagggcg aggacactgt gcagtctcta 1261 acccagggag atggcgtggc caaactcagc atcaacacac accccagcca gaagcccttg 1321 agcatcacgg tgcgcacgaa gaagcaggag ctctcggagg cagagcaggc taccaggacc 1381 atgcaggctc tgccctacag caccgtgggc aactccaaca attacctgca tctctcagtg 1441 ctacgtacag agctcagacc cggggagacc ctcaacgtca acttcctcct gcgaatggac 1501 cgcgcccacg aggccaagat ccgctactac acctacctga tcatgaacaa gggcaggctg 1561 ttgaaggcgg gacgccaggt gcgagagccc ggccaggacc tggtggtgct gcccctgtcc 1621 atcaccaccg acttcatccc ttccttccgc ctggtggcgt actacacgct gatcggtgcc 1681 agcggccaga gggaggtggt ggccgactcc gtgtgggtgg acgtcaagga ctcctgcgtg 1741 ggctcgctgg tggtaaaaag cggccagtca gaagaccggc agcctgtacc tgggcagcag 1801 atgaccctga agatagaggg tgaccacggg gcccgggtgg tactggtggc cgtggacaag 1861 ggcgtgttcg tgctgaataa gaagaacaaa ctgacgcaga gtaagatctg ggacgtggtg 1921 gagaaggcag acatcggctg caccccgggc agtgggaagg attacgccgg tgtcttctcc 1981 gacgcagggc tgaccttcac gagcagcagt ggccagcaga ccgcccagag ggcagaactt 2041 cagtgcccgc agccagccgc ccgccgacgc cgttccgtgc agctcacgga gaagcgaatg 2101 gacaaagtcg gcaagtaccc caaggagctg cgcaagtgct gcgaggacgg catgcgggag 2161 aaccccatga ggttctcgtg ccagcgccgg acccgtttca tctccctggg cgaggcgtgc 2221 aagaaggtct tcctggactg ctgcaactac atcacagagc tgcggcggca gcacgcgcgg 2281 gccagccacc tgggcctggc caggagtaac ctggatgagg acatcattgc agaagagaac 2341 atcgtttccc gaagtgagtt cccagagagc tggctgtgga acgttgagga cttgaaagag 2401 ccaccgaaaa atggaatctc tacgaagctc atgaatatat ttttgaaaga ctccatcacc 2461 acgtgggaga ttctggctgt cagcatgtcg gacaagaaag ggatctgtgt ggcagacccc 2521 ttcgaggtca cagtaatgca ggacttcttc atcgacctgc ggctacccta ctctgttgtt 2581 cgaaacgagc aggtggaaat ccgagccgtt ctctacaatt accggcagaa ccaagagctc 2641 aaggtgaggg tggaactact ccacaatcca gccttctgca gcctggccac caccaagagg 2701 cgtcaccagc agaccgtaac catccccccc aagtcctcgt tgtccgttcc atatgtcatc 2761 gtgccgctaa agaccggcct gcaggaagtg gaagtcaagg ctgccgtcta ccatcatttc 2821 atcagtgacg gtgtcaggaa gtccctgaag gtcgtgccgg aaggaatcag aatgaacaaa 2881 actgtggctg ttcgcaccct ggatccagaa cgcctgggcc gtgaaggagt gcagaaagag 2941 gacatcccac ctgcagacct cagtgaccaa gtcccggaca ccgagtctga gaccagaatt 3001 ctcctgcaag ggaccccagt ggcccagatg acagaggatg ccgtcgacgc ggaacggctg 3061 aagcacctca ttgtgacccc ctcgggctgc ggggaacaga acatgatcgg catgacgccc 3121 acggtcatcg ctgtgcatta cctggatgaa acggagcagt gggagaagtt cggcctagag 3181 aagcggcagg gggccttgga gctcatcaag aaggggtaca cccagcagct ggccttcaga 3241 caacccagct ctgcctttgc ggccttcgtg aaacgggcac ccagcacctg gctgaccgcc 3301 tacgtggtca aggtcttctc tctggctgtc aacctcatcg ccatcgactc ccaagtcctc 3361 tgcggggctg ttaaatggct gatcctggag aagcagaagc ccgacggggt cttccaggag 3421 gatgcgcccg tgatacacca agaaatgatt ggtggattac ggaacaacaa cgagaaagac 3481 atggccctca cggcctttgt tctcatctcg ctgcaggagg ctaaagatat ttgcgaggag 3541 caggtcaaca gcctgccagg cagcatcact aaagcaggag acttccttga agccaactac 3601 atgaacctac agagatccta cactgtggcc attgctggct atgctctggc ccagatgggc 3661 aggctgaagg ggcctcttct taacaaattt ctgaccacag ccaaagataa gaaccgctgg 3721 gaggaccctg gtaagcagct ctacaacgtg gaggccacat cctatgccct cttggcccta 3781 ctgcagctaa aagactttga ctttgtgcct cccgtcgtgc gttggctcaa tgaacagaga 3841 tactacggtg gtggctatgg ctctacccag gccaccttca tggtgttcca agccttggct 3901 caataccaaa aggacgcccc tgaccaccag gaactgaacc ttgatgtgtc cctccaactg 3961 cccagccgca gctccaagat cacccaccgt atccactggg aatctgccag cctcctgcga 4021 tcagaagaga ccaaggaaaa tgagggtttc acagtcacag ctgaaggaaa aggccaaggc 4081 accttgtcgg tggtgacaat gtaccatgct aaggccaaag atcaactcac ctgtaataaa 4141 ttcgacctca aggtcaccat aaaaccagca ccggaaacag aaaagaggcc tcaggatgcc 4201 aagaacacta tgatccttga gatctgtacc aggtaccggg gagaccagga tgccactatg 4261 tctatattgg acatatccat gatgactggc tttgctccag acacagatga cctgaagcag 4321 ctggccaatg gtgttgacag atacatctcc aagtatgagc tggacaaagc cttctccgat 4381 aggaacaccc tcatcatcta cctggacaag gtctcacact ctgaggatga ctgtctagct 4441 ttcaaagttc accaatactt taatgtagag cttatccagc ctggagcagt caaggtctac 4501 gcctattaca acctggagga aagctgtacc cggttctacc atccggaaaa ggaggatgga 4561 aagctgaaca agctctgccg tgatgaactg tgccgctgtg ctgaggagaa ttgcttcata 4621 caaaagtcgg atgacaaggt caccctggaa gaacggctgg acaaggcctg tgagccagga 4681 gtggactatg tgtacaagac ccgactggtc aaggttcagc tgtccaatga ctttgacgag 4741 tacatcatgg ccattgagca gaccatcaag tcaggctcgg atgaggtgca ggttggacag 4801 cagcgcacgt tcatcagccc catcaagtgc agagaagccc tgaagctgga ggagaagaaa 4861 cactacctca tgtggggtct ctcctccgat ttctggggag agaagcccaa cctcagctac 4921 atcatcggga aggacacttg ggtggagcac tggcctgagg aggacgaatg ccaagacgaa 4981 gagaaccaga aacaatgcca ggacctcggc gccttcaccg agagcatggt tgtctttggg 5041 tgccccaact gaccacaccc ccattcc // LOCUS HUMC3GP 4070 bp mRNA PRI 05-AUG-1996 DEFINITION Human mRNA for C3G protein, complete cds. ACCESSION D21239 NID g474981 KEYWORDS C3G protein; CRK SH3-binding protein; GNRP; ras guanine nucleotide releasing factor. SOURCE Homo sapiens spleen and placenta cDNA to mRNA, clone pC3G2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,S., Morishita,T., Hashimoto,Y., Hattori,S., Nakamura,S., Shibuya,M., Matuoka,K., Takenawa,T., Kurata,T., Nagashima,K. and Matsuda,M. TITLE C3G, a guanine nucleotide-releasing protein expressed ubiquitously, binds to the Src homology 3 domains of CRK and GRB2/ASH proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (8), 3443-3447 (1994) MEDLINE 94211880 REFERENCE 2 (sites) AUTHORS Knudsen,B.S., Feller,S.M. and Hanafusa,H. TITLE Four proline-rich sequences of the guanine-nucleotide exchange factor C3G bind with unique specificity to the first Src homology 3 domain of Crk JOURNAL J. Biol. Chem. 269 (52), 32781-32787 (1994) MEDLINE 95105157 REFERENCE 3 (bases 1 to 4070) AUTHORS Tanaka,S. TITLE domain of Crk JOURNAL Unpublished (1994) COMMENT Submitted (20-Oct-1993) to DDBJ by: Michiyuki Matsuda 1-23-1 Toyama, Shinjuku-ku Tokyo 162 Japan Phone: 03-5285-1111 x2625 Fax: 03-5285-1150. FEATURES Location/Qualifiers source 1..4070 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen and placenta" CDS 131..3364 /function="'ras guanine nucleotide releasing factor'" /note="1897-2047 bp: binding site for SH3 domains of CRK and GRB2/ASH protein.; 2630-3322 bp: guanine nucleotide releasing factor; homologous region to CDC25" /codon_start=1 /product="'C3G protein'" /db_xref="PID:d1005302" /db_xref="PID:g474982" /translation="MDTDSQRSHLSSFTMKLMDKFHSPKIKRTPSKKGKPAEVSVKIP EKPVNKEATDRFLPEGYPLPLDLEQQAVEFMSTSAVASRSQRQKNLSWLEEKEKEVVS ALRYFKTIVDKMAIDKKVLEMLPGSASKVLEAILPLVQNDPRIQHSSALSSCYSRVYQ SLANLIRWSDQVMLEGVNSEDKEMVTTVKGVIKAVLDGVKELVRLTIEKQGRPSPTSP VKPSSPASKPDGPAELPLTDREVEILNKTTGMSQSTELLPDATDEEVAPPKPPLPGIR VVDNSPPPALTPKKRQSAPSPTRVAVVAPMSRATSGSSLPVGINRQDFDVDCYAQRRL SGGSHSYGGESPRLSPCSSIDKLSKSDEQLSSLDRDSGQCSRNTSCETLDHYDPDYEF LQQDLSNADQIPQQTAWNLSPLPESLGESGSPFLGPPFQLPLGGHPQPDGPLAPGQQT DTPPALPEKKRRSAASQTADGSGCRVSYERHPSQYDNISGEDLQSTAPIPSVPYAPFA AILPFQHGGSSAPVEFVGDFTAPESTGDPEKPPPLPEKKNKHMLAYMQLLEDYSEPQP SMFYQTPQNEHIYQQKNKLLMEVYGFSDSFSGVDSVQELAPPPALPPKQRQLEPPAGK DGHPRDPSAVSGVPGKDSRDGSERAPKSPDALESAQSEEEVDELSLIDHNEIMSRLTL KQEGDDGPDVRGGSGDILLVHATETDRKDLVLYCEAFLTTYRTFISPEELIKKLQYRY EKFSPFADTFKKRVSKNTFFVLVRVVDELCLVELTEEILKLLMELVFRLVCNGELSLA RVLRKNILDKVDQKKLLRCATSSQPLAARGVAARPGTLHDFHSHEIAEQLTLLDAELF YKIEIPEVLLWAKEQNEEKSPNLTQFTEHFNNMSYWVRSIIMLQEKAQDRERLLLKFI KIMKHLRKLNNFNSYLAILSALDSAPIRRLEWQKQTSEGLAEYCTLIDSSSSFRAYRA ALSEVEPPCIPYLGLILQDLTFVHLGNPDYIDGKVNFSKRWQQFNILDSMRCFQQAHY DMRRNDDIINFFNDFSDHLAEEALWELSLKIKPRNITRRKTDREEKT" BASE COUNT 939 a 1197 c 1148 g 786 t ORIGIN 1 gaattccctt ttctgggcac cgccttctgc tagggggttg tagatgaaag tgcctgctcc 61 cagagaagct tgtctaacct agcacagttt ctaagctacc caggctgcca gaccgagcga 121 ccgtgctgcc atggacacag actctcagcg ttctcatctc tcttccttca ccatgaagct 181 gatggacaaa ttccactcac ccaaaatcaa gagaacgcca tcaaagaagg gaaaaccagc 241 tgaggtgtcc gtaaagattc cagagaagcc tgtgaacaaa gaggcaacag acagatttct 301 accagagggc taccctctcc ccttggatct ggagcagcag gcagtagaat ttatgtccac 361 cagtgctgtg gcttccaggt ctcaaaggca gaagaacctg agctggctgg aggagaaaga 421 gaaggaagtt gtcagtgccc tgcgctactt taagaccatt gtggacaaaa tggcaattga 481 taagaaggta ctggagatgc ttccagggtc agccagcaag gtgctggagg ccatcttacc 541 cctggtgcag aacgatcctc gaattcagca cagctcagcc ctctcttcct gctatagccg 601 agtgtaccaa agcctcgcca acctcattcg ctggtctgac caagtgatgc tggaaggcgt 661 gaactcagaa gacaaggaga tggtgacgac tgtgaagggg gtcatcaagg ctgtgctgga 721 tggagtgaag gagctggtca ggctcaccat cgagaagcag ggacgtccgt ctccgacgag 781 ccccgtgaag cccagttccc ctgccagcaa gcctgatggc ccagcagagc tccccctgac 841 agaccgcgag gtagagatcc taaacaagac gactgggatg tcacagtcaa ctgagctcct 901 cccagatgcc acggatgaag aggtcgcgcc ccccaagcct cctctgcctg gcattcgggt 961 ggttgataat agtcctccac cagcattgac acccaagaaa agacagtcgg cgccgtcccc 1021 tacccgagtg gctgtggtgg cccccatgag ccgagccacc agtggctcca gtttgcctgt 1081 tggaatcaat aggcaggatt ttgatgttga ctgttacgca cagaggcgac tgtcaggagg 1141 cagccactca tatggtggag agtcgccccg cctctccccc tgcagcagca tagacaagct 1201 cagcaagtca gacgagcagc tgtcctctct ggacagggac agtgggcagt gctcccggaa 1261 cacaagctgt gaaacactag accactatga tcccgactat gaattcctcc agcaagacct 1321 ctctaacgca gaccagatac ctcagcagac ggcctggaac cttagcccgt tgccagagtc 1381 tttgggggag tctgggtctc catttcttgg ccctcctttc cagctgcctc ttggcggcca 1441 tccccagcca gacggacctc tggccccagg gcagcagaca gatacgccac ctgctctccc 1501 cgagaagaag cgcaggagcg cagcctccca gacggcggac ggctctggct gcagggtgtc 1561 ctacgagcgg catccctcgc agtatgacaa catctctggg gaggacctgc agagcacagc 1621 cccgatccca tccgtcccct acgcgccctt tgctgctatt ctgccctttc agcatggagg 1681 ttcctcagcc cctgtcgaat ttgtgggtga ttttactgct cctgagtcaa ccggtgaccc 1741 agaaaaacca cctcctctac cagagaagaa aaacaaacac atgctggcct acatgcagtt 1801 gctggaggac tactcggagc cgcagccctc tatgttctac cagacgccac agaacgagca 1861 catctaccag cagaagaaca agctcctcat ggaggtatac ggcttcagcg actccttcag 1921 tggggtggac tccgtgcagg agctggcccc gccgcccgcc ctacccccca agcagcggca 1981 gctggagcca ccggctggga aagacggaca tcccagagat ccctcagcgg tcagcggcgt 2041 ccctgggaag gacagcagag acggcagtga gagggcccca aagtcaccag atgctctgga 2101 gtcggctcag tcggaggagg aagtggacga gctgtccctc attgaccaca acgaaattat 2161 gtccaggctg acgctcaagc aggagggtga tgacgggccg gacgtccgcg gaggatctgg 2221 ggacatctta ctggtccatg ctactgagac tgacaggaaa gatttggtgt tgtactgcga 2281 ggcattcctg accacctaca ggaccttcat ctccccagag gagctcatca agaagctgca 2341 gtacagatat gagaaattct ctccctttgc cgacacattc aagaagcgcg tcagcaagaa 2401 cacgttcttc gtgctggtac gggtggtgga tgagctctgc ctggtggagt tgacagaaga 2461 gatcctgaag ctgctgatgg aactggtctt ccgcctggtg tgcaatgggg agctgagcct 2521 ggcccgtgtg ctccggaaga acatcctgga caaggtggac cagaagaagc tactcaggtg 2581 tgccacctcc agccagcccc tggcagcccg gggggtagca gccaggccgg ggaccttgca 2641 cgactttcac agccatgaga tagcggagca gctaacgctg ctggatgctg agctcttcta 2701 taaaatagag attcctgagg ttttgctttg ggcaaaagag cagaatgagg agaagagccc 2761 caacttgacc cagttcacgg agcacttcaa caacatgtcc tactgggtcc ggtccataat 2821 catgttacag gaaaaggccc aggacaggga acggctgctc ttgaagttca tcaagatcat 2881 gaagcacttg cggaagctga ataacttcaa ctcctacttg gccatcctct ctgccctgga 2941 ctcggcgccc atccgcaggc tggagtggca gaagcagact tcagagggcc tggccgagta 3001 ctgcacactg atcgacagct cgtcctcctt ccgagcctac cgggccgccc tctcggaggt 3061 ggaaccgccg tgcatcccgt acctggggct gatcctgcag gacctgacct tcgttcacct 3121 gggaaaccca gactacatcg acgggaaagt gaacttctcc aagcggtggc agcagttcaa 3181 catcctcgac agcatgcgct gcttccagca ggcgcactat gacatgcgga ggaacgacga 3241 cattataaac ttcttcaatg acttcagtga ccacctggct gaggaggccc tatgggaact 3301 gtctctgaaa attaaaccca ggaacataac aaggagaaaa acagaccggg aagagaagac 3361 ctaggagcag acgccgggat ccaggagaat gctcgagggg cgcagagggc agctcccaga 3421 ccggagagga ccttggacct gttaggcgca tggcaggagt cccggcctcg gagccatgag 3481 gctggccagc cctcagcggg gccgggcggg agctggagcc tgccagccgc ttcctgcctc 3541 cttcctctgt gggagcagac ccgtgggcct cagggcagcc agcaggcagg tcttgttgcc 3601 aatttacaaa ccggtggttt tctggtttgg ttttgttttc tgcttttact tccatctctc 3661 ccctcttgac cttccaccca ctcccctcca gggagagagc agcagagacc tcatcagcag 3721 accaaggaag tggtgggtgc tccccctccc taagctccag ggtccctgaa tcttctgaaa 3781 tctcaaatga gtggaggcct cctggggtgg cctgtcctgc aggggccctg gaatgggggc 3841 aagcagctgg gtgggcagaa tgcagagtag actcggggga ggatcctttc actttccgct 3901 tccccttctg atgcatggag gatggtgtga gcttttcagc aggcccggaa aggtacgcag 3961 gtgacgcctt agcagccccg cagctggtgc tctgccccgc ggtactggcg ccatcagggc 4021 ctcccttgcc cgcctgagag cagcagcagt ctctgtcatc ccgtcgcccc // LOCUS HUMC4BINDA 1157 bp mRNA PRI 24-AUG-1993 DEFINITION Human (clone A12) C4b-binding protein beta-chain mRNA, complete cds. ACCESSION L11244 NID g179682 KEYWORDS C4b-binding protein; beta-chain. SOURCE Homo sapiens (library: lambda gt11) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1157) AUTHORS Hillarp,A., Pardo-Manuel,F., Ruiz,R., Rodriguez de Cordoba,S. and Dahlback,B. TITLE The human C4b-binding protein beta-chain gene JOURNAL J. Biol. Chem. 268, 15017-15023 (1993) MEDLINE 93315479 FEATURES Location/Qualifiers source 1..1157 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="lambda gt11" mRNA 1..1157 sig_peptide 335..385 /note="putative" CDS 335..1093 /note="putative" /codon_start=1 /product="C4b-binding protein beta-chain" /db_xref="PID:g387660" /translation="MFFWCACCLMVAWRVSASDAEHCPELPPVDNSIFVAKEVEGQIL GTYVCIKGYHLVGKKTLFCNASKEWDNTTTECRLGHCPDPVLVNGEFSSSGPVNVSDK ITFMCNDHYILKGSNRSQCLEDHTWAPPFPICKSRDCDPPGNPVHGYFEGNNFTLGST ISYYCEDRYYLVGVQEQQCVDGEWSSALPVCKLIQEAPKPECEKALLAFQESKNLCEA MENFMQQLKESGMTMEELKYSLELKKAELKAKLL" repeat_region 386..565 /note="short consensus repeat (SCR) 1" mat_peptide 386..1090 /note="putative" /product="C4b-binding protein beta-chain" repeat_region 566..739 /note="short consensus repeat (SCR) 2" repeat_region 740..910 /note="short consensus repeat (SCR) 3" polyA_signal 1120..1125 polyA_signal 1132..1137 BASE COUNT 341 a 236 c 299 g 281 t ORIGIN 1 attctgtctt tcacatacat tgagaccaaa aagaccaagt acctataaga ggaccaaccc 61 agacgggctg tgacaattac gctgttgctt ctgagtgaga agttacaggc ccaagaaagg 121 gtaatgacag ccttagagat acataaaaga gacaagcaat ttccaaaaca aaaagcaaag 181 gcaaaaagaa aaataaaaaa gcaggccttt ggagctctca gctttggagt cagttaagac 241 cagttccttg ctgggaagcc ctaactctgg agggacagag acaggtgtct gagctgggtg 301 aattccagcc tggggagagg actttgatca ccagatgttt ttttggtgtg cgtgctgtct 361 tatggttgcg tggcgagttt ctgcttcaga tgcagagcac tgtccagagc ttcctccagt 421 ggacaatagc atatttgtcg caaaggaggt ggaaggacag attctgggga cttacgtttg 481 tatcaagggc taccacctgg taggaaagaa gacccttttt tgcaatgcct ctaaggagtg 541 ggataacacc actactgagt gccgcttggg ccactgtcct gatcctgtgc tggtgaatgg 601 agagttcagt tcttcagggc ctgtgaatgt aagtgacaaa atcacgttta tgtgcaatga 661 ccactacatc ctcaagggca gcaatcggag ccagtgtcta gaggaccaca cctgggcacc 721 tccctttccc atctgcaaaa gtagggactg tgaccctcct gggaatccag ttcatggcta 781 ttttgaagga aataacttca ccttaggatc caccattagt tattactgtg aagacaggta 841 ctacttagtg ggcgtgcagg agcagcaatg cgttgatggg gagtggagca gtgcacttcc 901 agtctgcaag ttgatccagg aagctcccaa accagagtgt gagaaggcac ttcttgcctt 961 tcaggagagt aagaacctct gcgaagccat ggagaacttt atgcaacaat taaaggaaag 1021 tggcatgaca atggaggagc taaaatattc tctggagctg aagaaagctg agttgaaggc 1081 aaaattgttg taacactaca gctgagcaga tgtaatagaa ataaacctat gaataaattt 1141 tcttcttggt tctgaaa // LOCUS HUMC5AAR 2328 bp mRNA PRI 06-MAR-1995 DEFINITION Human C5a anaphylatoxin receptor mRNA, complete cds. ACCESSION M62505 J05327 NID g179699 KEYWORDS C5a anaphylatoxin receptor. SOURCE Human peripheral blood promyelocytic leukemia cell line HL-60 (ATCC CCL 240), cDNA to mRNA, clone C5a-receptor. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2328) AUTHORS Boulay,F., Mery,L., Tardif,M., Brouchon,L. and Vignais,P. TITLE Expression cloning of a receptor for C5a anaphylatoxin on differentiated HL-60 cells JOURNAL Biochemistry 30 (12), 2993-2999 (1991) MEDLINE 91175748 FEATURES Location/Qualifiers source 1..2328 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C5a-Receptor" /cell_line="HL-60" /cell_type="promyelocyte" /dev_stage="leukemia paptient" /haplotype="pseudodiploid" /tissue_type="peripheral blood" /tissue_lib="ATCC CCL 240" gene 25..2299 /gene="C5a anaphylatoxin receptor" CDS 25..1077 /gene="C5a anaphylatoxin receptor" /note="potential translated region; putative" /codon_start=1 /product="C5a anaphylatoxin receptor" /db_xref="PID:g179700" /translation="MNSFNYTTPDYGHYDDKDTLDLNTPVDKTSNTLRVPDILALVIF AVVFLVGVLGNALVVWVTAFEAKRTINAIWFLNLAVADFLSCLALPILFTSIVQHHHW PFGGAACSILPSLILLNMYASILLLATISADRFLLVFKPIWCQNFRGAGLAWIACAVA WGLALLLTIPSFLYRVVREEYFPPKVLCGVDYSHDKRRERAVAIVRLVLGFLWPLLTL TICYTFILLRTWSRRATRSTKTLKVVVAVVASFFIFWLPYQVTGIMMSFLEPSSPTFL LLNKLDSLCVSFAYINCCINPIIYVVAGQGFQGRLRKSLPSLLRNVLTEESVVRESKS FTRSTVDTMAQKTQAV" polyA_signal 2294..2299 /gene="C5a anaphylatoxin receptor" BASE COUNT 560 a 645 c 540 g 583 t ORIGIN 1 agggggagcc caggagacca gaacatgaac tccttcaatt ataccacccc tgattatggg 61 cactatgatg acaaggatac cctggacctc aacacccctg tggataaaac ttctaacacg 121 ctgcgtgttc cagacatcct ggccttggtc atctttgcag tcgtcttcct ggtgggagtg 181 ctgggcaatg ccctggtggt ctgggtgacg gcattcgagg ccaagcggac catcaatgcc 241 atctggttcc tcaacttggc ggtagccgac ttcctctcct gcctggcgct gcccatcttg 301 ttcacgtcca ttgtacagca tcaccactgg ccctttggcg gggccgcctg cagcatcctg 361 ccctccctca tcctgctcaa catgtacgcc agcatcctgc tcctggccac catcagcgcc 421 gaccgctttc tgctggtgtt taaacccatc tggtgccaga acttccgagg ggccggcttg 481 gcctggatcg cctgtgccgt ggcttggggt ttagccctgc tgctgaccat accctccttc 541 ctgtaccggg tggtccggga ggagtacttt ccaccaaagg tgttgtgtgg cgtggactac 601 agccacgaca aacggcggga gcgagccgtg gccatcgtcc ggctggtcct gggcttcctg 661 tggcctctac tcacgctcac gatttgttac actttcatcc tgctccggac gtggagccgc 721 agggccacgc ggtccaccaa gacactcaag gtggtggtgg cagtggtggc cagtttcttt 781 atcttctggt tgccctacca ggtgacgggg ataatgatgt ccttcctgga gccatcgtca 841 cccaccttcc tgctgctgaa taagctggac tccctgtgtg tctcctttgc ctacatcaac 901 tgctgcatca accccatcat ctacgtggtg gccggccagg gcttccaggg ccgactgcgg 961 aaatccctcc ccagcctcct ccggaacgtg ttgactgaag agtccgtggt tagggagagc 1021 aagtcattca cgcgctccac agtggacact atggcccaga agacccaggc agtgtaggcg 1081 acagcctcat gggccactgt ggcccgatgt ccccttcctt cccggccatt ctccctcttg 1141 ttttcacttc acttttcgtg ggatggtgtt accttagcta actaactctc ctccatgttg 1201 cctgtctttc ccagacttgt ccctcctttt ccagcgggac tcttctcatc cttcctcatt 1261 tgcaaggtga acacttcctt ctagggagca ccctcccacc ccccaccccc ccccacacac 1321 catctttcca tcccaggctt ttgaaaaaca aacagaaacc cgtgtatctg ggatatttcc 1381 atatggcaat aggtgtgaac agggaactca gaatacagac aagtagaaag attctcgctt 1441 aaaaaaatgt atttatttta tggcaagttg gaaaatatgt aactggaatc tcaaaagttc 1501 tttgggacaa aacagaagtc catggagtta tctaagctct tgtaagtgag ttaatttaaa 1561 aaagaaaatt aggctgagag cagtggctca cgcctgtaat cccagaactt tgggaggcta 1621 aggtgggtgg atcacctgag gtcaagagtt ccagaccagg ctggccagca tggtgaaacc 1681 ccgtctgtac taaaaataca aaaaattaac tgggcatggt agtgggtgcc tgtaatccca 1741 gctacttggg aggctgaggt gggagaattg ctcgaacctt ggaggtggag gttgtggtga 1801 gccatgatcg caccactgca ctctagcctg ggtgaccgag ggaggctctg tctcaaaagc 1861 aaagcaaaaa caaaaacaaa aacacctaaa aaacctgcag ttttgtttgt actttgtttt 1921 taaattatgc tttctatttt gagatcattg caaactcaac acaattgtaa gtaatgatac 1981 agagggatct tgtgtaccct tcacccagcc tcccccaatg gcaacatctt gcaaaactac 2041 aatgtagtct cataaccagg atattgacat tgatacagtg aagatacagg acattctcat 2101 caccacaggg atccccagga tgcccacttc cctccacccc cacaccccag ccgtgtccct 2161 aacccctggc aaccaggaat ccactctcca tttctataat gttgtcattt caagaatgtt 2221 attcaatgga atcatatagt atgtaacctg ttttgagctt aaaaaaaaaa gtatacatga 2281 ctttaatgag gaaaataaaa atgaatattg aaaaaaaaaa ctttagag // LOCUS HUMC6A 3303 bp mRNA PRI 31-OCT-1994 DEFINITION Human complement component C6 mRNA, complete cds. ACCESSION J05064 NID g179703 KEYWORDS complement component C6. SOURCE Human (adult) liver cDNA to mRNA, clones 11 and 1A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3303) AUTHORS Haefliger,J.A., Tschopp,J., Vial,N. and Jenne,D.E. TITLE Complete primary structure and functional characterization of the sixth component of the human complement system. Identification of the C5b-binding domain in complement C6 JOURNAL J. Biol. Chem. 264 (30), 18041-18051 (1989) MEDLINE 90036879 COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by D.E.Jenne, 25-JUL-1989. Authors found an 891-bp intron in clone 11 occurring between base pairs 1839 and 1840. The sequence of the intron is given in GenBank entry J05063. FEATURES Location/Qualifiers source 1..3303 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5p14-p12" gene 156..2960 /gene="C6" CDS 156..2960 /gene="C6" /note="complement component C6 precursor peptide" /codon_start=1 /db_xref="GDB:G00-119-045" /db_xref="PID:g179704" /translation="MARRSVLYFILLNALINKGQACFCDHYAWTQWTSCSKTCNSGTQ SRHRQIVVDKYYQENFCEQICSKQETRECNWQRCPINCLLGDFGPWSDCDPCIEKQSK VRSVLRPSQFGGQPCTAPLVAFQPCIPSKLCKIEEADCKNKFRCDSGRCIARKLECNG ENDCGDNSDERDCGRTKAVCTRKYNPIPSVQLMGNGFHFLAGEPRGEVLDNSFTGGIC KTVKSSRTSNPYRVPANLENVGFEVQTAEDDLKTDFYKDLTSLGHNENQQGSFSSQGG SSFSVPIFYSSKRSENINHNSAFKQAIQASHKKDSSFIRIHKVMKVLNFTTKAKDLHL SDVFLKALNHLPLEYNSALYSRIFDDFGTHYFTSGSLGGVYDLLYQFSSEELKNSGLT EEEAKHCVRIETKKRVLFAKKTKVEHRCTTNKLSEKHEGSFIQGAEKSISLIRGGRSE YGAALAWEKGSSGLEEKTFSEWLESVKENPAVIDFELAPIVDLVRNIPCAVTKRNNLR KALQEYAAKFDPCQCAPCPNNGRPTLSGTECLCVCQSGTYGENCEKQSPDYKSNAVDG QWGCWSSWSTCDATYKRSRTRECNNPAPQRGGKRCEGEKRQEEDCTFSIMENNGQPCI NDDEEMKEVDLPEIEADSGCPQPVPPENGFIRNEKQLYLVGEDVEISCLTGFETVGYQ YFRCLPDGTWRQGDVECQRTECIKPVVQEVLTITPFQRLYRIGESIELTCPKGFVVAG PSRYTCQGNSWTPPISNSLTCEKDTLTKLKGHCQLGQKQSGSECICMSPEEDCSHHSE DLCVFDTDSNDYFTSPACKFLAEKCLNNQQLHFLHIGSCQDGRQLEWGLERTRLSSNS TKKESCGYDTCYDWEKCSASTSKCVCLLPPQCFKGGNQLYCVKMGSSTSEKTLNICEV GTIRCANRKMEILHPGKCLA" sig_peptide 156..218 /gene="C6" /note="complement component C6 signal peptide" mat_peptide 219..2960 /gene="C6" /note="complement component C6 mature peptide" misc_feature 2049..2051 /gene="C6" /note="V8 protease cleavage site" BASE COUNT 1010 a 711 c 745 g 837 t ORIGIN 1 ttgccttgtg ttagctagca ataagaaaag aagctttgtt tggattaaca tatataccct 61 cttcattctg catacctatt ttttccccaa taatttgcag cttaggtccg aggacaccac 121 aaactctgct taaagggcct ggaggctctc aaggcatggc cagacgctct gtcttgtact 181 tcatcctgct gaatgctctg atcaacaagg gccaagcctg cttctgtgat cactatgcat 241 ggactcagtg gaccagctgc tcaaaaactt gcaattctgg aacccagagc agacacagac 301 aaatagtagt agataagtac taccaggaaa acttttgtga acagatttgc agcaagcagg 361 agactagaga atgtaactgg caaagatgcc ccatcaactg cctcctggga gattttggac 421 catggtcaga ctgtgaccct tgtattgaaa aacagtctaa agttagatct gtcttgcgtc 481 ccagtcagtt tgggggacag ccatgcactg cgcctctggt agcctttcaa ccatgcattc 541 catctaagct ctgcaaaatt gaagaggctg actgcaagaa taaatttcgc tgtgacagtg 601 gccgctgcat tgccagaaag ttagaatgca atggagaaaa tgactgtgga gacaattcag 661 atgaaaggga ctgtgggagg acaaaggcag tatgcacacg gaagtataat cccatcccta 721 gtgtacagtt gatgggcaat gggtttcatt ttctggcagg agagcccaga ggagaagtcc 781 ttgataactc tttcactgga ggaatatgta aaactgtcaa aagcagtagg acaagtaatc 841 cataccgtgt tccggccaat ctggaaaatg tcggctttga ggtacaaact gcagaagatg 901 acttgaaaac agatttctac aaggatttaa cttctcttgg acacaatgaa aatcaacaag 961 gctcattctc aagtcagggg gggagctctt tcagtgtacc aattttttat tcctcaaaga 1021 gaagtgaaaa tatcaaccat aattctgcct tcaaacaagc cattcaagcc tctcacaaaa 1081 aggattctag ttttattagg atccataaag tgatgaaagt cttaaacttc acaacgaaag 1141 ctaaagatct gcacctttct gatgtctttt tgaaagcact taaccatctg cctctagaat 1201 acaactctgc tttgtacagc cgaatattcg atgactttgg gactcattac ttcacctctg 1261 gctccctggg aggcgtgtat gaccttctct atcagtttag cagtgaggaa ctaaagaact 1321 caggtttaac cgaggaagaa gccaaacact gtgtcaggat tgaaacaaag aaacgcgttt 1381 tatttgctaa gaaaacaaaa gtggaacata ggtgcaccac caacaagctg tcagagaaac 1441 atgaaggttc atttatacag ggagcagaga aatccatatc cctgattcga ggtggaagga 1501 gtgaatatgg agcagctttg gcatgggaga aagggagctc tggtctggag gagaagacat 1561 tttctgagtg gttagaatca gtgaaggaaa atcctgctgt gattgacttt gagcttgccc 1621 ccatcgtgga cttggtaaga aacatcccct gtgcagtgac aaaacggaac aacctcagga 1681 aagctttgca agagtatgca gccaagttcg atccttgcca gtgtgctcca tgccctaata 1741 atggccgacc caccctctca gggactgaat gtctgtgtgt gtgtcagagt ggcacctatg 1801 gtgagaactg tgagaaacag tctccagatt ataaatccaa tgcagtagac ggacagtggg 1861 gttgttggtc ttcctggagt acctgtgatg ctacttataa gagatcgaga acccgagaat 1921 gcaataatcc tgccccccaa cgaggaggga aacgctgtga gggggagaag cgacaagagg 1981 aagactgcac attttcaatc atggaaaaca atggacaacc atgtatcaat gatgatgaag 2041 aaatgaaaga ggtcgatctt cctgagatag aagcagattc cgggtgtcct cagccagttc 2101 ctccagaaaa tggatttatc cggaatgaaa agcaactata cttggttgga gaagatgttg 2161 aaatttcatg ccttactggc tttgaaactg ttggatacca gtacttcaga tgcttaccag 2221 acgggacctg gagacaaggg gatgtggaat gccaacggac ggagtgcatc aagccagttg 2281 tgcaggaagt cctgacaatt acaccatttc agagattgta tagaattggt gaatccattg 2341 agctaacttg ccccaaaggc tttgttgttg ctgggccatc aaggtacaca tgccagggga 2401 attcctggac accacccatt tcaaactctc tcacctgtga aaaagatact ctaacaaaat 2461 taaaaggcca ttgtcagctg ggacagaaac aatcaggatc tgaatgcatt tgtatgtctc 2521 cagaagaaga ctgtagccat cattcagaag atctctgtgt gtttgacaca gactccaacg 2581 attactttac ttcacccgct tgtaagtttt tggctgagaa atgtttaaat aatcagcaac 2641 tccattttct acatattggt tcctgccaag acggccgcca gttagaatgg ggtcttgaaa 2701 ggacaagact ttcatccaac agcacaaaga aagaatcctg tggctatgac acctgctatg 2761 actgggaaaa atgttcagcc tccacttcca aatgtgtctg cctattgccc ccacagtgct 2821 tcaagggtgg aaaccaactc tactgtgtca aaatgggatc atcaacaagt gagaaaacat 2881 tgaacatctg tgaagtggga actataagat gtgcaaacag gaagatggaa atactgcatc 2941 ctggaaagtg tttggcctag cacaattact gctaggccca gcacaatgaa cagatttacc 3001 atcccgaaga accaactcct acaaatgaga attcttgcac aaacagcaga ctggcatgct 3061 caaagttact gacaaaaatt attttctgtt agtttgagat cattattctc ccctgactct 3121 cctgtttggg catgtcttat tcagttccag ctcatgacgc cctgtagcat acccctaggt 3181 accaacttcc acagcagtct cgtaaattct cctgttcaca ttgtacaaaa ataatgtgac 3241 ttctgaggcc cttatgtagc ctgtgacatt aagcattctc gcaattagaa ataagaataa 3301 aac // LOCUS HUMC7A 3890 bp mRNA PRI 31-OCT-1994 DEFINITION Human complement protein component C7 mRNA, complete cds. ACCESSION J03507 NID g179715 KEYWORDS complement component C2; complement protein C7. SOURCE Human liver, cDNA to mRNA, clone lambda-GT11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3890) AUTHORS DiScipio,R.G., Chakravarti,D.N., Muller-Eberhard,H.J. and Fey,G.H. TITLE The structure of human complement component C7 and the C5b-7 complex JOURNAL J. Biol. Chem. 263 (1), 549-560 (1988) MEDLINE 88087145 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by R.G.DiScipio 06-JAN-1988. FEATURES Location/Qualifiers source 1..3890 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5p14-p12" sig_peptide 1..66 /gene="C7" /note="complement protein C7 signal peptide" CDS 1..2532 /gene="C7" /note="complement protein C7 precursor" /codon_start=1 /db_xref="GDB:G00-119-046" /db_xref="PID:g179716" /translation="MKVISLFILVGFIGEFQSFSSASSPVNCQWDFYAPWSECNGCTK TQTRRRSVAVYGQYGGQPCVGNAFETQSCEPTRGCPTEEGCGERFRCFSGQCISKSLV CNGDSDCDEDSADEDRCEDSERRPSCDIDKPPPNIELTGNGYNELTGQFRNRVINTKS FGGQCRKVFSGDGKDFYRLSGNVLSYTFQVKINNDFNYEFYNSTWSYVKHTSTEHTSS SRKRSFFRSSSSSSRSYTSHTNEIHKGKSYQLLVVENTVEVAQFINNNPEFLQLAEPF WKELSHLPSLYDYSAYRRLIDQYGTHYLQSGSLGGEYRVLFYVDSEKLKQNDFNSVEE KKCKSSGWHFVVKFSSHGCKELENALKAASGTQNNVLRGEPFIRGGGAGFISGLSYLE LDNPAGNKRRYSAWAESVTNLPQVIKQKLTPLYELVKEVPCASVKKLYLKWALEEYLD EFDPCHCRPCQNGGLATVEGTHCLCHCKPYTFGAACEQGVLVGNQAGGVDGGWSCWSS WSPCVQGKKTRSRECNNPPPSGGGRSCVGETTESTQCEDEELEHLRLLEPHCFPLSLV PTEFCPSPPALKDGFVQDEGPMFPVGKNVVYTCNEGYSLIGNPVARCGEDLRWLVGEM HCQKIACVLPVLMDGIQSHPQKPFYTVGEKVTVSCSGGMSLEGPSAFLCGSSLKWSPE MKNARCVQKENPLTQAVPKCQRWEKLQNSRCVCKMPYECGPSLDVCAQDERSKRILPL TVCKMHVLHCQGRNYTLTGRDSCTLPASAEKACGACPLWGKCDAESSKCVCREASECE EEGFSICVEVNGKEQTMSECEAGALRCRGQSISVTSIRPCAAETQ" gene 1..2532 /gene="C7" mat_peptide 67..2529 /gene="C7" /note="complement protein C7" BASE COUNT 1073 a 799 c 931 g 1087 t ORIGIN 1707 bp upstream of EcoRI site; Linkage group 2. 1 atgaaggtga taagcttatt cattttggtg ggatttatag gagagttcca aagtttttca 61 agtgcctcct ctccagtcaa ctgccagtgg gacttctatg ccccttggtc agaatgcaat 121 ggctgtacca agactcagac tcgcaggcgg tcagttgctg tgtatgggca gtatggaggc 181 cagccttgtg ttggaaatgc ttttgaaaca cagtcctgtg aacctacaag aggatgtcca 241 acagaggagg gatgtggaga gcgtttcagg tgcttttcag gtcagtgcat cagcaaatca 301 ttggtttgca atggggattc tgactgtgat gaagacagtg ctgatgaaga cagatgtgag 361 gactcagaaa ggagaccttc ctgtgatatc gataaacctc ctcctaacat agaacttact 421 ggaaatggtt acaatgaact cactggccag tttaggaaca gagtcatcaa taccaaaagt 481 tttggtggtc aatgtagaaa ggtgtttagt ggggatggaa aagatttcta caggctgagt 541 ggaaatgtcc tgtcctatac attccaggtg aaaataaata atgattttaa ttatgaattt 601 tacaatagta cttggtctta tgtaaaacat acgtcgacag aacacacatc atctagtcgg 661 aagcgctcct tttttagatc ttcatcatct tcttcacgca gttatacttc acataccaat 721 gaaatccata aaggaaagag ttaccaactg ctggttgttg agaacactgt tgaagtggct 781 cagttcatta ataacaatcc agaattttta caacttgctg agccattctg gaaggagctt 841 tcccacctcc cctctctgta tgactacagt gcctaccgaa gattaatcga ccagtacggg 901 acacattatc tgcaatctgg gtcgttagga ggagaataca gagttctatt ttatgtggac 961 tcagaaaaat taaaacaaaa tgattttaat tcagtcgaag aaaagaaatg taaatcctca 1021 ggttggcatt ttgtcgttaa attttcaagt catggatgca aggaactgga aaacgcttta 1081 aaagctgctt caggaaccca gaacaatgta ttgcgaggag aaccgttcat cagaggggga 1141 ggtgcaggct tcatatctgg ccttagttac ctagagctgg acaatcctgc tggaaacaaa 1201 aggcgatatt ctgcctgggc agaatctgtg actaatcttc ctcaagtcat aaaacaaaag 1261 ctgacacctt tatatgagct ggtaaaggaa gtaccttgtg cctctgtgaa aaaactatac 1321 ctgaaatggg ctcttgaaga gtatctggat gaatttgacc cctgtcattg ccggccttgt 1381 caaaatggtg gtttggctac tgttgagggg acccattgtc tgtgccattg caaaccgtac 1441 acatttggtg cggcgtgtga gcaaggagtc ctcgtaggga atcaagcagg aggggttgat 1501 ggaggttgga gttgctggtc ctcttggagc ccctgtgtcc aagggaagaa aacaagaagc 1561 cgtgaatgca ataacccacc tcccagtggg ggtgggagat cctgcgttgg agaaacgaca 1621 gaaagcacac aatgcgaaga tgaggagctg gagcacttga ggttgcttga accacattgc 1681 tttcctttgt ctttggttcc aacagaattc tgtccatcac ctcctgcctt gaaagatgga 1741 tttgttcaag atgaaggtcc aatgtttcct gtggggaaaa atgtagtgta cacttgcaat 1801 gaaggatact ctcttattgg aaacccagtg gccagatgtg gagaagattt acggtggctt 1861 gttggggaaa tgcattgtca gaaaattgcc tgtgttctac ctgtactgat ggatggcata 1921 cagagtcacc cccaaaaacc tttctacaca gttggtgaga aggtgactgt ttcctgttca 1981 ggtggcatgt ccttagaagg tccttcagca tttctctgtg gctccagcct taagtggagt 2041 cctgagatga agaatgcccg ctgtgtacaa aaagaaaatc cgttaacaca ggcagtgcct 2101 aaatgtcagc gctgggagaa actgcagaat tcaagatgtg tttgtaaaat gccctacgaa 2161 tgtggacctt ccttggatgt atgtgctcaa gatgagagaa gcaaaaggat actgcctctg 2221 acagtttgca agatgcatgt tctccactgt cagggtagaa attacaccct tactggtagg 2281 gacagctgta ctctgcctgc ctcagctgag aaagcttgtg gtgcctgccc actgtgggga 2341 aaatgtgatg ctgagagcag caaatgtgtc tgccgagaag catcggagtg cgaggaagaa 2401 gggtttagca tttgtgtgga agtgaacggc aaggagcaga cgatgtctga gtgtgaggcg 2461 ggcgctctga gatgcagagg gcagagcatc tctgtcacca gcataaggcc ttgtgctgcg 2521 gaaacccagt aggctcctgg aggccatggt cagcttgctt ggaatccagc aggcagctgg 2581 ggctgagtga aaacatctgc acaactgggc actggacagc ttttccttct tctccagtgt 2641 ctaccttcct cctcaactcc cagccatctg tataaacaca atcctttgtt ctcccaaatc 2701 tgaatcgaat tactcttttg cctccttttt aatgtcagta aggatatgag cctttgcaca 2761 ggctggctgc gtgttcttga aataggtgtt accttctctg ggccttggtt ttttaaaatc 2821 tgtaaaatta gaggattgca ctagagaaac ttgaatgctc cattcaggcc tatcatttta 2881 ttaagtatga ttgacacagc ccatgggcca gaacacactc tacaaaatga ctaggataac 2941 agaaagaacg tgatctcctg attagagagg gtggttttcc tcaatggaac caaatataaa 3001 gaggacttga acaaaaatga cagatacaaa ctatttctat cctgagtagt aatctcacac 3061 ttcatcctat agagtcaacc accacagata ggaattcctt attctttttt taattttttt 3121 aagacagagt ctcactttgt tgcccaggct ggagcgcagt ggggtgatct catctccctg 3181 caacctccgc ctcctgggtt gaagcgattc ttgtgcctca gcttcccaag cagctgggat 3241 tacaggtgcc cgccaccacg cccagctaat ttttgcattt ttagtagaga tgggtttcac 3301 catgttggcc atgctcgtct ccaactcctg acctcaggta atccgtctgc cttggcctcc 3361 caaatgctgg gattacagac atgaaccacc acgcctggct ggaatactta ctcttgtcgg 3421 gagattgaac cactaaaatg ttagagcaga attcattatg ctgtggtcac aggggtgtct 3481 tgtctgagaa caaatacaat tcagtcttct ctttggggtt ttagtatgtg tcaaacatag 3541 gactggaagt ttgcccctgt tcttttttct tttgaaagaa catcagttca tgcctgaggc 3601 atgagtgact gtgcatttga gatagttttc cctattctgt ggatacagtc ccagagtttt 3661 cagggagtac acaggtagat tagtttgaag cattgacctt ttatttattc cttatttctc 3721 tttcatcaaa acaaaacagc agctgtggga ggagaaatga gagggcttaa atgaaattta 3781 aaataagcta tattatacaa atactatctc tgtattgttc tgaccctggt aaatatattt 3841 caaaacttca gatgacaagg attagaacac tcattaagat gctattcttc // LOCUS HUMC8AS 2397 bp mRNA PRI 07-NOV-1994 DEFINITION Human complement protein C8 alpha subunit mRNA, complete cds. ACCESSION M16974 NID g179717 KEYWORDS complement C8; complement C8-alpha; serum protein. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2397) AUTHORS Rao,A.G., Howard,O.M., Ng,S.C., Whitehead,A.S., Colten,H.R. and Sodetz,J.M. TITLE Complementary DNA and derived amino acid sequence of the alpha subunit of human complement protein C8: evidence for the existence of a separate alpha subunit messenger RNA JOURNAL Biochemistry 26 (12), 3556-3564 (1987) MEDLINE 88000560 REFERENCE 2 (bases 1 to 2394) AUTHORS Sodetz,J.M. TITLE Direct Submission JOURNAL Submitted (20-JUN-1988) J.M. Sodetz, Department of Chemistry and Biochemistry, University of South Columbia, SC 29208, USA REFERENCE 3 (bases 1 to 2394) AUTHORS Sodetz,J.M. TITLE Direct Submission JOURNAL Submitted (02-FEB-1993) J.M. Sodetz, Department of Chemistry and Biochemistry, University of South Columbia, SC 29208, USA COMMENT [1] [2] are revised by [3]. FEATURES Location/Qualifiers source 1..2397 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A1 and B1" /tissue_type="liver" /map="360 bp upstream of PstI site; chromosome 1p36.2-p22.1" /map="1p32" mRNA 1..2397 /note="C8 mRNA" gene 138..1892 /gene="C8A" sig_peptide 138..227 /gene="C8A" /note="complement protein C8 alpha subunit signal peptide" CDS 138..1892 /gene="C8A" /note="complement protein C8 alpha subunit precursor" /codon_start=1 /db_xref="GDB:G00-119-735" /db_xref="PID:g179718" /translation="MFAVVFFILSLMTCQPGVTAQEKVNQRVRRAATPAAVTCQLSNW SEWTDCFPCQDKKYRHRSLLQPNKFGGTICSGDIWDQASCSSSTTCVRQAQCGQDFQC KETGRCLKRHLVCNGDQDCLDGSDEDDCEDVRAIDEDCSQYEPIPGSQKAALGYNILT QEDAQSVYDASYYGGQCETVYNGEWRELRYDSTCERLYYGDDEKYFRKPYNFLKYHFE ALADTGISSEFYDNANDLLSKVKKDKSDSFGVTIGIGPAGSPLLVGVGVSHSQDTSFL NELNKYNEKKFIFTRIFTKVQTAHFKMRKDDIMLDEGMLQSLMELPDQYNYGMYAKFI NDYGTHYITSGSMGGIYEYILVIDKAKMESLGITSRDITTCFGGSLGIQYEDKINVGG GLSGDHCKKFGGGKTERARKAMAVEDIISRVRGGSSGWSGGLAQNRSTITYRSWGRSL KYNPVVIDFEMQPIHEVLRHTSLGPLEAKRQNLRRALDQYLMEFNACRCGPCFNNGVP ILEGTSCRCQCRLGSLGAACEQTQTEGAKADGSWSCWSSWSVCRAGIQERRRECDNPA PQNGGASCPGRKVQTQAC" mat_peptide 228..1889 /gene="C8A" /note="complement protein C8 alpha subunit" BASE COUNT 684 a 504 c 605 g 604 t ORIGIN 1 tttttttttt catcctactt tgttttattg ggcgttgatt gttacaggtc ccagcctgta 61 gacatctttt actccaattt cctgaataga tagctttatt ccttcaaggt aatatagtgc 121 ggtggcttct ggctgagatg tttgctgttg ttttcttcat cttgtctttg atgacttgtc 181 agcctggggt aactgcacag gagaaggtga accagagagt aagacgggca gctacacccg 241 cagcagttac ctgccagctg agcaactggt cagagtggac agattgcttt ccgtgccagg 301 acaaaaagta ccgacaccgg agcctcttgc agccaaacaa gtttggggga accatctgca 361 gtggtgacat ctgggatcaa gccagctgct ccagttctac aacttgtgta aggcaagcac 421 agtgtggaca ggatttccag tgtaaggaga caggtcgctg cctgaaacgc caccttgtgt 481 gtaatggaga ccaggactgc cttgatggct ctgatgagga cgactgtgaa gatgtcaggg 541 ccattgacga agactgcagc cagtatgaac caattccagg atcacagaag gcagccttgg 601 ggtacaatat cctgacccag gaagatgctc agagtgtgta cgatgccagt tattatgggg 661 gccagtgtga gacggtatac aatggggaat ggagggagct tcgatatgac tccacctgtg 721 aacgtctcta ctatggagat gatgagaaat actttcggaa accctacaac tttctgaagt 781 accactttga agccctggca gatactggaa tctcctcaga gttttatgat aatgcaaatg 841 accttctttc caaagttaaa aaagacaagt ctgactcatt tggagtgacc atcggcatag 901 gcccagccgg cagcccttta ttggtgggtg taggtgtatc ccactcacaa gacacttcat 961 tcttgaacga attaaacaag tataatgaga agaaattcat tttcacaaga atcttcacaa 1021 aggtgcagac tgcacatttt aagatgagga aggatgacat tatgctggat gaaggaatgc 1081 tgcagtcatt aatggagctt ccagatcagt acaattatgg catgtatgcc aagttcatca 1141 atgactatgg cacccattac atcacatctg gatccatggg tggcatttat gaatatatcc 1201 tggtgattga caaagcaaaa atggaatccc ttggtattac cagcagagat atcacgacat 1261 gttttggagg ctccttgggc attcaatatg aagacaaaat aaatgttggt ggaggtttat 1321 caggagacca ttgtaaaaaa tttggaggtg gcaaaactga aagggccagg aaggccatgg 1381 ctgtggaaga cattatttct cgggtgcgag gtggcagttc tggctggagc ggtggcttgg 1441 cacagaacag gagcaccatt acataccgtt cctgggggag gtcattaaag tataatcctg 1501 ttgttatcga ttttgagatg cagcctatcc acgaggtgct gcggcacaca agcctggggc 1561 ctctggaggc caagcgccag aacctgcgcc gcgccttgga ccagtatctg atggaattca 1621 atgcctgccg atgtgggcct tgcttcaaca atggggtgcc catcctcgag ggcaccagct 1681 gcaggtgcca gtgccgcctg ggtagcttgg gtgctgcctg tgagcaaaca cagacagaag 1741 gagccaaagc agatgggagc tggagttgct ggagctcctg gtctgtatgc agagcaggca 1801 tccaggaaag gagaagagag tgtgacaatc cagcacctca gaatggaggg gcctcgtgtc 1861 cagggcggaa agtacagacg caggcttgct gagggcctct ggacacaggc tggaccagat 1921 gctgtggatg tcgacccctg cactgactat tggataaaga cttctttcaa ctaagagaag 1981 atgcaaatca gcacactttt ttctttgttc tgccagcttc caggcctaag actaggtttt 2041 gctgtctaca gccaactatt ctattagtta caaaactcaa tcattttatt cagcaactgg 2101 atgttgactg ttaactagaa gctctgtcct acttacagca ctttggatca tcaaaaaaat 2161 aaagtaaaat agaaaactga gaaaactcaa tccatgacca gggagaactt acaggatgtt 2221 agagacaaaa caagcagaca cctgaaacaa tcaacgccca ataaaacaaa gtaggatgaa 2281 aattctctta gttctttgat aacaatttgt tcactcatag aaacattatt aattggtagg 2341 gtaagcagac actctgaaac aatgagaaaa atactaaaaa ttgacttgag ttatttc // LOCUS HUMC8BS 1995 bp mRNA PRI 31-OCT-1994 DEFINITION Human complement protein C8 beta subunit mRNA, complete cds. ACCESSION M16973 NID g179719 KEYWORDS complement C8; complement C8 beta; serum protein. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 28 to 1995) AUTHORS Howard,O.M., Rao,A.G. and Sodetz,J.M. TITLE Complementary DNA and derived amino acid sequence of the beta subunit of human complement protein C8: identification of a close structural and ancestral relationship to the alpha subunit and C9 JOURNAL Biochemistry 26 (12), 3565-3570 (1987) MEDLINE 88000561 REFERENCE 2 (bases 1 to 1995) AUTHORS Sodetz,J.M. JOURNAL Unpublished (1988) COMMENT [2] revises [1]. Draft entry and computer-readable sequence for [2] kindly provided by J.M.Sodetz, 14-JUN-1988. FEATURES Location/Qualifiers source 1..1995 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p32" mRNA <1..1995 /note="C8 mRNA" sig_peptide 28..189 /gene="C8B" /note="complement protein C8 beta subunit signal peptide" gene 28..1803 /gene="C8B" CDS 28..1803 /gene="C8B" /note="complement protein C8 beta subunit precursor" /codon_start=1 /db_xref="GDB:G00-119-736" /db_xref="PID:g179720" /translation="MKNSRTWAWRAPVELFLLCAALGCLSLPGSRGERPHSFGSNAVN KSFAKSRQMRSVDVTLMPIDCELSSWSSWTTCDPCQKKRYRYAYLLQPSQFHGEPCNF SDKEVEDCVTNRPCGSQVRCEGFVCAQTGRCVNRRLLCNGDNDCGDQSDEANCRRIYK KCQHEMDQYWGIGSLASGINLFTNSFEGPVLDHRYYAGGCSPHYILNTRFRKPYNVES YTPQTQGKYEFILKEYESYSDFERNVTEKMASKSGFSFGFKIPGIFELGISSQSDRGK HYIRRTKRFSHTKSVFLHARSDLEVAHYKLKPRSLMLHYEFLQRVKRLPLEYSYGEYR DLFRDFGTHYITEAVLGGIYEYTLVMNKEAMERGDYTLNNVHACAKNDFKIGGAIEEV YVSLGVSVGKCRGILNEIKDRNKRDTMVEDLVVLVRGGASEHITTLAYQELPTADLMQ EWGDAVQYNPAIIKVKVEPLYELVTATDFAYSSTVRQNMKQALEEFQKEVSSCHCAPC QGNGVPVLKGSRCDCICPVGSQGLACEVSYRKNTPIDGKWNCWSNWSSCSGRRKTRQR QCNNPPPQNGGSPCSGPASETLDCS" mat_peptide 190..1800 /gene="C8B" /note="complement protein C8" BASE COUNT 567 a 441 c 516 g 471 t ORIGIN 33 bp upstream of EcoRI site; chromosome 1p36.2-p22.1. 1 ctgtggcatc tcctgtcaca ttgggaaatg aagaattcca ggacatgggc ttggagggcg 61 ccggtggagc tatttcttct ctgtgctgcc ctgggctgtc tcagtttgcc tggctccaga 121 ggtgaaaggc cacattcctt tgggtcaaat gcagtcaaca agagctttgc taagagcaga 181 cagatgcgga gtgtggatgt taccctgatg cccattgatt gtgagctgtc tagttggtcc 241 tcttggacca catgtgaccc ctgtcagaag aaaaggtaca ggtatgccta cttgctccag 301 ccctctcagt tccatgggga accgtgcaac ttctctgaca aggaagtcga agactgtgtt 361 accaacagac catgcggaag tcaagtgcga tgtgaaggct ttgtgtgtgc acagacagga 421 aggtgtgtaa accgcagact tctttgcaat ggggacaatg actgtggaga ccagtcagat 481 gaagcaaact gtagaaggat ttataaaaaa tgtcagcatg aaatggacca atactgggga 541 attggcagtc tggccagtgg gataaatttg ttcacaaaca gttttgaggg cccagttctt 601 gatcacaggt attatgcagg tggatgctcc ccgcattaca tcctgaacac gaggtttagg 661 aagccctaca atgtggaaag ctacacgcca cagacccaag gcaaatacga attcatatta 721 aaagagtatg aatcatactc agattttgaa cgcaatgtca cagagaaaat ggcaagcaag 781 tctggtttca gttttggttt taaaatacct ggaatatttg aacttggcat cagtagtcaa 841 agtgatcgag gcaaacacta tattaggaga accaaacgat tctctcatac taaaagcgta 901 tttctgcatg cacgctctga ccttgaagta gcacattaca agctgaaacc cagaagcctc 961 atgctccatt acgagttcct tcagagagtt aagcggctgc ccctggagta cagctacggg 1021 gaatacagag atctcttccg tgattttggg acccactaca tcacagaggc tgtgcttggg 1081 ggcatttatg aatacaccct cgttatgaac aaagaggcca tggagagagg agattatact 1141 cttaacaacg tccatgcctg tgccaaaaat gattttaaaa ttggtggtgc cattgaagag 1201 gtctacgtca gtctgggtgt gtctgtaggc aaatgcagag gtattctgaa tgaaataaaa 1261 gacagaaaca agagggacac catggtggag gacttggtgg tcctggtacg aggaggggca 1321 agtgagcaca tcaccaccct ggcataccag gagctgccga cggcggacct gatgcaggag 1381 tggggagacg ctgtgcagta caacccagcc atcatcaaag ttaaggtgga gcctctgtat 1441 gaactagtga cagccacaga ttttgcctat tccagcacag tgaggcagaa catgaagcag 1501 gcactggagg agttccagaa ggaagttagt tcctgccact gtgctccctg ccaaggaaat 1561 ggagtccctg tcctgaaagg atcacgctgt gactgcatct gtcctgttgg atcccaaggc 1621 ctagcctgtg aggtctccta tcggaagaat acccccattg atgggaagtg gaattgctgg 1681 tcaaattggt cttcatgctc tggaagacgt aagacaagac aaaggcagtg taacaatcca 1741 cctcctcaaa atgggggtag cccctgttca ggccctgctt cagaaacact tgactgctcc 1801 tagcagatga tacagcagtg ggctacatac aatgagagcc ctgagccctc aagaactcac 1861 gccagctcag ccctacacca gtttccacct ggagttcatg caagggcaaa aggcagtgcc 1921 atgcaagctg tttaaaataa agatgttacc ttgtaaaatg caagttgatt taaataaata 1981 ctgagttaaa ggctt // LOCUS HUMCA11A 2625 bp mRNA PRI 27-JUN-1994 DEFINITION Homo sapiens cadherin-11 mRNA, complete cds. ACCESSION L34056 NID g506403 KEYWORDS cadherin-11. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2625) AUTHORS Suzuki,S., Sano,K. and Tanihara,H. TITLE Diversity of the cadherin family: evidence for eight new cadherins in nervous tissue JOURNAL Cell Regul. 2, 261-270 (1991) MEDLINE 91283540 REFERENCE 2 (bases 1 to 2625) AUTHORS Tanihara,H., Sano,K., Heimark,R.L., St.John,T. and Suzuki,S. TITLE Cloning of five cadherins clarifies characteristic features of cadherin extracellular domain and provides further evidence for two structurally different types of cadherin JOURNAL Cell Adhesion Commun. 2, 15-26 (1994) FEATURES Location/Qualifiers source 1..2625 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 156..2546 /note="putative" /citation=[1] /citation=[2] /codon_start=1 /function="cell adhesion" /product="cadherin-11" /db_xref="PID:g506404" /translation="MKENYCLQAALVCLGMLCHSHAFAPERRGHLRPSFHGHHEKGKE GQVLQRSKRGWVWNQFFVIEEYTGPDPVLVGRLHSDIDSGDGNIKYILSGEGAGTIFV IDDKSGNIHATKTLDREERAQYTLMAQAVDRDTNRPLEPPSEFIVKVQDINDNPPEFL HETYHANVPERSNVGTSVIQVTASDADDPTYGNSAKLVYSILEGQPYFSVEAQTGIIR TALPNMDREAKEEYHVVIQAKDMGGHMGGLSGTTKVTITLTDVNDNPPKFPQRLYQMS VSEAAVPGEEVGRVKAKDPDIGENGLVTYNIVDGDGMESFEITTDYETQEGVIKLKKP VDFETERAYSLKVEAANVHIDPKFISNGPFKDTVTVKISVEDADEPPMFLAPSYIHEV QENAAAGTVVGRVHAKDPDAANSPIRYSIDRHTDLDRFFTINPEDGFIKTTKPLDREE TAWLNITVFAAEIHNRHQEAQVPVAIRVLDVNDNAPKFAAPYEGFICESDQTKPLSNQ PIVTISADDKDDTANGPRFIFSLPPEIIHNPNFTVRDNRDNTAGVYARRGGFSRQKQD LYLLPIVISDGGIPPMSSTNTLTIKVCGCDVNGALLSCNAEAYILNAGLSTGALIAIL ACIVILLVIVVLFVTLRRQKKEPLIVFEEEDVRENIITYDDEGGGEEDTEAFDIATLQ NPDGINGFIPRKDIKPEYQYMPRPGLRPAPNSVDVDDFINTRIQEADNDPTAPPYDSI QIYGYEGRGSVAGSLSSLESATTDSDLDYDYLQNWGPRFKKLADLYGSKDTFDDDS" BASE COUNT 730 a 661 c 675 g 559 t ORIGIN 1 cggcagccct gacgtgatga gctcaaccag cagagacatt ccatcccaag agaggtctgc 61 gtgacgcgtc cgggaggcca ccctcagcaa gaccaccgta cagttggtgg aaggggtgac 121 agctgcattc tcctgtgcct accacgtaac caaaaatgaa ggagaactac tgtttacaag 181 ccgccctggt gtgcctgggc atgctgtgcc acagccatgc ctttgcccca gagcggcggg 241 ggcacctgcg gccctccttc catgggcacc atgagaaggg caaggagggg caggtgctac 301 agcgctccaa gcgtggctgg gtctggaacc agttcttcgt gatagaggag tacaccgggc 361 ctgaccccgt gcttgtgggc aggcttcatt cagatattga ctctggtgat gggaacatta 421 aatacattct ctcaggggaa ggagctggaa ccatttttgt gattgatgac aaatcaggga 481 acattcatgc caccaagacg ttggatcgag aagagagagc ccagtacacg ttgatggctc 541 aggcggtgga cagggacacc aatcggccac tggagccacc gtcggaattc attgtcaagg 601 tccaggacat taatgacaac cctccggagt tcctgcacga gacctatcat gccaacgtgc 661 ctgagaggtc caatgtggga acgtcagtaa tccaggtgac agcttcagat gcagatgacc 721 ccacttatgg aaatagcgcc aagttagtgt acagtatcct cgaaggacaa ccctattttt 781 cggtggaagc acagacaggt atcatcagaa cagccctacc caacatggac agggaggcca 841 aggaggagta ccacgtggtg atccaggcca aggacatggg tggacatatg ggcggactct 901 cagggacaac caaagtgacg atcacactga ccgatgtcaa tgacaaccca ccaaagtttc 961 cgcagaggct ataccagatg tctgtgtcag aagcagccgt ccctggggag gaagtaggaa 1021 gagtgaaagc taaagatcca gacattggag aaaatggctt agtcacatac aatattgttg 1081 atggagatgg tatggaatcg tttgaaatca caacggacta tgaaacacag gagggggtga 1141 taaagctgaa aaagcctgta gattttgaaa ccgaaagagc ctatagcttg aaggtagagg 1201 cagccaacgt gcacatcgac ccgaagttta tcagcaatgg ccctttcaag gacactgtga 1261 ccgtcaagat ctcagtagaa gatgctgatg agccccctat gttcttggcc ccaagttaca 1321 tccacgaagt ccaagaaaat gcagctgctg gcaccgtggt tgggagagtg catgccaaag 1381 accctgatgc tgccaacagc ccgataaggt attccatcga tcgtcacact gacctcgaca 1441 gatttttcac tattaatcca gaggatggtt ttattaaaac tacaaaacct ctggatagag 1501 aggaaacagc ctggctcaac atcactgtct ttgcagcaga aatccacaat cggcatcagg 1561 aagcccaagt cccagtggcc attagggtcc ttgatgtcaa cgataatgct cccaagtttg 1621 ctgcccctta tgaaggtttc atctgtgaga gtgatcagac caagccactt tccaaccagc 1681 caattgttac aattagtgca gatgacaagg atgacacggc caatggacca agatttatct 1741 tcagcctacc ccctgaaatc attcacaatc caaatttcac agtcagagac aaccgagata 1801 acacagcagg cgtgtacgcc cggcgtggag ggttcagtcg gcagaagcag gacttgtacc 1861 ttctgcccat agtgatcagc gatggcggca tcccgcccat gagtagcacc aacaccctca 1921 ccatcaaagt ctgcgggtgc gacgtgaacg gggcactgct ctcctgcaac gcagaggcct 1981 acattctgaa cgccggcctg agcacaggcg ccctgatcgc catcctcgcc tgcatcgtca 2041 ttctcctggt cattgtagta ttgtttgtga ccctgagaag gcaaaagaaa gaaccactca 2101 ttgtctttga ggaagaagat gtccgtgaga acatcattac ttatgatgat gaagggggtg 2161 gggaagaaga cacagaagcc tttgatattg ccaccctcca gaatcctgat ggtatcaatg 2221 gatttatccc ccgcaaagac atcaaacctg agtatcagta catgcctaga cctgggctcc 2281 ggccagcgcc caacagcgtg gatgtcgatg acttcatcaa cacgagaata caggaggcag 2341 acaatgaccc cacggctcct ccttatgact ccattcaaat ctacggttat gaaggcaggg 2401 gctcagtggc cgggtccctg agctccctag agtcggccac cacagattca gacttggact 2461 atgattatct acagaactgg ggacctcgtt ttaagaaact agcagatttg tatggttcca 2521 aagacacttt tgatgacgat tcttaacaat aacgatacaa atttggcctt aagaactgtg 2581 tctggcgttc tcaagaatct agaagatgtg taacaggtat ttttt // LOCUS HUMCA12A 2521 bp mRNA PRI 28-JUN-1994 DEFINITION Homo sapiens cadherin-12 mRNA, complete cds. ACCESSION L34057 NID g506405 KEYWORDS cadherin-12. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2521) AUTHORS Suzuki,S., Sano,K. and Tanihara,H. TITLE Diversity of the cadherin family: evidence for eight new cadherins in nervous tissue JOURNAL Cell Regul. 2, 261-270 (1991) MEDLINE 91283540 REFERENCE 2 (bases 1 to 2521) AUTHORS Tanihara,H., Sano,K., Heimark,R.L., St.John,T. and Suzuki,S. TITLE Cloning of five cadherins clarifies characteristic features of cadherin extracellular domain and provides further evidence for two structurally different types of cadherin JOURNAL Cell Adhesion Commun. 2, 15-26 (1994) FEATURES Location/Qualifiers source 1..2521 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 94..2478 /note="putative" /citation=[1] /citation=[2] /codon_start=1 /function="cell adhesion" /product="cadherin-12" /db_xref="PID:g506406" /translation="MLTRNCLSLLLWVLFDGGLLTPLQPQPQQTLATEPRENVIHLPG QRSHFQRVKRGWVWNQFFVLEEYVGSEPQYVGKLHSDLDKGEGTVKYTLSGDGAGTVF TIDETTGDIHAIRSLDREEKPFYTLRAQAVDIETRKPLEPESEFIIKVQDINDNEPKF LDGPYVATVPEMSPVGAYVLQVKATDADDPTYGNSARVVYSILQGQPYFSIDPKTGVI RTALPNMDREVKEQYQVLIQAKDMGGQLGGLAGTTIVNITLTDVNDNPPRFPKSIFHL KVPESSPIGSAIGRIRAVDPDFGQNAEIEYNIVPGDGGNLFDIVTDEDTQEGVIKLKK PLDFETKKAYTFKVEASNLHLDHRFHSAGPFKDTATVKISVLDVDEPPVFSKPLYTME VYEDTPVGTIIGAVTAQDLDVGSGAVRYFIDWKSDGDSYFTIDGNEGTIATNELLDRE STAQYNFSIIASKVSNPLLTSKVNILINVLDVNEFPPEISVPYETAVCENAKPGQIIQ IVSAADRDLSPAGQQFSFRLSPEAAIKPNFTVRDFRNNTAGIETRRNGYSRRQQELYF LPVVIEDSSYPVQSSTNTMTIRVCRCDSDGTILSCNVEAIFLPVGLSTGALIAILLCI VILLAIVVLYVALRRQKKKHTLMTSKEDIRDNVIHYDDEGGGEEDTQAFDIGALRNPK VIEENKIRRDIKPDSLCLPRQRPPMEDNTDIRDFIHQRLQENDVDPTAPPIDSLATYA YEGSGSVAESLSSIDSLTTEADQDYDYLTDWGPRFKVLADMFGEEESYNPDKVT" BASE COUNT 765 a 561 c 583 g 612 t ORIGIN 1 cggtggaggc cacagacacc tcaaacctgg attccacaat tctacgttaa gtgttggagt 61 ttttattact ctgctgtagg aaagcctttg ccaatgctta caaggaactg tttatccctg 121 cttctctggg ttctgtttga tggaggtctc ctaacaccac tacaaccaca gccacagcag 181 actttagcca cagagccaag agaaaatgtt atccatctgc caggacaacg gtcacatttc 241 caacgtgtta aacgtggctg ggtatggaat caattttttg tgctggaaga atacgtgggc 301 tccgagcctc agtatgtggg aaagctccat tccgacttag acaagggaga gggcactgtg 361 aaatacaccc tctcaggaga tggcgctggc accgttttta ccattgatga aaccacaggg 421 gacattcatg caataaggag cctagataga gaagagaaac ctttctacac tcttcgtgct 481 caggctgtgg acatagaaac cagaaagccc ctggagcctg aatcagaatt catcatcaaa 541 gtgcaggata ttaatgataa tgagccaaag tttttggatg gaccttatgt tgctactgtt 601 ccagaaatgt ctcctgtggg tgcatatgta ctccaggtca aggccacaga tgcagatgac 661 ccgacctatg gaaacagtgc cagagtcgtt tacagcattc ttcagggaca accttatttc 721 tctattgatc ccaagacagg tgttattaga acagctttgc caaacatgga cagagaagtc 781 aaagaacaat atcaagtact catccaagcc aaggatatgg gaggacagct tggaggatta 841 gccggaacaa caatagtcaa catcactctc accgatgtca atgacaatcc acctcgattc 901 cccaaaagca tcttccactt gaaagttcct gagtcttccc ctattggttc agctattgga 961 agaataagag ctgtggatcc tgattttgga caaaatgcag aaattgaata caatattgtt 1021 ccaggagatg ggggaaattt gtttgacatc gtcacagatg aggatacaca agagggagtc 1081 atcaaattga aaaagccttt agattttgaa acaaagaagg catacacttt caaagttgag 1141 gcttccaacc ttcaccttga ccaccggttt cactcggcgg gccctttcaa agacacagct 1201 acggtgaaga tcagcgtgct ggacgtagat gagccaccgg ttttcagcaa gccgctctac 1261 accatggagg tttatgaaga cactccggta gggaccatca ttggcgctgt cactgctcaa 1321 gacctggatg taggcagcgg tgctgttagg tacttcatag attggaagag tgatggggac 1381 agctacttta caatagatgg aaatgaagga accatcgcca ctaatgaatt actagacaga 1441 gaaagcactg cgcagtataa tttctccata attgcgagta aagttagtaa ccctttattg 1501 accagcaaag tcaatatact gattaatgtc ttagatgtaa atgaatttcc tccagaaata 1561 tctgtgccat atgagacagc cgtgtgtgaa aatgccaagc caggacagat aattcagata 1621 gtcagtgctg cagaccgaga tctttcacct gctgggcaac aattctcctt tagattatca 1681 cctgaggctg ctatcaaacc aaattttaca gttcgtgact tcagaaacaa cacagcgggg 1741 attgaaaccc gaagaaatgg atacagccgc aggcagcaag agttgtattt cctccctgtt 1801 gtaatagaag acagcagcta ccctgtccag agcagcacaa acacaatgac tattcgagtc 1861 tgtagatgtg actctgatgg caccatcctg tcttgtaatg tggaagcaat ttttctacct 1921 gtaggactta gcactggggc gttgattgca attctactat gcattgttat actcttagcc 1981 atagttgtac tgtatgtagc actgcgaagg cagaagaaaa agcacaccct gatgacctct 2041 aaagaagaca tcagagacaa cgtcatccat tacgatgatg aaggaggtgg ggaggaagat 2101 acccaggctt tcgacatcgg ggctctgaga aacccaaaag tgattgagga gaacaaaatt 2161 cgcagggata taaaaccaga ctctctctgt ttacctcgtc agagaccacc catggaagat 2221 aacacagaca taagggattt cattcatcaa aggctacagg aaaatgatgt agatccaact 2281 gccccaccaa tcgattcact ggccacatat gcctacgaag ggagtgggtc cgtggcagag 2341 tccctcagct ctatagactc tctcaccaca gaagccgacc aggactatga ctatctgaca 2401 gactggggac cccgctttaa agtcttggca gacatgtttg gcgaagaaga gagttataac 2461 cctgataaag tcacttaagg gagtcgtgga ggctaaaata caaccgagag gggagatttt 2521 t // LOCUS HUMCA13A 2690 bp mRNA PRI 28-JUN-1994 DEFINITION Homo sapiens cadherin-13 mRNA, complete cds. ACCESSION L34058 NID g506407 KEYWORDS cadherin-13. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2690) AUTHORS Suzuki,S., Sano,K. and Tanihara,H. TITLE Diversity of the cadherin family: evidence for eight new cadherins in nervous tissue JOURNAL Cell Regul. 2, 261-270 (1991) MEDLINE 91283540 REFERENCE 2 (bases 1 to 2690) AUTHORS Tanihara,H., Sano,K., Heimark,R.L., St.John,T. and Suzuki,S. TITLE Cloning of five cadherins clarifies characteristic features of cadherin extracellular domain and provides further evidence for two structurally different types of cadherin JOURNAL Cell Adhesion Commun. 2, 15-26 (1994) FEATURES Location/Qualifiers source 1..2690 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 445..2586 /note="putative" /citation=[1] /citation=[2] /codon_start=1 /function="cell adhesion" /product="cadherin-13" /db_xref="PID:g506408" /translation="MQPRTPLVLCVLLSQVLLLTSAEDLDCTPGFQQKVFHINQPAEF IEDQSILNLTFSDCKGNDKLRYEVSSPYFKVNSDGGLVALRNITAVGKTLFVHARTPH AEDMAELVIVGGKDIQGSLQDIFKFARTSPVPRQKRSIVVSPILIPENQRQPFPRDVG KVVDSDRPERSKFRLTGKGVDQEPKGIFRINENTGSVSVTRTLDREVIAVYQLFVETT DVNGKTLEGPVPLEVIVIDQNDNRPIFREGPYIGHVMEGSPTGTTVMRMTAFDADDPA TDNALLRYNIRQQTPDKPSPNMFYIDPEKGDIVTVVSPALLDRETLENPKYELIIEAQ DMAGLDVGLTGTATATIMIDDKNDHSPKFTKKEFQATVEEGAVGVIVNLTVEDKDDPT TGAWRAAYTIINGNPGQSFEIHTNPQTNEGMLSVVKPLDYEISAFHTLLIKVENEDPL VPDVSYGPSSTATVHITVLDVNEGPVFYPDPMMVTRQEDLSVGSVLLTVNATDPDSLQ HQTIRYSVYKDPAGWLNINPINGTVDTTAVLDRESPFVDNSVYTALFLAIDSGNPPAT GTGTLLITLEDVNDNAPFIYPTVAEVCDDAKNLSVVILGASDKDLHPNTDPFKFEIHK QAVPDKVWKISKINNTHALVSLLQNLNKANYNLPIMVTDSGKPPMTNITDLRVQVCSC RNSKVDCNAAGALRFSLPSVLLLSLFSLACL" BASE COUNT 729 a 692 c 669 g 600 t ORIGIN 1 cttcaaggtt ttgctgactc agtctggtag tcagagtctg caggagaaga cagttcaagg 61 cagggcctgg aggattggat cagtttaggg acaggtcaaa ggctggctta gagaccttag 121 aggcaggttg cttgggtcgt tgaatgctag tctggtcctg agagcccttt tctctggcaa 181 ctgtggactc agagctaacc aattgtagtt ggcagtgggg gtgaagggtg atccagaggc 241 ctgagctgca gagggcacaa gagagaaaag atgtcttaga aagagctttg agaacatgcc 301 ttggctgctg gcagggacct tggatggggt agtctacacc cggaagtgcc tgcctgccat 361 cctctagtgg ctgccttgca aaatatgctc agtgcagccg cgtgcatgaa tgaaaacgcc 421 gccgggcgct tctagtcgga caaaatgcag ccgagaactc cgctcgttct gtgcgttctc 481 ctgtcccagg tgctgctgct aacatctgca gaagatttgg actgcactcc tggatttcag 541 cagaaagtgt tccatatcaa tcagccagct gaattcattg aggaccagtc aattctaaac 601 ttgaccttca gtgactgtaa gggaaacgac aagctacgct atgaggtctc gagcccatac 661 ttcaaggtga acagcgatgg cggcttagtt gctctgagaa acataactgc agtgggcaaa 721 actctgttcg tccatgcacg gaccccccat gcggaagata tggcagaact cgtgattgtc 781 ggggggaaag acatccaggg ctccttgcag gatatattta aatttgcaag aacttctcct 841 gtcccaagac aaaagaggtc cattgtggta tctcccattt taattccaga gaatcagaga 901 cagcctttcc caagagatgt tggcaaggta gtcgatagtg acaggccaga aaggtccaag 961 ttccggctca ctggaaaggg agtggatcaa gagcctaaag gaattttcag aatcaatgag 1021 aacacaggga gcgtctccgt gacacggacc ttggacagag aagtaatcgc tgtttatcaa 1081 ctatttgtgg agaccactga tgtcaatggc aaaactctcg aggggccggt gcctctggaa 1141 gtcattgtga ttgatcagaa tgacaaccga ccgatctttc gggaaggccc ctacatcggc 1201 cacgtcatgg aagggtcacc cacaggcacc acagtgatgc ggatgacagc ctttgatgca 1261 gatgacccag ccaccgataa tgccctcctg cggtataata tccgtcaaca gacgcctgac 1321 aagccatctc ccaacatgtt ctacatcgat cctgagaaag gagacattgt cactgttgtg 1381 tcacctgcgc tgctggaccg agagactctg gaaaatccca agtatgaact gatcatcgag 1441 gctcaagata tggctggact ggatgttgga ttaacaggca cggccacagc cacgatcatg 1501 atcgatgaca aaaatgatca ctcaccaaaa ttcaccaaga aagagtttca agccacagtc 1561 gaggaaggag ctgtgggagt tattgtcaat ttgacagttg aagataagga tgaccccacc 1621 acaggtgcat ggagggctgc ctacaccatc atcaacggaa accccgggca gagctttgaa 1681 atccacacca accctcaaac caacgaaggg atgctttctg ttgtcaaacc attggactat 1741 gaaatttctg ccttccacac cctgctgatc aaagtggaaa atgaagaccc actcgtaccc 1801 gacgtctcct acggccccag ctccacagcc accgtccaca tcactgtcct ggatgtcaac 1861 gagggcccag tcttctaccc agaccccatg atggtgacca ggcaggagga cctctctgtg 1921 ggcagcgtgc tgctgacagt gaatgccacg gaccccgact ccctgcagca tcaaaccatc 1981 aggtattctg tttacaagga cccagcaggt tggctgaata ttaaccccat caatgggact 2041 gttgacacca cagctgtgct ggaccgtgag tccccatttg tcgacaacag cgtgtacact 2101 gctctcttcc tggcaattga cagtggcaac cctcccgcta cgggcactgg gactttgctg 2161 ataaccctgg aggacgtgaa tgacaatgcc ccgttcattt accccacagt agctgaagtc 2221 tgtgatgatg ccaaaaacct cagtgtagtc attttgggag catcagataa ggatcttcac 2281 ccgaatacag atcctttcaa atttgaaatc cacaaacaag ctgttcctga taaagtctgg 2341 aagatctcca agatcaacaa tacacacgcc ctggtaagcc ttcttcaaaa tctgaacaaa 2401 gcaaactaca acctgcccat catggtgaca gattcaggga aaccacccat gacgaatatc 2461 acagatctca gggtacaagt gtgctcctgc aggaattcca aagtggactg caacgcggcg 2521 ggggccctgc gcttcagcct gccctcagtc ctgctcctca gcctcttcag cttagcttgt 2581 ctgtgagaac tcctgacgtc tgaagcttga ctcccaagtt tccatagcaa caggaaaaaa 2641 aaaaaatcta tccaaatctg aagattgcgg tttacagcta tcgaacttcg // LOCUS HUMCA6 1316 bp mRNA PRI 31-OCT-1994 DEFINITION Human carbonic anhydrase isozyme VI (CA6) mRNA, complete cds. ACCESSION M57892 J05305 NID g179731 KEYWORDS carbonate dehydratase; carbonic anhydrase isozyme VI; secreted protein. SOURCE Human salivary gland, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1316) AUTHORS Aldred,P., Fu,P., Barrett,G., Penschow,J.D., Wright,R.D., Coghlan,J.P. and Fernley,R.T. TITLE Human secreted carbonic anhydrase: cDNA cloning, nucleotide sequence, and hybridization histochemistry JOURNAL Biochemistry 30 (2), 569-575 (1991) MEDLINE 91105141 FEATURES Location/Qualifiers source 1..1316 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="salivary gland" /map="1p36" mRNA <1..1316 /gene="CA6" /note="G00-125-350; putative" gene 1..1316 /gene="CA6" sig_peptide 7..57 /gene="CA6" /note="G00-125-350; putative" CDS 7..933 /gene="CA6" /EC_number="4.2.1.1" /note="putative" /codon_start=1 /db_xref="GDB:G00-125-350" /product="carbonic anhydrase isozyme VI" /db_xref="PID:g179732" /translation="MRALVLLLSLFLLGGQAQHVSDWTYSEGALDEAHWPQHYPACGG QRQSPINLQRTKVRYNPSLKGLNMTGYETQAGEFPMVNNGHTVQIGLPSTMRMTVADG IVYIAQQMHFHWGGASSEISGSEHTVDGIRHVIEIHIVHYNSKYKTYDIAQDAPDGLA VLAAFVEVKNYPENTYYSNFISHLANIKYPGQRTTLTGLDVQDMLPRNLQHYYTYHGS LTTPPCTENVHWFVLADFVKLSRTQVWKLENSLLDHRNKTIHNDYRRTQPLKHRVVES NFPNQEYTLGSEFQFYLHKIEEILDYLRRALN" mat_peptide 58..930 /gene="CA6" /EC_number="4.2.1.1" /note="G00-125-350; putative" /product="carbonic anhydrase isozyme VI" BASE COUNT 362 a 343 c 313 g 298 t ORIGIN Chromosome 1. 1 aacaccatga gggccctggt gcttctgctg tccctgttcc tgctgggtgg ccaggcccag 61 catgtgtctg actggaccta ctcagaaggg gcactggacg aagcgcactg gccacagcac 121 taccccgcct gtgggggcca gagacagtcg cctatcaacc tacagaggac gaaggtgcgg 181 tacaacccct ccttgaaggg gctcaatatg acaggctatg agacccaggc aggggagttc 241 cccatggtca acaatggcca cacagtgcag atcggcctgc cctccaccat gcgcatgaca 301 gtggctgacg gcattgtata catagcccag cagatgcact ttcactgggg aggtgcgtcc 361 tcggagatca gcggctctga gcacaccgtg gacgggatca gacatgtgat cgagattcac 421 attgttcact acaattctaa atacaagacg tatgatatag cccaagatgc gccggatggt 481 ttggctgtac tggcagcctt cgttgaggtg aagaattacc ctgaaaacac ttattacagc 541 aacttcattt ctcatctggc caacatcaag tacccaggac aaagaacaac cctgactggc 601 cttgacgttc aggacatgct gcccaggaac ctccagcact actacaccta ccatggctca 661 ctcaccacgc ctccctgcac tgagaacgtc cactggtttg tgctggcaga ttttgtcaag 721 ctctccagga cacaggtttg gaagctggag aattccttac tggatcaccg caacaagacc 781 atccacaacg attaccgcag gacccagccc ctgaaacaca gagtggtgga atccaacttc 841 ccgaatcagg aatacactct aggctctgaa ttccagtttt acctacataa gattgaggaa 901 attcttgact acttaagaag agcattgaac tgaggaaagc taagaggaag attcaataat 961 attaactagc ttgaagcctg acctagccag aagtgcctgt ccgctgcagc cgcaccctac 1021 cttgtctaag aaaccatgtg tgtctggaac acgctgctcc cctgggcagc tgttgggatt 1081 ctgattaaag aggggaaacg atcatcctgg acaggaagtg agatggcttc agttcatgag 1141 acgggatctg agttagacat caccagtgga aattgattgg aatagaaact taaaggaaat 1201 ggaaccctaa ctattctccc atcaaatcat atatgttgac ctgtctgaat tataaaccag 1261 cctgaccttt cctttagcat tagatgtaat aaaataactt tggaaatttg tcattt // LOCUS HUMCA6A 4315 bp mRNA PRI 07-JUL-1997 DEFINITION Homo sapiens mRNA for cadherin-6, complete cds. ACCESSION D31784 NID g974184 KEYWORDS cadherin-6; precursor protein. SOURCE Homo sapiens Hepatocellular carcinoma cell cell_line:C-Li21 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4315) AUTHORS Shimoyama,Y. TITLE Direct Submission JOURNAL Submitted (08-JUN-1994) to the DDBJ/EMBL/GenBank databases. Yutaka Shimoyama, National Cancer Center, Pathology Division; Tsukiji 5-1-1, Chuo-ku, Tokyo 104, Japan (Tel:03-3542-2511(ex.4208), Fax:03-3248-2737) REFERENCE 2 (bases 1 to 4315) AUTHORS Shimoyama,Y., Gotoh,M., Terasaki,T., Kitajima,M. and Hirohashi,S. TITLE Isolation and sequence analysis of human cadherin-6 complementary DNA for the full coding sequence and its expression in human carcinoma cells JOURNAL Cancer Res. 55 (10), 2206-2211 (1995) MEDLINE 95262134 COMMENT Submitted (08-Jun-1994) to DDBJ by: Yutaka Shimoyama National Cencer Center Pathology Division Tsukiji 5-1-1 Chuo-ku, Tokyo 104 Japan Phone: 03-3542-2511 x4208 Fax: 03-3248-2737. FEATURES Location/Qualifiers source 1..4315 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="C-Li21" /cell_type="Hepatocellular carcinoma cell" CDS 121..2493 /note="precursor protein" /codon_start=1 /product="cadherin-6" /db_xref="PID:d1007133" /db_xref="PID:g974185" /translation="MRTYRYFLLLFWVGQPYPTLSTPLSKRTSGFPAKKRALELSGNS KNELNRSKRSWMWNQFFLLEEYTGSDYQYVGKLHSDQDRGDGSLKYILSGDGAGDLFI INENTGDIQATKRLDREEKPVYILRAQAINRRTGRPVEPESEFIIKIHDINDNEPIFT KEVYTATVPEMSDVGTFVVQVTATDADDPTYGNSAKVVYSILQGQPYFSVESETGIIK TALLNMDRENREQYQVVIQAKDMGGQMGGLSGTTTVNITLTDVNDNPPRFPQSTYQFK TPESSPPGTPIGRIKASDADVGENAEIEYSITDGEGLDMFDVITDQETQEGIITVKKL LDFEKKKVYTLKVEASNPYVEPRFLYLGPFKDSATVRIVVEDVDEPPVFSKLAYILQI REDAQINTTIGSVTAQDPDAARNPVKYSVDRHTDMDRIFNIDSGNGSIFTSKLLDRET LLWHNITVIATEINNPKQSSRVPLYIKVLDVNDNAPEFAEFYETFVCEKAKADQLIQT LHAVDKDDPYSGHQFSFSLAPEAASGSNFTIQDNKDNTAGILTRKNGYNRHEMSTYLL PVVISDNDYPVQSSTGTVTVRVCACDHHGNMQSCHAEALIHPTGLSTGALVAILLCIV ILLVTVVLFAALRRQRKKEPLIISKEDIRDNIVSYNDEGGGEEDTQAFDIGTLRNPEA IEDNKLRRDIVPEALFLPRRTPTARDNTDVRDFINQRLKENDTDPTAPPYDSLATYAY EGTGSVADSLSSLESVTTDADQDYDYLSDWGPRFKKLADMYGGVDSDKDS" polyA_signal 3101..3106 polyA_site 3129 polyA_site 4315 BASE COUNT 1345 a 932 c 888 g 1150 t ORIGIN 1 tgagagccaa gcaaagaaca ttaaggaagg aaggaggaat gaggctggat acggtgcagt 61 gaaaaaggca cttccaagag tggggcactc actacgcaca gactcgacgg tgccatcagc 121 atgagaactt accgctactt cttgctgctc ttttgggtgg gccagcccta cccaactctc 181 tcaactccac tatcaaagag gactagtggt ttcccagcaa agaaaagggc cctggagctc 241 tctggaaaca gcaaaaatga gctgaaccgt tcaaaaagga gctggatgtg gaatcagttc 301 tttctcctgg aggaatacac aggatccgat tatcagtatg tgggcaagtt acattcagac 361 caggatagag gagatggatc acttaaatat atcctttcag gagatggagc aggagatctc 421 ttcattatta atgaaaacac aggcgacata caggccacca agaggctgga cagggaagaa 481 aaacccgttt acatccttcg agctcaagct ataaacagaa ggacagggag acccgtggag 541 cccgagtctg aattcatcat caagatccat gacatcaatg acaatgaacc aatattcacc 601 aaggaggttt acacagccac tgtccctgaa atgtctgatg tcggtacatt tgttgtccaa 661 gtcactgcga cggatgcaga tgatccaaca tatgggaaca gtgctaaagt tgtctacagt 721 attctacagg gacagcccta tttttcagtt gaatcagaaa caggtattat caagacagct 781 ttgctcaaca tggatcgaga aaacagggag cagtaccaag tggtgattca agccaaggat 841 atgggcggcc agatgggagg attatctggg accaccaccg tgaacatcac actgactgat 901 gtcaacgaca accctccccg attcccccag agtacatacc agtttaaaac tcctgaatct 961 tctccaccgg ggacaccaat tggcagaatc aaagccagcg acgctgatgt gggagaaaat 1021 gctgaaattg agtacagcat cacagacggt gaggggctgg atatgtttga tgtcatcacc 1081 gaccaggaaa cccaggaagg gattataact gtcaaaaagc tcttggactt tgaaaagaag 1141 aaagtgtata cccttaaagt ggaagcctcc aatccttatg ttgagccacg atttctctac 1201 ttggggcctt tcaaagattc agccacggtt agaattgtgg tggaggatgt agatgagcca 1261 cctgtcttca gcaaactggc ctacatctta caaataagag aagatgctca gataaacacc 1321 acaataggct ccgtcacagc ccaagatcca gatgctgcca ggaatcctgt caagtactct 1381 gtagatcgac acacagatat ggacagaata ttcaacattg attctggaaa tggttcgatt 1441 tttacatcga aacttcttga ccgagaaaca ctgctatggc acaacattac agtgatagca 1501 acagagatca ataatccaaa gcaaagtagt cgagtacctc tatatattaa agttctagat 1561 gtcaatgaca acgccccaga atttgctgag ttctatgaaa cttttgtctg tgaaaaagca 1621 aaggcagatc agttgattca gaccctgcat gctgttgaca aggatgaccc ttatagtgga 1681 caccaatttt cgttttcctt ggcccctgaa gcagccagtg gctcaaactt taccattcaa 1741 gacaacaaag acaacacggc gggaatctta actcggaaaa atggctataa tagacacgag 1801 atgagcacct atctcttgcc tgtggtcatt tcagacaacg actacccagt tcaaagcagc 1861 actgggacag tgactgtccg ggtctgtgca tgtgaccacc acgggaacat gcaatcctgc 1921 catgcggagg cgctcatcca ccccacggga ctgagcacgg gggctctggt tgccatcctt 1981 ctgtgcatcg tgatcctact agtgacagtg gtgctgtttg cagctctgag gcggcagcga 2041 aaaaaagagc ctttgatcat ttccaaagag gacatcagag ataacattgt cagttacaac 2101 gacgaaggtg gtggagagga ggacacccag gcttttgata tcggcaccct gaggaatcct 2161 gaagccatag aggacaacaa attacgaagg gacattgtgc ccgaagccct tttcctaccc 2221 cgacggactc caacagctcg cgacaacacc gatgtcagag atttcattaa ccaaaggtta 2281 aaggaaaatg acacggaccc cactgccccg ccatacgact ccttggccac ttacgcctat 2341 gaaggcactg gctccgtggc ggattccctg agctcgctgg agtcagtgac cacggatgca 2401 gatcaagact atgattacct tagtgactgg ggacctcgat tcaaaaagct tgcagatatg 2461 tatggaggag tggacagtga caaagactcc taatctgttg cctttttcat tttccaatac 2521 gacactgaaa tatgtgaagt ggctatttct ttatatttat ccactactcc gtgaaggctt 2581 ctctgttcta cccgttccaa aagccaatgg ctgcagtccg tgtggatcca atgttagaga 2641 cttttttcta gtacactttt atgagcttcc aaggggcaaa tttttatttt ttagtgcatc 2701 cagttaacca agtcagccca acaggcaggt gccggagggg aggacaggga acagtatttc 2761 cacttgttct cagggcagcg tgcccgcttc cgctgtcctg gtgttttact acactccatg 2821 tcaggtcagc caactgccct aactgtacat ttcacaggct aatgggataa aggactgtgc 2881 tttaaagata aaaatatcat catagtaaaa gaaatgaggg catatcggct cacaaagaga 2941 taaactacat aggggtgttt atttgtgtca caaagaattt aaaataacac ttgcccatgc 3001 tatttgttct tcaagaactt tctctgccat caactactat tcaaaacctc aaatccaccc 3061 atatgttaaa attctcatta ctcttaagga atagaagcaa attaaacggt aacatccaaa 3121 agcaaccaca aacctagtac gacttcattc cttccactaa ctcatagttt gttatatcct 3181 agactagaca tgcgaaagtt tgcctttgta ccatataaag ggggagggaa atagctaata 3241 atgttaacca aggaaatata ttttaccata catttaaagt tttggccacc acatgtatca 3301 cgggtcactt gaaattcttt cagctatcag taggctaatg tcaaaattgt ttaaaaattc 3361 ttgaaagaat tttcctgaga caaattttaa cttcttgtct atagttgtca gtattattct 3421 actatactgt acatgaaagt agcagtgtga agtacaataa ttcatattct tcatatcctt 3481 cttacacgac taagttgaat tagtaaagtt agattaaata aaacttaaat ctcactctag 3541 gagttcagtg gagaggttag agccagccac acttgaacct aataccctgc ccttgacatc 3601 tggaaacctc tacatattta tataacgtga tacatttgga taaacaacat tgagattatg 3661 atgaaaacct acatattcca tgtttggaag acccttggaa gaggaaaatt ggattccctt 3721 aaacaaaagt gtttaagatt gtaattaaaa tgatagttga ttttcaaaag cattaatttt 3781 ttttcattgt ttttaacttt gctttcatga ccatcctgcc atccttgact ttgaactaat 3841 gataaagtaa tgatctcaaa ctatgacaga aaagtaatgt aaaatccatc caatctatta 3901 tttctctaat tatgcaatta gcctcatagt tattatccag aggacccaac tgaactgaac 3961 taatccttct ggcagattca aatcgtttat ttcacacgct gttctaatgg cacttatcat 4021 tagaatctta ccttgtgcag tcatcagaaa ttccagcgta ctataatgaa aacatccttg 4081 ttttgaaaac ctaaaagaca ggctctgtat atatatatac ttaagaatat gctgacttca 4141 cttattagtc ttagggattt attttcaatt aatattaatt ttctacaaat aattttagtg 4201 tcatttccat ttggggatat tgtcatatca gcacatattt tctgtttgga aacacactgt 4261 tgtttagtta agttttaaat aggtgtatta cccaagaagt aaagatggaa acgtt // LOCUS HUMCACH1A 7193 bp mRNA PRI 27-JAN-1992 DEFINITION Human neuroendocrine/beta-cell-type calcium channel alpha-1 subunit mRNA, complete cds. ACCESSION M83566 NID g179751 KEYWORDS calcium channel alpha-1 subunit. SOURCE Homo sapiens pancreatic islets of Langerhans cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7193) AUTHORS Seino,S., Chen,L.-C., Seino,M., Blondel,O., Takeda,J., Johnson,J.H. and Bell,G.I. TITLE Cloning of the alpha-1 subunit of a voltage-dependent calcium channel expressed in pancreatic beta cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 584-588 (1992) MEDLINE 92115705 FEATURES Location/Qualifiers source 1..7193 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pancreatic beta-cells" /tissue_type="pancreatic islets of Langerhans" CDS 119..6664 /codon_start=1 /product="calcium channel alpha-1 subunit" /db_xref="PID:g179752" /translation="MMMMMMMKKMQHQRQQQADHANEANYARGTRLPLSGEGPTSQPN SSKQTVLSWQAAIDAARQAKAAQTMSTSAPPPVGSLSQRKRQQYAKSKKQGNSSNSRP ARALFCLSLNNPIRRACISIVEWKPFDIFILLAIFANCVALAIYIPFPEDDSNSTNHN LEKVEYAFLIIFTVETFLKIIAYGLLLHPNAYVRNGWNLLDFVIVIVGLFSVILEQLT KETEGGNHSSGKSGGFDVKALRAFRVLRPLRLVSGVPSLQVVLNSIIKAMVPLLHIAL LVLFVIIIYAIIGLELFIGKMHKTCFFADSDIVAEEDPAPCAFSGNGRQCTANGTECR SGWVGPNGGITNFDNFAFAMLTVFQCITMEGWTDVLYWVNDAIGWEWPWVYFVSLIIL GSFFVLNLVLGVLSGEFSKEREKAKARGDFQKLREKQQLEEDLKGYLDWITQAEDIDP ENEEEGGEEGKRNTSMPTSETESVNTENVSGEGENRGCCGSLWCWWRRRGAAKAGPSG CRRWGQAISKSKLSRRWRRWNRFNRRRCRAAVKSVTFYWLVIVLVFLNTLTISSEHYN QPDWLTQIQDIANKVLLALFTCEMLVKMYSLGLQAYFVSLFNRFDCFVVCGGITETIL VELEIMSPLGISVFRCVRLLRIFKVTRHWTSLSNLVASLLNSMKSIASLLLLLFLFII IFSLLGMQLFGGKFNFDETQTKRSTFDNFPQALLTVFQILTGEDWNAVMYDGIMAYGG PSSSGMIVCIYFIILFICGNYILLNVFLAIAVDNLADAESLNTAQKEEAEEKERKKIA RKESLENKKNNKPEVNQIANSDNKVTIDDYREEDEDKDPYPPCDVPVGEEEEEEEEDE PEVPAGPRPRRISELNMKEKIAPIPEGSAFFILSKTNPIRVGCHKLINHHIFTNLILV FIMLSSAALAAEDPIRSHSFRNTILGYFDYAFTAIFTVEILLKMTTFGAFLHKGAFCR NYFNLLDMLVVGVSLVSFGIQSSAISVVKILRVLRVLRPLRAINRAKGLKHVVQCVFV AIRTIGNIMIVTTLLQFMFACIGVQLFKGKFYRCTDEAKSNPEECRGLFILYKDGDVD SPVVRERIWQNSDFNFDNVLSAMMALFTVSTFEGWPALLYKAIDSNGENIGPIYNHRV EISIFFIIYIIIVAFFMMNIFVGFVIVTFQEQGEKEYKNCELDKNQRQCVEYALKARP LRRYIPKNPYQYKFWYVVNSSPFEYMMFVLIMLNTLCLAMQHYEQSKMFNDAMDILNM VFTGVFTVEMVLKVIAFKPKGYFSDAWNTFDSLIVIGSIIDVALSEADPTESENVPVP TATPGNSEESNRISITFFRLFRVMRLVKLLSRGEGIRTLLWTFIKSFQALPYVALLIA MLFFIYAVIGMQMFGKVAMRDNNQINRNNNFQTFPQAVLLLFRCATGEAWQEIMLACL PGKLCDPESDYNPGEEYTCGSNFAIVYFISFYMLCAFLIINLFVAVIMDNFDYLTRDW SILGPHHLDEFKRIWSEYDPEAKGRIKHLDVVTLLRRIQPPLGFGKLCPHRVACKRLV AMNMPLNSDGTVMFNATLFALVRTALKIKTEGNLEQANEELRAVIKKIWKKTSMKLLD QVVPPAGDDEVTVGKFYATFLIQDYFRKFKKRKEQGLVGKYPAKNTTIALQAGLRTLH DIGPEIRRAISCDLQDDEPEETKREEEDDVFKRNGALLGNHVNHVNSDRRDSLQQTNT THRPLHVQRPSIPPASDTEKPLFPPAGNSVCHNHHNHNSIGKQVPTSTNANLNNANMS KAAHGKRPSIGNLEHVSENGHHSSHKHDREPQRRSSVKRTRYYETYIRSDSGDEQLPT ICREDPEIHGYFRDPHCLGEQEYFSSEECYEDDSSPTWSRQNYGYYSRYPGRNIDSER PRGYHHPQGFLEDDDSPVCYDSRRSPRRRLLPPTPASHRRSSFNFECLRRQSSQEEVP SSPIFPHRTALPLHLMQQQIMAVAGLDSSKAQKYSPSHSTRSWATPPATPPYRDWTPC YTPLIQVEQSEALDQVNGSLPSLHRSSWYTDEPDISYRTFTPASLTVPSSFRNKNSDK QRSADSLVEAVLISEGLGRYARDPKFVSATKHEIADACDLTIDEMESAASTLLNGNVR PRANGDVGPLSHRQDYELQDFGPGYSDEEPDPGRDEEDLADEMICITTL" BASE COUNT 1879 a 1810 c 1788 g 1716 t ORIGIN 1 agaataaggg cagggaccgc ggctcctatc tcttggtgat ccccttcccc attccgcccc 61 cgcctcaacg cccagcacag tgccctgcac acagtagtcg ctcaataaat gttcgtggat 121 gatgatgatg atgatgatga aaaaaatgca gcatcaacgg cagcagcaag cggaccacgc 181 gaacgaggca aactatgcaa gaggcaccag acttcctctt tctggtgaag gaccaacttc 241 tcagccgaat agctccaagc aaactgtcct gtcttggcaa gctgcaatcg atgctgctag 301 acaggccaag gctgcccaaa ctatgagcac ctctgcaccc ccacctgtag gatctctctc 361 ccaaagaaaa cgtcagcaat acgccaagag caaaaaacag ggtaactcgt ccaacagccg 421 acctgcccgc gcccttttct gtttatcact caataacccc atccgaagag cctgcattag 481 tatagtggaa tggaaaccat ttgacatatt tatattattg gctatttttg ccaattgtgt 541 ggccttagct atttacatcc cattccctga agatgattct aattcaacaa atcataactt 601 ggaaaaagta gaatatgcct tcctgattat ttttacagtc gagacatttt tgaagattat 661 agcgtatgga ttattgctac atcctaatgc ttatgttagg aatggatgga atttactgga 721 ttttgttata gtaatagtag gattgtttag tgtaattttg gaacaattaa ccaaagaaac 781 agaaggcggg aaccactcaa gcggcaaatc tggaggcttt gatgtcaaag ccctccgtgc 841 ctttcgagtg ttgcgaccac ttcgactagt gtcaggggtg cccagtttac aagttgtcct 901 gaactccatt ataaaagcca tggttcccct ccttcacata gcccttttgg tattatttgt 961 aatcataatc tatgctatta taggattgga actttttatt ggaaaaatgc acaaaacatg 1021 tttttttgct gactcagata tcgtagctga agaggaccca gctccatgtg cgttctcagg 1081 gaatggacgc cagtgtactg ccaatggcac ggaatgtagg agtggctggg ttggcccgaa 1141 cggaggcatc accaactttg ataactttgc ctttgccatg cttactgtgt ttcagtgcat 1201 caccatggag ggctggacag acgtgctcta ctgggtaaat gatgcgatag gatgggaatg 1261 gccatgggtg tattttgtta gtctgatcat ccttggctca tttttcgtcc ttaacctggt 1321 tcttggtgtc cttagtggag aattctcaaa ggaaagagag aaggcaaaag cacggggaga 1381 tttccagaag ctccgggaga agcagcagct ggaggaggat ctaaagggct acttggattg 1441 gatcacccaa gctgaggaca tcgatccgga gaatgaggaa gaaggaggag aggaaggcaa 1501 acgaaatact agcatgccca ccagcgagac tgagtctgtg aacacagaga acgtcagcgg 1561 tgaaggcgag aaccgaggct gctgtggaag tctctggtgc tggtggagac ggagaggcgc 1621 ggccaaggcg gggccctctg ggtgtcggcg gtggggtcaa gccatctcaa aatccaaact 1681 cagccgacgc tggcgtcgct ggaaccgatt caatcgcaga agatgtaggg ccgccgtgaa 1741 gtctgtcacg ttttactggc tggttatcgt cctggtgttt ctgaacacct taaccatttc 1801 ctctgagcac tacaatcagc cagattggtt gacacagatt caagatattg ccaacaaagt 1861 cctcttggct ctgttcacct gcgagatgct ggtaaaaatg tacagcttgg gcctccaagc 1921 atatttcgtc tctcttttca accggtttga ttgcttcgtg gtgtgtggtg gaatcactga 1981 gacgatcctg gtggaactgg aaatcatgtc tcccctgggg atctctgtgt ttcggtgtgt 2041 gcgcctctta agaatcttca aagtgaccag gcactggact tccctgagca acttagtggc 2101 atccttatta aactccatga agtccatcgc ttcgctgttg cttctgcttt ttctcttcat 2161 tatcatcttt tccttgcttg ggatgcagct gtttggcggc aagtttaatt ttgatgaaac 2221 gcaaaccaag cggagcacct ttgacaattt ccctcaagca cttctcacag tgttccagat 2281 cctgacaggc gaagactgga atgctgtgat gtacgatggc atcatggctt acgggggccc 2341 atcctcttca ggaatgatcg tctgcatcta cttcatcatc ctcttcattt gtggtaacta 2401 tattctactg aatgtcttct tggccatcgc tgtagacaat ttggctgatg ctgaaagtct 2461 gaacactgct cagaaagaag aagcggaaga aaaggagagg aaaaagattg ccagaaaaga 2521 gagcctagaa aataaaaaga acaacaaacc agaagtcaac cagatagcca acagtgacaa 2581 caaggttaca attgatgact atagagaaga ggatgaagac aaggacccct atccgccttg 2641 cgatgtgcca gtaggggaag aggaagagga agaggaggag gatgaacctg aggttcctgc 2701 cggaccccgt cctcgaagga tctcggagtt gaacatgaag gaaaaaattg cccccatccc 2761 tgaagggagc gctttcttca ttcttagcaa gaccaacccg atccgcgtag gctgccacaa 2821 gctcatcaac caccacatct tcaccaacct catccttgtc ttcatcatgc tgagcagcgc 2881 tgccctggcc gcagaggacc ccatccgcag ccactccttc cggaacacga tactgggtta 2941 ctttgactat gccttcacag ccatctttac tgttgagatc ctgttgaaga tgacaacttt 3001 tggagctttc ctccacaaag gggccttctg caggaactac ttcaatttgc tggatatgct 3061 ggtggttggg gtgtctctgg tgtcatttgg gattcaatcc agtgccatct ccgttgtgaa 3121 gattctgagg gtcttaaggg tcctgcgtcc cctcagggcc atcaacagag caaaaggact 3181 taagcacgtg gtccagtgcg tcttcgtggc catccggacc atcggcaaca tcatgatcgt 3241 cactaccctc ctgcagttca tgtttgcctg tatcggggtc cagttgttca aggggaagtt 3301 ctatcgctgt acggatgaag ccaaaagtaa ccctgaagaa tgcaggggac ttttcatcct 3361 ctacaaggat ggggatgttg acagtcctgt ggtccgtgaa cggatctggc aaaacagtga 3421 tttcaacttc gacaacgtcc tctctgctat gatggcgctc ttcacagtct ccacgtttga 3481 gggctggcct gcgttgctgt ataaagccat cgactcgaat ggagagaaca tcggcccaat 3541 ctacaaccac cgcgtggaga tctccatctt cttcatcatc tacatcatca ttgtagcttt 3601 cttcatgatg aacatctttg tgggctttgt catcgttaca tttcaggaac aaggagaaaa 3661 agagtataag aactgtgagc tggacaaaaa tcagcgtcag tgtgttgaat acgccttgaa 3721 agcacgtccc ttgcggagat acatccccaa aaacccctac cagtacaagt tctggtacgt 3781 ggtgaactct tcgcctttcg aatacatgat gtttgtcctc atcatgctca acacactctg 3841 cttggccatg cagcactacg agcagtccaa gatgttcaat gatgccatgg acattctgaa 3901 catggtcttc accggggtgt tcaccgtcga gatggttttg aaagtcatcg catttaagcc 3961 taaggggtat tttagtgacg cctggaacac gtttgactcc ctcatcgtaa tcggcagcat 4021 tatagacgtg gccctcagcg aagcggaccc aactgaaagt gaaaatgtcc ctgtcccaac 4081 tgctacacct gggaactctg aagagagcaa tagaatctcc atcacctttt tccgtctttt 4141 ccgagtgatg cgattggtga agcttctcag caggggggaa ggcatccgga cattgctgtg 4201 gacttttatt aagtcctttc aggcgctccc gtatgtggcc ctcctcatag ccatgctgtt 4261 cttcatctat gcggtcattg gcatgcagat gtttgggaaa gttgccatga gagataacaa 4321 ccagatcaat aggaacaata acttccagac gtttccccag gcggtgctgc tgctcttcag 4381 gtgtgcaaca ggtgaggcct ggcaggagat catgctggcc tgtctcccag ggaagctctg 4441 tgaccctgag tcagattaca accccgggga ggagtataca tgtgggagca actttgccat 4501 tgtctatttc atcagttttt acatgctctg tgcatttctg atcatcaatc tgtttgtggc 4561 tgtcatcatg gataatttcg actatctgac ccgggactgg tctattttgg ggcctcacca 4621 tttagatgaa ttcaaaagaa tatggtcaga atatgaccct gaggcaaagg gaaggataaa 4681 acaccttgat gtggtcactc tgcttcgacg catccagcct cccctggggt ttgggaagtt 4741 atgtccacac agggtagcgt gcaagagatt agttgccatg aacatgcctc tcaacagtga 4801 cgggacagtc atgtttaatg caaccctgtt tgctttggtt cgaacggctc ttaagatcaa 4861 gaccgaaggg aacctggagc aagctaatga agaacttcgg gctgtgataa agaaaatttg 4921 gaagaaaacc agcatgaaat tacttgacca agttgtccct ccagctggtg atgatgaggt 4981 aaccgtgggg aagttctatg ccactttcct gatacaggac tactttagga aattcaagaa 5041 acggaaagaa caaggactgg tgggaaagta ccctgcgaag aacaccacaa ttgccctaca 5101 ggcgggatta aggacactgc atgacattgg gccagaaatc cggcgtgcta tatcgtgtga 5161 tttgcaagat gacgagcctg aggaaacaaa acgagaagaa gaagatgatg tgttcaaaag 5221 aaatggtgcc ctgcttggaa accatgtcaa tcatgttaat agtgatagga gagattccct 5281 tcagcagacc aataccaccc accgtcccct gcatgtccaa aggccttcaa ttccacctgc 5341 aagtgatact gagaaaccgc tgtttcctcc agcaggaaat tcggtgtgtc ataaccatca 5401 taaccataat tccataggaa agcaagttcc cacctcaaca aatgccaatc tcaataatgc 5461 caatatgtcc aaagctgccc atggaaagcg gcccagcatt gggaaccttg agcatgtgtc 5521 tgaaaatggg catcattctt cccacaagca tgaccgggag cctcagagaa ggtccagtgt 5581 gaaaagaacc cgctattatg aaacttacat taggtccgac tcaggagatg aacagctccc 5641 aactatttgc cgggaagacc cagagataca tggctatttc agggaccccc actgcttggg 5701 ggagcaggag tatttcagta gtgaggaatg ctacgaggat gacagctcgc ccacctggag 5761 caggcaaaac tatggctact acagcagata cccaggcaga aacatcgact ctgagaggcc 5821 ccgaggctac catcatcccc aaggattctt ggaggacgat gactcgcccg tttgctatga 5881 ttcacggaga tctccaagga gacgcctact acctcccacc ccagcatccc accggagatc 5941 ctccttcaac tttgagtgcc tgcgccggca gagcagccag gaagaggtcc cgtcgtctcc 6001 catcttcccc catcgcacgg ccctgcctct gcatctaatg cagcaacaga tcatggcagt 6061 tgccggccta gattcaagta aagcccagaa gtactcaccg agtcactcga cccggtcgtg 6121 ggccacccct ccagcaaccc ctccctaccg ggactggaca ccgtgctaca cccccctgat 6181 ccaagtggag cagtcagagg ccctggacca ggtgaacggc agcctgccgt ccctgcaccg 6241 cagctcctgg tacacagacg agcccgacat ctcctaccgg actttcacac cagccagcct 6301 gactgtcccc agcagcttcc ggaacaaaaa cagcgacaag cagaggagtg cggacagctt 6361 ggtggaggca gtcctgatat ccgaaggctt gggacgctat gcaagggacc caaaatttgt 6421 gtcagcaaca aaacacgaaa tcgctgatgc ctgtgacctc accatcgacg agatggagag 6481 tgcagccagc accctgctta atgggaacgt gcgtccccga gccaacgggg atgtgggccc 6541 cctctcacac cggcaggact atgagctaca ggactttggt cctggctaca gcgacgaaga 6601 gccagaccct gggagggatg aggaggacct ggcggatgaa atgatatgca tcaccacctt 6661 gtagccccca gcgaggggca gactggctct ggcctcaggt ggggcgcagg agagccaggg 6721 gaaaagtgcc tcatagttag gaaagtttag gcactagttg ggagtaatat tcaattaatt 6781 agacttttgt ataagagatg tcatgcctca agaaagccat aaacctggta ggaacaggtc 6841 ccaagcggtt gagcctggca gagtaccatg cgctcggccc cagctgcagg aaacagcagg 6901 ccccgccctc tcacagagga tgggtgagga ggccagacct gccctgcccc attgtccaga 6961 tgggcactgc tgtggagtct gcttctccca tgtaccaggg caccaggccc acccaactga 7021 aggcatggcg gcggggtgca ggggaaagtt aaaggtgatg acgatcatca cacctcgtgt 7081 cgttacctca gccatcggtc tagcatatca gtcactgggc ccaacatatc catttttaaa 7141 ccctttcccc caaatacact gcgtcctggt tcctgtttag ctgttctgaa ata // LOCUS HUMCACHNT 7364 bp mRNA PRI 31-OCT-1994 DEFINITION Human N-type calcium channel alpha-1 subunit mRNA, complete cds. ACCESSION M94172 NID g179757 KEYWORDS N-type calcium channel alpha-1 subunit. SOURCE Homo sapiens CNS cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7364) AUTHORS Williams,M.E., Brust,P.F., Feldman,D.H., Patthi,S., Simerson,S., Maroufi,A., McCue,A.F., Velicelebi,G., Ellis,S.B. and Harpold,M.M. TITLE Structure and functional expression of an omega-conotoxin-sensitive human N-type calcium channel JOURNAL Science 257 (5068), 389-395 (1992) MEDLINE 92335886 FEATURES Location/Qualifiers source 1..7364 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR32" /cell_type="neuroblastoma" /tissue_type="CNS" /map="Unassigned" gene 146..7165 /gene="CCHL1A2" CDS 146..7165 /gene="CCHL1A2" /codon_start=1 /function="calcium influx" /db_xref="GDB:G00-128-872" /evidence=experimental /product="N-type calcium channel alpha-1 subunit" /db_xref="PID:g179758" /translation="MVRFGDELGGRYGGPGGGERARGGGAGGAGGPGPGGLQPGQRVL YKQSIAQRARTMALYNPIPVKQNCFTVNRSLFVFSEDNVVRKYAKRITEWPPFEYMIL ATIIANCIVLALEQHLPDGDKTPMSERLDDTEPYFIGIFCFEAGIKIIALGFVFHKGS YLRNGWNVMDFVVVLTGILATAGTDFDLRTLRAVRVLRPLKLVSGIPSLQVVLKSIMK AMVPLLQIGLLLFFAILMFAIIGLEFYMGKFHKACFPNSTDAEPVGDFPCGKEAPARL CEGDTECREYWPGPNFGITNFDNILFAILTVFQCITMEGWTDILYNTNDAAGNTWNWL YFIPLIIIGSFFMLNLVLGVLSGEFAKERERVENRRAFLKLRRQQQIERELNGYLEWI FKAEEVMLAEEDRNAEEKSPLDVLKRAATKKSRNDLIHAEEGEDRFADLCAVGSPFAR ASLKSGKTESSSYFRRKEKMFRFFIRRMVKAQSFYWVVLCVVALNTLCVAMVHYNQPR RLTTTLYFAEFVFLGLFLTEMSLKMYGLGPRSYFRSSFNCFDFGVIVGSVFEVVWAAI KPGSSFGISVLRALRLLRIFKVTKYWSSLRNLVVSLLNSMKSIISLLFLLFLFIVVFA LLGMQLFGGQFNFQDETPTTNFDTFPAAILTVFQILTGEDWNAVMYHGIESQGGVSKG MFSSFYFIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEEEMEEAANQKLALQKAKE VAEVSPMSAANISIAARQQNSAKARSVWEQRASQLRLQNLRASCEALYSEMDPEERLR FATTRHLRPDMKTHLDRPLVVELGRDGARGPVGGKARPEAAEAPEGVDPPRRHHRHRD KDKTPAAGDQDRAEAPKAESGEPGAREERPRPHRSHSKEAAGPPEARSERGRGPGPEG GRRHHRRGSPEEAAEREPRRHRAHRHQDPSKECAGAKGERRARHRGGPRAGPREAESG EEPARRHRARHKAQPAHEAVEKETTEKEATEKEAEIVEADKEKELRNHQPREPHCDLE TSGTVTVGPMHTLPSTCLQKVEEQPEDADNQRNVTRMGSQPPDPNTIVHIPVMLTGPL GEATVVPSGNVDLESQAEGKKEVEADDVMRSGPRPIVPYSSMFCLSPTNLLRRFCHYI VTMRYFEVVILVVIALSSIALAAEDPVRTDSPRNNALKYLDYIFTGVFTFEMVIKMID LGLLLHPGAYFRDLWNILDFIVVSGALVAFAFSGSKGKDINTIKSLRVLRVLRPLKTI KRLPKLKAVFDCVVNSLKNVLNILIVYMLFMFIFAVIAVQLFKGKFFYCTDESKELER DCRGQYLDYEKEEVEAQPRQWKKYDFHYDNVLWALLTLFTVSTGEGWPMVLKHSVDAT YEEQGPSPGYRMELSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKVMSECSLEKNE RACIDFAISAKPLTRYMPQNRQSFQYKTWTFVVSPPFEYFIMAMIALNTVVLMMKFYD APYEYELMLKCLNIVFTSMFSMECVLKIIAFGVLNYFRDAWNVFDFVTVLGSITDILV TEIAETNNFINLSFLRLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLF FIYAIIGMQVFGNIALDDDTSINRHNNFRTFLQALMLLFRSATGEAWHEIMLSCLSNQ ACDEQANATECGSDFAYFYFVSFIFLCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHL DEFIRVWAEYDPAACGRISYNDMFEMLKHMSPPLGLGKKCPARVAYKRLVRMNMPISN EDMTVHFTSTLMALIRTALEIKLAPAGTKQHQCDAELRKEISVVWANLPQKTLDLLVP PHKPDEMTVGKVYAALMIFDFYKQNKTTRDQMQQAPGGLSQMGPVSLFHPLKATLEQT QPAVLRGARVFLRQKSSTSLSNGGAIQNQESGIKESVSWGTQRTQDAPHEARPPLERG HSTEIPVGRSGALAVDVQMQSITRRGPDGEPQPGLESQGRAASMPRLAAETQPVTDAS PMKRSISTLAQRPRGTHLCSTTPDRPPPSQASSHHHHHRCHRRRDRKQRSLEKGPSLS ADMDGAPSSAVGPGLPPGEGPTGCRRERERRQERGRSQERRQPSSSSSEKQRFYSCDR FGGREPPKPKPSLSSHPTSPTAGQEPGPHPQGSGSVNGSPLLSTSGASTPGRGGRRQL PQTPLTPRPSITYKTANSSPIHFAGAQTSLPAFSPGRLSRGLSEHNALLQRDPLSQPL APGSRIGSDPYLGQRLDSEASVHALPEDTLTFEEAVATNSGRSSRTSYVSSLTSQSHP LRRVPNGYHCTLGLSSGGRARHSYHHPDQDHWC" BASE COUNT 1444 a 2278 c 2216 g 1426 t ORIGIN 1 gcggcggcgg ctgcggcggt ggggccgggc gaggtccgct gcggtcccgg cggctccgtg 61 gctgctccgc tctgagcgcc tggcgcgccc cgcgccctcc ctgccggggc cgctgggccg 121 gggatgcacg cggggcccgg gagccatggt ccgcttcggg gacgagctgg gcggccgcta 181 tggaggcccc ggcggcggag agcgggcccg gggcggcggg gccggcgggg cggggggccc 241 gggtcccggg gggctgcagc ccggccagcg ggtcctctac aagcaatcga tcgcgcagcg 301 cgcgcggacc atggcgctgt acaaccccat cccggtcaag cagaactgct tcaccgtcaa 361 ccgctcgctc ttcgtcttca gcgaggacaa cgtcgtccgc aaatacgcga agcgcatcac 421 cgagtggcct ccattcgagt atatgatcct ggccaccatc atcgccaact gcatcgtgct 481 ggccctggag cagcacctcc ctgatgggga caaaacgccc atgtccgagc ggctggacga 541 cacggagccc tatttcatcg ggatcttttg cttcgaggca gggatcaaaa tcatcgctct 601 gggctttgtc ttccacaagg gctcttacct gcggaacggc tggaacgtca tggacttcgt 661 ggtcgtcctc acagggatcc ttgccacggc tggaactgac ttcgacctgc gaacactgag 721 ggctgtgcgt gtgctgaggc ccctgaagct ggtgtctggg attccaagtt tgcaggtggt 781 gctcaagtcc atcatgaagg ccatggttcc actcctgcag attgggctgc ttctcttctt 841 tgccatcctc atgtttgcca tcattggcct ggagttctac atgggcaagt tccacaaggc 901 ctgtttcccc aacagcacag atgcggagcc cgtgggtgac ttcccctgtg gcaaggaggc 961 cccagcccgg ctgtgcgagg gcgacactga gtgccgggag tactggccag gacccaactt 1021 tggcatcacc aactttgaca atatcctgtt tgccatcttg acggtgttcc agtgcatcac 1081 catggagggc tggactgaca tcctctataa tacaaacgat gcggccggca acacctggaa 1141 ctggctctac ttcatccctc tcatcatcat cggctccttc ttcatgctca acctggtgct 1201 gggcgtgctc tcgggggagt ttgccaagga gcgagagagg gtggagaacc gccgcgcctt 1261 cctgaagctg cgccggcagc agcagatcga gcgagagctc aacgggtacc tggagtggat 1321 cttcaaggcg gaggaagtca tgctggccga ggaggacagg aatgcagagg agaagtcccc 1381 tttggacgtg ctgaagagag cggccaccaa gaagagcaga aatgacctga tccacgcaga 1441 ggagggagag gaccggtttg cagatctctg tgctgttgga tcccccttcg cccgcgccag 1501 cctcaagagc gggaagacag agagctcgtc atacttccgg aggaaggaga agatgttccg 1561 gttttttatc cggcgcatgg tgaaggctca gagcttctac tgggtggtgc tgtgcgtggt 1621 ggccctgaac acactgtgtg tggccatggt gcattacaac cagccgcggc ggcttaccac 1681 gaccctgtat tttgcagagt ttgttttcct gggtctcttc ctcacagaga tgtccctgaa 1741 gatgtatggc ctggggccca gaagctactt ccggtcctcc ttcaactgct tcgactttgg 1801 ggtcatcgtg gggagcgtct ttgaagtggt ctgggcggcc atcaagccgg gaagctcctt 1861 tgggatcagt gtgctgcggg ccctccgcct gctgaggatc ttcaaagtca cgaagtactg 1921 gagctccctg cggaacctgg tggtgtccct gctgaactcc atgaagtcca tcatcagcct 1981 gctcttcttg ctcttcctgt tcattgtggt cttcgccctg ctggggatgc agctgtttgg 2041 gggacagttc aacttccagg atgagactcc cacaaccaac ttcgacacct tccctgccgc 2101 catcctcact gtcttccaga tcctgacggg agaggactgg aatgcagtga tgtatcacgg 2161 gatcgaatcg caaggcggcg tcagcaaagg catgttctcg tccttttact tcattgtcct 2221 gacactgttc ggaaactaca ctctgctgaa tgtctttctg gccatcgctg tggacaacct 2281 ggccaacgcc caagagctga ccaaggatga agaggagatg gaagaagcag ccaatcagaa 2341 gcttgctctg caaaaggcca aagaagtggc tgaagtcagc cccatgtctg ccgcgaacat 2401 ctccatcgcc gccaggcagc agaactcggc caaggcgcgc tcggtgtggg agcagcgggc 2461 cagccagcta cggctgcaga acctgcgggc cagctgcgag gcgctgtaca gcgagatgga 2521 ccccgaggag cggctgcgct tcgccactac gcgccacctg cggcccgaca tgaagacgca 2581 cctggaccgg ccgctggtgg tggagctggg ccgcgacggc gcgcgggggc ccgtgggagg 2641 caaagcccga cctgaggctg cggaggcccc cgagggcgtc gaccctccgc gcaggcacca 2701 ccggcaccgc gacaaggaca agacccccgc ggcgggggac caggaccgag cagaggcccc 2761 gaaggcggag agcggggagc ccggtgcccg ggaggagcgg ccgcggccgc accgcagcca 2821 cagcaaggag gccgcggggc ccccggaggc gcggagcgag cgcggccgag gcccaggccc 2881 cgagggcggc cggcggcacc accggcgcgg ctccccggag gaggcggccg agcgggagcc 2941 ccgacgccac cgcgcgcacc ggcaccagga tccgagcaag gagtgcgccg gcgccaaggg 3001 cgagcggcgc gcgcggcacc gcggcggccc ccgagcgggg ccccgggagg cggagagcgg 3061 ggaggagccg gcgcggcggc accgggcccg gcacaaggcg cagcctgctc acgaggctgt 3121 ggagaaggag accacggaga aggaggccac ggagaaggag gctgagatag tggaagccga 3181 caaggaaaag gagctccgga accaccagcc ccgggagcca cactgtgacc tggagaccag 3241 tgggactgtg actgtgggtc ccatgcacac actgcccagc acctgtctcc agaaggtgga 3301 ggaacagcca gaggatgcag acaatcagcg gaacgtcact cgcatgggca gtcagccccc 3361 agacccgaac actattgtac atatcccagt gatgctgacg ggccctcttg gggaagccac 3421 ggtcgttccc agtggtaacg tggacctgga aagccaagca gaggggaaga aggaggtgga 3481 agcggatgac gtgatgagga gcggcccccg gcctatcgtc ccatacagct ccatgttctg 3541 tttaagcccc accaacctgc tccgccgctt ctgccactac atcgtgacca tgaggtactt 3601 cgaggtggtc attctcgtgg tcatcgcctt gagcagcatc gccctggctg ctgaggaccc 3661 agtgcgcaca gactcgccca ggaacaacgc tctgaaatac ctggattaca ttttcactgg 3721 tgtctttacc tttgagatgg tgataaagat gatcgacttg ggactgctgc ttcaccctgg 3781 agcctatttc cgggacttgt ggaacattct ggacttcatt gtggtcagtg gcgccctggt 3841 ggcgtttgct ttctcaggat ccaaagggaa agacatcaat accatcaagt ctctgagagt 3901 ccttcgtgtc ctgcggcccc tcaagaccat caaacggctg cccaagctca aggctgtgtt 3961 tgactgtgtg gtgaactccc tgaagaatgt cctcaacatc ttgattgtct acatgctctt 4021 catgttcata tttgccgtca ttgcggtgca gctcttcaaa gggaagtttt tctactgcac 4081 agatgaatcc aaggagctgg agagggactg caggggtcag tatttggatt atgagaagga 4141 ggaagtggaa gctcagccca ggcagtggaa gaaatacgac tttcactacg acaatgtgct 4201 ctgggctctg ctgacgctgt tcacagtgtc cacgggagaa ggctggccca tggtgctgaa 4261 acactccgtg gatgccacct atgaggagca gggtccaagc cctgggtacc gcatggagct 4321 gtccatcttc tacgtggtct actttgtggt ctttcccttc ttcttcgtca acatctttgt 4381 ggctttgatc atcatcacct tccaggagca gggggacaag gtgatgtctg aatgcagcct 4441 ggagaagaac gagagggctt gcattgactt cgccatcagc gccaaacccc tgacacggta 4501 catgccccaa aaccggcagt cgttccagta taagacgtgg acatttgtgg tctccccgcc 4561 ctttgaatac ttcatcatgg ccatgatagc cctcaacact gtggtgctga tgatgaagtt 4621 ctatgatgca ccctatgagt acgagctgat gctgaaatgc ctgaacatcg tgttcacatc 4681 catgttctcc atggaatgcg tgctgaagat catcgccttt ggggtgctga actatttcag 4741 agatgcctgg aatgtctttg actttgtcac tgtgttggga agtattactg atattttagt 4801 aacagagatt gcggaaacga acaatttcat caacctcagc ttcctccgcc tctttcgagc 4861 tgcgcggctg atcaagctgc tccgccaggg ctacaccatc cgcatcctgc tgtggacctt 4921 tgtccagtcc ttcaaggccc tgccctacgt gtgtctgctc attgccatgc tgttcttcat 4981 ctacgccatc atcggcatgc aggtgtttgg gaatattgcc ctggatgatg acaccagcat 5041 caaccgccac aacaacttcc ggacgttttt gcaagccctg atgctgctgt tcaggagcgc 5101 cacgggggag gcctggcacg agatcatgct gtcctgcctg agcaaccagg cctgtgatga 5161 gcaggccaat gccaccgagt gtggaagtga ctttgcctac ttctacttcg tctccttcat 5221 cttcctgtgc tcctttctga tgttgaacct ctttgtggct gtgatcatgg acaattttga 5281 gtacctcacg cgggactctt ccatcctagg tcctcaccac ttggatgagt tcatccgggt 5341 ctgggctgaa tacgacccgg ctgcgtgtgg gcgcatcagt tacaatgaca tgtttgagat 5401 gctgaaacac atgtccccgc ctctggggct ggggaagaaa tgccctgctc gagttgctta 5461 caagcgcctg gttcgcatga acatgcccat ctccaacgag gacatgactg ttcacttcac 5521 gtccacgctg atggccctca tccggacggc actggagatc aagctggccc cagctgggac 5581 aaagcagcat cagtgtgacg cggagttgag gaaggagatt tccgttgtgt gggccaatct 5641 gccccagaag actttggact tgctggtacc accccataag cctgatgaga tgacagtggg 5701 gaaggtttat gcagctctga tgatatttga cttctacaag cagaacaaaa ccaccagaga 5761 ccagatgcag caggctcctg gaggcctctc ccagatgggt cctgtgtccc tgttccaccc 5821 tctgaaggcc accctggagc agacacagcc ggctgtgctc cgaggagccc gggttttcct 5881 tcgacagaag agttccacct ccctcagcaa tggcggggcc atacaaaacc aagagagtgg 5941 catcaaagag tctgtctcct ggggcactca aaggacccag gatgcacccc atgaggccag 6001 gccacccctg gagcgtggcc actccacaga gatccctgtg gggcggtcag gagcactggc 6061 tgtggacgtt cagatgcaga gcataacccg gaggggccct gatggggagc cccagcctgg 6121 gctggagagc cagggtcgag cggcctccat gccccgcctt gcggccgaga ctcagcccgt 6181 cacagatgcc agccccatga agcgctccat ctccacgctg gcccagcggc cccgtgggac 6241 tcatctttgc agcaccaccc cggaccgccc accccctagc caggcgtcgt cgcaccacca 6301 ccaccaccgc tgccaccgcc gcagggacag gaagcagagg tccctggaga aggggcccag 6361 cctgtctgcc gatatggatg gcgcaccaag cagtgctgtg gggccggggc tgcccccggg 6421 agaggggcct acaggctgcc ggcgggaacg agagcgccgg caggagcggg gccggtccca 6481 ggagcggagg cagccctcat cctcctcctc ggagaagcag cgcttctact cctgcgaccg 6541 ctttgggggc cgtgagcccc cgaagcccaa gccctccctc agcagccacc caacgtcgcc 6601 aacagctggc caggagccgg gaccccaccc acagggcagt ggttccgtga atgggagccc 6661 cttgctgtca acatctggtg ctagcacccc cggccgcggt gggcggaggc agctccccca 6721 gacgcccctg actccccgcc ccagcatcac ctacaagacg gccaactcct cacccatcca 6781 cttcgccggg gctcagacca gcctccctgc cttctcccca ggccggctca gccgtgggct 6841 ttccgaacac aacgccctgc tgcagagaga ccccctcagc cagcccctgg cccctggctc 6901 tcgaattggc tctgaccctt acctggggca gcgtctggac agtgaggcct ctgtccacgc 6961 cctgcctgag gacacgctca ctttcgagga ggctgtggcc accaactcgg gccgctcctc 7021 caggacttcc tacgtgtcct ccctgacctc ccagtctcac cctctccgcc gcgtgcccaa 7081 cggttaccac tgcaccctgg gactcagctc gggtggccga gcacggcaca gctaccacca 7141 ccctgaccaa gaccactggt gctagctgca ccgtgaccgc tcagacgcct gcatgcagca 7201 ggcgtgtgtt ccagtggatg agttttatca tccacacggg gcagtcggcc ctcgggggag 7261 gccttgccca ccttggtgag gctcctgtgg cccctccctc cccctcctcc cctcttttac 7321 tctagacgac gaataaagcc ctgttgcttg agtgtacgta ccgc // LOCUS HUMCACNLA 6160 bp mRNA PRI 31-OCT-1994 DEFINITION Human dihydropyridine-sensitive L-type calcium channel alpha-1 subunit (CACNL1A3) mRNA, complete cds. ACCESSION L33798 NID g563322 KEYWORDS calcium channel; dihydropyridine-sensitive L-type calcium channel alpha-1 subunit. SOURCE Homo sapiens female adult skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6160) AUTHORS Hogan,K., Powers,P.A. and Gregg,R.G. TITLE Cloning of the human skeletal muscle alpha1 subunit of the dihydropyridine-sensitive caclium channels (CACNL1A3) JOURNAL Genomics (1994) In press FEATURES Location/Qualifiers source 1..6160 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="female" /tissue_type="skeletal muscle" /map="1q31-32" 5'UTR 1..225 /gene="CACNL1A3" /note="G00-126-431; putative" gene 1..6159 /gene="CACNL1A3" CDS 226..5847 /gene="CACNL1A3" /note="putative" /codon_start=1 /db_xref="GDB:G00-126-431" /product="dihydropyridine-sensitive L-type calcium channel alpha-1 subunit" /db_xref="PID:g563323" /translation="MEPSSPQDEGLRKKQPKKPVPEILPRPPRALFCLTLENPLRKAC ISIVEWKPFETIILLTIFANCVALAVYLPMPEDDNNSLNLGLEKLEYFFLIVFSIEAA MKIIAYGFLFHQDAYLRSGWNVLDFTIVFLGVFTVILEQVNVIQSHTAPMSSKGAGLD VKALRAFRVLRPLRLVSGVPSLQVVLNSIFKAMLPLFHIALLVLFMVIIYAIIGLELF KGKMHKTCYFIGTDIVATVENEEPSPCARTGSGRRCTINGSECRGGCPGPNHGITHFD NFGFSMLTVYQCITMEGWTDVLYWVNDAIGNEWPWIYFVTLILLGSFFILNLVLGVLS GEFTKEREKAKSRGTFQKLREKQQLDEDLRGYMSWITQGEVMDVEDFREGKLSLDEGG SDTESLYEIAGLNKIIQFIRHWRQWNRIFRWKCHDIVKSKVFYWLVILIVALNTLSIA SEHHNQPHWLTRLQDIANRVLLSLFTTEMLMKMYGLGLRQYFMSIFNRFDCFVVCSGI LEILLVESGAMTPLGISVLRCIRLLRIFKITKYWTSLSNLVASLLNSIRSIASLLLLL FLFIVIFRLLGMQLFGGRYDFEDTEVRRSNFDNFPQALISVFQVLTGEDWTSMMYNGI MASSGPSYPGMLVCIYFIILFVCGNYILLNVFLAIAVDNLAEAESLTSAQKAKAEEKK RRKMSKGLPDKSEEEKSTMAKKLEQKPKGEGIPTTAKLKIDEFESNVNEVKDPYPSAD FPGDDEEDEPEIPLSPRPRPLAELQLKEKAVPIPEASSFFIFSPTNKIRVLCHRIVNA TWFTNFILLFILLSSAALAAEDPIRADSMRNQILKHFDIGFTSVFTVEIVLKMTTYGA FLHKGSFCRNYFNMLDLLVVAVSLISMGLESSAISVVKILRVLRVLRPLRAINRAKGL KHVARCMFVAISTIGNIVLVTTLLQFMFACIGVQLFKGKFFRCTDLSKMTEEECRGYY YVYKDGDPMQIELRHREWVHSDFHFDNVLSAMMSLFTVSTFEGWPQLLYKAIDSNAED VGPIYNNRVEMAIFFIIYIILIAFFMMNIFVGFVIVTFQEQGETEYKNCELDKNQRQC VQYALKARPLRCYIPKNPYQYQVWYIVTSSYFEYLMFALIMLNTICLGMQHYNQSEQM NHISDILNVAFTIIFTLEMILKLMAFKARGYFGNPWNVFDFLIVIGSIIDVILSEIDT FLASSGGLYCLGGGCGNVDPDESARISSAFFRLFRVMRLIKLLSRAEGVRTLLWTFIK SFQALPYVALLIVMLFFIYAVIGMQMFGKIALVDGTQINRNNNFQTFPQAVLLLFRCA TGEAWQEILLACSYGKLCDPESDYAPGEEYTCGTNFAYYYFISFYMLCAFLVINLFVA VIMDNFDYLTRDWSILGPHHLDEFKAIWAEYDPEAKGRIKHLDVVTLLRRIQPPLGFG KFCPHRVACKRLVGMNMPLNSDGTVTFNATLFALVRTALKIKTEGNFEQANEELRAII KKIWKRTSMKLLDQVIPPIGDDEVTVGKFYATFLIQEHFRKFMKRQEEYYGYRPKKDI VQIQAGLRTIEEEAAPEICRTVSGDLAAEEELERAMVEAAMEEGIFRRTGGLFGQVDN FLERTNSLPPVMANQRPLQFAEIEMEEMESPVFLEDFPQDPRTNPLARANTNNANANV AYANSNHSNSHVFSSVHYEREFPEETETPATRGRALGQPCRSLGPHSKPCVEMLKGLL TQRAMPRGQAPPAPCQCPRVESSMPEDRKSSTPGSLHEETPHSRSTRENTSRCSAPAT ALLIQKALVRGGLGTLAADANFIMATGQALGDACQMEPEEVEIMATELLKGREAPDGM ASSLGCLNLGSSLGSLDQHQGSQETLIPPRL" 3'UTR 5848..6159 /gene="CACNL1A3" /note="G00-126-431" polyA_site 6160 /gene="CACNL1A3" /note="G00-126-431" BASE COUNT 1389 a 1796 c 1672 g 1303 t ORIGIN 1 tcaggccggc agcggggagc cgagtggagg ctaattttac ttgctgggag cgaggagagt 61 aatcctcctg ccccccactc ctgccccgcc ccctggctgg ctcagcaggg cagctcagcc 121 gacagcctca gccagcctag tccccaaggc gggggcattg gggacacagg gaagggaaag 181 cactggggtg ggggagcagg agaaagccag attcccaggg aagccatgga gccatcctca 241 ccccaggatg aaggcctgag gaagaaacag cccaagaagc cagttcctga gattctgcca 301 aggccacccc gggctttgtt ctgcctgacc ctggagaacc ccctgaggaa ggcctgcatc 361 agcattgtag aatggaagcc cttcgagacg atcatcttgc tcaccatctt tgccaattgt 421 gtggccctgg ccgtgtacct gcccatgccg gaagatgaca acaactctct gaacctcggc 481 ctggagaagc tggagtattt cttcctcatt gtcttctcga ttgaagccgc catgaagatc 541 attgcctacg gcttcttatt ccaccaggac gcttacctgc gcagtggctg gaatgtgctg 601 gacttcacca ttgtcttcct gggggtcttc accgtgattc tggaacaggt taacgtcatc 661 caaagccaca cagccccaat gagcagcaaa ggagccggct tggatgtcaa ggccctcaga 721 gccttccgag tgctcagacc cctccggctg gtgtcggggg tgcctagcct gcaggtggtc 781 ctgaactcca tcttcaaggc catgctcccc ctctttcaca tcgccctgct ggtcctcttt 841 atggtcatca tctatgccat catcgggctg gagctcttca agggcaagat gcacaagacc 901 tgctacttca ttggtacaga tatcgtggcc acggtggaga atgaagagcc atcgccctgc 961 gccaggacgg gctcagggcg ccggtgcacc atcaatggca gtgagtgccg gggcggctgc 1021 ccagggccca accatggcat cacccacttc gacaacttcg gcttctccat gctcaccgtg 1081 taccagtgca ttaccatgga gggatggact gacgtccttt actgggtcaa tgatgccatc 1141 gggaatgagt ggccctggat ctattttgtc accctcattt tgctgggatc cttcttcatc 1201 ctcaacctgg tgctgggtgt cctgagtggg gaattcacca aggagcggga gaaggccaag 1261 tccaggggaa ccttccagaa gctccgggag aagcagcaac tagatgagga ccttcggggc 1321 tacatgagct ggatcacgca gggcgaggtc atggatgttg aggacttcag agaaggaaaa 1381 ctgtctttgg atgaaggtgg ctctgacaca gagagcctgt atgaaattgc aggcttgaac 1441 aaaatcatcc agttcatccg acattggagg cagtggaacc gcatctttcg ctggaagtgc 1501 catgacatcg tgaagtccaa ggtcttctat tggctggtga ttctcatcgt tgccctcaac 1561 accctgtcta tcgcctcaga gcaccacaac cagccgcact ggctgacccg tttgcaagac 1621 attgccaacc gggtgctgct gtccctcttc accactgaga tgctgatgaa gatgtacggg 1681 ctgggcctgc gccagtactt catgtctatc ttcaaccgct tcgactgctt cgtggtgtgc 1741 agcggtatcc tggagatcct gctggtggag tcgggcgcca tgacacccct gggcatctcc 1801 gtgctccgct gcatccgcct cctgaggatc ttcaagatca ccaaatattg gacgtcgctg 1861 agcaacctgg tggcatccct gctcaactcc atccgctcca tcgcctccct gctgctgctg 1921 ctcttcctct tcatcgtcat cttccgcctc ctgggcatgc agctctttgg ggggaggtat 1981 gactttgaag acacagaagt acggcgcagc aactttgaca actttcccca agccctcatc 2041 agcgtcttcc aggtactgac aggggaagac tggacctcaa tgatgtacaa tgggatcatg 2101 gcctcgagcg ggccgtccta ccctggcatg cttgtgtgca tttacttcat catccttttc 2161 gtctgtggca actacatcct gctcaatgtc ttcctggcca ttgccgtgga caacctggcc 2221 gaggcggaga gcctgacttc tgcccagaag gccaaggctg aggagaaaaa acgcaggaag 2281 atgtccaagg gtctcccaga caagtcagaa gaggagaagt caacgatggc caagaagctg 2341 gagcagaaac ccaagggtga gggcatcccc accactgcca agctgaaaat cgatgagttt 2401 gaatctaatg tcaatgaggt gaaggatccc tacccctcag ccgacttccc aggggatgac 2461 gaggaagatg agcctgagat cccgctgagc ccccgaccac gtcccctggc tgagctgcag 2521 ctgaaagaga aggccgtgcc cattccagaa gccagctcct tcttcatctt cagccccacc 2581 aataagatcc gtgtcctgtg tcaccgcatc gtcaatgcca cctggttcac caacttcatc 2641 ctgctcttca tcctgctcag cagcgctgca ctggctgcgg aagaccccat ccgggctgat 2701 tccatgagaa atcagatcct taaacacttt gacatcgggt tcacctctgt cttcactgtg 2761 gagattgtcc tcaagatgac gacctacgga gccttcctgc acaagggttc cttctgccgc 2821 aattacttca acatgctgga cctgctggtg gtggccgtgt ccctcatctc catgggactt 2881 gagtccagtg ccatctccgt ggtgaagatc ctgagggtgc tgagggtgct ccgaccactc 2941 agagccatca acagagccaa ggggttgaag cacgtggcta ggtgcatgtt cgtggccatc 3001 agcaccatcg ggaacatcgt gctggtcact accctcctac agttcatgtt tgcctgcatc 3061 ggcgtccagc tcttcaaggg gaagttcttc aggtgcaccg acttgtccaa gatgacagag 3121 gaggagtgca ggggctacta ctacgtgtac aaggacgggg accccatgca gatagagctg 3181 cgtcaccgcg agtgggtaca cagcgacttc cacttcgaca atgtgctctc agccatgatg 3241 tccctcttca cggtctccac cttcgaggga tggcctcagc tgctgtacaa ggccatagac 3301 tccaatgcgg aggacgtggg tcccatctac aacaaccgtg tggagatggc catcttcttc 3361 atcatctaca tcatcctcat tgccttcttc atgatgaaca tctttgtggg cttcgtcatt 3421 gtcaccttcc aggagcaggg agagactgag tacaagaact gtgagctgga caagaaccag 3481 cgccaatgtg tacagtatgc cctgaaggcc cgcccactga ggtgctacat tcccaaaaac 3541 ccataccagt accaggtgtg gtacattgtc acctcctcct actttgaata cctgatgttt 3601 gccctcatca tgctcaacac catctgcctc ggcatgcagc actacaacca gtcggagcag 3661 atgaaccaca tctcagacat cctcaatgtg gccttcacta tcatcttcac cctggagatg 3721 atcctcaagc tcatggcctt caaggccagg ggctactttg gaaacccctg gaatgtgttt 3781 gacttcctga ttgtcattgg cagcatcatt gatgtcatcc tcagtgagat cgacactttc 3841 ctggcctcca gcgggggact gtattgcctg ggtggaggct gcgggaacgt tgacccagat 3901 gagagtgccc gcatctccag cgccttcttc cgcctgttcc gtgtcatgag gctgatcaag 3961 ctgctgagcc gggcagaagg agtgcgaacc ctcctgtgga cgttcatcaa gtccttccag 4021 gccctaccct acgtggctct gctcatcgtc atgctcttct tcatctacgc tgtcatcggc 4081 atgcagatgt ttgggaagat cgccttggtg gatgggaccc aaataaaccg gaacaacaac 4141 ttccagacct tcccacaagc tgtgctactg ctcttcaggt gtgcaacagg tgaggcctgg 4201 caggagatcc tactggcctg cagctatggg aagctgtgtg acccagagtc ggactatgcc 4261 ccaggggagg agtacacatg tggcaccaac tttgcatact actacttcat cagcttctac 4321 atgctctgtg ccttcctggt catcaacctc tttgtggctg tcatcatgga caattttgac 4381 tacctcaccc gggactggtc catcctgggc cctcatcacc tggatgagtt caaggccatc 4441 tgggcagagt atgacccaga ggctaagggg aggatcaaac acctggacgt ggtgaccctg 4501 ctgagaagga ttcagccccc tctgggcttt gggaagttct gcccacatcg ggtagcttgt 4561 aagcggctgg tgggcatgaa catgcccctg aacagcgacg gcacagtcac cttcaatgcc 4621 acactctttg ccctggtccg cacggcactc aagatcaaga cggaaggtaa ctttgagcag 4681 gccaacgagg agctgagggc catcatcaag aagatctgga agagaaccag catgaagctc 4741 ttggaccagg tcatccctcc aataggagat gatgaggtga cagtggggaa gttctacgcc 4801 acattcctca tccaggagca cttccggaag ttcatgaaac gccaagagga gtattatggc 4861 tatcggccca agaaggacat tgtacagatc caggcagggc tgcggaccat tgaggaagag 4921 gcagcccccg agatctgtcg cacggtctca ggagacctgg ctgctgagga ggagctggag 4981 agagccatgg tggaggctgc gatggaggag gggatattcc ggaggactgg aggcctgttt 5041 ggccaggtgg acaacttcct ggaaaggacc aactccctgc cccctgtcat ggccaatcag 5101 agacccctcc agtttgctga gatagagatg gaagagatgg agtcacctgt cttcttggag 5161 gacttcccac aagatccacg caccaacccc ctggctcgtg ccaataccaa caatgccaac 5221 gccaatgtcg cctatgcgaa cagcaaccat agcaacagcc atgtgttttc cagtgtccac 5281 tatgaaaggg agttcccaga agagacagag acgcctgcta ccagaggacg agcccttggc 5341 caaccctgca ggtccctggg accccacagc aaaccctgtg tggagatgct gaagggactg 5401 ctgacccaga gggcaatgcc cagaggccag gcacctcctg ccccctgcca gtgccccagg 5461 gtggagtcct ccatgcctga ggacagaaag agctccacac cagggtctct tcatgaggag 5521 acaccccaca gcaggagcac cagggagaat acttccaggt gctcagcacc agctacagcc 5581 ctgctgatcc aaaaggctct ggttcgaggg ggcctgggca ccttggcagc tgatgcaaac 5641 ttcatcatgg caacaggcca ggccctcgga gatgcctgcc aaatggaacc agaggaagtg 5701 gagatcatgg caacagagct actgaaagga cgagaggccc cagacggcat ggccagctcc 5761 ctgggatgcc tgaacctcgg gtcctccctg ggcagcctcg accaacacca gggctcccag 5821 gagaccctta ttcctccaag gctgtgatgc ccacacagca tcagcatggg cttagagctg 5881 gcatgaccaa tgggggtggg gaagttgctg gggtggagaa gggctagccc accgcagcag 5941 cctccctccc tctcagcagc tagatgcatg cctgaggcag ggtggtcagg aaccacctca 6001 aaaagtgcgg aggaagtagc tggacaggcc ctgcccctca ccagcaagag gcatgattgg 6061 atggagcttc taatgtcatt caaaaaggcc tggtcagtgc ctgtccctag ggccactccc 6121 acctgcagga cattaaaatc tccaggcctg tgacactggc // LOCUS HUMCACNLB 3600 bp mRNA PRI 31-OCT-1994 DEFINITION Human neuronal DHP-sensitive, voltage-dependent, calcium channel alpha-2b subunit mRNA, complete cds. ACCESSION M76559 NID g179761 KEYWORDS neuronal DHP-sensitive calcium channel; transmembrane protein; voltage-dependent calcium channel alpha-2b subunit; voltage-gated calcium channel. SOURCE Homo sapiens (tissue library: lambda gt11 cDNA) central nervous system cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3600) AUTHORS Williams,M.E., Feldman,D.H., McCue,A.F., Brenner,R., Velicelebi,G., Ellis,S.B. and Harpold,M.M. TITLE Structure and functional expression of alpha 1, alpha 2, and beta subunits of a novel human neuronal calcium channel subtype JOURNAL Neuron 8 (1), 71-84 (1992) MEDLINE 92110010 FEATURES Location/Qualifiers source 1..3600 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="central nervous system" /tissue_lib="lambda gt11 cDNA" /map="Unassigned" gene 35..3310 /gene="CCHL1A2" CDS 35..3310 /gene="CCHL1A2" /codon_start=1 /db_xref="GDB:G00-128-872" /product="calcium channel alpha-2b subunit" /db_xref="PID:g179762" /translation="MAAGCLLALTLTLFQSLLIGPSSEEPFPSAVTIKSWVDKMQEDL VTLAKTASGVNQLVDIYEKYQDLYTVEPNNARQLVEIAARDIEKLLSNRSKALVSLAL EAEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQIS YQHAAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYY PASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSV SEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGITDYK KGFSFAFEQLLNYNVSRANCNKIIMLFTDGGEERAQEIFNKYNKDKKVRVFRFSVGQH NYERGPIQWMACENKGYYYEIPSIGAIRINTQEYLDVLGRPMVLAGDKAKQVQWTNVY LDALELGLVITGTLPVFNITGQFENKTNLKNQLILGVMGVDVSLEDIKRLTPRFTLCP NGYYFAIDPNGYVLLHPNLQPKNPKSQEPVTLDFLDAELENDIKVEIRNKMIDGESGE KTFRTLVKSQDERYIDKGNRTYTWTPVNGTDYSLALVLPTYSFYYIKAKLEETITQAR SKKGKMKDSETLKPDNFEESGYTFIAPRDYCNDLKISDNNTEFLLNFNEFIDRKTPNN PSCNADLINRVLLDAGFTNELVQNYWSKQKNIKGVKARFVVTDGGITRVYPKEAGENW QENPETYEDSFYKRSLDNDNYVFTAPYFNKSGPGAYESGIMVSKAVEIYIQGKLLKPA VVGIKIDVNSWIENFTKTSIRDPCAGPVCDCKRNSDVMDCVILDDGGFLLMANHDDYT NQIGRFFGEIDPSLMRHLVNISVYAFNKSYDYQSVCEPGAAPKQGAGHRSAYVPSVAD ILQIGWWATAAAWSILQQFLLSLTFPRLLEAVEMEDDDFTASLSKQSCITEQTQYFFD NDSKSFSGVLDCGNCSRIFHGEKLMNTNLIFIMVESKGTCPCDTRLLIQAEQTSDGPN PCDMVKQPRYRKGPDVCFDNNVLEDYTDCGGVSGLNPSLWYIIGIQFLLLWLVSGSTH RLL" BASE COUNT 1139 a 680 c 805 g 976 t ORIGIN 1 gcgggggagg gggcattgat cttcgatcgc gaagatggct gctggctgcc tgctggcctt 61 gactctgaca cttttccaat ctttgctcat cggcccctcg tcggaggagc cgttcccttc 121 ggccgtcact atcaaatcat gggtggataa gatgcaagaa gaccttgtca cactggcaaa 181 aacagcaagt ggagtcaatc agcttgttga tatttatgag aaatatcaag atttgtatac 241 tgtggaacca aataatgcac gccagctggt agaaattgca gccagggata ttgagaaact 301 tctgagcaac agatctaaag ccctggtgag cctggcattg gaagcggaga aagttcaagc 361 agctcaccag tggagagaag attttgcaag caatgaagtt gtctactaca atgcaaagga 421 tgatctcgat cctgagaaaa atgacagtga gccaggcagc cagaggataa aacctgtttt 481 cattgaagat gctaattttg gacgacaaat atcttatcag cacgcagcag tccatattcc 541 tactgacatc tatgagggct caacaattgt gttaaatgaa ctcaactgga caagtgcctt 601 agatgaagtt ttcaaaaaga atcgcgagga agacccttca ttattgtggc aggtttttgg 661 cagtgccact ggcctagctc gatattatcc agcttcacca tgggttgata atagtagaac 721 tccaaataag attgaccttt atgatgtacg cagaagacca tggtacatcc aaggagctgc 781 atctcctaaa gacatgctta ttctggtgga tgtgagtgga agtgttagtg gattgacact 841 taaactgatc cgaacatctg tctccgaaat gttagaaacc ctctcagatg atgatttcgt 901 gaatgtagct tcatttaaca gcaatgctca ggatgtaagc tgttttcagc accttgtcca 961 agcaaatgta agaaataaaa aagtgttgaa agacgcggtg aataatatca cagccaaagg 1021 aattacagat tataagaagg gctttagttt tgcttttgaa cagctgctta attataatgt 1081 ttccagagca aactgcaata agattattat gctattcacg gatggaggag aagagagagc 1141 ccaggagata tttaacaaat acaataaaga taaaaaagta cgtgtattca ggttttcagt 1201 tggtcaacac aattatgaga gaggacctat tcagtggatg gcctgtgaaa acaaaggtta 1261 ttattatgaa attccttcca ttggtgcaat aagaatcaat actcaggaat atttggatgt 1321 tttgggaaga ccaatggttt tagcaggaga caaagctaag caagtccaat ggacaaatgt 1381 gtacctggat gcattggaac tgggacttgt cattactgga actcttccgg tcttcaacat 1441 aaccggccaa tttgaaaata agacaaactt aaagaaccag ctgattcttg gtgtgatggg 1501 agtagatgtg tctttggaag atattaaaag actgacacca cgttttacac tgtgccccaa 1561 tgggtattac tttgcaatcg atcctaatgg ttatgtttta ttacatccaa atcttcagcc 1621 aaagaacccc aaatctcagg agccagtaac attggatttc cttgatgcag agttagagaa 1681 tgatattaaa gtggagattc gaaataagat gattgatggg gaaagtggag aaaaaacatt 1741 cagaactctg gttaaatctc aagatgagag atatattgac aaaggaaaca ggacatacac 1801 atggacacct gtcaatggca cagattacag tttggccttg gtattaccaa cctacagttt 1861 ttactatata aaagccaaac tagaagagac aataactcag gccagatcaa aaaagggcaa 1921 aatgaaggat tcggaaaccc tgaagccaga taattttgaa gaatctggct atacattcat 1981 agcaccaaga gattactgca atgacctgaa aatatcggat aataacactg aatttctttt 2041 aaatttcaac gagtttattg atagaaaaac tccaaacaac ccatcatgta acgcggattt 2101 gattaataga gtcttgcttg atgcaggctt tacaaatgaa cttgtccaaa attactggag 2161 taagcagaaa aatatcaagg gagtgaaagc acgatttgtt gtgactgatg gtgggattac 2221 cagagtttat cccaaagagg ctggagaaaa ttggcaagaa aacccagaga catatgagga 2281 cagcttctat aaaaggagcc tagataatga taactatgtt ttcactgctc cctactttaa 2341 caaaagtgga cctggtgcct atgaatcggg cattatggta agcaaagctg tagaaatata 2401 tattcaaggg aaacttctta aacctgcagt tgttggaatt aaaattgatg taaattcctg 2461 gatagagaat ttcaccaaaa cctcaatcag agatccgtgt gctggtccag tttgtgactg 2521 caaaagaaac agtgacgtaa tggattgtgt gattctggat gatggtgggt ttcttctgat 2581 ggcaaatcat gatgattata ctaatcagat tggaagattt tttggagaga ttgatcccag 2641 cttgatgaga cacctggtta atatatcagt ttatgctttt aacaaatctt atgattatca 2701 gtcagtatgt gagcccggtg ctgcaccaaa acaaggagca ggacatcgct cagcatatgt 2761 gccatcagta gcagacatat tacaaattgg ctggtgggcc actgctgctg cctggtctat 2821 tctacagcag tttctcttga gtttgacctt tccacgactc cttgaggcag ttgagatgga 2881 ggatgatgac ttcacggcct ccctgtccaa gcagagctgc attactgaac aaacccagta 2941 tttcttcgat aacgacagta aatcattcag tggtgtatta gactgtggaa actgttccag 3001 aatctttcat ggagaaaagc ttatgaacac caacttaata ttcataatgg ttgagagcaa 3061 agggacatgt ccatgtgaca cacgactgct catacaagcg gagcagactt ctgacggtcc 3121 aaatccttgt gacatggtta agcaacctag ataccgaaaa gggcctgatg tctgctttga 3181 taacaatgtc ttggaggatt atactgactg tggtggtgtt tctggattaa atccctccct 3241 gtggtatatc attggaatcc agtttctact actttggctg gtatctggca gcacacaccg 3301 gctgttatga ccttctaaaa accaaatctg catagttaaa ctccagaccc tgccaaaaca 3361 tgagccctgc cctcaattac agtaacgtag ggtcagctat aaaatcagac aaacattagc 3421 tgggcctgtt ccatggcata acactaaggc gcagactcct aaggcaccca ctggctgcat 3481 gtcagggtgt cagatcctta aacgtgtgtg aatgctgcat catctatgtg taacatcaaa 3541 gcaaaatcct atacgtgtcc tctattggaa aatttgggcg tttgttgttg cattgttggt // LOCUS HUMCACNLG 1258 bp mRNA PRI 31-OCT-1994 DEFINITION Homo sapiens DHP-sensitive calcium channel gamma subunit (CACNLG) mRNA, complete cds. ACCESSION L07738 NID g306472 KEYWORDS DHP-sensitive calcium channel gamma subunit. SOURCE Homo sapiens fetal skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1258) AUTHORS Powers,P.A., Liu,S., Hogan,K. and Gregg,R.G. TITLE Molecular characterization of the gene encoding the gamma subunit of the human skeletal muscle 1,4-dihydropyridine-sensitive Ca2+ channel (CACNLG), cDNA sequence, gene structure, and chromosomal location JOURNAL J. Biol. Chem. 268 (13), 9275-9279 (1993) MEDLINE 93252787 FEATURES Location/Qualifiers source 1..1258 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" gene 72..740 /gene="CACNLG" CDS 72..740 /gene="CACNLG" /note="DHP-sensitive" /codon_start=1 /db_xref="GDB:G00-132-015" /product="calcium channel gamma subunit" /db_xref="PID:g306473" /translation="MSQTKMLKVRVTLFCILAGIVLAMTAVVTDHWAVLSPHMEHHNT TCEAAHFGLWRICTKRIPMDDSKTCGPITLPGEKNCSYFRHFNPGESSEIFEFTTQKE YSISAAAIAIFSLGFIILGSLCVLLSLGKKRDYLLRPASMFYAFAGLCILVSVEVMRQ SVKRMIDSEDTVWIEYYYSWSFACACAAFILLFLGGLALLLFSLPRMPRNPWESCMDA EPEH" polyA_site 1258 BASE COUNT 231 a 417 c 338 g 272 t ORIGIN 1 cggcttgtca cctgccctag gagacgcagc cgccggaccc tgcccagggc acccacgcct 61 cggcgaccac catgtcccag accaaaatgc tgaaggtccg cgtgaccctc ttctgcatcc 121 tggcaggcat cgtgctggcc atgacagccg tggtaaccga ccactgggct gtgctgagcc 181 cccacatgga gcaccacaac actacctgcg aggcggccca cttcggcctc tggcggattt 241 gtaccaagcg catccccatg gacgacagca agacctgcgg gcccatcacc ctgcccgggg 301 agaagaactg ttcctacttc aggcatttta accccggcga gagctcggag atcttcgaat 361 tcaccactca gaaggagtac agcatctcgg cagccgccat cgccatcttc agccttggct 421 tcatcatcct gggcagcctc tgtgtcctcc tgtccctcgg gaagaagagg gactatctgc 481 tgcgacccgc gtccatgttc tatgcctttg caggtctctg catcctcgtc tcggtggagg 541 tcatgcggca gtcggtgaag cgcatgattg acagtgagga caccgtctgg atcgagtact 601 attactcctg gtcctttgcc tgcgcctgtg ccgccttcat cctcctcttt ctcggcggtc 661 tcgccctcct gctgttctcc ctgcctcgaa tgccccggaa cccatgggag tcctgcatgg 721 atgctgagcc cgagcactaa ccctctgcgg ccctagcgac cctcaggctt cttccccagg 781 aagcggggtc ttggcctgga accttccaga gaggaggcgg gagcaatttt agccccaccc 841 tgctcccatc tgcccccctg caacagctgc aggctgcttc ctctctctga gttcctctgg 901 gctgcgcagg ctcccctggg aatagagcaa gacgtgagtc ctaacctggc cacagttggg 961 ggagcagagc cagcaggtgg acaggtgttt gcaggggccc aacttcccct ggagctcaga 1021 ggggggccca ctgtaccagc ctctgataag ctgcctccag ttgtccttta tgaacattgc 1081 agggacaacc tgtgtttgcc agctgggtgt tccgtgtaaa tagccagcct gtctctttct 1141 cggtgataaa acacacgcgc tcctggagcc caggcctccc cctccttggc ttccaggagc 1201 ctggaagcat ttttaactgg gtagaatctg actgtggctt gaaataaaaa gctctcag // LOCUS HUMCAIIA 1551 bp mRNA PRI 31-OCT-1994 DEFINITION Human carbonic anhydrase II mRNA, complete cds. ACCESSION J03037 NID g179771 KEYWORDS carbonic anhydrase. SOURCE Human kidney, cDNA to mRNA, (library of G.I.Bell), clone lambda-HM3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1551) AUTHORS Murakami,H., Marelich,G.P., Grubb,J.H., Kyle,J.W. and Sly,W.S. TITLE Cloning, expression, and sequence homologies of cDNA for human carbonic anhydrase II JOURNAL Genomics 1 (2), 159-166 (1987) MEDLINE 88085190 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.D.Miller, 24-SEP-1987. FEATURES Location/Qualifiers source 1..1551 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q13-q22" mRNA <1..1551 /note="CA II mRNA" gene 66..848 /gene="CA2" CDS 66..848 /gene="CA2" /note="carbonic anhydrase II" /codon_start=1 /db_xref="GDB:G00-119-739" /db_xref="PID:g179772" /translation="MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTHTAKYDPSL KPLSVSYDQATSLRILNNGHAFNVEFDDSQDKAVLKGGPLDGTYRLIQFHFHWGSLDG QGSEHTVDKKKYAAELHLVHWNTKYGDFGKAVQQPDGLAVLGIFLKVGSAKPGLQKVV DVLDSIKTKGKSADFTNFDPRGLLPESLDYWTYPGSLTTPPLLECVTWIVLKEPISVS SEQVLKFRKLNFNGEGEPEELMVDNWRPAQPLKNRQIKASFK" BASE COUNT 460 a 322 c 329 g 440 t ORIGIN 238 bp upstream of BamHI site; chromosome 8q22. 1 ggcgcccaag ccgccgccgc cagatcggtg ccgattcctg ccctgccccg accgccagcg 61 cgaccatgtc ccatcactgg gggtacggca aacacaacgg acctgagcac tggcataagg 121 acttccccat tgccaaggga gagcgccagt cccctgttga catcgacact catacagcca 181 agtatgaccc ttccctgaag cccctgtctg tttcctatga tcaagcaact tccctgagga 241 tcctcaacaa tggtcatgct ttcaacgtgg agtttgatga ctctcaggac aaagcagtgc 301 tcaagggagg acccctggat ggcacttaca gattgattca gtttcacttt cactggggtt 361 cacttgatgg acaaggttca gagcatactg tggataaaaa gaaatatgct gcagaacttc 421 acttggttca ctggaacacc aaatatgggg attttgggaa agctgtgcag caacctgatg 481 gactggccgt tctaggtatt tttttgaagg ttggcagcgc taaaccgggc cttcagaaag 541 ttgttgatgt gctggattcc attaaaacaa agggcaagag tgctgacttc actaacttcg 601 atcctcgtgg cctccttcct gaatccctgg attactggac ctacccaggc tcactgacca 661 cccctcctct tctggaatgt gtgacctgga ttgtgctcaa ggaacccatc agcgtcagca 721 gcgagcaggt gttgaaattc cgtaaactta acttcaatgg ggagggtgaa cccgaagaac 781 tgatggtgga caactggcgc ccagctcagc cactgaagaa caggcaaatc aaagcttcct 841 tcaaataaga tggtcccata gtctgtatcc aaataatgaa tcttcgggtg tttcccttta 901 gctaagcaca gatctacctt ggtgatttgg accctggttg ctttgtgtct agttttctag 961 acccttcatc tcttacttga tagacttact aataaaatgt gaagactaga ccaattgtca 1021 tgcttgacac aactgctgtg gctggttggt gctttgttta tggtagtagt ttttctgtaa 1081 cacagaatat aggataagaa ataagaataa agtaccttga ctttgttcac agcatgtagg 1141 gtgatgagca ctcacaattg ttgactaaaa tgctgctttt aaaacatagg aaagtagaat 1201 ggttgagtgc aaatccatag cacaagataa attgagctag ttaaggcaaa tcaggtaaaa 1261 tagtcatgat tctatgtaat gtaaaccaga aaaaataaat gttcatgatt tcaagatgtt 1321 atattaaaga aaaactttaa aaattattat atatttatag caaagttatc ttaaatatga 1381 attctgttgt aatttaatga cttttgaatt acagagatat aaatgaagta ttatctgtaa 1441 aaattgttat aattagagtt gtgatacaga gtatatttcc attcagacaa tatatcataa 1501 cttaataaat attgtatttt agatatattc tctaataaaa ttcagaattc t // LOCUS HUMCAIVA 1105 bp mRNA PRI 11-MAR-1992 DEFINITION Human carbonic anhydrase IV mRNA, complete cds. ACCESSION M83670 NID g179790 KEYWORDS carbonic anhydrase IV. SOURCE Homo sapiens kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1105) AUTHORS Okuyama,T., Sato,S., Zhu,X.L., Waheed,A. and Sly,W.S. TITLE Human carbonic anhydrase IV: cDNA cloning, sequence comparison, and expression in COS cell membranes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 1315-1319 (1992) MEDLINE 92159040 FEATURES Location/Qualifiers source 1..1105 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" mRNA 1..1105 sig_peptide 48..101 CDS 48..986 /codon_start=1 /product="carbonic anhydrase IV" /db_xref="PID:g179791" /translation="MRMLLALLALSAARPSASAESHWCYEVQAESSNYPCLVPVKWGG NCQKDRQSPINIVTTKAKVDKKLGRFFFSGYDKKQTWTVQNNGHSVMMLLENKASISG GGLPAPYQAKQLHLHWSDLPYKGSEHSLDGEHFAMEMHIVHEKEKGTSRNVKEAQDPE DEIAVLAFLVEAGTQVNEGFQPLVEALSNIPKPEMSTTMAESSLLDLLPKEEKLRHYF RYLGSLTTPTCDEKVVWTVFREPIQLHREQILAFSQKLYYDKEQTVSMKDNVRPLQQL GQRTVIKSGAPGRPLPWALPALLGPMLACLLAGFLR" mat_peptide 102..983 /product="carbonic anhydrase IV" polyA_signal 1080..1085 BASE COUNT 248 a 321 c 321 g 215 t ORIGIN 1 gctcggtgcg cgaccccggc tcagaggact ctttgctgtc ccgcaagatg cggatgctgc 61 tggcgctcct ggccctctcc gcggcgcggc catcggccag tgcagagtca cactggtgct 121 acgaggttca agccgagtcc tccaactacc cctgcttggt gccagtcaag tggggtggaa 181 actgccagaa ggaccgccag tcccccatca acatcgtcac caccaaggca aaggtggaca 241 aaaaactggg acgcttcttc ttctctggct acgataagaa gcaaacgtgg actgtccaaa 301 ataacgggca ctcagtgatg atgttgctgg agaacaaggc cagcatttct ggaggaggac 361 tgcctgcccc ataccaggcc aaacagttgc acctgcactg gtccgacttg ccatataagg 421 gctcggagca cagcctcgat ggggagcact ttgccatgga gatgcacata gtacatgaga 481 aagagaaggg gacatcgagg aatgtgaaag aggcccagga ccctgaagac gaaattgcgg 541 tgctggcctt tctggtggag gctggaaccc aggtgaacga gggcttccag ccactggtgg 601 aggcactgtc taatatcccc aaacctgaga tgagcactac gatggcagag agcagcctgt 661 tggacctgct ccccaaggag gagaaactga ggcactactt ccgctacctg ggctcactca 721 ccacaccgac ctgcgatgag aaggtcgtct ggactgtgtt ccgggagccc attcagcttc 781 acagagaaca gatcctggca ttctctcaga agctgtacta cgacaaggaa cagacagtga 841 gcatgaagga caatgtcagg cccctgcagc agctggggca gcgcacggtg ataaagtccg 901 gggccccggg tcggccgctg ccctgggccc tgcctgccct gctgggcccc atgctggcct 961 gcctgctggc cggcttcctg cgatgatggc tcacttctgc acgcagcctc tctgttgcct 1021 cagctctcca agttccaggc ttccggtcct tagccttccc aggtgggact ttaggcatga 1081 ttaaaatatg gacatatttt tggag // LOCUS HUMCAIX 2785 bp mRNA PRI 31-OCT-1994 DEFINITION Human carbonic anhydrase I (CAI) mRNA, complete cds. ACCESSION M33987 NID g179792 KEYWORDS carbonic anhydrase I. SOURCE Human EBV transformed SH B cell line DNA, and cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2785) AUTHORS Lowe,N., Brady,H.J., Barlow,J.H., Sowden,J.C., Edwards,M. and Butterworth,P.H. TITLE Structure and methylation patterns of the gene encoding human carbonic anhydrase I JOURNAL Gene 93 (2), 277-283 (1990) MEDLINE 91033039 COMMENT Since no intron sequences were provided this entry is treated as if originating from an mRNA. Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by N.Lowe, 09-MAY-1990. Author address: N.Lowe Department of Biochemistry University College London Gower Street, London WC1E 6BT, U.K. E-mail:UCBCMAR%EUCLID.UCL.AC.UK@CUNYVM.CUNY.EDU. FEATURES Location/Qualifiers source 1..2785 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q13-q22" TATA_signal 874..879 mRNA 902..1939 /note="carbonic anhydrase I mRNA (alt.)" mRNA 902..2165 /note="carbonic anhydrase I mRNA (alt.)" gene 1048..1833 /gene="CA1" CDS 1048..1833 /gene="CA1" /note="carbonic anhydrase I (EC 4.2.1.1)" /codon_start=1 /db_xref="GDB:G00-119-047" /db_xref="PID:g179793" /translation="MASPDWGYDDKNGPEQWSKLYPIANGNNQSPVDIKTSETKHDTS LKPISVSYNPATAKEIINVGHSFHVNFEDNDNRSVLKGGPFSDSYRLFQFHFHWGSTN EHGSEHTVDGVKYSAELHVAHWNSAKYSSLAEAASKADGLAVIGVLMKVGEANPKLQK VLDALQAIKTKGKRAPFTNFDPSTLLPSSLDFWTYPGSLTHPPLYESVTWIICKESIS VSSEQLAQFRSLLSNVEGDNAVPMQHNNRPTQPLKGRTVRASF" BASE COUNT 840 a 603 c 492 g 850 t ORIGIN 1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat 61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa 121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta 181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa 241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca 301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg 361 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc 421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac 481 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa 541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat 601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg 661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg 721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac 781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta 841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc 901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca 961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa 1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata tgatgacaaa 1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct 1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc 1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt 1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc 1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat 1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc 1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt 1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag 1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat 1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc 1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta 1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct 1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc 1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa 1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact 1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc 2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac 2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc 2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca 2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca 2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga 2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct 2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt 2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc 2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg 2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat 2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta 2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct 2761 gccagtcagg gtggagtgcg atggt // LOCUS HUMCALAA 2276 bp mRNA PRI 24-AUG-1993 DEFINITION Human calmodulin-dependent protein phosphatase catalytic subunit (PPP3CA) mRNA, complete cds and alternative exon. ACCESSION L14778 NID g306476 KEYWORDS calcineurin A-alpha; calmodulin-dependent protein phosphatase; phosphoprotein phosphatase. SOURCE Homo sapiens (library: lambda ZAPII (Stratagene)) female 2 year old brain (hippocampus) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2276) AUTHORS Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C., Marietta,C.A., Amorese,D.A. and Martin,B.M. TITLE Cloning and characterization of molecular isoforms of the catalytic subunit of calcineurin using nonisotopic methods JOURNAL J. Biol. Chem. 265, 11312-11319 (1990) MEDLINE 90293081 REFERENCE 2 (bases 1 to 2276) AUTHORS Muramatsu,T. and Kincaid,R.L. TITLE Molecular cloning of a full-length cDNA encoding the catalytic subunit of human calmodulin-dependent protein phosphatase (calcineurin A alpha) JOURNAL Biochim. Biophys. Acta 1178 (1), 117-120 (1993) MEDLINE 93320118 FEATURES Location/Qualifiers source 1..2276 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 year old" /sex="female" /tissue_type="brain (hippocampus)" /tissue_lib="lambda ZAPII (Stratagene)" gene 1..2276 /gene="PPP3CA" 5'UTR 1..147 /gene="PPP3CA" CDS 148..1713 /gene="PPP3CA" /standard_name="calcineurin A alpha" /EC_number="3.1.3.16" /note="Partial coding sequence and 3' UTR was reported for the human 'type 1', or 'A alpha' cDNAs in ref. 1" /citation=[2] /citation=[1] /codon_start=1 /function="phosphoprotein phosphatase" /product="calmodulin-dependent phosphatase catalytic subunit" /db_xref="PID:g306477" /translation="MSEPKAIDPKLSTTDRVVKAVPFPPSHRLTAKEVFDNDGKPRVD ILKAHLMKEGRLEESVALRIITEGASILRQEKNLLDIDAPVTVCGDIHGQFFDLMKLF EVGGSPANTRYLFLGDYVDRGYFSIECVLYLWALKILYPKTLFLLRGNHECRHLTEYF TFKQECKIKYSERVYDACMDAFDCLPLAALMNQQFLCVHGGLSPEINTLDDIRKLDRF KEPPAYGPMCDILWSDPLEDFGNEKTQEHFTHNTVRGCSYFYSYPAVCEFLQHNNLLS ILRAHEAQDAGYRMYRKSQTTGFPSLITIFSAPNYLDVYNNKAAVLKYENNVMNIRQF NCSPHPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLNICSDDELGSEEDGFDGATAAAR KEVIRNKIRAIGKMARVFSVLREESESVLTLKGLTPTGMLPSGVLSGGKQTLQSATVE AIEADEAIKGFSPQHKITSFEEAKGLDRINERMPPRRDAMPSDANLNSINKALTSETN GTDSNGSNSSNIQ" exon 1486..1515 /gene="PPP3CA" /note="This alternative exon was first described in citation 1 and transcripts containing this exon are the predominant ones in brain.; alternative inserted exon" /citation=[2] /citation=[1] /function="unknown" 3'UTR 1714..2276 /gene="PPP3CA" polyA_signal 2245..2250 /gene="PPP3CA" BASE COUNT 653 a 458 c 538 g 627 t ORIGIN 1 ccagctcaga gcctagacct ccagccgagc ggtttgcagc cgcgggcggc ggcggcggcg 61 gcggcgttga gtgtctggcc cgccggtccg gtcggggtgt gcagtcggac ggacgagcag 121 cgcgtcgctg tcctccggca gctggagatg tccgagccca aggcaattga tcccaagttg 181 tcgacgaccg acagggtggt gaaagctgtt ccatttcctc caagtcaccg gcttacagca 241 aaagaagtgt ttgataatga tggaaaacct cgtgtggata tcttaaaggc gcatcttatg 301 aaggagggaa ggctggaaga gagtgttgca ttgagaataa taacagaggg tgcatcaatt 361 cttcgacagg aaaaaaattt gctggatatt gatgcgccag tcactgtttg tggggacatt 421 catggacaat tctttgattt gatgaagctc tttgaagtcg ggggatctcc tgccaacact 481 cgctacctct tcttagggga ctatgttgac agagggtact tcagtattga atgtgtgctg 541 tatttgtggg ccttgaaaat tctctacccc aaaacactgt ttttacttcg tggaaatcat 601 gaatgtagac atctaacaga gtatttcaca tttaaacaag aatgtaaaat aaagtattca 661 gaacgagtat atgatgcctg tatggatgcc tttgactgcc ttcccctggc tgccctgatg 721 aaccaacagt tcctgtgtgt gcatggtggt ttgtctccag agattaacac tttagatgat 781 atcagaaaat tagaccgatt caaagaacca cctgcatatg gacctatgtg tgatatcctg 841 tggtcagacc ccctggaaga ttttggaaat gagaagactc aggaacattt cactcacaac 901 acagtcaggg ggtgttcata cttctacagt tacccggctg tatgtgaatt cttacagcac 961 aataacttgt tatctatact ccgagcccac gaagcccaag atgcagggta ccgcatgtac 1021 aggaaaagcc aaacaacagg cttcccttct ctaattacaa ttttttcagc accaaattac 1081 ttagatgtat acaataacaa agctgcagta ttgaagtatg agaacaatgt tatgaatatc 1141 aggcaattca actgttctcc tcatccatac tggcttccaa atttcatgga tgtttttact 1201 tggtcccttc catttgttgg ggaaaaagtg actgagatgc tggtaaatgt cctcaacatc 1261 tgctcagatg atgaactagg gtcagaagaa gatggatttg atggtgcaac agctgcagcc 1321 cggaaagagg tgataaggaa caagatccga gcaataggca aaatggccag agtgttctca 1381 gtgctcagag aagagagtga gagtgtgctg acgctgaaag gcttgacccc aactggcatg 1441 ctccccagcg gagtactttc tggagggaag caaaccctgc aaagcgctac tgttgaggct 1501 attgaggctg atgaagctat caaaggattt tcaccacaac ataagatcac tagcttcgag 1561 gaagccaagg gcttagaccg aattaatgag aggatgccgc ctcgcagaga tgccatgccc 1621 tctgacgcca accttaactc catcaacaag gctctcacct cagagactaa cggcacggac 1681 agcaatggca gtaatagcag caatattcag tgaccacttc ctgttcacat tttttttttt 1741 tttttttttt tttttttttt tgagctgcgg ggcatgatgg ggattgctgc atatcagcag 1801 ttggatgttc ttgcctctga cagtagctta tttgctctgg gggccaggaa ttggattcag 1861 tttacactat cattaaaaaa gagggagaga gataataaac tatattttgg tggggatggt 1921 gattaaacac ctcttttggg tatgcctttt aaaaatgctt atagagaaaa aaaattttaa 1981 aaaaagaaag ctaatgctag tatatactgc aatgttaggg gaatgaacat gttttcctac 2041 tgcattgggg acttctagat aggttaatga aaggcctttt attctgttac tggacatgaa 2101 aactttgtct aatttcttac tctattgtac gtttacagtc gcagcactaa aaatggatga 2161 catcaaacat ttttaacaaa atgatgatgt acaaactaag gactatttat tgataatgtt 2221 ttgctactct tgtcagacaa tggctataaa ctgaattagg cagtcttaaa aaaaaa // LOCUS HUMCALCBP 799 bp mRNA PRI 15-DEC-1989 DEFINITION Human calmodulin mRNA, complete cds. ACCESSION M27319 NID g179809 KEYWORDS calcium-binding protein; calmodulin. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 799) AUTHORS Wawrzynczak,E.J. and Perham,R.N. TITLE Isolation and nucleotide sequence of a cDNA encoding human calmodulin JOURNAL Biochem. Int. 9, 177-185 (1984) MEDLINE 85022688 FEATURES Location/Qualifiers source 1..799 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" CDS 88..537 /codon_start=1 /product="calmodulin" /db_xref="PID:g179810" /translation="MADQLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNP TEAELQDMINEVDADGNGTIDFPEFLTMMARKMKDTDSEEEIREAFRVFDKDGNGYIS AAELRHVMTNLGEKLTDEEVDEMIREADIDGDGQVNYEEFVQMMTAK" BASE COUNT 263 a 178 c 160 g 198 t ORIGIN 1 cagcatcgga ggtacccccg ccgtcgcagc ccccgcgctg gtgcagccac cctcgctccc 61 tctgctcttc ctcccttcac tcgcaccatg gctgatcagc tgaccgaaga acagattgct 121 gaattcaagg aagccttctc cctatttgat aaagatggcg atggcaccat cacaacaaag 181 gaacttggaa ctgtcatgag gtcactgggt cagaacccaa cagaagctga attgcaggat 241 atgatcaatg aagtggatgc tgatggtaat ggcaccattg acttccccga atttttgact 301 atgatggcta gaaaaatgaa agatacagat agtgaagaag aaatccgtga ggcattccga 361 gtctttgaca aggatggcaa tggttatatc agtgcagcag aactacgtca cgtcatgaca 421 aacttaggag aaaaactaac agatgaagaa gtagatgaaa tgatcagaga agcagatatt 481 gatggagacg gacaagtcaa ctatgaagaa ttcgtacaga tgatgactgc aaaatgaaga 541 cctactttca actccttttt cccccctcta gaagaatcaa attgaatctt ttacttacct 601 cttgcaaaaa aaagaaaaaa gaaaaaagtt catttattca ttctgtttct atatagcaaa 661 actgaatgtc aaaagtacct tctgtccaca cacacaaaat ctgcatgtat tggttggtgg 721 tcctgtcccc taaagatcaa gctacacatc agttttacaa tataaatact tgtactacct 781 taatgataag gactcctta // LOCUS HUMCALD 2975 bp mRNA PRI 04-OCT-1991 DEFINITION Human caldesmon mRNA, complete cds. ACCESSION M64110 NID g179829 KEYWORDS F-actin binding protein; caldesmon; calmodulin; myosin; tropomyosin. SOURCE Homo sapiens (library: lambda gt11) fetus lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2975) AUTHORS Novy,R.E., Lin,J.L.-C. and Lin,J.J. TITLE Characterization of cDNA clones encoding a human fibroblast caldesmon isoform and analysis of caldesmon expression in normal and transformed cells JOURNAL J. Biol. Chem. 266, 16917-16924 (1991) MEDLINE 91358497 FEATURES Location/Qualifiers source 1..2975 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI-38" /cell_type="fibroblast" /dev_stage="fetus" /tissue_type="lung" /tissue_lib="lambda gt11" CDS 112..1728 /codon_start=1 /product="caldesmon" /db_xref="PID:g179830" /translation="MDDFERRRELRRQKREEMRLEAERIAYQRNDDDEEEAARERRRR ARQERLRQKQEEESLGQVTDQVEVNAQNSVPDEEAKTTTTNTQVEGDDEAAFLERLAR REERRQKRLQEALERQKEFDPTITDASLSLPSRRMQNDTAENETTEKEEKSESRQERY EIEETETVTKSYQKNDWRDAEENKKEDKEKEEEEEEKPKRGSIGENQIKDEKIKKDKE PKEEVKSFMDRKKGFTEVKSQNGEFMTHKLKHTENTFSRPGGRASVDTKEAEGAPQME AGKRLEELRRRRGETESEEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQE EADRKLREEEEKRRLKEEIERRRAEAAEKRQKMPEDGLSDDKKPFKCFTPKGSSLKIE ERAEFLNKSVQKSSGVKSTHQAAIVSKIDSRLEQYTSAIEGTKSAKPTKPAASDLPVP AEGVRNIKSMWEKGNVFSSPTAAGTPNKETAGLKVGVSSRINEWLTKTPDGNKSPAPK PSDLRPGDVSSKRNLWEKQSVDKVTSPTKV" BASE COUNT 1036 a 544 c 725 g 670 t ORIGIN 1 cagatcatca aatcaaattc cacagggatt ggtgaccaac cagaaggctc agacatctga 61 ttgctgacct gtccagacat catctggtct ccctgaacct gaaatcacac catggatgat 121 tttgagcgtc gcagagaact tagaaggcaa aagagggagg agatgcgact cgaagcagaa 181 agaatcgcct accagaggaa tgacgatgat gaagaggagg cagcccggga acggcgccgc 241 cgagcccgac aggaacggct gcggcagaag caggaggaag aatccttggg acaggtgacc 301 gaccaggtgg aggtgaatgc ccagaacagt gtgcctgacg aggaggccaa gacaaccacc 361 acaaacactc aagtggaagg ggatgatgag gccgcattcc tggagcgcct ggctcggcgt 421 gaggaaagac gccaaaaacg ccttcaggag gctctggagc ggcagaagga gttcgaccca 481 acaataacag atgcaagtct gtcgctccca agcagaagaa tgcaaaatga cacagcagaa 541 aatgaaacta ccgagaagga agaaaaaagt gaaagtcgcc aagaaagata cgagatagag 601 gaaacagaaa cagtcaccaa gtcctaccag aagaatgatt ggagggatgc tgaagaaaac 661 aagaaagaag acaaggaaaa ggaggaggag gaagaggaga agccaaagcg agggagcatt 721 ggagaaaatc agatcaaaga tgaaaagatt aaaaaggaca aagaacccaa agaagaagtt 781 aagagcttca tggatcgaaa gaagggattt acagaagtta agtcgcagaa tggagaattc 841 atgacccaca aacttaaaca tactgagaat actttcagcc gccctggagg gagggccagc 901 gtggacacca aggaggctga gggcgccccc cagatggaag ccggcaaaag gctggaggag 961 cttcgtcgtc gtcgcgggga gaccgagagc gaagagttcg agaagctcaa acagaagcag 1021 caggaggcgg ctttggagct ggaggaactc aagaaaaaga gggaggagag aaggaaggtc 1081 ctggaggagg aagagcagag gaggaagcag gaggaagccg atcgaaaact cagagaggag 1141 gaagagaaga ggaggctaaa ggaagagatt gaaaggcgaa gagcagaagc tgctgagaaa 1201 cgccagaaga tgccagaaga tggcttgtca gatgacaaga aaccattcaa gtgtttcact 1261 cctaaaggtt catctctcaa gatagaagag cgagcagaat ttttgaataa gtctgtgcag 1321 aaaagcagtg gtgtcaaatc gacccatcaa gcagcaatag tctccaagat tgacagcaga 1381 ctggagcagt ataccagtgc aattgaggga acaaaaagcg caaaacctac aaagccggca 1441 gcctcggatc ttcctgttcc tgctgaaggt gtacgcaaca tcaagagtat gtgggagaaa 1501 gggaatgtgt tttcatcccc cactgcagca ggcacaccaa ataaggaaac tgctggcttg 1561 aaggtagggg tttctagccg catcaatgaa tggctaacta aaaccccaga tggaaacaag 1621 tcacctgctc ccaaaccttc tgacttgaga ccaggagacg tatccagcaa gcggaacctc 1681 tgggaaaagc aatctgtgga taaggtcact tcccccacta aggtttgaga cagttccaga 1741 aagaacccaa gctcaagacg caggacgagc tcagttgtag agggctaatt cgctctgttt 1801 tgtatttatg ttgatttact aaattgggtt cattatcttt tatttttcaa tatcccagta 1861 aacccatgta tattatcact atatttaata atcacagtct agagatgttc atggtaaaag 1921 tactgccttt gcacaggagc ctgtttctaa agaaacccat gctgtgaaat agagactttt 1981 ctactgatca tcataactct gtatctgagc agtgatacca accacatctg aagtcaacag 2041 aagatccaag tttaaaattg cctgcggaat gtgtgcagta tctagaaaaa tgaaccgtag 2101 tttttgtttt tttaaataca gaagtcatgt tgtttctgca ctttataata aagcatggaa 2161 gaaattatct tagtaggcaa ttgtaacact ttttgaaagt aacccatttc agatttgaaa 2221 tactgcaata atggttgtct ttaaaaaaaa aaaaaagaaa cgtactgtta aggtattact 2281 ttttttcatg ctgatgattc atatctaaat tacattatta tgttagctga cagtggtact 2341 gattttttag gttggttgtt ttgtggattt ctttagtagt gatagtagcc tgaaccacat 2401 tttagataac tcaattatgt atgtatgtgc atacacatat acaaacacac taatggtaga 2461 atgctttttt atgtgctaga ctattatatt tagtagtatg tcattgtaac tagccaatat 2521 cacagctttt gaaaaattaa aaaatcacac tatattaata tttcatattt gccaacagaa 2581 acatggcaga taggtatcaa tatgttttca atgcctgatg acctataaga agaaagtatt 2641 gaaaagaaga gagattagaa ctgttagaag gagttgaaat tttctaaaag acatagtatt 2701 tagtttataa ttaaatgcat tcttgaagtc cagtgtgaat tttattaatg ctatcatctc 2761 gaccaagctc aaagcctact tattagaaac aatgaagttc acaataggtc ataaggtctc 2821 ttccttttct aaaattgaaa gacaagaaat ttagtgccaa tattgtacag acagaaattc 2881 catgtatgag tctcaacaaa gactaccttt ggctaaatgt ctagaagcag agaagtaaag 2941 tgagcaaaat ccagtgttga ggagtcatgg aattc // LOCUS HUMCALD9KA 456 bp mRNA PRI 25-MAR-1993 DEFINITION Homo sapiens calbindin D-9k mRNA, complete cds. ACCESSION L13220 NID g291883 KEYWORDS calbindin D-9k. SOURCE Homo sapiens female adult duodenal mucosa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 456) AUTHORS Jeung,E.-B., Krisinger,J., Dann,J.L. and Leung,P.C. K.. TITLE Molecular cloning of the full-length cDNA encoding the human calbindin-D9k JOURNAL FEBS Lett. 307, 224-228 (1992) MEDLINE 92354716 FEATURES Location/Qualifiers source 1..456 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="female" /tissue_type="duodenal mucosa" mRNA 1..456 CDS 58..297 /codon_start=1 /function="calcium binding protein" /evidence=experimental /product="calbindin D-9k" /db_xref="PID:g291884" /translation="MSTKKSPEELKRIFEKYAAKEGDPDQLSKDELKLLIQAEFPSLL KGPNTLDDLFQELDKNGDGEVSFEEFQVLVKKISQ" polyA_signal 436..441 polyA_site 456 BASE COUNT 157 a 81 c 95 g 123 t ORIGIN 1 aaaaaactcc tctttgattc ttctagctgt ttcactattg ggcaaccaga caccagaatg 61 agtactaaaa agtctcctga ggaactgaag aggatttttg aaaaatatgc agccaaagaa 121 ggtgatccag accagttgtc aaaggatgaa ctgaagctat tgattcaggc tgaattcccc 181 agtttactca aaggtccaaa caccctagat gatctctttc aagaactgga caagaatgga 241 gatggagaag ttagttttga agaattccaa gtattagtaa aaaagatatc ccagtgaagg 301 agaaaacaaa atagaaccct gagcactgga ggaagagcgc ctgtgctgtg gtcttatcct 361 atgtggaatc ccccaaagtc tctggtttaa ttctttgcaa ttataataac ctggctgtga 421 ggttcagtta ttattaataa agaaattatt agacat // LOCUS HUMCALIEF 3881 bp mRNA PRI 28-SEP-1994 DEFINITION Human calnexin mRNA, complete cds. ACCESSION M94859 NID g179831 KEYWORDS calcium-binding protein; calnexin. SOURCE Homo sapiens (library: lambda ZAPII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3881) AUTHORS Honore,B., Rasmussen,H.H., Celis,A., Leffers,H., Madsen,P. and Celis,J.E. TITLE The molecular chaperones HSP28, GRP78, endoplasmin, and calnexin exhibit strikingly different levels in quiescent keratinocytes as compared to their proliferating normal and transformed counterparts: cDNA cloning and expression of calnexin JOURNAL Electrophoresis 15 (3-4), 482-490 (1994) MEDLINE 94333293 FEATURES Location/Qualifiers source 1..3881 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 96..1874 /codon_start=1 /product="calnexin" /db_xref="PID:g179832" /translation="MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVED SKPDTTAPPSSPKVTYKAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDG KWEVEEMKESKLPGDKGLVLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIEC GGAYVKLLSKTPELNLDQLHDKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKH AKRPDADLKTYFTDKKTHLYTLILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREI EDPEDRKPEDWDERPKIPDPEAVKPDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDP DAEKPEDWDEDMDGEWEAPQIANPRCESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPS YQGIWKPRKIPNPDFFEDLEPFRMTPFSAIGLELWSMTSDIFFDNFIICADRRIVDDW ANDGWGLKKAADGAAEPGVVGQMIEAAEERPWLWVVYILTVALPVFLVILFCCSGKKQ TSGMEYKKTDAPQPDVKEEEEEKEEEKDKGDEEEEGEEKLEEKQKSDAEEDGGTVSQE EEDRKPKAEEDEILNRSPRNRKPRRE" BASE COUNT 1134 a 712 c 881 g 1154 t ORIGIN 1 gccgcctccg cctctctctt tactgcggcg cggggcaagg tgtgcgggcg ggaaggggca 61 cgggcacccc cgcggtcccc gggaggctag agatcatgga agggaagtgg ttgctgtgta 121 tgttactggt gcttggaact gctattgttg aggctcatga tggacatgat gatgatgtga 181 ttgatattga ggatgacctt gacgatgtca ttgaagaggt agaagactca aaaccagata 241 ccactgctcc tccttcatct cccaaggtta cttacaaagc tccagttcca acaggggaag 301 tatattttgc tgattctttt gacagaggaa ctctgtcagg gtggatttta tccaaagcca 361 agaaagacga taccgatgat gaaattgcca aatatgatgg aaagtgggag gtagaggaaa 421 tgaaggagtc aaagcttcca ggtgataaag gacttgtgtt gatgtctcgg gccaagcatc 481 atgccatctc tgctaaactg aacaagccct tcctgtttga caccaagcct ctcattgttc 541 agtatgaggt taatttccaa aatggaatag aatgtggtgg tgcctatgtg aaactgcttt 601 ctaaaacacc agaactcaac ctggatcagc tccatgacaa gaccccttat acgattatgt 661 ttggtccaga taaatgtgga gaggactata aactgcactt catcttccga cacaaaaacc 721 ccaaaacggg tatctatgaa gaaaaacatg ctaagaggcc agatgcagat ctgaagacct 781 attttactga taagaaaaca catctttaca cactaatctt gaatccagat aatagttttg 841 aaatactggt tgaccaatct gtggtgaata gtggaaatct gctcaatgac atgactcctc 901 ctgtaaatcc ttcacgtgaa attgaggacc cagaagaccg gaagcccgag gattgggatg 961 aaagaccaaa aatcccagat ccagaagctg tcaagccaga tgactgggat gaagatgccc 1021 ctgctaagat tccagatgaa gaggccacaa aacccgaagg ctggttagat gatgagcctg 1081 agtacgtacc tgatccagac gcagagaaac ctgaggattg ggatgaagac atggatggag 1141 aatgggaggc tcctcagatt gccaacccta gatgtgagtc agctcctgga tgtggtgtct 1201 ggcagcgacc tgtgattgac aaccccaatt ataaaggcaa atggaagcct cctatgattg 1261 acaatcccag ttaccaggga atctggaaac ccaggaaaat accaaatcca gatttctttg 1321 aagatctgga acctttcaga atgactcctt ttagtgctat tggtttggag ctgtggtcca 1381 tgacctctga catttttttt gacaacttta tcatttgtgc tgatcgaaga atagttgatg 1441 attgggccaa tgatggatgg ggcctgaaga aagctgctga tggggctgct gagccaggcg 1501 ttgtggggca gatgatcgag gcagctgaag agcgcccgtg gctgtgggta gtctatattc 1561 taactgtagc ccttcctgtg ttcctggtta tcctcttctg ctgttctgga aagaaacaga 1621 ccagtggtat ggagtataag aaaactgatg cacctcaacc ggatgtgaag gaagaggaag 1681 aagagaagga agaggaaaag gacaagggag atgaggagga ggaaggagaa gagaaacttg 1741 aagagaaaca gaaaagtgat gctgaagaag atggtggcac tgtcagtcaa gaggaggaag 1801 acagaaaacc taaagcagag gaggatgaaa ttttgaacag atcaccaaga aacagaaagc 1861 cacgaagaga gtgaaacaat cttaagagct tgatctgtga tttcttctcc ctcctcccct 1921 gcaagagtgg tcctaggaga ggacctggca caccttaggt tgacattcag aaaacttcaa 1981 gacatcacca tcagcaggct ccagttgaac actagtctgt gtaactttaa acatctagca 2041 gtaaatactt gcagttgtga tataaaggac cctgtttctg tagaaaagaa aacatttaac 2101 ataatggttg tgaaatgtaa catgaagcaa actaactttt ttttttttaa catctttgtt 2161 tttaaaatag aatgatagaa ctttgccagt ctttaagatc ttggcttaat ttaatgtatt 2221 aatctgtttg tgcaaacata ataccaccat ttaaaaatgt tagggagatg agttgcagtt 2281 tttataatag atttttttta aagtttggta ttgtaaaaca ttcacacctc tgtccctcaa 2341 aattgataat tacgtttaaa gtgcagtcat ttgtggttag aatcttgttt tgtttgcttc 2401 cattattgag ttcctcctaa ggaaattgag gagagggact gaatagaagc ccaaattcat 2461 ataaaagttg cgtttaagtt gtattaaaaa tagatatata agaaaaaatt ctttcacttg 2521 atgtttgtta gaccagaaag tgtgtgtgtt ctgtagctca gttcccagac agctttttag 2581 gtagtggagg aggtggcttc atgtggcact tgggcattta tattccactt gggagggtca 2641 ggctgtggcc ttctggagca ggtggcttgt taaggaacgc tagcagggca tggcacgtga 2701 gctccggaat agatgtcttc atcacttctt ccactgtgtg ttgacactgt tttccttacc 2761 tatttcctca gatccccagc tttctcctct gctatgcatt ttcttcacag tgcagcttgc 2821 agtccgttgc tgaaaatgat tataagccct gcataatgtt aagctttatt gtgattacgt 2881 gtatgtttct tctttctttt aagcagaccc atacctttcc agggtcaaag tacagaatag 2941 aatacattga tacaaagtac agaaaaatac tttgattttt atccatttct tttactctgt 3001 gtaaagactt gagaagtcta attcacaggc aaaccaatac agaattgact gcagttgaac 3061 agactagaag tatttgtggg aggagtgaca tgaagcatga gttatctgat tttttttgta 3121 gctgctatat attttaagcc ttcatttgca attcatgtaa cagttgtgtc ataaattaca 3181 caataaagca gtcctgttca aatttttttt taacgtggct tgtagaattt ttaaaaaagt 3241 gatcttaggt ttgttttttc atgcgggatg cagatgggtg ctatcagagc ctctcccaca 3301 ccactatagt gtaataatgt tattattact ctacactgaa acgtattcag agttagatat 3361 tattttagct tcagttgttc tttagaggct ttcaaatgta ccgatgatac tgtttcttgc 3421 actgaatata taaacactcc acagtgttta tattgggaag atattgggaa ggaaatatat 3481 ttgtaaaaga tgaaggctgt atctattttt ttttcttttt aaagtttgtt cacttaaatt 3541 cttttgagga tgggatgtat ttttcttgct gttcagtgct ttttcctttt catctgttgt 3601 tctgtggtca cagtgacctt agctacatag cagactttcc caaatgtatt gattacaaat 3661 aaacagttgt tacttagcaa gacctgaaaa tatgtctgca ggtttctcct tgaagcaaat 3721 gtgtgggatc attgcatttc cagaaatctg cctccttcac cctccgttga cagtatatgt 3781 catgcctcac tttcttctag ctgagcttta aatcattaga gcttaaattg tcagatcgtt 3841 cattgccttt ccagggttat ttagtaaagt ttgttgaaaa c // LOCUS HUMCALMODU 1605 bp mRNA PRI 02-AUG-1994 DEFINITION Homo sapiens calcium/calmodulin-dependent protein kinase mRNA, complete cds. ACCESSION L17000 NID g306478 KEYWORDS calcium/calmodulin-dependent protein kinase. SOURCE Homo sapiens cerebellum and thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1605) AUTHORS Bland,M.M., Monroe,R.S. and Ohmstede,C.-A. TITLE The cDNA sequence and characterization of the Ca2+/calmodulin-dependent protein kinase-Gr from human brain and thymus JOURNAL Gene 142, 191-197 (1994) MEDLINE 94252566 FEATURES Location/Qualifiers source 1..1605 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum and thymus" 5'UTR 1..41 CDS 42..1463 /codon_start=1 /product="calcium/calmodulin-dependent protein kinase" /db_xref="PID:g306479" /translation="MLKVTVPSCSASSCSSVTASAAPGTASLVPDYWIDGSNRDALSD FFEVESELGRGATSIVYRCKQKGTQKPYALKVLKKTVDKKIVRTEIGVLLRLSHPNII KLKEIFETPTEISLVLELVTGGELFDRIVEKGYYSERDAADAVKQILEAVAYLHENGI VHRDLKPENLLYATPAPDAPLKIADFGLSKIVEHQVLMKTVCGTPGYCAPEILRGCAY GPEVDMWSVGIITYILLCGFEPFYDERGDQFMFRRILNCEYYFISPWWDEVSLNAKDL VRKLIVLDPKKRLTTFQALQHPWVTGKAANFVHMDTAQKKLQEFNARRKLKAAVKAVV ASSRLGSASSSHGSIQESHKASRDPSPIQDGNEDMKAIPEGEKIQGDGAQAAVKGAQA ELMKVQALEKVKGADINAEEAPKMVPKAVEDGIKVADLELEEGLAEEKLKTVEEAAAP REGQGSSAVGFEVPQQDVILPEY" 3'UTR 1464..1605 BASE COUNT 456 a 355 c 431 g 363 t ORIGIN 1 agcggcgggg cggcggcggc ttccggagtc ccgctgcgaa gatgctcaaa gtcacggtgc 61 cctcctgctc cgcctcgtcc tgctcttcgg tcaccgccag tgcggccccg gggaccgcga 121 gcctcgtccc ggattactgg atcgacggct ccaacaggga tgcgctgagc gatttcttcg 181 aggtggagtc ggagctggga cggggtgcta catccattgt gtacagatgc aaacagaagg 241 ggacgcagaa gccttatgct ctcaaagtgt taaagaaaac agtggacaaa aaaatcgtaa 301 gaactgagat aggagttctt cttcgcctct cacatccaaa cattataaaa cttaaagaga 361 tatttgaaac ccctacagaa atcagtctgg tcctagaact cgtcacagga ggagaactgt 421 ttgataggat tgtggaaaag ggatattaca gtgagcgaga tgctgcagat gccgttaaac 481 aaatcctgga ggcagttgct tatctacatg aaaatgggat tgtccatcgt gatctcaaac 541 cagagaatct tctttatgca actccagccc cagatgcacc actcaaaatc gctgattttg 601 gactctctaa aattgtggaa catcaagtgc tcatgaagac agtatgtgga accccagggt 661 actgcgcacc tgaaattctt agaggttgtg cctatggacc tgaggtggac atgtggtctg 721 taggaataat cacctacatc ttactttgtg gatttgaacc attctatgat gaaagaggcg 781 atcagttcat gttcaggaga attctgaatt gtgaatatta ctttatctcc ccctggtggg 841 atgaagtatc tctaaatgcc aaggacttgg tcagaaaatt aattgttttg gatccaaaga 901 aacggctgac tacatttcaa gctctccagc atccgtgggt cacaggtaaa gcagccaatt 961 ttgtacacat ggataccgct caaaagaagc tccaagaatt caatgcccgg cgtaagctta 1021 aggcagcggt gaaggctgtg gtggcctctt cccgcctggg aagtgccagc agcagccatg 1081 gcagcatcca ggagagccac aaggctagcc gagacccttc tccaatccaa gatggcaacg 1141 aggacatgaa agctattcca gaaggagaga aaattcaagg cgatggggcc caagccgcag 1201 ttaagggggc acaggctgag ctgatgaagg tgcaagcctt agagaaagtt aaaggtgcag 1261 atataaatgc tgaagaggcc cccaaaatgg tgcccaaggc agtggaggat gggataaagg 1321 tggctgacct ggaactagag gagggcctag cagaggagaa gctgaagact gtggaggagg 1381 cagcagctcc cagagaaggg caaggaagct ctgctgtggg ttttgaagtt ccacagcaag 1441 atgtgatcct gccagagtac taaacagctt ccttcagatc tggaagccaa acaccggcat 1501 tttatgtact ttgtccttca gcaagaaagg tgtggaacca tgatatgtac tatagtgatt 1561 cgtttttgag gtcaaaaaca tacatatata ccagttggta attct // LOCUS HUMCALPA1L 609 bp mRNA PRI 31-DEC-1994 DEFINITION Human calpactin 1 light chain mRNA, complete cds. ACCESSION M81457 NID g179874 KEYWORDS calpactin I light chain. SOURCE Homo sapiens (tissue library: lambda gt11) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 609) AUTHORS Dooley,T.P., Weiland,K.L. and Simon,M. TITLE cDNA sequence of human p11 calpactin I light chain JOURNAL Genomics 13 (3), 866-868 (1992) MEDLINE 92347895 FEATURES Location/Qualifiers source 1..609 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary keratinocyte" /tissue_lib="lambda gt11" /map="presumed at 1q21" gene 72..593 /gene="calpactin 1 light chain" CDS 72..365 /gene="calpactin 1 light chain" /codon_start=1 /product="calpactin I light chain" /db_xref="PID:g179875" /translation="MPSQMEHAMETMMFTFHKFAGDKGYLTKEDLRVLMEKEFPGFLE NQKDPLAVDKIMKDLDQCRDGKVGFQSFFSLIAGLTIACNDYFVVHMKQKGKK" polyA_signal 584..593 /gene="calpactin 1 light chain" BASE COUNT 188 a 143 c 132 g 146 t ORIGIN 1 ccgcgtccag ctcgcccagc tcgcccagcg tccgccgcgc ctcggccaag gcttcaacgg 61 accacaccaa aatgccatct caaatggaac acgccatgga aaccatgatg tttacatttc 121 acaaattcgc tggggataaa ggctacttaa caaaggagga cctgagagta ctcatggaaa 181 aggagttccc tggatttttg gaaaatcaaa aagaccctct ggctgtggac aaaataatga 241 aggacctgga ccagtgtaga gatggcaaag tgggcttcca gagcttcttt tccctaattg 301 cgggcctcac cattgcatgc aatgactatt ttgtagtaca catgaagcag aagggaaaga 361 agtaggcaga aatgagcagt tcgctcctcc ctgataagag ttgtccaaag ggtcgcttaa 421 ggaatctgcc ccacagcttc ccccatagaa ggatttcatg agcagatcag gacacttagc 481 aaatgtaaaa ataaaatcta actctcattt gacaagcaga gaaagaaaag ttaaatacca 541 gataagcttt tgatttttgt attgtttgca tccccttgcc ctcaataaat aaagttcttt 601 tttagttcc // LOCUS HUMCALPS 2739 bp mRNA PRI 03-JUN-1991 DEFINITION Human calmodulin-like processed pseudogene, complete cds. ACCESSION M36707 NID g179878 KEYWORDS calmodulin; pseudogene. SOURCE Human leukocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2739) AUTHORS Koller,M. and Strehler,E.E. TITLE Characterization of an intronless human calmodulin-like pseudogene JOURNAL FEBS Lett. 239, 121-128 (1988) MEDLINE 89031205 FEATURES Location/Qualifiers source 1..2739 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" CDS 1066..1515 /note="calmodulin-like pseudogene" /pseudo /codon_start=1 BASE COUNT 591 a 790 c 802 g 556 t ORIGIN 1 gagctcctgg ggcaatgtgg accagagaat gatagtggga cttcctggct gagccttcaa 61 ggtctccttt gtctctcctc cttgtcccct ccctcacagg cctgcagccc actgatcatc 121 ttccagccct tggccctgcc atggccactc cggcctagac agctgcttct cagcaggagg 181 tctgtccttg accaggctga aaggtgacct cggggtcctg tccatcacca ccttgtatgt 241 ccatcagcat ttatcaataa tccctagtca ccttgtttgc attcctcgat cagtttacct 301 gcaaactagc tgcttgctag ttaacctcac gaggactggg ggtggcttgc ggtcccactg 361 tgttcccagc ttccagagtg acactgcacc tagtaggcac tcaaatatct gtgaaagggt 421 ggatgagtga atgggttgga ggaaacccct ccctattcat gtccattctg aagataagaa 481 aactcgctgc ttctacctga agggaaaacc ttcctcctcc aaaaaccctc ctgttctata 541 tgaggcttag aaaccaaagg caaggacgtt tctttcatct gcctttcatc ctgcaaaaag 601 ccctccactc aaggcaagaa agggcactga agtttattga gtgggagaca gcggcaggtg 661 ggagaccggg gagggaggag agaaagggaa attcaggagg aaggagtcca gcgtggattg 721 ctccaaagct cacccaccac gccctgactg caggtgtgat tcggggcccc cgtggctctg 781 ctgggtccag gtgcaagcag gcaagaggtg tggcgtcagc tcgattcgca ggccctggac 841 tactgtctaa acaggacagg cccgggcaag cagggcaggg gcgtctgcaa tgatggggga 901 ggactctgct gcttcttaag ctccagcgtc tcaagccagg gcgagacagc ccgcggccgc 961 ccggatctcc acctgccacc ccagagctgg gacagagccg ggctgcggca ctgggaggga 1021 gaccccacag tggcctcttc tgccacccac gcccccaccc ctggcatggc cgaccagctg 1081 actgaggagc aggtcacaga attcaaggag gccttctccc tgtttgacaa ggatggggac 1141 ggctgcatca ccacccgcga gctgggcacg gtcatgcggt ccctgggcca gaaccccacg 1201 gaggccgagc tgcgggacat gatgagtgag atcgaccggg acggcaacgg caccgtggac 1261 ttccccgagt tcctgggcat gatggccagg aagatgaagg acacggacaa cgaggaggag 1321 atccgcgagg ccttccgcgt gttcgacaag gacggcaacg gcttcgtcag cgccgccgag 1381 ctgcgacacg tcatgacccg gctgggggag aagctgagtg acgaggaggt ggacgagatg 1441 atccgggccg cggacacgga cggagacgga caggtgaact acgaggagtt tgtccgtgtg 1501 ctggtgtcca agtgaggccg gcgcccacca tgctcctggg cgcccacgcg gcccacaggg 1561 caagaacccg gggcctcccg cctcctcccc catccccctg cctcccctgg gcactgtggc 1621 ttcctcctgc gcctggttga ttcagcccac ctctctgcat cccgcttccc gcgtctcttc 1681 tctgcactcc tgccgacctt cccacctgct catctgaatg acacggaacg ctcccactgc 1741 aggcaaaccg tgacgccctc cccactcggg agaagcagag ctgaccttag gaccgagcac 1801 cagggcaggt tgcgctgact ctgcggccct ccaggacgga caccgggtga cccttaggca 1861 caggcaagat ccctaacagg caccaatgcc aggcaggggg ctgcagccct cagcccccgc 1921 caagattccc gcaggctcct ggactggaag ctccctccgc ggtcggattc tggagggtgg 1981 gaggcatctt ggcctgcagt aagcggtgct gacggggact ctggccacag aggtcaggcc 2041 tcctgaaaac agcactgcct tccgcgctgc cccagcttgc cccattcctt gtccgccaac 2101 ccaccgtgat tcatcttctg aagctgggag tgaaactggg tcagctgtaa cctgttccta 2161 ttcatctgga aggagggagg cttggatgag caggggatga gagctgcagg gaaataaatg 2221 agatattcgt ccttatttca tttccttttt ttttttggtt tccatggaaa tgatccttgt 2281 taaattcagg gttgaaacga ggcaggaatc tccatttttg tgctttttga aaatgcaatg 2341 aattcctata cgggggagcg ggaaaggtgc ctcagagaga gacaagtctg gatgagggaa 2401 atattgaata ttctcaatca aatggatacg ctggcagcaa agagtggtta aagtccatca 2461 ggacttgaaa gacctgagtc cattacgttg agaagggacc tgctgattgc tttgattccc 2521 cctggcaagt gctccctggt tgtgaatgcc aggcactaga gatggtgagg ggttgggggg 2581 cagttgggca cacacagtgt aagagcaatt cagagccgtt agtcctgcac tagccctcaa 2641 gctgctggca acacctagag aaggtcgaag ggccctgcca gagatccctt caattctaaa 2701 gggaggtatg ttttgcggga tgggcactag acggagctc // LOCUS HUMCAMPPK 3036 bp mRNA PRI 28-MAR-1997 DEFINITION Human cAMP-dependent protein kinase type I-alpha subunit (PRKAR1A) mRNA, complete cds. ACCESSION M33336 NID g1526989 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3036) AUTHORS Sandberg,M., Skalhegg,B. and Jahnsen,T. TITLE The two mRNA forms for the type I alpha regulatory subunit of cAMP-dependent protein kinase from human testis are due to the use of different polyadenylation site signals JOURNAL Biochem. Biophys. Res. Commun. 167 (1), 323-330 (1990) MEDLINE 90179769 REFERENCE 2 (bases 1 to 3036) AUTHORS Solberg,R., Sandberg,M., Natarajan,V., Torjesen,P.A., Hansson,V., Jahnsen,T. and Tasken,K. TITLE The human gene for the regulatory subunit RI alpha of cyclic adenosine 3', 5'-monophosphate-dependent protein kinase: two distinct promoters provide differential regulation of alternately spliced messenger ribonucleic acids JOURNAL Endocrinology 138 (1), 169-181 (1997) MEDLINE 97131944 REFERENCE 3 (bases 1 to 3036) AUTHORS Sandburg,M. TITLE Direct Submission JOURNAL Submitted (26-APR-1988) Institute of Medical Biochemistry, University of Oslo, P.O. Box 1112, Blindern, Oslo N-0317, Norway FEATURES Location/Qualifiers source 1..3036 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7p13-qter" /tissue_type="testis" exon 1..81 /note="exon 1b; alternate spliced promoter sequence, exon 1a in GenBank Accession Number Y07642" /number=1 exon 82..264 /number=2 gene 88..1233 /gene="PRKAR1A" CDS 88..1233 /gene="PRKAR1A" /EC_number="2.7.1.37" /codon_start=1 /product="cAMP-dependent protein kinase type I-alpha subunit" /db_xref="PID:g179895" /db_xref="GDB:G00-120-313" /translation="MESGSTAASEEARSLRECELYVQKHNIQALLKDSIVQLCTARPE RPMAFLREYFERLEKEEAKQIQNLQKAGTRTDSREDEISPPPPNPVVKGRRRRGAISA EVYTEEDAASYVRKVIPKDYKTMAALAKAIEKNVLFSHLDDNERSDIFDAMFSVSFIA GETVIQQGDEGDNFYVIDQGETDVYVNNEWATSVGEGGSFGELALIYGTPRAATVKAK TNVKLWGIDRDSYRRILMGSTLRKRKMYEEFLSKVSILESLDKWERLTVADALEPVQF EDGQKIVVQGEPGDEFFIILEGSAAVLQRRSENEEFVEVGRLGPSDYFGEIALLMNRP RAATVVARGPLKCVKLDRPRFERVLGPCSDILKRNIQQYNSFVSLSV" exon 265..435 /gene="PRKAR1A" /number=3 exon 436..589 /gene="PRKAR1A" /number=4 exon 590..636 /gene="PRKAR1A" /number=5 exon 637..795 /gene="PRKAR1A" /number=6 exon 796..856 /gene="PRKAR1A" /number=7 exon 857..978 /gene="PRKAR1A" /number=8 exon 979..1060 /gene="PRKAR1A" /number=9 exon 1061..3036 /number=10 BASE COUNT 820 a 552 c 683 g 981 t ORIGIN 1 gctgggagca aagcgctgag ggagctcggt acgccgccgc ctcgcacccg cagcctcgcg 61 cccgccgccg cccgtcccca gagaaccatg gagtctggca gtaccgccgc cagtgaggag 121 gcacgcagcc ttcgagaatg tgagctctac gtccagaagc ataacattca agcgctgctc 181 aaagattcta ttgtgcagtt gtgcactgct cgacctgaga gacccatggc attcctcagg 241 gaatactttg agaggttgga gaaggaggag gcaaaacaga ttcagaatct gcagaaagca 301 ggcactcgta cagactcaag ggaggatgag atttctcctc ctccacccaa cccagtggtt 361 aaaggtagga ggcgacgagg tgctatcagc gctgaggtct acacggagga agatgcggca 421 tcctatgtta gaaaggttat accaaaagat tacaagacaa tggccgcttt agccaaagcc 481 attgaaaaga atgtgctgtt ttcacatctt gatgataatg agagaagtga tatttttgat 541 gccatgtttt cggtctcctt tatcgcagga gagactgtga ttcagcaagg tgatgaaggg 601 gataacttct atgtgattga tcaaggagag acggatgtct atgttaacaa tgaatgggca 661 accagtgttg gggaaggagg gagctttgga gaacttgctt tgatttatgg aacaccgaga 721 gcagccactg tcaaagcaaa gacaaatgtg aaattgtggg gcatcgaccg agacagctat 781 agaagaatcc tcatgggaag cacactgaga aagcggaaga tgtatgagga attccttagt 841 aaagtctcta ttttagagtc tctggacaag tgggaacgtc ttacggtagc tgatgcattg 901 gaaccagtgc agtttgaaga tgggcagaag attgtggtgc agggagaacc aggggatgag 961 ttcttcatta ttttagaggg gtcagctgct gtgctacaac gtcggtcaga aaatgaagag 1021 tttgttgaag tgggaagatt ggggccttct gattattttg gtgaaattgc actactgatg 1081 aatcgtcctc gtgctgccac agttgttgct cgtggcccct tgaagtgcgt taagctggac 1141 cgacctagat ttgaacgtgt tcttggccca tgctcagaca tcctcaaacg aaacatccag 1201 cagtacaaca gttttgtgtc actgtctgtc tgaaatctgc ctcctgtgcc tcccttttct 1261 cctctcccca atccatgctt cactcatgca aactgcttta ttttccctac ttgcagcgcc 1321 aagtggccac tggcatcgca gcttcctgtc tgtttatata ttgaaagttg cttttattgc 1381 accattttca atttggagca ttaactaaat gctcatacac agttaaataa atagaaagag 1441 ttctatggag actttgctgt tactgcttct ctttgtgcag tgttagtatt caccctgggc 1501 agtgagtgcc atgctttttg gtgagggcag atccagcacc tattgaatta ccatagagta 1561 atgatgtaac agtgcaagat tttttttttt aagtgacata attgtccagt tataagcgta 1621 tttagactgt ggccatatat gctgtatttc tttgtagaat aaatggtttc tcattaaact 1681 ctaaagatta gggaaatgga tatagaaaat cttagtatag tagaaagaca tctgcctgta 1741 attaaactag tttaagggtg gaaaaatgaa aatttttgct aattatcaat gggatatgat 1801 tggttcagtt ttttttttcc agagttgttg tttgccaagc taatctgcct ggtttattta 1861 tatcttgtta ttaatgtttc ttctccaatt ctgaaatact tttgagtatg gctatctata 1921 cctgcctttt aagtttgaaa ctaactcata gatgcaaata ttggttagta tttaactaca 1981 tctgcctcgg ctcacaaatt ccgattagac ctttatccag ctagtgccaa ataattgatc 2041 agatgctgaa ttgagaataa gaatttgagg tctacattct tggttgttaa tttagagcgt 2101 ttggttaaag tatgtccttc agctgactcc agtataatct cctctgctca ttaaactgat 2161 tccaggagat tggatttgct gtgactagat acagatggag caaatgtcct aacagagaaa 2221 tagaggtgat gctgctaaag ggagaaatgc caggcggaca aagttcagtg tcgggaattt 2281 tccccgtgac attcactggg gcatgagatt ttggaagaag ttttttactt tggtttagtc 2341 tttttttcct cctttttatt cagctagaat ttctggtggg ttgatggtag ggtataatgt 2401 gtctgtgttg cttcaaattg gtctgaaagg ctatcctgct gaaagtcctg ctttcctatc 2461 tagcatttat tcctctggca aacttttctt tcttttcttt tttaaagtaa acttgtgtat 2521 tgagtcttaa ctgtatttca gtattttcca gccttatgtg ttacattatt ccaatgatac 2581 ccaacagttt atttttatta tttttttaaa caaaatttca cagttctgta atgtaggcac 2641 ttttattttc attgtgattt atatataagg taatgtaggg ttatatttgg gagtgactgc 2701 aagcattttt ccatctgtgt gcaactaact gactctgtta ttgatccctt ctcctgccct 2761 ttcccaggta atttaaattg gtcatggtag atttttttca tagatttgaa aaacttttag 2821 gttgttacca agtatgaagt ataaatctgg ggaagaggtt ttatttacat tttagggtgg 2881 gtaagaaagc caccttgtta caaatttttt aatttccaaa ataatctata ttaaatgagg 2941 gtttctgatc tgtactttgt gtttagctac ctttttatat ttaaaaaatt aaaaatgaaa 3001 attatgttct tacaagctta aagcttgatt tgatct // LOCUS HUMCANP 3213 bp mRNA PRI 19-JUL-1994 DEFINITION Human Ca2-activated neutral protease large subunit (CANP) mRNA, complete cds. ACCESSION M23254 NID g511636 KEYWORDS Ca2-activated neutral protease. SOURCE Homo sapiens skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3213) AUTHORS Imajoh,S., Aoki,K., Ohno,S., Emori,Y., Kawasaki,H., Sugihara,H. and Suzuki,K. TITLE Molecular cloning of the cDNA for the large subunit of the high-Ca-2+-requiring form of human Ca-2+-activated neutral protease JOURNAL Biochemistry 27, 8122-8128 (1988) MEDLINE 89166474 FEATURES Location/Qualifiers source 1..3213 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="skeletal muscle" gene 131..3199 /gene="CANP" CDS 131..2233 /gene="CANP" /note="Ca2-activated" /codon_start=1 /product="neutral protease large subunit" /db_xref="PID:g511637" /translation="MAGIAAKLAKDREAAEGLGSHERAIKYLNQDYEALRNECLEAGT LFQDPSFPAIPSALGFKELGPYSSKTRGMRWKRPTEICADPQFIIGGATRTDICQGAL GDCWLLAAIASLTLNEEILARVVPLNQSFQENYAGIFHFQFWQYGEWVEVVVDDRLPT KDGELLFVHSAEGSEFWSALLEKAYAKINGCYEALSGGATTEGFEDFTGGIAEWYELK KPPPNLFKIIQKALQKGSLLGCSIDITSAADSEAITFQKLVKGHAYSVTGAEEVESNG SLQKLIRIRNPWGEVEWTGRWNDNCPSWNTIDPEERERLTRRHEDGEFWMSFSDFLRH YSRLEICNLTPDTLTSDTYKKWKLTKMDGNWRRGSTAGGCRNYPNTFWMNPQYLIKLE EEDEDEEDGESGCTFLVGLIQKHRRRQRKMGEDMHTIGFGIYEVPEELSGQTNIHLSK NFFLTNRARERSDTFINLREVLNRFKLPPGEYILVPSTFEPNKDGDFCIRVFSEKKAD YQAVDDEIEANLEEFDISEDDIDDGVRRLFAQLAGEDAEISAFELQTILRRVLAKRQD IKSDGFSIETCKIMVDMLDSDGSGKLGLKEFYILWTKIQKYQKIYREIDVDRSGTMNS YEMRKALEEAGFKMPCQLHQVIVARFADDQLIIDFDNFVRCLVRLETLFKIFKQLDPE NTGTIELDLISWLCFSVL" mat_peptide 134..2230 /gene="CANP" /note="Ca2-activated" /product="neutral protease large subunit" polyA_signal 3188..3199 /gene="CANP" BASE COUNT 885 a 785 c 836 g 707 t ORIGIN 1 aatcatcgct cgcagcggcg gcgcccgcag tggccgcagc agcgcgccgg gccctggccg 61 cgccccagcc gagcgcagcg cggagtcgcc ccgacctttc tctgcgcagt acggccgccg 121 ggaccgcagc atggcgggca tcgcggccaa gctggcgaag gaccgggagg cggccgaggg 181 gctgggctcc cacgagaggg ccatcaagta cctcaaccag gactacgagg cgctgcggaa 241 cgagtgcctg gaggccggga cgctcttcca ggacccgtcc ttcccggcca tcccctcggc 301 cctgggcttc aaggagttgg ggccctactc cagcaaaacc cggggcatga gatggaagcg 361 ccccacggag atctgcgctg acccccagtt tatcattgga ggagccaccc gcacagacat 421 ctgccaagga gccctaggtg actgctggct gctggcagcc attgcctccc tcaccttgaa 481 tgaagaaatc ctggctcgag tcgtccccct aaaccagagc ttccaggaaa actatgcagg 541 gatctttcac ttccagttct ggcaatacgg cgagtgggtg gaggtggtgg tggatgacag 601 gctgcccacc aaggacgggg agctgctctt tgtgcattca gccgaaggga gcgagttctg 661 gagcgccctg ctggagaagg catacgccaa gatcaacgga tgctatgaag ctctatcagg 721 gggtgccacc actgagggct tcgaagactt caccggaggc attgctgagt ggtatgagtt 781 gaagaagccc cctcccaacc tgttcaagat catccagaaa gctctgcaaa aaggctctct 841 ccttggctgc tccatcgaca tcaccagcgc cgcggactcg gaggccatca cgtttcagaa 901 gctggtgaag gggcacgcgt actcggtcac cggagccgag gaggttgaaa gtaacggaag 961 cctacagaaa ctgatccgca tccgaaatcc ctggggagaa gtggagtgga cagggcggtg 1021 gaatgacaac tgcccaagct ggaacactat agacccagag gagagggaaa ggctgaccag 1081 acggcatgaa gatggagaat tctggatgtc tttcagtgac ttcctgaggc actattcccg 1141 cctggagatc tgtaacctga ccccagacac tctcaccagc gatacctaca agaagtggaa 1201 actcaccaaa atggatggga actggaggcg gggctccacc gcgggaggtt gcaggaacta 1261 cccgaacaca ttctggatga accctcagta cctgatcaag ctggaggagg aggatgagga 1321 cgaggaggat ggggagagcg gctgcacctt cctggtgggg ctcattcaga agcaccgacg 1381 gcggcagagg aagatgggcg aggacatgca caccatcggc tttggcatct atgaggttcc 1441 agaggagtta agtgggcaga ccaacatcca cctcagcaaa aacttcttcc tgacgaatcg 1501 cgccagggag cgctcagaca ccttcatcaa cctccgggag gtgctcaacc gcttcaagct 1561 gccgccagga gagtacattc tcgtgccttc caccttcgaa cccaacaagg atggggattt 1621 ctgcatccgg gtcttttctg aaaagaaagc tgactaccaa gctgtcgatg atgaaatcga 1681 ggccaatctt gaagagttcg acatcagcga ggatgacatt gatgatggag tcaggagact 1741 gtttgcccag ttggcaggag aggatgcgga gatctctgcc tttgagctgc agaccatcct 1801 gagaagggtt ctagcaaagc gccaagatat caagtcagat ggcttcagca tcgagacatg 1861 caaaattatg gttgacatgc tagattcgga cgggagtggc aagctggggc tgaaggagtt 1921 ctacattctc tggacgaaga ttcaaaaata ccaaaaaatt taccgagaaa tcgacgttga 1981 caggtctggt accatgaatt cctatgaaat gcggaaggca ttagaagaag caggtttcaa 2041 gatgccctgt caactccacc aagtcatcgt tgctcggttt gcagatgacc agctcatcat 2101 cgattttgat aattttgttc ggtgtttggt tcggctggaa acgctattca agatatttaa 2161 gcagctggat cccgagaata ctggaacaat agagctcgac cttatctctt ggctctgttt 2221 ctcagtactt tgaagttata actaatctgc ctgaagactt ctcatgatgg aaaatcagcc 2281 aaggactaag cttccataga aatacacttt gtatctggac ctcaaaatta tgggaacatt 2341 tacttaaacg gatgatcata gctgaaaata atgatactgt caatttgaga tagcagaagt 2401 ttcacacatc aaagtaaaag atttgcatat cattatacta aatgcaaatg agtcgcttaa 2461 cccttgacaa ggtcaaagaa agctttaaat ctgtaaatag tatacacttt ttacttttac 2521 acactttcct gttcatagca atattaaatc aggaaaaaaa aatgcaggga ggtatttaac 2581 agctgagcaa aaacattgag tcgctctcaa aggacacgag gcccttggca gggaatattt 2641 aaagcaactt caagtttaaa atgcagctgt tgattctacc aaacaacagt ccaagattac 2701 catttcccat gagccaactg ggaaacatgg tatatcatga agtaatcttg tcaaggcatc 2761 tggagagtcc aggagaggag actcacctct gtcgcttggg ttaaacaaga gacaggtttt 2821 gtagaatatt gattggtaat agtaaatcgt tctccttaca atcaagttct tgaccctatt 2881 cggccttata catctggtct tacaaagacc aaagggatcc tgcgcttgat caactgaacc 2941 agtatgccaa aaccaggcat ccaatttgta aaccaattat gataaaggac aaaataagct 3001 gtttgccacc tcaaaacttt atgaacttca ccaccactag tgtctgtcca tggagttaga 3061 ggggacatca cttagaagtt cttatagaaa ggacacaagt ttgtttcctg gctttacctt 3121 gggaaaatgc tagcaacatt atagaaattt tgccttgttg ccttatcttc ttccaaatgt 3181 actgttaaat aaaaataaag ggttacccca tcg // LOCUS HUMCAP 3984 bp mRNA PRI 23-JUL-1991 DEFINITION Human protein tyrosine phosphatase mRNA, complete cds. ACCESSION M64572 NID g179912 KEYWORDS tyrosine phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3984) AUTHORS Yang,Q.C. and Tonks,N.K. TITLE Isolation of a cDNA clone encoding a human protein-tyrosine phosphatase with homology to the cytoskeletal-associated proteins band 4.1, exrin, and talin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 5949-5953 (1991) MEDLINE 91296738 FEATURES Location/Qualifiers source 1..3984 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HELA CELLS" CDS 24..2765 /note="putative" /codon_start=1 /product="protein-tyrosine phosphatase" /db_xref="PID:g179913" /translation="MTSRLRALGGRINNIRTSELPKEKTRSEVICSIHFLDGVVQTFK VTKQDTGQVLLDMVHNHLGVTEKEYFGLQHDDDSVDSPRWLEASKPIRKQLKGGFPCT LHFRVRFFIPDPNTLQQEQTRHLYFLQLKMDICEGRLTCPLNSAVVLASYAVQSHFGD YNSSIHHPGYLSDSHFIPDQNEDFLTKVESLHEQHSGLKQSEAESCYINIARTLDFYG VELHSGRDLHNLDLMIGIASAGVAVYRKYICTSFYPWVNILKISFKRKKFFIHQRQKQ AESREHIVAFNMLNYRSCKNLWKSCVEHHTFFQAKKLLPQEKNVLSQYWTMGSRNTKK SVNNQYCKKVIGGMVWNPAMRRSLSVEHLETKSLPSRSPPITPNWRSPRLRHEIRKPR HSSADNLANEMTYITETEDVFYTYKGSLAPQDSDSEVSQNRSPHQESLSENNPAQSYL TQKSSSSVSPSSNAPGSCSPDGVDQQLLDDFHRVTKGGSTEDASQYYCDKNDNGDSYL VLIRITPDEDGKFGFNLKGGVDQKMPLVVSRINPESPADTCIPKLNEGDQIVLINGRD ISEHTHDQVVMFIKASRESHSRELALVIRRRAVRSFADFKSEDELNQLFPEAIFPMCP EGGDTLEGSMAQLKKGLESGTVLIQFEQLYRKKPGLAITFAKLPQNLDKNRYKDVLPY DTTRVLLQGNEDYINASYVNMEIPAANLVNKYIATQGPLPHTCAQFWQVVWDQKLSLI VMLTTLTERGRTKCHQYWPDPPDVMNHGGFHIQCQSEDCTIAYVSREMLVTNTQTGEE HTVTHLQYVAWPDHGIPDDSSDFLEFVNYVRSLRVDSEPVLVHCSAGIGRTGVLVTME TAMCLTERNLPIYPLDIVRKMRDQRAMMVQTSSQYKFVCEAILRVYEEGLVQMLDPS" BASE COUNT 1086 a 954 c 930 g 1014 t ORIGIN 1 ctgcaggtta ttcagcgata gttatgacct cccggttacg tgcgttgggt ggaagaatta 61 ataatatacg cacctcggag ttacccaaag agaaaactcg atcagaagtc atttgcagca 121 tccacttttt agatggcgtg gtacagacct ttaaagttac taaacaagac actggccagg 181 ttcttctgga tatggtgcac aaccacctgg gtgtgactga aaaggaatat tttggtttac 241 agcatgatga cgactccgtg gactctccta gatggctgga agcaagcaaa cccatcagga 301 agcagttaaa aggaggtttc ccctgtaccc tgcattttcg agtaagattt tttatacctg 361 atcccaacac actgcagcaa gaacaaacca ggcacttgta tttcttacaa ctgaagatgg 421 atatttgcga aggaaggtta acctgccctc ttaactcagc agtggttcta gcgtcctatg 481 ccgtacaatc tcattttgga gactataatt cttccataca tcatccaggc tatctttccg 541 atagtcactt tatacccgat caaaatgagg actttttaac aaaagtcgaa tctctgcatg 601 agcagcacag tgggctaaaa caatcagaag cagaatcctg ctatatcaac atagcgcgga 661 ccctcgactt ctatggagta gaactgcaca gtggtaggga tctgcacaat ttagacctaa 721 tgattggaat tgcttccgcg ggtgttgctg tgtaccgaaa atacatttgc acaagtttct 781 atccttgggt gaacattctc aaaatttctt tcaaaaggaa aaagttcttc atacatcagc 841 gacagaaaca ggctgaatcc agggaacata ttgtggcctt caacatgctg aattaccgat 901 cttgcaaaaa cttgtggaaa tcctgtgttg agcaccatac gttctttcag gcaaagaagc 961 tactacctca ggaaaagaat gttctgtctc agtactggac tatgggctct cggaacacca 1021 aaaagtcggt aaataaccaa tattgcaaaa aggtgattgg cgggatggtg tggaacccag 1081 ccatgcggag atccttatca gtggagcact tagaaaccaa gagtctgcct tctcgttccc 1141 ctcccattac tcccaactgg cgaagtcctc ggctccggca cgaaatccga aagccacgcc 1201 actcttctgc agataacctt gcaaatgaaa tgacctacat cacggaaacg gaagatgtat 1261 tttacacgta caagggctct ctggcccctc aagacagcga ttctgaagtt tctcagaacc 1321 gaagcccgca ccaagagagt ttatccgaga acaatccggc acaaagctac ctgacccaga 1381 agtcatccag ttctgtgtct ccatcttcaa atgctccagg ctcctgctca cctgacggcg 1441 ttgatcagca gctcttagat gacttccaca gggtgaccaa agggggctcc accgaggacg 1501 ccagccagta ctactgtgac aagaatgata atggtgacag ctacttagtc ttgatccgta 1561 tcacaccaga tgaagatgga aaatttggat ttaatcttaa gggaggagtg gatcaaaaga 1621 tgcctcttgt ggtatcaagg ataaacccag agtcacctgc ggacacctgc attcctaagc 1681 tgaacgaagg ggatcaaatc gtgttaatca atggccggga catctcagaa cacacgcatg 1741 accaagtggt gatgttcatc aaagccagcc gggagtccca ctcacgggag ctggccctgg 1801 tgatcaggag gagagctgtc cgctcatttg ctgacttcaa gtctgaagat gaactgaacc 1861 agcttttccc cgaagccatt ttccccatgt gtccggaggg tggggacact ttggagggat 1921 ccatggcaca gctaaagaag ggcctcgaaa gcgggacggt gctgatccag tttgagcaac 1981 tctacagaaa aaagccaggt ttggccatca cgtttgcaaa gctgcctcaa aatttggaca 2041 aaaaccgata taaagatgtg ctgccttatg acaccacccg ggtattattg cagggaaatg 2101 aagattatat taatgcaagt tacgtgaaca tggaaattcc tgctgctaac cttgtgaaca 2161 agtacatcgc cactcagggg cccctgccgc atacctgtgc acagttttgg caggttgtct 2221 gggatcagaa gttgtcactc attgtcatgt tgacgactct cacagaacga gggcggacca 2281 aatgtcacca gtactggcca gatccccccg acgtcatgaa ccacggcggc tttcacatcc 2341 agtgtcagtc agaggactgc accatcgcct atgtgtcccg agaaatgctg gtcacaaaca 2401 cccagaccgg ggaagaacac acagtgacac atctccagta cgtcgcatgg cctgaccacg 2461 gtatacccga tgactcctcc gactttctgg aatttgtaaa ctatgtgagg tctctgagag 2521 tggacagcga gcctgtccta gttcactgca gtgctggaat aggtcgaacc ggtgtgttgg 2581 tcactatgga aacagccatg tgcctaactg agaggaacct gcccatttac ccactggata 2641 ttgtccgaaa aatgcgagac cagcgcgcca tgatggtgca gacatcaagc cagtacaagt 2701 ttgtgtgtga agcgattctt cgtgtgtatg aagaaggttt agtccaaatg ctggatccta 2761 gttaagacaa ctgtgaaaaa gttcattcct ctttcccaag ggcatcctcc ttgaaagagg 2821 aggacagacc tctctggaag cagcaagagg aaccagtagc tgtgggaaag gaatgggcac 2881 ctctgaaccc aggcacttta aacttctata gaaaagatat cgtgtacata ggaactggtg 2941 tagataagca tgcaattatg gcatcattta ggcctgtatt tctatggaaa gatacaaaaa 3001 ggatctcagt ttggggcctg tcctaatgcc ttcttcccta acatcaccac acacacccct 3061 gtcggcatcc tggagcaatt gagaccggac acccacagag ctgttgtcct cccagcaaca 3121 agatggtgtg gttatcttgg gtcatttgga tgttttgttt gtttctgtgt gtcagactgt 3181 aagggctgag ctttctgtgc ttctaggtgg agctggaaca attcagattc acccgccctg 3241 atgctaagga aaccctgacg tatgtactag atggcagggc actgggggtc aggctgaagg 3301 ctgagcaaca cctctctgcc ctccctccct ttgtcccatc tcccagcgac ttccaatatt 3361 catgtttctg agaattgtgt ccctcttcag ttccctcttg gtgcctaacc tggattagta 3421 atgtgcattc aggtgaattt tcagctgagg ctctgagaac tggtactctc agtgtgttct 3481 ggtcatcttg tggcttagtt gtagaagcag gtgtgtctct tgcctctgct tgcctcctac 3541 tgcacactca gcacccagga ctggaatcac cgactactga atctcctaca tgtattgctg 3601 ctacttcaag ctcctccact tgaaacctta tgattttcca aggggagatg ggacagtgtc 3661 atctaaatat tccgaatgtt tggccttctg agaaaagagc ttctagtaat tgaaccatgg 3721 gtttcccagc ttctggaggg ttggccgtgg gctgtgtaca tgtgtgtgcc caggggtgag 3781 tgtttctcag gattcctaac gattcaaatt accgttgagt atatataaag aatcgagtct 3841 ctgtatggaa gaacaaatgt gtgcattcac ccccagtcac aatggtctcc attgcatttc 3901 aaaggagagg atcagactat ctgaatataa acacaatctg atgttaattt attctaagaa 3961 caccatcatt ttgattgtcc taaa // LOCUS HUMCAPA 2430 bp mRNA PRI 01-APR-1995 DEFINITION Homo sapiens mitochondrial carnitine palmitoyltransferase I mRNA, complete cds. ACCESSION L39211 NID g755645 KEYWORDS carnitine palmitoyltransferase I; nuclear gene. SOURCE Homo sapiens Liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2430) AUTHORS Britton,C.H., Schultz,R.A., Zhang,B., Esser,V., Foster,D.W. and McGarry,J.D. TITLE Human liver mitochondrial carnitine palmitoyltransferase I: characterization of its cDNA and chromosomal localization and partial analysis of the gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (6), 1984-1988 (1995) MEDLINE 95199277 FEATURES Location/Qualifiers source 1..2430 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Liver" /map="11q" 5'UTR <1..76 /note="putative" CDS 77..2398 /note="putative" /codon_start=1 /product="carnitine palmitoyltransferase I" /db_xref="PID:g755646" /translation="MAEAHQAVAFQFTVTPDGIDLRLSHEALRQIYLSGLHSWKKKFI RFKNGIITGVYPASPSSWLIVVVGVMTTMYAKIDPSLGIIAKINRTLETANCMSSQTK NVVSGVLFGTGLWVALIVTMRYSLKVLLSYHGWMFTEHGKMSRATKIWMGMVKIFSGR KPMLYSFQTSLPRLPVPAVKDTVNRYLQSVRPLMKEEDFKRMTALAQDFAVGLGPRLQ WYLKLKSWWATNYVSDWWEEYIYLRGRGPLMVNSNYYAMDLLYILPTHIQAARAGNAI HAILLYRRKLDREEIKPIRLLGSTIPLCSAQWERMFNTSRIPGEETDTIQHMRDSKHI VVYHRGRYFKVWLYHDGRLLKPREMEQQMQRILDNTSEPQPGEARLAALTAGDRVPWA RCRQAYFGRGKNKQSLDAVEKAAFFVTLDETEEGYRSEDPDTSMDSYAKSLLHGRCYD RWFDKSFTFVVFKNGKMGLNAEHSWADAQIVAHLWEYVMSIDSLQLGYAEDGHCKGDI NPNIPYPTRLQWDIPGECQEVIETSLNTANLLANDVDFHSFPFVAFGKGIIKKCRTSP DTFVQLALQLAHYKDMGKFCLTYEASMTRLFREGRTETVRSCTTESCDFVRAMVDPAQ TVEQRLKLFKLASEKHQHMYRLAMTGSGIDRHLFCLYVVSKYLAVESPFLKEVLSEPW RLSTSQTPQQQVELFDLENNPEYVSSGGGFGPVADDGYGVSYILVGENLINFHISSKF SCPETDSHRFGRHLKEAMTDIITLFGLSSNSKK" 3'UTR 2399..>2430 /note="putative" BASE COUNT 580 a 652 c 659 g 539 t ORIGIN 1 ctccaccgcc gccgccgccg ccgccgctgc cgctgccgct gccgcacctc cgtagctgac 61 tcggtactct ctgaagatgg cagaagctca ccaagctgtg gcctttcagt tcacggtcac 121 tccggacggg attgacctgc ggctgagcca tgaagctctt agacaaatct atctctctgg 181 acttcattcc tggaaaaaga agttcatcag attcaagaac ggcatcatca ctggcgtgta 241 cccggcaagc ccctccagtt ggcttatcgt ggtggtgggc gtgatgacaa cgatgtacgc 301 caagatcgac ccctcgttag gaataattgc aaaaatcaat cggactctgg aaacggccaa 361 ctgcatgtcc agccagacga agaacgtggt cagcggcgtg ctgtttggca ccggcctgtg 421 ggtggccctc atcgtcacca tgcgctactc cctgaaagtg ctgctctcct accacgggtg 481 gatgttcact gagcacggca agatgagtcg tgccaccaag atctggatgg gtatggtcaa 541 gatcttttca ggccgaaaac ccatgttgta cagcttccag acatcgctgc ctcgcctgcc 601 ggtcccggct gtcaaagaca ctgtgaacag gtatctacag tcggtgaggc ctcttatgaa 661 ggaagaagac ttcaaacgga tgacagcact tgctcaagat tttgctgtcg gtcttggacc 721 aagattacag tggtatttga agttaaaatc ctggtgggct acaaattacg tgagcgactg 781 gtgggaggag tacatctacc tccgaggacg agggccgctc atggtgaaca gcaactatta 841 tgccatggat ctgctgtata tccttccaac tcacattcag gcagcaagag ccggcaacgc 901 catccatgcc atcctgcttt acaggcgcaa actggaccgg gaggaaatca aaccaattcg 961 tcttttggga tccacgattc cactctgctc cgctcagtgg gagcggatgt ttaatacttc 1021 ccggatccca ggagaggaga cagacaccat ccagcacatg agagacagca agcacatcgt 1081 cgtgtaccat cgaggacgct acttcaaggt ctggctctac catgatgggc ggctgctgaa 1141 gccccgggag atggagcagc agatgcagag gatcctggac aatacctcgg agcctcagcc 1201 cggggaggcc aggctggcag ccctcaccgc aggagacaga gttccctggg ccaggtgtcg 1261 tcaggcctat tttggacgtg ggaaaaataa gcagtctctt gatgctgtgg agaaagcagc 1321 gttcttcgtg acgttagatg aaactgaaga aggatacaga agtgaagacc cggatacgtc 1381 aatggacagc tacgccaaat ctctactaca cggccgatgt tacgacaggt ggtttgacaa 1441 gtcgttcacg tttgttgtct tcaaaaacgg gaagatgggc ctcaacgctg aacactcctg 1501 ggcagatgcg cagatcgtgg cccacctttg ggagtacgtc atgtccattg acagcctcca 1561 gctgggctat gcggaggatg ggcactgcaa aggcgacatc aatccgaaca ttccgtaccc 1621 caccaggctg cagtgggaca tcccggggga atgtcaagag gttatagaga cctccctgaa 1681 caccgcaaat cttctggcaa acgacgtgga tttccattcc ttcccattcg tagcctttgg 1741 taaaggaatc atcaagaaat gtcgcacgag cccagacacc tttgtgcagc tggccctcca 1801 gctggcgcac tacaaggaca tgggcaagtt ttgcctcaca tacgaggcct ccatgacccg 1861 gctcttccga gaggggagga cggagaccgt gcgctcctgc accactgagt catgcgactt 1921 cgtgcgggcc atggtggacc cggcccagac ggtggaacag aggctgaagt tgttcaagtt 1981 ggcgtctgag aagcatcagc atatgtatcg cctcgccatg accggctctg ggatcgatcg 2041 tcacctcttc tgcctttacg tggtgtctaa atatctcgct gtggagtccc ctttccttaa 2101 ggaagtttta tctgagcctt ggagattatc aacaagccag acccctcagc agcaagtgga 2161 gctgtttgac ttggagaata acccagagta cgtgtccagc ggagggggct ttggaccggt 2221 tgctgatgac ggctatggtg tgtcgtacat ccttgtggga gagaacctca tcaatttcca 2281 catttcttcc aagttctctt gccctgagac ggattctcat cgctttggaa ggcacctgaa 2341 agaagcaatg actgacatca tcactttgtt tggtctcagt tctaattcca aaaagtaatt 2401 ccactggagc tgctgggaag gaaaacgagg // LOCUS HUMCAPR 3710 bp mRNA PRI 31-DEC-1994 DEFINITION Human cadherin-associated protein-related (cap-r) mRNA, complete cds. ACCESSION M94151 NID g179923 KEYWORDS alpha-catenin; cadherin-associated protein. SOURCE Homo sapiens (tissue library: Stratagene #936206) female 17-18 week foetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3710) AUTHORS Claverie,J.M., Hardelin,J.P., Legouis,R., Levilliers,J., Bougueleret,L., Mattei,M.G. and Petit,C. TITLE Characterization and chromosomal assignment of a human cDNA encoding a protein related to the murine 102-kDa cadherin-associated protein (alpha-catenin) JOURNAL Genomics 15 (1), 13-20 (1993) MEDLINE 93162640 FEATURES Location/Qualifiers source 1..3710 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="17-18 week foetus" /sex="female" /tissue_type="brain" /tissue_lib="Stratagene #936206" /map="2p11.1-2p12" CDS 1..2868 /partial /note="ORF1" /codon_start=1 /db_xref="PID:g179924" /translation="GLQEFPHRGSMTSATSPIILKWDPKSLEIRTLTVERLLEPLVTQ VTTLVNTSNKGPSGKKKGRSKKAHVLAASVEQATQNFLEKGEQIAKESQDLKEELVAA VEDVRKQGETMRIASSEFADDPCSSVKRGTMVRAARALLSAVTRLLILADMADVMRLL SHLKIVEEALEAVKNATNEQDLANRFKEFGKKMVKLNYVAARRQQELKDPHCRDEMAA ARGALKKNATMLYTASQAFLRHPDVAATRANRDYVFKQVQEAIAGISNAAQATSPTDE AKGHTGIGELAAALNEFDNKIILDPMTFSEARFRPSLEERLESIISGAALMADSSCTR DDRRERIVAECNAVRQALQDLLSEYMNNTGRKEKGDPLNIAIDKMTKKTRDLRRQLRK AVMDHISDSFLETNVPLLVLIEAAKSGNEKEVKEYAQVFREHANKLVEVANLACSISN NEEGVKLVRMAATQIDSLCPQVINAALTLAARPQSKVAQDNMDVFKDQWEKQVRVLTE AVDDITSVDDFLSVSENHILEDVNKCVIALQEGDVDTLDRTAGAIRGRAARVIHIINA EMENYEAGVYTEKVLEATKLLSETVMPRFAEQVEVAIEALSANVPQPFEENEFIDASR LVYDGVRDIRKAVLMIRTPEELEDDSDFEQEDYDVRRGTSVQTEDDQLIAGQSARAIM AQLPQEEKAKIAEQVEIFHQEKSKLDAEVAKWDDSGNDIIVLAKQMCMIMMEMTDFTR GKGPLKNTSDVINAAKKIAEAGSRMDKLARAVADQCPDSACKQDLLAYLQRIALYCHQ LNICSKVKAEVQNLGGELIVSGTGVQSTFTTFYEVDCDVIDGGRASQLSTHLPTCAEG APIGSGSSDSSMLDSATSLIQAAKNLMNAVVLTVKASYVASTKYQKVYGTAAVNSPVV SWKMKAPEKKPLVKREKPEEFQTRVRRGSQKKHIFACTGFK" gene 31..2868 /gene="cap-r" CDS 31..2868 /gene="cap-r" /codon_start=1 /product="cadherin-associated protein-related" /db_xref="PID:g179925" /translation="MTSATSPIILKWDPKSLEIRTLTVERLLEPLVTQVTTLVNTSNK GPSGKKKGRSKKAHVLAASVEQATQNFLEKGEQIAKESQDLKEELVAAVEDVRKQGET MRIASSEFADDPCSSVKRGTMVRAARALLSAVTRLLILADMADVMRLLSHLKIVEEAL EAVKNATNEQDLANRFKEFGKKMVKLNYVAARRQQELKDPHCRDEMAAARGALKKNAT MLYTASQAFLRHPDVAATRANRDYVFKQVQEAIAGISNAAQATSPTDEAKGHTGIGEL AAALNEFDNKIILDPMTFSEARFRPSLEERLESIISGAALMADSSCTRDDRRERIVAE CNAVRQALQDLLSEYMNNTGRKEKGDPLNIAIDKMTKKTRDLRRQLRKAVMDHISDSF LETNVPLLVLIEAAKSGNEKEVKEYAQVFREHANKLVEVANLACSISNNEEGVKLVRM AATQIDSLCPQVINAALTLAARPQSKVAQDNMDVFKDQWEKQVRVLTEAVDDITSVDD FLSVSENHILEDVNKCVIALQEGDVDTLDRTAGAIRGRAARVIHIINAEMENYEAGVY TEKVLEATKLLSETVMPRFAEQVEVAIEALSANVPQPFEENEFIDASRLVYDGVRDIR KAVLMIRTPEELEDDSDFEQEDYDVRRGTSVQTEDDQLIAGQSARAIMAQLPQEEKAK IAEQVEIFHQEKSKLDAEVAKWDDSGNDIIVLAKQMCMIMMEMTDFTRGKGPLKNTSD VINAAKKIAEAGSRMDKLARAVADQCPDSACKQDLLAYLQRIALYCHQLNICSKVKAE VQNLGGELIVSGTGVQSTFTTFYEVDCDVIDGGRASQLSTHLPTCAEGAPIGSGSSDS SMLDSATSLIQAAKNLMNAVVLTVKASYVASTKYQKVYGTAAVNSPVVSWKMKAPEKK PLVKREKPEEFQTRVRRGSQKKHIFACTGFK" BASE COUNT 1059 a 802 c 971 g 878 t ORIGIN chromosome 2p11.1-2p12. 1 gggctgcagg aattccccca cagagggagc atgacttcgg caacttcacc tatcattctg 61 aaatgggacc ccaaaagttt ggaaatccgg acgctaacag tggaaaggct gttggagcca 121 cttgttacac aggtgactac acttgtcaac acaagcaaca aaggcccatc tggtaaaaag 181 aaagggaggt caaagaaagc ccatgtacta gctgcctctg tagagcaagc cactcagaat 241 ttcctggaaa agggtgaaca gatcgctaag gagagtcaag atctcaaaga agagttggtg 301 gctgctgtag aggatgtgcg caaacaaggt gagacgatgc ggatcgcctc ctccgagttt 361 gcagatgacc cttgctcgtc ggtaaagcgc ggcaccatgg tacgggcggc aagggctttg 421 ctctccgcgg tgacacgctt actcatcctg gcggacatgg cagatgtcat gagactttta 481 tcccatctga aaattgtgga agaggccctg gaagctgtca aaaatgctac aaatgagcaa 541 gaccttgcaa accgttttaa agagtttggg aaaaagatgg tgaaacttaa ctatgtagca 601 gcaagaagac aacaggagct gaaggatcct cactgtcggg atgagatggc agccgcccga 661 ggggctctga agaagaatgc cacaatgctg tacacggcct ctcaagcatt tctccgccac 721 ccagatgtcg ccgctacgag agccaaccga gattatgtgt tcaaacaagt ccaggaggcc 781 atcgccggca tctccaatgc tgctcaagct acctcgccca ctgacgaagc caagggccac 841 acgggcatcg gcgagctggc tgcggctctt aatgagtttg acaataagat tatcctggac 901 cccatgacgt tcagcgaggc caggttccgg ccgtccctgg aggagaggct ggagagcatc 961 atcagcggcg cagcgctgat ggccgactcc tcctgcacgc gagacgaccg gcgcgagagg 1021 atcgtggcgg agtgcaacgc cgtgcggcag gcgctccagg acctgctcag cgagtacatg 1081 aataatactg gaaggaaaga aaaaggagat cctctcaaca ttgcgattga taagatgact 1141 aagaaaacaa gagatctaag gagacagctt cggaaagcag tgatggatca catatctgac 1201 tctttcctgg aaaccaatgt tcctttgcta gttctcattg aggctgcaaa gagcggaaat 1261 gaaaaggaag tgaaagaata tgcccaagtt ttccgtgagc atgccaacaa actggtagag 1321 gttgccaatt tggcctgttc catctccaac aatgaagaag gggtgaaatt agttcggatg 1381 gcagccaccc agattgacag cctgtgtccc caggtcatca atgccgctct gacactggct 1441 gcccggccac agagcaaagt tgctcaggat aacatggacg tcttcaaaga ccagtgggag 1501 aagcaggtcc gagtgttgac agaggccgtg gatgacatca cctcagtgga tgacttcctc 1561 tctgtctcag aaaatcacat cttggaggat gtgaacaagt gtgtgatagc cctccaagag 1621 ggcgatgtgg acactctgga ccggactgca ggggccatca ggggccgggc agctcgagtc 1681 atacacatca tcaatgctga gatggagaac tatgaagctg gggtttatac tgagaaggtg 1741 ttggaagcta caaaattgct ttctgaaaca gtgatgccac gcttcgctga acaagtagag 1801 gttgccattg aagccctgag tgccaacgtt cctcaaccgt ttgaggagaa tgagttcatc 1861 gatgcctctc gcctggtgta tgatggcgtt cgggacatca gaaaggctgt gctgatgatc 1921 aggaccccag aagaactaga ggatgattct gactttgagc aggaagatta tgatgtgcgt 1981 agagggacaa gtgttcagac tgaggatgac cagctcattg cagggcagag cgcacgggcc 2041 atcatggcgc aactaccgca ggaggagaag gcaaaaatag ctgagcaggt ggagatattc 2101 catcaagaga aaagcaagct ggatgcagaa gtggccaaat gggacgacag cggcaatgat 2161 atcattgtac tggccaagca gatgtgtatg atcatgatgg aaatgacaga cttcacaaga 2221 ggcaaaggcc cattgaaaaa tacatctgat gtcattaatg ctgccaagaa aattgccgaa 2281 gcaggttctc gaatggacaa attagctcgt gctgtggctg atcagtgtcc tgattcagca 2341 tgtaagcagg atttattagc ctaccttcaa cgaattgcct tgtattgcca tcagcttaat 2401 atctgcagca aggtgaaggc agaagtgcag aatctgggag gagagctcat tgtgtcaggg 2461 acaggagttc agagcacttt cactaccttt tatgaggtag attgtgatgt catagatggg 2521 ggcagggcta gtcaactttc tacccacctc ccaacctgtg ctgagggagc tccgatcggg 2581 agtggaagca gtgattcctc catgctggac agtgccacat cgcttatcca ggcagctaaa 2641 aacctgatga atgctgttgt cctcacggtg aaagcatcct atgtggcctc aaccaaatac 2701 cagaaggtct atgggacagc agctgtcaac tcacctgttg tgtcttggaa gatgaaggct 2761 ccagagaaga agccccttgt gaagagagaa aagcctgaag aattccagac acgagttcga 2821 cgaggttctc agaagaaaca cattttcgcc tgtacaggct ttaagtgaat tcaaagcaat 2881 ggattccttc taggacgata ggttttaaca agaaagcttt ttctttcttt tctttctttc 2941 tttttctttt taattccatt tttgtatgca tacctgccag ctcgtatgcc tctggcatgg 3001 ggaaattaag ggaacagtgt ctgtttgcat gtaagatgag atgagatcaa tactactgat 3061 ccatctgtag cctgggaagg agacaggaca ttcctgtact aaggtggcac agagctgtcc 3121 tttgcaacat tctcataaaa ttgggcacag agttcgcatt ggcgcaatat ttatgggagt 3181 gggagggatg gggaaaataa acttaactct acaaaagcaa actctaatgc atgcaagaat 3241 cattaggttg gcaggtatat gcataagtga aaaatctgga agtgtaatgg tagaacataa 3301 aacttgtatt gcttctgttt cagtgcaaaa atgtactagc caatacgctt aagtgtgtgg 3361 cccatgaatt gaacaattta accttgaagt ctatatccgt gatattatgt cgatttttaa 3421 ctgaggggaa attaactagt ccagcctaaa atgcttcttt taatctgcat tctgtttcct 3481 cttctagttg tgccattact agtgatcatg tttttttccc ccctttaatg aaaacaataa 3541 acatctattt gagacaatta aaatccttct gggggcactg gaagcacaat acggtgacca 3601 atcttgcttt catttttttt tctttttaat ttgaaccatg attttgctag aaatagaagg 3661 cccagtggtg gaatattaga gggaaggaaa ctgacaacgt gtgaaagtta // LOCUS HUMCARAA 1766 bp mRNA PRI 23-OCT-1991 DEFINITION Human carboxylesterase mRNA, complete cds. ACCESSION M73499 NID g179927 KEYWORDS carboxylesterase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1766) AUTHORS Munger,J.S., Shi,G.-P., Mark,E.A., Chin,D.T., Gerard,C. and Chapman,H.A. TITLE A serine esterase released by human alveolar macrophages is closely related to liver microsomal carboxylesterases JOURNAL J. Biol. Chem. 266, 18832-18838 (1991) MEDLINE 92011649 FEATURES Location/Qualifiers source 1..1766 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="alveolar macrophage" CDS 30..1733 /codon_start=1 /product="carboxylesterase" /db_xref="PID:g179928" /translation="MWLRAFILATLSASAAWGHPSSPPVVDTVHGKVLGKFVSLEGFA QPVAIFLGIPFAKPPLGPLRFTPPQPAEPWSFVKNATSYPPMCTQDPKAGQLLSELFT NRKENIPLKLSEDCLYLNIYTPADLTKKNRLPVMVWIHGGGLMVGAASTYDGLALAAH ENVVVVTIQYRLGIWGFFSTGDEHSRGNWGHLDQVAALRWVQDNIASFGGNPGSVTIF GESAGGESVSVLVLSPLAKNLFHRAISESGVALTSVLVKKGDVKPLAEQIAITAGCKT TTSAVMVHCLRQKTEEELLETTLKMKFLSLDLQGDPRESQPLLGTVIDGMLLLKTPEE LQAERNFHTVPYMVGINKQEFGWLIPMQLMSYPLSEGQLDQKTAMSLLWKSYPLVCIA KELIPEATEKYLGGTDDTVKKKDLFLDLIADVMFGVPSVIVARNHRDAGAPTYMYEFQ YRPSFSSDMKPKTVIGDHGDELFSVFGAPFLKEGASEEEIRLSKMVMKFWANFARNGN PNGEGLPHWPEYNQKEGYLQIGANTQAAQKLKDKEVAFWTNLFAKKAVEKPPQTEHIE L" BASE COUNT 431 a 452 c 484 g 399 t ORIGIN 1 ctctaaagcg agaactgtcg cccttcacga tgtggctccg tgcctttatc ctggccactc 61 tctctgcttc cgcggcttgg gggcatccgt cctcgccacc tgtggtggac accgtgcatg 121 gcaaagtgct ggggaagttc gtcagcttag aaggatttgc acagcctgtg gccattttcc 181 tgggaatccc ttttgccaag ccgcctcttg gacccctgag gtttactcca ccgcagcctg 241 cagaaccatg gagctttgtg aagaatgcca cctcgtaccc tcctatgtgc acccaagatc 301 ccaaggcggg gcagttactc tcagagctat ttacaaaccg aaaggagaac attcctctca 361 agctttctga agactgtctt tacctcaata tttacactcc tgctgacttg accaagaaaa 421 acaggctgcc ggtgatggtg tggatccacg gaggggggct gatggtgggt gcggcatcaa 481 cctatgatgg gctggccctt gctgcccatg aaaacgtggt ggtggtgacc attcaatatc 541 gcctgggcat ctggggattc ttcagcacag gggatgaaca cagccggggg aactggggtc 601 acctggacca ggtggctgcc ctgcgctggg tccaggacaa cattgccagc tttggaggga 661 acccaggctc tgtgaccatc tttggagagt cagcgggagg agaaagtgtc tctgttcttg 721 ttttgtctcc attggccaag aacctcttcc accgggccat ttctgagagt ggcgtggccc 781 tcacttctgt tctggtgaag aaaggtgatg tcaagccctt ggctgagcaa attgctatca 841 ctgctgggtg caaaaccacc acctctgctg tcatggttca ctgcctgcga cagaagacgg 901 aagaggagct cttggagacg acattgaaaa tgaaattctt atctctggac ttacagggag 961 accccagaga gagtcaaccc cttctgggca ctgtgattga tgggatgctg ctgctgaaaa 1021 cacctgaaga gcttcaagct gaaaggaatt tccacactgt cccctacatg gtcggaatta 1081 acaagcagga gtttggctgg ttgattccaa tgcagttgat gagctatcca ctctccgaag 1141 ggcaactgga ccagaagaca gccatgtcac tcctgtggaa gtcctatccc cttgtttgca 1201 ttgctaagga actgattcca gaagccactg agaaatactt aggaggaaca gacgacactg 1261 tcaaaaagaa agacctgttc ctggacttga tagcagatgt gatgtttggt gtcccatctg 1321 tgattgtggc ccggaaccac agagatgctg gagcacccac ctacatgtat gagtttcagt 1381 accgtccaag cttctcatca gacatgaaac ccaagacggt gataggagac cacggggatg 1441 agctcttctc cgtctttggg gccccatttt taaaagaggg tgcctcagaa gaggagatca 1501 gacttagcaa gatggtgatg aaattctggg ccaactttgc tcgcaatgga aaccccaatg 1561 gggaagggct gccccactgg ccagagtaca accagaagga agggtatctg cagattggtg 1621 ccaacaccca ggcggcccag aagctgaagg acaaagaagt agctttctgg accaacctct 1681 ttgccaagaa ggcagtggag aagccacccc agacagaaca catagagctg tgaatgaaga 1741 tccagccggc cttgggagcc tggagg // LOCUS HUMCARBANH 1083 bp mRNA PRI 01-OCT-1993 DEFINITION Human nuclear-encoded mitochondrial carbonic anhydrase (CA5) mRNA, complete cds. ACCESSION L19297 NID g306482 KEYWORDS carbonic anhydrase; nuclear-encoded protein. SOURCE Homo sapiens (library: Uni-ZAP XR) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1083) AUTHORS Nagao,Y., Platero,J.S., Waheed,A. and Sly,W.S. TITLE Human mitochondrial carbonic anhydrase: cDNA cloning, expression, subcellular localization, and mapping to chromosome 16 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (16), 7623-7627 (1993) MEDLINE 93361499 FEATURES Location/Qualifiers source 1..1083 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="Uni-ZAP XR" gene 56..1083 /gene="CA5" sig_peptide 56..169 /gene="CA5" CDS 56..973 /gene="CA5" /EC_number="4.2.1.1" /note="mitochondrial" /codon_start=1 /product="carbonic anhydrase V" /db_xref="PID:g306483" /translation="MLGRNTWKTSAFSFLVEQMWAPLWSRSMRPGRWCSQRSCAWQTS NNTLHPLWTVPVSVPGGTRQSPINIQWRDSVYDPQLKPLRVSYEAASCLYIWNTGYLF QVEFDDATEASGISGGPLENHYRLKQFHFHWGAVNEGGSEHTVDGHAYPAELHLVHWN SVKYQNYKEAVVGENGLAVIGVFLKLGAHHQTLQRLVDILPEIKHKDARAAMRPFDPS TLLPTCWDYWTYAGSLTTPPLTESVTWIIQKEPVEVAPSQLSAFRTLLFSALGEEEKM MVNNYRPLQPLMNRKVWASFQATNEGTRS" mat_peptide 170..970 /gene="CA5" /EC_number="4.2.1.1" /product="carbonic anhydrase V" polyA_site 1083 /gene="CA5" BASE COUNT 260 a 291 c 298 g 234 t ORIGIN 1 agacagcagg gaacatcacc ctcttcagac tggagtcagt gggaacagac ccaagatgtt 61 ggggaggaac acttggaaga cctcagcttt ctccttcttg gttgagcaga tgtgggcccc 121 tctctggagt cgttcgatga ggccagggcg atggtgttct cagcgttcct gtgcatggca 181 aaccagcaat aacactttgc acccactctg gacggtcccg gtctccgtgc cagggggcac 241 ccggcagtct cctattaaca tccagtggag ggacagcgtc tatgaccccc agctgaagcc 301 actcagggtc tcctatgaag cggcatcctg cctgtacatc tggaacactg gctacctctt 361 ccaggtggaa tttgacgatg ccaccgaggc atcaggaatt agtggtgggc ccttggaaaa 421 ccactacaga ctgaagcaat ttcacttcca ctggggagca gtgaacgagg ggggctcaga 481 gcacacagtg gacggccacg cgtaccccgc agagctgcat ttagttcact ggaattctgt 541 gaaataccaa aattacaagg aagctgtcgt gggagagaat ggtttggctg tgataggcgt 601 gtttttaaag ctcggggccc atcatcagac gctgcagagg ctggtggaca tcttgccgga 661 aataaaacat aaggacgcgc gggcggccat gcgccccttc gacccctcca ctctgctgcc 721 cacctgctgg gattactgga cctacgcggg ctcgctcacc accccgccgc tgaccgagtc 781 ggtcacctgg atcatccaga aggagcccgt tgaagtggcc ccaagccagc tctctgcatt 841 tcgtactctc ctgttttctg cacttggtga agaggagaag atgatggtga acaactatcg 901 cccacttcaa cccttgatga accggaaggt ctgggcgtcc ttccaggcca ctaatgaggg 961 cacaaggtcc tagagacatt aggtccacat gaatagcaga actgactttg aaggaaggaa 1021 gcgttgtttc ccaagtttca caatgtgatt gtacatgact tctgaaatta aaaagagagc 1081 atg // LOCUS HUMCARMC 1622 bp mRNA PRI 15-MAR-1990 DEFINITION Human mast cell carboxypeptidase A mRNA, complete cds. ACCESSION M27717 NID g179933 KEYWORDS carboxypeptidase. SOURCE Human lung, cDNA to mRNA, (Clontech library HL1066b). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1622) AUTHORS Reynolds,D.S., Gurley,D.S., Stevens,R.L., Sugarbaker,D.J., Austen,K.F. and Serafin,W.E. TITLE Cloning of cDNAs that encode human mast cell carboxypeptidase A, and comparison of the protein with mouse mast cell carboxypeptidase A and rat pancreatic carboxypeptidases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9480-9484 (1989) MEDLINE 90083291 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.S.Reynolds, 07-SEP-1989. FEATURES Location/Qualifiers source 1..1622 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1622 /note="CARA mRNA" sig_peptide 1..45 /note="mast cell carboxypeptidase A signal peptide" CDS 1..1254 /note="mast cell carboxypeptidase A precursor" /codon_start=1 /db_xref="PID:g179934" /translation="MRLILPVGLIATTLAIAPVRFDREKVFRVKPQDEKQADIIKDLA KTNELDFWYPGATHHVAANMMVDFRVSEKESQAIQSALDQNKMHYEILIHDLQEEIEK QFDVKEDIPGRHSYAKYNNWEKIVAWTEKMMDKYPEMVSRIKIGSTVEDNPLYVLKIG EKNERRKAIFMDCGIHAREWVSPAFCQWFVYQATKTYGRNKIMTKLLDRMNFYILPVF NVDGYIWSWTKNRMWRKNRSKNQNSKCIGTDLNRNFNASWNSIPNTNDPCADNYRGSA PESEKETKAVTNFIRSHLNEIKVYITFHSYSQMLLFPYGYTSKLPPNHEDLAKVAKIG TDVLSTRYETRYIYGPIESTIYPISGSSLDWAYDLGIKHTFAFELRDKGKFGFLLPES RIKPTCRETMLAVKFIAKYILKHTS" mat_peptide 46..327 /note="mast cell carboxypeptidase A activation peptide" mat_peptide 328..1251 /note="mast cell carboxypeptidase A" polyA_signal 1604..1609 polyA_signal 1609..1614 BASE COUNT 503 a 354 c 313 g 452 t ORIGIN 1 atgaggctca tcctgcctgt gggtttgatt gctaccactc ttgcaattgc tcctgtccgc 61 tttgacaggg agaaggtgtt ccgcgtgaag ccccaggatg aaaaacaagc agacatcata 121 aaggacttgg ccaaaaccaa tgagcttgac ttctggtatc caggtgccac ccaccacgta 181 gctgctaata tgatggtgga tttccgagtt agtgagaagg aatcccaagc catccagtct 241 gccttggatc aaaataaaat gcactatgaa atcttgattc atgatctaca agaagagatt 301 gagaaacagt ttgatgttaa agaagatatc ccaggcaggc acagctacgc aaaatacaat 361 aattgggaaa agattgtggc ttggactgaa aagatgatgg ataagtatcc tgaaatggtc 421 tctcgtatta aaattggatc tactgttgaa gataatccac tatatgttct gaagattggg 481 gaaaagaatg aaagaagaaa ggctattttt atggattgtg gcattcacgc acgagaatgg 541 gtctccccag cattctgcca gtggtttgtc tatcaggcaa ccaaaactta tgggagaaac 601 aaaattatga ccaaactctt ggaccgaatg aatttttaca ttcttcctgt gttcaatgtt 661 gatggatata tttggtcatg gacaaagaac cgcatgtgga gaaaaaatcg ttccaagaac 721 caaaactcca aatgcatcgg cactgacctc aacaggaatt ttaatgcttc atggaactcc 781 attcctaaca ccaatgaccc atgtgcagat aactatcggg gctctgcacc agagtccgag 841 aaagagacga aagctgtcac taatttcatt agaagccacc tgaatgaaat caaggtttac 901 atcaccttcc attcctactc ccagatgcta ttgtttccct atggatatac atcaaaactg 961 ccacctaacc atgaggactt ggccaaagtt gcaaagattg gcactgatgt tctatcaact 1021 cgatatgaaa cccgctacat ctatggccca atagaatcaa caatttaccc gatatcaggt 1081 tcttctttag actgggctta tgacctgggc atcaaacaca catttgcctt tgagctccga 1141 gataaaggca aatttggttt tctccttcca gaatcccgga taaagccaac gtgcagagag 1201 accatgctag ctgtcaaatt tattgccaag tatatcctca agcatacttc ctaaagaact 1261 gccctctgtt tggaataagc caattaatcc ttttttgtgc ctttcatcag aaagtcaatc 1321 ttcagttatc cccaaatgca gcttctattt cacctgaatc cttctcttgc tcatttaagt 1381 cccatgttac tgctgtttgc ttttacttac tttcagtagc accataacga agtagcttta 1441 agtgaaacct tttaactacc tttctttgct ccaagtgaag tttggaccca gcagaaagca 1501 ttattttgaa aggtgatata cagtggggca cagaaaacaa atgaaaaccc tcagtttctc 1561 acagattttc accatgtggc ttcatcaatt tatgtgctaa tacaataaaa taaaatgcac 1621 tt // LOCUS HUMCARP 941 bp mRNA PRI 13-JUL-1993 DEFINITION Homo sapiens carbonic anhydrase related protein (CARP) mRNA, complete cds. ACCESSION L04656 NID g179937 KEYWORDS carbonic anhydrase-related protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 941) AUTHORS Skaggs,L.A., Bergenhem,N.C.H., Venta,P.J. and Tashian,R.E. TITLE The deduced amino acid sequence of human carbonic anhydrase-related protein (CARP) is 98% identical to the mouse homologue JOURNAL Gene 126, 291-292 (1993) MEDLINE 93246262 FEATURES Location/Qualifiers source 1..941 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 410..871 /codon_start=1 /label=CARP_gene /product="carbonic anhydrase-related protein" /db_xref="PID:g179938" /translation="MELHLIHWNSTLFGSIDEAVGKPHGIAIIALFVQIGKEHVGLKA VTEILQDIQYKGKSKTIPCFNPNTLLPDPLLRDYWVYEGSLTIPPCSEGVTWILFRYP LTISQLQIEEFRRLRTHVKGAELVEGCDGILGDNFRPTQPLSDRVIRAAFQ" BASE COUNT 250 a 200 c 256 g 235 t ORIGIN 1 ggcggacctg agcttcatcg aagataccgt cgccttcccc gagaaggaag aggatgagga 61 ggaagaagag gagggtgtgg agtggggcta cgaggaaggt gttgagtggg gtctggtgtt 121 tcctgatgct aatggggaat accagtctcc tattaaccta aactcaagag aggctaggta 181 tgacccctcg ctgctggatg tccgcctctc cccaaattat gtggtgtgcc gagactgtga 241 agtcaccaat gatggacata ccattcaggt tatcctgaag tcaaaatcag ttctttcggg 301 aggaccattg cctcaagggc atgagtttga actgtacgaa gtgagatttc actggggaag 361 agaaaaccag cgtggttctg agcacacggt taatttcaaa gcttttccca tggagctcca 421 tctgatccac tggaactcca ctctgtttgg cagcattgat gaggctgtgg ggaagccgca 481 cggaatcgcc atcattgctc tgtttgttca gataggaaag gaacatgttg gcttgaaggc 541 tgtgactgaa atcctccaag atattcagta taaggggaag tccaaaacaa taccttgctt 601 taatcctaac actttattac cagaccctct gctgcgggat tactgggtgt atgaaggctc 661 tctcaccatc ccaccttgca gtgaaggtgt cacctggata ttattccgat accctttaac 721 tatatcccag ctacagatag aagaatttcg aaggctgagg acacatgtta agggggcaga 781 acttgtggaa ggctgtgatg ggattttggg agacaacttt cggcccactc agcctcttag 841 tgacagagtc attagagctg catttcagta gccaaagagg acaggaacaa gtctgtcttc 901 atgagggagg aagacaatgg tctataatgc ccttggataa g // LOCUS HUMCASP 2855 bp mRNA PRI 03-MAR-1994 DEFINITION Human alternatively spliced CUTL1 mRNA, complete cds. ACCESSION L12579 NID g457516 KEYWORDS alternative splicing. SOURCE Homo sapiens (library: lambda gt11) umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2855) AUTHORS Lievens,P.M.J. and Neufeld,E.J. TITLE Alternative splicing of CCAAT displacement protein mRNA lacks a homeodomain and cut repeats JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2855 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelium" /germline /tissue_type="umbilical vein" /tissue_lib="lambda gt11" 5'UTR 1..19 CDS 20..2056 /standard_name="CASP" /note="alternatively spliced" /codon_start=1 /number=2 /function="Unknown" /evidence=experimental /db_xref="PID:g457517" /translation="MAANVGSMFQYWKRFDLQQLQRELDATATVLANRQDESEQSRKR LIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLNVYKRLIDVPDP VPALDLGQQLQLKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREY EQTLKNQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKVQSLQTALEKT RTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQ LASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ ISQLEQQLSAKNSTLKQLEEKLKGQADYEEVKKELNILKSMEFAPSEGAGTQDAAKPL EVLLLEKNRSLQSENAALRISNSDLSGRCAELQVRITEAVATATEQRELIARLEQDLS IIQSIQRPDAEGAAEHRLEKIPEPIKEATALFYGPAAPASGALPEGQVDSLLSIISSQ RERFRARNQELEAENRLAQHTLQALQSELDSLRADNIKLFEKIKFLQSYPGRGSGSDD TELRYSSQYEERLDPFSSFSKRERQRKYLSLSPWDKATLSMGRLVLSNKMARTIGFFY TLFLHCLVFLVLYKLAWSESMERDCATFCAKKLADHLHKFHENDNGAAAGDLWQ" misc_feature 41 /standard_name="start codon, potential, #2" /note="alternate start codon; putative" /function="second in-frame methionine" misc_feature 1276 /standard_name="splice junction" /note="Sequences for CDP and CASP diverge at this site; slice junction" /function="mRNA splice site" /evidence=experimental 3'UTR 2054..2855 BASE COUNT 734 a 838 c 781 g 502 t ORIGIN 1 cgtctcaata tgtctcaaga tggccgccaa tgtgggatcg atgtttcaat attggaagcg 61 ctttgattta cagcagctgc agagagaact cgatgccacc gcaacggtat tggcgaaccg 121 gcaggatgaa agtgagcagt ccagaaagcg gcttatcgaa cagagccggg agttcaagaa 181 gaacactcca gaggatttgc gcaagcaggt agcgccgctg ctgaagagtt tccaaggaga 241 gattgatgca ctgagtaaaa gaagcaagga agctgaagca gctttcttga atgtctacaa 301 aagattgatt gacgtcccag atcccgtacc agctttggat ctcggacagc aactccagct 361 caaagtgcag cgcctgcacg atattgaaac agagaaccag aaacttaggg aaactctgga 421 agaatacaac aaggaatttg ctgaagtgaa aaatcaagag gttacgataa aagcacttaa 481 agagaaaatc cgagaatatg aacagacact gaagaaccaa gccgaaacca tagctcttga 541 gaaggaacag aagttacaga atgactttgc agaaaaggag agaaagctgc aggagacaca 601 gatgtccacc acctcaaagc tggaggaagc tgagcataag gttcagagcc tacaaacagc 661 cctggaaaaa actcgaacag aattatttga cctgaaaacc aaatacgatg aagaaactac 721 tgcaaaggcc gacgagattg aaatgatcat gacggacctt gaaagggcaa accagagggc 781 agaggtggct cagagagagg cggagacctt aagggaacag ctctcatcgg ccaatcactc 841 cctccagctg gcctcacaga tccagaaggc accagacgtg gagcaggcca tagaggtgct 901 gacccgctcc agcctagaag ttgagttggc cgccaaggag cgggagatcg cacagctggt 961 ggaggacgtg cagagactcc aggccagcct caccaagctg cgggagaatt cggccagcca 1021 gatctcacag cttgagcagc agctgagcgc caaaaacagc acactcaaac aactggaaga 1081 aaaactcaaa ggccaggctg actatgaaga ggtgaagaaa gagctgaaca ttctgaagtc 1141 catggagttt gcaccgtccg agggcgctgg gacacaggat gcggccaagc ccctggaggt 1201 gctgttgctg gagaagaacc gctcgctgca gtccgagaac gccgcgctgc gcatctccaa 1261 cagcgacctg agcggacgct gtgcagagct gcaagtccgt atcactgagg ctgtggccac 1321 agccactgag cagagagagc tgatcgcccg cctggagcag gacctgagca tcattcagtc 1381 catccagcgg cccgatgccg agggtgccgc tgagcaccgc ctggagaaga tcccagagcc 1441 catcaaagag gccactgccc tattctacgg acctgcagca ccagccagcg gtgccctccc 1501 agagggccag gtggattcac tgctttccat catctccagc cagagggagc gcttccgtgc 1561 ccggaaccag gagcttgagg ccgagaaccg cctggcccag cacaccctcc aggccctgca 1621 gagtgagctg gacagcctgc gcgccgacaa catcaagctc tttgagaaga tcaagttcct 1681 gcagagctac cctggccggg gcagcggcag tgatgacacg gagctgcggt actcgtccca 1741 gtacgaggag cgcctggacc ccttctcctc cttcagcaag cgggagcggc agaggaagta 1801 cctgagcttg agtccctggg acaaggccac cctcagcatg gggcgtctgg ttctctccaa 1861 caagatggcg cgcaccatcg gcttcttcta cacactgttc ctgcactgcc tggtcttcct 1921 ggtgctctac aagctggcat ggagcgagag catggagagg gactgtgcca ccttctgcgc 1981 caagaagctc gctgaccacc tgcacaagtt ccacgagaat gacaacgggg ctgcggctgg 2041 tgacttgtgg cagtgatacc ccggggcctc ccccgtgaca gtgacggctg cgcctccacc 2101 ccgactgctc agtgcatcta atcacttaga ctcccctgaa gaatccccca tggaaactgc 2161 ccttatccgc tgtccagcag ctgccagagg ccccaggtca cctcgggtcc ccttgaaaga 2221 atgtctcggt cacatcaggc ccgctaggtc cagagagcga gcccccaatg cccggccagg 2281 ctaagccgca gagaccctct cagcccccac ctcaggttag ggctctgccc gcagcctgac 2341 ctctagccct ggtggcagag gtccctcagc tgcgaggcta attgggtgac caccgattcc 2401 agctgcggtt aatccagctt gggcctgtct gcactgcgat cctcttgggc tctcctagga 2461 tccccccatg ccccgtaaga ggtggaagac gcttccttcc aggacagcag gctttggagt 2521 ccgacacccc cagcctgcct ttgccaccag ccccaaccct gcagagatat gaggcttgac 2581 agagtctgcc ccctccccca ctgcacccca agagagagag ccccagccag cggaacagtt 2641 tctattaccc cctccctgcc cccagaccca tgtgatttct gctttcttct ttagcaagat 2701 attctggttt ctagataagg aagagtctct aatgagcccc cgagccccag tctcttcaga 2761 ctcatggatt ggtctgaggg gtctgaacgt ctcctagcca atcagaactg gctgtggacc 2821 accctagcac ggccacctct cagggcactg gcagg // LOCUS HUMCATHB 1996 bp mRNA PRI 07-APR-1994 DEFINITION Homo sapiens cathepsin B mRNA, complete cds. ACCESSION L16510 NID g291887 KEYWORDS cathepsin B; lysosomal cysteine protease. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1996) AUTHORS Cao,L., Taggart,R.T., Berquin,I.M., Moin,K., Fong,D. and Sloane,B.F. TITLE Human gastric adenocarcinoma cathepsin B: isolation and sequencing of full-length cDNAs and polymorphisms of the gene JOURNAL Gene 139 (2), 163-169 (1994) MEDLINE 94156194 FEATURES Location/Qualifiers source 1..1996 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="carcinoma" /cell_type="gastric adenocarcinoma cell" /cell_line="AGS 1-6-30-1" gene 178..1197 /gene="CTSB" CDS 178..1197 /gene="CTSB" /codon_start=1 /product="cathepsin B" /db_xref="PID:g291888" /translation="MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAG HNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQ GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPT YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQ YWEKI" BASE COUNT 461 a 549 c 547 g 439 t ORIGIN 1 tccggcaacg ccaaccgctc cgctgcgcgc aggctgggct gcaggctctc ggctgcagcg 61 ctgggctggt gtgcagtggt gcgaccacgg ctcacggcag cctcagccac ccagatgtaa 121 gcgatctggt tcccacctca gcctcccgag tagtggatct aggatccggc ttccaacatg 181 tggcagctct gggcctccct ctgctgcctg ctggtgttgg ccaatgcccg gagcaggccc 241 tctttccatc ccctgtcgga tgagctggtc aactatgtca acaaacggaa taccacgtgg 301 caggccgggc acaacttcta caacgtggac atgagctact tgaagaggct atgtggtacc 361 ttcctgggtg ggcccaagcc accccagaga gttatgttta ccgaggacct gaagctgcct 421 gcaagcttcg atgcacggga acaatggcca cagtgtccca ccatcaaaga gatcagagac 481 cagggctcct gtggctcctg ctgggccttc ggggctgtgg aagccatctc tgaccggatc 541 tgcatccaca ccaatgcgca cgtcagcgtg gaggtgtcgg cggaggacct gctcacatgc 601 tgtggcagca tgtgtgggga cggctgtaat ggtggctatc ctgctgaagc ttggaacttc 661 tggacaagaa aaggcctggt ttctggtggc ctctatgaat cccatgtagg gtgcagaccg 721 tactccatcc ctccctgtga gcaccacgtc aacggctccc ggcccccatg cacgggggag 781 ggagataccc ccaagtgtag caagatctgt gagcctggct acagcccgac ctacaaacag 841 gacaagcact acggatacaa ttcctacagc gtctccaata gcgagaagga catcatggcc 901 gagatctaca aaaacggccc cgtggaggga gctttctctg tgtattcgga cttcctgctc 961 tacaagtcag gagtgtacca acacgtcacc ggagagatga tgggtggcca tgccatccgc 1021 atcctgggct ggggagtgga gaatggcaca ccctactggc tggttgccaa ctcctggaac 1081 actgactggg gtgacaatgg cttctttaaa atactcagag gacaggatca ctgtggaatc 1141 gaatcagaag tggtggctgg aattccacgc accgatcagt actgggaaaa gatctaatct 1201 gccgtgggcc tgtcgtgcca gtcctggggg cgagatcggg gtagaaatgc attttattct 1261 ttaagttcac gtaagataca agtttcaggc agggtctgaa ggactggatt ggccaaacat 1321 cagacctgtc ttccaaggag accaagtcct ggctacatcc cagcctgtgg ttacagtgca 1381 gacaggccat gtgagccacc gctgccagca cagagcgtcc ttccccctgt agactagtgc 1441 cgtgggagta cctgctgccc agctgctgtg gccccctccg tgatccatcc atctccaggg 1501 agcaagacag agacgcagga tggaaagcgg agttcctaac aggatgaaag ttcccccatc 1561 agttccccca gtacctccaa gcaagtagct ttccacattt gtcacagaaa tcagaggaga 1621 gatggtgttg ggagcccttt ggagaacgcc agtctccagg tccccctgca tctatcgagt 1681 ttgcaatgtc acaacctctc tgatcttgtg ctcagcatga ttctttaata gaagttttat 1741 ttttcgtgca ctctgctaat catgtgggtg agccagtgga acagcgggag cctgtgctgg 1801 tttgcagatt gcctcctaat gacgcggctc aaaaggaaac caagtggtca ggagttgttt 1861 ctgacccact gatctctact accacaagga aaatagttta ggagaaacca gcttttactg 1921 tttttgaaaa attacagctt caccctgtca agttaacaag gaatgcctgt gccaataaaa 1981 ggtttctcca acttga // LOCUS HUMCATSS 1255 bp mRNA PRI 25-FEB-1992 DEFINITION Human cathepsin S mRNA, complete cds. ACCESSION M86553 NID g179958 KEYWORDS cathepsin; cathepsin S; cysteine protease. SOURCE Homo sapiens mature lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1255) AUTHORS Chapman,H.A. TITLE Molecular cloning and expression of human alveolar macrophage cathepsin S, an elastinolytic cysteine protease JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1255 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="alveolar macrophage cells" /dev_stage="mature" /germline /tissue_type="lung" CDS 7..1002 /standard_name="cathepsin S" /codon_start=1 /function="cysteine protease" /product="cathepsin" /db_xref="PID:g179959" /translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVPSQWQ RNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV TLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSK YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI" BASE COUNT 369 a 253 c 305 g 328 t ORIGIN 1 ctaaagatga aacggctggt ttgtgtgctc ttggtgtgct cctctgcagt ggcacagttg 61 cataaagatc ctaccctgga tcaccactgg catctctgga agaaaaccta tggcaaacaa 121 tacaaggaaa agaatgaaga agcagtacga cgtctcatct gggaaaagaa tctaaagttt 181 gtgatgcttc acaacctgga gcattcaatg ggaatgcact catacgatct gggcatgaac 241 cacctgggag acatgaccag tgaagaagtg atgtctttga cgagttccct gagagttccc 301 agccagtggc agagaaatat cacatataag tcaaacccta atcggatatt gcctgattct 361 gtggactgga gagagaaagg gtgtgttact gaagtgaaat atcaaggttc ttgtggtgct 421 tgctgggctt tcagtgctgt gggggccctg gaagcacagc tgaagctgaa aacaggaaag 481 ctggtgactc tcagtgccca gaacctggtg gattgctcaa ctgaaaaata tggaaacaaa 541 ggctgcaatg gtggcttcat gacaacggct ttccagtaca tcattgataa caagggcatc 601 gactcagacg cttcctatcc ctacaaagcc atggatcaga aatgtcaata tgactcaaaa 661 tatcgtgctg ccacatgttc aaagtacact gaacttcctt atgggagaga agatgtcctg 721 aaagaagctg tggccaataa aggcccagtg tctgttggtg tagatgcgcg tcatccttct 781 ttcttcctct acagaagtgg tgtctactat gaaccatcct gtactcagaa tgtgaatcat 841 ggtgtacttg tggttggcta tggtgatctt aatgggaaag aatactggct tgtgaaaaac 901 agctggggcc acaactttgg tgaagaagga tatattcgga tggcaagaaa taaaggaaat 961 cattgtggga ttgctagctt tccctcttac ccagaaatct agaggatctc tcctttttat 1021 aacaaatcaa gaaatatgaa gcactttctc ttaacttaat ttttcctgct gtatccagaa 1081 gaaataattg tgtcatgatt aatgtgtatt tactgtacta atagaaaata tagtttgagg 1141 ccgggcactg tctggctcac gcctgtaatc ccagtacttg ggaggccaag gaggcatatc 1201 aacttgaggc caggagttaa agagcagcct ggctaactgt gaaaccctct ctact // LOCUS HUMCBF 3216 bp mRNA PRI 31-OCT-1994 DEFINITION Human CCAAT-box-binding factor (CBF) mRNA, complete cds. ACCESSION M37197 NID g179968 KEYWORDS CCAAT-box-binding factor; transcription factor. SOURCE Human W138 and U2-OS cell lines, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3216) AUTHORS Lum,L.S., Sultzman,L.A., Kaufman,R.J., Linzer,D.I. and Wu,B.J. TITLE A cloned human CCAAT-box-binding factor stimulates transcription from the human hsp70 promoter JOURNAL Mol. Cell. Biol. 10 (12), 6709-6717 (1990) MEDLINE 91061780 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by B.Wu, 01-AUG-1990. Author address: B.Wu Dept. of Biochemistry Northwestern University 2153 Sheridan Rd. Evanston, IL 60208. FEATURES Location/Qualifiers source 1..3216 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="W38 and U2-OS" /tissue_lib="lambda-gt11 and lambda-ZAP" /map="Unassigned" mRNA 1..3216 /partial /gene="CBF" /product="CCAAT-box-binding factor" gene 1..3216 /gene="CBF" gene 12..3008 /gene="CEBP" CDS 12..3008 /gene="CEBP" /codon_start=1 /db_xref="GDB:G00-128-839" /product="CCAAT-box-binding factor" /db_xref="PID:g179969" /translation="MAAVKEPLEFHAKRPWRPEEAVEDPDEEDEDNTSEAENGFSLEE VLRLGGTKQDYLMLATLDENEEVIDGGKKGAIDDLQQGELEAFIQNLNLAKYTKASLI EEDEPAEKENSSKKEVKIPKINNKNTAESQRTSVNKVKNKNRPEPHSDENGSTTPKVK KDKQNIFEFFERQTLLLRPGGKWYDLEYSNEYSLKPQPQDVVSKYKTLAQKLYQHEIN LFKSKTNSQKGASSTWMKAIVSSGTLGDRMAAMILLIQDDAVHTLQFVETLVNLVKKK GSKQQCLMALDTFKELLITDLLPDNRKLRIFSQRPFDKLEQLSSGNKDSRDRRLILWY FEHQLKHLVAEFVQVLETLSHDTLVTTKTRALTVAHELLCNKPEEEKALLVQVVNKLG DPQNRIATKASHLLETLLCKHPNMKGVVSGEVERLLFRSNISSKAQYYAICFLNQMAL SHEESELANKLITVYFCFFRTCVKKKDVESKMLSALLTGVNRAYPYSQTGDDKVREQI DTLFKVLHIVNFNTSVQALMLLFQVMNSQQTISDRYYTALYRKMLDPGLMTCSKQAMF LNLVYKSLKADIVLRRVKAFVKGLLQVTCQQMPPFICGALYLVSEILKAKPGLRSQLD DHPESDDEENFIDANDDEDMEKFTDADKETEIVKKLETEETVPETDVETKKPEVASWV HFDNLKGGKQLNKYDPFSRNPLFCGAENTSLWELKKLSVHFHPSVALFAKTILQGNYI QYSGDPLQDFTLMRFLDRFVYRNPKPHKGKENTDSVVMQPKRKHFIKDIRHLPVNSKE FLAKEESQIPVDEVFFHRYYKKVAVKEKQKRDADEESIEDVDDEEFEELIDTFEDDNC FSSGKDDMDFAGNVKKRTKGAKDNTLDEDSEGSDDELGNLDDDEVSLGSMDDEEFAEV DEDGGTFMDVLDDESESVPELEVHSKVSTKKSKRKGTDDFDFAGSFQGPRKKKRNLND SSLFVSAEEFGHLLDENMGSKFDNIA" BASE COUNT 1123 a 539 c 722 g 832 t ORIGIN 1 gctttgccgc catggccgca gtcaaggagc ctttggagtt ccatgccaag cggccttggc 61 gccccgagga ggcagtagaa gatccggacg aggaggatga ggataatact agtgaagccg 121 agaatgggtt ctccctggag gaagtgttac ggctcggagg caccaagcaa gattacctta 181 tgctggctac tttggatgag aatgaggaag tgatagatgg aggcaaaaaa ggagcaatcg 241 atgaccttca gcaaggtgaa ttggaagcat ttattcaaaa tcttaatttg gcgaagtata 301 caaaagcttc cttaattgaa gaagatgaac cagctgaaaa agaaaattcc agcaaaaaag 361 aagtaaaaat acctaaaata aataataaaa atacagcaga aagtcaaagg acatcagtta 421 ataaggtgaa aaataagaat aggccagaac cacattctga tgagaatggc agtaccacac 481 cgaaagtaaa gaaagataaa cagaacatct ttgaattttt tgagagacag actttgttac 541 ttaggcctgg aggcaaatgg tatgatctgg agtacagcaa tgaatattct ttgaaacccc 601 agcctcagga tgttgtatct aagtacaaaa cccttgctca gaagctgtat cagcatgaaa 661 tcaacttatt caaaagtaag acgaatagtc aaaagggagc ctcttctacc tggatgaagg 721 caattgtgtc atcggggaca ctaggtgaca ggatggcagc catgattctt cttattcagg 781 atgatgccgt tcacacactt cagtttgtag aaactcttgt gaaccttgtt aaaaagaagg 841 gcagcaaaca gcagtgcctt atggccttgg atactttcaa agagttgctt atcacagacc 901 ttttgccaga caatcggaag ctgaggattt tcagccagcg tccttttgac aaactggaac 961 agttgtccag tggcaacaag gactcaagag atagaagact gatattatgg tattttgaac 1021 accagctgaa acacttagtg gctgaatttg tgcaggtctt agaaacttta agtcatgata 1081 cattagtaac cactaaaact cgagccctta ccgtggctca tgagctgctt tgtaacaagc 1141 ctgaggaaga aaaggctctt cttgtgcaag tggtaaataa actgggagat cctcagaaca 1201 gaattgccac aaaagcatcc catctgttag agacattact ttgtaaacat cccaatatga 1261 aaggagttgt gtctggtgaa gtagaaaggc tactcttccg ctcaaatatc agctccaaag 1321 ctcaatatta tgcaatttgc tttttaaatc aaatggctct gtcccatgaa gaaagtgaat 1381 tggctaacaa attaataact gtttactttt gcttttttcg gacttgtgtc aaaaaaaaag 1441 atgttgaatc aaaaatgctt agcgcccttt taacaggtgt gaatagggca tacccttatt 1501 cccagactgg tgatgacaaa gtaagggagc agattgacac actgtttaaa gtgttgcata 1561 ttgtgaattt taataccagt gtccaggctt taatgttgct tttccaagta atgaattctc 1621 agcagacaat atcggatcga tattacacag cattatacag gaagatgttg gatccagggt 1681 tgatgacgtg ttccaagcaa gctatgtttc ttaaccttgt ctacaaatct ctgaaagctg 1741 acattgtgtt gcgccgggtg aaggcttttg tgaaggggtt acttcaagtt acttgtcaac 1801 agatgccacc atttatatgt ggagctttat atcttgtgtc tgagatcctt aaagcaaaac 1861 caggtttaag aagccaacta gatgatcatc cggagtctga tgatgaagaa aattttattg 1921 atgcaaatga tgatgaagac atggaaaaat tcactgatgc agacaaagaa acagagatag 1981 tgaaaaaact tgagacagag gaaacagttc ctgaaactga tgtagaaacc aaaaaaccag 2041 aggttgcttc ctgggtgcac tttgataatt tgaaaggtgg gaaacagtta aataaatacg 2101 atccattcag tagaaaccct ctgttctgtg gagctgaaaa tacaagtctt tgggaactca 2161 aaaagttatc tgtgcatttt catccctccg tggccctttt tgcaaagacc atccttcagg 2221 gaaactatat tcagtattca ggggacccac tgcaggattt cactctaatg agatttttgg 2281 atcgatttgt ataccgaaat ccaaagcccc ataaaggcaa agaaaacaca gatagtgttg 2341 tgatgcagcc gaaaagaaaa cattttatta aggatattcg tcatcttcct gtgaacagta 2401 aggagttcct tgcaaaagaa gaaagccaaa taccagtgga tgaagtgttt ttccacaggt 2461 attataaaaa agttgctgtt aaagagaaac aaaaacggga tgcagatgaa gaaagtatag 2521 aagacgtgga tgatgaagaa tttgaagagc tgattgacac atttgaagat gataactgtt 2581 tcagctctgg aaaggatgat atggattttg ctggaaacgt gaaaaagaga acaaaaggag 2641 ctaaggataa cacattagat gaagattcag aaggtagtga tgatgaactt ggtaacctgg 2701 atgacgatga agtttcttta ggaagtatgg atgatgaaga atttgctgaa gttgatgaag 2761 atggaggaac attcatggat gtgttagatg atgaaagtga gagcgttcca gaacttgaag 2821 tccactccaa agtcagtact aagaaaagca agagaaaagg tacagatgat tttgactttg 2881 ctggctcatt tcaagggcca agaaaaaaga aaagaaactt aaatgattcc agcctatttg 2941 tatctgctga agagtttggc catctattgg atgaaaatat gggatccaag tttgataaca 3001 ttgcatgaat gccatggcta acaaagataa tgcaagtctc aaacagctta gatgggaggc 3061 tgaacgtgat gactggctac acaacagaga tgcaaaaagt atcatcaaga aaaagaaaca 3121 ttttaaaaag aagaggatta aaaccactca aaaaactaaa aaacaaagga aatgagttat 3181 taatgtaaat tatagattaa aattctactt acatct // LOCUS HUMCBG 1422 bp mRNA PRI 08-AUG-1995 DEFINITION Human corticosteroid binding globulin mRNA, complete cds. ACCESSION J02943 NID g179970 KEYWORDS corticosteroid binding globulin; plasma transport protein; transcortin. SOURCE Homo sapiens liver and lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1422) AUTHORS Hammond,G.L., Smith,C.L., Goping,I.S., Underhill,D.A., Harley,M.J., Reventos,J., Musto,N.A., Gunsalus,G.L. and Bardin,C.W. TITLE Primary structure of human corticosteroid binding globulin, deduced from hepatic and pulmonary cDNAs, exhibits homology with serine protease inhibitors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (15), 5153-5157 (1987) MEDLINE 87260947 COMMENT Draft entry and printed copy of sequence [1] kindly provided by G.L.Hammond, 01-JUL-1987. FEATURES Location/Qualifiers source 1..1422 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver and lung" /map="14q31-q32.1" mRNA <1..1422 /note="CBG mRNA" gene 36..1253 /gene="CBG" sig_peptide 36..101 /gene="CBG" /note="corticosteroid binding globulin signal peptide" CDS 36..1253 /gene="CBG" /note="corticosteroid binding globulin precursor" /codon_start=1 /db_xref="GDB:G00-127-865" /db_xref="PID:g179971" /translation="MPLLLYTCLLWLPTSGLWTVQAMDPNAAYVNMSNHHRGLASANV DFAFSLYKHLVALSPKKNIFISPVSISMALAMLSLGTCGHTRAQLLQGLGFNLTERSE TEIHQGFQHLHQLFAKSDTSLEMTMGNALFLDGSLELLESFSADIKHYYESEVLAMNF QDWATASRQINSYVKNKTQGKIVDLFSGLDSPAILVLVNYIFFKGTWTQPFDLASTRE ENFYVDETTVVKVPMMLQSSTISYLHDSELPCQLVQMNYVGNGTVFFILPDKGKMNTV IAALSRDTINRWSAGLTSSQVDLYIPKVTISGVYDLGDVLEEMGIADLFTNQANFSRI TQDAQLKSSKVVHKAVLQLNEEGVDTAGSTGVTLNLTSKPIILRFNQPFIIMIFDHFT WSSLFLARVMNPV" variation 54 /gene="CBG" /note="a in liver; g in lung" /replace="g" mat_peptide 102..1250 /gene="CBG" /note="corticosteroid binding globulin" BASE COUNT 340 a 406 c 344 g 332 t ORIGIN 18 bp upstream of HaeIII site. 1 cagcctaccg cagactggcc tggctatact ggacaatgcc actcctcctg tacacctgtc 61 ttctctggct gcccaccagc ggcctctgga ccgtccaggc catggatcct aacgctgctt 121 atgtgaacat gagtaaccat caccggggcc tggcttcagc caacgttgac tttgccttca 181 gcctgtataa gcacctagtg gccttgagtc ccaaaaagaa cattttcatc tcccctgtga 241 gcatctccat ggccttagct atgctgtccc tgggcacctg tggccacaca cgggcccagc 301 ttctccaggg cctgggtttc aacctcactg agaggtctga gactgagatc caccagggtt 361 tccagcacct gcaccaactc tttgcaaagt cagacaccag cttagaaatg actatgggca 421 atgccttgtt tcttgatggc agcctggagt tgctggagtc attctcagca gacatcaagc 481 actactatga gtcagaggtc ttggctatga atttccagga ctgggcaaca gccagcagac 541 agatcaacag ctatgtcaag aataagacac aggggaaaat tgtcgacttg ttttcagggc 601 tggatagccc agccatcctc gtcctggtca actatatctt cttcaaaggc acatggacac 661 agccctttga cctggcaagc accagggagg agaacttcta tgtggacgag acaactgtgg 721 tgaaggtgcc catgatgttg cagtcgagca ccatcagtta ccttcatgac tcagagctcc 781 cctgccagct ggtgcagatg aactacgtgg gcaatgggac tgtcttcttc atccttccgg 841 acaaggggaa gatgaacaca gtcatcgctg cactgagccg ggacacgatt aacaggtggt 901 ccgcaggcct gaccagcagc caggtggacc tgtacattcc aaaggtcacc atctctggag 961 tctatgacct tggagatgtg ctggaggaaa tgggcattgc agacttgttc accaaccagg 1021 caaatttctc acgcatcacc caggacgccc agctgaagtc atcaaaggtg gtccataaag 1081 ctgtgctgca actcaatgag gagggtgtgg acacagctgg ctccactggg gtcaccctaa 1141 acctgacgtc caagcctatc atcttgcgtt tcaaccagcc cttcatcatc atgatcttcg 1201 accacttcac ctggagcagc cttttcctgg cgagggttat gaacccagtg taagagacca 1261 cccacccaga gcctcagcac tgtctgactt tgggaaccag ggatcccaca gaaatgtttt 1321 ggagagcggg aggtttcccc caatctcctc caagttcttc tccctccaac cagagttgtg 1381 tctaacttta ggcatctttt aataaatgtc attgcgactc tg // LOCUS HUMCBSA 2500 bp mRNA PRI 29-APR-1996 DEFINITION Homo sapiens cystathionine beta-synthase (CBS) mRNA, complete cds. ACCESSION L14577 NID g1289361 KEYWORDS CBS gene; cystathionine beta-synthase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2500) AUTHORS Kruger,W.D. and Cox,D.R. TITLE A yeast system for expression of human cystathionine beta-synthase: structural and functional conservation of the human and yeast genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (14), 6614-6618 (1994) MEDLINE 94294429 REFERENCE 2 (bases 1 to 2500) AUTHORS Kruger,W.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-1993) Warren D. Kruger, Population Science, Fox Chase Cancer Center, Philadelphia, PA 19111, USA REFERENCE 3 (bases 1 to 2500) AUTHORS Kruger,W.D. TITLE Direct Submission JOURNAL Submitted (01-MAR-1996) Warren D. Kruger, Population Science, Fox Chase Cancer Center, Philadelphia, PA 19111, USA FEATURES Location/Qualifiers source 1..2500 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="hep G2" /map="21q22.3" /chromosome="21" 5'UTR 1..158 gene 159..1814 /gene="CBS" CDS 159..1814 /gene="CBS" /codon_start=1 /product="cystathionine beta-synthase" /db_xref="PID:g1289362" /translation="MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWI RPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLK CELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAV RGYRCIIVMPEKMSSEKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSH ILDQYRNASNPLAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCR IIGVDPEGSILAEPEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTF ARMLIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWML QKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTITCGHTIEILREKGFDQAPVVD EAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQIRLTDTLGRLSHILEMDHF ALVVHEQIQYHSTGKSSQRQMVFGVVTAIDLLNFVAAQERDQK" 3'UTR 1815..2500 BASE COUNT 545 a 695 c 772 g 488 t ORIGIN 1 tgaatcgccc ggggtcgccg tctccgcctc gccgcagtcg gggcagccgc tgccctcttt 61 tccatgtatc gtccaggatc ccatgacaga ttctgttgtc acgtctcctt acagagtttg 121 agcggtgctg aactgtcagc acatctgtcc ggtccagcat gccttctgag accccccagg 181 cagaagtggg gcccacaggc tgcccccacc gctcagggcc acactcggcg aaggggagcc 241 tggagaaggg gtccccagag gataaggaag ccaaggagcc cctgtggatc cggcccgatg 301 ctccgagcag gtgcacctgg cagctgggcc ggcctgcctc cgagtcccca catcaccaca 361 ctgccccggc aaaatctcca aaaatcttgc cagatattct gaagaaaatc ggggacaccc 421 ctatggtcag aatcaacaag attgggaaga agttcggcct gaagtgtgag ctcttggcca 481 agtgtgagtt cttcaacgcg ggcgggagcg tgaaggaccg catcagcctg cggatgattg 541 aggatgctga gcgcgacggg acgctgaagc ccggggacac gattatcgag ccgacatccg 601 ggaacaccgg gatcgggctg gccctggctg cggcagtgag gggctatcgc tgcatcatcg 661 tgatgccaga gaagatgagc tccgagaagg tggacgtgct gcgggcactg ggggctgaga 721 ttgtgaggac gcccaccaat gccaggttcg actccccgga gtcacacgtg ggggtggcct 781 ggcggctgaa gaacgaaatc cccaattctc acatcctaga ccagtaccgc aacgccagca 841 accccctggc tcactacgac accaccgctg atgagatcct gcagcagtgt gatgggaagc 901 tggacatgct ggtggcttca gtgggcacgg gcggcaccat cacgggcatt gccaggaagc 961 tgaaggagaa gtgtcctgga tgcaggatca ttggggtgga tcccgaaggg tccatcctcg 1021 cagagccgga ggagctgaac cagacggagc agacaaccta cgaggtggaa gggatcggct 1081 acgacttcat ccccacggtg ctggacagga cggtggtgga caagtggttc aagagcaacg 1141 atgaggaggc gttcaccttt gcccgcatgc tgatcgcgca agaggggctg ctgtgcggtg 1201 gcagtgctgg cagcacggtg gcggtggccg tgaaggctgc gcaggagctg caggagggcc 1261 agcgctgcgt ggtcattctg cccgactcag tgcggaacta catgaccaag ttcctgagcg 1321 acaggtggat gctgcagaag ggctttctga aggaggagga cctcacggag aagaagccct 1381 ggtggtggca cctccgtgtt caggagctgg gcctgtcagc cccgctgacc gtgctcccga 1441 ccatcacctg tgggcacacc atcgagatcc tccgggagaa gggcttcgac caggcgcccg 1501 tggtggatga ggcgggggta atcctgggaa tggtgacgct tgggaacatg ctctcgtccc 1561 tgcttgccgg gaaggtgcag ccgtcagacc aagttggcaa agtcatctac aagcagttca 1621 aacagatccg cctcacggac acgctgggca ggctctcgca catcctggag atggaccact 1681 tcgccctggt ggtgcacgag cagatccagt accacagcac cgggaagtcc agtcagcggc 1741 agatggtgtt cggggtggtc accgccattg acttgctgaa cttcgtggcc gcccaggagc 1801 gggaccagaa gtgaagtccg gagcgctggg cggtgcggag cgggcccgcc acccttgccc 1861 acttctcctt cgctttcctg agccctaaac acacgcgtga ttggtaactg cctggcctgg 1921 caccgttatc cctgcagacg gcacagagca tccgtctccc ctcgttaaca catggcttcc 1981 taaatggccc tgtttacggc ctatgagatg aaatatgtga ttttctctaa tgtaacttcc 2041 tcttaggatg tttcaccaag gaaatattga gagagaagtc ggccaggtag gatgaacaca 2101 ggcaatgact gcgcagagtg gattaaaggc aaaagagaga agagtccagg aaggggcggg 2161 gagaagcctg ggtggctcag catcctccac gggctgcgcg tctgctcggg gctgagctgg 2221 cgggagcagt ttgcgtgttt gggtttttta attgagatga aattcaaata acctaaaaat 2281 caatcacttg aaagtgaaca atcagcggca tttagtacat ccagaaagtt gtgtaggcac 2341 cacctctgtc acgttctgga acattctgtc atcaccccgt gaagcaatca tttcccctcc 2401 cgtcttcctc ctcccctggc aactgctgat cgactttgtg tctctgttgt ctaaaatagg 2461 ttttccctgt tctggacatt tcatataaat ggaatcacac // LOCUS HUMCCC5 5444 bp mRNA PRI 31-OCT-1994 DEFINITION Human complement component C5 mRNA, complete cds. ACCESSION M57729 NID g179982 KEYWORDS complement component C5. SOURCE Human hepatocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5444) AUTHORS Haviland,D.L., Haviland,J.C., Fleischer,D.T., Hunt,A. and Wetsel,R.A. TITLE Complete cDNA sequence of human complement pro-C5. Evidence of truncated transcripts derived from a single copy gene JOURNAL J. Immunol. 146 (1), 362-368 (1991) MEDLINE 91079575 FEATURES Location/Qualifiers source 1..5444 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /tissue_type="liver" /map="9q33" gene 13..5428 /gene="C5" sig_peptide 13..66 /gene="C5" /note="G00-119-734" CDS 13..5043 /gene="C5" /codon_start=1 /db_xref="GDB:G00-119-734" /product="complement component C5" /db_xref="PID:g179983" /translation="MGLLGILCFLIFLGKTWGQEQTYVISAPKIFRVGASENIVIQVY GYTEAFDATISIKSYPDKKFSYSSGHVHLSSENKFQNSAILTIQPKQLPGGQNPVSYV YLEVVSKHFSKSKRMPITYDNGFLFIHTDKPVYTPDQSVKVRVYSLNDDLKPAKRETV LTFIDPEGSEVDMVEEIDHIGIISFPDFKIPSNPRYGMWTIKAKYKEDFSTTGTAYFE VKEYVLPHFSVSIEPEYNFIGYKNFKNFEITIKARYFYNKVVTEADVYITFGIREDLK DDQKEMMQTAMQNTMLINGIAQVTFDSETAVKELSYYSLEDLNNKYLYIAVTVIESTG GFSEEAEIPGIKYVLSPYKLNLVATPLFLKPGIPYPIKVQVKDSLDQLVGGVPVILNA QTIDVNQETSDLDPSKSVTRVDDGVASFVLNLPSGVTVLEFNVKTDAPDLPEENQARE GYRAIAYSSLSQSYLYIDWTDNHKALLVGEHLNIIVTPKSPYIDKITHYNYLILSKGK IIHFGTREKFSDASYQSINIPVTQNMVPSSRLLVYYIVTGEQTAELVSDSVWLNIEEK CGNQLQVHLSPDADAYSPGQTVSLNMATGMDSWVALAAVDSAVYGVQRGAKKPLERVF QFLEKSDLGCGAGGGLNNANVFHLAGLTFLTNANADDSQENDEPCKEILRPRRTLQKK IEEIAAKYKHSVVKKCCYDGACVNNDETCEQRAARISLGPRCIKAFTECCVVASQLRA NISHKDMQLGRLHMKTLLPVSKPEIRSYFPESWLWEVHLVPRRKQLQFALPDSLTTWE IQGIGISNTGICVADTVKAKVFKDVFLEMNIPYSVVRGEQIQLKGTVYNYRTSGMQFC VKMSAVEGICTSESPVIDHQGTKSSKCVRQKVEGSSSHLVTFTVLPLEIGLHNINFSL ETWFGKEILVKTLRVVPEGVKRESYSGVTLDPRGIYGTISRRKEFPYRIPLDLVPKTE IKRILSVKGLLVGEILSAVLSQEGINILTHLPKGSAEAELMSVVPVFYVFHYLETGNH WNIFHSDPLIEKQKLKKKLKEGMLSIMSYRNADYSYSVWKGGSASTWLTAFALRVLGQ VNKYVEQNQNSICNSLLWLVENYQLDNGSFKENSQYQPIKLQGTLPVEARENSLYLTA FTVIGIRKAFDICPLVKIDTALIKADNFLLENTLPAQSTFTLAISAYALSLGDKTHPQ FRSIVSALKREALVKGNPPIYRFWKDNLQHKDSSVPNTGTARMVETTAYALLTSLNLK DINYVNPVIKWLSEEQRYGGGFYSTQDTINAIEGLTEYSLLVKQLRLSMDIDVSYKHK GALHNYKMTDKNFLGRPVEVLLNDDLIVSTGFGSGLATVHVTTVVHKTSTSEEVCSFY LKIDTQDIEASHYRGYGNSDYKRIVACASYKPSREESSSGSSHAVMDISLPTGISANE EDLKALVEGVDQLFTDYQIKDGHVILQLNSIPSSDFLCVRFRIFELFEVGFLSPATFT VYEYHRPDKQCTMFYSTSNIKIQKVCEGAACKCVEADCGQMQEELDLTISAETRKQTA CKPEIAYAYKVSITSITVENVFVKYKATLLDIYKTGEAVAEKDSEITFIKKVTCTNAE LVKGRQYLIMGKEALQIKYNFSFRYIYPLDSLTWIEYWPRDTTCSSCQAFLANLDEFA EDIFLNGC" mat_peptide 67..2031 /gene="C5" /note="beta-chain; G00-119-734" /product="complement component C5" mat_peptide 2044..5040 /gene="C5" /note="alpha-chain; G00-119-734" /product="complement component C5" TATA_signal 5423..5428 /gene="C5" /note="G00-119-734" BASE COUNT 1735 a 1053 c 1119 g 1537 t ORIGIN 1 ctacctccaa ccatgggcct tttgggaata ctttgttttt taatcttcct ggggaaaacc 61 tggggacagg agcaaacata tgtcatttca gcaccaaaaa tattccgtgt tggagcatct 121 gaaaatattg tgattcaagt ttatggatac actgaagcat ttgatgcaac aatctctatt 181 aaaagttatc ctgataaaaa atttagttac tcctcaggcc atgttcattt atcctcagag 241 aataaattcc aaaactctgc aatcttaaca atacaaccaa aacaattgcc tggaggacaa 301 aacccagttt cttatgtgta tttggaagtt gtatcaaagc atttttcaaa atcaaaaaga 361 atgccaataa cctatgacaa tggatttctc ttcattcata cagacaaacc tgtttatact 421 ccagaccagt cagtaaaagt tagagtttat tcgttgaatg acgacttgaa gccagccaaa 481 agagaaactg tcttaacctt catagatcct gaaggatcag aagttgacat ggtagaagaa 541 attgatcata ttggaattat ctcttttcct gacttcaaga ttccgtctaa tcctagatat 601 ggtatgtgga cgatcaaggc taaatataaa gaggactttt caacaactgg aaccgcatat 661 tttgaagtta aagaatatgt cttgccacat ttttctgtct caatcgagcc agaatataat 721 ttcattggtt acaagaactt taagaatttt gaaattacta taaaagcaag atatttttat 781 aataaagtag tcactgaggc tgacgtttat atcacatttg gaataagaga agacttaaaa 841 gatgatcaaa aagaaatgat gcaaacagca atgcaaaaca caatgttgat aaatggaatt 901 gctcaagtca catttgattc tgaaacagca gtcaaagaac tgtcatacta cagtttagaa 961 gatttaaaca acaagtacct ttatattgct gtaacagtca tagagtctac aggtggattt 1021 tctgaagagg cagaaatacc tggcatcaaa tatgtcctct ctccctacaa actgaatttg 1081 gttgctactc ctcttttcct gaagcctggg attccatatc ccatcaaggt gcaggttaaa 1141 gattcgcttg accagttggt aggaggagtc ccagtaatac tgaatgcaca aacaattgat 1201 gtaaaccaag agacatctga cttggatcca agcaaaagtg taacacgtgt tgatgatgga 1261 gtagcttcct ttgtgcttaa tctcccatct ggagtgacgg tgctggagtt taatgtcaaa 1321 actgatgctc cagatcttcc agaagaaaat caggccaggg aaggttaccg agcaatagca 1381 tactcatctc tcagccaaag ttacctttat attgattgga ctgataacca taaggctttg 1441 ctagtgggag aacatctgaa tattattgtt acccccaaaa gcccatatat tgacaaaata 1501 actcactata attacttgat tttatccaag ggcaaaatta tccattttgg cacgagggag 1561 aaattttcag atgcatctta tcaaagtata aacattccag taacacagaa catggttcct 1621 tcatcccgac ttctggtcta ttatatcgtc acaggagaac agacagcaga attagtgtct 1681 gattcagtct ggttaaatat tgaagaaaaa tgtggcaacc agctccaggt tcatctgtct 1741 cctgatgcag atgcatattc tccaggccaa actgtgtctc ttaatatggc aactggaatg 1801 gattcctggg tggcattagc agcagtggac agtgctgtgt atggagtcca aagaggagcc 1861 aaaaagccct tggaaagagt atttcaattc ttagagaaga gtgatctggg ctgtggggca 1921 ggtggtggcc tcaacaatgc caatgtgttc cacctagctg gacttacctt cctcactaat 1981 gcaaatgcag atgactccca agaaaatgat gaaccttgta aagaaattct caggccaaga 2041 agaacgctgc aaaagaagat agaagaaata gctgctaaat ataaacattc agtagtgaag 2101 aaatgttgtt acgatggagc ctgcgttaat aatgatgaaa cctgtgagca gcgagctgca 2161 cggattagtt tagggccaag atgcatcaaa gctttcactg aatgttgtgt cgtcgcaagc 2221 cagctccgtg ctaatatctc tcataaagac atgcaattgg gaaggctaca catgaagacc 2281 ctgttaccag taagcaagcc agaaattcgg agttattttc cagaaagctg gttgtgggaa 2341 gttcatcttg ttcccagaag aaaacagttg cagtttgccc tacctgattc tctaaccacc 2401 tgggaaattc aaggcattgg catttcaaac actggtatat gtgttgctga tactgtcaag 2461 gcaaaggtgt tcaaagatgt cttcctggaa atgaatatac catattctgt tgtacgagga 2521 gaacagatcc aattgaaagg aactgtttac aactatagga cttctgggat gcagttctgt 2581 gttaaaatgt ctgctgtgga gggaatctgc acttcggaaa gcccagtcat tgatcatcag 2641 ggcacaaagt cctccaaatg tgtgcgccag aaagtagagg gctcctccag tcacttggtg 2701 acattcactg tgcttcctct ggaaattggc cttcacaaca tcaatttttc actggagact 2761 tggtttggaa aagaaatctt agtaaaaaca ttacgagtgg tgccagaagg tgtcaaaagg 2821 gaaagctatt ctggtgttac tttggatcct aggggtattt atggtaccat tagcagacga 2881 aaggagttcc catacaggat acccttagat ttggtcccca aaacagaaat caaaaggatt 2941 ttgagtgtaa aaggactgct tgtaggtgag atcttgtctg cagttctaag tcaggaaggc 3001 atcaatatcc taacccacct ccccaaaggg agtgcagagg cggagctgat gagcgttgtc 3061 ccagtattct atgtttttca ctacctggaa acaggaaatc attggaacat ttttcattct 3121 gacccattaa ttgaaaagca gaaactgaag aaaaaattaa aagaagggat gttgagcatt 3181 atgtcctaca gaaatgctga ctactcttac agtgtgtgga agggtggaag tgctagcact 3241 tggttaacag cttttgcttt aagagtactt ggacaagtaa ataaatacgt agagcagaac 3301 caaaattcaa tttgtaattc tttattgtgg ctagttgaga attatcaatt agataatgga 3361 tctttcaagg aaaattcaca gtatcaacca ataaaattac agggtacctt gcctgttgaa 3421 gcccgagaga acagcttata tcttacagcc tttactgtga ttggaattag aaaggctttc 3481 gatatatgcc ccctggtgaa aatcgacaca gctctaatta aagctgacaa ctttctgctt 3541 gaaaatacac tgccagccca gagcaccttt acattggcca tttctgcgta tgctctttcc 3601 ctgggagata aaactcaccc acagtttcgt tcaattgttt cagctttgaa gagagaagct 3661 ttggttaaag gtaatccacc catttatcgt ttttggaaag acaatcttca gcataaagac 3721 agctctgtac ctaacactgg tacggcacgt atggtagaaa caactgccta tgctttactc 3781 accagtctga acttgaaaga tataaattat gttaacccag tcatcaaatg gctatcagaa 3841 gagcagaggt atggaggtgg cttttattca acccaggaca ccatcaatgc cattgagggc 3901 ctgacggaat attcactcct ggttaaacaa ctccgcttga gtatggacat cgatgtttct 3961 tacaagcata aaggtgcctt acataattat aaaatgacag acaagaattt ccttgggagg 4021 ccagtagagg tgcttctcaa tgatgacctc attgtcagta caggatttgg cagtggcttg 4081 gctacagtac atgtaacaac tgtagttcac aaaaccagta cctctgagga agtttgcagc 4141 ttttatttga aaatcgatac tcaggatatt gaagcatccc actacagagg ctacggaaac 4201 tctgattaca aacgcatagt agcatgtgcc agctacaagc ccagcaggga agaatcatca 4261 tctggatcct ctcatgcggt gatggacatc tccttgccta ctggaatcag tgcaaatgaa 4321 gaagacttaa aagcccttgt ggaaggggtg gatcaactat tcactgatta ccaaatcaaa 4381 gatggacatg ttattctgca actgaattcg attccctcca gtgatttcct ttgtgtacga 4441 ttccggatat ttgaactctt tgaagttggg tttctcagtc ctgccacttt cacagtttac 4501 gaataccaca gaccagataa acagtgtacc atgttttata gcacttccaa tatcaaaatt 4561 cagaaagtct gtgaaggagc cgcgtgcaag tgtgtagaag ctgattgtgg gcaaatgcag 4621 gaagaattgg atctgacaat ctctgcagag acaagaaaac aaacagcatg taaaccagag 4681 attgcatatg cttataaagt tagcatcaca tccatcactg tagaaaatgt ttttgtcaag 4741 tacaaggcaa cccttctgga tatctacaaa actggggaag ctgttgctga gaaagactct 4801 gagattacct tcattaaaaa ggtaacctgt actaacgctg agctggtaaa aggaagacag 4861 tacttaatta tgggtaaaga agccctccag ataaaataca atttcagttt caggtacatc 4921 taccctttag attccttgac ctggattgaa tactggccta gagacacaac atgttcatcg 4981 tgtcaagcat ttttagctaa tttagatgaa tttgccgaag atatcttttt aaatggatgc 5041 taaaattcct gaagttcagc tgcatacagt ttgcacttat ggactcctgt tgttgaagtt 5101 cgtttttttg ttttcttctt tttttaaaca ttcatagctg gtcttatttg taaagctcac 5161 tttacttaga attagtggca cttgctttta ttagagaatg atttcaaatg ctgtaacttt 5221 ctgaaataac atggccttgg agggcatgaa gacagatact cctccaaggt tattggacac 5281 cggaaacaat aaattggaac acctcctcaa acctaccact caggaatgtt tgctggggcc 5341 gaaagaacag tccattgaaa gggagtatta caaaaacatg gcctttgctt gaaagaaaat 5401 accaaggaac aggaaactga tcattaaagc ctgagtttgc tttc // LOCUS HUMCCCKR1A 1495 bp mRNA PRI 31-DEC-1994 DEFINITION Human C-C chemokine receptor type 1 (C-C CKR-1) mRNA, complete cds. ACCESSION L09230 NID g179984 KEYWORDS C-C chemokine receptor type 1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1495) AUTHORS Neote,K., DiGregorio,D., Mak,J.Y., Horuk,R. and Schall,T.J. TITLE Molecular cloning, functional expression, and signaling characteristics of a C-C chemokine receptor JOURNAL Cell 72 (3), 415-425 (1993) MEDLINE 93161416 FEATURES Location/Qualifiers source 1..1495 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="differentiated HL60" gene 1..1068 /gene="CMKR-1" CDS 1..1068 /gene="CMKR-1" /codon_start=1 /product="C-C chemokine receptor type 1" /db_xref="PID:g179985" /translation="METPNTTEDYDTTTEFDYGDATPCQKVNERAFGAQLLPPLYSLV FVIGLVGNILVVLVLVQYKRLKNMTSIYLLNLAISDLLFLFTLPFWIDYKLKDDWVFG DAMCKILSGFYYTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITSIIIWAL AILASMPGLYFSKTQWEFTHHTCSLHFPHESLREWKLFQALKLNLFGLVLPLLVMIIC YTGIIKILLRRPNEKKSKAVRLIFVIMIIFFLFWTPYNLTILISVFQDFLFTHECEQS RHLDLAVQVTEVIAYTHCCVNPVIYAFVGERFRKYLRQLFHRRVAVHLVKWLPFLSVD RLERVSSTSPSTGEHELSAGF" BASE COUNT 348 a 389 c 361 g 397 t ORIGIN 1 atggaaactc caaacaccac agaggactat gacacgacca cagagtttga ctatggggat 61 gcaactccgt gccagaaggt gaacgagagg gcctttgggg cccaactgct gccccctctg 121 tactccttgg tatttgtcat tggcctggtt ggaaacatcc tggtggtcct ggtccttgtg 181 caatacaaga ggctaaaaaa catgaccagc atctacctcc tgaacctggc catttctgac 241 ctgctcttcc tgttcacgct tcccttctgg atcgactaca agttgaagga tgactgggtt 301 tttggtgatg ccatgtgtaa gatcctctct gggttttatt acacaggctt gtacagcgag 361 atctttttca tcatcctgct gacgattgac aggtacctgg ccatcgtcca cgccgtgttt 421 gccttgcggg cacggaccgt cacttttggt gtcatcacca gcatcatcat ttgggccctg 481 gccatcttgg cttccatgcc aggcttatac ttttccaaga cccaatggga attcactcac 541 cacacctgca gccttcactt tcctcacgaa agcctacgag agtggaagct gtttcaggct 601 ctgaaactga acctctttgg gctggtattg cctttgttgg tcatgatcat ctgctacaca 661 gggattataa agattctgct aagacgacca aatgagaaga aatccaaagc tgtccgtttg 721 atttttgtca tcatgatcat cttttttctc ttttggaccc cctacaattt gactatactt 781 atttctgttt tccaagactt cctgttcacc catgagtgtg agcagagcag acatttggac 841 ctggctgtgc aagtgacgga ggtgatcgcc tacacgcact gctgtgtcaa cccagtgatc 901 tacgccttcg ttggtgagag gttccggaag tacctgcggc agttgttcca caggcgtgtg 961 gctgtgcacc tggttaaatg gctccccttc ctctccgtgg acaggctgga gagggtcagc 1021 tccacatctc cctccacagg ggagcatgaa ctctctgctg ggttctgact cagaccatag 1081 gaggccaacc caaaataagc aggcgtgacc tgccaggcac actgaccagc agcctggctc 1141 tcccagccag gttctgactc ttggcacagc atggagtccg cctcttggat agagaggaat 1201 gtaatggtgg cctggggctt ctgaggcttc tgggcttgag tcttttccat gaacttctcc 1261 cctggtagaa aagaagatga atgagcaaaa ccaaatattc cagagactgg gactaagtgt 1321 accagagaag ggcttggact caagcaagat ttcagatttg tgaccattag catttgtcaa 1381 caaagtcacc cacttcccac tattgcttgc acaaaccaat taaacccagt agtggtgact 1441 gtgggctcca ttcaaagtga gctcctaagc catgggagac actgatgtat gagga // LOCUS HUMCCKAR 1393 bp mRNA PRI 15-JUL-1993 DEFINITION Human cholecystokinin A receptor mRNA, complete cds. ACCESSION L13605 NID g306490 KEYWORDS cholecystokinin A receptor. SOURCE Homo sapiens (library: pcDNA1) gallbladder cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1393) AUTHORS Ulrich,C.D., Ferber,I., Holicky,E., Hadac,E., Buell,G. and Miller,L.J. TITLE Molecular cloning and functional expression of the human gallbladder cholecystokinin A receptor JOURNAL Biochem. Biophys. Res. Commun. 193, 204-211 (1993) MEDLINE 93277552 FEATURES Location/Qualifiers source 1..1393 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="muscularis" /tissue_type="gallbladder" /tissue_lib="pcDNA1" CDS 72..1358 /standard_name="CCK A receptor" /note="putative" /codon_start=1 /product="cholecystokinin A receptor" /db_xref="PID:g306491" /translation="MDVVDSLLVNGSNITPPCELGLENETLFCLDQPRPSKEWQPAVQ ILLYSLIFLLSVLGNTLVITVLIRNKRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPN LLKDFIFGSAVCKTTTYFMGTSVSVSTFNLVAISLERYGAICKPLQSRVWQTKSHALK VIAATWCLSFTIMTPYPIYSNLVPFTKNNNQTANMCRFLLPNDVMQQSWHTFLLLILF LIPGIVMMVAYGLISLELYQGIKFEASQKKSAKERKPSTTSSGKYEDSDGCYLQKTRP PRKLELRQLSTGSSSRANRIRSNSSAANLMAKKRVIRMLIVIVVLFFLCWMPIFSANA WRAYDTASAERRLSGTPISFILLLSYTSSCVNPIIYCFMNKRFRLGFMATFPCCPNPG PPGARGEVGEEEEGGTTGASLSRFSYSHMSASVPPQ" BASE COUNT 312 a 422 c 336 g 323 t ORIGIN 1 cattagagga atgagccggg agtgagcaat tcaccagctc tccagcactt ggtggaaagc 61 agcaggcaag gatggatgtg gttgacagcc ttcttgtgaa tggaagcaac atcactcctc 121 cctgtgaact cgggctcgaa aatgagacgc ttttctgctt ggatcagccc cgtccttcca 181 aagagtggca gccagcggtg cagattctct tgtactcctt gatattcctg ctcagcgtgc 241 tgggaaacac gctggtcatc accgtgctga ttcggaacaa gcggatgcgg acggtcacca 301 acatcttcct cctctccctg gctgtcagcg acctcatgct ctgtctcttc tgcatgccgt 361 tcaacctcat ccccaatctg ctcaaggatt tcatcttcgg gagcgccgtt tgcaagacca 421 ccacctactt catgggcacc tctgtgagtg tatctacctt taatctggta gccatatctc 481 tagagagata tggtgcgatt tgcaaaccct tacagtcccg ggtctggcag acaaaatccc 541 atgctttgaa ggtgattgct gctacctggt gcctttcctt taccatcatg actccgtacc 601 ccatttatag caacttggtg ccttttacca aaaataacaa ccagaccgcg aatatgtgcc 661 gctttctact gccaaatgat gttatgcagc agtcctggca cacattcctg ttactcatcc 721 tctttcttat tcctggaatt gtgatgatgg tggcatatgg attaatctct ttggaactct 781 accagggaat aaaatttgag gctagccaga agaagtctgc taaagaaagg aaacctagca 841 ccaccagcag cggcaaatat gaggacagcg atgggtgtta cctgcaaaag accaggcccc 901 cgaggaagct ggagctccgg cagctgtcca ccggcagcag cagcagggcc aaccgcatcc 961 ggagtaacag ctccgcagcc aacctgatgg ccaagaaaag ggtgatccgc atgctcatcg 1021 tcatcgtggt cctcttcttc ttgtgctgga tgcccatctt cagcgccaac gcctggcggg 1081 cctacgacac cgcctccgca gagcgccgcc tctcaggaac ccccatttcc ttcatcctcc 1141 tcctgtccta cacctcctcc tgcgtcaacc ccatcatcta ctgcttcatg aacaaacgct 1201 tccgcctcgg cttcatggcc accttcccct gctgccccaa tcctggtccc ccaggggcga 1261 ggggagaggt gggggaggag gaggaaggcg ggaccacagg agcctctctg tccaggttct 1321 cgtacagcca tatgagtgcc tcggtgccac cccagtgaga tgtcccctga ccctccaccg 1381 cagaaggaag gca // LOCUS HUMCCND3A 1962 bp mRNA PRI 31-OCT-1994 DEFINITION Human D3-type cyclin (CCND3) mRNA, complete cds. ACCESSION M90814 NID g180002 KEYWORDS D3-type cyclin. SOURCE Homo sapiens (tissue library: lambda D3-H34) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1962) AUTHORS Xiong,Y., Menninger,J., Beach,D. and Ward,D.C. TITLE Molecular cloning and chromosomal mapping of CCND genes encoding human D-type cyclins JOURNAL Genomics 13 (3), 575-584 (1992) MEDLINE 92347851 FEATURES Location/Qualifiers source 1..1962 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda D3-H34" /map="12p13" gene 101..979 /gene="CCND3" CDS 101..979 /gene="CCND3" /codon_start=1 /db_xref="GDB:G00-128-969" /product="D3-type cyclin" /db_xref="PID:g180003" /translation="MELLCCEGTRHAPRAGPDPRLLGDQRVLQSLLRLEERYVPRASY FQCVQREIKPHMRKMLAYWMLEVCEEQRCEEEVFPLAMNYLDRYLSCVPTRKAQLQLL GAVCMLLASKLRETTPLTIEKLCIYTDHAVSPRQLRDWEVLVLGKLKWDLAAVIAHDF LAFILHRLSLPRDRQALVKKHAQTFLALCATDYTFAMYPPSMIATGSIGAAVQGLGAC SMSGDELTELLAGITGTEVDCLRACQEQIEAALRESLREAAQTSSSPAPKAPRGSSSQ GPSQTSTPTDVTAIHL" BASE COUNT 369 a 613 c 550 g 430 t ORIGIN 1 gaattccgat ccccagcccg cccgcccgcg ctctccggcc cgtcgcctgc cttgggactc 61 gcgagcccgc actcccgccc tgcctgttcg ctgcccgagt atggagctgc tgtgttgcga 121 aggcacccgg cacgcgcccc gggccgggcc ggacccgcgg ctgctggggg accagcgtgt 181 cctgcagagc ctgctccgcc tggaggagcg ctacgtaccc cgcgcctcct acttccagtg 241 cgtgcagcgg gagatcaagc cgcacatgcg gaagatgctg gcttactgga tgctggaggt 301 atgtgaggag cagcgctgtg aggaggaagt cttccccctg gccatgaact acctggatcg 361 ctacctgtct tgcgtcccca cccgaaaggc gcagttgcag ctcctgggtg cggtctgcat 421 gctgctggcc tccaagctgc gcgagaccac gcccctgacc atcgaaaaac tgtgcatcta 481 caccgaccac gctgtctctc cccgccagtt gcgggactgg gaggtgctgg tcctagggaa 541 gctcaagtgg gacctggctg ctgtgattgc acatgatttc ctggccttca ttctgcaccg 601 gctctctctg ccccgtgacc gacaggcctt ggtcaaaaag catgcccaga cctttttggc 661 cctctgtgct acagattata cctttgccat gtacccgcca tccatgatcg ccacgggcag 721 cattggggct gcagtgcaag gcctgggtgc ctgctccatg tccggggatg agctcacaga 781 gctgctggca gggatcactg gcactgaagt ggactgcctg cgggcctgtc aggagcagat 841 cgaagctgca ctcagggaga gcctcaggga agccgctcag accagctcca gcccagcgcc 901 caaagccccc cggggctcca gcagccaagg gcccagccag accagcactc ctacagatgt 961 cacagccata cacctgtagc cctggagagg ccctctggag tggccactaa gcagaggagg 1021 ggccgctgca cccacctccc tgcctccagg aaccacacca catctaagcc tgaaggggcg 1081 tctgttcccc cttcacaaag cccaagggat ctggtcctac ccatccccgc agtgtgcact 1141 aaggggcccg gccagccatg tctgcatttc ggtggctagt caagctcctc ctccctgcat 1201 ctgaccagca gcgcctttcc caactctagc tgggggtggg ccaggctgat gggacagaat 1261 tggatacata caccagcatt ccttttgaac gccccccccc acccctgggg gctctcatgt 1321 tttcaactgc caaaatgctc tagtgccttc taaaggtgtt gtcccttcta gggttattgc 1381 atttggattg gggtccctct aaaatttaat gcatgataga cacatatgag ggggaatagt 1441 ctagatggct cctctcagta ctttggaggc ccctatgtag tccgtgctga cagctgctcc 1501 tagagggagg ggcctaggct cagccagaga agctataaat tcctctttgc tttgctttct 1561 gctcagcttc tcctgtgtga ttgacagctt tgctgctgaa ggctcatttt aatttattaa 1621 ttgctttgag cacaacttta agaggacgta atggggtcct ggccatccca caagtggtgg 1681 taaccctggt ggttgctgtt ttcctccctt ctgctactgg caaaaggatc tttgtggcca 1741 aggagctgct atagcctggg gtggggtcat gccctcctct cccattgtcc ctctgcccca 1801 tcctccagca gggaaaatgc agcagggatg ccctggaggt gctgagcccc tgtctagaga 1861 gggaggcaag cctgttgaca caggtctttc ctaaggctgc aaggtttagg ctggtggccc 1921 aggaccatca tcctactgta ataaagatga ttgtgggaat tc // LOCUS HUMCCPGS 726 bp mRNA PRI 09-JUN-1997 DEFINITION Human mRNA for cone-specific cGMP phosphodiesterase gamma subunit, complete cds. ACCESSION D45399 NID g1311543 KEYWORDS cone-specific cGMP phosphodiesterase gamma subunut; cGMP PDE gamma subunit. SOURCE Homo sapiens adult neural retina cDNA to mRNA, clone:Aki967K. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 726) AUTHORS Shimizu,A. TITLE Direct Submission JOURNAL Submitted (30-JAN-1995) to the DDBJ/EMBL/GenBank databases. Akiyo Shimizu, Institute for Molecular and Cellular Biology, Osaka Univ.; Yamada-oka 1-3, Suita, Osaka 565, Japan (E-mail:kousaku@imcb.osaka-u.ac.jp., Tel:06-877-5111(ex.3910), Fax:06-877-1922) REFERENCE 2 (bases 1 to 726) AUTHORS Shimizu-Matsumoto,A., Itoh,K., Inazawa,J., Nishida,K., Matsumoto,Y., Kinoshita,S., Matsubara,K. and Okubo,K. TITLE Isolation and chromosomal localization of the human cone cGMP phosphodiesterase gamma cDNA (PDE6H) JOURNAL Genomics 32 (1), 121-124 (1996) MEDLINE 96230332 FEATURES Location/Qualifiers source 1..726 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="12" /clone="Aki967K" /dev_stage="adult" /map="12p13" /tissue_type="neural retina" mRNA 1..726 /evidence=experimental CDS 85..336 /note="cone-specific cGMP phosphodiesterase gamma subunit" /codon_start=1 /product="cGMP phosphodiesterase gamma subunit" /db_xref="PID:d1008836" /db_xref="PID:g1311544" /translation="MSDNTTLPAPASNQGPTTPRKGPPKFKQRQTRQFKSKPPKKGVK GFGDDIPGMEGLGTDITVICPWEAFSHLELHELAQFGII" polyA_signal 702..707 BASE COUNT 203 a 163 c 153 g 207 t ORIGIN 1 aaagaaggca gctgggcagc tgataggaac aacattcagc tccgaggctg aaagggaaac 61 atcagccgcc cggggggagt taaaatgagt gacaacacta ctctgcctgc tccagcttca 121 aaccagggtc ctaccacccc acgcaaaggc cctcccaagt tcaagcagag gcagactcgc 181 caattcaaga gtaaacctcc aaagaaaggt gtgaaaggat ttggagatga cattccagga 241 atggaggggc taggaacaga tatcacagtg atttgtccat gggaggcatt cagccacctg 301 gaattgcatg agctcgctca gtttgggatt atctgaagtg ccagaggttc tgccactctc 361 aatgacatct gctgtaattt tggttgcttt tgccctgttg atctgccgga gtcttgaaat 421 tcagctgatt ggaagtggtt tctttgactt tcaaggtcat tccctatcag ttactactaa 481 gagttctctt actgtcagaa tctctcttgc agtacagctc aaaatagtgg atgcatttca 541 aacttgacca ccttttccta ccagctagtt agaagtcatc aatatttctc tacattttgt 601 ttatctgtaa gtcctttaaa tctatttttg ctaggcattc attatgatta atagtagact 661 tttaagacaa catttatgtc actccccacc tcctcatatt taataaagga gatatttacc 721 ttgaaa // LOCUS HUMCD1A 2072 bp mRNA PRI 01-NOV-1994 DEFINITION Human thymocyte antigen CD1a mRNA, complete cds. ACCESSION M28825 NID g180035 KEYWORDS cell surface antigen. SOURCE Human, cDNA to mRNA (HPB-ALL library). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2072) AUTHORS Aruffo,A. and Seed,B. TITLE Expression of cDNA clones encoding the thymocyte antigens CD1a, b, c demonstrates a hierarchy of exclusion in fibroblasts JOURNAL J. Immunol. 143 (5), 1723-1730 (1989) MEDLINE 89341413 FEATURES Location/Qualifiers source 1..2072 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q22-q23" sig_peptide 534..581 /gene="CD1A" /note="CD1a signal peptide; G00-120-575" CDS 534..1517 /gene="CD1A" /note="CD1a antigen precursor" /codon_start=1 /db_xref="GDB:G00-120-575" /db_xref="PID:g180036" /translation="MLFLLLPLLAVLPGDGNADGLKEPLSFHVIWIASFYNHSWKQNL VSGWLSDLQTHTWDSNSSTIVFLWPWSRGNFSNEEWKELETLFRIRTIRSFEGIRRYA HELQFEYPFEIQVTGGCELHSGKVSGSFLQLAYQGSDFVSFQNNSWLPYPVAGNMAKH FCKVLNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQVKPEAWLSHGPSPGPGH LQLVCHVSGFYPKPVWVMWMRGEQEQQGTQRGDILPSADGTWYLRATLEVAAGEAADL SCRVKHSSLEGQDIVLYWEHHSSVGFIILAVIVPLLLLIGLALWFRKRCFC" gene 534..1517 /gene="CD1A" mat_peptide 582..1514 /gene="CD1A" /note="CD1a antigen; G00-120-575" BASE COUNT 530 a 444 c 522 g 576 t ORIGIN 1 gggcagtcgt aggagactct gaaaaagcaa ataaatcaat gttaaatcag aaatgtgaat 61 gtagtaaggg gctgaagaga caggggaaga gaatacatgg gaaaatattg aaaaggacag 121 agtgatcaaa aaaagcaggg acatgggagc attgggcagc acactgggag ccatttactt 181 tatgctctta ttgtatgatt gagaaaaaaa atgtccttag tggttaagtg gcttttcaat 241 gccacatcag acttgttcca tagcagttga attaggggaa ggtgaataag ttggaggttg 301 gtgacaagga gagaagctgg aacagagagg agagtcagaa ccagagggaa atgagagact 361 gagtaggcat ctcagggttt ttgaaggagt ggattttctt tgttgcagtc aggggaggtt 421 tgtctgttgg ctgcagaaag aagtcagaat agagatatcg tggggtaggt ttgtttggaa 481 cagaaatcaa agaccaattt ttctgagaga aggaaataac atctgcaaat gatatgctgt 541 ttttgctact tccattgtta gctgttctcc caggtgatgg caatgcagac gggctcaagg 601 agcctctctc cttccatgtc atctggatcg catcctttta caaccattcc tggaaacaaa 661 atctggtctc aggttggctg agtgatttgc agactcatac ctgggacagc aattccagca 721 ccatcgtttt cctgtggccc tggtccaggg gaaacttcag caatgaggag tggaaggaac 781 tggaaacatt attccgtata cgcaccattc ggtcatttga gggaattcgt agatacgccc 841 atgaattgca gtttgaatat ccttttgaga tacaggtgac aggaggctgt gagctgcact 901 ctggaaaggt ctcaggaagc ttcttgcagt tagcttatca aggatcagac tttgtgagct 961 tccagaacaa ttcatggttg ccatatccag tggctgggaa tatggccaag catttctgca 1021 aagtgctcaa tcagaatcag catgaaaatg acataacaca caatcttctc agtgacacct 1081 gcccacgttt catcttgggt cttcttgatg caggaaaggc acatctccag cggcaagtga 1141 agcccgaggc ctggctgtcc catggcccca gtcctggccc tggccatctg cagcttgtgt 1201 gccatgtctc aggattctac ccaaagcccg tgtgggtgat gtggatgcgg ggtgagcagg 1261 agcagcaggg cactcagcga ggggacatct tgcccagtgc tgatgggaca tggtatctcc 1321 gcgcaaccct ggaggtggcc gctggggagg cagctgacct gtcctgtcgg gtgaagcaca 1381 gcagtctaga gggccaggac atcgtcctct actgggagca tcacagttcc gtgggcttca 1441 tcatcttggc ggtgatagtg cctttacttc ttctgatagg tcttgcgctt tggttcagga 1501 aacgctgttt ctgttaagac acaccatgag cctcctcgtc acccttctcc ttttggggtg 1561 agagaccagc agcccaaggg ctccagacac acctgaacac atcgtgatga tgacgtcctc 1621 tcaactctct ttgtaaaaat tttgttattt ttgcttgttt ctgattaatg attgtttgtc 1681 aatataagct caatttaatt ttgcaggatt tgttgttctg acctgggttc tgggactttt 1741 aaattcaaat tttatctcca gatggaatgg ggtcctagca acctccacat gttcacctat 1801 taatggatca tcaggcctgt tttagatatc ccttactcca gagggccttc cctgacttac 1861 aagtgggaag cagtctcttc ctggtctgaa ctcccgccac attttagccg tactttgcta 1921 actgtgctcc tcacttcctc ttcttcattg cagttattta gatcccccct ttccttctaa 1981 tttttcagct ccttcaatgc aaagtacatg tatttttaat atatgcatcc ctggtgaagg 2041 atcttgcctg catgaaacat gttctcaata aa // LOCUS HUMCD1B 1295 bp mRNA PRI 01-NOV-1994 DEFINITION Human thymocyte antigen CD1b mRNA, complete cds. ACCESSION M28826 NID g180055 KEYWORDS cell surface antigen. SOURCE Human, cDNA to mRNA (HPB-ALL library). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1295) AUTHORS Aruffo,A. and Seed,B. TITLE Expression of cDNA clones encoding the thymocyte antigens CD1a, b, c demonstrates a hierarchy of exclusion in fibroblasts JOURNAL J. Immunol. 143 (5), 1723-1730 (1989) MEDLINE 89341413 FEATURES Location/Qualifiers source 1..1295 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q22-q23" sig_peptide 46..96 /gene="CD1B" /note="CD1b antigen signal peptide; G00-120-576" CDS 46..1047 /gene="CD1B" /note="CD1b antigen precursor" /codon_start=1 /db_xref="GDB:G00-120-576" /db_xref="PID:g180056" /translation="MLLLPFQLLAVLFPGGNSEHAFQGPTSFHVIQTSSFTNSTWAQT QGSGWLDDLQIHGWDSDSGTAIFLKPWSKGNFSDKEVAELEEIFRVYIFGFAREVQDF AGDFQMKYPFEIQGIAGCELHSGGAIVSFLRGALGGLDFLSVKNASCVPSPEGGSRAQ KFCALIIQYQGIMETVRILLYETCPRYLLGVLNAGKADLQRQVKPEAWLSSGPSPGPG RLQLVCHVSGFYPKPVWVMWMRGEQEQQGTQLGDILPNANWTWYLRATLDVADGEAAG LSCRVKHSSLEGQDIILYWRNPTSIGSIVLAIIVPSLLLLLCLALWYMRRRSYQNIP" gene 46..1047 /gene="CD1B" mat_peptide 97..1044 /gene="CD1B" /note="CD1b antigen; G00-120-576" BASE COUNT 343 a 299 c 304 g 349 t ORIGIN 1 atcaaatacc agctctgcca gtaagaagtt gcatctccca gtgaaatgct gctgctgcca 61 tttcaactgt tagctgttct ctttcctggt ggtaacagtg aacatgcctt ccaggggccg 121 acctcctttc atgttatcca gacctcgtcc tttaccaata gtacctgggc acaaactcaa 181 ggctcaggct ggttggatga tttgcagatt catggctggg atagcgactc aggcactgcc 241 atattcctga agccttggtc taaaggtaac tttagtgata aggaggttgc tgagttagag 301 gagatattcc gagtctacat ctttggattc gctcgagaag tacaagactt tgccggtgat 361 ttccagatga aatacccctt tgagatccag ggcatagcag gctgtgagct acattctgga 421 ggtgccatag taagcttcct gaggggagct ctaggaggat tggatttcct gagtgtcaag 481 aatgcttcat gtgtgccttc cccagaaggt ggcagcaggg cacagaaatt ctgtgcacta 541 atcatacaat atcaaggtat catggaaact gtgagaattc tcctctatga aacctgcccc 601 cgatatctct tgggcgtcct caatgcagga aaagcagatc tgcaaagaca agtgaagcct 661 gaggcctggc tgtccagtgg ccccagtcct ggacctggcc gtctgcagct tgtgtgccat 721 gtctcaggat tctacccaaa gcccgtgtgg gtgatgtgga tgcggggtga gcaggagcag 781 cagggcactc agctagggga catcctgccc aatgctaact ggacatggta tctccgagca 841 accctggatg tggcagatgg ggaggcggct ggcctgtcct gtcgggtgaa gcacagcagt 901 ttagagggcc aggacatcat cctctactgg agaaacccca cctccattgg ctcaattgtt 961 ttggcaataa tagtgccttc cttgctcctt ttgctatgcc ttgcattatg gtatatgagg 1021 cgccggtcat atcagaatat cccatgagcc atcatcatgt ctcctctccc attcgcaata 1081 agctaccaag aagcccaaga tatcagccca aaaatcaatc ttatcatatt tcaaatgatt 1141 ttcaaatttg atgaaatcag agttttcatg tattttaaaa taaattatta tttaaacatc 1201 agcaaaaaag tacttaaaac tgtaaattta ttatgagact gtactaacag tgtgattcac 1261 cctgatttta cacacattaa aatgttagaa aaaat // LOCUS HUMCD1C 1207 bp mRNA PRI 01-NOV-1994 DEFINITION Human thymocyte antigen CD1c mRNA, complete cds. ACCESSION M28827 NID g180065 KEYWORDS cell surface antigen. SOURCE Human, cDNA to mRNA (HPB-ALL library). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1207) AUTHORS Aruffo,A. and Seed,B. TITLE Expression of cDNA clones encoding the thymocyte antigens CD1a, b, c demonstrates a hierarchy of exclusion in fibroblasts JOURNAL J. Immunol. 143 (5), 1723-1730 (1989) MEDLINE 89341413 FEATURES Location/Qualifiers source 1..1207 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q22-q23" sig_peptide 52..102 /gene="CD1C" /note="CD1c antigen signal peptide; G00-120-577" CDS 52..1053 /gene="CD1C" /note="CD1c antigen precursor" /codon_start=1 /db_xref="GDB:G00-120-577" /db_xref="PID:g180066" /translation="MLFLQFLLLALLLPGGDNADASQEHVSFHVIQIFSFVNQSWARG QGSGWLDELQTHGWDSESGTIIFLHNWSKGNFSNEELSDLELLFRFYLFGLTREIQDH ASQDYSKYPFEVQVKAGCELHSGKSPEGFFQVAFNGLDLLSFQNTTWVPSPGCGSLAQ SVCHLLNHQYEGVTETVYNLIRSTCPRFLLGLLDAGKMYVHRQVRPEAWLSSRPSLGS GQLLLVCHASGFYPKPVWVTWMRNEQEQLGTKHGDILPNADGTWYLQVILEVASEEPA GLSCRVRHSSLGGQDIILYWGHHSSMNWIALVVIVPLVILIVLVLWFKKHCSYQDIL" gene 52..1053 /gene="CD1C" mat_peptide 103..1050 /gene="CD1C" /note="CD1c antigen; G00-120-577" BASE COUNT 314 a 283 c 287 g 323 t ORIGIN 1 acagagatca gcaaacagct tttctgagag aaagaaacat ctgcaaatga catgctgttt 61 ctgcagtttc tgctgctagc tcttcttctc ccaggtggtg acaatgcaga cgcatcccag 121 gaacacgtct ccttccatgt catccagatc ttctcatttg tcaaccaatc ctgggcacga 181 ggtcagggct caggatggct ggacgagttg cagactcatg gctgggacag tgaatcaggc 241 acaataattt tcctgcataa ctggtccaag ggcaacttca gcaatgaaga gttgtcagac 301 ctagagttgt tatttcgttt ctacctcttt ggattaactc gggagattca agaccatgca 361 agtcaagatt actcgaaata tccctttgaa gtacaggtga aagcgggctg tgagctgcat 421 tctggaaaga gcccagaagg cttctttcag gtagctttca acggattaga tttactgagt 481 ttccagaata caacatgggt gccatctcca ggctgtggaa gtttggccca aagtgtctgt 541 catctactca atcatcagta tgaaggcgtc acagaaacag tgtataatct cataagaagc 601 acttgccccc gatttctctt gggtctcctg gatgcaggga agatgtatgt acacaggcaa 661 gtgaggccag aagcctggct gtccagtcgc cccagccttg ggtctggcca gctgttgctg 721 gtttgtcatg cctccggctt ctacccaaag cctgtttggg tgacatggat gcggaatgaa 781 caggagcaac tgggcactaa acatggtgat attcttccta atgctgatgg gacatggtat 841 cttcaggtga tcctggaggt ggcatctgag gagcctgctg gcctgtcttg tcgagtgaga 901 cacagcagtc taggaggcca ggacatcatc ctctactggg gacaccactc ttccatgaat 961 tggattgcct tggtagtgat agtgcccttg gtgattctaa tagtccttgt gttatggttt 1021 aagaagcact gctcatatca ggacatcctg tgagactctt ccccctgact cccccattgt 1081 gttaagaacc cagcaaccca ggagcctagt acaatatagt gatgccatcc cgtcgactct 1141 ccatttaaat tgtttctctt tctgcataat aaacatttgt taataaaaac caaaaaaaaa 1201 aaaaaaa // LOCUS HUMCD27A 1204 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens T cell activation antigen (CD27) mRNA, complete cds. ACCESSION M63928 NID g180084 KEYWORDS T-cell activation antigen CD27. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Camerini,D., Walz,G., Loenen,W.A., Borst,J. and Seed,B. TITLE The T cell activation antigen CD27 is a member of the nerve growth factor/tumor necrosis factor receptor gene family JOURNAL J. Immunol. 147 (9), 3165-3169 (1991) MEDLINE 92013149 FEATURES Location/Qualifiers source 1..1204 /organism="Homo sapiens" /db_xref="taxon:9606" gene 101..883 /gene="CD27" CDS 101..883 /gene="CD27" /codon_start=1 /product="T-cell activation antigen" /db_xref="PID:g180085" /translation="MARPHPWWLCVLGTLVGLSATPAPKSCPERHYWAQGKLCCQMCE PGTFLVKDCDQHRKAAQCDPCIPGVSFSPDHHTRPHCESCRHCNSGLLVRNCTITANA ECACRNGWQCRDKECTECDPLPNPSLTARSSQALSPHPQPTHLPYVSEMLEARTAGHM QTLADFRQLPARTLSTHWPPQRSLCSSDFIRILVIFSGMFLVFTLAGALFLHQRRKYR SNKGESPVEPAEPCRYSCPREEEGSTIPIQEDYRKPEPACSP" BASE COUNT 263 a 376 c 338 g 227 t ORIGIN 1 ggggtgcaaa gaagagacag cagcgcccag cttggaggtg ctaactccag aggccagcat 61 cagcaactgg gcacagaaag gagccgcctg ggcagggacc atggcacggc cacatccctg 121 gtggctgtgc gttctgggga ccctggtggg gctctcagct actccagccc ccaagagctg 181 cccagagagg cactactggg ctcagggaaa gctgtgctgc cagatgtgtg agccaggaac 241 attcctcgtg aaggactgtg accagcatag aaaggctgct cagtgtgatc cttgcatacc 301 gggggtctcc ttctctcctg accaccacac ccggccccac tgtgagagct gtcggcactg 361 taactctggt cttctcgttc gcaactgcac catcactgcc aatgctgagt gtgcctgtcg 421 caatggctgg cagtgcaggg acaaggagtg caccgagtgt gatcctcttc caaacccttc 481 gctgaccgct cggtcgtctc aggccctgag cccacaccct cagcccaccc acttacctta 541 tgtcagtgag atgctggagg ccaggacagc tgggcacatg cagactctgg ctgacttcag 601 gcagctgcct gcccggactc tctctaccca ctggccaccc caaagatccc tgtgcagctc 661 cgattttatt cgcatccttg tgatcttctc tggaatgttc cttgttttca ccctggccgg 721 ggccctgttc ctccatcaac gaaggaaata tagatcaaac aaaggagaaa gtcctgtgga 781 gcctgcagag ccttgtcgtt acagctgccc cagggaggag gagggcagca ccatccccat 841 ccaggaggat taccgaaaac cggagcctgc ctgctccccc tgagccagca cctgcggtag 901 ctgcactaca gccctggcct ccacccccac cccgccgacc atccaaggga gagtgagacc 961 tggcagccac aactgcagtc ccatcctctt gtcagggccc tttcctgtgt acacgtgaca 1021 gagtgccttt tcgagactgg cagggacgag gacaaatatg gatgaggtgg agagtgggaa 1081 gcaggagccc agccagctgc gcctgcgctg caggagggcg ggggctctgg ttgtaaaaca 1141 cacttcctgc tgcgaaagac ccacatgcta caagacgggc aaaataaagt gacagatgac 1201 cacc // LOCUS HUMCD30 1906 bp mRNA PRI 23-AUG-1995 DEFINITION Homo sapiens CD30 ligand mRNA, complete cds. ACCESSION L09753 NID g349277 KEYWORDS CD30 ligand; transmembrane protein type II. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1906) AUTHORS Smith,C.A., Gruss,H.J., Davis,T., Anderson,D., Farrah,T., Baker,E., Sutherland,G.R., Brannan,C.I., Copeland,N.G., Jenkins,N.A., Grabstein,K.H., Gliniak,B., McAlister,I.B., Fanslow,W., Alderson,M., Falk,B., Gimpel,S., Gillis,S., Din,W.S., Goodwin, R.G. and Armitage,R.J. TITLE CD30 antigen, a marker for Hodgkin's lymphoma, is a receptor whose ligand defines an emerging family of cytokines with homology to TNF JOURNAL Cell 73 (7), 1349-1360 (1993) MEDLINE 93313964 FEATURES Location/Qualifiers source 1..1906 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="con A induced PBL" /map="9q32" 5'UTR 1..114 /gene="CD30" /note="G00-131-547" gene 1..1906 /gene="CD30" CDS 115..819 /gene="CD30" /note="homology to TNF and CD40 ligand; potential glycosylation sites (81,109,153,189,201)" /codon_start=1 /db_xref="GDB:G00-131-547" /product="CD30 ligand" /db_xref="PID:g349278" /translation="MDPGLQQALNGMAPPGDTAMHVPAGSVASHLGTTSRSYFYLTTA TLALCLVFTVATIMVLVVQRTDSIPNSPDNVPLKGGNCSEDLLCILKRAPFKKSWAYL QVAKHLNKTKLSWNKDGILHGVRYQDGNLVIQFPGLYFIICQLQFLVQCPNNSVDLKL ELLINKHIKKQALVTVCESGMQTKHVYQNLSQFLLDYLQVNTTISVNVDTFQYIDTST FPLENVLSIFLYSNSD" 3'UTR 817..1906 /gene="CD30" /note="G00-131-547" BASE COUNT 559 a 447 c 438 g 462 t ORIGIN 1 ccaagtcaca tgattcagga ttcaggggga gaatccttct tggaacagag atgggcccag 61 aactgaatca gatgaagaga gataaggtgt gatgtgggga agactatata aagaatggac 121 ccagggctgc agcaagcact caacggaatg gcccctcctg gagacacagc catgcatgtg 181 ccggcgggct ccgtggccag ccacctgggg accacgagcc gcagctattt ctatttgacc 241 acagccactc tggctctgtg ccttgtcttc acggtggcca ctattatggt gttggtcgtt 301 cagaggacgg actccattcc caactcacct gacaacgtcc ccctcaaagg aggaaattgc 361 tcagaagacc tcttatgtat cctgaaaaga gctccattca agaagtcatg ggcctacctc 421 caagtggcaa agcatctaaa caaaaccaag ttgtcttgga acaaagatgg cattctccat 481 ggagtcagat atcaggatgg gaatctggtg atccaattcc ctggtttgta cttcatcatt 541 tgccaactgc agtttcttgt acaatgccca aataattctg tcgatctgaa gttggagctt 601 ctcatcaaca agcatatcaa aaaacaggcc ctggtgacag tgtgtgagtc tggaatgcaa 661 acgaaacacg tataccagaa tctctctcaa ttcttgctgg attacctgca ggtcaacacc 721 accatatcag tcaatgtgga tacattccag tacatagata caagcacctt tcctcttgag 781 aatgtgttgt ccatcttctt atacagtaat tcagactgaa cagtttctct tggccttcag 841 gaagaaagcg cctctctacc atacagtatt tcatccctcc aaacacttgg gcaaaaagaa 901 aactttagac caagacaaac tacacagggt attaaatagt atacttctcc ttctgtctct 961 tggaaagata cagctccagg gttaaaaaga gagtttttag tgaagtatct ttcagatagc 1021 aggcagggaa gcaatgtagt gtggtgggca gagccccaca cagaatcaga agggatgaat 1081 ggatgtccca gcccaaccac taattcactg tatggtcttg atctatttct tctgttttga 1141 gagcctccag ttaaaatggg gcttcagtac cagagcagct agcaactctg ccctaatggg 1201 aaatgaaggg gagctgggtg tgagtgttta cactgtgccc ttcacgggat acttctttta 1261 tctgcagatg gcctaatgct tagttgtcca agtcgcgatc aaggactctc tcacacagga 1321 aacttcccta tactggcaga tacacttgtg actgaaccat gcccagttta tgcctgtctg 1381 actgtcactc tggcactagg aggctgatct tgtactccat atgaccccac ccctaggaac 1441 ccccagggaa aaccaggctc ggacagcccc ctgttcctga gatggaaagc acaaatttaa 1501 tacaccacca caatggaaaa caagttcaaa gacttttact tacagatcct ggacagaaag 1561 ggcataatga gtctgaaggg cagtcctcct tctccaggtt acatgaggca ggaataagaa 1621 gtcagacaga gacagcaaga cagttaacaa cgtaggtaaa gaaatagggt gtggtcactc 1681 tcaattcact ggcaaatgcc tgaatggtct gtctgaagga agcaacagag aagtggggaa 1741 tccagtctgc taggcaggaa agatgcctct aagttcttgt ctctggccag aggtgtggta 1801 tagaaccaga aacccatatc aagggtgact aagcccggct tccggtatga gaaattaaac 1861 ttgtatacaa aatggttgcc aaggcaacat aaaattataa gaattc // LOCUS HUMCD30A 3630 bp mRNA PRI 01-NOV-1994 DEFINITION H.sapiens lymphocyte activation antigen CD30 mRNA, complete cds. ACCESSION M83554 NID g180095 KEYWORDS Hodgkin's disease; lymphocyte activation antigen; nerve growth factor receptor related. SOURCE Homo sapiens Lymphoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3630) AUTHORS Durkop,H., Latza,U., Hummel,M., Eitelbach,F., Seed,B. and Stein,H. TITLE Molecular cloning and expression of a new member of the nerve growth factor receptor family that is characteristic for Hodgkin's disease JOURNAL Cell 68 (3), 421-427 (1992) MEDLINE 92154659 FEATURES Location/Qualifiers source 1..3630 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUT-102" /cell_type="T-cell" /tissue_type="Lymphoma" 5'UTR 1..222 gene 223..2010 /gene="CD30" sig_peptide 223..276 /gene="CD30" /note="G00-131-547" CDS 223..2010 /gene="CD30" /codon_start=1 /db_xref="GDB:G00-131-547" /product="CD30 antigen" /db_xref="PID:g180096" /translation="MRVLLAALGLLFLGALRAFPQDRPFEDTCHGNPSHYYDKAVRRC CYRCPMGLFPTQQCPQRPTDCRKQCEPDYYLDEADRCTACVTCSRDDLVEKTPCAWNS SRVCECRPGMFCSTSAVNSCARCFFHSVCPAGMIVKFPGTAQKNTVCEPASPGVSPAC ASPENCKEPSSGTIPQAKPTPVSPATSSASTMPVRGGTRLAQEAASKLTRAPDSPSSV GRPSSDPGLSPTQPCPEGSGDCRKQCEPDYYLDEAGRCTACVSCSRDDLVEKTPCAWN SSRTCECRPGMICATSATNSCARCVPYPICAAETVTKPQDMAEKDTTFEAPPLGTQPD CNPTPENGEAPASTSPTQSLLVDSQASKTLPIPTSAPVALSSTGKPVLDAGPVLFWVI LVLVVVVGSSAFLLCHRRACRKRIRQKLHLCYPVQTSQPKLELVDSRPRRSSTQLRSG ASVTEPVAEERGLMSQPLMETCHSVGAAYLESLPLQDASPAGGPSSPRDLPEPRVSTE HTNNKIEKIYIMKADTVIVGTVKAELPEGRGLAGPAEPELEEELEADHTPHYPEQETE PPLGSCSDVMLSVEEEGKEDPLPTAASGK" mat_peptide 277..2007 /gene="CD30" /note="G00-131-547" /product="CD30 antigen" 3'UTR 2011..3622 polyA_site 2400 /gene="CD30" /note="G00-131-547" polyA_site 3623 /gene="CD30" /note="G00-131-547" BASE COUNT 709 a 1137 c 1109 g 675 t ORIGIN 1 atacgggaga actaaggctg aaacctcgga ggaacaacca cttttgaagt gacttcgcgg 61 cgtgcgttgg gtgcggacta ggtggccccg gcgggagtgt gctggagcct gaagtccacg 121 cgcgcggctg agaaccgccg ggaccgcacg tgggcgccgc gcgcttcccc cgcttcccag 181 gtgggcgccg gccgccaggc cacctcacgt ccggccccgg ggatgcgcgt cctcctcgcc 241 gcgctgggac tgctgttcct gggggcgcta cgagccttcc cacaggatcg acccttcgag 301 gacacctgtc atggaaaccc cagccactac tatgacaagg ctgtcaggag gtgctgttac 361 cgctgcccca tggggctgtt cccgacacag cagtgcccac agaggcctac tgactgcagg 421 aagcagtgtg agcctgacta ctacctggat gaggccgacc gctgtacagc ctgcgtgact 481 tgttctcgag atgacctcgt ggagaagacg ccgtgtgcat ggaactcctc ccgtgtctgc 541 gaatgtcgac ccggcatgtt ctgttccacg tctgccgtca actcctgtgc ccgctgcttc 601 ttccattctg tctgtccggc agggatgatt gtcaagttcc caggcacggc gcagaagaac 661 acggtctgtg agccggcttc cccaggggtc agccctgcct gtgccagccc agagaactgc 721 aaggaaccct ccagtggcac catcccccag gccaagccca ccccggtgtc cccagcaacc 781 tccagtgcca gcaccatgcc tgtaagaggg ggcacccgcc tcgcccagga agctgcttct 841 aaactgacga gggctcccga ctctccctcc tctgtgggaa ggcctagttc agatccaggt 901 ctgtccccaa cacagccatg cccagagggg tctggtgatt gcagaaagca gtgtgagccc 961 gactactacc tggacgaggc cggccgctgc acagcctgcg tgagctgttc tcgagatgac 1021 cttgtggaga agacgccatg tgcatggaac tcctcccgca cctgcgaatg tcgacctggc 1081 atgatctgtg ccacatcagc caccaactcc tgtgcccgct gtgtccccta cccaatctgt 1141 gcagcagaga cggtcaccaa gccccaggat atggctgaga aggacaccac ctttgaggcg 1201 ccacccctgg ggacccagcc ggactgcaac cccaccccag agaatggcga ggcgcctgcc 1261 agcaccagcc ccactcagag cttgctggtg gactcccagg ccagtaagac gctgcccatc 1321 ccaaccagcg ctcccgtcgc tctctcctcc acggggaagc ccgttctgga tgcagggcca 1381 gtgctcttct gggtgatcct ggtgttggtt gtggtggtcg gctccagcgc cttcctcctg 1441 tgccaccgga gggcctgcag gaagcgaatt cggcagaagc tccacctgtg ctacccggtc 1501 cagacctccc agcccaagct agagcttgtg gattccagac ccaggaggag ctcaacgcag 1561 ctgaggagtg gtgcgtcggt gacagaaccc gtcgcggaag agcgagggtt aatgagccag 1621 ccactgatgg agacctgcca cagcgtgggg gcagcctacc tggagagcct gccgctgcag 1681 gatgccagcc cggccggggg cccctcgtcc cccagggacc ttcctgagcc ccgggtgtcc 1741 acggagcaca ccaataacaa gattgagaaa atctacatca tgaaggctga caccgtgatc 1801 gtggggaccg tgaaggctga gctgccggag ggccggggcc tggcggggcc agcagagccc 1861 gagttggagg aggagctgga ggcggaccat accccccact accccgagca ggagacagaa 1921 ccgcctctgg gcagctgcag cgatgtcatg ctctcagtgg aagaggaagg gaaagaagac 1981 cccttgccca cagctgcctc tggaaagtga ggcctgggct gggctggggc taggagggca 2041 gcagggtggc ctctgggagg ccaggatggc actgttggca ccgaggttgg gggcagaggc 2101 ccatctggcc tgaactgagg ctccagcatc tagtggtgga ccggccggtc actgcagggg 2161 tctggtggtc tctgcttgca tccccaactt agctgtcccc tgacccagag cctaggggat 2221 ccggggcttg tacagaagag acagtccaag gggactggat cccagcagtg atgttggttg 2281 aggcagcaaa cagatggcag gatgggcact gccgagaaca gcattggtcc cagagccctg 2341 ggcatcagac cttaaccacc aggcccacag cccagcgagg gagaggtcgt gaggccagct 2401 cccggggccc ctgtaaccct actctcctct ctccctggac ctcagaggtg acacccattg 2461 ggcccttccg gcatgccccc agttactgta aatgtggccc ccagtgggca tggagccagt 2521 gcctgtggtt gtttctccag agtcaaaagg gaagtcgagg gatggggcgt cgtcagctgg 2581 cactgtctct gctgcagcgg ccacactgta ctctgcactg gtgtgagggc ccctgcctgg 2641 actgtgggac cctcctggtg ctgcccacct tccctgtcct gtagccccct cggtgggccc 2701 agggcctagg ggcccaggat caagtcactc atctcagaat gtccccacca atccccgcca 2761 cagcaggcgc ctcgggtccc agatgtctgc agccctcagc agctgcagac cgcccctcac 2821 caacccagag aacctgcttt actttgccca gggacttcct ccccatgtga acatggggaa 2881 cttcgggccc tgcctggagt ccttgaccgc tctctgtggg ccccacccac tctgtcctgg 2941 gaaatgaaga agcatcttcc ttaggtctgc cctgcttgca aatccactag caccgacccc 3001 accacctggt tccggctctg cacgctttgg ggtgtggatg tcgagaggca ccacggcctc 3061 acccaggcat ctgctttact ctggaccata ggaaacaaga ccgtttggag gtttcatcag 3121 gattttgggt ttttcacatt tcacgctaag gagtagtggc cctgacttcc ggtcggctgg 3181 ccagctgact ccctagggcc ttcagacgtg tatgcaaatg agtgatggat aaggatgagt 3241 cttggagttg cgggcagcct ggagactcgt ggacttaccg cctggaggca ggcccgggaa 3301 ggctgctgtt tactcatcgg gcagccacgt gctctctgga ggaagtgata gtttctgaaa 3361 ccgctcagat gttttgggga aagttggaga agccgtggcc ttgcgagagg tggttacacc 3421 agaacctgga cattggccag aagaagctta agtgggcaga cactgtttgc ccagtgtttg 3481 tgcaaggatg gagtgggtgt ctctgcatca cccacagccg cagctgtaag gcacgctgga 3541 aggcacacgc ctgccaggca gggcagtctg gcgcccatga tgggagggat tgacatgttt 3601 caacaaaata atgcacttcc ttaaaaaaaa // LOCUS HUMCD33DA 1437 bp mRNA PRI 01-NOV-1994 DEFINITION Human differentiation antigen (CD33) mRNA, complete cds. ACCESSION M23197 NID g180097 KEYWORDS differentiation antigen CD33. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1437) AUTHORS Simmons,D. and Seed,B. TITLE Isolation of a cDNA encoding CD33, a differentiation antigen of myeloid progenitor cells JOURNAL J. Immunol. 141 (8), 2797-2800 (1988) MEDLINE 89009814 FEATURES Location/Qualifiers source 1..1437 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /lab_host="COS cell" /map="19q13.3" gene 13..1416 /gene="CD33" CDS 13..1107 /gene="CD33" /codon_start=1 /db_xref="GDB:G00-119-762" /product="differentiation antigen" /db_xref="PID:g180098" /translation="MPLLLLLPLLWAGALAMDPNFWLQVQESVTVQEGLCVLVPCTFF HPIPYYDKNSPVHGYWFREGAIISGDSPVATNKLDQEVQEETQGRFRLLGDPSRNNCS LSIVDARRRDNGSYFFRMERGSTKYSYKSPQLSVHVTDLTHRPKILIPGTLEPGHSKN LTCSVSWACEQGTPPIFSWLSAAPTSLGPRTTHSSVLIITPRPQDHGTNLTCQVKFAG AGVTTERTIQLNVTYVPQNPTTGIFPGDGSGKQETRAGLVHGAIGGAGVTALLALCLC LIFFIVKTHRRKAARTAVGSNDTHPTTGSASPKHQKNSKLHGPTETSSCSGAAPTVEM DEELHYASLNFHGMNPSKDTSTEYSEVRTQ" polyA_signal 1411..1416 /gene="CD33" /note="G00-119-762" polyA_site 1437 /gene="CD33" /note="G00-119-762" BASE COUNT 360 a 412 c 343 g 322 t ORIGIN Chromosome 19q13.3. 1 gcttcctcag acatgccgct gctgctactg ctgcccctgc tgtgggcagg ggccctggct 61 atggatccaa atttctggct gcaagtgcag gagtcagtga cggtacagga gggtttgtgc 121 gtcctcgtgc cctgcacttt cttccatccc ataccctact acgacaagaa ctccccagtt 181 catggttact ggttccggga aggagccatt atatccgggg actctccagt ggccacaaac 241 aagctagatc aagaagtaca ggaggagact cagggcagat tccgcctcct tggggatccc 301 agtaggaaca actgctccct gagcatcgta gacgccagga ggagggataa tggttcatac 361 ttctttcgga tggagagagg aagtaccaaa tacagttaca aatctcccca gctctctgtg 421 catgtgacag acttgaccca caggcccaaa atcctcatcc ctggcactct agaacccggc 481 cactccaaaa accttacctg ctctgtgtcc tgggcctgtg agcagggaac acccccgatc 541 ttctcctggt tgtcagctgc ccccacctcc ctgggcccca ggactactca ctcctcggtg 601 ctcataatca ccccacggcc ccaggaccac ggcaccaacc tgacctgtca ggtgaagttc 661 gctggagctg gtgtgactac ggagagaacc atccagctca acgtcaccta tgttccacag 721 aacccaacaa ctggtatctt tccaggagat ggctcaggga aacaagagac cagagcagga 781 ctggttcatg gggccattgg aggagctggt gttacagccc tgctcgctct ttgtctctgc 841 ctcatcttct tcatagtgaa gacccacagg aggaaagcag ccaggacagc agtgggcagc 901 aatgacaccc accctaccac agggtcagcc tccccgaaac accagaagaa ctccaagtta 961 catggcccca ctgaaacctc aagctgttca ggtgccgccc ctactgtgga gatggatgag 1021 gagctgcatt atgcttccct caactttcat gggatgaatc cttccaagga cacctccacc 1081 gaatactcag aggtcaggac ccagtgagga accctcaaga gcatcaggct cagctagaag 1141 atccacatcc tctacaggtc ggggaccaaa ggctgattct tggagattta actccccaca 1201 ggcaatgggt ttatagacat tatgtgagtt tcctgctata ttaacatcat cttgagactt 1261 tgcaagcaga gagtcgtgga atcaaatctg tgctctttca tttgctaagt gtatgatgtc 1321 acacaagctc cttaaccttc catgtctcca ttttcttctc tgtgaagtag gtataagaag 1381 tcctatctca tagggatgct gtgagcatta aataaaggta cacatggaaa acaccag // LOCUS HUMCD34HS 2615 bp mRNA PRI 02-NOV-1993 DEFINITION Human CD34 mRNA, complete cds. ACCESSION M81104 X60172 NID g180108 KEYWORDS CD34; hematopoietic stem cell surface antigen; sialomucin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2615) AUTHORS Simmons,D.L., Satterthwaite,A.B., Tenen,D.G. and Seed,B. TITLE Molecular cloning of a cDNA encoding CD34, a sialomucin of human hematopoietic stem cells JOURNAL J. Immunol. 148, 267-271 (1992) MEDLINE 92091783 FEATURES Location/Qualifiers source 1..2615 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 294..1415 /codon_start=1 /product="CD34" /db_xref="PID:g180109" /translation="MPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNV SYQETTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQT SVISTVFTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIK AEIKCSGIREVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCS LLLAQSEVRPQCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQ KTLIALVTSGALLAVLGITGYFLMNRRSWSPTGERLGEDPYYTENGGGQGYSSGPGTS PEAQGKASVNRGAQKNGTGQATSRNGHSARQHVVADTEL" BASE COUNT 616 a 763 c 627 g 609 t ORIGIN 1 ccttttttgg cctcgacggc ggcaacccag cctccctcct aacgccctcc gcctttggga 61 ccaaccaggg gagctcaagt tagtagcagc caaggagagg cgctgccttg ccaagactaa 121 aaagggaggg gagaagagag gaaaaaagca agaatccccc acccctctcc cgggcggagg 181 gggcgggaag agcgcgtcct ggccaagccg agtagtgtct tccactcggt gcgtctctct 241 aggagccgcg cgggaaggat gctggtccgc aggggcgcgc gcgagggccc aggatgccgc 301 ggggctggac cgcgctttgc ttgctgagtt tgctgccttc tgggttcatg agtcttgaca 361 acaacggtac tgctacccca gagttaccta cccagggaac attttcaaat gtttctacaa 421 atgtatccta ccaagaaact acaacaccta gtacccttgg aagtaccagc ctgcaccctg 481 tgtctcaaca tggcaatgag gccacaacaa acatcacaga aacgacagtc aaattcacat 541 ctacctctgt gataacctca gtttatggaa acacaaactc ttctgtccag tcacagacct 601 ctgtaatcag cacagtgttc accaccccag ccaacgtttc aactccagag acaaccttga 661 agcctagcct gtcacctgga aatgtttcag acctttcaac cactagcact agccttgcaa 721 catctcccac taaaccctat acatcatctt ctcctatcct aagtgacatc aaggcagaaa 781 tcaaatgttc aggcatcaga gaagtgaaat tgactcaggg catctgcctg gagcaaaata 841 agacctccag ctgtgcggag tttaagaagg acaggggaga gggcctggcc cgagtgctgt 901 gtggggagga gcaggctgat gctgatgctg gggcccaggt atgctccctg ctccttgccc 961 agtctgaggt gaggcctcag tgtctactgc tggtcttggc caacagaaca gaaatttcca 1021 gcaaactcca acttatgaaa aagcaccaat ctgacctgaa aaagctgggg atcctagatt 1081 tcactgagca agatgttgca agccaccaga gctattccca aaagaccctg attgcactgg 1141 tcacctcggg agccctgctg gctgtcttgg gcatcactgg ctatttcctg atgaatcgcc 1201 gcagctggag ccccacagga gaaaggctgg gcgaagaccc ttattacacg gaaaacggtg 1261 gaggccaggg ctatagctca ggacctggga cctcccctga ggctcaggga aaggccagtg 1321 tgaaccgagg ggctcagaaa aacgggaccg gccaggccac ctccagaaac ggccattcag 1381 caagacaaca cgtggtggct gataccgaat tgtgactcgg ctaggtgggg caaggctggg 1441 cagtgtccga gagagcaccc ctctctgcat ctgaccacgt gctaccccca tgctggaggt 1501 gacatctctt acgcccaacc cttccccact gcacacacct cagaggctgt tcttggggcc 1561 ctacaccttg aggagggggc aggtaaactc ctgtccttta cacattcggc tccctggagc 1621 cagactctgg tcttctttgg gtaaacgtgt gacgggggaa agccaaggtc tggagaagct 1681 cccaggaaca atcgatggcc ttgcagcact cacacaggac ccccttcccc taccccctcc 1741 tctctgccgc aatacaggaa cccccagggg aaagatgagc ttttctaggc tacaattttc 1801 tcccaggaag ctttgatttt taccgtttct tccctgtatt ttctttctct actttgagga 1861 aaccaaagta accttttgca cctgctctct tgtaatgata tagccagaaa aacgtgttgc 1921 cttgaaccac ttccctcatc tctcctccaa gacactgtgg acttggtcac cagctcctcc 1981 cttgttctct aagttccact gagctccatg tgccccctct accatttgca gagtcctgca 2041 cagttttctg gctggagcct agaacaggcc tcccaagttt taggacaaac agctcagttc 2101 tagtctctct ggggccacac agaaactctt tttgggctcc tttttctccc tctggatcaa 2161 agtaggcagg accatgggac caggtcttgg agctgagcct ctcacctgta ctcttccgaa 2221 aaatcctctt cctctgaggc tggatcctag ccttatcctc tgatctccat ggcttcctcc 2281 tccctcctgc cgactcctgg gttgagctgt tgcctcagtc ccccaacaga tgcttttctg 2341 tctctgcctc cctcaccctg agccccttcc ttgctctgca cccccatatg gtcatagccc 2401 agatcagctc ctaaccctta tcaccagctg cctcttctgt gggtgaccca ggtccttgtt 2461 tgctgttgat ttctttccag aggggttgag cagggatcct ggtttcaatg acggttggaa 2521 atagaaattt ccagagaaga gagtattggg tagatatttt ttctgaatac aaagtgatgt 2581 gtttaaatac tgcaattaaa gtgatactga aacac // LOCUS HUMCD38 1233 bp mRNA PRI 19-JUN-1995 DEFINITION Human lymphocyte differentiation antigen CD38 mRNA, complete cds. ACCESSION M34461 NID g862620 KEYWORDS cell surface glycoprotein; lymphocyte differentiation antigen CD38; membrane glycoprotein. SOURCE Human PHA-treated peripheral blood cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1233) AUTHORS Jackson,D.G. and Bell,J.I. TITLE Isolation of a cDNA encoding the human CD38 (T10) molecule, a cell surface glycoprotein with an unusual discontinuous pattern of expression during lymphocyte differentiation JOURNAL J. Immunol. 144 (7), 2811-2815 (1990) MEDLINE 90203621 FEATURES Location/Qualifiers source 1..1233 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4" gene 70..972 /gene="CD38" CDS 70..972 /gene="CD38" /note="lymphocyte differentiation antigen CD38" /codon_start=1 /db_xref="GDB:G00-119-763" /db_xref="PID:g180119" /translation="MANCEFSPVSGDKPCCRLSRRAQLCLGVSILVLILVVVLAVVVP RWRQTWSGPGTTKRFPETVLARCVKYTEIHPEMRHVDCQSVWDAFKGAFISKHPCNIT EEDYQPLMKLGTQTVPCNKILLWSRIKDLAHQFTQVQRDMFTLEDTLLGYLADDLTWC GEFNTSKINYQSCPDWRKDCSNNPVSVFWKTVSRRFAEAACDVVHVMLNGSRSKIFDK NSTFGSVEVHNLQPEKVQTLEAWVIHGGREDSRDLCQDPTIKELESIISKRNIQFSCK NIYRPDKFLQCVKNPEDSSCTSEI" BASE COUNT 329 a 298 c 288 g 318 t ORIGIN 1 ctaaagctct cttgctgcct agcctcctgc cggcctcatc ttcgcccagc caaccccgcc 61 tggagcccta tggccaactg cgagttcagc ccggtgtccg gggacaaacc ctgctgccgg 121 ctctctagga gagcccaact ctgtcttggc gtcagtatcc tggtcctgat cctcgtcgtg 181 gtgctcgcgg tggtcgtccc gaggtggcgc cagacgtgga gcggtccggg caccaccaag 241 cgctttcccg agaccgtcct ggcgcgatgc gtcaagtaca ctgaaattca tcctgagatg 301 agacatgtag actgccaaag tgtatgggat gctttcaagg gtgcatttat ttcaaaacat 361 ccttgcaaca ttactgaaga agactatcag ccactaatga agttgggaac tcagaccgta 421 ccttgcaaca agattcttct ttggagcaga ataaaagatc tggcccatca gttcacacag 481 gtccagcggg acatgttcac cctggaggac acgctgctag gctaccttgc tgatgacctc 541 acatggtgtg gtgaattcaa cacttccaaa ataaactatc aatcttgccc agactggaga 601 aaggactgca gcaacaaccc tgtttcagta ttctggaaaa cggtttcccg caggtttgca 661 gaagctgcct gtgatgtggt ccatgtgatg ctcaatggat cccgcagtaa aatctttgac 721 aaaaacagca cttttgggag tgtggaagtc cataatttgc aaccagagaa ggttcagaca 781 ctagaggcct gggtgataca tggtggaaga gaagattcca gagacttatg ccaggatccc 841 accataaaag agctggaatc gattataagc aaaaggaata ttcaattttc ctgcaagaat 901 atctacagac ctgacaagtt tcttcagtgt gtgaaaaatc ctgaggattc atcttgcaca 961 tctgagatct gagccagtcg ctgtggttgt tttagctcct tgactccttg tggtttatgt 1021 catcatacat gactcagcat acctgctggt gcagagctga agattttgga gggtcctcca 1081 caataaggtc aatgccagag acggaagcct ttttccccaa agtcttaaaa taacttatat 1141 catcagcata cctttattgt gatctatcaa tagtcaagaa aaattattgt ataagattag 1201 aatgaaaatt gtatgttaag ttacttcctt tag // LOCUS HUMCD43 5050 bp DNA PRI 01-NOV-1994 DEFINITION Human leukosialin (CD43) gene, complete cds. ACCESSION M61827 NID g180125 KEYWORDS leukosialin; sialoglycoprotein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5050) AUTHORS Kudo,S. and Fukuda,M. TITLE A short, novel promoter sequence confers the expression of human leukosialin, a major sialoglycoprotein on leukocytes JOURNAL J. Biol. Chem. 266 (13), 8483-8489 (1991) MEDLINE 91217090 COMMENT From EMBL entry HSCD43; dated 21-JUL-1991. FEATURES Location/Qualifiers source 1..5050 /organism="Homo sapiens" /db_xref="taxon:9606" /map="16p11.2" mRNA join(1799..1868,2247..4068) /gene="SPN" /note="G00-120-384" /product="leukosialin" gene join(1799..1868,2247..4068) /gene="SPN" exon 1799..1868 /gene="SPN" /number=1 /product="leukosialin" gene 1799..4068 /gene="CD43" exon 2247..4068 /gene="SPN" /number=2 /product="leukosialin" CDS 2281..3483 /gene="SPN" /codon_start=1 /db_xref="GDB:G00-120-384" /product="leukosialin" /db_xref="PID:g180126" /translation="MATLLLLLGVLVVSPDALGSTTAVQTPTSGEPLVSTSEPLSSKM YTTSITSDPKADSTGDQTSALPPSTSINEGSPLWTSIGASTGSPLPEPTTYQEVSIKM SSVPQETPHATSHPAVPITANSLGSHTVTGGTITTNSPETSSRTSGAPVTTAASSLET SRGTSGPPLTMATVSLETSKGTSGPPVTMATDSLETSTGTTGPPVTMTTGSLEPSSGA SGPQVSSVKLSTMMSPTTSTNASTVPFRNPDENSRGMLPVAVLVALLAVIVLVALLLL WRRRQKRRTGALVLSRGGKRNGVVDAWAGPAQVPEEGAVTVTVGGSGGDKGSGFPDGE GSSRRPTLTTFFGRRKSRQGSLAMEELKSGSGPSLKGEEEPLVASEDGAVDAPAPDEP EGGDGAAP" BASE COUNT 1087 a 1496 c 1353 g 1114 t ORIGIN 1 cccccctgca gaatgggcac cccgttacct ttctgagcca ctgtgcgcag aaaagagagc 61 atgttggcca ggctggtctc gaactcctga cctcaagtga tcagcctgcc ttacctccca 121 aagtcctggg attacaggcg tgaaccacca cgctcagcct ctgaatactt tgtactcaag 181 ccatttttca gtgctgtgtt tgcagtgagc acacccgagg gatgaagaca cgtctccctg 241 tgggaacctg ggcttaccag ggcccctaga ggaggggaat ctctcaagct cagagctcta 301 tggctgcggt gcaggcccac tgtgtgcatg gtgtcagtct gggcccttcc atgttgcccc 361 cgtgggactt ggggtaaggg gaactgatgc aaacatcacg ctgctgttgc ttggtgtgag 421 caattaattc ctgtggctct cacccaggag tctcatgtct ttgggtcaga caaactcatc 481 agcttgtaga aatggcacag tcccacgggc ctgttagaat cttctattgt gcacatgttg 541 ctcttaaaat atacaaatca gttttgattt taaaaaatta tttatttttt tagtgatagg 601 agttttgcta cgttgcccag gctggtttca aactcttggg ctcaggaggt cctcccactt 661 tggcctggac tgccagcata atgtatcacc acacccggga ctgattttcg tttttcaaga 721 acaaaaacca aaaacataca caaaccgaga gtcaaagctt gctaattaga ggaaagtcag 781 gaaatgggaa ccattcaaag aagaaaatac ccccacctcc tactctcacc tatccaaaga 841 caattaggtg aatcccttag tagatatctt tccagacggt tttccatata gattcccata 901 tctggccagg cgcggtggct cacacctgta atcctagcgc ttggggaggc tgaggcggat 961 ggaccacctg aggtcaggag ttcgagacca gcctgaccaa catggagaaa cctcgtctct 1021 acgaaaaata caaaattagc cgggcacagt ggtgcaagcc tgtaatccca gctactcagg 1081 aggccgaggc aggagaattg cttgaaccta ggaggcagac attgtgctga gccgagccaa 1141 gatcatgcca ttgcactaaa ctccgcctta aaaaaaaaaa aaaagattcc cacatcttta 1201 ctagtttgca gaaataagat cctagcatat gcagtgtgta ggaaccacct tggtttagcc 1261 acgtctctgt gactgggggc cactgtggtg acccccagct ccccggacag agtcaagagc 1321 tcaccagcct gcaaaggttt tcacggcccc cagccagact cgggggcttc ctcttgccct 1381 gctacttcct gggagctctg agggcaggaa atggcgccac tcagctcctg gcctaacagc 1441 ttggggacca caaatgcaaa ggaaaccacc ctcccctccc acctcctcct ctgcaccctt 1501 gagttctcag gctcacattc ccaccaccca cctctgagcc cagccctccc tagcatcacc 1561 acttccatcc cattcctcag ccaagagcca ggaatcctga ttccagatcc cacgcttccc 1621 tgcctccctc aggtgagccc cagaccccca ggcaccccgc tggcccctga aggagcaggt 1681 gatggtgctg tcttcgccca gcagctgtgg gagcaggcgg gtggggcagg atggaggggt 1741 gggtggggtg ggtggagcca gggcccactt cctttcccct tggggccctg tccttcccag 1801 tcttgcccca gcctcgggag gtggtggagt gacctggccc cagtgctgcg tccttatcag 1861 ccgagccggt aagagggtga gacttggtgg ggtaggggcc tcagtgggcc tgggaatgtg 1921 cctgtggctt gaaaagactc tgacaggtta tgatgggaag agattgggag ccattgggct 1981 gcacagggtc agggaaggcc aggaggggct ggtcactgct ggaatctaag ctgctgaggc 2041 tggagggagc ctcaggatgg ggctgatggg ggagctgcca gcatctgttc ctctgtcatt 2101 tctgataaca gtaaaagcca gcatggaaaa aaccgttaaa ccgcaggttg ggcctggccg 2161 ttggcaggga agtgggcaga ggggaggccc ggccaggtcc tccggcaact cccgcgtgtt 2221 ctgcttctcc ggctgcccac ctgcaggtcc cagctcttgc tcctgcctgt ttgcctggaa 2281 atggccacgc ttctccttct ccttggggtg ctggtggtaa gcccagacgc tctggggagc 2341 acaacagcag tgcagacacc cacctccgga gagcctttgg tctctactag cgagcccctg 2401 agctcaaaga tgtacaccac ttcaataaca agtgacccta aggccgacag cactggggac 2461 cagacctcag ccctacctcc ctcaacttcc atcaatgagg gatcccctct ttggacttcc 2521 attggtgcca gcactggttc ccctttacct gagccaacaa cctaccagga agtttccatc 2581 aagatgtcat cagtgcccca ggaaacccct catgcaacca gtcatcctgc tgttcccata 2641 acagcaaact ctctaggatc ccacaccgtg acaggtggaa ccataacaac gaactctcca 2701 gaaacctcca gtaggaccag tggagcccct gttaccacgg cagctagctc tctggagacc 2761 tccagaggca cctctggacc ccctcttacc atggcaactg tctctctgga gacttccaaa 2821 ggcacctctg gaccccctgt taccatggca actgactctc tggagacctc cactgggacc 2881 actggacccc ctgttaccat gacaactggc tctctggagc cctccagcgg ggccagtgga 2941 ccccaggtct ctagcgtaaa actatctaca atgatgtctc caacgacctc caccaacgca 3001 agcactgtgc ccttccggaa cccagatgag aactcacgag gcatgctgcc agtggctgtg 3061 cttgtggccc tgctggcggt catagtcctc gtggctctgc tcctgctgtg gcgccggcgg 3121 cagaagcggc ggactggggc cctcgtgctg agcagaggcg gcaagcgtaa cggggtggtg 3181 gacgcctggg ctgggccagc ccaggtccct gaggaggggg ccgtgacagt gaccgtggga 3241 gggtccgggg gcgacaaggg ctctgggttc cccgatgggg aggggtctag ccgtcggccc 3301 acgctcacca ctttctttgg cagacggaag tctcgccagg gctccctggc gatggaggag 3361 ctgaagtctg ggtcaggccc cagcctcaaa ggggaggagg agccactggt ggccagtgag 3421 gatggggctg tggacgcccc agctcctgat gagcccgaag ggggagacgg ggctgcccct 3481 taagtgtcgg tgaatagtga ggctggaggc cgcaatctca gccagcctcc agcaccttcc 3541 ctctcaccat cccactgccc cctcgctccc atgtttccac ccggcaccct gatcctcacc 3601 cgaatctcct tttttttttt cttttgagac agagtttcgc tttgtcgccc aggctggagt 3661 gcaatgcacg atctcagttc actgcaacct ctgcctccta agttcaggcg attctcctgc 3721 ctcagcttcc cgagtaactg agattacagg cacccaccac catgcccagc tgcttttttg 3781 tatttttggt agagatgggg tttcaccatg ttggctaggc tggtctcaaa ctcctgacct 3841 caggtgatct acctgcctca gcctcccaaa gtgctgagat tacagacatg agcctccgcg 3901 ccttgcctcc tcacccacct cttcactctg aatcctcatg aggcttctca gccctggatt 3961 tcctgctgcc atcctcaccc agcacccaca actagcgcct gggcagggca gggctggcac 4021 ctctcaacgt ctgtggactg aatgaataaa ccctcctcat ccacccctat ttatctccat 4081 caccatttcc ccctctttct tgttcctgga aacggctgct gagtctccat cggccaaact 4141 tatctgccct gtgatttctt tgacaattct ccttttcccc cagaacccac cctgggttga 4201 ccagagtctg ggaagaagga caagagaacc cggcaaactc cctcctagga ttaactttgt 4261 aaagcaccct tgccctgtag ctgcaagggc tgtggaacct gggcagcccg caaccacctt 4321 tagctctggg ccccccaggc cagcctggag catggctggg tggggccacc agcccatgct 4381 ctcaggcggg cctgtgatct ttcccagggc acatggactg taggctggcc ctggcccaca 4441 ccaccacact ctccccagcc atggacagag gcagccagag gcctcacggt ttctcctccg 4501 agtttctggc tgggtgtagt tctcagaaac cccagtgcct gcgtgtgtcc actcgtgggt 4561 gtggtttgtg tgcaagagct gaggatttgg cgatgcttgg gaggggtagt tgtgggtaca 4621 gacggtgtgg gggtgggaag tggtgcagag actgaagagg gtcaacctgg gcatggggga 4681 cacagggact gctgagaacg tgcgtgtcat ctttgctctg atggggtgga catagcagaa 4741 aatctaactc tgtctgtagc cccatacaga atgccagggt gagcacagtg gctggtgcct 4801 ttaatcccag cactttggaa agttgaggca ggaggatcgc ttgagcccag gagttcgagt 4861 ctgaagtgag ctgtgattgc accactgcac ttcagcctgg gcaacagagt gagcccctgt 4921 ctcaaaaaag aaaagaaaaa gaaagccagg cttcatggaa agatcgtatg tgtgacccaa 4981 tatgagttct tcagctcagc catggtaatc ccttccttga agtctccatt tctgcagtac 5041 acatgcatgt // LOCUS HUMCD48 1048 bp mRNA PRI 06-MAR-1995 DEFINITION Human pan-leukocyte antigen (CD48) mRNA, complete cds. ACCESSION M59904 NID g180138 KEYWORDS activation antigen; cell surface antigen; pan-leukocyte antigen. SOURCE Human cell line JY + DAUDI, cDNA to mRNA, clone pHULYM3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1048) AUTHORS Vaughan,H.A., Henning,M.M., Purcell,D.F., McKenzie,I.F. and Sandrin,M.S. TITLE The isolation of cDNA clones for CD48 JOURNAL Immunogenetics 33 (2), 113-117 (1991) MEDLINE 91153858 FEATURES Location/Qualifiers source 1..1048 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHULYM3" /cell_line="JY + DAUDI" /map="1q21.3-q22" gene 18..749 /gene="CD48" CDS 18..749 /gene="CD48" /codon_start=1 /db_xref="GDB:G00-119-725" /product="pan-leukocyte antigen" /db_xref="PID:g180139" /translation="MWSRGWDSCLALELLLLPLSLLVTSIQGHLVHMTVVSGSNVTLN ISESLPENYKQLTWFYTFDQKIVEWDSRKSKYFESKFKGRVRLDPQSGALYISKVQKE DNSTYIMRVLKKTGNEQEWKIKLQVLDPVPKPVIKIEKIEDMDDNCYLKLSCVIPGES VNYTWYGDKRPFPKELQNSVLETTLMPHNYSRCYTCQVSNSVSSKNGTVCLSPPCTLA RSFGVEWIASWLVVTVPTILGLLLT" misc_feature 214 /gene="CD48" /note="site of polymorphism" misc_feature 641 /gene="CD48" /note="site of polymorphism" BASE COUNT 310 a 220 c 233 g 285 t ORIGIN 1 ctgtgaaaga aggaagcatg tggtccagag gttgggattc gtgtctggct ctggaattgc 61 tactgctgcc tctgtcactc ctggtgacca gcattcaagg tcacttggta catatgaccg 121 tggtctccgg cagcaacgtg actctgaaca tctctgagag cctgcctgag aactacaaac 181 aactaacctg gttttatact ttcgaccaga agattgtaga atgggattcc agaaaatcta 241 agtactttga atccaaattt aaaggcaggg tcagacttga tcctcagagt ggcgcactgt 301 acatctctaa ggtccagaaa gaggacaaca gcacctacat catgagggtg ttgaaaaaga 361 ctgggaatga gcaagaatgg aagatcaagc tgcaagtgct tgaccctgta cccaagcctg 421 tcatcaaaat tgagaagata gaagacatgg atgacaactg ttatctgaaa ctgtcatgtg 481 tgatacctgg cgagtctgta aactacacct ggtatgggga caaaaggccc ttcccaaagg 541 agctccagaa cagtgtgctt gaaaccaccc ttatgccaca taattactcc aggtgttata 601 cttgccaagt cagcaattct gtgagcagca agaatggcac cgtctgcctc agtccaccct 661 gtaccctggc ccggtccttt ggagtagaat ggattgcaag ttggctagtg gtcacggtgc 721 ccaccattct tggcctgtta cttacctgag atgagctctt ttaactcaag cgaaacttca 781 aggccagaag atcttgcctg ttggtgatca tgctcctcag caggacagag actgtatagg 841 ctgaccagaa gcatgctgct gaattatcaa cgaggatttt caagttaact tttaaatact 901 ggttattatt taattttata tccctttgtt gttttgtagt acacagagat tatagagata 961 cacatgcttt tttcccaaaa ttgtgacaac attatgtgga atcttttatt atttttaaaa 1021 taaaaagata taattataaa aaaaaaaa // LOCUS HUMCD53 1452 bp mRNA PRI 01-NOV-1994 DEFINITION Human cell surface antigen (CD53) mRNA, complete cds. ACCESSION M60871 NID g180140 KEYWORDS cell surface antigen; type III integral membrane protein. SOURCE Human promyelocytic tumor cell line HL60, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1452) AUTHORS Amiot,M. TITLE Identification and analysis of cDNA clones encoding CD53. A pan-leukocyte antigen related to membrane transport proteins JOURNAL J. Immunol. 145 (12), 4322-4325 (1990) MEDLINE 91079522 FEATURES Location/Qualifiers source 1..1452 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="promyelocytic tumor cell line HL60" /map="Unassigned" mRNA 1..1452 /gene="CD53" /note="G00-127-521" gene 1..1452 /gene="CD53" CDS 74..733 /gene="CD53" /codon_start=1 /db_xref="GDB:G00-127-521" /product="cell surface antigen" /db_xref="PID:g180141" /translation="MGMSSLKLLKYVLFFFNLLFWICGCCILGFGIYLLIHNNFGVLF HNLPSLTLGNVFVIVGSIIMVVAFLGCMGSIKENKCLLMSFFILLLIILLAEVTLAIL LFVYEQKLNEYVAKGLTDSIHRYHSDNSTKAAWDSIQSFLQCCGINGTSDWTSGPPAS CPSDRKVEGCYAKARLWFHSNFLYIGIITICVCVIEVLGMSFALTLNCQIDKTSQTIG L" BASE COUNT 373 a 342 c 314 g 423 t ORIGIN 1 ctcaaggata atcactaaat tctgccgaaa ggactgagga acggtgcctg gaaaagggca 61 agaatatcac ggcatgggca tgagtagctt gaaactgctg aagtatgtcc tgtttttctt 121 caacttgctc ttttggatct gtggctgctg cattttgggc tttgggatct acctgctgat 181 ccacaacaac ttcggagtgc tcttccataa cctcccctcc ctcacgctgg gcaatgtgtt 241 tgtcatcgtg ggctctatta tcatggtagt tgccttcctg ggctgcatgg gctctatcaa 301 ggaaaacaag tgtctgctta tgtcgttctt catcctgctg ctgattatcc tccttgctga 361 ggtgaccttg gccatcctgc tctttgtata tgaacagaag ctgaatgagt atgtggctaa 421 gggtctgacc gacagcatcc accgttacca ctcagacaat agcaccaagg cagcgtggga 481 ctccatccag tcatttctgc agtgttgtgg tataaatggc acgagtgatt ggaccagtgg 541 cccaccagca tcttgcccct cagatcgaaa agtggagggt tgctatgcga aagcaagact 601 gtggtttcat tccaatttcc tgtatatcgg aatcatcacc atctgtgtat gtgtgattga 661 ggtgttgggg atgtcctttg cactgaccct gaactgccag attgacaaaa ccagccagac 721 catagggcta tgatctgcag tagttctgtg gtgaagagac ttgtttcatc tccggaaatg 781 caaaaccatt tatagcatga agccctacat gatcactgca ggatgatcct cctcccatcc 841 tttccctttt taggtccctg tcttatacaa ccagagaagt gggtgttggc caggcacatc 901 ccatctcagg cagcaagaca atctttcact cactgacggc agcagccatg tctctcaaag 961 tggtgaaact aatatctgag catcttttag acaagagagg caaagacaaa ctggatttaa 1021 tggcccaaca tcaaagggtg aacccaggat atgaattttt gcatcttccc attgtcgaat 1081 tagtctccag cctctaaata atgcccagtc ttctccccaa agtcaagcaa gagactagtt 1141 gaagggagtt ctggggccag gctcactgga ccattgtcac aaccctctgt ttctctttga 1201 ctaagtgccc tggctacagg aattacacag ttctctttct ccaaagggca agatctcatt 1261 tcaatttctt tattagaggg ccttattgat gtgttctaag tctttccaga aaaaaactat 1321 ccagtgattt atatcctgat ttcaaccagt cacttagctg ataatcacag taagaagact 1381 tctggtatta tctctctatc agataagatt ttgttaatgt actattttac tcttcaataa 1441 ataaaacagt tt // LOCUS HUMCD59A 1671 bp mRNA PRI 01-NOV-1994 DEFINITION Human lymphocytic antigen CD59/MEM43 mRNA, complete cds. ACCESSION M34671 X15861 NID g180152 KEYWORDS CD59 antigen; cell surface antigen; integral membrane protein. SOURCE Human peripheral blood monocyte, cDNA to mRNA, clone R18.. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 108 to 443) AUTHORS Sawada,R., Ohashi,K., Okano,K., Hattori,M., Minato,N. and Naruto,M. TITLE Complementary DNA sequence and deduced peptide sequence for CD59/MEM-43 antigen, the human homologue of murine lymphocyte antigen Ly-6C JOURNAL Nucleic Acids Res. 17 (16), 6728 (1989) MEDLINE 89386002 REFERENCE 2 (bases 1 to 1671) AUTHORS Sawada,R., Ohashi,K., Anaguchi,H., Okazaki,H., Hattori,M., Minato,N. and Naruto,M. TITLE Isolation and expression of the full-length cDNA encoding CD59 antigen of human lymphocytes JOURNAL DNA Cell Biol. 9 (3), 213-220 (1990) MEDLINE 90253615 COMMENT Draft entry and computer readable copy for sequence [1] kindly provided by Naruto,M., 17-JUL-1989. [1] Author address: Naruto,M. Basic Research Laboratories Toray Industries Inc 1111 Tebiro Kamakura 248, Japan. FEATURES Location/Qualifiers source 1..1671 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p14-p13" mRNA <1..1671 /note="CD59 mRNA" sig_peptide 30..104 /gene="CD59" /note="CD59 signal peptide" gene 30..416 /gene="CD59" CDS 30..416 /gene="CD59" /note="antigen CD59 precursor (CD59)" /codon_start=1 /db_xref="GDB:G00-119-769" /db_xref="PID:g180153" /translation="MGIQGGSVLFGLLLVLAVFCHSGHSLQCYNCPNPTADCKTAVNC SSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELTYYCCKKDLCNFNEQLEN GGTSLSEKTVLLLVTPFLAAAWSLHP" mat_peptide 105..413 /gene="CD59" /note="CD59 protein" polyA_signal 527..532 BASE COUNT 434 a 347 c 390 g 500 t ORIGIN 1 ggcgccgcca ggttctgtgg acaatcacaa tgggaatcca aggagggtct gtcctgttcg 61 ggctgctgct cgtcctggct gtcttctgcc attcaggtca tagcctgcag tgctacaact 121 gtcctaaccc aactgctgac tgcaaaacag ccgtcaattg ttcatctgat tttgatgcgt 181 gtctcattac caaagctggg ttacaagtgt ataacaagtg ttggaagttt gagcattgca 241 atttcaacga cgtcacaacc cgcttgaggg aaaatgagct aacgtactac tgctgcaaga 301 aggacctgtg taactttaac gaacagcttg aaaatggtgg gacatcctta tcagagaaaa 361 cagttcttct gctggtgact ccatttctgg cagcagcctg gagccttcat ccctaagtca 421 acaccaggag agcttctccc aaactccccg ttcctgcgta gtccgctttc tcttgctgcc 481 acattctaaa ggcttgatat tttccaaatg gatcctgttg ggaaagaata aaattagctt 541 gagcaacctg gctaagatag aggggctctg ggagactttg aagaccagtc ctgtttgcag 601 ggaagcccca cttgaaggaa gaagtctaag agtgaagtag gtgtgacttg aactagattg 661 catgcttcct cctttgctct tgggaagacc agctttgcag tgacagcttg agtgggttct 721 ctgcagccct cagattattt ttcctctggc tccttggatg tagtcagtta gcatcattag 781 tacatctttg gagggtgggg caggagtata tgagcatcct ctctcacatg gaacgctttc 841 ataaacttca gggatcccgt gttgccatgg aggcatgcca aatgttccat atgtgggtgt 901 cagtcaggga caacaagatc cttaatgcag agctagagga cttctggcag ggaagtgggg 961 aagtgttcca gatagcaggg catgaaaact tagagaggta caagtggctg aaaatcgagt 1021 ttttcctctg tctttaaatt ttatatgggc tttgttatct tccactggaa aagtgtaata 1081 gcatacatca atggtgtgtt aaagctattt ccttgccttt ttttattgga atggtaggat 1141 atcttggctt tgccacacac agttacagag tgaacactct actacatgtg actggcagta 1201 ttaagtgtgc ttattttaaa tgttactggt agaaaggcag ttcaggtatg tgtgtatata 1261 gtatgaatgc agtggggaca ccctttgtgg ttacagtttg agacttccaa aggtcatcct 1321 taataacaac agatctgcag gggtatgttt taccatctgc atccagcctc ctgctaactc 1381 ctagctgact cagcatagat tgtataaaat acctttgtaa cggctcttag cacactcaca 1441 gatgtttgag gctttcagaa gctcttctaa aaaatgatac acacctttca caagggcaaa 1501 ctttttcctt ttccctgtgt attctagtga atgaatctca agattcagta gacctaatga 1561 catttgtatt ttatgatctt ggctgtattt aatggcatag gctgactttt gcagatggag 1621 gaatttcttg attaatgttg aaaaaaaacc cttgattata ctctgttgga c // LOCUS HUMCDA24A 1811 bp mRNA PRI 22-JUN-1994 DEFINITION Homo sapiens CD24 signal transducer mRNA, complete cds. ACCESSION M58664 NID g180167 KEYWORDS signal transducer CD24; signal transduction. SOURCE Human cell line K562, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1811) AUTHORS Kay,R., Rosten,P.M. and Humphries,R.K. TITLE CD24, a signal transducer modulating B cell activation responses, is a very short peptide with a glycosyl phosphatidylinositol membrane anchor JOURNAL J. Immunol. 147, 1412-1416 (1991) MEDLINE 91332458 FEATURES Location/Qualifiers source 1..1811 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" sig_peptide 57..134 CDS 57..299 /codon_start=1 /product="signal transducer CD24" /db_xref="PID:g180168" /translation="MGRAMVARLGLGLLLLALLLPTQIYSSETTTGTSSNSSQSTSNS GLAPNPTNATTKAAGGALQSTASLFVVSLSLLHLYS" mat_peptide 135..296 /product="signal transducer CD24" BASE COUNT 524 a 399 c 355 g 533 t ORIGIN 1 cggttctcca agcacccagc atcctgctag acgcgccgcg caccgacgga ggggacatgg 61 gcagagcaat ggtggccagg ctcgggctgg ggctgctgct gctggcactg ctcctaccca 121 cgcagattta ttccagtgaa acaacaactg gaacttcaag taactcctcc cagagtactt 181 ccaactctgg gttggcccca aatccaacta atgccaccac caaggcggct ggtggtgccc 241 tgcagtcaac agccagtctc ttcgtggtct cactctctct tctgcatctc tactcttaag 301 agactcaggc caagaaacgt cttctaaatt tccccatctt ctaaacccaa tccaaatggc 361 gtctggaagt ccaatgtggc aaggaaaaac aggtcttcat cgaatctact aattccacac 421 cttttattga cacagaaaat gttgagaatc ccaaatttga ttgatttgaa gaacatgtga 481 gaggtttgac tagatgatgg atgccaatat taaatctgct ggagtttcat gtacaagatg 541 aaggagaggc aacatccaaa atagttaaga catgatttcc ttgaatgtgg cttgagaaat 601 atggacactt aatactacct tgaaaataag aatagaaata aaggatggga ttgtggaatg 661 gagattcagt tttcatttgg tgcttaattc tataagcgta taaacaggta atataaaaag 721 cttccatgat tctatttata tgtacatgag aaggaacttc caggtgttac tgtaattcct 781 caacgtattg tttcgacggc actaatttaa tgccgatata ctctagatga agttttacat 841 tgttgagcta ttgctgttct cttgggaact gaactcactt tcctcctgag gctttggatt 901 tgacattgca tttgaccttt tatgtagtaa ttgacatgtg ccagggcaat gatgaatgag 961 aatctacccc agatccaagc atcctgagca actcttgatt atccatattg agtcaaatgg 1021 taggcatttc ctatcacctg tttccattca acaagagcac tacattcatt tagctaaacg 1081 gattccaaag agtagaattg cattgaccac gactaatttc aaaatgcttt ttattattat 1141 tattttttag acagtctcac tttgtcgccc aggccggagt gcagtggtgc gatctcagat 1201 cagtgtacca tttgcctccc gggctcaagc gattctcctg cctcagcctc ccaagtagct 1261 gggattacag gcacctgcca ccatgcccgg ctaatttttg taattttagt agagacaggg 1321 tttcaccatg ttgcccaggc tggtttcgaa ctcctgacct caggtgatcc acccgcctcg 1381 gcctcccaaa gtgctgggat tacaggcttg agcccccgcg cccagccatc aaaatgcttt 1441 ttatttctgc atatgtttga atacttttta caatttaaaa aaatgatctg ttttgaaggc 1501 aaaattgcaa atcttgaaat taagaaggca aaatgtaaag gagtcaaact ataaatcaag 1561 tatttgggaa gtgaagactg gaagctaatt tgcataaatt cacaaacttt tatactcttt 1621 ctgtatatac attttttttc tttaaaaaac aactatggat cagaatagcc acatttagaa 1681 cactttttgt tatcagtcaa tatttttaga tagttagaac ctggtcctaa gcctaaaagt 1741 gggcttgatt ctgcagtaaa tcttttacaa ctgcctcgac acacataaac ctttttaaaa 1801 atagacactc c // LOCUS HUMCDC25A 2419 bp mRNA PRI 31-DEC-1994 DEFINITION Human cdc25A mRNA, complete cds. ACCESSION M81933 NID g180170 KEYWORDS B-type cyclin; mitotic cyclin; tyrosine phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2419) AUTHORS Galaktionov,K. and Beach,D. TITLE Specific activation of cdc25 tyrosine phosphatases by B-type cyclins: evidence for multiple roles of mitotic cyclins JOURNAL Cell 67 (6), 1181-1194 (1991) MEDLINE 92103683 FEATURES Location/Qualifiers source 1..2419 /organism="Homo sapiens" /db_xref="taxon:9606" gene 460..2031 /gene="cdc25A" CDS 460..2031 /gene="cdc25A" /note="putative" /codon_start=1 /db_xref="PID:g180171" /translation="MELGPSPAPRRLLFACSPPPASQPVVKALFGASAAGGLSPVTNL TVTMDQLQGLGSDYEQPLEVKNNSNLQRMGSSESTDSGFCLDSPGPLDSKENLENPMR RIHSLPQKLLGCSPALKRSHSDSLDHDIFQLIDPDENKENEAFEFKKPVRPVSRGCLH SHGLQEGKDLFTQRQNSAQLGMLSSNERDSSEPGNFIPLFTPQSPVTATLSDEDDGFV DLLDGENLKNEEETPSCMASLWTAPLVMRTTNLDNRCKLFDSPSLCSSSTRSVLKRPE RSQEESPPGSTKRRKSMSGASPKESTNPEKAHETLHQSLSLASSPKGTIENILDNDPR DLIGDFSKGYLFHTVAGKHQDLKYISPEIMASVLNGKFANLIKEFVIIDCRYPYEYEG GHIKGAVNLHMEEEVEDFLLKKPIVPTDGKRVIVVFHCEFSSERGPRMCRYVRERDRL GNEYPKLHYPELYVLKGGYKEFFMKCQSYCEPPSYRPMHHEDFKEDLKKFRTKSRTWA GEKSKREMYSRLKKL" BASE COUNT 563 a 653 c 686 g 517 t ORIGIN 1 cgaaaggccg gccttggctg cgacagcctg ggtaagaggt gtaggtcggc ttggttttct 61 gctacccgga gctgggcaag cgggttggga gaacagcgaa gacagcgtga gcctgggccg 121 ttgcctcgag gctctcgccc ggcttctctt gccgacccgc cacgtttgtt tggatttaat 181 cttacagctg gttgccggcg cccgcccgcc cgctggcctc gcggtgtgag agggaagcac 241 ccgtgcctgt ggctggtggc tggcgcctgg agggtccgca cacccgcccg gccgcgccgc 301 tttgcccgcg gcagccgcgt ccctgaaccg cggagtcgtg tttgtgtttg acccgcgggc 361 gccggtggcg cgcggccgag gccggtgtcg gcggggcggg gcggtcgcgg cggaggcaga 421 ggaagaggga gcgggagctc tgcgaggccg ggcgccgcca tggaactggg cccgagcccc 481 gcaccgcgcc gcctgctctt cgcctgcagc ccccctcccg cgtcgcagcc cgtcgtgaag 541 gcgctatttg gcgcttcagc cgccggggga ctgtcgcctg tcaccaacct gaccgtcact 601 atggaccagc tgcagggtct gggcagtgat tatgagcaac cactggaggt gaagaacaac 661 agtaatctgc agagaatggg ctcctccgag tcaacagatt caggtttctg tctagattct 721 cctgggccat tggacagtaa agaaaacctt gaaaatccta tgagaagaat acattcccta 781 cctcaaaagc tgttgggatg tagtccagct ctgaagagga gccattctga ttctcttgac 841 catgacatct ttcagctcat cgacccagat gagaacaagg aaaatgaagc ctttgagttt 901 aagaagccag taagacctgt atctcgtggc tgcctgcact ctcatggact ccaggagggt 961 aaagatctct tcacacagag gcagaactct gcccagctcg gaatgctttc ctcaaatgaa 1021 agagatagca gtgaaccagg gaatttcatt cctcttttta caccccagtc acctgtgaca 1081 gccactttgt ctgatgagga tgatggcttc gtggaccttc tcgatggaga gaatctgaag 1141 aatgaggagg agaccccctc gtgcatggca agcctctgga cagctcctct cgtcatgaga 1201 actacaaacc ttgacaaccg atgcaagctg tttgactccc cttccctgtg tagctccagc 1261 actcggtcag tgttgaagag accagaacgt tctcaagagg agtctccacc tggaagtaca 1321 aagaggagga agagcatgtc tggggccagc cccaaagagt caactaatcc agagaaggcc 1381 catgagactc ttcatcagtc tttatccctg gcatcttccc ccaaaggaac cattgagaac 1441 attttggaca atgacccaag ggaccttata ggagacttct ccaagggtta tctctttcat 1501 acagttgctg ggaaacatca ggatttaaaa tacatctctc cagaaattat ggcatctgtt 1561 ttgaatggca agtttgccaa cctcattaaa gagtttgtta tcatcgactg tcgataccca 1621 tatgaatacg agggaggcca catcaagggt gcagtgaact tgcacatgga agaagaggtt 1681 gaagacttct tattgaagaa gcccattgta cctactgatg gcaagcgtgt cattgttgtg 1741 tttcactgcg agttttcttc tgagagaggt ccccgcatgt gccggtatgt gagagagaga 1801 gatcgcctgg gtaatgaata ccccaaactc cactaccctg agctgtatgt cctgaagggg 1861 ggatacaagg agttctttat gaaatgccag tcttactgtg agccccctag ctaccggccc 1921 atgcaccacg aggactttaa agaagacctg aagaagttcc gcaccaagag ccggacctgg 1981 gcaggggaga agagcaagag ggagatgtac agtcgtctga agaagctctg agggcggcag 2041 gaccagccag cagcagccca agcttccctc catccccctt taccctcttt cctgcagaga 2101 aacttaagca aaggggacag ctgtgtgaca tttggagagg gggcctggga cttccatgcc 2161 ttaaacctac ctcccacact cccaaggttg gagcccaggg catcttgctg gctacgcctc 2221 ttctgtccct gttagacgtc ctccgtccat atcagaactg tgccacaatg cagttctgag 2281 caccgtgtca agctgctctg agccacagtg ggatgaacca gccggggcct tatcgggctc 2341 cagcatctca tgaggggaga ggagacggag gggagtagag aagtttacac agaaatgctg 2401 ctggccaaat agcaaagag // LOCUS HUMCDC25B 2940 bp mRNA PRI 31-DEC-1994 DEFINITION Human cdc25B mRNA, complete cds. ACCESSION M81934 NID g180172 KEYWORDS B-type cyclin; mitotic cyclin; tyrosine phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2940) AUTHORS Galaktionov,K. and Beach,D. TITLE Specific activation of cdc25 tyrosine phosphatases by B-type cyclins: evidence for multiple roles of mitotic cyclins JOURNAL Cell 67 (6), 1181-1194 (1991) MEDLINE 92103683 FEATURES Location/Qualifiers source 1..2940 /organism="Homo sapiens" /db_xref="taxon:9606" gene 73..1773 /gene="cdc25B" CDS 73..1773 /gene="cdc25B" /note="putative" /codon_start=1 /db_xref="PID:g180173" /translation="MEVPQPEPAPGSALSPAGVCGGAQRPGHLPGLLLGSHGLLGSPV RAAASSPVTTLTQTMHDLAGLGSRSRLTHLSLSRRASESSLSSESSESSDAGLCMDSP SPMDPHMAEQTFEQAIQAASRIIRNEQFAIRRFQSMPVRLLGHSPVLRNITNSQAPDG RRKSEAGSGAASSSGEDKENDGFVFKMPWKPTHPSSTHALAEWASRREAFAQRPSSAP DLMCLSPDRKMEVEELSPLALGRFSLTPAEGDTEEDDGFVDILESDLKDDDAVPPGME SLISAPLVKTLEKEEEKDLVMYSKCQRLFRSPSMPCSVIRPILKRLERPQDRDTPVQN KRRRSVTPPEEQQEAEEPKARVLRSKSLCHDEIENLLDSDHRELIGDYSKAFLLQTVD GKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGGHIKTAVNLPLERDAE SFLLKSPIAPCSLDKRVILIFHCEFSSERGPRMCRFIRERDRAVNDYPSLYYPEMYIL KGGYKEFFPQHPNFCEPQDYRPMNHEAFKDELKTFRLKTRSWAGERSRRELCSRLQDQ " BASE COUNT 606 a 874 c 825 g 635 t ORIGIN 1 gccagctgtg ccggcgtttg ttggctgccc tgcgcccggc cctccagcca gccttctgcc 61 ggccccgccg cgatggaggt gccccagccg gagcccgcgc caggctcggc tctcagtcca 121 gcaggcgtgt gcggtggcgc ccagcgtccg ggccacctcc cgggcctcct gctgggatct 181 catggcctcc tggggtcccc ggtgcgggcg gccgcttcct cgccggtcac caccctcacc 241 cagaccatgc acgacctcgc cgggctcggc agccgcagcc gcctgacgca cctatccctg 301 tctcgacggg catccgaatc ctccctgtcg tctgaatcct ccgaatcttc tgatgcaggt 361 ctctgcatgg attcccccag ccctatggac ccccacatgg cggagcagac gtttgaacag 421 gccatccagg cagccagccg gatcattcga aacgagcagt ttgccatcag acgcttccag 481 tctatgccgg tgaggctgct gggccacagc cccgtgcttc ggaacatcac caactcccag 541 gcgcccgacg gccggaggaa gagcgaggcg ggcagtggag ctgccagcag ctctggggaa 601 gacaaggaga atgatggatt tgtcttcaag atgccatgga agcccacaca tcccagctcc 661 acccatgctc tggcagagtg ggccagccgc agggaagcct ttgcccagag acccagctcg 721 gcccccgacc tgatgtgtct cagtcctgac cggaagatgg aagtggagga gctcagcccc 781 ctggccctag gtcgcttctc tctgacccct gcagaggggg atactgagga agatgatgga 841 tttgtggaca tcctagagag tgacttaaag gatgatgatg cagttccccc aggcatggag 901 agtctcatta gtgccccact ggtcaagacc ttggaaaagg aagaggaaaa ggacctcgtc 961 atgtacagca agtgccagcg gctcttccgc tctccgtcca tgccctgcag cgtgatccgg 1021 cccatcctca agaggctgga gcggccccag gacagggaca cgcccgtgca gaataagcgg 1081 aggcggagcg tgacccctcc tgaggagcag caggaggctg aggaacctaa agcccgcgtc 1141 ctccgctcaa aatcactgtg tcacgatgag atcgagaacc tcctggacag tgaccaccga 1201 gagctgattg gagattactc taaggccttc ctcctacaga cagtagacgg aaagcaccaa 1261 gacctcaagt acatctcacc agaaacgatg gtggccctat tgacgggcaa gttcagcaac 1321 atcgtggata agtttgtgat tgtagactgc agatacccct atgaatatga aggcgggcac 1381 atcaagactg cggtgaactt gcccctggaa cgcgacgccg agagcttcct actgaagagc 1441 cccatcgcgc cctgtagcct ggacaagaga gtcatcctca ttttccactg tgaattctca 1501 tctgagcgtg ggccccgcat gtgccgtttc atcagggaac gagaccgtgc tgtcaacgac 1561 taccccagcc tctactaccc tgagatgtat atcctgaaag gcggctacaa ggagttcttc 1621 cctcagcacc cgaacttctg tgaaccccag gactaccggc ccatgaacca cgaggccttc 1681 aaggatgagc taaagacctt ccgcctcaag actcgcagct gggctgggga gcggagccgg 1741 cgggagctct gtagccggct gcaggaccag tgaggggcct gcgccagtcc tgctacctcc 1801 cttgcctttc gaggcctgaa gccagctgcc ctatgggcct gccgggctga gggcctgctg 1861 gaggcctcag gtgctgtcca tgggaaagat ggtgtggtgt cctgcctgtc tgccccagcc 1921 cagattcccc tgtgtcatcc catcattttc catatcctgg tgccccccac ccctggaaga 1981 gcccagtctg ttgagttagt taagttgggt taataccagc ttaaaggcag tattttgtgt 2041 cctccaggag cttcttgttt ccttgttagg gttaaccctt catcttcctg tgtcctgaaa 2101 cgctcctttg tgtgtgtgtc agctgaggct ggggagagcc gtggtccctg aggatgggtc 2161 agagctaaac tccttcctgg cctgagagtc agctctctgc cctgtgtact tcccgggcca 2221 gggctgcccc taatctctgt aggaaccgtg gtatgtctgc catgttgccc ctttctcttt 2281 tcccctttcc tgtcccacca tacgagcacc tccagcctga acagaagctc ttactctttc 2341 ctatttcagt gttacctgtg tgcttggtct gtttgacttt acgcccatct caggacactt 2401 ccgtagactg tttaggttcc cctgtcaaat atcagttacc cactcggtcc cagttttgtt 2461 gccccagaaa gggatgttat tatccttggg ggctcccagg gcaagggtta aggcctgaat 2521 catgagcctg ctggaagccc agcccctact gctgtgaacc ctggggcctg actgctcaga 2581 acttgctgct gtcttgttgc ggatggatgg aaggttggat ggatgggtgg atggccgtgg 2641 atggccgtgg atgcgcagtg ccttgcatac ccaaaccagg tgggagcgtt ttgttgagca 2701 tgacacctgc agcaggaata tatgtgtgcc tatttgtgtg gacaaaaata tttacactta 2761 gggtttggag ctattcaaga ggaaatgtca cagaagcagc taaaccaagg actgagcacc 2821 ctctggattc tgaatctcaa gatgggggca gggctgtgct tgaaggccct gctgagtcat 2881 ctgttagggc cttggttcaa taaagcactg agcaagttga gaaaaaaaaa aaaaaaaaaa // LOCUS HUMCDC25C 3972 bp mRNA PRI 31-DEC-1994 DEFINITION Human (CDC25) mRNA, complete cds. ACCESSION L26584 NID g433719 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3972) AUTHORS Wei,W. and Broek,D. TITLE Cloning and analysis of the full length human cdc25 cDNA, a ras-specific nucleotide exchange factor JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..3972 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3828 /gene="CDC25" CDS 1..3828 /gene="CDC25" /codon_start=1 /db_xref="PID:g433720" /translation="MQKAIRLNDGHVAPLGLLARKDGTRKGYLSKRSSDNTKWQTKWF ALLQNLLFYFESDSSSRPSGLYLLEGCVCDRAPSPKPALSAKEPLEKQHYFTVNFSHE NQKALELRTEDAKDCDEWVAAIAHASYRTLATEHEALMQKYLHLLQIVETEKTVAKQL RQQIEDGEIEIERLKAEITSLLKDNERIQSTQTVAPNDEDSDIKKIKKVQSFLRGWLC RRKWKTIIQDYIRSPHADSMRKRNQVVFSMLEAEAEYVQQLHILVNNFLRPLRMAASS KKPPITHDDVSSIFLNSETIMFLHQIFYQGLKARISSWPTLVLADLLDILLPMLNIYQ EFVRNHQYSLQILAHCKQNRDFDKLLKHYEAKPDCEERTLETFLTYPMFQIPRYILTL HDVLAHTPHEHVERNSLDYAKSKLEELSRIMHDEVSETENIRKNLAIERMIIEGCEIL LDTSQTFVRQGSLIQVPMSEKGKITRGRLGSLSLEKEGERQCFLFSKHLIICTRGSGG KLHLTKNGVISLIDCTLLEEPESTEEEAKGSGQDIDHLDFKIGVEPKDSPPFTVILVA SSRQEKAAWTSDISQCVDNIRCNGLMMNAFEENSKVTVPQMIKRTREGTREAEMSRSD ASLYCDDVDIRFSKTMNSCKVLQIRYASVERLLERLTDLRFLSIDFLNTFLHSYRVFT TAIVVLDKLITIYKKPISAIPARWLRSLELLFASGQNNKLLYGEPPKSPRATRKFSSP PPLSITKTSSPSRRRKLISLNIPIITGGKALDLAGSLSCNSNGYTSMYSAMSPFSKAT LDTSKLYVSSSFTNKIPDEGDTTPEKPEDPSALSKQSSEVSMREESDIDQNQSDDGDT ETSPTKSPTTPKSVKNKNSSEFPLFSYNNGVVMTSCRELDNNRSALSAASAFAIATAG ANEGTPNKEKYRRMSLASAGFPPDQRNGDKEFVIRRAATNRVLNVLRHWVSKHSQDFE TNDELKCKVIGFLEEVMHDPELLTQERKAAANIIRTLTQEDPGDNQITLEEITQMAEG VKAEPFENHSALEIAEQLTLLDHLVFKKIPYEEFFGQGWMKLEKNERTPYIMKTTKHF NDISNLIASEIIRNEDINARVSAIEKWVAVADICRCLHNYNAVLEITSSMNRSAIFRL KKTWLKVSKQTKALIDKLQKLVSSEGRFKNLREALKNCDPPCVPYLGMYLTDLAFIEE GTPNYTEDGLVNFSKMRMISHIIREIRQFQQTAYKIEHQAKVTQYLLDQSFVMDEESL YESSLRIEPKLPT" polyA_site 3972 BASE COUNT 1052 a 1143 c 1008 g 769 t ORIGIN 1 atgcagaaag ccatccgact taatgatggc cacgtcgcgc ccctgggact gctggcgcgc 61 aaggacggca cgcgcaaagg ctacctgagc aagcggagtt cggacaacac aaaatggcaa 121 accaagtggt tcgcgctgct gcagaacctg ctcttctact tcgagagcga ctcgagctcg 181 cggccctcgg ggctttacct gctggagggc tgcgtctgcg accgcgcgcc ctcccccaag 241 ccggcgctgt cggccaagga gccgctggag aaacagcatt acttcacggt gaacttcagc 301 catgagaacc agaaagcctt ggagctgagg acagaggacg caaaagattg tgacgaatgg 361 gtggcagcca ttgcacatgc cagctacagg accctcgcca cagagcatga ggcattaatg 421 cagaaatacc tgcacctgct gcagatcgtg gagacagaga agaccgtggc caagcagctt 481 cggcagcaga tcgaggatgg ggagatcgag atcgagcggc tgaaggcaga gatcacatcc 541 ctgctcaagg acaatgagcg catccagtcc acccagactg tcgcccccaa cgatgaagac 601 agcgacatca agaaaattaa gaaggtgcag agcttcctgc ggggctggct gtgccggcgg 661 aagtggaaga ccatcatcca ggactacatc cggtcacccc atgctgacag catgcgcaag 721 aggaaccagg tggtgttcag catgctggag gctgaggctg agtacgtgca gcagctgcac 781 atccttgtca acaatttcct gcgcccgctg cggatggccg ccagctccaa gaagcctccc 841 atcacacacg acgacgtcag cagcatcttc ctgaacagcg aaaccatcat gtttttacat 901 cagatctttt accaaggcct gaaggcccgc atctccagct ggcccacgct ggtcctggct 961 gacctacttg acatcctgct gcccatgctc aacatctacc aagagttcgt ccgcaaccac 1021 cagtacagcc tgcagatcct ggcccactgc aagcagaacc gtgacttcga caagctgctg 1081 aagcactacg aggccaagcc tgactgcgag gagaggacgt tggagacctt cctcacctac 1141 cccatgttcc agatccccag gtacatcctg accctccatg acgtcctggc ccacacgcct 1201 catgagcacg ttgagcgcaa cagcctggac tacgccaagt ccaaactgga ggagctgtcc 1261 agaataatgc acgatgaagt aagtgagacg gagaacatcc ggaaaaacct ggccatcgag 1321 cgcatgatca tcgaaggctg tgagatcctc ctggacacca gccagacctt tgtgagacaa 1381 ggttccctca ttcaggtgcc catgtctgaa aagggcaaga tcaccagggg gcgcctgggg 1441 tctctctccc tagagaaaga gggcgagcga cagtgcttcc tgttttctaa gcatctgatt 1501 atctgtacca gaggctctgg agggaagctt cacttgacca agaatggagt catatccctc 1561 attgactgca ctttattgga ggagccagaa agcacggagg aggaagccaa aggatccggc 1621 caagacatag atcacttgga tttcaaaatc ggggtggagc caaaggattc cccgcccttt 1681 acagtcatcc tagtggcctc gtccagacag gagaaggcag cgtggaccag tgacatcagc 1741 cagtgtgtgg ataacatccg atgcaatggg ctcatgatga acgcatttga agaaaattcc 1801 aaggtcactg tgccgcagat gatcaagagg accagggagg ggaccaggga agcagaaatg 1861 agcaggtccg acgcctcctt atattgtgat gatgttgaca ttcgcttcag caaaaccatg 1921 aactcctgca aagtgctgca gatccgctac gccagtgtgg agcggctgct ggagaggctg 1981 acggacctgc gcttcctgag catcgacttc ctcaacacct tcctgcactc ctaccgcgtc 2041 ttcaccaccg ccatcgtggt cctggacaag ctcattacca tctacaagaa gcctatcagt 2101 gccattcctg ccaggtggct gaggtcgctg gagctcctgt ttgccagtgg ccagaacaat 2161 aagctcctgt acggtgaacc ccccaagtcc ccgcgcgcca cccgcaagtt ctcctcgccg 2221 ccacctctgt ccatcaccaa gacatcgtca ccgagccgcc ggcggaagct gatctccctg 2281 aacatcccca tcatcactgg cggcaaggcc ctggacctgg ccggatccct cagctgcaac 2341 tccaatggct acaccagcat gtactcggcc atgtcaccct tcagcaaggc cacgctggac 2401 accagcaagc tctatgtgtc cagcagcttc accaacaaga ttccagatga gggcgatacg 2461 acccctgaga agcccgaaga cccttcagcg ctcagcaagc agagctcaga agtctccatg 2521 agagaggagt cagatattga tcaaaaccag agtgatgatg gtgatactga aacatcacca 2581 actaaatctc caacaacacc caaatcagtc aaaaacaaaa attcttcaga gttcccactc 2641 ttttcctata acaatggagt cgtcatgacc tcctgtcgtg aactggacaa taaccgcagt 2701 gccttgtcgg ccgcctctgc ctttgccata gcaaccgccg gggccaacga gggcacccca 2761 aacaaggaga agtaccggag gatgtcctta gccagtgcag ggtttccccc agaccagagg 2821 aatggagaca aggagtttgt gatccgcaga gcagccacca atcgtgtctt gaacgtgctc 2881 cgccactggg tgtccaagca ctctcaggac tttgagacca acgatgagct caaatgcaag 2941 gtgatcggct tcctggaaga agtcatgcac gacccggagc tcctgaccca ggagcggaaa 3001 gctgcagcca acatcatcag gactctgacc caggaggacc caggtgacaa ccagatcacg 3061 ctggaggaga tcacgcagat ggctgaaggc gtgaaggctg agccctttga aaaccactca 3121 gccctggaga tcgcggagca gctgaccctg ctggatcacc tcgtcttcaa gaagattcct 3181 tatgaggagt tcttcggaca aggatggatg aaactggaaa agaatgaaag gaccccttat 3241 atcatgaaaa ccactaagca cttcaatgac atcagtaact tgattgcttc agaaatcatc 3301 cgcaatgagg acatcaacgc cagggtgagc gccatcgaga agtgggtggc cgtagctgac 3361 atatgccgct gcctccacaa ctacaatgcc gtactggaga tcacctcgtc catgaaccgc 3421 agtgcaatct tccggctcaa aaagacgtgg ctcaaagtct ctaagcagac taaagctttg 3481 attgataagc tccaaaagct tgtgtcatca gagggcagat ttaagaatct cagagaagct 3541 ctgaaaaatt gtgacccacc ctgtgtccct tacctgggga tgtacctcac cgacctggcc 3601 ttcatcgagg aggggacgcc caattacacg gaagacggcc tggtcaactt ctccaagatg 3661 aggatgatat cccatattat ccgagagatt cgccagtttc aacaaactgc ctacaaaata 3721 gagcaccaag caaaggtaac gcaatattta ctggaccaat cttttgtaat ggatgaagaa 3781 agcctctacg agtcttctct ccgaatagaa ccaaaactcc ccacctgaag ctgtgcccag 3841 acccagacca gctgctcccg gggacatgtg ctagatgata ctgtacatat tcgtttggtt 3901 tcactggatt ttacttcttc agtatgtgct tctccaagaa tacaaatcgt ccttgttctt 3961 agattcctgt ag // LOCUS HUMCDC25HS 2055 bp mRNA PRI 15-SEP-1990 DEFINITION Human cdc25Hs mRNA, complete cds. ACCESSION M34065 NID g180175 KEYWORDS mitotic inducer. SOURCE Human HeLa cell line D98/AH-2, cDNA to mRNA, clone BSK1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2055) AUTHORS Sadhu,K., Reed,B.I., Richardson,H. and Russell,P. TITLE Human homolog of fission yeast cdc25 mitotic inducer is predominantly expressed in G-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5139-5143 (1990) MEDLINE 90311358 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Russell, 08-MAY-1990, for release after publication. FEATURES Location/Qualifiers source 1..2055 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2055 /note="cdc25Hs mRNA" CDS 211..1632 /note="CDC25Hs ORF" /codon_start=1 /db_xref="PID:g180176" /translation="MSTELFSSTREEGSSGSGPSFRSNQRKMLNLLLERDTSFTVCPD VPRTPVGKFLGDSANLSILSGGTPKCCLDLSNLSSGEITATQLTTSADLDETGHLDSS GLQEVHLAGMNHDQHLMKCSPAQLLCSTPNGLDRGHRKRDAMCSSSANKENDNGNLVD SEMKYLGSPITTVPKLDKNPNLGEDQAEEISDELMEFSLKDQEAKVSRSGLYRSPSMP ENLNRPRLKQVEKFKDNTIPDKVKKKYFSGQGKLRKGLCLKKTVSLCDITITQMLEED SNQGHLIGDFSKVCALPTVSGKHQDLKYVNPETVAALLSGKFQGLIEKFYVIDCRYPY EYLGGHIQGALNLYSQEELFNFFLKKPIVPLDTQKRIIIVFHCEFSSERGPRMCRCLR EEDRSLNQYPALYYPELYILKGGYRDFFPEYMELCEPQSYCPMHHQDHKTELLRCRSQ SKVQEGERQLREQIALLVKDMSP" BASE COUNT 576 a 472 c 477 g 530 t ORIGIN 1 caggaagact ctgagtccga cgttggccta cccagtcgga aggcagagct gcaatctagt 61 taactacctc ctttccccta gatttccttt cattctgctc aagtcttcgc ctgtgtccga 121 tccctatcta ctttctctcc tcttgtagca agcctcagac tccaggcttg agctaggttt 181 tgtttttctc ctggtgagaa ttcgaagacc atgtctacgg aactcttctc atccacaaga 241 gaggaaggaa gctctggctc aggacccagt tttaggtcta atcaaaggaa aatgttaaac 301 ctgctcctgg agagagacac ttcctttacc gtctgtccag atgtccctag aactccagtg 361 ggcaaatttc ttggtgattc tgcaaaccta agcattttgt ctggaggaac cccaaaatgt 421 tgcctcgatc tttcgaatct tagcagtggg gagataactg ccactcagct taccacttct 481 gcagaccttg atgaaactgg tcacctggat tcttcaggac ttcaggaagt gcatttagct 541 gggatgaatc atgaccagca cctaatgaaa tgtagcccag cacagcttct ttgtagcact 601 ccgaatggtt tggaccgtgg ccatagaaag agagatgcaa tgtgtagttc atctgcaaat 661 aaagaaaatg acaatggaaa cttggtggac agtgaaatga aatatttggg cagtcccatt 721 actactgttc caaaattgga taaaaatcca aacctaggag aagaccaggc agaagagatt 781 tcagatgaat taatggagtt ttccctgaaa gatcaagaag caaaggtgag cagaagtggc 841 ctatatcgct ccccgtcgat gccagagaac ttgaacaggc caagactgaa gcaggtggaa 901 aaattcaagg acaacacaat accagataaa gttaaaaaaa agtatttttc tggccaagga 961 aagctcagga agggcttatg tttaaagaag acagtctctc tgtgtgacat tactatcact 1021 cagatgctgg aggaagattc taaccagggg cacctgattg gtgatttttc caaggtatgt 1081 gcgctgccaa ccgtgtcagg gaaacaccaa gatctgaagt atgtcaaccc agaaacagtg 1141 gctgccttac tgtcggggaa gttccagggt ctgattgaga agttttatgt cattgattgt 1201 cgctatccat atgagtatct gggaggacac atccagggag ccttaaactt atatagtcag 1261 gaagaactgt ttaacttctt tctgaagaag cccatcgtcc ctttggacac ccagaagaga 1321 ataatcatcg tgttccactg tgaattctcc tcagagaggg gcccccgaat gtgccgctgt 1381 ctgcgtgaag aggacaggtc tctgaaccag tatcctgcat tgtactaccc agagctatat 1441 atccttaaag gcggctacag agacttcttt ccagaatata tggaactgtg tgaaccacag 1501 agctactgcc ctatgcatca tcaggaccac aagactgagt tgctgaggtg tcgaagccag 1561 agcaaagtgc aggaagggga gcggcagctg cgggagcaga ttgcccttct ggtgaaggac 1621 atgagcccat gataacattc cagccactgg ctgctaacaa gtcaccaaaa agacactgca 1681 gaaaccctga gcagaaagag gccttctgga tggccaaacc caagattatt aaaagatgtc 1741 tctgcaaacc aacaggctac caacttgtat ccaggcctgg gaatggatta ggtttcagca 1801 gagctgaaag ctggtggcag agtcctggag ctggctctat aaggcagcct tgagttgcat 1861 agagatttgt attggttcag ggaactctgg cattcctttt cccaactcct catgtcttct 1921 cacaagccag ccaactcttt ctctctgggc ttcgggctat gcaagagcgt tgtctacctt 1981 ctttctttgt attttccttc tttgtttccc cctctttctt ttttaaaaat ggaaaaataa 2041 acactacaga atgag // LOCUS HUMCDR34 2412 bp DNA PRI 01-NOV-1994 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 NID g180188 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: a highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (8), 3077-3081 (1990) MEDLINE 90222173 COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990. FEATURES Location/Qualifiers source 1..2412 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq27.1-q27.2" gene 503..1174 /gene="CDR1" CDS 503..1174 /gene="CDR1" /note="cerebellar-degeneration-related antigen (CDR34)" /codon_start=1 /db_xref="GDB:G00-119-053" /db_xref="PID:g180189" /translation="MAWLEDVDFLEDVPLLEDIPLLEDVPLLEDVPLLEDTSRLEDIN LMEDMALLEDVDLLEDTDFLEDLDFSEAMDLREDKDFLEDMDSLEDMALLEDVDLLED TDFLEDPDFLEAIDLREDKDFLEDMDSLEDLRPLEDVDFLEDMAFLEDVDFQEDPNYP EDLDCWEDVDFLEDWRLLEDMDFLEDMDFLEDVDLQEDIYWLEDLDFFRKMWIDWKTW IWWKT" BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS HUMCDRPCA 408 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens cardiac delayed rectifier potassium channel protein mRNA, complete cds. ACCESSION L28168 NID g452493 KEYWORDS cardiac delayed rectifier potassium channel protein. SOURCE Homo sapiens adult cardiac muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 408) AUTHORS Folander,K., Williams,J.B., Strauss,H.C., Lazarides,E. and Swanson,R. TITLE The Human IsK Potassium Channel Gene: expression in fetal and adult heart, assignment to human chromosome 21q22, and an RFLP that distinguiishes two alleles JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..408 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myocyte" /dev_stage="adult" /tissue_type="cardiac muscle" /map="21q22" gene 19..408 /gene="IsK" CDS 19..408 /gene="IsK" /note="putative" /codon_start=1 /function="cardiac delayed rectifier potassium channel" /product="cardiac delayed rectifier potassium channel protein" /db_xref="PID:g452494" /translation="MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSGDGKLEA LYVLMVLGFFGFFTLGIMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLES YRSCYVVENHLAIEQPNTHLPETKPSP" BASE COUNT 94 a 128 c 108 g 78 t ORIGIN 1 ggaaccttaa tgcccaggat gatcctgtct aacaccacag cggtgacgcc ctttctgacc 61 aagctgtggc aggagacagt tcagcagggt ggcaacatgt cgggcctggc ccgcaggtcc 121 ccccgcagcg gtgacggcaa gctggaggcc ctctacgtcc tcatggtact gggattcttc 181 ggcttcttca ccctgggcat catgctgagc tacatccgct ccaagaagct ggagcactcg 241 aacgacccat tcaacgtcta catcgagtcc gatgcctggc aagagaagga caaggcctat 301 gtccaggccc gggtcctgga gagctacagg tcgtgctatg tcgttgaaaa ccatctggcc 361 atagaacaac ccaacacaca ccttcctgag acgaagcctt ccccatga // LOCUS HUMCEAPX 494 bp mRNA PRI 15-SEP-1990 DEFINITION Human cell adhesion protein (SQM1) mRNA, complete cds. ACCESSION M33374 NID g180232 KEYWORDS cell adhesion molecule. SOURCE Human squamous carcinoma cell line SCC25, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 494) AUTHORS Wong,Y.-C., Tsao,S.-W., Kakefuda,M. and Bernal,S.D. TITLE cDNA cloning of a novel cell adhesion protein expressed in human squamous carcinoma cells JOURNAL Biochem. Biophys. Res. Commun. 166, 984-992 (1990) MEDLINE 90147818 FEATURES Location/Qualifiers source 1..494 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..494 /note="SQM1 mRNA" CDS 36..443 /note="cell adhesion protein (SQM1)" /codon_start=1 /db_xref="PID:g180233" /translation="MGAHLVRRYLGDASVEPDPLQMPTFPPDYGFPERKEREMVATQQ EMMDASEAQLRDYCAHHLIRLLKCKRDSFPSCWPASRKRHDSGLLRTASYVMRMKEFE RDEGCSSGRSGGRRRRQICKGQGPGEVDPKVAL" BASE COUNT 107 a 154 c 163 g 70 t ORIGIN 1 ccctcggtgc tgcagggatc tgcaggactg cagccatggg ggcgcacctg gtccggcgct 61 acctgggcga tgcttcggtg gagcccgacc ccctgcagat gccaaccttc ccgccagact 121 acggcttccc cgaacgcaag gagcgcgaga tggtggccac acagcaggag atgatggacg 181 cgagtgaggc tcagctgcgg gactactgcg cccaccacct catccggctg ctcaagtgca 241 agcgtgacag cttcccaagt tgctggcctg caagcaggaa gcggcacgac tcgggactac 301 tgcgcaccgc aagctatgtg atgcgcatga aggagtttga gcgggacgag ggctgctcca 361 gcggaagaag cggcgggaga agaaggcggc aaatctgcaa aggccaggga cccggggaag 421 tggaccccaa ggtggccctg taggggtgca ccccccaccc tatggaccag tcaaataaaa 481 ccttcaggcc cctc // LOCUS HUMCELGROR 2244 bp mRNA PRI 09-JUN-1994 DEFINITION Human cellular growth-regulating protein mRNA, complete cds. ACCESSION L10844 NID g474898 KEYWORDS growth regulation protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2244) AUTHORS Moats-Staats,B.M., Jarvis,H.W., D'Ercole,A.J. and Stiles,A.D. TITLE Cloning and characterization of a novel RNA involved in cellular growth regulation JOURNAL Mol. Cell. Biol. 14 (5), 2936-2945 (1994) MEDLINE 94217691 FEATURES Location/Qualifiers source 1..2244 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI-38" /cell_type="fibroblast" /dev_stage="fetal" /tissue_type="lung" /tissue_lib="Stratagene" CDS 613..786 /codon_start=1 /product="growth-regulating protein" /db_xref="PID:g498177" /translation="MLSIDLQLSSICVPRMQLKTCYVEEIRGVVLEDRHLWNDSHPLK LGGWRPASLRSWG" polyA_site 2244 BASE COUNT 668 a 408 c 459 g 709 t ORIGIN 1 gttgtttaaa agcaaggcat gcttgtggat gactctgtaa cagactaatt ggaattgttg 61 aagctgctcc ctggttccac tctggagagt aatctgggac atcttagtgt tttgttttgt 121 ttttttccct cctctttttt tgggggggag tgtgtgtggg gtttgttttt tagtcttgtt 181 tttttaattc attaaccagt ggttagcctt aaggggagga ggacggattg attccacatt 241 ccacttccta gatctagttt agaaaacatg ttccccatct ggtgctctta ggaaggagta 301 tagtaaatgc ctcatttaat aacatactcc tttttgaaag ttgccttttc tctccaccct 361 tgagtagatc cagtatttga tgaaactcat gaaagtgggt ggagcccatc ttccccctcc 421 tcttttctag gacgcactat atgtgactgt gactttaagg acatttgttt gccatttgct 481 gatttttttg ggaagttaat ttctaacttc tttcactgat aaatgaagaa aagtattgca 541 cctttgaaat gcaccaaatg aattgagttt gtaattaaaa aaattttttt tccctttcag 601 tcattgtctt atatgcttag catagatttg cagctcagta gtatatgtgt tcctagaatg 661 cagctgaaga cctgttatgt agaggaaata cgaggggtgg tgctagaaga cagacatctg 721 tggaatgatt cacatcctct caagttagga ggatggaggc ctgcttcatt aagaagctgg 781 gggtagggtg ggggtgggga gaacacttaa caacatgggg accagtcagg ggaatcccct 841 tatttctgtt ttgcatatga ggaaccctag agcagccagg tgaggctctc tagtttaata 901 aaaatcatgg aaagactctt aatgcagact cttcttaagt gttaataggg attttttcag 961 cttattttgg ttgcagtttc caatttttaa aaatgttgag gtaatctttc ccaccttccc 1021 aaacctaatt cttgtagatg cattagtgtt gaaccaatgc ttctcatgtc tcaatcttgt 1081 atatcatctt ttcagatgta ttaacaaaca aaaccttaaa aagagtagat gaattgccaa 1141 acacaattcc taccaataat aaatcgatca actctatcta ttcaggaaag caggaagcat 1201 ttggaccaca gtgcatgaaa acttcaacat tctgttatta gataatgaat caaccaaatg 1261 aacaatccag agaaaagaaa attgcaataa taaaaggtaa attaacagaa agataatata 1321 agcaagatag taatagttga ccattctgaa aagcttataa catcactcat catccagcat 1381 cctttctgaa aacaaaggat ttttaaatca ctttatgcac atatacaaca taggaggttg 1441 gcaaaataat gcactatttc ttaacagcca tgtctcttgt agaacttcaa gttaatctac 1501 aaatgaccat tgtgtcttaa tttagattat gaataccaca ttagtcaggt atttgcacta 1561 acccttaata gtatatacag tttctatgga aaattcagtg gtccaaaaat ttccgtagaa 1621 tttgagagga cgttggtggg ctgaagatag ctccttgagg gtcactgatg taggctgcaa 1681 tgggggttca caaggccctg acaccgtatt tatagtctaa cctttttatg aaaatctgac 1741 tacagctatt taaggagtag tcttaatagc tgaaaatgaa gatagagaaa gacaccaaga 1801 atatgacaca gtttacattc tagtgaggga cacaacaaaa tcaaatttaa aaaagagtgt 1861 aatagatgct gataaatact gtagataaag cacataagaa aatagaaata aaggctgtca 1921 atggagaagt catgattttt attttattta tttatttatt tatttgagac agagtcaggc 1981 tctgtgcagg ctggagtgca atggtgtgat ctcgctcact acaacctctg ctcctggctc 2041 aagctatcct cccacctcag ctctcaagta gctgggatca caggtgcgtg ctaccatgcc 2101 cggctaattt tttgtagaga tgaggttttg ccatgttgcc caggctggtc tcgaactcct 2161 ggactcaact gaccccacct cggcctctca aagtgctgag attataggcg tgcagccggc 2221 agctggccat tgtttatgtt ctgc // LOCUS HUMCENPRO 3132 bp mRNA PRI 01-NOV-1994 DEFINITION H.sapiens centromere autoantigen C (CENPC) mRNA, complete cds. ACCESSION M95724 NID g180246 KEYWORDS centromere; centromere autoantigen C; inner kinetochore plate; scleroderma autoantigen. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3132) AUTHORS Saitoh,H., Tomkiel,J., Cooke,C.A., Ratrie,H. III., Maurer,M., Rothfield,N.F. and Earnshaw,W.C. TITLE CENP-C, an autoantigen in scleroderma, is a component of the human inner kinetochore plate JOURNAL Cell 70 (1), 115-125 (1992) MEDLINE 92323541 FEATURES Location/Qualifiers source 1..3132 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 157..2988 /gene="CENPC" CDS 157..2988 /gene="CENPC" /codon_start=1 /db_xref="GDB:G00-118-769" /evidence=experimental /product="centromere autoantigen C" /db_xref="PID:g180247" /translation="MAASGLDHLKNGYRRRFCRPSRARDINTEQGQNVLEILQDCFEE KSLANDFSTNSTKSVPNSTRKIKDTCIQSPSKECQKSHPKSVPVSSKKKEASLQFVVE PSEATNRSVQAHEVHQKILATDVSSKNTPDSKKISSRNINDHHSEADEEFYLSVGSPS VLLDAKTSVSQNVIPSSAKKRETYTFENSVNMLPSSTEVSVKTKKRLNFDDKVMLKKI EIDNKVSDEEDKTSEGQERKPSGSSQNRIRDSEYEIQRQAKKSFSTLFLETVKRKSES SPIVRHAATAPPHSCPPDDTKLIEDEFIIDESDQSFASRSWITIPRKAGSLKQRTISP AESTALFQGRKSREKHHNILPKTLANDKHSHKPHPVETSQPSDKTVLDTSYALIDETV NNYRSTKYEMYSKNAEKPSRSKRTIKQKQRRKFMAKPAEEQLDVGQSKDENIHTSHIT QDEFQRNSDRNMEEHEEMGNDCVSKKQMPPVGSKKSSTRKDKEESKKKRFSSESKNKL VPEEVTSTVTKSRRISRRPSDWWVVKSEESPVYSNSSVRNELPMHHNSSRKSTKKTNQ SSKNIRKKTIPLKRQKTATKGNQRVQKFLNAEGSGGIVGHDEISRCSLSEPLESDEAD LAKKKNLDCSRSTRSSKNEDNIMTAQNVPLKPQTSGYTCNIPTESNLDSGEHKTSVLE ESGPSRLNNNYLMSGKNDVDDEEVHGSSDDSKQSKVIPKNRIHHKLVLPSNTPNVRRT KRTRLKPLEYWRGERIDYQGRPSGGFVISGVLSPDTISSKRKAKENIGKVNKKSNKKR ICLDNDERKTNLMVNLGIPLGDPLQPTRVKDPETREIILMDLVRPQDTYQFFVKHGEL KVYKTLDTPFFSTGKLILGPQEEKGKQHVGQDILVFYVNFGDLLCTLHETPYILSTGD SFYVPSGNYYNIKNLRNEESVLLFTQIKR" BASE COUNT 1164 a 542 c 630 g 796 t ORIGIN 1 cggatcgcag ctctcgcggc agtcgcctga gacttaaggt tattgcttgg ccgcggcctg 61 gtattccggc gattcgtttc ttgctcggct tcctggagct gtggtccgtg tgggcttcca 121 cctcagacag ttgcgctggc tcagcggggc cggaacatgg ctgcgtccgg tctggatcat 181 ctcaaaaatg gctacagaag aagattttgt cgaccttcca gggcacgtga cattaacaca 241 gagcaaggcc agaatgttct ggaaatctta caagactgtt ttgaagaaaa aagtcttgcc 301 aatgatttta gtacaaattc tacaaaatca gtgcctaatt caacacgcaa aataaaagac 361 acttgtattc agtcaccaag caaagagtgc cagaaatcac atccaaagtc agttccagtt 421 tcttcaaaga agaaagaagc ctctctacag tttgttgtag aaccaagtga agccacaaac 481 agatcagttc aggcccatga agttcatcag aaaattctgg caactgatgt tagttccaaa 541 aatacacctg actcgaaaaa aatatcaagt agaaacataa atgatcatca cagtgaagct 601 gatgaagaat tttacttatc cgttggctca ccttctgttc ttttggatgc aaaaacatct 661 gtatcacaaa atgttattcc atctagtgcc aaaaagagag agacttacac ttttgaaaat 721 tcagtaaata tgctgccttc aagtacagag gtttcagtta aaaccaaaaa aaggttaaac 781 tttgatgata aagttatgtt aaagaaaata gaaatagata ataaagtatc agatgaagag 841 gataaaacat cggaaggaca agaaagaaaa ccatcaggat catctcagaa tagaatacga 901 gattcagaat atgaaattca acgacaagct aaaaaaagtt tttcaacatt gtttttagaa 961 acagtaaaac gaaaaagtga atccagtccc attgttaggc atgcggcaac tgctccacct 1021 cattcgtgtc ctcccgatga tacgaagttg atagaggatg aatttataat tgatgagtcg 1081 gatcaaagtt ttgccagtag atcttggatt acaataccaa gaaaggcagg gtctctgaaa 1141 caacgcacaa tatccccggc tgagagcact gcactctttc aaggtagaaa gtcaagagaa 1201 aagcatcata atatattacc taagactttg gcaaatgaca aacattccca taaacctcac 1261 ccagtagaga catctcagcc ctctgataaa acagtactgg atacaagtta tgctttgata 1321 gatgaaacag taaataatta tagatctaca aaatatgaaa tgtattccaa gaatgcagaa 1381 aaaccatcta gaagcaaaag gactataaaa caaaaacaga gaagaaaatt catggctaaa 1441 ccagctgaag aacagcttga tgtgggacag tctaaagatg aaaacataca tacatcacat 1501 attacccaag acgaatttca aagaaattca gacagaaata tggaagagca tgaagagatg 1561 ggaaatgatt gtgtttccaa aaaacagatg ccacctgtgg gaagcaagaa aagtagcact 1621 agaaaagata aggaagaatc taaaaagaag cgcttttcca gtgagtccaa gaacaaactt 1681 gtacctgaag aagtgacttc aactgtcacg aaaagtcgaa gaatttccag gcgtccatct 1741 gattggtggg tggtaaaatc agaggagagt cctgtttata gcaattcttc agtaagaaat 1801 gaattaccaa tgcatcacaa tagtagccga aaatctacta agaaaacaaa tcagtcatct 1861 aagaatatta ggaaaaaaac tattccactt aaaaggcaga agacagcaac taaaggcaac 1921 caaagagtac agaagttttt aaatgctgaa ggttctggag gtatcgttgg tcatgatgaa 1981 atttccagat gttcactgag tgagccattg gaaagtgatg aggcagactt ggctaagaag 2041 aaaaatcttg attgttctag atctacaaga agctcaaaga atgaagataa cattatgact 2101 gcacagaatg ttcccctaaa gcctcagacc agtggatata catgtaatat accaacagag 2161 tcaaacttgg attctggaga gcataagact tcagttttag aggaaagtgg accttccagg 2221 ctcaataata attatttaat gtctggaaag aatgatgtgg atgatgagga agttcatgga 2281 agttcagatg actcaaaaca atctaaagtg ataccaaaga acagaatcca tcacaaacta 2341 gtattgccct ccaacacacc aaatgttcgc aggaccaaga gaacacgttt gaaacctttg 2401 gagtactggc gaggagagcg aatagattat caaggaaggc catcaggagg attcgtgatt 2461 agtggagtac tatctccaga cacaatatcg tctaaaagga aggcaaaaga aaatattgga 2521 aaagtcaaca aaaaatctaa taagaaaagg atctgtcttg ataacgatga aagaaagact 2581 aacttaatgg taaatctagg tatacctctt ggagatcctt tgcagccaac gagggtaaag 2641 gacccagaaa caagagagat tattctcatg gatcttgtaa ggccacaaga tacatatcaa 2701 ttttttgtta agcatggtga gttgaaggta tacaagacat tggatacacc ctttttttct 2761 actgggaaat tgatattagg accacaagaa gaaaagggaa agcagcatgt tggccaggat 2821 atattggttt tttatgttaa ctttggtgac cttttgtgta ctttacatga aacaccttat 2881 atattaagta ctggggattc gttctatgtt ccttcaggta actattataa catcaaaaat 2941 ctccggaatg aggaaagtgt tcttcttttt actcagataa aaagatgaaa gatcaaccaa 3001 ccttaaatat atgtatgtat atatgtatat gtaaaaacag tttgtatagt tggaatattt 3061 gtctttgtaa ttacttgtga tgttttaaaa taaaaatttt attcagtttt gtgtaaaaaa 3121 aaaaaaaaaa aa // LOCUS HUMCERA 1524 bp mRNA PRI 27-FEB-1991 DEFINITION Human precerebellin and cerebellin mRNA, complete cds. ACCESSION M58583 NID g180250 KEYWORDS cerebellin; precerebellin. SOURCE Human cerebellum, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1524) AUTHORS Urade,Y., Oberdick,J., Molinar-Rode,R. and Morgan,J.I. TITLE Precerebellin is a cerebellum-specific protein with similarity to the globular domain of complement C1q B chain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 1069-1073 (1991) MEDLINE 91126057 FEATURES Location/Qualifiers source 1..1524 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum" CDS 315..896 /codon_start=1 /product="precerebellin" /db_xref="PID:g180251" /translation="MLGVLELLLLGAAWLAGPARGQNETEPIVLEGKCLVVCDSNPTS DPTGTALGISVRSGSAKVAFSAIRSTNHEPSEMSNRTMIIYFDQVLVNIGNNFDSERS TFIAPRKGIYSFNFHVVKVYNRQTIQVSLMLNGWPVISAFAGDQDVTREAASNGVLIQ MEKGDRAYLKLERGNLMGGWKYSTFSGFLVFPL" mat_peptide 483..530 /product="cerebellin" BASE COUNT 344 a 379 c 472 g 329 t ORIGIN 1 cgcggaaccg cgggtgcggg caggaggcgg cggcagcagc gggaccgagc agcagcggct 61 atgcatccaa gtgcggctgg gcagccgcgg cacccctgag gcccgggagg ggctccggga 121 acacagcgcg gaggggacgc tagtcgcgga ggggacgcta gtcccggagt gcgaggaaga 181 ggtgtagtgt ccagagcggc ggcggcggac gtgcgcacgg agctgaggag ggggcttcgg 241 agcgggactg ggacgggggg ggcggctggc gcgagcagcc ctgggggtgg ggggcggggt 301 gggccggcgg cgcgatgctg ggcgtcctgg agctgctgct gctgggggct gcgtggctgg 361 cgggcccggc ccgcgggcag aatgagacgg agcccatcgt gctggagggc aagtgcctgg 421 tggtgtgcga ctccaacccc acgtccgacc ccacgggcac tgccctgggc atctctgtgc 481 gctctggcag cgccaaggtg gctttctctg ccatcaggag caccaaccac gagccgtccg 541 agatgagtaa tcgcaccatg atcatctact tcgaccaggt actagtgaac attgggaaca 601 actttgattc agaacgcagc actttcatcg ccccgcgcaa agggatctac agttttaact 661 tccacgtggt aaaagtctac aacagacaaa ccatacaggt gagcctcatg ctaaacgggt 721 ggccggtgat ttcagccttc gctggtgacc aggacgtgac ccgggaggcc gccagcaacg 781 gagtcctaat ccaaatggag aaaggcgacc gagcatacct caagctggag cggggaaact 841 tgatgggggg ctggaagtac tcgaccttct ccggattcct ggtgtttcct ctctgactgg 901 ctcgtagccg gaagggaggc agggagaggg cgaaggcagg aaggggagtg agaaagaggc 961 tgaaattaag agggcgagaa agcagcacga cttgaaactt cctacatgtt ctctaactgt 1021 atctgggtaa aaaggtgcgc gccagctgtg ggacaacttt gtccatttcc ttattaggag 1081 aaataaattt cgcttagctc tgcgcactcc catttccaaa aataaactcg cctcccccat 1141 tccagttgca gtaatcaaga aagagactgc cttgtcattg tttcttatcc cccaacttca 1201 tgttccctgc aatttattta aagaaacttt gtatttcact acataatctg aaatctttct 1261 ccctagcccc ctctggaatc cttctgccta ctgaaatctg atatattaca caccccccaa 1321 cctttttttt tcgagtttgg aaaggggtaa agttttgttg ttgttgttgt tttgttttgg 1381 atggggaagg tagttttaat ttggcaagtg ttgctcttct taaaacactg ctcaaattaa 1441 ttagctgaag atacttagta tcctcggctg cttgttagca gagaaaggac tcaccttagt 1501 ggtgactggt ttaaaaaaaa aaaa // LOCUS HUMCERP 3321 bp mRNA PRI 01-NOV-1994 DEFINITION Human ceruloplasmin (ferroxidase) mRNA, complete cds. ACCESSION M13699 NID g180255 KEYWORDS amine oxidase; ceruloplasmin; copper-binding protein; ferroxidase; superoxide dismutase. SOURCE Human liver, cDNA to mRNA, clones lambda-hCP1 and phCP1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3321) AUTHORS Koschinsky,M.L., Funk,W.D., van Oost,B.A. and MacGillivray,R.T. TITLE Complete cDNA sequence of human preceruloplasmin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (14), 5086-5090 (1986) MEDLINE 86259737 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by R.T.A.MacGillivray, 06-NOV-1986. FEATURES Location/Qualifiers source 1..3321 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q23-q25" mRNA <1..3321 /note="ceruloplasmin mRNA" sig_peptide 1..57 /gene="CP" /note="ceruloplasmin signal peptide" gene 1..3198 /gene="CP" CDS 1..3198 /gene="CP" /note="preceruloplasmin (EC 1.16.3.1)" /codon_start=1 /db_xref="GDB:G00-119-069" /db_xref="PID:g180256" /translation="MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKL ISVDTEHSNIYLQNGPDRIGRLYKKALYLQYTDETFRTTIEKPVWLGFLGPIIKAETG DKVYVHLKNLASRPYTFHSHGITYYKEHEGAIYPDNTTDFQRADDKVYPGEQYTYMLL ATEEQSPGEGDGNCVTRIYHSHIDAPKDIASGLIGPLIICKKDSLDKEKEKHIDREFV VMFSVVDENFSWYLEDNIKTYCSEPEKVDKDNEDFQESNRMYSVNGYTFGSLPGLSMC AEDRVKWYLFGMGNEVDVHAAFFHGQALTNKNYRIDTINLFPATLFDAYMVAQNPGEW MLSCQNLNHLKAGLQAFFQVQECNKSSSKDNIRGKHVRHYYIAAEEIIWNYAPSGIDI FTKENLTAPGSDSAVFFEQGTTRIGGSYKKLVYREYTDASFTNRKERGPEEEHLGILG PVIWAEVGDTIRVTFHNKGAYPLSIEPIGVRFNKNNEGTYYSPNYNPQSRSVPPSASH VAPTETFTYEWTVPKEVGPTNADPVCLAKMYYSAVDPTKDIFTGLIGPMKICKKGSLH ANGRQKDVDKEFYLFPTVFDENESLLLEDNIRMFTTAPDQVDKEDEDFQESNKMHSMN GFMYGNQPGLTMCKGDSVVWYLFSAGNEADVHGIYFSGNTYLWRGERRDTANLFPQTS LTLHMWPDTEGTFNVECLTTDHYTGGMKQKYTVNQCRRQSEDSTFYLGERTYYIAAVE VEWDYSPQREWEKELHHLQEQNVSNAFLDKGEFYIGSKYKKVVYRQYTDSTFRVPVER KAEEEHLGILGPQLHADVGDKVKIIFKNMATRPYSIHAHGVQTESSTVTPTLPGETLT YVWKIPERSGAGTEDSACIPWAYYSTVDQVKDLYSGLIGPLIVCRRPYLKVFNPRRKL EFALLFLVFDENESWYLDDNIKTYSDHPEKVNKDDEEFIESNKMHAINGRMFGNLQGL TMHVGDEVNWYLMGMGNEIDLHTVHFHGHSFQYKHRGVYSSDVFDIFPGTYQTLEMFP RTPGIWLLHCHVTDHIHAGMETTYTVLQNEDTKSG" mat_peptide 58..3195 /gene="CP" /note="ceruloplasmin" BASE COUNT 1088 a 651 c 729 g 853 t ORIGIN 612 bp upstream of XbaI site; chromosome 3q31. 1 atgaagattt tgatacttgg tatttttctg tttttatgta gtaccccagc ctgggcgaaa 61 gaaaagcatt attacattgg aattattgaa acgacttggg attatgcctc tgaccatggg 121 gaaaagaaac ttatttctgt tgacacggaa cattccaata tctatcttca aaatggccca 181 gatagaattg ggagactata taagaaggcc ctttatcttc agtacacaga tgaaaccttt 241 aggacaacta tagaaaaacc ggtctggctt gggtttttag gccctattat caaagctgaa 301 actggagata aagtttatgt acacttaaaa aaccttgcct ctaggcccta cacctttcat 361 tcacatggaa taacttacta taaggaacat gagggggcca tctaccctga taacaccaca 421 gattttcaaa gagcagatga caaagtatat ccaggagagc agtatacata catgttgctt 481 gccactgaag aacaaagtcc tggggaagga gatggcaatt gtgtgactag gatttaccat 541 tcccacattg atgctccaaa agatattgcc tcaggactca tcggaccttt aataatctgt 601 aaaaaagatt ctctagataa agaaaaagaa aaacatattg accgagaatt tgtggtgatg 661 ttttctgtgg tggatgaaaa tttcagctgg tacctagaag acaacattaa aacctactgc 721 tcagaaccag agaaagttga caaagacaac gaagacttcc aggagagtaa cagaatgtat 781 tctgtgaatg gatacacttt tggaagtctc ccaggactct ccatgtgtgc tgaagacaga 841 gtaaaatggt acctttttgg tatgggtaat gaagttgatg tgcacgcagc tttctttcac 901 gggcaagcac tgactaacaa gaactaccgt attgacacaa tcaacctctt tcctgctacc 961 ctgtttgatg cttatatggt ggcccagaac cctggagaat ggatgctcag ctgtcagaat 1021 ctaaaccatc tgaaagccgg tttgcaagcc tttttccagg tccaggagtg taacaagtct 1081 tcatcaaagg ataatatccg tgggaagcat gttagacact actacattgc cgctgaggaa 1141 atcatctgga actatgctcc ctctggtata gacatcttca ctaaagaaaa cttaacagca 1201 cctggaagtg actcagcggt gttttttgaa caaggtacca caagaattgg aggctcttat 1261 aaaaagctgg tttatcgtga gtacacagat gcctccttca caaatcgaaa ggagagaggc 1321 cctgaagaag agcatcttgg catcctgggt cctgtcattt gggcagaggt gggagacacc 1381 atcagagtaa ccttccataa caaaggagca tatcccctca gtattgagcc gattggggtg 1441 agattcaata agaacaacga gggcacatac tattccccaa attacaaccc ccagagcaga 1501 agtgtgcctc cttcagcctc ccatgtggca cccacagaaa cattcaccta tgaatggact 1561 gtccccaaag aagtaggacc cactaatgca gatcctgtgt gtctagctaa gatgtattat 1621 tctgctgtgg atcccactaa agatatattc actgggctta ttgggccaat gaaaatatgc 1681 aagaaaggaa gtttacatgc aaatgggaga cagaaagatg tagacaagga attctatttg 1741 tttcctacag tatttgatga gaatgagagt ttactcctgg aagataatat tagaatgttt 1801 acaactgcac ctgatcaggt ggataaggaa gatgaagact ttcaggaatc taataaaatg 1861 cactccatga atggattcat gtatgggaat cagccgggtc tcactatgtg caaaggagat 1921 tcggtcgtgt ggtacttatt cagcgccgga aatgaggccg atgtacatgg aatatacttt 1981 tcaggaaaca catatctgtg gagaggagaa cggagagaca cagcaaacct cttccctcaa 2041 acaagtctta cgctccacat gtggcctgac acagagggga cttttaatgt tgaatgcctt 2101 acaactgatc attacacagg cggcatgaag caaaaatata ctgtgaacca atgcaggcgg 2161 cagtctgagg attccacctt ctacctggga gagaggacat actatatcgc agcagtggag 2221 gtggaatggg attattcccc acaaagggag tgggaaaagg agctgcatca tttacaagag 2281 cagaatgttt caaatgcatt tttagataag ggagagtttt acataggctc aaagtacaag 2341 aaagttgtgt atcggcagta tactgatagc acattccgtg ttccagtgga gagaaaagct 2401 gaagaagaac atctgggaat tctaggtcca caacttcatg cagatgttgg agacaaagtc 2461 aaaattatct ttaaaaacat ggccacaagg ccctactcaa tacatgccca tggggtacaa 2521 acagagagtt ctacagttac tccaacatta ccaggtgaaa ctctcactta cgtatggaaa 2581 atcccagaaa gatctggagc tggaacagag gattctgctt gtattccatg ggcttattat 2641 tcaactgtgg atcaagttaa ggacctctac agtggattaa ttggccccct gattgtttgt 2701 cgaagacctt acttgaaagt attcaatccc agaaggaagc tggaatttgc ccttctgttt 2761 ctagtttttg atgagaatga atcttggtac ttagatgaca acatcaaaac atactctgat 2821 caccccgaga aagtaaacaa agatgatgag gaattcatag aaagcaataa aatgcatgct 2881 attaatggaa gaatgtttgg aaacctacaa ggcctcacaa tgcacgtggg agatgaagtc 2941 aactggtatc tgatgggaat gggcaatgaa atagacttac acactgtaca ttttcacggc 3001 catagcttcc aatacaagca caggggagtt tatagttctg atgtctttga cattttccct 3061 ggaacatacc aaaccctaga aatgtttcca agaacacctg gaatttggtt actccactgc 3121 catgtgaccg accacattca tgctggaatg gaaaccactt acaccgttct acaaaatgaa 3181 gacaccaaat ctggctgaat gaaataaatt ggtgataagt ggaaaaaaga gaaaaaccaa 3241 tgattcataa caatgtatgt gaaagtgtaa aatagaatgt tactttggaa tgactataaa 3301 cattaaaaga gactggagca t // LOCUS HUMCETP 1787 bp mRNA PRI 01-NOV-1994 DEFINITION Human cholesteryl ester transfer protein mRNA, complete cds. ACCESSION M30185 NID g180259 KEYWORDS cholesteryl ester transfer protein; transfer protein. SOURCE Human adult liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1787) AUTHORS Drayna,D., Jarnagin,A.S., McLean,J., Henzel,W., Kohr,W., Fielding,C. and Lawn,R. TITLE Cloning and sequencing of human cholesteryl ester transfer protein cDNA JOURNAL Nature 327 (6123), 632-634 (1987) MEDLINE 87258172 FEATURES Location/Qualifiers source 1..1787 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" mRNA <1..1787 /note="CETP mRNA" sig_peptide 131..181 /gene="CETP" /note="cholesteryl ester transfer protein signal peptide" gene 131..1612 /gene="CETP" CDS 131..1612 /gene="CETP" /note="cholesteryl ester transfer protein precursor" /codon_start=1 /db_xref="GDB:G00-119-773" /db_xref="PID:g180260" /translation="MLAATVLTLALLGNAHACSKGTSHEAGIVCRITKPALLVLNHET AKVIQTAFQRASYPDITGEKAMMLLGQVKYGLHNIQISHLSIASSQVELVEAKSIDVS IQNVSVVFKGTLKYGYTTAWWLGIDQSIDFEIDSAIDLQINTQLTCDSGRVRTDAPDC YLSFHKLLLHLQGEREPGWIKQLFTNFISFTLKLVLKGQICKEINVISNIMADFVQTR AASILSDGDIGVDISLTGDPVITASYLESHHKGHFIYKNVSEDLPLPTFSPTLLGDSR MLYFWFSERVFHSLAKVAFQDGRLMLSLMGDEFKAVLETWGFNTNQEIFQEVVGGFPS QAQVTVHCLKMPKISCQNKGVVVNSSVMVKFLFPRPDQQHSVAYTFEEDIVTTVQASY SKKKLFLSLLDFQITPKTVSNLTESSSESIQSFLQSMITAVGIPEVMSRLEVVFTALM NSKGVSLFDIINPEIITRDGFLLLQMDFGFPEHLLVDFLQSLS" mat_peptide 182..1609 /gene="CETP" /note="cholesteryl ester transfer protein" BASE COUNT 397 a 531 c 456 g 403 t ORIGIN 1 gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc 61 tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg 121 cctgataacc atgctggctg ccacagtcct gaccctggcc ctgctgggca atgcccatgc 181 ctgctccaaa ggcacctcgc acgaggcagg catcgtgtgc cgcatcacca agcctgccct 241 cctggtgttg aaccacgaga ctgccaaggt gatccagacc gccttccagc gagccagcta 301 cccagatatc acgggcgaga aggccatgat gctccttggc caagtcaagt atgggttgca 361 caacatccag atcagccact tgtccatcgc cagcagccag gtggagctgg tggaagccaa 421 gtccattgat gtctccattc agaacgtgtc tgtggtcttc aaggggaccc tgaagtatgg 481 ctacaccact gcctggtggc tgggtattga tcagtccatt gacttcgaga tcgactctgc 541 cattgacctc cagatcaaca cacagctgac ctgtgactct ggtagagtgc ggaccgatgc 601 ccctgactgc tacctgtctt tccataagct gctcctgcat ctccaagggg agcgagagcc 661 tgggtggatc aagcagctgt tcacaaattt catctccttc accctgaagc tggtcctgaa 721 gggacagatc tgcaaagaga tcaacgtcat ctctaacatc atggccgatt ttgtccagac 781 aagggctgcc agcatccttt cagatggaga cattggggtg gacatttccc tgacaggtga 841 tcccgtcatc acagcctcct acctggagtc ccatcacaag ggtcatttca tctacaagaa 901 tgtctcagag gacctccccc tccccacctt ctcgcccaca ctgctggggg actcccgcat 961 gctgtacttc tggttctctg agcgagtctt ccactcgctg gccaaggtag ctttccagga 1021 tggccgcctc atgctcagcc tgatgggaga cgagttcaag gcagtgctgg agacctgggg 1081 cttcaacacc aaccaggaaa tcttccaaga ggttgtcggc ggcttcccca gccaggccca 1141 agtcaccgtc cactgcctca agatgcccaa gatctcctgc caaaacaagg gagtcgtggt 1201 caattcttca gtgatggtga aattcctctt tccacgccca gaccagcaac attctgtagc 1261 ttacacattt gaagaggata tcgtgactac cgtccaggcc tcctattcta agaaaaagct 1321 cttcttaagc ctcttggatt tccagattac accaaagact gtttccaact tgactgagag 1381 cagctccgag tccatccaga gcttcctgca gtcaatgatc accgctgtgg gcatccctga 1441 ggtcatgtct cggctcgagg tagtgtttac agccctcatg aacagcaaag gcgtgagcct 1501 cttcgacatc atcaaccctg agattatcac tcgagatggc ttcctgctgc tgcagatgga 1561 ctttggcttc cctgagcacc tgctggtgga tttcctccag agcttgagct agaagtctcc 1621 aaggaggtcg ggatggggct tgtagcagaa ggcaagcacc aggctcacag ctggaaccct 1681 ggtgtctcct ccagcgtggt ggaagttggg ttaggagtac ggagatggag attggctccc 1741 aactcctccc tatcctaaag gcccactggc attaaagtgc tgtatcc // LOCUS HUMCFTRM 6129 bp mRNA PRI 15-DEC-1989 DEFINITION Human cystic fibrosis mRNA, encoding a presumed transmembrane conductance regulator (CFTR). ACCESSION M28668 NID g180331 KEYWORDS cystic fibrosis; transmembrane conductance regulator. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6129) AUTHORS Riordan,J.R., Rommens,J.M., Kerem,B., Alon,N., Rozmahel,R., Grzelczak,Z., Zielenski,J., Lok,S., Plavsic,N., Chou,J.-L., Drumm,M.L., Iannuzzi,M.C., Collins,F.S. and Tsui,L.-C. TITLE Identification of the cystic fibrosis gene: Cloning and characterization of complementary DNA JOURNAL Science 245, 1066-1073 (1989) MEDLINE 89368940 COMMENT A three base-pair deletion spanning positions 1654-1656 is observed in cDNAs from cystic fibrosis patients. FEATURES Location/Qualifiers source 1..6129 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 133..4575 /note="cystic fibrosis transmembrane conductance regulator" /codon_start=1 /db_xref="PID:g180332" /translation="MQRSPLEKASVVSKLFFSWTRPILRKGYRQRLELSDIYQIPSVD SADNLSEKLEREWDRELASKKNPKLINALRRCFFWRFMFYGIFLYLGEVTKAVQPLLL GRIIASYDPDNKEERSIAIYLGIGLCLLFIVRTLLLHPAIFGLHHIGMQMRIAMFSLI YKKTLKLSSRVLDKISIGQLVSLLSNNLNKFDEGLALAHFVWIAPLQVALLMGLIWEL LQASAFCGLGFLIVLALFQAGLGRMMMKYRDQRAGKISERLVITSEMIENIQSVKAYC WEEAMEKMIENLRQTELKLTRKAAYVRYFNSSAFFFSGFFVVFLSVLPYALIKGIILR KIFTTISFCIVLRMAVTRQFPWAVQTWYDSLGAINKIQDFLQKQEYKTLEYNLTTTEV VMENVTAFWEEGFGELFEKAKQNNNNRKTSNGDDSLFFSNFSLLGTPVLKDINFKIER GQLLAVAGSTGAGKTSLLMMIMGELEPSEGKIKHSGRISFCSQFSWIMPGTIKENIIF GVSYDEYRYRSVIKACQLEEDISKFAEKDNIVLGEGGITLSGGQRARISLARAVYKDA DLYLLDSPFGYLDVLTEKEIFESCVCKLMANKTRILVTSKMEHLKKADKILILNEGSS YFYGTFSELQNLQPDFSSKLMGCDSFDQFSAERRNSILTETLHRFSLEGDAPVSWTET KKQSFKQTGEFGEKRKNSILNPINSIRKFSIVQKTPLQMNGIEEDSDEPLERRLSLVP DSEQGEAILPRISVISTGPTLQARRRQSVLNLMTHSVNQGQNIHRKTTASTRKVSLAP QANLTELDIYSRRLSQETGLEISEEINEEDLKECLFDDMESIPAVTTWNTYLRYITVH KSLIFVLIWCLVIFLAEVAASLVVLWLLGNTPLQDKGNSTHSRNNSYAVIITSTSSYY VFYIYVGVADTLLAMGFFRGLPLVHTLITVSKILHHKMLHSVLQAPMSTLNTLKAGGI LNRFSKDIAILDDLLPLTIFDFIQLLLIVIGAIAVVAVLQPYIFVATVPVIVAFIMLR AYFLQTSQQLKQLESEGRSPIFTHLVTSLKGLWTLRAFGRQPYFETLFHKALNLHTAN WFLYLSTLRWFQMRIEMIFVIFFIAVTFISILTTGEGEGRVGIILTLAMNIMSTLQWA VNSSIDVDSLMRSVSRVFKFIDMPTEGKPTKSTKPYKNGQLSKVMIIENSHVKKDDIW PSGGQMTVKDLTAKYTEGGNAILENISFSISPGQRVGLLGRTGSGKSTLLSAFLRLLN TEGEIQIDGVSWDSITLQQWRKAFGVIPQKVFIFSGTFRKNLDPYEQWSDQEIWKVAD EVGLRSVIEQFPGKLDFVLVDGGCVLSHGHKQLMCLARSVLSKAKILLLDEPSAHLDP VTYQIIRRTLKQAFADCTVILCEHRIEAMLECQQFLVIEENKVRQYDSIQKLLNERSL FRQAISPSDRVKLFPHRNSSKCKSKPQIAALKEETEEEVQDTRL" BASE COUNT 1886 a 1181 c 1330 g 1732 t ORIGIN 1 aattggaagc aaatgacatc acagcaggtc agagaaaaag ggttgagcgg caggcaccca 61 gagtagtagg tctttggcat taggagcttg agcccagacg gccctagcag ggaccccagc 121 gcccgagaga ccatgcagag gtcgcctctg gaaaaggcca gcgttgtctc caaacttttt 181 ttcagctgga ccagaccaat tttgaggaaa ggatacagac agcgcctgga attgtcagac 241 atataccaaa tcccttctgt tgattctgct gacaatctat ctgaaaaatt ggaaagagaa 301 tgggatagag agctggcttc aaagaaaaat cctaaactca ttaatgccct tcggcgatgt 361 tttttctgga gatttatgtt ctatggaatc tttttatatt taggggaagt caccaaagca 421 gtacagcctc tcttactggg aagaatcata gcttcctatg acccggataa caaggaggaa 481 cgctctatcg cgatttatct aggcataggc ttatgccttc tctttattgt gaggacactg 541 ctcctacacc cagccatttt tggccttcat cacattggaa tgcagatgag aatagctatg 601 tttagtttga tttataagaa gactttaaag ctgtcaagcc gtgttctaga taaaataagt 661 attggacaac ttgttagtct cctttccaac aacctgaaca aatttgatga aggacttgca 721 ttggcacatt tcgtgtggat cgctcctttg caagtggcac tcctcatggg gctaatctgg 781 gagttgttac aggcgtctgc cttctgtgga cttggtttcc tgatagtcct tgcccttttt 841 caggctgggc tagggagaat gatgatgaag tacagagatc agagagctgg gaagatcagt 901 gaaagacttg tgattacctc agaaatgatt gaaaatatcc aatctgttaa ggcatactgc 961 tgggaagaag caatggaaaa aatgattgaa aacttaagac aaacagaact gaaactgact 1021 cggaaggcag cctatgtgag atacttcaat agctcagcct tcttcttctc agggttcttt 1081 gtggtgtttt tatctgtgct tccctatgca ctaatcaaag gaatcatcct ccggaaaata 1141 ttcaccacca tctcattctg cattgttctg cgcatggcgg tcactcggca atttccctgg 1201 gctgtacaaa catggtatga ctctcttgga gcaataaaca aaatacagga tttcttacaa 1261 aagcaagaat ataagacatt ggaatataac ttaacgacta cagaagtagt gatggagaat 1321 gtaacagcct tctgggagga gggatttggg gaattatttg agaaagcaaa acaaaacaat 1381 aacaatagaa aaacttctaa tggtgatgac agcctcttct tcagtaattt ctcacttctt 1441 ggtactcctg tcctgaaaga tattaatttc aagatagaaa gaggacagtt gttggcggtt 1501 gctggatcca ctggagcagg caagacttca cttctaatga tgattatggg agaactggag 1561 ccttcagagg gtaaaattaa gcacagtgga agaatttcat tctgttctca gttttcctgg 1621 attatgcctg gcaccattaa agaaaatatc atctttggtg tttcctatga tgaatataga 1681 tacagaagcg tcatcaaagc atgccaacta gaagaggaca tctccaagtt tgcagagaaa 1741 gacaatatag ttcttggaga aggtggaatc acactgagtg gaggtcaacg agcaagaatt 1801 tctttagcaa gagcagtata caaagatgct gatttgtatt tattagactc tccttttgga 1861 tacctagatg ttttaacaga aaaagaaata tttgaaagct gtgtctgtaa actgatggct 1921 aacaaaacta ggattttggt cacttctaaa atggaacatt taaagaaagc tgacaaaata 1981 ttaattttga atgaaggtag cagctatttt tatgggacat tttcagaact ccaaaatcta 2041 cagccagact ttagctcaaa actcatggga tgtgattctt tcgaccaatt tagtgcagaa 2101 agaagaaatt caatcctaac tgagacctta caccgtttct cattagaagg agatgctcct 2161 gtctcctgga cagaaacaaa aaaacaatct tttaaacaga ctggagagtt tggggaaaaa 2221 aggaagaatt ctattctcaa tccaatcaac tctatacgaa aattttccat tgtgcaaaag 2281 actcccttac aaatgaatgg catcgaagag gattctgatg agcctttaga gagaaggctg 2341 tccttagtac cagattctga gcagggagag gcgatactgc ctcgcatcag cgtgatcagc 2401 actggcccca cgcttcaggc acgaaggagg cagtctgtcc tgaacctgat gacacactca 2461 gttaaccaag gtcagaacat tcaccgaaag acaacagcat ccacacgaaa agtgtcactg 2521 gcccctcagg caaacttgac tgaactggat atatattcaa gaaggttatc tcaagaaact 2581 ggcttggaaa taagtgaaga aattaacgaa gaagacttaa aggagtgcct ttttgatgat 2641 atggagagca taccagcagt gactacatgg aacacatacc ttcgatatat tactgtccac 2701 aagagcttaa tttttgtgct aatttggtgc ttagtaattt ttctggcaga ggtggctgct 2761 tctttggttg tgctgtggct ccttggaaac actcctcttc aagacaaagg gaatagtact 2821 catagtagaa ataacagcta tgcagtgatt atcaccagca ccagttcgta ttatgtgttt 2881 tacatttacg tgggagtagc cgacactttg cttgctatgg gattcttcag aggtctacca 2941 ctggtgcata ctctaatcac agtgtcgaaa attttacacc acaaaatgtt acattctgtt 3001 cttcaagcac ctatgtcaac cctcaacacg ttgaaagcag gtgggattct taatagattc 3061 tccaaagata tagcaatttt ggatgacctt ctgcctctta ccatatttga cttcatccag 3121 ttgttattaa ttgtgattgg agctatagca gttgtcgcag ttttacaacc ctacatcttt 3181 gttgcaacag tgccagtgat agtggctttt attatgttga gagcatattt cctccaaacc 3241 tcacagcaac tcaaacaact ggaatctgaa ggcaggagtc caattttcac tcatcttgtt 3301 acaagcttaa aaggactatg gacacttcgt gccttcggac ggcagcctta ctttgaaact 3361 ctgttccaca aagctctgaa tttacatact gccaactggt tcttgtacct gtcaacactg 3421 cgctggttcc aaatgagaat agaaatgatt tttgtcatct tcttcattgc tgttaccttc 3481 atttccattt taacaacagg agaaggagaa ggaagagttg gtattatcct gactttagcc 3541 atgaatatca tgagtacatt gcagtgggct gtaaactcca gcatagatgt ggatagcttg 3601 atgcgatctg tgagccgagt ctttaagttc attgacatgc caacagaagg taaacctacc 3661 aagtcaacca aaccatacaa gaatggccaa ctctcgaaag ttatgattat tgagaattca 3721 cacgtgaaga aagatgacat ctggccctca gggggccaaa tgactgtcaa agatctcaca 3781 gcaaaataca cagaaggtgg aaatgccata ttagagaaca tttccttctc aataagtcct 3841 ggccagaggg tgggcctctt gggaagaact ggatcaggga agagtacttt gttatcagct 3901 tttttgagac tactgaacac tgaaggagaa atccagatcg atggtgtgtc ttgggattca 3961 ataactttgc aacagtggag gaaagccttt ggagtgatac cacagaaagt atttattttt 4021 tctggaacat ttagaaaaaa cttggatccc tatgaacagt ggagtgatca agaaatatgg 4081 aaagttgcag atgaggttgg gctcagatct gtgatagaac agtttcctgg gaagcttgac 4141 tttgtccttg tggatggggg ctgtgtccta agccatggcc acaagcagtt gatgtgcttg 4201 gctagatctg ttctcagtaa ggcgaagatc ttgctgcttg atgaacccag tgctcatttg 4261 gatccagtaa cataccaaat aattagaaga actctaaaac aagcatttgc tgattgcaca 4321 gtaattctct gtgaacacag gatagaagca atgctggaat gccaacaatt tttggtcata 4381 gaagagaaca aagtgcggca gtacgattcc atccagaaac tgctgaacga gaggagcctc 4441 ttccggcaag ccatcagccc ctccgacagg gtgaagctct ttccccaccg gaactcaagc 4501 aagtgcaagt ctaagcccca gattgctgct ctgaaagagg agacagaaga agaggtgcaa 4561 gatacaaggc tttagagagc agcataaatg ttgacatggg acatttgctc atggaattgg 4621 agctcgtggg acagtcacct catggaattg gagctcgtgg aacagttacc tctgcctcag 4681 aaaacaagga tgaattaagt ttttttttaa aaaagaaaca tttggtaagg ggaattgagg 4741 acactgatat gggtcttgat aaatggcttc ctggcaatag tcaaattgtg tgaaaggtac 4801 ttcaaatcct tgaagattta ccacttgtgt tttgcaagcc agattttcct gaaaaccctt 4861 gccatgtgct agtaattgga aaggcagctc taaatgtcaa tcagcctagt tgatcagctt 4921 attgtctagt gaaactcgtt aatttgtagt gttggagaag aactgaaatc atacttctta 4981 gggttatgat taagtaatga taactggaaa cttcagcggt ttatataagc ttgtattcct 5041 ttttctctcc tctccccatg atgtttagaa acacaactat attgtttgct aagcattcca 5101 actatctcat ttccaagcaa gtattagaat accacaggaa ccacaagact gcacatcaaa 5161 atatgcccca ttcaacatct agtgagcagt caggaaagag aacttccaga tcctggaaat 5221 cagggttagt attgtccagg tctaccaaaa atctcaatat ttcagataat cacaatacat 5281 cccttacctg ggaaagggct gttataatct ttcacagggg acaggatggt tcccttgatg 5341 aagaagttga tatgcctttt cccaactcca gaaagtgaca agctcacaga cctttgaact 5401 agagtttagc tggaaaagta tgttagtgca aattgtcaca ggacagccct tctttccaca 5461 gaagctccag gtagagggtg tgtaagtaga taggccatgg gcactgtggg tagacacaca 5521 tgaagtccaa gcatttagat gtataggttg atggtggtat gttttcaggc tagatgtatg 5581 tacttcatgc tgtctacact aagagagaat gagagacaca ctgaagaagc accaatcatg 5641 aattagtttt atatgcttct gttttataat tttgtgaagc aaaatttttt ctctaggaaa 5701 tatttatttt aataatgttt caaacatata ttacaatgct gtattttaaa agaatgatta 5761 tgaattacat ttgtataaaa taatttttat atttgaaata ttgacttttt atggcactag 5821 tatttttatg aaatattatg ttaaaactgg gacaggggag aacctagggt gatattaacc 5881 aggggccatg aatcaccttt tggtctggag ggaagccttg gggctgatcg agttgttgcc 5941 cacagctgta tgattcccag ccagacacag cctcttagat gcagttctga agaagatggt 6001 accaccagtc tgactgtttc catcaagggt acactgcctt ctcaactcca aactgactct 6061 taagaagact gcattatatt tattactgta agaaaatatc acttgtcaat aaaatccata 6121 catttgtgt // LOCUS HUMCGA 1637 bp mRNA PRI 27-MAR-1997 DEFINITION Human mRNA for ceramide glucosyltransferase, complete cds. ACCESSION D50840 NID g1350551 KEYWORDS ceramide glucosyltransferase; glucosyl ceramide synthase. SOURCE Homo sapiens cell_line:melanoma SK-Mel-28 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1637) AUTHORS Ichikawa,S., Sakiyama,H., Suzuki,G., Hidari,K.I. and Hirabayashi,Y. TITLE Expression cloning of a cDNA for human ceramide glucosyltransferase that catalyzes the first glycosylation step of glycosphingolipid synthesis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 93 (10), 4638-4643 (1996) MEDLINE 96209784 REMARK Erratum:[Proc Natl Acad Sci U S A 1996 Oct 29;93(22):12654] REFERENCE 2 (bases 1 to 1637) AUTHORS Ichikawa,S. TITLE Direct Submission JOURNAL Submitted (01-JUN-1995) to the DDBJ/EMBL/GenBank databases. Shinichi Ichikawa, The Institute of Physical and Chemical Research(RIKEN), Glyco-Cell Biology; 2-1 Hirosawa, Wako-shi, Saitama 351-01, Japan (Tel:+81-48-462-1111(ex.6237), Fax:+81-48-462-4690) FEATURES Location/Qualifiers source 1..1637 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="melanoma SK-Mel-28" CDS 291..1475 /EC_number="2.4.1.80" /note="glucosyl ceramide synthase" /codon_start=1 /product="ceramide glucosyltransferase" /db_xref="PID:d1010093" /db_xref="PID:g1325917" /translation="MALLDLALEGMAVFGFVLFLVLWLMHFMAIIYTRLHLNKKATDK QPYSKLPGVSLLKPLKGVDPNLINNLETFFELDYPKYEVLLCVQDHDDPAIDVCKKLL GKYPNVDARLFIGGKKVGINPKINNLMPGYEVAKYDLIWICDSGIRVIPDTLTDMVNQ MTEKVGLVHGLPYVADRQGFAATLEQVYFGTSHPRYYISANVTGFKCVTGMSCLMRKD VLDQAGGLIAFAQYIAEDYFMAKAIADRGWRFAMSTQVAMQNSGSYSISQFQSRMIRW TKLRINMLPATIICEPISECFVASLIIGWAAHHVFRWDIMVFFMCHCLAWFIFDYIQL RGVQGGTLCFSKLDYAVAWFIRESMTIYIFLSALWDPTISWRTGRYRLRCGGTAEEIL DV" BASE COUNT 416 a 375 c 394 g 452 t ORIGIN 1 gaggcgaacc ggagcgcggg gccgcggtcg ccccgaccag agccgggaga ccgcagcacc 61 cgcagccgcc cgcgagcgcg ccgaagacag cgcgcaggcg agagcgcgcg ggcgggggcg 121 cgcaggccct gcccgcccct tccgtcccca cccccctccg ccctttcctc tccccacctt 181 cctctcgcct cccgcgcccc cgcaccgggc gcccaccctg tcctcctcct gcgggagcgt 241 tgtccgtgtt ggcggccgca gcgggccggg ccggtccggc gggccggggg atggcgctgc 301 tggacctggc cttggaggga atggccgtct tcgggttcgt cctcttcttg gtgctgtggc 361 tgatgcattt catggctatc atctacaccc gattacacct caacaagaag gcaactgaca 421 aacagcctta tagcaagctc ccaggtgtct ctcttctgaa accactgaaa ggggtagatc 481 ctaacttaat caacaacctg gaaacattct ttgaattgga ttatcccaaa tatgaagtgc 541 tcctttgtgt acaagatcat gatgatccag ccattgatgt atgtaagaag cttcttggaa 601 aatatccaaa tgttgatgct agattgttta taggtggtaa aaaagttggc attaatccta 661 aaattaataa tttaatgcca ggatatgaag ttgcaaagta tgatcttata tggatttgtg 721 atagtggaat aagagtaatt ccagatacgc ttactgacat ggtgaatcaa atgacagaaa 781 aagtaggctt ggttcacggg ctgccttacg tagcagacag acagggcttt gctgccacct 841 tagagcaggt atattttgga acttcacatc caagatacta tatctctgcc aatgtaactg 901 gtttcaaatg tgtgacagga atgtcttgtt taatgagaaa agatgtgttg gatcaagcag 961 gaggacttat agcttttgct cagtacattg ccgaagatta ctttatggcc aaagcgatag 1021 ctgaccgagg ttggaggttt gcaatgtcca ctcaagttgc aatgcaaaac tctggctcat 1081 attcaatttc tcagtttcaa tccagaatga tcaggtggac caaactacga attaacatgc 1141 ttcctgctac aataatttgt gagccaattt cagaatgctt tgttgccagt ttaattattg 1201 gatgggcagc ccaccatgtg ttcagatggg atattatggt atttttcatg tgtcattgcc 1261 tggcatggtt tatatttgac tacattcaac tcaggggtgt ccagggtggc acactgtgtt 1321 tttcaaaact tgattatgca gtcgcctggt tcatccgcga atccatgaca atatacattt 1381 ttttgtctgc attatgggac ccaactataa gctggagaac tggtcgctac agattacgct 1441 gtgggggtac agcagaggaa atcctagatg tataactaca gctttgtgac tgtatataaa 1501 ggaaaaaaga gaagtattat aaattatgtt tatataaatg cttttaaaaa tctaccttct 1561 gtagttttat cacatgtatg ttttggtatc tgttctttaa tttatttttg catggcactt 1621 gcatctgtga aaaaaaa // LOCUS HUMCGM7 1190 bp mRNA PRI 22-MAY-1991 DEFINITION Human CGM7 gene for nonspecific cross-reacting antigen (NCA). ACCESSION D90276 NID g219538 KEYWORDS CEA gene family; NCA; transmembrane protein. SOURCE Human peripheral leukocytes, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1190) AUTHORS Kuroki,M., Arakawa,F., Matsuo,Y., Oikawa,S., Misumi,Y., Nakazato,H. and Matsuoka,Y. TITLE Molecular cloning of nonspecific cross-reacting antigens in human granulocytes JOURNAL J. Biol. Chem. 266 (18), 11810-11817 (1991) MEDLINE 91268052 COMMENT These data kindly submitted in computer readable form by: Motomu Kuroki Department of Biochemistry School of Medicine, Fukuoka University 7-45-1 Nanakuma, Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600. FEATURES Location/Qualifiers source 1..1190 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 98..832 /note="CGM7" /codon_start=1 /db_xref="PID:d1015026" /db_xref="PID:g219539" /translation="MGPPSAAPRGGHRPWQGLLITASLLTFWDPPTTVQFTIEALPSS AAEGKDVLLLACNISETIQAYYWHKGKTAEGSPLIAGYITDIQANIPGAAYSGREQVY PNGSLLFQNITLEDAGSYTLRTINASYDSDQATGQLHVHQNNVPGLPVGAVAGIVTGV LVGVALVAALVCFLLLSRTGRASIQRDLREQPPPASTPGHGPSHRSTFSAPLPSPRTA TPIYVEFLYSDANIYCQIDHKADVVS" BASE COUNT 269 a 371 c 318 g 232 t ORIGIN 1 gtcagcagcc ccgacagccg acagtcacag cagctctgac aagagcgttc ctggagccca 61 gctcctctcc acagaccaca agcacccagc agagaccatg ggccccccct cagccgctcc 121 ccgtggaggg cacaggccct ggcaggggct cctgatcaca gcctcacttt taaccttctg 181 ggacccgccc accactgtcc agttcactat tgaagccctg ccatccagtg ctgcagaggg 241 aaaggatgtt cttctactgg cctgcaatat ttcagaaact attcaagcct attattggca 301 caaggggaaa acggcagaag ggagccctct cattgctggt tatataacag acattcaagc 361 aaatatccca ggggccgcat acagtggtcg agagcaagta taccccaatg gatccctgct 421 gttccaaaac atcaccctgg aggacgcagg atcctacacc ctacgaacca taaatgccag 481 ttacgactct gaccaagcaa ctggccagct ccacgtacac caaaacaacg tcccaggcct 541 tcctgtgggg gccgtcgctg gcatcgtgac tggggtcctg gttggggtgg ctctggtggc 601 cgccctggtg tgttttctgc ttctctccag gactggaagg gccagcatcc agcgtgacct 661 cagggagcag ccgcccccag cctccacccc tggccatggt ccctctcaca gatccacctt 721 ctcggcccct ctacccagcc ccagaacagc cactcccatc tatgtggaat ttctatactc 781 tgatgcaaac atttactgcc agatcgacca caaagcagat gtggtctctt aggttcctct 841 gggagctgct cttgtgggtt gatggagcgt cctcgaagct ccagccctgg ggacggggaa 901 ggacatggag cctgagccag agaaccagct ctgagtcctg aggagacaca ggcctgggga 961 cagggaggga tgggagtccc tgctgaatat ctggagaccc tgacaggttg ccctgggctc 1021 cgggtgggcc gggacaaagg cctctcatca ccacaggaag cgggggcttg caaggaaagt 1081 gaatgggcct gtggcccacc cggggtcacc aggaaaggat ctgaataaag aggacccttc 1141 ctctcattgg ctctttttct gctcacggga acttagcaga aactcacctg // LOCUS HUMCGMP 2500 bp mRNA PRI 01-NOV-1994 DEFINITION Human cGMP-gated cation channel protein mRNA, complete cds. ACCESSION M84741 NID g180461 KEYWORDS cGMP-gated cation channel. SOURCE Homo sapiens adult retina cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2500) AUTHORS Pittler,S.J., Lee,A.K., Altherr,M.R., Howard,T.A., Seldin,M.F., Hurwitz,R.L., Wasmuth,J.J. and Baehr,W. TITLE Primary structure and chromosomal localization of human and mouse rod photoreceptor cGMP-gated cation channel JOURNAL J. Biol. Chem. 267 (9), 6257-6262 (1992) MEDLINE 92210603 FEATURES Location/Qualifiers source 1..2500 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /map="4" gene 25..2097 /gene="CNCG" CDS 25..2097 /gene="CNCG" /codon_start=1 /db_xref="GDB:G00-127-557" /product="cGMP-gated cation channel protein" /db_xref="PID:g180462" /translation="MKLSMKNNIINTQQSFVTMPNVIVPDIEKEIRRMENGACSSFSE DDDSAYTSEESENENPHARGSFSYKSLRKGGPSQREQYLPGAIAIFNVNNSSNKDQEP EEKKKKKKEKKSKSDDKNENKNDPEKKKKKKDKEKKKKEEKSKDKKEHHKKEVVVIDP SGNTYYNWLFCITLPVMYNWTMVIARACFDELQSDYLEYWLILDYVSDIVYLIDMFVR TRTGYLEQGLLVKEELKLINKYKSNLQFKLDVLSLIPTDLLYFKLGWNYPEIRLNRLL RFSRMFEFFQRTETRTNYPNIFRISNLVMYIVIIIHWNACVFYSISKAIGFGNDTWVY PDINDPEFGRLARKYVYSLYWSTLTLTTIGETPPPVRDSEYVFVVVDFLIGVLIFATI VGNIGSMISNMNAARAEFQARIDAIKQYMHFRNVSKDMEKRVIKWFDYLWTNKKTVDE KEVLKYLPDKLRAEIAINVHLDTLKKVRIFADCEAGLLVELVLKLQPQVYSPGDYICK KGDIGREMYIIKEGKLAVVADDGVTQFVVLSDGSTFGEISILNIKGSKAGNRRTANIK SIGYSDLFCLSKDDLMEALTEYPDAKTMLEEKGKQILMKDGLLDLNIANAGSDPKDLE EKVTRMEGSVDLLQTRFARILAEYESMQQKLKQRLTKVEKFLKPLIDTEFSSIEGPWS ESGPIDST" BASE COUNT 861 a 431 c 525 g 683 t ORIGIN 1 tatattactt aaacaaccaa agatatgaaa ctatccatga agaacaatat tatcaataca 61 cagcagtctt ttgtaaccat gcccaatgtg attgtaccag atattgaaaa ggaaatacga 121 aggatggaaa atggagcatg cagctccttt tctgaggatg atgacagtgc ctatacatct 181 gaagaatcag agaatgaaaa ccctcatgca aggggttcct ttagttataa gtcactcaga 241 aagggaggac catcacagag ggagcagtac ctgcctggtg ccattgccat ttttaatgtg 301 aacaacagca gcaataagga ccaggaacca gaggaaaaaa agaaaaagaa aaaagaaaag 361 aagagcaagt cagatgataa aaacgaaaat aaaaacgacc cagagaagaa aaagaagaaa 421 aaggacaaag agaagaaaaa gaaagaggag aaaagcaaag ataagaaaga acaccacaag 481 aaagaagttg tggttattga tccctcggga aacacatatt acaactggct gttttgcatc 541 acattacctg ttatgtacaa ctggacaatg gttattgcca gagcatgttt tgatgaactt 601 caatctgatt acctagaata ttggctcatt ttggattacg tatcagacat agtctattta 661 atcgatatgt ttgtacgaac aaggacaggt tacctagaac aaggactgct ggtaaaggaa 721 gaacttaaac tcataaataa atataaatcc aacttgcaat ttaaacttga tgttctgtca 781 ctgataccaa ctgatttgct gtattttaag ttagggtgga actatccaga aattagatta 841 aacaggttgt tacggttctc tcgtatgttt gagttcttcc agagaacaga aacaaggaca 901 aactatccaa acatcttcag gatttccaac cttgttatgt atatcgtcat cattatccac 961 tggaatgcat gtgtgttcta ctctatttct aaagctattg gatttggaaa tgatacatgg 1021 gtctaccctg atattaatga tcctgaattt ggccgtttgg ctagaaaata cgtatacagc 1081 ctttactggt ctacactgac tttgactacc attggtgaaa caccccctcc cgtgagggat 1141 tctgagtatg tctttgtggt ggttgatttc ctaattggag tgttaatttt tgctaccatc 1201 gttggtaaca taggttctat gatttccaac atgaatgcag ccagagcaga atttcaagca 1261 agaattgatg ctatcaagca atatatgcat tttcgaaatg taagcaaaga tatggaaaag 1321 agggttatta aatggtttga ctacctgtgg accaacaaaa aaacagttga tgagaaagaa 1381 gtcttaaagt atctacctga taaactaaga gcagaaattg ccatcaacgt tcacttagac 1441 acattaaaaa aggtacgcat ttttgctgat tgtgaagctg gtctgttggt ggagttggtc 1501 ttgaaattgc aaccccaagt ctacagtcct ggagattata tttgcaagaa aggggatatc 1561 ggacgagaga tgtacattat caaggaaggc aaactcgctg tggtggcaga tgatggagtc 1621 actcagtttg tggtattgag cgatggcagc accttcggtg agatcagcat tcttaacatt 1681 aaagggagca aagctggcaa tcgaagaacg gccaatatta aaagtattgg ctactcagac 1741 ctgttctgtc tctcaaaaga tgacctcatg gaagctctaa ctgagtaccc agatgccaaa 1801 actatgctag aagaaaaagg gaagcaaatt ttaatgaaag atggtctact ggatctaaac 1861 attgcaaatg ctggcagtga tcctaaagat cttgaagaga aggttactcg aatggagggg 1921 tcagtagacc tcctgcaaac caggtttgcc cgaatcttgg ctgagtatga gtccatgcag 1981 cagaaactga aacaaagatt aaccaaggtt gagaaatttc tgaaaccgct tattgacaca 2041 gaattttcaa gtattgaggg accttggagc gaaagtgggc ccatcgactc tacatagaac 2101 cgaaaagctg gtcattaaca gggacatgcc tcatgatcct tttgatccta tgactgacat 2161 caactaaaat ttaaaagaag aggaagactc agttgggaaa tttttccatg aggaaaatgt 2221 gctttggtgc aaggtacagc ccacacctct ctgagagata ctatgattaa aaaagcttta 2281 tatctgggat ttttcacaac tgataatgtg caaagatata aactgattaa cttgtcagtg 2341 tctgtatttt ctgatttttt cacatacgct cattttatgt aatattcttc ataaaaatga 2401 ataagtagcc ctcactttca tgccatttcc attgttgagt gaagcgtatt tgaagtaact 2461 gagaattacc atgtacatca tatttgggat aacattttta // LOCUS HUMCGPRA 2994 bp mRNA PRI 08-SEP-1997 DEFINITION Human cGMP phosphodiesterase alpha subunit (CGPR-A) mRNA, complete cds. ACCESSION M26061 NID g2366986 KEYWORDS cGMP phosphodiesterase; photoreceptor. SOURCE Human adult retina, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2994) AUTHORS Pittler,S.J., Baehr,W., Wasmuth,J.J., McConnell,D.G., Champagne,M.S., vanTuinen,P., Ledbetter,D. and Davis,R.L. TITLE Molecular characterization of human and bovine rod photoreceptor cGMP phosphodiesterase alpha-subunit and chromosomal localization of the human gene JOURNAL Genomics 6 (2), 272-283 (1990) MEDLINE 90169986 COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by S.J.Pittler, 10-JUL-1989. FEATURES Location/Qualifiers source 1..2994 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q31.2-q34" mRNA 1..>2994 /gene="PDEA" /note="CGPR-A mRNA; G00-120-265" gene <1..>2994 /gene="PDEA" 5'UTR 1..120 /gene="PDEA" CDS 121..2703 /gene="PDEA" /EC_number="3.1.4.35" /note="cGMP phosphodiesterase" /codon_start=1 /db_xref="GDB:G00-120-265" /db_xref="PID:g2366987" /translation="MGEVTAEEVEKFLDSNIGFAKQYYNLHYRAKLISDLLGAKEAAV DFSNYHSPSSMEESEIIFDLLRDFQENLQTEKCIFNVMKKLCFLLQADRMSLFMYRTR NGIAELATRLFNVHKDAVLEDCLVMPDQEIVFPLDMGIVGHVAHSKKIANVPNTEEDE HFCDFVDILTEYKTKNILASPIMNGKDVVAIIMAVNKVDGSHFTKRDEEILLKYLNFA NLIMKWYHLSYLHNCETRRGQILLWSGSKVFEELTDIERQFHKALYTVRAFLNCDRYS VGLLDMTKQKEFFDVWPVLMGEVPPYSGPRTPDGREINFYKVIDYILHGKEDIKVIPN PPPDHWALVSGLPAYVAQNGLICNIMNAPAEDFFAFQKEPLDESGWMIKNVLSMPIVN KKEEIVGVATFYNRKDGKPFDEMDETLMESLTQFLGWSVLNPDTYESMNKLENRKDIF QDIVKYHVKCDNEEIQKILKTREVYGKEPWECEEEELAEILQAELPDADKYEINKFHF SDLPLTELELVKCGIQMYYELKVVDKFHIPQEALVRFMYSLSKGYRKITYHNWRHGFN VGQTMFSLLVTGKLKRYFTDLEALAMVTAAFCHDIDHRGTNNLYQMKSQNPLAKLHGS SILERHHLEFGKTLLRDESLNIFQNLNRRQHEHAIHMMDIAIIATDLALYFKKRTMFQ KIVDQSKTYESEQEWTQYMMLEQTRKEIVMAMMMTACDLSAITKPWEVQSQVALLVAA EFWEQGDLERTVLQQNPIPMMDRNKADELPKLQVGFIDFVCTFVYKEFSRFHEEITPM LDGITNNRKEWKALADEYDAKMKVQEEKKQKQQSAKSAAAGNQPGGNPSPGGATTSKS CCIQ" 3'UTR 2704..>2994 /gene="PDEA" BASE COUNT 859 a 700 c 752 g 683 t ORIGIN Chromosome 5q31.2-q34. 1 agtatgtttt gcagacaaga cccagagaag tccagactgg acttgttgca gactgcaaaa 61 ctgccattgg aaggcctccg tcccagtcct tctacagagt agccagtggg actcccagcc 121 atgggcgagg tgacagcaga ggaggtggag aagttcctgg actcgaatat tggctttgcc 181 aaacagtact acaacctcca ctaccgggcc aagctcatct ccgacctcct tggggccaag 241 gaggctgccg tggacttcag caactaccac tccccgagca gcatggagga gagcgaaatc 301 atctttgatc tcctgcggga ctttcaggag aatttacaga cagagaaatg catcttcaat 361 gtcatgaaga agctgtgctt cctcctgcag gcagaccgca tgagcctgtt catgtaccgg 421 acccgcaatg gcatcgcaga gctggccacc aggcttttca atgtccacaa ggatgctgtc 481 ctcgaggact gcctggtgat gcccgaccaa gagatcgtct tccctttgga catgggcatc 541 gtgggccatg tcgcacactc taagaagatt gctaacgtcc ccaacacaga ggaggatgag 601 catttctgtg actttgtgga catcctcaca gagtacaaga ccaagaacat cttggcttcc 661 cccataatga atgggaagga tgtggtggcc ataatcatgg ctgtgaataa agtggatgga 721 tcccacttca ccaagagaga tgaagagatt cttctcaagt acctcaattt tgcaaatcta 781 atcatgaagt ggtaccacct gagttacctg cacaactgtg aaactcgacg tggccagata 841 ctgctgtggt ctgggagcaa agtctttgaa gaacttacgg acatcgaacg acagttccac 901 aaagccctgt acacagtccg tgctttcctc aactgtgaca gatactctgt gggtctctta 961 gacatgacca agcagaagga attttttgat gtgtggccgg ttctgatggg tgaagttcca 1021 ccttactctg gtcccaggac tccggatgga agagaaatta acttttacaa ggtcattgac 1081 tacatcctgc atggcaaaga ggacatcaaa gtcatcccga atccacctcc tgaccattgg 1141 gctttagtaa gcggtctccc cgcttatgtt gcccagaatg gcctgatttg caacatcatg 1201 aacgcgcctg cggaggactt ttttgcattt cagaaagaac ctctggatga gtctggatgg 1261 atgattaaaa atgtgctttc aatgccgatt gtgaacaaga aggaagaaat tgttggagtg 1321 gccacatttt acaatcgtaa agatgggaag ccctttgatg aaatggatga gacgctcatg 1381 gagtctttga ctcaatttct gggctggtct gtcttaaatc ctgacaccta tgagtcaatg 1441 aataaacttg aaaataggaa ggatattttc caggacatag taaaatatca tgtgaagtgt 1501 gacaatgaag aaattcagaa aatcttgaaa accagagagg tgtatgggaa ggagccatgg 1561 gagtgtgagg aagaggagct ggctgagatc ctgcaagcgg agctgccaga tgcagataaa 1621 tacgaaatta ataaatttca cttcagtgac ttacccctaa cagaactgga gctggtaaaa 1681 tgtggaatac agatgtatta tgagctcaaa gtggtggata aatttcacat tccacaagag 1741 gccctggtgc ggttcatgta ctccctgagt aagggctacc gcaagatcac ctaccacaac 1801 tggcggcacg gcttcaacgt ggggcagacc atgttctccc tgctggtgac gggaaagctg 1861 aagcgctact tcacggacct agaggccttg gccatggtca ctgctgcttt ctgccatgac 1921 attgaccaca gaggcaccaa taacctctac cagatgaaat cccagaaccc actggccaag 1981 ctccatgggt cctctatctt ggaaagacac cacttggagt ttggcaaaac actgctcaga 2041 gacgagagcc tgaatatctt tcaaaacctc aatcgtcgac agcatgagca tgccatccac 2101 atgatggaca ttgcaatcat tgccacagac ctcgccctgt atttcaagaa gaggacgatg 2161 ttccaaaaga tcgtggatca gtctaagaca tatgagagtg aacaggagtg gacacagtac 2221 atgatgctgg agcagacacg gaaggaaatc gttatggcca tgatgatgac cgcctgtgat 2281 ctctcagcca tcaccaaacc ctgggaggtg cagagccagg tagctctgct ggtggctgct 2341 gaattctggg aacaaggtga cctggagcgc acggtgctgc aacagaatcc cattcccatg 2401 atggacagaa acaaagcaga tgaactccct aagcttcaag tcggcttcat tgactttgtt 2461 tgcaccttcg tctacaagga attctcccgt ttccacgagg agatcacccc aatgttggac 2521 gggatcacca acaatcgcaa ggagtggaag gcgcttgctg atgagtacga tgccaagatg 2581 aaggtgcagg aggagaagaa gcagaaacag cagtcggcca agtcagcagc cgcaggaaat 2641 cagccggggg gaaaccccag cccagggggt gcaactacat ccaagtcctg ctgcatccag 2701 taacaccact ggggatgtgc tggctggacg gcaccaccct ttcctgggaa gagatgactc 2761 aagccagtgg aagaccacac accttgagaa gtagaagagt cataggattt gaaagctgtt 2821 agagaattta gcttccagga ctgttcaatc ttttggcttc cctgggccac attggaagaa 2881 ttgtcttggg tcacacataa aatacagtaa cactaatgat agctgttgaa cttaaaaaaa 2941 aaaatcgcca aaaaaaaaat ttcataatgt tttaagaaag tttatgaatt tgtg // LOCUS HUMCHEB 2416 bp mRNA PRI 24-APR-1996 DEFINITION Human butyrylcholinesterase, mRNA, complete cds. ACCESSION M16541 J02964 NID g1280204 KEYWORDS butyrylcholinesterase; cholinesterase. SOURCE Human 1-day old female basal gangaglia, cDNA to mRNA, clones Oh57,z35, z3, z2 and z13. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS McTiernan,C., Adkins,S., Chatonnet,A., Vaughan,T.A., Bartels,C.F., Kott,M., Rosenberry,T.L., La Du,B.N. and Lockridge,O. TITLE Brain cDNA clone for human cholinesterase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (19), 6682-6686 (1987) MEDLINE 88016155 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by S.Adkins, 29-JUL-1987. ATG codons were found at nucleotides 7-9, 73-75, 130-132 and 190-192 but the triplet at bases 130-132 was confirmed as the initiation codon. FEATURES Location/Qualifiers source 1..2416 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q26" mRNA 1..2416 /note="BCHE mRNA" sig_peptide 130..213 /gene="BCHE" /note="cholinesterase signal peptide" gene 130..1938 /gene="BCHE" CDS 130..1938 /gene="BCHE" /note="cholinesterase (EC 3.1.1.8)" /codon_start=1 /db_xref="GDB:G00-120-558" /db_xref="PID:g180484" /translation="MHSKVTIICIRFLFWFLLLCMLIGKSHTEDDIIIATKNGKVRGM NLTVFGGTVTAFLGIPYAQPPLGRLRFKKPQSLTKWSDIWNATKYANSCCQNIDQSFP GFHGSEMWNPNTDLSEDCLYLNVWIPAPKPKNATVLIWIYGGGFQTGTSSLHVYDGKF LARVERVIVVSMNYRVGALGFLALPGNPEAPGNMGLFDQQLALQWVQKNIAAFGGNPK SVTLFGESAGAASVSLHLLSPGSHSLFTRAILQSGSFNAPWAVTSLYEARNRTLNLAK LTGCSRENETEIIKCLRNKDPQEILLNEAFVVPYGTPLSVNFGPTVDGDFLTDMPDIL LELGQFKKTQILVGVNKDEGTAFLVYGAPGFSKDNNSIITRKEFQEGLKIFFPGVSEF GKESILFHYTDWVDDQRPENYREALGDVVGDYNFICPALEFTKKFSEWGNNAFFYYFE HRSSKLPWPEWMGVMHGYEIEFVFGLPLERRDNYTKAEEILSRSIVKRWANFAKYGNP NETQNNSTSWPVFKSTEQKYLTLNTESTRIMTKLRAQQCRFWTSFFPKVLEMTGNIDE AEWEWKAGFHRWNNYMMDWKNQFNDYTSKKESCVGL" mat_peptide 214..1935 /gene="BCHE" /note="cholinesterase mature peptide" BASE COUNT 774 a 431 c 478 g 733 t ORIGIN 303 bp upstream of EcoRI site; HGML map location 3q21-qter. 1 tactgaatgt cagtgcagtc caatttacag gctggagcag cagctgcatc ctgcatttcc 61 ccgaagtatt acatgatttt cactccttgc aaactttacc atctttgttg cagagaatcg 121 gaaatcaata tgcatagcaa agtcacaatc atatgcatca gatttctctt ttggtttctt 181 ttgctctgca tgcttattgg gaagtcacat actgaagatg acatcataat tgcaacaaag 241 aatggaaaag tcagagggat gaacttgaca gtttttggtg gcacggtaac agcctttctt 301 ggaattccct atgcacagcc acctcttggt agacttcgat tcaaaaagcc acagtctctg 361 accaagtggt ctgatatttg gaatgccaca aaatatgcaa attcttgctg tcagaacata 421 gatcaaagtt ttccaggctt ccatggatca gagatgtgga acccaaacac tgacctcagt 481 gaagactgtt tatatctaaa tgtatggatt ccagcaccta aaccaaaaaa tgccactgta 541 ttgatatgga tttatggtgg tggttttcaa actggaacat catctttaca tgtttatgat 601 ggcaagtttc tggctcgggt tgaaagagtt attgtagtgt caatgaacta tagggtgggt 661 gccctaggat tcttagcttt gccaggaaat cctgaggctc cagggaacat gggtttattt 721 gatcaacagt tggctcttca gtgggttcaa aaaaatatag cagcctttgg tggaaatcct 781 aaaagtgtaa ctctctttgg agaaagtgca ggagcagctt cagttagcct gcatttgctt 841 tctcctggaa gccattcatt gttcaccaga gccattctgc aaagtggatc ctttaatgct 901 ccttgggcgg taacatctct ttatgaagct aggaacagaa cgttgaactt agctaaattg 961 actggttgct ctagagagaa tgagactgaa ataatcaagt gtcttagaaa taaagatccc 1021 caagaaattc ttctgaatga agcatttgtt gtcccctatg ggactccttt gtcagtaaac 1081 tttggtccga ccgtggatgg tgattttctc actgacatgc cagacatatt acttgaactt 1141 ggacaattta aaaaaaccca gattttggtg ggtgttaata aagatgaagg gacagctttt 1201 ttagtctatg gtgctcctgg cttcagcaaa gataacaata gtatcataac tagaaaagaa 1261 tttcaggaag gtttaaaaat attttttcca ggagtgagtg agtttggaaa ggaatccatc 1321 ctttttcatt acacagactg ggtagatgat cagagacctg aaaactaccg tgaggccttg 1381 ggtgatgttg ttggggatta taatttcata tgccctgcct tggagttcac caagaagttc 1441 tcagaatggg gaaataatgc ctttttctac tattttgaac accgatcctc caaacttccg 1501 tggccagaat ggatgggagt gatgcatggc tatgaaattg aatttgtctt tggtttacct 1561 ctggaaagaa gagataatta cacaaaagcc gaggaaattt tgagtagatc catagtgaaa 1621 cggtgggcaa attttgcaaa atatgggaat ccaaatgaga ctcagaacaa tagcacaagc 1681 tggcctgtct tcaaaagcac tgaacaaaaa tatctaacct tgaatacaga gtcaacaaga 1741 ataatgacga aactacgtgc tcaacaatgt cgattctgga catcattttt tccaaaagtc 1801 ttggaaatga caggaaatat tgatgaagca gaatgggagt ggaaagcagg attccatcgc 1861 tggaacaatt acatgatgga ctggaaaaat caatttaacg attacactag caagaaagaa 1921 agttgtgtgg gtctctaatt aatagattta ccctttatag aacatatttt cctttagatc 1981 aaggcaaaaa tatcaggagc ttttttacac acctactaaa aaagttatta tgtagctgaa 2041 acaaaaatgc cagaaggata atattgattc ctcacatctt taacttagta ttttacctag 2101 catttcaaaa cccaaatggc tagaacatgt ttaattaaat ttcacaatat aaagttctac 2161 agttaattat gtgcatatta aaacaatggc ctggttcaat ttctttcttt ccttaataaa 2221 tttaagtttt ttccccccaa aattatcagt gctctgcttt tagtcacgtg tattttcatt 2281 accactcgta aaaaggtatc ttttttaaat gaattaaata ttgaaacact gtacaccata 2341 gtttacaata ttatgtttcc taattaaaat aagaattgaa tgtcaatatg agatattaaa 2401 ataagcacag aaaatc // LOCUS HUMCHED 2219 bp mRNA PRI 31-DEC-1994 DEFINITION Human cdc2-related protein kinase (CHED) mRNA, complete cds. ACCESSION M80629 NID g180491 KEYWORDS cdc2-related protein kinase. SOURCE Homo sapiens (tissue library: lambda gt10) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2219) AUTHORS Lapidot-Lifson,Y., Patinkin,D., Prody,C.A., Ehrlich,G., Seidman,S., Ben-Aziz,R., Benseler,F., Eckstein,F., Zakut,H. and Soreq,H. TITLE Cloning and antisense oligodeoxynucleotide inhibition of a human homolog of cdc2 required in hematopoiesis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (2), 579-583 (1992) MEDLINE 92115704 FEATURES Location/Qualifiers source 1..2219 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="glioblastoma multiform" /tissue_type="brain" /tissue_lib="lambda gt10" gene 772..2028 /gene="CHED" CDS 772..2028 /gene="CHED" /codon_start=1 /product="cdc2-related protein kinase" /db_xref="PID:g180492" /translation="MLPEDKEADSLRGNISVKAVKKEVEKKLRCLLADLPLPPELPGG DDLSKSPEEKKTTTQLHSKRRPKICGPRYGETKEKDIDWGKLCVDKFDIIGIIGEGTY GQVYKARDKDTGEMVALKKVRLDNEKEGFPITAIREIKILRQLTHQSIINMKEIVTDK EDALDFKKDKGAFYLVFEYMDHDLMGLLESGLVHFYENHIKSFMRQLMEGLDYCHKKN FLHRDIKCSNILLNNRGQIKLADFGLARLYSSEESRPYTNKVITLWYRPPELLLGEER YTPAIDVWSCGCILGELFTKKPIFQANQELAQLELISRICGSPCPAVWPDVIKLPYFN TMKPKKQYRRKLREEFVFIPAAALDLFDYMLALDPSKRCTAEQALQCEFLRDVEPSKC LHQISLYGKIVMSYGVKSEEDRSRWA" BASE COUNT 736 a 481 c 499 g 503 t ORIGIN 1 attcgcggcc cctacagtcg ccgccgctcc cccagctaca gccgccacag ctcctacgag 61 cggggcggcg acgtgtcccc tagtccctac agcagcagca gctggcgccg ctctcgcagt 121 ccctacagcc ctgtgctcag acggtctgga aaatcccgaa gcagaagccc gtattcatct 181 aggcattcaa gatctcgtag caggcacaga ttgtctagat ccagaagtcg tcattctagt 241 atttctccta gcacactaac tctgaagagt agcctggcag ctgaattgaa caagaataaa 301 aaagcacgag cagcagaggc agcaagagcc gcagaagcag cgaaagctgc agaagcaact 361 aaggctgctg aggctgctgc caaggctgca aaagcttcaa acacttctac acctaccaag 421 gggaacacgg aaactagtgc cagtgcatca caaacaaacc atgtgaagga tgtgaagaaa 481 attaaaattg aacatgcacc ttctccctca agtggtggaa ctttaaaaaa tgacaaagca 541 aaaacaaagc cacctcttca ggtaacgaag gtggaaaata atttgattgt agataaagcc 601 accaagaaag cagtcatagt tggaaaggag agtaaatctg ctgctacaaa ggaggaatca 661 gtatctctta aagagaaaac caaaccactt acaccaagca taggagccaa ggagaaggag 721 caacatgtag ctttagtcac ctctacatta ccaccgttac ctttgcctcc catgctgcct 781 gaagataaag aagctgatag cttacgagga aatatttcag taaaagcagt taaaaaagaa 841 gtagaaaaga aactccgatg tcttcttgct gatttaccgc tgccccctga gctaccagga 901 ggagatgatc tttcaaagag tccagaggaa aagaaaacaa caacacagtt acatagtaaa 961 aggaggccta aaatatgtgg gcctcgctat ggtgaaacca aagaaaaaga tattgactgg 1021 ggaaaactct gcgtggataa atttgatatc atcggaatta ttggagaagg tacttacgga 1081 caagtttaca aagccaggga taaagacact ggagaaatgg tagccttaaa aaaagtacgt 1141 ctggataatg aaaaggaagg ctttccaatt acagcaattc gagaaattaa aattctccgg 1201 cagcttaccc atcagagtat tatcaatatg aaggaaatag tgactgataa agaagatgct 1261 ttggatttca agaaggacaa aggtgcattt tatctggtgt ttgaatatat ggaccatgat 1321 ctgatgggac tactggaatc aggcttggtt catttttatg aaaatcacat aaagtcattt 1381 atgagacagc tcatggaggg tctggattat tgtcataaga agaacttttt gcatagagat 1441 attaaatgtt ccaatatcct tctaaataat agagggcaga taaaacttgc agactttgga 1501 cttgctcgat tgtatagctc agaagaaagt cggccgtata ctaacaaggt aattacttta 1561 tggtaccgtc cacctgaact gctactggga gaagaacgat acacaccagc cattgatgta 1621 tggagctgtg gctgtatcct tggcgaactc ttcactaaaa aacctatatt tcaagcaaat 1681 caggaacttg cacaactaga attaataagc cgaatatgtg ggagtccatg tcctgcagtg 1741 tggcctgatg taatcaaact accatatttc aacaccatga aaccaaagaa gcaatatcgt 1801 cgaaagttaa gagaagaatt tgtttttatt cctgcagctg cgctagactt atttgattac 1861 atgcttgcct tggatcctag taagcgctgc actgctgaac aggctcttca gtgcgagttc 1921 ctccgagatg tggaaccctc aaaatgcctc caccagatct ccctttatgg caagattgtc 1981 atgagttatg gagtaaaaag cgaagaagac agaagcagat gggcatgact gatgtttcca 2041 ccattaaagc ccccaggaag gacttgtctc tgggcttgga tgacagcaga accaacacac 2101 cccagggtgt gctgccatct tcacagctga aatctcaggg cagctcaaat gtggcacctg 2161 gtgaaaaaca gacagatcca tcaacaccac aacaggagtc ttcgaaaccg ttgggagga // LOCUS HUMCHIP28A 1340 bp mRNA PRI 31-DEC-1994 DEFINITION Human channel-like integral membrane protein (CHIP28) mRNA, complete cds. ACCESSION M77829 NID g180500 KEYWORDS channel-like integral membrane protein. SOURCE Homo sapiens male adult bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1340) AUTHORS Preston,G.M. and Agre,P. TITLE Isolation of the cDNA for erythrocyte integral membrane protein of 28 kilodaltons: member of an ancient channel family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11110-11114 (1991) MEDLINE 92107900 FEATURES Location/Qualifiers source 1..1340 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="bone marrow" 5'UTR 1..38 /gene="CHIP28" gene 1..1340 /gene="CHIP28" CDS 39..848 /gene="CHIP28" /codon_start=1 /product="channel-like integral membrane protein" /db_xref="PID:g180501" /translation="MASEFKKKLFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQT AVQDNVKVSLAFGLSIATLAQSVGHISGAHLNPAVTLGLLLSCQISIFRALMYIIAQC VGAIVATAILSGITSSLTGNSLGRNDLADGVNSGQGLGIEIIGTLQLVLCVLATTDRR RRDLGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPFI GGALAVLIYDFILAPRSSDLTDRVKVWTSGQVEEYDLDADDINSRVEMKPK" 3'UTR 846..1340 /gene="CHIP28" BASE COUNT 250 a 411 c 397 g 282 t ORIGIN 1 gcacccggca gcggtctcag gccaagcccc ctgccagcat ggccagcgag ttcaagaaga 61 agctcttctg gagggcagtg gtggccgagt tcctggccac gaccctcttt gtcttcatca 121 gcatcggttc tgccctgggc ttcaaatacc cggtggggaa caaccagacg gcggtccagg 181 acaacgtgaa ggtgtcgctg gccttcgggc tgagcatcgc cacgctggcg cagagtgtgg 241 gccacatcag cggcgcccac ctcaacccgg ctgtcacact ggggctgctg ctcagctgcc 301 agatcagcat cttccgtgcc ctcatgtaca tcatcgccca gtgcgtgggg gccatcgtcg 361 ccaccgccat cctctcaggc atcacctcct ccctgactgg gaactcgctt ggccgcaatg 421 acctggctga tggtgtgaac tcgggccagg gcctgggcat cgagatcatc gggaccctcc 481 agctggtgct atgcgtgctg gctactaccg accggaggcg ccgtgacctt ggtggctcag 541 ccccccttgc catcggcctc tctgtagccc ttggacacct cctggctatt gactacactg 601 gctgtgggat taaccctgct cggtcctttg gctccgcggt gatcacacac aacttcagca 661 accactggat tttctgggtg gggccattca tcgggggagc cctggctgta ctcatctacg 721 acttcatcct ggccccacgc agcagtgacc tcacagaccg cgtgaaggtg tggaccagcg 781 gccaggtgga ggagtatgac ctggatgccg acgacatcaa ctccagggtg gagatgaagc 841 ccaaatagaa ggggtctggc ccgggcatcc acgtaggggg caggggcagg ggcgggcgga 901 gggaggggag gggtgaaatc catactgtag acactctgac aagctggcca aagtcacttc 961 cccaagatct gccagacctg catggtcaag cctcttatgg gggtgtttct atctctttct 1021 ttctctttct gtttcctggc ctcagagctt cctggggacc aagatttacc aattcaccca 1081 ctcccttgaa gttgtggagg aggtgaaaga aagggaccca cctgctagtc gcccctcaga 1141 gcatgatggg aggtgtgcca gaaagtcccc cctcgcccca aagttgctca ccgactcacc 1201 tgcgcaagtg cctgggattc taccgtaatt gctttgtgcc tttgggcacg gccctccttc 1261 ttttcctaac atgcaccttg ctcccaatgg tgcttggagg gggaagagat cccaggaggt 1321 gcagtggagg gggcaagctt // LOCUS HUMCHITO 1618 bp mRNA PRI 10-DEC-1992 DEFINITION Homo sapiens di-N-acetylchitobiase mRNA, complete cds. ACCESSION M95767 NID g180502 KEYWORDS chitobiase; di-N-acetylchitobiase. SOURCE Homo sapiens (library: lambda gt11) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1618) AUTHORS Fisher,K.J. and Aronson,N.N.Jr. TITLE Cloning and expression of the cDNA sequence encoding the lysosomal glycosidase di-N-acetylchitobiase JOURNAL J. Biol. Chem. 267, 19607-19616 (1992) MEDLINE 92406917 FEATURES Location/Qualifiers source 1..1618 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="lambda gt11" sig_peptide 1..114 /note="putative" CDS 1..1158 /note="putative" /codon_start=1 /product="di-N-acetylchitobiase" /db_xref="PID:g180503" /translation="MSRPQLRRWRLVSSPPSGVPGLALLALLALLALRLAAGTDCPCP EPELCRPIRHHPDFEVFVFDVGQKTWKSYDWSQITTVATFGKYDSELMCYAHSKGARV VLKGDVSLKDIIDPAFRASWIAQKLNLAKTQYMDGINIDIEQEVNCLSPEYDALTALV KETTDSFHREIEGSQVTFDVAWSPKNIDRRCYNYTGIADACDFLFVMSYDEQSQIWSE CIAAANAPYNQTLTGYNDYIKMSINPKKLVMGVPWYGYDYTCLNLSEDHVCTIAKVPF RGAPCSDAAGRQVPYKTIMKQINSSISGNLWDKDQRAPYYNYKDPAGHFHQVWYDNPQ SISLKATYIQNYRLRGIGMWNANCLDYSGDAVAKQQTEEMWEVLKPKLLQR" mat_peptide 115..1155 /standard_name="chitobiase" /note="putative" /product="di-N-acetylchitobiase" 3'UTR 1159..1618 /note="putative" polyA_signal 1274..1279 /note="putative" polyA_signal 1278..1283 /note="putative" polyA_signal 1596..1601 /note="putative" BASE COUNT 494 a 317 c 323 g 484 t ORIGIN 1 atgtcccggc cgcagcttcg acgctggcgc ctcgtctcta gcccgccgag cggcgtcccg 61 ggtctagcgc tgctggcgct gctggcgctg ctggcgctgc ggctcgcggc cgggaccgac 121 tgcccatgcc cggagcctga gctctgccgc ccgattcgcc accatccaga tttcgaggtc 181 tttgtgtttg atgttggaca gaaaacttgg aaatcttatg attggtcaca gattacaact 241 gtggcaacat ttggaaaata tgactcagaa cttatgtgct acgctcattc aaaaggagcc 301 agagtagtac ttaaaggaga tgtatcctta aaggatatca ttgatcctgc tttcagagca 361 tcctggatag ctcaaaaact taatttggcc aaaacacaat atatggatgg aattaatata 421 gatatagagc aagaagttaa ttgtttatca cctgaatatg atgcattaac tgctttagtc 481 aaagaaacta cagactcttt ccatcgtgaa attgagggat cacaggtaac ctttgatgta 541 gcttggtctc caaagaacat agacagaaga tgctataatt atactggaat cgcagatgct 601 tgtgacttcc tctttgtgat gtcttatgat gaacaaagtc agatctggtc agaatgtatt 661 gcagcagcca atgctcccta taatcagaca ttaactggat ataatgacta catcaagatg 721 agcattaatc ctaagaaact tgtaatgggt gttccttggt atggttatga ttatacctgc 781 ctgaatctgt ctgaggatca tgtttgtacc attgcaaaag tccctttccg gggggctcct 841 tgtagtgacg ctgcaggacg tcaggtgccc tacaaaacga tcatgaagca aataaatagt 901 tctatttctg gaaacctatg ggataaagat cagcgggctc cttattataa ctataaagat 961 cctgctggcc actttcatca agtatggtat gataaccctc agagtatttc tttaaaggca 1021 acatatatac aaaactatcg cttacggggc attggcatgt ggaatgcaaa ctgtcttgac 1081 tactctggag atgctgtagc caaacagcaa actgaagaaa tgtgggaagt cttaaagcca 1141 aagctgttac agagatgaac atcttttgtc aaaccattaa gagttagaaa gatgatctgt 1201 atcaacagat ctagtttctt gcatttttat tatgttgcta tatacttttg ttatccgtat 1261 actaaaaaaa aagaataaat aaatgttttg attgtttgaa tttgaaaaat acacacgaat 1321 gtcctcagta tccaggaaca taaaggcaag aagcaagtca acttacctat taaatattcc 1381 tctattagat gtttcaacac tataatttaa ttgggaaaaa ttgctttcag aattttatta 1441 tgccatattt cccttcatta tagtaaaata tatgctcacg aatcaatgct gatttttaaa 1501 atatgtataa tctgaagtgg aaattgtttg cttagagttt ttaaaaacct agtctttgaa 1561 aagcagtttg tgctatactt ttcccccaac cctccaataa atcttaaatt taaaacct // LOCUS HUMCHRA 1863 bp mRNA PRI 01-NOV-1994 DEFINITION Human chromogranin A mRNA, complete cds. ACCESSION J03483 NID g180526 KEYWORDS chromogranin A. SOURCE Human pheochromocytoma. cDNA to mRNA, (library of A.Lamouroux and J.Mallet), clone hCgA/42. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1863) AUTHORS Konecki,D.S., Benedum,U.M., Gerdes,H.H. and Huttner,W.B. TITLE The primary structure of human chromogranin A and pancreastatin JOURNAL J. Biol. Chem. 262 (35), 17026-17030 (1987) MEDLINE 88059106 COMMENT Printed copy of sequence for [1] kindly provided by Konecki,D.S., 12/15/87. FEATURES Location/Qualifiers source 1..1863 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q32" mRNA <1..1863 /note="chromogranin A mRNA" sig_peptide 83..136 /gene="CHGA" /note="chromogranin A signal peptide" gene 83..1456 /gene="CHGA" CDS 83..1456 /gene="CHGA" /note="chromogranin A precursor" /codon_start=1 /db_xref="GDB:G00-119-777" /db_xref="PID:g180527" /translation="MRSAAVLALLLCAGQVTALPVNSPMNKGDTEVMKCIVEVISDTL SKPSPMPVSQECFETLRGDERILSILRHQNLLKELQDLALQGAKERAHQQKKHSGFED ELSEVLENQSSQAELKEAVEEPSSKDVMEKREDSKEAEKSGEATDGARPQALPEPMQE SKAEGNNQAPGEEEEEEEEATNTHPPASLPSQKYPGPQAEGDSEGLSQGLVDREKGLS AEPGWQAKREEEEEEEEEAEAGEEAVPEEEGPTVVLNPHPSLGYKEIRKGESRSEALA VDGAGKPGAEEAQDPEGKGEQEHSQQKEEEEEMAVVPQGLFRGGKSGELEQEEERLSK EWEDSKRWSKMDQLAKELTAEKRLEGQEEEEDNRDSSMKLSFRARAYGFRGPGPQLRR GWRPSSREDSLEAGLPLQVRGYPEEKKEEEGSANRRPEDQELESLSAIEAELEKVAHQ LQALRRG" mat_peptide 137..1456 /gene="CHGA" /note="chromogranin A" BASE COUNT 437 a 535 c 608 g 283 t ORIGIN Unreported. 1 ccggccgcca gtccagccgc ccctcgcccg gtgcctaggt gcccggcccc acaccgccag 61 ctgctcggcg cccgggtccg ccatgcgctc cgccgctgtc ctggctcttc tgctctgcgc 121 cgggcaagtc actgcgctcc ctgtgaacag ccctatgaat aaaggggata ccgaggtgat 181 gaaatgcatc gttgaggtca tctccgacac actttccaag cccagcccca tgcctgtcag 241 ccaggaatgt tttgagacac tccgaggaga tgaacggatc ctttccattc tgagacatca 301 gaatttactg aaggagctcc aagacctcgc tctccaaggc gccaaggaga gggcacatca 361 gcagaagaaa cacagcggtt ttgaagatga actctcagag gttcttgaga accagagcag 421 ccaggccgag ctgaaagagg cggtggaaga gccatcatcc aaggatgtta tggagaaaag 481 agaggattcc aaggaggcag agaaaagtgg tgaagccaca gacggagcca ggccccaggc 541 cctcccggag cccatgcagg agtccaaggc tgaggggaac aatcaggccc ctggggagga 601 agaggaggag gaggaggagg ccaccaacac ccaccctcca gccagcctcc ccagccagaa 661 atacccaggc ccacaggccg agggggacag tgagggcctc tctcagggtc tggtggacag 721 agagaagggc ctgagtgcag agccagggtg gcaggcaaag agagaagagg aggaggagga 781 ggaggaggag gctgaggctg gagaggaggc tgtccccgag gaagaaggcc ccactgtagt 841 gctgaacccc cacccgagcc ttggctacaa ggagatccgg aaaggcgaga gtcggtcgga 901 ggctctggct gtggatggag ctgggaagcc tggggctgag gaggctcagg accccgaagg 961 gaagggagaa caggagcact cccagcagaa agaggaggag gaggagatgg cagtggtccc 1021 gcaaggcctc ttccggggtg ggaagagcgg agagctggag caggaggagg agcggctctc 1081 caaggagtgg gaggactcca aacgctggag caagatggac cagctggcca aggagctgac 1141 ggctgagaag cggctggagg ggcaggagga ggaggaggac aaccgggaca gttccatgaa 1201 gctctccttc cgggcccggg cctacggctt caggggccct gggccgcagc tgcgacgagg 1261 ctggaggcca tcctcccggg aggacagcct tgaggcgggc ctgcccctcc aggtccgagg 1321 ctaccccgag gagaagaaag aggaggaggg cagcgcaaac cgcagaccag aggaccagga 1381 gctggagagc ctgtcggcca ttgaagcaga gctggagaaa gtggcccacc agctgcaggc 1441 actacggcgg ggctgagaca ccggctggca gggctggccc cagggcaccc tgtggccctg 1501 gctctgctgt ccccttggca ggtcctggcc agatggcccg gacgctgctt ccggtaggga 1561 ggcagcctcc agcctgccca agcccaggcc accctatcgc cccctacgcg ccttgtctcc 1621 tactcctgac tcctacctgc cctggaacat cctttgcagg gcagccccac aactttaaac 1681 attgacgatt ccttctctga acacaggcag ctttctagaa gtttcccttc ctccatccta 1741 tccactgggc acaactgcaa taacttctga ccttttggtg aaagctgaga actcctgact 1801 gtaacatatt ctgtatgaac tttatctaaa gaaaaataaa tctgttctgg gctctttcct 1861 ctg // LOCUS HUMCILA 1431 bp mRNA PRI 01-NOV-1994 DEFINITION Human lipoprotein-associated coagulation inhibitor mRNA, complete cds. ACCESSION J03225 NID g180545 KEYWORDS lipoprotein-associated coagulation inhibitor. SOURCE Human placenta, cDNA to mRNA, clone lambda-P9. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1431) AUTHORS Wun,T.C., Kretzmer,K.K., Girard,T.J., Miletich,J.P. and Broze,G.J. Jr. TITLE Cloning and characterization of a cDNA coding for the lipoprotein-associated coagulation inhibitor shows that it consists of three tandem Kunitz-type inhibitory domains JOURNAL J. Biol. Chem. 263 (13), 6001-6004 (1988) MEDLINE 88198127 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by T.-C.Wun, 19-MAR-1988. FEATURES Location/Qualifiers source 1..1431 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q31-q32.1" sig_peptide 133..216 /gene="TFPI" /note="lipoprotein-associated coagulation inhibitor signal peptide" CDS 133..1047 /gene="TFPI" /note="lipoprotein-associated coagulation inhibitor precursor" /codon_start=1 /db_xref="GDB:G00-127-364" /db_xref="PID:g180546" /translation="MIYTMKKVHALWASVCLLLNLAPAPLNADSEEDEEHTIITDTEL PPLKLMHSFCAFKADDGPCKAIMKRFFFNIFTRQCEEFIYGGCEGNQNRFESLEECKK MCTRDNANRIIKTTLQQEKPDFCFLEEDPGICRGYITRYFYNNQTKQCERFKYGGCLG NMNNFETLEECKNICEDGPNGFQVDNYGTQLNAVNNSLTPQSTKVPSLFEFHGPSWCL TPADRGLCRANENRFYYNSVIGKCRPFKYSGCGGNENNFTSKQECLRACKKGFIQRIS KGGLIKTKRKRKKQRVKIAYEEIFVKNM" gene 133..1047 /gene="TFPI" mat_peptide 217..1044 /gene="TFPI" /note="lipoprotein-associated coagulation inhibitor" BASE COUNT 479 a 244 c 267 g 441 t ORIGIN 351 bp upstream of SspI site. 1 ggcgggtctg cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 61 ggaaaaacaa aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 121 gattttactt agatgattta cacaatgaag aaagtacatg cactttgggc ttctgtatgc 181 ctgctgctta atcttgcccc tgcccctctt aatgctgatt ctgaggaaga tgaagaacac 241 acaattatca cagatacgga gttgccacca ctgaaactta tgcattcatt ttgtgcattc 301 aaggcggatg atggcccatg taaagcaatc atgaaaagat ttttcttcaa tattttcact 361 cgacagtgcg aagaatttat atatggggga tgtgaaggaa atcagaatcg atttgaaagt 421 ctggaagagt gcaaaaaaat gtgtacaaga gataatgcaa acaggattat aaagacaaca 481 ttgcaacaag aaaagccaga tttctgcttt ttggaagaag atcctggaat atgtcgaggt 541 tatattacca ggtattttta taacaatcag acaaaacagt gtgaacgttt caagtatggt 601 ggatgcctgg gcaatatgaa caattttgag acactggaag aatgcaagaa catttgtgaa 661 gatggtccga atggtttcca ggtggataat tatggaaccc agctcaatgc tgtgaataac 721 tccctgactc cgcaatcaac caaggttccc agcctttttg aatttcacgg tccctcatgg 781 tgtctcactc cagcagacag aggattgtgt cgtgccaatg agaacagatt ctactacaat 841 tcagtcattg ggaaatgccg cccatttaag tacagtggat gtgggggaaa tgaaaacaat 901 tttacttcca aacaagaatg tctgagggca tgtaaaaaag gtttcatcca aagaatatca 961 aaaggaggcc taattaaaac caaaagaaaa agaaagaagc agagagtgaa aatagcatat 1021 gaagaaattt ttgttaaaaa tatgtgaatt tgttatagca atgtaacatt aattctacta 1081 aatattttat atgaaatgtt tcactatgat tttctatttt tcttctaaaa tcgttttaat 1141 taatatgttc attaaatttt ctatgcttat tgtacttgtt atcaacacgt ttgtatcaga 1201 gttgcttttc taatcttgtt aaattgctta ttctaggtct gtaatttatt aactggctac 1261 tgggaaatta cttattttct ggatctatct gtattttcat ttaactacaa attatcatac 1321 taccggctac atcaaatcag tcctttgatt ccatttggtg accatctgtt tgagaatatg 1381 atcatgtaaa tgattatctc ctttatagcc tgtaaccaga ttaagccccc c // LOCUS HUMCK1 2408 bp mRNA PRI 23-JUL-1992 DEFINITION Human mRNA for choline kinase. ACCESSION D10704 NID g219540 KEYWORDS choline kinase. SOURCE Homo sapiens cell_line:glioblastoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2408) AUTHORS Hosaka,K., Tanaka,S., Nikawa,J. and Yamashita,S. TITLE Cloning of a human choline kinase cDNA by complementation of the yeast cki mutation JOURNAL FEBS Lett. 304 (2-3), 229-232 (1992) MEDLINE 92316236 REFERENCE 2 (bases 1 to 2408) AUTHORS Kodaki,T. TITLE Direct Submission JOURNAL Submitted (14-MAR-1992) to the DDBJ/EMBL/GenBank databases. Tsutomu Kodaki, Gunma University School of Medicine, Department of Biochemistry; Maebashi, Gunma 371, Japan (E-mail:tkodaki@ddbj.nig.ac.jp, Tel:0272-31-7221(ex.2558), Fax:0272-32-9278) COMMENT Submitted (14-MAR-1992) to DDBJ by: Tsutomu Kodaki Gunma University School of Medicine Department of Biochemistry Maebashi, Gunma 371 Japan Phone: 0272-31-7221 x2558 Email: tkodaki@ddbj.nig.ac.jp Fax: 0272-32-9278. FEATURES Location/Qualifiers source 1..2408 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="glioblastoma" gene 28..1398 /gene="hgck1" CDS 28..1398 /gene="hgck1" /EC_number="2.7.1.32" /codon_start=1 /product="choline kinase" /db_xref="PID:d1002022" /db_xref="PID:g219541" /translation="MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLE SKQLAPTAALALPPPPPLPLPLPLPQPPPPQPPADEQPEPRARRRAYLWCKEFLPGAW RGLREDEFHISVIRGGLSNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEG SEQAQKENEFQGAEAMVLESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEEL SLPDISAEIAEKMATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKL LSYNLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSS YNYRGFDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAFQNDFENL STEEKSIIKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFDAYFHQ KRKLGV" BASE COUNT 568 a 620 c 643 g 577 t ORIGIN 1 ccgcgcctcc tcggccgcct gtcgggcatg aaaaccaaat tctgcaccgg gggcgaggcg 61 gagccctcgc cgctcgggct gctgctgagc tgcggtagcg gcagcgcggc cccggcgccc 121 ggcgtggggc agcagcgcga cgccgccagc gacctcgagt ccaagcagct ggcgccaaca 181 gccgcgctcg cgctgccccc tccgccgccg ctgccgctgc cgctgccgct gccccagccc 241 ccgccgccgc agccgcccgc agacgagcag ccggagcccc gggcgcggcg cagggcctat 301 ctgtggtgca aggagttcct gcccggcgcc tggcggggcc tccgcgagga cgagttccac 361 atcagtgtca tcagaggcgg ccttagcaac atgctgttcc agtgctccct acctgacacc 421 acagccaccc ttggtgatga gcctcggaaa gtgctcctgc ggctgtatgg agcgattttg 481 cagatgaggt cctgtaataa agagggatcc gaacaagctc agaaagaaaa tgaatttcaa 541 ggggctgagg ccatggttct ggagagcgtt atgtttgcca ttctcgcaga gaggtcactt 601 gggccaaaac tctatggcat ctttccccaa ggccgactgg agcagttcat cccgagccgg 661 cgattagata ctgaagaatt aagtttgcca gatatttctg cagaaatcgc cgagaaaatg 721 gctacatttc atggtatgaa aatgccattc aataaggaac caaaatggct ttttggcaca 781 atggaaaagt atctaaagga agtgctgaga attaaattta ctgaggaatc cagaattaaa 841 aagctccaca aattgctcag ttacaatctg cccttggaac tggaaaacct gagatcattg 901 cttgaatcta ctccatctcc agttgtattt tgtcataatg actgtcaaga aggtaatatc 961 ttgttgctgg aaggccgaga gaattctgaa aaacagaaac tgatgctcat tgatttcgaa 1021 tacagcagtt acaattacag gggattcgac attggaaatc acttctgtga gtggatgtat 1081 gattatagct atgaaaaata cccttttttc agagcaaaca tccggaagta tcccaccaag 1141 aaacaacagc tccattttat ttccagttac ttgcctgcat tccaaaatga ctttgaaaac 1201 ctcagtactg aagaaaaatc cattataaaa gaagaaatgt tgcttgaagt taataggttt 1261 gcccttgcat ctcatttcct ctggggactg tggtccattg tacaagccaa gatttcatct 1321 attgaatttg ggtacatgga ctacgcccaa gcaaggtttg atgcctattt ccaccagaag 1381 aggaagcttg gggtgtgact gtggggagga ctccatccac ctcatcactg gactgcatgg 1441 ggaggcagca gagcgcggtc ccctctgtgc ttcgactact gctcctgtgg caggaggctt 1501 tgggtggctc actactgaac acatgtgtat gatactaaag acggtattaa aatggagcga 1561 cgtttatttc atctcttgtt tacgatttca ctaggactca gaaacgagat cgggaagacg 1621 aaatatagtg caatagtgca acatctctga atccttttaa tctagagaag gcatttcata 1681 tttgggggct aaggtttcca gtcagatgag gcaaacagca agagtaagca gtgttacttg 1741 caggtacttt ggttaatgtt gactttaaat tttcatgaat gtgctggtga acactgtgac 1801 caggcttttg tagatggcga ctgtgttata gacggtgctc actcccaagg gacagcaagt 1861 gagcagagat gtactgcaaa gtcgccagtc actgcgtgca aggtggcctc tgcctggggc 1921 cgtccagaag ctgctccttt accctcttgg tcccatggct gaagcggagc agctggattg 1981 ctctggagca gccaaggccg ccactgtgga gacagagctc tcccctcctg ctgggcgtgt 2041 gtgacactgt agagtttcac tgtactcgat gtgacttctc ccctgccctt cctcctgatg 2101 gagtgtgcag acagccatgc gtggccacgg gggcagtgtg aggacctccc tgtctcccgc 2161 tcccctccca gggagcagct gcttgaccta gctctttggg cctctcctgc cctctgctct 2221 gcctggagtg tcggatcctg tgagtaggct gggcctcccc tgggcagggt tctccaaggc 2281 cggtttcccg gcccttacca aacctgatgc ccctgacatc atcattcttg tgggagacag 2341 cagcctgtat gtggtgtggg gcgtggatcg agtgtagctg tgaaatccat atatatgaaa 2401 tgtccaat // LOCUS HUMCKI 1480 bp mRNA PRI 14-MAY-1996 DEFINITION Homo sapiens cam kinase I mRNA, complete cds. ACCESSION L41816 NID g790789 KEYWORDS auto inhibitory domain; calcium/calmodulin; cam kinase I; protein kinase. SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1480) AUTHORS Haribabu,B., Hook,S.S., Selbert,M.A., Goldstein,E.G., Tomhave,E.D., Edelman,A.M., Snyderman,R. and Means,A.R. TITLE Human calcium-calmodulin dependent protein kinase I: cDNA cloning, domain structure and activation by phosphorylation at threonine-177 by calcium-calmodulin dependent protein kinase I kinase JOURNAL EMBO J. 14 (15), 3679-3686 (1995) MEDLINE 95369239 FEATURES Location/Qualifiers source 1..1480 /organism="Homo sapiens" /note="(vector lambda gt10)" /db_xref="taxon:9606" /cell_line="HL-60" 5'UTR 1..147 mRNA 1..1480 CDS 148..1260 /codon_start=1 /product="cam kinase I" /db_xref="PID:g790790" /translation="MLGAVEGPRWKQAEDIRDIYDFRDVLGTGAFSEVILAEDKRTQK LVAIKCIAKEALEGKEGSMENEIAVLHKIKHPNIVALDDIYESGGHLYLIMQLVSGGE LFDRIVEKGFYTERDASRLIFQVLDAVKYLHDLGIVHRDLKPENLLYYSLDEDSKIMI SDFGLSKMEDPGSVLSTACGTPGYVAPEVLAQKPYSKAVDCWSIGVIAYILLCGYPPF YDENDAKLFEQILKAEYEFDSPYWDDISDSAKDFIRHLMEKDPEKRFTCEQALQHPWI AGDTALDKNIHQSVSEQIKKNFAKSKWKQAFNATAVVRHMRKLQLGTSQEGQGQTASH GELLTPVAGGPAAGCCCRDCCVEPGTELSPTLPHQL" misc_feature 1005..1110 /function="auto-inhibitory and calmodulin binding domains" 3'UTR 1261..1480 BASE COUNT 341 a 410 c 436 g 289 t 4 others ORIGIN 1 gaattccgag caagagcgcg ggcgggtggc ccaggcacgc agcgggtgag gaccgcgccc 61 acagctcggc gccaaccacc gcgggcctcc cagccagccc cgcnnngagc cgcaggancc 121 ctggctgtgg tcggggggca gtgggccatg ctgggggcag tggaaggccc caggtggaag 181 caggcggagg acattagaga catctacgac ttccgagatg ttctgggcac gggggccttc 241 tcggaggtga tcctggcaga agataagagg acgcagaagc tggtggccat caaatgcatt 301 gccaaggagg ccctggaggg caaggaaggc agcatggaga atgagattgc tgtcctgcac 361 aagatcaagc accccaacat tgtagccctg gatgacatct atgagagtgg gggccacctc 421 tacctcatca tgcagctggt gtcgggtggg gagctctttg accgtattgt ggaaaaaggc 481 ttctacacgg agcgggacgc cagccgcctc atcttccagg tgctggatgc tgtgaaatac 541 ctgcatgacc tgggcattgt acaccgggat ctcaagccag agaatctgct gtactacagc 601 ctggatgaag actccaaaat catgatctcc gactttggcc tctccaagat ggaggacccg 661 ggcagtgtgc tctccaccgc ctgtggaact ccgggatacg tggcccctga agtcctggcc 721 cagaagccct acagcaaggc tgtggattgc tggtccatag gtgtcatcgc ctacatcttg 781 ctctgcggtt accctccctt ctatgacgag aatgatgcca aactctttga acagattttg 841 aaggccgagt acgagtttga ctctccttac tgggacgaca tctctgactc tgccaaagat 901 ttcatccggc acttgatgga gaaggaccca gagaaaagat tcacctgtga gcaggccttg 961 cagcacccat ggattgcagg agatacagct ctagataaga atatccacca gtcggtgagt 1021 gagcagatca agaagaactt tgccaagagc aagtggaagc aagccttcaa tgccacggct 1081 gtggtgcggc acatgaggaa actgcagctg ggcaccagcc aggaggggca ggggcagacg 1141 gcgagccatg gggagctgct gacaccagtg gctggggggc cggcagctgg ctgttgctgt 1201 cgagactgct gcgtggagcc gggcacagaa ctgtccccca cactgcccca ccagctctag 1261 ggccctggac ctcgggtcat gatcctctgc gtgggagggc ttgggggcca gcctgctccc 1321 cttccctccc tgaaccggga gtttctctgc cctgtcccct cctcacctgc ttccctacca 1381 ctcctcactg cattttccat acaaatgttt ctattttatt gttccttctt gtaataaagg 1441 gaagataaaa ccaaaaaaaa aaaaaaaaaa acggaattcc // LOCUS HUMCKMA 1562 bp mRNA PRI 01-NOV-1994 DEFINITION Human creatine kinase M mRNA, complete cds. ACCESSION M14780 NID g180575 KEYWORDS creatine kinase. SOURCE Human skeletal muscle, cDNA to mRNA (library of Gunning et al.), clone pHMCK1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1562) AUTHORS Perryman,M.B., Kerner,S.A., Bohlmeyer,T.J. and Roberts,R. TITLE Isolation and sequence analysis of a full-length cDNA for human M creatine kinase JOURNAL Biochem. Biophys. Res. Commun. 140 (3), 981-989 (1986) MEDLINE 87048887 FEATURES Location/Qualifiers source 1..1562 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.3" mRNA 1..1562 /note="CK mRNA" gene 79..1224 /gene="CKM" CDS 79..1224 /gene="CKM" /EC_number="2.7.3.2" /note="precursor" /codon_start=1 /db_xref="GDB:G00-120-591" /product="creatine kinase M" /db_xref="PID:g180576" /translation="MPFGNTHNKFKLNYKPEEEYPDLSKHNNHMAKVLTLELYKKLRD KEIPSGFTVDDVIQTGVDNPGHPFIMTVGCVAGDEESYEVFKELFDPIISDRHGGYKP TDKHKTDLNHENLKGGDDLDPNYVLSSPVRTGRSIKGYTLPPHCSRGERRAVEKLSVE ALNSLTGEFKGKYYPLKSMTEKEQQQLIDDHFQFDKPVSPLLLASGMARHWPDAPGIW HNDNKSFLVWVNEEDHLRVISMEKGGNMKEVFRRFCVGLQKIEEIFKKAGHPFMWNQH LGYVLTCPSNLGTGLRGGVHVKLAHLSKHPKFEEILTRLRLQKRGTGAVDTAAVGSVF DVSNADRLGSSEVEQVQLVVDGVKLMVEMEKKLEKGQSIDDMIPAQK" mat_peptide 82..1221 /gene="CKM" /note="creatine kinase M" BASE COUNT 359 a 489 c 425 g 289 t ORIGIN 136 bp upstream of RsaI site; chromosmoe 19q13. 1 gtgggtcagc atgtcacctc caggatacag acagcccccc ttcagcccag cccagccagg 61 tctccttaca ccgccaccat gccattcggt aacacccaca acaagttcaa gctgaattac 121 aagcctgagg aggagtaccc cgacctcagc aaacataaca accacatggc caaggtactg 181 acccttgaac tctacaagaa gctgcgggac aaggagatcc catctggctt cactgtagac 241 gatgtcatcc agacaggagt ggacaaccca ggtcacccct tcatcatgac cgtgggctgc 301 gtggctggtg atgaggagtc ctacgaagtt ttcaaggaac tctttgaccc catcatctcg 361 gatcgccacg ggggctacaa acccactgac aagcacaaga ctgacctcaa ccatgaaaac 421 ctcaagggtg gagacgacct ggaccccaac tacgtgctca gcagcccggt ccgcactggc 481 cgcagcatca agggctacac gttgccccca cactgctccc gtggcgagcg ccgggcggtg 541 gagaagctct ctgtggaagc tctcaacagc ctgacgggcg agttcaaagg gaagtactac 601 cctctgaaga gcatgacgga gaaggagcag cagcagctca tcgatgacca cttccagttc 661 gacaagcccg tgtccccgct gctgctggcc tcaggcatgg cccgccactg gcccgacgcc 721 cctggcatct ggcacaatga caacaagagc ttcctggtgt gggtgaacga ggaggatcac 781 ctccgggtca tctccatgga gaaggggggc aacatgaagg aggttttccg ccgcttctgc 841 gtagggctgc agaagattga ggagatcttt aagaaagctg gccacccctt catgtggaac 901 cagcacctgg gctacgtgct cacctgccca tccaacctgg gcactgggct gcgtggaggc 961 gtgcatgtga agctggcgca cctgagcaag caccccaagt tcgaggagat cctcacccgc 1021 ctgcgtctgc agaagagggg tacaggtgcg gtggacacag ctgccgtggg ctcagtattt 1081 gacgtgtcca acgctgatcg gctgggctcg tccgaagtag aacaggtgca gctggtggtg 1141 gatggtgtga agctcatggt ggaaatggag aagaagttgg agaaaggcca gtccatcgac 1201 gacatgatcc ccgcccagaa gtaggcgcct gcccacctgc caccgactgc tggaacccca 1261 gccagtggga gggcctggcc caccagagtc ctgctccctc actcctcgcc ccgccccctg 1321 tcccagagtc cacctggggg ctctctccac ccttctcaga gttccagttt caaccagagt 1381 tccaaccaat gggctccatc ctctggattc tggccaatga aatatctccc tggcagggtc 1441 ctcttctttt cccagagctc ctccccaacc aggagctcta gttaatggag agctcccagc 1501 acactcggac gcttgtgctt ttgtctccac gcaaacggat aaataaaagc attggtggcc 1561 tt // LOCUS HUMCLA 3461 bp mRNA PRI 17-SEP-1996 DEFINITION Human mRNA for clathrin-like protein, complete cds. ACCESSION D38293 NID g807814 KEYWORDS clathrin-like protein. SOURCE Homo spaiens fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3461) AUTHORS Koyama,K., Sudo,K. and Nakamura,Y. TITLE Isolation of 115 human chromosome 8-specific expressed-sequence tags by exon amplification JOURNAL Genomics 26 (2), 245-253 (1995) MEDLINE 95324915 REFERENCE 2 (bases 1 to 3461) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (13-SEP-1994) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:nakamura@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4501), Fax:03-3918-0342) FEATURES Location/Qualifiers source 1..3461 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /clone="CLA20" /dev_stage="fetal" /map="8p11.2" /tissue_type="brain" CDS 91..1347 /codon_start=1 /usedin=D38703,D38713:ESTs /product="clathrin-like protein" /db_xref="PID:d1007994" /db_xref="PID:g807815" /translation="MIHSLFLINSSGDIFLEKHWKSVVSRSVCDYFFEAQERATEAEN VPPVIPTPHHYLLSVYRHKIFFVAVIQTEVPPLFVIEFLHRVVDTFQDYFGVCSEPVI KDNVVVVYEVLEEMLDNGFPLATESNILKELIKPPTILRTVVNTITGSTNVGDQLPTG QLSVVPWRRTGVKYTNNEAYFDVIEEIDAIIDKSGSTITAEIQGVIDACVKLTGMPDL TLSFMNPRLLDDVSFHPCVRFKRWESERILSFIPPDGNFRLLSYHVSAQNLVAIPVYV KHNISFRDSSSLGRFEITVGPKQTMGKTIEGVTVTSQMPKGVLNMSLTPSQGTHTFDP VTKMLSWDVGKINPQKLPSLKGTMSLQAGASKPDENPTINLQFKIQQLAISGLKVNRL DMYGEKYKPFKGIKYMTKAGKFQVRT" polyA_signal 3438..3443 polyA_site 3461 BASE COUNT 938 a 745 c 792 g 986 t ORIGIN 1 gctgagagca gcaaggccag agttgaaaac ttacagagtc ctgaggcttc agactgaaaa 61 aggcttcttc tgtcactgac aatcgccacc atgatccata gtcttttctt gatcaactcc 121 tctggagaca ttttcctgga gaaacattgg aaaagtgtgg tcagccgttc tgtttgtgat 181 tacttttttg aggcgcaaga gagagctact gaggcagaaa atgtgcctcc ggttatccct 241 acccctcacc actatctctt aagtgtttac cgccacaaga tcttttttgt ggccgtgatc 301 cagacggagg tcccccctct gtttgtcatt gagtttcttc accgagtggt ggacacattt 361 caggattatt ttggagtctg ttcagagcca gtgatcaaag acaatgtagt tgtggtttat 421 gaggtattgg aagagatgct tgacaatggt tttccattgg ctaccgagtc gaacattctt 481 aaagaactca taaagcctcc taccatcctt cgaacggttg tcaacaccat cacaggaagc 541 acgaatgtgg gtgaccagct tcccactggg cagctgtcag tggtgccttg gcgacggact 601 ggggtgaaat ataccaacaa tgaggcctat tttgatgtga ttgaagagat tgatgcaatt 661 attgataaat caggctccac aattactgct gagatccagg gggtgattga tgcctgtgtc 721 aagctgactg gcatgccaga ccttacactt tccttcatga accctaggtt gttggatgat 781 gtcagcttcc atccttgtgt tcgtttcaaa cgctgggaat ctgagcgcat cctctccttc 841 atccctcctg atggaaactt ccgcctgctg tcttaccatg tcagtgcaca gaatctggtt 901 gcaatcccag tgtatgtcaa acataacatc agtttccggg acagtagttc ccttggacgc 961 tttgaaataa cggtgggacc caagcagacg atggggaaga ccattgaggg agtgactgtc 1021 accagccaga tgcccaaggg ggtcctgaac atgagcctta ctccatcaca ggggacacac 1081 acattcgacc cagtcacaaa gatgctgtct tgggatgtag gaaaaataaa tccacaaaag 1141 ctaccaagtt tgaaggggac catgagtctt caggctggag cttccaaacc agatgaaaac 1201 cccacaatta acctgcagtt taagatccag cagctggcca tttctggact caaggtgaat 1261 cgtctggata tgtatggaga aaagtacaaa ccctttaagg gcataaaata catgaccaaa 1321 gctgggaagt tccaagttcg aacctgaagg gagcatttgc tgagggaata gtcttgcaca 1381 ttttttcatt tcttacttgt ctaaaagtaa aaaaaaatat cagcctgtct cctaggtcag 1441 tcccctcctg gacccacccg ctcccttttt tccttagcct tcagtgccat ggaactaatc 1501 aagggaggaa aaggtcacca gggagaactg gacagaactg aaacacagca acaccagttc 1561 tcaaggacaa ggtgtgtgat gggggtagga agcttggtgc ttatgtaacc attttaaacg 1621 tggtttctat aggaaagacc aacatttgtt tagcttgctt ggctttaatt atctaaagcc 1681 aatgaaagac ttctttgttg attttttaag atagaaagat taaaaagaaa gaaagatggg 1741 aagaaaatga atgtcagtca aggaaaggcc acttaaggat ctgctctatg aaggagaaga 1801 aagaggaaaa agaagtaaag gagttaggag gagacactta aatgaaagat tggcaagaaa 1861 agcagtcgga ctctacctta aggagggatg ggaaggaaag agtgtgttgg tttctttctg 1921 attcctctac tcatgtagaa aacacttgta cttctggaaa tggactggag actttttaaa 1981 tttgagtcca ctattgacat ggaaacccca gtggaatcag attttccctc aaagaccatg 2041 atggtatcgg actagttttc agacactgcc tgttgctgtc catcagcact tggtctctta 2101 tcttcagtga gaaggtgacc cgccttcttc ccatggtggc tgcctaaagt gcttcttttc 2161 taacccaaaa cagttctact cacttccttt tacagaattc accggccatt ttcctgttac 2221 ctgatccttc tacaggattt ttaaaaagta agagagtttc agagaagccg atcccataat 2281 ccccagtgca gccagggccc gtgtgtcatc gtactggagt agagggccga ctcttcccat 2341 gaaggtgagc acagctgtga gtgagtgagc tcatatctcc atttgtcagt gctggactgg 2401 taccagatcg taaccttccc gttggtcgga aacttttcca tctgtcaccc tagaaaaaga 2461 gaaagcttac catcgagggt gtggttgatc cttgaagctg cttggtaaag ttatcatttc 2521 tcagtctttt cttttgtact cctatcatgt attcattaat atctaccagt cccttttcat 2581 tctaagacaa aacatttctc ctaaatctct gaataaaatc agtgctgtag gaagatggac 2641 tgtgttgatc atgggtgtaa gcaacccagt ttaagaaaca tggcaactaa agggatacct 2701 caggctttct ttcccagtgg gtcatttttg tcctagttca gtgtgtctgt tactatttaa 2761 atatttatac aaaagggttt ttgtttatag cttaaggaat gatactgtgc tctgcttggt 2821 gcatggagaa aaaggaagac ccgtactctc cacaccctag agctttctct aaatattgtg 2881 caaagttttt gctagtttta tcttctgact ttggactagt ttttgcctgc agttgtgttg 2941 ctttgtgatt ttcctctggg atagtgtact gtacacaacc agatgtgttc cacactccgt 3001 gactccgcag tttgtcctgg agtgacatac acatccacca tggaaaggga agcatctctg 3061 ccgagattct aagtacttga agcacctctc ctgaggaacc tacaggattc taaggtttcc 3121 taggtcactg aacaactaat cttggtccct gaatatttct aggtttgtaa gtgcagcagt 3181 tttatttcct ctagaactca tcctgtttca agggaagtac ctaagaagat atagagtgtt 3241 tctagggtaa gggacctgca ggtgtaagca tagatgaaat aactgtcctg tcacatgtgc 3301 agcaggccat ggagtgtagc gggcatcgct gccgccattc ctgcagcatc accataagca 3361 gtgcagggtg tctccatcga gctgtttggt tccatgtgtg tttaacatgt gcagaagtag 3421 cttctctgtt taagtttaat aaagttgagt ttcaaccagt c // LOCUS HUMCLEAVE 1801 bp mRNA PRI 02-FEB-1993 DEFINITION Homo sapiens (clone pZ50-19) cleavage stimulation factor 50kDa subunit, complete cds. ACCESSION L02547 NID g180598 KEYWORDS cleavage stimulation factor; polyadenylation factor. SOURCE Homo sapiens (library: Stratagene lambda-ZAPII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1801) AUTHORS Takagaki,Y. and Manley,J.L. TITLE A human polyadenylation factor is a G protein beta-subunit homologue JOURNAL J. Biol. Chem. 267, 23471-23474 (1992) MEDLINE 93054692 FEATURES Location/Qualifiers source 1..1801 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cell subclone D98/AH2" /cell_type="uterine cervical carcinoma" /tissue_lib="Stratagene lambda-ZAPII" 5'UTR 1..181 CDS 182..1477 /note="50kDa subunit" /codon_start=1 /product="cleavage stimulation factor" /db_xref="PID:g180599" /translation="MYRTKVGLKDRQQLYKLIISQLLYDGYISIANGLINEIKPQSVC APSEQLLHLIKLGMENDDTAVQYAIGRSDTVAPGTGIDLEFDADVQTMSPEASEYETC YVTSHKGPCRVATYSRDGQLIATGSADASIKILDTERMLAKSAMPIEVMMNETAQQNM ENHPVIRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFDYSKPSAKRAFKYIQEAE MLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNPQDQHTDAICSVNYNSSANM YVTGSKDGCIKLWDGVSNRCITTFEKAHDGAEVCSAIFSKNSKYILSSGKDSVAKLWE ISTGRTLVRYTGAGLSGRQVHRTQAVFNHTEDYVLLPDERTISLCCWDSRTAERRNLL SLGHNNIVRCIVHSPTNPGFMTCSDDFRARFWYRRSTTD" 3'UTR 1478..1801 polyA_signal 1779..1784 polyA_site 1801 BASE COUNT 471 a 431 c 422 g 477 t ORIGIN 1 agaggagtgg gaccgatcga tagcgcagcg gtcgcttggc gccctttcag cgtgcgcagt 61 gaacgtgcgc tcggagcggt agattgggca ggattcgcgc ctccattttt ccaggagaga 121 gcgggatacc aagagaaccg gaccagctgc tggcagggaa actgtcttcc ttttctccaa 181 gatgtacaga accaaagtgg gcttgaagga ccgccagcag ctctacaagc tgatcattag 241 ccagctgcta tatgacggct acatcagcat cgccaatggc ctcatcaatg aaatcaagcc 301 tcagtctgtg tgtgcaccct cggagcagct cctgcatctc atcaaactcg gaatggaaaa 361 cgatgacacc gcagttcagt atgcaattgg tcgttcagat actgttgccc ctggcacagg 421 gattgacctg gaatttgatg cagatgttca gactatgtcc ccagaggctt ctgagtacga 481 aacatgctat gtcacatcac ataaaggacc atgccgtgta gctacctata gtagagatgg 541 acagttaata gctactgggt ctgctgatgc ttcgataaag atacttgaca cagagaggat 601 gttggccaaa agtgccatgc caatagaggt catgatgaat gagaccgcac aacaaaatat 661 ggaaaaccac ccagtgattc gaactcttta tgaccatgtg gatgaagtca cgtgccttgc 721 tttccaccca acagaacaga tcctggcttc tggttcaagg gattatactc ttaaattatt 781 tgattattcc aaaccatcag caaaaagagc cttcaaatac attcaggaag ctgaaatgtt 841 acgttccatc tcttttcatc cttctggaga ctttatactt gtcggaactc agcatcctac 901 tcttcgcctt tatgatatca acacctttca atgttttgtc tcttgcaatc ctcaagatca 961 acacaccgat gctatatgtt ccgttaatta caattctagt gccaatatgt acgtaactgg 1021 aagcaaggac ggctgcatca aattatggga tggtgtttca aatcgatgca tcacaacttt 1081 tgagaaagca catgacggtg ctgaagtttg ttctgccatt ttttccaaaa attctaaata 1141 cattctctca agtggaaaag actctgtagc taaactttgg gaaatatcaa cgggacgaac 1201 actggtcaga tacacgggcg cgggtttaag tggacgccag gtgcaccgga cacaggctgt 1261 gtttaaccac accgaggact atgtgttgct gcccgacgag aggacgatca gtctttgctg 1321 ctgggactcg aggacagccg agcggagaaa cctgctgtcg ttggggcaca acaatattgt 1381 acgctgcata gtgcactccc ccaccaaccc cgggttcatg acgtgcagcg atgacttcag 1441 agcgcggttt tggtaccgga gatcgaccac tgactgagcc accctctccg tagggttctt 1501 tctcgaggac tctaccctcc tcccccacgt cctgtctcag ctgcagtcgt aagtccgtgc 1561 accatccttg acgttttgct gccacctctg tccacattct tcttggattt gtataaaaga 1621 atcttttttt accttgatgt agaatcatgg tggaaaaagt tggaaacaca gatctgtgca 1681 gttctacatt cactgattat tacagtgtga ttttcatcgg ttttgtaagt acaggacttg 1741 ccgtttcttt tgatctcttg attgaaggag gatagggcat taaagtgctt ttgacatgag 1801 g // LOCUS HUMCLK1B 1834 bp mRNA PRI 23-JAN-1995 DEFINITION Homo sapiens clk1 mRNA, complete cds. ACCESSION L29219 NID g632963 KEYWORDS cdc-like kinase; cell division cycle protein; protein kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1834) AUTHORS Johnson,K.W. and Smith,K.A. TITLE Molecular cloning of a novel human cdc2/CDC28-like protein kinase JOURNAL J. Biol. Chem. 266 (6), 3402-3407 (1991) MEDLINE 91139618 REFERENCE 2 (bases 1 to 1834) AUTHORS Hanes,J.J., von der Kammer,H., Klaudiny,J.J. and Scheit,K.H. TITLE Characterization by cDNA cloning of two new human protein kinases: Evidence by sequence comparison for a new family of mammalian protein kinases JOURNAL J. Mol. Biol. 244, 665-672 (1994) MEDLINE 95082033 FEATURES Location/Qualifiers source 1..1834 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1834 /gene="clk1" gene 1..1834 /gene="clk1" CDS 156..1610 /gene="clk1" /note="clk1; putative" /codon_start=1 /db_xref="PID:g632964" /translation="MRHSKRTYCPDWDDKDWDYGKWRSSSSHKRRKRSHSSAQENKRC KYNHSKMCDSHYLESRSINEKDYHSRRYIDEYRNDYTQGCEPGHRQRDHESRYQNHSS KSSGRSGRSSYKSKHRIHHSTSHRRSHGKSHRRKRTRSVEDDEEGHLICQSGDVLSAR YEIVDTLGEGAFGKVVECIDHKAGGRHVAVKIVKNVDRYCEAARSEIQVLEHLNTTDP NSTFRCVQMLEWFEHHGHICIVFELLGLSTYDFIKENGFLPFRLDHIRKMAYQICKSV NFLHSNKLTHTDLKPENILFVQSDYTEAYNPKIKRDERTLINPDIKVVDFGSATYDDE HHSTLVSTRHYRAPEVILALGWSQPCDVWSIGCILIEYYLGFTVFPTHDSKEHLAMME RILGPLPKHMIQKTRKRKYFHHDRLDWDEHSSAGRYVSRACKPLKEFMLSQDVEHERL FDLIQKMLEYDPAKRITLREALKHPFFDLLKKSI" polyA_signal 1813..1818 /gene="clk1" polyA_site 1834 /gene="clk1" BASE COUNT 607 a 313 c 389 g 525 t ORIGIN 1 atttttagat aatcattaaa gaccacagaa aatgtaacag atcctactct tcaaaataat 61 tgctattcag tattaaaacg agcagtcagc tgcgtgattc ccgtgattgc gttacaagct 121 ttgtctcctt cgacttggag tctttgtcca ggacgatgag acactcaaag agaacttact 181 gtcctgattg ggatgacaag gattgggatt atggaaaatg gaggagcagc agcagtcata 241 aaagaaggaa gagatcacat agcagtgccc aggagaacaa gcgctgcaaa tacaatcact 301 ctaaaatgtg tgatagccat tatttggaaa gcaggtctat aaatgagaaa gattatcata 361 gtcgacgcta cattgatgag tacagaaatg actacactca aggatgtgaa cctggacatc 421 gccaaagaga ccatgaaagc cggtatcaga accatagtag caagtcttct ggtagaagtg 481 gaagaagtag ttataaaagc aaacacagga ttcaccacag tacttcacat cgtcgttcac 541 atgggaagag tcaccgaagg aaaagaacca ggagtgtaga ggatgatgag gagggtcacc 601 tgatctgtca gagtggagac gtactaagtg caagatatga aattgttgat actttaggtg 661 aaggagcttt tggaaaagtt gtggagtgca tcgatcataa agcgggaggt agacatgtag 721 cagtaaaaat agttaaaaat gtggatagat actgtgaagc tgctcgctca gaaatacaag 781 ttctggaaca tctgaataca acagacccca acagtacttt ccgctgtgtc cagatgttgg 841 aatggtttga gcatcatggt cacatttgca ttgtttttga actattggga cttagtactt 901 acgacttcat taaagaaaat ggttttctac catttcgact ggatcatatc agaaagatgg 961 catatcagat atgcaagtct gtgaattttt tgcacagtaa taagttgact cacacagact 1021 taaagcctga aaacatctta tttgtgcagt ctgactacac agaggcgtat aatcccaaaa 1081 taaaacgtga tgaacgcacc ttaataaatc cagatattaa agttgtagac tttggtagtg 1141 caacatatga tgacgaacat cacagtacat tggtatctac aagacattat agagcacctg 1201 aagttatttt agccctaggg tggtcccaac catgtgatgt ctggagcata ggatgcattc 1261 ttattgaata ctatcttggg tttaccgtat ttccaacaca cgatagtaag gagcatttag 1321 caatgatgga aaggattctt ggacctctac caaaacatat gatacagaaa accaggaaac 1381 gtaaatattt tcaccacgat cgattagact gggatgaaca cagttctgcc ggcagatatg 1441 tttcaagagc ctgtaaacct ctgaaggaat ttatgctttc tcaagatgtt gaacatgagc 1501 gtctctttga cctcattcag aaaatgttgg agtatgatcc agccaaaaga attactctca 1561 gagaagcctt aaagcatcct ttctttgacc ttctgaagaa aagtatatag atctgtaatt 1621 ggacagctct ctcgaagaga tcttacagac tgtatcagtc taatttttaa attttaagtt 1681 attttgtaca gctttgtaaa ttcttaacat ttttatattg ccatgtttat tttgtttggg 1741 taatttggtt cattaagtac atagctaagg taatgaacat ctttttcagt aattgtaaag 1801 tgatttattc agaataaatt ttttgtgctt atga // LOCUS HUMCLK3A 1665 bp mRNA PRI 23-JAN-1995 DEFINITION Homo sapiens clk3 mRNA, complete cds. ACCESSION L29220 NID g632969 KEYWORDS cdc-like kinase; cell division cycle protein; protein kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1665) AUTHORS Hanes,J.J., von der Kammer,H., Klaudiny,J.J. and Scheit,K.H. TITLE Characterization by cDNA cloning of two new human protein kinases: Evidence by sequence comparison for a new family of mammalian protein kinases JOURNAL J. Mol. Biol. 244, 665-672 (1994) MEDLINE 95082033 FEATURES Location/Qualifiers source 1..1665 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1665 /gene="clk3" /note="L29217:clk3 mRNA with 97 bp deletion; clk3-152" gene 1..1665 /gene="clk3" CDS 57..515 /gene="clk3" /note="clk3-152; putative" /codon_start=1 /db_xref="PID:g632970" /translation="MHHCKRYRSPEPDPYLSYRWKRRRSYSREHEGRLRYPSRREPPP RRSRSRSHDRLPYQRRYRERRDSDTYRCEERSPSFGEDYYGPSRSRHRRRSRERGPYR TRKHAHHCHKRRTRSCSSASSMRLWGTWVKAPLARWWSAWTMPEGSLRLP" polyA_signal 1641..1646 /gene="clk3" /note="clk3-152" polyA_site 1665 /gene="clk3" /note="clk3-152" BASE COUNT 417 a 469 c 455 g 324 t ORIGIN 1 tggggcactg gtacctccag gacctggagt gtactggaag aaatggtgca gtccagatgc 61 atcactgtaa gcgataccgc tcccctgaac cagacccgta cctgagctac cgatggaaga 121 ggaggaggtc ctacagtcgg gaacatgaag ggagactgcg atacccgtcc cgaagggagc 181 ctcccccacg aagatctcgg tccagaagcc atgaccgcct gccctaccag aggaggtacc 241 gggagcgccg tgacagcgat acataccggt gtgaagagcg gagcccatcc tttggagagg 301 actactatgg accttcacgt tctcgtcatc gtcggcgatc gcgggagagg gggccatacc 361 ggacccgcaa gcatgcccac cactgccaca aacgccgcac caggtcttgt agcagcgcct 421 cctcgatgag attgtgggga acctgggtga aggcaccttt ggcaaggtgg tggagtgctt 481 ggaccatgcc agagggaagt ctcaggttgc cctgaagatc atccgcaacg tgggcaagta 541 ccgggaggct gcccggctag aaatcaacgt gctcaaaaaa atcaaggaga aggacaaaga 601 aaacaagttc ctgtgtgtct tgatgtctga ctggttcaac ttccacggtc acatgtgcat 661 cgcctttgag ctcctgggca agaacacctt tgagttcctg aaggagaata acttccagcc 721 ttacccccta ccacatgtcc ggcacatggc ctaccagctc tgccacgccc ttagatttct 781 gcatgagaat cagctgaccc atacagactt gaaacctgag aacatcctgt ttgtgaattc 841 tgagtttgaa accctctaca atgagcacaa gagctgtgag gagaagtcag tgaagaacac 901 cagcatccga gtggctgact ttggcagtgc cacatttgac catgagcacc acaccaccat 961 tgtggccacc cgtcactatc gcccgcctga ggtgatcctt gagctgggct gggcacagcc 1021 ctgtgacgtc tggagcattg gctgcattct ctttgagtac taccggggct tcacactctt 1081 ccagacccac gaaaaccgag agcacctggt gatgatggag aagatcctag ggcccatccc 1141 atcacacatg atccaccgta ccaggaagca gaaatatttc tacaaagggg gcctagtttg 1201 ggatgagaac agctctgacg gccggtatgt gaaggagaac tgcaaacctc tgaagagtta 1261 catgctccaa gactccctgg agcacgtgca gctgtttgac ctgatgagga ggatgttaga 1321 atttgaccct gcccagcgca tcacactggc cgaggccctg ctgcacccct tctttgctgg 1381 cctgacccct gaggagcggt ccttccacac cagccgcaac ccaagcagat gacaggcaca 1441 ggccaccgca tgaggagatg gagggcggga ctgggccgcc cagccccttg actccagcct 1501 cgaccgccag ccccaggcca gagccaccca atgaacagtg caatgtgaag gaaggcagga 1561 gcctgcaggg gagcagactt ggtgcccagc tgccagaaag cacagatttg acccaagcta 1621 tttatatgtt ataaagttat aataaagtgt ttcttactgt ttgta // LOCUS HUMCLMF35 1026 bp mRNA PRI 17-MAY-1991 DEFINITION Human cytotoxic lymphocyte maturation factor 35 kDa subunit mRNA, complete cds. ACCESSION M65271 M38443 NID g180623 KEYWORDS cytotoxic lymphocyte maturation factor 35 kDa subunit; interleukin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1026) AUTHORS Gubler,U., Chua,A.O., Schoenhaut,D.S., Dwyer,C.M., McComas,W., Motyka,R., Nabavi,N., Wolitzky,A.G., Quinn,P.M., Familletti,P.C. and Gately,M.K. TITLE Coexpression of two distinct genes is required to generate secreted bioactive cytotoxic lymphocyte maturation factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 4143-4147 (1991) MEDLINE 91239523 FEATURES Location/Qualifiers source 1..1026 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 170..829 /codon_start=1 /product="cytotoxic lymphocyte maturation factor 35 kDa subunit" /db_xref="PID:g180624" /translation="MCPARSLLLVATLVLLDHLSLARNLPVATPDPGMFPCLHHSQNL LRAVSNMLQKARQTLEFYPCTSEEIDHEDITKDKTSTVEACLPLELTKNESCLNSRET SFITNGSCLASRKTSFMMALCLSSIYEDLKMYQVEFKTMNAKLLMDPKRQIFLDQNML AVIDELMQALNFNSETVPQKSSLEEPDFYKTKIKLCILLHAFRIRAVTIDRVTSYLNA S" BASE COUNT 288 a 255 c 228 g 255 t ORIGIN 1 gaattcccag aaagcaagag accagagtcc cgggaaagtc ctgccgcgcc tcgggacaat 61 tataaaaatg tggccccctg ggtcagcctc ccagccaccg ccctcacctg ccgcggccac 121 aggtctgcat ccagcggctc gccctgtgtc cctgcagtgc cggctcagca tgtgtccagc 181 gcgcagcctc ctccttgtgg ctaccctggt cctcctggac cacctcagtt tggccagaaa 241 cctccccgtg gccactccag acccaggaat gttcccatgc cttcaccact cccaaaacct 301 gctgagggcc gtcagcaaca tgctccagaa ggccagacaa actctagaat tttacccttg 361 cacttctgaa gagattgatc atgaagatat cacaaaagat aaaaccagca cagtggaggc 421 ctgtttacca ttggaattaa ccaagaatga gagttgccta aattccagag agacctcttt 481 cataactaat gggagttgcc tggcctccag aaagacctct tttatgatgg ccctgtgcct 541 tagtagtatt tatgaagact tgaagatgta ccaggtggag ttcaagacca tgaatgcaaa 601 gcttctgatg gatcctaaga ggcagatctt tctagatcaa aacatgctgg cagttattga 661 tgagctgatg caggccctga atttcaacag tgagactgtg ccacaaaaat cctcccttga 721 agaaccggat ttttataaaa ctaaaatcaa gctctgcata cttcttcatg ctttcagaat 781 tcgggcagtg actattgaca gagtgacgag ctatctgaat gcttcctaaa aagcgaggtc 841 cctccaaacc gttgtcattt ttataaaact ttgaaatgag gaaactttga taggatgtgg 901 attaagaact agggaggggg aaagaaggat gggactatta catccacatg atacctctga 961 tcaagtattt ttgacattta ctgtggataa attgttttta agttttcatg aatgaattgc 1021 taagaa // LOCUS HUMCLMF40 1399 bp mRNA PRI 17-MAY-1991 DEFINITION Human cytotoxic lymphocyte maturation factor 40 kDa subunit mRNA, complete cds. ACCESSION M65272 M38443 M38444 NID g180625 KEYWORDS cytotoxic lymphocyte maturation factor 40 kDa subunit; interleukin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1399) AUTHORS Gubler,U., Chua,A.O., Schoenhaut,D.S., Dwyer,C.M., McComas,W., Motyka,R., Nabavi,N., Wolitzky,A.G., Quinn,P.M., Familletti,P.C. and Gately,M.K. TITLE Coexpression of two distinct genes is required to generate secreted bioactive cytotoxic lymphocyte maturation factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 4143-4147 (1991) MEDLINE 91239523 FEATURES Location/Qualifiers source 1..1399 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 43..1029 /codon_start=1 /product="cytotoxic lymphocyte maturation factor 40 kDa subunit" /db_xref="PID:g180626" /translation="MCHQQLVISWFSLVFLASPLVAIWELKKDVYVVELDWYPDAPGE MVVLTCDTPEEDGITWTLDQSSEVLGSGKTLTIQVKEFGDAGQYTCHKGGEVLSHSLL LLHKKEDGIWSTDILKDQKEPKNKTFLRCEAKNYSGRFTCWWLTTISTDLTFSVKSSR GSSDPQGVTCGAATLSAERVRGDNKEYEYSVECQEDSACPAAEESLPIEVMVDAVHKL KYENYTSSFFIRDIIKPDPPKNLQLKPLKNSRQVEVSWEYPDTWSTPHSYFSLTFCVQ VQGKSKREKKDRVFTDKTSATVICRKNASISVRAQDRYYSSSWSEWASVPCS" BASE COUNT 390 a 310 c 333 g 366 t ORIGIN 1 ctgtttcagg gccattggac tctccgtcct gcccagagca agatgtgtca ccagcagttg 61 gtcatctctt ggttttccct ggtttttctg gcatctcccc tcgtggccat atgggaactg 121 aagaaagatg tttatgtcgt agaattggat tggtatccgg atgcccctgg agaaatggtg 181 gtcctcacct gtgacacccc tgaagaagat ggtatcacct ggaccttgga ccagagcagt 241 gaggtcttag gctctggcaa aaccctgacc atccaagtca aagagtttgg agatgctggc 301 cagtacacct gtcacaaagg aggcgaggtt ctaagccatt cgctcctgct gcttcacaaa 361 aaggaagatg gaatttggtc cactgatatt ttaaaggacc agaaagaacc caaaaataag 421 acctttctaa gatgcgaggc caagaattat tctggacgtt tcacctgctg gtggctgacg 481 acaatcagta ctgatttgac attcagtgtc aaaagcagca gaggctcttc tgacccccaa 541 ggggtgacgt gcggagctgc tacactctct gcagagagag tcagagggga caacaaggag 601 tatgagtact cagtggagtg ccaggaggac agtgcctgcc cagctgctga ggagagtctg 661 cccattgagg tcatggtgga tgccgttcac aagctcaagt atgaaaacta caccagcagc 721 ttcttcatca gggacatcat caaacctgac ccacccaaga acttgcagct gaagccatta 781 aagaattctc ggcaggtgga ggtcagctgg gagtaccctg acacctggag tactccacat 841 tcctacttct ccctgacatt ctgcgttcag gtccagggca agagcaagag agaaaagaaa 901 gatagagtct tcacggacaa gacctcagcc acggtcatct gccgcaaaaa tgccagcatt 961 agcgtgcggg cccaggaccg ctactatagc tcatcttgga gcgaatgggc atctgtgccc 1021 tgcagttagg ttctgatcca ggatgaaaat ttggaggaaa agtggaagat attaagcaaa 1081 atgtttaaag acacaacgga atagacccaa aaagataatt tctatctgat ttgctttaaa 1141 acgttttttt aggatcacaa tgatatcttt gctgtatttg tatagttaga tgctaaatgc 1201 tcattgaaac aatcagctaa tttatgtata gattttccag ctctcaagtt gccatgggcc 1261 ttcatgctat ttaaatattt aagtaattta tgtatttatt agtatattac tgttatttaa 1321 cgtttgtctg ccaggatgta tggaatgttt catactctta tgacctgatc catcaggatc 1381 agtccctatt atgcaaaat // LOCUS HUMCMIII 1167 bp mRNA PRI 28-FEB-1996 DEFINITION Human L-isoaspartyl/D-aspartyl protein carboxyl methyltransferase isozyme II mRNA, complete cds. ACCESSION M93008 NID g180636 KEYWORDS carboxyl methyltransferase isozyme II. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1167) AUTHORS MacLaren,D.C., Kagan,R.M. and Clarke,S. TITLE Alternative splicing of the human isoaspartyl protein carboxyl methyltransferase RNA leads to the generation of a C-terminal -RDEL sequence in isozyme II JOURNAL Biochem. Biophys. Res. Commun. 185 (1), 277-283 (1992) MEDLINE 92287106 REFERENCE 2 (bases 1 to 1167) AUTHORS Kagan,R.M. TITLE Direct Submission JOURNAL Submitted (07-MAY-1992) Ron M. Kagan, Chemistry and Biochemistry, UCLA, Los Angeles, CA 90095-1569, USA FEATURES Location/Qualifiers source 1..1167 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 year old" /sex="female" /tissue_type="cerebral cortex" 5'UTR 1..37 CDS 38..724 /note="isozyme II" /codon_start=1 /product="L-isoaspartyl/D-aspartyl protein carboxyl methyltransferase" /db_xref="PID:g180637" /translation="MAWKSGGASHSELIHNLRKNGIIKTDKVFEVMLATDRSHYAKCN PYMDSPQSIGFQATISAPHMHAYALELLFDQLHEGAKALDVGSGSGILTACFARMVGC TGKVIGIDHIKELVDDSVNNVRKDDPTLLSSGRVQLVVGDGRMGYAEEAPYDAIHVGA AAPVVPQALIDQLKPGGRLILPVGPAGGNQMLEQYDKLQDGSIKMKPLMGVIYVPLTD KEKQWSRDEL" 3'UTR 725..1167 BASE COUNT 352 a 212 c 267 g 336 t ORIGIN 1 ccaggtggtt ctgtacctgc tccgagtgtg cttagcgatg gcctggaaat ccggcggcgc 61 cagccactcg gagctaatcc acaatctccg caaaaatgga atcatcaaga cagataaagt 121 atttgaagtg atgctggcta cagaccgctc ccactatgca aaatgtaacc catacatgga 181 ttctccacaa tcaataggtt tccaagcaac aatcagtgct ccacacatgc atgcatatgc 241 gctagaactt ctatttgatc agttgcatga aggagctaaa gctcttgatg taggatctgg 301 aagtggaatc cttactgcat gttttgcacg tatggttgga tgtactggaa aagtcatagg 361 aattgatcac attaaagagc tagtagatga ctcagtaaat aatgtcagga aggacgatcc 421 aacacttctg tcttcaggga gagtacagct tgttgtgggg gatggaagaa tgggatatgc 481 tgaagaagcc ccttatgatg ccattcatgt gggagctgca gcccctgttg taccccaggc 541 gctaatagat cagttaaagc ccggaggaag attgatattg cctgttggtc ctgcaggcgg 601 aaaccaaatg ttggagcagt atgacaagct acaagatggc agcatcaaaa tgaagcctct 661 gatgggggtg atatacgtgc ctttaacaga taaagaaaag cagtggtcca gggatgaatt 721 gtaaaagcaa catcagcttg accagtataa aattacagtg gattgctcat ctcagtcctc 781 aaagcttttt gaaaaccaac accatcacag cttgttttgg actttgttac actgttattt 841 tcagcatgaa aatgtgtgtt tttttagggt ttctgattct tcaaagaggc acagagccaa 901 attggtagag gaaggatgca aagtataaat ttgtgtaata ttactttaac atgcccatat 961 tttacttgga aatattaaaa gaaagggttc tgtaaaatgg aaaacttagt ttgtgaattg 1021 attttgagga gtggtttttc ttttcttgga cacttaattc tgttctgata ttaatttatc 1081 agattgcttt tgtgcattgg ataacaccac cattcacaag ttaagattct tggtatttgg 1141 atatctgtta gatgctacta aaaaaaa // LOCUS HUMCMOS 1303 bp DNA PRI 01-NOV-1994 DEFINITION human humos gene homologous to transforming gene of mmsv. ACCESSION J00119 NID g180640 KEYWORDS c-myc proto-oncogene; mos oncogene. SOURCE human placental dna, clone lambdahm1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1303) AUTHORS Watson,R., Oskarsson,M. and Vande Woude,G.F. TITLE Human DNA sequence homologous to the transforming gene (mos) of Moloney murine sarcoma virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79 (13), 4078-4082 (1982) MEDLINE 82275068 COMMENT human c-mos (humos) was aligned with mouse c-mos (mumos) dna, both homologs to v-mos of moloney murine sarcoma virus. extensive similarity was found. however, humos dna fragments were unable to transform mouse nih 3t3 cells. FEATURES Location/Qualifiers source 1..1303 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q11" gene 241..1281 /gene="MOS" CDS 241..1281 /gene="MOS" /note="c-mos transforming protein" /codon_start=1 /db_xref="GDB:G00-119-396" /db_xref="PID:g180641" /translation="MPSPLALRPYLRSEFSPSVDARPCSSPSELPAKLLLGATLPRAP RLPRRLAWCSIDWEQVCLLQRLGAGGFGSVYKATYRGVPVAIKQVNKCTKNRLASRRS FWAELNVARLRHDNIVRVVAASTRTPAGSNSLGTIIMEFGGNVTLHQVIYGAAGHPEG DAGEPHCRTGGQLSLGKCLKYSLDVVNGLLFLHSQSIVHLDLKPANILISEQDVCKIS DFGCSEKLEDLLCFQTPSYPLGGTYTHRAPELLKGEGVTPKADIYSFAITLWQMTTKQ APYSGERQHILYAVVAYDLRPSLSAAVFEDSLPGQRLGDVIQRCWRPSAAQRPSARLL LVDLTSLKAELG" BASE COUNT 265 a 382 c 381 g 275 t ORIGIN 5' to the ecor-i site at 5.4kb on the lambda hm1 fragment. 1 cggaagggaa atgctttcat ctgaaaggga tagctgtgct tcattccggt ttctccctcc 61 atctgataaa aactcttgct gagtgacagc acagatgtag ctcatttgga acaagtgaag 121 gaaaaggaga aaagggatga ggtggagcga aggagtagtc agtcatgttt ccaaagtccc 181 gcggtttccc ctagtctctt cattcactcc agcggccctg gtgtccccct gcaaagtgcg 241 atgccctcgc ccctggccct acgcccctac ctccggagcg agttttcccc atcggtggac 301 gcgcggccct gcagcagtcc ctcagagcta cctgcgaagc tgcttctggg ggccactctt 361 cctcgggccc cgcggctgcc gcgccggctg gcctggtgct ccattgactg ggagcaggtg 421 tgcttgctgc agaggctggg agctggaggg tttggctcgg tgtacaaggc gacttaccgc 481 ggtgttcctg tggccataaa gcaagtgaac aagtgcacca agaaccgact agcatctcgg 541 cggagtttct gggctgagct caacgtagca aggctgcgcc acgataacat cgtgcgcgtg 601 gtggctgcca gcacgcgcac gcccgcaggg tccaatagcc tagggaccat catcatggag 661 ttcggtggca acgtcacttt acaccaagtc atctatggcg ccgccggcca ccctgagggg 721 gacgcagggg agcctcactg ccgcactgga ggacagttaa gtttgggaaa gtgtctcaag 781 tactcactag atgttgtgaa cggcctgctc ttcctccact cgcaaagcat tgtgcacttg 841 gacctgaagc ccgcgaacat cttgatcagt gagcaggatg tctgtaaaat tagtgacttc 901 ggttgctctg agaagttgga agatctgctg tgcttccaga caccctctta ccctctagga 961 ggcacataca cccaccgcgc cccggagctc ctgaaaggag agggcgtgac gcctaaagcc 1021 gacatttatt cctttgccat cactctctgg caaatgacta ccaagcaggc gccgtattcg 1081 ggggagcggc agcacatact gtacgcggtg gtggcctacg acctgcgccc gtccctctcc 1141 gctgccgtct tcgaggactc gctccccggg cagcgccttg gggacgtcat ccagcgctgc 1201 tggagaccca gcgcggcgca gaggccgagc gcgcggctgc ttttggtgga tctcacctct 1261 ttgaaagctg aactcggctg actgaaaact tggtcaagat aag // LOCUS HUMCNC 3250 bp mRNA PRI 27-MAY-1992 DEFINITION Human Na+/Ca+ exchanger (CNC) mRNA, complete cds. ACCESSION M91368 NID g180672 KEYWORDS Na+/Ca+ exchanger. SOURCE Homo sapiens adult heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3250) AUTHORS Izumo,S., Philipson,K.D., Wenninger,K.E. and Komuro,I. TITLE Molecular cloning and characterization of the human cardiac Na+/Ca2+ exchanger cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 4769-4773 (1992) MEDLINE 92262521 FEATURES Location/Qualifiers source 1..3250 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" sig_peptide 81..161 /gene="CNC" gene 81..3002 /gene="CNC" CDS 81..3002 /gene="CNC" /codon_start=1 /product="Na+/Ca+ exchanger" /db_xref="PID:g180673" /translation="MYNMRRLSLSPTFSMGFHLLVTVSLLFSHVDHVIAETEMEGEGN ETGECTGSYYCKKGVILPIWEPQDPSFGDKIARATVYFVAMVYMFLGVSIIADRFMSS IEVITSQEKEITIKKPNGETTKTTVRIWNETVSNLTLMALGSSAPEILLSVIEVCGHN FTAGDLGPSTIVGSAAFNMFIIIALCVYVVPDGETRKIKHLRVFFVTAAWSIFAYTWL YIILSVISPGVVEVWEGLLTFFFFPICVVFAWVADRRLLFYKYVYKRYRAGKQRGMII EHEGDRPSSKTEIEMDGKVVNSHVENFLDGALVLEVDERDQDDEEARREMARILKELK QKHPDKEIEQLIELANYQVLSQQQKSRAFYRIQATRLMTGAGNILKRHAADQARKAVS MHEVNTEVTENDPVSKIFFEQGTYQCLENCGTVALTIIRRGGDLTNTVFVDFRTEDGT ANAGSDYEFTEGTVVFKPGDTQKEIRVGIIDDDIFEEDENFLVHLSNVKVSSEASEDG ILEANHVSTLACLGSPSTATVTIFDDDHAGIFTFEEPVTHVSESIGIMEVKVLRTSGA RGNVIVPYKTIEGTARGGGEDFEDTCGELEFQNDEIVKTISVKVIDDEEYEKNKTFFL EIGEPRLVEMSEKKALLLNELGGFTITGKYLFGQPVFRKVHAREHPILSTVITIADEY DDKQPLTSKEEEERRIAEMGRPILGEHTKLEVIIEESYEFKSTVDKLIKKTNLALVVG TNSWREQFIEAITVSAGEDDDDDECGEEKLPSCFDYVMHFLTVFWKVLFAFVPPTEYW NGWACFIVSILMIGLLTAFIGDLASHFGCTIGLKDSVTAVVFVALGTSVPDTFASKVA ATQDQYADASIGNVTGSNAVNVFLGIGVAWSIAAIYHAANGEQFKVSPGTLAFSVTLF TIFAFINVGVLLYRRRPEIGGELGGPRTAKLLTSCLFVLLWLLYIFFSSLEAYCHIKG F" BASE COUNT 884 a 670 c 816 g 880 t ORIGIN 1 ccggaggatt ttgaggacac ttgtggagag ctcgaattcc agaatgatga aattgttagg 61 ttgtgacagt tggaagtgtc atgtacaaca tgcggcgatt aagtctttca cccacctttt 121 caatgggatt tcatctgtta gttactgtga gtctcttatt ttcccatgtg gaccatgtaa 181 ttgctgagac agaaatggaa ggagaaggaa atgaaactgg tgaatgtact ggatcatatt 241 actgtaagaa aggggtgatt ttgcccattt gggaacccca agacccttct tttggggaca 301 aaattgctag agctactgtg tattttgtgg ccatggtcta catgtttctt ggagtctcta 361 tcatagctga tcggttcatg tcctctatag aagtcatcac atctcaagaa aaagaaataa 421 ccataaagaa acccaatgga gagaccacca agacaactgt gaggatctgg aatgaaacag 481 tttctaacct gaccttgatg gccctgggat cttctgctcc tgagattctc ctttcagtaa 541 ttgaagtgtg tggccataac ttcactgcag gagacctcgg tcctagcacc atcgtgggaa 601 gtgctgcatt caatatgttc atcattattg cactctgtgt ttatgtggtg cctgacggag 661 agacaaggaa gattaagcat ttgcgtgtct tctttgtgac agcagcctgg agcatctttg 721 cctacacctg gctttacatt attttgtctg tcatatctcc tggtgttgtg gaggtctggg 781 aaggtttgct tactttcttc ttctttccca tctgtgttgt gttcgcttgg gtagcggata 841 ggagacttct gttttacaag tatgtctaca agaggtatcg agctggcaag cagaggggga 901 tgattattga acatgaagga gacaggccat cttctaagac tgaaattgaa atggacggga 961 aagtggtcaa ttctcatgtt gaaaatttct tagatggtgc tctggttctg gaggtggatg 1021 agagggacca agatgatgaa gaagctaggc gagaaatggc taggattctg aaggaactta 1081 agcagaagca tccagataaa gaaatagagc aattaataga attagctaac taccaagtcc 1141 taagtcagca gcaaaaaagt agagcatttt atcgcattca agctactcgc ctcatgactg 1201 gagctggcaa cattttaaag aggcatgcag ctgaccaagc aaggaaggct gtcagcatgc 1261 acgaggtcaa cactgaagtg actgaaaatg accctgttag taagatcttc tttgaacaag 1321 ggacatatca gtgtctggag aactgtggta ctgtggccct taccattatc cgcagaggtg 1381 gtgatttgac taacactgtg tttgttgact tcagaacaga ggatggcaca gcaaatgctg 1441 ggtctgatta tgaatttact gaaggaactg tggtgtttaa gcctggtgat acccagaagg 1501 aaatcagagt gggtatcata gatgatgata tctttgagga ggatgaaaat ttccttgtgc 1561 atctcagcaa tgtcaaagta tcttctgaag cttcagaaga tggcatactg gaagccaatc 1621 atgtttctac acttgcttgc ctcggatctc cctccactgc cactgtaact atttttgatg 1681 atgaccacgc aggcattttt acttttgagg aacctgtgac tcatgtgagt gagagcattg 1741 gcatcatgga ggtgaaagta ttgagaacat ctggagctcg aggaaatgtt atcgttccat 1801 ataaaaccat cgaagggact gccagaggtg gaggggagga ttttgaggac acttgtggag 1861 agctcgaatt ccagaatgat gaaattgtca aaacaatatc agtcaaggta attgatgatg 1921 aggagtatga gaaaaacaag accttcttcc ttgagattgg agagccccgc ctggtggaga 1981 tgagtgagaa gaaagccctg ttattgaatg agcttggtgg cttcacaata acaggaaaat 2041 acctgtttgg ccaacctgtc ttcaggaagg ttcatgctag agaacatccg attctctcta 2101 ctgtaatcac cattgcagac gaatatgatg acaagcagcc actgaccagc aaagaggaag 2161 aggagaggcg cattgcagaa atggggcgcc ccatcctggg agagcacacc aagttggaag 2221 tgatcattga agaatcctat gaattcaaga gtactgtgga caaactcatt aagaagacaa 2281 acctggccct tgtggttggg actaacagct ggagagaaca gttcattgaa gctatcactg 2341 tcagtgctgg ggaagatgat gacgacgatg aatgtgggga agagaagctg ccctcctgtt 2401 tcgattacgt gatgcacttt ctgactgtgt tctggaaggt cctgtttgcc ttcgtccccc 2461 ctactgaata ctggaatggc tgggcgtgtt tcattgtctc catcctcatg attggcctac 2521 tgacagcttt cattggagac ctggcttccc actttggctg caccattggc ctgaaagatt 2581 ctgtgactgc agtcgtgttc gtcgcacttg gaacatcagt gccagacaca tttgccagca 2641 aagtggcagc cacccaggac cagtatgcag acgcctccat aggtaacgtc acgggcagca 2701 acgcggtgaa tgtcttcctg ggaatcggtg tggcctggtc catcgctgcc atctaccacg 2761 cagccaatgg ggaacagttc aaagtgtccc ctggcacact agctttctct gtcactctct 2821 tcaccatttt tgctttcatc aatgtggggg tgctgctgta tcggcggagg ccagaaatcg 2881 gaggtgagct gggtgggccc cggactgcca agctcctcac atcctgcctc tttgtgctcc 2941 tatggctctt gtacattttc ttctcctccc tggaggccta ctgccacata aaaggcttct 3001 aaaggaacta tcagatatag taaatttata tatatacata tatatacata aaaattatgt 3061 ataatggaca gaggaaactg acatttgtca tgttcactta cctgctgatg gaatccagct 3121 tcaagagcat actctgtact agggccgaag taaaaaacca tcacctccca ttcccagggg 3181 catcatcatg ttcaacaagg catggaggca gggccatctt tgcagctcag tctagaaggg 3241 ctgcactctc // LOCUS HUMCNGCCA 3408 bp DNA PRI 01-MAY-1995 DEFINITION Homo sapiens clone hRCNC2b retinal rod cyclic nucleotide-gated cation channel gene, complete cds. ACCESSION L15296 NID g291913 KEYWORDS cyclic nucleotide-gated cation channel; retinal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3408) AUTHORS Chen,T.Y., Peng,Y.W., Dhallan,R.S., Ahamed,B., Reed,R.R. and Yau,K.W. TITLE A new subunit of the cyclic nucleotide-gated cation channel in retinal rods JOURNAL Nature 362 (6422), 764-767 (1993) MEDLINE 93226050 REFERENCE 2 (bases 1 to 3408) AUTHORS Ahamed,B. TITLE Direct Submission JOURNAL Submitted (17-MAY-1993) Basheer Ahamed, Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA FEATURES Location/Qualifiers source 1..3408 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hRCNC2b" /tissue_type="retinal" CDS 105..2834 /codon_start=1 /product="cyclic nucleotide-gated cation channel" /db_xref="PID:g790511" /translation="MPRELSRIEEEKEDEEEEEEEEEEEEEEEVTEVLLDSCVVSQVG VGQSEEDGTRPQSTSDQKLWEEVGEEAKKEAEEKAKEEAEEVAEEEAEKEPQDWAETK EEPEAEAEAASSGVPATKQHPEVQVEDTDADSCPLMAEENPPSTVLPPPSPAKSDTLI VPSSASGTHRKKLPSEDDEAEELKALSPAESPVVAWSDPTTPKDTDGQDRAASTASTN SAIINDRLQELVKLFKERTEKVKEKLIDPDVTSDEESPKPSPAKKAPEPAPDTKPAEA EPVEEEHYCDMLCCKFKHRPWKKYQFPQSIDPLTNLMYVLWLFFVVMAWNWNCWLIPV RWAFPYQTPDNIHHWLLMDYLCDLIYFLDITVFQTRLQFVRGGDIITDKKDMRNNYLK SRRFKMDLLSLLPLDFLYLKVGVNPLLRLPRCLKYMAFFEFNSRLESILSKAYVYRVI RTTAYLLYSLHLNSCLYYWASAYQGLGSTHWVYDGVGNSYIRCYYFAVKTLITIGGLP DPKTLFEIVFQLLNYFTGVFAFSVMIGQMRDVVGAATAGQTYYRSCMDSTVKYMNFYK IPKSVQNRVKTWYEYTWHSQGMLDESELMVQLPDKMRLDLAIDVNYNIVSKVALFQGC DRQMIFDMLKRLRSVVYLPNDYVCKKGEIGREMYIIQAGQVQVLGGPDGKSVLVTLKA GSVFGEISLLAVGGGNRRTANVVAHGFTNLFILDKKDLNEILVHYPESQKLLRKKARR MLRSNNKPKEEKSVLILPPRAGTPKLFNAALAMTGKMGGKGAKGGKLAHLRARLKELA ALEAAAKHEELVEQAKSSQDVKGEEGSAAPDQHTHPKEAATDPPAPRTPPEPPGSPPS SPPPASLGSCEGEEEGPAEPEEHSVRICMSPGPEPGEQILSVKMPEEREEKAE" BASE COUNT 807 a 948 c 1008 g 645 t ORIGIN 1 gagattctgc tccagccaca gaagcagccg cagcccaggc ctagtgtctt ggcctcaagg 61 gcctcattgg gttcatctgt gcaccctgtg tccccagtgc tgccatgccc agagagctgt 121 cccggattga agaggagaaa gaagatgagg aggaggaaga ggaagaggag gaggaggagg 181 aagaggagga ggtgactgag gtgctgctgg atagctgtgt ggtgtcgcag gtgggcgtgg 241 gccagagtga agaagacggg acccggcccc agagcacttc agatcagaag ctgtgggagg 301 aagttgggga ggaggccaag aaggaggctg aagagaaggc caaggaggag gccgaggagg 361 tggctgaaga ggaggctgaa aaggagcccc aggactgggc ggagaccaag gaggagcctg 421 aggctgaggc cgaggctgcc agttcaggag tgcctgccac gaaacagcac ccagaagtgc 481 aggtggaaga tactgatgct gatagctgcc ccctcatggc agaagagaat ccaccctcaa 541 ccgtgttgcc gccaccgtct cctgccaaat cagacaccct tatagtccca agctcagcct 601 cggggacaca caggaagaag ctgccctctg aggatgatga ggctgaagag ctcaaggcgt 661 tgtcaccagc agagtcccca gtggttgcct ggtctgaccc caccaccccg aaggacactg 721 atggccagga ccgtgcggcc tccacggcca gcacaaatag cgccatcatc aacgaccggc 781 tccaggagct ggtgaagctc ttcaaggagc ggacagagaa agtgaaggag aaactcattg 841 accctgacgt cacctctgat gaggagagcc ccaagccctc cccagccaag aaagccccag 901 agccagctcc agacacaaag cccgctgaag ccgagccagt ggaagaggag cactattgcg 961 acatgctctg ctgcaagttc aaacaccgcc cctggaagaa gtaccagttt ccccagagca 1021 ttgacccgct gaccaacctg atgtatgtcc tatggctgtt cttcgtggtg atggcctgga 1081 attggaactg ttggctgatt cccgtgcgct gggccttccc ctaccagacc ccggacaaca 1141 tccaccactg gctgctgatg gattacctat gcgacctcat ctacttcctg gacatcaccg 1201 tgttccagac acgcctgcag tttgtcagag gcggggacat cattacggac aaaaaggaca 1261 tgcgaaataa ctacctgaag tctcgccgct tcaagatgga cctgctcagc ctcctgccct 1321 tggattttct ctatttgaaa gtcggtgtga accccctcct ccgcctgccc cgctgtttaa 1381 agtacatggc cttcttcgag tttaacagcc gcctggaatc catcctcagc aaagcctacg 1441 tgtacagggt catcaggacc acagcctacc ttctctacag cctgcatttg aattcctgtc 1501 tttattactg ggcatcggcc tatcagggcc tcggctccac tcactgggtt tacgatggcg 1561 tgggaaacag ttatattcgc tgttactact ttgctgtgaa gaccctcatc accatcgggg 1621 ggctgcctga ccccaagaca ctctttgaaa ttgtcttcca gctgctgaat tatttcacgg 1681 gcgtctttgc tttctctgtg atgatcggac agatgagaga tgtggtaggg gccgccaccg 1741 cgggacagac ctactaccgc agctgcatgg acagcacggt gaagtacatg aatttctaca 1801 agatccccaa gtccgtgcag aaccgcgtca agacctggta cgagtacacc tggcactcgc 1861 aaggcatgct ggatgagtca gagctgatgg tgcagcttcc agacaagatg cggctggacc 1921 tcgccatcga cgtgaactac aacatcgtta gcaaagtcgc actctttcag ggctgtgacc 1981 ggcagatgat ctttgacatg ctgaagaggc ttcgctctgt tgtctacctg cccaacgact 2041 atgtgtgcaa gaagggggag atcggccgtg agatgtacat catccaggca gggcaagtgc 2101 aggtcttggg cggccctgat gggaaatctg tgctggtgac gctgaaagct ggatctgtgt 2161 ttggagaaat aagcttgctg gctgttgggg gcgggaaccg gcgcacggcc aacgtggtgg 2221 cgcacgggtt taccaacctc ttcatcctgg ataagaagga cctgaatgag attttggtgc 2281 attatcctga gtctcagaag ttactccgga agaaagccag gcgcatgctg agaagcaaca 2341 ataagcccaa ggaggagaag agcgtgctga tccttccacc ccgggcgggc accccaaagc 2401 tcttcaacgc tgccctcgct atgacaggaa agatgggtgg caagggggca aaaggcggca 2461 aacttgctca cctccgggcc cggctcaaag aactggccgc gctggaggcg gctgcaaagc 2521 acgaagagtt ggtggaacag gccaagagct cgcaagacgt caagggagag gaaggctccg 2581 ccgccccaga ccagcacacg cacccaaagg aggccgccac cgacccaccc gcgccccgga 2641 cgccccccga gcccccgggg tctccaccga gctctccacc gcctgcctcc cttgggagct 2701 gcgagggaga ggaggagggg ccggccgagc ccgaagagca ctcggtgagg atctgcatga 2761 gcccgggccc ggagccggga gagcagatcc tgtcggtgaa gatgccggag gaaagggagg 2821 agaaggcgga gtaaggtggg gtgaggcgga tcccgcgcgc agttccagca ggtgtgtccc 2881 cagcgcccgc tgcgcccctc gccccagcgc cccaccttcc cccacggctc aagagaagat 2941 gcttttccgt agtcgtgacc tcagtggctg cagctctgac cgtcccgcca gcacgccagc 3001 cccgactcag ctcctcgcgg ggctgggcct gagctcgaca agttgcatca agtgttcgag 3061 tccctgagct ctcactatca tttgagagcc ctaccttttc ctgactgttc ctctttttaa 3121 gaacaaaatg attttccact tttaaagttc atgggtgagg gataaaatga ggctccaata 3181 catgggaagc tttgctgaaa aagtaaagtg ttattgaatg tgtggggttt cccctcagga 3241 atttgtctaa cacatttcaa ggatagaaaa tacttcactg ccgggcatgg tggctcatgc 3301 ctataatccc agtgctttgg gaagccgaag caggaggatc actggaggcc aagagttgga 3361 gagcagactg ggcaacatag cgagacctca tctcaaaccg gaattcgg // LOCUS HUMCNPDEA 1762 bp mRNA PRI 15-MAR-1989 DEFINITION Human 2',3'-cyclic nucleotide 3'-phosphodiesterase mRNA, complete cds. ACCESSION M19650 NID g180686 KEYWORDS 2',3'-cyclic-nucleotide 3'-phosphodiesterase; phosphodiesterase. SOURCE Human glioma (cell line U-251MG), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1762) AUTHORS Kurihara,T., Takahashi,Y., Nishiyama,A. and Kumanishi,T. TITLE cDNA cloning and amino acid sequence of human brain 2', 3-cyclic-nucleotide 3'-phosphodiesterase JOURNAL Biochem. Biophys. Res. Commun. 152, 837-842 (1988) MEDLINE 88209067 COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by T.Kurihara 06-JUL-1988. FEATURES Location/Qualifiers source 1..1762 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 14..1219 /note="2',3'-cyclic-nucleotide 3'-phosphodiesterase (EC 3.1.4.37)" /codon_start=1 /db_xref="PID:g180687" /translation="MSSSGAKDKPELQFPFLQDEDTVATLLECKTLFILRGLPGSGKS TLARVIVDKYRDGTKMVSADAYKITPGARGAFSEEYKRLDEDLAAYCRRRDIRILVLD DTNHERERLEQLFEMADQYQYQVVLVEPKTAWRLDCAQLKEKNQWQLSADDLKKLKPG LEKDFLPLYFGWFLTKKSSETLRKAGQVFLEELGNHKAFKKELRQFVPGDEPREKMDL VTYFGKRPPGVLHCTTKFCDYGKAPGAEEYAQQDVLKKSYSKAFTLTISALFVTPKTT GARVELSEQQLQLWPSDVDKLSPTDNLPRGSRAHITLGCAADVEAVQTGLDLLEILRQ EKGGSRGEEVGELSRGKLYSLGNGRWMLTLAKNMEVRAIFTGYYGKGKPVPTQGSRKG GALQSCTII" BASE COUNT 389 a 489 c 525 g 359 t ORIGIN 51 bp upstream of PstI site. 1 cttcttccgc aagatgtcat cctcaggggc caaggacaag cctgagctgc agtttccctt 61 ccttcaggat gaggacacag tggccacgct gctagagtgc aagacgctct tcatcttgcg 121 cggcctgcca ggaagcggca agtccacgct ggcacgggtc atcgtggaca agtaccgtga 181 tggcaccaag atggtgtcgg ctgacgctta caagatcacc cccggcgctc gaggagcctt 241 ctccgaggag tacaagcggc tcgatgagga cctggctgcc tactgccgcc gccgggacat 301 cagaattctt gtgcttgatg acaccaacca cgaacgggaa cggctggagc agctctttga 361 aatggccgac cagtaccagt accaggtggt gctggtggag cccaagacgg cgtggcggct 421 ggactgtgcc cagctcaagg agaagaacca gtggcagctg tcggctgatg acctgaagaa 481 gctgaagcct gggctggaga aggacttcct gccgctctac ttcggctggt tcctgaccaa 541 gaagagctct gagaccctcc gcaaagccgg ccaggtcttc ctggaagagc tggggaacca 601 caaggccttc aagaaggagc tgcgacaatt cgtccctggg gatgagccca gggagaagat 661 ggacttggtc acctactttg gaaagagacc cccaggcgtg ctgcattgca caaccaagtt 721 ttgtgactac gggaaggctc ccggggcaga ggagtacgct caacaagatg tgttaaagaa 781 atcttactcc aaggccttca cgctgaccat ctctgccctc tttgtgacac ccaagacgac 841 tggggcccgg gtggagttaa gcgagcagca actgcagttg tggccgagtg atgtggacaa 901 gctgtcaccc actgacaacc tgccgcgggg gagccgcgcc cacatcaccc tcggctgtgc 961 agctgacgta gaggccgtgc agacgggcct tgacctctta gagattctgc ggcaggagaa 1021 ggggggcagc cgaggcgagg aggtgggcga gctaagccgg ggcaagctct attccttggg 1081 caatgggcgc tggatgctga ccctggccaa gaacatggag gtcagggcca tcttcacggg 1141 gtactacggg aaaggcaaac ctgtgcccac gcaaggtagc cggaaggggg gcgccttgca 1201 gtcctgcacc atcatatgag tgttctcacc accacttatg cccctagaag ggaaggggag 1261 agggaaacgt gccctctgtt tgatccttgt tttgtgacat tttttttttt tttttttttt 1321 actcaaagtt aacctacctg taacttttta aaaacttgta aaataactga ccctcccttc 1381 ctgtccgccc tcttcccctc taatgctcac gctcccaaca caaggtgggc agggaggcac 1441 cattcaggaa cctggaccaa agctgacgag gctgggccaa gccagggatg gggccacagc 1501 cagaaccccg agccctactt ccaggttctg gttagctcag ccccagccca gcccagctgc 1561 tctgcccaga gctgggtgag tggggagaca cctcagagcc ccgcaaaacc cactgaccgg 1621 aggcaaaagg cagtggggct gggggtagtt ttccatggtc acagagaact agtggtggct 1681 ctgagaaggg gaggacctct gggctttgat tccatctcct tgtctttttt ctttgttttt 1741 agagacaggg tcctgctatt tc // LOCUS HUMCNR 2548 bp mRNA PRI 23-SEP-1996 DEFINITION Human calcineurin B mRNA, complete cds. ACCESSION M30773 NID g180704 KEYWORDS Ca2+/calmodulin-stimulated protein phosphatase; calcineurin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2548) AUTHORS Guerini,D., Krinks,M.H., Sikela,J.M., Hahn,W.E. and Klee,C.B. TITLE Isolation and sequence of a cDNA clone for human calcineurin B, the Ca2+-binding subunit of the Ca2+/calmodulin-stimulated protein phosphatase JOURNAL DNA 8 (9), 675-682 (1989) MEDLINE 90126237 FEATURES Location/Qualifiers source 1..2548 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain stem and basal ganglia" /clone_lib="library of R.Lazzarini" /clone="Hg2 and Hs1" gene 759..2532 /gene="CALNB" CDS 759..1271 /gene="CALNB" /codon_start=1 /product="calcineurin B" /db_xref="PID:g180705" /db_xref="GDB:13940" /translation="MGNEASYPLEMCSHFDADEIKRLGKRFKKLDLDNSGSLSVEEFM SLPELQQNPLVQRVIDIFDTDGNGEVDFKEFIEGVSQFSVKGDKEQKLRFAFRIYDMD KDGYISNGELFQVLKMMVGNNLKDTQLQQIVDKTIINADKDGDGRISFEEFCAVVGGL DIHKKMVVDV" polyA_signal 2527..2532 /gene="CALNB" BASE COUNT 696 a 561 c 553 g 738 t ORIGIN 1 aggctggggg acaaccagag gccagggaga aagaggagac agaggaagca ccgagggtga 61 ctacgttgtc ttccctagat caattttctt ctggatggct cgtgctgagt ggtagatgag 121 cgaatcgatg agtccagcca ctgtgaacat gcccccaatg atggcgcaca cacctgtcag 181 gaagtgggtg aaggacctgt gcttctccgt cagcttcacc atcatgggcg agagctcata 241 gaggacgaag actccgggaa ggccttggtc gcccaacagc ccattggcaa ccttctcatg 301 tctggtcaca gagaactgat ttgtcctcag tacctctccg tccaccttca tgtacacagt 361 gggcaccacc ttcacaaagt gctgtaacac ctgtgaaggc agcggctccg gcgcgagcgc 421 gaggctgcag cccccgagtt tcccggccgt cttcgccccc tctccccctc ctttcttctt 481 ctctgcctct cctgcctctc gccgctgctc ctcccgcgct ctccggctct gaatgtcgac 541 cttaatttat ttccccctac cctgcccgct ccctcgcgtg cccaatcgcc cggccggcgc 601 gggccccgcg cgccgcctcc ccctccccac gcgcgccccc tccccgccgg cgacccgagg 661 gccgcagctg ggccgccgcc gccgtttcct gcgagccagc ctgagcgcaa cacttctccg 721 agccagcgag ccagcgagcc gccgacccgc cgagcaaaat gggaaatgag gcaagttatc 781 ctttggaaat gtgctcacac tttgatgcgg atgaaattaa aaggctagga aagagattta 841 agaagcttga tttggacaat tctggttctt tgagtgtgga agagttcatg tctctgcctg 901 agttacaaca gaatccttta gtacagcgag taatagatat attcgacaca gatgggaatg 961 gagaagtaga ctttaaagaa ttcattgagg gcgtctctca gttcagtgtc aaaggagata 1021 aggagcagaa attgaggttt gctttccgta tctatgacat ggataaagat ggctatattt 1081 ccaatgggga actcttccag gtattgaaga tgatggtggg gaacaatctg aaagatacac 1141 agttacagca aattgtagac aaaaccataa taaatgcaga taaggatgga gatggaagaa 1201 tatcctttga agaattctgt gctgttgtag gtggcctaga tatccacaaa aagatggtgg 1261 tagatgtgtg actcttatca gagagtacca cccaacactt ttgctttctt ctccatctct 1321 gaagatctgc tcaagacgtc cagcaatgct ctctgtgtat ttaaatggaa gtatttttct 1381 ctgtgaagcc acattttcca acatgagcct catgaagcca actaagtgtt attgaactgt 1441 aattctctca ataactcagt gtagcacttt aaagtctgaa ggacagcaac atgaaaagag 1501 catatcaatg tggtggagaa agggaagggg ttggcttttt aatttatttt tcttcatctt 1561 ttataacaag aaagtatcta tatatacata tgtaaatatt tatatataga tatatgtagc 1621 tttctatata tgtagtaggt tggctttaat ttaatatcct tgattcagaa acaaaacaat 1681 agagtacaaa agtgccaagc agaacataaa acatccttac ttttatttca cacagtttta 1741 tatatagata gaagactgta caatttgagc cgggtgttag gccagctatt ttccttttcc 1801 tgtgctttct cctttgagag attgacaaag catttgttaa cgtcctatta tttaccttaa 1861 ttacattttt gtaacaaagg agtctgtaac tttatttata cttatgaata tatccaggga 1921 ctacttctca ttgctgagca gcttttaata cacctctgct tgaggagaaa gtctagttca 1981 ttgctactgc caagagctag ttcttgtgtt catatagtaa ctgcacaggg cttatagctg 2041 cttcattctg ctactttgta actaggagcc attgcattta ttaaatgtcc ctcagtaacg 2101 ttaagtgcta gttgtgattt tatacataaa ggccagaagc tgtctgaggc aatcatgatt 2161 gattgtatgt atcacttact gaagaatacc tgaagtgatc atgtaactac ttataaggga 2221 tatccatttg tttgattaca tgggtaaata atttgtcatt aaacttgtgt ttgaatcatg 2281 aattcccttg tttcaaaaga cttgcagcta atctaaaaaa ctggtgatat ttaatatgca 2341 tgtatgtatc taaacaccca catatatttg tggtttaagt gtgagaaatc ttgctaatct 2401 atatgccaca gaagagcaaa attgtatcca aatttatgcc acttaaattt ctttaccacg 2461 agggatagag catgcatact ggtttttttt tcttgatttg cccatataat tggtaatgga 2521 taacttaata aatttgtgtg atataaaa // LOCUS HUMCNRA1 2450 bp mRNA PRI 15-JUN-1990 DEFINITION Human calcineurin A1 mRNA, complete cds. ACCESSION M29550 NID g180706 KEYWORDS calcineurin A1. SOURCE Human brain stem basal ganglia, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2450) AUTHORS Guerini,D. and Klee,C.B. TITLE Cloning of human calcineurin A: Evidence for two isozymes identification of a polyproline structural domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9183-9187 (1989) MEDLINE 90083232 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.B.Klee 29-OCT-1989. FEATURES Location/Qualifiers source 1..2450 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 108..1652 /note="calcineurin A1" /codon_start=1 /db_xref="PID:g180707" /translation="MAAPEPARAAPPPPPPPPPPPGADRVVKAVPFPPTHRLTSEEVF DLDGIPRVDVLKNHLVKEGRVDEEIALRIINEGAAILRREKTMIEVEAPITVCGDIHG QFFDLMKLFEVGGSPANTRYLFLGDYVDRGYFSIEHVLGTEDISINPHNNINECVLYL WVLKILYPSTLFLLRGNHECRHLTEYFTFKQECKIKYSERVYEACMEAFDSLPLAALL NQQFLCVHGGLSPEIHTLDDIRRLDRFKEPPAFGPMCDLLWSDPSEDFGNEKSQEHFS HNTVRGCSYFYNYPAVCEFLQNNNLLSIIRAHEAQDAGYRMYRKSQTTGFPSLITIFS APNYLDVYNNKAAVLKYENNVMNIRQFNCSPHPYWLPNFMDVFTWSLPFVGEKVTEML VNVLSICSDDELMTEGEDQFDGSAAARKEIIRNKIRAIGKMARVFSVLREESESVLTL KGLTPTGMLPSGVLAGGRQTLQSGNDVMQLAVPQMDWGTPHSFANNSHNACREFLLFF SSCLSS" BASE COUNT 677 a 481 c 522 g 770 t ORIGIN 1 agagggtccg ccatgttccc cggcggcgcc gccgcttggc tctggtagcc gccgcccccg 61 cccccaaccc cgcccggccc agagcctagc cgagccccgg gcccagcatg gccgccccgg 121 agccggcccg ggctgcaccg cccccacccc cgcccccgcc gccccctccc ggggctgacc 181 gcgtcgtcaa agctgtccct ttccccccaa cacatcgctt gacatctgaa gaagtatttg 241 atttggatgg gatacccagg gttgatgttc tgaagaacca cttggtgaaa gaaggtcgag 301 tagatgaaga aattgcgctt agaattatca atgagggtgc tgccatcctt cggagagaga 361 aaaccatgat agaagtagaa gctccaatca cagtgtgtgg tgacatccat ggccaatttt 421 ttgatctgat gaaacttttt gaagtaggag gatcacctgc taatacacga tacctttttc 481 ttggcgatta tgtggacaga ggttatttta gtatagagca tgttctaggc actgaagaca 541 tatcgattaa tcctcacaat aatattaatg agtgtgtctt atatttatgg gttctgaaga 601 ttctataccc aagcacatta tttcttctga gaggcaacca tgaatgcaga caccttactg 661 aatattttac ctttaagcag gaatgtaaaa ttaagtattc ggaaagagtc tatgaagctt 721 gtatggaagc ttttgatagt ttgcctcttg ctgcactttt aaaccaacag tttctttgtg 781 ttcatggtgg actttcacca gaaatacaca cactggatga tattaggaga ttagatagat 841 tcaaagagcc acctgcattt ggaccaatgt gtgacttgtt atggtccgat ccttctgaag 901 attttggaaa tgaaaaatca caggaacatt ttagtcacaa tacagttcga ggatgttctt 961 atttttataa ctatccagca gtgtgtgaat ttttgcaaaa caataatttg ttatcgatta 1021 ttagagctca tgaagctcaa gatgcaggct atagaatgta cagaaaaagt caaactacag 1081 ggttcccttc attaataaca attttttcgg cacctaatta cttagatgtc tacaataata 1141 aagctgctgt attaaagtat gaaaataatg tgatgaatat tcgacagttt aactgttctc 1201 cacatcctta ctggttgcct aattttatgg atgtcttcac gtggtcttta ccgtttgttg 1261 gagaaaaagt gacagaaatg ttggtaaatg ttctgagtat ttgctctgat gatgaactaa 1321 tgactgaagg tgaagaccag tttgatggtt cagctgcagc ccggaaagaa atcataagaa 1381 acaaaattcg agcaattggc aagatggcaa gagtcttctc tgttctcagg gaggagagtg 1441 aaagtgtgct gacactcaag ggcctgactc ccacagggat gttgcctagt ggagtgttag 1501 ctggaggacg gcagaccctg caaagtggta atgatgttat gcaacttgct gtgcctcaga 1561 tggactgggg cacacctcac tcttttgcta acaattcaca taatgcatgc agggaattcc 1621 ttctgttttt tagttcctgt ctcagcagct gacctagaca gggtagtgta ttagctagtg 1681 tctcattaat acgtgatcag ggcagaaaac tgatagaatg ggtattcctt tcaattgaaa 1741 ataatggtca gttcctcagc ttttcatgaa atgatatggg agcagctcat atcataatgt 1801 ctgaaatatt tatttattca tctgtctaat tcaccctttt cttttaaaag ccccagtttc 1861 agaatgtgaa tcagggatat tcctgttact aaaatggaaa tgtaattcca agtttctttt 1921 ttaatttttt aaatttatgt cattgtattg gactatgctt atatttaaaa ctacttaatt 1981 tagagttaac tacctgctta ggccccagaa cattacttat gcccttcagt taccaaaaga 2041 tttgtgcaag gttttgtacc ctggtaaatg atgccaaagt ttgttttctg tggtgtttgt 2101 caaatgttct atgtataatt aactgtctgt aacatgctgt ttccttcctc tgcagatgta 2161 gctgctttcc taaatctgtc tgtctttctt taggttagct gtatgtctgt aaaagtatgt 2221 tcaattaaat tactccatca gacacttgtc tgtcttgcaa tgtagaagca gctttgtagc 2281 accttgtttt gaggtttgct gcatttgttg ctgcactttg tgcattctga acatgaatgt 2341 aacattagat attaagtcat tgttataagg ggttgaattt aaatcctgta agtcaaaatt 2401 gaaagggtgt tattaagtgt gcctttattt tgcatgaaaa taaaaagaat // LOCUS HUMCNTFR 1566 bp mRNA PRI 10-JUL-1992 DEFINITION Human ciliary neurotrophic factor receptor (CNTFR) mRNA, complete cds. ACCESSION M73238 NID g180710 KEYWORDS ciliary neurotrophic factor receptor. SOURCE Homo sapiens (library: pCMX expression) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1566) AUTHORS Davis,S., Aldrich,T.H., Valenzuela,D.M., Wong,V., Furth,M.E., Squinto,S.P. and Yancopoulos,G.D. TITLE The receptor for ciliary neurotrophic factor JOURNAL Science 253, 59-63 (1991) MEDLINE 91289158 FEATURES Location/Qualifiers source 1..1566 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SH-SY5Y" /cell_type="neuroblastoma" /tissue_lib="pCMX expression" CDS 264..1382 /standard_name="CNTFR gene" /codon_start=1 /product="ciliary neurotrophic factor receptor" /db_xref="PID:g180711" /translation="MAAPVPWACCAVLAAAAAVVYAQRHSPQEAPHVQYERLGSDVTL PCGTANWDAAVTWRVNGTDLAPDLLNGSQLVLHGLELGHSGLYACFHRDSWHLRHQVL LHVGLPPREPVLSCRSNTYPKGFYCSWHLPTPTYIPNTFNVTVLHGSKIMVCEKDPAL KNRCHIRYMHLFSTIKYKVSISVSNALGHNATAITFDEFTIVKPDPPENVVARPVPSN PRRLEVTWQTPSTWPDPESFPLKFFLRYRPLILDQWQHVELSDGTAHTITDAYAGKEY IIQVAAKDNEIGTWSDWSVAAHATPWTEEPRHLTTEAQAAETTTSTTSSLAPPPTTKI CDPGELGSGGGPSAPFLVSVPITLALAAAAATASSLLI" BASE COUNT 283 a 530 c 467 g 286 t ORIGIN 1 gcggcggcag cggaggcggc ggctccagcc ggcgcggcgc gaggctcggc ggtgggatcc 61 ggcgggcggt gctagctccg cgctccctgc ctcgctcgct gccgggggcg gtcggaaggc 121 gcggcgcgaa gcccgggtgg cccgagggcg cgactctagc cttgtcacct catcttgccc 181 ccttggtttt ggaagtcctg aagagttggt ctggaggagg aggaggacat tgatgtgctt 241 ggtgtgtggc cagtggtgaa gagatggctg ctcctgtccc gtgggcctgc tgtgctgtgc 301 ttgccgccgc cgccgcagtt gtctacgccc agagacacag tccacaggag gcaccccatg 361 tgcagtacga gcgcctgggc tctgacgtga cactgccatg tgggacagca aactgggatg 421 ctgcggtgac gtggcgggta aatgggacag acctggcccc tgacctgctc aacggctctc 481 agctggtgct ccatggcctg gaactgggcc acagtggcct ctacgcctgc ttccaccgtg 541 actcctggca cctgcgccac caagtcctgc tgcatgtggg cttgccgccg cgggagcctg 601 tgctcagctg ccgctccaac acttacccca agggcttcta ctgcagctgg catctgccca 661 cccccaccta cattcccaac accttcaatg tgactgtgct gcatggctcc aaaattatgg 721 tctgtgagaa ggacccagcc ctcaagaacc gctgccacat tcgctacatg cacctgttct 781 ccaccatcaa gtacaaggtc tccataagtg tcagcaatgc cctgggccac aatgccacag 841 ctatcacctt tgacgagttc accattgtga agcctgatcc tccagaaaat gtggtagccc 901 ggccagtgcc cagcaaccct cgccggctgg aggtgacgtg gcagaccccc tcgacctggc 961 ctgaccctga gtcttttcct ctcaagttct ttctgcgcta ccgacccctc atcctggacc 1021 agtggcagca tgtggagctg tccgacggca cagcacacac catcacagat gcctacgccg 1081 ggaaggagta cattatccag gtggcagcca aggacaatga gattgggaca tggagtgact 1141 ggagcgtagc cgcccacgct acgccctgga ctgaggaacc gcgacacctc accacggagg 1201 cccaggctgc ggagaccacg accagcacca ccagctccct ggcaccccca cctaccacga 1261 agatctgtga ccctggggag ctgggcagcg gcgggggacc ctcggcaccc ttcttggtca 1321 gcgtccccat cactctggcc ctggctgccg ctgccgccac tgccagcagt ctcttgatct 1381 gagcccggca ccccatgagg acatgcagag cacctgcaga ggagcaggag gccggagctg 1441 agcctgcaga ccccggtttc tattttgcac acgggcagga ggaccttttg cattctcttc 1501 agacacaatt tgtggagacc ccggcgggcc cgggcctgcc gccccccagc cctgccgcac 1561 caagct // LOCUS HUMCO1 2333 bp mRNA PRI 18-JAN-1996 DEFINITION Human mRNA for coproporphyrinogen oxidase, complete cds. ACCESSION D16611 NID g469488 KEYWORDS coproporphyrinogen oxidase. SOURCE Homo sapiens placenta cDNA to mRNA, clone_lib:lambda gt11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2333) AUTHORS Taketani,S., Kohno,H., Furukawa,T., Yoshinaga,T. and Tokunaga,R. TITLE Molecular cloning, sequencing and expression of cDNA encoding human coproporphyrinogen oxidase JOURNAL Biochim. Biophys. Acta 1183 (3), 547-549 (1994) MEDLINE 94114558 REFERENCE 2 (bases 1 to 2333) AUTHORS Taketani,S. TITLE Direct Submission JOURNAL Submitted (06-JUL-1993) to the DDBJ/EMBL/GenBank databases. Shigeru Taketani, Kansai Medical University, Dept. of Hygiene; 10-15 Fumizono-cho, Moriguchi, Osaka 570, Japan (Tel:06-992-1001(ex.2504), Fax:06-992-3522) COMMENT Submitted (06-JUL-1993) to DDBJ by: Shigeru Taketani Department of Hygiene Medical University Mriguchi Osaka 570 Japan Phone: 06-992-1001 x2504 Fax: 06-992-3522. FEATURES Location/Qualifiers source 1..2333 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="placenta" sig_peptide 94..186 /evidence=experimental CDS 94..1158 /EC_number="1.3.3.3" /codon_start=1 /product="coproporphyrinogen oxidase" /db_xref="PID:d1004551" /db_xref="PID:g840693" /translation="MLPKTSGTRATSLGRPEEEEDELAHRCSSFMAPPVTDLGELRRR PGDMKTKMELLILETQAQVCQALAQVDGGANFSVDRWERKEGGGGISCVLQDGCVFEK AGVSISVVHGNLSEEAAKQMRSRGKVLKTKDGKLPFCAMGVSSVIHPKNPHAPTIHFN YRYFEVEEADGNKQWWFGGGCDLTPTYLNQEDAVHFHRTLKEACDQHGPDLYPKFKKW CDDYFFIAHRGERRGIGGIFFDDLDSPSKEEVFRFVQSCARAVVPSYIPLVKKHCDDS FTPQEKLWQQLRRGRYVEFNLLYDRGTKFGLFTPGSRIESILMSLPLTARWEYMHSPS ENSKEAEILEVLRHPRDWVR" mat_peptide 187..1155 /evidence=experimental /product="coproporphyrinogen oxidase" BASE COUNT 608 a 436 c 576 g 713 t ORIGIN 1 gaattcgggt gggtggggac agggctggcc gcggcgctgg cggggttggt ggggctggcc 61 accgccgcct tcgggcatgt gcagcgggcg gagatgttgc ctaagacctc ggggacgcgg 121 gccacttcgc tggggaggcc ggaggaggag gaggatgagc tggcccaccg ctgcagcagc 181 ttcatggccc cgcctgtgac cgacctgggc gagctgcgaa ggaggccggg cgacatgaag 241 accaagatgg agctgctgat tctggagacc caggcccagg tgtgccaggc tctggcacag 301 gtagacgggg gcgccaactt ttctgtggac cggtgggaga ggaaggaagg aggtggcggc 361 atcagctgtg tacttcaaga tgggtgtgtt ttcgaaaagg ctggggtgag catttctgtt 421 gttcatggaa atctttcaga ggaagctgca aaacaaatga gaagcagagg aaaagttctg 481 aagactaaag atggtaaatt gccattttgt gctatgggcg tgagctctgt tatccacccc 541 aagaatcctc atgctcctac tatccatttc aactacagat actttgaagt agaagaagct 601 gatggcaaca agcagtggtg gtttggtggt ggatgtgacc tcactccaac atacttgaat 661 caagaagacg ctgtccattt tcacagaact ctgaaggagg cttgtgacca gcatggtcca 721 gatctctacc ccaaatttaa aaaatggtgt gatgattact tctttatagc ccatcgtgga 781 gagcggcggg gcattggtgg tatctttttt gatgatcttg actctccgtc caaggaggag 841 gtgtttcgct ttgtacagag ctgtgccagg gctgtagttc cttcttacat tccccttgtg 901 aaaaagcact gtgatgactc attcaccccc caggagaagc tgtggcagca gctcagaaga 961 ggacggtatg tagaatttaa tctgctgtat gatcggggca caaagtttgg cctcttcact 1021 ccaggatcca gaattgaaag tatcttgatg tctttacctc taactgcccg atgggagtac 1081 atgcattcac cctcagagaa ttccaaagaa gctgaaattc tggaagttct acgccatcca 1141 agggactggg tgcgttgatg caggcagaat ggctgtgcag gggtttggag ggcacacgat 1201 gtgtcgtccc catgccactg gtcggcactt tgccactgtg tcgagttacc cgtgccttag 1261 tcttctccac tctgcaccct acctcgtggc cagatgataa catgttttgg atgctgtccg 1321 tgatgaatgg tgagatgcga gattgtcaga gtcaattgat taaacctcat ttataccttc 1381 tagtgtcatt ttatatgact agtttacaaa ataggacatt gagtttccaa gtattgagat 1441 aagggaatat aaatagtatt atatgtatca ggaaatctct catcttgttt ttgtttcatg 1501 tattttttaa agttttcatt tgtgccacaa aaatctgtcg tggaatatat tttattttca 1561 ttaattcagt gaagttgaga cttcatagta atttagatgc aacttgaagg taaaaatttt 1621 actttgtcaa tactgaagtc tctgctgtaa tccttatata tctttctcca gagacataat 1681 attgtcaaat agatacacat ttttctaata ggtatttaga agcacttgaa atattcttaa 1741 tctctgcatg tgttacaatt cagttatttc tgtagtttgt aaactctaaa gtgacattac 1801 tattatttta gagatgtcta agttgtaatt ttgatttttg tggaaccatt gtttgttaat 1861 gttgggattc tctgcacttt tgaatgtgaa agcttatatc cctgaattct gatacttaag 1921 agttttctat ttcagacatc tctgtgtgga agttgagact aagaataatc ctagggatgt 1981 catgaattta ggcaatgttt ctcatttgga aaatgaaatg agaaataatt tccttcttta 2041 aagcaagtat atatagtatg agaaacttgg aggcatttca tacacacatt tcttaggaaa 2101 atggacacat tgaaaatgtc ctctttttta tattagagat tctgcagctc tttgctctta 2161 agagcaaatc acaacaggat tcttaatgta tgatttcttt gttcatattt atgaatgtat 2221 tattttattt gcttcgtaat aaagtttata aggaagagca tctcatacat atcattatcg 2281 tggaacacgt tgaacgtttg tgattctgtg tggccttttt ggggctggaa aaa // LOCUS HUMCOANACY 1650 bp mRNA PRI 26-AUG-1994 DEFINITION Human bile acid CoA: Amino acid N-acyltransferase mRNA, complete cds. ACCESSION L34081 NID g506819 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1650) AUTHORS Falany,C.N., Johnson,M.R., Barnes,S. and Diasio,R.B. TITLE Glycine and taurine conjugation of bile acids by a single enzyme. Molecular cloning and expression of human liver bile acid CoA:amino acid N-acyltransferase JOURNAL J. Biol. Chem. 269 (30), 19375-19379 (1994) MEDLINE 94308218 FEATURES Location/Qualifiers source 1..1650 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /tissue_lib="lambda ZAP XR" mRNA 1..1650 CDS 185..1441 /note="BAT" /codon_start=1 /product="bile acid CoA: Amino acid N-acyltransferase" /db_xref="PID:g532505" /translation="MIQLTATPVSALVDEPVHIRATGLIPFQMVSFQASLEDENGDMF YSQAHYRANEFGEVDLNHASSLGGDYMGVHPMGLFWSLKPEKLLTRLLKRDVMNRPFQ VQVKLYDLELIVNNKVASAPKASLTLERWYVAPGVTRIKVREGRLRGALFLPPGEGLF PGVIDLFGGLGGLLEFRASLLASRGFASLALAYHNYEDLPRKPEVTDLEYFEEAANFL LRHPKVFGSGVGVVSVCQGVQIGLSMAIYLKQVTATVLINGTNFPFGIPQVYHGQIHQ PLPHSAQLISTNALGLLELYRTFETTQVGASQYLFPIEEAQGQFLFIVGEGDKTINSK AHAEQAIGQLKRHGKNNWTLLSYPGAGHLIEPPYSPLCCASTTHDLRLHWGGEVIPHA AAQEHAWKEIQRFLRKHLIPDVTSQL" BASE COUNT 447 a 387 c 376 g 440 t ORIGIN 1 catcaccccc ggagcccagc tgtaaattcc tctctttgta ctctttctct ttatttctca 61 gaccagccga cacttaggga aaatagaacc tacgctgaaa ttttgggggc aggttctctt 121 gctaggtttt gaggttttgc tgaagatatt cctgaagaat catcccaggt gccacactaa 181 aaaaatgatc cagttgacag ctacccctgt gagtgcactt gttgatgagc cagtgcatat 241 ccgagctaca ggcctgattc cctttcagat ggtgagtttt caggcatcac tggaagatga 301 aaacggagac atgttttatt ctcaagccca ctatagggcc aatgaattcg gtgaggtgga 361 cctgaatcat gcttcttcac ttggagggga ttatatggga gtccacccca tgggtctctt 421 ctggtctctg aaacctgaaa agctattaac aagactgttg aaaagagatg tgatgaatag 481 gcctttccag gtccaagtaa aactttatga cttagagtta atagtgaaca ataaagttgc 541 cagtgctcca aaggccagcc tgactttgga gaggtggtat gtggcacctg gtgtcacacg 601 aattaaggtt cgagaaggcc gccttcgagg agctctcttt ctccctccag gagagggtct 661 cttcccaggg gtaattgatt tgtttggtgg tttgggtggg ctgcttgaat ttcgggccag 721 cctcctagcc agtcgtggct tcgcctcctt ggccttggct taccataact atgaagacct 781 gccccgcaaa ccagaagtaa cagatttgga atattttgag gaggctgcca actttctcct 841 gagacatcca aaggtctttg gctcaggcgt tggggtagtc tctgtatgtc aaggagtaca 901 gattggacta tctatggcta tttacctaaa gcaagtcaca gccacggtac ttattaatgg 961 gaccaacttt ccttttggca ttccacaggt atatcatggt cagatccatc agccccttcc 1021 ccattctgca caattaatat ccaccaatgc cttggggtta ctagagctct atcgcacttt 1081 tgagacaact caagttgggg ccagtcaata tttgtttcct attgaagagg cccaggggca 1141 attcctcttc attgtaggag aaggtgataa gactatcaac agcaaagcac acgctgaaca 1201 agccatagga cagctgaaga gacatgggaa gaacaactgg accctgctat cttaccctgg 1261 ggcaggccac ctgatagaac ctccctattc tcctctgtgc tgtgcctcaa cgacccacga 1321 tttgaggtta cactggggag gagaggtgat cccacacgca gctgcacagg aacatgcttg 1381 gaaggagatc cagagatttc tcaggaagca cctcattcca gatgtgacca gtcaactcta 1441 agaagactag atattcctag aaaataaaga agcaaatctc tttaccggga ccttctctca 1501 attttcatgg cggaatgtct cccccagctg ccacacacac attttgtatt aattttaatg 1561 attaaaaaga gcatagggta aaaacctgat atttcactta tactgcataa caagctaacc 1621 aaactggaca tggtgaataa atcataaaac // LOCUS HUMCOLIP 523 bp mRNA PRI 01-NOV-1994 DEFINITION Human colipase mRNA, complete cds. ACCESSION J02883 NID g180885 KEYWORDS cofactor; colipase; triglyceride lipase. SOURCE Human adult pancreas, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 523) AUTHORS Lowe,M.E., Rosenblum,J.L., McEwen,P. and Strauss,A.W. TITLE Cloning and characterization of the human colipase cDNA JOURNAL Biochemistry 29 (3), 823-828 (1990) MEDLINE 90248429 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.E.Lowe, 17-NOV-1989. FEATURES Location/Qualifiers source 1..523 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6pter-p21.1" mRNA <1..523 /note="CLP mRNA" gene 22..360 /gene="CLPS" CDS 22..360 /gene="CLPS" /note="colipase precursor" /codon_start=1 /db_xref="GDB:G00-127-277" /db_xref="PID:g180886" /translation="MEKILILLLVALSVAYAAPGPRGIIINLENGELCMNSAQCKSNC CQHSSALGLARCTSMASENSECSVKTLYGIYYKCPCERGLTCEGDKTIVGSITNTNFG ICHDAGRSKQ" sig_peptide 23..73 /gene="CLPS" /note="colipase signal peptide" mat_peptide 74..357 /gene="CLPS" /note="colipase" BASE COUNT 109 a 173 c 128 g 113 t ORIGIN 1 acaccagctg tcccactcac catggagaag atcctgatcc tcctgcttgt cgccctctct 61 gtggcctatg cagctcctgg cccccggggg atcattatca acctggagaa cggtgagctc 121 tgcatgaata gtgcccagtg taagagcaat tgctgccagc attcaagtgc gctgggcctg 181 gcccgctgca catccatggc cagcgagaac agcgagtgct ctgtcaagac gctctatggg 241 atttactaca agtgtccctg tgagcgtggc ctgacctgtg agggagacaa gaccatcgtg 301 ggctccatca ccaacaccaa ctttggcatc tgccatgacg ctggacgctc caagcagtga 361 gactgcccac ccactcccac acctagccca gaatgctgta ggccactagg cgcaggggca 421 tctctcccct gctccagcgc atctcccggg ctggccacct ccttgaccag catatctgtt 481 ttctgattgc gctcttcaca attaaaggcc tcctgcaaac ctt // LOCUS HUMCOMP 2439 bp mRNA PRI 15-DEC-1994 DEFINITION Human germline oligomeric matrix protein (COMP) mRNA, complete cds. ACCESSION L32137 NID g602449 KEYWORDS germline; matrix protein. SOURCE Homo sapiens cartilage cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2439) AUTHORS Newton,G., Weremowicz,S., Morton,C.C., Copeland,N.G., Gilbert,D.J., Jenkins,N.A. and Lawler,J. TITLE Characterization of human and mouse cartilage oligomeric matrix protein JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..2439 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chondrocyte" /germline /tissue_type="cartilage" /map="19p13.1" gene 26..2425 /gene="COMP" sig_peptide 26..85 /gene="COMP" CDS 26..2299 /gene="COMP" /standard_name="cartilage oligomeric matrix protein" /note="putative" /codon_start=1 /product="matrix protein" /db_xref="PID:g602450" /translation="MVPDTACVLLLTLAALGASGQGQSPLGSDLGPQMLRELQETNAA LQDVRDWLRQQVREITFLKNTVMECDACGMQQSVRTGLPSVRPLLHCAPGFCFPGVAC IQTESGGRCGPCPAGFTGNGSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSG PTHQGVGLAFAKANKQVCTDINECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQA SGCQRGAQRFCPDGSPSECHEHADCVLERDGSRSCVCRVGWAGNGILCGRDTDLDGFP DEKLRCPEPQCRKDNCVTVPNSGQEDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNP DQRNTDEDKWGDACDNCRSQKNDDQKDTDQDGRGDACDDDIDGDRIRNQADNCPRVPN SDQKDSDGDGIGDACDNCPQKSNPDQADVDHDFVGDACDSDQDQDGDGHQDSRDNCPT VPNSAQEDSDHDGQGDACDDDDDNDGVPDSRDNCRLVPNPGQEDADRDGVGDVCQDDF DADKVVDKIDVCPENAEVTLTDFRAFQTVVLDPEGDAQIDPNWVVLNQGREIVQTMNS DPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIFGYQDSSSFYVVMWKQMEQTYWQA NPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTESQVRLLWKDPRNVGWKDKKSY RWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGRLGVFCFSQENIIWANLRYR CNDTIPEDYETHQLRQA" repeat_region 290..892 /gene="COMP" /note="putative" /rpt_family="thrombospondin type 2" /rpt_type=tandem repeat_region 893..1577 /gene="COMP" /note="putative" /rpt_family="thrombospondin type 3" /rpt_type=tandem polyA_signal 2420..2425 /gene="COMP" polyA_site 2439 /gene="COMP" BASE COUNT 503 a 758 c 809 g 369 t ORIGIN 1 cagcacccag ctccccgcca ccgccatggt ccccgacacc gcctgcgttc ttctgctcac 61 cctggctgcc ctcggcgcgt ccggacaggg ccagagcccg ttgggctcag acctgggccc 121 gcagatgctt cgggaactgc aggaaaccaa cgcggcgctg caggacgtgc gggactggct 181 gcggcagcag gtcagggaga tcacgttcct gaaaaacacg gtgatggagt gtgacgcgtg 241 cgggatgcag cagtcagtac gcaccggcct acccagcgtg cggcccctgc tccactgcgc 301 gcccggcttc tgcttccccg gcgtggcctg catccagacg gagagcggcg gccgctgcgg 361 cccctgcccc gcgggcttca cgggcaacgg ctcgcactgc accgacgtca acgagtgcaa 421 cgcccacccc tgcttccccc gagtccgctg tatcaacacc agcccggggt tccgctgcga 481 ggcttgcccg ccggggtaca gcggccccac ccaccagggc gtggggctgg ctttcgccaa 541 ggccaacaag caggtttgca cggacatcaa cgagtgtgag accgggcaac ataactgcgt 601 ccccaactcc gtgtgcatca acacccgggg ctccttccag tgcggcccgt gccagcccgg 661 cttcgtgggc gaccaggcgt ccggctgcca gcgcggcgca cagcgcttct gccccgacgg 721 ctcgcccagc gagtgccacg agcatgcaga ctgcgtccta gagcgcgatg gctcgcggtc 781 gtgcgtgtgt cgcgttggct gggccggcaa cgggatcctc tgtggtcgcg acactgacct 841 agacggcttc ccggacgaga agctgcgctg cccggagccg cagtgccgta aggacaactg 901 cgtgactgtg cccaactcag ggcaggagga tgtggaccgc gatggcatcg gagacgcctg 961 cgatccggat gccgacgggg acggggtccc caatgaaaag gacaactgcc cgctggtgcg 1021 gaacccagac cagcgcaaca cggacgagga caagtggggc gatgcgtgcg acaactgccg 1081 gtcccagaag aacgacgacc aaaaggacac agaccaggac ggccggggcg atgcgtgcga 1141 cgacgacatc gacggcgacc ggatccgcaa ccaggccgac aactgcccta gggtacccaa 1201 ctcagaccag aaggacagtg atggcgatgg tataggggat gcctgtgaca actgtcccca 1261 gaagagcaac ccggatcagg cggatgtgga ccacgacttt gtgggagatg cttgtgacag 1321 cgatcaagac caggatggag acggacatca ggactctcgg gacaactgtc ccacggtgcc 1381 taacagtgcc caggaggact cagaccacga tggccagggt gatgcctgcg acgacgacga 1441 cgacaatgac ggagtccctg acagtcggga caactgccgc ctggtgccta accccggcca 1501 ggaggacgcg gacagggacg gcgtgggcga cgtgtgccag gacgactttg atgcagacaa 1561 ggtggtagac aagatcgacg tgtgtccgga gaacgctgaa gtcacgctca ccgacttcag 1621 ggccttccag acagtcgtgc tggacccgga gggtgacgcg cagattgacc ccaactgggt 1681 ggtgctcaac cagggaaggg agatcgtgca gacaatgaac agcgacccag gcctggctgt 1741 gggttacact gccttcaatg gcgtggactt cgagggcacg ttccatgtga acacggtcac 1801 ggatgacgac tatgcgggct tcatctttgg ctaccaggac agctccagct tctacgtggt 1861 catgtggaag cagatggagc aaacgtattg gcaggcgaac cccttccgtg ctgtggccga 1921 gcctggcatc caactcaagg ctgtgaagtc ttccacaggc cccggggaac agctgcggaa 1981 cgctctgtgg catacaggag acacagagtc ccaggtgcgg ctgctgtgga aggacccgcg 2041 aaacgtgggt tggaaggaca agaagtccta tcgttggttc ctgcagcacc ggccccaagt 2101 gggctacatc agggtgcgat tctatgaggg ccctgagctg gtggccgaca gcaacgtggt 2161 cttggacaca accatgcggg gtggccgcct gggggtcttc tgcttctccc aggagaacat 2221 catctgggcc aacctgcgtt accgctgcaa tgacaccatc ccagaggact atgagaccca 2281 tcagctgcgg caagcctagg gaccagggtg aggacccgcc ggatgacagc caccctcacc 2341 gcggctggat gggggctctg cacccagccc aaggggtggc cgtcctgagg gggaagtgag 2401 aagggctcag agaggacaaa ataaagtgtg tgtgcaggg // LOCUS HUMCONGRO 2075 bp mRNA PRI 05-MAR-1996 DEFINITION Human connective tissue growth factor, complete cds. ACCESSION M92934 M36965 S56201 NID g180923 KEYWORDS growth factor; mitogen. SOURCE Homo sapiens (library: lambda gt11) connective cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2075) AUTHORS Bradham,D.M., Igarashi,A., Potter,R.L. and Grotendorst,G.R. TITLE Connective tissue growth factor: a cysteine-rich mitogen secreted by human vascular endothelial cells is related to the SRC-induced immediate early gene product CEF-10 JOURNAL J. Cell Biol. 114 (6), 1285-1294 (1991) MEDLINE 91373462 REFERENCE 2 (bases 1 to 2075) AUTHORS Grotendorst,G.R. TITLE Direct Submission JOURNAL Submitted (01-JUL-1990) Gary R. Grotendorst, Department of Cell Biology and Anatomy, University of Miami School of Medicine, Miami, FL, 33136, USA FEATURES Location/Qualifiers source 1..2075 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /tissue_type="connective" /tissue_lib="lambda gt11" mRNA 1..2075 sig_peptide 130..192 CDS 130..1179 /codon_start=1 /product="connective tissue growth factor" /db_xref="PID:g180924" /translation="MTAASMGPVRVAFVVLLALCSRPAVGQNCSGPCRCPDEPAPRCP AGVSLVLDGCGCCRVCAKQLGELCTERDPCDPHKGLFCDFGSPANRKIGVCTAKDGAP CIFGGTVYRSGESFQSSCKYQCTCLDGAVGCMPLCSMDVRLPSPDCPFPRRVKLPGKC CEEWVCDEPKDQTVVGPALAAYRLEDTFGPDPTMIRANCLVQTTEWSACSKTCGMGIS TRVTNDNASCRLEKQSRLCMVRPCEADLEENIKKGKKCIRTPKISKPIKFELSGCTSM KTYRAKFCGVCTDGRCCTPHRTTTLPVEFKCPDGEVMKKNMMFIKTCACHYNCPGDND IFESLYYRKMYGDMA" mat_peptide 193..1176 /product="connective tissue growth factor" BASE COUNT 491 a 558 c 546 g 480 t ORIGIN 1 cccggccgac agccccgaga cgacagcccg gcgcgtcccg gtccccacct ccgaccaccg 61 ccagcgctcc aggccccgcg ctccccgctc gccgccaccg cgccctccgc tccgcccgca 121 gtgccaacca tgaccgccgc cagtatgggc cccgtccgcg tcgccttcgt ggtcctcctc 181 gccctctgca gccggccggc cgtcggccag aactgcagcg ggccgtgccg gtgcccggac 241 gagccggcgc cgcgctgccc ggcgggcgtg agcctcgtgc tggacggctg cggctgctgc 301 cgcgtctgcg ccaagcagct gggcgagctg tgcaccgagc gcgacccctg cgacccgcac 361 aagggcctct tctgtgactt cggctccccg gccaaccgca agatcggcgt gtgcaccgcc 421 aaagatggtg ctccctgcat cttcggtggt acggtgtacc gcagcggaga gtccttccag 481 agcagctgca agtaccagtg cacgtgcctg gacggggcgg tgggctgcat gcccctgtgc 541 agcatggacg ttcgtctgcc cagccctgac tgccccttcc cgaggagggt caagctgccc 601 gggaaatgct gcgaggagtg ggtgtgtgac gagcccaagg accaaaccgt ggttgggcct 661 gccctcgcgg cttaccgact ggaagacacg tttggcccag acccaactat gattagagcc 721 aactgcctgg tccagaccac agagtggagc gcctgttcca agacctgtgg gatgggcatc 781 tccacccggg ttaccaatga caacgcctcc tgcaggctag agaagcagag ccgcctgtgc 841 atggtcaggc cttgcgaagc tgacctggaa gagaacatta agaagggcaa aaagtgcatc 901 cgtactccca aaatctccaa gcctatcaag tttgagcttt ctggctgcac cagcatgaag 961 acataccgag ctaaattctg tggagtatgt accgacggcc gatgctgcac cccccacaga 1021 accaccaccc tgccggtgga gttcaagtgc cctgacggcg aggtcatgaa gaagaacatg 1081 atgttcatca agacctgtgc ctgccattac aactgtcccg gagacaatga catctttgaa 1141 tcgctgtact acaggaagat gtacggagac atggcatgaa gccagagagt gagagacatt 1201 aactcattag actggaactt gaactgattc acatctcatt tttccgtaaa aatgatttca 1261 gtagcacaag ttatttaaat ctgtttttct aactggggga aaagattccc acccaattca 1321 aaacattgtg ccatgtcaaa caaatagtct atcttcccca gacactggtt tgaagaatgt 1381 taagacttga cagtggaact acattagtac acagcaccag aatgtatatt aaggtgtggc 1441 tttaggagca gtgggagggt accggcccgg ttagtatcat cagatcgact cttatacgag 1501 taatatgcct gctatttgaa gtgtaattga gaaggaaaat tttagcgtgc tcactgacct 1561 gcctgtagcc ccagtgacag ctaggatgtg cattctccag ccatcaagag actgagtcaa 1621 gttgttcctt aagtcagaac agcagactca gctctgacat tctgattcga atgacactgt 1681 tcaggaatcg gaatcctgtc gattagactg gacagcttgt ggcaagtgaa tttgcctgta 1741 acaagccaga ttttttaaaa tttatattgt aaatattgtg tgtgtgtgtg tgtgtgtata 1801 tatatatata tatgtacagt tatctaagtt aatttaaagt tgtttgtgcc tttttatttt 1861 tgtttttaat gctttgatat ttcaatgtta gcctcaattt ctgaacacca taggtagaat 1921 gtaaagcttg tctgatcgtt caaagcatga aatggatact tatatggaaa ttctgctcag 1981 atagaatgac agtccgtcaa aacagattgt ttgcaaaggg gaggcatcag tgtcttggca 2041 ggctgatttc taggtaggaa atgtggtagc tcacg // LOCUS HUMCOOTAA 1104 bp mRNA PRI 26-JUN-1990 DEFINITION Human CO-029. ACCESSION M35252 NID g180925 KEYWORDS tumor-associated antigen. SOURCE Human SW 948 cell line (colorectal carcinoma cell line) cDNA to mRNA, clone CO-029-5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1104) AUTHORS Szala,S., Kasai,Y., Steplewski,Z., Rodeck,U., Koprowski,H. and Linnenbach,A.J. TITLE Molecular cloning of cDNA for the human tumor-associated antigen CO-029 and identification of related transmembrane antigens JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6833-6837 (1990) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.J.Linnenbach, 19-JUN-1990, for release after publication. Author address: A.J.Linnenbach Wistar Institute, Rm 472 3601 Spruce Street Philadelphia, PA 19104. FEATURES Location/Qualifiers source 1..1104 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 138..851 /note="tumor-associated antigen" /codon_start=1 /db_xref="PID:g180926" /translation="MAGVSACIKYSMFTFNFLFWLCGILILALAIWVRVSNDSQAIFG SEDVGSSSYVAVDILIAVGAIIMILGFLGCCGAIKESRCMLLLFFIGLLLILLLQVAT GILGAVFKSKSDRIVNETLYENTKLLSATGESEKQFQEAIIVFQEEFKCCGLVNGAAD WGNNFQHYPELCACLDKQRPCQSYNGKQVYKETCISFIKDFLAKNLIIVIGISFGLAV IEILGLVFSMVLYCQIGNK" BASE COUNT 332 a 188 c 238 g 346 t ORIGIN 1 ctctagagtc gagatccatt gtgctctaaa gtggatacag aaatctctgc aggcaagttg 61 ctccagagca tattgcagga caagcctgta acgaatagtt aaattcacgg catctggatt 121 cctaatcctt ttccgaaatg gcaggtgtga gtgcctgtat aaaatattct atgtttacct 181 tcaacttctt gttctggcta tgtggtatct tgatcctagc attagcaata tgggtacgag 241 taagcaatga ctctcaagca atttttggtt ctgaagatgt aggctctagc tcctacgttg 301 ctgtggacat attgattgct gtaggtgcca tcatcatgat tctgggcttc ctgggatgct 361 gcggtgctat aaaagaaagt cgctgcatgc ttctgttgtt tttcataggc ttgcttctga 421 tcctgctcct gcaggtggcg acaggtatcc taggagctgt tttcaaatct aagtctgatc 481 gcattgtgaa tgaaactctc tatgaaaaca caaagctttt gagcgccaca ggggaaagtg 541 aaaaacaatt ccaggaagcc ataattgtgt ttcaagaaga gtttaaatgc tgcggtttgg 601 tcaatggagc tgctgattgg ggaaataatt ttcaacacta tcctgaatta tgtgcctgtc 661 tagataagca gagaccatgc caaagctata atggaaaaca agtttacaaa gagacctgta 721 tttctttcat aaaagacttc ttggcaaaaa atttgattat agttattgga atatcatttg 781 gactggcagt tattgagata ctgggtttgg tgttttctat ggtcctgtat tgccagatcg 841 ggaacaaatg aatctgtgga tgcatcaacc tatcgtcagt caaacccctt taaaatgttg 901 ctttggcttt gtaaatttaa atatgtaagt gctatataag tcaggagcag ctgtcttttt 961 aaaatgtctc ggctagctag accacagata tcttctagac atattgaaca catttaagat 1021 ttgagggata taagggaaaa tgatatgaat gtgtattttt actcaaaata aaagtaactg 1081 tttaaaaaaa aaaaaaaaaa aaaa // LOCUS HUMCOR2M 1588 bp mRNA PRI 15-SEP-1990 DEFINITION Human cytochrome bc-1 complex core protein II mRNA, complete cds. ACCESSION J04973 NID g180927 KEYWORDS core protein II; mitochondrial cytochrome bc-1 complex. SOURCE Human fibroblast, cDNA to mRNA, clone PHCII, library of Okayama-Berg. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1588) AUTHORS Hosokawa,Y., Suzuki,H., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Complementary DNA encoding core protein II of human mitochondrial cytochrome bc-1 complex: Substantial diversity in deduced primary structure from its yeast counterpart JOURNAL J. Biol. Chem. 264, 13483-13488 (1989) MEDLINE 89340421 COMMENT Draft entry and printed sequence for [1] kindly submitted by T.Ozawa, 03-JUN-1989. FEATURES Location/Qualifiers source 1..1588 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1588 /note="core protein II mRNA" sig_peptide 54..95 /note="core protein II signal peptide" CDS 54..1415 /note="core protein II precursor" /codon_start=1 /db_xref="PID:g180928" /translation="MKLLTRAGSFSRFYSLKVAPKVKATAAPAGAPPQPQDLEFTKLP NGLVIASLENYSPVSRIGLFIKAGSRYEDFSNLGTTHLLRLTSSLTTKGASSFKITRG IEAVGGKLSVTATRENMAYTVECLRGDVDILMEFLLNVTTAPEFRRWEVADLQPQLKI DKAVAFQNPQTHVIENLHAAAYQNALANPLYCPDYRIGKVTSEELHYFVQNHFTSARM ALIGLGVSHPVLKQVAEQFLNMRGGLGLSGAKANYRGGEIREQNGDSLVHAAFVAESA VAGSAEANAFSVLQHVLGAGPHVKRGSNTTSHLHQAVAKATQQPFDVSAFNASYSDSG LFGIYTISQATAAGDVIKAAYNQVKRIAQGNLSNTDVQAAKNKLKAGYLMSVESSECF LEEVGSQALVAGSYMPPSTVLQQIDSVANADIINAAKKFVSGQKSMAASGNLGHTPFV DEL" mat_peptide 96..1412 /note="core protein II" BASE COUNT 442 a 348 c 366 g 432 t ORIGIN 1 atcttgcttt cctttaatcc ggcagtgacc gtgtgtcaga acaatcttga atcatgaagc 61 tactaaccag agccggctct ttctcgagat tttattccct caaagttgcc cccaaagtta 121 aagccacagc tgcgcctgca ggagcaccgc cacaacctca ggaccttgag tttaccaagt 181 taccaaatgg cttggtgatt gcttctttgg aaaactattc tcctgtatca agaattggtt 241 tgttcattaa agcaggcagt agatatgagg acttcagcaa tttaggaacc acccatttgc 301 tgcgtcttac atccagtctg acgacaaaag gagcttcatc tttcaagata acccgtggaa 361 ttgaagcagt tggtggcaaa ttaagtgtga ccgcaacaag ggaaaacatg gcttatactg 421 tggaatgcct gcggggtgat gttgatattc taatggagtt cctgctcaat gtcaccacag 481 caccagaatt tcgtcgttgg gaagtagctg accttcagcc tcagctaaag attgacaaag 541 ctgtggcctt tcagaatccg cagactcatg tcattgaaaa tttgcatgca gcagcttacc 601 agaatgcctt ggctaatccc ttgtattgtc ctgactatag gattggaaaa gtgacatcag 661 aggagttaca ttacttcgtt cagaaccatt tcacaagtgc aagaatggct ttgattggac 721 ttggtgtgag tcatcctgtt ctaaagcaag ttgctgaaca gtttctcaac atgaggggtg 781 ggcttggttt atctggtgca aaggccaact accgtggagg tgaaatccga gaacagaatg 841 gagacagtct tgtccatgct gcttttgtag cagaaagtgc tgtcgcggga agtgcagagg 901 caaatgcatt tagtgttctt cagcatgtcc tcggtgctgg gccacatgtc aagaggggca 961 gcaacaccac cagccatctg caccaggctg ttgccaaggc aactcagcag ccatttgatg 1021 tttctgcatt taatgccagt tactcagatt ctggactctt tgggatttat actatctccc 1081 aggccacagc tgctggagat gttatcaagg ctgcctataa tcaagtaaaa agaatagctc 1141 aaggaaacct ttccaacaca gatgtccaag ctgccaagaa caagctgaaa gctggatacc 1201 taatgtcagt ggagtcttct gagtgtttcc tggaagaagt cgggtcccag gctctagttg 1261 ctggttctta catgccacca tccacagtcc ttcagcagat tgattcagtg gctaatgctg 1321 atatcataaa tgcggcaaag aagtttgttt ctggccagaa gtcaatggca gcaagtggaa 1381 atttgggaca tacacctttt gttgatgagt tgtaatactg atgcacacat tacaggagag 1441 agctgaacgt tctctcaccc agagcagcaa acacatgaaa gtcagaagtc tctaatatat 1501 catttgtctt ttttccagtg aggtaaaata aggcataaat gcaggtaatt attcccagct 1561 gacctaaagt caataaaaca ttctgttt // LOCUS HUMCOX17R 423 bp mRNA PRI 21-APR-1996 DEFINITION Homo sapiens COX17 mRNA, complete cds. ACCESSION L77701 NID g1280205 KEYWORDS COX17 gene. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 423) AUTHORS Amaravadi,R., Glerum,D.M. and Tzagoloff,A. TITLE Isolation of a human cDNA for COX17, a yeast gene involved in copper metabolism JOURNAL Unpublished (1996) FEATURES Location/Qualifiers source 1..423 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" 5'UTR 1..86 /gene="COX17" gene 1..423 /gene="COX17" mRNA 1..423 /gene="COX17" CDS 87..278 /gene="COX17" /codon_start=1 /db_xref="PID:g1280206" /translation="MPGLVDSNPAPPESQEKKPLKPCCACPETKKARDACIIEKGEEH CGHLIEAHKECMRALGFKI" 3'UTR 279..423 /gene="COX17" polyA_site 423 /gene="COX17" BASE COUNT 122 a 79 c 117 g 105 t ORIGIN 1 ccggaagtga ctgcggacga atcggcgttt gccgaggctg gcatagattt ggctgtctcc 61 gctcatagct gcttttggcg cgaaagatgc cgggtctggt tgactcaaac cctgccccgc 121 ctgagtctca ggagaagaag ccgctgaagc cctgctgcgc ttgcccggag accaagaagg 181 cgcgcgatgc gtgtatcatc gagaaaggag aagaacactg tggacatcta attgaggccc 241 acaaggaatg catgagagcc ctaggattta aaatatgaaa tggtggtctg ctgtgtgaat 301 aaataattcc tgaagaatga agaagattaa ttttgggagt tctttgacga actttgatat 361 gtggaaaaag tatttataat ttattgtaag aagaaagtaa aatattacta gtggaagatc 421 ttc // LOCUS HUMCOX8A 472 bp ss-RNA PRI 10-MAY-1996 DEFINITION Human cytochrome c oxidase subunit VIII (COX8) mRNA, complete cds. ACCESSION J04823 NID g1311703 KEYWORDS cytochrome c oxidase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 472) AUTHORS Rizzuto,R., Nakase,H., Darras,B., Francke,U., Fabrizi,G.M., Mengel,T., Walsh,F., Kadenbach,B., DiMauro,S. and Schon,E.A. TITLE A gene specifying subunit VIII of human cytochrome c oxidase is localized to chromosome 11 and is expressed in both muscle and non-muscle tissues JOURNAL J. Biol. Chem. 264 (18), 10595-10600 (1989) MEDLINE 89278125 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Shon, 04-APR-1989. FEATURES Location/Qualifiers source 1..472 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q12-q13" /clone="lambda-hcox8.31." /tissue_type="liver" /dev_stage="adult" mRNA <1..472 sig_peptide 42..116 /gene="COX8" gene 42..251 /gene="COX8" CDS 42..251 /gene="COX8" /EC_number="1.9.3.1" /codon_start=1 /product="cytochrome c oxidase subunit VIII precursor" /db_xref="PID:g180939" /db_xref="GDB:G00-119-796" /translation="MSVLTPLLLRGLTGSARRLPVPRAKIHSLPPEGKLGIMELAVGL TSCFVTFLLPAGWILSHLETYRRPE" mat_peptide 116..248 /gene="COX8" BASE COUNT 72 a 150 c 127 g 123 t ORIGIN 141 bp upstream of HindIII site. 1 ggctacggct gaccgttttt tgtggtgtac tccgtgccat catgtccgtc ctgacgccgc 61 tgctgctgcg gggcttgaca ggctcggccc ggcggctccc agtgccgcgc gccaagatcc 121 attcgttgcc gccggagggg aagcttggga tcatggaatt ggccgttggg cttacctcct 181 gcttcgtgac cttcctcctg ccagcgggct ggatcctgtc acacctggag acctacagga 241 ggccagagtg aaggggtccg ttctgtccct cacactgtga cctgaccagc cccaccggcc 301 catcctggtc atgttactgc atttgtggcc ggcctcccct ggatcatgtc attcaattcc 361 agtcacctct tctgcaatca tgacctcttg atgtctccat ggtgacctcc ttgggggtca 421 ctgaccctgc ttggtggggt cccccttgta acaataaatc tatttaaact tt // LOCUS HUMCOXNE 647 bp mRNA PRI 09-MAY-1996 DEFINITION Homo sapiens nuclear-encoded mitochondrial cytochrome c oxidase Va subunit mRNA, complete cds. ACCESSION M22760 NID g695359 KEYWORDS cytochrome c oxidase; nuclear-encoded mitochondrial protein. SOURCE Homo sapiens (tissue library: F.S.Walsh) adult endothelium cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 647) AUTHORS Rizzuto,R., Nakase,H., Zeviani,M., DiMauro,S. and Schon,E.A. TITLE Subunit Va of human and bovine cytochrome c oxidase is highly conserved JOURNAL Gene 69 (2), 245-256 (1988) MEDLINE 89172069 FEATURES Location/Qualifiers source 1..647 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="endothelium" /tissue_lib="F.S.Walsh" transit_peptide 20..142 /product="cytochrome c oxidase subunit Va" CDS 20..472 /EC_number="1.9.3.1" /note="nuclear-encoded mitochondrial protein" /codon_start=1 /product="cytochrome c oxidase subunit Va" /db_xref="PID:g695360" /translation="MLGAALRRCAVAATTRADPRGLLHSARTPGPAVAIQSVRCYSHG SQETDEEFDARWVTYFNKPDIDAWELRKGINTLVTYDMVPEPKIIDAALRACRRLNDF ASLVRILEVVKDKAGPHKEIYPYVIQELRPTLNELGISTPEELGLDKV" mat_peptide 143..469 /EC_number="1.9.3.1" /note="nuclear-encoded mitochondrial protein" /product="cytochrome c oxidase subunit Va" polyA_signal 622..627 polyA_site 647 BASE COUNT 167 a 162 c 158 g 160 t ORIGIN 1 gggcgccgcc atcgccgtca tgctgggcgc cgctctccgc cgctgcgctg tggccgcaac 61 cacccgggcc gaccctcgag gcctcctgca ctccgcccgg acccccggcc ccgccgtggc 121 tatccagtca gttcgctgct attcccatgg gtcacaggag acagatgagg agtttgatgc 181 tcgctgggta acatacttca acaagccaga tatagatgcc tgggaattgc gtaaagggat 241 aaacacactt gttacctatg atatggttcc agagcccaaa atcattgatg ctgctttgcg 301 ggcatgcaga cggttaaatg attttgctag tctagttcga atcctagagg ttgttaagga 361 caaagcagga cctcataagg aaatctaccc ctatgtcatc caggaactta gaccaacttt 421 aaatgaactg ggaatctcca ctccggagga actgggcctt gacaaagtgt aaaccgcatg 481 gatgggcttc cccaaggatt tattgacatt gctacttgag tgtgaacagt tacctggaaa 541 tactgatgat aacatattac cttattttga acaagtttcc ctttattgag taccaagcca 601 tgtaatggta acttggactt taataaaagg gaaatgagtt tgaactg // LOCUS HUMCPC 1286 bp mRNA PRI 26-JUL-1995 DEFINITION Homo sapiens CTP:phosphocholine cytidyltransferase mRNA, complete cds. ACCESSION L28957 NID g575485 KEYWORDS CTP:phosphocholine cytidylyltransferase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1286) AUTHORS Kalmar,G.B., Kay,R.J., LaChance,A.C. and Cornell,R.B. TITLE Primary structure and expression of a human CTP:phosphocholine cytidylyltransferase JOURNAL Biochim. Biophys. Acta 1219 (2), 328-334 (1994) MEDLINE 95002145 FEATURES Location/Qualifiers source 1..1286 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="erythroleukemic K562" CDS 46..1149 /codon_start=1 /product="CTP:phosphocholine cytidylyltransferase" /db_xref="PID:g575486" /translation="MDAQCSAKVNARKRRKEAPGPNGATEEDGVPSKVQRCAVGLRQP APFSDEIEVDFSKPYVRVTMEEASRGTPCERPVRVYADGIFDLFHSGHARALMQAKNL FPNTYLIVGVCSDELTHNFKGFTVMNENERYDAVQHCRYVDEVVRNAPWTLTPEFLAE HRIDFVAHDDIPYSSAGSDDVYKHIKEAGMFAPTQRTEGISTSDIITRIVRDYDVYAR RNLQRGYTAKELNVSFINEKKYHLQERVDKVKEKVKDVEEKSKEFVQKVEEKSIDLIQ KWEEKSREFIGSFLEMFGPEGALKHMLKEGKGRMLQAISPKQSPSSSPTRERSPSPSF RWPFSGKTSPPCSPANLSRHKAAAYDISEDEED" BASE COUNT 360 a 304 c 346 g 276 t ORIGIN 1 cgaccggacc gggctcgggg gagcgtgagt tgcagttaaa agaagatgga tgcacagtgt 61 tcagccaagg tcaatgcaag gaagaggaga aaagaggcgc ccggacccaa cggggcaaca 121 gaagaagatg gggttccttc caaagtgcag cgctgtgcag tgggcttacg gcaaccagct 181 cctttttctg atgaaattga agttgacttt agtaagccct atgtcagggt aactatggaa 241 gaagccagca gaggaactcc ttgtgagcga cctgtgagag tttatgccga tggaatattt 301 gacttatttc actctggtca cgcccgagct ctgatgcaag cgaagaacct tttccctaat 361 acgtacctca ttgtgggagt ttgcagtgat gagctcacac acaacttcaa aggcttcacg 421 gtgatgaacg agaatgagcg ctatgacgca gtccagcact gccgctacgt ggatgaggtg 481 gtgaggaatg cgccctggac gctgacaccc gagttcctgg ccgaacaccg gattgatttt 541 gtagcccatg atgatattcc ttattcatct gctggcagtg atgatgttta taagcacatc 601 aaggaggcag gcatgtttgc tccaacacag aggacagaag gtatctccac atcagacatc 661 atcacccgaa ttgtgcggga ttatgatgtg tatgcgaggc ggaacctgca gaggggctac 721 acagcaaagg agctcaatgt cagctttatc aacgagaaga aataccactt gcaggagagg 781 gttgacaaag taaaggagaa agtgaaagat gtggaggaaa agtcaaaaga atttgttcag 841 aaggtggagg aaaaaagcat tgacctcatt cagaagtggg aggagaagtc ccgagaattc 901 attggaagtt ttctggaaat gtttggtccg gaaggagcac tgaaacatat gctgaaagag 961 gggaagggcc ggatgctgca ggccatcagc ccgaagcaga gccccagcag cagccctact 1021 cgcgagcgct ccccctcccc ctctttccga tggcccttct ccggcaagac ttccccacct 1081 tgctccccag caaatctctc caggcacaag gctgcagcct atgatatcag tgaggatgaa 1141 gaagactaat gtttcctccc tcctttcctg tcctcccttt ctgtcccatt accttcagaa 1201 gctctctgtt gaattccgaa ttgtgacccc aacactaaac ctaaggacag ctacaaagga 1261 aagacaactg gggaaagaag acctag // LOCUS HUMCPD2 2999 bp mRNA PRI 05-JUN-1996 DEFINITION Human brain mRNA for photolyase homolog, complete cds. ACCESSION D83702 NID g1304106 KEYWORDS photolyase homolog. SOURCE Homo sapiens brain cDNA to mRNA, clone:pHH64PR. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Todo,T., Ryo,H., Yamamoto,K., Toh,H., Inui,T., Ayaki,H., Nomura,T. and Ikenaga,M. TITLE Similarity among the Drosophila (6-4)photolyase, a human photolyase homolog, and the DNA photolyase-blue-light photoreceptor family JOURNAL Science 272 (5258), 109-112 (1996) MEDLINE 96178677 REFERENCE 2 (bases 1 to 2999) AUTHORS Todo,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2999) AUTHORS Todo,T. TITLE Direct Submission JOURNAL Submitted (25-FEB-1996) to the DDBJ/EMBL/GenBank databases. Takeshi Todo, Radiation Biology Center, Kyoto University; Yoshidakonoe-cho, Sakyo-ku,, Kyoto, Kyoto 606-01, Japan (E-mail:todo@radbio.med.osaka-u.ac.jp, Tel:075-753-7554, Fax:075-753-7564) FEATURES Location/Qualifiers source 1..2999 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHH64PR" /tissue_type="brain" CDS 587..2347 /codon_start=1 /product="photolyase homolog" /db_xref="PID:d1012740" /db_xref="PID:g1304107" /translation="MGVNAVHWFRKGLRLHDNPALKECIQGADTIRCVYILDPWFAGS SNVGINRWRFLLQCLEDLDANLRKLNSRLFVIRGQPADVFPRLFKEWNITKLSIEYDS EPFGKERDAAIKKLATEAGVEVIVRISHTLYDLDKIIELNGGQPPLTYKRFQTLISKM EPLEIPVETITSEVIEKCTTPLSDDHDEKYGVPSLEELGFDTDGLSSAVWPGGETEAL TRLERHLERKAWVANFERPRMNANSLLASPTGLSPYLRFGCLSCRLFYFKLTDLYKKV KKNSSPPLSLYGQLLWREFFYTAATNNPRFDKMEGNPICVQIPWDKNPEALAKWAEGR TGFPWIDAIMTQLRQEGWIHHLARHAVACFLTRGDLWISWEEGMKVFEELLLDADWSI NAGSWMWLSCSSFFQQFFHCYCPVGFGRRTDPNGDYIRRYLPVLRGFPAKYIYDPWNA PEGIQKVAKCLIGVNYPKPMVNHAEASRLNIERMKQIYQQLSRYRGLGLLASVPSNPN GNGGFMGYSAENIPGCSSSGSCSQGSGILHYAHGDSQQTHLLKQGRSSMGTGLSGGKR PSQEEDTQSIGPKVQRQSTN" BASE COUNT 796 a 662 c 709 g 832 t ORIGIN 1 agttacccgg gcagcctcgg gaccggtcac cggccggcaa ccgtccagcg gcctcgacca 61 ccgcctctag cctccgttcc cggtcctttc tcccgggccg agagacagcg tcgccgacag 121 gggctcattc ccctccggtt ctcctcggtg actcacctcg ggcgggccgt tttgtcttta 181 ggggccgcct tggtggggcg aggtttccgt gacgaatctc ctggggccgt ccgtgccggc 241 tcgggccgtc gtggcgactc gagctcctgg aacttgctca ggctccggag gtccgaggcc 301 ctcgaagtta tgcgtcgcct ccaggcggtt gcggcgggcg cgggctccta aagggcgtca 361 cacccggact ccgccgacta ggcaacctcc attcatcttt ccactgcgcc tccagcgccc 421 ccgccttctc cggtcccctc ctcggagtca ttttttcctg ttccccctct gccgcccttt 481 cctcacgccc cgggtgaggc aattctcttg gaagcgaagg tgtcggctat gagccggagc 541 ctccttcctt gaatttctcc gtggaggacc cgccgcgccc cccggcatgg gggtgaacgc 601 cgtgcactgg ttccgaaagg ggctccggct ccacgacaac cccgccctga aggagtgcat 661 tcagggcgcc gacaccatcc gctgcgtcta catcctggac ccctggttcg ccggctcctc 721 caatgtgggc atcaacaggt ggcgattttt gcttcagtgt cttgaggatc ttgatgccaa 781 tctacgaaaa ttaaactccc gtctgtttgt gattcgtgga caaccagcag atgtgtttcc 841 caggcttttc aaggaatgga acattactaa actttcaatt gagtatgatt ctgagccctt 901 tggaaaggaa cgagacgcag ctattaagaa actggcgact gaagctggag tagaagtcat 961 tgtaagaatt tcacatacat tatatgacct agacaagatc atagaactca atggtggaca 1021 accgcctcta acttataaaa gattccagac tctcatcagc aaaatggaac cactagagat 1081 accagtagag acaattactt cagaagtgat agaaaagtgc acaactcctc tgtctgatga 1141 ccatgatgag aaatatggag tcccttcact ggaagagcta ggttttgata cagatggctt 1201 atcctctgca gtgtggccag gcggagaaac tgaagcactt actcgtttgg aaaggcattt 1261 ggaaagaaaa gcttgggtgg caaattttga aagacctcga atgaatgcga attctctgct 1321 tgcaagccct actggactta gtccttatct ccgatttggt tgtttgtcat gtcgactgtt 1381 ttacttcaaa ctaacagatc tctacaaaaa ggtaaagaag aacagttccc ctcccctttc 1441 cctttatggg caactgttat ggcgtgaatt tttctataca gcagcaacaa ataatccacg 1501 ctttgataaa atggaaggaa accctatctg tgttcagatt ccttgggata aaaatcctga 1561 ggctttagcc aaatgggcgg aaggccggac aggctttcca tggattgatg ccatcatgac 1621 acagcttcgt caggagggtt ggattcatca tctagccagg catgcagttg cttgcttcct 1681 gacacgaggg gacctgtgga ttagttggga agaaggaatg aaggtatttg aagaattatt 1741 gcttgatgca gattggagca taaatgctgg aagttggatg tggctgtctt gtagttcctt 1801 ttttcaacag ttttttcact gctattgccc tgttggtttt ggtaggagaa cagatcccaa 1861 tggagactat atcaggcgtt atttgcctgt cctaagaggc ttccctgcaa aatatatcta 1921 tgatccctgg aatgcaccag aaggtatcca aaaggtagcc aaatgtttga taggagttaa 1981 ttatcctaaa ccaatggtga accatgctga ggcaagccgt ttgaatatcg aaaggatgaa 2041 acagatctat cagcagcttt cacgatatag aggactaggt cttctggcat cagtaccttc 2101 taatcctaat gggaatggag gcttcatggg atattctgca gaaaatatcc caggttgtag 2161 cagcagtgga agttgctctc aagggagtgg tattttacac tatgctcatg gcgacagtca 2221 gcaaactcac ctgttgaagc aaggaagaag ctccatgggc actggtctca gtggtgggaa 2281 acgtcctagt caggaagagg acacacagag tattggtcct aaagtccaga gacagagcac 2341 taattagaaa acattcagga ggaatactgt tgcagctgaa attggtgggg agttcaatag 2401 cttttcaatt aagttattta aaaatattct tcattgatgg aaagcagtta catattgaaa 2461 tatgttgttt ctaatgacat ttctgtggtt tttaactttt taatgaattt cacagaggac 2521 aattggtaat ttgtatataa agaacttggc aagagaattt gcttaatgta aatataaaca 2581 gtcacaatta gtatagaccc atcgatatat ttttgataat ttttcatgta tggtaaagtt 2641 aaaatgacaa attgatattc tgatataaaa ctcaaagttt gaagtcagtg ggaaaaaagg 2701 aggtttttag actttcttaa aagacgttaa aattttagga cagaattttc ttgatgttgt 2761 ttaatctaac tttgcactct ttgataataa tgttttagat aatgtcgtaa tccaaattgg 2821 tattgtagcc tctgttaaca cagacagtat atgttttaaa ctttgatgta aaccttttta 2881 gacccaaact tgtggaagta tcatgtgtta agttctctgt ctctgtttct ttgttcattt 2941 attactaaaa tgaacttgtt attaaagtat atgcaaatat gaaaaaaaaa aaaaaaaaa // LOCUS HUMCPSI 5215 bp mRNA PRI 17-JUL-1992 DEFINITION Human carbamyl phosphate synthetase I (EC 6.3.4.16) mRNA. ACCESSION D90282 NID g219552 KEYWORDS carbamyl phosphate synthetase I; urea cycle. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5215) AUTHORS Haraguchi,Y., Uchino,T., Takiguchi,M., Endo,F., Mori,M. and Matsuda,I. TITLE Cloning and sequence of a cDNA encoding human carbamyl phosphate synthetase I: molecular analysis of hyperammonemia JOURNAL Gene 107 (2), 335-340 (1991) MEDLINE 92084128 COMMENT Submitted (18-JAN-1991) to DDBJ by: Yougo Haraguchi Department of Pediatrics Kumamoto University Medical School 1-1-1 Honjo Kumamoto 860 Japan Phone: 096-344-2111 x5654 Fax: 096-366-3471. FEATURES Location/Qualifiers source 1..5215 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 119..4621 /note="carbamyl phosphate synthetase I (EC 6.3.4.16)" /codon_start=1 /db_xref="PID:d1015034" /db_xref="PID:g219553" /translation="MTRILTAFKVVRTLKTGFGFTNVTAHQKWKFSRPGIRLLSVKAQ TAHIVLEDGTKMKGYSFGHPSSVAGEVVFNTGLGGYPEAITDPAYKGQILTMANPIIG NGGAPDTTSLDELGLSKYLESNGIKVSGLLVLDYSKDYNHWLATKSLGQWLQEEKVPA IYGVDTRMLTKIIRDKGTMLGKIEFEGQPVDFVDPNKQNLIAEVSTKDVKVYGKGNPT KVVAVDCGIKNNVIRLLVKRGAEVHLVPWNHDFTKMEYDGILIAGGPGNPALAEPLIQ NVQKILESDRKEPLFGISTGNLITGLAAGAKTYKMSMANRGQNQPVLNITNKQAFITA QNHCYALDNTLPAGWKPLFVNVNDQTNEGIMHESKPFFAVQFHPEVTPGPIDTEYLFD SFFSLIKKGKATTITSVLPKPALVASRVEVSKVLILGSGGLSIGQAGEFDYSGSQAVK AMKEENVKTVLMNPNIASVQTNEVGLKQADTVYFLPITPQFVTEVIKAEQPDGLILGM GGQTALNCGVELFKRGVLKEYGVKVLGTSVESIMATEDRQLFSDKLNEINEKIAPSFA VESIEDALKAADTIGYPVMIRSAYALGGLGSGICPNRETLMDLSTKAFAMTNQILVEK SVTGWKEIEYEVVRDADDNCVTVCNMENVDAMGVHTGDSVVVAPAQTLSNAEFQMLRR TSINVVRHLGIVGECNIQFALHPTSMEYCIIEVNAKMSPNSALASKTTGYPLAFIAAK IALGIPLPGIKNVVSGKTSACFEPSLDYMVTKIPRWDLDRFHGTSSRIGSSMKSVGEV MAIGRTFEESFQKALRMCHPSIEGFTPRLPMNKEWPSNLDLRKELSEPSSTRIYAIAK AIDDNMSLDEIEKLTYIDKWFLYKMRDILNMEKTLKGLNSESMTEETLKRAKEIGFSD KQISKCLGLTEAQTRELRLKKNIHPWVKQIDTLAAEYPSVTNYLYVTYNGQEHDVNFD DHGMMVLGCGPYHIGSSVEFDWCAVSSIRTLRQLGKKTVVVNCNPETVSTDFDECDKL YFEELSLERILDIYHQEACGGCIISVGGQIPNNLAVPLYKNGVKIMGTSPLQIDRAED RSIFSAVLDELKVAQAPWKAVNTLNEALEFAKSVDYPCLLRPSYVLSGSAMNVVFSED EMKKFLEEATRVSQATPVVLTKFVEGAREVEMDAVGKDGRVISHAISEHVEDAGVHSE NATLMLPTQTISQGAIEKVKDATRKIAKAFAISGPFNVQFLVKGNDVLVNECNLRASR SFPSVSKTLGVDFIDVATKVLIGENVDEKHLPTLDHPIIPVDYVAIKAPMFSWPRLRD ADPILRCEMASTGEVACFGEGIHTAFLKAMLSTGFKIPQKGILIGIQQSFRPRFLGVA EQLHNEGFKLFATEATSDWLNANNVPANPVAWPSQEGQNPSLSSIRKLIRDGSIDLVI NLPNNNTKFVHDNYVIRRTAVDSGIPLLTNFQVTKLFAEAVQKSRKVDSKSLFHYRQY SAGKAA" BASE COUNT 1484 a 1102 c 1207 g 1422 t ORIGIN short arm of chromosome 2. 1 aagcaacctt aaaatgactg caccctccca gatttctttt acattaacta aaaagtctta 61 tcacacaatc tcataaaatt tatgtaattt catttaattt tagccacaaa tcatcaaaat 121 gacgaggatt ttgacagctt tcaaagtggt gaggacactg aagactggtt ttggctttac 181 caatgtgact gcacaccaaa aatggaaatt ttcaagacct ggcatcaggc tcctttctgt 241 caaggcacag acagcacaca ttgtcctgga agatggaact aagatgaaag gttactcctt 301 tggccatcca tcctctgttg ctggtgaagt ggtttttaat actggcctgg gagggtaccc 361 agaagctatt actgaccctg cctacaaagg acagattctc acaatggcca accctattat 421 tgggaatggt ggagctcctg atactacttc tctggatgaa ctgggactta gcaaatattt 481 ggagtctaat ggaatcaagg tttcaggttt gctggtgctg gattatagta aagactacaa 541 ccactggctg gctaccaaga gtttagggca atggctacag gaagaaaagg ttcctgcaat 601 ttatggagtg gacacaagaa tgctgactaa aataattcgg gataagggta ccatgcttgg 661 gaagattgaa tttgaaggtc agcctgtgga ttttgtggat ccaaataaac agaatttgat 721 tgctgaggtt tcaaccaagg atgtcaaagt gtacggcaaa ggaaacccca caaaagtggt 781 agctgtagac tgtgggatta aaaacaatgt aatccgcctg ctagtaaagc gaggagctga 841 agtgcactta gttccctgga accatgattt caccaagatg gagtatgatg ggattttgat 901 cgcgggagga ccggggaacc cagctcttgc agaaccacta attcagaatg ttcagaagat 961 tttggagagt gatcgcaagg agccattgtt tggaatcagt acaggaaact taataacagg 1021 attggctgct ggtgccaaaa cctacaagat gtccatggcc aacagagggc agaatcagcc 1081 tgttttgaat atcacaaaca aacaggcttt cattactgct cagaatcatt gctatgcctt 1141 ggacaacacc ctccctgctg gctggaaacc actttttgtg aatgtcaacg atcaaacaaa 1201 tgaggggatt atgcatgaga gcaaaccctt cttcgctgtg cagttccacc cagaggtcac 1261 cccggggcca atagacactg agtacctgtt tgattccttt ttctcactga taaagaaagg 1321 aaaagctacc accattacat cagtcttacc gaagccagca ctagttgcat ctcgggttga 1381 ggtttccaaa gtccttattc taggatcagg aggtctgtcc attggtcagg ctggagaatt 1441 tgattactca ggatctcaag ctgtaaaagc catgaaggaa gaaaatgtca aaactgttct 1501 gatgaaccca aacattgcat cagtccagac caatgaggtg ggcttaaagc aagcggatac 1561 tgtctacttt cttcccatca cccctcagtt tgtcacagag gtcatcaagg cagaacagcc 1621 agatgggtta attctgggca tgggtggcca gacagctctg aactgtggag tagaactatt 1681 caagagaggt gtgctcaagg aatatggtgt gaaagtcctg ggaacttcag ttgagtccat 1741 tatggctacg gaagacaggc agctgttttc agataaacta aatgagatca atgaaaagat 1801 tgctccaagt tttgcagtgg aatcgattga ggatgcactg aaggcagcag acaccattgg 1861 ctacccagtg atgatccgtt ccgcctatgc actgggtggg ttaggctcag gcatctgtcc 1921 caacagagag actttgatgg acctcagcac aaaggccttt gctatgacca accaaattct 1981 ggtggagaag tcagtgacag gttggaaaga aatagaatat gaagtggttc gagatgctga 2041 tgacaattgt gtcactgtct gtaacatgga aaatgttgat gccatgggtg ttcacacagg 2101 tgactcagtt gttgtggctc ctgcccagac actctccaat gccgagtttc agatgttgag 2161 acgtacttca atcaatgttg ttcgccactt gggcattgtg ggtgaatgca acattcagtt 2221 tgcccttcat cctacctcaa tggaatactg catcattgaa gtgaatgcca agatgtcccc 2281 gaactctgct ctggcctcca aaacgactgg ctacccattg gcattcattg ctgcaaagat 2341 tgccctagga atcccacttc caggaattaa gaacgtcgta tccgggaaga catcagcctg 2401 ttttgaacct agcctggatt acatggtcac caagattccc cgctgggatc ttgaccgttt 2461 tcatggaaca tctagccgaa ttggtagctc tatgaaaagt gtaggagagg tcatggctat 2521 tggtcgtacc tttgaggaga gtttccagaa agctttacgg atgtgccacc catctataga 2581 gggtttcact ccccgtctcc caatgaacaa agaatggcca tcgaatttag atcttagaaa 2641 agagttgtct gaaccaagca gcacgcgtat ctatgccatt gccaaggcca ttgatgacaa 2701 catgtccctt gatgagattg agaagctcac atacattgac aagtggtttt tgtataagat 2761 gcgtgatatt ttaaacatgg aaaagacact gaaaggcctc aacagtgagt ccatgacaga 2821 agaaaccctg aaaagggcaa aggagattgg gttctcagat aagcagattt caaaatgcct 2881 tgggctcact gaggcccaga caagggagct gaggttaaag aaaaacatcc acccttgggt 2941 taaacagatt gatacactgg ctgcagaata cccatcagta acaaactatc tctatgttac 3001 ctacaatggt caggagcatg atgtcaattt tgatgaccat ggaatgatgg tgctaggctg 3061 tggtccatat cacattggca gcagtgtgga atttgattgg tgtgctgtct ctagtatccg 3121 cacactgcgt caacttggca agaagacggt ggtggtgaat tgcaatcctg agactgtgag 3181 cacagacttt gatgagtgtg acaaactgta ctttgaagag ttgtccttgg agagaatcct 3241 agacatctac catcaggagg catgtggtgg ctgcatcata tcagttggag gccagattcc 3301 aaacaacctg gcagttcctc tatacaagaa tggtgtcaag atcatgggca caagccccct 3361 gcagatcgac agggctgagg atcgctccat cttctcagct gtcttggatg agctgaaggt 3421 ggctcaggca ccttggaaag ctgttaatac tttgaatgaa gcactggaat ttgcaaagtc 3481 tgtggactac ccctgcttgt tgaggccttc ctatgttttg agtgggtctg ctatgaatgt 3541 ggtattctct gaggatgaga tgaaaaaatt cctagaagag gcgactagag tttctcaggc 3601 cacgccagtg gtgctgacaa aatttgttga aggggcccga gaagtagaaa tggacgctgt 3661 tggcaaagat ggaagggtta tctctcatgc catctctgaa catgttgaag atgcaggtgt 3721 ccactcggag aatgccactc tgatgctgcc cacacaaacc atcagccaag gggccattga 3781 aaaggtgaag gatgctaccc ggaagattgc aaaggctttt gccatctctg gtccattcaa 3841 cgtccaattt cttgtcaaag gaaatgatgt cttggtgaat gagtgtaact tgagagcttc 3901 tcgatccttc ccctctgttt ccaagactct tggggttgac ttcattgatg tggccaccaa 3961 ggtgttgatt ggagagaatg ttgatgagaa acatcttcca acattggacc atcccataat 4021 tcctgttgac tatgttgcaa ttaaggctcc catgttttcc tggccccggt tgagggatgc 4081 tgaccccatt ctgagatgtg agatggcttc cactggagag gtggcttgct ttggtgaagg 4141 tattcataca gccttcctaa aggcaatgct ttccacagga tttaagatac cccagaaagg 4201 catcctgata ggcatccagc aatcattccg gccaagattc cttggtgtgg ctgaacaatt 4261 acacaatgaa ggtttcaagc tgtttgccac ggaagccaca tcagactggc tcaacgccaa 4321 caatgtccct gccaacccag tggcatggcc gtctcaagaa ggacagaatc ccagcctctc 4381 ttccatcaga aaattgatta gagatggcag cattgaccta gtgattaacc ttcccaacaa 4441 caacactaaa tttgtccatg ataattatgt gattcggagg acagctgttg atagtggaat 4501 ccctctcctc actaattttc aggtgaccaa actttttgct gaagctgtgc agaaatctcg 4561 caaggtggac tccaagagtc ttttccacta caggcagtac agtgctggaa aagcagcata 4621 gagatgcaga caccccagcc ccattattaa atcaacctga gccacatgtt atataaagga 4681 actgattcac aactttctca gagatgaata ttgataacta aacttcattt cagtttactt 4741 tgttatgcct taatattctg tgtcttttgc aattaaattg tcagtcactt cttcaaaacc 4801 ttacagtcct tcctaaggtt actcttcatg agattcatcc atttactaat actgtatttt 4861 tggtggacta ggcttgccta tgtgcttatg tgtagctttt tactttttat ggtgtgatta 4921 atggtgatca aggtaggaaa agttgtgttc tattttcttg aactccttct atactttaag 4981 atactctatt tttaaaacac tatctgcaaa ctcaggacac tttaacaggg cagaatactc 5041 taaaaacttg ataaaattaa atatagattt aatttatgaa ccttccatca tgtgtttgtg 5101 tattgcttct ttttggatcc tcattctcac ccatttggct aatccaggaa tattgttatc 5161 ccttcccatt atattgaagt tgagaaatgt gacagagcat ttagagtatg aattc // LOCUS HUMCRABP 924 bp mRNA PRI 01-NOV-1994 DEFINITION Human cellular retinoic acid-binding protein II (CRABP) mRNA, complete cds. ACCESSION M68867 NID g181025 KEYWORDS retinoic acid-binding protein II. SOURCE Homo sapiens (tissue library: lambda f1.1) male young skin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 924) AUTHORS Astrom,A., Tavakkol,A., Pettersson,U., Cromie,M., Elder,J.T. and Voorhees,J.J. TITLE Molecular cloning of two human cellular retinoic acid-binding proteins (CRABP). Retinoic acid-induced expression of CRABP-II but not CRABP-I in adult human skin in vivo and in skin fibroblasts in vitro JOURNAL J. Biol. Chem. 266 (26), 17662-17666 (1991) MEDLINE 91373396 FEATURES Location/Qualifiers source 1..924 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="young" /sex="male" /tissue_type="skin" /tissue_lib="lambda f1.1" /map="3p11-qter" gene 99..515 /gene="RBP2" CDS 99..515 /gene="RBP2" /codon_start=1 /db_xref="GDB:G00-119-548" /product="retinoic acid binding protein II" /db_xref="PID:g181026" /translation="MPNFSGNWKIIRSENFEELLKVLGVNVMLRKIAVAAASKPAVEI KQEGDTFYIKTSTTVRTTEINFKVGEEFEEQTVDGRPCKSLVKWESENKMVCEQKLLK GEGPKTSWTRELTNDGELILTMTADDVVCTRVYVRE" BASE COUNT 218 a 268 c 244 g 194 t ORIGIN 1 cctgacgacc cggcgacggc gacgtctctt ttgactaaaa gacagtgtcc agtgctccag 61 cctaggagtc tacggggacc gcctcccgcg ccgccaccat gcccaacttc tctggcaact 121 ggaaaatcat ccgatcggaa aacttcgagg aattgctcaa agtgctgggg gtgaatgtga 181 tgctgaggaa gattgctgtg gctgcagcgt ccaagccagc agtggagatc aaacaggagg 241 gagacacttt ctacatcaaa acctccacca ccgtgcgcac cacagagatt aacttcaagg 301 ttggggagga gtttgaggag cagactgtgg atgggaggcc ctgtaagagc ctggtgaaat 361 gggagagtga gaataaaatg gtctgtgagc agaagctcct gaagggagag ggccccaaga 421 cctcgtggac cagagaactg accaacgatg gggaactgat cctgaccatg acggcggatg 481 acgttgtgtg caccagggtc tacgtccgag agtgagtggc cacaggtaga accgcggccg 541 aagcccacca ctggccatgc tcaccgccct gcttcactgc cccctccgtc ccaccccctc 601 cttctaggat agcgctcccc ttaccccagt cacttctggg ggtcactggg atgcctcttg 661 cagggtcttg ctttctttga cctcttctct cctcccctac accaacaaag aggaatggct 721 gcaagagccc agatcaccca ttccgggttc actccccgcc tccccaagtc agcagtccta 781 gccccaaacc agcccagagc agggtctctc taaaggggac ttgagggcct gagcaggaaa 841 gactggccct ctagcttcta ccctttgtcc ctgtagccta tacagtttag aatatttatt 901 tgttaatttt attaaaatgc ttta // LOCUS HUMCRCMUT 4181 bp mRNA PRI 01-NOV-1994 DEFINITION Human colorectal mutant cancer protein mRNA, complete cds. ACCESSION M62397 NID g181034 KEYWORDS colorectal cancer protein. SOURCE Human brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4181) AUTHORS Kinzler,K.W., Nilbert,M.C., Vogelstein,B., Bryan,T.M., Levy,D.B., Smith,K.J., Preisinger,A.C., Hamilton,S.R., Nishisho,I., Miki,Y., Miyoshi,Y. and Nakamura,Y. TITLE Identification of a gene located at chromosome 5q21 that is mutated in colorectal cancers [see comments] JOURNAL Science 251 (4999), 1366-1370 (1991) MEDLINE 91164855 FEATURES Location/Qualifiers source 1..4181 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /map="5q21-q22" gene 221..2710 /gene="MCC" CDS 221..2710 /gene="MCC" /codon_start=1 /db_xref="GDB:G00-128-163" /product="colorectal mutant cancer protein" /db_xref="PID:g181035" /translation="MNSGVAMKYGNDSSAELSELHSAALASLKGDIVELNKRLQQTER ERDLLEKKLAKAQCEQSHLMREHEDVQERTTLRYEERITELHSVIAELNKKIDRLQGT TIREEDEYSELRSELSQSQHEVNEDSRSMDQDQTSVSIPENQSTMVTADMDNCSDLNS ELQRVLTGLENVVCGRKKSSCSLSVAEVDRHIEQLTTASEHCDLAIKTVEEIEGVLGR DLYPNLAEERSRWEKELAGLREENESLTAMLCSKEEELNRTKATMNAIREERDRLRRR VRELQTRLQSVQATGPSSPGRLTSTNRPINPSTGELSTSSSSNDIPIAKIAERVKLSK TRSESSSSDRPVLGSEISSIGVSSSVAEHLAHSLQDCSNIQEIFQTLYSHGSAISESK IREFEVETERLNSRIEHLKSQNDLLTITLEECKSNAERMSMLVGKYESNATALRLALQ YSEQCIEAYELLLALAESEQSLILGQFRAAGVGSSPGDQSGDENITQMLKRAHDCRKT AENAAKALLMKLDGSCGGAFAVAGCSVQPWESLSSNSHTSTTSSTASSCDTEFTKEDE QRLKDYIQQLKNDRAAVKLTMLELESIHIDPLSYDVKPRGDSQRLDLENAVLMQELMA MKEEMAELKAQLYLLEKEKKALELKLSTREAQEQAYLVHIEHLKSEVEEQKEQRMRSL SSTSSGSKDKPGKECADAASPALSLAELRTTCSENELAAEFTNAIRREKKLKARVQEL VSALERLTKSSEIRHQQSAEFVNDLKRANSNLVAAYEKAKKKHQNKLKKLESQMMAMV ERHETQVRMLKQRIALLEEENSRPHTNETSL" BASE COUNT 1127 a 1039 c 1130 g 885 t ORIGIN Chromosome 5q21. 1 cctcctgcag caatggctcg tccgtgaaac gcgagccacg gctgctcttt ttaagagtgc 61 ctgcatcctc cgtttgcgct tcgcaactgt cctgggtgaa aatggctgtc tagactaaaa 121 tgtggcagaa gggaccaagc agtggatatt gagcctgtga agtccaactc ttaagctccg 181 agacctgggg gactgagagc ccagctctga aaagtgcatc atgaattccg gagttgccat 241 gaaatatgga aacgactcct cggccgagct gagtgagctc cattcagcag ccctggcatc 301 actaaaggga gatatagtgg aacttaataa acgtctccag caaacagaga gggaacggga 361 ccttctggaa aagaaattgg ccaaggcaca gtgcgagcag tcccacctca tgagagagca 421 tgaggatgtc caggagcgaa cgacgcttcg ctatgaggaa cgcatcacag agctccacag 481 cgtcattgcg gagctcaaca agaagataga ccgtctgcaa ggcaccacca tcagggagga 541 agatgagtac tcagaactgc gatcagaact cagccagagc caacacgagg tcaacgagga 601 ctctcgaagc atggaccaag accagacctc tgtctctatc cccgaaaacc agtctaccat 661 ggttactgct gacatggaca actgcagtga cctgaactca gaactgcaga gggtgctgac 721 agggctggag aatgttgtct gcggcaggaa gaagagcagc tgcagcctct ccgtggccga 781 ggtggacagg cacattgagc agctcaccac agccagcgag cactgtgacc tggctattaa 841 gacagtcgag gagattgagg gggtgcttgg ccgggacctg tatcccaacc tggctgaaga 901 gaggtctcgg tgggagaagg agctggctgg gctgagggaa gagaatgaga gcctgactgc 961 catgctgtgc agcaaagagg aagaactgaa ccggactaag gccaccatga atgccatccg 1021 ggaagagcgg gaccggctcc ggaggcgggt cagagagctt caaactcgac tacagagcgt 1081 gcaggccaca ggtccctcca gccctggccg cctcacttcc accaaccgcc cgattaaccc 1141 cagcactggg gagctgagca caagcagcag cagcaatgac attcccatcg ccaagattgc 1201 tgagagggtg aagctatcaa agacaaggtc cgaatcgtca tcatctgatc ggccagtcct 1261 gggctcagaa atcagtagca taggggtatc cagcagtgtg gctgaacacc tggcccactc 1321 acttcaggac tgctccaata tccaagagat tttccaaaca ctctactcac acggatctgc 1381 catctcagaa agcaagatta gagagtttga ggtggaaaca gaacggctga atagccggat 1441 tgagcacctc aaatcccaaa atgacctcct gaccataacc ttggaggaat gtaaaagcaa 1501 tgctgagagg atgagcatgc tggtgggaaa atacgaatcc aatgccacag cgctgaggct 1561 ggccttgcag tacagcgagc agtgcatcga agcctacgaa ctcctcctgg cgctggcaga 1621 gagtgagcag agcctcatcc tggggcagtt ccgagcggcg ggcgtggggt cctcccctgg 1681 agaccagtcg ggggatgaaa acatcactca gatgctcaag cgagctcatg actgccggaa 1741 gacagctgag aacgctgcca aggccctgct catgaagctg gacggcagct gtgggggagc 1801 ctttgccgtg gccggctgca gcgtgcagcc ctgggagagc ctttcctcca acagccacac 1861 cagcacaacc agctccacag ccagtagttg cgacaccgag ttcactaaag aagacgagca 1921 gaggctgaag gattatatcc agcagctcaa gaatgacagg gctgcggtca agctgaccat 1981 gctggagctg gaaagcatcc acatcgatcc tctcagctat gacgtcaagc ctcggggaga 2041 cagccagagg ctggatctgg aaaacgcagt gcttatgcag gagctcatgg ccatgaagga 2101 ggagatggcc gagttgaagg cccagctcta cctactggag aaagagaaga aggccctgga 2161 gctgaagctg agcacgcggg aggcccagga gcaggcctac ctggtgcaca ttgagcacct 2221 gaagtccgag gtggaggagc agaaggagca gcggatgcga tccctcagct ccaccagcag 2281 cggcagcaaa gataaacctg gcaaggagtg tgctgatgct gcctccccag ctctgtccct 2341 agctgaactc aggacaacgt gcagcgagaa tgagctggct gcggagttca ccaacgccat 2401 tcgtcgagaa aagaagttga aggccagagt tcaagagctg gtgagtgcct tggagagact 2461 caccaagagc agtgaaatcc gacatcagca atctgcagag ttcgtgaatg atctaaagcg 2521 ggccaacagc aacctggtgg ctgcctatga gaaagcaaag aaaaagcatc aaaacaaact 2581 gaagaagtta gagtcgcaga tgatggccat ggtggagaga catgagaccc aagtgaggat 2641 gctcaagcaa agaatagctc tgctagagga ggagaactcc aggccacaca ccaatgaaac 2701 ttcgctttaa tcagcactca cgcaccggag ttctgcccat gggaagtaaa ctgcagcagg 2761 ccactgggga cagaagggcc catgtacttg ttgggaggag gaggaaaggg aaggctggca 2821 ggtaggtcgg cacttggaca atggagtgcc ccaactcaac ccttggggtg actggccatg 2881 gtgacattgt ggactgtatc cagaggtgcc cgctcttccc tcctgggccc acaacagcgt 2941 gtaaacacat gttctgtgcc tgctcagcag agcctcgttt ctgctttcag cactcactct 3001 ccccctcctc ttctggtctg gcggctgtgc atcagtggga tcccagacat ttgtttctgt 3061 aagattttcc attgtatcct ctttttggta gatgctgggc tcatcttcta gaatctcgtt 3121 tctcctcttt cctcctgctt catgggaaaa cagacctgtg tgtgcctcca gcatttaaaa 3181 ggactgctga tttgtttact acagcaaggc tttggtttcc aagtcccggg tctcaacttt 3241 aagatagagg cggccataag aggtgatctc tgggagttat aggtcatggg aagagcgtag 3301 acaggtgtta cttacagtcc cagatacact aaagttacaa acagaccacc accaggactg 3361 tgcctgaaca attttgtatt gagagaataa aaacttcctt caatcttcat tttggaggca 3421 gggctgggaa gggagcgctc tcttgattct gggatttctc cctctcagtg gagccttatt 3481 aatatccaag acttagagct gggaatcttt ttgatacctg tagtggaact aaaattctgt 3541 caggggtttc ttcaagagct gagaaacatt attagcactt cccgccccag ggcactacat 3601 aattgctgtt ctgctgaatc aaatctcttc cacatgggtg catttgtagc tctggacctg 3661 tctctaccta aggacaagac actgaggaga tactgaacat tttgcaaaac ttatcacgcc 3721 tacttaagag tgctgtgtaa cccccagttc aagacttagc tcctgttgtc atgacgggga 3781 cagagtgagg gaatggtagt taaggcttct tttttgcccc cagatacatg gtgatggtta 3841 gcatatggtg cttaaaaggt taaatttcaa gcaaaatgct tacagggcta ggcagtacca 3901 aagtaactga attatttcag gaaggtcttc aatcttaaaa caaattcatt attctttttc 3961 agttttacct cttctctctc agttctacac tgatacactt gaaggaccat ttactgtttt 4021 tttctgtagc accagagaat ccatccaaag ttccctatga aaaatgtgtt ccattgccat 4081 agctgactac aaattaaagt tgaggaggtt tctgcataga gtctttatgt ccataagcta 4141 cgggtaggtc tattttcaga gcatgataca aattccacag g // LOCUS HUMCREB2A 1241 bp mRNA PRI 01-NOV-1994 DEFINITION Human cAMP response element regulatory protein (CREB2) mRNA, complete cds. ACCESSION M86842 NID g181040 KEYWORDS cAMP responsive element regulatory protein. SOURCE Homo sapiens (tissue library: of J.Leiden) adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1241) AUTHORS Karpinski,B.A., Morle,G.D., Huggenvik,J., Uhler,M.D. and Leiden,J.M. TITLE Molecular cloning of human CREB-2: an ATF/CREB transcription factor that can negatively regulate transcription from the cAMP response element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (11), 4820-4824 (1992) MEDLINE 92279218 FEATURES Location/Qualifiers source 1..1241 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T lymphocyte" /dev_stage="adult" /tissue_lib="of J.Leiden" /map="2q32" gene 108..1163 /gene="CREB2" CDS 108..1163 /gene="CREB2" /codon_start=1 /db_xref="GDB:G00-128-011" /product="cAMP response element regulatory protein" /db_xref="PID:g181041" /translation="MTEMSFLSSEVLVGDLMSPFDPSGLGAEESLGLLDDYLEVAKHF KPHGFSSDKAKAGSSEWLAVDGLVSPSNNSKEDAFSGTDWMLEKMDLKEFDLDALLGI DDLETMPDDLLTTLDDTCDLFAPLVQETNKQPPQTVNPIGHLPESLTKPDQVAPFTFL QPLPLSPGVLSSTPDHSFSLELGSEVDITEGDRKPDYTAYVAMIPQCIKEEDTPSDND SGICMSPESYLGSPQHSPSTRGSPNRSLPSPGVLCGSARPKPYDPPGEKMVAAKVKGE KLDKKLKKMEQNKTAATRYRQKKRAEQEALTGECKELEKKNEALKERADSLAKEIQYL KDLIEEVRKARGKKRVP" polyA_site 1241 /gene="CREB2" /note="G00-128-011" BASE COUNT 328 a 310 c 336 g 267 t ORIGIN Chromosome 2q32. 1 gaattcgcgg ccgccgcttc tcacggcatt cagcagcagc gttgctgtaa ccgacaaaga 61 caccttcgaa ttaagcacat tcctcgattc cagcaaagca ccgcaacatg accgaaatga 121 gcttcctgag cagcgaggtg ttggtggggg acttgatgtc ccccttcgac ccgtcgggtt 181 tgggggctga agaaagccta ggtctcttag atgattacct ggaggtggcc aagcacttca 241 aacctcatgg gttctccagc gacaaggcta aggcgggctc ctccgaatgg ctggctgtgg 301 atgggttggt cagtccctcc aacaacagca aggaggatgc cttctccggg acagattgga 361 tgttggagaa aatggatttg aaggagttcg acttggatgc cctgttgggt atagatgacc 421 tggaaaccat gccagatgac cttctgacca cgttggatga cacttgtgat ctctttgccc 481 ccctagtcca ggagactaat aagcagcccc cccagacggt gaacccaatt ggccatctcc 541 cagaaagttt aacaaaaccc gaccaggttg cccccttcac cttcttacaa cctcttcccc 601 tttccccagg ggtcctgtcc tccactccag atcattcctt tagtttagag ctgggcagtg 661 aagtggatat cactgaagga gataggaagc cagactacac tgcttacgtt gccatgatcc 721 ctcagtgcat aaaggaggaa gacacccctt cagataatga tagtggcatc tgtatgagcc 781 cagagtccta tctggggtct cctcagcaca gcccctctac caggggctct ccaaatagga 841 gcctcccatc tccaggtgtt ctctgtgggt ctgcccgtcc caaaccttac gatcctcctg 901 gagagaagat ggtagcagca aaagtaaagg gtgagaaact ggataagaag ctgaaaaaaa 961 tggagcaaaa caagacagca gccactaggt accgccagaa gaagagggcg gagcaggagg 1021 ctcttactgg tgagtgcaaa gagctggaaa agaagaacga ggctctaaaa gagagggcgg 1081 attccctggc caaggagatc cagtacctga aagatttgat agaagaggtc cgcaaggcaa 1141 gggggaagaa aagggtcccc tagttgagga tagtcaggag cgtcaatgtg cttgtacata 1201 gagtgctgta gctgtgtgtt ccaataaatt attttgtagg g // LOCUS HUMCREBPA 2637 bp mRNA PRI 01-NOV-1994 DEFINITION Homo sapiens cAMP response element-binding protein (CRE-BP1) mRNA, complete cds. ACCESSION L05515 NID g181049 KEYWORDS cAMP response element-binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2637) AUTHORS Nomura,N., Zu,Y.L., Maekawa,T., Tabata,S., Akiyama,T. and Ishii,S. TITLE Isolation and characterization of a novel member of the gene family encoding the cAMP response element-binding protein CRE-BP1 JOURNAL J. Biol. Chem. 268 (6), 4259-4266 (1993) MEDLINE 93179432 FEATURES Location/Qualifiers source 1..2637 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q32-q34" gene 391..1917 /gene="CREB1" CDS 391..1917 /gene="CREB1" /standard_name="'CRE-BPa protein'" /codon_start=1 /db_xref="GDB:G00-119-803" /product="cAMP response element-binding protein" /db_xref="PID:g181050" /translation="MIYEESKMNLEQERPSVCSAPGCSQRFPTEDHLMIHRHKHEMTL KFPSIKTDNMLSDQTPTPTRFLKNCEEVGLFSELDCSLEHEFRKAQEEESSKRNISMH NAVGGAMTGPGTHQLSSARLPNHDTNVVIQQAMPSPQSSSVITQAPSTNRQIGPVPGS LSSLLHLHNRQRQPMPASMPGTLPNPTMPGSSAVLMPMERQMSVNSSIMGIEGPNLSN PCASPQVQPMHSEAKMRLKAALTHHPAAMSNGNMNTMGHMMEMMGSRQDQTPHHHMHS HPHQHQTLPPHHPYPHQHQHPAHHPHPQPHHQQNHPHHHSHSHLHAHPAHHQTSPHPP LHTGNQAQVSPATQQMQPTQTIQPPQPTGGRRRRVVDEDPDERRRKFLERNRAAATRC RQKRKVWVMSLEKKAEELTQTNMQLQNEVSMLKNEVAQLKQLLLTHKDCPITAMQKES QGYLSPESSPPASPVPACSQQQVIQHNTITTSSSVSEVVGSSTLSQLTTHRTDLNPIL " BASE COUNT 753 a 704 c 534 g 646 t ORIGIN 1 aacatttaca acaaagttga ttctgtgtag ggttggaggc tagacagttc cacaaatttt 61 tagtcacatt ttccatgtca gttaaatcta gggagttcaa gactactgga aaaattagtc 121 tcattactaa aagaaactta gagaccgagg gaggtaccag agtctaggag gtacctctgg 181 gttgcagaag taattgtaaa ataccagacc tgttcttttt actaaaagct agtttcacta 241 tcttctggtc tgaaatactg aggcaaatac tcaagactta ttttcttcct aatcttgctg 301 gtgaaacaga agttactaga aagaaaggaa gaaaaaactt gatttggtga ctgcaggaag 361 caacacgttg ctgcttttat tctacagata atgatttatg aggaatccaa gatgaatttg 421 gagcaggaga ggccgtctgt ctgcagtgcc ccaggctgct cccagcgctt cccaacagag 481 gaccatctga tgattcatag gcacaaacat gaaatgactt tgaagtttcc ttcaataaag 541 acagacaata tgttatcaga tcaaactccg accccaacga gattcctgaa gaactgcgag 601 gaggtgggcc tcttcagcga gctggactgc tccctggagc acgagttcag gaaggctcag 661 gaagaggaga gcagcaagcg gaatatctcg atgcataatg cagttggtgg ggccatgacg 721 gggcccggaa ctcaccagct tagcagcgct cggctgccca accatgacac caacgttgtg 781 attcagcaag ccatgccgtc gcctcagtcc agctctgtca tcactcaggc accttccacc 841 aaccgccaga tcgggcctgt cccaggctct ctatcttctc tgctacatct ccacaacaga 901 cagagacagc ccatgccagc ctccatgcct gggaccctgc ccaaccctac aatgccagga 961 tcttccgccg tcttgatgcc aatggagcga caaatgtcag tgaactccag catcatgggg 1021 atcgaaggtc caaatctcag caacccctgt gcttctcccc aggtccagcc aatgcattca 1081 gaagccaaaa tgaggttgaa ggctgcattg actcaccacc ctgctgccat gtcaaatggg 1141 aacatgaaca ccatgggaca catgatggag atgatgggct cccggcagga ccagacgcca 1201 caccatcaca tgcactcgca cccgcatcag caccagacac tgccacccca tcacccttac 1261 ccacaccagc accagcaccc agcacaccat cctcaccctc aaccccatca ccagcagaac 1321 catccacatc accactccca ttcccacctt catgcacacc cagcacatca ccagacctcg 1381 ccacatccgc ccctgcacac cggcaaccaa gcacaggttt caccagcaac acaacagatg 1441 cagccaaccc agacaataca gccaccccag cccacagggg ggcgccggcg aagggtggta 1501 gacgaggatc cggacgagag gcggcggaaa tttctggaac ggaaccgggc agctgccacc 1561 cgctgcagac agaagaggaa ggtctgggtg atgtcattgg aaaagaaagc agaagaactc 1621 acccagacaa acatgcagct tcagaatgaa gtgtctatgt tgaaaaatga ggtggcccag 1681 ctgaaacagt tgttgttaac acataaagac tgcccaataa cagccatgca gaaagaatca 1741 caaggatatc taagtccaga gagtagccct cctgctagtc ctgtcccagc ttgctcccag 1801 caacaagtca tccagcataa taccatcact acttcctcat cggtcagcga ggtggtagga 1861 agctccaccc tcagccagct caccactcac agaacagacc tgaatccgat tctttaaaat 1921 gcaccatcag acctggcctc caagaagagc tgtagcgtac catgcgtcct ttcttttaag 1981 ggcattttta gaattaactc agacctggaa gactcctcag ttcttcaaag actggctttc 2041 atttttatag ttattatgga aatgttgtct tttatactta gttatataag aaaaagggag 2101 ttatgcaatt aatatctatc agcttgggaa acgctttggt gcttttctcc agttttctgg 2161 taccagttac ttgtttataa actgaacctt ttctgtatat agccatggtt tcattcttat 2221 cagtccaacc ctttgcctga aacattgaat cttgttaaac cacagctttt agctaaaatg 2281 aggtatacct agatgtcaag taagacagat ccaaggtaac tgggtaggaa atcttttgac 2341 atcttaactc atgttgagtt tgtgctgtgg tgtcaccaga attccagata aacacacagc 2401 ctttcccata cctttttttt tcttactata aaatattata agatccattg atgtccaaat 2461 aataccaccg agcatctctt cacctctcct cctcttggtc cacttgctaa tgcccagttt 2521 tcttctccat ttccactttt tcttaggctc cctatttact attcattttg acttccttct 2581 gttttatttt tttcccttta gcattgcatg tgaataagaa aataatgttt aaagaaa // LOCUS HUMCRK 1553 bp mRNA PRI 19-NOV-1992 DEFINITION Human mRNA for CRK-II, complete cds. ACCESSION D10656 NID g219554 KEYWORDS CRK-II; SH2 domain; SH3 domain; proto-oncogene. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1553) AUTHORS Matsuda,M., Tanaka,S., Nagata,S., Kojima,A., Kurata,T. and Shibuya,M. TITLE Two species of human CRK cDNA encode proteins with distinct biological activities JOURNAL Mol. Cell. Biol. 12 (8), 3482-3489 (1992) MEDLINE 92334347 REFERENCE 2 (bases 1 to 1553) AUTHORS Matsuda,M. TITLE Direct Submission JOURNAL Submitted (04-MAR-1992) to the DDBJ/EMBL/GenBank databases. Michiyuki Matsuda, National Institute of Health, Department of Pathology; 2-10-35 Kamiosaki, Shinagawa-ku, Tokyo 141, Japan (Tel:03-3444-2181(ex.461), Fax:03-3446-6286) COMMENT Submitted (04-MAR-1992) to DDBJ by: Michiyuki Matsuda Department of Pathology National Institute of Health 2-10-35 Kamiosaki Shinagawa-ku, Tokyo 141 Japan Phone: 03-3444-2181 x461 Fax: 03-3446-6286. FEATURES Location/Qualifiers source 1..1553 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 106..1020 /codon_start=1 /product="CRK-II" /db_xref="PID:d1001980" /db_xref="PID:g219555" /translation="MAGNFDSEERSSWYWGRLSRQEAVALLQGQRHGVFLVRDSSTSP GDYVLSVSENSRVSHYIINSSGPRPPVPPSPAQPPPGVSPSRLRIGDQEFDSLPALLE FYKIHYWDTTTLIEPVSRSRQGSGVILRQEEAEYVRALFDFNGNDEEDLPFKKGDILR IRDKPEEQWWNAEDSEGKRGMIPVPYVEKYRPASASVSALIGGNQEGSHPQPLGPPEP GPYAQPSVNTPLPNLQNGPIYARVIQKRVPNAYDKTALALEVGELVKVTKINVSGQWE GGCNGKRGHFPFTHVRLLDQQNPDEDFS" misc_feature 136..417 /note="SH2 domain (regulatory domain of crk oncogene product)" misc_feature 502..657 /note="SH3 domain (regulatory domain of crk oncogene product)" misc_feature 817..975 /note="SH3 domain" BASE COUNT 359 a 387 c 444 g 363 t ORIGIN 1 gaattccgaa gctgaaaccg gagccggtcc gctgggcggc gggcgccggg ggccggaggg 61 gcgcgcgcgg cggcggcacc ccagcgttta ggcgcggagg cagccatggc gggcaacttc 121 gactcggagg agcggagtag ctggtactgg gggcggttga gtcggcagga ggcggtggcg 181 ctgctgcagg gccagcggca cggggtgttc ctggtgcggg actcgagcac cagccccggg 241 gactatgtgc tcagcgtctc agagaactcg cgcgtctccc actacatcat caacagcagc 301 ggcccgcgcc cgccggtgcc accgtcgccc gcccagcctc cgcccggggt gagcccctcc 361 agactccgaa taggagatca agagtttgat tcattgcctg ctttactgga attctacaaa 421 atacactatt gggacactac aacgttgata gaaccagttt ccagatccag gcagggtagt 481 ggagtgattc tcaggcagga ggaggcggag tatgtgcgag ccctctttga ctttaatggg 541 aatgatgagg aagatcttcc ctttaagaaa ggagacatct tgagaatccg ggacaagcct 601 gaagagcagt ggtggaatgc ggaggacagc gaaggcaaga gagggatgat tccagtccct 661 tacgtcgaga agtatagacc tgcctccgcc tcagtatcgg ctctgattgg aggtaaccag 721 gagggttccc acccacagcc actgggcccc ccggagcctg gcccctatgc ccaacccagc 781 gtcaacactc cgctccctaa cctccagaat gggcccatat atgccagggt tatccagaag 841 cgagtcccca atgcctacga caagacagcc ttggctttgg aggtcggtga gctggtaaag 901 gttacgaaga ttaatgtgag tggtcagtgg gaaggggggt gtaatggcaa acgaggtcac 961 ttcccattca cacatgtccg tctgctggat caacagaatc ccgatgagga cttcagctga 1021 gtatagttca acagttttgc tgacagatgg gaacaatctt tttttttttt ttttccaact 1081 gccatctata caattttctt acagatgtca aaagcagtct agtttatata agcattctgt 1141 tacctgtgat attttttaga ctgaactgct ccattcctag tcttaattac catattcagg 1201 gtacgaactg gagggcttgt gtgttagctt ctgaattggc aattggaggc ggtagtggtc 1261 gtgcctgtgt gtatcagaag ggataggtat cttgcctcct ttctctcagg cagtgcaaat 1321 caccctgtgg aaaaccgatg gacaggaagg agtgttacac actgcttacc ctgatttatt 1381 cagtggtttt gttttcattc tggaaccata ctatcaaatg gcgacagact gttccgttcc 1441 acccccgtga agagtaatca tgcaccgtgt gaatagtatc aagcaggatt gctttcattg 1501 tatggagcgt gaccagggta tgactcattc tgacattcag atcctaagaa ttc // LOCUS HUMCRPR 1797 bp mRNA PRI 15-SEP-1990 DEFINITION Human cysteine-rich peptide mRNA, complete cds. ACCESSION M33146 NID g181070 KEYWORDS cysteine-rich protein. SOURCE Human normal term placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1797) AUTHORS Liebhaber,S.A., Emery,J.G., Urbanek,M., Wang,X. and Cooke,N.E. TITLE Characterization of a human cDNA encoding a widely expressed and highly conserved cysteine-rich protein with an unusual zinc-finger motif JOURNAL Nucleic Acids Res. 18, 3871-3879 (1990) MEDLINE 90326508 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.A.Liebhaber, 22-MAR-1990, for release after publication. FEATURES Location/Qualifiers source 1..1797 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 36..617 /note="cysteine-rich protein" /codon_start=1 /db_xref="PID:g181071" /translation="MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNL DSTTVAVHGEEIYCKSCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTT NPNASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLAD KDGEIYCKGCYAKNFGPKGFGFGQGAGALVHSE" BASE COUNT 405 a 507 c 493 g 392 t ORIGIN 1 cctgccgccc ctgcgccgcc gagccagctg ccagaatgcc gaactgggga ggaggcaaga 61 aatgtggggt gtgtcagaag acggtttact ttgccgaaga ggttcagtgc gaaggcaaca 121 gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac agtaccactg 181 tggccgtgca tggtgaggag atttactgca agtcctgcta cggcaagaag tatgggccca 241 aaggctatgg ctacgggcag ggcgcaggca ccctcagcac tgacaagggg gagtcgctgg 301 gtatcaagca cgaggaagcc cctggccaca ggcccaccac caaccccaat gcatccaaat 361 ttgcccagaa gattggtggc tccgagcgct gcccccgatg cagccaggca gtctatgctg 421 cggagaaggt gattggtgct gggaagtcct ggcataaggc ctgctttcga tgtgccaagt 481 gtggcaaagg ccttgagtca accaccctgg cagacaagga tggcgagatt tactgcaaag 541 gatgttatgc taaaaacttc gggcccaagg gctttggttt tgggcaagga gctggggcct 601 tggtccactc tgagtgaggc caccatcacc caccacaccc tgcccactcc tgcgcttttc 661 atcgccattc cattcccagc agctttggag acctccagga ttatttctct gtcagccctg 721 ccacatatca ctaatgactt gaacttgggc atctggctcc ctttggtttg ggggtctgcc 781 tgaggtccca ccccactaaa gggctcccca ggcctgggat ctgacaccat caccagtagg 841 agacctcagt gttttgggtc taggtgagag caggcccctc tccccacacc tcgccccaca 901 gagctctgtt cttagcctcc tgtgctgcgt gtccatcatc agctgaccaa gacacctgag 961 gacacatctt ggcacccaga ggagcagcag caacaggctg gagggagagg gaagcaagac 1021 caagatgagg aggggggaag gctgggtttt ttggatctca gagattctcc tctgtgggaa 1081 agaggttgag cttcctggtg tccctcagag taagcctgag gagtcccagc ttagggagtc 1141 actattggag gcagagaggc atgcaggcgg ggtcctagga gcccctgctt ctccaggcct 1201 cttgcctttg agtctttgtg gaatggatag cctcccacta ggactgggag gagaataacc 1261 caggtcttaa ggaccccaaa gtcaggatgt tgtttgatct tctcaaacat ctagttccct 1321 gcttgatggg aggatcctaa tgaaatacct gaaacatata ttggcattta tcaatggctc 1381 aaatcttcat ttatctctgg ccttaaccct ggctcctgag gctgcggcca gcagagccca 1441 ggccagggct ctgttcttgc cacacctgct tgatcctcag atgtggaggg aggtaggcac 1501 tgcctcagtc ttcatccaaa cacctttccc tttgccctga gacctcagaa tcttcccttt 1561 aacccaagac cctgcctctt ccactccacc cttctccagg gacccttaga tcatcactcc 1621 acccctgcca ggccccaggt taggaatagt ggtgggagga aggggaaagg gctgggcctc 1681 accgctccca gcaactgaaa ggacaacact atctggagcc acccactgaa agggctgcag 1741 gcatgggctg tacccaagct gatttctcat ctggtcaata aagctgttta gaccaga // LOCUS HUMCRYB2B 721 bp mRNA PRI 18-MAR-1994 DEFINITION Human crystallin beta-B2 mRNA, complete cds. ACCESSION L10035 NID g401760 KEYWORDS crystallin beta-B2; structural protein. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 721) AUTHORS Chambers,C. and Russell,P. TITLE Sequence of the human lens beta B2-crystallin-encoding cDNA JOURNAL Gene 133 (2), 295-299 (1993) MEDLINE 94040827 FEATURES Location/Qualifiers source 1..721 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lens" /tissue_lib="lambda gt11" /map="22q11.2-q12.1" gene 1..721 /gene="CRYB2B" 5'UTR 1..12 /gene="CRYB2B" CDS 13..630 /gene="CRYB2B" /codon_start=1 /function="structural protein" /product="crystallin beta-B2" /db_xref="PID:g401761" /translation="MASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKET GVEKAGSVLVQAGPWVGYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKV DSQEHKIILYENPNFTGKKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYR GLQYLLEKGDYKDSSDFGAPHPQVQSVRRIRDMQWHQRGAFHPSN" 3'UTR 631..721 /gene="CRYB2B" polyA_site 704..709 /gene="CRYB2B" BASE COUNT 174 a 225 c 199 g 123 t ORIGIN 1 ggacagtcca ccatggcctc agatcaccag acccaggcgg gcaagccaca gtccctcaac 61 cccaagatca tcatctttga gcaggaaaac tttcaaggcc actcgcatga gctcaatggg 121 ccctgcccca acctgaagga aactggcgtg gagaaggcag gttctgtcct agtgcaggct 181 ggaccctggg tgggctatga acaggccaac tgcaagggcg agcagtttgt gtttgagaag 241 ggtgagtacc cccgctggga ctcatggacc agcagccgaa ggacggactc cctcagctcc 301 ctgaggccca tcaaagtgga cagccaagag cacaagatca tcctctatga aaaccccaac 361 ttcaccggga agaagatgga aatcatagat gacgatgtac ccagcttcca cgcccatggc 421 taccaggaga aggtgtcatc tgtgcgggtg cagagtggca cgtgggttgg ctaccagtac 481 cccggctacc gtggactgca gtacctgctg gagaagggag actacaagga cagcagcgac 541 tttggggccc ctcaccccca ggtgcagtcc gtgcgccgta tccgcgacat gcagtggcac 601 caacgtggtg ccttccaccc ctccaactag tgccctcccc accatgcctc cttcccagga 661 cccaggtctg ctgcccagga accctccaga cctccaagtg aagaataaag tgtggctcgt 721 g // LOCUS HUMCRYGBC 22775 bp DNA PRI 01-NOV-1994 DEFINITION Human gamma-B-crystallin (gamma 1-2) and gamma-C-crystallin (gamma 2-1) genes, complete cds. ACCESSION M19364 M19354 M19365 NID g181098 KEYWORDS . SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Den Dunnen,J.T., van Neck,J.W., Cremers,F.P.M., Lubsen,N.H. and Schoenmakers,J.G.G. JOURNAL Unpublished (1988) REFERENCE 2 (bases 1 to 22775) AUTHORS den Dunnen,J.T., van Neck,J.W., Cremers,F.P., Lubsen,N.H. and Schoenmakers,J.G. TITLE Nucleotide sequence of the rat gamma-crystallin gene region and comparison with an orthologous human region JOURNAL Gene 78 (2), 201-213 (1989) MEDLINE 89378747 REFERENCE 3 (sites) AUTHORS Hearne,C.M. and Todd,J.A. TITLE Tetranucleotide repeat polymorphism at the HPRT locus JOURNAL Nucleic Acids Res. 19 (19), 5450 (1991) MEDLINE 92020260 COMMENT [Gene 87, 225-232 (1990)] sites. [1] for [Gene 87, 225-232 (1990)]. Submitted in computer readable form by J.G.G. Schoenmakers 23-MAY-1988. FEATURES Location/Qualifiers source 1..22775 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q33-q35" prim_transcript 2116..5653 /gene="CRYG2" /note="CRY-gamma-B mRNA+introns; G00-119-077" gene 2116..5653 /gene="CRYG2" exon <2152..2160 /gene="CRYG2" /note="gamma-B-crystallin" /number=1 CDS join(2152..2160,2255..2497,5313..5588) /gene="CRYG2" /note="gamma-B-crystallin" /codon_start=1 /db_xref="PID:g181099" /translation="MGKITFYEDRAFQGRSYECTTDCPNLQPYFSRCNSIRVESGCWM IYERPNYQGHQYFLRRGEYPDYQQWMGLSDSIRSCCLIPPHSGAYRMKIYDRDELRGQ MSELTDDCLSVQDRFHLTEIHSLNVLEGSWILYEMPNYRGRQYLLRPGEYRRFLDWGA PNAKVGSLRRVMDLY" intron 2161..2254 /gene="CRYG2" /note="CRY-gamma-B intron A" exon 2255..2497 /gene="CRYG2" /number=2 intron 2498..5312 /gene="CRYG2" /note="CRY-gamma-B intron B" exon 5313..>5588 /gene="CRYG2" /note="gamma-B-crystallin" /number=3 prim_transcript 18374..20060 /gene="CRYG3" /note="CRY-gamma-C mRNA+introns; G00-119-078" gene 18374..20060 /gene="CRYG3" exon <18410..18418 /gene="CRYG3" /note="gamma-C-crystallin" /number=1 CDS join(18410..18418,18519..18761,19723..19995) /gene="CRYG3" /note="gamma-C-crystallin" /codon_start=1 /db_xref="PID:g181100" /translation="MGKITFYEDRAFQGRSYETTTDCPNLQPYFSRCNSIRVESGCWM LYERPNYQGQQYLLRRGEYPDYQQWMGLSDSIRSCCLIPQTVSHRLRLYEREDHKGLM MELSEDCPSIQDRFHLSEIRSLHVLEGCWVLYELPNYRGRQYLLRPQEYRRCQDWGAM DAKAGSLRRVVDLY" intron 18419..18518 /gene="CRYG3" /note="CRY-gamma-C intron A" exon 18519..18761 /gene="CRYG3" /number=2 intron 18762..19722 /gene="CRYG3" /note="CRY-gamma-C intron B" exon 19723..>19995 /gene="CRYG3" /note="gamma-C-crystallin" /number=3 BASE COUNT 6513 a 4984 c 5205 g 5948 t 125 others ORIGIN 1 ctgcagttat ctcagcttta ttttcttctt gtagtggacg ctctggtgct gcctggactt 61 ccactgtgga ctgaagcact ccttctccca gatagctgac ngcttaaagc tgagtttgtc 121 tccaagaatt ttcttctgct ggaggaagct gccttgccca aagttacact tctccatngc 181 agcagcccac atcaaattac ggggcaatgc agaggcacag attcccggtc cacttgccta 241 agttcctgac aactctagcc ctagccttnt gcgggatggg atgaagtcac cattattgca 301 ataggctctc tctcagctca cacttacaac tgcctgcagg ggtcccttct gaccagctga 361 tagagtgggg gaaggctgag cttgatacac taataaattg gcatggcatt tgggtacaag 421 cccaaaatag aatgctgctg cagaatagct cactcaggag tcaccctaaa gacagcagtg 481 agggaaatcc tcccaaaagg caaccttgag gtagtgcacc tggctgctca ctcggtgcag 541 aaagaaaagt agcccaaggt caagcatata tggaattatt gagcagtaat gtgtggcttg 601 gctggctggt ccagggcctg gaaggagaaa gattggaaga ttagggacaa gaaggtctgg 661 ggataggaat gtgcagggag atatgggatt gggcacaaag ggcgaagata gttgtatccc 721 atgtgaaggc ccagtgtagg gcatccacca ttgaagaggc actaaacaac caagtagaaa 781 aaatgatttg gtcatttgat tccagtcagc gtctgccatc agccacccta ctgctggcag 841 tgagctcact aagagagcgt tcatagtggt atggatgaag gctatgcatt aacttctact 901 tattaaggct aatccagcta tttccactgt agaatgtcca atctgccagc aatagggacc 961 aactctgagc ccttttggtc ccatgccttg agaagaccaa tcagccactt agtggtaaca 1021 tttgaggcag agcaggaaca ccaaacagca ggcaggactc tggacagcca ccagcagaag 1081 gagggattag gcccttagga agatctgaaa gggggaagaa agccacaact ctctgttttt 1141 gctcaggtca gagccaatta ttgaggacct gcaatgtgcc aaccctctgc tctagggctc 1201 agggatagac atgaacaaga aataataccc agcccccaga agcctacaaa ttccaggcaa 1261 gtatcagttt tattcaagac aagcccccaa acaccaacaa tgatgatcac agaatttttt 1321 tttttttttt ttttgcgaca gggtcttgct ctgttgccca ggctggagtg cagtggtgca 1381 atcatggctc actgcaacct cgacctcctg agcttaagcg atcctctcac ctcagcctcc 1441 caaatagctg ggactacagg tatgcgccac catgccgggc taatttttgt atttgttgta 1501 gagatggggt ctcaccatgt tacctaggct ggtctctaac tcctgagctc aagcaatcca 1561 cctgcctcag cctcccaaag tcctgggatt ataggtgtgc accacctcgc ctggctgatc 1621 acagaatttt gaatttgaac taagctttat gggttttaac acactttcac agattacttc 1681 acttgattcc ctcttcttta agattcctta ctatttaaat tctggctttt agaactaaaa 1741 cacacacaca cacacacaaa aacaccacca gtccctgagc agaattcatt acttagttta 1801 atgacttttg agaagtcaat taagcagagt aaactataaa ataaaagcac agagacagaa 1861 tttcgcccgt ccttttgctc aactaaaagt gtactctatt cattttgatg gcaacaacag 1921 ctttagtgaa ttagaacaac ccgaacctac cagaaagaat gagagtaaac cattcagcca 1981 acccagtgac ctgggcacgg agagcagcga gcttgcaaat ccccttactc accaaaatgg 2041 gcccttttgt gtgatttcct gtggaggcag cagtcagggc tgctatacat acagtgacgt 2101 tcctgcagtt cccacacagc aaccagaaaa catctgctca cttccttcaa aatgggaaag 2161 gtaagtcctg ggtaccggat gctcagcctt ggccctaatg cagtggcctc agtgggggca 2221 atcactccat gctcccacat cttccatttt tcagatcacc ttctacgagg acagggcctt 2281 ccagggccgc agctacgaat gcaccactga ctgccccaac ctacaaccct atttcagccg 2341 ctgcaactcc atcagggtgg agagcggctg ctggatgatc tatgagcgcc ccaactacca 2401 gggccaccag tacttcctgc ggcgtgggga gtaccccgac taccagcaat ggatgggcct 2461 cagcgactcc atccgctcct gctgcctcat ccccccggtg agtgtggctc tgtctttgcc 2521 ttccatcttt ttggaaataa aagctatttc atgatattct tttttttttt tttttttttt 2581 ttgaggcgga gtctcgctct gtccccaggg tgagtgcagt ggcatgatct cggctcactg 2641 caacttccgc tcccgggttc aagcaattcc cctgcctcag cctcctgagt agttgggatt 2701 acaggcatgc accaccatgc ccggccaatt tttgtattca tagtagagac gaggttgcac 2761 catgttgccc aggctgatct cgaactcctg gcctcaagtg atccgcccgc ctcggcctcc 2821 caaagtgctg ggattacagg tgtgaactac ggcatccggc ctatttaatc acattattct 2881 taaatcccag ctactcgggc ggctggggca ggagaattgc ttgcttgaag ccgggagatg 2941 gaggttgcag tgagccgaga tcgtgccact gcactccagc ctggcgacag agtgagactc 3001 catctcaaaa cgacaacaat aacaacaaca acaacaacaa caacaacaaa cttttgcttg 3061 ctttgtctta cttttctatt tgattattcc tcctgagaag ataggatcct gtcttctgat 3121 ttctattcta atttaatggt cagtacctat tagctcactt agaggtataa attagagaaa 3181 aggccaaatc tgaaactaaa gtttgagatt ccatattttc tgggcagtac tgcagaattc 3241 ttagaattgt ggtgtgtggg aggcatggga tacagtgtca taaaaacaga agttaatgaa 3301 ggtgatactt cctgctgagg gaaataaata agttgagagc aggaggctag aaaggaagct 3361 taaaagtcag ataggttgaa atttgaactg aaggctgggt gcggtggctc acgcctgtaa 3421 tcccagcact ttgggaggcc gaggtcagga ggtcaggagt tcaagaccac cctggccaat 3481 atggtgaaac cccctctcta ctaaaaatac aaaaattagc tgggcttggt ggcacgcgcc 3541 tgtagtctca gctacttggg aggctgaggc aggagaatcg cttgaacccg ggaaggtagg 3601 ttgcaatgag ctgagatcat gccactgcac tccagtctgg gtgacagagc aagattccgt 3661 ctcaaaaaaa aaaaaaaaag aaaagaaatt tgaactgaaa attagccaga tgtggtggtg 3721 catgcctgta gttccagcta ctcaggaggc tgaggtggga gggtcacttg agcccagaag 3781 gttgaggggg ctgtgaccca tgatcatgcc attgcactcc agcctaggtg acagagggag 3841 accctgtctc aagaaacaaa tacatatatg tgtgtgtgtg tgtgtgcgtg tgcatataat 3901 agtgaacccc aaatatctga gacaggtctc agtcaattta gaaagtttat tttgccaagg 3961 ctgggcacag tggctcatgc ctgtaatccc agcacttttg ggaggccaaa gcaggcagat 4021 cacctgaggt ccggagtttg agaccagcct gaccaacatg gagaaacctc tactaaaaat 4081 acaaaattag ctgggcgtgg tggcgcgtgc ctgtaatccc agctacttgg gaggctgagg 4141 caggagaatt gcttgaactt gggtggcaga ggttgtggtg agccgagatc acgccattgc 4201 actccagcct gggcaacaag agtgaaactc catctcaaaa aaaaaaaaaa aaaaacccag 4261 aaagaaggtt tattttgcca aggttgagga atgcacccat gacacagcct cagaaagtcc 4321 tgagacatgt gcccaaagtg gtcaggggta cagtttgctt ttatacattt tagggagaca 4381 tgagacatca atcaatacgt ataagttgta aattggttca gtctggtaag acaggaagta 4441 ggggcttcct ggttagacat ggataagaaa cagagctggg cacgtggctc acccctgtga 4501 tcccagcact ttgggaggct agaggtgggt ggatcgcctc aactcctgac agagtttaag 4561 accagcctgg ccaacatagt caaagcccgt ctctactaaa aatacaaaat attagctaat 4621 cgtggtggtg ggcgcctgta atcccgctac ccgggaggaa tcacttgaac ccagaaggtg 4681 gaggttgcag tgagctgaga ttgcgccatt aaaaaaacag tagataagag acaaaaggtt 4741 ctttgagccc ttgatcagct ttccactgaa tacacaattt agtctggctc gatgactctg 4801 catctttaca taaacaatag gggagaggaa gcaatcagag atgcatttgt ctcaggtgag 4861 cctcagaggg atgactttga atagaatggg agacaggttt gccctaagca gttcctggct 4921 tgacttttcc ctttagctta gtgattttga ggtcccaaga tttattttcc tttcactgta 4981 tatacataat atgacattta actttccaat gactctcagt tttctccttt gagaaaggct 5041 ttctcaaata acaggcacta aagtgatgtg atattttatt gtaagcaaat cttaattccc 5101 aaagccctag ctctgcccat gaatcattag actcagcggt gggtgactgg ggagtgtgac 5161 agccccttaa ctttgcaaca gggttgttct agaagcaaac ttggcctggg agaacttctg 5221 ggcaggtgtg gccagggagg tgtagggact ggagctttaa tttccatctg tttttgtttg 5281 tttactcttg cgttttctgt ctgccactcc agcactctgg cgcttacaga atgaagatct 5341 acgacagaga tgaattgagg ggacaaatgt cagagctcac agacgactgt ctctctgttc 5401 aggaccgctt ccacctcact gaaattcact ccctcaatgt gctggagggc agctggatcc 5461 tctatgagat gcccaactac agggggaggc agtatctgct gaggccgggg gagtacagga 5521 ggtttcttga ttggggggct ccaaatgcca aagttggctc tcttagacga gtcatggatt 5581 tgtactgaag tatttacgtt ttccactttt ctcctttaaa atctaataaa atatttagct 5641 tgtgtttctg gcactagtag agccctgtct ttctttcaat ctattaagca tttataagtg 5701 ataatggcac tcagccaaac ataataacat gttttcatga tgggaagcaa tcttttataa 5761 ggggaataat gcagagatat tatttccagc actcttgtaa tgactaaaac actgtaggct 5821 caaataaaag ctcagttcac ttggactcag gccaattgta gttagtctca ccaggctaga 5881 tttgcagcta cggtggtgaa ttggtacttt ttgaaacacc tccaagttat ccctccaggt 5941 tatccctatt gtcttcccta atagtccagg ccttagctga tgaggaaagt tgttctcagc 6001 ctctagactc ctacctcagc tccataaaca aacaaataaa tgaactggtc cactcttacc 6061 tattgcactc ctgtcctcct aacagtcacc atccctttcc tctcacttcc actgccaata 6121 tctcattcca ctatgagatt tccagtccaa atagaattca ctgtgttttc gctatatgtg 6181 ctttgtggaa taatttcagg acatttacaa aaaccacatg cgcacgggaa gctcagcctc 6241 atgtcatttc agtagagagt aaaatattat gattacacat aataatacta agtggccggg 6301 cgtggtggct cacacctgta atcccagccc tgtaaacctg cgtctccagg gctcaagcga 6361 tcctcccacc tcagcctccc aaatagctgg gattacaggc gcacaccacc atgctcagct 6421 aatttttttt atttctagta gagacagggt cttgccatgt tgcccaggct ggtctcaaac 6481 tcctgagctc aagcgatcct cttgtctggg cctcccaaag tgccagaatt acaggtgtga 6541 gccaccacac ccagtcataa actcggtcat ttgttttgat tttgggtttt tttttttttt 6601 tttttttgag atggagtctc actctgtcac ccaggcggga gttcagtggt gcaatctcag 6661 ctcactgcaa cctctgcctc ccagttcaag cagttctcct gcttcagcct cccaagtagc 6721 cagaactaca ggcgtgtgcc accacgcccg gctaatgttt gtatttttag tagagacaga 6781 gtttcaccat gttggccagg ctggtctcaa actcctgacc tcaggtgatc cacccacctc 6841 ggcctcccaa agtgctggga ttacaggagt gagccaccac gcctggccta cactgggtca 6901 tttttgagag tgaaagtata tactattaat aattacactg ggacaagaga catgaaccag 6961 gactgttcta ggcaaacttg gatgcaagtt taccctactg tgtgtgtgtg tgtgtgtgct 7021 aaatattact acaaaagtgt caaaaaggtt taaaatgtta aaaagtttat gttacaaagt 7081 tacagtatgc taatgttagt ttattattga agaaagcaag cattttaaaa aaatgtgtag 7141 cctaagttta gcctactgta tgtatatgta tattagccta ctgtatgtgt gtatatatag 7201 agaacacaaa tacatataaa gatttatttc ctgaaaaaca gtcagaatct aggattattt 7261 agtgttaact cttcctaaat cttgggtctt taacctttaa actcgcaaca atagtgatgg 7321 cttcctggag ctattaatca gagtcctctg tttgtcaaca ataatatgtc aaacataatt 7381 cagaaaacca aaagcttctt aacgtgctga atttttacac tatttcctag aagacataaa 7441 atcacatatt ttggctgggt gcagtggctc acgcctgtaa tcccaacatt ttggaaggcc 7501 aagacaggcg gatcacttga ggccaggagt ttgagaccag cctgaccaac atgccgaaac 7561 cccgtctcta ctaaatttaa atacaaaaat tagctgcacg tggtgacaca tgcctgtaat 7621 cccagctact acagaagctg aagcacgaga atcgcttgaa cctgggaggc ggaggttgca 7681 gtgaggtgag atcacgtcac tgcactccag cctgggcaac agagagagac tctatctcta 7741 aaaataaaaa taaaagtaaa tcaccgtctt taggcatgaa aattattcta aaacatctga 7801 actgaaaggg attagagtcc ccgaatctct ttgatcaagt cattattctt agaaattttt 7861 tgctcctatc cctgggatac aggcaaagaa aaagatgtcc tgtcctcctc ctaaatccaa 7921 cataaagaga aagtcaatac tcaggtggtc cgcctgcctc ggcctcccaa agtgctggga 7981 ttatagtgtg agccaccgtg cccagcttgc tatactatac tttttatcat cactttagag 8041 tattctcctt ctacttatac ttttttaaaa aaaaaaaagt taactgtaaa cangcttcag 8101 cagagccttc aggaggtact acagaaggaa gcatcatcat cataggtgac agttccatgt 8161 gtgtttttgc ccgtgaagat cttccagtgg tacaacatgt ggaagtggta gacagtgata 8221 ttgatgatcc tgaccccgtg taggcctaaa ctaatgtgtg tgtttgtgtc ttagtttttc 8281 acaaaatttt aaaaagtaaa agaataggcc aggcgtagtg gctcacgcct gtaatcccaa 8341 cacgttggga ggtcgaggca ggcagatcac ttgagcccag cagtgtaagg ttcttgtatc 8401 agttccaacc ccaagagcgc gtccacagac aacacgagga ggtgtggagc aataagctgt 8461 tttaaggagc gcctgggtgc gcctgagtgc aggaaggccg aggcttaaaa tggcgtcagc 8521 accaagtgag gacggggcaa aggttttaca gtctcctgta aacaggaagt gtcctagtct 8581 gacgtaactg ctacgttgta cccggatggc ctctttctcg atcttcgggg gtacgtgtct 8641 tccagccggc tctcttcctg cttctgctat cctgctggcg cacactgctg acacaagtga 8701 ccttgcgcct tgggactggg cctgggaagg gaggggttac tcatcccctt aagctttcag 8761 actctgggga gaatcataca aggagtttga gaccaccctg ggcaacatag caaaacgcca 8821 tctctattaa aaaaaaaaaa aagaaaaaag tttatatagt aaggataaaa agaaaatatt 8881 tttgtacagc tatacaatgt ttgtgtttta agctaccaca aaggtgtcaa aaaggcttaa 8941 aatgttaaaa agtttataaa gttacaaagt tacagtatgc taatgttagt ttattattga 9001 agaaagaaaa gcattttaaa aaaatgttgt gtaccctaag tgcacagtgt gtataaagtc 9061 tacagtggtg tacagtaatg tcctaggcct tcacattcat tcaccctcct cactgactca 9121 cccagagcaa cttccagtcc tgcaagctcc attcaatggt aagtgtccta tacaggtgta 9181 caatcttttt tttttttttt ttgagacagn gtatcactct gtcacccagg ctggagngca 9241 gtggtgtgat cgcggctaac cgcaagctct gcctcctggg ttcatgccat tctcctgcct 9301 cagcctcccg agtggctggg actacaggca cccaccacca cgcccagcta atttttttgt 9361 atttttagta gagacggggt ttcaccgtgt tagccagaat ggtctctatc tcctgacctc 9421 gtgatccgcc ttgccttggc ctcccaaagt gctgagatta cangcgtcag ccactgtgcc 9481 tggccaggtg tacagttttt taatcattta taccatattt cttgttggga acaggccccc 9541 ccaaatctgc cataaactgg ccccaaaact agccataaac aaaatctctg cagcactgtg 9601 acatgttcat gagggccata acgcccacgc tggaaggttg tgggtttact ggaatgagga 9661 caaggaacac ctggcccacc caggtcggaa aaccgcttaa aggcgttctt aaaccatgaa 9721 caatagcatg agcaatctgt gccttaaggg catgttcctg ctgcagataa ctagccagac 9781 ccaccccttc atttcggccc atcccttctt ttcccataag ggatactttt agttaatcga 9841 gtatctatag aaacaatgct aatgactggc ttgctgttaa taaatacatg ggtaatctct 9901 gtttggggct ctcagctctg aaggctgtga ggccctgatt tcccacttta cacctctata 9961 tttctgtgtg tgtgtcttta attcctctag tgccactggg ttaggctgtc cccagtcgag 10021 ctggtctcgc atttctactg taccttttct atgtttagat ttgtttagat acacaaatac 10081 tagtgtagta ggctgatcca tctaggtttg tgttagtaca ctctatgatg ttcacacaac 10141 aatgaaattt ctcaaaatgt atcccattga taagagatgc atgggggaag aggaggctag 10201 agaatgtagg catgggccgg gcgtagtggc ttactcctgt aatcccagca atttgggagc 10261 ccgaggcggg aggattgctt gagcccagga gctcgagacc agcctgggca acatagtgag 10321 acctcatctc tacaaaaaat aaacaaaatt agccgggcat ggtagcaggc acctgtggtc 10381 ccaaattctt gggaagctga agtgggagaa tcccctgagc cagagagata gaggctgcag 10441 tgatccaaga tcagccactg tgctccagcc tgggcaacag agcgagacct tatctcaaaa 10501 aaataaaaat aaaaagagct agagaatgga ggtgcaagtc agtagcggtg gtgcaggcga 10561 gcttgcagcc caagaccctg cctctgccct cagcaaggcc cctggcaggc aacgactgca 10621 agctgcatgg tgagggccca ggtcataagg aatgggcaca tctggatgtg agagtggggt 10681 tgaaaaatat aggccagtaa agctgaatga aaatgacagg aatgaataca ctgtgggcag 10741 gctggagatc tttgcaagag aagggaatat gcctcacatc atctttgctg gcccctggaa 10801 ctgacaaagt tataagcatc ctgggcccag cactgaagga tgccatgctg gaactcagtg 10861 cttcaaatgg caagggcatt gacgttgcga ggaataaaat caaaatgttt gcttgacaaa 10921 aagtcatcct tcccaaaggt caacataaga tcatcttcct ggatgaagca gacagcataa 10981 ctgatggagt ccagcaagcc ttcaggggaa ccattgaaat ctgctctaaa acgactcact 11041 gtgttcttgc ttttaatgct tcagataaga tcaccggaag gcccatgtga cagcagcata 11101 gatgaaggat cagagagagg ccaaattggg gtttggacat catgaagaag ctgttgtagt 11161 agtctaagga gaagaaaaag gagagtttta gaattctaca tttgtctgac tggtggttct 11221 tagctccagc tgcaggcagc acctccagtg gagctttaaa gaaactgtga gtgtcctgtt 11281 cctaccccca gccagaccga attgtcatct gcccagaagn nnccaacgcc cagcatacat 11341 tgtgattcca gagtcttcct gggcctgaat tgactcactg agggcttgtg gcacatggat 11401 gtctgccttg gaggatcatt tctccatctg aaaagccctg gccagccaag gccattgctg 11461 gggcagtgct ggctgaggcc tctgtctgat ttgcacttca actgggtgat ttggggaagg 11521 atcttaaaaa gctggagttc actctagatt agatgttagc agaaagcagg gttaattttg 11581 tgattagtta tctcaaggaa tcttacccat ggggagggga gactagaaca cggataaacc 11641 tgtaatccat aaagaagtag cagttaatca ttttggctga gagagagagg gcttggaatt 11701 ttgagagtgg agcattgact ttgtttttgt ctgcgcttag acaaagttaa gaagtggctt 11761 tattttatct catttcatcc tggtctcgga gtagccttgt ctgaagttgt tattctgtga 11821 gattgtttat gtctaatagg ggagtaacat gtcctcactg tgagtgccag gccagcatct 11881 gaatgtcaga gcctgctctt ttttctcact ttttttctca ttaatcagac tacagtccat 11941 agagataact catgctctgt gtggttatgt aatttgccat ggagctaatc agtgggcctt 12001 ctgagtgtag gcggcctgag tccaaagctc acactcttgc ccgtgaagct ttatagcctc 12061 cctcatgctt ggaaaaggaa aagcaattta taaagcaata ctttatactt tggttaatac 12121 tttgacattt agccttgttt ttactgcatt tctcttaagc acctgctgcc aaagaggagt 12181 cgcaccacca gcatggctgg gctttgtgga ttatgcagct gcatcctcgt aatcaaagca 12241 ctaatagggt gatgccagcc tctgagggaa acctcactgc cttcccgtct ctgttcagag 12301 gccactggat cccagtgcat agtcctccgc tacacaaagc tgaccgatgc ccggagcagt 12361 gtgaggccga tgaatgctag agagaagggt acagttctgt acactgactt tggcctagaa 12421 gccatcatcc tcatgggtca gggagacatg agacaggccc tgaacaactt gcagtccatc 12481 ttcccaggat ctggcttcag ttacagcgag aagtatgttc agggtctgtg acgagcccca 12541 ccccctgctc atgaaggaga tgatccagca ctgtgtgaat gcccatgtca aggaaaccta 12601 caagattcct gctcacctat ggcatctggg ctactcacca gaagatgtca ttggcaacat 12661 cttccaagtg tgtaaaactt tccaaatggc agaataactg ataaagtgga gtttgtnaag 12721 gaaattggat acactggtat gaaagcagag gaggagtgaa ctcccttctg cagacaacgg 12781 ggctcctggc caggctatgt cagaagacaa tggccccagt ggcatagagc agaggcttta 12841 ttgattgagt tacaagagcc ctaatccctg taatacagga ggtgcagcct tctgaagtgg 12901 agggggaagc gtgggtgggg aatgccacct taagctggtg ccagcataca ccatacttta 12961 aaccctcgtg gttttcacgt tgcttctagc tgatctctgc tccatgagtg tttgcattca 13021 acctcagact cactgacgag tgatggagca gggcagaaag gctcagagaa gctcagggca 13081 ggcacctgat ctgtgtgtga gttgacattt agctcataaa gccttgcagt gtttgttgta 13141 aggtgaccac atgaggcctc aaggaaaacc agcttcctgt ctctgccctt gcttgttcct 13201 cccctttcta cttgtggccc ctagcagcct gcaagtagga agatgactag agagtagatt 13261 ggacaagcta atctcaattc ttctagaagc atgaagggac cagttccctg gggtgagggc 13321 agagtttgct gtaatttatt tagatagact tctcaaaacc aggcaataag ccttcaggaa 13381 gggcttgctg agggggtcga ctggcaaagc accacacaaa aaccccagca tgatgccttt 13441 cccatgtcct cgagggaact cttggccttc gctttagaat tccccagtga attttatatt 13501 aatttgtaga attgcctttt tatttgcaag ggtactattt tttcccattt tttaaaatta 13561 aaattgcaat catataaaaa agagatgcat gactgcatag gataccaccg tatattaatt 13621 gctaagaagc aggaaggtgg gtggtaaaat caactgctgg ctggacctca gacaccagca 13681 aaactctaga attggctacc gctgttgtaa cattaatttc tcatctcaga agaaataaaa 13741 attagaatta ctaaggagtg ggaggactta gggcacaaca cagcattcct ggtttttccc 13801 ttttgcttat tatctccact gttataaagg cccattatgg ctgggtgtgg tggctcacac 13861 ctgtaattcc agcatttggg aggccgaggt gggcagatcc cctgaggtca ggagttcaag 13921 accagcctgg ccaacatggt gaaaccctga ctctactaaa ggtacaaaaa ttagctgggc 13981 gtggtggcag gcgcctgtaa tcccagctac tcgggaggct gaggcaggag aatctcttga 14041 acccaggagg cagaggttgc agtgagccaa gatcgcacca ctgcactcca gcctgggtga 14101 caagagcaag atttcatctc aaaaaaaaaa aaagaaggcc tggcacagtg gctcagtctt 14161 gtaatcccag cactttggga ggccgaggcg ggcggatcac aaggtcagga gatcgagacc 14221 atggtgaaac ctcgtctcta ctaaaaatac aaaaaattag ccgggcgtgg tggcgggcgc 14281 ctgtagtccc agctactcgg agaggctgag gcaggagaat ggcgtgaacc ctggaggcgg 14341 agcttgcagt gaactgagat tgcgccactg cactccagcc tgggcgacac agcgagactc 14401 cgcctcaaaa aaaaaaaaaa agccccatta aatcgatctt gtgcagggtg cctgtgtcat 14461 cctacacacc gaaagttgca catcctgaac ttcaatgtga gtgtgttatg cctgtgtata 14521 ttgaacattt gaaacaggct ccatagacca ttataataat aaatttcaga caaggaataa 14581 caatcacaca tccccccccg ggcatttatt gagtaacctt tatgtgccag tcctctaagc 14641 caattagtga gaacacaaaa ataaagatgg aatagactct gcctttatgg ggctctccca 14701 tctattttga gggactgatg tctaaacaaa ataattagat aagtgatagt tacccttata 14761 gtggtatgaa catgaggagc cacggaggca gtatttaatt cagcccagaa gattaggaag 14821 ggttcaggaa aatcttttac agacaaactg acgcttgagc tgagacttga agaaagggta 14881 aaatttctct ggttgaaaaa taaactggtg tctcattgag aaggatcaat atgttgaaaa 14941 acaagaacaa catgaaataa cattgtgcat ttgggaaatg acaagcggtt cggcgtgatc 15001 aatcagaaag cgtggggaca gacagcggag gccaggtcta ggatgccgcg ctaagggatc 15061 aatattggat cctgagacaa tggagtccag taaattttca agcaaatgaa cagcatgatt 15121 tgatgtgttt tagttctcat cactccagca tctgtctgag tgcagcttgg aggaagacaa 15181 cactgggcag agtcaggagg ctgttgaaag tccagctgag atatgataag accctatgta 15241 aggcagctac aatggcaggg gtttgggggt gggaggtggt tggggnagag gagagaagga 15301 gaaagatttg aaaaaaggtt agagaaatac aagactacgc cattaggaac aagagcgatg 15361 ttctgtaata tgcactttct ggccccttta ttgcttttcc tgtcttctta ccagtgctct 15421 aatttaagtg ctggtatctc tgctccgatt ctctcactgt gtacatatga aactttatag 15481 aacaagaaag agtgctggcc aggcgcggtg actcatgcct gtaatcctag cactttggga 15541 ggccgaggtg ggcagatcac gaggtcagga gatcgagacc atcctggcta acacggtgaa 15601 accctgtccc tactaaaaat acaaaaaaaa aaaaaaaatt aaccaggcgt ggtggtgggc 15661 gcctgcagtc ccagctactc aggaggctga ggcaggagaa ctgcttgaac ccaggaggcg 15721 gaggttgcag tgagctgaga tcacgccatg gcactccagc ctgggcgaca gagcgagact 15781 ccgtctcaaa aaaaaaaaaa aagagtgcta caaaacagag agcacatctg cctttaagtc 15841 taacgagaat gttaggcagg gcacagtgac tcacgcctgt catcccagca ctttgggagg 15901 cagaggaaaa tggattgctt gaggagatca agaccagact gggcaacata gtgagccctg 15961 tctctacata aagaaaatac aaaaaattag ctgggatgtg gatgacctgc agtcccagct 16021 actcaagaga ctgaggtggg agcttgagct tgagcgtggg aggcagaaac tgcagtgagc 16081 tgagattgtg tccctgcact ctagcctgga caacagagca gaactctatc tcaaaaaata 16141 aataaataga attaagagaa tgttagtaca attcctaata aggcattgcc tagggatggc 16201 taaaatacat gatttccttg tcaaaacatg aagggataaa ccacagcagt gttttggccc 16261 aagaactatt tggtgcttat taaaaatgta ggttctgggc tgggcgtggt ggctcatgcc 16321 tgtaatccca gcactttggg agaccgaggt gagtggatca cctgaggtca gaaattcgag 16381 accatcatgg ccaacatagc aaaaccctgt ctccaataaa aatacaaaaa ttagccgggc 16441 atggtgaccc gggcctataa tctcagctac tcaggaagct gagacagaag aatcacttga 16501 acccgggagg cagaggttgc agtgagctga gatcgcacca ctgcactcca gcctgggtaa 16561 cagagcaagg ctccatctca aaaaataaaa taaaataaat aaaataaaaa ataaaaatgt 16621 aggttctgag gtcttctcct gatctaatga tgcaggattt agccaaaaga tcaagatggt 16681 aatttgcatt tttaataagc aaggtgattc ttaggcacac tggagtttga aaaccacccg 16741 atggtttcga aagtcccttc caagcctaat ttnnctttaa tcacaaaatg gcttcataga 16801 ggataaattt tttcaggaat ctttgagagg attaaatatc tttgagaggg ataaacagga 16861 tgaagcaagc cttaacaatg tcactctatt actaataaca acatttattc agtgttcatt 16921 tgtgccaagt atgtgccaaa ttcgcttata cacataatcc cacttattca tgtcaatgtc 16981 cttnagctaa agaccaccag ggacacacct gtctttaaac aagttgagct tattacttgt 17041 tgcagcaagg gaggacatgc tccttgggga agcatggact gtctcagtaa cagagcttag 17101 aaagaaattg ttataggatt tgcacttcag ttgtgcgatt tggggaagga gctaaaaaag 17161 ttggagttca ctccagatta gatgttagca gaaagcaggg ttcaattttg tgatttagtt 17221 atctcaagaa atcttanccg tagggagggg gagactagaa cacggataac ctgtaatcca 17281 taaagaagtg gcagttaatc attttggctg agagagtgag ggcttgggat tttgagagtg 17341 gagcattgac tttgtttttg tctgcgctta ggcaaagtta agaagtggat ttattttatc 17401 tcatttcatc ctggtctcgg agtagccttg tctgaagttg ttattctgtg agactgttta 17461 tgtctaatag cggagtaaca tgtcctcact gcggagtgcc aggccagcat ctgaatgtca 17521 gaggctactc ttttttctca cttttgttct cattaatcag actatagtcc atgagatact 17581 cctgctctga gtggttatat aatttgccat ggagctaatc agtggccttc tgagtacagg 17641 cggcctgagt ccaaagctca cactcttgcc cgtgaagctt tatggcctcc ctcacgcttg 17701 aaaaagggaa agcaatctat aaagtagtat acggtacttc tcagtcttgg gcccagtgac 17761 catagaaacc tttcccttgc ttgttatcgt aagttcacta ccagggactg atctctagtt 17821 agacgacgtg cccacgaaag gcatagttaa aagctgtctt aaagcattgg gttaacgaag 17881 tcaaaactag ccaagataag aatctagatt tgtgtccggc cctggactgg aaaattgtgt 17941 aaagacaaaa atgtaaggta gagcagtctc tgccatccag ggaaatgagt cagtgctttt 18001 ggcattctac ctcagcaacc tctctggaat cctggagcaa ttctataatg gcatagatct 18061 ttcccagaaa attcacgggg aataattcaa tcatatagac agagccacga atttcaggtc 18121 aaaagttcag actgactgca cagataaatt tagaaaacac taacaatcca aataaaagca 18181 acacagagca gtatgtacag gcagcgttag aatataccag agaacaagaa cacaatctac 18241 aatcatttcc agtgaatgca ggatgttaaa gagatgcata aaatcccctt accgctgagg 18301 gccccttttg tgttgttctt gccaacgcag cagccatcct gctatataga ctggctgtgc 18361 agccgcaggc cccatcacac tgaactcgca tcatccgtgt caaccagcca tggggaaggt 18421 gagcagaaca caaattaaat aaaatgaaaa aaaagttgcc tttatataat gcctgttagg 18481 agcaaataat gtaattcgga acgattcttc ctttgcagat caccttctat gaggacaggg 18541 ccttccaggg ccgcagctac gaaaccacca ctgactgccc caacctgcag ccgtatttca 18601 gccgctgcaa ctccatccgg gtggagagcg gctgctggat gctctatgag cgtcccaact 18661 accaaggtca acaatacttg ctgcggcgag gggagtaccc cgactaccag caatggatgg 18721 gcctcagcga ctccatccgc tcctgttgtc tcatccccca agtgagtttt ctagatttcc 18781 atcatgccgc cagagtcccc actgtattgt caatgtgggt tacagggagg gaggtttgca 18841 tttaaaaaca cctaagggtt agatggacat cagagacctg aataagccag atagataaca 18901 tgcacaaaaa cagcagagaa tgtaaggaag aacccactta gaggaagaat ccgtttttac 18961 ttgaaatttt tgcatccaat aagtcaatta aaagaggtct gtaacaaatg taatatcata 19021 atgggatata aaacaaaact tgcaaatggg aagctggttg ggcataaaaa gacggtattt 19081 ttaacaaatt attttgtcta tagtaaaaga atttctaatg aatgaattag atagattcca 19141 ccatggacta gtaaacctca aagctgtatt gttaatatag gctattcaaa ggttgtgtcc 19201 aactttggaa aaggtaattt ttcttaattt ggaaaatcag ttaaattctc attccaaatg 19261 actatttaac tttacactga agtacaagat tttttaaatc ttactttagt ggtaatttat 19321 atgtttgtaa tttgggagat aataaccctt actaaagaaa tactagtgaa ataaattatt 19381 tctatttttc cccccaaatg tagtttgaat taacacaggt aattaggatc agatatctca 19441 gttcttgatg gattctaact cagagtttga acctcttaaa aattccatct cctaaataag 19501 aatgaatttc aggtcaaata gaaaatgagc gtgccagtat ttaggtgcta gtggaagaca 19561 gatccatgcg cagcaaccac agtaatctac attttacact gtctaaattt tacaatgaca 19621 attccatgcc acaacctacc aagttcatct gttctttggt tggacaaatt ctggaagaga 19681 ctcatttgct tttttccatc cttctttctg tggaccgagt agacagtctc ccacaggctg 19741 cggctgtacg agagggaaga ccacaaaggc ctcatgatgg agctgagtga agactgcccc 19801 agcatccagg accgcttcca cctcagcgag atccgttccc tccacgtgct ggagggctgc 19861 tgggtcctct acgagctgcc caactaccgg gggcggcaat acctgctgag gccccaagag 19921 tacaggcggt gccaggactg gggggccatg gatgctaagg caggctcttt gcggagagtg 19981 gtggatttgt attaaaatag cttaacacta ccaatttccc attttggaac ctaataaata 20041 tttagtctgc attgctggca attgctgact tctgtcattc tttcattgtg caaatcagtt 20101 cacctttaaa tgctccttgt caaaagttaa gaagtgaatg gggtggggga cggggtctat 20161 tggagaagag cttcaaggta gagtgagttt gaacaagcct cagacgttgg ggtggaggca 20221 gggtgtagcc tgnccgagaa agaaaacgca gggcaatgcc agagacctgg gagaggctga 20281 ctggtgagag gggaggcaat tcaactangc accttgagga caatgattgg gaagtttcct 20341 tctaaccaaa atgggttagg acnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 20401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 20461 ggccaccgca gccattgccg tacctttgaa caaagttaaa gactattcat cttcttttga 20521 aaatggctcc tgtccagaag tgtagctctt ggtcagtact acccctggcc caaatgctgt 20581 gtcaattata gaaactgcac atcagtttca tggtgaccag ctattgcaaa taaattaata 20641 ttaagagata cacatcaatc ttttaaatag attttcatgg tatagtgtaa cccaaacata 20701 aaaaataact acaaaaaaca tcgttctggc ataaggacgc atgtatgcat atgtttatca 20761 cagcactatt catgacagca aagacatgga atcaacctaa atgcccatca atggtaggct 20821 ggttttacaa atgtggtaca tatatacaat ggaatactat gcagccataa aaaagaatga 20881 gaccctgtcc tttgcagcaa catagatgga gctggaggcc attatcctaa gcaaactaac 20941 acaggaacag aaaacaaaat acctcatatt ctcgcttcta agtgggacct aaactttgag 21001 tacatgtgga caagaagaaa acaagagaca ctggggcctg cttgaggttg cagggtgggc 21061 ggttggagag gatcaaaaaa ctacctgtca ggtactatac ttatcacctg ggtgataaaa 21121 taatctgtac accaaacctt cacaacatgg caatttacct gtattacaaa cctggatacg 21181 tgcccctaaa tctaaaataa aagtttaaat aaacaaatac gtaagtcctg taattacaaa 21241 atgggctcaa atttaaaggt gctttaaaga agccataatt tctggagcct tttattaaac 21301 taaaatctgc ttgctcataa gcaaagccct aaaaagtaaa aataaaaaaa aaaaaaagaa 21361 agcaattacg agtcttttca tcagcagaga atttatttat tttaacttaa aaatctccct 21421 tcaaagtaaa aggcctccca taatcttact actgtacaaa ttctataaag aagttgaggg 21481 gagtccttct gacgcccctc attgctaact cttgtttata gttggtgggc acccttctgg 21541 gattttctcc ttgtttatat agctctctta taaacacatg cttcttttct ttacaaaaat 21601 ggaatcatgc tatacatatt agtctgcaac ttgcttttaa aatgtattat aatatggaca 21661 tatatctagg tcagtatctt attctgtttg atggttgaat tgcatcccat tgtttggagg 21721 tcccttagac catttcacca tttgtccact taagtgactg ggcttttcca atgttgcttt 21781 taccaacaat gctgcaaaga ataacattgt acatatacgc taatcctgac atgccataat 21841 tttaaaaaga tagcgaccta gaagtgagac taccggctca tagtgcaagt ttattttatt 21901 ttatttattt attttttgag acatggtcac actctgttgt ccaggatgga tgcagtggca 21961 caaacccagc tcactgcagc ctcgacctca caggctcaag cgatcctccc atctcagcct 22021 cctaagtagc tggggctaca ggcgcgcacc accatgcctg actaattttt tcactttttt 22081 gtagagaagg ggctcactat gttgccaggg ctggtctcga actcctgggc tcaagtgacc 22141 ctcccacctt ggcctccgaa agtgctggga ttacaggtgt gagccaccac acccagccac 22201 aagtttattt tccatttcaa tagcctccaa ttattagcca ctagagactc ttaacttcct 22261 tgagttagga tccttaccaa tttttgagca cctaccatac gcctgctata tgtagcaatg 22321 gggatgaaga aaacaatcaa aacaaaacaa aagaaaaagg tatttcccta ccaccgtaga 22381 gctctatata gaaagaatac aaaaaaaaaa aagcagccag aaaataagaa ttaaaataat 22441 ttgtattgca catagttggt gtttaattca taaataccac ttacaaagat ttatgactca 22501 aagacaagac aaagtacttc catgggttga cacagggctg taagcaattc tttaaacatg 22561 aaagaagctg gacacatagg tcatattcca ctccagtact cttcccagtc catcaaactg 22621 ctctcttcca ggccaataag tactttctga agtcttcctc gagtccaact tctagannnn 22681 nnnnnagctc ccctttgtgt tcatgcatta atttcgtctg agtcaaaggc cttgtgtact 22741 ctcgctcgtt caggggtaaa ggtgtattct agatc // LOCUS HUMCS1PA 1828 bp mRNA PRI 23-MAY-1996 DEFINITION Human cleavage signal 1 protein mRNA, complete cds. ACCESSION M61199 NID g181122 KEYWORDS cleavage signal 1 protein; sperm surface antigen. SOURCE Homo sapiens adult testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1828) AUTHORS Javed,A.A. and Naz,R.K. TITLE Human cleavage signal-1 protein; cDNA cloning, transcription and immunological analysis JOURNAL Gene 112 (2), 205-211 (1992) MEDLINE 92210000 FEATURES Location/Qualifiers source 1..1828 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="testis" 5'UTR 1..97 /note="cleavage signal 1 protein" CDS 98..847 /codon_start=1 /product="cleavage signal 1 protein" /db_xref="PID:g181123" /translation="MELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRST DNLSCPSPLNVMEPVTELMQEQSYLKSELGLGLGEMGFEIPPGESSESVFSKQRSESS SICSGPSHANRRTGVPSTASVGKSKTPLVARKKVFRASVALTPTAPSRTGSVQTPPDL ESSEEVDAAEGAPEVVGPKSESEVEEGHGKLPSMPAAEEMHKNVEQDELQQVIREIKE SIVGEIRREIVSGLLAAVSSSKASNSKQDYH" 3'UTR 848..1828 /note="cleavage signal 1 protein" polyA_signal 1803..1808 BASE COUNT 544 a 328 c 377 g 579 t ORIGIN 1 ggggctgacg cagcattgcc aattctaaat ccatcatttg actgaggagg agaggtttga 61 agttgatcag ctccagggtt tgagaaattc agtccgaatg gaacttcagg acctggaact 121 gcagctggag gagcgcctgc tgggcctgga ggagcagctt cgtgctgtgc gcatgccttc 181 acccttccgc tcctccgcac tcatgggaat gtgtggcagt agaagcactg ataacttgtc 241 atgcccttct ccattgaatg taatggaacc agtcactgaa ctgatgcagg agcagtcata 301 cctgaagtct gaattgggcc tgggacttgg agaaatggga tttgaaattc ctcctggaga 361 aagctcagaa tctgtttttt ccaagcaacg atcagaatca tcttctatat gttctggtcc 421 ctctcatgct aacagaagaa ctggagtacc ttctactgcc tcagtgggca aatccaaaac 481 cccattagtg gcaaggaaga aagtgttccg agcatcggtg gctctaacgc caacagctcc 541 ttctagaaca ggctctgtgc agacacctcc agatttggaa agttctgagg aagttgatgc 601 agctgaagga gccccagaag ttgtaggacc taaatctgaa tctgaagtgg aagaagggca 661 tggaaaactc ccatcaatgc cagctgctga ggaaatgcat aaaaatgtgg agcaagatga 721 gttgcagcaa gtcatacggg agattaaaga gtctattgtt ggggaaatca gacgggaaat 781 tgtaagtgga cttttggcag cagtatcttc aagtaaagcg tctaattcta agcaagatta 841 tcattaaaca gaaattatag gttggcatgg atcctattag ctgtgtaata ctggaattat 901 caatgatatg cactggtgga ggtgttattt gtgctttaga agatacttgc tgttgagctg 961 ggctactgta tacagtgtac aatgtgtatt tcttcaacca tatattttaa aaagacgtac 1021 atagaaactt aggcactttg ctatttcttt tctaaactat caaaaactct agcagtttga 1081 aaagcctaat atttatttgt atgtcaatat ttttcatttg attccctatt agaattaatt 1141 ttaaaacttg aagacttcca gacttatcca acttataaat aacatatttc ttcagactaa 1201 catcttaaaa cactgacctc tatgaggtat ttactgtgca ataactgatt catttttttc 1261 agagcttgaa gcatccaatg atttttccct ccactgctgt taattaatgt cacttccaag 1321 aagaaaaact gttctgttgt aaaaaatata attgctctta attcttgggg aggttactaa 1381 tagcagtagg atagaatttt atgaggttac ctacaactac ttaatgtact tacactgtaa 1441 gccttgttgc tttacccaag acaaatgtaa ttttatcatt gcttatgtag tatttttctt 1501 ttggaaatgt gccttatgtt aaacactatg tacttttact ttttgcattg tccagacttc 1561 tttattagat ggagatgttt ctttttctgt cttctagact aaatagagta tcatccaaat 1621 aatggggcct atgacttgaa tgaatagaaa tgaataagct ggtgtttgtt ttttcaaaat 1681 ggaagtaatt tagatttgtt ctcctcatac ataaaatgat tttagttcag ttttaaccag 1741 tgaaaacttt gtttttatga aaaaaaagga aaatggtttc ccatttggtt ttatatgtgt 1801 taaataaatg tgtaaagtaa ccaccccc // LOCUS HUMCSF 1978 bp mRNA PRI 11-MAR-1992 DEFINITION Human cleavage stimulation factor, complete cds. ACCESSION M85085 NID g181138 KEYWORDS cleavage stimulation factor; polyadenylation factor. SOURCE Homo sapiens (library: Lambda ZAP II) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1978) AUTHORS Takagaki,Y., MacDonald,C.C., Shenk,T. and Manley,J.L. TITLE The human 64-kDa polyadenylylation factor contains a ribonucleoprotein-type RNA binding domain and unusual auxiliary motifs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 1403-1407 (1992) MEDLINE 92159058 FEATURES Location/Qualifiers source 1..1978 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="Lambda ZAP II" CDS 23..1756 /note="64kDa" /codon_start=1 /product="cleavage stimulation factor" /db_xref="PID:g181139" /translation="MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVS FRLVYDRETGKPKGYGFCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELK SLGTGAPVIESPYGETISPEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARN MLLQNPQLAYALLQAQVVMRIVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSN VSMNQQNPQAPQAQSLGGMHVNGAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGG GMQAQVGMPGSGPVSMERGQVPMQDPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTL LSVTGEVEPRGYLGPPHQGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPML DQRGPPLDGRGGRDPRGIDARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAM EARAMEVRGMEARGMDTRGPVPGPRGPIPSGMQGPSPINMGAVVPQGSRQVPVMQGTG MQGASIQGGSQPGGFSPGQNQVTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILIL KEQIQKSTGAP" repeat_region 1250..1429 polyA_signal 1947..1952 polyA_site 1978 BASE COUNT 527 a 476 c 556 g 419 t ORIGIN 1 cggaagccga ctcaacagag ctatggcggg tttgactgtg agagacccag cggtggatcg 61 ttctctacgt tctgtgttcg tggggaacat tccttatgaa gctactgaag agcagttgaa 121 ggacatcttt tctgaggttg gacctgttgt tagtttcaga ttggtatacg atagagagac 181 aggaaagcca aagggttatg gcttctgtga ataccaagac caagagacag cacttagtgc 241 catgcggaac ctgaatgggc gcgaattcag tgggagagca cttcgagtgg acaatgctgc 301 cagtgaaaag aacaaagaag agctgaagag ccttggcact ggtgcccctg tcattgagtc 361 accttatgga gagaccatca gtcctgagga tgcccctgag tccattagca aagcagttgc 421 cagccttcca ccagagcaga tgtttgagct gatgaaacaa atgaagctct gtgtccagaa 481 tagtccccag gaggcacgga acatgttact tcagaaccct caactggctt atgctttgct 541 gcaagcacag gtagtgatga gaattgtgga tccggaaatt gccctgaaaa ttctgcatcg 601 ccagacaaat atcccaacgc tgattgcagg caaccctcag ccagtccatg gtgctgggcc 661 tggctcagga tccaatgtgt caatgaacca gcagaatcct caggcccctc aggcccagtc 721 tttgggtgga atgcatgtca atggcgcacc tcctctgatg caagcttcta tgcagggtgg 781 agttccagca ccagggcaaa tgccagctgc tgtcacagga cctggccctg gttccttagc 841 tcctggagga ggaatgcagg ctcaggttgg aatgccagga agtggaccag tgtccatgga 901 acgggggcaa gtgccgatgc aagaccccag agcagctatg cagcggggat ccttgcctgc 961 gaatgtccca acccctcgag gcttgttagg agatgctccg aatgatccac ggggaggcac 1021 tttactttct gtaactggag aggtagagcc tagaggttac ttgggaccac ctcatcaggg 1081 tccacccatg caccatgtcc ctggccatga gagccgagga ccacccccac atgaactgag 1141 gggagggcca ttacccgagc ccagacctct aatggcagaa ccaagaggac ccatgctaga 1201 tcagaggggt ccacccttgg atggcagagg tggaagggat ccccgaggaa tagatgcacg 1261 agggatggag gcccgagcca tggaggcaag agggttagat gccagaggat tagaggcccg 1321 tgcaatggag gcccgtgcga tggaagctcg tgcaatggag gcccgagcga tggaggcccg 1381 tgcaatggaa gtccgaggga tggaggccag aggcatggat accagaggcc cagtgcctgg 1441 ccccagagga cctataccta gtggaatgca gggtcccagt ccaattaaca tgggggcggt 1501 tgtcccccag ggatccagac aggtcccagt catgcaggga acaggaatgc aaggagcaag 1561 tatacagggt ggaagccagc ctggcggctt tagtcccggg cagaaccaag tcactccaca 1621 ggatcatgag aaggctgctt tgattatgca ggttctacaa ctgactgcag accagattgc 1681 catgttgcct cctgagcaaa ggcagagtat cctgatttta aaggaacaaa tacagaaatc 1741 cactggagca ccttgatagg ttttcaaaaa tacctggcaa gaaatctgga aattctataa 1801 ttttgttgaa atattgaaaa aagatgacct gcatcctaac ccttgaatga ctcaaatcag 1861 tgccaggtgg aggactccca tcaccttctc tcagaacaaa atcacttcat tttattgtct 1921 tagtttgtat attttctgtg acttgaaata aactttgaac acaattttag tacactgc // LOCUS HUMCSFM 910 bp mRNA PRI 15-JUN-1988 DEFINITION Human multilineage-colony-stimulating factor mRNA, complete cds. ACCESSION M17115 NID g181151 KEYWORDS colony stimulating factor; hemopoietic growth factor. SOURCE Human peripheral blood lymphocyte, cDNA to mRNA, clone pLB4/lambda-D11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 910) AUTHORS Dorssers,L., Burger,H., Bot,F., Delwel,R., Geurts van Kessel,A.H.M., Loewenberg,B. and Wagemaker,G. TITLE Characterization of a human multilineage-colony-stimulating factor cDNA clone identified by a conserved noncoding sequence in mouse interleukin-3 JOURNAL Gene 55, 115-124 (1987) MEDLINE 87305582 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Dorssers, 29-SEP-1987. FEATURES Location/Qualifiers source 1..910 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 39..95 /note="multilineage-colony stimulating factor signal peptide" CDS 39..497 /note="multilineage-colony stimulating factor precursor" /codon_start=1 /db_xref="PID:g181152" /translation="MSRLPVLLLLQLLVRPGLQAPMTQTTPLKTSWVNCSNMIDEIIT HLKQPPLPLLDFNNLNGEDQDILMENNLRRPNLEAFNRAVKSLQNASAIESILKNLLP CLPLATAAPTRHPIHIKDGDWNEFRRKLTFYLKTLENAQAQQTTLSLAIF" mat_peptide 96..494 /note="multilineage-colony stimulating factor" BASE COUNT 241 a 238 c 193 g 238 t ORIGIN 411 bp upstream of EcoRI site; chromosome 1. 1 gaccagaaca agacagagtg cctcctgccg atccaaacat gagccgcctg cccgtcctgc 61 tcctgctcca actcctggtc cgccccggac tccaagctcc catgacccag acaacgccct 121 tgaagacaag ctgggttaac tgctctaaca tgatcgatga aattataaca cacttaaagc 181 agccaccttt gcctttgctg gacttcaaca acctcaatgg ggaagaccaa gacattctga 241 tggaaaataa ccttcgaagg ccaaacctgg aggcattcaa cagggctgtc aagagtttac 301 agaacgcatc agcaattgag agcattctta aaaatctcct gccatgtctg cccctggcca 361 cggccgcacc cacgcgacat ccaatccata tcaaggacgg tgactggaat gaattccgga 421 ggaaactgac gttctatctg aaaacccttg agaatgcgca ggctcaacag acgactttga 481 gcctcgcgat cttttgagtc caacgtccag ctcgttctct gggccttctc accacagagc 541 ctcgggacat caaaaacagc agaacttctg aaacctctgg gtcatctctc acacattcca 601 ggaccagaag catttcacct tttcctgcgg catcagatga attgttaatt atctaatttc 661 tgaaatgtgc agctcccatt tggccttgtg cggttgtgtt ctcattttta tcccattgag 721 actatttatt tatgtatgta tgtatttatt tatttattgc ctggagtgtg aactgtattt 781 attttagcag aggagccatg tcctgctgct tctgcaaaaa actcagagtg gggtggggag 841 catgttcatt tgtacctcga gttttaaact ggttcctagg gatgtgtgag aataaactag 901 actctgaaca // LOCUS HUMCSNK1E 1331 bp mRNA PRI 29-JUN-1995 DEFINITION Homo sapiens casein kinase I epsilon mRNA, complete cds. ACCESSION L37043 NID g852056 KEYWORDS casein kinase; casein kinase I epsilon. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fish,K.J., Cegielska,A., Getman,M.E., Landes,G.M. and Virshup,D.M. TITLE Isolation and characterization of human casein kinase I epsilon (CKI), a novel member of the CKI gene family JOURNAL J. Biol. Chem. 270 (25), 14875-14883 (1995) MEDLINE 95318039 FEATURES Location/Qualifiers source 1..1331 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..22 mRNA 1..1331 CDS 23..1273 /codon_start=1 /product="casein kinase I-epsilon" /db_xref="PID:g852057" /translation="MELRVGNKYRLGRKIGSGSFGDIYLGANIASGEEVAIKLECVKT KHPQLHIESKFYKMMQGGVGIPSIKWCGAEGDYNVMVMELLGPSLEDLFNFCSRKFSL KTVLLLADQMISRIEYIHSKNFIHRDVKPDNFLMGLGKKGNLVYIIDFGLAKKYRDAR THQHIPYRENKNLTGTARYASINTHLGIEQSRRDDLESLGYVLMYFNLGSLPWQGLKA ATKRQKYERISEKKMSTPIEVLCKGYPSEFSTYLNFCRSLRFDDKPDYSYLRQLFRNL FHRQGFSYDYVFDWNMLKFGAARNPEDVDRERREHEREERMGQLRGSATRALPPGPPT GATANRLRSAAEPVASTPASRIQPAGNTSPRAISRVDRERKVSMRLHRGAPANVSSSD LTGRQEVSRIPASQTSVPFDHLGK" 3'UTR 1274..1331 BASE COUNT 294 a 415 c 381 g 241 t ORIGIN 1 gaattccggg caagagtgag ccatggagct acgtgtgggg aacaagtacc gcctgggacg 61 gaagatcggg agcgggtcct tcggagatat ctacctgggt gccaacatcg cctctggtga 121 ggaagtcgcc atcaagctgg agtgtgtgaa gacaaagcac ccccagctgc acatcgagag 181 caagttctac aagatgatgc agggtggcgt ggggatcccg tccatcaagt ggtgcggagc 241 tgagggcgac tacaacgtga tggtcatgga gctgctgggg cctagcctcg aggacctgtt 301 caacttctgt tcccgcaaat tcagcctcaa gacggtgctg ctcttggccg accagatgat 361 cagccgcatc gagtatatcc actccaagaa cttcatccac cgggacgtca agcccgacaa 421 cttcctcatg gggctgggga agaagggcaa cctggtctac atcatcgact tcggcctggc 481 caagaagtac cgggacgccc gcacccacca gcacattccc taccgggaaa acaagaacct 541 gaccggcacg gcccgctacg cttccatcaa cacgcacctg ggcattgagc aaagccgtcg 601 agatgacctg gagagcctgg gctacgtgct catgtacttc aacctgggct ccctgccctg 661 gcaggggctc aaagcagcca ccaagcgcca gaagtatgaa cggatcagcg agaagaagat 721 gtcaacgccc atcgaggtcc tctgcaaagg ctatccctcc gaattctcaa catacctcaa 781 cttctgccgc tccctgcggt ttgacgacaa gcccgactac tcttacctac gtcagctctt 841 ccgcaacctc ttccaccggc agggcttctc ctatgactac gtctttgact ggaacatgct 901 gaaattcggt gcagcccgga atcccgagga tgtggaccgg gagcggcgag aacacgaacg 961 cgaggagagg atggggcagc tacgggggtc cgcgacccga gccctgcccc ctggcccacc 1021 cacgggggcc actgccaacc ggctccgcag tgccgccgag cccgtggctt ccacgccagc 1081 ctcccgcatc cagccggctg gcaatacttc tcccagagcg atctcgcggg tcgaccggga 1141 gaggaaggtg agtatgaggc tgcacagggg tgcgcccgcc aacgtctcct cctcagacct 1201 cactgggcgg caagaggtct cccggatccc agcctcacag acaagtgtgc catttgacca 1261 tctcgggaag tgaggagagc ccccattgga ccagtgtttg cttagtgtct tcactgtatt 1321 ttctggaatt c // LOCUS HUMCSPG1A 1778 bp mRNA PRI 23-MAY-1996 DEFINITION Human chondroitin/dermatan sulfate proteoglycan (PG40) core protein mRNA, complete cds. ACCESSION M14219 NID g181169 KEYWORDS matrix protein; proteoglycan core protein. SOURCE Homo sapiens (clone: 5E.) embryo cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1778) AUTHORS Krusius,T. and Ruoslahti,E. TITLE Primary structure of an extracellular matrix proteoglycan core protein deduced from cloned cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (20), 7683-7687 (1986) MEDLINE 87017013 FEATURES Location/Qualifiers source 1..1778 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="5E." /cell_line="IMR-90" /cell_type="fibroblast" /dev_stage="embryo" /map="15" sig_peptide 82..171 /gene="CSPG1" /note="398" CDS 82..1161 /gene="CSPG1" /codon_start=1 /db_xref="GDB:398" /product="proteoglycan core protein" /db_xref="PID:g181170" /translation="MKATIILLLLAQVSWAGPFQQRGLFDFMLEDEASGIGPEVPDDR DFEPSLGPVCPFRCQCHLRVVQCSDLGLDKVPKDLPPDTTLLDLQNNKITEIKDGDFK NLKNLHALILVNNKISKVSPGAFTPLVKLERLYLSKNQLKELPEKMPKTLQELRAHEN EITKVRKVTFNGLNQMIVIELGTNPLKSSGIENGAFQGMKKLSYIRIADTNITSIPQG LPPSLTELHLDGNKISRVDAASLKGLNNLAKLGLSFNSISAVDNGSLANTPHLRELHL DNNKLTRVPGGLAEHKYIQVVYLHNNNISVVGSSDFCPPGHNTKKASYSGVSLFSNPV QYWEIQPSTFRCVYVRSAIQLGNYK" gene 82..1161 /gene="CSPG1" mat_peptide 172..1158 /gene="CSPG1" /note="398" /product="proteoglycan core protein" BASE COUNT 525 a 399 c 360 g 494 t ORIGIN Chromosome 15; 1 bp upstream of EcoRI site. 1 gaattccggt tacgtctgcc ccccggtggc aaattcccgg attaaaaggt tccctggttg 61 tgaaaataca tgagataaat catgaaggcc actatcatcc tccttctgct tgcacaagtt 121 tcctgggctg gaccgtttca acagagaggc ttatttgact ttatgctaga agatgaggct 181 tctgggatag gcccagaagt tcctgatgac cgcgacttcg agccctccct aggcccagtg 241 tgccccttcc gctgtcaatg ccatcttcga gtggtccagt gttctgattt gggtctggac 301 aaagtgccaa aggatcttcc ccctgacaca actctgctag acctgcaaaa caacaaaata 361 accgaaatca aagatggaga ctttaagaac ctgaagaacc ttcacgcatt gattcttgtc 421 aacaataaaa ttagcaaagt tagtcctgga gcatttacac ctttggtgaa gttggaacga 481 ctttatctgt ccaagaatca gctgaaggaa ttgccagaaa aaatgcccaa aactcttcag 541 gagctgcgtg cccatgagaa tgagatcacc aaagtgcgaa aagttacttt caatggactg 601 aaccagatga ttgtcataga actgggcacc aatccgctga agagctcagg aattgaaaat 661 ggggctttcc agggaatgaa gaagctctcc tacatccgca ttgctgatac caatatcacc 721 agcattcctc aaggtcttcc tccttccctt acggaattac atcttgatgg caacaaaatc 781 agcagagttg atgcagctag cctgaaagga ctgaataatt tggctaagtt gggattgagt 841 ttcaacagca tctctgctgt tgacaatggc tctctggcca acacgcctca tctgagggag 901 cttcacttgg acaacaacaa gcttaccaga gtacctggtg ggctggcaga gcataagtac 961 atccaggttg tctaccttca taacaacaat atctctgtag ttggatcaag tgacttctgc 1021 ccacctggac acaacaccaa aaaggcttct tattcgggtg tgagtctttt cagcaacccg 1081 gtccagtact gggagataca gccatccacc ttcagatgtg tctacgtgcg ctctgccatt 1141 caactcggaa actataagta attctcaaga aagccctcat ttttataacc tggcaaaatc 1201 ttgttaatgt cattgctaaa aaataaataa aagctagata ctggaaacct aactgcaatg 1261 tggatgtttt acccacatga cttattatgc atgttatgat cagtagttga ttttgagaaa 1321 gctctatgag ctctaagtaa ctgcatggtt ttttgtttaa tgtaatatag gagacccttc 1381 acattcccaa ggaatatatt ccaaaacatt tttgtgaata tctaagtttg tgaaactact 1441 agggcatgat acagtaaggt gtaattacag aatttacgaa atgtaaatga cctctacaga 1501 gttttatgga atacctggta ctaacgtagg cagctgcaaa accacactga gttacagctg 1561 tcagccctcc tcattcctaa ataacttgcc ttacatatca gccctcccac ttctgaagtt 1621 caaattagtg cctcggaaat gtagaattta ttatttgtca tttttttttt tttagcatag 1681 attgagaaca gttgaactct taaatcctca gatgccaggg gtctgctcta gcatcagtaa 1741 gtatttagca gaaactaact ccgtaatgaa tggaattc // LOCUS HUMCSYNA 2647 bp DNA PRI 30-SEP-1988 DEFINITION Human c-syn protooncogene, complete cds. ACCESSION M14333 NID g181171 KEYWORDS c-myc proto-oncogene. SOURCE Human (placental) DNA, clone lambda-SN-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2647) AUTHORS Semba,K., Nishizawa,M., Miyajima,N., Yoshida,M.C., Sukegawa,J., Yamanashi,Y., Sasaki,M., Yamamoto,T. and Toyoshima,K. TITLE yes-related protooncogene, syn, belongs to the protein-tyrosine kinase family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 5459-5463 (1986) MEDLINE 86287278 COMMENT syn belongs to the protein-tyrosine kinase family of retroviral oncogenes. FEATURES Location/Qualifiers source 1..2647 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 580..2193 /note="c-syn" /codon_start=1 /db_xref="PID:g181172" /translation="MGCVQCKDKEATKLTEERDGSLNQSSGYRYGTDPTPQHYPSFGV TSIPNYNNFHAAGGQGLTVFGGVNSSSHTGTLRTRGGTGVTLFVALYDYEARTEDDLS FHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVAPVDSIQAEEWYFGKLGRKDAE RQLLSFGNPRGTFLIRESETTKGAYSLSIRDWDDMKGDHVKHYKIRKLDNGGYYITTR AQFETLQQLVQHYSERAAGLCCRLVVPCHKGMPRLTDLSVKTKDVWEIPRESLQLIKR LGNGQFGEVWMGTWNGNTKVAIKTLKPGTMSPESFLEEAQIMKKLKHDKLVQLYAVVS EEPIYIVTEYMNKGSLLDFLKDGEGRALKLPNLVDMAAQVAAGMAYIERMNYIHRDLR SANILVGNGLICKIADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVW SFGILLTELVTKGRVPYPGMNNREVLEQVERGYRMPCPQDCPISLHELMIHCWKKDPE ERPTFEYLQSFLEDYFTATEPQYQPGENL" BASE COUNT 683 a 695 c 716 g 553 t ORIGIN 1 gccgcgctgg tggcggcggc gcgtcgttgc agttgcgcca tctgtcagga gcggagccgg 61 cgaggagggg gctgccgcgg gcgaggagga ggggtcgccg cgagccgaag gccttcgaga 121 cccgcccgcc gcccggcggc gagagtagag gcgaggttgt tgtgcgagcg gcgcgtcctc 181 tcccgcccgg gcgcgccgcg cttctcccag cgcaccgagg accgcccggg cgcacacaaa 241 gccgccgccc gcgccgcacc gcccggcggc cgccgcccgc gccagggagg gattcggccg 301 ccgggccggg gacaccccgg cgccgccccc tcggtgctct cggaaggccc accggctccc 361 gggcccgccg gggacccccc ggagccgcct cggccgcgcc ggaggagggc ggggagagga 421 ccatgtgagt gggctccgga gcctcagcgc cgcgcagttt ttttgaagaa gcaggatgct 481 gatctaaacg tggaaaaaga ccagtcctgc ctctgttgta gaagacatgt ggtgtatata 541 aagtttgtga tcgttggcgg aaattttgga atttagataa tgggctgtgt gcaatgtaag 601 gataaagaag caacaaaact gacggaggag agggacggca gcctgaacca gagctctggg 661 taccgctatg gcacagaccc cacccctcag cactacccca gcttcggtgt gacctccatc 721 cccaactaca acaacttcca cgcagccggg ggccaaggac tcaccgtctt tggaggtgtg 781 aactcttcgt ctcatacggg gaccttgcgt acgagaggag gaacaggagt gacactcttt 841 gtggcccttt atgactatga agcacggaca gaagatgacc tgagttttca caaaggagaa 901 aaatttcaaa tattgaacag ctcggaagga gattggtggg aagcccgctc cttgacaact 961 ggagagacag gttacattcc cagcaattat gtggctccag ttgactctat ccaggcagaa 1021 gagtggtact ttggaaaact tggccgaaaa gatgctgagc gacagctatt gtcctttgga 1081 aacccaagag gtacctttct tatccgcgag agtgaaacca ccaaaggtgc ctattcactt 1141 tctatccgtg attgggatga tatgaaagga gaccatgtca aacattataa aattcgcaaa 1201 cttgacaatg gtggatacta cattaccacc cgggcccagt ttgaaacact tcagcagctt 1261 gtacaacatt actcagagag agctgcaggt ctctgctgcc gcctagtagt tccctgtcac 1321 aaagggatgc caaggcttac cgatctgtct gtcaaaacca aagatgtctg ggaaatccct 1381 cgagaatccc tgcagttgat caagagactg ggaaatgggc agtttgggga agtatggatg 1441 ggtacctgga atggaaacac aaaagtagcc ataaagactc ttaaaccagg cacaatgtcc 1501 cccgaatcat tccttgagga agcgcagatc atgaagaagc tgaagcacga caagctggtc 1561 cagctctatg cagtggtgtc tgaggagccc atctacatcg tcaccgagta tatgaacaaa 1621 ggaagtttac tggatttctt aaaagatgga gaaggaagag ctctgaaatt accaaatctt 1681 gtggacatgg cagcacaggt ggctgcagga atggcttaca tcgagcgcat gaattatatc 1741 catagagatc tgcgatcagc aaacattcta gtggggaatg gactcatatg caagattgct 1801 gacttcggat tggcccgatt gatagaagac aatgagtaca cagcaagaca aggtgcaaag 1861 ttccccatca agtggacggc ccccgaggca gccctgtacg ggaggttcac aatcaagtct 1921 gacgtgtggt cttttggaat cttactcaca gagctggtca ccaaaggaag agtgccatac 1981 ccaggcatga acaaccggga ggtgctggag caggtggagc gaggctacag gatgccctgc 2041 ccgcaggact gccccatctc tctgcatgag ctcatgatcc actgctggaa aaaggaccct 2101 gaagaacgcc ccacttttga gtacttgcag agcttcctgg aagactactt taccgcgaca 2161 gagccccagt accaacctgg tgaaaacctg taaggcccgg gtctgcggag agaggccttg 2221 tcccagaggc tgccccaccc ctccccatta gctttcaatt ccgtagccag ctgctcccca 2281 gcagcggaac cgcccaggat cagattgcat gtgactctga agctgacgaa cttccatggc 2341 cctcattaat gacacttgtc cccaaatccg aacctcctct gtgaagcatt cgagacagaa 2401 ccttgttatt tctcagactt tggaaaatgc attgtatcga tgttatgtaa aaggccaaac 2461 ctctgttcag tgtaaatagt tactccagtg ccaacaatcc tagtgctttc cttttttaaa 2521 aatgcaaatc ctatgtgatt ttaactctgt cttcacctga ttcaactaaa aaaaaaaagt 2581 attattttcc aaaagtggcc tctttgtcta aaacaataaa attttttttc atgttttaac 2641 aaaaacc // LOCUS HUMCTI 2997 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens erythrocyte membrane protein mRNA, complete cds. ACCESSION M81635 NID g181183 KEYWORDS cation transport inhibitor; erythrocyte membrane protein; stomatin peptide. SOURCE Homo sapiens m adult bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2997) AUTHORS Stewart,G.W., Hepworth-Jones,B.J., Keen,J.N., Dash,B.J.C., Argent,A.C. and Casimir,C.M. JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..2997 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="m" /tissue_type="bone marrow" mRNA 1..2997 /gene="stomatin peptide" /note="putative" gene 1..2997 /gene="stomatin peptide" CDS 22..888 /gene="stomatin peptide" /note="The gene codes for an erythrocyte membrane protein which is deficient in patients with hereditary stomatocytosis. These membranes show very high permeability to Na+ and K+.; putative" /codon_start=1 /function="Possible cation transport inhibitor" /product="stomatin peptide" /db_xref="PID:g181184" /translation="MAEKRDTRDSEAQRLPDSFKDSPSKGLGPCGWILVAFSFLFTVI TFPISIWMCIKIIKEYERAIIFRLGRILQGGAKGPGLFFILPCTDSFIKVDMRTISFD IPPQEILTKDSVTISVDGVVYYRVQNATLAVANITNADSATRLLAQTTLRNVLGTKNL SQILSDREEIAHNMQSTLDDATDAWGIKVERVEIKDVKLPVQLQRAMAAEAEASREAR AKVIAAEGEMNASRALKEASMVITESPAALQLRYLQTLTTIAAEKNSTIVFPLPIDML QGIIGAKHSHLG" repeat_region 1338..1613 /note="putative" /rpt_family="Alu" polyA_site 2997 /gene="stomatin peptide" /note="putative" BASE COUNT 818 a 661 c 606 g 912 t ORIGIN 1 ggactgcggt ctcgggcagc aatggccgag aagcgcgaca cacgggactc cgaagcccag 61 cggctccccg actccttcaa ggacagcccc agtaagggcc ttggaccttg cggatggatt 121 ttggtggcgt tctcattctt attcaccgtt ataactttcc caatctcaat atggatgtgc 181 ataaagatta taaaagagta tgaaagagcc atcatcttta gattgggtcg cattttacaa 241 ggaggagcca aaggacctgg tttgtttttt attctgccat gcactgacag cttcatcaaa 301 gtggacatga gaactatttc atttgatatt cctcctcagg agatcctgac aaaggattca 361 gtgacaatta gcgtggatgg tgtggtctat taccgcgttc agaatgcaac cctggctgtg 421 gcaaatatca ccaacgctga ctcagcaacc cgtcttttgg cacaaactac tctgaggaat 481 gttctgggca ccaagaatct ttctcagatc ctctctgaca gagaagaaat tgcacacaac 541 atgcagtcta ctctggatga tgccactgat gcctggggaa taaaggtgga gcgtgtggaa 601 attaaggatg tgaaactacc tgtgcagctc cagagagcta tggctgcaga agcagaagcg 661 tcccgcgagg cccgcgccaa ggttattgca gccgaaggag aaatgaatgc atccagggct 721 ctgaaagaag cctccatggt catcactgaa tctcctgcag cccttcagct ccgatacctg 781 cagacactga ccaccattgc tgctgagaaa aactcaacaa ttgtcttccc tctgcccata 841 gatatgctgc aaggaatcat aggggcaaaa cacagccatc taggctagtg tagagatgag 901 cgctagcctt ccaagcatga agtcggggac caaattagcc tttaactcat aaagagaggg 961 tagggctttt ctttttccat atgtcaattg tggtgttccc agaatgtata gcagttataa 1021 aaataggtga aagaattgtt agcttgtaaa tactgagaga ttggtgattt atataaggta 1081 atctgttagt cttaaaatag ttaaaagttt gtatttttag attattatgt agtaggttag 1141 atccctcttg ttttgacttc cactgactca ttctgaaccc cctaagcacc caggccacag 1201 gcaagaacct gggctgtaac tgccacctga caccgctgac tggctaaatg ctttgcagaa 1261 agtgatgacc ttacaccaca accagcttct ccaggtcata tgtgccttac ctccagaagt 1321 cttttttttt ttttttttct gagatggagt ttcactcttg ttgcccaggc tggagtgcaa 1381 tagcatgatc tcggctcact gcaacctccg cctcctgggt tcaagagatt ctcctgcctc 1441 agcctcccca gtagctggga ttacaggctc atgccaccat gcccagctaa tttttgtatt 1501 attattattg ttttttagta gagacggggt ttcaccatgt tggccaggct agtcacgaac 1561 tcctaacctc aggtgatcca cccacctctg cctccaaagt gctggattac aggctgagct 1621 accaccctgg tttggagagt cttaattaat tgaaatttcc ctaatgttca tttattttct 1681 aaatccagcc gtgtttcaga ataatcctta cttgagagta gccattttct tgtgtacttg 1741 tcagaactag aggaaatagc caagactaat gaaaaacatt actctaaccc ttaaaagact 1801 tttaaattca ctactagagt ggtcatttta aaaatacatc catgttttaa cttattttga 1861 gcctttcttt tatgagtaaa tgattcctcc ttgttctgtc tttcaaacca gctaaatatt 1921 tgtcacaaaa gtgacttttt tctcactgtt gcctattttc atatatcagg ttttaaatag 1981 ttttaatttt ttaataaaat ttttctctac gttctatatg caattgttat atatctattt 2041 gaatagctga aggactaaaa tactttttta agagataact tcaggaaacc attatatttt 2101 actatctgca tgctgttaac tgtggtacac tgtgaaatat gttgattaca aacccattca 2161 ttacatagta taaggaattc acagtatatt gactatatag tgtctaatga ctgggcagat 2221 actgtcaact tacaatatct atatagagag gctttaaact taccttactc attctctatg 2281 atgtatgact tgatgctgaa agaggaagct ggtcagctcc tcatggacaa caaattctta 2341 gtctataata ttaggagaca tctctagttt tgcaaatgtc tgtgaatctg agcaacctgg 2401 acttctgctt actggccaga aagctggcgg gtgacatttg taacatttcc tctttgagac 2461 tctgagttca cctagagaag tctaagcata acagctttct ttcccagcac gagcctttat 2521 agctctcttt agctcaacca ctctgtccat ccagccaatg gatgtccttc cctgtaccca 2581 attcaagctt attttaggga agccttgaaa ctaccatgta tctggctcta gctgagttat 2641 tgaggattga gccagtgcaa cgttaaactc agtgcactta catttgattt aaatgatggt 2701 tttatctgtt gtgtgaagtg gttcaccctt gaggaccagg agcctccata tcctgactga 2761 aaaccttttc tgagacttag agtaacagta cttttggttc cttgagttct cctgtctcca 2821 gatacctaaa tgaccttgac ttttctgcct tgtgaattcg tagtccaatc agctgaaatt 2881 aaatcacttg ggagggacgc atagaaggag ctctaggaac acagtgccag tgcagaagtt 2941 tctccaggtg gcctcccttt ccaacaatgt acataataaa gtgtatgcac tttcact // LOCUS HUMCTLA4B 672 bp mRNA PRI 01-NOV-1994 DEFINITION Homo sapiens Ig superfamily CTLA-4 mRNA, complete cds. ACCESSION L15006 NID g291928 KEYWORDS CTLA-4 gene; Ig superfamily; cytotoxic T-lymphocyte-associated protein 4. SOURCE Homo sapiens (tissue library: HW-ZM23) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 672) AUTHORS Dariavach,P., Mattei,M.G., Golstein,P. and Lefranc,M.P. TITLE Human Ig superfamily CTLA-4 gene: chromosomal localization and identity of protein sequence between murine and human CTLA-4 cytoplasmic domains JOURNAL Eur. J. Immunol. 18 (12), 1901-1905 (1988) MEDLINE 89120925 REFERENCE 2 (bases 1 to 672) AUTHORS Gorman,S.D. TITLE Human CTLA-4 cDNA sequence JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 672) AUTHORS Harper,K., Balzano,C., Rouvier,E., Mattei,M.G., Luciani,M.F. and Golstein,P. TITLE CTLA-4 and CD28 activated lymphocyte molecules are closely related in both mouse and human as to sequence, message expression, gene structure, and chromosomal location JOURNAL J. Immunol. 147 (3), 1037-1044 (1991) MEDLINE 91318145 FEATURES Location/Qualifiers source 1..672 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HPB-ALL" /tissue_lib="HW-ZM23" /map="2q33" mRNA <1..>672 /gene="CTLA4" /note="G00-119-818; putative" gene 1..672 /gene="CTLA4" primer_bind complement(1..18) /gene="CTLA4" /note="cDNA isolated by PCR using 5' primer corresponding to thisregion.; putative" CDS 1..672 /gene="CTLA4" /note="putative" /codon_start=1 /db_xref="GDB:G00-119-818" /product="cytotoxic T-lymphocyte-associated protein 4" /db_xref="PID:g291929" /translation="MACLGFQRHKAQLNLAARTWPCTLLFFLLFIPVFCKAMHVAQPA VVLASSRGIASFVCEYASPGKATEVRVTVLRQADSQVTEVCAATYMTGNELTFLDDSI CTGTSSGNQVNLTIQGLRAMDTGLYICKVELMYPPPYYLGIGNGTQIYVIDPEPCPDS DFLLWILAAVSSGLFFYSFLLTAVSLSKMLKKRSPLTTGVYVKMPPTEPECEKQFQPY FIPIN" conflict 49 /gene="CTLA4" /note="Conflict could be due to allelic difference. 'G' found in two independent PCR-isolated clones from the same cDNA library." /citation=[2] /citation=[3] /replace="a" conflict 272 /gene="CTLA4" /note="Conflict could be due to allelic difference. 'C' found in two independent PCR-isolated clones from the same cDNA library." /citation=[2] /citation=[1] /replace="t" conflict 439 /gene="CTLA4" /note="Conflict could be due to allelic difference. 'A' found in two independent PCR-isolated clones from the same cDNA library." /citation=[2] /citation=[1] /replace="g" primer_bind 656..672 /gene="CTLA4" /note="cDNA isolated by PCR using 3' primer corresponding to this region.; putative" BASE COUNT 157 a 182 c 163 g 170 t ORIGIN 1 atggcttgcc ttggatttca gcggcacaag gctcagctga acctggctgc caggacctgg 61 ccctgcactc tcctgttttt tcttctcttc atccctgtct tctgcaaagc aatgcacgtg 121 gcccagcctg ctgtggtact ggccagcagc cgaggcatcg ccagctttgt gtgtgagtat 181 gcatctccag gcaaagccac tgaggtccgg gtgacagtgc ttcggcaggc tgacagccag 241 gtgactgaag tctgtgcggc aacctacatg acggggaatg agttgacctt cctagatgat 301 tccatctgca cgggcacctc cagtggaaat caagtgaacc tcactatcca aggactgagg 361 gccatggaca cgggactcta catctgcaag gtggagctca tgtacccacc gccatactac 421 ctgggcatag gcaacggaac ccagatttat gtaattgatc cagaaccgtg cccagattct 481 gacttcctcc tctggatcct tgcagcagtt agttcggggt tgttttttta tagctttctc 541 ctcacagctg tttctttgag caaaatgcta aagaaaagaa gccctcttac aacaggggtc 601 tatgtgaaaa tgcccccaac agagccagaa tgtgaaaagc aatttcagcc ttattttatt 661 cccatcaatt ga // LOCUS HUMCTSE 2158 bp mRNA PRI 01-NOV-1994 DEFINITION Human cathepsin E mRNA, complete cds. ACCESSION J05036 NID g181193 KEYWORDS aspartic proteinase; cathepsin. SOURCE Human stomach, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Azuma,T., Pals,G., Mohandas,T.K., Couvreur,J.M. and Taggart,R.T. TITLE Human gastric cathepsin E. Predicted sequence, localization to chromosome 1, and sequence homology with other aspartic proteinases JOURNAL J. Biol. Chem. 264 (28), 16748-16753 (1989) MEDLINE 89380302 FEATURES Location/Qualifiers source 1..2158 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q31" mRNA <1..2158 /note="CTSE mRNA" sig_peptide 50..100 /gene="CTSE" /note="cathepsin E signal peptide" gene 50..1240 /gene="CTSE" CDS 50..1240 /gene="CTSE" /note="cathepsin E precursor" /codon_start=1 /db_xref="GDB:G00-119-821" /db_xref="PID:g181194" /translation="MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSE FWKSHNLDMIQFTESCSMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNL WVPSVYCTSPACKTHSRFQPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTV VGQQFGESVTEPGQTFVDAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVY MSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGC QAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVECANLNVMPDVTFTINGVPYTL SPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPLWILGDVFIRQFYSVFDRGNNRVGLA PAVP" mat_peptide 101..1237 /gene="CTSE" /note="cathepsin E" BASE COUNT 530 a 552 c 500 g 576 t ORIGIN 1 ggagagaaga aaggaggggg caagggagaa gctgctggtc ggactcacaa tgaaaacgct 61 ccttcttttg ctgctggtgc tcctggagct gggagaggcc caaggatccc ttcacagggt 121 gcccctcagg aggcatccgt ccctcaagaa gaagctgcgg gcacggagcc agctctctga 181 gttctggaaa tcccataatt tggacatgat ccagttcacc gagtcctgct caatggacca 241 gagtgccaag gaacccctca tcaactactt ggatatggaa tacttcggca ctatctccat 301 tggctcccca ccacagaact tcactgtcat cttcgacact ggctcctcca acctctgggt 361 cccctctgtg tactgcacta gcccagcctg caagacgcac agcaggttcc agccttccca 421 gtccagcaca tacagccagc caggtcaatc tttctccatt cagtatggaa ccgggagctt 481 gtccgggatc attggagccg accaagtctc tgtggaagga ctaaccgtgg ttggccagca 541 gtttggagaa agtgtcacag agccaggcca gacctttgtg gatgcagagt ttgatggaat 601 tctgggcctg ggatacccct ccttggctgt gggaggagtg actccagtat ttgacaacat 661 gatggctcag aacctggtgg acttgccgat gttttctgtc tacatgagca gtaacccaga 721 aggtggtgcg gggagcgagc tgatttttgg aggctacgac cactcccatt tctctgggag 781 cctgaattgg gtcccagtca ccaagcaagc ttactggcag attgcactgg ataacatcca 841 ggtgggaggc actgttatgt tctgctccga gggctgccag gccattgtgg acacagggac 901 ttccctcatc actggccctt ccgacaagat taagcagctg caaaacgcca ttggggcagc 961 ccccgtggat ggagaatatg ctgtggagtg tgccaacctt aacgtcatgc cggatgtcac 1021 cttcaccatt aacggagtcc cctataccct cagcccaact gcctacaccc tactggactt 1081 cgtggatgga atgcagttct gcagcagtgg ctttcaagga cttgacatcc accctccagc 1141 tgggcccctc tggatcctgg gggatgtctt cattcgacag ttttactcag tctttgaccg 1201 tgggaataac cgtgtgggac tggccccagc agtcccctaa ggaggggcct tgtgtctgtg 1261 cctgcctgtc tgacagacct tgaatatgtt aggctggggc attctttaca cctacaaaaa 1321 gttattttcc agagaatgta gctgtttcca gggttgcaac ttgaattaag accaaacaga 1381 acatgagaat acacacacac acacacatat acacacacac acacttcaca catacacacc 1441 actcccacca ccgtcatgat ggaggaatta cgttatacat tcatattttg tattgatttt 1501 tgattatgaa aatcaaaaat tttcacattt gattatgaaa atctccaaac atatgcacaa 1561 gcagagatca tggtataata aatccctttg caactccact cagccctgac aacccatcca 1621 cacacggcca ggcctgttta tctacactgc tgcccactcc tctctccagc tccacatgct 1681 gtacctggat cattctgaag caaattccga gcattacatc attttgtcca taaatatttc 1741 taacatcctt aaatatacaa tcggaattca agcatctccc attgtcccac aaatgtttgg 1801 ctgtttttgt agttggattg tttgtattag gattcaagca aggcccatat attgcattta 1861 tttgaaatgt ctgtaagtct ctttccatct acagagttta gcacatttga acgttgctgg 1921 ttgaaatccc gaggtgtcat ttgacatggt tctctgaact tatctttcct ataaaatggt 1981 agttagatct ggaggtctga ttttgtggca aaaatacttc ctaggtggtg ctgggtactt 2041 cttgttgcat cctgtcagga ggcagataat gctggtgcct ctctattggt aatgttaaga 2101 ctgctgggtg ggtttggagt tcttggcttt aatcattcat tacaaagttc agcatttt // LOCUS HUMCYB5 743 bp mRNA PRI 15-JUN-1989 DEFINITION Human cytochrome b5 mRNA, complete cds. ACCESSION M22865 NID g181226 KEYWORDS cytochrome. SOURCE Human liver, cDNA to mRNA, (library of Clontech and S.L.C.Woo), clones pH[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 743) AUTHORS Yoo,M. and Steggles,A.W. TITLE The complete nucleotide sequence of human liver cytochrome b-5 mRNA JOURNAL Biochem. Biophys. Res. Commun. 156, 576-580 (1988) MEDLINE 89025904 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.W.Steggles, 24-APR-1989. FEATURES Location/Qualifiers source 1..743 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..743 /note="CYB5 mRNA" CDS 53..457 /note="cytochrome b5" /codon_start=1 /db_xref="PID:g181227" /translation="MAEQSDEAVKYYTLEEIQKHNHSKSTWLILHHKVYDLTKFLEEH PGGEEVLREQAGGDATENFEDVGHSTDAREMSKTFIIGELHPDDRPKLNKPPETLITT IDSSSSWWTNWVIPAISAVAVALMYRLYMAED" BASE COUNT 208 a 169 c 170 g 196 t ORIGIN 412 bp upstream of PstI site. 1 cagccagctc gacggggctg tgtgtgctgg gcctggctcg cggcgaaccg agatggcaga 61 gcagtcggac gaggccgtga agtactacac cctagaggag attcagaagc acaaccacag 121 caagagcacc tggctgatcc tgcaccacaa ggtgtacgat ttgaccaaat ttctggaaga 181 gcatcctggt ggggaagaag ttttaaggga acaagctgga ggtgacgcta ctgagaactt 241 tgaggatgtc gggcactcta cagatgccag ggaaatgtcc aaaacattca tcattgggga 301 gctccatcca gatgacagac caaagttaaa caagcctccg gaaactctta tcactactat 361 tgattctagt tccagttggt ggaccaactg ggtgatccct gccatctctg cagtggccgt 421 cgccttgatg tatcgcctat acatggcaga ggactgaaca cctcctcaga agtcagcgca 481 ggccgagcct gctttggaca cgggagaaaa gaagccattg ctaactactt caactgacag 541 aaaccttcac ttgaaaacaa tgattttaat atatctcttt ctttttcttc cgacattaga 601 aacaaaacaa aaagaactgt cctttctgcg ctcaaatttt tcgagtgtgc ctttttattc 661 atctacttta ttttgatgtt tccttaatgt gtaatttact tattataagc atgatctttt 721 aaaaatatat ttggctttta aag // LOCUS HUMCYCG1R 1602 bp mRNA PRI 25-MAR-1996 DEFINITION Homo sapiens cyclin G1 mRNA, complete cds. ACCESSION L49504 NID g1236232 KEYWORDS cyclin; cyclin G1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Horne,M.C., Goolsby,G.L., Donaldson,K.L., Tran,D., Neubauer,M. and Wahl,A.F. TITLE Cyclin G1 and cyclin G2 comprise a new family of cyclins with contrasting tissue-specific and cell cycle-regulated expression JOURNAL J. Biol. Chem. 271 (11), 6050-6061 (1996) MEDLINE 96198057 FEATURES Location/Qualifiers source 1..1602 /organism="Homo sapiens" /note="(vector lambda zap)" /db_xref="taxon:9606" /cell_line="Jujkat" /cell_type="T cell" 5'UTR 1..211 mRNA 1..1602 CDS 212..1099 /codon_start=1 /product="cyclin G1" /db_xref="PID:g1236233" /translation="MIEVLTTTDSQKLLHQLNALLEQESRCQPKVCGLRLIESAHDNG LRMTARLRDFEVKDLLSLTQFFGFDTETFSLAVNLLDRFLSKMKVQPKHLGCVGLSCF YLAVKSIEEERNVPLATDLIRISQYRFTVSDLMRMEKIVLEKVCWKVKATTAFQFLQL YYSLLQENLPLERRNSINFERLEAQLKACHCRIIFSKAKPSVLALSIIALEIQAQKCV ELTEGIECLQKHSKINGRDLTFWQELVSKCLTEYSSNKCSKPNVQKLKWIVSGRTARQ LKHSYYRITHLPTIPEMVP" 3'UTR 1100..1602 BASE COUNT 482 a 319 c 339 g 462 t ORIGIN 1 tcgataagct tgatatcgaa ttccgatcag ggccgagttg tctcggcggc gctgccgagg 61 cctccaccca ggacagtccc cctccccggg cctctctcct cttgcctacg agtcccctct 121 cctcgtaggc ctctcggatc tgatatcgtg gggtgaggtg agcaggcccg gggagggtgg 181 ttaccgctga ggagctgcag tctctgtcaa gatgatagag gtactgacaa caactgactc 241 tcagaaactg ctacaccagc tgaatgccct gttggaacag gagtctagat gtcagccaaa 301 ggtctgtggt ttgagactaa ttgagtctgc acacgataat ggcctcagaa tgactgcaag 361 actaagggac tttgaagtaa aagatcttct tagtctaact cagttctttg gctttgacac 421 agagacattt tctctagctg tgaatttact ggacagattc ctgtctaaaa tgaaggtaca 481 gcccaagcac cttgggtgtg ttggactgag ctgcttttat ttggctgtaa aatcaataga 541 agaggaaagg aatgtcccat tggcaactga cttgatccga ataagtcaat ataggtttac 601 ggtttcagac ttgatgagaa tggaaaagat tgtattggag aaggtgtgtt ggaaagtcaa 661 agctactact gcctttcaat ttctgcaact gtattattca ctccttcaag agaacttgcc 721 acttgaaagg agaaatagca ttaattttga aagactagaa gctcaactga aggcatgtca 781 ttgcaggatc atattttcta aagcaaagcc ttctgtgttg gcattgtcta tcattgcatt 841 agagatccaa gcacagaagt gtgtagagtt aacagaagga atagaatgtc ttcagaaaca 901 ttccaagata aatggcagag atctgacctt ctggcaagag cttgtatcca aatgtttaac 961 tgaatattca tcaaataagt gttccaaacc aaatgttcag aagttgaaat ggattgtttc 1021 tgggcgtact gcacggcaat tgaagcatag ctactacaga ataactcacc ttccaacaat 1081 tcctgaaatg gtcccttaac tggattatta cagcaccaaa aaacttctct gaagcctttc 1141 tccacaacct tgttctatgg attccataat gttacaatgg atttaagcta tgaagcctca 1201 aaacatcacg agataagcat gatggtctca gacttgggaa aactgcctaa tattatgctg 1261 tagtggaatt atgtttatga tttgaattca tctgtgaagg cattcaaatc aaagctaaaa 1321 gcctaaatgt gaaatgctaa tgacaagcct gagaaggtaa actatgaatc ttcatttcta 1381 tcattgatct aactttagat attggatcaa tatatttagg tggtattgaa aatgctattg 1441 gaggagtcac actaatacta tcaactatca gtcttcccac agcttcaatc actgtcatta 1501 ttctaatcct actcctactt aaattttaag ttatgaggtt tatgtcgaaa gcaacatttc 1561 acaaatgtac ttttaaggca taataagggt taacattcta gg // LOCUS HUMCYCG2R 1410 bp mRNA PRI 25-MAR-1996 DEFINITION Homo sapiens cyclin G2 mRNA, complete cds. ACCESSION L49506 NID g1236234 KEYWORDS cyclin; cyclin G2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Horne,M.C., Goolsby,G.L., Donaldson,K.L., Tran,D., Neubauer,M. and Wahl,A.F. TITLE Cyclin G1 and cyclin G2 comprise a new family of cyclins with contrasting tissue-specific and cell cycle-regulated expression JOURNAL J. Biol. Chem. 271 (11), 6050-6061 (1996) MEDLINE 96198057 FEATURES Location/Qualifiers source 1..1410 /organism="Homo sapiens" /note="(vector lambda zap)" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T cell" 5'UTR 1..236 mRNA 1..1410 CDS 237..1271 /codon_start=1 /product="cyclin G2" /db_xref="PID:g1236235" /translation="MKDLGAEHLAGHEGVQLLGLLNVYLEQEERFQPREKGLSLIEAT PENDNTLCPGLRNAKVEDLRSLANFFGSCTETFVLAVNILDRFLALMKVKPKHLSCIG VCSFLLAARIVEEDCNIPSTHDVIRISQCKCTASDIKRMEKIISEKLHYELEATTALN FLHLYHTIILCHTSERKEILSLDKLEAQLKACNCRLIFSKAKPSVLALCLLNLEVETL KSVELLEILLLVKKHSKINDTEFFYWRELVSKCLAEYSSPECCKPDLKKLVWIVSRRT AQNLHNSYYSVPELPTIPEGGCFDESESEDSCEDMSCGEESLSSSPPSDQECTFFFNF KVAQTLCFPS" 3'UTR 1272..1410 BASE COUNT 364 a 308 c 321 g 417 t ORIGIN 1 ggtcttaccc agcgctggcc ggcggacctg accacggctc ctccccctgc cacacctccc 61 tggccgtggt ggggattgcc ctggggctct ggcggacgtg actggtccct tcacccgctc 121 cttgtcgggg tgtgctgggg cgacccttgc tggaggtact ggcctcagcc ctttctcccg 181 cttccccacc cctcttaccc ccagattaca ttctctgtgt ggtgtcttta ctgcagatga 241 aggatttggg ggcagagcac ttggcaggtc atgaaggggt ccaacttctc gggttgttga 301 acgtctacct ggaacaagaa gagagattcc aacctcgaga aaaagggctg agtttgattg 361 aggctacccc ggagaatgat aacactttgt gtccaggatt gagaaatgcc aaagttgaag 421 atttaaggag tttagccaac ttttttggat cttgcactga aacttttgtc ctggctgtca 481 atattttgga caggttcttg gctcttatga aggtgaaacc taaacatttg tcttgcattg 541 gagtctgttc ttttttgctg gctgctagaa tagttgaaga agactgcaat attccatcca 601 ctcatgatgt gatccggatt agtcagtgta aatgtactgc ttctgacata aaacggatgg 661 aaaaaataat ttcagaaaaa ttgcactatg aattggaagc tactactgcc ttaaactttt 721 tgcacttata ccatactatt atactttgtc atacttcaga aaggaaagaa atactgagcc 781 ttgataaact agaagctcag ctgaaagctt gcaactgccg actcatcttt tcaaaagcaa 841 aaccatctgt attagccttg tgccttctca atttggaagt ggaaactttg aaatctgttg 901 aattactgga aattctcttg ctagttaaaa aacattccaa gattaatgac actgagttct 961 tctactggag agagttggtt tctaaatgcc tagccgagta ttcttctcct gaatgttgca 1021 aaccagatct taagaagttg gtttggatcg tttcaaggcg cacagcccag aacctccaca 1081 acagctacta tagtgttcct gagctgccaa cgatacctga ggggggttgt tttgatgaaa 1141 gtgaaagtga ggactcttgt gaagatatga gttgtggaga ggagagtctc agcagctctc 1201 ctcccagtga tcaagagtgc accttctttt tcaacttcaa agtggcacaa acactgtgct 1261 ttccatctta gaaatctgat tgttctgtca gaatttatat ttacaggttt caaagcaata 1321 aatgggggaa taggtagttt cctggtttag cccccatcta gtcaggaatt aatatactgg 1381 aatacctacc ttctatttgt tattcagatc // LOCUS HUMCYCLORA 4272 bp mRNA PRI 07-MAY-1993 DEFINITION Human cyclophilin-related protein mRNA, complete cds. ACCESSION L04288 NID g181251 KEYWORDS cyclophilin-related protein; peptidyl-prolyl cis-trans isomerase; transmembrane protein. SOURCE Homo sapiens (library: Lambda gt11, Lambda gt10) adult blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4272) AUTHORS Anderson,S.K., Gallinger,S., Roder,J., Frey,J., Young,H. and Ortaldo,J. TITLE A cyclophilin-related protein involved in the function of natural killer cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 542-546 (1993) MEDLINE 93133824 FEATURES Location/Qualifiers source 1..4272 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="NK" /dev_stage="adult" /tissue_type="blood" /tissue_lib="Lambda gt11, Lambda gt10" /map="3p21-24" CDS 52..4263 /codon_start=1 /function="tumor recognition" /product="cyclophilin-related protein" /db_xref="PID:g181252" /translation="MGENSVALGGPAWGRRRRSVSGVGVWLQWQCFLFCSRGPAQAGG QPALAATSVAMGAQDRPQCHFDIEINREPVGRIMFQLFSDICPKTCKNFLCLCSGEKG LGKTTGKKLCYKGSTFHRVVKNFMIQGGDFSEGNGKGGESIYGGYFKDENFILKHDRA FLLSMANRGKHTNGSQFFITTKPAPHLDGVHVVFGLVISGFEVIEQIENLKTDAASRP YADVRVIDCGVLATKSIKDVFEKKRKKPTHSEGSDSSSNSSSSSESSSESELEHERSR RRKHKRRPKVKRSKKRRKEASSSEEPRNKHAMNPKGHSERSDTNEKRSVDSSAKREKP VVRPEEIPPVPENRFLLRRDMPVVTAEPEPKIPDVAPIVSDQKPSVSKSGRKIKGRGT IRYHTPPRSRSCSESDDDDSSETPPHWKEEMQRLRAYRPPSGEKWSKGDKLSDPCSSR WDERSLSQRSRSWSYNGYYSDLSTARHSGHHKKRRKEKKVKHKKKGKKQKHCRRHKQT KKRRILIPSDIESSKSSTRRMKSSCDRERSSRSSSLSSHHSSKRDWSKSDKDVQSSLT HSSRDSYRSKSHSQSYSRGSSRSRTASKSSSHSRSRSKSRSSSKSGHRKRASKSPRKT ASQLSENKPVKTEPLRATMAQNENVVVQPVVAENIPVIPLSDSPPPSRWKPGEKPWKP SYERIQEMKAKTTHLLPIQSTYSLANIKETGSSSSYHKREKNSESDQSTYSKYSDRSS ESSPRSRSRSSRSRSYSRSYTRSRSLASSHSRSRSPSSRSHSRNKYSDHSQCSRSSSY TSISSDDGRRAKRRLRSSGKKNSVSHKKHSSSSEKTLHSKYVKGRDRSSCVRKYSESR SSLDYSSDSEQSSVQATQSAQEKEKQGQMERTHNKQEKNRGEEKSKSERECPHSKKRT LKENLSDHLRNGSKPKRKNYAGSKWDSESNSERDVTKNSKNDSHPSSDKEEGEATSDS ESEVSEIHIKVKPTTKSSTNTSLPDDNGAWKSSKQRTSTSDSEGSCSNSENNRGKPQK HKHGSKENLKREHTKKVKEKLKGKKDKKHKAPKRKQAFHWQPPLEFGEEEEEEIDDKQ VTQESKEKKVSENNETIKDNILKTEKSSEEDLSGKHDTVTVSSDLDQFTKDDSKLSIS PTALNTEENVACLQNIQHVEESVPNGVEDVLQTDDNMEICTPDRSSPAKVEETSPLGN ARLDTPDINIVLKQDMATEHPQAEVVKQESSMSESKVLGEVGKQDSSSASLASAGEST GKKEVAEKSQINLIDKKWKPLQGVGNLAAPNAATSSAVEVKVLTTVPEMKPQGLRIEI KSKNKVRPGSLFDEVRKTARLNRRPRNQESSSDEQTPSRDDDSQSRSPSRSRSKSETK SRHRTRSVSYSHSRSRSRSSTSSYR" BASE COUNT 1504 a 851 c 980 g 937 t ORIGIN 1 tggtactctc gcggtgtctg tcccgactct cgcggcgaga gtgcagtcta gatgggggag 61 aatagcgtgg cattgggagg cccggcctgg gggaggagac ggcgttccgt tagcggcgtt 121 ggggtttggc tgcagtggca gtgctttctc ttctgctcac ggggacccgc tcaggctgga 181 ggccagccag ctcttgccgc cacctcggtc gcgatggggg cgcaggaccg gccgcagtgc 241 cacttcgaca tcgagatcaa ccgggagccg gttggtcgca ttatgtttca gctcttctca 301 gacatatgtc caaaaacatg caaaaacttc ctttgcttgt gctcaggaga gaaaggcctt 361 gggaaaacaa ctgggaagaa gttatgttat aaaggttcta cgttccatcg tgtggttaaa 421 aactttatga ttcagggtgg ggacttcagt gaaggtaatg gaaaaggtgg agaatcaatt 481 tatggtggat attttaaaga tgaaaacttt attctcaaac atgacagagc gttcctttta 541 tcaatggcaa atcgagggaa acataccaat ggttcccagt ttttcattac cacaaagcct 601 gctccacacc tggatggggt gcatgtagtc tttggactgg ttatttctgg ttttgaagta 661 atcgaacaaa ttgaaaatct gaagaccgat gctgcaagca gaccatatgc agatgtgcga 721 gttattgact gtggagtact tgccacaaaa tcaataaaag atgtttttga gaaaaaaagg 781 aagaaaccaa ctcattcaga aggctcggat tcctcttcca attcctcctc ttcttcagaa 841 tcatcttcag aaagtgaact tgaacatgag agaagcagaa ggaggaaaca taagaggagg 901 ccaaaagtta aacgttctaa aaagaggcga aaggaagcaa gcagttcaga agagccaagg 961 aataaacatg caatgaaccc aaaaggtcac tctgagagga gtgataccaa tgaaaaaagg 1021 tcagttgatt ccagtgctaa aagggaaaaa cctgtggtcc gcccagaaga gattcctcca 1081 gtgcctgaga accgattttt actgagaaga gatatgcctg ttgttactgc agaacctgaa 1141 ccgaagattc ctgatgttgc acccattgta agtgatcaga aaccatctgt atcaaagtct 1201 ggacggaaga ttaaaggaag gggcacaatt cgctatcaca cacctccaag atcaagatcc 1261 tgttctgagt cagatgatga tgacagcagt gaaactcctc ctcactggaa agaggaaatg 1321 cagagattaa gagcatatag accacctagt ggagaaaaat ggagtaaagg agataagtta 1381 agtgacccct gttcaagccg atgggatgaa agaagcttgt ctcagagatc cagatcatgg 1441 tcctataatg gatattattc agaccttagt acagcaagac actctggcca ccataaaaaa 1501 cgcagaaaag aaaaaaaggt taagcataaa aagaaaggga aaaagcagaa acactgcaga 1561 agacacaaac aaacaaagaa gagaaggatt cttataccgt ctgacataga atcctcaaaa 1621 tcttccactc gaagaatgaa atcctcttgt gatagagaaa ggagttctcg ttcttcctca 1681 ttgtcatctc atcactcatc aaagagagac tggtctaaat ctgataagga tgtccagagc 1741 tctttaaccc attccagcag agactcatac agatcaaaat ctcactcaca gtcttattct 1801 agaggaagct caagatcaag gactgcgtca aagtcctcat cacattctcg aagtcgatca 1861 aagtccagat ctagttccaa gtctgggcac cgaaagagag catcaaaatc accaagaaaa 1921 acagcttctc agttaagtga aaataaacca gttaaaacag aacctttaag agcaaccatg 1981 gcacaaaatg aaaatgtagt agtacaacca gttgtagcag aaaatattcc tgtaatacca 2041 ctgagtgaca gtcccccccc ttcaagatgg aagcctggag agaaaccttg gaagccctct 2101 tatgagcgaa ttcaggaaat gaaagctaaa acaacccatt tgctacccat ccaaagcact 2161 tacagtttag caaatattaa agagactggt agctcatcat cctaccataa aagagaaaaa 2221 aattcggaaa gtgatcagag cacttattca aaatacagtg atagaagttc agaaagctca 2281 ccaaggtcaa ggagcagatc ttctaggagt agatcttatt ccagatcata tacaagatca 2341 cgtagtctag ctagttcaca ttcaaggtct aggtctccat catctagatc tcattcacga 2401 aataaataca gtgatcattc acagtgtagt agatcatctt catatacttc tattagcagt 2461 gatgatggaa ggcgagctaa gaggagactt agatccagtg ggaaaaaaaa tagcgtttca 2521 cataaaaagc atagcagcag ctctgaaaag acacttcaca gtaaatatgt caaaggtaga 2581 gacaggtctt catgtgtgag aaagtatagc gagagcagat catctttaga ttattcttca 2641 gacagtgagc agtcaagtgt tcaggccaca cagtcagccc aggaaaaaga gaagcagggc 2701 caaatggaaa gaacacataa taaacaagaa aaaaacagag gtgaagaaaa atccaagtct 2761 gaacgggaat gccctcattc aaaaaaaaga actttgaaag agaatctttc tgatcacctt 2821 agaaatggca gtaagcccaa aaggaagaat tatgctggta gtaaatggga ctctgagtca 2881 aattcagaac gagatgtcac taaaaacagt aaaaatgact cccatccatc ctctgacaag 2941 gaagaaggtg aggccacatc cgattctgaa tcagaggtta gtgaaattca catcaaagtc 3001 aaacccacaa ccaagtcgtc cacaaatact tcactgcctg atgataatgg tgcttggaaa 3061 tcaagcaaac agcgcacatc aacttctgac tctgaggggt cctgttccaa ttcggaaaac 3121 aataggggaa agcctcaaaa gcacaaacat gggtcaaagg aaaatcttaa aagagaacac 3181 accaaaaaag tgaaagagaa attgaaaggg aaaaaagaca aaaagcataa ggctccaaaa 3241 cgaaagcaag catttcactg gcagcctcca ctagaatttg gtgaagagga ggaggaggag 3301 attgatgaca agcaagttac tcaggaatca aaagagaaaa aagtttctga aaacaatgaa 3361 accataaaag ataatattct aaaaactgag aaatccagtg aagaggacct ttcaggtaaa 3421 catgatacag tgactgtttc atcagatctt gatcagttta ctaaagatga tagtaaactc 3481 agtatttctc ccacagcttt aaatactgag gaaaatgtgg cctgtttaca aaacattcag 3541 cacgttgaag aaagtgttcc caatggagtg gaagatgtgc ttcaaacaga tgacaacatg 3601 gagatctgca ctcctgatag gagttcccca gcaaaagtag aggagacttc ccctctagga 3661 aatgcacggc ttgatacccc agatataaac attgttttga agcaggatat ggcaacggaa 3721 catcctcaag cagaggtagt aaaacaggaa agcagcatgt ccgaaagtaa agtgttgggt 3781 gaagtgggga aacaggacag cagctctgct agcttggcta gtgctggaga aagtaccggg 3841 aagaaggagg tggctgagaa gagccagatc aacctcattg ataagaaatg gaagcccctg 3901 caaggtgtgg ggaacctggc agcacctaat gctgccacat ccagtgctgt ggaagttaag 3961 gtgttgacca ctgtgcctga aatgaaacca caaggcttga gaatagaaat taaaagcaaa 4021 aataaagttc ggcctgggtc tctctttgat gaagtaagaa agacagcacg cttaaaccgt 4081 agaccaagaa atcaggagag ttcaagtgat gagcagacgc ctagtcggga tgatgatagc 4141 cagtccagga gtccaagtag atctcgaagt aaatctgaaa ccaaatcaag acacagaaca 4201 aggtctgtct cctatagtca ctcaagaagt cgatcgagaa gttccacatc atcttatcgg 4261 tgagcaatat tc // LOCUS HUMCYDE 892 bp mRNA PRI 15-DEC-1994 DEFINITION Homo sapiens cytidine deaminase (CDA) mRNA, complete cds. ACCESSION L27943 NID g602451 KEYWORDS cytidine deaminase. SOURCE Homo sapiens female liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 892) AUTHORS Laliberte,J. and Momparler,R.L. TITLE Human cytidine deaminase: Purification of enzyme, cloning, and expression of its complementary DNA JOURNAL Cancer Res. 54, 5401-5407 (1994) MEDLINE 95007561 FEATURES Location/Qualifiers source 1..892 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="liver" /map="1p36.2-p35" 5'UTR 1..117 mRNA 1..892 mat_peptide 118..555 /gene="CDA" /note="G00-137-169" /product="cytidine deaminase" gene 118..558 /gene="CDA" CDS 118..558 /gene="CDA" /EC_number="3.5.4.5" /codon_start=1 /db_xref="GDB:G00-137-169" /product="cytidine deaminase" /db_xref="PID:g598149" /translation="MAQKRPACTLKPECVQQLLVCSQEAKKSAYCPYSHFPVGAALLT QEGRIFKGCNIENACYPLGICAERTAIQKAVSEGYKDFRAIAIASDMQDDFISPCGAC RQVMREFGTNWPVYMTKPDGTYIVMTVQELLPSSFGPEDLQKTQ" 3'UTR 559..892 polyA_site 892 BASE COUNT 190 a 278 c 231 g 193 t ORIGIN 1 gcgcgccagt ttcaggatgc agggtctagg agaggagccg caatcgtgtc tggggcccca 61 gccaggctgg ccggagctcc tgtttccgct gctctgctgc ctgcccgggg taccaacatg 121 gcccagaagc gtcctgcctg caccctgaag cctgagtgtg tccagcagct gctggtttgc 181 tcccaggagg ccaagaagtc agcctactgc ccctacagtc actttcctgt gggggctgcc 241 ctgctcaccc aggaggggag aatcttcaaa gggtgcaaca tagaaaatgc ctgctacccg 301 ctgggcatct gtgctgaacg gaccgctatc cagaaggccg tctcagaagg gtacaaggat 361 ttcagggcaa ttgctatcgc cagtgacatg caagatgatt ttatctctcc atgtggggcc 421 tgcaggcaag tcatgagaga gtttggcacc aactggcccg tgtacatgac caagccggat 481 ggtacgtata ttgtcatgac ggtccaggag ctgctgccct cctcctttgg gcctgaggac 541 ctgcagaaga ctcagtgaca gccagagaat gcccactgcc tgtaacagcc acctggagaa 601 cttcataaag atgtctcaca gccctgggga cacctgccca gtggccccag cctacaggga 661 ctgggcaaag atgatgtttc cagattacac tccagcctga gtcagcaccc ctcctagcaa 721 cctgccttgg gacttagaac accgccgccc ccctgcccca cctttccttt ccttcctgtg 781 ggccctcttt caaagtccag cctagtctgg actgcttccc catcagcctt cccaaggttc 841 tatcctgttc cgagcaactt ttctaattat aaacatcaca gaacatcctg ga // LOCUS HUMCYES1 4517 bp mRNA PRI 31-DEC-1987 DEFINITION Human c-yes-1 mRNA. ACCESSION M15990 NID g181267 KEYWORDS protein kinase; tyrosine kinase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4517) AUTHORS Sukegawa,J., Semba,K., Yamanashi,Y., Nishizawa,M., Miyajima,N., Yamamoto,T. and Toyoshima,K. TITLE Characterization of cDNA clones for the human c-yes gene JOURNAL Mol. Cell. Biol. 7, 41-47 (1987) MEDLINE 87172733 FEATURES Location/Qualifiers source 1..4517 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 208..1839 /note="cellular yes-1 protein" /codon_start=1 /db_xref="PID:g181268" /translation="MGCIKSKENKSPAIKYRPENTPEPVSTSVSHYGAEPTTVSPCPS SSAKGTAVNFSSLSMTPFGGSSGVTPFGGASSSFSVVPSSYPAGLTGGVTIFVALYDY EARTTEDLSFKKGERFQIINNTEGDWWEARSIATGKNGYIPSNYVAPADSIQAEEWYF GKMGRKDAERLLLNPGNQRGIFLVRESETTKGAYSLSIRDWDEIRGDNVKHYKIRKLD NGGYYITTRAQFDTLQKLVKHYTEHADGLCHKLTTVCPTVKPQTQGLAKDAWEIPRES LRLEVKLGQGCFGEVWMGTWNGTTKVAIKTLKPGTMMPEAFLQEAQIMKKLRHDKLVP LYAVVSEEPIYIVTEFMSKGSLLDFLKEGDGKYLKLPQLVDMAAQIADGMAYIERMNY IHRDLRAANILVGENLVCKIADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFT IKSDVWSFGILQTELVTKGRVPYPGMVNREVLEQVERGYRMPCPQGCPESLHELMNLC WKKDPDERPTFEYIQSFLEDYFTATEPQYQPGENL" BASE COUNT 1437 a 784 c 955 g 1341 t ORIGIN 1 gcggagccaa ggcacacggg tctgaccctt gggccggccc ggagcaagtg acacggaccg 61 gtcgcctatc ctgaccacag caaagcggcc cggagcccgc ggaggggacc tgacgggggc 121 gtaggcgccg gaaggctggg ggccccggag ccgggccggc gtggcccgag ttccggtgag 181 cggacggcgg cgcgcgcaga tttgataatg ggctgcatta aaagtaaaga aaacaaaagt 241 ccagccatta aatacagacc tgaaaatact ccagagcctg tcagtacaag tgtgagccat 301 tatggagcag aacccactac agtgtcacca tgtccgtcat cttcagcaaa gggaacagca 361 gttaatttca gcagtctttc catgacacca tttggaggat cctcaggggt aacgcctttt 421 ggaggtgcat cttcctcatt ttcagtggtg ccaagttcat atcctgctgg tttaacaggt 481 ggtgttacta tatttgtggc cttatatgat tatgaagcta gaactacaga agacctttca 541 tttaagaagg gtgaaagatt tcaaataatt aacaatacgg aaggagattg gtgggaagca 601 agatcaatcg ctacaggaaa gaatggttat atcccgagca attatgtagc gcctgcagat 661 tccattcagg cagaagaatg gtattttggc aaaatgggga gaaaagatgc tgaaagatta 721 cttttgaatc ctggaaatca acgaggtatt ttcttagtaa gagagagtga aacaactaaa 781 ggtgcttatt ccctttctat tcgtgattgg gatgagataa ggggtgacaa tgtgaaacac 841 tacaaaatta ggaaacttga caatggtgga tactatatca caaccagagc acaatttgat 901 actctgcaga aattggtgaa acactacaca gaacatgctg atggtttatg ccacaagttg 961 acaactgtgt gtccaactgt gaaacctcag actcaaggtc tagcaaaaga tgcttgggaa 1021 atccctcgag aatctttgcg actagaggtt aaactaggac aaggatgttt cggcgaagtg 1081 tggatgggaa catggaatgg aaccacgaaa gtagcaatca aaacactaaa accaggtaca 1141 atgatgccag aagctttcct tcaagaagct cagataatga aaaaattaag acatgataaa 1201 cttgttccac tatatgctgt tgtttctgaa gaaccaattt acattgtcac tgaatttatg 1261 tcaaaaggaa gcttattaga tttccttaag gaaggagatg gaaagtattt gaagcttcca 1321 cagctggttg atatggctgc tcagattgct gatggtatgg catatattga aagaatgaac 1381 tatattcacc gagatcttcg ggctgctaat attcttgtag gagaaaatct tgtgtgcaaa 1441 atagcagact ttggtttagc aaggttaatt gaagacaatg aatacacagc aagacaaggt 1501 gcaaaatttc caatcaaatg gacagctcct gaagctgcac tgtatggtcg gtttacaata 1561 aagtctgatg tctggtcatt tggaattctg caaacagaac tagtaacaaa gggccgagtg 1621 ccatatccag gtatggtgaa ccgtgaagta ctagaacaag tggagcgagg atacaggatg 1681 ccgtgccctc agggctgtcc agaatccctc catgaattga tgaatctgtg ttggaagaag 1741 gaccctgatg aaagaccaac atttgaatat attcagtcct tcttggaaga ctacttcact 1801 gctacagagc cacagtacca gccaggagaa aatttataat tcaagtagcc tattttatat 1861 gcacaaatct gccaaaatat aaagaacttg tgtagatttt ctacaggaat caaaagaaga 1921 aaatcttctt tactctgcat gtttttaatg gtaaactgga atcccagata tggttgcaca 1981 aaaccacttt tttttcccca agtattaaac tctaatgtac caatgatgaa tttatcagcg 2041 tatttcaggg tccaaacaaa atagagctaa gatactgatg acagtgtggg tgacagcatg 2101 gtaatgaagg acagtgaggc tcctgcttat ttataaatca tttcctttct ttttttcccc 2161 aaagtcagaa ttgctcaaag aaaattattt attgttacag ataaaacttg agagataaaa 2221 agctatacca taataaaatc taaaattaag gaatatcatg ggaccaaata attccattcc 2281 agttttttaa agtttcttgc atttattatt ctcaaaagtt ttttctaagt taaacagtca 2341 gtatgcaatc ttaatatatg ctttcttttg catggacatg ggccaggttt ttcaaaagga 2401 atataaacag gatctcaaac ttgattaaat gttagaccac agaagtggaa tttgaaagta 2461 taatgcagta cattaatatt catgttcatg gaactgaaag aataagaact ttttcacttc 2521 agtccttttc tgaagagttt gacttagaat aatgaaggta actagaaagt gagttaatct 2581 tgtatgaggt tgcattgatt ttttaaggca atatataatt gaaactactg tccaatcaaa 2641 ggggaaatgt tttgatcttt agatagcatg caaagtaaga cccagcattt taaaagccct 2701 tttttaaaaa ctagacttcg tactgtgagt attgcttata tgtccttatg gggatgggtg 2761 ccacaaatag aaaatatgac cagatcaggg acttgaatgc acttttgctc atggtgaata 2821 tagatgaaca gagaggaaaa tgtatttaaa agaaatacga gaaaagaaaa tgtgaaagtt 2881 ttacaagtta gagggatgga aggtaatgtt taatgttgat gtcatggagt gacagaatgg 2941 ctttgctggc actcagagct cctcacttag ctatattctg agactttgaa gagttataaa 3001 gtataactat aaaactaatt tttcttacac actaaatggg tatttgttca aaataatgaa 3061 gttatggctt cacattcatt gcagtgggat atggttttta tgtaaaacat ttttagaact 3121 ccagttttca aatcatgttt gaatctacat tcactttttt ttgttttctt ttttgagacg 3181 gagtctcgct ctgccgccca ggctggagtg cagtggcgcg atctcggctc actgcaagct 3241 ctgcctccca ggttcacacc attctcctgc ctcagcctcc cgagtagctg ggactacagg 3301 tgcccaccac cacgcctggc tagttttttg tatttttagt agagacgcag tttcaccgtg 3361 ttagccagga tggtctcgat ctcctgacct tgtgatctgc ccgcctcggc ctcccaaagt 3421 gctgggatta caggtgtgag ccaccgcgcc cagcctacat tcacttctaa agtctatgta 3481 atggtggtca ttttttccct tttagaatac attaaatggt tgatttgggg aggaaaactt 3541 attctgaata ttaacggtgg tgaaaagggg acagttttta ccctaaagtg caaaagtgaa 3601 acatacaaaa taagactaat ttttaagagt aactcagtaa tttcaaaata cagatttgaa 3661 tagcagcatt agtggtttga gtgtctagca aaggaaaaat tgatgaataa aatgaaggtc 3721 tggtgtatat gttttaaaat actctcatat agtcacactt taaattaagc cttatattag 3781 gcccctctat tttcaggata taattcttaa ctatcattat ttacctgatt ttaatcatca 3841 gattcgaaat tctgtgccat ggcgtatatg ttcaaattca aaccattttt aaaatgtgaa 3901 gatggacttc atgcaagttg gcagtggttc tggtactaaa aattgtggtt gttttttctg 3961 tttacgtaac ctgcttagta ttgacactct ctaccaagag ggtcttccta agaagagtgc 4021 tgtcattatt tcctcttatc aacaacttgt gacatgagat tttttaaggg ctttatgtga 4081 actatgatat tgtaattttt ctaagcatat tcaaaagggt gacaaaatta cgtttatgta 4141 ctaaatctaa tcaggaaagt aaggcaggaa aagttgatgg tattcattag gttttaactg 4201 aatggagcag ttccttatat aataacaatt gtatagtagg gataaaacac taacaatgtg 4261 tattcatttt aaattgttct gtatttttaa attgccaaga aaaacaactt tgtaaatttg 4321 gagatatttt ccaacagctt ttcgtcttca gtgtcttaat gtggaagtta acccttacca 4381 aaaaaggaag ttggcaaaaa cagccttcta gcacactttt ttaaatgaat aatggtagcc 4441 taaacttaat atttttataa agtattgtaa tattgttttg tggataattg aaataaaaag 4501 ttctcattga atgcacc // LOCUS HUMCYI 1260 bp mRNA PRI 10-FEB-1997 DEFINITION Human mRNA for cyclin I, complete cds. ACCESSION D50310 NID g1183161 KEYWORDS cyclin I. SOURCE Homo sapiens Brain cDNA to mRNA, clone:FC6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Nakamura,T., Sanokawa,R., Sasaki,Y.F., Ayusawa,D., Oishi,M. and Mori,N. TITLE Cyclin I: a new cyclin encoded by a gene isolated from human brain JOURNAL Exp. Cell Res. 221 (2), 534-542 (1995) MEDLINE 96086776 REFERENCE 2 (bases 1 to 1260) AUTHORS Nakamura,T. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1260) AUTHORS Nakamura,T. TITLE Direct Submission JOURNAL Submitted (17-APR-1995) to the DDBJ/EMBL/GenBank databases. Takeshi Nakamura, Sumitomo Electric Industries, Biomedical R&D Department; 1, Taya-cho, Sakae-ku, Yokohama, Kanagawa 244, Japan (E-mail:tnakamr@opele.sumiden.co.jp, Tel:045-853-7275, Fax:045-853-3528) FEATURES Location/Qualifiers source 1..1260 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FC6" /tissue_type="Brain" CDS 1..1134 /codon_start=1 /product="cyclin I" /db_xref="PID:d1009482" /db_xref="PID:g1183162" /translation="MKFPGPLENQRLSFLLEKAITREAQMWKVNVRKMPSNQNVSPSQ RDEVIQWLAKLKYQFNLYPETFALASSLLDRFLATVKAHPKYLSCIAISCFFLAAKTV EEDERIPVLKVLARDSFCGCSSSEILRMERIILDKLNWDLHTATPLDFLHIFHAIAVS TRPQLLFSLPKLSPSQHLAVLTKQLLHCMACNQLLQFRGSMLALAMVSLEMEKLIPDW LSLTIELLQKAQMDSSQLIHCRELVAHHLSTLQSSLPLNSVYVYRPLKHTLVTCDKGV FRLHPSSVPGPDFSKDNSKPEVPVRGTAAFYHHLPAASGCKQTSTKRKVEEMEVDDFY DGIKRLYNEDNVSENVGSVCGTDLSRQEGHASPCPPLQPVSVM" BASE COUNT 352 a 283 c 274 g 351 t ORIGIN 1 atgaagtttc cagggccttt ggaaaaccag agattgtctt tcctgttgga aaaggcaatc 61 actagggaag cacagatgtg gaaagtgaat gtgcggaaaa tgccttcaaa tcagaatgtt 121 tctccatccc agagagatga agtaattcaa tggctggcca aactcaagta ccaattcaac 181 ctttacccag aaacatttgc tctggctagc agtcttttgg ataggttttt agctaccgta 241 aaggctcatc caaaatactt gagttgtatt gcaatcagct gttttttcct agctgccaag 301 actgttgagg aagatgagag aattccagta ctaaaggtat tggcaagaga cagtttctgt 361 ggatgttcct catctgaaat tttgagaatg gagagaatta ttctggataa gttgaattgg 421 gatcttcaca cagccacacc attggatttt cttcatattt tccatgccat tgcagtgtca 481 actaggcctc agttactttt cagtttgccc aaattgagcc catctcaaca tttggcagtc 541 cttaccaagc aactacttca ctgtatggcc tgcaaccaac ttctgcaatt cagaggatcc 601 atgcttgctc tggccatggt tagtctggaa atggagaaac tcattcctga ttggctttct 661 cttacaattg aactgcttca gaaagcacag atggatagct cccagttgat ccattgtcgg 721 gagcttgtgg cacatcacct ttctactctg cagtcttccc tgcctctgaa ttccgtttat 781 gtctaccgtc ccctcaagca caccctggtg acctgtgaca aaggagtgtt cagattacat 841 ccctcctctg tcccaggccc agacttctcc aaggacaaca gcaagccaga agtgccagtc 901 agaggtacag cagcctttta ccatcatctc ccagctgcca gtgggtgcaa gcagacctct 961 actaaacgca aagtagagga aatggaagtg gatgacttct atgatggaat caaacggctc 1021 tataatgaag ataatgtctc agaaaatgtg ggttctgtgt gtggcactga tttatcaaga 1081 caagagggac atgcttcccc ttgtccacct ttgcagcctg tttctgtcat gtagtttcaa 1141 caagtgctac ctttgagtgt aaactaaggt agactacttt gggaatgaga acatccaaaa 1201 tcaggaaagg ctgtagaagg aaatatacct taacaggctg atttggagtg acccagaaaa // LOCUS HUMCYP 865 bp mRNA PRI 31-DEC-1994 DEFINITION H.sapiens cyclophilin isoform (hCyP3) mRNA, complete cds. ACCESSION M80254 NID g181273 KEYWORDS cyclophilin; cytosolic protein; peptidyl-prolyl cis-trans isomerase. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 865) AUTHORS Bergsma,D.J., Eder,C., Gross,M., Kersten,H., Sylvester,D., Appelbaum,E., Cusimano,D., Livi,G.P., McLauglin,M.M., Kasyan,K., Porter,T.G., Silverman,C., Dunnington,D., Hand,A., Prichett,W.P., Bossard,M.J., Brandt,M. and Levy,M.A. TITLE The cyclophilin multigene family of peptidyl-prolyl isomerases. Characterization of three separate human isoforms JOURNAL J. Biol. Chem. 266 (34), 23204-23214 (1991) MEDLINE 92078192 FEATURES Location/Qualifiers source 1..865 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 84..707 /gene="CyP3" CDS 84..707 /gene="CyP3" /codon_start=1 /product="cyclophilin 3 protein" /db_xref="PID:g181274" /translation="MLALRCGSRWLGLLSVPRSVPLRLPAARACSKGSGDPSSSSSSG NPLVYLDVDANGKPLGRVVLELKADVVPKTAENFRALCTGEKGFGYKGSTFHRVIPSF MCQAGDFTNHNGTGGKSIYGSRFPDENFTLKHVGPGVLSMANAGPNTNGSQFFICTIK TDWLDGKHVVFGHVKEGMDVVKKIESFGSKSGRTSKKIVITDCGQLS" BASE COUNT 162 a 268 c 260 g 175 t ORIGIN 1 gaattccgga gttccgggcg cgcgcgacgt cagtttgagt tctgtgttct ccccgcccgt 61 gtcccgcccg acccgcgccc gcgatgctgg cgctgcgctg cggctcccgc tggctcggcc 121 tgctctccgt cccgcgctcc gtgccgctgc gcctccccgc ggcccgcgcc tgcagcaagg 181 gctccggcga cccgtcctct tcctcctcct ccgggaaccc gctcgtgtac ctggacgtgg 241 acgccaacgg gaagccgctc ggccgcgtgg tgctggagct gaaggcagat gtcgtcccaa 301 agacagctga gaacttcaga gccctgtgca ctggtgagaa gggcttcggc tacaaaggct 361 ccaccttcca cagggtgatc ccttccttca tgtgccaggc gggcgacttc accaaccaca 421 atggcacagg cgggaagtcc atctacggaa gccgctttcc tgacgagaac tttacactga 481 agcacgtggg gccaggtgtc ctgtccatgg ctaatgctgg tcctaacacc aacggctccc 541 agttcttcat ctgcaccata aagacagact ggttggatgg caagcatgtt gtgttcggtc 601 acgtcaaaga gggcatggac gtcgtgaaga aaatagaatc tttcggctct aagagtggga 661 ggacatccaa gaagattgtc atcacagact gtggccagtt gagctaatct gtggccaggg 721 tgctggcatg gtggcagctg caaatgtcca tgcacccagg tggccgcgtt gggctgtcag 781 ccaaggtgcc tgaaacgata cgtgtgccca ctccactgtc acagtgtgcc tgaggaaggc 841 tgctagggat gttagacgga attcc // LOCUS HUMCYP2BA 2907 bp mRNA PRI 02-NOV-1994 DEFINITION Human cytochrome P450-IIB (hIIB3) mRNA, complete cds. ACCESSION M29873 J02864 NID g181293 KEYWORDS cytochrome P450; cytochrome P450 IIB. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2907) AUTHORS Yamano,S., Nhamburo,P.T., Aoyama,T., Meyer,U.A., Inaba,T., Kalow,W., Gelboin,H.V., McBride,O.W. and Gonzalez,F.J. TITLE cDNA cloning and sequence and cDNA-directed expression of human P450 IIB1: identification of a normal and two variant cDNAs derived from the CYP2B locus on chromosome 19 and differential expression of the IIB mRNAs in human liver JOURNAL Biochemistry 28 (18), 7340-7348 (1989) MEDLINE 90057429 FEATURES Location/Qualifiers source 1..2907 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.2" mRNA <1..2907 /note="CYP450-IIB mRNA" gene 7..1140 /gene="CYP2B" CDS 7..1140 /gene="CYP2B" /note="cytochrome P450-IIB" /codon_start=1 /db_xref="GDB:G00-120-752" /db_xref="PID:g181294" /translation="MELSVLLFLALLTGLLLLLVQRHPNSHGTLPPGPRPLPLLGNLL QMDRRGLLKSFLRFREKYGDVFTVHLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIV IMDPVYQGYGMLFANGNRWKVLRRFSVTTMRDFGMGKRSVEERIQDEAQCLIEELRKS KGALVDPTFLFHSITANIICSIIFGKRFHYQDQEFLKTLNLFCQSFLLISSISSQLFE LFSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPRDLIDTYLLHMEKEK SNPHSEFSHQNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERVYKEIEQVVGP HRPPALDDRAKMPYTEAVIREIQRFADLLPMGVPHIVTQHTSF" repeat_region 1704..1985 /note="Alu repeat 1" BASE COUNT 677 a 825 c 632 g 773 t ORIGIN 1 ggaaccatgg agctcagcgt cctcctcttc cttgcactcc tcacaggcct cttgctactc 61 ctggttcagc gtcaccctaa ctcccatggc accctcccac cagggccccg ccctctgccc 121 cttttgggga accttctgca gatggacaga agaggcctac tcaaatcctt tctgaggttc 181 cgagagaaat atggggacgt cttcacggta cacctgggac cgaggcccgt ggtcatgctg 241 tgtggagtag aggccatacg ggaggccctg gtggacaacg ctgaggcctt ctctggccgg 301 ggaaaaatcg tcatcatgga cccagtctac cagggatatg gcatgctctt tgccaatgga 361 aaccgctgga aggtgcttcg gcgattctct gtgaccacca tgagggactt cgggatggga 421 aagcggagtg tggaggagcg gattcaggac gaggctcagt gtctgataga ggaacttcgg 481 aaatccaagg gagccctcgt ggaccccacc ttcctcttcc attccattac cgccaacatc 541 atctgctcca tcatctttgg aaaacgcttc cactaccaag atcaagagtt cctgaagacg 601 ctgaacttgt tctgccagag tttcttactc atcagctcta tatccagcca gctgtttgag 661 ctcttctctg gcttcttgaa atactttcct ggggcacaca ggcaagttta caaaaaccta 721 caggaaatca atgcttacat tggccacagt gtggagaagc accgtgaaac cctggacccc 781 agcgccccca gggacctcat cgacacctac ctgctccaca tggaaaaaga gaaatccaac 841 ccacacagtg aattcagcca ccagaacctc atcatcaaca cgctctcgct cttctttgct 901 ggcactgaga ccaccagcac cactctccgc tacggcttcc tgctcatgct caaataccct 961 catgtcgcag agagagtcta caaggagatt gaacaggtgg ttggcccaca tcgccctcca 1021 gcgcttgatg accgagccaa aatgccatac acagaggcag tcatccgtga gattcagaga 1081 tttgctgacc ttctccccat gggtgtgccc cacattgtca cccaacacac cagcttctga 1141 gggtacacca tccccaagga cacggaagta tttctcatcc tgagcactgc tctccgtgac 1201 ccacactact ttgaaaaacc agacgccttc aatcctgacc actttctgga tgccaatggg 1261 gcactgaaaa agaatgaagc ttttatcccc ttctccttag ggaagcggat ttgtcttggt 1321 gaaggcattg cccgtgcgga attgttcctc ttcttcacca ccatcctcca gaacttctcc 1381 gtggccagcc ccgtggctcc tgaagacatc gatctgacac cccaggagtg tggtgtgggc 1441 aaaatacccc caacatacca gatctgcttc ctgccccgct gaaggggctg agggaagggg 1501 gtcaaaggat tccagggtca ttcagtgtcc ccacctctgt agataatggc tctgactccc 1561 tgcaacttcc tgcctctgag agacctgctg caagccagct tccttccctt ccatggcacc 1621 agttgtctga ggtcgcagtg caaatgagtg gaggagtgag attattgaaa attataatat 1681 acaaaattat atatatatat tttgagacag agtctcactc agttgcccag gctggagtgc 1741 agtggcgtga tctcggctca ctgcaacctc cacccccggg gttcaagaaa ttctcctgcc 1801 tcagcctccc tagtagctgg gattacaggt gtgtgctacc atgcctggct aatttttgta 1861 tttttagtag agatggggtt tcaccgtgtt ggccaggctg atctcaaact cctgaactca 1921 agtgattcac ccaccttagc ctcccaaagt gctgggatta caggtgtgag tcaccatgcc 1981 cggccatgta tatatataat tttaaaaatt aagatgaaat tcacataaaa taaaattagc 2041 cattttaaag tgtacaattt agtggtgtgt ggttcattca caaagctgta caaccaccac 2101 catctagttc caaacatttt ctttttttct gagacggagt ctcactctgt cacccaggtt 2161 cgagttcagt ggtcttgaac tcctgatgtc aggtgattct cctagttcca aatgttttca 2221 ttatctctcc cccaacaaaa cccataccta tcaagctgtc actccccata ccccattctc 2281 tttttcatct cagcccctgt caatctggtt tttgtcctta tggacttacc aattctgaat 2341 atttcctata aacagaatca cacaatattt gatttttttt ttaaaactaa gccttgctct 2401 gtctcccagg ctggagtgct gtggcgtgat tttggttcac tgcaacctcc gccttccaag 2461 ttcaagagat tctcctgcct cagcttccaa gtagctggga ttacaggcat gtggtaccac 2521 gcctggctaa ttttcttgta tttttagtag ggacatgttg gccaggctgg ttgtgagctc 2581 ctggcctcag gtgatccaca cgcctcagtg tcccagagtg ctgatattac aggcgtaata 2641 tgtgatcttt tgtgtctggt tcctttcacg ttgaacgcta tttttgaggt tcgtgcctgt 2701 tgtagaccac agtcacacac tgctgtagtc ttcccccatc ctcattccca gctgcctcct 2761 cctactgttt ccctctatca aaaagcctcc ttggcgcagg ttccctgagc tgtgggattc 2821 tgcactggtg ctttggattc cctgatatgt tccttcaaat ccactgagaa ttaaataaac 2881 atcgctaaag cctgacctcc ccacgtc // LOCUS HUMCYP7 2901 bp mRNA PRI 31-DEC-1994 DEFINITION Human cholesterol 7-alpha hydroxylase (CYP7) mRNA, complete cds. ACCESSION M93133 NID g181318 KEYWORDS cholesterol 7 alpha-hyroxylase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Noshiro,M. and Okuda,K. TITLE Molecular cloning and sequence analysis of cDNA encoding human cholesterol 7 alpha-hydroxylase JOURNAL FEBS Lett. 268 (1), 137-140 (1990) MEDLINE 90346120 REFERENCE 2 (bases 1 to 2901) AUTHORS Karam,W.G. and Chiang,J.Y. TITLE Polymorphisms of human cholesterol 7 alpha-hydroxylase JOURNAL Biochem. Biophys. Res. Commun. 185 (2), 588-595 (1992) MEDLINE 92304280 FEATURES Location/Qualifiers source 1..2901 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q11.12" gene 60..1574 /gene="CYP7" CDS 60..1574 /gene="CYP7" /EC_number="1.14.13.17" /codon_start=1 /product="cholesterol 7-alpha hydroxylase" /db_xref="PID:g181319" /translation="MMTTSLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGC ALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHF ATSAKAFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSN SKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPA LVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKA KTHLVVLWASQANTIPATFWSLFQMIRNPEAMKAATEEVKRTLENAGQKVSLEGNPIC LSQAELNDLPVLDSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQ LMHLDPEIYPDPLTFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL" allele 358 /gene="CYP7" /note="replace (358; `T')" BASE COUNT 911 a 537 c 567 g 886 t ORIGIN 1 ggcacgagct ttctaatcag agattttctt cctcagagat tttggcctag atttgcaaaa 61 tgatgaccac atctttgatt tgggggattg ctatagcagc atgctgttgt ctatggctta 121 ttcttggaat taggagaagg caaacgggtg aaccacctct agagaatgga ttaattccat 181 acctgggctg tgctctgcaa tttggtgcca atcctcttga gttcctcaga gcaaatcaaa 241 ggaaacatgg tcatgttttt acctgcaaac taatgggaaa atatgtccat ttcatcacaa 301 atcccttgtc ataccataag gtgttgtgcc acggaaaata ttttgattgg aaaaaatttc 361 actttgctac ttctgcgaag gcatttgggc acagaagcat tgacccgatg gatggaaata 421 ccactgaaaa cataaacgac actttcatca aaaccctgca gggccatgcc ttgaattccc 481 tcacggaaag catgatggaa aacctccaac gtatcatgag acctccagtc tcctctaact 541 caaagaccgc tgcctgggtg acagaaggga tgtattcttt ctgctaccga gtgatgtttg 601 aagctgggta tttaactatc tttggcagag atcttacaag gcgggacaca cagaaagcac 661 atattctaaa caatcttgac aacttcaagc aattcgacaa agtctttcca gccctggtag 721 caggcctccc cattcacatg ttcaggactg cgcacaatgc ccgggagaaa ctggcagaga 781 gcttgaggca cgagaacctc caaaagaggg aaagcatctc agaactgatc agcctgcgca 841 tgtttctcaa tgacactttg tccacctttg atgatctgga gaaggccaag acacacctcg 901 tggtcctctg ggcatcgcaa gcaaacacca ttccagcgac tttctggagt ttatttcaaa 961 tgattaggaa cccagaagca atgaaagcag ctactgaaga agtgaaaaga acattagaga 1021 atgctggtca aaaagtcagc ttggaaggca atcctatttg tttgagtcaa gcagaactga 1081 atgacctgcc agtattagat agtataatca aggaatcgct gaggctttcc agtgcctccc 1141 tcaacatccg gacagctaag gaggatttca ctttgcacct tgaggacggt tcctacaaca 1201 tccgaaaaga tgacatcata gctctttacc cacagttaat gcacttagat ccagaaatct 1261 acccagaccc tttgactttt aaatatgata ggtatcttga tgaaaacggg aagacaaaga 1321 ctaccttcta ttgtaatgga ctcaagttaa agtattacta catgcccttt ggatcgggag 1381 ctacaatatg tcctggaaga ttgttcgcta tccacgaaat caagcaattt ttgattctga 1441 tgctttctta ttttgaattg gagcttatag agggccaagc taaatgtcca cctttggacc 1501 agtcccgggc aggcttgggc attttgccgc cattgaatga tattgaattt aaatataaat 1561 tcaagcattt gtgaatacat ggctggaata agaggacact agatgatatt acaggactgc 1621 agaacaccct caccacacag tccctttgga caaatgcatt tagtggtggt agaaatgatt 1681 caccaggtcc aatgttgttc accagtgctt gcttgtgaat cttaacattt tggtgacagt 1741 ttccagatgc tatcacagac tctgctagtg aaaagaacta gtttctagga gcacaataat 1801 ttgttttcat ttgtataagt ccatgaatgt tcatatagcc agggattgaa gtttattatt 1861 ttcaaaggaa aacaccttta ttttattttt ttttcaaaat gaagatacac attacagcca 1921 ggtgtggtag caggcacctg tagtcttagc tactcgagag gccaaagaag gaggatggct 1981 tgagcccagg agttcaagac cagcctggac agcttagtga gatcccgtct ccgaagaaaa 2041 gatatgtatt ctaattggca gattgttttt tcctaaggaa actgctttat ttttataaaa 2101 ctgcctgaca attatgaaaa aatgttcaaa ttcacgttct agtgaaactg cattatttgt 2161 tgactagatg gtggggttct tcgggtgtga tcatatatca taaaggatat ttcaaatgat 2221 tatgattagt tatgtctttt aataaaaagg aaatattttt caacttcttc tatatccaaa 2281 attcagggct ttaaacatga ttatcttgat tcccaaaaac actaaaggtg gttttatttt 2341 cccttcatgt tttaacttat tgttgctgaa aactctatgt ccggctttaa ctatcttctc 2401 tatattttta tttcattcac attaatgaga agagttttct cagagattaa aaaaggtagt 2461 ttttctgtca ttgttaaata cacattatca ctgaaaaaat gtagctttta tgatgtatgt 2521 tttaaagtta aaactggatg gaaatagcca tttggaagct ttggttatga aacatgtgga 2581 gtgtattaag tgcagcttga cattatgttt tatttaaatg ctttttatcg ctaaatgact 2641 tgcagatgaa aaaaactaag gtgactcgag tgtttaaatg cctgtgtaca acaatgcttt 2701 gataaaatat tttaaggtat gagttatcag ctctatgtca attgtatttc tggtagtatt 2761 tatatttaaa ttatatttcc tttttgctta ttttacaaat attaagaaaa tattctaaca 2821 tttgataatt ttgaaatgat tcatctttca gaaataaaag tatgaatcta aaaaaaaaaa 2881 aaaaaaaaaa aaaaaaaaaa a // LOCUS HUMCYPBA 851 bp mRNA PRI 02-NOV-1994 DEFINITION Human cyclophilin B (hCyPB) mRNA, complete cds. ACCESSION M60857 NID g181334 KEYWORDS cyclophilin B; cyclosporin A-binding protein; peptidylprolyl isomerase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 851) AUTHORS Price,E.R., Zydowsky,L.D., Jin,M.J., Baker,C.H., McKeon,F.D. and Walsh,C.T. TITLE Human cyclophilin B: a second cyclophilin gene encodes a peptidyl-prolyl isomerase with a signal sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (5), 1903-1907 (1991) MEDLINE 91156714 FEATURES Location/Qualifiers source 1..851 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda-GT10" /map="Unassigned" sig_peptide 13..87 /gene="PPIB" /note="G00-127-610" CDS 13..639 /gene="PPIB" /codon_start=1 /db_xref="GDB:G00-127-610" /product="cyclophilin B" /db_xref="PID:g181335" /translation="MKVLLAAALIAGSVFFLLLPGPSAADEKKKGPKVTVKVYFDLRI GDEDVGRVIFGLFGKTVPKTVDNFVALATGEKGFGYKNSKFHRVIKDFMIQGGDFTRG DGTGGKSIYGERFPDENFKLKHYGPGWVSMANAGKDTNGSQFFITTVKTAWLDGKHVV FGKVLEGMEVVRKVESTKTDSRDKPLKDVIIADCGKIEVEKPFAIAKE" gene 13..639 /gene="PPIB" mat_peptide 88..636 /gene="PPIB" /note="G00-127-610" /product="cyclophilin B" BASE COUNT 222 a 210 c 240 g 179 t ORIGIN 1 cgggaacgca acatgaaggt gctccttgcc gccgccctca tcgcggggtc cgtcttcttc 61 ctgctgctgc cgggaccttc tgcggccgat gagaagaaga aggggcccaa agtcaccgtc 121 aaggtgtatt ttgacctacg aattggagat gaagatgtag gccgggtgat ctttggtctc 181 ttcggaaaga ctgttccaaa aacagtggat aattttgtgg ccttagctac aggagagaaa 241 ggatttggct acaaaaacag caaattccat cgtgtaatca aggacttcat gatccagggc 301 ggagacttca ccaggggaga tggcacagga ggaaagagca tctacggtga gcgcttcccc 361 gatgagaact tcaaactgaa gcactacggg cctggctggg tcagcatggc caacgcaggc 421 aaagacacca acggctccca gttcttcatc acgacagtca agacagcctg gctagatggc 481 aagcatgtgg tgtttggcaa agttctagag ggcatggagg tggtgcggaa ggtggagagc 541 accaagacag acagccggga taaacccctg aaggatgtga tcatcgcaga ctgcggcaag 601 atcgaggtgg agaagccctt tgccatcgcc aaggagtagg gcacagggac atctttcttt 661 gagtgaccgt ctgtgcaggc cctgtagtcc gccacagggc tctgagctgc actggccccg 721 gtgctggcat ctggtggagc ggacccactc ccctcacatt ccacaggccc atggactcac 781 ttttgtaaca aactcctacc aacactgacc aataaaaaaa aatgtgggtt tttttttttt 841 ttaataaaaa a // LOCUS HUMCYPIIF 1825 bp mRNA PRI 02-NOV-1994 DEFINITION Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds. ACCESSION J02906 NID g181357 KEYWORDS cytochrome P450. SOURCE Human lung, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1825) AUTHORS Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A., Gelboin,H.V. and Gonzalez,F.J. TITLE The human CYP2F gene subfamily: identification of a cDNA encoding a new cytochrome P450, cDNA-directed expression, and chromosome mapping JOURNAL Biochemistry 29 (23), 5491-5499 (1990) MEDLINE 90352299 COMMENT Draft entry and computer-readable sequence for [Biochemistry (1990) In press] kindly submitted by P.T.Nhamburo, 18-MAY-1990. FEATURES Location/Qualifiers source 1..1825 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.2" mRNA <1..1813 /note="CYP2F mRNA" gene 56..1531 /gene="CYP2F1" CDS 56..1531 /gene="CYP2F1" /note="cytochrome P450IIF1" /codon_start=1 /db_xref="GDB:G00-119-834" /db_xref="PID:g181358" /translation="MDSISTAILLLLLALVCLLLTLSSRDKGKLPPGPRPLSILGNLL LLCSQDMLTSLTKLSKEYGSMYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYP AFFNFTKGNGIAFSSGDRWKVLRQFSIQILRNFGMGKRSIEERILEEGSFLLADVRKT EGEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGELYD ILDPRFPSLLDWVPGPHQRIFQNFKCLRDLIAHSVHDHQASSPRDFIQCFLTKMAEEK EDPLSHFHMDTLLMTTHNLLFGGTKTVSTTLHHAFLALMKYPKVQARVQEEIDLVVGR ARLPALKDRAAMPYTDAVIHEVQRFADIIPMNLPHRVTRDTAFRGFLIPKGTDVITLL NTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGRRLCLGELLARMELFLYL TAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLRPR" BASE COUNT 356 a 596 c 470 g 403 t ORIGIN 1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga 61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct 121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa 181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta 241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca 301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc 361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa 421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat 481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg 541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt 601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat 661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc 721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt 781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg 841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca 901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac 961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc 1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga 1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat 1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat 1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt 1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa 1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc 1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct 1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg 1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga 1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca 1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca 1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc 1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat 1801 tgtaatagat tcaaaccagt cttgg // LOCUS HUMCYPSCC 1821 bp mRNA PRI 02-NOV-1994 DEFINITION Human cholesterol side-chain cleavage enzyme P450scc mRNA, complete cds. ACCESSION M14565 NID g181375 KEYWORDS cholesterol side-chain cleavage enzyme; cytochrome; cytochrome P450; cytochrome P450-scc. SOURCE Human testis (library of K.Fong) and adrenal, cDNA to mRNA, clones lambda-haSCC-71 and lambda-htSCC-2, respectively. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1821) AUTHORS Chung,B.C., Matteson,K.J., Voutilainen,R., Mohandas,T.K. and Miller,W.L. TITLE Human cholesterol side-chain cleavage enzyme, P450scc: cDNA cloning, assignment of the gene to chromosome 15, and expression in the placenta JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (23), 8962-8966 (1986) MEDLINE 87067434 COMMENT Clean copy sequence for [1] kindly provided by W.L.Miller, 15-FEB-1987. The human P450scc gene is expressed in the placenta in early and midgestation and accumulates in response to cyclic AMP. FEATURES Location/Qualifiers source 1..1821 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15" mRNA <1..1821 /note="P450scc mRNA" gene 45..1610 /gene="CYP11A" CDS 45..1610 /gene="CYP11A" /note="cholesterol side-chain cleavage enzyme P450scc (EC 1.14.15.67)" /codon_start=1 /db_xref="GDB:G00-119-828" /db_xref="PID:g181376" /translation="MLAKGLPPRSVLVKGYQTFLSAPREGLGRLRVPTGEGAGISTRS PRPFNEIPSPGDNGWLNLYHFWRETGTHKVHLHHVQNFQKYGPIYREKLGNVESVYVI DPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKKDRVALNQEVMA PEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFESITNVIFGER QGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK ADIYTQNFYWELRQKGSVHHDYRGMLYRLLGDSKMSFEDIKANVTEMLAGGVDTTSMT LQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHPISV TLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYF RNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPIS FTFWPFNQEATQQ" BASE COUNT 417 a 527 c 493 g 384 t ORIGIN 1051 bp upstream of PstI site; chromosome 15. 1 gggcgctgaa gtggagcagg tacagtcaca gctgtgggga cagcatgctg gccaagggtc 61 ttcccccacg ctcagtcctg gtcaaaggct accagacctt tctgagtgcc cccagggagg 121 ggctggggcg tctcagggtg cccactggcg agggagctgg catctccacc cgcagtcctc 181 gccccttcaa tgagatcccc tctcctggtg acaatggctg gctaaacctg taccatttct 241 ggagggagac gggcacacac aaagtccacc ttcaccatgt ccagaatttc cagaagtatg 301 gcccgattta cagggagaag ctcggcaacg tggagtcggt ttatgtcatc gaccctgaag 361 atgtggccct tctctttaag tccgagggcc ccaacccaga acgattcctc atcccgccct 421 gggtcgccta tcaccagtat taccagagac ccataggagt cctgttgaag aagtcggcag 481 cctggaagaa agaccgggtg gccctgaacc aggaggtgat ggctccagag gccaccaaga 541 actttttgcc cctgttggat gcagtgtctc gggacttcgt cagtgtcctg cacaggcgca 601 tcaagaaggc gggctccgga aattactcgg gggacatcag tgatgacctg ttccgctttg 661 cctttgagtc catcactaac gtcatttttg gggagcgcca ggggatgctg gaggaagtag 721 tgaaccccga ggcccagcga ttcattgatg ccatctacca gatgttccac accagcgtcc 781 ccatgctcaa ccttccccca gacctgttcc gtctgttcag gaccaagacc tggaaggacc 841 atgtggctgc atgggacgtg attttcagta aagctgacat atacacccag aacttctact 901 gggaattgag acagaaagga agtgttcacc acgattaccg tggcatgctc tacagactcc 961 tgggagacag caagatgtcc ttcgaggaca tcaaggccaa cgtcacagag atgctggcag 1021 gaggggtgga cacgacgtcc atgaccctgc agtggcactt gtatgagatg gcacgcaacc 1081 tgaaggtgca ggatatgctg cgggcagagg tcttggctgc gcggcaccag gcccagggag 1141 acatggccac gatgctacag ctggtccccc tcctcaaagc cagcatcaag gagacactaa 1201 gacttcaccc catctccgtg accctgcaga gatatcttgt aaatgacttg gttcttcgag 1261 attacatgat tcctgccaag acactggtgc aagtggccat ctatgctctg ggccgagagc 1321 ccaccttctt cttcgacccg gaaaattttg acccaacccg atggctgagc aaagacaaga 1381 acatcaccta cttccggaac ttgggctttg gctggggtgt gcggcagtgt ctgggacggc 1441 ggatcgctga gctagagatg accatcttcc tcatcaatat gctggagaac ttcagagttg 1501 aaatccaaca cctcagcgat gtgggcacca cattcaacct cattctgatg cctgaaaagc 1561 ccatctcctt caccttctgg ccctttaacc aggaagcaac ccagcagtga tcagagagga 1621 tggcctgcag ccacatggga ggaaggccca ggggtggggc ccatggggtc tctgcatctt 1681 cagtcgtctg tcccaagtcc tgctcctttc tgcccagcct gctcagcagg ttgaatgggt 1741 tctcagtggt caccttcctc agctcagctg ggccactcct cttcacccac cccatggaga 1801 caataaacag ctgaaccatc g // LOCUS HUMCYTEPOX 2100 bp mRNA PRI 10-SEP-1993 DEFINITION Human cytosolic epoxide hydrolase mRNA, complete cds. ACCESSION L05779 NID g181394 KEYWORDS cytosolic epoxide hydrolase; cytosolic protein; epoxide hydrolase; hydrolase. SOURCE Homo sapiens male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2100) AUTHORS Beetham,J.K. and Hammock,B.D. TITLE cDNA cloning and expression of a soluble epoxide hydrolase from human liver JOURNAL Arch. Biochem. Biophys. 305 (1), 197-201 (1993) MEDLINE 93343630 FEATURES Location/Qualifiers source 1..2100 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..41 CDS 42..1706 /codon_start=1 /product="cytosolic epoxide hydrolase" /db_xref="PID:g181395" /translation="MTLRGAVFDLDGVLALPAVFGVLGRTEEALALPRGLLNDAFQKG GPEGATTRLMKGEITLSQWIPLMEENCRKCSETAKVCLPKNFSIKEIFDKAISARKIN RPMLQAALMLRKKGFTTAILTNTWLDDRAERDGLAQLMCELKMHFDFLIESCQVGMVK PEPQIYKFLLDTLKASPSEVVFLDDIGANLKPARDLGMVTILVQDTDTALKELEKVTG IQLLNTPAPLPTSCNPSDMSHGYVTVKPRVRLHFVELGWPAVCLCHGFPESWYSWRYQ IPALAQAGYRVLAMDMKGYGESSAPPEIEEYCMEVLCKEMVTFLDKLGLSQAVFIGHD WGGMLVWYMALFYPERVRAVASLNTPFIPANPNMSPLESIKANPVFDYQLYFQEPGVA EAELEQNLSRTFKSLFRASDESVLSMHKVCEAGGLFVNSPEEPSLSRMVTEEEIQFYV QQFKKSGFRGPLNWYRNMERNWKWACKSLGRKILIPALMVTAEKDFVLVPQMSQHMED WIPHLKRGHIEDCGHWTQMDKPTEVNQILIKWLDSDARNPPVVSKM" 3'UTR 1707..2100 polyA_signal 2084..2089 BASE COUNT 522 a 523 c 567 g 488 t ORIGIN 1 ggcacgagct ctctctctct ctctctctct ctctcgccgc catgacgctg cgcggcgccg 61 tcttcgacct tgacggggtg ctggcgctgc cagcggtgtt cggcgtcctc ggccgcacgg 121 aggaggccct ggcgctgccc agaggacttc tgaatgatgc tttccagaaa gggggaccag 181 agggtgccac tacccggctt atgaaaggag agatcacact ttcccagtgg ataccactca 241 tggaagaaaa ctgcaggaag tgctccgaga ccgctaaagt ctgcctcccc aagaatttct 301 ccataaaaga aatctttgac aaggcgattt cagccagaaa gatcaaccgc cccatgctcc 361 aggcagctct catgctcagg aagaaaggat tcactactgc catcctcacc aacacctggc 421 tggacgaccg tgctgagaga gatggcctgg cccagctgat gtgtgagctg aagatgcact 481 ttgacttcct gatagagtcg tgtcaggtgg gaatggtcaa acctgaacct cagatctaca 541 agtttctgct ggacaccctg aaggccagcc ccagtgaggt cgtttttttg gatgacatcg 601 gggctaatct gaagccagcc cgtgacttgg gaatggtcac catcctggtc caggacactg 661 acacggccct gaaagaactg gagaaagtga ccggaatcca gcttctcaat accccggccc 721 ctctgccgac ctcttgcaat ccaagtgaca tgagccatgg gtacgtgaca gtaaagccca 781 gggtccgtct gcattttgtg gagctgggct ggcctgctgt gtgcctctgc catggatttc 841 ccgagagttg gtattcttgg aggtaccaga tccctgctct ggcccaggca ggttaccggg 901 tcctagctat ggacatgaaa ggctatggag agtcatctgc tcctcccgaa atagaagaat 961 attgcatgga agtgttatgt aaggagatgg taaccttcct ggataaactg ggcctctctc 1021 aagcagtgtt cattggccat gactggggtg gcatgctggt gtggtacatg gctctcttct 1081 accccgagag agtgagggcg gtggccagtt tgaatactcc cttcatacca gcaaatccca 1141 acatgtcccc tttggagagt atcaaagcca acccagtatt tgattaccag ctctacttcc 1201 aagaaccagg agtggctgag gctgaactgg aacagaacct gagtcggact ttcaaaagcc 1261 tcttcagagc aagcgatgag agtgttttat ccatgcataa agtctgtgaa gcgggaggac 1321 tttttgtaaa tagcccagaa gagcccagcc tcagcaggat ggtcactgag gaggaaatcc 1381 agttctatgt gcagcagttc aagaagtctg gtttcagagg tcctctaaac tggtaccgaa 1441 acatggaaag gaactggaag tgggcttgca aaagcttggg acggaagatc ctgattccgg 1501 ccctgatggt cacggcggag aaggacttcg tgctcgttcc tcagatgtcc cagcacatgg 1561 aggactggat tccccacctg aaaaggggac acattgagga ctgtgggcac tggacacaga 1621 tggacaagcc aaccgaggtg aatcagatcc tcattaagtg gctggattct gatgcccgga 1681 acccaccggt ggtctcaaag atgtagaacg cagcgtagtg cccacgctca gcaggtgtgc 1741 catccttcca cctgctgggg caccattctt agtatacaga ggtggcctta cacacatctt 1801 gcatggatgg cagcattgtt ctgaaggggt ttgcagaaaa aaaagatttt ctttacataa 1861 agtgaatcaa atttgacatt attttagatc ccagagaaat caggtgtgat tagttctcca 1921 ggcatgaatg catcgtccct ttatctgtaa gaacccttag tgtcctgtag ggggacagaa 1981 tggggtggcc aggtggtgat ttctctttga ccaatgcata gtttggcaga aaaatcagcc 2041 gttcatttag aagaatctta gcagagattg ggatgcctta ctcaataaag ctaagatgac // LOCUS HUMCYTFAOH 2576 bp mRNA PRI 31-DEC-1994 DEFINITION Human cytochrome p-450 4A (CYP4A) mRNA, complete cds. ACCESSION L04751 NID g181396 KEYWORDS cytochrome P450; cytochrome P450 4A; fatty acid omega-hydroxylase. SOURCE Homo sapiens adult kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2576) AUTHORS Palmer,C.N., Richardson,T.H., Griffin,K.J., Hsu,M.H., Muerhoff,A.S., Clark,J.E. and Johnson,E.F. TITLE Characterization of a cDNA encoding a human kidney, cytochrome P-450 4A fatty acid omega-hydroxylase and the cognate enzyme expressed in Escherichia coli JOURNAL Biochim. Biophys. Acta 1172 (1-2), 161-166 (1993) MEDLINE 93176801 FEATURES Location/Qualifiers source 1..2576 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" gene 33..1592 /gene="CYP4A" CDS 33..1592 /gene="CYP4A" /standard_name="CYP4A" /codon_start=1 /function="fatty acid omega-hydroxylase" /evidence=experimental /product="cytochrome P450" /db_xref="PID:g181397" /translation="MSVSVLSPSRLLGDVSGILQAASLLILLLLLIKAVQLYLHRQWL LKALQQFPCPPSHWLFGHIQELQQDQELQRIQKWVETFPSACPHWLWGGKVRVQLYDP DYMKVILGRSDPKSHGSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILKPYVG LMADSVRVMLDKWEELLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDRNSQSYI QAISDLNNLVFSRVRNAFHQNDTIYSLTSAGRWTHRACQLAHQHTDQVIQLRKAQLQK EGELEKIKRKRHLDFLDILLLAKMENGSILSDKDLRAEVDTFMFEGHDTTASGISWIL YALATHPKHQERCREEIHSLLGDGASITWNHLDQMPYTTMCIKEALRLYPPVPGIGRE LSTPVTFPDGRSLPKGIMVLLSIYGLHHNPKVWPNPEVFDPFRFAPGSAQHSHAFLPF SGGSRNCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPIARLVLKSKNGIHLRLRR LPNPCEDKDQL" BASE COUNT 572 a 793 c 583 g 628 t ORIGIN 1 gaattccgca gagatccagc aggtgctgca ccatgagtgt ctctgtgctg agccccagca 61 gactcctggg tgatgtctct ggaatcctcc aagcggcctc cctgctcatt ctgcttctgc 121 tgctgatcaa ggcagttcag ctctacctgc acaggcagtg gctgctcaaa gccctccagc 181 agttcccgtg ccctccctcc cactggctct tcgggcacat ccaggagctc caacaggacc 241 aggagctaca acggattcag aaatgggtgg agacattccc aagtgcctgt cctcattggc 301 tatggggagg caaagttcgt gtccagctct atgaccctga ctatatgaag gtgattctgg 361 ggagatcaga cccgaaatcc catggttcct acagattcct ggctccatgg attgggtacg 421 gcttgctcct gttgaatggg cagacatggt tccagcatcg acggatgctg accccagcct 481 tccactatga catcctgaag ccctatgtgg ggctcatggc agactctgta cgagtgatgc 541 tggacaaatg ggaagagctc cttggccagg attcccctct ggaggtcttt cagcacgtct 601 ccttgatgac cctggacacc atcatgaagt gtgccttcag ccatcagggc agcatccagg 661 tggacaggaa ttctcagtcc tacatacagg ccattagtga cctgaacaac ctggtttttt 721 cccgtgtgag gaatgccttt caccagaatg acaccatcta cagcctgacc tctgctggcc 781 gctggacaca ccgcgcctgc cagctggccc atcagcacac agaccaagtg atccaactga 841 ggaaggctca actacagaag gagggggagc tggagaagat caagaggaag aggcatttgg 901 attttctgga tatcctcctc ttggccaaaa tggagaatgg gagcatcttg tcagacaagg 961 acctccgtgc tgaggtggac acgttcatgt ttgagggcca cgacaccaca gccagtggga 1021 tctcctggat cctctatgct ctggccacac accccaagca tcaggagagg tgccgggagg 1081 agatccacag cctcctgggt gatggagcct ccatcacctg gaaccacctg gaccagatgc 1141 cctacaccac catgtgcatt aaggaggcac tgaggctcta cccaccggtg ccaggcattg 1201 gcagagagct cagcactccc gtcaccttcc ctgatgggcg ctccttgccc aaaggtatca 1261 tggtcctcct ctccatttat ggccttcacc acaacccaaa agtgtggccc aacccagagg 1321 tgtttgaccc tttccgtttt gcaccgggtt ctgctcaaca cagccacgct ttcctgccct 1381 tctcaggagg atcaaggaac tgcattggga aacaatttgc catgaacgag ctgaaggtgg 1441 ccacggccct gaccctgctc cgctttgagc tgctgcctga tcccaccagg atccccatcc 1501 ccattgcacg acttgtgttg aaatccaaaa atggaatcca cctgcgtctc aggaggctcc 1561 ctaacccttg tgaagacaag gaccagcttt gagggcctcc acctgccgtc ctgtcttcct 1621 gacccccgct tctgtcccct tcctgtctgc ccatatcctg ttttctgtct gcccaccttc 1681 ccttcttccc acctgcctgc tgtcccccag tctgcctgcc cttctctctc tcacctttct 1741 ccaggctccc tacctgcttg tctacctgtc tcctacccac ctgtatctct tgttgggaga 1801 aaagctgagt gttgggagaa gctgaggccg agcttgcatg tctgacataa tgtaaaagag 1861 tcttgaatca tgtccaggat ccagggtcta aaaccccttg tggcctttgg aacaccaagc 1921 tctgtgctga agggtggaag gctaccctga cgcaccataa tctaagcccg gggcataaaa 1981 cccctcgtgg cttggataga atccagggct cgtggctctg gaatgtgtct ggacttgctg 2041 gctccttgct ccttgctctc ccaggatcaa ttgtatcttg agttaaaaga acctgctctc 2101 cattatctca agtaacagag cagatgctaa accgtcacag ctgtaaattg tgtgcttaat 2161 gcaacatgcc ctttcgaccc accccccatt ctcaccacct gtttctttgt ttgatcacca 2221 ataaataatc tgcacttcca gagctcgggg ccttcacagc ctccatcctt agctttggcg 2281 ccctggaccc actttctctc tcaaactgtc ttttctcact gctttgactc tgccggactt 2341 tgtcaccccc acgacctggt gttgggtctg aacaccccaa catccctgaa tctccaccca 2401 cctcccaaac tcctgcctgc cctccagact gtctgcccat acacctgtct ccttcttcct 2461 gcctggcttg tctgttccta tattagtttc ctattactgc tgtaataaac tatcacaatc 2521 tcagtgattt taaataaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaacg gaattc // LOCUS HUMCYTOKER 2427 bp mRNA PRI 17-AUG-1992 DEFINITION Human epidermal cytokeratin 2 mRNA, complete cds. ACCESSION M99061 NID g181401 KEYWORDS cytokeratin; cytoskeletal protein. SOURCE Homo sapiens adult epidermal cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2427) AUTHORS Collin,C., Moll,R., Kubicka,S., Ouhayoun,J.-P. and Franke,W.W. TITLE Characterization of Human Cytokeratin 2, an Epidermal Cytoskeletal Protein Synthesized Late During Differentiation JOURNAL Exp. Cell Res. (1992) In press FEATURES Location/Qualifiers source 1..2427 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="epidermal" CDS 34..1971 /codon_start=1 /product="epidermal cytokeratin 2" /db_xref="PID:g181402" /translation="MSCQISCKSRGRGGGGGGFRGFSSGSAVVSGGSRRSTSSFSCLS RHGGGGGGFGGGGFGSRSLVGLGGTKSISISVAGGGGGFGAAGGFGGRGGGFGGGSGF GGGSGFGGGSGFSGGGFGGGGFGGGRFGGFGGPGGVGGLGGPGGFGPGGYPGGIHEVS VNQSLLQPLNVKVDPEIQNVKAQEREQIKTLNNKFASFIDKVRFLEQQNQVLQTKWEL LQQMNVGTRPINLEPIFQGYIDSLKRYLDGLTAERTSQNSELNNMQDLVEDYKKKYED EINKRTAAENDFVTLKKDVDNAYMIKVELQSKVDLLNQEIEFLKVLYDAEISQIHQSV TDTNVILSMDNSRNLDLDSIIAEVKAQYEEIAQRSKEEAEALYHSKYEELQVTVGRHG DSLKEIKIEISELNRVIQRLQGEIAHVKKQCKNVQDAIADAEQRGEHALKDARNKLND LEEALQQAKEDLARLLRDYQELMNVKLALDVEIATYRKLLEGEECRMSGDLSSNVTVS VTSSTISSNVASKAAFGGSGGRGSSSGGGYSSGSSSYGSGGRQSGSRGGSGGGGSISG GGYGSGGGSGGRYGSGGGSKGGSISGGGYGSGGGKHSSGGGSRGGSSSGGGYGSGGGG SSSVKGSSGEAFGSSVTFSFR" BASE COUNT 562 a 541 c 768 g 556 t ORIGIN 1 agcctgtgac tttcctccct ggacaaaggc atcatgagtt gtcagatctc ttgcaaatct 61 cgaggaagag gaggaggtgg aggaggattc cggggcttca gcagcggctc agctgtggtg 121 tctggtggaa gccggagatc aacttccagc ttctcctgct tgagccgcca tggtggtggt 181 ggcgggggct tcggtggagg cggctttggc agtcggagtc ttgttggcct tggagggacc 241 aagagcatct ccattagtgt ggctggagga ggtggtggct ttggcgccgc tggtggattt 301 ggtggcagag gaggtggttt tggaggcggc agcggctttg gaggcggcag cggctttgga 361 ggtggcagcg gcttcagtgg tggtggtttc ggtggaggcg gctttggtgg aggccgcttt 421 ggaggttttg ggggccctgg tggtgttgga ggtttagggg gtcctggtgg ctttgggcct 481 ggaggatacc ctggtggcat ccacgaagtc tctgtcaacc agagcctcct gcagcctctc 541 aacgtgaaag ttgacccaga gatccagaat gtgaaggccc aagagcgtga gcagatcaaa 601 actctcaaca acaaatttgc ctccttcatt gacaaggtgc ggttcttgga gcagcagaac 661 caggtgttac agaccaaatg ggagctgcta caacaaatga atgttggcac ccgccccatc 721 aacctggagc ccatcttcca ggggtatatc gacagcctca agagatatct ggatgggctc 781 actgcagaaa gaacatcaca gaattcagag ctgaataaca tgcaggatct tgtggaggat 841 tataagaaga agtatgagga tgaaatcaat aagcgcacag ctgctgagaa tgattttgtg 901 acgcttaaaa aggacgtgga caatgcctac atgataaagg tggagttgca gtccaaggtg 961 gacctgctga accaggaaat tgagtttctg aaagttctct atgatgcgga gatatcccag 1021 atacatcaga gtgtcactga caccaacgtc atcctctcca tggacaacag ccgcaacctg 1081 gacttggata gcatcatcgc cgaggtcaag gcccagtatg aggagatcgc ccagaggagc 1141 aaggaagaag cggaggccct gtaccacagc aagtatgagg agctccaggt gactgtcggg 1201 agacatggag acagcctgaa agagatcaag atagagatca gcgagctgaa ccgcgtgatc 1261 cagaggctgc agggggagat cgcacatgtg aagaagcagt gtaagaatgt gcaagatgcc 1321 atcgcagatg ccgagcagcg tggggagcat gccctcaagg atgccaggaa caagttgaat 1381 gacctggagg aggccctgca gcaggccaag gaggacttgg cgcggctgct gcgtgactac 1441 caggagctga tgaacgtgaa gctggcccta gatgtggaga tcgccaccta ccgcaaactg 1501 ctggagggcg aggagtgcag gatgtctgga gacctcagca gcaatgtgac tgtgtctgtg 1561 acaagcagca ccatttcatc aaatgtggca tccaaggctg cctttggagg ttctggaggt 1621 agagggtcca gttccggagg aggatacagc tctggaagca gcagttatgg ctctggaggc 1681 cgacagtctg gctccagagg cggtagtgga ggaggaggtt ctatctctgg aggaggatat 1741 ggctctggcg gtggttctgg aggaagatac ggatctggtg gtggctctaa gggagggtcc 1801 atctctggag gaggatatgg ctctggaggt ggaaaacaca gctctggagg tggctctaga 1861 ggaggctcca gctctggagg aggatatggc tctggaggtg ggggttctag ctctgtaaag 1921 ggtagctcag gtgaagcttt tggttccagc gtgaccttct cttttagata aagatgagcc 1981 cccaccacca ccgactctcc caacccagac tctcccactc cagaatgtag aagcctgtct 2041 ctgtacctct aactggcagc aagttaaatt tttgtcattt atctctgatg gcactttgag 2101 ggaaaagaat gtccacatac agtttttgaa agatcttctc tccaaaccag ttagttagag 2161 ccagtgacgc ctctgtgttc tggggcggaa tctgtgctgt ctaggtttgt gcttctagcc 2221 atgcccattc ccgcccccac catgcctctt tgcattgccc attttccaga tgtgtattct 2281 gttgaggacc caggcccatc cagggatttc atctctaagc ctggcagtgc tggggggaaa 2341 tgtgtttctg tgtatatagc tcctcttgtc cactctgctt tcggaagtgc tgtggtctgg 2401 gggtcttcat aataaacctc atttgca // LOCUS HUMD123A 1542 bp mRNA PRI 17-JUL-1996 DEFINITION Human mRNA for protein D123, complete cds. ACCESSION D14878 NID g1435036 KEYWORDS protein D123; D123; temperature-sensitive G1-phase arrest; cell cycle mutation. SOURCE Homo sapiens foreskin fibroblast cDNA to mRNA, clone_lib:pCD2 Basinger. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1542) AUTHORS Okuda,A. and Kimura,G. TITLE An amino acid change in novel protein D123 is responsible for temperature-sensitive G1-phase arrest in a mutant of rat fibroblast line 3Y1 JOURNAL Exp. Cell Res. 223 (2), 242-249 (1996) MEDLINE 96177290 REFERENCE 2 (bases 1 to 1542) AUTHORS Okuda,A. TITLE Direct Submission JOURNAL Submitted (05-APR-1993) to the DDBJ/EMBL/GenBank databases. Atsuyuki Okuda, Kyushu University, Medical Institute of Bioregulation; 3-1-1 Maidashi, Higashi-ku, Fukuoka 812, Japan (Tel:092-641-1151(ex.3742), Fax:092-641-1315) FEATURES Location/Qualifiers source 1..1542 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /clone_lib="pCD2 Basinger" /tissue_type="foreskin" gene 281..1291 /gene="D123" CDS 281..1291 /gene="D123" /function="complementation of temperature-sensitive cell cycle mutaion in 3Y1tsD123" /codon_start=1 /product="protein D123" /db_xref="PID:d1004105" /db_xref="PID:g1435037" /translation="MKKEHVLHCQFSAWYPFFRGVTIKSVILPLPQNVKDYLLDDGTL VVSGRDDPPTHSQPDSDDEAEEIQWSDDENTATLTAPEFPEFATKVQEPINSLGGSVF PKLNWSAPRDAYWIAMNSSLKCKTLSDIFLLFKSSDFITRDFTQPFIHCTDDSPDPCI EYELVLRKWCELIPGAEFRCFVKENKLIGISQRDYTQYYDHISKQKEEIRRCIQDFFK KHIQYKFLDEDFVFDIYRDSRGKVWLIDFNPFGEVTDSLLFTWEELISENNLNGDFSE VDAQEQDSPAFRCTNSEVTVQPSPYLSYRLPKDFVDLSTGRDAHKLIDFLKLKRNQQE DD" polyA_site 1542 BASE COUNT 422 a 347 c 367 g 406 t ORIGIN 1 tgcgtttagg gcgaagacgg agttgtaaac ttcttaaaat tcctctctcg acacttcggt 61 aattcctctt tcgagactaa agctcttttt gtatgcgtgt gtgtcaagcg tatgccccgg 121 gattctcctc cgcttccttt tctcggtctt ccttcttgct ttagggaccg gaagagtcct 181 tgaaccaaaa tagctcggcg ggcacttccg gggccggcgc ccagagttcc gggagggtgc 241 aggcaggaga gggaaaggca gcagcggcgg cagctggagg atgaagaagg agcatgtgct 301 tcactgccag ttctccgcgt ggtacccgtt cttccgaggc gttaccatca agagtgtcat 361 tcttccactt cctcagaatg tgaaggatta tttactcgat gatggaactc tggtggtttc 421 aggaagggat gatccaccaa cacattctca gccagacagt gatgatgaag cagaagaaat 481 acagtggtct gatgatgaga acacagccac gcttacggca ccagaatttc ctgagtttgc 541 cactaaagtc caggaaccta tcaattccct cgggggcagt gtctttccta agcttaattg 601 gagtgcccca agggatgcgt attggatagc aatgaatagt tctctgaaat gtaaaaccct 661 cagcgacatc tttctgcttt tcaagagttc cgatttcatc actcgtgact tcactcagcc 721 gtttattcat tgtactgatg attctccaga tccatgtata gaatatgagc tcgttctccg 781 aaaatggtgt gaattgattc ctggggctga gtttcgatgt tttgtcaagg aaaacaagct 841 tattggtatt tctcaaagag actacacaca atactatgat catatttcta aacaaaagga 901 agaaattcgc agatgcatac aagacttttt caagaaacac atacagtaca aattcttaga 961 tgaagacttt gtgttcgata tatacagaga cagtaggggg aaggtgtggc tcattgactt 1021 taatccattt ggtgaagtca cagattcact gctgttcacc tgggaagaac tgatatctga 1081 gaacaactta aacggcgatt ttagtgaagt tgacgctcaa gagcaggatt ccccagcttt 1141 ccgttgcaca aacagtgaag tgacagtcca gcccagcccc tatttgagtt accggctacc 1201 caaggacttt gtagacctct ctactgggag ggacgctcac aagctaatag acttccttaa 1261 gctgaagaga aatcagcagg aggacgactg atgagcgtac tgtaactgga gaagaggagg 1321 ccccgcccca ccgctccggg agctgctcat cagccgcaac ttcctgccga ccctgatgcg 1381 ggtgggccga gcagtgtgga catcagccac tttttatatt catgtacatt cacctgggga 1441 aaaaaacgga gggactttgc tacttgtaaa aataacataa taaatagatc ttaaacatag 1501 gaaaaccata ctgttctgat aataaaatgc tttctatgaa at // LOCUS HUMD2A 1756 bp mRNA PRI 13-FEB-1996 DEFINITION Human dopamine D2 receptor, mRNA, complete cds. ACCESSION M30625 NID g181431 KEYWORDS dopamine D2 receptor. SOURCE Homo sapiens (clone: 15-3 and 14(-1,-2).) foetus brain and pituitary cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1756) AUTHORS Selbie,L.A., Hayes,G. and Shine,J. TITLE The major dopamine D2 receptor: molecular analysis of the human D2A subtype JOURNAL DNA 8 (9), 683-689 (1989) MEDLINE 90126238 FEATURES Location/Qualifiers source 1..1756 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="15-3 and 14(-1,-2)." /dev_stage="foetus" /tissue_type="brain and pituitary" /map="11q22-q23" CDS 240..305 /note="ORF; putative" /codon_start=1 /product="unknown protein" /db_xref="PID:g563756" /translation="MRRELEASSSRRRLCPRGPMA" gene 337..1668 /gene="DRD2" CDS 337..1668 /gene="DRD2" /codon_start=1 /db_xref="GDB:G00-119-852" /product="dopamine D2 receptor" /db_xref="PID:g181432" /translation="MDPLNLSWYDDDLERQNWSRPFNGSDGKADRPHYNYYATLLTLL IAVIVFGNVLVCMAVSREKALQTTTNYLIVSLAVADLLVATLVMPWVVYLEVVGEWKF SRIHCDIFVTLDVMMCTASILNLCAISIDRYTAVAMPMLYNTRYSSKRRVTVMISIVW VLSFTISCPLLFGLNNADQNECIIANPAFVVYSSIVSFYVPFIVTLLVYIKIYIVLRR RRKRVNTKRSSRAFRAHLRAPLKGNCTHPEDMKLCTVIMKSNGSFPVNRRRVEAARRA QELEMEMLSSTSPPERTRYSPIPPSHHQLTLPDPSHHGLHSTPDSPAKPEKNGHAKDH PKIAKIFEIQTMPNGKTRTSLKTMSRRKLSQQKEKKATQMLAIVLGVFIICWLPFFIT HILNIHCDCNIPPVLYSAFTWLGYVNSAVNPIIYTTFNIEFRKAFLKILHC" BASE COUNT 351 a 636 c 456 g 313 t ORIGIN 1 ccggcagcct cacgcgcgca ccgcgcctcc gccccgtccc cgcgctccct cctgcccgcc 61 cgccccggcc cggccccgcc ccgccccgcg ccgccgcggc ccgtccactg ctccccgcgg 121 gccagagccg gccgagctgc tgcccgccgg ggctctgaag ggccggcggg gcggtagagc 181 caggaccggc cgaggagagt gcgcgcccgg acggctgccg gaggggcggc cgcgcgtgga 241 tgcggcggga gctggaagcc tcaagcagcc ggcgccgtct ctgcccccgg ggccctatgg 301 cttgaagagc ctggacccag tggctccacc gccctgatgg atccactgaa tctgtcctgg 361 tatgatgatg atctggagag gcagaactgg agccggccct tcaacgggtc agacgggaag 421 gcggacagac cccactacaa ctactatgcc acactgctca ccctgctcat cgctgtcatc 481 gtcttcggca acgtgctggt gtgcatggct gtgtcccgcg agaaggcgct gcagaccacc 541 accaactacc tgatcgtcag cctcgcagtg gccgacctcc tcgtcgccac actggtcatg 601 ccatgggttg tctacctgga ggtggtaggt gagtggaaat tcagcaggat tcactgtgac 661 atcttcgtca ctctggacgt catgatgtgc acggcgagca tcctgaactt gtgtgccatc 721 agcatcgaca ggtacacagc tgtggccatg cccatgctgt acaatacgcg ctacagctcc 781 aagcgccggg tcaccgtcat gatctccatc gtctgggtcc tgtccttcac catctcctgc 841 ccactcctct tcggactcaa taacgcagac cagaacgagt gcatcattgc caacccggcc 901 ttcgtggtct actcctccat cgtctccttc tacgtgccct tcattgtcac cctgctggtc 961 tacatcaaga tctacattgt cctccgcaga cgccgcaagc gagtcaacac caaacgcagc 1021 agccgagctt tcagggccca cctgagggct ccactaaagg gcaactgtac tcaccccgag 1081 gacatgaaac tctgcaccgt tatcatgaag tctaatggga gtttcccagt gaacaggcgg 1141 agagtggagg ctgcccggcg agcccaggag ctggagatgg agatgctctc cagcaccagc 1201 ccacccgaga ggacccggta cagccccatc ccacccagcc accaccagct gactctcccc 1261 gacccgtccc accacggtct ccacagcact cctgacagcc ccgccaaacc agagaagaat 1321 gggcatgcca aagaccaccc caagattgcc aagatctttg agatccagac catgcccaat 1381 ggcaaaaccc ggacctccct caagaccatg agccgtagaa agctctccca gcagaaggag 1441 aagaaagcca ctcagatgct cgccattgtt ctcggcgtgt tcatcatctg ctggctgccc 1501 ttcttcatca cacacatcct gaacatacac tgtgactgca acatcccgcc tgtcctgtac 1561 agcgccttca cgtggctggg ctatgtcaac agcgccgtga accccatcat ctacaccacc 1621 ttcaacattg agttccgcaa ggccttcctg aagatccttc actgctgact ctgctgcctg 1681 cccgcacagc agctgttccc acctccctgc ccaggccagc cagcctcacc cttgcgaacc 1741 gtgagctagg aaggca // LOCUS HUMD4C 1504 bp mRNA PRI 02-NOV-1994 DEFINITION Homo sapiens dopamine D4 receptor (DRD4) mRNA (D4.7) sequence. ACCESSION L12398 X58497 NID g291945 KEYWORDS dopamine D4 receptor; dopamine receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1504) AUTHORS Van Tol,H.H., Bunzow,J.R., Guan,H.C., Sunahara,R.K., Seeman,P., Niznik,H.B. and Civelli,O. TITLE Cloning of the gene for a human dopamine D4 receptor with high affinity for the antipsychotic clozapine JOURNAL Nature 350 (6319), 610-614 (1991) MEDLINE 91204054 REFERENCE 2 (bases 1 to 1504) AUTHORS Van Tol,H.H., Wu,C.M., Guan,H.C., Ohara,K., Bunzow,J.R., Civelli,O., Kennedy,J., Seeman,P., Niznik,H.B. and Jovanovic,V. TITLE Multiple dopamine D4 receptor variants in the human population [see comments] JOURNAL Nature 358 (6382), 149-152 (1992) MEDLINE 92310588 FEATURES Location/Qualifiers source 1..1504 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11p15.5" gene 1..1404 /gene="DRD4" CDS 1..1404 /gene="DRD4" /codon_start=1 /db_xref="GDB:G00-127-782" /product="dopamine receptor D4" /db_xref="PID:g291946" /translation="MGNRSTADADGLLAGRGPAAGASAGASAGLAGQGAAALVGGVLL IGAVLAGNSLVCVSVATERALQTPTNSFIVSLAAADLLLALLVLPLFVYSEVQGGAWL LSPRLCDALMAMDVMLCTASIFNLCAISVDRFVAVAVPLRYNRQGGSRRQLLLIGATW LLSAAVAAPVLCGLNDVRGRDPAVCRLEDRDYVVYSSVCSFFLPCPLMLLLYWATFRG LQRWEVARRAKLHGRAPRRPSGPGPPSPTPPAPRLPQDPCGPDCAPPAPGLPRGPCGP DCAPAAPGLPPDPCGPDCAPPAPGLPQDPCGPDCAPPAPGLPRGPCGPDCAPPAPGLP QDPCGPDCAPPAPGLPPDPCGSNCAPPDAVRAAALPPQTPPQTRRRRRAKITGRERKA MRVLPVVVGAFLLCWTPFFVVHITQALCPACSVPPRLVSAVTWLGYVNSALNPVIYTV FNAEFRNVFRKALRACC" repeat_region 744..1080 /gene="DRD4" /note="the 48bp direct repeat unit is polymorphic with respect to the number of repeated units (7 units for D4.7) (-8 &10 units; cit. #2; feature c ) and seq. of the individual units (feature e, unpubl.); G00-127-782" /rpt_type=direct /rpt_unit=744..792 variation 791..1033 /gene="DRD4" /note="this sequence was deleted in the D4 variant with a 2-fold repeat (D4.2); this polymorphic variant was isolated as a partial cDNA from the neuroepithelioma SK-N-MC (see cit #3); G00-127-782" /phenotype="dopamine D4.2 receptor" /replace="cc" variation 839..985 /gene="DRD4" /note="this group of nucleotides is deleted in the polymorphic variant of the D4 receptor with a 4-fold repeat (D4.4); isolated as a partial cDNA from human pituitary and substantia nigra cDNA library; G00-127-782" /phenotype="dopamine D4.4 receptor" /replace="cc" variation 985 /gene="DRD4" /note="G00-127-782" /phenotype="dopamine D4.4 receptor" /replace="g" variation 994 /gene="DRD4" /note="G00-127-782" /phenotype="dopamine D4.4 receptor" /replace="a" polyA_site 1504 /gene="DRD4" /note="four independently isolated cDNA clones contain a poly A tail after this nucleotide; G00-127-782" /evidence=experimental BASE COUNT 153 a 633 c 487 g 231 t ORIGIN 1 atggggaacc gcagcaccgc ggacgcggac gggctgctgg ctgggcgcgg gccggccgcg 61 ggggcatctg cgggggcatc tgcggggctg gctgggcagg gcgcggcggc gctggtgggg 121 ggcgtgctgc tcatcggcgc ggtgctcgcg gggaactcgc tcgtgtgcgt gagcgtggcc 181 accgagcgcg ccctgcagac gcccaccaac tccttcatcg tgagcctggc ggccgccgac 241 ctcctcctcg ctctcctggt gctgccgctc ttcgtctact ccgaggtcca gggtggcgcg 301 tggctgctga gcccccgcct gtgcgacgcc ctcatggcca tggacgtcat gctgtgcacc 361 gcctccatct tcaacctgtg cgccatcagc gtggacaggt tcgtggccgt ggccgtgccg 421 ctgcgctaca accggcaggg tgggagccgc cggcagctgc tgctcatcgg cgccacgtgg 481 ctgctgtccg cggcggtggc ggcgcccgta ctgtgcggcc tcaacgacgt gcgcggccgc 541 gaccccgccg tgtgccgcct ggaggaccgc gactacgtgg tctactcgtc cgtgtgctcc 601 ttcttcctac cctgcccgct catgctgctg ctctactggg ccacgttccg cggcctgcag 661 cgctgggagg tggcacgtcg cgccaagctg cacggccgcg cgccccgccg acccagcggc 721 cctggcccgc cttcccccac gccacccgcg ccccgcctcc cccaggaccc ctgcggcccc 781 gactgtgcgc cccccgcgcc cggccttccc cggggtccct gcggccccga ctgtgcgccc 841 gccgcgcccg gcctcccccc ggacccctgc ggccccgact gtgcgccccc cgcgcccggc 901 ctcccccagg acccctgcgg ccccgactgt gcgccccccg cgcccggcct tccccggggt 961 ccctgcggcc ccgactgtgc gccccccgcg cccggcctcc cccaggaccc ctgcggcccc 1021 gactgtgcgc cccccgcgcc cggcctcccc ccggacccct gcggctccaa ctgtgctccc 1081 cccgacgccg tcagagccgc cgcgctccca ccccagactc caccgcagac ccgcaggagg 1141 cggcgtgcca agatcaccgg ccgggagcgc aaggccatga gggtcctgcc ggtggtggtc 1201 ggggccttcc tgctgtgctg gacgcccttc ttcgtggtgc acatcacgca ggcgctgtgt 1261 cctgcctgct ccgtgccccc gcggctggtc agcgccgtca cctggctggg ctacgtcaac 1321 agcgccctca accccgtcat ctacactgtc ttcaacgccg agttccgcaa cgtcttccgc 1381 aaggccctgc gtgcctgctg ctgagccggg cacccccgga cgccccccgg cctgatggcc 1441 aggcctcagg gaccaaggag atggggaggg cgcttttgta cgttaattaa acaaattcct 1501 tccc // LOCUS HUMDAD1A 699 bp mRNA PRI 27-MAR-1996 DEFINITION Human mRNA for DAD-1, complete cds. ACCESSION D15057 NID g493244 KEYWORDS DAD-1; apotosis; membrane protein; temperature sensitive mutation. SOURCE Homo sapiens cell-line Raji cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 699) AUTHORS Nakashima,T., Sekiguchi,T., Kuraoka,A., Fukushima,K., Shibata,Y., Komiyama,S. and Nishimoto,T. TITLE Molecular cloning of a human cDNA encoding a novel protein, DAD1, whose defect causes apoptotic cell death in hamster BHK21 cells JOURNAL Mol. Cell. Biol. 13 (10), 6367-6374 (1993) MEDLINE 94019310 REFERENCE 2 (bases 1 to 699) AUTHORS Nakashima,T. TITLE Direct Submission JOURNAL Submitted (17-APR-1993) to the DDBJ/EMBL/GenBank databases. Torahiko Nakashima, Kyushu University, Graduate school of Medicine, Department of Molecular Biology; 6-10-4 Hakozaki, Higashiku, Fukuoka, Fukuoka 812, Japan (Tel:092-641-1151, Fax:092-632-2373) COMMENT Submitted (17-APR-1993) to DDBJ by: Torahiko Nakashima Department of Molecular Biology Graduate School of Medical Science Kyushu University Maidashi 3-1-1 Higashi-ku Fukuoka 812 Japan Phone: 092-641-1151 Fax: 092-632-2373. FEATURES Location/Qualifiers source 1..699 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" CDS 67..408 /codon_start=1 /product="DAD-1" /db_xref="PID:d1004164" /db_xref="PID:g914935" /translation="MSASVVSVISRFLEEYLSSTPQRLKLLDAYLLYILLTGALQFGY CLLVGTFPFNSFLSGFISCVGSFILAVCLRIQINPQNKADFQGISPERAFADFLFAST ILHLVVMNFVG" BASE COUNT 162 a 169 c 156 g 212 t ORIGIN Chromosome X. 1 catccggtgt ggtcgacggg tcctccaaga gtttggggcg cggaccggag taccttgcgt 61 gcagttatgt cggcgtcggt agtgtctgtc atttcgcggt tcttagaaga gtacttgagc 121 tccactccgc agcgtctgaa gttgctggac gcgtacctgc tgtatatact gctgaccggg 181 gcgctgcagt tcggttactg tctcctcgtg gggaccttcc ccttcaactc ttttctctcg 241 ggcttcatct cttgtgtggg gagtttcatc ctagcggttt gcctgagaat acagatcaac 301 ccacagaaca aagcggattt ccaaggcatc tccccagagc gagcctttgc tgattttctc 361 tttgccagca ccatcctgca ccttgttgtc atgaactttg ttggctgaat cattctcatt 421 tacttaattg aggagtagga gactaaaaga atgttcactc tttgaatttc ctggataaga 481 gttctggaga tggcagctta ttggacacat ggattttctt cagatttgac acttactgct 541 agctctgctt tttatgacag gagaaaagcc cagagttcac tgtgtgtcag aacaactttc 601 taacaaacat ttattaatcc agcctctgcc tttcattaaa tgtaaccttt tgctttccaa 661 attaaagaac tccatgccac tcctcaaaaa aaaaaaaaa // LOCUS HUMDAFA 2220 bp mRNA PRI 02-NOV-1994 DEFINITION Human decay-accelerating factor mRNA, complete cds. ACCESSION M30142 NID g181464 KEYWORDS Alu repeat; alternative splicing; decay-accelerating factor; membrane glycoprotein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2220) AUTHORS Caras,I.W., Davitz,M.A., Rhee,L., Weddell,G., Martin,D.W. Jr. and Nussenzweig,V. TITLE Cloning of decay-accelerating factor suggests novel use of splicing to generate two proteins JOURNAL Nature 325 (6104), 545-549 (1987) MEDLINE 87115845 COMMENT The gene for decay accelerating factor produces two proteins by alternative splicing. The spliced out region is from position 1147-1265. The stop codon in this case is located at position 1327-1329. Though mRNAs do not have introns, the alternative coding region is indicated in the features table. FEATURES Location/Qualifiers source 1..2220 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /map="1q32" mRNA <1..2220 /gene="DAF" /note="G00-119-088" exon <66..1146 /gene="DAF" /note="decay-accelerating factor precursor A" /number=1 gene join(66..1146,1265..1329) /gene="DAF" CDS join(66..1146,1265..1329) /gene="DAF" /codon_start=1 /db_xref="GDB:G00-119-088" /product="decay-accelerating factor A" /db_xref="PID:g181465" /translation="MTVARPSVPAALPLLGELPRLLLLVLLCLPAVWGDCGLPPDVPN AQPALEGRTSFPEDTVITYKCEESFVKIPGEKDSVICLKGSQWSDIEEFCNRSCEVPT RLNSASLKQPYITQNYFPVGTVVEYECRPGYRREPSLSPKLTCLQNLKWSTAVEFCKK KSCPNPGEIRNGQIDVPGGILFGATISFSCNTGYKLFGSTSSFCLISGSSVQWSDPLP ECREIYCPAPPQIDNGIIQGERDHYGYRQSVTYACNKGFTMIGEHSIYCTVNNDEGEW SGPPPECRGKSLTSKVPPTVQKPTTVNVPTTEVSPTSQKTTTKTTTPNAQATRSTPVS RTTKHFHETTPNKGSGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT" sig_peptide 66..167 /gene="DAF" /note="G00-119-088" mat_peptide 168..1385 /gene="DAF" /note="G00-119-088" /product="decay-accelerating factor A" mat_peptide join(168..1146,1265..1326) /gene="DAF" /note="G00-119-088" /product="decay-accelerating factor A" intron 1147..1264 /gene="DAF" /note="G00-119-088" repeat_region 1150..1264 /note="Alu repeat (partial)" exon 1265..>1329 /gene="DAF" /note="G00-119-088" /number=2 polyA_signal 1410..1415 /gene="DAF" /note="G00-119-088; putative" polyA_signal 1693..1698 /gene="DAF" /note="G00-119-088; putative" polyA_signal 1731..1736 /gene="DAF" /note="G00-119-088; putative" polyA_signal 2198..2203 /gene="DAF" /note="G00-119-088; putative" polyA_site 2220 /gene="DAF" /note="G00-119-088" BASE COUNT 681 a 455 c 475 g 609 t ORIGIN 1 ccgctgggcg tagctgcgac tcggcggagt cccggcggcg cgtccttgtt ctaacccggc 61 gcgccatgac cgtcgcgcgg ccgagcgtgc ccgcggcgct gcccctcctc ggggagctgc 121 cccggctgct gctgctggtg ctgttgtgcc tgccggccgt gtggggtgac tgtggccttc 181 ccccagatgt acctaatgcc cagccagctt tggaaggccg tacaagtttt cccgaggata 241 ctgtaataac gtacaaatgt gaagaaagct ttgtgaaaat tcctggcgag aaggactcag 301 tgatctgcct taagggcagt caatggtcag atattgaaga gttctgcaat cgtagctgcg 361 aggtgccaac aaggctaaat tctgcatccc tcaaacagcc ttatatcact cagaattatt 421 ttccagtcgg tactgttgtg gaatatgagt gccgtccagg ttacagaaga gaaccttctc 481 tatcaccaaa actaacttgc cttcagaatt taaaatggtc cacagcagtc gaattttgta 541 aaaagaaatc atgccctaat ccgggagaaa tacgaaatgg tcagattgat gtaccaggtg 601 gcatattatt tggtgcaacc atctccttct catgtaacac agggtacaaa ttatttggct 661 cgacttctag tttttgtctt atttcaggca gctctgtcca gtggagtgac ccgttgccag 721 agtgcagaga aatttattgt ccagcaccac cacaaattga caatggaata attcaagggg 781 aacgtgacca ttatggatat agacagtctg taacgtatgc atgtaataaa ggattcacca 841 tgattggaga gcactctatt tattgtactg tgaataatga tgaaggagag tggagtggcc 901 caccacctga atgcagagga aaatctctaa cttccaaggt cccaccaaca gttcagaaac 961 ctaccacagt aaatgttcca actacagaag tctcaccaac ttctcagaaa accaccacaa 1021 aaaccaccac accaaatgct caagcaacac ggagtacacc tgtttccagg acaaccaagc 1081 attttcatga aacaacccca aataaaggaa gtggaaccac ttcaggtact acccgtcttc 1141 tatctggttc tcgtcctgtc acccaggctg gtatgcggtg gtgtgatcgt agctcactgc 1201 agtctcgaac tcctgggttc aagcgatcct tccacttcag cctcccaagt agctggtact 1261 acagggcaca cgtgtttcac gttgacaggt ttgcttggga cgctagtaac catgggcttg 1321 ctgacttagc caaagaagag ttaagaagaa aatacacaca agtatacaga ctgttcctag 1381 tttcttagac ttatctgcat attggataaa ataaatgcaa ttgtgctctt catttaggat 1441 gctttcattg tctttaagat gtgttaggaa tgtcaacaga gcaaggagaa aaaaggcagt 1501 cctggaatca cattcttagc acacctacac ctcttgaaaa tagaacaact tgcagaattg 1561 agagtgattc ctttcctaaa agtgtaagaa agcatagaga tttgttcgta tttagaatgg 1621 gatcacgagg aaaagagaag gaaagtgatt tttttccaca agatctgtaa tgttatttcc 1681 acttataaag gaaataaaaa atgaaaaaca ttatttggat atcaaaagca aataaaaacc 1741 caattcagtc tcttctaagc aaaattgcta aagagagatg aaccacatta taaagtaatc 1801 tttggctgta aggcattttc atctttcctt cgggttggca aaatatttta aaggtaaaac 1861 atgctggtga accaggggtg ttgatggtga taagggagga atatagaatg aaagactgaa 1921 tcttcctttg ttgcacaaat agagtttgga aaaagcctgt gaaaggtgtc ttctttgact 1981 taatgtcttt aaaagtatcc agagatacta caatattaac ataagaaaag attatatatt 2041 atttctgaat cgagatgtcc atagtcaaat ttgtaaatct tattcttttg taatatttat 2101 ttatatttat ttatgacagt gaacattctg attttacatg taaaacaaga aaagttgaag 2161 aagatatgtg aagaaaaatg tatttttcct aaatagaaat aaatgatccc attttttggt // LOCUS HUMDAGI 5493 bp mRNA PRI 27-NOV-1995 DEFINITION Human dystroglycan (DAG1) mRNA, complete cds. ACCESSION L19711 NID g398025 KEYWORDS dystroglycan. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5493) AUTHORS Ibraghimov-Beskrovnaya,O., Milatovich,A., Ozcelik,T., Yang,B., Koepnick,K., Francke,U. and Campbell,K.P. TITLE Human dystroglycan: skeletal muscle cDNA, genomic structure, origin of tissue specific isoforms and chromosomal localization JOURNAL Hum. Mol. Genet. 2 (10), 1651-1657 (1993) MEDLINE 94093553 FEATURES Location/Qualifiers source 1..5493 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal and adult" /tissue_type="skeletal muscle" /tissue_lib="lambda-gt10 and gt11" /map="3p21" gene 395..3082 /gene="DAG1" CDS 395..3082 /gene="DAG1" /note="aa 751 to 764 is transmembrane domain" /codon_start=1 /product="dystroglycan" /db_xref="PID:g398026" /translation="MRMSVGLSLLLPLWGRTFLLLLSVVMAQSHWPSEPSEAVRDWEN QLEASMHSVLSDLHEAVPTVVGIPDGTAVVGRSFRVTIPTDLIASSGDIIKVSAAGKE ALPSWLHWDSQSHTLEGLPLDTDKGVHYISVSATRLGANGSHIPQTSSVFSIEVYPED HSDLQSVRTASPDPGEVVSSACAADEPVTVLTVILDADLTKMTPKQRIDLLHRMRSFS EVELHNMKLVPVVNNRLFDMSAFMAGPGNPKKVVENGALLSWKLGCSLNQNSVPDIHG VEAPAREGAMSAQLGYPVVGWHIANKKPPLPKRVRRQIHATPTPVTAIGPPTTAIQEP PSRIVPTPTSPAIAPPTETMAPPVRDPVPGKPTVTIRTRGAIIQTPTLGPIQPTRVSE AGTTVPGQIRPTMTIPGYVEPTAVATPPTTTTKKPRVSTPKPATPSTDSTTTTTRRPT KKPRTPRPVPRVTTKVSITRLETASPPTRIRTTTSGVPRGGEPNQRPELKNHIDRVDA WVGTYFEVKIPSDTFYDHEDTTTDKLKLTLKLREQQLVGEKSWVQFNSNSQLMYGLPD SSHVGKHEYFMHATDKGGLSAVDAFEIHVHRRPQGDRAPARFKAKFVGDPALVLNDIH KKIALVKKLAFAFGDRNCSTITLQNITRGSIVVEWTNNTLPLEPCPKEQIAGLSRRIA EDDGKPRPAFSNALEPDFKATSITVTGSGSCRHLQFIPVVPPRRVPSEAPPTEVPDRD PEKSSEDDVYLHTVIPAVVVAAILLIAGIIAMICYRKKRKGKLTLEDQATFIKKGVPI IFADELDDSKPPPSSSMPLILQEEKAPLPPPEYPNQSVPETTPLNQDTMGEYTPLRDE DPNAPPYQPPPPFTVPMEGKGSRPKNMTPYRSPPPYVPP" sig_peptide 395..475 /gene="DAG1" mat_peptide 476..3079 /gene="DAG1" /product="dystroglycan" polyA_site 5493 BASE COUNT 1199 a 1592 c 1437 g 1265 t ORIGIN 1 gggccagtcg gcgccgcgcg gagctggccg ctggattggc tgcaacactc gcgtgtcagg 61 cggttgctag gctccggccg cgcgccccgc ccttgcgctc agcgccctct caccgcccgg 121 tacgtgctcg cgcgaaggct gcggcgcggc gctcgcgcct cttaggcttg gcggtggcgg 181 cggcggcagc ttcgcgccga atccccgggg agcggcggtg gcggcgtcct ggggccagga 241 ggagcgaaca cctgccgcgg tcctcccgcc ggcgctgggc tctgtgtgct ccgggatgga 301 gcaggtgtgc agagggtgag aacccagctc tgggaccaag tcacttgctt ccttacttag 361 caagactatc gacttgagca aacttggacc tgggatgagg atgtctgtgg gcctctcgct 421 gctgctgccc ctctggggga ggacctttct cctcctgctc tctgtggtta tggctcagtc 481 ccactggccc agtgaaccct cagaggctgt cagggactgg gaaaaccagc ttgaggcatc 541 catgcactca gtgctctcag acctccacga ggctgttccc acagtggttg gcattcctga 601 tggcacggct gtcgtcgggc gctcatttcg agtgaccatt ccaacagatt tgattgcctc 661 cagtggagat atcatcaagg tatcagcggc agggaaggag gctttgccat cttggctgca 721 ctgggactca cagagccaca ccctggaggg cctccccctt gacactgata agggtgtgca 781 ttacatttca gtgagcgcta cacggctggg ggccaacggg agccacatcc cccagacctc 841 cagtgtgttc tccatcgagg tctaccctga agaccacagt gatctgcagt cggtgaggac 901 agcctcccca gaccctggtg aggtggtatc atctgcctgt gctgcggatg aacctgtgac 961 tgttttgacg gtgattttgg atgccgacct caccaagatg accccaaagc aaaggattga 1021 cctcctgcac aggatgcgga gcttctcaga agtagagctt cacaacatga aattagtgcc 1081 ggtggtgaat aacagactat ttgacatgtc ggccttcatg gctggcccgg gaaatccaaa 1141 aaaggtggtg gagaatgggg cccttctctc ctggaagctg ggctgctccc tgaaccagaa 1201 cagtgtgcct gacattcatg gtgtagaggc ccctgccagg gagggcgcaa tgtctgctca 1261 gcttggctac cctgtggtgg gttggcacat cgccaataag aagccccctc ttcccaaacg 1321 cgtccggagg cagatccatg ctacacccac acctgtcact gccattgggc ccccaaccac 1381 ggctatccag gagcccccat ccaggatcgt gccaaccccc acatctccag ccattgctcc 1441 tccaacagag accatggctc ctccagtcag ggatcctgtt cctgggaaac ccacggtcac 1501 catccggact cgaggcgcca ttattcaaac cccaacccta ggccccatcc agcctactcg 1561 ggtgtcagaa gctggcacca cagttcctgg ccagattcgc ccaacgatga ccattcctgg 1621 ctatgtggag cctactgcag ttgctacccc tcccacaacc accaccaaga agccacgagt 1681 atccacacca aaaccagcaa cgccttcaac tgactccacc accaccacga ctcgcaggcc 1741 aaccaagaaa ccacggacac cccggccagt gccccgggtc accaccaaag tttccatcac 1801 cagattggaa actgcctcac cgcctactcg tattcgcacc accaccagtg gagtgccccg 1861 tggcggagaa cccaaccagc gcccagagct caagaaccat attgacaggg tagatgcctg 1921 ggttggcacc tactttgagg tgaagatccc gtcagacact ttctatgacc atgaggacac 1981 caccactgac aagctgaagc tgaccctgaa actgcgggag cagcagctgg tgggcgagaa 2041 gtcctgggta cagttcaaca gcaacagcca gctcatgtat ggccttcccg acagcagcca 2101 cgtgggcaaa cacgagtatt tcatgcatgc cacagacaag gggggcctgt cggctgtgga 2161 tgccttcgag atccacgtcc acaggcgccc ccaaggggat agggctcctg caaggttcaa 2221 ggccaagttt gtgggtgacc cggcactggt gttgaatgac atccacaaga agattgcctt 2281 ggtaaagaaa ctggccttcg cctttggaga ccgaaactgt agcaccatca ccctgcagaa 2341 tatcacccgg ggctccatcg tggtggaatg gaccaacaac acactgccct tggagccctg 2401 ccccaaggag cagatcgctg ggctgagccg ccggatcgct gaggatgatg gaaaacctcg 2461 gcctgccttc tccaacgccc tagagcctga ctttaaggcc acaagcatca ctgtgacggg 2521 ctctggcagt tgtcggcacc tacagtttat ccctgtggta ccacccagga gagtgccctc 2581 agaggcgccg cccacagaag tgcctgacag ggaccctgag aagagcagtg aggatgatgt 2641 ctacctgcac acagtcattc cggccgtggt ggtcgcagcc atcctgctca ttgctggcat 2701 cattgccatg atctgctacc gcaagaagcg gaagggcaag cttacccttg aggaccaggc 2761 caccttcatc aagaaggggg tgcctatcat ctttgcagac gaactggacg actccaagcc 2821 cccaccctcc tccagcatgc cactcattct gcaggaggag aaggctcccc taccccctcc 2881 tgagtacccc aaccagagtg tgcccgagac cactcctctg aaccaggaca ccatgggaga 2941 gtacacgccc ctgcgggatg aggatcccaa tgcgcctccc taccagcccc caccgccctt 3001 cacagtaccc atggagggca agggctcccg tcccaagaac atgaccccat accggtcacc 3061 tcctccctat gtcccacctt aacccgcaag cgcctgggtg gaggcagggt agggcagggc 3121 cctggagacg acatggtgtt gtctgtggag accggtggcc tgcagaccat tgcccaccgg 3181 gagccgacac ctgacctagc acacactgac acaggggcct ggacaagccc gccctctctg 3241 gtcctcccaa accccaaagc agctggagag actttgggga cttttttatt tttatttttt 3301 gcctaacagc ttttggtttg ttcatagaga actcttcgct tcatttttga tggctggctc 3361 tgaaagcacc atgtggagtg gaggtggagg gaccgaggaa ccatgaatga actcgcaggc 3421 agtgccgggc ggccccctgg ctctctgcgt tttgccttta acactaactg tactgttttt 3481 tctattcacg tgtgtctagc tgcaggatgt aacatggaaa acagtaacta aagattaaat 3541 tcaaaggact ttcagaagtt aaggttaagt ttttacgttt aatctgctgt ttacctaaac 3601 ttgtatgtat aatttttggg tgggtatggg gaattgcttt gctaaaaata agctcccagg 3661 gtgtttcaaa cttagagaag accaagggac agtatttttt atcaaaggaa tactattttt 3721 tcacactacg tcaacttggt tgctctgata ccccagagcc tgattggggg cctcccggcc 3781 ctggctcacg ccaagtccct ggtgctgggt ttgctctccc gctgttgcca ggggctggaa 3841 gctggagggg tctcttgggc catggacatc cccacttcca gcccatgtac actagtggcc 3901 cacgaccaag gggtcttcat ttccatgaaa aagggactcc aagaggcagt ggtggctgtg 3961 gcccccaact ttggtgctcc agggtgggcc aactgcttgt gggggcacct gggaggtcaa 4021 aggtctccac cacatcaacc tattttgttt tacccttttt ctgtgcattg tttttttttt 4081 tcctcctaaa aggaatatca cggttttttg aaacactcag tgggggacat tttggtgaag 4141 atgcaatatt tttatgtcat gtgatgctct ttcctcactt gaccttggcc gctttgtcct 4201 aacagtccac agtcctgccc cgacccaccc catccctttt ctctggcact ccagtccagc 4261 ttgggcctga actactggaa aaggtctggc ggctggggag gagtgccagc aatagttcat 4321 aataaaaatc tgttagctct caaagctaat tttttactaa agtttttata cagcctcaaa 4381 ttgttttatt aaaaaaaaga tttaaaatgg tgatgcttac agcagtttgt acgagctctt 4441 aagtgttgat tccatggaac tgacggcttt gcttgttttg attcttttcc ccctactttt 4501 cctaatggtt taaattctgg aattacactg gggttctttt gcctttttta gcagaacatc 4561 cgtccgtcca tctgcatctc tgtcccatga ctcaggggcg cccactctgc ttcgattctc 4621 ctcctgtgga agaaaccatt ttgagcatga cttttcttga tgtctgaagc gttattttgg 4681 gtacttttta gggaggaatg cctttcgcaa taatgtatcc attcccctga ttgagggtgg 4741 gtgggtggac ccaggctccc tttgcacaca gagcagctac ttctaagcca tatcgactgt 4801 tttgcagagg atttgtgtgt cctccctcag gaggggaggc ctggtaggag ggggggagag 4861 ttctctgtcc tactgctctc aagagggcat ttccccttgc gccttctccc acagggccca 4921 gcccctctcc cctgcccaag tccccagggg gtactctgga gtgagcagtc cccctgtggg 4981 ggagcctgta aatgcgggct cagtggacca ctggtgactg ggctcatgcc tccaagtcag 5041 agtttcccct ggtgccccag agacaggagc acaagtggga tctgacctgg tgagattatt 5101 tctgatgacc tcatcaaaaa ataaacaatt cccaatgttc caggtgaggg ctttgaaagg 5161 ccttccaaac agctccgtcg cccctagcaa ctccaccatt gggcactgcc atgcagagac 5221 gtggctggcc cagaatggcc tgttgccata gcaactggag gcgatggggc agtgaacaga 5281 ataacaacag caacaatgcc tttgcaggca gcctgctccc ctgagcgctg ggctggtgat 5341 ggccgttgga ctctgtgaga tggagagcca atctcacatt caagtgttca ccaaccactg 5401 atgtgttttt atttccttct atatgatttt aagatgtgtt ttctgcattc tgtaaagaaa 5461 catatcaaac taaataaaag cagtgtcttt att // LOCUS HUMDAGK 3000 bp mRNA PRI 02-MAY-1996 DEFINITION Human diacylglycerol kinase (DAGK) mRNA, complete cds. ACCESSION L38707 NID g606756 KEYWORDS diacylglycerol kinase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3000) AUTHORS Pilz,A., Schaap,D., Hunt,D. and Fitzgibbon,J. TITLE Chromosomal localization of three mouse diacylglycerol kinase (DAGK) genes: genes sharing sequence homology to the Drosophila retinal degeneration A (rdgA) gene JOURNAL Genomics 26 (3), 599-601 (1995) MEDLINE 95331799 FEATURES Location/Qualifiers source 1..3000 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" mRNA 1..3000 /gene="DAGK" /note="G00-126-733" gene 1..3000 /gene="DAGK" CDS 39..2867 /gene="DAGK" /EC_number="2.7.1.107" /codon_start=1 /db_xref="GDB:G00-126-733" /product="diacylglycerol kinase" /db_xref="PID:g606757" /translation="MAAAAEPGARAWLGGGSPRPGSPACSPVLGSGGRARPGPGPGPG RDRAGGVRARARAAPGHSFRKVTLTKPTFCHLCSDFIWGLAGFLCDVCNFMSHEKCLK HVRIPCTSVAPSLVRVPVAHCFGPRGLHKRKFCAVCRKVLEAPALHCEVCELHLHPDC VPFACSDCRQCHQDGHQDHDTHHHHWREGNLPSGARCEVCRKTCGSSDVLAGVRCEWC GVQAHSLCSAALAPECGFGRLRSLVLPPACVRLLPGGFSKTQSFRIVEAAEPGEGGDG ADGSAAVGPGRETQATPESGKQTLKIFDGDDAVRRSQFRLVTVSRLAGAEEVLEAALR AHHIPEDPGHLELCRLPPSSQACDAWAGGKAGSAVISEEGRSPGSGEATPEAWVIRAL PRAQEVLKIYPGWLKVGVAYVSVRVTPKSTARSVVLEVLPLLGRQAESPESFQLVEVA MGCRHVQRTMLMDEQPLLDRLQDIRQMSVRQVSQTRFYVAESRDVAPHVSLFVGGLPP GLSPEEYSSLLHEAGATKATVVSVSHIYSSQGAVVLDVACFAEAERLYMLLKDMAVRG RLLTALVLPDLLHAKLPPDSCPLLVFVNPKSGGLKGRDLLCSFRKLLNPHQVFDLTNG GPLPGLHLFSQVPCFRVLVCGGDGTVGWVLGALEETRYRLACPEPSVAILPLGTGNDL GRVLRWGAGYSGEDPFSVLLSVDEADAVLMDRWTILLDAHEAGSAENDTADAEPPKIV QMSNYCGIGIDAELSLDFHQAREEEPGKFTSRLHNKGVYVRVGLQKISHSRSLHKQIR LQVERQEVELPSIEGLIFINIPSWGSGADLWGSDSDTRFEKPRMDDGLLEVVGVTGVV HMGQVQGGLRSGIRIAQGSYFRVTLLKATPVQVDGEPWVQAPGHMIISAAGPKVHMLR KAKQKPRRAGTTRDARADRAPAPESDPR" BASE COUNT 472 a 965 c 1059 g 503 t 1 others ORIGIN 1 gggcggacct aaaggggctc gggccgctcg ggccgggaat ggcggcggcg gccgagcccg 61 gggcccgcgc ctggctgggc ggcggctccc cgcgccccgg cagcccggcc tgcagccccg 121 tgctgggctc aggaggccgc gcgcgcccgg ggccggggcc ggggccggga cgngaccgag 181 cgggcggcgt cagagcccgg gcccgtgccg cgccgggaca cagcttccgg aaggtgacgc 241 tcaccaagcc caccttctgc cacctctgct ccgacttcat ctgggggctg gccggcttcc 301 tgtgcgacgt ctgcaatttc atgtctcatg agaagtgcct gaagcacgtg aggatcccgt 361 gcacgagtgt ggcacccagc ctggtccggg ttcctgtagc ccactgcttc ggcccccggg 421 ggctccacaa gcgcaagttc tgtgctgtct gccgcaaggt cctggaggca ccggcgctcc 481 actgcgaagt gtgtgagctg cacctccacc cagactgtgt gcccttcgcc tgcagtgact 541 gccgccagtg ccaccaggat gggcaccagg atcacgacac ccatcaccac cactggcggg 601 aggggaacct gccctcggga gcgcgctgcg aggtctgcag gaagacgtgc ggctcctctg 661 acgtgctggc cggcgtgcgc tgcgagtggt gcggggtcca ggcgcactcc ctctgctccg 721 cggcactggc tcccgagtgt ggcttcgggc gtctgcgctc cctggtcctg cctcccgcgt 781 gcgtgcgcct tctgcccggc ggcttcagca agacgcagag cttccgcatc gtggaggccg 841 cggagccggg cgaggggggc gacggcgccg acgggagcgc tgccgtgggt ccaggcagag 901 agacacaggc aactccggag tccgggaagc aaacgctgaa gatctttgat ggcgacgacg 961 cggtgagaag aagccagttc cgcctcgtca cggtgtcccg cctggccggt gccgaggagg 1021 tgctggaggc cgcactgcgg gcccaccaca tccccgagga ccctggccac ctggagctgt 1081 gccggctgcc cccttcctct caggcctgtg acgcctgggc tgggggcaag gctgggagtg 1141 ctgtgatctc ggaggagggc agaagccccg ggtccggcga ggccacgcca gaggcctggg 1201 tcatccgggc tctgccgcgg gcccaggagg tcctgaagat ctaccctggc tggctcaagg 1261 tgggcgtggc ctacgtgtcc gtgcgagtga cccctaagag cacggctcgc tctgtggtgc 1321 tggaggtcct gccgctgctc ggccgccagg ccgagagtcc cgagagcttc cagctggtgg 1381 aggtggcgat gggctgcagg cacgtccagc ggacgatgct gatggacgaa cagcccctgc 1441 tggaccggct acaggacatc cggcagatgt ctgtgcggca ggtgagccag acgcggttct 1501 acgtggcaga gagcagggat gtagccccgc acgtctccct gtttgttggc ggcctgcctc 1561 ccggcctgtc tcccgaggag tacagcagcc tgctgcatga ggccggggct accaaagcca 1621 ccgtggtgtc cgtgagtcac atctactcct cccaaggcgc ggtagtgttg gacgttgcct 1681 gctttgcgga ggccgagcgg ctgtacatgc tgctgaagga catggctgtg cggggccggc 1741 tgctcactgc cctggtgctc cccgacctgc tgcacgcgaa gctgccccca gacagctgtc 1801 ccctccttgt gttcgtgaac cccaagagtg gaggcctcaa gggccgagac ctgctctgca 1861 gcttccggaa gctactgaac cctcatcagg tcttcgacct gaccaacgga ggtcctcttc 1921 ccgggctcca cctgttctcc caggtgccct gcttccgggt gctggtgtgt ggtggcgatg 1981 gcactgtggg ctgggtgctt ggcgccctgg aggagacacg gtaccgactg gcctgcccgg 2041 agccttctgt ggccatcctg cccctgggca cagggaatga ccttggtcga gtcctccgct 2101 ggggggcggg ctacagcggc gaggacccgt tctccgtact gctgtctgtg gacgaggccg 2161 acgccgtgct catggaccgc tggaccatcc tgctggatgc ccacgaagct ggcagtgcag 2221 agaacgacac ggcagacgca gagcccccca agatcgtgca gatgagtaac tactgtggca 2281 ttggcatcga cgcggagctg agcctggact tccaccaggc acgggaagag gagcctggca 2341 agttcacaag caggctgcac aacaagggtg tgtacgtgcg ggtggggctg cagaagatca 2401 gtcactctcg gagcctgcac aagcagatcc ggctgcaggt ggagcggcag gaggtggagc 2461 tgcccagtat tgaaggcctc atcttcatca acatccccag ctggggctcg ggggccgacc 2521 tgtggggctc cgacagcgac accaggtttg agaagccacg catggacgac gggctgctgg 2581 aggttgtggg cgtgacgggc gtcgtgcaca tgggccaggt ccagggtggg ctgcgctccg 2641 gaatccggat tgcccagggt tcctacttcc gagtcacgct cctcaaggcc accccggtgc 2701 aggtggacgg ggagccctgg gtccaggccc cggggcacat gatcatctca gctgctggcc 2761 ctaaggtgca catgctgagg aaggccaagc agaagccgag gagggccggg accaccaggg 2821 atgcccgggc ggatcgtgcg cctgcccctg agagcgatcc taggtagggg tggctggggc 2881 agcccaaggg ctcgagccat ctctgctccc gccagccttg ttttcaggtg gtctggaggc 2941 agctccacgt cacacagtgg ctgtcatata ttgaagttac cttcccactg gaaaaaaaat // LOCUS HUMDB1 2306 bp mRNA PRI 27-FEB-1996 DEFINITION Human mRNA for DB1, complete cds. ACCESSION D28118 NID g529640 KEYWORDS DB1; zinc finger. SOURCE Homo sapiens (library: lambda gt11 (T.Yokota)) adult lymphoma cDNA to mRNA, clone DB15. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2306) AUTHORS Koyano-Nakagawa,N., Nishida,J., Baldwin,D., Arai,K. and Yokota,T. TITLE Molecular cloning of a novel human cDNA encoding a zinc finger protein that binds to the interleukin-3 promoter JOURNAL Mol. Cell. Biol. 14 (8), 5099-5107 (1994) MEDLINE 94309629 REFERENCE 2 (bases 1 to 2306) AUTHORS Koyano-Nakagawa,N. TITLE Direct Submission JOURNAL Submitted (27-JAN-1994) to the DDBJ/EMBL/GenBank databases. Naoko Koyano-Nakagawa, The University of Tokyo, Inst. of Medical Science, Dept. of Mol. and Developmental Biology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (E-mail:koyano@ims.u-tokyo.ac.jp, Tel:03-5449-5664, Fax:03-5449-5424) COMMENT Submitted (27-Jan-1994) to DDBJ by: Naoko Koyano-Nakagawa Molecular and Develpmental Biology The Institute of Medical Science The University of Tokyo 4-6-1 Shirokanedai, Minato-ku Tokyo 108 Japan Phone: 03-3443-8111 x664 Email: koyano@ims.u-tokyo.ac.jp Fax: 03-3443-5320. FEATURES Location/Qualifiers source 1..2306 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="jurkat" /cell_type="T-cell" /clone_lib="lambda gt11 (T.Yokota)" /dev_stage="adult" /tissue_type="lymphoma" CDS 42..1592 /function="A putative transcription factor. Contains six C2/H2-type zinc finger motifs." /codon_start=1 /label=DB1 /evidence=experimental /product="DB1" /db_xref="PID:d1006208" /db_xref="PID:g529641" /translation="MEANWTAFLFQAHEASHHQQQAAQNSLLPLLSSAVEPPDQKPLL PIPITQKPQGAPETLKDAIGIKKEKPKTSFVCTYCSKAFRDSYHLRRHESCHTGIKLV SRPKKTPTTVVPLISTIAGDSSRTSLVSTIAGILSTVTTSSSGTNPSSSASTTAMPVT QSVKKPSKPVKKNHACEMCGKAFRDVYHLNRHKLSHSDEKPFECPICNQRFKRKDRMT YHVRSHEGGITKPYTCSVCGKGFSRPDHLSCHVKHVHSTERPFKCQTCTAAFATKDRL RTHMVRHEGKVSCNICGKLLSAAYITSHLKTHGQSQSINCNTCKQGISKTCMSEETSN QKQQQQQQQQQQQQQHVTSWPGKQVETLRLWEEAVKARKKEAANLCQTSTAATTPVTL TTPFSITSSVSSETMSNPVTVAAAMSMRSPVNVSSAVNITSPMNIGHPVTITSPLSMT SPLTLTTPVNLPTPVTAPVNIAHPVTITSPMNLPTPMTLAAPLNIAMRPVESMPFLPQ ALPTSPPW" polyA_site 2306 BASE COUNT 719 a 561 c 469 g 557 t ORIGIN 1 agcgggggga gtggggagga ggggggtcgg ccgccgcagc catggaggcc aactggaccg 61 cgttcctgtt ccaggcccat gaagcttccc atcaccaaca gcaggcagca cagaacagct 121 tgctgcccct cctgagctct gccgtggagc cccctgatca gaaaccattg cttccaatac 181 caataactca gaaacctcag ggtgcaccag aaacattaaa ggatgccatt gggattaaaa 241 aagaaaaacc caaaacttca tttgtgtgca cttactgcag taaagctttc agggacagct 301 atcacctgag gcgccacgaa tcctgccaca cagggatcaa gttggtgtcc cggccaaaga 361 aaacccccac cacggtggtt ccccttatct ctaccatcgc tggggacagc agccgaactt 421 cgttggtctc gaccattgca ggcatcttgt caacagtcac tacatcttcc tcgggcacca 481 accccagtag cagtgccagc accacagcta tgccagtgac ccagtctgtc aagaaaccca 541 gtaagcctgt caagaagaac catgcttgtg agatgtgtgg gaaggccttc cgagatgtgt 601 accatctcaa tcgacacaag ctctcccatt cagatgagaa accctttgag tgtcctattt 661 gtaatcagcg cttcaagagg aaggaccgga tgacttacca tgtgaggtct catgaaggag 721 gcatcaccaa accctatact tgcagtgttt gtgggaaagg cttctcaagg cctgaccact 781 taagctgtca tgtaaaacat gtccattcaa cagaaagacc cttcaaatgc caaacgtgca 841 ctgctgcctt tgccaccaaa gacagactgc ggacacacat ggtgcgccat gaaggcaagg 901 tatcatgtaa catctgtggg aagctcctga gtgcagcata catcaccagc cacttaaaga 961 ctcatgggca gagccaaagt atcaactgta atacatgtaa acaaggcatc agtaaaacat 1021 gcatgagtga agagaccagt aaccaaaagc agcagcagca gcagcagcag caacaacaac 1081 aacaacaaca tgtgacaagc tggccaggga agcaagtaga aacactcaga ctgtgggaag 1141 aagctgttaa agcaaggaag aaagaagctg ctaacctgtg ccaaacctcc acggctgcta 1201 cgacacctgt gactctcact actccattca gtataacatc ctctgtgtcg tctgagacta 1261 tgtcaaaccc agtcacagtg gcagctgcaa tgagcatgag aagtccagta aatgtttcaa 1321 gtgcagttaa cataaccagc ccaatgaaca tagggcatcc tgtaactata accagtccat 1381 tatccatgac ctctccttta acactcacta ccccagtcaa cctccccacc cccgtcactg 1441 ccccagtgaa tatagcacac cctgtcacca tcacatctcc aatgaatcta cccacaccta 1501 tgacattagc cgcccctctc aatatagcaa tgagacctgt agagagcatg cctttcttgc 1561 cccaagcttt gcctacatca ccgccttggt aaacagtatt ataaaatcaa aatatgggta 1621 aaagtaaata tttaccagca acttaacttt tagttgatta aagcaaaaag taaaccatga 1681 aattgggaga ttttattaca ttagttaata agagtgtggt agcatttttc tccaatttgg 1741 ctgggattat tcaaagtagg gtgtgtatgt aacttatcac tggaccactt tagtttaatc 1801 agaaattcct tttagctgac aacattgctt aaacaggata gtagttggca agatgaaatg 1861 ccagaattaa aaccaatcat aagtagaacc cacttcaaaa taaaaaaaca gcattactat 1921 ttctaatccc aaggaatcac tttattgtaa acactagcag aactcttctc cctatacaag 1981 gtggatggct gattttaacc tgaaatttta aatccacaga ttgagagcta gtgtagaatt 2041 gtctgtgttt attgttttta tgagtaaata catgcattgt cataataaaa tgcatttcag 2101 agaatatgca ttttaccttt gggaatatgt taatttcagg cagcattccc tatgggaaag 2161 gtgataccag ctctgatatg caaagcatat gataatttat cattctaact tcaacgtata 2221 atagggattg tgacctgata tttggagatg taaatattgc tcagcatatt aatcccgatg 2281 gaatatagca ttgtagttga cttttt // LOCUS HUMDBI 556 bp mRNA PRI 02-NOV-1994 DEFINITION Human diazepam binding inhibitor (DBI) mRNA, complete cds. ACCESSION M14200 NID g181477 KEYWORDS . SOURCE Human hypothalamus, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 556) AUTHORS Gray,P.W., Glaister,D., Seeburg,P.H., Guidotti,A. and Costa,E. TITLE Cloning and expression of cDNA for human diazepam binding inhibitor, a natural ligand of an allosteric regulatory site of the gamma-aminobutyric acid type A receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (19), 7547-7551 (1986) MEDLINE 87016986 FEATURES Location/Qualifiers source 1..556 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q12-q21" mRNA 20..556 /note="DBI mRNA" gene 20..334 /gene="DBI" CDS 20..334 /gene="DBI" /note="diazepam binding inhibitor" /codon_start=1 /db_xref="GDB:G00-119-837" /db_xref="PID:g181478" /translation="MWGDLWLLPPASANPGTGTEAEFEKAAEEVRHLKTKPSDEEMLF IYGHYKQATVGDINTERPGMLDFTGKAKWDAWNELKGTSKEDAMKAYINKVEELKKKY GI" BASE COUNT 161 a 115 c 144 g 136 t ORIGIN 1 ccttgtctga gaccgagcta tgtggggcga cctctggctc ctcccgcctg cctctgccaa 61 tccgggcact gggacagagg ctgagtttga gaaagctgca gaggaggtta ggcaccttaa 121 gaccaagcca tcggatgagg agatgctgtt catctatggc cactacaaac aagcaactgt 181 gggcgacata aatacagaac ggcccgggat gttggacttc acgggcaagg ccaagtggga 241 tgcctggaat gagctgaaag ggacttccaa ggaagatgcc atgaaagctt acatcaacaa 301 agtagaagag ctaaagaaaa aatacgggat atgagagact ggatttggtt actgtgccat 361 gtgtttatcc taaactgaga caatgccttg tttttttcta ataccgtgga tggtgggaat 421 tcgggaaaat aaccagttaa accagctact caaggctgct caccatacgg ctctaacaga 481 ttaggggcta aaacgattac tgactttcct tgagtagttt ttatctgaaa tcaattaaaa 541 gtgtatttgt tacttt // LOCUS HUMDBP 1653 bp mRNA PRI 02-NOV-1994 DEFINITION Human serum vitamin D-binding protein (hDBP) mRNA, complete cds. ACCESSION M12654 NID g181481 KEYWORDS vitamin D-binding protein. SOURCE Human liver, library of S.Woo, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1653) AUTHORS Cooke,N.E. and David,E.V. TITLE Serum vitamin D-binding protein is a third member of the albumin and alpha fetoprotein gene family JOURNAL J. Clin. Invest. 76 (6), 2420-2424 (1985) MEDLINE 86086396 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by N.E.Cooke, 24-JUN-1986. FEATURES Location/Qualifiers source 1..1653 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q12-q13" mRNA <1..1653 /note="hDBP mRNA" sig_peptide 29..76 /gene="GC" /note="serum vitamin D-binding protein signal peptide" gene 29..1453 /gene="GC" CDS 29..1453 /gene="GC" /note="serum vitamin D-binding protein precursor" /codon_start=1 /db_xref="GDB:G00-119-263" /db_xref="PID:g181482" /translation="MKRVLVLLLAVAFGHALERGRDYEKNKVCKEFSHLGKEDFTSLS LVLYSRKFPSGTFEQVSQLVKEVVSLTEACCAEGADPDCYDTRTSALSAKSCESNSPF PVHPGTAECCTKEGLERKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFM WEYSTNYEQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNR VCSQYAAYGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAK ELPEHTVKLCDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVRLPTNKDV CDPGNTKVMDKYTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLL KKELSSFIDKGQELCADYSENTFTEYKKKLAERLKAKLPEATPTELAKLVNKRSDFAS NCCSINSPPLYCDSEIDAELKNIL" mat_peptide 77..1450 /gene="GC" /note="serum vitamin D-binding protein" BASE COUNT 504 a 378 c 346 g 425 t ORIGIN 6 bp upstream of Fnu4HI site; chromosome 4q11-q13. 1 cggtgctgca agactctctg gtagaaaaat gaagagggtc ctggtactac tgcttgctgt 61 ggcatttgga catgctttag agagaggccg ggattatgaa aagaataaag tctgcaagga 121 attctcccat ctgggaaagg aggacttcac atctctgtca ctagtcctgt acagtagaaa 181 atttcccagt ggcacgtttg aacaggtcag ccaacttgtg aaggaagttg tctccttgac 241 cgaagcctgc tgtgcggaag gggctgaccc tgactgctat gacaccagga cctcagcact 301 gtctgccaag tcctgtgaaa gtaattctcc attccccgtt cacccaggca ctgctgagtg 361 ctgcaccaaa gagggcctgg aacgaaagct ctgcatggct gctctgaaac accagccaca 421 ggaattcccc acctacgtgg aacccacaaa tgatgaaatc tgtgaggcgt tcaggaaaga 481 tccaaaggaa tatgctaatc aatttatgtg ggaatattcc actaattacg aacaagctcc 541 tctgtcactt ttagtcagtt acaccaagag ttatctttct atggtagggt cctgctgtac 601 ctctgcaagc ccaactgtat gctttttgaa agagagactc cagcttaaac atttatcact 661 tctcaccact ctgtcaaata gagtctgctc acaatatgct gcttatgggg agaagaaatc 721 aaggctcagc aatctcataa agttagccca aaaagtgcct actgctgatc tggaggatgt 781 tttgccacta gctgaagata ttactaacat cctctccaaa tgctgtgagt ctgcctctga 841 agattgcatg gccaaagagc tgcctgaaca cacagtaaaa ctctgtgaca atttatccac 901 aaagaattct aagtttgaag actgttgtca agaaaaaaca gccatggacg tttttgtgtg 961 cacttacttc atgccagctg cccaactccc cgagcttcca gatgtgagat tgcccacaaa 1021 caaagatgtg tgtgatccag gaaacaccaa agtcatggat aagtatacat ttgaactaag 1081 cagaaggact catcttccgg aagtattcct cagtaaggta cttgagccaa ccctaaaaag 1141 ccttggtgaa tgctgtgatg ttgaagactc aactacctgt tttaatgcta agggccctct 1201 actaaagaag gaactatctt ctttcattga caagggacaa gaactatgtg cagattattc 1261 agaaaataca tttactgagt acaagaaaaa actggcagag cgactaaaag caaaattgcc 1321 tgaggccaca cccacggaac tggcaaagct ggttaacaag cgctcagact ttgcctccaa 1381 ctgctgttcc ataaactcac ctcctcttta ctgtgattca gagattgatg ctgaattgaa 1441 gaatatcctg tagtcctgaa gcatgtttat taactttgac cagagttgga gccacccaag 1501 ggaatgatct ctgatgacct aacctaagca aaaccactga gcttctggga agacaactag 1561 gatactttct actttttcta gctacaatat cttcatacaa tgacaagtat gatgatttgc 1621 tatcaaaata aattgaaata taatgcaaac cat // LOCUS HUMDCKATPB 2460 bp mRNA PRI 26-FEB-1991 DEFINITION Human deoxycytidine kinase mRNA, complete cds. ACCESSION M60527 NID g181509 KEYWORDS ATP-binding protein; deoxycytidine kinase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2460) AUTHORS Chottiner,E.G., Shewach,D.S., Datta,N.S., Ashcraft,E., Gribbin,D., Ginsburg,D., Fox,I.H. and Mitchell,B.S. TITLE Cloning and expression of human deoxycytidine kinase cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 1531-1535 (1991) MEDLINE 91142207 FEATURES Location/Qualifiers source 1..2460 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2460 /product="deoxycytidine kinase" CDS 160..942 /EC_number="2.7.1.74" /codon_start=1 /product="deoxycytidine kinase" /db_xref="PID:g181510" /translation="MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQL CEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACL SRIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHD WMNNQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLH RTLKTNFDYLQEVPILTLDVNEDFKDKYESLVEKVKEFLSTL" BASE COUNT 752 a 418 c 443 g 847 t ORIGIN 1 aaagtcaaac cccgacaccg cggcgggccg gtgagctcac tagctgaccc ggcaggtcag 61 gatctggctt agcggcgccg cgagctccag tgcgcgcacc cgtggccgcc tcccagccct 121 ctttgccgga cgagctctgg gccgccacaa gactaaggaa tggccacccc gcccaagaga 181 agctgcccgt ctttctcagc cagctctgag gggacccgca tcaagaaaat ctccatcgaa 241 gggaacatcg ctgcagggaa gtcaacattt gtgaatatcc ttaaacaatt gtgtgaagat 301 tgggaagtgg ttcctgaacc tgttgccaga tggtgcaatg ttcaaagtac tcaagatgaa 361 tttgaggaac ttacaatgtc tcagaaaaat ggtgggaatg ttcttcagat gatgtatgag 421 aaacctgaac gatggtcttt taccttccaa acatatgcct gtctcagtcg aataagagct 481 cagcttgcct ctctgaatgg caagctcaaa gatgcagaga aacctgtatt attttttgaa 541 cgatctgtgt atagtgacag gtatattttt gcatctaatt tgtatgaatc tgaatgcatg 601 aatgagacag agtggacaat ttatcaagac tggcatgact ggatgaataa ccaatttggc 661 caaagccttg aattggatgg aatcatttat cttcaagcca ctccagagac atgcttacat 721 agaatatatt tacggggaag aaatgaagag caaggcattc ctcttgaata tttagagaag 781 cttcattata aacatgaaag ctggctcctg cataggacac tgaaaaccaa cttcgattat 841 cttcaagagg tgcctatctt aacactggat gttaatgaag actttaaaga caaatatgaa 901 agtctggttg aaaaggtcaa agagtttttg agtactttgt gatcttgctg aagactacag 961 gcagccaaat ggttccagat acttcagctt tgtgtatctt cgtaacttca tattaatata 1021 agtttcttta gaaaacccaa gtttttaatc gtttttgttt taaggaaaaa agatttttaa 1081 aatgaatctt atgcaaaact ttttgatcag tttcttttct tttgtttttt ttttaaaaaa 1141 gacatttaaa gacaaagaca ttatttctca tagcaggaaa tgtagaggta gatggttcca 1201 gtatcagcat agtgactaaa ctacattata aaagatccag cttccttctg tcattcccct 1261 cttttgtctt cctcagcagg ttggcttttt tccctggtgc ctctcacttc gttggtgacc 1321 agtttcttaa actgaaagct ttaatgttac atagtaaatg gtagtgtgtc ctgtgtaaat 1381 tagtgtacct attaaaagtt gcaaagtgga attaaaggaa tccctagaat aaggattctg 1441 aagttttatt ttaaattatt atcttcttaa cagtttagtc ccacctctta cttcctgcct 1501 cagtctgctt tctctactgt ctggattaat taggcagcct gctataaagt taaagtcaca 1561 catttctatt ttgcaaacac tgtgattact ctttgctttg tagtttgctt tgctttgtag 1621 ggttctgctt ttaagttttt ctctttttca gacaaattac tgataaaaat gatattgctc 1681 tatatgtaat atatcctgaa agcattattt tttgttgaat aggaaataaa attaatgaag 1741 acagaggcta gaaagcatcc attaattaat gagacacact taactactta tctctaaacc 1801 atctatgtga atatttgtaa aaataatgaa tggactcatc ttagttctgt atataaatat 1861 attttctttc tagtttgttt agttaaggtg tgcagtgttt ttcctgtgta ttaaaccttt 1921 ccattttacg ttttagaaaa ttttatgtat tttaaaataa ggggaagagt cattttcacc 1981 tttaaactac tatttttctt tccaagtcat ttttgttttt ggtttcttat tcaaagatga 2041 taatttagtg gattaaccag tccagacgca ctgatctttg caaaggagac ttaatttcaa 2101 atctgtaatt accatacata aactgtctca ttatacgtat gcattttttt agtttgtttt 2161 tgtttggtat aaattaattt gttaattaaa tatttcttaa gtataaacct tatgaactac 2221 agtggagcta cactcattga aatgtaattt cagttctaaa aagatgtaat aatcatttta 2281 gaattaaaat ttattctact tttaaataaa ttatgaatat taaaggtgaa aattgtataa 2341 attactttga ttccatttta agtggagaca tatttcagtg atttttagta acctttaaaa 2401 atgtataatg acttttaaaa tttgtagaat tgaaaagacg ctaataaaaa tttattattt // LOCUS HUMDDC 1930 bp mRNA PRI 31-DEC-1994 DEFINITION Human aromatic amino acid decarboxylase (ddc) mRNA, complete cds. ACCESSION M76180 M30772 NID g181520 KEYWORDS DOPA decarboxylase; aromatic amino acid decarboxylase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1930) AUTHORS Ichinose,H., Kurosawa,Y., Titani,K., Fujita,K. and Nagatsu,T. TITLE Isolation and characterization of a cDNA clone encoding human aromatic L-amino acid decarboxylase JOURNAL Biochem. Biophys. Res. Commun. 164 (3), 1024-1030 (1989) MEDLINE 90073624 FEATURES Location/Qualifiers source 1..1930 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pheochromocytoma" gene 70..1512 /gene="ddc" CDS 70..1512 /gene="ddc" /standard_name="DOPA decarboxylase" /EC_number="4.1.1.28" /note="The amino acid sequence Asn-Phe-Asn-Pro-His-Lys-Trp around a possible cofactor (pyridoxal phosphate) binding site is identical among species." /codon_start=1 /function="Decarboxylation of aromatic amino acids" /evidence=experimental /label=hAADC /product="aromatic amino acid decarboxylase" /db_xref="PID:g181521" /translation="MNASEFRRRGKEMVDYVANYMEGIEGRQVYPDVEPGYLRPLIPA AAPQEPDTFEDIINDVEKIIMPGVTHWHSPYFFAYFPTASSYPAMLADMLCGAIGCIG FSWAASPACTELETVMMDWLGKMLELPKAFLNEKAGEGGGVIQGSASEATLVALLAAR TKVIHRLQAASPELTQAAIMEKLVAYSSDQAHSSVERAGLIGGVKLKAIPSDGNFAMR ASALQEALERDKAAGLIPFFMVATLGTTTCCSFDNLLEVGPICNKEDIWLHVDAAYAG SAFICPEFRHLLNGVEFADSFNFNPHKWLLVNFDCSAMWVKKRTDLTGAFRLDPTYLK HSHQDSGLITDYRHWQIPLGRRFRSLKMWFVFRMYGVKGLQAYIRKHVQLSHEFESLV RQDPRFEICVEVILGLVCFRLKGSNKVNEALLQRINSAKKIHLVPCHLRDKFVLRFAI CSRTVESAHVQRAWEHIKELAADVLRAERE" BASE COUNT 492 a 461 c 505 g 472 t ORIGIN 1 ggagagagag aggacagaga gcaagtcact cccggctgcc tttttcacct ctgacagagc 61 ccagacacca tgaacgcaag tgaattccga aggagaggga aggagatggt ggattacgtg 121 gccaactaca tggaaggcat tgagggacgc caggtctacc ctgacgtgga gcccgggtac 181 ctgcggccgc tgatccctgc cgctgcccct caggagccag acacgtttga ggacatcatc 241 aacgacgttg agaagataat catgcctggg gtgacgcact ggcacagccc ctacttcttc 301 gcctacttcc ccactgccag ctcgtacccg gccatgcttg cggacatgct gtgcggggcc 361 attggctgca tcggcttctc ctgggcggca agcccagcat gcacagagct ggagactgtg 421 atgatggact ggctcgggaa gatgctggaa ctaccaaagg catttttgaa tgagaaagct 481 ggagaagggg gaggagtgat ccagggaagt gccagtgaag ccaccctggt ggccctgctg 541 gccgctcgga ccaaagtgat ccatcggctg caggcagcgt ccccagagct cacacaggcc 601 gctatcatgg agaagctggt ggcttactca tccgatcagg cacactcctc agtggaaaga 661 gctgggttaa ttggtggagt gaaattaaaa gccatcccct cagatggcaa cttcgccatg 721 cgtgcgtctg ccctgcagga agccctggag agagacaaag cggctggcct gattcctttc 781 tttatggttg ccaccctggg gaccacaaca tgctgctcct ttgacaatct cttagaagtc 841 ggtcctatct gcaacaagga agacatatgg ctgcacgttg atgcagccta cgcaggcagt 901 gcattcatct gccctgagtt ccggcacctt ctgaatggag tggagtttgc agattcattc 961 aactttaatc cccacaaatg gctattggtg aattttgact gttctgccat gtgggtgaaa 1021 aagagaacag acttaacggg agcctttaga ctggacccca cttacctgaa gcacagccat 1081 caggattcag ggcttatcac tgactaccgg cattggcaga taccactggg cagaagattt 1141 cgctctttga aaatgtggtt tgtatttagg atgtatggag tcaaaggact gcaggcttat 1201 atccgcaagc atgtccagct gtcccatgag tttgagtcac tggtgcgcca ggatccccgc 1261 tttgaaatct gtgtggaagt cattctgggg cttgtctgct ttcggctaaa gggttccaac 1321 aaagtgaatg aagctcttct gcaaagaata aacagtgcca aaaaaatcca cttggttcca 1381 tgtcacctca gggacaagtt tgtcctgcgc tttgccatct gttctcgcac ggtggaatct 1441 gcccatgtgc agcgggcctg ggaacacatc aaagagctgg cggccgacgt gctgcgagca 1501 gagagggagt aggagtgaag ccagctgcag gaatcaaaaa ttgaagagag atatatctga 1561 aaactggaat aagaagcaaa taaatatcat cctgccttca tggaactcag ctgtctgtgg 1621 cttcccatgt ctttctccaa agccatccag agggttgtga ttttgtctgc ttagtatctc 1681 atcaacaaag aaatattatt tgctaattaa aaagttaatc ttcatggcca tagcttttat 1741 tcattagctg tgatttttgt tgattaaaac attatagatt ttcatgttct tgcagtcatc 1801 agaagtggta ggaaagcctc actgatatat tttccagggc aatcaatgtt cacgcaactt 1861 gaaattatat ctgtggtctt caaattgtct tttgtcatgt ggctaaatgc ctaataaaca 1921 attcaagtga // LOCUS HUMDFSNSIX 452 bp mRNA PRI 19-JAN-1993 DEFINITION Homo sapiens defensin 6 mRNA, complete cds. ACCESSION M98331 NID g181546 KEYWORDS defensin 6. SOURCE Homo sapiens small intestine cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 452) AUTHORS Jones,D.E. and Bevins,C.L. TITLE Defensin-6 mRNA in human paneth cells: implications for antimicrobial peptides in host defence of the human bowel JOURNAL FEBS Lett. 315, 187-192 (1993) MEDLINE 93114459 FEATURES Location/Qualifiers source 1..452 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="small intestine" 5'UTR 1..18 CDS 19..321 /note="prepropeptide" /codon_start=1 /product="defensin 6" /db_xref="PID:g181547" /translation="MRTLTILTAVLLVALQAKAEPLQAEDDPLQAKAYEADAQEQRGA NDQDFAVSFAEDASSSLRALGSTRAFTCHCRRSCYSTEYSYGTCTVMGINHRFCCL" 3'UTR 319..440 allele 390 /replace="t" polyA_signal 424..429 BASE COUNT 124 a 118 c 98 g 112 t ORIGIN 1 cctccagcga ccctagccat gagaaccctc accatcctca ctgctgttct cctcgtggcc 61 ctccaggcca aggctgagcc actccaagct gaggatgatc cactgcaggc aaaagcttat 121 gaggctgatg cccaggagca gcgtggggca aatgaccagg actttgccgt ctcctttgca 181 gaggatgcaa gctcaagtct tagagctttg ggctcaacaa gggctttcac ttgccattgc 241 agaaggtcct gttattcaac agaatattcc tatgggacct gcactgtcat gggtattaac 301 cacagattct gctgcctctg agggatgaga acagagagaa atatattcat aatttacttt 361 atgacctaga aggaaactgt cgtgtgtccc atacattgcc atcaactttg tttcctcatc 421 tcaaataaag tcctttcagc aaaaaaaaaa aa // LOCUS HUMDGK 3758 bp mRNA PRI 15-APR-1996 DEFINITION Human mRNA for diacylglycerol kinase gamma, complete cds. ACCESSION D26135 NID g516757 KEYWORDS diacylglycerol kinase gamma. SOURCE Homo sapiens cell-line HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3758) AUTHORS Kai,M., Sakane,F., Imai,S., Wada,I. and Kanoh,H. TITLE Molecular cloning of a diacylglycerol kinase isozyme predominantly expressed in human retina with a truncated and inactive enzyme expression in most other human cells JOURNAL J. Biol. Chem. 269 (28), 18492-18498 (1994) MEDLINE 94308084 REFERENCE 2 (bases 1 to 3758) AUTHORS Kanoh,H. TITLE Direct Submission JOURNAL Submitted (13-DEC-1993) to the DDBJ/EMBL/GenBank databases. Hideo Kanoh, Sapporo Medical University School of Medicine, Department of Biochemistry; West-17, South-1, Sapporo, Hokkaido 060, Japan (E-mail:kanoh@serpent.cc.sapmed.ac.jp, Tel:011-611-2111(ex.2290), Fax:011-612-5861) COMMENT Submitted (13-Dec-1993) to DDBJ by: Hideo Kanoh Department of Biochemistry Sapporo Medical University School of Medicine Weat 17 South 1 Chuo-ku, Sapporo Hokkaido 060 Japan Phone: 011-611-2111 x2290 Fax: 011-612-5861. FEATURES Location/Qualifiers source 1..3758 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 517..2892 /EC_number="2.7.1.107" /citation=[1] /codon_start=1 /evidence=experimental /product="diacylglycerol kinase gamma" /db_xref="PID:d1005674" /db_xref="PID:g516758" /translation="MGEERWVSLTPEEFDQLQKYSEYSSKKIKDALTEFNEGGSLKQY DPHEPISYDVFKLFMRAYLEVDLPQPLSTHLFLAFSQKPRHETSDHPTEGASNSEANS ADTNIQNADNATKADEACAPDTESNMAEKQAPAEDQVAATPLEPPVPRSSSSESPVVY LKDVVCYLSLLETGRPQDKLEFMFRLYDSDENGLLDQAEMDCIVNQMLHIAQYLEWDP TELRPILKEMLQGMDYDRDGFVSLQEWVHGGMTTIPLLVLLGMDDSGSKGDGGHAWTM KHFKKPTYCNFCHIMLMGVRKQGLCCTYCKYTVHERCVSKNIPGCVKTYSKAKRSGEV MQHAWVEGNSSVKCDRCHKSIKCYQSVTARHCVWCRMTFHRKCELSTLCDGGELRDHI LLPTSICPITRDRPGEKSDGCVSAKGELVMQYKIIPTPGTHPLLVLVNPKSGGRQGER ILRKFHYLLNPKQVFNLDNGGPTPGLNFFRDTPDFRVLACGGDGTVGWILDCIDKANF AKHPPVAVLPLGTGNDLARCLRWGGGYEGGSLTKILKDIEQSPLVMLDRWHLEVIPRE EVENGDQVPYSIMNNYFSIGVDASIAHRFHVMREKHPEKFNSRMKNKLWYFEFGTSET FAATCKKLHDHIELECDGVGVDLSNIFLEGIAILNIPSMYGGTNLWGENKKNRAVIRE SRKGVTDPKELKFCVQDLSDQLLEVVGLEGAMEMGQIYTGLKSAGRRLAQCASVTIRT NKLLPMQVDGEPWMQPCCTIKITHKNQAPMMMGPPQKSSFFSLRRKSRSKD" polyA_site 3758 BASE COUNT 983 a 980 c 991 g 804 t ORIGIN 1 cacggagata gacagctttg gagctgctga actccgagca cagggtgaag accccggcgc 61 taccaaccac agcctggcag cctggtctcc gcggcaccca ctggggctgc atccccctcc 121 cccgagaggg ctgcgcaggc gggaagacgc cagaggccag cttcggtccc ccttctgtct 181 ctcggttcct ctttcctccc aagtaaggga ataaaccgcg aagaaggagc gccccgggcc 241 accgcgcaac caagtgttgc ctggtgagga agagccagga cttctgaatt taccttgaat 301 acagacagga ggatgttgcc taaggaatag cagagatctt gtctcatctt ctgagaggtg 361 cctgctgctg ctgtatacac ttgagtgctc ccagaagtct cctgaaaggc ttacatcgca 421 aacctgcaat gagccaggcc ctgggctggg cctccacttc agcctagtga acaaaactcc 481 atcactgccc tttagccact cacataaagt ttaaaaatgg gtgaagaacg gtgggtctcc 541 ctcactccag aagaatttga ccaactccag aaatattcag aatattcctc caagaagata 601 aaagatgcct tgactgaatt taatgagggt gggagcctca aacaatatga cccacatgag 661 ccgattagct atgatgtctt caagctgttc atgagggcgt acctggaggt ggaccttccc 721 cagccactga gcactcacct cttcctggcc ttcagccaga agcccagaca cgagacctct 781 gaccacccga cggagggagc cagcaacagt gaggccaaca gcgcagatac taatatacag 841 aatgcagata atgccaccaa agcagacgag gcctgtgccc ctgatactga atcaaatatg 901 gctgagaagc aagcaccagc tgaagaccaa gtggctgcga cccccctgga accccccgtc 961 cctcggtctt caagctcgga atccccagtg gtgtacctga aggatgttgt gtgctacctg 1021 tccctgctgg agacggggag gcctcaggat aagctggagt tcatgtttcg cctctatgat 1081 tcagatgaga acggtctcct ggaccaagcg gagatggatt gcattgtcaa ccaaatgctg 1141 catattgccc agtacctgga gtgggatccc acagagctga ggcctatatt gaaggagatg 1201 ctgcaaggga tggactacga ccgggacggc tttgtgtctc tacaggaatg ggtccatgga 1261 gggatgacca ccatcccatt gctggtgctc ctggggatgg atgactctgg ctccaagggg 1321 gatggggggc acgcctggac catgaagcac ttcaagaaac caacctactg caacttctgc 1381 catatcatgc tcatgggcgt ccgcaagcaa ggcctgtgct gcacttactg taaatacact 1441 gtccacgaac gctgtgtgtc caaaaacatt cctggttgtg tcaaaacgta ctcaaaagcc 1501 aaaaggagtg gtgaggtgat gcagcacgca tgggtggaag ggaactcctc cgtcaagtgt 1561 gaccggtgcc acaaaagtat caagtgctac cagagtgtca ccgcgcggca ctgcgtgtgg 1621 tgccggatga cgtttcaccg caaatgtgaa ttatcaacgt tgtgtgacgg tggggaactc 1681 agagaccaca tcttactgcc cacctccata tgccccatca cccgggacag gccaggtgag 1741 aagtctgatg gctgcgtgtc cgccaagggc gaacttgtca tgcagtataa gatcatcccc 1801 accccgggta cccaccccct gctggtcttg gtgaacccca agagtggagg gagacaagga 1861 gaaagaattc ttcggaaatt ccactatctg ctcaacccca aacaagtttt caacctggac 1921 aatggggggc ctactccagg gttgaacttt ttccgtgata ctccagactt ccgtgttttg 1981 gcctgtggtg gagatgggac agttggctgg attttggatt gcattgataa ggccaacttt 2041 gcaaagcatc caccagtggc tgtcctgcct cttggaacag gaaatgacct tgcccgttgt 2101 ctccgctggg gaggaggtta tgaagggggc agcttgacaa aaatcctgaa agacattgag 2161 cagagcccct tggtgatgct ggaccgctgg catctggaag tcatccccag agaggaagtg 2221 gaaaacgggg accaggtccc atacagcatc atgaacaact atttctccat tggtgtggac 2281 gcttccattg cacacagatt ccatgtgatg agagagaaac atcctgaaaa attcaacagc 2341 aggatgaaga acaagctgtg gtactttgaa tttggcacct cggagacttt tgcagcgacc 2401 tgcaagaaac tccacgacca cattgagttg gagtgtgatg gggttggggt ggacctgagc 2461 aacatcttcc tggaaggcat tgccattctc aacattccca gcatgtacgg aggcaccaat 2521 ctctggggag aaaacaagaa gaaccgggct gtgatccggg aaagcaggaa gggtgtcact 2581 gaccccaaag aactgaaatt ctgcgttcaa gacctcagtg accagctcct tgaagtggtg 2641 gggctagaag gagccatgga gatggggcag atctacaccg gcctgaagag tgcaggcagg 2701 aggctggccc agtgcgcctc tgtcaccatc aggacaaaca agctgctgcc aatgcaagtg 2761 gatggagaac cctggatgca gccatgttgc acgattaaaa ttactcacaa gaaccaagcg 2821 cccatgatga tggggcctcc ccagaagagc agcttcttct cgttgagaag gaagagccgt 2881 tcaaaagact aaacagtgtg ccaaacacca gctaaaccaa gagagaaagc aagaaactat 2941 aatgcacact cacacacaat ttatgtgcac actcacacat gcacacacac acacacatac 3001 acactcttct ctaaccagtg gaagcaaagc cacccttcgg gaagaaaacg tcaccttgcc 3061 atacattctg tttcaacagt gggtacaccc ctaacagagc cagtgccaac aaaacatttt 3121 gaatggactt agggcccatg aggttgtggc tggcttaggc agcaacctcc acattcccac 3181 aggccttgag cagaattttc tgagactgaa gggaaatccc cctttctttc taccagccct 3241 gcaagtttcc tcatggacgc tcgcgaggag caggctgcag gtttcctgcc tatggtgaga 3301 tcagatgtgg ccaagggaag gagctctggt tccagagaat ttgcacaaag ttccctctgt 3361 acagagacaa aacggcctcc ggctctcaga gcataatcct tggcagggct cagcaggcgc 3421 acgttggttt cttggtcgtc ctttgagtga caacttctcc gtgaacctgc tgaagaggca 3481 gaaaggctgt ggaaagctgt atttccattc ttgggtttct gcgccgtcgg tgggcacttg 3541 ttattttcca ggaaccttct cctggtgtct acatgtttgc ttagaggcgg ctccaagagc 3601 cccagagctg cctgcatagc acaccttaga tgtggtattt attttcttag ttctgtgaac 3661 acctgggagg gagagcggag aaactgggat ttatttttca aattggtgtc ataatattgt 3721 gtaaaaaggg aaggaaaaaa aaaaccaccc ccagcttc // LOCUS HUMDIPEPA 3709 bp mRNA PRI 22-JUL-1993 DEFINITION Human dipeptidyl aminopeptidase like protein mRNA, complete cds. ACCESSION M96859 NID g306705 KEYWORDS aminopeptidase; dipeptidyl aminopeptidase; dipeptidyl aminopeptidase IV. SOURCE Homo sapiens (library: lambda FIX (from Stratagene)) female 2 year old hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3709) AUTHORS Yokotani,N., Doi,K., Wenthold,R.J. and Wada,K. TITLE Non-conservation of a catalytic residue in a dipeptidyl aminopeptidase IV-related protein encoded by a gene on human chromosome 7 JOURNAL Hum. Mol. Genet. 2, 1037-1039 (1993) MEDLINE 93372805 FEATURES Location/Qualifiers source 1..3709 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 year old" /sex="female" /tissue_type="hippocampus" /tissue_lib="lambda FIX (from Stratagene)" /map="7" CDS 145..2742 /codon_start=1 /product="dipeptidyl aminopeptidase like protein" /db_xref="PID:g306706" /translation="MASLYQRFTGKINTSRSFPAPPEASHLLGGQGPEEDGGAGAKPL GPRAQAAAPRERGGGGGGAGGRPRFQYQGRSDGDEEDELVGSNPPQRNWKGIAIALLV ILVICSLIVTSVILLTPAEDNSLSQKKKVTVEDLFSEDFKIHDPEAKWISDTEFIYRE QKGTVRLWNVETNTSTVLIEGKKIESLRAIRYEISPDREYALFSYNVEPIYQHSYTGY YVLSKIPHGDPQSLDPPEVSNAKLQYAGWGPKGQQLIFIFENNIYYCAHVGKQAIRVV STGKEGVIYNGLSDWLYEEEILKTHIAHWWSPDGTRLAYAAINDSRVPIMELPTYTGS IYPTVKPYHYPKAGSENPSISLHVIGLNGPTHDLEMMPPDDPRMREYYITMVKWATST KVAVTWLNRAQNVSILTLCDATTGVCTKKHEDESEAWLHRQNEEPVFSKDGRKFFFIR AIPQGGRGKFYHITVSSSQPNSSNDNIQSITSGDWDVTKILAYDEKGNKIYFLSTEDL PRRRQLYSANTEGNFNRQCLSCDLVENCTYFSASFSHSMDFFLLKCEGPGVPMVTVHN TTDKKKMFDLETNEHVKKAINDRQMPKVEYRDIEIDDYNLPMQILKPATFTDTTHYPL LLVVDGTPGSQSVAEKFEVSWETVMVSSHGAVVVKCDGRGSGFQGTKLLHEVRRRLGL LEEKDQMEAVRTMLKEQYIDRTRVAVFGKDYGGYLSTYILPAKGENQGQTFTCGSALS PITDFKLYASAFSERYLGLHGLDNRAYEMTKVAHRVSALEEQQFLIIHPTADEKIHFQ HTAELITQLIRGKANYSLQIYPDESHYFTSSSLKQHLYRSIINFFVECFRIQDKLPTV TAKEDEEED" polyA_signal 3577..3582 polyA_site 3709 BASE COUNT 938 a 1013 c 977 g 781 t ORIGIN 1 gacgctgggc ttgcttgctg ctgctgctgc tgcctcccca ccgccttttt ttttttaatc 61 tggagccggg gtggggagtg ggaaccggag agaaagcaaa atattaaaaa gccccaaaga 121 cagccagcag gagcgcggtg cccgatggct tcgctgtacc agaggttcac tggcaagatc 181 aacacctcga ggtccttccc cgcgcccccg gaggcgagtc acctcctggg cggccagggg 241 cccgaggagg acggcggtgc aggagccaag cccctcggcc cgcgggcgca ggcggcggcg 301 ccccgggagc gcggcggcgg cggcggcggc gcgggtggcc ggccccggtt ccagtaccag 361 ggccggagcg atggtgacga ggaggacgag ctggtgggga gtaaccctcc gcagaggaat 421 tggaaaggaa tagcaattgc actgcttgtc attctggtca tctgctcctt gatcgtcacc 481 tcggtcatac ttctgacacc agcggaagat aatagtctgt ctcaaaagaa gaaggtcact 541 gtagaagatc tcttcagtga agacttcaaa attcatgacc ccgaggctaa gtggataagt 601 gatacagaat tcatctacag agaacagaaa ggaacagtga gactgtggaa tgttgaaaca 661 aatacttcta ctgtcttaat agaaggcaaa aaaattgaat cattaagagc catcagatat 721 gaaatatctc cagatagaga gtatgcactt ttttcataca atgtggaacc catatatcaa 781 cactcgtata ctggatatta tgtcctgagc aaaattcctc atggggatcc tcaaagtctg 841 gacccaccag aagtcagcaa tgcaaagctt cagtatgcag gatggggccc taaaggccaa 901 cagctgatat ttatttttga aaacaatatc tactactgtg cacatgtcgg gaaacaggcc 961 atccgtgtgg tctccactgg caaggaaggt gtgatttaca atggcctcag tgactggctg 1021 tatgaagagg agattttgaa gacacacatc gcacactggt ggtctccgga tggcacgaga 1081 ctcgcctacg ccgccatcaa tgattcccgt gtccccatca tggagctccc aacttacacc 1141 ggctccatct accccaccgt gaagccctac cactatccca aggctggaag tgagaacccc 1201 agcatttccc tacacgttat tggcttaaat ggacccaccc atgatctgga gatgatgccg 1261 cctgatgatc cacggatgag ggagtactac atcaccatgg tgaagtgggc caccagcacc 1321 aaggtcgccg tgacctggct gaaccgggcg cagaacgtgt ccatcctcac cctctgcgac 1381 gccaccacgg gggtctgcac gaagaaacac gaggatgaaa gtgaggcctg gctccacaga 1441 cagaatgaag aacctgtgtt ctccaaggat ggccgaaagt ttttcttcat cagagccatc 1501 ccccagggag gacgagggaa attctatcac atcacggtgt cctcgtccca gcccaacagc 1561 agcaacgaca acatccagtc catcacctcc ggggactggg acgtgaccaa gatcctagcc 1621 tacgatgaga aggggaataa gatctacttc ctgagcacgg aggacctgcc tcggagacga 1681 caactctaca gtgccaacac ggagggcaac ttcaacaggc agtgcctctc ctgtgacctg 1741 gttgagaact gcacctactt cagcgcttcc ttcagccata gcatggactt cttcctgctc 1801 aagtgcgaag gtcctggtgt tcctatggtg acggtgcaca acacaacaga taagaaaaaa 1861 atgtttgacc tagaaacaaa tgaacatgtc aagaaggcca taaatgaccg acagatgcct 1921 aaagtggaat acagggacat tgagattgat gattacaacc tgcccatgca gatactgaag 1981 ccagcaacct tcaccgacac cacccactac cctctgctcc tggtggtgga tggcaccccg 2041 ggcagccaga gtgtagctga gaagttcgag gtgagctggg agacggtgat ggtgagcagc 2101 cacggcgcgg tggtggtaaa gtgtgacggc cgtggcagcg gcttccaagg gaccaagctc 2161 ctgcacgaag tgaggcggcg gctgggcttg ctggaggaga aggaccagat ggaggccgtg 2221 cggacgatgc tgaaggagca gtacattgac aggacgcgcg tggccgtgtt cgggaaggat 2281 tacggtggct acctgagcac ctacatcctc ccagcaaagg gagaaaatca aggccagaca 2341 ttcacctgcg gctctgctct ctctccaata acagacttca aactctatgc ctctgcgttt 2401 tccgagaggt acttgggcct ccatggactt gacaacagag catacgagat gaccaaggta 2461 gcccatcgag tctccgcgct ggaagaacag cagttcctga tcattcatcc cactgccgat 2521 gaaaaaattc atttccagca cacagcagaa ctcattacac aactaattag gggaaaggct 2581 aattacagct tacagattta cccggacgaa agccattact ttaccagctc cagcctcaaa 2641 cagcatctgt accggtccat catcaacttc ttcgtggaat gcttcaggat ccaggacaaa 2701 ctgccgacag tcacagcgaa agaggacgag gaggaggact aagctcaggt cgctctaagc 2761 acaaacgtgg ctctttctac aaccagatgc aaccgaggga tttccctgcc ctccctcttc 2821 cctcggaggg gcggggcggg gcggggccgg gcgtaccata gcatgtgtgt ctcggatgcg 2881 gaaggcagtt ttgcttggga aacaagctcc ttcccggggt catcactcac ggcctccatg 2941 gcaccaggga caacgctgtc cccgcagcag cgcctcctcc cggcgcccga gagaccggca 3001 cgccacggcc cctcccccaa ggaacagagc aaaggatggt gccgcaggcc ccagcccaca 3061 ggacaccggc ccctagattc cagccaccaa tgcgaagatg agactcgccc acactagcct 3121 ctgtgttacc gttagcatca caccctgtct cacgtcgcag tgccatggac gcagcagtta 3181 cagcaccatt gttttagcag tgcgtgttca tatatgggct tgctacttcc tgtaatgagg 3241 acgttcaaca tggtgagggg ctacaagaaa acgcttttct gtacagagtc ttactgtagc 3301 tacgctaatg gttaacctga tagaattaac tcgtattttt ctatggtttt aacctgatgc 3361 tccactgtct ccgtcagggg ttgttttgct gtttggggtt gggccttgtt tcccttggct 3421 ttctccagtc cacgtgtaga ctttgcgctt gtttggatga agaagcagat cggaagtaac 3481 tgctccctcc tcaaggttgt cttcagacgt cttggagacg ttcctaaaca ctgagggggg 3541 aagacagcca atagcaccca ttaaaagaaa tacctaaata aaacctctct cccactcagc 3601 tatgctaggg cttggctgta ggtgtgcact gtctatttac atccgtcctt acaaccatcc 3661 ttgtcctcct tggtaccgta ttcaagctct ttcccatgac atttggttg // LOCUS HUMDIPM 1489 bp mRNA PRI 04-SEP-1993 DEFINITION Human mRNA for dipeptidase. ACCESSION D13138 NID g219584 KEYWORDS dipeptidase. SOURCE Homo sapiens (haplotype:diploid) kidney cDNA to mRNA, clone_lib:lambda gt10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1489) AUTHORS Satoh,S., Keida,Y., Konta,Y., Maeda,M., Matsumoto,Y., Niwa,M. and Kohsaka,M. TITLE Purification and molecular cloning of mouse renal dipeptidase JOURNAL Biochim. Biophys. Acta 1163 (3), 234-242 (1993) MEDLINE 93283418 REFERENCE 2 (bases 1 to 1489) AUTHORS Satoh,S. TITLE Direct Submission JOURNAL Submitted (01-SEP-1992) to the DDBJ/EMBL/GenBank databases. Susumu Satoh, Fujisawa Pharmaceutical Co., Ltd., Product Development Laboratories; 1-6-2 Kashima, Yodogawa-ku, Osaka 532, Japan (Tel:06-390-1148, Fax:06-304-1192) COMMENT Submitted (01-SEP-1992) to DDBJ by: Susumu Satoh Product development Laboratories Fujisawa Pharmaceutical Co., Ltd. 1-6 2chome Kashima Osaka 532 Japan Phone: 06-390-1148 Fax: 06-304-1192. FEATURES Location/Qualifiers source 1..1489 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10" /haplotype="diploid" /tissue_type="kidney" CDS 44..1279 /codon_start=1 /product="dipeptidase precursor" /db_xref="PID:d1002931" /db_xref="PID:g219585" /translation="MWSGWWLWPLVAVCTADFFRDEAERIMRDSPVIDGHNDLPWQLL DMFNNRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAVRRTLEQM DVVHRMCRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHPIDSSLGVLRALYQLG MRYLTLTHSCNTPWSDNWLVDTGDSEPQSQGLSPFGQRVVKELNRLGVLIDLAHVSVA TMKATLQLSRAPVIFSHSSAYSVCASRRNVPDDVLRLVKQTDSLVMVNFYNNYISCTN KANLSQVADHLDHIKEVAGARAVGFGGDFDGVPRVPEGLEDVSKYPDLIAELLRRNWT EAEVKGALADNLLRVFEAVEQASNLTQAPEEEPIPLDQLGGSCRTHYGYSSGASSLHR HWGLLLASLAPLVLCLSLL" sig_peptide 44..91 mat_peptide 92..1276 /EC_number="3.4.13.11" /product="dipeptidase" polyA_site 1489 BASE COUNT 297 a 468 c 462 g 262 t ORIGIN 1 ggcaccaggg cagcagtgca cacaggtccc cggggacccc accatgtgga gcggatggtg 61 gctgtggccc cttgtggccg tctgcactgc agacttcttt cgggacgagg cagagaggat 121 catgagggac tcccctgtca ttgatgggca caatgacctc ccctggcagc tgctggatat 181 gttcaacaac cggctgcagg acgagagggc caacctgacc accttggccg gcacacacac 241 caacatcccc aagctgaggg ccggctttgt gggaggccag ttctggtccg tgtacacgcc 301 ctgcgacacc cagaacaaag acgccgtgcg gaggacgctg gagcagatgg acgtggtcca 361 ccgcatgtgc cggatgtacc cggagacctt cctgtatgtc accagcagtg caggcattcg 421 gcaggccttc cgggaaggga aggtggccag cctgatcggc gtggagggcg gccaccccat 481 tgacagcagt ttgggcgtcc tgcgggcact ctatcagctg ggcatgcggt acctgaccct 541 cacccacagc tgcaacacgc cctggtctga caactggctg gtggacacgg gagacagcga 601 gccccagagc caaggcttgt caccctttgg gcagcgtgtg gtgaaggagc tgaaccgtct 661 gggggtcctc atcgacttgg ctcacgtgtc tgtggccacc atgaaggcca ccctgcagct 721 gtccagagcc ccggtcatct tcagccactc ctcggcctac agcgtgtgcg caagccggcg 781 caacgtgcct gacgacgtcc tgaggctggt gaaacagaca gacagcctgg tgatggtgaa 841 cttctacaac aattacattt cctgcaccaa caaggccaac ctgtcccaag tggccgacca 901 tctggatcac atcaaggagg tggcaggagc cagagccgtg ggttttggtg gggactttga 961 tggtgttcca agggtccctg aggggctgga ggacgtctcc aagtatccag acctgatcgc 1021 tgagctgctc aggaggaact ggacggaggc ggaggtcaag ggcgcactgg ctgacaacct 1081 gctgagggtc ttcgaggctg tggaacaggc cagcaacctc acacaggctc ccgaggagga 1141 gcccatcccg ctggaccagc tgggtggctc ctgcaggacc cattacggct actcctctgg 1201 ggcttccagc ctccatcgcc actgggggct cctgctggcc tccctcgctc ccctggtcct 1261 ctgtctgtct ctcctgtgaa acctgggaga ccagagtccc ctttagggtt cccggagctc 1321 cgggaagacc cgcccatccc aggactccag atgccaggag ccctgctgcc cacatgcaag 1381 gaccagcatc tcctgagagg acgcctgggc ttacctgggg ggcaggatgc ctggggacag 1441 ttcaggacac acacacagta ggcccgcaat aaaagcaaca ccccttcac // LOCUS HUMDK 3018 bp mRNA PRI 28-JUL-1994 DEFINITION Homo sapiens ataxia-telangiectasia group D-associated protein mRNA, complete cds. ACCESSION L24203 NID g401762 KEYWORDS ataxia-telangiectasia group D-associated protein. SOURCE Homo sapiens (library: Stratagene) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3018) AUTHORS Leonhardt,E.A., Kapp,L.N., Young,B.R. and Murnane,J.P. TITLE Nucleotide sequence analysis of a candidate gene for ataxia-telangiectasia group D (ATDC) JOURNAL Genomics 19, 130-136 (1994) MEDLINE 94245147 FEATURES Location/Qualifiers source 1..3018 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="Stratagene" exon <1..928 /number=1 CDS 125..1891 /codon_start=1 /product="ataxia-telangiectasia group D-associated protein" /db_xref="PID:g401763" /translation="MEAADASRSNGSSPEARDARSPSGPSGSLENGTKADGKDAKTTN GHGGEAAEGKSLGSALKPGEGRSALFAGNEWRRPIIQFVESGDDKNSNYFSMDSMEGK RSPYAGLQLGAAKKPPVTFAEKGDVRKSIFSESRKPTVSIMEPGETRRNSYPRADTGL FSRSKSGSEEVLCDSCIGNKQKAVKSCLVCQASFCELHLKPHLEGAAFRDHQLLEPIR DFEARKCPVHGKTMELFCQTDQTCICYLCMFQEHKNHSTVTVEEAKAEKETELSLQKE QLQLKIIEIEDEAEKWQKEKDRIKSFTTNEKAILEQNFRDLVRDLEKQKEEVRAALEQ REQDAVDQVKVIMDALDERAKVLHEDKQTREQLHSISDSVLFLQEFGALMSNYSLPPP LPTYHVLLEGEGLGQSLGNFKDDLLNVCMRHVEKMCKADLSRNFIERNHMENGGDHRY VNNYTNSFGGEWSAPDTMKRYSMYLTPKGGVRTSYQPSSPGRFTKETTQKNFNNLYGT KGNYTSRVWEYSSSIQNSDNDLPVVQGSSSFSLKGYPSLMRSQSPKAQPQTWKSGKQT MLSHYRPFYVNKGNGIGSNEAP" exon 929..1024 /number=2 exon 1025..1258 /number=3 exon 1259..1457 /number=4 exon 1458..1559 /number=5 exon 1560..1652 /number=6 exon 1653..1751 /number=7 exon 1752..1828 /number=8 exon 1829..>3018 /number=9 polyA_signal 3000..3005 polyA_site 3018 BASE COUNT 665 a 937 c 837 g 579 t ORIGIN 1 ctcctcacag gtgtgtctct agtcctcgtg gttgcctgcc ccactccctg ccgagacgcc 61 tgccagaaag gtcacctatc ctgaacccca gcaagcctga aacagctcag ccaagcaccc 121 tgcgatggaa gctgcagatg cctccaggag caacgggtcg agcccagaag ccagggatgc 181 ccggagcccg tcgggcccca gtggcagcct ggagaatggc accaaggctg acggcaagga 241 tgccaagacc accaacgggc acggcgggga ggcagctgag ggcaagagcc tgggcagcgc 301 cctgaagcca ggggaaggta ggagcgccct gttcgcgggc aatgagtggc ggcgacccat 361 catccagttt gtcgagtccg gggacgacaa gaactccaac tacttcagca tggactctat 421 ggaaggcaag aggtcgccgt acgcagggct ccagctgggg gctgccaaga agccacccgt 481 tacctttgcc gaaaagggcg acgtgcgcaa gtccattttc tcggagtccc ggaagcccac 541 ggtgtccatc atggagcccg gggagacccg gcggaacagc tacccccggg ccgacacggg 601 ccttttttca cggtccaagt ccggctccga ggaggtgctg tgcgactcct gcatcggcaa 661 caagcagaag gcggtcaagt cctgcctggt gtgccaggcc tccttctgcg agctgcatct 721 caagccccac ctggagggcg ccgccttccg agaccaccag ctgctcgagc ccatccggga 781 ctttgaggcc cgcaagtgtc ccgtgcatgg caagacgatg gagctcttct gccagaccga 841 ccagacctgc atctgctacc tttgcatgtt ccaggagcac aagaatcata gcaccgtgac 901 agtggaggag gccaaggccg agaaggagac ggagctgtca ctgcaaaagg agcagctgca 961 gctcaagatc attgagattg aggatgaagc tgagaagtgg cagaaggaga aggaccgcat 1021 caagagcttc accaccaatg agaaggccat cctggagcag aacttccggg acctggtgcg 1081 ggacctggag aagcaaaagg aggaagtgag ggctgcgctg gagcagcggg agcaggatgc 1141 tgtggaccaa gtgaaggtga tcatggatgc tctggatgag agagccaagg tgctgcatga 1201 ggacaagcag acccgggagc agctgcatag catcagcgac tctgtgttgt ttctgcagga 1261 atttggtgca ttgatgagca attactctct ccccccaccc ctgcccacct atcatgtcct 1321 gctggagggg gagggcctgg gacagtcact aggcaacttc aaggacgacc tgctcaatgt 1381 atgcatgcgc cacgttgaga agatgtgcaa ggcggacctg agccgtaact tcattgagag 1441 gaaccacatg gagaacggtg gtgaccatcg ctatgtgaac aactacacga acagcttcgg 1501 gggtgagtgg agtgcaccgg acaccatgaa gagatactcc atgtacctga cacccaaagg 1561 tggggtccgg acatcatacc agccctcgtc tcctggccgc ttcaccaagg agaccaccca 1621 gaagaatttc aacaatctct atggcaccaa aggtaactac acctcccggg tctgggagta 1681 ctcctccagc attcagaact ctgacaatga cctgcccgtc gtccaaggca gctcctcctt 1741 ctccctgaaa ggctatccct ccctcatgcg gagccaaagc cccaaggccc agccccagac 1801 ttggaaatct ggcaagcaga ctatgctgtc tcactaccgg ccattctacg tcaacaaagg 1861 caacgggatt gggtccaacg aagccccatg agctcctggc ggaaggaacg aggcgccaca 1921 cccctgctct tcctcctgac cctgctgctc ttgccttcta agctactgtg cttgtctggg 1981 tgggagggag cctggtcctg cacctgccct ctgcagccct ctgccagcct cttgggggca 2041 gttccggcct ctccgacttc cccactggcc acactccatt cagactcctt tcctgccttg 2101 tgacctcaga tggtcaccat cattcctgtg ctcagaggcc aacccatcac aggggtgaga 2161 taggttgggg cctgccctaa cccgccagcc tcctcctctc gggctggatc tgggggctag 2221 cagtgagtac ccgcatggta tcagcctgcc tctcccgccc acgccctgct gtctccaggc 2281 ctatagacgt ttctctccaa ggccctatcc cccaatgttg tcagcagatg cctggacagc 2341 acagccaccc atctcccatt cacatggccc acctcctgct tcccagagga ctggccctac 2401 gtgctctctc tcgtcctacc tatcaatgcc cagcatggca gaacctgcag tggccaaggg 2461 ctgcagatgg aaacctctca gtgtcttgac atcaccctac ccaggcggtg ggtctccacc 2521 acagccactt tgagtctgtg gtccctggag ggtggcttct cctgactggc aggatgacct 2581 tagccaagat attcctctgt tccctctgct gagataaaga attcccttaa catgatataa 2641 tccacccatg caaatagcta ctggcccagc taccatttac catttgccta cagaatttca 2701 ttcagtctac actttggcat tctctctggc gatggagtgt ggctgggctg accgcaaaag 2761 gtgccttaca cactgccccc accctcagcc gttgccccat cagaggctgc ctcctccttc 2821 tgattacccc ccatgttgca tatcagggtg ctcaaggatt ggagaggaga caaaaccagg 2881 agcagcacag tggggacatc tcccgtctca acagccccag gcctatgggg gctctggaag 2941 gatgggccag cttgcagggg ttggggaggg agacatccag cttgggcttt cccctttgga 3001 ataaaccatt ggtctgtc // LOCUS HUMDLDH 2064 bp mRNA PRI 15-MAR-1990 DEFINITION Human dihydrolipoamide dehydrogenase mRNA, complete cds. ACCESSION J03620 NID g181574 KEYWORDS dihydrolipoamide dehydrogenase; dihydrolipoamide:NAD+ oxireductase. SOURCE Human liver, cDNA to mRNA, clone lambda-E-3-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2064) AUTHORS Pons,G., Raefsky-Estrin,C., Carothers,D.J., Pepin,R.A., Javed,A.A., Jesse,B.W., Ganapathi,M.K., Samols,D. and Patel,M.S. TITLE Cloning and cDNA sequence of the dihydrolipoamide dehydrogenase component of human alpha-ketoacid dehydrogenase complexes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 1422-1426 (1988) MEDLINE 88144449 FEATURES Location/Qualifiers source 1..2064 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2064 /note="DLDH mRNA" sig_peptide 51..155 /note="dihydrolipoamide dehydrogenase precursor (EC" CDS 51..1580 /note="dihydrolipoamide dehydrogenase precursor" /codon_start=1 /db_xref="PID:g181575" /translation="MQSWSRVYCSLAKRGHFNRISHGLQGLSAVPLRTYADQPIDADV TVIGSGPGGYVAAIKAAQLGFKTVCIEKNETLGGTCLNVGCIPSKALLNNSHYYHMAH GKDFASRGIEMSEVRLNLDKMMEQKSTAVKALTGGIAHLFKQNKVVHVNGYRKITGKN QVTATKADGGTQVIDTKNILIATGSEVTPFPGITIDEDTIVSSTGALSLKKVPEKMVV IGAGVIGVELGSVWQRLGADVTAVEFLGHVGGVGIDMEISKNFQRILQKQGFKFKLNT KVTGATKKSDGKIDVSIEAASGGKAEVITCDVLLVCIGRRPFTKNLGLEELGIELDPR GRIPVNTRFQTKIPNIYAIGDVVAGPMLAHKAEDEGIICVEGMAGGAVHIDYNCVPSV IYTHPEVAWVGKSEEQLKEEGIEYKVGKFPFAANSRAKTNADTDGMVKILGQKSTDRV LGAHILGPGAGEMVNEAALALEYGASCEDIARVCHAHPTLSEAFREANLAASFGKSIN F" mat_peptide 156..1577 /note="dihydrolipoamide dehydrogenase precursor (EC" BASE COUNT 638 a 334 c 474 g 618 t ORIGIN 62 bp upstream of HinfI site. 1 gctcccagcg gaggtgaaag tattggcgga aaggaaaata cagcggaaaa atgcagagct 61 ggagtcgtgt gtactgctcc ttggccaaga gaggccattt caatcgaata tctcatggcc 121 tacagggact ttctgcagtg cctctgagaa cttacgcaga tcagccgatt gatgctgatg 181 taacagttat aggttctggt cctggaggat atgttgctgc tattaaagct gcccagttag 241 gcttcaagac agtctgcatt gagaaaaatg aaacacttgg tggaacatgc ttgaatgttg 301 gttgtattcc ttctaaggct ttattgaaca actctcatta ttaccatatg gcccatggaa 361 aagattttgc atctagagga attgaaatgt ccgaagttcg cttgaattta gacaagatga 421 tggagcagaa gagtactgca gtaaaagctt taacaggtgg aattgcccac ttattcaaac 481 agaataaggt tgttcatgtc aatggatata gaaagataac tggcaaaaat caagtcactg 541 ctacgaaagc tgatggcggc actcaggtta ttgatacaaa gaacattctt atagccacgg 601 gttcagaagt tactcctttt cctggaatca cgatagatga agatacaata gtgtcatcta 661 caggtgcttt atctttaaaa aaagttccag aaaagatggt tgttattggt gcaggagtaa 721 taggtgtaga attgggttca gtttggcaaa gacttggtgc agatgtgaca gcagttgaat 781 ttttaggtca tgtaggtgga gttggaattg atatggagat atctaaaaac tttcaacgca 841 tccttcaaaa acaggggttt aaatttaaat tgaatacaaa ggttactggt gctaccaaga 901 agtcagatgg aaaaattgat gtttctattg aagctgcttc tggtggtaaa gctgaagtta 961 tcacttgtga tgtactcttg gtttgcattg gccgacgacc ctttactaag aatttgggac 1021 tagaagagct gggaattgaa ctagatccca gaggtagaat tccagtcaat accagatttc 1081 aaactaaaat tccaaatatc tatgccattg gtgatgtagt tgctggtcca atgctggctc 1141 acaaagcaga ggatgaaggc attatctgtg ttgaaggaat ggctggtggt gctgtgcaca 1201 ttgactacaa ttgtgtgcca tcagtgattt acacacaccc tgaagttgct tgggttggca 1261 aatcagaaga gcagttgaaa gaagagggta ttgagtacaa agttgggaaa ttcccatttg 1321 ctgctaacag cagagctaag acaaatgctg acacagatgg catggtgaag atccttgggc 1381 agaaatcgac agacagagta ctgggagcac atattcttgg accaggtgct ggagaaatgg 1441 taaatgaagc tgctcttgct ttggaatatg gagcatcctg tgaagatata gctagagtct 1501 gtcatgcaca tccgacctta tcagaagctt ttagagaagc aaatcttgct gcgtcatttg 1561 gcaaatcaat caacttttga attagaagat tatatatttt tttttctgaa atttcctggg 1621 agcttttgta gaagtcacat tcctgaacag gatattctca cagctccaag aatttctagg 1681 actgaattat gaaacttttg gaaggtattt aataggtttg gacaaaatgg aatactctta 1741 tatctatatt ttacataaat ttagtatttt tgtttcagtg cactaatatg taagacaaaa 1801 agctacttat tgtagcatcc tggaatatct ccgtcaactc atattttcat gctgttcatg 1861 aaagattcaa tgcccctgaa tttaaatagc ttttttctct gatacagaaa agttgaattt 1921 tacatggctg gagctagaat ttgatatgtg aacagttgtg tttgaagcac agtgatcaag 1981 ttatttttaa tttggttttc acattggaaa caagtcagtc attcagatat gattcaaatg 2041 tctataaacc gaactgatgt aagt // LOCUS HUMDMC1B 2150 bp mRNA PRI 21-FEB-1997 DEFINITION Human mRNA for DMC1 homologue, complete cds. ACCESSION D64108 NID g987656 KEYWORDS DMC1 homologue; meiosis-specific homologous recombination gene. SOURCE Homo sapiens testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2150) AUTHORS Habu,T., Taki,T., West,A., Nishimune,Y. and Morita,T. TITLE The mouse and human homologs of DMC1, the yeast meiosis-specific homologous recombination gene, have a common unique form of exon-skipped transcript in meiosis JOURNAL Nucleic Acids Res. 24 (3), 470-477 (1996) MEDLINE 96173646 REFERENCE 2 (bases 1 to 2150) AUTHORS Morita,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 2150) AUTHORS Morita,T.T. TITLE Direct Submission JOURNAL Submitted (09-SEP-1995) to the DDBJ/EMBL/GenBank databases. Takashi T.M. Morita, Research Institute for Microbial Diseases,Osaka University, Department of Molecular Embryology; 3-1 Yamadaoka, Suita, Osaka 565, Japan (E-mail:tmorita@biken.osaka-u.ac.jp, Tel:06-879-8309, Fax:06-875-3268) FEATURES Location/Qualifiers source 1..2150 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" CDS 54..1076 /codon_start=1 /product="DMC1 homologue" /db_xref="PID:d1011626" /db_xref="PID:g1321636" /translation="MKEDQVVAEEPGFQDEEESLFQDIDLLQKHGINVADNKKLKSVG ICTIKGIQMTTRRALCNVKGLSEAKVDKIKEAANKLIEPGFLTAFEYSEKRKMVFHIT TGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVTAQLPGAGGYPGGKIIFID TENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEHQMELLDYVAAKFHEEAGIFK LLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNVAVFVTNQMTADPGA TMTFQADPKKPIGGHILAHASTTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGG IGDAKE" BASE COUNT 709 a 385 c 467 g 589 t ORIGIN 1 agactgtggg tacgaggggg aagtgattat ttttctgttg cccacttttc aatatgaagg 61 aggatcaagt tgtggcggaa gaaccaggat tccaagatga agaggaatct ttgtttcaag 121 atattgacct gttacagaaa catggaatta acgtggctga caataagaaa ctgaaatcag 181 taggaatctg taccatcaaa ggtatacaga tgacaacaag aagagctcta tgcaatgtca 241 aaggactctc agaagccaaa gtagacaaga ttaaagaggc agcgaacaaa ctaattgaac 301 caggattctt gactgcattt gagtatagtg aaaagaggaa aatggttttc catatcacca 361 ccgggagcca ggaatttgat aagttactag gaggtggaat tgaaagtatg gcaattacag 421 aagcttttgg agaatttcgt actggaaaaa cccagctttc tcataccctc tgtgtgacag 481 ctcaacttcc aggagctggt ggctacccag gaggaaagat tatcttcatt gatacagaaa 541 atactttccg tccagatcgc cttagggaca ttgctgatcg ctttaatgta gaccatgatg 601 cagtactgga caacgtactt tatgcacgtg catatactag tgaacatcag atggagctac 661 ttgattatgt agcagcaaag ttccatgaag aagctggcat cttcaagcta ttgattatcg 721 attcaataat ggcacttttt cgagtggatt tcagtggccg tggggagttg gccgaacggc 781 agcaaaaatt ggcccagatg ttgtcacgac tccaaaaaat ctcagaagaa tataacgtgg 841 ctgtttttgt gaccaatcaa atgactgccg atccaggagc aactatgacc tttcaggcag 901 atcccaaaaa acccattggg ggacacattc tggctcatgc ttcaacaaca agaataagct 961 tgcgaaaggg aagaggagag ctcagaattg ccaagattta tgacagtcct gagatgcctg 1021 aaaatgaagc caccttcgca ataactgctg gaggaattgg ggatgccaag gagtaggtgg 1081 tgaattgatg caaattgctt cttagtgctt attaggagct gaagaaatgg aaaagcagtc 1141 tccaatttca catcttgaaa tagaggtttt ttcccacatg ttactaaaga aaagtcagca 1201 aagagattta aatcttatat ttatcttaaa agtccctgat ttatgataac tatacattgt 1261 atgtaaattc agggatatgt atatgtatgt ttgtgtgtgt gtgtgtttgt gtgtgtatac 1321 ctgacatata tagtagatat atatacctat tgaaattctc tcactctgtg tatgtatata 1381 tacctatgga aaactagttg ttgggaaagg agtacgtgat tccctccctt tcacttttca 1441 aagtgagcca ctaaacagaa agtctaagag ttcagaaatg tcccattctc ctaaggactt 1501 tcccttcacc atttttctag ggtatagtgc cactgacatt acatagctag atgtttttcc 1561 acaagaggat ttaagggagg aatgtttata ggacacacac acaaaagctc tttctattta 1621 taatatcaaa ctgcatattc acctttatag caacagaaaa cagacattaa attaagccaa 1681 attttaaatc atttgacttt gacaagcaga aattactttt aaaaaatgta tttatggctg 1741 ggcgtggtgg ctcacacctg taatcccagc actttgggag gccgaggcag gtggatcact 1801 tgaggtcagg aggtcaagac cagcctggcc aacatggaga aaccccatct ctactaaaaa 1861 aaaaaaaatt agccaagtgc agtggcatgt gcttgtaatc ccagctaccc tggaggctga 1921 ggcaggagaa ttgcttgaac ccaggaggca gagattgcgg tgagccaaga tggcaccact 1981 gcactccagc ctaggtgaca gagtgagact ccgtctcaaa aaaaaaaaaa aaaaactttt 2041 ttaaaatgtt aaaattaatt taaatgtaaa atttctgaat actaatctta gaaatgtgta 2101 aaagatatta ttttgattgt taaacaaata aaatcctgtt ttcaaaatgc // LOCUS HUMDMKIN 13747 bp DNA PRI 12-MAY-1994 DEFINITION Human myotonic dystrophy kinase (DM kinase) gene, complete cds. ACCESSION L08835 NID g181601 KEYWORDS alternative splicing; kinase; myotonic dystrophy kinase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 13747) AUTHORS Mahadevan,M.S., Amemiya,C.T., Jansen,G., Sabourin,L., Baird,S., Neville,C.E., Wormskamp,N., Segers,B., Lamerdin,J., de Jong,P., Wieringa,B. and Korneluk,R.G. TITLE Structure and genomic sequence of the myotonic dystrophy (DM kinase) gene JOURNAL Hum. Mol. Genet. 2, 299-304 (1993) MEDLINE 93271990 FEATURES Location/Qualifiers source 1..13747 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19" gene <234..281 /gene="DMR-N9" CDS <234..281 /gene="DMR-N9" /note="putative" /codon_start=1 /db_xref="PID:g181602" /translation="GISSQPGNSPSGTVV" exon 234..(1516.1540) /gene="DMR-N9" /note="putative" polyA_signal one-of(720..725,767..773) /gene="DMR-N9" /note="two weak, non-consensus signals; putative" mRNA join(1394..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7204,8527..8612, 10739..10850,11566..11723,11829..11926,12854..13747) /gene="DM kinase" /note="alternative transcipt in heart, lacks exons 13 and 14, utilizes the longer exon 8b" exon 1394..2329 /gene="DM kinase" /note="putative" /number=1 mRNA join(1394..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7204,8527..8612, 10739..10850,11566..11723,11829..11926,12098..12144, 12434..12523,12854..13747) /gene="DM kinase" /note="transcript utilizes longer exon 8b" 5'UTR 1394..2169 /gene="DM kinase" /note="putative" mRNA join(1394..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7189,8527..8612, 10739..10850,11566..11723,11829..11926,12098..12144, 12434..12523,12854..13747) /gene="DM kinase" /note="transcript using shorter exon 8a" mRNA join(1394..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7189,8527..8612, 10739..10850,11566..11723,11829..11926,12854..13747) /gene="DM kinase" /note="alternative transcript in heart, lacks exons 13 and 14, utilizes the shorter exon 8a" gene 1394..13747 /gene="DM kinase" CDS join(2170..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7204,8527..8612, 10739..10850,11566..11723,11829..11926,12854..12861) /gene="DM kinase" /note="found in heart cDNA clones, contains exon 8a, lacking exons 13 and 14 resulting in frameshift and early translation" /codon_start=1 /product="myotonic dystrophy kinase" /db_xref="PID:g181606" /translation="MSAEVRLRRLQQLVLDPGFLGLEPLLDLLLGVHQELGASELAQD KYVADFLQWAEPIVVRLKEVRLQRDDFEILKVIGRGAFSEVAVVKMKQTGQVYAMKIM NKWDMLKRGEVSCFREERDVLVNGDRRWITQLHFAFQDENYLYLVMEYYVGGDLLTLL SKFGERIPAEMARFYLAEIVMAIDSVHRLGYVHRDIKPDNILLDRCGHIRLADFGSCL KLRADGTVRSLVAVGTPDYLSPEILQAVGGGPGTGSYGPECDWWALGVFAYEMFYGQT PFYADSTAETYGKIVHYKEHLSLPLVDEGVPEEARDFIQRLLCPPETRLGRGGAGDFR THPFFFGLDWDGLRDSVPPFTPDFEGATDTCNFDLVEDGLTAMVSGGGETLSDIREGA PLGVHLPFVGYSYSCMALRDSEVPGPTPMEVEAEQLLEPHVQAPSLEPSVSPQDETAE VAVPAAVPAAEAEAEVTLRELQEALEEEVLTRQSLSREMEAIRTDNQNFASQLREAEA RNRDLEAHVRQLQERMELLQAEGATGP" CDS join(2170..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7189,8527..8612, 10739..10850,11566..11723,11829..11926,12098..12144, 12434..12523,12854..13006) /gene="DM kinase" /note="CDS using shorter exon 8a" /codon_start=1 /product="myotonic dystrophy kinase" /db_xref="PID:g181603" /translation="MSAEVRLRRLQQLVLDPGFLGLEPLLDLLLGVHQELGASELAQD KYVADFLQWAEPIVVRLKEVRLQRDDFEILKVIGRGAFSEVAVVKMKQTGQVYAMKIM NKWDMLKRGEVSCFREERDVLVNGDRRWITQLHFAFQDENYLYLVMEYYVGGDLLTLL SKFGERIPAEMARFYLAEIVMAIDSVHRLGYVHRDIKPDNILLDRCGHIRLADFGSCL KLRADGTVRSLVAVGTPDYLSPEILQAVGGGPGTGSYGPECDWWALGVFAYEMFYGQT PFYADSTAETYGKIVHYKEHLSLPLVDEGVPEEARDFIQRLLCPPETRLGRGGAGDFR THPFFFGLDWDGLRDSVPPFTPDFEGATDTCNFDLVEDGLTAMETLSDIREGAPLGVH LPFVGYSYSCMALRDSEVPGPTPMEVEAEQLLEPHVQAPSLEPSVSPQDETAEVAVPA AVPAAEAEAEVTLRELQEALEEEVLTRQSLSREMEAIRTDNQNFASQLREAEARNRDL EAHVRQLQERMELLQAEGATAVTGVPSPRATDPPSHLDGPPAVAVGQCPLVGPGPMHR RHLLLPARVPRPGLSEALSLLLFAVVLSRAAALGCIGLVAHAGQLTAVWRRPGAARAP " CDS join(2170..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7204,8527..8612, 10739..10850,11566..11723,11829..11926,12098..12144, 12434..12523,12854..13006) /gene="DM kinase" /note="CDS using longer exon 8b" /codon_start=1 /product="myotonic dystrophy kinase" /db_xref="PID:g181604" /translation="MSAEVRLRRLQQLVLDPGFLGLEPLLDLLLGVHQELGASELAQD KYVADFLQWAEPIVVRLKEVRLQRDDFEILKVIGRGAFSEVAVVKMKQTGQVYAMKIM NKWDMLKRGEVSCFREERDVLVNGDRRWITQLHFAFQDENYLYLVMEYYVGGDLLTLL SKFGERIPAEMARFYLAEIVMAIDSVHRLGYVHRDIKPDNILLDRCGHIRLADFGSCL KLRADGTVRSLVAVGTPDYLSPEILQAVGGGPGTGSYGPECDWWALGVFAYEMFYGQT PFYADSTAETYGKIVHYKEHLSLPLVDEGVPEEARDFIQRLLCPPETRLGRGGAGDFR THPFFFGLDWDGLRDSVPPFTPDFEGATDTCNFDLVEDGLTAMVSGGGETLSDIREGA PLGVHLPFVGYSYSCMALRDSEVPGPTPMEVEAEQLLEPHVQAPSLEPSVSPQDETAE VAVPAAVPAAEAEAEVTLRELQEALEEEVLTRQSLSREMEAIRTDNQNFASQLREAEA RNRDLEAHVRQLQERMELLQAEGATAVTGVPSPRATDPPSHLDGPPAVAVGQCPLVGP GPMHRRHLLLPARVPRPGLSEALSLLLFAVVLSRAAALGCIGLVAHAGQLTAVWRRPG AARAP" CDS join(2170..2329,4661..4752,5008..5091,5171..5266, 5889..6037,6311..6404,6658..6864,6941..7189,8527..8612, 10739..10850,11566..11723,11829..11926,12854..12861) /gene="DM kinase" /note="found in heart cDNA clones, contains exon 8a, lacking exons 13 and 14 resulting in frameshift and early translation termination" /codon_start=1 /product="myotonic dystrophy kinase" /db_xref="PID:g181605" /translation="MSAEVRLRRLQQLVLDPGFLGLEPLLDLLLGVHQELGASELAQD KYVADFLQWAEPIVVRLKEVRLQRDDFEILKVIGRGAFSEVAVVKMKQTGQVYAMKIM NKWDMLKRGEVSCFREERDVLVNGDRRWITQLHFAFQDENYLYLVMEYYVGGDLLTLL SKFGERIPAEMARFYLAEIVMAIDSVHRLGYVHRDIKPDNILLDRCGHIRLADFGSCL KLRADGTVRSLVAVGTPDYLSPEILQAVGGGPGTGSYGPECDWWALGVFAYEMFYGQT PFYADSTAETYGKIVHYKEHLSLPLVDEGVPEEARDFIQRLLCPPETRLGRGGAGDFR THPFFFGLDWDGLRDSVPPFTPDFEGATDTCNFDLVEDGLTAMETLSDIREGAPLGVH LPFVGYSYSCMALRDSEVPGPTPMEVEAEQLLEPHVQAPSLEPSVSPQDETAEVAVPA AVPAAEAEAEVTLRELQEALEEEVLTRQSLSREMEAIRTDNQNFASQLREAEARNRDL EAHVRQLQERMELLQAEGATGP" intron 2330..4660 /gene="DM kinase" /number=1 exon 4661..4752 /gene="DM kinase" /note="putative" /number=2 intron 4753..5007 /gene="DM kinase" /number=2 exon 5008..5091 /gene="DM kinase" /note="putative" /number=3 intron 5092..5170 /gene="DM kinase" /number=3 exon 5171..5266 /gene="DM kinase" /note="putative" /number=4 intron 5267..5888 /gene="DM kinase" /number=4 allele replace(5285,"t") /gene="DM kinase" /note="a G to T polymorphism in intron 4 resulting in a polymorphic HphI site" /frequency="0.87" exon 5889..6037 /gene="DM kinase" /note="putative" /number=5 intron 6038..6310 /gene="DM kinase" /number=5 allele replace(6043,"c") /gene="DM kinase" /note="a C to T change in intron 5 results in a polymorphic HhaI site" /frequency="0.45" exon 6311..6404 /gene="DM kinase" /note="putative" /number=6 intron 6405..6657 /gene="DM kinase" /number=6 exon 6658..6864 /gene="DM kinase" /note="putative" /number=7 intron 6865..6940 /gene="DM kinase" /number=7 exon 6941..7204 /gene="DM kinase" /note="longer version of exon 8, including last 15 bp (ie 5 codons); putative" exon 6941..7189 /gene="DM kinase" /note="shorter version of exon 8, with last 15 bp deleted (ie. 5 codons); putative" intron one-of(7205..8526,7190..8526) /gene="DM kinase" allele 8157 /gene="DM kinase" /note="submitted sequence is smaller of two alleles (freq. 0.47)resulting from a 1 kb deletion of 3 Alu elements from larger allele. DM is in total linkage disequilibrium with this larger allele" /frequency="0.53" exon 8527..8612 /gene="DM kinase" /note="putative" /number=9 intron 8613..10738 /gene="DM kinase" /number=9 allele replace(10693,"t") /gene="DM kinase" /note="a T to G change in intron 9 results in a polymorphic HinfI site" /frequency="0.47" exon 10739..10850 /gene="DM kinase" /note="putative" /number=10 intron 10851..11565 /gene="DM kinase" /number=10 exon 11566..11723 /gene="DM kinase" /note="putative" /number=11 intron 11724..11828 /gene="DM kinase" /number=11 allele replace(11780,"g") /gene="DM kinase" /note="a G to T change results in a polymorphic Fnu4HI site" /frequency="0.55" exon 11829..11926 /gene="DM kinase" /note="putative" /number=12 intron 11927..12853 /gene="DM kinase" /note="found in alternate transcripts in heart tissue" intron 11927..12097 /gene="DM kinase" /number=12 exon 12098..12144 /gene="DM kinase" /note="exon 13 along with exon 14 is deleted in some cDNA clones derived from a human heart cDNA library, resulting in a frameshift and earlier translational stop codon.; putative" /number=13 intron 12143..12433 /gene="DM kinase" /number=13 exon 12434..12523 /gene="DM kinase" /note="exon 14 along with exon 13 is deleted from some cDNA clones derived from a human heart cDNA library. This results in a frameshift and an earlier translational stop codon.; putative" /number=14 intron 12524..12853 /gene="DM kinase" /number=14 allele replace(12581,"t") /gene="DM kinase" /note="a G to T sequence change in intron 14" /frequency="0.45" exon 12854..13747 /gene="DM kinase" /note="putative" /number=15 mutation replace(13230,"ctg") /gene="DM kinase" /note="Polymorphic trinucleotiderepeat ranging from 5 to 30 in normal population. In DM individuals, repeat is unstable with copy number ranging from 50 to more than 2000" polyA_signal 13725..13730 /gene="DM kinase" /note="putative" BASE COUNT 2681 a 4012 c 4207 g 2847 t ORIGIN 1 ggatccgcca aggactttga ttattgcgtg aaagtgctga ctgccaggac aggaagctag 61 ctaagatgca agttcccagc ctagagcagt ggcctctggg gggtctaggg cggacccaag 121 ggcaaggcca gggtggcagc agcttgggga ctctggctgg ctccctcccc tgacactggc 181 tgaagcccag gtggtctcta acccctccca tctctccctc tcatcttccc cagggcatct 241 cctcccaacc aggcaactcc ccgagtggca cagtggtgtg aagccatgga tatcgggccc 301 ccccaacccc atgcccccag cctcctagcc ataaccctcc ctgctgacct cacagatcaa 361 cgtattaaca agactaacca tgatggatgg actgctccag tccccccacc tgcacaaaat 421 ttgggggccc cccagactgg cccggacacg ggcgatgtaa tagcccttgt ggcctcagcc 481 ttgtccccca cccactgcca agtacaatga cctcttcctc tgaaacatca gtgttaccct 541 catccctgtc cccagcatgt gactggtcac tcctggggag acactccccg cccctgccac 601 aagagcccca ggtctgcagt gtgcccctca gttgagtggg cagggccggg ggtggtccag 661 ccctcgcccg gcccccaccc cagctgccct tgctattgtc tgtgcttttg aagagtgtta 721 aattatggaa gcccctcagg ttcctccctg tcccgcagga cctcttattt atactaaagt 781 tccctgtttt ctcagcgggt ctgtcccctt cggaggagat gatgtagagg acctgtgtgt 841 gtactctgtg gttctaggca gtccgctttc cccagaggag gagtgcaggc ctgctcccag 901 cccagcgcct cccacccctt ttcatagcag gaaaagccgg agcccaggga gggaacggac 961 ctgcgagtca cacaactggt gacccacacc agcggctgga gcaggaccct cttggggaga 1021 agagcatcct gcccgcagcc agggcccctc atcaaagtcc tcggtgtttt ttaaattatc 1081 agaactgccc aggaccacgt ttcccaggcc ctgcccagct gggactcctc ggtccttgcc 1141 tcctagtttc tcaggcctgg ccctctcaag gcccaggcac cccaggccgg ttggaggccc 1201 cgacttccac tctggagaac cgtccaccct ggaaagaaga gctcagattc ctcttggctc 1261 tcggagccgc agggagtgtg tcttcccgcg ccaccctcca ccccccgaaa tgtttctgtt 1321 tctaatccca gcctgggcag gaatgtggct ccccggccag gggccaagga gctattttgg 1381 ggtctcgttt gcccagggag ggcttggctc caccactttc ctcccccagc ctttgggcag 1441 caggtcaccc ctgttcaggc tctgagggtg ccccctcctg gtcctgtcct caccacccct 1501 tccccacctc ctgggaaaaa aaaaaaaaaa aaaaaaaaag ctggtttaaa gcagagagcc 1561 tgagggctaa atttaactgt ccgagtcgga atccatctct gagtcaccca agaagctgcc 1621 ctggcctccc gtccccttcc caggcctcaa cccctttctc ccacccagcc ccaaccccca 1681 gccctcaccc cctagccccc agttctggag cttgtcggga gcaagggggt ggttgctact 1741 gggtcactca gcctcaattg gccctgttca gcaatgggca ggttcttctt gaaattcatc 1801 acacctgtgg cttcctctgt gctctacctt tttattgggg tgacagtgtg acagctgaga 1861 ttctccatgc attcccccta ctctagcact gaagggttct gaagggccct ggaaggaggg 1921 agcttggggg gctggcttgt gaggggttaa ggctgggagg cgggaggggg gctggaccaa 1981 ggggtgggga gaaggggagg aggcctcggc cggccgcaga gagaagtggc cagagaggcc 2041 caggggacag ccagggacag gcagacatgc agccagggct ccagggcctg gacaggggct 2101 gccaggccct gtgacaggag gaccccgagc ccccggcccg gggaggggcc atggtgctgc 2161 ctgtccaaca tgtcagccga ggtgcggctg aggcggctcc agcagctggt gttggacccg 2221 ggcttcctgg ggctggagcc cctgctcgac cttctcctgg gcgtccacca ggagctgggc 2281 gcctccgaac tggcccagga caagtacgtg gccgacttct tgcagtgggg tgagtgccta 2341 ccctcggggc tcctgcagat ggggtggggg tggggcagca gacagctctg ggcacagagg 2401 cctggctgtt gggggggggc agcatggcag gatgggcatg gggagatcct cccatcctgg 2461 ggctcagagt gtggacctgg gccctggggc aacatttctc tgtcctatgc caccactctg 2521 gaggggcaga gtaaggtcag cagaggctag ggtggctgtg actcagagcc atggcttagg 2581 agtcacagca ggctaggctg ccaacagcct cccatggcct ctctgcaccc cgcctcaggg 2641 tcagggtcag ggtcatgctg ggagctccct ctcctaggac cctcccccca aaagtgggct 2701 ctatggccct ctcccctggt ttcctgtggc ctggggcaag ccaggagggc cagcatgggg 2761 cagctgccag gggcgcagcc gacaggcagg tgttcggcgc cagcctctcc agctgcccca 2821 acaggtgccc aggcgctggg agggcggtga ctcacgcggg ccctgtggga gaaccagctt 2881 tgcagacagg cgccaccagt gccccctcct ctgcgatcca ggagggacaa ctttgggttc 2941 ttctgggtgt gtctccttct ttagtaggtt ctgcacccac ccccaccccc agccccaaag 3001 tctcggttcc tatgagccgt gtgggtcaga caccattccc gccaccccgg gtccctgcgt 3061 cctttagttc tcctggccca gggcctccaa ccttccagct gtcccacaaa accccttctt 3121 gcaagggctt tccagggcct ggggccaggg ctggaaggag gatgcttccg cttctgccag 3181 ctgccttgtc tgcccaacct cctccccaag cccaggactc gggctcactg gtcactggtt 3241 tctttcattc ccagcaccct gctcctctgg ccctcatatg tctggccctc agtgactggt 3301 gtttggtttt tgggctgtgt gtaacaaact gtgtgtgaca cttgtttcct gtttctccgc 3361 cttcccctgc ttcctcttgt gtccatctct ttctgaccca ggcctggttc ctttccctcc 3421 tcctcccatt tcacagatgg gaaggtggcg gccaagaagg gccaggccat tcagcctctg 3481 gaaaaacctt ctcccaacct cccacagccc ctaatgactc tcctggcctc cctttagtag 3541 aggatgaagt tgggttggca gggtaaactg agaccgggtg gggtaggggt ctggcgctcc 3601 cgggaggagc actccttttg tggcccgagc tgcatctcgc ggcccctccc ctgccaggcc 3661 tggggcgggg gagggggcca gggttcctgc tgccttaaaa gggctcaatg tcttggctct 3721 ctcctccctc ccccgtcctc agccctggct ggttcgtccc tgctggccca ctctcccgga 3781 accccccgga acccctctct ttcctccaga acccactgtc tcctctcctt ccctcccctc 3841 ccatacccaa ccctctctcc atcctgtcct ccacttcttc cacccccggg agagccaggc 3901 ctcccctgtg ccccacagtg ccctgaggcc acaagcctcc accccagctg gtccccaccc 3961 aggctgccca gtttaacatt cctagtcata ggaccttgac ttctgagagg cctgattgtc 4021 atctgtaaat aaggggtagg actaaagcac tcctcctgga ggactgagag atgggctgga 4081 ccggagcact tgagtctggg atatgtgacc atgctacctt tgtctccctg tcctgttcct 4141 tcccccagcc ccaaatccag ggttttccaa agtgtggttc aagaaccacc tgcatctgaa 4201 tctagaggta ctggatacaa ccccacgtct gggccgttac ccaggacatt ctacatgaga 4261 acgtgggggt ggggccctgg ctgcacctga actgtcacct ggagtcaggg tggaaggtgg 4321 aagaactggg tcttatttcc ttctcccctt gttctttagg gtctgtcctt ctgcagactc 4381 cgttacccca ccctaaccat cctgcacacc cttggagccc tctgggccaa tgccctgtcc 4441 cgcaaagggc ttctcaggca tctcacctct atgggagggc atttttggcc cccagaacct 4501 tacacggtgt ttatgtgggg aagcccctgg gaagcagaca gtcctagggt gaagctgaga 4561 ggcagagaga aggggagaca gacagagggt ggggctttcc cccttgtctc cagtgccctt 4621 tctggtgacc ctcggttctt ttcccccacc acccccccag cggagcccat cgtggtgagg 4681 cttaaggagg tccgactgca gagggacgac ttcgagattc tgaaggtgat cggacgcggg 4741 gcgttcagcg aggtaagccg aaccgggcgg gagcctgact tgactcgtgg tgggcggggc 4801 ataggggttg gggcgggccc ttagaaattg atgaatgacc gagccttaga acctagggct 4861 gggctggagg cggggcttgg gaccaatggg cgtggtgtgg caggtggggc ggggccacgg 4921 ctgggtgcag aagcgggtgg agttgggtct gggcgagccc ttttgttttc ccgccgtctc 4981 cactctgtct cactatctcg acctcaggta gcggtagtga agatgaagca gacgggccag 5041 gtgtatgcca tgaagatcat gaacaagtgg gacatgctga agaggggcga ggtgaggggc 5101 tgggcggacg tggggggctt tgaggatccg cgccccgtct ccggctgcag ctcctccggg 5161 tgccctgcag gtgtcgtgct tccgtgagga gagggacgtg ttggtgaatg gggaccggcg 5221 gtggatcacg cagctgcact tcgccttcca ggatgagaac tacctggtga gctccgggcc 5281 ggggggacta ggaagaggga caagagcccg tgctgtcact ggacgaggag gtggggagag 5341 gaagctctag gattgggggt gctgcccgga aacgtctgtg ggaaagtctg tgtgcggtaa 5401 gagggtgtgt caggtggatg aggggccttc cctatctgag acggggatgg tgtccttcac 5461 tgcccgtttc tggggtgatc tgggggactc ttataaagat gtctctgttg cggggggtct 5521 cttacctgga atgggatagg tcttcaggaa ttctaacggg gccactgcct agggaaggag 5581 tgtctgggac ctattctctg ggtgttgggt ggcctctggg ttctctttcc cagaacatct 5641 cagggggagt gaatctgccc agtgacatcc caggaaagtt tttttgtttg tgtttttttt 5701 tgaggggcgg gggcgggggc cgcaggtggt ctctgatttg gcccggcaga tctctatggt 5761 tatctctggg ctggggctgc aggtctctgc ccaaggatgg ggtgtctctg ggaggggttg 5821 tcccagccat ccgtgatgga tcagggcctc aggggactac caaccaccca tgacgaaccc 5881 cttctcagta cctggtcatg gagtattacg tgggcgggga cctgctgaca ctgctgagca 5941 agtttgggga gcggattccg gccgagatgg cgcgcttcta cctggcggag attgtcatgg 6001 ccatagactc ggtgcaccgg cttggctacg tgcacaggtg ggtgcagcat ggccgagggg 6061 atagcaagct tgttccctgg ccgggttctt ggaaggtcag agcccagaga ggccagggcc 6121 tggagaggga ccttcttggt tggggcccac cggggggtgc ctgggagtag gggtcagaac 6181 tgtagaagcc ctacaggggc ggaacccgag gaagtggggt cccaggtggc actgcccgga 6241 ggggcggagc ctggtgggac cacagaaggg aggttcattt atcccaccct tctcttttcc 6301 tcccgtgcag ggacatcaaa cccgacaaca tcctgctgga ccgctgtggc cacatccgcc 6361 tggccgactt cggctcttgc ctcaagctgc gggcagatgg aacggtgagc cagtgccctg 6421 gccacagagc aactggggct gctgatgagg gatggaaggc acagagtgtg ggagcgggac 6481 tggatttgga ggggaaaaga ggtggtgtga cccaggctta agtgtgcatc tgtgtggcgg 6541 agtattagac caggcagagg gaggggctaa gcatttgggg agtggttgga aggagggccc 6601 agagctggtg ggcccagagg ggtgggccca agcctcgctc tgctcctttt ggtccaggtg 6661 cggtcgctgg tggctgtggg caccccagac tacctgtccc ccgagatcct gcaggctgtg 6721 ggcggtgggc ctgggacagg cagctacggg cccgagtgtg actggtgggc gctgggtgta 6781 ttcgcctatg aaatgttcta tgggcagacg cccttctacg cggattccac ggcggagacc 6841 tatggcaaga tcgtccacta caaggtgagc acggccgcag ggagacctgg cctctcccgg 6901 taggcgctcc caggctatcg cctcctctcc ctctgagcag gagcacctct ctctgccgct 6961 ggtggacgaa ggggtccctg aggaggctcg agacttcatt cagcggttgc tgtgtccccc 7021 ggagacacgg ctgggccggg gtggagcagg cgacttccgg acacatccct tcttctttgg 7081 cctcgactgg gatggtctcc gggacagcgt gccccccttt acaccggatt tcgaaggtgc 7141 caccgacaca tgcaacttcg acttggtgga ggacgggctc actgccatgg tgagcggggg 7201 cggggtaggt acctgtggcc cctgctcggc tgcgggaacc tccccatgct ccctccataa 7261 agttggagta aggacagtgc ctaccttctg gggtcctgaa tcactcattc cccagagcac 7321 ctgctctgtg cccatctact actgaggacc cagcagtgac ctagacttac agtccagtgg 7381 gggaacacag agcagtcttc agacagtaag gccccagagt gatcagggct gagacaatgg 7441 agtgcagggg gtgggggact cctgactcag caaggaaggt cctggagggc tttctggagt 7501 ggggagctat ctgagctgag acttggaggg atgagaagca ggagaggact cctcctccct 7561 taggccgtct ctcttcaccg tgtaacaagc tgtcatggca tgcttgctcg gctctgggtg 7621 cccttttgct gaacaatact ggggatccag cacggaccag atgagctctg gtccctgccc 7681 tcatccagtt gcagtctaga gaattagaga attatggaga gtgtggcagg tgccctgaag 7741 ggaagcaaca ggatacaaga aaaaatgatg ggcggcaggc aacgggtggg ctcacgcctg 7801 taacccccag caatttggca ggccgaagtg ggtggattgc ttgagcccag gagttcgaga 7861 ccagcctggg caatgtggtg agacccccgt ctctacaaaa atgttttaaa aattggttgg 7921 gcgtggtggc gcatgcctgt atactcagct actagggtgg ccgacgtggg cttgagccca 7981 ggaggtcaag gctgcagtga gctgtgattg tgccactgca ctccagcctg ggcaacggag 8041 agagactctg tctcaaaaat aagataaact gaaattaaaa aataggctgg gctggccggg 8101 cgtggtggct cacgcctgta atctcagcac tttgggaggc cgaggcgggt ggatcacgag 8161 gtcagaagat ggagaccagc ctggccagcg tggcgaaacc ccgtctctac ccaaaaatat 8221 aaaaaattag ccaggcgtgg tagagggcgc ctgtaatctc agctactcag gacgctgagg 8281 caggagaatc gcctgaacct gggaggcgga ggttgcagtg agctgagatt gcaccactgc 8341 actccagcct gggtaacaga gcgagactcc gtatcaaaga aaaagaaaaa agaaaaaatg 8401 ctggaggggc cactttagat aacccctgag ttggggctgg tttgggggga acatgtaagc 8461 caagatccaa aagcagtgag gggcccgccc tgacgactgc tgctcacatc tgtgtgtctt 8521 gcgcaggaga cactgtcgga cattcgggaa ggtgcgccgc taggggtcca cctgcctttt 8581 gtgggctact cctactcctg catggccctc aggtaagcac tgccctggac ggcctccagg 8641 ggacacgagg ctgcttgagc ttcctgggtc ctgctccttg gcagccaatg gagttgcagg 8701 atcagtcttg gaacctcact gtttggggcc cacagactcc taagaggcca gagttggagg 8761 accttaaatt tctcagatct atgtacttca aatgttagat tgaattttaa aacctcagag 8821 tcacagactg ggcttcccag aatcttgtaa ccattaactt ttacgtctgt agtacacaga 8881 gccacaggac ttcagaactt ggcaaatatg aagtttagac ttttacaatc agttgtaaaa 8941 gaatgcaaat tctttgaatc agccatataa caataaggcc atttaaaagt attaatttag 9001 gcgggccgcg gtggctcacg cctgtaatcc tagcactttg ggaggccaag gcaggtggat 9061 catgaggtca ggagatcgag accatcctgg ctaacacggt gaaaccccgt ctctactaaa 9121 aatacaaaaa aattagccgg gcatggtggc gggcgcttgc ggtcccagct acttgggagg 9181 cgaggcagga gaatggcatg aacccgggag gcggagcttg cagtgagccg agatcatgcc 9241 actgcactcc agcctgggcg acagagcaag actccgtctc aaaaaaaaaa aaaaaaaagt 9301 ttttatttag gccgggtgtg gcggctcacg cctgtaatcc agtgctttgg gaggatgagg 9361 tgggtggatc actgaggtca ggagttcgag accagcctga ccacgtggag aaacctcatc 9421 tctactaaaa aacaaaatta gccaggcgtg gtggcatata cctgtaatcc cagctactca 9481 ggaggctgag gcaggagaat cagaacccag gagggggagg ttgtggtgag ctgagatcgt 9541 gccattgcat tccagcctgg gcaacaagag tgaaacttca tctccaaaaa aaaaaaaaaa 9601 aagtactaaa tttacaggct gggcatggtg gctcacgctt ggaatcccag cactttggga 9661 ggctgaagtg gacggattgc ttcagcccag gagttcaaga ccagcctgag caacataatg 9721 agaccctgtc tctaccaaaa attgaaaaaa tcgtgccagg catggtggtc tgtgcctgca 9781 gtcctagcta ctcaggagtc tgaagtagga gaatcacttg agcctggagt ttgaggcttc 9841 agtgagccat gatagattcc agcctaggca acaaagtgag acctggtctc aacaaaagta 9901 ttaattacac aaataatgca ttgcttatca caagtaaatt agaaaataca gataaggaaa 9961 aggaagttga tatctcgtga gctcaccaga tgggcagtgg tccctggctc acacgtgtac 10021 tgacacatgt ttaaatagtg gagaacaggt gtttttttgg tttgtttttt tccccttcct 10081 catgctactt tgtctaagag aacagttggt tttctagtca gcttttatta ctgggcaaca 10141 ttacacatac tataccttat cattaatgaa ctccagcttg attctgaacc gctgcggggc 10201 ctgaacggtg ggtcaggatt gaacccatcc tctattagaa cccaggcgca tgtccaggat 10261 agctaggtcc tgagccgtgt tcccacagga gggactgctg ggttggaggg gacagccact 10321 tcatacccca gggaggagct gtccccttcc cacagctgag tggggtgtgc tgacctcaag 10381 ttgccatctt ggggtcccat gcccagtctt aggaccacat ctgtggaggt ggccagagcc 10441 aagcagtctc cccatcaggt cggcctccct gtcctgaggc cctgagaaga ggggtctgca 10501 gaaggtttag aaagagcagc tcccaggggc ccaaggccag gagaggggca gggcttttcc 10561 taagcagagg aggggctatt ggcctacctg ggactctgtt ctcttcgctc tgctgctccc 10621 cttcctcaaa tcaggaggtc ttggaagcag ctgcccctac ccacaggcca gaagttctgg 10681 ttctccacca gagaatcagc attctgtctc cctccccact ccctcctcct ctccccaggg 10741 acagtgaggt cccaggcccc acacccatgg aagtggaggc cgagcagctg cttgagccac 10801 acgtgcaagc gcccagcctg gagccctcgg tgtccccaca ggatgaaaca gtaagttggt 10861 ggaggggagg gggtccgtca gggacaattg ggagagaaaa ggtgagggct tcccgggtgg 10921 cgtgcactgt agagccctct agggacttcc tcgaacagaa gcagacagaa accacggaga 10981 gacgaggtta cttcagacat gggacggtct ctgtagttac agtggcgcat taagtaaggg 11041 tgtgtgtgtt gctggcgatc tgagaagtcg atctttgagc tgagcgctgg tgaaggagaa 11101 acaagccatg gaaggaaagg tgccaagtgg tcaggcgaga gcctccaggg caaaggcctt 11161 gggcaggtgg gaatcctgat ttgttcctga aaggtagttt gtctgagtca ctacctgaga 11221 aggctggaga ggccagcagg aaacacaacc cagcacggcc tgttgtcgtg tgggcactag 11281 ggagctggag ggattttgag caccagaggg acatagggtg tgttagtgtg tgagcaccag 11341 ccctctggtg ccctgtgtag atttagagga ccagactcag ggatgggtct gagggaggta 11401 gagaagggag ggggcttgga tcattgcagg agctatgggg attccagaaa tgttgagggg 11461 gcggaggagt aggggataaa caaggattcc tagcctggaa ccagtgtcca agtcctgagt 11521 cttccaggag ccacaggcag ccttaagcct ggtccccaca cacaggctga agtggcagtt 11581 ccagcggctg tccctgcggc agaggctgag gccgaggtga cgctgcggga gctccaggaa 11641 gccctggagg aggaggtgct cacccggcag agcctgagcc gggagatgga ggccatccgc 11701 acggacaacc agaacttcgc caggtcggga tcggggccgg ggccggggcc gggatgcggg 11761 ccggtggcaa cccttggcat cccctctcgt ccggcccgga cggactcacc gtccttacct 11821 ccccacagtc aactacgcga ggcagaggct cggaaccggg acctagaggc acacgtccgg 11881 cagttgcagg agcggatgga gttgctgcag gcagagggag ccacaggtga gtccctcatg 11941 tgtccccttc cccggaggac cgggaggagg tgggccgtct gctccgcggg gcgtgtatag 12001 acacctggag gagggaaggg acccacgctg gggcacgccg cgccaccgcc ctccttcgcc 12061 cctccacgcg ccctatgcct ctttcttctc cttccagctg tcacgggggt ccccagtccc 12121 cgggccacgg atccaccttc ccatgtaaga cccctctctt tcccctgcct cagacctgct 12181 gcccattctg cagatcccct ccctggctcc tggtctcccc gtccagatat agggctcacc 12241 ctacgtcttt gcgactttag agggcagaag ccctttattc agccccagat ctccctccgt 12301 tcaggcctca ccagattccc tccgggatct ccctagataa cctccccaac ctcgattccc 12361 ctcgctgtct ctcgccccac cgctgagggc tgggctgggc tccgatcggg tcacctgtcc 12421 cttctctctc cagctagatg gccccccggc cgtggctgtg ggccagtgcc cgctggtggg 12481 gccaggcccc atgcaccgcc gccacctgct gctccctgcc agggtacgtc cggctgccca 12541 cgcccccctc cgccgtcgcg ccccgcgctc cacccgcccc gtgccacccg cttagctgcg 12601 catttgcggg gctgggccca cggcaggagg gcggatcttc gggcagccaa tcaacacagg 12661 ccgctaggaa gcagccaatg acgagttcgg acgggattcg aggcgtgcga gtggactaac 12721 aacagctgta ggctgttggg gcgggggcgg ggcgcaggga agagtgcggg cccacctatg 12781 ggcgtaggcg gggcgagtcc caggagccaa tcagaggccc atgccgggtg ttgacctcgc 12841 cctctccccg caggtcccta ggcctggcct atcggaggcg ctttccctgc tcctgttcgc 12901 cgttgttctg tctcgtgccg ccgccctggg ctgcattggg ttggtggccc acgccggcca 12961 actcaccgca gtctggcgcc gcccaggagc cgcccgcgct ccctgaaccc tagaactgtc 13021 ttcgactccg gggccccgtt ggaagactga gtgcccgggg cacggcacag aagccgcgcc 13081 caccgcctgc cagttcacaa ccgctccgag cgtgggtctc cgcccagctc cagtcctgtg 13141 taccgggccc gccccctagc ggccggggag ggaggggccg ggtccgcggc cggcgaacgg 13201 ggctcgaagg gtccttgtag ccgggaatgc tgctgctgct gctgctgctg ctgctgctgc 13261 tggggggatc acagaccatt tctttctttc ggccaggctg aggccctgac gtggatgggc 13321 aaactgcagg cctgggaagg cagcaagccg ggccgtccgt gttccatcct ccacgcaccc 13381 ccacctatcg ttggttcgca aagtgcaaag ctttcttgtg catgacgccc tgctctgggg 13441 agcgtctggc gcgatctctg cctgcttact cgggaaattt gcttttgcca aacccgcttt 13501 ttcggggatc ccgcgccccc ctcctcactt gcgctgctct cggagcccca gccggctccg 13561 cccgcttcgg cggtttggat atttattgac ctcgtcctcc gactcgctga caggctacag 13621 gacccccaac aaccccaatc cacgttttgg atgcactgag accccgacat tcctcggtat 13681 ttattgtctg tccccaccta ggacccccac ccccgaccct cgcgaataaa aggccctcca 13741 tctgccc // LOCUS HUMDNABP 1538 bp mRNA PRI 23-JUL-1997 DEFINITION Homo sapiens DNA-binding protein mRNA, complete cds. ACCESSION M91196 NID g2275152 KEYWORDS DNA-binding protein; ICSBP; interferon consensus sequence binding protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1538) AUTHORS Weisz,A., Marx,P., Sharf,R., Appella,E., Driggers,P.H., Ozato,K. and Levi,B.Z. TITLE Human interferon consensus sequence binding protein is a negative regulator of enhancer elements common to interferon-inducible genes JOURNAL J. Biol. Chem. 267 (35), 25589-25596 (1992) MEDLINE 93094284 REFERENCE 2 (bases 1 to 1538) AUTHORS Levi,B.-Z. TITLE Direct Submission JOURNAL Submitted (27-APR-1993) Dept. of Food Engineering & Biotechnology, Technion, Haifa 32,000, Israel REFERENCE 3 (bases 1 to 1538) AUTHORS Schmidt,M. TITLE Direct Submission JOURNAL Submitted (23-JUL-1997) Innere Medizin mS. Haematologie/Onkologie, Virchow Klinikum der HU Berlin, Forschungshaus, Hs 37, R. 2.0314, Augustenburger Platz 1, Berlin 13353, Germany REMARK Sequence update FEATURES Location/Qualifiers source 1..1538 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 48..1328 /codon_start=1 /function="involved in the transcription regulation of interferon-inducible genes" /product="DNA-binding protein" /db_xref="PID:g2275153" /translation="MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAG KQDYNQEVDASIFKAWAVFKGKFKEGDKAEPATWKTRLRCALNKSPDFEEVTDRSQLD ISEPYKVYRIVPEEEQKCKLGVATAGCVNEVTEMECGRSEIDELIKEPSVDDYMGMIK RSPSPPEACRSQLLPDWWAQQPSTGVPLVTGYTTYDAHHSAFSQMVISFYYGGKLVGQ ATTTCPEGCRLSLSQPGLPGTKLYGPEGLELVRFPPADAIPSERQRQVTRKLFGHLER GVLLHSSRQGVFVKRLCQGRVFCSGNAVVCKGRPNKLERDEVVQVFDTSQFFRELQQF YNSQGRLPDGRVVLCFGEEFPDMAPLRSKLILVQIEQLYVRQLAEEAGKSCGAGSVMQ APEEPPPDQVFRMFPDICASHQRSFFRENQQITV" BASE COUNT 333 a 402 c 495 g 308 t ORIGIN 1 atggatgggg gaaccgggcg gcgagacggc ggcaggacgg cggcaggatg tgtgaccgga 61 atggtggtcg gcggcttcga cagtggctga tcgagcagat tgacagtagc atgtatccag 121 gactgatttg ggagaatgag gagaagagca tgttccggat cccttggaaa cacgctggca 181 agcaagatta taatcaggaa gtggatgcct ccatttttaa ggcctgggca gtttttaaag 241 ggaagtttaa agaaggggac aaagctgaac cagccacttg gaagacgagg ttacgctgtg 301 ctttgaataa gagcccagat tttgaggaag tgacggaccg gtcccaactg gacatttccg 361 agccatacaa agtttaccga attgttcctg aggaagagca aaaatgcaaa ctaggcgtgg 421 caactgctgg ctgcgtgaat gaagttacag agatggagtg cggtcgctct gaaatcgacg 481 agctgatcaa ggagccttct gtggacgatt acatggggat gatcaaaagg agcccttccc 541 cgccggaggc ctgtcggagt cagctccttc cagactggtg ggcgcagcag cccagcacag 601 gcgtgccgct ggtgacgggg tacaccacct acgacgcgca ccattcagca ttctcccaga 661 tggtgatcag cttctactat gggggcaagc tggtgggcca ggccaccacc acctgccccg 721 agggctgccg cctgtccctg agccagcctg ggctgcccgg caccaagctg tatgggcccg 781 agggcctgga gctggtgcgc ttcccgccgg ccgacgccat ccccagcgag cgacagaggc 841 aggtgacgcg gaagctgttc gggcacctgg agcgcggggt gctgctgcac agcagccggc 901 agggcgtgtt cgtcaagcgg ctgtgccagg gccgcgtgtt ctgcagcggc aacgccgtgg 961 tgtgcaaagg caggcccaac aagctggagc gtgatgaggt ggtccaggtc ttcgacacca 1021 gccagttctt ccgagagctg cagcagttct ataacagcca gggccggctt cctgacggca 1081 gggtggtgct gtgctttggg gaagagtttc cggatatggc ccccttgcgc tccaaactca 1141 ttctcgtgca gattgagcag ctgtatgtcc ggcaactggc agaagaggct gggaagagct 1201 gtggagccgg ctctgtgatg caggcccccg aggagccgcc gccagaccag gtcttccgga 1261 tgtttccaga tatttgtgcc tcacaccaga gatcattttt cagagaaaac caacagatca 1321 ccgtctaagt gcgtcgcttg ggcgccccac cccgtctgcg tcctgcatcc atctccctgt 1381 tacagtggcc cgcatcatga ttaaagaatg tggatccctc tgtctggggt gggatgcctt 1441 actttgcact taatttaata agggcattct cggaggagta gacgtttaat acgaagtggc 1501 gcatagccct gccgagatgt cggtgatggc ctgatgcg // LOCUS HUMDNAHEL 3888 bp DNA PRI 26-JUL-1995 DEFINITION Human DNA helicase gene, complete cds. ACCESSION L24544 NID g908916 KEYWORDS helicase. SOURCE human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3888) AUTHORS Zhang,Q. and Montalvo,E.A. TITLE A putative DNA helicase binds the EBV BZLF1 promoter JOURNAL Unpublished (1993) REFERENCE 2 (bases 675 to 1850) AUTHORS Montalvo,E.A. TITLE Direct Submission JOURNAL Submitted (01-APR-1994) Eduardo A. Montalvo, Institute of Biotechnology, Center for Molecular Medicine, 15355 Lambda Drive, San Antonio, TX 78245, USA REFERENCE 3 (bases 1 to 3888) AUTHORS Montalvo,E.A. TITLE Direct Submission JOURNAL Submitted (10-MAY-1995) Eduardo A. Montalvo, Institute of Biotechnology, Center for Molecular Medicine, 15355 Lambda Drive, San Antonio, TX 78245, USA FEATURES Location/Qualifiers source 1..3888 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda gt11" CDS 36..3017 /codon_start=1 /product="DNA helicase" /db_xref="PID:g908917" /translation="MASAAVESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRG VCLLKLQVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLA TGILTRVTQKSVTVAFDESHDFQLSLDRENSYRLLKLANDVTYRRLKKALIALKKYHS GPASSLIEVLFGRSAPSPASEIHPLTFFNTCLDTSQKEAVLFALSQKELAIIHGPPGT GKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKQRILRLGHPARLLESIQ QHSLDAVLARSDSAQIVADIRKDIDQVFVKNKKTQDKREKSNFRNEIKLLRKELKERE EAAMLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDECAQALEASCWIPLLKA RKCILAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVRTLTVQYRMHQAIMR WASDTMYLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEEDEQS KGNPGEVRLVSLHIQALVDAGVPARDIAVVSPYNLQVDLLRQSLVHRHPELEIKSVDG FQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARRHVAVICDSRTVNNHAFLK TLVEYFTQHGEVRTAFEYLDDIVPENYSHENSQGSSHAATKPQGPATSTRTGSQRQEG GQEAAAPARQGRKKPAGKSLASEAPSQPSLNGGSPEGVESQDGVDHFRAMIVEFMASK KMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITVSKRAPRPRAALGPPAG TGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQRVRSAQGQPASKEQQAS GQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFAKCTAGVTTLGQFCQ LCSRRYCLSHHLPEIHGCGERARAHARQRISREGVLYAGSGTKNGSLDPAKRAQLQRR LDKKLSELSNQRTSRRKERGT" BASE COUNT 857 a 1132 c 1167 g 732 t ORIGIN 1 acgtcggctt ctaggggccc aggccggcgg cggcgatggc ctcggcagct gtggagagct 61 tcgtgaccaa gcaactggac ctgctggagc ttgagagaga cgcggaggtg gaggagcgca 121 ggtcctggca ggagaacatc tctctgaaag agctccagag ccgaggcgtg tgtttgctga 181 agctgcaggt atccagccag cgcactgggc tgtacggacg gctgctggtc acctttgagc 241 ccaggcgata cgggtccgcg gcagctcttc ccagtaacag ctttacttct ggtgatatcg 301 tgggcctgta cgatgctgct aatgagggca gtcagctggc cactgggatc ttgacccggg 361 tcacccagaa gtcggtcacg gtggcctttg atgagtccca cgatttccag ttgagcttgg 421 accgagagaa ttcctacaga ctgttaaaac ttgccaatga tgtcacttac aggcgactga 481 aaaaagccct gattgctcta aagaagtatc attctggccc agcctcctca ctcatagaag 541 tgctctttgg cagatctgct cccagtcctg ccagtgaaat acacccgctg acattcttca 601 acacctgcct ggacacctcc cagaaagaag cggttttatt tgcgctgtct cagaaagaac 661 ttgccatcat ccatggacct cctggcactg ggaaaaccac gactgtggtt gagatcattc 721 ttcaagctgt gaaacaaggc ttaaaggttc tgtgctgcgc cccctccaac atcgccgtgg 781 acaatctggt ggagcgcctg gctctgtgta agcagcggat tctgcgcctg ggacaccctg 841 cccgcctcct ggagtccatt cagcagcact ccctggatgc ggttttagcg cggagcgaca 901 gtgcccagat tgttgcagat atcaggaagg acatcgacca ggtctttgtg aaaaacaaaa 961 agacccagga taagagagag aaaagtaatt ttcgaaatga aattaagctg ttaagaaaag 1021 aactgaagga gagggaagaa gcagctatgc tcgagagcct cacttcggca aacgtggtcc 1081 ttgcaacaaa cacaggtgcg tctgccgatg gccccctgaa gttgctgccc gagagctact 1141 tcgacgtggt ggtcattgac gagtgtgccc aggccctcga ggcgagctgc tggatccccc 1201 tgctgaaggc cagaaagtgc atcctggcgg gcgatcacaa gcagctgccc cccaccacag 1261 tctctcacaa ggctgcgctg gcaggactgt cactcagcct gatggaacgc ctggctgagg 1321 agtacggcgc gagggtggtg cggacactga cggtgcagta ccgcatgcac caggctatca 1381 tgcgctgggc ctcagacacc atgtaccttg ggcagctcac agcccactct tccgtggcaa 1441 ggcacctcct gagggacctc ccaggtgtgg ctgccacaga agagacgggt gtgcccctgc 1501 tcttggtgga caccgccggc tgcgggctgt ttgagctgga ggaggaggac gaacagtcga 1561 aagggaaccc tggcgaagtc cgcctcgtca gtttgcacat ccaggctctg gtggacgctg 1621 gtgttccagc ccgtgacatt gctgtggtct cgccatacaa cctccaggtg gacctgctca 1681 gacagagcct tgtgcacagg caccctgagc ttgaaatcaa gtctgtcgat ggcttccaag 1741 gccgagagaa ggaggccgtg atactgtcct tcgtcagatc caacaggaaa ggtgaagttg 1801 gttttcttgc tgaggaccgg aggatcaacg tggctgtcac ccgtgcccga cgccacgtgg 1861 cggtcatctg tgactcccgt actgtcaaca accatgcatt tttgaagacc ctggtggagt 1921 atttcacaca gcatggggaa gtacgcacgg cctttgagta tcttgacgat attgtcccag 1981 aaaactattc ccatgagaac tcccagggtt ccagccacgc tgccaccaag ccccagggac 2041 ctgctacgtc caccaggacc ggaagccagc ggcaggaggg aggccaggag gctgcagcac 2101 ctgccagaca gggccggaag aagccggctg ggaagtctct ggcctctgaa gctccatctc 2161 agcccagcct caacggaggc agcccagagg gagtggagag ccaagatggc gtggaccact 2221 tccgggccat gatagtggag ttcatggcca gcaagaagat gcagttggag tttcctcctt 2281 ccctcaattc ccacgacagg ctgcgggtcc accaaatagc cgaggagcac gggctgaggc 2341 acgacagttc cggggaaggg aagaggaggt tcatcactgt gagcaagagg gccccgcgac 2401 cccgagcagc cctgggaccc ccagcaggga ccggtggccc agcccctctc cagccagtgc 2461 cccctacccc tgcgcagaca gagcagcctc ccagggagca gcgtggccca gaccagcctg 2521 atctgaggac gctgcacctg gagagactgc agagggtcag gagcgcgcag gggcagcccg 2581 ccagcaagga gcagcaggcc tcagggcagc agaaacttcc agaaaagaaa aagaaaaaag 2641 ccaaaggaca tccggccaca gatctgccca cggaggagga ctttgaggcc ctggtttctg 2701 ccgccgttaa ggctgataac acctgcggct ttgccaagtg cacagccggc gtcacaaccc 2761 tgggccagtt ctgccagctc tgcagccgcc gctactgcct cagccaccac ctgcccgaga 2821 tccatggctg cggtgagagg gctcgcgccc atgcccggca gagaatcagc cgggaagggg 2881 tcctctatgc cggcagcggg accaagaacg gatccctgga cccagccaag agggcccagc 2941 tgcagaggag gctggataag aagctgagtg agctcagcaa ccagaggacc agccggagga 3001 aggagagggg gacgtgaccg gccgcatcct tgcacgcccc gcggagctct ctccatggta 3061 gcccagggcg ctggcagacc atgctccgcc tccaccaggg ccacagagga gcggaggggc 3121 ctatggggga ggagcggagg gccctgttgg ggaaggttgg gtttttggac cccagggata 3181 agcttttccg atgtcacaat gtggaggaaa gcacctgggg gacaacagtg ctcgtgcagg 3241 tggggcttgg gaaatgcacg tcccttcccc tcactccccg ccaaaaccca catcccagcc 3301 tctggatcct ggggaaggtt ccagtccctg gagaataccc agggcctcaa acttgaagtc 3361 actcctccaa tgtctgggac ttgccagctc agcccgttag gatgagggtg ctgagaggaa 3421 acaggaaaca agactgcgaa tggtgctcag gcagggagca gggagtggcg tttggcttgc 3481 acgttcccat gtggccagat gctggggcca ctttccttct gtctgctggt gactgcagtg 3541 ttccccctcc tcctcaccac ggggctcctg tgagtctggg gggcacctct ttctggcctg 3601 tgcacctctc tctggcttat aaaggtgcct ggcctgtgcc agcccctcct tgttgcgcct 3661 caccgtgggg accaggtgag ccggctctcc cacgtggttg tcccgggaaa gctgccccac 3721 agcctcagca tcttcagcac ttaccgatcc agagcctccc ggccttctcc ggtgtcctgt 3781 accaactctt ctatttaaga gaacctcaga tgatgtacct gagcctcagg gttttgtttc 3841 agagggatat aaattattta aaaattaaat gaaaacgttg cacactgc // LOCUS HUMDNAJHOM 1438 bp mRNA PRI 25-AUG-1993 DEFINITION Human heat shock protein, E. coli DnaJ homologue mRNA, complete cds. ACCESSION L08069 NID g306713 KEYWORDS DnaJ; heat shock protein; heat shock related protein; homologue. SOURCE Homo sapiens (library: lambda gt11 HUVE) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1438) AUTHORS Chellaiah,A., Davis,A. and Mohanakumar,T. TITLE Cloning of a unique human homologue of the Escherichia coli DNAJ heat shock protein JOURNAL Biochim. Biophys. Acta 1174 (1), 111-113 (1993) MEDLINE 93326629 FEATURES Location/Qualifiers source 1..1438 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda gt11 HUVE" CDS 83..1276 /standard_name="HDJ-2 protein" /note="putative N-linked glycosylation sites: aa. 16, 281" /codon_start=1 /product="DNAJ homologue-2" /db_xref="PID:g306714" /translation="MVKETTYYDVLGVKPNATQEELKKAYRKLALKYHPDKNPNEGEK FKQISQAYEVLSDAKKRELYDKGGEQAIKEGGAGGGFGSPMDIFDMFFGGGGRMQRER RGKNVVHQLSVTLEDLYNGATRKLALQKNVICDKCEGRGGKKGAVECCPNCRGTGMQI RIHQIGPGMVQQIQSVCMECQGHGERISPKDRCKSCNGRKIVREKKILEVHIDKGMKD GQKITFHGEGDQEPGLEPGDIIIVLDQKDHAVFTRRGEDLFMCMDIQLVEALCGFQKP ISTLDNRTIVITSHPGQIVKHGDIKCVLNEGMPIYRRPYEKGRLIIEFKVNFPENGFL SPDKLSLLEKLLPERKEVEETDEMDQVELVDFDPNQERRRHYNGEAYEDDEHHPRGGV QCQTS" polyA_signal 1380..1385 polyA_signal 1417..1422 polyA_site 1438 BASE COUNT 472 a 265 c 361 g 340 t ORIGIN 1 cggtaactac cccggctgcg cacagctcgg cgctccttcc cgctccctca cacaccgcct 61 cagcccgcac cggcagtaga agatggtgaa agaaacaact tactacgatg ttttgggggt 121 caaacccaat gctactcagg aagaattgaa aaaggcttat aggaaactgg ccttgaagta 181 ccatcctgat aagaacccaa atgaaggaga gaagtttaaa cagatttctc aagcttacga 241 agttctctct gatgcaaaga aaagggaatt atatgacaaa ggaggagaac aggcaattaa 301 agagggtgga gcaggtggcg gttttggctc ccccatggac atctttgata tgttttttgg 361 aggaggagga aggatgcaga gagaaaggag aggtaaaaat gttgtacatc agctctcagt 421 aaccctagaa gacttatata atggtgcaac aagaaaactg gctctgcaaa agaatgtgat 481 ttgtgacaaa tgtgaaggta gaggaggtaa gaaaggagca gtagagtgct gtcccaattg 541 ccgaggtact ggaatgcaaa taagaattca tcagatagga cctggaatgg ttcagcaaat 601 tcagtctgtg tgcatggagt gccagggcca tggggagcgg atcagtccta aagatagatg 661 taaaagctgc aacggaagga agatagttcg agagaagaaa attttagaag ttcatattga 721 caaaggcatg aaagatggcc agaagataac attccatggt gaaggagacc aagaaccagg 781 actggagcca ggcgatatta tcattgtgtt agatcagaag gaccatgctg tttttactcg 841 acgaggagaa gaccttttca tgtgtatgga catacagctc gttgaagcac tgtgtggctt 901 ccagaagcca atatctactc ttgacaaccg aaccatcgtc atcacctctc atccaggtca 961 gattgtcaag catggagata tcaagtgtgt actaaatgaa ggcatgccaa tttatcgtag 1021 accatatgaa aagggtcgcc taatcatcga atttaaggta aactttcctg agaatggctt 1081 tctctctcct gataaactgt ctttgctgga aaaactccta cccgagagga aggaagtgga 1141 agagactgat gagatggacc aagtagaact ggtggacttt gatccaaatc aggaaagacg 1201 gcgccactac aatggagaag catatgagga tgatgaacat catcccagag gtggtgttca 1261 gtgtcagacc tcttaatggc cagtgaataa cactcactgc tggcatttaa tgtgcagtag 1321 tgaatgagtg aaggactgta atcataatat gctcactact tgctcttgtt tttgttttaa 1381 taaactatag tagtgttata aaaagttaaa tgaagaataa acgcaaatat aaaagctc // LOCUS HUMDNAPOLC 3443 bp mRNA PRI 31-DEC-1994 DEFINITION Human DNA polymerase delta catalytic subunit mRNA, complete cds. ACCESSION M80397 NID g181619 KEYWORDS DNA polymerase-delta catalytic subunit. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3443) AUTHORS Chung,D.W., Zhang,J.A., Tan,C.K., Davie,E.W., So,A.G. and Downey,K.M. TITLE Primary structure of the catalytic subunit of human DNA polymerase delta and chromosomal location of the gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (24), 11197-11201 (1991) MEDLINE 92107916 FEATURES Location/Qualifiers source 1..3443 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="hepatocyte" /tissue_type="liver" gene 54..3377 /gene="DNA polymerase delta catalytic subunit" CDS 54..3377 /gene="DNA polymerase delta catalytic subunit" /codon_start=1 /product="DNA polymerase-delta catalytic-subunit" /db_xref="PID:g181620" /translation="MDGKRRPGPGPGVPPKRARGGLWDDDDAPWPSQFEEDLALMEEM EAEHRLQEQEEEELQSVLEGVADGQVPPSAIDPRWLRPTPPALDPQTEPLIFQQLEID HYVGPAQPVPGGPPPSRGSVPVLRAFGVTDEGFSVCCHIHGFAPYFYTPAPPGFGPEH MGDLQRELNLAISRDSRGGRELTGPAVLAVELCSRESMFGYHGHGPSPFLRITVALPR LVAPARRLLEQGIRVAGLGTPSFAPYEANVDFEIRFMVDTDIVGCNWLELPAGKYALR LKEKATQCQLEADVLWSDVVSHPPEGPWQRIAPLRVLSFDIECAGRKGIFPEPERDPV IQICSLGLRWGEPEPFLRLALTLRPCAPILGAKVQSYEKEEDLLQAWSTFIRIMDPDV ITGYNIQNFDLPYLISRAQTLKVQTFPFLGRVAGLCSNIRDSSFQSKQTGRRDTKVVS MVGRVQMDMLQVLLREYKLRSHTLNAVSFHFLGEQKEDVQHSIITDLQNGNDQTRRRL AVYCLKDAYLPLRLLERLMVLVNAVEMARVTGVPLSYLLSRGQQVKVVSQLLRQAMHE GLLMPVVKSEGGEDYTGATVIEPLKGYYDVPIATLDFSSLYPSIMMAHNLCYTTLLRP GTAQKLGLTEDQFIRTPTGDEFVKTSVRKGLLPQILENLLSARKRAKAELAKETDPLR RQVLDGRQLALKVSANSVYGFTGAQVGKLPCLEISQSVTGFGRQMIEKTKQLVESKYT VENGYSTSAKVVYGDTDSVMCRFGVSSVAEAMALGREAADWVSGHFPSPIRLEFEKVY FPYLLISKKRYAGLLFSSRPDAHDRMDCKGLEAVRRDNCPLVANLVTASLRRLLIDRD PEGAVAHAQDVISDLLCNRIDISQLVITKELTRAASDYAGKQAHVELAERMRKRDPGS APSLGDRVPYVIISAAKGVAAYMKSEDPLFVLEHSLPIDTQYYLEQQLAKPLLRIFEP ILGEGRAEAVLLRGDHTRCKTVLTGKVGGLLAFAKRRNCCIGCRTVLSHQGAVCEFCQ PRESELYQKEVSHLNALEERFSRLWTQCQRCQGSLHEDVICTSRDCPIFYMRKKVRKD LEDQEQLLRRFGPPGPEAW" BASE COUNT 621 a 1109 c 1120 g 593 t ORIGIN 1 agtcaggggt cacggcggcg taggctgtgg cgggaaacgc tgtttgaagc gggatggatg 61 gcaagcggcg gccaggccca gggcccgggg tgcccccaaa gcgggcccgt gggggcctct 121 gggatgatga tgatgcacct tggccatccc aattcgagga ggacctggca ctgatggagg 181 agatggaggc agaacacagg ctgcaggagc aggaggagga ggagctgcag tcagtcctgg 241 agggggttgc agacgggcag gtcccaccat cagccataga tcctcgctgg cttcggccca 301 caccaccagc gctggacccc cagacagagc ccctcatctt ccaacagttg gagattgacc 361 attatgtggg cccagcgcag cctgtgcctg gggggccccc accatcccgc ggctccgtgc 421 ctgtgctccg cgccttcggg gtcaccgatg aggggttctc tgtctgctgc cacatccacg 481 gcttcgctcc ctacttctac accccagcgc cccctggttt cgggcccgag cacatgggtg 541 acctgcaacg ggagctgaac ttggccatca gccgggacag tcgcgggggg agggagctga 601 ctgggccggc cgtgctggct gtggaactgt gctcccgaga gagcatgttt gggtaccacg 661 ggcacggccc ctccccgttc ctgcgcatca ccgtggcgct gccgcgcctc gtggccccgg 721 cccgccgtct cctggaacag ggcatccgtg tggcaggcct gggcacgccc agcttcgcgc 781 cctacgaggc caacgtcgac tttgagatcc ggttcatggt ggacacggac atcgtcggct 841 gcaactggct ggagctccca gctgggaaat acgccctgag gctgaaggag aaggctacgc 901 agtgccagct ggaggcggac gtgctgtggt ctgacgtggt cagtcaccca ccggaagggc 961 catggcagcg cattgcgccc ttgcgcgtgc tcagcttcga tatcgagtgc gccggccgca 1021 aaggcatctt ccctgagcct gagcgggacc ctgtcatcca gatctgctcg ctgggcctgc 1081 gctgggggga gccggagccc ttcctacgcc tggcgctcac cctgcggccc tgtgccccca 1141 tcctgggtgc caaggtgcag agctacgaga aggaggagga cctgctgcag gcctggtcca 1201 ccttcatccg tatcatggac cccgacgtga tcaccggtta caacatccag aacttcgacc 1261 ttccgtacct catctctcgg gcccagaccc tcaaggtaca aacattccct ttcctgggcc 1321 gtgtggccgg cctttgctcc aacatccggg actcttcatt ccagtccaag cagacgggcc 1381 ggcgggacac caaggttgtc agcatggtgg gccgcgtgca gatggacatg ctgcaggtgc 1441 tgctgcggga gtacaagctc cgctcccaca cgctcaatgc cgtgagcttc cacttcctgg 1501 gcgagcagaa ggaggacgtg cagcacagca tcatcaccga cctgcagaat gggaacgacc 1561 agacccgccg ccgcctggct gtgtactgcc tgaaggatgc ctacctgcca ctgcggctgc 1621 tggagcggct catggtgctg gtgaacgccg tggagatggc gagggtcact ggcgtgcccc 1681 tcagctacct gctcagtcgt ggccagcagg tcaaagtcgt atcccagctg ttgcggcagg 1741 ccatgcacga ggggctgctg atgcccgtgg tgaagtcaga gggcggcgag gactacacgg 1801 gagccactgt catcgagccc ctcaaagggt actacgacgt ccccatcgcc accctggact 1861 tctcctcgct gtacccgtcc atcatgatgg cccacaacct gtgttacacc acgctccttc 1921 ggcccgggac tgcacagaaa ctgggcctga ctgaggatca gttcatcagg acccccaccg 1981 gggacgagtt tgtgaagacc tcagtgcgga aggggctgct gccccagatc ctggagaacc 2041 tgctcagtgc ccggaagagg gccaaggccg agctggccaa ggagacagac cccctccggc 2101 gccaggtcct ggatggacgg cagctggcgc tgaaggtgag cgccaactcc gtatacggct 2161 tcactggcgc ccaggtgggc aagttgccgt gcctggagat ctcacagagc gtcacggggt 2221 tcggacgtca gatgatcgag aaaaccaagc agctggtgga gtctaagtac acagtggaga 2281 atggctacag caccagtgcc aaggtggtgt atggtgacac tgactccgtc atgtgccgat 2341 tcggcgtgtc ctcggtggct gaggcgatgg ccctggggcg ggaggccgcg gactgggtgt 2401 caggtcactt cccgtcgccc atccggctgg agtttgagaa ggtctacttc ccatacctgc 2461 ttatcagcaa gaagcgctac gcgggcctgc tcttctcctc ccggcccgac gcccacgacc 2521 gcatggactg caagggcctg gaggccgtgc gcagggacaa ctgccccctc gtggccaacc 2581 tggtcactgc ctcactgcgc cgcctgctca tcgaccgaga ccctgagggc gcggtggctc 2641 acgcacagga cgtcatctcg gacctgctgt gcaaccgcat cgatatctcc cagctggtca 2701 tcaccaagga gctgacccgc gcggcctccg actatgccgg caagcaggcc cacgtggagc 2761 tggccgagag gatgaggaag cgggaccccg ggagtgcgcc cagcctgggc gaccgcgtcc 2821 cctacgtgat catcagtgcc gccaagggtg tggccgccta catgaagtcg gaggacccgc 2881 tgttcgtgct ggagcacagc ctgcccattg acacgcagta ctacctggag cagcagctgg 2941 ccaagcccct cctgcgcatc ttcgagccca tcctgggcga gggccgtgcc gaggctgtgc 3001 tactgcgggg ggaccacacg cgctgcaaga cggtgctcac gggcaaggtg ggcggcctcc 3061 tggccttcgc caaacgccgc aactgctgca ttggctgccg cacagtgctc agccaccagg 3121 gagccgtgtg tgagttctgc cagccccggg agtctgagct gtatcagaag gaggtatccc 3181 atctgaatgc cctggaggag cgcttctcgc gcctctggac gcagtgccag cgctgccagg 3241 gcagcctgca cgaggacgtc atctgcacca gccgggactg ccccatcttc tacatgcgca 3301 agaaggtgcg gaaggacctg gaagaccagg agcagctcct gcggcgcttc ggaccccctg 3361 gacctgaggc ctggtgacct tgcaagcatc ccatggggcg ggggcgggac cagggagaat 3421 taataaagtt ctggactttt gct // LOCUS HUMDNM 3601 bp mRNA PRI 13-FEB-1996 DEFINITION Homo sapiens dynamin (DNM) mRNA, complete cds. ACCESSION L36983 NID g1196422 KEYWORDS DNM gene; dynamin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3601) AUTHORS Diatloff-Zito,C., Gordon,A.J., Duchaud,E. and Merlin,G. TITLE Isolation of an ubiquitously expressed cDNA encoding human dynamin II, a member of the large GTP-binding protein family JOURNAL Gene 163 (2), 301-306 (1995) MEDLINE 96011652 FEATURES Location/Qualifiers source 1..3601 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..144 mRNA 1..3601 gene 145..2745 /gene="DNM" CDS 145..2745 /gene="DNM" /codon_start=1 /db_xref="GDB:G00-132-857" /product="dynamin" /db_xref="PID:g1196423" /translation="MGNRGMEELIPLVNKLQDAFSSIGQSCHLDLPQIAVVGGQSAGK SSVLENFVGRDFLPRGSGIVTRRPLILQLIFSKTEHAEFLHCKSKKFTDFDEVRQEIE AETDRVTGTNKGISPVPINLRVYSPHVLNLTLIDLPGITKVPVGDQPPDIEYRVKDMI LQFISRESSLILAVTPANMDLANSDALKLAKEVDPQGLRTIGVITKLDLMDEGTDARD VLENKLLPLRRGYIGVVNRSQKDIEGKKDIRAALAAERKFFLSHPAYRHMADRMGTPH LQKTLNQQLTNHIRESLPALRSKLQSQLLSLEKEVEEYKIFRPDDPTPKTKALLQMVQ QFGVDFEKRIEGSGDQVDTLELSGGARINRIFHERFPFELVKMEFDEKDLRREISYAI KNIHGVRTGLFTPDLAFEAIVKKQVVKLKEPCLKCVDLVIQELINTVRQCTSKLSSYP RLREETERIVTTYIREREGRTKDQILLLIDIEQSYINTNHEDFIGFANAQQRSTQLNK KRAIPNQVIRRGWLTINNISLMKGGSKEYWFVLTAESLSWYKDEEEKEKKYMLPLDNL KIRDVEKGFMSNKHVFAIFNTEQRNVYKDLRQIELACDSQEDVDSWKASFLRAGVYPE KDQAENEDGAQENTFSMDPQLERQVETIRNLVDSYVAIINKSIRDLMPKTIMHLMINN TKAFIHHELLAYLYSSADQSSLMEESADQAQRRDDMLRMYHALKEALNIIGDISTSTV STPVPPPVDDTWLQSASSHSPTPQRRPVSSIHPPGRPPAVRGPTPGPPLIPVPVGAAA SFSAPPIPSRPGPQSVFANSDLFPAPPQIPSRPVRIPPGIPPGVPSRRPPAAPSRPTI IRPAEPSLLD" 3'UTR 2746..3601 polyA_site 3601 BASE COUNT 766 a 1133 c 1075 g 627 t ORIGIN 1 gaccgtgagg ccgagccggg agcgggcgtc ttgccgaggc ccgggcgggc gggagcaacg 61 gctacagacg ccgcggggcc aggtcgttga gggtcggcgg cgggcgagga gcgcagggcg 121 ctcgggccgg gggccgccgg cgccatgggc aaccgcggga tggaagagct gatcccgctg 181 gtcaacaaac tgcaggacgc cttcagctcc atcggccaga gctgccacct ggacctgccg 241 cagatcgctg tagtgggcgg ccagagcgcc ggcaagagct cggtgctgga gaacttcgtg 301 ggccgggact tccttccccg cggttcagga atcgtcaccc ggcggcctct cattctgcag 361 ctcatcttct caaaaacaga acatgccgag tttttgcact gcaagtccaa aaagtttaca 421 gactttgatg aagtccggca ggagattgaa gcagagaccg acagggtcac ggggaccaac 481 aaaggcatct ccccagtgcc catcaacctt cgagtctact cgccacacgt gttgaacttg 541 accctcatcg acctcccggg tatcaccaag gtgcctgtgg gcgaccagcc tccagacatc 601 gagtaccgag tcaaggacat gatcctgcag ttcatcagcc gggagagcag cctcattctg 661 gctgtcacgc ccgccaacat ggacctggcc aactccgacg ccctcaagct ggccaaggaa 721 gtcgatcccc aaggcctacg gaccatcggt gtcatcacca agcttgacct gatggacgag 781 ggcaccgacg ccagggacgt cttggagaac aagttgctcc cgttgagaag aggctacatt 841 ggcgtggtga accgcagcca gaaggatatt gagggcaaga aggacatccg tgcagcactg 901 gcagctgaga ggaagttctt cctctcccac ccggcctacc ggcacatggc cgaccgcatg 961 ggcacgccac atctgcagaa gacgctgaat cagcaactga ccaaccacat ccgggagtcg 1021 ctgccggccc tacgtagcaa actacagagc cagctgctgt ccctggagaa ggaggtggag 1081 gagtacaaga tctttcggcc cgacgacccc acccctaaaa ccaaagccct gctgcagatg 1141 gtccagcagt ttggggtgga ttttgagaag aggatcgagg gctcaggaga tcaggtggac 1201 actctggagc tctccggggg cgcccgaatc aatcgcatct tccacgagcg gttcccattt 1261 gagctggtga agatggagtt tgacgagaag gacttacgac gggagatcag ctatgccatt 1321 aagaacatcc atggagtcag gaccgggctt ttcaccccgg acttggcatt cgaggccatt 1381 gtgaaaaagc aggtcgtcaa gctgaaagag ccctgtctga aatgtgtcga cctggttatc 1441 caggagctaa tcaatacagt taggcagtgt accagtaagc tcagttccta cccccggttg 1501 cgagaggaga cagagcgaat cgtcaccact tacatccggg aacgggaggg gagaacgaag 1561 gaccagattc ttctgctgat cgacattgag cagtcctaca tcaacacgaa ccatgaggac 1621 ttcatcgggt ttgccaatgc ccagcagagg agcacgcagc tgaacaagaa gagagccatc 1681 cccaatcagg tgatccgcag gggctggctg accatcaaca acatcagcct gatgaaaggc 1741 ggctccaagg agtactggtt tgtgctgact gccgagtcac tgtcctggta caaggatgag 1801 gaggagaaag agaagaagta catgctgcct ctggacaacc tcaagatccg tgatgtggag 1861 aagggcttca tgtccaacaa gcacgtcttc gccatcttca acacggagca gagaaacgtc 1921 tacaaggacc tgcggcagat cgagctggcc tgtgactccc aggaagacgt ggacagctgg 1981 aaggcctcgt tcctccgagc tggcgtctac cccgagaagg accaggcaga aaacgaggat 2041 ggggcccagg agaacacctt ctccatggac ccccaactgg agcggcaggt ggagaccatt 2101 cgcaacctgg tggactcata cgtggccatc atcaacaagt ccatccgcga cctcatgcca 2161 aagaccatca tgcacctcat gatcaacaat acgaaggcct tcatccacca cgagctgctg 2221 gcctacctat actcctcggc agaccagagc agcctcatgg aggagtcggc tgaccaggca 2281 cagcggcggg acgacatgct gcgcatgtac catgccctca aggaggcgct caacatcatc 2341 ggtgacatca gcaccagcac tgtgtccacg cctgtacccc cgcctgtcga tgacacctgg 2401 ctccagagcg ccagcagcca cagccccact ccacagcgcc gaccggtgtc cagcatacac 2461 ccccctggcc ggcccccagc agtgaggggc cccactccag ggccccccct gattcctgtt 2521 cccgtggggg cagcagcctc cttctcggcg cccccaatcc catcccggcc tggaccccag 2581 agcgtgtttg ccaacagtga cctcttccca gccccgcctc agatcccatc tcggccagtt 2641 cggatccccc cagggattcc cccaggagtg cccagcagaa gaccccctgc tgcgcccagc 2701 cggcccacca ttatccgccc agccgagcca tccctgctcg actaggcctc gaggggggcg 2761 tgctctcggg ggggcctcac gcacccgcgg cgcaggagct tcagtggtct ggggccctcc 2821 gccgccccta tgctgggacc aggctcccag tgggcagccc tggcctcttg gttaacgctg 2881 gccccggtcc agggccggcc cctgtgcctg gctggacacc gcactgcgca aaggggccct 2941 ggagctccag gcagggggcg ctggggtgtt gcactttggg ggatggagtc tcagggtggc 3001 agagggggga ccagaaccct tgacaccatc ctgaatgagg ggtccagcct gggggggact 3061 ctaccaaggt cttcttgggc tgggaaagcc catgtagggc aggccttcta taagtgcggg 3121 caccaagggc gcctacatcc ccaggccttg ctggggtgca ggggtatatc aacttcccat 3181 tagcaggagc tccccagcgg caagcctggc ccagtgggct cggtagtgcc cagctggcag 3241 gcctgaggtg tacatagtcc ttcccggcca tattaaccac acagcctgag cctggcccag 3301 cctcggctgc cagaggtgcc tttgctaggc ccggagccgt tgcccggcct tgcccttgcc 3361 ctattcctct cctcctcctc ctcctgggtc ccccagggtg gctgggcttg ggctatgtgg 3421 gtggtggtgg cggggggtct tgggggcctc tcagctcccg cccatgcctc cctgatgggt 3481 gggcccaggg cggcctctct ctgaggagac ctcacccact cctcgctcag tttgaccact 3541 gtaagtgcct gcactctgta ttctattaat aaactaaaat aaagggaaga gcgtgctggt 3601 g // LOCUS HUMDNSPOLA 2187 bp mRNA PRI 07-MAR-1994 DEFINITION Homo sapiens DNA polymerase alpha mRNA, complete cds. ACCESSION L24559 NID g439600 KEYWORDS DNA polymerase alpha. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2187) AUTHORS Collins,K.L., Russo,A.A., Tseng,B.Y. and Kelly,T.J. TITLE The role of the 70 kDa subunit of human DNA polymerase alpha in DNA replication JOURNAL EMBO J. 12 (12), 4555-4566 (1993) MEDLINE 94038939 FEATURES Location/Qualifiers source 1..2187 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="F" /tissue_type="cervical epithelium" 5'UTR 1..64 CDS 65..1861 /codon_start=1 /product="DNA polymerase alpha" /db_xref="PID:g439601" /translation="MSASAQQLAEELQIFGLDCEEALIEKLVELCVQYGQNEEGMVGE LIAFCTSTHKVGLTSEILNSFEHEFLSKRLSKARHSTCKDSGHAGARDIVSIQELIEV EEEEEILLNSYTTPSKGSQKRAISTPETPLTKRSVSTRSPHQLLSPSSFSPSATPSQK YNSRSNRGEVVTSFGLAQGVSWSGRGGAGNISLKVLGCPEALTGSYKSMFQKLPDIRE VLTCKIEELGSELKEHYKIEAFTPLLAPAQEPVTLLGQIGCDSNGKLNNKSVILEGDR EHSSGAQIPVDLSELKEYSLFPGQVVIMEGINTTGRKLVATKLYEGVPLPFYQPTEED ADFEQSMVLVACGPYTTSDSITYDPLLDLIAVINHDRPDVCILFGPFLESKHEQVENC LLTSPFEDIFKQCLRTIIEGTRSSGSHLVFVPSLRDVHHEPVYPQPPFSYSDLSREDK KQVQFVSEPCSLSINGVIFGLTSTDLLFHLGAEEISSSSGTSDRFSRILKHILTQRSY YPLYPPQEDMAIDYESFYVYAQLPVTPDVLIIPSELRYFVKDVLGCVCVNPGRLTKGQ VGGTFARLYLRRPAADGAERQSPCIAVQVVRI" 3'UTR 1862..2187 BASE COUNT 546 a 570 c 574 g 497 t ORIGIN 1 gaattccggc ttgggcgcag gtcggagctg ggtgggccgg ctccccggcc tggcttgggc 61 gaccatgtcc gcatccgccc agcagctggc ggaggagctg cagatcttcg gcctagactg 121 cgaggaggct ctaattgaga aattggtaga gctttgtgtt cagtatggac agaatgagga 181 gggaatggta ggcgagctta tagccttctg caccagcaca cataaagttg gccttacctc 241 agagatcctg aactcttttg agcatgagtt tctgagcaaa agattatcga aagccaggca 301 tagtacctgc aaggacagtg gccatgcagg agctagagac attgtttcca ttcaagagct 361 aattgaagtg gaagaagaag aggaaatcct cttgaactct tacaccacac cttcaaaggg 421 ttctcagaag cgagctatct ctaccccaga aaccccccta acaaaaagga gtgtgtcaac 481 tcgtagcccc catcagctac tctcaccgtc aagtttctct ccaagtgcta ctccctccca 541 gaaatacaac tcacgaagta accgaggaga agtggttacc tccttcggct tagcacaggg 601 agtatcttgg tctgggagag gaggagctgg aaacatcagc ctgaaggtct tgggatgtcc 661 agaggcacta actgggagct acaaatccat gtttcagaag ctcccagaca ttcgagaagt 721 tctgacctgt aagatagaag aacttggcag cgaactcaag gaacattaca agattgaagc 781 tttcactcct ttgctagccc cagcacagga gcctgtcact ctgctgggcc agattggctg 841 tgatagcaac gggaagctga acaacaagtc agtgattctc gagggagacc gggaacattc 901 ctcgggtgct caaattccag tggatttatc tgagcttaag gaatattctc tgtttcctgg 961 acaggttgta attatggaag gaatcaacac cactggtagg aaacttgttg ccaccaaact 1021 ctacgagggt gtgccacttc cattttatca gcccactgaa gaggatgcag actttgagca 1081 aagcatggtc ctggttgcct gtggaccata caccacatct gacagcatca cgtatgaccc 1141 cctgcttgac ctgattgctg tcatcaacca tgaccggcca gatgtctgca tcctgtttgg 1201 ccctttcctg gagtctaagc atgaacaggt ggagaattgt ctactgacaa gtccatttga 1261 agacattttc aagcagtgtc tacgaacaat tattgaaggc acaagaagct ccggctccca 1321 ccttgtcttt gtcccgtcat tgagagatgt gcaccatgag cctgtgtacc cccagccgcc 1381 tttcagctac tccgatctgt ctcgagagga caaaaagcaa gtacagtttg tgtccgagcc 1441 ctgcagcctc tccataaacg gagtgatctt cggcttgaca tccacagatc tgcttttcca 1501 cctgggggcc gaggagatca gtagttcttc cggaacttca gacagattca gccgaatact 1561 caagcacatc ttgacccaga ggagctacta cccactctac ccgccccaag aagacatggc 1621 cattgactat gagtcgttct atgtttacgc acagctgcct gtcaccccag atgtcctcat 1681 catcccgtca gagctgaggt acttcgtgaa ggatgtcctc ggctgtgtct gtgtgaaccc 1741 tgggcgcctt accaaagggc aggtgggagg caccttcgcc cgactctacc ttaggaggcc 1801 ggcagcggac ggggcagaga ggcagagccc atgcattgct gtgcaggtcg tcaggatctg 1861 aggcttctgt cctctgctgt tctctgctgt gtgggccctt aaagtcttag ccaagagcca 1921 agacatagcc ctgtgacaag gtgaacagtt gggtgggaaa ggagagagga gccagccagg 1981 gaggggcagc tgcagtgacc aggcccagca ggaggacttg tgcagccggg cctgcctgtg 2041 agtggtgcct ctcctggaag aagctcttgc ttctcagtcc atgctccgtg tccagaagta 2101 agccagctgt ggatcccgcc cactcagaaa aggcgagaag gctttgtgat tttctacatg 2161 aatcaaacac agaaacaccg gaattcc // LOCUS HUMDOC2 1621 bp mRNA PRI 24-SEP-1996 DEFINITION Human mRNA for Doc2 (Double C2), complete cds. ACCESSION D31897 NID g695403 KEYWORDS Doc2. SOURCE Homo sapiens brain (library: H.Suzuki) cDNA to mRNA, clone Doc2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 50 to 1369) AUTHORS Orita,S., Sasaki,T., Naito,A., Komuro,R., Ohtsuka,T., Maeda,M., Suzuki,H., Igarashi,H. and Takai,Y. TITLE Doc2: a novel brain protein having two repeated C2-like domains JOURNAL Biochem. Biophys. Res. Commun. 206 (2), 439-448 (1995) MEDLINE 95126937 REFERENCE 2 (bases 1 to 1621) AUTHORS Orita,S. JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 1621) AUTHORS Orita,S. TITLE Direct Submission JOURNAL Submitted (22-JUN-1994) to the DDBJ/EMBL/GenBank databases. Satoshi Orita, Shionogi Institute for Medical Science; 2-5-1 Mishima, Settsu, Osaka 566, Japan (E-mail:tatanaka@ddbj.nig.ac.jp, Tel:06-382-2612, Fax:06-382-8336) COMMENT Sequence updated(22-Nov-1994)by: Satoshi Orita. FEATURES Location/Qualifiers source 1..1621 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="H.Suzuki" /tissue_type="brain" CDS 113..1315 /codon_start=1 /product="Doc2" /db_xref="PID:d1007267" /db_xref="PID:g1438116" /translation="MRGRRGDRMTINIQEHMAINVCPGPIRPIRQISDYFPRGPGPEG GGGSGGEAPAHLVPLALAPPAALLGATTPEDGAEVDSYDSDDATALGKLEFDLLYDRA SCTLHVCILRAKGLKPMDFNGLADPYVKLHLLPGACKANKLKTKTQRNTLNPVWNEDL TYSGITDDDITHKVLRIAVCDEDKLSHNEFIGEIRVPLRRLKPSQKKHFNICLERQVP LASPSSMSAALRGISCYLKDLEQAEQGQGLLEERGRILLSLSYSSRRRGLLVGILRCA HLAAMDVNGYSDPYVKTYLRPDVDKKSKHKTCVKKKTLNPEFNEEFFYEIELSTLATK TLEVTVWDYDIGKSNDFIGGVSLGPGARGEARKHWSDCLQQRDAALERWHTLTSELPP AAGALSSA" BASE COUNT 322 a 522 c 479 g 298 t ORIGIN 1 tcctgacggc gctggagctg aggggcagtg cggatgcccc aggaaggctc ctaggaagag 61 gggacccacg gtgacttcct aaggaagcgc ggttcccagc caggggtgct gcatgagggg 121 ccgcaggggc gatcgcatga ccatcaacat ccaggagcac atggccatca acgtgtgccc 181 cgggcccatc cggcccatcc gccagatctc tgactacttc ccccggggac caggacctga 241 agggggcggc gggagcggcg gggaggcccc cgcccatctg gtccccctgg ctctggcccc 301 ccctgcagcc ctccttgggg ccaccacgcc tgaggatggt gcggaggtgg acagctatga 361 ctcggatgat gccaccgccc taggcaagct ggagtttgac cttctctacg accgggcctc 421 ctgcactctg cacgtatgca tcctcagggc caagggcctc aagcccatgg atttcaatgg 481 cctcgccgac ccctacgtca agctgcactt gctgcctgga gcctgtaagg ccaataagct 541 aaaaacgaag actcagagga acacactgaa tcccgtgtgg aatgaggacc tgacttacag 601 cgggatcaca gatgacgaca tcacgcacaa ggtgctcagg atcgccgtct gtgatgagga 661 caagctgagt cacaatgagt ttattgggga gatccgcgtg cccctccgcc gcctcaagcc 721 ttcgcagaag aagcatttta acatctgcct cgagcgccaa gtcccgctgg cgtccccctc 781 ttccatgtca gcggcgctga ggggcatctc ctgttatctg aaggacttgg agcaggcgga 841 gcaggggcag gggctgctgg aggagcgtgg ccgcatcctg ctgagtctca gctacagctc 901 gcggcgccgg ggactgctgg taggcatctt gcgctgcgcc catctggctg ccatggacgt 961 caacggttac tcggacccct acgtcaagac gtacctgagg cccgatgtgg acaagaaatc 1021 caagcataag acgtgtgtga agaagaagac tctcaaccca gaatttaacg aggagttttt 1081 ctacgagata gagctctcca ctctggccac caagaccctg gaagtcaccg tctgggacta 1141 tgacattggc aaatccaatg acttcattgg tggcgtgtcc ctggggccag gtgcccgagg 1201 cgaggctcgg aagcactgga gtgactgcct gcagcagcgg gacgcagccc tggagcgctg 1261 gcacaccctg accagtgagc tgccccctgc ggccggggct ctgtcctcag cctgagtgga 1321 cagcagtgtc ccggcacagg cccatcgagc cgggtccagt acccaacctt cgcacgagtg 1381 tgttgcacgt ttacacaggt gggctgcccc accctgcact acctattttg tgagtctcgt 1441 gacccgggtc tgtctgctca tgaggggctg cggagttcta tattcacata tgcaaacctc 1501 ctgcctgact cgctagtccc tgcaaatatg caaacccccc tactactgca cacccgggca 1561 gtgctcagag ccgcccaggc cccgcgctcc tcactcctgc ctctccacgc tgccccgtcc 1621 c // LOCUS HUMDOCK180 6519 bp mRNA PRI 02-APR-1997 DEFINITION Human DOCK180 protein mRNA, complete cds. ACCESSION D50857 NID g1339909 KEYWORDS DOCK180 protein; CRK-binding; ATP-binding. SOURCE Homo sapiens (strain:Japanese) female placenta cDNA to mRNA, clone:C3X. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6519) AUTHORS Matsuda,M. TITLE Direct Submission JOURNAL Submitted (03-JUN-1995) to the DDBJ/EMBL/GenBank databases. Michiyuki Matsuda, National Institute of Health, Department of Pathology; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162, Japan (Tel:03-5285-1111(ex.2625), Fax:03-5285-1150) REFERENCE 2 (bases 1 to 6519) AUTHORS Matsuoka,M. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Hasegawa,H., Kiyokawa,E., Tanaka,S., Nagashima,K., Gotoh,N., Shibuya,M., Kurata,T. and Matsuda,M. TITLE DOCK180, a major CRK-binding protein, alters cell morphology upon translocation to the cell membrane JOURNAL Mol. Cell. Biol. 16 (4), 1770-1776 (1996) MEDLINE 96239533 FEATURES Location/Qualifiers source 1..6519 /organism="Homo sapiens" /strain="Japanese" /db_xref="taxon:9606" /clone="C3X" /sex="female" /tissue_type="placenta" CDS 24..5621 /note="180-kDa protein downstream of CRK; CRK SH3-binding protein" /codon_start=1 /product="DOCK180 protein" /db_xref="PID:d1010096" /db_xref="PID:g1339910" /translation="MTRWVPTKREEKYGVAFYNYDARGADELSLQIGDTVHILETYEG WYRGYTLRKKSKKGIFPASYIHLKEAIVEGKGQHETVIPGDLPLIQEVTTTLREWSTI WRQLYVQDNREMFRSVRHMIYDLIEWRSQILSGTLPQDELKELKKKVTAKIDYGNRIL DLDLVVRDEDGNILDPELTSTISLFRAHEIASKQVEERLQEEKSQKQNIDINRQAKFA ATPSLALFVNLKNVVCKIGEDAEVLMSLYDPVESKFISENYLVRWSSSGLPKDIDRLH NLRAVFTDLGSKDLKREKISFVCQIVRVGRMELRDNNTRKLTSGLRRPFGVAVMDVTD IINGKVDDEDKQHFIPFQPVAGENDFLQTVINKVIAAKEVNHKGQGLWVTLKLLPGDI HQIRKEFPHLVDRTTAVARKTGFPEIIMPGDVRNDIYVTLVQGDFDKGSKTTAKNVEV TVSVYDEDGKRLEHVIFPGAGDEAISEYKSVIYYQVKQPRWFETVKVAIPIEDVNRSH LRFTFRHRSSQDSKDKSEKIFALAFVKLMRYDGTTLRDGEHDLIVYKAEAKKLEDAAT YLSLPSTKAELEEKGHSATGKSMQSLGSCTISKDSFQISTLVCSTKLTQNVDLLGLLK WRSNTSLLQQNLRQLMKVDGGEVVKFLQDTLDALFNIMMENSESETFDTLVFDALVFI IGLIADRKFQHFNPVLETYIKKHFSATLAYTKLTKVLKNYVDGAEKPGVNEQLYKAMK ALESIFKFIVRSRILFNQLYENKGEADFVESLLQLFRSINDMMSSMSDQTVRVKGAAL KYLPTIVNDVKLVFDPKELSKMFTEFILNVPMGLLTIQKLYCLIEIVHSDLFTQHDCR EILLPMMTDQLKYHLERQEDLEACCQLLSHILEVLYRKDVGPTQRHVQIIMEKLLRTV NRTVISMGRDSELIGNFVACMTAILRQMEDYHYAHLIKTFGKMRTDVVDFLMETFIMF KNLIGKNVYPFDWVIMNMVQNKVFLRAINQYADMLNKKFLDQANFELQLWNNYFHLAV AFLTQESLQLENFSSAKRAKILNKYGDMRRQIGFEIRDMWYNLGQHKIKFIPEMVGPI LEMTLIPETELRKATIPIFFDMMQCEFHSTRSFQMFENEIITKLDHEVEGGRGDEQYK VLFDKILLEHCRKHKYLAKTGETFVKLVVRLMERLLDYRTIMHDENKENRMSCTVNVL NFYKEIEREEMYIRYLYKLCDLHKECDNYTEAAYTLLLHAKLLKWSEDVCVAHLTQRD GYQATTQGQLKEQLYQEIIHYFDKGKMWEEAIALGKELAEQYENEMFDYEQLSELLKK QAQFYENIVKVIRPKPDYFAVGYYGQGFPTFLRGKVFIYRGKEYERREDFEARLLTQF PNAEKMKTTSPPGDDIKNSPGQYIQCFTVKPKLDLPPKFHRPVSEQIVSFYRVNEVQR FEYSRPIRKGEKNPDNEFANMWIERTIYTTAYKLPGILRWFEVKSVFMVEISPLENAI ETMQLTNDKINSMVQQHLDDPSLPINPLSMLLNGIVDPAVMGGFANYEKAFFTDRYLQ EHPEAHEKIEKLKDLIAWQIPFLAEGIRIHGDKVTEALRPFHERMEACFKQLKEKVEK EYGVRIMPSSLDDRRGSRPRSMVRSFTMPSSSRPLSVASVSSLSSDSTPSRPGSDGFA LEPLLPKKMHSRSQDKLDKDDLEKEKKDKKKEKRNSKHQEIFEKEFKPTDISLQQSEA VILSETISPLRPQRPKSQVMNVIGSERRFSVSPSSPSSQQTPPPVTPRAKLSFSMQSS LELNGMTGADVADVPPPLPLKGSVADYGNLMENQDLLGSPTPPPPPPHQRHLPPPLPS KTPPPPPPKTTRKQTSVDSGIVQ" BASE COUNT 1856 a 1498 c 1610 g 1555 t ORIGIN 1 gcacgagcgg ctccggcggc gccatgacgc gctgggtgcc caccaagcgc gaggagaagt 61 acggcgtggc tttttataac tatgatgcca gaggagcgga tgaactttct ttacagatcg 121 gagacactgt gcacatctta gaaacatatg aagggtggta ccgaggttac acgttacgaa 181 aaaagtctaa gaagggtata tttcctgctt catatattca tcttaaagaa gcgatagttg 241 aaggaaaagg gcaacatgaa acagtcatcc cgggtgacct ccccctcatc caggaagtca 301 ccacgacact ccgagagtgg tccaccatct ggaggcagct ctacgtgcaa gataacaggg 361 agatgtttcg aagtgtgcgg cacatgatct atgaccttat tgaatggcga tcacaaattc 421 tttctggaac tctgcctcag gatgaactca aagaactgaa gaagaaggtc acagccaaaa 481 ttgattatgg aaacagaatt ctagatttgg acctggtggt tagagatgaa gatgggaata 541 ttttggatcc agaattaact agcacgatta gtctcttcag agctcatgaa atagcttcta 601 aacaagtgga ggaaaggtta caagaggaaa aatctcaaaa gcagaacata gatattaaca 661 gacaagccaa gtttgctgca accccttctc tggccttgtt tgtgaacctc aaaaatgtgg 721 tttgtaaaat aggagaagat gctgaagtcc tcatgtctct atatgaccct gtggagtcca 781 aattcatcag tgagaactac ctggttcgct ggtccagttc aggattacct aaagacatag 841 acagattaca taatttgcga gccgtgttta ctgacctcgg aagcaaagac ctgaaaaggg 901 agaaaatcag ttttgtctgt cagattgttc gcgtgggtcg catggagctg agggacaaca 961 acaccaggaa actgacctcg gggttgcggc gaccttttgg agtggctgtg atggatgtaa 1021 cagatataat aaatggaaaa gtagatgatg aagataagca gcatttcatt ccctttcagc 1081 cggtggcagg ggagaatgac ttccttcaga ctgttataaa caaagtcatc gctgccaaag 1141 aagtcaacca caaggggcag ggtttgtggg taacattgaa attacttcct ggagatatcc 1201 atcagatccg aaaagagttt ccgcatttag tggacaggac cacagctgtg gctcgaaaaa 1261 cagggtttcc ggagataatc atgcctggtg atgttcgaaa tgatatctat gtaacattag 1321 ttcaaggaga ttttgataaa ggaagcaaaa caacagcgaa gaacgtggag gtcacggtgt 1381 ctgtgtacga tgaggatggg aaacgattag agcatgtgat tttcccgggt gctggtgatg 1441 aagcgatttc agagtacaaa tctgtgattt actaccaagt aaagcagcca cgctggtttg 1501 agactgttaa ggtggccatt cccatcgagg acgttaaccg cagtcacctt cggtttacct 1561 tccgccacag gtcatcacag gactctaagg ataaatctga gaaaatattt gcactagcat 1621 ttgtcaagct gatgagatac gatggtacca ccctgcgaga cggagagcac gatcttatcg 1681 tctataaggc cgaagcgaag aagctggaag atgctgccac gtacttgagt ctgccctcca 1741 cgaaggcaga gttggaagaa aagggccact cggccaccgg caagagcatg cagagccttg 1801 ggagctgcac cattagcaag gactccttcc agatctccac gctcgtgtgc tccaccaaac 1861 tgactcagaa cgtggacctt ctggggctct tgaaatggcg ctccaacacc agcctgctgc 1921 agcagaactt gaggcagctg atgaaagtcg atggtggtga agtagtgaag tttcttcagg 1981 acacgttgga tgccctcttc aacatcatga tggagaactc agagagtgag acttttgaca 2041 cgttagtctt tgatgctctg gtatttatca ttggactgat tgctgataga aaatttcagc 2101 attttaatcc tgttttggaa acttacatta agaaacactt tagtgcaacg ttagcctaca 2161 cgaagttgac aaaagtgttg aagaactacg tggacggtgc tgagaagccg ggagtaaatg 2221 agcagctgta caaagccatg aaagcgctag aatccatctt caagttcatc gtgcgctcca 2281 ggatcctgtt caatcaactg tatgaaaaca agggagaggc tgacttcgtg gaatctttgc 2341 tgcagctctt caggtccatc aatgacatga tgagcagcat gtcagaccag accgtccggg 2401 tgaagggggc agcactgaaa tacttaccaa cgatcgtcaa cgatgtgaaa ttggtgtttg 2461 atcccaaaga gctcagcaaa atgtttactg aattcatcct caatgttccc atgggcttgc 2521 tgaccatcca gaaactctac tgcttgatcg aaatcgtcca cagtgacctc ttcacacagc 2581 atgactgcag agagatcctg cttcccatga tgaccgatca gctcaagtac catctggaga 2641 gacaggagga cctggaggcc tgctgtcagc tgctcagcca catcctggag gtgctgtaca 2701 ggaaggacgt ggggccaacc cagaggcacg tccagattat catggagaaa cttctccgga 2761 ccgtgaaccg aaccgtcatt tccatgggac gagattctga actcattgga aacttcgtgg 2821 cttgcatgac agctatttta cgacaaatgg aagattacca ttatgcccac ttgatcaaga 2881 cttttgggaa aatgaggact gatgtggtag atttcctaat ggaaacattc atcatgttta 2941 agaacctcat tggaaagaac gtttacccct tcgactgggt gatcatgaac atggtgcaaa 3001 ataaagtctt cctgcgagca attaatcagt atgcagatat gctgaacaaa aaatttctgg 3061 atcaagccaa ctttgagcta cagctgtgga acaactactt tcacctggct gttgctttcc 3121 ttactcaaga gtccctgcaa ctggagaatt tttcaagtgc caagagagcc aaaatcctta 3181 acaagtacgg agatatgagg agacagattg gctttgaaat cagagacatg tggtacaacc 3241 ttggtcaaca caagataaag ttcattccag aaatggtggg cccaatatta gaaatgacat 3301 taattcccga gacggagctg cgcaaagcca ccatccccat cttctttgat atgatgcagt 3361 gtgaattcca ttcgacccga agcttccaaa tgtttgaaaa tgagatcatc accaagctgg 3421 atcatgaagt cgaaggaggc agaggagacg aacagtacaa agtgttattt gataaaatcc 3481 ttctggaaca ctgcaggaag cacaaatacc tcgccaaaac aggagaaact tttgtaaaac 3541 tcgttgtgcg cttaatggaa aggcttttgg attatagaac catcatgcac gacgagaaca 3601 aagaaaaccg catgagctgc accgtcaatg tgctgaattt ctacaaagaa attgaaagag 3661 aagaaatgta tataaggtat ttgtacaagc tctgtgacct gcacaaggag tgtgataact 3721 acaccgaagc ggcttacacc ttgcttctcc atgcaaagct tcttaagtgg tcggaggatg 3781 tgtgtgtggc ccacctcacc cagcgggacg ggtaccaggc caccacgcag ggacagctga 3841 aggagcagct ctaccaggaa atcatccact acttcgacaa aggcaagatg tgggaggagg 3901 ccattgcctt gggcaaggag ctagccgagc agtatgagaa cgaaatgttt gattatgagc 3961 aactcagcga attgctgaaa aaacaggctc agttttatga aaacatcgtc aaagtgatca 4021 ggcccaagcc tgactatttt gctgttggct actacggaca agggttcccc acattcctgc 4081 ggggaaaagt tttcatttac cgagggaaag agtatgagcg ccgggaagat tttgaggctc 4141 ggctcttaac tcagtttcca aacgccgaga aaatgaagac aacatctcca ccaggcgacg 4201 atattaaaaa ctctcctggc cagtatattc agtgcttcac agtgaagccc aaactcgatc 4261 tgcctcctaa gtttcacagg ccagtgtcag agcagattgt aagtttttac agggtgaacg 4321 aggtccagcg atttgaatat tctcggccaa tccggaaggg agagaaaaac ccagacaatg 4381 aatttgcgaa tatgtggatc gagagaacca tatatacaac tgcatataaa ttacctggaa 4441 ttttaaggtg gtttgaggtc aagtctgttt tcatggtgga aatcagcccc ctggagaatg 4501 ccatcgagac catgcagctg acgaacgaca agatcaacag catggtgcag cagcacctgg 4561 atgaccccag cctgcccatc aacccgctct ccatgctcct gaacggcatc gtggacccag 4621 ctgtcatggg gggcttcgca aactacgaaa aggccttctt tacagaccgg tacctgcagg 4681 agcaccctga ggcccatgaa aagatcgaga agctcaagga cctgattgct tggcagattc 4741 cttttctggc cgaagggatc agaatccatg gagacaaagt cacggaggca ctgaggccgt 4801 tccacgagag gatggaggcc tgtttcaaac agctgaagga aaaggtggag aaagagtacg 4861 gcgtccgaat catgccctca agtctggatg atagaagagg cagccgcccc cggtccatgg 4921 tgcggtcctt cacgatgcct tcctcatccc gccctctgtc tgtggcctct gtctcttccc 4981 tctcatcgga cagcaccccc tccagaccag gctccgacgg gtttgccctg gagcctctcc 5041 tgccaaagaa aatgcactcc aggtcccagg acaagctgga caaggatgac ctggagaagg 5101 agaagaagga caagaagaag gaaaaaagga acagcaaaca tcaagagata tttgagaaag 5161 aatttaaacc caccgacatt tccctgcagc agtctgaggc tgtgatcctt tcggaaacga 5221 taagtcccct gcggccccag agaccgaaga gccaggtgat gaacgtcatt ggaagcgaaa 5281 ggcgcttctc ggtgtccccc tcgtcaccgt cctcccagca aacaccccct ccagttacac 5341 caagagccaa gctcagcttc agcatgcagt cgagcttgga gctgaacggc atgacggggg 5401 cggacgtggc cgatgtccca ccccctctgc ctctcaaagg cagcgtggca gattacggga 5461 atttgatgga aaaccaggac ttgctgggct cgccaacacc tccacctccc cctccacacc 5521 agaggcatct gccacctcca ctgcccagca aaactccgcc tcctccccct ccaaagacaa 5581 ctcgcaagca gacatcggtg gactctggga tcgtgcagtg acatcgcaag gctctctgga 5641 aagagtgtgc tgcccctccc catctccatg ccctctcctt ctgtgtcccc tgagtctgct 5701 gtttacctca ttgggcctgt gatgttaaca tttcgtgcga ctgctttttc ttcaaaggag 5761 ttcagttctc accatggagt gagtggcctt tagcgtcatg gagcaaggtg ggtctgggag 5821 gtagatatgg gtccgggatg tgccatcgta gttaccagag ttgggggcct ctgagtgtgt 5881 ctggctctga gagagtctga gtcttgccca aacattcttt ctttttgtgc caaatgactt 5941 gcatttgcaa agagctcaat tgctctgagc tcagccaagt aggagaggct aggccatcac 6001 tcttgggaag ctgtgtagtg atgatgtata agaatcctcc tcactgtcat gggatgttgt 6061 atccagcccc tccttgttcc agccggtggt gtgacttcgt tggttgaggt gtgtctccaa 6121 cctacatcag accatgaagt tcaacccctc cagggaagct cctgatttcc cctgcataat 6181 tgaaaatagg atattctcag ctattgaaca gttactaatt tatggggtgg aaacagcatt 6241 aagaatactg aatcaaatgg aaaaacaaat gaatacagga agataagtgt tcgttctttt 6301 ctgaaaaaag agtatgtgta ccacaagagc tggttttaat tgggtgaatt gtttttgtcc 6361 tcattctgta cagaaatttg tatatatgat ggttcttaga acttgtttta atttttgtgg 6421 tccttctgtt tattataata ggcgtccacc aatgattatc catatgtgtt cttaattttt 6481 aactgctgga agtgttaaaa cacacacacc ccggaattc // LOCUS HUMDOTR 2010 bp mRNA PRI 05-JUL-1994 DEFINITION Homo sapiens dopamine transporter mRNA, complete cds. ACCESSION L24178 NID g401764 KEYWORDS dopamine transporter. SOURCE Homo sapiens substantia nigra cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2010) AUTHORS Pristupa,Z.B., Wilson,J.M., Hoffman,B.J., Kish,S.J. and Niznik,H.B. TITLE Pharmacological heterogeneity of the cloned and native human dopamine transporter: disassociation of [3H]WIN 35,428 and [3H]GBR 12,935 binding JOURNAL Mol. Pharmacol. 45, 125-135 (1994) MEDLINE 94134051 FEATURES Location/Qualifiers source 1..2010 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="substantia nigra" CDS 20..1882 /codon_start=1 /product="dopamine transporter" /db_xref="PID:g401765" /translation="MSKSKCSVGLMSSVVAPAKEPNAVGPKEVELILVKEQNGVQLTS STLTNPRQSPVEAQDRETWGKKIDFLLSVIGFAVDLANVWRFPYLCYKNGGGAFLVPY LLFMVIAGMPLFYMELALGQFNREGAAGVWKICPILKGVGFTVILISLYVGFFYNVII AWALHYLFSSFTTELPWIHCNNSWNSPNCSDAHPGDSSGDSSGLNDTFGTTPAAEYFE RGVLHLHQSHGIDDLGPPRWQLTACLVLVIVLLYFSLWKGVKTSGKVVWITATMPYVV LTALLLRGVTLPGAIDGIRAYLSVDFYRLCEASVWIDAATQVCFSLGVGFGVLIAFSS YNKFTNNCYRDAIVTTSINSLTSFSSGFVVFSFLGYMAQKHSVPIGDVAKDGPGLIFI IYPEAIATLPLSSAWAVVFFIMLLTLGIDSAMGGMESVITGLIDEFQLLHRHRELFTL FIVLATFLLSLFCVTNGGIYVFTLLDHFAAGTSILFGVLIEAIGVAWFYGVGQFSDDI QQMTGQRPSLYWRLCWKLVSPCFLLFVVVVSIVTFRPPHYGAYIFPDWANALGWVIAT SSMAMVPIYAAYKFCSLPGSFREKLAYAIAPEKDRELVDRGEVRQFTLRHWLKV" BASE COUNT 372 a 646 c 553 g 439 t ORIGIN 1 ctcaactccc agtgtgccca tgagtaagag caaatgctcc gtgggactca tgtcttccgt 61 ggtggccccg gctaaggagc ccaatgccgt gggcccgaag gaggtggagc tcatccttgt 121 caaggagcag aacggagtgc agctcaccag ctccaccctc accaacccgc ggcagagccc 181 cgtggaggcc caggatcggg agacctgggg caagaagatc gactttctcc tgtccgtcat 241 tggctttgct gtggacctgg ccaacgtctg gcggttcccc tacctgtgct acaaaaatgg 301 tggcggtgcc ttcctggtcc cctacctgct cttcatggtc attgctggga tgccactttt 361 ctacatggag ctggccctcg gccagttcaa cagggaaggg gccgctggtg tctggaagat 421 ctgccccata ctgaaaggtg tgggcttcac ggtcatcctc atctcactgt atgtcggctt 481 cttctacaac gtcatcatcg cctgggcgct gcactatctc ttctcctcct tcaccacgga 541 gctcccctgg atccactgca acaactcctg gaacagcccc aactgctcgg atgcccatcc 601 tggtgactcc agtggagaca gctcgggcct caacgacact tttgggacca cacctgctgc 661 cgagtacttt gaacgtggcg tgctgcacct ccaccagagc catggcatcg acgacctggg 721 gcctccgcgg tggcagctca cagcctgcct ggtgctggtc atcgtgctgc tctacttcag 781 cctctggaag ggcgtgaaga cctcagggaa ggtggtatgg atcacagcca ccatgccata 841 cgtggtcctc actgccctgc tcctgcgtgg ggtcaccctc cctggagcca tagacggcat 901 cagagcatac ctgagcgttg acttctaccg gctctgcgag gcgtctgttt ggattgacgc 961 ggccacccag gtgtgcttct ccctgggcgt ggggttcggg gtgctgatcg ccttctccag 1021 ctacaacaag ttcaccaaca actgctacag ggacgcgatt gtcaccacct ccatcaactc 1081 cctgacgagc ttctcctccg gcttcgtcgt cttctccttc ctggggtaca tggcacagaa 1141 gcacagtgtg cccatcgggg acgtggccaa ggacgggcca gggctgatct tcatcatcta 1201 cccggaagcc atcgccacgc tccctctgtc ctcggcctgg gccgtggtct tcttcatcat 1261 gctgctcacc ctgggtatcg acagcgccat gggtggtatg gagtcagtga tcaccgggct 1321 catcgatgag ttccagctgc tgcacagaca ccgtgagctc ttcacgctct tcatcgtcct 1381 ggcgaccttc ctcctgtccc tgttctgcgt caccaacggt ggcatctacg tcttcacgct 1441 cctggaccat tttgcagccg gcacgtccat cctctttgga gtgctcatcg aagccatcgg 1501 agtggcctgg ttctatggtg ttgggcagtt cagcgacgac atccagcaga tgaccgggca 1561 gcggcccagc ctgtactggc ggctgtgctg gaagctggtc agcccctgct ttctcctgtt 1621 cgtggtcgtg gtcagcattg tgaccttcag acccccccac tacggagcct acatcttccc 1681 cgactgggcc aacgcgctgg gctgggtcat cgccacatcc tccatggcca tggtgcccat 1741 ctatgcggcc tacaagttct gcagcctgcc tgggtccttt cgagagaaac tggcctacgc 1801 cattgcaccc gagaaggacc gtgagctggt ggacagaggg gaggtgcgcc agttcacgct 1861 ccgccactgg ctcaaggtgt agagggagca gagacgaaga ccccaggaag tcatcccgca 1921 atgggagaga cacgaacaaa ccaaggaaat ctaagtttcg agagaaagga gggcaacttc 1981 tactcttcaa cctctactga aaacaccccc // LOCUS HUMDP1A 1440 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens E2F-related transcription factor (DP-1) mRNA, complete cds. ACCESSION L23959 NID g414316 KEYWORDS E2F-related transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1440) AUTHORS Helin,K., Wu,C.L., Fattaey,A.R., Lees,J.A., Dynlacht,B.D., Ngwu,C. and Harlow,E. TITLE Heterodimerization of the transcription factors E2F-1 and DP-1 leads to cooperative trans-activation JOURNAL Genes Dev. 7 (10), 1850-1861 (1993) MEDLINE 94010284 FEATURES Location/Qualifiers source 1..1440 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Nalm-6 pre-B" gene 38..1270 /gene="DP1" CDS 38..1270 /gene="DP1" /codon_start=1 /product="E2F-related transcription factor" /db_xref="PID:g414317" /translation="MAKDAGLIEANGELKVFIDQNLSPGKGVVSLVAVHPSTVNPLGK QLLPKTFGQSNVNIAQQVVIGTPQRPAASNTLVVGSPHTPSTHFASQNQPSDSSPWSA GKRNRKGEKNGKGLRHFSMKVCEKVQRKGTTSYNEVADELVAEFSAADNHILPNESAY DQKNIRRRVYDALNVLMAMNIISKEKKEIKWIGLPTNSAQECQNLEVERQRRLERIKQ KQSQLQELILQQIAFKNLVQRNRHAEQQASRPPPPNSVIHLPFIIVNTSKKTVIDCSI SNDKFEYLFNFDNTFEIHDDIEVLKRMGMACGLESGSCSAEDLKMARSLVPKALEPYV TEMAQGTVGGVFITTAGSTSNGTRFSASDLTNGADGMLATSSNGSQYSGSRVETPVSY VGEDDEEDDDFNENDEDD" BASE COUNT 399 a 369 c 374 g 298 t ORIGIN 1 ggaattccgt agctattgat ttcccggatc tggtaacatg gcaaaagatg ccggtctaat 61 tgaagccaac ggagaactca aggtcttcat agaccagaac cttagtcccg ggaaaggcgt 121 ggtgtccctc gtggccgttc acccctccac cgtcaacccg ctcgggaagc agctcttgcc 181 aaaaaccttt ggacagtcca atgtcaacat tgcccagcaa gtggtaattg gtacgcctca 241 gagaccggca gcgtcaaaca ccctggtggt aggaagccca cacaccccca gcactcactt 301 tgcctctcag aaccagcctt ccgactcctc accttggtct gccgggaagc gcaacaggaa 361 aggagagaag aatggcaagg gcctacggca tttctccatg aaggtctgcg agaaggtgca 421 gaggaaaggg accacttcct acaacgaagt ggcagacgag ctggttgcgg agttcagtgc 481 tgccgacaac cacatcttac caaacgagtc agcttatgac cagaaaaaca taagacggcg 541 cgtctacgat gccttaaacg tgctaatggc catgaacatc atctccaagg agaagaagga 601 gatcaagtgg attggtctgc ccaccaactc ggctcaggaa tgtcagaact tagaggtgga 661 aagacagagg agacttgaaa gaataaaaca gaaacagtct caacttcaag aacttattct 721 acagcaaatt gccttcaaga acctggtgca gagaaaccgg catgcggagc agcaggccag 781 ccggccaccg ccacccaact cagtcatcca cctgcccttc atcatcgtca acaccagcaa 841 gaagacggtc atcgactgca gcatctccaa tgacaaattt gagtatctgt ttaattttga 901 caacacattt gaaatccacg atgacataga agtgctgaag cggatgggca tggcttgcgg 961 gctggagtcg gggagctgct ctgccgaaga ccttaaaatg gccagaagtc tggtccccaa 1021 ggctctggag ccatacgtga cagaaatggc tcagggaact gttggaggcg tgttcatcac 1081 gacggcaggt tccacgtcta acggcacaag gttctctgcc agtgacctga ccaacggtgc 1141 agatgggatg ctggccacaa gctccaatgg gtctcagtac agcggctcca gggtggagac 1201 tccggtgtcc tacgtcgggg aggacgacga ggaggacgat gacttcaacg agaatgacga 1261 ggacgactga cgtcctcccc acttcagatt cggcttcagg aaaacgttta gcgaaaagaa 1321 actttttttt taatgtgggt tttctgtttc cttttggcct agtcccaaga agatattggt 1381 aagctattga atttagatat gcacctctga taagcaagga ttgtttcccg tagattagga // LOCUS HUMDP2M 1266 bp mRNA PRI 30-JUN-1995 DEFINITION Human DP-2 mRNA, complete cds. ACCESSION L40386 NID g703084 KEYWORDS DP-2 gene. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1266) AUTHORS Wu,C.L., Zukerberg,L.R., Ngwu,C., Harlow,E. and Lees,J.A. TITLE In vivo association of E2F and DP family proteins JOURNAL Mol. Cell. Biol. 15 (5), 2536-2546 (1995) MEDLINE 95257935 FEATURES Location/Qualifiers source 1..1266 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Nalm-6" mRNA 1..1266 /gene="DP-2" gene 1..1266 /gene="DP-2" 5'UTR 1..32 /gene="DP-2" CDS 33..1190 /gene="DP-2" /codon_start=1 /db_xref="PID:g703085" /translation="MIISTPQRLTSSGSVLIGSPYTPAPAMVTQTHIAEATGWVPGDR KRARKFIDSDFSESKRSKKGDKNGKGLRHFSMKVCEKVQRKGTTSYNEVADELVSEFT NSNNHLAADSAYDQKNIRRRVYDALNVLMAMNIISKEKKEIKWIGLPTNSAQECQNLE IEKQRRIERIKQKRAQLQELLLQQIAFKNLVQRNRQNEQQNQGPPALNSTIQLPFIII NTSRKTVIDCSISSDKFEYLFNFDNTFEIHDDIEVLKRMGMSFGLESGKCSLEDLKLA KSLVPKALEGYITDISTGPSWLNQGLLLNSTQSVSNLDLTTGATLPQSSVNQGLCLDA EVALATGQFLAPNSHQSSSAASHCSESRGETPCSFNDEDEEDDEEDSSSPE" 3'UTR 1191..1266 /gene="DP-2" BASE COUNT 445 a 263 c 279 g 279 t ORIGIN 1 gaattccaat aaatgtgaat gttggacccc aaatgattat aagcacacca cagagactaa 61 ccagttcagg aagtgttctg attgggagtc catatacccc tgcaccagca atggttactc 121 agacacacat agcagaagct actggctggg tccctggtga tagaaaacgg gctagaaaat 181 ttatagactc tgatttttca gaaagtaaac gaagcaaaaa aggagataaa aatgggaaag 241 gcttgagaca cttttcaatg aaagtgtgtg agaaagttca acgaaaaggt acaacatcgt 301 acaatgaagt cgctgatgag ctggtgtcag agttcaccaa ttcaaataac catttggctg 361 ctgattcggc ttatgatcag aagaacatta ggcgaagagt ttatgatgct ttaaatgtgc 421 taatggcaat gaacataatt tcaaaggaaa aaaaagaaat caagtggatt ggcctgccta 481 ccaattctgc tcaggaatgt cagaatctgg agatagagaa gcagaggcgg atagaacgga 541 taaagcagaa gcgggcccag ctgcaagaac ttctcctaca gcaaatcgct ttcaaaaacc 601 tggtacagag aaatcgacaa aatgagcagc aaaaccaggg cccgccggct ctgaactcta 661 ccattcagct gccattcata atcatcaata caagcagaaa aacagtcata gattgcagca 721 tctccagtga caagtttgag tatcttttca attttgacaa cacctttgag atccatgatg 781 acatagaagt actaaagcgg atgggaatgt cgtttggcct ggagtcaggc aaatgctctc 841 tggaggatct gaaacttgcg aaatccctgg tgccaaaggc tttagaaggt tatatcacag 901 atatctccac aggaccttct tggttaaatc agggactact tctgaactct acccaatcag 961 tttcaaattt agacctgacc actggtgcca ccttacccca gtcaagtgta aaccaagggt 1021 tatgcttgga tgcagaagtg gccttagcaa ctgggcagtt cctggcccca aacagtcacc 1081 agtccagcag tgcggcctct cactgctccg agtcccgagg cgagaccccc tgttcgttca 1141 atgatgaaga tgaggaagat gatgaggagg attcctcctc cccagaataa agacaagaga 1201 aagcctaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaatctaggg 1261 aattcc // LOCUS HUMDR1TATA 1375 bp mRNA PRI 31-DEC-1994 DEFINITION Human TATA binding protein-associated phosphoprotein (DR1) mRNA, complete cds. ACCESSION M97388 NID g181756 KEYWORDS TATA binding protein-associated phosphoprotein; transcription inhibitor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1375) AUTHORS Inostroza,J.A., Mermelstein,F.H., Ha,I., Lane,W.S. and Reinberg,D. TITLE Dr1, a TATA-binding protein-associated phosphoprotein and inhibitor of class II gene transcription JOURNAL Cell 70 (3), 477-489 (1992) MEDLINE 92354065 FEATURES Location/Qualifiers source 1..1375 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 548..1078 /gene="DR1" CDS 548..1078 /gene="DR1" /codon_start=1 /function="inhibits class II gene transcription" /product="TATA binding protein-associated phosphoprotein" /db_xref="PID:g181757" /translation="MASSSGNDDDLTIPRAAINKMIKETLPNVRVANDARELVVNCCT EFIHLISSEANEICNKSEKKTISPEHVIQALESLGFGSYISEVKEVLQECKTVALKRR KASSRLENLGIPEEELLRQQQELFAKARQQQAELAQQEWLQMQQAAQQAQLAAASASA SNQAGSSQDEEDDDDI" BASE COUNT 359 a 356 c 311 g 349 t ORIGIN 1 tcgaattccg gaagccgctc ccgacaccct ttgcctggct ctgtccatat tagttcccag 61 gcggccgtcg cgttccagca gcggcacgca gcgcaggcgg agcggcagcg gggcctcggc 121 tctatagagc cgagccgctg gtacccgccc ggtaccgcgc gagccagtgc ccctggatct 181 tgcctctgct ccgacgccgt tccccaccag ttagcgacag cgcccgcccc tctgaggaga 241 cacgaaggtg gttccccagc cgctcaaatt tccggaccac cgcgctttcc cctcctcagc 301 ctgggctgtg ctctctctag aatcctcggg cccccacttt cttcccaaac tcatcctaaa 361 tctctcacac acgcgagtgt tcccagccct caagccagct gctcctcctc cgttcatttt 421 ctgcccctct tcgcaaagca cccccgggat catcctccga gggcgacttt ttgagaaatc 481 tcggtggagt agtggaccag agcaggggag tttttaaaag ccggggcgcg agaaacagga 541 aggtactatg gcttcctcgt ctggcaacga tgatgatctc actatcccca gagctgctat 601 caataaaatg atcaaagaga ctcttcctaa tgtccgggtg gccaacgatg ctcgagagct 661 ggtggtgaac tgctgcactg aattcattca ccttatatct tctgaagcca atgagatttg 721 taacaaatcg gaaaagaaga ccatctcacc agagcatgtc atacaagcac tagaaagttt 781 gggatttggc tcttacatca gtgaagtaaa agaagtcttg caagagtgta aaacagtagc 841 attaaaaaga agaaaggcca gttctcgttt ggaaaacctt ggcattcctg aagaagagtt 901 attgagacag caacaagaat tatttgcaaa agctagacag caacaagcag aattggccca 961 acaggaatgg cttcaaatgc agcaagctgc ccaacaagcc cagcttgctg ctgcctcagc 1021 cagtgcatct aatcaggcgg gatcttctca ggatgaagaa gatgatgatg atatctgaaa 1081 ttcaccagct gagtttctat ttcttctata aatgtttttc cctgcacaac aaaaacagtg 1141 aaagaaatgc ttatctgtaa ttttgtatgc atcttggtgg acttgtcatt ggtattctag 1201 agatgtctgc tataagtttc atctgttgtg tgctatacat gtaaaaactg tctctttgaa 1261 ctattgaaaa tttaaggttc agtataatat caattttgaa tttttcctgg tgtttatgaa 1321 attttagata gcagcaagtc ttcgtttgat cataaacagt gtacagataa ctcaa // LOCUS HUMDRA 2881 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens colon mucosa-associated (DRA) mRNA, complete cds. ACCESSION L02785 NID g291963 KEYWORDS colon mucosa-associated protein. SOURCE Homo sapiens colon cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2881) AUTHORS Schweinfest,C.W., Henderson,K.W., Suster,S., Kondoh,N. and Papas,T.S. TITLE Identification of a colon mucosa gene that is down-regulated in colon adenomas and adenocarcinomas JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (9), 4166-4170 (1993) MEDLINE 93248250 FEATURES Location/Qualifiers source 1..2881 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="epithelial cell" /tissue_type="colon" gene 185..2479 /gene="DRA" CDS 185..2479 /gene="DRA" /note="Nuclear localization signal at AA 569-573, 576-580, 579-583; acidic transcr. activ. domain 620-640,; homeobox motif 653-676" /codon_start=1 /db_xref="PID:g291964" /translation="MIEPFGNQYIVARPVYSTNAFEENHKKTGRHHKTFLDHLKVCCS CSPQKAKRIVLSLFPIASWLPAYRLKEWLLSDIVSGISTGIVAVLQGLAFALLVDIPP VYGLYASFFPAIIYLFFGTSRHISVGPFPILSMMVGLAVSGAVSKAVPDRNATTLGLP NNSNNSSLLDDERVRVAAAASVTVLSGIIQLAFGILRIGFVVIYLSESLISGFTTAAA VHVLVSQLKFIFQLTVPSHTDPVSIFKVLYSVFSQIEKTNIADLVTALIVLLVVSIVK EINQRFKDKLPVPIPIEFIMTVIAAGVSYGCDFKNRFKVAVVGDMNPGFQPPITPDVE TFQNTVGDCFGIAMVAFAVAFSVASVYSLKYDYPLDGNQELIALGLGNIVCGVFRGFA GSTALSRSAVQESTGGKTQIAGLIGAIIVLIVVLAIGFLLAPLQKSVLAALALGNLKG MLMQFAEIGRLWRKDKYDCLIWIMTFIFTIVLGLGLGLAASVAFQLLTIVFRTQFPKC STLANIGRTNIYKNKKDYYDMYEPEGVKIFRCPSPIYFANIGFFRRKLIDAVGFSPLR ILRKRNKALRKIRKLQKQGLLQVTPKGFICTVDTIKDSDEELDNNQIEVLDQPINTTD LPFHIDWNDDLPLNIEVPKISLHSLILDFSAVSFLDVSSVRGLKSILQEFIRIKVDVY IVGTDDDFIEKLNRYEFFDGEVKSSIFFLTIHDAVLHILMKKDYSTSKFNPSQEKDGK IDFTINTNGGLRNRVYEVPVETKF" BASE COUNT 839 a 578 c 596 g 868 t ORIGIN 1 atccactcag gtctacaggc tcttagaact agaacttaga actttatctt gaaaatgtac 61 cactgttgca gaagctcctc acagagtatg tgtcaggcat ttttaacctg ctaaaggcaa 121 gaagaagtgt tcaccacata gttgcaaagg tcttcaactt gccacagcca acagaaaaat 181 caaaatgatt gaaccctttg ggaatcagta tattgtggcc aggccagtgt attctacaaa 241 tgcttttgag gaaaatcata aaaagacagg aagacatcat aagacatttc tggatcatct 301 caaagtgtgt tgtagctgtt ccccacaaaa ggccaagaga attgtcctct ctttgttccc 361 catagcatct tggttgccag cataccggct taaagaatgg ttgctcagtg atattgtttc 421 tggtatcagc acagggattg tggccgtact acaaggttta gcatttgctc tgctggtcga 481 cattccccca gtctatgggt tgtatgcatc ctttttccca gccataatct accttttctt 541 cggcacttcc agacacatat ccgtgggtcc gtttccgatt ctgagtatga tggtgggact 601 agcagtttca ggagcagttt caaaagcagt cccagatcgc aatgcaacta ctttgggatt 661 gcctaacaac tcgaataatt cttcactact ggatgacgag agggtgaggg tggcggcggc 721 ggcatcagtc acagtgcttt ctggaatcat ccagttggct tttgggattc tgcggattgg 781 atttgtagtg atatacctgt ctgagtccct catcagtggc ttcactactg ctgctgctgt 841 tcatgttttg gtttcccaac tcaaattcat ttttcagttg acagtcccgt cacacactga 901 tccagtttca attttcaaag tactatactc tgtattctca caaatagaga agactaatat 961 tgcagacctg gtgacagctc tgattgtcct tttggttgta tccattgtta aagaaataaa 1021 tcagcgcttc aaagacaaac ttccagtgcc cattccaatc gaattcatta tgaccgtgat 1081 tgcagcaggt gtatcctacg gctgtgactt taaaaacagg tttaaagtgg ctgtggttgg 1141 ggacatgaat cctggatttc agccccctat tacacctgac gtggagactt tccaaaacac 1201 cgtaggagat tgcttcggca tcgcaatggt tgcatttgca gtggcctttt cagttgccag 1261 cgtctattcc ctcaaatacg attatccact tgatggcaat caggagttaa tagccttggg 1321 actgggtaac atagtctgtg gagtattcag aggatttgct gggagtactg ccctctccag 1381 atcagcagtt caggagagca caggaggcaa aacacagatt gctgggctta ttggtgccat 1441 catcgtgctg attgtcgttc tagccattgg atttctcctg gcgcctctac aaaagtccgt 1501 cctggcagct ttagcattgg gaaacttaaa gggaatgctg atgcagtttg ctgaaatagg 1561 cagattgtgg cgaaaggaca aatatgattg tttaatttgg atcatgacct tcatcttcac 1621 cattgtcctg ggactcgggt taggcctggc agctagtgtg gcatttcaac tgctaaccat 1681 cgtgttcagg acccaatttc caaaatgcag cacgctggct aatattggaa gaaccaacat 1741 ctataagaat aaaaaagatt attatgatat gtatgagcca gaaggagtga aaattttcag 1801 atgtccatct cctatctact ttgcaaacat tggtttcttt aggcggaaac ttatcgatgc 1861 tgttggcttt agtccacttc gaattctacg caagcgcaac aaagctttga ggaaaatccg 1921 aaaactgcag aagcaaggct tgctacaagt gacaccaaaa ggatttatat gtactgttga 1981 caccataaaa gattctgacg aagagctgga caacaatcag atagaagtac tggaccagcc 2041 aatcaatacc acagacctgc ctttccacat tgactggaat gatgatcttc ctctcaacat 2101 tgaggtcccc aaaatcagcc tccacagcct cattctcgac ttttcagcag tgtcctttct 2161 tgatgtttct tcagtgaggg gccttaaatc gattttgcaa gaatttatca ggatcaaggt 2221 agatgtgtat atcgttggaa ctgatgatga cttcattgag aagcttaacc ggtatgaatt 2281 ttttgatggt gaagtgaaaa gctcaatatt tttcttaaca atccatgatg ctgttttgca 2341 tattttgatg aagaaagatt acagtacttc aaagtttaat cccagtcagg aaaaagatgg 2401 aaaaattgat tttaccataa atacaaatgg aggattacgt aatcgggtat atgaggtgcc 2461 agttgaaaca aaattctaat caacatataa ttcagaagga tcttcatctg actatgacat 2521 aaaaacaact ttatacccag aaagttattg ataagttcat acattgtacg aagagtattt 2581 ttgacagaat atgtttcaaa ctttggaaca agatggttct agcatggcat atttttcaca 2641 tatctagtat gaaattatat aagtattcta aattttatat cttgtagctt tatcaaaggg 2701 tgaaaattat tttgttcata catatttttg tagcactgac agatttccat cctagtcact 2761 accttcatgc ataggtttag cagtatagtg gcgccactgt tttgaatctc ataatttata 2821 caggtcatat taatatattt ccattaaaaa atcagttgta cagtgaaaaa aaaaaagaaa 2881 a // LOCUS HUMDRAL 1433 bp mRNA PRI 11-JAN-1996 DEFINITION Homo sapiens (clone 35.3) DRAL mRNA, complete cds. ACCESSION L42176 NID g1160931 KEYWORDS LIM domain. SOURCE Homo sapiens (clone: 35.3) neonatal skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1433) AUTHORS Genini,M., Schwalbe,P., Mattei,M.-G. and Schafer,B.W. TITLE Subtractive cloning of DRAL, a novel LIM-domain protein down-regulated in rhabdomyosarcoma JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..1433 /organism="Homo sapiens" /note="(vector lambda Express)" /db_xref="taxon:9606" /clone="35.3" /cell_type="primary myoblast" /dev_stage="neonatal" /tissue_type="skeletal muscle" gene 139..978 /gene="DRAL" CDS 139..978 /gene="DRAL" /codon_start=1 /db_xref="PID:g1160932" /translation="MTERFDCHHCNESLFGKKYILREESPYCVVCFETLFANTCEECG KPIGCDCKDLSYKDRHWHEACFHCSQCRNSLVDKPFAAKEDQLLCTDCYSNEYSSKCQ ECKKTIMPGTRKMEYKGSSWHETCFICHRCQQPIGTKSFIPKDNQNFCVPCYEKQHAM QCVQCKKPITTGGVTYREQPWHKECFVCTACRKQLSGQRFTARDDFAYCLNCFCDLYA KKCAGCTNPISGLGGTKYISFEERQWHNDCFNCKKCSLSLVGRGFLTERDDILCPDCG KDI" misc_feature 157..231 /gene="DRAL" /note="LIM domain" misc_feature 256..414 /gene="DRAL" /note="LIM domain" misc_feature 439..597 /gene="DRAL" /note="LIM domain" misc_feature 622..774 /gene="DRAL" /note="LIM domain" misc_feature 801..963 /gene="DRAL" /note="c-terminal half of LIM domain" polyA_signal 1414..1419 polyA_site 1433 BASE COUNT 351 a 376 c 351 g 355 t ORIGIN 1 cgcagccacc agccgcccgc gccctccagc cccgtccggg agtccccggc ccgctgcggt 61 gcctggctga gaactgtgtc ttcctggaga ctaggctggc attttgactt tggggttgct 121 gaaaagccag gagtcaaaat gactgagcgc tttgactgcc accattgcaa cgaatctctc 181 tttggcaaga agtacatcct gcgggaggag agcccctact gcgtggtgtg ctttgagacc 241 ctgttcgcca acacctgcga ggagtgtggg aagcccatcg gctgtgactg caaggacttg 301 tcttacaagg accggcactg gcatgaagcc tgtttccact gctcgcagtg cagaaactca 361 ctggtggaca agccctttgc tgccaaggag gaccagctgc tctgtacaga ctgctattcc 421 aacgagtact catccaagtg ccaggaatgc aagaagacca tcatgccagg tacccgcaag 481 atggagtaca agggcagcag ctggcatgag acctgcttca tctgccaccg ctgccagcag 541 ccaattggaa ccaagagttt catccccaaa gacaatcaga atttctgtgt gccctgctat 601 gagaaacaac atgccatgca gtgcgttcag tgcaaaaagc ccatcaccac gggaggggtc 661 acttaccggg agcagccctg gcacaaggag tgcttcgtgt gcaccgcctg caggaagcag 721 ctgtctgggc agcgcttcac agctcgcgat gactttgcct actgcctgaa ctgcttctgt 781 gacttgtatg ccaagaagtg tgctgggtgc accaacccca tcagcggact tggtggcaca 841 aaatacatct cctttgagga acggcagtgg cataacgact gctttaactg taagaagtgc 901 tccctctcac tggtggggcg tggcttcctc acagagaggg acgacatcct gtgccccgac 961 tgtgggaaag acatctgaat tcaacacaga gaagttgctg cttgtgatct cacacacaga 1021 tttttatgtt ttctttctca cccaggcaat cttgccttct ggtttcttcc agccacattg 1081 agactttctt ctagtgcttt tcagtgatac tcacgtttgc ttaaaccctt tagtgctttg 1141 tgatagttca gtcccaggga aagagaaaac tcgccctagg ccctaggtgg gaagatggtt 1201 tgaaattttt gtaatcgagt aaggcacacc caaatgtaaa aatccttttg aatgatgcct 1261 ttataaatct ttctctcact gtctatttaa gtgcaattaa catatgtcac gaacttgaaa 1321 gttttctaaa ctcaataagg taatgaccag ttgttattta cagctctgta acctcccgtt 1381 gcgtcaagtc taaaccaaga ttatgtgact tgcaataaag ttattcagaa cag // LOCUS HUMDRD5A 1673 bp DNA PRI 07-NOV-1994 DEFINITION Human D5 dopamine receptor (DRD5) gene, complete cds. ACCESSION M67439 NID g181830 KEYWORDS G-protein coupled receptor; dopamine D5 receptor; transmembrane protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1673) AUTHORS Grandy,D.K., Zhang,Y.A., Bouvier,C., Zhou,Q.Y., Johnson,R.A., Allen,L., Buck,K., Bunzow,J.R., Salon,J. and Civelli,O. TITLE Multiple human D5 dopamine receptor genes: a functional receptor and two pseudogenes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (20), 9175-9179 (1991) MEDLINE 92021013 FEATURES Location/Qualifiers source 1..1673 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 148..1581 /gene="DRD5" CDS 148..1581 /gene="DRD5" /codon_start=1 /db_xref="GDB:G00-127-548" /product="dopamine receptor D5" /db_xref="PID:g181831" /translation="MLPPGSNGTAYPGQFALYQQLAQGNAVGGSAGAPPLGPSQVVTA CLLTLLIIWTLLGNVLVCAAIVRSRHLRANMTNVFIVSLAVSDLFVALLVMPWKAVAE VAGYWPFGAFCDVWVAFDIMCSTASILNLCVISVDRYWAISRPFRYKRKMTQRMALVM VGLAWTLSILISFIPVQLNWHRDQAASWGGLDLPNNLANWTPWEEDFWEPDVNAENCD SSLNRTYAISSSLISFYIPVAIMIVTYTRIYRIAQVQIRRISSLERAAEHAQSCRSSA ACAPDTSLRASIKKETKVLKTLSVIMGVFVCCWLPFFILNCMVPFCSGHPEGPPAGFP CVSETTFDVFVWFGWANSSLNPVIYAFNADFQKVFAQLLGCSHFCSRTPVETVNISNE LISYNQDIVFHKEIAAAYIHMMPNAVTPGNREVDNDEEEGPFDRMFQIYQTSPDGDPV AESVWELDCEGEISLDKITPFTPNGFH" BASE COUNT 311 a 551 c 471 g 340 t ORIGIN 1 cccggcgcag ctcatggtga gcgcctctgg ggctcgaggg tcccttggct gagggggcgc 61 atcctcgggg tgcccgatgg ggctgcctgg gggtcgcagg gctgaagttg ggatcgcgca 121 caaaccgacc ctgcagtcca gcccgaaatg ctgccgccag gcagcaacgg caccgcgtac 181 ccggggcagt tcgctctata ccagcagctg gcgcagggga acgccgtggg gggctcggcg 241 ggggcaccgc cactggggcc ctcacaggtg gtcaccgcct gcctgctgac cctactcatc 301 atctggaccc tgctgggcaa cgtgctggtg tgcgcagcca tcgtgcggag ccgccacctg 361 cgcgccaaca tgaccaacgt cttcatcgtg tctctggccg tgtctgacct tttcgtggcg 421 ctgctggtca tgccctggaa ggcagtcgcc gaggtggccg gttactggcc ctttggagcg 481 ttctgcgacg tctgggtggc cttcgacatc atgtgctcca ctgcctccat cctgaacctg 541 tgcgtcatca gcgtggaccg ctactgggcc atctccaggc ccttccgcta caagcgcaag 601 atgactcagc gcatggcctt ggtcatggtc ggcctggcat ggaccttgtc catcctcatc 661 tccttcattc cggtccagct caactggcac agggaccagg cggcctcttg gggcgggctg 721 gacctgccaa acaacctggc caactggacg ccctgggagg aggacttttg ggagcccgac 781 gtgaatgcag agaactgtga ctccagcctg aatcgaacct acgccatctc ttcctcgctc 841 atcagcttct acatccccgt tgccatcatg atcgtgacct acacgcgcat ctaccgcatc 901 gcccaggtgc agatccgcag gatttcctcc ctggagaggg ccgcagagca cgcgcagagc 961 tgccggagca gcgcagcctg cgcgcccgac accagcctgc gcgcttccat caagaaggag 1021 accaaggttc tcaagaccct gtcggtgatc atgggggtct tcgtgtgttg ctggctgccc 1081 ttcttcatcc ttaactgcat ggtccctttc tgcagtggac accctgaagg ccctccggcc 1141 ggcttcccct gcgtcagtga gaccaccttc gacgtcttcg tctggttcgg ctgggctaac 1201 tcctcactca accccgtcat ctatgccttc aacgccgact ttcagaaggt gtttgcccag 1261 ctgctggggt gcagccactt ctgctcccgc acgccggtgg agacggtgaa catcagcaat 1321 gagctcatct cctacaacca agacatcgtc ttccacaagg aaatcgcagc tgcctacatc 1381 cacatgatgc ccaacgccgt tacccccggc aaccgggagg tggacaacga cgaggaggag 1441 ggtcctttcg atcgcatgtt ccagatctat cagacgtccc cagatggtga ccctgttgct 1501 gagtctgtct gggagctgga ctgcgagggg gagatttctt tagacaaaat aacacctttc 1561 accccgaatg gattccatta aactgcatta agaaaccccc tcatggatct gcataaccgc 1621 acagacactg acaagcacgc acacacacgc aaatacatgc ctttccagta ctg // LOCUS HUMDSAEC 1584 bp mRNA PRI 13-FEB-1996 DEFINITION Human mRNA for mitochondrial 3-oxoacyl-CoA thiolase, complete cds. ACCESSION D16294 NID g452590 KEYWORDS mitochondrial 3-oxoacyl-CoA thiolase; mitochondrial fatty acid oxidation. SOURCE Homo sapiens liver cDNA to mRNA, clones hT1-[1, 2, 3]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1584) AUTHORS Abe,H., Ohtake,A., Yamamoto,S., Satoh,Y., Takayanagi,M., Amaya,Y., Takiguchi,M., Sakuraba,H., Suzuki,Y., Mori,M. and Niimi,H. TITLE Cloning and sequence analysis of a full length cDNA encoding human mitochondrial 3-oxoacyl-CoA thiolase JOURNAL Biochim. Biophys. Acta 1216 (2), 304-306 (1993) MEDLINE 94060106 REFERENCE 2 (bases 1 to 1584) AUTHORS Ohtake,A. TITLE Direct Submission JOURNAL Submitted (19-MAY-1993) to the DDBJ/EMBL/GenBank databases. Akira Ohtake, The Tokyo Metropolitan institute of Medical Science, Department of Clinical Genetics; 3-18-22 Honkomagome, Bunkyo-ku, Tokyo 113, Japan (E-mail:ohtake@rinshoken.or.jp, Tel:03-3823-2101, Fax:03-3823-6008) COMMENT Submitted (19-May-1993) to DDBJ by: Akira Ohtake Department of Clinical Genetics The Tokyo Metropolitan Institute of Medical Science 3-18-22 Honkomagome Bunkyo-ku Tokyo 113 Janpan Phone: 03-3823-2101 Fax: 03-3823-6008 E-mail: ohtake@rinshoken.or.jp. FEATURES Location/Qualifiers source 1..1584 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 49..1242 /EC_number="2.3.1.16" /codon_start=1 /product="mitochondrial 3-oxoacyl-CoA thiolase" /db_xref="PID:d1004316" /db_xref="PID:g509676" /translation="MRLLRGVFVVAAKRTPFGAYGGLLKDFTATDLSEFAAKAALSAG KVSPETVDSVIMGNVLQSSSDAIYLARHVGLRVGIPKETPALTINRLCGSGFQSIVNG CQEICVKEAEVVLCGGTESMSQAPYCVRNVRFGTKLGSDIKLEDSLWVSLTDQHVQLP MAMTAENLTVKHKISREECDKYALQSQQRWKAANDAGYFNDEMAPIEVKTKKGKQTMQ VDEHARPQTTLEQLQKLPPVFKKDGTVTAGNASGVADGAGAVIIASEDAVKKHNFTPL ARIVGYFVSGCDPSIMGIGPVPAISGALKKAGLSLKDMDLVEVNEAFAPQYLAVERSL DLDISKTNVNGGAIALGHPLGGSGSRITAHLVHELRRRGGKYAVGSACIGGGQGIAVI IQSTA" mat_peptide 49..1239 /note="Sequence from bp49 to bp96 is a non-cleavable mitochondrial targeting signal." /product="mitochondrial 3-oxoacyl-CoA thiolase" polyA_signal 1564..1569 polyA_site 1584 BASE COUNT 455 a 336 c 389 g 404 t ORIGIN 1 gcgtccccca caccacagac ccgcgccgcc gacgacccag cagccgccat gcgtctgctc 61 cgaggtgtgt ttgtagttgc tgctaagcga acgccctttg gagcttacgg aggccttctg 121 aaagacttca ctgctactga cttgtctgaa tttgctgcca aggctgcctt gtctgctggc 181 aaagtctcac ctgaaacagt tgacagtgtg attatgggca atgtcctgca gagttcttca 241 gatgctatat atttggcaag gcatgttggt ttgcgtgtgg gaatcccaaa ggagacccca 301 gctctcacga ttaataggct ctgtggttct ggttttcagt ccattgtgaa tggatgtcag 361 gaaatttgtg ttaaagaagc tgaagttgtt ttatgtggag gaaccgaaag catgagccaa 421 gctccctact gtgtcagaaa tgtgcgtttt ggaaccaagc ttggatcaga tatcaagctg 481 gaagattctt tatgggtatc attaacagat cagcatgtcc agctccccat ggcaatgact 541 gcagagaatc ttactgtaaa acacaaaata agcagagaag aatgtgacaa atatgccctg 601 cagtcacagc agagatggaa agctgctaat gatgctggct actttaatga tgaaatggca 661 ccaattgaag tgaagacaaa gaaaggaaaa cagacaatgc aggtagacga gcatgctcgg 721 ccccaaacca ccctggaaca gttacagaaa cttcctccag tattcaagaa agatggaact 781 gttactgcag ggaatgcatc gggtgtagct gatggtgctg gagctgttat catagctagt 841 gaagatgctg ttaagaaaca taacttcaca ccactggcaa gaattgtggg ctactttgta 901 tctggatgtg atccctctat catgggtatt ggtcctgtcc ctgctatcag tggggcactg 961 aagaaagcag gactgagtct taaggacatg gatttggtag aggtgaatga agcttttgct 1021 ccccagtact tggctgttga gaggagtttg gatcttgaca taagtaaaac caatgtgaat 1081 ggaggagcca ttgctttggg tcacccactg ggaggatctg gatcaagaat tactgcacac 1141 ctggttcacg aattaaggcg tcgaggtgga aaatatgccg ttggatcagc ttgcattgga 1201 ggtggccaag gtattgctgt catcattcag agcacagcct gaagagacca gtgagctcac 1261 tgtgacccat ccttactcta cttggccagg ccacagtaaa acaagtgacc ttcagagcag 1321 ctgccacaac tggccatgcc ctgccattga aacagtgatt aagtttgatc aagccatggt 1381 gacacaaaaa tgcattgatc atgaatagga gcccatgcta gaagtacatt ctctcagatt 1441 tgaaccagtg aaatatgatg tatttctgag ctaaaactca actatagaag acattaaaag 1501 aaatcgtatt cttgccaagt aaccaccact tctgccttag ataatatgat tataaggaaa 1561 tcaaataaat gttgccttaa cttc // LOCUS HUMDSPHS 861 bp mRNA PRI 11-FEB-1993 DEFINITION Human dual specificity phosphatase tyrosine/serine mRNA, complete cds. ACCESSION L05147 NID g181839 KEYWORDS phosphatase; phosphatase tyrosine/serine. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 861) AUTHORS Ishibashi,T., Bottaro,D.P., Chan,A., Miki,T. and Aaronson,S.A. TITLE Expression cloning of a human dual-specificity phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 12170-12174 (1992) MEDLINE 93101689 FEATURES Location/Qualifiers source 1..861 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="M426" /cell_type="fibroblast" /tissue_type="lung" CDS 29..586 /standard_name="dual specificity phosphatase (tyrosine/serine)" /codon_start=1 /product="phosphatase tyrosine/serine" /db_xref="PID:g181840" /translation="MSGSFELSVQDLNDLLSDGSGCYSLPSQPCNEVTPRIYVGNASV AQDIPKLQKLGITHVLNAAEGRSFMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYF ERAADFIDQALAQKNGRVLVHCREGYSRSPTLVIAYLMMRQKMDVKSALSIVRQNREI GPNDGFLAQLCQLNDRLAKEGKLKP" BASE COUNT 183 a 270 c 244 g 164 t ORIGIN 1 gccgggcgtg cagggccccg ccgccgccat gtcgggctcg ttcgagctct cggtgcagga 61 tctcaacgac ctgctctcgg acggcagcgg ctgctacagc ctcccgagcc agccctgcaa 121 cgaggtcacc ccgcggatct acgtgggcaa cgcgtctgtg gctcaggaca tccccaagct 181 gcagaaacta ggcatcaccc atgtgctgaa cgcggctgag ggcaggtcct tcatgcacgt 241 caacaccaat gccaacttct acaaggactc cggcatcaca tacctgggca tcaaggccaa 301 cgacacacag gagttcaacc tcagcgctta ctttgaaagg gctgccgact tcattgacca 361 ggctttggct caaaagaatg gccgggtgct cgtccactgc cgggaaggtt atagccgctc 421 cccaacgcta gttatcgcct acctcatgat gcggcagaag atggacgtca agtctgccct 481 gagcatcgtg aggcagaacc gtgagatcgg ccccaacgat ggcttcctgg cccagctctg 541 ccagctcaat gacagactag ccaaggaggg gaagttgaaa ccctagggca cccccaccgc 601 ctctgctcga gaggtccgtg ggggaggccg tgggaaaggt gtccgagctg ccatgtttag 661 gaaacacact gtaccctgct cccagcatca caaggcactt gtctacaagt gtgtcccaac 721 acagtcctgg gccactttcc ccaccctggg gagcacataa agaagcttgc caaggggggc 781 gtccttgctc cccagttgtc ctgtttctgt aacttatgat gtcttttccc tgagatgggg 841 gctcagaggg ggaaggcctg t // LOCUS HUMDV 1128 bp mRNA PRI 26-AUG-1993 DEFINITION Human Gal beta1,3(4)GlcNAc alpha2,3-sialyltransferase mRNA, complete cds. ACCESSION L23768 NID g388014 KEYWORDS Gal beta1,3(4)GlcNAc alpha 2,3-sialyltransferase; sialyltransferase. SOURCE Homo sapiens female adult placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1128) AUTHORS Kitagawa,H. and Paulson,J.C. TITLE Cloning and expression of human Gal beta1,3(4)GlcNAc alpha 2, 3-sialyltransferase JOURNAL Biochem. Biophys. Res. Commun. 194, 375-382 (1993) MEDLINE 93326146 FEATURES Location/Qualifiers source 1..1128 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="female" /tissue_type="placenta" CDS 1..1128 /codon_start=1 /product="Gal beta 1,3 (4)GlcNAc alpha 2, 3-sialyltranferase" /db_xref="PID:g388015" /translation="MGLLVFVRNLLLALCLFLVLGFLYYSAWKLHLLQWEEDSNSVVL SFDSAGQTLGSEYDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRF SKPAPMFLDDSFRKWARIREFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRC IIVGNGGVLANKSLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE QYERDSLFVLAGFKWQDFKWLKYIVYKERVSASDGFWKSVATRVPKEPPEIRILNPYF IQEAAFTLIGLPFNNGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGYDMSTPNAPLH YYETVRMAAIKESWTHNIQREKEFLRKLVKARVITDLSSGI" BASE COUNT 253 a 313 c 296 g 266 t ORIGIN 1 atgggactct tggtatttgt gcgcaatctg ctgctagccc tctgcctctt tctggtactg 61 ggatttttgt attattctgc gtggaagcta cacttactcc agtgggagga ggactccaat 121 tcagtggttc tttcctttga ctccgctgga caaacactag gctcagagta tgatcggttg 181 ggcttcctcc tgaatctgga ctctaaactg cctgctgaat tagccaccaa gtacgcaaac 241 ttttcagagg gagcttgcaa gcctggctat gcttcagcct tgatgacggc catcttcccc 301 cggttctcca agccagcacc catgttcctg gatgactcct ttcgcaagtg ggctagaatc 361 cgggagttcg tgccgccttt tgggatcaaa ggtcaagaca atctgatcaa agccatcttg 421 tcagtcacca aagagtaccg cctgacccct gccttggaca gcctccgctg ccgccgctgc 481 atcatcgtgg gcaatggagg cgttcttgcc aacaagtctc tggggtcacg aattgacgac 541 tatgacattg tggtgagact gaattcagca ccagtgaaag gctttgagaa ggacgtgggc 601 agcaaaacga cactgcgcat cacctacccc gagggcgcca tgcagcggcc tgagcagtac 661 gagcgcgatt ctctctttgt cctcgccggc ttcaagtggc aggactttaa gtggttgaaa 721 tacatcgtct acaaggagag agtgagtgca tcggatggct tctggaaatc tgtggccact 781 cgagtgccca aggagccccc tgagattcga atcctcaacc catatttcat ccaggaggcc 841 gccttcaccc tcattggcct gcccttcaac aatggcctca tgggccgggg gaacatccct 901 acccttggca gtgtggcagt gaccatggca ctacacggct gtgacgaggt ggcagtcgca 961 ggatttggct atgacatgag cacacccaac gcacccctgc actactatga gaccgttcgc 1021 atggcagcca tcaaagagtc ctggacgcac aatatccagc gagagaaaga gtttctgcgg 1081 aagctggtga aagctcgcgt catcactgat ctaagcagtg gcatctga // LOCUS HUME12A 2954 bp mRNA PRI 07-NOV-1994 DEFINITION Human e12 protein (E2A) mRNA, complete cds. ACCESSION M31222 NID g181905 KEYWORDS . SOURCE Human lymphoid B cell, SU-DHL-4 cell line, cDNA to mRNA, clone D13-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2954) AUTHORS Nourse,J., Mellentin,J.D., Galili,N., Wilkinson,J., Stanbridge,E., Smith,S.D. and Cleary,M.L. TITLE Chromosomal translocation t(1;19) results in synthesis of a homeobox fusion mRNA that codes for a potential chimeric transcription factor JOURNAL Cell 60 (4), 535-545 (1990) MEDLINE 90150281 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Nourse, 18-JAN-1990. FEATURES Location/Qualifiers source 1..2954 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19p13.3" gene 88..2034 /gene="TCF3" CDS 88..2034 /gene="TCF3" /note="e12 protein" /codon_start=1 /db_xref="GDB:G00-118-881" /db_xref="PID:g181906" /translation="MAPVGTDKELSDLLDFSMMFPLPVTNGKGRPASLAGAQFGGSGL EDRPSSGSWGSGDQSSSSFDPSRTFSEGTHFTESHSSLSSSTFLGPGLGGKSGERGAY ASFGRDAGVGGLTQAGFLSGELALNSPGPLSPSGMKGTSQYYPSYSGSSRRRAADGSL DTQPKKVRKVPPGLPSSVYPPSSGEDYGRDATAYPSAKTPSSTYPAPFYVADGSLHPS AELWSPPGQAGFGPMLGGGSSPLPLPPGSGPVGSSGSSSTFGGLHQHERMGYQLHGAE VNGGLPSASSFSSAPGATYGGVSSHTPPVSGADSLLGSRGTTAGSSGDALGKALASIY SPDHSSNNFSSSPSTPVGSPQGLAGTSQWPRAGAPGALSPSYDGGLHGLQSKIEDHLD EAIHVLRSHAVGTAGDMHTLLPGHGALASGFTGPMSLGGRHAGLVGGSHPEDGLAGST SLMHNHAALPSQPGTLPDLSRPPDSYSGLGRAGATAAASEIKREEKEDEENTSAADHS EEEKKELKAPRARTSPDEDEDDLLPPEQKAEREKERRVANNARERLRVRDINEAFKEL GRMCQLHLNSEKPQTKLLILHQAVSVILNLEQQVRERNLNPKAACLKRREEEKVSGVV GDPQMVLSAPHPGLSEAHNPAGHM" BASE COUNT 622 a 965 c 853 g 501 t 13 others ORIGIN Map position 19p13. 1 acgcgccgcg tgcccggccg cgcccagcag ggtttccagg cctgaggtgc ccgccctggc 61 cccaggagaa tgaaccagcc gcagaggatg gcgcctgtgg gcacagacaa ggagctcagt 121 gacctcctgg acttcagcat gatgttcccg ctgcctgtca ccaacgggaa gggccggccc 181 gcctccctgg ccggggcgca gttcggaggt tcaggtcttg aggaccggcc cagctcaggc 241 tcctggggca gcggcgacca gagcagctcc tcctttgacc ccagccggac cttcagcgag 301 ggcacccact tcactgagtc gcacagcagc ctctcttcat ccacattcct gggaccggga 361 ctcggaggca agagcggtga gcggggcgcc tatgcctcct tcgggagaga cgcaggcgtg 421 ggcggcctga ctcaggctgg cttcctgtca ggcgagctgg ccctcaacag ccccgggccc 481 ctgtcccctt cgggcatgaa ggggacctcc cagtactacc cctcctactc cggcagctcc 541 cggcggagag cggcagacgg cagcctagac acgcagccca agaaggtccg gaaggtcccg 601 ccgggtcttc catcctcggt gtacccaccc agctcaggtg aggactacgg cagggatgcc 661 accgcctacc cgtccgccaa gacccccagc agcacctatc ccgccccctt ctacgtggca 721 gatggcagcc tgcacccctc agccgagctc tggagtcccc cgggccaggc gggcttcggg 781 cccatgctgg gtgggggctc atccccgctg cccctcccgc ccggtagcgg cccggtgggc 841 agcagtggaa gcagcagcac gtttggtggc ctgcaccagc acgagcgtat gggctaccag 901 ctgcatggag cagaggtgaa cggtgggctc ccatctgcat cctccttctc ctcagccccc 961 ggagccacgt acggcggcgt ctccagccac acgccgcctg tcagcggggc cgacagcctc 1021 ctgggctccc gagggaccac agctggcagc tccggggatg ccctcggcaa agcactggcc 1081 tcgatctact ccccggatca ctcaagcaat aacttctcgt ccagcccttc tacccccgtg 1141 ggctcccccc agggcctggc aggaacgtca cagtggcctc gagcaggagc ccccggtgcc 1201 ttatcgccca gctacgacgg gggtctccac ggcctgcaga gtaagataga agaccacctg 1261 gacgaggcca tccacgtgct ccgcagccac gccgtgggca cagccggcga catgcacacg 1321 ctgctgcctg gccacggggc gctggcctca ggtttcaccg gccccatgtc actgggcggg 1381 cggcacgcag gcctggttgg aggcagccac cccgaggacg gcctcgcagg cagcaccagc 1441 ctcatgcaca accacgcggc cctccccagc cagccaggca ccctccctga cctgtctcgg 1501 cctcccgact cctacagtgg gctagggcga gcaggtgcca cggcggccgc cagcgagatc 1561 aagcgggagg agaaggagga cgaggagaac acgtcagcgg ctgaccactc ggaggaggag 1621 aagaaggagc tgaaggcccc ccgggcccgg accagcccag acgaggacga ggacgacctt 1681 ctccccccag agcagaaggc cgagcgggag aaggagcgcc gggtggccaa taacgcccgg 1741 gagcggctgc gggtccgtga catcaacgag gcctttaagg agctggggcg catgtgccaa 1801 ctgcacctca acagcgagaa gccccagacc aaactgctca tcctgcacca ggctgtctcg 1861 gtcatcctga acttggagca gcaagtgcga gagcggaacc tgaatcccaa agcagcctgt 1921 ttgaaacggc gagaagagga aaaggtgtca ggtgtggttg gagaccccca gatggtgctt 1981 tcagctcccc acccaggcct gagcgaagcc cacaaccccg ccgggcacat gtgaaaggta 2041 tgcctccgtg ggacgagcca ccccgtttca gccctgtgct ctggccccag aacggccact 2101 cgagaccccg ggattcatcc acatccacac ctcacacacc tgttgtcagc atcgagccaa 2161 caccaacctg acaaggttcg gagtgatggg ggcggccaag gtgacactgg gtccaggagc 2221 tccctgggcc ctggcctacc actcactggc ctcgctcccc ctgtccccga atctcagcca 2281 ccgttgcact ctgtgacctg tcccatggat ccrrrractr ratcttggcc ctgttgcctg 2341 ggctgacagg agrrrrrrrt ttttttccag taaacaaaac ctgaaagcaa gcaacaaaac 2401 atacactttg tcagagaaga aaaaatgcct taactataaa aagcggagaa atggaaacat 2461 atcactcaag ggggatgctg tggaaacctg gcttattctt ctaaagccac cagcaaattg 2521 tgcctaagcg aaatattttt tttaaggaaa ataaaaacat tagttacaag attttttttc 2581 ttaatgtgag atgaaaatta gcaaggatgc tgcctttggt ctctggtttt tttaagcttt 2641 ttttgcatat gttttgtaag caacaaattt ttttgtataa aagtcccgtg tctctcgcta 2701 tttctgctgc tgttcctaga ctgagcattg catttcttga tcaaccagat gattaaacgt 2761 tgtattaaaa aaaaaaagga gaaggagcgc cgggtggcca ataacgcccg ggagcggctg 2821 cgggtccgtg acatcaacga ggccttaagg agctggggcg catgtgccaa ctgcacctca 2881 acagcgagaa gccccagacc aaactgctca tcctgcacca ggctgtctcg gtcatcctga 2941 acttggagca gcaa // LOCUS HUME16GEN 3984 bp mRNA PRI 02-SEP-1992 DEFINITION Human E16 mRNA, complete cds. ACCESSION M80244 NID g181907 KEYWORDS E16 protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3984) AUTHORS Gaugitsch,H.W., Prieschl,E.E., Kalthoff,F., Huber,N.E. and Baumruker,T. TITLE A novel transiently expressed, integral membrane protein linked to cell activation: Molecular cloning via the rapid degradation signal AUUUA JOURNAL J. Biol. Chem. 267, 11267-11273 (1992) MEDLINE 92283834 FEATURES Location/Qualifiers source 1..3984 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat T cell" gene 311..1036 /gene="E16" CDS 311..1036 /gene="E16" /codon_start=1 /db_xref="PID:g181908" /translation="MINPYRNLPLAIIISLPIVTLVYVLTNLAYFTTLSTEQMLSSEA VAVDFGNYHLGVMSWIIPVFVGLSCFGSVNGSLFTSSRLFFVGSREGHLPSILSMIHP QLLTPVPSLVFTCVMTLLYAFSKDIFSVINFFSFFNWLCVALAIIGMIWLRHRKPELE RPIKVNLALPVFFILACLFLIAVSFWKTPVECGIGFTIILSGLPVYFFGVWWKNKPKW LLQGIFSTTVLCQKLMQVVPQET" BASE COUNT 692 a 1268 c 1095 g 929 t ORIGIN 1 gtcctttcac gcgtgtcttc gtgttggtgc gcttttcact ggtcataaag tgctgctcac 61 ggccgtgaac tgctacagcg tgaaggccgc cacccgggtc caggatgctt ttgccgccgc 121 caagctcctg gccctggccc tgatcatcct gctgggcttc gtccagatcg ggaagggtga 181 tgtgtccaat ctagatccca agttctcatt tgaaggcacc aaactggatg tggggaacat 241 tgtgctggca ttatacagcg gcctctttgc ctatggagga tggaattact tgaatttcgt 301 cacagaggaa atgatcaacc cctacagaaa cctgcccctg gccatcatca tctccctgcc 361 catcgtgacg ctggtgtacg tgctgaccaa cctggcctac ttcaccaccc tgtccaccga 421 gcagatgctg tcgtccgagg ccgtggccgt ggacttcggg aactatcacc tgggcgtcat 481 gtcctggatc atccccgtct tcgtgggcct gtcctgcttt ggctccgtca atgggtccct 541 gttcacatcc tccaggctct tcttcgtggg gtcccgggaa ggccacctgc cctccatcct 601 ctccatgatc cacccacagc tcctcacccc cgtgccgtcc ctcgtgttca cgtgtgtgat 661 gacgctgctc tacgccttct ccaaggacat cttctccgtc atcaacttct tcagcttctt 721 caactggctc tgcgtggccc tggccatcat cggcatgatc tggctgcgcc acagaaagcc 781 tgagcttgag cggcccatca aggtgaacct ggccctgcct gtgttcttca tcctggcctg 841 cctcttcctg atcgccgtct ccttctggaa gacacccgtg gagtgtggca tcggcttcac 901 catcatcctc agcgggctgc ccgtctactt cttcggggtc tggtggaaaa acaagcccaa 961 gtggctcctc cagggcatct tctccacgac cgtcctgtgt cagaagctca tgcaggtggt 1021 cccccaggag acatagccag gaggccgagt ggctgccgga ggagcatgcg cagaggccag 1081 ttaaagtaga tcacctcctc gaacccactc cggttccccg caacccacag ctcagctgcc 1141 catcccagtc ctcgccgtcc ctcccaggtc gggcagtgga ggctgctgtg aaaactctgg 1201 tacgaatctc atccctcaac tgagggccag ggacccaggt gtgcctgtgc tcctgcccag 1261 gagcagcttt tggtctcctt gggccctttt tcccttccct cctttgttta cttatatata 1321 tatttttttt aaacttaaat tttgggtcaa cttgacacca ctaagatgat tttttaagga 1381 gctgggggaa ggcaggagcc ttcctttctc ctgccccaag ggcccagacc ctgggcaaac 1441 agagctactg agacttggaa cctcattgct accacagact tgcactgaag ccagacagct 1501 gcccagacac atgggcttgt gacattcgtg aaaaccaacc ctgtgggctt atgtctctgc 1561 cttagggttt gcagagtgga aactcagccg tagggtggca ctgggagggg gtgggggatc 1621 tgggcaaggt gggtgattcc tcccaggagg tgcttgaggc cccgatggac tcctgaccat 1681 aatcctagcc ccgagacacc atcctgagcc agggaacagc cccagggttg gggggtgccg 1741 gcatctcccc tagctcacca ggcctggcct ctgggcagtg tggcctcttg gctatttctg 1801 ttccagtttt ggaggctgag ttctggttca tgcagacaaa gccctgtcct tcagtcttct 1861 agaaacagag acaagaaagg cagacacacc gcggccaggc acccatgtgg gcgcccaccc 1921 tgggctccac acagcagtgt cccctgcccc agaggtcgca gctaccctca gcctccaatg 1981 cattggcctc tgtaccgccc ggcagcccct tctggccggt gctgggttcc cactcccggc 2041 ctaggcacct ccccgctctc cctgtcacgc tcatgtcctg tcctggtcct gatgcccgtt 2101 gtctaggaga cagagccaag cactgctcac gtctctgccg cctgcgtttg gaggcccctg 2161 ggctctcacc cagtccccac ccgcctgcag agagggaact agggcacccc ttgtttctgt 2221 tgttcccgtg aatttttttc gctatgggag gcagccgagg cctggccaat gcggcccact 2281 ttcctgagct gtcgctgcct ccatggcagc agccaaggac ccccagaaca agaagacccc 2341 cccgcaggat ccctcctgag ctcggggggc tctgccttct caggccccgg gcttcccttc 2401 tccccagcca gaggtggagc caagtggtcc agcgtcactc cagtgctcag ctgtggctgg 2461 aggagctggc ctgtggcaca gccctgagtg tcccaagccg ggagccaacg aagccggaca 2521 cggcttcact gaccagcggc tgctcaagcc gcaagctctc agcaagtgcc cagtggagcc 2581 tgccgccccc acctgggcac cgggaccccc tcaccatcca gtgggcccgg agaaacctga 2641 tgaacagttt ggggactcag gaccagatgt ccgtctctct tgcttgagga atgaagacct 2701 ttattcaccc ctgccccgtt gcttcccgct gcacatggac agacttcaca gcgtctgctc 2761 ataggacctg catccttcct ggggacgaat tccactcgtc caagggacag cccacggtct 2821 ggaggccgag gaccaccagc aggcaggtgg actgactgtg ttgggcaaga cctcttccct 2881 ctgggcctgt tctcttggct gcaaataagg acagcagctg gtgccccacc tgcctggtgc 2941 attgctgtgt gaatccagga ggcagtggac atcgtaggca gccacggccc caggtccagg 3001 agaagtgctc cctggaggca cggaccactg cttcccactg gggccggcgg ggcccacgca 3061 cgacgtcagc ctcttacctt cccgcctcgg ctaggggtcc tcgggatgcc gttctgttcc 3121 aacctcctgt tctgggaggt ggacatgcct caaggataca gggagccggc ggcctctcga 3181 cggcacgcac ttcctgttgg ctgctgcggc tgtgggcgag catgggggct gccagcgtct 3241 gttgtggaaa gtagctgcta gtgaaatggc tggggccgct ggggtccgtc ttcacactgc 3301 gcaggtctct tctgggcgtc tgagctgggg tgggagctcc tccgcagaag gttggtgggg 3361 ggtccagtct gtgatccttg gtgctgtgtg ccccactcca gcctggggac cccacttcag 3421 aaggtagggg ccgtgtcccg cggtgctgac tgaggcctgc ttccccctcc ccctcctgct 3481 gtgctggaat tccacaggga ccagggccac cgcaggggac tgtctcagaa gacttgattt 3541 ttccgtccct ttttctccac actccactga caaacgtccc cagcggtttc cacttgtggg 3601 cttcaggtgt tttcaagcac aacccaccac aacaagcaag tgcattttca gtcgttgtgc 3661 ttttttgttt tgtgctaacg tcttactaat ttaaagatgc tgtcggcacc atgtttattt 3721 atttccagtg gtcatgctca gccttgctgc tctgcgtggc gcaggtgcca tgcctgctcc 3781 ctgtctgtgt cccagccacg cagggccatc cactgtgacg tcggccgacc aggctggaca 3841 ccctctgccg agtaatgacg tgtgtggctg ggaccttctt tattctgtgt taatggctaa 3901 cctgttacac tgggctgggt tgggtagggt gttctggctt ttttgtgggg tttttatttt 3961 taaagaaaca ctcaatcatc ctag // LOCUS HUME1URP 3301 bp mRNA PRI 14-SEP-1995 DEFINITION Homo sapiens ubiquitin-activating enzyme E1 related protein mRNA, complete cds. ACCESSION L13852 NID g520832 KEYWORDS homologue; ubiquitin activating enzyme; ubiquitin activating enzyme E1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3301) AUTHORS Kok,K., Hofstra,R., Pilz,A., van den Berg,A., Terpstra,P., Buys,C.H. and Carritt,B. TITLE A gene in the chromosomal region 3p21 with greatly reduced expression in lung cancer is similar to the gene for ubiquitin-activating enzyme JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (13), 6071-6075 (1993) MEDLINE 93317626 FEATURES Location/Qualifiers source 1..3301 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-cell precursor" /map="3p21" 5'UTR 1..164 /gene="UBE1L" gene 1..3301 /gene="UBE1L" CDS 165..3200 /gene="UBE1L" /codon_start=1 /product="ubiquitin-activating enzyme E1-related protein" /db_xref="PID:g986881" /translation="MDALDASKLLDEELYSRQLYVLGSPAMQRIQGARVLVSGLQGLG AEVAKNLVLMGVGSLTLHDPHPTCWSDLAAQFLLSEQDLERSRAEASQELLAQLNRAV QVVVHTGDITEDLLLDFQVVVLTAAKLEEQLKVGTLCHKHGVCFLAADTRGLVGQLFC DFGEDFTVQDPTEAEPLTAAIQHISQGSPGILTLRKGANTHYFRDGDLVTFSGIEGMV ELNDCDPRSIHVREDGSLEIGDTTTFSRYLRGGAITEVKRPKTVRHKSLDTALLQPHV VAQSSQEVHHAHCLHQAFCALHKFQHLHGRPPQPWDPVDAETVVGLARDLEPLKRTEE EPLEEPLDEALVRTVALSSARCLEPMVACWVSSCPGSAEGNLQKFMPLDQWLYFDALD CLPEDGELLPSPEDCALRGSRYDGQIAVFGAGFQEKLRRQHYLLVGAGAIGCELLKVF ALVGLGAGNSGGLTVVDMDHIERSNLSRQFLFRSQDVGRPKAEVAAAAARGLNPDLQV IPLTYPLDPTTEHIYGDNFFSRVDGVAAALDSFQARRYVAARCTHYLKPLLEAGTSGT WGSATVFMPHVTEAYRAPASAAASEDAPYPVCTVRYFPSTAEHTLQWARHEFEELFRL SAETINHHQQAHTSLADMDEPQTLTLLKPVLGVLRVRPQNWQDCVAWALGHWKLCFHY GIKQLLRHFPPNKVLEDGTPFWSGPKQCPQPLEFDTNQDTHLLYVLAAANLYAQMHGL PGSQDWTALRELLKLLPQPDPQQMAPIFASNLELASASAEFGPEQQKELNKALEVWSV GPPLKPLMFEKDDDSNFHVDFVVAAASLRCQNYGIPPVNRAQSKRIVGQIIPAIATTT AAVAGLLGLELYKVVSGPRPRSAFRHSYLHLAENYLIRYMPFAPAIQTFHHLKWTSWD RLKVPAGQPERTLESLLAHLQEQHGLRVRILLHGSALLYAAGWSPEKQAQHLPLRVTE LVQQLTGQAPAPGQRVLVLELSCEGDDEDTAFPPLHYEL" 3'UTR 3201..3301 /gene="UBE1L" polyA_signal 3279..3284 /gene="UBE1L" BASE COUNT 679 a 963 c 960 g 699 t ORIGIN 1 aggagccagg aagagagctg tgaccagcag cgtcccttat tcgcttggcc ttggttcctg 61 tttgcactgg ctacagcagg gcactggccc ctactgtcac cgccacctac acaaagaccc 121 tatctctgag cgctgcagcc tactgttcag ccccaggttt gaggatggat gccctggacg 181 cttcgaagct actggatgag gagctgtatt caagacagct gtatgtgctg ggctcacctg 241 ccatgcagag gattcaggga gccagggtcc tggtgtcagg cctgcagggc ctgggggccg 301 aggtggccaa gaacttggtt ctgatgggtg tgggcagcct cactctgcat gatccccacc 361 ccacctgctg gtccgacctg gctgcccagt ttctcctctc agagcaggac ttggaaagga 421 gcagagccga ggcctctcaa gagctcttgg ctcagctcaa cagagctgtc caggtcgtcg 481 tgcacacggg tgacatcact gaggacctgc tgttggactt ccaggtggtg gtgctgactg 541 ctgcaaagct ggaggagcag ctgaaggtgg gcaccttgtg tcataagcat ggagtttgct 601 ttctggcggc tgacacccgg ggcctcgtgg ggcagttgtt ctgtgacttt ggtgaggact 661 tcactgtgca ggaccccaca gaggcagaac ccctgacagc tgccatccag cacatctccc 721 agggctcccc tggcattctc actctgagga aaggggccaa tacccactac ttccgtgatg 781 gagacttggt gactttctcg ggaattgagg gaatggttga gctcaacgac tgtgatcccc 841 ggtctatcca cgtgcgggag gatgggtccc tggagattgg agacacaaca actttctctc 901 ggtacttgcg tggtggggct atcactgaag tcaagagacc caagactgtg agacataagt 961 ccctggacac agccctgctc cagccccatg tggtggccca gagctcccag gaagttcacc 1021 atgcccactg cctgcatcag gccttctgtg cactgcacaa gttccagcac ctccatggcc 1081 ggccacccca gccctgggat cctgttgatg cagagactgt ggtgggcctg gcccgggacc 1141 tggaaccact gaagcggaca gaggaagagc cactggaaga gccactggat gaggccctag 1201 tgcggacagt cgccctaagc agtgcaaggt gtcttgagcc tatggtggca tgctgggtca 1261 gtagctgccc aggaagtgct gaaggcaatc tccagaagtt catgcctctg gaccagtggc 1321 tttactttga tgccctcgat tgtcttccgg aagatgggga gctccttccc agtcctgagg 1381 actgtgccct gagaggcagc cgctatgatg ggcaaattgc agtgtttggg gctggttttc 1441 aggagaaact gagacgccag cactacctcc tggtgggcgc tggtgccatt ggttgtgagc 1501 tgctcaaagt ctttgcccta gtgggactgg gggccgggaa cagcgggggc ttgactgttg 1561 ttgacatgga ccacatagag cgctccaatc tcagccgtca gttcctcttc aggtcccagg 1621 acgttggtag acccaaggca gaggtggctg cagcagctgc ccggggcctg aacccagact 1681 tacaggtgat cccgctcacc tacccactgg atcccaccac agagcacatc tatggggata 1741 actttttctc ccgtgtggat ggtgtggctg ctgccctgga cagtttccag gcccggcgct 1801 atgtggctgc tcgttgcacc cactatctga agccactgct ggaggcaggc acatcgggca 1861 cctggggcag tgctacagta ttcatgccac atgtgactga ggcctacaga gcccctgcct 1921 cagctgcagc ttctgaggat gccccctacc ctgtctgtac cgtgcggtac ttccctagca 1981 cagccgagca caccctgcag tgggcccggc atgagtttga agaactcttc cgactgtctg 2041 cagagaccat caaccaccac caacaggcac acacctccct ggcagacatg gatgagccac 2101 agacactcac cttactgaag ccagtgcttg gggtcctgag agtgcgtcca cagaactggc 2161 aagactgtgt ggcgtgggct cttggccact ggaaactctg ctttcattat ggcatcaaac 2221 agctgctgag gcacttccca cctaataaag tgcttgagga tggaactccc ttctggtcag 2281 gtcccaaaca gtgtccccag cccttggagt ttgacaccaa ccaagacaca cacctcctct 2341 acgtactggc agctgccaac ctgtatgccc agatgcatgg gctgcctggc tcacaggact 2401 ggactgcact cagggagctg ctgaagctgc tgccacagcc tgacccccaa cagatggccc 2461 ccatctttgc tagtaatcta gagctggctt cggcttctgc tgagtttggc cctgagcagc 2521 agaaggaact gaacaaagcc ctggaagtct ggagtgtggg ccctcccctg aagcctctga 2581 tgtttgagaa ggatgatgac agcaacttcc atgtggactt tgtggtagcg gcagctagcc 2641 tgagatgtca gaactacggg attccaccgg tcaaccgtgc ccagagcaag cgaattgtgg 2701 gccagattat cccagccatt gccaccacta cagcagctgt ggcaggcctg ttgggcctgg 2761 agctgtataa ggtggtgagt gggccacggc ctcgtagtgc ctttcgccac agctacctac 2821 atctggctga aaactacctc atccgctata tgccttttgc cccagccatc cagacgttcc 2881 atcacctgaa gtggacctct tgggaccgtc tgaaggtacc agctgggcag cctgagagga 2941 ccctggagtc gctgctggct catcttcagg agcagcacgg gttgagggtg aggatcctgc 3001 tgcacggctc agccctgctc tatgcggccg gatggtcacc tgaaaagcag gcccagcacc 3061 tgcccctcag ggtgacagaa ctggttcagc agctgacagg ccaggcacct gctcctgggc 3121 agcgggtgtt ggtgctagag ctgagctgtg agggtgacga cgaggacact gccttcccac 3181 ctctgcacta tgagctgtga caaggcagcc accctgtcac ctagctcaat ggagccccgg 3241 atcccaagcc ctgcattgta agcccacagt aggcactcaa taattgcttg ttaaaggaag 3301 g // LOCUS HUME2EPI 890 bp mRNA PRI 31-DEC-1994 DEFINITION Human ubiquitin carrier protein (E2-EPF) mRNA, complete cds. ACCESSION M91670 NID g181915 KEYWORDS ubiquitin carrier protein. SOURCE Homo sapiens (tissue library: lambda gt11, Clontech) neonatal foreskin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 890) AUTHORS Liu,Z., Diaz,L.A., Haas,A.L. and Giudice,G.J. TITLE cDNA cloning of a novel human ubiquitin carrier protein. An antigenic domain specifically recognized by endemic pemphigus foliaceus autoantibodies is encoded in a secondary reading frame of this human epidermal transcript JOURNAL J. Biol. Chem. 267 (22), 15829-15835 (1992) MEDLINE 92348449 FEATURES Location/Qualifiers source 1..890 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /dev_stage="neonatal" /tissue_type="foreskin" /tissue_lib="lambda gt11, Clontech" gene 60..878 /gene="E2-EPF" CDS 60..737 /gene="E2-EPF" /codon_start=1 /product="ubiquitin carrier protein" /db_xref="PID:g181916" /translation="MNSNVENLPPHIIRLVYKEVTTLTADPPDGIKVFPNEEDLTDLQ VTIEGPEGTPYAGGLFRMKLLLGKDFPASPPKGYFLTKIFHPNVGANGEICVNVLKRD WTAELGIRHVLLTIKCLLIHPNPESALNEEAGRLLLENYEEYAARARLLTEIHGGAGG PSGRAEAGRALASGTEASSTDPGAPGGPGGAEGPMAKKHAGERDKKLAAKKKTDKKRA LRALRRL" polyA_signal 873..878 /gene="E2-EPF" BASE COUNT 182 a 284 c 281 g 143 t ORIGIN 1 ggcggaccga agaacgcagg aagggggccg gggggacccg cccccggccg gccgcagcca 61 tgaactccaa cgtggagaac ctacccccgc acatcatccg cctggtgtac aaggaggtga 121 cgacactgac cgcagaccca cccgatggca tcaaggtctt tcccaacgag gaggacctca 181 ccgacctcca ggtcaccatc gagggccctg aggggacccc atatgctgga ggtctgttcc 241 gcatgaaact cctgctgggg aaggacttcc ctgcctcccc acccaagggc tacttcctga 301 ccaagatctt ccacccgaac gtgggcgcca atggcgagat ctgcgtcaac gtgctcaaga 361 gggactggac ggctgagctg ggcatccgac acgtactgct gaccatcaag tgcctgctga 421 tccaccctaa ccccgagtct gcactcaacg aggaggcggg ccgcctgctc ttggagaact 481 acgaggagta tgcggctcgg gcccgtctgc tcacagagat ccacgggggc gccggcgggc 541 ccagcggcag ggccgaagcc ggtcgggccc tggccagtgg cactgaagct tcctccaccg 601 accctggggc cccagggggc ccgggagggg ctgagggtcc catggccaag aagcatgctg 661 gcgagcgcga taagaagctg gcggccaaga aaaagacgga caagaagcgg gcgctgcggg 721 cgctgcggcg gctgtagtgg gctctcttcc tccttccacc gtgaccccaa cctctcctgt 781 cccctccctc caactctgtc tctaagttat ttaaattatg gctggggtcg gggagggtac 841 agggggcact gggacctgga tttgtttttc taaataaagt tggaaaagca // LOCUS HUME2F 2517 bp mRNA PRI 10-AUG-1992 DEFINITION Homo sapiens (E2F-1) pRB-binding protein mRNA, complete cds. ACCESSION M96577 NID g181917 KEYWORDS DNA-binding protein; pRB-binding protein; transcription factor E2F. SOURCE Homo sapiens fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2517) AUTHORS Helin,K., Lees,J.A., Vidal,M., Dyson,N.J., Harlow,E. and Fattaey,A. TITLE A cDNA encoding a pRB-binding protein with properties of the transcription factor E2F JOURNAL Cell 70, 337-350 (1992) MEDLINE 92346720 FEATURES Location/Qualifiers source 1..2517 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Nalm 6" /cell_type="pre B-cells" /dev_stage="fetal" /tissue_type="brain" CDS 136..1449 /note="pRB-binding protein" /codon_start=1 /product="E2F-1" /db_xref="PID:g181918" /translation="MALAGAPAGGPCAPALEALLGAGALRLLDSSQIVIISAAQDASA PPAPTGPAAPAAGPCDPDLLLFATPQAPRPTPSAPRPALGRPPVKRRLDLETDHQYLA ESSGPARGRGRHPGKGVKSPGEKSRYETSLNLTTKRFLELLSHSADGVVDLNWAAEVL KVQKRRIYDITNVLEGIQLIAKKSKNHIQWLGSHTTVGVGGRLEGLTQDLRQLQESEQ QLDHLMNICTTQLRLLSEDTDSQRLAYVTCQDLRSIADPAEQMVMVIKAPPETQLQAV DSSENFQISLKSKQGPIDVFLCPEETVGGISPGKTPSQEVTSEEENRATDSATIVSPP PSSPPSSLTTDPSQSLLSLEQEPLLSRMGSLRAPVDEDRLSPLVAADSLLEHVREDFS GLLPEEFISLSPPHEALDYHFGLEEGEGIRDLFDCDFGDLTPLDF" BASE COUNT 454 a 784 c 781 g 498 t ORIGIN 1 ggaattccgt ggccgggact ttgcaggcag cggcggccgg gggcggagcg ggatcgagcc 61 ctcgccgagg cctgccgcca tgggcccgcg ccgccgccgc cgcctgtcac ccgggccgcg 121 cgggccgtga gcgtcatggc cttggccggg gcccctgcgg gcggcccatg cgcgccggcg 181 ctggaggccc tgctcggggc cggcgcgctg cggctgctcg actcctcgca gatcgtcatc 241 atctccgccg cgcaggacgc cagcgccccg ccggctccca ccggccccgc ggcgcccgcc 301 gccggcccct gcgaccctga cctgctgctc ttcgccacac cgcaggcgcc ccggcccaca 361 cccagtgcgc cgcggcccgc gctcggccgc ccgccggtga agcggaggct ggacctggaa 421 actgaccatc agtacctggc cgagagcagt gggccagctc ggggcagagg ccgccatcca 481 ggaaaaggtg tgaaatcccc gggggagaag tcacgctatg agacctcact gaatctgacc 541 accaagcgct tcctggagct gctgagccac tcggctgacg gtgtcgtcga cctgaactgg 601 gctgccgagg tgctgaaggt gcagaagcgg cgcatctatg acatcaccaa cgtccttgag 661 ggcatccagc tcattgccaa gaagtccaag aaccacatcc agtggctggg cagccacacc 721 acagtgggcg tcggcggacg gcttgagggg ttgacccagg acctccgaca gctgcaggag 781 agcgagcagc agctggacca cctgatgaat atctgtacta cgcagctgcg cctgctctcc 841 gaggacactg acagccagcg cctggcctac gtgacgtgtc aggaccttcg tagcattgca 901 gaccctgcag agcagatggt tatggtgatc aaagcccctc ctgagaccca gctccaagcc 961 gtggactctt cggagaactt tcagatctcc cttaagagca aacaaggccc gatcgatgtt 1021 ttcctgtgcc ctgaggagac cgtaggtggg atcagccctg ggaagacccc atcccaggag 1081 gtcacttctg aggaggagaa cagggccact gactctgcca ccatagtgtc accaccacca 1141 tcatctcccc cctcatccct caccacagat cccagccagt ctctactcag cctggagcaa 1201 gaaccgctgt tgtcccggat gggcagcctg cgggctcccg tggacgagga ccgcctgtcc 1261 ccgctggtgg cggccgactc gctcctggag catgtgcggg aggacttctc cggcctcctc 1321 cctgaggagt tcatcagcct ttccccaccc cacgaggccc tcgactacca cttcggcctc 1381 gaggagggcg agggcatcag agacctcttc gactgtgact ttggggacct cacccccctg 1441 gatttctgac agggcttgga gggaccaggg tttccagagt agctcacctt gtctctgcag 1501 ccctggagcc ccctgtccct ggccgtcctc ccagcctgtt tggaaacatt taatttatac 1561 ccctctcctc tgtctccaga agcttctagc tctggggtct ggctaccgct aggaggctga 1621 gcaagccagg aagggaagga gtctgtgtgg tgtgtatgtg catgcagcct acacccacac 1681 gtgtgtaccg ggggtgaatg tgtgtgagca tgtgtgtgtg catgtaccgg ggaatgaagg 1741 tgaacataca cctctgtgtg tgcactgcag acacgcccca gtgtgtccac atgtgtgtgc 1801 atgagtccat ctctgcgcgt gggggggctc taactgcact ttcggccctt ttgctcgtgg 1861 ggtcccacaa ggcccagggc agtgcctgct cccagaatct ggtgctctga ccaggccagg 1921 tggggaggct ttggctggct gggcgtgtag gacggtgaga gcacttctgt cttaaaggtt 1981 ttttctgatt gaagctttaa tggagcgtta tttatttatc gaggcctctt tggtgagcct 2041 ggggaatcag caaaagggga ggaggggtgt ggggttgata ccccaactcc ctctaccctt 2101 gagcaagggc aggggtccct gagctgttct tctgccccat actgaaggaa ctgaggcctg 2161 ggtgatttat ttattgggaa agtgagggag ggagacagac tgactgacag ccatgggtgg 2221 tcagatggtg gggtgggccc tctccagggg gccagttcag ggcccagctg ccccccagga 2281 tggatatgag atgggagagg tgagtggggg accttcactg atgtgggcag gaggggtggt 2341 gaaggcctcc cccagcccag accctgtggt ccctcctgca gtgtctgaag cgcctgcctc 2401 cccactgctc tgccccaccc tccaatctgc actttgattt gcttcctaac agctctgttc 2461 cctcctgctt tggttttaat aaatattttg atgacgttaa aaaaaggaat tcgatat // LOCUS HUMEB2CR2 4094 bp mRNA PRI 15-JUN-1989 DEFINITION Human Epstein-Barr virus complement receptor type II(cr2). ACCESSION J03565 NID g181919 KEYWORDS cell surface glycoprotein; membrane protein; signal peptide. SOURCE Human Raji B lymphoblastiod cell, cDNA to mRNA, clone lambda-E41. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4094) AUTHORS Moore,M., Cooper,N., Tack,B. and Nemerow,G. TITLE Molecular cloning of the cDNA encoding the epstein-barr virus.C3d receptor (complement receptor type 2) of human b lymphocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9194-9198 (1987) MEDLINE 88097454 COMMENT Submitted in computer-readable form by M.Moore 25-NOV-1987. FEATURES Location/Qualifiers source 1..4094 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 64..123 /gene="CR2" /note="siginal peptide" CDS 64..3327 /gene="CR2" /note="CR2 precursor" /codon_start=1 /db_xref="PID:g181920" /translation="MGAAGLLGVFLALVAPGVLGISCGSPPPILNGRISYYSTPIAVG TVIRYSCSGTFRLIGEKSLLCITKDKVDGTWDKPAPKCEYFNKYSSCPEPIVPGGYKI RGSTPYRHGDSVTFACKTNFSMNGNKSVWCQANNMWGPTRLPTCVSVFPLECPALPMI HNGHHTSENVGSIAPGLSVTYSCESGYLLVGEKIINCLSSGKWSAVPPTCEEARCKSL GRFPNGKVKEPPILRVGVTANFFCDEGYRLQGPPSSRCVIAGQGVAWTKMPVCEEIFC PSPPPILNGRHIGNSLANVSYGSIVTYTCDPDPEEGVNFILIGESTLRCTVDSQKTGT WSGPAPRCELSTSAVQCPHPQILRGRMVSGQKDRYTYNDTVIFACMFGFTLKGSKQIR CNAQGTWEPSAPVCEKECQAPPNILNGQKEDRHMVRFDPGTSIKYSCNPGYVLVGEES IQCTSEGVWTPPVPQCKVAACEATGRQLLTKPQHQFVRPDVNSSCGEGYKLSGSVYQE CQGTIPWFMEIRLCKEITCPPPPVIYNGAHTGSSLEDFPYGTTVTYTCNPGPERGVEF SLIGESTIRCTSNDQERGTWSGPAPLCKLSLLAVQCSHVHIANGYKISGKEAPYFYND TVTFKCYSGFTLKGSSQIRCKRDNTWDPEIPVCEKGCQPPPGLHHGRHTGGNTVFFVS GMTVDYTCDPGYLLVGNKSIHCMPSGNWSPSAPRCEETCQHVRQSLQELPAGSRVELV NTSCQDGYQLTGHAYQMCQDAENGIWFKKIPLCKVIHCHPPPVIVNGKHTGMMAENFL YGNEVSYECDQGFYLLGEKNCSAEVILKAWILERAFPQCLRSLCPNPEVKHGYKLNKT HSAYSHNDIVYVDCNPGFIMNGSRVIRCHTDNTWVPGVPTCIKKAFIGCPPPPKTPNG NHTGGNIARFSPGMSILYSCDQGYLVVGEPLLLCTHEGTWSQPAPHCKEVNCSSPADM DGIQKGLEPRKMYQYGAVVTLECEDGYMLEGSPQSQCQSDHQWNPPLAVCRSRSLAPV LCGIAAGLILLTFLIVITLYVISKHRERNYYTDTSQKEAFHLEAREVYSVDPYNPAS" gene 64..3327 /gene="CR2" mat_peptide 124..3327 /gene="CR2" /note="CR2 protein" BASE COUNT 1105 a 917 c 931 g 1141 t ORIGIN 3363 bp upstream of EcoRI site; chromosome 1q32. 1 ccagagctgc cggacgctcg cgggtctcgg aacgcatccc gccgcggggg cttcggccgt 61 ggcatgggcg ccgcgggcct gctcggggtt ttcttggctc tcgtcgcacc gggggtcctc 121 gggatttctt gtggctctcc tccgcctatc ctaaatggcc ggattagtta ttattctacc 181 cccattgctg ttggtaccgt gataaggtac agttgttcag gtaccttccg cctcattgga 241 gaaaaaagtc tattatgcat aactaaagac aaagtggatg gaacctggga taaacctgct 301 cctaaatgtg aatatttcaa taaatattct tcttgccctg agcccatagt accaggagga 361 tacaaaatta gaggctctac accctacaga catggtgatt ctgtgacatt tgcctgtaaa 421 accaacttct ccatgaacgg aaacaagtct gtttggtgtc aagcaaataa tatgtggggg 481 ccgacacgac taccaacctg tgtaagtgtt ttccctctcg agtgtccagc acttcctatg 541 atccacaatg gacatcacac aagtgagaat gttggctcca ttgctccagg attgtctgtg 601 acttacagct gtgaatctgg ttacttgctt gttggagaaa agatcattaa ctgtttgtct 661 tcgggaaaat ggagtgctgt cccccccaca tgtgaagagg cacgctgtaa atctctagga 721 cgatttccca atgggaaggt aaaggagcct ccaattctcc gggttggtgt aactgcaaac 781 tttttctgtg atgaagggta tcgactgcaa ggcccacctt ctagtcggtg tgtaattgct 841 ggacagggag ttgcttggac caaaatgcca gtatgtgaag aaattttttg cccatcacct 901 ccccctattc tcaatggaag acatataggc aactcactag caaatgtctc atatggaagc 961 atagtcactt acacttgtga cccggaccca gaggaaggag tgaacttcat ccttattgga 1021 gagagcactc tccgttgtac agttgatagt cagaagactg ggacctggag tggccctgcc 1081 ccacgctgtg aactttctac ttctgcggtt cagtgtccac atccccagat cctaagaggc 1141 cgaatggtat ctgggcagaa agatcgatat acctataacg acactgtgat atttgcttgc 1201 atgtttggct tcaccttgaa gggcagcaag caaatccgat gcaatgccca aggcacatgg 1261 gagccatctg caccagtctg tgaaaaggaa tgccaggccc ctcctaacat cctcaatggg 1321 caaaaggaag atagacacat ggtccgcttt gaccctggaa catctataaa atatagctgt 1381 aaccctggct atgtgctggt gggagaagaa tccatacagt gtacctctga gggggtgtgg 1441 acaccccctg taccccaatg caaagtggca gcgtgtgaag ctacaggaag gcaactcttg 1501 acaaaacccc agcaccaatt tgttagacca gatgtcaact cttcttgtgg tgaagggtac 1561 aagttaagtg ggagtgttta tcaggagtgt caaggcacaa ttccttggtt tatggagatt 1621 cgtctttgta aagaaatcac ctgcccacca ccccctgtta tctacaatgg ggcacacacc 1681 gggagttcct tagaagattt tccatatgga accacggtca cttacacatg taaccctggg 1741 ccagaaagag gagtggaatt cagcctcatt ggagagagca ccatccgttg tacaagcaat 1801 gatcaagaaa gaggcacctg gagtggccct gctcccctat gtaaactttc cctccttgct 1861 gtccagtgct cacatgtcca tattgcaaat ggatacaaga tatctggcaa ggaagcccca 1921 tatttctaca atgacactgt gacattcaag tgttatagtg gatttacttt gaagggcagt 1981 agtcagattc gttgcaaacg tgataacacc tgggatcctg aaataccagt ttgtgaaaaa 2041 ggctgccagc cacctcctgg gctccaccat ggtcgtcata caggtggaaa tacggtcttc 2101 tttgtctctg ggatgactgt agactacact tgtgaccctg gctatttgct tgtgggaaac 2161 aaatccattc actgtatgcc ttcaggaaat tggagtcctt ctgccccacg gtgtgaagaa 2221 acatgccagc atgtgagaca gagtcttcaa gaacttccag ctggttcacg tgtggagcta 2281 gttaatacgt cctgccaaga tgggtaccag ttgactggac atgcttatca gatgtgtcaa 2341 gatgctgaaa atggaatttg gttcaaaaag attccacttt gtaaagttat tcactgtcac 2401 cctccaccag tgattgtcaa tgggaagcac acaggcatga tggcagaaaa ctttctatat 2461 ggaaatgaag tctcttatga atgtgaccaa ggattctatc tcctgggaga gaaaaattgc 2521 agtgcagaag tgattctaaa ggcatggatc ttggagcgag ccttcccaca gtgcttacga 2581 tctctgtgcc ctaatccaga agtcaaacat gggtacaagc tcaataaaac acattctgca 2641 tattcccaca atgacatagt gtatgttgac tgcaatcctg gcttcatcat gaatggtagt 2701 cgcgtgatta ggtgtcatac tgataacaca tgggtgccag gtgtgccaac ttgtatcaaa 2761 aaagccttca tagggtgtcc acctccgcct aagaccccta acgggaacca tactggtgga 2821 aacatagctc gattttctcc tggaatgtca atcctgtaca gctgtgacca aggctacctg 2881 gtggtgggag agccactcct tctttgcaca catgagggaa cctggagcca acctgcccct 2941 cattgtaaag aggtaaactg tagctcacca gcagatatgg atggaatcca gaaagggctg 3001 gaaccaagga aaatgtatca gtatggagct gttgtaactc tggagtgtga agatgggtat 3061 atgctggaag gcagtcccca gagccagtgc caatcggatc accaatggaa ccctcccctg 3121 gcggtttgca gatcccgttc acttgctcct gtcctttgtg gtattgctgc aggtttgata 3181 cttcttacct tcttgattgt cattacctta tacgtgatat caaaacacag agaacgcaat 3241 tattatacag atacaagcca gaaagaagct tttcatttag aagcacgaga agtatattct 3301 gttgatccat acaacccagc cagctgatca gaagacaaaa ctggtgtgtg cctcattgct 3361 tggaattcag cggaatattg attagaaaga aactgctcta atatcagcaa gtctctttat 3421 atggcctcaa gatcaatgaa atgatgtcat aagcgatcac ttcctatatg cacttattct 3481 caagaagaac atctttatgg taaagatggg agcccagttt cactgccata tactcttcaa 3541 ggactttctg aagcctcact tatgagatgc ctgaagccag gccatggcta taaacattac 3601 atggctctaa aagttttgcc ctttttaagg aggcactaaa aagagctgtc ctggtatcta 3661 gacccatctt ctttttgaaa tcacatactc atgttactat ctgcttttgg ttataatgtg 3721 tttttaatta tctaaagtat gaagcatttt ctggggttat gatggcctta cttttattag 3781 gaagtatggt tttattttga tagtagcttc cttcctcggt ggtgttaatc atttcgtttt 3841 taccctttac cttcggattt gagtttctct cacattactg tatatacttt gccttccata 3901 atcactcagt gattgcaatt tgcacaagtt tttttaaatt atgggaatca agatttaatc 3961 ctagagattt ggtgtacaat tcaggctttg gatgtttctt tagcagtttt gtgataagtt 4021 ctagttgctt gtaaaatttc acttaataat gtgtacatta gtcattcaat aaattgtaat 4081 tgtaaagaaa acat // LOCUS HUMEBI1CDN 2139 bp mRNA PRI 10-AUG-1995 DEFINITION Human G protein-coupled receptor (EBI 1) mRNA, complete cds. ACCESSION L31581 NID g468319 KEYWORDS G protein-coupled receptor. SOURCE Homo sapiens Blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2139) AUTHORS Schweickart,V.L., Raport,C.J., Godiska,R., Byers,M.G., Eddy,R.L. Jr., Shows,T.B. and Gray,P.W. TITLE Cloning of human and mouse EBI1, a lymphoid-specific G-protein-coupled receptor encoded on human chromosome 17q12-q21.2 JOURNAL Genomics 23 (3), 643-650 (1994) MEDLINE 95154835 FEATURES Location/Qualifiers source 1..2139 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Blood" /map="17q12-21.2" misc_feature 1..18 /gene="EBI 1" /note="This sequence was obtained by RACE-PCR, appended to cDNA clone." 5'UTR 1..66 /gene="EBI 1" gene 1..2139 /gene="EBI 1" CDS 67..1203 /gene="EBI 1" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:g468320" /translation="MDLGKPMKSVLVVALLVIFQVCLCQDEVTDDYIGDNTTVDYTLF ESLCSKKDVRNFKAWFLPIMYSIICFVGLLGNGLVVLTYIYFKRLKTMTDTYLLNLAV ADILFLLTLPFWAYSAAKSWVFGVHFCKLIFAIYKMSFFSGMLLLLCISIDRYVAIVQ AVSAHRHRARVLLISKLSCVGIWILATVLSIPELLYSDLQRSSSEQAMRCSLITEHVE AFITIQVAQMVIGFLVPLLAMSFCYLVIIRTLLQARNFERNKAIKVIIAVVVVFIVFQ LPYNGVVLAQTVANFNITSSTCELSKQLNIAYDVTYSLACVRCCVNPFLYAFIGVKFR NDLFKLFKDLGCLSQEQLRQWSSCRHIRRSSMSVEAETTTTFSP" 3'UTR 1201..2139 /gene="EBI 1" polyA_site 2139 /gene="EBI 1" BASE COUNT 472 a 643 c 546 g 478 t ORIGIN 1 gtgagacagg ggtagtgcga ggccgggcac agccttcctg tgtggtttta ccgcccagag 61 agcgtcatgg acctggggaa accaatgaaa agcgtgctgg tggtggctct ccttgtcatt 121 ttccaggtat gcctgtgtca agatgaggtc acggacgatt acatcggaga caacaccaca 181 gtggactaca ctttgttcga gtctttgtgc tccaagaagg acgtgcggaa ctttaaagcc 241 tggttcctcc ctatcatgta ctccatcatt tgtttcgtgg gcctactggg caatgggctg 301 gtcgtgttga cctatatcta tttcaagagg ctcaagacca tgaccgatac ctacctgctc 361 aacctggcgg tggcagacat cctcttcctc ctgacccttc ccttctgggc ctacagcgcg 421 gccaagtcct gggtcttcgg tgtccacttt tgcaagctca tctttgccat ctacaagatg 481 agcttcttca gtggcatgct cctacttctt tgcatcagca ttgaccgcta cgtggccatc 541 gtccaggctg tctcagctca ccgccaccgt gcccgcgtcc ttctcatcag caagctgtcc 601 tgtgtgggca tctggatact agccacagtg ctctccatcc cagagctcct gtacagtgac 661 ctccagagga gcagcagtga gcaagcgatg cgatgctctc tcatcacaga gcatgtggag 721 gcctttatca ccatccaggt ggcccagatg gtgatcggct ttctggtccc cctgctggcc 781 atgagcttct gttaccttgt catcatccgc accctgctcc aggcacgcaa ctttgagcgc 841 aacaaggcca tcaaggtgat catcgctgtg gtcgtggtct tcatagtctt ccagctgccc 901 tacaatgggg tggtcctggc ccagacggtg gccaacttca acatcaccag tagcacctgt 961 gagctcagta agcaactcaa catcgcctac gacgtcacct acagcctggc ctgcgtccgc 1021 tgctgcgtca accctttctt gtacgccttc atcggcgtca agttccgcaa cgatctcttc 1081 aagctcttca aggacctggg ctgcctcagc caggagcagc tccggcagtg gtcttcctgt 1141 cggcacatcc ggcgctcctc catgagtgtg gaggccgaga ccaccaccac cttctcccca 1201 taggcgactc ttctgcctgg actagaggga cctctcccag ggtccctggg gtggggatag 1261 ggagcagatg caatgactca ggacatcccc ccgccaaaag ctgctcaggg aaaagcagct 1321 ctcccctcag agtgcaagcc ctgctccaga agttagcttc accccaatcc cagctacctc 1381 aaccaatgcc gaaaaagaca gggctgataa gctaacacca gacagacaac actgggaaac 1441 agaggctatt gtcccctaaa ccaaaaactg aaagtgaaag tccagaaact gttcccacct 1501 gctggagtga aggggccaag gagggtgagt gcaaggggcg tgggagtggc ctgaagagtc 1561 ctctgaatga accttctggc ctcccacaga ctcaaatgct cagaccagct cttccgaaaa 1621 ccaggcctta tctccaagac cagagatagt ggggagactt cttggcttgg tgaggaaaag 1681 cggacatcag ctggtcaaac aaactctctg aacccctccc tccatcgttt tcttcactgt 1741 cctccaagcc agcgggaatg gcagctgcca cgccgcccta aaagcacact catcccctca 1801 cttgccgcgt cgccctccca ggctctcaac aggggagagt gtggtgtttc ctgcaggcca 1861 ggccagctgc ctccgcgtga tcaaagccac actctgggct ccagagtggg gatgacatgc 1921 actcagctct tggctccact gggatgggag gagaggacaa gggaaatgtc aggggcgggg 1981 agggtgacag tggccgccca aggccacgag cttgttcttt gttctttgtc acagggactg 2041 aaaacctctc ctcatgttct gctttcgatt cgttaagaga gcaacatttt acccacacac 2101 agataaagtt ttcccttgag gaaacaacag ctttaaaag // LOCUS HUMEBI3X 1161 bp mRNA PRI 01-APR-1996 DEFINITION Human cytokine receptor (EBI3) mRNA, complete cds. ACCESSION L08187 NID g632973 KEYWORDS cytokine receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1161) AUTHORS Devergne,O., Hummel,M., Koeppen,H., Le Beau,M.M., Nathanson,E.C., Kieff,E. and Birkenbach,M. TITLE A novel interleukin-12 p40-related protein induced by latent Epstein-Barr virus infection in B lymphocytes JOURNAL J. Virol. 70 (2), 1143-1153 (1996) MEDLINE 96135230 FEATURES Location/Qualifiers source 1..1161 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BL41/B95-8" /cell_type="B lymphocyte, EBV-converted Burkitt lymphoma" sig_peptide 14..73 /gene="EBI3" CDS 14..703 /gene="EBI3" /codon_start=1 /product="cytokine receptor" /db_xref="PID:g632974" /translation="MTPQLLLALVLWASCPPCSGRKGPPAALTLPRVQCRASRYPIAV DCSWTLPPAPNSTSPVSFIATYRLGMAARGHSWPCLQQTPTSTSCTITDVQLFSMAPY VLNVTAVHPWGSSSSFVPFITEHIIKPDPPEGVRLSPLAERHVQVQWEPPGSWPFPEI FSLKYWIRYKRQGAARFHRVGPIEATSFILRAVRPRARYYVQVAAQDLTDYGELSDWS LPATATMSLGK" gene 14..703 /gene="EBI3" mat_peptide 74..700 /gene="EBI3" /product="cytokine receptor" BASE COUNT 239 a 378 c 304 g 240 t ORIGIN 1 gaattccgca gccatgaccc cgcagcttct cctggccctt gtcctctggg ccagctgccc 61 gccctgcagt ggaaggaaag ggcccccagc agctctgaca ctgccccggg tgcaatgccg 121 agcctctcgg tacccgatcg ccgtggattg ctcctggacc ctgccgcctg ctccaaactc 181 caccagcccc gtgtccttca ttgccacgta caggctcggc atggctgccc ggggccacag 241 ctggccctgc ctgcagcaga cgccaacgtc caccagctgc accatcacgg atgtccagct 301 gttctccatg gctccctacg tgctcaatgt caccgccgtc cacccctggg gctccagcag 361 cagcttcgtg cctttcataa cagagcacat catcaagccc gaccctccag aaggcgtgcg 421 cctaagcccc ctcgctgagc gccacgtaca ggtgcagtgg gagcctcccg ggtcctggcc 481 cttcccagag atcttctcac tgaagtactg gatccgttac aagcgtcagg gagctgcgcg 541 cttccaccgg gtggggccca ttgaagccac gtccttcatc ctcagggctg tgcggccccg 601 agccaggtac tacgtccaag tggcggctca ggacctcaca gactacgggg aactgagtga 661 ctggagtctc cccgccactg ccacaatgag cctgggcaag tagcaagggc ttcccgctgc 721 ctccagacag cacctgggtc ctcgccaccc taagccccgg gacacctgtt ggagggcgga 781 tgggatctgc ctagcctggg ctggagtcct tgctttgctg ctgctgagct gccgggcaac 841 ctcagatgac cgacttttcc ctttgagcct cagtttctct agctgagaaa tggagatgta 901 ctactctctc ctttaccttt acctttacca cagtgcaggg ctgactgaac tgtcactgtg 961 agatattttt tattgtttaa ttagaaaaga attgttgttg ggctgggcgc agtggatcgc 1021 acctgtaatc ccagtcactg ggaagccgac gtgggtgggt agcttgaggc caggagctcg 1081 aaaccagtcc gggccacaca gcaagacccc atctctaaaa aattaatata aatataaaat 1141 aaaaaaaaaa aaaaggaatt c // LOCUS HUMECHD 3779 bp mRNA PRI 18-NOV-1994 DEFINITION Human enyol-CoA: hydratase 3-hydroxyacyl-CoA dehydrogenase (EHHADH) mRNA, complete cds with repeats. ACCESSION L07077 NID g452044 KEYWORDS 3-hydroxyacyl-CoA dehydrogenase; EHHADH gene; enyol-CoA: hydratase. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3779) AUTHORS Hoefler,G., Forstner,M., McGuiness,M.C., Hulla,W., Hiden,M., Krisper,P., Kenner,L., Lengauer,C., Ried,T., Zechner,R., Chen,G.L. and Moser,H.W. TITLE cDNA cloning of the human peroxisomal enoyl-CoA hydratase: 3-hydroxyacyl-CoA dehydrogenase bifunctional enzyme and localization to chromosome 3q26.3-3q28: a free left Alu Arm is inserted in the 3' noncoding region JOURNAL Genomics 19 (1), 60-67 (1994) MEDLINE 94245181 FEATURES Location/Qualifiers source 1..3779 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /map="3q26.3-q28" 5'UTR 1..7 /partial /gene="EHHADH" /note="G00-141-631; putative" gene 1..3779 /gene="EHHADH" CDS 8..2179 /gene="EHHADH" /note="putative" /codon_start=1 /db_xref="GDB:G00-141-631" /product="enyol-CoA: hydratase/3-hydroxyacyl-CoA dehydrogenase" /db_xref="PID:g452045" /translation="MAEYTRLHNALALIRLRNPPVNAISTTLLRDIKEGLQKAGRDHT IKAIVICGAEGKFSAGADIRGFSAPRTFGLILGHVVDEIQRNEKPVVAAIQGMAFGGG LELALGCHYRIAHADAQVGLPEVTLGLLPGARGTQLLPRLTGVPAALDLITSGRRILA DEALKLGILDKVVNSDPVEEAIRFAQRVSDQPLESRRLCNKPIQSLPNMDSIFSEALL KMRRQHPGCLAQEACVRAVQAAVQYPYEVGIKKEEELFLYLLQSGQARALQYAFFAER KANKWSTPSGASWKTASARPVSSVGVVGLGTMGRGIVISFARARIPVIGVDSDKNQLA TANKMITSVLEKEASKMQQSGHPWSGPKPRLTSSVKELGGVDLVIEAVFEEMSLKKQV FAELSAVCKPEAFLCTNTSALDVDEIASSTDRPHLVIGTHFFSPAHVMKLLEVIPSQY SSPTTIATVMNLSKKIKKIGVVVGNCFGFVGNRMLNPYYNQAYFLLEEGSKPEEVDQV LEEFGFKMGPFRVSDLAGLDVGWKSRKGQGLTGPTLLPGTPARKRGNRRYCPIPDVLC ELGRFGQKTGKGWYQYDKPLGRIHKPDPWLSTFLSRYRKPHHIEPRTISQDEILERCL YSLINEAFRILGEGIAASPEHIDVVYLHGYGCARHKGGPMFYASTVGLPTVLEKLQKY YRQNPDIPQLEPSDYLKKLASQGNPPLKEWQSLAGSPSSKL" 3'UTR 2180..3779 /gene="EHHADH" /note="G00-141-631" /evidence=experimental repeat_region 3148..3264 /gene="EHHADH" /standard_name="free left Alu arm" /note="G00-141-631; putative" /rpt_family="Alu" BASE COUNT 1152 a 732 c 845 g 1050 t ORIGIN 1 gggaaacatg gccgagtata cgcggctgca caacgccttg gcgctaatcc gcctccgaaa 61 cccgccggtc aacgcgatca gtacgacttt actccgtgat ataaaagaag gactacagaa 121 agctggaaga gaccatacaa taaaagccat tgtgatttgt ggagcagagg gcaaattttc 181 tgcaggtgct gatattcgtg gcttcagtgc tcctaggaca tttggcctta tactgggaca 241 tgtagtagat gaaatacaga gaaatgagaa gcccgtggtg gcagcaatcc aaggcatggc 301 tttcggaggg ggactagagc tggccctggg ctgtcactat aggattgccc acgcagacgc 361 tcaagttggc ttaccagaag ttacacttgg acttctccct ggtgcaagag gaacccagct 421 tctccccaga ctcactggag ttcctgctgc acttgactta attacctcag gaagacgtat 481 tttagcagat gaagcactca agctgggcat tctagataaa gttgtaaact cagacccggt 541 tgaagaagca atcagatttg ctcagagagt ttcagatcaa cctctagaat cccgtagact 601 ctgcaacaag ccaattcaga gcttgcccaa catggacagc atttttagtg aggccctctt 661 gaagatgcgg aggcagcacc ctgggtgtct tgcacaggag gcttgtgtcc gtgcagtcca 721 ggctgctgtg cagtatccct atgaagtggg catcaagaag gaggaggagc tgtttctata 781 tcttttgcaa tcagggcagg ctagagccct gcaatatgct ttcttcgctg aaaggaaagc 841 aaataagtgg tcaactccct ccggagcatc gtggaaaaca gcatcagcgc ggcctgtctc 901 ctcagttggt gttgttggct tgggaacaat gggccgaggc attgtcattt cttttgcaag 961 ggccaggatt cctgtgattg gtgtagactc ggacaaaaac cagctagcaa ctgcaaacaa 1021 gatgataacc tctgtcttgg aaaaagaagc ctccaaaatg caacagagcg gccacccttg 1081 gtcaggacca aaacccaggt taacttcatc tgtgaaggag cttggtggtg tagatttagt 1141 cattgaagca gtatttgagg aaatgagcct gaagaagcag gtctttgctg aactctcagc 1201 tgtgtgcaaa ccagaagcat ttttgtgcac taatacttca gccctggatg ttgatgagat 1261 tgcttcttcc actgatcgtc ctcacttggt cattggcacc cacttctttt cgccagctca 1321 tgtcatgaag ttgttagagg ttattcccag ccaatactct tcccccacta ccattgccac 1381 tgttatgaac ttatcaaaaa agattaaaaa gattggagtc gttgtaggca actgttttgg 1441 atttgtgggg aatcgaatgt tgaatcctta ctacaatcag gcatatttct tgttagaaga 1501 aggcagcaaa ccagaggagg tagatcaggt gctggaagag tttggtttta aaatgggacc 1561 ttttagagtg tctgatcttg ctgggttgga tgtgggctgg aaatctagaa aggggcaagg 1621 tcttactgga cctacattgc ttccaggaac tcctgcccga aaaaggggta ataggaggta 1681 ctgcccaatt cctgatgtgc tctgtgaatt aggacgattt ggccagaaga caggtaaggg 1741 ttggtatcaa tatgacaagc cattgggtag gattcacaaa cctgatccct ggctttccac 1801 attcctatca cggtatagaa aaccccatca cattgaacca cgtaccatta gccaggatga 1861 gatccttgaa cgctgcttat attcacttat caatgaagca ttccgtatct tgggagaagg 1921 gatagctgct agcccagagc acattgatgt tgtctattta catggatatg gatgcgcaag 1981 gcacaagggc gggcccatgt tctatgcttc cacagttggg ttgcccacag ttctagagaa 2041 attgcagaaa tattacaggc agaaccctga tattccccaa ctggagccaa gtgactatct 2101 aaaaaaactg gcttctcagg gaaaccctcc cctgaaagaa tggcaaagct tggcaggctc 2161 ccctagcagt aaattgtgat tcagtcttcc agattatgcc tcacatgcta gcatcaggta 2221 atgctgactg aatttcagtg aaattaaatc aaaaatccaa agtaagattg ttctgaaata 2281 caaagcaaaa taaataatca ttagaatctt ctgtgtaacg actctaatgg tcaaatcttt 2341 aggaatgtgc ttcctatgcc tctgaatctg tccttatcag ataaattcaa tgcatgaact 2401 tgtgtgaata taataccata atagctaatg aaagaggctc aggcataagt tgagattctc 2461 aaatgctttt atcattggat aaatgtgtca tcaattaata aatgataaat gcagctaagt 2521 catacattca ttttgactcc tttcaatgtc acacacatag tattgatcag aaatcttatg 2581 aatcatacat acactcaaca aacattaaag ttgtaggaaa aagacagttg gaaattggta 2641 agggaactga gtacttcaaa ccagcacagg gaacttaggt tagtgtggca agcctttcct 2701 cttctggtct ttcctcttct gtttatggag aaataataga aagtagtaag tcgttaactt 2761 agtgtaagaa gggtcttaga gaacatctaa ccttctagga tttcccaatt ctgtgataga 2821 gtaatgacac cagttttcct gtcatgacaa gcctctgtga tgttacatat ggaaatggtt 2881 gaatcttgaa aaatctaaaa ttgttgcaaa acatattttg tatgattttg ttgtaagagt 2941 tcttctcttt ttactttttg ccttgtgtag ttaaaaatta aggggctggt caatacaaaa 3001 acttgtacac aaatatttat agcagaatta ttcataatgg ccaaaagctg aatacaaccc 3061 aaatgtctat gaactaatga atagataaac caaatctggt atatccatac aatggactat 3121 attattcagc cataaaaaga aataaagggc caggcacagt ggctcacacc tgtaatccca 3181 gcactttggg aggctgaggc aggaggattg tttgaggcca gaaatttgag accagcctgg 3241 gcaacatagc aaaaccctgt ctctacaaaa aatactttcc gtacattagt ggttacctaa 3301 ggctagaatt gggggtgatg aatggggagt acaggattca tctaatgggt acaggatttc 3361 tttatggttc atgaaaatgg tctaaactta ttgtggtaat ggtcacataa caatatatta 3421 gaaaaccatt gagttgtata ttttaagtgg gtgaattata tggtatgtga attatatctc 3481 aataaaggtg ttagtaaata aatgggctga agatgttatc cttcattgtg gtgtaaatga 3541 actttcacaa tattttcacc tgtgaaccca aataaaatga ttaaagttct gatggaaaaa 3601 tcttgaatgg agtataagtt ttccgttgtt aaaaagcaaa caaaaaacca acaaaaaatc 3661 caagtgtgcc ttgaattgta cagagcacaa ttattatgtt tgaaatgtgt actacttaat 3721 tttatataat ttggtttgtg aaattaaaga catcaataaa aatgattcct gaaagtagt // LOCUS HUMECPC 1284 bp mRNA PRI 13-MAR-1995 DEFINITION Homo sapiens endothelial cell protein C/APC receptor (EPCR) mRNA, complete cds. ACCESSION L35545 NID g565267 KEYWORDS endothelial cell protein C/APC receptor. SOURCE Homo sapiens endothelial cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1284) AUTHORS Fukudome,K. and Esmon,C.T. TITLE Identification, cloning, and regulation of a novel endothelial cell protein C/activated protein C receptor JOURNAL J. Biol. Chem. 269 (42), 26486-26491 (1994) MEDLINE 95014491 FEATURES Location/Qualifiers source 1..1284 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="endothelial" mRNA 1..1284 sig_peptide 25..69 /gene="EPCR" /note="putative" gene 25..741 /gene="EPCR" CDS 25..741 /gene="EPCR" /note="amino acid feature: transmembrane domain, bp 655 .. 729" /codon_start=1 /product="endothelial cell protein C/APC receptor" /db_xref="PID:g565268" /translation="MLTTLLPILLLSGWAFCSQDASDGLQRLHMLQISYFRDPYHVWY QGNASLGGHLTHVLEGPDTNTTIIQLQPLQEPESWARTQSGLQSYLLQFHGLVRLVHQ ERTLAFPLTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFRPERALWQADTQVTSGV VTFTLQQLNAYNRTRYELREFLEDTCVQYVQKHISAENTKGSQTSRSYTSLVLGVLVG GFIIAGVAVGIFLCTGGRRC" mat_peptide 70..738 /gene="EPCR" /product="endothelial cell protein C/APC receptor" polyA_site 1275 BASE COUNT 322 a 325 c 341 g 296 t ORIGIN 1 caggtccgga gcctcaactt caggatgttg acaacattgc tgccgatact gctgctgtct 61 ggctgggcct tttgtagcca agacgcctca gatggcctcc aaagacttca tatgctccag 121 atctcctact tccgcgaccc ctatcacgtg tggtaccagg gcaacgcgtc gctgggggga 181 cacctaacgc acgtgctgga aggcccagac accaacacca cgatcattca gctgcagccc 241 ttgcaggagc ccgagagctg ggcgcgcacg cagagtggcc tgcagtccta cctgctccag 301 ttccacggcc tcgtgcgcct ggtgcaccag gagcggacct tggcctttcc tctgaccatc 361 cgctgcttcc tgggctgtga gctgcctccc gagggctcta gagcccatgt cttcttcgaa 421 gtggctgtga atgggagctc ctttgtgagt ttccggccgg agagagcctt gtggcaggca 481 gacacccagg tcacctccgg agtggtcacc ttcaccctgc agcagctcaa tgcctacaac 541 cgcactcggt atgaactgcg ggaattcctg gaggacacct gtgtgcagta tgtgcagaaa 601 catatttccg cggaaaacac gaaagggagc caaacaagcc gctcctacac ttcgctggtc 661 ctgggcgtcc tggtgggcgg tttcatcatt gctggtgtgg ctgtaggcat cttcctgtgc 721 acaggtggac ggcgatgtta attactctcc agccccgtca gaaggggctg gattgatgga 781 ggctggcaag ggaaagtttc agctcactgt gaagccagac tccccaactg aaacaccaga 841 aggtttggag tgacagctcc tttcttctcc cacatctgcc cactgaagat ttgagggagg 901 ggagatggag aggagaggtg gacaaagtac ttggtttgct aagaacctaa gaacgtgtat 961 gctttgctga attagtctga taagtgaatg tttatctatc tttgtggaaa acagataatg 1021 gagttggggc aggaagccta tgcgccatcc tccaaagaca gacagaatca cctgaggcgt 1081 tcaaaagata taaccaaata aacaagtcat ccacaatcaa aatacaacat tcaatacttc 1141 caggtgtgtc agacttggga tgggacgctg atataatagg gtagaaagaa gtaacacgaa 1201 gaagtggtgg aaatgtaaaa tccaagtcat atggcagtga tcaattatta atcaattaat 1261 aatattaata aatttcttat attt // LOCUS HUMEDF 1840 bp mRNA PRI 30-SEP-1988 DEFINITION Human erythroid differentiation protein mRNA (EDF), complete cds. ACCESSION J03634 NID g181946 KEYWORDS erythroid differentiation protein. SOURCE Human acute monocytic leukemia cell line THP-1, cDNA to mRNA, clone pSD(X)/EDF. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1840) AUTHORS Murata,M., Eto,Y., Shibai,H., Sakai,M. and Muramatsu,M. TITLE Erythroid differentiation factor is encoded by the same mRNA as that of the inhibin beta-A chain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2434-2438 (1988) MEDLINE 88190086 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by M.Murata, 01-MAR-1988. FEATURES Location/Qualifiers source 1..1840 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 86..169 /note="erythroid differentiation protein signal peptide" CDS 86..1366 /note="erythroid differentiation protein precursor" /codon_start=1 /db_xref="PID:g181947" /translation="MPLLWLRGFLLASCWIIVRSSPTPGSEGHSAAPDCPSCALAALP KDVPNSQPEMVEAVKKHILNMLHLKKRPDVTQPVPKAALLNAIRKLHVGKVGENGYVE IEDDIGRRAEMNELMEQTSEIITFAESGTARKTLHFEISKEGSDLSVVERAEVWLFLK VPKANRTRTKVTIRLFQQQKHPQGSLDTGEEAEEVGLKGERSELLLSEKVVDARKSTW HVFPVSSSIQRLLDQGKSSLDVRIACEQCQESGASLVLLGKKKKKEEEGEGKKKGGGE GGAGADEEKEQSHRPFLMLQARQSEDHPHRRRRRGLECDGKVNICCKKQFFVSFKDIG WNDWIIAPSGYHANYCEGECPSHIAGTSGSSLSFHSTVINHYRMRGHSPFANLKSCCV PTKLRPMSMLYYDDGQNIIKKDIQNMIVEECGCS" mat_peptide 170..1363 /note="erythroid differentiation protein" BASE COUNT 537 a 427 c 521 g 355 t ORIGIN 158 bp upstream of BamHI site. 1 tccacacaca caaaaaacct gcgcgtgagg ggggaggaaa agcagggcct ttaaaaaggc 61 aatcacaaca acttttgctg ccaggatgcc cttgctttgg ctgagaggat ttctgttggc 121 aagttgctgg attatagtga ggagttcccc caccccagga tccgaggggc acagcgcggc 181 ccccgactgt ccgtcctgtg cgctggccgc cctcccaaag gatgtaccca actctcagcc 241 agagatggtg gaggccgtca agaagcacat tttaaacatg ctgcacttga agaagagacc 301 cgatgtcacc cagccggtac ccaaggcggc gcttctgaac gcgatcagaa agcttcatgt 361 gggcaaagtc ggggagaacg ggtatgtgga gatagaggat gacattggaa ggagggcaga 421 aatgaatgaa cttatggagc agacctcgga gatcatcacg tttgccgagt caggaacagc 481 caggaagacg ctgcacttcg agatttccaa ggaaggcagt gacctgtcag tggtggagcg 541 tgcagaagtc tggctcttcc taaaagtccc caaggccaac aggaccagga ccaaagtcac 601 catccgcctc ttccagcagc agaagcaccc gcagggcagc ttggacacag gggaagaggc 661 cgaggaagtg ggcttaaagg gggagaggag tgaactgttg ctctctgaaa aagtagtaga 721 cgctcggaag agcacctggc atgtcttccc tgtctccagc agcatccagc ggttgctgga 781 ccagggcaag agctccctgg acgttcggat tgcctgtgag cagtgccagg agagtggcgc 841 cagcttggtt ctcctgggca agaagaagaa gaaagaagag gagggggaag ggaaaaagaa 901 gggcggaggt gaaggtgggg caggagcaga tgaggaaaag gagcagtcgc acagaccttt 961 cctcatgctg caggcccggc agtctgaaga ccaccctcat cgccggcgtc ggcggggctt 1021 ggagtgtgat ggcaaggtca acatctgctg taagaaacag ttctttgtca gtttcaagga 1081 catcggctgg aatgactgga tcattgctcc ctctggctat catgccaact actgcgaggg 1141 tgagtgcccg agccatatag caggcacgtc cgggtcctca ctgtccttcc actcaacagt 1201 catcaaccac taccgcatgc ggggccatag cccctttgcc aacctcaaat cgtgctgtgt 1261 gcccaccaag ctgagaccca tgtccatgtt gtactatgat gatggtcaaa acatcatcaa 1321 aaaggacatt cagaacatga tcgtggagga gtgtgggtgc tcatagagtt gcccagccca 1381 gggggaaagg gagcaagagt tgtccagaga agacagtggc aaaatgaaga aatttttaag 1441 gtttctgagt taaccagaaa aatagaaatt aaaaacaaaa caaaacaaaa aaaaaaacaa 1501 aaaaaaacaa aagtaaatta aaaacaaacc tgatgaaaca gatgaaacag atgaaggaag 1561 atgtggaaat cttagcctgc cttagccagg gctcagagat gaagcagtga agagacagat 1621 tgggagggaa agggagaatg gtgtaccctt tatttcttct gaaatcacac tgatgacatc 1681 agttgtttaa acggggtatt gtcctttccc cccttgaggt tcccttgtga gcttgaatca 1741 accaatctga tctgcagtag tgtggactag aacaacccaa atagcatcta gaaagccatg 1801 agtttgaaag ggcccatcac aggcactttc ctagcctaat // LOCUS HUMEDG 2757 bp mRNA PRI 07-NOV-1994 DEFINITION Human endothelial differentiation protein (edg-1) gene mRNA, complete cds. ACCESSION M31210 NID g181948 KEYWORDS endothelial differentiation protein. SOURCE Human umbilical vein endothelial cell, cDNA to mRNA, clone p4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2757) AUTHORS Hla,T. and Maciag,T. TITLE An abundant transcript induced in differentiating human endothelial cells encodes a polypeptide with structural similarities to G-protein-coupled receptors JOURNAL J. Biol. Chem. 265 (16), 9308-9313 (1990) MEDLINE 90264425 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Hla, 11-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..2757 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q13" misc_feature 246..254 /note="consensus KOZAK sequence; putative" gene 251..1396 /gene="ECGF1" CDS 251..1396 /gene="ECGF1" /note="endothelial differentiation protein (edg-1)" /codon_start=1 /db_xref="GDB:G00-127-754" /db_xref="PID:g181949" /translation="MGPTSVPLVKAHRSSVSDYVNYDIIVRHYNYTGKLNISADKENS IKLTSVVFILICCFIILENIFVLLTIWKTKKFHRPMYYFIGNLALSDLLAGVAYTANL LLSGATTYKLTPAQWFLREGSMFVALSASVFSLLAIAIERYITMLKMKLHNGSNNFRL FLLISACWVISLILGGLPIMGWNCISALSSCSTVLPLYHKHYILFCTTVFTLLLLSIV ILYCRIYSLVRTRSRRLTFRKNISKASRSSENVALLKTVIIVLSVFIACWAPLFILLL LDVGCKVKTCDILFRAEYFLVLAVLNSGTNPIIYTLTNKEMRRAFIRIMSCCKCPSGD SAGKFKRPIIAGMEFSRSKSDNSSHPQKDEGDNPETIMSSGNVNSSS" polyA_signal 2591..2596 polyA_signal 2737..2742 BASE COUNT 671 a 706 c 619 g 761 t ORIGIN 1 tctaaaggtc gggggcagca gcaagatgcg aagcgagccg tacagatccc gggctctccg 61 aacgcaactt cgccctgctt gagcgaggct gcggtttccg aggccctctc cagccaagga 121 aaagctacac aaaaagcctg gatcactcat cgaaccaccc ctgaagccag tgaaggctct 181 ctcgcctcgc cctctagcgt tcgtctggag tagcgccacc ccggcttcct ggggacacag 241 ggttggcacc atggggccca ccagcgtccc gctggtcaag gcccaccgca gctcggtctc 301 tgactacgtc aactatgata tcatcgtccg gcattacaac tacacgggaa agctgaatat 361 cagcgcggac aaggagaaca gcattaaact gacctcggtg gtgttcattc tcatctgctg 421 ctttatcatc ctggagaaca tctttgtctt gctgaccatt tggaaaacca agaaattcca 481 ccgacccatg tactatttta ttggcaatct ggccctctca gacctgttgg caggagtagc 541 ctacacagct aacctgctct tgtctggggc caccacctac aagctcactc ccgcccagtg 601 gtttctgcgg gaagggagta tgtttgtggc cctgtcagcc tccgtgttca gtctcctcgc 661 catcgccatt gagcgctata tcacaatgct gaaaatgaaa ctccacaacg ggagcaataa 721 cttccgcctc ttcctgctaa tcagcgcctg ctgggtcatc tccctcatcc tgggtggcct 781 gcctatcatg ggctggaact gcatcagtgc gctgtccagc tgctccaccg tgctgccgct 841 ctaccacaag cactatatcc tcttctgcac cacggtcttc actctgcttc tgctctccat 901 cgtcattctg tactgcagaa tctactcctt ggtcaggact cggagccgcc gcctgacgtt 961 ccgcaagaac atttccaagg ccagccgcag ctctgagaat gtggcgctgc tcaagaccgt 1021 aattatcgtc ctgagcgtct tcatcgcctg ctgggcaccg ctcttcatcc tgctcctgct 1081 ggatgtgggc tgcaaggtga agacctgtga catcctcttc agagcggagt acttcctggt 1141 gttagctgtg ctcaactccg gcaccaaccc catcatttac actctgacca acaaggagat 1201 gcgtcgggcc ttcatccgga tcatgtcctg ctgcaagtgc ccgagcggag actctgctgg 1261 caaattcaag cgacccatca tcgccggcat ggaattcagc cgcagcaaat cggacaattc 1321 ctcccacccc cagaaagacg aaggggacaa cccagagacc attatgtctt ctggaaacgt 1381 caactcttct tcctagaact ggaagctgtc cacccaccgg aagcgctctt tacttggtcg 1441 ctggccaccc cagtgtttgg aaaaaaatct ctgggcttcg actgctgcca gggaggagct 1501 gctgcaagcc agagggagga agggggagaa tacgaacagc ctggtggtgt cgggtgttgg 1561 tgggtagagt tagttcctgt gaacaatgca ctgggaaggg tggagatcag gtcccggcct 1621 ggaatatata ttctaccccc ctggagcttt gattttgcac tgagccaaag gtctagcatt 1681 gtcaagctcc taaagggttc atttggcccc tcctcaaaga ctaatgtccc catgtgaaag 1741 cgtctctttg tctggagctt tgaggagatg ttttccttca ctttagtttc aaacccaagt 1801 gagtgtgtgc acttctgctt ctttagggat gccctgtaca tcccacaccc caccctccct 1861 tcccttcata cccctcctca acgttctttt actttatact ttaactacct gagagttatc 1921 agagctgggg ttgtggaatg atcgatcatc tatagcaaat aggctatgtt gagtacgtag 1981 gctgtgggaa gatgaagatg gtttggaggt gtaaaacaat gtccttcgct gaggccaaag 2041 tttccatgta agcgggatcc gttttttgga atttggttga agtcactttg atttctttaa 2101 aaaacatctt ttcaatgaaa tgtgttacca tttcatatcc attgaagccg aaatctgcat 2161 aaggaagccc actttatcta aatgatatta gccaggatcc ttggtgtcct aggagaaaca 2221 gacaagcaaa acaaagtgaa aaccgaatgg attaactttt gcaaaccaag ggagatttct 2281 tagcaaatga gtctaacaaa tatgacatcc gtctttccca cttttgttga tgtttatttc 2341 agaatcttgt gtgattcatt tcaagcaaca acatgttgta ttttgttgtg ttaaaagtac 2401 ttttcttgat ttttgaatgt atttgtttca ggaagaagtc attttatgga tttttctaac 2461 ccgtgttaac ttttctagaa tccaccctct tgtgccctta agcattactt taactggtag 2521 ggaacgccag aacttttaag tccagctatt cattagatag taattgaaga tatgtataaa 2581 tattacaaag aataaaaata tattactgtc tctttagtat ggttttcagt gcaattaaac 2641 cgagagatgt cttgtttttt taaaaagaat agtatttaat aggtttctga cttttgtgga 2701 tcattttgca catagcttta tcaactttta aacattaata aactgatttt tttaaag // LOCUS HUMEDNRB 1719 bp mRNA PRI 07-NOV-1994 DEFINITION Homo sapiens endothelin receptor type B (EDNRB) mRNA, complete cds. ACCESSION L06623 NID g181958 KEYWORDS endothelin receptor; endothelin receptor type B. SOURCE Homo sapiens (tissue library: lambda ZAPII) lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Elshourbagy,N.A., Korman,D.R., Wu,H.L., Sylvester,D.R., Lee,J.A., Nuthalaganti,P., Bergsma,D.J., Kumar,C.S. and Nambi,P. TITLE Molecular characterization and regulation of the human endothelin receptors JOURNAL J. Biol. Chem. 268 (6), 3873-3879 (1993) MEDLINE 93179382 FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /tissue_lib="lambda ZAPII" /map="Unassigned" gene 234..1562 /gene="EDNRB" CDS 234..1562 /gene="EDNRB" /codon_start=1 /db_xref="GDB:G00-129-075" /product="endothelin receptor type B" /db_xref="PID:g181959" /translation="MQPPPSLCGRALVALVLACGLSRIWGEERGFPPDRATPLLQTAE IMTPPTKTLWPKGSNASLARSLAPAEVPKGDRTAGSPPRTISPPPCQGPIEIKETFKY INTVVSCLVFVLGIIGNSTLLRIIYKNKCMRNGPNILIASLALGDLLHIVIDIPINVY KLLAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWT AVEIVLIWVVSVVLAVPEAIGFDIITMDYKGSYLRICLLHPVQKTAFMQFYKTAKDWW LFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQIALNDHLKQRREVAKTVFCLVLVFAL CWLPLHLSRILKLTLYNQNDPNRCELLSFLLVLDYIGINMASLNSCINPIALYLVSKR FKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS" BASE COUNT 444 a 400 c 412 g 463 t ORIGIN 1 gggctgcagg tttcgacccg cgctggcgag tcatgagcgc caagtttccc actggcgcgc 61 aaacttgagt tacttttgag cgtggatact ggcgaagagg ctgcgggcgg tattagcgtt 121 tgcagcgact tggctcgggc agctgacccc aaagtgtctg tcttccttcc tctgcttgtc 181 tctaggctct gaaactgcgg cggccaccgg acgcttctgg agcaggtagc agcatgcagc 241 cgcctccaag tctgtgcgga cgcgccctgg ttgcgctggt tcttgcctgc ggcctgtcgc 301 ggatctgggg agaggagaga ggcttcccgc ccgacagggc cactccgctt ttgcaaaccg 361 cagagataat gacgccaccc actaagacct tatggcccaa gggttccaac gccagtctgg 421 cgcggtcgtt ggcacctgcg gaggtgccta aaggagacag gacggcagga tctccgccac 481 gcaccatctc ccctcccccg tgccaaggac ccatcgagat caaggagact ttcaaataca 541 tcaacacggt tgtgtcctgc cttgtgttcg tgctggggat catcgggaac tccacacttc 601 tgagaattat ctacaagaac aagtgcatgc gaaacggtcc caatatcttg atcgccagct 661 tggctctggg agacctgctg cacatcgtca ttgacatccc tatcaatgtc tacaagctgc 721 tggcagagga ctggccattt ggagctgaga tgtgtaagct ggtgcctttc atacagaaag 781 cctccgtggg aatcactgtg ctgagtctat gtgctctgag tattgacaga tatcgagctg 841 ttgcttcttg gagtagaatt aaaggaattg gggttccaaa atggacagca gtagaaattg 901 ttttgatttg ggtggtctct gtggttctgg ctgtccctga agccataggt tttgatataa 961 ttacgatgga ctacaaagga agttatctgc gaatctgctt gcttcatccc gttcagaaga 1021 cagctttcat gcagttttac aagacagcaa aagattggtg gctgttcagt ttctatttct 1081 gcttgccatt ggccatcact gcattttttt atacactaat gacctgtgaa atgttgagaa 1141 agaaaagtgg catgcagatt gctttaaatg atcacctaaa gcagagacgg gaagtggcca 1201 aaaccgtctt ttgcctggtc cttgtctttg ccctctgctg gcttcccctt cacctcagca 1261 ggattctgaa gctcactctt tataatcaga atgatcccaa tagatgtgaa cttttgagct 1321 ttctgttggt attggactat attggtatca acatggcttc actgaattcc tgcattaacc 1381 caattgctct gtatttggtg agcaaaagat tcaaaaactg ctttaagtca tgcttatgct 1441 gctggtgcca gtcatttgaa gaaaaacagt ccttggagga aaagcagtcg tgcttaaagt 1501 tcaaagctaa tgatcacgga tatgacaact tccgttccag taataaatac agctcatctt 1561 gaaagaagaa ctattcactg tatttcattt tctttatatt ggaccgaagt cattaaaaca 1621 aaatgaaaca tttgccaaaa caaaacaaaa aactatgtat ttgcacagca cactattaaa 1681 atattaagtg taattatttt aaaaaaaaaa aaaaaaaaa // LOCUS HUMEGFAA 990 bp mRNA PRI 28-FEB-1991 DEFINITION Human heparin-binding vascular endothelial growth factor (VEGF) mRNA, complete cds. ACCESSION M32977 NID g181970 KEYWORDS angiogenic mitogen; vascular endothelial growth factor. SOURCE Human promyelocytic leukemia cell line HL60, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 990) AUTHORS Leung,D.W., Cachianes,G., Kuang,W.-J., Goeddel,D.V. and Ferrara,N. TITLE Vascular endothelial growth factor is a secreted angiogenic mitogen JOURNAL Science 246, 1306-1309 (1989) MEDLINE 90069608 FEATURES Location/Qualifiers source 1..990 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..990 /note="VEGF mRNA" sig_peptide 57..134 /gene="VEGF" gene 57..632 /gene="VEGF" CDS 57..632 /gene="VEGF" /codon_start=1 /product="vascular endothelial growth factor" /db_xref="PID:g181971" /translation="MNFLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFM DVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSCVPLMRCGGCCNDEGLECVPTEESNI TMQIMRIKPHQGQHIGEMSFLQHNKCECRPKKDRARQENPCGPCSERRKHLFVQDPQT CKCSCKNTDSRCKARQLELNERTCRCDKPRR" mat_peptide 135..629 /gene="VEGF" /product="vascular endothelial growth factor" BASE COUNT 255 a 269 c 276 g 190 t ORIGIN 1 cagtgtgctg gcggcccggc gcgagccggc ccggccccgg tcgggcctcc gaaaccatga 61 actttctgct gtcttgggtg cattggagcc tcgccttgct gctctacctc caccatgcca 121 agtggtccca ggctgcaccc atggcagaag gaggagggca gaatcatcac gaagtggtga 181 agttcatgga tgtctatcag cgcagctact gccatccaat cgagaccctg gtggacatct 241 tccaggagta ccctgatgag atcgagtaca tcttcaagcc atcctgtgtg cccctgatgc 301 gatgcggggg ctgctgcaat gacgagggcc tggagtgtgt gcccactgag gagtccaaca 361 tcaccatgca gattatgcgg atcaaacctc accaaggcca gcacatagga gagatgagct 421 tcctacagca caacaaatgt gaatgcagac caaagaaaga tagagcaaga caagaaaatc 481 cctgtgggcc ttgctcagag cggagaaagc atttgtttgt acaagatccg cagacgtgta 541 aatgttcctg caaaaacaca gactcgcgtt gcaaggcgag gcagcttgag ttaaacgaac 601 gtacttgcag atgtgacaag ccgaggcggt gagccgggca ggaggaagga gcctccctca 661 gggtttcggg aaccagatct ctcaccagga aagactgata cagaacgatc gatacagaaa 721 ccacgctgcc gccaccacac catcaccatc gacagaacag tccttaatcc agaaacctga 781 aatgaaggaa gaggagactc tgcgcagagc actttgggtc cggagggcga gactccggcg 841 gaagcattcc cgggcgggtg acccagcacg gtccctcttg gaattggatt cgccatttta 901 tttttcttgc tgctaaatca ccgagcccgg aagattagag agttttattt ctgggattcc 961 tgtagacaca ccgcggccgc cagcacactg // LOCUS HUMEGFGRBA 1109 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens epidermal growth factor receptor-binding protein GRB2 (EGFRBP-GRB2) mRNA sequence. ACCESSION M96995 NID g181975 KEYWORDS epidermal growth factor receptor-binding protein GRB2. SOURCE Homo sapiens (tissue library: gt11 human brainstem library) brainstem cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1109) AUTHORS Lowenstein,E.J., Daly,R.J., Batzer,A.G., Li,W., Margolis,B., Lammers,R., Ullrich,A., Skolnik,E.Y., Bar-Sagi,D. and Schlessinger,J. TITLE The SH2 and SH3 domain-containing protein GRB2 links receptor tyrosine kinases to ras signaling JOURNAL Cell 70 (3), 431-442 (1992) MEDLINE 92354060 FEATURES Location/Qualifiers source 1..1109 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brainstem" /tissue_lib="gt11 human brainstem library" gene 79..732 /gene="EGFRBP-GRB2" CDS 79..732 /gene="EGFRBP-GRB2" /codon_start=1 /product="epidermal growth factor receptor-binding protein GRB2" /db_xref="PID:g181976" /translation="MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGK DGFIPKNYIEMKPHPWFFGKIPRAKAEEMLSKQRHDGAFLIRESESAPGDFSLSVKFG NDVQHFKVLRDGAGKYFLWVVKFNSLNELVDYHRSTSVSRNQQIFLRDIEQVPQQPTY VQALFDFDPQEDGELGFRRGDFIHVMDNSDPNWWKGACHGQTGMFPRNYVTPVNRNV" BASE COUNT 313 a 273 c 262 g 261 t ORIGIN 1 gccagtgaat tcgggggctc agccctcctc cctcccttcc ccctgcttca ggctgctgag 61 cactgagcag cgctcagaat ggaagccatc gccaaatatg acttcaaagc tactgcagac 121 gacgagctga gcttcaaaag gggggacatc ctcaaggttt tgaacgaaga atgtgatcag 181 aactggtaca aggcagagct taatggaaaa gacggcttca ttcccaagaa ctacatagaa 241 atgaaaccac atccgtggtt ttttggcaaa atccccagag ccaaggcaga agaaatgctt 301 agcaaacagc ggcacgatgg ggcctttctt atccgagaga gtgagagcgc tcctggggac 361 ttctccctct ctgtcaagtt tggaaacgat gtgcagcact tcaaggtgct ccgagatgga 421 gccgggaagt acttcctctg ggtggtgaag ttcaattctt tgaatgagct ggtggattat 481 cacagatcta catctgtctc cagaaaccag cagatattcc tgcgggacat agaacaggtg 541 ccacagcagc cgacatacgt ccaggccctc tttgactttg atccccagga ggatggagag 601 ctgggcttcc gccggggaga ttttatccat gtcatggata actcagaccc caactggtgg 661 aaaggagctt gccacgggca gaccggcatg tttccccgca attatgtcac ccccgtgaac 721 cggaacgtct aagagtcaag aagcaattat ttaaagaaag tgaaaaatgt aaaacacata 781 caaaagaatt aaacccacaa gctgcctctg acagcagcct gtgagggagt gcagaacacc 841 tggccgggtc accctgtgac cctctcactt tggttggaac tttagggggt gggagggggc 901 gttggattta aaaatgccaa aacttaccta taaattaaga agagttttta ttacaaattt 961 tcactgctgc tcctctttcc cctcctttgt cttttttttc atcctttttt ctcttctgtc 1021 catcagtgca tgacgtttaa ggccacgtat agtcctagct gacgccaata ataaaaaaca 1081 agaaaccaaa aaaaaaaaac ccgaattca // LOCUS HUMEGFRBB3 4879 bp mRNA PRI 15-MAR-1990 DEFINITION Human epidermal growth factor receptor (ERBB3) mRNA, complete cds. ACCESSION M29366 NID g181979 KEYWORDS epidermal growth factor receptor. SOURCE Human placenta, cDNA to mRNA, (library of Clontech). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4879) AUTHORS Kraus,M.H., Issing,W., Miki,T., Popescu,N.C. and Aaronson,S.A. TITLE Isolation and characterization of ERBB3, a third member of the ERBB/epidermal growth factor receptor family: Evidence for overexpression in a subset of human mammary tumors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9193-9197 (1989) MEDLINE 90083234 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Kraus 31-OCT-1989. FEATURES Location/Qualifiers source 1..4879 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..4879 /note="ERBB3 mRNA" sig_peptide 100..156 /note="epidermal growth factor signal peptide" CDS 100..4128 /note="epidermal growth factor receptor precursor" /codon_start=1 /db_xref="PID:g181980" /translation="MRANDALQVLGLLFSLARGSEVGNSQAVCPGTLNGLSVTGDAEN QYQTLYKLYERCEVVMGNLEIVLTGHNADLSFLQWIREVTGYVLVAMNEFSTLPLPNL RVVRGTQVYDGKFAIFVMLNYNTNSSHALRQLRLTQLTEILSGGVYIEKNDKLCHMDT IDWRDIVRDRDAEIVVKDNGRSCPPCHEVCKGRCWGPGSEDCQTLTKTICAPQCNGHC FGPNPNQCCHDECAGGCSGPQDTDCFACRHFNDSGACVPRCPQPLVYNKLTFQLEPNP HTKYQYGGVCVASCPHNFVVDQTSCVRACPPDKMEVDKNGLKMCEPCGGLCPKACEGT GSGSRFQTVDSSNIDGFVNCTKILGNLDFLITGLNGDPWHKIPALDPEKLNVFRTVRE ITGYLNIQSWPPHMHNFSVFSNLTTIGGRSLYNRGFSLLIMKNLNVTSLGFRSLKEIS AGRIYISANRQLCYHHSLNWTKVLRGPTEERLDIKHNRPRRDCVAEGKVCDPLCSSGG CWGPGPGQCLSCRNYSRGGVCVTHCNFLNGEPREFAHEAECFSCHPECQPMEGTATCN GSGSDTCAQCAHFRDGPHCVSSCPHGVLGAKGPIYKYPDVQNECRPCHENCTQGCKGP ELQDCLGQTLVLIGKTHLTMALTVIAGLVVIFMMLGGTFLYWRGRRIQNKRAMRRYLE RGESIEPLDPSEKANKVLARIFKETELRKLKVLGSGVFGTVHKGVWIPEGESIKIPVC IKVIEDKSGRQSFQAVTDHMLAIGSLDHAHIVRLLGLCPGSSLQLVTQYLPLGSLLDH VRQHRGALGPQLLLNWGVQIAKGMYYLEEHGMVHRNLAARNVLLKSPSQVQVADFGVA DLLPPDDKQLLYSEAKTPIKWMALESIHFGKYTHQSDVWSYGVTVWELMTFGAEPYAG LRLAEVPDLLEKGERLAQPQICTIDVYMVMVKCWMIDENIRPTFKELANEFTRMARDP PRYLVIKRESGPGIAPGPEPHGLTNKKLEEVELEPELDLDLDLEAEEDNLATTTLGSA LSLPVGTLNRPRGSQSLLSPSSGYMPMNQGNLGESCQESAVSGSSERCPRPVSLHPMP RGCLASESSEGHVTGSEAELQEKVSMCRSRSRSRSPRPRGDSAYHSQRHSLLTPVTPL SPPGLEEEDVNGYVMPDTHLKGTPSSREGTLSSVGLSSVLGTEEEDEDEEYEYMNRRR RHSPPHPPRPSSLEELGYEYMDVGSDLSASLGSTQSCPLHPVPIMPTAGTTPDEDYEY MNRQRDGGGPGGDYAAMGACPASEQGYEEMRAFQGPGHQAPHVHYARLKTLRSLEATD SAFDNPDYWHSRLFPKANAQRT" mat_peptide 157..4825 /note="epidermal growth factor receptor" BASE COUNT 1182 a 1258 c 1308 g 1131 t ORIGIN 376 bp upstream of EcoRI site. 1 accaattcgc cagcggttca ggtggctctt gcctcgatgt cctagcctag gggcccccgg 61 gccggacttg gctgggctcc cttcaccctc tgcggagtca tgagggcgaa cgacgctctg 121 caggtgctgg gcttgctttt cagcctggcc cggggctccg aggtgggcaa ctctcaggca 181 gtgtgtcctg ggactctgaa tggcctgagt gtgaccggcg atgctgagaa ccaataccag 241 acactgtaca agctctacga gaggtgtgag gtggtgatgg ggaaccttga gattgtgctc 301 acgggacaca atgccgacct ctccttcctg cagtggattc gagaagtgac aggctatgtc 361 ctcgtggcca tgaatgaatt ctctactcta ccattgccca acctccgcgt ggtgcgaggg 421 acccaggtct acgatgggaa gtttgccatc ttcgtcatgt tgaactataa caccaactcc 481 agccacgctc tgcgccagct ccgcttgact cagctcaccg agattctgtc agggggtgtt 541 tatattgaga agaacgataa gctttgtcac atggacacaa ttgactggag ggacatcgtg 601 agggaccgag atgctgagat agtggtgaag gacaatggca gaagctgtcc cccctgtcat 661 gaggtttgca aggggcgatg ctggggtcct ggatcagaag actgccagac attgaccaag 721 accatctgtg ctcctcagtg taatggtcac tgctttgggc ccaaccccaa ccagtgctgc 781 catgatgagt gtgccggggg ctgctcaggc cctcaggaca cagactgctt tgcctgccgg 841 cacttcaatg acagtggagc ctgtgtacct cgctgtccac agcctcttgt ctacaacaag 901 ctaactttcc agctggaacc caatccccac accaagtatc agtatggagg agtttgtgta 961 gccagctgtc cccataactt tgtggtggat caaacatcct gtgtcagggc ctgtcctcct 1021 gacaagatgg aagtagataa aaatgggctc aagatgtgtg agccttgtgg gggactatgt 1081 cccaaagcct gtgagggaac aggctctggg agccgcttcc agactgtgga ctcgagcaac 1141 attgatggat ttgtgaactg caccaagatc ctgggcaacc tggactttct gatcaccggc 1201 ctcaatggag acccctggca caagatccct gccctggacc cagagaagct caatgtcttc 1261 cggacagtac gggagatcac aggttacctg aacatccagt cctggccgcc ccacatgcac 1321 aacttcagtg ttttttccaa tttgacaacc attggaggca gaagcctcta caaccggggc 1381 ttctcattgt tgatcatgaa gaacttgaat gtcacatctc tgggcttccg atccctgaag 1441 gaaattagtg ctgggcgtat ctatataagt gccaataggc agctctgcta ccaccactct 1501 ttgaactgga ccaaggtgct tcgggggcct acggaagagc gactagacat caagcataat 1561 cggccgcgca gagactgcgt ggcagagggc aaagtgtgtg acccactgtg ctcctctggg 1621 ggatgctggg gcccaggccc tggtcagtgc ttgtcctgtc gaaattatag ccgaggaggt 1681 gtctgtgtga cccactgcaa ctttctgaat ggggagcctc gagaatttgc ccatgaggcc 1741 gaatgcttct cctgccaccc ggaatgccaa cccatggagg gcactgccac atgcaatggc 1801 tcgggctctg atacttgtgc tcaatgtgcc cattttcgag atgggcccca ctgtgtgagc 1861 agctgccccc atggagtcct aggtgccaag ggcccaatct acaagtaccc agatgttcag 1921 aatgaatgtc ggccctgcca tgagaactgc acccaggggt gtaaaggacc agagcttcaa 1981 gactgtttag gacaaacact ggtgctgatc ggcaaaaccc atctgacaat ggctttgaca 2041 gtgatagcag gattggtagt gattttcatg atgctgggcg gcacttttct ctactggcgt 2101 gggcgccgga ttcagaataa aagggctatg aggcgatact tggaacgggg tgagagcata 2161 gagcctctgg accccagtga gaaggctaac aaagtcttgg ccagaatctt caaagagaca 2221 gagctaagga agcttaaagt gcttggctcg ggtgtctttg gaactgtgca caaaggagtg 2281 tggatccctg agggtgaatc aatcaagatt ccagtctgca ttaaagtcat tgaggacaag 2341 agtggacggc agagttttca agctgtgaca gatcatatgc tggccattgg cagcctggac 2401 catgcccaca ttgtaaggct gctgggacta tgcccagggt catctctgca gcttgtcact 2461 caatatttgc ctctgggttc tctgctggat catgtgagac aacaccgggg ggcactgggg 2521 ccacagctgc tgctcaactg gggagtacaa attgccaagg gaatgtacta ccttgaggaa 2581 catggtatgg tgcatagaaa cctggctgcc cgaaacgtgc tactcaagtc acccagtcag 2641 gttcaggtgg cagattttgg tgtggctgac ctgctgcctc ctgatgataa gcagctgcta 2701 tacagtgagg ccaagactcc aattaagtgg atggcccttg agagtatcca ctttgggaaa 2761 tacacacacc agagtgatgt ctggagctat ggtgtgacag tttgggagtt gatgaccttc 2821 ggggcagagc cctatgcagg gctacgattg gctgaagtac cagacctgct agagaagggg 2881 gagcggttgg cacagcccca gatctgcaca attgatgtct acatggtgat ggtcaagtgt 2941 tggatgattg atgagaacat tcgcccaacc tttaaagaac tagccaatga gttcaccagg 3001 atggcccgag acccaccacg gtatctggtc ataaagagag agagtgggcc tggaatagcc 3061 cctgggccag agccccatgg tctgacaaac aagaagctag aggaagtaga gctggagcca 3121 gaactagacc tagacctaga cttggaagca gaggaggaca acctggcaac caccacactg 3181 ggctccgccc tcagcctacc agttggaaca cttaatcggc cacgtgggag ccagagcctt 3241 ttaagtccat catctggata catgcccatg aaccagggta atcttgggga gtcttgccag 3301 gagtctgcag tttctgggag cagtgaacgg tgcccccgtc cagtctctct acacccaatg 3361 ccacggggat gcctggcatc agagtcatca gaggggcatg taacaggctc tgaggctgag 3421 ctccaggaga aagtgtcaat gtgtagaagc cggagcagga gccggagccc acggccacgc 3481 ggagatagcg cctaccattc ccagcgccac agtctgctga ctcctgttac cccactctcc 3541 ccacccgggt tagaggaaga ggatgtcaac ggttatgtca tgccagatac acacctcaaa 3601 ggtactccct cctcccggga aggcaccctt tcttcagtgg gtcttagttc tgtcctgggt 3661 actgaagaag aagatgaaga tgaggagtat gaatacatga accggaggag aaggcacagt 3721 ccacctcatc cccctaggcc aagttccctt gaggagctgg gttatgagta catggatgtg 3781 gggtcagacc tcagtgcctc tctgggcagc acacagagtt gcccactcca ccctgtaccc 3841 atcatgccca ctgcaggcac aactccagat gaagactatg aatatatgaa tcggcaacga 3901 gatggaggtg gtcctggggg tgattatgca gccatggggg cctgcccagc atctgagcaa 3961 gggtatgaag agatgagagc ttttcagggg cctggacatc aggcccccca tgtccattat 4021 gcccgcctaa aaactctacg tagcttagag gctacagact ctgcctttga taaccctgat 4081 tactggcata gcaggctttt ccccaaggct aatgcccaga gaacgtaact cctgctccct 4141 gtggcactca gggagcattt aatggcagct agtgccttta gagggtaccg tcttctccct 4201 attccctctc tctcccaggt cccagcccct tttccccagt cccagacaat tccattcaat 4261 ctttggaggc ttttaaacat tttgacacaa aattcttatg gtatgtagcc agctgtgcac 4321 tttcttctct ttcccaaccc caggaaaggt tttccttatt ttgtgtgctt tcccagtccc 4381 attcctcagc ttcttcacag gcactcctgg agatatgaag gattactctc catatccctt 4441 cctctcaggc tcttgactac ttggaactag gctcttatgt gtgcctttgt ttcccatcag 4501 actgtcaaga agaggaaagg gaggaaacct agcagaggaa agtgtaattt tggtttatga 4561 ctcttaaccc cctagaaaga cagaagctta aaatctgtga agaaagaggt taggagtaga 4621 tattgattac tatcataatt cagcacttaa ctatgagcca ggcatcatac taaacttcac 4681 ctacattatc tcacttagtc ctttatcatc cttaaaacaa ttctgtgaca tacatattat 4741 ctcattttac acaaagggaa gtcgggcatg gtggctcatg cctgtaatct cagcactttg 4801 ggaggctgag gcagaaggat tacctgaggc aaggagtttg agaccagctt agccaacata 4861 gtaagacccc catctcttt // LOCUS HUMEHS2A 2299 bp DNA PRI 31-DEC-1994 DEFINITION Homo sapiens endogenous HIV-1 related sequence (EHS-2) gene, partial cds. ACCESSION M86246 NID g181990 KEYWORDS HIV-1 related sequence. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2299) AUTHORS Horwitz,M.S., Boyce-Jacino,M.T. and Faras,A.J. TITLE Novel human endogenous sequences related to human immunodeficiency virus type 1 JOURNAL J. Virol. 66 (4), 2170-2179 (1992) MEDLINE 92194452 FEATURES Location/Qualifiers source 1..2299 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1179..1509 /gene="EHS-2" CDS 1179..1509 /partial /gene="EHS-2" /note="ends of coding region undetermined." /codon_start=1 /db_xref="PID:g553273" /translation="NLIDCSQGISQDWVLIRDSSRTGSAFKLPQVAGRIHFVVAAGFV AACFCTSRNGWRERQRQRGREIQQESCYNLMQSLRLISSLCNCELCCIKHAHAGVFFY TVTCIPLG" CDS 1229..1391 /partial /gene="EHS-2" /codon_start=1 /db_xref="PID:g553274" /translation="RLISNRICFQAPSGCWQDSFCCSCRIRGCLLLHIQKWVERETET ERERDPAREL" CDS 1336..1419 /gene="EHS-2" /note="endogenous HIV-1 related sequence" /codon_start=1 /db_xref="PID:g181993" /translation="MGGERDRDREGERSSKRVAIILCNHLG" BASE COUNT 698 a 404 c 537 g 660 t ORIGIN 1 aagcttaaac taaacctttt ggaaaagaaa aacaaaaccc acagcagctg tccagggctg 61 cgggagttag aaaatgcaat tcatgtgact tggtggaatg ccagtgaagc taccattaga 121 ttattctgtt tactgttaag gtacgtggag acagacaggc tctggagatg tcgacctgca 181 tttggatgac aggtattgac cacctccttg atgccaggca tggtgctatg cagaggcata 241 taacagcaca cacaggacac agaccatccc tcaaggtttt tagtctcgtg ggagtcatca 301 acacaagatt taaataatgt gtcatgtgtg ctagggagag gagtttgggg ttctttggga 361 ggatatacct tgagaagggc caggtaggga aagtccatag agaagggttt gtttcctctg 421 agatactagg ccaagaaaga gttagggaag tgaagacaag aggggccagt tttgtcaaca 481 tcagctataa tactcaaaag cagctaaaat aatgatatat ctgcaaaact ggaagcagtt 541 cagtaggttg aagaaaagat agtaaagtga gggctggcat gggagggcga gagaggaggc 601 aggttggaag actagctaag cattttgggc tttgtaacaa gggcattgac acatgttaaa 661 ctcggaaaaa ctgatcgttt cattgtacga acagcctagc ttataaagtt atttaattag 721 aaaattttca gaatactttt gattcggtgg taatgtgtca acaaccctaa aatattatgt 781 gtttttctgt aaagtatgtg gatgtgaagg gaagcaacat gggggcatga ttattaagag 841 tacagcttct ggaatcagag atcatgtaag ttggaattct accttatcca attacttaat 901 gtgtgagttt aggaaaatat ctattaactt accaccaatt acttaatgtg tgagtttagg 961 aaaatctatt aacttaccac aatctatagt acctcatctg aaaaatggga ataattaatt 1021 aatactgagc tcatagagct tttggaagaa tcaggttagg tttttacgcc tgcatagcac 1081 attaccacct gcagaagctt aacacaacac tcttttgtta tctaacagtt tctgtgtatc 1141 agaaatttga gcatggctta gctgggttct ctgcttaaaa tctaatagac tgcagtcagg 1201 gtatcagcca ggactgggtt ctcattagag actcatctcg aacaggatct gctttcaagc 1261 tccctcaggt tgctggcagg attcattttg ttgtagctgc aggattcgtg gctgcttgct 1321 tctgcacatc cagaaatggg tggagagaga gacagagaca gagagggaga gagatccagc 1381 aagagagttg ctataatctt atgcaatcac ttaggttgat ttcctctctt tgcaattgtg 1441 aattgtgctg cattaagcat gcacatgcag gtgtgttttt ttatacagtg acttgtattc 1501 ctttgggtag ataaccagta gtgggattaa ctggatcaat ggtagatcta tttttagctc 1561 tttgagaaat ctgcatactg ttgcatactg tttgccatag aggttgtgct aatttacatt 1621 cccaccagca gcatataaga gtccccattt caccacatcc atgctaacat ctattgttgt 1681 ttcacttttt aacagtggtc attctggctg ggtaaggtgg tagctcatta tggttttaat 1741 ttgcatttcc ctgatgatta gtgataagca tttttttcat gtttcttggc catttgttgg 1801 ctattctcat cttctgagaa atgcctgttc aagtcagaga ctcagaagga gggaggcaag 1861 gaggagggta aaggatggaa aattacctat cgggtacaag gtatgcaatt cagttgacag 1921 gtacactcaa agccaagcct tccccactat acaattcatc catgtaagca aaagccacct 1981 gtgcccccaa agctactgaa attaaaataa accttatata atcacataca tgtaatcaca 2041 cttatcttat cacctgtgtc atattccatt ggttagaaac aagttgtggg tcctgctcac 2101 attcaaggag ggattagtaa aggcgtgaat gttaggatgt ggggatcatg gtgagaagaa 2161 tcctcaagga tgcacaccac aagaatttgg taaaattgta cacataagga cagaatagaa 2221 gcatttagcc ttgtacgacc tggcttacca gggcgcgaaa gagagatgga gcatatacaa 2281 atagagcaca tagagactt // LOCUS HUMEIF2A 1393 bp mRNA PRI 07-NOV-1994 DEFINITION Human translational initiation factor (eIF-2), alpha subunit mRNA, complete cds. ACCESSION J02645 NID g181994 KEYWORDS translational initiation factor. SOURCE Human fibroblast, cDNA to mRNA (library of B.Wold), clone pHh2a-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1393) AUTHORS Ernst,H., Duncan,R.F. and Hershey,J.W. TITLE Cloning and sequencing of complementary DNAs encoding the alpha-subunit of translational initiation factor eIF-2. Characterization of the protein and its messenger RNA JOURNAL J. Biol. Chem. 262 (3), 1206-1212 (1987) MEDLINE 87109235 COMMENT Draft entry and clean copy sequence for [1] kindly provided by J.W.B.Hershey, 08-DEC-1986. FEATURES Location/Qualifiers source 1..1393 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" mRNA <1..>1393 /note="eIF2 mRNA" gene 28..975 /gene="EIF2" CDS 28..975 /gene="EIF2" /note="translational initiation factor eIF-2, alpha subunit" /codon_start=1 /db_xref="GDB:G00-126-359" /db_xref="PID:g181995" /translation="MPGLSCRFYQHKFPEVEDVVMVNVRSIAEMGAYVSLLEYNNIEG MILLSELSRRRIRSINKLIRIGRNECVVVIRVDKEKGYIDLSKRRVSPEEAIKCEDKF TKSKTVYSILRHVAEVLEYTKDEQLESLFQRTAWVFDDKYKRPGYGAYDAFKHAVSDP SILDSLDLNEDEREVLINNINRRLTPQAVKIRADIEVACYGYEGIDAVKEALRAGLNC STENMPIKINLIAPPRYVMTTTTLERTEGLSVLSQAMAVIKEKIEEKRGVFNVQMEPK VVTDTDETELARQMERLERENAEVDGDDDAEEMEAKAED" BASE COUNT 449 a 244 c 317 g 383 t ORIGIN Unreported. 1 gtgcgggaat cacacacata cctcagaatg ccgggtctaa gttgtagatt ttatcaacac 61 aaatttcctg aggtggaaga tgtagtgatg gtgaatgtca gatccattgc tgaaatgggg 121 gcttatgtca gcttgctgga atacaacaac attgaaggca tgattcttct tagtgaatta 181 tccagaaggc gtatccgttc tatcaacaaa ctcatccgaa ttggcaggaa tgagtgtgtg 241 gttgtcatta gggtggacaa agaaaaagga tatattgatt tgtcaaaaag aagagtttct 301 ccagaggaag caatcaaatg tgaagacaaa ttcacaaaat ccaaaactgt ttatagcatt 361 cttcgtcatg ttgctgaggt gttagaatac accaaggatg agcagctgga aagcctattc 421 cagaggactg cctgggtctt tgatgacaag tacaagagac ctggatatgg tgcctatgat 481 gcatttaagc atgcagtctc agacccatct attttggata gtttagattt gaatgaagat 541 gaacgggaag tactcattaa taatattaat aggcgcttga ccccacaggc tgtcaaaatt 601 cgagcagata ttgaagtggc ttgttatggt tatgaaggca ttgatgctgt aaaagaagcc 661 ctaagagcag gtttgaattg ttctacagaa aacatgccca ttaagattaa tctaatagct 721 cctcctcggt atgtaatgac tacgacaacc ctggagagaa cagaaggcct ttctgtcctc 781 agtcaagcta tggctgttat caaagagaag attgaggaaa agaggggtgt gttcaatgtt 841 caaatggagc ccaaagtggt cacagataca gatgagactg aacttgcgag gcagatggag 901 aggcttgaaa gagaaaatgc cgaagtggat ggagatgatg atgcagaaga aatggaagcc 961 aaagctgaag attaactttg tgggaaacag agtccaattt aaggaacaca gagcagcgct 1021 tcctggctgt aaatcctaga cttgaaagtt ttccagtatt gaaaacttca aagctgaata 1081 ttttttattt ctaagtattt aaatgttcta acagatcaga acatgaaatg ccctcctaaa 1141 tgtcagctgt tgtcacacag tagctccaac actttgagca tttttaaggg agtggcctca 1201 tttcactaga gacaaatctt taagaatagt tctaaaattg ggcttgtgat ttccatttct 1261 gatgtctcca gattggcacc cctttctagt tcaatgcctc acgagatttg ccaggggcat 1321 ccaaggcaaa caatcccaat ctttctatat aaaatgtatt caagcaaaca tcaaataaat 1381 ttctgggata ttt // LOCUS HUMEIF4C 1202 bp mRNA PRI 18-JUL-1994 DEFINITION Human protein synthesis factor (eIF-4C) mRNA, complete cds. ACCESSION L18960 NID g306724 KEYWORDS protein synthesis factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1202) AUTHORS Dever,T.E., Wei,C.L., Benkowski,L.A., Browning,K., Merrick,W.C. and Hershey,J.W. TITLE Determination of the amino acid sequence of rabbit, human, and wheat germ protein synthesis factor eIF-4C by cloning and chemical sequencing JOURNAL J. Biol. Chem. 269 (5), 3212-3218 (1994) MEDLINE 94148809 FEATURES Location/Qualifiers source 1..1202 /organism="Homo sapiens" /note="subject has leukemia" /db_xref="taxon:9606" 5'UTR 1..207 CDS 208..642 /codon_start=1 /product="protein synthesis factor" /db_xref="PID:g306725" /translation="MPKNKGKGGKNRRRGKNENESEKRELVFKEDGQEYAQVIKMLGN GRLEAMCFDGVKRLCHIRGKLRKKVWINTSDIILVGLRDYQDNKADVILKYNADEARS LKAYGELPEHAKINETDTFGPGDDDEIQFDDIGDDDEDIDDI" 3'UTR 643..1202 BASE COUNT 366 a 221 c 278 g 337 t ORIGIN 1 ggcacgaggc gccatttgct gccgccgagc gtggacgcag gcggatctct gaagagctgg 61 gtcgccagcc tctcccgcgc acgttgcctg gcctccagca cctacttggt cccgcgcgct 121 ccctcgtgtc gcccctcgga gcagcagccg ccgcggtcgc cgctacccgg aaagaagtca 181 gagacgccgc gagtcgccgc caccgccatg cccaagaata aaggtaaagg aggtaaaaac 241 agacgcaggg gtaagaatga gaatgaatct gaaaaaagag aactggtatt caaagaggat 301 gggcaggagt atgctcaggt aatcaaaatg ttgggaaatg gacggctaga agcaatgtgt 361 ttcgatggtg taaagaggtt atgtcacatc agaggaaaat tgagaaaaaa ggtttggata 421 aatacctcgg acattatttt ggttggtctc cgagactacc aggataacaa agctgatgta 481 attttaaaat acaatgcaga cgaagctaga agtctgaagg catacggcga gcttccagag 541 catgctaaaa tcaatgaaac tgatacattt ggtcctggag atgatgatga aattcagttt 601 gatgacattg gagatgatga tgaagatatt gatgacatct aaattgaact caacatttta 661 cattccatct tttctgaaga ttgtcctaca atttggattt tgatcatgac aaagaagatt 721 aaaatttcat tagcatgaat gcaatttgtt aaagcagact gatttgtttc taagatattt 781 ttggtttttt taaaactgat aataatgctg aattatctta agtgagatgt taagcccact 841 ttgttctttt aatgtaatgg agcttatggg tagaagacca tgtctactaa ttacaaaaaa 901 aaaaaaaaac catgattgct gcttttccta ccacttccag taagaaaatg ggtgttttga 961 agaaatcatt tgccttgtct cacggaatct gattaagccc tggcctcttg atgtatagag 1021 tcatggatat tccagttacc tagatattcc cttgagattt tgatacaatt tgagggaggc 1081 agaagtctgc agttgaagaa aaaaaataag tctgtttgtc atatttaagt agcctgtgcg 1141 tatttttata ctgattttga tatcatgttc ttttcatagt cgtattttgc caccgtaaac 1201 at // LOCUS HUMEL4REC 999 bp DNA PRI 04-AUG-1993 DEFINITION Human melanocortin 4 receptor gene, complete cds. ACCESSION L08603 NID g291977 KEYWORDS melanocortin 4 receptor. SOURCE Homo sapiens (library: lambda EMBL3) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 999) AUTHORS Gantz,I., Miwa,H., Konda,Y., Shimoto,Y., Tashiro,T., Waston,S.J. and DelValle,J. TITLE Molecular cloning, expression, and gene localization of a fourth melanocortin receptor JOURNAL J. Biol. Chem. 268, 15174-15179 (1993) MEDLINE 93315499 FEATURES Location/Qualifiers source 1..999 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda EMBL3" CDS 1..999 /codon_start=1 /product="melanocortin 4 receptor" /db_xref="PID:g291978" /translation="MVNSTHRGMHTSLHLWNRSSYRLHSNASESLGKGYSDGGCYEQL FVSPEVFVTLGVISLLENILVIVAIAKNKNLHSPMYFFICSLAVADMLVSVSNGSETI IITLLNSTDTDAQSFTVNIDNVIDSVICSSLLASICSLLSIAVDRYFTIFYALQYHNI MTVKRVGIIISCIWAACTVSGILFIIYSDSSAVIICLITMFFTMLALMASLYVHMFLM ARLHIKRIAVLPGTGAIRQGANMKGAITLTILIGVFVVCWAPFFLHLIFYISCPQNPY CVCFMSHFNLYLILIMCNSIIDPLIYALRSQELRKTFKEIICCYPLGGLCDLSSRY" BASE COUNT 229 a 243 c 213 g 314 t ORIGIN 1 atggtgaact ccacccaccg tgggatgcac acttctctgc acctctggaa ccgcagcagt 61 tacagactgc acagcaatgc cagtgagtcc cttggaaaag gctactctga tggagggtgc 121 tacgagcaac tttttgtctc tcctgaggtg tttgtgactc tgggtgtcat cagcttgttg 181 gagaatatct tagtgattgt ggcaatagcc aagaacaaga atctgcattc acccatgtac 241 tttttcatct gcagcttggc tgtggctgat atgctggtga gcgtttcaaa tggatcagaa 301 accattatca tcaccctatt aaacagtaca gatacggatg cacagagttt cacagtgaat 361 attgataatg tcattgactc ggtgatctgt agctccttgc ttgcatccat ttgcagcctg 421 ctttcaattg cagtggacag gtactttact atcttctatg ctctccagta ccataacatt 481 atgacagtta agcgggttgg gatcatcata agttgtatct gggcagcttg cacggtttca 541 ggcattttgt tcatcattta ctcagatagt agtgctgtca tcatctgcct catcaccatg 601 ttcttcacca tgctggctct catggcttct ctctatgtcc acatgttcct gatggccagg 661 cttcacatta agaggattgc tgtcctcccc ggcactggtg ccatccgcca aggtgccaat 721 atgaagggag cgattacctt gaccatcctg attggcgtct ttgttgtctg ctgggcccca 781 ttcttcctcc acttaatatt ctacatctct tgtcctcaga atccatattg tgtgtgcttc 841 atgtctcact ttaacttgta tctcatactg atcatgtgta attcaatcat cgatcctctg 901 atttatgcac tccggagtca agaactgagg aaaaccttca aagagatcat ctgttgctat 961 cccctgggag gcctttgtga cttgtctagc agatattaa // LOCUS HUMELA2 906 bp mRNA PRI 07-NOV-1994 DEFINITION Human elastase 2 mRNA, complete cds. ACCESSION M16631 NID g182022 KEYWORDS elastase. SOURCE Human pancreas, cDNA to mRNA (library of R.Weiss), clones hpe2-lambda-[4,10]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 906) AUTHORS Fletcher,T.S., Shen,W.F. and Largman,C. TITLE Primary structure of human pancreatic elastase 2 determined by sequence analysis of the cloned mRNA JOURNAL Biochemistry 26 (23), 7256-7261 (1987) MEDLINE 88107669 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.S.Fletcher, 05-AUG-1987. A poly-adenylation signal is located at positions 891-896. FEATURES Location/Qualifiers source 1..906 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12" sig_peptide 22..69 /gene="ELA1" /note="elastase 2 signal peptide" CDS 22..831 /gene="ELA1" /note="elastase 2 precursor" /codon_start=1 /db_xref="GDB:G00-119-866" /db_xref="PID:g182023" /translation="MIRTLLLSTLVAGALSCGDPTYPPYVTRVVGGEEARPNSWPWQV SLQYSSNGKWYHTCGGSLIANSWVLTAAHCISSSRTYRVGLGRHNLYVAESGSLAVSV SKIVVHKDWNSNQISKGNDIALLKLANPVSLTDKIQLACLPPAGTILPNNYPCYVTGW GRLQTNGAVPDVLQQGRLLVVDYATCSSSAWWGSSVKTSMICAGGDGVISSCNGDSGG PLNCQASDGRWQVHGIVSFGSRLGCNYYHKPSVFTRVSNYIDWINSVIANN" gene 22..831 /gene="ELA1" mat_peptide 70..828 /gene="ELA1" /note="elastase 2" BASE COUNT 197 a 274 c 253 g 182 t ORIGIN 161 bp upstream of PstI site. 1 aaacagtccc agggacacac catgataagg acgctgctgc tgtccacttt ggtggctgga 61 gccctcagtt gtggggaccc cacttaccca ccttatgtga ctagggtggt tggcggtgaa 121 gaagcgaggc ccaacagctg gccctggcag gtctccctgc agtacagctc caatggcaag 181 tggtaccaca cctgcggagg gtccctgata gccaacagct gggtcctgac ggctgcccac 241 tgcatcagct cctccaggac ctaccgcgtg gggctgggcc ggcacaacct ctacgttgcg 301 gagtccggct cgctggcagt cagtgtctct aagattgtgg tgcacaagga ctggaactcc 361 aaccaaatct ccaaagggaa cgacattgcc ctgctcaaac tggctaaccc cgtctccctc 421 accgacaaga tccagctggc ctgcctccct cctgccggca ccattctacc caacaactac 481 ccctgctacg tcacgggctg gggaaggctg cagaccaacg gggctgttcc tgatgtcctg 541 cagcagggcc ggttgctggt tgtggactat gccacctgct ccagctctgc ctggtggggc 601 agcagcgtga aaaccagtat gatctgtgct gggggtgatg gcgtgatctc cagctgcaac 661 ggagactctg gcgggccact gaactgtcag gcgtctgacg gccggtggca ggtgcacggc 721 atcgtcagct tcgggtctcg cctcggctgc aactactacc acaagccctc cgtcttcacg 781 cgggtctcca attacatcga ctggatcaat tcggtgattg caaataacta accaaaagaa 841 gtccctggga ctgtttcaga cttggaaagg tcacagaagg aaaataatat aataaagtga 901 caactc // LOCUS HUMELA3A 896 bp mRNA PRI 31-DEC-1994 DEFINITION Human elastase III B mRNA, complete cds, clone pCL1E3. ACCESSION M18692 J03516 NID g607029 KEYWORDS elastase. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 896) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a novel class of elastase isozyme, human pancreatic elastase III, by cDNA and genomic gene cloning JOURNAL J. Biol. Chem. 263 (3), 1231-1239 (1988) MEDLINE 88087253 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES Location/Qualifiers source 1..896 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" mRNA 1..896 /gene="EL III" gene 1..896 /gene="EL III" CDS 19..831 /gene="EL III" /note="precursor" /codon_start=1 /product="elastase III B" /db_xref="PID:g182035" /translation="MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQV SLQYEKSGSFYHTCGGSLIAPDWVVTAGHCISSSRTYQVVLGEYDRAVKEGPEQVIPI NSGDLFVHPLWNRSCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYIT GWGRLYTNGPLPDKLQEALLPVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSG GPLNCPTEDGGWQVHGVTSFVSAFGCNTRRKPTVFTRVSAFIDWIEETIASH" sig_peptide 19..66 /gene="EL III" mat_peptide 103..828 /gene="EL III" /product="elastase III B" polyA_signal 874..879 /gene="EL III" polyA_site 896 /gene="EL III" BASE COUNT 169 a 291 c 248 g 187 t 1 others ORIGIN 280 bp upstream of RsaI site. 1 cctatcatcg caaaactcat gatgctccgg ctgctcagtt ccctcctcct tgtggccgtt 61 gcctcaggct atggcccacc ttcctctcgc ccttccagcc gcgttgtcaa tggtgaggat 121 gcggtcccct acagctggcc ctggcaggtt tccctgcagt atgagaaaag cggaagcttc 181 taccacacct gtggcggtag cctcatcgcc cccgactggg ttgtgactgc cggccactgc 241 atctcgagct cccggaccta ccaggtggtg ttgggcgagt acgaccgtgc tgtgaaggag 301 ggccccgagc aggtgatccc catcaactct ggggacctct ttgtgcatcc actctggaac 361 cgctcgtgtg tggcctgtgg caatgacatc gccctcatca agctctcacg cagcgcccag 421 ctgggagacg ccgtccagct cgcctcactc cctccggctg gtgacatcct tcccaacgag 481 acaccctgct acatcaccgg ctggggccgt ctctatacca acgggccact cccagacaag 541 ctgcaggagg ccctgctgcc ggtggtggac tatgaacact gctccaggtg gaactggtgg 601 ggttcctccg tgaagaagac catggtgtgt gctggagggg acatccgctc cggctgcaat 661 ggtgactctg gaggacccct caactgcccc acagaggatg gtggctggca ggtccatggc 721 gtgaccagct ttgtttctgc ctttggctgc aacacccgca ggaagcccac ggtgttcact 781 cgagtctccg ccttcattga ctggattgag gagaccatag caagccacta gaaccaaggc 841 ccagctggca gtgctgatcg atcccacatc ctgaataaag aadaaagatc tctcag // LOCUS HUMELAM1A 3834 bp mRNA PRI 07-NOV-1994 DEFINITION Human endothelial leukocyte adhesion molecule 1 (ELAM-1) mRNA, complete cds. ACCESSION M24736 NID g537523 KEYWORDS endothelial leukocyte adhesion molecule 1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3834) AUTHORS Bevilacqua,M.P., Stengelin,S., Gimbrone,M.A. Jr. and Seed,B. TITLE Endothelial leukocyte adhesion molecule 1: an inducible receptor for neutrophils related to complement regulatory proteins and lectins JOURNAL Science 243 (4895), 1160-1165 (1989) MEDLINE 89162047 FEATURES Location/Qualifiers source 1..3834 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEC" /map="1q22-q25" gene 117..3821 /gene="ELAM1" sig_peptide 117..179 /gene="ELAM1" /note="G00-120-612" CDS 117..1949 /gene="ELAM1" /codon_start=1 /db_xref="GDB:G00-120-612" /product="endothelial leukocyte adhesion molecule 1" /db_xref="PID:g537524" /translation="MIASQFLSALTLVLLIKESGAWSYNTSTEAMTYDEASAYCQQRY THLVAIQNKEEIEYLNSILSYSPSYYWIGIRKVNNVWVWVGTQKPLTEEAKNWAPGEP NNRQKDEDCVEIYIKREKDVGMWNDERCSKKKLALCYTAACTNTSCSGHGECVETINN YTCKCDPGFSGLKCEQIVNCTALESPEHGSLVCSHPLGNFSYNSSCSISCDRGYLPSS METMQCMSSGEWSAPIPACNVVECDAVTNPANGFVECFQNPGSFPWNTTCTFDCEEGF ELMGAQSLQCTSSGNWDNEKPTCKAVTCRAVRQPQNGSVRCSHSPAGEFTFKSSCNFT CEEGFMLQGPAQVECTTQGQWTQQIPVCEAFQCTALSNPERGYMNCLPSASGSFRYGS SCEFSCEQGFVLKGSKRLQCGPTGEWDNEKPTCEAVRCDAVHQPPKGLVRCAHSPIGE FTYKSSCAFSCEEGFELYGSTQLECTSQGQWTEEVPSCQVVKCSSLAVPGKINMSCSG EPVFGTVCKFACPEGWTLNGSAARTCGATGHWSGLLPTCEAPTESNIPLVAGLSAAGL SLLTLAPFLLWLRKCLRKAKKFVPASSCQSLESDGSYQKPSYIL" mat_peptide 180..1946 /gene="ELAM1" /note="G00-120-612" /product="endothelial leukocyte adhesion molecule 1" polyA_signal 3816..3821 /gene="ELAM1" /note="G00-120-612" polyA_site 3834 /gene="ELAM1" /note="G00-120-612" BASE COUNT 1140 a 769 c 854 g 1071 t ORIGIN 1 cctgagacag aggcagcagt gatacccacc tgagagatcc tgtgtttgaa caactgcttc 61 ccaaaacgga aagtatttca agcctaaacc tttgggtgaa aagaactctt gaagtcatga 121 ttgcttcaca gtttctctca gctctcactt tggtgcttct cattaaagag agtggagcct 181 ggtcttacaa cacctccacg gaagctatga cttatgatga ggccagtgct tattgtcagc 241 aaaggtacac acacctggtt gcaattcaaa acaaagaaga gattgagtac ctaaactcca 301 tattgagcta ttcaccaagt tattactgga ttggaatcag aaaagtcaac aatgtgtggg 361 tctgggtagg aacccagaaa cctctgacag aagaagccaa gaactgggct ccaggtgaac 421 ccaacaatag gcaaaaagat gaggactgcg tggagatcta catcaagaga gaaaaagatg 481 tgggcatgtg gaatgatgag aggtgcagca agaagaagct tgccctatgc tacacagctg 541 cctgtaccaa tacatcctgc agtggccacg gtgaatgtgt agagaccatc aataattaca 601 cttgcaagtg tgaccctggc ttcagtggac tcaagtgtga gcaaattgtg aactgtacag 661 ccctggaatc ccctgagcat ggaagcctgg tttgcagtca cccactggga aacttcagct 721 acaattcttc ctgctctatc agctgtgata ggggttacct gccaagcagc atggagacca 781 tgcagtgtat gtcctctgga gaatggagtg ctcctattcc agcctgcaat gtggttgagt 841 gtgatgctgt gacaaatcca gccaatgggt tcgtggaatg tttccaaaac cctggaagct 901 tcccatggaa cacaacctgt acatttgact gtgaagaagg atttgaacta atgggagccc 961 agagccttca gtgtacctca tctgggaatt gggacaacga gaagccaacg tgtaaagctg 1021 tgacatgcag ggccgtccgc cagcctcaga atggctctgt gaggtgcagc cattcccctg 1081 ctggagagtt caccttcaaa tcatcctgca acttcacctg tgaggaaggc ttcatgttgc 1141 agggaccagc ccaggttgaa tgcaccactc aagggcagtg gacacagcaa atcccagttt 1201 gtgaagcttt ccagtgcaca gccttgtcca accccgagcg aggctacatg aattgtcttc 1261 ctagtgcttc tggcagtttc cgttatgggt ccagctgtga gttctcctgt gagcagggtt 1321 ttgtgttgaa gggatccaaa aggctccaat gtggccccac aggggagtgg gacaacgaga 1381 agcccacatg tgaagctgtg agatgcgatg ctgtccacca gcccccgaag ggtttggtga 1441 ggtgtgctca ttcccctatt ggagaattca cctacaagtc ctcttgtgcc ttcagctgtg 1501 aggagggatt tgaattatat ggatcaactc aacttgagtg cacatctcag ggacaatgga 1561 cagaagaggt tccttcctgc caagtggtaa aatgttcaag cctggcagtt ccgggaaaga 1621 tcaacatgag ctgcagtggg gagcccgtgt ttggcactgt gtgcaagttc gcctgtcctg 1681 aaggatggac gctcaatggc tctgcagctc ggacatgtgg agccacagga cactggtctg 1741 gcctgctacc tacctgtgaa gctcccactg agtccaacat tcccttggta gctggacttt 1801 ctgctgctgg actctccctc ctgacattag caccatttct cctctggctt cggaaatgct 1861 tacggaaagc aaagaaattt gttcctgcca gcagctgcca aagccttgaa tcagacggaa 1921 gctaccaaaa gccttcttac atcctttaag ttcaaaagaa tcagaaacag gtgcatctgg 1981 ggaactagag ggatacactg aagttaacag agacagataa ctctcctcgg gtctctggcc 2041 cttcttgcct actatgccag atgcctttat ggctgaaacc gcaacaccca tcaccacttc 2101 aatagatcaa agtccagcag gcaaggacgg ccttcaactg aaaagactca gtgttccctt 2161 tcctactctc aggatcaaga aagtgttggc taatgaaggg aaaggatatt ttcttccaag 2221 caaaggtgaa gagaccaaga ctctgaaatc tcagaattcc ttttctaact ctcccttgct 2281 cgctgtaaaa tcttggcaca gaaacacaat attttgtggc tttctttctt ttgcccttca 2341 cagtgtttcg acagctgatt acacagttgc tgtcataaga atgaataata attatccaga 2401 gtttagagga aaaaaatgac taaaaatatt ataacttaaa aaaatgacag atgttgaatg 2461 cccacaggca aatgcatgga gggttgttaa tggtgcaaat cctactgaat gctctgtgcg 2521 agggttacta tgcacaattt aatcactttc atccctatgg gattcagtgc ttcttaaaga 2581 gttcttaagg attgtgatat ttttacttgc attgaatata ttataatctt ccatacttct 2641 tcattcaata caagtgtggt agggacttaa aaaacttgta aatgctgtca actatgatat 2701 ggtaaaagtt acttattcta gattaccccc tcattgttta ttaacaaatt atgttacatc 2761 tgttttaaat ttatttcaaa aagggaaact attgtcccct agcaaggcat gatgttaacc 2821 agaataaagt tctgagtgtt tttactacag ttgttttttg aaaacatggt agaattggag 2881 agtaaaaact gaatggaagg tttgtatatt gtcagatatt ttttcagaaa tatgtggttt 2941 ccacgatgaa aaacttccat gaggccaaac gttttgaact aataaaagca taaatgcaaa 3001 cacacaaagg tataatttta tgaatgtctt tgttggaaaa gaatacagaa agatggatgt 3061 gctttgcatt cctacaaaga tgtttgtcag atgtgatatg taaacataat tcttgtatat 3121 tatggaagat tttaaattca caatagaaac tcaccatgta aaagagtcat ctggtagatt 3181 tttaacgaat gaagatgtct aatagttatt ccctatttgt tttcttctgt atgttagggt 3241 gctctggaag agaggaatgc ctgtgtgagc aagcatttat gtttatttat aagcagattt 3301 aacaattcca aaggaatctc cagttttcag ttgatcactg gcaatgaaaa attctcagtc 3361 agtaattgcc aaagctgctc tagccttgag gagtgtgaga atcaaaactc tcctacactt 3421 ccattaactt agcatgtgtt gaaaaaaaaa gtttcagaga agttctggct gaacactggc 3481 aacgacaaag ccaacagtca aaacagagat gtgataagga tcagaacagc agaggttctt 3541 ttaaaggggc agaaaaactc tgggaaataa gagagaacaa ctactgtgat caggctatgt 3601 atggaataca gtgttatttt ctttgaaatt gtttaagtgt tgtaaatatt tatgtaaact 3661 gcattagaaa ttagctgtgt gaaataccag tgtggtttgt gtttgagttt tattgagaat 3721 tttaaattat aacttaaaat attttataat ttttaaagta tatatttatt taagcttatg 3781 tcagacctat ttgacataac actataaagg ttgacaataa atgtgcttat gttt // LOCUS HUMELASF 2242 bp mRNA PRI 07-NOV-1994 DEFINITION Human elastin mRNA, complete cds. ACCESSION M36860 NID g182061 KEYWORDS elastin. SOURCE Human skin fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2242) AUTHORS Fazio,M.J., Olsen,D.R., Kauh,E.A., Baldwin,C.T., Indik,Z., Ornstein-Goldstein,N., Yeh,H., Rosenbloom,J. and Uitto,J. TITLE Cloning of full-length elastin cDNAs from a human skin fibroblast recombinant cDNA library: further elucidation of alternative splicing utilizing exon-specific oligonucleotides JOURNAL J. Invest. Dermatol. 91 (5), 458-464 (1988) MEDLINE 89009960 FEATURES Location/Qualifiers source 1..2242 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7cen-q21.1" sig_peptide 50..127 /gene="ELN" /note="elastin signal peptide" CDS 50..2242 /gene="ELN" /note="elastin precursor" /codon_start=1 /db_xref="GDB:G00-119-107" /db_xref="PID:g182062" /translation="MAGLTAAAPRPGVLLLLLSILHPSRPGGVPGAIPGGVPGGVFYP GAGLGALGGGALGPGGKPLKPVPGGLAGAGLGAGLGAFPAVTFPGALVPGGVADAAAA YKAAKAGAGLGGVPGVGGLGVSAGAVVPQPGAGVKPGKVPGVGLPGVYPGGVLPGARF PGVGVLPGVPTGAGVKPKAPGVGGAFAGIPGVGPFGGPQPGVPLGYPIKAPKLPGGYG LPYTTGKLPYGYGPGGVAGAAGKAGYPTGTGVGPQAAAAAAAKAAAKFGAGAAGVLPG VGGAGVPGVPGAIPGIGGIAGVGTPAAAAAAAAAAKAAKYGAAAGLVPGGPGFGPGVV GVPGAGVPGVGVPGAGIPVVPGAGIPGAAVPGVVSPEAAAKAAAKAAKYGARPGVGVG GIPTYGVGAGGFPGFGVGVGGIPGVAGVPSVGGVPGVGGVPGVGISPEAQAAAAAKAA KYGVGTPAAAAAKAAAKAAQFALLNLAGLVPGVGVAPGVGVAPGVGVAPGVGLAPGVG VAPGVGVAPGVGVAPGIGPGGVAAAAKSAAKVAAKAQLRAAAGLGAGIPGLGVGVGVP GLGVGAGVPGLGVGAGVPGFGAVPGALAAAKAAKYGAAVPGVLGGLGALGGVGIPGGV VGAGPAAAAAAAKAAAKAAQFGLVGAAGLGGLGVGGLGVPGVGGLGGIPPAAAAKAAK YGAAGLGGVLGGAGQFPLGGVAARPGFGLSPIFPGGACLGKACGRKRK" gene 50..2242 /gene="ELN" mat_peptide 128..2239 /gene="ELN" /note="elastin" BASE COUNT 333 a 598 c 840 g 471 t ORIGIN 1 ccgggataaa acgaggtgcg gagagcgggc tggggcattt ctccccgaga tggcgggtct 61 gacggcggcg gccccgcggc ccggagtcct cctgctcctg ctgtccatcc tccacccctc 121 tcggcctgga ggggtccctg gggccattcc tggtggagtt cctggaggag tcttttatcc 181 aggggctggt ctcggagccc ttggaggagg agcgctgggg cctggaggca aacctcttaa 241 gccagttccc ggagggcttg cgggtgctgg ccttggggca gggctcggcg ccttccccgc 301 agttaccttt ccgggggctc tggtgcctgg tggagtggct gacgctgctg cagcctataa 361 agctgctaag gctggcgctg ggcttggtgg tgtcccagga gttggtggct taggagtgtc 421 tgcaggtgcg gtggttcctc agcctggagc cggagtgaag cctgggaaag tgccgggtgt 481 ggggctgcca ggtgtatacc caggtggcgt gctcccagga gctcggttcc ccggtgtggg 541 ggtgctccct ggagttccca ctggagcagg agttaagccc aaggctccag gtgtaggtgg 601 agcttttgct ggaatcccag gagttggacc ctttggggga ccgcaacctg gagtcccact 661 ggggtatccc atcaaggccc ccaagctgcc tggtggctat ggactgccct acaccacagg 721 gaaactgccc tatggctatg ggcccggagg agtggctggt gcagcgggca aggctggtta 781 cccaacaggg acaggggttg gcccccaggc agcagcagca gcggcagcta aagcagcagc 841 aaagttcggt gctggagcag ccggagtcct ccctggtgtt ggaggggctg gtgttcctgg 901 cgtgcctggg gcaattcctg gaattggagg catcgcaggc gttgggactc cagctgcagc 961 tgcagctgca gcagcagccg ctaaggcagc caagtatgga gctgctgcag gcttagtgcc 1021 tggtgggcca ggctttggcc cgggagtagt tggtgtccca ggagctggcg ttccaggtgt 1081 tggtgtccca ggagctggga ttccagttgt cccaggtgct gggatcccag gtgctgcggt 1141 tccaggggtt gtgtcaccag aagcagctgc taaggcagct gcaaaggcag ccaaatacgg 1201 ggccaggccc ggagtcggag ttggaggcat tcctacttac ggggttggag ctgggggctt 1261 tcccggcttt ggtgtcggag tcggaggtat ccctggagtc gcaggtgtcc ctagtgtcgg 1321 aggtgttccc ggagtcggag gtgtcccggg agttggcatt tcccccgaag ctcaggcagc 1381 agctgccgcc aaggctgcca agtacggagt ggggacccca gcagctgcag ctgctaaagc 1441 agccgccaaa gccgcccagt ttgctcttct caatcttgca gggttagttc ctggtgtcgg 1501 cgtggctcct ggagttggcg tggctcctgg tgtcggtgtg gctcctggag ttggcttggc 1561 tcctggagtt ggcgtggctc ctggagttgg tgtggctcct ggcgttggcg tggctcccgg 1621 cattggccct ggtggagttg cagctgcagc aaaatccgct gccaaggtgg ctgccaaagc 1681 ccagctccga gctgcagctg ggcttggtgc tggcatccct ggacttggag ttggtgtcgg 1741 cgtccctgga cttggagttg gtgctggtgt tcctggactt ggagttggtg ctggtgttcc 1801 tggcttcggg gcagtacctg gagccctggc tgccgctaaa gcagccaaat atggagcagc 1861 agtgcctggg gtccttggag ggctcggggc tctcggtgga gtaggcatcc caggcggtgt 1921 ggtgggagcc ggacccgccg ccgccgctgc cgcagccaaa gctgctgcca aagccgccca 1981 gtttggccta gtgggagccg ctgggctcgg aggactcgga gtcggagggc ttggagttcc 2041 aggtgttggg ggccttggag gtatacctcc agctgcagcc gctaaagcag ctaaatacgg 2101 tgctgctggc cttggaggtg tcctaggggg tgccgggcag ttcccacttg gaggagtggc 2161 agcaagacct ggcttcggat tgtctcccat tttcccaggt ggggcctgcc tggggaaagc 2221 ttgtggccgg aagagaaaat ga // LOCUS HUMELF2 1416 bp mRNA PRI 07-NOV-1994 DEFINITION Human translational initiation factor 2 beta subunit (elF-2-beta) mRNA, complete cds. ACCESSION M29536 NID g182066 KEYWORDS translational initiation factor. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1416) AUTHORS Pathak,V.K., Nielsen,P.J., Trachsel,H. and Hershey,J.W. TITLE Structure of the beta subunit of translational initiation factor eIF-2 JOURNAL Cell 54 (5), 633-639 (1988) MEDLINE 88311064 FEATURES Location/Qualifiers source 1..1416 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 117..1118 /gene="EIF2" CDS 117..1118 /gene="EIF2" /note="translational initiation factor beta subunit" /codon_start=1 /db_xref="GDB:G00-126-359" /db_xref="PID:g182067" /translation="MSGDEMIFDPTMSKKKKKKKKPFMLDEEGDTQTEETQPSETKEV EPEPTEDKDLEADEEDTRKKDASDDLDDLNFFNQKKKKKKTKKIFDIDEAEEGVKDLK IESDVQEPTEPEDDLDIMLGNKKKKKKNVKFPDEDEILEKDEALEDEDNKKDDGISFS NQTGPAWAGSERDYTYEELLNRVFNIMREKNPDMVAGEKRKFVMKPPQVVRVGTKKTS FVNFTDICKLLHRQPKHLLAFLLAELGTSGSIDGNNQLVIKGRFQQKQIENVLRRYIK EYVTCHTCRSPDTILQKDIRLYFLQCETCHSRCSVASIKTGFQAVTGKRAQLRAKAN" BASE COUNT 478 a 256 c 342 g 340 t ORIGIN 1 cgttgaagac attgtcgggt ggtgggagag gtatcggcag gggcagcgct gccgccgggg 61 cctggggctg acccgtctga cttcccgtcc gtgccgagcc cactcgagcc gcagccatgt 121 ctggggacga gatgattttt gatcctacta tgagcaagaa gaaaaagaag aagaagaagc 181 cttttatgtt agatgaggaa ggggataccc aaacagagga aacccagcct tcagaaacaa 241 aagaagtgga gccagagcca actgaggaca aggatttgga agctgatgaa gaggacacta 301 ggaaaaaaga tgcttctgat gatctagatg acttgaactt ctttaatcaa aagaaaaaga 361 agaaaaaaac taaaaagata tttgatattg atgaagctga agaaggtgta aaggatctta 421 agattgaaag tgatgttcaa gaaccaactg aaccagagga tgaccttgac attatgcttg 481 gcaataaaaa gaagaaaaag aagaatgtta agttcccaga tgaggatgaa atactagaga 541 aagatgaagc tctagaagat gaagacaaca aaaaagatga tggtatctca ttcagtaatc 601 agacaggccc tgcttgggca ggctcagaaa gagactacac atacgaggag ctgctgaatc 661 gagtgttcaa catcatgagg gaaaagaatc cagatatggt tgctggggag aaaaggaaat 721 ttgtcatgaa acctccacaa gtcgtccgag taggaaccaa gaaaacttct tttgtcaact 781 ttacagatat ctgtaaacta ttacatcgtc agcccaaaca tctccttgca tttttgttgg 841 ctgaattggg tacaagtggt tctatagatg gtaataacca acttgtaatc aaaggaagat 901 tccaacagaa acagatagaa aatgtcttga gaagatatat caaggaatat gtcacttgtc 961 acacatgccg atcaccggac acaatcctgc agaaggacat acgactctat ttcctacagt 1021 gcgaaacttg tcattctaga tgttctgttg ccagtatcaa aaccggcttc caggctgtca 1081 cgggcaagcg agcacagctc cgtgccaaag ctaactaatt tgctaatcac tgattttgca 1141 aagcttgttg tggagatgtg gctggacagg tttgccatca gagtggatat accgttgtat 1201 taaaaacaag ataaaaaagc tgccaagatt tttggcgagt ggttggtctg aagtccttgc 1261 aagacgctga tgctcaagct gttgacatac tcattgccta ctttaacacc tgtcagagaa 1321 acgtgatatg gggtaaggag gtgctttttt aaaatcgttc atagacttct gtaaaatgca 1381 agataaatta aagttattat aacagtgatt ctttca // LOCUS HUMELF4AII 1864 bp mRNA PRI 30-JUL-1997 DEFINITION Homo sapiens mRNA for eukaryotic initiation factor 4AII, complete cds. ACCESSION D30655 NID g485387 KEYWORDS eIF-4II; eukaryotic initiation factor 4AII. SOURCE Homo sapiens fetal lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1864) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (01-MAY-1994) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:nakamura@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4501), Fax:03-3918-0342) REFERENCE 2 (bases 1 to 1864) AUTHORS Sudo,K. and Nakamura,Y. TITLE Isolation and mapping of human cDNA homologus to murine protein synthedsis initiation factor 4A-II (eIF4A-II) JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..1864 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="lung" gene 16..1239 /gene="eIF-4II" CDS 16..1239 /gene="eIF-4II" /note="mouse eIF4 homology" /codon_start=1 /product="eukaryotic initiation factor 4AII" /db_xref="PID:d1006902" /db_xref="PID:g485388" /translation="MSGGSADYNREHGGPEGMDPDGVIESNWNEIVDNFDDMNLKESL LRGIYAYGFEKPSAIQQRAIIPCIKGYDVIAQAQSGTGKTATFAISILQQLEIEFKET QALVLAPTRELAQQIQKVILALGDYMGATCHACIGGTNVRNEMQKLQAEAPHIVVGTP GRVFDMLNRRYLSPKWIKMFVLDEADEMLSRGFKDQIYEIFQKLNTSIQVVFASATMP TDVLEVTKKFMRDPIRILVKKEELTLEGIKQFYINVEREEWKLDTLCDLYETLTITQA VIFLNTRRKVDWLTEKMHARDFTVSALHGDMDQKERDVIMREFRSGSSRVLITTDLLA RGIDVQQVSLVINYDLPTNRENYIHRIGRGGRFGRKGVAINFVTEEDKRILRDIETFY NTTVEEMPMNVADLI" polyA_signal 1844..1849 polyA_site 1864 BASE COUNT 538 a 309 c 442 g 575 t ORIGIN 1 gtggtttttc ggatcatgtc tggtggctcc gcggattata acagagaaca tggcggccca 61 gagggaatgg accccgatgg tgtcatcgag agcaactgga atgagattgt tgataacttt 121 gatgatatga atttaaagga gtctctcctt cgtggcatct atgcttacgg ttttgagaag 181 ccttccgcta ttcagcagag agctattatt ccctgtatta aagggtatga tgtgattgct 241 caagctcagt caggtactgg caagacagcc acatttgcta tttccatcct gcaacagttg 301 gagattgagt tcaaggagac ccaagcacta gtattggccc ccaccagaga actggctcaa 361 cagatccaaa aggtaattct ggcacttgga gactatatgg gagccacttg tcatgcctgc 421 attggtggaa caaatgttcg aaatgaaatg caaaaactgc aggctgaagc accacatatt 481 gttgttggta cacccgggag agtgtttgat atgttaaaca gaagatacct ttctccaaaa 541 tggatcaaaa tgtttgtttt ggatgaagca gatgaaatgt tgagccgtgg ttttaaggat 601 caaatctatg agattttcca aaaactaaac acaagtattc aggttgtgtt tgcttctgcc 661 acaatgccaa ctgatgtgtt ggaagtgacc aaaaaattca tgagagatcc aattcgaatt 721 ctggtgaaaa aggaagaatt gacccttgaa ggaatcaaac agttttatat taatgttgag 781 agagaggaat ggaagttgga tacactttgt gacttgtacg agacactgac cattacacag 841 gctgttattt ttctcaatac gaggcgcaag gtggactggc tgactgagaa gatgcatgcc 901 agagacttca cagtttctgc tctgcatggt gacatggacc agaaggagag agatgttatc 961 atgagggaat tccggtcagg gtcaagtcgt gttctgatca ctactgactt gttggctcgc 1021 gggattgatg tgcaacaagt gtctttggtt ataaattatg atctacctac caatcgtgaa 1081 aactatattc acagaattgg cagagggggt cgatttggga ggaaaggtgt ggctataaac 1141 tttgttactg aagaagacaa gaggattctt cgtgacattg agactttcta caatactaca 1201 gtggaggaga tgcccatgaa tgtggctgac cttatttaat tcctgggatg agagttttgg 1261 atgcagtgct cgctgttgct gaataggcga tcacaacgtg cattgtgctt ctttctttgg 1321 gaatatttga atcttgtctc aatgctcata acggatcaga aatacagatt ttgatagcaa 1381 agcgacgtta gtcgtgagct cttgtgagga aagtcattgg ctttatcctc tttagagtta 1441 gactgttggg gtgggtataa aagatggggt ctgtaaaatc tttctttctt agaaatttat 1501 ttcctagttc tgtagaaatg gttgtattag atgttctcta tcatttaata atatacttgt 1561 ggactaaaag atataagtgc tgtataaaat cagccaatta tgttaaacta gcatatctgc 1621 ctttattgtg tttgtcatta gcctgagtag aaaggccttt aaaatttttt tagaaagcat 1681 ttgaatgcat tttgtttggt attgtattta ttcaataaag tatttaatta gtgctaagtg 1741 tgaactggac cctgttgcta agccccagca agcaatccta ggtagggttt aatccccagt 1801 aaaattgcca tattgcacat gtcttaatga agtttgaatg ttaaataaat tgtatattca 1861 cttt // LOCUS HUMELFT 2159 bp mRNA PRI 06-MAR-1995 DEFINITION Human ELAM-1 ligand fucosyltransferase (ELFT) mRNA, complete cds. ACCESSION M58596 NID g182068 KEYWORDS ELAM-1 ligand fucosyltransferase; alpha-(1,3) fucosyltransferase; endothelial-leukocyte adhesion molecule. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2159) AUTHORS Goelz,S.E., Hession,C., Goff,D., Griffiths,B., Tizard,R., Newman,B., Chi-Rosso,G. and Lobb,R. TITLE ELFT: a gene that directs the expression of an ELAM-1 ligand JOURNAL Cell 63 (6), 1349-1356 (1990) MEDLINE 91084863 FEATURES Location/Qualifiers source 1..2159 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ELFT" /cell_line="HL60" /cell_type="promyelocytic" CDS 58..1275 /standard_name="ELFT" /codon_start=1 /product="ELAM-1 ligand fucosyltransferase" /db_xref="PID:g182069" /translation="MGAPWGSPTAAAGGRRGWRRGRGLPWTVCVLAAAGLTCTALITY ACWGQLPPLPWASPTPSRPVGVLLWWEPFGGRDSAPRPPPDCRLRFNISGCRLLTDRA SYGEAQAVLFHHRDLVKGPPDWPPPWGIQAHTAEEVDLRVLDYEEAAAAAEALATSSP RPPGQRWVWMNFESPSHSPGLRSLASNLFNWTLSYRADSDVFVPYGYLYPRSHPGDPP SGLAPPLSRKQGLVAWVVSHWDERQARVRYYHQLSQHVTVDVFGRGGPGQPVPEIGLL HTVARYKFYLAFENSQHLDYITEKLWRNALLAGAVPVVLGPDRANYERFVPRGAFIHV DDFPSASSLASYLLFLDRNPAVYRRYFHWRRSYAVHITSFWDEPWCRVCQAVQRAGDR PKSIRNLASWFER" BASE COUNT 403 a 636 c 643 g 477 t ORIGIN 1 cgctcctcca cgcctgcgga cgcgtggcga gcggaggcag cgctgcctgt tcgcgccatg 61 ggggcaccgt ggggctcgcc gacggcggcg gcgggcgggc ggcgcgggtg gcgccgaggc 121 cgggggctgc catggaccgt ctgtgtgctg gcggccgccg gcttgacgtg tacggcgctg 181 atcacctacg cttgctgggg gcagctgccg ccgctgccct gggcgtcgcc aaccccgtcg 241 cgaccggtgg gcgtgctgct gtggtgggag cccttcgggg ggcgcgatag cgccccgagg 301 ccgccccctg actgccggct gcgcttcaac atcagcggct gccgcctgct caccgaccgc 361 gcgtcctacg gagaggctca ggccgtgctt ttccaccacc gcgacctcgt gaaggggccc 421 cccgactggc ccccgccctg gggcatccag gcgcacactg ccgaggaggt ggatctgcgc 481 gtgttggact acgaggaggc agcggcggcg gcagaagccc tggcgacctc cagccccagg 541 cccccgggcc agcgctgggt ttggatgaac ttcgagtcgc cctcgcactc cccggggctg 601 cgaagcctgg caagtaacct cttcaactgg acgctctcct accgggcgga ctcggacgtc 661 tttgtgcctt atggctacct ctaccccaga agccaccccg gcgacccgcc ctcaggcctg 721 gccccgccac tgtccaggaa acaggggctg gtggcatggg tggtgagcca ctgggacgag 781 cgccaggccc gggtccgcta ctaccaccaa ctgagccaac atgtgaccgt ggacgtgttc 841 ggccggggcg ggccggggca gccggtgccc gaaattgggc tcctgcacac agtggcccgc 901 tacaagttct acctggcttt cgagaactcg cagcacctgg attatatcac cgagaagctc 961 tggcgcaacg cgttgctcgc tggggcggtg ccggtggtgc tgggcccaga ccgtgccaac 1021 tacgagcgct ttgtgccccg cggcgccttc atccacgtgg acgacttccc aagtgcctcc 1081 tccctggcct cgtacctgct tttcctcgac cgcaaccccg cggtctatcg ccgctacttc 1141 cactggcgcc ggagctacgc tgtccacatc acctccttct gggacgagcc ttggtgccgg 1201 gtgtgccagg ctgtacagag ggctggggac cggcccaaga gcatacggaa cttggccagc 1261 tggttcgagc ggtgaagccg cgctcccctg gaagcgaccc aggggaggcc aagttgtcag 1321 ctttttgatc ctctactgtg catctccttg actgccgcat catgggagta agttcttcaa 1381 acacccattt ttgctctatg ggaaaaaaac gatttaccaa ttaatattac tcagcacaga 1441 gatgggggcc cggtttccat attttttgca cagctagcaa ttgggctccc tttgctgctg 1501 atgggcatca ttgtttaggg gtgaaggagg gggttcttcc tcaccttgta accagtgcag 1561 aaatgaaata gcttagcggc aagaagccgt tgaggcggtt tcctgaattt ccccatctgc 1621 cacaggccat atttgtggcc cgtgcagctt ccaaatctca tacacaactg ttcccgattc 1681 acgtttttct ggaccaaggt gaagcaaatt tgtggttgta gaaggagcct tgttggtgga 1741 gagtggaagg actgtggctg caggtgggac tttgttgttt ggattcctca cagccttggc 1801 tcctgagaaa ggtgaggagg gcagtccaag aggggccgct gacttctttc acaagtacta 1861 tctgttcccc tgtcctgtga atggaagcaa agtgctggat tgtccttgga ggaaacttaa 1921 gatgaataca tgcgtgtacc tcactttaca taagaaatgt attcctgaaa agctgcattt 1981 aaatcaagtc ccaaattcat tgacttaggg gagttcagta tttaatgaaa ccctatggag 2041 aatttatccc tttacaatgt gaatagtcat ctcctaattt gtttcttctg tctttatgtt 2101 tttctataac ctggattttt taaatcatat taaaattaca gatgtgaaaa taaaaaaaa // LOCUS HUMELI 2677 bp mRNA PRI 30-SEP-1988 DEFINITION Human erythroid isoform protein 4.1 mRNA, complete cds. ACCESSION J03796 NID g182072 KEYWORDS erythroid protein 4.1. SOURCE Human T-cell leukemia line MOLT-4, cDNA to mRNA, clones pTM-[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2677) AUTHORS Tang,T.K., Leto,T.L., Correas,I., Alonso,M.A., Marchesi,V.T. and Benz,E.J.Jr.. TITLE Selective expression of an erythroid-specific isoform of protein 4.1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 3713-3717 (1988) MEDLINE 88234496 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by T.K.Tang, 26-APR-1988. FEATURES Location/Qualifiers source 1..2677 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 47..2374 /note="erythroid protein 4.1 isoform A" /codon_start=1 /db_xref="PID:g182073" /translation="MTTEKSLVTEAENSQHQQKEEGEEAINSGQQEPQQEESCQTAAE GDNWCEQKLKASNGDTPTHEDLTKNKERTSESRGLSRLFSSFLKRPKSQVSEEEGKEV ESDKEKGEGGQKEIEFGTSLDEEIILKAPIAAPEPELKTDPSLDLHSLSSAETQPAQE ELREDPDFEIKEGEGLEECSKIEVKEESPQSKAETELKASQKPIRKHRNMHCKVSLLD DTVYECVVETWLDSAKEIKKQVRGVPWNFTFNVKFYPPDPAQLTEDITRYYLCLQLRQ DIVAGRLPCSFATLALLGSYTIQSELGDYDPELHGVDYVSDFKLAPNQTKELEEKVME LHKSYRSMTPAQADLEFLENAKKLSMYGVDLHKAKDLEGVDIILGVCSSGLLVYKDKL RINRFPWPKVLKISYKRSSFFIKIRPGEQEQYESTIGFKLPSYRAAKKLWKVCVEHHT FFRLTSTDTIPKSKFLALGSKFRYSGRTQAQTRQASALIDRPAPHFERTASKRASRSL DGAAAVDSADRSPRPTSAPAITQGQVAEGGVLDASAKKTVVPKAQKETVKAEVKKEDE PPEQAEPEPTEAWKDLDKSQEEIKKHHASISELKKNFMESVPEPRPSEWDKRLSTHSP FRTLNINGQIPTGEGPPLVKTQTVTISDNANAVKSEIPTKDVPIVHTETKTITYEAAQ TDDNSGDLDPGVLLTAQTITSETPSSTTTTKITKTVKGGISETRIEKRIVITGDADID HDQVLVQAIKEAKEQHPDMSVTKVVVHQETEIADE" CDS 674..2374 /note="erythroid protein 4.1 isoform B" /codon_start=1 /db_xref="PID:g182074" /translation="MHCKVSLLDDTVYECVVETWLDSAKEIKKQVRGVPWNFTFNVKF YPPDPAQLTEDITRYYLCLQLRQDIVAGRLPCSFATLALLGSYTIQSELGDYDPELHG VDYVSDFKLAPNQTKELEEKVMELHKSYRSMTPAQADLEFLENAKKLSMYGVDLHKAK DLEGVDIILGVCSSGLLVYKDKLRINRFPWPKVLKISYKRSSFFIKIRPGEQEQYEST IGFKLPSYRAAKKLWKVCVEHHTFFRLTSTDTIPKSKFLALGSKFRYSGRTQAQTRQA SALIDRPAPHFERTASKRASRSLDGAAAVDSADRSPRPTSAPAITQGQVAEGGVLDAS AKKTVVPKAQKETVKAEVKKEDEPPEQAEPEPTEAWKDLDKSQEEIKKHHASISELKK NFMESVPEPRPSEWDKRLSTHSPFRTLNINGQIPTGEGPPLVKTQTVTISDNANAVKS EIPTKDVPIVHTETKTITYEAAQTDDNSGDLDPGVLLTAQTITSETPSSTTTTKITKT VKGGISETRIEKRIVITGDADIDHDQVLVQAIKEAKEQHPDMSVTKVVVHQETEIADE " BASE COUNT 873 a 604 c 603 g 597 t ORIGIN 828 bp upstream of HincII site; chromosome 1p32-1pter. 1 agaacgcggt cggcccggtc cccgccgcac ccagcccagc aacatcatga caacagagaa 61 gagtttagtg actgaggccg aaaattcaca gcaccaacag aaggaagagg gtgaggaagc 121 cataaactca ggccaacaag aacctcagca ggaggaatct tgtcaaacag cagctgaagg 181 agataattgg tgtgaacaga agctgaaagc ttctaatgga gacactccta cacatgaaga 241 cttgaccaag aacaaggagc ggacatcaga aagcagagga ctttcacgac tattctcctc 301 gtttctcaaa aggcccaaat ctcaggtgtc cgaggaagaa ggcaaagaag tagagtcaga 361 taaagaaaaa ggtgaaggag gtcagaaaga gatagaattt ggaaccagtc ttgatgaaga 421 gatcatttta aaggccccaa ttgcagctcc tgaaccggaa ctcaaaacag acccatcttt 481 ggatcttcat tcattaagca gtgcagaaac acagcctgct caggaagaac tcagagaaga 541 tccagatttt gaaattaagg aaggagaagg acttgaagag tgctccaaaa tagaagtaaa 601 agaagaaagc cctcaatcaa aagcagaaac agaattaaaa gcttcccaaa aaccaatcag 661 aaaacacagg aacatgcact gcaaggtttc tttgttggat gacacagttt atgaatgtgt 721 tgtggagaca tggctggatt ccgccaaaga aataaaaaag caggttcgtg gtgtcccttg 781 gaattttaca tttaatgtaa agttttatcc acctgaccca gcacagttaa cagaagacat 841 aacaagatat tatttatgtc ttcagcttcg gcaggacata gttgcaggac gtctgccctg 901 ttcctttgca accttagcat tattaggttc ttacaccatc cagtctgaac tgggagacta 961 cgacccagaa ctccatggcg tggattatgt tagtgatttt aaactggccc cgaatcagac 1021 caaggaactt gaagagaagg tcatggaact gcataagtca tacaggtcca tgactccagc 1081 tcaggctgac ttggagtttc ttgagaatgc caaaaagttg tctatgtatg gagttgatct 1141 tcataaagca aaggacttgg aaggagtaga tatcatccta ggtgtctgct ctagtggcct 1201 tctggtttac aaagataagc tgagaattaa ccgcttccct tggcccaaag tgctgaagat 1261 ttcttataaa cgtagtagct ttttcatcaa gattcggcct ggagagcaag agcagtatga 1321 aagtaccatc ggattcaaac ttcccagtta ccgagcagct aagaaattat ggaaagtctg 1381 tgtagaacat cacacgtttt tcagattgac atctacagac accattccca aaagcaaatt 1441 tcttgcgcta ggatccaaat ttcgatacag tggccggact caagctcaga ccaggcaagc 1501 tagtgctcta attgacaggc ctgccccaca cttcgagcgt acagcaagta aacgggcgtc 1561 ccggagcctc gatggagcag cagctgtcga ttcggcagac cgaagtcctc ggcccacttc 1621 tgcacctgcc attactcagg gtcaggttgc agaaggtggc gtcctagatg cctctgctaa 1681 aaaaacagtg gtccctaaag cacagaagga aacagtgaag gctgaagtga aaaaggaaga 1741 cgagccacct gagcaagctg agccagagcc cacagaagca tggaaggatt tagacaagag 1801 tcaagaggag atcaaaaaac atcatgccag catcagtgag ctgaaaaaga acttcatgga 1861 gtctgtacca gaaccacggc ctagtgaatg ggataaacgc ttatccactc actcaccctt 1921 ccgaactctt aacatcaatg ggcaaatccc cacaggagaa ggacctcccc tggtgaagac 1981 acaaactgtc accatctcag ataatgccaa tgctgtgaaa agtgaaatcc caaccaaaga 2041 cgtccctatt gtccacactg agaccaagac catcacttat gaggctgccc agactgacga 2101 caacagtgga gacttggacc caggagtctt gctgacagct caaactatca catctgagac 2161 cccaagcagc accaccacaa ctaaaattac caagactgta aaaggtggga tttcagagac 2221 acgtattgaa aagagaattg tgatcacagg agatgctgat attgaccatg atcaggtcct 2281 tgtacaagcc atcaaggagg caaaggagca gcacccagac atgtcagtga ccaaggtggt 2341 cgtccaccag gagaccgaga ttgctgatga gtgagctcag gaactaacct accccaactc 2401 tgcccttctc ccatccaaga gaaaccagca aaatgataaa gaagctaacc tgccatagtc 2461 agacttcaga ctttcaagat tattctaaat caccagaaaa ttaatttcag tttctattgg 2521 gagtttatac caagagattc ttctagatct cattgatcct tttgaagagc tttttctata 2581 ttaggatatc agaattgttc aacttttcac tctatagact gttttaagag ttttggggtt 2641 ttttttaatt gggtggtttg taaccccttc agcctag // LOCUS HUMELK1A 2266 bp mRNA PRI 07-NOV-1994 DEFINITION Homo sapiens tyrosine kinase (ELK1) oncogene mRNA, complete cds. ACCESSION M25269 NID g538208 KEYWORDS ETS1 gene; oncogene; tyrosine kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2266) AUTHORS Rao,V.N., Huebner,K., Isobe,M., ar-Rushdi,A., Croce,C.M. and Reddy,E.S. TITLE elk, tissue-specific ets-related genes on chromosomes X and 14 near translocation breakpoints JOURNAL Science 244 (4900), 66-70 (1989) MEDLINE 89203250 FEATURES Location/Qualifiers source 1..2266 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="COLO 320" /clone="lambda-11" /map="Xp22.1-p11" gene 316..1602 /gene="ELK1" CDS 316..1602 /gene="ELK1" /codon_start=1 /db_xref="GDB:G00-119-867" /product="tyrosine kinase" /db_xref="PID:g538209" /translation="MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVAR LWGLRKNKTNMNYDKLSRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQ PEVSVTSTMPNVAPAAIHAAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYS TFTIQSLQPQPPPHPRPAVVLPNAAPAGAAAPPSGSRSTSPSPLEACLEAEEAGLPLQ VILTPPEAPNLKSEELNVEPGLGRALPPEVKVEGPKEELEVAGERGFVPETTKAEPEV PPQEGVPARLPAVVMDTAGQAGGHAASSPEISQPQKGRKPRDLELPLSPSLLGGPGPE RTPGSGSGSGLQAPGPALTPSLLPTHTLTPVLLTPSSLPPSIHFWSTLSPIAPRSPAK LSFQFPSSGSAQVHIPSISVDGLSTPVVLSPGPQKP" BASE COUNT 457 a 731 c 605 g 473 t ORIGIN 1 aattccgagc tgtagggaaa cgcaggggcg gcttctaggt gctgccgccg ccaccgccac 61 caccacctcc accgccgcct cggaacccag gcctgggggg cggtggggcc gcgtatggag 121 cccccgcccc ccggagctgc caacattgcc aacgccaccg ccacgctaca cacagcctca 181 actttcagga gacccgtccg tggccttatt tattccaccc ttcctgtaca tcgtagcgaa 241 tcaatccgtg gcgccgcact cctccgcatc cctctttaac agtacccctg ggatggcgtg 301 agcactcccc cagcgatgga cccatctgtg acgctgtggc agtttctgct gcagctgctg 361 agagagcaag gcaatggcca catcatctcc tggacttcac gggatggtgg tgaattcaag 421 ctggtggatg cagaggaggt ggcccggctg tggggactac gcaagaacaa gaccaacatg 481 aattacgaca agctcagccg ggccttgcgg tactactatg acaagaacat catccgcaag 541 gtgagcggcc agaagttcgt ctacaagttt gtgtcctacc ctgaggtcgc agggtgctcc 601 actgaggact gcccgcccca gccagaggtg tctgttacct ccaccatgcc aaatgtggcc 661 cctgctgcta tacatgccgc cccaggggac actgtctctg gaaagccagg cacacccaag 721 ggtgcaggaa tggcaggccc aggcggtttg gcacgcagca gccggaacga gtacatgcgc 781 tcgggcctct attccacctt caccatccag tctctgcagc cgcagccacc ccctcatcct 841 cggcctgctg tggtgctccc caatgcagct cctgcagggg cagcagcgcc cccctcgggg 901 agcaggagca ccagtccaag ccccttggag gcctgtctgg aggctgaaga ggccggcttg 961 cctctgcagg tcatcctgac cccgcccgag gccccaaacc tgaaatcgga agagcttaat 1021 gtggagccgg gtttgggccg ggctttgccc ccagaagtga aagtagaagg gcccaaggaa 1081 gagttggaag ttgcggggga gagagggttt gtgccagaaa ccaccaaggc cgagccagaa 1141 gtccctccac aggagggcgt gccagcccgg ctgcccgcgg ttgttatgga caccgcaggg 1201 caggcgggcg gccatgcggc ttccagccct gagatctccc agccgcagaa gggccggaag 1261 ccccgggacc tagagcttcc actcagcccg agcctgctag gtgggccggg acccgaacgg 1321 accccaggat cgggaagtgg ctccggcctc caggctccgg ggccggcgct gaccccatcc 1381 ctgcttccta cgcatacatt gaccccggtg ctgctgacac ccagctcgct gcctcctagc 1441 attcacttct ggagcaccct gagtcccatt gcgccccgta gcccggccaa gctctccttc 1501 cagtttccat ccagtggcag cgcccaggtg cacatccctt ctatcagcgt ggatggcctc 1561 tcgacccccg tggtgctctc cccagggccc cagaagccat gactactacc accaccacca 1621 ccaccccttc tggggtcact ccatccatgc tctctccagc cagccatctc aaggagaaac 1681 atagttcaac tgaaagactc atgctctgat tgtggtgggg tggggatcct tgggaagaat 1741 tactcccaag agtaactctc attatctcct ccacagaaaa cacacagctt ccacaacttc 1801 tctgttttct gtcagtcccc cagtggccgc ccttacacgt ctcctacttc aatggtaggg 1861 gcggtttatt tatttatttt ttgaaggcca ctgggatgag cctgacctaa ccttttaggg 1921 tggttaggac atctccccca cctccccact tttttcccca agacaagaca atcgaggtct 1981 ggcttgagaa cgacctttct ttctttattt ctcagcctgc ccttggggag atgagggagc 2041 cctgtctgcg tttttggatg tgagtagaag agttagtttg ttttgtttta ttattcctgg 2101 ccatactcag gggtccagga agaatttgta ccatttaatg ggttgggagt cttggccaag 2161 gaagaatcac acccttggaa tagaaatttc cacctccccc aacctttctc tcagacagct 2221 tatccttttt caaccaactt tttggccagg gaggaatgtc cctttt // LOCUS HUMELONA 2676 bp mRNA PRI 18-SEP-1995 DEFINITION Homo sapiens elongin A mRNA, complete cds. ACCESSION L47345 NID g992562 KEYWORDS RNA polymerase; RNA polymerase II; RNA polymerase II elongation factor; elongin A. SOURCE Homo sapiens (clone: pSPORT 1) umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2676) AUTHORS Aso,T., Haque,D., Fukudome,K., Brower,C.S., Conaway,J.W. and Conaway,R.C. TITLE A human cDNA encoding the 110-kDa subunit of RNA polymerase II transcription factor Elongin (SIII) JOURNAL Gene (1995) In press FEATURES Location/Qualifiers source 1..2676 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSPORT 1" /cell_type="epithelial cell" /tissue_type="umbilical vein" CDS 33..2351 /standard_name="SIII p110" /codon_start=1 /function="RNA polymerase II elongation factor active subunit" /evidence=experimental /product="elongin A" /db_xref="PID:g992563" /translation="MAAESALQVVEKLQARLAANPDPKKLLKYLKKLSTLPITVDILA ETGVGKTVNSLRKHEHVGSFARDLVAQWKKLVPVERNAEPDEQDFEKSNSRKRPRDAL QKEEEMEGDYQETWKATGSRSYSPDHRQKKHRKLSELERPHKVSHGHERRDERKRCHR MSPTYSSDPESSDYGHVQSPPSCTSPHQMYVDHYRSLEEDQEPIVSHQKPGKGHSNAF QDRLGASQERHLGEPHGKGVVSQNKEHKSSHKDKRPVDAKSDEKASVVSREKSHKALS KEENRRPPSGDNAREKPPSSGVKKEKDREGSSLKKKCLPPSEAASDNHLKKPKHRDPE KAKLDKSKQGLDSFDTGKGAGDLLPKVKEKGSNNLKTPEGKVKTNLDRKSLGSLPKVE ETDMEDEFEQPTMSFESYLSYDQPRKKKKKIVKTSATALGDKGLKKNDSKSTGKNLDS VQKLPKVNKTKSEKPAGADLAKLRKVPDVLPVLPDLPLPAIQANYRPLPSLELISSFQ PKRKAFSSPQEEEEAGFTGRRMNSKMQVYSGSKCAYLPKMMTLHQQCIRVLKNNIDSI FEVGGVPYSVLEPVLERCTPDQLYRIEEYNHVLIEETDQLWKVHCHRDFKEERPEEYE SWREMYLRLQDAREQRLRVLTKNIQFAHANKPKGRQAKMAFVNSVAKPPRDVRRRQEK FGTGGAAVPEKIKIKPAPYPMGSSHASASSISFNPSPEEPAYDGPSTSSAHLAPVVSS TVSYDPRKPTVKKIAPMMAKTIKAFKNRFSRR" BASE COUNT 807 a 646 c 700 g 523 t ORIGIN 1 gttccggcga ggaggccgcg ccagtgacag cgatggcggc ggagtcggcg ctccaagttg 61 tggagaagct gcaggcgcgc ctggccgcga acccggaccc taagaagcta ttgaaatatt 121 tgaagaaact ctccaccctg cctattacag tagacattct tgcggagact ggggttggga 181 aaacagtaaa tagcttgcga aaacacgagc atgttggaag ctttgccagg gacctagtgg 241 cccagtggaa gaagctggtt cctgtggaac gaaatgctga gcctgatgaa caggactttg 301 agaagagcaa ttcccgaaag cgccctcggg atgccctgca gaaggaggag gagatggagg 361 gggactacca agaaacctgg aaagccacgg ggagccgatc ctatagccct gaccacaggc 421 agaagaaaca taggaaactc tcggagctcg agagacctca caaagtgtct cacggtcatg 481 agaggagaga tgagagaaag aggtgtcaca gaatgtcacc aacttactct tcagaccctg 541 agtcttctga ttatggccat gttcaatccc ctccatcttg taccagtcct catcagatgt 601 acgtcgacca ctacagatcc ctggaggagg accaggagcc cattgtttca caccagaagc 661 ctgggaaagg ccacagcaat gcctttcagg acagactcgg ggccagccaa gaacgacacc 721 tgggtgaacc ccatgggaaa ggggttgtga gtcaaaacaa ggagcacaaa tcttcccaca 781 aggacaaacg ccccgtggat gccaagagtg atgagaaggc ctctgtggtg agcagagaga 841 aatcacacaa ggccctctcc aaagaggaga accgaaggcc accctcaggg gacaatgcaa 901 gggagaaacc gccctctagt ggcgtaaaga aagagaagga cagagagggc agcagcctga 961 agaagaagtg tttgcctccc tcagaggccg cttcagacaa ccacctgaaa aagccaaagc 1021 acagagaccc agagaaagcc aaattggaca aaagcaagca aggtctggac agctttgaca 1081 caggaaaagg agcaggagac ctgttgccca aggtaaaaga gaagggttct aacaacctaa 1141 agactccaga agggaaagtc aaaactaatt tggatagaaa gtcactgggc tccctcccta 1201 aagttgagga gacagatatg gaggatgaat tcgagcagcc aaccatgtct tttgaatcct 1261 acctcagcta tgaccagccc cggaagaaaa agaaaaagat tgtgaaaact tcagccacgg 1321 cacttggaga taaaggactt aaaaaaaatg actctaaaag cactggtaaa aacttggact 1381 cagttcagaa attacccaag gtgaacaaaa ccaagtcaga gaagccggct ggagctgatt 1441 tagccaagct gagaaaggtg cctgatgtgt tgccagtgtt gccagacctc ccgttacccg 1501 cgatacaggc caattaccgt ccactgcctt ccctcgagct gatatcctcc ttccagccaa 1561 agcgaaaagc gttctcttca ccccaggaag aagaagaagc tggatttact gggcgcagaa 1621 tgaattccaa gatgcaggtg tattctggtt ccaagtgtgc ctatctccct aaaatgatga 1681 ccttgcacca gcaatgcatc cgagtactta aaaacaacat cgattcaatc tttgaagtgg 1741 gaggagtccc atactctgtt cttgaacccg ttttggagag gtgtacacct gatcagctgt 1801 atcgcataga ggaatacaat catgtattaa ttgaagaaac agatcaatta tggaaagttc 1861 attgtcaccg agactttaag gaagaaagac ccgaagagta tgagtcgtgg cgagagatgt 1921 acctgcggct tcaggacgcc cgagagcagc ggctacgagt actaacaaag aatatccagt 1981 tcgcacatgc caataagccc aaaggccgac aagcaaagat ggcctttgtc aactctgtgg 2041 ccaagccacc tcgtgacgtc cggaggaggc aggaaaagtt tggaacggga ggagcagctg 2101 tccctgagaa aatcaagatc aagccagccc cgtaccccat gggaagcagc catgcttccg 2161 ccagtagcat cagctttaac cccagccctg aggagccggc ctatgatggc ccaagcacca 2221 gcagtgccca cttggcacca gtggtcagca gcactgtttc ctatgatcct aggaaaccca 2281 ctgtgaagaa aattgcccca atgatggcca agacaattaa agctttcaag aacagattct 2341 cccgacgata aactgaggac ttgccttgga aatggaatct ggggaggcag gaatacaagg 2401 acagtggggg ttggggaatg gaattctaca ggagactgga gtcttgcttt gtggatcctt 2461 ttggtctccg agtctgcagt ctgcaggtgc tgcccctggg aacctgcgtg ccacagcccc 2521 gcctccctgc ctggagcaca ctttagaatt ctgaagatgt gaagcctctg tctcactgag 2581 gattttaaag gtcaattata cttttgttgt tcattagcat ctttgtaaac tataagacgt 2641 agttttaatt aataaatatt gcccccagat gttaaa // LOCUS HUMEMP42 2335 bp mRNA PRI 15-JUN-1990 DEFINITION Human erythrocyte membrane protein band 4.2 mRNA, complete cds. ACCESSION M29399 NID g182083 KEYWORDS erythrocyte membrane band 4.2 protein. SOURCE Human peripheral blood reticulocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2335) AUTHORS Korsgren,C., Lawler,J., Lambert,S., Speicher,D. and Cohen,C.M. TITLE Complete amino acid sequence and homologies of human erythrocyte membrane protein band 4.2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 613-617 (1990) MEDLINE 90138879 COMMENT Draft entry and printed sequence for [1] kindly submitted by C.Korsgren 25-OCT-1989. FEATURES Location/Qualifiers source 1..2335 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 188..2263 /note="erythrocyte membrane protein band 4.2" /codon_start=1 /db_xref="PID:g182084" /translation="MGQALGIKSCDFQAARNNEEHHTKALSSRRLFVRRGQPFTIILY FRAPVRAFLPALKKVALTAQTGEQPSKINRTQATFPISSLGDRKWWSAVVEERDAQSW TISVTTPADAVIGHYSLLLQVSGRKQLLLGQFTLLFNPWNREDAVFLKNEAQRMEYLL NQNGLIYLGTADCIQAESWDFGQFEGDVIDLSLRLLSKDKQVEKWSQPVHVARVLGAL LHFLKEQRVLPTPQTQATQEGALLNKRRGSVPILRQWLTGRGRPVYDGQAWVLAAVAC TVLRCLGIPARVVTTFASAQGTGGRLLIDEYYNEEGLQNGEGQRGRIWIFQTSTECWM TRPALPQGYDGWQILDPSAPNGGGVLGSCDLVPVRAVKEGTVGLTPAVSDLFAAINAS CVVWKCCEDGTLELTDSNTKYVGNNISTKGVGSDRCEDITQNYKYPEGSLQEKEVLER VEKEKMEREKDNGIRPPSLETASPLYLLLKAPSSLPLRGDAQISVTLVNHSEQEKAVQ LAIGVQAVHYNGVLAAKLWRKKLHLTLSANLEKIITIGLFFSNFERNPPENTFLRLTA MATHSESNLSCFAQEDIAICRPHLAIKMPEKAEQYQPLTASVSLQNSLDAPMEDCVIS ILGRGLIHRERSYRFRSVWPENTMCAKFQFTPTHVGLQRLTVEVDCNMFQNLTNYKSV TVVAPELSA" BASE COUNT 567 a 635 c 652 g 481 t ORIGIN 1 cgaagaaaca tgtcagggtg ctcacaggag tagtgggggg aggttttgct atttccagat 61 tcttaagcca acaaaagtgc cttcatattt tctgtctgga agacagaaag cccagaagga 121 gcccagaagc aacagtttga gagaggcgct tctgcggcca agtggataag aggagcggcc 181 tgcaaccatg ggacaggccc tgggtatcaa gagctgtgac tttcaggcag caagaaacaa 241 tgaggagcac cacaccaagg ccctcagctc ccggcgcctc tttgtgagga gggggcagcc 301 cttcaccatc atcctgtact tccgcgctcc agtccgtgca tttctgcctg ccctgaagaa 361 ggtggccctc actgcacaaa ctggagagca gccttccaag atcaacagga cccaagccac 421 attcccaatt tccagtctgg gggaccgaaa gtggtggagt gcagtggtgg aggagagaga 481 tgcccagtcc tggaccatct ctgtgaccac acctgcggac gctgtcattg gccactactc 541 gcttctgctg caggtctcag gcaggaagca actcctcttg ggtcagttca cactgctttt 601 taacccctgg aatagagagg atgctgtttt cctgaagaat gaggctcagc gcatggagta 661 cttgttgaac cagaatggtc tcatctacct gggtacagct gactgcatcc aggcagagtc 721 ctgggacttt ggccagttcg agggggatgt cattgacctc agcctgcgct tgctgagcaa 781 ggacaagcag gtagagaagt ggagccagcc ggtgcacgtg gcccgtgtgt tgggtgcctt 841 gctgcatttt ctcaaggagc agagggtcct gcccaccccg cagacccagg ccacccagga 901 aggggccttg ctgaacaagc gccggggcag cgtgcccatc ctgcggcagt ggctcaccgg 961 ccgaggccga cctgtgtatg atggccaggc ctgggtgttg gctgctgttg cttgcacagt 1021 gctgcgatgc ctgggaatcc ctgcccgcgt ggtgaccacg tttgcctcag cacagggcac 1081 cggtgggcgt cttctcatag atgaatacta taatgaggag ggacttcaga acggagaagg 1141 ccagagaggc agaatctgga tcttccagac ttccacagag tgctggatga cgcggcctgc 1201 cttgccccag ggttatgatg gatggcagat tctcgaccca agtgctccta atggaggtgg 1261 agtcctgggg tcctgtgatc tggtgccggt cagagcagtc aaggagggga ccgtggggct 1321 gaccccagca gtgtcagacc tttttgctgc cataaatgcc tcatgtgtgg tctggaagtg 1381 ctgtgaggat gggacactgg agttgactga ctccaacaca aagtatgttg gcaacaacat 1441 cagcaccaag ggtgtgggca gtgaccgctg cgaggacatc actcagaact acaagtatcc 1501 tgaagggtct cttcaggaaa aagaggtgct ggagagagtc gagaaagaga aaatggaacg 1561 tgagaaagac aacggcatcc gtcctcccag tctcgagact gccagtcctc tgtacctgct 1621 cttgaaagca cccagctccc tacccctgag aggggatgcc cagatctcag tgacgctggt 1681 taatcacagt gagcaggaga aggcagtgca gctggcaatt ggggtccagg ctgtacacta 1741 caacggtgtc cttgctgcca agctctggag gaagaagctg cacctcacgc tcagtgccaa 1801 cctggaaaag ataataacca tcggcctgtt cttctccaat tttgagcgaa acccacccga 1861 gaacaccttc cttagactca ccgccatggc aacacactct gaatccaacc ttagctgctt 1921 tgctcaggaa gacattgcca tttgtagacc acaccttgcc atcaagatgc cagagaaagc 1981 agagcagtat caacccctca cagcctcagt cagcctccag aactccctag atgcccccat 2041 ggaggactgt gtgatctcca tcctgggaag ggggctcatt cacagagaga ggagctacag 2101 attccgttca gtgtggcctg aaaacaccat gtgtgccaag ttccagttca cgccaacaca 2161 tgtggggctc cagagactca ctgtggaagt ggactgcaac atgttccaga acctaaccaa 2221 ctataaaagc gtcaccgtgg tagcccctga actatcagct taaacttcca gctctatcac 2281 cactctcctg ccaacccttg ttctacaatc taaaccaaac atgtgctagg aagag // LOCUS HUMEMS 3248 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens amplaxin (EMS1) mRNA, complete cds. ACCESSION M98343 NID g182086 KEYWORDS amplaxin. SOURCE Homo sapiens (tissue library: lambda gt11) female breast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3248) AUTHORS Schuuring,E.M.D., Verhoeven,E., Litvinov,S., de Boer,C. and Michalides,R.A.M. TITLE The product of the EMS1 gene, amplaxin, amplified and overexpressed in human carcinomas with a chromosome 11q13 amplification, is located in the cell-to-substrate contact sites JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..3248 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="T47D" /cell_type="carcinoma" /sex="female" /tissue_type="breast" /tissue_lib="lambda gt11" /map="11q1.3" gene 169..3230 /gene="EMS1" CDS 169..1821 /gene="EMS1" /codon_start=1 /product="amplaxin" /db_xref="PID:g182087" /translation="MWKASAGHAVSIAQDDAGADDWETDPDFVNDVSEKEQRWGAKTV QGSGHQEHINIHKLRENVFQEHQTLKEKELETGPKASHGYGGKFGVEQDRMDKSAVGH EYQSKLSKHCSQVDSVRGFGGKFGVQMDRVDQSAVGFEYQGKTEKHASQKDYSSGFGG KYGVQADRVDKSAVGFDYQGKTEKHESQRDYSKGFGGKYGIDKDKVDKSAVGFEYQGK TEKHESQKDYVKGFGGKFGVQTDRQDKCALGWDHQEKLQLHESQKDYKTGFGGKFGVQ SERQDSAAVGFDYKEKLAKHESQQDYSKGFGGKYGVQKDRMDKNASTFEDVTQVSSAY QKTVPVEAVTSKTSNIRANFENLAKEKEQEDRRKAEAERAQRMAKERQEQEEARRKLE EQARAKTQTPPVSPAPQPTEERLPSSPVYEDAASFKAELSYRGPVSGTEPEPVYSMEA ADYREASSQQGLAYATEAVYESAEAPGHYPAEDSTYDEYENDLGYTAVALYDYQAAGD DEISFDPDDIITNIEMIDDGWWRGVCKGRYGLFPANYVELRQ" polyA_signal 3226..3230 /gene="EMS1" BASE COUNT 763 a 787 c 964 g 734 t ORIGIN 1 gcctggtgcc tgggagcggc tggcgcggcg gaatccaggg ccgacccggg ccggaccgac 61 cccaggcggc gacggaatca gtccccaatg cctggaaatt cctcattgga ttactgtgtt 121 ttaaacagaa tttcgtgaac agccttttat ctccaagcgg aaagaaagat gtggaaagct 181 tcagcaggcc acgctgtgtc catcgcccag gatgacgcgg gggccgatga ctgggagacc 241 gaccctgatt ttgtgaatga tgtgagtgag aaggagcaaa gatggggtgc caagacggtg 301 cagggctccg ggcaccagga gcatatcaac atacacaagc tgagggagaa tgtctttcaa 361 gagcatcaga cccttaagga gaaggaactt gaaacaggac caaaagcttc ccatggctat 421 ggagggaaat ttggtgtgga acaagaccga atggataagt cagctgtcgg ccacgaatat 481 cagtcgaaac tttccaagca ctgctcgcag gtggactcgg tccgtggctt cggaggcaag 541 tttggtgtcc agatggacag agttgatcag tctgctgtag gctttgaata ccaggggaag 601 actgagaagc atgcctccca gaaagactac tccagtggtt ttggcggcaa gtatggcgtg 661 caggccgacc gagtagacaa gagcgcggtg ggcttcgact accagggcaa gacggagaag 721 cacgagtcac agagagatta ctccaaaggt ttcggcggca aatacggtat cgacaaggac 781 aaagtggata agagcgccgt tggctttgag tatcaaggca aaacggagaa gcacgagtcc 841 cagaaagact atgtgaaagg gtttggagga aaatttggtg tgcagacaga cagacaagac 901 aaatgtgccc ttggctggga tcaccaggag aaattgcagc tgcatgaatc ccaaaaagat 961 tataagactg gttttggagg caaattcggt gttcagtcgg agaggcagga ctccgctgct 1021 gtggggtttg attacaagga gaagctggcc aagcacgagt cccagcaaga ctactccaaa 1081 ggattcggcg ggaagtatgg ggtgcagaag gatcggatgg ataagaatgc gtcaaccttt 1141 gaggatgtca cccaggtgtc ctctgcctac cagaagacag tacctgtcga agctgtgacc 1201 agcaaaacaa gtaacatcag agctaacttt gaaaacctcg ctaaggagaa agagcaggag 1261 gacaggcgga aggcggaggc ggagagagcc cagcggatgg ccaaggagcg gcaggagcag 1321 gaagaggcca ggaggaagct ggaggagcaa gccagagcca aaacgcaaac gccccctgtg 1381 tcgcccgcac ctcagccaac cgaggagagg ctgccctcga gccccgtcta tgaggatgcg 1441 gcttccttca aggcagagct gagctacaga ggccctgtga gtgggacgga gccggagccc 1501 gtgtacagca tggaggccgc tgactaccga gaggccagca gccagcaggg cctggcctat 1561 gccacagagg ctgtctatga aagcgcagag gccccgggcc actatcccgc agaggacagc 1621 acctacgatg agtacgagaa cgatctgggg tacacagccg tcgccctgta cgactaccag 1681 gctgcgggcg atgatgagat ctcatttgac cctgatgaca tcatcaccaa catcgagatg 1741 attgacgacg gctggtggcg cggggtgtgc aagggccggt acgggctctt cccagccaac 1801 tatgtggagc tgcggcagta gggcccccag cccccccccg gagctggcgc cctggatcct 1861 cacactacag atcaggcctt ctttggttct tgggtggttt tgggtttttt ctgttttttt 1921 tttttttttt tttttttttt tttgaaggtg gggaggggaa tatacacatt gcttttatat 1981 ttaatacttt tgctgatgct tttgaaaatg tttatgccac agaatttgct aatatattgt 2041 aatcacattc cttaggagga ctttggtaat tggttttatg cattgatggt tttttttttc 2101 ttttttgcca aattgactgt cacgcggcag cttcagggag ctcgcattct cttgtgttcg 2161 tgttgccctc gtgcccatca agtgcagtcg ggacctccca ggacaagcac gagcctcagg 2221 tcggccctgt ggcgggtagg caggaaggac tgtcccagac gaggggcttc ctctagagtc 2281 tcactgctgg ggaggagagg actgggcctg atggaagtta acccggagct aagtcaccca 2341 gagcacagga gctgccatgt cagatgggaa atctgcctat gtcataccgt gacagcccgc 2401 aggatcaggt gacttctagc agagaccctg gtttttttcc tgtgcccact ccggcttgtc 2461 ctcatctcta cccatcccct gatgcccagg tcaccgggag ggctgctggg agcctctcct 2521 gtccccgccg gcagtgtcac tgagtccttg aaatcctccc ctgcccgcgg gtctctggat 2581 tgggacgcac agtgcagttg aggtctgcgt cgggcttggc ttttcacaaa ggctgatgtc 2641 ttaactgtca cccatatggt ccctgggcca ccgggcagcc tggggcggtg tgtgtgccat 2701 gtcacagcat ggcctctcgg ccttgggaag gaaggcagtg tgcctgctct gctgtgagcc 2761 gccaggaacc ctcctcctgt caatgggggt gtagtatttt tgccaaaata tcatgttcaa 2821 tttcagtagt ttgatcagtt gaaggctaga agtgtgaagt gcagatgagt gtgtgttctt 2881 ccccaaggtc cccccacagc tccaggacac cgctgtcctg gcatttgtgg ccactcactt 2941 tgtaggaaac tcatctcctt cctgaggagc cgggaggctg gaccagtccc gtcgtgcagt 3001 caggtgggcg gtgtgtcttt ccagaaggtc acgtggaaat gtctcgggac ttgggtcccg 3061 gagtgcccgt gaagcgtgtt tttgctcctg aggtgcattt tctcatcatc cttgctttac 3121 cacaatgagc aatgaggtcg ggttttatat gcaacttatt gtatctgaat tcctgtaggc 3181 acaccctcca tagggtatga ttttttttaa attaaagaat tcagaataaa cattttttga 3241 tccaaaaa // LOCUS HUMENIGMA 1725 bp DNA PRI 10-NOV-1994 DEFINITION Human enigma gene, complete cds. ACCESSION L35240 NID g561636 KEYWORDS enigma protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1725) AUTHORS Wu,R.Y. and Gill,G.N. TITLE LIM domain recognition of a tyrosine-containing tight turn JOURNAL J. Biol. Chem. 269 (40), 25085-25090 (1994) MEDLINE 95014287 FEATURES Location/Qualifiers source 1..1725 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SKNMC" /cell_type="neuroblastoma" CDS 85..1452 /note="bp 1099-1251: LIM domain, LIM1; bp 1276-1434: LIM domain, LIM2" /codon_start=1 /product="enigma protein" /db_xref="PID:g561637" /translation="MDSFKVVLEGPAPWGFRLQGGKDFNVPLSISRLTPGGKAAQAGV AVGDWVLSIDGENAGSLTHIEAQNKIRACGERLSLGLSRAQPVQSKPQKASAPAADPP RTPLHPASPSTRRPNPLGPPAPDSPPQQNGQPLRPLVPDASKQRLMENTEDWRPRPGQ ASRVPSASLPTSQAPSSCKTPDEEHLKKSSQVPDRSPSPSLIYTPGALAWPYRPQPYQ PPALGCGPCVCRALCPGQNEHSADPHSQPATPTPLQSRTSIVQAAAGGVPGGGSNNGK TPVCHQCHKVIRGRYLVALGHAYHPEEFVCSQCGKVLEEGGFFEEKGAIFCPPCYDVR YAPSCAKCKKKITGEIMHALKMTWHVHCFTCAACKTPIRNRAFYMEEGVPYCERDYEK MFGTKCHGCDFKIDAGDRFLEALGFSWHDTCFVCAICQINLEGKTFYSKKDRPLCKSH AFSHV" BASE COUNT 355 a 593 c 499 g 278 t ORIGIN 1 gaattcccgt tgctgtcgcc caacgaggct ccctggagcc gacgcagagc agcgccctgg 61 ccgggccaag caggagccgg catcatggat tccttcaaag tagtgctgga ggggccagca 121 ccttgggggt tccggctgca agggggcaag gacttcaatg tgcccctctc catttcccgg 181 ctcactcctg ggggcaaagc ggcgcaggcc ggagtggccg tgggtgactg ggtgctgagc 241 atcgatggcg agaatgcggg tagcctcaca cacatcgaag ctcagaacaa gatccgggcc 301 tgcggggagc gcctcagcct gggcctcagc agggcccagc cggttcagag caaaccgcag 361 aaggcctccg cccccgccgc ggaccctccg cgtacacctt tgcacccagc gtctccctca 421 acaagacggc ccaacccttt gggccccccc gcccctgaca gccccccgca gcagaatgga 481 cagccgctcc gaccgctggt cccagatgcc agcaagcagc ggctgatgga gaacacagag 541 gactggcggc cgcggccggg acaggccagt cgcgttcctt ccgcatcctt gcccacctca 601 caggcaccga gttcatgcaa gaccccggat gaggagcacc tgaagaaatc aagccaggtg 661 ccagacagaa gccccagccc cagcctcatc tacaccccag gagccctggc ctggccctac 721 cgcccccagc cctaccagcc gcccgccctg ggctgtggac cctgcgtttg ccgagcgcta 781 tgccccggac aaaacgagca cagtgctgac ccacacagcc agccagccac gcccacgccg 841 ctgcagagcc gcacctccat tgtgcaggca gctgccggag gggtgccagg agggggcagc 901 aacaacggca agactcccgt gtgtcaccag tgccacaagg tcatccgggg ccgctacctg 961 gtggcgctgg gccacgcgta ccacccggag gagtttgtgt gtagccagtg tgggaaggtc 1021 ctggaagagg gtggcttctt tgaggagaag ggcgccatct tctgcccacc atgctatgac 1081 gtgcgctatg cacccagctg tgccaagtgc aagaagaaga ttacaggcga gatcatgcac 1141 gccctgaaga tgacctggca cgtgcactgc tttacctgtg ctgcctgcaa gacgcccatc 1201 cggaacaggg ccttctacat ggaggagggc gtgccctatt gcgagcgaga ctatgagaag 1261 atgtttggca cgaaatgcca tggctgtgac ttcaagatcg acgctgggga ccgcttcctg 1321 gaggccctgg gcttcagctg gcatgacacc tgcttcgtct gtgcgatatg tcagatcaac 1381 ctggaaggaa agaccttcta ctccaagaag gacaggcctc tctgcaagag ccatgccttc 1441 tctcatgtgt gagccccttc tgcccacagc tgccgcggtg gcccctagcc tgaggggcct 1501 ggagtcgtgg ccctgcattt ctgggtaggg ctggcaatgg ttgccttaac cctggctcct 1561 ggcccgagcc tgggctccct ggccctgccc cacccacctt atcctcccac cccactccct 1621 ccaccaccac agcacaccgg tgctggccac accagccccc tttcacctcc agtgccacaa 1681 taaacctgta cccagctgaa aaaaaaaaaa aaaaaaaaac tcgag // LOCUS HUMENL 1680 bp mRNA PRI 31-DEC-1994 DEFINITION Human germline ENL mRNA, complete cds. ACCESSION L04285 NID g182109 KEYWORDS ENL; acute leukemia; chromosomal translocation; homologue; trithorax. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1680) AUTHORS Tkachuk,D.C., Kohler,S. and Cleary,M.L. TITLE Involvement of a homolog of Drosophila trithorax by 11q23 chromosomal translocations in acute leukemias JOURNAL Cell 71 (4), 691-700 (1992) MEDLINE 93046667 FEATURES Location/Qualifiers source 1..1680 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /germline /map="19q" gene 1..1680 /gene="ENL" CDS 1..1680 /gene="ENL" /standard_name="ENL" /note="translocated to HRX in t(11;19) leukemia" /codon_start=1 /db_xref="PID:g182110" /translation="MDNQCTVQVRLELGHRAQLRKKPTTEGFTHDWMVFVRGPEQCDI QHFVEKVVFWLHDSFPKPRRVCKEPPYKVEESGYAGFIMPIEVHFKNKEEPRKVCFTY DLFLNLEGNPPVNHLRCEKLTFNNPTTEFRYKLLRAGGVMVMPEGADTVSRPSPDYPM LPTIPLSAFSDPKKTKPSHGSKDANKESSKTSKPHKVTKEHRERPRKDSESKSSSKEL EREQAKSSKDTSRKLGEGRLPKEEKAPPPKAAFKEPKMALKETKLESTSPNPGPPPPP PPPPRASSKRPATADSPKPSAKKQKKSSSKGSRSAPGTSPRTSSSSSFSDKKPAKDKS STRGEKVKAESEPREAKKALEVEESNSEDEASFKSESAQSSPSNSSSSSDSSSDSDFE PSQNHSQGPLRSMVEDLQSEESDEDDSSSGEEAAGKTNPGRDSRLSFSDSESDNSADS SLPSREPPPPQKPPPPNSKVSGRRSPESCSKPEKILKKGTYDKAYTDELVELHRRLMA LRERNVLQQIVNLIEETGHFNVTNTTFDFDLFSLDETTVRKLQSCLEAVAT" BASE COUNT 408 a 576 c 480 g 216 t ORIGIN 1 atggacaatc agtgcaccgt ccaggtgagg ttagagctgg ggcatcgcgc ccaactgcgc 61 aagaagccca ccacggaggg gttcactcac gactggatgg tgtttgtccg cggccccgag 121 caatgtgaca tccagcactt cgtggagaag gtggtcttct ggctgcacga cagcttcccc 181 aagcccagac gcgtgtgcaa ggagcccccc tacaaagtag aggagtcggg gtacgctggc 241 ttcatcatgc ccatcgaggt gcacttcaaa aacaaggagg agccgaggaa ggtctgcttc 301 acctacgacc tgttcctgaa cctggaaggc aacccgcccg tgaaccacct gcgctgcgag 361 aagctcacct tcaacaaccc caccacggag ttccggtaca agctcctgcg ggccggcggg 421 gtgatggtaa tgcccgaagg agcagacacg gtgtccaggc ccagtcccga ctaccccatg 481 ttacccacaa ttccactctc tgccttctct gaccccaaga agaccaaacc atcccacggc 541 tccaaggacg ccaacaagga gagcagcaag acctccaagc cacacaaggt gaccaaggag 601 caccgggagc gcccccgcaa agactccgag agcaagagct cctccaagga gctggagcgt 661 gagcaggcca aaagctccaa ggacacctcg cggaagctgg gcgagggccg gctgcccaag 721 gaggagaagg cgccaccgcc caaggctgcc ttcaaggaac ccaagatggc cctgaaagag 781 accaagctgg aaagcacgtc ccccaaccct gggcccccac ccccaccccc acccccaccc 841 cgggcttcca gcaagcggcc ggccaccgcc gactcgccaa agcccagcgc caagaagcag 901 aagaagagca gctcgaaggg gtcccggagt gctccaggca cctcgccccg cacctcctcc 961 tcctcctcct tctcggacaa gaagccggcc aaggacaaga gcagcaccag aggggagaag 1021 gtgaaggccg agagtgagcc ccgggaggcc aaaaaggccc tggaggtgga ggagtccaac 1081 tcagaggacg aggcctcctt caagtccgag tctgcccagt caagcccgtc caactccagc 1141 tccagctcag actccagctc agactcagac ttcgagccat cccagaacca cagccaagga 1201 cccctgcgct ccatggtgga ggacctgcag tccgaggagt ccgacgagga cgactcttcg 1261 tcaggcgagg aggctgccgg caagaccaac ccggggaggg actccaggtt gagcttcagc 1321 gacagcgaga gtgacaacag cgccgactcc tccctgccca gccgtgagcc cccacccccc 1381 cagaagccac ccccgcccaa cagcaaggtg tcaggccgga ggagccccga gtcctgcagc 1441 aagcctgaga agatcctcaa gaagggcacc tacgacaagg cctacacgga tgagctggtg 1501 gagctacacc ggaggctgat ggcgctgcgg gagcgcaacg tgctgcagca gattgtgaat 1561 ctgatcgagg agactggcca cttcaatgtc accaacacca ccttcgactt cgacctcttc 1621 tccctggacg agaccaccgt gcgcaaactg cagagctgcc tggaggccgt ggccacatga // LOCUS HUMENOA 1755 bp mRNA PRI 07-NOV-1994 DEFINITION Human alpha enolase mRNA, complete cds. ACCESSION M14328 NID g182113 KEYWORDS enolase; glycolytic enzyme. SOURCE Human T-cell line Jurkat, cDNA to mRNA, clone pH48. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Giallongo,A., Feo,S., Moore,R., Croce,C.M. and Showe,L.C. TITLE Molecular cloning and nucleotide sequence of a full-length cDNA for human alpha enolase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (18), 6741-6745 (1986) MEDLINE 86313654 FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..1755 /note="enol mRNA" gene 95..1399 /gene="ENO1" CDS 95..1399 /gene="ENO1" /note="alpha enolase (EC 4.2.1.11)" /codon_start=1 /db_xref="PID:g182114" /translation="MSILKIHAREIFDSRGNPTVEVDLFTSKGLFRAAVPSGASTGIY EALELRDNDKTRYMGKGVSKAVEHINKTIAPALVSKKLNVTEQEKIDKLMIEMDGTEN KSKFGANAILGVSLAVCKAGAVEKGVPLYRHIADLAGNSEVILPVPAFNVINGGSHAG NKLAMQEFMILPVGAANFREAMRIGAEVYHNLKNVIKEKYGKDATNVGDEGGFAPNIL ENKEGLELLKTAIGKAGYTDKVVIGMDVAASEFFRSGKYDLDFKSPDDPSRYISPDQL ADLYKSFIKDYPVVSIEDPFDQDDWGAWQKFTASAGIQVVGDDLTVTNPKRIAKAVNE KSCNCLLLKVNQIGSVTESLQACKLAQANGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLLRIEEELGSKAKFAGRNFRNPLAK" BASE COUNT 406 a 472 c 484 g 393 t ORIGIN 5 bp upstream of BglII site. 1 acggagatct cgccggcttt acgttcacct cggtgtctgc agcaccctcc gcttcctctc 61 ctaggcgacg agacccagtg gctagaagtt caccatgtct attctcaaga tccatgccag 121 ggagatcttt gactctcgcg ggaatcccac tgttgaggtt gatctcttca cctcaaaagg 181 tctcttcaga gctgctgtgc ccagtggtgc ttcaactggt atctatgagg ccctagagct 241 ccgggacaat gataagactc gctatatggg gaagggtgtc tcaaaggctg ttgagcacat 301 caataaaact attgcgcctg ccctggttag caagaaactg aacgtcacag aacaagagaa 361 gattgacaaa ctgatgatcg agatggatgg aacagaaaat aaatctaagt ttggtgcgaa 421 cgccattctg ggggtgtccc ttgccgtctg caaagctggt gccgttgaga agggggtccc 481 cctgtaccgc cacatcgctg acttggctgg caactctgaa gtcatcctgc cagtcccggc 541 gttcaatgtc atcaatggcg gttctcatgc tggcaacaag ctggccatgc aggagttcat 601 gatcctccca gtcggtgcag caaacttcag ggaagccatg cgcattggag cagaggttta 661 ccacaacctg aagaatgtca tcaaggagaa atatgggaaa gatgccacca atgtggggga 721 tgaaggcggg tttgctccca acatcctgga gaataaagaa ggcctggagc tgctgaagac 781 tgctattggg aaagctggct acactgataa ggtggtcatc ggcatggacg tagcggcctc 841 cgagttcttc aggtctggga agtatgacct ggacttcaag tctcccgatg accccagcag 901 gtacatctcg cctgaccagc tggctgacct gtacaagtcc ttcatcaagg actacccagt 961 ggtgtctatc gaagatccct ttgaccagga tgactgggga gcttggcaga agttcacagc 1021 cagtgcagga atccaggtag tgggggatga tctcacagtg accaacccaa agaggatcgc 1081 caaggccgtg aacgagaagt cctgcaactg cctcctgctc aaagtcaacc agattggctc 1141 cgtgaccgag tctcttcagg cgtgcaagct ggcccaggcc aatggttggg gcgtcatggt 1201 gtctcatcgt tcgggggaga ctgaagatac cttcatcgct gacctggttg tggggctgtg 1261 cactgggcag atcaagactg gtgccccttg ccgatctgag cgcttggcca agtacaacca 1321 gctcctcaga attgaagagg agctgggcag caaggctaag tttgccggca ggaacttcag 1381 aaaccccttg gccaagtaag ctgtgggcag gcaagccttc ggtcacctgt tggctacaca 1441 gacccctccc ctcgtgtcag ctcaggcagc tcgaggcccc cgaccaacac ttgcaggggt 1501 ccctgctagt tagcgcccca ccgccgtgga gttcgtaccg cttccttaga acttctacag 1561 aagccaagct ccctggagcc ctgttggcag ctctagcttt tgcagtcgtg taatgggccc 1621 aagtcattgt ttttctcgcc tcactttcca ccaagtgtct agagtcatgt gagcctcgtg 1681 tcatctccgg ggtggccaca ggctagatcc ccggtggttt tgtgctcaaa ataaaaagcc 1741 tcagtgaccc atgag // LOCUS HUMEP3IV 48 bp mRNA PRI 12-JUN-1995 DEFINITION Human prostaglandin E2 receptor EP3 subtype isoform IV mRNA, complete cds. ACCESSION L32662 NID g484163 KEYWORDS prostaglandin E2 receptor. SOURCE Homo sapiens female uterus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 48) AUTHORS An,S., Yang,J., So,S.W., Zeng,L. and Goetzl,E.J. TITLE Isoforms of the EP3 subtype of human prostaglandin E2 receptor transduce both intracellular calcium and cAMP signals JOURNAL Biochemistry 33 (48), 14496-14502 (1994) MEDLINE 95072021 FEATURES Location/Qualifiers source 1..48 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="uterus" CDS 1..48 /note="EP3 subtype isoform IV" /codon_start=1 /evidence=experimental /product="prostaglandin E2 receptor" /db_xref="PID:g484164" /translation="MRKRRLREQEEFWGN" BASE COUNT 23 a 3 c 14 g 8 t ORIGIN 1 atgagaaaaa gaagactcag agagcaagag gaattttggg gaaattaa // LOCUS HUMEPI 1109 bp mRNA PRI 10-OCT-1995 DEFINITION Human mRNA for epimorphin. ACCESSION D14582 NID g285915 KEYWORDS epimorphin. SOURCE Homo sapiens placenta cDNA to mRNA, clones No.[2,8,A]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1109) AUTHORS Hirai,Y. TITLE Molecular cloning of human epimorphin: identification of isoforms and their unique properties JOURNAL Biochemical and Biophysical Research Communication 191, 1332-1337 (1993) COMMENT Submitted (08-MAR-1993) to DDBJ by: Yohie Hirai Biomaterial Research Institute 1 Taya-cho, Sakae-ku Yokohama 244 Japan Phone: 045-851-9272 Fax: 045-851-9270. FEATURES Location/Qualifiers source 1..1109 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 96..995 /codon_start=1 /product="epimorphin" /db_xref="PID:d1003947" /db_xref="PID:g303605" /translation="MRDRLPDLTACRKNDDGDTVVVVEKDHFMDDFFHQVEEIRNSID KITQYVEEVKKNHSIILSAPNPEGKIKEELEDLNKEIKKTANKIAAKLKAIEQSFDQD ESGNRTSVDLRIRRTQHSVLSRKFVEAMAEYNEAQTLFRERSKGRIQRQLEITGRTTT DDELEEMLESGKPSIFTSDIISDSQITRQALNEIESRHKDIMKLETSIRELHEMFMDM AMFVETQGEMINNIERNVMNATDYVEHAKEETKKAIKYQSKARRKKWIIIAVSVVLVV YRLFGLSLEYVVRSAASLPGWGN" BASE COUNT 345 a 219 c 302 g 243 t ORIGIN 1 gcggggcctg aggcggagac cggagagccc gcggcccggc cggaggcagc tcgggacagg 61 cttgagcggc ggggcgcgct gcccggccgg cggggatgcg ggaccggctg ccagacctga 121 cggcgtgtag gaagaatgat gatggagaca cagttgttgt ggttgagaaa gatcatttca 181 tggatgattt cttccatcag gtggaggaga ttagaaacag tattgataaa ataactcaat 241 atgttgaaga agtaaagaaa aaccacagca tcattctttc tgcaccaaac ccggaaggaa 301 aaataaaaga agagcttgaa gatctgaaca aagaaatcaa gaaaactgcg aataaaattg 361 cagccaagtt aaaggctatt gaacaaagtt ttgatcagga tgagagtggg aaccggactt 421 cagtggatct tcggatacga agaacccagc attcggtgct gtctcggaag tttgtggaag 481 ccatggcgga gtacaatgag gcacagactc tgtttcggga gcggagcaaa ggccgcatcc 541 agcgccagct ggagataact gggagaacca ccacagacga cgagctagaa gagatgctgg 601 agagcgggaa gccatccatc ttcacttccg acattatatc agattcacaa attactagac 661 aagctctcaa tgaaatcgag tcacgtcaca aggacatcat gaagctggag accagcatcc 721 gagagttgca tgagatgttc atggacatgg ctatgtttgt ggagactcag ggtgaaatga 781 tcaacaacat agaaagaaat gttatgaatg ccacagacta tgtagaacac gctaaagaag 841 aaacaaaaaa agctatcaaa tatcagagca aggcaagaag gaaaaagtgg ataattattg 901 ctgtgtcagt ggttctggtt gtctatcgtc tatttggctt gtcgttggaa tatgttgtac 961 gcagtgctgc ctctctgcca gggtggggaa attgatgttc attatattga agtttgttta 1021 ttgattctca cacatcaaac caccaagatt cctgctgcaa tgaaccaaat cagcatcctg 1081 tcatttcgtg aatgaatctc agacgctgt // LOCUS HUMEPSURAN 2488 bp mRNA PRI 03-MAY-1995 DEFINITION Human surface antigen mRNA, complete cds. ACCESSION M60922 NID g793909 KEYWORDS surface antigen. SOURCE Homo sapiens (library: lambda gt11) epidermis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2488) AUTHORS Schroeder,W.T., Stewart-Galetka,S., Mandavilli,S., Parry,D.A., Goldsmith,L. and Duvic,M. TITLE Cloning and characterization of a novel epidermal cell surface antigen (ESA) JOURNAL J. Biol. Chem. 269 (31), 19983-19991 (1994) MEDLINE 94327549 FEATURES Location/Qualifiers source 1..2488 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /tissue_type="epidermis" /tissue_lib="lambda gt11" /map="17q11-12" CDS 127..1266 /codon_start=1 /product="surface antigen" /db_xref="PID:g793910" /translation="MTLQPRCEDVETAEGVALTVTGVAQVKIMTEKELLAVACEQFLG KNVQDIKNVVLQTLEGHLRSILGTLTVEQIYQDRDQFAKLVREVAAPDVGRMGIEILS FTIKDVYDKVDYLSSLGKTQTAVVQRDADIGVAEAERDAGIREAECKKEMLDVKFMAD TKIADSKRAFELQKSAFSEEVNIKTAEAQLAYELQGAREQQKIRQEEIEIEVVQRKKQ IAVEAQEILRTDKELIATVRRPAEAEAHRIQQIAEGEKVKQVLLAQAEAEKIRKIGEA EAAVIEAMGKAEAERMKLKAEAYQKYGDAAKMALVLEALPQIAAKIAAPLTKVDEIVV LSGDNSKVTSEVNRLLAELPASVHALTGVDLSKIPLIKKATGVQV" polyA_signal 2466..2471 BASE COUNT 499 a 712 c 723 g 554 t ORIGIN 1 cggcccaacg aggcgctggt ggtttcaggg ggctgttgtg gttccgacta taaacagtac 61 gtgtttggcg gctgggcctg ggcctggtgg tgtatctccg acactcagag gatttcccta 121 gagattatga cgttgcagcc ccgctgcgag gacgtagaga cggccgaggg ggtagcttta 181 actgtgacgg gtgtcgccca ggtgaagatc atgacggaga aggaactcct ggccgtggct 241 tgtgagcagt ttctgggtaa gaatgtgcag gacatcaaaa acgtcgtcct gcagaccctg 301 gagggacatc tgcgctccat cctcgggacc ctgacagtgg agcagattta tcaggaccgg 361 gaccagtttg ccaagctggt gcgggaggtg gcagcccctg atgttggccg catgggcatt 421 gagatcctca gcttcaccat caaggacgtg tatgacaaag tggactatct gagctccctg 481 ggcaagacgc agactgccgt ggtgcagaga gatgctgaca ttggcgtggc cgaggctgaa 541 cgggacgcag gcatccggga agctgagtgc aagaaggaga tgctggatgt gaagttcatg 601 gcagacacca agattgctga ctctaagcga gccttcgagc tgcaaaagtc agccttcagt 661 gaggaggtta acatcaagac agctgaggcc cagttggcct atgagctgca gggggcccgt 721 gaacagcaga agatccggca ggaagagatt gagattgagg ttgtgcagcg caagaaacag 781 attgccgtgg aggcacagga gatcctgcgt acggacaagg agctcatcgc tacagtgcgc 841 cggcctgccg aggccgaggc ccaccgcatc cagcagattg ccgagggtga aaaggtgaag 901 caggtcctct tggcacaggc agaggctgag aagatccgca aaatcgggga ggcggaagcg 961 gcagtcatcg aggcgatggg caaggcagag gctgagcgga tgaagctcaa ggcagaagcc 1021 taccagaaat acggggatgc agccaagatg gccttggtgc tagaggccct gccccagatt 1081 gctgccaaaa tcgctgcccc acttaccaag gtcgatgaga ttgtggtcct cagtggagac 1141 aacagtaagg tcacatcaga agtgaaccga ctgctggccg agctgcctgc ctctgtgcat 1201 gccctcacag gcgtggacct gtctaagata cccctgatca agaaggccac tggtgtgcag 1261 gtgtgaggct cctacaggcc cactctcttc agcagccacc cggccctccc tccagcaccc 1321 gttttaatcc cacagaacaa cgggaacgtt actgactctg gtgccttatc tcgaagggac 1381 cagaagtgct gcgtgttcag gccatctctg gctgtcttcc tgtctctcct gtctgtccac 1441 ctcctcctct tcctctcctt taccccactt tcactgccac tttcatcagg tttgtgtctc 1501 atctccctgc gtgtcttttc ctttgtctgt ctttttcttt cccccatgca catcatgtag 1561 attaagctga agatgtttat tacaatcact ctctgtgggg ggtggccctg ctgctcctca 1621 gaatcctggt gccttgaagt tctctgtgca tctgtccatc ctccctatgg ccctggccag 1681 agctcagcat gggcaggggt tctgggtagg acggtcactg tcctctctcc tggactggtc 1741 ttcccagccc taaaccctgc cccaggaagc ccacagcctc acctgctgct gcccctctag 1801 gtctgggcag ccatgacctg cagggcccag agacactgtc cttcccctca tccacccaag 1861 gccccagcca gcgctcatac cctgtccttt ctccctgacc ccaagggcac agaggcaagg 1921 cctcctgtct acagcagctt cctcagtttc ctactgcctt aggaggcccc tgcttgtgct 1981 cagggaaggc ctcttcatgg gcatgttcct gctggggcgg tgcggtttgg tcccaactct 2041 gctaagtttt ctgagatgag ggtctagccc tgttggggac agaaaagtgt gtagaccttc 2101 ttcctgctag ggctgcactg tcctgggtgt tgggcccttc tggtggacaa ggctgtgcca 2161 accctgtaca gaatcgagtg ctgtagcctg gccagacccc agagcccttg tgccatcttt 2221 cttcctggcc agagtgatgg ggttccagcc atggggaagc aacccaatcc tctgtctcct 2281 tgctccaatg gaggcagaag agcccaggac ccaagcgtct tggcaggggt gctgtgaatg 2341 tccagtggtc ccagctcccc accctggccc tgccccagcc tgtgtagctc ttcctgcatg 2401 tggatgctgc atgtctggtc tggggcttgg atgttgcact gccccactgc ctgtcccttc 2461 tggtaaaata aagaactctt aatgcccg // LOCUS HUMERCC1 1097 bp mRNA PRI 07-NOV-1994 DEFINITION Human excision repair protein (ERCC1) mRNA, complete cds, clone pcDE. ACCESSION M13194 NID g567007 KEYWORDS alternative splicing; excision; excision repair protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1097) AUTHORS van Duin,M., de Wit,J., Odijk,H., Westerveld,A., Yasui,A., Koken,H.M., Hoeijmakers,J.H. and Bootsma,D. TITLE Molecular characterization of the human excision repair gene ERCC-1: cDNA cloning and amino acid homology with the yeast DNA repair gene RAD10 JOURNAL Cell 44 (6), 913-923 (1986) MEDLINE 86161680 COMMENT Draft entry and computer readable sequence [1] kindly submitted by M. van Duin. FEATURES Location/Qualifiers source 1..1097 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="SV40 transformed fibroblast" /map="19q13.3" mRNA 1..1097 /gene="ERCC1" /note="G00-119-111" gene 1..1097 /gene="ERCC1" CDS 143..1036 /gene="ERCC1" /codon_start=1 /db_xref="GDB:G00-119-111" /product="excision repair protein" /db_xref="PID:g182174" /translation="MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQ SLPTVDTSAQAAPQTYAEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSII VSPRQRGNPVLKFVRNVPWEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSL GKNFALRVLLVQVDVKDPQQALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQK PADLLMEKLEQDFVSRVTECLTTVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPG LGPQKARRLFDVLHEPFLKVP" polyA_signal 1071..1076 /gene="ERCC1" /note="G00-119-111" polyA_site 1097 /gene="ERCC1" /note="G00-119-111" BASE COUNT 242 a 352 c 312 g 191 t ORIGIN 64 bp upstream of PstI site; chromosome 19q1.3. 1 aagtgctgcg agccctgggc cacgctggcc gtgctggcag tgggccgcct cgatccctct 61 gcagtctttc ccttgaggct ccaagaccag caggtgaggc ctcgcggcgc tgaaaccgtg 121 aggcccggac cacaggctcc agatggaccc tgggaaggac aaagaggggg tgccccagcc 181 ctcagggccg ccagcaagga agaaatttgt gatacccctc gacgaggatg aggtccctcc 241 tggagtggcc aagcccttat tccgatctac acagagcctt cccactgtgg acacctcggc 301 ccaggcggcc cctcagacct acgccgaata tgccatctca cagcctctgg aaggggctgg 361 ggccacgtgc cccacagggt cagagcccct ggcaggagag acgcccaacc aggccctgaa 421 acccggggca aaatccaaca gcatcattgt gagccctcgg cagaggggca atcccgtact 481 gaagttcgtg cgcaacgtgc cctgggaatt tggcgacgta attcccgact atgtgctggg 541 ccagagcacc tgtgccctgt tcctcagcct ccgctaccac aacctgcacc cagactacat 601 ccatgggcgg ctgcagagcc tggggaagaa cttcgccttg cgggtcctgc ttgtccaggt 661 ggatgtgaaa gatccccagc aggccctcaa ggagctggct aagatgtgta tcctggccga 721 ctgcacattg atcctcgcct ggagccccga ggaagctggg cggtacctgg agacctacaa 781 ggcctatgag cagaaaccag cggacctcct gatggagaag ctagagcagg acttcgtctc 841 ccgggtgact gaatgtctga ccaccgtgaa gtcagtcaac aaaacggaca gtcagaccct 901 cctgaccaca tttggatctc tggaacagct catcgccgca tcaagagaag atctggcctt 961 atgcccaggc ctgggccctc agaaagcccg gaggctgttt gatgtcctgc acgagccctt 1021 cttgaaagta ccctgatgac cccagctgcc aaggaaaccc ccagtgtaat aataaatcgt 1081 cctcccaggc caggctc // LOCUS HUMERCC3A 2751 bp mRNA PRI 07-NOV-1994 DEFINITION Human DNA repair helicase (ERCC3) mRNA, complete cds. ACCESSION M31899 NID g182178 KEYWORDS Cockayne's syndrome; DNA repair protein; excision repair protein; helicase. SOURCE Human lymphoid cell line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2751) AUTHORS Weeda,G., van Ham,R.C., Vermeulen,W., Bootsma,D., van der Eb,A.J. and Hoeijmakers,J.H. TITLE A presumed DNA helicase encoded by ERCC-3 is involved in the human repair disorders xeroderma pigmentosum and Cockayne's syndrome JOURNAL Cell 62 (4), 777-791 (1990) MEDLINE 90352711 REFERENCE 2 (sites) AUTHORS Weeda,G., van Ham,R.C., Masurel,R., Westerveld,A., Odijk,H., de Wit,J., Bootsma,D., van der Eb,A.J. and Hoeijmakers,J.H. TITLE Molecular cloning and biological characterization of the human excision repair gene ERCC-3 JOURNAL Mol. Cell. Biol. 10 (6), 2570-2581 (1990) MEDLINE 90258842 COMMENT [2] sites. Draft entry and computer-readable sequence for [2] kindly submitted by G. Weeda, 07-FEB-1990, for release after publication. FEATURES Location/Qualifiers source 1..2751 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q21" gene 96..2444 /gene="ERCC3" CDS 96..2444 /gene="ERCC3" /note="DNA repair helicase" /codon_start=1 /db_xref="GDB:G00-119-881" /db_xref="PID:g182179" /translation="MGKRDRADRDKKKSRKRHYEDEEDDEEDAPGNDPQEAVPSAAGK QVDESGTKVDEYGAKDYRLQMPLKDDHTSRPLWVAPDGHIFLEAFSPVYKYAQDFLVA IAEPVCRPTHVHEYKLTAYSLYAAVSVGLQTSDITEYLRKLSKTGVPDGIMQFIKLCT VSYGKVKLVLKHNRYFVESCHPDVIQHLLQDPVIRECRLRNSEGEATELITETFTSKS AISKTAESSGGPSTSRVTDPQGKSDIPMDLFDFYEQMDKDEEEEEETQTVSFEVKQEM IEELQKRCIHLEYPLLAEYDFRNDSVNPDINIDLKPTAVLRPYQEKSLRKMFGNGRAR SGVIVLPCGAGKSLVGVTAACTVRKRCLVLGNSAVSVEQWKAQFKMWSTIDDSQICRF TSDAKDKPIGCSVAISTYSMLGHTTKRSWEAERVMEWLKTQEWGLMILDEVHTIPAKM FRRVLTIVQAHCKLGLTATLVREDDKIVDLNFLIGPKLYEANWMELQNNGYIAKVQCA EVWCPMSPEFYREYVAIKTKKRILLYTMNPNKFRACQFLIKFHERRNDKIIVFADNVF ALKEYAIRLNKPYIYGPTSQGERMQILQNFKHNPKINTIFISKVGDTSFDLPEANVLI QISSHGGSRRQEAQRLGRVLRAKKGMVAEEYNAFFYSLVSQDTQEMAYSTKRQRFLVD QGYSFKVITKLAGMEEEDLAFSTKEEQQQLLQKVLAATDLDAEEEVVAGEFGSRSSQA SRRFGTMSSMSGADDTVYMEYHSSRSKAPSKHVHPLFKRFRK" BASE COUNT 727 a 668 c 726 g 630 t ORIGIN 1 gggagcttcc ggattgagcc ggaagtcccc ccagagcgga tgccgcggcg ggcctgtggg 61 agcggggtca tcttctctct gctgctgtag ctgccatggg caaaagagac cgagcggacc 121 gcgacaagaa gaaatccagg aagcggcact atgaggatga agaggatgat gaagaggacg 181 ccccggggaa cgaccctcag gaagcggttc cctcggcggc ggggaagcag gtggatgagt 241 caggcaccaa agtggatgaa tatggagcca aggactacag gctgcaaatg ccgctgaagg 301 acgaccacac ctccaggccc ctctgggtgg ctcccgatgg ccatatcttc ttggaagcct 361 tctctccagt ttacaaatat gcccaagact tcttggtggc tattgcagag ccagtgtgcc 421 gaccaaccca tgtgcatgag tacaaactaa ctgcctactc cttgtatgca gctgtcagcg 481 ttgggctgca aaccagtgac atcaccgagt acctcaggaa gctcagcaag actggagtcc 541 ctgatggaat tatgcagttt attaagttgt gtactgtcag ctatggaaaa gtcaagctgg 601 tcttgaagca caacagatac ttcgttgaaa gttgccaccc tgatgtaatc cagcatcttc 661 tccaggaccc cgtgatccga gaatgccgct taagaaactc tgaaggggag gccactgagc 721 tcatcacaga gactttcaca agcaaatctg ccatttctaa gactgctgaa agcagtggtg 781 ggccctccac ttcccgagtg acagatccac agggtaaatc tgacatcccc atggacctgt 841 ttgacttcta tgagcaaatg gacaaggatg aagaagaaga agaagagaca cagacagtgt 901 cttttgaagt caagcaggaa atgattgagg aactccagaa acgttgcatc cacctggagt 961 accctctgtt ggcagaatat gacttccgga atgattctgt caaccctgat atcaacattg 1021 acctaaagcc cacagctgtc ctcagaccct atcaggagaa gagcttgcga aagatgtttg 1081 gaaacgggcg tgcacgttcg ggggtcattg ttcttccctg cggtgctgga aagtccctgg 1141 ttggtgtgac tgctgcatgc actgtcagaa aacgctgtct ggtgctgggc aactcagctg 1201 tttctgtgga gcagtggaaa gcccagttca agatgtggtc caccattgac gacagccaga 1261 tctgccggtt cacctccgat gccaaggaca agcccatcgg ctgctccgtt gccattagca 1321 cctactccat gctgggccac accaccaaaa ggtcctggga ggccgagcga gtcatggagt 1381 ggctcaagac ccaggagtgg ggcctcatga tcctggatga agtgcacacc ataccagcca 1441 agatgttccg aagggtgctc accatcgtgc aggcccactg taagctgggt ttgactgcga 1501 ccctcgtccg cgaagatgac aaaattgtgg atttaaattt tctgattggg cctaagctct 1561 acgaagccaa ctggatggag ctgcagaata atggctacat cgccaaagtc cagtgtgctg 1621 aggtctggtg ccctatgtct cctgaatttt accgggaata tgtggcaatc aaaaccaaga 1681 aacgaatctt gctgtacacc atgaacccca acaaatttag agcttgccag tttctgatca 1741 agtttcatga aaggaggaat gacaagatta ttgtctttgc tgacaatgtg tttgccctaa 1801 aggaatatgc cattcgactg aacaaaccct atatctacgg acctacgtct cagggggaaa 1861 ggatgcaaat tctccagaat ttcaagcaca accccaaaat taacaccatc ttcatatcca 1921 aggtaggtga cacttcgttt gatctgccgg aagcaaatgt cctcattcag atctcatccc 1981 atggtggctc caggcgtcag gaagcccaaa ggctagggcg ggtgcttcga gctaaaaaag 2041 ggatggttgc agaagagtac aatgcctttt tctactcact ggtatcccag gacacacagg 2101 aaatggctta ctcaaccaag cggcagagat tcttggtaga tcaaggttat agcttcaagg 2161 tgatcacgaa actcgctggc atggaggagg aagacttggc gttttcgaca aaagaagagc 2221 aacagcagct cttacagaaa gtcctggcag ccactgacct ggatgccgag gaggaggtgg 2281 tggctgggga atttggctcc agatccagcc aggcatctcg gcgctttggc accatgagtt 2341 ctatgtctgg ggccgacgac actgtgtaca tggagtacca ctcatcgcgg agcaaggcgc 2401 ccagcaaaca tgtacacccg ctcttcaagc gctttaggaa atgatgctta ggcagggtac 2461 ttcgttcaag accggcgctt ggcacccttg ttggaaaggg attttcagca taacattttc 2521 cttccacctc tttgaccttc cctccagcgt tggccaaatt gtgctgagga agatgcatca 2581 agggcttggc tgtgccttca taggtcatct agggttttat aaaggaggag gagacaatat 2641 tttttcaaac tttttgggga gtggggtcat ttctgtatat aaaaaatgtt aatatttaag 2701 gtgtatttat gttaccgttc tgaataaaca gaatggacca ttgaaccagt a // LOCUS HUMERCC6A 4714 bp mRNA PRI 08-NOV-1994 DEFINITION Human excision repair protein ERCC6 mRNA, complete cds. ACCESSION L04791 NID g182180 KEYWORDS active gene repair; helicase. SOURCE Homo sapiens testis, placenta mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4714) AUTHORS Troelstra,C., van Gool,A., de Wit,J., Vermeulen,W., Bootsma,D. and Hoeijmakers,J.H. TITLE ERCC6, a member of a subfamily of putative helicases, is involved in Cockayne's syndrome and preferential repair of active genes JOURNAL Cell 71 (6), 939-953 (1992) MEDLINE 93092214 FEATURES Location/Qualifiers source 1..4714 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis, placenta" /map="10q11-21" gene 80..4561 /gene="ERCC6" CDS 80..4561 /gene="ERCC6" /standard_name="excision repair cross complementing rodent repair deficiency, complementation group 6" /note="putative casein kinase II phosphorylation site bp 1526..1549 and 3281..3301; putative helicase domains I-VI bp 1658..2929; putative nucleotide binding fold bp 3479..3493; homology to helicase subfamily bp 1658..3100" /codon_start=1 /db_xref="GDB:G00-119-882" /product="excision repair protein" /db_xref="PID:g182181" /translation="MPNEGIPHSSQTQEQDCLQSQPVSNNEEMAIKQESGGDGEVEEY LSFRSVGDGLSTSAVGCASAAPRRGPALLHIDRHQIQAVEPSAQALELQGLGVDVYDQ DVLEQGVLQQVDNAIHEASRASQLVDVEKEYRSVLDDLTSCTTSLRQINKIIEQLSPQ AATSRDINRKLDSVKRQKYNKEQQLKKITAKQKHLQAILGGAEVKIELDHASLEEDAE PGPSSLGSMLMPVQETAWEELIRTGQMTPFGTQIPQKQEKKPRKIMLNEASGFEKYLA DQAKLSFERKKQGCNKRAARKAPAPVTPPAPVQNKNKPNKKARVLSKKEERLKKHIKK LQKRALQFQGKVGLPKARRPWESDMRPEAEGDSEGEESEYFPTEEEEEEEDDEVEGAE ADLSGDGTDYELKPLPKGGKRQKKVPVQEIDDDFFPSSGEEAEAASVGEGGGGGRKVG RYRDDGDEDYYKQRLRRWNKLRLQDKEKRLKLEDDSEESDAEFDEGFKVPGFLFKKLF KYQQTGVRWLWELHCQQAGGILGDEMGLGKTIQIIAFLAGLSYSKIRTRGSNYRFEGL GPTVIVCPTTVMHQWVKEFHTWWPPFRVAILHETGSYTHKKEKLIRDVAHCHGILITS YSYIRLMQDDISRYDWHYVILDEGHKIRNPNAAVTLACKQFRTPHRIILSGSPMQNNL RELWSLFDFIFPGKLGTLPVFMEQFSVPITMGGYSNASPVQVKTAYKCACVLRDTINP YLLRRMKSDVKMSLSLPDKNEQVLFCRLTDEQHKVYQNFVDSKEVYRILNGEMQIFSG LIALRKICNHPDLFSGGPKNLKGLPDDELEEDQFGYWKRSGKMIVVESLLKIWHKQGQ RVLLFSQSRQMLDILEVFLRAQKYTYLKMDGTTTIASRQPLITRYNEDTSIFVFLLTT RVGGLGVNLTGANRVVIYDPDWNPSTDTQARERAWRIGQKKQVTVYRLLTAGTIEEKI YHRQIFKQFLTNRVLKDPKQRRFFKSNDLYELFTLTSPDASQSTETSAIFAGTGSDVQ TPKCHLKRRIQPAFGADHDVPKRKKFPASNISVNDATSSEEKSEAKGAEVNAVTSNRS DPLKDDPHMSSNVTSNDRLGEETNAVSGPEELSVISGNGECSNSSGTGKTSMPSGDES IDEKLGLSYKRERPSQAQTEAFWENKQMENNFYKHKSKTKHHSVAEEETLEKHLRPKQ KPKNSKHCRDAKFEGTRIPHLVKKRRYQKQDSENKSEAKEQSNDDYVLEKLFKKSVGV HSVMKHDAIMDGASPDYVLVEAEANRVAQDALKALRLSRQRCLGAVSGVPTWTGHRGI SGAPAGKKSRFGKKRNSNFSVQHPSSTSPTEKCQDGIMKKEGKDNVPEHFSGRAEDAD SSSGPLASSSLLAKMRARNHLILPERLESESGHLQEASALLPTTEHDDLLVEMRNFIA FQAHTDGQASTREILQEFESKLSASQSCVFRELLRNLCTFHRTSGGEGIWKLKPEYC" misc_signal 1475..1522 /gene="ERCC6" /standard_name="bipartite nuclear location signal" /note="G00-119-882; putative" misc_signal 3191..3244 /gene="ERCC6" /standard_name="bipartite nuclear location signal" /note="G00-119-882; putative" BASE COUNT 1433 a 993 c 1220 g 1068 t ORIGIN 1 tgggttccaa ggcggctggc ggcggtagcg tctctgtttc cttgtgggcg ctcgcgcggc 61 cctgggtagt ctgtagagaa tgccaaatga gggaatcccc cactcaagtc aaactcagga 121 gcaagactgt ttacagagtc aacctgtcag taataatgaa gaaatggcaa tcaagcaaga 181 aagtggtggt gatggggagg tggaggagta cctgtccttt cgttctgtgg gtgacgggct 241 gtccacctct gctgtggggt gcgcatcagc agctccgagg agagggccag ccctgctgca 301 catcgaccga catcagatcc aggcagtaga gcctagcgcc caggcccttg agctgcaggg 361 tttgggtgtg gacgtctatg accaggacgt gctggaacag ggagtgcttc agcaggtgga 421 caatgccatc catgaggcca gccgtgcctc ccagctcgtt gacgtggaga aggagtatcg 481 gtcggtcctg gatgacctca cgtcatgtac gacatcccta aggcaaatca ataaaattat 541 tgaacagctt agccctcaag ctgccaccag cagagacatc aacaggaaac tagattctgt 601 aaaacgacag aagtataata aggaacaaca gctaaaaaag atcactgcaa aacaaaagca 661 tctccaggcc atccttggag gagcagaggt gaaaattgaa ctagatcacg ccagtctgga 721 ggaggatgca gagccggggc catccagtct tggcagcatg ctcatgcctg tccaggagac 781 tgcctgggaa gagctcatcc gcactggcca gatgacacct tttggtaccc agatccctca 841 gaaacaggag aaaaagccca gaaaaatcat gcttaatgaa gcatcaggct tcgaaaagta 901 tttggcagat caagcaaaac tgtcttttga aaggaagaag caaggttgta ataaaagagc 961 agctagaaaa gctccagccc cagtcacgcc tccagcccca gtgcaaaata aaaacaaacc 1021 aaacaagaaa gccagagttc tgtccaaaaa agaggagcgt ttgaaaaagc acatcaagaa 1081 actccagaag agggctttgc agttccaggg gaaagtggga ttgccaaagg caaggagacc 1141 ttgggagtca gacatgaggc cagaggcaga gggagactct gagggtgaag agtctgagta 1201 tttccccaca gaggaggagg aagaggagga agatgacgag gtggaggggg cagaggcgga 1261 cctgtctgga gatggtactg actatgagct gaagcctctg cccaagggcg ggaaacggca 1321 gaagaaagtg ccagtgcagg agattgatga tgactttttc ccaagttctg gggaagaagc 1381 tgaagctgct tctgtaggag aaggaggagg aggaggtcgg aaagtgggaa gataccgaga 1441 tgatggagat gaagattatt ataagcagcg gttaaggaga tggaataaac tgagactgca 1501 ggacaaagag aaacgtctga agctggagga cgattctgag gaaagtgatg ctgaatttga 1561 cgaaggtttt aaagtgccag gttttctgtt caaaaagctt tttaagtacc agcagacagg 1621 tgttaggtgg ctgtgggaat tgcactgcca gcaggcagga ggaattctgg gagatgaaat 1681 gggattgggc aagaccatcc agataattgc cttcttggca ggtctgagct acagcaagat 1741 caggactcgt ggttcaaatt acaggtttga ggggttgggt ccaactgtaa ttgtctgtcc 1801 aacaacagtg atgcatcagt gggtgaagga atttcacacg tggtggcctc cgttcagagt 1861 ggcaattcta catgaaaccg gttcctatac ccacaaaaag gagaaactaa ttcgagatgt 1921 tgctcattgt catggaattt tgatcacatc ttactcctac attcgattga tgcaggatga 1981 cattagcagg tatgactggc actatgtgat cttggacgaa ggacacaaaa ttcgaaatcc 2041 aaatgctgct gtcacccttg cttgcaaaca gtttcgcacc cctcatcgga tcattctgtc 2101 tggctcaccg atgcaaaata acctccgaga gctgtggtcg ctctttgact tcatcttccc 2161 gggaaagtta ggcacgttgc ctgtgtttat ggagcagttc tccgtcccca tcaccatggg 2221 gggatattca aatgcttccc cagtacaggt caaaactgct tacaagtgtg catgtgtctt 2281 acgagatacc ataaatccat acctactgcg gagaatgaag tcagatgtca agatgagcct 2341 ttctttgcca gataaaaatg aacaggtctt attttgccgt cttacagatg agcagcataa 2401 agtctaccaa aatttcgttg attccaaaga agtttacagg attctcaatg gagagatgca 2461 gattttctcc ggacttatag ccctaagaaa aatttgcaac caccctgatc tcttttctgg 2521 aggtcccaag aatctcaaag gtcttcctga tgatgaacta gaagaagatc agtttgggta 2581 ctggaaacgt tctgggaaaa tgattgttgt tgagtctttg ttgaaaatat ggcacaagca 2641 gggtcagcga gtattgctgt tttctcagtc aaggcagatg ctggacatac ttgaagtatt 2701 ccttagagcc caaaagtata cctatctcaa gatggatggt accactacaa tagcttcaag 2761 acagccactg attacgagat acaatgagga cacatccata tttgtgtttc ttctgaccac 2821 gcgggtgggc ggcttaggtg tcaacctgac gggggcaaac agagttgtca tctatgaccc 2881 agactggaac ccaagcacgg acacgcaggc ccgggagcga gcatggagaa taggccagaa 2941 gaagcaagtg actgtgtaca ggctcctgac tgcgggcacc attgaagaaa agatctacca 3001 ccgacaaatc ttcaagcagt ttttgacaaa tagagtgcta aaagacccaa aacaaaggcg 3061 gtttttcaaa tccaatgatc tctatgagct atttactctg actagtcctg atgcatccca 3121 gagcactgaa acaagtgcaa tttttgcagg aactggatca gatgttcaga cacccaaatg 3181 ccatctaaaa agaaggattc aaccagcctt tggagcagac catgatgttc caaaacgcaa 3241 gaagttccct gcttctaaca tatctgtaaa tgatgccaca tcatctgaag agaaatctga 3301 ggctaaagga gctgaagtaa atgcagtaac ttctaatcga agtgatcctt tgaaagatga 3361 ccctcacatg agtagtaatg taactagcaa tgataggctt ggagaagaga caaatgcagt 3421 atctggacca gaagagttgt cagtgattag tggaaatggg gaatgttcaa attcttcagg 3481 aacaggcaaa acttctatgc catctggtga tgaaagcatt gatgaaaagt taggtctttc 3541 ttacaaaaga gaaagaccca gccaggctca aacagaagct ttttgggaga ataaacaaat 3601 ggaaaataat ttttataagc acaagtcaaa aacaaaacat catagtgtgg cagaagaaga 3661 gaccctggag aaacatctga gaccaaagca aaagcctaag aactctaagc attgcagaga 3721 cgccaagttt gaaggaactc gaattccaca cctggtgaag aaaaggcgtt accagaagca 3781 agacagtgaa aacaagagtg aggccaagga acagagcaat gacgattatg ttttggaaaa 3841 gcttttcaaa aaatcagttg gcgtgcacag tgtcatgaag cacgatgcca tcatggatgg 3901 agccagccca gattatgtac tggtggaggc agaagccaac cgagtggccc aggatgccct 3961 gaaagcactg aggctctctc gtcagcggtg tctgggagca gtgtctggtg ttcccacctg 4021 gactggccac agggggattt ctggtgcacc agcaggaaaa aagagtagat ttggtaagaa 4081 aaggaattct aacttctctg tgcagcatcc ttcatcaaca tctccaacag agaagtgcca 4141 ggatggcatc atgaaaaagg agggaaaaga taatgtccct gagcatttta gtggaagagc 4201 agaagatgca gactcttcat ccgggcccct cgcttcctcc tcactcttgg ctaaaatgag 4261 agctagaaac cacctgattc tgccagagcg tttagaaagt gaaagcgggc acctgcagga 4321 agcttctgcc ctgctgccca ccacagaaca cgatgacctt ctggtggaga tgagaaactt 4381 catcgctttc caggcccaca ctgatggcca ggccagcacc agggagatac tgcaggagtt 4441 tgaatccaag ttatctgcat cacagtcttg tgtcttccga gaactattga gaaatctgtg 4501 cactttccat agaacttctg gtggtgaagg aatttggaaa ctcaagccag aatactgcta 4561 aacaacattg cttcctaaac tttcaagtcc ctttttctaa cgggcatttc tgattattaa 4621 tttattatta ataatcatgt ttgtcaatgg aagttggctg cacttgatgt ttgtttgcat 4681 gatgtctacc tcagaattaa aactttaagg aagg // LOCUS HUMERFP 2304 bp mRNA PRI 13-DEC-1994 DEFINITION Human mRNA for estrogen responsive finger protein, complete cds. ACCESSION D21205 NID g458725 KEYWORDS estrogen responsive finger protein. SOURCE Homo sapiens placenta cDNA to mRNA, clone lambda C3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2304) AUTHORS Inoue,S., Orimo,A., Hosoi,T., Kondo,S., Toyoshima,H., Kondo,T., Ikegami,A., Ouchi,Y., Orimo,H. and Muramatsu,M. TITLE Genomic binding-site cloning reveals an estrogen-responsive gene that encodes a RING finger protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11117-11121 (1993) MEDLINE 94068555 REFERENCE 2 (bases 1 to 2304) AUTHORS Inoue,S. TITLE Direct Submission JOURNAL Submitted (18-OCT-1993) to the DDBJ/EMBL/GenBank databases. Satoshi Inoue, Faculty of Medicine, University of Tokyo, Department of Geriatrics; 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan (E-mail:U10AINO@JPNUMIN.BITNET, Tel:03-3815-5411(ex.8344), Fax:03-5689-2483) COMMENT Submitted (18-Oct-1993) to DDBJ by: Satoshi Inoue Department of Geriatrics Faculty of Medicine University of Tokyo 7-3-1 Hongo, Bunkyo-ku Tokyo 113 Japan Phone: 03-3815-5411 x8344 Fax: 03-5689-2483 Email: U10AINO@JPNUMIN.BITNET. FEATURES Location/Qualifiers source 1..2304 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 40..1932 /codon_start=1 /product="estrogen responsive finger protein (efp)" /db_xref="PID:d1005279" /db_xref="PID:g458726" /translation="MAELCPLAEELSCSICLEPFKEPVTTPCGHNFCGSCLNETWAVQ GSPYLCPQCRAVYQARPQLHKNTVLCNVVEQFLQADLAREPPADVWTPPARASAPSPN AQVACDHCLKEAAVKTCLVCMASFCQEHLQPHFDSPAFQDHPLQPPVRDLLRRKCSQH NRLREFFCPEHSECICHICLVEHKTCSPASLSQASADLEATLRHKLTVMYSQINGASR ALDDVRNRQQDVRMTANRKVEQLQQEYTEMKALLDASETTSTRKIKEEEKRVNSKFDT IYQILLKKKSEIQTLKEEIEQSLTKRDEFEFLEKASKLRGISTKPVYIPEVELNHKLI KGIHQSTIDLKNELKQCIGRLQELTPSSGDPGEHDPASTHKSTRPVKKVSKEEKKSKK PPPVPALPSKLPTFGAPEQLVDLKQAGLEAAAKATSSHPNSTSLKAKVLETFLAKSRP ELLEYYIKVILDYNTAHNKVALSECYTVASVAEMPQNYRPHPQRFTYCSQVLGLHCYK KGIHYWEVELQKNNFCGVGICYGSMNRQGPESRLGRNSASWCVEWFNTKISAWHNNVE KTLPSTKATRVGVLLNCDHGFVIFFAVADKVHLMYKFRVDFTEALYPAFWVFSAGATL SICSPK" BASE COUNT 544 a 673 c 659 g 428 t ORIGIN 1 cgcgggtgca gcagttgtgt cccgacccct gggagcgcca tggcagagct gtgccccctg 61 gccgaggagc tgtcgtgctc catctgcctg gagcccttca aggagccggt caccactccg 121 tgcggccaca acttctgcgg gtcgtgcctg aatgagacgt gggcagtcca gggctcgcca 181 tacctgtgcc cgcagtgccg cgccgtctac caggcgcgac cgcagctgca caagaacacg 241 gtgctgtgca acgtggtgga gcagttcctg caggccgacc tggcccggga gccacccgcc 301 gacgtctgga cgccgcccgc ccgcgcctct gcacccagcc cgaatgccca ggtggcctgc 361 gaccactgcc tgaaggaggc cgccgtgaag acgtgcttgg tgtgcatggc ctccttctgt 421 caggagcacc tgcagccgca cttcgacagc cccgccttcc aggaccaccc gctgcagccg 481 cccgttcgcg acctgttgcg ccgcaaatgt tcccagcaca atcggctgcg ggaatttttc 541 tgccccgagc acagcgagtg catctgccac atctgcctgg tggagcataa gacctgctct 601 cccgcgtccc tgagccaggc cagcgccgac ctggaggcca ccctgaggca caaactaact 661 gtcatgtaca gtcagatcaa cggggcgtcg agagcactgg atgatgtgag aaacaggcag 721 caggatgtgc ggatgactgc aaacagaaag gtggagcagc tacaacaaga atacacggaa 781 atgaaggctc tcttggacgc ctcagagacc acctcgacaa ggaagataaa ggaagaggag 841 aagagggtca acagcaagtt tgacaccatt tatcagattc tcctcaagaa gaagagtgag 901 atccagacct tgaaggagga gattgaacag agcctgacca agagggatga gttcgagttt 961 ctggagaaag catcaaaact gcgaggaatc tcaacaaagc cagtctacat ccccgaggtg 1021 gaactgaacc acaagctgat aaaaggcatc caccagagca ccatagacct caaaaacgag 1081 ctgaagcagt gcatcgggcg gctccaggag ctcaccccca gttcaggtga ccctggagag 1141 catgacccag cgtccacaca caaatccaca cgccctgtga agaaggtctc caaagaggaa 1201 aagaaatcca agaaacctcc ccctgtccct gccttaccca gcaagcttcc cacgtttgga 1261 gccccggaac agttagtgga tttaaaacaa gctggcttgg aggctgcagc caaagccacc 1321 agctcacatc cgaactcaac atctctcaag gccaaggtgc tggagacctt cctggccaag 1381 tccagacctg agctcctgga gtattacatt aaagtcatcc tggactacaa caccgcccac 1441 aacaaagtgg ctctgtcaga gtgctataca gtagcttctg tggctgagat gcctcagaac 1501 taccggccgc atccccagag gttcacatac tgctctcagg tgctgggcct gcactgctac 1561 aagaagggga tccactactg ggaggtggag ctgcagaaga acaacttctg tggggtaggc 1621 atctgctacg gaagcatgaa ccggcagggc ccagaaagca ggctcggccg caacagcgcc 1681 tcctggtgcg tggagtggtt caacaccaag atctctgcct ggcacaataa cgtggagaaa 1741 accctgccct ccaccaaggc cacgcgggtg ggcgtgcttc tcaactgtga ccacggcttt 1801 gtcatcttct tcgctgttgc cgacaaggtc cacctgatgt ataagttcag ggtggacttt 1861 actgaggctt tgtacccggc tttctgggta ttttctgctg gtgccacact ctccatctgc 1921 tcccccaagt aggcaggctg taggcacttg ggctgactgc ctgcagaagt cccaagaccc 1981 tagtgaaaat acagcaggca gaactctcct tggataattc ccccaagagg tccccaagga 2041 ttgggagcat gggaggggag ctggcgggag ggtgggaggt gggatttagc caggaaaggg 2101 gtgagagtga ttgtgttgtg ggcgaggagg cgtttccacc ccctggtgcc tatcagggca 2161 gggtgaccta ctccccattg ttctggaaat ctccaggctg ctgggcagct gggcagctgg 2221 gcagagctct gggaagtgaa gtcatgagtg cccgattcct cttagagaaa atccatagcc 2281 ttcagatctt ggtgttttga attc // LOCUS HUMERG11 3126 bp mRNA PRI 15-MAR-1989 DEFINITION Human erg protein (ets-related gene) mRNA, complete cds. ACCESSION M21535 M17390 NID g182182 KEYWORDS erg protein. SEGMENT 1 of 2 SOURCE Human, cell line COLO 320, cDNA to mRNA, lambda-7. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3126) AUTHORS Reddy,E.S.P., Rao,V.N. and Papas,T.S. TITLE The erg gene: A human gene related to the ets oncogene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6131-6135 (1987) MEDLINE 87317608 FEATURES Location/Qualifiers source 1..3126 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 195..1286 /note="erg1 protein" /codon_start=1 /db_xref="PID:g182185" /translation="MVGSPDTVGMNYGSYMEEKHMPPPNMTTNERRVIVPADPTLWST DHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMTKDDFQRLTPSYNADILLSHLHY LRETPLPHLTSDDVDKALQNSPRLMHARNTDLPYEPPRRSAWTGHGHPTPQSKAAQPS PSTVPKTEDQRPQLDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSNSSCITWEG TNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDF HGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSSFFAAPN PYWNSPTGGIYPNTRLPTSHMPSHLGTYY" BASE COUNT 928 a 732 c 725 g 741 t ORIGIN 1 bp upstream from EcoRI site. 1 gaattccctc caaagcaaga caaatgactc acagagaaaa aagatggcag aaccaagggc 61 aactaaagcc gtcaggttct gaacagctgg tagatgggct ggcttactga aggacatgat 121 tcagactgtc ccggacccag cagctcatat caaggaactc tcctgatgaa tgcagtgtgg 181 ccaaaggcgg gaagatggtg ggcagcccag acaccgttgg gatgaactac ggcagctaca 241 tggaggagaa gcacatgcca cccccaaaca tgaccacgaa cgagcgcaga gttatcgtgc 301 cagcagatcc tacgctatgg agtacagacc atgtgcggca gtggctggag tgggcggtga 361 aagaatatgg ccttccagac gtcaacatct tgttattcca gaacatcgat gggaaggaac 421 tgtgcaagat gaccaaggac gacttccaga ggctcacccc cagctacaac gccgacatcc 481 ttctctcaca tctccactac ctcagagaga ctcctcttcc acatttgact tcagatgatg 541 ttgataaagc cttacaaaac tctccacggt taatgcatgc tagaaacaca gatttaccat 601 atgagccccc caggagatca gcctggaccg gtcacggcca ccccacgccc cagtcgaaag 661 ctgctcaacc atctccttcc acagtgccca aaactgaaga ccagcgtcct cagttagatc 721 cttatcagat tcttggacca acaagtagcc gccttgcaaa tccaggcagt ggccagatcc 781 agctttggca gttcctcctg gagctcctgt cggacagctc caactccagc tgcatcacct 841 gggaaggcac caacggggag ttcaagatga cggatcccga cgaggtggcc cggcgctggg 901 gagagcggaa gagcaaaccc aacatgaact acgataagct cagccgcgcc ctccgttact 961 actatgacaa gaacatcatg accaaggtcc atgggaagcg ctacgcctac aagttcgact 1021 tccacgggat cgcccaggcc ctccagcccc accccccgga gtcatctctg tacaagtacc 1081 cctcagacct cccgtacatg ggctcctatc acgcccaccc acagaagatg aactttgtgg 1141 cgccccaccc tccagccctc cccgtgacat cttccagttt ttttgctgcc ccaaacccat 1201 actggaattc accaactggg ggtatatacc ccaacactag gctccccacc agccatatgc 1261 cttctcatct gggcacttac tactaaagac ctggcggagg cttttcccat cagcgtgcat 1321 tcaccagccc atcgccacaa actctatcgg agaacatgaa tcaaaagtgc ctcaagagga 1381 atgaaaaaag ctttactggg gctggggaag gaagccgggg aagagatcca aagactcttg 1441 ggagggagtt actgaagtct tactgaagtc ttactacaga aatgaggagg atgctaaaaa 1501 tgtcacgaat atggacatat catctgtgga ctgaccttgt aaaagacagt gtatgtagaa 1561 gcatgaagtc ttaaggacaa agtgccaaag aaagtggtct taagaaatgt ataaacttta 1621 gagtagagtt tgaatcccac taatgcaaac tgggatgaaa ctaaagcaat agaaacaaca 1681 cagttttgac ctaacatacc gtttataatg ccattttaag gaaaactacc tgtatttaaa 1741 aatagtttca tatcaaaaac aagagaaaag acacgagaga gactgtggcc catcaacaga 1801 cgttgatatg caactgcatg gcatgtgctg ttttggttga aatcaaatac attccgtttg 1861 atggacagct gtcagctttc tcaaactgtg aagatgaccc aaagtttcca actcctttac 1921 agtattaccg ggactatgaa ctaaaaggtg ggactgagga tgtgtataga gtgagcgtgt 1981 gattgtagac agaggggtga agaaggagga ggaagaggca gagaaggagg agaccaggct 2041 gggaaagaaa cttctcaagc aatgaagact ggactcagga catttgggga ctgtgtacaa 2101 tgagttatgg agactcgagg gttcatgcag tcagtgttat accaaaccca gtgttaggag 2161 aaaggacaca gcgtaatgga gaaagggaag tagtagaatt cagaaacaaa aatgcgcatc 2221 tctttctttg tttgtcaaat gaaaatttta actggaattg tctgatattt aagagaaaca 2281 ttcaggacct catcattatg tgggggcttt gttctccaca gggtcaggta agagatggcc 2341 ttcttggctg ccacaatcag aaatcacgca ggcattttgg gtaggcggcc tccagttttc 2401 ctttgagtcg cgaacgctgt gcgtttgtca gaatgaagta tacaagtcaa tgtttttccc 2461 cctttttata taataattat ataacttatg catttataca ctacgagttg atctcggcca 2521 gccaaagaca cacgacaaaa gagacaatcg atataatgtg gccttgaatt ttaactctgt 2581 atgcttaatg tttacaatat gaagttatta gttcttagaa tgcagaatgt atgtaataaa 2641 ataagcttgg cctagcatgg caaatcagat ttatacagga gtctgcattt gcactttttt 2701 tagtgactaa agttgcttaa tgaaaacatg tgctgaatgt tgtggatttt gtgttataat 2761 ttactttgtc caggaacttg tgcaagggag agccaaggaa ataggatgtt tggcacccaa 2821 atggcgtcag cctctccagg tccttcttgc ctcccctcct gtcttttatt tctagcccct 2881 tttggaacag gaaggacccc ggggtttcaa ttggagcctc catatttatg cctggaagga 2941 aagaggccta tgaagctggg gttgtcattg agaaattcta gttcagcacc tggtcacaaa 3001 tcacccttaa ttctgctatg attaaaatac atttgttgaa cagtgaacaa gctaccactc 3061 gtaaggcaaa ctgtattatt actggcaaat aaagcgtcat ggatagctgc aatttctcac 3121 tttaca // LOCUS HUMERGPE 8342 bp mRNA PRI 10-JUN-1994 DEFINITION Human endogenous retrovirus type C oncovirus sequence. ACCESSION M74509 NID g325464 KEYWORDS endogenous retrovirus; oncogene. SOURCE Homo sapiens Female carcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8342) AUTHORS Yeh,K.-H., Huang,H.-C., Yen,C.-P., Liu,J.-C., Feng,Y.-N., Wu,F.Y.-H., Yang,W.-K. and Wu,C.-W. TITLE Human endogenous retrovirus and cancer JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..8342 /organism="Homo sapiens" /proviral /db_xref="taxon:9606" /cell_line="cervical cancer cell line cc-7T" /cell_type="epithelial cell" /germline /sex="Female" /tissue_type="carcinoma" LTR 1..113 /note="putative" 5'clip 119..136 /note="putative" primer_bind 119..136 CDS 2312..3499 /codon_start=1 /db_xref="PID:g325465" /translation="MTVGGKDIDFLVDTSAEHSVVTASVAPLSKKTIDIIGAMGVSAK QAFCLPQTCTIGGHKVIHQFLYMPDCPLPLLGRDLLSKLRATISFTEHGSLLLKLPGT GVIMTLMLPREEEWRLFLTEPGQEIRPALAKRWPRVWAEDNPPGLAVNQAPVLIEVKP GVQPVRQKQYPVLREALEGIQVHLKCLRTFRIIVPCQSPWNTPLLPVPKPGTKDYRPV QDLRLVNQATVTLHPTVPNLYTLLGLLPAEDSWFTCLDLKDAFFSIRLAPERQKLFAF QWEDPESGVTTQYTWTQLPQRFKNSPTIFGEALARDLQKFPTRDLGCVLLQYVDDLLL GHPTAVGCAKGTDALLRHLEDCGYKVSKKKSSDLPTAGMLLGIYYPTGGAQPRIRKKA GHL" 3'clip 5487..5495 LTR 7935..8326 /note="putative" polyA_signal 8326..8342 BASE COUNT 2419 a 1841 c 2014 g 2068 t ORIGIN 1 ggcgagctcg gctcttaaga tacgagtctg ccaatgctcc cggccaaata aaaaacctct 61 tccttcttta atctggtgtc tgaggagttt tgtctgtgac tcgtcctgct acatttcttg 121 gttccctggc caggaagcaa ggtaattgaa ggacagtcga ggcagcccct taggtggctt 181 aggcctgccc tgtggagcat ccctgcaggg gactctggcc agcttgagtg acgcggatcc 241 tgagagcgct cccaggtagg caattacccc ggtggaaagc ctcgtcagag cagtgcgtgg 301 caggcccctg tggaggatca atgcagtggc tgaacactgg gaaggaacag gcacttggag 361 tccagacatt tgaaacttgg taagactggt cttcggaact tgcccactcc atttgagtgg 421 aagcgtggcc tgatcaacca cggcatgcct gtactggcac tttggttttt gtttttgact 481 tgacttgaat tgcttgatac tttggttttg gtttgacctg gcttggattt ctggatactc 541 tgattttggt tttgattctg gtttggtgaa aactgaaaaa gtgtgtgtgt gcacttttta 601 cccattcttt gttttgtggt gtgcatgtgg tgtgagcttg gtgttttgtc ttgaggaaac 661 atggatcaga cacaaaataa gcctactcct ctaggaacta tattgaaaaa ttttaagaag 721 ggatttaatc gagactatgg ggttactatg acaccaggga aacttagaac tttgtgtgaa 781 atagattggc caacattaga agtgggttgg ccatcagaag gaagcctgga caggtccctt 841 gtttctaagg tatggcacaa ggtaactagt aagtcaggac actcagacca gtttccatac 901 atagacactt ggttacagct ggtgctagac cccccacagt ggctaagagg gcaggcagca 961 gcagtgctag tagcaaaggg acagatagtc aaggaaggat tctgctccac ccgctgagga 1021 aatcaactcc tgaagttctg ttcgaccaaa catcagaaga tccattgcag gagatggcac 1081 cagtgatccc agtgttgccc tccccttatc agggagagag gctccccact tttgagtcca 1141 cagtgcttgc gcctctgcca gacaaatgta tccctaggcc actcagagta gacaagagag 1201 gaggtgaagc ctcgggagaa acccctccct tggcagctca tttaagaccc aaaacaggga 1261 tacaaatgcc cctgagagag cagcagtata ctggaataga tgaggatggg cacatggtgg 1321 agagtcgtgt ttttgtgtac cagcccttca cctctgccga ccttctcaac tggaaaaaca 1381 ataccccgtc ctatactgaa aagccgcaag ctctaattga tttgctccaa actattatcc 1441 agacccataa ccccacttgg gctgattgcc accagttgct catgttcctc tttaaaacag 1501 atgaaaggtg aagggtgctt caagcagcaa ctaagtggct agaggaacat gcactggctg 1561 attaccaaaa cccccaagag tatgtaagga cacagttacc aggaaccgac ccccagtggg 1621 acccaaatta aagagaggat atgcaaaggc taaaccgata caggaaagct ctcttagaag 1681 gatttaaaga ggagagccca gaaggccaca aacattaaca aggtctctga ggtcattcag 1741 ggaaaagaag aaagtccagc aaaattccac gagagactgt gtgaggctta ttgtatgtat 1801 actccctttg atcccgatag ccctgaaaat caacgcatga ttaacatggc tttagttagt 1861 caaagcacag aagacattag aagaaaactg cagaaaaagg ctgggtttgc agggatgaac 1921 acatcacagt tattagaaat agccaaccag gtgtttgtaa acagggatgc agcaagccgt 1981 aaggaaaacc acatagagaa tgaacgtcag gcccggcgaa acgcacctgt tagctgcagc 2041 aattagaggg gtccccccaa aagaggcaag ggaaaagggg ggccctggga aagaaactca 2101 gcctggctgt cagagcttgc agtgtaatca gtgtgcttat cgtaaagaaa atggatattg 2161 gaagaacaaa tgccctcagc taaaaggaaa acaaggtgac tcggagcagg aggctccaga 2221 caaggaggaa ggggccctgc tcaacctggc agaagggtta ttggactgag ggggactggg 2281 ctcaaggacc tccaaagagc ctatggtcag gatgacagtt gggggtaaag acattgattt 2341 tcttgtagat accagtgctg aacattcggt agtaactgcc tcagtcgccc ccttatccaa 2401 aaagactatt gacatcatcg gagccatggg agtttcagca aaacaagctt tctgcttgcc 2461 ccagacttgt actataggag gacataaagt gattcatcag tttttgtaca tgcctgattg 2521 tcccttgccc ttgttgggaa gagacttgct tagcaaactg agagccacta tctcttttac 2581 agagcacggc tctttgctgc taaagttacc cggaacagga gtcattatga cccttatgct 2641 cccccgagag gaggaatgga gacttttctt aacagagccg ggccaagaga taagaccagc 2701 tctggctaag cggtggccaa gagtgtgggc ggaagacaac cctccagggt tggcagtcaa 2761 ccaagccccc gtgcttatag aagttaagcc tggggtccag ccggttaggc aaaaacagta 2821 cccggtcctc agagaagctc ttgaaggtat ccaggtccat ctcaagtgcc taagaacctt 2881 tagaattata gttccttgtc agtctccatg gaacactccc ctcctgcctg ttcccaagcc 2941 tgggaccaag gactacaggc cggtacagga tttgcgcttg gttaatcagg ctacagtgac 3001 tttacatcca acagtaccta acctgtacac attgctgggg ttgctgccag ctgaggacag 3061 ctggttcacc tgcttggacc tgaaagatgc tttctttagc atcagattag cccctgagag 3121 acagaagctg tttgcctttc agtgggaaga tccagagtca ggtgtcacta ctcaatacac 3181 ttggacccag cttccccaaa ggttcaagaa ctcccccacc atctttgggg aggcgttggc 3241 tcgagacctc cagaagtttc ccaccagaga cctaggctgc gtgttgctcc agtacgttga 3301 tgaccttttg ctgggacacc ccacggcagt cgggtgcgcc aagggaacag atgctctact 3361 ccggcacctg gaggactgtg ggtataaggt gtccaagaaa aaaagctcag atctgccgac 3421 agcaggtatg ttacttggga tttactatcc aacaggggga gcacagccta ggatcagaaa 3481 gaaagcaggt catttgtaat ctaccggagc ctaagaccag aaggcaggtg agagaattct 3541 taggggctgt gggtttttgc agactgtgga tcccaaactt tgcagtatta gctaagcctt 3601 tgtatgaggt cacaaaggcg ggggaccagg aaccttttga atggggatcc cagcaacagc 3661 aagcctttca tgagttaaag gaaagactta tgtcagtccc agccctgggg ctacctgatc 3721 tgacaaagcc ttttacattg tatgtgtcag agagtgaaaa gatggcagtt ggagttttaa 3781 cccaaactgt ggggccctgg ctgaggccgg tggcctacct ctctaaacaa ctagacgggg 3841 tttctaaagg atggcccccg tgtttgaggg ccttggcagc aactgccctg ctagtacaag 3901 aagcagataa gctgattctt gggcaaaacc tgaacataaa ggacccccat gctgtggtga 3961 ctttaatgaa tactagagga catcattggc taacgaatgc tagacttact aagtaccaaa 4021 gtttgctttg tgaaaatccc catataacca ttgaagtttg taacaccctg aaccccgcta 4081 ccttgctccc agtattagag atccctgtcg agcatgactg tgtagaagtg ttggactcag 4141 tttactctgg gcatcagtag actgggaact atacgtggat gggagcagct ttgtcaaccc 4201 acaagaagag agatgtgcag ggtatgcggt ggtaactctg gacactgttg ctgaagccag 4261 atcgtttccc cagggcactt caactcagaa agctgaactc attgctttaa ttcgggcctt 4321 agaactcagt gaaggtaaga ctgtaaacat ttacactgac tcttgatatg tctttttaac 4381 ccttcaagtg catggagcat tatgtaaaga aaagggccta ttgaactctg ggggaaaaga 4441 cataaaatat caacaagaaa tcttgcaatt attagaagca gtatggaaac cccacaaggt 4501 ggctgttata cattgcggag gacaccagtg agcttccacc ttggtgggtt tggggaattc 4561 ctgcactgac ttagaggctc aaaaagcagc atctgccctt ccgggcatca gtgacagccc 4621 ccctgctccc tcaagcacct gatcttgtac ctacttattc taaagaagaa aaggactttc 4681 tccaggcaga gggaggacaa gtgatggagg aaggatggat ttggttacca gatgggagag 4741 tagctgtgcc acagctgcta ggagctgcag ttgtactggc tgtgcataaa accacccatc 4801 taggtcagga atcacttgaa aagttgttag gctggtattt ctacatctcg catttgtcag 4861 cccttgccaa aacagtgacg cagcggtgtg ttacctgccg acagcataat gcgagacaag 4921 gtccagctgt tccccctggc atacaagctt atggagcagc cccctttgaa gatctccagg 4981 tggacttcac agagatgcca aagtgtggag gtaacaagta tttactagtt cttgtgtgta 5041 cctactctgg gcaggtggag gcttatccaa cacgaactga gaaagctcat gaagtaactc 5101 gtgtgcttct tcgagatctt attcctagat ttggactgcc cttacggatt ggctcagata 5161 atgggctggt gtttgtggct gacttggtac agaagacggc aaaggtattg gggatcacat 5221 ggaaactgca tgctgcctac cagcctcaga gttccggaaa ggtagagcgg atgaatcgga 5281 ctatcaaaaa tagtttaggg aaagtatgtc aagaaacagg attaaaatgg atacaggctc 5341 ttcctatggt attatttaaa attagatgta ccccttctaa aagaacagga tattcccctt 5401 atgaaatatt atatcatagg ccccctccta tattgcgggg acttccaggc actccccgag 5461 agttaggtga aattgagtta cagcgatagc tacaggcttc aggaaaaatt acacaaacaa 5521 tctcggcctg ggtaaatgag agatgccctg ttaacttatt ctccccagtt caccctttct 5581 ccccaggtga tctagtgtgg atcaaggact gaaacgtagc ctgtttgtgt ccacggtgga 5641 aaggacccca gactgtcatc ctgagcactc ccaccgctgt gaaggtagag ggaatcccaa 5701 cctggatcca ccacagccgt gtaaaacctg cagtgcctga aacctgggag gcaagaccaa 5761 gcccagaaaa cccctgcaga gtgaccccga agaagacaac aagccctgct ccagtcacac 5821 ccggaagctg actggtccac gcacggccga agcatgcaga agctcatcat gggattcatt 5881 tttcttaaat tttggactta tacagtaagg gcttcaactg atcttactca aactggggac 5941 tgttcccagt gtattcatca ggtcaccgag gtaggacagc aaattaaaac aatgtttctg 6001 ttctatagtt attataaatg tataggaaca ttaaaagaaa cttgtttgta taatgctact 6061 cagtacaatg tatgtagccc aggaaatgac cgacctgatg tgtgttataa cccatctgag 6121 cctcctgcaa ccaccatttt tgaaataaga ataagaactg gccttttcct aggtgataca 6181 agtaaaataa taactagaac agaagaaaaa gaaatcccca aacaaataac tttaagattt 6241 gatgcttgtg cagccattaa tagtaaaaag ctaggaatag gatgtgattc tcttaactgg 6301 gaaaggagct acagaataaa aaataaatat gtttgtcatg agtcaggggc ctttgtgaaa 6361 attgtgccta ttggccatgt gttatttggg ctacttggaa aaagaacaaa aaggacccgg 6421 tttatcttca gaagggggaa gccaacccct cctgtgctgc tggtcactgt aacccactag 6481 aactaataat taccaatccc ctagatcccc attggaaaaa gggagaacgt gtaaccctgg 6541 ggattgatgg gacagggtta aacccccaag ttgccatttt aattagaggg gaggtccaca 6601 agtgctctcc caaaccagta tttcaaacct tttataagga gctgaatctg ccagcaccag 6661 aatttccaaa aaagacaaaa aatttgtttc tccaattagc agaaaatgta gctcattccc 6721 ttaatgttac ttcttgttat gtatgcgggg gaaccactat cggagaccga tggccttggg 6781 aagcccgaga gttggtgcct actgatccag ctcctgatat aattccagtt cagaaaaccc 6841 aagctagcaa cttctgggtc ctaaaaacct caattattgg acaatactgt atagctagag 6901 aagggaaaga ctttatcatc cctgtaggaa agcttaattg tataggacag aagttgtata 6961 acagtacaac aaagacaatt acttggtggg gcataaacca cactgaaaag aatccattta 7021 gtaaattttc aaaattaaaa actgcttggg ctcatccaga atctcatcag gactggatgg 7081 ctcccgctgg actatactgg atatgtgggc acagagccta cattcggtta cctaataaat 7141 aggcaggcag ttgtgttatt ggcactatta agtcgtcctt tttcttatta cccataaaaa 7201 caggtgagac cctaggtttc cctgtctatg cctcccgaga aaagagaggc atagttatag 7261 gaaactggaa agataatgag tggcgccctg aaaggatcat acagtattat gggcctgcca 7321 catgggcaca agacggctca tggggatacc gaacccccat ttacatgctc aatcggatca 7381 tacggttgca ggccatctta gaaataatta ctaatgaaac tggcagagct ttgactgttt 7441 tagctcggca ggaaacccaa acgaggaatg ctatctatca gaatagactg gccttggact 7501 acttgctagc agctgaagga ggagtttgtg gaaaatttaa cttaaccaat tactgcctac 7561 aaatagatga tcaaggacag gtggttgaaa acatagtcag ggacatgaca aaggtggcac 7621 atgtgcctgt acaggtttgg cacaagttta atcctgagtc tttatttgga aaatggtttc 7681 cagctatagg aggatttaaa accctcattg taggtgtatt gctagtgata ggaacttgct 7741 tgctgctccc ctgtgtatta cccttgcttt ttcaaatgat aaaatatttt gttgttactt 7801 tagttcatca gaaaacttca gcacatgtgt attatacaaa tcactatcgc tctatctcac 7861 aaagagacta aaaaagtgag gacgagagta agaactccca ctaaaagtga aaattctcaa 7921 agggggggaa atatggtatg aggtcgccac ttctcctgtt gtccttctca gtttctcccc 7981 aacctcccct tttccctagt ttataagaca ggagaaaagg gagaaagcaa aaagttgaaa 8041 agaaacagaa gtaagataaa tagctagacg accttggcac caccacctgg ccctggtggc 8101 taaaataata ataatattat taacccctga ccaaaactat tggtgttatc tgtaaattcc 8161 agacactgta tgagaaaata ctgtaaaact ttttgttctg ttagctgatg tatgtagccc 8221 ccagtcatgt ttctcacgct tacttgatct attatgactt tttcatgtag accccttaga 8281 gttgtaaccc ttaaaagggc taagaatttc tttttcgggg agctcgaaaa aaaaaaaaaa 8341 aa // LOCUS HUMERP72H 2865 bp mRNA PRI 31-DEC-1994 DEFINITION Human (clone pA3) protein disulfide isomerase related protein (ERp72) mRNA, complete cds. ACCESSION J05016 NID g181507 KEYWORDS protein disulfide isomerase-related protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2865) AUTHORS Huang,S.H., Tomich,J.M., Wu,H., Jong,A. and Holcenberg,J. TITLE Human deoxycytidine kinase. Sequence of cDNA clones and analysis of expression in cell lines with and without enzyme activity JOURNAL J. Biol. Chem. 264 (25), 14762-14768 (1989) MEDLINE 89359272 REFERENCE 2 (sites) AUTHORS Huang,S.H., Tomich,J.M., Wu,H., Jong,A. and Holcenberg,J. TITLE Human deoxycytidine kinase. Sequence of cDNA clones and analysis of expression in cell lines with and without enzyme activity JOURNAL J. Biol. Chem. 266 (8), 5353 (1991) MEDLINE 91161636 COMMENT Originally described as deoxycytidine kinase and later determined experimentally to be a homologue of murine ERp72 (acc# J05186) Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Holcenberg, 27-JUL-1989. FEATURES Location/Qualifiers source 1..2865 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Molt 4" /cell_type="lymphoblast" /clone="pA3" mRNA <1..2864 /gene="ERp72" gene 1..2864 /gene="ERp72" CDS 46..1983 /gene="ERp72" /codon_start=1 /product="protein disulfide isomerase-related protein" /db_xref="PID:g181508" /translation="MRPRKAFLLLLLLGLVQLLAVAGAEGPDEDSSNRENAIEDEEEE EEEDDDEEEDDLEVKEENGVLVLNDANFDNFVADKDTVLLEFYAPWCGHCKQFAPEYE KIANILKDKDPPIPVAKIDATSASVLASRFDVSGYPTIKILKKGQAVDYEGSRTQEEI VAKVREVSQPDWTPPPEVTLVLTKENFDEVVNDADIILVEFYAPWCGHCKKLAPEYEK AAKELSKRSPPIPLAKVDATAETDLAKRFDVSGYPTLKIFRKGRPYDYNGPREKYGIV DYMIEQSGPPSKEILTLKQVQEFLKDGDDVIIIGVFKGESDPAYQQYQDAANNLREDY KFHHTFSTEIAKFLKVSQGQLVVMQPEKFQSKYEPRSHMMDVQGSTQDSAIKDFVLKY ALPLVGHRKVSNDAKRYTRRPLVVVYYSVDFSFDYRAATQFWRSKVLEVAKDFPEYTF AIADEEDYAGEVKDLGLSESGEDVNAAILDESGKKFAMEPEEFDSDTLREFVTAFKKG KLKPVIKSQPVPKNNKGPVKVVVGKTFDSIVMDPKKDVLIEFYAPWCGHCKQLEPVYN SLAKKYKGQKGLVIAKMDATANDVPSDRYKVEGFPTIYFAPSGDKKNPVKFEGGDRDL EHLSKFIEEHATKLSRTKEEL" BASE COUNT 786 a 643 c 729 g 707 t ORIGIN 1 ccagcggccg ccgacgctag gaggccgcgc tccgcccccg ctaccatgag gccccggaaa 61 gccttcctgc tcctgctgct cttggggctg gtgcagctgc tggccgtggc gggtgccgag 121 ggcccggacg aggattcttc taacagagaa aatgccattg aggatgaaga ggaggaggag 181 gaggaagatg atgatgagga agaagacgac ttggaagtta aggaagaaaa tggagtcttg 241 gtcctaaatg atgcaaactt tgataatttt gtggctgaca aagacacagt gctgctggag 301 ttttatgctc catggtgtgg acattgcaag cagtttgctc cggaatatga aaaaattgcc 361 aacatattaa aggataaaga tcctcccatt cctgttgcca agatcgatgc aacctcagcg 421 tctgtgctgg ccagcaggtt tgatgtgagt ggctacccca ccatcaagat ccttaagaag 481 gggcaggctg tagactacga gggctccaga acccaggaag aaattgttgc caaggtcaga 541 gaagtctccc agcccgactg gacgcctcca ccagaagtca cgcttgtgtt gaccaaagag 601 aactttgatg aagttgtgaa tgatgcagat atcattctgg tggagtttta tgccccatgg 661 tgtggacact gcaagaaact tgcccccgag tatgagaagg ccgccaagga gctcagcaag 721 cgttctcctc caattcccct ggcaaaggtc gacgccaccg cagaaacaga cctggccaag 781 aggtttgatg tctctggcta tcccaccctg aaaattttcc gcaaaggaag gccttatgac 841 tacaacggcc cacgagaaaa atatggaatc gttgattaca tgatcgagca gtccgggcct 901 ccctccaagg agattctgac cctgaagcag gtccaggagt tcctgaagga tggagacgat 961 gtcatcatca tcggggtctt taagggggag agtgacccag cctaccagca ataccaggat 1021 gccgctaaca acctgagaga agattacaaa tttcaccaca ctttcagcac agaaatagca 1081 aagttcttga aagtctccca ggggcagttg gttgtaatgc agcctgagaa attccagtcc 1141 aagtatgagc cccggagcca catgatggac gtccagggct ccacccagga ctcggccatc 1201 aaggacttcg tgctgaagta cgccctgccc ctggttggcc accgcaaggt gtcaaacgat 1261 gctaagcgct acaccaggcg ccccctggtg gtcgtctact acagtgtgga cttcagcttt 1321 gattacagag ctgcaactca gttttggcgg agcaaagtcc tagaggtggc caaggacttc 1381 cctgagtaca cctttgccat tgcggacgaa gaggactatg ctggggaggt gaaggacctg 1441 gggctcagcg agagtgggga ggatgtcaat gccgccatcc tggacgagag tgggaagaag 1501 ttcgccatgg agccagagga gtttgactct gacaccctcc gcgagtttgt cactgctttc 1561 aaaaaaggaa aactgaagcc agtcatcaaa tcccagccag tgcccaagaa caacaaggga 1621 cccgtcaagg tcgtggtggg aaagaccttt gactccattg tgatggaccc caagaaggac 1681 gtcctcatcg agttctacgc gccatggtgc gggcactgca agcagctaga gcccgtgtac 1741 aacagcctgg ccaagaagta caagggccaa aagggcctgg tcatcgccaa gatggacgcc 1801 actgccaacg acgtccccag cgaccgctat aaggtggagg gcttccccac catctacttc 1861 gcccccagtg gggacaaaaa gaacccagtt aaatttgagg gtggagacag agatctggag 1921 catttgagca agtttataga agaacatgcc acaaaactga gcaggaccaa ggaagagctt 1981 tgaaggcctg aggtctgcgg aaggtgggag gaggcagacg ccctgcgtgg cccatggtcg 2041 gggcgtccac cggaggccgg caacaaacga cagtatctcg gattcctttt tttttttttt 2101 taatttttta tactttgttg tttcacttca tgctctgaat actgaataac catgaatgac 2161 tgaatagttt agtccagatt tttacagagg atacatctat ttttatcatt atttggggtt 2221 tgaaaaattt ttttttacac cttctaattt ctttatttct caaagcagat aattcttctg 2281 tgtgaaaatg ttttcttttt ttaatttaag gtttaaaatt ccttttccaa atcatgttga 2341 ttttgctctt taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga agggctggga 2401 ccaaccgggt gagatccaca agtctctgga tgtggctgaa ggcaaataca caattgaagt 2461 actttctgtt ttgaagtgct ttcccttttg aatctggttt gaaacatgca gcttctgtct 2521 ctagcccaag gaaagaccaa aacataggga aataaaagca tttatctttg tcttggaagt 2581 aattgttgaa gttgtgcagt tgatcagtgc acagttagct gcaatgttta tagaaattga 2641 ttgttaaacc aaatttacac tggcatgtgt ggtgtagttt ctaaaaggca cttcacattt 2701 gaaatttttc ttaccttaga aagtttctag tgatctaaat gtctagtttt gtattctttt 2761 gtgtgtgttc actgtttctc agtattacca cttgaataat tctctgtaca ggggggtttg 2821 tgctatacac tgggatgtct aattgcagca ataaagcctt tcttt // LOCUS HUMERVA34A 3387 bp DNA PRI 13-FEB-1996 DEFINITION Human endogenous retrovirus ERV3, pol-env-3'LTR region. ACCESSION M12140 NID g182215 KEYWORDS endogenous retrovirus; env protein; long terminal repeat; pol protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3387) AUTHORS Cohen,M., Powers,M., O'Connell,C. and Kato,N. TITLE The nucleotide sequence of the env gene from the human provirus ERV3 and isolation and characterization of an ERV3-specific cDNA JOURNAL Virology 147 (2), 449-458 (1985) MEDLINE 86072098 FEATURES Location/Qualifiers source 1..3387 /organism="Homo sapiens" /proviral /db_xref="taxon:9606" CDS <1..494 /note="pol gene protein; Xxx" /codon_start=3 /db_xref="PID:g1196424" /transl_except=(pos:225..227,aa:OTHER) /transl_except=(pos:129..131,aa:OTHER) /translation="GYSPYERLFGKPSPIISQIKGNLRELGELTLRRQMQALGIAMXS VHGWVQERMPISLIDPIHPFKPRDSLWVKKXNPTTLGPIWDGLHTVILSIPTVVKVAG IVPWIHPSSQLKPAAQDKWTSQQDLDHATQLILRWNQGASEMTTALLWSLRKLTSPRT AEA" mat_peptide 808..2100 /note="putative" /product="envelope protein" CDS 808..2502 /note="putative" /codon_start=1 /product="envelope protein" /db_xref="PID:g1196425" /translation="MTKTLLYHTYYECAGTCLGTCTHNQTTYSVCDPGRGQPYVCYDP KSSPGIWFEIHVGSKEGDLLNQTKVFPSGKDVVSLYFDVCQIVSMGSLFPVIFSSMEY YSSCHKNRYAHPACSTDSPVTTCWDCTTWSTNQQSLGPIMLTKIPLEPDCKTSTCNSV NLTILEPDQPIWTTGLKTPLGARVSDEEIGPGAYVYLYIIKKTRTRSTQQLRVFESFY EHVNQKLPEPPPLASNLFAQLAENIASSLHVASCYVCGGMNMGDQWPWEARELMPQDN FTLTASSLEPAPSSQSIWFLKTSIIGKFCIARWGKAFTDPVGELTCLGQQYYNETLGK TLWRGKSNNSESPHPSPFSRFPSLNHSWYQLEAPNTWQAPSGLYWICGPQAYRQLPAK WSGACVLGTIRPSFFLMPLKQGEALGYPIYDETKRKSKRGITIGDWKDSEWPPERIIQ YYGPATWAEDGMWGYRTPVYMLNRIIRLQAVLEIITNETAGALNLLAQQATKMRNVIY QNRLALDYLLAQEEGVCGKFSLTNCCLELDDEGKVIKEITAKIQKLAHIPVQTWKG" repeat_region 2800..3387 /note="3' LTR" BASE COUNT 979 a 835 c 712 g 861 t ORIGIN 1 ctgggtattc gccctatgaa agactgttcg gcaagccatc cccaatcata agtcaaatta 61 agggtaatct tcgtgaacta ggggaattaa ctttaaggag gcaaatgcag gctttaggga 121 tagccatgtg aagtgtccat ggctgggtac aggaaagaat gcccataagc ctgatagatc 181 caatacaccc ctttaaaccc agggactctc tttgggtcaa aaaataaaac ccaaccactc 241 tgggacccat atgggatggg ctccatactg taatcttgtc tattcccact gttgttaaag 301 ttgcaggaat tgtgccttgg atccatccat ccagtcagct gaaaccagca gcccaggaca 361 agtggaccag ccaacaggac ctagaccatg caacccagct gatcctacga tggaaccaag 421 gtgccagtga gatgacaaca gccctgctct ggtcactccg gaagctgacc agtccacgca 481 cggctgaagc ttgaggagac aacagccctg ctctagtcac cccagaagct gactagtcta 541 tgcacggccg aagcttgagt catcatcagg gaagtaaatg tggttagaaa tctgaagtcc 601 agtaattttc cttgtcatat taattacttt gctattaagc tgtcactttg cttagccttc 661 tccccccagg aaaaagcctt ttctgtccat gctgggtatg aacatgctac tcatcacttt 721 gttcttgcta ctccccttat ccatgttaaa aggagaaccc tgggagggat gcctccactg 781 cacccacact acgtgtcggg gaacatcatg actaaaaccc tgttgtatca cacttattat 841 gagtgtgctg ggacctgcct aggaacttgt actcacaacc agacaaccta ctcagtctgt 901 gacccaggaa ggggccagcc ttatgtgtgt tatgacccta agtcttcacc tgggatctgg 961 tttgaaattc atgtcgggtc aaaggaaggg gatcttctaa accaaaccaa ggtatttccc 1021 tctggcaagg atgtcgtatc cttatacttt gatgtttgcc agatagtatc catgggctca 1081 ctctttcccg taatcttcag ttccatggag tactatagta gctgccataa aaataggtat 1141 gcacaccctg cttgttccac cgattcccca gtaacaactt gctgggactg cacaacgtgg 1201 tccactaacc aacaatcact agggccaatt atgcttacca aaataccatt agaaccagat 1261 tgtaaaacaa gcacttgcaa ttctgtaaat cttaccatct tagagccaga tcagcccata 1321 tggacaacag gtttaaagac accgctaggg gcacgagtca gcgatgaaga aattggccca 1381 ggagcctatg tctatctata tatcataaag aaaactcgga cccgctcaac ccaacagttg 1441 cgagtttttg agtcattcta tgagcatgtt aaccagaaat tgcctgagcc ccctcccttg 1501 gccagtaatt tattcgccca actggctgaa aacatagcca gcagcctgca cgttgcttca 1561 tgttatgtct gtgggggaat gaacatggga gaccaatggc catgggaagc aagggaacta 1621 atgccccaag ataatttcac actaaccgcc tcttccctcg aacctgcacc atcaagtcag 1681 agcatctggt tcttaaaaac ctccattatt ggaaaattct gtattgctcg ctggggaaag 1741 gcctttacag acccagtagg agagttaact tgcctaggac aacaatatta caacgagaca 1801 ctaggaaaga ctttatggag gggcaaaagc aataattctg aatcaccaca cccaagccca 1861 ttctctcgtt tcccatcttt aaaccattct tggtaccaac ttgaagctcc aaatacctgg 1921 caggcaccct ctggcctcta ctggatctgt gggccacaag catatcgaca actgccagct 1981 aaatggtcag gggcctgtgt actggggaca attaggccgt ccttcttcct aatgccccta 2041 aaacagggag aagccttagg ataccccatc tatgatgaaa ctaaaaggaa aagcaaaaga 2101 ggcataacta taggagattg gaaggacagt gaatggcctc ctgaaagaat aattcaatat 2161 tatggcccag ccacctgggc agaagatgga atgtggggat accgcacccc agtttacatg 2221 cttaaccgca ttataagatt gcaggcagta ctagaaatca ttaccaatga aactgcaggg 2281 gccttgaatc tgcttgccca gcaagccaca aaaatgagaa atgtcattta tcaaaataga 2341 ctggccttag actacctcct agcccaggaa gagggagtat gcggaaagtt cagccttact 2401 aactgctgcc tggaacttga tgacgaagga aaggttatca aagaaataac tgctaaaatc 2461 caaaagttag ctcacatccc agttcagact tggaaaggat agtctccaga ttcccttttc 2521 agaggttggt tcttatccct tggaggattt aaaaccttag tacaaatagt cctagccata 2581 ttgggagttt gccttatact cccttgtctc ttacccctca ttgtcaaaaa tatccaaaca 2641 gccatagagg ctcttgtgga cagacggact accacacgac taatggccct aactaagtat 2701 taacccctgc caagaaaaga gctacttcct cttgaagtaa atgaagatag tgatgctttc 2761 tcttaaactt tacttataaa aagcatcaaa ggggggaatg aagcaggaaa tataaaagga 2821 aaaacaagta aagggaaaac aagtcctttc ctgaccagtc tgactcactc caaagtcctg 2881 ctggagctat gataattatc tgcaaggcca ggcaggggct ccgaaggagg gctccaggag 2941 cagggatgag aaaaacaagt tctccttatc agtttccctg tttgaaattc tctccccata 3001 acattattct ttgttctgct ctcacaacta tttttgtaac tatttctgca agtctgtaaa 3061 gattttgtaa gttcttgttt ttctttctgt agcatggcaa ggtcacaaga catgtttaag 3121 taaggtaggc tcatgttgca aatcctgttg taaaacctgt cacggtatga ttaactgcct 3181 ttgttctgct tctgtaagac tgctttctca cctcgcaggt tttgcgccaa aaacccgact 3241 tgcccctgcc tgatgcatgt ataaaagtca agcccgtctt tgttccgggc tcagcctttg 3301 gatgttaatc cgctgggcca gtggccacct aaataaaacc ttcctgttgc acccagtgat 3361 ctctccggcc tcctgatacc cacaaca // LOCUS HUMERVBCB 2315 bp mRNA PRI 31-DEC-1994 DEFINITION Human estrogen receptor-related protein (variant ER from breast cancer) mRNA, complete cds. ACCESSION M69297 NID g182218 KEYWORDS LINE 1 repetitive sequence; estrogen receptor-related protein. SOURCE Homo sapiens (tissue library: lambda ZAP-II) breast cancer cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2315) AUTHORS Dotzlaw,H., Alkhalaf,M. and Murphy,L.C. TITLE Characterization of estrogen receptor variant mRNAs from human breast cancers JOURNAL Mol. Endocrinol. 6 (5), 773-785 (1992) MEDLINE 92293154 FEATURES Location/Qualifiers source 1..2315 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="breast cancer" /tissue_lib="lambda ZAP-II" misc_feature 1..883 /note="estrogen receptor-related sequence" gene 241..903 /gene="estrogen receptor-related protein" CDS 241..903 /gene="estrogen receptor-related protein" /codon_start=1 /db_xref="PID:g182219" /translation="MTMTLHTKASGMALLHQIQGNELEPLNRPQLKIPLERPLGEVYL DSSKPAVYNYPEGAAYEFNAAAAANAQVYGQTGLPYGPGSEAAAFGSNGLGGFPPLNS VSPSPLMLLHPPPQLSPFLQPHGQQVPYYLENEPSGYTVREAGPPAFYRPNSDNRRQG GRERLASTNDKGSMAMESAKETRYCAVCNDYASGYHYGVWSCEGCKAFFKRSIQELPT LC" repeat_region 901..2315 /rpt_family="LINE I related" CDS 1216..1494 /note="ORF 2" /codon_start=1 /db_xref="PID:g182220" /translation="MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCSLLGYLKTENTAK ETTIRLNRQPTEWEKIFAIYSSDKGLISRIYKELKQIYKKKGTPSTSG" CDS 1500..1931 /note="ORF 3" /codon_start=1 /db_xref="PID:g182221" /translation="MNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYHLTPVRM VIIRKSGNDRCWRGCGEIGTLLHCWWDCKLVQPLWKTVWRFLRDLQLEIPFDPAIPLL GIYPKDYKSCCYKDTCTRMFIAALFTIAKTWNQPKYPTTIG" polyA_signal 2302..2307 polyA_site 2315 BASE COUNT 716 a 578 c 559 g 462 t ORIGIN 1 gagcccagga gctggcggag ggcgttcgtc ctgggactgc acttgctccc gtcgggtcgc 61 ccggcttcac cggacccgca ggctcccggg gcagggccgg ggccagagct cgcgtgtcgg 121 cgggacatgc gctgcgtcgc ctctaacctc gggctgtgct ctttttccag gtggcccgcc 181 ggtttctgag ccttctgccc tgcggggaca cggtctgcac cctgcccgcg gccacggacc 241 atgaccatga ccctccacac caaagcatcc gggatggccc tactgcatca gatccaaggg 301 aacgagctgg agcccctgaa ccgtccgcag ctcaagatcc ccctggagcg gcccctgggc 361 gaggtgtacc tggacagcag caagcccgcc gtgtacaact accccgaggg cgccgcctac 421 gagttcaacg ccgcggccgc cgccaacgcg caggtctacg gtcagaccgg cctcccctac 481 ggccccgggt ctgaggctgc ggcgttcggc tccaacggcc tggggggttt ccccccactc 541 aacagcgtgt ctccgagccc gctgatgcta ctgcacccgc cgccgcagct gtcgcctttc 601 ctgcagcccc acggccagca ggtgccctac tacctggaga acgagcccag cggctacacg 661 gtgcgcgagg ccggcccgcc ggcattctac aggccaaatt cagataatcg acgccagggt 721 ggcagagaaa gattggccag taccaatgac aagggaagta tggctatgga atctgccaag 781 gagactcgct actgtgcagt gtgcaatgac tatgcttcag gctaccatta tggagtctgg 841 tcctgtgagg gctgcaaggc cttcttcaag agaagtattc aagaacttcc aacactatgt 901 tgaataggat ggtactggta ccaaaacaga gatatagacc aatggaacag aacagagccc 961 tcagaaataa taccacacat ctacaaccat ctgatctttg acaaacgtga caaaaacaag 1021 aaatggggaa aggattccct atttaataaa tagtgctggg aaaactggct agccatatgt 1081 agaaagctga aactggatca cttccttaca ccttatacaa aaattaattc aagatggatt 1141 aaagacttaa atgttagaac taaaaccata aaaaccctag aagaaaacct aggcaatacc 1201 attcaggaca taggcatggg caaggacttc atgtctaaaa caccaaaagc aatggcaaca 1261 aaagccaaaa ttgacaaatg ggatctaatt aaactaaaga gcttctgttc tttgctgggg 1321 tatctgaaga ctgaaaacac agcaaaagaa actaccatca gactgaacag gcaacctaca 1381 gaatgggaga aaatttttgc aatctactca tctgacaaag ggctaatatc cagaatctac 1441 aaagaactca aacaaattta caagaaaaaa ggaaccccat caacaagtgg gtgaaggata 1501 tgaacagaca cttctcaaaa gaagacattt atgcagccaa cagacacatg aaaaaatgct 1561 catcatcatt ggccatcaga gaaatgcaaa tcaaaaccac aatgagatac catctcacac 1621 cagttagaat ggtgatcatt agaaagtcag gaaacgacag gtgctggaga ggatgtggag 1681 aaataggaac acttttacac tgttggtggg actgtaaact ggttcaacca ttgtggaaga 1741 cagtgtggcg attcctcagg gatctacaac tagaaatacc atttgaccca gccatcccat 1801 tactgggtat atacccaaag gattataaat catgctgcta taaagacaca tgcacacgta 1861 tgtttattgc ggcactattc acaatagcaa agacttggaa ccaacctaaa tatccaacaa 1921 caataggcta gattaagaaa atgtggcaca tatacaccat ggaatactat gcagccataa 1981 aaaaggatga gttcatatac ttgtagggac atggatgaag ctggaaacca tcattctcag 2041 caaactattg caaggacaaa aaaccaaaca tgcatgttct cactcatagg tgggaattga 2101 acaataagaa cacttggaca cagggtgggg aacattacac actggggcct gttgtggggt 2161 ggggggaggg gggagggata gcattaggag atataactaa tgtaaatgat gagttaatgg 2221 gtgcagcaca ccaacatggc acatgtatac atatgtaaca aacctgcaca ttgtgcacat 2281 gtaccctaga acttaaagta taataaaaaa tattt // LOCUS HUMERVKA 9179 bp DNA PRI 13-FEB-1996 DEFINITION Human endogenous retrovirus HERV-K10. ACCESSION M14123 NID g182227 KEYWORDS endogenous retrovirus; env protein; gag protein; glucocorticoid response element; glycoprotein; long terminal repeat; pol polyprotein; protease. SOURCE Homo sapiens foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9179) AUTHORS Ono,M., Yasunaga,T., Miyata,T. and Ushikubo,H. TITLE Nucleotide sequence of human endogenous retrovirus genome related to the mouse mammary tumor virus genome JOURNAL J. Virol. 60 (2), 589-598 (1986) MEDLINE 87036922 COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.Ono, 22-JAN-1987. The human K10 and K18 endogenous retrovirus clones have a deletion of 290 base pairs (between positions 6500 and 6501) with respect to clones K8 and K22 (see separate entries). This deletion fuses the pol and env reading frames, eliminating the 3' end of pol and the 5' end of env. Within the pol/env ORF an in frame stop codon is located at postitions 6920-6922. Polyadenylation signals are located at positions 793-798 and 9294-9299. TATAA promoters are located at 531-537 and 9032-9038. FEATURES Location/Qualifiers source 1..9179 /organism="Homo sapiens" /proviral /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="liver" LTR 1..968 /note="5' LTR" protein_bind 75..80 /bound_moiety="glucocorticoid responsive element" enhancer 81..88 /note="enhancer core" misc_binding 971..988 /bound_moiety="Lys-tRNA primer" CDS 1112..1963 /note="putative" /codon_start=1 /product="gag 1 protein" /db_xref="PID:g488473" /translation="MGQTKSKIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQIIEQ FCPWFPEQGTSDLKDWKRIGKELKQAGRKGNIIPLTVWNDWAIIKAALEPFQTEEDSI SVSDAPGSCLIDCNENTRKKSQKETESLHCEYVAEPVMAQSTQNVDYNQLQEVIYPET LKLEGKGPELMGPSESKPRGTSPLPAGQVLVRLQPQKQVKENKTQPQVAYQYCRWLNF SIGHPQKVSMDIQECPQHHRAGRHTISRPLGDLILWHHLVDRVVNYMKLLINQERKEI LRHGNSQ" CDS 1807..3111 /note="ORF (bases 1720-3111) first start codon at 1807.; putative" /codon_start=1 /product="gag 2 protein" /db_xref="PID:g1196427" /translation="MPPAPQGRAPYHQPPTRRLNPMAPPSRQGSELHEIIDKSRKEGD TEAWQFPVTLEPMPPGEGAQEGEPPTVEARYKSFSIKMLKDMKEGVKQYGPNSPYMRT LLDSIAYGHRLIPYDWEILAKSSLSPSQFLQFKTWWIDGVQEQVRRNRAANPPVNIDA DQLLGIGQNWSTISQQALMQNEAIEQVRAICLRAWEKIQDPGSTCPSFNTVRQGSKEP YPDFVARLQDVAQKSIADEKAGKVIVELMAYENANPECQSAIKPLKGKVPAGSDVISE YVKACDGIGGAMHKAMLMAQAITGVVLGGQVRTFGGKCYNCGQIGHLKKNCPVLNKQN ITIQATTTGREPPDLCPRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQTGAF PIQPFVPQGFQGQQPPLSQVFQGISQLPQYNNCPSPQAAVQQ" CDS 3639..3917 /note="ORF (bases 2913-3917) first start codon at 3639; putative" /codon_start=1 /product="neutral protease large subunit" /db_xref="PID:g1196428" /translation="MEILHCLGPDNQESTVQPMITSIPLNLWGRDLLQQWGAEITMPA PLYSPTSQKIMTKMGYIPGKGLGKNEDGIKVPVEAKINQEREGIGYPF" CDS 4172..8257 /note="pol/env ORF (bases 3878-8257) first start codon at 4172; Xxx; putative" /codon_start=1 /db_xref="PID:g1196429" /transl_except=(pos:6920..6922,aa:OTHER) /translation="MGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAF TIPAINNKEPATRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDIL CAAETKDKLIDCYTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIR KDTLKTLNDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSQRILTPEATKEI KLVEEKIQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTL YLDQIATLIGQTRLRITKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGLID NHYPKTKIFQFLKLTTWILPKITRREPLENALTVFTDGSSNGKAAYTGPKERVIKTPY QSAQRDELVAVITVLQDFDQPINIISDSAYVVQATRDVETALIKYSMDDQLNQLFNLL QQTVRKRNFPFYITYIRAHTNLPGPLTKANEQADLLVSSALIKAQELHALTHVNAAGL KNKFDVTWKQAKDIVQHCTQCQVLHLPTQEAGVNPRGLCPNALWQMDVTHVPSFGRLS YVHVTVDTYSHFIWATCQTGESTSHVKKHLLSCFAVMGVPEKIKTDNGPGYCSKAFQK FLSQWKISHTTGIPYNSQGQAIVERTNRTLKTQLVKQKEGGDSKECTTPQMQLNLALY TLNFLNIYRNQTTTSAEQHLTGKKNSPHEGKLIWWKDNKNKTWEIGKVITWGRGFACV SPGENQLPVWLPTRHLKFYNEPIGDAKKRASTEMVTPVTWMDNPIEVYVNDSIWVPGP IDDRCPAKPEEEGMMINISIGYRYPPICLGRAPGCLMPAVQNWLVEVPTVSPISRFTY HMVSGMSLRPRVNYLQDFSYQRSLKFRPKGKPCPKEIPKESKNTEVLVWEECVANSAV ILXNNEFGTIIDWAPRGQFYHNCSGQTQSCPSAQVSPAVDSDLTESLDKHKHKKLQSF YPWEWGEKGISTPRPKIVSPVSGPEHPELWRLTVASHHIRIWSGNQTLETRDCKPFYT VDLNSSLTVPLQSCVKPPYMLVVGNIVIKPDSQTITCENCRLLTCIDSTFNWQHRILL VRAREGVWIPVSMDRPWEASPSVHILTEVLKGVLNRSKRFIFTLIAVIMGLIAVTATA AVAGVALHSSVQSVNFVNDWQKNSTRLWNSQSSIDQKLANQINDLRQTVIWMGDRLMS LEHRFQLQCDWNTSDFCITPQIYNESEHHWDMVRRHLQGREDNLTLDISKLKEQIFEA SKAHLNLVPGTEAIAGVADGLANLNPVTWVKTIGSTSIINLILILVCLFCLLLVCRCT QQLRRDSDHRERAMMTMAVLSKRKGGNVGKSKRDQIVTVSV" LTR 8212..9179 /note="3' LTR" protein_bind 8286..8291 /bound_moiety="glucocorticoid responsive element" enhancer 8292..8299 /note="enhancer core" BASE COUNT 2959 a 1866 c 1952 g 2402 t ORIGIN 237 bp upstream of SphI site. 1 tgtggggaaa agcaagagag atcaaattgt tactgtgtct gtgtagaaag aagtagacat 61 aggagactcc attttgttat gtgctaagaa aaattcttct gccttgagat tctgttaatc 121 tatgacctta cccccaaccc cgtgctctct gaaacgtgtg ctgtgtcaac tcagggttga 181 atggattaag ggcggtgcag gatgtgcttt gttaaacaga tgcttgaagg cagcatgctc 241 cttaagagtc atcaccactc cctaatctca agtacccagg gacacaaaaa ctgcggaagg 301 ccgcagggac ctctgcctag gaaagccagg tattgtccaa ggtttctccc catgtgatag 361 tctgaaatat ggcctcgtgg gaagggaaag acctgaccgt cccccagccc gacacctgta 421 aagggtctgt gctgaggagg attagtaaaa gaggaaggaa tgcctcttgc agttgagaca 481 agaggaaggc atctgtctcc tgcctgtccc tgggcaatgg aatgtctcgg tataaaaccc 541 gattgtatgc tccatctact gagataggga aaaaccgcct tagggctgga ggtgggacct 601 gcgggcagca atactgcttt gtaaagcatt gagatgttta tgtgtatgca tatccaaaag 661 cacagcactt aatcctttac attgtctatg atgccaagac ctttgttcac gtgtttgtct 721 gctgaccctc tccccacaat tgtcttgtga ccctgacaca tccccctctt tgagaaacac 781 ccacagatga tcaataaata ctaagggaac tcagaggctg gcgggatcct ccatatgctg 841 aacgctggtt ccccgggtcc ccttatttct ttctctatac tttgtctctg tgtctttttc 901 ttttccaaat ctctcgtccc accttacgag aaacacccac aggtgtgtag gggcaaccca 961 cccctacatc tggtgcccaa cgtggaggct tttctctagg gtgaaggtac gctcgagcgt 1021 aatcattgag gacaagtcga cgagagatcc cgagtacatc tacagtcagc cttacggtaa 1081 gcttgcgcgc tcggaagaag ctagggtgat aatggggcaa actaaaagta aaattaaaag 1141 taaatatgcc tcttatctca gctttattaa aattctttta aaaagagggg gagttaaagt 1201 atctacaaaa aatctaatca agctatttca aataatagaa caattttgcc catggtttcc 1261 agaacaagga acttcagatc taaaagattg gaaaagaatt ggtaaggaac taaaacaagc 1321 aggtaggaag ggtaatatca ttccacttac agtatggaat gattgggcca ttattaaagc 1381 agctttagaa ccatttcaaa cagaagaaga tagcatttca gtttctgatg cccctggaag 1441 ctgtttaata gattgtaatg aaaacacaag gaaaaaatcc cagaaagaaa ccgaaagttt 1501 acattgcgaa tatgtagcag agccggtaat ggctcagtca acgcaaaatg ttgactataa 1561 tcaattacag gaggtgatat atcctgaaac gttaaaatta gaaggaaaag gtccagaatt 1621 aatggggcca tcagagtcta aaccacgagg cacaagtcct cttccagcag gtcaggtgct 1681 cgtaagatta caacctcaaa agcaggttaa agaaaataag acccaaccgc aagtagccta 1741 tcaatactgc cgctggctga acttcagtat cggccacccc cagaaagtca gtatggatat 1801 ccaggaatgc ccccagcacc acagggcagg gcgccatacc atcagccgcc cactaggaga 1861 cttaatccta tggcaccacc tagtagacag ggtagtgaat tacatgaaat tattgataaa 1921 tcaagaaagg aaggagatac tgaggcatgg caattcccag taacgttaga accgatgcca 1981 cctggagaag gagcccaaga gggagagcct cccacagttg aggccagata caagtctttt 2041 tcgataaaaa tgctaaaaga tatgaaagag ggagtaaaac agtatggacc caactcccct 2101 tatatgagga cattattaga ttccattgct tatggacata gactcattcc ttatgattgg 2161 gagattctgg caaaatcgtc tctctcaccc tctcaatttt tacaatttaa gacttggtgg 2221 attgatgggg tacaagaaca ggtccgaaga aatagggctg ccaatcctcc agttaacata 2281 gatgcagatc aactattagg aataggtcaa aattggagta ctattagtca acaagcatta 2341 atgcaaaatg aggccattga gcaagttaga gctatctgcc ttagagcttg ggaaaaaatc 2401 caagacccag gaagtacctg cccctcattt aatacagtaa gacaaggttc aaaagagccc 2461 taccctgatt ttgtggcaag gctccaagat gttgctcaaa agtcaattgc cgatgaaaaa 2521 gccggtaagg tcatagtgga gttgatggca tatgaaaacg ccaatcctga gtgtcaatca 2581 gccattaagc cattaaaagg aaaggttcct gcaggatcag atgtaatctc agaatatgta 2641 aaagcctgtg atggaatcgg aggagctatg cataaagcta tgcttatggc tcaagcaata 2701 acaggagttg ttttaggagg acaagttaga acatttggag gaaaatgtta taattgtggt 2761 caaattggtc acttaaaaaa gaattgccca gtcttaaaca aacagaatat aactattcaa 2821 gcaactacaa caggtagaga gccacctgac ttatgtccaa gatgtaaaaa aggaaaacat 2881 tgggctagtc aatgtcgttc taaatttgat aaaaatgggc aaccattgtc gggaaacgag 2941 caaaggggcc agcctcaggc cccacaacaa actggggcat tcccaattca gccatttgtt 3001 cctcagggtt ttcagggaca acaaccccca ctgtcccaag tgtttcaggg aataagccag 3061 ttaccacaat acaacaattg tccctcacca caagcggcag tgcagcagta gatttatgta 3121 ctatacaagc agtctctctg cttccagggg agcccccaca aaaaatccct acaggggtat 3181 atggcccact gcctgagggg actgtaggac taatcttggg aagatcaagt ctaaatctaa 3241 aaggagttca aattcatact agtgtggttg attcagacta taaaggcgaa attcaattgg 3301 ttattagctc ttcaattcct tggagtgcca gtccaagaga caggattgct caattattac 3361 tcctgccata tattaagggt ggaaatagtg aaataaaaag aataggaggg cttgtaagca 3421 ctgatccaac aggaaaggct gcatattggg caagtcaggt ctcagagaac agacctgtgt 3481 gtaaggccat tattcaagga aaacagtttg aagggttggt agacactgga gcagatgtct 3541 ctattattgc tttaaatcag tggccaaaaa actggcctaa acaaaaggct gttacaggac 3601 ttgtcggcat aggcacagcc tcagaagtgt atcaaagtat ggagatttta cattgcttag 3661 ggccagataa tcaagaaagt actgttcagc caatgattac ttcaattcct cttaatctgt 3721 ggggtcgaga tttattacaa caatggggtg cggaaatcac catgcccgct ccattatata 3781 gccccacgag tcaaaaaatc atgaccaaga tgggatatat accaggaaag ggactaggga 3841 aaaatgaaga tggcattaaa gttccagttg aggctaaaat aaatcaagaa agagaaggaa 3901 tagggtatcc tttttagggg cggtcactgt agagcctcct aaacccatac cactaacttg 3961 gaaaacagaa aaaccggtgt gggtaaatca gtggccgcta ccaaaacaaa aactggaggc 4021 tttacattta ttagcaaatg aacagttaga aaagggtcac attgagcctt cgttctcacc 4081 ttggaattct cctgtgtttg taattcagaa gaaatcaggc aaatggcata cgttaactga 4141 cttaagggct gtaaacgccg taattcaacc catggggcct ctccaacccg ggttgccctc 4201 tccggccatg atcccaaaag attggccttt aattataatt gatctaaagg attgcttttt 4261 taccatccct ctggcagagc aggattgtga aaaatttgcc tttactatac cagccataaa 4321 taataaagaa ccagccacca ggtttcagtg gaaagtgtta cctcagggaa tgcttaatag 4381 tccaactatt tgtcagactt ttgtaggtcg agctcttcaa ccagtgagag aaaagttttc 4441 agactgttat attattcatt atattgatga tattttatgt gctgcagaaa cgaaagataa 4501 attaattgac tgttatacat ttctgcaagc agaggttgcc aatgctggac tggcaatagc 4561 atccgataag atccaaacct ctactccttt tcattattta gggatgcaga tagaaaatag 4621 aaaaattaag ccacaaaaaa tagaaataag aaaagacaca ttaaaaacac taaatgattt 4681 tcaaaaatta ctaggagata ttaattggat tcggccaact ctaggcattc ctacttatgc 4741 catgtcaaat ttgttctcta tcttaagagg agactcagac ttaaatagtc aaagaatatt 4801 aaccccagag gcaacaaaag aaattaaatt agtggaagaa aaaattcagt cagcgcaaat 4861 aaatagaata gatcccttag ccccactcca acttttgatt tttgccactg cacattctcc 4921 aacaggcatc attattcaaa atactgatct tgtggagtgg tcattccttc ctcacagtac 4981 agttaagact tttacattgt acttggatca aatagctaca ttaatcggtc agacaagatt 5041 acgaataaca aaattatgtg gaaatgaccc agacaaaata gttgtccctt taaccaagga 5101 acaagttaga caagccttta tcaattctgg tgcatggcag attggtcttg ctaattttgt 5161 gggacttatt gataatcatt acccaaaaac aaagatcttc cagttcttaa aattgactac 5221 ttggattcta cctaaaatta ccagacgtga acctttagaa aatgctctaa cagtatttac 5281 tgatggttcc agcaatggaa aagcagctta cacagggccg aaagaacgag taatcaaaac 5341 tccatatcaa tcggctcaaa gagacgagtt ggttgcagtc attacagtgt tacaagattt 5401 tgaccaacct atcaatatta tatcagattc tgcatatgta gtacaggcta caagggatgt 5461 tgagacagct ctaattaaat atagcatgga tgatcagtta aaccagctat tcaatttatt 5521 acaacaaact gtaagaaaaa gaaatttccc attttatatt acttatattc gagcacacac 5581 taatttacca gggcctttga ctaaagcaaa tgaacaagct gacttactgg tatcatctgc 5641 actcataaaa gcacaagaac ttcatgcttt gactcatgta aatgcagcag gattaaaaaa 5701 caaatttgat gtcacatgga aacaggcaaa agatattgta caacattgca cccagtgtca 5761 agtcttacac ctgcccactc aagaggcagg agttaatccc agaggtctgt gtcctaatgc 5821 attatggcaa atggatgtca cgcatgtacc ttcatttgga agattatcat atgttcatgt 5881 aacagttgat acttattcac atttcatatg ggcaacttgc caaacaggag aaagtacttc 5941 ccatgttaaa aaacatttat tgtcttgttt tgctgtaatg ggagttccag aaaaaatcaa 6001 aactgacaat ggaccaggat attgtagtaa agctttccaa aaattcttaa gtcagtggaa 6061 aatttcacat acaacaggaa ttccttataa ttcccaagga caggccatag ttgaaagaac 6121 taatagaaca ctcaaaactc aattagttaa acaaaaagaa gggggagaca gtaaggagtg 6181 taccactcct cagatgcaac ttaatctagc actctatact ttaaattttt taaacattta 6241 tagaaatcag actactactt ctgcagaaca acatcttact ggtaaaaaga acagcccaca 6301 tgaaggaaaa ctaatttggt ggaaagataa taaaaataag acatgggaaa tagggaaggt 6361 gataacgtgg gggagaggtt ttgcttgtgt ttcaccagga gaaaatcagc ttcctgtttg 6421 gttacccact agacatttga agttctacaa tgaacccatc ggagatgcaa agaaaagggc 6481 ctccacggag atggtaacac cagtcacatg gatggataat cctatagaag tatatgttaa 6541 tgatagtata tgggtacctg gccccataga tgatcgctgc cctgccaaac ctgaggaaga 6601 agggatgatg ataaatattt ccattgggta tcgttatcct cctatttgcc tagggagagc 6661 accaggatgt ttaatgcctg cagtccaaaa ttggttggta gaagtaccta ctgtcagtcc 6721 catcagtaga ttcacttatc acatggtaag cgggatgtca ctcaggccac gggtaaatta 6781 tttacaagac ttttcttatc aaagatcatt aaaatttaga cctaaaggga aaccttgccc 6841 caaggaaatt cccaaagaat caaaaaatac agaagtttta gtttgggaag aatgtgtggc 6901 caatagtgcg gtgatattat aaaacaatga atttggaact attatagatt gggcacctcg 6961 aggtcaattc taccacaatt gctcaggaca aactcagtcg tgtccaagtg cacaagtgag 7021 tccagctgtt gatagcgact taacagaaag tttagacaaa cataagcata aaaaattgca 7081 gtctttctac ccttgggaat ggggagaaaa aggaatctct accccaagac caaaaatagt 7141 aagtcctgtt tctggtcctg aacatccaga attatggagg cttactgtgg cctcacacca 7201 cattagaatt tggtctggaa atcaaacttt agaaacaaga gattgtaagc cattttatac 7261 tgtcgaccta aattccagtc taacagttcc tttacaaagt tgcgtaaagc ccccttatat 7321 gctagttgta ggaaatatag ttattaaacc agactcccag actataacct gtgaaaattg 7381 tagattgctt acttgcattg attcaacttt taattggcaa caccgtattc tgctggtgag 7441 agcaagagag ggcgtgtgga tccctgtgtc catggaccga ccgtgggagg cctcaccatc 7501 cgtccatatt ttgactgaag tattaaaagg tgttttaaat agatccaaaa gattcatttt 7561 tactttaatt gcagtgatta tgggattaat tgcagtcaca gctacggctg ctgtagcagg 7621 agttgcattg cactcttctg ttcagtcagt aaactttgtt aatgattggc aaaagaattc 7681 tacaagattg tggaattcac aatctagtat tgatcaaaaa ttggcaaatc aaattaatga 7741 tcttagacaa actgtcattt ggatgggaga cagactcatg agcttagaac atcgtttcca 7801 gttacaatgt gactggaata cgtcagattt ttgtattaca ccccaaattt ataatgagtc 7861 tgagcatcac tgggacatgg ttagacgcca tctacaggga agagaagata atctcacttt 7921 agacatttcc aaattaaaag aacaaatttt cgaagcatca aaagcccatt taaatttggt 7981 gccaggaact gaggcaattg caggagttgc tgatggcctc gcaaatctta accctgtcac 8041 ttgggttaag accattggaa gtacatcgat tataaatctc atattaatcc ttgtgtgcct 8101 gttttgtctg ttgttagtct gcaggtgtac ccaacagctc cgaagagaca gcgaccatcg 8161 agaacgggcc atgatgacga tggcggtttt gtcgaaaaga aaagggggaa atgtggggaa 8221 aagcaagaga gatcaaattg ttactgtgtc tgtgtagaaa gaagtagaca taggagactc 8281 cattttgtta tgtgctaaga aaaattcttc tgccttgaga ttctgttaat ctatgacctt 8341 acccccaacc ccgtgctctc tgaaacatgt gctgtgtcaa ctcagggttg aatggattaa 8401 gggcggtgca ggatgtgctt tgttaaacag atgcttgaag gcagcatgct ccttaagagt 8461 catcaccact ccctaatctc aagtacccag ggacacaaaa actgcagaag gccgcaggga 8521 cctctgccta ggaaagccag gtattgtcca aggtttctcc ccatgtgata gtctgaaata 8581 tggcctcgtg ggaagggaaa gacctgaccg tcccccagcc cgacacctgt aaagggtctg 8641 tgctgaggag gattagtaaa agaggaagga atgcctcttg cagttgagac aagaggaagg 8701 catctgtctc ctgcctgtcc ctgggcaatg gaatgtctcg gtataaaacc cgattgtatg 8761 ctccatctac tgagataggg aaaaaccgcc ttagggctgg aggtgggacc tgcgggcagc 8821 aatactgctt tgtaaagcat tgagatgttt atgtgtatgc atatccaaaa gcacagcact 8881 taatccttta cattgtctat gatgccaaga cctttgttca cgtgtttgtc tgctgaccct 8941 ctccccacaa ttgtcttgtg accctgacac atccccctct ttgagaaaca cccacagatg 9001 atcaataaat actaagggaa ctcagaggct ggcgggatcc tccatatgct gaacgctggt 9061 tccccgggtc cccttatttc tttctctata ctttgtctct gtgtcttttt cttttccaaa 9121 tctctcgtcc caccttacga gaaacaccca caggtgtgta ggggcaaccc acccctaca // LOCUS HUMETFA 1266 bp mRNA PRI 08-NOV-1994 DEFINITION Human electron transfer flavoprotein alpha-subunit mRNA, complete cds. ACCESSION J04058 NID g182250 KEYWORDS electron transfer flavoprotein. SOURCE Human liver, cDNA to mRNA, clone pE5b. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1266) AUTHORS Finocchiaro,G., Ito,M., Ikeda,Y. and Tanaka,K. TITLE Molecular cloning and nucleotide sequence of cDNAs encoding the alpha-subunit of human electron transfer flavoprotein JOURNAL J. Biol. Chem. 263 (30), 15773-15780 (1988) MEDLINE 89008492 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tanaka, 08-SEP-1988. FEATURES Location/Qualifiers source 1..1266 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q23-q25" gene 1..1002 /gene="ETFA" CDS 1..1002 /gene="ETFA" /note="electron transport flavoprotein" /codon_start=1 /db_xref="GDB:G00-119-121" /db_xref="PID:g182251" /translation="MFRAAAPGQLRRAASLLRFQSTLVIAEHANDSLAPITLNTITAA TRLGGEVSCLVAGTKCDKVAQDLCKVAGIAKVLVAQHDVYKGLLPEELTPLILATQKQ FNYTHICAGASAFGKNLLPRVAAKLEVAPISDIIAIKSPDTFVRTIYAGNALCTVKCD EKVKVFSVRGTSFDAAATSGGSASSEKASSTSPVEISEWLDQKLTKSDRPELTGAKVV VSGGRGLKSGENFKLLYDLADQLHAAVGASRAAVDAGFVPNDMQVGQTGKIVAPELYI AVGISGAIQHLAGMKDSKTIVAINKDPEAPIFQVADYGIVADLFKVVPEMTEILKKK" BASE COUNT 381 a 239 c 293 g 353 t ORIGIN 63 bp upstream of RsaI site; chromosome 15q23-q25. 1 atgttccgag cggcggctcc ggggcagctc cggcgggcgg cctcattgct acgatttcag 61 agtaccctgg taatagctga gcatgcaaat gattccctag cacccattac tttaaatacc 121 attactgcag ccacacgcct tggaggtgaa gtgtcctgct tagtagctgg aaccaaatgt 181 gacaaggtgg cacaagatct ctgtaaagta gcaggcatag caaaagttct ggtggctcag 241 catgatgtgt acaaaggcct acttccagag gaactgacac cattgatttt ggcaactcag 301 aagcagttca attacacaca catctgtgct ggagcatctg ccttcggaaa gaaccttttg 361 cccagagtag cagccaaact tgaggttgcc ccgatttctg acatcattgc aatcaagtca 421 cctgacacat ttgtgagaac tatttatgca ggaaatgctc tatgtacagt gaagtgtgat 481 gagaaagtga aagtgttttc tgtccgtgga acatcctttg atgctgcagc aacaagtggc 541 ggtagtgcca gttcagaaaa ggcatcaagt acttcaccag tggaaatatc agagtggctt 601 gaccagaaat taacaaaaag tgatcgacca gagctaacag gtgccaaagt ggtggtatct 661 ggtggtcgag gcttgaagag tggagagaac tttaagttgt tatatgactt ggcagatcaa 721 ctacatgctg cagttggtgc ttcccgtgct gctgttgatg ctggctttgt tcccaatgac 781 atgcaagttg gacagacggg aaaaatagta gcaccagaac tttatattgc tgttggaata 841 tctggagcca tccaacattt agctgggatg aaagacagca agacaattgt ggcaattaat 901 aaagacccag aagctccaat tttccaagtg gcagattatg gaatagttgc agatttattt 961 aaggtagttc ctgaaatgac tgagatattg aagaaaaaat gaatcaggat catgccttaa 1021 aaagaaaact tttgttaaag tattccactg aaatcacaga tatttgtggg tattataaca 1081 atcattggaa agcatggaga gctacatttc ataatttgag ggaaaatttc taacagatgc 1141 cagaatgctt gtttatggga ttgctgtgtt tccttttaat tatttgtggt tccaaacaat 1201 tattgtttga acttttttaa ttctgtacta aaatctataa taaagctttt ccacagcttt 1261 aaaact // LOCUS HUMETR101 1811 bp mRNA PRI 18-JUL-1991 DEFINITION Human transcription factor ETR101 mRNA, complete cds. ACCESSION M62831 NID g182260 KEYWORDS transcription factor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1811) AUTHORS Shimizu,N., Ohta,M., Fujiwara,C., Sagara,J., Mochizuki,N., Oda,T. and Utlyama,H. TITLE Expression of a novel immediate early gene during 12-O-tetradecanoylphorbol-13-acetate-induced macrophagic differentiaiton of HL-60 cells JOURNAL J. Biol. Chem. 266, 12157-12161 (1991) MEDLINE 91286224 FEATURES Location/Qualifiers source 1..1811 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /cell_type="leukemic" /tissue_type="promyelocytic" /tissue_lib="HL-60 TPA 30 min lambda gt10" mRNA <1..1811 /gene="ETR101" gene 1..1811 /gene="ETR101" CDS 101..772 /gene="ETR101" /codon_start=1 /db_xref="PID:g182261" /translation="MEVQKEAQRIMTLSVWKMYHSRMQRGGLRLHRSLQLSLVMRSAR ELYLSAKVEALEPEVSLPAALPSDPRLHPPREAESTAETATPDGEHPFPEPMDTQEAP TAEETSACCAPRPAKVSRKRRSSSLSDGGDVGLVPSKKARLEEKEEEEGASSEVADRL QPPPGQAEGAFPNLARVLQRRFSGLLNCSPAAPPTAPPACEAKPACRPADSMLNVLVR AVVAF" polyA_signal 1794..1799 /gene="ETR101" BASE COUNT 305 a 550 c 592 g 364 t ORIGIN 1 ggtttgtgta gagaggcgtg cagagcccgt tgtccggagt gcacctgctg cctgttctgt 61 ccctcccggg agcccccgcc gctgtcgccg tcgagtcgcc atggaagtgc agaaagaggc 121 acagcgcatc atgaccctgt cggtgtggaa gatgtatcac tcccgcatgc agcgcggtgg 181 cctgcggctg caccggagtc tgcagctgtc gctggtcatg cgcagcgccc gggagctcta 241 cctctcggcc aaggtggagg ccctcgagcc cgaggtgtcg ttgccggccg ccctcccctc 301 tgaccctcgc ctgcacccgc cccgagaagc cgagtccacg gccgagacag cgacccccga 361 cggtgagcac ccgtttccgg agccaatgga cacgcaggag gcgccgacag ccgaggagac 421 ctccgcctgc tgtgccccgc gccccgccaa agtcagccgc aaacgacgca gcagcagcct 481 gagcgacggc ggggacgttg gactggtccc gagcaagaaa gcccgtctgg aagaaaagga 541 agaagaggag ggagcgtcat ccgaagtcgc cgatcgcctg cagccccctc cgggccaagc 601 ggagggcgcc tttcccaacc tggcccgcgt cctgcagagg cgcttctccg gcctcctgaa 661 ctgcagcccc gcggcccctc cgacggcgcc gcccgcgtgc gaggcaaagc ccgcttgccg 721 cccggcggac agcatgctca acgtgctcgt gcgggccgtg gtggccttct gaggaccccg 781 agcggcgctg ccggagccca gagcgcgcgt cgaaccgtcg gcccgagggc gcagacctga 841 ggcgaggcca cccccctcca tcctggggga agcgcccgcg aaaaccgtgg agagaagccg 901 ccgcccgggc tgctgagagg cccggagagg actctgtccc cggggagcca tcgccttcag 961 tgtgcaggga cggcaccgag gagtctgagc cgggcgcggg cgccttccgc agagacctgc 1021 gcccacaggt gctgtcttag tggactggga cgtgaacctt tcgctctcct tctggactgg 1081 gagaagggag gcttgggtgt tgtgtttttt gttttgtttg tttgtttgtt tttaaagatc 1141 tcctcagggt cggacttcat tttgtactgt gggctgtgct ggccctttca aggtttttca 1201 agagttggtt ttgcgtttcc aacctcggag aattccaggc actccccttc cccctccgct 1261 gacatacttg tataagcggt catcgttgcg tcatggggca ggcgtgggga gcttcctgtc 1321 gccttggctg ggtgtgggcc tggaggaagg tcctggggcg tgcactcgcc tgggcagtgg 1381 ggaggagagt ggcctgagtt acttcacccc cgcgtgctgc tggttaatgt cccgcgtctc 1441 tgcaccttcg ggtgggagcg gggactgatc tactttcaca ttctcaagtt tttctcatct 1501 gcattagagg tccccagtag gttcccaggt tccagcgtgc ccctccctca gacacacgga 1561 cacaatcagc cgagaagttc ctggtctgaa tcacgagaat gtggaggggt ggggggtgtc 1621 agtggaaagg cataaggctg agctgagacc agttgctggt gaaactgggc caatctgggg 1681 aggggaacat ccttgccagg gagtttctga gggtctgctt tgtttacctt tcgtgcggtg 1741 gattcttttt aactccgtct acctggcgtt ttgttagaaa tgtcagatag gaaaataaaa 1801 accatttgag t // LOCUS HUMEVI22 1798 bp DNA PRI 08-NOV-1994 DEFINITION Human EV12 protein gene, exon 1. ACCESSION M55267 NID g182278 KEYWORDS EVI2 protein. SEGMENT 2 of 2 SOURCE Human lymphoblastoid cell DNA, and cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1798) AUTHORS Cawthon,R.M., O'Connell,P., Buchberg,A.M., Viskochil,D., Weiss,R.B., Culver,M., Stevens,J., Jenkins,N.A., Copeland,N.G. and White,R. TITLE Identification and characterization of transcripts from the neurofibromatosis 1 region: the sequence and genomic structure of EVI2 and mapping of other transcripts JOURNAL Genomics 7 (4), 555-565 (1990) MEDLINE 90353953 FEATURES Location/Qualifiers source 1..1798 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoblast" /map="17q11.2" mRNA join(M55266:187..383,198..1563) /partial /gene="EVI2A" /note="G00-125-191" /product="EVI2 protein" gene join(M55266:187..604,1..1563) /gene="EVI2A" intron order(M55266:384..604,1..197) /gene="EVI2A" /note="G00-125-191" CDS 220..918 /gene="EVI2A" /codon_start=1 /number=1 /db_xref="GDB:G00-125-191" /product="EVI2 protein" /db_xref="PID:g182280" /translation="MEHTGHYLHLAFLMTTVFSLSPGTKANYTRLWANSTSSWDSVIQ NKTGRNQNENINTNPITPEVDYKGNSTNMPETSHIVALTSKSEQELYIPSVVSNSPST VQSIENTSKSHGEIFKKDVCAENNNNMAMLICLIIIAVLFLICTFLFLSTVVLANKVS SLRRSKQVGKRQPRSNGDFLASGLWPAESDTWKRTKQLTGPNLVMQSTGVLTATRERK DEEGTEKLTNKQIG" sig_peptide 220..297 /partial /gene="EVI2A" /note="G00-125-191" mat_peptide 298..915 /partial /gene="EVI2A" /note="G00-125-191" /product="EVI2 protein" BASE COUNT 658 a 310 c 303 g 527 t ORIGIN 1 taatagaaat taaaatgctt cttcatacat agctgaatag aaaagaattt gttgagaagg 61 aattcagggt agcgaatatt aggcataagc ttgtagttta cttgtaacat ctcaacacta 121 tcttttaact acaattacca aaaactagga tccattattc tttcacaaac taacaaatta 181 tattgctatc ccaacagatt gccaagtatg cccacggaca tggaacacac aggacattac 241 ctacatcttg cctttctgat gacaacagtt ttttctttgt ctcctggaac aaaagcaaac 301 tatacccgtc tgtgggctaa cagtacttct tcctgggatt cagttattca aaacaagaca 361 ggcagaaacc aaaatgaaaa cattaacaca aaccctataa ctcctgaagt agattataaa 421 ggtaattcta caaacatgcc tgaaacatct cacatcgtag ctttaacttc taaatctgaa 481 caggagcttt atataccttc tgtcgtcagc aacagtcctt caacagtaca gagcattgaa 541 aacacaagca aaagtcatgg tgaaattttc aaaaaggatg tctgtgcgga aaacaacaac 601 aacatggcta tgctaatttg cttaattata attgcagtgc tttttcttat ctgtaccttt 661 ctatttctat caactgtggt tttggcaaac aaagtctctt ctctcagacg atcaaaacaa 721 gtaggcaagc gtcagcctag aagcaatggc gattttctgg caagcggtct atggcccgct 781 gaatcagaca cttggaaaag aacaaaacag ctcacaggac ccaacctagt gatgcaatct 841 actggagtgc tcacagctac aagggaaaga aaagatgaag aaggaactga aaaacttact 901 aacaaacaga taggttagtg aagaaaaatg caaagtagca atgagaaggc ttatggagta 961 aaaatgaagt cagttggtat ttaatcccaa agtgttgttc tgattatcta aaatttgaca 1021 tggtagacct tgcaatttag aatcaagcag gtgagacagg gagaagtatg cctgcttaat 1081 tatttaaact gtgtactttt gttttgacac tgaatatttt aaaaagcaaa taataaaata 1141 actaagcatt tgaggaaaat tttaaggata aattgaggaa actgattaat agagatagca 1201 agggataatt aaataaatat tccctatgta gcaacagtgg ttagatgatc tttgtctgaa 1261 tgtaataaaa ctttgaatag ttttagtgtg tccttaaagc caagtatatg ctttaacatc 1321 aaatggaagt caaattccta atgcatagat agagagagct aaactgtgta atttaatggt 1381 atcttccttg ctggatgtgg cagaatccac accagcttat caaccaacac agctaatttt 1441 agaataggtc ctttatcttt ccatatggca cacgtaagaa agtgtttttc tactattaat 1501 attaaattaa aacctttact tttgtataat aaattaaaac tcagaataaa cctgtgacca 1561 cgtatatttg cattcacttt attactttag agaacacatt gtaaagatca ataagaaata 1621 gagcacaact aaaataaata agatttatag ccacaccaat aggctagtgt aaacgaaagt 1681 atgtttcact gtttatgatt aataatattc atcttttcta taaatactac ttactggaac 1741 attaacaaca agtccaaagg ttgattaatt ttgactcagg agcagagcta tgattata // LOCUS HUMEVI2B3P 2158 bp DNA PRI 22-MAR-1991 DEFINITION Human EVI2B3P gene, exon and complete cds. ACCESSION M60830 NID g182282 KEYWORDS . SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Cawthon,R.M., Andersen,L.B., Buchberg,A.M., Xu,G.F., O'Connell,P. and Viskochil,D. TITLE cDNA sequence and genomic structure of EV12B, a gene lying within an intron of the neurofibromatosis type 1 gene JOURNAL Genomics 9, 446-460 (1991) MEDLINE 91236164 FEATURES Location/Qualifiers source 1..2158 /organism="Homo sapiens" /db_xref="taxon:9606" exon 81..2158 /note="in EVI2B cDNAs this sequence is immediately followed by a polyA tail; putative" CDS 102..1448 /note="open reading frame" /codon_start=1 /db_xref="PID:g182283" /translation="MDPKYFILILFCGHLNNTFFSKTETITTEKQSQPTLYTSSMSQV LANSQNTTGNPLGQPTQFSDTFSGQSISPAKVTAGQPTPAVYTSSEKPEAHTSAGQPL AYNTKQPTPIANTSSQQAVFTSARQLPSARTSTTQPPKSFVYTFTQQSSSVQIPSRKQ ITVHNPSTQPTSTVKNSPRSTPGFILDTTSNKQTPQKNNYNSIAAILIGVLLTSMLVA IIIIVLWKCLRKPVLNDQNWAGRSPFADGETPDICMDNIRENEISTKRTSIISLTPWK PSKSTLLADDLEIKLFESSENIEDSNNPKTEKIKDQVNGTSEDSADGSTVGTAVSSSD DADLPPPPPLLDLEGQESNQSDKPTMTIVSPLPNDSTSLPPSLDCLNQDCGDHKSEII QSFPPLDSLNLPLPPVDFMKNQEDSNLEIQCQEFSIPPNSDQDLNESLPPPPAELL" BASE COUNT 755 a 466 c 307 g 630 t ORIGIN 1 aatataatga aaagtcaaag ttttaactag acaccaatga cgcctaactg tctttctctt 61 tcattataaa cccgctatag ataacgagga aatattctga aatggatccc aaatatttca 121 tcttaatttt gttttgtgga cacctgaaca atacattttt ttcaaagaca gagacaatta 181 caacagagaa gcagtcacag cctaccttat acacatcatc aatgtcacag gtattggcta 241 attctcaaaa cacaacaggg aatcctttgg gtcaaccaac acaattcagc gacacttttt 301 ctggacaatc aatatcacct gccaaagtca ctgctggaca accaacacca gctgtctata 361 cctcttctga aaaaccagaa gcacatactt ctgctggaca accacttgcc tacaacacca 421 aacaaccaac accaatagcc aacacctcct cccagcaagc cgtgttcacc tctgccagac 481 aactaccatc tgcccgtact tctaccacac aaccaccaaa gtcatttgtc tatactttta 541 ctcaacaatc atcatctgtc cagatccctt ctagaaaaca aataactgtt cataatccat 601 ccacacaacc aacatcaact gtcaaaaatt cacctaggag tacaccagga tttatcttag 661 atactaccag taacaaacaa accccacaaa aaaacaatta taattcaata gctgccatac 721 taattggtgt acttctgact tctatgttgg tagctataat catcattgta ctttggaaat 781 gcttaaggaa accagtttta aatgatcaaa attgggcagg tagatctcca tttgctgatg 841 gagaaacccc tgacatttgt atggataaca tcagagaaaa tgaaatatcc acaaaacgta 901 catcaatcat ttcacttaca ccctggaaac caagcaaaag cacactttta gcagatgact 961 tagaaattaa gttgtttgaa tcaagtgaaa acattgaaga ctccaacaac cccaaaacag 1021 agaaaataaa agatcaagta aatggtacat cagaagatag tgctgatggt tcaacagttg 1081 gaactgctgt ttcttcttca gatgatgcag atctgcctcc accacctccc cttctggatt 1141 tggaaggaca ggaaagtaac caatctgaca aacccacaat gacaattgta tctcctcttc 1201 caaatgattc tactagtctc cctccatctc tggactgtct caatcaagac tgtggagatc 1261 ataaatctga gataatacaa tcatttccac cgcttgactc acttaacttg cccctgccac 1321 cagtagattt tatgaaaaac caagaagatt ccaaccttga gatccagtgt caggagttct 1381 ctattcctcc caactctgat caagatctta atgaatccct gccacctcca cctgcagaac 1441 tgttataaat attacaactt gctttttagc tgatcttcca tcctcaaatg actctttttt 1501 ctttatatgt taacatatat aaaatggcaa ctgatagtca attttgattt ttattcagga 1561 actatctgaa atctgctcag agcctatgtg catagatgaa actttttttt aaaaaaagtt 1621 atttaacagt aatctattta ctaattatag tacctatctt taaagtatag tacattttac 1681 atatgtaaat ggtatgtttc aataatttaa gaactctgaa acaatctaca tatacttatt 1741 acccagtaca gttttttttc ccctgaaaag ctgtgtataa aattatggtg aataaacttt 1801 tatgtttcca tttcaaagac cagggtggag aggaataaga gactaagtat atgcttcaag 1861 ttttaaatta atacctcaag tattaaataa atattccaag tttgtgggaa tgggagatta 1921 aaatgcatgt ttgagaatag agaaattttc ttcttggttt cattgcaaag agtaaaacaa 1981 acatgttaaa acatcaactg aagggttggg ttaggaacat ttaccctgaa aaaaatatga 2041 ggatgcatca taaaatgtaa atattttcct accatgttgg gggggcacaa attttaaaac 2101 tggcatcttt acaagtttct tctttataaa cacccaaaca aaatcaagtt ttataaag // LOCUS HUMFABPHA 662 bp mRNA PRI 31-DEC-1994 DEFINITION Human fatty acid binding protein homologue (PA-FABP) mRNA, complete cds. ACCESSION M94856 NID g182353 KEYWORDS fatty acid binding protein homologue. SOURCE Homo sapiens (tissue library: lambda gt11) adult epidermis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 662) AUTHORS Madsen,P., Rasmussen,H.H., Leffers,H., Honore,B. and Celis,J.E. TITLE Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins JOURNAL J. Invest. Dermatol. 99 (3), 299-305 (1992) MEDLINE 92381332 FEATURES Location/Qualifiers source 1..662 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="unfractionated non-cultured keratinocyte" /cell_type="keratinocyte" /dev_stage="adult" /tissue_type="epidermis" /tissue_lib="lambda gt11" /map="17" gene 49..650 /gene="PA-FABP" CDS 49..456 /gene="PA-FABP" /codon_start=1 /product="fatty acid binding protein homologue" /db_xref="PID:g182354" /translation="MATVQQLEGRWRLVDSKGFDEYMKELGVGIALRKMGAMAKPDCI ITCDGKNLTIKTESTLKTTQFSCTLGEKFEETTADGRKTQTVCNFTDGALVQHQEWDG KESTITRKLKDGKLVVECVMNNVTCTRIYEKVE" polyA_signal 645..650 /gene="PA-FABP" BASE COUNT 210 a 128 c 150 g 174 t ORIGIN chromosome 17. 1 accgccgacg cagacccctc tctgcacgcc agcccgcccg cacccaccat ggccacagtt 61 cagcagctgg aaggaagatg gcgcctggtg gacagcaaag gctttgatga atacatgaag 121 gagctaggag tgggaatagc tttgcgaaaa atgggcgcaa tggccaagcc agattgtatc 181 atcacttgtg atggtaaaaa cctcaccata aaaactgaga gcactttgaa aacaacacag 241 ttttcttgta ccctgggaga gaagtttgaa gaaaccacag ctgatggcag aaaaactcag 301 actgtctgca actttacaga tggtgcattg gttcagcatc aggagtggga tgggaaggaa 361 agcacaataa caagaaaatt gaaagatggg aaattagtgg tggagtgtgt catgaacaat 421 gtcacctgta ctcggatcta tgaaaaagta gaataaaaat tccatcatca ctttggacag 481 gagttaatta agagaatgac caagctcagt tcaatgagca aatctccata ctgtttcttt 541 cttttttttt tcattactgt gttcaattat ctttatcata aacattttac atgcagctat 601 ttcaaagtgt gttggattaa ttaggatcat ccctttggtt aataaataaa tgtgtttgtg 661 ct // LOCUS HUMFABPL 489 bp mRNA PRI 08-NOV-1994 DEFINITION Human liver fatty acid binding protein (FABP) mRNA, complete cds. ACCESSION M10050 NID g182355 KEYWORDS fatty acid binding protein; protein Z; sterol carrier protein. SOURCE Human liver, cDNA to mRNA, clone pHF658. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 489) AUTHORS Lowe,J.B., Boguski,M.S., Sweetser,D.A., Elshourbagy,N.A., Taylor,J.M. and Gordon,J.I. TITLE Human liver fatty acid binding protein. Isolation of a full length cDNA and comparative sequence analyses of orthologous and paralogous proteins JOURNAL J. Biol. Chem. 260 (6), 3413-3417 (1985) MEDLINE 85131136 COMMENT Draft entry and sequence in computer readable form kindly provided by J.Lowe, 16-AUG-1985. FEATURES Location/Qualifiers source 1..489 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q28-q31" mRNA 1..489 /note="FABP mRNA" gene 43..426 /gene="FABP2" CDS 43..426 /gene="FABP2" /note="fatty acid binding protein" /codon_start=1 /db_xref="GDB:G00-119-127" /db_xref="PID:g182356" /translation="MSFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQN GKHFKFTITAGSKVIQNEFTVGEECELETMTGEKVKTVVQLEGDNKLVTAFKNIKSVT ELNGDIITNTMTLGDIVFKRISKRI" BASE COUNT 158 a 92 c 129 g 110 t ORIGIN 31 bp upstream of SacI site. 1 agagccgcag gtcagtcgtg aagagggagc tctattgcca ccatgagttt ctccggcaag 61 taccaactgc agagccagga aaactttgaa gccttcatga aggcaatcgg tctgccggaa 121 gagctcatcc agaaggggaa ggatatcaag ggggtgtcgg aaatcgtgca gaatgggaag 181 cacttcaagt tcaccatcac cgctgggtcc aaagtgatcc aaaacgaatt cacggtgggg 241 gaggaatgtg agctggagac aatgacaggg gagaaagtca agacagtggt tcagttggaa 301 ggtgacaata aactggtgac agctttcaaa aacatcaagt ctgtgaccga actcaacggc 361 gacataatca ccaataccat gacattgggt gacattgtct tcaagagaat cagcaagaga 421 atttaaacaa gtctgcattt catattattt tagtgtgtaa aattaatgta ataaagtgaa 481 ctttgtttt // LOCUS HUMFACAL 3188 bp mRNA PRI 28-MAY-1996 DEFINITION Human long-chain acyl-coenzyme A synthetase (FACL1) mRNA, complete cds. ACCESSION L09229 NID g182384 KEYWORDS acyl-activating enzyme; long-chain acyl-CoA synthetase; long-chain fatty acid-coenzyme A ligase; palmitoyl-CoA ligase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3188) AUTHORS Ghosh,B., Barbosa,E. and Singh,I. TITLE Molecular cloning and sequencing of human palmitoyl-CoA ligase and its tissue specific expression JOURNAL Mol. Cell. Biochem. 151 (1), 77-81 (1995) MEDLINE 96147073 FEATURES Location/Qualifiers source 1..3188 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /chromosome="4" gene 74..2173 /gene="FACL1" CDS 74..2173 /gene="FACL1" /standard_name="palmitoyl-CoA ligase" /EC_number="6.2.1.3" /function="activates long chain fatty acids" /note="ATP-binding domain (bp. 1447..1846)" /codon_start=1 /product="long-chain acyl-CoA synthetase" /db_xref="PID:g182385" /db_xref="GDB:G00-127-357" /translation="MQAHELFRYFRMPELVDFRQCVTLPTNTLMGFGAFSRRLTTFWR PRHPKPLKPPWHLSMQSVEVAGSGGARRSALLDSDEPLVYFYDDVTTLYEGFQRGIQV SNNGPCLGSRKPDQPYEWLSYKQVAELSECIGSALIQKGFKTAPDQFIGIFAQNRPEW VIIEQGCFAYSMVIVPLYDTLGNEAITYIVNKAELSLVFVDKPEKAKLLLEGVENKLI PGLKIIVVMDSYGSELVERGQRCGVEVTSMKAMEDLGRANRRKPKPPAPEDLAVICFT SGTTGNPKGAMVTHRNIVSDCSAFVKATENTVNPCPDDTLISFLPLAHMFERVVECVM LCHGAKIGFFQGDIRLLMDDLKVLQPTVFPVVPRLLNRMFDRIFGQANTTVKRWLLDF ASKRKEADVRSGIIRNNSLWDRLIFHKVQSSLGGRVRLMVTGAAPVSATVLTFLRAAL GCQFYEGYGQTECTAGCCLTMPGDWTTGHVGAPMPCNLIKLGWQLEEMNYMASEGEGE VCVKGPNVFQGYLKDPAKTAEALDKDGWLHTGDIGKWLPNGTLKIIDRKKHIFKLAQG EYIAPEKIENIYMRSEPVAQVFVHGESLQAFLIAIVVPDVETLCSWAQKRGFEGSFEE LCRNKDVKKAILEDMVRLGKDSGLKPFEQVKGITLHPELFSIDNGLLTPTMKAKRPEL RNYFRSQIDDLYSIIKV" BASE COUNT 870 a 716 c 790 g 812 t ORIGIN 1 cgggcagtga cagccggcgc ggatcgcgcg tccacggagg agaatcagct tagagaacta 61 tcaacacagg acaatgcaag cccatgagct gttccggtat tttcgaatgc cagagctggt 121 tgacttccga cagtgcgtga ctcttccgac caacacgctt atgggcttcg gagctttttc 181 cagacgactc accaccttct ggcggccacg ccacccaaaa cccctgaagc cgccatggca 241 cctctccatg cagtcagtgg aagtggcggg tagtggtggt gcacgaagat ccgcactact 301 tgacagcgac gagcccttgg tgtatttcta tgatgatgtt acaacattat acgaaggttt 361 ccagagaggg atacaggtgt caaataatgg cccttgttta ggctctcgga aaccagacca 421 accctatgaa tggctttcat ataaacaggt tgcagaattg tcggagtgca taggctcagc 481 actgatccag aagggcttca agactgcccc agatcagttc attggcatct ttgctcaaaa 541 tagacctgag tgggtgatta ttgaacaagg atgctttgct tattcgatgg tgatcgttcc 601 actttatgat acccttggaa atgaagccat cacgtacata gtcaacaaag ctgaactctc 661 tctggttttt gttgacaagc cagagaaggc caaactctta ttagagggtg tagaaaataa 721 gttaatacca ggccttaaaa tcatagttgt catggactcg tacggcagtg aactggtgga 781 acgaggccag aggtgtgggg tggaagtcac cagcatgaag gcgatggagg acctgggaag 841 agccaacaga cggaagccca agcctccagc acctgaagat cttgcagtaa tttgtttcac 901 aagtggaact acaggcaacc ccaaaggagc aatggtcact caccgaaaca tagtgagcga 961 ttgttcagct tttgtgaaag caacagagaa tacagtcaat ccttgcccag atgatacttt 1021 gatatctttc ttgcctctcg cccatatgtt tgagagagtt gtagagtgtg taatgctgtg 1081 tcatggagct aaaatcggat ttttccaagg agatatcagg ctgctcatgg atgacctcaa 1141 ggtgcttcaa cccactgtct tccccgtggt tccaagactg ctgaaccgga tgtttgaccg 1201 aattttcgga caagcaaaca ccaccgtgaa gcgatggctc ttggactttg cctccaagag 1261 gaaagaagca gacgttcgca gcggcatcat cagaaacaac agcctgtggg accggctgat 1321 cttccacaaa gtacagtcga gcctgggcgg aagagtccgg ctgatggtga caggagccgc 1381 cccggtgtct gccactgtgc tgacgttcct cagagcagcc ctgggctgtc agttttatga 1441 aggatacgga cagacagagt gcactgccgg gtgctgccta accatgcctg gagactggac 1501 cacaggccat gttggggccc cgatgccgtg caatttgata aaacttggtt ggcagttgga 1561 agaaatgaat tacatggcgt ccgagggcga gggcgaggtg tgtgtgaaag ggccaaatgt 1621 atttcagggc tacttgaagg acccagcgaa aacagcagaa gctttggaca aagacggctg 1681 gttacacaca ggggacatcg gaaaatggtt accaaatggc accttgaaaa ttatcgaccg 1741 gaaaaagcac atatttaagc tggcacaagg agaatacata gcccctgaaa agattgaaaa 1801 tatctacatg cgaagtgagc ctgttgctca ggtgtttgtc cacggagaaa gcctgcaggc 1861 atttctcatt gcaattgtgg taccagatgt tgagacatta tgttcctggg cccaaaagag 1921 aggatttgaa gggtcgtttg aggaactgtg cagaaataag gatgtcaaaa aagctatcct 1981 cgaagatatg gtgagacttg ggaaggattc tggtctgaaa ccatttgaac aggtcaaagg 2041 catcacattg caccctgaat tattttctat cgacaatggc cttctgactc caacaatgaa 2101 ggcgaaaagg ccagagctgc ggaactattt caggtcgcag atagatgacc tctattccat 2161 catcaaggtt tagtgtgaag aagaaagctc agaggaaatg gcacagttcc acaatctctt 2221 ctcctgctga tggccttcat gttgttaatt ttgaatacag caagtgtagg gaaggaagcg 2281 ttctgtgttt gacttgtcca ttcggggttc ttctcatagg aatgctagag gaaacagaac 2341 actgccttac agtcacctca gtgttcagac catgtttatg gtaatacaca cttccaaaag 2401 tagccttaaa aattgtaaag ggatactata aatgtgctaa ttatttgaga cttcctcagt 2461 ttaaaaagtg ggttttaaat cttctgtctc cctgtttttc taatcaaggg gttaggactt 2521 tgctatctct gagatgtctg ctacttcgtc gaaattctgc agctgtctgc tgctctaaag 2581 agtacagtgc tctagaggga agtgttccct ttaaaaataa gaacaactgt cctggctgga 2641 gatctcacaa gcggaccaga gatcttttta aatccctgct actgtccctt ctcacaggca 2701 ttcacagaac ccttctgatt cgaagggtta cgaaactcat gttcttctcc agtcccctgt 2761 ggtttctgtt ggagcataag gtttccagta agcgggaggg cagatccaac tcagaaccat 2821 gcagataagg agcctctggc aaatgggtgc tgcatcagaa cgcgtggatt ctctttcatg 2881 gcagatgctc ttggactcgg ttctccaggc ctgattcccc gactccatcc tttttcaggg 2941 ttatttaaaa atctgcctta gattctatag tgaagacaag catttcaaga aagagttacc 3001 tggatcagcc atgctcagct gtgacgcctg ataactgtct actttatctt cactgaacca 3061 ctcactctgt gtaaaggcca acggattttt aatgtggttt tcatatcaaa agatcatgtt 3121 gggattaact tgcctttttc cccaaaaaat aaactctcag gcaaggcatt tcttttaaag 3181 ctattccg // LOCUS HUMFACX 1507 bp mRNA PRI 08-NOV-1994 DEFINITION Human coagulation factor X (F10) mRNA, complete cds. ACCESSION M57285 NID g182389 KEYWORDS coagulation factor X. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1507) AUTHORS Messier,T.L., Pittman,D.D., Long,G.L., Kaufman,R.J. and Church,W.R. TITLE Cloning and expression in COS-1 cells of a full-length cDNA encoding human coagulation factor X JOURNAL Gene 99 (2), 291-294 (1991) MEDLINE 91216473 FEATURES Location/Qualifiers source 1..1507 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="13q34" gene 1..1467 /gene="F10" CDS 1..1467 /gene="F10" /EC_number="3.4.21.6" /codon_start=1 /db_xref="GDB:G00-119-890" /product="coagulation factor X" /db_xref="PID:g182390" /translation="MGRPLHLVLLSASLAGLLLLGESLFIRREQANNILARVTRANSF LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYKDGDQCETSPCQNQGKCK DGLGEYTCTCLEGFEGKNCELFTRKLCSLDNGDCDQFCHEEQNSVVCSCARGYTLADN GKACIPTGPYPCGKQTLERRKRSVAQATSSSGEAPDSITWKPYDAADLDPTENPFDLL DFNQTQPERGDNNLTRIVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAH CLYQAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHNRFTKETYDFDIAVLRLKTPITF RMNVAPACLPERDWAESTLMTQKTGIVSGFGRTHEKGRQSTRLKMLEVPYVDRNSCKL SSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYG IYTKVTAFLKWIDRSMKTRGLPKAKSHAPEVITSSPLK" misc_feature 1004..1060 /gene="F10" /note="putative VECTOR sequence Bacteriophage lambda (J02459); putative" BASE COUNT 394 a 429 c 446 g 238 t ORIGIN 1 atggggcgcc cactgcacct cgtcctgctc agtgcctccc tggctggcct cctgctgctc 61 ggggaaagtc tgttcatccg cagggagcag gccaacaaca tcctggcgag ggtcacgagg 121 gccaattcct ttcttgaaga gatgaagaaa ggacacctcg aaagagagtg catggaagag 181 acctgctcat acgaagaggc ccgcgaggtc tttgaggaca gcgacaagac gaatgaattc 241 tggaataaat acaaagatgg cgaccagtgt gagaccagtc cttgccagaa ccagggcaaa 301 tgtaaagacg gcctcgggga atacacctgc acctgtttag aaggattcga aggcaaaaac 361 tgtgaattat tcacacggaa gctctgcagc ctggacaacg gggactgtga ccagttctgc 421 cacgaggaac agaactctgt ggtgtgctcc tgcgcccgcg ggtacaccct ggctgacaac 481 ggcaaggcct gcattcccac agggccctac ccctgtggga aacagaccct ggaacgcagg 541 aagaggtcag tggcccaggc caccagcagc agcggggagg cccctgacag catcacatgg 601 aagccatatg atgcagccga cctggacccc accgagaacc ccttcgacct gcttgacttc 661 aaccagacgc agcctgagag gggcgacaac aacctcacca ggatcgtggg aggccaggaa 721 tgcaaggacg gggagtgtcc ctggcaggcc ctgctcatca atgaggaaaa cgagggtttc 781 tgtggtggaa ctattctgag cgagttctac atcctaacgg cagcccactg tctctaccaa 841 gccaagagat tcaaggtgag ggtaggggac cggaacacgg agcaggagga gggcggtgag 901 gcggtgcacg aggtggaggt ggtcatcaag cacaaccggt tcacaaagga gacctatgac 961 ttcgacatcg ccgtgctccg gctcaagacc cccatcacct tccgcatgaa cgtggcgcct 1021 gcctgcctcc ccgagcgtga ctgggccgag tccacgctga tgacgcagaa gacggggatt 1081 gtgagcggct tcgggcgcac ccacgagaag ggccggcagt ccaccaggct caagatgctg 1141 gaggtgccct acgtggaccg caacagctgc aagctgtcca gcagcttcat catcacccag 1201 aacatgttct gtgccggcta cgacaccaag caggaggatg cctgccaggg ggacagcggg 1261 ggcccgcacg tcacccgctt caaggacacc tacttcgtga caggcatcgt cagctgggga 1321 gagggctgtg cccgtaaggg gaagtacggg atctacacca aggtcaccgc cttcctcaag 1381 tggatcgaca ggtccatgaa aaccaggggc ttgcccaagg ccaagagcca tgccccggag 1441 gtcataacgt cctctccatt aaagtgagat cccactcaaa aaaaaaaaaa aaaaaaaaaa 1501 aaaaaaa // LOCUS HUMFAK 3052 bp mRNA PRI 25-FEB-1993 DEFINITION Homo sapiens focal adhesion kinase mRNA, complete cds. ACCESSION L05186 NID g182394 KEYWORDS focal adhesion kinase; protein-tyrosine kinase. SOURCE Homo sapiens (library: lambda gt10) fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3052) AUTHORS Andri,E. and Becker-Andri,M. TITLE Expression of an N-terminally truncated form of human focal adhesion kinase in brain JOURNAL Biochem. Biophys. Res. Commun. 190, 140-147 (1993) MEDLINE 93135758 FEATURES Location/Qualifiers source 1..3052 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /tissue_lib="lambda gt10" CDS 15..2654 /EC_number="2.7.1.1" /codon_start=1 /function="protein/tyrosine kinase" /product="focal adhesion kinase" /db_xref="PID:g182395" /translation="MSDYWVVGKKSNYEVLEKDVGLKRFFPKSLLDSVKAKTLRKLIQ QTFRQFANLNREESILKFFEILSPVYRFDKECFKCALGSSWIISVELAIGPEEGISYL TDKGCNPTHLADFTQVQTIQYSNSEDKDRKGMLQLKIAGAPEPLTVTAPSLTIAENMA DLIDGYCRLVNGTSQSFIIRPQKEGERALPSIPKLANSEKQGMRTHAVSVSETDDYAE IIDEEDTYTMPSTRDYEIQRERIELGRCIGEGQFGDVHQGIYMSPENPALAVAIKTCK NCTSDSVREKFLQEACHYTSLHWNWCRYISDPNVDACPDPRNAELTMRQFDHPHIVKL IGVITENPVWIIMELCTLGELRSFLQVRKYSLDLASLILYAYQLSTALAYLESKRFVH RDIAARNVLVSSNDCVKLGDFGLSRYMEDSTYYKASKGKLPIKWMAPESINFRRFTSA SDVWMFGVCMWEILMHGVKPFQGVKNNDVIGRIENGERLPMPPNCPPTLYSLMTKCWA YDPSRRPRFTELKAQLSTILEEEKAQQEERMRMESRRQATVSWDSGGSDEAPPKPSRP GYPSPRSSEGFYPSPQHMVQTNHYQVSGYPGSHGITAMAGSIYPGQASLLDQTDSWNH RSQEIAMWQPNVEDSTVLDLRGIGQVLPTHLMEERLIRQQQEMEEDQRWLEKEERFLI GNQHIYQPVGKPDPAAPPKKPPRPGAPGHLGSLASLSSPADSYNEGVKLQPQEISPPP TANLDRSNDKVYENVTGLVKAVIEMSSKIQPAPPEEYVPMVKEVGLALRTLLATVDET IPLLPASTHREIEMAQKLLNSDLGELINKMKLAQQYVMTSLQQEYKKQMLTAAHALAV DAKNLLDVIDQARLKMLGQTRPH" BASE COUNT 910 a 680 c 724 g 738 t ORIGIN 1 ccggtgtgaa ggccatgagt gattactggg ttgttggaaa gaagtctaac tatgaagtat 61 tagaaaaaga tgttggttta aagcgatttt ttcctaagag tttactggat tctgtcaagg 121 ccaaaacact aagaaaactg atccaacaaa catttagaca atttgccaac cttaatagag 181 aagaaagtat tctgaaattc tttgagatcc tgtctccagt ctacagattt gataaggaat 241 gcttcaagtg tgctcttggt tcaagctgga ttatttcagt ggaactggca atcggcccag 301 aagaaggaat cagttaccta acggacaagg gctgcaatcc cacacatctt gctgacttca 361 ctcaagtgca aaccattcag tattcaaaca gtgaagacaa ggacagaaaa ggaatgctac 421 aactaaaaat agcaggtgca cccgagcctc tgacagtgac ggcaccatcc ctaaccattg 481 cggagaatat ggctgaccta atagatgggt actgccggct ggtgaatgga acctcgcagt 541 catttatcat cagacctcag aaagaaggtg aacgggcttt gccatcaata ccaaagttgg 601 ccaacagcga aaagcaaggc atgcggacac acgccgtctc tgtgtcagaa acagatgatt 661 atgctgagat tatagatgaa gaagatactt acaccatgcc ctcaaccagg gattatgaga 721 ttcaaagaga aagaatagaa cttggacgat gtattggaga aggccaattt ggagatgtac 781 atcaaggcat ttatatgagt ccagagaatc cagctttggc ggttgcaatt aaaacatgta 841 aaaactgtac ttcggacagc gtgagagaga aatttcttca agaagcctgc cattacacat 901 ctttgcactg gaattggtgc agatatataa gtgatcctaa tgttgatgcc tgcccagacc 961 ccaggaatgc agagttaaca atgcgtcagt ttgaccatcc tcatattgtg aagctgattg 1021 gagtcatcac agagaatcct gtctggataa tcatggagct gtgcacactt ggagagctga 1081 ggtcattttt gcaagtaagg aaatacagtt tggatctagc atctttgatc ctgtatgcct 1141 atcagcttag tacagctctt gcatatctag agagcaaaag atttgtacac agggacattg 1201 ctgctcggaa tgttctggtg tcctcaaatg attgtgtaaa attaggagac tttggattat 1261 cccgatatat ggaagatagt acttactaca aagcttccaa aggaaaattg cctattaaat 1321 ggatggctcc agagtcaatc aattttcgac gttttacctc agctagtgac gtatggatgt 1381 ttggtgtgtg tatgtgggag atactgatgc atggtgtgaa gccttttcaa ggagtgaaga 1441 acaatgatgt aatcggtcga attgaaaatg gggaaagatt accaatgcct ccaaattgtc 1501 ctcctaccct ctacagcctt atgacgaaat gctgggccta tgaccccagc aggcggccca 1561 ggtttactga acttaaagct cagctcagca caatcctgga ggaagagaag gctcagcaag 1621 aagagcgcat gaggatggag tccagaagac aggccacagt gtcctgggac tccggagggt 1681 ctgatgaagc accgcccaag cccagcagac cgggttatcc cagtccgagg tccagcgaag 1741 gattttatcc cagcccacag cacatggtac aaaccaatca ttaccaggtt tctggctacc 1801 ctggttcaca tggaatcaca gccatggctg gcagcatcta tccaggtcag gcatctcttt 1861 tggaccaaac agattcatgg aatcatagat ctcaggagat agcaatgtgg cagcccaatg 1921 tggaggactc tacagtattg gacctgcgag ggattgggca agtgttgcca acccatctga 1981 tggaagagcg tctaatccga cagcaacagg aaatggaaga agatcagcgc tggctggaaa 2041 aagaggaaag atttctgatt ggaaaccaac atatatatca gcctgtgggt aaaccagatc 2101 ctgcagctcc accaaagaaa ccgcctcgcc ctggagctcc cggtcatctg ggaagccttg 2161 ccagcctcag cagccctgct gacagctaca acgagggtgt caagcttcag ccccaggaaa 2221 tcagcccccc tcctactgcc aacctggacc ggtcgaatga taaggtgtac gagaatgtga 2281 cgggcctggt gaaagctgtc atcgagatgt ccagtaaaat ccagccagcc ccaccagagg 2341 agtatgtccc tatggtgaag gaagtcggct tggccctgag gacattattg gccactgtgg 2401 atgagaccat tcccctccta ccagccagca cccaccgaga gattgagatg gcacagaagc 2461 tattgaactc tgacctgggt gagctcatca acaagatgaa actggcccag cagtatgtca 2521 tgaccagcct ccagcaagag tacaaaaagc aaatgctgac tgccgctcac gccctggctg 2581 tggatgccaa aaacttactc gatgtcattg accaagcaag actgaaaatg cttgggcaga 2641 cgagaccaca ctgagcctcc cctaggagca cgtcttgcta ccctcttttg aagatgttct 2701 ctagccttcc accagcagcg aggaattaac cctgtgtcct cagtcgccag cactcacagc 2761 tccaactttt ttgaatgacc atctggttga aaaatctttc tcatataagt ttaaccacac 2821 tttgatttgg gttcattttt tgttttgttt ttttcaatca tgatattcag aaaaatccag 2881 gatccaaaat gtggcgtttt tctaagaatg aaaattatat gtaagctttt aagcatcatg 2941 aagaacaatt tatgttcaca ttaagatacg ttctaaaggg ggatggccaa ggggtgacat 3001 cttaattcct aaactacctt agctgcatag tggaagagga gagccggaat tc // LOCUS HUMFALD 1791 bp mRNA PRI 29-MAY-1996 DEFINITION Human fatty aldehyde dehydrogenase (FALDH) mRNA, complete cds. ACCESSION L47162 NID g1082035 KEYWORDS Sjogren-Larsson syndrome; aldehyde dehydrogenase; fatty aldehyde dehydrogenase; microsomal. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1791) AUTHORS De Laurenzi,V., Rogers,G.R., Hamrock,D.J., Marekov,L.N., Steinert,P.M., Compton,J.G., Markova,N. and Rizzo,W.B. TITLE Sjogren-Larsson syndrome is caused by mutations in the fatty aldehyde dehydrogenase gene JOURNAL Nature Genet. 12 (1), 52-57 (1996) MEDLINE 96122039 FEATURES Location/Qualifiers source 1..1791 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" mRNA <1..1777 /gene="FALDH" gene 1..1777 /gene="FALDH" CDS 164..1621 /gene="FALDH" /codon_start=1 /product="fatty aldehyde dehydrogenase" /db_xref="PID:g1082036" /translation="MELEVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTA IAADLCKSEFNVYSQEVITVLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPL GVVLIIGAWNYPFVLTIQPLIGAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLY IVINGGVEETTELLKQRFDHIFYTGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDK DCDLDIVCRRITWGKYMNCGQTCIAPDYILCEASLQNQIVWKIKETVKEFYGENIKES PDYERIINLRHFKRILSLLEGQKIAFGGETDEATRYIAPTVLTDVDPKTKVMQEEIFG PILPIVPVKNVDEAINFINEREKPLALYVFSHNHKLIKRMIDETSSGGVTGNDVIMHF TLNSFPFGGVGSSGMGAYHGKHSFDTFSHQRPCLLKSLKREGANKLRYPPNSQSKVDW GKFFLLKRFNKEKLGLLLLTFLGIVAAVLVKAEYY" polyA_site 1791 BASE COUNT 501 a 389 c 435 g 466 t ORIGIN 1 ggcggacgga gcgagccctg ggcgagtgaa ttgtggctgt gggttgacgg tggagacacc 61 ccccggagag gcggagggaa gggaggcgag gctgcacctg catgcttccc gcctcccact 121 ccccagcgcc cccggaccgt gcagttctct gcaggaccag gccatggagc tcgaagtccg 181 gcgggtccga caggcgttcc tgtccggccg gtcgcgacct ctgcggtttc ggctgcagca 241 gctggaggcc ctgcggagga tggtgcagga gcgcgagaag gatatcctga cggccatcgc 301 cgccgacctg tgcaagagtg aattcaatgt gtacagtcag gaagtcatta ctgtccttgg 361 ggaaattgat tttatgcttg agaatcttcc tgaatgggtt actgctaaac cagttaagaa 421 gaacgtgctc accatgctgg atgaggccta tattcagcca cagcctctgg gagtggtgct 481 gataatcgga gcttggaatt accccttcgt tctcaccatt cagccactga taggagccat 541 cgctgcagga aatgctgtga ttataaagcc ttctgaactg agtgaaaata cagccaagat 601 cttggcaaag cttctccctc agtatttaga ccaggatctc tatattgtta ttaatggtgg 661 tgttgaggaa accacggagc tcctgaagca gcgatttgac cacattttct atacgggaaa 721 cactgcggtt ggcaaaattg tcatggaagc tgctgccaag catctgaccc ctgtgactct 781 tgaactggga gggaaaagtc catgttatat tgataaagat tgtgacctgg acattgtttg 841 cagacgcata acctggggaa aatacatgaa ttgtggccaa acctgcattg cacccgacta 901 tattctctgt gaagcatccc tccaaaatca aattgtatgg aagattaagg aaacagtgaa 961 ggaattttat ggagaaaata taaaagagtc tcctgattat gaaaggatca tcaatcttcg 1021 tcattttaag aggatactaa gtttgcttga aggacaaaag atagcttttg gtggggagac 1081 tgatgaggcc acacgctaca tagccccaac agtacttacc gatgttgatc ctaaaaccaa 1141 ggtgatgcaa gaagaaattt ttggaccaat tcttccaata gtgcctgtga aaaatgtaga 1201 tgaggccata aatttcataa atgaacgtga aaagcctctg gctctttatg tattttcgca 1261 taaccataag ctcatcaaac ggatgattga tgagacatcc agtggaggtg tcacaggcaa 1321 tgacgtcatt atgcacttca cgctcaactc tttcccattt ggaggagtgg gttccagtgg 1381 gatgggagct tatcacggaa aacatagttt tgatactttt tctcatcagc gtccctgttt 1441 attaaaaagt ttaaagagag aaggtgctaa caaactcaga tatcctccca acagccagtc 1501 aaaggtggat tgggggaaat tttttctctt gaaacggttc aacaaagaaa aactcggtct 1561 cctgttgctc actttcctgg gtattgtagc cgctgtgctt gtcaaggcag aatattactg 1621 aagaatgatc ctgttcaacc tcctagtgcc tctactgaat tattcctctt ttaaatggtt 1681 aatgaaccaa taatttttaa atcataccaa aaatagtaag aaaatatgca aacactctgt 1741 gatcaaactt aaaagtcatt gccattcatc attaataaaa gttgccattt c // LOCUS HUMFAPAPC 8972 bp mRNA PRI 09-DEC-1993 DEFINITION Human APC gene mRNA, complete cds. ACCESSION M74088 NID g182396 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8972) AUTHORS Kinzler,K.W., Nilbert,M.C., Su,L.K., Vogelstein,B., Bryan,T.M., Levy,D.B., Smith,K.J., Preisinger,A.C., Hedge,P., McKechnie,D., Finniear,R., Markham,A., Groffen,J., Boguski,M.S., Altschul,S.F., Horii,A.K., Ando,H., Miyoshi,Y., Miki,Y., Nishisho,I. and Nakamura,Y. TITLE Identification of FAP locus genes from chromosome 5q21 JOURNAL Science 253, 661-665 (1991) MEDLINE 91335210 REFERENCE 2 (sites) AUTHORS Nishisho,I., Nakamura,Y., Miyoshi,Y., Miki,Y., Ando,H., Horii,A., Koyama,K., Utsunomiya,J., Baba,S. and Hedge,P. TITLE Mutations of chromosome 5q21 genes in FAP and colorectal cancer patients JOURNAL Science 253, 665-669 (1991) MEDLINE 91335211 REFERENCE 3 (bases 1 to 8972) AUTHORS Kinzler,K.W. TITLE Direct Submission JOURNAL Submitted (31-JUL-1991) K.W. Kinzler, Molecular Genetics Laboratory, Johns Hopkins University School of Medicine, Baltimore, MD 21231 USA FEATURES Location/Qualifiers source 1..8972 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q21" gene 19..8550 /gene="APC" CDS 19..8550 /gene="APC" /codon_start=1 /db_xref="PID:g182397" /translation="MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNM KEVLKQLQGSIEDEAMASSGQIDLLERLKELNLDSSNFPGVKLRSKMSLRSYGSREGS VSSRSGECSPVPMGSFPRRGFVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYA QLQNLTKRIDSLPLTENFSLQTDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRI ARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNEGQGVGEINMATSGN GQGSTTRMDHETASVLSSSSTHSAPRRLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTL LAMSSSQDSCISMRQSGCLPLLIQLLHGNDKDSVLLGNSRGSKEARARASAALHNIIH SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAV CVLMKLSFDEEHRHAMNELGGLQAIAELLQVDCEMYGLTNDHYSITLRRYAGMALTNL TFGDVANKATLCSMKGCMRALVAQLKSESEDLQQVIASVLRNLSWRADVNSKKTLREV GSVKALMECALEVKKESTLKSVLSALWNLSAHCTENKADICAVDGALAFLVGTLTYRS QTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCLQTLLQHLKSHSLTIVSNACG TLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMANRPAKYKDA NIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGD YVFDTNRHDDNRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLDSSRSEKDRSLERER GIGLGNYHPATENPGTSSKRGLQISTTAAQIAKVMEEVSAIHTSQEDRSSGSTTELHC VTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNSVSSSDG YGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELDTPINYSLK YSDEQLNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLK FQPHFGQQECVSPYRSRGANGSETNRVGSNHGINQNVSQSLCQEDDYEDDKPTNYSER YSEEEQHEEEERPTNYSIKYNEEKRHVDQPIDYSLKYATDIPSSQKQSFSFSKSSSGQ SSKTEHMSSSSENTSTPSSNAKRQNQLHPSSAQSRSGQPQKAATCKVSSINQETIQTY CVEDTPICFSRCSSLSSLSSAEDEIGCNQTTQEADSANTLQIAEIKEKIGTRSAEDPV SEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQTPKSPPEHYVQ ETPLMFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRS KTPPPPPQTAQTKREVPKNKAPTAEKRESGPKQAAVNAAVQRVQVLPDADTLLHFATE STPDGFSCSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPKESNENQEKE AEKTIDSEKDLLDDSDDDDIEILEECIISAMPTKSSRKAKKPAQTASKLPPPVARKPS QLPVYKLLPSQNRLQPQKHVSFTPGDDMPRVYCVEGTPINFSTATSLSDLTIESPPNE LAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDDNKAEEGDILAE CINSAMPKGKSHKPFRVKKIMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTE YRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKDFNDKLPNNEDRVRGSFAFDS PHHYTPIEGTPYCFSRNDSLSSLDFDDDDVDLSREKAELRKAKENKESEAKVTSHTEL TSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAATDEKLQNFAIEN TPVCFSHNSSLSSLSDIDQENNNKENEPIKETEPPDSQGEPSKPQASGYAPKSFHVED TPVCFSRNSSLSSLSIDSEDDLLQECISSAMPKKKKPSRLKGDNEKHSPRNMGGILGE DLTLDLKDIQRPDSEHGLSPDSENFDWKAIQEGANSIVSSLHQAAAAACLSRQASSDS DSILSLKSGISLGSPFHLTPDQEEKPFTSNKGPRILKPGEKSTLETKKIESESKGIKG GKKVYKSLITGKVRSNSEISGQMKQPLQANMPSISRGRTMIHIPGVRNSSSSTSPVSK KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRD STPSRPAQQPLSRPIQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKM SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNGNGANKKVELSRMSST KSSGSESDRSERPVLVRQSTFIKEAPSPTLRRKLEESASFESLSPSSRPASPTRSQAQ TPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSR LPINRSGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSESSEKAKSEDEKHVNSIS GTKQSKENQVSAKGTWRKIKENEFSPTNSTSQTVSSGATNGAESKTLIYQMAPAVSKT EDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVP MRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVSETNESSIVERTPFSSSSSSKH SSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGT QSPKRHSGSYLVTSV" BASE COUNT 3095 a 1827 c 1821 g 2229 t ORIGIN 1 gtccaagggt agccaaggat ggctgcagct tcatatgatc agttgttaaa gcaagttgag 61 gcactgaaga tggagaactc aaatcttcga caagagctag aagataattc caatcatctt 121 acaaaactgg aaactgaggc atctaatatg aaggaagtac ttaaacaact acaaggaagt 181 attgaagatg aagctatggc ttcttctgga cagattgatt tattagagcg tcttaaagag 241 cttaacttag atagcagtaa tttccctgga gtaaaactgc ggtcaaaaat gtccctccgt 301 tcttatggaa gccgggaagg atctgtatca agccgttctg gagagtgcag tcctgttcct 361 atgggttcat ttccaagaag agggtttgta aatggaagca gagaaagtac tggatattta 421 gaagaacttg agaaagagag gtcattgctt cttgctgatc ttgacaaaga agaaaaggaa 481 aaagactggt attacgctca acttcagaat ctcactaaaa gaatagatag tcttccttta 541 actgaaaatt tttccttaca aacagatatg accagaaggc aattggaata tgaagcaagg 601 caaatcagag ttgcgatgga agaacaacta ggtacctgcc aggatatgga aaaacgagca 661 cagcgaagaa tagccagaat tcagcaaatc gaaaaggaca tacttcgtat acgacagctt 721 ttacagtccc aagcaacaga agcagagagg tcatctcaga acaagcatga aaccggctca 781 catgatgctg agcggcagaa tgaaggtcaa ggagtgggag aaatcaacat ggcaacttct 841 ggtaatggtc agggttcaac tacacgaatg gaccatgaaa cagccagtgt tttgagttct 901 agtagcacac actctgcacc tcgaaggctg acaagtcatc tgggaaccaa ggtggaaatg 961 gtgtattcat tgttgtcaat gcttggtact catgataagg atgatatgtc gcgaactttg 1021 ctagctatgt ctagctccca agacagctgt atatccatgc gacagtctgg atgtcttcct 1081 ctcctcatcc agcttttaca tggcaatgac aaagactctg tattgttggg aaattcccgg 1141 ggcagtaaag aggctcgggc cagggccagt gcagcactcc acaacatcat tcactcacag 1201 cctgatgaca agagaggcag gcgtgaaatc cgagtccttc atcttttgga acagatacgc 1261 gcttactgtg aaacctgttg ggagtggcag gaagctcatg aaccaggcat ggaccaggac 1321 aaaaatccaa tgccagctcc tgttgaacat cagatctgtc ctgctgtgtg tgttctaatg 1381 aaactttcat ttgatgaaga gcatagacat gcaatgaatg aactaggggg actacaggcc 1441 attgcagaat tattgcaagt ggactgtgaa atgtacgggc ttactaatga ccactacagt 1501 attacactaa gacgatatgc tggaatggct ttgacaaact tgacttttgg agatgtagcc 1561 aacaaggcta cgctatgctc tatgaaaggc tgcatgagag cacttgtggc ccaactaaaa 1621 tctgaaagtg aagacttaca gcaggttatt gcaagtgttt tgaggaattt gtcttggcga 1681 gcagatgtaa atagtaaaaa gacgttgcga gaagttggaa gtgtgaaagc attgatggaa 1741 tgtgctttag aagttaaaaa ggaatcaacc ctcaaaagcg tattgagtgc cttatggaat 1801 ttgtcagcac attgcactga gaataaagct gatatatgtg ctgtagatgg tgcacttgca 1861 tttttggttg gcactcttac ttaccggagc cagacaaaca ctttagccat tattgaaagt 1921 ggaggtggga tattacggaa tgtgtccagc ttgatagcta caaatgagga ccacaggcaa 1981 atcctaagag agaacaactg tctacaaact ttattacaac acttaaaatc tcatagtttg 2041 acaatagtca gtaatgcatg tggaactttg tggaatctct cagcaagaaa tcctaaagac 2101 caggaagcat tatgggacat gggggcagtt agcatgctca agaacctcat tcattcaaag 2161 cacaaaatga ttgctatggg aagtgctgca gctttaagga atctcatggc aaataggcct 2221 gcgaagtaca aggatgccaa tattatgtct cctggctcaa gcttgccatc tcttcatgtt 2281 aggaaacaaa aagccctaga agcagaatta gatgctcagc acttatcaga aacttttgac 2341 aatatagaca atttaagtcc caaggcatct catcgtagta agcagagaca caagcaaagt 2401 ctctatggtg attatgtttt tgacaccaat cgacatgatg ataataggtc agacaatttt 2461 aatactggca acatgactgt cctttcacca tatttgaata ctacagtgtt acccagctcc 2521 tcttcatcaa gaggaagctt agatagttct cgttctgaaa aagatagaag tttggagaga 2581 gaacgcggaa ttggtctagg caactaccat ccagcaacag aaaatccagg aacttcttca 2641 aagcgaggtt tgcagatctc caccactgca gcccagattg ccaaagtcat ggaagaagtg 2701 tcagccattc atacctctca ggaagacaga agttctgggt ctaccactga attacattgt 2761 gtgacagatg agagaaatgc acttagaaga agctctgctg cccatacaca ttcaaacact 2821 tacaatttca ctaagtcgga aaattcaaat aggacatgtt ctatgcctta tgccaaatta 2881 gaatacaaga gatcttcaaa tgatagttta aatagtgtca gtagtagtga tggttatggt 2941 aaaagaggtc aaatgaaacc ctcgattgaa tcctattctg aagatgatga aagtaagttt 3001 tgcagttatg gtcaataccc agccgaccta gcccataaaa tacatagtgc aaatcatatg 3061 gatgataatg atggagaact agatacacca ataaattata gtcttaaata ttcagatgag 3121 cagttgaact ctggaaggca aagtccttca cagaatgaaa gatgggcaag acccaaacac 3181 ataatagaag atgaaataaa acaaagtgag caaagacaat caaggaatca aagtacaact 3241 tatcctgttt atactgagag cactgatgat aaacacctca agttccaacc acattttgga 3301 cagcaggaat gtgtttctcc atacaggtca cggggagcca atggttcaga aacaaatcga 3361 gtgggttcta atcatggaat taatcaaaat gtaagccagt ctttgtgtca agaagatgac 3421 tatgaagatg ataagcctac caattatagt gaacgttact ctgaagaaga acagcatgaa 3481 gaagaagaga gaccaacaaa ttatagcata aaatataatg aagagaaacg tcatgtggat 3541 cagcctattg attatagttt aaaatatgcc acagatattc cttcatcaca gaaacagtca 3601 ttttcattct caaagagttc atctggacaa agcagtaaaa ccgaacatat gtcttcaagc 3661 agtgagaata cgtccacacc ttcatctaat gccaagaggc agaatcagct ccatccaagt 3721 tctgcacaga gtagaagtgg tcagcctcaa aaggctgcca cttgcaaagt ttcttctatt 3781 aaccaagaaa caatacagac ttattgtgta gaagatactc caatatgttt ttcaagatgt 3841 agttcattat catctttgtc atcagctgaa gatgaaatag gatgtaatca gacgacacag 3901 gaagcagatt ctgctaatac cctgcaaata gcagaaataa aagaaaagat tggaactagg 3961 tcagctgaag atcctgtgag cgaagttcca gcagtgtcac agcaccctag aaccaaatcc 4021 agcagactgc agggttctag tttatcttca gaatcagcca ggcacaaagc tgttgaattt 4081 tcttcaggag cgaaatctcc ctccaaaagt ggtgctcaga cacccaaaag tccacctgaa 4141 cactatgttc aggagacccc actcatgttt agcagatgta cttctgtcag ttcacttgat 4201 agttttgaga gtcgttcgat tgccagctcc gttcagagtg aaccatgcag tggaatggta 4261 agtggcatta taagccccag tgatcttcca gatagccctg gacaaaccat gccaccaagc 4321 agaagtaaaa cacctccacc acctcctcaa acagctcaaa ccaagcgaga agtacctaaa 4381 aataaagcac ctactgctga aaagagagag agtggaccta agcaagctgc agtaaatgct 4441 gcagttcaga gggtccaggt tcttccagat gctgatactt tattacattt tgccacggaa 4501 agtactccag atggattttc ttgttcatcc agcctgagtg ctctgagcct cgatgagcca 4561 tttatacaga aagatgtgga attaagaata atgcctccag ttcaggaaaa tgacaatggg 4621 aatgaaacag aatcagagca gcctaaagaa tcaaatgaaa accaagagaa agaggcagaa 4681 aaaactattg attctgaaaa ggacctatta gatgattcag atgatgatga tattgaaata 4741 ctagaagaat gtattatttc tgccatgcca acaaagtcat cacgtaaagc aaaaaagcca 4801 gcccagactg cttcaaaatt acctccacct gtggcaagga aaccaagtca gctgcctgtg 4861 tacaaacttc taccatcaca aaacaggttg caaccccaaa agcatgttag ttttacaccg 4921 ggggatgata tgccacgggt gtattgtgtt gaagggacac ctataaactt ttccacagct 4981 acatctctaa gtgatctaac aatcgaatcc cctccaaatg agttagctgc tggagaagga 5041 gttagaggag gagcacagtc aggtgaattt gaaaaacgag ataccattcc tacagaaggc 5101 agaagtacag atgaggctca aggaggaaaa acctcatctg taaccatacc tgaattggat 5161 gacaataaag cagaggaagg tgatattctt gcagaatgca ttaattctgc tatgcccaaa 5221 gggaaaagtc acaagccttt ccgtgtgaaa aagataatgg accaggtcca gcaagcatct 5281 gcgtcgtctt ctgcacccaa caaaaatcag ttagatggta agaaaaagaa accaacttca 5341 ccagtaaaac ctataccaca aaatactgaa tataggacac gtgtaagaaa aaatgcagac 5401 tcaaaaaata atttaaatgc tgagagagtt ttctcagaca acaaagattc aaagaaacag 5461 aatttgaaaa ataattccaa ggacttcaat gataagctcc caaataatga agatagagtc 5521 agaggaagtt ttgcttttga ttcacctcat cattacacgc ctattgaagg aactccttac 5581 tgtttttcac gaaatgattc tttgagttct ctagattttg atgatgatga tgttgacctt 5641 tccagggaaa aggctgaatt aagaaaggca aaagaaaata aggaatcaga ggctaaagtt 5701 accagccaca cagaactaac ctccaaccaa caatcagcta ataagacaca agctattgca 5761 aagcagccaa taaatcgagg tcagcctaaa cccatacttc agaaacaatc cacttttccc 5821 cagtcatcca aagacatacc agacagaggg gcagcaactg atgaaaagtt acagaatttt 5881 gctattgaaa atactccagt ttgcttttct cataattcct ctctgagttc tctcagtgac 5941 attgaccaag aaaacaacaa taaagaaaat gaacctatca aagagactga gccccctgac 6001 tcacagggag aaccaagtaa acctcaagca tcaggctatg ctcctaaatc atttcatgtt 6061 gaagataccc cagtttgttt ctcaagaaac agttctctca gttctcttag tattgactct 6121 gaagatgacc tgttgcagga atgtataagc tccgcaatgc caaaaaagaa aaagccttca 6181 agactcaagg gtgataatga aaaacatagt cccagaaata tgggtggcat attaggtgaa 6241 gatctgacac ttgatttgaa agatatacag agaccagatt cagaacatgg tctatcccct 6301 gattcagaaa attttgattg gaaagctatt caggaaggtg caaattccat agtaagtagt 6361 ttacatcaag ctgctgctgc tgcatgttta tctagacaag cttcgtctga ttcagattcc 6421 atcctttccc tgaaatcagg aatctctctg ggatcaccat ttcatcttac acctgatcaa 6481 gaagaaaaac cctttacaag taataaaggc ccacgaattc taaaaccagg ggagaaaagt 6541 acattggaaa ctaaaaagat agaatctgaa agtaaaggaa tcaaaggagg aaaaaaagtt 6601 tataaaagtt tgattactgg aaaagttcga tctaattcag aaatttcagg ccaaatgaaa 6661 cagccccttc aagcaaacat gccttcaatc tctcgaggca ggacaatgat tcatattcca 6721 ggagttcgaa atagctcctc aagtacaagt cctgtttcta aaaaaggccc accccttaag 6781 actccagcct ccaaaagccc tagtgaaggt caaacagcca ccacttctcc tagaggagcc 6841 aagccatctg tgaaatcaga attaagccct gttgccaggc agacatccca aataggtggg 6901 tcaagtaaag caccttctag atcaggatct agagattcga ccccttcaag acctgcccag 6961 caaccattaa gtagacctat acagtctcct ggccgaaact caatttcccc tggtagaaat 7021 ggaataagtc ctcctaacaa attatctcaa cttccaagga catcatcccc tagtactgct 7081 tcaactaagt cctcaggttc tggaaaaatg tcatatacat ctccaggtag acagatgagc 7141 caacagaacc ttaccaaaca aacaggttta tccaagaatg ccagtagtat tccaagaagt 7201 gagtctgcct ccaaaggact aaatcagatg aataatggta atggagccaa taaaaaggta 7261 gaactttcta gaatgtcttc aactaaatca agtggaagtg aatctgatag atcagaaaga 7321 cctgtattag tacgccagtc aactttcatc aaagaagctc caagcccaac cttaagaaga 7381 aaattggagg aatctgcttc atttgaatct ctttctccat catctagacc agcttctccc 7441 actaggtccc aggcacaaac tccagtttta agtccttccc ttcctgatat gtctctatcc 7501 acacattcgt ctgttcaggc tggtggatgg cgaaaactcc cacctaatct cagtcccact 7561 atagagtata atgatggaag accagcaaag cgccatgata ttgcacggtc tcattctgaa 7621 agtccttcta gacttccaat caataggtca ggaacctgga aacgtgagca cagcaaacat 7681 tcatcatccc ttcctcgagt aagcacttgg agaagaactg gaagttcatc ttcaattctt 7741 tctgcttcat cagaatccag tgaaaaagca aaaagtgagg atgaaaaaca tgtgaactct 7801 atttcaggaa ccaaacaaag taaagaaaac caagtatccg caaaaggaac atggagaaaa 7861 ataaaagaaa atgaattttc tcccacaaat agtacttctc agaccgtttc ctcaggtgct 7921 acaaatggtg ctgaatcaaa gactctaatt tatcaaatgg cacctgctgt ttctaaaaca 7981 gaggatgttt gggtgagaat tgaggactgt cccattaaca atcctagatc tggaagatct 8041 cccacaggta atactccccc ggtgattgac agtgtttcag aaaaggcaaa tccaaacatt 8101 aaagattcaa aagataatca ggcaaaacaa aatgtgggta atggcagtgt tcccatgcgt 8161 accgtgggtt tggaaaatcg cctgaactcc tttattcagg tggatgcccc tgaccaaaaa 8221 ggaactgaga taaaaccagg acaaaataat cctgtccctg tatcagagac taatgaaagt 8281 tctatagtgg aacgtacccc attcagttct agcagctcaa gcaaacacag ttcacctagt 8341 gggactgttg ctgccagagt gactcctttt aattacaacc caagccctag gaaaagcagc 8401 gcagatagca cttcagctcg gccatctcag atcccaactc cagtgaataa caacacaaag 8461 aagcgagatt ccaaaactga cagcacagaa tccagtggaa cccaaagtcc taagcgccat 8521 tctgggtctt accttgtgac atctgtttaa aagagaggaa gaatgaaact aagaaaattc 8581 tatgttaatt acaactgcta tatagacatt ttgtttcaaa tgaaacttta aaagactgaa 8641 aaattttgta aataggtttg attcttgtta gagggttttt gttctggaag ccatatttga 8701 tagtatactt tgtcttcact ggtcttattt tgggaggcac tcttgatggt taggaaaaaa 8761 atagtaaagc caagtatgtt tgtacagtat gttttacatg tatttaaagt agcacccatc 8821 ccaacttcct ttaattattg cttgtcttaa aataatgaac actacagata gaaaatatga 8881 tatattgctg ttatcaatca tttctagatt ataaactgac taaacttaca tcagggaaaa 8941 attggtattt atgcaaaaaa aaatgttttt gt // LOCUS HUMFAPS 1148 bp mRNA PRI 08-NOV-1994 DEFINITION Human farnesyl pyrophosphate synthetase mRNA, complete cds. ACCESSION J05262 NID g182398 KEYWORDS farnesyl pyrophosphate synthetase. SOURCE Human hepatoma cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1139 to 1148) AUTHORS Wilkin,D.J. JOURNAL Unpublished (1990) REFERENCE 2 (bases 1 to 1138) AUTHORS Wilkin,D.J., Kutsunai,S.Y. and Edwards,P.A. TITLE Isolation and sequence of the human farnesyl pyrophosphate synthetase cDNA. Coordinate regulation of the mRNAs for farnesyl pyrophosphate synthetase, 3-hydroxy-3-methylglutaryl coenzyme A reductase, and 3-hydroxy-3-methylglutaryl coenzyme A synthase by phorbol ester JOURNAL J. Biol. Chem. 265 (8), 4607-4614 (1990) MEDLINE 90170972 COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.J.Wilkin, 12-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..1148 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 7..1068 /gene="FDPS" CDS 7..1068 /gene="FDPS" /note="farnesyl pyrophosphate synthetase (EC 2.5.1.1)" /codon_start=1 /db_xref="GDB:G00-128-629" /db_xref="PID:g182399" /translation="MNGDQNSDVYAQEKQDFVQHFSQIVRVLTEDEMGHPEIGDAIAR LKEVLEYNAIGGKYNRGLTVVVAFRELVEPRKQDADSLQRAWTVGWCVELLQAFFLVA DDIMDSSLTRRGQTCWYQKPGVGLDAINDANLLEACIYRLLKLYCREQPYYLNLIELF LQSSYQTEIGQTLDLLTAPQGNVDLVRFTEKRYKSIVKYKTAFYSFYLPIAAAMYMAG IDGEKEHANAKKILLEMGEFFQIQDDYLDLFGDPSVTGKIGTDIQDNKCSWLVVQCLQ RATPEQYQILKENYGQKEAEKVARVKALYEELDLPAVFLQYEEDSYSHIMALIEQYAA PLPPAVFLGLARKIYKRRK" BASE COUNT 302 a 272 c 324 g 250 t ORIGIN 1 cacagaatga acggagacca gaattcagat gtttatgccc aagaaaagca ggatttcgtt 61 cagcacttct cccagatcgt tagggtgctg actgaggatg agatggggca cccagagata 121 ggagatgcta ttgcccggct caaggaggtc ctggagtaca atgccattgg aggcaagtat 181 aaccggggtt tgacggtggt agtagcattc cgggagctgg tggagccaag gaaacaggat 241 gctgatagtc tccagcgggc ctggactgtg ggctggtgtg tggaactgct gcaagctttc 301 ttcctggtgg cagatgacat catggattca tcccttaccc gccggggaca gacctgctgg 361 tatcagaagc cgggcgtggg tttggatgcc atcaatgatg ctaacctcct ggaagcatgt 421 atctaccgcc tgctgaagct ctattgccgg gagcagccct attacctgaa cctgatcgag 481 ctcttcctgc agagttccta tcagactgag attgggcaga ccctggacct cctcacagcc 541 ccccagggca atgtggatct tgtcagattc actgaaaaga ggtacaaatc tattgtcaag 601 tacaagacag ctttctactc cttctacctt cctatagctg cagccatgta catggcagga 661 attgatggcg agaaggagca cgccaatgcc aagaagatcc tgctggagat gggggagttc 721 tttcagattc aggatgatta ccttgacctc tttggggacc ccagtgtgac cggcaaaatt 781 ggcactgaca tccaggacaa caaatgcagc tggctggtgg ttcagtgtct gcaacgggcc 841 actccagaac agtaccagat cctgaaggaa aattacgggc agaaggaggc tgagaaagtg 901 gcccgggtga aggcgctata tgaggagctg gatctgccag cagtgttctt gcaatatgag 961 gaagacagtt acagccacat tatggctctc attgaacagt acgcagcacc cctgccccca 1021 gccgtctttc tggggcttgc gcgcaaaatc tacaagcgga gaaagtgacc tagagattgc 1081 aagggcgggg agaggaggct ctcaataaat aatcgtgtaa ccttaaaaaa aaaaaaaacc 1141 tcgacgat // LOCUS HUMFASANT 2534 bp mRNA PRI 06-MAR-1995 DEFINITION Human Fas antigen (fas) mRNA, complete cds. ACCESSION M67454 NID g182409 KEYWORDS Fas antigen; cell surface antigen; transmembrane protein. SOURCE Homo sapiens (clone pF58) (tissue library: pCEV4) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2534) AUTHORS Itoh,N., Yonehara,S., Ishii,A., Yonehara,M., Mizushima,S., Sameshima,M., Hase,A., Seto,Y. and Nagata,S. TITLE The polypeptide encoded by the cDNA for human cell surface antigen Fas can mediate apoptosis JOURNAL Cell 66 (2), 233-243 (1991) MEDLINE 91309137 FEATURES Location/Qualifiers source 1..2534 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pF58" /cell_line="KT3" /tissue_lib="pCEV4" gene 195..2523 /gene="fas" sig_peptide 195..242 /gene="fas" CDS 195..1202 /gene="fas" /codon_start=1 /product="Fas antigen" /db_xref="PID:g182410" /translation="MLGIWTLLPLVLTSVARLSSKSVNAQVTDINSKGLELRKTVTTV ETQNLEGLHHDGQFCHKPCPPGERKARDCTVNGDEPDCVPCQEGKEYTDKAHFSSKCR RCRLCDEGHGLEVEINCTRTQNTKCRCKPNFFCNSTVCEHCDPCTKCEHGIIKECTLT SNTKCKEEGSRSNLGWLCLLLLPIPLIVWVKRKEVQKTCRKHRKENQGSHESPTLNPE TVAINLSDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQ LLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSL V" mat_peptide 243..1199 /gene="fas" /product="Fas antigen" polyA_signal 1831..1836 /gene="fas" polyA_signal 2352..2357 /gene="fas" polyA_signal 2518..2523 /gene="fas" BASE COUNT 817 a 487 c 503 g 727 t ORIGIN 1 gacgcttctg gggagtgagg gaagcggttt acgagtgact tggctggagc ctcaggggcg 61 ggcactggca cggaacacac cctgaggcca gccctggctg cccaggcgga gctgcctctt 121 ctcccgcggg ttggtggacc cgctcagtac ggagttgggg aagctctttc acttcggagg 181 attgctcaac aaccatgctg ggcatctgga ccctcctacc tctggttctt acgtctgttg 241 ctagattatc gtccaaaagt gttaatgccc aagtgactga catcaactcc aagggattgg 301 aattgaggaa gactgttact acagttgaga ctcagaactt ggaaggcctg catcatgatg 361 gccaattctg ccataagccc tgtcctccag gtgaaaggaa agctagggac tgcacagtca 421 atggggatga accagactgc gtgccctgcc aagaagggaa ggagtacaca gacaaagccc 481 atttttcttc caaatgcaga agatgtagat tgtgtgatga aggacatggc ttagaagtgg 541 aaataaactg cacccggacc cagaatacca agtgcagatg taaaccaaac tttttttgta 601 actctactgt atgtgaacac tgtgaccctt gcaccaaatg tgaacatgga atcatcaagg 661 aatgcacact caccagcaac accaagtgca aagaggaagg atccagatct aacttggggt 721 ggctttgtct tcttcttttg ccaattccac taattgtttg ggtgaagaga aaggaagtac 781 agaaaacatg cagaaagcac agaaaggaaa accaaggttc tcatgaatct ccaaccttaa 841 atcctgaaac agtggcaata aatttatctg atgttgactt gagtaaatat atcaccacta 901 ttgctggagt catgacacta agtcaagtta aaggctttgt tcgaaagaat ggtgtcaatg 961 aagccaaaat agatgagatc aagaatgaca atgtccaaga cacagcagaa cagaaagttc 1021 aactgcttcg taattggcat caacttcatg gaaagaaaga agcgtatgac acattgatta 1081 aagatctcaa aaaagccaat ctttgtactc ttgcagagaa aattcagact atcatcctca 1141 aggacattac tagtgactca gaaaattcaa acttcagaaa tgaaatccaa agcttggtct 1201 agagtgaaaa acaacaaatt cagttctgag tatatgcaat tagtgtttga aaagattctt 1261 aatagctggc tgtaaatact gcttggtttt ttactgggta cattttatca tttattagcg 1321 ctgaagagcc aacatatttg tagattttta atatctcatg attctgcctc caaggatgtt 1381 taaaatctag ttgggaaaac aaacttcatc aagagtaaat gcagtggcat gctaagtacc 1441 caaataggag tgtatgcaga ggatgaaaga ttaagattat gctctggcat ctaacatatg 1501 attctgtagt atgaatgtaa tcagtgtatg ttagtacaaa tgtctatcca caggctaacc 1561 ccactctatg aatcaataga agaagctatg accttttgct gaaatatcag ttactgaaca 1621 ggcaggccac tttgcctcta aattacctct gataattcta gagattttac catatttcta 1681 aactttgttt ataactctga gaagatcata tttatgtaaa gtatatgtat ttgagtgcag 1741 aatttaaata aggctctacc tcaaagacct ttgcacagtt tattggtgtc atattataca 1801 atatttcaat tgtgaattca catagaaaac attaaattat aatgtttgac tattatatat 1861 gtgtatgcat tttactggct caaaactacc tacttctttc tcaggcatca aaagcatttt 1921 gagcaggaga gtattactag agctttgcca cctctccatt tttgccttgg tgctcatctt 1981 aatggcctaa tgcaccccca aacatggaaa tatcaccaaa aaatacttaa tagtccacca 2041 aaaggcaaga ctgcccttag aaattctagc ctggtttgga gatactaact gctctcagag 2101 aaagtagctt tgtgacatgt catgaaccca tgtttgcaat caaagatgat aaaatagatt 2161 cttatttttc ccccaccccc gaaaatgttc aataatgtcc catgtaaaac ctgctacaaa 2221 tggcagctta tacatagcaa tggtaaaatc atcatctgga tttaggaatt gctcttgtca 2281 taccctcaag tttctaagat ttaagattct ccttactact atcctacgtt taaatatctt 2341 tgaaagtttg tattaaatgt gaattttaag aaataatatt tatatttctg taaatgtaaa 2401 ctgtgaagat agttataaac tgaagcagat acctggaacc acctaaagaa cttccattta 2461 tggaggattt ttttgcccct tgtgtttgga attataaaat ataggtaaaa gtacgtaatt 2521 aaataatgtt tttg // LOCUS HUMFAV 6909 bp mRNA PRI 08-NOV-1994 DEFINITION Human coagulation factor V mRNA, complete cds. ACCESSION M16967 NID g182411 KEYWORDS blood coagulation factor; factor V. SOURCE Human fetal liver, cDNA to mRNA, clones V401 and V402. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6909) AUTHORS Jenny,R.J., Pittman,D.D., Toole,J.J., Kriz,R.W., Aldape,R.A., Hewick,R.M., Kaufman,R.J. and Mann,K.G. TITLE Complete cDNA and derived amino acid sequence of human factor V JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (14), 4846-4850 (1987) MEDLINE 87260886 FEATURES Location/Qualifiers source 1..6909 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q25" gene 91..6765 /gene="F5" CDS 91..6765 /gene="F5" /note="coagulation factor V precursor" /codon_start=1 /db_xref="GDB:G00-119-896" /db_xref="PID:g182412" /translation="MFPGCPRLWVLVVLGTSWVGWGSQGTEAAQLRQFYVAAQGISWS YRPEPTNSSLNLSVTSFKKIVYREYEPYFKKEKPQSTISGLLGPTLYAEVGDIIKVHF KNKADKPLSIHPQGIRYSKLSEGASYLDHTFPAEKMDDAVAPGREYTYEWSISEDSGP THDDPPCLTHIYYSHENLIEDFNSGLIGPLLICKKGTLTEGGTQKTFDKQIVLLFAVF DESKSWSQSSSLMYTVNGYVNGTMPDITVCAHDHISWHLLGMSSGPELFSIHFNGQVL EQNHHKVSAITLVSATSTTANMTVGPEGKWIISSLTPKHLQAGMQAYIDIKNCPKKTR NLKKITREQRRHMKRWEYFIAAEEVIWDYAPVIPANMDKKYRSQHLDNFSNQIGKHYK KVMYTQYEDESFTKHTVNPNMKEDGILGPIIRAQVRDTLKIVFKNMASRPYSIYPHGV TFSPYEDEVNSSFTSGRNNTMIRAVQPGETYTYKWNILEFDEPTENDAQCLTRPYYSD VDIMRDIASGLIGLLLICKSRSLDRRGIQRAADIEQQAVFAVFDENKSWYLEDNINKF CENPDEVKRDDPKFYESNIMSTINGYVPESITTLGFCFDDTVQWHFCSVGTQNEILTI HFTGHSFIYGKRHEDTLTLFPMRGESVTVTMDNVGTWMLTSMNSSPRSKKLRLKFRDV KCIPDDDEDSYEIFEPPESTVMATRKMHDRLEPEDEESDADYDYQNRLAAALGIRSFR NSSLNQEEEEFNLTALALENGTEFVSSNTDIIVGSNYSSPSNISKFTVNNLAEPQKAP SHQQATTAGSPLRHLIGKNSVLNSSTAEHSSPYSEDPIEDPLQPDVTGIRLLSLGAGE FRSQEHAKRKGPKVERDQAAKHRFSWMKLLAHKVGRHLSQDTGSPSGMRPWEDLPSQD TGSPSRMRPWEDPPSDLLLLKQSNSSKILVGRWHLASEKGSYEIIQDTDEDTAVNNWL ISPQNASRAWGESTPLANKPGKQSGHPKFPRVRHKSLQVRQDGGKSRLKKSQFLIKTR KKKKEKHTHHAPLSPRTFHPLRSEAYNTFSERRLKHSLVLHKSNETSLPTDLNQTLPS MDFGWIASLPDHNQNSSNDTGQASCPPGLYQTVPPEEHYQTFPIQDPDQMHSTSDPSH RSSSPELSEMLEYDRSHKSFPTDISQMSPSSEHEVWQTVISPDLSQVTLSPELSQTNL SPDLSHTTLSPELIQRNLSPALGQMPISPDLSHTTLSPDLSHTTLSLDLSQTNLSPEL SQTNLSPALGQMPLSPDLSHTTLSLDFSQTNLSPELSHMTLSPELSQTNLSPALGQMP ISPDLSHTTLSLDFSQTNLSPELSQTNLSPALGQMPLSPDPSHTTLSLDLSQTNLSPE LSQTNLSPDLSEMPLFADLSQIPLTPDLDQMTLSPDLGETDLSPNFGQMSLSPDLSQV TLSPDISDTTLLPDLSQISPPPDLDQIFYPSESSQSLLLQEFNESFPYPDLGQMPSPS SPTLNDTFLSKEFNPLVIVGLSKDGTDYIEIIPKEEVQSSEDDYAEIDYVPYDDPYKT DVRTNINSSRDPDNIAAWYLRSNNGNRRNYYIAAEEISWDYSEFVQRETDIEDSDDIP EDTTYKKVVFRKYLDSTFTKRDPRGEYEEHLGILGPIIRAEVDDVIQVRFKNLASRPY SLHAHGLSYEKSSEGKTYEDDSPEWFKEDNAVQPNSSYTYVWHATERSGPESPGSACR AWAYYSAVNPEKDIHSGLIGPLLICQKGILHKDSNMPVDMREFVLLFMTFDEKKSWYY EKKSRSSWRLTSSEMKKSHEFHAINGMIYSLPGLKMYEQEWVRLHLLNIGGSQDIHVV HFHGQTLLENGNKQHQLGVWPLLPGSFKTLEMKASKPGWWLLNTEVGENQRAGMQTPF LIMDRDCRMPMGLSTGIISDSQIKASEFLGYWEPRLARLNNGGSYNAWSVEKLAAEFA SKPWIQVDMQKEVIITGIQTQGAKHYLKSCYTTEFYVAYSSNQINWQIFKGNSTRNVM YFNGNSDASTIKENQFDPPIVARYIRISPTRAYNRPTLRLELQGCEVNGCSTPLGMEN GKIENKQITASSFKKSWWGDYWEPFRARLNAQGRVNAWQAKANNNKQWLEIDLLKIKK ITAIITQGCKSLSSEMYVKSYTIHYSEQGVEWKPYRLKSSMVDKIFEGNTNTKGHVKN FFNPPIISRFIRVIPKTWNQSITLRLELFGCDIY" sig_peptide 91..174 /gene="F5" /note="coagulation factor V signal peptide" mat_peptide 2302..4809 /gene="F5" /note="coagulation factor Va (alt.)" mat_peptide 3229..4809 /gene="F5" /note="coagulation factor Va' (alt.)" BASE COUNT 2096 a 1701 c 1430 g 1682 t ORIGIN 1 bp upstream of EcoRI site; chromosome 1q21-q25. 1 gaattccgca gcccggagtg tggttagcag ctcggcaagc gctgcccagg tcctggggtg 61 gtggcagcca gcgggagcag gaaaggaagc atgttcccag gctgcccacg cctctgggtc 121 ctggtggtct tgggcaccag ctgggtaggc tgggggagcc aagggacaga agcggcacag 181 ctaaggcagt tctacgtggc tgctcagggc atcagttgga gctaccgacc tgagcccaca 241 aactcaagtt tgaatctttc tgtaacttcc tttaagaaaa ttgtctacag agagtatgaa 301 ccatatttta agaaagaaaa accacaatct accatttcag gacttcttgg gcctacttta 361 tatgctgaag tcggagacat cataaaagtt cactttaaaa ataaggcaga taagcccttg 421 agcatccatc ctcaaggaat taggtacagt aaattatcag aaggtgcttc ttaccttgac 481 cacacattcc ctgcggagaa gatggacgac gctgtggctc caggccgaga atacacctat 541 gaatggagta tcagtgagga cagtggaccc acccatgatg accctccatg cctcacacac 601 atctattact cccatgaaaa tctgatcgag gatttcaact cggggctgat tgggcccctg 661 cttatctgta aaaaagggac cctaactgag ggtgggacac agaagacgtt tgacaagcaa 721 atcgtgctac tatttgctgt gtttgatgaa agcaagagct ggagccagtc atcatcccta 781 atgtacacag tcaatggata tgtgaatggg acaatgccag atataacagt ttgtgcccat 841 gaccacatca gctggcatct gctgggaatg agctcggggc cagaattatt ctccattcat 901 ttcaacggcc aggtcctgga gcagaaccat cataaggtct cagccatcac ccttgtcagt 961 gctacatcca ctaccgcaaa tatgactgtg ggcccagagg gaaagtggat catatcttct 1021 ctcaccccaa aacatttgca agctgggatg caggcttaca ttgacattaa aaactgccca 1081 aagaaaacca ggaatcttaa gaaaataact cgtgagcaga ggcggcacat gaagaggtgg 1141 gaatacttca ttgctgcaga ggaagtcatt tgggactatg cacctgtaat accagcgaat 1201 atggacaaaa aatacaggtc tcagcatttg gataatttct caaaccaaat tggaaaacat 1261 tataagaaag ttatgtacac acagtacgaa gatgagtcct tcaccaaaca tacagtgaat 1321 cccaatatga aagaagatgg gattttgggt cctattatca gagcccaggt cagagacaca 1381 ctcaaaatcg tgttcaaaaa tatggccagc cgcccctata gcatttaccc tcatggagtg 1441 accttctcgc cttatgaaga tgaagtcaac tcttctttca cctcaggcag gaacaacacc 1501 atgatcagag cagttcaacc aggggaaacc tatacttata agtggaacat cttagagttt 1561 gatgaaccca cagaaaatga tgcccagtgc ttaacaagac catactacag tgacgtggac 1621 atcatgagag acatcgcctc tgggctaata ggactacttc taatctgtaa gagcagatcc 1681 ctggacaggc gaggaataca gagggcagca gacatcgaac agcaggctgt gtttgctgtg 1741 tttgatgaga acaaaagctg gtaccttgag gacaacatca acaagttttg tgaaaatcct 1801 gatgaggtga aacgtgatga ccccaagttt tatgaatcaa acatcatgag cactatcaat 1861 ggctatgtgc ctgagagcat aactactctt ggattctgct ttgatgacac tgtccagtgg 1921 cacttctgta gtgtggggac ccagaatgaa attttgacca tccacttcac tgggcactca 1981 ttcatctatg gaaagaggca tgaggacacc ttgaccctct tccccatgcg tggagaatct 2041 gtgacggtca caatggataa tgttggaact tggatgttaa cttccatgaa ttctagtcca 2101 agaagcaaaa agctgaggct gaaattcagg gatgttaaat gtatcccaga tgatgatgaa 2161 gactcatatg agatttttga acctccagaa tctacagtca tggctacacg gaaaatgcat 2221 gatcgtttag aacctgaaga tgaagagagt gatgctgact atgattacca gaacagactg 2281 gctgcagcat taggaattag gtcattccga aactcatcat tgaaccagga agaagaagag 2341 ttcaatctta ctgccctagc tctggagaat ggcactgaat tcgtttcttc gaacacagat 2401 ataattgttg gttcaaatta ttcttcccca agtaatatta gtaagttcac tgtcaataac 2461 cttgcagaac ctcagaaagc cccttctcac caacaagcca ccacagctgg ttccccactg 2521 agacacctca ttggcaagaa ctcagttctc aattcttcca cagcagagca ttccagccca 2581 tattctgaag accctataga ggatcctcta cagccagatg tcacagggat acgtctactt 2641 tcacttggtg ctggagaatt cagaagtcaa gaacatgcta agcgtaaggg acccaaggta 2701 gaaagagatc aagcagcaaa gcacaggttc tcctggatga aattactagc acataaagtt 2761 gggagacacc taagccaaga cactggttct ccttccggaa tgaggccctg ggaggacctt 2821 cctagccaag acactggttc tccttccaga atgaggccct gggaggaccc tcctagtgat 2881 ctgttactct taaaacaaag taactcatct aagattttgg ttgggagatg gcatttggct 2941 tctgagaaag gtagctatga aataatccaa gatactgatg aagacacagc tgttaacaat 3001 tggctgatca gcccccagaa tgcctcacgt gcttggggag aaagcacccc tcttgccaac 3061 aagcctggaa agcagagtgg ccacccaaag tttcctagag ttagacataa atctctacaa 3121 gtaagacagg atggaggaaa gagtagactg aagaaaagcc agtttctcat taagacacga 3181 aaaaagaaaa aagagaagca cacacaccat gctcctttat ctccgaggac ctttcaccct 3241 ctaagaagtg aagcctacaa cacattttca gaaagaagac ttaagcattc gttggtgctt 3301 cataaatcca atgaaacatc tcttcccaca gacctcaatc agacattgcc ctctatggat 3361 tttggctgga tagcctcact tcctgaccat aatcagaatt cctcaaatga cactggtcag 3421 gcaagctgtc ctccaggtct ttatcagaca gtgcccccag aggaacacta tcaaacattc 3481 cccattcaag accctgatca aatgcactct acttcagacc ccagtcacag atcctcttct 3541 ccagagctca gtgaaatgct tgagtatgac cgaagtcaca agtccttccc cacagatata 3601 agtcaaatgt ccccttcctc agaacatgaa gtctggcaga cagtcatctc tccagacctc 3661 agccaggtga ccctctctcc agaactcagc cagacaaacc tctctccaga cctcagccac 3721 acgactctct ctccagaact cattcagaga aacctttccc cagccctcgg tcagatgccc 3781 atttctccag acctcagcca tacaaccctt tctccagacc tcagccatac aaccctttct 3841 ttagacctca gccagacaaa cctctctcca gaactcagtc agacaaacct ttccccagcc 3901 ctcggtcaga tgcccctttc tccagacctc agccatacaa ccctttctct agacttcagc 3961 cagacaaacc tctctccaga actcagccat atgactctct ctccagaact cagtcagaca 4021 aacctttccc cagcccttgg tcagatgccc atttctccag acctcagcca tacaaccctt 4081 tctctagact tcagccagac aaacctctct ccagaactca gtcaaacaaa cctttcccca 4141 gccctcggtc agatgcccct ttctccagac cccagccata caaccctttc tctagacctc 4201 agccagacaa acctctctcc agaactcagt cagacaaacc tttccccaga cctcagtgag 4261 atgcccctct ttgcagatct cagtcaaatt ccccttaccc cagacctcga ccagatgaca 4321 ctttctccag accttggtga gacagatctt tccccaaact ttggtcagat gtccctttcc 4381 ccagacctca gccaggtgac tctctctcca gacatcagtg acaccaccct tctcccggat 4441 ctcagccaga tatcacctcc tccagacctt gatcagatat tctacccttc tgaatctagt 4501 cagtcattgc ttcttcaaga atttaatgag tcttttcctt atccagacct tggtcagatg 4561 ccatctcctt catctcctac tctcaatgat acttttctat caaaggaatt taatccactg 4621 gttatagtgg gcctcagtaa agatggtaca gattacattg agatcattcc aaaggaagag 4681 gtccagagca gtgaagatga ctatgctgaa attgattatg tgccctatga tgacccctac 4741 aaaactgatg ttaggacaaa catcaactcc tccagagatc ctgacaacat tgcagcatgg 4801 tacctccgca gcaacaatgg aaacagaaga aattattaca ttgctgctga agaaatatcc 4861 tgggattatt cagaatttgt acaaagggaa acagatattg aagactctga tgatattcca 4921 gaagatacca catataagaa agtagttttt cgaaagtacc tcgacagcac ttttaccaaa 4981 cgtgatcctc gaggggagta tgaagagcat ctcggaattc ttggtcctat tatcagagct 5041 gaagtggatg atgttatcca agttcgtttt aaaaatttag catccagacc gtattctcta 5101 catgcccatg gactttccta tgaaaaatca tcagagggaa agacttatga agatgactct 5161 cctgaatggt ttaaggaaga taatgctgtt cagccaaata gcagttatac ctacgtatgg 5221 catgccactg agcgatcagg gccagaaagt cctggctctg cctgtcgggc ttgggcctac 5281 tactcagctg tgaacccaga aaaagatatt cactcaggct tgataggtcc cctcctaatc 5341 tgccaaaaag gaatactaca taaggacagc aacatgcctg tggacatgag agaatttgtc 5401 ttactattta tgacctttga tgaaaagaag agctggtact atgaaaagaa gtcccgaagt 5461 tcttggagac tcacatcctc agaaatgaaa aaatcccatg agtttcacgc cattaatggg 5521 atgatctaca gcttgcctgg cctgaaaatg tatgagcaag agtgggtgag gttacacctg 5581 ctgaacatag gcggctccca agacattcac gtggttcact ttcacggcca gaccttgctg 5641 gaaaatggca ataaacagca ccagttaggg gtctggcccc ttctgcctgg ttcatttaaa 5701 actcttgaaa tgaaggcatc aaaacctggc tggtggctcc taaacacaga ggttggagaa 5761 aaccagagag cagggatgca aacgccattt cttatcatgg acagagactg taggatgcca 5821 atgggactaa gcactggtat catatctgat tcacagatca aggcttcaga gtttctgggt 5881 tactgggagc ccagattagc aagattaaac aatggtggat cttataatgc ttggagtgta 5941 gaaaaacttg cagcagaatt tgcctctaaa ccttggatcc aggtggacat gcaaaaggaa 6001 gtcataatca cagggatcca gacccaaggt gccaaacact acctgaagtc ctgctatacc 6061 acagagttct atgtagctta cagttccaac cagatcaact ggcagatctt caaagggaac 6121 agcacaagga atgtgatgta ttttaatggc aattcagatg cctctacaat aaaagagaat 6181 cagtttgacc cacctattgt ggctagatat attaggatct ctccaactcg agcctataac 6241 agacctaccc ttcgattgga actgcaaggt tgtgaggtaa atggatgttc cacacccctg 6301 ggtatggaaa atggaaagat agaaaacaag caaatcacag cttcttcgtt taagaaatct 6361 tggtggggag attactggga acccttccgt gcccgtctga atgcccaggg acgtgtgaat 6421 gcctggcaag ccaaggcaaa caacaataag cagtggctag aaattgatct actcaagatc 6481 aagaagataa cggcaattat aacacagggc tgcaagtctc tgtcctctga aatgtatgta 6541 aagagctata ccatccacta cagtgagcag ggagtggaat ggaaaccata caggctgaaa 6601 tcctccatgg tggacaagat ttttgaagga aatactaata ccaaaggaca tgtgaagaac 6661 tttttcaacc ccccaatcat ttccaggttt atccgtgtca ttcctaaaac atggaatcaa 6721 agtattacac ttcgcctgga actctttggc tgtgatattt actagaattg aacattcaaa 6781 aacccctgga agagactctt taagacctca aaccatttag aatgggcaat gtattttacg 6841 ctgtgttaaa tgttaacagt tttccactat ttctctttct tttctattag tgaataaaat 6901 tttatacaa // LOCUS HUMFBRA 2182 bp mRNA PRI 08-NOV-1994 DEFINITION Human fibrinogen alpha-chain mRNA, complete cds. ACCESSION J00127 NID g182423 KEYWORDS alpha-fibrinogen; fibrin; fibrinogen. SOURCE Human liver, cDNA to mRNA (library of Chandra and Woo), clone pHI-alpha-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2182) AUTHORS Rixon,M.W., Chan,W.Y., Davie,E.W. and Chung,D.W. TITLE Characterization of a complementary deoxyribonucleic acid coding for the alpha chain of human fibrinogen JOURNAL Biochemistry 22 (13), 3237-3244 (1983) MEDLINE 83283432 COMMENT The initiation codon 'atg' at positions 40-42 could also initiate the translation of alpha-fibrinogen. FEATURES Location/Qualifiers source 1..2182 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q28" mRNA <1..2182 /note="a-fibrinogen mRNA" sig_peptide 31..87 /gene="FGA" /note="alpha-fibrinogen signal peptide" gene 31..1965 /gene="FGA" CDS 31..1965 /gene="FGA" /note="alpha-fibrinogen precursor" /codon_start=1 /db_xref="GDB:G00-119-129" /db_xref="PID:g182424" /translation="MFSMRIVCLVLSVVGTAWTADSGEGDFLAEGGGVRGPRVVERHQ SACKDSDWPFCSDEDWNYKCPSGCRMKGLIDEVNQDFTNRINKLKNSLFEYQKNNKDS HSLTTNIMEILRGDFSSANNRDNTYNRVSEDLRSRIEVLKRKVIEKVQHIQLLQKNVR AQLVDMKRLEVDIDIKIRSCRGSWSRALAREVDLKDYEDQQKQLEQVIAKDLLPSRDR QHLPLIKMKPVPDLVPGNFKSQLQKVPPEWKALTDMPQMRMELERPGGNEITRGGSTS YGTGSETESPRNPSSAGSWNSGSSGPGSTGNRNPGSSGTGGTATWKPGSSGPGSAGSW NSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSERGSAGHWTSESSVSGSTGQWHSESGS FRPDSPGSGNARPNNPDWGTFEEVSGNVSPGTRREYHTEKLVTSKGDKELRTGKEKVT SGSTTTTRRSCSKTVTKTVIGPDGHKEVTKEVVTSEDGSDCPEAMDLGTLSGIGTLDG FRHRHPDEAAFFDTASTGKTFPGFFSPMLGEFVSETESRGSESGIFTNTKESSSHHPG IAEFPSRGKSSSYSKQFTSSTSYNRGDSTFESKSYKMADEAGSEADHEGTHSTKRGHA KSRPVRGIHTSPLGKPSLSP" mat_peptide 88..1962 /gene="FGA" /note="alpha-fibrinogen" BASE COUNT 638 a 476 c 552 g 516 t ORIGIN 90 bp upstream of PstI site; chromosome 4q31. 1 gtctaggagc cagccccacc cttagaaaag atgttttcca tgaggatcgt ctgcctagtt 61 ctaagtgtgg tgggcacagc atggactgca gatagtggtg aaggtgactt tctagctgaa 121 ggaggaggcg tgcgtggccc aagggttgtg gaaagacatc aatctgcctg caaagattca 181 gactggccct tctgctctga tgaagactgg aactacaaat gcccttctgg ctgcaggatg 241 aaagggttga ttgatgaagt caatcaagat tttacaaaca gaataaataa gctcaaaaat 301 tcactatttg aatatcagaa gaacaataag gattctcatt cgttgaccac taatataatg 361 gaaattttga gaggcgattt ttcctcagcc aataaccgtg ataataccta caaccgagtg 421 tcagaggatc tgagaagcag aattgaagtc ctgaagcgca aagtcataga aaaagtacag 481 catatccagc ttctgcagaa aaatgttaga gctcagttgg ttgatatgaa acgactggag 541 gtggacattg atattaagat ccgatcttgt cgagggtcat ggagtagggc tttagctcgt 601 gaagtagatc tgaaggacta tgaagatcag cagaagcaac ttgaacaggt cattgccaaa 661 gacttacttc cctctagaga taggcaacac ttaccactga taaaaatgaa accagttcca 721 gacttggttc ccggaaattt taagagccag cttcagaagg tacccccaga gtggaaggca 781 ttaacagaca tgccgcagat gagaatggag ttagagagac ctggtggaaa tgagattact 841 cgaggaggct ccacctctta tggaaccgga tcagagacgg aaagccccag gaaccctagc 901 agtgctggaa gctggaactc tgggagctct ggacctggaa gtactggaaa ccgaaaccct 961 gggagctctg ggactggagg gactgcaacc tggaaacctg ggagctctgg acctggaagt 1021 gctggaagct ggaactctgg gagctctgga actggaagta ctggaaacca aaaccctgga 1081 agtcctagac ctggtagtac cggaacctgg aatcctggca gctctgaacg cggaagtgct 1141 gggcactgga cctctgagag ctctgtatct ggtagtactg gacaatggca ctctgaatct 1201 ggaagtttta ggccagatag cccaggctct gggaacgcga ggcctaacaa cccagactgg 1261 ggcacatttg aagaggtgtc aggaaatgta agtccaggga caaggagaga gtaccacaca 1321 gaaaaactgg tcacttctaa aggagataaa gagctcagga ctggtaaaga gaaggtcacc 1381 tctggtagca caaccaccac gcgtcgttca tgctctaaaa ccgttactaa gactgttatt 1441 ggtcctgatg gtcacaaaga agttaccaaa gaagtggtga cctccgaaga tggttctgac 1501 tgtcccgagg caatggattt aggcacattg tctggcatag gtactctgga tgggttccgt 1561 cataggcacc ctgatgaagc tgccttcttc gacactgcct caactggaaa aacattccca 1621 ggtttcttct cacctatgtt aggagagttt gtcagtgaga ctgagtctag gggctcagaa 1681 tctggcatct tcacaaatac aaaggaatcc agttctcatc accctgggat agctgaattc 1741 ccttcccgtg gtaaatcttc aagttacagc aaacaattta ctagtagcac gagttacaac 1801 agaggagact ccacatttga aagcaagagc tataaaatgg cagatgaggc cggaagtgaa 1861 gccgatcatg aaggaacaca tagcaccaag agagggcatg ctaaatctcg ccctgtcaga 1921 ggtatccaca cttctccttt ggggaagcct tccctgtccc cctagactaa gttaaatatt 1981 tctgcacagt gttcccatgg ccccttgcat ttccttctta actctctgtt acacgtcatt 2041 gaaactacac ttttttggtc tgtttttgtg ctagactgta agttccttgg gggcagggcc 2101 tttgtctgtc tcatctctgt attcccaaat gcctaacagt acagagccat gactcaataa 2161 atacatgtta aatggatgaa tg // LOCUS HUMFCREA 591 bp mRNA PRI 02-OCT-1992 DEFINITION Human Fc-epsilon-receptor gamma-chain mRNA, complete cds. ACCESSION M33195 J05285 NID g182487 KEYWORDS Fc-epsilon-receptor gamma-chain protein. SOURCE Human basophil-enriched leukocyte DNA, clone 1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 591) AUTHORS Kuester,H., Thompson,H. and Kinet,J.-P. TITLE Characterization and expression of the gene for the human receptor gamma subunit: Definition of a new gene family JOURNAL J. Biol. Chem. 265, 6448-6452 (1990) MEDLINE 90202928 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Kuester, 26-MAR-1990. FEATURES Location/Qualifiers source 1..591 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 26..78 /note="Fc-epsilon-receptor gamma-chain protein signal peptide" mat_peptide 26..283 /note="Fc-epsilon-receptor gamma-chain protein" CDS 26..286 /note="Fc-epsilon-receptor gamma-chain protein precursor" /codon_start=1 /db_xref="PID:g182488" /translation="MIPAVVLLLLLLVEQAAALGEPQLCYILDAILFLYGIVLTLLYC RLKIQVRKAAITSYEKSDGVYTGLSTRNQETYETLKHEKPPQ" BASE COUNT 146 a 167 c 118 g 160 t ORIGIN 1 cagaacggcc gatctccagc ccaagatgat tccagcagtg gtcttgctct tactcctttt 61 ggttgaacaa gcagcggccc tgggagagcc tcagctctgc tatatcctgg atgccatcct 121 gtttctgtat ggaattgtcc tcaccctcct ctactgtcga ctgaagatcc aagtgcgaaa 181 ggcagctata accagctatg agaaatcaga tggtgtttac acgggcctga gcaccaggaa 241 ccaggagact tacgagactc tgaagcatga gaaaccacca cagtagcttt agaatagatg 301 cggtcatatt cttctttggc ttctggttct tccagccctc atggttggca tcacatatgc 361 ctgcatgcca ttaacaccag ctggccctac ccctataatg atcctgtgtc ctaaattaat 421 atacaccagt ggttcctcct ccctgttaaa gactaatgct cagatgctgt ttacggatat 481 ttatattcta gtctcactct cttgtcccac ccttcttctc ttccccattc ccaactccag 541 ctaaaatatg ggaagggaga acccccaata aaactgccat ggactggact c // LOCUS HUMFE65 2654 bp mRNA PRI 31-DEC-1997 DEFINITION Homo sapiens stat-like protein (Fe65) mRNA, complete cds. ACCESSION L77864 NID g2734082 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2654) AUTHORS Bressler,S.L., Gray,M.D., Sopher,B.L., Hu,Q., Hearn,M.G., Pham,D.G., Dinulos,M.B., Fukuchi,K., Sisodia,S.S., Miller,M.A., Disteche,C.M. and Martin,G.M. TITLE cDNA cloning and chromosome mapping of the human Fe65 gene: interaction of the conserved cytoplasmic domains of the human beta-amyloid precursor protein and its homologues with the mouse Fe65 protein JOURNAL Hum. Mol. Genet. 5 (10), 1589-1598 (1996) MEDLINE 97049965 REFERENCE 2 (bases 1 to 2654) AUTHORS Hu,Q., Kukull,W.A., Bressler,S.L., Gray,M.D., Cam,J.A., Larson,E., Martin,G.M. and Deeb,S.S. TITLE The human FE65 gene: genomic structure and intronic polymorphisms associated with sporadic dementia of the alzheimer type JOURNAL Unpublished REFERENCE 3 (bases 1 to 2654) AUTHORS Hu,Q. TITLE Direct Submission JOURNAL Submitted (31-DEC-1997) Pathology, University of Washington, 1959 N.E. Pacific Ave., Seattle, WA 98195, USA REMARK Sequence update by submitter COMMENT GSDB:S:75831. FEATURES Location/Qualifiers source 1..2654 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /clone_lib="Stratagene #936206" /tissue_type="brain" 5'UTR <1..100 /gene="Fe65" gene <1..2654 /gene="Fe65" CDS 101..2233 /gene="Fe65" /codon_start=1 /product="stat-like protein" /db_xref="PID:g2734083" /translation="MSVPSSLSQSAINANSHGGPALSLPLPLHAAHNQLLNAKLQATA VGPKDLRSAMGEGGGPEPGPANAKWLKEGQNQLRRAATAHRDQNRNVTLTLAEEASQE PEMAPLGPKGLIHLYSELELSAHNAANRGLRGPGLIISTQEQGPDEGEEKAAGEAEEE EEDDDDEEEEEDLSSPPGLPEPLESVEAPPRPQALTDGPREHSKSASLLFGMRNSAAS DEDSSWATLSQGSPSYGSPEDTDSFWNPNAFETDSDLPAGWMRVQDTSGTYYWHIPTG TTQWEPPGRASPSQGSSPQEESQLTWTGFAHGEGFEDGEFWKDEPSDEAPMELGLKEP EEGTLTFPAQSLSPEPLPQEEEKLPPRNTNPGIKCFAVRSLGWVEMTEEELAPGRSSV AVNNCIRQLSYHKNNLHDPMSGGWGEGKDLLLQLEDETLKLVEPQSQALLHAQPIISI RVWGVGRDSGRERDFAYVARDKLTQMLKCHVFRCEAPAKNIATSLHEICSKIMAERRN ARCLVNGLSLDHSKLVDVPFQVEFPAPKNELVQKFQVYYLGNVPVAKPVGVDVINGAL ESVLSSSSREQWTPSHVSVAPATLTILHQQTEAVLGECRVRFLSFLAVGRDVHTFAFI MAAGPASFCCHMFWCEPNAASLSEAVQAACMLRYQKCLDARSQASTSCLPAPPAESVA RRVGWTVRRGVQSLWGSLKPKRLGAHTP" 3'UTR 2234..2643 /gene="Fe65" polyA_site 2643 /gene="Fe65" BASE COUNT 580 a 801 c 797 g 476 t ORIGIN 1 atgttgtgat ggagaagccg cggcggagcc cgaaccccgc agcctgagcc acctccgtca 61 tctgggcccg gggcctcacc gcgcaggagc tgccaaggcc atgtctgttc catcatcact 121 gagccagtcg gccattaatg ccaacagcca cggaggcccc gcactgagcc tacccctgcc 181 tctgcacgct gcccacaacc agctgctcaa cgccaagctg caggccacag ctgtgggacc 241 caaggacctg cgcagcgcca tgggggaggg tggtgggcct gagccaggcc ctgccaatgc 301 caagtggcta aaagagggcc agaaccagct ccggcgggcc gccacggccc accgtgacca 361 gaatcgcaat gtgaccttga ccttggcgga ggaggccagc caggagcctg agatggcacc 421 cttgggcccc aaaggcctga tacacctgta ctctgagctg gagctctcag ctcacaacgc 481 agccaaccga ggcctacgag gacctggcct gatcatcagc actcaagagc aggggccaga 541 tgagggagag gagaaggcgg ccggggaggc cgaggaggag gaggaggatg atgatgatga 601 agaggaggag gaggacttat cttctccccc agggctgcct gagcccctgg agagtgtgga 661 ggcccccccc aggccccaag cccttacaga tggcccccgg gaacacagca agagtgccag 721 cctcctgttt ggcatgcgga acagtgcagc cagtgatgag gactcaagct gggctacctt 781 atctcagggc agcccctcct atggctcccc agaggacaca gattccttct ggaaccccaa 841 cgccttcgag acggattccg acctgccggc tggatggatg agggtccagg acacctcagg 901 gacctattac tggcacatcc caacagggac cacccagtgg gaaccccccg gccgggcctc 961 cccctcacag gggagcagcc cccaagagga gtcccagctc acctggacag gttttgctca 1021 tggagaaggc tttgaggatg gagaattttg gaaggatgaa cccagtgatg aggccccaat 1081 ggagctggga ctgaaggaac ctgaggaggg gacgttgacc ttcccagctc agagcctcag 1141 cccagagccg ttgccccaag aggaggagaa gcttccccca cggaatacca acccagggat 1201 caagtgtttc gccgtgcgct ccctaggctg ggtagagatg accgaggagg agctggcccc 1261 tggacgcagc agtgtggcag tcaacaattg catccgtcag ctctcttacc acaaaaacaa 1321 cctgcatgac cccatgtctg ggggctgggg ggaaggaaag gatctgctac tgcagctgga 1381 ggatgagaca ctaaagctag tggagccaca gagccaggca ctgctgcacg cccaacccat 1441 catcagcatc cgcgtgtggg gcgtcgggcg ggacagtgga agagagaggg actttgccta 1501 cgtagctcgt gataagctga cccagatgct caagtgccac gtgtttcgct gtgaggcacc 1561 tgccaagaac atcgccacca gcctgcatga gatctgctct aagatcatgg ccgaacggcg 1621 taatgcccgc tgcttggtaa atggactctc cctggaccac tctaaacttg tggatgtccc 1681 tttccaagtg gaattcccag cgcctaagaa tgagttggtc cagaagttcc aagtctatta 1741 cctggggaat gtacctgttg ctaaacctgt tggggtagat gtgattaatg gggccctcga 1801 gtcagtcctg tcctccagca gccgtgaaca atggacccca agtcatgtca gtgtggcccc 1861 tgctaccctc accatcttgc accagcagac agaggcagtg ctgggagagt gtcgggtgcg 1921 tttcctctcc ttcctggccg tgggcagaga tgtccacacg tttgcattca tcatggctgc 1981 cggcccagcc tccttctgct gccacatgtt ctggtgcgag cccaatgctg ccagcctctc 2041 agaggctgtg caggctgcgt gcatgcttcg ctaccagaag tgtctggatg cccgttccca 2101 ggcctccacc tcctgcctcc cagcaccccc tgctgagtct gtggcacggc gtgtagggtg 2161 gactgtccgc aggggtgttc agtcgctgtg gggctccctg aagcccaaac ggctgggggc 2221 ccatacccca tgaagaagcc cccaccttcc cttccacctg attgtgttgg gccccaggga 2281 actaaagggt gtgggtcagg gaggggtcta gaggctattc ctaggcctca ggcctcccaa 2341 atatgcccct ccccagtagg tacggttccc tgcctaggag ctggggaggg agagatctaa 2401 tcccttcaag gaagtgataa cactggagtg gtaacaagag gagcaggaag caaggccagc 2461 cctggttctc catccccatg tgtttcaggt ggaacaggag gaactggtcc aggccaggcc 2521 tcatcctcct ggacccagca ggggcagaag gaggaaggga ctggtccagg catgggtccc 2581 ttccccctgc tccatgggca cctctgctgt attgatatca ctaataaagt ctgtctgcac 2641 tgcaaaaaaa aaaa // LOCUS HUMFEN1A 1144 bp mRNA PRI 06-MAR-1996 DEFINITION Homo sapiens endonuclease (FEN-1) mRNA, complete cds. ACCESSION L37374 NID g642089 KEYWORDS endonuclease. SOURCE Homo sapiens (clone library: Clontech) mature T cell lymphocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1144) AUTHORS Hiraoka,L.R., Harrington,J.J., Gerhard,D.S., Lieber,M.R. and Hsieh,C.L. TITLE Sequence of human FEN-1, a structure-specific endonuclease, and chromosomal localization of the gene (FEN1) in mouse and human JOURNAL Genomics 25 (1), 220-225 (1995) MEDLINE 95293376 FEATURES Location/Qualifiers source 1..1144 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T cell" /dev_stage="mature T cell" /clone_lib="Clontech" /tissue_type="lymphocyte" /map="11q12-13 or 1p22.2" /chromosome="11 or 1" CDS 1..1143 /codon_start=1 /function="structure-specific endonuclease" /product="endonuclease" /db_xref="PID:g642090" /translation="MGIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLI AVRQGGDVLQNEEGETTSHLMGMFYRTIRMMENGIKPVYVFDGKPPQLKSGELAKRSE RRAEAEKQLQQAQAAGAEQEVEKFTKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAE ASCAALVKAGKVYAAATEDMDCLTFGSPVLMRHLTASEAKKLPIQEFHLSRILQELGL NQEQFVDLCILLGSDYCESIRGIGPKRAVDLIQKHKSIEEIVRRLDPNKYPVPENWLH KEAHQLFLEPEVLDPESVELKWSEPNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQG STQGRLDDFFKVTGSLSSAKRKEPEPKGSTKKKAKTGAAGKFKRGK" BASE COUNT 292 a 283 c 353 g 216 t ORIGIN 1 atgggaattc aaggcctggc caaactaatt gctgatgtgg cccccagtgc catccgggag 61 aatgacatca agagctactt tggccgtaag gtggccattg atgcctctat gagcatttat 121 cagttcctga ttgctgttcg ccagggtggg gatgtgctgc agaatgagga gggtgagacc 181 accagccacc tgatgggcat gttctaccgc accattcgca tgatggagaa cggcatcaag 241 cccgtgtatg tctttgatgg caagccgcca cagctcaagt caggcgagct ggccaaacgc 301 agtgagcggc gggctgaggc agagaagcag ctgcagcagg ctcaggctgc tggggccgag 361 caggaggtgg aaaaattcac taagcggctg gtgaaggtca ctaagcagca caatgatgag 421 tgcaaacatc tgctgagcct catgggcatc ccttatcttg atgcacccag tgaggcagag 481 gccagctgtg ctgccctggt gaaggctggc aaagtctatg ctgcggctac cgaggacatg 541 gactgcctca ccttcggcag ccctgtgcta atgcgacacc tgactgccag tgaagccaaa 601 aagctgccaa tccaggaatt ccacctgagc cggattctgc aggagctggg cctgaaccag 661 gaacagtttg tggatctgtg catcctgcta ggcagtgact actgtgagag tatccggggt 721 attgggccca agcgggctgt ggacctcatc cagaagcaca agagcatcga ggagatcgtg 781 cggcgacttg accccaacaa gtaccctgtg ccagaaaatt ggctccacaa ggaggctcac 841 cagctcttct tggaacctga ggtgctggac ccagagtctg tggagctgaa gtggagcgag 901 ccaaatgaag aagagctgat caagttcatg tgtggtgaaa agcagttctc tgaggagcga 961 atccgcagtg gggtcaagag gctgagtaag agccgccaag gcagcaccca gggccgcctg 1021 gatgatttct tcaaggtgac cggctcactc tcttcagcta agcgcaagga gccagaaccc 1081 aagggatcca ctaagaagaa ggcaaagact ggggcagcag ggaagtttaa aaggggaaaa 1141 taaa // LOCUS HUMFERC 2443 bp mRNA PRI 16-JAN-1992 DEFINITION Human mRNA for ferrochelatase (EC 4.99.1.1). ACCESSION D00726 NID g219655 KEYWORDS ferrochelatase. SOURCE Human placenta, cDNA to mRNA, clones lambda HF[1-2 and 2-1]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2443) AUTHORS Nakahashi,Y., Taketani,S., Okuda,M., Inoue,K. and Tokunaga,R. TITLE Molecular cloning and sequence analysis of cDNA encoding human ferrochelatase JOURNAL Biochemical and Biophysical Research Communication 173, 748-755 (1990) COMMENT These data kindly submitted in computer readable form by: Shigeru Taketani Department of Hygiene Kansai Medical University 1 Fumizonocho, Moriguchi Osaka 570 Japan Phone: 06-992-1001 x2504 Fax: 06-992-0609 Northern blot analysis showed two mRNAs for ferrochelatase. FEATURES Location/Qualifiers source 1..2443 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 30..191 /note="ferrochelatase signal peptide" CDS 30..1301 /note="ferrochelatase precursor" /codon_start=1 /db_xref="PID:d1001086" /db_xref="PID:g219656" /translation="MRSLGANMAAALRAAGVLLRDPLASSSWRVCQPWRWKSGAAAAA VTTETAQHAQGAKPQVQPQKRKPKTGILMLNMGGPETLGDVHDFLLRLFLDQDLMTLP IQNKLAPFIAKRRTPKIQEQYRRIGGGSPIKIWTSKQGEGMVKLLDELSPNTAPHKYY IGFRYVHPLTEEAIEEMERDGLERAIAFTQYPQYSCSTTGSSLNAIYRYYNQVGRKPT MKWSTIDRWPTHHLLIQCFADHILKELDHFPLEKRSEVVILFSAHSLPMSVVNRGDPY PQEVSATVQKVMERLEYCNPYRLVWQSKVGPMPWLGPQTDESIKGLCERGRKNILLVP IAFTSDHIETLYELDIEYSQVLAKECGVENIRRAESLNGNPLFSKALADLVHSHIQSN ELCSKQLTLSCPLCVNPVCRETKSFFTSQQL" mat_peptide 192..1298 /note="ferrochelatase" polyA_signal 1559..1564 /note="polyadenylation signal" polyA_site 1571 /note="polyadenylation site" polyA_signal 1967..1972 /note="Polyadenylation signal (alt.)" polyA_site 2443 /note="polyadenylation site (alt.)" BASE COUNT 652 a 557 c 581 g 653 t ORIGIN 1 gggcgggtcg ggccgaggct gcccaggcaa tgcgttcact cggcgcaaac atggctgcgg 61 ccctgcgcgc cgcgggcgtc ctgctccgcg atccgctggc atccagcagc tggagggtct 121 gtcagccatg gaggtggaag tcaggtgcag ctgcagcggc cgtcaccaca gaaacagccc 181 agcatgccca gggtgcaaaa cctcaagttc aaccgcagaa gaggaagccg aaaactggaa 241 tattaatgct aaacatggga ggccctgaaa ctcttggaga tgttcacgac ttccttctga 301 gactcttctt ggaccaagac ctcatgacac ttcctattca gaataagctg gcaccattca 361 tcgccaaacg ccgaaccccc aagattcaag agcagtaccg caggattgga ggcggatccc 421 ccatcaagat atggacttcc aagcagggag agggcatggt gaagctgctg gatgaattgt 481 cccccaacac agcccctcac aaatactata ttggatttcg gtacgtccat cctttaacag 541 aagaagcaat tgaagagatg gagagagatg gcctagaaag ggctattgct ttcacacagt 601 atccacagta cagctgctcc accacaggca gcagcttaaa tgccatttac agatactata 661 atcaagtggg acggaagccc acgatgaagt ggagcactat tgacaggtgg cccacacatc 721 acctcctcat ccagtgcttt gcagatcata ttctaaagga actggaccat tttccacttg 781 agaagagaag cgaggtggtc attctgtttt ctgctcactc actgccgatg tctgtggtca 841 acagaggcga cccatatcct caggaggtaa gcgccactgt ccaaaaagtc atggaaaggc 901 tggagtactg caacccctac cgactggtgt ggcaatccaa ggttggtccg atgccctggt 961 tgggtcctca aacagacgaa tctatcaaag ggctttgtga gagggggagg aagaatatcc 1021 tcttggttcc gatagcattt accagtgacc atattgaaac gctgtatgag ctggacatcg 1081 agtactctca agttttagcc aaggagtgtg gagttgaaaa catcagaaga gctgagtctc 1141 ttaatggaaa tccattgttc tctaaggccc tggccgactt ggtgcattca cacatccagt 1201 caaacgagct gtgttccaag cagctgaccc tgagctgtcc gctctgtgtc aatcctgtct 1261 gcagggagac taaatccttc ttcaccagcc agcagctgtg acccccgccg gtggaccccg 1321 tggcgttagg caaatgccca acctccagat acctccgatg tggagagggt gttatttaga 1381 gatcaaggaa ggaagtcatc cttccttgat atatatacag cctttgggta caaattgtgt 1441 ggtttcttga ggattggact cttgatggat ttctattttt atataactat acagtaagca 1501 tttgtatttt ctctctctag gtataagtta ctagtttgga atgtccatca ggacctttaa 1561 taaatgaggc taaaaatttg tcttatgaga cacacctatt taagcacaga ttttggcttt 1621 attgcccaaa accctcccga aagggtacgg agagtcccct ctgtgggctg gcagtgtgaa 1681 tgagatctgt ttagtctcgt gcatatagtt gctgtttttt aaatgaacac agttgagtat 1741 ttgaagtgaa tttgaaaaag aaatgttact taatctttcc ctaagcccat gggttacaga 1801 atgctaggga ggcaatttgg ttacctgcaa tggctgcttt tgccagcgag gccaccattc 1861 attggtcatc ttggtatttg tgctgtgaat ctcactttcc tcaatgtaaa aaggaatcaa 1921 gtatggattt cagaggtgct cttagattcc ccatacaccc aagggtaata aacgtgtaca 1981 agtacagtgt tcatgatacg tgccttggtg ggagtccgtg gtgccacagg gaaggggctc 2041 ccactgcttc tggtctccag ggacagtgct gctggaaagg ctagtgatga gcttcaccct 2101 ggagctcctc ccgggacctt gcaagcctct ccatccagca tcttctctat cttagttgaa 2161 tgccttcttt ctgaacattt gttttaagaa ttattttata aagtcaacaa tactttgctt 2221 gaattctttc ttaatttacg attttttatt ataaaaaagt atagtgatac aatgggacat 2281 gtgaagaata cagaaaagta accactttaa tgcaataact gttatcataa tattgtattt 2341 cgtggtagtc cttgcctgta gatattttta atgccattta atgccattgt caccttggat 2401 ttatgagtga aaagtgtttc taaaaatata gaaataatgt cag // LOCUS HUMFERH 790 bp mRNA PRI 08-NOV-1994 DEFINITION Human ferritin H chain mRNA, complete cds. ACCESSION M11146 NID g182504 KEYWORDS ferritin. SOURCE Human liver, cDNA to mRNA, clone pHF16. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 790) AUTHORS Boyd,D., Vecoli,C., Belcher,D.M., Jain,S.K. and Drysdale,J.W. TITLE Structural and functional relationships of human ferritin H and L chains deduced from cDNA clones JOURNAL J. Biol. Chem. 260 (21), 11755-11761 (1985) MEDLINE 86008223 COMMENT Draft entry, computer-readable and printed copy of sequences in [1] kindly provided by J.Drysdale, 29-JAN-1986. FEATURES Location/Qualifiers source 1..790 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q13" mRNA <1..790 /note="ferritin H chain mRNA" gene 78..629 /gene="FTH1" CDS 78..629 /gene="FTH1" /note="ferritin heavy chain" /codon_start=1 /db_xref="GDB:G00-120-617" /db_xref="PID:g182505" /translation="MTTASTSQVRQNYHQDSEAAINRQINLELYASYVYLSMSYYFDR DDVALKNFAKYFLHQSHEEREHAEKLMKLQNQRGGRIFLQDIKKPDCDDWESGLNAME CALHLEKNVNQSLLELHKLATDKNDPHLCDFIETHYLNEQVKAIKELGDHVTNLRKMG APESGLAEYLFDKHTLGDSDNES" mat_peptide 81..626 /gene="FTH1" /note="ferritin heavy chain mature peptide; G00-120-617" BASE COUNT 207 a 221 c 184 g 178 t ORIGIN 148 bp upstream of Sau3A site. 1 actgccccaa ggcccccgcc gccgctccag cgccgcgcag ccaccgccgc cgccgccgcc 61 tctccttagt cgccgccatg acgaccgcgt ccacctcgca ggtgcgccag aactaccacc 121 aggactcaga ggccgccatc aaccgccaga tcaacctgga gctctacgcc tcctacgttt 181 acctgtccat gtcttactac tttgaccgcg atgatgtggc tttgaagaac tttgccaaat 241 actttcttca ccaatctcat gaggagaggg aacatgctga gaaactgatg aagctgcaga 301 accaacgagg tggccgaatc ttccttcagg atatcaagaa accagactgt gatgactggg 361 agagcgggct gaatgcaatg gagtgtgcat tacatttgga aaaaaatgtg aatcagtcac 421 tactggaact gcacaaactg gccactgaca aaaatgaccc ccatttgtgt gacttcattg 481 agacacatta cctgaatgag caggtgaaag ccatcaaaga attgggtgac cacgtgacca 541 acttgcgcaa gatgggagcg cccgaatctg gcttggcgga atatctcttt gacaagcaca 601 ccctgggaga cagtgataat gaaagctaag cctcgggcta atttccccat agccgtgggg 661 tgacttccct ggtcaccaag gcagtgcatg catgttgggg tttcctttac cttttctata 721 agttgtacca aaacatccac ttaagttctt tgatttgtac cattccttca aataaagaaa 781 tttggtaccc // LOCUS HUMFERL 822 bp mRNA PRI 08-NOV-1994 DEFINITION Human ferritin L chain mRNA, complete cds. ACCESSION M11147 NID g182513 KEYWORDS ferritin. SOURCE Human liver, cDNA to mRNA, clone pLF108. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 822) AUTHORS Boyd,D., Vecoli,C., Belcher,D.M., Jain,S.K. and Drysdale,J.W. TITLE Structural and functional relationships of human ferritin H and L chains deduced from cDNA clones JOURNAL J. Biol. Chem. 260 (21), 11755-11761 (1985) MEDLINE 86008223 COMMENT Draft entry, computer-readable and printed copy of sequences in [1] kindly provided by J.Drysdale, 29-JAN-1986. FEATURES Location/Qualifiers source 1..822 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.3-q13.4" mRNA <1..822 /note="ferritin L chain mRNA" gene 152..679 /gene="FTL" CDS 152..679 /gene="FTL" /note="ferritin light chain" /codon_start=1 /db_xref="GDB:G00-119-234" /db_xref="PID:g182514" /translation="MSSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDRDDVA LEGVSHFFRELAEEKREGYERLLKMQNQRGGRALFQDIKKPAEDEWGKTPDAMKAAMA LEKKLNQALLDLHALGSARTDPHLCDFLETHFLDEEVKLIKKMGDHLTNLHRLGGPEA GLGEYLFERLTLKHD" mat_peptide 155..676 /gene="FTL" /note="ferritin light chain mature peptide; G00-119-234" BASE COUNT 180 a 256 c 204 g 182 t ORIGIN 8 bp upstream of Sau3A site. 1 acggaacaga tccggggact ctcttccagc ctccgaccgc cctccgattt cctctccgct 61 tgcaacctcc gggaccatct tctcggccat ctcctgcttc tgggacctgc cagcaccgtt 121 tttgtggtta gctccttctt gccaaccaac catgagctcc cagattcgtc agaattattc 181 caccgacgtg gaggcagccg tcaacagcct ggtcaatttg tacctgcagg cctcctacac 241 ctacctctct ctgggcttct atttcgaccg cgatgatgtg gctctggaag gcgtgagcca 301 cttcttccgc gaactggccg aggagaagcg cgagggctac gagcgtctcc tgaagatgca 361 aaaccagcgt ggcggccgcg ctctcttcca ggacatcaag aagccagctg aagatgagtg 421 gggtaaaacc ccagacgcca tgaaagctgc catggccctg gagaaaaagc tgaaccaggc 481 ccttttggat cttcatgccc tgggttctgc ccgcacggac ccccatctct gtgacttcct 541 ggagactcac ttcctagatg aggaagtgaa gcttatcaag aagatgggtg accacctgac 601 caacctccac aggctgggtg gcccggaggc tgggctgggc gagtatctct tcgaaaggct 661 cactctcaag cacgactaag agccttctga gcccagcgac ttctgaaggg ccccttgcaa 721 agtaataggg cttctgccta agcctctccc tccagccaat aggcagcttt cttaactatc 781 ctaacaagcc ttggaccaaa tggaaataaa gctttttgat gc // LOCUS HUMFGF 1420 bp mRNA PRI 09-SEP-1993 DEFINITION Human mRNA for FGF-9, complete cds. ACCESSION D14838 NID g391718 KEYWORDS fibroblast growth factor; human FGF-9. SOURCE Homo sapiens (strain NMC-G1) astrocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1420) AUTHORS Miyamoto,M., Naruo,K., Seko,C., Matsumoto,S., Kondo,T. and Kurokawa,T. TITLE Molecular cloning of a novel cytokine cDNA encoding the ninth member of the fibroblast growth factor family, which has a unique secretion property JOURNAL Mol. Cell. Biol. 13 (7), 4251-4259 (1993) MEDLINE 93309459 REFERENCE 2 (bases 1 to 1420) AUTHORS Miyamoto,M. TITLE Direct Submission JOURNAL Submitted (29-MAR-1993) to the DDBJ/EMBL/GenBank databases. Masaaki Miyamoto, Kansai Medical University, Department of Microbiology; 10-15 Fumizono-cho, Moriguchi, Osaka 570, Japan (Tel:075-712-5406, Fax:075-712-5492) COMMENT Submitted (29-MAR-1993) to DDBJ by: Masaaki Miyamoto Okayama Cell Switching Project 103-5 Pasteur Building Tanaka-Monzen-cho Sakyo-ku, Kyoto 606 Japan Phone: 075-712-5406 Fax: 075-712-5492. FEATURES Location/Qualifiers source 1..1420 /organism="Homo sapiens" /strain="NMC-G1" /db_xref="taxon:9606" /cell_type="Astrocytoma" CDS 179..805 /codon_start=1 /product="human FGF-9" /db_xref="PID:d1004083" /db_xref="PID:g391719" /translation="MAPLGEVGNYFGVQDAVPFGNVPVLPVDSPVLLSDHLGQSEAGG LPRGPAVTDLDHLKGILRRRQLYCRTGFHLEIFPNGTIQGTRKDHSRFGILEFISIAV GLVSIRGVDSGLYLGMNEKGELYGSEKLTQECVFREQFEENWYNTYSSNLYKHVDTGR RYYVALNKDGTPREGTRTKRHQKFTHFLPRPVDPDKVPELYKDILSQS" polyA_signal 1360..1365 polyA_signal 1370..1375 polyA_signal 1377..1382 polyA_signal 1383..1388 BASE COUNT 402 a 275 c 361 g 382 t ORIGIN 1 tgaaacagca gattactttt atttatgcat ttaatggatt gaagaaaaga accttttttt 61 ttctctctct ctctgcaact gcagtaaggg aggggagttg gatatacctc gcctaatatc 121 tcctgggttg acaccatcat tattgtttat tcttgtgctc caaaagccga gtcctctgat 181 ggctccctta ggtgaagttg ggaactattt cggtgtgcag gatgcggtac cgtttgggaa 241 tgtgcccgtg ttgccggtgg acagcccggt tttgttaagt gaccacctgg gtcagtccga 301 agcagggggg ctccccaggg gacccgcagt cacggacttg gatcatttaa aggggattct 361 caggcggagg cagctatact gcaggactgg atttcactta gaaatcttcc ccaatggtac 421 tatccaggga accaggaaag accacagccg atttggcatt ctggaattta tcagtatagc 481 agtgggcctg gtcagcattc gaggcgtgga cagtggactc tacctcggga tgaatgagaa 541 gggggagctg tatggatcag aaaaactaac ccaagagtgt gtattcagag aacagttcga 601 agaaaactgg tataatacgt actcgtcaaa cctatataag cacgtggaca ctggaaggcg 661 atactatgtt gcattaaata aagatgggac cccgagagaa gggactagga ctaaacggca 721 ccagaaattc acacattttt tacctagacc agtggacccc gacaaagtac ctgaactgta 781 taaggatatt ctaagccaaa gttgacaaag acaatttctt cacttgagcc cttaaaaaag 841 taaccactat aaaggtttca cgcggtgggt tcttattgat tcgctgtgtc atcacatcag 901 ctccactgtt gccaaacttt gtcgcatgca taatgtatga tggaggcttg gatgggaata 961 tgctgatttt gttctgcact taaaggcttc tcctcctgga gggctgccta gggccacttg 1021 cttgatttat catgagagaa gaggagagag agagagactg agcgctagga gtgtgtgtat 1081 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt atgtgtgtag cgggagatgt gggcggagcg 1141 agagcaaaag gactgcggcc tgatgcatgc tggaaaaaag acacgctttt catttctgat 1201 cagttgtact tcatcctata tcagcacagc tgccatactt cgacttatca ggattctggc 1261 tggtggcctg cgcgagggtg cagtcttact taaaagactt tcagttaatt ctcactggta 1321 tcatcgcagt gaacttaaag caaagacctc ttagtaaaaa ataaaaaaaa ataaaaaata 1381 aaaataaaaa aagttaaatt tatttataga aattccaaaa // LOCUS HUMFGFB 3877 bp mRNA PRI 08-NOV-1994 DEFINITION Human basic fibroblast growth factor (FGF) mRNA, complete cds. ACCESSION M27968 NID g182562 KEYWORDS basic fibroblast growth factor. SOURCE Human foreskin fibroblast, cDNA to mRNA, clone pTB627. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3877) AUTHORS Kurokawa,T., Sasada,R., Iwane,M. and Igarashi,K. TITLE Cloning and expression of cDNA encoding human basic fibroblast growth factor JOURNAL FEBS Lett. 213 (1), 189-194 (1987) MEDLINE 87162468 FEATURES Location/Qualifiers source 1..3877 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q25-q27" mRNA <1..3877 /note="FGFB mRNA" gene 340..807 /gene="FGF2" CDS 340..807 /gene="FGF2" /note="basic fibroblast growth factor" /codon_start=1 /db_xref="GDB:G00-119-910" /db_xref="PID:g182563" /translation="MAAGSITTLPALPEDGGSGAFPPGHFKDPKRLYCKNGGFFLRIH PDGRVDGVREKSDPHIKLQLQAEERGVVSIKGVCANRYLAMKEDGRLLASKCVTDECF FFERLESNNYNTYRSRKYTSWYVALKRTGQYKLGSKTGPGQKAILFLPMSAKS" BASE COUNT 1117 a 762 c 817 g 1181 t ORIGIN 36 bp upstream of BamHI site. 1 gccagattag cggacgcgtg cccgcggttg caacgggatc ccgggcgctg cagcttggga 61 ggcggctctc cccaggcggc gtccgcggag acaaccatcc gtgaacccca ggtcccggcg 121 cgccggctcg ccgcgcacca ggggccggcg gacagaagag cggccgagcg gctcgaggct 181 gggggacccg gcgcggccgc gcgctgccgg gcgggaggct ggggggccgg ggcggggccg 241 tgccccggag cgggtcggag gccggggccg gggccggggg acggcggctc cccgcgcggc 301 tccagcggct cggggatccc ggccgggccc cgcaggacca tggcagccgg gagcatcacc 361 acgctgcccg ccttgcccga ggatggcggc agcggcgcct tcccgcccgg ccacttcaag 421 gaccccaagc ggctgtactg caaaaacggg ggcttcttcc tgcgcatcca ccccgacggc 481 cgagttgacg gggtccggga gaagagcgac cctcacatca agctacaact tcaagcagaa 541 gagagaggag ttgtgtctat caaaggagtg tgtgctaacc gttacctggc tatgaaggaa 601 gatggaagat tactggcttc taaatgtgtt acggatgagt gtttcttttt tgaacgattg 661 gaatctaata actacaatac ttaccggtca aggaaataca ccagttggta tgtggcactg 721 aaacgaactg ggcagtataa acttggatcc aaaacaggac ctgggcagaa agctatactt 781 tttcttccaa tgtctgctaa gagctgattt taatggccac atctaatctc atttcacatg 841 aaagaagaag tatattttag aaatttgtta atgagagtaa aagaaaataa atgtgtaaag 901 ctcagtttgg ataattggtc aaacaatttt ttatccagta gtaaaatatg taaccattgt 961 cccagtaaag aaaaataaca aaagttgtaa aatgtatatt ctccctttta tattgcatct 1021 gctgttaccc agtgaagctt acctagagca atgatctttt tcacgcattt gctttattcg 1081 aaaagaggct tttaaaatgt gcatgtttag aaacaaaatt tcttcatgga aatcatcata 1141 tacattagaa aatcacagtc agatgtttaa tcaatccaaa atgtccacta tttcttatgt 1201 cattcgttag tctacatgtt tctaaacata taaatgtgaa tttaatcaat tcctttcata 1261 gttttataat tctctggcag ttccttatga tagagtttat aaaacagtcc tgtgtaaact 1321 gctggaagtt cttccacagt caggtcaatt ttgtcaaacc cttctctgta cccatacagc 1381 agcagcctag caactctgct ggtgatggga gttgtatttt cagtcttcgc caggtcattg 1441 agatccatcc actcacatct taagcattct tcctggcaaa aatttatggt gaatgaatat 1501 ggctttaggc ggcagatgat atacatatct gacttcccaa aagctccagg atttgtgtgc 1561 tgttgccgaa tactcaggac ggacctgaat tctgatttta taccagtctc ttcaaaacct 1621 tctcgaaccg ctgtgtctcc tacgtaaaaa aagagatgta caaatcaata ataattacac 1681 ttttagaaac tgtatcatca aagattttca gttaaagtag cattatgtaa aggctcaaaa 1741 cattacccta acaaagtaaa gttttcaata caaattcttt gccttgtgga tatcaagaaa 1801 tcccaaaata ttttcttacc actgtaaatt caagaagctt ttgaaatgct gaatatttct 1861 ttggctgcta cttggaggct tatctacctg tacatttttg gggtcagctc tttttaactt 1921 cttgctgctg tttttcccaa aaggtaaaaa tatagattga aaagttaaaa cattttgcat 1981 ggctgcagtt cctttgtttc ttgagataag attccaaaga acttagattt atttcttcaa 2041 caccgaaatg ctggaggtgt ttgatcagtt ttcaagaaac ttggaatata aataatttta 2101 taattcaaca aaggttttca cattttataa ggttgatttt tcaattaaat gcaaatttat 2161 gtggcaggat ttttattgcc attaacatat ttttgtggct gctttttcta cacatccaga 2221 tggtccctct aactgggctt tctctaattt tgtgatgttc tgtcattgtc tcccaaagta 2281 tttaggagaa gccctttaaa aagctgcctt cctctaccac tttgctgaaa gcttcacaat 2341 tgtcacagac aaagattttt gttccaatac tcgttttgcc tctattttac ttgtttgtca 2401 aatagtaaat gatatttgcc cttgcagtaa ttctactggt gaaaaacatg caaagaagag 2461 gaagtcacag aaacatgtct caattcccat gtgctgtgac tgtagactgt cttaccatag 2521 actgtcttac ccatcccctg gatatgctct tgttttttcc ctctaatagc tatggaaaga 2581 tgcatagaaa gagtataatg ttttaaaaca taaggcattc gtctgccatt tttcaattac 2641 atgctgactt cccttacaat tgagatttgc ccataggtta aacatggtta gaaacaactg 2701 aaagcataaa agaaaaatct aggccgggtg cagtggctca tgcccatatt ccctgcactt 2761 tgggaggcca aagcaggagg atcgcttgag cccaggagtt caagaccaac ctggtgaaac 2821 cccgtctcta caaaaaaaca caaaaaatag ccaggcatgg tggcgtgtac atgtggtctc 2881 agatacttgg gaggctgagg tgggagggtt gatcacttga ggctgagagg tcaaggttac 2941 agtgagccat aatcgtgcca ctgcagtcca gcctaggcaa cagagtgaga ctttgtctca 3001 aaaaaagaga aattttcctt aataagaaaa gtaattttta ctctgatgtg caatacattt 3061 gttattaaat ttattattta agatggtagc actagtctta aattgtataa aatatcccct 3121 aacatgttta aatgtccatt tttattcatt atgctttgaa aaataattat ggggaaatac 3181 atgtttgtta ttaaatttat tattaaagat agtagcacta gtcttaaatt tgatataaca 3241 tctcctaact tgtttaaatg tccattttta ttctttatgt ttgaaaataa attatgggga 3301 tcctatttag ctcttagtac cactaatcaa aagttcggca tgtagctcat gatctatgct 3361 gtttctatgt cgtggaagca ccggatgggg gtagtgagca aatctgccct gctcagcagt 3421 caccatagca gctgactgaa aatcagcact gcctgagtag ttttgatcag tttaacttga 3481 atcactaact gactgaaaat tgaatgggca aataagtgct tttgtctcca gagtatgcgg 3541 gagacccttc cacctcaaga tggatatttc ttccccaagg atttcaagat gaattgaaat 3601 ttttaatcaa gatagtgtgc tttattctgt tgtatttttt attattttaa tatactgtaa 3661 gccaaactga aataacattt gctgttttat aggtttgaag acataggaaa aactaagagg 3721 ttttattttt gtttttgctg atgaagagat atgtttaaat actgttgtat tgttttgttt 3781 agttacagga caataatgaa atggagttta tatttgttat ttctattttg ttatatttaa 3841 taatagaatt agattgaaat aaaatataat gggaaat // LOCUS HUMFGFR3 2520 bp mRNA PRI 08-NOV-1994 DEFINITION Human fibroblast growth factor receptor (FGFR3) mRNA, complete cds. ACCESSION M58051 NID g182568 KEYWORDS fibroblast growth factor receptor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2520) AUTHORS Keegan,K., Johnson,D.E., Williams,L.T. and Hayman,M.J. TITLE Isolation of an additional member of the fibroblast growth factor receptor family, FGFR-3 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (4), 1095-1099 (1991) MEDLINE 91142118 FEATURES Location/Qualifiers source 1..2520 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CML K562" /map="4p16.3" sig_peptide 40..105 /gene="FGFR3" /note="G00-127-526" /product="fibroblast growth factor receptor" CDS 40..2460 /gene="FGFR3" /codon_start=1 /db_xref="GDB:G00-127-526" /product="fibroblast growth factor receptor" /db_xref="PID:g182569" /translation="MGAPACALALCVAVAIVAGASSESLGTEQRVVGRAAEVPGPEPG QQEQLVFGSGDAVELSCPPPGGGPMGPTVWVKDGTGLVPSERVLVGPQRLQVLNASHE DSGAYSCRQRLTQRVLCHFSVRVTDAPSSGDDEDGEDEAEDTGVDTGAPYWTRPERMD KKLLAVPAANTVRFRCPAAGNPTPSISWLKNGREFRGEHRIGGIKLRHQQWSLVMESV VPSDRGNYTCVVENKFGSIRQTYTLDVLERSPHRPILQAGLPANQTAVLGSDVEFHCK VYSDAQPHIQWLKHVEVNGSKVGPDGTPYVTVLKTAGANTTDKELEVLSLHNVTFEDA GEYTCLAGNSIGFSHHSAWLVVLPAEEELVEADEAGSVYAGILSYGVGFFLFILVVAA VTLCRLRSPPKKGLGSPTVHKISRFPLKRQVSLESNASMSSNTPLVRIARLSSGEGPT LANVSELELPADPKWELSRARLTLGKPLGEGCFGQVVMAEAIGIDKDRAAKPVTVAVK MLKDDATDKDLSDLVSEMEMMKMIGKHKNIINLLGACTQGGPLYVLVEYAAKGNLREF LRARRPPGLDYSFDTCKPPEEQLTFKDLVSCAYQVARGMEYLASQKCIHRDLAARNVL VTEDNVMKIADFGLARDVHNLDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGV LLWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTHDLYMIMRECWHAAPSQRPT FKQLVEDLDRVLTVTSTDEYLDLSAPFEQYSPGGQDTPSSSSSGDDSVFAHDLLPPAP PSSGGSRT" gene 40..2460 /gene="FGFR3" mat_peptide 106..2457 /gene="FGFR3" /note="G00-127-526" /product="fibroblast growth factor receptor" BASE COUNT 441 a 827 c 840 g 412 t ORIGIN 1 cgcgcgctgc ctgaggacgc cgcggccccc gcccccgcca tgggcgcccc tgcctgcgcc 61 ctcgcgctct gcgtggccgt ggccatcgtg gccggcgcct cctcggagtc cttggggacg 121 gagcagcgcg tcgtggggcg agcggcagaa gtcccgggcc cagagcccgg ccagcaggag 181 cagttggtct tcggcagcgg ggatgctgtg gagctgagct gtcccccgcc cgggggtggt 241 cccatggggc ccactgtctg ggtcaaggat ggcacagggc tggtgccctc ggagcgtgtc 301 ctggtggggc cccagcggct gcaggtgctg aatgcctccc acgaggactc cggggcctac 361 agctgccggc agcggctcac gcagcgcgta ctgtgccact tcagtgtgcg ggtgacagac 421 gctccatcct cgggagatga cgaagacggg gaggacgagg ctgaggacac aggtgtggac 481 acaggggccc cttactggac acggcccgag cggatggaca agaagctgct ggccgtgccg 541 gccgccaaca ccgtccgctt ccgctgccca gccgctggca accccactcc ctccatctcc 601 tggctgaaga acggcaggga gttccgcggc gagcaccgca ttggaggcat caagctgcgg 661 catcagcagt ggagcctggt catggaaagc gtggtgccct cggaccgcgg caactacacc 721 tgcgtcgtgg agaacaagtt tggcagcatc cggcagacgt acacgctgga cgtgctggag 781 cgctccccgc accggcccat cctgcaggcg gggctgccgg ccaaccagac ggcggtgctg 841 ggcagcgacg tggagttcca ctgcaaggtg tacagtgacg cacagcccca catccagtgg 901 ctcaagcacg tggaggtgaa cggcagcaag gtgggcccgg acggcacacc ctacgttacc 961 gtgctcaaga cggcgggcgc taacaccacc gacaaggagc tagaggttct ctccttgcac 1021 aacgtcacct ttgaggacgc cggggagtac acctgcctgg cgggcaattc tattgggttt 1081 tctcatcact ctgcgtggct ggtggtgctg ccagccgagg aggagctggt ggaggctgac 1141 gaggcgggca gtgtgtatgc aggcatcctc agctacgggg tgggcttctt cctgttcatc 1201 ctggtggtgg cggctgtgac gctctgccgc ctgcgcagcc cccccaagaa aggcctgggc 1261 tcccccaccg tgcacaagat ctcccgcttc ccgctcaagc gacaggtgtc cctggagtcc 1321 aacgcgtcca tgagctccaa cacaccactg gtgcgcatcg caaggctgtc ctcaggggag 1381 ggccccacgc tggccaatgt ctccgagctc gagctgcctg ccgaccccaa atgggagctg 1441 tctcgggccc ggctgaccct gggcaagccc cttggggagg gctgcttcgg ccaggtggtc 1501 atggcggagg ccatcggcat tgacaaggac cgggccgcca agcctgtcac cgtagccgtg 1561 aagatgctga aagacgatgc cactgacaag gacctgtcgg acctggtgtc tgagatggag 1621 atgatgaaga tgatcgggaa acacaaaaac atcatcaacc tgctgggcgc ctgcacgcag 1681 ggcgggcccc tgtacgtgct ggtggagtac gcggccaagg gtaacctgcg ggagtttctg 1741 cgggcgcggc ggcccccggg cctggactac tccttcgaca cctgcaagcc gcccgaggag 1801 cagctcacct tcaaggacct ggtgtcctgt gcctaccagg tggcccgggg catggagtac 1861 ttggcctccc agaagtgcat ccacagggac ctggctgccc gcaatgtgct ggtgaccgag 1921 gacaacgtga tgaagatcgc agacttcggg ctggcccggg acgtgcacaa cctcgactac 1981 tacaagaaga caaccaacgg ccggctgccc gtgaagtgga tggcgcctga ggccttgttt 2041 gaccgagtct acactcacca gagtgacgtc tggtcctttg gggtcctgct ctgggagatc 2101 ttcacgctgg ggggctcccc gtaccccggc atccctgtgg aggagctctt caagctgctg 2161 aaggagggcc accgcatgga caagcccgcc aactgcacac acgacctgta catgatcatg 2221 cgggagtgct ggcatgccgc gccctcccag aggcccacct tcaagcagct ggtggaggac 2281 ctggaccgtg tccttaccgt gacgtccacc gacgagtacc tggacctgtc ggcgcctttc 2341 gagcagtact ccccgggtgg ccaggacacc cccagctcca gctcctcagg ggacgactcc 2401 gtgtttgccc acgacctgct gcccccggcc ccacccagca gtgggggctc gcggacgtga 2461 agggccactg gtccccaaca atgtgagggg tccctagcag ccctccctgc tgctggtgca // LOCUS HUMFISP 1963 bp mRNA PRI 08-NOV-1994 DEFINITION Human factor I (C3b/C4b inactivator) mRNA, complete cds. ACCESSION J02770 NID g182606 KEYWORDS C3b/C34 inactivator; complement cascade; factor I; serine protease. SOURCE Human hepatoma cell line HepG2 and normal liver, cDNA to mRNA, clones lambda-G2-HI1971 and lambda-gt10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1963) AUTHORS Goldberger,G., Bruns,G.A., Rits,M., Edge,M.D. and Kwiatkowski,D.J. TITLE Human complement factor I: analysis of cDNA-derived primary structure and assignment of its gene to chromosome 4 JOURNAL J. Biol. Chem. 262 (21), 10065-10071 (1987) MEDLINE 87280021 COMMENT Draft entry and printed copy of sequence [1] kindly provided by G.Goldberger, 11-MAY-1987. There were no sequence differences between the sequences from the hepatoma cell line and the normal liver. FEATURES Location/Qualifiers source 1..1963 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q24-q25" mRNA <1..1963 /note="SPRC mRNA" sig_peptide 15..68 /gene="IF" /note="C3b/C4b inactivator signal peptide" gene 15..1766 /gene="IF" CDS 15..1766 /gene="IF" /note="prepro-C3b/C4B inactivator" /codon_start=1 /db_xref="GDB:G00-120-077" /db_xref="PID:g182607" /translation="MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCD KVFCQPWQRCIEGTCVCKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLN NGTCTAEGKFSVSLKHGNTDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQ QGADTQRRFKLSDLSINSTECLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQK ADSPMDDFFQCVNGKYISQMKACDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQY QCNGEVDCITGEDEVGCAGFASVAQEETEILTADMDAERRRIKSLLPKLSCGVKNRMH IRRKRIVGGKRAQLGDLPWQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQI WTTVVDWIHPDLKRIVIEYVDRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSI PACVPWSPYLFQPNDTCIVSGWGREKDNERVFSLQWGEVKLISNCSKFYGNRFYEKEM ECAGTYDGSIDACKGDSGGPLVCMDANNVTYVWGVVSWGENCGKPEFPGFYTKVANYF DWISYHVGRPFISQYNV" mat_peptide 69..1019 /gene="IF" /note="C3b/C4b inactivator heavy chain" mat_peptide 1032..1763 /gene="IF" /note="C3b/C4b inactivator light chain" BASE COUNT 605 a 353 c 454 g 551 t ORIGIN Chromosome 4. 1 cgaacacctc caacatgaag cttcttcatg ttttcctgtt atttctgtgc ttccacttaa 61 ggttttgcaa ggtcacttat acatctcaag aggatctggt ggagaaaaag tgcttagcaa 121 aaaaatatac tcacctctcc tgcgataaag tcttctgcca gccatggcag agatgcattg 181 agggcacctg tgtttgtaaa ctaccgtatc agtgcccaaa gaatggcact gcagtgtgtg 241 caactaacag gagaagcttc ccaacatact gtcaacaaaa gagtttggaa tgtcttcatc 301 cagggacaaa gtttttaaat aacggaacat gcacagccga aggaaagttt agtgtttcct 361 tgaagcatgg aaatacagat tcagagggaa tagttgaagt aaaacttgtg gaccaagata 421 agacaatgtt catatgcaaa agcagctgga gcatgaggga agccaacgtg gcctgccttg 481 accttgggtt tcaacaaggt gctgatactc aaagaaggtt taagttgtct gatctctcta 541 taaattccac tgaatgtcta catgtgcatt gccgaggatt agagaccagt ttggctgaat 601 gtacttttac taagagaaga actatgggtt accaggattt cgctgatgtg gtttgttata 661 cacagaaagc agattctcca atggatgact tctttcagtg tgtgaatggg aaatacattt 721 ctcagatgaa agcctgtgat ggtatcaatg attgtggaga ccaaagtgat gaactgtgtt 781 gtaaagcatg ccaaggcaaa ggcttccatt gcaaatcggg tgtttgcatt ccaagccagt 841 atcaatgcaa tggtgaggtg gactgcatta caggggaaga tgaagttggc tgtgcaggct 901 ttgcatctgt ggctcaagaa gaaacagaaa ttttgactgc tgacatggat gcagaaagaa 961 gacggataaa atcattatta cctaaactat cttgtggagt taaaaacaga atgcacattc 1021 gaaggaaacg aattgtggga ggaaagcgag cacaactggg agacctccca tggcaggtgg 1081 caattaagga tgccagtgga atcacctgtg ggggaattta tattggtggc tgttggattc 1141 tgactgctgc acattgtctc agagccagta aaactcatcg ttaccaaata tggacaacag 1201 tagtagactg gatacacccc gaccttaaac gtatagtaat tgaatacgtg gatagaatta 1261 ttttccatga aaactacaat gcaggcactt accaaaatga catcgctttg attgaaatga 1321 aaaaagacgg aaacaaaaaa gattgtgagc tgcctcgttc catccctgcc tgtgtcccct 1381 ggtctcctta cctattccaa cctaatgata catgcatcgt ttctggctgg ggacgagaaa 1441 aagataacga aagagtcttt tcacttcagt ggggtgaagt taaactaata agcaactgct 1501 ctaagtttta cggaaatcgt ttctatgaaa aagaaatgga atgtgcaggt acatatgatg 1561 gttccatcga tgcctgtaaa ggggactctg gaggcccctt agtctgtatg gatgccaaca 1621 atgtgactta tgtctggggt gttgtgagtt ggggggaaaa ctgtggaaaa ccagagttcc 1681 caggttttta caccaaagtg gccaattatt ttgactggat tagctaccat gtaggaaggc 1741 cttttatttc tcagtacaat gtataaaatt gtgatctctc tcttcattct attctttttc 1801 tctcaagagt tccatttaat ggaaataaaa cggtataatt aataattctc taggggggaa 1861 aaatgaagca aatctcattg gatattttta aaggtctcca cagagtttat gccatattgg 1921 aattttgttg tataattctc aaataaatat tttggtgaag cat // LOCUS HUMFK506A 964 bp mRNA PRI 31-DEC-1994 DEFINITION Human rapamycin binding protein (FK506) mRNA, complete cds. ACCESSION M96256 NID g182625 KEYWORDS rapamycin binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 964) AUTHORS Wiederrecht,G., Martin,M.M., Sigal,N.H. and Siekierka,J.J. TITLE Isolation of a human cDNA encoding a 25 kDa FK-506 and rapamycin binding protein JOURNAL Biochem. Biophys. Res. Commun. 185 (1), 298-303 (1992) MEDLINE 92287110 FEATURES Location/Qualifiers source 1..964 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T-cell" gene 24..698 /gene="FK506" CDS 24..698 /gene="FK506" /codon_start=1 /product="rapamycin binding protein" /db_xref="PID:g182626" /translation="MAAAVPQRAWTVEQLRSEQLPKKDIIKFLQEHGSDSFLAEHKLL GNIKNVAKTANKDHLVTAYNHLFETKRFKGTESISKVSEQVKNVKLNEDKPKETKSEE TLDEGPPKYTKSVLKKGDKTNFPKKGDVVHCWYTGTLQDGTVFDTNIQTSAKKKKNAK PLSFKVGVGKVIRGWDEALLTMSKGEKARLEIEPEWAYGKKGQPDAKIPPNAKLTFEV ELVDID" BASE COUNT 338 a 163 c 223 g 240 t ORIGIN 1 gaaagcggag gcagcggggg aagatggcgg cggccgttcc acagcgggcg tggaccgtgg 61 agcagctgcg cagtgagcag ctgcccaaga aggatattat caagtttctg caggaacacg 121 gttcagattc gtttcttgca gaacataaat tattaggaaa cattaaaaat gtggccaaga 181 cagctaacaa ggaccacttg gttacagcct ataaccatct ttttgaaact aagcgtttta 241 agggtactga aagtataagt aaagtgtctg agcaagtaaa aaatgtgaag cttaatgaag 301 ataaacccaa agaaaccaag tctgaagaga ccctggatga gggtccacca aaatatacta 361 aatctgttct gaaaaaggga gataaaacca actttcccaa aaagggagat gttgttcact 421 gctggtatac aggaacacta caagatggga ctgtttttga tactaatatt caaacaagtg 481 caaagaagaa gaaaaatgcc aagcctttaa gttttaaggt cggagtaggc aaagttatca 541 gaggatggga tgaagctctc ttgactatga gtaaaggaga aaaggctcga ctggagattg 601 aaccagaatg ggcttacgga aagaaaggac agcctgatgc caaaattcca ccaaatgcaa 661 aactcacttt tgaagtggaa ttagtggata ttgattgaaa taggcagtgc ttcagctcta 721 aggatattag caacaatgat aaaacttggc cttgaagaaa tttacacaac tagttagaac 781 ttgttactat tgtaaaggaa gagtcaactg gaaaattcaa ggagttaata aaatttgttt 841 acttggtccc agcttttgag agataaatcc cttatgaatc cctggtctaa aatactttcc 901 tacagctgtg taaaatactg gtcaaggaga actttttcct tttacctcat gttgtaaact 961 taag // LOCUS HUMFK506B 880 bp mRNA PRI 02-NOV-1995 DEFINITION Homo sapiens FK-506 binding protein (fkbp12.6) gene, complete cds. ACCESSION L37086 NID g965467 KEYWORDS FK506 binding protein; calcineurin. SOURCE Homo sapiens (tissue library: stratagene) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 880) AUTHORS Lam,E., Martin,M., Timerman,M., Fleischer,S., Sabers,C., Fleischer,S., Lukas,T.J., Abram,R.T., OKeefe,S.J., ONeill,E.A. and Wiederrecht,G.J. TITLE A novel FK506 binding protein can mediate the immunosuppressive effects of FK506 and is associated with the cardiac ryanodine receptor JOURNAL J. Biol. Chem. 270 (44), 26511-26522 (1995) MEDLINE 96064732 FEATURES Location/Qualifiers source 1..880 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /tissue_lib="stratagene" gene 67..393 /gene="FKBP12.6" CDS 67..393 /gene="FKBP12.6" /note="FK-506 binding protein" /codon_start=1 /product="calcineurin" /db_xref="PID:g965468" /translation="MGVEIETISPGDGRTFPKKGQTCVVHYTGMLQNGKKFDSSRDRN KPFKFRIGKQEVIKGFEEGAAQMSLGQRAKLTCTPDVAYGATGHPGVIPPNATLIFDV ELLNLE" BASE COUNT 212 a 226 c 230 g 212 t ORIGIN 1 ggccggagcc gagccggggt cgggcagcag cagggacccc ccagaggcgg ggcctgtggg 61 accgctatgg gcgtggagat cgagaccatc tcccccggag acggaaggac attccccaag 121 aagggccaaa cgtgtgtggt gcactacaca ggaatgctcc aaaatgggaa gaagtttgat 181 tcatccagag acagaaacaa acctttcaag ttcagaattg gcaaacagga agtcatcaaa 241 ggttttgaag agggtgcagc ccagatgagc ttggggcaga gggcgaagct gacctgcacc 301 cctgatgtgg catatggagc cacgggccac cccggtgtca tccctcccaa tgccaccctc 361 atctttgacg tggagctgct caacttagag tgaaggcagg aaggaactca aggtggctgg 421 agatggctgc tgctcaccct cctagcctgc tctgccactg ggacggctcc tgcttttggg 481 gctcttgatc agtgtgctaa cctcactgcc tcatggcatc atccattctc tctgcccaag 541 ttgctctgta tgtgttcgtc agtgttcatg cgaattcttg cttgaggaaa cttcggttgc 601 agattgaagc atttcaggtt gtgcattttg tgtgatgcat gtagtagcct ttcctgatga 661 cagaacacag atctcttgtt cgcacaatct acactgcctt accttcactt aaaccacaca 721 cacaaggtgc tcagacatga aatgtacatg gcgtaccgta cacagaggga cttgagccag 781 ttacctttgc tgtcactttc tctcttataa attctgttag ctgctcactt aaacaatgtc 841 ctctttgaga aaatgtaaaa taaaggctct gtgcttgaca // LOCUS HUMFKBP13 550 bp mRNA PRI 31-DEC-1994 DEFINITION Human rapamycin-binding protein (FKBp-13) mRNA, complete cds. ACCESSION M65128 NID g182639 KEYWORDS rapamycin-binding protein. SOURCE Homo sapiens (tissue library: lambda-gt11 cDNA) colon carcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 550) AUTHORS Jin,Y.J., Albers,M.W., Lane,W.S., Bierer,B.E., Schreiber,S.L. and Burakoff,S.J. TITLE Molecular cloning of a membrane-associated human FK506- and rapamycin-binding protein, FKBP-13 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (15), 6677-6681 (1991) MEDLINE 91319747 FEATURES Location/Qualifiers source 1..550 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon carcinoma" /tissue_lib="lambda-gt11 cDNA" sig_peptide 38..100 /gene="FKBP13" CDS 38..463 /gene="FKBP13" /codon_start=1 /product="rapamycin-binding protein" /db_xref="PID:g182640" /translation="MRLSWFRVLTVLSICLSAVASTGTEGKRKLQIGVKKRVDHCPIK SRKGDVLHMHYTGKLEDGTEFDSSLPQNQPFVFSLGTGQVIKGWDQGLLGMYEGEKRK LVIPSELGYGERGAPPKIPGGATLVFEVELLKIERRTEL" gene 38..463 /gene="FKBP13" mat_peptide 101..460 /gene="FKBP13" /product="rapamycin-binding protein" BASE COUNT 146 a 137 c 185 g 82 t ORIGIN 1 ggccggggtt gactccgggg gcgcggcgag gagagacatg aggctgagct ggttccgggt 61 cctgacagta ctgtccatct gcctgagcgc cgtggccagc acggggaccg agggcaaaag 121 gaagctgcag atcggggtca agaagcgggt ggaccactgt cccatcaaat cgcgcaaagg 181 ggatgtcctg cacatgcact acacggggaa gctggaagat gggacagagt ttgacagcag 241 cctgccccag aaccagccct ttgtcttctc ccttggcaca ggccaggtca tcaagggctg 301 ggaccagggg ctgctgggga tgtatgaggg ggaaaagcgc aagctggtga tcccatccga 361 gctagggtat ggagagcggg gagctccccc aaagattcca ggcggtgcaa ccctggtgtt 421 cgaggtggag ctgctcaaaa tagagcgacg aactgagctg taaccagact gggaggggca 481 ggggagaggc ccccatcagg accagactgt tccaaaaaaa aaaaaacaaa aaacaaacaa 541 aaaaacactt // LOCUS HUMFKBPA 1641 bp mRNA PRI 16-MAY-1996 DEFINITION Human FK-506 binding protein homologue (FKBP38) mRNA, complete cds. ACCESSION L37033 NID g965469 KEYWORDS binding protein; leucine zipper; tetratricopeptide family. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1641) AUTHORS Lam,E., Martin,M. and Wiederrecht,G. TITLE Isolation of a cDNA encoding a novel human FK506-binding protein homolog containing leucine zipper and tetratricopeptide repeat motifs JOURNAL Gene 160 (2), 297-302 (1995) MEDLINE 95369708 COMMENT Related entries: acc# T09208 T08452 M79056 M62106. FEATURES Location/Qualifiers source 1..1641 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Jurkat library (Stratagene)" gene 141..1208 /gene="FKBP38" CDS 141..1208 /gene="FKBP38" /codon_start=1 /product="FK-506 binding protein homologue" /db_xref="PID:g965470" /translation="MGQPPAEEAEQPGALAREFLAAMEPEPAPAPAPEEWLDILGNGL LRKKTLVPGPPGSSRPVKGQVVTVHLQTSLENGTRVQEEPELVFTLGDCDVIQALDLS VPLMDVGETAMVTADSKYCYGPQGRSPYIPPHAALCLEVTLKTAVDGPDLEMLTGQER VALANRKRECGNAHYQRADFVLAANSYDLAIKAITSSAKVDMTFEEEAQLLQLKVKCL NNLAASQLKLDHYRAALRSCSLVLEHQPDNIKALFRKGKVLAQQGEYSEAIPILRAAL KLEPSNKTIHAELSKLVKKHAAQRSTETALYRKMLGNPSRLPAKCPGKGAWSIPWKWL FGATAVALGGVALSVVIAARN" BASE COUNT 324 a 550 c 507 g 260 t ORIGIN 1 gattcccctc accctctgca tcctcaaccc catccagtac ctcgaagtcc tcgagcggtg 61 ggaccccggc gggtgaggag gaagaggagg aggaagagga ggaagaggat gacctgagtg 121 agctgccacc gctggaggac atgggacaac ccccggcgga ggaggctgag cagcctgggg 181 ccctggcccg agagttcctt gctgccatgg agcccgagcc cgccccagcc ccggccccag 241 aagagtggct ggacattctg gggaacgggc tgttgaggaa gaagacgctg gtcccagggc 301 cgccaggttc gagccgcccg gtcaagggcc aggtggtcac cgtacatctg cagacgtcgc 361 tggagaatgg cacacgggtg caggaggagc cggagctggt gttcactctg ggtgactgtg 421 acgtcatcca ggccctggat ctcagtgtcc cactcatgga cgtgggggag acggccatgg 481 tcactgctga ctccaagtac tgctacggcc cccaaggcag gagcccatac atccccccgc 541 acgcggccct gtgcctggag gtgaccctga agacggctgt ggacgggcct gacctggaga 601 tgctcacggg gcaggagcgc gtggccctgg ccaaccggaa gcgggagtgc ggcaacgccc 661 actaccagcg ggcggacttc gtcctggccg ccaactccta cgacctcgcc atcaaggcta 721 tcacctccag cgccaaagtg gacatgacgt tcgaggagga ggcacagctc ctgcagttga 781 aggtgaagtg tctgaacaac ctggcggcct cgcagctgaa gctcgaccac taccgcgcag 841 ccctgcgctc ctgcagcctt gtgctggagc accagccaga caacatcaag gctctcttcc 901 gcaagggcaa ggtgctggcc cagcaggggg agtacagtga ggccatcccc atcctgaggg 961 cagccctgaa gctggaacct tccaacaaga cgatccacgc agagctctca aagctggtga 1021 agaagcatgc ggcgcagcgg agcacggaga ccgccttgta ccggaaaatg ctgggcaacc 1081 ccagccggct gcctgctaag tgccctggca agggtgcctg gtccatccca tggaagtggc 1141 tgtttggggc gactgctgtt gccttggggg gtgtggcact ctctgtggtc atcgctgcca 1201 ggaactgacc acctaggtgg ctgccacccc ctctgcacac catggaccct gccctgcgct 1261 ccccaactcc cccaggctcc ctgtccactg ccctccctgg tctggccccc tcctccgggt 1321 taggggagca aggattgggg gtcgtgcagc ccagccagca ggagggactg aggccctcta 1381 ggaggaaagc ccagagggag ggggccctca ttccttcaga cccagttttc ccccaccctc 1441 cttaccccgc tgggctaggt ctccgccagg gctggcctca gtttctcctc aacaggcctg 1501 ggggcagccc ttcccctgcc tagtccccgc ctgagtgcca gccccccacc ccgcctgccg 1561 ccccctgtcc aggttccctc cccgccacag tgaaataaag catcccaccc tgcaaaaaaa 1621 aaaaaaaaaa aaaaggaatt c // LOCUS HUMFKHH 1044 bp mRNA PRI 31-DEC-1994 DEFINITION Human fork head-related protein (FKH H3) mRNA, complete cds. ACCESSION L12141 NID g506820 KEYWORDS fork head-related protein. SOURCE Homo sapiens (tissue library: lambda gt11) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1044) AUTHORS Hromas,R., Moore,J., Johnston,T., Socha,C. and Klemsz,M. TITLE Drosophila forkhead homologues are expressed in a lineage-restricted manner in human hematopoietic cells JOURNAL Blood 81 (11), 2854-2859 (1993) MEDLINE 93271467 FEATURES Location/Qualifiers source 1..1044 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEPG2" /tissue_type="liver" /tissue_lib="lambda gt11" gene 1..1044 /gene="FKH H3" CDS 1..1044 /gene="FKH H3" /note="HNF-3 gamma homologue" /codon_start=1 /product="fork head-related protein" /db_xref="PID:g506821" /translation="MLGSVKMEAHDLAEWSYYPEAGEVYSPVTPVPTMAPLNSYMTLN PLSSPYPGGLPASPLPSGPLAPPAPAAPLGPTFPGLGLSGGSSSSGYGAPGPGLVHGK EMPKGYRAPAHAKPPYSYISLITMAIQQAPGKVLTLSEIYQWIMDLFPYYRDNQQRWQ NSIRHSLSFNDCFVKVARSPDKPGKGSYWALHPSSGNMFENGCYLRRQKRFKLEEKVK KGGSGASTTRNGTGSAASTTTPAATVTSPPQPPPPAPEPEAQGGEDVGALDCGSPASS TPYFTGLELPGDLKLDAPYNFNHPFSINNLMSEQTPAPPKLDVGFGGYGAEGGEPGVY YQGLYSRSLLNAS" BASE COUNT 202 a 368 c 290 g 184 t ORIGIN 1 atgctgggct cagtgaagat ggaggcccat gacctggccg agtggagcta ctacccggag 61 gcgggcgagg tctactcgcc ggtgacccca gtgcccacca tggcccccct caactcctac 121 atgaccctga atcctctaag ctctccctat cctggggggc tccctgcctc cccactgccc 181 tcaggacccc tggcaccccc agcacctgca gcccccctgg ggcccacttt cccaggcctg 241 ggtctgagcg gtggcagcag cagctccggg tacggggccc cgggtcctgg gctggtgcac 301 gggaaggaga tgccgaaggg gtatcgcgcc cctgcacacg ccaagccacc gtattcctat 361 atctcactca tcaccatggc tatccagcag gcgccgggca aggtgctgac cttgagtgaa 421 atctaccagt ggatcatgga cctcttccct tactaccggg acaatcagca gcgctggcag 481 aactccattc gccactcgct gtctttcaac gactgcttcg tcaaggtggc gcgttcccca 541 gacaagcctg gcaagggctc ctactgggcc ctacacccca gctcagggaa catgtttgag 601 aatggctgct acctgcgccg ccagaaacgc ttcaagctgg aggagaaggt gaaaaaaggg 661 ggcagcgggg catcgaccac caggaacggg acagggtctg ctgcctcgac caccaccccc 721 gccgccacag tcacctcccc gccccagccc ccgcctccag cccctgagcc tgaggcccag 781 ggcggggaag atgtgggggc tctggactgt ggctcacccg cttcctccac accctatttc 841 actggcctgg agctcccagg ggacctgaag ctggacgcgc cctacaactt caaccaccct 901 ttctccatca acaacctaat gtcagaacag acaccagcac ctcccaaact ggacgtgggg 961 tttgggggct acggggctga aggtggggag cctggagtct actaccaggg cctctattcc 1021 cgctctttgc ttaatgcatc ctag // LOCUS HUMFLNG6PD 219447 bp DNA PRI 26-FEB-1996 DEFINITION Homo sapiens chromosome X region from filamin (FLN) gene to glucose-6-phosphate dehydrogenase (G6PD) gene, complete cds's. ACCESSION L44140 NID g1203968 KEYWORDS 1A gene; 2_19 gene; ABP-280 gene; DNL1L gene; DNase I-like protein gene; EMD gene; FLN gene; G4.5 gene; G4.8 gene; G6PD gene; GDI gene; GdX gene; P3 gene; QM gene; STA gene; XAP-1 gene; XAP-2 gene; XAP-4 gene; XAP-5 gene; XAP-7 gene; actin-binding protein; emerin; emery-dreyfuss syndrome; filamin; glucose-6-phosphate dehydrogenase. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 219447) AUTHORS Chen,E.Y., Zollo,M., Mazzarella,R.A., Ciccodicola,A., Chen,C.-N., Zuo,L., Heiner,C., Burough,F.W., Ripetto,M., Schlessinger,D. and D'Urso,M. TITLE Thirteen known and six candidate genes in 219.4kb of high GC DNA between the human RCP/GCP and G6PD loci JOURNAL Unpublished (1996) COMMENT Submitted by: Ellson Chen, Advanced Center for Genetic Technology, Applied Biosystems Division of Perlin Elmer Corp., 850 Lincoln Center Drive, Foster City, CA 94404 USA and David Schlessinger, Department of Molecular Microbiology and Center for Genetics in Medicine Washington University School of Medicine, St. Louis MO 63110 USA e-mail: ellson@genseq.apldbio.com and davids@genetics.wustl.edu Note: Gene predictions were accomplished with runs of Grail versions 1.1 and 1.2, coupled with fasta and blastx comparisons to genbank & non-redundant peptide libraries. Repeat analysis was accomplished via censor. FEATURES Location/Qualifiers source 1..219447 /organism="Homo sapiens" /db_xref="taxon:9606" /map="X" /chromosome="X" repeat_unit 4..223 /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental repeat_unit 226..503 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 641..931 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 1023..1313 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(1446..1536) /rpt_family="Alu-J" /evidence=experimental repeat_region 3095..3426 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(3663..3952) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 4912..5046 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 5047..5333 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 5346..5628 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 5632..5805 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 6363..17698 /rpt_family="11Kb repeat1" /evidence=experimental repeat_unit 6369..6633 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(7121..7409) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(7358..7432) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(7426..7701) /rpt_family="Alu-J" /evidence=experimental repeat_unit 8306..8592 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(8863..9029) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(9031..9317) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(9318..9442) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 9486..9773 /rpt_family="Alu-J" /evidence=experimental repeat_unit 9996..10132 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 10159..10255 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 10271..10367 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(10455..10739) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(10740..10860) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(10866..11156) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 11223..11501 /rpt_family="Alu-J" /evidence=experimental repeat_region 11638..13564 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(11884..12163) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(12195..12482) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(12668..12945) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(18360..18648) /rpt_family="Alu-Sb0" /evidence=experimental exon complement(18984..19488) /gene="FLN" /number=48 /evidence=experimental gene complement(18984..45006) /gene="FLN" CDS complement(join(19301..19488,19814..20017,20101..20319, 20483..20659,21361..21493,22033..22148,22336..22473, 22633..22899,23005..23127,23224..23376,23453..23656, 23748..23909,24006..24179,24367..24495,24603..24743, 24833..24935,25068..25163,25277..25524,27701..27724, 27884..28073,28649..28805,28895..29018,29434..29604, 29696..29856,29934..30096,30182..30355,30440..31037, 31758..32020,32120..32237,32429..32598,32692..32782, 32868..33028,33111..33234,34472..34615,34709..34822, 34976..35169,35271..35407,35586..35709,35798..35935, 36473..36673,36757..36919,37011..37088,37181..37299, 37846..37993,38090..38187,38291..38539,41319..41691)) /gene="FLN" /codon_start=1 /function="binds actin" /evidence=experimental /product="filamin" /db_xref="PID:g1203969" /translation="MSSSHSRAGQSAAGAAPGGGVDTRDAEMPATEKDLAEDAPWKKI QQNTFTRWCNEHLKCVSKRIANLQTDLSDGLRLIALLEVLSQKKMHRKHNQRPTFRQM QLENVSVALEFLDRESIKLVSIDSKAIVDGNLKLILGLIWTLILHYSISMPMWDEEED EEAKKQTPKQRLLGWIQNKLPQLPITNFSRDWQSGRALGALVDSCAPGLCPDWDSWDA SKPVTNAREAMQQADDWLGIPQVITPEEIVDPNVDEHSVMTYLSQFPKAKLKPGAPLR PKLNPKKARAYGPGIEPTGNMVKKRAEFTVETRSAGQGEVLVYVEDPAGHQEEAKVTA NNDKNRTFSVWYVPEVTGTHKVTVLFAGQHIAKSPFEVYVDKSQGDASKVTAQGPGLE PSGNIANKTTYFEIFTAGAGTGEVEVVIQDPMGQKGTVEPQLEARGDSTYRCSYQPTM EGVHTVHVTFAGVPIPRSPYTVTVGQACNPSACRAVGRGLQPKGVRVKETADFKVYTK GAGSGELKVTVKGPKGEERVKQKDLGDGVYGFEYYPMVPGTYIVTITWGGQNIGRSPF EVKVGTECGNQKVRAWGPGLEGGVVGKSADFVVEAIGDDVGTLGFSVEGPSQAKIECD DKGDGSCDVRYWPQEAGEYAVHVLCNSEDIRLSPFMADIRDAPQDFHPDRVKARGPGL EKTGVAVNKPAEFTVDAKHGGKAPLRVQVQDNEGCPVEALVKDNGNGTYSCSYVPRKP VKHTAMVSWGGVSIPNSPFRVNVGAGSHPNKVKVYGPGVAKTGLKAHEPTYFTVDCAE AGQGDVSIGIKCAPGVVGPAEADIDFDIIRNDNDTFTVKYTPRGAGSYTIMVLFADQA TPTSPIRVKVEPSHDASKVKAEGPGLSRTGVELGKPTHFTVNAKAAGKGKLDVQFSGL TKGDAVRDVDIIDHHDNTYTVKYTPVQQGPVGVNVTYGGDPIPKSPFSVAVSPSLDLS KIKVSGLGEKVDVGKDQEFTVKSKGAGGQGKVASKIVGPSGAAVPCKVEPGLGADNSV VRFLPREEGPYEVEVTYDGVPVPGSPFPLEAVAPTKPSKVKAFGPGLQGGSAGSPARF TIDTKGAGTGGLGLTVEGPCEAQLECLDNGDGTCSVSYVPTEPGDYNINILFADTHIP GSPFKAHVVPCFDASKVKCSGPGLERATAGEVGQFQVDCSSAGSAELTIEICSEAGLP AEVYIQDHGDGTHTITYIPLCPGAYTVTIKYGGQPVPNFPSKLQVEPAVDTSGVQCYG PGIEGQGVFREATTEFSVDARALTQTGGPHVKARVANPSGNLTETYVQDRGDGMYKVE YTPYEEGLHSVDVTYDGSPVPSSPFQVPVTEGCDPSRVRVHGPGIQSGTTNKPNKFTV ETRGAGTGGLGLAVEGPSEAKMSCMDNKDGSCSVEYIPYEAGTYSLNVTYGGHQVPGS PFKVPVHDVTDASKVKCSGPGLSPGMVRANLPQSFQVDTSKAGVAPLQVKVQGPKGLV EPVDVVDNADGTQTVNYVPSREGPYSISVLYGDEEVPRSPFKVKVLPTHDASKVKASG PGLNTTGVPASLPVEFTIDAKDAGEGLLAVQITDPEGKPKKTHIQDNHDGTYTVAYVP DVTGRYTILIKYGGDEIPFSPYRVRAVPTGDASKCTVTVSIGGHGLGAGIGPTIQIGE ETVITVDTKAAGKGKVTCTVCTPDGSEVDVDVVENEDGTFDIFYTAPQPGKYVICVRF GGEHVPNSPFQVTALAGDQPSVQPPLRSQQLAPQYTYAQGGQQTWAPERPLVGVNGLD VTSLRPFDLVIPFTIKKGEITGEVRMPSGKVAQPTITDNKDGTVTVRYAPSEAGLHEM DIRYDNMHIPGSPLQFYVDYVNCGHVTAYGPGLTHGVVNKPATFTVNTKDAGEGGLSL AIEGPSKAEISCTDNQDGTCSVSYLPVLPGDYSILVKYNEQHVPGSPFTARVTGDDSM RMSHLKVGSAADIPINISETDLSLLTATVVPPSGREEPCLLKRLRNGHVGISFVPKET GEHLVHVKKNGQHVASSPIPVVISQSEIGDASRVRVSGQGLHEGHTFEPAEFIIDTRD AGYGGLSLSIEGPSKVDINTEDLEDGTCRVTYCPTEPGNYIINIKFADQHVPGSPFSV KVTGEGRVKESITRRRRAPSVANVGSHCDLSLKIPEISIQDMTAQVTSPSGKTHEAEI VEGENHTYCIRFVPAEMGTHTVSVKYKGQHVPGSPFQFTVGPLGEGGAHKVRAGGPGL ERAEAGVPAEFSIWTREAGAGGLAIAVEGPSKAEISFEDRKDGSCGVAYVVQEPGDYE VSVKFNEEHIPDSPFVVPVASPSGDARRLTVSSLQESGLKVNQPASFAVSLNGAKGAI DAKVHSPSGALEECYVTEIDQDKYAVRFIPRENGVYLIDVKFNGTHIPGSPFKIRVGE PGHGGDPGLVSAYGAGLEGGVTGNPAEFVVNTSNAGAGALSVTIDGPSKVKMDCQECP EGYRVTYTPMAPGSYLISIKYGGPYHIGGSPFKAKVTGPRLVSNHSLHETSSVFVDSL TKATCAPQHGAPGPGPADASKVVAKGLGLSKAYVGQKSSFTVDCSKAGNNMLLVGVHG PRTPCEEILVKHVGSRLYSVSYLLKDKGEYTLVVKWGDEHIPGSPYRVVVP" intron complement(19489..19813) /gene="FLN" /evidence=experimental repeat_unit 19490..19709 /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental exon complement(19814..20017) /gene="FLN" /number=47 intron complement(20018..20100) /gene="FLN" /evidence=experimental exon complement(20101..20319) /gene="FLN" /number=46 intron complement(20320..20482) /gene="FLN" /evidence=experimental repeat_unit complement(20336..20624) /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental exon complement(20483..20659) /gene="FLN" /number=45 intron complement(20660..21360) /gene="FLN" /evidence=experimental repeat_unit complement(20663..20940) /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(21031..21280) /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(21361..21493) /gene="FLN" /number=44 intron complement(21494..22032) /gene="FLN" /evidence=experimental exon complement(22033..22148) /gene="FLN" /number=43 intron complement(22149..22335) /gene="FLN" /evidence=experimental repeat_unit 22176..22286 /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(22336..22473) /gene="FLN" /number=42 intron complement(22474..22632) /gene="FLN" /evidence=experimental exon complement(22633..22899) /gene="FLN" /number=41 intron complement(22900..23004) /gene="FLN" /evidence=experimental exon complement(23005..23127) /gene="FLN" /number=40 repeat_unit 23010..23294 /gene="FLN" /rpt_family="Alu-Sx" /evidence=experimental intron complement(23128..23223) /gene="FLN" /evidence=experimental exon complement(23224..23376) /gene="FLN" /number=39 intron complement(23377..23452) /gene="FLN" /evidence=experimental exon complement(23453..23656) /gene="FLN" /number=38 repeat_unit 23583..23869 /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental intron complement(23657..23747) /gene="FLN" /evidence=experimental exon complement(23748..23909) /gene="FLN" /number=37 intron complement(23910..24005) /gene="FLN" /evidence=experimental exon complement(24006..24179) /gene="FLN" /number=36 repeat_unit complement(24137..24256) /gene="FLN" /evidence=experimental intron complement(24180..24366) /gene="FLN" /evidence=experimental repeat_unit complement(24267..24548) /gene="FLN" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental exon complement(24367..24495) /gene="FLN" /number=35 intron complement(24496..24602) /gene="FLN" /evidence=experimental repeat_unit complement(24580..24671) /gene="FLN" /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental exon complement(24603..24743) /gene="FLN" /number=34 intron complement(24744..24832) /gene="FLN" /evidence=experimental repeat_unit complement(24803..25082) /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental exon complement(24833..24935) /gene="FLN" /number=33 intron complement(24936..25067) /gene="FLN" /evidence=experimental exon complement(25068..25163) /gene="FLN" /number=32 intron complement(25164..25276) /gene="FLN" /evidence=experimental exon complement(25277..25524) /gene="FLN" /number=31 repeat_unit 25346..25635 /gene="FLN" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental intron complement(25525..27700) /gene="FLN" /evidence=experimental repeat_unit 25816..26038 /gene="FLN" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(26751..26926) /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(26984..27166) /gene="FLN" /evidence=experimental exon complement(27701..27724) /gene="FLN" /number=30 intron complement(27725..27883) /gene="FLN" /evidence=experimental exon complement(27884..28073) /gene="FLN" /number=29 repeat_unit complement(28007..28054) /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental intron complement(28074..28648) /gene="FLN" /evidence=experimental repeat_unit complement(28083..28372) /gene="FLN" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(28382..28670) /gene="FLN" /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental exon complement(28649..28805) /gene="FLN" /number=28 repeat_unit 28754..29042 /gene="FLN" /rpt_family="Alu-Sb2" /evidence=experimental intron complement(28806..28894) /gene="FLN" /evidence=experimental exon complement(28895..29018) /gene="FLN" /number=27 intron complement(29019..29433) /gene="FLN" /evidence=experimental repeat_unit complement(29221..29314) /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(29434..29604) /gene="FLN" /number=26 intron complement(29605..29695) /gene="FLN" /evidence=experimental repeat_unit 29680..29960 /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(29696..29856) /gene="FLN" /number=25 intron complement(29857..29933) /gene="FLN" /evidence=experimental exon complement(29934..30096) /gene="FLN" /number=24 repeat_unit 30063..30341 /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental intron complement(30097..30181) /gene="FLN" /evidence=experimental exon complement(30182..30355) /gene="FLN" /number=23 intron complement(30356..30439) /gene="FLN" /evidence=experimental exon complement(30440..31037) /gene="FLN" /number=22 intron complement(31038..31757) /gene="FLN" /evidence=experimental repeat_unit 31215..31504 /gene="FLN" /rpt_family="Alu-Sx" /evidence=experimental exon complement(31758..32020) /gene="FLN" /number=21 intron complement(32021..32119) /gene="FLN" /evidence=experimental exon complement(32120..32237) /gene="FLN" /number=20 intron complement(32238..32428) /gene="FLN" /evidence=experimental exon complement(32429..32598) /gene="FLN" /number=19 intron complement(32599..32691) /gene="FLN" /evidence=experimental exon complement(32692..32782) /gene="FLN" /number=18 intron complement(32783..32867) /gene="FLN" /evidence=experimental exon complement(32868..33028) /gene="FLN" /number=17 intron complement(33029..33110) /gene="FLN" /evidence=experimental exon complement(33111..33234) /gene="FLN" /number=16 intron complement(33235..34471) /gene="FLN" /evidence=experimental repeat_unit 33585..33875 /gene="FLN" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 33887..34169 /gene="FLN" /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit 34122..34193 /gene="FLN" /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit 34206..34346 /gene="FLN" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 34347..34636 /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(34472..34615) /gene="FLN" /number=15 intron complement(34616..34708) /gene="FLN" /evidence=experimental repeat_unit 34637..34794 /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental exon complement(34709..34822) /gene="FLN" /number=14 intron complement(34823..34975) /gene="FLN" /evidence=experimental exon complement(34976..35169) /gene="FLN" /number=13 intron complement(35170..35270) /gene="FLN" /evidence=experimental exon complement(35271..35407) /gene="FLN" /number=12 intron complement(35408..35585) /gene="FLN" /evidence=experimental repeat_unit complement(35414..35700) /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental exon complement(35586..35709) /gene="FLN" /number=11 intron complement(35710..35797) /gene="FLN" /evidence=experimental exon complement(35798..35935) /gene="FLN" /number=10 intron complement(35936..36472) /gene="FLN" /evidence=experimental exon complement(36473..36673) /gene="FLN" /number=9 repeat_unit complement(36605..36758) /gene="FLN" /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental intron complement(36674..36756) /gene="FLN" /evidence=experimental exon complement(36757..36919) /gene="FLN" /number=8 repeat_unit complement(36805..36906) /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental intron complement(36920..37010) /gene="FLN" /evidence=experimental exon complement(37011..37088) /gene="FLN" /number=7 intron complement(37089..37180) /gene="FLN" /evidence=experimental exon complement(37181..37299) /gene="FLN" /number=6 intron complement(37300..37845) /gene="FLN" /evidence=experimental exon complement(37846..37993) /gene="FLN" /number=5 intron complement(37994..38089) /gene="FLN" /evidence=experimental exon complement(38090..38187) /gene="FLN" /number=4 repeat_unit complement(38103..38380) /gene="FLN" /rpt_family="Alu-Sc" /evidence=experimental intron complement(38188..38290) /gene="FLN" /evidence=experimental exon complement(38291..38539) /gene="FLN" /number=3 intron complement(38540..41318) /gene="FLN" /evidence=experimental repeat_unit complement(38751..38936) /gene="FLN" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 39377..39503 /gene="FLN" /evidence=experimental repeat_unit complement(39853..39920) /gene="FLN" /evidence=experimental repeat_region 40983..42150 /rpt_family="CpG Island" /evidence=experimental exon complement(41319..41807) /gene="FLN" /number=2 /evidence=experimental intron complement(41808..44951) /gene="FLN" /evidence=experimental repeat_unit 44072..44362 /gene="FLN" /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 44363..44525 /gene="FLN" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_region 44799..45386 /rpt_family="CpG Island" /evidence=experimental repeat_unit 44922..45071 /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(44952..45006) /gene="FLN" /number=1 /evidence=experimental repeat_unit 45088..45374 /rpt_family="Alu-J" /evidence=experimental repeat_unit 45397..45690 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 45745..46002 /rpt_family="Alu-J" /evidence=experimental repeat_unit 46041..46324 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 46370..46615 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(46815..47098) /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental repeat_unit complement(47115..47400) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 48025..48328 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(48329..48527) /rpt_family="Alu-Sq" /evidence=experimental repeat_region 48446..50350 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(48550..48829) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 48908..49076 /rpt_family="Alu-J or Alu-S" /evidence=experimental exon 49883..50022 /gene="EMD" /note="G00-119-108" /number=1 /evidence=experimental gene 49883..51977 /gene="EMD" CDS join(49941..50022,50146..50250,50398..50475,50690..50823, 51209..51258,51338..51653) /gene="EMD" /codon_start=1 /db_xref="GDB:G00-119-108" /evidence=experimental /product="emerin" /db_xref="PID:g1203970" /translation="MDNYADLSDTELTTLLRRYNIPHGPVVGSTRRLYEKKIFEYETQ RRRLSPPSSSAASSYSFSDLNSTRGDADMYDLPKKEDALLYQSKGYNDDYYEESYFTT RTYGEPESAGPSRAVRQSVTSFPDADAFHHQVHDDDLLSSSEEECKDRERPMYGRDSA YQSITHYRPVSASRSSLDLSYYPTSSSTSFMSSSSSSSSWLTRRAIRPENRAPGAGLG QDRQVPLWGQLLLFLVFVIVLFFIYHFMQAEEGNPF" intron 50023..50145 /gene="EMD" /note="G00-119-108" /evidence=experimental repeat_unit 50106..50240 /gene="EMD" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon 50146..50250 /gene="EMD" /note="G00-119-108" /number=2 intron 50251..50397 /gene="EMD" /note="G00-119-108" /evidence=experimental repeat_unit 50294..50501 /gene="EMD" /rpt_family="Alu-J" /evidence=experimental exon 50398..50475 /gene="EMD" /note="G00-119-108" /number=3 intron 50476..50689 /gene="EMD" /note="G00-119-108" /evidence=experimental repeat_unit 50516..50647 /gene="EMD" /rpt_family="Alu-Sb0" /evidence=experimental exon 50690..50823 /gene="EMD" /note="G00-119-108" /number=4 repeat_unit complement(50709..50800) /gene="EMD" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental intron 50824..51208 /gene="EMD" /note="G00-119-108" /evidence=experimental repeat_unit 51129..51419 /gene="EMD" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental exon 51209..51258 /gene="EMD" /note="G00-119-108" /number=5 intron 51259..51337 /gene="EMD" /note="G00-119-108" /evidence=experimental exon 51338..51977 /gene="EMD" /note="G00-119-108" /number=6 /evidence=experimental polyA_signal 51956..51961 /gene="EMD" /note="G00-119-108" /evidence=experimental polyA_site 51977 /gene="EMD" /note="G00-119-108" /evidence=experimental repeat_unit 52250..52528 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(52533..52821) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(52854..53133) /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 53523..53808 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 53874..54139 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(54647..54748) /rpt_family="Alu-Sb0 or lu-Sb1" /evidence=experimental repeat_unit 54775..55064 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(55324..66665) /rpt_family="11 kb repeat2" /evidence=experimental repeat_unit complement(55348..55612) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(56375..56667) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 56894..57181 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 57589..57814 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(57802..57862) /evidence=experimental repeat_unit 57832..57890 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_region 59461..61404 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(61475..61739) /rpt_family="Alu-J" /evidence=experimental repeat_unit 61866..62156 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 62282..62565 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(62653..62749) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(62765..62861) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(62888..63024) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(63764..64037) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(64428..64716) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 65320..65595 /rpt_family="Alu-J" /evidence=experimental repeat_unit 65612..65899 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(66389..66666) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(66702..66974) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(66984..67090) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(67166..67439) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(67463..67578) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(67738..68029) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_region 67900..69462 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(68360..68648) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(68853..69099) /rpt_family="Alu-J or Alu-S" /evidence=experimental exon 68941..68977 /gene="QM" /number=1 /evidence=experimental gene 68941..71372 /gene="QM" CDS join(68955..68977,69774..69832,69923..70030,70239..70377, 70899..71061,71137..71289) /gene="QM" /codon_start=1 /evidence=experimental /db_xref="PID:g1203971" /translation="MGRRPARCYRYCKNKPYPKSRFCRGVPDAKIRIFDLGRKKAKVD EFPLCGHMVSDEYEQLSSEALEAARICANKYMVKSCGKDGFHIRVRLHPFHVIRINKM LSCAGADRLQTGMRGAFGKPQGTVARVHIGQVIMSIRTKLQNKEHVIEALRRAKFKFP GRQKIHISKKWGFTKFNADEFEDMVAEKRLIPDGCGVKYIPSRGPLDKWRALHS" intron 68978..69773 /gene="QM" /evidence=experimental repeat_unit complement(69154..69276) /gene="QM" /rpt_family="Alu-J" /evidence=experimental exon 69774..69832 /gene="QM" /number=2 intron 69833..69922 /gene="QM" /evidence=experimental exon 69923..70030 /gene="QM" /number=3 intron 70031..70238 /gene="QM" /evidence=experimental exon 70239..70377 /gene="QM" /number=4 intron 70378..70898 /gene="QM" /evidence=experimental exon 70899..71061 /gene="QM" /number=5 intron 71062..71136 /gene="QM" /evidence=experimental exon 71137..71372 /gene="QM" /number=6 /evidence=experimental polyA_signal 71327..71332 /gene="QM" /evidence=experimental polyA_site 71372 /gene="QM" /evidence=experimental gene 72196..82482 /gene="DNL1L" exon complement(72196..73277) /gene="DNL1L" /number=9 /evidence=experimental polyA_site 72197 /gene="DNL1L" /evidence=experimental polyA_signal complement(72210..72215) /gene="DNL1L" /evidence=experimental repeat_unit complement(72565..72855) /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental CDS complement(join(73143..73277,73378..73626,73705..73817, 73958..74058,75263..75349,75430..75518,75869..76003)) /gene="DNL1L" /note="DNase I-like protein" /codon_start=1 /evidence=experimental /db_xref="PID:g1203972" /translation="MHYPTALLFLILANGAQAFRICAFNAQRLTLAKVAREQVMDTLV RILARCDIMVLQEVVDSSGSAIPLLLRELNRFDGSGPYSTLSSPQLGRSTYMETYVYF YRSHKTQVLSSYVYNDEDDVFAREPFVAQFSLPSNVLPSLVLVPLHTTPKAVEKELNA LYDVFLEVSQHWQSKDVILLGDFNADCASLTKKRLDKLELRTEPGFHWVIADGEDTTV RASTHCTYARVVLHGERCRSLLHTAAAFDFPTSFQLTEEEALNISDHYPVEVELKLSQ AHSVQPLSLTVLLLLSLLSPQLCPAA" intron complement(73278..73377) /gene="DNL1L" /evidence=experimental exon complement(73378..73626) /gene="DNL1L" /number=8 intron complement(73627..73704) /gene="DNL1L" /evidence=experimental repeat_unit 73642..73920 /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(73705..73817) /gene="DNL1L" /number=7 intron complement(73818..73957) /gene="DNL1L" /evidence=experimental repeat_unit 73945..74034 /gene="DNL1L" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental exon complement(73958..74058) /gene="DNL1L" /number=6 intron complement(74059..75262) /gene="DNL1L" /evidence=experimental repeat_unit 74063..74230 /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 74262..74540 /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 74581..74868 /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(75263..75349) /gene="DNL1L" /number=5 intron complement(75350..75429) /gene="DNL1L" /evidence=experimental exon complement(75430..75518) /gene="DNL1L" /number=4 intron complement(75519..75868) /gene="DNL1L" /evidence=experimental exon complement(75869..76090) /gene="DNL1L" /number=3 /evidence=experimental intron complement(76091..79544) /gene="DNL1L" /evidence=experimental repeat_unit 76154..76214 /gene="DNL1L" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 76255..76315 /gene="DNL1L" /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit complement(76316..76411) /gene="DNL1L" /rpt_family="L1MB3" /evidence=experimental repeat_unit complement(76417..76701) /gene="DNL1L" /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit complement(76812..77092) /gene="DNL1L" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(77110..77229) /gene="DNL1L" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(77252..77326) /gene="DNL1L" /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(77332..77469) /gene="DNL1L" /rpt_family="L1MA10" /evidence=experimental repeat_unit complement(77497..77571) /gene="DNL1L" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(77580..77653) /gene="DNL1L" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(77681..77970) /gene="DNL1L" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(78039..78191) /gene="DNL1L" /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(78223..78376) /gene="DNL1L" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(78377..78664) /gene="DNL1L" /rpt_family="Alu-Sx" /evidence=experimental exon complement(79545..79629) /gene="DNL1L" /number=2 /evidence=experimental intron complement(79630..82321) /gene="DNL1L" /evidence=experimental repeat_unit 80393..80685 /gene="DNL1L" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 80739..80990 /gene="DNL1L" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 81022..81304 /gene="DNL1L" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 81594..81656 /gene="DNL1L" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 81661..81777 /gene="DNL1L" /rpt_family="Alu-J" /evidence=experimental repeat_region 81836..82633 /rpt_family="CpG Island" /evidence=experimental exon complement(82322..82482) /gene="DNL1L" /number=1 /evidence=experimental gene complement(83920..90160) /gene="XAP-2" exon complement(83920..83999) /gene="XAP-2" /number=1 /evidence=experimental intron complement(84000..89972) /gene="XAP-2" /note="does not fit consensus" /cons_splice=(5'site:no,3'site:no) /evidence=experimental repeat_unit 84641..84758 /gene="XAP-2" /rpt_family="Alu-J" /evidence=experimental repeat_unit 84842..85129 /gene="XAP-2" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 85141..85428 /gene="XAP-2" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(85558..85805) /gene="XAP-2" /rpt_family="Alu-J" /evidence=experimental repeat_unit 86042..86318 /gene="XAP-2" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 86408..86547 /gene="XAP-2" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 86548..86765 /gene="XAP-2" /rpt_family="Alu-Sb2" /evidence=experimental repeat_unit 86766..86871 /gene="XAP-2" /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit 87284..87572 /gene="XAP-2" /rpt_family="Alu-Sb2" /evidence=experimental exon complement(89973..90160) /gene="XAP-2" /number=2 /evidence=experimental repeat_unit complement(91317..91468) /rpt_family="Alu-Sb2" /evidence=experimental repeat_unit complement(91475..91590) /rpt_family="L1MB3" /evidence=experimental repeat_unit complement(91648..91897) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 91906..92097 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(92113..92402) /rpt_family="Alu-J" /evidence=experimental repeat_unit 92403..92502 /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental repeat_unit 92549..92672 /evidence=experimental repeat_unit 92676..92960 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 92974..93263 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 93275..93561 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 93579..93737 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(93649..93926) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 93779..94065 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 94148..94418 /rpt_family="L1MB3" /evidence=experimental repeat_unit 94419..94713 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(94773..94878) /evidence=experimental repeat_unit complement(94879..95156) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 95255..95539 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 95579..95786 /rpt_family="L1MB7" /evidence=experimental repeat_unit 95787..96061 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(96374..96655) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(96463..96719) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(96720..97008) /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit complement(97022..97310) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 97311..97365 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(98088..98375) /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 98635..98751 /rpt_family="Alu-J" /evidence=experimental repeat_region 98783..99468 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(100546..100848) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 100658..100943 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 101179..101237 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 101243..101374 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_region 106826..108158 /rpt_family="CpG Island" /evidence=experimental gene 107607..113280 /gene="GDI" exon 107607..107728 /gene="GDI" /number=1 /evidence=experimental CDS join(107684..107728,108952..109059,109194..109293, 109435..109569,110370..110568,110804..110935, 111525..111624,112052..112223,112540..112684, 112803..112857,112949..113101) /gene="GDI" /codon_start=1 /function="GDP-dissociation inhibitor" /evidence=experimental /db_xref="PID:g1203973" /translation="MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESS SITPLEELYKRFQLLEGPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDF KVVEGSFVYKGGKIYKVPSTETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEG VDPQTTSMRDVYRKFDLGQDVIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLAR YGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVAR CKQLICDPSYIPDRVRKAGQVIRIICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYV CMISYAHNVAAQGKYIAIASTTVETTDPEKEVEPALELLEPIDQKFVAISDLYEPIDD GCESQVFCSCSYDATTHFETTCNDIKDIYKRMAGTAFDFENMKRKQNDVFGEAEQ" intron 107729..108951 /gene="GDI" /evidence=experimental exon 108952..109059 /gene="GDI" /number=2 intron 109060..109193 /gene="GDI" /evidence=experimental exon 109194..109293 /gene="GDI" /number=3 intron 109294..109434 /gene="GDI" /evidence=experimental repeat_unit complement(109338..109626) /gene="GDI" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental exon 109435..109569 /gene="GDI" /number=4 intron 109570..110369 /gene="GDI" /evidence=experimental repeat_unit 109825..110113 /gene="GDI" /rpt_family="Alu-Sx" /evidence=experimental exon 110370..110568 /gene="GDI" /number=5 intron 110569..110803 /gene="GDI" /evidence=experimental exon 110804..110935 /gene="GDI" /number=6 intron 110936..111524 /gene="GDI" /evidence=experimental exon 111525..111624 /gene="GDI" /number=7 intron 111625..112051 /gene="GDI" /evidence=experimental exon 112052..112223 /gene="GDI" /number=8 intron 112224..112539 /gene="GDI" /evidence=experimental exon 112540..112684 /gene="GDI" /number=9 intron 112685..112802 /gene="GDI" /evidence=experimental exon 112803..112857 /gene="GDI" /number=10 intron 112858..112948 /gene="GDI" /evidence=experimental exon 112949..113280 /gene="GDI" /number=11 /evidence=experimental repeat_region 114316..114957 /rpt_family="CpG Island" /evidence=experimental gene 116240..121077 /gene="XAP-5" CDS join(<116240..116340,116844..116989,118913..118989, 119119..119185,119322..119383,119650..119726, 120110..120164,120312..120360,120458..120528, 120638..120748,120853..120861) /gene="XAP-5" /codon_start=1 /evidence=experimental /db_xref="PID:g1203974" /translation="GLVTLNDMKAKQEALVKEREKQLAKKEQSKELQMKLEKLREKER KKEAKRKISSLSFTLEEEEEGGEEEEEAAMYEEEMEREEITTKKRKLGKNPDVDTSFL PDRDREEEENRLREELRQEWEAKQEKIKSEEIEITFSYWDGSGHRRTVKMRKGNTMQQ FLQKALEILRKDFSELRSAGVEQLMYIKEDLIIPHHHSFYDFIVTKARGKSGPLFNFD VHDDVRLLSDATVEKDESHAGKVVLRSWYEKNKHIFPASRWEPYDPEKKWDKYTIR" exon 116240..116340 /gene="XAP-5" /number=1 intron 116341..116843 /gene="XAP-5" /evidence=experimental exon 116844..116989 /gene="XAP-5" /number=2 intron 116990..118912 /gene="XAP-5" /evidence=experimental repeat_unit complement(118077..118208) /gene="XAP-5" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(118214..118272) /gene="XAP-5" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(118508..118793) /gene="XAP-5" /rpt_family="Alu-J" /evidence=experimental repeat_unit 118603..118905 /gene="XAP-5" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon 118913..118989 /gene="XAP-5" /number=3 intron 118990..119118 /gene="XAP-5" /evidence=experimental exon 119119..119185 /gene="XAP-5" /number=4 intron 119186..119321 /gene="XAP-5" /evidence=experimental exon 119322..119383 /gene="XAP-5" /number=5 intron 119384..119649 /gene="XAP-5" /evidence=experimental exon 119650..119726 /gene="XAP-5" /number=6 intron 119727..120109 /gene="XAP-5" /evidence=experimental exon 120110..120164 /gene="XAP-5" /number=7 intron 120165..120311 /gene="XAP-5" /evidence=experimental exon 120312..120360 /gene="XAP-5" /number=8 intron 120361..120457 /gene="XAP-5" /evidence=experimental exon 120458..120528 /gene="XAP-5" /number=9 intron 120529..120637 /gene="XAP-5" /evidence=experimental exon 120638..120748 /gene="XAP-5" /number=10 repeat_unit complement(120700..120816) /gene="XAP-5" /rpt_family="Alu-Sc" /evidence=experimental intron 120749..120852 /gene="XAP-5" /evidence=experimental exon 120853..121077 /gene="XAP-5" /number=11 /evidence=experimental repeat_unit 121076..121363 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(122086..122246) /rpt_family="Alu-J" /evidence=experimental repeat_unit 122141..122429 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 122443..122731 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 122698..122795 /rpt_family="Alu-J" /evidence=experimental repeat_unit 122796..123077 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(123390..123664) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(123665..123872) /rpt_family="L1MB7" /evidence=experimental repeat_unit complement(123912..124196) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 124295..124572 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(124738..125032) /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(125386..125524) /rpt_family="Alu-J" /evidence=experimental repeat_unit 125525..125802 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(125803..125872) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(125890..126176) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(126188..126477) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(126491..126775) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(126779..126902) /evidence=experimental repeat_unit complement(126949..127048) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 127049..127338 /rpt_family="Alu-Sb1" /evidence=experimental repeat_unit complement(127354..127545) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 127554..127803 /rpt_family="Alu-J" /evidence=experimental repeat_unit 127861..127976 /rpt_family="L1ME3a" /evidence=experimental repeat_unit 127983..128134 /rpt_family="Alu-J" /evidence=experimental repeat_region 128187..129236 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(131879..132167) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(132580..132685) /rpt_family="Alu-Sp or Alu-Sq" /evidence=experimental repeat_unit complement(132686..132903) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(132904..133043) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(133133..133409) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 133646..133893 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(134023..134310) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(134322..134609) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(134694..134810) /rpt_family="Alu-Sb0" /evidence=experimental repeat_unit complement(137674..137788) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(137795..137855) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(138147..138427) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(138461..138710) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(138766..139056) /rpt_family="Alu-J" /evidence=experimental repeat_unit 140789..141073 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 141077..141227 /rpt_family="Alu-Sb2" /evidence=experimental repeat_unit 141262..141411 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 141483..141769 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 141800..141870 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 141882..141953 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 141984..142118 /rpt_family="L1MA10" /evidence=experimental repeat_unit 142127..142198 /rpt_family="Alu-J" /evidence=experimental repeat_unit 142224..142340 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 142361..142638 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 142752..143033 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(143138..143195) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(143239..143296) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(144586..144870) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(144914..145189) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(145224..145388) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(145420..145506) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(145534..145809) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 146598..146885 /rpt_family="Alu-Sp" /evidence=experimental repeat_region 148990..149796 /rpt_family="CpG Island" /evidence=experimental repeat_unit 150177..150296 /rpt_family="Alu-J" /evidence=experimental repeat_unit 150354..150597 /rpt_family="Alu-J" /evidence=experimental repeat_unit 150805..151086 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 151421..151708 /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 151871..151983 /rpt_family="Alu-J" /evidence=experimental repeat_unit 152010..152280 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 152359..152462 /rpt_family="Alu-J" /evidence=experimental repeat_unit 152475..152744 /rpt_family="Alu-J" /evidence=experimental repeat_unit 152783..153057 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(153550..153834) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(153854..154126) /rpt_family="Alu-J or Alu-S" /evidence=experimental polyA_site 154142 /gene="GdX" /evidence=experimental gene 154142..157038 /gene="GdX" exon complement(154142..156069) /gene="GdX" /number=4 /evidence=experimental polyA_signal complement(154159..154164) /gene="GdX" /evidence=experimental polyA_site 154405 /gene="GdX" /evidence=experimental polyA_signal complement(154421..154426) /gene="GdX" /evidence=experimental repeat_unit 154733..155016 /gene="GdX" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 155412..155682 /gene="GdX" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental CDS complement(join(155959..156069,156191..156399, 156647..156752,156956..157003)) /gene="GdX" /note="ubiquitin-like protein" /codon_start=1 /evidence=experimental /db_xref="PID:g1203975" /translation="MQLTVKALQGRECSLQVPEDELVSTLKQLVSEKLNVPVRQQRLL FKGKALADGKRLSDYSIGPNSKLNLVVKPLEKVLLEEGEAQRLADSPPPQVWQLISKV LARHFSAADASRVLEQLQRDYERSLSRLTLDDIERLASRFLHPEVTETMEKGFSK" intron complement(156070..156190) /gene="GdX" /evidence=experimental exon complement(156191..156399) /gene="GdX" /number=3 repeat_region 156388..157495 /rpt_family="CpG Island" /evidence=experimental intron complement(156400..156646) /gene="GdX" /evidence=experimental repeat_unit 156425..156558 /gene="GdX" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 156588..156681 /gene="GdX" /rpt_family="Alu-J or Alu-S" /evidence=experimental exon complement(156647..156752) /gene="GdX" /number=2 repeat_unit 156700..156793 /gene="GdX" /rpt_family="Alu-J or Alu-S" /evidence=experimental intron complement(156753..156955) /gene="GdX" /evidence=experimental repeat_unit complement(156884..157164) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental exon complement(156956..157038) /gene="GdX" /number=1 /evidence=experimental repeat_unit complement(157293..157580) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 157710..157971 /rpt_family="Alu-Sx" /evidence=experimental exon complement(157728..159501) /gene="P3" /number=2 /evidence=experimental gene complement(157728..161077) /gene="P3" CDS complement(157926..159359) /gene="P3" /codon_start=1 /evidence=experimental /db_xref="PID:g1203976" /translation="MVLMQDKGSSQQWPGLGGEGGGTGPLSMLRAALLLISLPWGAQG TASTSLSTAGGHTVPPTGGRYLSIGDGSVMEFEFPEDSEGIIVISSQYPGQANRTAPG PMLRVTSLDTEVLTIKNVSAITWGGGGGFVVSIHSGLAGLAPLHIQLVDAHEAPPTLI EERRDFCIKVSPAEDTPATLSADLAHFSENPILYLLLPLIFVNKCSFGCKVELEVLKG LMQSPQPMLLGLLGQFLVMPLYAFLMAKVFMLPKALALGLIITCSSPGGGGSYLFSLL LGGDVTLAISMTFLSTVAATGFLPLSSAIYSRLLSIHETLHVPISKILGTLLFIAIPI AVGVLIKSKLPKFSQLLLQVVKPFSFVLLLGGLFLAYRMGVFILAGIRLPIVLVGITV PLVGLLVGYCLATCLKLPVAQRRTVSIEVGVQNSLLALAMLQLSLRRLQADYASQAPF IVALSGTSEMLALVIGHFIYSSLFPVP" intron complement(159502..160725) /gene="P3" /evidence=experimental repeat_region 160697..161402 /rpt_family="CpG Island" /evidence=experimental exon complement(160726..161077) /gene="P3" /number=1 /evidence=experimental repeat_unit complement(161559..161614) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 161587..161644 /evidence=experimental repeat_unit complement(161635..161857) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(162268..162552) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 162782..163071 /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 163837..164098 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(164385..164671) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 164701..164799 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(165310..165572) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(165641..165923) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 166316..166592 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit 166628..166913 /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(166921..167196) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(168030..168317) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 168649..168737 /rpt_family="Alu-Sb1" /evidence=experimental repeat_unit complement(168802..168930) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(168948..169152) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(169209..169340) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(170373..170538) /rpt_family="Alu-J" /evidence=experimental repeat_unit 170620..170896 /rpt_family="Alu-J" /evidence=experimental repeat_unit 170922..171117 /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(171121..171421) /rpt_family="Alu-J" /evidence=experimental repeat_unit 172049..172331 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 172351..172631 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(172834..173076) /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(173125..173405) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(173447..173701) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(173759..174049) /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(174075..174358) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(174378..174524) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(174924..175083) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(175087..175374) /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental polyA_site 176586 /gene="2_19" /evidence=experimental gene 176586..186448 /gene="2_19" exon complement(176586..177317) /gene="2_19" /number=9 /evidence=experimental polyA_signal complement(176599..176604) /gene="2_19" /evidence=experimental CDS complement(join(177222..177317,177615..177741, 177818..177902,178223..178273,178701..178759, 178886..179009,182259..182282,183222..183335, 186159..186171)) /gene="2_19" /codon_start=1 /evidence=experimental /db_xref="PID:g1203977" /translation="MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSP ESSVTAAPRARKYKCGLPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVG RGLNIALVNGVSGELIEARAFDMWAGDVNDLLKFIRPLHEGTLVFVASYDDPATKMNE ETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGCPEALE MEGCIPRRSTAS" intron complement(177318..177614) /gene="2_19" /evidence=experimental exon complement(177615..177741) /gene="2_19" /number=8 intron complement(177742..177817) /gene="2_19" /evidence=experimental exon complement(177818..177902) /gene="2_19" /number=7 intron complement(177903..178222) /gene="2_19" /evidence=experimental exon complement(178223..178273) /gene="2_19" /number=6 intron complement(178274..178700) /gene="2_19" /evidence=experimental exon complement(178701..178759) /gene="2_19" /number=5 intron complement(178760..178885) /gene="2_19" /evidence=experimental exon complement(178886..179009) /gene="2_19" /number=4 intron complement(179010..182258) /gene="2_19" /evidence=experimental repeat_unit 179529..179593 /gene="2_19" /evidence=experimental repeat_unit 179750..179827 /gene="2_19" /evidence=experimental repeat_unit complement(179946..180069) /gene="2_19" /evidence=experimental repeat_unit complement(180177..180241) /gene="2_19" /evidence=experimental repeat_unit 180402..180488 /gene="2_19" /evidence=experimental repeat_unit 180513..180695 /gene="2_19" /rpt_family="Alu-J" /evidence=experimental repeat_unit 181069..181343 /gene="2_19" /rpt_family="Alu-Sx" /evidence=experimental exon complement(182259..182282) /gene="2_19" /number=3 intron complement(182283..183221) /gene="2_19" /evidence=experimental repeat_unit 182543..182641 /gene="2_19" /rpt_family="Alu-J" /evidence=experimental repeat_unit 182691..182841 /gene="2_19" /rpt_family="Alu-Sc" /evidence=experimental exon complement(183222..183335) /gene="2_19" /number=2 intron complement(183336..186158) /gene="2_19" /evidence=experimental repeat_unit 183749..184032 /gene="2_19" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(184655..184809) /gene="2_19" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(184813..185099) /gene="2_19" /rpt_family="Alu-Sb1" /evidence=experimental repeat_unit complement(185103..185240) /gene="2_19" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(185256..185324) /gene="2_19" /rpt_family="Alu-Sb2" /evidence=experimental repeat_unit complement(185280..185559) /gene="2_19" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(185574..185861) /gene="2_19" /rpt_family="Alu-Sq" /evidence=experimental exon complement(186159..186448) /gene="2_19" /number=1 /evidence=experimental repeat_region 186412..186922 /rpt_family="CpG Island" /evidence=experimental repeat_unit complement(187945..188231) /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(189107..189382) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(189488..189765) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 190134..190224 /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(190406..190691) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 190778..191063 /rpt_family="Alu-J" /evidence=experimental repeat_unit 191076..191362 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 191394..191438 /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 191622..192089 /evidence=experimental repeat_unit 192282..192461 /evidence=experimental repeat_unit 192522..192694 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(193410..193629) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(193813..194099) /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 194366..194642 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 194669..194736 /evidence=experimental repeat_unit 194777..194865 /rpt_family="Alu-J" /evidence=experimental repeat_unit 194900..195178 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 195192..195308 /evidence=experimental repeat_unit complement(195579..195862) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(196154..196435) /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(197162..197269) /rpt_family="Alu-J" /evidence=experimental repeat_unit 198168..198414 /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 198508..198782 /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 198824..199109 /rpt_family="Alu-Sq" /evidence=experimental repeat_unit complement(199739..199955) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 200800..201085 /rpt_family="Alu-J or Alu-S" /evidence=experimental polyA_site 201336 /gene="G6PD" /note="G00-120-621" /evidence=experimental gene 201336..217196 /gene="G6PD" exon complement(201336..202035) /gene="G6PD" /note="G00-120-621" /number=13 /evidence=experimental polyA_signal complement(201349..201354) /gene="G6PD" /note="G00-120-621" /evidence=experimental CDS complement(join(201945..202035,202133..202225, 202331..202407,202512..202747,202887..203073, 203521..203614,203980..204105,204283..204441, 205112..205329,205879..205987,206083..206120, 215978..216097)) /gene="G6PD" /codon_start=1 /db_xref="GDB:G00-120-621" /evidence=experimental /db_xref="PID:g1203978" /translation="MAEQVALSRTQVCGILREELFQGDAFHQSDTHIFIIMGASGDLA KKKIYPTIWWLFRDGLLPENTFIMGYARSRLTVADIRKQSEPFFKATPEEKLKLEDFF ARNSYVAGQYDDAASYQRLNSHMDALHLGSQANRLFYLALPPTVYEAVTKNIHESCMS QIGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKEMVQNLMVLRFAN RIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQMLCLVAMEKP ASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDPTVPRGSTT ATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCKRNELVI RVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKLPDAYERLILDVFCGSQM HFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVGFQYEGTYK WVNPHKL" intron complement(202036..202132) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(202133..202225) /gene="G6PD" /note="G00-120-621" /number=12 intron complement(202226..202330) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(202331..202407) /gene="G6PD" /note="G00-120-621" /number=11 intron complement(202408..202511) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(202512..202747) /gene="G6PD" /note="G00-120-621" /number=10 intron complement(202748..202886) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(202887..203073) /gene="G6PD" /note="G00-120-621" /number=9 intron complement(203074..203520) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(203521..203614) /gene="G6PD" /note="G00-120-621" /number=8 intron complement(203615..203979) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(203980..204105) /gene="G6PD" /note="G00-120-621" /number=7 intron complement(204106..204282) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(204283..204441) /gene="G6PD" /note="G00-120-621" /number=6 intron complement(204442..205111) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(205112..205329) /gene="G6PD" /note="G00-120-621" /number=5 intron complement(205330..205878) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(205879..205987) /gene="G6PD" /note="G00-120-621" /number=4 intron complement(205988..206082) /gene="G6PD" /note="G00-120-621" /evidence=experimental exon complement(206083..206120) /gene="G6PD" /note="G00-120-621" /number=3 intron complement(206121..215977) /gene="G6PD" /note="G00-120-621" /evidence=experimental repeat_unit 206503..206777 /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 206966..207250 /gene="G6PD" /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 207285..207561 /gene="G6PD" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit complement(207947..208222) /gene="G6PD" /rpt_family="Alu-J" /evidence=experimental repeat_unit 208292..208579 /gene="G6PD" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 208588..208705 /gene="G6PD" /rpt_family="Alu-Sp" /evidence=experimental repeat_unit 208709..208990 /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(209081..209174) /gene="G6PD" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(209193..209286) /gene="G6PD" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(209316..209449) /gene="G6PD" /rpt_family="Alu-Sb0 or Alu-Sb1" /evidence=experimental repeat_unit complement(209675..209959) /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 210006..210127 /gene="G6PD" /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 210131..210414 /gene="G6PD" /rpt_family="Alu-Sc" /evidence=experimental repeat_unit 210419..210560 /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(210854..211139) /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit 211747..212019 /gene="G6PD" /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 212016..212087 /gene="G6PD" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit 212039..212324 /gene="G6PD" /rpt_family="Alu-J or Alu-S" /evidence=experimental repeat_unit complement(212815..213076) /gene="G6PD" /rpt_family="Alu-J" /evidence=experimental repeat_unit complement(213643..213813) /gene="G6PD" /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(213820..214099) /gene="G6PD" /rpt_family="Alu-Sb2" /evidence=experimental repeat_unit complement(214115..214398) /gene="G6PD" /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(214402..214533) /gene="G6PD" /rpt_family="Alu-Sq" /evidence=experimental repeat_unit 215496..215782 /gene="G6PD" /rpt_family="Alu-Sq" /evidence=experimental exon complement(215978..216105) /gene="G6PD" /note="G00-120-621" /number=2 /evidence=experimental intron complement(216106..216730) /gene="G6PD" /note="G00-120-621" /evidence=experimental repeat_region 216617..217876 /rpt_family="CpG Island" /evidence=experimental exon complement(216731..217196) /gene="G6PD" /note="G00-120-621" /number=1 /evidence=experimental repeat_unit complement(218517..218804) /rpt_family="Alu-Sx" /evidence=experimental repeat_unit complement(218945..219219) /rpt_family="Alu-Sc" /evidence=experimental repeat_unit complement(219225..219390) /rpt_family="Alu-Sx" /evidence=experimental BASE COUNT 46881 a 61395 c 62847 g 48324 t ORIGIN 1 aggttcattc gctggcagtg tcgggagtgc cccagagtgg gaagtccgag gaattgctgg 61 atatgtatgg aattagtgcc agacatatca tagtggccgt gaaatgcatg ttgctgaact 121 aaaatagctg ttagctttgg tcttttggcc tctttaccct gtgtttatgt ttgttccaaa 181 accatcattt aaatctctac tgtcacattt tgtttcttaa aagcaaagcc agctaacacc 241 ttcattcatc cctagttcgg aaattcaagc taactactta ccctttaaac tgtcactgca 301 tatgcaagta ccgctctaat ttttggatca ttaaagggag ttacacaact tttaagtgaa 361 aaaaataggt aacaaaacaa ccacctgata gtaagttttc tgataagact atagataagt 421 ggtagaggta atcaattctt ccgaagtgtt tccttcgtga ataactggta gaggtaatag 481 ttttttcaat gtatttcctt catgagtaaa gaaaatgtgg attgaagtat agattccagt 541 agcctagttt ccacagcacg ataacaccat gacgcctact gctgttccca ccttgggatt 601 ctgtgtgctg ccatcccacc tgcagctgcc ctggaattcc cttcgctgtt tgccttcatc 661 tccctccacg tttgagaggc tgtcaggcag cagcgaaagc ttgttaggat gtcctgtgct 721 gcttgtgatg agagcctcca cactgtactg ttcaagtcaa tgttaataaa gcatttcaaa 781 accagctgct ttattcagca cgtgccttct gtgtgaatca gtttctcagt ggtcagttcc 841 taaataagga aagagctttt atccaaacca gaaagagtga gccatggccc ccaccagtaa 901 acctgggaaa gaataatggg ctgcagagga agcactcgct cgaggacagc acctggcctg 961 aaaagctgtc cctgagctga cgccttcctg tcagctataa gcacataaag tatgtggcta 1021 gaggccaggc gttgtggctc acgcttgtaa tcccagcact ttgggaggcc aaggcaggtg 1081 gatcacctga ggtcaggagt tcaagaccag cctggacaac atggtgaaac cccctctcta 1141 ctaaaaaaga caaaaattag ccagacatgg tggtgcacac ttacaatccc agctactcgg 1201 gaggctgagg caggagaatt gtttgaaccc aggaagcgga ggttgcagtg agccgagatc 1261 ctgccattgt acgccagcct aggagacaga gcgagactct acctccaaaa aaaaatttaa 1321 gaagtatgta tctagaatca aagtgtcaac cagaaaaaga tgaacctgag cttcaaaact 1381 agatttaaaa ttctactaga tgtgtctgct gtgtcacatt cctgttttgg ctttggtttt 1441 gttttgagac agggtctcac tatgttgccc aggctggtct caaactcctg gcctcaagtg 1501 atcctcacgc ctggaccttc caaggtgctg gggttagaat tacaggcgtg agccactata 1561 cccagcccat tccttatttt tattgcccaa attaagtggt cagagatctt tggagtcatg 1621 accagtgtga ctgaggaatt gttctagaat gtgcgggata gagatggaag cactggcctg 1681 gggcttggag tcctcaggac ctcctcctac tcctgccctt gtaagtcact gagctggttt 1741 cctggcctgt aaaatgagca tgatgatggc tctgcttatt tcataagatt tcaaaattta 1801 ttttgtaaaa atagtttcta tagggtgatt gcagaaaatt tagaaaatgc aaagaggcaa 1861 aaatgtttta cctcacttat ctcactactt cgaggtaatc acagcttctc acactccaga 1921 caattttact catttaaggt atacaattca gtggttttca ttctttttat ggccacatac 1981 tattctattg tatggagata acacatttta ttcttccatt cagttgatga atgtggatta 2041 gttccacttt tgtctcttgt gaatagcgtt ctgacaattc acatattttg tgaagatgta 2101 ggtttcacaa aatatgtgaa agattttgtg tagatgtaga ttttcccttc ttatggacat 2161 gtatactagg agtggaattg aattgctgtg tcgtaggata attctgtgta ccctttcaag 2221 agtgtggtgg ggggtggtgt ttaaataccg ttttgtgagg agtagatgca gtgacactga 2281 aggtggctgt aaagcattac tttagaggaa ggcattctaa cagacaagcc acgcttgctt 2341 gtattcaggt gcaaaatgtc aggtttggtg agaccacagg acacataagc gcacggtggc 2401 aacaaatgga gcaacagatt ccatgattaa aatgtgcaat ttctaaacac aggaaagagc 2461 ccccaaagcc tcacttcagt cccctagaag gttctaggca gaaacagtag ggcagatcaa 2521 aggaatgaga gcataggagc catgatcacc cagcgagggg agtgagccct gttgccagct 2581 tggcctggct gtggcttctg agtaaacacg tgtgtttgtt aactgttaag aacctggtaa 2641 tgaatactcc ctggaagatg ccatggaccc tggcctctgg ccagtcccct cgcccttgag 2701 tcctcctaac atcttattgc tctgattact tcccatggag aagccagcca gcagggactc 2761 gggaaccatg tgtcccaaaa cctggcttga atccccaaat ctgcttcttg actgtggggc 2821 cttgggtagc tgtagaaggg gatgatggtg ctgatgacgg ggacagcatg tatcccacaa 2881 agctgtggtc aggatgttag tgccatcttt tcgaaacacc catgcaggcc caccctgcat 2941 acctggcagc ttcctactcc ccagcacccc cagcacgccc tcggctggcc agccccatcg 3001 gcccctcagc acgatccttc cccgccttgc tgggattatg ttggggctgg cagcaaccaa 3061 cccgcctgcc cgcttctggg ccaggcagcc ctgtgcaacc cgagctgctc ccctgtcccc 3121 gctgtcccac cgccctccct ggacgcctgc tacccaccat caaacaccgc ctcgtgccgg 3181 ccccgcgcac cctcccgccc cgtcggcccc acgcttcagg cgctacctgg agggcatggc 3241 tcctacccgt gggcactcgc cttcaccctt ggagcccggc cttccccggg cagccgggag 3301 aggttctacc acttcacgag gcctcctcgc cctgccaggc cgaggtcagg gttagagaag 3361 ctgggtcgtt agattcggga gagctggaat gatctgatgt gtcattccgt acttgggtga 3421 ggaagccgag cacgcagagt taccgctcag ggcaggacct gggctgcacc tggtatcagg 3481 gcccacgtga gagataccct ccctccctaa gtgtgtacct gacgctagtt ctgggtctaa 3541 cggcaacatt ctagttctag gtctagttct aggtctagtt gtagagggtc tcctggatga 3601 tacaagtgag tgtccactgg caaggctcct ccccaggtaa acttgtgaca gagttctcca 3661 gcaaagaaac accagtcccc cagacacggg aggtcgagcc gccgccacta ccaagggagg 3721 tgcagaggga ctgggggctt ctcgagtctc agcttcaacc aaggccaacc agatggcccc 3781 attccgggtt tcagggtccc attcttccca catcactgcc cacgattcac gtgtgaaatg 3841 tggcgaggct ggaaattcat cactttggga atctgtaact ggcacaataa aatgtgtttg 3901 attgtcagaa ctgtcagccc tgcaagtaca aggagtaagg gaactttgag gcctttcctg 3961 gaagcaccct ggttctgtta ctgtggcttg agctgagagt tagggagcct gagcccattg 4021 tgtcttcatg gtacatagtc agggccacta aggatcatcc cataccacag ccctcatagc 4081 catccgtgac caccatgaca agtcaggtgc agcaaggact tgagctccct ggtgcttgtt 4141 tcagttggcc catcatccca agcaaccgaa gtgacagcta gattaattgt gaagccactg 4201 caggccatgg atggctcatg ccccacttcc cactgggaaa atgctcaacc ctgcactcca 4261 gcccaaaggc aggaccagac caatcgccag attccatgtg cataggcctt ttcttccaga 4321 accttaccca gggccagttc tggattagtc aggaaccagt taggagacaa aaatacatac 4381 ctagtatctc aagaaaggga agtgaaagcg tatttacttg gggtttgagg gctgaaagtg 4441 caaaagggat actgaagtaa cagagttggt gaaggaagaa gctacaaccc ccagagcagg 4501 gaggcaaaaa taaagatgtt ggggtgacca ggagtctagt actgagcctg ggagcatgga 4561 aggcctaggg ctctggacct cgaggagggg ctgctgcagt gcgtcacaga aggaggaagc 4621 cacatcatct gctaagaacg gggcaggagg aagtgtcttt ggcatcgtct tccaggtcac 4681 tcattctgtg ttcaggggca tcaatttgct gttgagtcca ttggatgaga ttttcattgc 4741 aatgactagg cttatgtacg tgctgctttt caaatctccc tggatttcta actcttctct 4801 tttgtatctc tttctttcta cccttaggtt gttgtgtcag agagactatt agagatctga 4861 gaagtcagta gctttacatc acaataatgc aaacatctat gagaactgct ctgtgatctg 4921 ctgagaattc taagctggtc agtctgtgct tgcagagttc agccgatcag gcccgagagg 4981 agaacagcgg ttctcatttt accctcctct gtttttcgcg tgatgacccg aggcctggga 5041 gatactgagc cagacttgcc tccgactcct tccttccaca tgcatcacac tccacacacc 5101 acacaccaca caccacgggg aatctccaaa gggcaggagt gggagggagt gcagcaaagc 5161 cgagaaaggg ctcaggctct ggagcccaac ttcatgagtc acatttccct cctgtttttg 5221 tctgaaaatg ggggtaacac tggcaccgac atcctagggc tgctgtgagg agggggtgga 5281 tgctcccagc cattgctgac aatgatgtga ttggggatct ttgaggattc caccccggaa 5341 ccctgacagg tccagcctta ctctttgacc tgcccactga ctctactctc catgacggtg 5401 gccagtgctg tcatcgtgag gctgtgggcc ccacttcagc tttacctcca aaagtcctgt 5461 ggcagaatcc tggggtgacg tgtcccctct atggaaggta ggttgtctct gccaggatcc 5521 cagcagtatg accgcactaa ggaggcaagg tgatctatca gccagggctc acagagacct 5581 gagccttttc cccgtgcctg tgggcaggca ggccgtgctg gagggtgctc agggcacctg 5641 ctgggaggca ggcagattcc ttctcacctg ctcacgcaga cccaggctga catggacctg 5701 ctatccctga gctccctgtg ctgaggcagc gctcacctca tctttccttt atcccggaag 5761 taaccttgca aaaacactgg gttgagctga ggggttccgt tcggcacact cacctgaagg 5821 cacaaggcct cctgacccag aggccatcaa gaccaagtgc agcaagctgc ttctgcacgg 5881 ccccagcgtc aggtcctgct taggcctgtt tttttttggt tgtttctatt tttttctctg 5941 tatttagaga tggggtctca cctcgttgcc caggctggtc tccagtgatc ctcctgcctc 6001 agcctccaaa aattctggga ttacaggtgt gatccaccga gtcccgaccc ctctgtgttt 6061 tcatttgctc tttgtttcca cccagggacg gccctcactc acctgtggcc caggtatgct 6121 tgaaatgaca caactacaag ttgtatcatt gctcctttcc ttgccttctc ccaaactcca 6181 ctaaggatgc atggaggaca ggcgatgggg tctaacaggt gtgatgctca tcccagccta 6241 gttagcactg gctcatgtaa cctagaccac aggccatagc tggacagggg tggcggggcg 6301 acctgcccag cttgggtcaa tcaggaattt gtaacgggca ctcagaaatc tcacctacag 6361 ctgggtgagg tggctcacgc ctgtaatccc aagactttgg gaggccaagg tgggcagatc 6421 acgaggtcag gagttcgaga ccagcctggc caacatggtg aaaccacgtc tactaaaaat 6481 acagaaatta gccgggcatg gtggaacgtg cctgtaatct cagctaccca gggggctgag 6541 gcaggagaat ctggcttgaa cctgggaggt ggaggtttga gtgagccaag atcacaccac 6601 tgcactccag cctgggacca cagagcaaga ctcttctgtc tcagaaaaaa gaaaaacaac 6661 aaagaaattg cacctacagt ctcctcactc tctagtgggc attaaaatga actttctgta 6721 gccagtttgg gcagagggag gatacagcct aatcctgggt gccctagaag gtcgtgcagg 6781 ctgttctcgg cacaaggaca ccttcctggg gagccagagg gctgaaatcc ctcccgtgtc 6841 ctcgcctcca gccgagtacc tgccccgaga gcagagtgcc tcttgctgat ccgcacataa 6901 gcctgcaggg gtgggtgggg gccctggagc ccagacaggc accaaggaat cttgcagccc 6961 tacaaggtga ggacggggga actgtcggag ttccacctgt tggctcctct ccctcctgac 7021 cttcccacgg gaatccttcc aaccgttggc tcaaagaaat tgaagtgggt ttctccattg 7081 ctaccctcgc catctaggac tccactgtcc tttttttttt ttttttctga gatggagttt 7141 cgctcttatt gccgaggctg gagtgcaatg gcacaatctt ggcccaccgc aacctctgcc 7201 tcccgggttt gagcgattct cctgcctcag cctcccgagt agctgggatc acaggcatgt 7261 gccaccatgc ctggcttatt ttgtattttt agtagagaca gggtttctcc atgtgggtca 7321 ggctggtctc aaactcctga cttcaggtga tctgcccgcc tcagcctccc aatgtgctgg 7381 aattacaggt gtgagccacc acgcccggct tttttttttt tttaattgag acaggttctt 7441 gctctgttgc ccagactgga atgcagtggc atgatcaaca ctcactgcag cctcgaactc 7501 ctgggctcca gagatgctct tgcctcagct tcccaatcag ctgggactac aggcacacgc 7561 caccatgcct ggataacttt ttttttcttt tgtagagatg gggccttgtt aggttgccca 7621 gactggtctt aaactgctgg agccaagtga tcctcctgcc ctaaccctaa cccaaagtgc 7681 tgggattaca ggcaggagcc aacgtgcttg gccagactcc atggtctctt acctggacca 7741 ggacgatggc cccacccagg ccctcagaac tgtgggatga ggtgtttgct tctggctcac 7801 ccacgagaaa gaaactcccg gaaggaaggg accgtgtctg gagcgctcca actccaggcc 7861 ccagactgca ggtatccctt gccaacagag ctgaggccag caggggaaac cattcctgcc 7921 acctggacta agaatgagat cctcttgaaa aaaaacaatg ttaagaagtg ggctcttccc 7981 atcacgccac cagagtccca ggaagccagg gatgtgggtg tctaatggtc cgggacaaga 8041 tgatgggaaa cttcacatgg agaagaaaat gcccaagaat gactgaaact gtctgcaagg 8101 tcctgctaag ggagctggca ggtgcaccag gtatggcgag atgcaaagca aggagcccac 8161 tgcatcgttc cctggaagga caagtgcaga ggagaccaca gatgcctgcg catttagcag 8221 gagtatggca gtgccatcat gggggaaagg ctgtgtgaca aatggtatgg aggagactga 8281 ctgtccctta gaaaatggga aaggagccag gcgcggtggc tcacgcctgt catcccagca 8341 ctgtgggagg ccgaggcggg gggatcactt gaggtcagga gttcaggacc agcttggcca 8401 acacggtgaa accccgtctc cactaaaaat acaaaaagta gccaggcatg ctggcgtgcg 8461 cctttagtcc cagctactcg gtaggctgag gtgggagaat tgcttaaacc caggaggtga 8521 atgttgcagt gagccgagat cgcgccacta cactccagcc tgggtgacag agtgagactc 8581 tttcaaaaaa aaaaaaaaaa aaagaacaga acaaacataa gaaaatgaaa catcagtgtt 8641 cttgaaccca gcaatcctac cccttgataa cctcagcaag agcaattctc ccttctcgct 8701 aaggccacat tccgtgccat ggccttcgtg gcatggtttg tggcacacca ggagctgaag 8761 ccctggggtt gcatcactca gggtgggaat gatcaaatgt gttgcatcca agctctgaaa 8821 tcctctgcag gagcacagga gtcaactgta cgtctccagg gccacagaca aagagctgag 8881 aaaccctgtt gaatgacgga agcagtcagc agaataggaa tgcagcacag ggacagttac 8941 ggtgggcaaa accatacaca caaaaaatcc acaggtgcag ggcagtggct cacgcctgtc 9001 atccaagcac tttgggaggc cgagacgggc agatcgcttg aagccagaaa taggagacca 9061 acctgggcaa cagggtgaga cctcgtctct acaaaaaaat aaaattagct ggcgtggtgg 9121 cacatgcctg tagtcctagt tactccagaa gctgaggtgg gaggatcact tgagtctggg 9181 acatcaaggc tgccgtgagc tatgatcaca ccactgcact ccagcctggg tgacaaagcg 9241 aggccctgtc tcagaaaact aaaatacaat tgaacaaaca cattccaaat gacaccacat 9301 tttaaatgac ccctttggga gggaggggaa caggactagg aaacagggaa gaaggaaaaa 9361 gcgaaaagct acagttctcc catctaattt tcagctacct aaaaagctgt tagcccaaaa 9421 ggtgctggtg aggctgtggg taaacgctga gggtacaatg cgagggacgg acagtatcaa 9481 aggtagggtc tttaggggag gcaattccct agtgtctctt acaattttaa atgcaggcca 9541 cacaaggtgg gacacatctg taaaaccagc actttgagag gccaaggagg gaggacttga 9601 gcccaggggt tcaagaccag cctgggcaac acagtgagac cctgtctgta caaaaagaag 9661 ttttctaaaa aagccaggga tggtggcaca cacttgtggt cccagcagct caggaggctg 9721 aaacaggaag atcacttgag cccaggaggt tgaggcggtg gtgagctgtg atcgagccac 9781 tccactccag cctgagcgac agactgagat cctgtctcaa aaaaaaaaaa tttttttttt 9841 taatgtacat atatctcgac ccagcaattc cacttacagg taattattct agagagagat 9901 ggatgcacgt tcaaggctgt gttcatagta aatgcaaggt agaaataata atgctttaaa 9961 agtacatcaa atggaggccc ggtgcagtgg cacatgcctg taatcccagg tactcgggag 10021 gctgaggcag gagaatcact tgagcccagg aggtggaggt tgtggtgagc tgagatcacg 10081 ccattgcact ccagcctggg caacatgagt gaaactctct gtctcaaaaa aagaaaaaaa 10141 aaagtttaaa gtacatcagg ccgggcgcgg tggctcatgc ctgtaatccc agcactttgg 10201 gaggccgagg ctggtgcatc actgaggtca ggaatttgag accagcctga ccaacccagg 10261 tgagaggtga gaggatccct tcagcctggg aggcaaaggt tgcaatgagc cgagatggtg 10321 ccattgcact cagcctgggc gacaagagtg agactctatc tcaaaaacaa aaaagttgaa 10381 attataaaaa agctaaaaca aacaaaacaa aatcactaca aaacgtgtag cataacggtt 10441 tttttttgtt tttgtttttc agacggagtc tcactgtgtc acccaggctg gagtgcagtg 10501 gtgcgatctc ggcttgctgc aaccttcacc tcccgggttc aagcgattct tctgcctcag 10561 cctcccgagt agctgggact acaggcacac gccaccacgc ccagctaatt tttgtatttt 10621 tagtagaaac gaggtttcac catattagcc aggctggtct cgaactcctg acctcgtgat 10681 ccacgcgcct cagcctccca aagtgctggg attacaggcg tgagccaccg cgcccggccg 10741 cataatgggt cttaacttgg ctaattatgt gggtgattgc acagaaaaat agtataaata 10801 acttcatgct tttttataca tgggataact tgcttcaagt tgtttcaaaa acgttttttt 10861 tttttttttt tttgagacag aatcttgctc tgtcacccag gctggagtgc cgtggtgcga 10921 tctcggctca gtgcaacctc tgcctcctgg gttcaagcga ttctcctgcc tcagcctcct 10981 gagtagttgg gattataggc gagggcgcca ccacgcccgg ctaattttat atttttatta 11041 gagacagagt ttcactatgt tggtcaggct ggtctcaaac tcctgacctc aaacgatcca 11101 cccgcctcgg cctcccgaag tgctgggatc acaggcgtga gccaccgcgc ccggcctcaa 11161 acacctctca atgattcgct tgagtatgtg taaagtgatg cacaggccac cctgaagacc 11221 ttggtggttt ctattgaaac tacacatcaa gccgggcgcg gcggccggtg cgcggtgccc 11281 agaatcccag cactttgtga ggctggggtg gatcacctga gcccaggaat cccagaccaa 11341 cctgggcgaa aaagcgagac cccatctcta caaaaagtaa aaaattagcc ggacatggtg 11401 gtggcgtgcg cctgtagtcc cagctacacg gaaggctgag gctggaggtt gaagccctcg 11461 ctggggtgga gaggtcaagg ctgtagtgag ccctcatcgc gccactgcac tccagactgg 11521 gcgcagagtg agaccctgac tcaaaaagaa agaaaaggaa agaaagaagg aaagaaagga 11581 agaaagaaag agaattacaa ataaagctac acatccactt ctacacttct gcgtggactc 11641 gagcgttcaa gatgggcgtg gcaacccgat gacaggaaac cctacacggg cgtgctccca 11701 gcaggctcct ccgtgagggc caccagccac gggaaaccac gaccttgtcc ctcaccagca 11761 aaagctcagc ctgcggctcc gcgagcgtgg accgttccgt ggccatgagc gggaagaacc 11821 acgggccccc gccaggggct gggatggatc tcgcgggcgg gaggcggagg gtactcggcc 11881 caccccagcg gaggacaccg cgtgggtccg ctgagataac gctccgcggt gatggactcc 11941 atgactcccg gagcagagcc ctggctgcgc tgcaccgcgg gagggagggg ctgcgtctac 12001 tgcggggcca cgagagggag gggccggggc cggggccggg gccggggccg gggccggggc 12061 cggggccggg gccggggcgg gacaggaacc cgcgcaagcg cggaaatgcc cacggccgcg 12121 ccctcccgcg cgcaccgcgg cgacacccgg gccgccctcg gaaggcacgg acgtcgcgcc 12181 gcgctcccgg ggcgcttcac ttgcgcccac ggccgacgcc ccggcagcgg cgggcaggtg 12241 ccacgacctc tggacgtttg tcgccccttc tcgggaatgg aacgcttaga gaagcgcagg 12301 tccacaccca aaacacaagg cggcactgtt gcgtgtgcac cggcgactct gcggccagcg 12361 ccctgggtct cgccccggga acgcctagga gcctgagagc gccgcgccac acccggcagg 12421 gccgcagcgg cgcgcacgcg acggggtgtc gtccactggg cagcaggagc gtcagtcaag 12481 ggcgaggggc gggaattccg tgcaatctga gcatggtcat tggcgaggac aggcgaagag 12541 gaggacggat aggcggggct gggcggggct tcccgtgggc gcctggagct taccaggagc 12601 gaccgaagcg gaggcgggga gcgaggcagc tgtgggcccc ggcgggcgga gactagggct 12661 gtccctccac gccccgcccc cgcctcggca ccgccttccg cgcctgcaag tcctgggccg 12721 gccctgcggg gcatgcgccg agtccccttc ctttgcagac cggtggtcac tgtcttcccc 12781 cggggacgag gcggacaggc gtcgcctgag atcagggccc gggaagcccc gtgagccgct 12841 ctcgcctcgg cgccgtcgag gggccgcccc gcccttccca ctccactgtg ctgcgaaagt 12901 gcctcggggc gcgtctgggg ggtcgtgggg aagcagggcc gcgcccaccc ggcagcgtcc 12961 agggagaggt gggcagagcg gggcgggcgg ccgtcttggg ggaagtggcg tctggaggtg 13021 tctgtgcacc attccgcggt ggcgcgcggt ggggtaacaa agtcttcccc gtcccttgac 13081 agcggcggcc gccggggagg gaatcctgct ctgctgcggg cggtcgtggg ctggggcgct 13141 tcctggcctg actgtctcct gggggaggta ggaaaggcga gccccgcaga gaaggcggtg 13201 gcttccctcg acgtgggtgc caacaggagc catccctcca gccggtcttc ccgcgggagg 13261 gtgcctggga acacccgcag ggctgggttc cagcacccat gggaccgctg ggtccagccc 13321 agtgtttccg gggaagcccc gccttacccg ccaaccttcc cgcaccggga gggttcccag 13381 ctgctggtga tgctgctcag gcgactcccc gcagcgccac cacccacccc cccgggagcc 13441 ggccgctggg tttggcccgt ccgccctggt tcctctttgc tgggctctga ggcccggcgg 13501 gacttctccg cggcggctct ggcgacccag catctctttg ccctggggtc agacgggctc 13561 ccaaggaatt gcatcccggt ggagaagacg cttccttccc tctcccttcc tctcctccgc 13621 ccccttttgc tctcttgtcc ttgcactttg ccttgtccgt gtgtccggct ctgtgtgtgt 13681 gtgactgacg cgagccctta gtcacctctt gctactattt tctcccctta ttatcacttg 13741 tctcttgact gtgtggtttg cgggttgtgc ttggtcacag aggggatttt agttttcgtg 13801 tcatcagaat ctccaggctt tcctgatggc ttctgggggt tcttgtcggt ttgtttcctt 13861 ctggctcacg ccgtgtcttg ccccctggac gctgggcacc agttccgact actgggtaac 13921 agacgacttc acgtttcggt ccttccccac ctagcactcg tctcaccccc ccagcactcg 13981 tgtcaccccc ccaagggttc tgagggacag gctctgggag ggctctcggg gagccaggac 14041 tgtggctcag ggtccctctc aggctgcggt caggtgtcca ctggggccca gccatcctat 14101 ctgactgggt ggggagccct cgtgatgact ctccctctgg aggctgtggc aggaggcctc 14161 agttcctccc caccgggcct cctccccagg gctgctgggc ttcccccgga cctgcaatcc 14221 agtcgaacca gcaggagagg gcgccccaga cccggccaca gtctcttcat ggcctggtct 14281 cctaagtgac tcgcctcgcc tccgctgtgg tttttattgg tctcacaggt ccaccctggc 14341 gcgtgctgga gagggcgcca cgctcagggt ggaggctgtt cctgagtgct gcgagtcaga 14401 agcacccttg cacgtgtaga cctgcgtcgg ctgctggaga tcccgagcgg gccgcctcag 14461 gaacatgtcc ttggtgtcag cgggagcagc aggccacctc ctggggctga cactggccgg 14521 ggcctgcatt tctgagggcg gggcgtgctt tgacatcctg aaagggttgc aaaacccatc 14581 acagagaacc ccaagaaggg agactctgcc accgagaccc cgtgtgcccc gcagggtctg 14641 cgccgtgtgc tctctggctc ttcatggaag cctttcccag ccctggtcgg ggagccccgg 14701 tccaacatcc ccaggagccg gtgggaagcc cgttctggtc tgcatggcat gagtgcccca 14761 cgccagcccg ggtggcagga tgtgcccccc acctccaatg cccccctccc aggggctcat 14821 tcctgcccct gtcccctctc tgccccaggt ggggcagcct ctctctgcag acaggggagg 14881 cctctcctcc ccgcatgcag ccctgaaggg gaggggccgg ggcagggagg gggtgtgccg 14941 cgtcgtcatg gtgaaggcag agcagctcct tttctccgtg atgtgtcagg ttgccggata 15001 ctgggctccg gtccctggcc tggagggtgg gatgtggagg gccaggtggg tcctggaagg 15061 agctttccac tggggcccca ggaagacctt ggtgtctcgt ttccctggag acgtgcccca 15121 tccttcccac tgggactggc aagtgacttg ctgatgacgc tctccgtggc tctgagtttg 15181 tcacagaggt tccgtgggcg gtgagtgctt ttcctgttgg cccgcgccgc ctctgggcaa 15241 tgcgggggcc cttggtgaca gctcctggcc ccttccaaag ccgcctggcc cttcttgaac 15301 ccgacgagag caggttgccg ggccttccct gaggaatgtg ttcctggcaa gtggggacag 15361 cccctccccc agggtgtcgg gtacagggtc ctggggtgct tcaggacacg gtcaggccca 15421 ccgacccccc acactcctgg gccacctgcc tgctggctgt cctctgtgac cctctctccc 15481 ctgagccgag gtgccggccc gatcccgcgg gaagaacagc gaggagggcc ggctctccct 15541 gctatggggg cctcgggggc ctggacgggg actgtgtcct ccctacccgg gctgcccggc 15601 gccctcgctc agggagtgct ctcctagcga cctgttcggc agatgggcac agccacgaca 15661 gcagccagac cctgtcccgg cccatgcacc cacctggacg gatcggcgcc ctcctggact 15721 tactggctcc attgagggtc ccaaaggagc tgccccagag caccctaggg gcgtccggac 15781 ccctcggccc gcagcagcta ccagcagccc aggcaggacc caggggcttc cgggcacaga 15841 ggcgccgctc cctggctcgt gagcccaggg cctctccagc cgtcccctgt tgggggcctc 15901 cactcctgct ctggatcaag gtcctgcccc tagggagagc tggtgccttc gggggagcgg 15961 gccctggttg cagttggaga gtgggtccag ggatggtgtt ctgcttggct gccgccctgc 16021 ctcctggggg cctccccaga tcccccccgg ctgctgtcag ctggggctga ggggtcctgt 16081 ggcctagggg gcatcctggt tcgggagtgc cctgccctgg acaccgtgtc caggaaggat 16141 acatgctcac cctttgttgg gggccaggca cagggtgagg gggcactcgg ccccaccacc 16201 cgtgtctctg ggaaccctgg ctgggtacgg cccctcttcc gagagatcct gcccgggact 16261 ggtgccgcct gcctcgggcc tgccccccac acagttgggg accctggcgg ggcacctcct 16321 gccagccacc tgctgctgtc ctagcctcag ccaggccctg caagctgtcc atgtgcctgg 16381 gacacccggt ccctctgcaa atgtttgcaa acacccctgc ccccacaggc atgtctgggg 16441 gcctctctgg gcctttgtgt gccgagttgg gcgtgggtgg cggcgggggt ctttccatag 16501 tcacctactt cctgtcctcc tcggccccag gtccagcggt gccctgtctg ctcggtttcc 16561 agcctgagcc ctttcctgag ctggcccggt cccctaagct tctgcagcag gttcccaggt 16621 ccaccccgag ggactcctgg ccagtgttcc caacagtgtc tgagtgctgg cctcagtccc 16681 tggccatgga ttttggggaa ccccgctctg aggggccgtt gcttcccttt cctctccagg 16741 tcccctccca gtttcctcgg agagcgggga tggggctgtt gggaggcgga agggagggtt 16801 ggggtgcaga ggtgtggctt gggggaggga aggtcaactg ggctaccgcc cctccctgcc 16861 ctccctggac tcagtccaga cacaacctgc gggttggagg cctgtgcacc ctttatctcc 16921 tcctagtagc ctggcgggtc ctcctgggtc ttcctcagtg tgtgtgagtc cccaggccat 16981 aggacccggc agggtggggg cccgtgctct gctcttgttt ttgctcctac ttgcccgccc 17041 ttcccggggg gggccctggc caggaccgtg ccagtctctt tccctgctgc ctgcctgcct 17101 gcctgggtgg cctccagctc accaccctgt gaggcctgtg gcctcccgcc aggctgtcct 17161 caggggcccg gatcccccta gtttatcttc tgcccttgcc gccccacatg tgtccctccc 17221 tgcagcctgg ggacgatcgg caggtttggg ggaccggtgc ccttggttca cctagcccag 17281 ccctagggcg gtggccctgc caggcatggg gcttgagagg ggatgtgatg gagcagacac 17341 agggaggggg caggagaccc tgggtgactc gggacaggag ggagtgcccc agaggttcag 17401 aatcctgggc acacagtggg ctgagcggac aagtcgcagc ctcaggggga cctcccgtcc 17461 tcccaactgg cactgcatct ttctgggcct ggctctgctg cctcacagcc ccgttcagct 17521 ggtggctttt agaggcttcc agagtgtgct tggccccttt acctctatgc cattgggccc 17581 agggggagca gtagagtggc tgcggctggg ggtgggactt cccctttctg tgtcttgctt 17641 gccccgtgtc tcccagtgag tggccgccct gagcctgggg ccagcagccc agccccagtg 17701 agaaataaaa gtagccatcc tgcctgaact gccgctgcct tttctacttt gccctctcaa 17761 caggggtcaa ggagggtacg cgtggtgttg aggtctcaga aggtctgggt ttgacgtgtg 17821 ccatggctgc aggctacggc ctggtcgtgt cagacgctgt ggagctgagg gaggatgtgg 17881 cgagaaagct gagcgttgcc cccgagagcc ttgcttcctg gcccttgtct gcagatgtct 17941 gaggtggtgg tgtccttgtc aggggtttga tcgagggcca aagcctttgg cccagggagc 18001 aggcagcttt gggtgctgca gttgggatgt cccgatagca tgcaggcagc accaggagcc 18061 aggacgactt gggatgttat gtgtttgggg gcaatcttta cagtgggaga cacataggtt 18121 ggtgtgggaa tccacggcag gggtaccaat gaggggatat tcagcagggg gcccagggta 18181 cctgggattg gcaccttgaa tgcatccacc tcctgccacc tgttaactaa agggccctga 18241 ggcccagagc ctgggggagg gggcctcatc ccagtccctt gtttggggga atggtggcat 18301 ggtggcatgt tcaagttacc tggcagatgg ccagggcctg ctgtgatttt tttttttttt 18361 tttttttgag acagagtctc gctctgtcgc ccaggctgga gggcagtggc gcgatctcgg 18421 ctcactgcaa gctctgtctc ctgggttcgc gccattctcc tgcctcagcc tcccgagtag 18481 ctgggactac aggcgcccgc caccacacct ggctaatttt ttgtattttt agtagagacg 18541 gggtttcacc gtgttagcca ggatggtctt gatctcctga ccttgtgatc tgcctgcctc 18601 ggcctcccga agtgctggga ttacaggcgt gagccaccac acccggccgg gcctgctgtt 18661 gattatctgg ccacctccac cagggagcct tcctgacacc accctaccca gtaatgggct 18721 ctcctgccca gctctttgcc tccttgctgc ctgggctcca ttccaggttg ctacatgttc 18781 ctctccttac gttggcttga actcagggag gcagaggaag caaagataga tggtgccttc 18841 tggagtcagg gtggggtgaa gcaggagccc tggagggagg tgacctgtgc tcagcccatg 18901 ctttgatctg ctcattggca cccctgtcag gttggccttg actggtcccc caaccccaac 18961 aaagctacag ccacgcaaag gagaatggaa gcaaaacttt attcctcttg gctggagaag 19021 agaactagtg ggtggttgtg tacaggaccc ccatccctca cccctcccag aaccaaagaa 19081 gacaagcagc gccaccaaat ggctccctct gcccaagtga aagccgagag gtcagcggct 19141 ggctggggag gcaggtgagc gcagcacggc acagggcagg ggcggctgca gtgacaggcg 19201 ggcggccagg gcggcctggg ccggggttga ggggaagagg gcggggctgc ttgggtagcg 19261 gggcaggctt gggggctgcc ggctggcacg ggccccagac tcagggcacc acaacgcggt 19321 aggggctgcc tgggatgtgc tcgtcccccc atttgaccac cagtgtgtac tcccccttgt 19381 ccttgagcag gtaggacacg ctgtagagcc ggctgcccac gtgcttcacc aggatctcct 19441 cgcagggggt ccttgggcca tgaaccccca ccagcagcat gttgttgcct gaggcaagag 19501 gggtcctcag tcccaggtcc cagccccctt cctcccgctg gtgtcctgct ctcctctcct 19561 gtccctaagc cagttctggg gtgacagttc tctcttccta aagagctgcc agcacctccc 19621 caacccaggc cagcttagca gattccagct cctccctggt ctttgtttga ggggtccagg 19681 aggtggcagg gaggtggctc tagaagtgaa agccacgccc caaaggcggt ccctgctctc 19741 ccagacactg gccatcctgt gatttctggc ctcattttgg tgggaaggtg ggccgggggc 19801 ccaggttgcc cacctgcttt gctgcagtct actgtgaagc tgctcttctg gcctacgtag 19861 gccttgctca gccccaggcc cttggccacc accttgctgg cgtcagcagg cccaggaccc 19921 ggggccccat gctggggggc acaggtggcc ttggtcagag agtctacaaa cactgatgat 19981 gtctcgtgga ggctgtggtt gctgacgaga cgggggcctg caaggcagag tgggtggggc 20041 taagaggtgg ctgtggtgcc gggcgttggg cagatgccaa tagcttggcc ccaggctcac 20101 ctgtgacttt ggccttgaag gggctgcccc caatgtggta ggggccgccg tacttgatgg 20161 agatgaggta gctgccaggt gccatggggg tataggtgac gcggtagccc tcagggcact 20221 cctggcaatc catcttcacc ttggaggggc cgtcaatggt caccgacagg gcaccagctc 20281 ccgcattgct cgtgttcacg acgaactcag ctgggttccc tgggagcaca ggaggtcagg 20341 cagagccaga ctggcctgcc tccctgtcct cggccaccca gcacaggctt gtcacctcca 20401 ggtgcctcct gttgtcacca agagcttgga ggcaagggcc tagcagctgt gtgcacacgt 20461 gcagccgcac cgacagactt acctgtgaca ccgccttcca gacctgctcc gtaagcagac 20521 accaagcctg ggtcccctcc atgcccaggc tccccaactc ggatcttgaa ggggcttcca 20581 gggatgtggg tgccgttgaa cttgacgtca atcaggtaaa cgccattctc ccgagggatg 20641 aagcgcacag catacttatc tgaggagcag ggagtcatgc tgtgggcctg gggcccctcc 20701 tcaaaccagc agatggtgtc tgtgaggaac agcctcaacc ctggccctcc ccattttgcc 20761 ggtccatcag tgtgagccca gccacgctgg gcacctgcac cccgcagcag acctcctgag 20821 ggccgaaggt ttagacactc aagctgccca ccgtcagggt gcattgggac ggaaatcttt 20881 tacaagccaa acccaggcct tgctgcctcc taagaaaggt taccgtgaac accatatccc 20941 ctgcatgtta acaggctcag agacgggcag gaacttgctt caggtcatgc agctgggaag 21001 ggcaggacta ggccgtgaaa tgaagcccac tgggctcctc ctacccacaa ggctgcccca 21061 ggcctgtgca caagtaccat ggccgtggct gaagggccgg gcagcactga ggctggactt 21121 tgagtctttc cctgtggaga tggcatggta ctgcagggta gaaggccttc ctaatagctg 21181 tgggtggcga gccacatccc cagcggcttg ggtcaccaat tgggaaaggg ttgcctggct 21241 tagggggccc aggagttggg gccccgtctt ggctgcttac agaagcggta ccttcctttt 21301 gtactctcag cctgcttcca gccagcaggg cagggcggcc gggcagggac agggcctcac 21361 cttggtcaat ttctgtgaca tagcactcct ccagggctcc tgaggggctg tgcaccttgg 21421 catcgatcgc ccccttggcc ccgttcaggc tgactgcaaa agaggctggc tggttgacct 21481 ttagccctga ctcctggaag cacagcagac atggttagat gggggtgctt gagccagggc 21541 aagggaggac ggaaaggccc caggagcaag aggcccgcgg gtcgcactta cccatctggc 21601 gtccaacttg gtttaaagag ggagagccac tgatttgcaa ggggcagcaa gccacaggcg 21661 cacatccccg cccctgcccc cacactctgc cgtcgcacac acccaggtgg caggaaggga 21721 gggctgcccc accctgtggg cagcacccag ccttggcagg ctgtccctct ttaaaccgca 21781 gccctggtgc tcaaggggca caggcttctg ttcaaagcct ttagacctga gacacgagaa 21841 aaactccacg tggtgggcag gatatttcct gccgacccca agcgtgggct gcacccggcc 21901 agagcagagg cctggaaccc agagctgggc caggggggtc actgaacaca gggccaaggc 21961 ctggtctctg cggtggctct ggtctgtccg ggcccaggag ccccaggtgg gcggtttctc 22021 tcggtgcctc acctgaaggc tagaaacagt gaggcggcgg gcgtcgccag acggagaagc 22081 cacaggcacc acgaaggggc tgtcgggaat gtgttcctcg ttgaacttga ctgagacttc 22141 gtagtcacct gggcagggaa agagtgtaag accggctcat cagcctttgg gcctcccacc 22201 tcccacgaac agcctgggtg gccctgtgac ctcaaagtga agagtgagtg ccaggcaggg 22261 ctatctgtgc cagccctggg tccaataccc acactcaggc cccacctcct ccaaccccag 22321 ccgggttccc agtacctggc tcctggacca cataagccac accacaggag ccgtccttgc 22381 ggtcctcaaa agagatctca gccttgctgg ggccctcgac agcaatggcc aggcctccag 22441 caccagcttc ccgggtccag atactgaatt cggctgtggg agaacagttt gtcctcactg 22501 aaggctgctt caccagcctc ggccccctcc aggcatagcc aaggtagaca tgcctgccgg 22561 ctctttcctc cacccccaac cccaccctgg ccaacgcagg agagcgagca ctcggggtgt 22621 gagcaggcct acctggcact ccagcttcag ctctctccag gccagggccc ccagctcgga 22681 ccttgtgggc tcccccttcc cctaggggcc ccacggtgaa ctggaagggg ctcccaggca 22741 cgtgctggcc cttgtacttc acgctgactg tgtgtgtgcc catctcagcg ggaacaaagc 22801 ggatgcagta ggtgtggttc tccccttcca cgatctcggc ctcatgggtc ttgcccgatg 22861 ggctggtcac ctgggctgtc atatcctgga tgctaatttc tgcagggtgg ggatgggcta 22921 gtgagcagca gccctgggct ccacccctcc tcttggggct gcttgagccc caggacccct 22981 ccccaggctt cccacagccc ctaccaggga ttttcaggct gaggtcacaa tgactaccaa 23041 cgttggccac tgaaggagcc cgacgcctgc gggtgatgct ctctttcacc cggccctcgc 23101 ctgtcacctt cacagagaag gggctgcctg caggaagaaa gcacggccca cgcccctcca 23161 gtcacatact gcctgtgggc cctggtgtag tgaggggggc tgccgaggca ctgctgcact 23221 caccaggcac gtgctggtcg gcaaacttga tgttgatgat gtagttgcct ggctctgtgg 23281 ggcagtaggt gaccctgcac gtcccgtcct ccaggtcctc tgtgttgatg tccaccttgc 23341 tggggccctc aatggacagg ctgagcccac catagcctag gggatggata cccctgagcc 23401 tcggtgctat gcacagtgct cccgccccag ctggtgggca gccactgcct acctgcatcg 23461 cgggtatcaa tgataaactc tgcaggctca aaggtgtggc cttcgtgaag gccctgacca 23521 gagacccgaa cacgactggc atccccaatt tccgactggc tgatcaccac cgggatgggg 23581 ctgctggcca cgtgctggcc atttttcttc acatgcacca ggtgctcccc cgtctccttg 23641 ggcacgaatg aaatccctgg acacagggca tggctgtcag tcagggaggg catggctccc 23701 cacaggctgc ctcctttctg aaccccctgg acccttcagc cgcttaccca cgtggccatt 23761 acgcagccgc ttcagcaaac agggctcctc ccggcccgag ggcgggacca cagtggccgt 23821 cagcaggctg agatccgtct ctgagatgtt gatggggatg tcggcagcag agccgacctt 23881 taggtgggac atacgcatgg agtcgtcacc tggtggggac aggccagcca tcagtgtgcg 23941 tccagccatg ggagaccatg cccaccctgc cacccgtttc tgtcactgct cccagtgcca 24001 cccacctgtg acccgagcag tgaaggggct gcctgggacg tgctgttcat tgtacttgac 24061 tagaatgctg tagtcccccg gcagcacagg caggtaggac acgctgcatg tcccatcctg 24121 gttgtcagtg cagctgattt ctgctttgga cgggccctca atggccagag acaggccccc 24181 tggagagagc cgtgggtgag catgggaact atgctgggga cactccagtt ggcctgcact 24241 gaggagctcc gatggccagg gcagcagggc agggagcccc attcaggagt atctcctgag 24301 tccagccctt accccggtga gggcatcccg ggcacagagc aggtcaagac cagagctatt 24361 gctcaccctc tcctgcatcc ttggtgttga cggtgaaggt ggcaggcttg ttcactactc 24421 catgggtgag gccaggccca taggcagtga catggccaca gttgacgtaa tccacataga 24481 actgcaaggg gcttcctgag gcaggaagaa gggccttgtg gaatggcagc ctcgtgtgtc 24541 ccccagcccc atctctctgt gaggtggtgg cggtggagtg ggcagaggca gggcaggccc 24601 acctgggatg tgcatgttgt catagcggat gtccatctcg tgcaggccag cctcgctggg 24661 tgcataccgc acggtcacgg tgccgtcttt gttgtcagtg atggtgggct gcgccacctt 24721 gcctgagggc atccgaacct cccctgtggg gcagtggggc tgaggtcagg gcagctcatt 24781 ggcacagtct ggcctccttg gctcccgagc tccttcccaa gtccccactc acctgtgatc 24841 tcgcccttct tgatggtgaa ggggatgaca aggtcaaagg gcctcaggct ggtcacatcc 24901 agcccattga cacccaccag gggcctctcc ggggcctgca gtggagacac agggcatggg 24961 tgagggacag tgggaggatc tggggtccct cctcacacat ggtaaactag gggctcacag 25021 aacacccagc tggctagccc cagtgtccct agctggccag gccgtaccca agtctgctgg 25081 ccgccctggg cgtaggtgta ctgtggggcc agctgctgag accgtagagg gggctgcacc 25141 gagggctggt ccccagccag agcctgcagg gcaaagcaga gagctgctgg agagtctgtt 25201 gtcacagagg ggccccaggg gaacaggccg ggacctgcca gacacccctg ctgacctacc 25261 ccccacccct cctcaccgtc acttggaagg ggctgttggg cacgtgctcg ccaccaaagc 25321 gcacacagat gacgtatttg cccggctggg gggccgtgta gaagatgtcg aaagtgccgt 25381 cctcattctc caccacgtcc acatccacct ctgagccatc aggcgtgcac acggtgcacg 25441 tcactttgcc tttgcctgcc gccttagtgt ccacagtgat caccgtctcc tccccaatct 25501 gaatggtggg gccgatgcca gcacctggtg gggcagggtg ggtccccaaa ggggggccag 25561 cgtgtgagct cgggttcagg ttgttcccgt ccgcctgccg cccacacagg cctctcattg 25621 ccaccaccaa gcacagccag gccgccaggc ctcctgcccc gccctgcccc tccaagctgg 25681 gatggcgagt ggtttctcag aaggcaggaa ggtttctcct gaccacctca agtgcccatg 25741 tgggagaggc ccgacagtag cacccaggtg cctgggagga aaaggaaggg cccactccag 25801 cctaagtgtt gacagccact cccttgccta ctcagcctcg tccagcacca tggcccacac 25861 tgctgaacgc cagcatctgg ctgggcacac ggctgcctgc gcccttctgg cctggccccg 25921 gtccccattg ctgtgtgtgg acgcggtgag gcggagtcct gctgcagcgt ggggggatgt 25981 gacacttgcc tccccacccc ccaggcctga tggtgaaccg gagtttcctg acaactgaga 26041 aggctcgcgg tgaaaagcag cccggtgcag acaggagtgg gagcggggaa gcgggaggca 26101 gcggggttag cacccgagta gcgcacgcag caggcttggc agagcgggga tagcacccag 26161 ggagacaggg aacaggaggc ccaagctaca gccaccactg ctgggttgcg agggcagagc 26221 agctgagcag gctccagggc cccagagtgc ccgaggaaag gggagagaca gtgacgttat 26281 gatctgggtc tccgccgtgg aggcttgggg ggctgtgggc tgtgaggtct ggcagtgaga 26341 gtgccccctg cagcctggcc tcctgccagg gaagcagcca ggagcacgaa acctacttga 26401 aggccaagct cgaggggccc tgccaccaag tcccactatc ggccgagggc tcctgctgca 26461 ctaggctcca tgctggcctg ccgcggccag gcagctctcc caggctccag acctggctgc 26521 cggctgccca cgctgccctg ggaggcttca ctcatgtgca ccccatgcca tctagttagt 26581 ggcttgggcc ccaacctcac aggtacccga ctgcccccac tttccctgcc cggctcccag 26641 ccaaccgcag gtctcagcca ctccatcctc aaagtatgcc tcctaatttg gctctcactg 26701 aagccccgtg gccgtcaagc ttgccagaga gcaacttcag cgcccccttc cccaccccgg 26761 gctccctgtc tccattctgc aaacagtagc cagggaaact gaggttactg agtcttgctc 26821 ccaccctcaa agcccccaag gccctgcaat cagtactccc cagcctcccc cgccctgcgc 26881 acacacgcct gtcacacaca catcggatcc tacggcctgc tctataaagc catctcgctc 26941 ctgccgcaaa actctagcca ccctctcacc ctttccttag ccacctcgcc caagtgccag 27001 gcctgtagcc tctcatcagc atccctgggt gtccctggct ggggcaggtg accctccatg 27061 cctgttgggg aatgtctgaa taacgtcact gctgggggat aggggaaggg ggaaggaggg 27121 agacgggcca cagaatgcac tttggggctc tgaggctggg ccgggccccg ccatgctggc 27181 cgcccaagca gtcgacactg gcgctactgc actaccagcc agccccaccc gctcatgcac 27241 ctcccccggc ccgggacgca gaggcccctg gaaggtgggg atgggggtct gtggatctgg 27301 ccagggctgg ggctcccatt gagttgggga gaccccttgt tccctcagcc cacgcaggag 27361 cccttgtgcc tccctgggtg ccaagatccc gagaccccaa agtccccagt ggggtgcctc 27421 tcaggatgca cctcacccca aagctctcag ctgcaacccg agttctcaga tggggaaata 27481 aaggcccaga gaggggaaga gctggagcgg tcatgctcag ctccggaagc ctccctgggc 27541 cctcgccctc cagcccacgg cctcatcttc tgcaccccac ccaactctgt ccctgcctag 27601 agctgcagct ggaactgtcc tgggaatcgg ccccaagagg aaaggaagct gcccctctgg 27661 gcaggagggg cattgggagt ggccccagca agcagcttac ctagcccgtg acctccgatt 27721 gacactgagg tgagagggca gagagcaagg agaaaggtca ggtgagtcac tcaagggggc 27781 ggcccccact cccacacgcc gccccaagca gcgagccctt gcacacaggc acaacccaag 27841 ccccgtgccg agcgccgcag cggccaacag cgggcgggct cacctgtgac agtgcacttg 27901 ctggcgtccc cggtgggcac ggcacgcacg cggtacgggg agaaggggat ctcgtcacca 27961 ccgtacttga tgaggatggt gtagcgacct gtcacgtctg gcacgtaggc cactgtatac 28021 gtgccgtcat ggttgtcttg gatgtgtgtc ttcttcggct tgccttcggg atcctgtgtg 28081 gcagaggcag gggaggcagt tggcccaagc ccgagtagcc ccgggcctgc ctcaggcgct 28141 cccagagtgc ccagcgctgc tgctacaggg acagtctctg ggcctcaggc atcttaactg 28201 ctatggccca aaagggctgc ctgcacctgg cagagtccag aatggagact aaaacatccc 28261 acacttccca catgacaaca ggcgcggcca gccagcttgg cacccatcct cagccccagg 28321 gcctggccag cactaggagg agctcaccgg ggtgatctca agcttctatc ctatacctgc 28381 cccatgagaa tattacaggt ggggaaactg aggcacagaa ggctttggtg acgtgctcaa 28441 agttagcagc aagtgcacgt ctggggaagc ctgtgcctgt caccaccatc ctaaggtagt 28501 cacaacctta gcaaaccaaa gacaggggcc ctgtccttcc ctgccctcct tggtgcagct 28561 ccgcagggag atcagacacc agccacccgc agcccacact ccagccgccc aggcccccct 28621 gcctcccctg cctgtgcccg gagctcaccg tgatctggac agccagcagg ccctccccgg 28681 cgtcctttgc atcgatggtg aactccacgg gcaggctggc aggcacgcca gtggtgttga 28741 gcccggggcc actggccttc accttgctgg catcatgagt aggcagcacc ttgaccttga 28801 aggggctgtg agggattggt gttgtgagca gtcagacagg ttctcagcat ccagcctggg 28861 ccactcccca caggcagcag gccctgcctc ttacctccgg ggtacctctt catctccata 28921 cagtactgag atgctgtagg gcccttctcg gctgggcaca taattgacgg tctgggtgcc 28981 atcagcgttg tctaccacgt ccactggctc caccaggcct ggccccagcc ccagggacag 29041 agcatcagct agtctcctgg gcctccattc ctaccccacc acactggacg gccaggaccc 29101 agccccaggc ctagacctcg ctttatcctc atccacccga ccctgggaat ggccttgacc 29161 ccctctggca cgaaagacac ccatctgggt gccagctggg acccttgcct gcctgccttc 29221 ctgccacatc tgctcagtga ccagtcccat ctgctctacc tcctccaagt cccccacctg 29281 gctgcactgt cacctgaggc tcttccaaaa acacctacac acttgcacca cctctgcact 29341 ctggtcctca ctttgggcct ccagtttcct ttcccggccc tcaaacagga cactgccctc 29401 ctgacccctg gctccaggca tgcaaacact cacctttggg cccttgcact ttgacctgca 29461 atggggccac accagccttg cttgtgtcca cctggaagga ctgagggagg ttggcacgaa 29521 ccatgcctgg gctcaggccg ggcccagagc acttgacctt ggacgcatct gtcacatcat 29581 gcacagggac cttgaaagga ctgcctgagg gttggggcaa agggatggcg gctgtatgag 29641 acagggtggg gacgagcaga cagtcccagc cctgccctgg ccgccacctc ctcacctggc 29701 acttgatggc caccataggt gacgttgagg ctgtaggtgc cagcctcata agggatgtac 29761 tcgaccgagc agctgccgtc cttgttatcc atgcaggaca tcttggcctc ggaggggccc 29821 tctacagcca ggcccaggcc gcccgtgcca gctcccctgg tccaaacaga cagccggtca 29881 ttcctggggt tcccaggccc accagccaca cgggctcctg ggggctccct tacctggtct 29941 ccacagtgaa cttgttgggc ttgttggtgg tgccactttg gatgcctggc ccgtggacac 30001 gcacccggga ggggtcgcag ccctcggtca cgggcacctg gaaggggctg ctgggcacgg 30061 gactgccgtc ataggtcacg tccacggagt gcagtcctgg aggagtgcag gccaggtcag 30121 gaggagcccg ggccacccca cccaccccgt ctgccagcct gtgggagtcc ccagcacgca 30181 ccctcctcgt aaggcgtgta ctccactttg tacatgccat cgccacggtc ctgaacgtag 30241 gtctccgtca ggttgcctga ggggttggcc acacgggcct tgacgtgcgg ccctccggtc 30301 tgtgtcagag cccgggcgtc cacactgaac tcagtggtgg cctcacggaa gacacctgca 30361 aaggcacaga gaggaggctt ggggctcggg ggttctggtc cctgtccccc gtcacatacc 30421 ccacggcagg gcaactcacc ctggccctca ataccaggcc catagcactg gacaccggaa 30481 gtgtccaccg caggttccac ctgcagcttg ctggggaagt tgggcacggg ctggccgccg 30541 tacttgatgg tgacggtgta ggccccgggg cagaggggaa tgtaggtaat ggtgtgcgtg 30601 ccatcaccgt ggtcctggat gtacacctcg gccggaagcc ccgcctccga gcagatctca 30661 atggtcagct ccgcgctgcc cgcgctcgag cagtccactt ggaattggcc cacctcccca 30721 gcggtggccc gctccagccc ggggcctgag cacttgactt tggatgcgtc aaagcaggga 30781 accacgtggg ccttgaatgg ggagccaggg atgtgggtgt cagcgaagag gatgttgatg 30841 ttgtagtccc cgggctcggt gggcacgtag gacacggaac atgtgccatc cccattgtcc 30901 aagcactcga gctgcgcctc acaggggccc tccaccgtca ggcccaggcc acctgtgccg 30961 gcgcccttgg tgtcgatggt gaagcgggcg ggggagcccg cactgcctcc ctgcagcccc 31021 ggcccaaacg ccttcacctg agggaagaag gggtcaggag ccaaggccac actatgcccc 31081 gatcccagac ctcctgcttg actttccacc tgcctcccct tgccctggct tcctgccctc 31141 accaaacagg gacacgggcg ggcccaggct gcctgccaga tgggccatcc atcagtcata 31201 aggacaaaaa ggagggcagg gcgcggtcac tcacgcctgt aatctcagcc ctttgggagg 31261 ccaaggtggg tggatcacct gaggtcagga gtttgagaac agcctggcca acatggtgaa 31321 atctcgtctc tactaaaatt accaaaatta gctgggcatg gtggcgggca cctgtaatcc 31381 cagctacttg ggaggctgag gcaggagtat tgcttgaagc caggaggcgg agggtgccgt 31441 gagccaatat cgcgccactg cactccagcc tgggcgacag agtgagactc tttctcaaaa 31501 aaaaaaaaaa aaaaaaaaaa aaaaaaagga aggaaaaaaa aaaaaaagga aattcttcaa 31561 ccctgatgac tttttgtggg tttctatttc cctcctcccc tttactctca cttaaacagg 31621 gtcaaggttg acagtctgtc tcaggaaaaa aaaaaaaaaa aaaggattta gggcaggtct 31681 ggagaagatg gggtaccttt ggggccctca gagagcacag tgggttctac ccttagggcc 31741 tcccataccc caattacctt gctaggcttg gtgggggcca cagcttccag aggaaagggg 31801 ctgccaggca cgggcacgcc gtcataggtc acctccacct catagggccc ttcctcacgg 31861 ggcaggaagc gcaccacact gttgtcagcc cccaggcctg gctccacctt gcagggcacc 31921 gctgcacccg aggggcccac aatcttggat gccactttgc cttgaccacc agcacccttt 31981 gatttgactg tgaactcctg gtctttgcca acgtccacct ctgtggaaac gatgaaagga 32041 aggagagaga catgacaccc agctcagcca atccctggat gtgacaaagg cctttgcgac 32101 aagggcccca actacttact ctctcccagg ccagacacct tgatcttgct gaggtccagg 32161 cttggagata ctgccactga gaaagggctc ttagggatgg gatcccctcc ataagtgaca 32221 ttgacgccta ctggaccctg ggaagggtgc agaagggaag ggggtattta gagacaccag 32281 aattgctccc caaccccctc ctggacctcc atgcctgtcc tctcccaggt aaggaatgag 32341 agtgggcaga aagtccctcg gagctgtccc ctaggctgct gcatgaggag gctggggact 32401 cggtgactgt agtggagggt gtggctacct gctggacagg cgtgtacttg actgtgtagg 32461 tgttgtcatg gtggtcgatg atgtccacat ctcgcactgc atcccccttg gtgagtcctg 32521 agaactggac gtccagcttg cctttgccag cagctttggc atttactgtg aagtgggtgg 32581 gcttgccaag ctcgacacct gaggaacaca cagggaccat gtaggggcac cctgccccaa 32641 gccctcctac ccttgatgcc ccgcaacctg ccatggggta cctgtcctca ccagtgcgac 32701 tgaggccagg gccctcggcc ttcaccttac tggcgtcatg agagggctcc accttgactc 32761 ggatggggct ggtgggcgtg gcctgcaggc agtgggagga gaaggcctta gaggagggca 32821 gacgtcatcc gcaatgacat cttagcggcc aggagcgcag cacccacctg gtcagcaaag 32881 aggaccataa tggtgtagct gccagccccc cggggcgtgt acttgaccgt gaaggtgtca 32941 ttgtcattgc ggatgatgtc gaagtcgatg tcagcttcgg cggggcctac cactccaggg 33001 gcacacttga tgccgatgct gacgtcccct gcggcgggga gaggagcgga ggctgagacc 33061 tcgcagggac accccagcca cctgccctcc cacccacagc caggccttac cctggccagc 33121 ctcggcgcag tccacagtga agtaggtggg ctcgtgggcc ttgagccctg tcttggctac 33181 tccggggccg tatactttga ccttgttggg gtggctgcca gctcccacat tcacctgcag 33241 ggcacagggg caagggcaag ggcatgagca gcctggagga gactcagaag ctccctcagc 33301 tacactggca ggcacacaaa atggtccagc tgccttggga agagtctggc agttcctcag 33361 ttgaccacag agggactgtg tgatgctccg gttcactccg aggcaactac ccaagagaaa 33421 tgaaagcaaa tgtccacgca aacacctgga ctcactgtta acagcactgt tcatgatcac 33481 caaaggcaag ctcatcctca atatctgtta gcaagtgaac ggaaaaatgg aatgtggtct 33541 ggccacacaa tggagtatta tgcggccata aaaataaatg aagcggccgg acgtggtggc 33601 tcatgcctgt aatcccagca ctttggaagg ccaaggtggg cggaccacct ggggtcaggc 33661 ggttgagacc agtctgacca acatggagaa accccgtctc tactaaaaat acaaaattag 33721 ctgggaatgg tggcgtatgc ctgcaatccc agctactcag gaggctgagg caggagagaa 33781 ttgcttgaat ctggcaggca gaggttgcag tgagctgaga tcacgtcatt gcactccagc 33841 ctggcaataa gagcaaaact ccggctcaaa aaaaaaaaac caggatgggc gcggtggctc 33901 atgcctgtca tcccagcact ttgggaggcc gaggtgggca gatcacaagg tcaggagatc 33961 aacaccatcc tggctaacac agtgaaacct tgtctctact aaaaatacaa aaaattagcc 34021 gggggggtgg cgggcgcctg tagtcccagc tacccgggag gctgaggcaa gacaatggtg 34081 tgaacctggg agaaagagcc tgcagtgagc cgagatcgcg ccattgcact ccagcctggg 34141 cgacagtgtg agactccatc tcaaataaat aaataaataa acaaaataat tataaaggaa 34201 caaagtgcca agacatgcca aacatgaacg aaaacacgga aaacttcgtg gtgagcgaaa 34261 gaagccagac acaaaaggcc acatggcaca cagttccatc tatagaacat gtccaggata 34321 ggcaaagcca cagcagaaag caggttagtg gtcaccagga gctgtgggac aggggagtca 34381 ggatggtgtg ccacaaccac ttgaaatgga cgaatcgtgt gctacgtgaa tgctatcgag 34441 atggactaaa ggccggtgga ggttggctca ccctgaaggg gctgttgggg atgctgacgc 34501 ctccccagga caccatggct gtgtgcttca ccggcttcct gggcacgtag gagcagctgt 34561 aagtgccatt gccgttgtcc ttgaccaacg cctccacagg gcagccttca ttgtcctgtc 34621 aggcagatag gagcaggtgg cctgctggtc agtgcccagg cctgggtgcc cacacctgcc 34681 ctgcccccaa cacccgtggg tgctctacct ggacttggac ccgaagtggg gccttgccac 34741 cgtgcttggc atccactgtg aactctgctg gcttgttgac ggccacacct gtcttctcca 34801 atccaggccc acgtgccttc acctagcggg agaccaccca gctgtcaggg ggccaggtcc 34861 aggctgccag agctacaacc caggcagggt ggccagggac acagagtgcc atccccacca 34921 gaccccaagc aggagcagca gggcgagact taggccatca cagcctgctc tttaccctgt 34981 ctgggtggaa gtcctggggc gcgtcacgga tgtcagccat gaaggggctg aggcggatgt 35041 cttcgctgtt gcacagcacg tgaacggcat actcgccagc ctcctgcggc cagtagcgca 35101 catcacagga gccgtcgccc ttgtcgtcac attcgatctt agcctgcgat ggcccttcca 35161 ccgagaagcc tgacaacagc caccagtccc ctcagtgccc tggagcctca gggtgggccg 35221 tccttgccat cgtctgtccc caggtgccca tgctgcagcc tccaacttac ccagcgtgcc 35281 cacgtcgtcc ccgatagcct ccaccacaaa gtctgctgac ttgccaacga cgccgccctc 35341 cagcccaggg ccccaggccc gtaccttctg attgccacac tcggtgccca ccttcacttc 35401 gaagggactg caaatgcgag agccacacag ggaacaccga ggatcaccac atgagccagc 35461 gtgggcccca ctgtggcggc caggcaggaa gagcccatgt ggcccctcat catcaggtgg 35521 ggaggcagaa ggaagagaag aggcagagtg tgcagagctg ggagagggat gcctgggggc 35581 ctcacctgcg cccgatgttc tgaccacccc acgtgatggt gacgatatag gttccaggga 35641 ccatggggta atactcgaag ccatacacgc catcccccag gtccttctgc ttcacgcgct 35701 cctctccctc tgccaagaca aggagggcct caggcctgcc cagcagtgaa cccggggctg 35761 ccgccaccca tcctggcctg gctccaggcc aacttactgg ggcccttcac ggtgaccttc 35821 agctccccac tgccagcgcc ctttgtgtac accttgaagt cagctgtctc cttcacccgc 35881 acacccttgg gctggaggcc ccggccaacc gcccggcagg cactcgggtt acaggctgca 35941 ggcagagggg ccagctgagc accagcagct cggctgggcg acccctccct tgcctcccac 36001 agggccgggc tgtcaggatt ggtgggtccc tcagagcttg gctggagggg gacatgcaag 36061 acagaactgg aagggactgt gaccccagga actgtggcct gggccaggac atggcaggcc 36121 tcctcccacc tgctccacac cagcccagag ccccccaccc tcccccttcc aagaagaaga 36181 cagatccaag taccgttgac cctgtgggca gagcagagag cagcaggttt ctagacagct 36241 ggagcgagct cttccgaagg tgaaagcgtg aggagagaga tggagagggg cgaggagagg 36301 agaggaggaa gggccccagc aggggaggaa aaacagcatg tgcccagaca gtagaagctc 36361 aaagagtagg ggccccgggg cgggctgcag cgggactggc ccagggggtc cccctcctgt 36421 gggaggccca gactgcagtg ccacagcaga gggcagtcag ggccgggcct accttggcca 36481 acagtgacag tgtaggggct gcgagggatg ggcacgccgg caaacgtgac gtgcacggtg 36541 tggacgccct ccatggtggg ctggtagctg cagcggtatg tgctgtcgcc ccgggcctcc 36601 agctgaggct ctaccgtgcc cttctgtccc atggggtcct ggatcacaac ctcgacctcg 36661 cccgtgccag ctcctgccac gaggcacctg ctcagctccc aggccccagt gcggctctcc 36721 ccacagacca gctgggcctt ggcagcctcc cctcacctgc cgtaaagatc tcaaagtagg 36781 tggtcttgtt ggcgatgttg ccactgggct ccaggccggg accttgggct gtcactttgc 36841 tggcgtcacc ctgtgactta tccacgtaca cctcgaaggg gctcttggcg atgtgctggc 36901 cagcaaagag cacagtaacc tgtccccaga agggtgggcc gtgagggtag ggctgggggc 36961 ctccagccac tgcctgaggt cacaagcctc ccccctggcc aagggctcac cttatgagtc 37021 cccgtcacct cggggacgta ccagacggag aaggtgcggt tcttgtcgtt attggcggtc 37081 acttttgcct gcagtgggaa ggagcctgtg agcctttgct aagagcagcc ccactgaaag 37141 ggagcgctgc ggggcctctg ctgccagcag ctggccctac ctcctcctgg tgtccggccg 37201 ggtcctccac gtacaccagc acctctccct ggccagcact tctggtctcc acagtgaact 37261 ctgcccgctt cttcaccatg ttgcctgtgg gctcgatgcc tggcagggga aggcgagcca 37321 accacgggcc agctgttaag gccacagcct cacccctcca cccttcagcc ctccctgctg 37381 gcccctaagt caggcacatt ccaaactggg caagttcatt cagtgtgagc tgtggcacag 37441 agcctggcct cgggacccag ctgtgctctc gctgctggga caggcccagc ctgttttctg 37501 gaacattcat tctcaccgca ctgggccctt gttcctgtgt gagccttgac tgggttttag 37561 gtccctgtta ctgagacctg gttcatctcc atttataata aagaccgctg gatgctacct 37621 ccccgaggac cccacgcagc aggcttctgt gactggagga acctattagc attctcttgt 37681 cgttttcact agtcccaaaa acccactctt gtctgactct tgggtggctt gatcacctag 37741 ccgccatgta acccaagacc cctggggacc gccacgttta gatgggacac tgtcttatgg 37801 ggaagacgtt ggcacacggg tgcacccctg gtggggctcc ctcacctggc ccgtaggcac 37861 gggctttctt cgggttcagt ttgggccgca agggagcccc tggcttcagc ttggccttgg 37921 ggaactggga caggtaggtc atgacagagt gctcgtccac gttggggtcc acaatctcct 37981 cgggggtgat cacctgtcac aggcagaaaa caggagccat cgggcctccg agtctctccc 38041 aactgccgat ccggtcccct acagctgtag ccaggccggg cgggtgtacc tgggggatgc 38101 ccagccagtc atccgcctgc tgcatggcct ctcgcgcatt ggtaacgggc ttgctggcgt 38161 cccaagagtc ccagtcagga cacaggcctg tggcgcaagg gaggctgtga gtctgggggc 38221 cgcagaaccc cctcaagggc cacccatggg tgaccccagc ccagtctctc ctgcctctgc 38281 gccccctcac ccggggcaca gctgtccacc agggcgccca gggcccggcc gctctgccag 38341 tcccggctga agttggtgat gggcagctgc ggcagcttgt tctggatcca gcccaggagc 38401 ctctgcttgg gggtctgctt cttggcctcc tcatcctcct cctcgtccca catgggcatg 38461 gagatggagt agtgcaggat cagggtccag atgaggccca ggatcagctt caggttcccg 38521 tccacgatgg ccttgctgtc tgtgagtaga agagtggcca cgctgggcac acggctgtgc 38581 gggaggggcc gatcccaggc tttggggtca gggtctggca gcacggggta gcaggggcca 38641 aggaggaggt agacaccccc tcttggccag tggtaggatg tggatgaggc cctctctgcc 38701 cagggtccca gggggtgttc agaggtgaag gagactttgg ggtgagggca gatacacaca 38761 gacacacaca cacgatgcca gcagggtatg attactgcca gaagccctgg gcccagagga 38821 agggcagagg gcagacccag cctgcagatg ggcagctctg gagaacagct ggacagtgtc 38881 cacagctgcc agagcagggg tccgccacac acctcagggg cctatgggga accaggagga 38941 ggaggctgac tccagctggc ccagaatcca gggcccatga cctggagtgg ggctggggct 39001 gaggaagagg aaggtgggtg gccctgggtg gttgggaaat acaactgttg ttccagggtg 39061 ggtggggtga ggcaggggaa gtgggggagt tgggcaaacg ccctttgggt gagggattgt 39121 gacccccgcc accaccccca cttcctctac cacctcgaag actcaagctt ggagttggga 39181 ggccacaggc ctagccaggg gaggggccca gagcccaaag gaaccagcag ttgggccaca 39241 tggccctgac tcactggggc ccatcgggag atgccggggt ggcagcagga gacactggga 39301 acccccttgc cacccactgc ctgacccact tgccacctaa cgggcctctg ggcctcggca 39361 ggctcgccac aggggaagtg gttagggcgg gtccctgaca ccccccatca ccatggcaac 39421 cctgctgggc tctgggcccc acccaaggct acttctgggt ctcaggctca aggcaggatg 39481 gctcatcttt agaagccctg tcagcctccc ccataatcag acaaaatact tctctggtga 39541 gtcaactagg gggcgggcct atgaggaggg gctgggctgt ccatggcagc tgcttcccag 39601 cccagcaggc agcacctgga gggccaggtg agccgctcag agtggcctcc ccagttccca 39661 ggttggtcct ggccccacga ggctgccttt cactctcccc aaatggcagc agggaggggg 39721 actcggagag gaccaagggt gcagggccag ggaacaaggc cagccagggc ttcccatcag 39781 gcccccaccc cagtccagcc tcagacccct caacgacccc cacacgggga gcaaccaggc 39841 ccaccaggtc catgttaggc cagcctactc cgtccccatc cccctcccaa aagagagagg 39901 gaaggaccag ggcccagggc tgggagactg tctggctgga tggccccgcc ccatcccacc 39961 cccctttccc cagcccctgg gccagccaac ccctccctcc cctaatcact gctgctttcc 40021 aacatttttt tgccttatat ggcaaagctc tgggaactag gcctgctcca gccagctcac 40081 aaggaggagg ggagctggga ggagggagga gaccccccct aaagagctgg ggtgtggccc 40141 gcctccccac atccggtcgc cccccctacc atccaggact gagatgactc agcttggcta 40201 gactgaggct gggacttgtg ggggtgggcc tgcccctcgc ctcaactggc attccaaaga 40261 aagaactttg gttcttggga tctgccctct gagtggctgg gggaacccaa gtcttggtcc 40321 ctcactttct tgggttcagc aagggccatt ccagggctgc agggtaactg gaaggcctct 40381 caattctgaa gaagagaaga ctcatcatcc ctctcctctc cctgttccac atcacccccg 40441 agtcacccat tctggggcct gccctctcat gcagccccta tcttgagaac gaattgggct 40501 gctgcagcag gcctggtcaa tgccagcctc tccagtccag tgacacagac actgggctgg 40561 cacaatgcac gctggagcat ctaccaacca actcctgcgt gctacacgca gaaaagcaag 40621 gcctacatgg aagcaagagt ttggcccaag atgaagagaa aagttggcgt ccaagtagtc 40681 tacaagccag gtctctcagt caaccacgcc caactcaccc cgggtgcagt tttgctaact 40741 atgcttcttt ctggccccca ggttgcctgt gtgtgtgtgg gggggtagat ctaaactggt 40801 ggttggagca tctccgatgg gcgacccatc ctccagggac acaaagagcc cagagcccca 40861 gacaatggga cgcacgagct ccaacccaca cttcagaacc aacgggcctc tgaaggccag 40921 agggccctct ctaaacagct gtggggggag agcccgggcc tcctcctcct cggggacggg 40981 gcgcagggcc aggagcctgc ccagtgcccg accccccagg cccactccca ccgcgagcac 41041 agccgcggag gcgcggctgc ggaggatgtg agttcagcct gggcgccggc cagccggggc 41101 gcagcgggcg ggggcgcgta cggttcgcac gcccggccgg ccagccagcc agcccgcgcg 41161 cctttgttct gcaggcccgc gtggagcaga gcagccggag gcctcgggag ttgcgctgcg 41221 gcccggaagg gggtggtttg gaggggtccg cccggggaga tggaaggacg cacggagtcc 41281 ccgcccccgc ccgcccggcg cttcggggcg tccctcaccg atggacacca gtttgatgct 41341 ctcgcggtcc aggaactcga gcgccaccga cacgttctca agctgcattt ggcggaaagt 41401 gggccgctgg ttgtgcttgc ggtgcatctt cttctggctg agcacctcca acagcgcgat 41461 aagccgcagc ccgtcgctca ggtccgtctg caggttggcg atgcgcttgc tcacgcactt 41521 caggtgctcg ttgcaccagc gcgtgaaagt gttctgctgg atcttcttcc acggcgcgtc 41581 ctccgccagg tccttctcgg tggccggcat ctcggcgtcc cgcgtgtcga cgccgccgcc 41641 cggagccgcg cctgctgcgc tctggcccgc ccgagagtgg gagctactca ttttgaggcg 41701 cgagaagccg ggggggcggt gctgcagcct cggcgagggg acggcccttt aattaaagtc 41761 gcaggcacct aggcgcgcgg gaggcgaggc agggagcaga ggttgcgctg cggagagagc 41821 gagcccttta aatgcgggag gagggcgggg ccagagggcg ggcctcctgc ggggaggggc 41881 cgtggggtgg ggcttcgagg gcgcgcctgc cccaccccgc cccgcccgtt ggaatgcccc 41941 cactaggccc ccgggttcgg ctgatcagac gcgaaacccg ggctccaggg tgggtcgctg 42001 ggcagtgggg tgggcaagga tgctcccagc cccgcagcct cgcgttggcg ctgcaggaaa 42061 cgcgccctag agcgaggaaa ggctgagcag tgtctggcgc gggactgctt ggcctgcagc 42121 cggcgcgcct tacaagggac tttccttccc cagggcccct gcggggggtg gggtggctct 42181 cttccctaag tcacttgggc tctgccccgt ccctgcacat ccctaccccc cgccgtcatc 42241 ccccttcccc gggcccccag ggcctagggt tccccggcgg catccccgtc cgccggcccg 42301 gcagcggcgg gaaggggcca accctaagag cgaacccctg gagagcggca gccccggacc 42361 aggcactgcc gagggcgctt tgtgtgctca tcacctggga agcggctgca gggcaaggcc 42421 tgcgcctacg cggtccccgc agaccccctg ccacgggccc gccttcccag tccgtagggc 42481 cctcgcccgc gcacctgtgc ccgctgcttc ccaccgcctc cctctctgct tccctccacc 42541 cgggaggata tggggcactg ggccaggagg ccgccggcac tccaggagca agcagccttc 42601 gaaggggtcc gggcatgccc acgggcacgc cagtgggcgc gggcagaggg cgccgtgcga 42661 tgcatgcaga gacgtgtgcc cgcctaaccc ttcatgccag ttcacccggg gcactggggg 42721 gagctgaagc ggggggtcct ctcctgtcgg aaagtcccct cagcccagcg tggccagggc 42781 agggggcaaa cacgggagtg gcattccaga cccagagggg cctgctcaga ggttcccata 42841 agacttgtca ggctctgagc acctgggtct gggtgggctt caggtgtggt ggcaggtgca 42901 gaggaggcct gaggacaggg atcaggggcc tggggggttg tgtccccaga gctgtggggg 42961 ctcttcactc aaaattttct ggtagataac tctggagatg gtgcagtggc tgctgtgccg 43021 agatgtagac tgggaccagg ggacggctgg ggacgctcac tgctactatc ttcagtcaga 43081 agccagcctg cctatggctt cctccccatt gtgcactccc ctccctctca gtcctcggta 43141 cctcccctct ccagaatgtt tcctcctctg agtgtttttt ttttttttct aggaaggcag 43201 gtatctgtcc ctcctgttca attctagtaa taattaattt atctgctaaa gaccagtgag 43261 agatggaggc cctgggttgg ccagcacgtc atgcccaaat atggttgact ctcctgtagc 43321 cctgggtgtc tacatgacac gtggtggggc tgtgatccgc ccagcacatt cacttcgggg 43381 cagcctgtgg ccccactggt ttttcccact agtagtggtg ttagggggtg caaatggcat 43441 tttccagtac tgaggccaac ccctgtatat tgtcccctct cctgaccact ttgagacaca 43501 cctgcaccaa gtgcgcctgg gcgtctggga gctggggtgg aggtgggagg gagggctggg 43561 aagggggaac ccatacagac ccctccctgc acgggcccca ttcaagacgg gcttgaagtt 43621 gtccttctgg tccccctgcc ccactggaaa gcaagttccc aggcacggaa ttccatcaat 43681 aattgactct ctctccacta agggccattg agctgggttg tctgggttca ttccttaatc 43741 ttcagagctc agggaaactg agtcagagtg accagtggtc aattccagca gaaagagtgt 43801 caggaggtgg gtgccgtgtc ttccagccct agcttttccc tatttagagg aaacaaggaa 43861 tgggcgtgta gagggttctt gtccctagaa catagtatca accgccagat gtcgcccgat 43921 cccaaagcca ccagatcggg cagtcctggg aagccaaaat ccagctcttt catattttcc 43981 cttatttgga gtagattctt ggatgttagg cacaccccgc ccaccacacc acccctggcc 44041 ctcagacccc aggccagacc ctcccactgg gaggtatagc ccaagcccag gagctgttca 44101 gctgggatcc ctaaggagct gtttccaggc ctgcgtagcc agggggcgca ggagggaagg 44161 aagccgcaga ctggcaaggt gagaagagac ctgtgctggc aagtccaggt gcggcaggtg 44221 aggtcgccca ttcccaagct cccaccttga cggtgcactc ggagcactta caaatgggtc 44281 tgccagtgga ttttaaccta ttttgcctgg ccccaggtgg cacgggaatc tggggacatt 44341 gttggttata tcctggctgg cagtgcgcat ccaactcaga gccaacctgg catagggccc 44401 agggcccaac caaggaacct ggcctggtct catcctcacc atggccccca tggagtagag 44461 cagccccaga accagctccc ctgtgcccaa agcagctggt gccctgtgga ctcgagccag 44521 caagggacac gctgtccccc tcccagtggg cacactggct gagaagtgtc cttccaggtg 44581 gccactcgcc ctgagtccac acaagttcct ggtgtcaggg cccttctctc tcaggcatgc 44641 agtggtccgt tcatgggcaa gtgcccagac accctctctc tctcggggac cccccatggc 44701 ctcaggctcc cacattcgtg ggcacagaca tggctcgtgg gtcaactagg gaagctattc 44761 tctggacact cactcccaga tgccagggga gcctccatcg gctcacgcat cccagctgaa 44821 cctggcccgg acctgggcgc gagctggcgc ctgcgttttc cggcctctct gcccgccttc 44881 tccaaccccg cccccggccg acgaggagcg cggcgaccct acggcgcgaa gccgggccgc 44941 ggagacctca cctgctgttc ctgagagcga ccggtgaccg atgaccgcgg ggtggcgccc 45001 ggatcgcctt cgcgcccgcg cccgcgccag gcgcctcggg gattctgtcg gcgtccgctg 45061 cgcgcgacgc gcctccacgc gaatgggccg ccgccgcccg ccttcttgtt ggccgcaccc 45121 ccgccccgcg cccgccccgc gcccggcccg gcccggcgag aaagccttaa ttggtaaaat 45181 tgcccaggag cccgggacgg gtgcgtgggg ggcgggggtg cgggggcacg ccgtgagctc 45241 cagcgacccg ccgccggcga cgccgccccc cgagatgagc tcaccgccgg cgagggccgc 45301 caggccctgg gggagggagg gcttcgtggg ggagtcgcct ccagcgccca cagggactgc 45361 agggctcgtg tctaggccca atcacaagga tcctgcgtgt ctgagtctgg gggagccaag 45421 cgcaccccag gtggaaaggc cgaggcccaa ggccaccttc tccaaggagc cacccacagt 45481 cacagcccct ggtgtctgtc ccaagccggg attgtttcct ggggtggggg caacaggtcg 45541 ttgcaggtgt tggctgggcg cctacgtcag gcagggcccc gggacgggac tgcagggctc 45601 cgagccctgg gatgaacctg actgccatgg atcaggcgct gccaggtctg gcctggggtg 45661 ggcagcagaa acaggatcgc tcagggggac gagaaagtta agtttgttta gtattaagaa 45721 gattctgatc ttaaagaaaa cattggccgg gcatggtggc tcatgcctat aatcctagca 45781 ctgtgggaag ctgaggcagg tggattaatt gagcccggga gtttgagacc aacctgggca 45841 acacagcaag acactatctc taccaaaaat ataaaaatta gctgggcgtg gtgacgtgca 45901 ccggtagtct cagctgctca ggagacagat gggaggatcg tttaagcccg ggagtttgag 45961 gctgcagtga gctataattt caccactaca ctccagcctg ggtcacacag tgaaaccctg 46021 tcactaaaca taaaaaaaaa ggccaggtgc ggtggttcat gcctgtaatc ccagcacttt 46081 gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaacagcctg gccaagatgg 46141 tgaaaccccg tctctactaa aaatataaaa actagccggg cgcagtggca ggcgcctgta 46201 atcccagcta ctcgggaggc tgaggcggga aaatcgcgtg aacccgggcg gcagaggttg 46261 cagtgagccg agatcacgtc actgcactcc aacctgggtg atagagactc cgtctcaaaa 46321 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaggccgg gcgtggtgga tcccagcact 46381 ttgggaggct gaggcaggtg gatcacgagg tcaggagttt gagaccagcc tggtcaacac 46441 agtgaaaccc ggtctctact aaaaatacaa aaattagctg ggcgtggtgg caggcgcctg 46501 taatcccagc tactcgggag gctgagacag gagaatcgcg tgaacctggg aggcggaggt 46561 tgcagtgagc tgagattgcg ccactgcact ccagcctggg tgatagagct agactgtgtc 46621 ttggaaaaaa ataataaata taaaaataaa aatacaaata aaaaaataac gttaacagct 46681 cagtgaacaa tcatctactt gtcatagaca ctgaggaatg tttccatgtt gatatttttt 46741 cctcgtaact caatcatgtt gggattgtaa gttatacagc atatactgct tgcaagccat 46801 tccatcctta aggagcaact cagttttctt attcttgact tagaattggt tgcatcccga 46861 tgtctcctct gcctgcactt cccaggaaga accccctgag gccctcagat tttcctgcac 46921 acgcccttcc cagaccctct gatgactgca aatcccaccc tacccttgac gggaaactct 46981 ctgaggctgt ccaggtcatc acccccaggg accaggtact ggtgggggaa tgttccatga 47041 attaattgct gtactcaggg ctcttggaca agtctcccct gatcgcaggg tcctgactct 47101 ccctgtataa gatggaggct ctgctgctca ggagcctgcc tggcttccag tggtctggag 47161 tggagctgca ctgcaatggg gctggactta ggctccaggt acccacaggg tgggtgcagg 47221 gggaccatca cagttcagct tcctggtgcc atctgtggtg ggacctgatt gtggggcaaa 47281 ggggtgggac ataaaaggca gagtgtcgag gcaggtacag aacgggtggg gggtgctcct 47341 tagggagaag gcaaatctag aattccttga cctcataagg aaacagcttg gatccgtatg 47401 gggaggggtg gcatggggga ggatggagca gcacttagta cacagaggcc tctgggagag 47461 gggccaggat tgcagaaggc ttgcagaggt gctggggaat ctgattttgt caaaactaaa 47521 aaatccacaa ggcagcctct gagcagccca gagtccaaac tgccatctgt cccctcagct 47581 ctgaaagatg aggtgcaaac ccctgcccta cccaaggctc tcctgctgca cactgccagg 47641 cactgtcttc atggacccac acttccttcc atattgctgg ggacacagtg gggccttgcc 47701 aacagtgatt cctctttgtt aggatctttg cgtgtagtca aaacagggcc ccatctcctc 47761 attactcccc caggtgtcta cctcttccca ccccatgggc aaacctccat ggatgactgg 47821 tctgtgaccc tcaccagcca ggccagccat tctggacatg accagagccc tcccctcttc 47881 ttcccttgtc accagccctc aactgtgaaa ctacagatgg catgtcacct cttcatgctc 47941 ccagtttggg gcctacccag tgaggggcac actagttgat tccatacacc ctcagctggc 48001 tttgtttaga atgtgaatgt cacagccagg cgcggtggct tatgcctgta atcccagcac 48061 tttgggaggc cgagccggac tgatcacttg aggtcaggag ttcgagacca gcctggccaa 48121 catggtgaaa tcctgtttct actaaaaata caaatactaa atactaaaaa tacaaagcca 48181 ggcatggtgg catgtgactg tactcccagc tactcgggag gctgagacag gagaatcact 48241 tgaacccggg aggtggaggc tacagagagc caagattgca ccactgcact ccagcctggg 48301 tgacagagcg agactgtctc aaaaaaaaaa aaaaaaaaaa aagaatatga atgatgaatg 48361 tcacctcctt tctcttttcc atgagacagc caatatctga tcaaggccct gggaatggga 48421 atccatagct cgccacccac gtctgccccc tgcctagcca cgctgcgctc agggaccatt 48481 ccgacaggag aggacgcccc acttcagctc cccccagtgc cccgggtgaa tgaacggcat 48541 gaagggttag gcgggcacac gtcgtctctg cacgcatcgc acggcgccct ctgcccgcgc 48601 ccactggcgt gcccgtcggc atgcccgggc cccttcgaag actgcttgct tctggagtgc 48661 cagcggcctc ctggcccagt gccccatttc ctcccgggtg gagcgaggga ggaggaaggg 48721 aagcagaggg gtaggcggtg gtaagcagca ggcacatgtg cgcgggcgag agccccgcag 48781 actgggaagg cgggcccatg gcagggggtc tgcggggatc gcgtaggcgc gggccttgcc 48841 ctgcagccgc ttcccgggcg atgcgcacac aaagcgccct cggcagcgcc tggcccgggg 48901 cccgcgctct cttagggtcc gggcccttcc cgccgctgct ggggcggccg accgggatgc 48961 ggccggggaa ccctaggctc tggggccctg ggaaggggga cgacggcggg gagtagggat 49021 gtgcagggat cgggcgaagc ccaagcgact tagggaccag agccacccca agcgccgact 49081 tcgcccgctc gcacgtcccg ggtccctcgc gcgcaggccc cgcccctctc accccgccgc 49141 acgccacagg gtgacgtctg ggctcccagc cgcatcgccc tgactcccgc gcgggccccg 49201 ccccctgccg ctagccaatc tgtgcgtttg tgacttttgg gcccgcagcc ccgcctgctc 49261 ccacagcgat accggtttgc attgccctga ctcccgcgcg ggccccgccc cctacgccgc 49321 tagccaatcc atgcattagt ggcgtccggg ctcgcagtac cgctcgctcc caccgcgaga 49381 ccttctgctc cgcgcccgcg cgggcccctc cccctccatc gctagccaat ccccgttttg 49441 tgacgtatgg gctcgcggcc ccgctcgctc ccaccgcgag accttttgct ccgcgcccgc 49501 gcgggccccg ccccctccat cactagccaa tccccgtgct tgtgacatat gggcttgcgg 49561 ccccgcccgc tcccatcgcg agaccggttc ccaccgccct gactcccggg cgggccccgc 49621 cctctccgcc gctagccaat cctcgcgttg atgacgtttg ggctcgcggc cccagcctcc 49681 cagctctcag ggcacggccg gtctgtgccg gctgctcccg cggttaggtc ccgccccgcg 49741 cagcgcgcgc agcctgcgga gccagcggcc gtgacgcgac aacgattcgg ctgtgacgcg 49801 acaacgattc ggctgtgacg cgagcgcggc cgctcccgat gcgctcgtgc cgcccccgcc 49861 gtgctcctcg gcagccgttg ctcggccggt tttggtaggc ccgggccgcc gccaggcctc 49921 cgcctgagcc cgcacccgcc atggacaact acgcagatct ttcggatacc gagctgacca 49981 ccttgctgcg ccggtacaac atcccgcacg ggcctgtagt aggtacgcgg cggcgggcgg 50041 gaccccttcc gggccccctc ctcgtgctcc gcctcgcgac ctccccgctg ccctccccgc 50101 gcgccttccc cggcccgcgg ccctgaccgc cccgtgtccg gccaggatca actcgtaggc 50161 tttacgagaa gaagatcttc gagtacgaga cccagaggcg gcggctctcg ccccccagct 50221 cgtccgccgc ctcctcttat agcttctctg gtgagagcct cgcctgtggg gacagcctgg 50281 gacgcgggga ggatggggtc gcgagggtgt ggcagggggg ccggtcgaga gcggcactgg 50341 agaaagggga gggaagtctg ggggggcaaa cagttctgtc tcctcctttc aatccagact 50401 tgaattcgac tagaggggat gcagatatgt atgatcttcc caagaaagag gacgctttac 50461 tctaccagag caagggtaag gcaggggttg ggtgggcacg ctggcacctt cacccgactt 50521 cgtcagggac cccgctcaca gggaggacct gagacctcag tcccaaccac tccagcagcc 50581 ttaggaggga gaaactgtta caggtcccga aatgggattc agattagggc catcaggcca 50641 ggcggggcac accgatgccc cctctgctac cgctgccccc cttcccaagg ctacaatgac 50701 gactactatg aagagagcta cttcaccacc aggacttatg gggagcccga gtctgccggc 50761 ccgtccaggg ctgtccgcca gtcagtgact tcattcccag atgctgacgc tttccatcac 50821 caggtgagct ggctggcagg cgtcctgtac ttgggtacaa cctaggggat cgcggctgtg 50881 tttggataaa tccagggggg cactgggtac aaatggtggc tcttgggcct ccggggagac 50941 tctgtgtgac tagagcaccc tggtctggga tctaggctca gactcttcct gagagtcctg 51001 ggggcaaaag gggatgctgg ggcatgagca caagtggcaa ggccccatgg ataaagggct 51061 gaacacccag agccattcag gagggtgtgg gttcctggcc tctaaccaaa ggtcagaggg 51121 gactggctgg ggaagtttgg actgagggac atgacagggc catggtggcc ctgccagcca 51181 gtcccctcgc cctgactctc ttctgcaggt gcatgatgac gatcttttgt cttcttctga 51241 agaggagtgc aaggataggt gcgtagtggg ggagcccagg gacgggctgg ttctgggtcc 51301 aggctcctgg cccacttgct cccctctttt gcctcaggga acgccccatg tacggccggg 51361 acagtgccta ccagagcatc acgcactacc gccctgtttc agcctccagg agctccctgg 51421 acctgtccta ttatcctact tcctcctcca cctcttttat gtcctcctca tcatcttcct 51481 cttcatggct cacccgccgt gccatccggc ctgaaaaccg tgctcctggg gctgggctgg 51541 gccaggatcg ccaggtcccg ctctggggcc agctgctgct tttcctggtc tttgtgatcg 51601 tcctcttctt catttaccac ttcatgcagg ctgaagaagg caaccccttc tagagggagc 51661 catgagggtc tgggcttcag agctaggtct ttggggaagt cctggctgac tgccttagca 51721 gtgggggtgg gggtgggggc aggggcaggg gctttatgtg tttttgcttg gggggcgctg 51781 ggcctagccc agagtagtgc ttgctccccc tgccttgtcc caccagggag gcagcagact 51841 caggccctcc atggtcctct ttgtcatttt gttgacatgc attcctcctt ttgtcatctt 51901 gttgggggga ggggattaac caaaggccac cctgactttg tttttgtgga cacacaataa 51961 aagccccgtt tatttgtaat gcgttggctc ttcctggagg agagggttgg gctcccatgg 52021 caagggcctc tgcgtcttgg ggctccagga ttgcaatccg gctttgttgg gtccgcattt 52081 ttgctttagt ctggggatag gaatcaaatg ttacccagag atgtttgtgt tttgtttggg 52141 agttttattc cctaactcat tccccaaagc acgtgtaact gcttatacat ataatcgtgg 52201 tacaacaagg tatatacaga gaacccactt ggaaattcag gcaaagctgc atgcacgcta 52261 ccagcagtct gcgggtgttt taactggaaa aagctgaagt ccacctcggt gtccaatggc 52321 atggggatgg aaagaaaatg aggcgtctct ggcacatcat tctcagctcc tggaactgct 52381 gcttgtttaa catgggagaa aagctccaaa ggctgaaatg ccccatcatc cctgggtgat 52441 tgaattcacc tgcctatatt ctcacttagc tcccaggttt tctgccctgg gtatgtattt 52501 cataatcttc aaagtatata tatatataat tttttttttt gagacggagt catgctctgt 52561 cacccaggct ggagtgcaat ggcacaatct tggctcactg caacctctgc ctccgggttc 52621 aagcaattct tctgtctcag cctccctagt agctgggact acagtcgcgc accaccatac 52681 ctggctaatt tttgtatttt tagtagagac gaggtttcac cgtattggtc aggctggtct 52741 caaactcctg acctcaggtg atccacccat cttggcctcc caaagtgctg ggattacaag 52801 tatgagccac cgtgcccagc cgaagtttac tatatatatt atatatttat ttattttttt 52861 gagatgcagt ctcactgtcg ccaggctgga gtgcagtggc gcaatctcag ctcatcgcaa 52921 cctctgcctc ctgagtccaa gcgattcccc tccctcagcc tcccgagtag ctgggactac 52981 aggtgtgcac caccgtgcac ggctattgct ttgtatttta gtagagacag ggtttcacca 53041 tgttggccag gatggtctcc atctcctgac ctcatgatcc acccactttg gcctcccaaa 53101 gtgctgggat tacatgtgtg agccaccgtg cccagtctac atatatgttt ttaagtggaa 53161 agtgggtact aaaacccaag acaatgccac gtgttgctgt gcatgcctgg ctctatgctg 53221 ctggcagagg gcgggagagc acagaggggc tgccgtccac tgcacacggg caggtgggaa 53281 gcagaggccc agcaaggggt ggggtgctgg cattagtcac ccaagccttt ggctccttgc 53341 tgcaggtagt gggacagccc agcctcccac acttggcaca gctcccatgt agtgcctgca 53401 catgcatttg tggaggcagg gctggggctg ggaccctccc tggctactcc tgaccattta 53461 tcttggctgc atcctcgtct ctcctcacgt atatttttgg tcctaaaata gtacatggtc 53521 cttgtgtgtt gcaaacatgc agaaaagagt gaatcacttc taattttctt accactcaca 53581 gtttcagcct ttcttgtttc attgctatga gctcctctta cctggctcac cccgcagtgc 53641 ccagccactt gctgtgtgga ttccccaagc ccctgccctc tgcccctcac catcccaagc 53701 ccaatgcaaa gactgatggc tggccctgca gcaggcgaga cctggatggt ccatggcggg 53761 tggtaggcat gggcagggga tgccacctcc tcacaggtcc accttaggga ttcaatggtc 53821 ccattaatga gcctgtacat ggggggcaga acaaacaggt tgctactgcc aggaccctag 53881 aaggacgcag tgacagagcc tcctgtttca gcagctgata aaaaccaagt acactgacag 53941 gaagaaccga actcacctca gtccagttcc cccaggaagc actcactacc tgggattggg 54001 gctggactgc agcccagtgt ggctggacag aggcagatgg ggcctagcag gaacctacat 54061 gggacctggg gtggtggcaa ggttgacatc ccaaagctct ggagtaatag ctggtgtcag 54121 ggagatgggg gatgtggagc tgggcttggg gcatcccaag gagggaaaag atatccagat 54181 atcgggacga aatggtgctt tctaggcctt aggtcaggtc ctggatctgt tccaggtggg 54241 gagtaaagag gaggtagctg ggtgtacttg gcaaccactg tggtgtctgg cgtgagggaa 54301 gagagggaag gaaggaacag cggccacccc tgacgtagct tgcccctttt cattcattcc 54361 aaaagagctc ccaagattgg atggaagagg gctcccggag tccccacccc agttccccag 54421 ggaatgaaaa gggctgcgag actccctgct cctgagggct catggcgttc cccagggctc 54481 tgctcccacc acctccctga gctccctctg gattcctgag gcctcccact tgggcttggt 54541 caccaatccc cagggctcca tgggccttgg agtactcccc ccaccgtttc agggctcctc 54601 agggttctgc caaccccatc ctgtgatctg tccaaccccc agggcttcga accccatgaa 54661 ggatcctgac tattctggac acaccgaagt gcactgggtg ccccagccgg aaccagagga 54721 ttaggaaggt tccatgggcc tcaagagcct ccaggcctcc acttacatgt tctccagagc 54781 tctgagagac cactgtggcc ccactttcca aagttcccaa agaatctggg gccccccgca 54841 gggctccagc catgctcaga catacagcag ggggcagcag ctgcccaagg tccaagcgag 54901 aggccactct gggcactgtc caagaagata ctggattttt ttcctccgct accaccaggc 54961 cctgccagac cagggaggag aaccacctga gggggctggg agagtgtcga ggcagccagt 55021 ggagaaggag actgtggaga gggactgcag ggccagggtc cagaacccat ggtcctggca 55081 aggcccttga ccttggcccc agacaggtcc aaaagaggtg acagcgccaa gaccggccaa 55141 gaaggacatt tgccttgacc ctcagctggg ttggaggtac gaagggtgtc tgcggggctg 55201 ctaccacttt caagaagaga gtgccagggg tgtaggacct gtggcttcat gagcatggca 55261 cactcctgac tctgatggca taaaggaaag gaaaacggca gaggaaccct gcctgacttc 55321 cacctggggc tgggctgctg gccccaggct cagggcggcc actcactggg agacacgggg 55381 caagcaagac acagaaaggg gaagtcccac ccccagccgc agccactcta ctgctccccc 55441 tgggcccaat ggcatagagg taaaggggcc aagcacactc tggaagcctc taaaagccac 55501 cagctgaacg gggctgtgag gcagcagagc caggcccaga aagatgcagt gccagttggg 55561 aggacgggag gtccccctga ggctgcgact tgtccgctca gcccactgtg tgcccaggat 55621 tctgaacctc tggggcactc cctcctgtcc cgagtcaccc agggtctcct gccccctccc 55681 tgtgtctgct ccatcacatc ccctctcaag ccccatgcct ggcagggcca ccgccctagg 55741 gctgggctag gtgaaccaag ggcaccggtc ccccaaacct gccgatcgtc cccaggctgc 55801 agggagggac acatgtgggg cggcaagggc agaagataaa ctagggggat ccgggcccct 55861 gaggacagcc ctggcgggag gccacaggcc tcacagggtg gtgagctgga ggccacccag 55921 gcaggcaggc aggcagcagg gaaagagact ggcacggtcc tggccagggc cccccccggg 55981 aagggcgggc aagtaggagc aaaaacaaga gcagagcacg ggcccccacc tgccgggtcc 56041 tatggcctgg ggactcacac acactgagga agacccagga ggacccgcca ggctactagg 56101 aggagataaa gggtgcacag gcctccaacc cgcaggttgt gtctggactg agtccaggga 56161 gggcagggag gggcggtagc ccagttgacc ttccctcccc caagccacac ctctgcaccc 56221 caaccctccc ttccgcctcc caacagcccc atccccgctc tccgaggaaa ctgggagggg 56281 acctggagag gaaagggaag caacggcccc tcagagcggg gttccccaaa atccatggcc 56341 agggactgag gccagcactc agacactgtt gggaacactg gccaggagtc cctcggggtg 56401 gacctgggaa cctgctgcag aagcttaggg gaccgggcca gctcaggaaa gggctcaggc 56461 tggaaaccga gcagacaggg caccgctgga cctggggccg aggaggacag gaagtaggtg 56521 actatggaaa gacccccgcc gccacccacg cccaactcgg cacacaaagg cccagagagg 56581 cccccagaca tgcctgtggg ggcaggggtg tttgcaaaca tttgcagagg gaccgggtgt 56641 cccaggcaca tggacagctt gcagggcccg gctgaggcta ggacagcagc aggtggctgg 56701 caggaggtgc cccgccaggg tccccaactg tgtggggggc aggcccgagg caggcggcac 56761 cagtcccggg caggatctct cggaagaggg gccgtaccca gccagggttc ccagagacac 56821 gggtggtggg gccgagtgcc ccctcaccct gtgcctggcc cccaacaaag ggtgagcatg 56881 tatccttcct ggacacggtg tccagggcag ggcactcccg aaccaggatg ccccctaggc 56941 cacaggaccc ctcagcccca gctgacagca gccggggggg atctggggag gcccccagga 57001 ggcagggcgg cagccaagca gaacaccatc cctggaccca ctctccaact gcaaccaggg 57061 cccgctcccc cgaaggcacc agctctccct aggggcagga ccttgatcca gagcaggagt 57121 ggaggccccc aacaggggac ggctggagag gccctgggct cacgagccag ggagcggcgc 57181 ctctgtgccc ggaagcccct gggtcctgcc tgggctgctg gtagctgctg cgggccgagg 57241 ggtccggacg cccctagggt gctctggggc agctcctttg ggaccctcaa tggagccagt 57301 aagtccagga gggcgccgat ccgtccaggt gggtgcatgg gccgggacag ggtctggctg 57361 ctgtcgtggc tgtgcccatc tgccgaacag gtcgctagga gagcactccc tgagcgaggg 57421 cgccgggcag cccggggagg gaggacacag tccccgtcca ggcccccgag gcccccatag 57481 cagggagagc cggccctcct cgctgttctt cccgcgggat cgggccggca cctcggctca 57541 ggggagagag ggtcacagag gacagccagc aggcaggtgg cccaggagtg tggggggtcg 57601 gtgggcctga ccgtgtcctg aagcacccca ggaccctgta cccgacaccc tgggggaggg 57661 gctgtcccca cttgccagga acacattcct cagggaaggc ccggcaacct gctctcgtcg 57721 ggttcaagaa gggccaggcg gctttggaag gggccaggag ctgtcaccaa gggcccccgc 57781 attgcccaga ggcggcgcgg gccaacagga aaagcactca ccgcccacgg aacctctgtg 57841 acaaactcag agccacggag agcgtcatca gcaagtcact tgccagtccc agtgggaagg 57901 atggggcacg tctccaggga aacgagacac caaggtcttc ctggggcccc agtggaaagc 57961 tccttccagg acccacctgg ccctccacat cccaccctcc aggccaggga ccggagccca 58021 gtatccggca acctgacaca tcacggagaa aaggagctgc tctgccttca ccatgacgac 58081 gcggcacacc ccctccctgc cccggcccct ccccttcagg gctgcatgcg gggaggagag 58141 gcctcccctg tctgcagaga gaggctgccc cacctggggc agagagggga caggggcagg 58201 aatgagcccc tgggaggggg gcattggagg tggggggcac atcctgccac ccgggctggc 58261 gtggggcact catgccatgc agaccagaac gggcttccca ccggctcctg gggatgttgg 58321 accggggctc cccgaccagg gctgggaaag gcttccatga agagccagag agcacacggc 58381 gcagaccctg cggggcacac ggggtctcgg tggcagagtc tcccttcttg gggttctctg 58441 tgatgggttt tgcaaccctt tcaggatgtc aaagcacgcc ccgccctcag aaatgcaggc 58501 cccggccagt gtcagcccca ggaggtggcc tgctgctccc gctgacacca aggacatgtt 58561 cctgaggcgg cccgctcggg atctccagca gccgacgcag gtctacacgt gcaagggtgc 58621 ttctgactcg cagcactcag gaacagcctc caccctgagc gtggcgccct ctccagcacg 58681 cgccagggtg gacctgtgag accaataaaa accacagcgg aggcgaggcg agtcacttag 58741 gagaccaggc catgaagaga ctgtggccgg gtctggggcg ccctctcctg ctggttcgac 58801 tggattgcag gtccggggga agcccagcag ccctggggag gaggcccggt ggggaggaac 58861 tgaggcctcc tgccacagcc tccagaggga gagtcatcac gagggctccc cacccagtca 58921 gataggatgg ctgggcccca gtggacacct gaccgcagcc tgagagggac cctgagccac 58981 agtcctggct ccccgagagc cctcccagag cctgtccctc agaacccttg ggggggtgac 59041 acgagtgctg ggggggtgag acgagtgcta ggtggggaag gaccgaaacg tgaagtcgtc 59101 tgttacccag tagtcggaac tggtgcccag cgtccagggg gcaagacacg gcgtgagcca 59161 gaaggaaaca aaccgacaag aacccccaga agccatcagg aaagcctgga gattctgatg 59221 acacgaaaac taaaatcccc tctgtgacca agcacaaccc gcaaaccaca cagtcaagag 59281 acaagtgata ataaggggag aaaatagtag caagaggtga ctaagggctc gcgtcagtca 59341 cacacacaca gagccggaca cacggacaag gcaaagtgca aggacaagag agcaaaaggg 59401 ggcggaggag aggaagggag agggaaggaa gcgtcttctc caccgggatg caattccttg 59461 ggagcccgtc tgaccccagg gcaaagagat gctgggtcgc cagagccgcc gcggagaagt 59521 cccgccgggc ctcagagccc agcaaagagg aaccagggcg gacgggccaa acccagcggc 59581 cggctcccgg gggggtgggt ggtggcgctg cggggagtcg cctgagcagc atcaccagca 59641 gctgggaacc ctcccggtgc gggaaggttg gcgggtaagg cggggcttcc ccggaaacac 59701 tgggctggac ccagcggtcc catgggtgct ggaaccagcc ctgcgggtgt tcccaggcac 59761 cctcccgcgg gaagaccggc tggagggatg gctcctgttg gcacccacgt cgagggaagc 59821 caccgccttc tctgcggggc tcgcctttcc tacctccccc aggagacagt caggccagga 59881 agcgccccag cccacgaccg cccgcagcag agcaggattc cctccccggc ggccgccgct 59941 gtcaagggac ggggaagact ttgttacccc accgcgcgcc accgcggaat ggtgcacaga 60001 cacctccaga cgccacttcc cccaagacgg ccgcccgccc cgctctgccc acctctccct 60061 ggacgctgcc gggtgggcgc ggccctgctt ccccacgacc ccccagacgc gccccgaggc 60121 actttcgcag cacagtggag tgggaagggc ggggcggccc ctcgacggcg ccgaggcgag 60181 agcggctcac ggggcttccc gggccctgat ctcaggcgac gcctgtccgc ctcgtccccg 60241 ggggaagaca gtgaccaccg gtctgcaaag gaaggggact cggcgcatgc cccgcagggc 60301 cggcccagga ctcgcaggcg cggaaggcgg tgccgaggcg ggggcggggc gtggagggac 60361 agccctagtc tccgcccgcc ggggcccaca gctgcctcgc tccccgcctc cgcttcggtc 60421 gctcccggta agctccaggc gcccacggga agccccgccc agccccgcct atccgtcctc 60481 ctcttcgcct gtcctcgcca atgaccatgc tcagattgca cggaattccc gcccctcgcc 60541 cttgactgac gctcctgctg cccagtggac gacaccccgt cgcgtgcgcg ccgctgcggc 60601 cctgccgggt gtggcgcggc gctctcaggc tcctaggcgt tcccggggcg agacccaggg 60661 cgctggccgc agagtcgccg gtgcacacgc aacagtgccg ccttgtgttt tgggtgtgga 60721 cctgcgcttc tctaagcgtt ccattcccga gaaggggcga gaaacgtcca gaggtcgtgg 60781 cacctgcccg ccgctgccgg ggcgtcggcc gtgggcgcaa gtgaaagcgc cccgggagcg 60841 cggcgcgacg tccgtgcctt ccgagggcgg cccgggtgtc gccgcggtgc gcgcgggagg 60901 gcgcggccgt gggcatttcc gcgcttgcgc gggttcctgt cccgccccgg ccccggcccc 60961 ggccccggcc ccggccccgg ccccggcccc ggccccggcc cctccctctc gtggccccgc 61021 agtagacgca gcccctccct cccgcggtgc agcgcagcca gggctctgct ccgggagtca 61081 tggagtccat caccgcggag cgttatctca gcggacccac gcggtgtcct ccgctggggt 61141 gggccgagta ccctccgcct cccgcccgcg agatccatcc cagcccctgg cgggggcccg 61201 tggttcttcc cgctcatggc cacggaacgg tccacgctcg cggagccgca ggctgagctt 61261 ttgctggtga gggacaaggt cgtggtttcc cgtggctggt ggccctcacg gaggagcctg 61321 ctgggagcac gcccgtgtag ggtttcctgt catcgggttg ccacgcccat cttgaacgct 61381 cgagtccacg cagaagtgta gaagtggatg tgtagcttta tttgtaattc tctttctttc 61441 ttcctttctt tccttctttc tttccttttc tttctttttg agtcagggtc tcactctgcg 61501 cccagtctgg agtgcagtgg cgcgatgagg gctcactaca gccttgacct ctccacccca 61561 gcgagggctt caacctccag cctcagcctt ccgtgtagct gggactacag gcgcacgcca 61621 ccaccatgtc cggctaattt tttacttttt gtagagatgg ggtctcgctt tttcgcccag 61681 gttggtctgg gattcctggg ctcaggtgat ccaccccagc ctcacaaagt gctgggattc 61741 tgggcaccgc gcaccggccg ccgcgcccgg cttgatgtgt agtttcaata gaaaccacca 61801 aggtcttcag ggtggcctgt gcatcacttt acacatactc aagcgaatca ttgagaggtg 61861 tttgaggccg ggcgcggtgg ctcacgcctg tgatcccagc acttcgggag gccgaggcgg 61921 gtggatcgtt tgaggtcagg agtttgagac cagcctgacc aacatagtga aactctgtct 61981 ctaataaaaa tataaaatta gccgggcgtg gtggcgccct cgcctataat cccaactact 62041 caggaggctg aggcaggaga atcgcttgaa cccaggaggc agaggttgca ctgagccgag 62101 atcgcaccac ggcactccag cctgggtgac agagcaagat tctgtctcaa aaaaaaaaaa 62161 aaaaaaacgt ttttgaaaca acttgaagca agttatccca tgtataaaaa agcatgaagt 62221 tatttatact atttttctgt gcaatcaccc acataattag ccaagttaag acccattatg 62281 cggccgggcg cggtgctcac gcctgtaatc ccagcacttt gggaggctga ggcgcgtgga 62341 tcacgaggtc aggagttcga gaccagcctg gctaatatgg tgaaacctcg tttctactaa 62401 aaatacaaaa attagctggg cgtggtggcg tgtgcctgta gtcccagcta ctcgggaggc 62461 tgaggcagaa gaatcgcttg aacccgggag gtgaaggttg cagcaagccg agatcgcacc 62521 actgcactcc agcctgggtg acacagtgag actccgtctg aaaaacaaaa acaaaaaaaa 62581 accgttatgc tacacgtttt gtagtgattt tgttttgttt gttttagctt ttttataatt 62641 tcaacttttt tgtttttgag atagagtctc actcttgtcg cccaggctga gtgcaatggc 62701 accatctcgg ctcattgcaa cctttgcctc ccaggctgaa gggatcctct cacctctcac 62761 ctgggttggt caggctggtc tcaaattcct gacctcagtg atgcaccagc ctcggcctcc 62821 caaagtgctg ggattacagg catgagccac cgcgcccggc ctgatgtact ttaaactttt 62881 ttttttcttt ttttgagaca gagagtttca ctcatgttgc ccaggctgga gtgcaatggc 62941 gtgatctcag ctcaccacaa cctccacctc ctgggctcaa gtgattctcc tgcctcagcc 63001 tcccgagtac ctgggattac aggcatgtgc cactgcaccg ggcctccatt tgatgtactt 63061 ttaaagcatt attatttcta ccttgcattt actatgaaca cagccttgaa cgtgcatcca 63121 tctctctcta gaataattac ctgtaagtgg aattgctggg tcgagatata tgtacattaa 63181 aaaaaaaatt tttttttttt tttgagacag gatctcagtc tgtcgctcag gctggagtgg 63241 agtggctcga tcacagctca ccaccgcctc aacctcctgg gctcaagtga tcttcctgtt 63301 tcagcctcct gagctgctgg gaccacaagt gtgtgccacc atccctggct tttttagaaa 63361 acttcttttt gtacagacag ggtctcactg tgttgcccag gctggtcttg aacccctggg 63421 ctcaagtcct ccctccttgg cctctcaaag tgctggtttt acagatgtgt cccaccttgt 63481 gtggcctgca tttaaaattg taagagacac tagggaattg cctcccctaa agaccctacc 63541 tttgatactg tccgtccctc gcattgtacc ctcagcgttt acccacagcc tcaccagcac 63601 cttttgggct aacagctttt taggtagctg aaaattagat gggagaactg tagcttttcg 63661 ctttttcctt cttccctgtt tcctagtcct gttcccctcc ctcccaaagg ggtcatttaa 63721 aatgtggtgt catttggaat gtgtttgttc aattgtattt tagttttctg agacagggcc 63781 tcgctttgtc acccaggctg gagtgcagtg gtgtgatcat agctcacggc agccttgatg 63841 tcccagactc aagtgatcct cccacctcag cttctggagt aactaggact acaggcatgt 63901 gccaccacgc cagctaattt tatttttttg tagagacgag gtctcaccct gttgcccagg 63961 ttggtctcct atttctggct tcaagcgatc tgcccgtctc ggcctcccaa agtgcttgga 64021 tgacaggcgt gagccactgc cctgcacctg tggatttttt gtgtgtatgg ttttgcccac 64081 cgtaactgtc cctgtgctgc attcctattc tgctgactgc ttccgtcatt caacagggtt 64141 tctcagctct ttgtctgtgg ccctggagac gtacagttga ctcctgtgct cctgcagagg 64201 atttcagagc ttggatgcaa cacatttgat cattcccacc ctgagtgatg caaccccagg 64261 gcttcagctc ctggtgtgcc acaaaccatg ccacgaaggc catggcacgg aatgtggcct 64321 tagcgagaag ggagaattgc tcttgctgag gttatcaagg ggtaggattg ctgggttcaa 64381 gaacactgat gtttcatttt cttatgtttg ttctgttctt tttttttttt tttttttgaa 64441 agagtctcac tctgtcaccc aggctggagt gtagtggcgc gatctcggct cactgcaaca 64501 ttcacctcct gggtttaagc aattctccca tctcagccta ccgagtagct gggactaaag 64561 gcgcacgcca gcatgcctgg ctagtttttg catttttagt agagacgggg tttcaccgtg 64621 ttggccaggc tggtcttgaa ctccggacct caagtgatcc ccccgcctcg gcctcccaca 64681 gtgctgggat gacaggcgtg agccaccgcg cctggctcct ttcccatttt ctaagggaca 64741 gtcagtctcc tccataccat ttgtcacaca gcctttcccc catgatggca ctgccatact 64801 cccgctaaat gcgcaggcat ctgtggtctc ctctgcactt gtccttccag ggaacgatgc 64861 agtgggctcc ttgctttgca tctcgccata cctggtgcac ctgccagctc ccttagcagg 64921 accttgcaga cagtttcagt cattcttggg cattttcttc tccatgtgaa gtttcccatc 64981 atcttgtccc ggaccattag acacccacat ccctggcttc ctgggactct ggtggcgtga 65041 tgggaagagc ccacttctta atattgtttt tttcaagagg atctcattct tagtccaggt 65101 ggcaggaatg gtttcccctg ctggcctcag ctctgttggc aagggatacc tgcagtctgg 65161 ggcctggagt tggagcgctc cagacacggt cccttccttc cgggagtttc tttctcgtgg 65221 gtgagccaga agcaaacacc tcatcccaca gttctgaggg cctgggtggg gccatcgtcc 65281 tggtccaggt aagagaccat ggagtctggc caagcacgtt ggctcctgcc tgtaatcccc 65341 gcactttggg ttagggttag ggcaggagga tcacttggct ccagcagttc aagaccagtc 65401 tgggcaacct aataaggccc catctctaca aaagaaaaaa aaaattatcc aggcatggtg 65461 gcgtgtgcct gtagtcccag ccgattggga ggctgaggca ggagcatctc tggagcccag 65521 gagttcgagg ctgcagtgag tgttgatcat gccactgcat tccagtctgg gcaacggagc 65581 aagaccctgt ctcaattaaa aaaaaaaaaa agccgggcgt ggtggctcac acctgtaatt 65641 ccagcacatt gggaggctga ggcgggcaga tcacctgagg tcaggagttt gagaccagcc 65701 tgaccacacg gagaaaccct gtctctacta aaaatacaaa ataagccagg catggtggca 65761 catgcctgtg atcccagcta ctcgggaggc tgaggcagga gaattgcttg aacccgggag 65821 gcagaggttg cactgagctg agatcgtgcc atggcactcc agcctgggca acaagagcga 65881 aactccatct cagaaaaaaa aaaaaaaaaa aaaaaagaca gtggagtcct agatggcgag 65941 ggtagcaatg gagaaaccca cttcaatttc tttgagccaa cagttggaag gattcccgtg 66001 ggaaggtcag gagggagagg agccaacagg tggaactccg acagttcccc cgtcctcacc 66061 ttgtagggct gcaagattcc ttggtgcctg tctgggctcc agggccccca cccacccctg 66121 caggcttatg tgcggatcag caagaggcac tctgctctcg gggcaggtac tcggctggag 66181 gcgaggacac gggagggatt tcagccctct cgctccccag gaagatgtcc ttgtgcagag 66241 aacagcctgc acgacctact agggcaccca ggattaggct gtatcctccg tctgcccaaa 66301 ctggctacag aaagctcatt ttaacgccca ctagagagtg aggagactgt aggtgcaatt 66361 tcttttttct tttttttctg agacagatga gtcttgctct gtggtcccag gctggagtgc 66421 agtggtgtga tcttggctca ctcaaacctc cacctcccag gttcaagcca gattctcctg 66481 cctcagcctc ccgagtagct gagattacag gcatgcgcca ccatgcccag ctaatttttt 66541 tatattttta gtagagacgt ggtttcaccc tgttggccag gctggtctcc aactcctgac 66601 ctcaattgat ccacccacct cagcctccca aagtgctggg attataggcg tgagccactg 66661 cacccgccct gtaggcagat tttttttttt ttttctgaga tagtttctct ctgtagacta 66721 ggctggagtg cagtggcatg atctcggctc actgcaacct ctgtcttccg ggctcaagca 66781 attctcctgc ctcagcctcc caagtagcta gtattacagg tgtgtgccac cacacccggc 66841 tcatttttgt atttttagta gacagggttt caccatgttg gccaggttgg ccttgaactc 66901 ctgacctcag gtaatccgcc cgcctcggcc tcccaaagcg ctcggattac aggcgtgagc 66961 cacagcgccc ggccatacac gaattttttt gagacagtct tgctctgtcg cccaggctgg 67021 agtacagtgg tgtgatctca gctcactgca atctctgcgt cctgagctca agcgatcctc 67081 ccacctcagc tgggattaga gtgtgccgcc acgcctggct aatttttaaa tatatatata 67141 tatatatatt tttttttttt tttttttttt tctgagacgg agtctcgctc tgtcacccag 67201 gctggagtgc aatggcgtgg tctcggctca ctgcaacctc cgcctcccgg gttcaagcga 67261 ttctcctgcc tcagcctccg aagcagctgg gattacaggc accagccagc atgcccagct 67321 aatttttgta tttttagtag agacggggtt tcaccatgtt ggccaggctg gtctcgatct 67381 gctgacctca tgatccgccc gcctcggcct cccaaagtgc tggaattaca ggcgtgagca 67441 cttgcgccca gccaattttt agtttttaat ttttgtagag gcggtttttc gccatgttgt 67501 ccaggtctca aattcctggg ctcaagtgat ccgcccacct ccgcccctca aagtgctgag 67561 atcacaggcg tgagccacgg cactccgccg caattcctga cttgtactgc cccagtggtg 67621 ttacagcctc gaacccagtt tccagaccta agtcaccgcc ggcatccaga gtcccatgat 67681 ggaaagggag gctaggttcc cctgggaagg accttgcacc actgctgcaa gtgcacccca 67741 gaaatcgtcc caagcttttc tcaaagggac ctgcagccat ttataaaagc gacatatcat 67801 cctggaaaac aggtgcgctc gagcaggatt tcctcccgtc cttcctgtca aaggacggga 67861 agactttgtt accccaccgc gccccacctg cagaatggtg gacagatacc tccagatgcc 67921 acttccccca ggacgcccgc ctgctctgcg cacctctccc cggatgctgc cccgtgggcg 67981 ggtgggggcg gccctgcttc cccacgaccc ccagacgcac ccggagggga ctcttgagca 68041 cagtggagtg ggaagggcga ggtggggcgg tgcccaggcg agagcggctc atgggaggcg 68101 gcgcccgaga cgcagctggt cgggacggtg cgggtcaggg tgggcggacg gggctagaga 68161 tgccccgggg tttcccaggc catgagtctc cgtggagatt tctcctcgac ctcttccccg 68221 cggcaatgtg cgaaccctgg gtctccagga aacggggata cggggcatgg ctcccagcaa 68281 ggcctggtcc agcctctccg gtaggggaat gggtctcccc ctccggcctc ccgggttgac 68341 aaaggaacgc gggcccagat ccccgtatgg cgcttcaccg ccggggcctc tagcctagaa 68401 ggaggcacgg agcgcgtgtc cgagacccgt gcaagctcag ggacactctc gcggtcgccg 68461 ggaggcccac ctagggtact ttcctttttt ccactctcag aaatatacgt ctgtcacagt 68521 taacggcaaa gcctagggca agagttctac gcccaagatg gccagccgga agcgggcttc 68581 tcgcgaccat gtggcgaagc cccattcgtc agctggccgc ccgcggccct ggtacccggt 68641 cacctctctg atctgcgcat gtgctgggct acgcccgggc gcaaggccaa gagcggctgc 68701 gtctatggtc atgacgtctg acagagcgtc cacccgtctt cgacaggact ctatggttct 68761 tacgcgcgca gacagaccgc ctatataagc catgcgcagg cggaggagcg cctctttccc 68821 ttcggtgtgg tgagtaagcg cagttgtcgt ctcttgcggt gccgttgctg gttctcacac 68881 cttttaggtc tgttctcgtc ttccgttccg actctctctt tttcgttgca gccactgaag 68941 atcctggtgt cgccatgggc cgccgccccg cccgttggtg agtcttgaat ccgtgtactt 69001 tcactgctgg gaaacgggcg gggaaagaag tgcctatggc ccgctgaaaa caattgtggg 69061 gtggagcctc ccccgtgcgg cggccctgtc ttgggaactg accctatgtt ttacacctcc 69121 cggctatttt ttagtctgca atattactgt ctgttccttc gttcccgtgt cggtgtggaa 69181 gcgacggttc cctcgtatct ctgcctgtgt cctgcaagct caccgcattt tcgggcgcta 69241 gatacgctct tggggccttt gtgtgcgttc tctgtcttat ttccgggcga cgtgccgcgt 69301 gcgttgtcgg acgtgaaggg cagtccggga aaaacgggtg cggccgcctc ctgtgtccta 69361 cagggggcgc cagacgcatt tccactcggc ttggaggtgg atttagtgcc acgtgcccga 69421 aagtcttaaa ttgggtgacc tgagctgtgc aatgataatg cggcgatttt gtagtacgca 69481 gtgtcttgaa gagagaattt ttaactagga aagtttgttg caaagtgttt ataggaagcg 69541 taagacaaag taacggaaga tggtgtctgt tgtttctagt cttgggtgtt gtctgtgttg 69601 cagcagccaa ctgttgcttt gtagtttatt tccccgaatg gaaacggttt agaagtggac 69661 gtgcattccc cacccttttc ccgtccctcg tttgggttgt tccttgaggg gcaaagtgcc 69721 tgttgggctt tctgtgaacc tcacctaacc tgtgtttttt cactcccctg cagttaccgg 69781 tattgtaaga acaagccgta cccaaagtct cgcttctgcc gaggtgtccc tggtaagtag 69841 tggaagagcc cctgcactgg ttggctctgc ggactccgcg tccgtctgtg acaccccctg 69901 cacacttacc caatcctttt agatgccaag attcgcattt ttgacctggg gcggaaaaag 69961 gcaaaagtgg atgagtttcc gctttgtggc cacatggtgt cagatgaata tgagcagctg 70021 tcctctgaag gtaaggcagg attctttgtt cgtcaccccc cagtccttcc tccgtgctcc 70081 ctcaacccca cccacataca ctgcactgga attgggagtt gatgaaacat gagccttaca 70141 aaactcagcc aacacagttc ccctgagctg gagatagtcg tggtgaatgt ttctctatta 70201 tcttctctca ctttgctgct tttcttctcc ctacctagcc ctggaggctg cccgaatttg 70261 tgccaataag tacatggtaa aaagttgtgg caaagatggc ttccatatcc gggtgcggct 70321 ccaccccttc cacgtcatcc gcatcaacaa gatgttgtcc tgtgctgggg ctgacaggtg 70381 agcttggtct gggcctttta aggcagttgg agtctctaca ttattaggct tgcattattt 70441 tatcagagca tagaggtggc cccagtgact cagccactat ggctactaga aaagccaggc 70501 tggcaagtga ctttcagtgg tcactcagga ccccttcctg cagatacaaa caaagcatga 70561 gtaagtctta gcaagctctt ccccacaggt taggggaaac actggtgatg ggagtagctc 70621 ttcttgcttt tagctgaatg acaaccatct tgccagcagg tagctgaagc tggcagagga 70681 gcccagtggc gccttttcag tggttcttgg gatctccgca gccaattaag ccgactgagt 70741 tcctttcctc atggggaccc agtgtgcgat ggctgcacac agcagcttcc ttggtagtgt 70801 acgcagcctg ttggttgtat gggttgctct aagggacctt ggagacaggc ctttcaggtg 70861 gatgttcatg tttctgacct tgcactaccc caatgtaggc tccaaacagg catgcgaggt 70921 gcctttggaa agccccaggg cactgtggcc agggttcaca ttggccaagt tatcatgtcc 70981 atccgcacca agctgcagaa caaggagcat gtgattgagg ccctgcgcag ggccaagttc 71041 aagtttcctg gccgccagaa ggtatgtagt gctgcagccc ccttctccca cctttgcccc 71101 aggcctcctg actcagttct ttccattgct ccttagatcc acatctcaaa gaagtggggc 71161 ttcaccaagt tcaatgctga tgaatttgaa gacatggtgg ctgaaaagcg gctcatccca 71221 gatggctgtg gggtcaagta catccccagt cgtggccctc tggacaagtg gcgggccctg 71281 cactcatgag ggcttccaat gtgctgcccc cctcttaata ctcaccaata aattctactt 71341 cctgtccacc tatgtctttg tatctacatt cttgacgggg aaggaacttc ctctgggaac 71401 ctttgggtca ttgccctttc acttcagaaa caggttgaca actcagccct gctcatgagg 71461 cagcaaaccc tgcaaagggc tgggactggt ggccttatgt cagttgtcta ctctggagct 71521 tgacttggac ctccccaggt cctaggcagt aggttgaaaa acactgaagt gcttttcatg 71581 aagcacagct gcagcaaagc cttgcaatcc caggctgggg tcagcctaca gttgtgttgc 71641 ttattacaac acatgcggac caagaggggc ttgtgggcta gaggctgacc agcagcgttt 71701 atttagcaag ggtaggtgtg catcacattg ggcttgttct cacccatctg gtttggccat 71761 tcctccttgg tgggaatcat ccaggtactg ctgaggtcac ctgcgatttg ccccatttcc 71821 tatctctagc aacctcctgg gccccatgcc cccacccctt ctagaacctg cattcccagg 71881 gccttcacca cctgaccaaa ggtctaggct aacctttggt catttgtaac aagacctcgg 71941 aacagacacg tgtgtggcat ggtttggcct ggggatctta gatgtctgac ctgaactatt 72001 gtagaacagc gctggctttt gggggagcag caaaaatgag aggagtgcta ggtgggtggc 72061 ctgagcatct gtatccaggg acaggactcc aaaggctttt ggtcccagag ctggggtatg 72121 ttggccccag cccccagcct gtggctccca aaaggcctct ggttttttgt aatctcagtt 72181 tacagccatt tcttaggttt ttaattacct ttattttatt ttgccaaaca tacctgggaa 72241 taccttttat ttttttttta ccttggggtg atggttccaa accataaatg tgattatagt 72301 taacacatga cccttctagc gtcccagcca gtgtttttcc tgacctctct tctttggaga 72361 ggaggatgga agggaggggt ccggcatgct gctggcattt tgctgtgtcc tgcagcccct 72421 ttccgggaca cctgggttca cacagctttt tagcttacat aactggtgca gattttctgt 72481 gtggagatgt tgccttgacc agccttggct ggacctttac caggcatgca gaagcctgta 72541 ccaacacaga ctacagcacc caggaggtgc gagtgtggct gctcagcggt tataacaggc 72601 ctgactgcat tgttcaccgg attataatga gccaaaatgt ttcccggtgt ttgctggttt 72661 cagggaagga gtttgatata gcagattaac caccctcctt gtagctattg gggcttaatg 72721 gtttcctggt gattcttacc aatccacaat aaacatggcc cattggcata tctgctgcac 72781 aagtgtccta tctcaccaat ctgggttttt gttctcagta actttccttc ttgtcataca 72841 acatcttcat tcctctttct gaaccctccc ttcccctacc ccaacccaga gcccactttg 72901 tctccactcc tgatactaca ctacctggca ggtggcatga gtgcagggcc cctggcttcc 72961 tctcctaatc taggcacaag cccaaccaaa gaacaagagc caaatcaaac aaggcaggca 73021 ggggtggact acagtcacag ggcaactata gttgaagccc cccagcccca gggctggatg 73081 gacgggggag gctggggttt aagtcccaaa aggcagcagg ccctgggggg gtagggggac 73141 gctcaggcag cagggcacag ctgaggggac aggagtgata gcagcaacag aacagtgagg 73201 ctgagaggct ggacgctgtg cgcctggctc agcttcagct ccacctccac ggggtagtgg 73261 tcactgatgt tgagggcctg ggaatgggtg gtggggcagt gtcagtgccc ggggccgagg 73321 ggcttccctg tgaccctgct gctgccctcc ctcctcatgc cagccccatc ccttcacctc 73381 ctcctcggtg agctggaagc tcgtggggaa gtcaaaggca gccgcagtgt gcagcagact 73441 ccggcagcgc tccccgtgca gcacgacgcg ggcataggtg cagtgggtgc tggcccgcac 73501 tgtggtgtcc tccccatcgg caatcaccca gtggaagcct ggctcagtcc gcagctccag 73561 cttgtccagg cgctttttgg tcagtgaagc gcagtcagca ttgaagtccc caagcaggat 73621 cacgtcctag agacacagca gccatcaggc tggccgctgg gccaccctat ccttgtccct 73681 cctgtacaaa ctgcccaggc ctaccttgct ctgccagtgc tgggagacct ccagaaacac 73741 atcgtagagg gcgttcagct ccttctctac ggccttagga gtggtgtgca gcgggaccaa 73801 caccaggctg ggaaggactg caaggcccag ggaccgaggg ctgccatcca ctcctggacc 73861 ttctccccct agcccgtctg ggtgaggaag cccccagtgg agccagccag aactcccctt 73921 ctgctcccac ataccaagcc cacccttccc ttcctaccat tgctgggcaa agagaactgg 73981 gccacaaatg gctcccgggc aaagacgtca tcctcatcgt tgtacacgta ggaactcagg 74041 acctgtgttt tgtgtgacct gggaggacaa agggatgggc agcagtggcc ccacgtcttt 74101 gattttggct tgcaagtgcc aaacggcgct cagggagggg ccctccacta acagcggcaa 74161 tgtgggcaca gccctcccaa cctgtctcta gaactatcaa aaatcaggca taattcatat 74221 gccataaaat tcacctgctt caagtataca acacggttct tttgtacatt cataaggctg 74281 tgcaactatt tctacaatca attttaaaac attggtatcc attactccta ataaaaacat 74341 aacatccatt accagttact ctccattctc ccctcccccc agcccctgca accattaatc 74401 tgttctctat cttgtacatt tgcctattct ggatatttca tatgaatgaa atcagctcat 74461 caatgttttc tgcgttcgtc catatgctgt ggcacgtctg tgcttcattc ctctttctgg 74521 ctgaataata ttccacggac tgggtatagt acattttgac atttggacac tttgatgtca 74581 ttcttttttt tttttttttt ttttttgaga catggtctca ctctgttgct cgggctgcag 74641 tgcactggca caatcatagc tcactgcagc cttgaactct tgggttcaag tgatcctgct 74701 gcttcagctc ccaaagcgtt gggattacag gcatgagcca ctgtaccctg cctacccagt 74761 aattttggag cgcataccag atgtcatatc attttattta tatagagaat cagttctcct 74821 tccctgcagc ctcccttgcc ctctggatcc tgactccaaa gggctccctg ccagcctgtc 74881 ctcccaggga cttagaagat cgggtctacc caaggcccat aagtggtccc cacggagcct 74941 cttcccagga gctgcagccc ttgctggcca ctttcataga gggggccagc cacaccctga 75001 gccccagcag ctcccacagg agagaggagg agggacccca catttgtgct tatagaagga 75061 aggcctcctg ctctatggac actctggcgg gggtcgagga tgggaagagc ccagaataaa 75121 aggtgttgag aagagaaagc atgcaaagca ggtgaggctg tcccaggatg gggggcctgt 75181 ggccgagggg actaatgagc ctacctcaca ttagggccaa ggaccgcccc ccaccctgct 75241 gcgctgccag ctccgtgctc accgatagaa gtacacatac gtctccatgt aggtgctgcg 75301 ccccagctgg gggctgctca gggtgctgta gggcccagag ccatcaaatc tgccaagaaa 75361 aggggaggcg gggtcagggc ctcaagcctc tgggtgtgcc cctgattaga cccacagaat 75421 cttcctcacc gattgagttc tcgaagcagg agcgggatgg cgctgccgga agagtccacc 75481 acctcctgca gcaccatgat gtcacagcga gccagtatct gtgggacatc aggaaggcca 75541 gcctgactca gtgccaactg ccctgagagg aggcagtggg cccaggggac tggcactgtt 75601 gtggcccaga gtgttaaggc tcactgactg cagcccacag aaagcagtgg gactggaaat 75661 gccccaggaa aagactcatc ccacccacat gtcacctgtt tcacagagga ggagactgag 75721 gccagcagca ggcaagcgcc tgcgctgcat ctccgtgcca cacctcttct tctcccctct 75781 tctctagaac atacccctgg tcaggcaggg atgggggctt tggctcatgg tggttagggg 75841 cacggcttcc ctgaagtgat gaacttaccc gaactaaggt gtccatcacc tgctccctgg 75901 ccaccttggc cagtgtcagc cgctgggcat tgaaggcgca gatgcgaaag gcctgggccc 75961 cattggccag gatgaggaag aggagtgcag ttgggtagtg catggctgtg tgtggctgcc 76021 ggggacaccc caggaatcca ggctgcccca gggtgcgctc tcactgggct cagttctggc 76081 tctggaggcg ctgaaatatg ggacagagga agtggcacta gctgctgccc agcaccagcc 76141 tcccagtgac ctcccgagat cataccactg cactccagcc tgggtgacag agtgagactc 76201 tgtctccaaa aaaaaaaatt gtttttgaga cagagtcttg ctctgtgaag atcaagccaa 76261 gatcgcacca ctgcactcca gcctgggcga caataagact ccgtctcaaa aaaaagtgaa 76321 atcctacagt atttgtcttt ttgtgactgg ctcatttcac tgagcaaaat gtcctcaagg 76381 gtcattcatg ttggagcata tgtcagaatt tttttttttt ttttgagacg gagtctcact 76441 ctgtcgccca ggctgagtgc agtggcgcaa acgcggctca ctgcaactcc gcctcctggg 76501 ttcacgccat tctcctgcct cagcctcccg agtagctggg actacaggtg cctgccacac 76561 gcccggctaa ttttttgtat ttttagtaga gatggggttt caccatgtta gccagaatgg 76621 tcttgatctc ctgacctcgt gatctgcccg cctcggcctc ccaaaatgct gggattacag 76681 gcgtgagcca ccgcacccgg ctgtagaatt tctttccttt tcatggctga atcatattcc 76741 attgtctggg catctgggta gaccacattt tgttcattca ttcattcatc catccatctg 76801 gttttttttt tttttttttg agacggagtt tcgctcttgc tgcccgggct ggagtgcaat 76861 ggtgcgatct cggcttactg caacctctgc ctcccaggtt caagcgattc tcctgcctca 76921 gcctcccaaa tagctgggac tacaggcacg ctccaccacg cccagctaat tttgtatttt 76981 tagtagagat ggggtttctt catgttggtc aggctggtct cgaactcctg acctcaggtg 77041 aactgcccac cttggcctcc caaagtgctg ggattacagg tgtgagccac cgtgtccttt 77101 tttttttttt tttttttttt taaatagaga caaggtctca ctatgttgcc caggctggtc 77161 ttgaactcct gggctcaagc aattctcctg cctcagcctg ctaaagtgct gggattgcag 77221 gaatgagccc tggtgcccca gcctcatgca tctcttgacc tcgtgatctg cccgcctcgg 77281 cctctcaaag tgctgggatt acaggcgtga gccagagcgc ccagcccctg ctttcagttc 77341 ttttgggtat atacccagaa gtggaattgc tggatcatat ggtagttcta gttttaattt 77401 tctgaggaag tgtcatattg gtctccatgg tgactgcacc attttacgtt ctcgccaaca 77461 gtgcacaagc ggtcaagctt attcttttat tgtttattta tttgggacaa agtcttgctc 77521 tgtcgcccag actggagtgc aatggtgcca tcacagctca ctgcagcctc ctgaattcta 77581 tgtttcccag gctggtctca aactgctgga ctcaagtgat cttcctacct cagcctccca 77641 aagtgctggg attttttttt tttttttttt tttttttttt ttttttttga gacagagtct 77701 cactctgtcg cccagactgg agagcagtgg cgtgatctcg gcttactgca acctcagcct 77761 cctgggttca agcaactccc tgcctcagcc tcccgagtag ctgggattac aggcacccac 77821 caccacgccc agctaatttt tatatttttt agtagagaca gggtttcacc atcttggcca 77881 ggatggtctc gaactcctga cttcaggtga tcagccggcc ttggcctccc aaagtgctgg 77941 gattacaggt gtgagccact gcgcccagcc agtgctggga tattataggc ctgacccact 78001 gcgcctggcc tagtctacct tttttttttt tttttttttt ttttttgaga tggagtcttg 78061 ctctgtcacc aggctggagt gcagtggcgt gatcttggct cactgcaacc tctgccttcc 78121 gggttcaagt gattctcctg cctcagcctc ctgagtagtt ggggctacag gcgtgagcca 78181 ccgcgcccgg ccccccccct tttttttttt tttttttttg atagtctcat tctgttgctc 78241 aggctggagg gcagtaatgt gatctcagct cactgcaacc tccaccttcc aggctcaagt 78301 gatgctctca cctcagcctt ccgagtagat gggactacaa ggtgcatgct gccacaccaa 78361 gctaattttt tttttctttt tttgagatgg agtctcactc tgtctcccag gctggagtgc 78421 agtggcttga tctcagttca ctgcaacctc cgtctcccag gctcaagaga ttatcctgcc 78481 tcagcctccc aagtagctag gattacaggc acgtgccacc acacccggct aatttttgta 78541 cttttagtag agacagggtt tcaccatgtt ggccaggctg gtctcaaact cctgacctca 78601 agtaatcctc ccacctcagc ctcccaaagt gctggaatta caggtgtgag ccaccgcgcc 78661 cggctaccaa gctaattttt aaaatttgtt tggagagatg gggtttcacc atgttgttcc 78721 aagaccccag tgcttttcgc ctccattagc aactacagga acatcttcgt tcatctctgt 78781 ctccccactt gtgagatgtt ctctttaaga taaactcccg gcagtgaagg gtgtgtgcat 78841 tttaaaaggt ctcagatatg gccaaatcat ccccattgca acctgtgatg gggactcttt 78901 cccatgcccc tgccaccact ggggagccct tcttgttcat ctcaccagct ggagcccccc 78961 tctctcagtt tatctttgtc tcacactcac ctgaccttgc agggctcggc tccttgctcc 79021 acgccctgtc catgtggctt ttgcatcctg aggcatgagt tcctctgctc tgtctgcctg 79081 cttttctcta tcaacctgtc accttccctt cgggctcact tcctacctct ctgaggctct 79141 gtccgtctct gtctcacttt ccctctcgtc tctggtttcc cttctctgtc tctctgttgc 79201 ggtgtctctc tcctggctgg gtttgatgtc tgtttctccc tgaccacctg ctctgtcact 79261 catgctagtt tgggttccct accctggtgc ctattctggg atatacatgg gtatgtgggc 79321 agccctgggc tggcacatgc agccttggag gctctggcgt cagggacccc catgagggct 79381 accagtaaga accagagaga gcgacggttc ctggcttcag ccccaaaccc tggctgtcaa 79441 tatcagcctt ggtgggagcc ctaggcttcc ctctccctct caaaccctcc ccaggcagaa 79501 ggcaacttcc tcccttcttc cgtctcccca gggggctccc atacctgctc agaccagctt 79561 gtcaccaagt ctgcctcatc cttaagtgga tcgcgatacc ccagaagagc agctcctgcc 79621 tgaatagggc taggagggga aaaaaaagag gaagaggctg tgttcctgag ccacacggcc 79681 agaaagggcc tggctgggcc ggacgcccca cccttgccaa caggcaggcc cgccacgtct 79741 ccgcctcgcc gccaacaagg aggaccagca ccaggcctgg gaactgctga cctgcccgcc 79801 caccttccca cccagacccc gccttcccgg gccagcggga catgacagac ctgtgagtca 79861 ctcaagcctg gggcaaggag gggctggtgc ctgagtgaga ccccgagctt cgctcgccag 79921 gcccgtcgct cagctcccgg tctgggtccc cagtcagcag ttgctgggga aggcgcagga 79981 gctggaagtc atgcctgatg ccacgagacc gggcacccca agggcgtgag tcttctgtcc 80041 ctgttgttgg gttctgagtc ccagtgtggc tgcacacttc tgagttaggg gtgtttgatg 80101 caagcgctct ggagacccaa agagagggac agtccacaca tgatctggcc acagtcgtga 80161 agtcaaggga ggactgagga tctgccccag agaggaggag acccagggag ctggcagtcc 80221 aacgtcctgg gggtcctgca gtaatgctgt tgggacaaat gaggtgctcg gacagggtct 80281 gtgagcagga ggtggggctg aatccactca gcttctcatg ggatggctgt agctaatgga 80341 ttcgacgtgg taatagaagg ctggacaaat tttaaaaaat caataaatta tgggctgggc 80401 gtggtggctc atgcctgtaa tcccagcact ttgggaggct gaggcgggtg gatacgaggt 80461 caggcgttca agaccagcct ggccaacata atgaaacccg tttctactaa aaatacaaaa 80521 aattagcccg gcgtggtggt gggcgcctgt aatcccagct actcaggagg ctgaggcagg 80581 agaattgctt gagcctggaa ggcggagctt gcagtgagcc gagatcacac cactgcactc 80641 gcactccagc ccgggcgaca gtgcgagact ccatctcaaa aaaaaaaatc aataaattat 80701 ggtttaaaaa taaaaaacag gccaggtagg atgactcaca ctttgggagg agaggcagac 80761 agatcacctg aggccgggag ttcgagacca gcttgaccaa catgaagaaa cccagcctct 80821 actaaaaata caaaaattag ccaggtgtgg tggcttgcgc ctgtaatccc agctactcag 80881 gaggctgaga cagaagaatc gcttgaacct gggaggcaga ggttgcaatg agctgagatc 80941 tcgtcactgc actccagcct gggcaacaga gcaagactct gtctcaaaaa taaaaataaa 81001 taaaaataaa aataaaaaac tgccgggtgc ggtggcccat gcctgtaatc ctagcacttt 81061 gggaggctga ggtgggtgga ctgcctgagc tcaggagttc gagaccaacc tgggcaacat 81121 ggtgaaaccc catctctact aaaatacaaa aaaaaaaaaa aattagccgg gcgtggcgac 81181 ctgcgcctgt agtcccagct actcaggagg ctgaggcagg agaatagctt gaacccggga 81241 ggcggaggtt gcagtgagcc gagattgtgc cactgcactc cagctggcga cagagcgaga 81301 ctcctaaaaa aataataata aataaataaa tataaaataa aaataaaaaa ctggctgctt 81361 aatgcatagg gggctttctt tccgggtcat gaatatgttt cagaactaga ggtagtagct 81421 gacattgtga ctgtatcaga tggtataaaa tactgtttct tgtattacat atgatacaca 81481 atatgtttgt atatattgct gtatatattg tatattttgc catacagtag attatacatt 81541 gctacataca catagtatca tgtagtactg taactatgtc aaattgtata tgtggctggg 81601 cggggtggct cacggctgtc atcccaacac tttgggaggc cgaggcgggt ggatcagagt 81661 gtgatggcgg gcgcctgtgg tcccagctac tcgggaggct gaggcaggag aatcgcttga 81721 gtctgggaga tcgaggctgc agtgagccat gatcccatca ctgcactcca gcctgggtct 81781 ggctggaccc tgtgtcaaac aaaaaacaga aacactagag agacaaaagg cgagcgcagg 81841 aggctgccag gaagcgggag ccccacgggg tgacggccac ttcccgccgc cccggggctg 81901 gcctcgctgc cgtcaagcgg cccgtgcgcg ctttcgacag tcggtcgggc tgggcaccgg 81961 cgcaggttcg ctttccggcg gttgcaccgg gccggggtgc cagcgcccgc tttcccgttt 82021 cctcccgttc cgcagcgcgc ccacggcctg tgacttcgga gaccgctccc cagtgacgag 82081 agagcggggc cgggcgctgc tccggcctga cctgcgaagg gacctcggtc cagtaccctg 82141 ttgcgccgcg cccccgtccg tccgtgcgcg ggccagtcag gggccagtgt gtcgaggcgg 82201 tagaggtcgc agcctagagg cgttccacag gtcggcccgg ggcgctggga gcccggccgg 82261 cggtcgggtg gggtggcctc tgcacgtgaa agtggccgtt ccccgcggtg ccgccgctca 82321 cctggaccct ggccagcagc gtcgtcatgg gcttggtggg cacctacagc tgcttctgga 82381 ccagtgagtg ggcccaggcc gaggcaggcc cgcccgggta cccatgcccg gccggaggtg 82441 ggacttaggg tgggggacgc cgcggccccg acctagcggg cgagcccgga gcgcctgacc 82501 tctccttccc cgccagagta catgaaccac ctgaccgtgc acaacaggga ggtgctgtac 82561 gagctcatcg agaagcgagg cccggccacg cccctcatca ccgtgtccaa tcaccagtcc 82621 tgcatggacg accctcatct ctggggtacc cgggccagtg tgctgggcag ggggaggaaa 82681 ggcgaggatt cgggacgggc ccagcctcgt cagaacaaac cccttctcgc tgccttttca 82741 tggagccctg gcactccggt ttccgggcgt tctgccccag ggctgagact tggatttgtg 82801 gctcctgcag catccctgag ggagcatgat ttggagaggg ccttgggcat ggagcaggac 82861 caaggaacgg ccacagggca tatcttacag ctcctttaac gtcaggcctc cagaaccctc 82921 tgttctctaa accatccagg gccccacagt cttgggaaaa atttgggggt tggacatagg 82981 atatcctgtg aggaagttct gggccaggga ctggaagccc agtgtcctgg cttcaggttt 83041 gcccctggct gggcgtgtac cgttcagccc tatttgggag cctggttgcc acacctagag 83101 gttcctgctc tgctgacttc acaggaagtt gcttggctct ggtaaccaga tggccgggtg 83161 gaggggtgcc tggggcttcc actcctgaac agagagctta actgtcctag agttcatggg 83221 gagccaccta gccacactgg agcccttcag agctcaccct ggctgctggg tggagaaagg 83281 cctatagatg tcacaggtgg atcctgggag gccaggaagg aggacttgga gccatccact 83341 agcctagact gggctggttg cagtgacagt acagggagaa gtatagaaga atgggcattg 83401 tccaagtatg ttggggaggt aggcagtcca ggtgcagtga ttagtaagat gtggagggtg 83461 ggggagaggg aggagccagg gagaactgaa tgatctggcc taagggtctg aaagatgcca 83521 ctttgggctg tagggaaatg gtagtgctgc tgtccctcat tccctgagat ggggtgaagg 83581 aagggcactc cctggggata tgggaagttg gggcatgaag cctttcctgt cctctaggga 83641 tcctgaaact ccgccacatc tggaacctga agttgatgcg ttggtgagga ggaatgggcc 83701 cctcgaagtg ggccgggccg gccccacctg cctctgccca gatttgccct cctcctgctc 83761 tgcccaggag gtggcgtcca gcagtccagg cagggcatgg ggtggggcac cgcaggccct 83821 cccctgtgcc tcccttctgc atgctgatgc tgggggcagg gtggtggagc ggggtgcagg 83881 gggcaggact aattgcatct gtccctgctt aggacccctg cagctgcaag acatctgctt 83941 caccaaggag ctacactccc acttcttcag cttgggcaag tgtgtgcctg tgtgccgagg 84001 tgagctgctc ctccagcgag tgcagggagg cacttctggg gcaggaaagc tggtggccag 84061 tgtctctgtt ttggagggac ctatgggggt aggcatcccg ggagccaaca tgaggcccag 84121 ctccatcaca tgccaccaga gggagcagag gctgtgagag gagggatggg tccttggaaa 84181 gtcctcaact ctctgcaact gcccagggat cctgtggacc tggagccttg gcattagaat 84241 tcaaggaggc attatccata accccttctc tcagggatgt cactaagaga ctcattgaga 84301 acagtgcgtt gtcattggtc catggtggca gtagcatgta cctgcaggcc agagggtttt 84361 gcttctagga gaacaaacag gcttgtgggg tgagaacccc aggggatagg ggccattttg 84421 atccctattc tccagacagg gaggatcaca gagattaagc tgctggccca aggtcatggg 84481 gtaggaggtg caaagccagg atttgaatcc agattgctcc ttcctctgca ggagcagaat 84541 ttttccaagc agagaatgag gggaaaggtg ttctagacac aggcaggcac atgccaggtg 84601 ctggaaaaag aagagagaaa ggtaagccag gcatagcggt tcacacttgt catcccagca 84661 ctttgtgagg tcgaggtgag aggatcactc aagcccagga gtttgagacc agcctgggca 84721 acatagtaag accctgtctc tttaaaaata aaaatatttt aaacaaaaga aaggtagaaa 84781 atatgagttg tatgtagaat atgaaggttc tagaataatc ctattaagaa ttttggattc 84841 tggctgggcg cggtgctcac gcctgtaatc ccagcacttt gggaggctga ggccagcgga 84901 tcacgaggtc aggagatcga gaccatcctg gctaacacgg tgaaaccccg tctctactaa 84961 aaatacaaaa aattagctgg gcatggtagt gggcgcctgt agtcccagct actcgggagg 85021 ctgaggcagg agaatggcgt gaacccggaa ggtggagctt gcagtgagcc aagatcacgc 85081 cactgcactc cagcctgggt gacagagtga gactccgtct caaaaaaaaa aaaaaaaagt 85141 gccggccacg gtggctcatg cctgtaatcc cagcactctg ggaggccgag gcgggcggat 85201 cacgaggtca ggaattcgag accagcctaa ccaacatggt gaaaccccgt ctctactaaa 85261 aatacaaaaa ttagccgggc gtggtggtgt gtgcctgtaa tctcagctac ttgggagact 85321 gagacgggag cattgcttga acccgggagg tggaggttgt ggtgagctga gatcctgcca 85381 ttgcactcca gcctgggcaa taagagcgaa actccgtctg gaaaaaaaaa aaaaaaaaaa 85441 gaattttgaa ttagctggct gcattggtgc atgcctacag tcccagctcc tcaggaggct 85501 gaggtgggag gatcacttga gccctcgatt ttctttttcc ttgagacaat gtctgtttct 85561 gttgtccagg ctggagtgcc gtggcgtgat gatggctcac tgcagcttct gtttcctggg 85621 ttcaagggat cctgtcacct cagcctcctg agcagctggg actaaggagt gtgcccagct 85681 aattcttttt tttttttggt agagttgggg tcttactaca ttgccagggc tggtcttgaa 85741 ctcctagact caagcgatcc tcccgcctca gcctctcaaa gtgctgggat cacaggcatg 85801 agccattgtg cccaacccag cccaggaatt tgaggccagc ctgagcaata tagtgagacc 85861 ccaggtatag atagataggt agatagatag gtaggtaggt aggtagatag atagataggt 85921 aggtgggtag gtaggtgggt agataggtag gtaggtacgt aggtagatag atagatagat 85981 acatacatac atagatagat aagatagata gattgattag atagattaga tagccaggtg 86041 gggtggctca cgcctgtaat cccagtgctt tgggaggctg aggtgggtgg atcacctgag 86101 gtcaggagtt tgagaccagc ctggctaatg tggcgaaacc ctgtctacta aaaatacaaa 86161 aattagctgg atgtggtggc gcacgctgta atctcagcta ctcgggaggc tggggtggga 86221 gaatcgcttg accccaggag gtggaggttg cagtgagctg agatcatgcc actgcactcc 86281 agcctgggcc acagagggag acctggtctc aaaaaaaaaa aaaaaaaaaa aaaagatttt 86341 tgtatttgac tgtgctctct catccaggtg gtgacatgct tgtatctgtg gttaagaggt 86401 agcatctgcc aggcgcggtg gctcatgcct gtaatcccag tactttgcga ggccgaggcg 86461 ggtggttcac ctgaggtcag gagtttgaga ccagcctgac caacatggcg aaaccccgtc 86521 tctactaaaa atataaaaat tagccgaggc tgggcgcggg tgtcacgcct gtaatcccag 86581 cactttggga ggccgaggcg ggtggatcac gaggtcagga gatcgagacc atcctggcta 86641 acaggatgaa accccgtctc tactaaaaac acaaaattag ctgggcatgg tggcaggtgc 86701 ctgtagtccc agctactcgg gaggctgagg caggagaatg gcacaaaccc gggaggcgga 86761 gcttgaggca ggagaatggc gtgaacccgg gaggcagagc ttgctatgag cagagatcgc 86821 gccactgcac tccagcctgg gtgacagagc gagactccgt ctcaaaaaaa aaaaaaagag 86881 agagatacca ttccagttgc aatgtggctg cagcagggct aggtggacat gaggagccca 86941 gttcagggct gtcacagtag ctccagcagc agatgattgt ggctgggcct cccaagtgtc 87001 acgttggaga accggagaag gggacttctt tgggatgtac tctggacttg ttgatagatt 87061 aagtgtaggt ggggtgagga agagaactca aagatgacac caggtgttgg agctgagcca 87121 cggggagaag ggtgcaaagg gaaagcagtg cgggggctgg gaggggaaga gggtcagtcc 87181 tgttttgctt gtgctgcatc tgaggagccc ctcacctgtg gaaggagagc agtcccagag 87241 gcagtggggt gtgcagttct ggaacttaga agaatgatca gggggctggg tgcagtggct 87301 cacgcctgta atcccagcac tttgggaggc cgaggcgggc ggatcaagag gtcaggagat 87361 tgagaccatc ctggctaaca tggtgaaacc ccgtctctac taaaaatata aaaaattagc 87421 agcgcatggt ggcaggcacc tgtagtccca gctattcagg aggctgaggc aggagagtgg 87481 cgtgaacccg ggagacggag cttgcagtga gctgagattg cgccactgca ctccagcctg 87541 ggcgacagag cgagactccg tctcaaaaaa aaaaaaaaca aaaaattatc aggaataaag 87601 atactgggaa ttcaaccaca aaaaccacaa agtcacgaat gcagagtgag agtcttcaga 87661 gagagggcag aggcccagga tggcacagag ggagagagaa gccaccaaga tggcagggag 87721 ccatggccag agagaaaagc aaaccaggcg cctgtggcac catggaagca gcaggtgatg 87781 ctggagaagc gtttccatgc agtgacagct gccagagcca gattattatg gcctgagggc 87841 tggtggtgct gcagttgggg gggaatggag accgcagcgg gggaatggag accgcagggg 87901 cggcaactcc tcccagaggt tcagtttgga ggaaaaagag acagaaggaa gggtaaccaa 87961 aggatattag aggtcaagga ggggtttatt atttctagct ggtcaaagta gaatgtggta 88021 aaacccaata aagtaacagc aaataacaaa gaggggcttt taatggaaag ctgcagtgcc 88081 tgcccctcac ctccctagcc ccactcctca gaagcagccc cctggagcgg tttctctttc 88141 tgctgctttt ggaggctact tctgtctctg taaaggaact tctgtctctg taaaggaaca 88201 agcagaggcc acagtgtctt ggcaaatgac cgtggcattg tttatcccaa cctgctctga 88261 gagatgggga ctctgctgat tcacccctcg tccctgcctc tttagttcct tcagcaaaga 88321 actaataata tgtcattaag gcaaaggcag tatttggcac acttggcttc tgtgttccaa 88381 aaaattccct ttttctccaa gcgtagattt taatatggta tttaagtatc aaagtgattt 88441 gacacaagtg acacaagtga gcagtagtag catttgcaaa agctacaaat gatgaagtct 88501 tgctccaaaa gcatattatt attatttttt aaattccagt ttcaagctca atggataggt 88561 tgcgtcagtg ggttttagta gagaaggggc cagtacctat gaagactccc ctgtactatt 88621 acacatgctg cgtcccaagt agggaaactc cagtgggtca tcaccccctc cctctagccc 88681 aagtgggctg ctgggtcagg gcactgtttg ggttgggttg agattcctgg atccctaact 88741 agcaccttcc aaagaatcgg ccttgcagag aacaaagttc attgccaaag gtattcccaa 88801 tgtctgtctc tcgctctgtg atagggcttc tttactcccc actttataag atgaggatat 88861 ctgctatacc ccattccttc aacctctcca cctctttcct cttttgtctg ccatgtgatt 88921 acttttccat caccaaggtt gatgagatcc ttactgtgta cttactgtct ccaagcatcc 88981 ccaggcttcc agccagcacg cagcagccaa cattggcttc agaaggaagg aggtttgcat 89041 ttccttctct tgggtccagt gccatggtct cagggttatc tccaatgtgg aaagttgcta 89101 gaccaggagc agtttgactc cactgttttt cagtctagca agtttgagtt tttcccaagc 89161 tcttgatgtg tgttcaaatg aatgttctcc ctttcccaag agacctccct ataagggagg 89221 gtctagtttc ccggggtgcc ctgagaagaa ggggtcagca ggggggagga atggtggagg 89281 ctgaggcctc aggcaggtgc cctgtgagga gtggctgtga caagcggctg agatcccagc 89341 aaagggttgt tgcccagaat ctggggccta agcgaggttg gaaatcacaa atgcacttgg 89401 tttctctagc agctggcaca tagtaggatc agcccaggag tggcaccttc aaagagaaag 89461 gaacgcatga ggctccgaga gcggtcggct ggcagccagg gcctggctgg ggagggaggg 89521 caggtggtgc gctgacctgg ggatctttga gagtgtgcag gagtatgtgg ctgaagcggc 89581 ccagctggtg agccactgaa gatgggggtc ataggtagat ggaggaattg ccaggtcaga 89641 atctcggtag gagatgactt gagctgatgc tgcctggcac ataggggttt agaagatatt 89701 aggaaggaag ccccacagcg tgcccaagtg ccatgcaggc cctggagacg gtcattggag 89761 ggcgtctgtg gatggcgcag tgggggatgg gaagtcccta atctgctggg gctttcccag 89821 caagcaggga ctgaggggga tagtccccaa cacatgggcc ccagggagaa gggcctgttt 89881 cattgaggtt tggagagtgt tgagggtaag ctaacctgtc accccacgcc cccgagaatg 89941 gttactgata gggagaggcc ttttccttgc aggagatggc gtctaccaga aggggatgga 90001 cttcattttg gagaagctca accatgggga ctgggtgcat atcttcccag aaggtcagca 90061 gggctgactg ggtcgagccc ccccagtatg agcgggatgg gctcccaagc ctcgcctctg 90121 tgctctctca ccagggaaag tgaacatgag ttccgaattc ctgcgtttca agtggggtaa 90181 gggctgctgg tctctggcca cagccatcct cccggcccag agatggccct gtgggcccct 90241 ggctcccgcc ccctcgggct ggcttgtatg ggggtagatg ggcgtgtttg tagcgccagg 90301 aaggggacag gtgctgagac taggcctgcc tctcgcaggg gcttgcccaa gggagctgaa 90361 ttgaactgga ggatgtcggg ggtggcagtg gccagaggct ggcacagaag cttggctcag 90421 ggcccagctt atgctaacat ttctacctcc cccctgggca ggaatcgggc gcctgattgc 90481 tgagtgtcat ctcaacccca tcatcctgcc cctgtggcat gtcggtgagc ctggggacgg 90541 ggacagagag atggcatctg gggtgggggg cctgggactc cctctggtcc caggctgccc 90601 tgctccaccc cacgtctggc cttctgtcca ctgtgctgca ggaatgaatg acgtccttcc 90661 taacagtccg ccctacttcc cccgctttgg acaggtgggt ggggactgct gaccttcggc 90721 tgtctgcctg tctgctgtct gctccgtgtc tcccactcag cactatggag gccagctgca 90781 ggaggagctg agcatgaggc tacagcaggg gacagagtgg aacatagaca gggacttcct 90841 ggcaccgaga cattaaaatg agagcagagg aagtcaggct ggggcagagg tgggctgtgg 90901 ggtcaggaca ggacaggtca tctacagcac aggacccagg accaggacct tgttttagag 90961 gaagagtggc ccctggggag tgtgtggcac agcagggccg gggcttagct tctggctccc 91021 gggttgcttg ggctggggct gtgggcactc ctactgctcc tcatcactct tggcggccac 91081 cccacagaaa atcactgtgc tgatcgggaa gcccttcagt gccctgcctg tactcgagcg 91141 gctccgggcg gagaacaagt cggctgtgag tttcctcctg ggtcccccgt agctgtcccc 91201 ggaccccctg ctgctggctt ccagcagggt gcctccaccc tctccatccc gtcaccctcc 91261 cagggcaccc tcccagggca ccttggccaa gcttcccgag gggtgcaggc catccctggt 91321 cctttccctc aggtggagat gcggaaagcc ctgacggact tcattcaaga ggaattccag 91381 catctgaaga ctcaggcaga gcagctccac aaccacctcc agcctgggag ataggccttg 91441 cttgctgcct tctggattct tggcccgcac agagctgggg ctgagggatg gactgatgct 91501 tttagctcaa acgtggcttt tagacagatt tgttcataga ccctctcaag tgccctctcc 91561 gagctggtag gcattccagc tcctccgtgc ttcctcagtt acacaaagga cctcagctgc 91621 ttctcccact tggccaagca gggaggaaga agcttaggca gggctctctt tccttcttgc 91681 cttcagatgt tctctcccag gggctggctt caggagggag catagaaggc aggtgagcaa 91741 ccagttggct aggggagcag ggggcccacc agagctgtgg agaggggacc ctaagactcc 91801 tcggcctggc tcctacccac cgcccttgcc gaaccaggag ctgctcacta cctcctcagg 91861 gatggccgtt ggccacgtct tccttctgcc tgagcttccc ccccaccaca ggccctttcc 91921 tcaggcaagg tctggcctca ggtgggccgc aggcgggaaa agcagccctt ggccagaagt 91981 caagcccagc cacgtggagc ctagagtgag ggcctgaggt ctggctgctt gcccccatgc 92041 tggcgccaac aacttctcca tcctttctgc ctctcaacat cacttgaatc ctagggcctg 92101 ggttttcatg tttttgaaac agaaccataa agcatatgtg ttggcttgtt gtaaaatgtc 92161 tctggcctct ctgtaggggt gaaaatggga agtgactgct ggacagaaag gctgaaccct 92221 tagtgccctg gggacatgga gcacacttga gcagatgggc atcaacctct tctgcaccag 92281 tcacttaagg cttcagttat tgtctgtaat ccctgtaaca acattgcaga aatagggatt 92341 aatctcattt tgccaaggag gtggagtgag gctaatgaac ttgcttcaaa acccaaagct 92401 agccaggtag attggggcca cacttcagag agacccaagt gtggagaatc aattggatcc 92461 tacctgggct gcatcatagg tgggagagga gctctgaaga gcccggcaag tagccagcca 92521 tgagcagtgc acgctgtgat caggaggtag gattggggtt gcccagatgg agctgggggt 92581 cctgggtggg agcaaaggca aatccagggt ggtggcgtag acacagtatt aaaaatacta 92641 taacagaaac gtggaaagaa agaaaatata gaaccaccaa agatgagcca gcaacaagag 92701 aagattcaga caaccaggtc tttcagcagt gttggtgcct aattatttca aaataaaagg 92761 tttttaaaaa aatgaccaaa aagggaaaaa aattacctgg gaaaaaacaa acacttgtaa 92821 ttaaatatag atagaagcag ctgggcacca tggctcacgc ctgtaatccc aacactttag 92881 gaagccaagg tgggcggatc atttgaagtc aggagttcga gaccaacctg gccaacatgg 92941 tgaaacccca tctctactaa aatacaaaaa ttaggccagt cgcagtggct cacacctgta 93001 atcccaacac tttaggaggc cgaggcaggc agatcacctg aggtcaggag tttgagacca 93061 gcctgaccaa catggagaaa ccccgtctct actgaaaata caaaattagc agggcatggt 93121 agcgcatgcc tgtaatccca gctacttggg aaggctgagg cagaagaatc gcttgaacct 93181 gggaggcgga ggttgcggtg agccgagatc gagccattgc actccagcct gggaaacagc 93241 agcgaaactc cgtctcaaaa aaagaaaaaa aaaaggctgg gcctggtggc tcacacctgt 93301 aatccaagca ctttgggagg ccgaggcagg cagatcacga ggtcaggaga tcgagaccat 93361 cctggctaac atggtgaaac cccgtctcta ctaaaaatac aaaaaaatta gccaggcgtg 93421 gtggcgggtg cctgtagtcc cagctactct ggaggctgag gcaggagaat ggcgtgaacc 93481 cgggaggcgg agcttgcagt gagccaagat cacaccactg cactccagcc tggtcgacag 93541 agtgagactc cgtctcaaaa acaaacaaaa aacccccaaa aaattagctg gcgtgatggc 93601 gcacgcctgt aatcccagct actcaggagg ctgaggcagg agaatcactt gaacccaaga 93661 ggcagaggtt gcaatgagct gagatcgtgc cactggactc cagcctgggc gatagaggga 93721 gacttcatct caaaaaacaa acaaacaaat aaacaaacaa aaaaacacac caggcacagt 93781 ggctcatgcc tgtaatccag cactttggga ggccgaggca ggcagatcac aaggtcagga 93841 gatcgagacc atcctggcta acatggtgaa accccatctc tactaaaaat acaaaaaaaa 93901 aaaaaaaaaa aaattagcca ggcgtggtgg caggtgcctg tagtcccagc tggaggctga 93961 ggcaggataa tggcgtaaac ccaggaggtg gagcttacag tgagccgaga tcgtgccact 94021 gcactccagc ctgggcaaca gagcgagact ccgtctcaaa aaaaaaaaaa aattaaaaat 94081 aaaagaaaga aaaaggaata ttttgataca tgctatcaaa tggatactct caaaggattt 94141 tcaagaaaca ttatgttaac tgcaataagc cggaaacaaa aggctgcata ttatatgatt 94201 ccacttatat gatgtactta ccggcttcag atccagagtc aaaaagtaga aggttggttt 94261 ccagggcctg ggaggatggg ggaatgagaa attgtataat ggggacagag ctttagtttg 94321 ggaagatgaa aagagttctg aagatggatg gtggtgatga ctgtacaaca gtgtgaacgt 94381 acttaatgcc actgaactgt acagttaaaa gtggttaagg ccgggcgtgg tggctcacac 94441 ctgtaatccc agcactttgg gaggccgagg tgagcggatc acctgaggcc aggagtttga 94501 gaccagcctg gacaacatgg tgaaaccccc atctctacaa aaaaaaaaaa aaaaattagc 94561 tgggcatgtt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc gggagaatca 94621 ctgaaacccg ggaagcagag gcttcagtga gccaagattg tgccattgca ctccagcctg 94681 gacaacaaga gcgaaattcc gtctcaaaaa aaaaaaaaaa aaaggcattt cacagaagag 94741 gaaacacaaa aggcagagtg tgcatggtat gggtttcctg tggctgcagt aacaaagtgc 94801 cagagactgg gaagtttaaa gcaacagaaa tttattctct cccagttctg aaggccagaa 94861 gtccgaaatc aaggtggcac cagggtgggt tccttctggg gctgtgaggg aaagccagct 94921 ccaggcctgt cttctggctt cttccggtgg ttcccagcaa tgctccgtgt tccttggctt 94981 gcagaggtgt cattccaatc tctgccttca gctgcgtgca gtgctctcct ccctgtgtct 95041 gtctcttttt ctcctccttt aaaaacacca tgcattgatt gaactagggg tcacccttat 95101 gcagtataac ctcaccttaa cttgattctg tctgcaatga cttttatggg gacactactc 95161 aacccagtgc atatactcta tgacctagca attccccacc tagttatata cctattgaaa 95221 tgaacatcta tgtttaacaa aagacacata catcggctgg gcgcagtggc tcacgcccat 95281 aatccaagca ctcaggccga ggcgggcaga tcttgaggtc aggagataga gaccatcctg 95341 gctaacacag tgaaaccccg tctctactaa aaatacaaaa aattagctgg gcatgggggt 95401 gggcacctgt agtcccagct actcaggggg ctgaggcagg agactggcgt gaacccggaa 95461 ggcggagctt gcagtgagct gagatcacgc cactgcactc cagcctaggt gacagagcga 95521 gactccgtct caaaaaaaaa aaaaaaaaga agaaaaagaa agacgtatac ataattgttc 95581 acagcagcat tagccataac agcccaaatg ggaaaccaac caaatgtcaa cagttgaagg 95641 acacaatcaa tggtgacgta ttcatccagg ggaatactat gcagcaacat tgttacatgc 95701 agccacttga attaatctgc caatgttgag caaagaatct agacacaaaa taatacttcc 95761 cttcagagtg gaggagaagg ctgagtgcgg tgactcacac ctgtaatctc agcactttgg 95821 gaggccaagg tgggtgggtc acctgaggtt aggagttcaa gaccagcctg accaacatgg 95881 tgaaatcccg tctctactaa atacaaaaaa ttagccgggc gtggtggcac atgcctgtaa 95941 tcccagctac tcgggaggct gaggcaggag aattgcttga atccgaaggt tgcagtgacc 96001 cgagatggcg ctattgcact ccagcctggg caacaagagc caaactctgt ctccaaaaaa 96061 aaaaaaaaaa aaaagagtgg aggagggaaa aatattatac tgaataattt cctttattta 96121 aagcctcaaa acagccaaaa caacacatcc catcattttt aggaatgcat gcttaactgg 96181 tcaaactata aagaccaaag tgattagcaa aagaatcaag ttcatggtca cattgggaag 96241 tgatagtaag ggtatgagag gggctttcag ggcaatgata ttcaaattct tgacctgggt 96301 gatgccatgg aactgtctgc tttgataagc attacactgt acatctttat tttgtgggtt 96361 tttttttttt tttttttttt tgagacagag tttcgctctg tcacccaggc tggatggagt 96421 acaatggcat gatctctggt cactgcaacc tctgcctcct gggttcaagc gattctcctg 96481 cctcagcctc ccgagtagct gggattacag gcgcatgcca ccatgcccgg ctaattatct 96541 tgtattttta atagagacgg ggtttcacca tgttggccag gctggtctcg aactcctgcc 96601 ctcaggtgat ccgcccgcct tggcctccca aagtgctggg attacaggcg tgagcaatgc 96661 gcctggccta cttcttttct tttttgggtt acgtttcacc atagaagtgc tttttttttt 96721 tttttttgag acagagtctt gctctgtcgc ccaggctgga gtgcagtggc acgatctcgg 96781 ctcactgcaa gccccatctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag 96841 ctgggactac aggcgcccgc caccacgcct ggctaatttt tcatattttt agtagagatg 96901 gggtttcacc gtgttagcca ggatggtctt gatctcctga cctcgtgatc cgcccgcctc 96961 ggcctcccaa agtgctggga ttacaggcat gagccaaagt gcccggccct tttttttttt 97021 cttttttttg agatggagtt tcgctcttgt tgctcaggct ggagtgcaat ggcacgatct 97081 cggctcactg caacctccgc ctcccaggtt caagcaattg tcctgtctca gcctcctgag 97141 tagctgggac tacaggtgcc cgccaccacg cccagctaat ttttgtattt ttagtagaga 97201 cggggttttg ccatgtttgc taggctggtc tcaaactctt gacctcaggt tatctgcccg 97261 cctcagcctc ccagtgttgg gattacaggc gtgagccacc acgcccagcc aggatttcaa 97321 tgaacaaaat gcttttaccc taaagatgaa gtccccagac agaaggtgct cattggtgct 97381 gtcaacattt ggtgattggt tcagtttaga tatcacctcc tggtgggagc agcttcatcg 97441 gtgcccctct ccttacatcc ctcccagtcc catgtctcaa cctttggaat attcatgacc 97501 ctgtatcatg acctgctgtc ctgcatgtcc tctagaccaa actgtcaact tccggggcag 97561 agtccaggtg cctggcccat agcaggtgct tcaggaagcc tccagggcca tcactccagg 97621 gacccctagc aatgcaggga agaataaaag cagccacagg tgggtttctt ccctaagcat 97681 ggaagcagct cagcaacaga gtcctggcgg aggcaagatt gccttccgca ttccagagga 97741 agtatttgcc ttgaaactat gtaacaaaag taacagaaac ataatagaca tgctttcctt 97801 tttaagtttg ctggattctc ctgggataca gtgaagctcc agtgaaaacc tctctcctca 97861 gctagttaag gcctggaaat ggccacttgg gggtttccct aagcacccga aactcagtga 97921 gcccccaagt gacctcgtct cccagtcccc aactcagaga gacccaaaag ctacaatctt 97981 ctttgactgt attgtcagac ccacttcctc tctccagctc aactgccact atcctagtcc 98041 tagtctgaat cacatgtgac aatcatgact acaatctttt tttttttttt tttttgagat 98101 ggactctcac tctgtcgccc aggatggagt gcagtggcgc gatctcggcc cactgcaacc 98161 tccgcctcgc gggttcaagt gattctcctg cctcagcctc ccgtgtaact gggactacag 98221 gtgtgcacca ccatgcccag ctaatttttg tattttcagt agagatgggg tttcaccatg 98281 ttgtgcagga tggtctcgat ctcttgacct catgattcgc ccgcctcggg ctcccaaagt 98341 gctgggatta caggcgtgag ccacagcgcc cggcccgtta gggtgtcttt gatgccacgc 98401 ttcgctctgg gttttctcct tcctctttga ttgctccttg tccgatctct tttctggacc 98461 cttctcctcc tgcccaatgt aagtgctggg tgttccccta gatgctatcc tgagcccgct 98521 tcactcctcg ttctaaggga ttacagtctc ctctcacctg tatcagggag ggtccatctg 98581 caggaaaaag acaccacttt tgcaatttta aggaaaagga ttaaaacagg gattggccgg 98641 gtgcggcgat gcgcgcctgt agtcccactt ttccggaggc tgaggcggga ggatcgcttg 98701 agcccgggga ctcgagacca gcctggccaa catagctaga ccccgtctct aaagaaaaaa 98761 aaaagacaaa agaagagaac aacaacaaaa aaaccaaaat aaaaataaaa ataaaaacgg 98821 acgggcgaac gcgtcgggtg ggcgtagggt tccgcgaagc ctccctgtcc ctcttgtctc 98881 gtcctagtcg gtctagctgg gccccatccc ctttcccact cggagcgtga gcgcctggag 98941 acacgtacag ccaaccagtg agaaggagtg gccgcgagtg gcatgcactt ggtccaatta 99001 cctgcggccc tgccggtcgg cccgcgctgg ggccaatgga ggtgcgaggc ggggctcggg 99061 cgggggcaac ggtcacctga tctgcggctg tcgaggccgc tgaggcagtg gaggctgagg 99121 ctatgatggc ggccatggcg acggctcgag tgcggatggg gccgcggtgc gcccaggcgc 99181 tctggcgcat gccgtggctg ccggtgtttt tgtcgttggc ggcggcggcg gcggcggcag 99241 cggcggagca gcaggtcccg ctggtgctgt ggtcgagtga ccggtgagcg ggccggggtg 99301 ggatgcgctg tggcggctga ggcgccctcg cccgactccg gcgctgtcct aggcgagggg 99361 tggtgaggcc cggaggtgga ctgttccttg ctcgggggct cgcagcgaat ctgccggcga 99421 cagagctcca gtccacatgc gcccccgtct gacagcacct cttctgtgcc ctgccaggga 99481 cttgtgggct cctgcggccg acactcatga aggccacatc accagcgact tgcagctctc 99541 tacctactta gatcccgccc tggagctggg tcccaggaat gtgctgctgt tcctgcagga 99601 caaggtgcgc ccgccccagc ccactctccc ccggtcatcg ggaggcagcc aggccccctc 99661 cccccatgac actgacgccc attccccaag ggaagcttca gtgaccttgt cccaactgta 99721 gggaggtgtg ggtcgtctca tgggaaggcc tgtagtaaac gcttcagtgg gcatggcgac 99781 agcctcggaa atggcaccaa cttgattgga ggaagcgacg gaccagaggc caggtaccta 99841 ctgagtacca agcactttgg atatctgact tagtccaata tggtgggtgg ggattatcgt 99901 ccctgtttgt ttatagatga gaagactgag gctggaggtt aagtgacttg tccaagctca 99961 tacagctaat gggtggcaga gttgtaattc tagctgtgat gatcataata atgataattg 100021 gaaaatgctc acctgtttag tgctttgtag gcacttgcta cactgatgtc attatcttgg 100081 tttcactgcc aaggaaagta aggtttgtcg agatacaatg tttcactaca gttgatagat 100141 accactttgg ttccagcctg agtctgccaa tcctccctcc caattggagc tcataacccg 100201 catgctgttt tccttcccat ccaggatctt tgccccaaaa gcaggcgtgg agaccatgag 100261 atgactctaa gatagcaagt ctctgctagg ccctcccttg ttggtggcct gtatggctca 100321 ttgcccttgt agtctccctt ggttgctgtg tgggcagatg gtggtggcct gaagtctttt 100381 caggaggtgg taaataatat agtcaaatag gaactgagtc ctagttctac catgtaacca 100441 gccattggag gcatgaaata atctttgtga atgacagttt ctacatctgt aaaatgaaga 100501 caccacctat atcagagggc tgtgagtggg aaatcttttt tttttttttt tttgagacgg 100561 agtctcgctc tgttgcccag gctggaatgc agtggtgcaa tgtcagctca gctcactgca 100621 ccctccgcct cccgggttca agcagttctc tgcctcagcc tcccgagtag ctaggattac 100681 agggcgcccg ccactacacc cagctaattt ttgtattttt ttgtattttt agtagagatg 100741 gggtttcacc atgttggcca ggctggtctt gaactcctga cctcgtgatc cacccacctc 100801 agcctcccaa actgctggga ttacaggcgt gagccaccgc gcccggcctg tgagggggaa 100861 ttcttataat ccacataaaa catctagcca ggatccccca gtcttagggc attcacccat 100921 tcattcattt gcccaatatg gaaagcctac tatgtgccag gtattgtgaa gtgttgggga 100981 tatggcagtg agcagaagaa cccaagcccc caccctcagg aagctgacat tcttgtgagg 101041 gtggccagta aatactaagt cggtatagga tatgatgtca gtgttacggc ctttagggaa 101101 ggaagcaggc taaggggaca gagtgactgg gatgctattt tagataaggg tggtcaggcc 101161 aggcctcatt gaggaggtga tatatgagca gcgatcttaa tggagggagg gagccgtttg 101221 gggaaggaca ttcctggaac agcaaatgcg agggtcctgg gcgaggtgtg ctcttggcca 101281 gctcaaggaa gagctggtgt ggctggagca cagtgagtga gagaaagggg taggagatgt 101341 aggagatggt ttgcaggtgg ctaggcgatg aagtaggacc ttgtaggcca tgagggggaa 101401 ggtcagttgc agttcgaaat gtgtgagaaa gccttgggta acgtggagca gggagtggtg 101461 tgatttctca attaaagaaa gcccacagga cccaccagca gttcctgggc tttccgactg 101521 agaaccccgg tggccaaaca gcagcaaggt cctgcccaca agggagggga ggctgggcga 101581 gtgtgtatac caaacaggtt agggaagctg attaaaatct cctcagggcc taactgggaa 101641 gggccagagg aagctgggga tgggagtgga gggtaggagc aaaaggacaa aggacatctg 101701 taggttgtgg agaaaaaggg atggggtcgg ggccactgtg gtcctaagag ctcaaaagac 101761 ttcaatgctc gatgcttcct ccagcatgtt ctgagatcct cacctctccc cttccgccaa 101821 aagcaggtgg ggggagggtc ccgtccagac tggacatagc cgactctcct tttctctggc 101881 tgggaggcct gccacaaatg ctcttggctg ccccaccccc tccccgcagc ttccctgttc 101941 cctccccagt tcctcttgtc tgtagggtgg gcaaggcggc tgactcctac tcctgagtta 102001 ccacaagtca gctgcctgca gatctcccca ccccatgact gccttccatg tcttctcacc 102061 ctgccctgag agtgctggag ggaagaggtg agtctcccac cccaccccca ccaacacaca 102121 gttgtgctcc accatactga acctgactct cagtgagact ttgctggccc tgagaatgca 102181 cggggaaggt ggctgtcccc tgccttggcc cccatcactt gccaaacctc ctcagatgtc 102241 tcccctattt ccctaatagc tgagcattga ggatttcaca gcatatggcg gtgtgtttgg 102301 aaacaagcag gacagcgcct tttctaacct agaggtgaga gtcctctccc agccaggggc 102361 catgggggac attctgtgct ccttctccca gcataagaac tgtactctga cctcatatca 102421 gggttacttg ggtctgagtc tcttctgggt caaccatccc ccccaaaaac aacaacaaca 102481 acaaaagcca cctcatactc ttagggagca gagacgctgg ctttggggta gaaaggcctg 102541 gggcctgaat gtcaggggct ggggtgtggg ccattttctt ggtgagcctt tatgggtatg 102601 gcttgccaga ggagagctgc cagagaggtg tggggggctg ggccaggccc actggcccct 102661 ggctaactca ccttttgcct ctcccctgtc cccagaatgc cctggacctg gccccctcct 102721 cactggtgct tcctgccgtc gactggtatg cagtcagcac tctgaccact tacctgcagg 102781 agaagctcgg ggccagcccc ttgcatgtgg acctggccac cctgcgggag ctgaagctca 102841 atgccagcct ccctgctctg ctgctcattc gcctgcccta cacagccagg tactgcccgc 102901 atggcccagc caccagcctc ggggcccaga gagcagcaag gccctgggct atggcatgtg 102961 gtggcaccgt ttggctaaag atgccagtac tccccacccc ctcacagccc ttccagaagg 103021 tgccctgtgt ggccagaaga ggaggggagg gcctcttctg tgactcagga gttggtgcct 103081 agcactgagc tgggcctcca gagatgaggc agatgtgagc cctgcgggtg gggagccttc 103141 tcacctagcc aggcccttgt cacagagcag gtggccagga cttgagtgtg actgagaagc 103201 ctcggggtgg gatgggcttc caggaggggg cactgaggta agagtgtgca ggcttggtgc 103261 gtggggctag tggggaggag aagctggggt cggggaaggg tagtgggcag tgcaggtcca 103321 ggccgcctga ggactctggc gctctgtttg tcttccatag ctctggtctg atggcaccca 103381 gggaagtcct cacaggcaac ggtgagtaga atcagggagg atgcacgtcc tcatctagcc 103441 cttgggtggg ggccacagag agagctggcc tgcagttcct cggcctcctc actcccaggg 103501 tggcccgcct gcttctcggg gcagctggca gtggagcagg gaaatgctgg ctcccaggac 103561 cagccaggga ccacctgaca gaagaccctg tccagccaga ccccacttgg cagagggggg 103621 cggctctctg tgctggaggc tgtggagcca gtttctgtgc aggcagcgtg aatcctagag 103681 ttatgggttt aataagactg cagacatttg gaaaaacaat tctggggact catctatttg 103741 caataaagga cagtcactcc ccatggaaag gcagtcgtga acctttcatg ggtggatgtg 103801 gagttttctg tattgatatg tattttctta tgtcacatag gtagggcttg gcattaggga 103861 atcgatgact gaagccacag ttgtctctgg gggaggctgg gaatgggggt gggatgagaa 103921 gtagctgctc ctaaggatcc tcagtggaaa gtaaacacac cactttcaga tggtctagaa 103981 ggtttctaaa gcagacagtg gggagtgttt ctttatgcag ggtgcctggt gacctgagtc 104041 tctgctctcc ttgttgaccc tcagatgagg tcatcgggca ggtcctgagc acactcaagt 104101 ccgaagatgt cccatacaca gcggccctca cagcggtccg cccttccagg gtatgtgccc 104161 ttccagcagg ggctctgggg cgtgcaggga gaggcagtgt ggtgagtctg cttggaggtg 104221 gggagtgtat gccacaggta ggctgcccag gaggcccagc ataggggaag catgaggcac 104281 agaacccctg tggtgtgact atttggagtg gctgactctg ggggcagggc taggatgagc 104341 actcaagtcc tgcctcccca ttccaggggc agctttccat ctcgtcctgt gaggcttgtg 104401 ggtggcagac ttggacacta agtctaaatg accctaagtt tgagggacta ggaatgtcat 104461 cagggtttag ggggcatcag ggaaggcttc ctggaggaaa ggtctcctct ggctgatggg 104521 actttgagat agtggacagg gtgtcctgtt gggaggggaa ggagggcacg acaattgggt 104581 tcaagtgtga cactctccca gggctgctga aagaaggctg gttgggtgtt tctgcaggtg 104641 gcccgtgatg tagccgtggt ggccggaggg ctaggtcgcc agctgctaca aaaacagcca 104701 gtatcacctg tgatccatcc tcctgtgagt tacaatgaca ccgctccccg gatcctgttc 104761 tgggcccaaa acttctctgt ggcgtacaag gaccagtggg aggacctgac tcccctcacc 104821 tttggggtgc aggaactcaa cctgactggc tccttctgga atgactcctt tgccaggcaa 104881 gggcactagg ctggggagga ctgtgccacc acaggtgacc ttcccatcac tgtgggtcgc 104941 aggtggggca gggagccagg gtcaggtctg tgttttgggg tggggatggc catggtctgg 105001 ttggggtctg ggcagcgtga ggatgtagga ggtgtcatgg catggggtgc tgttggaggg 105061 ggagttgctc agacttctgc ctccccatgc ttgcgagtct ttggttggga ccttggttgt 105121 taacacctca aggttacctg ggaacctgtc acctccccct tgagcaaaac ctctgggact 105181 ttgtgtgggc aaataccagg ccagctatct gagccattta tggggcaacc atgcccttct 105241 gtttgctctc gcccccaaag cccagcccag gcaccttagt gaccctaaga gggagtaaaa 105301 ggaggggcag tgagctcagg ggaaccagag gaaggacccc catgctggtg gcaaggggga 105361 cctgggtcct ggcttccagc tctgttccga ccttgcaatg tgactggatc ctcagttttc 105421 ctatctgtca agtggggaca ctgcttcttc aggtggctgc aaggacccaa ggcagcttag 105481 gtaggagcag agctgaggaa cttgttcctc taagatgcca aaggccctcc cagagcctca 105541 cagtgcgcct ctttctctgg ccccacaggc tctcactgac ctatgaacga ctctttggta 105601 ccacagtgac attcaagtga gtcctggggg tggttgaggt atgggtgggc tggtgtggcc 105661 ccagcctccc cagctcacct gacatccctg ccaccttccc caggttcatt ctggccaacc 105721 gcctctaccc agtgtctgcc cggcactggt ttaccatgga gcgcctcgaa gtccacagca 105781 atggctccgt cgcctacttc aatgcttccc aggtcacagg gcccagcatc tactccttcc 105841 actgcgagta tgtcagcagc ctgagcaaga agggtagtct cctcgtggcc cgcacgcagc 105901 cctctccctg gcagatgatg cttcaggact tccaggtatg gagcgggcgt ggcccagctt 105961 caggtggggg agcccaggct agtggttgag agacgaagag aggcttgggc ttgggcatgt 106021 agttgagagt cctgtccctg cgctgcccct gtcccagttc ttgctgggcc tcccaacggt 106081 cctttctgac ccgtgtctgt gtgtctgcca gatccaggct ttcaacgtaa tgggggagca 106141 gttctcctac gccagcgact gtgccagctt cttctccccc ggcatctgga tggggctgct 106201 cacctccctg ttcatgctct tcatcttcac ctatggcctg cacatgatcc tcagcctcaa 106261 gaccatggat cgctttgatg accacaaggg ccccactatt tctttgaccc agattgtgtg 106321 accctgtgcc agtggggggg ttgagggtgg gacggtgtcc gtgttgttgc tttcccaccc 106381 tgcagcgcac tggactgaag agcttccctc ttcctactgc agcatgaact gcaagctccc 106441 ctcagcccat cttgctccct cttcagcccg ctgaggagct ttcttgggct gcccccatct 106501 ctcccaacaa ggtgtacata ttctgcgtag atgctagacc aaccagcttc ccagggttcg 106561 tcgctgtgag gcgtaaggga catgaattct agggtctcct ttctccttat ttattcttgt 106621 ggctacatca tccctggctg tggatagtgc ttttgtgtag caaatgctcc ctccttaagg 106681 ttatagggct ccctgagttt gggagtgtgg aagtactact taactgtctg tcctgcttgg 106741 ctgtcgttat cgttttctgg tgatgttgtg ctaacaataa gaagtacacg ggtttatttc 106801 tgtggcctga gaaggaaggg acctccacga caggtgggct gggtgcgatc gccggctgtt 106861 tggcatgttc ccaccgggag tgccgggcag gagcatgggg tgcttggttg tttccttcct 106921 aataaaataa acgcgggtcg ccatgcgttg ggcctcttgt cgctcatttc tgcgctgggc 106981 tttaggcgcg tgctcgccgg agggatgcgg gaggatggag cctggcgagg gggaggcgaa 107041 gaggcatccg cggagaccgg gaggggcttt agcaagctgg gccctaaccg ctcagggtag 107101 ggaggggcgg tcgggccgag tcccggtgcg cttcggcgtc tcaggaggcc gtcgtgacaa 107161 tggcagcctt cgcgtcgtga gggggccgcc ccaggctgcc gcagcggcgc gcgcccggcc 107221 gcggggccgc gccgagactc ccgggggggt gggcgggggt ggggaggtgg gcgggcgcgc 107281 gcggggcggg gcggggcggg gcgcggcggg tggggcgggg cgcgcgcggc tcccgcgacg 107341 gattcttcct gggccgcagc accggccgcc ggcccgcccc gcccgccccg ccccgcgcct 107401 ctctagactc tcggcgagca gcgtcgcccg cggagccgcg ctggtgctgt ggcccgggcc 107461 ctggcagccc tcccgggagg cgagccgggg aggcggtggc ccctgccggt tgcggcgggg 107521 cgcgggaaga ggcgggactc tttgacgcgg cggaggggtc gggcgacggc cgacgcgccg 107581 ccatctttgg tccagtgcgg tggcggcggc ggcggtggcg gcggcgactg ctgcggtgaa 107641 ggaggaggag gagccgagcg ggcgctggca ccgaggcctg accatggacg aggaatacga 107701 tgtgatcgtg ctggggaccg gtctcaccgt aagtgcggcc ccggcgcccc tggccctgcg 107761 tcgtgccctc gccatccccc gcgatgatgc ggccgccggc ccatcctctg gtcactgcca 107821 tcttctgcgc tccgctcccc gaaaagccgg cgccgtgccc accccgctgc tcccccagcg 107881 acgtcgcctc tgctcgttgc catgacccca tctccccgaa gaggccccct tcctggccgc 107941 ctcctgggct acccgggtcc gacccccggc gccctcccat cttccctacc cctgcccgct 108001 ggaccggaaa acgcaccccc actcggcccg atgcccacga ccccgcccac ccatcgcccc 108061 cacccatcac ggcatccacc cgtcgcccgc acccattgcc cgcctagcgt tgcctctcag 108121 cgtcccttcc catcttgcgc taatcccggt ggggatgaag tagaaggacc tggggtcagg 108181 gggcctcgcc agagtgggcc ctcagcccgc ctcctcatcg tcttctgggg ccaggggatc 108241 cagggaatgt tccagaagga acagcagtgg gggcagtaag cacattcccg caaccccacc 108301 tcggttgccc atggagacct caggctgggt agcatctgca gcctttgtcc ttgggctagt 108361 gacagtgact ggtgagggtg tttatcttcc ccctttcccc gaaaaccccg ttggcctact 108421 cccctcagag actcttcttc ctcggtccag gcccagatgc ccctcctcag cggcaggggc 108481 tcctgctatt tgaagaaaga gcagtccagg gtggggccta caatttcaag gactgtgcta 108541 agcctttctt ttcctccctt tcgagtgtcc tttgcagtcc agctgggtct tctctgtgcc 108601 ttattgtgtt ttatccgcct ttttgtgcgg ctctgcgtag gtagatgcct ctcggcgtgt 108661 tggtactgga aggccatcct gtgtctgctg gatgccctgc ctccctcaaa gcctccgtta 108721 tttcactctc cgccttgtga gggggacggg actcctttgc ccagcaaggt tgcaatggag 108781 agtgtggtcg gatgaagcca tcaccccaat tttaggcgag gcaactagcc agtggggatg 108841 gtgggtggaa ggagctgtcc actgaggcag gtttgggtgg ggaacaaatc aggtctgctg 108901 gtaggagcgg aggcaggtgg gagtgcagcg gatggtgtca ccctccccca ggaatgcatc 108961 ctgtcgggca tcatgtctgt gaacgggaag aaggtgctgc acatggaccg gaacccctac 109021 tacgggggcg agagctcctc catcacaccc ctggaggagg tgaggctgcc caggcccaag 109081 cctatccctg ttaacctccc acatagcggc acagggtgga agtagctccc caggtgcctg 109141 gagcaggagg aggtggtccc tgaggtgtct ctctctctcc cttctgctta cagctgtata 109201 agcgttttca gttgctggag gggccccctg agtcgatggg ccgaggccga gactggaatg 109261 ttgacctgat tcccaaattc ctcatggcta acggtgaggg acaggaagga ggtattgcag 109321 gtcgggggtg atagggaaga cccgaggacg gtcaggctcc agaggaggca cggccagacc 109381 agctagagcc tctggagaga gccgtgctga tgttgccctt gtgcaccccc acagggcagc 109441 tggtaaagat gctactgtat acagaggtga ctcgctacct ggacttcaag gtggtggagg 109501 gcagctttgt ctacaagggg ggcaagatct acaaagtgcc gtccactgag actgaggcct 109561 tggcttccag tgagtgtggg ccccctgagc cagagaaatc atggggtggc agggaagcca 109621 actgtctttg ggatgggtgg ctgtccagaa taaggcaccc ccttgtccaa agtcacattt 109681 cccgtgaagg accagagagg cccttcactc tgtattccct ctgccccatg tcatgctgca 109741 cggctggctg ccaggaggga ggtcctcaca cagcagggat gtgatgtgga ggggaacggg 109801 gctccttttg aaagacttga gcctggccag gtgcggtggc tcacgcctgt aatcccagca 109861 ctttgggagg cctaggcggg tggatcacga ggtcaggagt tcaagaccag cctggccaac 109921 atggtgaaac accgtctcat ctcaaaatac gaaaaattag ctgggtgtgg tggcatgcac 109981 ctgtaatccc agctactcgg gaggctgagg caagagaatg gtttgcgccc gggaggcgga 110041 ggttgtggtg agccgagatc acaccactgc actccagcct gggcgataga gggagactcc 110101 atctcaaaaa aaaaaacaaa aacttgagcc tttggcaggg cctccactga atccccgaga 110161 catcgccaag tgtctgagcc aggaaggccc ttagagccca tccaggccaa tcccctcttc 110221 atccaggtgg ggaaactgag gctaaaaagg gtggagaggt taacctaaga tcacaggaca 110281 gctgggctgt gagtgggggt gcctgccctg ctgggatgga gagttttgtg ctgcattgat 110341 gcagaacctc ctcccgccgc gttccttaga tctgatgggc atgtttgaga aacggcgctt 110401 ccgcaagttc ctggtgtttg tggcaaactt cgatgagaat gaccccaaga cctttgaggg 110461 cgttgacccc cagactacca gcatgcgtga cgtctaccgg aagtttgatc tgggccagga 110521 tgtcatcgat ttcactggcc atgccctggc gctctaccgc actgatgagt gaggggaagc 110581 tgggtggtgg cagccccctc gcccggcccg cccccaccct tcccacacct gcctgctgtc 110641 cccctgcccc gcatgtctct tgtactcagc cacaggctca cctatcaccc tttccatttc 110701 ccccttgcat ttgatcctca tcttccccag gccattcagg gtggtgggaa gactggcctg 110761 gtacctgggt gggggtaaat caggggtgct gctattcccc cagctacctg gaccagccct 110821 gccttgagac cgtcaaccgc atcaagttgt acagtgagtc cctggcccgg tatggcaaga 110881 gcccatattt atacccgctc tacggcttgg gcgagctgcc ccagggtttt gcaaggtgag 110941 gacatgggtt ttttcagttg gcatagctga gtaggactag ggccaggagt ggagatggcc 111001 gccttgagca ttttcttggt agtatttcct cctttccaat tcgtgctctt catttacttc 111061 actccatcct tgggtgatga gggcacaacc ccagtgcact gaagtacaga agtctccaca 111121 cttgaaaaaa cgagtctcca caaatgtgac ttgctcagtt cccggcagca ccaggactgg 111181 gcccagggtc ccagaacctt gagtcagggc ttagctggga gctgacctat ctccagagag 111241 tccattctgg tccctcaggc tgtcttggtg cggagcactg atccctcatg ccccctcccg 111301 ctggcccttg gcagtgtatt ctgagcagaa gctccagaga ggaaagagcc cgcggagatg 111361 gggccagcgt tcacggcagt gacagagctg ggatgaatcc cggcgtctcc tacctgctgc 111421 ctgcctgggg cagccttctt tgcactttgg tggaggaagg gtgcagtgga attggtcccc 111481 gtttctcctg ggaggggcct tcaccggagc tgctctcacc acagattgag tgccatctat 111541 ggggggacat atatgctgaa caaacctgtg gatgacatca tcatggagaa cggcaaggtg 111601 gtgggcgtga agtctgaggg agaggtgagc ccttcccgtg catgcagtgc agtcccatgg 111661 ttgaggcggg gcttggtcac tttctctgtg ctgtcgagtc tcctgctgca cagccccaag 111721 ggtcttggag ctgggaactg ggggtccaga agtcttcaga gtcatgactg tgggtgagag 111781 gtctctttca gaggccacat gccatgtctg tggtgcagaa agtgttcttg atcacagtgc 111841 ctctgggggg cgggggggcg ggcacccagc ccgtgcgtca aggcccacct atctgagcgg 111901 cagggtaggg ccatgcattc aggaggggca gtagtgatgg ctccgaggaa cgtcttgtct 111961 gccagagctg cctgcccttg ctgtgagcgc agccccacgt ctacctaagg gagggatgtg 112021 acaggggcga ggacttctgg tgccttctca ggtggcccgc tgcaagcagc tgatctgtga 112081 ccccagctac atcccggacc gtgtgcggaa ggctggccag gttatccgca tcatctgtat 112141 ccttagccac cccatcaaga acaccaacga cgccaactcc tgccaaataa tcatccccca 112201 gaaccaggtc aacaggaagt caggtaggcc gggcgctggt acagcttcct ctagggtgtt 112261 ctctttgggg ttactctttt tctccccatc agtgtgggag ctccctgcct tacctctctg 112321 gaaaaaaatt tgagccacat tgtagccacc acaacttgga gccctagggt ttgggtggcc 112381 tggactcttt ccctctggga gggtggcggc ttggatgcta cctgctctgc ggggctgggg 112441 aggggcagtc gggccagact tgtgtccagc cactggaacc cctctctgcg tatggccagc 112501 gcctctggcc tgagttgacg gagctcttcc tgtggccaga catctacgtg tgcatgatct 112561 cctatgcaca caacgtggcg gcccagggca agtacatagc tattgccagc actactgtgg 112621 agaccacgga ccctgaaaag gaggtggagc cggctctgga gctgttggag cccattgacc 112681 agaagtgagg gactgggctg agcccaaggg ggagagggag ggagcccagc agctgggctt 112741 agggaccagg aagggcatgg cccattgtga ggcctgtccc ctcccctctg tctgcccctc 112801 aggtttgtgg ctatcagtga cttgtatgag cccattgatg atggttgtga gagccaggta 112861 agcagctcgt cccagccctg ggctcctggc tgcccgccca ggatacctgt ctgactcacc 112921 ctgggcgctg ggctctggct gtttccaggt gttctgttcc tgctcctacg atgccaccac 112981 acactttgag acaacctgca acgacatcaa agacatctac aaacgcatgg ctggcacggc 113041 ctttgacttt gagaacatga agcgcaaaca gaacgacgtc tttggagaag ctgagcagtg 113101 attgtggccg cccccagccc ctgctgcccc agcctgtgtc tgttctcctc gagggctcca 113161 gcatcctctg cttcccccac cacgttccca tcacccacct cattgatcca ctgaccaaat 113221 ccttaaccct agcgatggct tgggagatgg ggggttggat agcatcctct ttcttggccc 113281 ttccttatcc taggaaaaga gggttcctct ccttgtgtgt gtctcttccc cccaccccta 113341 attcttctgc tctgtttggg aagacgtgga ggaaaaggtg acttctgccc ccaccgctct 113401 tacccccact gtagtggcct ttggagatgc ccccacctcc cccccaccaa ctctcgcgtg 113461 ttggagagaa ggggccctcc cagcacaaag ttgcattcct cccccctaat ttattctaat 113521 ttattaactt tgacccaccc tttctgagcc tgcagccttc ccgtgtggcc tgagggctgt 113581 cgagtgagct gccccagccc ctcccagccc ttgcccagcc tgggggagtg gggaaggctt 113641 gggcatggcc ccgttggagg ttgatttgct gttttgtttc ttgtctttgt gttctgtggt 113701 acttgctgag agaaaagaaa agtgagccaa gcagaaggag gtgggaaaac ggacccaaac 113761 cccagtgtgc cctgccccat gcctttcctt tagtggtggg aaacccttat cttgcaaagt 113821 gaatgtgtcc ccttccccac cctctagtgt atttcacaga aaacaaaacc tcccaataaa 113881 acggttgaaa cctgaatctc tggatcttga tggttatctc ccaaaagctg ggtacctggc 113941 agggctttgg ctgcctgggt tggggctctg gcttagcttc actctggact gtaaaaggtg 114001 gcattcttca ccagggaatt cctcacccag agcacgctgg ggccaaattg acttaaagct 114061 taaggctacc cagtgagtcg atgctggcac caggactgca gccctcagtg ctcaatgcag 114121 ggggagtgtg gtggagggcg ggtgtcaacg tggaggaagc agagggagcg ggggtcctgg 114181 cctctggctt gaaaggagag ggcagagggg accaggggcc agcctcggat gggattctaa 114241 gtgaagcagg gagacgggtg ccagttttcg gcttaatgcg ggggaactag ggcagccagg 114301 ggaccagcgt tggggtcacc aggaggaatt agagggaccg gggcccagcc ttggctcggg 114361 ttaacgtcga ggcagcgcga ggggctgggg aattcagcct ccggggtcca caggagccgg 114421 cggcgggggc tggggcggga gtccacgcag gaccaggggc ggtctccggc ggggcggggc 114481 gggcggtggt cacgctccgg gccagctggc gcgcggcggg gcggggcatc cgtgcgtctc 114541 ctggtggctg acgtcacggc gcgggcgtca gtgactgttc ggccgccacc gccgctgccg 114601 ctgccgctgt cgctgtcgcc gccgccgccg cccgccgccg ccgccgccgc cgccgccgct 114661 gccatggctc aatacaaggg cgccgcgagc gaggccggcc gcgccatgca cctgatgaag 114721 aagcgggaga agcagcgcga gcagatggag cagatgaagc agcgcatcgc ggaggtgcga 114781 gccggggagc ctcggagcat gcgcgctgcc caggtcccct ggcggccccg cgccacgctc 114841 cggccctgga agccctgggg cggtggccac ggagcgcggg cggcgcgccg gggcaggccc 114901 ctggctcgct gacttccagc agggcacaca gaagcaagcc agcgacagtc tcatctgtga 114961 aatgggcctg gagccgggcg ttctccgagg ctgaggccac tagctgatga ccggcctccg 115021 gcaggaggga gccgagccag tgtttctgtt gggaggcacg gcggggacgg gcctctcgtc 115081 gcctcgcctg ggcacccagc gtcctggctc tggcccctcg gccctcagag ccaggcctct 115141 gccccctcgg ccctcagagc caggcctctg ccctggtcct atggacaccg ggcagggcgg 115201 ttcgtctccc cctggagagg cggggaggcc tgagccagac ctggcaagtc cactggccag 115261 gaggctcctt ccagaacccc ttgcacctgt gaggcaggca cgaccctccc gggtgccagc 115321 ggttcaccca ctgcttccgt gagtggcctg tgatggtggt ggtcagggaa ggagcgtgga 115381 ggagcttgat ccttttgggg agactcactc atcaggactc ggggctccag gggagagcag 115441 gctcctgcct ggggaactgg ggaagcctgg ccagaggagc cttgaaacac caagtcctca 115501 tgccctggcc gcactgtctg agccggggtg cccagcaagc agaaggtgta tttctctgcc 115561 agccaccttc ttgggagacc actggtcagc gagggcttga catggtacca agtgggttgg 115621 ggctggggcc aaagagaggc ctcctgggaa gctgttgatg acattctggg gcctggaggg 115681 aggaaaatca ctcagggagg ggccaggccc ctcaaccact ctgttcttgt ttgccctctc 115741 ctcataaccc ttggcttggg tctcggaagc ttcagcttaa aggctggaga tgaaaaaggc 115801 caaaggtcag tagggtcggt ggcttcaggc tactgaggga cctgtgatcc ccagcaggag 115861 acctgaagag ccagagcagc tcagggatgg aaacagcccc atttggcact cttagcattt 115921 gaagagccct gcccagcctc atggaacctt ccatcgacaa gggcacaggg gcagatttgg 115981 actataacgc aggtccttga tggccagcct gccatttcca cagtgagggg tggttctcat 116041 ggtggctgtc actccccgca ggagaacatc atgaaatcca acattgacaa gaagttctct 116101 gcgcactacg acgcggtgga ggcagagctc aagtccagca ccgtgggtga gcagggtgcg 116161 ggtgcccctc acccgggccc cgggccactc ttccgctggc cctgcgacta ccttgcccct 116221 tgtgcctcct tgcttcaaag gtctcgtgac cctgaatgac atgaaggcca agcaggaggc 116281 tctggtgaag gagcgggaga agcagctggc caagaaggag cagtccaagg agctgcagat 116341 gtgggtcctc tcgccgcagc cctcaacatg gagctagtgc ttacggggtc ccgtggctgg 116401 tcgggctctt tttgcaccct gggggggacc ctgtctccag ctttgggaca ggggtagcat 116461 ctagcttgcg atttgcattt ctctctcccg cctcctcggg tccacgctca cgtcttttca 116521 ccatggcggc ttgtcttctt tctgaccctc taaatgcagt gggccctttc ccagcatccg 116581 gtgcctgagc agcctgcgtt ctccccattg ccagcctgtg cttggctctc ccctgtggcc 116641 ctctgtgctt gctgtttgta acagagccct cgcccatgag ctcaaagtga tccaggacag 116701 caggacccgt ctggggcaat ggggacccag cctgggctgg gcaggcccca catagggcac 116761 acaggtggct ctgggtcaag gaaggggctg ggtcagggtc gggaaggaca tgatgtgtga 116821 tggggccacc tgcacctcac taggaagctg gagaagcttc gagagaagga gcgtaagaag 116881 gaagccaagc ggaagatctc cagcctgtcc ttcaccctgg aggaggaaga agagggaggc 116941 gaggaggaag aggaggcggc catgtatgag gaggagatgg aaagggaagg tgagggctgg 117001 cttgagtcgg agccccgccc gcagcccctg gggaggggtc cccaatgtga gaggctgtca 117061 gcaaagctga tgccctgtca agatggctcc aaaagctgcc ctcttgaggc accgcgcaca 117121 gtagaccggg gagggaagag gctggtctag accaggctgg ctgggcacat tttctgtgat 117181 gatggaaatg ttagagtctg tgccacccaa tatggtagcc acttgcccca ggtgcctcgt 117241 atgactgggg cactaaatgg tgaattctgt ttgatttgtt tcagcgtaaa tgtatacagc 117301 cacagaaatt ccgaggctac tagtcagaca ctgcaggtct cagctgtgag aacaaggggc 117361 tgtcagggaa agctcaggat ggctggggag tgagggtaaa gccagcctgg cctccacgtg 117421 tgccacacac ctgtgctgtg cctggccagg gcacataggg tgctgtccct ctcctggtgg 117481 aagtcacggg tcaccatgca ctgaagacac tactcagggt tactgctctg ctcggggccc 117541 accgggctaa cccaggggtc aaggaaggag gggagggttg agttaggccc taaggatgag 117601 gggagttgca ttcgccagag aggcaggagg ggcgacgaga aagtacatgc cagaagccca 117661 gcaggaaaga gcttttcaaa acgtttgaag attgggagca agttgtgtca cgggcccctg 117721 gagggtgtgg tggcagttct ggaaattgtg tcacccaggg aatggcagag ggagagaagc 117781 gaggcagggg atggtgctgt gtggctgtgg taacaggctg ggtgccatgg ctttgagggg 117841 agtgcaggat tgtagggagg aagaacctgc cagccacctg ccatttgggt tccatgttcc 117901 agcagccaca gtaaaacata gaaaaagaaa caagcaggat tttagtatga cattgtattt 117961 aacccagcgt atccaaaata tcaccattgc ggcaggcgca ctagctgcaa tgctggggtg 118021 ctgtgtgcag cgggagctac tggtttgtgt ttgtgttttt gttttggttg ttgttgtttt 118081 tgagacagtg tctccctgtg gcccaggctg gagtgcagag gggcgatcac agcttacttc 118141 agccttgacc tcctggactc aagggaccct cccacctcag ccacttgagt agctgggact 118201 acaggcgctt tgtatttttg tattttttgt agagacggaa tctcgccatg ttgcccaagc 118261 tggtctcgaa cttgggagct actggcgtgg acagtgcagg cctgggggaa aggttgtcac 118321 cctcactgtg tgtcactggg gcttttacaa aacactgctt ggtgggctct tccctccctc 118381 cacaggttcc gtttaggggg cccagccaca agaccccagg ctctagaatc tctgcccttg 118441 agcatcaggc taaagagttt agatttgtct ttttttgttg ttcttttctt ttttttcttt 118501 cttttttttt tttttgagat gaggtctcac tctgttgccc aggctagagt gcagtggttt 118561 gatcacggct cactgcagcc ccgacctcgc aggctcaagc gatcctccca cctcagcctt 118621 ctgagtagct ggtactacag gtgtgccacc acacccggct gatttttttg agttttagta 118681 gagacgaggt cttgctatgt tgcccatgct ggtctccacc tcctgggctc aagtgatcct 118741 tctgcctcat cctccccaag tgttgggttg acaagcgtga gcctctgtgc ccgccctagg 118801 cttgtcttct agatcgactg ttctcaaggt atgcttgggg gaccctgggc gtcggccata 118861 tcaaaatcat ttttataata atgctatgat attcggcttc cttttgttcc agagatcacc 118921 acgaagaaga gaaaactggg gaagaaccca gacgttgaca caagcttctt gcctgatcga 118981 gaccgtgagg taaaggctgc ctggccctga ccccggcctc gactccagcc ctggtggaga 119041 cccttgctta gggagccagg ctctgcgtgc accctgagca gccctgggcc tcacctgtct 119101 gcctgctccc tctcccagga ggaggagaat cggcttcggg aagagctgcg gcaggagtgg 119161 gaagccaagc aggagaagat caagagtgag tgtttgcgga gtcagacgcg aggggcctgg 119221 gcccaagcca gcttccctgt ccacagtctc ctgggtggtg cgctgagccc caaggctgct 119281 ctcagtgggg ctggagacca agaggcagct ctgccttgta ggtgaggaga tcgagatcac 119341 cttcagctac tgggatggct ctgggcaccg gcggacagtc aaggtaggca gcgtgcagcc 119401 tgcttcctgc tcaccatggg cccagcctcc ctcaggttcc gtgggaagga actgaccttc 119461 ctgacttggg aagccccagg gatcctgagg ataactggct ccagggcctg gccccctgtc 119521 tggggagtga ggcccaggcc tggctgaggc tctgcagcgt ctggcagggc cctctgggcc 119581 tctgccttgt tgagagccct tctgcaggct tccgccttcc cttcccaact cctggattcc 119641 cggatacaga tgagaaaggg caacaccatg cagcagttcc tgcagaaggc gctcgagatc 119701 cttcggaaag acttcagtga gctgaggtgt gaggtgtgcg tgtgtgcacg gtggtgtgtg 119761 tggcacctgt ggctccagcc tgggtgccag acacccagtg tgatgtgggc gctgtctggg 119821 caagcaagca gcgtgctggg cactctcctc ccaaaagagg gggtcaggct cctcttccct 119881 tccctcccac cagagagctt tgctgagagc ggcctgcctt ggttccttag cccaaaggtg 119941 gagggtgctg ggcgccagtc ctgtgctcac taaaccatgg agcttccctc acagccaact 120001 ggccgctgct tcccagcccc agcatgggct ctcctgggct ggctgtctcc tccctccctg 120061 ggcaggggtg ctgtcctctt gcccacgccc tcctgccttc ctccctcagg tccgcagggg 120121 tggagcagct catgtacatc aaggaggact tgatcatccc tcacgtgagt cccttcagcc 120181 ccagtacccg cagtgggtgc agcaccctgg gctttggctc aggctggtgg gggcagtgtg 120241 cgcaggcggg gggtgtgggg aggggccgag atggaccatc aagacgccgc tttcctgccc 120301 ttggctccca gcatcacagc ttctacgact tcatcgtcac caaggcacgg gggaagagtg 120361 gtgagtgccg ccgacccagc cgcccccata gcaccttgcc gccgatgttg tcactggggc 120421 agcagcggga cattgggctg tcacttctcc cctgcaggac cactcttcaa ctttgatgtt 120481 catgacgatg tgcggttgct cagtgacgcc actgtggaga aggatgaggt acagcgtagg 120541 gggccgtgtg agggggaacg ggtgtccctg ggctatggag gagaagccaa gggaagggga 120601 ggtcagcagg cggccctgtg ccctcttcct cccttagtcc catgcaggca aggtggtgct 120661 gaggagctgg tacgagaaga acaagcacat ctttcccgcc agccgctggg aaccctacga 120721 ccctgaaaag aagtgggaca agtacacggt gaggaggggc tggcagggac ccctccaagt 120781 tggggacggc agccagcccc tgctcacccc tcgccttcct tgtctcctct gcccaccttg 120841 tcctcacact agatccgctg agcatccagg aggctgcgcg gccccggctc ctcagctccc 120901 tcagtgtgcc ccgtggtgtc accgggactc caggcacccg ctcccctgcg accatgccag 120961 gcacgctggg aggaggacgg cagctgctcg tgtcctgccc ctgccacatc agtgactgct 121021 ttattctttt ccaataaaga agtgcacgtg tcagagctgg agcgcctgca ttgtgagaaa 121081 ccatttgtgt tcggaccaaa ttcatttgtc attttgacca tttaaaactg tacaatttgg 121141 tggcatttag cacatttgcc atgttgtgta gtcatcaccc tggtctgctt ccaggacatt 121201 ctcatccctc caaaggccct gtggccatag gcaggctcct cccctcacat ccccccacca 121261 ggctctcatg tgctttctgt ctccatggat tgacctgttc tggacgtgtc atacgtgcct 121321 ggggagtctg tcaccgtggc agccccacca tccctgaggt gtctttgtgg cactgtgctt 121381 cttgaccgtg ggcacctgtc cttgggcggt ggtggctctg gtgggtggga agggggttcc 121441 tccccaggcc tctctcccag gcccttgggc tcctgttgag tttcccgtag ggacgtggcc 121501 gtcctgtggc cacgaaaacc catcctgcct tccttgctgt gggccgtccc agcacagtcg 121561 ttctgctgtg gtgccctgtg ctcgacactt gcctgccctg ccctagctag ccctgctcag 121621 acagcccttg gggcaggatc cagatcgggg gtaggtccag gacccctact ctgccacctc 121681 agcagctgat tccccaagtt ccacttggct aagctgaggc aggcaggcct ggctccagga 121741 ggctgaggaa ggccagctga cagctccctg tttcctcccc tggcctgaag ccaagaagaa 121801 agggcagggg atagcccctc actggccagg atcaccccag ccctgggtct ggcctcccct 121861 tcctggcaaa tgtggagatc tgatggtgcc cgatagtgca gaggcccagt gtctcattca 121921 gcgtggccgg gaagtagcca cctggcgtca ctaggaatag agatgagagt agactaatgt 121981 tttctttctt ttcttttcct tccctccttc cttccttctt ccttccttcc ttccttcttt 122041 cttccctccc ctcccctccc ctctccttcc tttcctcttt cttttttttt tttgagactg 122101 ggtcttgctc tgtctcccag gctggagtgc agtggtgcaa tctcagctcg ctgcagcccc 122161 cacctcctgg gctcaagtga tcctcccgcc tcagcctcct gagtagctgg gactacaggt 122221 gtgcaccaca atgcccagct atttttattt ttatgtacat gaaaattgga acagaattaa 122281 aaaaaaattt tttttgtaga gatgggatat tgctctgttg cccaggctgg tctcgaactg 122341 ctgggctcaa gtgatcctgc cacctcagcc tcccaaattg ctgggattgc aggtgtgagc 122401 cactgtgttt ggctgagtag gctatttgat gctctggaac cagataaggt caccaaggat 122461 cagtgtgtgc caggggtggt cacccacttt gctcacccgg gcctcgttcc tgggggctgt 122521 gtgtgtagaa aacagagagg gaaacaaaca aaacaaaatg tagaacaaga tgagctggcc 122581 tttgtggcgt ggagacattt aagctgacat ccaaatgaca agaagtcagc cattccgatg 122641 cgggagggag tgttccaggc agagggaaca ccaagtctgg gtcaaaagtg agctggaggc 122701 cgggtgcagt ggctcatgcc tgtaatccca gcactttggg aggctgaggc gggtggatca 122761 cttgaggtcg ggagttcgag accagcctga ccaacatggt gaaaccccgt ctctactaaa 122821 aatacaaaaa ttagctgggc gtggtggcag gtgcctgtta tctcagctac tcaggaggct 122881 gaagcaggag aatcgcttga acccgggagg cggatgttgc agtgagccga aatcacggca 122941 ccgcattcca gcctggatga caagagagaa acgccgtctc aaaaaaaaaa aaaaaaagaa 123001 agaaaaaagg tgagctgggc tggaacaagg ggaagctacc tttttccctc atcttcactg 123061 tccagcagaa gccatgtatt caggcggttc ccatgtcagc atggaagcca gttcctgcaa 123121 tgtccccaac agagatgtgc ttttcttcac tgcctgctta cctgttctct ttctctagac 123181 aaagacaagc tcctacggct gctcctcctg gccctcacct ttctcgcatg aaaggtattg 123241 cgcttgcatg ctgttcgaca ccgtgtgata tagtgtccat tagcgatata tcccaatact 123301 cggcatcttt ccacatgggt gcatagagac cctcattctc tcttcagctg cagagtcagc 123361 atggcagata ttggccaact ccactcctta gggcctgccc acttggcagc ctcatgggca 123421 ggatgagggc cggctccccc agccttgccc acacagtgcc ctcactcgcc gccaggagac 123481 gcgtgccaag ctgagatgga gacatggttt ctcagcatgg ggttttgttt cttctttgtg 123541 tttttaattt tttcaactga ggaaaaattc acataacaga aaagtagcca ttaaccattt 123601 taaagtttac aatttggtgg catttggtac attcacattg ttgtgtgacc attgcctctc 123661 tctactggaa cattttcatc cccaaaagga gaccccgtgg ttggttattc agcagtctct 123721 ccccattccc ccactcccca ggccctggca gctgctaatc taccgtttca atggattgga 123781 ctgttctgga agcttcacat caatggaatc atacaataca tgaccttttg tgtctggctt 123841 gtttttctta gcatcatgtt tttgaggttc atggcttact ttttataact aactgagtcg 123901 tagtccatta tctgtgtaca ccacatcttg tttatctatt catccatgga tgggcatttc 123961 aggtgttcct acctttcggc tattattgta aataatgctg ctatgaacat tcacgtacaa 124021 gtttttgttt gaccttagtg tggtttttaa tttgtatttc tcttttcaca tgtttaaaaa 124081 ctatgtgcat tttatttctc ctgcaaactg tctattcctg gcatttgccc attaattgta 124141 ttgttggtct tctttgtttt tttttttttt ttttttttaa tagagacagg atctcactat 124201 gttgcccagg ctggttttga aatcttgggc tcaagcaatc ctcctgcctt ggcctcttgt 124261 tggtcttttt ttatctagaa atcggtcagg tgcagtggct cccgcctgta atcccagcac 124321 tttgggaggc cgaggtgggc agatcacctg aggtcaggag ttcgagacca gcctggccaa 124381 catggtgaag ccccgtctct actaaaaata caaaaattag ccgggcgtgg tggtgggcac 124441 ctgtaatctc agctgctcag gaagctgagg caggagaata gcgtgaactc aggaggcgga 124501 ggttgcggtg agccgagatc gtaccactgt actccagcct gggtgacaga gcgaaactct 124561 gtctcaaata aataaataaa taatatgtag aatcctaggt accttccatg tatgagagag 124621 agtacccctt tgtgttatac agtgcaaata cttttcctcg tttttcattg atttgatggt 124681 gttttttaga cacgagaagt ttatttttag ctagctggac tcttttgtct tctatgcctt 124741 ctgggtttga ggtcatagtt agaaggtctt ctggaagtca gtgttataaa ggaagtgatt 124801 ggaggctttt accaggagca agacataatc ttgacttaag ttttccagtc actcttgtag 124861 cagtgtggca aagggactgg gggagggcaa gcatgggagg ccagtgagga ggccaccaca 124921 gctatcctga tgagaggctg gtggcttgga ccagggtggg gacagaatat cagcttctgg 124981 atagattttg gggggaaagc caacaagatt tgctgacgaa taggacatgg agtttgagag 125041 aaagagagga gtcaagaact ccactttggc ctgagcaaga ggaataaact ttactaagat 125101 gcggggcatg ggaagagaag caagtctggg gtgaacaatg aaggctttgc ttgtggaaat 125161 ggtatgttga gatgcctgtt agatccaagc agagatttgg atctatgaac ctggagcctg 125221 gagttcacag gagagcttga gaggatggag tgagaaattg cagagctgtc aagacagaca 125281 tggctgatgc tctggaacta gaaagatcgc caagggaatc aaagggtaac acctgctcgt 125341 ctggtcaagc caatggatga gatgacaaag atgggactaa ggccaggcac agtggctcac 125401 gcctgtaatc ccagcacttt gggaggccta ggcaggtgga ttgtttgagc ccaggagttc 125461 gagaccagct gggcaacata gtgagactct gtgtgtatag caaatacaaa gattagctgg 125521 gtgtggtggc acacgcctgt agtcccagct acttgggagg cttaggtggg cggatcactt 125581 gagcccagga ggtggagacc agcctgggca acatggtgaa accctgtctc tacaaaaaat 125641 tcaaaaatta gcctggcgtg gtggcatgca cctgtagtcc tagctacttg ggaggctgag 125701 gtgggaagat tgcttgagcc tgggaagacg aggctgcagt gagtggtgtt cataccactg 125761 cactccagcc tgggtgacaa agtgagacat tgtctccaaa aagatcatta tacaaaaatc 125821 aatttttaaa agttttttga gatagagtct tgctctctgt tgcctaggct ggagtgcaat 125881 ggcatgatct tggctcactg caacctctgc ctcttgggtt caagcgattg cccccctcct 125941 cagcctccca agtaactggg attacaggtg cccgccacca tgcctggcca attttcgtat 126001 ttttagtaga gacggggttt tgtcatcttg gccaggctgg tctcgaattc ctgatctcaa 126061 gtgatccgcc cacctctgcc tcccaaagtg cttgaattgc aggcatgagc caccacacct 126121 ggccaatcaa ttgtatttct atacacaatc caaaatgaaa ttaagaaaat aattccattt 126181 ataacagtgt caaaaaagaa taaaatactt attaatagat ttttttaaaa gaagcatgag 126241 atttgtacac tgaaaagtac aaaacatcat tgaaagaaat tcaagaagac ctaaataaat 126301 ggaaagacat ctgtattagt catggttctc cagagaaaca gaaccaacag gacacacaca 126361 cacacagttt attttaagga tttggttcac ccaattgtcg gggctgacaa gttggtgtga 126421 aatttgtagg acagactgac gggctggaaa ctcggagaat ttctgtgtta cagtcttgtt 126481 gttgttgttg ttgagatgga gtctcactct tgtcgcccag gctggagtgc agtggcatga 126541 tatcagctca ctgcaaccac tgcctcctgg gttcaagcga ttctcctgcc tcagcttccc 126601 gagtagctgg gattacagat gcccgccacc atgcccaact aatttttgta tttttagtag 126661 agacggggtt tcatcacgtt ggtcaggctg gtctcgaact cctgacctca ggcgatccgc 126721 ctgccttggc ctcccaaagt gctgggatga cagacatgag ccaccgcgcc tggcctgtgt 126781 tacagtcttg aggcagaatt tctgcttctc caggaaacct cgggttttgc tcttaaagcc 126841 atccaactaa atggatgagg cccacacaca ttatcaaggg tgatctcttt tacttaaagt 126901 cagttgaggg ccaggcatgg tggctcatgc ctataatctc agtgcttttt tttttttaga 126961 cagagtcttg ctctatctcc caggctggag tgcaatggca cgatcttggc tcactgcaac 127021 ctccccctcc caggttcaag agattctcgg ccgggcgcgg tggctcacgc ctgtaatccc 127081 agcactttgg gaggccgaga tgggcggatc acaaggtcag gagatcgaga ccatcctggc 127141 taacacggtg aaacctcgtc tctaccaaaa atacaaaaaa attagccggg cgtggtggcg 127201 agtgcctgta gtcccagcta cttgggaggc tgagacagga gaatggcgtg aacccgggag 127261 gcggagcttg cagtgagccg aggtcacacc actgcactcc agccttggcg acagagcaag 127321 actttgtcta aaaaaaaaaa aaaaaaaaaa aaaattctcc tgcttcagcc tcctgagtag 127381 ctgggactac aggcatatga caccatgtcc ggctaatttt tgtatttttt gtagagatgg 127441 ggttttgcca tgttggccag gctggtctcg aactcctgac ctcaggtgat ccgctcactt 127501 cagccttcca aaatgctggg attacaggct tgagtcactg cacccagcat atgccagtac 127561 tttggaaggc tgaggcagga ggattactta agcccaggag ttcaagacta gcctgggcag 127621 tgtagcaaga ccctgtctct ctctacaaga ataggaaaag ttagccaggc gtggtggcac 127681 ttgcctgtgg tcccagctac ctgggaggct gaggcaggag gatcccttga gcccaggaag 127741 tcgaggctgc agtgagccat gatccaatgc cgctacactg cagcctgggt gatagagcgc 127801 gacctcaact tggggggaaa aatctgtcta tctatctatc tctccatata tgtgtttaat 127861 aaaaatgggc aaatgacttg aatagacatt tcttcagaga tcatatacaa aaggccagca 127921 agcacatgaa aagatgctga acaccattag tcatcaggga aatgaaaaag aaaaccttaa 127981 cggggcgcgg tggctcacac ctgtagtccc agcactttgg gaggctgagg cgggagcatc 128041 gtttgagcct ggggcggtcg aggctgcagt gagctatgat ctcaccactg cactccagcc 128101 cgggtgacag agcgagaccc tgtgtcaaaa gaaaacaaaa aaagaaaagg aaaaagaaag 128161 caagggaagg aagaaagccc tcggttcagg gtcgccgggg aaggggctgg tttggacggc 128221 tgtcccgcag ccctccgggt tagagaccgt cgggggcgtg tcttgagctg gggacgcgac 128281 tcctcggcgg gggtgcacct gagaggccgg gaccagcgag gccgcgcccc ggtggcaggt 128341 gaccccgggc gcgtctccga gcgccagggt ggggcgaggc ggacgcgcgc cccaggcccg 128401 agggggtgtg gcgcgggcgc gcgcgggttc cggggtcgcc cgaggcggcg ggcggggagc 128461 cggggcgcgg gggaggcgcg cgccgcggaa cgcgccatcg agcgaagggg gtcggctggc 128521 cgggcgggtc ccccttcagg tcgcgggggt cagggctgcg cgcgaggtcc ctggccccgg 128581 ggagcagcgg ccccgcctgc gtcccctgcc gccgcgtccc cctcggccgg gcgtccccgg 128641 acgcccccgc cccgcgcggt ggtacggtcc gggtttaaaa ggctccggcg gcccccgggc 128701 cagcccagtg tgtgggcggc ggcggcggcg gctgcgcgct tggggcccgg ggcgcggggc 128761 gaggccgtgg ggacgtgcga gcggggccgc ggtggggctc gggggcgcgt ccacgtggcc 128821 agagggcggc cacccggagc cagcggaggg acaggtaggt ggccgtcatc gtcgccgtcc 128881 gtgcggggcc aaacagctcg gggcccgggg ctccggcagc ccgggggtcc ggctcggcct 128941 gtgggggtca ggccctatcc ctgcccgtcg gcgccggcct gcccgactcc ttgtgccccg 129001 catggtgggg acgccaaccc aagaccaaca ccctgcccaa agaggaccct ttgtcgcagg 129061 ggagcggccc ctcggaggat caacgctgta cttccctccc gctaagggct ccgcgagtcc 129121 gccgaggggt gccctgatgg ggcccgggca aaccggacca gcagcacggt ggggggtgtg 129181 gggggcggcc gctggacccc cgtctctgcg tggggctgcc ccttcctcag gctctctgct 129241 ggcccagctg tacctgtgat gcccgaggtt gccacttaca ggtgggggct gggcctgaaa 129301 ctgcctcccc aggtctccct cccgacctgt ccccagcctg ttcctaggct tgtgttcccg 129361 gggtgtagag gaggtgaaga caggatgtgg cctctctttt ctttggcctt cctttcttct 129421 ccggctgttg catgaggggt ccagctgaca aaagagccct ggagaggtga tcaatctgag 129481 ctgaaacttt gcagtttcaa gctgggcccg ctgcttgttc ctggcaaatg ccttattttt 129541 cctttctctc tcctgtgcct ccttggtttg gttccctgca ccaccccccg ctcccccccc 129601 gcctcccgcc ctcaagctgg gtacctccta tctcccccgg gagctctggc actgagaggg 129661 tagcgctgag ccctgccctg ttggctgcca cgacactgct ctgctgtctt agcagcctgg 129721 ggctgactgg gtgtggcctg gggactaagt gccagtccca gcactgtccc tgtggctcct 129781 gccctgtctc tccaaggcct tgactgtatt tggggacatt gctcctcctc ttgtaaggct 129841 cctgggactc caaagggcag gtagggcccc ctgtacattc ccttctccag tgaacccctg 129901 ggtctggagg gggagccatg caggctgtgg ctgagcagag cttgactgat gtgtgtctgg 129961 gtgcgtggct gaggggcagg cgcgacttgg ctttgtgttg catcggggag gccagacagc 130021 gggctcctgg atggccagct gaggcccttg tcggggaggc agctggggcc cctgcagggg 130081 acatggtggt gctggatcct gcatcaggcc tccattcccc agacctctgt cccagggctc 130141 ctggtggcag tgccactcca ggtggtgttc ctggccctgc ccctctcccc cagcttgcga 130201 ttattctgtg cctgcctccc tctcccctca gctcaggcgt tgggggcagc ccagaggcct 130261 ggggcttgct gttctttgct gttggttgcc ccgggctctg agatgtggca tctggggtcc 130321 cagttccagc atctgctctg ctaccccctt gggtgtggcc cagctgaggc gaggcctcct 130381 ggctcctttc cagcatgggg gctgagggac gattgatggc ccccaagcct gccctcgctg 130441 gctgctcact catgcgccct ggcggaggcc cctcctctct ccctgtcacg gtggccatct 130501 ggggctttgg cggggtggtg ggtgaatggc ccagccctct gtggctcgag gcctctgact 130561 cccacacacc gtctctccct aggcctgtcc ccaggcgcgg ctgccggcca tgccctctgt 130621 ctgcctcctc ctgctgctct tccttgccgt ggggggggcc ctgggcaaca ggcccttccg 130681 tgccttcgtg gtgacagaca ccacgcttac ccacctggct gtgcaccggg tgactgggga 130741 ggtgttcgtg ggcgcagtga accgagtctt taagctggcc cccaacctga ctgagctgcg 130801 ggcccatgtc acggggcccg tcgaggacaa cgctcgctgc tacccgcccc ccagcatgcg 130861 cgtgtgtgcc caccgcctgg cccccgtgga caacatcaac aagctgctgc tcatagacta 130921 tgcggcccgc cgcctggtgg cctgcggcag catctggcag ggcatctgcc agttcctgcg 130981 tctggacgac ctcttcaagc tgggtgagcc gcaccaccgc aaggagcact acctgtcggg 131041 ggcccaggag cccgactcca tggctggtgt cattgtggag cagggccagg ggcccagcaa 131101 gctgtttgtg ggcactgctg tcgacggcaa gtcggagtac ttccccacct tgagctcccg 131161 caagctcatc agtgatgaag acagcgcgga catgttcagt ctcgtgcgtg agccttcctt 131221 ctcttcttcc tccacccagt cctggctctg cctcccagaa gagccctgtt ctttctgaga 131281 ggggactttc ggctcctcag tgtctgggat gcagacaggt tgcggagggt gggatgacag 131341 cctgaaccca ggttgcgggg gtcccctgtg tgcagggagg ctggtcaccc tgccctctgc 131401 agcctttctg gcctgggcct ctgtgatcat ccaggcggga gggggatgca gagggaagct 131461 ggcggtggcc cgtgatgttg gcacagggcc tcaccggatg ctgtctcctc cccctgctcc 131521 ccaggtgtac caggatgagt ttgtgtcctc ccagatcaag atcccctcag acacgctgtc 131581 cttgtaccct gcctttgaca tctactacat ctacggcttc gtcagcgcct ccttcgtgta 131641 cttcctgacg ctgcagctgg acacccagca gacgctgttg gacacagcgg gcgagaaatt 131701 tttcacgtcc aagatcgtgc gcatgtgcgc gggagactca gagttctact catacgtgga 131761 attccccatc ggctgctcct ggcgcggcgt ggagtaccgc ttggtgcaga gcgcccacct 131821 ggccaagcct ggcctgctgc tggcccaggc cctgggcgtg ccggctgatg aggacgtcct 131881 cttcaccatc ttctctcagg gccagaagaa ccgggccagc ccaccccggc agaccatcct 131941 ctgcctcttc accctcagca acatcaatgc ccacatccgg cgccgcatcc agtcctgcta 132001 tcgtggggag ggcactctgg ctctgccctg gctgctgaac aaggagctgc cctgcatcaa 132061 caccgtgagc ccctcatcac cccacactgg tcctctgccc tgtcccaggt ctaccatgcc 132121 cagcgtgggc tcacggccag tcatcctgtc ccaggctttg ccatggccct gggagtggca 132181 cctttgccct cccacctgtt tctccctccc aggggtccct gccctctcct tgcctctctc 132241 tgggctgccc agtgccccac ttctgcttcc ttccgttccc agctcgttcc tccttgctcc 132301 gtctcccggt tgccccttgt cttgagccaa caagagtctt gtgggcccag gcttcgctcc 132361 tcgctccctg ctgtccctcc tctgtctccc tccgcagcca ccaatttcct ggagcaccct 132421 agggcgagca gtcagtcctg gctggagggg accgggttct agatcccgct ctcttctggg 132481 actcaggaac tgccggcctc catcttgcgc ctgggagctt tgactctcac gggttcctcc 132541 tctgtttcac cagcccatgc agatcaacgg caacttctgt gggctggtgt tgaaccagcc 132601 tctgggaggc ctgcatgtga tcgaggggct gcccctgctg gccgacagca ccgacggcat 132661 ggccagcgtg gccgcctaca cctaccgcca gcactctgtg gtcttcattg gcacgcgcag 132721 cggcagcttg aagaaggtgg cccccagagc cctgggcatg tgggggtggg gacagtctca 132781 gatatgggac aaggctggtg gtgggaacac acgggagagc tctgaggaca ggacaggaga 132841 ggacacggct ggtgcaggga gggagatttc cctggctggg tcccaggcgc ctggccctgc 132901 tcctcgggtc gccccttggc cgcccaccct caccattgcc agttcccact gtggactgaa 132961 ttgccagctt cgtatggctc tggggacaga gaacagacct attgtggagt ctccctgcca 133021 cgagagcagg cctggggctg agccagcatt gctgggtcct ctgttgaatg aacaagggag 133081 caaatgcctt gcccctcaag cctccctatg cctgggacag acccacttgc ccctgccctc 133141 tccttggctc actacttccc atccttcccc tcctacccat gggcccgggg gctgaagctg 133201 cttccattgg ggatctgggg gtagcttggc atgaaggggg agatgaactt ctgggcactc 133261 aggtacgtgt tgaataggta ggagtcgggg gactgcagtg ctctcatcct gcttgcctcg 133321 tagatgcagt gccatccccc aggagggcaa gtgttggtct gggggcgcag ggaagggttt 133381 gtggagaagg atgatgaact ctcctggccc acctgctctg ttcctgggac cctcttttcc 133441 tttgtagcag ctttgttgaa cgcgtactat ccaattcacc cattttaaag tgcacagttc 133501 agtgggtttc tgtggatctg cagagtggca caaccatcgc cactacctca ttctcctaat 133561 caatgccatc ttctagctct tcccctcccc tgcatcccct gaccttgggc aacacgaatc 133621 actccgtcct tctgtttgct gagggcattt tctggcgtct gcaccctcct ttttttggcc 133681 tccgggctgt gggtgtgagt gctgaacccc accaggtcct gacgtcagct cagccacaac 133741 ctccaaagtc tttggctgga gaagacagtg tcccaggtgc aggaggctgt ggtggcatca 133801 ggctggtctt gtggctcagg tgcgggtcga tggcttccag gatgcccacc tgtatgagac 133861 agtccccgtg gtggatggca gccccatcct ccgagacctg ctcttcagcc cggaccaccg 133921 gcacatctat ctcctgagtg agaagcaggt gggcctgtgg tgggtggcgg tggctggtgg 133981 ggcgggggtg ggctgggtgc tgatgtggct gtccccaggt gagccagctc ccggtggaga 134041 cctgtgagca gtaccagagc tgcgcagcct gcctgggctc cggggacccg cactgtggtt 134101 ggtgtgtgct gcgacacagg tgagggcggg ggcccctgct cggggagttg gagggcccca 134161 ctggccaggg ggagccagcc acacccagcc gtgcccttgc ggcctccccc cgccctcctc 134221 cactttctcc agctaatgag tggaacctcc accccgagtg acgtccctcg ggccccaggg 134281 gacgcggcca aggctcgctg ttgcctggca ctgtgagctg gacaggcctg ggccctcccg 134341 gggcagtggc gggaccggct ctggcccgac cccgtgcagg tgctgccgcg aaggggcctg 134401 tctgggcgcc tctgccccac acggctttgc tgaggagctg agcaagtgtg tccaggtgcg 134461 ggtccggccc aacaatgtgt cagtgacgtc acctggggtg caggtgagca gcttgggggt 134521 gcccggctgg gtgtgcacat gtgtgctggg agtccgccct gccctgagcc ctctgcttcc 134581 cccagctgac cgtcaccctg cacaacgtgc cagacctcag tgcgggcgtg agctgcgcct 134641 tcgaggcggc ggcggagaac gaggcggtcc tgctgccctc cggtgaactg ctctgcccct 134701 caccctccct ccaggagctc cgagctctta ccagggggca tggtcagtgg gttggggctg 134761 cccaggatgg ggcagagtgg ggcctctccc tacccccagc gagttcacgg ccaccccggg 134821 actgctgcag gggccacccg cactgtgcgg ctgcagcttc tctccaagga gacaggcgtg 134881 aggtttgccg gtgctgactt tgtcttctac aactgcagcg tcctccagtc gtgagtacct 134941 ggccagcacc cgtccctacc cctggggata gaggggacct ctgggacagg cgacaccttc 135001 agggcagctg tttctggggc atcaggggct aactgtcagg ccctgatagc ccctgacatc 135061 cccacatctc tctgtcccct gatatggctg ccttcttcct tcagttcccg atgtgtccta 135121 ttggggagca gtgaggggca gtgcagcctc ctctgttatt tctgaagcca cttccccccc 135181 aggtgcatgt cctgtgttgg cagcccttac ccctgccact ggtgtaagta ccgccacacg 135241 tgtaccagcc gcccccacga gtgctccttc caggagggca gggtccacag ccctgaggtg 135301 aggcgggcgc cgcatgtgag gggctgggct ctgtggtgcg ggcggggcca ccggcttcta 135361 tgcgttctcg gtttctttga gccttctcca ggttgggcct ggagaagggt ggccctgtgt 135421 gcagctgaca ggtgctttcc ccgcagggct gccctgagat cctgcccagt ggggacctcc 135481 tgatccccgt tggggtcatg cagcctctta ccttgcgggc taagaaccta cctcagccgc 135541 agtcgggcca gaagaactat gagtgcgtgg tgcgggtgca ggggcggcag cagcgggtgc 135601 ctgccgtgcg cttcaacagc agcagtgtgc agtgccagaa cgcctcggtg aggtcccacc 135661 cgctgcctcc cttcggggtc tgggctgtgg cgtggtgtgg atcgttcttg tctatgccca 135721 gctctctggc agggaactgt ctgagtctcc tgctagggat tcctgtacct ggaggagggg 135781 ggcagctggc aaaaatcttg ggtaggggtt ctgcacatct tatctggggt cccatgggtt 135841 cctttcctcc agtactccta tgaaggtgat gagcatggtg acaccgagct ggacttctcc 135901 gtggtctggg atggagactt ccccatagac aagcctccca gcttccgagg tgaaggcatg 135961 ggccagggag cttccctcct aaggcaattg gcactgctgg ggtcgggtct gggggcccta 136021 gtgctcccca ctttccactt ttctgccact gcctgctccg tcagcagtgc cttctgtgcc 136081 tgcagccctc ctgtacaagt gctgggcgca gcggcccagc tgtggcctct gcctcaaggc 136141 tgatccccgc ttcaactgtg gctggtgcat ctcagagcac aggtgccagc tgcggaccca 136201 ctgcccggcc ccgaagacca actggatgca cctgagccag aagggcaccc ggtgcagcca 136261 cccccgcatc acgcaggtca gcctccctca ccgcccctgc ccactgccaa cagggcccct 136321 gggagtctga gccaactctc tcactgccca tcctgctcca cagatccacc ctctcgtggg 136381 gcccaaggaa ggaggcaccc gggtcaccat cgtgggtgac aacctgggcc tcttgtcccg 136441 agaggtgggc ctgcgggtgg ctggcgtgcg ttgcaactcc attccggccg agtacatcag 136501 tgctgagagg tgagtgcggc tctgtgggtg cccgggccgt atgtggcctg gccggccctg 136561 acgctctctg agccctagga tcgtgtgtga gatggaggag tcgctggtgc ccagcccgcc 136621 gccggggccc gtggagctgt gtgtgggtga ctgttcagcc gacttccgca cgcagtcgga 136681 gcaggtctac agctttgtgg tgcgtggctg ccggccctac cccttcctgt cccttctctc 136741 tcccgcaagg ggcgtgtgga gcagcccggc ccggctcctc ccctcagggc agcttctccc 136801 gcagacccca acgtttgacc aagtgagtcc cagccgtggc ccggcgtccg ggggcacacg 136861 gcttaccatc tcaggcagct ctctggatgc tggcagcagg gtcacagtga ctgtgaggga 136921 cagcgagtgc cagtttgtaa ggtgggccgg ggccctgcca gctttgggtt gggcatcgtg 136981 tggggggccg tggggacggg tggctgaggg ccctgggcca cccgctccaa gcaccctgct 137041 tgccaatgta ggagagatgc caaggcgatc gtgtgcatct cacctctctc caccctgggc 137101 cccagccagg cccccatcac acttgccatt gaccgggcta acatctccag ccccgggctc 137161 atctacacct acactcagga ccccaccgtc acccgccttg agcccacctg gagcatcatc 137221 aagtaagacc ctgggggact ggggagcctg gcagtgtcca gagggctcag ggactgggtg 137281 gtctctgagc tccggtgggg cccctcctgg agcctaggcc ctcattggtg gagggggtgg 137341 agctgtcctg gctgcacagt acccggcacc cactaggcga tccagggtca gggaaggcct 137401 ggctgccatg gcagccttga tcgctctcca gggctgggtg ggtgcactgg aggagggagc 137461 ctgaggcccc tgccctgtcc ctagtggaag cactgccatc actgtgagtg ggacccacct 137521 gctgacggtc caggagcccc gggtccgtgc caagtaccgc ggcattgaga ccaccaatgt 137581 gagtaccagc tgccccgccc caccccgacc ctgcagccca tggcttcacg tgcctggctg 137641 tccacctttg tccccacaga catgccaagt gatcaacgac actgccatgc tgtgtaaggc 137701 ccccggcatc tttcttgggc ggccccagcc tcgggcgcaa ggcgagcacc ctgatgagtt 137761 tggcttcctg ctggaccacg tgcaaacggc ccgctccctc aaccgctcct cctttaccta 137821 ctaccctgat cccagctttg agccgctggg gccctctggc gtgctggacg tcaaaccggg 137881 ctcccacgtg gtgctgaagg tgcgggcggg gtgggggcgg ggaggggcgg gaaagtggag 137941 agtcctgggc tgaagttgtc ctccaccccc agggcaagaa cctgattccc gcggcagccg 138001 gcagctcccg cctcaactac actgtgctga taggaggcca gccgtgttcg ctcactgtct 138061 cggacacaca actcctgtgc gactcaccca gccagactgg ccggcagcct gtcatggtag 138121 gtggggatgg ggagaccccc tgggcagccc agggtgggcg tggtggtcag ctcacctcag 138181 gcctgtcccc acaggtgctg gtgggtggcc tggagttctg gctgggcacc ctgcacatct 138241 cggcagagcg ggcgctgacc ctaccggcca tgatggggct ggcggcgggg ggtgggctcc 138301 tgctgctggc catcacagcc gtgctggtgg cgtacaagcg caagactcag gacgcggacc 138361 gtaccctcaa gcgtctgcag ctgcagatgg acaacctgga gtcccgtgtg gccctggagt 138421 gcaaggaagg tgcctgaggc ggggcgggat gtggtgtgga agctggggac ctccctcctg 138481 cccactcatt ccctctctcc accccccagc ttttgcagag ctgcagacgg acatcaatga 138541 gctgactaac cacatggacg aggtgcagat ccccttcctg gactaccgga cttacgccgt 138601 gcgcgtgctc ttcccgggca tcgaggccca cccggtgctc aaggagctgg atgtgagcct 138661 ctgcctggcc tgccccccac cattcccttc agggccgccc ccaccctctg agacctcctg 138721 ctccccacag acgccaccca acgtggagaa ggccctgcgc ctcttcgggc agctgctgca 138781 cagccgcgcg ttcgtgctta ccttcatcca cacgctggag gcccagagca gcttctccat 138841 gcgcgaccgc ggcaccgtgg cctcgctcac catggtggcc ctgcagagcc ggctcgacta 138901 tgccacgggg ctgctcaagc aactgctggc cgacctcatc gagaagaacc tcgagagcaa 138961 gaaccacccc aagctgctgc tacgcaggta cctgcctcgc tctatccagg ccacaccttt 139021 gtcctgcctg ggcctgcccc ctgtccaagc cccacccctg ccaggcccga gcccccttct 139081 cttgccctag gacagagtca gtggctgaga agatgcttac caactggttc acgttcctgc 139141 tgcataagtt tctgaaggtg cgccaggtgg gtgggcggca gggaggtggt ggcagaggac 139201 cggccagtgg tcaggcaggc agccagctgt gcacctgtcc ctggctgcag gagtgtgctg 139261 gggagcctct cttcctgctt tactgtgcca tcaagcagca gatggagaag ggccccattg 139321 atgccatcac gggcgaggca cgatactccc tgagcgagga caagctcatc cgtcagcaga 139381 tcgactacaa gacactggtg agcgcagggc caggcgggcc agaggtaggg gcgcagagag 139441 aggcctcgcc ccagactgac actggagtcc gctttcccct cagacccttc actgcgtgtg 139501 tccggagaac gagggcagcg cccaggtccc agtgaaggtt ctcaactgtg acagcatcac 139561 ccaggccaaa gataagctgc tggacactgt gtacaagggc attccgtact cccagcgtcc 139621 caaagctgag gacatggacc tgggtgaggt ccccaccctc tcctcctggc tcccactcat 139681 accctcctgt gccgtgtgac ctccgtgggc tctcccgctc cccacctgaa ccctgtttcc 139741 taagaggagt gggccaggca cttgggggct gccagcataa tcttaagggc tgtctgtggc 139801 ccacagagtg gcgccagggc cgcatgactc gcatcatcct ccaggatgag gatgtcacca 139861 ccaagatcga gtgtgactgg aagaggctca actcactggc ccactaccag gtgaggggtt 139921 ggggcccatt cctccccagg gccacctggg agccaggacc ctgccttgag ctgcagcagg 139981 acacgggagc agtggccctg gctcccctcg cccttctccc tgggctcgtg ggctcccctc 140041 ctgggtgtgg gtgggtgggg gcgccttggc ttctcagctt cctgcactgc ccccctctgt 140101 ctctggggcc tccaggtgac agacggttcc ttggtggcat tggtgcccaa acaagtgtct 140161 gcctataaca tggccaactc cttcaccttc acccgctccc tcagccgcta cggtaggtgt 140221 cctcagtgtg gtggccatgt gcccttcgag ggaaccccca cttccaagtg ccatcgattc 140281 tgtagagtgt agacggaggg tcggccagcg agggcagatg ggtccccaca tgctgctgag 140341 ctcccgggag agtggggcag gggccaggtg gtggcctaag ggtcacatgc attctctgct 140401 ccagagagct tgctccgcac ggccagcagc cctgatagcc tccgctcacg ggcacccatg 140461 attacgcctg accaggagac aggcaccaaa ttgtggcacc tggtgaaaaa ccacgaccat 140521 gccgaccatc gcgaggggga ccgtggcagc aagatggtct ccgagatcta cctgacacgg 140581 ctgctggcca ccaaggtatg ggcctgcctc tcgccacctt ggcctccgcc cagaggttgt 140641 cccggcctta caggggaggg gccatggatc atttgcttcc agtgggcctt tcttcgggtc 140701 tttgctgtgg gaactctgag ccttagataa aggccatgtc tgtctgagag accactggcc 140761 ctgtagggaa gcccctctct ttgccaagcc tttctgcctc catctgtcct gctctgggga 140821 catcccgacc tggctccctc atagcctgtg acctctctgc cccacacagg gcacactgca 140881 gaagttcgtg gatgacctct ttgagacagt gttcagcaca gcccaccggg gctcggccct 140941 gcccctggcc atcaagtaca tgttcgactt cctggatgag caggcggacc agcgccagat 141001 cagcgacccc gatgtgcgcc acacctggaa gagcaactgg tatcaccccg tgctgggctg 141061 ccagcagcct gtctggagac tggtgggcgg aagacgctgg tggctctgct gagacccagg 141121 gcctggagta gcatccaggc aggagggaat gaggcctggc ctggggttga cactgacggt 141181 gccatcaagc caaggggcct gttgcgtcca gttctggagc tggactgcct gggtgggtgg 141241 gatggtggtg gtgaggcctt tgccctctgg ggtcttgcac caggtagaga tgctggtggt 141301 cggtcgcagg tggtgagagc ggtgacagga gaagctgcgg agggggttga gccctattga 141361 gaacagaagt tcagcagaga cttagaggaa aggccgttca ccctaaatgt gttgctctca 141421 ccacctctgg acaagatgtg agccggccgg acaggaaaga gatgtgcaca tgggaggggc 141481 cccttggggc taactaggga gcctcaggcg cacgtccctc tgttgtccac agcctgccgc 141541 tgcgcttctg ggtgaatgtg atcaagaacc cgcagttcgt gttcgacatc cacaagaaca 141601 gcatcacgga tgcctgcctg tcggtggtag cccagacctt catggactcc tgctctacat 141661 ccgagcaccg cctggggaag gactcgccct ccaacaaact gctctacgcc aaggacatcc 141721 ccaactacaa gagctgggtg gagaggtggg ctccgcccgc tgtgggtggc agagggcagg 141781 acctgcccct ccctgggctg cctgtgcgag gcaggccagc tacaggcagg aagcccgggg 141841 gctggctggg cagtgagggc aggggttcta gtttttgtga tgtcgcctga gtgacatact 141901 cagggtccca acaggtatta tcgagacatt gcaaagatgg catccatcag cgaccaggac 141961 atggatgcct acctggtgga gcagtcccgc ctccacgcca gcgacttcag cgtcctgagt 142021 gcgctcaacg agctgtattt ctatgtcacc aagtaccgcc aggaggtgtg tgtcatcccc 142081 acagactccc agttacttgg tcctgaaggt gcagccagtg acaaaggcag gagggttggt 142141 gggggttggg gattgttcac ggtctctccc ctgaccaggg cctggggccg gcacaggtga 142201 tgctctctgg gatcacaggt agatgctgga atttgaggga cccaggagat aagccactgg 142261 ctgggtacaa agtggccact gggcaaaagt taaagggcct gagcactgga agggatgggt 142321 caggctttgt gatgagaaat agccagaggc aggaggacat agaggtgtgg actttaggat 142381 ggagtctggg ctggacacat ttgcaagcct gtggatggca tttaaagccc tgagcctgga 142441 tgagctcaca gggaagccac aagtgggtgt ggagggagaa gcggcccaaa cactgagccc 142501 agggccccag gacacaaaga ggggagggga ggggaggtgc agcagccagc ggggaggatg 142561 cctgagagcg cctggtgggg tctggcaacc tgagccagtg aggacccctc acaggtcagg 142621 gagtggcagt ggccggctcc atcttccagt gctgctgctg gtgaaggatg gtgacctgca 142681 gaacactcac tgggtctggc gagggtcacg gggtcctggc gcgagggagg cttgggctag 142741 gaggtaatgg gagggggaat ccagcacggc gagtccagag tccaggtcac tctagggaca 142801 ggccgcaccc tagctgtgga gaggaggagg agaaagtgct gtcgtcggag gggcatttgg 142861 ggctggggag agctggtggg tttgtacagg ggctgcgggg gaggacggag ggatgaggcg 142921 tgtgggaggg aggagcccag ctgggaagtg gggactttgc atggcaggag ctgcgccccc 142981 tacagagccc agctccaggg gactcctctc ccccagattc tcacggctct ggaccgagat 143041 gcctcttgtc ggaagcataa gttgcggcag aaactggaac agatcatcag cctcgtgtcc 143101 agcgacagct aaggtggtgg aatcggtgag gagggggctt ctcagtcctg tgccgtcctc 143161 ccatccaggg gagtggctgg ctcaagcctg ggtccccggg ctgagccctg gattgggtat 143221 cgtggggcag gtcaccctgg ccacgatgcc cccggcacac ccaggccccc ttcattagtg 143281 ccttgctttg ggccctgcag ggggaggggt gacagggcga gcccccaccc cagcagcagc 143341 aataccccca ccctcctgcc ctgtgcccag gtgttgggac agtcccaccc tccctgctat 143401 ttatatccct ctgcctattt attgaatcga acttcgcctc tgtctccatc tgtaaatatg 143461 tgtcccccca ccggatgtcg ccaccctcac tcacctgcct cttcttgagc tgtcctgggc 143521 cctgccaccc gtctgggctc ctttgtgtag cattatcagc ctcggtctgg cctctggcac 143581 ctcacccttg ccatggctga ccccacccat tccaaggcgg ggtcacggta ccagcagcac 143641 ttggggtgag gcctccaaag cttcctcaga attgtggctg tgccacgctg gaccacaggg 143701 tccccctcaa gcatctcggg gccctattct ctctgagcac ctggagggct ggactcaggc 143761 ttgtgccagg gcctgacttg ggcctggggg ccctagaaca ctcctcctcc tgagcctact 143821 gccaaacgtc ctcagtgttg tctgcacctg ctccgactcc ttcagccgcc ccattcagcg 143881 cccgctccgt ccagtgcccg ccctgtgggg ccaaggcggc cgtgccttac tactctgtgt 143941 cttctgcctc ctctgaggaa tctggccctg tctgacagtc ccagaccccc cgttctctcc 144001 tctttagttg catgagtttt tctttgttca tggaatgttt tttcctgatt aaatgttggg 144061 gaaatgccat ccatgtgggg ctctgtcctt ggggcaggtc agtagacctg tggatgctgc 144121 atgagctgga gccccattgt gggagaaagg accactcctt gcagggtccc agagggtttg 144181 gggaggttca gggccatacc cgacccatca gcaaaccccg gtggctctac cttcaagaca 144241 gattcaaggt agatagattc tcctcccttc caccaccccc acccagctct cctgtgccat 144301 cgtctatctc tgcctggatg ccagtctccc tgcttctgcc ctggctctct tcagcctgtt 144361 ctcagcgtgg gaggcagaga gatcctgtta tgatataagg tggatcatgt tgcctcactg 144421 tttagaaccc tccggcggtt tcagctgcag ccagctcctt caaaacctga caattgcata 144481 ctgtcgaggt tgtggcaccc atcatggtgc acctaccaaa gtcttactcc caggcctgtt 144541 aagtaatgca ggtggggaga aggtgacatg caatttattt atttatttat tttgagacag 144601 agtgtagctg tgttgcccag gctggagtgc agtggcaaga tcacagctca gtgcagtctc 144661 aaactcctgg gctcaagtga tcctcccacc tcagcctcct gaatagctgg gtgtattagt 144721 gcactctact atgccaggct tttttttttt tttttttgag ataggttgcc gctatgttgt 144781 tcaggctggt cttgaattcc tggtctcaag tgatcctcac ccctcagcct cccaaagtat 144841 tgggattaca ggcatgagcc actgtgcccg gccaggtgag atggaatctt atttttactt 144901 tttatgtttt tttttttttc tttgacggag tgtcgctctg tcgcccaggc tggagtgcag 144961 tggcatgatc tcggctcact gcaagctccg cctcccaggt tcacgccatt ctcctgcctc 145021 agcctcccga atagctggaa ctacaggtgc ccaccaccac gcccggctaa ttttttgtat 145081 ttttagtaga gacggggttt tactgtgtta gccaggatgg tctcaatctc ctgacctcgt 145141 gatccacctg cctcagcctc ccaaagtgct gggattacag gcgtgagcca ccatgcctga 145201 cctttttttt tttttttttt tttttttttt ggagacggga tcttgccctg ttgcccacgc 145261 tggagtgcag tggcaaaatc aactcactgc agccttgatg tcctgggctc aagtgatctt 145321 cctgcctcag ccttctgagt agctaggatt acaagtgctt gccaccatac ccagctgttt 145381 tttttttttt ttttttttct ttagtagaga tggagtcttg ctggttttga acctcctggg 145441 ctcaagtgat cctcctgttt tggcctccca aaattctggg attacagttg tgagccactg 145501 tgcccagcca agtgagatgc aatttttttt tttttttttt tgagacagag tttcactctt 145561 gttgcccagg ctggagtgca atggtatgat cttggctcac cgcaacctcc gcctcccggg 145621 ttcaagtgat tctcctgcct cagcctcctg agtagctgag attacaggca tgtgccacca 145681 cgcccagcta attttgtatt tttagtagag acagggtttc tccatgttgg tcaggctggt 145741 ctcgagctcc cgacctcagg tgatccgccc gcctcagcct cccaaagtgc tgggattaca 145801 ggcatgagcc actgttcctg gcctttcttc tttttttgtt tttgtttttt tttgagatgc 145861 aattttaaat agggtggctt ggacagccac ctgagttggg tttaaaagct gccagctcca 145921 gccgtccagc tgtggggcca cggtgccaga cagtggtggt ggttatagga ccgtaggtgt 145981 ttaccaaaac tagacaagtc cacataacaa ccaactgcac aaataaggag taaaataata 146041 aagagaagga cgtgtgaggc aggactggat tgtagcccag gataaagaga agttgcggga 146101 tgacagctgt gccgtcggcc cagacagaac catctggaca gaggacagag gataagggag 146161 atggttgaaa atactaccat catatgataa tgacaactaa gccacaggaa aacaaggcag 146221 ttttaacctc aggggaaatc cctcagaaat ggatggaatc taccagaagc atcatgagag 146281 gaagaggaag ggaaacagag ctaaattggg gccaggtgtg gtggctcgca cttgtcatcc 146341 cagcactttt ggaggcccag gcgggaagat cactggagcc caggagttca ggaccagcct 146401 gggcaacaaa gcgagatccc ctctctacaa aaaataaaaa actagccagg catggtggtg 146461 catgcctgtg gtcccagtta ctcaggaggc tgaggtgctg gggaacaagg ctgcagtgag 146521 ctgtgattgt gccagcctag gtgacagagc cagaccctgt atcaaaaaaa ataataataa 146581 ttaaaaaaca aaagatcggc tgggcgcggt ggctcatgcc tgtaatccca gcactttggg 146641 aggctgaggt gggtggacca cctgaggtcg ggagttcgag accagcctca ccaacatgga 146701 gaaaccctgt gtctactaaa aatacaaaat tagctgggcg tggtggtgca tgcctgtaat 146761 tctagctact tgggaggctg aggcaggaga attcgcttga acccgggaag tggaggttgc 146821 ggtgagccaa gatcgcacca ttgcactcca gcctgggcaa caagagtgaa acactgtctc 146881 aaaaaaaaaa aaaaaaaagt gaaagatcta aattgtcaac tattcgtcaa taattaatgt 146941 ctaaaaaata ttaagaaagg aagaaaggaa tgagagcaaa agagagggag agaggaaagt 147001 gtcatcttca taagctgaag gccagtgcca ccgtcaccac tggcagagct ccccagattc 147061 tacaaaggga aactgttcct ccatatggag taatagggac tggatatgtc atgccaaatg 147121 aaagaacacg ccaaaccaga caaaatatat ggaatgcaga gtattaaaaa actggctgtg 147181 ggccagtgag ggacagcgac gtgtgggagc tgggaaacaa ggtaagccat gcgactggcc 147241 cagtgtcctg tgtggagttt cctggtggag cctggcagac acccggactt taggagttga 147301 aactgattca gagcgtccca gcagctggag tttgcagggc ggagaaccag agagaagaga 147361 gctgcacagg gaaagaaccc tggatatcca cagaggctcc cgttccagtg tgcagcggac 147421 tacgactact ggtgagcaca ggcgtgtgag gaagctaccc aagaccaggg aaggccatca 147481 ccccagcagg agaggatgaa cggtaactcc agtgtgcaag caggacaggg actagcagct 147541 gtccccaccc accaggatag agaagctcat gttcacctag gcagcactgg ccagggtgcc 147601 cagaatggtg tttgttgcct ccatagaggg aaacaagcct acagttaaga gctgctctgg 147661 tcccacacag cacatcttca aagcaagact caaaaggatc aaaccatttc cttgttttct 147721 gagaataaaa gcagttgtac ctcttccttt tcactctcag aaaatctgat agaatctata 147781 aaagagccct aaggctggag acactgtatt ttccagggag ccaactcctt gccccagaca 147841 ctagtcaatt gccccagcac tttcttcact gatcgttaga ggaaaaaagc tcatgttaga 147901 tttaaaaatc aggaaaaaaa tttaacacta aacttgcaaa tattatgtgg tgtatattac 147961 agttacacat ttactatgta agtgcaactg taaaactgag gaagtattcc aaaatgttga 148021 cagtagttaa tcctgaatga cagaacccaa ttgattttca ttttcttcca tttcctaaat 148081 tgtcttcaat ttccttgtac tccttgaata tctgaattct agaacccccc cccagcaacc 148141 caagcacaac aagaatattg ggaggtgccc cagctgcaaa taaactcaga cttggaatag 148201 tagcgcagtt ttattttctg tagtaacaaa catttaagaa ccagataaaa catgaggccc 148261 tcccaggaaa gtagcaactg tgggaattcc tgccctaggg aaggatggac gcactgccta 148321 caaggagacg caaagtggga cctcgctcca tttgcccagg ccaggcttag cgggaaacgg 148381 ggggcccaaa gcgctgcatg gtccgcacca ccagggaaag ctggtcaaga aagttgatga 148441 cggaaattcg gagcaggcga cagtcttcag ctttccagcg gctaaaagtg aggggaaaag 148501 aaaatcaaat tggaggtggc aactcctggt ccctcgctct gcccctcagg cacccaatat 148561 agcaaaccct tgggagccct gccattacac agcccctcag gcagagaccg gcccagcgca 148621 cctggcgctc cttagggctc atccttggat gtcccccatg tctccatggg ccgcctcccc 148681 caaccccgtc cctttcagaa cttacacgac caggatcctg ccactcactg tgagatcctt 148741 cccaaccacc ctttggtggg gctcggcatc tggtgccagg gacccatggg cgatttccgc 148801 ctccaagggg gtcgggaaag gcacgctgag ggtgctgggg gtcattcggt taaggtgcca 148861 tctgatacga tcttgcctat gtctcaggcc ttcttgctcc tctgaagccc cctctcaggt 148921 caccctgcct agtgtcaccc ccttcactcc ttacctcgcc caggccaccg ttagcagagc 148981 ttctccctcc atttcaatgt cttcccctcc tccgaccacc tctgaccgta tacgccagcc 149041 ccggtccccc cctgctaccc tttccgtcgg ccccccgccc gccctccttt ccttctcccc 149101 gagtccagcc ctcgtccgcc gcaagcagcc cccagctcgg aaaggataca atatgtgcgg 149161 ccgcattcgt gaccccctgg cgcagacgcg gcgtctctgc ccggacctgg cgcgtgcgct 149221 gggggagctc caccggccgg agctgcggct gtgtccacgc ccccgcggca gctgtggcca 149281 ccccggccat ccccgccgtc agcgcctccg cctgcgtctg catccgcgtc ccgcatgacc 149341 gccgccgcgc cgctccgact ccacccccga agcgcaggtc ctacgccccg ccctctctgt 149401 ggctccttcc cgaagccccg ccccctgcgc acgactccgc ccacacgcgc ctgcgcaggt 149461 ccctggagag cctgactggc gcgtggtcag ttcccgccaa gcggccctgc cgggggcctt 149521 ctgagacccg gtcagcggtt gagaggctgc ggtttcctcc agaaactctg ccctttcgcg 149581 cgtaactcga ttccagagcg ctggtgcaaa ctgacccaca gtcggtccgg ccccaggaag 149641 ccaaccctga cggggctttt gaagacaccg tggtcgccta tcccagaggc gcaccccgcg 149701 aggctccgcc cctaagcgcg ccttgtgacc ttagcgcgcc tgcgctggcg gggccttccc 149761 ccttgtccct ggaggcccca gtgaggccct gcggtgcctg gacggcttcc cgccccgttt 149821 cttacctgga cccaagagcc ccaggagatg tggcacaacg tccatgttgc gattcgcaga 149881 gggagggtgg ccgggtcggg taacccccaa ggatactgct cttgccctca accccagtgc 149941 ccaaccaacg gggcttcacc atggactgtg aagcatcgga tgtgcatttt agaggaagca 150001 ttttagagcg ggtgggctgt gcagagcggg ggagcctgtc agcaggaggc ctgcacggag 150061 gcgaaaaatg acagggcctt gaatgaaagt actggcggtg ggtctaggtc taatataacg 150121 tgaaaaatca gactgcacaa ctatgagtat gaaaatgcca gtttaggcca ggcaaagtgg 150181 cccccgcttg gaatcccagc actttggaag gccaagggag gaggatcact tgagcccagg 150241 agttcaagac aaacacaggc aacatagtga gaccccgtct ctacaaaaaa tacaaaaata 150301 aattttaaaa gttagctggg agtggtggag cgtgcctgta gtcatagctt cttggaggct 150361 gaggagggtt gattgcttga gcccgggagt tcgacaccag cctgggcaac atagtgaaac 150421 cctgcctcta caaaaaatac aaaaattagc tgggtgtgat ggtgcacacc tgtggtccca 150481 gctactaggg aggctgagat gggaggacag cctgagcccg ggaggcagag gctgcagtta 150541 gccgtgatcg ccccaccgca ctccagcctg gttgatagag ccagaccccg cctcaaaaaa 150601 taaaagaaaa tgccaactta gtctcttact gtacatatgt agagaaatcc tagaacaata 150661 cagcgtcagg taaatcctca tgatctctgc taatagtatg gtgggtgact ttaatttcct 150721 ttttgtgttt tccctccaat ttttccgaag tggtctttta tgacatttac aataaaaaaa 150781 aaaagtattt aaagtgacat cagtccgggc acggtggctc acgcctataa tcccagtact 150841 ttgggaggcc gaggcgggtg gatcacttga agtcaggagt tcaacaccat cctggccaac 150901 atggtgaaac ccccggccct actaaaaata acaaaaatta gctgcgtgtc gtggtgtgcg 150961 cctgtagtcc cagctattcg ggaggctggg acagaagaat cacttgaacc cgggaggtgg 151021 aggttgtggt gagccgacat cacgccactg cactccagcc tgggcgacag agggagactg 151081 tctcaaaaag aaaaataata aaaataaaaa atgaataaat aaagtgacat aaaatttcca 151141 acagtgtttg tcatgtttgg acaacctatt ttcctccttt gcagtcataa ggaaggatta 151201 aacactgaac tatatcagca aatcctgttg ggtttccctc tgaaataggt tccacatcag 151261 cccacttctc actctccact gctagtctgc tgtggaagcc agcatctttc cttgacctcc 151321 ctcaacagct tcctggctga ttctctttta cactttcccc tctacaacct attcaccaca 151381 ccgcagaatg actttttaat ttctaattta aattgcatgg ggctgggcgc agtggctcat 151441 gcctgtaatc ccagcacttt gggaggccaa ggcaggcaga tcacttgagg tcaggagttc 151501 cagaccagcc tgaccaacat ggtgaaaccc catctctact aaaaatacaa aaattagcca 151561 gttgtggtgg caggagcttg ttatcccagc tatttgggag gctgagacag gagaatggct 151621 tgaacccggg aggtggaggt tgtggtgagc ccagatcgca ccattgcact ccagcctggg 151681 caacaagagc aaaactctgt ctcaaaaaaa aaaaaaaaaa aaattagccg gctgtgacag 151741 tgtgtgcctg cggtcccagc taccagagag gctgacgcag gaggattgct tgagcccagg 151801 aagtcaaggc tacagtgagc tatgatctag ccactgcact ctgcacaagt aacagactga 151861 gactctgtct caatcaaaca aacaaacaaa aaagtccatg atattcagga aagaaagcag 151921 agataaagga aaggttttta tagggaaatg gcaagtaata gctgtagaaa aaactatcag 151981 aaaattacca tctttgcaac caccaatgta ataattaatt gttattaact taataattga 152041 ttcaggatgg ccaggtgcag tggctcacgc ctgtaattcc agcactttgg gaggccgagg 152101 cgagtggatt gtgaggtcag gagtttgaga ccagcctggc caacacagtg aaatcccatc 152161 tctactaaaa gtacaaaaaa aattagccgg ctgtggtggc atgcatctgt agtctcagct 152221 actcaggagg ctgaggcggg agaatctctt gaacccagga ggcagagatt gcagtgagcc 152281 gagaccacgc cattgcactc cagcctgggt gacagagtga ggctccgtct cacaaaaaaa 152341 aaaaaatcga ttcaggcaag gaattgtcgc tggatggcaa aagcattagc tgaaaggttg 152401 ttggggacag gatattcaca aagtccccaa atattacccc aaattacttg tgaatcagga 152461 agggaaactg cgtctttaca ggggagagat ctggcagcag ccacctggat cacgtgatca 152521 aactgagcgt cctcaacagt ggggcagcct gaccctgcca gctgcctgat gaggggctgg 152581 gggtgagtgt ggtcagagga ggaaaggata cagtagcagc ttccctggat gcttccaaaa 152641 atattcttgc ctgattgcct atggagaatg gctagttggg agaggcaaag atggcctcct 152701 gctccctcag aagcctggct gagtggcatt gccagtcctc cagccgggcc gtggagaaca 152761 cgtgtgtgct gaggagccgg ccagtgctgg acgtgagtgc caagagggaa gggactgtgt 152821 cttccagtct cagggttgcc agcacctggg acacaagggg gcactcagtg atgagcaggg 152881 tgtcgtctgc agcaaagtca ggcgctgtct ggaagacatg ggcaggagct caggagagag 152941 ggccgggatg agacatagac atggcagtga ggcacagaaa tagaaagcaa gctggatgag 153001 ctccccgagg agtgatacag agggaagaga aggggctcag ggacagagcc ctggggcacc 153061 aacattgagg ggtggggtgt gtgtgaagaa ccagcaagag agcctgagta ggagcaagtg 153121 gtggaaacag cgggagagca aaggcagaag agggtgagga atgaggactg ccggccattc 153181 agtggggccc tgaaggtgta tgcaggccgc tccagctggg ccctggagga ggacagggca 153241 gttgctgtgt gtctctcatc tctcctgaag gcttttcagg gctgggaggc tcaaggctgg 153301 ccacctctgc tgcatctact cctgggagta ccaggcccgg gcagatctat tggagctggt 153361 gaggctcaca tgggacacca gctgatccat tgccagggaa cctccccacc ccaaggcaaa 153421 tgctgctgtc ctctgaggcc gctcttcctc ccgtgggccc ttgggaaacg acaccaaaag 153481 cacctcatcc cagctgggcc cttcactcat gctgctatct cctgatgtca ttgtagtcac 153541 accatggccg caggactcag atactgtccc gaggcccctc cctgaggggc aggtggtggg 153601 tctcagggcc cttcaccggg cagggggcag tggatgactc ggtgctacct ctcagggcgg 153661 acaattggct cagaggggcg cagcacctgc agtaaacatt ctgacccctg ttaaaggatc 153721 tcagcattgg gggggctgca gcctgtggct gctccatctt catgcagaat ccccagtgtc 153781 ctctaacgat ccccccactt cctgtctccc atccctgcag tgtttcctct ctctggtgct 153841 ttccctaacc caagcctcag aaaacgcttt ccctgtgtga tcttgacgcc tccagccctc 153901 tcctccacta cctgcgaggt ggccctgatc tgacacatgc atggcccctg tagtggacag 153961 ctgccatttg cggccaccat tgaggatatc cattcatcca gccctccccc agagccttct 154021 gtcccttgca tagcccatgt gtgtccaggg aagtggcccc atgcctagct ccagggagcc 154081 agccctgacg gagaattacc ccctacccca gccaaaatga ttggacgcac aatccagttc 154141 tgaccaacga gaagagtgtt tatttgctgg gggcatctga gacagcagag gtggtacaca 154201 gagaagatgc tgctctggca gtgagggagg gattccgtgg aagctgcagg cagccatctt 154261 ttcatcctcg gaggtggtgg cctgggatgt agcagatacc aacgcagagg gaggcagaag 154321 acactgggtg aaattgctca cctgctgcca atgccaatac cactggactt ttcactttca 154381 taagccaaga aatttccgtt aactgattaa acaaattgcc tttattgtta aaccaatgat 154441 tggatatcag aaacagccta gctaatatag ctccttttgg ataaaagtag aagcctcaca 154501 ttcaccttta atgttatttc actttttcca aaatcaattt cctggcacca cacaaacaag 154561 aacactggta aatgtatcac ctgaactgca ggccacacat gctcacgagt gtgtccctgc 154621 cgtagtggct ttgaagccat atacatggct ctccagctct cgtggggggc caggggcttc 154681 tgtgtgctct tcaaagcctt cacccattgg gcctctgtcc atctttccag cctggggacc 154741 caatatttgc cccacaaggc ccaggcagga gggcctccag cacgcgcagc cctgaccaca 154801 ctgtgagcag ctcacttggc aagggtgaag ggccggagtg gcgtcctcct gatgtggctt 154861 ccttccagcc tctaccaccc ctgggggcct cctccaagga gcaggtggtt agatgtcctc 154921 ttgtgcccac acacacggca gggtggagga ggatgaaccg gaacccagcc catgcttctg 154981 cccccaccct ggcaccccaa ggtttcttct caggcaacaa ggagcacaca aagtgagccc 155041 ctgtacacag gtggctggga tatgttcatc ttccacaggg aggccagtgc tggggatgag 155101 gacaaaggcc tacgcccagt cccagctctg acactgcctt gctgagtcac ccatcagccc 155161 taacgcttcc tggacgagac aagcagatag ggtgccccag agaggagctg tgtcactaag 155221 ggccaggaca ccccatctct gggtcacgct tggccacggc ttcatttttg gcttcagttt 155281 gtttactgaa cagaagggag aggggccctg tgagacagag gacactgact caggtcggtg 155341 agtggccctc actgtataga gtccatggct ggggaccaag tggcactgag actcgcccaa 155401 agtcacatca cagagctggg gtgccttgct gactccctgg ccccaatctg cttttccatt 155461 tgtcgaacgc acactggcac actggctgca gcagctgcgg gggtcggggg gttccagggc 155521 tggctgtcac tcggcctcct gcagaggtaa gctgagggct atgttggacc cgccagaagt 155581 caggtgacag ctgctttccc tttcaggatc gggagggagt cacctgaggg tatattctca 155641 gtgggctaaa catgacatgc gagagcccag ctctccaagc ggccccttcc tccttctcac 155701 accaaggcac tttcagaagc cgctggattc cgactggggc tgtgccactt catcgccggt 155761 gcctctgctg tgccggctct cctcttctga gggtgtggaa gcaaacccag gtcaagctgt 155821 gaggagtgcc tagcacctgg ccagagccag gtacatgcca gctatgacga tgatgagccc 155881 aactgcaaca ggagaacaca cttagtgcga catgcagcgg tagcctggcg ttgggcacct 155941 ccccatgctc cgagaattct atttggagaa gcccttctcc attgtctcag tcacttcagg 156001 gtgcaggaag cggctggcca accgttcgat gtcgtccagc gtcaggcgac tcagggacct 156061 ctcgtaatcc tggggagaga agggcaggaa gattcagggg ctgccagtgg ggggggggca 156121 tacactgcaa cagaggtagg ggctgggagc aactgggcct gaagaggatg cccagggtta 156181 cccttctcac cctctgtagc tgttccagga ccctgctggc atctgccgca ctgaagtggc 156241 gggccaagac tttggagatc agctgccaga cctgcggggg tggggagtcg gccagcctct 156301 gggcctcgcc ttcttctagt agcaccttct ccaggggttt gaccactagg ttgagcttgg 156361 agttgggccc gatgctataa tccgagagtc gtttcccatc tgccaaagac agatcgatgg 156421 gttcctcctc ctccaccgcc gtgggggtgg cccggggcgc gaggccgcct cgggcgcggg 156481 agcggagcgc tgcggcatcc ggctgtgagg catgcggccg cctcccttgg ggtcatgtgg 156541 cgcttggcgc ccgcaccccg acccgacaca cggctgctgg cctcggcccg cgcaccccgg 156601 tcccgcttcc tgcggggctc cccgggcgtc tcctctccct gggtacctgc cagggccttg 156661 cccttgaaca gcagccgctg ctggcgcact gggacgttca gcttctcgga gaccagctgc 156721 ttcagcgtgg acaccagctc gtcctctggc acctggccgc gcggagggtg ggaggcaagg 156781 gttgagcccg cggcccccag acggcgaagc ccacgcgtcc gcccccgagg gcagctctcc 156841 cggctcccgg ccctcggccc ggcctccccg tgcccgccct cggccctgcc ggtccccctg 156901 ccaggagcgg gcgagagagc gcgccggacc cggcggcccg gcccggggac cctacctgca 156961 ggctgcactc gcggccctgc agcgccttca ccgtcagctg catggcggtc gcgacggcgt 157021 ccactcgggc cggcgcgcac cccaaccacc ccccgccgcg cgccgccgcc ccgggcgcgc 157081 gctggaaccg cccgccgccg ccggaagcag cgagaggggc gcgcccgccg cccgcgcttc 157141 ccggccccgc cctcctgccc cgtccgcgcc ccggccccgc cccccggccc aattgcgccc 157201 cgggaggagt ccccccgggg tcccgcctag cacgcgcgca tcgaccgccc tggccggcgg 157261 acccctccca aagcaacgac gcgggggagt tgggagcagg aaccagtcgg gccaggttgg 157321 cgtcatctgg gagctgggac atggagaagg gagggccaga aggtacgctc gcccccgcgg 157381 actgcctggc ccaccccacg cgccaggact gctggcggcg cctggccctg catacccaca 157441 gacggcccgc ccccgctccc tgtctggtgg cttcctacct cctcctttct ggtctggctg 157501 cgaaacgctc cactgagctt gagagggggc aggggggcta cgcctctgcc ctcatcctcc 157561 cacctctccc agagtgggga cacccccggg gagggctgat agggaaagcc tatgcagggg 157621 tcaccgtgtc acccaagctg gccctccacc ctggtccaca cttacccagc acactctttc 157681 tacccccacg cccaggcccc aaagggcaaa gagtagaagg gacccaagct tgcttctctc 157741 ttttattgaa atatattttc tgggcagcgc ccacctacct aactcagcat ggcctctttg 157801 gcactgaaag ctggagaata aaaaatcagt aaaaaaaata agcctggatt tttctgagtg 157861 catagtgcat gagaactttg gtggagtgtg ggggctgggg ctgatgaaag cttgacccag 157921 aggcctcagg gaactgggaa caggctgctg tagatgaagt ggccaatgac caaggccagc 157981 atctcggagg tgccgctcag cgccacaatg aagggggcct gggaggcata gtcagcttga 158041 aggcggcgga gggatagctg cagcatggcc aaggccagca ggctgttctg cacccctacc 158101 tcaatgctga ccgtccgccg ctgggccact ggcagcttca gacacgtggc taggcagtag 158161 cccaccaaca ggccaaccag gggcaccgtg atacccacca gtacgatggg tagccggatg 158221 cctgccagga tgaagacccc catgcgatag gccaggaaga ggccgcccag gaggagcaca 158281 aagctgaagg gcttgacgac ctgcagcagc agctgggaga acttggggag cttggacttg 158341 atcagcacgc ccacggctat ggggatggca atgaacagca gggtccccag gatcttggag 158401 atgggcacgt ggagcgtctc atggatgctg agcaggcggc tgtagatggc cgaagacaga 158461 ggcaagaagc cagtggcagc caccgtagag aggaaagtca tggagatggc cagggtgacg 158521 tcccctccaa gaaggaggct gaagaggtag ctccccccgc cgccaggcga cgagcaggtg 158581 atgatgaggc ccagagccag ggccttgggc agcatgaaga ccttggccat gaggaaagcg 158641 tacaagggca tgaccagaaa ctggcccagg aggcccagca gcatgggctg ggggctctgc 158701 atgagcccct tcagaacctc gagttccact ttgcacccaa acgaacactt gttgacaaag 158761 ataagaggca ggagcaggta gaggattggg ttttccgaga agtgggccag gtcggcgctg 158821 agggtggcag gcgtgtcttc agcaggtgag accttgatgc agaagtctct ccgctcctca 158881 atcagtgtgg gcggggcctc atgggcgtcc acgagctgga tgtggagtgg ggccagccca 158941 gccaggcctg agtggatgct caccacaaag ccacccccgc ctccccaggt tatagcactc 159001 acgttcttga tggtcagcac ctctgtgtcc agggaggtga ccctgagcat ggggccaggc 159061 gccgtcctgt tggcctggcc tgggtactgg ctggagatca cgatgatgcc ctcactgtcc 159121 tcaggaaact caaactccat cacagagcca tctccaatgc tcaagtagcg gcccccagtc 159181 ggtggcacgg tgtgaccccc agcagtgctg aggctggtgc tggctgtccc ttgggccccc 159241 catggcaggc tgatgagcag cagggcagct ctgagcatgc ttaagggacc tgtgccacca 159301 ccctcgcccc ccagaccagg ccactgctga gagctgccct tgtcctgcat taacaccatg 159361 gctcttcctg gaggacggat ggcgtgccca ggccctctgc ggcttaggag aacatcccca 159421 ctggtgctgt tgggttgtcc tgagaagggt tcatgggtgg gtcccagtcc tgtgctgtct 159481 tccttgatgg cagggctgtc cctgaggagg taagggacac agagaggcag ctggtgaggg 159541 tcacttaccc tggaggcctt ggtccagcca tgtgtcttac ctctgaagag ggaggcagtc 159601 aagaacccag gagcatgggg atctggaagc ccactgctaa gagtaaaact cagcagggac 159661 ccttgcccca cactcaggaa tggagaggca atgcctagtg atggcccttg agtgccattc 159721 agctgtggct ccctggccca ctctacctgc ccctacagtg actcgatgaa ggtcctgctg 159781 ggaagcaggt cctccctccc cagggaagtt gccagaaggg gcggcaggag tggatcagac 159841 agagccacat gtctactggg ggggtttctc ctccttttgt cagagctccc actgggctga 159901 gactttgagc tcccagagcc actgcgttac atctgagagg ctgtgctcag agtgcagggc 159961 tccttcccat gcccacatgc gcccctacca gagcctcaaa ctcagtggaa ttatgagtgt 160021 ttctagcggc cagcaacacc cccttccact tctcaatctg ggcagcgacc accgtagtgt 160081 cacatcacct gctagaactg gggcaaccag gatggagaca caacagcttt acatcacccc 160141 tgtcagttgc ctcatgcact gttgtgtccc agtcctagaa gaggactgcc tctatagatc 160201 ctatccgtct ttggcattgg cctcccgtca tctgagactg gggtgaggca gaagtaggga 160261 gcagaggtta ggttaagcca aaggctcatc ccggcgagcc tcccctatcc ctagcttagg 160321 atgacatggc ctctgctggc ttgaagttcg ggtcacttac cctggaggcc ttggtccagc 160381 catgtgtctt acctctgaag agggaggcag gaaaaccagt ggggtgggag aggcaaagta 160441 caggagctaa gatgaaggca ccaggactga agccaccggg gtgatacaaa ccagctcaga 160501 agggaaaggc taggggccat ccaccccatg ccccatccct ccagtagggc tggcggctct 160561 gagtcactcc aagaggggtt tggcccagaa cccacaccat ttctgcttga ggggccgaca 160621 gagtttgagc agatgcaccc acaaggcagg ggagggagtc ccggcccttc caaaacggcc 160681 aagtgttcgg ggtaactggc agtgctaaca ggggctcctc cctacctggg agtccggaag 160741 ttgtcggacg ggtggctagc agagctcggg cgcgaacctg ggagggagga aaggaagggc 160801 acgccgccgt cccacagccg gcgacccgct cgggccgtct cggagtctgg gggggcccct 160861 tacgccattc ctgggccccg cggccccgcc aggcctcgca aacgcgcggc ggcggcggcg 160921 gcccttccct gccaggcccg gccgggcgcc cgagtgggcg atcgcggagc agggtcgggg 160981 ccagaggccg cctcccttcc ggaggctctc acctgccaca gccaccgcgc agcttagctg 161041 cagcgtctgc ctggcaggca gtcgcccgcg agctcggcct gtcagggccc cgccccaccg 161101 aagcccctcc cttccccgcc ccgcgcgtca gggccccgcc cccccctcag cgtccagctc 161161 aactccggct gctcgctggg ctccgaccgg ttgtgacgtc tctgggagtg acgcgccggc 161221 gcgggcgcgg tgcgggtgag ctggtcggcg cttggcgctg cctggaccaa tcgcttggca 161281 gcgcatctga agctgttgtg atcgctgggg cgacccgctc tgggagggag ccccaggcat 161341 tcccgattac gctgcgacct ctaaaggcaa tgggacagag ggttagacgt tagcagggct 161401 gggtgcccgg ggttagggtt ttggccacgt tgccgctcaa ggtgcccagc cctggggccc 161461 cagggttgag tttgaccgac ctggacgcca ggctctgtgc ggagttcctc ctttcgttca 161521 agtattgacc gagcgcctat ttctatttta tgtatttatt ttctgagact gagtcttgct 161581 ctgtcaccca gcccggagtg cagtggcgcg atctcggctc actgcaacct ccgcctccct 161641 gcaacctccg cctcccgggt tcaatccatt ctcctctcta agcctcccgc gtagctaaga 161701 ttacaggcgt ccgcccccgc gcccggctaa tttttgtatt tttagtagag acggggtttc 161761 accatgttgg ccaggtaggt ctcgaactcc tgacctcgtg agccacccgc ctcggcctcg 161821 caaagttctg ggattacagg cgtgagccac tgcgcccggc gtaggccagt ttctatactg 161881 tggaacatta agtggacttg tctgcttttc caaactttgg cgccactagc aaggtgcaga 161941 ggctttgtgg taaaggaaga gaaagaaggc agaaagttgg ggaaaaatat gatgggaatg 162001 tgtagaaaga aggtcggaag gcagtacaga agacagtaca gagttggggg ggggggatca 162061 tatgttccca aagcttattg gtgcaactgg aatcaatctg cacacttcaa cctcttgtca 162121 agactgtgat ggagtcggag gtatttgccc cgggtaggtg cacaagacgc atggttgcag 162181 atgtggaaac agaacagaac aaaaccaccc acgtctctaa tgtttcaaat gaatgatata 162241 catttgtttt tattacttat ttatttattt ttgagacagg gtctcgctct gtcgcctagg 162301 ctggagtgca gtggcacaag ctcggctgac tgcaacctcc gcctcccgga ttcaagcgat 162361 tctcctgcct cagcctcctg agtagctggg attacaggcg tgcgccacta ccacccggct 162421 aatttttata ttttttagta gagacagggt ttgccatgtt ggccaggctg gtctcaaatt 162481 cgtgacctca aacgatccac ccgcctcagc ctccctaagt gctgggatta caggtgtgag 162541 ccaccgtgcc cagccaaata atatactttt aataaattta ttgacattta aatttatttt 162601 aaaactttat ttatttattt agagacaggg tctcgctctg ttacccaggc tggaatgcag 162661 tagtgccatc ataatcactg cagcctcgaa cccctgggct caagcgatct tcccacctca 162721 gtatcccaag tagctgggac tacaggagca tgccaccatg cccagcttaa aaagtttttt 162781 tggccgggtg cggtggctaa tgcctgtaat cctagcactt tgggaggccg aggcgggcgg 162841 atcacgaggt caggagattg agaccatcct ggctaacacg gtgagacccc gtctctacta 162901 aaaatacaaa caaattagcc gggcgtggtg gtgggcgcct gtagtcccag ctatttggga 162961 ggctgaggca ggacaatctc ttgaaccccg ggtgggggga ggttgcagtg agccgagatc 163021 gcgccactct cctccagcct gggcaacaga gtgagactcc catctcaaaa aaaaaaaaat 163081 tgatttttgt gtgtgtttta caaagttcaa gttagtatgt gcgtataata tgagcaaata 163141 tacatatgct gagggttgtt acaagggttt gatgtgtgga tgatcaaaaa catttgcagc 163201 catagctctc agccattgtt tagcctccag ggatcactaa ggcagaagcg atttgtgtgt 163261 ttgtacttgg tgttctggaa aggttgttct gtccctttca aatggggttg aagggtcctg 163321 atggaagtag acagtattta gactaggaca ggtgaaaggg ctggactagg cacatgctgg 163381 ggctgaagat tgcaatttgg aaaacggcag tgagggtagg gattgtctga ggatggggta 163441 aaattggcca gagggagaaa ggagtcagct ggaaaagcca aagaaggaaa acccaaaaag 163501 ttgtgtcctt cttgaaggaa aagggaattt ttgtattcca cttgcctgtg ctgttgggcc 163561 tgcctgacac tgtcagtaaa attcacctgg aactagccac ttggcagcca ccaattcttg 163621 tttttgccag gtgtcaagga gagcaaatgc agtcagaaga ctgctgtctg ttattgtcaa 163681 tcacttcagc cttccctgaa aagcccaagt gggcaggtgg gatatctgac atgaagaatt 163741 tggtggcttt gtctgcagga aaagggaata gcaaaattaa aggcaagaca atgttcctag 163801 ttggtttagg gaatcagtaa tttaccccag ggtgcgggtg gctcacgcct gttatctcag 163861 cactttggga gaccgaggcg ggtggatcac ttgagctcag gagttcaaga ccagcctggg 163921 caacatggca aaaccccatc tctacaaaaa atacaaaaat tagcctggcg tggtggcatg 163981 cgcctgtact cccagctact cggaaggctg acgtgggagg gtcgcttgaa ccccgggaga 164041 tggaggctgc agtgagctga gatggcacca ctgcactcca gcctgggcaa caaagtgaga 164101 ccttgactca aaataataat aataataata gtaataataa taataattta ccctagaaga 164161 gaaggaggaa gaggtgagga tatctagtgc ccaaagagaa gtcatctgga ccgaacaaac 164221 agatggaaca aatgatagca acagctgaaa caacatcaac acatttcaga tgggaggcct 164281 gccaggtatt atgtaataag gtcacagtgg tgaccctgag aggcatgcct acctctgctc 164341 ttgaggcagg ggcctctaga aacaggattt tttttttttt tttttttttt ttgagacaga 164401 gtctcactct atcgcccggg ctggagtgca gtggcacagt cttggctcac tgcaacctct 164461 gcctcccggg ttcaagtgat tctcctacct cagcctccca agtagctggg attacaggta 164521 tgtaccacca cacccagcta agttttgtat ttttagtaga ggtggggttt caccatgttg 164581 gccaggctgg tctcaaactc ctgacctcag gtgatccacc cacctcgacc tcccaaagtg 164641 ctgggattac aggcataagc caccgtgcct ggcccagaaa cagtatttga taaataaaga 164701 ggccaggtga ggtggctcat acctgtaatc ccagcatttt gggaggctga ggagggaaga 164761 ttgcttgagc ccaggagttt gagaccagcc cgggcatcat gggaagaccc tatttctaca 164821 taaacagatt tttaaaaaaa tttaaaaagg catggaaggc atatgctagt gtaagtatga 164881 ggtacattga gtatccacag ggaagtgtaa tgacacctgt cacttttgcc tgcccagcat 164941 caggttgcag caccttagtt ttctctgggg gaaccatccc tcctccactc tcagtacaca 165001 tggtttgcat gtggatgacc ccaacgctgg gccctggagt gggcacataa ccaggtataa 165061 ccagccagcc actcagagtg ggcagacatc atgccataat ccagtctgga ccagtgaggg 165121 ttagtcccca ggaacttgta ctgaagtatt gggaatgcca tcctaagtca cagagaggga 165181 gactaaaaga aaatggtatt tatttggaat gggtactgca atgggaatgt gtgtgccata 165241 gtaaactgtg tatattcagg gaggcgaagg gagaaaaagg gttttgttgt tgttgttttt 165301 taaggtaggg tctcactctg ttgcccaggg tggagtgcag tggtgtgata ctggctaact 165361 gcagccttca cctcctggac tcgggtgacc cttccacttc agccccctcg agtagctggg 165421 actacaggca tgtgccacca tgcctggcta atttttttgc attttttgta gagatggggt 165481 ttaaccatgt tgccttacac tggtcccaaa ctcctgggct taagcaattc tcctgtctcg 165541 gccttccaaa gtgctgggat tacaaacatg agccattgca gcaggctgac aaaggtgttt 165601 tttgtttgtt tgtttgtttg tttgtttgtc tgtttgtttg tttttgagat ggagtttggc 165661 tcttgttgcc caggctggag tgcaatggcg cgatctcagc tcactgcacc ctccgcctcc 165721 tggattcagg cgattctcct gcctcagcct cctgagtagc tgggattacg ggcacccgcc 165781 accacgcccg gctgattttg tatttttagt agagacgggg tttcactatg ttggtcagac 165841 tggtcttgaa ctcccgacct caggtgatcc gccttcctca gcctcccaaa gtgctgggat 165901 tacaggtgtg aaccactgtg cccggctgag aaaagttttt aaagaaaaaa tgagaattac 165961 ataattgttt tcaggtaatt attcttggct acaaggatta ataacaaagg tggcatcagt 166021 ccaaatttgg acaggcagct gctgggtaga tgtgttcaca gaagtatttt ctgtgtgtaa 166081 ggtgatgaga ctgcaaaaac tatatcactc tggtgatgat agccgttgta cagggttata 166141 gtttttgcag tctcatgatg ggttttgcta tcaggcatca cacatgagaa ccttcccttc 166201 atggccttcc ccagctccgg ttttcgggtt tttttttttt ttttaaggtg gggtcttgct 166261 atgtgaccct ggctggaagt gcagtggctt ttcacggagc gatcctacta ctgatcagca 166321 tgggagtttt tatttgctct gtttctgagc tgggctggtt cactcttcct taggaaacct 166381 ggtggtctcc tgctcccagg agattaccat attgatgctg agtttagtgg agacaccaga 166441 tcagaatagc ataccacagc ccagaaatcc tgggctcaag tggtcctcct gcctcagcct 166501 ccccagtagc taggaataca ggtgtgtact accatgcctg gctaattttt aaaaattttt 166561 cgtagagaca gggtctccct atgttgctca ggctcatctc gaactcctag gctcaaggga 166621 tcctcccatg acagcctccc aaagtgctag actacagggg tgagccaccg cacctggcct 166681 agcatggaag ttttaacctg ctccatttcc agcctgggct ggttcattcc tccctagaca 166741 acctagtggt cctccattcc caggatactg atgctgagct ttttgtggac acctgatcgg 166801 cctggcatac tacagcccag aactcctggg ctcaaacaat tctcctgctg cagcctcctg 166861 agtagctagg actacaggca tgtgccacca cacctggcag ggtttgtttg tttttttttt 166921 tttttaatga gacagagtct cactttgtca cccaggctgg agtgcagtgg tgcgatcttg 166981 gctcactgca acctccgtct cccaggttcc agcaattctc ccacctcaac ctcctgagta 167041 gctgggacta caggcctgca tcaccatgcc tggctaattt ttgtattttt actagagaca 167101 gggttttacc atgttggcca ggatggtctc aaactcctga cctcaagtga tccacccacc 167161 tcagcctccc aaagtgctgg gattacaggc gtgagccact gcacctgttt tgtttttaat 167221 acaggtgact ccattttgct tctggcaact ttcacatttc ccctttttga tcgtgatctt 167281 tcttcaaaag cagtaggttt ggattgtccc tcagtgctgg gatggacctg tcctgggttg 167341 ttggtctggt tccagataag agggagtgat tgacaactaa gagtcagtgt caaaacccct 167401 ttaggcacat ctgagcaaca gtggaaattt ggagagagtt gcactcagac taagtctacc 167461 tgaagtccat cgttaaattc tattttttct gttctgtggt cttttggcca tcatctggaa 167521 gccccgtgcc agcatactct gttagcagct gtacttctgc agaagtttca caagtaacaa 167581 gtacaaattg ttaaaaggaa aatacaaatt aatattagta gtaatatgat acccccagtt 167641 ttcataatag ttttgagcta tgaacctagg cttaaaggca atcaagtgaa taaatcaaat 167701 gaccatgggg aattaggtga gacccattat aacctatgtg acctgttata acctatgttt 167761 tgtaatttcg tgtatatggg gcctcaagtt ccccagggga atttatccag gttcagcatg 167821 tgctattagc aatagaacag atatttcctg atttaaccaa tagatgctga aggatctctt 167881 aggtcaggtt ctgtgaggtt actgacagaa gctattgatt gtgcaatttc aattacacca 167941 ttatcctgtc aacgaaaagg cagacataag aaaggaaaaa ttaagagtga caagattttc 168001 attttgatgg gtgaagctct caacttcttt tttttttgag atggagtctc gctctgtcgc 168061 ccaggctgga gtgcagcagc atgatcttgg cttactgcaa cctccaactc caggttcaag 168121 caattctcct gcctcagcct cccgagtagc tgggattaca ggtgcgtgcc accacagcca 168181 gctaattttt aaaaaatatt tttggtagag atgggtttca ccatgttggc cagactggtc 168241 ttgaactcct gacccccaag tgatccgccc gcctcggcct cccaaagtgc tgggatgaca 168301 ggcctgaagc caccgcgccc agctgaagcg ctcaacttct tgttctggtt tgcagtttga 168361 gtgtctctgg ttatggcatc aggtggtttg gagaacttcc tgtgtgaccc atacatcagg 168421 catgagactt gtccctggat atttatacca agttttccag cttcagcttc aaggccttta 168481 gcaacataac aggttttttt cttagttgga gagttttagc caaatattag aggaaattag 168541 gaggatttgg ggtctagtcc cgtcttcatg tagataagaa acaatgcaaa gggctgcaat 168601 ctaatagcag gcatatttta gtgtttttcc tttagaaaca actttttctg ggaggccaag 168661 acgggcggat cacgaggtca ggagatcgag accatcctgg ctaacacggt gaaaccccgt 168721 ctctactaaa aatacaaaaa aaaaaaaaaa aaaaaaaaat agccaggcgt ggtggcacgc 168781 acctgtagtc ccagctacag attttttttg agacagagtc tcgctctgtt gcccaggctg 168841 gagtgcagtg gtgcaatctc agctcactgc aacccccacc tcccaggttg aagcaacact 168901 catgcttcca cctcccaagt agctgggatt acaagcatga gccactgttt ttgagacagg 168961 gtctcactct gttgcccagg ctggagtgca gtggtgccat catggctcac tacagtctca 169021 acctcctggg ctcaagcaat cctctcctcc tacctgagct ccctgagtag ctgggcatac 169081 aggtgcgcac taccacactg gctatttttt ttttttttgt ggtagagatg ggggtctcct 169141 tatgttgccc aggctagttt tttttttttt ttttttaact tgggtacctt tggtattttt 169201 agtagagact aatttttgta tttttagtag agatggggtt tcaccatgtt gggcaggctg 169261 gtctcaaact cctgacctcg tgatccgccc gccttggcct cccaaagtgc tgggattaca 169321 ggcgtgaagc accgcgcccc gcctgcccag gctagtttta gaactcctgg gcttaagtca 169381 tcacctgtct cagccttcta aagtgctgag attacaggcg tgactcacca tgcccaccta 169441 attactatat ttttgataaa ataattaatg cttttaattt ttccctttaa ttaattagat 169501 atttttcata tattttggta gaaaatatta cacagaatgt gaaaacgtac ggacacatga 169561 ttacatagaa acagattccc tagctttcat tttgaaattt ttgtcatgag acagtacaat 169621 atagtaatat acgcttgcca gtttataaaa ggacagttgg atccaatttc tgacaaaatg 169681 agacccgttc gtgtggctga gcattatttg accccatagg taatcttatg aaggctgaag 169741 atcaaaattt tgggtaaagc agcgtctaca gcagtttgat ttaaaatatc tttttaaaat 169801 ccttttattc catatcaaat gagtttagag gttaaatatt caaatgttca catttcagtt 169861 aggactagct gaattgtatg agaaaaacag aatttccagt ggccaatctg gtttgcttga 169921 ttagtcagca caggtggaga ggcacttttc aaaaagatat ttacagttgt tttttttcct 169981 cagctttttc tggattgtac atgacagaca aagccaaatt tttatgctgg acagagatac 170041 cttatgtgat tgctgtgtgc tcaagagttt gacctgtttg atctgagaac ctaactttca 170101 caaacattga tctagttctt tcctttttag actatcaatc ttttaattaa ctgttccgtc 170161 accctaagca attgtcagct aggcaaacct ccatacatgt ttctgaaagg aatgactcct 170221 aagtgtacaa ggctaggttt ttggttgcca tggagctgtt gtaatttgaa agcacccttt 170281 tttcttttct ttttcttggc tgaaatgccc taaggaaaag ttataaacag cttttgaaat 170341 caccaatgtt agattatctt tttttattct tatttttttg agacaagatc tcactccatc 170401 gcccaggcta gagtgcagta gtgtgtcacg gctcaacgca gccttgaact tccaggctca 170461 agtgatcctc ccacctcagc ctcctgaata gctgggacca caggtgtgtg cctccacgct 170521 agctaccttt ttgtattttt aatgtagggt tggggtctga tcaggttgcc cagggtgaaa 170581 ttatcttaat acaattatct taatacaagt gaagaagctg gctgggcacg gtggccacgc 170641 ctgtaattct agcactttga gaggccaaga tgagtggatc atgtgagctc aggagataga 170701 gaccagcctg aacaacatga tgaaacccca tctctacaaa aattagccag gcgtggtggc 170761 aggtgcctgt gttcccaact ccttgggagg ctgaggtggg aggatcgctt gagcccagga 170821 ggttgaggct gcagtgagct gagactgcac cactgcactc cagcctgggt ggcagagtga 170881 gaccctgtct caaaaaaaaa aaaaaaaaaa aagaagaaaa aggccaggtg cggtggttca 170941 cacctgtaat cccagcactt tgggaggccg aggcaggcag atcatgaggt caggagatcg 171001 agaccatcct ggccaacatg ataaaataca aaaaattagc caggcatggt ggcgcatgcc 171061 tgtagtccca gctactcggg acgctaaggc agtggaattc cttgaacccg ggaggcagag 171121 gttgccgaga tcgtgccagc ctggtgacag agcaagactc cgcctcaaaa aaaaaaaaaa 171181 agaagaagaa gaagaaaaag ccaacagagt cagcagagag gaggaagaaa agaaaagcag 171241 atagagaagt taggcgcctc tacataccag tgttttagtt ttatttattt aattttattt 171301 tttattttaa ttttattttt ttgaaacagg gtctcactct gtttcccagg ctggccttga 171361 attcctgggc tcaaataatc ctcctgtctt ggcctccaaa agtgctggga taacaggcat 171421 gagccaccgt gcccagcccc aattcctttt aaaggtgatt ttgtttcggg tctcacttat 171481 tttttttttc ttttcttttc ttttttcttt ctttctttct tttttttttt tctgagatag 171541 ccttgacccc tgggctcaag ccatcctccc acctcagcct ccctagtagc tgggactaca 171601 ggtgtgcacc accacacctg gctaattaat tttttttttt tttttttaga gacagggtct 171661 tgctgtgttg cccaggatgg tgtcaaattc ctgggctcaa gtgatcctcc tgccttggac 171721 tctcaagtgc tgggatgaca ggtgtgagcc accacgccca gctgtaggtc ccacttctga 171781 ctctagttat gtcatcctaa ataacaggct ctctgagaga aaatgatatt tattctggaa 171841 tgcgttgcag tgggaatgca tgtgccatgg taaactgtat ggtaaactgt gtgcatatac 171901 agggaggcaa aagaaggcaa aggtttttca aggaaaagtg aagaggatca cctaattgtt 171961 ttgagagaat tctccttggc tacaaggatc gataacaagg gtgacctcag tatgaggttg 172021 gacagggtta gggttaagtt gctcagctgg gtgcggtggc tcacgcctat aatcctagca 172081 ctttgggagg ctgaggcggg tggatcacct gaggtcagga gttcaagagc agcctggcca 172141 acaaggtaaa accctgtctc tactaaaaac acaaaaatga gccaggcgtg gtagcgcatg 172201 cctgtgatcc cagctactca ggaggctgag gcaggagaat cgcttgaacc taggaggcgg 172261 aggttgtggt gagccaagat tgtgccacta tactccagcc tgggtgactg atccagactc 172321 cgtttcagaa aaaaaaaaaa aaaaaagtca gccgggcacg gtagctcacg cctttaatcc 172381 cagcactttg ggaggctgag gcaggtggat cacttgaagt caggagttca agaccagcct 172441 ggccaacatg gtgaaaccgt gtctctacta aaaatacaaa aattagctgg gtgtggtggt 172501 gcacacctgc aatcccagct acttgggagg ctgaggcaga attgcttgaa cccaggaggc 172561 ggaggttgca gtgagccgag attgcgccac tgcgcttcag cctgggtaac aagagcgaaa 172621 ctccgtctca aaaagttgct gggcagatgt ccttgcagta ttttttttct gtggaaagct 172681 gtggtttctg catttgcagc cttccccagc tctgtttttt gttttttaac acaagtgact 172741 ccattttgat tctgacaact gtcatggcaa ggaaggctgc catgttggaa aatgaagcca 172801 atacagagag aaaaacaaag ccaagaggtg gagagagccc aagtcccttg ggcttcgggg 172861 tccattacag gaacaatgca attctgccat cctccctttt tttttttttt tttttttgct 172921 tatgccagtt tgagtgttga attctggcat tgtaagtgag gacatcctaa ctcctaaagg 172981 gcaggcccag ggggcgatac ccctggcaaa accaggaccg tgggatatgt caagcttacc 173041 tcctctttgg gaagtcaggc aggctgtgct ccatgccatc agcttcctag cctgtgttgg 173101 gctgtgacac tggtgacccc agtgagggtg cagccccatg aagaggtaga ggctgtcact 173161 gtggctcact acggtgttcc tggaccagtt caggcacttg gtgaatgttt agtgaacaaa 173221 tgagctgaag gaacaaatgg aggaaggctg ggcaatcaca gcagttgctg gagaagtgtc 173281 ggggagccca agcccctccc agcctggcag tatagacact ccgtgagttt ccgtttcatt 173341 taggggcaag ctagtaacct cgtggaattg tacttatctg gcttgtagtc ttgctacacc 173401 gggaactctc ctatccccac tagggacctt tacttttttg tacagtgggg acaatggcca 173461 cacttactgg ctagggttgt ggtgaggtgt tacctatggt ataactgagg tattgatgca 173521 agaaggaagt tgagatttgt atcaacatat gggaccatgt gggtacagat gagacagcac 173581 tggtcacatg tcgatcacta gagggatgag cacatcagag ttgatgacag tcatctgttg 173641 gtatccgata ggtatttgtt ccagggcacc ctatagctgc caaagtccag ggatgctcaa 173701 gtccctgata taaaatggat acattatagt tatacatatt tattcccata gatttttttt 173761 tcttttgaga tggagtctcg ctctgtcacc caggctggag ggcagtggtg caatctcagc 173821 tcactgtaag ctccgcctcc cgggttcaag cgattctcct gcttcagcct cccgagtaac 173881 tgggactaca ggcacccgcc accatgctca gctaattttt cgtattttta gtagagacgg 173941 ggtttcaccg tgttagccag gatggtcttg atctcctgac ctcttgatct cctgaccttg 174001 tgatccgccc acctcggcct ctcaaagtgc tgggattaca ggcgtgagcc actgtgcgcc 174061 cagccctttt ttcttttttt ttgagacagt gtgtcactct gtcgtccagg ctggagtgca 174121 gtggcgtgat ctcggctcac tgcaacctcc acctcccagg ttcaagtgat tctgcctcag 174181 cctcccgagt agctgggatc acaggcatgc gccagcacgc ctggctaatt tttgtgtttt 174241 tagtagagac agggttttac catgttggct aggctggtct gaaactcccg acttcagatt 174301 atctgcccac ctcggtctcc caaagtgctg ggattacagg tgtgagccac cgtgcgcggc 174361 ctgatttttt tttttttttt tttttgagac aaagtctcgc tctgtcaccc tggctggagt 174421 gcagtggcac aatcacggct cactgcagcc tcaacctcct ggactgcagc aatctgcctg 174481 cttcaacctc tgtaagtgct gggattacag acatgagcca ccaagcctgg tcccatagaa 174541 tctaaaccat cttgagatta tttatcatgc ttaatacaat gtaaatgctc tgcaaataac 174601 aattacactg tattgtgtgt gtgttttttt ttgataggat ctcactctgt tgcccaggct 174661 gaagtgcagt ggtgtgatct tggctcactg catccttgac ctcctgggct caagtaatcc 174721 tgtctcagtc tctcaagtag ctgggactac agatgtgtgt caccacgact ggctaatttt 174781 atttttttgt agagatgggg tctcgctatt ttgcctaggc tggtcttgaa tgcttgggct 174841 caagtgatct gcccactttg gcctcccgaa ggctgggtta tagggcatga gccaccgtgc 174901 cagcctacac tgtatttttt tttttttttt taagacgaag ttttgctttt gttgcccagg 174961 ctggagtgca atggcacaat cttggctcac tgcaacttct gcctcctgag ttcaagtgat 175021 tctcctgcct cggcctccca agtagctggg attataggcg cccgccacca cgcctggcta 175081 attttttttt ttttgagacg ggagtcacgc tctgtcaccc aggctggagt gcagtcatgt 175141 gatctcagct cactgcaagc tccgcctccc gggttcacgc cattctcctg cctcagcctc 175201 ccgagtagct gggactacag gcgcccgcca ccacgcccgg ctaatttttt gtatttctta 175261 gtagagacga ggtttcaccg tgttagccag gatggtctcg atctcctgac ctcgtgatcc 175321 acctgccttg gcctcccaaa gtgctgggat tataggcgtg agccaccacg cccggccact 175381 gtattgtttt ttacttgtac tattttttat tcttgtattt ttattttttc caagtatttc 175441 caatctgtgg ttggttgaat ctgtggacat ggaaccccca tggatatgaa gggccgattg 175501 tatgctagtc tgtctcctgg agatgtttgc aattttccat agcaagtcta aagagtgaag 175561 tgtgtatgat cctgcagtgg atccatcatt acacattttt gtccaaaccc actgaacgtc 175621 caacagcaaa ctgaagcctc atgtgagcac cgatactggt tcattgacag caacagatgg 175681 gaagctgggg gctggcaggg ggcacacagg aactctgacc ttttcactca gttttgctgt 175741 gaacctaaaa tggctctaag aaggaaagtt tacaaaaaca aaaaaggaat ggaatacacg 175801 tagaaagttc ctggtgcaga ggcaggactg gcatcaccat cgtttcctcc ctggtgagcc 175861 gctcctccag ggcacccctg tgcctccaac cccacatgtg cctgctccag aagcactttt 175921 gtcttctgag gctctgtctg cgctctgtac tcttcctggg attggctcct aagtccatcg 175981 cccctaacgt cgccccatga aggacaccac agtgcctccc atctaggaac tgccttctct 176041 cctttcgctg aactcagact gcttctgggc ctctggagtt gatattccag caaaattgct 176101 tctaatttgc tggctctcgc ttagcaaagg gaggtattaa ccgtaacctg ggagagacgc 176161 aaaaagccca agtgactcga cccctcacaa ccctgacttc aaaggcagcc gaagtctgtg 176221 ggcgacatgt gctcctggat gatgatgggg tgtggtgggg gcctgggctg gtgttaatcc 176281 caatccctgt ccctgtccgt acaggcctgc actgggtcag agggagttgt tgagtcagct 176341 tcctgcagcc cgagtgtgca atcttctctg ccagacgaaa agagtcccac tgctctgttg 176401 actgcttgag tcacatctca gtcgcagcgg ggcctgttgg tggcaggagc cctggagaca 176461 aaaggctgtc tgctcgggct gtgggagcag gttggttaag actctggaat ctgtcctccc 176521 gcaccccaga ggacaccgcc acctctgact ccgcactgac aacagcatct gaaggcattt 176581 cttctgtggg gccatcactt tattaagggg tcatctagaa ggtgggcccc ctgacaaacc 176641 gcgggactgt gatcgggctc cagctacttc accaccccgg gccagcctgc tccaggggtc 176701 ccttcctgct gagagcaggc gagaggcagt caggctcatg aagcagccac cgggtttggc 176761 tcactggaag gaatcacact ggaaacatgt ttagcccgca gtgcagagtg gctccagaag 176821 ggagaggttc tggaagacgc cccaacctgc cgggctgctc ccagagatgc acagtgaggg 176881 gcaggcaccc agggccgttc caggactcaa gatggggatg gagatggcgt gaggaatgga 176941 ggacagggct agatgggctg accgggggga acagtgttac gaaaaggagg cgggtaccct 177001 gggctcccgt gacaaagtgc ggcagggcta ccccctgcag cccccatagc ccccaccacc 177061 ttgagaccac gttctggtca ctgcctcgga gccccgatgt gttggggcca gggagcgctc 177121 ctgcccgggt gtggggtgtg agcctcagcc tctgtccgcc cggcagcgcg cgtgcctccc 177181 ttggtctggc ctccctcggc ccggtcctgg cactggccgt gctagctggc cgtgcttctc 177241 cgcgggatac agccttccat ctccagcgcc tcggggcagc cttcgtactt gttgctgtgc 177301 ttactgttct tcacgtgctg tggggagggg agcagaaagt catgactttg gccctcaggg 177361 acaggggaca ggagtggacc aggacccagg ctgtcagtca actgtgctgt gttggccaac 177421 ggtgtactct cagggcactt ggcccctcca cccatcccag ggaaacatgg catcagctgc 177481 actggcccac ccctcatagc tggactgggg acggtcccct gctggggcgt gagcagctgg 177541 ggaggctcgt cgcatgcagg cagaaggggc gacaggctgc cccggtgggg gtgatgccag 177601 gcaacccccc atacctgctc aaaggggctc ttgttctgca cacccttggc cccgacaaac 177661 acccagctgt cccggaaggc cagctccttg gcgttcctgc tgcccagctc actgaagagc 177721 tttctggtct cttcattcat cctgcagcat gggaggaagg ggtgtcagct gttgctgcag 177781 aagggcaagg ctgaggctgg cctccccaag cacctacttg gtggctgggt cgtcgtagga 177841 tgccacgaac accagggtgc cttcgtgcag tggccgaata aacttcaaca ggtcgttgac 177901 atctgggggg gcaggtgcca cggaacaggg gtcatcaggc accactgagc accctcccac 177961 aagcccgttc tccaaacaag gaaacagaag tagggattgg gcctgggaca tgccccgcca 178021 agggacaggc tggtgtagga agctgaggca tccccgtgac ctcccacagc ctccagacca 178081 agctggcgat gcccctgtcc tttctgaacc cacggaggtg actgagaagc tccaagtaca 178141 gacagagaga gcagtgtctc aggggaagct gcccaggtca ttttgctgca gtcaaaatgc 178201 aagacgctgc gctgcaactc acctccggcc cacatgtcaa aggcccgggc ctcgatgagc 178261 tcgccgctga cccctgtgtc aggagggagg ggccctgctg ggtaagccag gccagcctca 178321 gggcacggca ggcgggtagc accgggggtg gggggcagcc cttctggacg ctttcaaaca 178381 gtcccccaat attcatccat ttgttcaggc agctgcatcc caccccaagc cgtggtgcct 178441 ctgtctggca tttgaggccc tccgggggct ctccatgaag caccctcaac ctcccagaac 178501 cttccctagt tctttattgc catcatgctt ttttgttcat tgggtcactc ctctgtggag 178561 ccttgcgctt gcacagactc cttcccaggc ctccctggga gatcttgaat gggaggggaa 178621 gggagggcaa acacccagag gcctcctggg gagggagcag ggaaggaggg cggcaggcca 178681 cgctcagcca ggggcctcac cgttcaccag ggcgatgttc agcccgcggc ccacgttgtc 178741 cttgacgctg ctcatcagcc tagttggggg ggggtggggg ggacggggag atcccacatg 178801 ggcacgtctg ctccactctc ccttgcagac ccggccctgg atcccccact ccctccctga 178861 tccccagtca ccccagggag ctcacatctt gtcctcgagg cagatcttgg gcccaatgac 178921 gttggcggcc ccgctgacca cgcggaaggc caggtgctcc tcaggacacg gctggggcag 178981 gccacacttg tacttcctgg cccgtggcgc tgggcaggga tagcaggtgt tatccatggg 179041 cctggccctg aggtcacctg agatctgaag gggaggtcag gaccagcaca catggtagaa 179101 caagtgaccg cagagcagga cacaagtagg gggatgtggc tcaggccgag gctggggaca 179161 cttccatact ctgtgtccta ctggtgaggc cacagggacc caagacagaa ggattttctg 179221 ttactggtgg gagaaaagaa aggatgctga gggctggcct gggccactgc acagccagga 179281 acaaagcagg ggtgaagtaa gtcaggctgg gggtctagct gcatggatga ggccaacctg 179341 agcaaaggca gtgtggcagg gggtagtgtg gcagggggtg cagcctgtct cccgattttg 179401 atgggggttg gttgggagga ctggagcagg atggcttcta ttatgggttg aattctgtcc 179461 cctcaaggta tgctgaagtt ctaatctgtg gtacttcaga atgtgatgtt tgcaaatagg 179521 gcctttgcag ttcagatggg gtcatactgg agtaggttgg gtccttaatg caacatgact 179581 gatgtcctta taagaacaga agtgacaaag acacacagag ggaagacagc catggtacag 179641 tggaggaaga gacaggagtg acacagccag aggccaaggt atgcctgggg ccactggaac 179701 ctggggaggt caggaaggac cctcctgtag aactttcagg gtgagtaacg ccctgctgac 179761 actttgattt cagacttctg gctttcagaa tggggagaga atccatttct gttgtttaag 179821 ccacccagtt gaaatggcag ttccaggaaa tggatacagg ttttggcacc aggaagtgtg 179881 gtgcttaggt aacaaatacc taaaaatgtg gaaatggctt tggaactggg tagtcagtag 179941 aggcttgtct tagtccattt gctgctgcca taacagaata ccacagactg ggtaatttat 180001 aaagacaaca gattcatttg gctcatggtt ctggaggctg ggaagtccaa aagcatggcg 180061 ccggcatctg gtgagggtca tcccgtggca gaaaggtgga aggtggaagc aagcatatga 180121 gacgcaggaa aaaagggggc tgaacttcaa ggtaactaac tcacccataa taatggcatt 180181 catccattca tgagggcaga gccctcatga cgtaactacc tcaaaaaggc cctcctctca 180241 ataccaatac attgacaaat ttcaacatga gttttagaga tgacatttaa accatatcaa 180301 ggctatagaa gtttgaggta catgttagaa aaggtctgga ttgcctttaa gagactgttg 180361 gtaggagtat ggatgttaaa ggtaattctg gtgagaggag gggagagcaa tagagacagc 180421 ctctgtggtc ttagagaata caaatatcat caagaacaga acgttgctag aaacatgaat 180481 gttaaaggtg cttttgggtt gggtgtggtg attcacgcat gtgatcccag caccttggga 180541 ggccaaggca ggagcatcac ttgaggccag gtgatcaaga ccagcctggg caacatagtg 180601 agaccccgtc tctacaaaaa atacaaaata agctgggtgt ggtggtacat gcctgtaatc 180661 ccagctactc aggaggctga ggcaggagaa tcacttgagc tatgattgcg ccactgccct 180721 ccagcctggg tgagagagca agaccctgtc tcaatacatt caaaaaataa ataataaaaa 180781 ataaaacatt agctgggtgt ggtggcacat gcctgtagtc gcagctactt gggagactga 180841 ggtgagagga tcgcttgagc ccaggagttt gagcttacag tgcacgggga tgataccact 180901 gaactccagc ctgagcaaca gagtaagacc tctgtctcta ccaaaaaaaa aaaaaaaaaa 180961 aaagggtgct tttggttaag cctcagagga actggaggga catgttattg agttattggg 181021 cgctggagga aagctgatcc ttgttataaa ggaacttggc ccaggtgtgg tggctcacgc 181081 ctataatccc agcactttgc ggggccaagg cgggaggatc acttgaggtc aggaattcga 181141 gaccagcctg accaacacgg tgaaacccca tctctactaa atatgaaaat tagttgggta 181201 tggtggcagg tgcctgtaat cccagctaat caggaggctg aggtgggaga atcgcttgaa 181261 cccgggaggc ggaggttgca gtgagccaag atggctccac tgcactccag cctgggtgac 181321 agagcaagac tccgtctcaa aaaaaaaaaa aaacttggtt gaattgtgtt ctagtgtttt 181381 gtcgaaagga gaatgtgtaa gtgctgggtg ctgaacttgg atattcagct gaggagattt 181441 ccaagcagtg ttgagagcgt agctgacttc tccttgctgc ttatagtaaa atgtgagaga 181501 cataaggaag aaactgttga gcacaaagga gccagaaagc gaagatgtgg aaaattcttg 181561 gcctaatcat attacaaaaa ctgcaaaagt gtcccctgga cagaacagca aggctgttca 181621 agcgtttgcc ccagagctga ggtatgtgac tcgtgggtcc actcaaccct ttcagcagaa 181681 atctacaatg aagatggggt tatctaggaa gcgtctgtgg gaaaactctc tgattttata 181741 gcttgggcct ctataacttg caaaagagac caacaaggct cttgagaatc tgacaccagc 181801 agaaacactg ccggcctgga gtgaaggggg cagagatgag ataaagggaa ggaagggtgg 181861 ccccgaaggt ggggctgtct cgtttcagag catgggtcat tccagcaggc ccagaagaaa 181921 gaatctcagg ccacagagga ttcccctcag gccttgaaac caaatgaatt ttgccctgct 181981 gggctttgga cttgcttggg acgggtgacc tcttttttcc ttccaagttg cccttttgca 182041 gtgggagtat ctatcctagg cctgccccac cgtcatattc tggcaggaag taacttggca 182101 ggaagtaact tgttttgtag gttgacagtg gcaagcagcc cccgtggacc accacagctg 182161 gtttggggag gaataaatgg aaaaccgaag ccctacggga ccgggggcag gcagggatgt 182221 cacaaagggg aggagcaggg aggtgacagg ggccttacct gcagtcaccg agctctctgg 182281 acctgtggat acagagtggt ttctgttagt gtggagcctc ccggggcctg cggcagggca 182341 gcctggccct ccttacaccg cgaagtgtag acacaggcag tcaggctcag gggccggggg 182401 ctaagccagg ccgaccttgg catgcttcct gctcaagtgt aagtcatgtt cagatgaaaa 182461 actccctact ctaggaacag ggggaaatgg aattagaaaa actacttaga tgacctggcc 182521 gggcatagtg gctcatgcct gtagcacttt gggaggctga ggcgggtaga ttgcttgagc 182581 ccaggagttt gagaccagcc tgggcaacat ggtgaaaccc catctctact aaaaatacaa 182641 taataataat aataataata agaagaagaa gaagaagaag aagaagaaga agccgggttt 182701 ggtggtgcgc acctgtagtc tcagctactc gggaggctgg ggcactagaa tcgcttgaac 182761 ccaggaggta gaggttgcag tgaaccgaga ttgcaccact gcactccagc ctgggagaca 182821 aagtgagaca ctgtctcaaa caaacaaaca aaaaagtact caggtgacct attaccagtc 182881 aacaaggtgg agagtcccct ctccatctca gagcctgcag ggagcaaccg gggcatcact 182941 ccccatgttc agtggaagct ccctacccct tactctggcc ctactgactc cagttgggca 183001 tataattcta gtgatcaggt attggctttc ctagatgaag caactgggtg ttgggggagg 183061 ctgcgcctga tgccaggaga cagactgggg acagccaaag gtagcctgcc agggaggaga 183121 tagaaaaccc aagtcaactc ccaacccaga gcctcaacca caggcccagg tggccacctt 183181 ggtggcctcc agagaaggat aaggcggggt cccacactca ctggtgaaga gttgctggat 183241 gcgaggaaag ccactgccag gcccacccag gaggatgctg accacgatcc atgtgacacc 183301 cacactgacg actaggacca caatgcggag agggcctgga ggcaggatag acacagggtg 183361 gggtcacctc agataccagc gaaccccaga atgaccccat agccttgcat gactggatct 183421 cccctgccac ttgggaagaa gtggctggag gacaaggagg tggagacctg ttgccttcta 183481 ccagggctga aagggagact tgagctccct ggcaggtgga aggaggaaat tgagttcaaa 183541 tcccaaggct gctgggtttg gttttccctg gcgggtctgg gagggggcct tccagaactc 183601 tctgctcgag aagggctcca gccctctttc caagaacccc ctcaacttcc tgcctctgaa 183661 accaccaata tatttgggca tccacctgcc cctgctttcc tcctgttctg agggagaagg 183721 gtccctactc ccttttaaga ccagtcccgg ccgggcgcag tggctcacgc ctgtaatccc 183781 agaactttag gaggccgagg cgggtagatc atgaggtcag gtgttcgaga ccagcctgac 183841 aaagatggtg aaaccctgcc tctacagaaa atacaaaaat tagccgggca tggtggcgcg 183901 cacctgtaat cccagctact caggaggctg aagcaggaga atcagttgaa cctgggtggc 183961 ggaggttgca gttagctgag accatgccac tgcactccag cctggcgaca gagtgagact 184021 ctgcctcaaa acaaaaacaa aaacaaaaca aaacaaaacc aaaacaacaa cagtcccgtc 184081 atctgctggg acctggattg caagctcccc agccttctct gatttcttgt aatagtgatc 184141 actctctcta gttcgctctt tctcctgtat tttcaacatc tccaatgtta aggaagcaaa 184201 gaaaacactc ttcttcaaaa ccacatcccc atcctctttc ccctaacctt tcttttccct 184261 gtcaaccccc tcaaaagtca tctatactca ttccatcatc tcacctccta ctccctccca 184321 ccctactcca gtctggcctg gcttccacca cctcaccaaa tcagcttgtt gccaaggcca 184381 ctgatgacct ttgtggtgtc accacccctg gtgacctgtc agccttgatc ttccttgacc 184441 tctcagtggg ctgcccatgt ccacctgaca gaagcactaa cttctctgct ggctttccgg 184501 acagcacact ctcctgaccc aatctctctg gacactcctc attctcctta gttagatcgt 184561 ctcctctacc ctccccttaa aagctggact ttcccaggct cccatctcag gctgccttct 184621 ggtccctacc tgtagtgtct ttttattttt tttatttttg agacggagtc ttgctcttgc 184681 cccccaggct ggagtgcaat ggcgcgatgt cggctcactg caacctccgc ctcctgggtt 184741 caagcgattc tcctgcctca gcctcccaag tagctgggat tacaggcgcg caccaccgtg 184801 cctggctaat tttttttttt gagacagagt cttgctctgt tgcccagact gaagtgcagt 184861 ggcgtgatct tggctcactg caagctccgc ctcccggtta cacgccattc tcctgcctca 184921 gcctccctgg tagctgggac tacaggcgcc cgccaccacg cccggctaat ttttttgtat 184981 ttttagtaaa gacggggttt caccatgtta gccaggatgg tcttgatctc cagacctcgt 185041 gatccgcccg cctcggcctc ccaaagtgct gggattacag gtgtgagcca ccgcacccgg 185101 ccatgcctgg ctaatttttg tatttttagt agagacgggg tttcaccatc ttggccagac 185161 tggtctcgaa ctcttgacct caggtgatcc acccgccttg gcctcccaaa gtgttgagat 185221 tacaggcgtg agccactgcg cccagctgtc tttacttttt tgagacggag tctcgctctg 185281 tcgcccaggc tggagttatg cggtggcgca atctcggctc actgcaacct cagcctcccg 185341 ggtagctggg attacaggca cccgccacca cgccaggctc atttttttgt atttttttag 185401 tagagatgag gattcaccat gttggccagg ctggtctcga actcctgact tcaagtgatc 185461 ctcccacctc ggcctcccaa agtgctagga ttacaggcgt gagccaccgc gcccggcctg 185521 tagtgtgttt ttagatgatg tcatctactc ctagctcttt aattaagcac catgtgtcaa 185581 tgactttcta ttttctagtt ccagccgtgg cctctcttga aaccactccc tgccagctgg 185641 ttcctcagtg gccagctcca cctggatatg tccacaaagt taccacctta gcaagtctga 185701 agtccaattc ccgagttgtt tctactctcc acccagcccc aacctatccc tccccaagga 185761 tcttcatctt acttgtccag gtgacagtag ggctgtcatc gggttcatgt cgaagcccac 185821 agccactttt gtcacttccc ctctaagacc acgcctctcc acctggactg cccacctcct 185881 ctctctcacc actcattttc caaaaagcag ctggtctgct acctgcgttg ggagatactg 185941 actatactaa tcctcatgtc atagctcggg gagctgacgt cctcagaggt gaaaggattt 186001 ggttaaggtc aaggaacgag tgtttgtgca gagtcctcca tttccaaaac caaacccagt 186061 ggttcacccg ggcctctcca gacacagatg tcgcgacgct gggcctccag ccaggtgccc 186121 ttttcaactc cctcccgttc gctgtggaag ccacccacct gccaacctca tgtccacttc 186181 tcctgctggg ttgggaccgc cggcaagtgc actgtttggg ggcaaagcgg aaggacaggt 186241 ttgggtgggg gtcaggggca agcctgtgcg ggacccctgc tgggggcggg cgcgatcggt 186301 gctacgacct gtttgtccag ccgcccggcc aggcagggct cctcggaaac gcgcggggaa 186361 accctgcctg gccaggaggg gcctcaggaa cccgttggct cacgatcttg cccacaggag 186421 cctccggggc tgggacgaag atgtggttct cctgggctcg ggccgttcct ccgggcctgg 186481 gggctggcga tgcgggccgg gccggagatt tctctgtccg tggggagacg tgggctcggt 186541 cagggaggtt ccctgtcagg cccgccgacc cgctccgccc ccggagagga cgcaaggccc 186601 ctgcgcaggg tggccaggca ggcccggcaa tggcggctaa gggcggggcc acagcccgcc 186661 ggccccgccc ccggcagcag ctgcgccgtc tggctccacg caggccccgc ctccagaccc 186721 gcctctaccg ctcgcgctgg ccgccgcatg ggaggggcct gaacctgtcc cgccccgaac 186781 ctgtcccgcc cctggatccc tcgtcggccc cacctccgcg gtgcgttcag gctgaaccca 186841 ccggccgccg agtgggaggg gcccacgtcg gtcccgcccc ggaagtctcc cctcggccct 186901 ttctcgggtg ggagagccct gcctacccag gcttagaggg ctggggctgc gcttgcaccc 186961 tgaggtcaga tttgtgagtc gtgggaagac gcagatggat ctggggcaga attagaacct 187021 gttccaaagt tgatccatgc gctttgtaaa ggcaagggag gtacagaggc gcagataggg 187081 aagaccgcag gatccagcgt gagcctctgg ctgcctttcc ttagggtctg ttttctgtcc 187141 agaagcgcgt tcctgctgtc tctcgttatc acttttataa tgggacgggg cgtctcaaac 187201 tatttcgaca tccttgaaaa ctcttgcctg ctccgcctgc ctggtctcta acaccctcct 187261 cctcagagct gctcaagctg tgttgtttac gccacacccc accctaaaag cagaggagaa 187321 tacaactttc gggggcagtg gaggggggag tttagcgcta taacccgagg ctttcacagc 187381 aaaccagaat ttattttgag gcctctcagg ttcagctttc aagtcacgtg actttcgcgt 187441 cctacacaac tcttattttc acttgaaaac aagtgtgacg tcgtgaattc cctaacaata 187501 ggaaaaaaga aggctccaga aagagcggtg cttggctttc ccagcggctc caagcggtgc 187561 ccggcaccca gctggctgta tggacctggg gtggtctccg gccgcaggca gacgcggccc 187621 ttcggccttt ctccagtcac cccaattctc cctgactgcc tggctatctc ccctccgcgc 187681 ctgcccgcac gctgccccca acactgcccg ggctccctct gcccatgaga acacgggtca 187741 agctcccgtc ccttctgccc tgccggctgc cccaagtgcg ccctgaccac ctactgcgct 187801 ggcatggccg gggccaccct gacagctgat ccatcatctg tctggagagc agcagcaggg 187861 cccaatgaga accacggaaa cctccctgcc ctttgaccca acttacggga atctcctacc 187921 cttagggaaa caatgcaaaa tatagagaaa gtttatagag ccgggcatgg gagagccctg 187981 gttagaatcc agcagcttct atactggagt actcaacagg gtcaaggtcc ctttcagctg 188041 cctgctgctg gggccagggc tttgcatttg cgttttgtca tggaatttga cagcctctca 188101 agtaattatg gtaacgaatc ttgggctcca tgccctcctc aggggggccc agtcgaagaa 188161 gctccaggtt actcacaccc ttgagctgcc agccggtgtt tttcaaagac aatatttaca 188221 tgttgttcag agcaggcatt tatgaaagtg ttgctttaat gagtccccct gagcattccg 188281 agacaactct cccttccttg gggacaccac gtgggtgatg ggccagaggc ctgggattca 188341 gttccacgtc ttccgtgctt cctggggcag ccttgcaaaa gtcccttatt ttctctggga 188401 tggtttggat tcctgtgggg gtctggccag cacggtgact agagcacatg gtctgtcacc 188461 atccaggcca acgtcctggg gtacactcac agtcatctgt tcttccaggt cagccctcat 188521 agatcagggc agcttcagca gttctcaaat agcagatgtt acaaaatggg ggtgggcaat 188581 cctgcagcag acgtctctgc actttctgtt agaaggacac cccaggtatt tttgagcact 188641 gttctgaaga cggaaaaatc cagagtatac atacacttaa agctgggagg ggcacctgcc 188701 ggggagaatg cataagggat agaaacaaaa acagagggcc gaagcctctg ccaggcagaa 188761 gacactgagt tctcagcaaa gagaacacct ttcagtgctg tggtcaaagg gatgtcaagc 188821 ctcctgaggc ccggaaagag caggaaactg agggcagaat tggaaaaaat caagttacag 188881 acacacacca cgggataacc tctatacaga gaggagcaca cccaaaataa tactatatat 188941 ttatagatgt acttttgtat tggggtaaga catatataac ataacatttg ccatccaaat 189001 cattttaagt atacgatcct gtggcatcag ttacattcac aatgttgtac aaccatcact 189061 actatctttt tgtgtgtgtg tatattactt tttttttttt ttccccgaga cagagtcttg 189121 ctctgtctcc aggctggagt gcagtggctc aatcttggct cattgcaacc tctgcctacc 189181 aggttcaagc aattctcctg cctcagtctc ctaagtagct gggattacag gtgcctgcca 189241 tcatacccag ctaattttta tatttttagt agagacgggg tttcaacatg ctggccaggc 189301 tcgtttcaaa ctcctgagct cgtgttctgc ccgccttggc ctcccaaagt gctgggatta 189361 caggcgtgag ctactgtgcc cggccatgta ttacttttat gggctcatat atatgtccat 189421 agaaatatag agttctttta aaaattctat caatagatat ttctttttct cttttctttt 189481 tttttttttt tttttgagtc agaatctcac tctgttgccc aggctggagt gcagttgcgt 189541 gatctcggct cactgcaact cctgcttccc gggttcaagt gattctcctg catcagcctc 189601 ccgagaagct ggaactacag gcgtctgtca ccatacccgg ctaatttttt tgtattttta 189661 gtagagatgg ggtttcacta gttggctagg ctggtctcga actcctgacc tcaggtgatc 189721 cgcctgcctc ggcctcccaa agtgctagga ttacaggcat gtgccaccat gcacggaaat 189781 caatagatat ttcatttcca ccaatacaga agcatcagtt aacagtgact ggttgacact 189841 tttagaaagt ggcttgagaa gggccctgaa tgagccacct acgtggaaca cagaatttga 189901 gatgtgcaaa tgtggagacc tgaaggcttg ttatgggaac aaaaggtggc tgtgtataga 189961 gaggagcgca gataaagggg acttatcagc agtcgcaggg agttcatgct caacaactgg 190021 caacaagcgg ggcagctaca ggctactcaa ctggggattg atgcgcggtg cctctgaggt 190081 tcagctctgg gcctgatcgt ccaagaaagg gcagctaagg tgggcatagt gaatcacgcc 190141 tgtaatccta ggactttggg aggctgaggt gggaggattg cttgaagcca ggaacttgag 190201 accagcctgg gcaacaaagt gagaccctgc atatatatat atatatatat atatatattt 190261 acacaaataa ataaagggca gctagactcc acagaaacca tgttgaagat tcctgtgcag 190321 gaagggggtg ataaaatgca gaaacctgac ctgacacagc caacacataa ctgcattgcc 190381 tcatttttca cgattttttt ctttcttttt ggagagaggg tctcattgtg ttgcccaggc 190441 tggagtgcag tggcgcagtc acagctcact gcaacctctg cctcccaggc tcaagcgatt 190501 ctcctgcctc agcctcctca gcagctggga ttacaggcat gtgccactac tggccggcta 190561 atttctgtat ttttagtaga gacggggttt caccatgttg gccaggctgg tctcgaactc 190621 ctggcctcaa atgatccacc tgcctcggcc tcctaaagtg ctgggattac aggcgtgagt 190681 caccacgccc agccattaat tgccaagtct cattggctcc tagtgtctta atccacatct 190741 gctaaagttg tttaaggtac agctaaaaca tcacccaggc tgggctcagt ggctcacacc 190801 tgtaatccca gtaatttgga aggcccaggc gggagtatca cttgagccca ggagttcgag 190861 accagcctgg acaacacagg aagaccttat ctctaccaaa aatacaaaaa ttagcagcat 190921 gtggtggtgc acacctgtgg tcccagatac ttgggaggct gaggtgggag gatcacttga 190981 gcccaggagg tcaaggctgc agtgagctat gatcacacca ctgcactcca gcctgggcag 191041 cagagcaagg ccctgtctca aaaaaataaa ataggggcca ggcacggtgg ctcacgcctg 191101 taatctcagc actttgggag gctgaggcag gtgaatcacc tgaggtcagg agttcaggac 191161 cagcctggcc aacatggtga aacctcgtct ctactaaaaa tacaaaaatt agctgggcat 191221 ggtggtacac gcctgtaata ccagatactc gggaggctga ggcaggagaa tcgctttaac 191281 ctgggaggtg gaggttgcag tgagccgaga ttgtgccact gcactgcagc ctgggtgaca 191341 gagcgagact ctgtctctaa ataaataaat aaaataaata aataaataaa taaggctggg 191401 tgcggtggct cacgcctgta atcccagcac tttgggaggc caagtgtagg gaaaagagag 191461 atcagactgt cactgtgtct acgtagaaag ggaagacata agagactcca ttttgaaaaa 191521 gacctgtact ttaaacaatt gctttgctga gatgttgtta atttgtagct ttgccccagc 191581 cactttgacc caaccacttt gatccaatct ggagctcaca aaaacatgtg ttgtataaaa 191641 tcaaggttta aaagggacct agggctgtgc aggacgtgcc ttgttaacaa aatgcttaca 191701 ggcagtatgc ttggtaaaag tcatcgccat tctctagtct caataaacca ggggcacaat 191761 gcactgcgga aagccgcagg gacctctgcc ctggaaagcc gggtattgtc caaggtttct 191821 ccccatgtga tagtctgaaa tatggcctcg tgggatgaga aagacctgac cgtcccccag 191881 cccaacaccc gtaaagggtc tgtgctgagg tggattagta aaagaggaag gcctcttgca 191941 gttgagatag aggaaggcca ctgtctcctg cctgcccctg ggaactgaat gtctcagtat 192001 aaaacctgat tgtacatttg ttcaattctg agatcggaga agaaccgccc tatggcggga 192061 ggcgagacat gtttgcagta atactgcctt gttattcttt actccactga gatgtttggg 192121 tggagagaaa cataaatctg gcctacgtgc acatccaggc atagtacctt cccttgaact 192181 taattatgac acagattctt ttgctcacgt tttttgctga ccttctcctt attatcaccc 192241 tgctctccta ctgcattcct ttttgctgaa ataatgaaaa taataatcaa taaaaactga 192301 gggaactcag aggccagtgc aggtccttgg tgtgctgagc gccggtcccc tgggcccact 192361 attgtttctc tatactttgt ctctgtgtct tagttctttt ctcagtctct cgtcccaccc 192421 gactagaaat acccacagat atggaggggc aggccacccc ttcaccaagg ggccaaggca 192481 ggcggatcat gaggtcagga gttcgagacc agcctggcca agagaccagc ctggccaata 192541 tggtgaaacc cccgtctcta ctaaaaatac aaaaattagc caggcatggt ggcgggtgcc 192601 tgtaatctca gctactgggg aggctgaggc aggataattg cttgaaccca ggaggtggag 192661 gttgtggcga gccgagatca tgccactgta ctccagcagc ctgggcaata agggtgagac 192721 ttcatctcac caaaaaaaaa aaaaaagcag ctccactagg aagactaaaa tactaaaatg 192781 atggtgtgct aacttgaaca gttaggttgt acaagaaagg acttgacagt tccctttctg 192841 atgtcctcca agacatgggt tccatttata tgcaccttgt tctttccctc agacctgtaa 192901 cttcagcctg gagttgagca gaaacatggc ttccttgtct tcaagtcatt cttgggcttc 192961 agagcgaaga tgctggacct ttgaaccaac aagcaggtta ctggtacctt tgccctgaga 193021 atacgctggt ggtgcttgtg gctgcagtgt ttaccccgag ataactttgc catgaagtat 193081 cttcctttta ttattttttc atcgctctag tatatcgact ttggaaacaa aagacatcac 193141 tctatttaga gcattccttt cttagtagtg gtatttccat tgacaaaaaa atagtaattc 193201 tgaattgccg aaaatgtcaa atcgtagaaa atgttgttag ccgaagattc atctgatgaa 193261 tcagattttt ccaaaataga tgattctgat gttagttctg tttagaaata actccaagta 193321 cagtttttat atattatttt cacattgaaa atcagtcaga tttacttcag cctcaaagag 193381 tgtgtttatg taaaaataaa tgagcactgg caacctccgc ctcccgggtt caagcaattc 193441 tcctgcctca gcctccccag tagctgaaat tacaggcgcc cgccaccaca cctggctaat 193501 ttttgtattt ttagtagaga cgaggtttca ctatgttggc caggctgatc ttgaactcct 193561 gacctcaggt gatctgcctg cctcagcctc ccaaagtgct gagatatata ggtgtgagcc 193621 acggcacccg gctggctttt caatggtata tgggaatgag ctgcttggtc agctggataa 193681 atgtactgtc agcctcagga ctgcctgcat tcaccctggg aagcagtcaa accccatgtg 193741 atgggtcatt gggtcatgag atgctggaac atcactgtat tgctggcttt ggagaaagat 193801 agattttttt tttttttttt gagacgccgt ttcactccgt tgcccaggct ggagtgcaat 193861 ggtgtgacct cgcctcactg caacctccgc ctcccgggtc caagtgattc tcctgcctca 193921 gcctcccgag tagctgggat tacaggcacc caccaccaag cctggctaat ttttgtattt 193981 ttagtagaga cagggtttca ccatgttggc caggctggtc ttgaactcct gacctcaagt 194041 gatccgcctg cctcatcctc ccaaagtcct ggaattacag gtgtgagcca ccgtgcctgg 194101 ccaaaaggta gatcttgtct cattttcctg ccagaaagct ccctgaggct gaacatctga 194161 aaaaagaact cagaaagacg accccagaca tttgctgtgt acagcgctac ccagaattat 194221 ttgaagaaaa tgttcaattt ccttcttaat ctcttcactg actcactggt cattcaggat 194281 catattgttt aagcagaaaa tgtttgggaa aaaaaaaaaa aaccaccacc accaccacta 194341 aaaaataaca gcactgggtg ggtgtggtgg ctcatgcttg taatcccagc actttgaaag 194401 gccaatgagg gcggatcacc tgaggtcaag agttcaagac cagctcggtc aacatggcaa 194461 aattctgtct ctactaaaaa tacaaagatt agccaggtgg tagtgcgtgc ctgttaattc 194521 cagcttttca ggtggctgag gcacaagaat cgcttgaacc aatgaggtga aggttgcagt 194581 gagccgagat tgtggccact gcactccaac ctgggctaca gagttggact gtgtctcaaa 194641 aaaaaaaaaa aaaaaaaaaa aaaagcagca agacagtgaa acaactattt acgtagtact 194701 tatattgtat tagatactat aagttatcta gagatgatta gattatttca agtgtatggg 194761 aggactgggg agttctgcac tttgggaggc tgaggcagga ggattgcttc agcccaggag 194821 ttcaagagca gccttggcaa catggcaaga tcccaactct attaaaaaaa aaatacaaat 194881 ttaaaaagta cacaggaggg gccgggcacg gtggctcacg cctataagca ctttgggagg 194941 ccgaggtggg cggatcacga ggtcaggagt tcgagaccag actggccaat atagtgaaac 195001 cccacctcta ctaaaaataa aaaaattagc tgggcgtggt ggcatatacc tgtaatccca 195061 gtgactcggg aggctgaggg aggagaatcg ctttaatctg ggagacgaag gttgcagtga 195121 gccgagattg tgccactgca ctcaagcctg ggtgacagac aagactctgt ctcaaaaaaa 195181 aaaaaaaaaa gaagtataca ggaggatgtg cgtaggttat atgctgtgcc attttatatc 195241 agggacttga gcatctgtgg attttgatac ccctgggggt cctggaacca atccccgtgg 195301 ataccaaggg aagactccac tcaactctgc cactacaaac agcatgtaag caaatgggtg 195361 tggctgggtt cccataaaac tgtatttaca aacacccaat gtctacttag gtgagctgca 195421 cattgacctc tttagctcgt tattccttct tgcatctcag attgttccat ctagtactgg 195481 gtttcttctg cttgaggcac acactttttg gtgagaaaat gttggtggca aattctttgc 195541 ttttgtttgc acttcaccat catacttgaa agattttttt ttttttaagt cggagtctcg 195601 ctcttttgcc caggctggag tgcagtggca caatctaggc tcactacaac ctctgcctcc 195661 cgggttcaag tgattctgct tcagcctccc acgcagctgg gattacaggc aagcgccacc 195721 atgcctggct aatatttgta tttttattac agacggggtt tcaccatgtt gtctaggctg 195781 gtctggaact cctgacctca ggtgatccgg ccacctcggc ctcccaaagt gctgggatca 195841 caggtgtgag ccaccacgcc tggccagcga ccatcatgcc cagccagcca tcattatatc 195901 ttcaagtttt gcttctgcca tacttgctgt cttctgtttc tgtgaaatca gttaaatgta 195961 tggaacacct tctcattctc cttgcggctt cttcctctgt ctttagtgtc ttccctgaac 196021 ttgtctttct gggttgtatt ctgaatacat tcttctgagc cagtttttgg ttcccgcatt 196081 ctctctttga tgttgcgtaa tttgccacta aaccagtcca tctactttta aacttactag 196141 ctcacatttt accttttttg agacagggtc tcactatgtc gcccaggctg gagtgcagtg 196201 gcttgatctt ggctcactgc aacctccact tcctggcctc aagcgaccct cccatctcag 196261 cttcctgagt agatgggact acaggcactc gtcaccacac ctggctagtt tttgtatttt 196321 ttgtagacgt ggggtctcac catgttgccc aggctggcct agaacccctg ggctcaagca 196381 atctgtctgc ttgatccctc ccaaagtgct gggattacag gcatgagcca ctgcacccag 196441 ctgtcccacc acgacttatt atttcattag atctggttaa gtacggttat tttatgtctg 196501 gctaacaatt ccagcgcctg aagacctcac gatctgtcgc tattgctcat tgtttctctt 196561 gcaatgttgt ttcctggtgc gcccggttat ctctgatgat gtgcttgccc tcatgcttta 196621 aaaagtacct gtaggaggaa ggagtctctc ctgagagggt ttgctctggg ttctgtcaga 196681 caggcagaac catctgcatt ggggaactgc ccatccagga ccccttaagc caagtgaaag 196741 gcttaaggtt ccatggtcaa ccctgggaat ttgaaccctg ggaacttgaa ccctgggaat 196801 ttgaaccctg ggaatttgaa ccctgggaat ttgcaccctg gctcagttgc aattctgggt 196861 ttcctctcag tatgatgggg cagcaagttg ggggcctcaa gatttgattt ctgtcctctt 196921 cgctttatag ggctgtcaag gaaaagttcc tgggtgggca aaggcaactt gggggttaat 196981 ttgtatcatg gggtcctagt tttctctgaa aatttggcct tgcaaatgtg tgtgtgtgtg 197041 tgtgtgtatg agtgtgtgtg tgtgtatata tatatattta tatatatatt tgagacaggg 197101 tcttgctgtg tggcccaggc tggagtgcaa tggtgtgatc atggcccaag tgaacctccc 197161 acctcagtct cctgagtagc taggactaca ggcatgtgcc aacatgcccg gctaattttt 197221 gtattattgt agaaacaggg tttcatcatg ttgctctgct ggtctggaac tcttgggctc 197281 aagcaaaatg cccaaagtgc tgggattaca ggagtgagac actgtgtcca gccatagagt 197341 ctcactgtcc ttacagaaag ccagaattca gtctgcttct tgttccctca aagcttcctc 197401 gaagtgttct agtggaggcg agcagtgaag ctctaggagg caagaggcct tctcagggac 197461 cccagctttg gcctgtatgt ctgggtgtcg tcctggacta gtctgatcac gtctcacagc 197521 tgagtttctc agcttctcct cctgcagtac aggaaggaca atcccttccg actgcttctc 197581 acagggagag ggcagtggtg agcctagaaa ataaaggccc gaagccagcc ccgccccatt 197641 tgtgcctctc cctcctctcc ttttctgctt cgcagagcag acaatgcaac agtcgccccc 197701 gcccagctca gcctgaagac cccttctcca ctcccatccc tttctggctc agccacccgg 197761 cagtgggcca ggttcggatg gagggaggga gaggagctgg agagagtttt aagtactctc 197821 tgggcttctc acattgctgg cttcagtggt tgctggacgc aaattgatgt gatgtcagac 197881 accggcagca gagaaggaag ctaaggcatt tgggtctgtg ctgctgtggc acctgggact 197941 cagtgtgaag ggatcaggct aggtttgaaa ttctcttccg gcttaataat ttgcaggtct 198001 gggctgggag gtaaagctct gagtacactg cgagtgtcag gggtccggca ggggggcttt 198061 ttccagctcc tacccactcc cgcgtggggt gaaaccttga gagatgagac accaatccag 198121 ggcctctgaa gacccacgag gtagagtgca cttaacagta aattgtgggc tgggtgcggt 198181 ggctcttgcc tgtaatccca gcactttgag aagcctaggc gggtggatca cggggtcaag 198241 agatagagac catcctgacc aacatggtga aaccccatct ctactaaaaa tacaaaaatt 198301 agctgtgtgt gctggcgggt gcctgtagtc ccagctactt gggaggctga ggcaggagaa 198361 tcacttgacc ccaggtggtg gaggttgcag tgaactgaga tcacgccact gcactcctgc 198421 acgccagcct ggcgaaagag caagagtctg tctcaaaaaa aaaaaaagaa aaaaaagaaa 198481 aaaagaaaaa aaaaaagagt aaattgtggc caggtgcggt ggcccatgcc tgtaatccca 198541 gcactttggg aggctgaggc gggcggatca cgggagttca agatcagcct ggccagcatg 198601 gggaaacccc atctctacta aaaataaaaa ttagctgggc atggtagcgg gcgcctgtaa 198661 taccagctac tcaggatgct gagacagaag aattgcttga acctgggagg tagaggttgc 198721 agcgagcaga gatcgtgcca ctgcactcca gcctgagtgc cagagccaca ctctgtctca 198781 aaaaacaaaa acaaaaacaa acaaaaaacc cggtaaattg cggggccgag tgcagtggct 198841 cacgcctgta atcccagcac tttgggccgc caaggctggg ggatcaccta aggttaagag 198901 ttcgagacca acctgcccaa catgatgaaa ccccgtctct actaaaaata caaaaattag 198961 ccaggcgtgg tggcgtacac ctgtagcctc agctacttga gaggctgagg caggagaatc 199021 acttgaaccc aggaggtgga ggttgcagtg agccgagatc acgccagtta gtccagactg 199081 ggcgacagag caaaactcca tctcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaggg 199141 gtcaatggaa gcccaggggt gggggttaac gcctttggcc ccagtttttc agggggttga 199201 ggtggggggt ttttggaccc tggggggtgg aagttgaagg aattggtgat ggggtaattg 199261 aactccaccc tggaaaaaag ggtgagtccc agtttaaaaa aaaaaaaaaa aaaaaaaaaa 199321 aaagcaaatt gtggctgaat tctttggcat tgcaggcaga gaaggggtga aaacagggct 199381 ccagccttga accggtcagc aggggtcagg gaggctgagt gagtcaccct atggaatggg 199441 ccagggtgga tctgggctca caggcaaggg cctcagtgct cctgtgagct ccaaggggct 199501 ggggaggact gccttcttgt tcagggtaca gctcgtgttg ttttctcggg cttatgtcca 199561 tgtctcagtg tggacacccg caactgaggt gacagtggaa tgccctgacc tgatctctca 199621 ttctctccca ttgtggaccc agtgctgtcc tcacctggga cctcctgcct ccacccaagt 199681 aggcgcccag agctgaaggg acagttgcct gatgggttca ggtgccagtg gatggtcttc 199741 cacctcccgg gttcaagcga ttgtcctgcc tcagcctcct gagtagctgg gaccacaggt 199801 gcccgccacc acgcctggca aatttttgta ttttttttgg tagggacatg ttttcgtcat 199861 gttggtcagg ctggtcttga acttctggcc tcaagtgatc tgcccatctt ggcttcctaa 199921 agtgctgaga ttacacgctt gagccactgc gcctggcctt acttgggtgt tgttccctgc 199981 cctggaacct accctgctta gccttaccac accccatcag cccccagccc aacccgactc 200041 tcagccttgc cacagcccct tctggaggcc acaggaacca ggacaagagg actaaaacca 200101 ctgcccctgt ggccgaagcc tggcccccaa tcctgtgcac acagagccca ctgagcctgt 200161 gacctgtcct tggcccccat caagctcctt tccctatcta gggtccaggt gaacttggca 200221 tacttcctcc ctctgaaaac tcaagaggtg ccactcccca ttcagggcac aggaagagca 200281 agaattgaac accgggatgc agcatgtgtc cctgttgcca caggtctcac cagaggcctc 200341 cttcgttcta cgacatcggg ccctgctacc tgcagcgccc agggtgcgcg tcctcacctg 200401 tgctgcgggt ggaacactcc tccctggcca ctttccaggc cactggtttg ctctccgcaa 200461 aggacaaatc ccagcaccac ctggttattt tgttccctgc cggcctcacc tcctagaacc 200521 cgggccacaa gtgcagggac catagtgtct ggtttcgcca tctccctggc aggacacatg 200581 ctgcactgaa acacccaggg gctgtgtggg gagcagtgca ggggctgaac agcggccccc 200641 aaatacatgt ctcctgggag ttcctgatgt ggccttattt ggaaatagga ggtctgcaga 200701 tgtcgctgag ttaggacata gaagtcacac catggggagg ccacgagaag acaggcagag 200761 tgagtggagt gaggtgacca ccagcccaag atgcctggtt ccccaggagc tggagagggc 200821 aggaaggacc ctccccttgg gcctctggcg ggagtgctgg atcttggact ctggcctcca 200881 gaactgggag aatcagccta tgttgagagc tgcccagtgt gtggcccttc tgcctggaag 200941 gggtggctgg gcctggctgt gcatcactgt ggactgtccc ctctcccttg gcctctgaca 201001 tcagtcacaa aacccatccc ctctccttcc taaatacttc tgtggactgg cagtgttgct 201061 ggaagtcatc ttgggtgggg cagggacatg gacagtaaga gcggaaggtg accccacatc 201121 cttgcacctc aagctggtct cacaatgaca atatgcgtgg agcggaggag gactcaggag 201181 gtgtctctga caccaaacat ggtcatggct ggacatgggg tcagaaccag aagtgaggag 201241 gctcctcagt ggccacagag gaagcacccg ggtgacttgg ggagtgggac aaggaagtgg 201301 gtcctcaggg aagcaaatga caaggggaca ggaggggctg tttgcggatt taatggcagg 201361 gcattgaggt tgggaggggt cccagctgct gcgtctgctt ttcttatagc agagaggctg 201421 cctacgggtg gcacggggtg gccatggagt gcagagttgg tgggacaggg gacatccagg 201481 gggctcgaga tgttgctggt gacaaggaat gtcaagtggc actgaggctg gcgggcaagg 201541 ccacaggcag attctctcac gtgggtgctc gcccctttcc tcccccttgt ccctccctcc 201601 caccctggcc ccactcagga gtgagaccca gtggccaata agctctggga cagacgaatg 201661 ggcgccctcc tccttccttc tgttgggctg gagtgagtgg aggaggtgac tcagctcctg 201721 ggctcaggca gggtctggag gggccaggat ggtctcgagt gcttggcagc tgaggaatgt 201781 agctgggctc gggtagtagc agcagcgagg ggcgggccag ggtggccaga gcccggggcc 201841 aggaatgtgc agctgaggtc aatggtcccg gagtcctccc gactcggggt cgggcggcgg 201901 gaaggagggt ggccgtggcg ggggtggagg tgggtgccca gggctcagag cttgtggggg 201961 ttcacccact tgtaggtgcc ctcatactgg aaacccactc tcttcatcag ctcgtctgcc 202021 tccgtggggc ctcggctgga gagtgacggg tggaggagag gcatgaggta gctccaccct 202081 caccccgccc ctgcccgctg ggctctgtcc ccagccccca ccctttcctc acctgccata 202141 aatatagggg atgggcttgg gcttctccag ctcaatctgg tgcagcagtg gggtgaaaat 202201 acgccaggcc tcacggagct cgtcgctgag gggacatagt atggcttggg aggccggtgg 202261 cacacaggga gggagggcaa aggccacccc atagcccaca ggtatgcagg ggccggcagc 202321 tgggcctcac ctgcgcacga agtgcatctg gctcccgcag aagacgtcca ggatgaggcg 202381 ctcgtaggcg tcagggagct tcacgttctg tgagggagag agtgtcttgc tgatgccact 202441 gcctgccacc atgtggagtc ccccgggccc aggccgccca ccctccacac tgctccttct 202501 ctgtagggca ccttgtatct gttgccgtag gtcaggtcca gctccgactc ctcggggttg 202561 aagaacatgc ccggcttctt ggtcatcatc ttggtgtaca cggcctcgtt gggctgcacg 202621 cggatcacca gctcgttgcg cttgcactgc tggtggaaga tgtcgccggc cacatcatgg 202681 aactgcagcc tcacctcggc cttgcgctcg ttcagggcct tgccgcagcg caggatgaag 202741 ggcaccccta cgtggcggaa agggcagcct cagcaccagc tctctcaggg tgtggaccag 202801 tgcgtgagtg tctcagtggg agctccagtg cccgcacaca gggcatgccc agttctgcct 202861 tgctgggcct cgaaggcatc acctaccatc ccacctctca ttctccacat agaggacgac 202921 ggctgcaaaa gtggcggtgg tggacccgcg gggcaccgtg gggtcgtcca ggtacccttt 202981 ggtggcctcg ccctctccat cggggttccc cacgtactgg cccaggacca cattgttggc 203041 ctgcacctct gagatgcatt tcaacacctt gacctgagag aaagccaagg gagagaatgg 203101 gctccttggg tgttgagttg gggtgcaggg atgactgtgg ccacagatgt gcagccctca 203161 gggcaggagg aggcccctgc ttggctctgc tcaccctgcc agaggcccag ctcagggccc 203221 ctccctgagg accctccagg accaccctgg tccatctcga gtctattctg atgaacaagc 203281 tgaggcccag agaggcaatg gccttccctg gtcacatagt gactgtcagg cctgggacat 203341 gacaacttgg gcttcatgac tgccagtcca ggtcacctcc gggaggccac gctgtgctca 203401 gaggtggtga cttctccggg gttgaggaca cctgctctgc atgcacaccc cagctcagtg 203461 cctcgtcaca gatgggcctg cgacagggca tgctcctggg gactggggtg caccccctac 203521 cttctcatca cggacgtcat ctgagttggt ggaggcgggc ttctccatgg ccaccagaca 203581 cagcatctgc agtaggtggt tctgcatcac gtccctgggg acggaagagg ccagagctcg 203641 ccttagctcc ccgccctgtt cacctggttc aagggcatgg ggaccccaaa caaggcttcc 203701 tagtgacaag ctgcaagact cacttcctga tcccccttgt cttccaggtc cccttaaatc 203761 aaggaaggac atggtgatgc tatcactgaa tcataaaacc gtggagtgct tggctgtgta 203821 ggggtccagc cctttccagc ccggtctgat agctcagaca cttaggtttt gaactgcagg 203881 gtgaggagga gctcccccaa gatagggaag agtagccctg cagggtgact ggctctgcca 203941 ccctgtgcca gcctcccagg agagaggaag agctctcacc ggatgatccc aaattcatcg 204001 aaatagcccc cgcgaccctc agtgccaaag ggctccttga aggtgaggat aacgcaggcg 204061 atgttgtccc ggttccagat ggggccgaag atcctgttgg caaatctgca gggaggggca 204121 aggtggagga actgaccttg ggcctctgtg gtgcaggggc cacatgtgag gggtcaccct 204181 tgtctgagtt ctggaggaat tcgtcctcgg ggaggcagtg ggccaggtga ggctcctgag 204241 taccaccccc accctggtcc cccggcccag gcttggcccc acctcagcac catgaggttc 204301 tgcaccatct ccttgcccag gtagtggtcg atgcggtaga tctggtcctc acggaacagg 204361 gaggagatgt ggttggacag ccggtcagag ctctgcaggt ccctcccgaa gggcttctcc 204421 acgatgatgc ggttccagcc tctgctggga gcccggagct gcgttacccc cttgaacccc 204481 tcttcgggga gtgaggatca cagctgcatc attcagacgc cctcccaggg gagtagaggc 204541 cagaacctcc tcgcccccgt ggccacctcc cgtgccttgt gttcccagat gaccctctgg 204601 ctcaacacct tccgggaagc cttcccaccc tgggtgccag ggtggcattg ctggcatgct 204661 gccgggggcc cgttcccctc ctgcctcatg ctgggtcccc ggtgggtctg agtggcctga 204721 aggcctgtag gggagcaggg ggaggaggca tccaagccat ggcttcctca ttggttttga 204781 aaatgagaag aggacctttg ctttactacc cccgcattca aaaccagcca gaggacaaga 204841 gcctctggct gggttgggaa accacccagc gcggccatgc tgcattcgca gagcaaggct 204901 gccaccctgc ggcctggccg ggcctttggg gaagcagagc ggaaaggcgg tgtttcgtgg 204961 agcaacgctg ccaccttgtg gtcccgctgg ggatggcccc ggcaccatgg atgctgccca 205021 gatccccggc cccggacacg ctcatagagt ggtgggagca ctgcctgggc cagcctggca 205081 ggcgggaagg gagggcaacg gcaagcctta catctggctc atgcaggact cgtgaatgtt 205141 cttggtgacg gcctcgtaga cggtcggggg caaggccagg tagaagaggc ggttggcctg 205201 tgaccccagg tggagggcat ccatgtggct gttgaggcgc tggtaggagg ctgcatcatc 205261 gtactggcca gccacatagg agttgcgggc aaagaagtcc tccagcttga gcttctcctc 205321 tggggtggcc tgggagacac ggacagacag acacacagac agatgtcagc ccctctcttt 205381 gagtccgtgt gtgttctgcc ccaggggagt ggagggtctt ccctggaggt ccagggaggg 205441 tgccctcaga agtcagtgtc cccgtcccac actgggttca gccccatctt agcagctctg 205501 cacatccaga gggagggagg ccaaagcagg cagcacagac actgccccag gctgggaccc 205561 ctgtgcctta ggacagaggc cagatttcag gatattttga cctgggagaa atacactgga 205621 gaaagctctc tctccaaaat catgacaccc aactatgatt ggcggagaaa acgcagcaga 205681 gcacagcagg agggacctgt gggtcctggt cacgggggct ggtaatgggg gtctcaagaa 205741 gtacgagagc aggcggggcg gggcaggaga ggaggagagc atcccgggat gggatggggg 205801 gaggtccccg aagctggcca tgctgggggc tggtagagag ggcagaacca ggctggggga 205861 ggccctgaca ccacccacct tgaagaaggg ctcactctgt ttgcggatgt cagccactgt 205921 gaggcgggaa cgggcatagc ccatgatgaa ggtgttttcg ggcagaaggc catcccggaa 205981 cagccacctg agggcagggc acagctgtaa ccagtgcggg cagggcagga ccaggcctgt 206041 ccctggcggg aggtcacagg ggcagtggtg ggacacactt accagatggt ggggtagatc 206101 ttcttcttgg ccaggtcacc ctgtggcaga gggaacaggt gtgtggttag aagtggctgg 206161 ggacacgacc tacatacatc atcctccacc cttggtgatc tgggcactac tcaggatcac 206221 tactgggcca caagcatgtc tgtcttgcct gatacacaga cagaagagcc ccgagattca 206281 agcctgtgcc agccctccat ccctgaccca gaagacataa acgcatctcc tgcgttccag 206341 gagaaaacct gaaaattcaa ttcagtagaa ccaagggacg acagaagtat catgtagcca 206401 catttgtgag acgtgacctc aagtgcttac attagcaaaa aagagtggtt caaaaataag 206461 aagctaagca ttccacttaa gaagtgagaa aaagggccag atgcggtggc tcacacctgt 206521 aatcccagca ctttgggagg ctgaggcagg cagatcacct gaagtcagga gttcaagacc 206581 agccaatgtg gcaaaacccc atctctatta aaaatacaaa aattagccag gtgtggtggc 206641 aggcacctgt aatcccagtt actcaggagg ctgaggcagg agaatcactt gaatccggga 206701 ggcagaggtt gcagtgagcc gagatcgtgc cactgcactc cagcctgggt gacaagagtg 206761 agacttcatc tccaaaaaaa aagtgagaaa aagatagaac tgcaagccaa attaccagga 206821 agaaaatagg aaataatgac aagtaaaatt aataaaatta aaagctaacc caaagaaaag 206881 atgaacaaag ctggaaattg gtgtttgaaa aagacgaata gaaacagaca accccctggc 206941 atggttgctt ttttaaaagg gcataggccg ggcacagtgg cttactcctg taatcccagc 207001 actttgagag gccgaggcag gcggatcaag aggtcaggag atcgagacca tcctggctga 207061 cacggtgaaa ccccgcctct accaaaaata caaaaaatta gccaggcgtg gtggcgggtg 207121 tctgtagtcc cagctactcg gaaggctgag gtaggagaat cacttgaacc cgggagatgg 207181 aggttgcagt gagccaagat tgcaccattg caccccagcc tgggcgacaa gagtgaaact 207241 ccgtctcaaa aaacaaaaaa caaaaaaaca actaaacagc tggacgcggt ggctcaagcc 207301 tataatccca gcactttgga aggccgaggc ggatggatta cctgagttcg ggagttcaag 207361 accagcctga ccaacatgga gaaaccgttt ctactaaaaa tacaaaattg gctgggcatg 207421 gtggcccatg cctgtaatcc cagctactca ggaggctgag gcaggagaat cgcttgaacc 207481 tgggaggcgg aggttgcagt gagccaagat caagccactg cactccagcc tgggctacaa 207541 aagcgaaact ccgtctcaaa caaacaaaaa acagagcaat ggacctgaga ggggacagtg 207601 gccacaaatc ttcctacaga aacaaccccc tggcatagag agtgttgcca gtgggttcta 207661 ccaaaatgca agcacaagag aattccagcc tgaagcaaac tcttcctgca gaaggaagaa 207721 aggggaacac ttcccaagcc accttatggg gcctacagag ccttggtacg gaaacctgat 207781 gagaaaggca catgggaaaa cccatacgcc tcattcacac acacatgcag atgcagaaat 207841 cccacacgaa atatcagcag gccgggcaca actcacacac cccagcactt tgggagacca 207901 aggtgggagg aagattgctt gaaatcagga ggtttttttt tttttttttt tttaaagaca 207961 tggtctcact ctgtcaccta ggctggagtg cagtggcacg atctcagttc actgcaacct 208021 ctgcctctgg gctcaagcaa tcctcccacc tcagcctctg gagtagctgg gactacaggt 208081 acatgccaac acacccagct aattttttgt attttttgta gagacagggt ttccccatgt 208141 tgtccaggct gctcttgaac tcctgggctc aagtgatctg cccgcctcag cctcccaaag 208201 tgttgggatt acaggcgtga gccacgacga cccactgacc ctgtctcttt aaaaaaaaag 208261 aagaaaacat cagcaaacca aatcctgcaa tgttaaaaac ggtaggtttt ttatttacag 208321 aataaataca tctgtaaata aaaagttggt ttgatattaa cagaaaaaaa tcattgaaat 208381 tcaccatatt aagagattaa aaaaaacctc taagaactct ctgcctactg ctatgttttg 208441 attttttaaa tctagcaatc ttgctcaacc caacagatgt agatcaagtg tttcagaaat 208501 cacctattca cgctaaaact cttagcagag ggggaatgca agggaacttc actaacttga 208561 taaagggcat ctataaaaca ccattctggc tgggcgtggt ggctcaagcc tgtaatccca 208621 gcactttggg aggccgaggc ggggaatcgc ttgaggtcag gagtttgaga ccagcctgac 208681 caacatggtg aaaccccatt tctactaaaa atacaaaact tagctgaaca tggtggtgta 208741 tgcctgtaat cccagctatt caggagactg aggcaggaga atcgcttgaa ccaggaggga 208801 gagactggag tgagccgaga tcacaccact gtagcctgag caacagagct agacgtcgtc 208861 tcaaaaaaaa aaaccaccac caaaaaacaa agaacaaaaa aacccccaaa ccatcatcct 208921 tacaaatact acaaatatca cacttatagg aaatgctgaa atattaatca tccctttaag 208981 caatcaggaa caagaaaagt gcagtaaggc aagaaaatga aaatggctga agttgagctg 209041 tgaggtggtg cgggtgctcc ctctgatcaa gtgaaacaca cacctatcct gtgacctggc 209101 cgttccaccc caagagaaat gaaagtgcaa gtccacacaa agacctgcac acgaatggtc 209161 acagcagcct tggttcatag cggccccaaa ctggaaacaa gtcaaacact tgaattaaca 209221 tgtgacccgg caaactgtgc attcatacaa cggatactac tcagcaacac tgggaaatct 209281 cagcatcaga gatgcaaccc attgtgacag aaatcagatc agcggttgcc tggagcaggg 209341 cagattggct gggaatgggt acccaggaac tttctgggga gatgacgatg gcctttattg 209401 tgatgagggt gtacagccaa ttgtacagtt aagatttgcg caccaacaaa agacggcgag 209461 aaaaccaaca atggagcgga gggtggggag tgggtcagaa atgacccttt gagggccagt 209521 gactgtgaga gccccaggtg agggggcttc ctgggtgcag agcatattct gtttcttgat 209581 ctaggagcag tttctgtggc tgtgctgagt ttgtgaaaag tcatcaacaa gctagacctg 209641 caccatctgt gttcttcatg acatgtattt cttttttttt ttgagacgga gtctcactct 209701 gtcggccagg ctggagtgca gtggcgtgat cttggctcac tgaaatctcc acctcctggg 209761 ttcaagccat tctcctgcct cagcctcctg agtagctggg actccaggca cctgccacca 209821 cgcccggcta atttttgtat ttttactaga gatggggttt cgccatgttg gccaggctgg 209881 cctcaaactc ctgaccttgt gatccacctg ccgtggcctc ccaaagtgct ggcattacag 209941 gcgtgagcca ccgtggccgg cccatgacat atatttcaat aaaacatggt aaaaactaat 210001 agtctgggcg tggtggctca tgcctgtaac cccagcactt tgggaggctg aggagggtgg 210061 atcacctgag gtcaggagtt cgagaccagc ctggcaaaca tggtgaaacc ccgtctctac 210121 taaaaataca ggccaggcgc ggtggctcat gcctgtaatc ctagcacttt gggaggttga 210181 ggtgggtgga tcacgaggtc aggagattga gaccatcctg gccaacatgg tgagaccccg 210241 tttctactaa aatacaaaaa attagctgga tgtggtggtg cgcgcctgta gtcccagcta 210301 ctcgggaggc tgaggcaggg gaattgcttg aacctcgcag gtggagattg cagtgagcca 210361 agattgcgcc actgtactcc agcctggtga aagagcaaga ctccatctca aaaaaaagaa 210421 aaaacaaaaa ttagctggat gtggtggcag gcacctggaa tcccagctac tcaggaggct 210481 aaggcaggag aatcacttga atccaggaga cggaggttgc agtgagccaa gattgcgcca 210541 ccaaactcca gcctgggtga cagcaagact ccatgtcacg aaaaaaaaaa aaaaaaaaca 210601 catgctcttc gaagcaatca ggcactgcaa ccatggatgg gtccatttct acaaaccatt 210661 acagaatggg ggccatgcca ccaggtcaaa cgccctcacc cagagccagg cagttaggaa 210721 gccaccacag ggcttccagg agggcaactg ctatggtgtg ggtgccttgt gtgcttgcgc 210781 tcttcagggg ccctgctcca cttctgatgc tcctcaggtg agctcggtgc caggactaca 210841 gcgacacgga cctctcactg ctgcaccctt caaagtccag cagacaaccc ctcgccctgg 210901 ggtaggagct cctctactca gctcctcaca gcacctgcca tcagtggggc tggggagggg 210961 aacgttgccc atgtcacatc aactcacggc tggccccaca cagaaagggt cagcttactg 211021 ggaactaact ggaccctgaa ataaacggaa agaaccacag cctgcaaata cctctaagcc 211081 tcactcaaag caggtcatgt ttgctcatga ccaaattccg gaagactggg aggcatcggg 211141 acagactcca gaaaggtact ggtggcaggg gactggtctg ctgagtcaca gcaaccccac 211201 gctccaggcc gtttgttcct gctgctccct ttgggattaa ctagagattg aaccggctca 211261 ccttgtttca gctcaggccc tgcaaccgac cgaccgtctg ctgggctcca agtcacctga 211321 gaggcgaggt gaggtaagac ggacagtagg ggtggggtaa ggcaggccgc tctctcccat 211381 ggtgacaggg agctaagctg agctgagccg gactgagctg ctctcctcag gactcagggg 211441 tagccctgcc ctcccagctg gctgcgatgc tctagtggga ctttctggag gaacaaacat 211501 gggaagacat cgggaaacct gacagcttgg tcaagagtct actcgtgcca tgtgggtggg 211561 ggcaattcaa ggggtcctta gtgaaaagca gttgggagct tctcatcttc cccacgccct 211621 cctctttcct gaggaccaca gcaggggagg gatctgccca aggacacaag gtgacttagt 211681 agaagcaggg catgatgaag atctctagcc accaacatct ggctaggcca gggcaaaggg 211741 acctgggcca tgacctgtct gttttataaa ccgaaatcgc tgcagccctt gtcttggcat 211801 gagggtttca tggtagagca cgatgcctgc tggagggcca gcgggggtgg gccatcccat 211861 ccccaagggt cactggggtt aaggacctgc ccagactccc tgggtggagc cctgggcctt 211921 ggcaggggcc cagctggttc ctggccactg gggccagaga acctgggggc tgctgagtta 211981 gttcagctga aaaaccaaca aaagggcctc cctgctatac ccgggggctc atgagtcacc 212041 ggggctttgg gggagtgcca acatcatcat gacacagcaa gaatcttatc agaccaatgg 212101 ggaagtcagc ccagaaatgt tctgaggaaa ggggaggcgg gggccgctgg gtcatccctc 212161 ccacaggccc atcattggga tgcgtcctga acgcccatca agcccatggc ccttgtgatc 212221 caggtgggga aactaaggcc cagagaagtg aggaccccgc agactatcaa tcccagtctc 212281 ttcccctcac tccctgtgaa gctctccagc atcatcgagg tcccatcagg tggggaaaga 212341 tgctgttcca ggcgcacact agtctacaag gccagagctt tctggaaggg ggcagtaagt 212401 acctcggctc cctttctggt aggggtggga gtcctgagaa ggcaggaagt ggcccacttg 212461 gtaactctga ggtgccatca gggcccccag gaaggaagct gggtgtgtgg gcaagtgtga 212521 ggtaagctgg ccagggagga ggaagggaca gaggaaggcc acgtgggtcc agcctgcccc 212581 agggtgtcct gcttgcccag gctgtgggtc tgccagccac ttgcctgctt tcagtttcta 212641 ggtcatgctg agcttgttcc caactcgggg ctccgggcat ttgctcttcc ttctgtgttt 212701 gtgctctccc ggccctcttt gtgggatcta tgcgctttgg gggactgggg acacagggcc 212761 catgtgtatc ttctgaaaca cacgtcagcc tgaactcttg ctggtctgct tacttgccgt 212821 ggttccctgg ccgcaatagc acataggtcc catgaggagg gcagggactg gctgctgtgg 212881 cccagaagta cccaatcgct gtctgctcaa tgaacggaga atgggctgct ttcctggaac 212941 agcagattct aggatcacgt gccctcaagt gccaccctgc ctacctcccg caccgagtga 213001 ggcatcaggc gtggaagaag cctgggagcc ggagctgttc caggtgctgc ctgagcacgc 213061 cacctcccat ctcccccaca gcaggaagag gaaaaaacaa accacgaggc tcttcagaga 213121 gaggacccct tgtcccctac ccacagtgct ggagctggca cttcctattt ctgctttgaa 213181 agcctcaggt tgtcactctc agaacagagg agagcaaagg ggaaccctac tgatttcaac 213241 aaaacaaagt tgcccaaccc gaagctggcc acaggcggag gcagtgaaat gacaaaatca 213301 gcttggaaaa acctaaggac cctgggccct cgtttcaaaa gctgtcattt gctacacgaa 213361 atgctgaggt tcaggaaaag gaagacactt gctccaactc acacagcaag cttggatgct 213421 ctcaatgagg tgtctaataa gagctgaagg ccaggagtca taagctatca ttgtggcctg 213481 ggttgctctg ttgcagtgcc ttgtgacagc atggggtggg gatggagaaa gcaacctcag 213541 ttaccttctc ctccagtctg cttcttggac taagggttta acggtcagag tcctggctgt 213601 taaggtttgt ggctgatgca ggtatggctc tttttatttt tatttttttg agatggagtc 213661 tcactcactc cgtcacccag gatggagtgc aatggtgcca tctcagctca ctgcaacctc 213721 cacctcctgg gttcaagcga ttttcctgcc tcagcctccc aagtagctgg gattacagat 213781 gggtgccact acacctaact aatttttgtt attttttttt tttttttgag atggagtctc 213841 actctgtcac ccaggctgga atgcagtgtt gcgatttctg ctcactgcaa gctccgcctc 213901 ccgagttcac gcctttctcc tgcctcagcc tcctgagtag ctggactaca ggtgcccaca 213961 ccaggcctgg ctaatttttt atttagtaga gacgggattt caccatgtta gccaggatgg 214021 tctcaatctc ctgacctcat gatccacccg cttcagcctc ccaaagtgct gggattacag 214081 acgtgagcca ctgcgcccgg cctttttgtt ttgttttgtt ttgagacaga gtctcgctct 214141 gtcgccaggc tggagtgcag tggcgcgatc tcagctcact gcaacctctg actccctgat 214201 tcaagcgatt ctactgcctc agcctactga gtagctggga ctacaggcac gcaccaccac 214261 gcccagctaa tttttgtatt tttagtagag atggggtttc accatgttgg ccaggatggt 214321 ctcgatctct tgaccttgtg atctgcccac cttggccccc caaagtgctg ggattacaga 214381 tgtgagccac tgcgcccagc caatttttgt attttaagta gagactgggt ttcgccatgt 214441 tggccaggct ggtgtcaaac tcctgacctc aggcgatcca cctgcctcgg cctcccaaag 214501 tgctgggaat acaggcgtga gccaccgtgc ccggccaggt atggctcttc tgaggggacc 214561 aggctggggc tggggctgag gccaagccca atctactgtg ggctccacct ggtacctctc 214621 ctgggtctca ggcttatggg gagtcagagg acaatggccc ctccttactc tgccactggc 214681 agagcccttc tccctcggct gcctcctcat tcccttttgg ctctcctttt ctaagttctg 214741 atcagaagta caaaggtgtc aaggagtagg tttgacaaag tgtgacagcg cgttgttcta 214801 tgtgaacaaa gaaccactga gctcagccag cactgagggg cgcacgatgt ggaagaacta 214861 actagttttg atagagctcc tgctcaaggt tacaaggtaa gttaatggca aagatggtca 214921 tacagtaatg agcagagaga ttcttgagtg catccagggt taaaagtgaa aactgagacc 214981 acgtgtaatt tgagatgaag cccttgttcc accagcccct gcacagtctc actgccccat 215041 gggacacagg ggagggtgct ctagtctcgg gcgatggctg tcctaggacc acccctccct 215101 cccctcccat ggaaatcctc atgctatact attcttttgt ctttcctatg agtgcagaat 215161 ggcggttcta caagctggac aattggggcc aggtggggtg aggcagagct ccttggtagg 215221 ctttcgaaat tgaggcaaag agacagtgtt caaagaaaag ctaagtgttt gtatcgacgg 215281 aacttgaagt gttagtgaag aggcagagat caggcctcag atcctgcctt tggaactcat 215341 ttgttaaggc atgaacaggt ctgaaaaaac tacagaatga atagcattcc ctgttttccc 215401 caagaagtcc atctagacag tccctaaaga gcctgcaact ccaggattaa gggctacatt 215461 cagcggctag gcacagggtt cagaaacgtc ctgcagccgg gcgtggtggc tcatgcctgt 215521 catcccagca ctttgggagg ccgaggcggg cggatcactt gaggtcagga gtttgagacc 215581 agcctggtca acatggtgaa actccgtctc tactaacaat acaaaaatta gccgggtgtg 215641 gtggtgcatg cctgtaatct cagctacttg gggggctgag gcaggagaat cgcttgaacc 215701 caggaggccg aggttgcagt gagccaagat ggcgccactg caccccatcc tgggcgacca 215761 gagcaaaact ccgtctccaa aaaaaaaaaa aaaaaaaagt cttgcaagtg catatgcaca 215821 ccaggtagag ccgggatgat cctggcgcac tagcaggagc gggaggagga gctcaactta 215881 gcagagcctg tggggccctg caacaattag ttggaaaagc tgaggcatgg agcaggcact 215941 tcctggcttt taagattggg gcctgggaga tactcaccga tgcacccatg atgatgaata 216001 tgtgtgtatc cgactgatgg aaggcatcgc cctggaaaag ctcttcccgc aggatcccgc 216061 acacctgggt ccggctcagg gccacctgct ctgccatgac gctgtctggt ggaagaaagg 216121 ctcgttaaca aggcagaaga acaggagagc attgagaagt tagccccttt cttgagagtt 216181 cctctggggt ctcacaccag ggtgacacct gattgcccaa atcactcctt gtgaacggct 216241 gggcattggg gagtggttga tgggttagaa actgctggct ctgggctcca gccactctct 216301 cctaggcagc cttcattcag ggtgtacatt acaaaattac tcaaagcatt ggtgtattcc 216361 gactacagca tcaattctgg acaaccgagt aaaatccttg tggggcatgg aactgcgtgc 216421 ccaggaccat ctcattcccg actgtagcgg gaactccaca atgacctgga gcatgggaga 216481 tggtgcagat gctggagttc agctccagcc ctccccttgc caagctgggt gaccccggtc 216541 ggcaagtccc cttcgctctc ggggtctcag gggctcaaga gaggaggtgc ggggtataaa 216601 gggattggtt aagaccctct cgattctgct cggttctcaa gcacaacaaa cagcgtgtat 216661 tttaccgccg cgcggcgcag cgcgggacag tacgctcctc cgcctgcgcg gcgcccgccc 216721 ggccggttac ctgcgcttcg tcgtcgtcgc cctccgcgct cgcagccccg aagtgtacga 216781 ccgtttccgg gggctgagcc ccgccggccc atttaatcgg cgggggcggg ggcgggcgcc 216841 tgggctgagc ggacccgcct caggcgaggc gtgcggggcg gggcctcggc caccacccct 216901 cgtgcgggcg gggcggggcg agggcaggtg cgcgcgcatc ccaggccagc ccctgcctct 216961 cgggcacctg cgctggaggc cggcccgccg gctgcctgcc atacccgctg ccgctgctct 217021 gcatccccaa ttccggcggg cacgggtgca gctccgcgta gtgctcccgc atccccatcg 217081 ccggcccggc cccgccccta ctgtccggtt tccccgcctg ccccggcggc ctcgcgcgct 217141 cgcggagggc tccacttccg ccggcagcgt ggccacgcct cctctgggct gtccctcggc 217201 tcctgggcgg ggccttctgc cgcgccactc ccccgggaac cctcgcctgt tcgcgggtca 217261 agccggccct attgggcagt ctcttcttga ttcctccccc ggagagggcg gggcggccga 217321 gcgccccgag gctagacgcc gccgtccgag agacgagggg gcgtgtaggc ggtgacgccc 217381 tgcaaagtgg ccggcgtgct tatcattacc gagcttccgc gggcctgcag agcctggcgg 217441 actcagactt ctctccggag cgggatgcgg ccctaccgcg gcctcacact tctcgccggc 217501 ttcccgagtt ctcgggggcg gggcttgtgt ttttacttcc ggatcccaca gctatgacac 217561 cggaagccgg aagcgtggta gggaagggcg accgcgaaac tgggactttc tcggagcgcc 217621 ggggccctac cagcgttcac agtccgccgc tcccaccctt ctcacgtctg acggactctg 217681 ctgacaggtg tggtcctttt ccccaaagac agggttccat ccgtgggcgt tccgccgcct 217741 ccgaaacttc ccccggacgt tcaggctccc ccctcttttt tgggccccag cccgttcctg 217801 ctccgcgctt ctggagcact ggccaaggcg ggccgattca ggacccaggt tacttgggcg 217861 gcgagctgga ctgtttctac tcctccctcc tcctccactg cggggtctga ccctactcct 217921 tgtgtgagga ctcctctagt tcagagacat attctgttca ccaaacttga ctgcgctcta 217981 tcgaggtcgt taaattcttc ggaaatgcct cacatatagt ttggcagcta ggtatctgat 218041 ttcatatgcc tgtttgctcg ttttgcaaga caacatctgc ctatcgtcat actgtttctg 218101 tgatctgaga atgaatgggc tctcctggca cattagagga atagcacgga ggtcactata 218161 gctggagcag agtgagggag gggagcacag ttggagagga agggatgggg aacccgatgg 218221 tacagggcct tgtaggccgt tgtagggaga ttggctttta cgcagaaggc aacggggatc 218281 catggtaggg ttctcagtag aggaggaatg tgatcggaat tacgtcttac actgatcgtt 218341 ctagcagtgg tggggagacc agaccatagt gggcaagggt aaaagaaggg tggcgtggta 218401 agaggtaatt gcagtaatcc agctcagaga cagtgctttg agcattttat tattggaaat 218461 ttcaaaaatg tacgaaagta gaaaaatgag tatagtaaat ctttttggtt tgtttgtttt 218521 tgagagatgg agtctcgctc tgtcgcccag gctggagtgc agtggcatga tctcagctca 218581 ctctaacctc ctcctcctgg gttcaagcga ttctcctgcc tcagcttccc aggtagctgg 218641 gactgcaggc gtgcgccact acgcccagct aatttttgta tttttagtag agacggggtt 218701 tcactatatg ttggccagtc tggtcttgaa ctcctgacct caggtgatct gcctgcctca 218761 gactcccaaa gtgctgggat tacaggtgtg agccactgcg ctcggccagt atagtaaatc 218821 ttacattcac agctaccaac ttcaacaatt gaaggtatca acttctgact aattttgttt 218881 atatccctcg caatcctctc ccattattat tttatttctt tttctttctt tctttctttt 218941 tttttttttt ttgagacaga atctcgctct gtcaccaggc tggagtgcag tggtgtgatc 219001 tccacccact gcaacttccg cctcctggat tcaagcgatt cttctgcctc agccgctggg 219061 actacagttg tgcaccacca cgcccagcta atttttgtat ttttagtaga gatggggttt 219121 catcatgttg gccaggatgg tcttgatctc ttgaccttgt catccgcccg cctcggcctc 219181 ccaaagtgct gggattacag gcatgagcca ccgcgcccgg cctctttttt tgagacccag 219241 tctcactctg tcgcccagaa tggagtgcag tggcacgatc ttggctcacc gcaacctcca 219301 tctcccaggt ttgagcgatt ctcctgcctc agcttcccaa atagctggga ttacaggcat 219361 gcaccatcac tcccagctat tttttttttt tttgtatttt tggtaaatac agggttttgt 219421 catgttggcc aggctggtct tgaattc // LOCUS HUMFMLP 1866 bp mRNA PRI 23-JAN-1991 DEFINITION Human N-formylpeptide receptor (fMLP-R98) mRNA, complete cds. ACCESSION M60626 M33537 NID g182662 KEYWORDS N-formyl peptide receptor; N-formylpeptide receptor fMLP-R98. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Boulay,F., Tardif,M., Brouchon,L. and Vignais,P. TITLE The human N-formylpeptide receptor. Characterization of two cDNA isolates and evidence for a new subfamily of G-Protein-Coupled receptors JOURNAL Biochemistry 29, 11123-11133 (1990) MEDLINE 91105045 FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1866 CDS 46..1098 /codon_start=1 /product="N-formylpeptide receptor fMLP-R98" /db_xref="PID:g182663" /translation="METNSSLPTNISGGTPAVSAGYLFLDIITYLVFAVTFVLGVLGN GLVIWVAGFRMTHTVTTISYLNLAVADFCFTSTLPFFMVRKAMGGHWPFGWFLCKFLF TIVDINLFGSVFLIALIALDRCVCVLHPVWTQNHRTVSLAKKVIIGPWVMALLLTLPV IIRVTTVPGKTGTVACTFNFSPWTNDPKERINVAVAMLTVRGIIRFIIGFSAPMSIVA VSYGLIATKIHKQGLIKSSRPLRVLSFVAAAFFLCWSPYQVVALIATVRIRELLQGMY KEIGIAVDVTSALAFFNSCLNPMLYVFMGQDFRERLIHALPASLERALTEDSTQTSDT ATNSTLPSAEVALQAK" BASE COUNT 457 a 471 c 469 g 469 t ORIGIN 1 cccagagcaa gaccacagct ggtgaacagt ccaggagcag acaagatgga gacaaattcc 61 tctctcccca cgaacatctc tggagggaca cctgctgtat ctgctggcta tctcttcctg 121 gatatcatca cttatctggt atttgcagtc acctttgtcc tcggggtcct gggcaacggg 181 cttgtgatct gggtggctgg attccggatg acacacacag tcaccaccat cagttacctg 241 aacctggccg tggctgactt ctgtttcacc tccactttgc cattcttcat ggtcaggaag 301 gccatgggag gacattggcc tttcggctgg ttcctgtgca aattcctctt taccatagtg 361 gacatcaact tgttcggaag tgtcttcctg atcgccctca ttgctctgga ccgctgtgtt 421 tgcgtcctgc atccagtctg gacccagaac caccgcaccg tgagcctggc caagaaggtg 481 atcattgggc cctgggtgat ggctctgctc ctcacattgc cagttatcat tcgtgtgact 541 acagtacctg gtaaaacggg gacagtagcc tgcactttta acttttcgcc ctggaccaac 601 gaccctaaag agaggataaa tgtggccgtt gccatgttga cggtgagagg catcatccgg 661 ttcatcattg gcttcagcgc acccatgtcc atcgttgctg tcagttatgg gcttattgcc 721 accaagatcc acaagcaagg cttgattaag tccagtcgtc ccttacgggt cctctccttt 781 gtcgcagcag ccttttttct ctgctggtcc ccatatcagg tggtggccct tatagccaca 841 gtcagaatcc gtgagttatt gcaaggcatg tacaaagaaa ttggtattgc agtggatgtg 901 acaagtgccc tggccttctt caacagctgc ctcaacccca tgctctatgt cttcatgggc 961 caggacttcc gggagaggct gatccacgcc cttcccgcca gtctggagag ggccctgacc 1021 gaggactcaa cccaaaccag tgacacagct accaattcta ctttaccttc tgcagaggtg 1081 gcgttacagg caaagtgagg agggagctgg gggacacttt cgagctccca gctccagctt 1141 cgtctcacct tgagttaggc tgagcacagg catttcctgc ttattttagg attacccact 1201 catcagaaaa aaaaaaaaag cctttgtgtc ccctgatttg gggagaataa acagatatga 1261 gtttattatt gacttctttt ttgattttgg acctcagcct cgggtggtca gggtgggaaa 1321 tgataggaag aagctgtcat ctgcatccta gtttgcctga aatgaaccca aataataccc 1381 attattatta gtcctgaatt atgagtagtg aatgataccc atcattctgg catcatgatg 1441 agtagtgtcc acttccattc tgaaaagtgc cctgctgtga aaaataaatt atatagtcat 1501 cctaggtaaa tgaaggagga gggagaagtg tgaaagagta tggcttaaat cagacaagat 1561 atacaagaag atactttata tagggcagga gcggtggctc atgcctgtaa tcccagcact 1621 ttgggaggcc gaggcaggcg gatcaccaga ggtcaggaat tcgagaacag cctggccaac 1681 atggtgaaac cctgtctcta ctaaaaatac aaaaattagc tgggcgtagt ggcaggctcc 1741 cgtaatccca gctactcagg agaccgaggc aggagaatcg cttggacctg gaaggcggag 1801 gttgtagtga gccaagaaaa cgccactaca ctccagcctg ggtgacagag agagactccg 1861 gctcag // LOCUS HUMFMLPX 1058 bp mRNA PRI 31-DEC-1994 DEFINITION Human FMLP-related receptor II (FMLP R II) mRNA, complete cds. ACCESSION M76672 NID g182666 KEYWORDS FMLP-related receptor II; GTP-binding protein; plasma membrane protein; protein coupled. SOURCE Homo sapiens (tissue library: human genomic) lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1058) AUTHORS Bao,L., Gerard,N.P., Eddy,R.L. Jr., Shows,T.B. and Gerard,C. TITLE Mapping of genes for the human C5a receptor (C5AR), human FMLP receptor (FPR), and two FMLP receptor homologue orphan receptors (FPRH1, FPRH2) to chromosome 19 JOURNAL Genomics 13 (2), 437-440 (1992) MEDLINE 92307681 FEATURES Location/Qualifiers source 1..1058 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /tissue_lib="human genomic" /map="chromosome 19" gene 1..1056 /gene="FMLP R II" CDS 1..1056 /gene="FMLP R II" /codon_start=1 /product="FMLP-related receptor II" /db_xref="PID:g182667" /translation="METNFSTPLNEYEEVSYESAGYTVLRILPLVVLGVTFVLGVLGN GLVIWVAGFRMTRTVTTICYLNLALADFSFTATLPFLIVSMAMGEKWPFGWFLCKLIH IVVDINLFGSVFLIGFIALDRCICVLHPVWAQNHRTVSLAMKVIVGPWILALVLTLPV FLFLTTVTIPNGDTYCTFNFASWGGTPEERLKVAITMLTARGIIRFVIGFSLPMSIVA ICYGLIAAKIHKKGMIKSSRPLRVLTAVVASFFICWFPFQLVALLGTVWLKEMLFYGK YKIIDILVNPTSSLAFFNSCLNPMLYVFVGQDFRERLIHSLPTSLERALSEDSAPTND TAANCASPPAETELQAM" BASE COUNT 210 a 288 c 255 g 305 t ORIGIN chromosome 19. 1 atggaaacca acttctccac tcctctgaat gaatatgaag aagtgtccta tgagtctgct 61 ggctacactg ttctgcggat cctcccattg gtggtgcttg gggtcacctt tgtcctcggg 121 gtcctgggca atgggcttgt gatctgggtg gctggattcc ggatgacacg cacagtcacc 181 accatctgtt acctgaacct ggccctggct gacttttctt tcacggccac attaccattc 241 ctcattgtct ccatggccat gggagaaaaa tggccttttg gctggttcct gtgtaagtta 301 attcacatcg tggtggacat caacctcttt ggaagtgtct tcttgattgg tttcattgca 361 ctggaccgct gcatttgtgt cctgcatcca gtctgggccc agaaccaccg cactgtgagt 421 ctggccatga aggtgatcgt cggaccttgg attcttgctc tagtccttac cttgccagtt 481 ttcctctttt tgactacagt aactattcca aatggggaca catactgtac tttcaacttt 541 gcatcctggg gtggcacccc tgaggagagg ctgaaggtgg ccattaccat gctgacagcc 601 agagggatta tccggtttgt cattggcttt agcttgccga tgtccattgt tgccatctgc 661 tatgggctca ttgcagccaa gatccacaaa aagggcatga ttaaatccag ccgtccctta 721 cgggtcctca ctgctgtggt ggcttctttc ttcatctgtt ggtttccctt tcaactggtt 781 gcccttctgg gcaccgtctg gctcaaagag atgttgttct atggcaagta caaaatcatt 841 gacatcctgg ttaacccaac gagctccctg gccttcttca acagctgcct caaccccatg 901 ctttacgtct ttgtgggcca agacttccga gagagactga tccactccct gcccaccagt 961 ctggagaggg ccctgtctga ggactcagcc ccaactaatg acacggctgc caattgtgct 1021 tcacctcctg cagagactga gttacaggca atgtgagg // LOCUS HUMFMO1 2134 bp mRNA PRI 08-NOV-1994 DEFINITION Human flavin-containing monooxygenase (FMO1) mRNA, complete cds. ACCESSION M64082 NID g182670 KEYWORDS flavin-containing monooxygenase. SOURCE Homo sapiens adult and fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2134) AUTHORS Dolphin,C., Shephard,E.A., Povey,S., Palmer,C.N., Ziegler,D.M., Ayesh,R., Smith,R.L. and Phillips,I.R. TITLE Cloning, primary sequence, and chromosomal mapping of a human flavin-containing monooxygenase (FMO1) JOURNAL J. Biol. Chem. 266 (19), 12379-12385 (1991) MEDLINE 91286259 FEATURES Location/Qualifiers source 1..2134 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult and fetal" /tissue_type="liver" /map="1q23-q25" gene 99..1697 /gene="FMO1" CDS 99..1697 /gene="FMO1" /EC_number="1.14.13.8" /codon_start=1 /db_xref="GDB:G00-126-689" /product="flavin-containing monooxygenase" /db_xref="PID:g182671" /translation="MAKRVAIVGAGVSGLASIKCCLEEGLEPTCFERSDDLGGLWRFT EHVEEGRASLYKSVVSNSCKEMSCYSDFPFPEDYPNYVPNSQFLEYLKMYANHFDLLK HIQFKTKVCSVTKCSDSAVSGQWEVVTMHEEKQESAIFDAVMVCTGFLTNPYLPLDSF PGINAFKGQYFHSRQYKHPDIFKDKRVLVIGMGNSGTDIAVEASHLAEKVFLSTTGGG WVISRIFDSGYPWDMVFMTRFQNMLRNSLPTPIVTWLMERKINNWLNHANYGLIPEDR TQLKEFVLNDELPGRIITGKVFIRPSIKEVKENSVIFNNTSKEEPIDIIVFATGYTFA FPFLDESVVKVEDGQASLYKYIFPAHLQKPTLAIIGLIKPLGSMIPTGETQARWAVRV LKGVNKLPPPSVMIEEINARKENKPSWFGLCYCKALQSDYITYIDELLTYINAKPNLF SMLLTDPHLALTVFFGPCSPYQFRLTGPGKWEGARNAIMTQWDRTFKVIKARVVQESP SPFESFLKVFSFLALLVAIFLIFL" BASE COUNT 613 a 496 c 452 g 573 t ORIGIN 1 acccaagcca gcactggctc atactgattc attttgatct ctgctaatac cagagtcctg 61 cgtggcagag ccattggcac cagaaattac aagagaacat ggccaagcga gttgccattg 121 tgggagctgg ggtcagcggc ctggcctcca tcaagtgctg tctggaagaa ggactggagc 181 ccacctgctt tgagaggagc gatgaccttg gggggctgtg gagattcacc gaacatgttg 241 aagaaggcag agccagtctc tacaagtctg tggtttccaa cagctgcaag gagatgtctt 301 gttactcaga ctttccattc ccagaagatt atccaaacta tgtgccaaat tctcaattcc 361 tggaatatct caaaatgtat gcaaaccact ttgaccttct gaaacacatt caattcaaga 421 ccaaagtctg cagtgtaaca aaatgctcag attctgctgt ctctggccaa tgggaggtgg 481 tcactatgca tgaagagaag caagagtcag ccatctttga tgctgtcatg gtctgcactg 541 gctttcttac taatccttat ttgccactgg attcctttcc aggtattaat gcctttaaag 601 gccagtactt tcatagccgg caatataagc atccagatat atttaaggac aagagagtcc 661 ttgtgattgg aatgggaaat tctggcacag acattgctgt ggaggccagc cacctggcgg 721 aaaaggtgtt cctcagcacc accggagggg gatgggtgat cagccgaatc tttgactcgg 781 gctacccatg ggacatggtg ttcatgacac gctttcagaa catgttgaga aattccctcc 841 caaccccaat tgtgacttgg ttgatggagc gaaagataaa caactggctc aatcatgcaa 901 attacggctt aataccagaa gacaggactc agctgaaaga gtttgtgcta aatgatgagc 961 tcccaggacg catcatcact gggaaagtgt tcatcaggcc aagcataaaa gaggtaaagg 1021 aaaactctgt catatttaac aatacttcaa aggaagagcc tattgacatc attgtctttg 1081 ccactggata cacatttgct ttccccttcc ttgatgagtc tgtagtgaaa gttgaagatg 1141 gccaggcctc actgtacaag tatatcttcc ctgcacatct gcaaaagcca accctggcca 1201 ttattggcct catcaaaccc ttgggctcca tgatacctac aggagaaaca caagctcggt 1261 gggctgttcg agtcctgaaa ggtgtaaata agttaccacc accaagtgtc atgatagagg 1321 aaattaatgc aaggaaagaa aacaagccca gttggtttgg cttgtgctac tgcaaggctt 1381 tacaatcaga ttatatcaca tacatagatg aactcctgac ctatatcaat gcaaaaccca 1441 acctgttctc tatgctccta acggatccac atctggctct gaccgtcttc tttggcccat 1501 gctcaccata ccagttccgc ttgactggcc caggaaaatg ggaaggagcc agaaatgcca 1561 tcatgaccca gtgggaccga acattcaagg tcatcaaagc tcgagttgta caagagtctc 1621 catctccctt tgaaagtttt cttaaagtct ttagctttct ggctttgctt gtggctattt 1681 ttctgatttt cctataagta aaagatctcc taaatggaag atgcacagag tagatttaca 1741 atgctccaat tcctctctta cagcaatatt gccttcacag ttataaactg tattcaaata 1801 gtaaaggcca ccctctcgct tccctggctg gccccagggc taccactggt attcctgagc 1861 ctctcccagc tccacttcta atgctagaga atgataacta agacttctgt gcatttgaag 1921 gttgttggaa agttacaggt tcattttaga aagaaagctg ttcttgacag cactctgagc 1981 catcatacct ctttcccata taaactattt tcacagatct caactaaaac cccttacttc 2041 acaaatgatt gtgttgtgct gaaatggtgc tcttatagta ctggcttatt aaaagtaaaa 2101 aataaaccta aaaattgatt gttttaaaaa aaaa // LOCUS HUMFOLYSYN 2158 bp mRNA PRI 18-MAR-1993 DEFINITION Homo sapiens folylpolyglutamate synthetase mRNA, complete cds. ACCESSION M98045 NID g292028 KEYWORDS folylpolyglutamate synthetase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Garrow,T.A., Admon,A. and Shane,B. TITLE Expression cloning of a human cDNA encoding folylpoly(gamma-glutamate) synthetase and determination of its primary structure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 9151-9155 (1992) MEDLINE 93028422 FEATURES Location/Qualifiers source 1..2158 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 71..1708 /codon_start=1 /evidence=experimental /product="folylpolyglutamate synthetase" /db_xref="PID:g292029" /translation="MEYQDAVRMLNTLQTNAGYLEQVKRQRGDPQTQLEAMELYLARS GLQVEDLDRLNIIHVTGTKGKGSTCAFTECILRSYGLKTGFFSSPHLVQVRERIRING QPISPELFTKYFWRLYHRLEETKDGSCVSMPPYFRFLTLMAFHVFLQEKVDLAVVEVG IGGAYDCTNIIRKPVVCGVSSLGIDHTSLLGDTVEKIAWQKGGIFKQGVPAFTVLQPE GPLAVLRDRAQQISCPLYLCPMLEALEEGGPPLTLGLEGEHQRSNAALALQLAHCWLQ RQDRHGAGEPKASRPGLLWQLPLAPVFQPTSHMRLGLRNTEWPGRTQVLRRGPLTWYL DGAHTASSAQACVRWFRQALQGRERPSGGPEVRVLLFNATGDRDPAALLKLLQPCQFD YAVFCPNLTEVSSTGNADQQNFTVTLDQVLLRCLEHQQHWNHLDEEQASPDLWSAPSP EPGGSASLLLAPHPPHTCSASSLVFSCISHALQWISQGRDPIFQPPSPPKGLLTHPVA HSGASILREAAAIHVLVTGSLHLVGGVLKLLEPALSQ" polyA_signal 2140..2146 polyA_site 2158 BASE COUNT 366 a 695 c 673 g 424 t ORIGIN 1 gcgcggcata acgacccagg tcgcggcgcg gcggggcttg agcgcgtggc cggtgccgca 61 ggagccgagc atggagtacc aggatgccgt gcgcatgctc aataccctgc agaccaatgc 121 cggctacctg gagcaggtga agcgccagcg gggtgaccct cagacacagt tggaagccat 181 ggaactgtac ctggcacgga gtgggctgca ggtggaggac ttggaccggc tgaacatcat 241 ccacgtcact gggacgaagg ggaagggctc cacctgtgcc ttcacggaat gtatcctccg 301 aagctatggc ctgaagacgg gattctttag ctctccccac ctggtgcagg ttcgggagcg 361 gatccgcatc aatgggcagc ccatcagtcc tgagctcttc accaagtact tctggcgcct 421 ctaccaccgg ctggaggaga ccaaggatgg cagctgtgtc tccatgcccc cctacttccg 481 cttcctgaca ctcatggcct tccacgtctt cctccaagag aaggtggacc tggcagtggt 541 ggaggtgggc attggcgggg cttatgactg caccaacatc atcaggaagc ctgtggtgtg 601 cggagtctcc tctcttggca tcgaccacac cagcctcctg ggggatacgg tggagaagat 661 cgcatggcag aaagggggca tctttaagca aggtgtccct gccttcactg tgctccaacc 721 tgaaggtccc ctggcagtgc tgagggaccg agcccagcag atctcatgtc ctctatacct 781 gtgtccgatg ctggaggccc tcgaggaagg ggggccgccg ctgaccctgg gcctggaggg 841 ggagcaccag cggtccaacg ccgccttggc cttgcagctg gcccactgct ggctgcagcg 901 gcaggaccgc catggtgctg gggagccaaa ggcatccagg ccagggctcc tgtggcagct 961 gcccctggca cctgtgttcc agcccacatc ccacatgcgg ctcgggcttc ggaacacgga 1021 gtggccgggc cggacgcagg tgctgcggcg cgggcccctc acctggtacc tggacggtgc 1081 gcacaccgcc agcagcgcgc aggcctgcgt gcgctggttc cgccaggcgc tgcagggccg 1141 cgagaggccg agcggtggcc ccgaggttcg agtcttgctc ttcaatgcta ccggggaccg 1201 ggacccggcg gccctgctga agctgctgca gccctgccag tttgactatg ccgtcttctg 1261 ccctaacctg acagaggtgt catccacagg caacgcagac caacagaact tcacagtgac 1321 actggaccag gtcctgctcc gctgcctgga acaccagcag cactggaacc acctggacga 1381 agagcaggcc agcccggacc tctggagtgc ccccagccca gagcccggtg ggtccgcatc 1441 cctgcttctg gcgccccacc caccccacac ctgcagtgcc agctccctcg tcttcagctg 1501 catttcacat gccttgcaat ggatcagcca aggccgagac cccatcttcc agccacctag 1561 tcccccaaag ggcctcctca cccaccctgt ggctcacagt ggggccagca tactccgtga 1621 ggctgctgcc atccatgtgc tagtcactgg cagcctgcac ctggtgggtg gtgtcctgaa 1681 gctgctggag cccgcactgt cccagtagcc aaggcccggg gttggaggtg ggagcttccc 1741 acacctgcct gcgttctccc catgaactta catactaggt gccttttgtt tttggctttc 1801 ctggttctgt ctagactggc ctaggggcca gggctttggg atgggaggcc gggagaggat 1861 gtctttttta aggctctgtg ccttggtctc tccttcctct tggctgagat agcagagggg 1921 ctccccgggt ctctcactgt tgcagtggcc tggccgttca gcctgtctcc cccaacaccc 1981 cgcctgcctc ctggctcagg cccagcttat tgtgtgcgct gcctggccag gccctgggtc 2041 ttgccatgtg ctgggtggta gatttcctcc tcccagtgcc ttctgggaag ggagagggcc 2101 tctgcctggg acactgcggg acagagggtg gctggagtga attaaagcct ttgttttt // LOCUS HUMFPTA 1344 bp mRNA PRI 08-MAY-1993 DEFINITION Human farnesyl-protein transferase alpha-subunit mRNA, complete cds. ACCESSION L00634 NID g292030 KEYWORDS farnesyl-protein transferase; farnesyl-protein transferase alpha-subunit. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1344) AUTHORS Omer,C.A. TITLE Characterization of recombinant human farnesyl-protein transferase: Cloning, expression, farnesyl diphosphate binding and functional homology with Yeast prenyl-protein transferases JOURNAL Biochemistry (1993) In press FEATURES Location/Qualifiers source 1..1344 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 17..1156 /codon_start=1 /product="farnesyl-protein transferase alpha-subunit" /db_xref="PID:g292031" /translation="MAATEGVGEAAQGGEPGQPAQPPPQPHPPPPQQQHKEEMAAEAG EAVASPMDDGFVSLDSPSYVLYRDRAEWADIDPVPQNDGPNPVVQIIYSDKFRDVYDY FRAVLQRDERSERAFKLTRDAIELNAANYTVWHFRRVLLKSLQKDLHEEMNYITAIIE EQPKNYQVWHHRRVLVEWLRDPSQELEFIADILNQDAKNYHAWQHRQWVIQEFKLWDN ELQYVDQLLKEDVRNNSVWNQRYFVISNTTGYNDRAVLEREVQYTLEMIKLVPHNESA WNYLKGILQDRGLSKYPNLLNQLLDLQPSHSSPYLIAFLVDIYEDMLENQCDNKEDIL NKALELCEILAKEKDTIRKEYWRYIGRSLQSKHSTENDSPTNVQQ" BASE COUNT 400 a 285 c 322 g 337 t ORIGIN 1 tcggtccgca gccgagatgg cggccaccga gggggtcggg gaggctgcgc aagggggcga 61 gcccgggcag ccggcgcaac ccccgcccca gccgcaccca ccgccgcccc agcagcagca 121 caaggaagag atggcggccg aggctgggga agccgtggcg tcccccatgg acgacgggtt 181 tgtgagcctg gactcgccct cctatgtcct gtacagggac agagcagaat gggctgatat 241 agatccggtg ccgcagaatg atggccccaa tcccgtggtc cagatcattt atagtgacaa 301 atttagagat gtttatgatt acttccgagc tgtcctgcag cgtgatgaaa gaagtgaacg 361 agcttttaag ctaacccggg atgctattga gttaaatgca gccaattata cagtgtggca 421 tttccggaga gttcttttga agtcacttca gaaggatcta catgaggaaa tgaactacat 481 cactgcaata attgaggagc agcccaaaaa ctatcaagtt tggcatcata ggcgagtatt 541 agtggaatgg ctaagagatc catctcagga gcttgaattt attgctgata ttcttaatca 601 ggatgcaaag aattatcatg cctggcagca tcgacaatgg gttattcagg aatttaaact 661 ttgggataat gagctgcagt atgtggacca acttctgaaa gaggatgtga gaaataactc 721 tgtctggaac caaagatact tcgttatttc taacaccact ggctacaatg atcgtgctgt 781 attggagaga gaagtccaat acactctgga aatgattaaa ctagtaccac ataatgaaag 841 tgcatggaac tatttgaaag ggattttgca ggatcgtggt ctttccaaat atcctaatct 901 gttaaatcaa ttacttgatt tacaaccaag tcatagttcc ccctacctaa ttgcctttct 961 tgtggatatc tatgaagaca tgctagaaaa tcagtgtgac aataaggaag acattcttaa 1021 taaagcatta gagttatgtg aaatcctagc taaagaaaag gacactataa gaaaggaata 1081 ttggagatac attggaagat cccttcaaag caaacacagc acagaaaatg actcaccaac 1141 aaatgtacag caataacacc atccagaaga acttgatgga atgcttttat tttttattaa 1201 gggaccctgc aggagtttca cacgagagtt ccttcccttt tgtggtgtaa aagtgcatca 1261 cacaggtatt gctttttaca gactgatgct ccttggtgct gctgcatcta tctcagacta 1321 gctctagtat gtgatctcta agca // LOCUS HUMFPTB 1582 bp mRNA PRI 20-MAY-1993 DEFINITION Human farnesyl-protein transferase beta-subunit mRNA, complete cds. ACCESSION L00635 NID g292032 KEYWORDS farnesyl-protein transferase; farnesyl-protein transferase beta-subunit. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1582) AUTHORS Omer,C.A. TITLE Characterization of recombinant human farnesyl-protein transferase: Cloning, expression, farnesyl diphosphate binding and functional homology with Yeast prenyl-protein transferases JOURNAL Biochemistry (1993) In press FEATURES Location/Qualifiers source 1..1582 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 20..1333 /codon_start=1 /product="farnesyl-protein transferase beta-subunit" /db_xref="PID:g292033" /translation="MASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVT SIEQAKVEEKIQEVFSSYKFNHLVPRLVLQREKHFHYLKRGLRQLTDAYECLDASRPW LCYWILHSLELLDEPIPQIVATDVCQFLELCQSPEGGFGGGPGQYPHLAPTYAAVNAL CIIGTEEAYDIINREKLLQYLYSLKQPDGSFLMHVGGEVDVRSAYCAASVASLTNIIT PDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTFCGLAALVILKRERSLNLKSLLQW VTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLPLLHRALHAQGDPALSMSHWMFHQQA LQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSGLSIAQHFGSGAMLHDVVLGVPEN ALQPTHPVYNIGPDKVIQATTYFLQKPVPGFEELKDETSAEPATD" BASE COUNT 361 a 434 c 417 g 370 t ORIGIN 1 ctctgctgct ctcctgatca tggcttctcc gagttctttc acctactatt gccctccatc 61 ttcctccccc gtctggtcag agccgctgta cagtctgagg cccgagcacg cgcgagagcg 121 gttgcaggac gactcggtgg aaacagtcac gtccatagaa caggcaaaag tagaagaaaa 181 gatccaagag gtcttcagtt cttacaagtt caaccacctt gtaccaaggc ttgttttgca 241 gagggagaag cacttccatt atctgaaaag aggccttcga caactgacag atgcctatga 301 gtgtctggat gccagccgcc catggctctg ctattggatc ctgcacagct tggaactgct 361 agatgaaccc atcccccaga tagtggctac agatgtgtgt cagttcctgg agctgtgtca 421 gagcccagaa ggtggctttg gaggaggacc cggtcagtat ccacaccttg cacccacata 481 tgcagcagtc aatgcattgt gcatcattgg caccgaggag gcctatgaca tcattaacag 541 agagaagctt cttcagtatt tgtactccct gaagcaacct gacggctcct ttctcatgca 601 tgtcggaggt gaggtggatg tgagaagcgc atactgtgct gcctccgtag cctcgctgac 661 caacatcatc actccagacc tctttgaggg cactgctgaa tggatagcaa ggtgtcagaa 721 ctgggaaggt ggcattggcg gggtaccagg gatggaagcc catggtggct ataccttctg 781 tggcctggcc gcgctggtaa tcctcaagag ggaacgttcc ttgaacttga agagcttatt 841 acaatgggtg acaagccggc agatgcgatt tgaaggagga tttcagggcc gctgcaacaa 901 gctggtggat ggctgctact ccttctggca ggcggggctc ctgcccctgc tccaccgcgc 961 actgcacgcc caaggtgacc ctgcccttag catgagccac tggatgttcc atcagcaggc 1021 cctgcaggag tacatcctga tgtgctgcca gtgccctgcg ggggggcttc tggataaacc 1081 tggcaagtcg cgtgatttct accacacctg ctactgcctg agcggcctgt ccatagccca 1141 gcacttcggc agcggagcca tgttgcatga tgtggtcctg ggtgtgcccg aaaacgctct 1201 gcagcccact cacccagtgt acaacattgg accagacaag gtgatccagg ccactacata 1261 ctttctacag aagccagtcc caggttttga ggagcttaag gatgagacat cggcagagcc 1321 tgcaaccgac tagaggacct gggtcccggc agctctttgc tcacccatct ccccagtcag 1381 acaaggttta tacgtttcaa tacatactgc attctgtgct acacaagcct tagcctcagt 1441 ggagctgtgg ttctcttggt actttcttgt caaacaaaac caatggctct gggtttggag 1501 aacacagtgg ctggttttaa aattctttcc acacctgtca aaccaaaaat ctatcagccc 1561 acgtggtgtg gttggtgaac ca // LOCUS HUMFRAPX 7943 bp mRNA PRI 31-DEC-1994 DEFINITION Human FKBP-rapamycin associated protein (FRAP) mRNA, complete cds. ACCESSION L34075 NID g508481 KEYWORDS FKBP-rapamycin associated protein; phosphatidylinositol 3-kinase. SOURCE Homo sapiens (tissue library: lambda ZAPII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7943) AUTHORS Brown,E.J., Albers,M.W., Shin,T.B., Ichikawa,K., Keith,C.T., Lane,W.S. and Schreiber,S.L. TITLE A mammalian protein targeted by G1-arresting rapamycin-receptor complex JOURNAL Nature 369 (6483), 756-758 (1994) MEDLINE 94277209 FEATURES Location/Qualifiers source 1..7943 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="jurkat" /cell_type="T-cell" /tissue_lib="lambda ZAPII" 5'UTR 1..79 /gene="FRAP" gene 1..7943 /gene="FRAP" CDS 80..7729 /gene="FRAP" /note="homologue of phosphatidylinositol-3'-kinase (PI3K), TOR1, and TOR2 genes" /codon_start=1 /function="acts as the target for the cell-cycle arrest and immunosuppressive effects of the FKBP12-rapamycin complex" /product="FKBP-rapamycin associated protein" /db_xref="PID:g508482" /translation="MLGTGPAAATTAATTSSNVSVLQQFASGLKSRNEETRAKAAKEL QHYVTMELREMSQEESTRFYDQLNHHIFELVSSSDANERKGGILAIASLIGVEGGNAT RIGRFANYLRNLLPSNDPVVMEMASKAIGRLAMAGDTFTAEYVEFEVKRALEWLGADR NEGRRHAAVLVLRELAISVPTFFFQQVQPFFDNIFVAVWDPKQAIREGAVAALRACLI LTTQREPKEMQKPQWYRHTFEEAEKGFDETLAKEKGMNRDDRIHGALLILNELVRISS MEGERLREEMEEITQQQLVHDKYCKDLMGFGTKPRHITPFTSFQAVQPQQSNALVGLL GYSSHQGLMGFGTSPSPAKSTLVESRCCRDLMEEKFDQVCQWVLKCRNSKNSLIQMTI LNLLPRLAAFRPSAFTDTQYLQDTMNHVLSCVKKEKERTAAFQALGLLSVAVRSEFKV YLPRVLDIIRAALPPKDFAHKRQKAMQVDATVFTCISMLARAMGPGIQQDIKELLEPM LAVGLSPALTAVLYDLSRQIPQLKKDIQDGLLKMLSLVLMHKPLRHPGMPKGLAHQLA SPGLTTLPEASDVGSITLALRTLGSFEFEGHSLTQFVRHCADHFLNSEHKEIRMEAAR TCSRLLTPSIHLISGHAHVVSQTAVQVVADVLSKLLVVGITDPDPDIRYCVLASLDER FDAHLAQAENLQALFVALNDQVFEIRELAICTVGRLSSMNPAFVMPFLRKMLIQILTE LEHSGIGRIKEQSARMLGHLVSNAPRLIRPYMEPILKALILKLKDPDPDPNPGVINNV LATIGELAQVSGLEMRKWVDELFIIIMDMLQDSSLLAKRQVALWTLGQLVASTGYVVE PYRKYPTLLEVLLNFLKTEQNQGTRREAIRVLGLLGALDPYKHKVNIGMIDQSRDASA VSLSESKSSQDSSDYSTSEMLVNMGNLPLDEFYPAVSMVALMRIFRDQSLSHHHTMVV QAITFIFKSLGLKCVQFLPQVMPTFLNVIRVCDGAIREFLFQQLGMLVSFVKSHIRPY MDEIVTLMREFWVMNTSIQSTIILLIEQIVVALGGEFKLYLPQLIPHMLRVFMHDNSP GRIVSIKLLAAIQLFGANLDDYLHLLLPPIVKLFDAPEAPLPSRKAALETVDRLTESL DFTDYASRIIHPIVRTLDQSPELRSTAMDTLSSLVFQLGKKYQIFIPMVNKVLVRHRI NHQRYDVLICRIVKGYTLADEEEDPLIYQHRMLRSGQGDALASGPVETGPMKKLHVST INLQKAWGAARRVSKDDWLEWLRRLSLELLKDSSSPSLRSCWALAQAYNPMARDLFNA AFVSCWSELNEDQQDELIRSIELALTSQDIAEVTQTLLNLAEFMEHSDKGPLPLRDDN GIVLLGERAAKCRAYAKALHYKELEFQKGPTPAILESLISINNKLQQPEAAAGVLEYA MKHFGELEIQATWYEKLHEWEDALVAYDKKMDTNKDDPELMLGRMRCLEALGEWGQLH QQCCEKWTLVNDETQAKMARMAAAAAWGLGQWDSMEEYTCMIPRDTHDGAFYRAVLAL HQDLFSLAQQCIDKARDLLDAELTAMAGESYSRAYGAMVSCHMLSELEEVIQYKLVPE RREIIRQIWWERLQGCQRIVEDWQKILMVRSLVVSPHEDMRTWLKYASLCGKSGRLAL AHKTLVLLLGVDPSRQLDHPLPTVHPQVTYAYMKNMWKSARKIDAFQHMQHFVQTMQQ QAQHAIATEDQQHKQELHKLMARCFLKLGEWQLNLQGINESTIPKVLQYYSAATEHDR SWYKAWHAWAVMNFEAVLHYKHQNQARDEKKKLRHASGANITNATTAATTAATATTTA STEGSNSESEAESTENSPTPSPLQKKVTEDLSKTLLMYTVPAVQGFFRSISLSRGNNL QDTLRVLTLWFDYGHWPDVNEALVEGVKAIQIDTWLQVIPQLIARIDTPRPLVGRLIH QLLTDIGRYHPQALIYPLTVASKSTTTARHNAANKILKNMCEHSNTLVQQAMMVSEEL IRVAILWHEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAY GRDLMEAQEWCRKYMKSGNVKDLTQAWDLYYHVFRRISKQLPQLTSLELQYVSPKLLM CRDLELAVPGTYDPNQPIIRIQSIAPSLQVITSKQRPRKLTLMGSNGHEFVFLLKGHE DLRQDERVMQLFGLVNTLLANDPTSLRKNLSIQRYAVIPLSTNSGLIGWVPHCDTLHA LIRDYREKKKILLNIEHRIMLRMAPDYDHLTLMQKVEVFEHAVNNTAGDDLAKLLWLK SPSSEVWFDRRTNYTRSLAVMSMVGYILGLGDRHPSNLMLDRLSGKILHIDFGDCFEV AMTREKFPEKIPFRLTRMLTNAMEVTGLDGNYRITCHTVMEVLREHKDSVMAVLEAFV YDPLLNWRLMDTNTKGNKRSRTRTDSYSAGQSVEILDGVELGEPAHKKTGTTVPESIH SFIGDGLVKPEALNKKAIQIINRVRDKLTGRDFSHDDTLDVPTQVELLIKQATSHENL CQCYIGWCPFW" 3'UTR 7730..7943 /gene="FRAP" BASE COUNT 1972 a 2103 c 2094 g 1774 t ORIGIN 1 acggggcctg aagcggcggt accggtgctg gcggcggcag ctgaggcctt ggccgaagcc 61 gcgcgaacct cagggcaaga tgcttggaac cggacctgcc gccgccacca ccgctgccac 121 cacatctagc aatgtgagcg tcctgcagca gtttgccagt ggcctaaaga gccggaatga 181 ggaaaccagg gccaaagccg ccaaggagct ccagcactat gtcaccatgg aactccgaga 241 gatgagtcaa gaggagtcta ctcgcttcta tgaccaactg aaccatcaca tttttgaatt 301 ggtttccagc tcagatgcca atgagaggaa aggtggcatc ttggccatag ctagcctcat 361 aggagtggaa ggtgggaatg ccacccgaat tggcagattt gccaactatc ttcggaacct 421 cctcccctcc aatgacccag ttgtcatgga aatggcatcc aaggccattg gccgtcttgc 481 catggcaggg gacactttta ccgctgagta cgtggaattt gaggtgaagc gagccctgga 541 atggctgggt gctgaccgca atgagggccg gagacatgca gctgtcctgg ttctccgtga 601 gctggccatc agcgtcccta ccttcttctt ccagcaagtg caacccttct ttgacaacat 661 ttttgtggcc gtgtgggacc ccaaacaggc catccgtgag ggagctgtag ccgcccttcg 721 tgcctgtctg attctcacaa cccagcgtga gccgaaggag atgcagaagc ctcagtggta 781 caggcacaca tttgaagaag cagagaaggg atttgatgag accttggcca aagagaaggg 841 catgaatcgg gatgatcgga tccatggagc cttgttgatc cttaacgagc tggtccgaat 901 cagcagcatg gagggagagc gtctgagaga agaaatggaa gaaatcacac agcagcagct 961 ggtacacgac aagtactgca aagatctcat gggcttcgga acaaaacctc gtcacattac 1021 ccccttcacc agtttccagg ctgtacagcc ccagcagtca aatgccttgg tggggctgct 1081 ggggtacagc tctcaccaag gcctcatggg atttgggacc tcccccagtc cagctaagtc 1141 caccctggtg gagagccggt gttgcagaga cttgatggag gagaaatttg atcaggtgtg 1201 ccagtgggtg ctgaaatgca ggaatagcaa gaactcgctg atccaaatga caatccttaa 1261 tttgttgccc cgcttggctg cattccgacc ttctgccttc acagataccc agtatctcca 1321 agataccatg aaccatgtcc taagctgtgt caagaaggag aaggaacgta cagcggcctt 1381 ccaagccctg gggctacttt ctgtggctgt gaggtctgag tttaaggtct atttgcctcg 1441 cgtgctggac atcatccgag cggccctgcc cccaaaggac ttcgcccata agaggcagaa 1501 ggcaatgcag gtggacgcca cagtcttcac ttgcatcagc atgctggctc gagcaatggg 1561 gccaggcatc cagcaggata tcaaggagct gctggagccc atgctggcag tgggactaag 1621 ccctgccctc actgcagtgc tctacgacct gagccgtcag attccacagc taaagaagga 1681 cattcaagat gggctactga aaatgctgtc cctggtcctt atgcacaaac cccttcgcca 1741 cccaggcatg cccaagggcc tggcccatca gctggcctct cctggcctca cgaccctccc 1801 tgaggccagc gatgtgggca gcatcactct tgccctccga acgcttggca gctttgaatt 1861 tgaaggccac tctctgaccc aatttgttcg ccactgtgcg gatcatttcc tgaacagtga 1921 gcacaaggag atccgcatgg aggctgcccg cacctgctcc cgcctgctca caccctccat 1981 ccacctcatc agtggccatg ctcatgtggt tagccagacc gcagtgcaag tggtggcaga 2041 tgtgcttagc aaactgctcg tagttgggat aacagatcct gaccctgaca ttcgctactg 2101 tgtcttggcg tccctggacg agcgctttga tgcacacctg gcccaggcgg agaacttgca 2161 ggccttgttt gtggctctga atgaccaggt gtttgagatc cgggagctgg ccatctgcac 2221 tgtgggccga ctcagtagca tgaaccctgc ctttgtcatg cctttcctgc gcaagatgct 2281 catccagatt ttgacagagt tggagcacag tgggattgga agaatcaaag agcagagtgc 2341 ccgcatgctg gggcacctgg tctccaatgc cccccgactc atccgcccct acatggagcc 2401 tattctgaag gcattaattt tgaaactgaa agatccagac cctgatccaa acccaggtgt 2461 gatcaataat gtcctggcaa caataggaga attggcacag gttagtggcc tggaaatgag 2521 gaaatgggtt gatgaacttt ttattatcat catggacatg ctccaggatt cctctttgtt 2581 ggccaaaagg caggtggctc tgtggaccct gggacagttg gtggccagca ctggctatgt 2641 agtagagccc tacaggaagt accctacttt gcttgaggtg ctactgaatt ttctgaagac 2701 tgagcagaac cagggtacac gcagagaggc catccgtgtg ttagggcttt taggggcttt 2761 ggatccttac aagcacaaag tgaacattgg catgatagac cagtcccggg atgcctctgc 2821 tgtcagcctg tcagaatcca agtcaagtca ggattcctct gactatagca ctagtgaaat 2881 gctggtcaac atgggaaact tgcctctgga tgagttctac ccagctgtgt ccatggtggc 2941 cctgatgcgg atcttccgag accagtcact ctctcatcat cacaccatgg ttgtccaggc 3001 catcaccttc atcttcaagt ccctgggact caaatgtgtg cagttcctgc cccaggtcat 3061 gcccacgttc cttaatgtca ttcgagtctg tgatggggcc atccgggaat ttttgttcca 3121 gcagctggga atgttggtgt cctttgtgaa gagccacatc agaccttata tggatgaaat 3181 agtcaccctc atgagagaat tctgggtcat gaacacctca attcagagca cgatcattct 3241 tctcattgag caaattgtgg tagctcttgg gggtgaattt aagctctacc tgccccagct 3301 gatcccacac atgctgcgtg tcttcatgca tgacaacagc ccaggccgca ttgtctctat 3361 caagttactg gctgcaatcc agctgtttgg cgccaacctg gatgactacc tgcatttact 3421 gctgcctcct attgttaagt tgtttgatgc ccctgaagct ccactgccat ctcgaaaggc 3481 agcgctagag actgtggacc gcctgacgga gtccctggat ttcactgact atgcctcccg 3541 gatcattcac cctattgttc gaacactgga ccagagccca gaactgcgct ccacagccat 3601 ggacacgctg tcttcacttg tttttcagct ggggaagaag taccaaattt tcattccaat 3661 ggtgaataaa gttctggtgc gacaccgaat caatcatcag cgctatgatg tgctcatctg 3721 cagaattgtc aagggataca cacttgctga tgaagaggag gatcctttga tttaccagca 3781 tcggatgctt aggagtggcc aaggggatgc attggctagt ggaccagtgg aaacaggacc 3841 catgaagaaa ctgcacgtca gcaccatcaa cctccaaaag gcctggggcg ctgccaggag 3901 ggtctccaaa gatgactggc tggaatggct gagacggctg agcctggagc tgctgaagga 3961 ctcatcatcg ccctccctgc gctcctgctg ggccctggca caggcctaca acccgatggc 4021 cagggatctc ttcaatgctg catttgtgtc ctgctggtct gaactgaatg aagatcaaca 4081 ggatgagctc atcagaagca tcgagttggc cctcacctca caagacatcg ctgaagtcac 4141 acagaccctc ttaaacttgg ctgaattcat ggaacacagt gacaagggcc ccctgccact 4201 gagagatgac aatggcattg ttctgctggg tgagagagct gccaagtgcc gagcatatgc 4261 caaagcacta cactacaaag aactggagtt ccagaaaggc cccacccctg ccattctaga 4321 atctctcatc agcattaata ataagctaca gcagccggag gcagcggccg gagtgttaga 4381 atatgccatg aaacactttg gagagctgga gatccaggct acctggtatg agaaactgca 4441 cgagtgggag gatgcccttg tggcctatga caagaaaatg gacaccaaca aggacgaccc 4501 agagctgatg ctgggccgca tgcgctgcct cgaggccttg ggggaatggg gtcaactcca 4561 ccagcagtgc tgtgaaaagt ggaccctggt taatgatgag acccaagcca agatggcccg 4621 gatggctgct gcagctgcat ggggtttagg tcagtgggac agcatggaag aatacacctg 4681 tatgatccct cgggacaccc atgatggggc attttataga gctgtgctgg cactgcatca 4741 ggacctcttc tccttggcac aacagtgcat tgacaaggcc agggacctgc tggatgctga 4801 attaactgca atggcaggag agagttacag tcgggcatat ggggccatgg tttcttgcca 4861 catgctgtcc gagctggagg aggttatcca gtacaaactt gtccccgagc gacgagagat 4921 catccgccag atctggtggg agagactgca gggctgccag cgtatcgtag aggactggca 4981 gaaaatcctt atggtgcggt cccttgtggt cagccctcat gaagacatga gaacctggct 5041 caagtatgca agcctgtgcg gcaagagtgg caggctggct cttgctcata aaactttagt 5101 gttgctcctg ggagttgatc cgtctcggca acttgaccat cctctgccaa cagttcaccc 5161 tcaggtgacc tatgcctaca tgaaaaacat gtggaagagt gcccgcaaga tcgatgcctt 5221 ccagcacatg cagcattttg tccagaccat gcagcaacag gcccagcatg ccatcgctac 5281 tgaggaccag cagcataagc aggaactgca caagctcatg gcccgatgct tcctgaaact 5341 tggagagtgg cagctgaatc tacagggcat caatgagagc acaatcccca aagtgctgca 5401 gtactacagc gccgccacag agcacgaccg cagctggtac aaggcctggc atgcgtgggc 5461 agtgatgaac ttcgaagctg tgctacacta caaacatcag aaccaagccc gcgatgagaa 5521 gaagaaactg cgtcatgcca gcggggccaa catcaccaac gccaccactg ccgccaccac 5581 ggccgccact gccaccacca ctgccagcac cgagggcagc aacagtgaga gcgaggccga 5641 gagcaccgag aacagcccca ccccatcgcc gctgcagaag aaggtcactg aggatctgtc 5701 caaaaccctc ctgatgtaca cggtgcctgc cgtccagggc ttcttccgtt ccatctcctt 5761 gtcacgaggc aacaacctcc aggatacact cagagttctc accttatggt ttgattatgg 5821 tcactggcca gatgtcaatg aggccttagt ggagggggtg aaagccatcc agattgatac 5881 ctggctacag gttatacctc agctcattgc aagaattgat acgcccagac ccttggtggg 5941 acgtctcatt caccagcttc tcacagacat tggtcggtac cacccccagg ccctcatcta 6001 cccactgaca gtggcttcta agtctaccac gacagcccgg cacaatgcag ccaacaagat 6061 tctgaagaac atgtgtgagc acagcaacac cctggtccag caggccatga tggtgagcga 6121 ggagctgatc cgagtggcca tcctctggca tgagatgtgg catgaaggcc tggaagaggc 6181 atctcgtttg tactttgggg aaaggaacgt gaaaggcatg tttgaggtgc tggagccctt 6241 gcatgctatg atggaacggg gcccccagac tctgaaggaa acatccttta atcaggccta 6301 tggtcgagat ttaatggagg cccaagagtg gtgcaggaag tacatgaaat cagggaatgt 6361 caaggacctc acccaagcct gggacctcta ttatcatgtg ttccgacgaa tctcaaagca 6421 gctgcctcag ctcacatcct tagagctgca atatgtttcc ccaaaacttc tgatgtgccg 6481 ggaccttgaa ttggctgtgc caggaacata tgaccccaac cagccaatca ttcgcattca 6541 gtccatagca ccgtctttgc aagtcatcac atccaagcag aggccccgga aattgacact 6601 tatgggcagc aacggacatg agtttgtttt ccttctaaaa ggccatgaag atctgcgcca 6661 ggatgagcgt gtgatgcagc tcttcggcct ggttaacacc cttctggcca atgacccaac 6721 atctcttcgg aaaaacctca gcatccagag atacgctgtc atccctttat cgaccaactc 6781 gggcctcatt ggctgggttc cccactgtga cacactgcac gccctcatcc gggactacag 6841 ggagaagaag aagatccttc tcaacatcga gcatcgcatc atgttgcgga tggctccgga 6901 ctatgaccac ttgactctga tgcagaaggt ggaggtgttt gagcatgccg tcaataatac 6961 agctggggac gacctggcca agctgctgtg gctgaaaagc cccagctccg aggtgtggtt 7021 tgaccgaaga accaattata cccgttcttt agcggtcatg tcaatggttg ggtatatttt 7081 aggcctggga gatagacacc catccaacct gatgctggac cgtctgagtg ggaagatcct 7141 gcacattgac tttggggact gctttgaggt tgctatgacc cgagagaagt ttccagagaa 7201 gattccattt agactaacaa gaatgttgac caatgctatg gaggttacag gcctggatgg 7261 caactacaga atcacatgcc acacagtgat ggaggtgctg cgagagcaca aggacagtgt 7321 catggccgtg ctggaagcct ttgtctatga ccccttgctg aactggaggc tgatggacac 7381 aaataccaaa ggcaacaagc gatcccgaac gaggacggat tcctactctg ctggccagtc 7441 agtcgaaatt ttggacggtg tggaacttgg agagccagcc cataagaaaa cggggaccac 7501 agtgccagaa tctattcatt ctttcattgg agacggtttg gtgaaaccag aggccctaaa 7561 taagaaagct atccagatta ttaacagggt tcgagataag ctcactggtc gggacttctc 7621 tcatgatgac actttggatg ttccaacgca agttgagctg ctcatcaaac aagcgacatc 7681 ccatgaaaac ctctgccagt gctatattgg ctggtgccct ttctggtaac tggaggccca 7741 gatgtgccca tcacgttttt tctgaggctt ttgtacttta gtaaatgctt ccactaaact 7801 gaaaccatgg tgagaaagtt tgactttgtt aaatattttg aaatgtaaat gaaaagaagt 7861 actgtatatt aaaagttggt ttgaaccaac tttctagctg ctgttgaaga atatattgtc 7921 agaaacacaa ggcttgattt ggt // LOCUS HUMFRG1R 1042 bp mRNA PRI 02-APR-1996 DEFINITION Homo sapiens FRG1 mRNA, complete cds. ACCESSION L76159 NID g1246232 KEYWORDS FRG1 gene; FSHD candidate gene; multigene family. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1042) AUTHORS Van Deutekom,J.C.T., Lemmers,R.R.J., Grewal,P.K., Van Geel,M., Hofker,M.H., Hewitt,J.E. and Frants,R.R. TITLE Identification of the first gene (FRG1) from the FSHD region on human chromosome 4q35 JOURNAL Hum. Mol. Genet. 5(5) (1996) In press FEATURES Location/Qualifiers source 1..1042 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q35" 5'UTR 1..191 /gene="FRG1" gene 1..1042 /gene="FRG1" mRNA <1..>1042 /gene="FRG1" CDS 192..968 /gene="FRG1" /codon_start=1 /db_xref="PID:g1246233" /translation="MAEYSYVKSTKLVLKGTKTKSKKKKSKDKKRKREEDEETQLDIV GIWWTVTNFGEISGTIAIEMDKGTYIHALDNGLFTLGAPHKEVDEGPSPPEQFTAVKL SDSRIALKSGYGKYLGINSDGLVVGRSDAIGPREQWEPVFQNGKMALLASNSCFIRCN EAGDIEAKSKTAGEEEMIKIRSCAERETKKKDDIPEEDKGNVKQCEINYVKKFQSFQD HKLKISKEDSKILKKARKDGFLHETLLDRRAKLKADRYCK" 3'UTR 969..>1042 /gene="FRG1" BASE COUNT 350 a 195 c 252 g 245 t ORIGIN 1 gaaacccgga agtggaactc tgagccattc agcgtttggg tgaagacgga ggcgggttct 61 acagagacgt aggctgtcag ggagtgttta tttcgcgtcc gcttctgttc ctccgcgccc 121 ctgtgctgcc ccgactcaca tactcgtcca gaaccggcct cagcctctcc gcgcagaagt 181 tgcccggagc catggccgag tactcctatg tgaagtctac caagctcgtg ctcaagggaa 241 ccaagacgaa gagtaagaag aaaaagagca aagataagaa aagaaaaaga gaagaagatg 301 aagaaaccca gcttgatatt gttggaatct ggtggacagt aacaaacttt ggtgaaattt 361 caggaaccat agccattgaa atggataagg gaacctatat acatgcactc gacaatggtc 421 tttttaccct gggagctcca cacaaagaag ttgatgaggg ccctagtcct ccagagcagt 481 ttacggctgt caaattatct gattccagaa ttgccctgaa gtctggctat ggaaaatatc 541 ttggtataaa ttcagatgga cttgttgttg ggcgttcaga tgcaattgga ccaagagaac 601 aatgggaacc agtctttcaa aatgggaaaa tggctttgtt ggcctcaaat agctgcttta 661 ttagatgcaa tgaagcaggg gacatagaag caaaaagtaa aacagcagga gaagaagaaa 721 tgatcaagat tagatcctgt gctgaaagag aaaccaagaa aaaagatgac attccagaag 781 aagacaaagg aaatgtaaaa caatgtgaaa tcaattatgt aaagaaattt cagagcttcc 841 aagaccacaa acttaaaata agtaaagaag acagtaaaat tcttaaaaag gctcggaaag 901 atggattttt gcatgagacg cttctggaca ggagagccaa attgaaagcc gacagatact 961 gcaagtgact gggatttttg tttctgcctt atctttctgt gtttttttct gaataaaata 1021 ttcagaggaa atgcttttac ag // LOCUS HUMFSHD 3303 bp DNA PRI 01-AUG-1996 DEFINITION Human facioscapulohumeral muscular dystrophy (FSHD) gene region, D4Z4 tandem repeat unit. ACCESSION D38024 NID g871846 KEYWORDS facioscapulohumeral muscular dystrophy; FSHD; D4Z4 repeat family; microsatellite; homeodomain; LSau-like sequence. SOURCE Homo sapiens DNA, clone:c51. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3303) AUTHORS Lee,J.H., Goto,K., Matsuda,C. and Arahata,K. TITLE Characterization of a tandemly repeated 3.3-kb KpnI unit in the facioscapulohumeral muscular dystrophy (FSHD) gene region on chromosome 4q35 JOURNAL Muscle Nerve 2, S6-S13 (1995) MEDLINE 95258038 REFERENCE 2 (bases 1 to 3303) AUTHORS Lee,J. TITLE Direct Submission JOURNAL Submitted (22-AUG-1994) to the DDBJ/EMBL/GenBank databases. Je Hyeon Lee, National Institute of Neuroscience, NCNP, Dept. of neuromusclar Research; 4-1-1, Ogawa-Higashi, Kodaira, Tokyo 187, Japan (Tel:0423-46-1712, Fax:0423-46-1742) FEATURES Location/Qualifiers source 1..3303 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="4q35-qter" /clone="c51" repeat_unit 1..3303 /note="3303bps KpnI fragment in FSHD gene region contains other repeat and sequence motive" /function="unknown" /rpt_family="D4Z4" /rpt_type=tandem repeat_region 1..300 /note="region with similarity to Lsau(GenBank X59423), Part of tandem repeat D4Z4" /function="unknown" /rpt_type=other misc_feature 393..578 /note="extremly G-rich region in 186bps. Part of tandem repeat locus D4Z4" /function="unknown" satellite 398..578 /note="microsatellite of GGAGG, Part of tantem repeat locus D4Z4" /rpt_type=direct CDS 417..2978 /note="ORF" /codon_start=1 /db_xref="PID:d1007805" /db_xref="PID:g1435038" /translation="MERGTGETRGAEGTLGGRQGGREAGRNGGRDRATQGLGAGPREP GTDGGRKAGRKSGPRPPGVAGPPASGKTVSVRRGLRAGPTAAAPAGGAPPIRPGSGAQ GVGGFLRDKRPGLGLPSGLHPRGSQTAHPQAEPCNAARGPQTRPRRSHTQDDGGVILV SEWLCPPEGGLLLTSLRPPKGWPCRLFAPGALRHPETCREGCKPGMVPSLSLPGSKPA TLQTPPRCRTRESIVRPSRRGGISSLGSRSGLLRGNEREPHACVCETVPATATPTGIA SFTERGPGTLKTPTEVQFHTPLHPPRLVSPCCRRVGAQRAASRSRGIPGEVRRAGPRN APPSPLPPLPLPLRLSGPTTTTATTPPPPPPPPTTTTTTTPPAGPRPRRPGSLPGWGG LSQGGSPPFMKGWSLPACGPLQGRLAGWLAVRAGLLAAPAAVHSPAEVHGSPPASLCP RPSVKFRPGLTAMALPTPSDSTLPAEARGRGRPRRLVWTPSQSEALRACFERNPYPGI ATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRGPPEGRRKRTAVTGS QTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHPGQGGRAPAQAGG LCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQAARAAPALQP SQAAPAEGVSQPAPARGDFAYAAPAPPEPGRSPTLRLLGGLRTRAKAGRTGTRSATAC RAPARWHSLGPLKRGRRPRGACATHVPGESVVGLGPGSPGRRGGVGTPSRGSSTSPAR APGTPPPPRGRGRCKASRRPPRRSRSRRPGLHSPAACCWMSSWRARSFCSRRNLS" misc_feature 1405..1554 /note="Extremly C-rich region(69%) in 186bps. Part of tandem repeat locus 4DZ4." /function="unknown" satellite 1470..1536 /note="microsatellite of CCA, Part of tandem repeat locus D4Z4" /rpt_type=direct satellite 1590..1703 /note="microsatellite of GGCT, Part of tandem repeat locus D4Z4" /rpt_type=direct misc_feature 1863..2037 /note="paired type homeodomain seq I, translation;PRRLVWTPSQSEALRACFERNPYPGIATRERLAQAIGIPEPRVQIW FQNERSRQLR" /function="unknown" misc_feature 2082..2264 /note="paired type homeodomain seq II, translationGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQI WFQNRRARHPGQG" /function="unknown" BASE COUNT 486 a 1257 c 1142 g 418 t ORIGIN Chromosome 4q35-qter. 1 ggtaccagca ggtgggccgc ctactgcgca cgcgcgggtt tgcgggcagc cgcctgggct 61 gtgggagcag cccgggccag agctctcctg cctctccacc agcccacccc gccgcctgac 121 cgccccctcc ccacccccca ccccccaccc ccggaaaacg cgtcgtcccc tgggctgggt 181 ggagaccccc gtcccgcgaa acaccgggcc ccgcgcagcg tccgggcctg acaccgctcc 241 gccggctcgc ctcctcctgt cgcccccggg ccaccgtcgc ccgcccgccc gggcccctgc 301 gggcccctgc agccgcccag ctgccagcac gggcggctgg cggcggaacg cagaccccag 361 gcccggcgca caccggggac gctgagcgtt ccaggcggga gggaaggcgg gcagagatgg 421 agagaggaac gggagagact agaggggcgg aagggacgtt aggagggagg cagggaggca 481 gggaggcagg gaggaacgga gggagagaca gagcgacgca gggactgggg gcggggccga 541 gggagccggg gacggacggg gggaggaagg cagggaggaa aagcggtcct cggcctccgg 601 gagtagcggg accgcccgcc tccgggaaaa cggtcagcgt ccggcgcggg ctgagggctg 661 ggcccacagc cgccgcgccg gccggcgggg caccacccat tcgccccggt tccggggccc 721 agggagtggg cggtttcctc cgggacaaaa gaccgggact cgggttgccg tcgggtcttc 781 acccgcgcgg ttcacagacc gcacatcccc aggctgagcc ctgcaacgcg gcgcgaggcc 841 cacagacccg gccacggagg agccacacgc aggacgacgg aggcgtgatt ttggtttccg 901 agtggctttg ccctcccgaa ggcggcctgt tgctcacgtc tctccggccc ccgaaaggct 961 ggccatgccg actgtttgct cccggagctc tgcggcaccc ggaaacatgc agggaagggt 1021 gcaagcccgg catggtgcct tcgctctcct tgccaggttc caaacccgcc acactgcaga 1081 ctcccccacg ttgccgcacg cgggaatcca tcgtcaggcc atcacgccgg ggaggcatct 1141 cctctctggg gtctcgctct ggtcttctac gtggaaatga acgagagcca cacgcctgcg 1201 tgtgcgagac cgtcccggca acggcgacgc ccacaggcat tgcctccttc acggagagag 1261 ggcctggcac actcaagact cccacggagg ttcagttcca cactcccctc caccctccca 1321 ggctggtttc tccctgctgc cgacgcgtgg gagcccagag agcggcttcc cgttcccgcg 1381 ggatccctgg agaggtccgg agagccggcc cccgaaacgc gcccccctcc cccctccccc 1441 ctctccccct tcctcttcgt ctctccggcc ccaccaccac caccgccacc acgcctcccc 1501 caccaccccc cccccccacc accaccacca ccaccacccc gccggccggc cccaggcctc 1561 gacgccctgg gtcccttccg gggtggggcg ggctgtccca ggggggctca ccgccattca 1621 tgaaggggtg gagcctgcct gcctgtgggc ctttacaagg gcggctggct ggctggctgg 1681 ctgtccgggc aggcctcctg gctgcacctg ccgcagtgca cagtccggct gaggtgcacg 1741 ggagcccgcc ggcctctctc tgcccgcgtc cgtccgtgaa attccggccg gggctcaccg 1801 cgatggccct cccgacaccc tcggacagca ccctccccgc ggaagcccgg ggacgaggac 1861 ggccacggag actcgtttgg accccgagcc aaagcgaggc cctgcgagcc tgctttgagc 1921 ggaacccgta cccgggcatc gccaccagag aacggctggc ccaggccatc ggcattccgg 1981 agcccagggt ccagatttgg tttcagaatg agaggtcacg ccagctgagg cagcaccggc 2041 gggaatctcg gccctggccc gggagacgcg gcccgccaga aggccggcga aagcggaccg 2101 ccgtcaccgg atcccagacc gccctgctcc tccgagcctt tgagaaggat cgctttccag 2161 gcatcgccgc ccgggaggag ctggccagag agacgggcct cccggagtcc aggattcaga 2221 tctggtttca gaatcgaagg gccaggcacc cgggacaggg tggcagggcg cccgcgcagg 2281 caggcggcct gtgcagcgcg gcccccggcg ggggtcaccc tgctccctcg tgggtcgcct 2341 tcgcccacac cggcgcgtgg ggaacggggc ttcccgcacc ccacgtgccc tgcgcgcctg 2401 gggctctccc acagggggct ttcgtgagcc aggcagcgag ggccgccccc gcgctgcagc 2461 ccagccaggc cgcgccggca gagggggtct cccaacctgc cccggcgcgc ggggatttcg 2521 cctacgccgc cccggctcct ccggagccgg ggcgctctcc caccctcagg ctcctcggtg 2581 gcctccgcac ccgggcaaaa gccgggagga ccgggacccg cagcgcgacg gcctgccggg 2641 cccctgcgcg gtggcacagc ctgggcccgc tcaagcgggg ccgcaggcca aggggtgctt 2701 gcgccaccca cgtcccaggg gagtccgtgg tggggctggg gccggggtcc ccaggtcgcc 2761 ggggcggcgt gggaacccca agccggggca gctccacctc cccagcccgc gcccccggga 2821 cgcctccgcc tccgcgcggc aggggcagat gcaaggcatc ccggcgccct cccaggcgct 2881 ccaggagccg gcgccctggt ctgcactccc ctgcggcctg ctgctggatg agctcctggc 2941 gagcccggag tttctgcagc aggcgcaacc tctcctagaa acggaggccc cgggggagct 3001 ggaggcctcg gaagaggcgc ctcgctggaa gcacccctca gcgaggaaga ataccgggct 3061 ctgctggagg agctttagga cgcggggttg ggacggggtc gggtggttcg gggcagggcg 3121 gtggcctctc tttcgcgggg aacacctggc tggctacgga ggggcgtgtc tccgccccgc 3181 cccctccacc gggctgaccg gcctgggatt cctgccttct aggtccaggc ccggtgagag 3241 actccacacc gcggagaact gccattcttt cctgggcatc ccggggatcc cagagccggc 3301 cca // LOCUS HUMFSHRE 2393 bp mRNA PRI 08-NOV-1994 DEFINITION Human follicle stimulating hormone receptor mRNA, complete cds. ACCESSION M65085 NID g182770 KEYWORDS follicle stimulating hormone (FSH) receptor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2393) AUTHORS Minegishi,T., Nakamura,K., Takakura,Y., Ibuki,Y., Igarashi,M. and Minegish T [corrected to Minegishi,T.]. TITLE Cloning and sequencing of human FSH receptor cDNA [published erratum appears in Biochem Biophys Res Commun 1994 Jun 15;201(2):1057] JOURNAL Biochem. Biophys. Res. Commun. 175 (3), 1125-1130 (1991) MEDLINE 91222171 FEATURES Location/Qualifiers source 1..2393 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 67..2154 /gene="FSHR" CDS 67..2154 /gene="FSHR" /codon_start=1 /db_xref="GDB:G00-127-510" /product="follicle stimulating hormone receptor" /db_xref="PID:g182771" /translation="MALLLVSLLAFLSLGSGCHHRICHCSNRVFLCQESKVTEIPSDL PRNAIELRFVLTKLRVIQKGAFSGFGDLEKIEISQNDVLEVIEADVFSNLPKLHEIRI EKANNLLYITPEAFQNLPNLQYLLISNTGIKHLPDVHKIHSLQKVLLDIQDNINIHTI ERNSFVGLSFESVILWLNKNGIQEIHNCAFNGTQLDAVNLSDNNNLEELPNDVFHGAS GPVILDISRTRIHSLPSYGLENLKKLRARSTYNLKKLPTLEKLVALMEASLTYPSHCC AFANWRRQISELHPICNKSILRQEVDYMTQARGQRSSLAEDNESSYSRGFDMTYTEFD YDLCNEVVDVTCSPKPDAFNPCEDIMGYNILRVLIWFISILAITGNIIVLVILTTSQY KLTVPRFLMCNLAFADLCIGIYLLLIASVDIHTKSQYHNYAIDWQTGAGCDAAGFFTV FASELSVYTLTAITLERWHTITHAMQLDCKVQLRHAASVMVMGWIFAFAAALFPIFGI SSYMKVSICLPMDIDSPLSQLYVMSLLVLNVLAFVVICGCYIHIYLTVRNPNIVSSSS DTRIAKRMAMLIFTDFLCMAPISFFAISASLKVPLITVSKAKILLVLFHPINSCANPF LYAIFTKNFRRDFFILLSKCGCYEMQAQIYRTETSSTVHNTHPRNGHCSSAPRVTSGS TYILVPLSHLAQN" BASE COUNT 648 a 596 c 484 g 665 t ORIGIN 1 cgctgagatc tgtggaggtt tttctctgca aatgcagaaa gaaatcaggt ggatggatgc 61 ataattatgg ccctgctcct ggtctctttg ctggcattcc tgagcttggg ctcaggatgt 121 catcatcgga tctgtcactg ctctaacagg gtttttctct gccaagagag caaggtgaca 181 gagattcctt ctgacctccc gaggaatgcc attgaactga ggtttgtcct caccaagctt 241 cgagtcatcc aaaaaggtgc attttcagga tttggggacc tggagaaaat agagatctct 301 cagaatgatg tcttggaggt gatagaggca gatgtgttct ccaaccttcc caaattacat 361 gaaattagaa ttgaaaaggc caacaacctg ctctacatca cccctgaggc cttccagaac 421 cttcccaacc ttcaatatct gttaatatcc aacacaggta ttaagcacct tccagatgtt 481 cacaagattc attctctcca aaaggtttta cttgacattc aagataacat aaacatccac 541 acaattgaaa gaaattcttt cgtggggctg agctttgaaa gtgtgattct atggctgaat 601 aagaatggga ttcaagaaat acacaactgt gcattcaatg gaacccaact agatgcagtg 661 aatctaagcg ataataataa tttagaagaa ttgcctaatg atgttttcca cggagcctct 721 ggaccagtca ttctagatat ttcaagaaca aggatccatt ccctgcctag ctatggctta 781 gaaaatctta agaagctgag ggccaggtcg acttacaact taaaaaagct gcctactctg 841 gaaaagcttg tcgccctcat ggaagccagc ctcacctatc ccagccattg ctgtgccttt 901 gcaaactgga gacggcaaat ctctgagctt catccaattt gcaacaaatc tattttaagg 961 caagaagttg attatatgac tcaggctagg ggtcagagat cctctctggc agaagacaat 1021 gagtccagct acagcagagg atttgacatg acgtacactg agtttgacta tgacttatgc 1081 aatgaagtgg ttgacgtgac ctgctcccct aagccagatg cattcaaccc atgtgaagat 1141 atcatggggt acaacatcct cagagtcctg atatggttta tcagcatcct ggccatcact 1201 gggaacatca tagtgctagt gatcctaact accagccaat ataaactcac agtccccagg 1261 ttccttatgt gcaacctggc ctttgctgat ctctgcattg gaatctacct gctgctcatt 1321 gcatcagttg atatccatac caagagccaa tatcacaact atgccattga ctggcaaact 1381 ggggcaggct gtgatgctgc tggctttttc actgtctttg ccagtgagct gtcagtctac 1441 actctgacag ctatcacctt ggaaagatgg cataccatca cgcatgccat gcagctggac 1501 tgcaaggtgc agctccgcca tgctgccagt gtcatggtga tgggctggat ttttgctttt 1561 gcagctgccc tctttcccat ctttggcatc agcagctaca tgaaggtgag catctgcctg 1621 cccatggata ttgacagccc tttgtcacag ctgtatgtca tgtccctcct tgtgctcaat 1681 gtcctggcct ttgtggtcat ctgtggctgc tatatccaca tctacctcac agtgcggaac 1741 cccaacatcg tgtcctcctc tagtgacacc aggatcgcca agcgcatggc catgctcatc 1801 ttcactgact tcctctgcat ggcacccatt tctttctttg ccatttctgc ctccctcaag 1861 gtgcccctca tcactgtgtc caaagcaaag attctgctgg ttctgtttca ccccatcaac 1921 tcctgtgcca accccttcct ctatgccatc tttaccaaaa actttcgcag agatttcttc 1981 attctgctga gcaagtgtgg ctgctatgaa atgcaagccc aaatttatag gacagaaact 2041 tcatccactg tccacaacac ccatccaagg aatggccact gctcttcagc tcccagagtc 2101 accagtggtt ccacttacat acttgtccct ctaagtcatt tagcccaaaa ctaaaacaca 2161 atgtgaaaat gtatctgagt attgaatgat aattcagtcc ttgcctttga agggtatgtc 2221 acaaggagct gacagtgctt ctacacattt catctaattt aatattcctg gcataccttt 2281 aaggtaaatt ggtcaggaac tattaattcc atgtgataca ttaggaagct gaattattag 2341 taacaacaat aataattaaa gaatgcaata ctgtaaaaaa gcggccgcga att // LOCUS HUMFUMA 1447 bp mRNA PRI 08-NOV-1994 DEFINITION Human mitochondrial fumarase mRNA, complete cds. ACCESSION M15502 NID g182793 KEYWORDS fumarase. SOURCE Human liver, cDNA to mRNA, clone pUC18. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1447) AUTHORS Kinsella,B.T. and Doonan,S. TITLE Nucleotide sequence of a cDNA coding for mitochondrial fumarase from human liver JOURNAL Biosci. Rep. 6 (10), 921-929 (1986) MEDLINE 87157989 FEATURES Location/Qualifiers source 1..1447 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q42.1" mRNA <1..1447 /gene="FH" /note="fumarase mRNA; G00-119-133" gene 1..1447 /gene="FH" CDS 1..1407 /gene="FH" /note="fumarase precursor (EC 4.2.1.2)" /codon_start=1 /db_xref="GDB:G00-119-133" /db_xref="PID:g182794" /translation="MASQNSFRIEYDTFGELKVPNDKYYGAQTVRSTMNFKIGGVTER MPTPVIKAFGILKRAAAEVNQDYGLDPKIANAIMKAADEVAEGKLNDHFPLVVWQTGS GTQTNMNVNEVISNRAIEMLGGELGSKIPVHPNDHVNKSQSSNDTFPTAMHIAAAIEV HEVLLPGLQKLHDALDAKSKEFAQIIKIGRTHTQDAVPLTLGQEFSGYVQQVKYAMTR IKAAMPRIYELARGGTAVGTGLNTRIGFAEKLAAKVAALTGLPFVTAPNKFEALAAHD ALVELSGAMNTSSCSLMKIANDIRFLGSGPRSGLGELILPENEPGSSIMPGKVNPTQC EAAMTMVAAQVMGNHVAVTVGGSNGHFELNVFKPMMIKNVLHSARLLGDASVSFTENC VVGIQANTERINKLMNESLMLVTALNPHIGYDKAAKIAKTAHKNGSTLKETAIELGYL TAEQFDEWVKPKDMLGPK" mat_peptide 4..1404 /gene="FH" /note="fumarase; G00-119-133" BASE COUNT 456 a 274 c 344 g 373 t ORIGIN 871 bp upstream of PstI site. 1 atggcaagcc aaaattcctt ccggatagaa tatgatacct ttggtgaact aaaggtgcca 61 aatgataagt attatggcgc ccagaccgtg agatctacga tgaactttaa gattggaggt 121 gtgacagaac ggatgccaac cccagttatt aaagcttttg gcatcttgaa gcgagcggcc 181 gctgaagtaa accaggatta tggtcttgat ccaaagattg ctaatgcaat aatgaaggca 241 gcagatgagg tagctgaagg taaattaaat gatcattttc ctctcgtggt atggcagact 301 ggatcaggaa ctcagacaaa tatgaatgta aatgaagtca ttagcaatag agcaattgaa 361 atgttaggag gtgaacttgg cagcaagata cctgtgcatc ccaacgatca tgttaataaa 421 agccagagct caaatgatac ttttcccaca gcaatgcaca ttgctgctgc aatagaagtt 481 catgaagtac tgttaccagg actacagaag ttacatgatg ctcttgatgc aaaatccaaa 541 gagtttgcac agatcatcaa gattggacgt actcatactc aggatgctgt tccacttact 601 cttgggcagg aatttagtgg ttatgttcaa caagtaaaat atgcaatgac aagaataaaa 661 gctgccatgc caagaatcta tgagctcgca cgtggaggca ctgctgttgg tacaggttta 721 aatactagaa ttggctttgc agaaaagctt gctgcaaaag tggctgcact tacaggcttg 781 ccttttgtca ctgctccgaa taaatttgaa gctctggctg ctcatgacgc tctggttgag 841 ctcagtggag ccatgaatac tagctcctgc agtctgatga agatagcaaa tgatattcga 901 tttttgggtt ctggtcctcg gtcaggtctg ggagaattga tcttgcctga aaatgaacca 961 ggaagcagta tcatgccagg caaggtgaac cctactcagt gtgaagctgc aatgaccatg 1021 gttgcagccc aagtcatggg gaaccatgtt gctgtcactg tcggaggcag caatggacat 1081 tttgagttga atgttttcaa gccaatgatg attaaaaatg tgttacactc agccaggctg 1141 ctgggggatg cttcagtttc ctttacagaa aactgcgtgg tgggaatcca ggccaataca 1201 gaaaggatca acaagctgat gaatgagtct ctaatgttgg tgacagctct caatcctcat 1261 atagggtatg acaaggcagc aaagattgct aagacagcac acaaaaatgg atcaacctta 1321 aaggaaactg ctatcgaact tggctatctc acagcagagc agtttgacga atgggtaaaa 1381 cctaaggaca tgctgggtcc aaagtgattt acaaaattta taatgaaaat aaacatgtat 1441 aaaattt // LOCUS HUMFXI 2087 bp mRNA PRI 08-NOV-1994 DEFINITION Human factor XI (blood coagulation factor) mRNA, complete cds. ACCESSION M13142 NID g182832 KEYWORDS blood coagulation factor; factor XI; serine protease; serum glycoprotein. SOURCE Human liver, cDNA to mRNA, clone lambda-HXI-12. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2087) AUTHORS Fujikawa,K., Chung,D.W., Hendrickson,L.E. and Davie,E.W. TITLE Amino acid sequence of human factor XI, a blood coagulation factor with four tandem repeats that are highly homologous with plasma prekallikrein JOURNAL Biochemistry 25 (9), 2417-2424 (1986) MEDLINE 86243360 COMMENT During activation of factor XI an internal peptide bond is cleaved in each of the two chains, resulting in factor XI-a, a serine protease composed of two heavy and two light chains held together by disulfide bonds. FEATURES Location/Qualifiers source 1..2087 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q35" mRNA <1..2087 /note="FXI mRNA" sig_peptide 44..97 /gene="F11" /note="factor XI signal peptide" gene 44..1921 /gene="F11" CDS 44..1921 /gene="F11" /note="preprofactor XI" /codon_start=1 /db_xref="GDB:G00-119-891" /db_xref="PID:g182833" /translation="MIFLYQVVHFILFTSVSGECVTQLLKDTCFEGGDITTVFTPSAK YCQVVCTYHPRCLLFTFTAESPSEDPTRWFTCVLKDSVTETLPRVNRTAAISGYSFKQ CSHQISACNKDIYVDLDMKGINYNSSVAKSAQECQERCTDDVHCHFFTYATRQFPSLE HRNICLLKHTQTGTPTRITKLDKVVSGFSLKSCALSNLACIRDIFPNTVFADSNIDSV MAPDAFVCGRICTHHPGCLFFTFFSQEWPKESQRNLCLLKTSESGLPSTRIKKSKALS GFSLQSCRHSIPVFCHSSFYHDTDFLGEELDIVAAKSHEACQKLCTNAVRCQFFTYTP AQASCNEGKGKCYLKLSSNGSPTKILHGRGGISGYTLRLCKMDNECTTKIKPRIVGGT ASVRGEWPWQVTLHTTSPTQRHLCGGSIIGNQWILTAAHCFYGVESPKILRVYSGILN QSEIKEDTSFFGVQEIIIHDQYKMAESGYDIALLKLETTVNYTDSQRPICLPSKGDRN VIYTDCWVTGWGYRKLRDKIQNTLQKAKIPLVTNEECQKRYRGHKITHKMICAGYREG GKDACKGDSGGPLSCKHNEVWHLVGITSWGEGCAQRERPGVYTNVVEYVDWILEKTQA V" mat_peptide 98..1204 /gene="F11" /note="factor XI heavy chain" mat_peptide 1205..1918 /gene="F11" /note="factor XI light chain" BASE COUNT 605 a 467 c 494 g 521 t ORIGIN 247 bp upstream of BamHI site. 1 ttattaagaa ttgcagcaag taagccaaca aggtcttttc aggatgattt tcttatatca 61 agtggtacat ttcattttat ttacttcagt ttctggtgaa tgtgtgactc agttgttgaa 121 ggacacctgc tttgaaggag gggacattac tacggtcttc acaccaagcg ccaagtactg 181 ccaggtagtc tgcacttacc acccaagatg tttactcttc actttcacgg cggaatcacc 241 atctgaggat cccacccgat ggtttacttg tgtcctgaaa gacagtgtta cagaaacact 301 gccaagagtg aataggacag cagcgatttc tgggtattct ttcaagcaat gctcacacca 361 aataagcgct tgcaacaaag acatttatgt ggacctagac atgaagggca taaactataa 421 cagctcagtt gccaagagtg ctcaagaatg ccaagaaaga tgcacggatg acgtccactg 481 ccactttttc acgtacgcca caaggcagtt tcccagcctg gagcatcgta acatttgtct 541 actgaagcac acccaaacag ggacaccaac cagaataacg aagctcgata aagtggtgtc 601 tggattttca ctgaaatcct gtgcactttc taatctggct tgtattaggg acattttccc 661 taatacggtg tttgcagaca gcaacatcga cagtgtcatg gctcccgatg cttttgtctg 721 tggccgaatc tgcactcatc atcccggttg cttgtttttt accttctttt cccaggaatg 781 gcccaaagaa tctcaaagaa atctttgtct ccttaaaaca tctgagagtg gattgcccag 841 tacacgcatt aaaaagagca aagctctttc tggtttcagt ctacaaagct gcaggcacag 901 catcccagtg ttctgccatt cttcatttta ccatgacact gatttcttgg gagaagaact 961 ggatattgtt gctgcaaaaa gtcacgaggc ctgccagaaa ctgtgcacca atgccgtccg 1021 ctgccagttt tttacctata ccccagccca agcatcctgc aacgaaggga agggcaagtg 1081 ttacttaaag ctttcttcaa acggatctcc aactaaaata cttcacggga gaggaggcat 1141 ctctggatac acattaaggt tgtgtaaaat ggataatgag tgtaccacca aaatcaagcc 1201 caggatcgtt ggaggaactg cgtctgttcg tggtgagtgg ccgtggcagg tgaccctgca 1261 cacaacctca cccactcaga gacacctgtg tggaggctcc atcattggaa accagtggat 1321 attaacagcc gctcactgtt tctatggggt agagtcacct aagattttgc gtgtctacag 1381 tggcatttta aatcaatctg aaataaaaga ggacacatct ttctttgggg ttcaagaaat 1441 aataatccat gatcagtata aaatggcaga aagcgggtat gatattgcct tgttgaaact 1501 ggaaaccaca gtgaattaca cagattctca acgacccata tgcctgcctt ccaaaggaga 1561 tagaaatgta atatacactg attgctgggt gactggatgg gggtacagaa aactaagaga 1621 caaaatacaa aatactctcc agaaagccaa gataccctta gtgaccaacg aagagtgcca 1681 gaagagatac agaggacata aaataaccca taagatgatc tgtgccggct acagggaagg 1741 agggaaggac gcttgcaagg gagattcggg aggccctctg tcctgcaaac acaatgaggt 1801 ctggcatctg gtaggcatca cgagctgggg cgaaggctgt gctcaaaggg agcggccagg 1861 tgtttacacc aacgtggtcg agtacgtgga ctggattctg gagaaaactc aagcagtgtg 1921 aatgggttcc caggggccat tggagtccct gaaggaccca ggatttgctg ggagagggtg 1981 ttgagttcac tgtgccagca tgcttcctcc acagtaacac gctgaagggg cttggtgttt 2041 gtaagaaaat gctagaagaa aacaaactgt cacaagttgt tatgtcc // LOCUS HUMFXIIIA 3816 bp mRNA PRI 08-NOV-1994 DEFINITION Human placental FXIIIa mRNA, complete cds. ACCESSION M14354 NID g182834 KEYWORDS factor XIIIa; fibrinoligase. SOURCE Human placenta, cDNA to mRNA, clones lambda-gt10-[11,12,20]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3816) AUTHORS Grundmann,U., Amann,E., Zettlmeissl,G. and Kupper,H.A. TITLE Characterization of cDNA coding for human factor XIIIa JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (21), 8024-8028 (1986) MEDLINE 87041394 COMMENT Computer-readable sequence of [1] kindly provided by U.Grundmann (27-JAN-1987). FEATURES Location/Qualifiers source 1..3816 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6p25-p24" mRNA <1..3816 /note="FXIIIa mRNA" gene 85..2283 /gene="F13A1" CDS 85..2283 /gene="F13A1" /note="clotting factor XIIIa precursor (EC 2.3.2.13)" /codon_start=1 /db_xref="GDB:G00-120-614" /db_xref="PID:g182835" /translation="MSETSRTAFGGRRAVPPNNSNAAEDDLPTVELQGVVPRGVNLQE FLNVTSVHLFKERWDTNKVDHHTDKYENNKLIVRRGQSFYVQIDLSRPYDPRRDLFRV EYVIGRYPQENKGTYIPVPIVSELQSGKWGAKIVMREDRSVRLSIQSSPKCIVGKFRM YVAVWTPYGVLRTSRNPETDTYILFNPWCEDDAVYLDNEKEREEYVLNDIGVIFYGEV NDIKTRSWSYGQFEDGILDTCLYVMDRAQMDLSGRGNPIKVSRVGSAMVNAKDDEGVL VGSWDNIYAYGVPPSAWTGSVDILLEYRSSENPVRYGQCWVFAGVFNTFLRCLGIPAR IVTNYFSAHDNDANLQMDIFLEEDGNVNSKLTKDSVWNYHCWNEAWMTRPDLPVGFGG WQAVDSTPQENSDGMYRCGPASVQAIKHGHVCFQFDAPFVFAEVNSDLIYITAKKDGT HVVENVDATHIGKLIVTKQIGGDGMMDITDTYKFQEGQEEERLALETALMYGAKKPLN TEGVMKSRSNVDMDFEVENAVLGKDFKLSITFRNNSHNRYTITAYLSANITFYTGVPK AEFKKETFDVTLEPLSFKKEAVLIQAGEYMGQLLEQASLHFFVTARINETRDVLAKQK STVLTIPEIIIKVRGTQVVGSDMTVTVQFTNPLKETLRNVWVHLDGPGVTRPMKKMFR EIRPNSTVQWEEVCRPWVSGHRKLIASMSSDSLRHVYGELDVQIQRRPSM" mat_peptide 88..2280 /gene="F13A1" /note="clotting factor XIIIa" BASE COUNT 1086 a 866 c 897 g 967 t ORIGIN 196 bp upstream of SmaI site; chromosome 6p24-p21.3. 1 gaggaagtcc ccgaggcgca cagagcaagc ccacgcgagg gcacctctgg aggggagcgc 61 ctgcaggacc ttgtaaagtc aaaaatgtca gaaacttcca ggaccgcctt tggaggcaga 121 agagcagttc cacccaataa ctctaatgca gcggaagatg acctgcccac agtggagctt 181 cagggcgtgg tgccccgggg cgtcaacctg caagagtttc ttaatgtcac gagcgttcac 241 ctgttcaagg agagatggga cactaacaag gtggaccacc acactgacaa gtatgaaaac 301 aacaagctga ttgtccgcag agggcagtct ttctatgtgc agattgacct cagtcgtcca 361 tatgacccca gaagggatct cttcagggtg gaatacgtca ttggtcgcta cccacaggag 421 aacaagggaa cctacatccc agtgcctata gtctcagagt tacaaagtgg aaagtggggg 481 gccaagattg tcatgagaga ggacaggtct gtgcggctgt ccatccagtc ttcccccaaa 541 tgtattgtgg ggaaattccg catgtatgtt gctgtctgga ctccctatgg cgtacttcga 601 accagtcgaa acccagaaac agacacgtac attctcttca atccttggtg tgaagatgat 661 gctgtgtatc tggacaatga gaaagaaaga gaagagtatg tcctgaatga catcggggta 721 attttttatg gagaggtcaa tgacatcaag accagaagct ggagctatgg tcagtttgaa 781 gatggcatcc tggacacttg cctgtatgtg atggacagag cacaaatgga cctctctgga 841 agagggaatc ccatcaaagt cagccgtgtg gggtctgcaa tggtgaatgc caaagatgac 901 gaaggtgtcc tcgttggatc ctgggacaat atctatgcct atggcgtccc cccatcggcc 961 tggactggaa gcgttgacat tctattggaa taccggagct ctgagaatcc agtccggtat 1021 ggccaatgct gggtttttgc tggtgtcttt aacacatttt tacgatgcct tggaatacca 1081 gcaagaattg ttaccaatta tttctctgcc catgataatg atgccaattt gcaaatggac 1141 atcttcctgg aagaagatgg gaacgtgaat tccaaactca ccaaggattc agtgtggaac 1201 taccactgct ggaatgaagc atggatgaca aggcctgacc ttcctgttgg atttggaggc 1261 tggcaagctg tggacagcac cccccaggaa aatagcgatg gcatgtatcg gtgtggcccc 1321 gcctcggttc aagccatcaa gcacggccat gtctgcttcc aatttgatgc accttttgtt 1381 tttgcagagg tcaacagcga cctcatttac attacagcta agaaagatgg cactcatgtg 1441 gtggaaaatg tggatgccac ccacattggg aaattaattg tgaccaaaca aattggagga 1501 gatggcatga tggatattac tgatacttac aaattccaag aaggtcaaga agaagagaga 1561 ttggccctag aaactgccct gatgtacgga gctaaaaagc ccctcaacac agaaggtgtc 1621 atgaaatcaa ggtccaacgt tgacatggac tttgaagtgg aaaatgctgt gctgggaaaa 1681 gacttcaagc tctccatcac cttccggaac aacagccaca accgttacac catcacagct 1741 tatctctcag ccaacatcac cttctacacc ggggtcccga aggcagagtt caagaaggag 1801 acgttcgacg tgacgctgga gcccttgtcc ttcaagaaag aggcggtgct gatccaagcc 1861 ggcgagtaca tgggtcagct gctggaacaa gcgtccctgc acttctttgt cacagctcgc 1921 atcaatgaga ccagggatgt tctggccaag caaaagtcca ccgtgctaac catccctgag 1981 atcatcatca aggtccgtgg cactcaggta gttggttctg acatgactgt gacagttcag 2041 tttaccaatc ctttaaaaga aaccctgcga aatgtctggg tacacctgga tggtcctgga 2101 gtaacaagac caatgaagaa gatgttccgt gaaatccggc ccaactccac cgtgcagtgg 2161 gaagaagtgt gccggccctg ggtctctggg catcggaagc tgatagccag catgagcagt 2221 gactccctga gacatgtgta tggcgagctg gacgtgcaga ttcaaagacg accttccatg 2281 tgaatgcaca ggaagctgag atgaaccctg gcatttggcc tcttgtagtc ttggctaagg 2341 aaattctaac gcaaaaatag ctcttgcttt gacttaggtg tgaagaccca gacaggactg 2401 cagagggccc cagagtggag atcccacata tttcaaaaac atacttttcc aaacccaggc 2461 tattcggcaa ggaagttagt ttttaatctc tccaccttcc aaagagtgct aagcattagc 2521 tttaattaag ctctcatagc tcataagagt aacagtcatc atttatcatc acaaatggct 2581 acatctccaa atatcagtgg gctctcttac cagggagatt tgctcaatac ctggcctcat 2641 ttaaaacaag acttcagatt ccccactcag ccttttggga ataatagcac atgatttggg 2701 ctctagaatt ccagtcccct ttctcggggt caggttctac cctccatgtg agaatatttt 2761 tcccaggact agagcacaac ataattttta tttttggcaa agccagaaaa agatctttca 2821 ttttgcacct gcagccaagc aaatgcctgc caaattttag atttaccttg ttagaagagg 2881 tggccccata ttaacaaatt gcatttgtgg gaaacttaac cacctacaag gagataagaa 2941 agcaggtgca acactcaagt ctattgaata atgtagtttt gtgatgcatt ttatagaatg 3001 tgtcacactg tggcctgatc agcaggagcc aatatccctt actttaaccc tttctgggat 3061 gcaatactag gaagtaaagt gaagaattta tctctttagt tagtgattat atttcaccca 3121 tctctcagga atcatctcct ttgcagaatg atgcaggttc aggtcccctt tcagagatat 3181 aataagccca acaagttgaa gaagctggcg gatctagtga ccagatatat agaaggactg 3241 cagccactga ttctctcttg tccttcacat caccattttg agacctcagc ttggcactca 3301 ggtgctgaag ggtaatatgg actcagcctt gcaaatagcc agtgctagtt ctgacccaac 3361 cacagaggat gctgacatca tttgtattat gttccaaggc tactacagag aaggctgcct 3421 gctatgtatt tgcaaggctg atttatggtc agaatttccc tctgatatgt ctagggtgtg 3481 atttaggtca gtagactgtg attcttagca aaaaatgaac agtgataagt atactggggg 3541 caaaatcaga atggaatgct ctggtctata taaccacatt tctgagcctt tgagactgtt 3601 cctgagcctt cagcactaac ctatgagggt gagctggtcc cctctatata tacatcatac 3661 ttaactttac taagtaatct cacagcattt gccaagtctc ccaatatcca attttaaaat 3721 gaaatgcatt ttgctagaca gttaaactgg cttaacttag tatattatta ttaattacaa 3781 tgtaatagaa gcttaaaata aagttaaact gattat // LOCUS HUMG 1376 bp mRNA PRI 03-JAN-1994 DEFINITION Human prostaglandin receptor ep1 subtype mRNA, complete cds. ACCESSION L22647 NID g410208 KEYWORDS prostaglandin receptor ep1 subtype. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1376) AUTHORS Funk,C.D., Furci,L., FitzGerald,G.A., Grygorczyk,R., Rochette,C., Bayne,M.A., Abramovitz,M., Adam,M. and Metters,K.M. TITLE Cloning and expression of a cDNA for the human prostaglandin E receptor EP1 subtype JOURNAL J. Biol. Chem. 268 (35), 26767-26772 (1993) MEDLINE 94075377 REFERENCE 2 (bases 1 to 1376) AUTHORS Funk,C.D. TITLE Direct Submission JOURNAL Submitted (01-NOV-1993) Colin D. Funk, Department of Pharmacology, Vanderbilt University, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..1376 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEL" /cell_type="erythroleukemia cell" /tissue_lib="lambda gt11 of G.J. Roth" 5'UTR 1..74 CDS 75..1283 /codon_start=1 /product="prostaglandin receptor ep1 subtype" /db_xref="PID:g410209" /translation="MSPCGPLNLSLAGEATTCAAPWVPNTSAVPPSGASPALPIFSMT LGAVSNLLALALLAQAAGRLRRRRSATTFLLFVASLLATDLAGHVIPGALVLRLYTAG RAPAGGACHFLGGCMVFFGLCPLLLGCGMAVERCVGVTRPLLHAARVSVARARLALAA VAAVALAVALLPLARVGRYELQYPGTWCFIGLGPPGGWRQALLAGLFASLGLVALLAA LVCNTLSGLALHRARWRRRSRRPPPASGPDSRRRWGAHGPRSASASSASSIASASTFF GGSRSSGSARRARAHDVEMVGQLVGIMVVSCICWSPMLVLVALAVGGWSSTSLQRPLF LAVRLASWNQILDPWVYILLRQAVLRQLLRLLPPRAGAKGGPAGLGLTPSAWEASSLR SSRHSGLSHF" 3'UTR 1284..1376 polyA_signal 1358..1363 BASE COUNT 138 a 526 c 484 g 228 t ORIGIN 1 gggggcggca gggctgagcg gccggtgatg gggaccccac atcccaggca gtgccggcac 61 ccctggcgcc tgacatgagc ccttgcgggc ccctcaacct gagcctggcg ggcgaggcga 121 ccacatgcgc ggcgccctgg gtccccaaca cgtcggccgt gccgccgtcg ggcgcttcgc 181 ccgcgctgcc catcttctcc atgacgctgg gcgccgtgtc caacctgctg gcgctggcgc 241 tgctggcgca ggccgcgggc cgcctgcgac gccgccgctc ggccaccacc ttcctgctgt 301 tcgtggccag cctgctggcc accgacctgg cgggccacgt gatcccgggc gcgctggtgc 361 tgcgtctgta cactgcgggg cgcgctccgg ccggcggggc ctgccacttc ctgggcggct 421 gcatggtctt cttcggcctg tgcccgctgc tgctgggctg tggcatggcc gtggagcgct 481 gcgtgggcgt cacgcggccg ctgctccacg ccgcgcgggt ctcggtcgcc cgcgcgcgcc 541 tggcgctggc cgcggtggcc gcggtggcct tggccgtggc gctgctgccg ctggcgcgcg 601 tgggccgcta tgagctgcag tacccgggca cgtggtgctt catcggcctg ggtcccccgg 661 gcggctggcg ccaggcactg cttgctggcc tcttcgccag cctcggcctg gtcgcgctcc 721 tcgccgcgct ggtgtgcaac acgctcagcg gcctggccct gcatcgcgcc cgctggcgac 781 gccgctcccg acggcctccc ccggcctcag gccccgacag ccggcgtcgc tggggggcgc 841 acggaccccg ctcggcctcc gcctcgtccg cctcgtccat cgcttcggcc tccaccttct 901 ttggcggctc tcggagcagc ggctcggcac gcagagctcg cgcccacgac gtggagatgg 961 tgggccagct tgtcggtatc atggtggtgt cgtgcatctg ctggagccca atgctggtgt 1021 tggtggcgct ggccgtcggc ggctggagct ctacctccct gcagcggcca ctgttcctgg 1081 ccgtgcgcct tgcctcctgg aaccagatcc tggacccttg ggtgtacatc ctactgcgcc 1141 aggccgtgct gcgccaactg cttcgcctct tgcccccgag ggccggagcc aagggcggcc 1201 ccgcggggct gggcctaaca ccgagcgcct gggaggccag ctcgctgcgc agctcccggc 1261 acagcggcct cagccacttc taagcacaac cagaggccca acgactaagc cagcccaccc 1321 tgggctgggc ccaggtgcgc ggcgcagagc ctttgggaat aaaaagccat tctgcg // LOCUS HUMG0S2PE 4466 bp DNA PRI 05-JAN-1995 DEFINITION Human GOS2 gene, 5' flank and cds. ACCESSION M72885 NID g182852 KEYWORDS . SOURCE Homo sapiens blood DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4466) AUTHORS Russell,L. and Forsdyke,D.R. TITLE A human putative lymphocyte G0/G1 switch gene containing a CpG-rich island encodes a small basic protein with the potential to be phosphorylated JOURNAL DNA Cell Biol. 10 (8), 581-591 (1991) MEDLINE 92029620 FEATURES Location/Qualifiers source 1..4466 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_type="blood" enhancer 150..156 /note="c-mos enhancer homology; putative" misc_feature 270..286 /note="homeobox homology; putative" misc_feature 292..320 /note="dyad-symmetry element" protein_bind 375..384 /note="AP2 site homology; putative" /bound_moiety="AP2" protein_bind 405..414 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" misc_feature 499..507 /note="T cell element homology; putative" repeat_region 602..666 /note="TCAGTTT-containing repeats" enhancer 733..742 /note="c-mos enhancer homology; putative" protein_bind 1049..1059 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" protein_bind 1208..1216 /note="AP3 site homology; putative" /bound_moiety="AP3" protein_bind 1291..1299 /note="AP1 site homology; putative" /bound_moiety="AP1" misc_feature 1607..1627 /note="region of dyad symmetry" protein_bind 1732..1740 /note="AP1 site homology; putative" /bound_moiety="AP1" misc_binding 1810..1818 /bound_moiety="c_myc" protein_bind 1810..1818 /note="AP1 site homology; putative" /bound_moiety="AP1" protein_bind 1828..1841 /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" misc_feature 1829..1858 /note="GC-rich mini-island" repeat_region 1961..2002 /note="CCAAT-containing repeat element" CAAT_signal 1963..1974 repeat_region 2003..2044 /note="CCAAT-containing repeat element" CAAT_signal 2005..2016 misc_feature 2248..2257 /note="TGF-beta consensus; putative" enhancer 2390..2397 /note="Adenovirus E1A enhancer homology; putative" repeat_region 2475..2544 /note="CT/CA repeat element" enhancer 2674..2680 /note="Adenovirus E4F1 enhancer homology; putative" misc_feature 2796..2805 /note="T cell element homology; putative" misc_feature 2863..3965 /note="CpG rich island" misc_feature 2895..2905 /note="CpG rc-fos dyad symmetry SRE arm" misc_feature 3076..3085 /note="T cell element homology; putative" protein_bind 3113..3122 /note="AP2 site homology; putative" /bound_moiety="AP2" misc_feature 3149..3168 /note="dyad symmetry element" TATA_signal 3190..3194 mRNA join(3228..3355,3459..4149) /gene="G0S2" gene join(3228..3355,3459..4149) /gene="G0S2" exon 3228..3355 /gene="G0S2" /number=1 intron 3356..3458 /number=1 misc_feature 3361..3368 /note="thyroid hormone response element" exon 3459..4149 /gene="G0S2" /number=1 protein_bind 3462..3472 /gene="G0S2" /note="GC box homology (Sp1); putative" /bound_moiety="Sp1" CDS 3491..3802 /gene="G0S2" /note="ORF 103 amino acids" /codon_start=1 /db_xref="PID:g182853" /translation="METVQELIPLAKEMMAQKRKGKMVKLYVLGSVLALFGVVLGLME TVCSPFTAARRLRDQEAAVAELQAALERQALQKQALQEKGKQQDTVLGGRALSNRQHA S" protein_bind 3513..3522 /gene="G0S2" /note="AP2 site homology; putative" /bound_moiety="AP2" protein_bind 3674..3683 /gene="G0S2" /note="AP2 site homology; putative" /bound_moiety="AP2" repeat_region 4129..4147 /note="ATTA repeats" polyA_signal 4173..4178 misc_feature 4277..4283 /note="CK-2 cytokine motif homology; putative" BASE COUNT 1174 a 1200 c 980 g 1112 t ORIGIN 1 tctagatctc tagtctataa gaccagagga gacagtggct acacatataa atttcagtgt 61 cttccactga tatccgagtg ataagcaact tcctttctaa aattatgaaa gtaactaagg 121 gtctaaaaaa aatttctagt gcggtagtct taaaactaca aatagttttg tcatatctcc 181 tatgatgact ccctcccttc agctgcctgg agcccagggg tgggcagtga ctttgtgtag 241 gcagcaagca gccggtaaca aaataataat tattattgtt attattatat tataataatt 301 gtaacaataa caattattat tgaagctcat ttacaactaa ccatccaaaa gacctctttc 361 ccctgtgtct tcaatcccca aggcagaggg gtagggacag ttctatccct cctctgcact 421 taacccttga aacacatgca cccctttgtg actttaccct ctgcagatgg ctctgaatgt 481 cttaatgtct gagagaaagg gatttagaaa gcaaaatata aaaattttaa actagtcctt 541 cctaccttcc tagaagtggc aagagttaaa tgttgagata gactcaaggg taggatgact 601 atttcagttt gcctggaagt gtttcagttt gcctggaagt atctcagttt tagtgctcag 661 ttttagcagg agtcctgcat ttcagagaac ccctcaaccc tgagcaaaat aagatggttg 721 gtcaccctat ggcttggttt gaccatcatc gctataacct gtatcttacc tgcatgcatg 781 acacaacaca gtgctttact tccctaaaaa tgacatcagc cccactaaaa tatatgttta 841 gtttccaagc cctacctagt caccttctac cctaggagcc aggttctctt ttcccaccca 901 gacagaggag ctgcactcag aaattcctag acatgagtta acactggatt ccttagcctt 961 ctactcccat catctcctgc tcagccccag ctaccaccta aactaggaag atcaagtcta 1021 ccagtgacct ccgtccatgg cacttgctgc ccctcctctg tccagctctt accaatatag 1081 ctgctggaac ctggaggtca aagtcaaatt atcaaaaaaa ggaactgagc tggtgatgtg 1141 cactaacaca gcaaatcaca ggaaagggga acccaggtaa attacagcct tctgacctag 1201 gaagacgtgt ggtttgcgtc tctgagttac agaaacacag gaaatgctta ctggaccagt 1261 caatttcaga attttgggtc ccaagctagg ctgactcacc ttcagaatgg aaaccacgtg 1321 acagccctta tatcagggca cacatcacat gctcttccag aagtcaatgg gtttggaacc 1381 ctcacagata ttgggaaagc tcactaatca tttctgccag ttatcagagg ttgctctgaa 1441 actataagga agattcaaag aaaatgccaa gactgatatt aaacttggca ggaacccttg 1501 ttacagaatt ttcctgcctg acaaggttaa aagaacaata agcaggaaac acagtcctcc 1561 aggaataatc aattctattt ggcccctggt caccttcact cagactaaat tctaaaacat 1621 agaatttcaa ataagctatt tagataacct tgaccattct ccacacacaa gcccttgcct 1681 gaactattaa tagtcaaggc aaagggtagt tgttattgct gcctttttaa actgaatcat 1741 ctgagaaatt gcttcagacc cccaaagaaa gattactgtt aacaattcaa aaactaaaat 1801 atttgatccc tgagacagcc ttttcccccg acccgccctt cagggctcag tccgaccgac 1861 tgcaaaggct gttgcaagat tgcatcactg acctttgcaa ttttctggcc agtttgattc 1921 cccttctttc ccctgccccc tccctttctc tgcttaaagg cctttggcca atttgcctct 1981 ccttttcccc aagtttgcta accctttggc caatttgcct ctctttttcc ccaagtttgc 2041 taactctagc atatccataa ccaaagccaa actagaacgc tccctcagcc ccaggtgatt 2101 acagctaacc ctggtcaaaa tcaatcctac atcttcacac gtccaagagt actcacactc 2161 tggattctta cctaagctgt ctactacacg cccttctgcc cacaaactgc ttcaaagctg 2221 aagttgagct ggagcagtga agttgtaccc ccaaacccag gagggtggca gagaattatt 2281 gaggagagca tgaaatactt ccattctaaa atggcaagat gaacttctac caacagcccc 2341 ttccatactt gaccccctac ccccaagctc ccaactccac ttctcaagtg gaagtgagaa 2401 caatttgaat ttgaaggctc ttccctgata cacggaagta cataaggaaa cagctcgcag 2461 gcaaagagac taatctctct ctctctctct ctctctctct ctctctctct ctcacacaca 2521 cacacacaca cacacacaca cacacctctg tgcataaaag caccatcaat gaatagtttt 2581 ctatcaactg actctagtta tacatgcatg tacctctaaa taaaaccaac caggcaggaa 2641 agaaacaata ttagcacata ttgctttatc caagcgtaac ctgttctgtc ctgttaccca 2701 gatccttccc ccttgccttc tcctctctga tccattgcca cacacgtggg aaggtgacaa 2761 cccttccgaa taaaaatgaa agctttcttc tttagatgga acccccaaat tccctcatta 2821 tttataatgt caggctgtcc tggacaaggg aagctgtgca cccgctgaca ccagtaagaa 2881 ggttgccgcc atgtcagaga tgtccgcgga cacctccctg ggctccgggt cctcccctgc 2941 gctcgcctgg agtgggacct tcgcgtgcac actggccttc ccacgcgccc cgctgcgatg 3001 gcacccgcgc cgggccccct agctcacaca gtcggagcgt gctcagcgcg tggccacctc 3061 ttgccaggtc ccagccgggt tccaccccct ccttttcccc tcctcttctt cctccccctc 3121 cgagttcccc tggctctgac cgcgctggcc tgggcccgag agcccaggag gcgtgtctca 3181 gagaaaagat ataagcggcc cccggacgct aaagcggtgc cagcggcgga gtctccaact 3241 gggagagctg cagctgccga gaggaggaga acgctgaggt cggtcggacc aacggacgcg 3301 ctgaccgctg ccaactgcag ctcgcgctgc ctcctgctcg cgccgtgcca ctaaggtagt 3361 ccgcctttct atgagccctc cccaagatta gctgggtgcg gggtggtggg agccgttctt 3421 tggtggctga agcccctctc ctgctgctcc tcctgcaggt cattcccgcc tccgagagcc 3481 cagagccgag atggaaacgg tccaggagct gatccccctg gccaaggaga tgatggccca 3541 gaagcgcaag gggaagatgg tgaagctgta cgtgctgggc agcgtgctgg ccctcttcgg 3601 cgtggtgctc ggcctgatgg agactgtgtg cagccccttc acggccgcca gacgtctgcg 3661 ggaccaggag gcagccgtgg cggagctgca ggccgccctg gagcgacagg ctctccagaa 3721 gcaagccctg caggagaaag gcaagcagca ggacacggtc ctcggcggcc gggccctgtc 3781 caaccggcag cacgcctcct aggaactgtg ggagaccagc ggagtgggag ggagacgcag 3841 tagacagaga cagaccgaga gaggaaggga gagacagagg gggcgcgcgc acaggagcct 3901 gactccgctg ggagagtgca ggagcacgtg ctgtttttta tttggactta acttcagaga 3961 aaccgctgac atctagaact gacctaccac aagcagccac caaaggagtt tgggattgag 4021 ttttgctgct gtgcagcact gcattgtcat gacatttcca acactgtgtg aattatctaa 4081 atgcgtctac cattttgcac tagggaggaa ggataaatgc tttttatgtt attattatta 4141 attattacaa tgaccaccat tttgcatttt gaaataaaaa actttttata ccatatctca 4201 tgtaattcct gagaggtgtg gtgtcctggg gtgggaagca gggagggtga gcaggtgggc 4261 gatggtgatg ggttcttacc tgagcactgc agagggagca gcttcctgag ggtcagacac 4321 ttgcttcaca cctaggaact gtgtaataag ttactacatg catataagtc tgttgaggac 4381 ttgtttttcc ttcttgttag gggtgggaag agagaaaatt ttataacttc cgtgagattt 4441 agcattttaa catcaaaagg tagatc // LOCUS HUMG0S3R 3775 bp mRNA PRI 09-MAY-1997 DEFINITION Human G0S3 mRNA, complete cds. ACCESSION L49169 NID g1082037 KEYWORDS GOS3 gene; oncogene; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3775) AUTHORS Siderovski,D.P., Blum,S., Forsdyke,R.E. and Forsdyke,D.R. TITLE A set of human putative lymphocyte G0/G1 switch genes includes genes homologous to rodent cytokine and zinc finger protein-encoding genes JOURNAL DNA Cell Biol. 9 (8), 579-587 (1990) MEDLINE 91103878 REFERENCE 2 (bases 1 to 3775) AUTHORS Heximer,S.P., Cristillo,A.D., Russell,L. and Forsdyke,D.R. TITLE Sequence analysis and expression in cultured lymphocytes of the human FOSB gene (G0S3) JOURNAL DNA Cell Biol. 15 (12), 1025-1038 (1996) MEDLINE 97138090 REFERENCE 3 (bases 1 to 3775) AUTHORS Forsdyke,D.R. TITLE Direct Submission JOURNAL Submitted (05-MAY-1997) Biochemistry, Queen's University, Kingston, Ontario K7L3N6, Canada COMMENT G0S3 (putative G0/G1 switch regulatory gene 3), was the third cDNA clone picked from a cDNA library prepared from concanavalin-A and cycloheximide stimulated human blood mononuclear cells The corresponding mRNA increases rapidly in response to either concanavalin-A or cycloheximide. Restriction maps of genomic DNA give different Southern blot profiles for FOSB (G0S3) and FOS (G0S7) (Siderovski et al., 1990). The mRNA start site was determined experimentally by primer extension. There is an apparent TATA box in the genomic sequence, which begins 29 nt upstream from the mRNA start site. The murine homolog has a degenerate TATA box in this position (CATA). The murine and human genomic sequences show high homology in this region, suggesting that the CATA box in mice is functional. FEATURES Location/Qualifiers source 1..3775 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphoid" /dev_stage="adult" /tissue_type="blood" mRNA 1..3775 /gene="G0S3" /note="a partial cDNA clone extended from base 3775 to terminate in intron 3. 1st and 2nd exon encoded sequences were determined from reverse transcriptase PCR products, up to nt 84. Nc 1-83 are from gene." /evidence=experimental 5'UTR 1..593 /gene="G0S3" gene 1..3775 /gene="G0S3" CDS 594..1610 /gene="G0S3" /note="GOS3 is human homolog of mouse FOSB gene" /codon_start=1 /db_xref="PID:g1082038" /translation="MFQAFPGDYDSGSRCSSSPSAESQYLSSVDSFGSPPTAAASQEC AGLGEMPGSFVPTVTAITTSQDLQWLVQPTLISSMAQSQGQPLASQPPVVDPYDMPGT SYSTPGMSGYSSGGASGSGGPSTSGTTSGPGPARPARARPRRPREETLTPEEEEKRRV RRERNKLAAAKCRNRRRELTDRLQAETDQLEEEKAELESEIAELQKEKERLEFVLVAH KPGCKIPYEEGPGPGPLAEVRDLPGSAPAKEDGFSWLLPPPPPPPLPFQTSQDAPPNL TASLFTHSEVQVLGDPFPVVNPSYTSSFVLTCPEVSAFAGAQRTSGSDQPSDPLNSPS LLAR" allele 1605..1607 /gene="G0S3" /note="non-synonymous codon change relative to genomic sequence HUMMMDBC" 3'UTR 1611..3775 /gene="G0S3" misc_feature 3140..3144 /gene="G0S3" /note="upstream of our originally reported partial cDNA (Siderovski et al., 1990) is a putative mRNA instability signal (ATTTA)" polyA_signal 3754..3759 /gene="G0S3" /note="assigned on basis of position and identity to consensus" polyA_site 3775 /gene="G0S3" /note="polyA tail sequenced in cDNA clone, generated in oligodT-primed cDNA library" /evidence=experimental BASE COUNT 715 a 1175 c 1004 g 881 t ORIGIN 1 cattcataag actcagagct acggccacgg cagggacacg cggaaccaag acttggaaac 61 ttgattgttg tggttcttct tgggggttat gaaatttcat taatcttttt tttttccggg 121 gagaaagttt ttggaaagat tcttccagat atttcttcat tttcttttgg aggaccgact 181 tacttttttt ggtcttcttt attactcccc tccccccgtg ggacccgccg gacgcgtgga 241 ggagaccgta gctgaagctg attctgtaca gcgggacagc gctttctgcc cctgggggag 301 caacccctcc ctcgcccctg ggtcctacgg agcctgcact ttcaagaggt acagcggcat 361 cctgtggggg cctgggcacc gcaggaagac tgcacagaaa ctttgccatt gttggaacgg 421 gacgttgctc cttccccgag cttccccgga cagcgtactt tgaggactcg ctcagctcac 481 cggggactcc cacggctcac cccggacttg caccttactt ccccaacccg gccatagcct 541 tggcttcccg gcgacctcag cgtggtcaca ggggcccccc tgtgcccagg gaaatgtttc 601 aggctttccc cggagactac gactccggct cccggtgcag ctcctcaccc tctgccgagt 661 ctcaatatct gtcttcggtg gactccttcg gcagtccacc caccgccgcg gcctcccagg 721 agtgcgccgg tctcggggaa atgcccggtt ccttcgtgcc cacggtcacc gcgatcacaa 781 ccagccagga cctccagtgg cttgtgcaac ccaccctcat ctcttccatg gcccagtccc 841 aggggcagcc actggcctcc cagcccccgg tcgtcgaccc ctacgacatg ccgggaacca 901 gctactccac accaggcatg agtggctaca gcagtggcgg agcgagtggc agtggtgggc 961 cttccaccag cggaactacc agtgggcctg ggcctgcccg cccagcccga gcccggccta 1021 ggagaccccg agaggagacg ctcaccccag aggaagagga gaagcgaagg gtgcgccggg 1081 aacgaaataa actagcagca gctaaatgca ggaaccggcg gagggagctg accgaccgac 1141 tccaggcgga gacagatcag ttggaggaag aaaaagcaga gctggagtcg gagatcgccg 1201 agctccaaaa ggagaaggaa cgtctggagt ttgtgctggt ggcccacaaa ccgggctgca 1261 agatccccta cgaagagggg cccgggccgg gcccgctggc ggaggtgaga gatttgccgg 1321 gctcagcacc ggctaaggaa gatggcttca gctggctgct gccgcccccg ccaccaccgc 1381 ccctgccctt ccagaccagc caagacgcac cccccaacct gacggcttct ctctttacac 1441 acagtgaagt tcaagtcctc ggcgacccct tccccgttgt taacccttcg tacacttctt 1501 cgtttgtcct cacctgcccg gaggtctccg cgttcgccgg cgcccaacgc accagcggca 1561 gtgaccagcc ttccgatccc ctgaactcgc cctccctcct cgctcggtga actctttaga 1621 cacacaaaac aaacaaacac atgggggaga gagacttgga agaggaggag gaggaggaga 1681 aggaggagag agaggggaag agacaaagtg ggtgtgtggc ctccctggct cctccgtctg 1741 accctctgcg gccactgcgc cactgccatc ggacaggagg attccttgtg ttttgtcctg 1801 cctcttgttt ctgtgccccg gcgaggccgg agagctggtg actttgggga cagggggtgg 1861 gaaggggatg gacaccccca gctgactgtt ggctctctga cgtcaaccca agctctgggg 1921 atgggtgggg aggggggcgg gtgacgccca ccttcgggca gtcctgtgtg aggatgaagg 1981 gacgggggtg ggaggtaggc tgtggggtgg gctggagtcc tctccagaga ggctcaacaa 2041 ggaaaaatgc cactccctac ccaatgtctc ccacacccac cctttttttg gggtgcccag 2101 gttggtttcc cctgcactcc cgaccttagc ttattgatcc cacatttcca tggtgtgaga 2161 tcctctttac tctgggcaga agtgagcccc cccttaaagg gaattcgatg cccccctaga 2221 ataatctcat ccccccaccc gacttctttt gaaatgtgaa cgtccttcct tgactgtcta 2281 gccactccct cccagaaaaa ctggctctga ttggaatttc tggcctccta aggctcccca 2341 ccccgaaatc agcccccagc cttgtttctg atgacagtgt tatcccaaga ccctgccccc 2401 tgccagccga ccctcctggc cttcctcgtt gggccgctct gatttcaggc agcaggggct 2461 gctgtgatgc cgtcctgctg gagtgattta tactgtgaaa tgagttggcc agattgtggg 2521 gtgcagctgg gtggggcagc acacctctgg ggggataatg tccccactcc cgaaagcctt 2581 tcctcggtct cccttccgtc catccccctt cttcctcccc tcaacagtga gttagactca 2641 agggggtgac agaaccgaga agggggtgac agtcctccat ccacgtggcc tctctctctc 2701 tcctcaggac cctcagccct ggcctttttc tttaaggtcc cccgaccaat ccccagccta 2761 ggacgccaac ttctcccacc ccttggcccc tcacatcctc tccaggaagg cagtgagggg 2821 ctgtgacatt tttccggaga agatttcaga gctgaggctt tggtaccccc aaacccccaa 2881 tatttttgga ctggcagact caaggggctg gaatctcatg attccatgcc cgagtccgcc 2941 catccctgac catggttttg gctctcccac cccgccgttc cctgcgcttc atctcatgag 3001 gatttcttta tgaggcaaat ttatattttt taatatcggg gggtggacca cgccgccctc 3061 catccgtgct gcatgaaaaa cattccacgt gccccttgtc gcgcgtctcc catcctgatc 3121 ccagacccat tccttagcta tttatccctt tcctggtttc cgaaaggcaa ttatatctat 3181 tatgtataag taaatatatt atatatggat gtgtgtgtgt gcgtgcgcgt gagtgtgtga 3241 gcgcttctgc agcctcggcc taggtcacgt tggccctcaa agcgagccgt tgaattggaa 3301 actgcttcta gaaactctgg ctcagcctgt ctcgggctga cccttttctg atcgtctcgg 3361 cccctctgat tgttcccgat ggtctctctc cctctgtctt ttctcctccg cctgtgtcca 3421 tctgaccgtt ttcacttgtc tcctttctga ctgtccctgc caatgctcca gctgtcgtct 3481 gactctgggt tcgttgggga catgagattt tattttttgt gagtgagact gagggatcgt 3541 agatttttac aatctgtatc tttgacaatt ctgggtgcga gtgtgagagt gtgagcaggg 3601 cttgctcctg ccaaccacaa ttcaatgaat ccccgacccc cctaccccat gctgtacttg 3661 tggttctctt tttgtatttt gcatctgacc ccggggggct gggacagatt ggcaatgggc 3721 cgtcccctct ccccttggtt ctgcactgtt gccaataaaa agctcttaaa aacgc // LOCUS HUMG13A 1402 bp mRNA PRI 10-AUG-1995 DEFINITION Human guanine nucleotide regulatory protein (G13) mRNA, complete cds. ACCESSION L22075 NID g404721 KEYWORDS guanine nucleotide regulatory protein. SOURCE Homo sapiens infant thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1402) AUTHORS Kabouridis,P.S., Waters,S.T., Escobar,S., Stanners,J. and Tsoukas,C.D. TITLE Expression of GTP-binding protein alpha subunits in human thymocytes JOURNAL Mol. Cell. Biochem. 144 (1), 45-51 (1993) MEDLINE 95311934 FEATURES Location/Qualifiers source 1..1402 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="thymocyte" /dev_stage="infant" /tissue_type="thymus" gene 42..1175 /gene="G13" CDS 42..1175 /gene="G13" /codon_start=1 /function="binding of GTP; signal transduction" /product="guanine nucleotide regulatory protein" /db_xref="PID:g404722" /translation="MADFLPSRSVLSVCFPGCLLTSGEAEQQRKSKEIDKCLSREKTY VKRLVKILLLGAGESGKSTFLKQMRIIHGQDFDQRAREEFRPTIYSNVIKGMRVLVDA REKLHIPWGDNSNQQHGDKMMSFDTRAPMAAQGMVETRVFLQYLPAIRALWADSGIQN AYDRRREFQLGESVKYFLDNLDKLGEPDYIPSQQDILLARRPTKGIHEYDFEIKNVPF KMLDVGGQRSERKRWFECFDSVTSILFLVSSSEFDQVLMEDRLTNRLTESLNIFETIV NNRVFSNVSIILFLNKTDLLEEKVQIVSIKDYFLEFEGDPHCLRDVQKFLVECFRNKR RDQQQKPLYHHFTTAINTENIRLVFRDVKDTILHDNLKQLMLQ" misc_binding 192..242 /gene="G13" /bound_moiety="GTP" misc_binding 705..722 /gene="G13" /bound_moiety="GTP" misc_binding 903..923 /gene="G13" /bound_moiety="GTP" BASE COUNT 375 a 317 c 344 g 366 t ORIGIN 1 tggggccgga gaggcggcga ggcggcggcg gcggcggcaa gatggcggac ttcctgccgt 61 cgcggtccgt gctgtccgtg tgcttccccg gctgcctgct gacgagtggc gaggccgagc 121 agcaacgcaa gtccaaggag atcgacaaat gcctgtctcg ggaaaagacc tatgtgaagc 181 ggctggtgaa gatcctgctg ctgggcgcgg gcgagagcgg caagtccacc ttcctgaagc 241 agatgcggat catccacggg caggacttcg accagcgcgc gcgcgaggag ttccgcccca 301 ccatctacag caacgtgatc aaaggtatga gggtgctggt tgatgctcga gagaagcttc 361 atattccctg gggagacaac tcaaaccaac aacatggaga taagatgatg tcgtttgata 421 cccgggcccc catggcagcc caaggaatgg tggaaacaag ggttttctta caatatcttc 481 ctgctataag agcattatgg gcagacagcg gcatacagaa tgcctatgac cggcgtcgag 541 aatttcaact gggtgaatct gtaaaatatt tcctggataa cttggataaa cttggagaac 601 cagattatat tccatcacaa caagatattc tgcttgccag aagacccacc aaaggcatcc 661 atgaatacga ctttgaaata aaaaatgttc ctttcaaaat gcttgatgta ggtggtcaga 721 gatcagaaag gaaacgttgg tttgaatgtt tcgacagtgt gacatcaata cttttccttg 781 tttcctcaag tgaatttgac caggtgctta tggaagatcg actgaccaat cgccttacag 841 agtctctgaa catttttgaa acaatcgtca ataaccgggt tttcagcaat gtctccataa 901 ttctgttctt aaacaagaca gacttgcttg aggagaaggt gcaaattgtg agcatcaaag 961 actatttcct agaatttgaa ggggatcccc actgcttaag agacgtccaa aaattcctgg 1021 tggaatgttt ccggaacaaa cgccgggacc agcaacagaa gcccttatac caccacttca 1081 ccactgctat caacacggag aacatccgcc ttgttttccg cgacgtgaag gatactattc 1141 tgcatgacaa cctcaagcag cttatgctac agtgatgtac aaaagacttg ctgttttaat 1201 atctttttgt gtttttgatg ttttctgttt gttttgtttt ttaaaatagc agtttacaac 1261 cagaattaga acaatcttaa ttctacgttt aacttcttga aaatcttagt actttttctg 1321 cggcctttgg tttgtggctg aaagctgttg agtgactcat cgccaagatt tgctgtaatg 1381 caggctttga tctgtttcac cc // LOCUS HUMG19P1A 2056 bp mRNA PRI 08-NOV-1994 DEFINITION Human 80K-H protein (kinase C substrate) mRNA, complete cds. ACCESSION J03075 NID g182854 KEYWORDS 80K-H protein; phosphoprotein. SOURCE Human squamous carcinoma Ca9-22 cell line A431, cDNA to mRNA, clones lambda-80H-[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2056) AUTHORS Sakai,K., Hirai,M., Minoshima,S., Kudoh,J., Fukuyama,R. and Shimizu,N. TITLE Isolation of cDNAs encoding a substrate for protein kinase C: nucleotide sequence and chromosomal mapping of the gene for a human 80K protein JOURNAL Genomics 5 (2), 309-315 (1989) MEDLINE 90007553 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Shimizu, 19-APR-1989. FEATURES Location/Qualifiers source 1..2056 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19" mRNA <1..2056 /note="80K-H mRNA" gene 137..1720 /gene="G19P1" CDS 137..1720 /gene="G19P1" /note="80K-H protein" /codon_start=1 /db_xref="GDB:G00-119-961" /db_xref="PID:g182855" /translation="MLLPLLLLLPMCWAVEVKRPRGVSLTNHHFYDESKPFTCLDGSA TIPFDQVNDDYCDCKDGSDEPGTAACPNGSFHCTNTGYKPLYIPSNRVNDGVCDCCDG TDEYNSGVICENTCKEKGRKERESLQQMAEVTREGFRLKKILIEDWKKAREEKQKKLI ELQAGKKSLEDQVEMLRTVKEEAEKPEREAKEQHQKLWEEQLAAAKAQQEQELAADAF KELDDDMDGTVSVTELQTHPELDTDGDGALSEAEAQALLSGDTQTDATSFYDRVWAAI RDKYRSEALPTDLPAPSAPDLTEPKEEQPPVPSSPTEEEEEEEEEEEEAEEEEEEEDS EEAPPPLSPPQPASPAEEDKMPPYDEQTQAFIDAAQEARNKFEEAERSLKDMEESIRN LEQEISFDFGPNGEFAYLYSQCYELTTNEYVYRLCPFKLVSQKPKLGGSPTSLGTWGS WIGPDHDKFSAMKYEQGTGCWQGPNRSTTVRLLCGKETMVTSTTEPSRCEYLMELMTP AACPEPPPEAPTEDDHDEL" polyA_signal 2036..2041 BASE COUNT 460 a 635 c 623 g 338 t ORIGIN 1 ggaaccgcgg ctgctggaca agaggggtgc ggtggatact gacctttgct ccggcctcgt 61 cgtgaagaca cagcgcatct ccccgctgta ggcttctccc acagaacccg tttcgggcct 121 cagagcgtct ggtgagatgc tgttgccgct gctgctgctg ctacccatgt gctgggccgt 181 ggaggtcaag aggccccggg gcgtctccct caccaatcat cacttctacg atgagtccaa 241 gcctttcacc tgcctggacg gttcggccac catcccattt gatcaggtca acgatgacta 301 ttgcgactgc aaagatggct ctgacgagcc aggcacggct gcctgtccta atggcagctt 361 ccactgcacc aacactggct ataagcccct gtatatcccc tccaaccggg tcaacgatgg 421 tgtttgtgac tgctgcgatg gaacagacga gtacaacagc ggcgtcatct gtgagaacac 481 ctgcaaagag aagggccgta aggagagaga gtccctgcag cagatggccg aggtcacccg 541 cgaagggttc cgtctgaaga agatccttat tgaggactgg aagaaggcac gggaggagaa 601 gcagaaaaag ctcattgagc tacaggctgg gaagaagtct ctggaagacc aggtggagat 661 gctgcggaca gtgaaggagg aagctgagaa gccagagaga gaggccaaag agcagcacca 721 gaagctgtgg gaagagcagc tggctgctgc caaggcccaa caggagcagg agctggcggc 781 tgatgccttc aaggagctgg atgatgacat ggacgggacg gtctcggtga ctgagctgca 841 gactcacccg gagctggaca cagatgggga tggggcgttg tcagaagcgg aagctcaggc 901 cctcctcagt ggggacacac agacagacgc cacctctttc tacgaccgcg tctgggccgc 961 catcagggac aagtaccggt ccgaggcact gcccaccgac cttccagcac cttctgcccc 1021 tgacttgacg gagcccaagg aggagcagcc gccagtgccc tcgtcgccca cagaggagga 1081 ggaggaggag gaggaggagg aagaagaggc tgaagaagag gaggaggagg aggattccga 1141 ggaggcccca ccgccactgt cacccccgca gccggccagc cctgctgagg aagacaaaat 1201 gccgccctac gacgagcaga cgcaggcctt catcgatgct gcccaggagg cccgcaacaa 1261 gttcgaggag gccgagcggt cgctgaagga catggaggag tccatcagga acctggagca 1321 agagatttct tttgactttg gccccaacgg ggagtttgct tacctgtaca gccagtgcta 1381 cgagctcacc accaacgaat acgtctaccg cctctgcccc ttcaagcttg tctcgcagaa 1441 acccaaactc gggggctctc ccaccagcct tggcacctgg ggctcatgga ttggccccga 1501 ccacgacaag ttcagtgcca tgaagtatga gcaaggcacg ggctgctggc agggccccaa 1561 ccgctccacc accgtgcgcc tcctgtgcgg gaaagagacc atggtgacca gcaccacaga 1621 gcccagtcgc tgcgagtacc tcatggagct gatgacgcca gccgcctgcc cggagccacc 1681 gcctgaagca cccaccgaag acgaccatga cgagctctag ctggatgggc gcagagaacc 1741 tcaagaaggc atgaagccag cccctgcagt gccgtccacc cgcccctctg ggcctgcctg 1801 tggctctgtt gccctcctct gtggcggcag gacctttgtg gggcttcgtg ccctgctctg 1861 gggcccaggc ggggctggtc cacattccca ggccccaaca gcctccaaag atgggtaaag 1921 gagcttgccc tccctgggcc ccccaccttg gtgactcgcc ccaccacccc cagccctgtc 1981 cctgccaccc ctcctagtgg ggactagtga atgacttgac ctgtgacctc aatacaataa 2041 atgtgatccc ccaccc // LOCUS HUMG25KA 1014 bp mRNA PRI 08-NOV-1994 DEFINITION Human GTP-binding protein (G25K) mRNA, complete cds. ACCESSION M35543 NID g182856 KEYWORDS G25K gene; GTP-binding protein G25K. SOURCE Human fetal brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1014) AUTHORS Munemitsu,S., Innis,M.A., Clark,R., McCormick,F., Ullrich,A. and Polakis,P. TITLE Molecular cloning and expression of a G25K cDNA, the human homolog of the yeast cell cycle gene CDC42 JOURNAL Mol. Cell. Biol. 10 (11), 5977-5982 (1990) MEDLINE 91042529 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by S.Munemitsu, 25-JUN-1990. Author address: S.Munemitsu Cetus Corporation 1400 53rd Street Emeryville, CA 94608. FEATURES Location/Qualifiers source 1..1014 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 4..579 /gene="CDC42" CDS 4..579 /gene="CDC42" /note="GTP-binding protein G25K" /codon_start=1 /db_xref="GDB:G00-127-540" /db_xref="PID:g182857" /translation="MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYVPTVFDNYAVTV MIGGEPYTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEIT HHCPKTPFLLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSAL TQRGLKNVFDEAILAALEPPETQPKRKCCIF" BASE COUNT 286 a 204 c 210 g 314 t ORIGIN 1 gcaatgcaga caattaagtg tgttgttgtg ggcgatggtg ctgttggtaa aacatgtctc 61 ctgatatcct acacaacaaa caaatttcca tcggaatatg taccgactgt ttttgacaac 121 tatgcagtca cagttatgat tggtggagaa ccatatactc ttggactttt tgatactgca 181 gggcaagagg attatgacag attacgaccg ctgagttatc cacaaacaga tgtatttcta 241 gtctgttttt cagtggtctc tccatcttca tttgaaaacg tgaaagaaaa gtgggtgcct 301 gagataactc accactgtcc aaagactcct ttcttgcttg ttgggactca aattgatctc 361 agagatgacc cctctactat tgagaaactt gccaagaaca aacagaagcc tatcactcca 421 gagactgctg aaaagctggc ccgtgacctg aaggctgtca agtatgtgga gtgttctgca 481 cttacacaga gaggtctgaa gaatgtgttt gatgaggcta tcctagctgc cctcgagcct 541 ccggaaactc aacccaaaag gaagtgctgt atattctaaa ctgttttctc cttcccttct 601 ttgctgctgc ttcctgtccc actactgtag aaagatcgtt taaaaacaaa ggaataaaac 661 catcctgttt gaaagcctct gcgtcttttt actcaccacc ttagagcaac ctctgtatta 721 gtttttgatc aagaattgca atatcatata aattttttgt gatcagtagt caagttggac 781 ttgttttaac gttctgctgc ttgagttgcc tgatgctcag agctttttgg tttggattac 841 tattgcaaag ggaacttggt ctggcttaga tgtcctcttg gagaaaataa caagagtttt 901 aacacttcta gatcttagtt cagatggaga aagtaacaca aacatcattt tactcttatg 961 atcaattgtt aattgtaatt gcatgacaaa ccttatggaa aaggggtgac ctgg // LOCUS HUMG6PA 1464 bp mRNA PRI 08-NOV-1994 DEFINITION Human glucose-6-phosphate dehydrogenase, complete cds. ACCESSION M24470 M27958 NID g182866 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE Human, cDNA to mRNA, clone NG6PD 1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1464) AUTHORS Kanno,H., Huang,I.Y., Kan,Y.W. and Yoshida,A. TITLE Two structural genes on different chromosomes are required for encoding the major subunit of human red cell glucose-6-phosphate dehydrogenase JOURNAL Cell 58 (3), 595-606 (1989) MEDLINE 89336791 COMMENT Draft entry and sequence for [1] kindly submitted by A.Yoshida, 02-MAY-1989. FEATURES Location/Qualifiers source 1..1464 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xq28" mRNA <1..1464 /note="glucose-6-phosphate dehydrogenase mRNA" gene 72..1109 /gene="G6PD" CDS 72..1109 /gene="G6PD" /EC_number="1.1.1.49" /codon_start=1 /db_xref="GDB:G00-120-621" /product="glucose-6-phosphate dehydrogenase" /db_xref="PID:g182867" /translation="MPRIDADLKLDFKDVLLRPKRSSLKSRAEVDLERTFTFRNSKQT YSGIPIIVANMDTVGTFEMAAVMSQHSMFTAIHKHYSLDDWKLFATNHPECLQNVAVS SGSGQNDLEKMTSILEAVPQVKFICLDVANGYSEHFVEFVKLVRAKFPEHTIMAGNVV TGEMVEELILSGADIIKVGVGPGSVCTTRTKTGVGYPQLSAVIECADSAHGLKGHIIS DGGCTCPGDVAKAFGTGADFVMLGGMFSGHTECAGEVIERNGRKLKLFYGMSSDTAMN KHAGGVAEYRASEGKTVEVPYKGDVENTILDILGGLRSTCTYVGAAKLKELSRRATFI RVTQQHNTVFS" BASE COUNT 331 a 404 c 389 g 340 t ORIGIN 1 ctccccgcgc cgccccgcgc aggcgccccc gccccgccgt cgccgccgcc gcagccagga 61 gccgctgcac catgccccgc atagatgcgg acctcaagct cgacttcaag gacgtcctgc 121 tccgacctaa gcggagcagc ctcaagagcc gagccgaggt ggatcttgaa cgcaccttca 181 cgtttcgaaa ttcaaagcag acctactcag ggattcccat catcgtggcc aacatggaca 241 ctgtgggcac gtttgagatg gcagccgtga tgtcacagca ctccatgttt acagcaattc 301 ataagcatta ctccctggat gactggaagc tctttgccac aaatcaccca gaatgcctgc 361 agaatgtagc cgtgagttca ggcagtgggc agaatgatct ggaaaagatg accagcatcc 421 tggaagctgt gccacaggtt aagtttattt gcctggatgt ggccaatggg tattcagaac 481 attttgtgga attcgtgaaa cttgtccgtg ccaaatttcc tgaacacacc attatggcag 541 ggaacgtggt gacaggagaa atggtagaag agcttattct ttccggagca gatatcatca 601 aagtgggagt tggaccaggt tctgtgtgca ccacccgcac caagacggga gtggggtacc 661 cccagctgag tgccgtcatt gagtgtgccg actctgccca cggcctgaag ggccacatca 721 tctctgatgg aggctgtacg tgtccagggg atgtcgccaa agcctttgga actggagcag 781 attttgtcat gctgggagga atgttttcgg gtcatacgga gtgtgctgga gaagtgattg 841 agaggaacgg acggaagctc aagctcttct acgggatgag ctctgacacc gccatgaaca 901 agcacgcagg aggagttgct gagtacagag cctctgaggg taagactgtg gaagttcctt 961 acaaaggaga tgtggaaaac actatcctgg atattctcgg gggactgagg tccacgtgca 1021 cctacgtggg ggccgccaaa ctcaaggagc tcagcaggag ggcaacattc atccgggtga 1081 cccagcagca caacaccgtg ttcagctaac cctggggaca aagcagcgtc tggctcgatg 1141 gaagcgtcca aacctgcttt tcccatctcc ccccaagtct gttccgtcag agcttctggc 1201 tgctcctgaa tggtggaatg cctgtgtcct ctcttctgtc tcctgccgcc tggaggcttc 1261 ggggctctcc cgcctgcctt ctcggggccc agacgcaagg caccgattgg gccaacatca 1321 gagccctgct gcccagaact cataacctca ttgttcaaac caacacttgc acctttctct 1381 ttttctcttt ctctctccct ttctttgttt ttctttcttt tttaaaagaa gatggtttca 1441 gctttaatat aatgctatta tctt // LOCUS HUMGA16 2060 bp mRNA PRI 23-JUL-1991 DEFINITION Human G-alpha 16 protein mRNA, complete cds. ACCESSION M63904 NID g182891 KEYWORDS G-alpha 16 protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Amatruda,T.T.III., Steele,D.A., Slepak,V.Z. and Simon,M.I. TITLE G-alpha16, a G protein alpha subunit specifically expressed in hematopoietic cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 5587-5591 (1991) MEDLINE 91288509 FEATURES Location/Qualifiers source 1..2060 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" mRNA <1..2060 /product="G-alpha-16 protein" CDS 220..1344 /codon_start=1 /product="G-alpha-16 protein" /db_xref="PID:g182892" /translation="MARSLTWRCCPWCLTEDEKAAARVDQEINRILLEQKKQDRGELK LLLLGPGESGKSTFIKQMRIIHGAGYSEEERKGFRPLVYQNIFVSMRAMIEAMERLQI PFSRPESKHHASLVMSQDPYKVTTFEKRYAAAMQWLWRDAGIRACYERRREFHLLDSA VYYLSHLERITEEGYVPTAQDVLRSRMPTTGINEYCFSVQKTNLRIVDVGGQKSERKK WIHCFENVIALIYLASLSEYDQCLEENNQENRMKESLALFGTILELPWFKSTSVILFL NKTDILEEKIPTSHLATYFPSFQGPKQDAEAAKRFILDMYTRMYTGCVDGPEGSKKGA RSRRLFSHYTCATDTQNIRKVFKDVRDSVLARYLDEINLL" BASE COUNT 415 a 631 c 640 g 374 t ORIGIN 1 tgttcccagc actcaagcct tgccaccgcc gagccgggct tcctgggtgt ttcaggcaag 61 gaagtctagg tccctggggg gtgaccccca aggaaaaggc agcctccctg cgcacccggt 121 tgcccggagc cctctccagg gccggctggg ctgggggttg ccctggccag caggggcccg 181 ggggcgatgc cacccggtgc cgactgaggc caccgcacca tggcccgctc gctgacctgg 241 cgctgctgcc cctggtgcct gacggaggat gagaaggccg ccgcccgggt ggaccaggag 301 atcaacagga tcctcttgga gcagaagaag caggaccgcg gggagctgaa gctgctgctt 361 ttgggcccag gcgagagcgg gaagagcacc ttcatcaagc agatgcggat catccacggc 421 gccggctact cggaggagga gcgcaagggc ttccggcccc tggtctacca gaacatcttc 481 gtgtccatgc gggccatgat cgaggccatg gagcggctgc agattccatt cagcaggccc 541 gagagcaagc accacgctag cctggtcatg agccaggacc cctataaagt gaccacgttt 601 gagaagcgct acgctgcggc catgcagtgg ctgtggaggg atgccggcat ccgggcctgc 661 tatgagcgtc ggcgggaatt ccacctgctc gattcagccg tgtactacct gtcccacctg 721 gagcgcatca ccgaggaggg ctacgtcccc acagctcagg acgtgctccg cagccgcatg 781 cccaccactg gcatcaacga gtactgcttc tccgtgcaga aaaccaacct gcggatcgtg 841 gacgtcgggg gccagaagtc agagcgtaag aaatggatcc attgtttcga gaacgtgatc 901 gccctcatct acctggcctc actgagtgaa tacgaccagt gcctggagga gaacaaccag 961 gagaaccgca tgaaggagag cctcgcattg tttgggacta tcctggaact accctggttc 1021 aaaagcacat ccgtcatcct ctttctcaac aaaaccgaca tcctggagga gaaaatcccc 1081 acctcccacc tggctaccta tttccccagt ttccagggcc ctaagcagga tgctgaggca 1141 gccaagaggt tcatcctgga catgtacacg aggatgtaca ccgggtgcgt ggacggcccc 1201 gagggcagca agaagggcgc acgatcccga cgccttttca gccactacac atgtgccaca 1261 gacacacaga acatccgcaa ggtcttcaag gacgtgcggg actcggtgct cgcccgctac 1321 ctggacgaga tcaacctgct gtgacccagg ccccacctgg ggcaggcggc accggcgggc 1381 gggtgggagg tgggagtggc tgcagggacc ctagtgtcct ggtctatctc tccagcctcg 1441 gcccacacgc aagggagtcg ggggacggcc cgctgctggc cgctctcttc tctgcctctc 1501 accaggacag ccgcccccca gggtactcct gcccttgctt gactcagttt ccctcctttg 1561 aaagggaagg agcaaaacgg ccatttggga tgccagggtg gatgaaaagg tgaagaaatc 1621 aggggattga gacttgggtg ggtgggcatc tctcaggagc cccatctccg ggcgtgtcac 1681 ctcctgggca gggttctggg accctctgtg ggtgacgcac accctgggat ggggctagta 1741 gagccttcag gcgccttcgg gcgtggactc tggcgcactc tagtggacag gagaaggaac 1801 gccttccagg aacctgtgga ctaggggtgc agggacttcc ctttgcaagg ggtaacagac 1861 cgctggaaaa cactgtcact ttcagagctc ggtggctcac agcgtgtcct gccccggttt 1921 gcggacgaga gaaatcgcgg cccacaagca tcccccatcc cttgcaggct gggggctggg 1981 catgctgcat cttaaccttt tgtatttatt ccctcacctt ctgcagggct ccgtgcgggc 2041 tgaaattaaa gatttcttag // LOCUS HUMGABAR 1989 bp mRNA PRI 08-NOV-1994 DEFINITION Human gamma-aminobutyric acid receptor type A rho-1 subunit (GABA-A rho-1) mRNA, complete cds. ACCESSION M62400 M62323 NID g182910 KEYWORDS chloride channel; gamma-aminobutyric acid receptor type A rho-1 subunit. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1989) AUTHORS Cutting,G.R., Lu,L., O'Hara,B.F., Kasch,L.M., Montrose-Rafizadeh,C., Donovan,D.M., Shimada,S., Antonarakis,S.E., Guggino,W.B., Uhl,G.R. and Kazazian,H.H.Jr.. TITLE Cloning of the gamma-aminobutyric acid (GABA) rho 1 cDNA: a GABA receptor subunit highly expressed in the retina JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (7), 2673-2677 (1991) MEDLINE 91187854 FEATURES Location/Qualifiers source 1..1989 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q34-q35" sig_peptide 47..61 /gene="GABRA1" /product="gamma-aminobutyric acid receptor type A rho-1 subunit" CDS 47..1468 /gene="GABRA1" /codon_start=1 /db_xref="GDB:G00-119-966" /product="gamma-aminobutyric acid receptor type A rho-1 subunit" /db_xref="PID:g182911" /translation="MRFGIFLLWWGWVLATESRMHWPGREVHEMSKKGRPQRQRREVH EDAHKQVSPILRRSPDITKSPLTKSEQLLRIDDHDFSMRPGFGGPAIPVGVDVQVESL DSISEVDMDFTMTLYLRHYWKDERLSFPSTNNLSMTFDGRLVKKIWVPDMFFVHSKRS FIHDTTTDNVMLRVQPDGKVLYSLRVTVTAMCNMDFSRFPLDTQTCSLEIESYAYTED DLMLYWKKGNDSLKTDERISLSQFLIQEFHTTTKLAFYSSTGWYNRLYINFTLRRHIF FFLLQTYFPATLMVMLSWVSFWIDRRAVPARVPLGITTVLTMSTIITGVNASMPRVSY IKAVDIYLWVSFVFVFLSVLEYAAVNYLTTVQERKEQKLREKLPCTSGLPPPRTAMLD GNYSDGEVNDLDNYMPENGEKPDRMMVQLTLASERSSPQRKSQRSSYVSMRIDTHAID KYSRIIFPAAYILFNLIYWSIFS" gene 47..1468 /gene="GABRA1" mat_peptide 62..1465 /gene="GABRA1" /product="gamma-aminobutyric acid receptor type A rho-1 subunit" BASE COUNT 537 a 504 c 444 g 503 t 1 others ORIGIN 1 cgagaaggat gtttgaattt ggaaacccat gttggctgtc ccaaatatga gatttggcat 61 ctttcttttg tggtggggat gggttttggc cactgaaagc agaatgcact ggcccggaag 121 agaagtccac gagatgtcta agaaaggcag gccccaaaga caaagacgag aagtacatga 181 agatgcccac aagcaagtca gcccaattct gagacgaagt cctgacatca ccaaatcgcc 241 tctgacaaag tcagaacagc ttctgaggat agatgaccat gatttcagca tgaggcctgg 301 ctttggaggc cctgccattc ctgttggtgt ggatgtgcag gtggagagtt tggatagcat 361 ctcagaggtt gacatggact ttacgatgac cctctacctg aggcactact ggaaggacga 421 gaggctgtct tttccaagca ccaacaacct cagcatgacg tttgatggcc ggctggtcaa 481 gaagatctgg gtccctgaca tgtttttcgt gcactccaaa cgctccttca tccacgacac 541 caccacagac aacgtcatgt tgcgggtcca gcctgatggg aaagtgctct atagtctcag 601 ggttacagta actgcaatgt gcaacatgga cttcagccga tttcccttgg acacacaaac 661 gtgctctctt gaaattgaaa gctatgccta tacagaagat gacctcatgc tgtactggaa 721 aaagggcaat gactccttaa agacagatga acggatctca ctctcccagt tcctcattca 781 ggaattccac accaccacca aactggcttt ctacagcagc acaggctggt acaaccgtct 841 ctacattaat ttcacgttgc gtcgccacat cttcttcttc ttgctccaaa cttatttccc 901 cgctaccctg atggtcatgc tgtcctgggt gtccttctgg atcgaccgca gagccgtgcc 961 tgccagagtc cccttaggta tcacaacggt gctgaccatg tccaccatca tcacgggcgt 1021 gaatgcctcc atgccgcgcg tctcctacat caaggccgtg gacatctacc tctgggtcag 1081 ctttgtgttc gtgttcctct cggtgctgga gtatgcggcc gtcaactacc tgaccactgt 1141 gcaggagagg aaggaacaga agctgcggga gaagcttccc tgcaccagcg gattacctcc 1201 gccccgcact gcaatgctgg acggcaacta cagtgatggg gaggtgaatg acctggacaa 1261 ctacatgcca gagaatggag agaagcccga caggatgatg gtgcagctga ccctggcctc 1321 agagaggagc tccccacaga ggaaaagtca gagaagcagc tatgtgagca tgagaatcga 1381 cacccacgcc attgataaat actccaggat catctttcca gcagcataca ttttattcaa 1441 tttaatatac tggtctattt tctcctagat gcttgtaatt ctacaaattt cacatttcca 1501 tggcatgcac tacagaaata actgtataat gaaaaagtat ttaaggatat ggttaaaaaa 1561 aaatcccagg acccacccat gttttcacta tcccttctgc agctttccaa agctacattg 1621 acgagacact tactggttta atttgcactt attaaccgtc tgttgaatac acagcattat 1681 attaggtgct gcagaaatac gacactgtag cgactgatgt tagttgttac ccagataaaa 1741 tggaaaagca cactaccagt gttgtgggca catttagytc cacccgatta gacccttgat 1801 gctattcaca tgaataattt atttttccct aaaagtgtca ttacattgtt caggctacgt 1861 gaacttggaa gcaccatcag gccatttgca tgaaattcac atgcacctaa atcctcactt 1921 tgacagaaac tcatgcttca gttataacct attacctatt ttgtatgcga ctccacctcc 1981 gcatgttcg // LOCUS HUMGABAT 1705 bp mRNA PRI 11-AUG-1995 DEFINITION Human 4-aminobutyrate aminotransferase (GABAT) mRNA, complete cds. ACCESSION L32961 NID g602704 KEYWORDS 4-aminobutyrate aminotransferase. SOURCE Homo sapiens (tissue library: lambda ZAPII) brain cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1705) AUTHORS Osei,Y.D. and Churchich,J.E. TITLE Screening and sequence determination of a cDNA encoding the human brain 4-aminobutyrate aminotransferase JOURNAL Gene 155 (2), 185-187 (1995) MEDLINE 95237607 FEATURES Location/Qualifiers source 1..1705 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain cortex" /tissue_lib="lambda ZAPII" 5'UTR 1..98 /gene="GABAT" gene 1..1601 /gene="GABAT" CDS 99..1601 /gene="GABAT" /EC_number="2.6.1.19" /codon_start=1 /function="converts 4-aminobutyrate into succinic semi-aldehyde" /product="4-aminobutyrate aminotransferase" /db_xref="PID:g602705" /translation="MASMLLAQRLACSFQHTYRLLVPGSRHISQAAAKVDVEFDYDGP LMKTEVPGPRSQELMKQLNIIQNAEAVHFFCNYEESRGNYLVDVDGNRMLDLYSQISS VPIGYSDPALVKLIQQPQNASMFVNRPALEILPPENFVEKLRQSLLSVAPKGMSQLIT MACGSCSNENALKTIFMWYRSKERGQRGFSKEELETCMINQAPWCPDYSILSFMGSFH GRTMGCLATTHSKAIHKIDIPSFDWPIAPFPRLKYPLEEFVKENQQEEAGCLEEVEDL IVKYRKKKKTVAGIIVEPIQSEGGDNHASDDFFRKLRDIARKHCCAFLVDEVQTGGGC TGKFWAHEHWGLDDPADVMTFSKKMMTGGFFLKEEFRPNAPYRIFNTWLGDPSKNLLL AEVINIIKREDLLNNAAHAGKALLTGLLDLQARYPQFISRVRGRGTFCSFDTPDDSIR NKLILIARNKGVVLGGCGDKSIRFRPTLVFRDHHAHLFLNIFSDILADFK" misc_binding 357 /gene="GABAT" /bound_moiety="cofactor" BASE COUNT 413 a 462 c 451 g 379 t ORIGIN 1 ggctcactgc atctccggct cctggactca agcgattctc ctgcctcagg ctcccaaggt 61 ggcagcacgc aaagggtgtc cctgtccctc aaggggtcat ggcctccatg ttgctcgccc 121 agcggctggc ctgcagcttc cagcacacgt accgcctgct ggtgcctgga tccagacaca 181 ttagtcaagc tgcagccaaa gtcgacgttg aatttgatta tgatgggcct ctgatgaaga 241 cggaagtccc agggcctaga tctcaggagt taatgaaaca gctgaatata attcagaatg 301 cagaggctgt gcattttttc tgcaattacg aagagagccg aggcaattac ctggttgatg 361 tggacggcaa ccgaatgctg gatctttatt cccagatctc ctctgttccc ataggttaca 421 gcgacccggc cctcgtgaaa ctcatccaac agccacaaaa tgcgagcatg tttgtcaaca 481 gacccgccct cgaaatcctg cctccggaga actttgtgga gaagctccgg cagtccttgc 541 tctcggtggc tcccaaaggg atgtcccagc tcatcaccat ggcctgcggc tcctgctcca 601 atgaaaacgc cttaaagacc atcttcatgt ggtaccggag caaggaaaga gggcagaggg 661 gattctccaa agaggagctg gagacgtgca tgattaacca ggccccctgg tgccccgact 721 acagcatcct ctccttcatg ggttccttcc atgggaggac catgggttgc ttagcgacca 781 cgcactctaa agccattcac aagatcgata tcccttcctt tgactggccc atcgcaccgt 841 tcccacggct gaaataccct ctggaagagt ttgtgaaaga gaaccaacag gaagaggccg 901 gctgtctgga agaggttgag gatctgattg tgaaatatcg aaaaaagaag aagacggtgg 961 ccgggatcat cgtggagccc atccagtccg agggtggaga caaccatgca tccgatgact 1021 tctttcggaa gctgagagac atcgccagga agcactgctg cgccttcttg gtggacgagg 1081 tccagaccgg aggaggctgc acgggcaagt tctgggccca tgagcactgg ggcctggatg 1141 acccagcaga cgtgatgacc ttcagcaaga agatgatgac tgggggcttc ttcctcaagg 1201 aggagttcag gcctaatgct ccctaccgga tcttcaacac gtggctgggg gacccgtcca 1261 agaacctgtt gctggctgag gtcatcaaca tcatcaagcg ggaggacctg ctaaataatg 1321 cagcccatgc cgggaaggcc ctgctcacag gactgctgga cctccaggcc cggtaccccc 1381 agttcatcag cagggtgaga ggacgaggca ccttttgctc cttcgatact cccgatgatt 1441 ccatacggaa taagctcatt ttaattgcca gaaacaaagg tgtggtgttg ggtggctgtg 1501 gtgacaaatc cattcgtttc cgtcccacgc tggtgttcag ggatcaccac gctcacctgt 1561 tcctcaatat tttcagtgac atcttagcag acttcaagta aagaagccat ttccactaca 1621 gtgagaaagc ccggatccca acagttgtca aattgattag tttgcctaat tcatgttttc 1681 acttaaaagt atcagaggtg gaatt // LOCUS HUMGABRA5Y 2318 bp mRNA PRI 31-DEC-1994 DEFINITION Human GABA-benzodiazepine receptor alpha-5-subunit (GABRA5) mRNA, complete cds. ACCESSION L08485 NID g182915 KEYWORDS GABA-benzodiazepine receptor; alpha-5-subunit; gamma-aminobutyric acid receptor. SOURCE Homo sapiens (tissue library: lambda ZAP) adult brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2318) AUTHORS Knoll,J.H., Sinnett,D., Wagstaff,J., Glatt,K., Wilcox,A.S., Whiting,P.M., Wingrove,P., Sikela,J.M. and Lalande,M. TITLE FISH ordering of reference markers and of the gene for the alpha 5 subunit of the gamma-aminobutyric acid receptor (GABRA5) within the Angelman and Prader-Willi syndrome chromosomal regions JOURNAL Hum. Mol. Genet. 2 (2), 183-189 (1993) MEDLINE 93271965 FEATURES Location/Qualifiers source 1..2318 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" /tissue_lib="lambda ZAP" /map="15q11q13" sig_peptide 306..398 /gene="GABRA5" CDS 306..1694 /gene="GABRA5" /note="putative" /codon_start=1 /product="GABA-benzodiazepine receptor alpha-5-subunit" /db_xref="PID:g182916" /translation="MDNGMFSGFIMIKNLLLFCISMNLSSHFGFSQMPTSSVKDETND NITIFTRILDGLLDGYDNRLRPGLGERITQVRTDIYVTSFGPVSDTEMEYTIDVFFRQ SWKDERLRFKGPMQRLPLNNLLASKIWTPDTFFHNGKKSIAHNMTTPNKLLRLEDDGT LLYTMRLTISAECPMQLEDFPMDAHACPLKFGSYAYPNSEVVYVWTNGSTKSVVVAED GSRLNQYHLMGQTVGTENISTSTGEYTIMTAHFHLKRKIGYFVIQTYLPCIMTVILSQ VSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSA LIEFATVNYFTKRGWAWDGKKALEAAKIKKKREVILNKSTNAFTTGKMSHPPNIPKEQ TPAGTSNTTSVSVKPSEEKTSESKKTYNSISKIDKMSRIVFPVLFGTFNLVYWATYLN REPVIKGAASPK" gene 306..1694 /gene="GABRA5" mat_peptide 399..1691 /gene="GABRA5" /product="GABA-benzodiazepine receptor alpha-5-subunit" BASE COUNT 659 a 584 c 513 g 562 t ORIGIN 1 aattgcaaga attcccccct tgcaggccga gccggggccc tgcgccctcc ccctccgccc 61 agctcggcca agggcgcatt tgctgagcgt ctggcggcct ctaccggagc acctctgcag 121 agggccgatc ctccagccca gagacgacat gtggcgctcg ggcgagtgcc ttgcagagag 181 aggagtagct tgctggcttt gaacgcgtgg cgtggcagat atttcagaaa gcttcaagaa 241 caagctggag aagggaagag ttattcctcc atattcacct gcttcaacta ctattcttat 301 tgggaatgga caatggaatg ttctctggtt ttatcatgat caaaaacctc cttctctttt 361 gtatttccat gaacttatcc agtcactttg gcttttcaca gatgccaacc agttcagtga 421 aagatgagac caatgacaac atcacgatat ttaccaggat cttggatggg ctcttggatg 481 gctacgacaa cagacttcgg cccgggctgg gagagcgcat cactcaggtg aggaccgaca 541 tctacgtcac cagcttcggc ccggtgtccg acacggaaat ggagtacacc atagacgtgt 601 ttttccgaca aagctggaaa gatgaaaggc ttcggtttaa ggggcccatg cagcgcctcc 661 ctctcaacaa cctccttgcc agcaagatct ggaccccaga cacgttcttc cacaacggga 721 agaagtccat cgctcacaac atgaccacgc ccaacaagct gctgcggctg gaggacgacg 781 gcaccctgct ctacaccatg cgcttgacca tctctgcaga gtgccccatg cagcttgagg 841 acttcccgat ggatgcgcac gcttgccctc tgaaatttgg cagctatgcg taccctaatt 901 ctgaagtcgt ttacgtctgg accaacggct ccaccaagtc ggtggtggtg gcggaagatg 961 gctccagact gaaccagtac cacctgatgg ggcagacggt gggcactgag aacatcagca 1021 ccagcacagg cgaatacaca atcatgacag ctcacttcca cctgaaaagg aagattggct 1081 actttgtcat ccagacctac cttccctgca taatgaccgt gatcttatca caggtgtcct 1141 tttggctgaa ccgggaatca gtcccagcca ggacagtttt tggggtcacc acggtgctga 1201 ccatgacgac cctcagcatc agcgccagga actctctgcc caaagtggcc tacgccaccg 1261 ccatggactg gttcatagct gtgtgctatg ccttcgtctt ctcggcgctg atagagtttg 1321 ccacggtcaa ttactttacc aagagaggct gggcctggga tggcaaaaaa gccttggaag 1381 cagccaagat caagaaaaag cgtgaagtca tactaaataa gtcaacaaac gcttttacaa 1441 ctgggaagat gtctcacccc ccaaacattc cgaaggaaca gaccccagca gggacgtcga 1501 atacaacctc agtctcagta aaaccctctg aagagaagac ttctgaaagc aaaaagactt 1561 acaacagtat cagcaaaatt gacaaaatgt cccgaatcgt attcccagtc ttgttcggca 1621 ctttcaactt agtttactgg gcaacgtatt tgaataggga gccggtgata aaaggagccg 1681 cctctccaaa ataaccggcc acactcccaa actccaagac agccatactt ccagcgaaat 1741 ggtaccaagg agaggttttg ctcacaggga ctctccatat gtgagcacta tctttcagga 1801 aatttttgca tgtttaataa tatgtacaaa taatattgcc ttgatgtttc tatatgtaac 1861 ttcagatgtt tccaagatgt cccattgata attcgagcaa acaactttct ggaaaaacag 1921 gatacgatga ctgacactca gatgcccagt atcatacgtt gatagtttac aaacaagata 1981 cgtatatttt taactgcttc aagtgttacc taacaatgtt ttttatactt caaatgtcat 2041 ttcatacaaa ttttcccagt gaataaatat tttaggaaac tctccatgat tattagaaga 2101 ccaactatat tgcgagaaac agagatcata aagagcacgt tttccattat gaggaaactt 2161 ggacatttat gtacaaaatg aattgccttt gataattctt actgttctga aattaggaaa 2221 gtacttgcat gatcttacac gaagaaatag aataggcaaa cttttatgta ggcagattaa 2281 taacagaaat acatcatatg ttagatacac aaaatatt // LOCUS HUMGABRB3A 1634 bp mRNA PRI 08-NOV-1994 DEFINITION Human gamma amino butyric acid (GABAA) receptor beta-3 subunit mRNA, complete cds. ACCESSION M82919 NID g182924 KEYWORDS GABA-alpha receptor beta-3 subunit; gamma amino butyric acid receptor beta-3 subunit. SOURCE Homo sapiens fetal cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Wagstaff,J., Chaillet,J.R. and Lalande,M. TITLE The GABAA receptor beta 3 subunit gene: characterization of a human cDNA from chromosome 15q11q13 and mapping to a region of conserved synteny on mouse chromosome 7 JOURNAL Genomics 11 (4), 1071-1078 (1991) MEDLINE 92147103 FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /map="15q11.2-q12" gene 54..1475 /gene="GABRB3" CDS 54..1475 /gene="GABRB3" /codon_start=1 /db_xref="GDB:G00-127-549" /product="GABA-alpha receptor beta-3 subunit" /db_xref="PID:g182925" /translation="MWGLAGGRLFGIFSAPVLVAVVCCAQSVNDPGNMSFVKETVDKL LKGYDIRLRPDFGGPPVCVGMNIDIASIDMVSEVNMDYTLTMYFQQYWRDKRLAYSGI PLNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAA CMMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWRGGDKAVTGVERIELPQFSIVEHRL VSRNVVFATGAYPRLSLSFRLKRNIGYFILQTYMPSILITILSWVSFWINYDASAARV ALGITTVLTMTTINTHLRETLPKIPYVKAIDMYLMGCFVFVFLALLEYAFVNYIFFGR GPQRQKKLAEKTAKAKNDRSKSESNRVDAHGNILLTSLEVHNEMNEVSGGIGDTRNSA ISFDNSGIQYRKQSMPREGHGRFLGDRSLPHKKTHLRRRSSQLKIKIPDLTDVNAIDR WSRIVFPFTFSLFNLVYWLYYVN" BASE COUNT 421 a 399 c 409 g 405 t ORIGIN 1 cgtcgcgacg gcggcggggc gccccctccc ccgtgccggg gcgcggcgga gggatgtggg 61 gccttgcggg aggaaggctt ttcggcatct tctcggcccc ggtgctggtg gctgtggtgt 121 gctgcgccca gagtgtgaac gatcccggga acatgtcctt tgtgaaggag acggtggaca 181 agctgttgaa aggctacgac attcgcctaa gacccgactt cgggggtccc ccggtctgcg 241 tggggatgaa catcgacatc gccagcatcg acatggtttc cgaagtcaac atggattata 301 ccttaaccat gtattttcaa caatattgga gagataaaag gctcgcctat tctgggatcc 361 ctctcaacct cacgcttgac aatcgagtgg ctgaccagct atgggtgccc gacacatatt 421 tcttaaatga caaaaagtca tttgtgcatg gagtgacagt gaaaaaccgc atgatccgtc 481 ttcaccctga tgggacagtg ctgtatgggc tcagaatcac cacgacagca gcatgcatga 541 tggacctcag gagatacccc ctggacgagc agaactgcac tctggaaatt gaaagctatg 601 gctacaccac ggatgacatt gagttttact ggcgaggcgg ggacaaggct gttaccggag 661 tggaaaggat tgagctcccg cagttctcca tcgtggagca ccgtctggtc tcgaggaatg 721 ttgtcttcgc cacaggtgcc tatcctcgac tgtcactgag ctttcggttg aagaggaaca 781 ttggatactt cattcttcag acttatatgc cctctatact gataacgatt ctgtcgtggg 841 tgtccttctg gatcaattat gatgcatctg ctgctagagt tgccctcggg atcacaactg 901 tgctgacaat gacaaccatc aacacccacc ttcgggagac cttgcccaaa atcccctatg 961 tcaaagccat tgacatgtac cttatgggct gcttcgtctt tgtgttcctg gcccttctgg 1021 agtatgcctt tgtcaactac attttctttg gaagaggccc tcaaaggcag aagaagcttg 1081 cagaaaagac agccaaggca aagaatgacc gttcaaagag cgaaagcaac cgggtggatg 1141 ctcatggaaa tattctgttg acatcgctgg aagttcacaa tgaaatgaat gaggtctcag 1201 gcggcattgg cgataccagg aattcagcaa tatcctttga caactcagga atccagtaca 1261 ggaaacagag catgcctcga gaagggcatg ggcgattcct gggggacaga agcctcccgc 1321 acaagaagac ccatctacgg aggaggtctt cacagctcaa aattaaaata cctgatctaa 1381 ccgatgtgaa tgccatagac agatggtcca ggatcgtgtt tccattcact ttttctcttt 1441 tcaacttagt ttactggctg tactatgtta actgagtgac tgtacttgat ttttcaaaga 1501 cttcatttaa cactgagtga aatattactc tgcctgtcaa gtttttatac ctgtacacac 1561 acagacacac aagcagacac acacatatat acatacgcaa ttgtatatat atgtgaactt 1621 ctcagcatat atat // LOCUS HUMGACA 1375 bp mRNA PRI 26-FEB-1997 DEFINITION Human mRNA for Mr 110,000 antigen, complete cds. ACCESSION D64154 NID g994759 KEYWORDS Mr 110,000 antigen. SOURCE Homo sapiens stomach cancer cell_line:GaCa cDNA to mRNA, clone:GP110, G10, G13. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1375) AUTHORS Shimada,S., Ogawa,M., Takahashi,M., Schlom,J. and Greiner,J.W. TITLE Molecular cloning and characterization of the complementary DNA of an M(r) 110,000 antigen expressed by human gastric carcinoma cells and upregulated by gamma-interferon JOURNAL Cancer Res. 54 (14), 3831-3836 (1994) MEDLINE 94306392 REFERENCE 2 (bases 1 to 1375) AUTHORS Shimada,S. TITLE Direct Submission JOURNAL Submitted (16-SEP-1995) to the DDBJ/EMBL/GenBank databases. Shinya Shimada, Kumamoto National Hospital, Surgery; Ninomaru 1-5, Kumamoto, Kumamoto 860, Japan (Tel:096-353-6501(ex.228), Fax:096-325-2519) FEATURES Location/Qualifiers source 1..1375 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="GaCa" /clone="GP110, G10, G13" /tissue_type="stomach cancer" CDS 47..1270 /codon_start=1 /product="Mr 110,000 antigen" /db_xref="PID:d1011683" /db_xref="PID:g1853971" /translation="MTTSGALFPSLVPGSRGASNKYLVEFRAGKMSLKGTTVTPDKRK GLVYIQQTDDSLIHFCWKDRTSGNVEDDLIIFPDDCEFKRVPQCPSGRVYVLKFKAGS KRLFFWMQEPKTDQDEEHCRKVNEYLNNPPMPGALGASGTSGHELSALGGEGGLQSLL GNMSHSQLMQLIGPAGLGGLGGLGALTGPGLASLLGSSGPPGSSSSSSSRSQSAAVTP SSTTSSTRATPAPSAPAAASATSPSPAPSSGNGASTAASPTQPIQLSDLQSILATMNV PAGPAGGQQVDLASVLTPEIMAPILANADVQERLLPYLPSGESLPQTADEIQNTLTSP QFQQALGMFSAALASGQLGPLMCQFGLPAEAVEAANKGDVEAFAKAMQNNAKPEQKEG DTKDKKDEEEDMSLD" sig_peptide 47..91 mat_peptide 92..1267 /product="Mr 110,000 antigen" polyA_signal 1345..1350 BASE COUNT 288 a 440 c 428 g 219 t ORIGIN 1 gcgagcccgg acggcgcctc tcgaacgagt gtgggcgcga ggcaggatga cgacctcagg 61 cgcgctcttt ccaagcctgg tgccaggctc tcggggcgcc tccaacaagt acttggtgga 121 gtttcgggcg ggaaagatgt ccctgaaggg gaccaccgtg actccggata agcggaaagg 181 gctggtgtac attcagcaga cggacgactc gcttattcac ttctgctgga aggacaggac 241 gtccgggaac gtggaagacg acttgatcat cttccctgac gactgtgagt tcaagcgggt 301 gccgcagtgc cccagcggga gggtctacgt gctgaagttc aaggcagggt ccaagcggct 361 tttcttctgg atgcaggaac ccaagacaga ccaggatgag gagcattgcc ggaaagtcaa 421 cgagtatctg aacaaccccc cgatgcctgg ggcactgggg gccagcggaa cgagcggcca 481 cgaactctct gcgctaggcg gtgagggtgg cctgcagagc ctgctgggaa acatgagcca 541 cagccagctc atgcagctca tcggaccagc cggcctcgga ggactgggtg ggctgggggc 601 cctgactgga cctggcctgg ccagcttact ggggagcagt gggcctccag ggagcagctc 661 ctcctccagc tcccggagcc agtcggcagc ggtcaccccg tcatccacca cctcttccac 721 ccgtgccacc ccagcccctt ctgctccagc agctgcctca gcaactagcc cgagccccgc 781 gcccagttcc gggaatggag ccagcacagc agccagcccg acccagccca tccagctgag 841 cgacctccag agcatcctgg ccacgatgaa cgtaccagcc gggccagcag gcggccagca 901 agtggacctg gccagtgtgc tgacgccgga gataatggct cccatcctcg ccaacgcgga 961 tgtccaggag cgcctgcttc cctacttgcc atctggggag tcgctgccgc agaccgcgga 1021 tgagatccag aataccctga cctcgcccca gttccagcag gccctgggca tgttcagcgc 1081 agccttggcc tcggggcagc tgggccccct catgtgccag ttcggtctgc ctgcagaggc 1141 tgtggaggcc gccaacaagg gcgatgtgga agcgtttgcc aaagccatgc agaacaacgc 1201 caagcccgag cagaaagagg gcgacacgaa ggacaagaag gacgaagagg aggacatgag 1261 cctggactga gccacgcgcc gtcctccgag gaactgggcg cttgcagtgc gttgcacacc 1321 ctcacctccc acccactgat tattaataaa gtcttttcct ttacctgcaa aaaaa // LOCUS HUMGAD67A 3610 bp mRNA PRI 23-FEB-1995 DEFINITION Human glutamate decarboxylase (GAD67) mRNA, complete cds. ACCESSION M81883 NID g182935 KEYWORDS glutamate decarboxylase; pyridoxal phosphate coenzyme. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3610) AUTHORS Bu,D.F., Erlander,M.G., Hitz,B.C., Tillakaratne,N.J., Kaufman,D.L., Wagner-McPherson,C.B., Evans,G.A. and Tobin,A.J. TITLE Two human glutamate decarboxylases, 65-kDa GAD and 67-kDa GAD, are each encoded by a single gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (6), 2115-2119 (1992) MEDLINE 92196068 REFERENCE 2 (bases 1 to 3610) AUTHORS Bu,D.F. and Tobin,A.J. TITLE The exon-intron organization of the genes (GAD1 and GAD2) encoding two human glutamate decarboxylases (GAD67 and GAD65) suggests that they derive from a common ancestral GAD JOURNAL Genomics 21 (1), 222-228 (1994) MEDLINE 94375018 FEATURES Location/Qualifiers source 1..3610 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="22 week fetus" /tissue_lib="whole brain, lambda gt11" mRNA <1..>3610 /gene="GAD67" /evidence=experimental gene 1..3610 /gene="GAD67" CDS 551..2335 /gene="GAD67" /EC_number="4.1.1.15" /note="67-kDa isoenzyme" /codon_start=1 /function="directs the synthesis of enzymatically active proteins in prokaryotic and eukaryotic expression systems" /product="glutamate decarboxylase" /db_xref="PID:g182936" /translation="MASSTPSSSATSSNAGADPNTTNLRPTTYDTWCGVAHGCTRKLG LKICGFLQRTNSLEEKSRLVSAFRERQSSKNLLSCENSDRDARFRRTETDFSNLFARD LLPAKNGEEQTVQFLLEVVDILLNYVRKTFDRSTKVLDFHHPHQLLEGMEGFNLELSD HPESLEQILVDCRDTLKYGVRTGHPRFFNQLSTGLDIIGLAGEWLTSTANTNMFTYEI APVFVLMEQITLKKMREIVGWSSKDGDGIFSPGGAISNMYSIMAARYKYFPEVKTKGM AAVPKLVLFTSEQSHYSIKKAGAALGFGTDNVILIKCNERGKIIPADFEAKILEAKQK GYVPFYVNATAGTTVYGAFDPIQEIADICEKYNLWLHVDAAWGGGLLMSRKHRHKLNG IERANSVTWNPHKMMGVLLQCSAILVKEKGILQGCNQMCAGYLFQPDKQYDVSYDTGD KAIQCGRHVDIFKFWLMWKAKGTVGFENQINKCLELAEYLYAKIKNREEFEMVFNGEP EHTNVCFWYIPQSLRGVPDSPQRREKLHKVAPKIKALMMESGTTMVGYQPQGDKANFF RMVISNPAATQSDIDFLIEEIERLGQDL" BASE COUNT 985 a 838 c 861 g 926 t ORIGIN chromosome 2q31. 1 gaattcttcg taggaattat cttttccctc ctctcacccg acagcctgcc tatttccaaa 61 ggaaaaaaaa aaagcgtgtt gagtacgttc tggattactc ataagacctt ttttttttcc 121 ttccgggcgc aaaaccgtga gctggattta taatcgccct ataaagctcc agaggcggtc 181 aggcacctgc agaggagccc cgccgctccg ccgactagct gcccccgcga gcaacggcct 241 cgtgatttcc ccgccgatcc ggtccccgcc tccccactct gcccccgcct accccggagc 301 cgtgcagccg cctctccgaa tctctctctt ctcctggcgc tcgcgtgcga gagggaacta 361 gcgagaacga ggaagcagct ggaggtgacg ccgggcagat tacgcctgtc agggccgagc 421 cgagcggatc gctgggcgct gtgcagagga aaggcgggag tgcccggctc gctgtcgcag 481 agccgagcct gtttctgcgc cggaccagtc gaggactctg gacagtagag gccccgggac 541 gaccgagctg atggcgtctt cgaccccatc ttcgtccgca acctcctcga acgcgggagc 601 ggaccccaat accactaacc tgcgccccac aacgtacgat acctggtgcg gcgtggccca 661 tggatgcacc agaaaactgg ggctcaagat ctgcggcttc ttgcaaagga ccaacagcct 721 ggaagagaag agtcgccttg tgagtgcctt cagggagagg caatcctcca agaacctgct 781 ttcctgtgaa aacagcgacc gggatgcccg cttccggcgc acagagactg acttctctaa 841 tctgtttgct agagatctgc ttccggctaa gaacggtgag gagcaaaccg tgcaattcct 901 cctggaagtg gtggacatac tcctcaacta tgtccgcaag acatttgatc gctccaccaa 961 ggtgctggac tttcatcacc cacaccagtt gctggaaggc atggagggct tcaacttgga 1021 gctctctgac caccccgagt ccctggagca gatcctggtt gactgcagag acaccttgaa 1081 gtatggggtt cgcacaggtc atcctcgatt tttcaaccag ctctccactg gattggatat 1141 tattggccta gctggagaat ggctgacatc aacggccaat accaacatgt ttacatatga 1201 aattgcacca gtgtttgtcc tcatggaaca aataacactt aagaagatga gagagatagt 1261 tggatggtca agtaaagatg gtgatgggat attttctcct gggggcgcca tatccaacat 1321 gtacagcatc atggctgctc gctacaagta cttcccggaa gttaagacaa agggcatggc 1381 ggctgtgcct aaactggtcc tcttcacctc agaacagagt cactattcca taaagaaagc 1441 tggggctgca cttggctttg gaactgacaa tgtgattttg ataaagtgca atgaaagggg 1501 gaaaataatt ccagctgatt ttgaggcaaa aattcttgaa gccaaacaga agggatatgt 1561 tcccttttat gtcaatgcaa ctgctggcac gactgtttat ggagcttttg atccgataca 1621 agagattgca gatatatgtg agaaatataa cctttggttg catgtcgatg ctgcctgggg 1681 aggtgggctg ctcatgtcca ggaagcaccg ccataaactc aacggcatag aaagggccaa 1741 ctcagtcacc tggaaccctc acaagatgat gggcgtgctg ttgcagtgct ctgccattct 1801 cgtcaaggaa aagggtatac tccaaggatg caaccagatg tgtgcaggat atctcttcca 1861 gccagacaag cagtatgatg tctcctacga caccggggac aaggcaattc agtgtggccg 1921 ccacgtggat atcttcaagt tctggctgat gtggaaagca aagggcacag tgggatttga 1981 aaaccagatc aacaaatgcc tggaactggc tgaatacctc tatgccaaga ttaaaaacag 2041 agaagaattt gagatggttt tcaatggcga gcctgagcac acaaacgtct gtttttggta 2101 tattccacaa agcctcaggg gtgtgccaga cagccctcaa cgacgggaaa agctacacaa 2161 ggtggctcca aaaatcaaag ccctgatgat ggagtcaggt acgaccatgg ttggctacca 2221 gccccaaggg gacaaggcca acttcttccg gatggtcatc tccaacccag ccgctaccca 2281 gtctgacatt gacttcctca ttgaggagat agaaagactg ggccaggatc tgtaatcatc 2341 cttcgcagaa catgagttta tgggaatgcc ttttccctct ggcactccag aacaaacctc 2401 tatatgttgc tgaaacacac aggccatttc attgagggaa aacataatat cttgaagaat 2461 attgttaaaa ccttacttaa agcttgtttg ttctagttag caggaaatag tgttcttttt 2521 aaaaagttgc acattaggaa cagagtatat atgtacagtt atacatacct ctctctatat 2581 atacatgtat agtgagtgtg gcttagtaat agatcacggc atgtttcccg ctccaagaga 2641 attcacttta ccttcagcag ttaccgagga gctaaacatg ctgccaacca gcttgtccaa 2701 caactccagg aaaactgttt ttcaaaacgc catgtcctag gggccaaggg aaatgctgtt 2761 ggtgagaatc gacctcactg tcagcgtttc tccacctgaa gtgatgatgg atgagaaaaa 2821 acaccaccaa atgacaagtc acaccctccc cattagtatc ctgttagggg aaaatagtag 2881 cagagtcatt gttacaggtg tactatggct gtattttaga gattaatttg tgtagattgt 2941 gtaaattcct gttgtctgac cttggtggtg ggaggggaga ctatgtgtca tgatttcaat 3001 gattgtttaa ttgtaggtca atgaaatatt tgcttattta tattcagaga tgtaccatgt 3061 taaagaggcg tcttgtattt tcttcccatt tgtaatgtat cttatttata tatgaagtaa 3121 gttctgaaaa ctgtttatgg tattttcgtg catttgtgag ccaaagagaa aagattaaaa 3181 ttagtgagat ttgtatttat attagagtgc ccttaaaata atgatttaag cattttactg 3241 tctgtaagag aattctaaga ttgtacatga cataagttat agtaatcatg gcaaatcctg 3301 ttacttaaat agcatctgct cttctcttac gctctctgtc tggctgtacg tctggtgttc 3361 tcaatgcttt tctagcaact gttggataat aactagatct cctgtaattt tgtagtagtt 3421 gatgaccaat ctctgtgact cgcttagctg aaacctaagg caacatttcc gaagaccttc 3481 tgaagatctc agataaagtg accaggctca caactgtttt tgaagaaggg aaattcacac 3541 tgtgcgtttt gagtatgcaa gaagaatata aataaataaa atatctcatg gagattgaca 3601 aaaaaaaaaa // LOCUS HUMGALAREC 1053 bp mRNA PRI 21-OCT-1994 DEFINITION Human galanin receptor mRNA, complete cds. ACCESSION L34339 NID g559047 KEYWORDS galanin receptor. SOURCE Homo sapiens melanoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1053) AUTHORS Habert-Ortoli,E., Amiranoff,B., Loquet,I., Laburthe,M. and Mayaux,J.-F. TITLE Molecular cloning of a functional human galanin receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 9780-9783 (1994) MEDLINE 95024044 FEATURES Location/Qualifiers source 1..1053 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="melanoma" CDS 1..1050 /codon_start=1 /product="galanin receptor" /db_xref="PID:g559048" /translation="MELAVGNLSEGNASCPEPPAPEPGPLFGIGVENFVTLVVFGLIF ALGVLGNSLVITVLARSKPGKPRSTTNLFILNLSIADLAYLLFCIPFQATVYALPTWV LGAFICKFIHYFFTVSMLVSIFTLAAMSVDRYVAIVHSRRSSSLRVSRNALLGVGCIW ALSIAMASPVAYHQGLFHPRASNQTFCWEQWPDPRHKKAYVVCTFVFGYLLPLLLICF CYAKVLNHLHKKLKNMSKKSEASKKKTAQTVLVVVVVFGISWLPHHIIHLWAEFGVFP LTPASFLFRITAHCLAYSNSSVNPIIYAFLSENFRKAYKQVFKCHIRKDSHLSDTKEN KSRIDTPPSTNCTHV" BASE COUNT 203 a 343 c 270 g 237 t ORIGIN 1 atggagctgg cggtcgggaa cctcagcgag ggcaacgcga gctgtccgga gccccccgcc 61 ccggagcccg ggccgctgtt cggcatcggc gtggagaact tcgtcacgct ggtggtgttc 121 ggcctgatct tcgcgctggg cgtgctgggc aacagcctag tgatcaccgt gctggcgcgc 181 agcaagccgg gcaagccgcg gagcaccacc aacctgttca tcctcaacct gagcatcgcc 241 gacctggcct acctgctctt ctgcatcccc ttccaggcca ccgtgtacgc gctgcccacc 301 tgggtgctgg gcgccttcat ctgcaagttc atccactact tcttcaccgt gtccatgctg 361 gtgagcatct tcaccctggc cgcgatgtcc gtggaccgct acgtggccat cgtgcactcg 421 cggcgctcct cctccctcag ggtgtcccgc aacgcgctgc tgggcgtggg ctgcatctgg 481 gcgctgtcca ttgccatggc ctcgcccgtg gcctaccacc agggcctctt ccacccgcgc 541 gccagcaacc agaccttctg ctgggagcag tggcccgacc ctcgccacaa gaaggcctac 601 gtggtgtgca ccttcgtctt cggctacctg ctgccgctcc tgctcatctg cttctgctat 661 gccaaggtcc ttaatcactt gcataaaaag ttgaagaaca tgtcaaagaa gtctgaagca 721 tccaagaaaa agactgcaca gacagttctg gtggtggttg tggtgtttgg aatctcctgg 781 ctgccgcacc acatcatcca tctctgggct gagtttggag ttttcccgct gacgccggct 841 tccttcctct tcagaatcac cgcccactgc ctggcgtaca gcaattcctc cgtgaatcct 901 atcatttatg catttctctc tgaaaatttc aggaaggcct ataaacaagt gttcaagtgt 961 cacattcgca aagattcaca cctgagtgat actaaagaaa ataaaagtcg aatagacacc 1021 ccaccatcaa ccaattgtac tcatgtgtga taa // LOCUS HUMGALC 3777 bp mRNA PRI 10-MAR-1994 DEFINITION Homo sapiens galactocerebrosidase (GALC) mRNA, complete cds. ACCESSION L23116 NID g431309 KEYWORDS galactocerebrosidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3777) AUTHORS Chen,Y.Q., Rafi,M.A., de Gala,G. and Wenger,D.A. TITLE Cloning and expression of cDNA encoding human galactocerebrosidase, the enzyme deficient in globoid cell leukodystrophy JOURNAL Hum. Mol. Genet. 2 (11), 1841-1845 (1993) MEDLINE 94108435 FEATURES Location/Qualifiers source 1..3777 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, kidney" /tissue_lib="Clontech" gene 48..3777 /gene="GALC" CDS 48..2057 /gene="GALC" /codon_start=1 /product="galctocerebrosidase" /db_xref="PID:g431310" /translation="MTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA VSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMH YALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVV TWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESIS ASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRIL NQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQF TQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQF ATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTL TTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPG EHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRV NKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHF ASGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR" polyA_site 3777 /gene="GALC" BASE COUNT 1040 a 744 c 813 g 1180 t ORIGIN 1 tggctgagtg gctactctcg gcttcctggc aacgccgagc gaaagctatg actgcggccg 61 cgggttcggc gggccgcgcc gcggtgccct tgctgctgtg tgcgctgctg gcgcccggcg 121 gcgcgtacgt gctcgacgac tccgacgggc tgggccggga gttcgacggc atcggcgcgg 181 tcagcggcgg cggggcaacc tcccgacttc tagtaaatta cccagagccc tatcgttctc 241 agatattgga ttatctcttt aagccgaatt ttggtgcctc tttgcatatt ttaaaagtgg 301 aaataggtgg tgatgggcag acaacagacg gcactgagcc ctcccacatg cattatgcac 361 tagatgagaa ttatttccga ggatacgagt ggtggttgat gaaagaagct aagaagagga 421 atcccaatat tacactcatt gggttgccat ggtcattccc tggatggctg ggaaaaggtt 481 tcgactggcc ttatgtcaat cttcagctga ctgcctatta tgtcgtgacc tggattgtgg 541 gcgccaagcg ttaccatgat ttggacattg attatattgg aatttggaat gagaggtcat 601 ataatgccaa ttatattaag atattaagaa aaatgctgaa ttatcaaggt ctccagcgag 661 tgaaaatcat agcaagtgat aatctctggg agtccatctc tgcatccatg ctccttgatg 721 ccgaactttt caaggtggtt gatgttatag gggctcatta tcctggaacc cattcagcaa 781 aagatgcaaa gttgactggg aagaagcttt ggtcttctga agactttagc actttaaata 841 gtgacatggg tgcaggctgc tggggtcgca ttttaaatca gaattatatc aatggctata 901 tgacttccac aatcgcatgg aatttagtgg ctagttacta tgaacagttg ccttatggga 961 gatgcgggtt gatgacggcc caagagccat ggagtgggca ctacgtggta gaatctcctg 1021 tctgggtatc agctcatacc actcagttta ctcaacctgg ctggtattac ctgaagacag 1081 ttggccattt agagaaagga ggaagctacg tagctctgac tgatggctta gggaacctca 1141 ccatcatcat tgaaaccatg agtcataaac attctaagtg catacggcca tttcttcctt 1201 atttcaatgt gtcacaacaa tttgccacct ttgttcttaa gggatctttt agtgaaatac 1261 cagagctaca ggtatggtat accaaacttg gaaaaacatc cgaaagattt ctttttaagc 1321 agctggattc tctatggctc cttgacagtg atggcagttt cacactgagc ctgcatgaag 1381 atgagctgtt cacactcacc actctcacca ctggtcgcaa aggcagctac ccgcttcctc 1441 caaaatccca gcccttccca agtacctata aggatgattt caatgttgat tacccatttt 1501 ttagtgaagc tccaaacttt gctgatcaaa ctggtgtatt tgaatatttt acaaatattg 1561 aagaccctgg cgagcatcac ttcacgctac gccaagttct caaccagaga cccattacgt 1621 gggctgccga tgcatccaac acaatcagta ttataggaga ctacaactgg accaatctga 1681 ctataaagtg tgatgtttac atagagaccc ctgacacagg aggtgtgttc attgcaggaa 1741 gagtaaataa aggtggtatt ttgattagaa gtgccagagg aattttcttc tggatttttg 1801 caaatggatc ttacagggtt acaggtgatt tagctggatg gattatatat gctttaggac 1861 gtgttgaagt tacagcaaaa aaatggtata cactcacgtt aactattaag ggtcatttcg 1921 cctctggcat gctgaatgac aagtctctgt ggacagacat ccctgtgaat tttccaaaga 1981 atggctgggc tgcaattgga actcactcct ttgaatttgc acagtttgac aactttcttg 2041 tggaagccac acgctaatac ttaacagggc atcatagaat actctggatt ttcttccctt 2101 ctttttggtt ttggttcaga gccaattctt gtttcattgg aacagtatat gaggcttttg 2161 agactaaaaa taatgaagag taaaagggga gagaaattta tttttaattt accctgtgga 2221 agattttatt agaattaatt ccaaggggaa aactggtgaa tctttaacat tacctggtgt 2281 gttccctaac attcaaactg tgcattggcc atacccttag gagtggtttg agtagtacag 2341 acctcgaagc cttgctgcta acacctgagg tagctctctt catcttattt gcgagcggtc 2401 tctgtagagt ggcagtaact tgatcatcac tgagatgtat tgtatgcatg ctgaccgtgt 2461 gtccaagtga gccagtgtct gtcatcacaa gatgatgctg ccataataga aagctgaaga 2521 acactagaag tagcttcttg aaaaccactt caacctgtta tgctttatgc tctaaaaagt 2581 atttttttat tttccttttt aagatgatac ttttgaaatg caggatatgg atgagtggga 2641 tgattttaaa aacgcctgtt taataaacta cctctaacac tatttctgcg gtaatagata 2701 ttagcagatt aattgggtta tttgcattat ttaatttttt tgattccaag gttttggtct 2761 tgtaaccact atcactctct gtgaacgttt ttccaggtgg ctggaagaag gaagaaaacc 2821 tgatatagcc aatgctgttg tagtcgtttc ctcagcctca tctcactgtg ctgtggtctg 2881 tcctcacatg tgcactggta acagactcac acagctgatg aatgcttttc tctccttatg 2941 tgtggaagga ggggagcact tagacatttg ctaactccca gagttggatc atctcctaag 3001 atgtacttac tttttaaagt ccaaatatgt ttatatttaa atatacgtga gcatgttcat 3061 catgttgtat gatttatact aagcattaat gtggctctat gtagcaaatc agttattcat 3121 gtaggtaaag taaatctaga attatttata agaattactc attgaactaa ttctactatt 3181 taggaatttg taagagtcta acataggctt agctacagtg aagttttgca ttgcttttga 3241 agacaagaaa agtgctagaa taaataagat tacagagaaa attttttgtt aaaaccaagt 3301 gatttccagc tgatgtatct aatatttttt aaaacaaaca ttatagaggt gtaatttatt 3361 tacaataaaa tgttcctact ttaaatatac aattcagtga gttttgataa attgatatac 3421 ccatgtaacc aacactccag tcaagcttca gaatatttcc atcaccccag aaggttctct 3481 tgtatacctg ctcagtcagt tcctttcact cccaattgtt ggcagccatt gataggaatt 3541 ctatcactat aggttagttt tctttgttcc agaacatcat gaaagcggcg tcatgtactg 3601 tgtattctta tgaatggttt ctttccatca gcataatgct ttgagattgg tccatgttgt 3661 gtgattcagt ggtttgttcc ttcttatttc tgaaaagttt tccattgtat gaatatacca 3721 caatttgttt cctccccacc agtttctgat actacaatta aaactgtcta catttac // LOCUS HUMGALE 1488 bp mRNA PRI 18-NOV-1997 DEFINITION Homo sapiens UDP-galactose-4-epimerase (GALE) mRNA, complete cds. ACCESSION L41668 NID g2623737 KEYWORDS UDP-galactose-4-epimerase; galactose metabolism; galactosemia. SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1488) AUTHORS Daude,N., Gallaher,T.K., Zeschnigk,M., Starzinski-Powitz,A., Petry,K.G., Haworth,I.S. and Reichardt,J.K. TITLE Molecular cloning, characterization, and mapping of a full-length cDNA encoding human UDP-galactose 4'-epimerase JOURNAL Biochem. Mol. Med. 56 (1), 1-7 (1995) MEDLINE 96161433 FEATURES Location/Qualifiers source 1..1488 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /cell_type="foreskin" /chromosome="1" /map="1p36-p35" 5'UTR 1..93 /gene="GALE" /note="G00-119-245" gene 1..1488 /gene="GALE" CDS 94..1140 /gene="GALE" /codon_start=1 /product="UDP-galactose-4-epimerase" /db_xref="PID:g1119217" /db_xref="GDB:G00-119-245" /translation="MAEKVLVTGGAGYIGSHTVLELLEAGYLPVVIDNFHNAFRGGGS LPESLRRVQELTGRSVEFEEMDILDQGALQRLFKKYSFMAVIHFAGLKAVGESVQKPL DYYRVNLTGTIQLLEIMKAHGVKNLVFSSSATVYGNPQYLPLDEAHPTGGCTNPYGKS KFFIEEMIRDLCQADKTWNVVLLRYFNPTGAHASGCIGEDPQGIPNNLMPYVSQVAIG RREALNVFGNDYDTEDGTGVRDYIHVVDLAKGHIAALRKLKEQCGCRIYNLGTGTGYS VLQMVQAMEKASGKKIPYKVVARREGDVAACYANPSLAQEELGWTAALGLDRMCEDLW RWQKQNPSGFGTQA" 3'UTR 1141..1488 /gene="GALE" /note="G00-119-245" BASE COUNT 338 a 414 c 439 g 297 t ORIGIN 1 cgcgacggct gagcaaggac tctccagtcc tcagtcacct tggacaaaga agtgtggatc 61 ctcagattcc atcttttcca actccaaggt gccatggcag agaaggtgct ggtaacaggt 121 ggggctggct acattggcag ccacacggtg ctggagctgc tggaggctgg ctacttgcct 181 gtggtcatcg ataacttcca taatgccttc cgtggagggg gctccctgcc tgagagcctg 241 cggcgggtcc aggagctgac aggccgctct gtggagtttg aggagatgga cattttggac 301 cagggagccc tacagcgtct cttcaaaaag tacagcttta tggcggtcat ccactttgcg 361 gggctcaagg ccgtgggcga gtcggtgcag aagcctctgg attattacag agttaacctg 421 accgggacca tccagcttct ggagatcatg aaggcccacg gggtgaagaa cctggtgttc 481 agcagctcag ccactgtgta cgggaacccc cagtacctgc cccttgatga ggcccacccc 541 acgggtggtt gtaccaaccc ttacggcaag tccaagttct tcatcgagga aatgatccgg 601 gacctgtgcc aggcagacaa gacttggaac gtagtgctgc tgcgctattt caaccccaca 661 ggtgcccatg cctctggctg cattggtgag gatccccagg gcatacccaa caacctcatg 721 ccttatgtct cccaggtggc gatcgggcga cgggaggccc tgaatgtctt tggcaatgac 781 tatgacacag aggatggcac aggtgtccgg gattacatcc atgtcgtgga tctggccaag 841 ggccacattg cagccttaag gaagctgaaa gaacagtgtg gctgccggat ctacaacctg 901 ggcacgggca caggctattc agtgctgcag atggtccagg ctatggagaa ggcctctggg 961 aagaagatcc cgtacaaggt ggtggcacgg cgggaaggtg atgtggcagc ctgttacgcc 1021 aaccccagcc tggcccaaga ggagctgggg tggacagcag ccttagggct ggacaggatg 1081 tgtgaggatc tctggcgctg gcagaagcag aatccttcag gctttggcac gcaagcctga 1141 ggaccctccc ctaccaagga ccaggaaaag cagcagctgc ctgctctcca gcctctggag 1201 gaactcaggg ccctggagct gctggggcca agccaagggc ctcccctacc tcaaacccca 1261 gctgggcccg cttagcccac caggcatgag gccaaggctc cactgaccag gaggccgagg 1321 tctctaactc ttatcttcca cagggtccaa gagttcatca ggacccccaa gagtgagtga 1381 gggggcaagg ctctggcaca aaacctcctc ctcccaggca ctcatttata ttgctctgaa 1441 agagctttcc aaagtattta aaaataaaaa caagttttct tacactgg // LOCUS HUMGAP43A 1231 bp mRNA PRI 08-NOV-1994 DEFINITION Human neuronal growth protein 43 (GAP-43) mRNA, complete cds. ACCESSION M25667 NID g182969 KEYWORDS GAP-43 gene; neuronal growth protein 43. SOURCE Human fetal brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1231) AUTHORS Kosik,K.S., Orecchio,L.D., Bruns,G.A., Benowitz,L.I., MacDonald,G.P., Cox,D.R. and Neve,R.L. TITLE Human GAP-43: its deduced amino acid sequence and chromosomal localization in mouse and human JOURNAL Neuron 1 (2), 127-132 (1988) MEDLINE 90166498 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.S.Kosik, 21-JUN-1989. FEATURES Location/Qualifiers source 1..1231 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q21-qter" gene 92..808 /gene="GAP43" CDS 92..808 /gene="GAP43" /note="neuronal growth factor 43" /codon_start=1 /db_xref="GDB:G00-119-972" /db_xref="PID:g182970" /translation="MLCCMRRTKQVEKNDDDQKIEQDGIKPEDKAHKAATKIQASFRG HITRKKLKGEKKDDVQAAEAEANKKDEAPVADGVEKKGEGTTTAEAAPATGSKPDEPG KAGETPSEEKKGEGDAATEQAAPQAPASSEEKAGSAETESATKASTDNSPSSKAEDAP AKEEPKQADVPAAVTAAAATTPAAEDAAAKATAQPPTETGESSQAEENIEAVDETKPK ESARQDEGKEEEPEADQEHA" BASE COUNT 365 a 302 c 320 g 244 t ORIGIN Chromosome 3. 1 gaattccaga aaagaggtgg agaggggggg aataagaaag agagagaagg aaaggagaga 61 aggcaggaag aaggcaaggg acgagacaac catgctgtgc tgtatgagaa gaaccaaaca 121 ggttgaaaaa aatgatgacg accaaaagat tgaacaagat ggtatcaaac cagaagataa 181 agctcataag gccgcaacca aaattcaggc tagcttccgt ggacacataa caaggaaaaa 241 gctcaaagga gagaagaagg atgatgtcca agctgctgag gctgaagcta ataagaagga 301 tgaagcccct gttgccgatg gggtggagaa gaagggagaa ggcaccacta ctgccgaagc 361 agccccagcc actggctcca agcctgatga gcccggcaaa gcaggagaaa ctccttccga 421 ggagaagaag ggggagggtg atgctgccac agagcaggca gccccccagg ctcctgcatc 481 ctcagaggag aaggccggct cagctgagac agaaagtgcc actaaagctt ccactgataa 541 ctcgccgtcc tccaaggctg aagatgcccc agccaaggag gagcctaaac aagccgatgt 601 gcctgctgct gtcactgctg ctgctgccac cacccctgcc gcagaggatg ctgctgccaa 661 ggcaacagcc cagcctccaa cggagactgg ggagagcagc caagctgaag agaacataga 721 agctgtagat gaaaccaaac ctaaggaaag tgcccggcag gacgagggta aagaagagga 781 acctgaggct gaccaagaac atgcctgaac tctaagaaat ggctttccac atccccaccc 841 tcccctctcc tgagcctgtc tctccctacc ctcttctcag ctccactctg aagtcccttc 901 ctgtcctgct cacgtctgtg agtctgtcct ttcccaccca ctagccctct ttctctctgt 961 gtggcaaaca tttaaaaaaa aaaaaaaaaa gcaggaaaga tcccaagtca aacagtgtgg 1021 cttaaacatt ttttgtttct tggtgttgtt atggcaagtt tttggtaatg atgattcaat 1081 cattttggga aattcttgca ctgtatccaa gttatttgat ctggtgcgtg tggccctgtg 1141 ggagtccact ttcctctctc tctctctctc tgttccaagt gtgtgtgcaa tgttccgttc 1201 atctgaggag tccaaaatat tgagtgaatt c // LOCUS HUMGAPA 4307 bp mRNA PRI 08-NOV-1994 DEFINITION Human GTPase-activating protein ras p21 (RASA) mRNA, complete cds. ACCESSION M23379 NID g182971 KEYWORDS GTPase-activating protein. SOURCE Human placenta, cDNA to mRNA, clone 101. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4307) AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A., Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and McCormick,F. TITLE Molecular cloning of two types of GAP complementary DNA from human placenta JOURNAL Science 242 (4886), 1697-1700 (1988) MEDLINE 89072759 COMMENT Draft entry and computer readable sequence for [1] kindly submitted by C.M.Long, 31-MAR-1989. For sequence of clone 16 refer to M23612. FEATURES Location/Qualifiers source 1..4307 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q13" gene 119..3262 /gene="RASA" CDS 119..3262 /gene="RASA" /note="ras p21 GTP-ase-activating protein (GAP)" /codon_start=1 /db_xref="GDB:G00-120-339" /product="GTPase-activating protein" /db_xref="PID:g182972" /translation="MMAAEAGSEEGGPVTAGAGGGGAAAGSSAYPAVCRVKIPAALPV AAAPYPGLVETGVAGTLGGGAALGSEFLGAGSVAGALGGAGLTGGGTAAGVAGAAAGV AGAAVAGPSGDMALTKLPTSLLAETLGPGGGFPPLPPPPYLPPLGAGLGTVDEGDSLD GPEYEEEEVAIPLTAPPTNQWYHGKLDRTIAEERLRQAGKSGSYLIRESDRRPGSFVL SFLSQMNVVNHFRIIAMCGDYYIGGRRFSSLSDLIGYYSHVSCLLKGEKLLYPVAPPE PVEDRRRVRAILPYTKVPDTDEISFLKGDMFIVHNELEDGWMWVTNLRTDEQGLIVED LVEEVGREEDPHEGKIWFHGKISKQEAYNLLMTVGQVCSFLVRPSDNTPGDYSLYFRT NENIQRFKICPTPNNQFMMGGRYYNSIGDIIDHYRKEQIVEGYYLKEPVPMQDQEQVL NDTVDGKEIYNTIRRKTKDAFYKNIVKKGYLLKKGKGKRWKNLYFILEGSDAQLIYFE SEKRATKPKGLIDLSVCSVYVVHDSLFGRPNCFQIVVQHFSEEHYIFYFAGETPEQAE DWMKGLQAFCNLRKSSPGTSNKRLRQVSSLVLHIEEAHKLPVKHFTNPYCNIYLNSVQ VAKTHAREGQNPVWSEEFVFDDLPPDINRFEITLSNKTKKSKDPDILFMRCQLSRLQK GHATDEWFLLSSHIPLKGIEPGSLRVRARYSMEKIMPEEEYSEFKELILQKELHVVYA LSHVCGQDRTLLASILLRIFLHEKLESLLLCTLNDREISMEDEATTLFRATTLASTLM EQYMKATATQFVHHALKDSILKIMESKQSCELSPSKLEKNEDVNTNLTHLLNILSELV EKIFMASEILPPTLRYIYGCLQKSVQHKWPTNTTMRTRVVSGFVFLRLICPAILNPRM FNIISDSPSPIAARTLILVAKSVQNLANLVEFGAKEPYMEGVNPFIKSNKHRMIMFLD ELGNVPELPDTTEHSRTDLSRDLAALHEICVAHSDELRTLSNERGAQQHVLKKLLAIT ELLQQKQNQYTKTNDVR" BASE COUNT 1280 a 856 c 957 g 1214 t ORIGIN 1 cctcagcctg gggagctgaa ggggagacgc gtctgggtgg ggctgctcgg agcccgggcc 61 tggtggcccc tggggctccc gggcgggcag ggtagggcag agtagagcgg gcttcaacat 121 gatggcggcc gaggccggca gtgaggaggg cggcccggta acagccggag ctggaggagg 181 cggcgcggca gcgggctcca gtgcctatcc cgcagtgtgt cgggtgaaga tacccgcggc 241 cctgcctgtg gcagccgccc cctatcctgg gctggtggag accggagtgg ctggaactct 301 gggtggcgga gccgctttgg ggtcagagtt cctaggagcc gggtctgtgg caggggcact 361 ggggggagct ggactgacag ggggaggtac tgctgctggc gtagctggtg ctgctgctgg 421 cgtggccggt gctgctgttg ctggacctag tggagacatg gctctcacca aactgcccac 481 ttcgttgctt gctgagactc tcgggccagg cggcggtttt ccccctctgc cccctccccc 541 ttacctgccc cctttggggg cgggcctcgg gacagtggac gaaggtgact ctctggatgg 601 accagaatac gaggaggaag aggtggccat accgttgacc gctcctccaa ctaaccagtg 661 gtatcacgga aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc 721 tggcagttat cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct 781 tagccagatg aatgttgtca accattttag gattattgct atgtgtggag attactacat 841 tggtggaaga cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg 901 tttgcttaaa ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag 961 aaggcgtgta cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt 1021 cttaaaagga gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac 1081 aaatttaaga acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg 1141 ggaagaagat ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc 1201 ttataattta ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa 1261 tactcctggc gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat 1321 atgtccaacg ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga 1381 catcatagat cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt 1441 accaatgcag gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa 1501 taccatccgt cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct 1561 tctgaaaaag ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga 1621 tgcccaactt atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga 1681 tctcagtgta tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt 1741 tcagatagta gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac 1801 tccagaacaa gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag 1861 tagtccaggg acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga 1921 agaagcccat aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa 1981 tagtgtccaa gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga 2041 gtttgtcttt gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa 2101 aacaaagaaa agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca 2161 gaaagggcat gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat 2221 tgaaccaggg tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga 2281 agagtacagt gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt 2341 atcacatgta tgtggacaag accgaacact actggccagc atcctactga ggatttttct 2401 tcacgaaaag cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga 2461 agatgaagcc actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta 2521 tatgaaagcc actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat 2581 aatggaaagc aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt 2641 gaacactaat ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat 2701 ggcttcagaa atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt 2761 tcagcataag tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt 2821 tcttcgactc atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc 2881 tccatctcct attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc 2941 aaatcttgtg gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa 3001 aagcaacaaa catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc 3061 ggacactaca gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat 3121 ttgcgtggct cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca 3181 cgtattgaaa aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac 3241 aaaaaccaat gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt 3301 ccaacatggt aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac 3361 ttttccacat tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa 3421 taactatgcc agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg 3481 aatcaatctt taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg 3541 ctatgactgt atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc 3601 aaagttttgc tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt 3661 attgacaatt gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag 3721 gaatatgacc atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat 3781 tataatagga acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg 3841 gaactgttga aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca 3901 tacgacttat tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa 3961 atgtttacaa gtaaatagtt tgaattcagt aaatattatt ggttgttgta ttgatcaatg 4021 catgttaccc attcaaccat tttatagact accaatttct tttatgttaa ctagaatgct 4081 tttgttaaaa gttatttgtt cattatttgt gctacccctt tgattatgca gacaacctca 4141 tcagctgcct aacttatcca tctttgaact tctgactact tgttgtatct gctggatatt 4201 tagttcaact gtatagtttt atttacttct gtatgtgtat ttttgtgaag tattcacaaa 4261 ggttaagtta aaataaaacc aagggatatc ttgcaaaaaa aaaaaaa // LOCUS HUMGAS 2461 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens growth-arrest-specific protein (gas) mRNA, complete cds. ACCESSION L13720 NID g401766 KEYWORDS growth-arrest-specific protein; vitamin K-dependent protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2461) AUTHORS Manfioletti,G., Brancolini,C., Avanzi,G. and Schneider,C. TITLE The protein encoded by a growth arrest-specific gene (gas6) is a new member of the vitamin K-dependent proteins related to protein S, a negative coregulator in the blood coagulation cascade JOURNAL Mol. Cell. Biol. 13 (8), 4976-4985 (1993) MEDLINE 93330291 FEATURES Location/Qualifiers source 1..2461 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cells" gene 135..2171 /gene="gas6" CDS 135..2171 /gene="gas6" /codon_start=1 /function="negative coregulator in the blood coagulation cascade" /product="growth-arrest-specific protein" /db_xref="PID:g401767" /translation="MAPSLSPGPAALRRAPQLLLLLLAAECALAALLPAREATQFLRP RQRRAFQVFEEAKQGHLERECVEELCSREEAREVFENDPETDYFYPRYLDCINKYGSP YTKNSGFATCVQNLPDQCTPNPCDRKGTQACQDLMGNFFCLCKAGWGGRLCDKDVNEC SQENGGCLQICHNKPGSFHCSCHSGFELSSDGRTCQDIDECADSEACGEARCKNLPGS YSCLCDEGFAYSSQEKACRDVDECLQGRCEQVCVNSPGSYTCHCDGRGGLKLSQDMDT CEDILPCVPFSVAKSVKSLYLGRMFSGTPVIRLRFKRLQPTRLVAEFDFRTFDPEGIL LFAGGHQDSTWIVLALRAGRLELQLRYNGVGRVTSSGPVINHGMWQTISVEELARNLV IKVNRDAVMKIAVAGDLFQPERGLYHLNLTVGGIPFHEKDLVQPINPRLDGCMRSWNW LNGEDTTIQETVKVNTRMQCFSVTERGSFYPGSGFAFYSLDYMRTPLDVGTESTWEVE VVAHIRPAADTGVLFALWAPDLRAVPLSVALVDYHSTKKLKKQLVVLAVEHTALALME IKVCDGQEHVVTVSLRDGEATLEVDGTRGQSEVSAAQLQERLAVLERHLRSPVLTFAG GLPDVPVTSAPVTAFYRGCMTLEVNRRLLDLDEAAYKHSDITAHSCPPVEPAAA" BASE COUNT 454 a 782 c 793 g 432 t ORIGIN 1 ccgcagccgc cgccgccgcc gccgccgcga tgtgaccttc agggccgcca ggacgggatg 61 accggagcct ccgccccgcg gcgcccgctc gcctcggcct cccgggcgct ctgaccgcgc 121 gtccccggcc cgccatggcc ccttcgctct cgcccgggcc cgccgccctg cgccgcgcgc 181 cgcagctgct gctgctgctg ctggccgcgg agtgcgcgct tgccgcgctg ttgccggcgc 241 gcgaggccac gcagttcctg cggcccaggc agcgccgcgc ctttcaggtc ttcgaggagg 301 ccaagcaggg ccacctggag agggagtgcg tggaggagct gtgcagccgc gaggaggcgc 361 gggaggtgtt cgagaacgac cccgagacgg attattttta cccaagatac ttagactgca 421 tcaacaagta tgggtctccg tacaccaaaa actcaggctt cgccacctgc gtgcaaaacc 481 tgcctgacca gtgcacgccc aacccctgcg ataggaaggg gacccaagcc tgccaggacc 541 tcatgggcaa cttcttctgc ctgtgtaaag ctggctgggg gggccggctc tgcgacaaag 601 atgtcaacga atgcagccag gagaacgggg gctgcctcca gatctgccac aacaagccgg 661 gtagcttcca ctgttcctgc cacagcggct tcgagctctc ctctgatggc aggacctgcc 721 aagacataga cgagtgcgca gactcggagg cctgcgggga ggcgcgctgc aagaacctgc 781 ccggctccta ctcctgcctc tgtgacgagg gctttgcgta cagctcccag gagaaggctt 841 gccgagatgt ggacgagtgt ctgcagggcc gctgtgagca ggtctgcgtg aactccccag 901 ggagctacac ctgccactgt gacgggcgtg ggggcctcaa gctgtcccag gacatggaca 961 cctgtgagga catcttgccg tgcgtgccct tcagcgtggc caagagtgtg aagtccttgt 1021 acctgggccg gatgttcagt gggacccccg tgatccgact gcgcttcaag aggctgcagc 1081 ccaccaggct ggtagctgag tttgacttcc ggacctttga ccccgagggc atcctcctct 1141 ttgccggagg ccaccaggac agcacctgga tcgtgctggc cctgagagcc ggccggctgg 1201 agctgcagct gcgctacaac ggtgtcggcc gtgtcaccag cagcggcccg gtcatcaacc 1261 atggcatgtg gcagacaatc tctgttgagg agctggcgcg gaatctggtc atcaaggtca 1321 acagggatgc tgtcatgaaa atcgcggtgg ccggggactt gttccaaccg gagcgaggac 1381 tgtatcatct gaacctgacc gtgggaggta ttcccttcca tgagaaggac ctcgtgcagc 1441 ctataaaccc tcgtctggat ggctgcatga ggagctggaa ctggctgaac ggagaagaca 1501 ccaccatcca ggaaacggtg aaagtgaaca cgaggatgca gtgcttctcg gtgacggaga 1561 gaggctcttt ctaccccggg agcggcttcg ccttctacag cctggactac atgcggaccc 1621 ctctggacgt cgggactgaa tcaacctggg aagtagaagt cgtggctcac atccgcccag 1681 ccgcagacac aggcgtgctg tttgcgctct gggcccccga cctccgtgcc gtgcctctct 1741 ctgtggcact ggtagactat cactccacga agaaactcaa gaagcagctg gtggtcctgg 1801 ccgtggagca tacggccttg gccctaatgg agatcaaggt ctgcgacggc caagagcacg 1861 tggtcaccgt ctcgctgagg gacggtgagg ccaccctgga ggtggacggc accaggggcc 1921 agagcgaggt gagcgccgcg cagctgcagg agaggctggc cgtgctcgag aggcacctgc 1981 ggagccccgt gctcaccttt gctggcggcc tgccagatgt gccggtgact tcagcgccag 2041 tcaccgcgtt ctaccgcggc tgcatgacac tggaggtcaa ccggaggctg ctggacctgg 2101 acgaggcggc gtacaagcac agcgacatca cggcccactc ctgccccccc gtggagcccg 2161 ccgcagccta ggcccccacg ggacgcggca ggcttctcag tctctgtccg agacagccgg 2221 gaggagcctg ggggctcctc accacgtggg gccatgctga gagctgggct ttcctctgtg 2281 accatcccgg cctgtaacat atctgtaaat agtgagatgg acttggggcc tctgacgccg 2341 cgcactcagc cgtgggcccg ggcgcgggga ggccggcgca gcgcagagcg ggctcgaaga 2401 aaataattct ctattatttt tattaccaag cgcttctttc tgactctaaa atatggaaaa 2461 t // LOCUS HUMGAS1A 2828 bp mRNA PRI 15-APR-1994 DEFINITION Human gas1 gene, complete cds. ACCESSION L13698 NID g472859 KEYWORDS . SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2828) AUTHORS Del Sal,G., Collavin,L., Ruaro,M.E., Edomi,P., Saccone,S., della Valle,G. and Schneider,C. TITLE Structure, function, and chromosome mapping of the growth-suppressing human homologue of the murine gas1 gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91, 1848-1852 (1994) MEDLINE 94173926 FEATURES Location/Qualifiers source 1..2828 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="9q21.3-22.1" CDS 411..1448 /codon_start=1 /db_xref="PID:g472860" /translation="MVAALLGGGGEARGGTVPGAWLCLMALLQLLGSAPRGSGLAHGR RLICWQALLQCQGEPECSYAYNQYAEACAPVLAQHGGGDAPGAAAAAFPASAASFSSR WRCPSHCISALIQLNHTRRGPALEDCDCAQDENCKSTKRAIEPCLPRTSGGGAGGPGA GGVMGCTEARRRCDRDSRCNLALSRYLTYCGKVFNGLRCTDECRTVIEDMLAMPKVAL LNDCVCDGLERPICESVKENMARLCFGAELGNGPGSSGSDGGLDDYYDEDYDDEQRTG GAGGEQPLDDDDGVPHPPRPGSGAAASGGRGDLPYGPGRRSSGGGGRLAPRGAWTPLA SILLLLLGPLF" BASE COUNT 570 a 809 c 822 g 627 t ORIGIN 1 agcagccggc acggggacag ccggccgcac aacggatctg caggcgcgga gcaaaatgca 61 cccgccgcgc cgcgcggtcc tgcagccccg ccacggcccc gcggcccgca cccccccggg 121 gcgacagtga gcctctcccg ccaccaccgg gggccgagcg gagggctctc gggtgggaga 181 gcgggaccag atctcgacag ctgttcattt ccaggaagcc accgcagcca gagcgaaagg 241 ggaccttctg ccaccagcgg ggcatcagcc agcggcgcgc atggatttat gaagacactc 301 atgcaagaag tgggcaggac ttggacaaac ttttccaccg gctccgcgtc cgccgctccc 361 cgcgcctcgt ctcctttccc ctcctctccc ggcggccgcc gctgcccgcg atggtggccg 421 cgctgctggg cggcggcggc gaggcccgcg gggggacagt gccgggcgcc tggctgtgcc 481 tgatggcgct gctgcagctg ctgggctcgg cgccgcgggg atcggggctg gcgcacggcc 541 gccgcctcat ctgctggcag gcgctgctgc agtgccaggg ggagccggag tgcagctacg 601 cctacaacca gtacgccgag gcgtgcgcgc cggtgctggc gcagcacggc gggggcgacg 661 cgcccggggc cgccgccgcc gctttcccgg cctcggccgc ctctttctcg tcgcgctggc 721 gctgcccgag tcactgcatc tcggccctca ttcagctcaa ccacacgcgc cgcgggcccg 781 ccctggagga ctgtgactgc gcgcaggacg agaactgcaa gtccaccaag cgcgccattg 841 agccgtgcct gccccggacg agcggcggcg gcgcgggcgg ccccggcgcg ggcggggtca 901 tgggctgcac cgaggcccgg cggcgctgcg accgcgacag ccgctgcaac ctggcgctga 961 gccgctacct gacctactgc ggcaaagtct tcaacgggct gcgctgcacg gacgaatgcc 1021 gcaccgtcat tgaggacatg ctggctatgc ccaaggtggc gctgctcaac gactgcgtgt 1081 gcgacggcct cgagcggccc atctgcgagt cggtcaagga gaacatggcc cgcctgtgct 1141 tcggcgccga gctgggcaac ggccccggca gcagcggctc ggacgggggc ctggacgact 1201 actacgatga ggactacgat gacgagcagc gcaccggggg cgcgggtggt gagcagccgc 1261 tggacgacga cgacggcgtc ccgcacccac cgcgcccggg cagcggcgct gctgcatcgg 1321 gcggccgcgg ggacctgccc tatgggcctg ggcgcaggag cagcggcggc ggcggccgct 1381 tggcgccccg gggcgcctgg accccactcg cctccatctt gctgctgctg cttgggccgc 1441 tcttttagcc ctcgcgcccc ccgccgttgg ctgcgggaga gcccgcgtcc cactcccgtg 1501 ctcgcctcga ccccgcgccg ggcacctgtg gcttgggaca gatagaaggg atggttgggg 1561 atacttccca aaactttttc caagtcaact tggtgtagcc ggttccccgg ccacgactct 1621 gggcacttcc cctgaagctc ctctccggag cttgacttct tggacctcct cccccgcccc 1681 aattccaagc tccagaaact cccaactcgt ctgccgtcca gaaagctagc tgcagtgttc 1741 aggacgtccg ggaggaagca agcatgtggg ggacagaaca gtagtcctgg actcgaaagg 1801 gaaggtgctg accagtgggg ccttagcaat ttgaagggtt gggaaggagg aattatattt 1861 gcaaaggggc tgtctattag catatttcct ttgagggggc aaaaaaaagt gccagtatcg 1921 acttttacag attgtggcca gtgaggatat tataatccta tgtaaacaga aaagtcccac 1981 ttaccgattc attctttcac tgtttgtatc tgcgcccaga attctcagtg acgtgggggt 2041 gagggtgggt ggcgattgcc ttagagggaa cccctaaatt ggttttggat aagtttgagc 2101 ccttgacctt aatttcattg ctaccactct gatctcttag cacatttctt aggattaagg 2161 gtccaaaaat gctgatctaa ggggttgcca tggtgttgaa caatgcaact ttttatttaa 2221 aaaagctctg cactgccatg tatgaaagtc tctttatgat gtttgttttt ttgtcatttt 2281 tgttctttac atcaagaaat tttatgttta aatatgcgga gaatgtatat tgcctctgct 2341 cctatcaggg ttgctaaacc ctggtacatc gtatataaaa tgtattaaaa ctggggtttg 2401 ttaccagttg ctgtactttg tatatagaat ttttataaat tgtatgcttc agaaataatt 2461 tatttttaaa aagaaattaa aagttttaaa ctcacatcca tattacacct ttcccccctg 2521 aaatgtatag aatccatttg tcatcaggaa tcaaaaccca cagtccattg tgaagtgtgc 2581 tatatttaga acagtcttaa aatgtacagt gtattttata gaattgaagt taacattctt 2641 attttcaaga gaatttatgg acgttgtaga aatgtacaaa tgcatttcca aactgcctta 2701 aacgttgtat ttttatagac atgttttttt aaaaatccta agtttttaaa taactatgga 2761 tttgtgtatt ttttttggtt atttgtttta ttaaaacatg tacatcagta aagagtttta 2821 aacaatga // LOCUS HUMGAS3X 1716 bp mRNA PRI 31-DEC-1994 DEFINITION Human peripheral myelin protein 22 (GAS3) mRNA, complete cds. ACCESSION L03203 NID g182984 KEYWORDS peripheral myelin protein 22. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1716) AUTHORS Suter,U., Lupski,J.R., Shooter,E.M., Francke,U., Garcia,C.A., Snipes,G.J., Pentao,L., Trask,B., Schoener-Scott,R., Welcher,A.A., Roa,B. and Patel,P.I. TITLE The gene for the peripheral myelin protein PMP-22 is a candidate for Charcot-Marie-Tooth disease type 1A JOURNAL Nature New Genet. 1, 159-165 (1992) REFERENCE 2 (bases 1 to 1716) AUTHORS Edomi,P., Martinotti,A., Colombo,M.P. and Schneider,C. TITLE Sequence of human GAS3/PMP22 full-length cDNA JOURNAL Gene 126 (2), 289-290 (1993) MEDLINE 93246261 FEATURES Location/Qualifiers source 1..1716 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /map="17p" 5'UTR 1..114 /gene="GAS3" gene 1..1716 /gene="GAS3" CDS 115..597 /gene="GAS3" /codon_start=1 /product="peripheral myelin protein 22" /db_xref="PID:g182985" /translation="MLLLLLSIIVLHVAVLVLLFVSTIVSQWIVGNGHATDLWQNCST SSSGNVHHCFSSSPNEWLQSVQATMILSIIFSILSLFLFFCQLFTLTKGGRFYITGIF QILAGLCVMSAAAIYTVRHPEWHLNSDYSYGFAYILAWVAFPLALLSGVIYVILRKRE " 3'UTR 598..1716 /gene="GAS3" BASE COUNT 438 a 425 c 370 g 483 t ORIGIN 1 cggcgccagc agcggagcca acgcacccga gtttgtgttt gaggccaccc tgaggatcgg 61 gacagctgtt cctttgggct gcagaaactc cgctgagcag aacttgccgc cagaatgctc 121 ctcctgttgc tgagtatcat cgtcctccac gtcgcggtgc tggtgctgct gttcgtctcc 181 acgatcgtca gccaatggat cgtgggcaat ggacacgcaa ctgatctctg gcagaactgt 241 agcacctctt cctcaggaaa tgtccaccac tgtttctcat catcaccaaa cgaatggctg 301 cagtctgtcc aggccaccat gatcctgtcg atcatcttca gcattctgtc tctgttcctg 361 ttcttctgcc aactcttcac cctcaccaag gggggcaggt tttacatcac tggaatcttc 421 caaattcttg ctggtctgtg cgtgatgagt gctgcggcca tctacacggt gaggcacccg 481 gagtggcatc tcaactcgga ttactcctac ggtttcgcct acatcctggc ctgggtggcc 541 ttccccctgg cccttctcag cggtgtcatc tatgtgatct tgcggaaacg cgaatgaggc 601 gcccagacgg tctgtctgag gctctgagcg tacataggga agggaggaag ggaaaccaga 661 aagcagacaa agaaaaaaga gctagcccaa aatcccaaac tcaaaccaaa cagaaagcag 721 tggaggtggg ggttgctgtt gattgaagat gtatataata tctccggttt ataaaaccta 781 tttataacac tttttacata tatgtacata gtattgtttg ctttttatgt tgaccatcag 841 cctcgtgttg agccttaaag aagtagctaa ggaactttac atcctaacag tataatccag 901 ctcagtattt ttgttttgtt ttttgtttgt ttgttttgtt ttacccagaa ataagataac 961 tccatctcgc cccttccctt tcatctgaaa gaagatacct ccctcccagt ccacctcatt 1021 tagaaaacca aagtgtgggt agaaacccca aatgtccaaa agcccttttc tggtgggtga 1081 cccagtgcat ccaacagaaa cagccgctgc ccgaacctct gtgtgaagct ttacgcgcac 1141 acggacaaaa tgcccaaact ggagcccttg caaaaacacg gcttgtggca ttggcatact 1201 tgcccttaca ggtggagtat cttcgtcaca catctaaatg agaaatcagt gacaacaagt 1261 ctttgaaatg gtgctatgga tttaccattc cttattatca ctaatcatct aaacaactca 1321 ctggaaatcc aattaacaat tttacaacat aagatagaat ggagacctga ataattctgt 1381 gtaatataaa tggtttataa ctgcttttgt acctagctag gctgctatta ttactataat 1441 gagtaaatca taaagccttc atcactccca catttttctt acggtcggag catcagaaca 1501 agcgtctaga ctccttggga ccgtgagttc ctagagcttg gctgggtcta ggctgttctg 1561 tgcctccaag gactgtctgg caatgacttg tattggccac caactgtaga tgtatatatg 1621 gtgcccttct gatgctaaga ctccagacct tttgtttttg ctttgcattt tctgatttta 1681 taccaactgt gtggactaag atgcattaaa ataaac // LOCUS HUMGASUB 1146 bp mRNA PRI 17-MAR-1993 DEFINITION Homo sapiens (clone 58N-1) Ga subunit mRNA, complete cds. ACCESSION L01694 NID g182993 KEYWORDS Ga subunit; oncogene. SOURCE Homo sapiens (library: pCEV27) male adult sarcoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1146) AUTHORS Chan,A.M. L., Fleming,T.P., McGovern,E.S., Chedid,M., Miki,T. and Aaronson,S.A. TITLE Expression cDNA cloning of a transforming gene encoding the wild-type G alpha 12 gene product JOURNAL Mol. Cell. Biol. 13, 762-768 (1993) MEDLINE 93140773 FEATURES Location/Qualifiers source 1..1146 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RD-ES-1, A2095" /cell_type="transformed cell" /dev_stage="adult" /sex="male" /tissue_type="sarcoma" /tissue_lib="pCEV27" CDS 1..1146 /codon_start=1 /product="Ga subunit" /db_xref="PID:g182994" /translation="MSGVVRTLSRCLLPAEAGGARERRAGSGARDAEREARRRSRDID ALLARERRAVRRLVKILLLGAGESGKSTFLKQMRIIHGREFDQKALLEFRDTIFDNIL KGSRVLVDARDKLGIPWQYSENEKHGMFLMAFENKAGLPVEPATFQLYVPALSALWRD SGIREAFSRRSEFQLGESVKYFLDNLDRIGQLNYFPSKQDILLARKATKGIVEHDFVI KKIPFKMVDVGGQRSQRQKWFQCFDGITSILFMVSSSEYDQVLMEDRRTNRLVESMNI FETIVNNKLFFNVSIILFLNKMDLLVEKVKTVSIKKHFPDFRGDPHQLEDVQRYLVQC FDRKRRNRSKPLFHHFTTAIDTENVRFVFHAVKDTILQENLKDIMLQ" BASE COUNT 241 a 338 c 355 g 212 t ORIGIN 1 atgtccgggg tggtgcggac cctcagccgc tgcctgctgc cggccgaggc cggcggggcc 61 cgcgagcgca gggcgggcag cggcgcgcgc gacgcggagc gcgaggcccg gaggcgtagc 121 cgcgacatcg acgcgctgct ggcccgcgag cggcgcgcgg tccggcgcct ggtgaagatc 181 ctgctgctgg gcgcgggcga gagcggcaag tccacgttcc tcaagcagat gcgcatcatc 241 cacggccgcg agttcgacca gaaggcgctg ctggagttcc gcgacaccat cttcgacaac 301 atcctcaagg gctcaagggt tcttgttgat gcacgagata agcttggcat tccttggcag 361 tattctgaaa atgagaagca tgggatgttc ctgatggcct tcgagaacaa ggcggggctg 421 cctgtggagc cggccacctt ccagctgtac gtcccggccc tgagcgcact ctggagggat 481 tctggcatca gggaggcttt cagccggaga agcgagtttc agctggggga gtcggtgaag 541 tacttcctgg acaacttgga ccggatcggc cagctgaatt actttcctag taagcaagat 601 atcctgctgg ctaggaaagc caccaaggga attgtggagc atgacttcgt tattaagaag 661 atccccttta agatggtgga tgtgggcggc cagcggtccc agcgccagaa gtggttccag 721 tgcttcgacg ggatcacgtc catcctgttc atggtctcct ccagcgagta cgaccaggtc 781 ctcatggagg acaggcgcac caaccggctg gtggagtcca tgaacatctt cgagaccatc 841 gtcaacaaca agctcttctt caacgtctcc atcattctct tcctcaacaa gatggacctc 901 ctggtggaga aggtgaagac cgtgagcatc aagaagcact tcccggactt caggggcgac 961 ccgcaccagc tggaggacgt ccagcgctac ctggtccagt gcttcgacag gaagagacgg 1021 aaccgcagca agccactctt ccaccacttc accaccgcca tcgacaccga gaacgtccgc 1081 ttcgtgttcc atgctgtgaa agacaccatc ctgcaggaga acctgaagga catcatgctg 1141 cagtga // LOCUS HUMGATAA 2226 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens GATA-4 mRNA, complete cds. ACCESSION L34357 NID g508483 KEYWORDS cardiac myosin heavy chain. SOURCE Homo sapiens heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2226) AUTHORS Huang,W.-Y., Cukerman,E. and Liew,C.-C. TITLE Identification of a putative GATA motif of the cardiac myosin heavy chain gene and cloning of human GATA-4 JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..2226 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" gene 241..1569 /gene="GATA-4" CDS 241..1569 /gene="GATA-4" /note="putative" /codon_start=1 /db_xref="PID:g508484" /translation="MYQSLAMAANHGPPPGAYQAGGPGPFMHGAGAASSPVYLPTPRV PSSVLGLSYLQGGGAGSASGGPSGGSPGGAASGAGPGTQQGSPGWSQAGATGAAYTPP PVSPRFSFPGTTGSLAAAAAAAAAREAAAYSSGGGAAGAGLAGREQYGRAGFAGSYSS PYPAYMADVGASWAAAAAASAGPFDSPVLHSLPGRANPAARHPNLDMFDDFSEGRECV NCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSASRRVGLSCANCQT TTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMRKEGIQTRKRKPKNLNKSKTPAAP SGSESLPPASGASSNSSNATTSSSEEMRPIKTEPGLSSHYGHSSSVSQTFSVSAMSGH GPSIHPVLSALKLSPQGYASPVSQSPQTSSKQDSWNSLVLADSHGDIITA" BASE COUNT 402 a 766 c 644 g 414 t ORIGIN 1 accaccaaaa attcaaattg ggattttccg gagtaaacaa gagcctagag ccctttgctc 61 aatgctggat ttaatacgta tatattttta agcgagttgg ttttttcccc tttgattttt 121 gatcttcgcg acagttcctc ccacgcatat tatcgttgtt gccgtcgttt tctctccccg 181 cgtggctcct tgacctgcga gggagagaga ggacaccgaa gccgggagct cgcagggacc 241 atgtatcaga gcttggccat ggccgccaac cacgggccgc cccccggtgc ctaccaggcg 301 ggcggccccg gccccttcat gcacggcgcg ggcgccgcgt cctcgccagt ctacctgccc 361 acaccgcggg tgccctcctc cgttctgggc ctgtcctacc tccagggcgg aggcgcgggc 421 tctgcgtccg gaggcccctc gggcggcagc cccggtgggg ccgcgtctgg tgcggggccc 481 gggacccagc agggcagccc gggatggagc caggcgggag cgaccggagc cgcttacacc 541 ccgccgccgg tgtcgccgcg cttctccttc ccggggacca ccgggtccct ggcggcggcg 601 gcggcggctg ccgccgcccg ggaagctgcg gcctacagca gtggcggcgg agcggcgggt 661 gcgggcctgg cgggccgcga gcagtacggg cgcgccggct tcgcgggctc ctactccagc 721 ccctacccgg cttacatggc cgacgtgggc gcgtcctggg ccgcagccgc cgccgcctcc 781 gccggcccct tcgacagccc ggtcctgcac agcctgcccg gccgggccaa cccggccgcc 841 cgacacccca atctcgatat gtttgacgac ttctcagaag gcagagagtg tgtcaactgt 901 ggggctatgt ccaccccgct ctggaggcga gatgggacgg gtcactatct gtgcaacgcc 961 tgtggcctct accacaagat gaacggcatc aaccggccgc tcatcaagcc tcagcgccgg 1021 ctgtccgcct cccgccgagt gggcctctcc tgtgccaact gccagaccac caccaccacg 1081 ctgtggcgcc gcaatgcgga gggcgagcct gtgtgcaatg cctgcggcct ctacatgaag 1141 ctccacgggg tgcccaggcc tcttgcaatg cggaaagagg ggatccaaac cagaaaacgg 1201 aagcccaaga acctgaataa atctaagaca ccagcagctc cttcaggcag tgagagcctt 1261 cctcccgcca gcggtgcttc cagcaactcc agcaacgcca ccaccagcag cagcgaggag 1321 atgcgtccca tcaagacgga gcctggcctg tcatctcact acgggcacag cagctccgtg 1381 tcccagacgt tctcagtcag tgcgatgtct ggccatgggc cctccatcca ccctgtcctc 1441 tcggccctga agctctcccc acaaggctat gcgtctcccg tcagccagtc tccacagacc 1501 agctccaagc aggactcttg gaacagtctg gtcttggccg acagtcacgg ggacataatc 1561 actgcgtaat cttccctctt ccctcctcaa attcctgcac ggacctggga cttggaggat 1621 agcaaagaag gaggccctgg gctcccaggg gccggcctcc tctgcctggt aatgactcca 1681 gaacaacaac tgggaagaaa cttgaagtcg acaatctggt taggggaagc gggtgttgga 1741 ttttctcaga tgcctttaca cgctgatggg actggaggga gcccaccctt cagcacgagc 1801 acactgcatc tctcctgtga gttggagact tctttcccaa gatgtccttg tcccctgcgt 1861 tccccactgt ggcctagacc gtgggttttg cattgtgttt ctagcaccga aggatctgag 1921 aacaagcgga gggccgggcc ctgggacccc tgctccagcc cgaatgacgg catctgtttg 1981 ccatgtacct ggatgtgacg ggcccctggg gacaggccct tgccccatcc atccgcttga 2041 ggcatggcac cgccctgcat ccctaatacc aaatctgact ccaaaactgt ggggtgtgac 2101 acacaagtga ctgagcactt cctggggagc tacaggggca cttaacccac cacagcgcag 2161 cctcatcaaa atgcagctgg caacttctcc cccaggtgcc ttccccctgc tgccggcctt 2221 tgctcc // LOCUS HUMGBP1 2881 bp mRNA PRI 09-SEP-1991 DEFINITION Human guanylate binding protein isoform I (GBP-2) mRNA, complete cds. ACCESSION M55542 NID g183001 KEYWORDS guanylate binding protein isoform I. SOURCE Human foreskin fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2881) AUTHORS Cheng,Y.-S.E., Patterson,C.E. and Staeheli,P. TITLE Interferon-induced guanylate-binding protein lack and N(T)KXD consensus motif and bind GMP in addition to GDP and GTP JOURNAL Mol. Cell. Biol. 11, 4717-4725 (1991) MEDLINE 91342675 FEATURES Location/Qualifiers source 1..2881 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="FS-2" /cell_type="fibroblast" /tissue_type="foreskin" mRNA <1..>2881 /gene="GBP-1" /product="guanylate binding protein isoform I" gene 1..2881 /gene="GBP-1" CDS 69..1847 /gene="GBP-1" /codon_start=1 /evidence=experimental /product="guanylate binding protein isoform I" /db_xref="PID:g183002" /translation="MASEIHMTGPMCLIENTNGRLMANPEALKILSAITQPMVVVAIV GLYRTGKSYLMNKLAGKKKGFSLGSTVQSHTKGIWMWCVPHPKKPGHILVLLDTEGLG DVEKGDNQNDSWIFALAVLLSSTFVYNSIGTINQQAMDQLYYVTELTHRIRSKSSPDE NENEVEDSADFVSFFPDFVWTLRDFSLDLEADGQPLTPDEYLTYSLKLKKGTSQKDET FNLPRLCIRKFFPKKKCFVFDRPVHRRKLAQLEKLQDEELDPEFVQQVADFCSYIFSN SKTKTLSGGIQVNGPRLESLVLTYVNAISSGDLPCMENAVLALAQIENSAAVQKAIAH YEQQMGQKVQLPTESLQELLDLHRDSEREAIEVFIRSSFKDVDHLFQKELAAQLEKKR DDFCKQNQEASSDRCSGLLQVIFSPLEEEVKAGIYSKPGGYRLFVQKLQDLKKKYYEE PRKGIQAEEILQTYLKSKESMTDAILQTDQTLTEKEKEIEVERVKAESAQASAKMLQE MQRKNEQMMEQKERSYQEHLKQLTEKMENDRVQLLKEQERTLALKLQEQEQLLKEGFQ KESRIMKNEIQDLQTKMRRRKACTIS" BASE COUNT 945 a 592 c 650 g 694 t ORIGIN 1 acagaagtgc tagaagccag tgctcgtgaa ctaaggagaa aaagaacaga caagggaaca 61 gcctggacat ggcatcagag atccacatga caggcccaat gtgcctcatt gagaacacta 121 atgggcgact gatggcgaat ccagaagctc tgaagatcct ttctgccatt acacagccta 181 tggtggtggt ggcaattgtg ggcctctacc gcacaggcaa atcctacctg atgaacaagc 241 tggctggaaa gaaaaagggc ttctctctgg gctccacggt gcagtctcac actaaaggaa 301 tctggatgtg gtgtgtgccc caccccaaga agccaggcca catcctagtt ctgctggaca 361 ccgagggtct gggagatgta gagaagggtg acaaccagaa tgactcctgg atcttcgccc 421 tggccgtcct cctgagcagc accttcgtgt acaatagcat aggaaccatc aaccagcagg 481 ctatggacca actgtactat gtgacagagc tgacacatag aatccgatca aaatcctcac 541 ctgatgagaa tgagaatgag gttgaggatt cagctgactt tgtgagcttc ttcccagact 601 ttgtgtggac actgagagat ttctccctgg acttggaagc agatggacaa cccctcacac 661 cagatgagta cctgacatac tccctgaagc tgaagaaagg taccagtcaa aaagatgaaa 721 cttttaacct gcccagactc tgtatccgga aattcttccc aaagaaaaaa tgctttgtct 781 ttgatcggcc cgttcaccgc aggaagcttg cccagctcga gaaactacaa gatgaagagc 841 tggaccccga atttgtgcaa caagtagcag acttctgttc ctacatcttt agtaattcca 901 aaactaaaac tctttcagga ggcatccagg tcaacgggcc tcgtctagag agcctggtgc 961 tgacctacgt caatgccatc agcagtgggg atctgccgtg catggagaac gcagtcctgg 1021 ccttggccca gatagagaac tcagctgcag tgcaaaaggc tattgcccac tatgaacagc 1081 agatgggcca gaaggtgcag ctgcccacag aaagcctcca ggagctgctg gacctgcaca 1141 gggacagtga gagagaggcc attgaagtct tcatcaggag ttccttcaaa gatgtggacc 1201 atctatttca aaaggagtta gcggcccagc tagaaaaaaa gcgggatgac ttttgtaaac 1261 agaatcagga agcatcatca gatcgttgct caggtttact tcaggtcatt ttcagtcctc 1321 tagaagaaga agtgaaggcg ggaatttatt cgaaaccagg gggctatcgt ctctttgttc 1381 agaagctaca agacctgaag aaaaagtact atgaggaacc gaggaagggg atacaggctg 1441 aagagattct gcagacatac ttgaaatcca aggagtctat gactgatgca attctccaga 1501 cagaccagac tctcacagaa aaagaaaagg agattgaagt ggaacgtgtg aaagctgagt 1561 ctgcacaggc ttcagcaaaa atgttgcagg aaatgcaaag aaagaatgag cagatgatgg 1621 aacagaagga gaggagttat caggaacact tgaaacaact gactgagaag atggagaacg 1681 acagggtcca gttgctgaaa gagcaagaga ggaccctcgc tcttaaactt caggaacagg 1741 agcaactact aaaagaggga tttcaaaaag aaagcagaat aatgaaaaat gagatacagg 1801 atctccagac gaaaatgaga cgacgaaagg catgtaccat aagctaaaga ccagagcctt 1861 cctgtcaccc ctaaccaagg cataattgaa acaattttag aatttggaac aagcgtcact 1921 acatttgata ataattagat cttgcatcat aacaccaaaa gtttataaag gcatgtggta 1981 caatgatcaa aatcatgttt tttcttaaaa aaaaaaaaaa gactgtaaat tgtgcaacaa 2041 agatgcattt acctctgtat caactcagga aatctcataa gctggtacca ctcaggagaa 2101 gtttattctt ccagatgacc agcagtagac aaatggatac tgagcagagt cttaggtaaa 2161 agtcttggga aatatttggg cattggtctg gccaagtcta caatgtccca atatcaagga 2221 caaccaccct agcttcttag tgaagacaat gtacagttat ccattagatc aagactacac 2281 ggtctatgag caataatgtg atttctggac attgcccatg tataatcctc actgatgatt 2341 tcaagctaaa gcaaaccacc ttatacagag atctagaatc tctttatgtt ctccagagga 2401 aggtggaaga aaccatgggc aggagtagga attgagtgat aaacaattgg gctaatgaag 2461 aaaacttctc ttattgttca gttcatccag attataactt caatgggaca ctttagacca 2521 ttagacaatt gacactggat taaacaaatt cacataatgc caaatacaca atgtatttat 2581 agcaacgtat aatttgcaaa gatggacttt aaaagatgct gtgtaactaa actgaaataa 2641 ttcaattact tattatttag aatgttaaag cttatgatag tcttttctaa ttcttaacac 2701 tcatacttga aatctttccg agtttcccca gaagagaata tgggattttt tttgacattt 2761 ttgacccatt taataatgct cttgtgttta cctagtatat gtagactttg tcttatgtgt 2821 caaaagtcct aggaaagtgg ttgatgtttc ttatagcaat taaaaattat ttttgaactg 2881 a // LOCUS HUMGCL 1652 bp mRNA PRI 31-DEC-1994 DEFINITION Human grancalcin mRNA, complete cds. ACCESSION M81637 NID g183030 KEYWORDS calcium-binding protein; grancalcin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1652) AUTHORS Boyhan,A., Casimir,C.M., French,J.K., Teahan,C.G. and Segal,A.W. TITLE Molecular cloning and characterization of grancalcin, a novel EF-hand calcium-binding protein abundant in neutrophils and monocytes JOURNAL J. Biol. Chem. 267 (5), 2928-2933 (1992) MEDLINE 92147631 FEATURES Location/Qualifiers source 1..1652 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /cell_type="promyelocyte" gene 120..773 /gene="grancalcin" CDS 120..773 /gene="grancalcin" /note="putative" /codon_start=1 /product="grancalcin" /db_xref="PID:g183031" /translation="MAYPGYGGGFGNFSIQVPGMQMGQPVPETGPAILLDGYSGPAYS DTYSSAGDSVYTYFSAVAGQDGEVDAEELQRCLTQSGINGTYSPFSLETCRIMIAMLD RDHTGKMGFNAFKELWAALNAWKENFMTVDQDGSGTVEHHELRQAIGLMGYRLSPQTL TTIVKRYSKNGRIFFDDYVACCVKLRALTDFFRKRDHLQQGSANFIYDDFLQGTMAI" BASE COUNT 482 a 283 c 327 g 560 t ORIGIN 1 cccccccccc ctttcagcct cacctgcagc tgcgcctcct tgcacctgcg cctgtgcttt 61 ttctcccagc actgcggacg cgactcgagg gtgacgctcg ctccgctcgt ccgctcgtca 121 tggcctaccc gggatacgga ggagggtttg gaaattttag cattcaggtg ccaggaatgc 181 agatgggaca gccagtgcca gaaacaggcc cagctatact cctcgatgga tactctgggc 241 cagcatattc agacacttat tcctcagctg gtgactccgt gtatacttac ttcagtgctg 301 ttgctggaca ggatggtgaa gtggatgctg aagaacttca gagatgtttg acacagtctg 361 gaattaatgg aacttactct cccttcagtt tggaaacctg cagaattatg attgccatgt 421 tggatagaga tcacacagga aaaatgggat ttaatgcatt caaagagcta tgggcagctc 481 ttaatgcctg gaaggaaaac ttcatgactg ttgatcaaga tggaagtggc acagtagaac 541 atcatgagtt gcgtcaagcc attggtctta tgggttatag gttgagtcct caaacattaa 601 ctactattgt taaacgttat agcaagaatg gcagaatttt ctttgatgat tatgttgctt 661 gctgtgtgaa gcttcgagca ttgacagatt tctttaggaa aagagaccac ttgcaacaag 721 ggtctgcgaa tttcatatat gacgattttt tgcagggcac tatggcaatt tgaatgctta 781 gaattttaaa cctgaagaga cactgtgaat tcttttgttt ggaagaagtg aactggacta 841 ctttaaaact tttaagggtt ttctatgttc ttcctacctg ttaaacctct tccctttctg 901 tgtgttttta ttttagcaga tagttcaaag caataaaaga tttctttttt aatttgaggt 961 attactgctt ttggaaaagt tattttataa atatgtgcat attgtcataa aatattgtat 1021 gattaattga tttaaataat gcttagcctt aattttagat aatgtaaatt tagaggaatg 1081 tactttacaa gatagattgt ataagaagcc aaataatgaa agcctagaaa aaactaattt 1141 atacttatct gaaggttaca aattagactt ttaaattttc tttgtagttg gtggtgtttg 1201 agggttggct agaaatgaaa gcctggcatt ttgtgccatg tttgtaatat agtttgttcc 1261 ttgatcaaat aatcagagaa aagaaactta aagatctttg tctgtgaaga agaaaattat 1321 ctccctagtt caatctgtag tgaaataaga ctacagaagg cattgttttt tcctttttta 1381 ttttttgtat tatatatttt tcttaaatat gttttattgt cttctctaag caaaaagttg 1441 cttaataaac atagtatttc tctctgcgtc ctatttcatt agtgaagaca tagttcacct 1501 aaaatggcat cctgctctga atctagactt tttagaaatg gcatatgttt ttgatgatat 1561 gtcaacattc aaaattgtcc taattaaatt gttgtttaaa tgtaatgtca actctttata 1621 aacttaaaaa taaacaagta attaaccact ca // LOCUS HUMGCSH 2634 bp mRNA PRI 31-DEC-1994 DEFINITION Human gamma-glutamylcysteine synthetase (GCS) mRNA, complete cds. ACCESSION M90656 NID g183038 KEYWORDS gamma-glutamylcysteine synthetase. SOURCE Homo sapiens (tissue library: statagene) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2634) AUTHORS Gipp,J.J., Chang,C. and Mulcahy,R.T. TITLE Cloning and nucleotide sequence of a full-length cDNA for human liver gamma-glutamylcysteine synthetase JOURNAL Biochem. Biophys. Res. Commun. 185 (1), 29-35 (1992) MEDLINE 92287108 FEATURES Location/Qualifiers source 1..2634 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="statagene" gene 93..2006 /gene="GCS" CDS 93..2006 /gene="GCS" /EC_number="6.3.2.2" /codon_start=1 /product="gamma-glutamylcysteine synthetase" /db_xref="PID:g183039" /translation="MGLLSQGSPLSWEETKRHADHVRRHGILQFLHIYHAVKDRHKDV LKWGDEVEYMLVSFDHENKKVRLVLSGEKVLETLQEKGERTNPNHPTLWRPEYGSYMI EGTPGQPYGGTMSEFNTVEANMRKRRKEATSILEENQALCTITSFPRLGCPGFTLPEV KPNPVEGGASKSLFFPDEAINKHPRFSTLTRNIRHRRGEKVVINVPIFKDKNTPSPFI ETFTEDDEASRASKPDHIYMDAMGFGMGNCCLQVTFQACSISEARYLYDQLATICPIV MALSAASPFYRGYVSDIDCRWGVISASVDDRTREERGLEPLKNNNYRISKSRYDSIDS YLSKCGEKYNDIDLTIDKEIYEQLLQEGIDHLLAQHVAHLFIRDPLTLFEEKIHLDDA NESDHFENIQSTNWQTMRFKPPPPNSDIGWRVEFRPMEVQLTDFENSAYVVFVVLLTR VILSYKLDFLIPLSKVDENMKVAQKRDAVLQGMFYFRKDICKGGNAVVDGCGKAQNST ELAAEEYTLMSIDTIINGKEGVFPGLIPILNSYLENMEVDVDTRCSILNYLKLIKKRA SGELMTVARWMREFIANHPDYKQDSVITDEMNYSLILKCNQIANELCECPELLGSAFR KVKYSGSKTDSSN" BASE COUNT 792 a 527 c 609 g 706 t ORIGIN 1 ggcacgaggc tgagtgtccg tctcgcgccc ggaagcgggc gaccgccgtc agcccggagg 61 aggaggagga ggaggaggag gagggggcgg ccatggggct gctgtcccag ggctcgccgc 121 tgagctggga ggaaaccaag cgccatgccg accacgtgcg gcggcacggg atcctccagt 181 tcctgcacat ctaccacgcc gtcaaggacc ggcacaagga cgttctcaag tggggcgatg 241 aggtggaata catgttggta tcttttgatc atgaaaataa aaaagtccgg ttggtcctgt 301 ctggggagaa agttcttgaa actctgcaag agaaggggga aaggacaaac ccaaaccatc 361 ctaccctttg gagaccagag tatgggagtt acatgattga agggacacca ggacagccct 421 acggaggaac aatgtccgag ttcaatacag ttgaggccaa catgcgaaaa cgccggaagg 481 aggctacttc tatattagaa gaaaatcagg ctctttgcac aataacttca tttcccagat 541 taggctgtcc tgggttcaca ctgcccgagg tcaaacccaa cccagtggaa ggaggagctt 601 ccaagtccct cttctttcca gatgaagcaa taaacaagca ccctcgcttc agtaccttaa 661 caagaaatat ccgacatagg agaggagaaa aggttgtcat caatgtacca atatttaagg 721 acaagaatac accatctcca tttatagaaa catttactga ggatgatgaa gcttcaaggg 781 cttctaagcc ggatcatatt tacatggatg ccatgggatt tggaatgggc aattgctgtc 841 tccaggtgac attccaagcc tgcagtatat ctgaggccag atacctttat gatcagttgg 901 ctactatctg tccaattgtt atggctttga gtgctgcatc tcccttttac cgaggctatg 961 tgtcagacat tgattgtcgc tggggagtga tttctgcatc tgtagatgat agaactcggg 1021 aggagcgagg actggagcca ttgaagaaca ataactatag gatcagtaaa tcccgatatg 1081 actcaataga cagctattta tctaagtgtg gtgagaaata taatgacatc gacttgacga 1141 tagataaaga gatctacgaa cagctgttgc aggaaggcat tgatcatctc ctggcccagc 1201 atgttgctca tctctttatt agagacccac tgacactgtt tgaagagaaa atacacctgg 1261 atgatgctaa tgagtctgac cattttgaga atattcagtc cacaaattgg cagacaatga 1321 gatttaagcc ccctcctcca aactcagaca ttggatggag agtagaattt cgacccatgg 1381 aggtgcaatt aacagacttt gagaactctg cctatgtggt gtttgtggta ctgctcacca 1441 gagtgatcct ttcctacaaa ttggattttc tcattccact gtcaaaggtt gatgagaaca 1501 tgaaggtagc acagaaaaga gatgctgtct tgcagggaat gttttatttc aggaaagata 1561 tttgcaaagg tggcaatgca gtggtggatg gttgtggcaa ggcccagaac agcacggagc 1621 tcgctgcaga ggagtacacc ctcatgagca tagacaccat catcaatggg aaggaaggtg 1681 tgtttcctgg actgatccca attctgaact cttaccttga aaacatggaa gtggatgtgg 1741 acaccagatg tagtattctg aactacctaa agctaattaa gaagagagca tctggagaac 1801 taatgacagt tgccagatgg atgagggagt ttatcgcaaa ccatcctgac tacaagcaag 1861 acagtgtcat aactgatgaa atgaattata gccttatttt gaagtgtaac caaattgcaa 1921 atgaattatg tgaatgccca gagttacttg gatcagcatt taggaaagta aaatatagtg 1981 gaagtaaaac tgactcatcc aactagacat tctacagaaa gaaaaatgca ttattgacga 2041 actggctaca gtaccatgcc tctcagcccg tgtgtataat atgaagacca aatgatagaa 2101 ctgtactgtt ttctgggcca gtgagccaga aattgattaa ggctttcttt ggtaggtaaa 2161 tctagagttt atacagtgta catgtacata gtaaagtatt tttgattaac aatgtatttt 2221 aataacatat ctaaagtcat catgaactgg cttgtacatt tttaaattct tactctggag 2281 caacctactg tctaagcagt tttgtaaatg tactggtaat tgtacaatac ttgcattcca 2341 gagttaaaat gtttactgta aatttttgtt cttttaaaga ctacctggga cctgatttat 2401 tgaaattttt ctctttaaaa acattttctc tcgttaattt tcctttgtca tttcctttgt 2461 tgtctacatt aaatcacttg aatccattga aagtgcttca agggtaatct tgggtttcta 2521 gcaccttatc tatgatgttt cttttgcaat tggaataatc acttggtcac cttgccccaa 2581 gctttcccct ctgaataaat acccattgaa ctctgaaaaa aaaaaaaaaa aaaa // LOCUS HUMGCSL 1610 bp mRNA PRI 14-APR-1995 DEFINITION Homo sapiens gamma-glutamylcysteine synthetase light subunit mRNA, complete cds. ACCESSION L35546 NID g530136 KEYWORDS gamma-glutamylcysteine synthetase; gamma-glutamylcysteine synthetase light subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1610) AUTHORS Gipp,J.J., Bailey,H.H. and Mulcahy,R.T. TITLE Cloning and sequencing of the cDNA for the light subunit of human liver gamma-glutamylcysteine synthetase and relative mRNA levels for heavy and light subunits in human normal tissues JOURNAL Biochem. Biophys. Res. Commun. 206 (2), 584-589 (1995) MEDLINE 95126958 FEATURES Location/Qualifiers source 1..1610 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="liver" mRNA 1..1610 5'UTR 1..253 CDS 254..1078 /codon_start=1 /product="gamma-glutamylcysteine synthetase light subunit" /db_xref="PID:g530137" /translation="MGTDSRAAKALLARARTLHLQTGNLLNWGRLRKKCPSTHSEELH DCIQKTLNEWSSQINPDLVREFPDVLECTVSHAVEKINPDEREEMKVSAKLFIVESNS SSSTRSAVDMACSVLGVAQLDSVIIASPPIEDGVNLSLEHLQPYWEELENLVQSKKIV AIGTSDLDKTQLEQLYQWAQVKPNSNQVNLASCCVMPPDLTAFAKQFDIQLLTHNDPK ELLSEASFQEALQESIPDIQAHEWVPLWLLRYSVIVKSRGIIKSKGYILQAKRRGS" 3'UTR 1079..1610 polyA_signal 1588..1593 polyA_site 1610 BASE COUNT 435 a 378 c 355 g 442 t ORIGIN 1 ggcacgaggc tgcggccgca gtagccggag ccggagccgc agccaccggt gccttccttt 61 cccgccgccg cccagccgcc gtccggcctc cctcgggccc gagcgcagac caggctccag 121 ccgcgcggcg ccggcagcct cgcgctccct ctcgggtctc tctcgggcct cgggcaccgc 181 gtcctgtggg cggccgcctg cctgcccgcc cgcccgcagc cccttgcctg ccggcccctg 241 ggcggcccgt gccatgggca ccgacagccg cgcggccaag gcgctcctgg cgcgggcccg 301 caccctgcac ctgcagacgg ggaacctgct gaactggggc cgcctgcgga agaagtgccc 361 gtccacgcac agcgaggagc ttcatgattg tatccaaaaa accttgaatg aatggagttc 421 ccaaatcaac ccagatttgg tcagggagtt tccagatgtc ttggaatgca ctgtatctca 481 tgcagtagaa aagataaatc ctgatgaaag agaagaaatg aaagtttctg caaaactgtt 541 cattgtagaa tcaaactctt catcatcaac tagaagtgca gttgacatgg cctgttcagt 601 ccttggagtt gcacagctgg attctgtgat cattgcttca cctcctattg aagatggagt 661 taatctttcc ttggagcatt tacagcctta ctgggaggaa ttagaaaact tagttcagag 721 caaaaagatt gttgccatag gtacctctga tctagacaaa acacagttgg aacagctgta 781 tcagtgggca caggtaaaac caaatagtaa ccaagttaat cttgcctcct gctgtgtgat 841 gccaccagat ttgactgcat ttgctaaaca atttgacata cagctgttga ctcacaatga 901 tccaaaagaa ctgctttctg aagcaagttt ccaagaagct cttcaggaaa gcattcctga 961 cattcaagcg cacgagtggg tgccgctgtg gctactgcgg tattcggtca ttgtgaaaag 1021 tagaggaatt atcaaatcaa aaggctacat tttacaagct aaaagaaggg gttcttaact 1081 gacttaggag cataacttac ctgtaatttc cttcaatatg agagaaaatt gagatgtgta 1141 aaatctagtt actgcctgta aatggtgtca ttgaggcaga tattctttcg tcatatttga 1201 cagtatgttg tctgtcaagt tttaaatact tatcttgcct ccatatcaat ccattctcat 1261 gaacctctgt attgctttcc ttaaactatt gttttctaat tgaaattgtc tataaagaaa 1321 atacttgcaa tatatttttc ctttattttt atgactaata taaatcaaga aaatttgttg 1381 ttagatatat tttggcctag gtatcagggt aatgtatata catatttttt atttccaaaa 1441 aaaattcatt aattgcttct taactcttat tataaccaag caatttaatt acaattgtta 1501 aaactgaaat actggaagaa gatatttttc ctgtcattga tgagatatat cagagtaact 1561 ggagtagctg ggatttacta gtagtgtaaa taaaattcac tcttcaatac // LOCUS HUMGCSTP 2102 bp mRNA PRI 18-APR-1996 DEFINITION Human mRNA for glycine cleavage system T-protein, complete cds. ACCESSION D13811 NID g391720 KEYWORDS glycine cleavage system T-protein. SOURCE Homo sapiens fetus liver, lambda gt11 library, cDNA to mRNA, clones HT15 and HT24. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2102) AUTHORS Hayasaka,K., Nanao,K., Takada,G., Okamura-Ikeda,K. and Motokawa,Y. TITLE Isolation and sequence determination of cDNA encoding human T-protein of the glycine cleavage system JOURNAL Cellular and Molecular Biology Research 192, 766-771 (1993) REFERENCE 2 (bases 1 to 2102) AUTHORS Hayasaka,K. TITLE Direct Submission JOURNAL Submitted (03-DEC-1992) to the DDBJ/EMBL/GenBank databases. Kiyoshi Hayasaka, Yamagata University School of Medicine, Department of Pediatrics; 2-2-2 Iida Nishi, Yamagata, Yamagata 990-23, Japan (Fax:0236-25-7089) COMMENT Submitted (03-DEC-1992) to DDBJ by: Kiyoshi Hayasaka Department of Pediatrics Akita University School of Medicine 1-1-1 Hondo, Akita 010 Japan Phone: 0188-34-1111 x2533 Fax: 0188-36-2620. FEATURES Location/Qualifiers source 1..2102 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="liver" CDS 130..1341 /codon_start=1 /product="glycine cleavage system T-protein" /db_xref="PID:d1003473" /db_xref="PID:g391721" /translation="MQRAVSVVARLGFRLQAFPPALCRPLSCAQEVLRRTPLYDFHLA HGGKMVAFAGWSLPVQYRDSHTDSHLHTRQHCSLFDVSHMLQTKILGSDRVKLMESLV VGDIAELRPNQGTLSLFTNEAGGILDDLIVTNTSEGHLYVVSNAGCWEKDLALMQDKV RELQNQGRDVGLEVLDNALLALQGPTAAQVLQAGVADDLRKLPFMTSAVMEVFGVSGC RVTRCGYTGEDGVEISVPVAGAVHLATAILKNPEVKLAGLAARDSLRLEAGLCLYGND IDEHTTPVEGSLSWTLGKRRRAAMDFPGAKVIVPQLKGRVQRRRVGLMCEGAPMRAHS PILNMEGTKIGTVTSGCPSPSLKKNVAMGYVPCEYSRPGTMLLVEVRRKQQMAVVSKM PFVPTNYYTLK" sig_peptide 130..213 mat_peptide 214..1338 /product="glycine cleavage system T-protein" polyA_site 2102 BASE COUNT 435 a 562 c 625 g 480 t ORIGIN Chromosome 3, p21.1-p21.2. 1 tgcccacgcc cccttcagat cctttgctcc ggagagagac ctgtccgagc agaggcctgg 61 actacatctc ccggcgtgcc tggcagtgtg gtggcctctg tgcgccgtct gcactcgttg 121 caggcgacga tgcagagggc tgtaagtgtg gtggcccgtc tgggctttcg cctgcaggca 181 ttccccccgg ccttgtgtcg tccacttagt tgcgcacagg aggtgctccg caggacaccg 241 ctctatgact tccacctggc ccacggcggg aaaatggtgg cgtttgcggg ttggagtctg 301 ccagtgcagt accgggacag tcacactgac tcgcacctgc acacacgcca gcactgctcg 361 ctctttgacg tgtctcatat gctgcagacc aagatacttg gtagtgaccg ggtgaagctg 421 atggagagtc tagtggttgg agacattgca gagctaagac caaaccaggg gacactgtcg 481 ctgtttacca acgaggctgg aggcatctta gatgacttga ttgtaaccaa tacttctgag 541 ggccacctgt atgtggtgtc caacgctggc tgctgggaga aagatttggc cctcatgcag 601 gacaaggtca gggagcttca gaaccagggc agagatgtgg gcctggaggt gttggataat 661 gccctgctag ctctgcaagg ccccactgca gcccaggtac tacaggccgg cgtggcagat 721 gacctgagga aactgccctt catgaccagt gctgtgatgg aggtgtttgg cgtgtctggc 781 tgccgcgtga cccgctgtgg ctacacagga gaggatggtg tggagatctc ggtgccggta 841 gcgggggcag ttcacctggc aacagctatt ctgaaaaacc cagaggtgaa gctggcaggg 901 ctggcagcca gggacagcct gcgcctggag gcaggcctct gcctgtatgg gaatgacatt 961 gatgaacaca ctacacctgt ggagggcagc ctcagttgga cactggggaa gcgccgccga 1021 gctgctatgg acttccctgg agccaaggtc attgttcccc agctgaaggg cagggtgcag 1081 cggaggcgtg tggggttgat gtgtgagggg gcccccatgc gggcacacag tcccatcctg 1141 aacatggagg gtaccaagat tggtactgtg actagtggct gcccctcccc ctctctgaag 1201 aagaatgtgg cgatgggtta tgtgccctgc gagtacagtc gtccagggac aatgctgctg 1261 gtagaggtgc ggcggaagca gcagatggct gtagtcagca agatgccctt tgtgcccaca 1321 aactactata ccctcaagtg aagctggctc agggtggggc tgtcccttcc aggagttttg 1381 cccctacaag gggttagtca agaagctgag gcagaactca ctgggggtgg gcagttaagg 1441 tggaggctga ttctaattgt ctggttgagg ggccacacca cctattcccc ccacctaact 1501 catgccattc cagcttcctt caggaccctg cttctgagtg acggaccagc tcacacaatg 1561 tcttgtttca gtccatgatc ccactgacct actcttgcct gctggagggt aatgagaagc 1621 tttggttctg ccatctctcc cactctgcca ggtgctggct gtggagcaaa ggctcacctt 1681 tgtggagagg ataaaacctg cccaacctac ctcaccatgg tttttcacat tgcaaagggt 1741 aataacatgg gcagtgcgga cttaggctac cccctccagt ttgctttccg taaatgcaaa 1801 ttgtccttac tgcaagtcag gaatgattgc tgactcacag tagggctgct atgcctgtgt 1861 gtaaacttgg ggatggctga gggaacatag actcactctt ccacattccc aagttggtct 1921 agtgtgctgc ccagtagcaa accatggcag actcaccacc tattctgagt tccagggctg 1981 ctgtagggca gggtgggctt cctcccagac ttgccttacc ctgggctgat ctttgcccct 2041 ggtatgcatt aatggactcc actgaatcct gaaaaaaaaa ttaaacttcc ttcttacttg 2101 cc // LOCUS HUMGDF1 2510 bp mRNA PRI 31-DEC-1994 DEFINITION Human growth/differentiation factor 1 (GDF-1) mRNA, complete cds. ACCESSION M62302 NID g183050 KEYWORDS growth/differentiation factor 1; transforming growth factor-beta. SOURCE Homo sapiens (tissue library: Stratagene) adult cerebellum and fetal brain (17-18 week abortus) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2510) AUTHORS Lee,S.J. TITLE Expression of growth/differentiation factor 1 in the nervous system: conservation of a bicistronic structure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (10), 4250-4254 (1991) MEDLINE 91239545 FEATURES Location/Qualifiers source 1..2510 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="adult cerebellum and fetal brain (17-18 week abortus)" /tissue_lib="Stratagene" gene 25..1077 /gene="UOG-1" CDS 25..1077 /gene="UOG-1" /note="ORF" /codon_start=1 /db_xref="PID:g183051" /translation="MAAAGPAAGPTGPEPMPSYAQLVQRGWGSALAAARGCTDCGWGL ARRGLAEHAHLAPPELLLLALGALGWTALRSAATARLFRPLAKRCCLQPRDAAKMPES AWKFLFYLGSWSYSAYLLFGTDYPFFHDPPSVFYDWTPGMAVPRDIAAAYLLQGSFYG HSIYATLYMDTWRKDSVVMLLHHVVTLILIVSSYAFRYHNVGILVLFLHDISDVQLEF TKLNIYFKSRGGSYHRLHALAADLGCLSFGFSWFWFRLYWFPLKVLYATSHCSLRTVP DIPFYFFFNALLLLLTLMNLYWFLYIVAFAAKVLTGQVHELKDLREYDTAEAQSLKPS KAEKPLRNGLVKDKRF" gene 1347..2465 /gene="GDF-1" CDS 1347..2465 /gene="GDF-1" /codon_start=1 /product="growth/differentiation factor 1" /db_xref="PID:g183052" /translation="MPPPQQGPCGHHLLLLLALLLPSLPLTRAPVPPGPAAALLQALG LRDEPQGAPRLRPVPPVMWRLFRRRDPQETRSGSRRTSPGVTLQPCHVEELGVAGNIV RHIPDRGAPTRASEPVSAAGHCPEWTVVFDLSAVEPAERPSRARLELRFAAAAAAAPE GGWELSVAQAGQGAGADPGPVLLRQLVPALGPPVRAELLGAAWARNASWPRSLRLALA LRPRAPAACARLAEASLLLVTLDPRLCHPLARPRRDAEPVLGGGPGGACRARRLYVSF REVGWHRWVIAPRGFLANYCQGQCALPVALSGSGGPPALNHAVLRALMHAAAPGAADL PCCVPARLSPISVLFFDNSDNVVLRQYEDMVVDECGCR" BASE COUNT 313 a 968 c 814 g 415 t ORIGIN 1 ggacacggcg ggcgagcggg cggtatggcg gcggcggggc ccgcggcggg gccgacgggg 61 cccgagccca tgccgagcta cgcgcagcta gtgcagcgcg gctggggcag cgcgctggcg 121 gcggcgcggg gctgcacgga ctgcggctgg gggctggcgc gtcgcggcct ggctgagcac 181 gcgcacctgg cgccgcccga gctgctgctg ctggcgctcg gcgcgctggg ctggaccgcg 241 ctgcgctccg cggccactgc gcgcctcttt cggcccctgg cgaagcggtg ctgcctccag 301 cccagagatg ccgccaagat gcccgagagc gcttggaagt ttctcttcta cctgggcagc 361 tggagctaca gtgcctacct gctgtttggc accgactacc ccttcttcca tgacccacca 421 tctgtcttct acgactggac gccgggcatg gcagtgccac gggacattgc agccgcctac 481 ctgctccagg gaagcttcta tggccactcc atctacgcta cgctatacat ggacacctgg 541 cgcaaggact cggtggtcat gctgctccac cacgtggtca ctctcatcct catcgtctcc 601 tcctacgcct tccggtacca caatgtgggc atccttgtgc tcttcctgca cgatatcagt 661 gacgtgcagc ttgagttcac caagctcaac atttacttca agtcccgcgg cggctcctac 721 catcggctgc atgccttggc agcagacttg ggctgcctca gcttcggctt cagctggttc 781 tggttccgcc tctactggtt cccgctcaag gtcctgtatg ccaccagtca ctgcagtctg 841 cgcacggtgc ctgacatccc cttctacttc ttcttcaatg cgctcctgct gctgctcacc 901 cttatgaacc tctactggtt cctgtacatc gtggcgtttg cagccaaggt gttgacaggc 961 caggtgcacg agctgaagga cctgcgggag tatgacacag ccgaggccca gagcctgaag 1021 cccagcaaag ccgagaagcc actgaggaac ggcctggtga aggacaagcg cttctgaacc 1081 cctcggcccc gcccccgtgg acccggcccc accccgaata ccccggccac gctccccgtc 1141 cttggccgcc cctccacccc ctccaactct gctcctctag ggccgccgcc acctcccctg 1201 ggaccccgcc ccctcatcct gcctccattt cccggccacg ccccccagga cccctgcccc 1261 tccggggaca ccggccccgc cctcagccca ctggtcccgg gccgccgcgg accctgcgca 1321 ctctctggtc atcgcctggg aggaagatgc caccgccgca gcaaggtccc tgcggccacc 1381 acctcctcct cctcctggcc ctgctgctgc cctcgctgcc cctgacccgc gcccccgtgc 1441 ccccaggccc agccgccgcc ctgctccagg ctctaggact gcgcgatgag ccccagggtg 1501 cccccaggct ccggccggtt cccccggtca tgtggcgcct gtttcgacgc cgggaccccc 1561 aggagaccag gtctggctcg cggcggacgt ccccaggggt caccctgcaa ccgtgccacg 1621 tggaggagct gggggtcgcc ggaaacatcg tgcgccacat cccggaccgc ggtgcgccca 1681 cccgggcctc ggagcctgtc tcggccgcgg ggcattgccc tgagtggaca gtcgtcttcg 1741 acctgtcggc tgtggaaccc gctgagcgcc cgagccgggc ccgcctggag ctgcgtttcg 1801 cggcggcggc ggcggcagcc ccggagggcg gctgggagct gagcgtggcg caagcgggcc 1861 agggcgcggg cgcggacccc gggccggtgc tgctccgcca gttggtgccc gccctggggc 1921 cgccagtgcg cgcggagctg ctgggcgccg cttgggctcg caacgcctca tggccgcgca 1981 gcctccgcct ggcgctggcg ctacgccccc gggcccctgc cgcctgcgcg cgcctggccg 2041 aggcctcgct gctgctggtg accctcgacc cgcgcctgtg ccaccccctg gcccggccgc 2101 ggcgcgacgc cgaacccgtg ttgggcggcg gccccggggg cgcttgtcgc gcgcggcggc 2161 tgtacgtgag cttccgcgag gtgggctggc accgctgggt catcgcgccg cgcggcttcc 2221 tggccaacta ctgccagggt cagtgcgcgc tgcccgtcgc gctgtcgggg tccggggggc 2281 cgccggcgct caaccacgct gtgctgcgcg cgctcatgca cgcggccgcc ccgggagccg 2341 ccgacctgcc ctgctgcgtg cccgcgcgcc tgtcgcccat ctccgtgctc ttctttgaca 2401 acagcgacaa cgtggtgctg cggcagtatg aggacatggt ggtggacgag tgcggctgcc 2461 gctaacccgg ggcgggcagg gacgcgggcc caacaataaa tgccgcgtgg // LOCUS HUMGFAT 3082 bp mRNA PRI 31-DEC-1994 DEFINITION Human glutamine:fructose-6-phosphate amidotransferase (GFAT) mRNA, complete cds. ACCESSION M90516 NID g183081 KEYWORDS fructose-6-phosphate amidotransferase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3082) AUTHORS McKnight,G.L., Mudri,S.L., Mathewes,S.L., Traxinger,R.R., Marshall,S., Sheppard,P.O. and O'Hara,P.J. TITLE Molecular cloning, cDNA sequence, and bacterial expression of human glutamine:fructose-6-phosphate amidotransferase JOURNAL J. Biol. Chem. 267 (35), 25208-25212 (1992) MEDLINE 93094229 FEATURES Location/Qualifiers source 1..3082 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="pheochromocytoma" mRNA 1..2082 /gene="GFAT" gene 1..2168 /gene="GFAT" CDS 123..2168 /gene="GFAT" /note="putative" /codon_start=1 /product="glutamine:fructose-6-phosphate amidotransferase" /db_xref="PID:g183082" /translation="MCGIFAYLNYHVPRTRREILETLIKGLQRLEYRGYDSAGVGFDG GNDKDWEANACKTQLIKKKGKVKALDEEVHKQQDMDLDIEFDVHLGIAHTRWATHGEP SPVNSHPQRSDKNNEFIVIHNGIITNYKDLKKFLESKGYDFESETDTETIAKLVKYMY DNRESQDTSFTTLVERVIQQLEGAFALVFKSVHFPGQAVGTRRGSPLLIGVRSEHKLS TDHIPILYRTGKDKKGSCNLSRVDSTTCLFPVEEKAVEYYFASDASAVIEHTNRVIFL EDDDVAAVVDGRLSIHRIKRTAGDHPGRAVQTLQMELQQIMKGNFSSFMQKEIFEQPE SVVNTMRGRVNFDDYTVNLGGLKDHIKEIQRCRRLILIACGTSYHAGVATRQVLEELT ELPVMVELASDFLDRNTPVFRDDVCFFLSQSGETADTLMGLRYCKERGALTVGITNTV GSSISRETDCGVHINAGPEIGVASTKAYTSQFVSLVMFALMMCDDRISMQERRKEIML GLKRLPDLIKEVLSMDDEIQKLATELYHQKSVLIMGRGYHYATCLEGALKIKEITYMH SEGILAGELKHGPLALVDKLMPVIMIIMRDHTYAKCQNALQQVVARQGRPVVICDKED TETIKNTKRTIKVPHSVDCLQGILSVIPLQLLAFHLAVLRGYDVDFPRNLAKSVTVE" BASE COUNT 896 a 578 c 697 g 911 t ORIGIN 1 agggagtcgt gtcggcgcca ccccggcccc cgagcccgca gattgcccac cgaagctcgt 61 gtgtgcaccc ccgatcccgc cagccactcg cccctggcct cgcgggccgt gtctccggca 121 tcatgtgtgg tatatttgct tacttaaact accatgttcc tcgaacgaga cgagaaatcc 181 tggagaccct aatcaaaggc cttcagagac tggagtacag aggatatgat tctgctggtg 241 tgggatttga tggaggcaat gataaagatt gggaagccaa tgcctgcaaa acccagctta 301 ttaagaagaa aggaaaagtt aaggcactgg atgaagaagt tcacaagcaa caagatatgg 361 atttggatat agaatttgat gtacaccttg gaatagctca tacccgttgg gcaacacatg 421 gagaacccag tcctgtcaat agccaccccc agcgctctga taaaaataat gaatttatcg 481 ttattcacaa tggaatcatc accaactaca aagacttgaa aaagtttttg gaaagcaaag 541 gctatgactt cgaatctgaa acagacacag agacaattgc caagctcgtt aagtatatgt 601 atgacaatcg ggaaagtcaa gataccagct ttactacctt ggtggagaga gttatccaac 661 aattggaagg tgcttttgca cttgtgttta aaagtgttca ttttcccggg caagcagttg 721 gcacaaggcg aggtagccct ctgttgattg gtgtacggag tgaacataaa ctttctactg 781 atcacattcc tatactctac agaacaggca aagacaagaa aggaagctgc aatctctctc 841 gtgtggacag cacaacctgc cttttcccgg tggaagaaaa agcagtggag tattactttg 901 cttctgatgc aagtgctgtc atagaacaca ccaatcgcgt catctttctg gaagatgatg 961 atgttgcagc agtagtggat ggacgtcttt ctatccatcg aattaaacga actgcaggag 1021 atcaccccgg acgagctgtg caaacactcc agatggaact ccagcagatc atgaagggca 1081 acttcagttc atttatgcag aaggaaatat ttgagcagcc agagtctgtc gtgaacacaa 1141 tgagaggaag agtcaacttt gatgactata ctgtgaattt gggtggtttg aaggatcaca 1201 taaaggagat ccagagatgc cggcgtttga ttcttattgc ttgtggaaca agttaccatg 1261 ctggtgtagc aacacgtcaa gttcttgagg agctgactga gttgcctgtg atggtggaac 1321 tagcaagtga cttcctggac agaaacacac cagtctttcg agatgatgtt tgctttttcc 1381 ttagtcaatc aggtgagaca gcagatactt tgatgggtct tcgttactgt aaggagagag 1441 gagctttaac tgtggggatc acaaacacag ttggcagttc catatcacgg gagacagatt 1501 gtggagttca tattaatgct ggtcctgaga ttggtgtggc cagtacaaag gcttatacca 1561 gccagtttgt atcccttgtg atgtttgccc ttatgatgtg tgatgatcgg atctccatgc 1621 aagaaagacg caaagagatc atgcttggat tgaaacggct gcctgatttg attaaggaag 1681 tactgagcat ggatgacgaa attcagaaac tagcaacaga actttatcat cagaagtcag 1741 ttctgataat gggacgaggc tatcattatg ctacttgtct tgaaggggca ctgaaaatca 1801 aagaaattac ttatatgcac tctgaaggca tccttgctgg tgaattgaaa catggccctc 1861 tggctttggt ggataaattg atgcctgtga tcatgatcat catgagagat cacacttatg 1921 ccaagtgtca gaatgctctt cagcaagtgg ttgctcggca ggggcggcct gtggtaattt 1981 gtgataagga ggatactgag accattaaga acacaaaaag aacgatcaag gtgccccact 2041 cagtggactg cttgcagggc attctcagcg tgatcccttt acagttgctg gctttccacc 2101 ttgctgtgct gagaggctat gatgttgatt tcccacggaa tcttgccaaa tctgtgactg 2161 tagagtgagg aatatctata caaaatgtac gaaactgtat gattaagcaa cacaagacac 2221 cttttgtatt taaaaccttg atttaaaata tcaccccttg aagccttttt ttagtaaatc 2281 cttatttata tatcagttat aattattcca ctcaatatgt gatttttgtg aagttacctc 2341 ttacattttc ccagtaattt gtggaggact ttgaataatg gaatctatat tggaatctgt 2401 atcagaaaga ttctagctat tattttcttt aaagaatgct gggtgttgca tttctggacc 2461 ctccacttca atctgagaag acaatatgtt tctaaaaatt ggtacttgtt tcaccatact 2521 tcattcagac cagtgaaaga gtagtgcatt taattggagt atctaaagcc agtggcagtg 2581 tatgctcata cttggacagt tagggaaggg tttgccaagt tttaagagaa gatgtgattt 2641 attttgaaat ttgtttctgt tttgttttta aatcaaactg taaaacttaa aactgaaaaa 2701 ttttattggt aggatttata tctaagtttg gttagcctta gtttctcaga cttgttgtct 2761 attatctgta ggtggaagaa atttaggaag cgaaatatta cagtagtgca ttggtgggtc 2821 tcaatcctta acatatttgc acaattttat agcacaaact ttaaattcaa gctgctttgg 2881 acaactgaca atatgatttt aaatttgaag atgggatgtg tacatgttgg gtatcctact 2941 actttgtgtt ttcatctcct aaaagtgttt tttatttcct tgtatctgta gtcttttatt 3001 ttttaaatga ctgctgaatg acatatttta tcttgttctt taaaatcaca acacagagct 3061 gctattaaat taatattgat at // LOCUS HUMGGTBS 1969 bp mRNA PRI 29-MAR-1994 DEFINITION Human geranylgeranyltransferase type I beta-subunit mRNA, complete cds. ACCESSION L25441 NID g466490 KEYWORDS geranylgeranyltransferase type I beta-subunit. SOURCE Homo sapiens placenta, kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1969) AUTHORS Zhang,F.L., Diehl,R.E., Kohl,N.E., Gibbs,J.B., Giros,B., Casey,P.J. and Omer,C.A. TITLE cDNA cloning and expression of rat and human protein geranylgeranyltransferase type-I JOURNAL J. Biol. Chem. 269, 3175-3180 (1994) MEDLINE 94148804 FEATURES Location/Qualifiers source 1..1969 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta, kidney" CDS 313..1446 /codon_start=1 /product="geranylgeranyltransferase type I beta-subunit" /db_xref="PID:g466491" /translation="MVATEDERLAGSGEGERLDFLRDRHVRFFQRCLQVLPERYSSLE TSRLTIAFFALSGLDMLDSLDVVNKDDIIEWIYSLQVLPTEDRSNLNRCGFRGSSYLG IPFNPSKAPGTAHPYDSGHIAMTYTGLSCLVILGDDLSRVNKEACLAGLRALQLEDGS FCAVPEGSENDMRFVYCASCICYMLNNWSGMDMKKAITYIRRSMSYDNGLAQGAGLES HGGSTFCGIASLCLMGKLEEVFSEKELNRIKRWCIMRQQNGYHGRPNKPVDTCYSFWV GATLKLLKIFQYTNFEKNRNYILSTQDRLVGGFAKWPDSHPDALHAYFGICGLSLMEE SGICKVHPALNVSTRTSERLLDLHQSWKTKDSKQCSENVHIST" BASE COUNT 559 a 371 c 443 g 596 t ORIGIN 1 gaataaaatg aacaattcag ttcctcagtc acatgagctg tgtgtcaaat gcacaacagc 61 cgtatgtggc tcgtggcccc tgtaccggac actcccatcc ctgcagagtt actggacagt 121 gctgatctag ggattctgtt acaaaatcca tgaaagtgtt cagcacaatg ccgggcccat 181 ataaacgtca gtagttgttg ttattataat tagtcttgac ccaacggcaa attcactttg 241 agaccttaga taaatcactc tacctctctg agcctggttt ccttgcccta aaaggatggc 301 aaggggctgg gcatggtggc cactgaggat gagaggctag cagggagcgg tgagggagag 361 cggctggatt tcttacggga tcggcacgtg cgatttttcc agcgctgcct ccaggttttg 421 ccggagcgct attcttcact cgagacaagc aggttgacaa ttgcattttt tgcactctcc 481 gggctggata tgttggattc cttagatgtg gtgaacaaag atgatataat agagtggatt 541 tattccctgc aggtccttcc cacagaagac agatcaaatc taaatcgctg tggtttccga 601 ggctcttcat acctgggtat tccgttcaat ccatcaaagg ctcctggaac agctcatcct 661 tatgatagtg gccacattgc aatgacctac actggcctct catgcttagt tattcttgga 721 gacgacttaa gccgagtaaa taaagaagct tgcttagcgg gcttgagagc ccttcagctg 781 gaagatggga gtttttgtgc agtacctgaa ggcagtgaaa atgacatgcg atttgtgtac 841 tgtgcttcct gtatttgcta tatgctcaac aactggtcag gcatggatat gaaaaaagcc 901 atcacctata ttagaaggag tatgtcctat gacaatggac tggcacaggg agctggactt 961 gaatctcatg gaggatcaac tttttgtggc attgcctcac tatgtctgat gggtaaacta 1021 gaagaagttt tttcagaaaa agaattgaac aggataaaga ggtggtgtat aatgaggcaa 1081 caaaatggtt atcatggaag acctaataag cctgtagaca cctgttattc tttttgggtg 1141 ggagcaactc tgaagcttct aaaaattttc caatacacta actttgagaa aaatagaaat 1201 tacatcttat caactcaaga tcgccttgta gggggatttg ccaagtggcc agacagtcat 1261 ccagatgctt tgcatgcata ctttgggatc tgtggcctgt cactaatgga ggaaagtgga 1321 atttgtaaag ttcatcctgc tctgaatgta agcacacgga cttctgaacg ccttctagat 1381 ctccatcaaa gctggaaaac caaggactct aaacaatgct cagagaatgt acatatctcc 1441 acatgactga ttttagattg ggagggtggg ggggatttgt agcataactg tagctcaagt 1501 ttaaaagcca tgtataacca agtgtgctct ttttttaaaa ggtagagtct tacaatcaaa 1561 tctcctgctg atttcacttt gggatatggt cttgagccag taatctttat actgggtttc 1621 aagaaaatct ttgttgaagt ttgaaccaca actttgtcgt ggttcttaaa tgtttatact 1681 gtatttctaa gaagttgttt gaggcaaatt aactgtatgt gtgtaggtta tctttttaaa 1741 aactcttcag tgcaaattgt atcttattat aaaatggaca caaattttca agtttacact 1801 tcatatagca ttgataatct tcaggtgaac acttagtgat catttaaaaa gctcactgct 1861 gatcgtagaa aatttgcttt aattaattaa gtatctggga ttattctttg aaaacagatg 1921 accataattt tttttaaaga agagtgactt attttgtctt attcttaag // LOCUS HUMGGTR 2414 bp mRNA PRI 31-DEC-1994 DEFINITION Human gamma-glutmyl transpeptidase-related protein (GGT-Rel) mRNA, complete cds. ACCESSION M64099 NID g183141 KEYWORDS gamma-glutamyltranspeptidase; transmembrane protein. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2414) AUTHORS Heisterkamp,N., Rajpert-De Meyts,E., Uribe,L., Forman,H.J. and Groffen,J. TITLE Identification of a human gamma-glutamyl cleaving enzyme related to, but distinct from, gamma-glutamyl transpeptidase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (14), 6303-6307 (1991) MEDLINE 91296809 FEATURES Location/Qualifiers source 1..2414 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 342..2102 /gene="GGT-Rel" CDS 342..2102 /gene="GGT-Rel" /codon_start=1 /product="gamma-glutmyl transpeptidase-related protein" /db_xref="PID:g183142" /translation="MARGYGATVSLVLLGLGLALAVIVLAVVLSRHQAPCGPQAFAHA AVAADSKVCSDIGRAILQQQGSPVDATIAALVCTSVVNPQSMGLGGGVIFTIYNVTTG KVEVINARETVPASHAPSLLDQCAQALPLGTGAQWIGVPGELRGYAEAHRRHGRLPWA QLFQPTIALLRGGHVVAPVLSRFLHNSILRPSLQASTLRQLFFNGTEPLRPQDPLPWP ALATTLETVATEGVEVFYTGRLGQMLVEDIAKEGSQLTLQDLAKFQPEVVDALEVPLG DYTLYSPPPPAGGAILSFILNVLRGFNFSTESMARPEGRVNVYHHLVETLKFARGQRW RLGDPRSHPKLQNASRDLLGETLAQLIRQQIDGRGDHQLSHYSLAEAWGHGTGTSHVS VLGEDGSAVAATSTINTPFGAMVYSPRTGIILNNELLDLCERCPWGSGTTPSPVSGDR VGGAPGRCWPPVPGERSPSSMVPSILINKAQGSKLVIGGAGGELIISAVAQAIMSKLW LGFDLRAAIAAPILHVNSKGCVEYEPNFSQEVQRGLQDRGQNQTQRPFFLNVVQAVSQ EGACVYAVSDLRKSGEAAGY" BASE COUNT 419 a 800 c 723 g 472 t ORIGIN 1 ggggtgaggg cagcagctcg ccacagctgc cagccatctg tccattcacc catctgtcca 61 tctggcagcc cgctgttcag acctgtctgt ctgtccgccc atctctgtaa gcccatctct 121 gtcccattgt ctatctgacc atctttctct tactgtcctc tttgtctagc tatctggcct 181 atctgtcgat ccatcttcgt gtctgtcttc agcccccacc tgtttttgtc catctgtcca 241 attacctgtg actctgtgca tcttcttgtc cattcatctg cccacccatc cgtccctccg 301 tctgcccacc agccgcccct ctcctcctgg gctgcagagc catggcccgg ggctacgggg 361 ccacggtcag cctagtcctg ctgggtctgg ggctggcgct ggctgtcatt gtgctggctg 421 tggtcctctc tcgacaccag gccccatgtg gcccccaggc ctttgcccac gctgctgttg 481 ccgccgactc caaggtctgc tcggatattg gacgagccat cctccagcag cagggctcac 541 ccgtggatgc caccatcgcg gctctggtct gcaccagcgt cgtcaaccct cagagcatgg 601 gcctgggcgg aggggtcatc ttcaccatct acaatgtgac aacagggaag gtggaggtca 661 tcaatgcccg ggagacggtg ccggccagcc acgccccgag cctgctggac cagtgtgcac 721 aggctctgcc actgggcaca ggggcccagt ggatcggggt gcccggggag ctccgtggct 781 atgccgaggc ccaccgccgc catggccgcc tgccctgggc gcagctgttc cagcccacca 841 tcgcgctgct ccgagggggg catgtggtgg cccctgtcct cagccgtttc ctgcacaaca 901 gcatcctgcg gccttccttg caggcgtcaa ccctgcgcca gctcttcttc aacgggacag 961 aacccctgag gcctcaggac ccactcccat ggcctgcact ggccaccacc ctggagaccg 1021 tggccacaga gggcgtggag gtcttctaca cggggaggct gggccagatg ctggtggagg 1081 acattgccaa ggaagggagc cagctgacgc tgcaggacct ggccaagttc cagcccgagg 1141 tggtggatgc cctggaggtg cccctggggg actataccct gtactcacca ccgccgcctg 1201 cagggggtgc cattctcagc tttatcctca acgtgctaag agggttcaac ttctcaacag 1261 agtctatggc caggcctgaa gggagggtga acgtgtacca ccaccttgta gagacgctca 1321 agtttgccag ggggcagagg tggaggctgg gggaccctcg aagccacccg aagctccaga 1381 atgcctcccg ggacctgctg ggggagaccc tggcccagct catccgccaa cagatcgatg 1441 gccgggggga ccaccagctc agccactaca gcttggccga ggcctggggc cacgggacag 1501 gcacgtccca tgtgtctgtg ctgggggagg atggcagcgc cgtggctgcc accagcacca 1561 tcaacacacc ctttggagcg atggtgtatt caccacggac aggcatcatc ctcaacaacg 1621 agctcctgga cttatgcgag cgatgcccct ggggttccgg caccaccccc tcacctgtga 1681 gtggagacag ggtgggtgga gctcccggaa ggtgctggcc cccagttcca ggcgagcgtt 1741 ccccatcctc catggtgccc tccatcttga tcaacaaagc ccaggggtcg aagctagtga 1801 ttggcggggc tggcggggag ctcatcatct ctgctgtggc ccaggccatc atgagcaagc 1861 tgtggcttgg ctttgacctg agagcggcca ttgcagcccc catcctgcat gtcaacagca 1921 agggctgtgt ggagtacgag cccaacttca gccaggaggt gcagagggga ctccaagacc 1981 gtggccagaa ccagacccag aggcccttct tcctgaacgt ggtccaggct gtgtcccagg 2041 agggggcctg tgtgtacgcc gtctcggacc tgaggaagag tggggaggcc gcaggctact 2101 aagacactgc tctgcccaga gctgaagtct ggccccacca tgagtcctgt gtccaggccg 2161 gacatggctg ggggaccaac tactctggca ggatctggac ccctggcagg ggagtccagc 2221 tgagagtgga agaggtggcg gggaccagct gggcagatga gaggctgagc ctcatcccta 2281 accccctttc ccagagcccc tggtggtcct gaaccggccc ctctatccct ccgcaggcct 2341 cttacctggg gccactctcc caccctctcg atctgtatat cctccagtcc aagattaaag 2401 aagaggcgga ctgt // LOCUS HUMGHRHREC 1617 bp mRNA PRI 10-NOV-1992 DEFINITION Human growth hormone-releasing hormone receptor mRNA, complete cds. ACCESSION L01406 NID g183172 KEYWORDS growth hormone; growth hormone-releasing hormone; growth hormone-releasing hormone receptor; hormone receptor; pituitary specific. SOURCE Homo sapiens (library: Lambda bluemid from Clonetech) female pituitary tumor cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1617) AUTHORS Mayo,K.E. TITLE Molecular cloning and expression of a pituitary-specific receptor for growth hormone-releasing hormone JOURNAL Mol. Endocrinol. 6, 1734-1744 (1992) MEDLINE 93078807 FEATURES Location/Qualifiers source 1..1617 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="pituitary tumor" /tissue_lib="Lambda bluemid from Clonetech" CDS 52..1323 /standard_name="GHRH receptor" /codon_start=1 /product="growth hormone-releasing hormone receptor" /db_xref="PID:g183173" /translation="MDRRMWGAHVFCVLSPLPTVLGHMHPECDFITQLREDESACLQA AEEMPNTTLGCPATWDGLLCWPTAGSGEWVTLPCPDFFSHFSSESGAVKRDCTITGWS EPFPPYPVACPVPLELLAEEESYFSTVKIIYTVGHSISIVALFVAITILVALRRLHCP RNYVHTQLFTTFILKAGRVFLKDAALFHSDDTDHCSFSTVLCKVSVAASHFATMTNFS WLLAEAVYLNCLLASTSPSSRRAFWWLVLAGWGLPVLFTGTWVSCKLAFEDIACWDLD DTSPYWWIIKGPIVLSVGVNFGLFLNIIRILVRKLEPAQGSLHTQSQYWRLSKSTLFL IPLFGIHYIIFNFLPDNAGLGIRLPLELGLGSFQGFIVAILYCFLNQEVRTEISRKWH GHDPELLPAWRTRAKWTTPSRSAAKVLTSMC" BASE COUNT 294 a 516 c 422 g 385 t ORIGIN 1 agcagccaag gcttactgag gctggtggag ggagccactg ctgggctcac catggaccgc 61 cggatgtggg gggcccacgt cttctgcgtg ttgagcccgt taccgaccgt attgggccac 121 atgcacccag aatgtgactt catcacccag ctgagagagg atgagagtgc ctgtctacaa 181 gcagcagagg agatgcccaa caccaccctg ggctgccctg cgacctggga tgggctgctg 241 tgctggccaa cggcaggctc tggcgagtgg gtcaccctcc cctgcccgga tttcttctct 301 cacttcagct cagagtcagg ggctgtgaaa cgggattgta ctatcactgg ctggtctgag 361 ccctttccac cttaccctgt ggcctgccct gtgcctctgg agctgctggc tgaggaggaa 421 tcttacttct ccacagtgaa gattatctac accgtgggcc atagcatctc tattgtagcc 481 ctcttcgtgg ccatcaccat cctggttgct ctcaggaggc tccactgccc ccggaactac 541 gtccacaccc agctgttcac cacttttatc ctcaaggcgg gacgtgtgtt cctgaaggat 601 gctgcccttt tccacagcga cgacactgac cactgcagct tctccactgt tctatgcaag 661 gtctctgtgg ccgcctccca tttcgccacc atgaccaact tcagctggct gttggcagaa 721 gccgtctacc tgaactgcct cctggcctcc acctccccca gctcaaggag agccttctgg 781 tggctggttc tcgctggctg ggggctgccc gtgctcttca ctggcacgtg ggtgagctgc 841 aaactggcct tcgaggacat cgcgtgctgg gacctggacg acacctcccc ctactggtgg 901 atcatcaaag ggcccattgt cctctcggtc ggggtgaact ttgggctttt tctcaatatt 961 atccgcatcc tggtgaggaa actggagcca gctcagggca gcctccatac ccagtctcag 1021 tattggcgtc tctccaagtc gacacttttc ctgatcccac tctttggaat tcactacatc 1081 atcttcaact tcctgccaga caatgctggc ctgggcatcc gcctccccct ggagctggga 1141 ctgggttcct tccagggctt cattgttgcc atcctctact gcttcctcaa ccaagaggtg 1201 aggactgaga tctcacggaa gtggcatggc catgaccctg agcttctgcc agcctggagg 1261 acccgtgcta agtggaccac gccttcccgc tcggcggcaa aggtgctgac atctatgtgc 1321 taggctgcct catcacgcca ctggagtcca cacttgaatt tgggcagcta ccacgggtct 1381 gccatgctct ggaggagcaa gggggccaca tccccacccc agctgttacc cagcccgggg 1441 caggtgcagc ccttcctccc tgtctctgca tctgactctc ttttgaggtc cctgtatgtc 1501 tacctctgac ttctgtggtc cctctgtgtc tgctctcatc cattcctctt actggggcct 1561 ggggctctag cccaaggctc agaggagcca ataaacctgt aaatgaaaaa aaaaaaa // LOCUS HUMGIPA 711 bp mRNA PRI 13-FEB-1996 DEFINITION Human gastric inhibitory polypeptide (GIP) mRNA, complete cds. ACCESSION M18185 NID g183212 KEYWORDS gastric inhibitory polypeptide. SOURCE Homo sapiens duodenum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 711) AUTHORS Takeda,J., Seino,Y., Tanaka,K.I., Fukumoto,H., Kayano,T., Takahashi,H., Mitani,T., Kurono,M., Suzuki,T., Tobe,T. and Imura,H. TITLE Sequence of an intestinal cDNA encoding human gastric inhibitory polypeptide precursor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (20), 7005-7008 (1987) MEDLINE 88041039 FEATURES Location/Qualifiers source 1..711 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="duodenum" /map="17q21.3-q22" mRNA <1..711 /gene="GIP" /note="G00-119-985" gene 1..711 /gene="GIP" sig_peptide 99..161 /gene="GIP" /note="G00-119-985" CDS 99..560 /gene="GIP" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-985" /product="gastric inhibitory polypeptide" /db_xref="PID:g183213" /translation="MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSS PQPRGPRYAEGTFISDYSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQREARALELA SQANRKEEEAVEPQSSPAKNPSDEDLLRDLLIQELLACLLDQTNLCRLRSR" mat_peptide 252..377 /gene="GIP" /note="G00-119-985" /product="gastric inhibitory polypeptide" BASE COUNT 191 a 188 c 190 g 142 t ORIGIN 1 aggctcagaa ggtccagaaa tcaggggaag gagaccccta tctgtccttc ttctggaaga 61 gctggaaagg aagtctgctc aggaaataac cttggaagat ggtggccacg aagacctttg 121 ctctgctgct gctgtccctg ttcctggcag tgggactagg agagaagaaa gagggtcact 181 tcagcgctct cccctccctg cctgttggat ctcatgctaa ggtgagcagc cctcaacctc 241 gaggccccag gtacgcggaa gggactttca tcagtgacta cagtattgcc atggacaaga 301 ttcaccaaca agactttgtg aactggctgc tggcccaaaa ggggaagaag aatgactgga 361 aacacaacat cacccagagg gaggctcggg cgctggagct ggccagtcaa gctaatagga 421 aggaggagga ggcagtggag ccacagagct ccccagccaa gaaccccagc gatgaagatt 481 tgctgcggga cttgctgatt caagagctgt tggcctgctt gctggatcag acaaacctct 541 gcaggctcag gtctcggtga ctctgaccac acccagctca ggactcgatt ctgcccttca 601 cttagcacct gcctcagccc cactccagaa tagccaagag aacccaaacc aataaagttt 661 atgctaagtc gagcccattg tgaaaattta ttaaaatgac tactgagcac t // LOCUS HUMGJA4A 1601 bp mRNA PRI 08-NOV-1994 DEFINITION Homo sapiens connexin 37 (GJA4) mRNA, complete cds. ACCESSION M96789 NID g183222 KEYWORDS connexin 37; gap junction protein. SOURCE Homo sapiens umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1601) AUTHORS Reed,K.E., Westphale,E.M., Larson,D.M., Wang,H.Z., Veenstra,R.D. and Beyer,E.C. TITLE Molecular cloning and functional expression of human connexin37, an endothelial cell gap junction protein JOURNAL J. Clin. Invest. 91 (3), 997-1004 (1993) MEDLINE 93195088 FEATURES Location/Qualifiers source 1..1601 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="primary culture" /cell_type="endothelial cell" /tissue_type="umbilical vein" /map="1p36-q12" 5'UTR 1..64 /gene="GJA4" /note="G00-127-818" gene 1..1601 /gene="GJA4" CDS 65..1066 /gene="GJA4" /standard_name="gap juncion" /codon_start=1 /function="intercellular channel" /db_xref="GDB:G00-127-818" /product="connexin 37" /db_xref="PID:g183223" /translation="MGDWGFLEKLLDQVREHSTVVGKIWLTVLFIFRILILGLAGESV WGDEQSDFECNTAQPGCTNVCYDQAFPISHIRYWVLQFLFVSTPTLVYLGHVIYLSRR EERLAQKEGELRALPAKDPQVERALAGIELQMAKISVAEDGRLRIPRALMGTYVASVL CKSVLEAGFLYGQWRLYGWTMEPVFVCQRAPCPYLVDCFVSRPTEKTIFIIFMLVVGL ISLVLNLLELVHLLCRCLSRGMRARQGQDAPPTQGTSSDPYTDQGLLLPPRGQGPSSP PCPTYNGLSSSEQNWANLTTEERLASSRPPLFLDPPPQNGQKPPSRPSSSASKKQYV" 3'UTR 1067..1589 /gene="GJA4" /note="G00-127-818" polyA_site 1590..1601 /gene="GJA4" /note="G00-127-818; putative" BASE COUNT 306 a 507 c 455 g 333 t ORIGIN 1 ctccggccat cgtccccacc tccacctggg ccgcccgcga ggcagcggac ggaggccggg 61 agccatgggt gactggggct tcctggagaa gttgctggac caggtccgag agcactcgac 121 cgtggtgggt aagatctggc tgacggtgct cttcatcttc cgcatcctca tcctgggcct 181 ggccggcgag tcagtgtggg gtgacgagca gtcagatttc gagtgtaaca cggcccagcc 241 aggctgcacc aacgtctgct atgaccaggc cttccccatc tcccacatcc gctactgggt 301 gctgcagttc ctcttcgtca gcacacccac cctggtctac ctgggccatg tcatttacct 361 gtctcggcga gaagagcggc tggcgcagaa ggagggggag ctgcgggcac tgccggccaa 421 ggacccacag gtggagcggg cgctggccgg catagagctt cagatggcca agatctcggt 481 ggcagaagat ggtcgcctgc gcattccgcg agcactgatg ggcacctatg tcgccagtgt 541 gctctgcaag agtgtgctag aggcaggctt cctctatggc cagtggcgcc tgtacggctg 601 gaccatggag cccgtgtttg tgtgccagcg agcaccctgc ccctacctcg tggactgctt 661 tgtctctcgc cccacggaga agaccatctt catcatcttc atgttggtgg ttggactcat 721 ctccctggtg cttaacctgc tggagttggt gcacctgctg tgtcgctgcc tcagccgggg 781 gatgagggca cggcaaggcc aagacgcacc cccgacccag ggcacctcct cagaccctta 841 cacggaccag ggtcttcttc tacctccccg tggccagggg ccctcatccc caccatgccc 901 cacctacaat gggctctcat ccagtgagca gaactgggcc aacctgacca cagaggagag 961 gctggcgtct tccaggcccc ctctcttcct ggacccaccc cctcagaatg gccaaaaacc 1021 cccaagtcgt cccagcagct ctgcttctaa gaagcagtat gtatagaggc ctgtggctta 1081 tgtcacccaa cagaggggtc ctgagaagtc tggctgcctg ggatgccccc tgccccctcc 1141 tggaaggctc tgcagagatg actgggctgg ggaagcagat gcttgctggc catggagcct 1201 cattgcaagt tgttcttgaa cacctgaggc cttcctgtgg cccaccaggc actacggctt 1261 cctctccaga tgtgctttgc ctgagcacag acagtcagca tggaatgctc ttggccaagg 1321 gtactggggc cctctggcct tttgcagctg atccagagga acccagagcc aacttacccc 1381 aacctcaccc tatggaacag tcacctgtgc gcaggttgtc ctcaaaccct ctcctcacag 1441 gaaaaggcgg attgaggctg ctgggtcagc cttgatcgca cagacagagc ttgtgccgga 1501 tttggccctg tcaaggggac tggtgccttg ttttcatcac tccttcctag ttctactgtt 1561 caagcttctg aaataaacag gacttgatca caaaaaaaaa a // LOCUS HUMGLBA 2767 bp mRNA PRI 08-NOV-1994 DEFINITION Human co-beta glucosidase (proactivator) mRNA, complete cds. ACCESSION J03077 NID g183230 KEYWORDS beta glucosidase; glucosidase activator; proactivator. SOURCE Human placenta, cDNA to mRNA, clone EGTISI. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2767) AUTHORS Rorman,E.G. and Grabowski,G.A. TITLE Molecular cloning of a human co-beta-glucosidase cDNA: evidence that four sphingolipid hydrolase activator proteins are encoded by single genes in humans and rats JOURNAL Genomics 5 (3), 486-492 (1989) MEDLINE 90129043 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.A.Grabowski, 08-JUN-1989. FEATURES Location/Qualifiers source 1..2767 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" sig_peptide 39..86 /gene="GLBA" /note="co-beta glucosidase signal peptide (pot.); putative" CDS 39..1613 /gene="GLBA" /note="co-beta glucosidase precursor" /codon_start=1 /db_xref="GDB:92" /db_xref="PID:g183231" /translation="MYALFLLASLLGAALAGPVLGLKECTRGSAVWCQNVKTASDCGA VKHCLQTVWNKPTVKSLPCDICKDVVTAAGDMLKDNATEEEILVYLEKTCDWLPKPNM SASCKEIVDSYLPVILDIIKGEMSRPGEVCSALNLCESLQKHLAELNHQKQLESNKIP ELDMTEVVAPFMANIPLLLYPQDGPRSKPQPKDNGDVCQDCIQMVTDIQTAVRTNSTF VQALVEHVKEECDRLGPGMADICKNYISQYSEIAIQMMMHMQPKEICALVGFCDEVKE MPMQTLVPAKVASKNVIPALELVEPIKKHEVPAKSDVYCEVCEFLVKEVTKLIDNNKT EKEILDAFDKMCSKLPKSLSEECQEVVDTYGSSILSILLEEVSPELVCSMLHLCSGTR LPALTVHVTQPKDGGFCEVCKKLVGYLDRNLEKNSTKQEILAALEKGCSFLPDPYQKQ CDQFVAEYEPVLIEILVEVMDPSFVCLKIGACPSAHKPLLGTEKCIWGPSYWCQNTET AAQCNAVEHCKRHVWN" gene 39..1613 /gene="GLBA" mat_peptide 87..1610 /gene="GLBA" /note="co-beta glucosidase (pot.); putative" BASE COUNT 590 a 709 c 796 g 672 t ORIGIN Chromosome 10q21. 1 gggcgggcgc attgcagact gcggagtcag acggtgctat gtacgccctc ttcctcctgg 61 ccagcctcct gggcgcggct ctagccggcc cggtccttgg actgaaagaa tgcaccaggg 121 gctcggcagt gtggtgccag aatgtgaaga cggcgtccga ctgcggggca gtgaagcact 181 gcctgcagac cgtttggaac aagccaacag tgaaatccct tccctgcgac atatgcaaag 241 acgttgtcac cgcagctggt gatatgctga aggacaatgc cactgaggag gagatccttg 301 tttacttgga gaagacctgt gactggcttc cgaaaccgaa catgtctgct tcatgcaagg 361 agatagtgga ctcctacctc cctgtcatcc tggacatcat taaaggagaa atgagccgtc 421 ctggggaggt gtgctctgct ctcaacctct gcgagtctct ccagaagcac ctagcagagc 481 tgaatcacca gaagcagctg gagtccaata agatcccaga gctggacatg actgaggtgg 541 tggccccctt catggccaac atccctctcc tcctctaccc tcaggacggc ccccgcagca 601 agccccagcc aaaggataat ggggacgttt gccaggactg cattcagatg gtgactgaca 661 tccagactgc tgtacggacc aactccacct ttgtccaggc cttggtggaa catgtcaagg 721 aggagtgtga ccgcctgggc cctggcatgg ccgacatatg caagaactat atcagccagt 781 attctgaaat tgctatccag atgatgatgc acatgcaacc caaggagatc tgtgcgctgg 841 ttgggttctg tgatgaggtg aaagagatgc ccatgcagac tctggtcccc gccaaagtgg 901 cctccaagaa tgtcatccct gccctggaac tggtggagcc cattaagaag cacgaggtcc 961 cagcaaagtc tgatgtttac tgtgaggtgt gtgaattcct ggtgaaggag gtgaccaagc 1021 tgattgacaa caacaagact gagaaagaaa tactcgacgc ttttgacaaa atgtgctcga 1081 agctgccgaa gtccctgtcg gaagagtgcc aggaggtggt ggacacgtac ggcagctcca 1141 tcctgtccat cctgctggag gaggtcagcc ctgagctggt gtgcagcatg ctgcacctct 1201 gctctggcac gcggctgcct gcactgaccg ttcacgtgac tcagccaaag gacggtggct 1261 tctgcgaagt gtgcaagaag ctggtgggtt atttggatcg caacctggag aaaaacagca 1321 ccaagcagga gatcctggct gctcttgaga aaggctgcag cttcctgcca gacccttacc 1381 agaagcagtg tgatcagttt gtggcagagt acgagcccgt gctgatcgag atcctggtgg 1441 aggtgatgga tccttccttc gtgtgcttga aaattggagc ctgcccctcg gcccataagc 1501 ccttgttggg aactgagaag tgtatatggg gcccaagcta ctggtgccag aacacagaga 1561 cagcagccca gtgcaatgct gtcgagcatt gcaaacgcca tgtgtggaac taggaggagg 1621 aatattccat cttggcagaa accacagcat tggttttttt ctacttgtgt gtctggggga 1681 atgaacgcac agatctgttt gactttgtta taaaaatagg gctcccccac ctcccccatt 1741 tctgtgtcct ttattgtagc attgctgtct gcaagggagc ccctagcccc tagcccctgg 1801 cagacatagc tgcttcagtg ccccttttct ctctgctaga tggatgttga tgcactggag 1861 gtcttttagc ctgcccttgc atggcgcctg ctggaggagg agagagctct gctggcatga 1921 gccacagttt cttgactgga ggccatcaac cctcttggtt gaggccttgt tctgagccct 1981 gacatgtgct tgggcactgg tgggcctggg cttctgaggt ggcctcctgc cctgatcagg 2041 gaccctcccc gctttcctgg gcctctcagt tgaacaaagc agcaaaacaa aggcagtttt 2101 atatgaaaga ttagaagcct ggaataatca ggctttttaa atgatgtaat tcccactgta 2161 atagcatagg gattttggaa gcagctgctg gtggcttggg acatcagtgg ggccaagggt 2221 tctctgtccc tggttcaact gtgatttggc tttcccgtgt ctttcctggt gatgccttgt 2281 ttggggttct gtgggtttgg gtgggaagag ggccatctgc ctgaatgtaa cctgctagct 2341 ctccgaagcc ctgcgggcct ggcttgtgtg agcgtgtgga cagtggtggc cgcgctgtgc 2401 ctgctcgtgt tgcctacatg tccctggctg ttgaggcgct gcttcagcct gcacccctcc 2461 cttgtctcat agatgctcct tttgaccttt tcaaataaat atggatggcg agctcctagg 2521 cctctggctt cctggtagag ggcggcatgc cgaagggtct gtctgggtgt ggattggatg 2581 ctgggggtgt gggggttgga agctgtctgt ggcccacttg ggcacccacg cttctgtcca 2641 cttctggttg ccaggagaca gcaagcaaag ccagcaggac atgaagttgc tattaaatgg 2701 acttcgtgat ttttgttttg cactaaagtt tctgtgattt aacaataaaa ttctgttagc 2761 ccccccg // LOCUS HUMGLCNAC 2602 bp mRNA PRI 08-NOV-1994 DEFINITION Human N-acetylglucosaminyltransferase I (GlcNAc-TI) mRNA, complete cds. ACCESSION M55621 NID g183236 KEYWORDS N-acetylglucosaminyltransferase I. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2602) AUTHORS Kumar,R., Yang,J., Larsen,R.D. and Stanley,P. TITLE Cloning and expression of N-acetylglucosaminyltransferase I, the medial Golgi transferase that initiates complex N-linked carbohydrate formation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (24), 9948-9952 (1990) MEDLINE 91088628 FEATURES Location/Qualifiers source 1..2602 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A431" /tissue_lib="cDNA in CDM7 vector" /map="5" gene 162..1499 /gene="MGAT" CDS 162..1499 /gene="MGAT" /EC_number="2.4.1.51" /codon_start=1 /db_xref="GDB:G00-128-225" /product="N-acetylglucosaminyltransferase I" /db_xref="PID:g183237" /translation="MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALD GDPASLTREVIRLAQDAEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPA PAVIPILVIACDRSTVRRCLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVT HIRQPDLSSIAVPPDHRKFQGYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDF FEYFRATYPLLKADPSLWCVSAWNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELW AELEPKWPKAFWDDWMRRPEQRQGRACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKL NQQFVHFTQLDLSYLQREAYDRDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRD SFKAFAKALGVMDDLKSGVPRAGYRGIVTFQFRGRRVHLAPPLTWEGYDPSWN" gene 2566..2571 /gene="GlcNAc-TI" polyA_signal 2566..2571 /gene="GlcNAc-TI" BASE COUNT 471 a 785 c 794 g 552 t ORIGIN 1 ggccaagttc ggggccagga cgtcgggagg acctggtgca tggctgcctc ctaatcccat 61 agtccagagg aggcatccct aggactgcgg gcaagggagc cgggcaagcc cagggcagcc 121 ttgaaccgtc ccctggcctg ccctccccgg tgggggccag gatgctgaag aagcagtctg 181 cagggcttgt gctgtggggc gctatcctct ttgtggcctg gaatgccctg ctgctcctct 241 tcttctggac gcgcccagca cctggcaggc caccctcagt cagcgctctc gatggcgacc 301 ccgccagcct cacccgggaa gtgattcgcc tggcccaaga cgccgaggtg gagctggagc 361 ggcagcgtgg gctgctgcag cagatcgggg atgccctgtc gagccagcgg gggagggtgc 421 ccaccgcggc ccctcccgcc cagccgcgtg tgcctgtgac ccccgcgccg gcggtgattc 481 ccatcctggt catcgcctgt gaccgcagca ctgttcggcg ctgcctggac aagctgctgc 541 attatcggcc ctcggctgag ctcttcccca tcatcgttag ccaggactgc gggcacgagg 601 agacggccca ggccatcgcc tcctacggca gcgcggtcac gcacatccgg cagcccgacc 661 tgagcagcat tgcggtgccg ccggaccacc gcaagttcca gggctactac aagatcgcgc 721 gccactaccg ctgggcgctg ggccaggtct tccggcagtt tcgcttcccc gcggccgtgg 781 tggtggagga tgacctggag gtggccccgg acttcttcga gtactttcgg gccacctatc 841 cgctgctgaa ggccgacccc tccctgtggt gcgtctcggc ctggaatgac aacggcaagg 901 agcagatggt ggacgccagc aggcctgagc tgctctaccg caccgacttt ttccctggcc 961 tgggctggct gctgttggcc gagctctggg ctgagctgga gcccaagtgg ccaaaggcct 1021 tctgggacga ctggatgcgg cggccggagc agcggcaggg gcgggcctgc atacgccctg 1081 agatctcaag aacgatgacc tttggccgca agggtgtgag ccacgggcag ttctttgacc 1141 agcacctcaa gtttatcaag ctgaaccagc agtttgtgca cttcacccag ctggacctgt 1201 cttacctgca gcgggaggcc tatgaccgag atttcctcgc ccgcgtctac ggtgctcccc 1261 agctgcaggt ggagaaagtg aggaccaatg accggaagga gctgggggag gtgcgggtgc 1321 agtatacggg cagggacagc ttcaaggctt tcgccaaggc tctgggtgtc atggatgacc 1381 ttaagtcggg ggttccgaga gctggctacc ggggtattgt caccttccag ttccggggcc 1441 gccgtgtcca cctggcgccc ccactgacgt gggagggcta tgatcctagc tggaattagc 1501 acctgcctgt ccttcctggg cccctccttg ccacatcatg agctgaggtg ggaccacagt 1561 ccccaggctg catcggcctg cctgtgtttc cctcttaggt gcatttatct ttttgatttt 1621 tccgagtggc atttaagtgc acaaatgata acaagaggat tattctcccg ttctcaaggg 1681 agtcagatca ggggaactat tctagggtat gttgcggggt attaagcagg aaaccactgt 1741 gtggtggggg gcactgggct tgttggggcc agaaatgtcc acgtcctgag ctttctcctg 1801 gagcatgtgc agagagtttg gcaacgttcg ctctcttgac cagacccctt ctccctgacc 1861 tggctcttcc agccagggca cgagccctcc ttctatacct gctccccttc ccccagtggg 1921 gactgagtta tgggagaagg ggacatattt gtggccaaaa tgatactaac caaaggggct 1981 tccttgtcag ggcctggtgg agttggtggg tcatcggggc tcactgcctc ctgcccttct 2041 ctcctgtctg acccccactt agcccttctc tccttgcagc ctagcagttt atagttctga 2101 gatggaaagt tgaagggggc aagcaagacc tctcctcagc ccatgcccag ctgtcaggag 2161 agaggtgcag ggaggaaggc cttgtgctgg gacaacctct ctcttgcctt acctcagaga 2221 gggactatgc cctgacccct cctttctgaa aatcagtgcc ctccctgttg ctctaggagg 2281 ctcctgctgg cttggtagaa gacagaattc gatctgcctg tccctttttc ccctggggtt 2341 tgacacacag gctcctctca gcatgaggtg gagcagtgac caggtggagc agtgaccagg 2401 acgcctctgg cccagtgctg cccagcctcc ccgcccgctc ccaggcgccc catgtcctca 2461 caggccagga cgccatggca ggatggagag gacttggtgg atttttgttt cttgcctgac 2521 ctcagtttca tgaaagaaag tggaagctac agaattattt tctaaaataa aggctgaatt 2581 gtctgaaaaa aaaaaaaaaa aa // LOCUS HUMGLI3A 5055 bp mRNA PRI 08-NOV-1994 DEFINITION Human DNA-binding protein (GLI3) mRNA, complete cds. ACCESSION M57609 M34366 NID g183247 KEYWORDS GLI3 gene. SOURCE Human glioblastoma multiforme xenograft D245MG, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5055) AUTHORS Ruppert,J.M., Vogelstein,B., Arheden,K. and Kinzler,K.W. TITLE GLI3 encodes a 190-kilodalton protein with multiple regions of GLI similarity JOURNAL Mol. Cell. Biol. 10 (10), 5408-5415 (1990) MEDLINE 90377231 FEATURES Location/Qualifiers source 1..5055 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="xenograft D245MG" /cell_type="glioblastoma multiforme" /map="7p13" mRNA <1..5046 /gene="GLI3" /note="G00-119-990" /product="DNA-binding protein" gene 1..5046 /gene="GLI3" CDS 55..4845 /gene="GLI3" /codon_start=1 /db_xref="GDB:G00-119-990" /product="DNA-binding protein" /db_xref="PID:g183248" /translation="MEAQSHSSTTTEKKKVENSIVKCSTRTDVSEKAVASSTTSNEDE SPGQTYHRERRNAITMQPQNVQGLSKVSEEPSTSSDERASLIKKEIHGSLPHVAEPSV PYRGTVFAMDPRNGYMEPHYHPPHLFPAFHPPVPIDARHHEGRYHYDPSPIPPLHMTS ALSSSPTYPDLPFIRISPHRNPAAASESPFSPPHPYINPYMDYIRSLHSSPSLSMISA TRGLSPTDAPHAGVSPAEYYHQMALLTGQRSPYADIIPSAATAGTGAIHMEYLHAMDS TRFSSPRLSARPSRKRTLSISPLSDHSFDLQTMIRTSPNSLVTILNNSRSSSSASGSY GHLSASAISPALSFTYSSAPVSLHMHQQILSRQQSLGSAFGHSPPLIHPAPTFPTQRP IPGIPTVLNPVQVSSGPSESSQNKPTSESAVSSTGDPMHNKRSKIKPDEDLPSPGARG QQEQPEGTTLVKEEGDKDESKQEPEVIYETNCHWEGCAREFDTQEQLVHHINNDHIHG EKKEFVCRWLDCSREQKPFKAQYMLVVHMRRHTGEKPHKCTFEGCTKAYSRLENLKTH LRSHTGEKPYVCEHEGCNKAFSNASDRAKHQNRTHSNEKPYVCKIPGCTKRYTDPSSL RKHVKTVHGPEAHVTKKQRGDIHPRPPPPRDSGSHSQSRSPGRPTQGALGEQQDLSNT TSKREECLQVKTVKAEKPMTSQPSPGGQSSCSSQQSPISNYSNSGLELPLTDGGSIGD LSAIDETPIMDSTISTATTALALQARRNPAGTKWMEHVKLERLKQVNGMFPRLNPILP PKAPAVSPLIGNGTQSNNTCSLGGPMTLLPGRSDLSGVDVTMLNMLNRRDSSASTISS AYLSSRRSSGISPCFSSRRSSEASQAEGRPQNVSVADSYDPISTDASRRSSEASQSDG LPSLLSLTPAQQYRLKAKYAAATGGPPPTPLPNMERMSLKTRLALLGDALEPGVALPP VHAPRRCSDGGAHGYGRRHLQPHDALGHGVRRASDPVRTGSEGLALPRVPRFSSLSSC NPPAMATSAEKRSLVLQNYTRPEGGQSRNFHSSPCPPSITENVTLESLTMDADANLND EDFLPDDVVQYLNSQNQAGYEQHFPSALPDDSKVPHGPGDFDAPGLPDSHAGQQFHAL EQPCPEGSKTDLPIQWNEVSSGSADLSSSKLKCGPRPAVPQTRAFGFCNGMVVHPQNP LRSGPAGGYQTLGENSNPYGGPEHLMLHNSPGSGTSGNAFHEQPCKAPQYGNCLNRQP VAPGALDGACGAGIQASKLKSTPMQGSGGQLNFGLPVAPNESAGSMVNGMQNQDPVGQ GYLAHQLLGDSMQHPGAGRPGQQMLGQISATSHINIYQGPESCLPGAHGMGSQPSSLA VVRGYQPCASFGGSRRQAMPRDSLALQSGQLSDTSQTCRVNGIKMEMKGQPHPLCSNL QNYSGQFYDQTVGFSQQDTKAGSFSISDASCLLQGTSAKNSELLSPGANQVTSTVDSL DSHDLEGVQIDFDAIIDDGDHSSLMSGALSPSIIQNLSHSSSRLTTPRASLPFPVAVH EHHQHGYRGHEFFADLPSGRKQIPCSYAIGFRKKRLQPTEINRS" BASE COUNT 1243 a 1591 c 1290 g 931 t ORIGIN 1 cgatactacg tgggcatttt tggtcgaaga gagctgaagt aatgagaaga catcatggag 61 gcccagtccc acagctccac gaccactgaa aagaaaaaag ttgagaattc catagtgaag 121 tgctccactc gaacagatgt gagcgagaaa gccgttgcct ccagcaccac ttctaatgag 181 gatgaaagtc ctggacagac ttatcacaga gagagaagaa acgcaatcac tatgcagcca 241 cagaatgtcc aggggctcag caaagtcagt gaggaacctt caacatcgag tgacgagagg 301 gcctcattga tcaagaaaga gatccatggg tccctgccac acgtggcgga gccctctgtg 361 ccgtaccgcg ggacggtgtt tgccatggac cccaggaatg gttacatgga gccccactac 421 caccctcctc atcttttccc tgccttccat cctcctgtac caattgatgc cagacatcat 481 gagggccgtt accattacga tccatctccg attcctccat tgcatatgac ttccgcctta 541 tctagtagcc ctacgtatcc ggacctgccc ttcattagga tctccccaca ccggaacccc 601 gctgctgctt ccgagtctcc cttcagccct ccacatccct acattaatcc ctacatggac 661 tatatccgct ccttgcacag cagcccatcg ctctccatga tctcagcaac ccgtgggctg 721 agccctacag atgcgcccca tgcaggagtc agcccagcag aatactatca tcagatggcc 781 ctgctaactg gccagcgcag cccctatgca gacattattc cctcagctgc caccgccggc 841 acgggggcca tccacatgga atatcttcat gctatggata gcaccagatt ctccagcccc 901 aggctgtcag ccaggccgag ccgaaaacgt acactgtcca tatcaccact ctccgatcat 961 agctttgacc ttcagaccat gataaggacg tctcccaact ccttggtcac gattctcaat 1021 aattcccgta gcagctcttc agcaagtggc tcctatggtc acttatctgc aagtgcaatc 1081 agccctgcct tgagcttcac ctactcttcc gcgcccgtct ctctccacat gcatcagcag 1141 atcctaagcc gacaacagag cttaggttca gcctttggac acagccctcc actcatccac 1201 cctgccccaa cttttccaac acagaggcct attccaggga tccctacggt tctgaacccc 1261 gtccaggtca gctccggccc ttctgagtcc tcacagaaca agcccacgag tgagtctgca 1321 gtgagcagca ctggtgaccc gatgcacaac aagaggtcca agatcaaacc cgatgaagac 1381 ctccccagcc caggggctcg ggggcagcag gaacagcccg aaggaacaac ccttgtcaag 1441 gaggaagggg acaaagatga aagcaaacag gagcctgaag tcatctatga gacaaactgc 1501 cactgggaag gctgcgcgag ggagttcgac acccaagagc agcttgtgca ccatataaat 1561 aacgaccata ttcatggaga gaagaaggag ttcgtgtgca ggtggctgga ctgctcaaga 1621 gagcagaaac ccttcaaagc ccagtatatg ttggtagtgc atatgagaag acacacgggc 1681 gagaagcctc acaaatgcac ttttgaaggt tgcacaaagg cctactcgag actagaaaac 1741 ttgaaaacac acttgagatc tcacactgga gagaaaccat acgtctgtga gcacgaaggt 1801 tgcaacaagg ctttctcaaa tgcctctgat cgcgccaaac accaaaacag aacgcattcc 1861 aatgagaaac catatgtgtg caaaatccca ggctgcacta agcgttacac agacccaagc 1921 tccctccgga aacatgtgaa gacagtgcat ggcccagagg ctcatgtcac caagaagcag 1981 cgaggggaca tccatcctcg gccgccaccc ccgagagatt ccggcagcca ttcacagtcc 2041 aggtcgcctg gccgaccgac tcagggagcc cttggtgagc agcaggacct cagcaacact 2101 acctcaaagc gggaagaatg cctccaggtg aaaaccgtca aggcagagaa gccaatgaca 2161 tctcagccaa gccctggtgg tcagtcttca tgcagcagcc aacagtcccc catcagcaac 2221 tattccaaca gtgggctcga gcttcctctg accgatggag gtagtatagg agacctcagt 2281 gccatcgatg aaaccccaat catggactca accatttcca ctgcaaccac agcccttgct 2341 ttgcaagcca ggagaaaccc ggcagggacc aaatggatgg agcacgtaaa actagaaagg 2401 ctaaaacaag tgaatggaat gtttccgcga ctgaacccca ttctaccccc taaagcccct 2461 gcggtctctc ctctcatagg aaatggcaca cagtccaaca acacctgcag cttgggtggg 2521 cccatgacgc ttctcccggg cagaagcgac ctctctgggg tggacgtcac tatgctgaac 2581 atgctcaaca gaagggacag cagcgccagc accatcagct cggcctacct gagcagccgc 2641 cgctcctcag ggatctcgcc ctgcttctcc agccgccgct ccagcgaggc gtcacaggcc 2701 gagggccggc cgcagaacgt gagcgtggcc gactcctacg accccatctc caccgacgcc 2761 tcgcgccgct ccagcgaagc cagccagagc gacggcctgc ccagcctgct cagcctcacg 2821 cccgcccagc agtaccgcct caaggccaag tacgcggctg ccacaggagg gccgccgccg 2881 acgcccctgc ccaacatgga gaggatgagc ctgaagacgc gcctggcgct gctcggggat 2941 gccctcgagc ctggcgtggc cctgcctcca gttcatgccc cgaggaggtg cagcgacggg 3001 ggagcccacg gctacgggcg gcgccacctg cagccgcacg atgcgctggg ccacggcgtg 3061 aggagggcca gcgacccggt gcggacaggc tccgagggcc tggccctgcc tcgtgtgccg 3121 cgcttcagca gcctcagcag ctgcaacccc ccggcgatgg ccacgtccgc ggagaagcgc 3181 agtctcgtgc ttcagaatta cacgcggccc gagggcggcc agtcccgaaa cttccactcg 3241 tccccctgtc ctcccagcat caccgagaac gtcaccctgg agtccctgac catggacgct 3301 gatgccaacc tgaacgatga ggatttcctg ccggacgacg tggtgcagta tttaaattcc 3361 cagaaccaag cagggtacga gcagcacttc cccagcgccc tcccggacga cagcaaagtg 3421 ccccacgggc ccggtgactt tgacgcgccc gggctgccag acagccacgc tggccagcag 3481 ttccatgccc tcgagcagcc ctgccccgag ggcagcaaaa ccgacctgcc cattcagtgg 3541 aacgaagtca gctccggaag cgccgacctg tcctcctcca agctcaagtg tgggccgcgg 3601 cccgctgtgc cgcagactcg cgcctttggg ttctgcaacg gcatggtcgt ccacccgcag 3661 aaccccttga ggagcgggcc tgctgggggc tatcagaccc tcggggagaa cagcaacccc 3721 tacggtggcc cagagcactt gatgctccac aacagccccg gaagtggcac cagtggaaac 3781 gccttccatg aacagccctg taaggccccg cagtatggga actgtctcaa caggcagcca 3841 gtggcccctg gtgcactcga cggtgcctgt ggtgccggga ttcaagcctc aaagctgaag 3901 agcaccccca tgcaagggag cgggggccag ctgaatttcg gcctgccggt agcgccaaat 3961 gagtcagctg gcagcatggt gaatggcatg cagaaccagg acccagtggg acaggggtac 4021 ctggctcacc agctcctcgg cgacagcatg cagcacccgg gggcaggccg ccccggtcag 4081 cagatgcttg ggcagattag tgctacctca cacatcaaca tctaccaagg gccagagagc 4141 tgcctgccag gggctcacgg catgggcagc cagccgtcaa gcttggcagt tgtcaggggc 4201 taccagccat gtgccagctt tgggggcagc aggcgccagg ctatgccgag ggacagcctt 4261 gctctgcagt caggacagct cagtgacaca agtcagacct gcagggtgaa tggtatcaag 4321 atggagatga aagggcagcc ccatccgctg tgctctaatc tgcagaatta ctctggtcag 4381 ttctatgacc aaaccgtggg cttcagtcag caagacacga aagctggttc attctctatt 4441 tcagacgcca gctgcctgct acaggggacc agcgccaaaa actctgagtt actttcccca 4501 ggtgctaatc aggtgacaag cacagtggac agcctcgaca gccatgacct ggaaggggta 4561 cagattgact tcgatgccat catagacgat ggggaccact ccagcctgat gtcgggggcc 4621 ctgagcccaa gtatcattca gaacctttcc catagctcct cccgcctcac cacgcctcgg 4681 gcgtccctcc cattcccagt cgctgtccat gagcaccacc aacatggcta tcggggacat 4741 gagttctttg ctgacctccc tagcggaaga aagcaaattc cttgcagtta tgcaataggc 4801 tttaggaaaa aaagactgca accaacggaa atcaatagga gttgaagaga ttaaactgac 4861 tttgttttgg ctgttttttt agttctgtat gtattttagc aatctcatct cacctaactg 4921 agatgtgttt caattatatt ccttttatgg aaaaggactc tgaaaaaccc taaagtattc 4981 tagggagaaa ctgtcttcca tttcagtttt gaatcagtat tgttacactc aaaccaccct 5041 ctttttaaaa aaaaa // LOCUS HUMGLO1A 623 bp mRNA PRI 08-NOV-1994 DEFINITION Human glyoxalase-1 mRNA, complete cds. ACCESSION L07837 NID g183257 KEYWORDS glyoxalase I; lactoylglutathione lyase. SOURCE Homo sapiens colon cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ranganathan,S., Walsh,E.S., Godwin,A.K. and Tew,K.D. TITLE Cloning and characterization of human colon glyoxalase-I JOURNAL J. Biol. Chem. 268 (8), 5661-5667 (1993) MEDLINE 93194863 FEATURES Location/Qualifiers source 1..623 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" /map="6p21.3-p21.1" gene 61..615 /gene="GLO1" CDS 61..615 /gene="GLO1" /EC_number="4.4.1.5" /codon_start=1 /db_xref="GDB:G00-119-992" /product="glyoxaslase I" /db_xref="PID:g183258" /translation="MAEPQPPSGGLTDEAALSCCSDADPSTKDFLLQQTMLRVKDPKK SLDFYTRVLGMTLIQKCDFPIMKFSLYFLAYEDKNDIPKEKDEKIAWALSRKATLELT HNWGTEDDATQSYHNGNSDPRGFGHIGIAVPDVYSACKRFEELGVKFVKKPDDGKMKG LAFIQDPDGYWIEILNPNKMATLM" BASE COUNT 174 a 140 c 150 g 159 t ORIGIN 1 ccggtgtggg tgactcctcc gttccttggg tcccgtcgtc tgtgatactg cagcgcagcc 61 atggcagaac cgcagccccc gtccggcggc ctcacggacg aggccgccct cagttgctgc 121 tccgacgcgg accccagtac caaggatttt ctattgcagc agaccatgct acgagtgaag 181 gatcctaaga agtcactgga tttttatact agagttcttg gaatgacgct aatccaaaaa 241 tgtgattttc ccattatgaa gttttcactc tacttcttgg cttatgagga taaaaatgac 301 atccctaaag aaaaagatga aaaaatagcc tgggcgctct ccagaaaagc tacacttgag 361 ctgacacaca attggggcac tgaagatgat gcgacccaga gttaccacaa tggcaattca 421 gaccctcgag gattcggtca tattggaatt gctgttcctg atgtatacag tgcttgtaaa 481 aggtttgaag aactgggagt caaatttgtg aagaaacctg atgatggtaa aatgaaaggc 541 ctggcattta ttcaagatcc tgatggctac tggattgaaa ttttgaatcc taacaaaatg 601 gcaaccttaa tgtagtgctg tga // LOCUS HUMGLSYKIN 1389 bp mRNA PRI 16-MAY-1995 DEFINITION Human protein kinase mRNA, complete cds. ACCESSION L33801 NID g529236 KEYWORDS glycogen synthase kinase 3; protein kinase. SOURCE Homo sapiens (library: lambda ZAP) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1389) AUTHORS Stambolic,V. and Woodgett,J.R. TITLE Mitogen inactivation of glycogen synthase kinase-3 beta in intact cells via serine 9 phosphorylation JOURNAL Biochem. J. 303, 701-704 (1994) MEDLINE 95071278 FEATURES Location/Qualifiers source 1..1389 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="hepatoma" /tissue_lib="lambda ZAP" CDS 40..1302 /codon_start=1 /product="protein kinase" /db_xref="PID:g529237" /translation="MSGRPRTTSFAESCKPVQQPSAFGSMKVSRDKDGSKVTTVVATP GQGPDRPQEVSYTDTKVIGNGSFGVVYQAKLCDSGELVAIKKVLQDKRFKNRELQIMR KLDHCNIVRLRYFFYSSGEKKDEVYLNLVLDYVPETVYRVARHYSRAKQTLPVIYVKL YMYQLFRSLAYIHSFGICHRDIKPQNLLLDPDTAVLKLCDFGSAKQLVRGEPNVSYIC SRYYRAPELIFGATDYTSSIDVWSAGCVLAELLLGQPIFPGDSGVDQLVEIIKVLGTP TREQIREMNPNYTEFKFPQIKAHPWTKVFRPRTPPEAIALCSRLLEYTPTARLTPLEA CAHSFFDELRDPNVKHPNGRDTPALFNFTTQELSSNPPLATILIPPHARIQAAASTPT NATAASDANTGDRGQTNNAASASASNST" polyA_site 1389 BASE COUNT 402 a 326 c 326 g 335 t ORIGIN 1 ggagaaggaa ggaaaaggtg attcgcgaag agagtgatca tgtcagggcg gcccagaacc 61 acctcctttg cggagagctg caagccggtg cagcagcctt cagcttttgg cagcatgaaa 121 gttagcagag acaaggacgg cagcaaggtg acaacagtgg tggcaactcc tgggcagggt 181 ccagacaggc cacaagaagt cagctataca gacactaaag tgattggaaa tggatcattt 241 ggtgtggtat atcaagccaa actttgtgat tcaggagaac tggtcgccat caagaaagta 301 ttgcaggaca agagatttaa gaatcgagag ctccagatca tgagaaagct agatcactgt 361 aacatagtcc gattgcgtta tttcttctac tccagtggtg agaagaaaga tgaggtctat 421 cttaatctgg tgctggacta tgttccggaa acagtataca gagttgccag acactatagt 481 cgagccaaac agacgctccc tgtgatttat gtcaagttgt atatgtatca gctgttccga 541 agtttagcct atatccattc ctttggaatc tgccatcggg atattaaacc gcagaacctc 601 ttgttggatc ctgatactgc tgtattaaaa ctctgtgact ttggaagtgc aaagcagctg 661 gtccgaggag aacccaatgt ttcgtatatc tgttctcggt actatagggc accagagttg 721 atctttggag ccactgatta tacctctagt atagatgtat ggtctgctgg ctgtgtgttg 781 gctgagctgt tactaggaca accaatattt ccaggggata gtggtgtgga tcagttggta 841 gaaataatca aggtcctggg aactccaaca agggagcaaa tcagagaaat gaacccaaac 901 tacacagaat ttaaattccc tcaaattaag gcacatcctt ggactaaggt cttccgaccc 961 cgaactccac cggaggcaat tgcactgtgt agccgtctgc tggagtatac accaactgcc 1021 cgactaacac cactggaagc ttgtgcacat tcattttttg atgaattacg ggacccaaat 1081 gtcaaacatc caaatgggcg agacacacct gcactcttca acttcaccac tcaagaactg 1141 tcaagtaatc cacctctggc taccatcctt attcctcctc atgctcggat tcaagcagct 1201 gcttcaaccc ccacaaatgc cacagcagcg tcagatgcta atactggaga ccgtggacag 1261 accaataatg ctgcttctgc atcagcttcc aactccacct gaacagtccc gacgagccag 1321 ctgcacagga aaaaccacca gttacttgag tgtcactcag caacactggt cacgtttgga 1381 aagaatatt // LOCUS HUMGLTKIN 1510 bp mRNA PRI 31-DEC-1994 DEFINITION H.sapiens galactokinase (GK2) mRNA, complete cds. ACCESSION M84443 NID g183265 KEYWORDS galactokinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1510) AUTHORS Lee,R.T., Peterson,C.L., Calman,A.F., Herskowitz,I. and O'Donnell,J.J. TITLE Cloning of a human galactokinase gene (GK2) on chromosome 15 by complementation in yeast JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (22), 10887-10891 (1992) MEDLINE 93066348 FEATURES Location/Qualifiers source 1..1510 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15" gene 21..1397 /gene="GK2" CDS 21..1397 /gene="GK2" /codon_start=1 /product="galactokinase" /db_xref="PID:g183266" /translation="MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRV NIIGEHIDYCGYSVLPMAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDK TKPLWHNYFLCGLKGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLT VLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLP SGAVFVIANSCVEMNKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLG ISLEEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQRAKH VYSEAARVLQFKKICEEAPENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKF GAQGSRLTGAGWGGCTVSMVPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGG GALVLLEA" BASE COUNT 421 a 307 c 402 g 380 t ORIGIN 1 agatctgaat tcggcgaaat atggctacag agagccctgc tacgcgtcgg gtccaggtgg 61 cagaacatcc taggttactg aagctaaagg agatgtttaa ctccaagttt ggatctattc 121 ccaagtttta tgttcgagca ccaggaagag tcaacataat aggagagcat atagattatt 181 gtggatattc tgttcttcct atggctgtag aacaagatgt gctaatagct gtagaacctg 241 tgaaaacgta cgctctccaa ctggccaata caaatccctt gtatccggac ttcagtacta 301 gtgctaataa catccagatt gataaaacca agcctttgtg gcacaactat ttcttatgtg 361 gacttaaagg aattcaggaa cactttggtc ttagtaacct gactggaatg aactgcctgg 421 tagatggaaa tatcccacca agttctggcc tctccagctc cagtgctttg gtctgttgtg 481 ctggcttggt gacgctcaca gtgctgggaa ggaatctatc caaggtggaa cttgcagaaa 541 tctgtgccaa gagtgagcgt tacattggca ctgaaggagg aggcatggac cagtctatat 601 catttcttgc agaagaagga actgccaagt tgatagaatt tagtcctctg agggcaaccg 661 atgtaaaact cccaagtgga gcagtgtttg tgattgccaa cagttgtgtg gagatgaata 721 aggcagcaac ttcccatttc aatatcaggg tgatggagtg tcggctggct gcgaagctcc 781 tggctaaata caaaagcttg caatgggaca aagtactgag gctggaggag gtgcaggcta 841 aactagggat tagtctagaa gaaatgctgt tggtcacaga agatgccctt catcctgaac 901 cctataaccc tgaggagatc tgcaggtgtc tgggaattag cctggaggaa ctccgaaccc 961 aaatcctgag tccaaacact caagatgtgc tcatcttcaa actctatcag cgggcaaagc 1021 atgtgtacag cgaggctgcg cgagtgctcc agtttaagaa gatatgtgaa gaagcacctg 1081 aaaacatggt ccagctgctg ggagagttga tgaaccagag ccacatgagc tgccgggaca 1141 tgtatgagtg cagctgcccc gagctggatc agctggtgga catctgtcgg aagtttgggg 1201 ctcaagggtc acgacttact ggagcaggat ggggaggctg tacagtatca atggtacctg 1261 cggacaagct gcccagcttt ctagcaaatg tgcacaaagc ttattaccag aggagtgatg 1321 gaagcttagc accggagaag caaagtttgt ttgctaccaa acctggaggt ggggctttgg 1381 ttttgcttga ggcctgaaaa aatgtaaaaa gtctgagaga aactacttag ggcacttagg 1441 aattggcagg actttctgtg ccacagtaaa ttaatcttcc ttctgttttg tattatgatg 1501 aacggttgct // LOCUS HUMGLUR2A 2621 bp mRNA PRI 29-SEP-1995 DEFINITION Human rearranged metabotropic glutamate receptor type II (GLUR2) mRNA, complete cds. ACCESSION L35318 NID g999415 KEYWORDS G-protein coupled receptor; glutamate receptor type 2; rearranged. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2621) AUTHORS Flor,P.J., Lindauer,K., Puttner,I., Ruegg,D., Lukic,S., Knopfel,T. and Kuhn,R. TITLE Molecular cloning, functional expression and pharmacological characterization of the human metabotropic glutamate receptor type 2 JOURNAL Eur. J. Neurosci. 7 (4), 622-629 (1995) MEDLINE 95346007 FEATURES Location/Qualifiers source 1..2621 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q32-q33" mRNA 1..2621 /gene="GLUR2" /note="G00-131-458" gene 1..2621 /gene="GLUR2" CDS 3..2621 /gene="GLUR2" /codon_start=1 /db_xref="GDB:G00-131-458" /product="metabotropic glutamate receptor type II" /db_xref="PID:g999416" /translation="MGSLLALLALLPLWGAVAEGPAKKVLTLEGDLVLGGLFPVHQKG GPAEDCGPVNEHRGIQRLEAMLFALDRINRDPHLLPGVRLGAHILDSCSKDTHALEQA LDFVRASLSRGADGSRHICPDGSYATHGDAPTAITGVIGGSYSDVSIQVANLLRLFQI PQISYASTSAKLSDKSRYDYFARTVPPDFFQAKAMAEILRFFNWTYVSTEASEGDYGE TGIEAFELEARARNICVATSEKVGRAMSRAAFEGVVRALLQKPSARVAVLFTRSEDAR ELLAASQRLNASFTWVASDGWGALESVVAGSEGAAEGAITIELASYPISDFASYFQSL DPWNNSRNPWFREFWEQRFRCSFRQRDCAAHSLRAVPFEQESKIMFVVNAVYAMAHAL HNMHRALCPNTTRLCDAMRPVNGRRLYKDFVLNVKFDAPFRPADTHNEVRFDRFGDGI GRYNIFTYLRAGSGRYRYQKVGYWAEGLTLDTSLIPWASPSAGPLAASRCSEPCLQNE VKSVQPGEVCCWLCIPCQPYEYRLDEFTCADCGLGYWPNASLTGCFELPQEYIRWGDA WAVGPVTIACLGALATLFVLGVFVRHNATPVVKASGRELCYILLGGVFLCYCMTFIFI AKPSTAVCTLRRLGLGTAFSVCYSALLTKTNRIARIFGGAREGAQRPRFISPASQVAI CLALISGQLLIVVAWLVVEAPGTGKETAPERREVVTLRCNHRDASMLGSLAYNVLLIA LCTLYAFNTRKCPENFNEAKFIGFTMYTTCIIWLALLPIFYVTSSDYRVQTTTMCVSV SLSGSVVLGCLFAPKLHIILFQPQKNVVSHRAPTSRFGSAAARASSSLGQGSGSQFVP TVCNGREVVDSTTSSL" BASE COUNT 443 a 838 c 758 g 582 t ORIGIN 1 ccatgggatc gctgcttgcg ctcctggcac tgctgccgct gtggggtgct gtggctgagg 61 gcccagccaa gaaggtgctg accctggagg gagacttggt gctgggtggg ctgttcccag 121 tgcaccagaa gggcggccca gcagaggact gtggtcctgt caatgagcac cgtggcatcc 181 agcgcctgga ggccatgctt tttgcactgg accgcatcaa ccgtgacccg cacctgctgc 241 ctggcgtgcg cctgggtgca cacatcctcg acagttgctc caaggacaca catgcgctgg 301 agcaggcact ggactttgtg cgtgcctcac tcagccgtgg tgctgatgga tcacgccaca 361 tctgccccga cggctcttat gcgacccatg gtgatgctcc cactgccatc actggtgtta 421 ttggcggttc ctacagtgat gtctccatcc aggtggccaa cctcttgagg ctatttcaga 481 tcccacagat tagctacgcc tctaccagtg ccaagctgag tgacaagtcc cgctatgact 541 actttgcccg cacagtgcct cctgacttct tccaagccaa ggccatggct gagattctcc 601 gcttcttcaa ctggacctat gtgtccactg aggcctctga gggcgactat ggcgagacag 661 gcattgaggc ctttgagcta gaggctcgtg cccgcaacat ctgtgtggcc acctcggaga 721 aagtgggccg tgccatgagc cgcgcggcct ttgagggtgt ggtgcgagcc ctgctgcaga 781 agcccagtgc ccgcgtggct gtcctgttca cccgttctga ggatgcccgg gagctgcttg 841 ctgccagcca gcgcctcaat gccagcttca cctgggtggc cagtgatggt tggggggccc 901 tggagagtgt ggtggcaggc agtgaggggg ctgctgaggg tgctatcacc atcgagctgg 961 cctcctaccc catcagtgac tttgcctcct acttccagag cctggaccct tggaacaaca 1021 gccggaaccc ctggttccgt gaattctggg agcagaggtt ccgctgcagc ttccggcagc 1081 gagactgcgc agcccactct ctccgggctg tgccctttga acaggagtcc aagatcatgt 1141 ttgtggtcaa tgcagtgtac gccatggccc atgcgctcca caacatgcac cgtgccctct 1201 gccccaacac cacccggctc tgtgacgcga tgcggccagt taacgggcgc cgcctctaca 1261 aggactttgt gctcaacgtc aagtttgatg ccccctttcg cccagctgac acccacaatg 1321 aggtccgctt tgaccgcttt ggtgatggta ttggccgcta caacatcttc acctatctgc 1381 gtgcaggcag tgggcgctat cgctaccaga aggtgggcta ctgggcagaa ggcttgactc 1441 tggacaccag cctcatccca tgggcctcac cgtcagccgg ccccctggcc gcctctcgct 1501 gcagtgagcc ctgcctccag aatgaggtga agagtgtgca gccgggcgaa gtctgctgct 1561 ggctctgcat tccgtgccag ccctatgagt accgattgga cgaattcact tgcgctgatt 1621 gtggcctggg ctactggccc aatgccagcc tgactggctg cttcgaactg ccccaggagt 1681 acatccgctg gggcgatgcc tgggctgtgg gacctgtcac catcgcctgc ctcggtgccc 1741 tggccaccct gtttgtgctg ggtgtctttg tgcggcacaa tgccacacca gtggtcaagg 1801 cctcaggtcg ggagctctgc tacatcctgc tgggtggtgt cttcctctgc tactgcatga 1861 ccttcatctt cattgccaag ccatccacgg cagtgtgtac cttacggcgt cttggtttgg 1921 gcactgcctt ctctgtctgc tactcagccc tgctcaccaa gaccaaccgc attgcacgca 1981 tcttcggtgg ggcccgggag ggtgcccagc ggccacgctt catcagtcct gcctcacagg 2041 tggccatctg cctggcactt atctcgggcc agctgctcat cgtggtcgcc tggctggtgg 2101 tggaggcacc gggcacaggc aaggagacag cccccgaacg gcgggaggtg gtgacactgc 2161 gctgcaacca ccgcgatgca agtatgttgg gctcgctggc ctacaatgtg ctcctcatcg 2221 cgctctgcac gctttatgcc ttcaatactc gcaagtgccc cgaaaacttc aacgaggcca 2281 agttcattgg cttcaccatg tacaccacct gcatcatctg gctggcattg ttgcccatct 2341 tctatgtcac ctccagtgac taccgggtac agaccaccac catgtgcgtg tcagtcagcc 2401 tcagcggctc cgtggtgctt ggctgcctct ttgcgcccaa gctgcacatc atcctcttcc 2461 agccgcagaa gaacgtggtt agccaccggg cacccaccag ccgctttggc agtgctgctg 2521 ccagggccag ctccagcctt ggccaagggt ctggctccca gtttgtcccc actgtttgca 2581 atggccgtga ggtggtggac tcgacaacgt catcgctttg a // LOCUS HUMGLUTRN 2856 bp mRNA PRI 08-NOV-1994 DEFINITION Human (HepG2) glucose transporter gene mRNA, complete cds. ACCESSION K03195 NID g183302 KEYWORDS glucose transport protein; membrane glycoprotein; membrane protein. SOURCE Human hepatoma cell line HepG2, cDNA to mRNA, clone lambda-GT25. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2856) AUTHORS Mueckler,M., Caruso,C., Baldwin,S.A., Panico,M., Blench,I., Morris,H.R., Allard,W.J., Lienhard,G.E. and Lodish,H.F. TITLE Sequence and structure of a human glucose transporter JOURNAL Science 229 (4717), 941-945 (1985) MEDLINE 85272595 COMMENT A draft entry and printed copy of this sequence were kindly provided by M.Mueckler (15-NOV-1985). FEATURES Location/Qualifiers source 1..2856 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q13.1" gene 180..1658 /gene="SGLT1" CDS 180..1658 /gene="SGLT1" /note="glucose transporter glycoprotein" /codon_start=1 /db_xref="GDB:G00-120-375" /db_xref="PID:g183303" /translation="MEPSSKKLTGRLMLAVGGAVLGSLQFGYNTGVINAPQKVIEEFY NQTWVHRYGESILPTTLTTLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRNSMLMMNLL AFVSAVLMGFSKLGKSFEMLILGRFIIGVYCGLTTGFVPMYVGEVSPTAFRGALGTLH QLGIVVGILIAQVFGLDSIMGNKDLWPLLLSIIFIPALLQCIVLPFCPESPRFLLINR NEENRAKSVLKKLRGTADVTHDLQEMKEESRQMMREKKVTILELFRSPAYRQPILIAV VLQLSQQLSGINAVFYYSTSIFEKAGVQQPVYATIGSGIVNTAFTVVSLFVVERAGRR TLHLIGLAGMAGCAILMTIALALLEQLPWMSYLSIVAIFGFVAFFEVGPGPIPWFIVA ELFSQGPRPAAIAVAGFSNWTSNFIVGMCFQYVEQLCGPYVFIIFTVLLVLFFIFTYF KVPETKGRTFDEIASGFRQGGASQSDKTPEELFHPLGADSQV" BASE COUNT 602 a 804 c 753 g 697 t ORIGIN 143 bp upstream of RsaI site. 1 tagtcgcggg tccccgagtg agcacgccag ggagcaggag accaaacgac gggggtcgga 61 gtcagagtcg cagtgggagt ccccggaccg gagcacgagc ctgagcggga gagcgccgct 121 cgcacgcccg tcgccacccg cgtacccggc gcagccagag ccaccagcgc agcgctgcca 181 tggagcccag cagcaagaag ctgacgggtc gcctcatgct ggctgtggga ggagcagtgc 241 ttggctccct gcagtttggc tacaacactg gagtcatcaa tgccccccag aaggtgatcg 301 aggagttcta caaccagaca tgggtccacc gctatgggga gagcatcctg cccaccacgc 361 tcaccacgct ctggtccctc tcagtggcca tcttttctgt tgggggcatg attggctcct 421 tctctgtggg ccttttcgtt aaccgctttg gccggcggaa ttcaatgctg atgatgaacc 481 tgctggcctt cgtgtccgcc gtgctcatgg gcttctcgaa actgggcaag tcctttgaga 541 tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgaccaca ggcttcgtgc 601 ccatgtatgt gggtgaagtg tcacccacag cctttcgtgg ggccctgggc accctgcacc 661 agctgggcat cgtcgtcggc atcctcatcg cccaggtgtt cggcctggac tccatcatgg 721 gcaacaagga cctgtggccc ctgctgctga gcatcatctt catcccggcc ctgctgcagt 781 gcatcgtgct gcccttctgc cccgagagtc cccgcttcct gctcatcaac cgcaacgagg 841 agaaccgggc caagagtgtg ctaaagaagc tgcgcgggac agctgacgtg acccatgacc 901 tgcaggagat gaaggaagag agtcggcaga tgatgcggga gaagaaggtc accatcctgg 961 agctgttccg ctcccccgcc taccgccagc ccatcctcat cgctgtggtg ctgcagctgt 1021 cccagcagct gtctggcatc aacgctgtct tctattactc cacgagcatc ttcgagaagg 1081 cgggggtgca gcagcctgtg tatgccacca ttggctccgg tatcgtcaac acggccttca 1141 ctgtcgtgtc gctgtttgtg gtggagcgag caggccggcg gaccctgcac ctcataggcc 1201 tcgctggcat ggcgggttgt gccatactca tgaccatcgc gctagcactg ctggagcagc 1261 taccctggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc ttctttgaag 1321 tgggtcctgg ccccatccca tggttcatcg tggctgaact cttcagccag ggtccacgtc 1381 cagctgccat tgccgttgca ggcttctcca actggacctc aaatttcatt gtgggcatgt 1441 gcttccagta tgtggagcaa ctgtgtggtc cctacgtctt catcatcttc actgtgctcc 1501 tggttctgtt cttcatcttc acctacttca aagttcctga gactaaaggc cggaccttcg 1561 atgagatcgc ttccggcttc cggcaggggg gagccagcca aagtgataag acacccgagg 1621 agctgttcca tcccctgggg gctgattccc aagtgtgagt cgccccagat caccagcccg 1681 gcctgctccc agcagcccta aggatctctc aggagcacag gcagctggat gagacttcca 1741 aacctgacag atgtcagccg agccgggcct ggggctcctt tctccagcca gcaatgatgt 1801 ccagaagaat attcaggact taacggctcc aggattttaa caaaagcaag actgttgctc 1861 aaatctattc agacaagcaa caggttttat aattttttta ttactgattt tgttattttt 1921 atatcagcct gagtctcctg tgcccacatc ccaggcttca ccctgaatgg ttccatgcct 1981 gagggtggag actaagccct gtcgagacac ttgccttctt cacccagcta atctgtaggg 2041 ctggacctat gtcctaagga cacactaatc gaactatgaa ctacaaagct tctatcccag 2101 gaggtggcta tggccacccg ttctgctggc ctggatctcc ccactctagg ggtcaggctc 2161 cattaggatt tgccccttcc catctcttcc tacccaacca ctcaaattaa tctttcttta 2221 cctgagacca gttgggagca ctggagtgca gggaggagag gggaagggcc agtctgggct 2281 gccgggttct agtctccttt gcactgaggg ccacactatt accatgagaa gagggcctgt 2341 gggagcctgc aaactcactg ctcaagaaga catggagact cctgccctgt tgtgtataga 2401 tgcaagatat ttatatatat ttttggttgt caatattaaa tacagacact aagttatagt 2461 atatctggac aagccaactt gtaaatacac cacctcactc ctgttactta cctaaacaga 2521 tataaatggc tggtttttag aaacatggtt ttgaaatgct tgtggattga gggtaggagg 2581 tttggatggg agtgagacag aagtaagtgg ggttgcaacc actgcaacgg cttagacttc 2641 gactcaggat ccagtccctt acacgtacct ctcatcagtg tcctcttgct caaaaatctg 2701 tttgatccct gttacccaga gaatatatac attctttatc ttgacattca aggcatttct 2761 atcacatatt tgatagttgg tgttcaaaaa aacactagtt ttgtgccagc cgtgatgctc 2821 aggcttgaaa tcgcattatt ttgaatgtga agggaa // LOCUS HUMGLVR1X 3220 bp mRNA PRI 08-NOV-1994 DEFINITION Human leukemia virus receptor 1 (GLVR1) mRNA, complete cds. ACCESSION L20859 NID g306769 KEYWORDS leukemia virus receptor 1. SOURCE Homo sapiens (tissue library: lambda HGR6, 7, and 16; Clontech #1020b) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3220) AUTHORS O'Hara,B., Johann,S.V., Klinger,H.P., Blair,D.G., Rubinson,H., Dunn,K.J., Sass,P., Vitek,S.M. and Robins,T. TITLE Characterization of a human gene conferring sensitivity to infection by gibbon ape leukemia virus JOURNAL Cell Growth Differ. 1 (3), 119-127 (1990) MEDLINE 91175479 FEATURES Location/Qualifiers source 1..3220 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /tissue_lib="lambda HGR6, 7, and 16; Clontech #1020b" /map="2q11-q14" gene 371..2410 /gene="GLVR1" CDS 371..2410 /gene="GLVR1" /codon_start=1 /db_xref="GDB:G00-125-248" /product="leukemia virus receptor 1" /db_xref="PID:g306770" /translation="MATLITSTTAATAASGPLVDYLWMLILGFIIAFVLAFSVGANDV ANSFGTAVGSGVVTLKQACILASIFETVGSVLLGAKVSETIRKGLIDVEMYNSTQGLL MAGSVSAMFGSAVWQLVASFLKLPISGTHCIVGATIGFSLVAKGQEGVKWSELIKIVM SWFVSPLLSGIMSGILFFLVRAFILHKADPVPNGLRALPVFYACTVGINLFSIMYTGA PLLGFDKLPLWGTILISVGCAVFCALIVWFFVCPRMKRKIEREIKCSPSESPLMEKKN SLKEDHEETKLSVGDIENKHPVSEVGPATVPLQAVVEERTVSFKLGDLEEAPERERLP SVDLKEETSIDSTVNGAVQLPNGNLVQFSQAVSNQINSSGHSQYHTVHKDSGLYKELL HKLHLAKVGDCMGDSGDKPLRRNNSYTSYTMAICGMPLDSFRAKEGEQKGEEMEKLTW PNADSKKRIRMDSYTSYCNAVSDLHSASEIDMSVKAAMGLGDRKGSNGSLEEWYDQDK PEVSLLFQFLQILTACFGSFAHGGNDVSNAIGPLVALYLVYDTGDVSSKVATPIWLLL YGGVGICVGLWVWGRRVIQTMGKDLTPITPSSGFSIELASALTVVIASNIGLPISTTH CKVGSVVSVGWLRSKKAVDWRLFRNIFMAWFVTVPISGVISAAIMAIFRYVILRM" BASE COUNT 783 a 710 c 763 g 964 t ORIGIN 1 gagctgtccc cggtgccgcc gacccgggcc gtgccgtgtg cccgtggctc cagccgctgc 61 cgcctcgatc tcctcgtctc ccgctccgcc ctcccttttc cctggatgaa cttgcgtcct 121 ttctcttctc cgccatggaa ttctgctccg tgcttttagc cctcctgagc caaagaaacc 181 ccagacaaca gatgcccata cgcagcgtat agcagtaact ccccagctcg gtttctgtgc 241 cgtagtttac agtatttaat tttatataat atatattatt tattatagca tttttgatac 301 ctcatattct gtttacacat cttgaaaggc gctcagtagt tctcttacta aacaaccact 361 actccagaga atggcaacgc tgattaccag tactacagct gctaccgccg cttctggtcc 421 tttggtggac tacctatgga tgctcatcct gggcttcatt attgcatttg tcttggcatt 481 ctccgtggga gccaatgatg tagcaaattc ttttggtaca gctgtgggct caggtgtagt 541 gaccctgaag caagcctgca tcctagctag catctttgaa acagtgggct ctgtcttact 601 gggggccaaa gtgagcgaaa ccatccggaa gggcttgatt gacgtggaga tgtacaactc 661 gactcaaggg ctactgatgg ccggctcagt cagtgctatg tttggttctg ctgtgtggca 721 actcgtggct tcgtttttga agctccctat ttctggaacc cattgtattg ttggtgcaac 781 tattggtttc tccctcgtgg caaaggggca ggagggtgtc aagtggtctg aactgataaa 841 aattgtgatg tcttggttcg tgtccccact gctttctgga attatgtctg gaattttatt 901 cttcctggtt cgtgcattca tcctccataa ggcagatcca gttcctaatg gtttgcgagc 961 tttgccagtt ttctatgcct gcacagttgg aataaacctc ttttccatca tgtatactgg 1021 agcaccgttg ctgggctttg acaaacttcc tctgtggggt accatcctca tctcggtggg 1081 atgtgcagtt ttctgtgccc ttatcgtctg gttctttgta tgtcccagga tgaagagaaa 1141 aattgaacga gaaataaagt gtagtccttc tgaaagcccc ttaatggaaa aaaagaatag 1201 cttgaaagaa gaccatgaag aaacaaagtt gtctgttggt gatattgaaa acaagcatcc 1261 tgtttctgag gtagggcctg ccactgtgcc cctccaggct gtggtggagg agagaacagt 1321 ctcattcaaa cttggagatt tggaggaagc tccagagaga gagaggcttc ccagcgtgga 1381 cttgaaagag gaaaccagca tagatagcac cgtgaatggt gcagtgcagt tgcctaatgg 1441 gaaccttgtc cagttcagtc aagccgtcag caaccaaata aactccagtg gccactccca 1501 gtatcacacc gtgcataagg attccggcct gtacaaagag ctactccata aattacatct 1561 tgccaaggtg ggagattgca tgggagactc cggtgacaaa cccttaaggc gcaataatag 1621 ctatacttcc tataccatgg caatatgtgg catgcctctg gattcattcc gtgccaaaga 1681 aggtgaacag aagggcgaag aaatggagaa gctgacatgg cctaatgcag actccaagaa 1741 gcgaattcga atggacagtt acaccagtta ctgcaatgct gtgtctgacc ttcactcagc 1801 atctgagata gacatgagtg tcaaggcagc gatgggtcta ggtgacagaa aaggaagtaa 1861 tggctctcta gaagaatggt atgaccagga taagcctgaa gtctctctcc tcttccagtt 1921 cctgcagatc cttacagcct gctttgggtc attcgcccat ggtggcaatg acgtaagcaa 1981 tgccattggg cctctggttg ctttatattt ggtttatgac acaggagatg tttcttcaaa 2041 agtggcaaca ccaatatggc ttctactcta tggtggtgtt ggtatctgtg ttggtctgtg 2101 ggtttgggga agaagagtta tccagaccat ggggaaggat ctgacaccga tcacaccctc 2161 tagtggcttc agtattgaac tggcatctgc cctcactgtg gtgattgcat caaatattgg 2221 ccttcccatc agtacaacac attgtaaagt gggctctgtt gtgtctgttg gctggctccg 2281 gtccaagaag gctgttgact ggcgtctctt tcgtaacatt tttatggcct ggtttgtcac 2341 agtccccatt tctggagtta tcagtgctgc catcatggca atcttcagat atgtcatcct 2401 cagaatgtga agctgtttga gattaaaatt tgtgtcaatg tttgggacca tcttaggtat 2461 tcctgctccc ctgaagaatg attacagtgt taacagaaga ctgacaagag tctttttatt 2521 tgggagcaga ggagggaagt gttacttgtg ctataactgc ttttgtgcta aatatgaatt 2581 gtctcaaaat tagctgtgta aaatagcccg ggttccactg gctcctgctg aggtcccctt 2641 tccttctggg ctgtgaattc ctgtacatat ttctctactt tttgtatcag gcttcaattc 2701 cattatgttt taatgttgtc tctgaagatg acttgtgatt tttttttctt ttttttaaac 2761 catgaagagc cgtttgacag agcatgctct gcgttgttgg tttcaccagc ttctgccctc 2821 acatgcacag ggatttaaca acaaaaatat aactacaact tcccttgtag tctcttatat 2881 aagtagagtc cttggtactc tgccctcctg tcagtagtgg caggatctat tggcatattc 2941 gggagcttct tagagggatg aggttctttg aacacagtga aaatttaaat tagtaacttt 3001 tttgcaagca gtttattgac tgttattgct aagaagaagt aagaaagaaa aagcctgttg 3061 gcaatcttgg ttatttcttt aagatttctg gcagtgtggg atggatgaat gaagtggaat 3121 gtgaactttg ggcaagttaa atgggacagc cttccatgtt catttgtcta cctcttaact 3181 gaataaaaaa gcctacagtt tttagaaaaa acccgaattc // LOCUS HUMGLVR2X 3175 bp mRNA PRI 11-MAY-1994 DEFINITION Human leukemia virus receptor 2 (GLVR2) mRNA, complete cds. ACCESSION L20852 NID g306771 KEYWORDS leukemia virus receptor 2. SOURCE Homo sapiens (library: Stratagene #936203) male placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3175) AUTHORS van Zeijl,M., Johann,S.V., Closs,E., Cunningham,J., Eddy,R., Shows,T.B. and O'Hara,B. TITLE A human amphotropic retrovirus receptor is a second member of the gibbon ape leukemia virus receptor family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (3), 1168-1172 (1994) MEDLINE 94134719 FEATURES Location/Qualifiers source 1..3175 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" /tissue_lib="Stratagene #936203" gene 244..2202 /gene="GLVR2" CDS 244..2202 /gene="GLVR2" /note="homologous to GLVR1" /codon_start=1 /product="leukemia virus receptor 2" /db_xref="PID:g306772" /translation="MAMDEYLWMVILGFIIAFILAFSVGANDVANSFGTAVGSGVVTL RQACILASIFETTGSVLLGAKVGETIRKGIIDVNLYNETVETLMAGEVSAMVGSAVWQ LIASFLRLPISGTHCIVGSTIGFSLVAIGTKGVQWMELVKIVASWFISPLLSGFMSGL LFVLIRIFILKKEDPVPNGLRALPVFYAATIAINVFSIMYTGAPVLGLVLPMWAIALI SFGVALLFAFFVWLFVCPWMRRKITGKLQKEGALSRVSDESLSKVQEAESPVFKELPG AKANDDSTIPLTGAAGETLGTSEGTSAGSHPRAAYGRALSMTHGSVKSPISNGTFGFD GHTRSDGHVYHTVHKDSGLYKDLLHKIHIDRGPEEKPAQESNYRLLRRNNSYTCYTAA ICGLPVHATFRAADSSAPEDSEKLVGDTVSYSKKRLRYDSYSSYCNAVAEAEIEAEEG GVEMKLASELADPDQPREDPAEEEKEEKDAPEVHLLFHFLQVLTACFGSFAHGGNDVS NAIGPLVALWLIYKQGGVTQEAATPVWLLFYGGVGICTGLWVWGRRVIQTMGKDLTPI TPSSGFTIELASAFTVVIASNIGLPVSTTHCKVGSVVAVGWIRSRKAVDWRLFRNIFV AWFVTVPVAGLFSAAVMALLMYGILPYV" BASE COUNT 739 a 810 c 815 g 811 t ORIGIN 1 cagatcggga agaaaaatat ggaatgtgtt ttaccgctga ctgaacacaa ccaaatgaac 61 tgtcctgaca gtagtttgca aaccagcagc tagcagtttg tccagcctct aacattgtcc 121 agcactttcc agagcaaact cactgtttac aagaactctt ggccttacga agtttataac 181 ctcaagcttt gtttatttaa aatattcctg caaaagaaaa gtacccggca cccactttcc 241 aaaatggcca tggatgagta tttgtggatg gtcattttgg gtttcatcat agctttcatc 301 ttggcctttt ctgttggtgc aaacgatgtt gccaactcct ttggtacagc cgtgggctct 361 ggtgtggtga ccttgaggca ggcatgcatt ttagcttcaa tatttgaaac caccggctcc 421 gtgttactag gcgccaaagt aggagaaacc attcgcaaag gtatcattga cgtgaacctg 481 tacaacgaga cggtggagac tctcatggct ggggaagtta gtgccatggt tggttccgct 541 gtgtggcagc tgattgcttc cttcctgagg cttccaatct caggaacgca ctgcattgtg 601 ggttctacta taggattctc actggtcgca atcggtacca aaggtgtgca gtggatggag 661 cttgtcaaga ttgttgcttc ttggtttata tctccactgt tgtctggttt catgtctggc 721 ctgctgtttg tactcatcag aattttcatc ttaaaaaagg aagaccctgt tcccaatggc 781 ctccgggcac tcccagtatt ctatgctgct accatagcaa tcaatgtctt ttccatcatg 841 tacacaggag caccagtgct cggccttgtt ctccccatgt gggccatagc cctcatttcc 901 tttggtgtcg ccctcctgtt cgcttttttt gtgtggctct tcgtgtgtcc gtggatgcgg 961 aggaaaataa caggcaaatt acaaaaagaa ggtgctttat cacgagtatc tgacgaaagc 1021 ctcagtaagg ttcaggaagc agagtcccca gtatttaaag agctaccagg tgccaaggct 1081 aatgatgaca gcaccatccc gctcacggga gcagcagggg agacactggg gacctcggaa 1141 ggcacttctg cgggcagcca ccctcgggct gcatacggaa gagcactgtc catgacccat 1201 ggctctgtga aatcgcccat ctccaacggc accttcggct tcgacggcca caccaggagc 1261 gacggtcatg tgtaccacac cgtgcacaaa gactcggggc tctacaaaga tctgctgcac 1321 aaaatccaca tcgacagggg ccccgaggag aagccagccc aggaaagcaa ctaccggctg 1381 ctccgccgaa acaacagtta cacctgctac accgcagcca tttgtgggct gccagtgcac 1441 gccacctttc gagctgcgga ctcatcggcc ccagaggaca gtgagaagct ggtgggcgac 1501 accgtgtcct actccaagaa gaggctgcgc tacgacagct actcgagcta ctgtaacgcg 1561 gtggcagagg cggagatcga ggcggaggag ggcggcgtgg agatgaagct ggcgtcggag 1621 ctggccgacc ctgaccagcc gcgagaggac cctgcagagg aggagaagga ggagaaggac 1681 gcacccgagg ttcacctcct gttccatttc ctgcaggtcc tcaccgcctg tttcgggtcc 1741 tttgctcacg gcggcaatga cgtgagtaat gccatcggtc ccctggtagc cttgtggctg 1801 atttacaaac aaggcggggt aacgcaagaa gcagctacac ccgtctggct gctgttttat 1861 ggaggagttg gaatctgcac aggcctctgg gtctggggga gaagagtgat ccagaccatg 1921 gggaaggacc tcactcccat cacgccgtcc agcggcttca cgatcgagct ggcctcagcc 1981 ttcacagtgg tgatcgcctc caacatcggg cttccagtca gcaccacgca ctgtaaggtg 2041 ggctcggtgg tggccgtggg ctggatccgc tcccgcaagg ctgtggactg gcgcctcttt 2101 cggaacatct tcgtggcctg gttcgtgacc gtccctgtgg ctgggctgtt cagcgctgct 2161 gtcatggctc ttctcatgta tgggatcctt ccatatgtgt gatttgtctt cttccagctg 2221 caaacagcta aagggatggt ctggtgttgg cgtgtgggag acatgtgtgc tcgtgccgca 2281 catacacatc ctggccgtgc acggctctct catgaccagc tctctgcctc ccttccagga 2341 ggctccatcc cacactgttc acccaggctg cggagactca ccttcccgag ctaacttaac 2401 tactgtacat aataatatgt attaaactgg tatcgtggtg atataatgtg gtgcagttac 2461 ttatatatta aatatctatt gtatccatag aataggcagc attatttcaa acatattcaa 2521 gttgggagtg gagatcattg cctagaagtc aatattcaat aaatcttgta cataactatt 2581 tcgatggcaa atgttaagcc ttctaaaagg aaagtgtaga ttggaaaatg attttttttc 2641 caaatgatgt ttttgccttc taatatactg taaggtaatg agcttcagaa caggcaacct 2701 gaccctgcag aggtcgcgtg ctgtgggatg acagcgggac gggagctcac aagtgctttc 2761 actgaagatt tgttcatata ctgtgtattg attgttgtgt aatatatcat cattgctttt 2821 gtaaatacgt aaaactgtaa ttttttaatg gtgtgcttcc cttatacttt ttgatcagag 2881 aattttggaa agtaccaaag aagcagggga atcattggcc agtgttacgt tttcacattg 2941 tctgtctccc accctcactg atcacgcctg ccccagagca gtgtgtggcg gtgacaccgt 3001 cacccagcat gcgccacgcc gtcgtcccac cagcagtgcc accgccacca caccccagat 3061 cccacccacc ttgcagtggc tttcttgtca tcagagtaga gaatgcacag gtgttggtga 3121 gggcgtgtgg ctgagcacta catgtcaagt cagagtcagt ttctatccaa ttctc // LOCUS HUMGLYCOPR 3533 bp mRNA PRI 26-OCT-1993 DEFINITION Homo sapiens platelet membrane glycoprotein V mRNA, complete cds. ACCESSION L11238 NID g388759 KEYWORDS adhesive glycoprotein; leucine-rich glycoprotein; platelet membrane glycoprotein V. SOURCE Homo sapiens (library: Stratagene 944201) lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3533) AUTHORS Hickey,M.J., Hagen,F.S., Yagi,M. and Roth,G.J. TITLE Human platelet glycoprotein V: characterization of the polypeptide and the related Ib-V-IX receptor system of adhesive, leucine-rich glycoproteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (18), 8327-8331 (1993) MEDLINE 93391348 FEATURES Location/Qualifiers source 1..3533 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI38" /cell_type="fibroblast" /tissue_type="lung" /tissue_lib="Stratagene 944201" sig_peptide 73..120 CDS 73..1755 /note="putative" /codon_start=1 /product="platelet membrane glycoprotein V" /db_xref="PID:g388760" /translation="MLRGTLLCAVLGLLRAQPFPCPPACKCVFRDAAQCSGGDVARIS ALGLPTNLTHILLFGMGRGVLQSQSFSGMTVLQRLMISDSHISAVAPGTFSDLIKLKT LRLSRNKITHLPGALLDKMVLLEQLFLDHNALRGIDQNMFQKLVNLQELALNQNQLDF LPASLFTNLENLKLLDLSGNNLTHLPKGLLGAQAKLERLLLHSNRLVSLDSGLLNSLG ALTELQFHRNHIRSIAPGAFDRLPNLSSLTLSRNHLAFLPSALFLHSHNLTLLTLFEN PLAELPGVLFGEMGGLQELWLNRTQLRTLPAAAFRNLSRLRYLGVTLSPRLSALPQGA FQGLGELQVLALHSNGLTALPDGLLRGLGKLRQVSLRRNRLRALPRALFRNLSSLESV QLDHNQLETLPGDVFGALPRLTEVLLGHNSWRCDCGLGPFLGWLRQHLGLVGGEEPPR CAGPGAHAGLPLWALPGGDAECPGPRGPPPRPAADSSSEAPVHPALAPNSSEPWVWAQ PVTTGKGQDHSPFWGFYFLLLAVQAMITVIIVFAMIKIGQLFRKLIRERALG" mat_peptide 121..1752 /note="putative" /function="thrombin substrate" /product="platelet membrane glycoprotein V" repeat_region complement(2318..2612) /note="putative" /rpt_family="Alu" polyA_signal 3270..3275 BASE COUNT 661 a 1055 c 927 g 890 t ORIGIN 1 ggatcccagg gttcagtaca ggcgcgaacg ctcctgtgtg ttgaccacac tcccacggtt 61 gctttttcag acatgctgag ggggactcta ctgtgcgcgg tgctcgggct tctgcgcgcc 121 cagcccttcc cctgtccgcc agcttgcaag tgtgtcttcc gggacgccgc gcagtgctcg 181 gggggcgacg tggcgcgcat ctccgcgctg ggcctgccca ccaacctcac gcacatcctg 241 ctcttcggaa tgggccgcgg cgtcctgcag agccagagct tcagcggcat gaccgtcctg 301 cagcgcctca tgatctccga cagccacatt tccgccgttg cccccggcac cttcagtgac 361 ctgataaaac tgaaaaccct gaggctgtcg cgcaacaaaa tcacgcatct tccaggtgcg 421 ctgctggata agatggtgct cctggagcag ttgtttttgg accacaatgc gctaaggggc 481 attgaccaaa acatgtttca gaaactggtt aacctgcagg agctcgctct gaaccagaat 541 cagctcgatt tccttcctgc cagtctcttc acgaatctgg agaacctgaa gttgttggat 601 ttatcgggaa acaacctgac ccacctgccc aaggggttgc ttggagcaca ggctaagctc 661 gagagacttc tgctccactc gaaccgcctt gtgtctctgg attcggggct gttgaacagc 721 ctgggcgccc tgacggagct gcagttccac cgaaatcaca tccgttccat cgcacccggg 781 gccttcgacc ggctcccaaa cctcagttct ttgacgcttt cgagaaacca ccttgcgttt 841 ctcccctctg cgctctttct tcattcgcac aatctgactc tgttgactct gttcgagaac 901 ccgctggcag agctcccggg ggtgctcttc ggggagatgg ggggcctgca ggagctgtgg 961 ctgaaccgca cccagctgcg caccctgccc gccgccgcct tccgcaacct gagccgcctg 1021 cggtacttag gggtgactct gagcccgcgg ctgagcgcgc ttccgcaggg cgccttccag 1081 ggccttggcg agctccaggt gctcgccctg cactccaacg gcctgaccgc cctccccgac 1141 ggcttgctgc gcggcctcgg caagctgcgc caggtgtccc tgcgccgcaa caggctgcgc 1201 gccctgcccc gtgccctctt ccgcaatctc agcagcctgg agagcgtcca gctcgaccac 1261 aaccagctgg agaccctgcc tggcgacgtg tttggggctc tgccccggct gacggaggtc 1321 ctgttggggc acaactcctg gcgctgcgac tgtggcctgg ggcccttcct ggggtggctg 1381 cggcagcacc taggcctcgt gggcggggaa gagcccccac ggtgcgcagg ccctggggcg 1441 cacgccggcc tgccgctctg ggccctgccg gggggtgacg cggagtgccc gggcccccgg 1501 ggcccgcctc cccgccccgc tgcggacagc tcctcggaag cccctgtcca cccagccttg 1561 gctcccaaca gctcagaacc ctgggtgtgg gcccagccgg tgaccacggg caaaggtcaa 1621 gatcatagtc cgttctgggg gttttatttt ctgcttttag ctgttcaggc catgatcacc 1681 gtgatcatcg tgtttgctat gattaaaatt ggccaactct ttcgaaaatt aatcagagag 1741 agagcccttg ggtaaaccaa tgggaaaatc ttctaattac ttagaacctg accagatgtg 1801 gctcggaggg gaatccagac ccgctgctgt cttgctctcc ctcccctccc cactcctcct 1861 ctcttcttcc tcttctctct cactgccacg ccttcctttc cctcctcctc cccctctccg 1921 ctctgtgctc ttcattctca caggcccgca acccctcctc tctgtgtccc ccgcccgttc 1981 ctggaaactg agcttgacgt ttgtaaactg tggttgcctg ccttccccag ctcccacgcg 2041 ggtgtgcgct gacactgccg ggggcgctgg actgtgttgg acccatccgt gctccgctgt 2101 gcctggcttg gcgtctggtg gagagagggg cctcttcagt gtctactgag taaggggaca 2161 gctccaggcc ggggcctgtc tcctgcacag agtaagccgg taaatgtttg tgaaatcaat 2221 gcgtggataa aggaactcat gccatccaag tgatgatggc ttttcctgga gggaaaggat 2281 aggctgttgc tctatctaat tttttgtttt tgtttttgga cagtctagct ctgtggccca 2341 ggctggcgtg cagtgggccg tctcagttca ctgcagcctc cgcctcccag gttcaagtga 2401 ttctcatgcc tcagcgttct gagtagctgg gattagaggc gtgtgccact acacccggct 2461 aatttttgta ctttttaaag tagagacggg gctttgccat attggcctgg ctgatctcaa 2521 actcctggtc ttgaactcct ggccacaagt gatctgcccg ccttggcctc ccaaagtgct 2581 gggattacag gcgtaagcca ctacacctgg ccctcttcat cgaattttat ttgagaagta 2641 gagctcttgc cattttttcc cttgctccat ttttctcact ttatgtctct ctgacctatg 2701 ggctacttgg gagagcactg gactccattc atgcatgagc attttcagga taagcgactt 2761 ctgtgaggct gagagaggaa gaaaacacgg agccttccct ccaggtgccc agtgtaggtc 2821 cagcgtgttt cctgagcctc ctgtgagttt ccacttgctt tacatccatg caacatgtca 2881 ttttgaaact ggattgattt gcatttcctg gaactctgcc acctcatttc acaagcattt 2941 atggagcagt taacatgtga ctggtattca tgaatataat gataagcttg attctagttc 3001 agctgctgtc acagtctcat ttgttcttcc aactgaaagc cgtaaaacct ttgttgcttt 3061 aattgaatgt ctgtgcttat gagaggcagt ggttaaaaca ggggctggcg agttgacaac 3121 tgtgggttca aatcccagct ctaccactta ctaactgcat gggactttgg gtaagacacc 3181 tgcttacatt ctctaagcct tggtttcctg aaccttaaaa caggataaca tagtacctgc 3241 ttcgtagagt ttttgtgaga attaaaggca ataaagcata taatgactta gcccagcggc 3301 ctgcaggcaa tacatgttaa tgaatgttag ctattattac taaaggatga gcaattatta 3361 ttggcatcat gatttctaaa gaagagcttt gagttggtat ttttctctgt gtataagggt 3421 aagtccgaac tttctcagac tggaggttac attcacatca gtctgtcttc cctgcggatg 3481 gcctcagccc tgggtggcca gactctgtgc tcacaatcca gagcaatgga tcc // LOCUS HUMGLYPL 2828 bp mRNA PRI 08-NOV-1994 DEFINITION Human liver glycogen phosphorylase mRNA, complete cds. ACCESSION M14636 NID g183352 KEYWORDS glycogen phosphorylase; phosphorylase. SOURCE Human liver, cDNA to mRNA (library of A.DiLella and S.Woo), clones HL-[1-10] and (library of J.-H.Ou), clone HL-11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2828) AUTHORS Newgard,C.B., Nakano,K., Hwang,P.K. and Fletterick,R.J. TITLE Sequence analysis of the cDNA encoding human liver glycogen phosphorylase reveals tissue-specific codon usage JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (21), 8132-8136 (1986) MEDLINE 87041414 COMMENT There is a polyadenylation signal sequence at positions 2806 to 2811. FEATURES Location/Qualifiers source 1..2828 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q11.2-q24.3" gene 114..2657 /gene="PYGL" CDS 114..2657 /gene="PYGL" /note="glycogen phosphorylase (EC 2.4.1.1)" /codon_start=1 /db_xref="GDB:G00-120-328" /db_xref="PID:g183353" /translation="MGEPLTDQEKRRQISIRGIVGVENVAELKKSFNRHLHFTLVKDR NVATTRDYYFALAHTVRDHLVGRWIRTQQHYYDKCPKREYYLSLEFYMGRTLQNTMIN LGLQNACDEAIYQLGLDIEELEEIEEDAGLGNGGLGRLAACFLDSMATLGLAAYGYGI RYEYGIFNQKIRDGWQVEEADDWLRYGNPWEKSRPEFMLPVHFYGKVEHTNTGTKWID TQVVLALPYDTPEPGYMNNTVNTMRLWSARAPNDFNLRDFNVGDYIQAVLDRNLAENI SRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKASKFGSTRGQGTVFDAFPDQV AIQLNDTHPRIAIPELMRIFVDIEKLPWSKAWELNQKTFAYTNHTVLPEALERWPVDL VEKLLPRHLEIIYEINQKHLDRIVALFPKDVDPLRRMSLIEEEGSKRINMAHLCIVGS HAVNGVAKIHSDIVKTKVFKDFSELEPDKFQNKTNGITPRRWLLLCNPGLAELIAEKI GEDYVKDLSQLTKLHSFLGDDVFLRELAKVKQENKLKFSQFLETEYKVKINPSSMFDV QVKRIHEYKRQLLNCLHVITMYNRIKKDPKKLFVPRTVIIGGKAAPGYHMAKMIIKLI TSVADVVNNDPMVGSKLKVIFLENYRVSLAEKVIPATDLSEQISTAGTEASGTGNMKF MLNGALTIGTMDGANVEMAEEAGEENLFIFGMSIDDVAALDKKGYEAKEYYEALPELK LVIDQIDNGFFSPKQPDLFKDIINMLFYHDRFKVFADYEAYVKCQDKVSQLYMNPKAW NTMVLKNIAASGKFSSDRTIKEYAQNIWNVEPSDLKISLSNESNKVNGN" BASE COUNT 800 a 673 c 709 g 646 t ORIGIN 699 bp upstream of EcoRI site. 1 gttgaaagct cctggcgcgg cggggcggac tccacccctg cccggcagcc cagcgcctcc 61 ggccgcactt ccagctctct gcgcagcccg ccgcgcagcc cgccgcccca gccatgggcg 121 aaccgctgac agaccaggag aagcggcggc agatcagcat ccgcggcatc gtgggcgtgg 181 agaacgtggc agagctgaag aagagtttca accggcacct gcacttcacg ctggtcaagg 241 accgcaacgt ggccaccacc cgcgactact acttcgcgct ggcgcacacg gtgcgggacc 301 acctggtggg gcgctggatc cgcacgcagc agcactacta cgacaagtgc cccaagaggg 361 aatattacct ctctctggaa ttttacatgg gccgaacatt acagaacacc atgatcaacc 421 tcggtctgca aaatgcctgt gatgaggcca tttaccagct tggattggat atagaagagt 481 tagaagaaat tgaagaagat gctggacttg gcaatggtgg tcttgggaga cttgctgcct 541 gcttcttgga ttccatggca accctgggac ttgcagccta tggatacggc attcggtatg 601 aatatgggat tttcaatcag aagatccgag atggatggca ggtagaagaa gcagatgatt 661 ggctcagata tggaaaccct tgggagaagt cccgcccaga attcatgctg cctgtgcact 721 tctatggaaa agtagaacac accaacaccg ggaccaagtg gattgacact caagtggtcc 781 tggctctgcc atatgacacc cccgagcccg gctacatgaa taacactgtc aacaccatgc 841 gcctctggtc tgctcgggca ccaaatgact ttaacctcag agactttaat gttggagact 901 acattcaggc tgtgctggac cgaaacctgg ccgagaacat ctcccgggtc ctctatccca 961 atgacaattt ttttgaaggg aaggagctaa gattgaagca ggaatacttt gtggtggctg 1021 caaccttgca agatatcatc cgccgtttca aagcctccaa gtttggctcc acccgtggtc 1081 aaggaactgt gtttgatgcc ttcccggatc aggtggccat ccagctgaat gatactcacc 1141 ctcgcatcgc gatccctgag ctgatgagga tttttgtgga tattgaaaaa ctgccctggt 1201 ccaaggcatg ggagctcaac cagaagacct tcgcctacac caaccacaca gtgctcccgg 1261 aagccctgga gcgctggccc gtggacctgg tggagaagct gctccctcga catttggaaa 1321 tcatttatga gataaatcag aagcatttag atagaattgt ggccttgttt cctaaagatg 1381 tggaccctct gagaaggatg tctctgatag aagaggaagg aagcaaaagg atcaacatgg 1441 cccatctctg cattgtcggt tcccatgctg tgaatggcgt ggctaaaatc cactcagaca 1501 tcgtgaagac taaagtattc aaggacttca gtgagctaga acctgacaag tttcagaata 1561 aaaccaatgg gatcactcca aggcgctggc tcctactctg caacccagga cttgcagagc 1621 tcatagcaga gaaaattgga gaagactatg tgaaagacct gagccagctg acgaagctcc 1681 acagcttcct gggtgatgat gtcttcctcc gggaactcgc caaggtgaag caggagaata 1741 agctgaagtt ttctcagttc ctggagacgg agtacaaagt gaagatcaac ccatcctcca 1801 tgtttgatgt ccaggtgaag aggatacatg agtacaagcg acagctcttg aactgtctgc 1861 atgtgatcac gatgtacaac cgcattaaga aagaccctaa gaagttattc gtgccaagga 1921 cagttatcat tggtggtaaa gctgccccag gatatcacat ggccaaaatg atcataaagc 1981 tgatcacttc agtggcagat gtggtgaaca atgaccctat ggttggaagc aagttgaaag 2041 tcatcttctt ggagaactac agagtatctc ttgctgaaaa agtcattcca gccacagatc 2101 tgtcagagca gatttccact gcaggcaccg aagcctcggg gacaggcaat atgaagttca 2161 tgctaaatgg ggccctaact atcgggacca tggatggggc caatgtggaa atggcagaag 2221 aagctgggga agagaacctg ttcatctttg gcatgagcat agatgatgtg gctgctttgg 2281 acaagaaagg gtacgaggca aaagaatact atgaggcact tccagagctg aagctggtca 2341 ttgatcaaat tgacaatggc tttttttctc ccaagcagcc tgacctcttc aaagatatca 2401 tcaacatgct attttatcat gacaggttta aagtctttgc agactacgaa gcctatgtca 2461 agtgtcaaga taaagtgagt cagctgtaca tgaatccaaa ggcctggaac acaatggtac 2521 tcaaaaacat agctgcctcg gggaaattct ccagtgaccg aacaattaaa gaatatgccc 2581 aaaacatctg gaacgtggaa ccttcagatc taaagatttc tctatccaat gaatctaaca 2641 aagtcaatgg aaattgaact ctacaatgtc tctagaaaac atagcttctt actgaacttg 2701 aacattttta caacattcac tggtttttgt tttgttagct aataatctat aatagttgag 2761 tatctctggg aatggggagg gaaattatat gtaatagagc ttaaaaataa agtgtcaatt 2821 tccaagga // LOCUS HUMGLYSA 3531 bp mRNA PRI 13-FEB-1996 DEFINITION Human muscle glycogen synthase mRNA, complete cds. ACCESSION J04501 NID g183354 KEYWORDS glycogen synthase. SOURCE Homo sapiens fetus muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3531) AUTHORS Browner,M.F. JOURNAL Unpublished (1989) REFERENCE 2 (bases 1 to 3531) AUTHORS Browner,M.F., Nakano,K., Bang,A.G. and Fletterick,R.J. TITLE Human muscle glycogen synthase cDNA sequence: a negatively charged protein with an asymmetric charge distribution JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (5), 1443-1447 (1989) MEDLINE 89160794 COMMENT Computer readable copy of sequence [2] kindly submitted by M.F. Browner 09-FEB-1989. FEATURES Location/Qualifiers source 1..3531 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="muscle" CDS 161..2374 /note="muscle glycogen synthase" /codon_start=1 /product="glycogen synthase" /db_xref="PID:g183355" /translation="MPLNRTLSMSSLPGLEDWEDEFDLENAVLFEVAWEVANKVGGIY TVLQTKAKVTGDEWGDNYFLVGPYTEQGVRTQVELLEAPTPALKRTLDSMNSKGCKVY FGRWLIEGGPLVVLLDVGASAWALERWKGELWDICNIGVPWYDREANDAVLFGFLTTW FLGEFLAQSEEKPHVVAHFHEWLAGVGLCLCRARRLPVATIFTTHATLLGRYLCAGAV DFYNNLENFNVDKEAGERQIYHRYCMERAAAHCAHVFTTVSQITAIEAQHLLKRKPDI VTPNGLNVKKFSAMHEFQNLHAQSKARIQEFVRGHFYGHLDFNLDKTLYFFIAGRYEF SNKGADVFLEALARLNYLLRVNGSEQTVVAFFIMPARTNNFNVETLKGQAVRKQLWDT ANTVKEKFGRKLYESLLVGSLPDMNKMLDKEDFTMMKRAIFATQRQSFPPVCTHNMLD DSSDPILTTIRRIGLFNSSADRVKVIFHPEFLSSTSPLLPVDYEEFVRGCHLGVFPSY YEPWGYTPAECTVMGIPSISTNLSGFGCFMEEHIADPSAYGIYILDRRFRSLDDSCSQ LTSFLYSFCQQSRRQRIIQRNRTERLSDLLDWKYLGRYYMSARHMALSKAFPEHFTYE PNEADAAQGYRYPRPASVPPSPSLSRHSSPHQSEDEEDPRNGPLEEDGERYDEDEEAA KDRRNIRAPEWPRRASCTSSTSGRKRNSVDTATSSSLSTPSEPLSPTSSLGEERN" old_sequence 676..678 /citation=[2] old_sequence 706..707 /citation=[2] BASE COUNT 721 a 1096 c 973 g 741 t ORIGIN 1 cgcttcgggc aggggtgcgg tcttgcaata ggaagccgag cgtcttgcaa gcttcccgtc 61 ggcaccagct actcggcccc gcaccctacc tggtgcattc cctagacacc tccggggtcc 121 ctacctggag atccccggag ccccccttcc tgcgccagcc atgcctttaa accgcacttt 181 gtccatgtcc tcactgccag gactggagga ctgggaggat gaattcgacc tggagaacgc 241 agtgctcttc gaagtggcct gggaggtggc taacaaggtg ggtggcatct acacggtgct 301 gcagacgaag gcgaaggtga caggggacga atggggcgac aactacttcc tggtggggcc 361 gtacacggag cagggcgtca ggacccaggt ggaactgctg gaggccccca ccccggccct 421 gaagaggaca ctggattcca tgaacagcaa gggctgcaag gtgtatttcg ggcgctggct 481 gatcgaggga ggccctctgg tggtgctcct ggacgtgggt gcctcagctt gggccctgga 541 gcgctggaag ggagagctct gggatatctg caacatcgga gtgccgtggt acgaccgcga 601 ggccaacgac gctgtcctct ttggctttct gaccacctgg ttcctgggtg agttcctggc 661 acagagtgag gagaagccac atgtggttgc tcacttccat gagtggttgg caggcgttgg 721 actctgcctg tgtcgtgccc ggcgactgcc tgtagcaacc atcttcacca cccatgccac 781 gctgctgggg cgctacctgt gtgccggtgc cgtggacttc tacaacaacc tggagaactt 841 caacgtggac aaggaagcag gggagaggca gatctaccac cgatactgca tggaaagggc 901 ggcagcccac tgcgctcacg tcttcactac tgtgtcccag atcaccgcca tcgaggcaca 961 gcacttgctc aagaggaaac cagatattgt gacccccaat gggctgaatg tgaagaagtt 1021 ttctgccatg catgagttcc agaacctcca tgctcagagc aaggctcgaa tccaggagtt 1081 tgtgcggggc catttttatg ggcatctgga cttcaacttg gacaagacct tatacttctt 1141 tatcgccggc cgctatgagt tctccaacaa gggtgctgac gtctttctgg aggcattggc 1201 tcggctcaac tatctgctca gagtgaacgg cagcgagcag acagtggttg ccttcttcat 1261 catgccagcg cggaccaaca atttcaacgt ggaaaccctc aaaggccaag ctgtgcgcaa 1321 acagctttgg gacacggcca acacggtgaa ggaaaagttc gggaggaagc tttatgaatc 1381 cttactggtt gggagccttc ccgacatgaa caagatgctg gataaggaag acttcactat 1441 gatgaagaga gccatctttg caacgcagcg gcagtctttc ccccctgtgt gcacccacaa 1501 tatgctggat gactcctcag accccatcct gaccaccatc cgccgaatcg gcctcttcaa 1561 tagcagtgcc gacagggtga aggtgatttt ccacccggag ttcctctcct ccacaagccc 1621 cctgctccct gtggactatg aggagtttgt ccgtggctgt caccttggag tcttcccctc 1681 ctactatgag ccttggggct acacaccggc tgagtgcacg gttatgggaa tccccagtat 1741 ctccaccaat ctctccggct tcggctgctt catggaggaa cacatcgcag acccctcagc 1801 ttacggtatc tacattcttg accggcggtt ccgcagcctg gatgattcct gctcgcagct 1861 cacctccttc ctctacagtt tctgtcagca gagccggcgg cagcgtatca tccagcggaa 1921 ccgcacggag cgcctctccg accttctgga ctggaaatac ctaggccggt actatatgtc 1981 tgcgcgccac atggcgctgt ccaaggcctt tccagagcac ttcacctacg agcccaacga 2041 ggcggatgcg gcccaggggt accgctaccc acggccagcc tcggtgccac cgtcgccctc 2101 gctgtcacga cactccagcc cgcaccagag tgaggacgag gaggatcccc ggaacgggcc 2161 gctggaggaa gacggcgagc gctacgatga ggacgaggag gccgccaagg accggcgcaa 2221 catccgtgca ccagagtggc cgcgccgagc gtcctgcacc tcctccacca gcggccgcaa 2281 gcgcaactct gtggacacgg ccacctccag ctcactcagc accccgagcg agcccctcag 2341 ccccaccagc tccctgggcg aggagcgtaa ctaagtccgc cccaccacac tccccgcctg 2401 tcctgcctct ctgctccaga gagaggatgc agaggggtgc tgctcctaaa cccccgatcc 2461 agatctgcac ggggtgcggc cccgcagtgc ccccacccag tccgccaaac actccacccc 2521 ctccagctcc agtttccaag ttcctgcact cctgaatcca caaagccgtg cctttctctg 2581 gctccagaat atgcataatc agcgccctgg agtcccctgg gcctggaccg cttcccagag 2641 gccaggaaat ctgccattac tctgcggtgg tgccagaggt tttaggaaac ctggcatggt 2701 gctttcaggt ctggggcttt tagagccccc cgtgtggctt acaaattcta cagcatacag 2761 agcaggccac gctcaggccc ggcatgcggg ccaccaagtt ctggaaacca cgtggtgtcc 2821 ctgcgaatgg ggcgatcaag tccagagccg gggcactttc agagtttgaa ggtaactgag 2881 agcagatggt cctccatttc aactccagaa gtggggctct gggagggatg ttctagccct 2941 ccctggctgt cagagccagg ctctgcctgg aggatccctc catccggctc ctgtcatccc 3001 ctacactttg gccaagcaag aggtggtaga accacttggc tgctcattcc ttctggagga 3061 cacacagtct cagtccagat gccttcctgt ctttctggtc ctttctggac cagatcctac 3121 tcttcctttc taaatctgag atctccctcc agggaatccg cctgcagagg acagagctgg 3181 ctgtcttccc ccacccctaa cctggcttat tcccaactgc tctgcccact gtgaaaccac 3241 taggttctag gtcctggctt ctagatctgg aaccttacca cgttactgca tactgatccc 3301 tttcccatga tccagaactg aggtcactgg gttctagaac ccccacattt acctcgaggc 3361 tcttccatcc ccaaactgtg ccctgccttc agctttggtg aaagggaggg cccctcatgt 3421 gtgctgtgct gtgtctgcac cgcttggttt gcagttgaga ggggagggca ggaggggtgt 3481 gattggagtg tgtccggaga tgagatgaaa aaaatacatc tatatttaag a // LOCUS HUMGLYSYN 2169 bp mRNA PRI 22-FEB-1995 DEFINITION Homo sapiens glycogen synthase kinase 3 mRNA, complete cds. ACCESSION L40027 NID g682744 KEYWORDS glycogen synthase kinase 3; protein kinase. SOURCE Homo sapiens male foreskin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2169) AUTHORS He,X., Saint-Jeannet,J.-P., Woodgett,J.R., Varmus,H.E. and Dawid,I.B. TITLE Glycogen synthase kinase 3 and dorsoventral patterning in Xenopus embryos JOURNAL Nature (1995) In press FEATURES Location/Qualifiers source 1..2169 /organism="Homo sapiens" /note="Cloning Vector: lambda ZAP" /db_xref="taxon:9606" /cell_type="fibroblast" /sex="male" /tissue_type="foreskin" mRNA 1..2169 CDS 115..1566 /codon_start=1 /product="glycogen synthase kinase 3" /db_xref="PID:g682745" /translation="MSGGGPSGGGPGGSGRARTSSFAEPGGGGGGGGGGPGGSASGPG GTGGGKASVGAMGGGVGASSSGGGPGGSGGGGSGGPGAGTSFPPPGVKLGRDSGKVTT VVATLGQGPERSQEVAYTDIKVIGNGSFGVVYQARLAETRELVAIKKVLQDKRFKNRE LQIMRKLDHCNIVRLRYFFYSSGEKKDELYLNLVLEYVPETVYRVARHFTKAKLTIPI LYVKVYMYQLFRSLAYIHSQGVCHRDIKPQNLLVDPDTAVLKLCDFGSAKQLVRGEPN VSYICSRYYRAPELIFGATDYTSSIDVWSAGCVLAELLLGQPIFPGDSGVDQLVEIIK VLGTPTREQIREMNPNYTEFKFPQIKAHPWTKVFKSRTPPEAIALCSSLLEYTPSSRL SPLEACAHSFFDELRCLGTQLPNNRPLPPLFNFSAGELSIQPSLNAILIPPHLRSPSG TTTLTPSSQALTETPTSSDWQSTDATPTLTNSS" BASE COUNT 428 a 714 c 614 g 413 t ORIGIN 1 gccagagcgg cgcggcctgg aagaggccag ggcccggggg aggcgacggc agcggcggcg 61 gctggggcag cccgggcagc ccgagccccg cagcctgggc ctgtgctcgg cgccatgagc 121 ggcggcgggc cttcgggagg cggccctggg ggctcgggca gggcgcggac tagctcgttc 181 gcggagcccg gcggcggagg cggaggaggc ggcggcggcc ccggaggctc ggcctccggc 241 ccaggcggca ccggcggcgg aaaggcatct gtcggggcca tgggtggggg cgtcggggcc 301 tcgagctccg ggggtggacc cggcggcagc ggcggaggag gcagcggagg ccccggcgca 361 ggcactagct tcccgccgcc cggggtgaag ctgggccgtg acagcgggaa ggtgaccaca 421 gtcgtagcca ctctaggcca aggcccagag cgctcccaag aagtggctta cacggacatc 481 aaagtgattg gcaatggctc atttggggtc gtgtaccagg cacggctggc agagaccagg 541 gaactagtcg ccatcaagaa ggttctccag gacaagaggt tcaagaaccg agagctgcag 601 atcatgcgta agctggacca ctgcaatatt gtgaggctga gatacttttt ctactccagt 661 ggcgagaaga aagacgagct ttacctaaat ctggtgctgg aatatgtgcc cgagacagtg 721 taccgggtgg cccgccactt caccaaggcc aagttgacca tccctatcct ctatgtcaag 781 gtgtacatgt accagctctt ccgcagcttg gcctacatcc actcccaggg cgtgtgtcac 841 cgcgacatca agccccagaa cctgctggtg gaccctgaca ctgctgtcct caagctctgc 901 gattttggca gtgcaaagca gttggtccga ggggagccca atgtctccta catctgttct 961 cgctactacc gggccccaga gctcatcttt ggagccactg attacacctc atccatcgat 1021 gtttggtcag ctggctgtgt actggcagag ctcctcttgg gccagcccat cttccctggg 1081 gacagtgggg tggaccagct ggtggagatc atcaaggtgc tgggaacacc aacccgggaa 1141 caaatccgag agatgaaccc caactacacg gagttcaagt tccctcagat taaagctcac 1201 ccctggacaa aggtgttcaa atctcgaacg ccgccagagg ccatcgcgct ctgctctagc 1261 ctgctggagt acaccccatc ctcaaggctc tccccactag aggcctgtgc gcacagcttc 1321 tttgatgaac tgcgatgtct gggaacccag ctgcctaaca accgcccact tccccctctc 1381 ttcaacttca gtgctggtga actctccatc caaccgtctc tcaacgccat tctcatccct 1441 cctcacttga ggtcccccag cggcactacc accctcaccc cgtcctcaca agctttaact 1501 gagactccga ccagctcaga ctggcagtcg accgatgcca cacctaccct cactaactcc 1561 tcctgagggc cccaccaagc acccttccac ttccatctgg gagccccaag agggcgtggg 1621 aaggggggcc atagcccatc aagctcctgc cctggctggg cccctagact agagggcaga 1681 ggtaaatgag tccctgtccc cacctccagt ccctccctca ccagcctcac ccctgtggtg 1741 ggctttttaa gaggatttta actggttgtg gggagggaag agaaggacag ggtgttgggg 1801 ggatgaggac ctcctacccc cttggccccc tcccctcccc cagacctcca cctcctccag 1861 accccctccc ctcctgtgtc ccttgtaaat agaaccagcc cagcccgtct cctcttccct 1921 tccctggccc ccgggtgtaa atagattgtt ataatttttt tcttaaagaa aacgtcgatt 1981 cgcaccgtcc aacctgcccc gcccctccta cagctgtaac tcccctcctg tcctctgccc 2041 ccaaggtcta ctccctcctc accccaccct ggagggccag gggagtggag agagctcctg 2101 atgtcttagt ttccacagta aggtttgcct gtgtacagac ctccgttcaa taaattattg 2161 gcatgaaaa // LOCUS HUMGMCSFRB 2996 bp mRNA PRI 16-MAY-1994 DEFINITION Human GM-CSF receptor beta chain mRNA, complete cds. ACCESSION M59941 M38275 NID g487424 KEYWORDS GM-CSF receptor; cytokine receptor; growth factor receptor; lymphokine receptor. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2996) AUTHORS Hayashida,K., Kitamura,T., Gorman,D.M., Arai,K., Yokota,T. and Miyajima,A. TITLE Molecular cloning of a second subunit of the receptor for human granulocyte-macrophage colony-stimulating factor (GM-CSF): reconstitution of a high-affinity GM-CSF receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (24), 9655-9659 (1990) MEDLINE 91088571 REFERENCE 2 (bases 1 to 2996) AUTHORS Kitamura,T. TITLE Direct Submission JOURNAL Submitted (06-FEB-1991) Toshio Kitamura, Department of Molecular Biology, DNAX Research Institute of Molecular and Cellular Biology, 901 California Avenue, Palo Alto, CA 94304-1104, USA FEATURES Location/Qualifiers source 1..2996 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 29..172 CDS 29..2722 /codon_start=1 /product="GM-CSF receptor beta chain" /db_xref="PID:g487425" /translation="MVLAQGLLSMALLALCWERSLAGAEETIPLQTLRCYNDYTSHIT CRWADTQDAQRLVNVTLIRRVNEDLLEPVSCDLSDDMPWSACPHPRCVPRRCVIPCQS FVVTDVDYFSFQPDRPLGTRLTVTLTQHVQPPEPRDLQISTDQDHFLLTWSVALGSPQ SHWLSPGDLEFEVVYKRLQDSWEDAAILLSNTSQATLGPEHLMPSSTYVARVRTRLAP GSRLSGRPSKWSPEVCWDSQPGDEAQPQNLECFFDGAAVLSCSWEVRKEVASSVSFGL FYKPSPDAGEEECSPVLREGLGSLHTRHHCQIPVPDPATHGQYIVSVQPRRAEKHIKS SVNIQMAPPSLNVTKDGDSYSLRWETMKMRYEHIDHTFEIQYRKDTATWKDSKTETLQ NAHSMALPALEPSTRYWARVRVRTSRTGYNGIWSEWSEARSWDTESVLPMWVLALIVI FLTIAVLLALRFCGIYGYRLRRKWEEKIPNPSKSHLFQNGSAELWPPGSMSAFTSGSP PHQGPWGSRFPELEGVFPVGFGDSEVSPLTIEDPKHVCDPPSGPDTTPAASDLPTEQP PSPQPGPPAASHTPEKQASSFDFNGPYLGPPHSRSLPDILGQPEPPQEGGSQKSPPPG SLEYLCLPAGGQVQLVPLAQAMGPGQAVEVERRPSQGAAGSPSLESGGGPAPPALGPR VGGQDQKDSPVAIPMSSGDTEDPGVASGYVSSADLVFTPNSGASSVSLVPSLGLPSDQ TPSLCPGLASGPPGAPGPVKSGFEGYVELPPIEGRSPRSPRNNPVPPEAKSPVLNPGE RPADVSPTSPQPEGLLVLQQVGDYCFLPGLGPGPLSLRSKPSSPGPGPEIKNLDQAFQ VKKPPGQAVPQVPVIQLFKALKQQDYLSLPPWEVNKPGEVC" mat_peptide 173..2719 /product="GM-CSF receptor beta chain" BASE COUNT 591 a 1017 c 857 g 531 t ORIGIN 1 gcctgcctgt ccagagctga ccagggagat ggtgctggcc caggggctgc tctccatggc 61 cctgctggcc ctgtgctggg agcgcagcct ggcaggggca gaagaaacca tcccgctgca 121 gaccctgcgc tgctacaacg actacaccag ccacatcacc tgcaggtggg cagacaccca 181 ggatgcccag cggctcgtca acgtgaccct cattcgccgg gtgaatgagg acctcctgga 241 gccagtgtcc tgtgacctca gtgatgacat gccctggtca gcctgccccc atccccgctg 301 cgtgcccagg agatgtgtca ttccctgcca gagttttgtc gtcactgacg ttgactactt 361 ctcattccaa ccagacaggc ctctgggcac ccggctcacc gtcactctga cccagcatgt 421 ccagcctcct gagcccaggg acctgcagat cagcaccgac caggaccact tcctgctgac 481 ctggagtgtg gcccttggga gtccccagag ccactggttg tccccagggg atctggagtt 541 tgaggtggtc tacaagcggc ttcaggactc ttgggaggac gcagccatcc tcctctccaa 601 cacctcccag gccaccctgg ggccagagca cctcatgccc agcagcacct acgtggcccg 661 agtacggacc cgcctggccc caggttctcg gctctcagga cgtcccagca agtggagccc 721 agaggtttgc tgggactccc agccagggga tgaggcccag ccccagaacc tggagtgctt 781 ctttgacggg gccgccgtgc tcagctgctc ctgggaggtg aggaaggagg tggccagctc 841 ggtctccttt ggcctattct acaagcccag cccagatgca ggggaggaag agtgctcccc 901 agtgctgagg gaggggctcg gcagcctcca caccaggcac cactgccaga ttcccgtgcc 961 cgaccccgcg acccacggcc aatacatcgt ctctgttcag ccaaggaggg cagagaaaca 1021 cataaagagc tcagtgaaca tccagatggc ccctccatcc ctcaacgtga ccaaggatgg 1081 agacagctac agcctgcgct gggaaacaat gaaaatgcga tacgaacaca tagaccacac 1141 atttgagatc cagtacagga aagacacggc cacgtggaag gacagcaaga ccgagaccct 1201 ccagaacgcc cacagcatgg ccctgccagc cctggagccc tccaccaggt actgggccag 1261 ggtgagggtc aggacctccc gcaccggcta caacgggatc tggagcgagt ggagtgaggc 1321 gcgctcctgg gacaccgagt cggtgctgcc tatgtgggtg ctggccctca tcgtgatctt 1381 cctcaccatc gctgtgctcc tggccctccg cttctgtggc atctacgggt acaggctgcg 1441 cagaaagtgg gaggagaaga tccccaaccc cagcaagagc cacctgttcc agaacgggag 1501 cgcagagctt tggcccccag gcagcatgtc ggccttcact agcgggagtc ccccacacca 1561 ggggccgtgg ggcagccgct tccctgagct ggagggggtg ttccctgtag gattcgggga 1621 cagcgaggtg tcacctctca ccatagagga ccccaagcat gtctgtgatc caccatctgg 1681 gcctgacacg actccagctg cctcagatct acccacagag cagcccccca gcccccagcc 1741 aggcccgcct gccgcctccc acacacctga gaaacaggct tccagctttg acttcaatgg 1801 gccctacctg gggccgcccc acagccgctc cctacctgac atcctgggcc agccggagcc 1861 cccacaggag ggtgggagcc agaagtcccc acctccaggg tccctggagt acctgtgtct 1921 gcctgctggg gggcaggtgc aactggtccc tctggcccag gcgatgggac cgggacaggc 1981 cgtggaagtg gagagaaggc cgagccaggg ggctgcaggg agtccctccc tggagtccgg 2041 gggaggccct gcccctcctg ctcttgggcc aagggtggga ggacaggacc aaaaggacag 2101 ccctgtggct atacccatga gctctgggga cactgaggac cctggagtgg cctctggtta 2161 tgtctcctct gcagacctgg tattcacccc aaactcaggg gcctcgtctg tctccctagt 2221 tccctctctg ggcctcccct cagaccagac ccccagctta tgtcctgggc tggccagtgg 2281 accccctgga gccccaggcc ctgtgaagtc agggtttgag ggctatgtgg agctccctcc 2341 aattgagggc cggtccccca ggtcaccaag gaacaatcct gtcccccctg aggccaaaag 2401 ccctgtcctg aacccagggg aacgcccggc agatgtgtcc ccaacatccc cacagcccga 2461 gggcctcctt gtcctgcagc aagtgggcga ctattgcttc ctccccggcc tggggcccgg 2521 ccctctctcg ctccggagta aaccttcttc cccgggaccc ggtcctgaga tcaagaacct 2581 agaccaggct tttcaagtca agaagccccc aggccaggct gtgccccagg tgcccgtcat 2641 tcagctcttc aaagccctga agcagcagga ctacctgtct ctgccccctt gggaggtcaa 2701 caagcctggg gaggtgtgtt gagaccccca ggcctagaca ggcaagggga tggagagggc 2761 ttgccttccc tcccgcctga ccttcctcag tcatttctgc aaagccaagg ggcagcctcc 2821 tgtcaaggta gctagaggcc tgggaaagga gatagccttg ctccggcccc cttgaccttc 2881 agcaaatcac ttctctccct gcgctcacac agacacacac acacacacgt acatgcacac 2941 atttttcctg tcaggttaac ttatttgtag gttctgcatt attagaactt tctaga // LOCUS HUMGMFB 700 bp mRNA PRI 31-DEC-1994 DEFINITION Human glia maturation factor beta mRNA, complete cds. ACCESSION M86492 M31742 NID g183369 KEYWORDS glia maturation factor. SOURCE Homo sapiens (tissue library: ATCC 37432) neonatal brain stem cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 700) AUTHORS Kaplan,R., Zaheer,A., Jaye,M. and Lim,R. TITLE Molecular cloning and expression of biologically active human glia maturation factor-beta JOURNAL J. Neurochem. 57 (2), 483-490 (1991) MEDLINE 91303115 FEATURES Location/Qualifiers source 1..700 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="neonatal" /tissue_type="brain stem" /tissue_lib="ATCC 37432" mRNA 13..700 /gene="glia maturation factor" gene 13..700 /gene="glia maturation factor" CDS 41..469 /gene="glia maturation factor" /codon_start=1 /product="glia maturation factor beta" /db_xref="PID:g183370" /translation="MSESLVVCDVAEDLVEKLRKFRFRKETNNAAIIMKIDKDKRLVV LDEELEGISPDELKDELPERQPRFIVYSYKYQHDDGRVSYPLCFIFSSPVGCKPEQQM MYAGSKNKLVQTAELTKVFEIRNTEDLTEEWLREKLGFFH" BASE COUNT 224 a 110 c 153 g 213 t ORIGIN 1 gaattcgggg ggcgacaggc cgctgacggc cggaaggaaa atgagtgagt ctttggttgt 61 ttgtgatgtt gccgaagatt tagtggaaaa gctgagaaag tttcgttttc gcaaagaaac 121 gaacaacgct gctattataa tgaagattga caaggataaa cgcctggtgg tactggatga 181 ggagcttgag ggcatttcac cagatgaact taaagatgaa ctacctgaac gacaacctcg 241 cttcattgtg tatagttata aatatcaaca tgatgatgga agagtttcat atcctctgtg 301 ctttattttc tccagtcctg ttggatgtaa gcctgaacaa cagatgatgt atgctggaag 361 taagaataag ctagtccaga cagctgaact aaccaaggta tttgaaataa gaaataccga 421 agacctaact gaagaatggt tacgtgagaa acttggattt tttcactaat gtgaacttct 481 gtgtttctaa agtatttatg tattaacctg accatactgg aatcagacat aaatacttat 541 ttatgcctaa aaatgcactg ttacttacag tttgtttcct gcagtaaaga aaaattcttc 601 atttgtgcaa aatttgaaca aagaggaaat catcttcata gtaatgaaac tttgtaaagt 661 gtttccttat attggtaatt gttaggtgga ctacttttcc // LOCUS HUMGMP140 3142 bp mRNA PRI 15-JUN-1990 DEFINITION Human granule membrane protein-140 mRNA, complete cds. ACCESSION M25322 NID g183390 KEYWORDS granule membrane protein. SOURCE Human umbilical vein endothelium and platelets, cDNA to mRNA, clones lambda-GMPE[1-4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3142) AUTHORS Johnston,G.I., Cook,R.G. and McEver,R.P. TITLE Cloning of GMP-140, a granule membrane protein of platelets and endothelium: Sequence similarity to proteins involved in cell adhesion and inflammation JOURNAL Cell 56, 1033-1044 (1989) MEDLINE 89168432 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.P.McEver, 22-FEB-1990. FEATURES Location/Qualifiers source 1..3142 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..3142 /note="GMP-140 mRNA" sig_peptide 39..161 /note="granule membrane protein-140 signal peptide" CDS 39..2531 /note="granule membrane protein-140 (GMP-140) precursor" /codon_start=1 /db_xref="PID:g183391" /translation="MANCQIAILYQRFQRVVFGISQLLCFSALISELTNQKEVAAWTY HYSTKAYSWNISRKYCQNRYTDLVAIQNKNEIDYLNKVLPYYSSYYWIGIRKNNKTWT WVGTKKALTNEAENWADNEPNNKRNNEDCVEIYIKSPSAPGKWNDEHCLKKKHALCYT ASCQDMSCSKQGECLETIGNYTCSCYPGFYGPECEYVRECGELELPQHVLMNCSHPLG NFSFNSQCSFHCTDGYQVNGPSKLECLASGIWTNKPPQCLAAQCPPLKIPERGNMICL HSAKAFQHQSSCSFSCEEGFALVGPEVVQCTASGVWTAPAPVCKAVQCQHLEAPSEGT MDCVHPLTAFAYGSSCKFECQPGYRVRGLDMLRCIDSGHWSAPLPTCEAISCEPLESP VHGSMDCSPSLRAFQYDTNCSFRCAEGFMLRGADIVRCDNLGQWTAPAPVCQALQCQD LPVPNEARVNCSHPFGAFRYQSVCSFTCNEGLLLVGASVLQCLATGNWNSVPPECQAI PCTPLLSPQNGTMTCVQPLGSSSYKSTCQFICDEGYSLSGPERLDCTRSGRWTDSPPM CEAIKCPELFAPEQGSLDCSDTRGEFNVGSTCHFSCNNGFKLEGPNNVECTTSGRWSA TPPTCKGIASLPTPGLQCPALTTPGQGTMYCRHHPGTFGFNTTCYFGCNAGFTLIGDS TLSCRPSGQWTAVTPACRAVKCSELHVNKPIAMNCSNLWGNFSYGSICSFHCLEGQLL NGSAQTACQENGHWSTTVPTCQAGPLTIQEALTYFGGAVASTIGLIMGGTLLALLRKR FRQKDDGKCPLNPHSHLGTYGVFTNAAFDPSP" mat_peptide 162..2528 /note="granule membrane protein-140" repeat_region 628..813 /note="repeat 1" variation 684 /note="c or a" repeat_region 814..999 /note="repeat 2" variation 859 /note="t or c" repeat_region 1000..1185 /note="repeat 3" variation 1030 /note="g or a" variation 1088 /note="c or t" repeat_region 1186..1371 /note="repeat 4" repeat_region 1372..1557 /note="repeat 5" repeat_region 1558..1743 /note="repeat 6" repeat_region 1744..1929 /note="repeat 7" variation 1744..1929 /note="deleted in clone lambda-GMPE1" variation 1832 /note="t or c" variation 1845 /note="a or g" variation 1850 /note="t or c" repeat_region 1930..2139 /note="repeat 8" variation 1956 /note="t or g" repeat_region 2140..2325 /note="repeat 9" variation 2326..2445 /note="deleted in clones lambda-GMPE[2,3]" BASE COUNT 835 a 812 c 741 g 754 t ORIGIN Chromosome 1q21-q24. 1 tgggcagaag gcagaaaacc agcagagtca cagaggagat ggccaactgc caaatagcca 61 tcttgtacca gagattccag agagtggtct ttggaatttc ccaactcctt tgcttcagtg 121 ccctgatctc tgaactaaca aaccagaaag aagtggcagc atggacttat cattacagca 181 caaaagcata ctcatggaat atttcccgta aatactgcca gaatcgctac acagacttag 241 tggccatcca gaataaaaat gaaattgatt acctcaataa ggtcctaccc tactacagct 301 cctactactg gattgggatc cgaaagaaca ataagacatg gacatgggtg ggaaccaaaa 361 aggctctcac caacgaggct gagaactggg ctgataatga acctaacaac aaaaggaaca 421 acgaggactg cgtggagata tacatcaaga gtccgtcagc ccctggcaag tggaatgatg 481 agcactgctt gaagaaaaag cacgcattgt gttacacagc ctcctgccag gacatgtcct 541 gcagcaaaca aggagagtgc ctcgagacca tcgggaacta cacctgctcc tgttaccctg 601 gattctatgg gccagaatgt gaatacgtga gagagtgtgg agaacttgag ctccctcaac 661 acgtgctcat gaactgcagc caccctctgg gaaacttctc ttttaactcg cagtgcagct 721 tccactgcac tgacgggtac caagtaaatg ggcccagcaa gctggaatgc ttggcttctg 781 gaatctggac aaataagcct ccacagtgtt tagctgccca gtgcccaccc ctgaagattc 841 ctgaacgagg aaacatgatc tgccttcatt ctgcaaaagc attccagcat cagtctagct 901 gcagcttcag ttgtgaagag ggatttgcat tagttggacc ggaagtggtg caatgcacag 961 cctcgggggt atggacagcc ccagccccag tgtgtaaagc tgtgcagtgt cagcacctgg 1021 aagcccccag tgaaggaacc atggactgtg ttcatccgct cactgctttt gcctatggct 1081 ccagctgcaa atttgagtgc cagcccggct acagagtgag gggcttggac atgctccgct 1141 gcattgactc tggacactgg tctgcaccct tgccaacctg tgaggctatt tcgtgtgagc 1201 cgctggagag tcctgtccac ggaagcatgg attgctctcc atccttgaga gcgtttcagt 1261 atgacaccaa ctgtagcttc cgctgtgctg aaggtttcat gctgagagga gccgatatag 1321 ttcggtgtga taacttggga cagtggacag caccagcccc agtctgtcaa gctttgcagt 1381 gccaggatct cccagttcca aatgaggccc gggtgaactg ctcccacccc ttcggtgcct 1441 ttaggtacca gtcagtctgc agcttcacct gcaatgaagg cttgctcctg gtgggagcaa 1501 gtgtgctaca gtgcttggct actggaaact ggaattctgt tcctccagaa tgccaagcca 1561 ttccctgcac acctttgcta agccctcaga atggaacaat gacctgtgtt caacctcttg 1621 gaagttccag ttataaatcc acatgtcaat tcatctgtga cgagggatat tctttgtctg 1681 gaccagaaag attggattgt actcgatcgg gacgctggac agactcccca ccaatgtgtg 1741 aagccatcaa gtgcccagaa ctctttgccc cagagcaggg cagcctggat tgttctgaca 1801 ctcgtggaga attcaatgtt ggctccacct gtcatttctc ttgtaacaat ggctttaagc 1861 tggaggggcc caataatgtg gaatgcacaa cttctggaag atggtcagct actccaccaa 1921 cctgcaaagg catagcatca cttcctactc cagggttgca atgtccagcc ctcaccactc 1981 ctgggcaggg aaccatgtac tgtaggcatc atccgggaac ctttggtttt aataccactt 2041 gttactttgg ctgcaacgct ggattcacac tcataggaga cagcactctc agctgcagac 2101 cttcaggaca atggacagca gtaactccag catgcagagc tgtgaaatgc tcagaactac 2161 atgttaataa gccaatagcg atgaactgct ccaacctctg gggaaacttc agttatggat 2221 caatctgctc tttccattgt ctagagggcc agttacttaa tggctctgca caaacagcat 2281 gccaagagaa tggccactgg tcaactaccg tgccaacctg ccaagcagga ccattgacta 2341 tccaggaagc cctgacttac tttggtggag cggtggcttc tacaataggt ctgataatgg 2401 gtgggacgct cctggctttg ctaagaaagc gtttcagaca aaaagatgat gggaaatgcc 2461 ccttgaatcc tcacagccac ctaggaacat atggagtttt tacaaacgct gcatttgacc 2521 cgagtcctta aggtttccat aaacacccat gaatcaaaga catggaatta ccttagatta 2581 gctctggacc agcctgttgg acccgctctg gaccaaccct gtttcctgag tttgggattg 2641 tggtacaatc tcaaattctc aacctaccac cccttcctgt cccacctctt ctcttcctgt 2701 aacacaagcc acagaagcca ggagcaaatg tttctgcagt agtctctgtg ctttgactca 2761 cctgttactt gaaataccag tgaaccaaag agactggagc atctgactca caagaagacc 2821 agactgtgga gaaataaaaa tacctcttta ttttttgatt gaaggaaggt tttctccact 2881 ttgttggaaa gcaggtggca tctctaattg gaagaaattc ctgtagcatc ttctggagtc 2941 tccagtggtt gctgttgatg aggcctcttg gacctctgct ctgaggcttc cagagagtcc 3001 tctggatggc accagaggct gcagaaggcc aagaatcaag ctagaaggcc acatgtcacc 3061 gtggaccttc ctgccaccag tcactgtccc tcaaatgacc caaagaccaa tattcaaatg 3121 cgtaattaaa agaattttcc cc // LOCUS HUMGMPIPDI 3980 bp mRNA PRI 27-MAY-1992 DEFINITION Human cGMP-inhibited cAMP phosphodiesterase mRNA, complete cds. ACCESSION M91667 NID g183392 KEYWORDS cGMP-inhibited cAMP phosphodiesterase. SOURCE Homo sapiens myocardium cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3980) AUTHORS Meacci,E., Taira,M., Moos,M., Smith,C.J., Movsesian,M.A., Degerman,E., Belfrage,P. and Manganiello,V. TITLE Molecular cloning and expression of human myocardial cGMP-inhibited cAMP phosphodiesterase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 3721-3725 (1992) MEDLINE 92237240 FEATURES Location/Qualifiers source 1..3980 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="myocardium" CDS 23..3448 /codon_start=1 /evidence=experimental /product="cGMP-inhibited cAMP phosphodiesterase" /db_xref="PID:g183393" /translation="MAVPGDAARVRNKPVHSGVSQAPTAGRDCHHRADPASPRDSGCR GCWGDLVLQPLRSSRKLSLPLCAGSLSFLLALLVRLVRGEVGCDLEQCKEAAAAEEEE AAPGAEGGVFPGPRGGAPGGGARLSPWLQPSALLFSLLCAFFWMGLYLLRAGVRLPLA VALLAACCGGEALVQIGLGVGEDHLLSLPAAGVVLSCLAAATWLVLRLRLGVLMIALT SAVRTVSLISLERFKVAWRPYLAYLAGVLGILLARYVEQILPQSAEAAPREHLGSQLI AGTKEDIPVFKRRRRSSSVVSAEMSGCSSKSHRRTSLPCIPREQLMGHSEWDHKRGPR GSQSSGTSITVDIAVMGEATASLPTSWQTLLFHQTCATSLRAVSNLLSTQLTFQAIHK PRVNPVTSLSENYTCSDSEESSEKDKLAIPKRLRRSLPPGLLRRVSSTWTTTTSATGL PTLEPAPVRRDRSTSIKLQEAPSSSPDSWNNPVMMTLTKSRSFTSSYAISAANHVKAK KQSRPGALAKISPLSSPCSSPLQGTPASSLVSKISAVQFPESADTTAKQSLGSHRALT YTQSAPDLSPQILTPPVICSSCGRPYSQGNPADEPLERSGVATRTPSRTDDTAQVTSD YETNNNSDSSDIVQNEDETECLREPLRKASACSTYAPETMMFLDKPILAPEPLVMDNL DSIMEQLNTWNFPIFDLVENIGRKCGRILSQVSYRLFEDMGLFEAFKIPIREFMNYFH ALEIGYRDIPYHNRIHATDVLHAVWYLTTQPIPGLSTVINDHGSTSDSDSDSGFTHGH MGYVFSKTYNVTDDKYGCLSGNIPALELMALYVAAAMHDYDHPGRTNAFLVATSAPQA VLYNDRSVLENHHAAAAWNLFMSRPEYNFLINLDHVEFKHFRFLVIEAILATDLKKHF DFVAKFNGKVNDDVGIDWTNENDRLLVCQMCIKLADINGPAKCKELHLQWTDGIVNEF YEQGDEEASLGLPISPFMDRSAPQLANLQESFISHIVGPLCNSYDSAGLMPGKWVEDS DESGDTDDPEEEEEEAPAPNEEETCENNESPKKKTFKRRKIYCQITQHLLQNHKMWKK VIEEEQRLAGIENQSLDQTPQSHSSEQIQAIKEEEEEKGKPRGEEIPTQKPDQ" BASE COUNT 1038 a 983 c 1008 g 951 t ORIGIN 1 agtgaagagg gcaccctata ccatggcagt gcccggcgac gctgcacgag tcaggaacaa 61 gcccgtccac agtggggtga gtcaagcccc cacggcgggc cgggactgcc accatcgtgc 121 ggaccccgca tcgccgcggg actcgggctg ccgtggctgc tggggagacc tggtgctgca 181 gccgctccgg agctctcgga aactttccct gccgctgtgc gcgggctccc tatcctttct 241 gctggcgctg ctggtgaggc tggtccgcgg ggaggtcggc tgtgacctgg agcagtgtaa 301 ggaggcggcg gcggcggagg aggaggaagc agccccggga gcagaagggg gcgtcttccc 361 ggggcctcgg ggaggtgctc ccgggggcgg tgcgcggctc agcccctggc tgcagccctc 421 ggcgctgctc ttcagtctcc tgtgtgcctt cttctggatg ggcttgtacc tcctgcgcgc 481 cggggtgcgc ctgcctctgg ctgtcgcgct gctggccgcc tgctgcgggg gggaagcgct 541 cgtccagatt gggctgggcg tcggggagga tcacttactc tcactccccg ctgcgggggt 601 ggtgctcagc tgcttggccg ccgcgacatg gctggtgctg aggctgaggc tgggcgtcct 661 catgatcgcc ttgactagcg cggtcaggac cgtgtccctc atttccttag agaggttcaa 721 ggtcgcctgg agaccttacc tggcgtacct ggccggcgtg ctggggatcc tcttggccag 781 gtacgtggaa caaatcttgc cgcagtccgc ggaggcggct ccaagggagc atttggggtc 841 ccagctgatt gctgggacca aggaagatat cccggtgttt aagaggagga ggcggtccag 901 ctccgtcgtg tccgccgaga tgtccggctg cagcagcaag tcccatcgga ggacctccct 961 gccctgtata ccgagggaac agctcatggg gcattcagaa tgggaccaca aacgagggcc 1021 aagaggatca cagtcttcag gaaccagtat tactgtggac atcgccgtca tgggcgaagc 1081 cacggcctca ttaccgacct cctggcagac ccttctcttc caccaaacgt gtgccacatc 1141 cttgagagcc gtgagcaact tgctcagcac acagctcacc ttccaggcca ttcacaagcc 1201 cagagtgaat cccgttactt cgctcagtga aaactatacc tgttctgact ctgaagagag 1261 ctctgaaaaa gacaagcttg ctattccaaa gcgcctgaga aggagtttgc ctcctggctt 1321 gttgagacga gtttcttcca cttggaccac caccacctcg gccacaggtc tacccacctt 1381 ggagcctgca ccagtacgga gagaccgcag caccagcatc aaactgcagg aagcaccttc 1441 atccagtcct gattcttgga ataatccagt gatgatgacc ctcaccaaaa gcagatcctt 1501 tacttcatcc tatgctattt ctgcagctaa ccatgtaaag gctaaaaagc aaagtcgacc 1561 aggtgccctc gctaaaattt cacctctttc atcgccctgc tcctcacctc tccaagggac 1621 tcctgccagc agcctggtca gcaaaatttc tgcagtgcag tttccagaat ctgctgacac 1681 aactgccaaa caaagcctag gttctcacag ggccttaact tacactcaga gtgccccaga 1741 cctatcccct caaatcctga ctccacctgt tatatgtagc agctgtggca gaccatattc 1801 ccaagggaat cctgctgatg agcccctgga gagaagtggg gtagccactc ggacaccaag 1861 tcgaacagat gacactgctc aagttacctc tgattatgaa accaataaca acagtgacag 1921 cagtgacatt gtacagaatg aagatgaaac agagtgcctg agagagcctc tgaggaaagc 1981 atcggcttgc agcacctatg ctcctgagac catgatgttt ctggacaaac caattcttgc 2041 tcccgaacct cttgtcatgg ataacctgga ctcaattatg gagcagctaa atacttggaa 2101 ttttccaatt tttgatttag tggaaaatat aggaagaaaa tgtggccgta ttcttagtca 2161 ggtatcttac agactttttg aagacatggg cctctttgaa gcttttaaaa ttccaattag 2221 ggaatttatg aattattttc atgctttgga gattggatat agggatattc cttatcataa 2281 cagaatccat gccactgatg ttttacatgc tgtttggtat cttactacac agcctattcc 2341 aggcctctca actgtgatta atgatcatgg ttcaaccagt gattcagatt ctgacagtgg 2401 atttacacat ggacatatgg gatatgtatt ctcaaaaacg tataatgtga cagatgataa 2461 atacggatgt ctgtctggga atatccctgc cttggagttg atggcgctgt atgtggctgc 2521 agccatgcac gattatgatc atccaggaag gactaatgct ttcctggttg caactagtgc 2581 tcctcaggcg gtgctatata acgatcgttc agttttggag aatcatcacg cagctgctgc 2641 atggaatctt ttcatgtccc ggccagagta taacttctta attaaccttg accatgtgga 2701 atttaagcat ttccgtttcc ttgtcattga agcaattttg gccactgacc tgaagaaaca 2761 ctttgacttc gtagccaaat ttaatggcaa ggtaaatgat gatgttggaa tagattggac 2821 caatgaaaat gatcgtctac tggtttgtca aatgtgtata aagttggctg atatcaatgg 2881 tccagctaaa tgtaaagaac tccatcttca gtggacagat ggtattgtca atgaatttta 2941 tgaacagggt gatgaagagg ccagccttgg attacccata agccccttca tggatcgttc 3001 tgctcctcag ctggccaacc ttcaggaatc cttcatctct cacattgtgg ggcctctgtg 3061 caactcctat gattcagcag gactaatgcc tggaaaatgg gtggaagaca gcgatgagtc 3121 aggagatact gatgacccag aagaagagga ggaagaagca ccagcaccaa atgaagagga 3181 aacctgtgaa aataatgaat ctccaaaaaa gaagactttc aaaaggagaa aaatctactg 3241 ccaaataact cagcacctct tacagaacca caagatgtgg aagaaagtca ttgaagagga 3301 gcaacggttg gcaggcatag aaaatcaatc cctggaccag acccctcagt cgcactcttc 3361 agaacagatc caggctatca aggaagaaga agaagagaaa gggaaaccaa gaggcgagga 3421 gataccaacc caaaagccag accagtgaca atggatagaa tgggctgtgt ttccaaacag 3481 attgacttgt caaagactct cttcaagcca gcacaagcat ttagatcaca acactgtaga 3541 aatttgagat gggcaaatgg ctattgcatt ttgggattct tcgcattttg tgtgtatatt 3601 tttacagtga ggtacattgt taaaaacttt ttgctcaaag aagctttcac attgcaacac 3661 cagcttctaa ggatttttta aggagggaat atatatgtgt gtgtgtatat aagctcccac 3721 atagatacat gtaaaacata ttcacaccca tgcacgcaca cacatacaca ctgaaggcca 3781 cgattgctgg ctccacaatt tagtaacatt tatattaaga tatatatata gtggtcactg 3841 tgatataata aatcataaag gaaaccaaat cacaaaggag atggtgtggc ttagcaagga 3901 aacagtgcag gaaatgtagg ttaccaacta agcagctttt gctcttagta ctgagggatg 3961 aaagttccag agcattattt // LOCUS HUMGNAZ 2679 bp mRNA PRI 08-NOV-1994 DEFINITION Human transducin alpha-subunit (GNAZ) mRNA, complete cds. ACCESSION J03260 NID g183408 KEYWORDS guanine nucleotide-binding regulatory protein; transducin. SOURCE Human retina, cDNA to mRNA, clone lambda-alpha-161. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2679) AUTHORS Fong,H.K., Yoshimoto,K.K., Eversole-Cire,P. and Simon,M.I. TITLE Identification of a GTP-binding protein alpha subunit that lacks an apparent ADP-ribosylation site for pertussis toxin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (9), 3066-3070 (1988) MEDLINE 88203641 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by H.K.W.Fong, 07-APR-1988. FEATURES Location/Qualifiers source 1..2679 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11.1-q11.2" mRNA <1..2679 /note="GNAZ mRNA" gene 13..1080 /gene="GNAZ" CDS 13..1080 /gene="GNAZ" /note="transducin alpha-subunit" /codon_start=1 /db_xref="GDB:G00-120-003" /db_xref="PID:g306775" /translation="MGCRQSSEEKEAARRSRRIDRHLRSESQRQRREIKLLLLGTSNS GKSTIVKQMKIIHSGGFNLEACKEYKPLIIYNAIDSLTRIIRALAALRIDFHNPDRAY DAVQLFALTGPAESKGEITPELLGVMRRLWADPGAQACFSRSSEYHLEDNAAYYLNDL ERIAAADYIPTVEDILRSRDMTTGIVENKFTFKELTFKMVDVGGQRSERKKWIHCFEG VTAIIFCVELSGYDLKLYEDNQTSRMAESLRLFDSICNNNWFINTSLILFLNKKDLLA EKIRRIPLTICFPEYKGQNTYEEAAVYIQRQFEDLNRNKETKEIYSHFTCATDTSNIQ FVFDAVTDVIIQNNLKYIGLC" BASE COUNT 650 a 728 c 700 g 601 t ORIGIN 647 bp upstream of BamHI site. 1 gagaccagga ccatgggatg tcggcaaagc tcagaggaaa aagaagcagc ccggcggtcc 61 cggagaattg accgccacct gcgctcagag agccagcggc aacgccgcga aatcaagctg 121 ctcctgctgg gcaccagcaa ctcaggcaag agcaccatcg tcaaacagat gaagatcatc 181 cacagcggcg gcttcaacct ggaggcctgc aaggagtaca agcccctcat catctacaat 241 gccatcgact cgctgacccg catcatccgg gccctggccg ccctcaggat cgacttccac 301 aaccccgacc gcgcctacga cgctgtgcag ctctttgcgc tgacgggccc cgctgagagc 361 aagggcgaga tcacacccga gctgctgggt gtcatgcgac ggctctgggc cgacccaggg 421 gcacaggcct gcttcagccg ctccagcgag taccacctgg aggacaacgc ggcctactac 481 ctgaacgacc tggagcgcat cgccgcagct gactatatcc ccactgtcga ggacatcctg 541 cgctcccggg acatgaccac gggcattgtg gagaacaagt tcaccttcaa ggagctcacc 601 ttcaagatgg tggacgtggg ggggcagagg tcagagcgca aaaagtggat ccactgcttc 661 gagggcgtca cagccatcat cttctgtgtg gagctcagcg gctacgacct gaaactctac 721 gaggataacc agacaagtcg gatggcagag agcttgcgcc tctttgactc catctgcaac 781 aacaactggt tcatcaacac ctcactcatc ctcttcctga acaagaagga cctgctggca 841 gagaagatcc gccgcatccc gctcaccatc tgctttcccg agtacaaggg ccagaacacg 901 tacgaggagg ccgctgtcta catccagcgg cagtttgaag acctgaaccg caacaaggag 961 accaaggaga tctactccca cttcacctgc gccaccgaca ccagtaacat ccagtttgtc 1021 ttcgacgcgg tgacagacgt catcatacag aacaatctca agtacattgg cctttgctga 1081 ggagctgggc ccggggcgcc tgcctatggt gaaacccacg gggtgtcatg ccccaacgcg 1141 tgctagagag gcccaatcca ggggcagaaa acagggggcc taaagaatgt cccccacccc 1201 ttggcctctg cctccttggc cccacatttc tgcaaacata aatatttacg gatagattgc 1261 taggtagata gacacacaca catgcacaca cacacatctg gagatggcaa aatcctctaa 1321 aatgtcgagg tctcttgaag acttgagaag ctgtcacaag gtcactacaa gcccaacctg 1381 ccccttcact ttgccttcct gagttggccc cactccactt gggggtctgc attggattgt 1441 tagggatagg cagcagggct gaggcaaggt aggccaactg cacccctgtc acctggagga 1501 gggccggctc gctgcccgag ctctggccta gggaccttgc cgctgaccaa gagggaggac 1561 cagtgcaggg tctgtgcacc ttccctgctg gcctgcacac agctgctcag caccatttca 1621 ttctggacct gggaccttag gagccgggtg acagcactaa ccagacctcc agccactcac 1681 agctcttttt aaaaaacagc ttcaaaatat gcagcaaaaa ccaatacaac aaaacgagtg 1741 gcacgattta tttcaaacta ggccagctgg gattccagct tttcttctac tagtctgatg 1801 ttttataaat caaaacctgg ttttccttct ctggcatttt tttttgtttt ttgttttttg 1861 gttttttttt tttttttggc caaatctcgt ggtgtttcgc agaaaaaaat ccagaaaatt 1921 tcaaatgcag ttgagtattc ttttttaaat gcagattttc aaaacatatt ttttttcagg 1981 tggtcttttt tgtgtctggc ttgctgagtg taaaagttgt tatctggacg atctgtctct 2041 ctgctccaaa gaaattttgg agtgagtggc agtcctgcgc cagcctcgcg ggacacgtgt 2101 tgtacataag cctctgcagt gtcctcttgt taatggtggg gttttctgct ttgtttttat 2161 ttaagaaaat aaacacgaca tatttaaaga aggttctttc acctgggagc aaatgaacaa 2221 tagctaagtg tcttggtatt taaagagtaa attatttgtg gctttgctga gtgaaggaag 2281 gggagcaagg ggtggtgccc ctggtcccag catgccccgc gcctgagact ggctggaaat 2341 gctctgactc ctgtgaaggc acagccagcg ttgtggcctg agggaggccc tgctgggacc 2401 ctgatctggg ccttcctgtc ccagggccta tgggcaactg cgttgaaagg acgttcgcca 2461 agggccgtgt gtaaatacga actgcgccat ggagaggaga ggcactgccg gagcccttgc 2521 cagatctccc tccctctctc tgtgcagtag ctgtgtgtcc gaggtcagtg tgcggaatca 2581 cagccaagga cgtgaagaga tgtacggggg aaagagaagc tggggattgg atgaaagtca 2641 aaggttgtct actttaagaa aataaaatac cctgaatgg // LOCUS HUMGNEFA 4002 bp mRNA PRI 06-JUL-1993 DEFINITION Human guanine nucleotide exchange factor mRNA, complete cds. ACCESSION L13857 NID g306777 KEYWORDS guanine nucleotide exchange factor. SOURCE Homo sapiens (library: lambda gt10) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4002) AUTHORS Chardin,P., Camonis,J.H., Gale,N.W., Van Aelst,L., Wigler,M.H. and Bar-Sagi,D. TITLE Human Sos 1: A guanine nucleotide exchange factor for Ras that binds to GRB2 JOURNAL Science 260, 1338-1343 (1993) MEDLINE 93262494 FEATURES Location/Qualifiers source 1..4002 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="lambda gt10" CDS 1..4002 /codon_start=1 /product="guanine nucleotide exchange factor" /db_xref="PID:g306778" /translation="MQAQQLPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDA LQYVEELILQLLNMLCQAQPRSASDVEERVQKSFPHPIDKWAIADAQSAIEKRKRRNP LSLPVEKIHPLLKEVLGYKIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQ DIKVAMCADKVLMDMFHQDVEDINILSLTDEEPSTSGEQTYYDLVKAFMAEIRQYIRE LNLIIKVFREPFVSNSKLFSANDVENIFSRIVDIHELSVKLLGHIEDTVEMTDEGSPH PLVGSCFEDLAEELAFDPYESYARDILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEA VQYVLPRLLLAPVYHCLHYFELLKQLEEKSEDQEDKECLKQAITALLNVQSGMEKICS KSLAKRRLSESACRFYSQQMKGKQLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGTL TRVGAKHERHIFLFDGLMICCKSNHGQPRLPGASNAEYRLKEKFFMRKVQINDKDDTN EYKHAFEIILKDENSVIFSAKSAEEKNNWMAALISLQYRSTLERMLDVTMLQEEKEEQ MRLPSADVYRFAEPDSEENIIFEENMQPKAGIPIIKAGTVIKLIERLTYHMYADPNFV RTFLTTYRSFCKPQELLSLIIERFEIPEPEPTEADRIAIENGDQPLSAELKRFRKEYI QPVQLRVLNVCRHWVEHHFYDFERDAYLLQRMEEFIGTVRGKAMKKWVESITKIIQRK KIARDNGPGHNITFQSSPPTVEWHISRPGHIETFDLLTLHPIEIARQLTLLESDLYRA VQPSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVETENLEERVAVVSRII EILQVFQELNNFNGVLEVVSAMNSSPVYRLDHTFEQIPSRQKKILEEAHELSEDHYKK YLAKLRSINPPCVPFFGIYLTNILKTEEGNPEVLKRHGKELINFSKRRKVAEITGEIQ QYQNQPYCLRVESDIKRFFENLNPMGNSMEKEFTDYLFNKSLEIEPRNPKPLPRFPKK YSYPLKSPGVRPSNPRPGTMRHPTPLQQEPRKISYSRIPESETESTASAPNSPRTPLT PPPASGASSTTDVCSVFDSDHSSPFHSSNDTVFIQVTLPHGPRSASVSSISLTKGTDE VPVPPPVPPRRRPESAPAESSPSKIMSKHLDSPPAIPPRQPTSKAYSPRYSISDRTSI SDPPESPPLLPPREPVRTPDVFSSSPLHLQPPPLGKKSDHGNAFFPNSPSPFTPPPPQ TPSPHGTRRHLPSPPLTQEVDLHSIAGPPVPPRQSTSQHIPKLPPKTYKREHTHPSMH RDGPPLLENAHSS" BASE COUNT 1341 a 829 c 783 g 1049 t ORIGIN 1 atgcaggcgc agcagctgcc ctacgagttt ttcagcgaag agaacgcgcc caagtggcgg 61 ggactactgg tgcctgcgct gaaaaaggtc caggggcaag ttcatcctac tctcgagtct 121 aatgatgatg ctcttcagta tgttgaagaa ttaattttgc aattattaaa tatgctatgc 181 caagctcagc cccgaagtgc ttcagatgta gaggaacgtg ttcaaaaaag tttccctcat 241 ccaattgata aatgggcaat agctgatgcc caatcagcta ttgaaaagag gaagcgaaga 301 aaccctttat ctctcccagt agaaaaaatt catcctttat taaaggaggt cctaggttat 361 aaaattgacc accaggtttc tgtttacata gtagcagtct tagaatacat ttctgcagac 421 attttaaagc tggttgggaa ttatgtaaga aatatacggc attatgaaat tacaaaacaa 481 gatattaaag tggcaatgtg tgctgacaag gtattgatgg atatgtttca tcaagatgta 541 gaagatatta atatattatc tttaactgac gaagagcctt ccacctcagg agaacaaact 601 tactatgatt tggtaaaagc atttatggca gaaattcgac aatatataag ggaactaaat 661 ctaattataa aagtttttag agagcccttt gtctccaatt caaaattgtt ttcagctaat 721 gatgtagaaa atatatttag tcgcatagta gatatacatg aacttagtgt aaagttactg 781 ggccatatag aagatacagt agaaatgaca gatgaaggca gtccccatcc actagtagga 841 agctgctttg aagacttagc agaggaactg gcatttgatc catatgaatc gtatgctcga 901 gatattttgc gacctggttt tcatgatcgt ttccttagtc agttatcaaa gcctggggca 961 gcactttatt tgcagtcaat aggcgaaggt ttcaaagaag ctgttcaata tgttttaccc 1021 aggctgcttc tggcccctgt ttaccactgt ctccattact ttgaactttt gaagcagtta 1081 gaagaaaaaa gtgaagatca agaagacaag gaatgtttaa aacaagcaat aacagctttg 1141 cttaatgttc agagtggtat ggaaaaaata tgttctaaaa gtcttgcaaa acgaagactg 1201 agtgaatctg catgtcggtt ttatagtcag caaatgaagg ggaaacaact agcaatcaag 1261 aagatgaacg agattcagaa gaatattgat ggttgggagg gaaaagacat tggacagtgt 1321 tgtaatgaat ttataatgga aggaactctt acacgtgtag gagccaaaca tgagagacac 1381 atatttctct ttgatggctt aatgatttgc tgtaaatcaa atcatgggca gccaagactt 1441 cctggtgcta gcaatgcaga atatcgtctt aaagaaaagt tttttatgcg aaaggtacaa 1501 attaatgata aagatgacac caatgaatac aagcatgctt ttgaaataat tttaaaagat 1561 gaaaatagtg ttatattttc tgccaagtca gctgaagaga aaaacaattg gatggcagca 1621 ttgatatctt tacagtaccg gagtacactg gaaaggatgc ttgatgtaac aatgctacag 1681 gaagagaaag aggagcagat gaggctgcct agtgctgatg tttatagatt tgcagagcct 1741 gactctgaag agaatattat atttgaagag aacatgcagc ccaaggctgg aattccaatt 1801 atcaaagcag gaactgttat taaacttata gagaggctta cgtaccatat gtacgcagat 1861 cccaattttg ttcggacatt tcttacaaca tacagatcct tttgcaaacc tcaagaacta 1921 ctgagtctta taatagaaag gtttgaaatt ccagagcctg agccaacaga agctgatcgc 1981 atagctatag agaatggaga tcaacccttg agtgcagaac tgaaaagatt tagaaaagaa 2041 tatatacagc ctgtgcaact gcgagtatta aatgtatgtc ggcactgggt agagcaccac 2101 ttctatgatt ttgaaagaga tgcatatctt ttgcaacgaa tggaagaatt tattggaaca 2161 gtaagaggta aagcaatgaa aaaatgggtt gaatccatca ctaaaataat ccaaaggaaa 2221 aaaattgcaa gagacaatgg accaggtcat aatattacat ttcagagttc acctcccaca 2281 gttgagtggc atataagcag acctgggcac atagagactt ttgacctgct caccttacac 2341 ccaatagaaa ttgctcgaca actcacttta cttgaatcag atctataccg agctgtacag 2401 ccatcagaat tagttggaag tgtgtggaca aaagaagaca aagaaattaa ctctcctaat 2461 cttctgaaaa tgattcgaca taccaccaac ctcactctgt ggtttgagaa atgtattgta 2521 gaaactgaaa atttagaaga aagagtagct gtggtgagtc gaattattga gattctacaa 2581 gtctttcaag agttgaacaa ctttaatggt gtccttgagg ttgtcagtgc tatgaattca 2641 tcacctgttt acagactaga ccacacattt gagcaaatac caagtcgcca gaagaaaatt 2701 ttagaagaag ctcatgaatt gagtgaagat cactataaga aatatttggc aaaactcagg 2761 tctattaatc caccatgtgt gcctttcttt ggaatttatc tcactaatat cttgaaaaca 2821 gaagaaggca accctgaggt cctaaaaaga catggaaaag agcttataaa ctttagcaaa 2881 aggaggaaag tagcagaaat aacaggagag atccagcagt accaaaatca gccttactgt 2941 ttacgagtag aatcagatat caaaaggttc tttgaaaact tgaatccgat gggaaatagc 3001 atggagaagg aatttacaga ttatcttttc aacaaatccc tagaaataga accacgaaac 3061 cctaagcctc tcccaagatt tccaaaaaaa tatagctatc ccctaaaatc tcctggtgtt 3121 cgtccatcaa acccaagacc aggtaccatg aggcatccca cacctctgca gcaggagcca 3181 aggaaaatta gttatagtag gatccctgaa agtgaaacag aaagtacagc atctgcacca 3241 aattctccaa gaacaccgtt aacacctccg cctgcttctg gtgcttccag taccacagat 3301 gtttgcagtg tatttgattc cgatcattcg agcccttttc actcaagcaa tgataccgtc 3361 tttatccaag ttactctgcc ccatggccca agatctgctt ctgtatcatc tataagttta 3421 accaaaggca ctgatgaagt gcctgtccct cctcctgttc ctccacgaag acgaccagaa 3481 tctgccccag cagaatcttc accatctaag attatgtcta agcatttgga cagtccccca 3541 gccattcctc ctaggcaacc cacatcaaaa gcctattcac cacgatattc aatatcagac 3601 cggacctcta tctcagaccc tcctgaaagc cctcccttat taccaccacg agaacctgtg 3661 aggacacctg atgttttctc aagctcacca ctacatctcc aacctccccc tttgggcaaa 3721 aaaagtgacc atggcaatgc cttcttccca aacagccctt ccccctttac accacctcct 3781 cctcaaacac cttctcctca cggcacaaga aggcatctgc catcaccacc attgacacaa 3841 gaagtggacc ttcattccat tgctgggccg cctgttcctc cacgacaaag cacttctcaa 3901 catatcccta aactccctcc aaaaacttac aaaagggagc acacacaccc atccatgcac 3961 agagatggac caccactgtt ggagaatgcc cattcttcct ga // LOCUS HUMGNEFB 3999 bp mRNA PRI 06-JUL-1993 DEFINITION Human guanine nucleotide exchange factor mRNA, complete cds. ACCESSION L13858 NID g306779 KEYWORDS guanine nucleotide exchange factor. SOURCE Homo sapiens (library: lambda gt10) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3999) AUTHORS Chardin,P., Camonis,J.H., Gale,N.W., Van Aelst,L., Wigler,M.H. and Bar-Sagi,D. TITLE Human Sos 1: A guanine nucleotide exchange factor for Ras that binds to GRB2 JOURNAL Science 260, 1338-1343 (1993) MEDLINE 93262494 FEATURES Location/Qualifiers source 1..3999 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="lambda gt10" CDS 1..3999 /codon_start=1 /product="guanine nucleotide exchange factor" /db_xref="PID:g306780" /translation="MQQAPQPYEFFSEENSPKWRGLLVSALRKVQVQVHPTLSANEES LYYIEELIFQLLNKLCMAQPRTVQDVEERVQKTFPHPIDKWAIADAQSAIEKRKRRNP LLLPVDKIHPSLKEVLGYKVDYHVSLYIVAVLEYISADILKLAGNYVFNIRHYEISQQ DIKVSMCADKVLMDMFDQDDIGLVSLCEDEPCSSGELNYYDLVRTEIAEERQYLRELN MIIKVFREAFLSDRKLFKPSVYEKIFSNISDIHELTVKLLGLIEDTVEMTDESSPHPL AGSCFEDLAEEQAFDPYETLSQDILSPEFHEHFNKLMARPAVALHFQSIADGFKEAVR YVLPRLMLVPVYHCWHYFELLKQLKACSEEQEDRECLNQAITALMNHQGSMDRIYKQY SPRRRPGDPVCPFYSHQLRSKHLAIKKMNEIQKNIDGWEGKDIGQCCNEFIMEGPLTR IGAKHERHIFLFDGLMISCKPNHGQTRLPGYTSAEYRLKEKFVMRKIQICDKEDTCEH KHAFELVSKDENSIIFAAKSAEEKNNWMAALISLHYRSTLDRMLDSVLLKEENEQPLR LPSPEVYRFVVKDSEENIVFEDNLQSRSGIPIIKGGTVVKLIERLTYHMYADPNFVRT FLTTYRSFCKPQELLSLLIERFEIPEPEPTDADKLAIEKGEQPISADLKRFRKEYVQP VQLRVLNVFRHWVDHHYYDFERDLELLERLESFISSVRGKAMKKWVESIAKIIRRKKQ AQANGVSHNITFESPPPPIEWHISKPGQFETFDLMTLDPIEIARQLTLLESDLYRKVQ PSELVGSVWTKEDKEINSPNLLKMIRHTTNLTLWFEKCIVEAENFEERVAVLSRIIEI LQVFQDLNNFNGVLEIVSAVNSVSVYRLDHTFEALQERKRKILDEAVELSQDHFKKYL VKLKSINPPCVPFFGIYLTNILKTEEGNNDFLKRKGKDLINFSKRRKVAEITGEIQQY QNQPYCLRIEPDMRRFFENLNPMGSASEKEFTDYLFNKSLEIEPRNCKQPPRFPRKST FSLKSPGIRPNTGRHGSTSGTLRGHPTPLEREPCKISFSRIAETELESTVSAPTSPNT PSTPPVSASSDLSVFLDVDLNSSCGSNSIFAPVLLPHSKSFFSSCGSLHKLSEEPLIP PPLPPRKKFDHDASNSKGNMKSDDDPPAIPPRQPPPPKVKPRVPVPTGAFDGPLHSPP PPPPRDPLPDTPPPVPLRPPEHFINCPFNLQPPPLGHLHRDSDWLRDISTCPNSPSTP PSTPSPRVPRRCYVLSSSQNNLAHPPAPPVPPRQNSSPHLPKLPPKTYKRELSHPPLY RLPLLENAETPQ" BASE COUNT 1279 a 810 c 799 g 1111 t ORIGIN 1 atgcagcagg cgccgcagcc ttacgagttc ttcagcgagg agaacagtcc gaaatggcgg 61 ggactgttgg tctcggccct gcggaaggtt caggttcaag tgcatcccac tctctcagct 121 aatgaagagt ctctctatta tattgaagag ctgatttttc agctgcttaa taaattatgc 181 atggcccagc caaggactgt tcaagatgta gaggagcgag ttcagaagac ctttcctcac 241 ccaattgata aatgggccat tgctgatgca caatctgcta tagaaaaacg aaaacgaaga 301 aatcctcttt tactgcctgt ggacaaaatc catccttcgt tgaaggaagt attagggtac 361 aaagtggact accatgtatc cctatatatt gtggctgtac tagagtatat ctcagctgat 421 attttaaaat tggctggtaa ttatgttttt aatatccggc attatgaaat atctcagcag 481 gacattaaag tgtcaatgtg tgcggataag gttttgatgg acatgtttga tcaggatgac 541 ataggtttgg tttctctctg tgaagatgaa ccctgttctt ctggtgaatt aaactactat 601 gatcttgtca gaactgaaat cgcagaagaa agacagtatc tacgggaatt aaatatgatc 661 ataaaagtgt ttcgagaagc ctttctttct gatagaaagc tgtttaaacc ttctgtatac 721 gaaaagattt ttagtaacat ttcagatata catgaattga ctgtgaaact tttaggtttg 781 attgaagaca cagttgaaat gactgatgaa agcagtcctc atcccttagc tggcagctgt 841 tttgaagatt tggcagaaga gcaagcattt gatccttatg aaacattatc acaggacatt 901 ctttcaccag agtttcatga acatttcaat aaattgatgg ccagacctgc agttgctcta 961 cactttcagt ccattgctga tggttttaaa gaggcagttc gttatgtcct tccacgtctt 1021 atgctggtgc cagtgtatca ctgttggcac tactttgagt tactaaagca attgaaagca 1081 tgtagtgaag aacaagaaga cagagaatgt ttgaaccaag ctattactgc tctcatgaat 1141 caccaaggta gcatggaccg aatttacaag cagtattcac ctagacgtcg acctggagat 1201 cctgtttgcc ctttttatag tcaccaatta agaagcaaac acctggctat caaaaaaatg 1261 aatgaaattc agaaaaatat cgatggatgg gaaggcaaag atattggaca gtgttgtaat 1321 gaattcatta tggagggacc attgacaaga atcggtgcca aacatgaacg gcatattttt 1381 ctgtttgatg gcttaatgat cagttgtaaa cctaatcatg gccagactcg gcttccaggt 1441 tacactagtg cagaatacag gttaaaagaa aaatttgtca tgaggaaaat acaaatttgt 1501 gataaagaag atacttgtga gcacaagcat gcatttgaat tagtatccaa agatgagaac 1561 agcataatat ttgctgctaa gtctgctgaa gaaaaaaaca actggatggc agcccttatt 1621 tctcttcatt atcgtagtac tctagatcga atgttagatt cagtattatt gaaagaagaa 1681 aatgagcaac cactgagatt accaagtcct gaagtatatc gttttgtagt aaaagactct 1741 gaggaaaaca ttgtttttga agacaacttg caaagtagaa gtggcatccc cattattaaa 1801 ggaggaactg tagtgaaatt aattgaaagg ttaacatatc atatgtatgc agatcccaat 1861 tttgttcgta cttttcttac cacatatcgt tcattttgta aaccacagga attgctgagc 1921 ttactgattg aacggtttga aattccagag ccagaaccta ctgacgcaga caaattggca 1981 atagagaaag gcgagcagcc aatcagtgca gaccttaaaa gatttcgcaa ggaatatgtc 2041 caaccagtac aacttagggt acttaatgta ttccgccatt gggttgacca tcattattat 2101 gactttgaaa gagacctgga attgctggaa agactagaat ccttcatttc aagtgtaaga 2161 gggaaagcta tgaaaaaatg ggtagagtca attgctaaga tcatcaggag gaagaagcaa 2221 gctcaggcaa atggagtaag ccataatatt acctttgaaa gtccacctcc accaattgaa 2281 tggcatatca gcaaaccagg acagtttgaa acatttgatc tcatgacact tgatccaata 2341 gaaattgcac gtcagctgac acttttggag tctgatcttt acaggaaagt tcaaccgtct 2401 gaacttgtag ggagtgtgtg gaccaaagaa gataaagaaa taaattctcc aaatttatta 2461 aaaatgattc gccataccac aaatctcacc ctctggtttg aaaaatgcat tgtggaagca 2521 gaaaattttg aagaacgggt ggcagtacta agtagaatta tagaaattct gcaagttttt 2581 caagatttga ataatttcaa tggcgtattg gagatagtca gtgcagtaaa ttcagtgtca 2641 gtatacagac tagaccatac ctttgaggca ctgcaggaaa ggaaaaggaa aattttggac 2701 gaagctgtgg aattaagtca agatcacttt aaaaaatacc tagtaaaact taagtcaatc 2761 aatccacctt gtgtgccttt ttttggaata tatttaacaa atattctgaa gaccgaagaa 2821 gggaataatg attttttaaa aagaaaggga aaagatttaa tcaatttcag taagaggagg 2881 aaagtagctg aaattactgg agaaattcag cagtatcaga atcagcctta ctgtttacgg 2941 atagaaccag atatgaggag attctttgaa aaccttaacc ccatgggaag tgcatctgaa 3001 aaagagttta cagattattt gttcaacaag tcactagaaa ttgaacctcg aaactgcaaa 3061 cagccacctc gatttcctag gaaatcaact ttttccttaa aatctcctgg aataaggcct 3121 aacacaggcc gacatggctc tacctcaggt actttacgag gtcacccaac accattagaa 3181 agagaaccat gtaaaataag ctttagtcgg attgctgaaa ctgagctgga atcaacagtg 3241 tcagcaccaa cctctccaaa tacaccatct actccaccag tatctgcttc ttcagacctt 3301 agtgtatttt tagatgtgga tctcaacagc tcctgtggca gcaatagcat ctttgctcca 3361 gtgcttttgc cacattcaaa gtctttcttt agttcatgtg gtagtttaca taaactaagt 3421 gaagagcccc tgattcctcc tcctcttcct cctcgaaaaa agtttgatca tgatgcttca 3481 aattccaagg gaaatatgaa atctgatgat gatcctcctg ctattccacc gagacagcct 3541 cctcctccaa aggtaaaacc cagagttcct gttcctactg gtgcatttga tgggcctctg 3601 catagtccac ctccgccacc accaagagat cctcttcctg atacccctcc accagttccc 3661 cttcggcctc cagaacactt tataaactgt ccatttaatc ttcagccacc tccactgggg 3721 catcttcaca gagattcaga ctggctcaga gacattagta cgtgtccaaa ttcgccaagc 3781 actcctccta gcacaccctc tccaagggta ccgcgtcgat gctatgtgct cagttctagt 3841 cagaataatc ttgctcatcc tccagctccc cctgttccac caaggcagaa ttcaagccct 3901 catctgccaa aactgccacc aaagacttac aaacgggagc tttcgcaccc cccattgtac 3961 agactgcctt tgctagaaaa tgcagaaact ccccaatga // LOCUS HUMGNLNA 571 bp mRNA PRI 05-NOV-1992 DEFINITION Homo sapiens guanylin mRNA, complete cds. ACCESSION M97496 NID g183414 KEYWORDS guanylin. SOURCE Homo sapiens Adult Duodenum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 571) AUTHORS Wiegand,R.C., Kato,J., Huang,M.D., Fok,K.F., Kachur,J.F. and Currie,M.G. TITLE Human guanylin: cDNA isolation, sequence, and activity JOURNAL FEBS Lett. 311, 150-154 (1992) MEDLINE 93011964 FEATURES Location/Qualifiers source 1..571 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /tissue_type="Duodenum" CDS 9..356 /note="precursor" /codon_start=1 /product="guanylin" /db_xref="PID:g183415" /translation="MNAFLLFALCLLGAWAALAGGVTVQDGNFSFSLESVKKLKDLQE PQEPRVGKLRNFAPIPGEPVVPILCSNPNFPEELKPLCKEPNAQEILQRLEEIAEDPG TCEICAYAACTGC" mat_peptide 309..353 /product="guanylin" polyA_signal 552..557 polyA_site 571 BASE COUNT 120 a 185 c 151 g 115 t ORIGIN 1 tcgctgccat gaatgccttc ctgctcttcg cactgtgcct ccttggggcc tgggccgcct 61 tggcaggagg ggtcaccgtg caggatggaa atttctcctt ttctctggag tcagtgaaga 121 agctcaaaga cctccaggag ccccaggagc ccagggttgg gaaactcagg aactttgcac 181 ccatccctgg tgaacctgtg gttcccatcc tctgtagcaa cccgaacttt ccagaagaac 241 tcaagcctct ctgcaaggag cccaatgccc aggagatact tcagaggctg gaggaaatcg 301 ctgaggaccc gggcacatgt gaaatctgtg cctacgctgc ctgtaccgga tgctaggggg 361 gcttgcccac tgcctgcctc ccctccgcag cagggaagct cttttctcct gcagaaaggg 421 ccacccatga tactccactc ccagcagctc aacctaccct ggtccagtcg ggaggagcag 481 cccggggagg aactgggtga ctggaggcct cgccccaaca ctgtccttcc ctgccacttc 541 aacccccagc taataaacca gattccagag t // LOCUS HUMGNRHR 2160 bp mRNA PRI 28-MAY-1993 DEFINITION Homo sapiens GnRH receptor mRNA, complete cds. ACCESSION L07949 NID g292052 KEYWORDS GnRH receptor. SOURCE Homo sapiens 5 females & 4 males ages 15, 34, 35, 43, 53, 64, 70, 80 & 83 whole pituitary gland cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2160) AUTHORS Chi,L., Zhou,W., Prikhozhan,A., Flanagan,C., Davidson,J.S., Golembo,M., Illing,N., Millar,R.P. and Sealfon,S.C. TITLE Cloning and characterization of the human GNRH receptor JOURNAL Mol. Cell. Endocrinol. 91, 1-6 (1993) MEDLINE 93231378 FEATURES Location/Qualifiers source 1..2160 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="ages 15, 34, 35, 43, 53, 64, 70, 80 & 83" /sex="5 females & 4 males" /tissue_type="whole pituitary gland" CDS 25..1011 /codon_start=1 /product="GnRH receptor" /db_xref="PID:g292053" /translation="MANSASPEQNQNHCSAINNSIPLMQGNLPTLTLSGKIRVTVTFF LFLLSATFNASFLLKLQKWTQKKEKGKKLSRMKLLLKHLTLANLLETLIVMPLDGMWN ITVQWYAGELLCKVLSYLKLFSMYAPAFMMVVISLDRSLAITRPLALKSNSKVGQSMV GLAWILSSVFAGPQLYIFRMIHLADSSGQTKVFSQCVTHCSFSQWWHQAFYNFFTFSC LFIIPLFIMLICNAKIIFTLTRVLHQDPHELQLNQSKNNIPRARLKTLKMTVAFATSF TVCWTPYYVLGIWYWFDPEMLNRLSDPVNHFFFLFAFLNPCFDPLIYGYFSL" BASE COUNT 660 a 465 c 388 g 647 t ORIGIN 1 cggagccttg tgtcctggga aaatatggca aacagtgcct ctcctgaaca gaatcaaaat 61 cactgttcag ccatcaacaa cagcatccca ctgatgcagg gcaacctccc cactctgacc 121 ttgtctggaa agatccgagt gacggttact ttcttccttt ttctgctctc tgcgaccttt 181 aatgcttctt tcttgttgaa acttcagaag tggacacaga agaaagagaa agggaaaaag 241 ctctcaagaa tgaagctgct cttaaaacat ctgaccttag ccaacctgtt ggagactctg 301 attgtcatgc cactggatgg gatgtggaac attacagtcc aatggtatgc tggagagtta 361 ctctgcaaag ttctcagtta tctaaagctt ttctccatgt atgccccagc cttcatgatg 421 gtggtgatca gcctggaccg ctccctggct atcacgaggc ccctagcttt gaaaagcaac 481 agcaaagtcg gacagtccat ggttggcctg gcctggatcc tcagtagtgt ctttgcagga 541 ccacagttat acatcttcag gatgattcat ctagcagaca gctctggaca gacaaaagtt 601 ttctctcaat gtgtaacaca ctgcagtttt tcacaatggt ggcatcaagc attttataac 661 tttttcacct tcagctgcct cttcatcatc cctcttttca tcatgctgat ctgcaatgca 721 aaaatcatct tcaccctgac acgggtcctt catcaggacc cccacgaact acaactgaat 781 cagtccaaga acaatatacc aagagcacgg ctgaagactc taaaaatgac ggttgcattt 841 gccacttcat ttactgtctg ctggactccc tactatgtcc taggaatttg gtattggttt 901 gatcctgaaa tgttaaacag gttgtcagac ccagtaaatc acttcttctt tctctttgcc 961 tttttaaacc catgctttga tccacttatc tatggatatt tttctctgtg attgatagac 1021 tacacaagaa gtcatatgaa gaagggtaag gtaatgaatc tctccatctg ggaatgatta 1081 acacaaatgt tggagcatgt ttacatacaa acaaagtagg atttacactt aagttatcat 1141 tcttttagaa actcagtctt cagagcctca attattaagg aaaagtcttc aggaaaaata 1201 ctaaaatatt ttctcttcct cataagcttc taaattaatc tctgcctttt ctgacctcat 1261 ataacacatt atgtaggttt cttatcactt tctctttgca taataatgta ctaatattta 1321 aaataccttc agcctaaggc acaaggatgc caaaaaaaca aaggtgagaa cccacaacac 1381 aggtctaaac tcagcatgct tggtgagttt ttctccaaag gggcatatta gcaattagag 1441 ttgtatgcta tataatacat agagcacaga gccctttgcc cataatatca actttccctc 1501 ctatagttaa aaagaaaaaa aaatgaatct atttttctct ttggcttcaa aagcattctg 1561 acatttggag gagtcagtaa ccaatcccac caaccactcc agcaacctga caagactatg 1621 agtagttctc cttcatccta tttatgtggt acaggttgtg aagtatctct atataaaggg 1681 aaattttaga ggggttagga tttggacagg ggtttagaac attcctctaa gctatctagt 1741 ctgtggagtt tgtggcaatt aattgccata aaataacatg tttccaaatg caactaagaa 1801 aatactcata gtgagtacgc tctatgcata gtatgacttc tatttaatgt gaagaatttt 1861 ttgtctctct cctgatctta ctaaatccat atttcataaa tgaactgaga ataattaaca 1921 aaattaagca aatgcacaag caaaagatgc ttgatacaca aaaggaactc tggagagaaa 1981 actacagctt cagtctgtac agatcaaaga agacagaaca tgtcagggga aggaggaaag 2041 atcttgatgc agggtttctt aacctgcagt ctatgcacaa cactatattt ccatgtaatg 2101 tttttatttc agccctattt gtattatttt gtgcatttaa aaaacacaat cttaaggccg // LOCUS HUMGOLGINA 2067 bp mRNA PRI 15-JUL-1993 DEFINITION Human (clone SY11) golgin-95 mRNA, complete cds. ACCESSION L06147 NID g306781 KEYWORDS golgin-95. SOURCE Homo sapiens (library: lambda ZAP) adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2067) AUTHORS Fritzler,M.J., Hamel,J.C., Ochs,R.L. and Chan,E.K.L. TITLE Molecular characterization of two human autoantigens: Unique cDNAs encoding 95- and 160-kD proteins of a putative family in the Golgi complex JOURNAL J. Exp. Med. 178, 49-62 (1993) MEDLINE 93301617 FEATURES Location/Qualifiers source 1..2067 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="hep G2" /cell_type="epithelial cell" /dev_stage="adult" /germline /tissue_type="liver" /tissue_lib="lambda ZAP" CDS 19..1881 /note="putative" /codon_start=1 /product="golgin-95" /db_xref="PID:g306782" /translation="MESVRQLQMERDKYAENLKGESAMWRQRMQQMSEQVHTLREEKE CSMSRVQELETSLAELRNQMAEPPPPEPPAGPSEVEQQLQAEAEHLRKELEGLAGQLQ AQVQDNEGLSRLNREQEERLLELERAAELWGEQAEARRQILETMQNDRTTISRALSQN RELKEQLAELQSGFVKLTNENMEITSALQSEQHVKRELGKKLGELQEKLSELKETVEL KSQEAQSLQQQRDQYLGHLQQYVAAYQQLTSEKEVLHNQLLLQTQLVDQLQQQEAQGK AVAEMARQELQETQERLEAATQQNQQLRAQLSLMAHPGEGDGLDREEEEDEEEEEEEA VAVPQPMPSIPEDLESREAMVAFFNSAVASAEEEQARLRGQLKEQRVRCRRLAHLLAS AQKEPEAAAPAPGTGGDSVCGETHRALQGAMEKLQSRFMELMQEKADLKERVEELEHR CIQLSGETDTIGEYIALYQSQRAVLKERHREKEEYISRLAQDKEEMKVKLLELQELVL RLVGDRNEWHGRFLAAAQNPADEPTSGAPAPQELGAANQQGDLCEVSLAGSVEPAQGE AREGSPRDNPTAQQIMQLLREMQNPRERPGLGSNPCIPFFYRADENDEVKITVI" polyA_site 2067 BASE COUNT 519 a 533 c 712 g 303 t ORIGIN 1 tagcatcagg ggctggtaat ggagtcggtt agacaactac aaatggagag agataaatat 61 gcggagaatc tcaaaggaga gagcgccatg tggcggcaga ggatgcagca gatgtcagag 121 caggtgcaca cattgagaga ggagaaggaa tgtagcatga gtcgggtaca ggagctggag 181 acgagcttgg ctgaactgag gaaccagatg gctgaacccc cgcccccaga gcccccagca 241 gggccctccg aggtggagca gcagctacaa gcggaggctg agcacctgcg gaaggagctg 301 gagggtctgg caggacagct tcaagcccag gtgcaagaca atgagggctt gagtcgcctg 361 aaccgggagc aggaggagag gctgctggag ctggagcggg cggccgagct ctggggggag 421 caggcggagg cgcgcaggca aatcctggag accatgcaga acgaccgcac taccatcagc 481 cgcgcactct cccagaaccg ggagctcaag gagcagctgg ctgagctgca gagcggattt 541 gtaaagctga ctaatgagaa catggagatc accagcgcac tgcagtcgga gcagcacgtc 601 aagagggagc tgggaaagaa gctgggcgag ctgcaggaga agctgagcga gctgaaggaa 661 acggtggagc tgaagagcca agaggctcaa agtctgcagc agcagcgaga ccagtacctg 721 ggacacctgc agcagtatgt ggccgcctat cagcagctga cctctgagaa ggaggtgctg 781 cataatcagc tactgctgca gacccagctc gtggaccagc tgcagcagca ggaagctcag 841 ggcaaagcgg tggccgagat ggcccgccaa gagttgcagg aaacccagga gcgcctggaa 901 gctgccaccc agcagaatca gcagctacgg gcccagttga gcctcatggc tcaccctggg 961 gaaggagatg gactggaccg ggaggaggag gaggatgagg aggaggagga ggaggaggcg 1021 gtggcagtac ctcagcccat gccaagcatc ccggaggacc tggagagccg ggaagccatg 1081 gtggcatttt tcaactcagc tgtagccagt gccgaggagg agcaggcaag gctacgtggg 1141 cagctgaagg agcaaagggt gcgctgccgg cgcctggctc acctgctggc ctcggcccag 1201 aaggagcctg aggcagcagc cccagcccca gggaccgggg gtgattctgt gtgtggggag 1261 acccaccggg ccctgcaggg ggccatggag aagctgcaga gccgctttat ggagctcatg 1321 caggagaagg cagacctgaa ggagagggta gaggaactgg aacatcgctg catccagctt 1381 tctggagaga cagacaccat tggagagtac attgcactgt accagagcca gagggcagtg 1441 ctgaaggagc ggcaccggga gaaggaggag tacatcagca ggctggccca agacaaggag 1501 gagatgaagg tgaagctgct ggagctgcag gagctggtct tacggcttgt gggcgaccgc 1561 aacgagtggc atggcagatt cctggcagct gcccagaacc ctgctgatga gcccacttca 1621 ggggccccag ccccccagga acttggggct gccaaccagc agggtgatct ttgcgaggtg 1681 agcctcgccg gcagtgtgga gcctgcccaa ggagaggcca gggagggttc tccccgtgac 1741 aaccccactg cacagcagat catgcagctg cttcgtgaga tgcagaaccc ccgggagcgc 1801 ccaggcttgg gcagcaaccc ctgcattcct tttttttacc gggctgacga gaatgatgag 1861 gtgaagatca ctgtcatcta aaagccggct actgtcagca aagcctgaag aagtggggct 1921 ggataccctg cccccaccat atccctacca tcccttctca gtcaaccctt tacccttaca 1981 gtagcaagca tagacccctg tctaacgggg gtagacaggt gcagatgagg tgaagatcac 2041 tgtcatctaa aagctggcca ctaaatt // LOCUS HUMGP3A 3170 bp mRNA PRI 08-NOV-1994 DEFINITION Human endothelial membrane glycoprotein IIIa (GPIIIa) mRNA, complete cds. ACCESSION J02703 NID g183452 KEYWORDS glycoprotein; glycoprotein IIIa. SOURCE Human umbilical vein endothelial cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3170) AUTHORS Fitzgerald,L.A., Steiner,B., Rall,S.C. Jr., Lo,S.S. and Phillips,D.R. TITLE Protein sequence of endothelial glycoprotein IIIa derived from a cDNA clone. Identity with platelet glycoprotein IIIa and similarity to 'integrin' JOURNAL J. Biol. Chem. 262 (9), 3936-3939 (1987) MEDLINE 87165991 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.A.Fitzgerald, 10-FEB-1987. The endothelial membrane glycoprotein IIIa is probably identical to the platelet glycoprotein IIIa. FEATURES Location/Qualifiers source 1..3170 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q21.32" sig_peptide 21..98 /gene="ITGB3" /note="glycoprotein IIIa signal peptide (putative); putative" CDS 21..2387 /gene="ITGB3" /note="glycoprotein IIIa precursor" /codon_start=1 /db_xref="GDB:G00-120-013" /db_xref="PID:g306786" /translation="MRARPRPRPLWVTVLALGALAGVGVGGPNICTTRGVSSCQQCLA VSPMCAWCSDEALPLGSPRCDLKENLLKDNCAPESIEFPVSEARVLEDRPLSDKGSGD SSQVTQVSPQRIALRLRPDDSKNFSIQVRQVEDYPVDIYYLMDLSYSMKDDLWSIQNL GTKLATQMRKLTSNLRIGFGAFVDKPVSPYMYISPPEALENPCYDMKTTCLPMFGYKH VLTLTDQVTRFNEEVKKQSVSRNRDAPEGGFDAIMQATVCDEKIGWRNDASHLLVFTT DAKTHIALDGRLAGIVQPNDGQCHVGSDNHYSASTTMDYPSLGLMTEKLSQKNINLIF AVTENVVNLYQNYSELIPGTTVGVLSMDSSNVLQLIVDAYGKIRSKVELEVRDLPEEL SLSFNATCLNNEVIPGLKSCMGLKIGDTVSFSIEAKVRGCPQEKEKSFTIKPVGFKDS LIVQVTFDCDCACQAQAEPNSHRCNNGNGTFECGVCRCGPGWLGSQCECSEEDYRPSQ QDECSPREGQPVCSQRGECLCGQCVCHSSDFGKITGKYCECDDFSCVRYKGEMCSGHG QCSCGDCLCDSDWTGYYCNCTTRTDTCMSSNGLLCSGRGKCECGSCVCIQPGSYGDTC EKCPTCPDACTFKKECVECKKFDREPYMTENTCNRYCRDEIESVKELKDTGKDAVNCT YKNEDDCVVRFQYYEDSSGKSILYVVEEPECPKGPDILVVLLSVMGAILLIGLAALLI WKLLITIHDRKEFAKFEEERARAKWDTANNPLYKEATSTFTNITYRGT" gene 21..2387 /gene="ITGB3" mat_peptide 99..2384 /gene="ITGB3" /note="glycoprotein IIIa" BASE COUNT 705 a 809 c 909 g 747 t ORIGIN 132 bp upstream of SacI site. 1 cgccgcggga ggcggacgag atgcgagcgc ggccgcggcc ccggccgctc tgggtgactg 61 tgctggcgct gggggcgctg gcgggcgttg gcgtaggagg gcccaacatc tgtaccacgc 121 gaggtgtgag ctcctgccag cagtgcctgg ctgtgagccc catgtgtgcc tggtgctctg 181 atgaggccct gcctctgggc tcacctcgct gtgacctgaa ggagaatctg ctgaaggata 241 actgtgcccc agaatccatc gagttcccag tgagtgaggc ccgagtacta gaggacaggc 301 ccctcagcga caagggctct ggagacagct cccaggtcac tcaagtcagt ccccagagga 361 ttgcactccg gctccggcca gatgattcga agaatttctc catccaagtg cggcaggtgg 421 aggattaccc tgtggacatc tactacttga tggacctgtc ttactccatg aaggatgatc 481 tgtggagcat ccagaacctg ggtaccaagc tggccaccca gatgcgaaag ctcaccagta 541 acctgcggat tggcttcggg gcatttgtgg acaagcctgt gtcaccatac atgtatatct 601 ccccaccaga ggccctcgaa aacccctgct atgatatgaa gaccacctgc ttgcccatgt 661 ttggctacaa acacgtgctg acgctaactg accaggtgac ccgcttcaat gaggaagtga 721 agaagcagag tgtgtcacgg aaccgagatg ccccagaggg tggctttgat gccatcatgc 781 aggctacagt ctgtgatgaa aagattggct ggaggaatga tgcatcccac ttgctggtgt 841 ttaccactga tgccaagact catatagcat tggacggaag gctggcaggc attgtccagc 901 ctaatgacgg gcagtgtcat gttggtagtg acaatcatta ctctgcctcc actaccatgg 961 attatccctc tttggggctg atgactgaga agctatccca gaaaaacatc aatttgatct 1021 ttgcagtgac tgaaaatgta gtcaatctct atcagaacta tagtgagctc atcccaggga 1081 ccacagttgg ggttctgtcc atggattcca gcaatgtcct ccagctcatt gttgatgctt 1141 atgggaaaat ccgttctaaa gtcgagctgg aagtgcgtga cctccctgaa gagttgtctc 1201 tatccttcaa tgccacctgc ctcaacaatg aggtcatccc tggcctcaag tcttgtatgg 1261 gactcaagat tggagacacg gtgagcttca gcattgaggc caaggtgcga ggctgtcccc 1321 aggagaagga gaagtccttt accataaagc ccgtgggctt caaggacagc ctgatcgtcc 1381 aggtcacctt tgattgtgac tgtgcctgcc aggcccaagc tgaacctaat agccatcgct 1441 gcaacaatgg caatgggacc tttgagtgtg gggtatgccg ttgtgggcct ggctggctgg 1501 gatcccagtg tgagtgctca gaggaggact atcgcccttc ccagcaggac gagtgcagcc 1561 cccgagaggg tcagcccgtc tgcagccagc ggggcgagtg cctctgtggt caatgtgtct 1621 gccacagcag tgactttggc aagatcacgg gcaagtactg cgagtgtgac gacttctcct 1681 gtgtccgcta caagggggag atgtgctcag gccatggcca gtgcagctgt ggggactgcc 1741 tgtgtgactc cgactggacc ggctactact gcaactgtac cacgcgtact gacacctgca 1801 tgtccagcaa tgggctgctg tgcagcggcc gcggcaagtg tgaatgtggc agctgtgtct 1861 gtatccagcc gggctcctat ggggacacct gtgagaagtg ccccacctgc ccagatgcct 1921 gcacctttaa gaaagaatgt gtggagtgta agaagtttga ccgggagccc tacatgaccg 1981 aaaatacctg caaccgttac tgccgtgacg agattgagtc agtgaaagag cttaaggaca 2041 ctggcaagga tgcagtgaat tgtacctata agaatgagga tgactgtgtc gtcagattcc 2101 agtactatga agattctagt ggaaagtcca tcctgtatgt ggtagaagag ccagagtgtc 2161 ccaagggccc tgacatcctg gtggtcctgc tctcagtgat gggggccatt ctgctcattg 2221 gccttgccgc cctgctcatc tggaaactcc tcatcaccat ccacgaccga aaagaattcg 2281 ctaaatttga ggaagaacgc gccagagcaa aatgggacac agccaacaac ccactgtata 2341 aagaggccac gtctaccttc accaatatca cgtaccgggg cacttaatga taagcagtca 2401 tcctcagatc attatcagcc tgtgccagga ttgcaggagt ccctgccatc atgtttacag 2461 aggacagtat ttgtggggag ggatttcggg gctcagagtg gggtaggttg ggagaatgtc 2521 agtatgtgga agtgtgggtc tgtgtgtgtg tatgtggggg tctgtgtgtt tatgtgtgtg 2581 tgttgtgtgt gggagtgtgt aatttaaaat tgtgatgtgt cctgataagc tgagctcctt 2641 agcctttgtc ccagaatgcc tcctgcaggg attcttcctg cttagcttga gggtgactat 2701 ggagctgagc aggtgttctt cattacctca gtgagaagcc agctttcctc atcaggccat 2761 tgtccctgaa gagaagggca gggctgaggc ctctcattcc agaggaaggg acaccaagcc 2821 ttggctctac cctgagttca taaatttatg gttctcaggc ctgactctca gcagctatgg 2881 taggaactgc tggcttggca gcccgggtca tctgtacctc tgcctccttt cccctccctc 2941 aggccgaagg aggagtcagg gagagctgaa ctattagagc tgcctgtgcc ttttgccatc 3001 ccctcaaccc agctatggtt ctctcgcaag ggaagtcctt gcaagctaat tctttgacct 3061 gttgggagtg aggatgtctg ggccactcag gggtcattca tggcctgggg gatgtaccag 3121 catctcccag ttcataatca caacccttca gatttgcctt attggcagcg // LOCUS HUMGPCRAA 2816 bp mRNA PRI 08-FEB-1996 DEFINITION Human mRNA for G protein-coupled receptor, complete cds. ACCESSION D38449 NID g556519 KEYWORDS G protein-coupled receptor. SOURCE Homo sapiens female 17-18 wk gestation fetal brain (library: lambda ZAPU) cDNA to mRNA, clone HB-954. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2816) AUTHORS Hata,S., Emi,Y., Iyanagi,T. and Osumi,T. TITLE cDNA cloning of a putative G protein-coupled receptor from brain JOURNAL Biochim. Biophys. Acta 1261 (1), 121-125 (1995) MEDLINE 95200959 REFERENCE 2 (bases 1 to 2816) AUTHORS Hata,S. TITLE Direct Submission JOURNAL Submitted (29-OCT-1994) to the DDBJ/EMBL/GenBank databases. Shingo Hata, Kyoto University, Faculty of Agriculture, Lab. of Applied Botany; Oiwake-cho, Kitashirakawa, Sakyo-ku, Kyoto, Kyoto 606-01, Japan (Tel:075-753-6141, Fax:075-753-6146) COMMENT Submitted (29-Oct-1994) to DDBJ by: Shingo Hata Laboratory of Applied Botany Faculty of Agriculture Kyoto University Kitashirakawa, Oiwake-cho Kyoto 606-01 Japan Phone: 075-753-6142 Fax: 075-753-6146. FEATURES Location/Qualifiers source 1..2816 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda ZAPU" /dev_stage="17-18 wk gestation" /sex="female" /tissue_type="fetal brain" CDS 867..2414 /note="putative" /codon_start=1 /product="G protein-coupled receptor" /db_xref="PID:d1008061" /db_xref="PID:g1088443" /translation="MGHNGSWISPNASEPHNASGAEAAGVNRSALGEFGEAQLYRQFT TTVQVVIFIGSLLGNFMVLWSTCRTTVFKSVTNRFIKNLACSGICASLVCVPFDIILS TSPHCCWWIYTMLFCKVVKFLHKVFCSVTILSFPAIALDRYYSVLYPLERKISDAKSR ELVMYIWAHAVVASVPVFAVTNVADIYATSTCTEVWSNSLGHLVYVLVYNITTVIVPV VVVFLFLILIRRALSASQKKKVIIAALRTPQNTISIPYASQREAELHATLLSMVMVFI LCSVPYATLVVYQTVLNVPDTSVFLLLTAVWLPKVSLLANPVLFLTVNKSVRKCLIGT LVQLHHRYSRRNVVSTGSGMAEASLEPSIRSGSQLLEMFHIGQQQIFKPTEDEEESEA KYIGSADFQAKEIFSTCLEGEQGPQFAPSAPPLSTVDSVSQVAPAAPVEPETFPDKYS LQFGFGPFELPPQWLSETRNSKKRLLPPLGNTPEELIQTKVPKVGRVERKMSRNNKVS IFPKVDS" BASE COUNT 558 a 813 c 846 g 599 t ORIGIN 1 tgggggcgtc ctccttcgtc cccgcccggc tgtcaagctg tgttctagcg gccgagggac 61 cgaggggggc taagaaaggg ggcgcccagc catgcagagg caaaaaggcg ctgcggaacg 121 gggtccccgt cgccagtgct gaggcaggag gtcggagcca caagtgaggg gctgggaagc 181 aggacccagc acgggcgtct tggcaggcgg ccgggcgcag ggccaggctg ctggggacgc 241 tcagggcttt ccacccaagc catgggcgct gtcgggcact cgggggtccc ctcgtggctc 301 cggccactcg gcgtgggcat tacgttggct tcacatcgcc atccagcctc gaagccaaca 361 ggactgaaaa atagcttcgg ccaaacgttc tcctcccgct aaggagaggg gtcgagtgcg 421 tcagcccgag gggactggag agggatgccc tagccctcga ggggcggagg acccgcggtt 481 gaaggaggca gcgggagcgg agagcgccct ccttgaccat cgaatgcctc cttctgtgtt 541 tccattcctg tcgagtgggc tgggccacgc tgaccaccct ggaggaggga cggacgacgc 601 tcggcgggct ctgaccgtgc cgccttcttg tggctgctga ctgggatcca ggagggagtg 661 ggcatggggc gcagccgcgc ctccctccct ccccgcctcc cgggcgccgg ggttggcgat 721 gtggagacgt gaggggaccc gtcggctgct ccggcttctc caggactccg ccaggcgccc 781 gcgcgtccct cctcacccgg aggaggagag gctccgcgcg gggctccgag gcgggcggcg 841 cgcggagccg gagtcccagc ctcgccatgg gacataacgg gagctggatc tctccaaatg 901 ccagcgagcc gcacaacgcg tccggcgccg aggctgcggg tgtgaaccgc agcgcgctcg 961 gggagttcgg cgaggcgcag ctgtaccgcc agttcaccac caccgtgcag gtcgtcatct 1021 tcataggctc gctgctcgga aacttcatgg tgttatggtc aacttgccgc acaaccgtgt 1081 tcaaatctgt caccaacagg ttcattaaaa acctggcctg ctcggggatt tgtgccagcc 1141 tggtctgtgt gcccttcgac atcatcctca gcaccagtcc tcactgttgc tggtggatct 1201 acaccatgct cttctgcaag gtcgtcaaat ttttgcacaa agtattctgc tctgtgacca 1261 tcctcagctt ccctgctatt gctttggaca ggtactactc agtcctctat ccactggaga 1321 ggaaaatatc tgatgccaag tcccgtgaac tggtgatgta catctgggcc catgcagtgg 1381 tggccagtgt ccctgtgttt gcagtaacca atgtggctga catctatgcc acgtccacct 1441 gcacggaagt ctggagcaac tccttgggcc acctggtgta cgttctggtg tataacatca 1501 ccacggtcat tgtgcctgtg gtggtggtgt tcctcttctt gatactgatc cgacgggccc 1561 tgagtgccag ccagaagaag aaggtcatca tagcagcgct ccggacccca cagaacacca 1621 tctctattcc ctatgcctcc cagcgggagg ccgagctgca cgccaccctg ctctccatgg 1681 tgatggtctt catcttgtgt agcgtgccct atgccaccct ggtcgtctac cagactgtgc 1741 tcaatgtccc tgacacttcc gtcttcttgc tgctcactgc tgtttggctg cccaaagtct 1801 ccctgctggc aaaccctgtt ctctttctta ctgtgaacaa atctgtccgc aagtgcttga 1861 tagggaccct ggtgcaacta caccaccggt acagtcgccg taatgtggtc agtacaggga 1921 gtggcatggc tgaggccagc ctggaaccca gcatacgctc gggtagccag ctcctggaga 1981 tgttccacat tgggcagcag cagatcttta agcccacaga ggatgaggaa gagagtgagg 2041 ccaagtacat tggctcagct gacttccagg ccaaggagat atttagcacc tgcctggagg 2101 gagagcaggg gccacagttt gcgccctctg ccccacccct gagcacagtg gactctgtat 2161 cccaggtggc accggcagcc cctgtggaac ctgaaacatt ccctgataag tattccctgc 2221 agtttggctt tgggcctttt gagttgcctc ctcagtggct ctcagagacc cgaaacagca 2281 agaagcggct gcttcccccc ttgggcaaca ccccagaaga gctgatccag acaaaggtgc 2341 ccaaggtagg cagggtggag cggaagatga gcagaaacaa taaagtgagc atttttccaa 2401 aggtggattc ctagcaagga ttgtaaattc ttggaagcaa cggggggctt ccatattccc 2461 accagagtgt gggaatgctg tggccatgtg attgtatgat ctccttgcaa ctcagtgtga 2521 gttgattcct ccaatatggg ccagatgctt ttgaatgata gggaaatcta cataaaatcc 2581 agtgtcctct ttattgaggg agtatatgta tccatctcag tgatccatgt ccttagtgaa 2641 gtccacatta ttctctgtgg ggacaagagc tgggcagttt tgaatgggtc ttgaggtggg 2701 taccccatgt gcactttctg aggatgcctc acttccctgg gctctgcaga gaacacacag 2761 agagaagact ttcagagctc acaggagcag ggagcaggag cactctaagg gaattc // LOCUS HUMGPCRB 1643 bp mRNA PRI 14-APR-1993 DEFINITION Human EBV induced G-protein coupled receptor (EBI2) mRNA, complete cds. ACCESSION L08177 NID g292056 KEYWORDS Epstein-Barr virus induced gene; G-protein coupled receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1643) AUTHORS Birkenbach,M.P., Josefsen,K., Yalamanchili,R.R., Lenoir,G.M. and Elliott,K. TITLE Epstein-Barr virus induced genes: First lymphocyte specific G protein coupled peptide receptors JOURNAL J. Virol. 67, 2209-2220 (1993) MEDLINE 93188173 FEATURES Location/Qualifiers source 1..1643 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BL41/B95-8" /cell_type="B lymphocyte, EBV-converted Burkitt lymphoma" /germline CDS 34..1119 /codon_start=1 /product="EBI 2: EBV induced G-protein coupled receptor" /db_xref="PID:g292057" /translation="MDIQMANNFTPPSATPQGNDCDLYAHHSTARIVMPLHYSLVFII GLVGNLLALVVIVQNRKKINSTTLYSTNLVISDILFTTALPTRIAYYAMGFDWRIGDA LCRITALVFYINTYAGVNFMTCLSIDRFIAVVHPLRYNKIKRIEHAKGVCIFVWILVF AQTLPLLINPMSKQEAERITCMEYPNFEETKSLPWILLGACFIGYVLPLIIILICYSQ ICCKLFRTAKQNPLTEKSGVNKKALNTIILIIVVFVLCFTPYHVAIIQHMIKKLRFSN FLECSQRHSFQISLHFTVCLMNFNCCMDPFIYFFACKGYKRKVMRMLKRQVSVSISSA VKSAPEENSREMTETQMMIHSKSSNGK" BASE COUNT 509 a 339 c 285 g 510 t ORIGIN 1 ggaattccct gatatacacc tggaccacca ccaatggata tacaaatggc aaacaatttt 61 actccgccct ctgcaactcc tcagggaaat gactgtgacc tctatgcaca tcacagcacg 121 gccaggatag taatgcctct gcattacagc ctcgtcttca tcattgggct cgtgggaaac 181 ttactagcct tggtcgtcat tgttcaaaac aggaaaaaaa tcaactctac caccctctat 241 tcaacaaatt tggtgatttc tgatatactt tttaccacgg ctttgcctac acgaatagcc 301 tactatgcaa tgggctttga ctggagaatc ggagatgcct tgtgtaggat aactgcgcta 361 gtgttttaca tcaacacata tgcaggtgtg aactttatga cctgcctgag tattgaccgc 421 ttcattgctg tggtgcaccc tctacgctac aacaagataa aaaggattga acatgcaaaa 481 ggcgtgtgca tatttgtctg gattctagta tttgctcaga cactcccact cctcatcaac 541 cctatgtcaa agcaggaggc tgaaaggatt acatgcatgg agtatccaaa ctttgaagaa 601 actaaatctc ttccctggat tctgcttggg gcatgtttca taggatatgt acttccactt 661 ataatcattc tcatctgcta ttctcagatc tgctgcaaac tcttcagaac tgccaaacaa 721 aacccactca ctgagaaatc tggtgtaaac aaaaaggctc tcaacacaat tattcttatt 781 attgttgtgt ttgttctctg tttcacacct taccatgttg caattattca acatatgatt 841 aagaagcttc gtttctctaa tttcctggaa tgtagccaaa gacattcgtt ccagatttct 901 ctgcacttta cagtatgcct gatgaacttc aattgctgca tggacccttt tatctacttc 961 tttgcatgta aagggtataa gagaaaggtt atgaggatgc tgaaacggca agtcagtgta 1021 tcgatttcta gtgctgtgaa gtcagcccct gaagaaaatt cacgtgaaat gacagaaacg 1081 cagatgatga tacattccaa gtcttcaaat ggaaagtgaa atggattgta ttttggttta 1141 tagtgacgta aactgtatga caaactttgc aggacttccc ttataaagca aaataattgt 1201 tcagcttcca attagtattc ttttatattt ctttcattgg gcgctttccc atctccaact 1261 cggaagtaag cccaagagaa caacataaag caaacaacat aaagcacaat aaaaatgcaa 1321 ataaatattt tcatttttat ttgtaaacga atacaccaaa aggaggcgct cttaataact 1381 cccaatgtaa aaagttttgt tttaataaaa aattaattat tattcttgcc aacaaatggc 1441 tagaaaggac tgaatagatt atatattgcc agatgttaat actgtaacat actttttaaa 1501 taacatattt cttaaatcca aatttctctc aatgttagat ttaattccct caataacacc 1561 aatgttttgt tttgtttcgt tctgggtcat aaaactttgt taaggaactc ttttggaata 1621 aagagcagga tgctgcggaa ttc // LOCUS HUMGPCRD 1262 bp DNA PRI 31-JUL-1995 DEFINITION Homo sapiens G protein-coupled receptor (GPR3) gene, complete cds. ACCESSION L32831 NID g602311 KEYWORDS G protein-coupled receptor; G protein-coupled receptor GPR3. SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1262) AUTHORS Iismaa,T.P., Kiefer,J., Liu,M.L., Baker,E., Sutherland,G.R. and Shine,J. TITLE Isolation and chromosomal localization of a novel human G-protein-coupled receptor (GPR3) expressed predominantly in the central nervous system JOURNAL Genomics 24 (2), 391-394 (1994) MEDLINE 95213036 FEATURES Location/Qualifiers source 1..1262 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Kelly" /cell_type="neuroblastoma" intron <1..187 exon 188..1262 CDS 193..1185 /codon_start=1 /product="G protein-coupled receptor GPR3" /db_xref="PID:g602312" /translation="MMWGAGSPLAWLSAGSGNVNVSSVGPAEGPTGPAAPLPSPKAWD VVLCISGTLVSCENALVVAIIVGTPAFRAPMFLLVGSLAVADLLAGLGLVLHFAAVFC IGSAEMSLVLVGVLAMAFTASIGSLLAITVDRYLSLYNALTYYSETTVTRTYVMLALV WGGALGLGLLPVLAWNCLDGLTTCGVVYPLSKNHLVVLAIAFFMVFGIMLQLYAQICR IVCRHAQQIALQRHLLPASHYVATRKGIATLAVVLGAFAACWLPFTVYCLLGDAHSPP LYTYLTLLPATYNSMINPIIYAFRNQDVQKVLWAVCCCCSSSKIPFRSRSPSDV" BASE COUNT 213 a 415 c 332 g 302 t ORIGIN 1 attggagggg acagcggtat cctgggaaga gccccagggc atgaatgtgg ggataaggca 61 ttgggaccct atcaggtatc ctgaggagag actcccacca cgtatcctga gaagcacctc 121 accccctcca gaccccaact cccatcaccc agcttggtca gcttctcaca aggcctttct 181 cctgcaggta ccatgatgtg gggtgcaggc agccctctgg cctggctctc agctggctca 241 ggcaacgtga atgtaagcag cgtgggccca gcagaggggc ccacaggtcc agccgcacca 301 ctgccctcgc ctaaggcctg ggatgtggtg ctctgcatct caggcaccct ggtgtcctgc 361 gagaatgcgc tagtggtggc catcatcgtg ggcactcctg ccttccgtgc ccccatgttc 421 ctgctggtgg gcagcctggc cgtggcagac ctgctggcag gcctgggcct ggtcctgcac 481 tttgctgctg tcttctgcat cggctcagcg gagatgagcc tggtgctggt tggcgtgctg 541 gcaatggcct ttaccgccag catcggcagt ctactggcca tcactgtcga ccgctacctt 601 tctctgtaca atgccctcac ctactattca gagacaacag tgacacggac ctatgtgatg 661 ctggccttag tgtggggagg tgccctgggc ctggggctgc tgcctgtgct ggcctggaac 721 tgcctggatg gcctgaccac atgtggcgtg gtttatccac tctccaagaa ccatctggta 781 gttctggcca ttgccttctt catggtgttt ggcatcatgc tgcagctcta cgcccaaatc 841 tgccgcatcg tctgccgcca tgcccagcag attgcccttc agcggcacct gctgcctgcc 901 tcccactatg tggccacccg caagggcatt gccacactgg ccgtggtgct tggagccttt 961 gccgcctgct ggttgccctt cactgtctac tgcctgctgg gtgatgccca ctctccacct 1021 ctctacacct atcttacctt gctccctgcc acctacaact ccatgatcaa ccctatcatc 1081 tacgccttcc gcaaccagga tgtgcagaaa gtgctgtggg ctgtctgctg ctgctgttcc 1141 tcttccaaga tccccttccg atcccgctcc cccagtgatg tctagctgag tcttcatgac 1201 ccttcaaccc tgattactac agaattccag aatgttaggc tctccagggc ttctttccaa 1261 ac // LOCUS HUMGPIBA 2480 bp mRNA PRI 08-NOV-1994 DEFINITION Human platelet glycoprotein Ib alpha chain mRNA, complete cds. ACCESSION J02940 NID g183499 KEYWORDS glycoprotein Ib; surface membrane glycoprotein. SOURCE Human erythroleukemia (HEL) cell line, cDNA to mRNA, clones lambda-GPIb[1.1,2.4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2480) AUTHORS Lopez,J.A., Chung,D.W., Fujikawa,K., Hagen,F.S., Papayannopoulou,T. and Roth,G.J. TITLE Cloning of the alpha chain of human platelet glycoprotein Ib: a transmembrane protein with homology to leucine-rich alpha 2-glycoprotein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5615-5619 (1987) MEDLINE 87289655 COMMENT Draft entry and printed copy of sequence [1] kindly provided by G.J.Roth, 18-MAY-1987. The alpha and beta chains of platelet glycoprotein Ib are encoded by different mRNAs generated from two different genes. FEATURES Location/Qualifiers source 1..2480 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17pter-p12" mRNA <1..2480 /note="PGIB mRNA" sig_peptide 43..90 /gene="GP1BA" /note="platelet glycoprotein Ib alpha chain signal peptide" gene 43..1923 /gene="GP1BA" CDS 43..1923 /gene="GP1BA" /note="platelet glycoprotein Ib alpha chain precursor" /codon_start=1 /db_xref="GDB:G00-118-806" /db_xref="PID:g306793" /translation="MPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPD LPKDTTILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSH NQLQSLPLLGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLL TPTPKLEKLSLANNNLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFL HGNPWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYP GKGCPTLGDEGDTDLYDYYPEEDTEGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASL DSQMPSSLHPTQESTKEQTTFPPRWTPNFTLHMESITFSKTPKSTTEPTPSPTTSEPV PEPAPNMTTLEPTPSPTTPEPTSEPAPSPTTPEPTPIPTIATSPTILVSATSLITPKS TFLTTTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFY VLGLFWLLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPR AWLLFLRGSLPTFRSSLFLWVRPNGRVGPLVAGRRPSALSQGRGQDLLSTVSIRYSGH SL" mat_peptide 91..1920 /gene="GP1BA" /note="platelet glycoprotein Ib alpha chain" BASE COUNT 561 a 757 c 580 g 582 t ORIGIN 290 bp upstream od SstI site. 1 gacgctctgt gccttcggag gtctttctgc ctgcctgtcc tcatgcctct cctcctcttg 61 ctgctcctgc tgccaagccc cttacacccc caccccatct gtgaggtctc caaagtggcc 121 agccacctag aagtgaactg tgacaagagg aatctgacag cgctgcctcc agacctgccg 181 aaagacacaa ccatcctcca cctgagtgag aacctcctgt acaccttctc cctggcaacc 241 ctgatgcctt acactcgcct cactcagctg aacctagata ggtgcgagct caccaagctc 301 caggtcgatg ggacgctgcc agtgctgggg accctggatc tatcccacaa tcagctgcaa 361 agcctgccct tgctagggca gacactgcct gctctcaccg tcctggacgt ctccttcaac 421 cggctgacct cgctgcctct tggtgccctg cgtggtcttg gcgaactcca agagctctac 481 ctgaaaggca atgagctgaa gaccctgccc ccagggctcc tgacgcccac acccaagctg 541 gagaagctca gtctggctaa caacaacttg actgagctcc ccgctgggct cctgaatggg 601 ctggagaatc tcgacaccct tctcctccaa gagaactcgc tgtatacaat accaaagggc 661 ttttttgggt cccacctcct gccttttgct tttctccacg ggaacccctg gttatgcaac 721 tgtgagatcc tctattttcg tcgctggctg caggacaatg ctgaaaatgt ctacgtatgg 781 aagcaaggtg tggacgtcaa ggccatgacc tctaacgtgg ccagtgtgca gtgtgacaat 841 tcagacaagt ttcccgtcta caaataccca ggaaaggggt gccccaccct tggtgatgaa 901 ggtgacacag acctatatga ttactaccca gaagaggaca ctgagggcga taaggtgcgt 961 gccacaagga ctgtggtcaa gttccccacc aaagcccata caaccccctg gggtctattc 1021 tactcatggt ccactgcttc tctagacagc caaatgccct cctccttgca tccaacacaa 1081 gaatccacta aggagcagac cacattccca cctagatgga ccccaaattt cacacttcac 1141 atggaatcca tcacattctc caaaactcca aaatccacta ctgaaccaac cccaagcccg 1201 accacctcag agcccgtccc ggagcccgcc ccaaacatga ccaccctgga gcccactcca 1261 agcccgacca ccccagagcc cacctcagag cccgccccca gcccgaccac cccggagccc 1321 accccaatcc cgaccatcgc cacaagcccg accatcctgg tgtctgccac aagcctgatc 1381 actccaaaaa gcacattttt aactaccaca aaacccgtat cactcttaga atccaccaaa 1441 aaaaccatcc ctgaacttga tcagccacca aagctccgtg gggtgctcca agggcatttg 1501 gagagctcca gaaatgaccc ttttctccac cccgactttt gctgcctcct ccccctgggc 1561 ttctatgtct tgggtctctt ctggctgctc tttgcctctg tggtcctcat cctgctgctg 1621 agctgggttg ggcatgtgaa accacaggcc ctggactctg gccaaggtgc tgctctgacc 1681 acagccacac aaaccacaca cctggagctg cagaggggac ggcaagtgac agtgccccgg 1741 gcctggctgc tcttccttcg aggttcgctt cccactttcc gctccagcct cttcctgtgg 1801 gtacggccta atggccgtgt ggggcctcta gtggcaggaa ggaggccctc agctctgagt 1861 cagggtcgtg gtcaggacct gctgagcaca gtgagcatta ggtactctgg ccacagcctc 1921 tgagggtggg aggtttgggg accttgagag aagagcctgt gggctctcct attggaatct 1981 agttgggggt tggaggggta aggaacacag ggtgataggg gaggggtctt agttcctttt 2041 tctgtatcag aagccctgtc ttcacaacac aggcacacaa tttcagtccc agccaaagca 2101 gaaggggtaa tgacatggac ttggcggggg gacaagacaa agctcccgat gctgcatggg 2161 gcgctgccag atctcacggt gaaccatttt ggcagaatac agcatggttc ccacatgcat 2221 ttatgcacag aagaaaatct ggaaagtgat ttatcaggat gtgagcactc gttgtgtctg 2281 gatgttacaa atatgggtgg ttttattttc tttttccctg tttagcattt tctagttttc 2341 ttatcaggat gtgagcactc gttgtgtctg gatgttacaa atatgggtgg ttttattttc 2401 tttttccctg tttagcattt tctagttttc cactattatt gtatattatc tgtataataa 2461 aaaataattt tagggttggg // LOCUS HUMGPIH 1394 bp mRNA PRI 06-DEC-1993 DEFINITION Human GPI-H mRNA, complete cds. ACCESSION L19783 NID g404725 KEYWORDS glycosyl phosphatidylinositol. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1394) AUTHORS Kamitani,T., Chang,H.M., Rollins,C., Waneck,G.L. and Yeh,E.T. TITLE Correction of the class H defect in glycosylphosphatidylinositol anchor biosynthesis in Ltk- cells by a human cDNA clone JOURNAL J. Biol. Chem. 268, 20733-20736 (1993) MEDLINE 94012603 FEATURES Location/Qualifiers source 1..1394 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Ltk(-)" /tissue_type="placenta" /tissue_lib="Invitrogen #A900-11" gene 55..1382 /gene="GPI-H" misc_feature 55..64 /gene="GPI-H" CDS 61..627 /gene="GPI-H" /codon_start=1 /function="synthesis of GPI anchor precursor" /db_xref="PID:g404726" /translation="MEDERSFSDICGGRLALQRRYYSPSCREFCLSCPRLSLRSLTAV TCTVWLAAYGLFTLCENSMILSAAIFITLLGLLGYLHFVKIDQETLLIIDSLGIQMTS SYASGKESTTFIEMGKVKDIVINEAIYMQKVIYYLCILLKDPVEPHGISQVVPVFQSA KPRLDCLIEVYRSCQEILAHQKATSTSP" polyA_signal 1377..1382 /gene="GPI-H" BASE COUNT 374 a 321 c 323 g 376 t ORIGIN 1 gaccagggcg ggcgagcgca gtgcagcgcc gcgcggtgcg ggcggccgag tgggggcgtc 61 atggaggatg agcggagctt ttcggatatc tgcggcggcc gcctggcgct gcagcgccgc 121 tactactccc cgtcctgccg ggaattctgc ctcagctgcc ctcggctctc gctgcgttcg 181 ctcaccgctg tcacctgcac ggtgtggctg gcggcctacg gactcttcac cctctgcgag 241 aacagcatga tcctctctgc tgccatcttc atcaccctct taggtctgct tggttatctc 301 cattttgtga agattgatca ggagactctg ttaatcattg attcccttgg cattcagatg 361 acttcatctt atgcttcagg caaagaaagc actaccttca tagaaatggg caaggtcaag 421 gatattgtca tcaatgaggc catttacatg cagaaggtga tttactacct ctgcatctta 481 ttgaaagatc cagtggaacc acatgggata tcccaagtag tacccgtctt ccagagtgcc 541 aagccccggc tggactgctt gattgaagta tacaggagct gccaggagat cctggcacac 601 cagaaagcca catcaacaag cccatgagcc ccagcgttca gaaggccagc attgtcttcc 661 atgggagatg actcttaagc cataggggct ggttttccgt actccaaacc atcaggtgga 721 cacagtccta ggaaccatta tggatgtagt gcatcttaga gccatagagc aggtgactgg 781 aaatctaact cagtatattt ccgtgtatta tagtgttttc cttgtaaggt tttgcctact 841 ttaccaaagg aggggagacc ttaagaattt tgacacagta tgtcaaaagt aatgtcagca 901 aagaacactt tcggaaattg ctaagcatgt tcaggtttac tttgttgatg tttgtgaagc 961 aaaacaatgg gaaactgaca tcagatgtct tgagaaagct atattttcca atagtacccc 1021 ttgtgtaaag ttacgaaaaa aaacaagccc ttcagtactg gttagcagga agaaatgtcc 1081 tacaaaagac gagtctgtca acctagtgcc tgtgttctgt acagggctta tttatttact 1141 gttaataaac agatcttcca aaacacacag ttacctggtt cacagatagc tgctttttat 1201 tgacaaacaa gatcatagta tttcagtcat ggtgtcaagc acattaatac ttttctgtta 1261 ctcagcattt ttttttatat cagttcaaaa gctacttttg ccaagaagga tgagacaaaa 1321 ggccagacag aaattggact ttggttttca gacatgggac tctcatgtta acaaagaata 1381 aataagcaaa tatt // LOCUS HUMGPIIBA 3333 bp mRNA PRI 11-JUN-1993 DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds. ACCESSION M34480 NID g183510 KEYWORDS platelet glycoprotein IIb. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3333) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) MEDLINE 90265363 FEATURES Location/Qualifiers source 1..3333 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..3333 /note="GPIIb mRNA" CDS 33..3152 /note="platelet glycoprotein IIb (GPIIb)" /codon_start=1 /db_xref="PID:g306794" /translation="MARALCPLQALWLLEWVLLLLGACAAPPAWALNLDPVQLTFYAG PNGSQFGFSLDFHKDSHGRVAIVVGAPRTLGPSQEETGGVFLCPWRAEGGQCPSLLFD LRDETRNVGSQTLQTFKARQGLGASVVSWSDVIVACAPWQHWNVLEKTEEAEKTPVGS CFLAQPESGRRAEYSPCRGNTLSRIYVENDFSWDKRYCEAGFSSVVTQAGELVLGAPG GYYFLGLLAQAPVADIFSSYRPGILLWHVSSQSLSFDSSNPEYFDGYWGYSVAVGEFD GDLNTTEYVVGAPTWSWTLGAVEILDSYYQRLHRLRAEQMASYFGHSVAVTDVNGDGR HDLLVGAPLYMDSRADRKLAEVGRVYLFLQPRGPHALGAPSLLLTGTQLYGRFGSAIA PLGDLDRDGYNDIAVAAPYGGPSGRGQVLVFLGQSEGLRSRPSQVLDSPFPTGSAFGF SLRGAVDIDDNGYPDLIVGAYGANQVAVYRAQPVVKASVQLLVQDSLNPAVKSCVLPQ TKTPVSCFNIQMCVGATGHNIPQKLSLNAELQLDRQKPRQGRRVLLLGSQQAGTTLDL DLGGKHSPICHTTMAFLRDEADFRDKLSPIVLSLNVSLPPTEAGMAPAVVLHGDTHVQ EQTRIVLDCGEDDVCVPQLQLTASVTGSPLLVGADNVLELQMDAANEGEGAYEAELAV HLPQGAHYMRALSNVEGFERLICNQKKENETRVVLCELGNPMKKNAQIGIAMLVSVGN LEEAGESVSFQLQIRSKNSQNPNSKIVLLDVPVRAEAQVELRGNSFPASLVVAAEEGE REQNSLDSWGPKVEHTYELHNNGPGTVNGLHLSIHLPGQSQPSDLLYILDIQPQGGLQ CFPQPPVNPLKVDWGLPIPSPSPIHPAHHKRDRRQIFLPEPEQPSRLQDPVLVSCDSA PCTVVQCDLQEMARGQRAMVTVLAFLWLPSLYQRPLDQFVLQSHAWFNVSSLPYAVPP LSLPRGEAQVWTQLLRALEERAIPIWWVLVGVLGGLLLLTILVLAMWKVGFFKRNRHT LEEDDEEGE" BASE COUNT 626 a 998 c 1040 g 669 t ORIGIN 1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag 61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct 121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt 181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg 241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga 301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg 361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca 421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga 481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc 541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg 601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg 661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc 721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc 781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg 841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca 901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc 961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca 1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag 1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg 1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg 1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc 1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc 1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct 1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag 1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc 1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca 1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc 1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc 1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg 1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc 1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg 1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg 1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga 1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca 2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact 2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg 2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga 2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct 2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg 2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc 2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca 2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc 2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac 2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact 2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca 2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa 2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc 2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc 2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg 2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct 3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc 3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca 3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga 3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt 3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc 3301 ccctcctgct gcctaataaa gagactgagc cct // LOCUS HUMGRB14R 2376 bp mRNA PRI 17-JUN-1996 DEFINITION Homo sapiens Grb14 mRNA, complete cds. ACCESSION L76687 NID g1369836 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2376) AUTHORS Daly,R.J., Sanderson,G.M., Janes,P.W. and Sutherland,R.L. TITLE Cloning and characterization of GRB14, a novel member of the GRB7 gene family JOURNAL J. Biol. Chem. 271 (21), 12502-12510 (1996) MEDLINE 96218175 FEATURES Location/Qualifiers source 1..2376 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..2376 /gene="Grb14" gene 1..2376 /gene="Grb14" 5'UTR 1..540 /gene="Grb14" CDS 541..2163 /gene="Grb14" /codon_start=1 /db_xref="PID:g1369837" /translation="MTTSLQDGQSAASRAAARDSPLAAQVCGAAQGRGDAHDLAPAPW LHARALLPLPDGTRGCAADRRKKKDLDVPEMPSIPNPFPELCCSPITSVLSADLFPKA NSRKKQVIKVYSEDETSRALDVPSDITARDVCQLLILKNHYIDDHSWTLFEHLPHIGV ERTIEDHELVIEVLSNWGIEEENKLYFRKNYAKYEFFKNPMYFFPEHMVSFATETNGE ISPTQILQMFLSSSTYPEIHGFLHAKEQGKKSWKKIYFFLRRSGLYFSTKGTSKEPRH LQFFSEFGNSDIYVSLAGKKKHGAPTNYGFCFKPNKAGGPRDLKMLCAEEEQSRTCWV TAIRLLKYGMQLYQNYMHPYQGRSGCSSQSISPMRSISENSLVAMDFSGQKSRVIENP TEALSVAVEEGLAWRKKGCLRLGTHGSPTASSQSSATNMAIHRSQPWFHHKISRDEAQ RLIIQQGLVDGVFLVRDSQSNPKTFVLSMSHGQKIKHFQIIPVEDDGEMFHTLDDGHT RFTDLIQLVEFYQLNKGVLPCKLKHYCARIAL" 3'UTR 2164..2376 /gene="Grb14" BASE COUNT 631 a 652 c 583 g 510 t ORIGIN 1 cggatgaggg tcagggctgc gcggacccct atcccgcctg cgtcctcccg gcaagcccag 61 cgggagcgcc cgctcggctg ggtccccgcc tccagcgcgc cggggccgcc cagaccctgg 121 gctcagcctc gcgccccggt gcccacctga ggaggcggcg gtcccggcct cgcgtcccgg 181 atgggacggc gcgggagcaa tgccagtggc cccgagcgcc ccgggccacg cgcggggccg 241 gccagccgct ctcgcgccct ccccgccccc tccgcgcctt gcctcgccgc ccgcgcgccc 301 cacccaccgg ccgctcctcc cctctcccca ccctcctcct ccgccccctc ccctcccccg 361 ccgcctcgca gatagctcgg ccgcgcgtct cagccgccgg ggccccgagc gcaggcggcg 421 aggccaccac acctgcagag cgctcgggct gcctaggcgg cacctcgcct cccgccgcgc 481 aaaccccttc tccccacgcg ccgagtctcc catgacgccc gagccccccg gccggcgaca 541 atgaccactt ccctgcaaga tgggcagagc gccgcgagca gggcggctgc ccgggattcg 601 ccgctggccg cccaggtgtg tggcgctgcc caggggaggg gcgacgccca cgacctggcg 661 ccggccccct ggctgcacgc gcgagcgctc ctgccccttc cggacgggac ccgcggctgt 721 gctgcagaca ggagaaaaaa gaaagatctt gatgttccgg aaatgccatc tattccaaac 781 ccttttcctg agctatgctg ttctccaatt acatctgtgt tgtcagcaga cctatttccc 841 aaagcaaatt caaggaaaaa acaggtgatt aaagtataca gtgaagatga aaccagcagg 901 gctttagatg tacccagtga cataacggct cgagatgttt gtcagctgtt gatcctgaag 961 aatcattaca ttgatgacca cagctggacc ctttttgagc acctgcctca cataggtgta 1021 gaaagaacaa tagaagacca cgaactggtg attgaagtgc tatccaactg ggggatagaa 1081 gaagaaaaca aactatactt tagaaaaaat tatgccaaat atgagttctt taaaaaccca 1141 atgtattttt ttccagagca tatggtatct tttgcaactg aaaccaatgg tgaaatatcc 1201 cccacacaga ttttgcagat gtttctgagt tcaagcacat atcctgaaat tcatggtttc 1261 ttacatgcga aagaacaggg aaagaagtct tggaaaaaaa tttacttttt tctaagaaga 1321 tctggtttat atttttctac taaaggaaca tcaaaggaac cgcggcattt gcagtttttc 1381 agcgaatttg gcaatagtga tatttatgtg tcactggcag gcaaaaaaaa acatggagca 1441 ccgactaact atggattctg ctttaagcct aacaaagcgg gagggccccg agacctgaaa 1501 atgctctgtg cagaagaaga gcagagtagg acgtgctggg tgaccgcgat tagattgctt 1561 aagtatggca tgcagctgta ccagaattat atgcatccat atcaaggtag aagtggctgc 1621 agttcacaga gcatatcacc tatgagaagt atatcagaga attccctggt agcaatggac 1681 ttctcaggcc agaaaagcag agttatagaa aatcccactg aagccctttc agttgcggtt 1741 gaagaaggac tcgcttggag gaaaaaagga tgtttacgcc tgggcactca cggtagcccc 1801 actgcctctt cacagagctc tgccacaaac atggctatcc accggtccca gccatggttt 1861 caccacaaaa tttctagaga tgaggctcag cgattgatta ttcagcaagg acttgtggat 1921 ggagttttct tggtacggga tagtcagagt aaccccaaaa ctttcgtact gtcaatgagt 1981 catggacaaa aaataaagca ctttcaaatt ataccagtag aagatgacgg tgaaatgttc 2041 cacacactgg atgatggcca cacaagattt acagatctaa tacagctggt ggagttctat 2101 caactcaata agggcgttct tccttgcaag ttgaaacatt attgtgctag gattgctctc 2161 tagacaagcc agaagtgact tattaaacta ttgaaggaaa aggactcaag aaaaataata 2221 aaagaccata aataagggcg aaaacattat catgtgaaaa gaatgtattt cacctgcaag 2281 ttacaaaaaa atagtttgtg cattgcaaat aagcaaagac ttggattgac tttacattca 2341 tcatttaaaa ttcattagtt aaaattaaac cttagg // LOCUS HUMGRB7 2205 bp mRNA PRI 17-APR-1997 DEFINITION Human squamous cell carcinama of esophagus mRNA for GRB-7 SH2 domain protein, complete cds. ACCESSION D43772 NID g601890 KEYWORDS GRB-7 SH2 domain protein; growth factor receptor-bound protein 7. SOURCE Homo sapiens squamous cell carcinama of esophagus cell-line TE 6 cDNA to mRNA, clone GRB-7. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2205) AUTHORS Kishi,T. TITLE Direct Submission JOURNAL Submitted (08-DEC-1994) to the DDBJ/EMBL/GenBank databases. Tatsuya Kishi, National Cancer Center Research Institute, Genetics Division; 5-1-1, Tsukiji, Chuo-ku, Tokyo 104, Japan (Tel:03-3542-2511(ex.4400), Fax:03-3541-2685) REFERENCE 2 (bases 1 to 2205) AUTHORS Kishi,T., Sasaki,H., Akiyama,N., Hosokawa,K., Sakamoto,H., Aizawa,S., Sugimura,T. and Terada,M. TITLE Moleucular cloning of human GRB-7 located in the c-ERBB-2 amplification unit, and amplification frequency of the five genes lying close to c-ERBB-2 in gastric cancer JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Kishi,T., Sasaki,H., Akiyama,N., Ishizuka,T., Sakamoto,H., Aizawa,S., Sugimura,T. and Terada,M. TITLE Molecular cloning of human GRB-7 co-amplified with CAB1 and c-ERBB-2 in primary gastric cancer JOURNAL Biochem. Biophys. Res. Commun. 232 (1), 5-9 (1997) MEDLINE 97236270 FEATURES Location/Qualifiers source 1..2205 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TE 6" /clone="GRB-7" /tissue_type="squamous cell carcinama of esophagus" gene 220..1818 /gene="GRB-7" CDS 220..1818 /gene="GRB-7" /standard_name="growth factor receptor-bound protein 7" /codon_start=1 /product="GRB-7 SH2 domain protein" /db_xref="PID:d1008413" /db_xref="PID:g601891" /translation="MELDLSPPHLSSSPEDLWPAPGTPPGTPRPPDTPLPEEVKRSQP LLIPTTGRKLREEERRATSLPSIPNPFPELCSPPSQSPILGGPSSARGLLPRDASRPH VVKVYSEDGACRSVEVAAGATARHVCEMLVQRAHALSDETWGLVECHPHLALERGLED HESVVEVQAAWPVGGDSRFVFRKNFAKYELFKSSPHSLFPEKMVSSCLDAHTGISHED LIQNFLNAGSFPEIQGFLQLRGSGRKLWKRFFCFLRRSGLYYSTKGTSKDPRHLQYVA DVNESNVYVVTQGRKLYGMPTDFGFCVKPNKLRNGHKGLRIFCSEDEQSRTCWLAAFR LFKYGVQLYKNYQQAQSRHLHPSCLGSPPLRSASDNTLVAMDFSGHAGRVIENPREAL SVALEEAQAWRKKTNHRLSLPMPASGTSLSAAIHRTQLWFHGRISREESQRLIGQQGL VDGLFLVRESQRNPQGFVLSLCHLQKVKHYLILPSEEEGRLYFSMDDGQTRFTDLLQL VEFHQLNRGILPCLLRHCCTRVAL" polyA_signal 2184..2189 BASE COUNT 421 a 708 c 619 g 457 t ORIGIN Chromosome 17q11-q12. 1 cacagggctc ccccccgcct ctgacttctc tgtccgaagt cgggacaccc tcctaccacc 61 tgtagagaag cgggagtgga tctgaaataa aatccaggaa tctgggggtt cctagacgga 121 gccagacttc ggaacgggtg tcctgctact cctgctgggg ctcctccagg acaagggcac 181 acaactggtt ccgttaagcc cctctctcgc tcagacgcca tggagctgga tctgtctcca 241 cctcatctta gcagctctcc ggaagacctt tggccagccc ctgggacccc tcctgggact 301 ccccggcccc ctgatacccc tctgcctgag gaggtaaaga ggtcccagcc tctcctcatc 361 ccaaccaccg gcaggaaact tcgagaggag gagaggcgtg ccacctccct cccctctatc 421 cccaacccct tccctgagct ctgcagtcct ccctcacaga gcccaattct cgggggcccc 481 tccagtgcaa gggggctgct cccccgcgat gccagccgcc cccatgtagt aaaggtgtac 541 agtgaggatg gggcctgcag gtctgtggag gtggcagcag gtgccacagc tcgccacgtg 601 tgtgaaatgc tggtgcagcg agctcacgcc ttgagcgacg agacctgggg gctggtggag 661 tgccaccccc acctagcact ggagcggggt ttggaggacc acgagtccgt ggtggaagtg 721 caggctgcct ggcccgtggg cggagatagc cgcttcgtct tccggaaaaa cttcgccaag 781 tacgaactgt tcaagagctc cccacactcc ctgttcccag aaaaaatggt ctccagctgt 841 ctcgatgcac acactggtat atcccatgaa gacctcatcc agaacttcct gaatgctggc 901 agctttcctg agatccaggg ctttctgcag ctgcggggtt caggacggaa gctttggaaa 961 cgctttttct gtttcttgcg ccgatctggc ctctattact ccaccaaggg cacctctaag 1021 gatccgaggc acctgcagta cgtggcagat gtgaacgagt ccaacgtgta cgtggtgacg 1081 cagggccgca agctctacgg gatgcccact gacttcggtt tctgtgtcaa gcccaacaag 1141 cttcgaaatg gacacaaggg gcttcggatc ttctgcagtg aagatgagca gagccgcacc 1201 tgctggctgg ctgccttccg cctcttcaag tacggggtgc agctgtacaa gaattaccag 1261 caggcacagt ctcgccatct gcatccatct tgtttgggct ccccaccctt gagaagtgcc 1321 tcagataata ccctggtggc catggacttc tctggccatg ctgggcgtgt cattgagaac 1381 ccccgggagg ctctgagtgt ggccctggag gaggcccagg cctggaggaa gaagacaaac 1441 caccgcctca gcctgcccat gccagcctcc ggcacgagcc tcagtgcagc catccaccgc 1501 acccaactct ggttccacgg gcgcatttcc cgtgaggaga gccagcggct tattggacag 1561 cagggcttgg tagacggcct gttcctggtc cgggagagtc agcggaaccc ccagggcttt 1621 gtcctctctt tgtgccacct gcagaaagtg aagcattatc tcatcctgcc gagcgaggag 1681 gagggtcgcc tgtacttcag catggatgat ggccagaccc gcttcactga cctgctgcag 1741 ctcgtggagt tccaccagct gaaccgcggc atcctgccgt gcttgctgcg ccattgctgc 1801 acgcgggtgg ccctctgacc aggccgtgga ctggctcatg cctcagcccg ccttcaggct 1861 gcccgccgcc cctccaccca tccagtggac tctggggcgc ggccacaggg gacgggatga 1921 ggagcgggag ggttccgcca ctccagtttt ctcctctgct tctttgcctc cctcagatag 1981 aaaacagccc ccactccagt ccactcctga cccctctcct caagggaagg ccttgggtgg 2041 ccccctctcc ttctcctagc tctggaggtg ctgctctagg gcagggaatt atgggagaag 2101 tgggggcagc ccaggcggtt tcacgcccca cactttgtac agaccgagag gccagttgat 2161 ctgctctgtt ttatactagt gacaataaag attatttttt gatac // LOCUS HUMGRF1A 3233 bp mRNA PRI 31-DEC-1994 DEFINITION Human glucocorticoid receptor repression factor 1 (GRF-1) mRNA, complete cds. ACCESSION M73077 NID g183617 KEYWORDS glucocorticoid receptor repression factor 1. SOURCE Homo sapiens (tissue library: lambda-GT11, lambda-GT10) breast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3233) AUTHORS LeClerc,S., Palaniswami,R., Xie,B.X. and Govindan,M.V. TITLE Molecular cloning and characterization of a factor that binds the human glucocorticoid receptor gene and represses its expression JOURNAL J. Biol. Chem. 266 (26), 17333-17340 (1991) MEDLINE 91373352 FEATURES Location/Qualifiers source 1..3233 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MCF-7" /tissue_type="breast" /tissue_lib="lambda-GT11, lambda-GT10" gene 34..2541 /gene="GRF-1" CDS 34..2541 /gene="GRF-1" /codon_start=1 /product="glucocorticoid receptor repression factor 1" /db_xref="PID:g183618" /translation="MDATSHIDNMENERIPFDLMDTVPAEALYEAHLEKLRNERKRVE MRRAFKENLETSPFITPGKPWEEARSFIMNEDFYQWLEESVYTDIYGKHQKQIIDKAK EEFQELLLEYSELFYELELDAKPSKEKMGVIQDVLGEEQRFKAIYKSSKQSVDALILK HIHFVYHPTKETCPSCPACVDAKIEHLISSRFIRPSDRNQKNSLSDPNIDRINLVILG KDALPESWPMEIRALCTNDDKYVIDGKMYELSLRPIEGNVRLPVNSFQTPTFQPHGCL CLYNSKESLSYVVESIEKSRESTLGRRDNHLVHLPLTLILVNKRGDTSGETLHSLIQQ GQQIASKLQCVFLDPASAGIGYGRNINEKQISQVLKGLLDSKRNLNLVSSTASIKDLA DVDLRIVMCLMCGDPFSADDILFPVLQSQTCKSSHCGSNNSVLLELPIGLHKKRIELS VLSYHSSFSIRKSRLVHGYIVFYSAKRKASLAMLRAFLCEVQDIIPIQLVALTDGAVD VLDNDLSREQLTEGEEIAQEIDGRFTSIPCSQPQHKLEIFHPFFKDVVEKKNIIEATH MYDNAAEACSTTEEVFNSPRAGSPLCNSNLQDSEEDIEPSYSLFREDTSLPSLSKDHS KLSMELEGNDGLSFIMSNFESKLNNKVPPPVKPKPPVHFEITKGDLSYLDQGHRDGQR KSVSSSPWLPQDGFDPSDYAEPMDAVVKPRNEEENIYSVPHDSTQGKIITIRNINKAQ SNGSGNGSDSEMDTSSLERGRKVSIVSKPVLYRTRCTRLGGLLVTGPASAWGVMMSWG PSGRKRRIRHPRVIKGTMLSFHTKQTKTRGGGIFFAA" BASE COUNT 929 a 751 c 771 g 782 t ORIGIN 1 cttgaagtgg tttgttgtgc ttgaagagac cccatggatg ccaccagtca cattgacaac 61 atggaaaacg aacggattcc ctttgattta atggataccg tccctgcaga ggcactatac 121 gaggcccact tagagaagct gaggaacgaa aggaaaagag ttgagatgcg aagggcgttt 181 aaagaaaacc tggagacttc tcctttcata actcccggaa agccttggga agaggcccgt 241 agttttatta tgaatgagga tttctaccag tggctggagg aatctgtata cacggatatt 301 tatggcaaac accaaaagca aattatagat aaagcaaagg aagaatttca ggagttgctt 361 ttggaatatt cagaattgtt ttatgaactg gagctggatg ctaagcccag caaggagaag 421 atgggtgtta ttcaggatgt tctgggagag gaacagcgat ttaaagccat ttacaaaagc 481 tccaagcaga gcgttgatgc ccttattctg aaacacattc attttgtgta ccacccaaca 541 aaggagacat gccccagctg cccagcttgt gtggacgcta agattgagca cttgattagt 601 tctcggttta tccggccgtc tgaccggaat cagaaaaatt cactctctga ccctaacatt 661 gatagaatca acttggttat attgggcaaa gacgccttgc ccgagagttg gccaatggag 721 attagagctc tttgtacaaa tgatgacaag tatgtgatag atggtaaaat gtatgagctt 781 tccctgaggc caatagaggg gaatgtcagg cttcctgtga actctttcca gacgccaaca 841 tttcagcccc acggctgtct ctgcctttac aattcaaagg aatcgctatc ctatgtagtg 901 gaaagtatag agaagagtag agagtccacg ctgggccggc gggataatca tttagtccat 961 ctccccctta cattaatttt ggttaacaag agaggagaca ccagtggaga gactctgcat 1021 agcttaatac agcaaggtca acaaattgct agcaaacttc agtgtgtctt tctcgaccct 1081 gcttctgctg gcattggtta cggacgcaac attaatgaaa agcaaatcag tcaagttttg 1141 aagggactcc tggactctaa gcgtaactta aacctggtca gttctactgc tagcatcaaa 1201 gatttggctg atgttgatct gcgaattgtt atgtgtctga tgtgtggaga tccttttagt 1261 gcagatgata tactttttcc tgtccttcag tcccaaacct gtaaatcttc ccattgtgga 1321 agcaacaact ctgttttact tgaactacca atcggactgc acaagaagcg gattgaactg 1381 tctgttcttt cataccattc ctcctttagc atcagaaaga gccggttggt tcatgggtac 1441 attgtttttt attcagccaa acgtaaggcc tctttggcta tgttacgtgc ctttctttgt 1501 gaagtgcagg atattatccc tattcagctt gtagcactca ctgatggcgc tgtagatgtc 1561 ctggacaatg acttaagtag ggaacagcta actgaggggg aggagattgc tcaagaaatt 1621 gacggaaggt tcacaagcat cccctgtagc caaccccagc ataaacttga gatctttcac 1681 ccatttttta aagatgtggt ggaaaaaaag aacataatcg aggctactca tatgtacgat 1741 aatgctgccg aggcctgtag caccaccgaa gaggtgttta actccccccg ggcaggatca 1801 ccgctctgca actcaaacct gcaggattca gaagaagata tcgagccatc ttacagcctg 1861 tttcgagaag acacatcact gccttctctg tccaaagacc attctaagct ctctatggaa 1921 ctggagggaa atgatgggct gtctttcatt atgagcaatt ttgagagtaa actgaacaac 1981 aaagtacctc cgccagtcaa accaaagcct cctgtccatt ttgaaattac aaagggggat 2041 ctatcttatt tagaccaagg ccatagggat ggacagagga agtctgtgtc ttctagcccc 2101 tggctgcctc aggatgggtt tgatccttct gactatgctg aacccatgga tgctgtggtg 2161 aagccaagga atgaagaaga aaacatatac tccgtgcccc atgacagcac ccaaggcaaa 2221 atcatcacca ttcggaatat caacaaagcc cagtccaacg gcagcgggaa tggttctgac 2281 agtgaaatgg acaccagctc tctagagcga gggcgcaagg tttccatcgt gagcaagcca 2341 gtgctgtaca ggacgagatg cacccggctg ggcggtttgc tagttaccgg accagcttca 2401 gcgtggggag tgatgatgag ctggggccca tccggaagaa agaggaggat caggcatccc 2461 agggttataa aggggacaat gctgtcattc catacgaaac agacgaagac ccgcggagga 2521 ggaatattct tcgcagccta aggaggaaca ctaagaaacc aaagcccaaa ccccggccat 2581 ccatcacaaa ggccaacctg ggagagtaac tattttgggg tgcccttaac aactgtcgtg 2641 actccagaga agccgatccc catttttatt gaaagatgta ttgagtacat tgaagccaca 2701 ggactgagca cggaaggcat ctaccgggtc agcgggaaca agtctgagat ggagagtctg 2761 cagagacagt ttgatcaaga ccacaacctg gacctggcag agaaagactt tacggtgaat 2821 accgtggctg gtgccatgaa gagctttttc tcagaactgc ctgaccccct ggtccgtata 2881 acatgcagat cgacttggtg gaagcacaca aaatcaacga ccgggagcag aagttgcatg 2941 cccttaagga ggtattaaag aaatttccaa aggaaaacca cgaagtcttc aagtatgtca 3001 tctctcacct aaacaaggtc agccacaaca acaaggtgaa tctcatgacc agcgagaacc 3061 tctccatctg cttctggccc accttgatga gacctgattt cagcactatg gacgccctca 3121 cagccacgcg cacctaccag acaatcattg aactctttat ccagcagtgc cccttcttct 3181 tctacaatcg gcccatcacc gagcccccgg cgccaggccc agctcccgga att // LOCUS HUMGRK5A 2557 bp mRNA PRI 31-DEC-1994 DEFINITION Human G protein-coupled receptor kinase (GRK5) mRNA, complete cds. ACCESSION L15388 NID g306804 KEYWORDS G-protein coupled receptor kinase; G-protein receptor kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2557) AUTHORS Kunapuli,P. and Benovic,J.L. TITLE Cloning and expression of GRK5: a member of the G protein-coupled receptor kinase family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (12), 5588-5592 (1993) MEDLINE 93296183 FEATURES Location/Qualifiers source 1..2557 /organism="Homo sapiens" /db_xref="taxon:9606" gene 221..1993 /gene="GRK5" CDS 221..1993 /gene="GRK5" /codon_start=1 /product="G protein-coupled receptor kinase" /db_xref="PID:g306805" /translation="MELENIVANTVLLKAREGGGGKRKGKSKKWKEILKFPHISQCED LRRTIDRDYCSLCDKQPIGRLLFRQFCETRPGLECYIQFLDSVAEYEVTPDEKLGEKG KEIMTKYLTPKSPVFIAQVGQDLVSQTEEKLLQKPCKELFSACAQSVHEYLRGEPFHE YLDSMFFDRFLQWKWLERQPVTKNTFRQYRVLGKGGFGEVCACQVRATGKMYACKRLE KKRIKKRKGESMALNEKQILEKVNSQFVVNLAYAYETKDALCLVLTIMNGGDLKFHIY NMGNPGFEEERALFYAAEILCGLEDLHRENTVYRDLKPENILLDDYGHIRISDLGLAV KIPEGDLIRGRVGTVGYMAPEVLNNQRYGLSPDYWGLGCLIYEMIEGQSPFRGRKEKV KREEVDRRVLETEEVYSHKFSEEAKSICKMLLTKDAKQRLGCQEEGAAEVKRHPFFRN MNFKRLEAGMLDPPFVPDPRAVYCKDVLDIEQFSTVKGVNLDHTDDDFYSKFSTGSVS IPWQNEMIETECFKELNVFGPNGTLPPDLNRNHPPEPPKKGLLQRLFKRQHQNNSKSS PSSKTSFNHHINSNHVSSNSTGSS" BASE COUNT 666 a 639 c 726 g 526 t ORIGIN 1 cagagggagg aagaagcggc ggcgcggcgg cggcggctcc tctttgcaga gggggaaact 61 cttgggctga gagcaggaac aacgcggtag gcaaggcggg ctgctggctc ccccggctcc 121 ggcagcagcg gcggcagccc gagcagcggc agcagcagcg gcagcacccc aggcgctgac 181 agccccgccg gccggctccg ttgctgaccg ccgactgtca atggagctgg aaaacatcgt 241 ggccaacacg gtcttgctga aagccaggga agggggcgga ggaaagcgca aagggaaaag 301 caagaagtgg aaagaaatcc tgaagttccc tcacattagc cagtgtgaag acctccgaag 361 gaccatagac agagattact gcagtttatg tgacaagcag ccaatcggga ggctgctttt 421 ccggcagttt tgtgaaacca ggcctgggct ggagtgttac attcagttcc tggactccgt 481 ggcagaatat gaagttactc cagatgaaaa actgggagag aaagggaagg aaattatgac 541 caagtacctc accccaaagt cccctgtttt catagcccaa gttggccaag acctggtctc 601 ccagacggag gagaagctcc tacagaagcc gtgcaaagaa ctcttttctg cctgtgcaca 661 gtctgtccac gagtacctga ggggagaacc attccacgaa tatctggaca gcatgttttt 721 tgaccgcttt ctccagtgga agtggttgga aaggcaaccg gtgaccaaaa acactttcag 781 gcagtatcga gtgctaggaa aagggggctt cggggaggtc tgtgcctgcc aggttcgggc 841 cacgggtaaa atgtatgcct gcaagcgctt ggagaagaag aggatcaaaa agaggaaagg 901 ggagtccatg gccctcaatg agaagcagat cctcgagaag gtcaacagtc agtttgtggt 961 caacctggcc tatgcctacg agaccaagga tgcactgtgc ttggtcctga ccatcatgaa 1021 tgggggtgac ctgaagttcc acatctacaa catgggcaac cctggcttcg aggaggagcg 1081 ggccttgttt tatgcggcag agatcctctg cggcttagaa gacctccacc gtgagaacac 1141 cgtctaccga gatctgaaac ctgaaaacat cctgttagat gattatggcc acattaggat 1201 ctcagacctg ggcttggctg tgaagatccc cgagggagac ctgatccgcg gccgggtggg 1261 cactgttggc tacatggccc ccgaagtcct gaacaaccag aggtacggcc tgagccccga 1321 ctactggggc cttggctgcc tcatctatga gatgatcgag ggccagtcgc cgttccgcgg 1381 ccgtaaggag aaggtgaagc gggaggaggt ggaccgccgg gtcctggaga cggaggaggt 1441 gtactcccac aagttctccg aggaggccaa gtccatctgc aagatgctgc tcacgaaaga 1501 tgcgaagcag aggctgggct gccaggagga gggggctgca gaggtcaaga gacacccctt 1561 cttcaggaac atgaacttca agcgcttaga agccgggatg ttggaccctc ccttcgttcc 1621 agacccccgc gctgtgtact gtaaggacgt gctggacatc gagcagttct ccactgtgaa 1681 gggcgtcaat ctggaccaca cagacgacga cttctactcc aagttctcca cgggctctgt 1741 gtccatccca tggcaaaacg agatgataga aacagaatgc tttaaggagc tgaacgtgtt 1801 tggacctaat ggtaccctcc cgccagatct gaacagaaac caccctccgg aaccgcccaa 1861 gaaagggctg ctccagagac tcttcaagcg gcagcatcag aacaattcca agagttcgcc 1921 cagctccaag accagtttta accaccacat aaactcaaac catgtcagct cgaactccac 1981 cggaagcagc tagtttcggc tctggcctcc aagtccacag tggaaccagc ccagaccctt 2041 ctccttagaa gtggaagtag tggagcccct gctctggtgg ggctgccagg ggagaccccg 2101 ggagccggaa ggaggccgtc catcccgtcg acgtagaacc tcgaggtttc tcaaagaaat 2161 ttccactcag gtctgttttc cgaggcggcc ccgggcgggt ggattggatt tgtctttggt 2221 gaacattgca atagaaatcc aattggatac gacaacttgc acgtatttta atagcgtcat 2281 aactagaact gaattttgtc tttatgattt ttaaagaaaa gttttgtaaa tttctctact 2341 gtctcagttt acattttcgg tatatttgta tttaaatgaa gtgagacttt gagggtgtat 2401 attttctgtg cagccactgt taagccatgt gttccaaggc attttagcgg ggagggggtt 2461 atcaaaaaaa aaaaaaatgt gactcaagac ttccagagcc tcaaatgaga aaatgtcttt 2521 attaaatgta gaaagtgatc catacttcaa aaaaaaa // LOCUS HUMGRP5E 797 bp mRNA PRI 08-NOV-1994 DEFINITION Human gastrin-releasing peptide mRNA, complete cds. ACCESSION K02054 NID g183642 KEYWORDS gastrin-releasing peptide. SOURCE Human pulmonary carcinoid tumor, cDNA to mRNA, clone pB-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 797; 1 to 797) AUTHORS Spindel,E.R., Chin,W.W., Price,J., Rees,L.H., Besser,G.M. and Habener,J.F. TITLE Cloning and characterization of cDNAs encoding human gastrin-releasing peptide JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (18), 5699-5703 (1984) MEDLINE 85014836 REFERENCE 2 (bases 263 to 485) AUTHORS Spindel,E.R., Zilberberg,M.D., Habener,J.F. and Chin,W.W. TITLE Two prohormones for gastrin-releasing peptide are encoded by two mRNAs differing by 19 nucleotides JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (1), 19-23 (1986) MEDLINE 86094341 COMMENT [2] revises [1]. GRP and amphibian bombesin have similar biological effects. They increase plasma immunoreactive levels of gastrin, pancreatic polypeptide, glucagon, gastrin inhibitory peptide and insulin. FEATURES Location/Qualifiers source 1..797 /organism="Homo sapiens" /db_xref="taxon:9606" /map="18q21" mRNA <1..797 /note="proGRP mRNA" sig_peptide 56..124 /gene="GRP" /note="gastrin releasing peptide signal pept" gene 56..502 /gene="GRP" CDS 56..502 /gene="GRP" /note="pre-progastrin releasing peptide" /codon_start=1 /db_xref="GDB:G00-119-284" /db_xref="PID:g306807" /translation="MRGSELPLVLLALVLCLAPRGRAVPLPAGGGTVLTKMYPRGNHW AVGHLMGKKSTGESSSVSERGSLKQQLREYIRWEEAARNLLGLIEAKENRNHQPPQPK ALGNQQPSWDSEDSSNFKDVGSKGKVGRLSAPGSQREGRNPQLNQQ" mat_peptide 125..205 /gene="GRP" /note="gastrin releasing peptide" BASE COUNT 208 a 200 c 205 g 184 t ORIGIN Chromosome 18q21; 215 bp upstream of HpaII site. 1 agtctctgct cttcccagcc tctccggcgc gctccaaggg cttcccgtcg ggaccatgcg 61 cggcagtgag ctcccgctgg tcctgctggc gctggtcctc tgcctagcgc cccgggggcg 121 agcggtcccg ctgcctgcgg gcggagggac cgtgctgacc aagatgtacc cgcgcggcaa 181 ccactgggcg gtggggcact taatggggaa aaagagcaca ggggagtctt cttctgtttc 241 tgagagaggg agcctgaagc agcagctgag agagtacatc aggtgggaag aagctgcaag 301 gaatttgctg ggtctcatag aagcaaagga gaacagaaac caccagccac ctcaacccaa 361 ggccttgggc aatcagcagc cttcgtggga ttcagaggat agcagcaact tcaaagatgt 421 aggttcaaaa ggcaaagttg gtagactctc tgctccaggt tctcaacgtg aaggaaggaa 481 cccccagctg aaccagcaat gataatgatg gcctctctca aaagagaaaa acaaaacccc 541 taagagactg agttctgcaa gcatcagttc tacggatcat caacaagatt tccttgtgca 601 aaatatttga ctattctgta tctttcatcc ttgactaaat tcgtgatttt caagcagcat 661 cttctggttt aaacttgttt gctgtgaaca attgtcgaaa agagtcttcc aattaatgct 721 tttttatatc taggctacct gttggttaga ttcaaggccc cgagctgtta ccattcacaa 781 taaaagctta aacacat // LOCUS HUMGRP75 2131 bp mRNA PRI 26-MAY-1995 DEFINITION Homo sapiens mitochondrial HSP75 mRNA, complete cds. ACCESSION L15189 NID g292058 KEYWORDS glucose-regulated protein. SOURCE Homo sapiens (tissue library: lambda ZAPII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2131) AUTHORS Bhattacharyya,T., Karnezis,A.N., Murphy,S.P., Hoang,T., Freeman,B.C., Phillips,B. and Morimoto,R.I. TITLE Cloning and subcellular localization of human mitochondrial hsp70 JOURNAL J. Biol. Chem. 270 (4), 1705-1710 (1995) MEDLINE 95130547 FEATURES Location/Qualifiers source 1..2131 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda ZAPII" gene 30..2069 /gene="mthsp75" CDS 30..2069 /gene="mthsp75" /note="75kD glucose-regulated protein" /codon_start=1 /product="MTHSP75" /db_xref="PID:g292059" /translation="MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRR DYASEAIKGAVVGIDLGTTNSCVAVMEGKQAKVLENAEGARTTPSVVAFTADGERLVG MPAKRQAVTNPNNTFYATKRLIGRRYDDPEVQKDIKNVPFKIVRASNGDAWVEAHGKL YSPSQIGAFVLMKMKETAENYLGHTAKNAVITVPAYFNDSQRQATKDAGQISGLNVLR VINEPTAAALAYGLDKSEDKVIAVYDLGGGTFDISILEIQKGVFEVKSTNGDTFLGGE DFDQALLRHIVKEFKRETGVDLTKDNMALQRVREAAEKAKCELSSSVQTDINLPYLTM DSSGPKHLNMKLTRAQFEGIVTDLIRRTIAPCQKAMQDAEVSKSDIGEVILVGGMTRM PKVQQTVQDLFGRAPSKAVNPDEAVAIGAAIQGGVLAGDVTDVLLLDVTPLSLGIETL GGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGEREMAGDNKLLGQFTLIGI PPAPRGVPQIEVTFDIDANGIVHVSAKDKGTRREQQIVIQSSGGLSKDDIENMVKNAE KYAEEDRRKKERVEAVNMAEGIIHDTETKMEEFKDQLPADECNKLKEEISKMRELLAR KDSETGENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQKEEKQ" BASE COUNT 639 a 428 c 555 g 509 t ORIGIN 1 cctgcctcgt actcctccat ttatccgcca tgataagtgc cagccgagct gcagcagccc 61 gtctcgtggg cgccgcagcc tcccggggcc ctacggccgc ccgccaccag gatagctgga 121 atggccttag tcatgaggct tttagacttg tttcaaggcg ggattatgca tcagaagcaa 181 tcaagggagc agttgttggt attgatttgg gtactaccaa ctcctgcgtg gcagttatgg 241 aaggtaaaca agcaaaggtg ctggagaatg ccgaaggtgc cagaaccacc ccttcagttg 301 tggcctttac agcagatggt gagcgacttg ttggaatgcc ggccaagcga caggctgtca 361 ccaacccaaa caatacattt tatgctacca agcgtctcat tggccggcga tatgatgatc 421 ctgaagtaca gaaagacatt aaaaatgttc cctttaaaat tgtccgtgcc tccaatggtg 481 atgcctgggt tgaggctcat gggaaattgt attctccgag tcagattgga gcatttgtgt 541 tgatgaagat gaaagagact gcagaaaatt acttggggca cacagcaaaa aatgctgtga 601 tcacagtccc agcttatttc aatgactcgc agagacaggc cactaaagat gctggccaga 661 tatctggact gaatgtgctt cgggtgatta atgagcccac agctgctgct cttgcctatg 721 gtctagacaa atcagaagac aaagtcattg ctgtatatga tttaggtggt ggaacttttg 781 atatttctat cctggaaatt cagaaaggag tatttgaggt gaaatccaca aatggggata 841 ccttcttagg tggggaagac tttgaccagg ccttgctacg gcacattgtg aaggagttca 901 agagagagac aggggttgat ttgactaaag acaacatggc acttcagagg gtacgggaag 961 ctgctgaaaa ggctaaatgt gaactctcct catctgtgca gactgacatc aatttgccct 1021 atcttacaat ggattcttct ggacccaagc atttgaatat gaagttgacc cgtgctcaat 1081 ttgaagggat tgtcactgat ctaatcagaa ggactatcgc tccatgccaa aaagctatgc 1141 aagatgcaga agtcagcaag agtgacatag gagaagtgat tcttgtgggt ggcatgacta 1201 ggatgcccaa ggttcagcag actgtacagg atctttttgg cagagcccca agtaaagctg 1261 tcaatcctga tgaggctgtg gccattggag ctgccattca gggaggtgtg ttggccggcg 1321 atgtcacgga tgtgctgctc cttgatgtca ctcccctgtc tctgggtatt gaaactctag 1381 gaggtgtctt taccaaactt attaatagga ataccactat tccaaccaag aagagccagg 1441 tattctctac tgccgctgat ggtcaaacgc aagtggaaat taaagtgtgt cagggtgaaa 1501 gagagatggc tggagacaac aaactccttg gacagtttac tttgattgga attccaccag 1561 cccctcgtgg agttcctcag attgaagtta catttgacat tgatgccaat gggatagtac 1621 atgtttctgc taaagataaa ggcacaagac gtgagcagca gattgtaatc cagtcttctg 1681 gtggattaag caaagatgat attgaaaata tggttaaaaa tgcagagaaa tatgctgaag 1741 aagaccggcg aaagaaggaa cgagttgaag cagttaatat ggctgaagga atcattcacg 1801 acacagaaac caagatggaa gaattcaagg accaattacc tgctgatgag tgcaacaagc 1861 tgaaagaaga gatttccaaa atgagggagc tcctggctag aaaagacagt gaaacaggag 1921 aaaatattag acaggcagca tcctctcttc agcaggcatc attgaagctg ttcgaaatgg 1981 catacaaaaa gatggcatct gagcgagaag gctctggaag ttctggcact ggggaacaaa 2041 aggaagatca aaaggaggaa aaacagtaat aatagcagaa attttgaagc cagaaggaca 2101 acatatgaag cttaggagtg aagagacttc c // LOCUS HUMGRPR 1726 bp mRNA PRI 13-FEB-1996 DEFINITION Human gastrin releasing peptide receptor (GRPR) mRNA, complete cds. ACCESSION M73481 NID g183649 KEYWORDS G protein-coupled receptor; GRPR gene; bombesin peptide receptor; gastrin-releasing peptide receptor; growth factor receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1726) AUTHORS Corjay,M.H., Dobrzanski,D.J., Way,J.M., Viallet,J., Shapira,H., Worland,P., Sausville,E.A. and Battey,J.F. TITLE Two distinct bombesin receptor subtypes are expressed and functional in human lung carcinoma cells JOURNAL J. Biol. Chem. 266 (28), 18771-18779 (1991) MEDLINE 92011639 FEATURES Location/Qualifiers source 1..1726 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NCI-H345" /map="Xp22.2-p22.13" mRNA 1..1726 /gene="GRPR" /note="G00-128-035" gene 1..1726 /gene="GRPR" CDS 399..1553 /gene="GRPR" /codon_start=1 /db_xref="GDB:G00-128-035" /product="gastrin releasing peptide receptor" /db_xref="PID:g183650" /translation="MALNDCFLLNLEVDHFMHCNISSHSADLPVNDDWSHPGILYVIP AVYGVIILIGLIGNITLIKIFCTVKSMRNVPNLFISSLALGDLLLLITCAPVDASRYL ADRWLFGRIGCKLIPFIQLTSVGVSVFTLTALSADRYKAIVRPMDIQASHALMKICLK AAFIWIISMLLAIPEAVFSDLHPFHEESTNQTFISCAPYPHSNELHPKIHSMASFLVF YVIPLSIISVYYYFIAKNLIQSAYNLPVEGNIHVKKQIESRKRLAKTVLVFVGLFAFC WLPNHVIYLYRSYHYSEVDTSMLHFVTSICARLLAFTNSCVNPFALYLLSKSFRKQFN TQLLCCQPGLIIRSHSTGRSTTCMTSLKSTNPSVATFSLINGNICHERYV" BASE COUNT 423 a 449 c 380 g 474 t ORIGIN 1 ccagattcta aatatcagga aagacgctgt gggaaaatag caggccaaaa gttcttagta 61 aactgcagcc agggagactc agactagaat ggaggtagaa agaactgatg cagagtgggt 121 ttaattctaa gcctttttgt ggctaagttt tgttgttgtt aacttattga atttagagtt 181 gtattgcact ggtcatgtga aagccagagc agcaccagtg tcaaaatagt gacagagagt 241 tttgaatacc atagttagta tatatgtact cagagtattt ttattaaaga aggcaaagag 301 cccggcatag atcttatctt catcttcact cggttgcaaa atcaatagtt aagaaatagc 361 atctaaggga acttttaggt gggaaaaaaa atctagagat ggctctaaat gactgtttcc 421 ttctgaactt ggaggtggac catttcatgc actgcaacat ctccagtcac agtgcggatc 481 tccccgtgaa cgatgactgg tcccacccgg ggatcctcta tgtcatccct gcagtttatg 541 gggttatcat tctgataggc ctcattggca acatcacttt gatcaagatc ttctgtacag 601 tcaagtccat gcgaaacgtt ccaaacctgt tcatttccag tctggctttg ggagacctgc 661 tcctcctaat aacgtgtgct ccagtggatg ccagcaggta cctggctgac agatggctat 721 ttggcaggat tggctgcaaa ctgatcccct ttatacagct tacctctgtt ggggtgtctg 781 tcttcacact cacggcgctc tcggcagaca gatacaaagc cattgtccgg ccaatggata 841 tccaggcctc ccatgccctg atgaagatct gcctcaaagc cgcctttatc tggatcatct 901 ccatgctgct ggccattcca gaggccgtgt tttctgacct ccatcccttc catgaggaaa 961 gcaccaacca gaccttcatt agctgtgccc catacccaca ctctaatgag cttcacccca 1021 aaatccattc tatggcttcc tttctggtct tctacgtcat cccactgtcg atcatctctg 1081 tttactacta cttcattgct aaaaatctga tccagagtgc ttacaatctt cccgtggaag 1141 ggaatataca tgtcaagaag cagattgaat cccggaagcg acttgccaag acagtgctgg 1201 tgtttgtggg cctgttcgcc ttctgctggc tccccaatca tgtcatctac ctgtaccgct 1261 cctaccacta ctctgaggtg gacacctcca tgctccactt tgtcaccagc atctgtgccc 1321 gcctcctggc cttcaccaac tcctgcgtga acccctttgc cctctacctg ctgagcaaga 1381 gtttcaggaa acagttcaac actcagctgc tctgttgcca gcctggcctg atcatccggt 1441 ctcacagcac tggaaggagt acaacctgca tgacctccct caagagtacc aacccctccg 1501 tggccacctt tagcctcatc aatggaaaca tctgtcacga gcggtatgtc tagattgacc 1561 cttgattttg ccccctgagg gacggttttg ctttatggct agacaggaac ccttgcatcc 1621 attgttgtgt ctgtgccctc caaagagcct tcagaatgct cctgagtggt gtaggtgggg 1681 gtggggaggc ccaaatgatg gatcaccatt atattttgaa agaagc // LOCUS HUMGS1 2058 bp mRNA PRI 31-DEC-1994 DEFINITION Human GS1 (protein of unknown function) mRNA, complete cds. ACCESSION M86934 NID g183652 KEYWORDS . SOURCE Homo sapiens (tissue library: Clontech) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Salido,E.C., Yen,P.H., Koprivnikar,K., Yu,L.C. and Shapiro,L.J. TITLE The human enamel protein gene amelogenin is expressed from both the X and the Y chromosomes [see comments] JOURNAL Am. J. Hum. Genet. 50 (2), 303-316 (1992) MEDLINE 92133605 REFERENCE 2 (sites) AUTHORS Yen,P.H., Ellison,J., Salido,E.C., Mohandas,T. and Shapiro,L. TITLE Isolation of a new gene from the distal short arm of the human X chromosome that escapes X-inactivation JOURNAL Hum. Mol. Genet. 1 (1), 47-52 (1992) MEDLINE 93244753 FEATURES Location/Qualifiers source 1..2058 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="Clontech" /map="Xp22.3" gene 36..2035 /gene="GS1" CDS 36..680 /gene="GS1" /note="Gene from Xp22.3 which escapes X-inactivation. Function unknown." /codon_start=1 /db_xref="PID:g183653" /translation="MDGLLLDTERLYSVVFQEICNRYDKKYSWDVKSLVMGKKALEAA QIIIDVLQLPMSKEELVEESQTKLKEVFPMAALMPGAEKLIIHLRKHGIPFALATSSG SASFDMKTSRHKEFFSLFSHIVLGDDPEVQHGKPDPDIFLACAKRFSPPPAMEKCLVF EDAPNGVEAALAAGMQAVMVPDGNLSRDLTTKATLVLNSLQDFQPELFGLPSYE" polyA_signal 2030..2035 /gene="GS1" polyA_site 2058 /gene="GS1" BASE COUNT 542 a 455 c 467 g 594 t ORIGIN Xp22.3. 1 cgcccccgca gcccgtcacc cacctcatct ttgacatgga cggacttctt ctggatactg 61 aacggctgta ttcagtggtg tttcaagaaa tatgtaatcg ctatgacaag aaatacagct 121 gggatgtaaa gtccctggtt atgggtaaga aggcattaga ggcggcacag attataatag 181 acgtcttgca gctcccgatg tccaaagagg agctggtgga agaaagccaa acgaagttaa 241 aggaagtgtt ccccatggct gcgctcatgc caggggcgga gaaactcatc atccacctgc 301 ggaaacatgg catccccttt gcactagcca ccagctcggg gtccgcgtcg ttcgatatga 361 agacaagccg ccacaaggag ttcttcagct tgttttccca cattgtgctg ggagatgacc 421 ccgaagtgca gcatggcaag ccagacccag acatcttcct agcttgtgcc aagaggttct 481 ctccccctcc tgctatggag aagtgccttg tctttgaaga tgctcccaat ggggtggagg 541 cggccctggc agctgggatg caggcggtca tggttcctga cggaaacttg agccgagatc 601 tgacaacaaa ggccaccctg gtgctgaatt ccctgcagga cttccagccc gagctgtttg 661 gtttgccctc ctatgagtga gagggagggc ctcagtcttc cgcccccagc ccactctcat 721 ggtccacact gctgggggaa agggaaagga aatcagcaac tcttcaatcc caacctgcgc 781 tgtgatttta gcctcctgag attggagttt ccatcccatg ttggtttgtc ccagtctaac 841 gtgttgataa aatgtgactt gacggttgag acaaaaaata cagtagagac agaaacgaag 901 cccagaacaa agatgaaact tgaattacca tctcagaagt caagctgatg gagtatgtga 961 taaagtgaat gtacatgtat atacacacac acctccatat atacacgtgt gtatcagttt 1021 ggtaatatgc aggtaggcat tacatgcata tgtatgtaga catatgcatg catgtatatg 1081 taaaatatat acttttccaa gacaaaatgg aacatcactt ctcctagttt ttctgaacac 1141 tggctgggaa atgtaaactg tgtatgcata taagtatatg ctttatgtat gcatatgtat 1201 gtagatatgt ttatatctat cgtctgcatc actctcctca gtgttgatgt caacatgcaa 1261 tgacaactga taaagcgaga tggtagttct gcctggtttg cagtctgagt gggaaagtcc 1321 tgtttttgat gagcactcct tgttagctaa catttaaatt ctttttgtga cctcagaatg 1381 tctctggatc tttcctcatt gactgactct gtgccacgtc atccatagtt tattgttagt 1441 atgaacacaa ctgtaacatt tacctggtat ctacatcctt acctgcattg gaaaatgttt 1501 gctacctcac aacaaccatt tgcctccttt aagaacactg atgggctgca ctttttggat 1561 agaaatagaa tttgatttca gaatgtatgc ttggtgagtc tcagtgccca ggaacacttt 1621 tggaataatt tatcagacat tgaacttctg tgattaatcg cttttataga tttactcagt 1681 ctttaaaatt cgtctctgat ttgccagaga aaaacggtgg tagccatgga aatcgggagt 1741 gaaggagcac tgcttcattg tggctcagcc cttcctaggg gcctctgccc tttgatgtcc 1801 ttgagctact cttcagctct ggaagttgtg gacaaaccgt aggaatgtat gtgtgcgtgt 1861 ggtggagtga ttgtctgtga atgacaggcc ctggctattg attgatgttg catcaattta 1921 gcaaattcat ttcctcattc ttgatggcct gaatatatgt ctgcactttt aatgctcctc 1981 ttaaccagtt gtaacatctt accatttccc taccaaattg aattagttta ataaaatctt 2041 ttgacacatg ttaaaaac // LOCUS HUMGSHS 1811 bp mRNA PRI 01-JUL-1995 DEFINITION Homo sapiens (clone pGSH1) glutathione synthetase (gsh-s) mRNA, complete cds. ACCESSION L42531 NID g886283 KEYWORDS glutathione synthetase. SOURCE Homo sapiens (clone: pGSH1) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1811) AUTHORS Gali,R.R. and Board,P.G. TITLE Sequencing and expression of a cDNA for human glutathione synthetase JOURNAL Biochem. J. (1995) In press FEATURES Location/Qualifiers source 1..1811 /organism="Homo sapiens" /note="(vector lambda gt11)" /db_xref="taxon:9606" /clone="pGSH1" /tissue_type="brain" 5'UTR 1..39 /gene="gsh-s" gene 1..1811 /gene="gsh-s" mRNA 1..1811 /gene="gsh-s" CDS 40..1464 /gene="gsh-s" /EC_number="6.3.2.3" /codon_start=1 /product="glutathione synthetase" /db_xref="PID:g886284" /translation="MATNWGSLLQDKQQLEELARQAVDRALAEGVLLRTSQEPTSSEV VSYAPFTLFPSLVPSALLEQAYAVQMDFNLLVDAVSQNAAFLEQTLSSTIKQDDFTAR LFDIHKQVLKEGIAQTVFLGLNRSDYMFQRSADGSPALKQIEINTISASFGGLASRTP AVHRHVLSVLSKTKEAGKILSNNPSKGLALGIAKAWELYGSPNALVLLIAQEKERNIF DQRAIENELLARNIHVIRRTFEDISEKGSLDQDRRLFVDGQEIAVVYFRDGYMPRQYS LQNWEARLLLERSHAAKCPDIATQLAGTKKVQQELSRPGMLEMLLPGQPEAVARLRAT FAGLYSLDVGEEGDQAIAEALAAPSRFVLKPQREGGGNNLYGEEMVQALKQLKDSEER ASYILMEKIEPEPFENCLLRPGSPARVVQCISELGIFGVYVRQEKTLVMNKHVGHLLR TKAIEHADGGVAAGVAVLDNPYPV" 3'UTR 1465..1811 /gene="gsh-s" BASE COUNT 450 a 483 c 500 g 378 t ORIGIN 1 ggagaaccgt tcgcggagga aaggcgaact agtgttggga tggccaccaa ctgggggagc 61 ctcttgcagg ataaacagca gctagaggag ctggcacggc aggccgtgga ccgggccctg 121 gctgagggag tattgctgag gacctcacag gagcccactt cctcggaggt ggtgagctat 181 gccccattca cgctcttccc ctcactggtc cccagtgccc tgctggagca agcctatgct 241 gtgcagatgg acttcaacct gctagtggat gctgtcagcc agaacgctgc cttcctggag 301 caaactcttt ccagcaccat caaacaggat gactttaccg ctcgtctctt tgacatccac 361 aagcaagtcc taaaagaggg cattgcccag actgtgttcc tgggcctgaa tcgctcagac 421 tacatgttcc agcgcagcgc agatggctcc ccagccctga aacagatcga aatcaacacc 481 atctctgcca gctttggggg cctggcctcc cggaccccag ctgtgcaccg acatgttctc 541 agtgtcctga gtaagaccaa agaagctggc aagatcctct ctaataatcc cagcaaggga 601 ctggccctgg gaattgccaa agcctgggag ctctacggct cacccaatgc tctggtgcta 661 ctgattgctc aagagaagga aagaaacata tttgaccagc gtgccataga gaatgagcta 721 ctggccagga acatccatgt gatccgacga acatttgaag atatctctga aaaggggtct 781 ctggaccaag accgaaggct gtttgtggat ggccaggaaa ttgctgtggt ttacttccgg 841 gatggctaca tgcctcgtca gtacagtcta cagaattggg aagcacgtct actgctggag 901 aggtcacatg ctgccaagtg cccagacatt gccacccagc tggctgggac taagaaggtg 961 cagcaggagc taagcaggcc gggcatgctg gagatgttgc tccctggcca gcctgaggct 1021 gtggcccgcc tccgcgccac ctttgctggc ctctactcac tggatgtggg tgaagaaggg 1081 gaccaggcca tcgccgaggc ccttgctgcc cctagccggt ttgtgctaaa gccccagaga 1141 gagggtggag gtaacaacct atatggggag gaaatggtac aggccctgaa acagctgaag 1201 gacagtgagg agagggcctc ctacatcctc atggagaaga tcgaacctga gccttttgag 1261 aattgcctgc tacggcctgg cagccctgcc cgagtggtcc agtgcatttc agagctgggc 1321 atctttgggg tctatgtcag gcaggaaaag acactcgtga tgaacaagca cgtggggcat 1381 ctacttcgaa ccaaagccat cgagcatgca gatggtggtg tggcagcggg agtggcagtc 1441 ctggacaacc cataccctgt gtgagggcac aaccaggcca cgggaccttc tatcctctgt 1501 atttgtcatt cctctcctag ccctcctgag gggtatcctc ctaaagacct ccaaagtttt 1561 tatggaaggg taaatactgg taccttcccc cagctttcca tctgaggacc agaaaagttg 1621 tgtctccctt agatgagatc tagacgcccc caaatccttg agatgtgggt atagctcagg 1681 gtaagctgct aaaccattta cccaaataaa gtataggcga tagaaattga aacctggcgc 1741 aatagatata gtaccgcaag ggaaagatga aaaattataa ccaagcataa tatagcaagg 1801 actaacccct g // LOCUS HUMGST 909 bp mRNA PRI 11-JUN-1993 DEFINITION Human glutathione S-transferase mRNA, complete cds. ACCESSION J03746 NID g183655 KEYWORDS glutathione transferase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 909) AUTHORS DeJong,J.L., Morgenstern,R., Joernvall,H., DePierre,J.W. and Tu,C.-P.D. TITLE Gene expression of rat and human microsomal glutathione S-transferases JOURNAL J. Biol. Chem. 263, 8430-8436 (1988) MEDLINE 88228077 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.L.DeJong, 08-JUN-1988. FEATURES Location/Qualifiers source 1..909 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..909 /note="GST mRNA" CDS 74..541 /note="glutathione S-transferase" /codon_start=1 /db_xref="PID:g306808" /translation="MVDLTQVMDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKVF ANPEDCVAFGKGENAKKYLRTDDRVERVRRAHLNDLENIIPFLGIGLLYSLSGPDPST AILHFRLFVGARIYHTIAYLTPLPQPNRALSFFVGYGVTLSMAYRLLKSKLYL" BASE COUNT 291 a 167 c 160 g 291 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaagt cctaaagcct acagttttga atactactga aatgacaagt tattccagac 61 caaaattgaa aaaatggttg acctcaccca ggtaatggat gatgaagtat tcatggcttt 121 tgcatcctat gcaacaatta ttctttcaaa aatgatgctt atgagtactg caactgcatt 181 ctatagattg acaagaaagg tttttgccaa tccagaagac tgtgtagcat ttggcaaagg 241 agaaaatgcc aagaagtatc ttcgaacaga tgacagagta gaacgtgtac gcagagccca 301 cctgaatgac cttgaaaata ttattccatt tcttggaatt ggcctcctgt attccttgag 361 tggtcccgac ccctctacag ccatcctgca cttcagacta tttgtcggag cacggatcta 421 ccacaccatt gcatatttga caccccttcc ccagccaaat agagctttga gtttttttgt 481 tggatatgga gttactcttt ccatggctta caggttgctg aaaagtaaat tgtacctgta 541 aagaaaatca tacaactcaa catccagttg gctttttaag aattctgtac ttccaattta 601 taatgaatac tttcttagat tttaggtagg aggggagcag aggaattatg aactggggta 661 aacccatttt gaatattagc attgccaata tcctgtattc ttgttttaca tttggattag 721 aaatttaaca tagtaattct taagtctttt gtctgatttt taaagtactt tcttataaat 781 ttggatcatg ttatgatttg taacattcac acaacacctc acttttgaat ctataaaaga 841 attgcacgta tgagaaacct atatttcaat actgctgaaa cagacatgaa ataaagaatt 901 taaagaatg // LOCUS HUMGST2 797 bp mRNA PRI 21-JUL-1995 DEFINITION Human glutathione S-transferase 2 (GST) mRNA, complete cds. ACCESSION M15872 NID g183657 KEYWORDS glutathione transferase; glutathione transferase II; transferase. SOURCE Homo sapiens (clone: lambda-GST2-3) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 797) AUTHORS Board,P.G. and Webb,G.C. TITLE Isolation of a cDNA clone and localization of human glutathione S-transferase 2 genes to chromosome band 6p12 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (8), 2377-2381 (1987) MEDLINE 87175676 COMMENT Draft entry and clean copy of sequence [1] kindly provided by P.G.Board, 09-JUN-1987. A polyadenylation signal is located at nucleotides 781-786. FEATURES Location/Qualifiers source 1..797 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-GST2-3" /tissue_type="liver" /map="6p12.2" mRNA <1..797 /note="GST2 mRNA" gene 56..724 /gene="GST2" CDS 56..724 /gene="GST2" /EC_number="2.5.1.18" /codon_start=1 /db_xref="GDB:G00-120-023" /product="glutathione S-transferase" /db_xref="PID:g306809" /translation="MAEKPKLHYFNARGRMESTRWLLAAAGVEFEEKFIKSAEDLDKL RNDGYLMFQQVPMVEIDGMKLVQTRAILNYIASKYNLYGKDIKERALIDMYIEGIADL GEMILLLPVCPPEEKDAKLALIKEKIKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHL VELLYYVEELDSSLISSFPLLKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEEARK IFRF" BASE COUNT 250 a 173 c 188 g 186 t ORIGIN Chromosome 6p12.2; 24 bp upstream of HindIII site. 1 caggacggtg acagcgttta acaaagctta gagaaacctc caggagactg ctatcatggc 61 agagaagccc aagctccact acttcaatgc acggggcaga atggagtcca cccggtggct 121 cctggctgca gctggagtag agtttgaaga gaaatttata aaatctgcag aagatttgga 181 caagttaaga aatgatggat atttgatgtt ccagcaagtg ccaatggttg agattgatgg 241 gatgaagctg gtgcagacca gagccattct caactacatt gccagcaaat acaacctcta 301 tgggaaagac ataaaggaga gagccctgat tgatatgtat atagaaggta tagcagattt 361 gggtgaaatg atcctccttc tgcccgtatg tccacctgag gaaaaagatg ccaagcttgc 421 cttgatcaag gagaaaataa aaaatcgcta cttccctgcc tttgaaaaag tcttaaagag 481 ccatggacaa gactaccttg ttggcaacaa gctgagccgg gctgacattc atctggtgga 541 acttctctac tacgtcgagg agcttgactc cagtcttatc tccagcttcc ctctgctgaa 601 ggccctgaaa accagaatca gcaacctgcc cacagtgaag aagtttctac agcctggcag 661 cccaaggaag cctcccatgg atgagaaatc tttagaagaa gcaaggaaga ttttcaggtt 721 ttaataacgc agtcatggag gccaagaact tgcaatacca atgttctaaa gttttgcaac 781 aataaagtac tttacct // LOCUS HUMGSTM3A 1266 bp mRNA PRI 23-DEC-1994 DEFINITION Human glutathione transferase M3 (GSTM3) mRNA, complete cds. ACCESSION J05459 NID g183680 KEYWORDS GSTM3 gene; glutathione S-transferase; glutathione S-transferase M3; glutathione transferase. SOURCE clone HTGT-[6,18]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1266) AUTHORS Campbell,E., Takahashi,Y., Abramovitz,M., Peretz,M. and Listowsky,I. TITLE A distinct human testis and brain mu-class glutathione S-transferase. Molecular cloning and characterization of a form present even in individuals lacking hepatic type mu isoenzymes JOURNAL J. Biol. Chem. 265 (16), 9188-9193 (1990) MEDLINE 90264406 COMMENT Authorin submission for [1] kindly submitted by E.Campbell, 13-APR-1990. FEATURES Location/Qualifiers source 1..1266 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain and testis" /map="1P13.3" mRNA 1..1266 /gene="GSTM3" /note="G00-128-874" gene 1..1266 /gene="GSTM3" CDS 18..695 /gene="GSTM3" /EC_number="2.5.1.18" /codon_start=1 /db_xref="GDB:G00-128-874" /product="glutathione transferase M3" /db_xref="PID:g306820" /translation="MSCESSMVLGYWDIRGLAHAIRLLLEFTDTSYEEKRYTCGEAPD YDRSQWLDVKFKLDLDFPNLPYLLDGKNKITQSNAILRYIARKHNMCGETEEEKIRVD IIENQVMDFRTQLIRLCYSSDHEKLKPQYLEELPGQLKQFSMFLWKFSWFAGEKLTFV DFLTYDILDQNRIFDPKCLDEFPNLKAFMCRFEALEKIAAYLQSDQFCKMPINNKMAQ WGNKPVC" BASE COUNT 350 a 274 c 317 g 325 t ORIGIN 1 ctcggaagcc cgtcaccatg tcgtgcgagt cgtctatggt tctcgggtac tgggatattc 61 gtgggctggc gcacgccatc cgcctgctcc tggagttcac ggatacctct tatgaggaga 121 aacggtacac gtgcggggaa gctcctgact atgatcgaag ccaatggctg gatgtgaaat 181 tcaagctaga cctggacttt cctaatctgc cctacctcct ggatgggaag aacaagatca 241 cccagagcaa tgccatcttg cgctacatcg ctcgcaagca caacatgtgt ggtgagactg 301 aagaagaaaa gattcgagtg gacatcatag agaaccaagt aatggatttc cgcacacaac 361 tgataaggct ctgttacagc tctgaccacg aaaaactgaa gcctcagtac ttggaagagc 421 tacctggaca actgaaacaa ttctccatgt ttctgtggaa attctcatgg tttgccgggg 481 aaaagctcac ctttgtggat tttctcacct atgatatctt ggatcagaac cgtatatttg 541 accccaagtg cctggatgag ttcccaaacc tgaaggcttt catgtgccgt tttgaggctt 601 tggagaaaat cgctgcctac ttacagtctg atcagttctg caagatgccc atcaacaaca 661 agatggccca gtggggcaac aagcctgtat gctgagcagg aggcagactt gcagagcttg 721 ttttgtttca tcctgtccgt aaggggtcag cgctcttgct ttgctctttt caatgaatag 781 cacttatgtt actggtgtcc agctgagttt ctcttgggta taaaggctaa aagggaaaaa 841 ggatatgtgg agaatcatca agatatgaat tgaatcgctg cgatactgtg gcatttccct 901 actccccaac tgagttcaag ggctgtaggt tcatgcccaa gccctgagag tgggtactag 961 aaaaaacgag attgcacagt tggagagagc aggtgtgtta aatggactgg agtccctgtg 1021 aagactgggt gaggataaca caagtaaaac tgtggtactg atggacttaa ccggagttcg 1081 gaaaccgtcc tgtgtacaca tgggagttta gtgtgataaa ggcagtattt cagactggtg 1141 ggctagccaa tagagttggc aattgcttat tgaaactcat taaaaataat agagccccac 1201 ttgacactat tcactaaaat taatctggaa tttaaggccc aacattaaac acaaagctgt 1261 attgat // LOCUS HUMGSTT2A 1036 bp mRNA PRI 24-JUL-1997 DEFINITION Homo sapiens glutathione S-transferase theta 2 (GSTT2) mRNA, complete cds. ACCESSION L38503 NID g601917 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1036) AUTHORS Tan,K.L., Webb,G.C., Baker,R.T. and Board,P.G. TITLE Molecular cloning of a cDNA and chromosomal localization of a human theta-class glutathione S-transferase gene (GSTT2) to chromosome 22 JOURNAL Genomics 25 (2), 381-387 (1995) MEDLINE 95309904 REFERENCE 2 (bases 1 to 1036) AUTHORS Board,P.G., Tan,K.L. and Baker,R.T. TITLE Direct Submission JOURNAL Submitted (12-DEC-1994) Molecular Genetics Group, John Curtin School of Medical Research, Australian National University, P.O. Box 334, Canberra, Australian Capital Territory 2601, Australia FEATURES Location/Qualifiers source 1..1036 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="female" /tissue_type="liver" /map="22" gene 1..1036 /gene="GSTT2" CDS 1..735 /gene="GSTT2" /EC_number="2.5.1.18" /codon_start=1 /db_xref="GDB:G00-376-372" /product="glutathione S-transferase theta 2" /db_xref="PID:g601918" /translation="MGLELFLDLVSQPSRAVYIFAKKNGIPLELRTVDLVKGQHKSKE FLQINSLGKLPTLKDGDFILTESSAILIYLSCKYQTPDHWYPSDLQARARVHEYLGWH ADCIRGTFGIPLWVQVLGPLIGVQVPEEKVERNRTAMDQALQWLEDKFLGDRPFLAGQ QVTLADLMALEELMQPVALGYELFEGRPRLAAWRGRVEAFLGAELCQEAHSIILSILE QAAKKTLPTPSPEAYQAMLLRIARIP" BASE COUNT 243 a 286 c 279 g 228 t ORIGIN 1 atgggcctag agctgtttct tgacctggtg tcccagccca gccgcgccgt ctacatcttc 61 gccaagaaga atggcatccc cttagagctg cgcaccgtgg atttggtcaa agggcagcac 121 aagagcaagg agttcttgca gatcaacagc ctggggaaac tgccgacgct caaggatggt 181 gatttcatct tgaccgaaag ctcggccatc ctgatttacc tgagctgtaa gtaccagacg 241 ccggaccact ggtatccatc tgacctgcag gctcgtgccc gtgttcatga gtacctgggc 301 tggcatgccg actgcatccg tggcaccttt ggtatacccc tgtgggtcca ggtgttgggg 361 ccactcattg gggtccaggt gcccgaggag aaggtggaac gcaacaggac tgccatggac 421 caggccctgc aatggctgga ggacaagttc ctgggggaca ggcccttcct cgctggccag 481 caggtgacac tggctgatct catggccctg gaggagctga tgcagccggt ggctctcggc 541 tacgaactgt ttgagggacg gccacgactg gcagcatggc gtggacgagt ggaggctttc 601 ctgggtgctg agctatgcca ggaggcccac agcatcatct tgagcatcct ggaacaggcg 661 gccaagaaaa ccctcccaac accctcacca gaggcctatc aggctatgct gcttcgaatc 721 gccaggatcc cctgaagggt ctgggatggg ggccaggaga ttagcaacaa ggattcattc 781 tgttacttac ttgccccttt ttatctttcc ctcttgcccc agtcccttct ctccagcttc 841 atgtgaagct ctgcacagac aagacactca gtgtccttgg cagtgctgct actcctcagg 901 tgcagcatac ataaccagta agagactaaa tctgcaatat ataaagagct cctacaaatc 961 agtaacatga agaacactca aaaattggca aatgtcatca gtgttttaaa cagaataaag 1021 attccaaaca ctttga // LOCUS HUMGT198A 1438 bp mRNA PRI 03-OCT-1995 DEFINITION Homo sapiens GT198 mRNA, complete ORF. ACCESSION L38933 NID g1008841 KEYWORDS . SOURCE Homo sapiens (clone: GT198) (tissue library: ATCC #CRL1500; Clontech #HL1059b) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1438) AUTHORS Rommens,J.M., Durocher,F., McArthur,J., Tonin,P., Leblanc,J.-F., Allen,T., Samson,C., Ferri,L., Narod,S., Morgan,K. and Simard,J. TITLE Generation of a transcription map at the HSD17B locus centromeric to BRCA1 at 17q21 JOURNAL Genomics 28 (3), 530-542 (1995) MEDLINE 96039267 FEATURES Location/Qualifiers source 1..1438 /organism="Homo sapiens" /note="Cloning Vector: Lambda gt11" /db_xref="taxon:9606" /clone="GT198" /cell_line="ZR-75-1 0)" /tissue_lib="ATCC #CRL1500; Clontech #HL1059b" mRNA 1..1436 /note="putative" 5'UTR 1..133 /note="putative" CDS 134..745 /note="the longest open reading frame predicts a protein of 202 amino acids, with fair Kozak consensus at the initial ATG codon; an in-frame TGA codon is seen at nucleotide 8; ORF; putative" /codon_start=1 /db_xref="PID:g1008842" /translation="MSKGRAEAAGAAGILLRYLQEQNRPYSSQDVFGNLQREHGLGKA VVVKTLEQLAQQGKIKEKMYGKQKIYFADQDQFDMVSDADLQVLDGKIVALTAKVQSL QQTCRYMEAELKELSSALTTPEMQKEIQELKKECAGYRERLKNIKAATNHVTPEEKEQ VYRERQKYCKEWREEEEDGYRAVLMQYLKDTPRARSSSLRKLG" 3'UTR 746..1438 /note="putative" polyA_signal 1422..1427 /note="putative" polyA_site 1438 /note="putative" BASE COUNT 426 a 297 c 407 g 308 t ORIGIN 1 atcaaggtga tcccaaaacg aaccaacaga ccaggcatca gcacaacaga ccggggtttt 61 ccacgagccc gctaccgcgc ccggaccacc aactacaacg tccggctttc tgagttgggt 121 ggcgggaaag gcgatgagta aaggccgggc agaagctgcg ggagccgccg ggatcctcct 181 gaggtacctg caggagcaga accggcccta cagctcccag gatgtgttcg ggaacctaca 241 gcgggaacac ggactgggca aggcggtggt ggtgaagacg ctggagcagc tggcgcaaca 301 aggcaagatc aaagagaaga tgtacggcaa gcagaagatc tattttgcgg atcaggacca 361 gtttgacatg gtgagtgatg ctgaccttca agtcctagat ggcaaaatcg tggccctcac 421 tgctaaggtg cagagcttgc agcagacgtg ccgctacatg gaggctgagc tcaaggaatt 481 atctagtgcc ctgaccacac cagagatgca gaaagaaatc caggagttaa agaaggaatg 541 cgctggctac agagagagat tgaagaacat taaagcagct accaatcatg tgactccaga 601 agagaaagag caggtgtaca gagagaggca gaagtactgt aaggagtgga gggaagagga 661 agaggatggc tacagagctg tcttgatgca atacttgaag gataccccaa gagcaagaag 721 cagttctttg aggaagttgg gatagagacg gatgaagatt acaacgtcac actcccagac 781 ccctgagggg cccacggtca ggactggtgg ggactgcagg atgtcagaag agtgagatgt 841 cttgcactgg ctaccttgtt tttggttggc ttttgttgtt gttcctctta cttttcactt 901 tagcagagca gtcaggagac aagcataaac cagagcactg ggtagagagg atgagggctg 961 gtggctgggg gtagacccca cgcatttcat tgtctaaatt gcagtagctt gaggttaaca 1021 tttagacttg gaacaatgct aaaggaaagc atttggcaat atttattata atttaatttt 1081 atataaaaat atttaatttc ctctggatag tcaaacctgc cagatatcaa acctgaggaa 1141 ggcagaagtg aatttggaga actagggtag agagaggttg ctataaaacg agcatttgga 1201 gggcccacgg cttcactcag gacctgctgg gcttgtgtac cccaggagcc cttttaagta 1261 tcttttgtac gcttttcacc ccacccccaa gtcctgggag aaatgcaggc aacactgaga 1321 catgggagag gccaagatat gcttgacaga aagggtgatt ttgaggctca gttaatattt 1381 caaaattgta accgtagcaa aactgcattg gtatttagaa aaataaaaaa tttccaat // LOCUS HUMGTLPA 3915 bp mRNA PRI 12-JUN-1997 DEFINITION Human glucose transporter-like protein-III (GLUT3), complete cds. ACCESSION M20681 NID g183684 KEYWORDS Alu repeat; glucose transporter-like protein. SOURCE Human fetal skeletal muscle, cDNA to mRNA, clone lambda-hMGT-8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3915) AUTHORS Kayano,T., Fukumoto,H., Eddy,R.L.J.r., Fan,Y.-S., Byers,M.G., Shows,T.B.J.r. and Bell,G.I. TITLE Evidence for a family of human glucose transporter-like proteins: Sequence and gene localization of a protein expressed in fetal skeletal muscle and other tissues JOURNAL J. Biol. Chem. 263, 15245-15248 (1988) MEDLINE 89008414 COMMENT Draft entry and computer readable sequence [1] kindly submitted by G.I.Bell, 14-SEP-1988. FEATURES Location/Qualifiers source 1..3915 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2597 /note="GTLP mRNA (alt.)" mRNA <1..3915 /note="GTLP mRNA (alt.)" CDS 243..1733 /note="glucose transporter-like protein" /codon_start=1 /db_xref="PID:g306821" /translation="MGTQKVTPALIFAITVATIGSFQFGYNTGVINAPEKIIKEFINK TLTDKGNAPPSEVLLTSLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRNSMLIVNLLAV TGGCFMGLCKVAKSVEMLILGRLVIGLFCGLCTGFVPMYIGEISPTALRGAFGTLNQL GIVVGILVAQIFGLEFILGSEELWPLLLGFTILPAILQSAALPFCPESPRFLLINRKE EENAKQILQRLWGTQDVSQDIQEMKDESARMSQEKQVTVLELFRVSSYRQPIIISIVL QLSQQLSGINAVFYYSTGIFKDAGVQEPIYATIGAGVVNTIFTVVSLFLVERAGRRTL HMIGLGGMAFCSTLMTVSLLLKDNYNGMSFVCIGAILVFVAFFEIGPGPIPWFIVAEL FSQGPRPAAMAVAGCSNWTSNFLVGLLFPSAAHYLGAYVFIIFTGFLITFLAFTFFKV PETRGRTFEDITRAFEGQAHGADRSGKDGVMEMNSIEPAKETTTNV" repeat_region 3429..3712 /note="Alu repetitive sequence" BASE COUNT 932 a 863 c 933 g 1187 t ORIGIN 116 bp upstream of BamHI site; chromosome 12p13.3. 1 gtggggtggg gtggggctgg gggcttgtcg ccctttcagg ctccaccctt tgcggagatt 61 ataaatagtc atgatcccag cgagacccag agatgcctgt aatggtgaga ctttggatcc 121 ttcctgagga cgtggagaaa actttctgct gagaaggaca ttttgaaggt tttgttggct 181 gaaaaagctg tttctggaat cacccctaga tctttcttga agacttgaat tagattacag 241 cgatggggac acagaaggtc accccagctc tgatatttgc catcacagtt gctacaatcg 301 gctctttcca atttggctac aacactgggg tcatcaatgc tcctgagaag atcataaagg 361 aatttatcaa taaaactttg acggacaagg gaaatgcccc accctctgag gtgctgctca 421 cgtctctctg gtccttgtct gtggccatat tttccgtcgg gggtatgatc ggctcctttt 481 ccgtcggact cttcgtcaac cgctttggca ggcgcaattc aatgctgatt gtcaacctgt 541 tggctgtcac tggtggctgc tttatgggac tgtgtaaagt agctaagtcg gttgaaatgc 601 tgatcctggg tcgcttggtt attggcctct tctgcggact ctgcacaggt tttgtgccca 661 tgtacattgg agagatctcg cctactgccc tgcggggtgc ctttggcact ctcaaccagc 721 tgggcatcgt tgttggaatt ctggtggccc agatctttgg tctggaattc atccttgggt 781 ctgaagagct atggccgctg ctactgggtt ttaccatcct tcctgctatc ctacaaagtg 841 cagcccttcc attttgccct gaaagtccca gatttttgct cattaacaga aaagaagagg 901 agaatgctaa gcagatcctc cagcggttgt ggggcaccca ggatgtatcc caagacatcc 961 aggagatgaa agatgagagt gcaaggatgt cacaagaaaa gcaagtcacc gtgctagagc 1021 tctttagagt gtccagctac cgacagccca tcatcatttc cattgtgctc cagctctctc 1081 agcagctctc tgggatcaat gctgtgttct attactcaac aggaatcttc aaggatgcag 1141 gtgttcaaga gcccatctat gccaccatcg gcgcgggtgt ggttaatact atcttcactg 1201 tagtttctct atttctggtg gaaagggcag gaagaaggac tctgcatatg ataggccttg 1261 gagggatggc tttttgttcc acgctcatga ctgtttcttt gttattaaag gataactata 1321 atgggatgag ctttgtctgt attggggcta tcttggtctt tgtagccttc tttgaaattg 1381 gaccaggccc cattccctgg tttattgtgg ccgaactctt cagccagggc ccccgcccag 1441 ctgcgatggc agtggccggc tgctccaact ggacctccaa cttcctagtc ggattgctct 1501 tcccctccgc tgctcactat ttaggagcct acgtttttat tatcttcacc ggcttcctca 1561 ttaccttctt ggcttttacc ttcttcaaag tccctgagac ccgtggcagg acttttgagg 1621 atatcacacg ggcctttgaa gggcaggcac acggtgcaga tagatctgga aaggacggcg 1681 tcatggagat gaacagcatc gagcctgcta aggagaccac caccaatgtc taagtcgtgc 1741 ctccttccac ctccctcccg gcatgggaaa gccacctctc cctcaacaag ggagagacct 1801 catcaggatg aacccaggac gcttctgaat gctgctactt aattcctttc tcatcccacg 1861 cactccatga gcaccccaag gctgcggttt gttggatctt caatggcttt ttaaatttta 1921 tttcctggac atcctcttct gcttaggaga gaccgagtga acctaccttc atttcaggag 1981 ggattggccg cttggcacat gacaactttg ccagcttttc ctcccttggg ttctgatatt 2041 gccgcactag gggatatagg agaggaaaag taaggtgcag ttcccccaac ctcagactta 2101 ccaggaagca gatacatatg agtgtggaag ccggagggtg tttatgtaag agcaccttcc 2161 tcacttccat acagctctac gtggcaaatt aacttgagtt ttatttattt tatcctctgg 2221 tttaattaca taattttttt ttttttactt taagtttcag gatacatgtg ccgaatgtgc 2281 aggtttgtta cataggtata tatatgccat gatggaaata tttatttttt taagcgtaat 2341 tttgccaaat aataaaaaca gaaggaaatt gagattagag ggaggtgttt aaagagaggt 2401 tatagagtag aagatttgat gctggagagg ttaaggtgca ataagaattt agggagaaat 2461 gttgttcatt attggagggt aaatgatgtg gtgcctgagg tctgtacgtt acctcttaac 2521 aatttctgtc cttcagatgg aaactcttta acttctcgta aaagtcatat acctatataa 2581 taaagctact gatttccttg gagctttttt ctttaagata atagtttaca tgtagtagta 2641 cttgaaatct aggattatta actaatatgg gcattgtagt taatgatggt tgatgggttc 2701 taattttgga tggagtccag ggaagagaaa gtgatttcta gaaagcctgt tcccctcact 2761 ggatgaaata actccttctt gtagtagtct cattactttt gaagtaatcc cgccacctat 2821 ctcgtgggag agccatccaa ataagaaacc taaaataatt ggttcttggt agagattcat 2881 tatttttcca ctttgttctt taggagattt taggtgttga ttttctgttg tattttaact 2941 cataccttta aaggaattcc ccaaagaatg tttatagcaa acttggaatt tgtaacctca 3001 gctctgggag aggatttttt tctgagcgat tattatctaa agtgtgttgt tgctttaggc 3061 tcacggcacg cttgcgtatg tctgttacca tgtcactgtg gtcctatgcc gaatgccctc 3121 aggggacttg aatctttcca ataaaccagg tttagacagt atgagtcaat gtgcagtgta 3181 gcccacactt gagaggatga atgtatgtgc actgtcactt tgctctgggt ggaagtacgt 3241 tattgttgac ttattttctc tgtgtttgtt cctacagccc ctttttcata tgttgctcag 3301 tctccctttc ccttcttggt gcttacacat ctcagaccct ttagccaaac ccttgtcagt 3361 gacagtattt tggttcttag ttctcactgt tccctctgct cctggagcct ttgaataaaa 3421 atgcacgtag ctgaggccgg atgcggtggc tcacgcctgt aatcccagca ctttgggagg 3481 cctaggcggg cggtcagggg ttcgagacca gtctggccaa catcgtgaaa ccctgtctct 3541 actaaaaatg caaaaattag ccgggcgtgg tggcgggcgc ctgtaatccc agctacttgg 3601 gaagctgagg cgggagaatc atgtgaaccc gggacgcagg ggttgcagtg agcggagatc 3661 gcatcattgc actctagcct gggccacagg gcgagactcc gtctcaaaaa aaaaaaaatg 3721 cacatagcta tcgagtgtgc tttagcttga aaaggtgacc ttgcaacttc atgtcaactt 3781 tctggctcct caaacagtag gttggcagta aggcagggtc ccatttctca ctgagaagat 3841 tgtgaatatt tccatatgga ttttctattg ttactctggt tctttgtttt aaaataaaaa 3901 ttctgaatgt acacg // LOCUS HUMGTPBRPA 1540 bp mRNA PRI 31-DEC-1994 DEFINITION Human guanine nucleotide-binding regulatory protein (G-y-alpha) mRNA, complete cds. ACCESSION M69013 NID g183690 KEYWORDS guanine nucleotide-binding regulatory protein. SOURCE Human retinal pigment epithelium, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1540) AUTHORS Jiang,M., Pandey,S., Tran,V.T. and Fong,H.K. TITLE Guanine nucleotide-binding regulatory proteins in retinal pigment epithelial cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (9), 3907-3911 (1991) MEDLINE 91219481 FEATURES Location/Qualifiers source 1..1540 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="retinal pigment epithelium" gene 187..1266 /gene="G-y-alpha" CDS 187..1266 /gene="G-y-alpha" /codon_start=1 /product="guanine nucleotide-binding regulatory protein" /db_xref="PID:g183691" /translation="MTLESMMACCLSDEVKESKRINAEIEKQLRRDKRDARRELKLLL LGTGESGKSTFIKQMRIIHGAGYSEEDKRGFTKLVYQNIFTAMQAMIRAMETLKILYK YEQNKANALLIREVDVEKVTTFEHQYVSAIKTLWEDPGIQECYDRRREYQLSDSAKYY LTDVDRIATLGYLPTQQDVLRVRVPTTGIIEYPFDLENIIFRMVDVGGQRSERRKWIH CFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKK DLLEDKILYSHLVDYFPEFDGPQREPQAAREFILKMFVDLNPDSDKIIYSHFTCATDT ENIRFVFAAVKDTILQLNLKEYNLV" polyA_site 1540 /gene="G-y-alpha" BASE COUNT 295 a 489 c 484 g 272 t ORIGIN 1 gccctcggcc ccgggccggc ccgccccgcc tcggccgccg cctggcgagc cgccgggtcc 61 ccgctcggcc ggtggccgag gccggagggc cgcggcgggc ggcggccgag gcggctccgg 121 ccagggccgg gccgggggcc ggggggcggc ggcgggcagg cggccgcgtc ggccggggcc 181 gggacgatga ctctggagtc catgatggcg tgttgcctga gcgatgaggt gaaggagtcc 241 aagcggatca acgccgagat cgagaagcag ctgcggcggg acaagcgcga cgcccggcgc 301 gagctcaagc tgctgctgct cggcacgggc gagagcggga agagcacgtt catcaagcag 361 atgcgcatca tccacggcgc cggctactcg gaggaggaca agcgcggctt caccaagctc 421 gtctaccaga acatcttcac cgccatgcag gccatgatcc gggccatgga gacgctcaag 481 atcctctaca agtacgagca gaacaaggcc aatgcgctcc tgatccggga ggtggacgtg 541 gagaaggtga ccaccttcga gcatcagtac gtcagtgcca tcaagaccct gtgggaggac 601 ccgggcatcc aggaatgcta cgaccgcagg cgcgagtacc agctctccga ctctgccaag 661 tactacctga ccgacgttga ccgcatcgcc accttgggct acctgcccac ccagcaggac 721 gtgctgcggg tccgcgtgcc caccaccggc atcatcgagt accctttcga cctggagaac 781 atcatcttcc ggatggtgga tgtggggggc cagcggtcgg agcggaggaa gtggatccac 841 tgctttgaga acgtgacatc catcatgttt ctcgtcgccc tcagcgaata cgaccaagtc 901 ctggtggagt cggacaacga gaaccggatg gaggagagca aagccctgtt ccggaccatc 961 atcacctacc cctggttcca gaactcctcc gtcatcctct tcctcaacaa gaaggacctg 1021 ctggaggaca agatcctgta ctcgcacctg gtggactact tccccgagtt cgatggtccc 1081 cagcgggagc cccaggcggc gcgggagttc atcctgaaga tgttcgtgga cctgaacccc 1141 gacagcgaca agatcatcta ctcacacttc acgtgtgcca ccgacacgga gaacatccgc 1201 ttcgtgttcg cggccgtgaa ggacaccatc ctgcagctca acctcaagga gtacaacctg 1261 gtctgagcgc cccaggccca gggagacggg atggagacac ggggcaggac cttccttcca 1321 cggagcctgc gctgccgggc gggtggcgct gccgagtccg ggccggggct ctgccgcggg 1381 aggagatttt ttttttttca tatttttaac aaatggtttt tatttcacag ttatcagggg 1441 atgtacatct ctccctccgt acacttcgcg caccttctca ccttttgtca acggcaaagg 1501 cagccttttt ctggccttga cttatggctc gcttttttct // LOCUS HUMGTUB 1568 bp mRNA PRI 08-NOV-1994 DEFINITION Human gamma-tubulin mRNA, complete cds. ACCESSION M61764 NID g183702 KEYWORDS gamma-tubulin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1568) AUTHORS Zheng,Y., Jung,M.K. and Oakley,B.R. TITLE Gamma-tubulin is present in Drosophila melanogaster and Homo sapiens and is associated with the centrosome JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..1568 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /map="Unassigned" mRNA 1..1568 /gene="TUBG" /note="G00-128-600" gene 1..1568 /gene="TUBG" CDS 25..1380 /gene="TUBG" /codon_start=1 /db_xref="GDB:G00-128-600" /product="gamma-tubulin" /db_xref="PID:g183703" /translation="MPREIITLQLGQCGNQIGFEFWKQLCAEHGISPEAIVEEFATEG TDRKDVFFYQADDEHYIPRAVLLDLEPRVIHSILNSPYAKLYNPENIYLSEHGGGAGN NWASGFSQGEKIHEDIFDIIDREADGSDSLEGFVLCHSIAGGTGSGLGSYLLERLNDR YPKKLVQTYSVFPNQDEMSDVVVQPYNSLLTLKRLTQNADCLVVLDNTALNRIATDRL HIQNPSFSQINQLVSTIMSASTTTLRYPGYMNNDLIGLIASLIPTPRLHFLMTGYTPL TTDQSVASVRKTTVLDVMRRLLQPKNVMVSTGRDRQTNHCYIAILNIIQGEVDPTQVH KSLQRIRERKLANFIPWGPASIQVALSRKSPYLPSAHRVSGLMMANHTSISSLFERTC RQYDKLRKREAFLEQFRKEDMFKDNFDEMDTSREIVQQLIDEYHAATRPDYISWGTQE Q" BASE COUNT 364 a 493 c 408 g 303 t ORIGIN 1 cgcaacgccg gtgcctgagg agcgatgccg agggaaatca tcaccctaca gttgggccag 61 tgcggcaatc agattgggtt cgagttctgg aaacagctgt gcgccgagca tggtatcagc 121 cccgaggcga tcgtggagga gttcgccacc gagggcactg accgcaagga cgtctttttc 181 taccaggcag acgatgagca ctacatcccc cgggccgtgc tgctggactt ggaaccccgg 241 gtgatccact ccatcctcaa ctccccctat gccaagctct acaacccaga gaacatctac 301 ctgtcggaac atggaggagg agctggcaac aactgggcca gcggattctc ccagggagaa 361 aagatccatg aggacatttt tgacatcata gaccgggagg cagatggtag tgacagtcta 421 gagggctttg tgctgtgtca ctccattgct ggggggacag gctctggact gggttcctac 481 ctcttagaac ggctgaatga caggtatcct aagaagctgg tgcagacata ctcagtgttt 541 cccaaccagg acgagatgag cgatgtggtg gtccagcctt acaattcact cctcacactc 601 aagaggctga cgcagaatgc agactgtctg gtggtgctgg acaacacagc cctgaaccgg 661 attgccacag accgcctgca catccagaac ccatccttct cccagatcaa ccagctggtg 721 tctaccatca tgtcagccag caccaccacc ctgcgctacc ctggctacat gaacaatgac 781 ctcatcggcc tcatcgcctc gctcattccc accccacggc tccacttcct catgaccggc 841 tacacccctc tcactacgga ccagtcagtg gccagcgtga ggaagaccac ggtcctggat 901 gtcatgaggc ggctgctgca gcccaagaac gtgatggtgt ccacaggccg agaccgccag 961 accaaccact gctacatcgc catcctcaac atcatccagg gagaggtgga ccccacccag 1021 gtccacaaga gcttgcagag gatccgggaa cgcaagttgg ccaacttcat cccgtggggc 1081 cccgccagca tccaggtggc cctgtcgagg aagtctccct acctgccctc ggcccaccgg 1141 gtcagcgggc tcatgatggc caaccacacc agcatctcct cgctcttcga gagaacctgt 1201 cgccagtatg acaagctgcg taagcgggag gccttcctgg agcagttccg caaggaggac 1261 atgttcaagg acaactttga tgagatggac acatccaggg agattgtgca gcagctcatc 1321 gatgagtacc atgcggccac acggccagac tacatctcct ggggcaccca ggagcagtga 1381 gtcccccagg acaggggacc ctcatctgcc ttactggttg gcccaagccc tgcctgactg 1441 accaccccct cagagcacag atcagggacc tcacgcatct ctttctcata tacatggact 1501 ctctgttggc ctgcaaacac atttacttct cctcttatga gactatttat ctttaataaa 1561 gcactggg // LOCUS HUMGUABIND 3334 bp mRNA PRI 05-MAY-1993 DEFINITION Human nucleotide binding protein mRNA, complete cds. ACCESSION L04510 NID g292069 KEYWORDS ADP-ribosylation factor; DNA-binding; guanine nucleotide-binding protein; nucleotide binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3334) AUTHORS Mishima,K., Tsuchiya,M., Nightingale,M.S., Moss,J. and Vaughan,M. TITLE ARD 1, a 64-kDa guanine nucleotide-binding protein with a carboxyl- terminal ADP-ribosylation factor (ARF) domain JOURNAL J. Biol. Chem. 268, 8801-8807 (1993) MEDLINE 93232038 FEATURES Location/Qualifiers source 1..3334 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 23..1747 /standard_name="ARD 1" /note="64 kDa protein; contains ADP-ribosylation factor domain" /codon_start=1 /product="nucleotide binding protein" /db_xref="PID:g292070" /translation="MATLVVNKLGAGVDSGRQGSRGTAVVKVLECGVCEDVFSLQGDK VPRLLLCGHTVCHDCLTRLPLHGRAIRCPFDRQVTDLGDSGVWGLKKNFALLELLERL QNGPIGQYGAAEESIGISGESIIRCDEDEAHLASVYCTVCATHLCSECSQVTHSTKTL AKHRRVPLADKPHEKTMCSQHQVHAIEFVCLEEGCQTSPLMCCVCKEYGKHQGHKHSV LEPEANQIRASILDMAHCIRTFTEEISDYSRKLVGIVQHIEGGEQIVEDGIGMAHTEH VPGTAENARSCIRAYFYDLHETLCRQEEMALSVVDAHVREKLIWLRQQQEDMTILLSE VSAACLHCEKTLQQDDCRVVLAKQEITRLLETLQKQQQQFTEVADHIQLDASIPVTFT KDNRVHIGPKMEIRVVTLGLDGAGKTTILFKLKQDEFMQPIPTIGFNVETVEYKNLKF TIWDVGGKHKLRPLWKHYYLNTQAVVFVVDSSHRDRISEAHSELAKLLTEKELRDALL LIFANKQDVAGALSVEEITELLSLHKLCCGRSWYIQGCDARSGMGLYEGLDWLSRQLV AAGVLDVA" BASE COUNT 1068 a 507 c 718 g 1041 t ORIGIN 1 ctgtggcgct tcccctgcga ggatggctac cctggttgta aacaagctcg gagcgggagt 61 agacagtggc cggcagggca gccgggggac agctgtagtg aaggtgctag agtgtggagt 121 ttgtgaagat gtcttttctt tgcaaggaga caaagttccc cgtcttttgc tttgtggcca 181 taccgtctgt catgactgtc tcactcgcct acctcttcat ggaagagcaa tccgttgccc 241 atttgatcga caagtaacag acctaggtga ttcaggtgtc tggggattga aaaaaaattt 301 tgctttattg gagcttttgg aacgactgca gaatgggcct attggtcagt atggagctgc 361 agaagaatcc attgggatat ctggagagag catcattcgt tgtgatgaag atgaagctca 421 ccttgcctct gtatattgca ctgtgtgtgc aactcatttg tgctctgagt gttctcaagt 481 tactcattct acaaagacat tagcaaagca caggcgagtt cctctagctg ataaacctca 541 tgagaaaact atgtgctctc agcaccaggt gcatgccatt gagtttgttt gcttggaaga 601 aggttgtcaa actagcccac tcatgtgctg tgtctgcaaa gaatatggaa aacaccaggg 661 tcacaagcat tcagtattgg aaccagaagc taatcagatc cgagcatcaa ttttagatat 721 ggctcactgc atacggacct tcacagagga aatctcagat tattccagaa aattagttgg 781 aattgtgcag cacattgaag gaggagaaca aatcgtggaa gatggaattg gaatggctca 841 cacagaacat gtaccaggga ctgcagagaa tgcccggtca tgtattcgag cttattttta 901 tgatctacat gaaactctgt gtcgtcaaga agaaatggct ctaagtgttg ttgatgctca 961 tgttcgtgaa aaattgattt ggctcaggca gcaacaagaa gatatgacta ttttgttgtc 1021 agaggtttct gcagcctgcc tccactgtga aaagactttg cagcaggatg attgtagagt 1081 tgtcttggca aaacaggaaa ttacaaggtt actggaaaca ttgcagaaac agcagcagca 1141 gtttacagaa gttgcagatc acattcagtt ggatgccagc atccctgtca cttttacaaa 1201 ggataatcga gttcacattg gaccaaaaat ggaaattcgg gtcgttacgt taggattgga 1261 tggtgctgga aaaactacta tcttgtttaa gttaaaacag gatgaattca tgcagcccat 1321 tccaacaatt ggttttaacg tggaaactgt agaatataaa aatctaaaat tcactatttg 1381 ggatgtaggt ggaaaacaca aattaagacc attgtggaaa cattattacc tcaatactca 1441 agctgttgtg tttgttgtag atagcagtca tagagacaga attagtgaag cacacagcga 1501 acttgcaaag ttgttaacgg aaaaagaact ccgagatgct ctgctcctga tttttgctaa 1561 caaacaggat gttgctggag cactgtcagt agaagaaatc actgaactac tcagtctcca 1621 taaattatgc tgtggccgta gctggtatat tcagggctgt gatgctcgaa gtggtatggg 1681 actgtatgaa gggttggact ggctctcacg gcaacttgta gctgctggag tattggatgt 1741 tgcttgattt taaaggcagc agttgtttga agttttgtgg ttaaaagtaa ctttgcacat 1801 agtatgtttt aagaaattat acatctcaaa agatggtaat ttaggatgca tatatatata 1861 tatatatata aaggaatctt ggattgggaa ttcagtactt tgctttaaaa aaattttgtg 1921 gcagaattaa atttctaatt gagcagatta gattgaatta aatagaaact tattgaatat 1981 acattctttt aaaaagtata tttgttattt aagtttttca gataatatgt gaccaatata 2041 ctgggaaaga ggtagtcaca gagaaagggt aagtgaaggt ttattctttc agtgaaaaaa 2101 gaatagccaa ttgagtgcct aatgagacct ctgtgtgaag caagtgaagt atagctgctt 2161 cttttaacct gccttttcac tgaatgttgg cagcatttag tagtagaaat gacagttgct 2221 taatgaaata gaatccaaac tacatatttg gataatagga ttactttatg tttatgttca 2281 gagttaacag aacaccttta atgctaagaa ctataaggta cagaaaatta atactttata 2341 tagtgtttta ttaactttct cctacagcat tttgtataaa acacaatgag ggagtgaaat 2401 gttacccaat taggcttgtc aggttagtaa taaactgaac agtaataaaa ctgtggaagt 2461 aattggatct gaatttatga aagacccatt tccaggactg aacctaggtc agagctctaa 2521 attggtcctt ctatttttca acaaatttaa agtaatattt ctttctaata taatattgca 2581 tcctttgtgg gaatgactat aggtaaaatg tagtaagtaa cgcagaacca gggttggctt 2641 tatttaaaag ctagtgacct aaatagaaag cgaacttcaa gagaagttgt aagtacagtg 2701 gcaaatgctt attacttact tcaaactgtt tcccaaaata agtgcattta ttttgacaat 2761 aaaacttaag gctgttcatg agaaggcctt gaaaagttac tctagaggaa aaatgtctaa 2821 agaaaaaaaa aattcaaaaa gtttacatta attattcagt gttgtgagta aataaaaatg 2881 tgtgctcttt actgtttttc atttttaaag aatattatta tggaagcacg atttatttaa 2941 ataggtacat tgagactttt ttttttaatg ttctgataca ttaggatgaa gttaaatctt 3001 aaatcttatt agttgaattg ttgtaaggac agtgatgtct ggtaacaaga tgtgactttt 3061 tggtagcact gttgtggttc attcttttca aatctatttt tgtttaaaaa caatacaagt 3121 tttagaaaac aaagcattaa aaaaaaagcc tatcagtatt atgggcaata tgtaaataaa 3181 taaatgtaat atttcatcct ttatttttca ggtaaaaggt catgctgtta caggtgtagt 3241 ttgtgtgcat aaataatact tccgaattaa attatttaat atttgactga tttcaataac 3301 tgtgaaaata aaaaggtgtt gtattgcttg tgag // LOCUS HUMH1T 1759 bp DNA PRI 18-OCT-1991 DEFINITION Human testicular H1 histone (H1) gene, complete cds. ACCESSION M60094 NID g183750 KEYWORDS histone H1. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1759) AUTHORS Drabent,B., Kardalinou,E. and Doenecke,D. TITLE Structure and expression of the human gene encoding testicular H1 histone (Hlt) JOURNAL Gene 103, 263-268 (1991) MEDLINE 91365256 FEATURES Location/Qualifiers source 1..1759 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" mRNA 487..1211 /evidence=experimental gene 530..1153 /gene="H1t" CDS 530..1153 /gene="H1t" /codon_start=1 /product="testicular H1 histone" /db_xref="PID:g183751" /translation="MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSV SKLITEALSVSQERVGMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRG TGASGSFKLSKKVIPKSTRSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATT PKTVRSGRKAKGAKGKQQQKSPVKARASKSKLTQHHEVNVRKATSKK" BASE COUNT 509 a 391 c 418 g 441 t ORIGIN 1 gtcactccgc aattagacag ctaagagatc tgtgttactt ccctcacata tataaataat 61 tttaaataaa aatcatggcg tgaataattt ctttcctcta ccgatttgaa gctatccatt 121 tggaagacca ctctgaagag atgaaataag tcttctgcca aagattactt attaatttac 181 aaggaaaagg ggaagttttg ttcctctccg tgaatttgat tgaaaatcga gggctttctc 241 gaatagtttt ggcatccagg gtcatttttc attaaaaaga gaaaagtcat gtcaaatatg 301 aatttccgca gattattcag cactagaccc tgggagattc tgtaaagagg ggttttgtta 361 tactcaactt ttccgggtaa aacaaacaca aatactcctc ctccaagggg cgggggcggt 421 gcctaggtga tgcaccaatc acagcgcgcc ctaccctata taagccccga ggccgcccgg 481 gtgtttcatg cttttcgctg gttattacat cttgcgtttc tctgttgtta tgtctgaaac 541 cgtgcctgca gcttctgcca gtgctggtct agccgctatg gagaaacttc caaccaagaa 601 gcgagggagg aagccggctg gcttgataag tgcaagtcgc aaagtgccga acctctctgt 661 gtccaagttg atcaccgagg ccctttcagt gtcacaggaa cgagtaggta tgtctttggt 721 tgcgctcaag aaggcattgg ccgctgctgg ctacgacgta gagaagaata acagccgcat 781 caaactgtcc ctcaagagct tagtgaacaa gggaatcctg gtgcaaacca ggggtactgg 841 tgcttccggt tcctttaagc ttagtaagaa ggtgattcct aaatctacac gaagcaaggc 901 taaaaagtca gtttctgcca agaccaagaa gctggtttta tccagggact ccaagtcacc 961 aaagactgct aaaaccaata agagagccaa gaagccgaga gcgacaactc ctaaaactgt 1021 taggagcggg agaaaggcta aaggagccaa gggtaagcaa cagcagaaga gcccagtgaa 1081 ggcaagggct tcgaagtcaa aattgaccca acatcatgaa gttaatgtta gaaaggccac 1141 atctaagaag taaagagctt tccgggaggc caatttggaa agaacccaaa ggctctttta 1201 agagccaccc acattatttt aagatggcgt aacactggaa acaagtttct gtgacagtta 1261 tctataggtt taagttgtga tgcagctgag ttgaaaaggc ttgagattgg agaattaatt 1321 caggccaggc ttcaagacca tcctgggcaa catagccaga ctaccatcta taccaggggt 1381 cctcattccc ccggccaccg accggtaacc ggtccctgtc catggcacgt tatgaattga 1441 gccgcacagc tgaggggtga gcgaacatta accaactgag ctccaccgcc tgtcaggtta 1501 gctgcagcat tagatagatt ctcataagct caaactgtat tgtgaatggc acatgcaagg 1561 gatctaggtt tcaggctcct tgtgacaatc taatgcctga tgatctgagg ttggagcagt 1621 tttagtccgg aaatcattgc tcccagcccc tgcaccccct ggtccgtggt ataattgtct 1681 tacacaaacg gtctcttgtg tcaaaaaggt tggagactac tggtttttac aaaaaagtaa 1741 attagtcaag catggttgg // LOCUS HUMHA3G 1741 bp mRNA PRI 24-JAN-1994 DEFINITION Human glycoprotein mRNA, complete cds. ACCESSION M80927 NID g348911 KEYWORDS glycoprotein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1741) AUTHORS Hakala,B.E., White,C. and Recklies,A.D. TITLE Human cartilage gp-39, a major secretory product of articular chondrocytes and synovial cells, is a mammalian member of a chitinase protein family JOURNAL J. Biol. Chem. 268 (34), 25803-25810 (1993) MEDLINE 94064658 FEATURES Location/Qualifiers source 1..1741 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="chondrocyte" /tissue_type="cartilage" 5'UTR 1..71 sig_peptide 72..134 CDS 72..1223 /note="articular 39kDa glycoprotein" /codon_start=1 /evidence=experimental /product="glycoprotein" /db_xref="PID:g348912" /translation="MGVKASQTGFVVLVLLQCCSAYKLVCYYTSWSQYREGDGSCFPD ALDRFLCTHIIYSFANISNDHIDTWEWNDVTLYGMLNTLKNRNPNLKTLLSVGGWNFG SQRFSKIASNTQSRRTFIKSVPPFLRTHGFDGLDLAWLYPGRRDKQHFTTLIKEMKAE FIKEAQPGKKQLLLSAALSAGKVTIDSSYDIAKISQHLDFISIMTYDFHGAWRGTTGH HSPLFRGQEDASPDRFSNTDYAVGYMLRLGAPASKLVMGIPTFGRSFTLASSETGVGA PISGPGIPGRFTKEAGTLAYYEICDFLRGATVHRTLGQQVPYATKGNQWVGYDDQESV KSKVQYLKDRQLAGAMVWALDLDDFQGSFCGQDLRFPLTNAIKDALAAT" mat_peptide 135..1220 /product="glycoprotein" 3'UTR 1224..1741 polyA_signal 1715..1720 polyA_site 1741 BASE COUNT 416 a 510 c 447 g 368 t ORIGIN 1 ctaggtagct ggcaccagga gccgtgggca agggaagagg ccacaccctg ccctgctctg 61 ctgcagccag aatgggtgtg aaggcgtctc aaacaggctt tgtggtcctg gtgctgctcc 121 agtgctgctc tgcatacaaa ctggtctgct actacaccag ctggtcccag taccgggaag 181 gcgatgggag ctgcttccca gatgcccttg accgcttcct ctgtacccac atcatctaca 241 gctttgccaa tataagcaac gatcacatcg acacctggga gtggaatgat gtgacgctct 301 acggcatgct caacacactc aagaacagga accccaacct gaagactctc ttgtctgtcg 361 gaggatggaa ctttgggtct caaagatttt ccaagatagc ctccaacacc cagagtcgcc 421 ggactttcat caagtcagta ccgccattcc tgcgcaccca tggctttgat gggctggacc 481 ttgcctggct ctaccctgga cggagagaca aacagcattt taccacccta atcaaggaaa 541 tgaaggccga atttataaag gaagcccagc cagggaaaaa gcagctcctg ctcagcgcag 601 cactgtctgc ggggaaggtc accattgaca gcagctatga cattgccaag atatcccaac 661 acctggattt cattagcatc atgacctacg attttcatgg agcctggcgt gggaccacag 721 gccatcacag tcccctgttc cgaggtcagg aggatgcaag tcctgacaga ttcagcaaca 781 ctgactatgc tgtggggtac atgttgaggc tgggggctcc tgccagtaag ctggtgatgg 841 gcatccccac cttcgggagg agcttcactc tggcttcttc tgagactggt gttggagccc 901 caatctcagg accgggaatt ccaggccggt tcaccaagga ggcagggacc cttgcctact 961 atgagatctg tgacttcctc cgcggagcca cagtccatag aaccctcggc cagcaggtcc 1021 cctatgccac caagggcaac cagtgggtag gatacgacga ccaggaaagc gtcaaaagca 1081 aggtgcagta cctgaaggat aggcagctgg caggcgccat ggtatgggcc ctggacctgg 1141 atgacttcca gggctccttc tgcggccagg atctgcgctt ccctctcacc aatgccatca 1201 aggatgcact cgctgcaacg tagccctctg ttctgcacac agcacggggg ccaaggatgc 1261 cccgtccccc tctggctcca gctggccggg agcctgatca cctgccctgc tgagtcccag 1321 gctgagcctc agtctccctc ccttggggcc tatgcagagg tccacaacac acagatttga 1381 gctcagccct ggtgggcaga gaggtaggga tggggctgtg gggatagtga ggcatcgcaa 1441 tgtaagactc gggattagta cacacttgtt gatgattaat ggaaatgttt acagatcccc 1501 aagcctggca agggaatttc ttcaactccc tgccccctag ccctccttat caaaggacac 1561 cattttggca agctctatca ccaaggagcc aaacatccta caagacacag tgaccatact 1621 aattataccc cctgcaaagc cagcttgaaa ccttcactta ggaacgtaat cgtgtcccct 1681 atcctacttc cccttcctaa ttccacagct gctcaataaa gtacaagagt ttaacagtgt 1741 g // LOCUS HUMHAAA 1266 bp mRNA PRI 21-MAY-1991 DEFINITION Human factor H homologue mRNA, complete cds. ACCESSION M65292 NID g183762 KEYWORDS factor H. SOURCE Human liver, cDNA to mRNA, clone pFH1.4a. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1266) AUTHORS Estaller,C., Koistinen,V., Schwaeble,W., Dierich,M.P. and Weiss,E.H. TITLE Cloning of the 1.4-kb mRNA species of human complement factor H reveals a novel member of the short consensus repeat family related to the carboxy terminal of the classical 150-kDa molecule JOURNAL J. Immunol. 146, 3190-3196 (1991) MEDLINE 91201892 FEATURES Location/Qualifiers source 1..1266 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 78..1070 /codon_start=1 /product="factor H homologue" /db_xref="PID:g183763" /translation="MWLLVSVILISRISSVGGEATFCDFPKINHGILYDEEKYKPFSQ VPTGEVFYYSCEYNFVSPSKSFWTRITCTEEGWSPTPKCLRLCFFPFVENGHSESSGQ THLEGDTVQIICNTGYRLQNNENNISCVERGWSTPPKCRSTDTSCVNPPTVQNAYIVS RQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNWTEPPQCKDSTGKCGPPPPIDNGD ITSFPLSVYAPASSVEYQCQNLYQLEGNKRITCRNGQWSEPPKCLHPCVISREIMENY NIALRWTAKQKLYLRTGESAEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR" BASE COUNT 414 a 237 c 249 g 366 t ORIGIN 1 tgttaatgaa agcagattca aagcaacacc accaccactg aagtattttt agttatataa 61 gattggaact accaagcatg tggctcctgg tcagtgtaat tctaatctca cggatatcct 121 ctgttggggg agaagcaaca ttttgtgatt ttccaaaaat aaaccatgga attctatatg 181 atgaagaaaa atataagcca ttttcccagg ttcctacagg ggaagttttc tattactcct 241 gtgaatataa ttttgtgtct ccttcaaaat cattttggac tcgcataaca tgcacagaag 301 aaggatggtc accaacacca aagtgtctca gactgtgttt ctttcctttt gtggaaaatg 361 gtcattctga atcttcagga caaacacatc tggaaggtga tactgtgcaa attatttgca 421 acacaggata cagacttcaa aacaatgaga acaacatttc atgtgtagaa cggggctggt 481 ccacccctcc caaatgcagg tccactgaca cttcctgtgt gaatccgccc acagtacaaa 541 atgcttatat agtgtcgaga cagatgagta aatatccatc tggtgagaga gtacgttatc 601 aatgtaggag cccttatgaa atgtttgggg atgaagaagt gatgtgttta aatggaaact 661 ggacggaacc acctcaatgc aaagattcta cgggaaaatg tgggccccct ccacctattg 721 acaatgggga cattacttca ttcccgttgt cagtatatgc tccagcttca tcagttgagt 781 accaatgcca gaacttgtat caacttgagg gtaacaagcg aataacatgt agaaatggac 841 aatggtcaga accaccaaaa tgcttacatc cgtgtgtaat atcccgagaa attatggaaa 901 attataacat agcattaagg tggacagcca aacagaagct ttatttgaga acaggtgaat 961 cagctgaatt tgtgtgtaaa cggggatatc gtctttcatc acgttctcac acattgcgaa 1021 caacatgttg ggatgggaaa ctggagtatc caacttgtgc aaaaagatag aatcaatcat 1081 aaaatgcaca cctttattca gaactttagt attaaatcag ttcttaattt aatttttaag 1141 tattgtttta ctccttttta ttcatacgta aaattttgga ttaatttgtg aaaatgtaat 1201 tataagctga gaccggtggc tctcttctta aaagcaccat attaaaactt ggaaaactgg 1261 aaaact // LOCUS HUMHBEGF 2360 bp mRNA PRI 07-MAR-1991 DEFINITION Human heparin-binding EGF-like growth factor mRNA, complete cds. ACCESSION M60278 NID g183866 KEYWORDS heparin-binding EGF-like growth factor. SOURCE Human histiocytic lymphoma derived cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2360) AUTHORS Higashiyama,S., Abraham,J.A., Miller,J.L., Fiddes,J.C. and Klagsbrun,M. TITLE A heparin-binding growth factor secreted by macrophage-like cells that is related to EGF JOURNAL Science 251, 936-939 (1991) MEDLINE 91157008 FEATURES Location/Qualifiers source 1..2360 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="histiocytic lymphoma derived" sig_peptide 262..318 /note="putative" CDS 262..888 /note="putative" /codon_start=1 /product="heparin-binding EGF-like growth factor" /db_xref="PID:g183867" /translation="MKLLPSVVLKLFLAAVLSALVTGESLERLRRGLAAGTSNPDPPT VSTDQLLPLGGGRDRKVRDLQEADLDLLRVTLSSKPQALATPNKEEHGKRKKKGKGLG KKRDPCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLSLPVENRLYTYDHT TILAVVAVVLSSVCLLVIVGLLMFRYHRRGGYDVENEEKVKLGMTNSH" mat_peptide 481..702 /evidence=experimental /product="heparin-binding EGF-like growth factor" BASE COUNT 599 a 579 c 605 g 577 t ORIGIN 1 gctacgcggg ccacgctgct ggctggcctg acctaggcgc gcggggtcgg gcggccgcgc 61 gggcgggctg agtgagcaag acaagacact caagaagagc gagctgcgcc tgggtcccgg 121 ccaggcttgc acgcagaggc gggcggcaga cggtgcccgg cggaatctcc tgagctccgc 181 cgcccagctc tggtgccagc gcccagtggc cgccgcttcg aaagtgactg gtgcctcgcc 241 gcctcctctc ggtgcgggac catgaagctg ctgccgtcgg tggtgctgaa gctctttctg 301 gctgcagttc tctcggcact ggtgactggc gagagcctgg agcggcttcg gagagggcta 361 gctgctggaa ccagcaaccc ggaccctccc actgtatcca cggaccagct gctaccccta 421 ggaggcggcc gggaccggaa agtccgtgac ttgcaagagg cagatctgga ccttttgaga 481 gtcactttat cctccaagcc acaagcactg gccacaccaa acaaggagga gcacgggaaa 541 agaaagaaga aaggcaaggg gctagggaag aagagggacc catgtcttcg gaaatacaag 601 gacttctgca tccatggaga atgcaaatat gtgaaggagc tccgggctcc ctcctgcatc 661 tgccacccgg gttaccatgg agagaggtgt catgggctga gcctcccagt ggaaaatcgc 721 ttatatacct atgaccacac aaccatcctg gccgtggtgg ctgtggtgct gtcatctgtc 781 tgtctgctgg tcatcgtggg gcttctcatg tttaggtacc ataggagagg aggttatgat 841 gtggaaaatg aagagaaagt gaagttgggc atgactaatt cccactgaga gagacttgtg 901 ctcaaggaat cggctgggga ctgctacctc tgagaagaca caaggtgatt tcagactgca 961 gaggggaaag acttccatct agtcacaaag actccttcgt ccccagttgc cgtctaggat 1021 tgggcctccc ataattgctt tgccaaaata ccagagcctt caagtgccaa acagagtatg 1081 tccgatggta tctgggtaag aagaaagcaa aagcaaggga ccttcatgcc cttctgattc 1141 ccctccacca aaccccactt cccctcataa gtttgtttaa acacttatct tctggattag 1201 aatgccggtt aaattccata tgctccagga tctttgactg aaaaaaaaaa agaagaagaa 1261 gaaggagagc aagaaggaaa gatttgtgaa ctggaagaaa gcaacaaaga ttgagaagcc 1321 atgtactcaa gtaccaccaa gggatctgcc attgggaccc tccagtgctg gatttgatga 1381 gttaactgtg aaataccaca agcctgagaa ctgaattttg ggacttctac ccagatggaa 1441 aaataacaac tatttttgtt gttgttgttt gtaaatgcct cttaaattat atatttattt 1501 tattctatgt atgttaattt atttagtttt taacaatcta acaataatat ttcaagtgcc 1561 tagactgtta ctttggcaat ttcctggccc tccactcctc atccccacaa tctggcttag 1621 tgccacccac ctttgccaca aagctaggat ggttctgtga cccatctgta gtaatttatt 1681 gtctgtctac atttctgcag atcttccgtg gtcagagtgc cactgcggga gctctgtatg 1741 gtcaggatgt aggggttaac ttggtcagag ccactctatg agttggactt cagtcttgcc 1801 taggcgattt tgtctaccat ttgtgttttg aaagcccaag gtgctgatgt caaagtgtaa 1861 cagatatcag tgtctccccg tgtcctctcc ctgccaagtc tcagaagagg ttgggcttcc 1921 atgcctgtag ctttcctggt ccctcacccc catggcccca ggccacagcg tgggaactca 1981 ctttcccttg tgtcaagaca tttctctaac tcctgccatt cttctggtgc tactccatgc 2041 aggggtcagt gcagcagagg acagtctgga gaaggtatta gcaaagcaaa aggctgagaa 2101 ggaacaggga acattggagc tgactgttct tggtaactga ttacctgcca attgctaccg 2161 agaaggttgg aggtggggaa ggctttgtat aatcccaccc acctcaccaa aacgatgaag 2221 gtatgctgtc atggtccttt ctggaagttt ctggtgccat ttctgaactg ttacaacttg 2281 tatttccaaa cctggttcat atttatactt tgcaatccaa ataaagataa cccttattcc 2341 ataaaaaaaa aaaaaaaaaa // LOCUS HUMHBLOD 3373 bp mRNA PRI 08-NOV-1994 DEFINITION Human GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase mRNA, complete cds. ACCESSION M35531 NID g183887 KEYWORDS GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase. SOURCE Human epidermal carcinoma cell line A431, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3373) AUTHORS Larsen,R.D., Ernst,L.K., Nair,R.P. and Lowe,J.B. TITLE Molecular cloning, sequence, and expression of a human GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferase cDNA that can form the H blood group antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (17), 6674-6678 (1990) MEDLINE 90370848 COMMENT Draft entry and computer-readable [or printed] sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by J.B.Lowe, 22-JUN-1990. FEATURES Location/Qualifiers source 1..3373 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q" gene 104..2385 /gene="FCT3A" CDS 104..1201 /gene="FCT3A" /note="GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferase" /codon_start=1 /db_xref="GDB:2410" /db_xref="PID:g306830" /translation="MWLRSHRQLCLAFLLVCVLSVIFFLHIHQDSFPHGLGLSILCPD RRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQYATLLAL AQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYADLRDP FLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFVGVH VRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTSQ GDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSE FLKIFKPEAAFLPEWVGINADLSPLWTLAKP" misc_feature 1744..2385 /gene="FCT3A" /note="Alu sequence homologue; 2410; putative" BASE COUNT 687 a 925 c 905 g 856 t ORIGIN 1 gcctggcgtt ccaggggcgg ccggatgtgg cctgcctttg cggagggtgc gctccggcca 61 cgaaaagcgg actgtggatc tgccacctgc aagcagctcg gccatgtggc tccggagcca 121 tcgtcagctc tgcctggcct tcctgctagt ctgtgtcctc tctgtaatct tcttcctcca 181 tatccatcaa gacagctttc cacatggcct aggcctgtcg atcctgtgtc cagaccgccg 241 cctggtgaca cccccagtgg ccatcttctg cctgccgggt actgcgatgg gccccaacgc 301 ctcctcttcc tgtccccagc accctgcttc cctctccggc acctggactg tctaccccaa 361 tggccggttt ggtaatcaga tgggacagta tgccacgctg ctggctctgg cccagctcaa 421 cggccgccgg gcctttatcc tgcctgccat gcatgccgcc ctggccccgg tattccgcat 481 caccctgccc gtgctggccc cagaagtgga cagccgcacg ccgtggcggg agctgcagct 541 tcacgactgg atgtcggagg agtacgcgga cttgagagat cctttcctga agctctctgg 601 cttcccctgc tcttggactt tcttccacca tctccgggaa cagatccgca gagagttcac 661 cctgcacgac caccttcggg aagaggcgca gagtgtgctg ggtcagctcc gcctgggccg 721 cacaggggac cgcccgcgca cctttgtcgg cgtccacgtg cgccgtgggg actatctgca 781 ggttatgcct cagcgctgga agggtgtggt gggcgacagc gcctacctcc ggcaggccat 841 ggactggttc cgggcacggc acgaagcccc cgttttcgtg gtcaccagca acggcatgga 901 gtggtgtaaa gaaaacatcg acacctccca gggcgatgtg acgtttgctg gcgatggaca 961 ggaggctaca ccgtggaaag actttgccct gctcacacag tgcaaccaca ccattatgac 1021 cattggcacc ttcggcttct gggctgccta cctggctggc ggagacactg tctacctggc 1081 caacttcacc ctgccagact ctgagttcct gaagatcttt aagccggagg cggccttcct 1141 gcccgagtgg gtgggcatta atgcagactt gtctccactc tggacattgg ctaagccttg 1201 agagccaggg agactttctg aagtagcctg atctttctag agccagcagt acgtggcttc 1261 agaggcctgg catcttctgg agaagcttgt ggtgttcctg aagcaaatgg gtgcccgtat 1321 ccagagtgat tctagttggg agagttggag agaaggggga cgtttctgga actgtctgaa 1381 tattctagaa ctagcaaaac atcttttcct gatggctggc aggcagttct agaagccaca 1441 gtgcccacct gctcttccca gcccatatct acagtacttc cagatggctg cccccaggaa 1501 tggggaactc tccctctggt ctactctaga agaggggtta cttctcccct gggtcctcca 1561 aagactgaag gagcatatga ttgctccaga gcaagcattc accaagtccc cttctgtgtt 1621 tctggagtga ttctagaggg agacttgttc tagagaggac caggtttgat gcctgtgaag 1681 aaccctgcag ggcccttatg gacaggatgg ggttctggaa atccagataa ctaaggtgaa 1741 gaatcttttt agtttttttt tttttttttt ggagacaggg tctcgctctg ttgcccaggc 1801 tggagtgcag tggcgtgatc ttggctcact gcaacttccg cctcctgtgt tcaagcgatt 1861 ctcctgtctc agcctcctga gtagatggga ctacaggcac aggccattat gcctggctaa 1921 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggatggt ctcgatctcc 1981 tgaccttgtc atccacctgt cttggcctcc caaagtgctg ggattactgg catgagccac 2041 tgtgcccagc ccggatattt ttttttaatt atttatttat ttatttattt attgagacgg 2101 agtcttgctc tgtagcccag gccagagtgc agtggcgcga tctcagctca ctgcaagctc 2161 tgcctcccgg gttcatgcca ttctgcctca gcctcctgag tagctgggac tacaggcgcc 2221 cgccaccacg cccggctaat tttttttgta tttttagtag agacggggtt tcatcgtgtt 2281 aaccaggatg gtctcgatct cctgacctcg tgatctgccc acctcggcct cccacagtgc 2341 tgggattacc ggcgtgagcc accatgcctg gcccggataa ttttttttaa tttttgtaga 2401 gacgaggtct tgtgatattg cccaggctgt tcttcaactc ctgggctcaa gcagtcctcc 2461 caccttggcc tcccagaatg ctgggtttat agatgtgagc cagcacaccg ggccaagtga 2521 agaatctaat gaatgtgcaa cctaattgta gcatctaatg aatgttccac cattgctgga 2581 aaaattgaga tggaaaacaa accatctcta gttggccagc gtcttgctct gttcacagtc 2641 tctggaaaag ctggggtagt tggtgagcag agcgggactc tgtccaacaa gccccacagc 2701 ccctcaaaga cttttttttg tttgttttga gcagacaggc taaaatgtga acgtggggtg 2761 agggatcact gccaaaatgg tacagcttct ggagcagaac tttccaggga tccagggaca 2821 ctttttttta aagctcataa actgccaaga gctccatata ttgggtgtga gttcaggttg 2881 cctctcacaa tgaaggaagt tggtctttgt ctgcaggtgg gctgctgagg gtctgggatc 2941 tgttttctgg aagtgtgcag gtataaacac accctctgtg cttgtgacaa actggcaggt 3001 accgtgctca ttgctaacca ctgtctgtcc ctgaactccc agaaccacta catctggctt 3061 tgggcaggtc tgagataaaa cgatctaaag gtaggcagac cctggaccca gcctcagatc 3121 caggcaggag cacgaggtct ggccaaggtg gacggggttg tcgagatctc aggagcccct 3181 tgctgttttt tggagggtga aagaagaaac cttaaacata gtcagctctg atcacatccc 3241 ctgtctactc atccagaccc catgcctgta ggcttatcag ggagttacag ttacaattgt 3301 tacagtactg ttcccaactc agctgccacg ggtgagagag caggaggtat gaattaaaag 3361 tctacagcac taa // LOCUS HUMHBP 4354 bp mRNA PRI 24-JUL-1992 DEFINITION Human high density lipoprotein binding protein (HBP) mRNA, complete cds. ACCESSION M64098 M83789 NID g183891 KEYWORDS HX protein homologue; high density lipoprotein binding protein; tandem amphipathic repeat. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4354) AUTHORS McKnight,G.L., Reasoner,J., Gilbert,T., Sundquist,K.O., Hokland,B., McKernan,P.A., Champagne,J., Johnson,C.J., Bailey,M.C., Holly,R., O'Hara,P.J. and Oram,J.F. TITLE cloning and expression of a cellular high density liporotein-binding protein that is up-regulated by cholesterol loading of cells JOURNAL J. Biol. Chem. 267, 12131-12141 (1992) MEDLINE 92291094 FEATURES Location/Qualifiers source 1..4354 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="human erythroleukemic (Hel) cells" gene 155..3961 /gene="HBP" CDS 155..3961 /gene="HBP" /codon_start=1 /product="high density lipoprotein binding protein" /db_xref="PID:g183892" /translation="MSSVAVLTQESFAEHRSGLVPQQIKVATLNSEEESDPPTYKDAF PPLPEKAACLESAQEPAGAWGNKIRPIKASVITQVFHVPLEERKYKDMNQFGEGEQAK ICLEIMQRTGAHLELSLAKDQGLSIMVSGKLDAVMKARKDIVARLQTQASATVAIPKE HHRFVIGKNGEKLQDLELKTATKIQIPRPDDPSNQIKITGTKEGIEKARHEVLLISAE QDKRAVERLEVEKAFHPFIAGPYNRLVGEIMQETGTRINIPPPSVNRTEIVFTGEKEQ LAQAVARIKKIYEEKKKKTTTIAVEVKKSQHKYVIGPKGNSLQEILERTGVSVEIPPS DSISETVILRGEPEKLGQALTEVYAKANSFTVSSVAAPSWLHRFIIGKKGQNLAKITQ QMPKVHIEFTEGEDKITLEGPTEDVNVAQEQIEGMVKDLINRMDYVEINIDHKFHRHL IGKSGANINRIKDQYKVSVRIPPDSEKSNLIRIEGDPQGVQQAKRELLELASRMENER TKDLIIEQRFHRTIIGQKGERIREIRDKFPEVIINFPDPAQKSDIVQLRGPKNEVEKC TKYMQKMVADLVENSYSISVPIFKQFHKNIIGKGGANIKKIREESNTKIDLPAENSNS ETIIITGKRANCEAARSRILSIQKDLANIAEVEVSIPAKLHNSLIGTKGRLIRSIMEE CGGVHIHFPVEGSGSDTVVIRGPSSDVEKAKKQLLHLAEEKQTKSFTVDIRAKPEYHK FLIGKGGGKIRKVRDSTGARVIFPAAEDKDQDLITIIGKEDAVREAQKELEALIQNLD NVVEDSMLVDPKHHRHFVIRRGQVLREIAEEYGGVMVSFPRSGTQSDKVTLKGAKDCV EAAKKRIQEIIEDLEAQVTLECAIPQKFHRSVMGPKGSRIQQITRDFSVQIKFPDREE NAVHSTEPVVQENGDEAGEGREAKDCDPGSPRRCDIIIISGRKEKCEAAKEALEALVP VTIEVEVPFDLHRYVIGQKGSGIRKMMDEFEVNIHVPAPELQSDIIAITGLAANLDRA KAGLLERVKELQAEQEDRALRSFKLSVTVDPKYHPKIIGRKGAVITQIRLEHDVNIQF PDKDDGNQPQDQITITGYEKNTEAARDAILRIVGELEQMVSEDVPLDHRVHARIIGAR GKAIRKIMDEFKVDIRFPQSGAPDPNCVTVTGLPENVEEAIDHILNLEEEYLADVVDS EALQVYMKPPAHEEAKAPSRGFVVRDAPWTASSSEKAPDMSSSEEFPSFGAQVAPKTL PWGPKR" BASE COUNT 1209 a 1124 c 1180 g 841 t ORIGIN 1 gaattcgggg ggcgagtaag ccagcggcag gaccagcggg cgggggccac aacaaaagct 61 ggcaggctga cagaggcggc ctcaggacgg accttctggc tactgaccgt tttgctgtgg 121 ttttcccgga ttgtgtgtag gtgtgagatc aaccatgagt tccgttgcag ttttgaccca 181 agagagtttt gctgaacacc gaagtgggct ggttccgcaa caaatcaaag ttgccactct 241 aaattcagaa gaggagagcg accctccaac ctacaaggat gccttccctc cacttcctga 301 gaaagctgct tgcctggaaa gtgcccagga acccgctgga gcctggggga acaagatccg 361 acccatcaag gcttctgtca tcactcaggt gttccatgta cccctggagg agagaaaata 421 caaggatatg aaccagtttg gagaaggtga acaagcaaaa atctgccttg agatcatgca 481 gagaactggt gctcacttgg agctgtcttt ggccaaagac caaggcctct ccatcatggt 541 gtcaggaaag ctggatgctg tcatgaaagc tcggaaggac attgttgcta gactgcagac 601 tcaggcctca gcaactgttg ccattcccaa agaacaccat cgctttgtta ttggcaaaaa 661 tggagagaaa ctgcaagact tggagctaaa aactgcaacc aaaatccaga tcccacgccc 721 agatgacccc agcaatcaga tcaagatcac tggcaccaaa gagggcatcg agaaagctcg 781 ccatgaagtc ttactcatct ctgccgagca ggacaaacgt gctgtggaga ggctagaagt 841 agaaaaggca ttccacccct tcatcgctgg gccgtataat agactggttg gcgagatcat 901 gcaggagaca ggcacgcgca tcaacatccc cccacccagc gtgaaccgga cagagattgt 961 cttcactgga gagaaggaac agttggctca ggctgtggct cgcatcaaga agatttatga 1021 ggagaagaaa aagaagacta caaccattgc agtggaagtg aagaaatccc aacacaagta 1081 tgtcattggg cccaagggca attcattgca ggagatcctt gagagaactg gagtttccgt 1141 tgagatccca ccctcagaca gcatctctga gactgtaata cttcgaggcg aacctgaaaa 1201 gttaggtcag gcgttgactg aagtctatgc caaggccaat agcttcaccg tctcctctgt 1261 cgccgcccct tcctggcttc accgtttcat cattggcaag aaagggcaga acctggccaa 1321 aatcactcag cagatgccaa aggttcacat cgagttcaca gagggcgaag acaagatcac 1381 cctggagggc cctacagagg atgtcaatgt ggcccaggaa cagatagaag gcatggtcaa 1441 agatttgatt aaccggatgg actatgtgga gatcaacatc gaccacaagt tccacaggca 1501 cctcattggg aagagcggtg ccaacataaa cagaatcaaa gaccagtaca aggtgtccgt 1561 gcgcatccct cctgacagtg agaagagcaa tttgatccgc atcgaggggg acccacaggg 1621 cgtgcagcag gccaagcgag agctgctgga gcttgcatct cgcatggaaa atgagcgtac 1681 caaggatcta atcattgagc aaagatttca tcgcacaatc attgggcaga agggtgaacg 1741 gatccgtgaa attcgtgaca aattcccaga ggtcatcatt aactttccag acccagcaca 1801 aaaaagtgac attgtccagc tcagaggacc taagaatgag gtggaaaaat gcacaaaata 1861 catgcagaag atggtggcag atctggtgga aaatagctat tcaatttctg ttccgatctt 1921 caaacagttt cacaagaata tcattgggaa aggaggcgca aacattaaaa agattcgtga 1981 agaaagcaac accaaaatcg accttccagc agagaatagc aattcagaga ccattatcat 2041 cacaggcaag cgagccaact gcgaagctgc ccggagcagg attctgtcta ttcagaaaga 2101 cctggccaac atagccgagg tagaggtctc catccctgcc aagctgcaca actccctcat 2161 tggcaccaag ggccgtctga tccgctccat catggaggag tgcggcgggg tccacattca 2221 ctttcccgtg gaaggttcag gaagcgacac cgttgttatc aggggccctt cctcggatgt 2281 ggagaaggcc aagaagcagc tcctgcatct ggcggaggag aagcaaacca agagtttcac 2341 tgttgacatc cgcgccaagc cagaatacca caaattcctc atcggcaagg ggggcggcaa 2401 aattcgcaag gtgcgcgaca gcactggagc acgtgtcatc ttccctgcgg ctgaggacaa 2461 ggaccaggac ctgatcacca tcattggaaa ggaggacgcc gtccgagagg cacagaagga 2521 gctggaggcc ttgatccaaa acctggataa tgtggtggaa gactccatgc tggtggaccc 2581 caagcaccac cgccacttcg tcatccgcag aggccaggtc ttgcgggaga ttgctgaaga 2641 gtatggcggg gtgatggtca gcttcccacg ctctggcaca cagagcgaca aagtcaccct 2701 caagggcgcc aaggactgtg tggaggcagc caagaaacgc attcaggaga tcattgagga 2761 cctggaagct caggtgacat tagaatgtgc tataccccag aaattccatc gatctgtcat 2821 gggccccaaa ggttccagaa tccagcagat tactcgggat ttcagtgttc aaattaaatt 2881 cccagacaga gaggagaacg cagttcacag tacagagcca gttgtccagg agaatgggga 2941 cgaagctggg gaggggagag aggctaaaga ttgtgacccc ggctctccaa ggaggtgtga 3001 catcatcatc atctctggcc ggaaagaaaa gtgtgaggct gccaaggaag ctctggaggc 3061 attggttcct gtcaccattg aagtagaggt gccctttgac cttcaccgtt acgttattgg 3121 gcagaaagga agtgggatcc gcaagatgat ggatgagttt gaggtgaaca tacatgtccc 3181 ggcacctgag ctgcagtctg acatcatcgc catcacgggc ctcgctgcaa atttggaccg 3241 ggccaaggct ggactgctgg agcgtgtgaa ggagctacag gccgagcagg aggaccgggc 3301 tttaaggagt tttaagctga gtgtcactgt agaccccaaa taccatccca agattatcgg 3361 gagaaagggg gcagtaatta cccaaatccg gttggagcat gacgtgaaca tccagtttcc 3421 tgataaggac gatgggaacc agccccagga ccaaattacc atcacagggt acgaaaagaa 3481 cacagaagct gccagggatg ctatactgag aattgtgggt gaacttgagc agatggtttc 3541 tgaggacgtc ccgctggacc accgcgttca cgcccgcatc attggtgccc gcggcaaagc 3601 cattcgcaaa atcatggacg aattcaaggt ggacattcgc ttcccacaga gcggagcccc 3661 agaccccaac tgcgtcactg tgacggggct cccagagaat gtggaggaag ccatcgacca 3721 catcctcaat ctggaggagg aatacctagc tgacgtggtg gacagtgagg cgctgcaggt 3781 atacatgaaa cccccagcac acgaagaggc caaggcacct tccagaggct ttgtggtgcg 3841 ggacgcaccc tggaccgcca gcagcagtga gaaggctcct gacatgagca gctctgagga 3901 atttcccagc tttggggctc aggtggctcc caagaccctc ccttggggcc ccaaacgata 3961 atgatcaaaa agaacagaac cctctccagc ctgctgaccc gaacccaacc acacaatggt 4021 ttgtctcaat ctgacccagc ggctggaccc tccgtaaatt gttgagcgct cttccccttc 4081 ccgaggtccg cagggagcct agcgcctggc tgtgtgtgcg gccgctcctc caggcctggc 4141 cgtgcccgct caggacctgc tccactgttt aacaataaac caaggtcatg agcattcgag 4201 ctaagataac agactccagc tcctggtcca cccggcatgt cagtcagcac tctggccttc 4261 atcacgagag ctccgcagcc gtggctagga ttccacttcc tgtgtcatga cctcaggaaa 4321 taaacgtcct tgactttata aaagccccga attc // LOCUS HUMHCCA 2595 bp mRNA PRI 22-FEB-1994 DEFINITION Homo sapiens splicing factor (CC1.3) mRNA, complete cds. ACCESSION L10910 NID g405191 KEYWORDS splicing factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2595) AUTHORS Imai,H., Chan,E.K., Kiyosawa,K., Fu,X.D. and Tan,E.M. TITLE Novel nuclear autoantigen with splicing factor motifs identified with antibody from hepatocellular carcinoma JOURNAL J. Clin. Invest. 92 (5), 2419-2426 (1993) MEDLINE 94043761 FEATURES Location/Qualifiers source 1..2595 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hep G2" /cell_type="epithelial cell" /tissue_type="liver" gene 150..2578 /gene="CC1.3" CDS 150..1724 /gene="CC1.3" /note="putative" /codon_start=1 /product="splicing factor" /db_xref="PID:g405192" /translation="MADDIDIEAMLEAPYKKDENKLSSANGHEERSKKRKKSKSRSRS HERKRSKSKERKRSRDRERKKSKSRERKRSRSKERRRSRSRSRDRRFRGRYRSPYSGP KFNSAIRGKIGLPHSIKLSRRRSRSKSPFRKDKSPVREPIDNLTPEERDARTVFCMQL AARIRPRDLEEFFSTVGKVRDVRMISDRNSRRSKGIAYVEFVDVSSVPLAIGLTGQRV LGVPIIVQASQAEKNRAAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRI ESIQLMMDSETGRSKGYGFITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASS ASSFLDSDELERTGIDLGTTGRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAD LQTRLSQQTEASALAAAASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNK HGGVIHIYVDKNSAQGNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHNL FPDSMTATQLLVPSRR" polyA_signal 1804..1809 /gene="CC1.3" polyA_signal 1860..1865 /gene="CC1.3" polyA_signal 2573..2578 /gene="CC1.3" BASE COUNT 828 a 461 c 609 g 697 t ORIGIN 1 cgggctgggc ggttccgcgg cctgggccta ggggcttaac agtagcaaca gaagcggcgg 61 cggcggcagc agcagcagca gcagcagcaa tctcttcccg aacacgagca ccacaggcgc 121 ccgaaggccg gaacaggcgt ttagagaaaa tggcagacga tattgatatt gaagcaatgc 181 ttgaggctcc ttacaagaag gatgagaaca agttgagcag tgccaacggc catgaagaac 241 gtagcaaaaa gaggaaaaaa agcaagagca gaagtcgtag tcatgaacga aagagaagca 301 aaagtaagga acggaagcga agtagagaca gagaaaggaa aaagagcaaa agccgtgaaa 361 gaaagcgaag tagaagcaaa gagaggcgac ggagccgctc aagaagtcga gatcgaagat 421 ttagaggccg ctacagaagt ccttactccg gaccaaaatt taacagtgcc atccgaggaa 481 agattgggtt gcctcatagc atcaaattaa gcagacgacg ttcccgaagc aaaagtccat 541 tcagaaaaga caagagccct gtgagagaac ctattgataa tttaactcct gaggaaagag 601 atgcaaggac agtcttctgt atgcagctgg cggcaagaat tcgaccaagg gatttggaag 661 agtttttctc tacagtagga aaggttcgag atgtgaggat gatttctgac agaaattcaa 721 gacgttccaa aggaattgct tatgtggagt tcgtcgatgt tagctcagtg cctctagcaa 781 taggattaac tggccaacga gttttaggcg tgccaatcat agtacaggca tcacaggcag 841 aaaaaaacag agctgcagca atggcaaaca atttacaaaa gggaagtgct ggacctatga 901 ggctttatgt gggctcatta cacttcaaca taactgaaga tatgcttcgt gggatctttg 961 agccttttgg aagaattgaa agtatccagc tgatgatgga cagtgaaact ggtcgatcca 1021 agggatatgg atttattaca ttttctgact cagaatgtgc caaaaaggct ttggaacaac 1081 ttaatggatt tgaactagca ggaagaccaa tgaaagttgg tcatgttact gaacgtactg 1141 atgcttcgag tgctagttca tttttggaca gtgatgaact ggaaaggact ggaattgatt 1201 tgggaacaac tggtcgtctt cagttaatgg caagacttgc agagggtaca ggtttgcaga 1261 ttccgccagc agcacagcaa gctctacaga tgagtggctc tttggcattt ggtgctgtgg 1321 cagatttgca aacaagactt tcccagcaga ctgaagcttc agctttagct gcagctgcct 1381 ctgttcagcc acttgcaaca caatgtttcc aactctctaa catgtttaac cctcaaacag 1441 aagaagaagt tggatgggat accgagatta aggatgatgt gattgaagaa tgtaataaac 1501 atggaggagt tattcatatt tatgttgaca aaaattcagc tcagggcaat gtgtatgtga 1561 agtgcccatc aattgctgca gctattgctg ctgtcaatgc attgcatggc aggtggtttg 1621 ctggtaaaat gataacagca gcatatgtac ctcttccaac ttaccacaac ctgtttcctg 1681 attctatgac agcaacacag ctactggttc caagtagacg atgaaggaag atatagtccc 1741 ttatgtatat agcttttttt ctttcttgag aattcatctt gagttatctt ttatttagat 1801 aaaaataaag aggcaaggat ctactgtcat ttgtatgcaa tttcctgtta ccttgaaaaa 1861 ataaaaatgt taacaggaat gcagtgtgct cattctccct aaatagtaaa tcccactgta 1921 tacaaaactg ttctcttgtt ctgcctttta aaatgttcat gtagaaaatt aatgaactat 1981 aggaatagct ctaggagaac aaatgtgctt tctgtaaaaa ggcagaccag ggatgtaatg 2041 tttttaatgt ttcagaagcc taacttttta cacagtggtt acatttcaca tttcactaat 2101 gttgatattt ggctgatggt tgagcagttt ctgaaataca catttagtgt atggaaatac 2161 aagacagcta aagggctgtt tggttagcat ctcatcttgc attctgatca attggcaaga 2221 aagggagatt tcaaaattat atttcttgat ggtatctttt caattaatgt atctgtaaaa 2281 gtttctttgt aaatactatg tgttctggtg tgtcttaaaa ttccaaacaa aatgatccct 2341 gcatttcctg aagatgttta aacgtgagag tctggtaggc aaagcagtct gagaaagaaa 2401 taggaaatgc agaaataggt tttgtctggt tgcatataat ctttgctctt tttaagctct 2461 gtgagctctg aaatatattt ttgggttact tcagtgtgtt tgacaagaca gcttgatatt 2521 tctatcaaac aaatgacttt catattgcaa caatctttgt aagaaccact caaataaaag 2581 tctcttaaaa aggcc // LOCUS HUMHDGF 2376 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for hepatoma-derived growth factor, complete cds. ACCESSION D16431 NID g598955 KEYWORDS hepatoma-derived GF; hepatoma-derived growth factor. SOURCE Homo sapiens adult hepatoma hepatocyte cell_line:HuH-7 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2376) AUTHORS Nakamura,H., Izumoto,Y., Kambe,H., Kuroda,T., Mori,T., Kawamura,K., Yamamoto,H. and Kishimoto,T. TITLE Molecular cloning of complementary DNA for a novel human hepatoma-derived growth factor. Its homology with high mobility group-1 protein JOURNAL J. Biol. Chem. 269 (40), 25143-25149 (1994) MEDLINE 95014294 REFERENCE 2 (bases 1 to 2376) AUTHORS Izumoto,Y. TITLE Direct Submission JOURNAL Submitted (10-JUN-1993) to the DDBJ/EMBL/GenBank databases. Yoshitaka Izumoto, Osaka University Medical School, 3rd Department of Medicine; Yamadaoka 2-2, Suita, Osaka 553, Japan (Tel:06-879-3837, Fax:06-879-3839) FEATURES Location/Qualifiers source 1..2376 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HuH-7" /cell_type="hepatocyte" /dev_stage="adult" /tissue_type="hepatoma" CDS 316..1038 /codon_start=1 /evidence=experimental /product="hepatoma-derived GF" /db_xref="PID:d1004419" /db_xref="PID:g598956" /translation="MSRSNRQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANK YQVFFFGTHETAFLGPKDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQS SQKKSCVEEPEPEPEAAEGDGDKKGNAEGSSDEEGKLVIDEPAKEKNEKGALKRRAGD LLEDSPKRPKEAENPEGEEKEAATLEVERPLPMEVEKNSTPSEPGSGRGPPQEEEEEE DEEEEATKEDAEAPGIRDHESL" polyA_signal 1936..1941 polyA_signal 2357..2362 BASE COUNT 543 a 656 c 679 g 498 t ORIGIN 1 gaggaggagt ggggaccggg cggggggtgg aggaagaggc ctcgcgcaga ggagggagca 61 attgaatttc aaacacaaac aactcgacga gcgcgcaccc accgcgccgg agccttgccc 121 cgatccgcgc ccgccccgtc cgtgcggcgc gcgggcggag acgccgtggc cgcgccggag 181 ctcgggccgg gggccaccat cgaggcgggg gccgcgcgag ggccggagcg gagcggcgcc 241 gccaccgccg cacgcgcaaa cttgggctcg cgcttcccgg cccggcgcgg agcccggggc 301 gcccggagcc ccgccatgtc gcgatccaac cggcagaagg agtacaaatg cggggacctg 361 gtgttcgcca agatgaaggg ctacccacac tggccggccc ggattgacga gatgcctgag 421 gctgccgtga aatcaacagc caacaaatac caagtctttt ttttcgggac ccacgagacg 481 gcattcctgg gccccaaaga cctcttccct tacgaggaat ccaaggagaa gtttggcaag 541 cccaacaaga ggaaagggtt cagcgagggg ctgtgggaga tcgagaacaa ccctactgtc 601 aaggcttccg gctatcagtc ctcccagaaa aagagctgtg tggaagagcc tgaaccagag 661 cccgaagctg cagagggtga cggtgataag aaggggaatg cagagggcag cagcgacgag 721 gaagggaagc tggtcattga tgagccagcc aaggagaaga acgagaaagg agcgttgaag 781 aggagagcag gggacttgct ggaggactct cctaaacgtc ccaaggaggc agaaaaccct 841 gaaggagagg agaaggaggc agccaccttg gaggttgaga ggccccttcc tatggaggtg 901 gaaaagaata gcaccccctc tgagcccggc tctggccggg ggcctcccca agaggaagaa 961 gaagaggagg atgaagagga agaggctacc aaggaagatg ctgaggcccc aggcatcaga 1021 gatcatgaga gcctgtagcc accaatgttt caagaggagc ccccaccctg ttcctgctgc 1081 tgtctgggtg ctactgggga aactggccat ggcctgcaaa ctgggaaccc ctttcccacc 1141 ccaacctgct ctcctcttct actcactttt cccactccaa gcccagccca tggagattga 1201 cctggatggg gcaggccacc tggctctcac ctctaggtcc ccatactcct atgatctgag 1261 tcagagccat gtcttctccc tggaatgagt tgaggccact gtgttccttc cgcttggagc 1321 tattttccag gcttctgctg gggcctggga caactgctcc cacctcctga cacccttctc 1381 ccactctcct aggcattctg gacctctggg ttgggatcag gggtaggaat ggaaggatgg 1441 agcatcaaca gcagggtggg cttgtggggc ctgggagggg caatcctcaa atgcggggtg 1501 ggggcagcac aggagggcgg cctccttctg agctcctgtc ccctgctaca cctattatcc 1561 cagctgccta gattcaggga aagtgggaca gcttgtaggg gaggggctcc tttccataaa 1621 tccttgatga ttgacaacac ccatttttcc ttttgccgac cccaagagtt ttgggagttg 1681 tagttaatca tcaagagaat ttggggcttc caagttgttc gggccaagga cctgagacct 1741 gaagggttga ctttacccat ttgggtggga gtgttgagca tctgtccccc tttagatctc 1801 tgaagccaca aataggatgc ttgggaagac tcctagctgt cctttttcct ctccacacag 1861 tgctcaaggc cagcttatag tcatatatat cacccagaca taaaggaaaa gacacatttt 1921 ttaggaaatg tttttaataa aagaaaatta caaaaaaaaa ttttaaagac ccctaaccct 1981 ttgtgtgctc tccattctgc tccttcccca tcgttgcccc catttctgag gtgcactggg 2041 aggctcccct tctatttggg gcttgatgac tttctttttg tagctggggc tttgatgttc 2101 cttccagtgt catttctcat ccacataccc tgacctggcc ccctcagtgt tgtcaccaga 2161 tctgatttgt aacccactga gaggacagag agaaataagt gccctctccc accctcttcc 2221 tactggtctc tctatgcctc tctacagtct cgtctctttt accctggccc ctctcccttg 2281 ggctctgatg aaaaattgct gactgtagct ttggaagttt agctctgaga accgtagatg 2341 atttcagttc taggaaaata aaacccgttg attact // LOCUS HUMHEB 4126 bp mRNA PRI 31-DEC-1994 DEFINITION Human HEB helix-loop-helix protein (HEB) mRNA, complete cds. ACCESSION M80627 NID g183929 KEYWORDS helix-loop-helix protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4126) AUTHORS Hu,J.S., Olson,E.N. and Kingston,R.E. TITLE HEB, a helix-loop-helix protein related to E2A and ITF2 that can modulate the DNA-binding ability of myogenic regulatory factors JOURNAL Mol. Cell. Biol. 12 (3), 1031-1042 (1992) MEDLINE 92186835 FEATURES Location/Qualifiers source 1..4126 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa and HPB-ALL" gene 214..2262 /gene="HEB" CDS 214..2262 /gene="HEB" /codon_start=1 /product="helix-loop-helix protein" /db_xref="PID:g183930" /translation="MNPQQQRMAAIGTDKELSDLLDFSAMFSPPVNSGKTRPTTLGSS QFSGSGIDERGGTTSWGTSGQPSPSYDSSRGFTDSPHYSDHLNDSRLGAHEGLSPTPF MNSNLMGKTSERGSFSLYSRDTGLPGCQSSLLRQDLGLGSPAQLSSSGKPGTAYYSFS ATSSRRRPLHDSAALDPLQAKKVRKVPPGLPSSVYAPSPNSDDFNRESPSYPSPKPPT SMFASTFFMQDGTHNSSDLWSSSNGMSQPGFGGILGTSTSHMSQSSSYGNLHSHDRLS YPPHSVSPTDINTSLPPMSSFHRGSTSSSPYVAASHTPPINGSDSILGTRGNAAGSSQ TGDALGKALASIYSPDHTSSSFPSNPSTPVGSPSPLTGTSQWPRPGGQAPSSPSYENS LHSLQSRMEDRLDRLDDAIHVLRNHAVGPSTSLPAGHSDIHSLLGPSHNAPIGSLNSN YGGSSLVASSRSASMVGTHREDSVSLNGNHSVLSSTVTTSSTDLNHKTQENYRGGLQS QSGTVVTTEIKTENKEKDENLHEPPSSDDMKSDDESSQKDIKVSSRGRTSSTNEDEDL NPEQKIEREKERRMANNARERLRVRDINEAFKELGRMCQLHLKSEKPQTKLLILHQAV AVILSLEQQVRERNLNPKAACLKRREEEKVSAVSAEPPTTLPGTHPGLSETTNPMGHM " misc_feature 1939..2106 /gene="HEB" /note="basic-helix-loop-helix domain" BASE COUNT 1230 a 964 c 863 g 1069 t ORIGIN 1 taaagggacc gacagcccgc cccgggagga aggggcgcca ggcccgaaag ccgcctcccc 61 ctcccagacc cgagagctcg tgcggggcaa agtgaaccga gccgctgggc gtgcaagggg 121 aagcccaagc ccgttctccc ggccaaagtg aactttaatc ggggtggttg gatgcggaga 181 cggggcggca ggacctgcta gaagtggccg aagatgaatc cccagcaaca acgcatggcc 241 gctataggga ccgacaagga gctgagcgac ctactggact tcagtgcgat gttttcccca 301 cctgttaata gtgggaaaac tagaccaact acactgggaa gcagtcaatt cagtggatca 361 ggtattgatg aaagaggagg tacaacatct tggggaacaa gtggtcaacc aagtccttcc 421 tatgattcat ctagaggttt tacagacagc cctcattaca gtgatcactt gaatgacagt 481 cgattaggag cccatgaagg cttgtcccca acacctttca tgaactcaaa tctgatggga 541 aaaacatcag agagaggctc attttccctg tacagcagag atactggatt accaggctgt 601 caatctagtc tcctgagaca agatctgggg cttgggagcc cagcacagct atcttcttca 661 ggaaaacctg ggacagcata ctattcattc tctgctacaa gttccaggag gagaccactc 721 catgactctg cagcgcttga tcccttgcaa gcaaaaaaag tcagaaaggt gcctcctggt 781 ttgccttctt ctgtatatgc accatcccca aattcagatg atttcaaccg tgaatctcct 841 agttatccat ctcctaagcc accaaccagt atgttcgcta gcactttctt tatgcaagat 901 gggacccaca attcttctga cctttggagt tcatcaaatg ggatgagcca gcctggtttt 961 ggtggaattc tggggacctc cacttcccac atgtctcaat ccagtagtta tggcaacctt 1021 cattcacatg accgcttgag ttatcctcca cactcagttt caccaacaga cataaacacg 1081 agtcttccac caatgtccag ctttcatcgc ggcagtacca gcagttcacc ttacgttgct 1141 gcctcacaca ctcctcccat caatggatca gacagcattc taggaaccag agggaatgct 1201 gctggaagct cacagacagg tgatgcactt ggaaaggctt tggcatctat ttattctcct 1261 gaccatacca gcagtagttt tccgtcaaat ccatcaacac cagttggatc accttcacct 1321 ctcacaggta ccagtcagtg gccaagacct ggagggcaag caccttcatc cccaagctat 1381 gaaaactcac tccactccct gcagtctcga atggaggatc gtttagacag actggatgat 1441 gcaatccatg tgctgcggaa ccatgctgtg ggaccttcca ccagtttgcc tgctggtcac 1501 agtgatatac atagtttatt gggaccatcc cataatgcac caattggaag cctcaattca 1561 aactatggag gatcaagcct tgttgcaagc agtcgatcag cttcaatggt tggaactcat 1621 cgggaagact ctgtcagtct caatggcaat cattcagtcc tgtctagtac agtcactact 1681 tcaagcacag acctgaacca taaaacacaa gaaaattata gaggtggctt gcaaagtcag 1741 tctggaactg ttgttacaac agaaatcaag actgaaaaca aagaaaagga tgaaaacctt 1801 catgaacctc cttcatcaga tgacatgaag tcagatgatg aatcctccca aaaagatatc 1861 aaggtttcat ctagaggcag aacaagcagt actaatgaag atgaggattt gaaccctgaa 1921 cagaagatag aaagggagaa ggagaggcgg atggctaaca atgccagaga acgcttacgc 1981 gtgcgggata ttaatgaagc attcaaagag cttggccgaa tgtgtcagct tcacttgaag 2041 agtgaaaaac cccaaacaaa actccttatt cttcatcaag ccgtggcagt catccttagt 2101 ctagaacagc aagtcagaga gaggaacctt aaccccaaag cagcctgcct taagagaagg 2161 gaagaagaaa aagtttctgc cgtatcggca gagccgccaa ccacactgcc aggaacccat 2221 cctgggctta gtgaaactac caaccctatg ggtcatatgt aaacatcagc cagttccaga 2281 gttatcagta ggctagatag aaggtgacct ctcctcataa ggacttggac aactcagatt 2341 atctgaagac acaaacctga caggagggag aagaaaaaac aaaacacttg aaccaagaaa 2401 ctcaaatgta atcctacgat caaagcaact ggtcaacact tccatcagaa gtgaagatag 2461 gaagctcatc agatagaaca tcagcccatg agatgtttgc aacaaacctt ttgttgcaag 2521 cagtgtgtcg cttctgcaca atcagagact gtctcgatct ctccactcac cgtggaagtt 2581 gccttgtgcc taaactgaat tgacaaatgc attgtaacta caaattttat ttattgttat 2641 gaaactgtaa ggtctacata taaagggaaa aagttaatgt ggaaagctga tctacactca 2701 gctgatgcca gcatacatta aagcggttca cgtgcagaga acaaagcagt gacaaccatt 2761 ggcccttagc attcccggca tacctattag tgtcttaaaa aggaagggaa aagtcttttg 2821 ttgccctctc ctatcctctt gccatatgaa tagcgttttc catgaaatag gaaaatatta 2881 cttggtatag catttctctt gctctcattt tttgatttat ttttattttc tctttgtggg 2941 tgttatattt gatctctaaa tctgaacagt ttatggtcac agtccagcct cctccgtgca 3001 gccctgtgtg ctttgcacat ttaccttaca gtggtaagca gagaccatct gtgaccatag 3061 cctagctagc attttaaacg gggaaatttt gttctctagg ttttccccca aataaacatt 3121 gctttatttc taataataac caagactttt caagcttcta gatctcatag gaaagcttgt 3181 aatagcaaaa ttgtaaatta caagggaaga atctactttt tagaaatcgc tttgttttcc 3241 aagcagtaag tactacatac agtacttgta aagtgttagc tgtaagtaag cacaaaatac 3301 atttaaaata caaagacgat tttttcaggc tgtgattatg gtgaacataa caaaacccag 3361 tagtcaccga ggcaggtagt gtgataaatg aacacaccac tctgaggcta attacctaat 3421 ggaatacaag agcaatggtc acccgtattt ccttatccta gcctttattt ctctgtcatt 3481 tggatggctg gtcaatgggg aagaattgag tgggtgattt aatcaactgc aaaccatctg 3541 cccctgtccc aaaatgatga gccagattag cattaaacca gtacttgtca gtccatctta 3601 atactgttca ttaaggcact ctctgtctct aatccttagg agttgtttta aaagacataa 3661 tcactttgaa cttccatgaa acctgcctgc caccacaaca accctgggag agaaaaacat 3721 gctaaaggag gtatcttggc ttaataattc cttatagcca atatcaacag tggcaatcag 3781 cacacagagg aaaggaccca aatcactatg tagcttaaag atttctgtta atttgaaaga 3841 acaaaaacaa gacagaactt ctggtactct aatcaggatg attcctaaca agtcagtcat 3901 ttgtgaactt agtggacttt ttggttactt taatttgcat atattctcca gttacatcgg 3961 actctatctg tggccttgtt cttcatttca gtgttaatca gctaaacaga agttgttgct 4021 tatgatgtgt gagtgaacat atgccactgc ctggcccttt tttcttcaga gcttgttgtc 4081 tttttcgcta tattagactt tgcagtatgc ccagaagctt tccttc // LOCUS HUMHEK 3149 bp mRNA PRI 31-DEC-1994 DEFINITION Human receptor tyrosine kinase (HEK) mRNA, complete cds. ACCESSION M83941 NID g183931 KEYWORDS receptor protein-tyrosine kinase. SOURCE Homo sapiens lymphoid tumor cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3149) AUTHORS Wicks,I.P., Wilkinson,D., Salvaris,E. and Boyd,A.W. TITLE Molecular cloning of HEK, the gene encoding a receptor tyrosine kinase expressed by human lymphoid tumor cell lines JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (5), 1611-1615 (1992) MEDLINE 92179233 FEATURES Location/Qualifiers source 1..3149 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="LK63" /tissue_type="lymphoid tumor" gene 101..3052 /gene="HEK" CDS 101..3052 /gene="HEK" /codon_start=1 /product="receptor protein kinase" /db_xref="PID:g183932" /translation="MDCQLSILLLLSCSVLDSFGELIPQPSNEVNLLDSKTIQGELGW ISYPSHGWEEISGVDEHYTPIRTYQVCNVMDHSQNNWLRTNWVPRNSAQKIYVELKFT LRDCNSIPLVLGTCKETFNLYYMESDDDHGVKFREHQFTKIDTIAADESFTQMDLGDR ILKLNTEIREVGPVNKKGFYLAFQDVGACVALVSVRVYFKKCPFTVKNLAMFPDTVPM DSQSLVEVRGSCVNNSKEEDPPRMYCSTEGEWLVPIGKCSCNAGYEERGFMCQACRPG FYKALDGNMKCAKCPPHSSTQEDGSMNCRCENNYFRADKDPPSMACTRPPSSPRNVIS NINETSVILDWSWPLDTGGRKDVTFNIICKKCGWNIKQCEPCSPNVRFLPRQFGLTNT TVTVTDLLAHTNYTFEIDAVNGVSELSSPPRQFAAVSITTNQAAPSPVLTIKKDRTSR NSISLSWQEPEHPNGIILDYEVKYYEKQEQETSYTILRARGTNVTISSLKPDTIYVFQ IRARTAAGYGTNSRKFEFETSPDSFSISGESSQVVMIAISAAVAIILLTVVIYVLIGR FCGYKSKHGADEKRLHFGNGHLKLPGLRTYVDPHTYEDPTQAVHEFAKELDATNISID KVVGAGEFGEVCSGRLKLPSKKEISVAIKTLKVGYTEKQRRDFLGEASIMGQFDHPNI IRLEGVVTKSKPVMIVTEYMENGSLDSFLRKHDAQFTVIQLVGMLRGIASGMKYLSDM GYVHRDLAARNILINSNLVCKVSDFGLSRVLEDDPEAAYTTRGGKIPIRWTSPEAIAY RKFTSASDVWSYGIVLWEVMSYGERPYWEMSNQDVIKAVDEGYRLPPPMDCPAALYQL MLDCWQKDRNNRPKFEQIVSILDKLIRNPGSLKIITSAAARPSNLLLDQSNVDISTFR TTGDWLNGVRTAHCKEIFTGVEYSSCDTIAKISTDDMKKVGVTVVGPQKKIISSIKAL ETQSKNGPVPV" BASE COUNT 891 a 711 c 768 g 779 t ORIGIN 1 ccatggatgg taacttctcc agcaatcaga gcgctccccc tcacatcagt ggcatgcttc 61 atggagatat gctcctctca ctgccctctg caccagcaac atggattgtc agctctccat 121 cctcctcctt ctcagctgct ctgttctcga cagcttcggg gaactgattc cgcagccttc 181 caatgaagtc aatctactgg attcaaaaac aattcaaggg gagctgggct ggatctctta 241 tccatcacat gggtgggaag agatcagtgg tgtggatgaa cattacacac ccatcaggac 301 ttaccaggtg tgcaatgtca tggaccacag tcaaaacaat tggctgagaa caaactgggt 361 ccccaggaac tcagctcaga agatttatgt ggagctcaag ttcactctac gagactgcaa 421 tagcattcca ttggttttag gaacttgcaa ggagacattc aacctgtact acatggagtc 481 tgatgatgat catggggtga aatttcgaga gcatcagttt acaaagattg acaccattgc 541 agctgatgaa agtttcactc aaatggatct tggggaccgt attctgaagc tcaacactga 601 gattagagaa gtaggtcctg tcaacaagaa gggattttat ttggcatttc aagatgttgg 661 tgcttgtgtt gccttggtgt ctgtgagagt atacttcaaa aagtgcccat ttacagtgaa 721 gaatctggct atgtttccag acacggtacc catggactcc cagtccctgg tggaggttag 781 agggtcttgt gtcaacaatt ctaaggagga agatcctcca aggatgtact gcagtacaga 841 aggcgaatgg cttgtaccca ttggcaagtg ttcctgcaat gctggctatg aagaaagagg 901 ttttatgtgc caagcttgtc gaccaggttt ctacaaggca ttggatggta atatgaagtg 961 tgctaagtgc ccgcctcaca gttctactca ggaagatggt tcaatgaact gcaggtgtga 1021 gaataattac ttccgggcag acaaagaccc tccatccatg gcttgtaccc gacctccatc 1081 ttcaccaaga aatgttatct ctaatataaa cgagacctca gttatcctgg actggagttg 1141 gcccctggac acaggaggcc ggaaagatgt taccttcaac atcatatgta aaaaatgtgg 1201 gtggaatata aaacagtgtg agccatgcag cccaaatgtc cgcttcctcc ctcgacagtt 1261 tggactcacc aacaccacgg tgacagtgac agaccttctg gcacatacta actacacctt 1321 tgagattgat gccgttaatg gggtgtcaga gctgagctcc ccaccaagac agtttgctgc 1381 ggtcagcatc acaactaatc aggctgctcc atcacctgtc ctgacgatta agaaagatcg 1441 gacctccaga aatagcatct ctttgtcctg gcaagaacct gaacatccta atgggatcat 1501 attggactac gaggtcaaat actatgaaaa gcaggaacaa gaaacaagtt ataccattct 1561 gagggcaaga ggcacaaatg ttaccatcag tagcctcaag cctgacacta tatacgtatt 1621 ccaaatccga gcccgaacag ccgctggata tgggacgaac agccgcaagt ttgagtttga 1681 aactagtcca gactctttct ccatctctgg tgaaagtagc caagtggtca tgatcgccat 1741 ttcagcggca gtagcaatta ttctcctcac tgttgtcatc tatgttttga ttgggaggtt 1801 ctgtggctat aagtcaaaac atggggcaga tgaaaaaaga cttcattttg gcaatgggca 1861 tttaaaactt ccaggtctca ggacttatgt tgacccacat acatatgaag accctaccca 1921 agctgttcat gagtttgcca aggaattgga tgccaccaac atatccattg ataaagttgt 1981 tggagcaggt gaatttggag aggtgtgcag tggtcgctta aaacttcctt caaaaaaaga 2041 gatttcagtg gccattaaaa ccctgaaagt tggctacaca gaaaagcaga ggagagactt 2101 cctgggagaa gcaagcatta tgggacagtt tgaccacccc aatatcattc gactggaagg 2161 agttgttacc aaaagtaagc cagttatgat tgtcacagaa tacatggaga atggttcctt 2221 ggatagtttc ctacgtaaac acgatgccca gtttactgtc attcagctag tggggatgct 2281 tcgagggata gcatctggca tgaagtacct gtcagacatg ggctatgttc accgagacct 2341 cgctgctcgg aacatcttga tcaacagtaa cttggtgtgt aaggtttctg atttcggact 2401 ttcgcgtgtc ctggaggatg acccagaagc tgcttataca acaagaggag ggaagatccc 2461 aatcaggtgg acatcaccag aagctatagc ctaccgcaag ttcacgtcag ccagcgatgt 2521 atggagttat gggattgttc tctgggaggt gatgtcttat ggagagagac catactggga 2581 gatgtccaat caggatgtaa ttaaagctgt agatgagggc tatcgactgc caccccccat 2641 ggactgccca gctgccttgt atcagctgat gctggactgc tggcagaaag acaggaacaa 2701 cagacccaag tttgagcaga ttgttagtat tctggacaag cttatccgga atcccggcag 2761 cctgaagatc atcaccagtg cagccgcaag gccatcaaac cttcttctgg accaaagcaa 2821 tgtggatatc tctaccttcc gcacaacagg tgactggctt aatggtgtcc ggacagcaca 2881 ctgcaaggaa atcttcacgg gcgtggagta cagttcttgt gacacaatag ccaagatttc 2941 cacagatgac atgaaaaagg ttggtgtcac cgtggttggg ccacagaaga agatcatcag 3001 tagcattaaa gctctagaaa cgcaatcaaa gaatggccca gttcccgtgt aaagcacgac 3061 ggaagtgctt ctggacggaa gtggtggctg tggaaggcgt caagtcatcc tgcagacaga 3121 caataattct ggagatactg gtggaagtt // LOCUS HUMHEM1 3837 bp mRNA PRI 14-OCT-1993 DEFINITION Human membrane-associated protein (HEM-1) mRNA, complete cds. ACCESSION M58285 NID g407955 KEYWORDS membrane-associated protein. SOURCE Human adult blood myeloid from Patient S, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3837) AUTHORS Hromas,R.A., Collins,S., Raskind,W., Deaven,L. and Kaushansky,K. TITLE Hem-1, a potential membrane protein, with expression restricted to blood cells JOURNAL Biochim. Biophys. Acta 109, 241-244 (1991) MEDLINE 66099942 FEATURES Location/Qualifiers source 1..3837 /organism="Homo sapiens" /isolate="patient S" /db_xref="taxon:9606" /cell_type="myeloid" /dev_stage="adult" /tissue_type="blood" CDS 1583..3424 /codon_start=1 /product="membrane-associated protein HEM-1" /db_xref="PID:g183941" /translation="MNLIVFHSRMLDSVEKLLVETSDLSTFCFHLRIFEKMFAMTLEE SAMLRYAIAFPLICAHFVHCTHEMCPEEYPHLKNHGLHHCNSFLEELAKQTSNCVLEI CAEQRNLSEQLLPKHCATTISKAKNKKTRKQRQTPRKGEPERDKPGAESHRKNRSIVT NMDKLHLNLTELALTMNHVYSFSVFEHTIFPSEYLSSHLEARLNRAIVWLAGYNATTQ EIVRPSELLAGVKAYIGFIQSLAQFLGADASRVIRKPLLQQTQPLDSCGEQTITTLYT NWYLESLLRQASSGTIILSPAMQAFVSLPREGEQNFSAEEFSDISEMRALAELLGPYG MKFLSENLMWHVTSQIVELKKLVVENMDILVQIRSNFSKPDLMASLLPQLTGAENVLK RMTIIGVILSFRAMAQEGLREVFSSHCPFLMGPIECLKEFVTPDTDIKVTLSIFELAS AAGVGCDIDPALVAAIANLKADTSSPEEEYKVACLLLIFLAVSLPLLATDPSSFYSIE KDGYNNNIHCLTKAIIQVSAALFTLYNKNIETHLKEFVVVASVSLLQLGQETDKLKTR NRESISLLMRLVVEESSFLTLDMLESCFPYVLLRNAYREVSRAFHLN" BASE COUNT 961 a 958 c 944 g 974 t ORIGIN Map position 12q13. 1 tcagacattg ctgtctggtg ctcctctctc agtggccatc atgtctttga catctgctta 61 ccagcataaa ttagcagaga agctcactat cctgaatgat cgcggtcagg gggttctcat 121 ccgtatgtat aacatcaaga agacttgttc agaccccaaa tctaagccac ctttcttact 181 ggaaaagtcc atggaaccat ctctcaagta tatcaacaag aaatttccca acatagatgt 241 ccgaaacagc acgcaacatt taggaccagt acatcgtgaa aaagccgaga taattagatt 301 cctcaccaac tactaccagt catttgtgga tgtcatggaa tttcgggatc atgtatatga 361 acttctcaac accattgatg cctgccagtg ccattttgat atcaatctca actttgattt 421 cactcggagt tacctggact tgattgtaac ttacacctca gtcattttac ttctgtcacg 481 gattgaagat cggcggatac tcattggcat gtacaattgt gcccatgaga tgctgcatgg 541 gcatggtgac cccagttttg cccgtctggg tcagatggtc ttggagtatg accaccctct 601 gaagaagctg acagaagagt ttgggcctca cacaaaggct gtgagtggag ccctcctctc 661 tttgcatttc ctctttgtcc gaagaaacca gggggctgag cagtggcgca gtgcccaact 721 tctaagcctc atcagcaacc ccccagccat gattaaccct gctaattcag atacaatggc 781 ctgtgagtat ctgtctgtgg aagtaatgga gcgctggatt atcattgggt ttcttctttg 841 tcatgggtgc ctcaactcca atagccagtg ccagaagctg tggaagctgt gtctgcaggg 901 ctccctctac atcaccctta tccgtgagga tgtgctgcag gtgcacaaag tcaccgagga 961 cctgtttagc agtttgaaag ggtatggcaa gagagtggca gacataaagg agagcaagga 1021 acatgtaatt gcaaacagtg gccagtttca ttgtcaacgg cggcaatttc tgcggatggc 1081 agtgaaggag ctggagactg tgttggctga tgaaccggga ctactgggtc ctaaggctct 1141 ttttgctttc atggccctgt ccttcattcg tgatgaggtc acctggctgg ttcgccacac 1201 agagaatgtc accaagacaa agacacctga ggactatgct gactcgagca ttgcagagct 1261 acttttcttg ttggagggga ttaggtctct ggtccgaaga cacatcaaag tgatacagca 1321 ataccacctt cagtacttgg caagatttga tgctcttgtg ctcagtgaca tcattcagaa 1381 cttgtctgtg tgtccagagg aggagtccat catcatgtcc tcattcgtca gtatcctctc 1441 ctctctgaat ctcaaacaag ttgataatgg agaaaaattt gaattctcag gattgaggct 1501 ggactggttc cgcctacagg cataccatag cgtggctaag gcccctctgc acctgcatga 1561 gaaccctgac ttagccaagg tgatgaacct cattgtcttc cactcccgaa tgctggactc 1621 cgtagaaaaa ttgctggtgg aaacttctga tctgtctact ttctgctttc atcttcgtat 1681 ctttgagaag atgtttgcca tgaccttgga ggaatctgcc atgttgcgtt atgccattgc 1741 tttccccctg atttgtgctc actttgtcca ctgcactcat gagatgtgcc cagaggagta 1801 cccccacctc aagaaccatg gtcttcacca ctgcaactcc ttcctggaag agttggccaa 1861 gcagaccagc aattgcgtcc tggagatctg tgctgagcag cgaaacctga gcgagcagct 1921 tctacctaag cactgtgcca ctacaatcag caaagccaag aacaagaaaa ccaggaagca 1981 gaggcagact cccagaaaag gagagcccga gagggacaag ccaggagctg agagtcaccg 2041 gaagaaccgc agcattgtca ccaacatgga caagctacac ctaaacttga cagaactggc 2101 actgacaatg aatcatgtat acagtttctc cgtgtttgaa catactatct tcccttctga 2161 gtacctcagc agccacctgg aggccagact caacagagcc attgtgtggc tggctggcta 2221 caatgccacg acccaggaga tcgtacggcc ttctgagctg ttggcaggag tcaaagcata 2281 cattggtttc atacagtcac tggcccagtt tttgggtgca gatgcttcca gagtcatccg 2341 caagcccctc ctgcagcaga cacaaccact ggattcctgt ggggaacaga caatcaccac 2401 actctacaca aactggtacc tggaaagtct gcttagacag gcaagcagtg ggaccatcat 2461 cctctcccca gccatgcagg ccttcgtcag cctgcccaga gaaggggagc agaacttcag 2521 tgcagaggag ttctctgaca tctctgagat gcgggccttg gcagaactcc tgggccccta 2581 tggcatgaag ttcctgagtg aaaacctgat gtggcatgtg acctctcaga ttgtggagct 2641 gaagaagctg gtggtggaaa acatggacat acttgttcag atcagatcca actttagcaa 2701 gccggacttg atggcttccc tgctgcccca gctgacaggg gctgaaaatg tgctaaagcg 2761 catgaccatc attggggtta tcctcagttt cagggccatg gcccaagagg gacttcggga 2821 ggttttctcc tcccactgcc catttcttat gggtcccatt gagtgcttga aggagtttgt 2881 cactccagac acagacatca aggtgacctt gagtatcttt gagctggcat ctgctgcagg 2941 tgtgggctgt gacattgacc cagccttggt ggctgccatt gctaatctga aagctgatac 3001 ttcatctcct gaggaggaat ataaggtggc ctgcctgctc ttgatctttc tggcagtttc 3061 cctcccactc cttgccactg acccttcttc cttttatagc attgagaagg atggttacaa 3121 caacaatatt cattgcttga ccaaagccat catccaggtg tctgctgccc tcttcacgct 3181 ctacaacaag aacattgaaa ctcacctcaa ggaatttgtg gtggtggcct ctgtcagcct 3241 cttgcagctg ggccaggaga ctgacaagct taaaaccaga aatcgagaat ccatttctct 3301 gctcatgcgc ttggtggtgg aggagtcatc cttcctgacc ctggacatgc tggagtcctg 3361 tttcccttat gtcctgcttc gaaatgccta tcgggaggtg tctcgggcct tccacctaaa 3421 ctgaatgcct gccagtaccc actgaagagc cctttggacc ttcctaaacc cttgccatag 3481 tggaagctgt ggtcactttc gcagggggtg ggaatggggt ggggtcacta aggagagagg 3541 gtcaggagcc agagttgatg agcagatctg tggaagaaca atccagggct gagaaatcgt 3601 agagcagtga ggcaggctgg gagcatggag gacagcttat ggaaaaagtt agggcgtggg 3661 gccacatgtg tgaattttac aatgaaaaaa ggagtaacgt acaagtatat tttctatctt 3721 ctggtgactt gagcttgagc tctgacaggc atgggcctct ccgaccttca tcactattct 3781 taggataatg ctggcgggca gagatgatca atcatcatat taaatcataa tgagccc // LOCUS HUMHEN1A 600 bp DNA PRI 31-DEC-1994 DEFINITION Homo sapiens helix-loop-helix protein (HEN1) gene, complete cds. ACCESSION M97507 NID g183946 KEYWORDS helix-loop-helix protein. SOURCE Homo sapiens (tissue library: RPMI-8402/2001) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 600) AUTHORS Brown,L., Espinosa,R. III., Le Beau,M.M., Siciliano,M.J. and Baer,R. TITLE HEN1 and HEN2: a subgroup of basic helix-loop-helix genes that are coexpressed in a human neuroblastoma JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (18), 8492-8496 (1992) MEDLINE 92409542 FEATURES Location/Qualifiers source 1..600 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RPMI-8402" /tissue_lib="RPMI-8402/2001" intron <1..24 /gene="HEN1" gene 1..600 /gene="HEN1" CDS 199..600 /gene="HEN1" /codon_start=1 /product="helix-loop-helix protein" /db_xref="PID:g183947" /translation="MMLNSDTMELDLPPTHSETESGFSDCGGGAGPDGAGPGGPGGGQ ARGPEPGEPGRKDLQHLSREERRRRRRATAKYRTAHATRERIRVEAFNLAFAELRKLL PTLPPDKKLSKIEILRLAICYISYLNHVLDV" BASE COUNT 109 a 209 c 180 g 102 t ORIGIN 1 tcacttctct cttctctttt tcaggcttca gactggcacc ctgaccatgg aaccctgaag 61 tggcagtgac ttctagagct cagtggcaga ccccacgacc cttcctcccc cttcctcccc 121 ctcccaccac cagctttcaa gtcccagagg gaggggtggg gaggggatcc tgatctcaca 181 gggcaggggg cttccatcat gatgctcaac tcagacacca tggagctgga cctgccgccc 241 acccactcag agactgagtc gggcttcagt gactgtgggg gcggggcggg ccctgatggt 301 gccgggcctg ggggtccggg agggggccag gcccgaggcc cagagccggg agagcctggc 361 cggaaagacc tgcagcatct gagccgcgag gagcgccggc gccggcgccg cgccacagcc 421 aagtaccgca cggcccacgc cacgcgagaa cgcatccgcg tggaagcctt caacctggcc 481 ttcgccgagc tgcgcaagct gctgcctacg ctgccccccg acaagaagct ctccaagatt 541 gagattctgc gcctggccat ctgctatatc tcctacctga accacgtgct ggacgtctga // LOCUS HUMHEOF 3817 bp mRNA PRI 01-MAY-1996 DEFINITION Homo sapiens enhancer of filamentation (HEF1) mRNA, complete cds. ACCESSION L43821 NID g1294780 KEYWORDS enhancer of filamentation. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3817) AUTHORS Law,S.F., Estojak,J., Wang,B., Mysliwiec,T.H., Kruh,G. and Golemis,E.A. TITLE Human enhancer of filamentation 1 (HEF1), a novel p130cas-like docking protein, associates with FAK, and induces pseudohyphal growth in yeast JOURNAL Mol. Cell. Biol. (1996) In press FEATURES Location/Qualifiers source 1..3817 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" 5'UTR <1..163 /gene="HEF1" gene 1..3817 /gene="HEF1" mRNA <1..>3817 /gene="HEF1" CDS 164..2668 /gene="HEF1" /codon_start=1 /product="enhancer of filmentation 1" /db_xref="PID:g1280212" /translation="MKYKNLMARALYDNVPECAEELAFRKGDILTVIEQNTGGLEGWW LCSLHGRQGIVPGNRVKLLIGPMQETASSHEQPASGLMQQTFGQQKLYQVPNPQAAPR DTIYQVPPSYQNQGIYQVPTGHGTQEQEVYQVPPSVQRSIGGTSGPHVGKKVITPVRT GHGYVYEYPSRYQKDVYDIPPSHTTQGVYDIPPSSAKGPVFSVPVGEIKPQGVYDIPP TKGVYAIPPSACRDEAGLREKDYDFPPPMRQAGRPDLRPEGVYDIPPTCTKPAGKDLH VKYNCDIPGAAEPVARRHQSLSPNHPPPQLGQSVGSQNDAYDVPRGVQFLEPPAETSE KANPQERDGVYDVPLHNPPDAKGSRDLVDGINRLSFSSTGSTRSNMSTSSTSSKESSL SASPAQDKRLFLDPDTAIERLQRLQQALEMGVSSLMALVTTDWRCYGYMERHINEIRT AVDKVELFLKEYLHFVKGAVANAACLPELILHNKMKRELQRVEDSHQILSQTSHDLNE CSWSLNILAINKPQNKCDDLDRFVMVAKTVPDDAKQLTTTINTNAEALFRPGPGSLHL KNGPESIMNSTEYPHGGSQGQLLHPGDHKAQAHNKALPPGLSKEQAPDCSSSDGSERS WMDDYDYVHLQGKEEFERQQKELLEKENIMKQNKMQLEHHQLSQFQLLEQEITKPVEN DISKWKPSQSLPTTNSGVSAQDRQLLCFYYDQCETHFISLLNAIDALFSCVSSAQPPR IFVAHSKFVILSAHKLVFIGDTLTRQVTAQDIRNKVMNSSNQLCEQLKTIVMATKMAA LHYPSTTALQEMVHQVTDLSRNAQLFKRSLLEMATF" 3'UTR 2669..>3817 /gene="HEF1" BASE COUNT 1049 a 977 c 922 g 869 t ORIGIN 1 tgaattcgtg agagacttga gggaggcgct gcgactgaca agcggctctg cccgggacct 61 tctcgctttc atctagcgct gcactcaatg gaggggcggg caccgcagtg cttaatgctg 121 tcttaactag tgtaggaaaa cggctcaacc caccgctgcc gaaatgaagt ataagaatct 181 tatggcaagg gccttatatg acaatgtccc agagtgtgcc gaggaactgg cctttcgcaa 241 gggagacatc ctgaccgtca tagagcagaa cacaggggga ctggaaggat ggtggctgtg 301 ctcgttacac ggtcggcaag gcattgtccc aggcaaccgg gtgaagcttc tgattggtcc 361 catgcaggag actgcctcca gtcacgagca gcctgcctct ggactgatgc agcagacctt 421 tggccaacag aagctctatc aagtgccaaa cccacaggct gctccccgag acaccatcta 481 ccaagtgcca ccttcctacc aaaatcaggg aatttaccaa gtccccactg gccacggcac 541 ccaagaacaa gaggtatatc aggtgccacc atcagtgcag agaagcattg ggggaaccag 601 tgggccccac gtgggtaaaa aggtgataac ccccgtgagg acaggccatg gctacgtata 661 cgagtaccca tccagatacc aaaaggatgt ctatgatatc cctccttctc ataccactca 721 aggggtatac gacatccctc cctcatcagc aaaaggccct gtgttttcag ttccagtggg 781 agagataaaa cctcaagggg tgtatgacat cccgcctaca aaaggggtat atgccattcc 841 gccctctgct tgccgggatg aagcagggct tagggaaaaa gactatgact tcccccctcc 901 catgagacaa gctggaaggc cggacctcag accggagggg gtttatgaca ttcctccaac 961 ctgcaccaag ccagcaggga aggaccttca tgtaaaatac aactgtgaca ttccaggagc 1021 tgcagaaccg gtggctcgaa ggcaccagag cctgtccccg aatcacccac ccccgcaact 1081 cggacagtca gtgggctctc agaacgacgc atatgatgtc ccccgaggcg ttcagtttct 1141 tgagccacca gcagaaacca gtgagaaagc aaacccccag gaaagggatg gtgtttatga 1201 tgtccctctg cataacccgc cagatgctaa aggctctcgg gacttggtgg atgggatcaa 1261 ccgattgtct ttctccagta caggcagcac ccggagtaac atgtccacgt cttccacctc 1321 ctccaaggag tcctcactgt cagcctcccc agctcaggac aaaaggctct tcctggatcc 1381 agacacagct attgagagac ttcagcggct ccagcaggcc cttgagatgg gtgtctccag 1441 cctaatggca ctggtcacta ccgactggcg gtgttacgga tatatggaaa gacacatcaa 1501 tgaaatacgc acagcagtgg acaaggtgga gctgttcctg aaggagtacc tccactttgt 1561 caagggagct gttgcaaatg ctgcctgcct cccggaactc atcctccaca acaagatgaa 1621 gcgggagctg caacgagtcg aagactccca ccagatcctg agtcaaacca gccatgactt 1681 aaatgagtgc agctggtccc tgaatatctt ggccatcaac aagccccaga acaagtgtga 1741 cgatctggac cggtttgtga tggtggcaaa gacggtgccc gatgacgcca agcagctcac 1801 cacaaccatc aacaccaacg cagaggccct cttcagaccc ggccctggca gcttgcatct 1861 gaagaatggg ccggagagca tcatgaactc aacggagtac ccacacggtg gctcccaggg 1921 acagctgctg catcctggtg accacaaggc ccaggcccac aacaaggcac tgcccccagg 1981 cctgagcaag gagcaggccc ctgactgtag cagcagtgat ggttctgaga ggagctggat 2041 ggatgactac gattacgtcc acctacaggg taaggaggag tttgagaggc aacagaaaga 2101 gctattggaa aaagagaata tcatgaaaca gaacaagatg cagctggaac atcatcagct 2161 gagccagttc cagctgttgg aacaagagat tacaaagccc gtggagaatg acatctcgaa 2221 gtggaagccc tctcagagcc tacccaccac aaacagtggc gtgagtgctc aggatcggca 2281 gttgctgtgc ttctactatg accaatgtga gacccatttc atttcccttc tcaacgccat 2341 tgacgcactc ttcagttgtg tcagctcagc ccagcccccg cgaatcttcg tggcacacag 2401 caagtttgtc atcctcagtg cacacaaact ggtgttcatt ggagacacgc tgacacggca 2461 ggtgactgcc caggacattc gcaacaaagt catgaactcc agcaaccagc tctgcgagca 2521 gctcaagact atagtcatgg caaccaagat ggccgccctc cattacccca gcaccacggc 2581 cctgcaggaa atggtgcacc aagtgacaga cctttctaga aatgcccagc tgttcaagcg 2641 ctctttgctg gagatggcaa cgttctgaga agaaaaaaaa gaggaagggg actgcgttaa 2701 cggttactaa ggaaaactgg aaatactgtc tggtttttgt aaatgttatc tatttttgta 2761 gataatttta tataaaaatg aaatatttta acattttatg ggtcagacaa ctttcagaaa 2821 ttcagggagc tggagaggga aatctttttt tcccccctga gtgttcttat gtatacacag 2881 aagtatctga gacataaact gtacagaaaa cttgtccacg tccttttgta tgcccatgta 2941 ttcatgtttt tgtttgtaga tgtttgtctg atgcatttca ttaaaaaaaa aaccatgaat 3001 tacgaagcac cttagtaagc accttctaat gctgcatttt ttttgttgtt gttaaaaaca 3061 tccagctggt tataatattg ttctccacgt ccttgtgatg attctgagcc tggcactggg 3121 aatctgggaa gcatagttta tttgcaagtg ttcaccttcc aaatcatgag gcatagcatg 3181 acttattctt gttttgaaaa ctcttttcaa aactgaccat cttaaacaca tgatggccaa 3241 gtgccacaaa gccctcttgc ggagacattt acgaatatat atgtggatcc aagtctcgat 3301 agttaggcgt tggagggaag agagaccaga gagtttagag gccaggacca cagttaggat 3361 tgggttgttt caatactgag agacagctac aataaaagga gagcaattgc ctccctgggg 3421 ctgttcaatc ttctgcattt gtgagtggtt cagtcatgag gttttccaaa agatgttttt 3481 agagttgtaa aaaccatatt tgcagcaaag atttacaaag gcgtatcaga ctatgattgt 3541 tcaccaaaat aggggaatgg tttgatccgc cagttgcaag tagaggcctt tctgactctt 3601 aatattcact ttggtgctac tacccccatt acctgaggaa ctggccaggt ccttgatcat 3661 ggaactatag agctaccaga catatcctgc tctctaaggg aatttattgc tatcttgcac 3721 cttctttaaa actcaaaaaa catatgcaga cctgacactc aagagtggct agctacacag 3781 agtccatcta atttttgcaa cttccccccc cgaattc // LOCUS HUMHEPBP 1163 bp mRNA PRI 31-DEC-1994 DEFINITION Human heparin binding protein (HBp17) mRNA, complete cds. ACCESSION M60047 NID g183950 KEYWORDS heparin binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1163) AUTHORS Wu,D.Q., Kan,M.K., Sato,G.H., Okamoto,T. and Sato,J.D. TITLE Characterization and molecular cloning of a putative binding protein for heparin-binding growth factors JOURNAL J. Biol. Chem. 266 (25), 16778-16785 (1991) MEDLINE 91358475 FEATURES Location/Qualifiers source 1..1163 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A631" /cell_type="epideromoid carcinoma" sig_peptide 98..196 /gene="HBp17" CDS 98..802 /gene="HBp17" /codon_start=1 /product="heparin binding protein" /db_xref="PID:g183951" /translation="MKICSLTLLSFLLLAAQVLLVEGKKKVKNGLHSKVVSEQKDTLG NTQIKQKSRPGNKGKFVTKDQANCRWAATEQEEGISLKVECTQLDHEFSCVFAGNPTS CLKLKDERVYWKQVARNLRSQKDICRYSKTAVKTRVCRKDFPESSLKLVSSTLFGNTK PRKEKTEMSPREHIKGKETTPSSLAVTQTMATKAPECVEDPDMANQRKTALEFCGETW SSLCTFFLSIVQDTSC" gene 98..802 /gene="HBp17" mat_peptide 197..799 /gene="HBp17" /product="heparin binding protein" BASE COUNT 336 a 273 c 275 g 279 t ORIGIN 1 ctctacctga cacagctgca gcctgcaatt cactcccact gcctgggatt gcactggatc 61 cgtgtgctca gaacaaggtg aacgcccagc tgcagccatg aagatctgta gcctcaccct 121 gctctccttc ctcctactgg ctgctcaggt gctcctggtg gaggggaaaa aaaaagtgaa 181 gaatggactt cacagcaaag tggtctcaga acaaaaggac actctgggca acacccagat 241 taagcagaaa agcaggcccg ggaacaaagg caagtttgtc accaaagacc aagccaactg 301 cagatgggct gctactgagc aggaggaggg catctctctc aaggttgagt gcactcaatt 361 ggaccatgaa ttttcctgtg tctttgctgg caatccaacc tcatgcctaa agctcaagga 421 tgagagagtc tattggaaac aagttgcccg gaatctgcgc tcacagaaag acatctgtag 481 atattccaag acagctgtga aaaccagagt gtgcagaaag gattttccag aatccagtct 541 taagctagtc agctccactc tatttgggaa cacaaagccc aggaaggaga aaacagagat 601 gtcccccagg gagcacatca agggcaaaga gaccaccccc tctagcctag cagtgaccca 661 gaccatggcc accaaagctc ccgagtgtgt ggaggaccca gatatggcaa accagaggaa 721 gactgccctg gagttctgtg gagagacttg gagctctctc tgcacattct tcctcagcat 781 agtgcaggac acgtcatgct aatgaggtca aaagagaacg ggttccttta agagatgtca 841 tgtcgtaagt ccctctgtat actttaaagc tctctacagt ccccccaaaa tatgaacttt 901 tgtgcttagt gagtgcaacg aaatatttaa acaagttttg tattttttgc ttttgtgttt 961 tggaatttgc cttatttttc ttggatgcga tgttcagagg ctgtttcctg cagcatgtat 1021 ttccatggcc cacacagcta tgtgtttgag cagcgaagag tctttgagct gaatgagcca 1081 gagtgataat ttcagtgcaa cgaactttct gctgaattaa tggtaataaa actctgggtg 1141 tttttcaaaa aaaaaaaaaa aaa // LOCUS HUMHEPGFA 2216 bp mRNA PRI 07-OCT-1994 DEFINITION Human hepatocyte growth factor-like protein mRNA, complete cds. ACCESSION M74178 NID g183976 KEYWORDS hepatocyte growth factor-like protein; serine protease-like domain. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2216) AUTHORS Han,S., Stuart,L.A. and Degen,S.J. TITLE Characterization of the DNF15S2 locus on human chromosome 3: identification of a gene coding for four kringle domains with homology to hepatocyte growth factor JOURNAL Biochemistry 30 (40), 9768-9780 (1991) MEDLINE 92002016 REFERENCE 2 (bases 1 to 2216) AUTHORS Degen,S.J. TITLE Direct Submission JOURNAL Submitted (02-DEC-1991) S.J.F. Degen, Division of Basic Science Research, Children's Hospital Research Foundation, Cincinnati, OH 45229-3039, USA FEATURES Location/Qualifiers source 1..2216 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /chromosome="3" /map="3p21" /tissue_type="liver" CDS 1..2136 /codon_start=1 /product="hepatocyte growth factor-like protein" /db_xref="PID:g183977" /translation="MGWLPLLLLLTQYLGVPGQRSPLNDFQVLRGTELQHLLHAVVPG PWQEDVADAEECAGRCGPLMDCRAFHYNVSSHGCQLLPWTQHSPHTRLRRSGRCDLFQ KKDYVRTCIMNNGVGYRGTMATTVGGLPCQAWSHKFPNDHKYTPTLRNGLEENFCRNP DGDPGGPWCYTTDPAVRFQSCGIKSCREAACVWCNGEEYRGAVDRTESGRECQRWDLQ HPHQHPFEPGKFLDQGLDDNYCRNPDGSERPWCYTTDPQIEREFCDLPRCGSEAQPRQ EATTVSCFRGKGEGYRGTANTTTAGVPCQRWDAQIPHQHRFTPEKYACKDLRENFCRN PDGSEAPWCFTLRPGMRAAFCYQIRRCTDDVRPQDCYHGAGEQYRGTVSKTRKGVQCQ RWSAETPHKPQFTFTSEPHAQLEENFCRNPDGDSHGPWCYTMDPRTPFDYCALRRCAD DQPPSILDPPDQVQFEKCGKRVDRLDQRRSKLRVVGGHPGNSPWTVSLRNRQGQHFCG GSLVKEQWILTARQCFSSCHMPLTGYEVWLGTLFQNPQHGEPSLQRVPVAKMVCGPSG SQLVLLKLERSVTLNQRVALICLPPEWYVVPPGTKCEIAGWGETKGTGNDTVLNVALL NVISNQECNIKHRGRVRESEMCTEGLLAPVGACEGDYGGPLACFTHNCWVLEGIIIPN RVCARSRWPAVFTRVSVFVDWIHKVMRLG" misc_feature 328..558 /note="first kringle domain" misc_feature 571..804 /note="second kringle domain" misc_feature 847..1083 /note="third kringle domain" misc_feature 1108..1344 /note="fourth kringle domain" misc_feature 1449..1450 /note="activation site" misc_feature 1450..2133 /note="serine protease-like domain" polyA_signal 2187..2192 BASE COUNT 445 a 666 c 682 g 423 t ORIGIN 1 atggggtggc tcccactcct gctgcttctg actcaatact taggggtccc tgggcagcgc 61 tcgccattga atgacttcca agtgctccgg ggcacagagc tacagcacct gctacatgcg 121 gtggtgcccg ggccttggca ggaggatgtg gcagatgctg aagagtgtgc tggtcgctgt 181 gggcccttaa tggactgccg ggccttccac tacaacgtga gcagccatgg ttgccaactg 241 ctgccatgga ctcaacactc gccccacacg aggctgcggc gttctgggcg ctgtgacctc 301 ttccagaaga aagactacgt acggacctgc atcatgaaca atggggttgg gtaccggggc 361 accatggcca cgaccgtggg tggcctgccc tgccaggctt ggagccacaa gttcccgaat 421 gatcacaagt acacgcccac tctccggaat ggcctggaag agaacttctg ccgtaaccct 481 gatggcgacc ccggaggtcc ttggtgctac acaacagacc ctgctgtgcg cttccagagc 541 tgcggcatca aatcctgccg ggaggccgcg tgtgtctggt gcaatggcga ggaataccgc 601 ggcgcggtag accgcacgga gtcagggcgc gagtgccagc gctgggatct tcagcacccg 661 caccagcacc ccttcgagcc gggcaagttc ctcgaccaag gtctggacga caactattgc 721 cggaatcctg acggctccga gcggccatgg tgctacacta cggatccgca gatcgagcga 781 gagttctgtg acctcccccg ctgcgggtcc gaggcacagc cccgccaaga ggccacaact 841 gtcagctgct tccgcgggaa gggtgagggc taccggggca cagccaatac caccactgcg 901 ggcgtacctt gccagcgttg ggacgcgcaa atccctcatc agcaccgatt tacgccagaa 961 aaatacgcgt gcaaagacct tcgggagaac ttctgccgga accccgacgg ctcagaggcg 1021 ccctggtgct tcacactgcg gcccggcatg cgcgcggcct tttgctacca gatccggcgt 1081 tgtacagacg acgtgcggcc ccaggactgc taccacggcg caggggagca gtaccgcggc 1141 acggtcagca agacccgcaa gggtgtccag tgccagcgct ggtccgctga gacgccgcac 1201 aagccgcagt tcacgtttac ctccgaaccg catgcacaac tggaggagaa cttctgccgg 1261 aacccagatg gggatagcca tgggccctgg tgctacacga tggacccaag gaccccattc 1321 gactactgtg ccctgcgacg ctgcgctgat gaccagccgc catcaatcct ggacccccca 1381 gaccaggtgc agtttgagaa gtgtggcaag agggtggatc ggctggatca gcggcgttcc 1441 aagctgcgcg tggttggggg ccatccgggc aactcaccct ggacagtcag cttgcggaat 1501 cggcagggcc agcatttctg cggggggtct ctagtgaagg agcagtggat actgactgcc 1561 cggcagtgct tctcctcctg ccatatgcct ctcacgggct atgaggtatg gttgggcacc 1621 ctgttccaga acccacagca tggagagcca agcctacagc gggtcccagt agccaagatg 1681 gtgtgtgggc cctcaggctc ccagcttgtc ctgctcaagc tggagagatc tgtgaccctg 1741 aaccagcgcg tggccctgat ctgcctgccc cctgaatggt atgtggtgcc tccagggacc 1801 aagtgtgaga ttgcaggctg gggtgagacc aaaggtacgg gtaatgacac agtcctaaat 1861 gtggccttgc tgaatgtcat ctccaaccag gagtgtaaca tcaagcaccg aggacgtgtg 1921 cgtgagagtg agatgtgcac tgagggactg ttggcccctg tgggggcctg tgagggtgac 1981 tacgggggcc cacttgcctg ctttacccac aactgctggg tcctggaagg aattataatc 2041 cccaaccgag tatgcgcaag gtcccgctgg ccagctgtct tcacgcgtgt ctctgtgttt 2101 gtggactgga ttcacaaggt catgagactg ggttaggccc agccttgatg ccatatgcct 2161 tggggaggac aaaacttctt gtcagacata aagccatgtt tcctctttat gcctgt // LOCUS HUMHEXKIN 3580 bp mRNA PRI 08-NOV-1994 DEFINITION Human hexokinase 1 (HK1) mRNA, complete cds. ACCESSION M75126 X61091 NID g184020 KEYWORDS hexokinase 1. SOURCE Homo sapiens adult kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3580) AUTHORS Nishi,S., Seino,S. and Bell,G.I. TITLE Human hexokinase: sequences of amino- and carboxyl-terminal halves are homologous JOURNAL Biochem. Biophys. Res. Commun. 157 (3), 937-943 (1988) MEDLINE 89087485 FEATURES Location/Qualifiers source 1..3580 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="kidney" 5'UTR 1..81 gene 82..2835 /gene="HK1" CDS 82..2835 /gene="HK1" /codon_start=1 /db_xref="GDB:G00-120-044" /product="hexokinase 1" /db_xref="PID:g184021" /translation="MIAAQLLAYYFTELKDDQVKKIDKYLYAMRLSDETLIDIMTRFR KEMKNGLSRDFNPTATVKMLPTFVRSIPDGSEKGDFIALDLGGSSFRILRVQVNHEKN QNVHMESEVYDTPENIVHGSGSQLFDHVAECLGDFMEKRKIKDKKLPVGFTFSFPCQQ SKIDEAILITWTKRFKASGVEGADVVKLLNKAIKKRGDYDANIVAVVNDTVGTMMTCG YDDQHCEVGLIIGTGTNACYMEELRHIDLVEGDEGRMCINTEWGAFGDDGSLEDIRTE FDREIDRGSLNPGKQLFEKMVSGMYLGELVRLILVKMAKEGLLFEGRITPELLTRGKF NTSDVSAIEKNKEGLHNAKEILTRLGVEPSDDDCVSVQHVCTIVSFRSANLVAATLGA ILNRLRDNKGTPRLRTTVGVDGSLYKTHPQYSRRFHKTLRRLVPDSDVRFLLSESGSG KGAAMVTAVAYRLAEQHRQIEETLAHFHLTKDMLLEVKKRMRAEMELGLRKQTHNNAV VKMLPSFVRRTPDGTENGDFLALDLGGTNFRVLLVKIRSGKKRTVEMHNKIYAIPIEI MQGTGEELFDHIVSCISDFLDYMGIKGPRMPLGFTFSFPCQQTSLDAGILITWTKGFK ATDCVGHDVVTLLRDAIKRREEFDLDVVAVVNDTVGTMMTCAYEEPTCEVGLIVGTGS NACYMEEMKNVEMVEGDQGQMCINMEWGAFGDNGCLDDIRTHYDRLVNEYSLNAGKQR YEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETMKTRGIFETKFLSQIESDRLALL QVRAILQQLGLNSTCDDSILVKTVCGVVSRRAAQLCGAGMAAVVDKIRENRGLDRLNV TVGVDGTLYKLHPHFSRIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTE ASS" 3'UTR 2836..3580 BASE COUNT 867 a 905 c 1030 g 778 t ORIGIN chromosome 10q. 1 ccgccggagg accacggctc gccagggctg cggaggaccg accgtcccca cgcctgccgc 61 cccgcgaccc cgaccgccag catgatcgcc gcgcagctcc tggcctatta cttcacggag 121 ctgaaggatg accaggtcaa aaagattgac aagtatctgt atgccatgcg gctctccgat 181 gaaactctca tagatatcat gactcgcttc aggaaggaga tgaagaatgg cctctcccgg 241 gattttaatc caacagccac agtcaagatg ttgccaacat tcgtaaggtc cattcctgat 301 ggctctgaaa agggagattt cattgccctg gatcttggtg ggtcttcctt tcgaattctg 361 cgggtgcaag tgaatcatga gaaaaaccag aatgttcaca tggagtccga ggtttatgac 421 accccagaga acatcgtgca cggcagtgga agccagcttt ttgatcatgt tgctgagtgc 481 ctgggagatt tcatggagaa aaggaagatc aaggacaaga agttacctgt gggattcacg 541 ttttcttttc cttgccaaca atccaaaata gatgaggcca tcctgatcac ctggacaaag 601 cgatttaaag cgagcggagt ggaaggagca gatgtggtca aactgcttaa caaagccatc 661 aaaaagcgag gggactatga tgccaacatc gtagctgtgg tgaatgacac agtgggcacc 721 atgatgacct gtggctatga cgaccagcac tgtgaagtcg gcctgatcat cggcactggc 781 accaatgctt gctacatgga ggaactgagg cacattgatc tggtggaagg agacgagggg 841 aggatgtgta tcaatacaga atggggagcc tttggagacg atggatcatt agaagacatc 901 cggacagagt ttgacaggga gatagaccgg ggatccctca accctggaaa acagctgttt 961 gagaagatgg tcagtggcat gtacttggga gagctggttc gactgatcct agtcaagatg 1021 gccaaggagg gcctcttatt tgaagggcgg atcaccccgg agctgctcac ccgagggaag 1081 tttaacacca gtgatgtgtc agccatcgaa aagaataagg aaggcctcca caatgccaaa 1141 gaaatcctga cccgcctggg agtggagccg tccgatgatg actgtgtctc agtccagcac 1201 gtttgcacca ttgtctcatt tcgctcagcc aacttggtgg ctgccacact gggcgccatc 1261 ttgaaccgcc tgcgtgataa caagggcaca cccaggctgc ggaccacggt tggtgtcgac 1321 ggatctcttt acaagacgca cccacagtat tcccggcgtt tccacaagac tctaaggcgc 1381 ttggtgccag actccgatgt gcgcttcctc ctctcggaga gtggcagcgg caagggggct 1441 gccatggtga cggcggtggc ctaccgcttg gccgagcagc accggcagat agaggagacc 1501 ctggctcatt tccacctcac caaagacatg ctgctggagg tgaagaagag gatgcgggcc 1561 gagatggagc tggggctgag gaagcagacg cacaacaatg ccgtggttaa gatgctgccc 1621 tccttcgtcc ggagaactcc cgacgggacc gagaatggtg acttcttggc cctggatctt 1681 ggaggaacca atttccgtgt gctgctggtg aaaatccgta gtgggaaaaa gagaacggtg 1741 gaaatgcaca acaagatcta cgccattcct attgaaatca tgcagggcac tggggaagag 1801 ctgtttgatc acattgtctc ctgcatctct gacttcttgg actacatggg gatcaaaggc 1861 cccaggatgc ctctgggctt cacgttctca tttccctgcc agcagacgag tctggacgcg 1921 ggaatcttga tcacgtggac aaagggtttt aaggcaacag actgcgtggg ccacgatgta 1981 gtcaccttac taagggatgc gataaaaagg agagaggaat ttgacctgga cgtggtggct 2041 gtggtcaacg acacagtggg caccatgatg acctgtgctt atgaggagcc cacctgtgag 2101 gttggactca ttgttgggac cggcagcaat gcctgctaca tggaggagat gaagaacgtg 2161 gagatggtgg agggggacca ggggcagatg tgcatcaaca tggagtgggg ggcctttggg 2221 gacaacgggt gtctggatga tatcaggaca cactacgaca gactggtgaa cgaatattcc 2281 ctaaatgctg ggaaacaaag gtatgagaag atgatcagtg gtatgtacct gggtgaaatc 2341 gtccgcaaca tcttaatcga cttcaccaag aagggattcc tcttccgagg gcagatctct 2401 gagacgatga agacccgggg catctttgag accaagtttc tctctcagat cgagagtgac 2461 cgattagcac tgctccaggt ccgggctatc ctccagcagc taggtctgaa tagcacctgc 2521 gatgacagta tcctcgtcaa gacagtgtgc ggggtggtgt ccaggagggc cgcacagctg 2581 tgtggcgcag gcatggctgc ggttgtggat aagatccgcg agaacagagg actggaccgt 2641 ctgaatgtga ctgtgggagt ggacgggaca ctctacaagc ttcatccaca cttctccaga 2701 atcatgcacc agacggtgaa ggaactgtca ccaaaatgta acgtgtcctt cctcctgtct 2761 gaggatggca gcggcaaggg ggccgccctc atcacggccg tgggcgtgcg gttacgcaca 2821 gaggcaagca gctaagagtc cgggatcccc agcctactgc ctctccagca cttctctctt 2881 caagcggcga ccccctaccc tcccagcgag ttgcgctggg agacgctggc gccagggcct 2941 gccggcgcgg ggaggaaagc aaaatccaac taatggtata tattgtaggg tacagaatag 3001 agcgtgtgct gttgataata tctctcaccc ggatccctcc tcacttgccc tgccactttg 3061 catggtttga ttttgacctg gtcccccacg tgtgaagtgt agtggcatcc atttctaatg 3121 tatgcattca tccaacagag ttatttattg gctggagatg gaaaatcaca ccacctgaca 3181 ggccttctgg gcctccaaag cccatccttg gggttccccc tccctgtgtg aaatgtatta 3241 tcaccagcag acactgccgg gcctccctcc cgggggcact gcctgaaggc gagtgtgggc 3301 atagcattag ctgcttcctc ccctcctggc acccactgtg gcctggcatc gcatcgtggt 3361 gtgtcaatgc cacaaaatcg tgtgtccgtg gaaccagtcc tagccgcgtg tgacagtctt 3421 gcattctgtt tgtctcgtgg ggggaggtgg acagtcctgc ggaaatgtgt cttgtcttcc 3481 atttggataa aaggaaccaa ccaacaaaca atgccatcac tggaatttcc caccgctttg 3541 tgagccgtgt cgtatgacct agtaaacttt gtaccaattc // LOCUS HUMHFH3 2089 bp mRNA PRI 27-MAR-1997 DEFINITION Human HNF-3/fork-head homolog-3 HFH-3 mRNA, complete cds. ACCESSION L13203 NID g1911184 KEYWORDS forkhead homolog; hepatocyte nuclear factor; transcription factor. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2089) AUTHORS Clevidence,D.E., Overdier,D.G., Tao,W., Qian,X., Pani,L., Lai,E. and Costa,R.H. TITLE Identification of nine tissue-specific transcription factors of the hepatocyte nuclear factor 3/forkhead DNA-binding-domain family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (9), 3948-3952 (1993) MEDLINE 93248207 REFERENCE 2 (bases 1 to 2089) AUTHORS Overdier,D.G., Ye,H., Peterson,R.S., Clevidence,D.E. and Costa,R.H. TITLE The Winged Helix Transcriptional Activator HFH-3 is Expressed in the Distal Tubules of Embryonic and Adult Kidney JOURNAL J. Biol. Chem. (1997) In press REFERENCE 3 (bases 1 to 2089) AUTHORS Costa,R.H. TITLE Direct Submission JOURNAL Submitted (24-MAR-1997) Biochemistry, University of Illinois at Chicago, Chicago, IL 60612-7334, USA FEATURES Location/Qualifiers source 1..2089 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="Clontech" CDS 79..1134 /function="DNA binding domain" /note="HFH-3 winged helix transcriptional activator" /codon_start=1 /product="HNF-3/fork-head homolog-3" /db_xref="PID:g1911185" /translation="MNLYYENFFHPQGVPSPQRPSFEGGGEYGATPNPYLWFNGPTMT PPPYLPGPNASPFLPQAYGVQRPLLPSVSGLGGSDLGWLPIPSQEELMKLVRPPYSYS ALIAMAIHGAPDKRLTLSQIYQYVADNFPFYNKSKAGWQNSIRHNLSLNDCFKKVPRD EDDPGKGNYWTLDPNCEKMFDNGNFRRKRKRKSDVSSSTASLALEKTESSLPVDSPKT TEPQDILDGASPGGTTSSPEKRPSPPPSGAPCLNSFLSSMTAYVSGGSPTSHPLVTPG LSPEPSDKTGQNSLTFNSFSPLTNLSNHSGGGDWANPMPTNMLSYGGSVLSQFSPHFY NSVNTSGVLYPREGTEV" misc_feature 358..660 /note="winged helix DNA binding domain" misc_feature 718..1131 /note="transcriptional activation domain" BASE COUNT 504 a 654 c 543 g 384 t 4 others ORIGIN 1 aattccggcg acctgccggc gccctcccca cctcgctgca gcccccagtt ccccagcatc 61 ggccaggagc cccccgagat gaacctctac tatgagaact tcttccaccc acagggcgtg 121 cccagccctc agcggccctc cttcgagggg ggcggcgagt atggggccac ccccaacccc 181 tacctctggt tcaacgggcc caccatgacc ccgccaccct acctgcccgg ccccaacgcc 241 agccccttcc tgccccaggc ctatggagtg cagaggccgc tgctgcccag cgtgtcgggg 301 cttgggggga gcgacctggg ctggctgccc atcccctcgc aggaggagct gatgaagctg 361 gtgcggccac cctattccta ctcggctctc atcgccatgg ccatccacgg ggcacccgac 421 aagcgcctca ctctcagcca gatctaccag tacgtggccg acaacttccc cttctacaac 481 aagagcaagg ccggctggca gaactccatc cgccacaacc tgtcgctcaa cgactgcttc 541 aagaaggtgc cccgcgacga ggacgacccg ggcaaaggga attactggac cctggacccc 601 aactgtgaga aaatgttcga caatggaaat ttccgcagga aaaggaagag aaaatcagat 661 gtttcctcta gcacagcctc cttggcctta gagaagacag agagcagtct cccggtggac 721 agccccaaga ccacggagcc tcaggacatc ttggatggag cctcaccagg gggcaccacc 781 agctccccag agaagcggcc ctcccctccc ccatcaggcg ccccttgcct taacagcttc 841 ctttcctcta tgacagccta tgtgagcggg gggagcccca cgagccaccc cttggtcaca 901 ccaggactga gccctgagcc cagtgacaag acggggcaga actcactgac cttcaactcc 961 ttctccccgc tcaccaacct cagcaaccac agcggtgggg gtgactgggc gaaccccatg 1021 cccaccaaca tgctcagcta cggaggatct gtgctcagcc aattcagccc tcacttctac 1081 aacagtgtca acaccagtgg tgtcctctac cccagggagg gcaccgaggt ctaggtacag 1141 aacagctcct gagccaggtg gacatgccag agagaaaagc agtagaggtc ctccatgcca 1201 gccccacggt ggtccatgac tgcggaactg cccagacata agcaggagcc tccgaggaat 1261 ccaccctctt tctagaacac tggttaaggc ttctgtttat cacacatagg cccacacaca 1321 gactcaccaa ctttgcaata gaaatactgg tgcctgcaga gcagcactaa cagtggcagg 1381 tgctgtacta ggctctgtac tggccacact tactattgac agtcanyccg taaggttcac 1441 aaaccacccc attgaacaga tgaggaactg aggctcaagg aggttaagta acatttccag 1501 ggttatataa actagtaaat ggcagagcta agagtcaaat ccaggtctat gtgatcctca 1561 gagattggag gccaggatgg agaattggtt gagtagccaa ggaaggtcag tgtgaaaagc 1621 ttgctatggc aaatatagcg aaatctctcc actgccttct gtccaccagc atttagtgcc 1681 agcctaggca caacttgtcc tggtccaagt ccttattctg cttgccccaa cttacctgca 1741 gacactcctt ttgctaccac tcaaggaagg aagtcaccag tggccttagt gcaggaaact 1801 cagccctggt ggccctgcag aacaatgcat ctgacatgtg cgatgcatcc ccatggagag 1861 acangcattg ctccccagcc ctccagaaac cttngagact ctgcggtagg acagaatgct 1921 gttctggggg gcacaggatc tctttgtggg gagggaatca gaggaggaaa tctgaagtga 1981 agacaggtgg gctggggcta gtgacaagga tgagatggga gaggtagggg agaaggagtg 2041 gggcactttg tacccccatg aatccagagc agactccaga cattctttt // LOCUS HUMHFREP1 1215 bp mRNA PRI 03-SEP-1993 DEFINITION Human HFREP-1 mRNA for unknown protein, complete cds. ACCESSION D14446 NID g393314 KEYWORDS fibrinogen-related; liver specific. SOURCE Homo sapiens liver cDNA to mRNA, clone_lib:lambda gt10 cDNA clone:S29. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1215) AUTHORS Yamamoto,T., Gotoh,M., Sasaki,H., Terada,M., Kitajima,M. and Hirohashi,S. TITLE Molecular cloning and initial characterization of a novel fibrinogen-related gene, HFREP-1 JOURNAL Biochemical and Biophysical Research Communication 193, 681-687 (1993) REFERENCE 2 (bases 1 to 1215) AUTHORS Hirohashi,S. TITLE Direct Submission JOURNAL Submitted (15-FEB-1993) to the DDBJ/EMBL/GenBank databases. Setsuo Hirohashi, National Cancer Center, Research Institute, Pathology Division; 5-1-1 Tsukiji, Chuo-ku, Tokyo 104, Japan (Tel:03-3542-2511(ex.4200), Fax:03-3248-2737) COMMENT Submitted (15-FEB-1993) to DDBJ by: Setsuo Hirohashi National Cancer Center Res.Institute 5-1-1 Tsukiji Chuo-ku, Tokyo 104 Japan Phone: 03-3542-2511 x4200 Fax: 03-3248-2737. FEATURES Location/Qualifiers source 1..1215 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="S29" /clone_lib="lambda gt10 cDNA" /tissue_type="liver" gene 79..1017 /gene="HFREP-1" CDS 79..1017 /gene="HFREP-1" /codon_start=1 /product="unknown protein precursor" /db_xref="PID:d1003846" /db_xref="PID:g393315" /translation="MAKVFSFILVTTALIMGREISALEDCAQEQMRLRAQVRLLETRV KQQQVKIKQLLQENEVQFLDKGDEDTVVDLGSKRQYADCSEIFNDGYKLSGFYKIKPL QSPAEFSVYCDMSDGGGWTVIQRRSDGSENFNRGWKDYENGFGNFVQKHGEYWLGNKN LHFLTTQEDYTLKIDLADFEKNSRYAQYKNFKVGDEKNFYELNIGEYSGTAGDSLAGN FHPEVQWWASHQRMKFSTWDRDHDNYEGNCAEEDQSGWWFNRCHSANLNGVYYSGPYT AKTDNGIVWYTWHGWWYSLKSVVMKIRPNDFIPNVI" sig_peptide 79..129 /gene="HFREP-1" mat_peptide 130..1014 /gene="HFREP-1" /product="unknown protein" polyA_site 1215 BASE COUNT 354 a 218 c 300 g 343 t ORIGIN 1 ctgggaagca gagtgtctgg atggaacctg agctgggtct ctgactcact tctgacttta 61 gttttttcaa gggggaacat ggcaaaggtg ttcagtttca tccttgttac caccgctctg 121 ataatgggca gggaaatttc ggcgctcgag gactgtgccc aggagcagat gcggctcaga 181 gcccaggtgc gcctgcttga gacccgggtc aaacagcaac aggtcaagat caagcagctt 241 ttgcaggaga atgaagtcca gttccttgat aaaggagatg aggatactgt cgttgatctt 301 ggaagcaaga ggcagtatgc agattgttca gagattttca atgatgggta taagctcagt 361 ggattttaca aaatcaaacc tctccagagc ccagcagaat tttctgttta ttgtgacatg 421 tccgatggag gaggatggac tgtaattcag agacgatctg atggcagtga aaactttaac 481 agaggatgga aagactatga aaatggcttt ggaaattttg tccaaaaaca tggtgaatat 541 tggctgggca ataaaaatct tcacttcttg accactcaag aagactacac tttaaaaatc 601 gaccttgcag attttgaaaa aaatagccgt tatgcacaat ataagaattt caaagttgga 661 gatgaaaaga atttctacga gttgaatatt ggggaatatt ctggaacagc tggagattcc 721 cttgcgggga attttcatcc tgaggtgcag tggtgggcta gtcaccaaag aatgaaattc 781 agcacgtggg acagagatca tgacaactat gaagggaact gcgcagaaga agatcagtct 841 ggctggtggt ttaacaggtg tcactctgca aacctgaatg gtgtatacta cagcggcccc 901 tacacggcta aaacagacaa tgggattgtc tggtacacct ggcatgggtg gtggtattct 961 ctgaaatctg tggttatgaa aattaggcca aatgatttta ttccaaatgt aatttaattg 1021 ctgctgttgg gcttcgtttc tgcaattcag ctttgtttaa agtgatttga aaaatactca 1081 ttctgaacat atccatgcgc aatcatgata actgttgtga gtagtgcttt tcattcttct 1141 cacttgcctt tgttacttaa tgtgctttca gtacagcaga tatgcaatat tcaccaaata 1201 aatgtagact gtgtt // LOCUS HUMHFSP 849 bp mRNA PRI 08-NOV-1994 DEFINITION Human Hanukah factor serine protease (HuHF) mRNA, complete cds. ACCESSION M18737 J03608 NID g184022 KEYWORDS Hanukah factor; T-cell-specific serine protease; natural killer cell-specific serine protease; serine protease. SOURCE Human peripheral blood lymphocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 849) AUTHORS Gershenfeld,H.K., Hershberger,R.J., Shows,T.B. and Weissman,I.L. TITLE Cloning and chromosomal assignment of a human cDNA encoding a T cell- and natural killer cell-specific trypsin-like serine protease JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (4), 1184-1188 (1988) MEDLINE 88125000 FEATURES Location/Qualifiers source 1..849 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5" mRNA <1..>840 /note="HFSP mRNA" sig_peptide 1..84 /gene="GJA1P1" /note="Hanukah factor serine protease signal peptide" gene 1..789 /gene="GJA1P1" CDS 1..789 /gene="GJA1P1" /note="Hanukah factor serine protease precursor" /codon_start=1 /db_xref="GDB:G00-125-920" /db_xref="PID:g306845" /translation="MRNSYRFLASSLSVVVSLLLIPEDVCEKIIGGNEVTPHSRPYMV LLSLDRKTICAGALIAKDWVLTAAHCNLNKRSQVILGAHSITREEPTKQIMLVKKEFP YPCYDPATREGDLKLLQLTEKAKINKYVTILHLPKKGDDVKPGTMCQVAGWGRTHNSA SWSDTLREVNITIIDRKVCNDRNHYNFNPVIGMNMVCAGSLRGGRDSCNGDSGSPLLC EGVFRGVTSFGLENKCGDPRGPGVYILLSKKHLNWIIMTIKGAV" mat_peptide 85..786 /gene="GJA1P1" /note="Hanukah factor serine protease" BASE COUNT 251 a 176 c 186 g 236 t ORIGIN Chromosome 5. 1 atgaggaact cctatagatt tctggcatcc tctctctcag ttgtcgtttc tctcctgcta 61 attcctgaag atgtctgtga aaaaattatt ggaggaaatg aagtaactcc tcattcaaga 121 ccctacatgg tcctacttag tcttgacaga aaaaccatct gtgctggggc tttgattgca 181 aaagactggg tgttgactgc agctcactgt aacttgaaca aaaggtccca ggtcattctt 241 ggggctcact caataaccag ggaagagcca acaaaacaga taatgcttgt taagaaagag 301 tttccctatc catgctatga cccagccaca cgcgaaggtg accttaaact tttacagctg 361 acggaaaaag caaaaattaa caaatatgtg actatccttc atctacctaa aaagggggat 421 gatgtgaaac caggaaccat gtgccaagtt gcagggtggg ggaggactca caatagtgca 481 tcttggtccg atactctgag agaagtcaat atcaccatca tagacagaaa agtctgcaat 541 gatcgaaatc actataattt taaccctgtg attggaatga atatggtttg tgctggaagc 601 ctccgaggtg gaagagactc gtgcaatgga gattctggaa gccctttgtt gtgcgagggt 661 gttttccgag gggtcacttc ctttggcctt gaaaataaat gcggagaccc tcgtgggcct 721 ggtgtctata ttcttctctc aaagaaacac ctcaactgga taattatgac tatcaaggga 781 gcagtttaaa taaccgtttc ctttcattta ctgtggcttc ttaatctttt cacaaataaa 841 atcaatttg // LOCUS HUMHGBE 2955 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens 1,4-alpha-glucan branching enzyme (HGBE) mRNA, complete cds. ACCESSION L07956 NID g184025 KEYWORDS 1,4-alpha-glucan branching enzyme. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2955) AUTHORS Thon,V.J., Khalil,M. and Cannon,J.F. TITLE Isolation of human glycogen branching enzyme cDNAs by screening complementation in yeast JOURNAL J. Biol. Chem. 268 (10), 7509-7513 (1993) MEDLINE 93216700 FEATURES Location/Qualifiers source 1..2955 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /tissue_type="liver" misc_feature 1..12 /note="this is the 5' adaptor sequence used to clone cDNAs in this library" /evidence=experimental gene 91..2199 /gene="HGBE" CDS 91..2199 /gene="HGBE" /standard_name="glycogen branching enzyme" /EC_number="2.4.1.18" /codon_start=1 /function="branch glycogen" /evidence=experimental /product="1,4-alpha-glucan branching enzyme" /db_xref="PID:g184026" /translation="MAAPMTPAARPEDYEAALNAALADVPELARLLEIDPYLKPYAVD FQRRYKQFSQILKNIGENEGGIDKFSRGYESFGVHRCADGGLYSKEWAPGAEGVFLTG DFNGWNPFSYPYKKLDYGKWELYIPPKQNKSVLVPHGSKLKVVITSKSGEILYRISPW AKYVVREGDNVNYDWIHWDPEHSYEFKHSRPKKPRSLRIYESHVGISSHEGKVASYKH FTCNVLPRIKGLGYNCIQLMAIMEHAYYASFGYQITSFFAASSRYGTPEELQELVDTA HSMGIIVLLDVVHSHASKNSADGLNMFDGTDSCYFHSGPRGTHDLWDSRLFAYSSWEV LRFLLSNIRWWLEEYRFDGFRFDGVTSMLYHHHGVGQGFSGDYSEYFGLQVDEDALTY LMLANHLVHTLCPDSITIAEDVSGMPALCSPISQGGGGFDYRLAMAIPDKWIQLLKEF KDEDWNMGDIVYTLTNRRYLEKCIAYAESHDQALVGDKSLAFWLMDAEMYTNMSVLTP FTPVIDRGIQLHKMIRLITHGLGGEGYLNFMGNEFGHPEWLDFPRKGNNESYHYARRQ FHLTDDDLLRYKFLNNFDRDMNRLEERYGWLAAPQAYVSEKHEGNKIIAFERAGLLFI FNFHPSKSYTDYRVGTALPGKFKIVLDSDAAEYGGHQRLDHSTDFFSEAFEHNGRPYS LLVYIPSRVALILQNVDLPN" misc_feature 2900..2955 /note="this is the 3' adaptor used in the cDNA library construction; the 16 a's should be considered the true 3' end, although in vivo length may actually be longer" /evidence=experimental BASE COUNT 853 a 562 c 650 g 890 t ORIGIN 1 gatctgaatt cggtcccagc tagagctcca gcgcccgctc aggccccact cgaccctctc 61 gggcctcggc tacttggact gcggcggaat atggcggctc cgatgactcc cgcggctcgg 121 cccgaggact acgaggcggc gctcaatgcc gccctggctg acgtgcccga actggccaga 181 ctcctggaga tcgacccgta cttgaagccc tacgccgtgg acttccagcg caggtataag 241 cagtttagcc aaattttgaa gaacattgga gaaaatgaag gtggtattga taagttttcc 301 agaggctatg aatcatttgg cgtccacaga tgtgctgatg gtggtttata ctccaaagaa 361 tgggccccgg gagcagaagg agtttttctt actggagatt ttaatggttg gaatccattt 421 tcgtacccat acaaaaaact ggattatgga aaatgggagc tgtatatccc accaaagcag 481 aataaatctg tactcgtgcc tcatggatcc aaattaaagg tagttattac tagtaaaagc 541 ggagagatct tgtatcgtat ttcaccgtgg gcaaagtatg tggttcgtga aggtgataat 601 gtgaattatg attggataca ctgggatcca gaacactcat atgagtttaa gcattccaga 661 ccaaagaagc cacggagtct aagaatttat gaatctcatg tgggaatttc ttcccatgaa 721 ggaaaagtag cttcttataa acattttaca tgcaatgtac taccaagaat caaaggcctt 781 ggatacaact gcattcagtt gatggcaatc atggagcatg cttactatgc cagctttggt 841 taccaaatca caagcttctt tgcagcttcc agccgttatg gaacacctga agagctacaa 901 gaactggtag acacagctca ttccatgggt atcatagtcc tcttagatgt ggtacacagc 961 catgcttcaa aaaattcagc agatggattg aatatgtttg atgggacaga ttcctgttat 1021 tttcattctg gacctagagg gactcatgat ctttgggata gcagattgtt tgcctactcc 1081 agctgggaag ttttaagatt ccttctgtca aacataagat ggtggttgga agaatatcgc 1141 tttgatggat ttcgttttga tggtgttacg tccatgcttt atcatcacca tggagtgggt 1201 caaggtttct caggtgatta cagtgaatat ttcggactac aagtagatga agatgccttg 1261 acttacctca tgttggcaaa tcatttggtt cacacgctgt gtcccgattc tataacaata 1321 gctgaggatg tatcaggaat gccagctctg tgctctccaa tttcccaggg agggggtggt 1381 tttgactatc gactagccat ggcaattcca gataagtgga ttcagctact taaagagttt 1441 aaagatgaag actggaacat gggcgatata gtatacacgc tcacaaacag gcgctacctt 1501 gaaaagtgca ttgcttatgc agagagccat gatcaggcat tggttgggga taagtcgctg 1561 gcattttggt tgatggatgc cgaaatgtat acaaacatga gtgtcctgac tccttttact 1621 ccagttattg atcgtggaat acagcttcat aaaatgattc gactcattac gcatgggctt 1681 ggtggagaag gctatctcaa tttcatgggt aatgaatttg ggcatcctga atggttagac 1741 ttcccaagaa aaggaaataa tgagagttac cattatgcca ggcggcagtt tcatttaact 1801 gacgacgacc ttcttcgcta caagttccta aataattttg acagggatat gaatagattg 1861 gaagaaagat atggttggct tgcagctcca caggcctacg tgagtgaaaa acatgaaggc 1921 aataagatca ttgcttttga aagagcaggt cttcttttca ttttcaactt ccatccaagc 1981 aagagctaca ctgactaccg agttggaaca gcattgccag ggaaattcaa aattgtgcta 2041 gattcagatg cagcggaata tggagggcat cagagactgg accacagcac tgactttttt 2101 tctgaggctt ttgaacataa tgggcgtccc tattctcttt tggtgtacat tccaagcaga 2161 gtggccctca tccttcagaa tgtggatctg ccgaattgaa gaggcctgat ttcagctcca 2221 ccagatgcag atttgtgttt tgttttcttg ttatcactgt cacacagctt ataacatgta 2281 tgcttttcag aatacagttg tctagccaag ccatcaagtg tctgaaattc aatattggtt 2341 tatgcaaata cagcaaactt ttatttaagt agataggaga atatgtttaa aatattagga 2401 atcctagacc atattttcaa gtcatcttag cagctaggat tctcaaatgg aagtgttata 2461 tataatatgt taaaaacatt ttgctttcct ggctaattat ttgatccttt taaatccaaa 2521 tttgaatcat ttgtcatgta tgattatttc tgttaaatgt acacagtatt taagatggat 2581 atttggtggc tctatttgtt ctgatatctt ttggtctaaa ttatgaggta ccaagattgt 2641 ttctttgttt ctttttttca aattgtgttt agaaatactg taataaatat gcagtagtga 2701 tataaagaat tatatccaag gtaatataaa agccattacg tatgaactca tccgtgtctc 2761 attttgtgtt ttattttgtg atctcttgtc cactaagtat cttgttaaat gccagtatct 2821 cagtctttct gaagccctga aatggtaatt gtagcatttc agaaaatgtc tttcatttca 2881 atcaataaaa agcttttgta aaaaaaaaaa aaaaaaaaaa aaaaaccgtc gacaaagcgg 2941 ccgcaaaccg aattc // LOCUS HUMHGFAL 3008 bp mRNA PRI 25-JUL-1996 DEFINITION Human mRNA for HGF activator like protein, complete cds. ACCESSION D49742 NID g736706 KEYWORDS HGF activator like protein; serin protease. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3008) AUTHORS Kitamura,N. JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3008) AUTHORS Kitamura,N. TITLE Direct Submission JOURNAL Submitted (17-MAR-1995) to the DDBJ/EMBL/GenBank databases. Naomi Kitamura, Institute for Liver Research, Kansai Medical University; Moriguchi, Osaka 570, Japan (Tel:06-992-1001(ex.2530), Fax:06-994-6099) FEATURES Location/Qualifiers source 1..3008 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 97..1779 /codon_start=1 /product="HGF activator like protein" /db_xref="PID:d1009187" /db_xref="PID:g1345398" /translation="MFARMSDLHVLLLMALVGKTACGFSLMSLLESLDPDWTPDQYDY SYEDYNQEENTSSTLTHAENPDWYYTEDQADPCQPNPCEHGGDCLVHGSTFTCSCLAP FSGNKCQKVQNTCKDNPCGRGQCLITQSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQ NGATCSRHKRRSKFTCACPDQFKGKFCEIGSDDCYVGDGYSYRGKMNRTVNQHACLYW NSHLLLQENYNMFMEDAETHGIGEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSAC SAQDVAYPEESPTEPSTKLPGFDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSS LPLTISMPQGHFCGGALIHPCWVLTAAHCTDIKTRHLKVVLGDQDLKKEEFHEQSFRV EKIFKYSHYNERDEIPHNDIALLKLKPVDGHCALESKYVKTVCLPDGSFPSGSECHIS GWGVTETGKGSRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGD SGGPLTCEKDGTYYVYGIVSWGLECGKRPGVYTQVTKFLNWIKATIKSESGF" polyA_site 2312 polyA_site 3008 BASE COUNT 791 a 837 c 703 g 677 t ORIGIN 1 cctgaatcct tggagactga catttttccc ccctaaaggc atagacaaca aaagaaattt 61 tattgagagg aaaacacaag tccttaaact gcaaagatgt ttgccaggat gtctgatctc 121 catgttctgc tgttaatggc tctggtggga aagacagcct gtgggttctc cctgatgtct 181 ttattggaaa gcctggaccc agactggacc cctgaccagt atgattacag ctacgaggat 241 tataatcagg aagagaacac cagtagcaca cttacccatg ctgagaatcc tgactggtac 301 tacactgagg accaagctga tccatgccag cccaacccct gtgaacacgg tggggactgc 361 ctcgtccatg ggagcacctt cacatgcagc tgcctggctc ctttctctgg gaataagtgt 421 cagaaagtgc aaaatacgtg caaggacaac ccatgtggcc ggggccaatg tctcattacc 481 cagagtcctc cctactaccg ctgtgtctgt aaacaccctt acacaggtcc cagctgctcc 541 caagtggttc ctgtatgcag gccaaacccc tgccagaatg gggctacctg ctcccggcat 601 aagcggagat ccaagttcac ctgtgcctgt cccgaccagt tcaaggggaa attctgtgaa 661 ataggttctg atgactgcta tgttggcgat ggctactctt accgagggaa aatgaatagg 721 acagtcaacc agcatgcgtg cctttactgg aactcccacc tcctcttgca ggagaattac 781 aacatgttta tggaggatgc tgaaacccat gggattgggg aacacaattt ctgcagaaac 841 ccagatgcgg acgaaaagcc ctggtgcttt attaaagtta ccaatgacaa ggtgaaatgg 901 gaatactgtg atgtctcagc ctgctcagcc caggacgttg cctacccaga ggaaagcccc 961 actgagccat caaccaagct tccggggttt gactcctgtg gaaagactga gatagcagag 1021 aggaagatca agagaatcta tggaggcttt aagagcacgg cgggcaagca cccatggcag 1081 gcgtccctcc agtcctcgct gcctctgacc atctccatgc cccagggcca cttctgtggt 1141 ggggcgctga tccacccctg ctgggtgctc actgctgccc actgcaccga cataaaaacc 1201 agacatctaa aggtggtgct aggggaccag gacctgaaga aagaagaatt tcatgagcag 1261 agctttaggg tggagaagat attcaagtac agccactaca atgaaagaga tgagattccc 1321 cacaatgata ttgcattgct caagttaaag ccagtggatg gtcactgtgc tctagaatcc 1381 aaatacgtga agactgtgtg cttgcctgat gggtcctttc cctctgggag tgagtgccac 1441 atctctggct ggggtgttac agaaacagga aaagggtccc gccagctcct ggatgccaaa 1501 gtcaagctga ttgccaacac tttgtgcaac tcccgccaac tctatgacca catgattgat 1561 gacagtatga tctgtgcagg aaatcttcag aaacctgggc aagacacctg ccagggtgac 1621 tctggaggcc ccctgacctg tgagaaggac ggcacctact acgtctatgg gatagtgagc 1681 tggggcctgg agtgtgggaa gaggccaggg gtctacaccc aagttaccaa attcctgaat 1741 tggatcaaag ccaccatcaa aagtgaaagt ggcttctaag gtactgtctt ctggacctca 1801 gagcccactc tccttggcac cctgacaccg ggaggcctca tggccaacaa tggacacctc 1861 cagagcctcc aggggaccac acagtagact atccctactc taagcagaga caactgccac 1921 ccagcctggg ccttcccaga ccagcatttg cacaatatca ccaggcttct tctgcctccc 1981 ttggtaaccc aaggaatgat ggaatcaaca caacatagta tgtttgcttt ccttacccaa 2041 ttgtaccttc tagaaaatca gtgttcacag agactgcctc caccacaggc atcctgcaaa 2101 tgcagactcc agaatcccca gcatcagcgg gaaccaccat cacatcttta ttcctcagcc 2161 cagacactcg aggcactcaa cagaatcagc catccacgtc taggtatcag agaggaccac 2221 aaatacaaca ttctccatct gctttcagag ttattatttt aataaaggaa gatctgggat 2281 gggctggtgg gccattccag cttgccgaaa tcaaagccat ctgaagcctg tctctggtga 2341 acaaacttcc tctctggcct ctcaggaatc agggtggcat ggctcacaac agcagggcct 2401 tcttcttttt gacgtgcaga atctcagtgg catctgggtt cacctcccca ctctgatgat 2461 ctccagcctc cactgcttct gccccccgct gctgaaatca aacatacccc aagttaaaat 2521 gaagctcccc cacccccact cccggccccg gttcccacag gacacgctaa gaagcacagg 2581 gagcatttaa caggctcacc ctccctttcc ttttcccctc ttctaccctc cccaagaaaa 2641 agggccttca aggcaggaat gagaaagcaa agccaatctc tcatttagac ctggcttctt 2701 tcttctgaac aaagtagggt tcaaaatgca gactgtcata tccagcgagt ccctgaccct 2761 ttctgcgaat gtaacgagca agcagtcagc acagcctggg ctgccctggc ccgggattga 2821 tgtagccccg gtaggtttgc ctctgcagaa ctaatggctg tgacttcaga gaagccctgc 2881 aggaagttta acctacgtgt catctgcctg gtcatctcag acccatgaaa ttaggcgcct 2941 tgtttgagct gcgtttcaca cttctttaga cgctagctga cctttggcca aaaataaact 3001 ttgaaaag // LOCUS HUMHHR23B 2905 bp mRNA PRI 03-JUL-1996 DEFINITION Human mRNA for XP-C repair complementing protein (p58/HHR23B), complete cds. ACCESSION D21090 NID g498147 KEYWORDS XP-C repair complementing protein (p58/HHR23B); repair protein. SOURCE Homo sapiens cervical carcinoma epitherial cell_line:HeLa cDNA to mRNA, clone_lib:lambda gt10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2905) AUTHORS Masutani,C., Sugasawa,K., Yanagisawa,J., Sonoyama,T., Ui,M., Enomoto,T., Takio,K., Tanaka,K., Spek,P.V., Bootsma,D., Hoeijmakers,J.H. and Hanaoka,F. TITLE Purification and cloning of a nucleotide excision repair complex involving the xeroderma pigmentosum group C protein and a human homologue of yeast RAD23 JOURNAL EMBO J. 13 (8), 1831-1843 (1994) MEDLINE 94222030 REFERENCE 2 (sites) AUTHORS van der Spek,P.J., Eker,A., Rademakers,S., Visser,C., Sugasawa,K., Masutani,C., Hanaoka,F., Bootsma,D. and Hoeijmakers,J.H. TITLE XPC and human homologs of RAD23: intracellular localization and relationship to other nucleotide excision repair complexes JOURNAL Nucleic Acids Res. 24 (13), 2551-2559 (1996) MEDLINE 96292259 REFERENCE 3 (bases 1 to 2905) AUTHORS Hanaoka,F. TITLE Direct Submission JOURNAL Submitted (07-OCT-1993) to the DDBJ/EMBL/GenBank databases. Fumio Hanaoka, Riken Institute of Physical and Chemical Research; 2-1 Hirosawa, Wako, Saitama 351-01, Japan (Tel:048-462-1111(ex.5561), Fax:048-462-4673) FEATURES Location/Qualifiers source 1..2905 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="epitherial" /clone_lib="lambda gt10" /tissue_type="cervical carcinoma" gene 314..1543 /gene="XPCC" CDS 314..1543 /gene="XPCC" /codon_start=1 /product="XP-C repair complementing protein (p58/HHR23B)" /db_xref="PID:d1005181" /db_xref="PID:g498148" /translation="MQVTLKTLQQQTFKIDIDPEETVKALKEKIESEKGKDAFPVAGQ KLIYAGKILNDDTALKEYKIDEKNFVVVMVTKPKAVSTPAPATTQQSAPASTTAVTSS TTTTVAQAPTPVPALAPTSTPASITPASATASSEPAPASAAKQEKPAEKPAETPVATS PTATDSTSGDSSRSNLFEDATSALVTGQSYENMVTEIMSMGYEREQVIAALRASFNNP DRAVEYLLMGIPGDRESQAVVDPPQAASTGAPQSSAVAAAAATTTATTTTTSSGGHPL EFLRNQPQFQQMRQIIQQNPSLLPALLQQIGRENPQLLQQISQHQEHFIQMLNEPVQE AGGQGGGGGGGSGGIAEAGSGHMNYIQVTPQEKEAIERLKALGFPEGLVIQAYFACEK NENLAANFLLQQNFDED" polyA_signal 2592..2597 polyA_signal 2757..2762 BASE COUNT 829 a 629 c 662 g 785 t ORIGIN 1 tagcgattcc ctgcttgtct cgccgacccc ctcgcgcctt ctgcagactc cgtggctggc 61 gctcggcgcg tgaggaagca cggcggcccg agttcgcggg gaaggccgca gtcgcggagg 121 cagcggcgcg gtccggggca cgggctgggg gagaggccgc tccgctgggc gaatgtgaca 181 agcccccacc cccaccgcct tcctccccag agcgcgagga gcgcgggcga ccccggggcc 241 ccgccaggcc acagaccccg cccagcggcc agcacccggc gcaggcccgg cagccgagct 301 gcgcggcggc accatgcagg tcaccctgaa gaccctccag cagcagacct tcaagataga 361 cattgacccc gaggagacgg tgaaagcact gaaagagaag attgaatctg aaaaggggaa 421 agatgccttt ccagtagcag gtcaaaaatt aatttatgca ggcaaaatcc tcaatgatga 481 tactgctctc aaagaatata aaattgatga gaaaaacttt gtggtggtta tggtgaccaa 541 acccaaagca gtgtccacac cagcaccagc tacaactcag cagtcagctc ctgccagcac 601 tacagcagtt acttcctcca ccaccacaac tgtggctcag gctccaaccc ctgtccctgc 661 cttggccccc acttccacac ctgcatccat cactccagca tcagcgacag catcttctga 721 acctgcacct gctagtgcag ctaaacaaga gaagcctgca gaaaagccag cagagacacc 781 agtggctact agcccaacag caactgacag tacatcgggt gattcttctc ggtcaaacct 841 ttttgaagat gcaacgagtg cacttgtgac gggtcagtct tacgagaata tggtaactga 901 gatcatgtca atgggctatg aacgagagca agtaattgca gccctgagag ccagtttcaa 961 caaccctgac agagcagtgg agtatctttt aatgggaatc cctggagata gagaaagtca 1021 ggctgtggtt gacccccctc aagcagctag tactggggct cctcagtctt cagcagtggc 1081 tgcagctgca gcaactacga cagcaacaac tacaacaaca agttctggag gacatcccct 1141 tgaattttta cggaatcagc ctcagtttca acagatgaga caaattattc agcagaatcc 1201 ttccttgctt ccagcgttac tacagcagat aggtcgagag aatcctcaat tacttcagca 1261 aattagccaa caccaggagc attttattca gatgttaaat gaaccagttc aagaagctgg 1321 tggtcaagga ggaggaggtg gaggtggcag tggaggaatt gcagaagctg gaagtggtca 1381 tatgaactac attcaagtaa cacctcagga aaaagaagct atagaaaggt taaaggcatt 1441 aggatttcct gaaggacttg tgatacaagc gtattttgct tgtgagaaga atgagaattt 1501 ggctgccaat tttcttctac agcagaactt tgatgaagat tgaaagggac ttttttatat 1561 ctcacacttc acaccagtgc attacactaa cttgttcact ggattgtctg ggatgacttg 1621 ggctcatatc cacaatactt ggtataaggt agtagattgt tgggggtggg gagggaggga 1681 tctaggatac agggcaggga taaatacagt gcatgtctgc ttcaattagc agatgccgca 1741 actccacaca gtgtgtaaaa tatatacaac caaaaatcag cttttgcagg tctttatttc 1801 ttctgtaaaa cagtaggtaa cttttcctag gtttcactct ttttagtgta ctagatccag 1861 aaacttagtg taatgccctg ctttatatat ctttgactta acattggttt cagaaagaat 1921 cttagctacc tagaatttac agtctctgtt tcatggcaac actggataat ggctttgtga 1981 aatttaaaaa atttttgtag cgactgtaaa cagaaatgcc aaattgatgg ttaattgttg 2041 ctgcttcaaa aataagtata aaattaatat gtaaggaagc ccattctttc atgttaaata 2101 cttggggtgg gaggggagaa agggaacctt ttcttaaaat gaaaataatt actgctattt 2161 taaaatttct tgatcattga atgtgagacc cttctaacat gatttgagaa gctgtacaag 2221 tataggcaga gttattttcc tgtttacatt ttttttttgt tttggggaaa aaattggtag 2281 gtgtctaatt actgtttact tcattgttat attgcagtaa aagttttaaa acaaccattg 2341 catgtttgct tttgatgtat ccctttgtga aattagcact tttggggcca atggagaaat 2401 gcagcattca ctctccctgt cttttcccct tccctcagca gaaacgtgtt tatcagcaag 2461 tcgtgagtca aactgctgcc ttttaaaaaa cccacaaaat gctgattcag ttcaaaatta 2521 atgcaaatgt ttcaaaactg ggtttctgat atttgtaaat gtgtttcttt attagataag 2581 agtgtattac cattaaagtc attagtataa tattgctttc aaaaagaaat ggtagacaaa 2641 actataatcc agcatctttt attgcattgg aaagactggc aaagtctttt ggatgggttg 2701 ggagatgtgg ctggaaagta ctttggaaaa tatacaatca agatatctca tggcatatta 2761 aaagaaaaat cttaatagca gtgttggctt ttatttggat tttttcatct cagttttttc 2821 tgtggaatct ccttcattgg cattgttatt taatcataaa cggggcagat gtctacttgt 2881 tcagtttttc aaatctgttt tcctg // LOCUS HUMHIP116A 3418 bp mRNA PRI 25-MAY-1995 DEFINITION Human ATPase, DNA-binding protein (HIP116) mRNA, 3' end. ACCESSION L34673 NID g531195 KEYWORDS ATPase; DNA-binding protein; SNF2/SWI2-related protein; transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3418) AUTHORS Sheridan,P.L., Schorpp,M., Voz,M.L. and Jones,K.A. TITLE Cloning of an SNF2/SWI2-related protein that binds specifically to the SPH motifs of the SV40 enhancer and to the HIV-1 promoter JOURNAL J. Biol. Chem. 270 (9), 4575-4587 (1995) MEDLINE 95181452 FEATURES Location/Qualifiers source 1..3418 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" 5'UTR 1..177 mRNA 1..3418 gene 178..3207 /gene="HIP116" CDS 178..3207 /gene="HIP116" /note="SNF2/SWI2-related protein; DNA-binding protein" /codon_start=1 /function="putative transcription factor" /product="ATPase" /db_xref="PID:g531196" /translation="MSWMFKRDPVWKYLQTVQYGVHGNFPRLSYPTFFPRFEFQDVIP PDDFLTSDEEVDSVLFGSLRGHVVGLRYYTGVVNNNEMVALQRDPNNPYDKNAIKVNN VNGNQVGHLKKELAGALAYIMDNKLAQIEGVVPFGANNAFTMPLHMTFWGKEENRKAV SDQLKKHGFKLGPAPKTLGFNLESGWGSGRAGPSYSMPVHAAVQMTTEQLKTEFDKLF EDLKEDDKTHEMEPAEAIETPLLPHQKQALAWMVSRENSKELPPFWEQRNDLYYNTIT NFSEKDRPENVHGGILADDMGLGKTLTAIAVILTNFHDGRPLPIERVKKNLLKKEYNV NDDSMKLGGNNTSEKADGLSKDASRCSEQPSISDIKEKSKFRMSELSTSRPKRRKTAV QYIESSDSEEIETSELPQKMKGKLKNVQSETKGRAKAGSSKVIEDVAFACALTSSVPT TKKKMLKKGACAVEGSKKTDVEERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYV YYGPDRIREPALLSKQDIVLTTYNILTHDYGTKGDSPLHSIRWLRVILDEGHAIRNPN AQQTKAVLDLESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTM GDEGGLRRLQSLIKNITLRRTKTSKIKGKPVLELPERKVFIQHITLSDEERKIYQSVK NEGRATIGRYFNEGTVLAHYADVLGLLLRLRQICCHTYLLTNAVSSNGPSGNDTPEEL RKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCKPCICQVIQNEQPHAKCP LCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHALTDLRKKNPNIKSLV VSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTEAGSPTIMLLSLKA GGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFIVKDSVEENMLK IQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL" 3'UTR 3208..3418 BASE COUNT 1135 a 602 c 744 g 937 t ORIGIN 1 attcccgggg tctgactgga ctcgcggcga cttacctttc agtcgtgcgc tcctgatccg 61 gcgctcggaa tttgtccccg gcttcagggc tgcggggcct ggaaggaggc gtatcgaggc 121 ggctcgaaaa cgatccaggg gagccgaggc gctcctcttg tcatcccact cagcgccatg 181 tcctggatgt tcaagaggga tccagtttgg aagtacttgc agactgtcca gtatggagtt 241 catggaaatt ttccacgcct ctcatatcca actttctttc cacgttttga attccaagat 301 gttatccctc cagatgactt tctaactagt gatgaagaag tagattccgt tttatttgga 361 agtttgagag gtcatgtggt tggactacgc tattacacgg gagtagttaa taataatgaa 421 atggttgcat tacaacgaga tcctaataac ccttatgata agaatgcaat taaagtaaac 481 aatgtgaatg gaaatcaagt tggccattta aagaaagagc ttgcaggtgc tttggcctat 541 atcatggaca acaaattggc acaaattgaa ggggtagttc cttttggtgc aaacaatgct 601 tttaccatgc ctctgcatat gactttttgg ggaaaagaag aaaatagaaa agcggtttca 661 gatcagttga agaaacatgg atttaaattg ggtcctgcac caaaaacttt aggattcaat 721 ttggaaagtg gttggggctc tggaagagct ggaccaagct atagtatgcc agtgcatgct 781 gcagtacaga tgacaactga acagcttaaa acagaatttg acaaattgtt tgaagattta 841 aaagaagatg ataaaaccca tgaaatggaa ccagctgagg ctattgaaac accactgctt 901 ccacatcaaa aacaagctct agcttggatg gtgtcacggg aaaatagcaa agaacttcca 961 ccattctggg aacagcgaaa tgacttatac tataacacaa taacaaattt ttctgagaag 1021 gaccgaccag aaaatgtcca tggaggaatt ttagctgatg atatgggttt gggtaaaact 1081 cttacagcca ttgcagtaat ccttaccaac ttccatgatg gcagacctct tcctattgaa 1141 agagttaaaa agaatctact gaagaaggaa tataatgtta acgatgactc tatgaaactt 1201 ggaggaaaca ataccagtga aaaggcagat ggactaagca aagacgcatc tagatgtagt 1261 gaacaaccca gtatttcaga tatcaaggag aagagtaagt ttcgcatgtc agaattgtct 1321 acgtcccgcc ccaaaagaag aaaaactgct gtccagtaca tagaaagcag tgattcagag 1381 gaaattgaaa caagtgaatt gccgcagaaa atgaaaggca aactgaaaaa tgtacagtct 1441 gaaactaaag gcagggcgaa agcaggatct tctaaggtta tagaagatgt ggcatttgca 1501 tgtgcattaa cttcatctgt tcctacaaca aaaaagaaaa tgttgaaaaa gggagcttgt 1561 gcagtggagg ggtcaaagaa aactgatgtt gaggagagac caagaacaac actgatcatc 1621 tgtccgcttt ctgtgttaag caactggatt gaccagtttg gacaacatat aaaatcagat 1681 gtacacttga atttttatgt ttattatggt cctgatcgta ttagagaacc ggccttactt 1741 tcaaaacagg atattgtttt gactacgtat aatattttaa ctcatgacta tggaactaaa 1801 ggagatagtc cattacatag cataaggtgg ctaagagtga tcctggatga aggacatgcc 1861 atacgaaatc caaatgctca gcagacaaaa gctgtacttg acttagaatc agaaagaaga 1921 tgggttttga caggtactcc aatccagaat tctttaaagg acttgtggtc tcttctttcc 1981 tttttaaaac ttaaaccatt tattgataga gaatggtggc atagaacaat acagcgtcct 2041 gtcacaatgg gagatgaagg aggacttagg cgtttacagt ccctaattaa aaatattaca 2101 cttagaagaa caaagacaag caaaattaaa ggaaaacctg ttttggagtt accagaacgt 2161 aaagtattta ttcagcacat tacactttca gatgaagaga gaaagattta tcagtctgtg 2221 aaaaatgaag gcagagccac tattggaagg tattttaatg aagggactgt cctggcacat 2281 tatgcagatg tcctgggtct tttgcttaga ctgcggcaaa tttgttgcca tacttacctt 2341 cttacaaatg cagtgtcttc caatggcccc tcaggaaatg atacacctga agaactgaga 2401 aagaagttaa taaggaagat gaagttaatt ctgagctcag gttcagatga ggaatgtgca 2461 atttgcctgg attctttaac agttcctgtg ataacacatt gtgcacatgt attttgtaaa 2521 ccctgtattt gccaagtcat tcagaatgag cagccacatg ctaaatgccc tttatgcaga 2581 aatgatatac atgaagataa tttattagaa tgtcctccag aagaattagc acgtgacagt 2641 gagaaaaagt ctgatatgga atggacatcc agttcaaaga ttaatgcgct aatgcacgca 2701 ttgactgact taagaaagaa gaatcccaac ataaaaagtt tggttgtttc tcagtttaca 2761 acattcctgt ctttaataga aataccactt aaagcctctg gatttgtgtt tactcgtttg 2821 gatggttcca tggcccaaaa gaaaagagtt gaatcaattc agtgttttca aaacactgaa 2881 gcaggatctc caactataat gcttctgtcc ttaaaagcag gtggagttgg tttgaatctg 2941 tctgcagctt ctcgagtgtt tttaatggat ccagcctgga atcctgctgc tgaagatcag 3001 tgctttgaca gatgccatag acttggtcag aagcaagaag ttatcatcac aaaattcatt 3061 gtaaaggact ctgttgaaga aaatatgctg aaaatacaaa acaaaaagag agaacttgca 3121 gcaggagcct ttggaactaa aaaaccaaat gctgacgaaa tgaaacaagc caaaattaat 3181 gaaatcagaa cattaattga cttataattt gtgggatttt agtaagaaga ctactatatg 3241 tgagaggcgt gatatctgga tggaagttgg gctggatgat ctccaaagtc gtttcaactc 3301 ttaaagacat cttaatcctg aatgtaaaca attgttatgt gtttagaatc agaatttgat 3361 tttgaacttg agtaattcat ccttacagct atctgtagaa ttagtcatct tttttctt // LOCUS HUMHISAC 1978 bp DNA PRI 07-MAR-1995 DEFINITION Human histone H1 (H1F4) gene, complete cds. ACCESSION M60748 NID g184073 KEYWORDS histone H1. SOURCE Human blood DNA, clone C3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1978) AUTHORS Albig,W., Kardalinou,E., Drabent,B., Zimmer,A. and Doenecke,D. TITLE Isolation and characterization of two human H1 histone genes within clusters of core histone genes JOURNAL Genomics 10 (4), 940-948 (1991) MEDLINE 92009931 FEATURES Location/Qualifiers source 1..1978 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="C3" /tissue_type="blood" /map="12q11-q21" gene 730..1389 /gene="H1F4" CDS 730..1389 /gene="H1F4" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-030" /product="histone H1" /db_xref="PID:g184074" /translation="MSETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELI TKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGAS GSFKLNKKAASGEAKPKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKK PAAAAGAKKAKSPKKAKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKK K" BASE COUNT 532 a 494 c 544 g 408 t ORIGIN 1 aagggaaaga attatccaag aattgtttaa aaactcagat gtagcggaca gatgtaaaac 61 catggctgta tagattgatg tcccaggggt ccaaaactta atctcaaatg ggcaataatt 121 tgtttggcat taaactaaac cagtttgatg aactcaaatg ccctcggctc aataggcagg 181 actctccgag gagcctgtgt tacttccctc acttaagtgc agatttgtaa taaaaatctt 241 aatgccagtg gcatgctttt tggatatata agaagctaac cacttggagt atcatatttg 301 agaggtcaga aaagtccaca gttaaagatc ggtttataat ttacgaagaa atagaaagtt 361 ttgtttcctc ctgagttgaa atttgccaag cacggaggaa atattgcaag tttttggcac 421 aaggctttct gcttcccctt ataatttgag atctgcgtga agcctgaggg ttcggggatc 481 attatctgag aaaaaccggg cagttcggtg tagacaattt ttatattttt ggcttttttt 541 gaggtgtaac aaacacaact cgggatccga gaggacactc tgcggctgcc agcgaggcgg 601 gctggacagc gcaccaatca cggcgcacgt ccgccctata taaacgggcg ggcgcagcgc 661 cgcggctcga gtcccggcca gtgcctctgc ttccggctcg aattgctctc gctcacgctt 721 gccttcaaca tgtccgagac tgcgcctgcc gcgcccgctg ctccggcccc tgccgagaag 781 actcccgtga agaagaaggc ccgcaagtct gcaggtgcgg ccaagcgcaa agcgtctggg 841 cccccggtgt ccgagctcat tactaaagct gttgccgcct ccaaggagcg cagcggcgta 901 tctttggccg ctctcaagaa agcgctggca gccgctggct atgacgtgga gaaaaacaac 961 agccgcatca agctgggtct caagagcctg gtgagcaagg gcaccctggt gcagaccaag 1021 ggcaccggcg cgtcgggttc cttcaaactc aacaagaagg cggcctctgg ggaagccaag 1081 cctaaggcta aaaaggcagg cgcggccaag gccaagaagc cagcaggagc ggcgaagaag 1141 cccaagaagg cgacgggggc ggccaccccc aagaagagcg ccaagaagac cccaaagaag 1201 gcgaagaagc cggctgcagc tgctggagcc aaaaaagcga aaagcccgaa aaaggcgaaa 1261 gcagccaagc caaaaaaggc gcccaagagc ccagcgaagg ccaaagcagt taaacccaag 1321 gcggctaaac caaagaccgc caagcccaag gcagccaagc caaagaaggc ggcagccaag 1381 aaaaagtaga aagttccttt ggccaactgc ttagaagccc aacacaaccc aaaggctctt 1441 ttcagagcca cccaccgctc tcagtaaaag agctgttgca ctattagggg gcgtggctcg 1501 ggaaaacgct gctaagcagg ggcgggtctc ccgggaacaa agtcggggag aggagtggga 1561 ttttgtgtgt ctccggagct atttttgact atggcgtcgc gtcgcccaag ccggagtgca 1621 gtggcgtcat ctcgattttg cgttctcgag tgtcggagtt gaacccattt gggcctccct 1681 tgtgctttgc cttttagcag gccctggctc cagatagcat gggaaaaaaa atgttgggat 1741 tttccccggg tttctaagct gggtttttcc gagttccaaa cacggcacag tgtatcagtt 1801 tctgtgctgg ttacaagcct actggttatc cctatcgagt atggcaggca gtgagggact 1861 tcagaggagt acgtcttagg acaagtggca tagtactgac attatttccg aagggctaca 1921 tttcaagtgc ttggggagac tactgccaca taactgaaat tagaaaccaa cactgcag // LOCUS HUMHISD 3085 bp mRNA PRI 29-MAY-1994 DEFINITION Human mRNA for histidase, complete cds. ACCESSION D16626 NID g451209 KEYWORDS histidase. SOURCE Homo sapiens liver (library: CLHL1115a) cDNA to mRNA, clones pKS-hHAL-[2 and 7]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3085) AUTHORS Suchi,M., Harada,N., Wada,Y. and Takagi,Y. TITLE Molecular cloning of a cDNA encoding human histidase JOURNAL Biochim. Biophys. Acta 1216 (2), 293-295 (1993) MEDLINE 94060103 REFERENCE 2 (bases 1 to 3085) AUTHORS Suchi,M. TITLE Direct Submission JOURNAL Submitted (08-JUL-1993) to the DDBJ/EMBL/GenBank databases. Mariko Suchi, Nagoya City University Medical School, Pediatrics; 1 Kawasumi Mizuho-cho Mizuho-ku, Nagoya, Aichi 467, Japan (Tel:052-851-5511(ex.8200), Fax:052-842-3316) COMMENT Submitted (08-JUL-1993) to DDBJ by: Mariko Suchi Department of Pediatrics Nagoya City University Medical School 1 Kawasumi, Mizuho-cho Mizuho-ku, Nagoya Aichi 467 Japan Phone: 052-851-5511 x8200 Fax: 052-842-3316. FEATURES Location/Qualifiers source 1..3085 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="CLHL1115a" /tissue_type="liver" 5'UTR 1..243 CDS 244..2217 /EC_number="4.3.1.3" /codon_start=1 /product="histidase" /db_xref="PID:d1004565" /db_xref="PID:g451210" /translation="MPRYTVHVRGEWLAVPCQDAQLTVGWLGREAVRRYIKNKPDNGG FTSVDDAHFLVRRCKGLGLLDNEDRLEVALENNEFVEVVIEGDAMSPDFIPSQPEGVY LYSKYREPEKYIELDGDRLTTEDLVNLGKGRYKIKLTPTAEKRVQKSREVIDSIIKEK TVVYGITTGFGKFARTVIPINKLQELQVNLVRSHSSGVGKPLSPERCRMLLALRINVL AKGYSGISLETLKQVIEMFNASCLPYVPEKGTVGASGDLAPLSHLALGLVGEGKMWSP KSGWADAKYVLEAHGLKPVILKPKEGLALINGTQMITSLGCEAVERASAIARQADIVA ALTLEVLKGTTKAFDTDIHALRPHRGQIEVAFRFRSLLDSDHHPSEIAESHRFCDRVQ DAYTLRCCPQVHGVVNDTIAFVKNIITTELNSATDNPMVFANRGETVSGGNFHGEYPA KALDYLAIGIHELAAISERRIERLCNPSLSELPAFLVAEGGLNSGFMIAHCTAAALVS ENKALCHPSSVDSLSTSAATEDHVSMGGWAARKALRVIEHVEQVLAIELLAACQGIEF LRPLKTTTPLEKVYDLVRSVVRPWIKDRFMAPDIEAAHRLLLEQKVWEVAAPYIEKYR MEHIPESRPLSPTAFSLQFLHKKSTKIPESEDL" 3'UTR 2218..3085 BASE COUNT 810 a 730 c 749 g 796 t ORIGIN 1 agcagcaggt aggtgccatc agggacaaga acagcacctc ccagggtggg agaccccagg 61 cctttctggc agcaggtctg gatggaaagt ggacaggagg ctcacccgtc tgcatcccct 121 gctcctgccc ctgctcggct acaaaaacca aagggacagc agctgaccac accccggtag 181 ccactcctgc ataaagctct cccctcctgt gaccagctga ggacctcagg ctgcagcgga 241 gccatgccca gatacacggt gcacgtacgt ggggaatggc tggcagtgcc ctgccaggac 301 gcgcagctca ctgtgggctg gctgggccgg gaggccgtga ggcgctatat caagaataag 361 cccgacaatg gtggcttcac ctccgtggat gacgcgcact tccttgtgcg ccggtgcaag 421 ggcctgggcc tgctggacaa cgaggaccgg ctcgaggtgg ccctagagaa caacgagttc 481 gtggaagtgg ttatagaggg tgatgccatg tctcctgact tcattccatc tcaaccagaa 541 ggagtttatc tatacagcaa gtaccgggag cctgaaaagt acatcgagtt agatggagac 601 cgtctgacca cggaggatct ggtcaacttg ggaaagggac gctacaaaat aaagctcacc 661 ccaacagctg agaagagggt gcagaaatcc agggaggtca tagatagcat cataaaagag 721 aaaacagttg tttacggtat tactacaggt tttgggaaat ttgccagaac tgtaattcct 781 atcaataagc tacaggagct tcaggtcaac ttagtacgct cacattcttc aggtgttggg 841 aaaccactaa gtcctgagag gtgtcggatg ctcttggctt taaggatcaa tgtcttagcc 901 aaaggataca gtggcatttc cctggagacc ctcaaacaag tcatagaaat gtttaatgcc 961 tcctgcctgc cctatgtccc agagaaagga accgttggtg ccagtggaga ccttgcccca 1021 ctctctcatc ttgctcttgg gctagttgga gaagggaaga tgtggtctcc gaagagtggc 1081 tgggctgatg ctaaatacgt gctagaagcc catggattga aaccagttat tttaaaacca 1141 aaagagggcc tggcactcat caatgggacg cagatgatca catccctggg ctgtgaagct 1201 gtagagcgag ccagtgctat tgcacggcag gctgacattg tggcagccct gacccttgag 1261 gtgctgaagg gcaccaccaa agcctttgac actgacattc atgctcttcg acctcaccgt 1321 gggcaaattg aagttgcttt tcggtttcgg tcactcttgg actcagatca ccacccatca 1381 gaaatagcag agagtcacag gttctgtgat cgcgtccagg atgcatacac cttgcgctgc 1441 tgtccacagg tccatggtgt ggtgaatgat acaatagcat ttgtgaagaa catcattacc 1501 acagaactga acagcgcaac agataatcct atggtctttg ccaatagggg agagacagtt 1561 tctggaggaa acttccatgg tgaataccca gccaaagccc tagactactt ggccattggc 1621 atccatgaac ttgctgcaat cagtgagaga agaatcgagc ggctctgcaa tccctccctc 1681 agtgagctgc ctgccttcct ggtggctgaa ggtggtctga actctgggtt catgatagct 1741 cactgcacgg cagcagccct tgtttctgag aacaaggctc tgtgccatcc ctcgtctgtt 1801 gactccctct ccaccagcgc agccacggag gaccacgtct ccatgggagg atgggcagca 1861 aggaaagccc tcagggtcat cgagcatgtg gagcaagtgc tggccatcga gctccttgca 1921 gcctgccagg gcatagagtt tctacgtccc ctgaaaacaa ccactccgct ggagaaggtc 1981 tatgacctgg tgcgctctgt tgtaaggccc tggataaaag atcgcttcat ggccccggac 2041 atcgaggcag cccacaggct gctcctggag cagaaggttt gggaagtagc tgctccatac 2101 attgaaaaat acagaatgga gcatattcca gaatcaagac ctctttctcc aacagccttt 2161 tcactgcaat ttctgcacaa gaaatccacc aaaatcccgg agtctgagga cctttaatgg 2221 gctttgtcat gaagtagcag atgagagggc agtcagttta gcacaaagca atactaggct 2281 gaaggagaga cctgagaact ttcctaggta gatcaatcca ttgtatcatt cagttcttct 2341 aaagcctacg ttggttaggc tgatggcagt attatagttg ctaaattcag cactgtgttc 2401 ctgttgtcgt ggttcaagac ccaccaggta ttttcagatt ataaaacttt tctttctttc 2461 ttaacagttt caacaggcca ctcactctta agggtgagaa gaataaccac aattgtatgt 2521 gcctgttttt tactcttagc attagatgaa ttcaaatttg gaaacagatt gatagcaatt 2581 ttttctaaaa acattagact tttgttaacc tttttttttt tttttaaatt tgcttcaaca 2641 agctctccac cagttgactt tctttggcta attttacttt gcatgatatg ccttaatatg 2701 ccttcataaa taaccatttt aagtcataat ttgtccttaa gctgcttttt tcttctatta 2761 attggatcat agtaaagagt agtcaatagg gtcttcagct attaattgta gaggtgatta 2821 aaaccaacaa ggagtttcat gtgcaaagga gataaggaat gaatataaag attgctattt 2881 gggtggctct tattaaactg tgtattttgt acttatcact acacgtatcc cccaaatgct 2941 tacatgggag tttgaggtta gtattttcac ttccttggtg ttagtactct attcacattc 3001 ttattgtaac cttcctcatt tcacagataa ggaatctttg gggattaacc aacctccttt 3061 ctgtaatggt aatcattaaa ataag // LOCUS HUMHISH2R 1191 bp DNA PRI 31-DEC-1994 DEFINITION Human histamine H2 receptor gene, complete cds. ACCESSION M64799 NID g184087 KEYWORDS histamine H2 receptor. SOURCE Homo sapiens (tissue library: Clontech) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1191) AUTHORS Gantz,I., Munzert,G., Tashiro,T., Schaffer,M., Wang,L., DelValle,J. and Yamada,T. TITLE Molecular cloning of the human histamine H2 receptor JOURNAL Biochem. Biophys. Res. Commun. 178 (3), 1386-1392 (1991) MEDLINE 91337087 FEATURES Location/Qualifiers source 1..1191 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="Clontech" gene 1..1080 /gene="histamine H2 receptor" CDS 1..1080 /gene="histamine H2 receptor" /codon_start=1 /product="histamine H2 receptor" /db_xref="PID:g184088" /translation="MAPNGTASSFCLDSTACKITITVVLAVLILITVAGNVVVCLAVG LNRRLRNLTNCFIVSLAITDLLLGLLVLPFSAIYQLSCKWSFGKVFCNIYTSLDVMLC TASILNLFMISLDRYCAVMDPLRYPVLVTPVRVAISLVLIWVISITLSFLSIHLGWNS RNETSKGNHTTSKCKVQVNEVYGLVDGLVTFYLPLLIMCITYYRIFKVARDQAKRINH ISSWKAATIREHKATVTLAAVMGAFIICWFPYFTAFVYRGLRGDDAINEVLEAIVLWL GYANSALNPILYAALNRDFRTGYQQLFCCRLANRNSHKTSLRSNASQLSRTQSREPRQ QEEKPLKLQVWSGTEVTAPQGATDR" BASE COUNT 250 a 377 c 302 g 262 t ORIGIN 1 atggcaccca atggcacagc ctcttccttt tgcctggact ctaccgcatg caagatcacc 61 atcaccgtgg tccttgcggt cctcatcctc atcaccgttg ctggcaatgt ggtcgtctgt 121 ctggccgtgg gcttgaaccg ccggctccgc aacctgacca attgtttcat cgtgtccttg 181 gctatcactg acctgctcct cggcctcctg gtgctgccct tctctgccat ctaccagctg 241 tcctgcaagt ggagctttgg caaggtcttc tgcaatatct acaccagcct ggatgtgatg 301 ctctgcacag cctccattct taacctcttc atgatcagcc tcgaccggta ctgcgctgtc 361 atggacccac tgcggtaccc tgtgctggtc accccagttc gggtcgccat ctctctggtc 421 ttaatttggg tcatctccat taccctgtcc tttctgtcta tccacctggg gtggaacagc 481 aggaacgaga ccagcaaggg caatcatacc acctctaagt gcaaagtcca ggtcaatgaa 541 gtgtacgggc tggtggatgg gctggtcacc ttctacctcc cgctactgat catgtgcatc 601 acctactacc gcatcttcaa ggtcgcccgg gatcaggcca agaggatcaa tcacattagc 661 tcctggaagg cagccaccat cagggagcac aaagccacag tgacactggc cgccgtcatg 721 ggggccttca tcatctgctg gtttccctac ttcaccgcgt ttgtgtaccg tgggctgaga 781 ggggatgatg ccatcaatga ggtgttagaa gccatcgttc tgtggctggg ctatgccaac 841 tcagccctga accccatcct gtatgctgcg ctgaacagag acttccgcac cgggtaccaa 901 cagctcttct gctgcaggct ggccaaccgc aactcccaca aaacttctct gaggtccaac 961 gcctctcagc tgtccaggac ccaaagccga gaacccaggc aacaggaaga gaaacccctg 1021 aagctccagg tgtggagtgg gacagaagtc acggcccccc agggagccac agacaggtaa 1081 aagctccagg tgtggagtgg gacagaagtc acggcccccc agggagccac agacaggtaa 1141 gcgctgaaca gagacttccg caccgggtac caacagctct tctgctgcag g // LOCUS HUMHISH3B 1039 bp mRNA PRI 08-NOV-1994 DEFINITION Human H3.3 histone, class B mRNA, complete cds. ACCESSION M11354 NID g184090 KEYWORDS histone; histone H3. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1039) AUTHORS Wells,D. and Kedes,L. TITLE Structure of a human histone cDNA: evidence that basally expressed histone genes have intervening sequences and encode polyadenylylated mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (9), 2834-2838 (1985) MEDLINE 85190590 FEATURES Location/Qualifiers source 1..1039 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21" gene 109..519 /gene="H3F2" CDS 109..519 /gene="H3F2" /note="H3.3 histone" /codon_start=1 /db_xref="GDB:G00-120-031" /db_xref="PID:g306848" /translation="MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRP GTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLV GLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA" BASE COUNT 291 a 223 c 225 g 300 t ORIGIN 1 tgttcgcagc cgccgccgcg ccgccgtcgc tctccaacgc cagcgccgcc tctcgctcgc 61 gtaagtaagg aggtctctgc gagctccagc cgaagagaag ggggtaccat ggctcgtaca 121 aagcagactg cccgcaaatc gaccggtggt aaagcaccca ggaagcaact ggctacaaaa 181 gccgctcgca agagtgcgcc ctctactgga ggggtgaaga aacctcatcg ttacaggcct 241 ggtactgtgg cgctccgtga aattagacgt tatcagaagt ccactgaact tctgattcgc 301 aaacttccct tccagcgtct ggtgcgagaa attgctcagg actttaaaac agatctgcgc 361 ttccagagcg cagctatcgg tgctttgcag gaggcaagtg aggcctatct ggttggcctt 421 tttgaagaca ccaacctgtg tgctatccat gccaaacgtg taacaattat gccaaaagac 481 atccagctag cacgccgcat acgtggagaa cgtgcttaag aatccactat gatgggaaac 541 atttcattct caaaaaaaaa aaaaaaattt ctcttcttcc tgttattggt agttctgaac 601 gttagatatt ttttttccat ggggtcaaag gtacctaagt atatgattgc gagtggaaaa 661 ataggggaca gaaatcaggt attggcagtt tttccatttt catttgtgtg tgaattttta 721 atataaatgc ggagacgtaa agcattaatg caagttaaaa tgtttcagtg aacaagtttc 781 agcggttcaa ctttataata attataaata aacctgttaa atttttctgg acaatgccag 841 catttggatt tctttaaaac aagtaaattt cttattgatg gcaactaaat ggtgtttgta 901 gcatttttat catacagtag attccatcca ttcactatac ttttctaact gagttgtcct 961 acatgcaagt acatgttttt aatgttgtct gtcttctgtg ctgttcctgt aagtttgcta 1021 ttaaaataca ttaaactat // LOCUS HUMHK1A 4134 bp mRNA PRI 15-NOV-1994 DEFINITION Homo sapiens calcium-ATPase (HK1) mRNA, complete cds. ACCESSION M23114 J04025 NID g184100 KEYWORDS ATPase; Ca2+-ATPase; alternative splicing; calcium-ATPase. SOURCE Human kidney cortex, cDNA to mRNA, clones lambda HK[1a,1b]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4134) AUTHORS Lytton,J. and MacLennan,D.H. TITLE Molecular cloning of cDNAs from human kidney coding for two alternatively spliced products of the cardiac Ca2+-ATPase gene JOURNAL J. Biol. Chem. 263 (29), 15024-15031 (1988) MEDLINE 89008384 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1 and HK2, are realized in human kidney cDNAs. HK2 codes for a protein identical to rabbit cardiac muscle Ca2+ ATPase, with the exception of 6 scattered amino acid replacements, whereas HK1 codes for a protein identical to that encoded by HK2, but with the carboxyl-terminal 4 amino acids replaced by an extended sequence of 49 amino acids. See accession M23115 and J04703. FEATURES Location/Qualifiers source 1..4134 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..4134 /note="calcium-ATPase mRNA" gene 164..3292 /gene="HK1" CDS 164..3292 /gene="HK1" /EC_number="3.6.1.3" /codon_start=1 /db_xref="GDB:G00-120-044" /db_xref="PID:g306850" /translation="MENAHTKTVEEVLGHFGVNESTGLSLEQVKKLKERWGSNELPAE EGKTLLELVIEQFEDLLVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILVANA IVGVWQERNAENAIEALKEYEPEMGKVYRQDRKSVQRIKAKDIVPGDIVEIAVGDKVP ADIRLTSIKSTTLRVDQSILTGESVSVIKHTDPVPDPRAVNQDKKNMLFSGTNIAAGK AMGVVVATGVNTEIGKIRDEMVATEQERTPLQQKLDEFGEQLSKVISLICIAVWIINI GHFNDPVHGGSWIRGAIYYFKIAVALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVR SLPSVETLGCTSVICSDKTGTLTTNQMSVCRMFILDRVEGDTCSLNEFTITGSTYAPI GEVHKDDKPVNCHQYDGLVELATICALCNDSALDYNEAKGVYEKVGEATETALTCLVE KMNVFDTELKGLSKIERANACNSVIKQLMKKEFTLEFSRDRKSMSVYCTPNKPSRTSM SKMFVKGAPEGVIDRCTHIRVGSTKVPMTSGVKQKIMSVIREWGSGSDTLRCLALATH DNPLRREEMHLEDSANFIKYETNLTFVGCVGMLDPPRIEVASSVKLCRQAGIRVIMIT GDNKGTAVAICRRIGIFGQDEDVTSKAFTGREFDELNPSAQRDACLNARCFARVEPSH KSKIVEFLQSFDEITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNFS TIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAALGFPEALIPVQLLWVNLVT DGLPATALGFNPPDLDIMNKPPRNPKEPLISGWLFFRYLAIGCYVGAATVGAAAWWFI AADGGPRVSFYQLSHFLQCKEDNPDFEGVDCAIFESPYPMTMALSVLVTIEMCNALNS LSENQSLLRMPPWENIWLVGSICLSMSLHFLILYVEPLPLIFQITPLNVTQWLMVLKI SLPVILMDETLKFVARNYLEPGKECVQPATKSCSFSACTDGISWPFVLLIMPLVIWVY STDTNFSDMFWS" misc_feature 3143..3144 /gene="HK1" /note="alternative splice site" BASE COUNT 1056 a 938 c 1012 g 1128 t ORIGIN 254 bp upstream of HindIII site. 1 gggtgattca gcgcccggcg aggcggaacg ggccgcaaga ggaggagggg agagcccgtc 61 cgcgcctggg ctcccggggt ggcacgagcc cgcggccgga gtgcgaggcg gaggcgagga 121 ggccgcgggg acgggaggcg aggccggccg ggcccccgaa gccatggaga acgcgcacac 181 caagacggtg gaggaggtgc tgggccactt cggcgtcaac gagagtacgg ggctgagcct 241 ggaacaggtc aagaagctta aggagagatg gggctccaac gagttaccgg ctgaagaagg 301 aaaaaccttg ctggaacttg tgattgagca gtttgaagac ttgctagtta ggattttatt 361 actggcagca tgtatatctt ttgttttggc ttggtttgaa gaaggtgaag aaacaattac 421 agcctttgta gaaccttttg taattttact catattagta gccaatgcaa ttgtgggtgt 481 atggcaggaa agaaatgctg aaaatgccat cgaagccctt aaggaatatg agcctgaaat 541 gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg attaaagcta aagacatagt 601 tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt cctgctgata taaggttaac 661 ttccatcaaa tctaccacac taagagttga ccagtcaatt ctcacaggtg aatctgtctc 721 tgtcatcaag cacactgatc ccgtccctga cccacgagct gtcaaccaag ataaaaagaa 781 catgctgttt tctggtacaa acattgctgc tgggaaagct atgggagtgg tggtagcaac 841 tggagttaac accgaaattg gcaagatccg ggatgaaatg gtggcaacag aacaggagag 901 aacacccctt cagcaaaaac tagatgaatt tggggaacag ctttccaaag tcatctccct 961 tatttgcatt gcagtctgga tcataaatat tgggcacttc aatgacccgg ttcatggagg 1021 gtcctggatc agaggtgcta tttactactt taaaattgca gtggccctgg ctgtagcagc 1081 cattcctgaa ggtctgcctg cagtcatcac cacctgcctg gctcttggaa ctcgcagaat 1141 ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg gaaacccttg gttgtacttc 1201 tgttatctgc tcagacaaga ctggtacact tacaacaaac cagatgtcag tctgcaggat 1261 gttcattctg gacagagtgg aaggtgatac ttgttccctt aatgagttta ccataactgg 1321 atcaacttat gcacctattg gagaagtgca taaagatgat aaaccagtga attgtcacca 1381 gtatgatggt ctggtagaat tagcaacaat ttgtgctctt tgtaatgact ctgctttgga 1441 ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa gctacagaga ctgctctcac 1501 ttgcctagta gagaagatga atgtatttga taccgaattg aagggtcttt ctaaaataga 1561 acgtgcaaat gcctgcaact cagtcattaa acagctgatg aaaaaggaat tcactctaga 1621 gttttcacgt gacagaaagt caatgtcggt ttactgtaca ccaaataaac caagcaggac 1681 atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt gtcattgaca ggtgcaccca 1741 cattcgagtt ggaagtacta aggttcctat gacctctgga gtcaaacaga agatcatgtc 1801 tgtcattcga gagtggggta gtggcagcga cacactgcga tgcctggccc tggccactca 1861 tgacaaccca ctgagaagag aagaaatgca ccttgaggac tctgccaact ttattaaata 1921 tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg gatcctccga gaatcgaggt 1981 ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg gtcatcatga tcactgggga 2041 caacaagggc actgctgtgg ccatctgtcg ccgcatcggc atcttcgggc aggatgagga 2101 cgtgacgtca aaagctttca caggccggga gtttgatgaa ctcaacccct ccgcccagcg 2161 agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa ccctcccaca agtctaaaat 2221 cgtagaattt cttcagtctt ttgatgagat tacagctatg actggcgatg gcgtgaacga 2281 tgctcctgct ctgaagaaag ccgagattgg cattgctatg ggctctggca ctgcggtggc 2341 taaaaccgcc tctgagatgg tcctggcgga tgacaacttc tccaccattg tggctgccgt 2401 tgaggagggg cgggcaatct acaacaacat gaaacagttc atccgctacc tcatctcgtc 2461 caacgtcggg gaagttgtct gtattttcct gacagcagcc cttggatttc ccgaggcttt 2521 gattcctgtt cagctgctct gggtcaatct ggtgacagat ggcctgcctg ccactgcact 2581 ggggttcaac cctcctgatc tggacatcat gaataaacct ccccggaacc caaaggaacc 2641 attgatcagc gggtggctct ttttccgtta cttggctatt ggctgttacg tcggcgctgc 2701 taccgtgggt gctgctgcat ggtggttcat tgctgctgac ggtggtccaa gagtgtcctt 2761 ctaccagctg agtcatttcc tacagtgtaa agaggacaac ccggactttg aaggcgtgga 2821 ttgtgcaatc tttgaatccc catacccgat gacaatggcg ctctctgttc tagtaactat 2881 agaaatgtgt aacgccctca acagcttgtc cgaaaaccag tccttgctga ggatgccccc 2941 ctgggagaac atctggctcg tgggctccat ctgcctgtcc atgtcactcc acttcctgat 3001 cctctatgtc gaacccttgc cactcatctt ccagatcaca ccgctgaacg tgacccagtg 3061 gctgatggtg ctgaaaatct ccttgcccgt gattctcatg gatgagacgc tcaagtttgt 3121 ggcccgcaac tacctggaac ctggtaaaga gtgtgtgcag cctgccacca aatcctgctc 3181 gttctcggca tgcaccgatg ggatttcctg gccgtttgtg ctgctcataa tgcccctggt 3241 gatctgggtc tatagcacag acactaactt tagcgatatg ttctggtctt gactgacagt 3301 tttccataaa gaagatgttt aacttaatca attaattttt ttattgttta aagcaactgt 3361 ctatttctgc tgaattttca catgaacata ctggctggtg atggaggttt catactctag 3421 attttgtttt gctttttctg actccagtgg ggcaagattt tcctttttta tacacataat 3481 taaagtgtcc attgacatgt acagagaact aacactattt tatgcaaata tttttttgta 3541 gatgaaaaag catgtacagt gttctgttta atactcatcc ttgtataaaa aaaatagttg 3601 agccagcaga cattgtcagc aaattaattg gcagcagatt ttaggaaatg aatgtgtgtg 3661 gttttttttc taaaactaaa tagcatgtat tgtgtctttt gcatgatgat ccggatttaa 3721 tttgatatca cagtctaatt tttattcata agccaatttt tctgcactga gcagagtctt 3781 gctacctcag tcagtattgt tttggtttgc tacttccctc acccactttg gcctccgttc 3841 accccacccc accccacctc tccccacctt acccccgccc cgcttggctt cttctttagg 3901 attgtgatgg ttcgttctgt ttacatcagt tttaacgaga ggtatgcctg tactcgcttg 3961 tgcagaaaac attgttccag attcaatcga ctgggtttat gtcccttcac atagttttta 4021 aggttattta tttaaatgtc taatgtattt tattgtaaca gacattgttt tgccaacatt 4081 gcctatttca gtggcacgtc atctagtttt aaaaaaataa aacattttaa aaag // LOCUS HUMHKATPB 1407 bp mRNA PRI 13-NOV-1991 DEFINITION Human H,K-ATPase beta subunit mRNA, complete cds. ACCESSION M75110 NID g184104 KEYWORDS H+/K+-ATPase beta subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1407) AUTHORS Ma,J.-Y., Song,Y.-h., Sjostrand,S.E., Rask,L. and Mardh,S. TITLE cDNA cloning of the beta-subunit of the human gastric H,K-ATPase JOURNAL Biochem. Biophys. Res. Commun. 180, 39-45 (1991) MEDLINE 92028970 FEATURES Location/Qualifiers source 1..1407 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 18..893 /codon_start=1 /product="H,K-ATPase beta subunit" /db_xref="PID:g184105" /translation="MAALQEKKTCGQRMEEFQRYCWNPDTGQMLGRTLSRWVWISLYY VAFYVVMTGLFALCLYVLMQTVDPYTPDYQDQLRSPGVTLRPDVYGEKGLEIVYNVSD NRTWADLTQTLHAFLAGYSPAAQEDSINCTSEQYFFQESFRAPNHTKFSCKFTADMLQ NCSGLADPNFGFEEGKPCFIIKMNRIVKFLPSNGSAPRVDCAFLDQPRELGQPLQVKY YPPNGTFSLHYFPYYGKKAQPHYSNPLVAAKLLNIPRNAEVAIVCKVMAEHVTFNNPH DPYEGKVEFKLKIEK" BASE COUNT 361 a 394 c 331 g 321 t ORIGIN 1 atctcaggcc agggacgatg gcggctctgc aggagaagaa gacgtgtggc cagcgcatgg 61 aggagttcca gcgttactgc tggaacccgg acacggggca gatgctgggc cgcaccctgt 121 cccggtgggt gtggatcagc ctgtactacg tggccttcta cgtggtgatg actgggctct 181 tcgccctgtg cctctatgtg ctgatgcaga cagtggaccc gtacacaccg gactaccaag 241 accagctacg gtcaccaggg gtaaccttaa ggccggatgt ttacggggag aaaggcctgg 301 aaattgtcta caacgtctct gataacagaa cctgggcaga cctcacacag actctccacg 361 ccttcctagc aggctactct ccagcagccc aggaggacag catcaactgc acctccgagc 421 agtacttctt ccaggagagt ttccgcgctc ccaaccacac caagttctcc tgcaagttca 481 cggcagatat gctgcagaac tgctcaggcc tggcggatcc caacttcggc tttgaagaag 541 gaaagccatg ttttattatt aaaatgaaca ggatcgtcaa gttcctcccc agcaacggct 601 cggcccccag agtggactgc gccttcctgg accagccccg cgagctcggc cagccgctgc 661 aggtcaagta ctaccctccc aacggcacct tcagtctgca ctacttccct tattacggga 721 agaaagccca gccccactac agcaaccccc tggtggcagc gaagctcctc aacatcccca 781 ggaacgctga ggtcgccatc gtgtgcaagg tcatggcaga gcacgtgacc ttcaacaatc 841 cccacgaccc gtatgaaggg aaagtggagt tcaaactcaa gattgagaag tgaaacgttt 901 gcgcaggggt cctgggcacg cctgcggggt cgctcaagga caccctcctg gttgggctta 961 ccttgcccgt cagttccctg ccaaatcatc cccaaagtgg tttggagcaa cggtgttgtc 1021 agtgtgcgaa ctccagagaa gcgcccacat ctgaaggacc tgctcgcgag tatcagttct 1081 tccttgttga attcttacag tttttagatg gaatttgctg ctataagaat gtccagctac 1141 catgggaacg caaggcagca actctctaat taaccaggtc ataaaaacga ttcgtcttct 1201 atgtagacat cactttctta ctataattta tttttctaca cttcaatatg aactgccccc 1261 cccacattaa tataaaaact actaatgcac tgatatgaaa cacggcttac actaatgaca 1321 ttctgaattc ttgcttttaa aattgcaatt cctaagttgt aaacataaaa tatattaaag 1381 ttactcttat tgtatgtaaa aaaaaaa // LOCUS HUMHLGP85 2329 bp mRNA PRI 23-OCT-1992 DEFINITION Human mRNA for lysosomal sialoglycoprotein, complete cds. ACCESSION D12676 NID g219702 KEYWORDS lysosomal membrane sialoglycoprotein; transmembrane protein. SOURCE Homo sapiens pancreas islet tumor cell line QGP-1NL cDNA to mRNA, clone hLGP-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2329) AUTHORS Fujita,H., Takata,Y., Kono,A., Tanaka,Y., Takahashi,T., Himeno,M. and Kato,K. TITLE Isolation and sequencing of a cDNA clone encoding the 85kDa human lysosomal sialoglycoprotein (hLGP85) in human metastatic pancreas islet tumor cells JOURNAL Biochemical and Biophysical Research Communication 184, 604-611 (1992) REFERENCE 2 (bases 1 to 2329) AUTHORS Fujita,H. TITLE Direct Submission JOURNAL Submitted (20-JUL-1992) to the DDBJ/EMBL/GenBank databases. Hideaki Fujita, Faculty of Pharmaceutical Sciences, Kyusyu University; Maidashi 3-1-1, Higashi-ku, Fukuoka 812, Japan (Tel:092-641-1151(ex.6167), Fax:092-641-8154) COMMENT Submitted (20-JUL-1992) to DDBJ by: Hideaki Fujita Faculty of Pharmaceutical Sciences Kyushu University 3-1-1 Maidashi Higashi-ku, Fukuoka Fukuoka 812 Japan Phone: 092-641-1151 x6167 Fax: 092-641-8154. FEATURES Location/Qualifiers source 1..2329 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="QGP-1NL" /cell_type="islet tumor cells" CDS 252..1688 /codon_start=1 /product="85kDa human lysosomal sialoglycoprotein" /db_xref="PID:d1002667" /db_xref="PID:g219703" /translation="MGRCCFYTAGTLSLLLLVTSVTLLVARVFQKAVDQSIEKKIVLR NGTEAFDSWEKPPLPVYTQFYFFNVTNPEEILRGETPRVEEVGPYTYRELRNKANIQF GDNGTTISAVSNKAYVFERDQSVGDPKIDLIRTLNIPVLTVIEWSQVHFLREIIEAML KAYQQKLFVTHTVDELLWGYKDEILSLIHVFRPDISPYFGLFYEKNGTNDGDYVFLTG EDSYLNFTKIVEWNGKTSLDWWITDKCNMINGTDGDSFHPLITKDEVLYVFPSDFCRS VYITFSDYESVQGLPAFRYKVPAEILANTSDNAGFCIPEGNCLGSGVLNVSICKNGAP IIMSFPHFYQADERFVSAIEGMHPNQEDHETFVDINPLTGIILKAAKRFQINIYVKKL DDFVETGDIRTMVFPVMYLNESVHIDKETASRLKSMINTTLIITNIPYIIMALGVFFG LVFTWLACKGQGSMDEGTADERAPLIRT" mat_peptide 255..1685 /product="85kDa human lysosomal sialoglycoprotein" polyA_signal 2280..2285 BASE COUNT 617 a 524 c 573 g 614 t 1 others ORIGIN 1 cacggctgcc cggcgaagga aaccgaaacc gagtccgggc ccgtccctcc gcggccccat 61 ccgcccggtg cacccggggc cgcgctcgcc aggccgcgga gccagagctg cgcgcacgaa 121 ccgtgcgcgg gagggcgtgg gcgttgcgcc gaagggtccc gagtcttcga cgcctctgcg 181 gcggctcctc cctccttgca gttggatccc tggcgggtgc ggcccggccc ggcccgtgag 241 cngcgcacag aatgggccga tgctgcttct acacggcggg gacgttgtcc ctgctcctgc 301 tggtgaccag cgtcacgctg ctggtggccc gggtcttcca gaaggctgta gaccagagta 361 tcgagaagaa aattgtgtta aggaatggta ctgaggcatt tgactcctgg gagaagcccc 421 ctctgcctgt gtatactcag ttctatttct tcaatgtcac caatccagag gagatcctca 481 gaggggagac ccctcgggtg gaagaagtgg ggccatacac ctacagggaa ctcagaaaca 541 aagcaaatat tcaatttgga gataatggaa caacaatatc tgctgttagc aacaaggcct 601 atgtttttga acgagaccaa tctgttggag accctaaaat tgacttaatt agaacattaa 661 atattcctgt attgactgtc atagagtggt cccaggtgca cttcctcagg gagatcatcg 721 aggccatgtt gaaagcctat cagcagaagc tctttgtgac tcacacagtt gacgaattgc 781 tctggggcta caaagatgaa atcttgtccc ttatccatgt tttcaggccc gatatctctc 841 cctattttgg cctattctat gagaaaaatg ggactaatga tggagactat gtttttctaa 901 ctggagaaga cagttacctt aactttacaa aaattgtgga atggaatggg aaaacgtcac 961 ttgactggtg gataacagac aagtgcaata tgattaatgg aacagatgga gattcttttc 1021 acccactaat aaccaaagat gaggtccttt atgtcttccc atctgacttt tgcaggtcag 1081 tgtatattac tttcagtgac tatgagagtg tacagggact gcctgccttt cggtataaag 1141 ttcctgcaga aatattagcc aatacgtcag acaatgccgg cttctgtata cctgagggaa 1201 actgcctggg ctcaggagtt ctgaatgtca gcatctgcaa gaatggtgca cccatcatta 1261 tgtctttccc acacttttac caagcagatg agaggtttgt ttctgccata gaaggcatgc 1321 acccaaatca ggaagaccat gagacatttg tggacattaa tcctttgact ggaataatcc 1381 taaaagcagc caagaggttc caaatcaaca tttatgtcaa aaaattagat gactttgttg 1441 aaacgggaga cattagaacc atggttttcc cagtgatgta cctcaatgag agtgttcaca 1501 ttgataaaga gacggcgagt cgactgaagt ctatgattaa cactactttg atcatcacca 1561 acatacccta catcatcatg gcgctgggtg tgttctttgg tttggttttt acctggcttg 1621 catgcaaagg acagggatcc atggatgagg gaacagcgga tgaaagagca cccctcattc 1681 gaacctaaac attgcctttg cttggtgaag aaactgtgtg agctgtcctg acctggacga 1741 tgacgtgggg aaaccctcca cctccttgca ggcttgttgc ctgttgaaag aaggaaaaag 1801 acacggcgct ggcaagtgat aggaacattc tggccagagg ttaaagagca ggctgacatg 1861 gctggccatt aagctttata aaatcatgtg ggctctgaaa ttgttctttt atgtgtctag 1921 caagtattta ataaaccctt gtatagtaat tttgttgttg ttgggtgctg gtagctccag 1981 aattttgtga ccactattgt gggtaaaatg tctctgcatc acttgttaat gctactggtc 2041 taacttcatt cagtatgctt cattcaccga actttgtgct caaaatgcgt atataccatt 2101 ttatgttgta ttcctccatt tcacttgcaa aacagaagta aataagagtt cgggacccag 2161 ggtaaaatgg tagcttcatc caatatatca ttcaaatgca tctgatttct aaaccatatt 2221 acattttatg ctgatcttca gttcataatt cttccaggaa aactcagtct tccaactgca 2281 ataaaatact ggggtaggaa tcaaatggga aagggggggg gggggggcc // LOCUS HUMHLGS 2373 bp DNA PRI 24-MAY-1996 DEFINITION Human gene for liver glycogen synthase, complete cds. ACCESSION D29685 NID g517111 KEYWORDS liver glycogen synthase. SOURCE Homo sapiens (strain caucasian) female liver DNA, clone HLGS. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2373) AUTHORS Nakabayashi,H. and Nakayama,T. TITLE Human liver glycogen synthase cDNA JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 2373) AUTHORS Nakabayashi,H. TITLE Direct Submission JOURNAL Submitted (28-MAR-1994) to the DDBJ/EMBL/GenBank databases. Hiroki Nakabayashi, Nihon University School of Medicine, Medical Reserch Institute; Oyaguchikami-machi, Itabashi-ku, Tokyo 173, Japan (Tel:03-3972-8111(ex.2330), Fax:03-3972-8830) COMMENT Submitted (28-Mar-1994) to DDBJ by: Hiroki Nakabayashi Medical Research Institute Nihon Unviersity School of Medicine Oyaguchikami-Machi, Itabashi-ku Tokyo 173 Japan Phone: 03-3972-8111 x23301 Fax: 03-3972-8830. FEATURES Location/Qualifiers source 1..2373 /organism="Homo sapiens" /strain="caucasian" /db_xref="taxon:9606" /sex="female" /tissue_type="liver" gene 255..2369 /gene="human liver glycogen synthase gene" CDS 255..2369 /gene="human liver glycogen synthase gene" /standard_name="HLGS" /EC_number="2.4.1.11" /codon_start=1 /product="human liver glycogen synthase" /db_xref="PID:d1006716" /db_xref="PID:g517112" /translation="MLRGRSLSVTSLGGLPQWEVEELPVEELLLFEVAWEVTNKVGGI YTVIQTKAKTTADEWGENYFLIGPYFEHNMKTQVEQCEPVNDAVRRAVDAMNMHGCQV HFGRWLIEGSPYVVLFDIGYSAWNLDRWKGDLWEACSVGIPYHDREANDMLIFGSLTA WFLKEVTDHADGKYVVARFHEWQAGVGLILSRARKLPIATIFTTHATLLGRYLCAANI DFYNHLDKFNIDKEAGERQIYHRYCMERASVHCAHVFTTVSEITAIEAEHMLKRKPDV VTPNGLNVKKFSAVHEFQNLHAMYKARIQDFVRGHFYGHLDFDLEKTLFLFIAGRYEF FKTKGADIFLDSLSRLNFLLRMHKSDITVVVFFIMPAKTNNFNVETLKGQAVRKQLWD VAHSVKEKFGKKLYDALLRGEIPDLNDILDRDDLTIMKRAIFSTQRQSLAPVTTHNMI DDSTDPILSTIRRIGLFNNRTDRVKVILHPEFLSSTSPLLPMDYEEFVRGCHLGVFPS YYEPWGYTPAECTVMGIPSVTTNLSGFGCFMQEHVADPTAYGIYIVDRRFRSPDDSCN QLTKFLYGFCNMSRRQRFIQRNRTERLSDLLDWRYLGRYYQHARHLTLSRAFPDKFHV ELTSPPTTEGFKYPRPSSVPPSPSGSQASSPQSSDVEDEVEDERYDEEEEAERDRLNI KSPFSLSHVPHGKKKLHGEYKN" BASE COUNT 688 a 502 c 552 g 631 t ORIGIN Chromosome 12. 1 agatactgac agggcagata ccgtcctcac aatacctgcc cagaaagacg agaaagagga 61 ggaagaattc ctccttccac caggaattct gtgggaagca cataagattt catgctacta 121 gtttattccc aagagaagct accaaagcct ggtaactcta ccaactctaa cttttgtgcc 181 tgtaagttct cttctcctgg gattacaact aattgaaaca ggaatcaaag gagtctcggt 241 ggactgtaag aagaatgctt cgaggccgat ccctctctgt aacatccctg ggtgggcttc 301 cccagtggga agtcgaagaa cttcctgtgg aggagttact gctctttgaa gttgcttggg 361 aagtgaccaa taaagttgga ggcatctata ctgtgattca gacaaaggcc aaaacaacag 421 cagatgaatg gggagagaac tattttctga taggtccata ttttgagcat aatatgaaga 481 ctcaggtgga acagtgtgaa cctgtaaatg atgctgtcag aagagcagtg gacgcaatga 541 atatgcatgg ctgccaggtg cattttggaa gatggctgat agaaggaagt ccttatgtgg 601 tactttttga cataggctat tcagcttgga atctggacag gtggaagggt gacctctggg 661 aagcatgcag tgtcggcatt ccttatcatg accgagaagc caatgatatg ctgatatttg 721 gatctttaac tgcctggttc ttaaaagaag tgacagatca tgcagatggt aaatatgtcg 781 ttgcccggtt ccatgaatgg caggctggag ttggactgat cctttctcga gccaggaaac 841 ttcctattgc cacaatattt acaacccacg ctacactact tgggaggtat ctctgtgcag 901 caaatattga tttctacaac catcttgata agtttaacat tgacaaagag gctggggaaa 961 ggcagattta ccaccggtac tgcatggagc gagcttccgt tcattgcgct cacgtgttca 1021 ccacggtttc tgaaataaca gcaatagaag ctgaacatat gctgaagaga aagcctgatg 1081 tagttactcc aaacggcttg aatgttaaga aattttcagc agtgcatgag tttcaaaatc 1141 tacatgccat gtacaaggcc agaatccaag attttgttcg aggtcatttc tatggtcatc 1201 tcgactttga tcttgaaaag actttgttcc ttttcattgc tgggaggtat gagtttttca 1261 aaacaaaagg agctgacatc ttcctagatt ccttatccag gctaaatttc ctgctgagga 1321 tgcataaaag tgacatcaca gtggtggtgt ttttcattat gcctgccaag acaaataatt 1381 tcaacgtgga aaccctgaaa ggacaagcag tgcgaaaaca gctgtgggat gttgcacatt 1441 ctgtgaagga aaagtttgga aaaaaactct atgatgcatt attaagagga gaaattcctg 1501 acctgaacga tattttagat cgagatgatc taacaattat gaaaagagcc atcttttcaa 1561 ctcagcgaca gtcattagcc ccagtgacca cgcacaacat gattgatgac tccaccgacc 1621 ccatcctcag caccattaga cggattggac ttttcaacaa ccgcacagat agagtcaagg 1681 tgattttgca cccagagttt ctatcctcca ccagtccctt actacccatg gactatgaag 1741 agtttgttag aggttgtcat cttggagtat ttccatcata ctatgaaccc tggggttata 1801 ctccagctga atgcactgtg atgggtatcc ccagtgtgac cacgaatctc tccgggtttg 1861 gctgtttcat gcaggagcac gtggctgatc ctactgctta cggtatttac atcgttgaca 1921 ggcggttccg ttctccagat gattcttgca atcagctgac taagtttctc tatggatttt 1981 gcaacatgtc acgccgccaa aggtttatcc agaggaacag aactgagagg ctctcagatc 2041 ttctggattg gagatactta ggcagatatt accagcatgc cagacacctg acattaagca 2101 gagcttttcc agataaattc catgtggaac taacatcacc accaacgaca gaaggattta 2161 aatatcccag gccttcctca gtaccacctt ctccttcagg gtctcaggcc tccagtcctc 2221 agagcagtga tgtggaagat gaagtggagg atgagagata cgatgaggaa gaggaggctg 2281 aaagggatcg gttaaatatc aagtcaccat tttcactgag ccacgttcct catgggaaga 2341 aaaagctgca tggtgaatat aagaactgaa ttc // LOCUS HUMHM74 2051 bp mRNA PRI 17-FEB-1994 DEFINITION Human mRNA for HM74. ACCESSION D10923 NID g219866 KEYWORDS GTP-binding protein; plasma membrane protein; protein coupled. SOURCE Homo sapiens monocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2051) AUTHORS Nomura,H., Nielsen,B.W. and Matsushima,K. TITLE Molecular cloning of cDNAs encoding a LD78 receptor and putative leukocyte chemotactic peptide receptors JOURNAL Int. Immunol. 5 (10), 1239-1249 (1993) MEDLINE 94092629 COMMENT Submitted (13-Apr-1992) to DDBJ by: Hideki Nomura Dept. of Pharmacol. Cancer Res. Inst., Kanazawa Univ. 13-1 Takaramachi Kanazawa, Ishikawa 920 Japan Phone: 0762-62-8151 x5875 Fax: 0762-60-7704. FEATURES Location/Qualifiers source 1..2051 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="human monocyte" /cell_type="monocyte" gene 61..1224 /gene="HM74" CDS 61..1224 /gene="HM74" /codon_start=1 /product="HM74" /db_xref="PID:d1002196" /db_xref="PID:g219867" /translation="MNRHHLQDHFLEIDKKNCCVFRDDFIAKVLPPVLGLEFIFGLLG NGLALWIFCFHLKSWKSSRIFLFNLAVADFLLIICLPFVMDYYVRRSDWNFGDIPCRL VLFMFAMNRQGSIIFLTVVAVDRYFRVVHPHHALNKISNWTAAIISCLLWGITVGLTV HLLKKKLLIQNGPANVCISFSICHTFRWHEAMFLLEFLLPLGIILFCSARIIWSLRQR QMDRHAKIKRAITFIMVVAIVFVICFLPSVVVRIRIFWLLHTSGTQNCEVYRSVDLAF FITLSFTYMNSMLDPVVYYFSSPSFPNFFSTLINRCLQRKMTGEPDNNRSTSVELTGD PNKTRGAPEALMANSGEPWSPSYLGPTSNNHSKKGHCHQEPASLEKQLGCCIE" BASE COUNT 470 a 530 c 525 g 526 t ORIGIN 1 cgccactttg ctggagcatt cactaggcga ggcgctccat cggactcact agccgcactc 61 atgaatcggc accatctgca ggatcacttt ctggaaatag acaagaagaa ctgctgtgtg 121 ttccgagatg acttcattgc caaggtgttg ccgccggtgt tggggctgga gtttatcttt 181 gggcttctgg gcaatggcct tgccctgtgg attttctgtt tccacctcaa gtcctggaaa 241 tccagccgga ttttcctgtt caacctggca gtagctgact ttctactgat catctgcctg 301 ccgttcgtga tggactacta tgtgcggcgt tcagactgga actttgggga catcccttgc 361 cggctggtgc tcttcatgtt tgccatgaac cgccagggca gcatcatctt cctcacggtg 421 gtggcggtag acaggtattt ccgggtggtc catccccacc acgccctgaa caagatctcc 481 aattggacag cagccatcat ctcttgcctt ctgtggggca tcactgttgg cctaacagtc 541 cacctcctga agaagaagtt gctgatccag aatggccctg caaatgtgtg catcagcttc 601 agcatctgcc ataccttccg gtggcacgaa gctatgttcc tcctggagtt cctcctgccc 661 ctgggcatca tcctgttctg ctcagccaga attatctgga gcctgcggca gagacaaatg 721 gaccggcatg ccaagatcaa gagagccatc accttcatca tggtggtggc catcgtcttt 781 gtcatctgct tccttcccag cgtggttgtg cggatccgca tcttctggct cctgcacact 841 tcgggcacgc agaattgtga agtgtaccgc tcggtggacc tggcgttctt tatcactctc 901 agcttcacct acatgaacag catgctggac cccgtggtgt actacttctc cagcccatcc 961 tttcccaact tcttctccac tttgatcaac cgctgcctcc agaggaagat gacaggtgag 1021 ccagataata accgcagcac gagcgtcgag ctcacagggg accccaacaa aaccagaggc 1081 gctccagagg cgttaatggc caactccggt gagccatgga gcccctctta tctgggccca 1141 acctcaaata accattccaa gaagggacat tgtcaccaag aaccagcatc tctggagaaa 1201 cagttgggct gttgcatcga gtaatgtcac tggactcggc ctaaggtttc ctggaacttc 1261 cagattcaga gaatctgatt tagggaaact gtggcagatg agtgggagac tggttgcaag 1321 gtgtgaccac aggaatcctg gaggaacaga gagtaaagct tctaggcatc tgaaacttgc 1381 ttcatctctg acgctcgcag gactgaagat gggcaaattg taggcgtttc tgctgagcag 1441 agttggagcc agagatctac ttgtgacttg ttggccttct tcccacatct gcctcagact 1501 ggggggggct cagctcctcg ggtgatatct agcctgcttg tgagctctag cagggataag 1561 gagagctgag attggaggga attgtgttgc tcctggagga agcccaggca tcattaaaca 1621 agccagtagg tcacctggct tccgtggacc aattcatctt tcagacaagc tttagagaaa 1681 tggactcagg gaagagactc acatgctttg gttagtatct gtgtttccgg tgggtgtaat 1741 aggggattag ccccagaagg gactgagcta aacagtgtta ttatgggaaa ggaaatggca 1801 ttgctgcttt caaccagcga ctaatgcaat ccattcctct cttgtttata gtaatctaag 1861 ggttgagcag ttaaaacggc ttcaggatag aaagctgttt cccacctgtt tcgttttacc 1921 attaaaaggg aaacgtgcct ctgccccacg ggtagagggg gtgcacgttc ctcctggttc 1981 cttcgcttgt gtttctgtac ttaccaaaaa tctaccactt caataaattt tgataggaga 2041 caaaaaaaaa a // LOCUS HUMHMCM2 2327 bp mRNA PRI 27-MAR-1996 DEFINITION Human mRNA for hMCM2, complete cds. ACCESSION D28480 NID g516759 KEYWORDS hMCM2. SOURCE Homo sapiens fetus lung cDNA to mRNA, clone pL1994. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2327) AUTHORS Nakatsuru,S., Sudo,K. and Nakamura,Y. TITLE Molecular cloning and chromosamal mapping of a novel human gene encoding a product homologous to yeast proteins involving in DNA replication JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 2327) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (05-FEB-1994) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:nakamura@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4501), Fax:03-3918-0342) COMMENT Submitted (05-Feb-1994) to DDBJ by: Yusuke Nakamura Department of Biochemistry Cancer Institute 1-37-1 Kami-Ikebukuro Toshima-ku, Tokyo 170 Japan Phone: 03-3918-0111 x4502 Email: nakamura@ganvx1.jfcr.or.jp Fax: 03-3918-0342. FEATURES Location/Qualifiers source 1..2327 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="lung" mRNA 1..2327 CDS 545..2176 /codon_start=1 /product="hMCM2" /db_xref="PID:d1006386" /db_xref="PID:g516760" /translation="MVVATYTCDQCGAETYQPIQSPTFMPLIMCPSQECQTNRSGGRL YLQTRGSRFIKFQEMKMQEHSDQVPVGNIPRSITVLVEGENTRIAQPGDHVSVTGIFL PILRTGFRQVVQGLLSETYLEAHRIVKMNKSEDDESGAGELTREELRQIAEEDFYEKL AASIAPEIYGHEDVKKALLLLLVGGVDQSPRGMKIRGNINICLMGDPGVAKSQLLSYI DRLAPRSQYTTGRGSSGVGLTAAVLRDSVSGELTLEGGALVLADQGVCCIDEFDKMAE ADRTAIHEVMEQQTISIAKAGILTTLNARCSILAAANPAYGRYNPRRSLEQNIQLPAA LLSRFDLLWLIQDRPDRDNDLRLAQHITYVHQHSRQPPSQFEPLDMKLMRRYIAMCRE KQPMVPESLADYITAAYVEMRREAWASKDATYTSARTLLAILRLSTALARLRMVDVVE KEDVNEAIRLMEMSKDSLLGDKGQTARTQRPADVIFATVRELVSGGRSVRFSEAEQRC VSRGFTPAQFQAALDEYEELNVWQVNASRTRITFV" misc_feature 930..1620 /note="sequence homologous to MCM2, MCM3, CDC21 and CDC46 in yeast." polyA_signal 2305..2310 polyA_site 2327 BASE COUNT 518 a 616 c 645 g 548 t ORIGIN Chromosome 7. 1 cgcaaacatt atgggacccc tttaaatcaa agataagaat ctggcattcc tgggctgcat 61 atagcccata gatctgtttg atttggctaa ttcggttttt aggattgaat ttattgacaa 121 ttctgaacat tcgggccatt tcccataaag ctcactattt ccaatttccc ttgagaaaat 181 ggaaagctcc cacataggtg ggtgtgttca cttggggctc agtagtggtg gccctgtttt 241 caagagggca gtcctcacca tgtcacccca tccccaccta gcctgcttct ctcaattatt 301 ttacctgcct ggtccctata ggcatttgtg tcagcccctt ttgctttaag tctctgtcgg 361 gaaagatgta gggattggtt ctccaggatc ttgtttgtga ctgttttctc cccttagtga 421 gctgtatttt caaggcccta gcagcaacaa gcctcgtgtg atccgggaag tgcgggctga 481 ctctgtgggg aagttggtaa ctgtgcgtgg aatcgtcact cgtgtctctg aagtcaaacc 541 caagatggtg gtggccactt acacttgtga ccagtgtggg gcagagacct accagccgat 601 ccagtctccc actttcatgc ctctgatcat gtgcccaagc caggagtgcc aaaccaaccg 661 ctcaggaggg cggctgtatc tgcagacacg gggctccaga ttcatcaaat tccaggagat 721 gaagatgcaa gaacatagtg atcaggtgcc tgtgggaaat atccctcgta gtatcacggt 781 gctggtagaa ggagagaaca caaggattgc ccagcctgga gaccacgtca gcgtcactgg 841 tattttcttg ccaatcctgc gcactgggtt ccgacaggtg gtacagggtt tactctcaga 901 aacctacctg gaagcccatc ggattgtgaa gatgaacaag agtgaggatg atgagtctgg 961 ggctggagag ctcaccaggg aggagctgag gcaaattgca gaggaggatt tctacgaaaa 1021 gctggcagct tcaatcgccc cagaaatata cgggcatgaa gatgtgaaga aggcactgct 1081 gctcctgcta gtcgggggtg tggaccagtc tcctcgaggc atgaaaatcc ggggcaacat 1141 caacatctgt ctgatggggg atcctggtgt ggccaagtct cagctcctgt catacattga 1201 tcgactggcg cctcgcagcc agtacacaac aggccggggc tcctcaggag tggggcttac 1261 ggcagctgtg ctgagagact ccgtgagtgg agaactgacc ttagagggtg gggccctggt 1321 gctggctgac cagggtgtgt gctgcattga tgagttcgac aagatggctg aggccgaccg 1381 cacagccatc cacgaggtca tggagcagca gaccatctcc attgccaagg ccggcattct 1441 caccacactc aatgcccgct gctccatcct ggctgccgcc aaccctgcct acgggcgcta 1501 caaccctcgc cgcagcctgg agcagaacat acagctacct gctgcactgc tctcccggtt 1561 tgacctcctc tggctgattc aggaccggcc cgaccgagac aatgacctac ggttggccca 1621 gcacatcacc tatgtgcacc agcacagccg gcagcccccc tcccagtttg aacctctgga 1681 catgaagctc atgaggcgtt acatagccat gtgccgcgag aagcagccca tggtgccaga 1741 gtctctggct gactacatca cagcagcata cgtggagatg aggcgagagg cttgggctag 1801 taaggatgcc acctatactt ctgcccggac cctgctggct atcctgcgcc tttccactgc 1861 tctggcacgt ctgagaatgg tggatgtggt ggagaaagaa gatgtgaatg aagccatcag 1921 gctaatggag atgtcaaagg actctcttct aggagacaag gggcagacag ctaggactca 1981 gagaccagca gatgtgatat ttgccaccgt ccgtgaactg gtctcagggg gccgaagtgt 2041 ccggttctct gaggcagagc agcgctgtgt atctcgtggc ttcacacccg cccagttcca 2101 ggcggctctg gatgaatatg aggagctcaa tgtctggcag gtcaatgctt cccggacacg 2161 gatcactttt gtctgattcc agcctgcttg caaccctggg gtcctcttgt tccctgctgg 2221 cctgcccctt gggaaggggc agtgatgcct ttgaggggaa ggaggagccc ctctttctcc 2281 catgctgcac ttactccttt tgctaataaa agtgtttgta gattgtc // LOCUS HUMHME 1778 bp mRNA PRI 31-DEC-1994 DEFINITION Human metalloproteinase (HME) mRNA, complete cds. ACCESSION L23808 NID g435969 KEYWORDS metalloproteinase. SOURCE Homo sapiens adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1778) AUTHORS Shapiro,S.D., Kobayashi,D.K. and Ley,T.J. TITLE Cloning and characterization of a unique elastolytic metalloproteinase produced by human alveolar macrophages JOURNAL J. Biol. Chem. 268 (32), 23824-23829 (1993) MEDLINE 94043200 FEATURES Location/Qualifiers source 1..1778 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="macrophage" /dev_stage="adult" gene 13..1425 /gene="HME" CDS 13..1425 /gene="HME" /codon_start=1 /product="metalloproteinase" /db_xref="PID:g435970" /translation="MKFLLILLLQATASGALPLNSSTSLEKNNVLFGERYLEKFYGLE INKLPVTKMKYSGNLMKEKIQEMQHFLGLKVTGQLDTSTLEMMHAPRCGVPDLHHFRE MPGGPVWRKHYITYRINNYTPDMNREDVDYAIRKAFQVWSNVTPLKFSKINTGMADIL VVFARGAHGDFHAFDGKGGILAHAFGPGSGIGGDAHFDEDEFWTTHSGGTNLFLTAVH EIGHSLGLGHSSDPKAVMFPTYKYVDINTFRLSADDIRGIQSLYGDPKENQRLPNPDN SEPALCDPNLSFDAVTTVGNKIFFFKDRFFWLKVSERPKTSVNLISSLWPTLPSGIEA AYEIEARNQVFLFKDDKYWLISNLRPEPNYPKSIHSFGFPNFVKKIDAAVFNPRFYRT YFFVDNQYWRYDERRQMMDPGYPKLITKNFQGIGPKIDAVFYSKNKYYYFFQGSNQFE YDFLLQRITKTLKSNSWFGC" polyA_site 1778 /gene="HME" BASE COUNT 548 a 357 c 337 g 536 t ORIGIN 1 tagaagttta caatgaagtt tcttctaata ctgctcctgc aggccactgc ttctggagct 61 cttcccctga acagctctac aagcctggaa aaaaataatg tgctatttgg tgagagatac 121 ttagaaaaat tttatggcct tgagataaac aaacttccag tgacaaaaat gaaatatagt 181 ggaaacttaa tgaaggaaaa aatccaagaa atgcagcact tcttgggtct gaaagtgacc 241 gggcaactgg acacatctac cctggagatg atgcacgcac ctcgatgtgg agtccccgat 301 ctccatcatt tcagggaaat gccagggggg cccgtatgga ggaaacatta tatcacctac 361 agaatcaata attacacacc tgacatgaac cgtgaggatg ttgactacgc aatccggaaa 421 gctttccaag tatggagtaa tgttaccccc ttgaaattca gcaagattaa cacaggcatg 481 gctgacattt tggtggtttt tgcccgtgga gctcatggag acttccatgc ttttgatggc 541 aaaggtggaa tcctagccca tgcttttgga cctggatctg gcattggagg ggatgcacat 601 ttcgatgagg acgaattctg gactacacat tcaggaggca caaacttgtt cctcactgct 661 gttcacgaga ttggccattc cttaggtctt ggccattcta gtgatccaaa ggctgtaatg 721 ttccccacct acaaatatgt cgacatcaac acatttcgcc tctctgctga tgacatacgt 781 ggcattcagt ccctgtatgg agacccaaaa gagaaccaac gcttgccaaa tcctgacaat 841 tcagaaccag ctctctgtga ccccaatttg agttttgatg ctgtcactac cgtgggaaat 901 aagatctttt tcttcaaaga caggttcttc tggctgaagg tttctgagag accaaagacc 961 agtgttaatt taatttcttc cttatggcca accttgccat ctggcattga agctgcttat 1021 gaaattgaag ccagaaatca agtttttctt tttaaagatg acaaatactg gttaattagc 1081 aatttaagac cagagccaaa ttatcccaag agcatacatt cttttggttt tcctaacttt 1141 gtgaaaaaaa ttgatgcagc tgtttttaac ccacgttttt ataggaccta cttctttgta 1201 gataaccagt attggaggta tgatgaaagg agacagatga tggaccctgg ttatcccaaa 1261 ctgattacca agaacttcca aggaatcggg cctaaaattg atgcagtctt ctattctaaa 1321 aacaaatact actatttctt ccaaggatct aaccaatttg aatatgactt cctactccaa 1381 cgtatcacca aaacactgaa aagcaatagc tggtttggtt gttagaaatg gtgtaattaa 1441 tggtttttgt tagttcactt cagcttaata agtatttatt gcatatttgc tatgtcctca 1501 gtgtaccact acttagagat atgtatcata aaaataaaat ctgtaaacca taggtaatga 1561 ttatataaaa tacataatat ttttcaattt tgaaaactct aattgtccat tcttgcttga 1621 ctctactatt aagtttgaaa atagttacct tcaaagcaag ataattctat ttgaagcatg 1681 ctctgtaagt tgcttcctaa catccttgga ctgagaaatt atacttactt ctggcataac 1741 taaaattaag tatatatatt ttggctcaaa taaaattg // LOCUS HUMHMG17 1187 bp mRNA PRI 08-NOV-1994 DEFINITION Human non-histone chromosomal protein HMG-17 mRNA, complete cds. ACCESSION M12623 NID g184233 KEYWORDS HMG-17; chromosomal protein; high mobility group protein; nonhistone protein. SOURCE Human (MCF-7 cells), cDNA to mRNA, clone pH17c. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1187) AUTHORS Landsman,D., Soares,N., Gonzalez,F.J. and Bustin,M. TITLE Chromosomal protein HMG-17. Complete human cDNA sequence and evidence for a multigene family [published erratum appears in J Biol Chem 1988 Nov 5;263(31):16512] JOURNAL J. Biol. Chem. 261 (16), 7479-7484 (1986) MEDLINE 86224021 REFERENCE 2 (bases 1 to 1187) AUTHORS Landsman,D. JOURNAL Unpublished (1988) COMMENT [2] revises [1]. Draft entry and computer-readable sequence for [1] kindly provided by D.Landsman, 08-DEC-1986 and for [2] 03-SEP-1988. FEATURES Location/Qualifiers source 1..1187 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p36.1-p35" mRNA <1..1184 /note="HMG-17 mRNA" gene 89..361 /gene="HMG17" CDS 89..361 /gene="HMG17" /note="high mobility group protein 17" /codon_start=1 /db_xref="GDB:G00-120-053" /db_xref="PID:g306864" /translation="MPKRKAEGDAKGDKAKVKDEPQRRSARLSAKPAPPKPEPKPKKA PAKKGEKVPKGKKGKADAGKEGNNPAENGDAKTDQAQKAEGAGDAK" BASE COUNT 329 a 257 c 274 g 327 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggac ccccggaccg accaaagccc gcgcgccgct gcatcccgcg tccagcacct 61 acgtcccgct gccgtcgccg ccgccaccat gcccaagaga aaggctgaag gggatgctaa 121 gggagataaa gcaaaggtga aggacgaacc acagagaaga tccgcgaggt tgtctgctaa 181 acctgctcct ccaaagccag agcccaagcc taaaaaggcc cctgcaaaga agggagagaa 241 ggtacccaaa gggaaaaagg gaaaagctga tgctggcaag gaggggaata accctgcaga 301 aaatggagat gccaaaacag accaggcaca gaaagctgaa ggtgctggag atgccaagtg 361 aagtgtgtgc atttttgata actgtgtact tctggtgact gtacagtttg aaatactatt 421 ttttatcaag ttttataaaa atgcagaatt ttgttttact tttttttttt ttttaaaagc 481 tatgttgtta gcacacagaa cacttcattg ttgtttttgg gggaaggggc atatgtcact 541 aatagaatgt ctccaaagct ggattgatgt ggagaaaaca cctttccctt ctagttttga 601 gagacttcct cttggctccc aggaggaggg attccctgac tttgacacac atggccacct 661 tggcacaaaa gccttgtggt atagaaaaac aaatttgttt ttatgtcctc ttctcccttt 721 ccatctttca gcatagactt aactccctta agcccagaca tctgttgaga cctgacccct 781 agtcattggt taccagtgtg tcaggcaatc tggactttcc agtgatgcca ctgagatggc 841 acctgtcaaa agagcagtgg ttccatttct agattgtgga tcttcagata aattctgcca 901 ttttcatttc acttcctgaa agtcagggtc ggcttgtgaa aagttgttaa acaacatgct 961 aaatgtgaaa tgtcaaccct cactctaaac tttccctgtt cagagcatca gatgaagact 1021 tcattgggtt ttatagtggc tttctgattt ttggtagtcc attgaagaag ggagtttgaa 1081 agttgttgta tactgttaac gattgtctgc ccatgtcctg cctgaaatac catgattgtt 1141 tatggaaagt atctttaata aagctggata cagtttggcc cgaattc // LOCUS HUMHMGBP 2839 bp mRNA PRI 31-DEC-1994 DEFINITION Human high mobility group box (SSRP1) mRNA, complete cds. ACCESSION M86737 NID g184241 KEYWORDS high mobility group box protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2839) AUTHORS Bruhn,S.L., Pil,P.M., Essigmann,J.M., Housman,D.E. and Lippard,S.J. TITLE Isolation and characterization of human cDNA clones encoding a high mobility group box protein that recognizes structural distortions to DNA caused by binding of the anticancer agent cisplatin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (6), 2307-2311 (1992) MEDLINE 92196107 FEATURES Location/Qualifiers source 1..2839 /organism="Homo sapiens" /db_xref="taxon:9606" gene 275..2805 /gene="SSRP1" CDS 275..2404 /gene="SSRP1" /note="recognizes structural distortions to DNA caused by binding of the anticancer agent cisplatin" /codon_start=1 /product="high mobility group box" /db_xref="PID:g184242" /translation="MAETLEFNDVYQEVKGSMNDGRLRLSRQGIIFKNSKTGKVDNIQ AGELTEGIWRRVALGHGLKLLTKNGHVYKYDGFRESEFEKLSDFFKTHYRLELMEKDL CVKGWNWGTVKFGGQLLSFDIGDQPVFEIPLSNVSQCTTGKNEVTLEFHQNDDAEVSL MEVRFYVPPTQEDGVDPVEAFAQNVLSKADVIQATGDAICIFRELQCLTPRGRYDIRI YPTFLHLHGKTFDYKIPYTTVLRLFLLPHKDQRQMFFVISLDPPIKQGQTRYHFLILL FSKDEDISLTLNMNEEEVEKRFEGRLTKNMSGSLYEMVSRVMKALVNRKITVPGNFQG HSGAQCITCSYKASSGLLYPLERGFIYVHKPPVHIRFDEISFVNFARGTTTTRSFDFE IETKQGTQYTFSSIEREEYGKLFDFVNAKKLNIKNRGLKEGMNPSYDEYADSDEDQHD AYLERMKEEGKIREENANDSSDDSGEETDESFNPGEEEEDVAEEFDSNASASSSSNEG DSDRDEKKRKQLKKAKMAKDRKSRKKPVEVKKGKDPNAPKRPMSAYMLWLNASREKIK SDHPGISITDLSKKAGEIWKGMSKEKKEEWDRKAEDARRDYEKAMKEYEGGRGESSKR DKSKKKKKVKVKMEKKSTPSRGSSSKSSSRQLSESFKSKEFVSSDESSSGENKSKKKR RRSEDSEEEELASTPPSSEDSASGSDE" polyA_signal 2800..2805 /gene="SSRP1" BASE COUNT 746 a 717 c 786 g 590 t ORIGIN 1 gaattccgta cggcttccgg tggcgggacg cggggccgcg cacgcgggaa aagcttcccc 61 ggtgtccccc catccccctc cccgcgcccc ccccgcgtcc ccccagcgcg cccacctctc 121 gcgccggggc cctcgcgagg ccgcagcctg aggagattcc caacctgctg agcatccgca 181 cacccactca ggagttgggg cccagctccc agtttacttg gtttcccttg tgcagcctgg 241 ggctctgccc aggccaccac aggcaggggt cgacatggca gagacactgg agttcaacga 301 cgtctatcag gaggtgaaag gttccatgaa tgatggtcga ctgaggttga gccgtcaggg 361 catcatcttc aagaatagca agacaggcaa agtggacaac atccaggctg gggagttaac 421 agaaggtatc tggcgccgtg ttgctctggg ccatggactt aaactgctta caaagaatgg 481 ccatgtctac aagtatgatg gcttccgaga atcggagttt gagaaactct ctgatttctt 541 caaaactcac tatcgccttg agctaatgga gaaggacctt tgtgtgaagg gctggaactg 601 ggggacagtg aaatttggtg ggcagctgct ttcctttgac attggtgacc agccagtctt 661 tgagataccc ctcagcaatg tgtcccagtg caccacaggc aagaatgagg tgacactgga 721 attccaccaa aacgatgacg cagaggtgtc tctcatggag gtgcgcttct acgtcccacc 781 cacccaggag gatggtgtgg accctgttga ggcctttgcc cagaatgtgt tgtcaaaggc 841 ggatgtaatc caggccacgg gagatgccat ctgcatcttc cgggagctgc agtgtctgac 901 tcctcgtggt cgttatgaca ttcggatcta ccccaccttt ctgcacctgc atggcaagac 961 ctttgactac aagatcccct acaccacagt actgcgtctg tttttgttac cccacaagga 1021 ccagcgccag atgttctttg tgatcagcct ggatccccca atcaagcaag gccaaactcg 1081 ctaccacttc ctgatcctcc tcttctccaa ggacgaggac atttcgttga ctctgaacat 1141 gaacgaggaa gaagtggaga agcgctttga gggtcggctc accaagaaca tgtcaggatc 1201 cctctatgag atggtcagcc gggtcatgaa agcactggta aaccgcaaga tcacagtgcc 1261 aggcaacttc caagggcact caggggccca gtgcattacc tgttcctaca aggcaagctc 1321 aggactgctc tacccgctgg agcggggctt catctacgtc cacaagccac ctgtgcacat 1381 ccgcttcgat gagatctcct ttgtcaactt tgctcgtggt accactacta ctcgttcctt 1441 tgactttgaa attgagacca agcagggcac tcagtatacc ttcagcagca ttgagaggga 1501 ggagtacggg aaactgtttg attttgtcaa cgcgaaaaag ctcaacatca aaaaccgagg 1561 attgaaagag ggcatgaacc caagctacga tgaatatgct gactctgatg aggaccagca 1621 tgatgcctac ttggagagga tgaaggagga aggcaagatc cgggaggaga atgccaatga 1681 cagcagcgat gactcaggag aagaaaccga tgagtcattc aacccaggtg aagaggagga 1741 agatgtggca gaggagtttg acagcaacgc ctctgccagc tcctccagta atgagggtga 1801 cagtgaccgg gatgagaaga agcggaaaca gctcaaaaag gccaagatgg ccaaggaccg 1861 caagagccgc aagaagcctg tggaggtgaa gaagggcaaa gaccccaatg cccccaagag 1921 gcccatgtct gcatacatgc tgtggctcaa tgccagccga gagaagatca agtcagacca 1981 tcctggcatc agcatcacgg atctttccaa gaaggcaggc gagatctgga agggaatgtc 2041 caaagagaag aaagaggagt gggatcgcaa ggctgaggat gccaggaggg actatgaaaa 2101 agccatgaaa gaatatgaag ggggccgagg cgagtcttct aagagggaca agtcaaagaa 2161 gaagaagaaa gtaaaggtaa agatggaaaa gaaatccacg ccctctaggg gctcatcatc 2221 caagtcgtcc tcaaggcagc taagcgagag cttcaagagc aaagagtttg tgtctagtga 2281 tgagagctct tcgggagaga acaagagcaa aaagaagagg aggaggagcg aggactctga 2341 agaagaagaa ctagccagta ctccccccag ctcagaggac tcagcgtcag gatccgatga 2401 gtagaaacgg aggaaggttc tctttgcgct tgccttctca caccccccga ctccccaccc 2461 atattttggt accagtttct cctcatgaaa tgcagtccct ggattctgtg ccatctgaac 2521 atgctctcct gttggtgtgt atgtcactag ggcagtgggg agacgtctta actctgctgc 2581 ttcccaagga tggctgttta taatttgggg agagataggg tgggaggcag ggcaatgcag 2641 gatccaaatc ctcatcttac tttcccgacc ttaaggatgt agctgctgct tgtcctgttc 2701 aagttgctgg agcaggggtc atgtgaggcc aggcctgtag ctcctacctg gggcctattt 2761 ctactttcat tttgtatttc tggtctgtga aaatgattta ataaagggaa ctgactttgg 2821 aaaccaaaaa aaggaattc // LOCUS HUMHMGCOA 2904 bp mRNA PRI 08-NOV-1994 DEFINITION Human 3-hydroxy-3-methylglutaryl coenzyme A reductase mRNA, complete cds. ACCESSION M11058 NID g184243 KEYWORDS 3-hydroxy-3-methylglutaryl coenzyme A reductase; glycoprotein. SOURCE Human fetal adrenal gland, cDNA to mRNA, library of T.Maniatis, clone pHRed-102. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2904) AUTHORS Luskey,K.L. and Stevens,B. TITLE Human 3-hydroxy-3-methylglutaryl coenzyme A reductase. Conserved domains responsible for catalytic activity and sterol-regulated degradation JOURNAL J. Biol. Chem. 260 (18), 10271-10277 (1985) MEDLINE 85261451 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by K.L.Luskey, 16-JAN-1986. HMG-CoA reductase is the rate-limiting enzyme for cholesterol synthesis and is regulated via a negative feedback mechanism mediated by sterols and non-sterol metabolites derived from mevalonate, the product of the reaction catalyzed by reductase. Normally in mammalian cells this enzyme is suppressed by cholesterol derived from the internalization and degradation of low density lipoprotein (LDL) via the LDL receptor. Competitive inhibitors of the reductase induce the expression of LDL receptors in the liver, which in turn increases the catabolism of plasma LDL and lowers the plasma concentration of cholesterol, an important determinant of atherosclerosis. The sequence coding for the highly conserved membrane bound region of the protein is located at positions 51-1067, that coding for the linker part of the protein at positions 1068-1397 and for the strongly conserved water-soluble catalytic part at positions 1398-2714. FEATURES Location/Qualifiers source 1..2904 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q13.3-q14" mRNA <1..>2904 /note="HMG CoA mRNA" gene 51..2717 /gene="HMGCR" CDS 51..2717 /gene="HMGCR" /note="3-hydroxy-3-methylglutaryl coenzyme A reductase" /codon_start=1 /db_xref="GDB:G00-119-312" /db_xref="PID:g306865" /translation="MLSRLFRMHGLFVASHPWEVIVGTVTLTICMMSMNMFTGNNKIC GWNYECPKFEEDVLSSDIIILTITRCIAILYIYFQFQNLRQLGSKYILGIAGLFTIFS SFVFSTVVIHFLDKELTGLNEALPFFLLLIDLSRASTLAKFALSSNSQDEVRENIARG MAILGPTFTLDALVECLVIGVGTMSGVRQLEIMCCFGCMSVLANYFVFMTFFPACVSL VLELSRESREGRPIWQLSHFARVLEEEENKPNPVTQRVKMIMSLGLVLVHAHSRWIAD PSPQNSTADTSKVSLGLDENVSKRIEPSVSLWQFYLSKMISMDIEQVITLSLALLLAV KYIFFEQTETESTLSLKNPITSPVVTQKKVPDNCCRREPMLVRNNQKCDSVEEETGIN RERKVEVIKPLVAETDTPNRATFVVGNSSLLDTSSVLVTQEPEIELPREPRPNEECLQ ILGNAEKGAKFLSDAEIIQLVNAKHIPAYKLETLMETHERGVSIRRQLLSKKLSEPSS LQYLPYRDYNYSLVMGACCENVIGYMPIPVGVAGPLCLDEKEFQVPMATTEGCLVAST NRGCRAIGLGGGASSRVLADGMTRGPVVRLPRACDSAEVKAWLETSEGFAVIKEAFDS TSRFARLQKLHTSIAGRNLYIRFQSRSGDAMGMNMISKGTEKALSKLHEYFPEMQILA VSGNYCTDKKPAAINWIEGRGKSVVCEAVIPAKVVREVLKTTTEAMIEVNINKNLVGS AMAGSIGGYNAHAANIVTAIYIACGQDAAQNVGSSNCITLMEASGPTNEDLYISCTMP SIEIGTVGGGTNLLPQQACLQMLGVQGACKDNPGENARQLARIVCGTVMAGELSLMAA LAAGHLVKSHMIHNRSKINLQDLQGACTKKTA" BASE COUNT 822 a 597 c 678 g 807 t ORIGIN 27 bp upstream of BamHI site; chromosome 5q13.3-q14. 1 ttcggtggcc tctagtgaga tctggaggat ccaaggattc tgtagctaca atgttgtcaa 61 gactttttcg aatgcatggc ctctttgtgg cctcccatcc ctgggaagtc atagtgggga 121 cagtgacact gaccatctgc atgatgtcca tgaacatgtt tactggtaac aataagatct 181 gtggttggaa ttatgaatgt ccaaagtttg aagaggatgt tttgagcagt gacattataa 241 ttctgacaat aacacgatgc atagccatcc tgtatattta cttccagttc cagaatttac 301 gtcaacttgg atcaaaatat attttgggta ttgctggcct tttcacaatt ttctcaagtt 361 ttgtattcag tacagttgtc attcacttct tagacaaaga attgacaggc ttgaatgaag 421 ctttgccctt tttcctactt ttgattgacc tttccagagc aagcacatta gcaaagtttg 481 ccctcagttc caactcacag gatgaagtaa gggaaaatat tgctcgtgga atggcaattt 541 taggtcctac gtttaccctc gatgctcttg ttgaatgtct tgtgattgga gttggtacca 601 tgtcaggggt acgtcagctt gaaattatgt gctgctttgg ctgcatgtca gttcttgcca 661 actacttcgt gttcatgact ttcttcccag cttgtgtgtc cttggtatta gagctttctc 721 gggaaagccg cgagggtcgt ccaatttggc agctcagcca ttttgcccga gttttagaag 781 aagaagaaaa taagccgaat cctgtaactc agagggtcaa gatgattatg tctctaggct 841 tggttcttgt tcatgctcac agtcgctgga tagctgatcc ttctcctcaa aacagtacag 901 cagatacttc taaggtttca ttaggactgg atgaaaatgt gtccaagaga attgaaccaa 961 gtgtttccct ctggcagttt tatctctcta aaatgatcag catggatatt gaacaagtta 1021 ttaccctaag tttagctctc cttctggctg tcaagtacat cttctttgaa caaacagaga 1081 cagaatctac actctcatta aaaaacccta tcacatctcc tgtagtgaca caaaagaaag 1141 tcccagacaa ttgttgtaga cgtgaaccta tgctggtcag aaataaccag aaatgtgatt 1201 cagtagagga agagacaggg ataaaccgag aaagaaaagt tgaggttata aaacccttag 1261 tggctgaaac agatacccca aacagagcta catttgtggt tggtaactcc tccttactcg 1321 atacttcatc agtactggtg acacaggaac ctgaaattga acttcccagg gaacctcggc 1381 ctaatgaaga atgtctacag atacttggga atgcagagaa aggtgcaaaa ttccttagtg 1441 atgctgagat catccagtta gtcaatgcta agcatatccc agcctacaag ttggaaactc 1501 tgatggaaac tcatgagcgt ggtgtatcta ttcgccgaca gttactttcc aagaagcttt 1561 cagaaccttc ttctctccag tacctacctt acagggatta taattactcc ttggtgatgg 1621 gagcttgttg tgagaatgtt attggatata tgcccatccc tgttggagtg gcaggacccc 1681 tttgcttaga tgaaaaagaa tttcaggttc caatggcaac aacagaaggt tgtcttgtgg 1741 ccagcaccaa tagaggctgc agagcaatag gtcttggtgg aggtgccagc agccgagtcc 1801 ttgcagatgg gatgactcgt ggcccagttg tgcgtcttcc acgtgcttgt gactctgcag 1861 aagtgaaagc ctggctcgaa acatctgaag ggttcgcagt gataaaggag gcatttgaca 1921 gcactagcag atttgcacgt ctacagaaac ttcatacaag tatagctgga cgcaaccttt 1981 atatccgttt ccagtccagg tcaggggatg ccatggggat gaacatgatt tcaaagggta 2041 cagagaaagc actttcaaaa cttcacgagt atttccctga aatgcagatt ctagccgtta 2101 gtggtaacta ttgtactgac aagaaacctg ctgctataaa ttggatagag ggaagaggaa 2161 aatctgttgt ttgtgaagct gtcattccag ccaaggttgt cagagaagta ttaaagacta 2221 ccacagaggc tatgattgag gtcaacatta acaagaattt agtgggctct gccatggctg 2281 ggagcatagg aggctacaac gcccatgcag caaacattgt caccgccatc tacattgcct 2341 gtggacagga tgcagcacag aatgttggta gttcaaactg tattacttta atggaagcaa 2401 gtggtcccac aaatgaagat ttatatatca gctgcaccat gccatctata gagataggaa 2461 cggtgggtgg tgggaccaac ctactacctc agcaagcctg tttgcagatg ctaggtgttc 2521 aaggagcatg caaagataat cctggggaaa atgcccggca gcttgcccga attgtgtgtg 2581 ggaccgtaat ggctggggaa ttgtcactta tggcagcatt ggcagcagga catcttgtca 2641 aaagtcacat gattcacaac aggtcgaaga tcaatttaca agacctccaa ggagcttgca 2701 ccaagaagac agcctgaata gcccgacagt tctgaactgg aacatgggca ttgggttcta 2761 aaggactaac ataaaatctg tgaattaaaa aagctcaatg cattgtcttg tggaggatga 2821 ataaatgtga tcactgagac agccacttgg tttttggctc tttcagagag gtctcaggtt 2881 ctttccatgc agactcctca gatc // LOCUS HUMHNRNA 1666 bp mRNA PRI 08-NOV-1994 DEFINITION Human nuclear ribonucleoprotein particle (hnRNP) C protein mRNA, complete cds. ACCESSION M16342 NID g184266 KEYWORDS C gene; nuclear protein; ribonucleoprotein particle. SOURCE Human premonocytic cell line U937, cDNA to mRNA, clones pHC[12,5]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1666) AUTHORS Swanson,M.S., Nakagawa,T.Y., LeVan,K. and Dreyfuss,G. TITLE Primary structure of human nuclear ribonucleoprotein particle C proteins: conservation of sequence and domain structures in heterogeneous nuclear RNA, mRNA, and pre-rRNA-binding proteins JOURNAL Mol. Cell. Biol. 7 (5), 1731-1739 (1987) MEDLINE 87257872 COMMENT Draft entry and clean copy for sequence [1] kindly provided by G.dreyfuss, 13-JUL-1988. FEATURES Location/Qualifiers source 1..1666 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" mRNA <1..1288 /note="CP mRNA (alt.)" mRNA <1..1666 /note="CP mRNA (alt.)" gene 123..995 /gene="SNRPC" CDS 123..995 /gene="SNRPC" /note="C protein" /codon_start=1 /db_xref="GDB:G00-118-878" /db_xref="PID:g306875" /translation="MASNVTNKTDPRSMNSRVFIGNLNTLVVKKSDVEAIFSKYGKIV GCSVHKGFAFVQYVNERNARAAVAGEDGRMIAGQVLDINLAAEPKVNRGKAGVKRSAA EMYGSSFDLDYDFQRDYYDRMYSYPARVPPPPPIARAVVPSKRQRVSGNTSRRGKSGF NSKSGQRGSSKSGKLKGDDLQAIKKELTQIKQKVDSLLENLEKMEKEQSKQAVEMKND KSEEEQSSSSVKKDETNVKMESEGGADDSAEEGDLLDDDDNEDRGDDQLELIKDDEKE AEEGEDDRDSANGG" BASE COUNT 472 a 328 c 392 g 474 t ORIGIN 1 bp upstream of EcoRI site. 1 ggaattccaa acccgggagt aggagactca gaatcgaatc tcttctccct ccccttcttg 61 tgagattttt ttgatcttca gctacatttt cggctttgtg agaaacctta ccatcaaaca 121 cgatggccag caacgttacc aacaagacag atcctcgctc catgaactcc cgtgtattca 181 ttgggaatct caacactctt gtggtcaaga aatctgatgt ggaggcaatc ttttcgaagt 241 atggcaaaat tgtgggctgc tctgttcata agggctttgc cttcgttcag tatgttaatg 301 agagaaatgc ccgggctgct gtagcaggag aggatggcag aatgattgct ggccaggttt 361 tagatattaa cctggctgca gagccaaaag tgaaccgagg aaaagcaggt gtgaaacgat 421 ctgcagcgga gatgtacggc tcctcttttg acttggacta tgactttcaa cgggactatt 481 atgataggat gtacagttac ccagcacgtg tacctcctcc tcctcctatt gctcgggctg 541 tagtgccctc gaaacgtcag cgtgtatcag gaaacacttc acgaaggggc aaaagtggct 601 tcaattctaa gagtggacag cggggatctt ccaagtctgg aaagttgaaa ggagatgacc 661 ttcaggccat taagaaggag ctgacccaga taaaacaaaa agtggattct ctcctggaaa 721 acctggaaaa aatggaaaag gaacagagca aacaagcagt agagatgaag aatgataagt 781 cagaagagga gcagagcagc agctccgtga agaaagatga gactaatgtg aagatggagt 841 ctgagggggg tgcagatgac tctgctgagg agggggacct actggatgat gatgataatg 901 aagatcgggg ggatgaccag ctggagttga tcaaggatga tgaaaaagag gctgaggaag 961 gagaggatga cagagacagc gccaatggag gatgactctt aagcacatag tggggtttag 1021 aaatcttatc ccattatttc tttacctagg cgcttgtcta agatcaaatt tttcaccaga 1081 tcctctcccc tagtatcttc agcacatgct cactgttctc cccatccttg tccttcccat 1141 gttcattaat tcatattgcc ccgcgcctag tcccattttc acttcctttg acgctcctag 1201 tagttttgtt aagtcttacc ctgtaatttt tgcttttaat tttgatacct ctttatgact 1261 taacaataaa aaggatgtat ggtttttatc aactgtctcc aaaataatct cttgttatgc 1321 agggagtaca gttcttttca ttcatacata agttcagtag ttgcttccct aactgcaaag 1381 gcaatctcat ttagttgagt agctcttgaa agcagctttg agttagaagt atgtgtgtta 1441 caccctcaca ttagtgtgct gtgtggggca gttcaacaca aatgtaacaa ttatttttgt 1501 gaatgagagt tggcatgtca aatgcatcct ctagaaaaat aattagtgtt atagtcttaa 1561 gatttgtttt ctaaagttga tactgtggga tttttgtgaa cagcctgatg tttgggacct 1621 tttttcctca aaataaacaa gtccttatta aaccaggaat ttggag // LOCUS HUMHO1A 3986 bp mRNA PRI 13-JUL-1993 DEFINITION Homo sapiens adenosine triphosphatase mRNA, complete cds. ACCESSION M95541 NID g184269 KEYWORDS adenosine triphosphatase. SOURCE Homo sapiens (library: lambda zap library from NIDR/NIH) bone cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3986) AUTHORS Kumar,R., Haugen,J.D. and Penniston,J.T. TITLE Molecular cloning of a plasma membrane calcium pump from normal human osteoblasts JOURNAL J. Bone Miner. Res. 8, 505-513 (1993) MEDLINE 93235706 FEATURES Location/Qualifiers source 1..3986 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="osteoblast" /tissue_type="bone" /tissue_lib="lambda zap library from NIDR/NIH" CDS 182..3844 /note="plasma membrane calciun pump HO-1" /codon_start=1 /product="adenosine triphosphatase" /db_xref="PID:g184270" /translation="MGDMANNSVAYSGVKNSLKEANHDGDFGITLAELRALMELRSTD ALRKIQESYGDVYGICTKLKTSPNEGLSGNPADLERREAVFGKNFIPPKKPKTFLQLV WEALQDVTLIILEIAAIVSLGLSFYQPPEGDNALCGEVSVGEEEGEGETGWIEGAAIL LSVVCVVLVTAFNDWSKEKQFRGLQSRIEQEQKFTVIRGGQVIQIPVADITVGDIAQV KYGDLLPADGILIQGNDLKIDESSLTGESDHVKKSLDKDPLLLSGTHVMEGSGRMVVT AVGVNSQTGIIFTLLGAGGEEEEKKDEKKKEKKNKKQDGAIENRNKAKAQDGAAMEMQ PLKSEEGGDGDEKDKKKANLPKKEKSVLQGKLTKLAVQIGKAGLLMSAITVIILVLYF VIDTFWVQKRPWLAECTPIYIQYFVKFFIIGVTVLVVAVPEGLPLAVTISLAYSVKKM MKDNNLVRHLDACETMGNATAICSDKTGTLTMNRMTVVQAYINEKHYKKVPEPEAIPP NILSYLVTGISVNCAYTSKILPPEKEGGLPRHVGNKTECALLGLLLDLKRDYQDVRNE IPEEALYKVYTFNSVRKSMSTVLKNSDGSYRIFSKGASEIILKKCFKILSANGEAKVF RPRDRDDIVKTVIEPMASEGLRTICLAFRDFPAGEPEPEWDNENDIVTGLTCIAVVGI EDPVRPEVPDAIKKCQRAGITVRMVTGDNINTARAIATKCGILHPGEDFLCLEGKDFN RRIRNEKGEIEQERIDKIWPKLRVLARSSPTDKHTLVKGIIDSTVSDQRQVVAVTGDG TNDGPALKKADVGFAMGIAGTDVAKEASDIILTDDNFTSIVKAVMWGRNVYDSISKFL QFQLTVNVVAVIVAFTGACITQDSPLKAVQMLWVNLIMDTLASLALATEPPTESLLLR KPYGRNKPLISRTMMKNILGHAFYQLVVVFTLLFAGEKFFDIDSGRNAPLHAPPSEHY TIVFNTFVLMQLFNEINARKIHGERNVFEGIFNNAIFCTIVLGTFVVQIIIVQFGGKP FSCSELSIEQWLWSIFLGMGTLLWGQLISTIPTSRLKFLKEAGHGTQKEEIPEEELAE DVEEIDHAERELRRGQILWFRGLNRIQTQIRVVNAFRSSLYEGLEKPESRSSIHNFMT HPEFRIEDSEPHIPLIDDTDAEDDAPTKRNSSPPPSPNKNNNAVDSGIHLTIEMNKSA TSSSPGSPLHSLETSL" BASE COUNT 1289 a 686 c 888 g 1123 t ORIGIN 1 ggccaaaggt caagatactt ctctgggaaa tgttgctgct gatgctgctt tacaaagtca 61 tacaatgagt gtttggttta agaaagattt tcatacttaa aagattttca tcttggaaat 121 acatcaagtg aaaattaaat tcttttggga aacattttcc ttctgatata ttatacttgt 181 aatgggcgac atggcaaaca actcagttgc ttacagtggt gtgaaaaact ctttgaagga 241 agctaatcat gatggagact ttggaattac gctcgcagag ctgcgggctc tcatggagct 301 caggtccaca gatgcattac gaaaaataca ggaaagctat ggagatgtct atggaatttg 361 caccaaattg aaaacatctc ccaatgaagg tttaagtgga aaccctgcag atttagaaag 421 aagagaagca gtgtttggaa agaattttat acctcctaaa aagccaaaaa cctttcttca 481 attagtatgg gaagcattac aagatgtcac tttaattata ttagaaattg cagccatagt 541 atcattgggc ctttcttttt atcagcctcc agaaggggat aatgcacttt gtggagaagt 601 ttctgttggg gaggaagaag gtgaaggtga aactggttgg attgaaggag ctgcaatcct 661 cttgtctgta gtgtgtgtgg tgttagtaac agctttcaat gactggagta aggaaaaaca 721 gtttagaggt ttgcagagcc gaattgaaca agaacagaag ttcactgtca tcaggggtgg 781 tcaggtcatt cagatacctg tagctgacat tactgttgga gatattgctc aagtgaaata 841 tggtgatctt cttccagctg acggcatact tattcaaggc aacgatctta aaattgatga 901 aagctcattg actggtgaat cagatcatgt taaaaagtct ttagataagg atcccttact 961 tctatcaggt actcatgtaa tggaaggctc tggaagaatg gtagttacag ctgtaggtgt 1021 aaattctcaa actggaatta tctttacctt acttggagct ggaggtgaag aggaagagaa 1081 gaaagatgag aagaaaaagg aaaagaaaaa taagaaacaa gatggagcta ttgagaatcg 1141 caacaaagca aaagcccagg atggtgcagc catggaaatg cagccattga agagtgaaga 1201 aggtggagat ggtgatgaaa aagataaaaa gaaagcaaat ttgccaaaaa aggaaaaatc 1261 tgttttacaa gggaaactta caaaactggc tgttcagatt ggcaaagcag gtctgttgat 1321 gtctgccatc acagttatca ttctagtatt atattttgtc attgacacct tctgggttca 1381 gaaaagacca tggcttgctg agtgcacacc aatttatata caatactttg tgaagttctt 1441 cattattgga gttacagttt tagtggtcgc agtgccagaa ggtcttccac ttgcagtcac 1501 gatctcactg gcttattcag tcaaaaaaat gatgaaagat aataacttag taaggcatct 1561 ggatgcttgt gaaaccatgg gaaatgctac agctatttgt tcagataaaa caggaacttt 1621 gacaatgaac agaatgacag tcgttcaagc ttacataaat gaaaaacatt ataaaaaggt 1681 tcctgaacca gaagctattc caccaaatat tttgtcctat cttgtaacag gaatttctgt 1741 gaattgtgct tatacatcaa aaatattgcc accagagaaa gagggtggat tacctcgtca 1801 cgttggtaat aaaactgaat gtgccttgtt gggacttctt ttggatttaa aacgggatta 1861 tcaggatgtt agaaatgaaa taccagaaga agcactgtac aaagtctaca ccttcaattc 1921 tgttaggaag tccatgagta ctgtcctgaa aaattcagat ggaagttatc gaatattcag 1981 caagggtgca tctgagataa ttctgaaaaa gtgtttcaaa atcttgagtg ctaatggtga 2041 ggcaaaagta ttcagaccaa gggaccgtga tgatattgta aaaactgtga ttgaaccgat 2101 ggcatcagaa ggcttgagaa ccatatgtct tgcattcaga gattttccag caggagaacc 2161 agaaccagag tgggataatg aaaatgatat tgtcaccggc cttacatgca ttgctgttgt 2221 ggggattgaa gatcctgtga gacctgaggt gccagatgca attaaaaagt gtcagagggc 2281 tggaattact gtgcggatgg tcactggtga taatattaat actgctcggg ccattgctac 2341 caaatgtggt attttacatc ctggggaaga ttttctgtgc ctagaaggta aagattttaa 2401 cagaagaata cgaaatgaaa aaggagagat tgagcaagag aggatagaca agatttggcc 2461 aaaacttcga gtacttgcaa gatcatctcc tactgataag catacactgg ttaaaggtat 2521 aattgacagc actgtctcag accaacgcca ggttgtagct gtaactggtg atggtacaaa 2581 tgatggccca gcactaaaga aagcagatgt tggatttgca atgggtattg ctggaactga 2641 tgtagctaaa gaagcatccg atattattct cacagatgac aactttacaa gcattgttaa 2701 agcagttatg tggggacgaa atgtctatga cagcatctca aaattccttc agttccaact 2761 tactgttaat gtagtagcag tgattgttgc ttttacgggc gcctgcatta ctcaagactc 2821 accgcttaag gctgtgcaga tgctgtgggt aaacctcata atggatacac tcgcttccct 2881 ggctctggca acggaaccac ccactgagtc tctcttgctt cggaaacctt atggtagaaa 2941 taagcctctc atctcacgta caatgatgaa gaatattttg ggtcatgcat tctatcaact 3001 tgtagtagtc tttacactct tatttgctgg agaaaagttt tttgacattg atagtggaag 3061 aaatgctcct ttgcatgctc ctccttcaga acattatact attgttttta atacctttgt 3121 gctgatgcaa cttttcaacg aaataaatgc ccggaaaatt catggtgaaa gaaatgtatt 3181 cgaaggaatc tttaacaatg ccatcttctg cacaattgtt ttaggcactt ttgtggtaca 3241 gataataatt gtgcagtttg gtggaaaacc tttcagttgt tcagaacttt caatagaaca 3301 gtggctatgg tcaatattcc taggaatggg aacattactc tggggccagc ttatttcaac 3361 aattccaact agccgtttaa aattcctcaa agaagctggt catggaacac aaaaggaaga 3421 aatacctgag gaggaattag cagaggatgt tgaagagatt gatcacgctg aaagggagtt 3481 gcggcgtggc caaatcttgt ggtttagagg tctgaacaga atccaaacac agattcgagt 3541 ggtgaatgca tttcgtagtt ctttatatga agggttagaa aaaccggaat caagaagttc 3601 gattcacaac tttatgacac atcctgagtt taggatagaa gattcagagc ctcatatccc 3661 ccttattgat gacactgatg ccgaagatga tgctcctaca aaacgtaact ccagtcctcc 3721 accctctccc aacaaaaata acaatgctgt tgacagtgga attcacctta caatagaaat 3781 gaacaagtct gctacctctt catccccagg aagcccacta catagtttgg aaacatcact 3841 ctgattgtaa gctgaatgtt aacacactag ctgcattgta aagaaacaaa ttgaaactgg 3901 gtcttttcac atattgtgat ggacaagcta gtattcttgt ctttggactt caacagaaga 3961 cacacttgta cgaatgtaga tttatt // LOCUS HUMHO2SOS1 977 bp mRNA PRI 10-JUN-1995 DEFINITION Human mRNA for heme oxygenase-2, complete cds. ACCESSION D21243 NID g416226 KEYWORDS heme oxygenase-2. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 977) AUTHORS Ishikawa,K., Takeuchi,N., Takahashi,S., Matera,K.M., Sato,M., Shibahara,S., Rousseau,D.L., Ikeda-Saito,M. and Yoshida,T. TITLE Heme oxygenase-2. Properties of the heme complex of the purified tryptic fragment of recombinant human heme oxygenase-2 JOURNAL J. Biol. Chem. 270 (11), 6345-6350 (1995) MEDLINE 95197675 REFERENCE 2 (sites) AUTHORS Ishikawa,K. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 977) AUTHORS Ishikawa,K. TITLE Direct Submission JOURNAL Submitted (21-OCT-1993) to the DDBJ/EMBL/GenBank databases. Kazunobu Ishikawa, Yamagata University School of Medicine; 2-2-2 Iida-Nishi, Yamagata, Yamagata 990-23, Japan (E-mail:MF021@idw01.id.yamagata-u.ac.jp, Tel:0236-33-1122(ex.2123), Fax:0236-33-4020) COMMENT Submitted (21-Oct-1993) to DDBJ by: Kazunobu Ishikawa Yamagata University School of Medicine 2-2-2 Iida-nishi, Yamagata city Yamagata 990-23 Japan Phone: 0236-33-1122 x2123 Fax: 0236-33-4020. FEATURES Location/Qualifiers source 1..977 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 1..951 /EC_number="1.14.99.3" /codon_start=1 /product="heme oxygenase-2" /db_xref="PID:d1005322" /db_xref="PID:g443771" /translation="MSAEVETSEGVDESEKKNSGALEKENQMRMADLSELLKEGTKEA HDRAENTQFVKDFLKGNIKKELFKLATTALYFTYSALEEEMERNKDHPAFAPLYFPME LHRKEALTKDMEYFFGENWEEQVQCPKAAQKYVERIHYIGQNEPELLVAHAYTRYMGD LSGGQVLKKVAQRALKLPSTGEGTQFYLFENVDNAQQFKQLYRARMNALDLNMKTKER IVEEANKAFEYNMQIFNELDQAGSTLARETLEDGFPVHDGKGDMRKCPFYAAEQDKGA LEGSSCPFRTAMAVLRKPSLQFILAAGVALAAGLLAWYYM" BASE COUNT 264 a 254 c 289 g 170 t ORIGIN 1 atgtcagcgg aagtggaaac ctcagagggg gtagacgagt cagaaaaaaa gaactctggg 61 gccctagaaa aggagaacca aatgagaatg gctgacctct cggagctcct gaaggaaggg 121 accaaggaag cacacgaccg ggcagaaaac acccagtttg tcaaggactt cttgaaaggc 181 aacattaaga aggagctgtt taagctggcc accacggcac tttacttcac atactcagcc 241 ctcgaggagg aaatggagcg caacaaggac catccagcct ttgccccttt gtacttcccc 301 atggagctgc accggaagga ggcgctgacc aaggacatgg agtatttctt tggtgaaaac 361 tgggaggagc aggtgcagtg ccccaaggct gcccagaagt acgtggagcg gatccactac 421 atagggcaga acgagccgga gctactggtg gcccatgcat acacccgcta catgggggat 481 ctctcggggg gccaggtgct gaagaaggtg gcccagcgag cactgaaact ccccagcaca 541 ggggaaggga cccagttcta cctgtttgag aatgtggaca atgcccagca gttcaagcag 601 ctctaccggg ccaggatgaa cgccctggac ctgaacatga agaccaaaga gaggatcgtg 661 gaggaggcca acaaggcttt tgagtataac atgcagatat tcaatgaact ggaccaggcc 721 ggctccacac tggccagaga gaccttggag gatgggttcc ctgtacacga tgggaaagga 781 gacatgcgta aatgcccttt ctacgctgct gaacaagaca aaggtgccct ggagggcagc 841 agctgtccct tccgaacagc tatggctgtg ctgaggaagc ccagcctcca gttcatcctg 901 gccgctggtg tggccctagc tgctggactc ttggcctggt actacatgtg aagcacccat 961 catgccacac cggtacc // LOCUS HUMHOX 1872 bp mRNA PRI 31-DEC-1994 DEFINITION H.sapiens homeobox protein (HOX-11) mRNA, complete cds. ACCESSION M75952 NID g184286 KEYWORDS T-cell oncogene; homeobox protein. SOURCE Homo sapiens (tissue library: lambda gt10) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1872) AUTHORS Kennedy,M.A., Gonzalez-Sarmiento,R., Kees,U.R., Lampert,F., Dear,N., Boehm,T. and Rabbitts,T.H. TITLE HOX11, a homeobox-containing T-cell oncogene on human chromosome 10q24 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (20), 8900-8904 (1991) MEDLINE 92020958 FEATURES Location/Qualifiers source 1..1872 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PER-225" /cell_type="T-cell" /germline /tissue_lib="lambda gt10" /map="10q24" gene 13..1005 /gene="HOX-11" CDS 13..1005 /gene="HOX-11" /codon_start=1 /product="homeobox protein" /db_xref="PID:g184287" /translation="MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGE YGLGCLVGGAYTYGGGGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPLTGSYN VNMALAGGPGPGGGGGSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSV PAMPGVNNLTGLTFPWMESNRRYTKDRFTGHPYQNRTPPKKKKPRTSFTRLQICELEK RFHRQKYLASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILL QLQQEAFQKSLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASACE" polyA_site 1872 BASE COUNT 398 a 613 c 535 g 326 t ORIGIN 10q24. 1 gccagggcca gcatggagca cctgggtccg caccacctcc acccgggtca cgcagagccc 61 attagcttcg gcatcgacca gatcctcaac agcccggacc agggtggctg catgggaccc 121 gcctcgcgcc tccaggacgg agaatacggc cttggctgct tggtcggagg cgcctacact 181 tacggcggcg ggggctccgc ggccgcgacg ggggctggag gagcgggggc ctatggtact 241 ggaggtcccg gcggccccgg aggcccggca ggcggcggcg gcgcctgcag catgggtcct 301 ctgaccggct cctacaacgt gaacatggcc ttggcaggcg gccccggtcc tggcggcggc 361 ggcggcagca gcggcggtgc cggggcactc agcgctgcgg gggtaatccg ggtgccggca 421 cacaggccgc tcgccggagc cgtggcccac ccccagcccc tggccaccgg cttgcccacc 481 gtgccctctg tgcctgccat gccgggcgtc aacaacctca ctggcctcac cttcccctgg 541 atggagagta accgcagata cacaaaggac aggttcacag gtcaccccta tcagaaccgg 601 acgcccccca agaagaagaa gccgcgcacg tccttcacac gcctgcagat ctgcgagctg 661 gagaagcgct tccaccgcca gaagtacctg gcctcggccg agcgcgccgc cctggccaag 721 gcgctcaaaa tgaccgatgc gcaggtcaaa acctggttcc agaaccggcg gacaaagtgg 781 agacggcaga ctgcggagga acgggaggcc gagaggcagc aagcgaaccg catcctcctg 841 cagttgcagc aggaggcctt ccagaagagc ctggcacagc cgctgcccgc tgaccctctg 901 tgcgtgcaca actcgtcgct cttcgccctg cagaatctgc agccgtggtc tgacgactcg 961 accaaaatca ctagcgtcac gtcggtggcg tcggcctgcg agtgagcctg cccattctgc 1021 cctgtgggac cccaggccca ctcaggggtc actgaggcct gagacccagg actcctcccc 1081 accctcctgg cctcagactg cacccaggag gggaacactg ccctcgcacg cccgaagggc 1141 ccccacattt gtgccgacac tgttctccct tcggtggaag agctcaaggg acaaggacac 1201 gcgcccccct cccagaggcg tcccgcacct gtctgaactg ttaagaaatc tgtttttgtt 1261 tatttcattt tattttaatt tttaacgtgg gattcagaga aaggcaaggg aggtaaggga 1321 ggaggagctt ctggggtccc cagggctgtc atctgaattt gccctgggaa accccttctc 1381 tgtgacccat ttctcatcac acacatggaa acccataggc ccacacacag gtggtgtcac 1441 tgtccctcct ggtgtcaccc cagagccaca catgggcatc tatgggagag tgtcaaccag 1501 acagagggtc acagtgttta cactttggac cttacgatca ggctcaggtc aggggtgaca 1561 cagactcatc ctgaacagca tggcactccc tccagcacaa acacaaggtc atggccacac 1621 tgtgacacac tacaccacac acagcagcca acagctacaa cagcctcact tggtctgcca 1681 ggcccccacc acacatccca gcccaatcca ggtacgcaca gacaggtttt cacataaatg 1741 cagcccattt ctccagaacc catttgaggg gtgggggggt gttaatttat gcacttataa 1801 ggtgttttct gtgtaaccat tttataaagt gcttgtgtaa tttatgtgaa aaaaataaat 1861 aaaagcctcc gg // LOCUS HUMHOX7 1713 bp mRNA PRI 08-NOV-1994 DEFINITION Homo sapiens (region 7) homeobox protein (HOX7) mRNA, complete cds. ACCESSION M97676 NID g184294 KEYWORDS homeobox protein. SOURCE Homo sapiens (tissue library: lambda ZAP II; craniofacial) day 42-54 ectomesenchyme cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1713) AUTHORS Padanilam,B.J., Stadler,H.S., Mills,K.A., McLeod,L.B., Solursh,M., Lee,B., Ramirez,F., Buetow,K.H. and Murray,J.C. TITLE Characterization of the human HOX 7 cDNA and identification of polymorphic markers JOURNAL Hum. Mol. Genet. 1 (6), 407-410 (1992) MEDLINE 93250782 FEATURES Location/Qualifiers source 1..1713 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="day 42-54" /tissue_type="ectomesenchyme" /tissue_lib="lambda ZAP II; craniofacial" /map="4p16.3-p16.1" 5'UTR 1..240 /gene="HOX7" /note="G00-120-683" gene 1..1713 /gene="HOX7" CDS 241..1134 /gene="HOX7" /codon_start=1 /db_xref="GDB:G00-120-683" /product="homeobox protein" /db_xref="PID:g184295" /translation="MTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGA KPKVSPSLLPFSVEALMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDA PSSPRPLGHFSVGGLLKLPEDALVKAESPEKPERTPWMQSPRFSPPPARRLSPPACTL RKHKTNRKPRTPFTTAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRR AKAKRLQEAELEKLKMAAKPMLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAA LPVAPVGLYTAHVGYSMYHLT" 3'UTR 1135..1713 /gene="HOX7" /note="G00-120-683" BASE COUNT 350 a 568 c 531 g 264 t ORIGIN 1 gcgcgagtgc tcccgggaac tctgcctgcg cggcggcagc gaccggaggc caggcccagc 61 acgccggagc tggcctgctg gggaggggcg ggaggcgcgc gcgggagggt ccgcccggcc 121 aggccccggg ccctcgcaga ggccggccgc gctcccagcc cgcccggagc ccatgcccgg 181 cggctggcca gtgctgcggc agaagggggg gcccggctct gcatggcccc ggctgctgac 241 atgacttctt tgccactcgg tgtcaaagtg gaggactccg ccttcggcaa gccggcgggg 301 ggaggcgcgg gccaggcccc cagcgccgcc gcggccacgg cagccgccat gggcgcggac 361 gaggaggggg ccaagcccaa agtgtcccct tcgctcctgc ccttcagcgt ggaggcgctc 421 atggccgacc acaggaagcc gggggccaag gagagcgccc tggcgccctc cgagggcgtg 481 caggcggcgg gtggctcggc gcagccactg ggcgtcccgc cggggtcgct gggagccccg 541 gacgcgccct cttcgccgcg gccgctcggc catttctcgg tggggggact cctcaagctg 601 ccagaagatg cgctcgtcaa agccgagagc cccgagaagc ccgagaggac cccgtggatg 661 cagagccccc gcttctcccc gccgccggcc aggcggctga gccccccagc ctgcaccctc 721 cgcaaacaca agacgaaccg taagccgcgg acgcccttca ccaccgcgca gctgctggcg 781 ctggagcgca agttccgcca gaagcagtac ctgtccatcg ccgagcgcgc ggagttctcc 841 agctcgctca gcctcactga gacgcaggtg aagatatggt tccagaaccg ccgcgccaag 901 gcaaagagac tacaagaggc agagctggag aagctgaaga tggccgccaa gcccatgctg 961 ccaccggctg ccttcggcct ctccttccct ctcggcggcc ccgcagctgt agcggccgcg 1021 gcgggtgcct cgctctacgg tgcctctggc cccttccagc gcgccgcgct gcctgtggcg 1081 cccgtgggac tctacacggc ccatgtgggc tacagcatgt accacctgac atagagggtc 1141 ccaggtcccc acctgtgggc cagccgattc ctccagccct ggtgctgtac ccccgacgtg 1201 ctcccctgct cggcaccgcc agccgccttc cctttaaccc tcacactgct ccagtttcac 1261 ctctttgctc cctgagttca ctctccgaag tctgatccct gccaaaaagt ggctggaaga 1321 gtcccttagt actcttctag catttagatc tacactctcg agttaaagat ggggaaactg 1381 agggcagaga ggttaacaga tttatctagg gtccccagca gaattgacag ttgaacagag 1441 ctagaggcca tgtctcctgc atagcttttc cctgtcctga caccaggcaa gaaaagcgca 1501 gagaaatcgg tgtctgacga ttttggaaat gagaacaatc tcaaaaaaaa aaaaaaaaaa 1561 aaaaaaaaaa gaaaagagaa aaaaaagact agccagccag gaagatgaat cctagcttct 1621 tccattggaa aatttaagac aagttcaaca acaaaacatt tgctctgggg ggcagggaaa 1681 acacagatgt gttgcaaagg taggttgaag gga // LOCUS HUMHOXY1 6188 bp mRNA PRI 05-JUL-1996 DEFINITION Human kidney mRNA for zinc-finger DNA-binding protein, complete cds. ACCESSION D45132 NID g1405347 KEYWORDS heme-oxygenase-1; zinc-finger DNA-binding protein; MTB-Zf. SOURCE Homo sapiens kidney cell_line:THP-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6188) AUTHORS Muraosa,Y., Takahashi,K., Yoshizawa,M. and Shibahara,S. TITLE cDNA cloning of a novel protein containing two zinc-finger domains that may function as a transcription factor for the human heme-oxygenase-1 gene JOURNAL Eur. J. Biochem. 235 (3), 471-479 (1996) MEDLINE 96184519 REFERENCE 2 (bases 1 to 6188) AUTHORS Shibahara,S. TITLE Direct Submission JOURNAL Submitted (11-JAN-1995) to the DDBJ/EMBL/GenBank databases. Shigeki Shibahara, Tohoku University School of Medicine, Dept. of Applied Physiol. and Mol. Biol.; 2-1 Seiryomachi, Aoba-ku, Sendai, Miyagi 980, Japan (Tel:022-717-8117, Fax:022-717-8118) COMMENT Sequence updated (30-May-96) by : Shigeki Shibahara. FEATURES Location/Qualifiers source 1..6188 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" /tissue_type="kidney" CDS 178..4626 /codon_start=1 /product="zinc-finger DNA-binding protein" /db_xref="PID:d1008701" /db_xref="PID:g1405348" /translation="MRDSAEGPKEDEEKPSASALEQPATLQEVASQEVPPELATPAPA WEPQPEPDERLEAAACEVNDLGEEEEEEEEEDEEEEEDDDDDELEDEGEEEASMPNEN SVKEPEIRCDEKPEDLLEEPKTTSEETLEDCSEVTPAMQIPRTKEEANGDVFETFMFP CQHCERKFTTKQGLERHMHIHISTVNHAFKCKYCGKAFGTQINRRRHERRHEAGLKRK PSQTLQPSEDLADGKASGENVASKDDSSPPSLGPDCLIMNSEKASQDTINSSVVEENG EVKELHPCKYCKKVFGTHTNMRRHQRRVHERHLIPKGVRRKGGLEEPQPPAEQAQATQ NVYVPSTEPEEEGEADDVYIMDISSNISENLNYYIDGKIQTNNNTSNCDVIEMESASA DLYGINCLLTPVTVEITQNIKTTQVPVTEDLPKEPLGSTNSEAKKRRTASPPALPKIK AETDSDPMVPSCSLSLPLSISTTEAVSFHKEKSVYLSSKLKQLLQTQDKLTPPAGISA TEIAKLGPVCVSAPASMLPVTSSRFKRRTSSPPSSPQHSPALRDFGKPSDGKAAWTDA GLTSKKSKLESHSDSPAWSLSGRDERETVSPPCFDEYKMSKEWTASSAFSSVCNQQPL DLSSGVKQKAEGTGKTPVQWESVLDLSVHKKHCSDSEGKEFKESHSVQPTCSAVKKRK PTTCMLQKVLLNEYNGIDLPVENPADGTRSPSPCKSLEAQPDPDLGPGSGFPAPTVES TPDVCPSSPALQTPSLSSGQLPPLLIPTDPSSPPPCPPVLTVATPPPPLLPTVPLPAP SSSASPHPCPSPLSNATAQSPLPILSPTVSPSPSPIPPVEPLMSAASPGPPTLSSSSS SSSSSSSFSSSSSSSSPSPPPLSAISSVVSSGDNLEASLPMISFKQEELENEGLKPRE EPQSAAEQDVVVQETFNKNFVCNVCESPFLSIKDLTKHLSIHAEEWPFKCEFCVQLFK DKTDLSEHRFLLHGVGNIFVCSVCKKEFAFLCNLQQHQRDLHPDKVCTHHEFESGTLR PQNFTDPSKAHVEHMQSLPEDPLETSKEEEELNDSSEELYTTIKIMASGIKTKDPDVR LGLNQHYPSFKPPPFQYHHRNPMGIGVTATNFTTHNIPQTFTTAIRCTKCGKGVDNMP ELHKHILACASASDKKRYTPKKNPVPLKQTVQPKNGVVVLDNSGKNAFRRMGQPKRLN FSVELSKMSSNKLKLNALKKKNQLVQKAILQKNKSAKQKADLKNACESSSHICPYCNR EFTYIGSLNKHAAFSCPKKPLSPPKKKVSHSSKKGGHSSPASSDKNSNSNHRRRTADA EIKMQSMQTPLGKTRARSSGPTQVPLPSSSFRSKQNVKFAASVKSKKPSSSSLRNSSP IRMAKITHVEGKKPKAVAKNHSAQLSSKTSRSLHVRVQKSKAVLQSKSTLASKKRTDR FNIKSRERSGGPVTRSLQLAAAADLSENKREDGSAKQELKDFRNFL" BASE COUNT 1873 a 1427 c 1314 g 1574 t ORIGIN 1 gaagacaacc ctgagatagc agctgcgatt gaggaagagc gagccagcgc ccggagcaag 61 cggagctccc ccaagagccg gaaagggaag aaaaaatccc aggaaaataa aaacaaagga 121 aacaaaatcc aagacataca actgaagaca agtgagccag atttcacctc tgcaaatatg 181 agagattctg cagaaggtcc taaagaagac gaagagaagc cttcagcctc agcacttgag 241 cagccggcca ccctccagga ggtggccagt caggaggtgc ctccagaact agcaacccct 301 gcccctgcct gggagccaca gccagaacca gacgagcgat tagaagcggc agcttgtgag 361 gtgaatgatt tgggggaaga ggaggaggag gaagaggagg aggatgaaga agaagaagaa 421 gatgatgatg atgatgagtt ggaagacgag ggggaagaag aagccagcat gccaaatgaa 481 aattctgtga aagagccaga aatacggtgt gatgagaagc cagaagattt attagaggaa 541 ccaaaaacaa cttcagaaga aactcttgaa gactgctcag aagtaacacc tgccatgcaa 601 atccccagaa ctaaagaaga ggccaatggt gatgtatttg aaacgtttat gtttccgtgt 661 caacattgtg aaaggaagtt tacaaccaaa caggggcttg agcgtcacat gcatatccat 721 atatccaccg tcaatcatgc tttcaaatgc aagtactgtg ggaaagcctt tggcacacag 781 attaaccggc ggcgacatga gcggcgccat gaagcagggt taaagcggaa acccagccaa 841 acactacagc cgtcagagga tctggctgat ggcaaagcat ctggagaaaa cgttgcttca 901 aaagatgatt cgagtcctcc cagtcttggg ccagactgtc tgatcatgaa ttcagagaag 961 gcttcccaag acacaataaa ttcttctgtc gtagaagaga atggggaagt taaagaactt 1021 catccgtgca aatattgtaa aaaggttttt ggaactcata ctaatatgag acggcatcag 1081 cgtagagttc acgaacgtca tctgattccc aaaggtgtac ggcgaaaagg aggccttgaa 1141 gagccccagc ctccagcaga acaggcccag gccacccaga acgtgtatgt accaagcaca 1201 gagccggagg aggaagggga agcagatgat gtgtacatca tggacatttc tagcaatata 1261 tctgaaaact taaattacta tattgatggt aaaattcaaa ctaataacaa cactagtaac 1321 tgtgatgtga ttgagatgga gtctgcttcg gcagatttgt atggtataaa ttgtctgctc 1381 actccagtta cagtggaaat tactcaaaat ataaagacca cacaggtccc tgtaacagaa 1441 gatcttccta aagagccttt gggcagcaca aatagtgagg ccaagaagcg gagaactgcg 1501 agcccacctg cactgcccaa aattaaggcc gaaacagact ctgaccccat ggtcccctct 1561 tgctctttaa gtcttcctct tagcatatca acaacagagg cagtgtcttt ccacaaagag 1621 aaaagtgttt atttgtcatc aaagctcaaa caacttcttc aaacccaaga taaactaact 1681 cctcctgcag ggatttcagc aactgaaata gctaaattag gtcctgtttg tgtgtctgct 1741 cctgcatcaa tgttgcctgt gacctcaagt aggtttaaga ggcggaccag ctctcctccc 1801 agttctccac agcacagtcc tgcccttcga gactttggaa agccaagtga tgggaaagca 1861 gcatggaccg atgccgggct gacttccaaa aaatccaaat tagaaagtca cagcgactca 1921 ccagcatgga gtttgtctgg gagagatgag agagaaactg tgagccctcc atgctttgat 1981 gaatataaaa tgtctaaaga gtggacagct agttctgctt ttagcagtgt gtgcaaccag 2041 cagccactgg atttatccag cggtgtcaaa cagaaggctg agggtacagg caagactcca 2101 gtccagtggg aatctgtctt agatctcagt gtgcataaaa agcattgtag tgactctgaa 2161 ggcaaggaat tcaaagaaag tcattcagtg cagcctacgt gtagtgctgt aaagaaaagg 2221 aaaccaacca cctgcatgct gcagaaggtt cttctcaatg aatataatgg catcgattta 2281 cctgtagaaa accctgcaga tgggaccagg agcccaagtc cttgtaaatc cctagaagct 2341 cagccagatc ctgacctcgg tccgggctct ggtttccctg cccctactgt tgagtccaca 2401 cctgatgttt gtccttcatc acctgccctg cagacaccct ccctttcatc cggtcagctg 2461 cctcctctct tgatccccac agatccctct tcccctccac cctgtccccc ggtattaact 2521 gttgccactc cgccccctcc cctccttcct accgtacctc ttccagcccc ctcttccagt 2581 gcatctccac acccatgccc ctctccactc tcaaatgcca ccgcacagtc cccacttcca 2641 attctgtccc caacagtgtc cccctctccc tctcccattc ctcccgtgga gcccctgatg 2701 tctgccgcct cacccgggcc tccaacactt tcttcttcct cctcttcatc ttcctcctcc 2761 tcttcgtttt cttcttcatc ttcctcctct tctccttctc cacctcctct ctccgcaata 2821 tcatctgttg tttcctctgg tgataatctg gaggcttctc tccccatgat atctttcaaa 2881 caggaggaat tagagaatga aggtctgaaa cccagggaag agccccagtc tgctgctgaa 2941 caggatgttg ttgttcagga aacattcaac aaaaactttg tttgcaacgt ctgtgaatca 3001 ccttttcttt ccattaaaga tctaaccaaa catttatcta ttcatgctga agaatggccc 3061 ttcaaatgtg aattttgtgt gcagcttttt aaggataaaa cggacttgtc agaacatcgc 3121 tttttgcttc atggagttgg gaatatcttt gtgtgttctg tttgtaaaaa agaatttgct 3181 tttttgtgca atttgcagca gcaccagcga gatctccacc cagataaggt gtgcacacat 3241 cacgagtttg aaagcgggac tctgaggccc cagaacttta cagatcccag caaggcccat 3301 gtagagcata tgcagagctt gccagaagat cctttagaaa cttctaaaga agaagaggag 3361 ttaaatgatt cctctgaaga gctttacacg actataaaaa taatggcttc tggaataaag 3421 acaaaagatc cagatgttcg attgggcctc aatcagcatt acccaagctt taaaccacct 3481 ccatttcagt accatcaccg taaccccatg gggattggtg tgacagccac aaatttcact 3541 acacacaata ttccacagac tttcactacc gccattcgct gcacaaagtg tggaaaaggt 3601 gtcgacaata tgccggagtt gcacaaacat atcctggctt gtgcttctgc aagtgacaag 3661 aagaggtaca cgcctaagaa aaacccagta ccattaaaac aaactgtgca acccaaaaat 3721 ggcgtggtgg ttttagataa ctctgggaaa aatgccttcc gacgaatggg acagcccaaa 3781 aggcttaact ttagtgttga gctcagcaaa atgtcgtcga ataagctcaa attaaatgca 3841 ttgaagaaaa aaaatcagct agtacagaaa gcaattcttc agaaaaacaa atctgcaaag 3901 cagaaggccg acttgaaaaa tgcttgtgag tcatcctctc acatctgccc ttactgtaat 3961 cgagagttca cttacattgg aagcctgaat aaacacgccg ccttcagctg tcccaaaaaa 4021 cccctttctc ctcccaaaaa aaaagtttct cattcatcta agaaaggtgg acactcatca 4081 cctgcaagta gtgacaaaaa cagtaacagc aaccaccgca gacggacagc ggatgcggag 4141 attaaaatgc aaagcatgca gactccgttg ggcaagacca gagcccgcag ctcaggcccc 4201 acccaagtcc cacttccctc ctcatccttc aggtccaagc agaacgtcaa gtttgcagct 4261 tcggtgaaat ccaaaaaacc aagctcctcc tctttaagga actccagccc gataagaatg 4321 gccaaaataa ctcatgttga ggggaaaaaa cctaaagctg tggccaagaa tcattctgct 4381 cagctttcca gcaaaacatc acggagcctg cacgtgaggg tacagaaaag caaagctgtt 4441 ttacaaagca aatccacctt ggcgagtaag aaaagaacag accggttcaa tataaaatct 4501 agagagcgga gtggggggcc agtcacccgg agccttcagc tggcagctgc tgctgacttg 4561 agtgagaaca agagagagga cggcagcgcc aagcaggagc tgaaggactt caggaacttc 4621 ctgtagaaaa gcccccaaaa caaaacaaac cttaattgac taaaaagtat tgcatgctca 4681 acttaggata agcactacgg caaaggatac gaaatctacc aagcttgcaa gaccagttga 4741 agctgactca aaaatcctaa cattcagctg attgccggca gcttagagtc aggcatctgc 4801 tgcttcggtg ggggcccaac gcgcatgctg ggcgcccggg tgattgagat ccaaagagaa 4861 gggcactgta agacaggcca gatgaactgg ctcctcgtca tgggactggt acctcagatc 4921 tgagcatggc ccttgttttt ggcacgtagc agagaaagga ttgatttgaa cttaaccttg 4981 caaagcaagt tgcctgtttt agcagtagtt tgttgtaggt ttcagggatg acagatttgg 5041 atgcactcat ttaaaacgtt ttggtcacat atcagctctt gatgcctttc ttttaaatta 5101 attatagaca gagagaggca tttagctgat ctcttacccc tggtatttct tttttttgtt 5161 gttttcgttt ttttaaatca caagtagatt gccagcgtaa tggagaggaa accagattgg 5221 atatggactc ttttttatgc ctattctggt gttgcgtttg tatatccaaa tggacgttat 5281 cctctcagat tcttatctgg cactaattta taactattat attatcagag actatgtagc 5341 aatatatcag tgcacaggcg catcccaggc ctgtacagat gtatgtctac acgtaagtat 5401 aaatgaattt gcataccagg ttttacactt gcatctctaa tagagattaa aaacaacaaa 5461 ttggcctctt cctaagtata ttaatatcat ttatccttac attttatgcc tccccctaaa 5521 ttaatgactg agttggtgga aagcggctag gttttattca tactgttttt tgttctcaac 5581 ttcaaaagta atctacctct gaaaaatttg tagtttaata tttgtttggt gaatttgtgc 5641 cactttaatc cttccactat cattcccatt ttgttacatt tctgttatgg ggactttatg 5701 ttgaaatatt gtataaagca tttgtagcaa tttaaaaata aaatatttaa aattatttaa 5761 attgttttgg acgcttcaat tgtattatat gtgatttaca tttcactttt tttgttggcg 5821 ttgttaaccc ggagagtgct cctgtattga actttgctgt tagttatttt attgcttctt 5881 tttggagagt gctataaaag actattctaa tgaaaacatt aaaatttaca atttgacata 5941 caaaaagggg ttgtccattg attttaacca atgtagcact gagagagaga gaggttaatt 6001 atagatagac aagagtggtg tttgttgttt ttcccctccc agcattgaaa tcattggggc 6061 ttgtcagagg tattaaaaaa agatttgttg tgctattgct gcaaacactt aataactaga 6121 ggagaattta aacaatgcat tttatattat tgtaaccaat aaaaaacttt ctaaaaaaaa 6181 aaaaaaaa // LOCUS HUMHP1HOM 839 bp mRNA PRI 07-DEC-1992 DEFINITION Human heterochromatin protein homologue (HP1) mRNA, complete cds. ACCESSION L07515 NID g184310 KEYWORDS heterochromatin protein; homologue. SOURCE Homo sapiens (library: lambda gt11) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 839) AUTHORS Saunders,W.S., Chue,C., Goebl,M.G., Craig,C., Clark,R.F., Powers,J.A., Eissenberg,J.C., Elgin,S.C.R., Rothfield,N.F. and Earnshaw,W.C. TITLE Molecular cloning of a human homologue of Drosophila heterochromatin protein HP1 using anticentromere autoantibodies with anti-chromo specificity JOURNAL J. Cell Sci. (1993) In press FEATURES Location/Qualifiers source 1..839 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_lib="lambda gt11" CDS 136..711 /codon_start=1 /db_xref="PID:g184311" /translation="MGKKTKRTADSSSSEDEEEYVVEKVLDRRVVKGQVEYLLKWKGF SEEHNTWEPEKNLDCPELISEFMKKYKKMKEGENNKPREKSESNKRKSNFSNSADDIK SKKKREQSNDIARGFERGLEPEKIIGATDSCGDLMFLMKWKDTDEADLVLAKEANVKC PQIVIAFYEERLTWHAYPEDAENKEKETAKS" BASE COUNT 271 a 157 c 222 g 189 t ORIGIN 1 gcgcagaagg cggcggcggt ggtggcttgt ggtgcggcct caccatacag gaacagggca 61 gacgttagcg tgagtgatca ctctcaatcc cggggacctg gtggccttag tctttcaggt 121 ggaacggtgt gcgacatggg aaagaaaacc aagcggacag ctgacagttc ttcttcagag 181 gatgaggagg agtatgttgt ggagaaggtg ctagacaggc gcgtggttaa gggacaagtg 241 gaatatctac tgaagtggaa aggcttttct gaggagcaca atacttggga acctgagaaa 301 aacttggatt gccctgagct aatttctgaa tttatgaaaa agtataagaa gatgaaggag 361 ggtgaaaata ataaacccag ggagaagtca gaaagtaaca agaggaaatc caatttctca 421 aacagtgccg atgacatcaa atctaaaaaa aagagagagc agagcaatga tatcgctcgg 481 ggctttgaga gaggactgga accagaaaag atcattgggg caacagattc ctgtggtgat 541 ttaatgttcc taatgaaatg gaaagacaca gatgaagctg acctggttct tgcaaaagaa 601 gctaatgtga aatgtccaca aattgtgata gcattttatg aagagagact gacatggcat 661 gcatatcctg aggatgcgga aaacaaagag aaagaaacag caaagagcta aaggagggga 721 tggtctctgt catttctctt tgtacataat acatttacct ccctgcctcc tctcctttct 781 acccacccct ttctatccta aacacatcca aaaaaaatgt gcttatcact gtgctccac // LOCUS HUMHPGI 1685 bp mRNA PRI 11-JUN-1993 DEFINITION Human hPGI mRNA encoding bone small proteoglycan I (biglycan), complete cds. ACCESSION J04599 NID g184339 KEYWORDS proteoglycan I. SOURCE Human bone cells (primary culture), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1685) AUTHORS Fisher,L.W., Termine,J.D. and Young,M.F. TITLE Deduced protein sequence of bone small proteoglycan I (biglycan) shows homology with proteoglycan II (decorin) and several nonconnective tissue proteins in a variety of species JOURNAL J. Biol. Chem. 264, 4571-4576 (1989) MEDLINE 89174714 COMMENT Draft entry and printed copy of sequence [1] kindly provided by L.W.Fisher, 03-JAN-1989. FEATURES Location/Qualifiers source 1..1685 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 121..177 /note="proteoglycan I signal peptide (put.); putative" CDS 121..1227 /note="proteoglycan I precursor" /codon_start=1 /db_xref="PID:g306884" /translation="MWPLWRLVSLLALSQALPFEQRGFWDFTLDDGPFMMNDEEASGA DTSGVLDPDSVTPTYSAMCPFGCHCHLRVVQCSDLGLKSVPKEISPDTTLLDLQNNDI SELRKDDFKGLQHLYALVLVNNKISKIHEKAFSPLRKLQKLYISKNHLVEIPPNLPSS LVELRIHDNRIRKVPKGVFSGLRNMNCIEMGGNPLENSGFEPGAFDGLKLNYLRISEA KLTGIPKDLPETLNELHLDHNKIQAIELEDLLRYSKLYRLGLGHNQIRMIENGSLSFL PTLRELHLDNNKLARVPSGLPDLKLLQVVYLHSNNITKVGVNDFCPMGFGVKRAYYNG ISLFNNPVPYWEVQPATFRCVTDRLAIQFGNYKK" mat_peptide 232..1224 /note="proteoglycan I" BASE COUNT 357 a 593 c 436 g 299 t ORIGIN Chromosome X. 1 gagtagctgc tttcggtccg ccggacacac cggacagata gacgtgcgga cggcccacca 61 ccccagcccg ccaactagtc agcctgcgcc tggcgcctcc cctctccagg tccatccgcc 121 atgtggcccc tgtggcgcct cgtgtctctg ctggccctga gccaggccct gccctttgag 181 cagagaggct tctgggactt caccctggac gatgggccat tcatgatgaa cgatgaggaa 241 gcttcgggcg ctgacacctc aggcgtcctg gacccggact ctgtcacacc cacctacagc 301 gccatgtgtc ctttcggctg ccactgccac ctgcgggtgg ttcagtgctc cgacctgggt 361 ctgaagtctg tgcccaaaga gatctcccct gacaccacgc tgctggacct gcagaacaac 421 gacatctccg agctccgcaa ggatgacttc aagggtctcc agcacctcta cgccctcgtc 481 ctggtgaaca acaagatctc caagatccat gagaaggcct tcagcccact gcggaagctg 541 cagaagctct acatctccaa gaaccacctg gtggagatcc cgcccaacct acccagctcc 601 ctggtggagc tccgcatcca cgacaaccgc atccgcaagg tgcccaaggg agtgttcagc 661 gggctccgga acatgaactg catcgagatg ggcgggaacc cactggagaa cagtggcttt 721 gaacctggag ccttcgatgg cctgaagctc aactacctgc gcatctcaga ggccaagctg 781 actggcatcc ccaaagacct ccctgagacc ctgaatgaac tccacctaga ccacaacaaa 841 atccaggcca tcgaactgga ggacctgctt cgctactcca agctgtacag gctgggccta 901 ggccacaacc agatcaggat gatcgagaac gggagcctga gcttcctgcc caccctccgg 961 gagctccact tggacaacaa caagttggcc agggtgccct cagggctccc agacctcaag 1021 ctcctccagg tggtctatct gcactccaac aacatcacca aagtgggtgt caacgacttc 1081 tgtcccatgg gcttcggggt gaagcgggcc tactacaacg gcatcagcct cttcaacaac 1141 cccgtgccct actgggaggt gcagccggcc actttccgct gcgtcactga ccgcctggcc 1201 atccagtttg gcaactacaa aaagtagagg cagctgcagc caccgcgggg cctcagtggg 1261 ggtctctggg gaacacagcc agacatcctg atggggaggc agagccagga agctaagcca 1321 gggcccagct gcgtccaacc cagcccccca cctcaggtcc ctgaccccag ctcgatgccc 1381 catcaccgcc tctccctggc tcccaagggt gcaggtgggc gcaaggcccg gcccccatca 1441 catgttccct tggcctcaga gctgcccctg ctctcccacc acagccaccc agaggcaccc 1501 catgaagctt ttttctcgtt cactcccaaa cccaagtgtc caaagctcca gtcctaggag 1561 aacagtccct gggtcagcag ccaggaggcg gtccataaga atggggacag tgggctctgc 1621 cagggctgcc gcacctgtcc agaacaacat gttctgttcc tcctcctcat gcatttccag 1681 ccttg // LOCUS HUMHPROT 1086 bp mRNA PRI 12-JUN-1991 DEFINITION Human H-protein mRNA, complete cds. ACCESSION M69175 NID g184347 KEYWORDS H-protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1086) AUTHORS Fujiwara,K., Okamura-Ikeda,K., Hayasaka,K. and Motokawa,Y. TITLE The primary structure of human H-protein of the glycine cleavage system deduced by cDNA cloning JOURNAL Biochem. Biophys. Res. Commun. 176, 711-716 (1991) MEDLINE 91222237 FEATURES Location/Qualifiers source 1..1086 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1086 /product="H-protein" CDS 25..546 /codon_start=1 /product="H-protein" /db_xref="PID:g184348" /translation="MALRVVRSVRALLCTLRAVPLPAAPCPPRPWQLGVGAVRTLRTG PALLSVRKFTEKHEWVTTENGIGTVGISNFAQEALGDVVYCSLPEVGTKLNKQDEFGA LESVKAASELYSPLSGEVTEINEALAENPGLVNKSCYEDGWLIKMTLSNPSELDELMS EEAYEKYIKSIEE" BASE COUNT 339 a 199 c 246 g 302 t ORIGIN 1 gggcgggccc gcacccctgc gaacatggcg ctgcgagtgg tgcggagcgt gcgggccctg 61 ctctgcaccc tgcgcgcggt cccgttaccc gccgcgccct gcccgccgag gccctggcag 121 ctgggggtgg gcgccgtccg tacgctgcgc actggacccg ctctgctctc ggtgcgtaaa 181 ttcacagaga aacacgaatg ggtaacaaca gaaaatggca ttggaacagt gggaatcagc 241 aattttgcac aggaagcgtt gggagatgtt gtttattgta gtctccctga agttgggaca 301 aaattgaaca aacaagatga gtttggtgct ttggaaagtg tgaaagctgc tagtgaactc 361 tattctcctt tatcaggaga agtaactgaa attaatgaag ctcttgcaga aaatccagga 421 cttgtaaaca aatcttgtta tgaagatggt tggctgatca agatgacact gagtaaccct 481 tcagaactag atgaacttat gagtgaagaa gcatatgaga aatacataaa atctattgag 541 gagtgaaaat ggaactccta aataaactag tatgaaataa cgcaagccag cagagttgtc 601 ttaaattagt ggtggataga agacttagaa tagaaacttt tagtattacc gatggggaaa 661 aaaaaactac tgttaacact gctaatgaaa gaaaatgccc tttaactttc taatgattat 721 agataaatat aatatgcgtc tttttcacaa tatcctatga tttttagact aggctctagt 781 gttcagaatt catgaaatta tccatggtaa aaactagtta taaaaattac ataattcaaa 841 gataacattg ttattcttaa gccttatata atattgtaac ttgcatgtat ccatacctgg 901 atttgggatg aaatacttaa tgatctttcc attggaaata actggaagtg aagaggtttt 961 gttgcttgta cagtgtcaga tgaggaacac cactatctta attttgcgat acactgcatt 1021 tgctggtgct atttttatac agtgaagcaa cagctttgca gcaaaataat aaaatacttc 1081 ttcgtt // LOCUS HUMHPSA 1728 bp mRNA PRI 17-JUN-1996 DEFINITION Human mRNA for phosphoribosypyrophosphate synthetase-associated protein 39, complete cds. ACCESSION D61391 NID g1381026 KEYWORDS phosphoribosypyrophosphate synthetase-associated protein 39; PAP39. SOURCE Homo sapiens Hepatoma Hepatocyte cell_line:HepG2 cells cDNA to mRNA, clone:hPAP39-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1728) AUTHORS Ishizuka,T., Kita,K., Sonoda,T., Ishijima,S., Sawa,K., Suzuki,N. and Tatibana,M. TITLE Cloning and sequencing of human complementary DNA for the phosphoribosylpyrophosphate synthetase-associated protein 39 JOURNAL Biochim. Biophys. Acta 1306 (1), 27-30 (1996) MEDLINE 96201702 REFERENCE 2 (bases 1 to 1728) AUTHORS Ishizuka,T. TITLE Direct Submission JOURNAL Submitted (27-JUN-1995) to the DDBJ/EMBL/GenBank databases. Toshiharu Ishizuka, Chiba University,School of Medicine, Biochemistry; 1-8-1,Inohana,Chuo-ku, Chiba, Chiba 260, Japan (E-mail:tishizuk@aquarius.bekkoame.or.jp, Tel:+81-43-226-2040, Fax:+81-43-226-2041) COMMENT Sequence updated (24-Oct-1995) by: Toshiharu Ishizuka. FEATURES Location/Qualifiers source 1..1728 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2 cells" /cell_type="Hepatocyte" /clone="hPAP39-1" /tissue_type="Hepatoma" CDS 50..1120 /codon_start=1 /product="phosphoribosypyrophosphate synthetase-associated protein 39" /db_xref="PID:d1010255" /db_xref="PID:g1381027" /translation="MNAARTGYRVFLANSTAACTELAKRITERLGAELGKSVVYQETN GETRVEIKEFVRGQDIFIIQTIPRDVNTAVMELLIMAYALKTACARNIIGVIPYFPYS KQSKMRKRGSIVCKLLASMLAKAGLTHIITMDLHQKEIQGFFSFPVDNLRASPFLLQY IQEEIPNYRNAVIVAKSPDAAKRAQSYAERLRLGLAVIHGEAQCTELDMDDGRHSPPM VKNATVHPGLELPLMMAKEKPPITVVGDVGGRIAIIVDDIIDDVESFVAAAEILKERG AYKIYVMATHGILSAEAPRLIEESSVDEVVVTNTVPHEVQKLQCPKIKTVDISLILSE AIRRIHNGESMAYLFRNITVDD" polyA_site 1728 BASE COUNT 466 a 390 c 429 g 443 t ORIGIN 1 ggtgcgcaag ggcacggacc tcggagctct ccccgttccc ccgccggcca tgaacgccgc 61 tcgcaccggc taccgagtct tcctcgccaa ctccacggcc gcctgcacgg agctggccaa 121 gcgcatcaca gagcgccttg gtgctgaatt ggggaagtct gttgtatatc aagagaccaa 181 tggagaaaca agagttgaaa taaaagaatt tgttcgtggc caagatattt tcattataca 241 gacaataccc agagatgtga atacagctgt gatggagttg ctcatcatgg cttacgcact 301 gaagactgcc tgtgccagga acattattgg ggtcatcccc tacttcccct acagcaagca 361 gagcaagatg aggaagaggg gttccattgt gtgcaagctg ctagcatcca tgctggcgaa 421 agcaggttta actcacatta tcactatgga tcttcatcaa aaggaaatac aaggcttttt 481 cagctttcct gtggacaacc ttagagcctc acctttcctg cttcagtata tccaggaaga 541 aattccaaat tacagaaatg cagtcattgt agctaagtct cctgatgctg caaagagggc 601 ccagtcctat gcggagagac tgcgtctggg tttggccgtc attcacgggg aagctcagtg 661 cacggaactg gacatggacg atggtcgtca ctccccgcct atggtcaaaa atgctactgt 721 gcacccaggc ctggagttgc cattgatgat ggccaaagag aagccaccga taactgtagt 781 tggagatgtt ggaggccgca tcgcaatcat cgtggatgac attattgacg atgtggagag 841 ttttgttgct gccgcggaga tcctgaaaga gagaggcgcc tataagatct atgttatggc 901 cacccacggc atcctgtctg cagaggcccc tcgcctgatt gaggagtcct ccgtagacga 961 ggtggtggtg acgaatactg tccctcatga ggttcagaag ctgcaatgtc ccaagataaa 1021 gactgtggat atcagtttga ttctttctga agccattcgg agaatccaca atggagagtc 1081 catggcctac cttttccgaa acatcactgt ggatgactag ctttcacgag ggtctcgacc 1141 ctggacctcc tgagggaaac atggaaaaag cagtgccatg agtgatacag tgtttccttg 1201 caagggagga ctcgaaacag cctggagtta gatatcttct tttgcccgga ttgatgggga 1261 ggagggatta aaagagtcag gaagaagaca gagctaatgg ataaatatca taacatggcc 1321 ttacatgtct gctgtcatca gccctgttcc ttaaaagttc tagctgcttt cttaaaaata 1381 atctgaaaat cttattgata ctaaagagga gttaaaggca cataaagtct taactctata 1441 atgttcattt agttgtttca gctccaggga aatggaggta ttgatgttga acctggttag 1501 ggaagctgag cgcctgtggc cctattacta tccagttggc ctctcccaaa tcaacttcaa 1561 gtcttttata gagaatcgta tttttctttc agaaattgct atgcctacag ccattgaaaa 1621 atgaagcatt catgttgtta catcttccaa ggatgtcaga ttagaaaata gcatcccacc 1681 tctgggtatc tgagtggctc tgaagttgca aataaaataa tttgttgt // LOCUS HUMHPV5B 8852 bp DNA PRI 08-FEB-1997 DEFINITION Human papillomavirus 5b genome integrated into human carcinoma DNA. ACCESSION D26561 NID g1065999 KEYWORDS L1 protein; E6 protein; E7 protein; E1 protein; provirus. SOURCE Homo sapiens carcinoma DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1383 to 4697) AUTHORS Yabe,Y., Sakai,A., Hitsumoto,T., Kato,H. and Ogura,H. TITLE A subtype of human papillomavirus 5 (HPV-5b) and its subgenomic segment amplified in a carcinoma: nucleotide sequences and genomic organizations JOURNAL Virology 183 (2), 793-798 (1991) MEDLINE 91306467 REFERENCE 2 (bases 1 to 8852) AUTHORS Yabe,Y. TITLE A subgenomic segment of human papillomavirus 5b integrated within a metastatic carcinoma: nucleotide sequence and genomic organization JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 8852) AUTHORS Yabe,Y. TITLE Direct Submission JOURNAL Submitted (19-JAN-1994) to the DDBJ/EMBL/GenBank databases. Yoshiro Yabe, Inst. of Cell. Mol. Biol., Okayama Univ. Med. Sch., Department of Molecular Virology; 2-5-1 Shikata-cho, Okayama, Okayama 700, Japan (Tel:086-223-7151(ex.2630), Fax:086-222-2846) FEATURES Location/Qualifiers source 1..8852 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="carcinoma" CDS 1383..2772 /partial /codon_start=2 /product="ORF for L1 protein" /db_xref="PID:d1006100" /db_xref="PID:g1838926" /translation="PKVSGNQHRVFRLKLPDPNRFALADMSVYNPDKERLVWACRGLE IGRGQPLGVGSTGHPYFNKVKDTENSNAYITFSKDGQNTAFSKDDRLNTSFDPKQIQM FIVGCTPCIGEHWDKAVPCAKNDQQTGLCPPIELKNTYIEDGDMADIGFGNMNFKALQ DSRSDVSLDIVNETCKYPDFLKMQNDIYGDACFFYARREQCYARHFFVRGGKTGDDIP GAQIDNGTYKNQFYIPGADGQAQKTIGNAMYFPTVSGSLVSSDAQLFNRPFWLQRAQG HNNGILWANQMFITVVDNTRNTNFSISVYNQAGPLKDVADYNAEQFREYQRHVEEYEI SLILQLCKVPLKAEVLAQINAMNSSLLEDWQLGFVPTPDNPIQDTYRYIDSLATRCPD KNPPKEKEDPYKGLHFWDVDLTERLSLDLDQYSLGRKFLFQAGLQHTTVNGTKAVSYK GSNRGTKRKRKN" source 1383..4697 /note="Integrated sequence of HPV-5b DNA; corresponds to the sequence from 6111..7779, 1..1646 of HPV-5b genome (accession number D90252)" /organism="Human papillomavirus" /proviral /db_xref="taxon:10566" CDS 3259..3732 /codon_start=1 /product="ORF for E6 protein" /db_xref="PID:d1006101" /db_xref="PID:g1838927" /translation="MAEGAEHQQKLTEKDKAELPSTIRDLAETLGIPLIDCIIPCNFC GKFLNYLEACEFDYKKLSLIWKDYCVFACCRVCCGATATYEFNQFYEQTVLGRDIELA SGLSIFDIDIRCQTCLAFLDIIEKLDCCGRGLPFHKVRNAWKGICRQCKHFYHDW" CDS 3722..4033 /codon_start=1 /product="ORF for E7 protein" /db_xref="PID:d1006102" /db_xref="PID:g1838928" /translation="MIGKEVTVQDIILELSEVQPEVLPVDLFCEEELPNEQETEEEPD IERISYKVIAPCGCRHCEVKLRIFVHATEFGIRAFQQLLTGDLQLLCPDCRGNCKHDG S" CDS 4020..4697 /partial /codon_start=1 /product="ORF for E1 protein" /db_xref="PID:d1006103" /db_xref="PID:g1838929" /translation="MTDPNPKGSTSKEGFGDWCLLEADCSDVENDLGQLFERDTDSDI SDLLDDTELEQGNSLELFHQQECEQSEEQLQKLKRKYLSPKAIAQLSPRLESISLSPQ QKSKRRLFAEQDSGLELTLNNEAEDVTPEVEVPAIDSRPDDEGGSGDVDIHYTSLLRS SNKKATLMAKFKESFGVGFNELTRQFKSHKTCCKDWVVSVYAVHDDLFESSKQLLQQH CDYIWVRG" BASE COUNT 2653 a 1711 c 1797 g 2691 t ORIGIN 1 gcatgcatat tttatgccta aaaaatattt catcttctga ttttctagct ttcttctgca 61 ggttgttttt tttttttaga atatttttca atcattttaa ttctctattg taaaacttgg 121 aaatctctac ttatggactg ccaagaacag tctgaaatta aatcaattta ctctcactct 181 tagtgaatat tttgggagtg ctaggtattg aatgtgtccc ccaaaagttc atgtgttaga 241 aatgtaatcc ttctgcccct atgatgagat gaatggatga atgagggctc tgttctcatg 301 acgggatgaa tgttgctgtc ccaggagtgg gtttattatc acagcagcgt tattcttgtg 361 cctgtgagct ctccctggct cccttgctcc cactctgagc atgtgatgcc ctctgccatg 421 aggtgaggca gcaagaaggc cctcacagat gccggcactg tggacttcca gcctctagaa 481 ctatgagcta aataaacttt ttaatataaa ttacctagtt tgtcatcttt tgttaccaat 541 aagaaaaggc aataagacag ggaggttcta ttatgatgtc attcatggga aaatggtaga 601 caggacagaa tctttgctgg ctccctagat actgccttgg aatgttactc ttggaaaaca 661 gtggctacag agtgatgcct ttctccataa ctttaattta ccttaagaaa agctttgagt 721 gctactgagt cgggaatgaa aaacctagag aatgttctgt cctgtgggag caaggagtat 781 agttagtccc tgtattacta cccaaagggg catgaggctt caccctcctt gggctcccta 841 gggacttata atgccttcct gtaattatcg ggccagttaa tttaatacta atccacttaa 901 ttcagttaac atcagatttg tcctagggat cttgttacat ccttttgtat actaaaagag 961 aaaaagaaaa aaaaaagatt tgcttctctt tcaatattcc tgggaagttg gatatccaca 1021 tggaaaaggc agatctctag ttcacaacag ataaaaaata aattttactc aatgtgaaat 1081 taaatgtaaa aaataaaact tcaacattgt agaagaaaat atagaaaaag ctctttaggt 1141 ctgtgaagga gaaaagaatt atcgaaatca ggccataaag aaaaagtaca agtcataaag 1201 gcaataaata tacaaataga gtttaacctc tgaaacactt gaccgcaatg ttctgaggta 1261 aaaacaacaa accagccatt gaagggaaga acagattgac agtgcatata acacagaaag 1321 aagtagtaat caaactattt aaaattgcct aataaatcaa aaagaaaaga ccaaaaaagg 1381 aaccctaagg tttcaggaaa tcaacacaga gtatttcgcc taaaattacc agatcccaac 1441 agatttgcat tagctgatat gtctgtttac aaccctgaca aagaacgttt ggtttgggcc 1501 tgtagaggct tagaaatagg taggggccag ccattgggtg tagggagcac tggtcaccct 1561 tatttcaata aagtaaaaga tacagaaaac agtaatgcat acataacatt ttctaaagat 1621 ggacagaata cagcattttc taaagatgac agactgaata catcctttga tcctaaacaa 1681 atccaaatgt tcattgtagg atgcacacct tgcataggag agcattggga taaagctgtg 1741 ccttgtgcaa aaaatgacca gcaaactggc ctttgtcctc ctattgaatt aaaaaataca 1801 tatatagaag atggtgatat ggcagatata ggttttggaa atatgaactt taaggcactt 1861 caagatagta gatcagatgt cagtttggat attgtcaatg aaacttgcaa atatccagat 1921 tttttaaaga tgcaaaatga tatctatggc gatgcctgct ttttttatgc tcgtagggag 1981 caatgttatg ctagacactt ttttgttaga gggggtaaaa ctggtgatga cattccaggt 2041 gcacaaattg acaatggtac atacaaaaat caattttaca ttccaggagc tgatggccaa 2101 gctcaaaaga ctatcggaaa tgccatgtat ttcccaactg ttagtggctc attagtttcc 2161 agtgatgctc aattgtttaa caggcccttc tggctccaaa gagcccaagg tcataataat 2221 ggcatcctgt gggctaatca aatgtttatc acagtggttg acaacacaag aaatactaat 2281 ttcagtattt ctgtatataa tcaagctgga ccactaaaag atgttgcaga ctataatgca 2341 gagcaattta gagaatatca aagacatgta gaagaatatg aaatatcttt aattttacaa 2401 ctttgtaagg ttcctttaaa ggcggaggta ttggcacaga tcaatgcaat gaactcctct 2461 ttattggaag attggcagtt aggatttgtt cccactcctg ataatccaat tcaggatacc 2521 tacaggtata ttgactcttt ggctacacgg tgtccagata aaaatcctcc aaaagaaaag 2581 gaagaccctt ataaaggctt acatttttgg gatgtagatt taactgaaag attgtcatta 2641 gatttagatc aatattcctt aggcaggaaa tttttattcc aagctggttt acaacacacg 2701 accgttaacg gtacaaaagc agtgtcttat aaagggtcta atagaggaac aaagcgcaaa 2761 cgtaaaaatt gaggcctgac cgaaagtggt acatttttat aaacttttac acagtattca 2821 aggaatgttt gtttactctg actaagtata agtcttccaa ggataccgac cgcacccggt 2881 acactcagtc aggttgttgc caatatagaa tcagatcggt gccaaacaca ccgtcttgga 2941 ctcagaacag accgtgttcg ttataacatg ctcggattag ggacttcgcc aaagaagatt 3001 taatctacaa tcgcttttgg caatcacatt tggcactgct aaaggaccgt taacggtaag 3061 tagcaagttc cttgttcctt gtaccaggtg cggtattggg attttgcaat tgtaatggtt 3121 gttgccaact accataggca cattcaagtt tttgcctgta tcgttttcgt atcctgttaa 3181 caatatccaa tgtatgtata cataaataaa tatatatata tataagtgtc taagattggg 3241 ttattctgta atcaggcaat ggctgaggga gccgaacacc aacagaaact gacagaaaaa 3301 gataaggcag aattaccttc aaccattaga gacttagctg aaaccttagg catccctctt 3361 attgattgta taataccttg caatttttgt ggtaaatttt taaattattt ggaagcctgt 3421 gaattcgact acaaaaaact tagcctaatt tggaaagatt attgtgtgtt tgcgtgttgt 3481 cgcgtatgct gtggcgccac tgcaacatac gaatttaatc aattttatga gcagacagtg 3541 ttaggaagag atattgagtt agcttcagga ctctcgattt ttgatattga tatcaggtgt 3601 caaacttgct tagcatttct tgacattata gaaaagttag attgctgtgg cagaggcctt 3661 ccctttcaca aggtgaggaa cgcctggaag ggaatctgta ggcagtgtaa gcatttttat 3721 catgattggt aaagaggtca ccgtgcaaga tattattctg gagctcagtg aggtgcagcc 3781 cgaagtgcta ccagttgacc tgttttgtga agaggaatta ccaaacgagc aggaaacgga 3841 ggaggagcct gacatcgaaa ggatctctta caaagttata gctccgtgcg gttgcagaca 3901 ctgtgaggtc aagcttcgca tttttgtcca cgccacagaa tttggtatta gagctttcca 3961 acagctattg accggagatc tgcagctcct gtgtcctgac tgtcgcggaa actgcaaaca 4021 tgacggatcc taatcctaaa ggtagtacat ctaaagaagg gtttggtgat tggtgtttat 4081 tggaagctga ctgtagtgat gtagaaaatg atttgggaca attgtttgag agagatacag 4141 actctgatat atcggatttg ttagatgata ctgaactgga gcagggcaat tccctggaac 4201 tatttcatca acaggagtgt gagcagagcg aggagcaatt acaaaaacta aaacgaaagt 4261 atcttagtcc aaaagctatc gcacagctta gtccgcgact tgagtcaatt tcattgtcac 4321 ctcagcagaa gtctaagcga aggctctttg cagagcagga cagcggactt gagctgactt 4381 taaacaatga agctgaagat gttactcctg aggtggaggt accggctatt gactctcggc 4441 cggatgacga gggaggttca ggggatgtag atatacatta tacatcattg ttgcgttcta 4501 gcaacaaaaa agccacatta atggctaaat ttaaagagtc gtttggagta ggttttaatg 4561 aattgacacg gcaatttaaa agccacaaaa cctgctgtaa ggactgggtt gtctctgtat 4621 atgcagtgca tgatgattta tttgaaagct caaagcagct gttgcaacag cattgtgact 4681 atatctgggt ccgtgggcat agcctggcct tccataccca tgcccacacg cctgctcctt 4741 gcttttctgt tttgtttttt gttcatgcta ttaactatgc cttccttgcc tggctgattt 4801 ttattcatgt cccaaaatgt agctcggagt tgacttcctc tagggagggt tcctattgcc 4861 attaaagtct cttccttgca tatccttgat gaactattgt gggtaccagg gccataactt 4921 caattaccat actgttttgg gacaatctat ttactcgtct aacttctctt gctagactta 4981 cagaaatgca aacacggaaa tcatatctgt accttattta ctcttttaaa gccccacctc 5041 cagtttagag cttggtgtag agcagttcca ttgaaatact gacctgagga catttcaact 5101 tatatttagt tatgttctca tatttttcta agaggttcaa aacaaaactg ataaacatgt 5161 ttttaaaaaa tgagtatctg attgtttaga aatgcttttt acagaaaaag actgatcaga 5221 ttctaacaga agcaaaactg ccttttaaac tatttatttt aaagttcttc tctccccatg 5281 tagcttttac atggtctcgt gagttatgac atcaacttct tctatgacct ccagaaagcc 5341 gttttatgcc tcagttactt tgttggtaaa attagcgatg ataactactg ctaagcaata 5401 ttgataaaag ttgtttgatt tgaaagtgtt ataactgaat gctcccaaac aataattcca 5461 agcacatttt tctaaggttc tgatgattgt gttagtgata aagcagaagg atggggtttc 5521 tatttgacag tctggtattt ccatctctca aatcaagcta gcagagttaa atgttctggt 5581 catatttaac tcacatctct tttttccaaa ttattcaact atgccatgga aaataatatt 5641 tgctttatcc tatcattaga cagtatgctg acaggcttat aaaatgacaa gatttttcat 5701 ttcagtaaac catatttaaa tcatcatcgt attccttatc tatctacttg gaggtaattt 5761 ggtttactgc attaatatga attttatatt ttttgagtct tcactcaaaa ttggtcctat 5821 catctattac ccaaataaag actgtactca gagataatgt agtattaaag agaaatgtct 5881 aagcaacaca aataatctag gaaaattgta aaagaaagtt tgaggataat gctaaaattt 5941 gagttttggc ctcaacttga gtgcattgtg ccacatcctt tgctgttaag ttaaaatgca 6001 cctttggtca aatcatttcc tttgtctacc ccctgctgtc atagtttcag tgctcatgtc 6061 tatgtcgata aattgtgaac tgctagatct gatttctcag tctattttta taatctattt 6121 tctttggata tttcaatttg gatatttcaa ccttttaagt tccagtgagc aagagtagac 6181 caaaattcat atttgcattc aacatttggg gactagggtt ttttttttat taccacaata 6241 aatatatgca gagtgatttc tatgcgcgat aagattataa gagactgatt gaatgggtga 6301 actttccttt gcttcctcag tttcccaagg cagtgtgagg gaagtttggt gatccatgca 6361 gctggtgcct gtgtcaggta ggagggtccc agggatggtg ggggcaccga ctgcctttcc 6421 ctccaccagg aggctagggg gtgtcatagt ccacttagtg ttgctataat ggaacacctg 6481 atagtgagcc atttataaat aagagaagtt catttagctc acagttttgt agcctgggaa 6541 gttcaagagc atggccctgg cttctgacaa gggctttcac actgtgtcat aacctggaga 6601 aggccaaggg gaagctgaca cgttcaaaca gaggactgaa caagtgaagg aggctcactt 6661 tgtaacaacc cactgtcacg ggaatccagt ctcaaaagag caagaactca ctattgcagg 6721 aaggacacca agccattcat gagcgatcca aacccacgac ccaaacacct cccactagtc 6781 cccacctcca gcaccaccac attcacattg ggaatcacat ttcaacatgg aatttgacag 6841 gggacaaact caactatagc aaggagactt tgcagaagtc aagcctctgc ctggagttac 6901 agatggaccc tctacattct actgcttaca ggatctgagg tcctatagca cactgtactt 6961 ggcctgccat gtcctgcaga gaaagagcat tatactcaga aatgaaggtg ttgtcatttc 7021 ttcaaatcct acctctcctc actgtaatca gtggaccctc cattaccatc aaactgtagc 7081 catagtgaat tgggggtgtt gctggttaac cttccctgtt tatacctcat caaatctggc 7141 cacttcataa aagtctgttc tcaatcacat ttcttaaatg taccctttat tatttcagca 7201 catatctgca caatcagtaa tttttatgaa aaatcagaaa cttctattga aattaggtgg 7261 tcctgttctt aatctccctt ccagtttttg gtgcatccat tctttcagta tcacctttaa 7321 cccattagtt gttgactaat catttgcagt gtttctgcag ttgtgtctac aagactcaaa 7381 ctgacccctt gtcctggtcc cagtgttctt ccaattccca ccactcatct ttgacatgaa 7441 tgtcgtcacc atcacagtcg ccccctagcg tcccagtaac agcctgcact gtggatggct 7501 gctgctggtt cccgggagca cacctttctc tagatgactt ccttcctgac tccccggtcc 7561 agatggctct gtcaacttga tctatttttc aggaaatgtt atttgattct tccatcaatc 7621 tctggagcac agtgaccctg cctttgtttt acctgccttg gcaaggggca cagagcagga 7681 aatactatat aattccctta caattccaag tagggcttct gacacaggta aaatttgagg 7741 catagtgcca aaaccatttc taagcaggtt ggtgctcagg gtgggatact accctgctgt 7801 gaagtgaagg cacagcaggg aagcaggtgt gtcggaggcc catttcttgg tattgtggct 7861 ggagaaacct gtctgtgacc ttgggcattt gtctccaagt ctattggggt ttgagactac 7921 agtgggacaa tcaagaccag acctgacaca ggagtaaaat ctgaaagtcc tcccagtcct 7981 gatcaagagt gaggaaaaat gagacggctg agcccacaga ggcagggact caggagctga 8041 tgttcagggg ggctctttaa aagttgcatt ttgtaaacat tgttaacgtg cttgctcaat 8101 tattagaaga cagtaagcat ttggtaaaca ttgttaacgt gcttgctcaa ttatgagaag 8161 acagtgaaat caacataata caaactgagg ccatttacgc ggctaaactt ttttgcacgt 8221 ggccgtttta attgctcctg tctttctaaa gcaagaggaa atcccattca ttaatttctg 8281 tcaatcaaca agaaaaggat gacatttttt agccatgtag caatatttta ttgaggaaaa 8341 atctacccct ttttggctct tactgtaata agtattcacc catttgatca ttaagaaata 8401 gattctaagt tccaaggaca caactctttc aagtcaaagg cattatgtca ttatttaaat 8461 tgtaatacat gtcagctttg ggtcagaaca tgtgtttctt accagttgcc tcatgtgtcg 8521 tacaagccct atttgggaat tgctttcagg gctttttctt cagtcagagg ggcaaaggct 8581 gtaactcaag gaagaaggtg aattttgtca ctaattactg tcattgttca gaaggcaata 8641 aacactttcc ttgaacaagg taataggtga atcctgcttc ctgctccaac caccagcaga 8701 agggtttctc ccccacctct gggatcagct aggatagtgt cagtagtttg cttttgtttg 8761 cttatgtatg atgtacacgt ttgtattatt ttctttagag aactaagtat atcagaaggc 8821 aacagcccat aatataagtt aaaattgcat gc // LOCUS HUMHRC1X 1539 bp mRNA PRI 31-DEC-1994 DEFINITION Human DNA-binding protein (HRC1) mRNA, complete cds. ACCESSION M91083 NID g184389 KEYWORDS DNA-binding protein; Harvey rat sarcoma viral oncogene homolog. SOURCE Homo sapiens female placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1539) AUTHORS Weitzel,J.N., Kasperczyk,A., Mohan,C. and Krontiris,T.G. TITLE The HRAS1 gene cluster: two upstream regions recognizing transcripts and a third encoding a gene with a leucine zipper domain JOURNAL Genomics 14 (2), 309-319 (1992) MEDLINE 93052330 FEATURES Location/Qualifiers source 1..1539 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="placenta" /map="11p" mRNA 10..1539 /partial /gene="HRC1" gene 10..1539 /gene="HRC1" CDS 44..1165 /gene="HRC1" /note="putative" /codon_start=1 /product="DNA-binding protein" /db_xref="PID:g184390" /translation="MLLGLAAMELKVWVDGIQRVVCGVSEQTTCQEVVIALAQAIGQT GRFVLVQRLREKERQLLPQECPVGAQATCGQFASDVQFVLRRTGPSLAGRPSSDSCPP PERCLIRASLPVKPRAALGCEPRKTLTPEPAPSLSRPGPAAPVTPTPGCCTDLRGLEL RVQRNAEELGHEAFWEQELRREQAREREGQARLQALSAATAEHAARLQALDAQARALE AELQLAAEAPGPPSPMASATERLHQDLAVQERQSAEVQGSLALVSRALEAAERALQAQ AQELEELNRELRQCNLQQFIQQTGAALPPPPRPDRGPPGTQGPLPPAREESLLGAPSE SHAGAQPRPRGGPHDAELLEVAAAPAPEWCPLAAQPQAL" polyA_signal 1523 /gene="HRC1" BASE COUNT 277 a 493 c 525 g 244 t ORIGIN 1 gaattcgggg ggagggggca gtgtcctccg agccaggaca ggcatgttgt tgggactggc 61 ggccatggag ctgaaggtgt gggtggatgg catccagcgt gtggtctgtg gggtctcaga 121 gcagaccacc tgccaggaag tggtcatcgc actagcccaa gcaataggcc agactggccg 181 ctttgtgctt gtgcagcggc ttcgggagaa ggagcggcag ttgctgccac aagagtgtcc 241 agtgggcgcc caggccacct gcggacagtt tgccagcgat gtccagtttg tcctgaggcg 301 cacagggccc agcctagctg ggaggccctc ctcagacagc tgtccacccc cggaacgctg 361 cctaattcgt gccagcctcc ctgtaaagcc acgggctgcg ctgggctgtg agccccgcaa 421 aacactgacc cccgagccag cccccagcct ctcacgccct gggcctgcgg cccctgtgac 481 acccacacca ggctgctgca cagacctgcg gggcctggag ctcagggtgc agaggaatgc 541 tgaggagctg ggccatgagg ccttctggga gcaagagctg cgccgggagc aggcccggga 601 gcgagaggga caggcacgcc tgcaggcact aagtgcggcc actgctgagc atgccgcccg 661 gctgcaggcc ctggacgctc aggcccgtgc cctggaggct gagctgcagc tggcagcgga 721 ggcccctggg cccccctcac ctatggcatc tgccactgag cgcctgcacc aggacctggc 781 tgttcaggag cggcagagtg cggaggtgca gggcagcctg gctctggtga gccgggccct 841 ggaggcagca gagcgagcct tgcaggctca ggctcaggag ctggaggagc tgaaccgaga 901 gctccgtcag tgcaacctgc agcagttcat ccagcagacc ggggctgcgc tgccaccgcc 961 cccacggcct gacaggggcc ctcctggcac tcagggccct ctgcctccag ccagagagga 1021 gtccctcctg ggcgctccct ctgagtccca tgctggtgcc cagcctaggc cccgaggtgg 1081 cccccatgac gcagaactcc tggaggtagc agcagctcct gccccagagt ggtgtcctct 1141 ggcagcccag ccccaggctc tgtgacagcc tagtgagggc tgcaagacca tcctgcccgg 1201 accacagaag gagagttggc ggtcacagag ggctcctctg ccaggcagtg ggaagccctg 1261 ggtttggcct caggagctgg gggtgcagtg ggggactgcc ctagtccttg ccaggtcgcc 1321 cagcaccctg gagaagcatg gggcgtagcc agctcggaac ttgccaggcc ccaaaggcca 1381 cgactgcctg ttggggacag gagatgcatg gacagtgtgc tcaagctgtg ggcatgtgct 1441 tgcctgcggg agaggtcctt cactgtgtgt acacagcaag agcatgtgtg tgccacttcc 1501 cctaccccaa cgtgaaaacc tcaataaact gcccgaagc // LOCUS HUMHSEM 2530 bp mRNA PRI 08-MAY-1995 DEFINITION Homo sapiens semaphorin-III (Hsema-I) mRNA, complete cds. ACCESSION L26081 NID g799328 KEYWORDS semaphorin. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2530) AUTHORS Kolodkin,A.L., Matthes,D.J. and Goodman,C.S. TITLE The semaphorin genes encode a family of transmembrane and secreted growth cone guidance molecules JOURNAL Cell 75 (7), 1389-1399 (1993) MEDLINE 94094332 FEATURES Location/Qualifiers source 1..2530 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="brain" gene 16..2331 /gene="Hsema-III" CDS 16..2331 /gene="Hsema-III" /codon_start=1 /product="semaphorin-III" /db_xref="PID:g436560" /translation="MGWLTRIVCLFWGVLLTARANYQNGKNNVPRLKLSYKEMLESNN VITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFDLVNIKDFQKIVWPVSYTRRDE CKWAGKDILKECANFIKVLKAYNQTHLYACGTGAFHPICTYIEIGHHPEDNIFKLENS HFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDFAIFRTLGHHHPIRTEQHDS RWLNDPKFISAHLISESDNPEDDKVYFFFRENAIDGEHSGKATHARIGQICKNDFGGH RSLVNKWTTFLKARLICSVPGPNGIDTHFDELQDVFLMNFKDPKNPVVYGVFTTSSNI FKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPYQGRVPYPRPGTCPSKTFGGFDST KDLPDDVITFARSHPAMYNPVFPMNNRPIVIKTDVNYQFTQIVVDRVDAEDGQYDVMF IGTDVGTVLKVVSIPKETWYDLEEVLLEEMTVFREPTAISAMELSTKQQQLYIGSTAG VAQLPLHRCDIYGKACAECCLARDPYCAWDGSACSRYFPTAKRRTRRQDIRNGDPLTH CSDLHHDNHHGHSPEERIIYGVENSSTFLECSPKSQRALVYWQFQRRNEERKEEIRVD DHIIRTDQGLLLRSLQQKDSGNYLCHAVEHGFIQTLLKVTLEVIDTEHLEELLHKDDD GDGSKTKEMSNSMTPSQKVWYRDFMQLINHPNLNTMDEFCEQVWKRDRKQRRQRPGHT PGNSNKWKHLQENKKGRNRRTHEFERAPRSV" BASE COUNT 786 a 518 c 576 g 650 t ORIGIN 1 ggaattccct gcagcatggg ctggttaact aggattgtct gtcttttctg gggagtatta 61 cttacagcaa gagcaaacta tcagaatggg aagaacaatg tgccaaggct gaaattatcc 121 tacaaagaaa tgttggaatc caacaatgtg atcactttca atggcttggc caacagctcc 181 agttatcata ccttcctttt ggatgaggaa cggagtaggc tgtatgttgg agcaaaggat 241 cacatatttt cattcgacct ggttaatatc aaggattttc aaaagattgt gtggccagta 301 tcttacacca gaagagatga atgcaagtgg gctggaaaag acatcctgaa agaatgtgct 361 aatttcatca aggtacttaa ggcatataat cagactcact tgtacgcctg tggaacgggg 421 gcttttcatc caatttgcac ctacattgaa attggacatc atcctgagga caatattttt 481 aagctggaga actcacattt tgaaaacggc cgtgggaaga gtccatatga ccctaagctg 541 ctgacagcat cccttttaat agatggagaa ttatactctg gaactgcagc tgattttatg 601 gggcgagact ttgctatctt ccgaactctt gggcaccacc acccaatcag gacagagcag 661 catgattcca ggtggctcaa tgatccaaag ttcattagtg cccacctcat ctcagagagt 721 gacaatcctg aagatgacaa agtatacttt ttcttccgtg aaaatgcaat agatggagaa 781 cactctggaa aagctactca cgctagaata ggtcagatat gcaagaatga ctttggaggg 841 cacagaagtc tggtgaataa atggacaaca ttcctcaaag ctcgtctgat ttgctcagtg 901 ccaggtccaa atggcattga cactcatttt gatgaactgc aggatgtatt cctaatgaac 961 tttaaagatc ctaaaaatcc agttgtatat ggagtgttta cgacttccag taacattttc 1021 aagggatcag ccgtgtgtat gtatagcatg agtgatgtga gaagggtgtt ccttggtcca 1081 tatgcccaca gggatggacc caactatcaa tgggtgcctt atcaaggaag agtcccctat 1141 ccacggccag gaacttgtcc cagcaaaaca tttggtggtt ttgactctac aaaggacctt 1201 cctgatgatg ttataacctt tgcaagaagt catccagcca tgtacaatcc agtgtttcct 1261 atgaacaatc gcccaatagt gatcaaaacg gatgtaaatt atcaatttac acaaattgtc 1321 gtagaccgag tggatgcaga agatggacag tatgatgtta tgtttatcgg aacagatgtt 1381 gggaccgttc ttaaagtagt ttcaattcct aaggagactt ggtatgattt agaagaggtt 1441 ctgctggaag aaatgacagt ttttcgggaa ccgactgcta tttcagcaat ggagctttcc 1501 actaagcagc aacaactata tattggttca acggctgggg ttgcccagct ccctttacac 1561 cggtgtgata tttacgggaa agcgtgtgct gagtgttgcc tcgcccgaga cccttactgt 1621 gcttgggatg gttctgcatg ttctcgctat tttcccactg caaagagacg cacaagacga 1681 caagatataa gaaatggaga cccactgact cactgttcag acttacacca tgataatcac 1741 catggccaca gccctgaaga gagaatcatc tatggtgtag agaatagtag cacatttttg 1801 gaatgcagtc cgaagtcgca gagagcgctg gtctattggc aattccagag gcgaaatgaa 1861 gagcgaaaag aagagatcag agtggatgat catatcatca ggacagatca aggccttctg 1921 ctacgtagtc tacaacagaa ggattcaggc aattacctct gccatgcggt ggaacatggg 1981 ttcatacaaa ctcttcttaa ggtaaccctg gaagtcattg acacagagca tttggaagaa 2041 cttcttcata aagatgatga tggagatggc tctaagacca aagaaatgtc caatagcatg 2101 acacctagcc agaaggtctg gtacagagac ttcatgcagc tcatcaacca ccccaatctc 2161 aacacgatgg atgagttctg tgaacaagtt tggaaaaggg accgaaaaca acgtcggcaa 2221 aggccaggac ataccccagg gaacagtaac aaatggaagc acttacaaga aaataagaaa 2281 ggtagaaaca ggaggaccca cgaatttgag agggcaccca ggagtgtctg agctgcatta 2341 cctctagaaa cctcaaacaa gtagaaactt gcctagacaa taactggaaa aacaaatgca 2401 atatacatga acttttttca tggcattatg tggatgttta caatggtggg aaattcagct 2461 gagttccacc aattataaat taaatccatg agtaactttc ctaataggct tttttttcct 2521 aataccaccg // LOCUS HUMHSF1 2156 bp mRNA PRI 08-NOV-1994 DEFINITION Human heat shock factor 1 (TCF5) mRNA, complete cds. ACCESSION M64673 NID g184402 KEYWORDS DNA-binding transcription factor; heat shock factor 1. SOURCE Homo sapiens lymphoid cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS Rabindran,S.K., Giorgi,G., Clos,J. and Wu,C. TITLE Molecular cloning and expression of a human heat shock factor, HSF1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (16), 6906-6910 (1991) MEDLINE 91334376 FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B cell" /tissue_type="lymphoid" /map="20" gene 161..1750 /gene="TCF5" CDS 161..1750 /gene="TCF5" /codon_start=1 /db_xref="GDB:G00-126-373" /product="heat shock factor 1" /db_xref="PID:g184403" /translation="MDLPVGPGAAGPSNVPAFLTKLWTLVSDPDTDALICWSPSGNSF HVFDQGQFAKEVLPKYFKHNNMASFVRQLNMYGFRKVVHIEQGGLVKPERDDTEFQHP CFLRGQEQLLENIKRKVTSVSTLKSEDIKIRQDSVTKLLTDVQLMKGKQECMDSKLLA MKHENEALWREVASLRQKHAQQQKVVNKLIQFLISLVQSNRILGVKRKIPLMLNDSGS AHSMPKYSRQFSLEHVHGSGPYSAPSPAYSSSSLYAPDAVASSGPIISDITELAPASP MASPGGSIDERPLSSSPLVRVKEEPPSPPQSPRVEEASPGRPSSVDTLLSPTALIDSI LRESEPAPASVTALTDARGHTDTEGRPPSPPPTSTPEKCLSVACLDKNELSDHLDAMD SNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPR PPEAENSSPDSGKQLVHYTAQPLFLLDPGSVDTGSNDLPVLFELGEGSYFSEGDGFAE DPTISLLTGSEPPKAKDPTVS" BASE COUNT 435 a 739 c 628 g 354 t ORIGIN 1 cgggcccgtt gcaagatggc ggcggccatg ctgggccccg gggctgtgtg tgcgcagcgg 61 gcggcggcgc ggcccggaag gctggcgcgg cgacggcgtt agcccggccc tcggcccctc 121 tttgcggccg ctccctccgc ctattccctc cttgctcgag atggatctgc ccgtgggccc 181 cggcgcggcg gggcccagca acgtcccggc cttcctgacc aagctgtgga ccctcgtgag 241 cgacccggac accgacgcgc tcatctgctg gagcccgagc gggaacagct tccacgtgtt 301 cgaccagggc cagtttgcca aggaggtgct gcccaagtac ttcaagcaca acaacatggc 361 cagcttcgtg cggcagctca acatgtatgg cttccggaaa gtggtccaca tcgagcaggg 421 cggcctggtc aagccagaga gagacgacac ggagttccag cacccatgct tcctgcgtgg 481 ccaggagcag ctccttgaga acatcaagag gaaagtgacc agtgtgtcca ccctgaagag 541 tgaagacata aagatccgcc aggacagcgt caccaagctg ctgacggacg tgcagctgat 601 gaaggggaag caggagtgca tggactccaa gctcctggcc atgaagcatg agaatgaggc 661 tctgtggcgg gaggtggcca gccttcggca gaagcatgcc cagcaacaga aagtcgtcaa 721 caagctcatt cagttcctga tctcactggt gcagtcaaac cggatcctgg gggtgaagag 781 aaagatcccc ctgatgctga acgacagtgg ctcagcacat tccatgccca agtatagccg 841 gcagttctcc ctggagcacg tccacggctc gggcccctac tcggccccct ccccagccta 901 cagcagctcc agcctctacg cccctgatgc tgtggccagc tctggaccca tcatctccga 961 catcaccgag ctggctcctg ccagccccat ggcctccccc ggcgggagca tagacgagag 1021 gcccctatcc agcagccccc tggtgcgtgt caaggaggag ccccccagcc cgcctcagag 1081 cccccgggta gaggaggcga gtcccgggcg cccatcttcc gtggacaccc tcttgtcccc 1141 gaccgccctc attgactcca tcctgcggga gagtgaacct gcccccgcct ccgtcacagc 1201 cctcacggac gccaggggcc acacggacac cgagggccgg cctccctccc ccccgcccac 1261 ctccacccct gaaaagtgcc tcagcgtagc ctgcctggac aagaatgagc tcagtgacca 1321 cttggatgct atggactcca acctggataa cctgcagacc atgctgagca gccacggctt 1381 cagcgtggac accagtgccc tgctggacct gttcagcccc tcggtgaccg tgcccgacat 1441 gagcctgcct gaccttgaca gcagcctggc cagtatccaa gagctcctgt ctccccagga 1501 gccccccagg cctcccgagg cagagaacag cagcccggat tcagggaagc agctggtgca 1561 ctacacagcg cagccgctgt tcctgctgga ccccggctcc gtggacaccg ggagcaacga 1621 cctgccggtg ctgtttgagc tgggagaggg ctcctacttc tccgaagggg acggcttcgc 1681 cgaggacccc accatctccc tgctgacagg ctcggagcct cccaaagcca aggaccccac 1741 tgtctcctag aggccccgga ggagctgggc cagccgccca cccccacccc cagtgcaggg 1801 ctggtcttgg ggaggcaggg cagcctcgcg gtcttgggca ctggtgggtc ggccgccata 1861 gccccagtag gacaaacggg ctcgggtctg ggcagcacct ctggtcagga gggtcaccct 1921 ggcctgccag tctgccttcc cccaaccccg tgtcctgtgg tttggttggg gcttcacagc 1981 cacacctgga ctgaccctgc aggttgttca tagtcagaat tgtattttgg atttttacac 2041 aactgtcccg ttccccgctc cacagagata cacagatata tacacacagt ggatggacgg 2101 acaagacagg cagagatcta taaacagaca ggctctaaaa aaaaaaaaaa aaaaaa // LOCUS HUMHSF2 2411 bp mRNA PRI 06-SEP-1991 DEFINITION Human heat shock factor 2 (HSF2) mRNA, complete cds. ACCESSION M65217 NID g184404 KEYWORDS heat shock factor 2. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2411) AUTHORS Schuetz,T.J., Sheldon,L., Gallo,G.J., Tempst,P. and Kingston,R.E. TITLE Isolation of a cDNA for HSF2: Evidence for two heat shock factor genes in humans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 6911-6915 (1991) MEDLINE 91334377 FEATURES Location/Qualifiers source 1..2411 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HPB-ALL" /cell_type="T-cells" /tissue_type="lymphocyte" gene 89..1699 /gene="heat shock factor 2" CDS 89..1699 /gene="heat shock factor 2" /codon_start=1 /product="HSF2" /db_xref="PID:g184405" /translation="MKQSSNVPAFLSKLWTLVEETHTNEFITWSQNGQSFLVLDEQRF AKEILPKYFKHNNMASFVRQLNMYGFRKVVHIDSGIVKQERDGPVEFQHPYFKQGQDD LLENIKRKVSSSKPEENKIRQEDLTKIISSAQKVQIKQETIESRLSELKSENESLWKE VSELRAKHAQQQQVIRKIVQFIVTLVQNNQLVSLKRKRPLLLNTNGAQKKNLFQHIVK EPTDNHHHKVPHSRTEGLKPRERISDDIIIYDVTDDNADEENIPVIPETNEDVISDPS NCSQYPDIVIVEDDNEDEYAPVIQSGEQNEPARESLSSGSDGSSPLMSSAVQLNGSSS LTSEDPVTMMDSILNDNINLLGKVELLDYLDSIDCSLEDFQAMLSGRQFSIDPDLLVD LFTSSVQMNPTDYINNTKSENKGLETTKNNVVQPVSEEGRKSKSKPDKQLIQYTAFPL LAFLDGNPASSVEQASTTASSEVLSSVDKPIEVDELLDSSLDPEPTQSKLVRLEPLTE AEASEATLFYLCELAPAPLDSDMPLLDS" BASE COUNT 734 a 458 c 497 g 722 t ORIGIN 1 gcgttctcgg gaagctgctg ccgtagctgc cgccgccgct accaccgcgt tcgggtgtag 61 aatttggaat ccctgcgccg cgttaacaat gaagcagagt tcgaacgtgc cggctttcct 121 cagcaagctg tggacgcttg tggaggaaac ccacactaac gagttcatca cctggagcca 181 gaatggccaa agttttctgg tcttggatga gcaacgattt gcaaaagaaa ttcttcccaa 241 atatttcaag cacaataata tggcaagctt tgtgaggcaa ctgaatatgt atggtttccg 301 taaagtagta catatcgact ctggaattgt aaagcaagaa agagatggtc ctgtagaatt 361 tcagcatcct tacttcaaac aaggacagga tgacttgttg gagaacatta aaaggaaggt 421 ttcatcttca aaaccagaag aaaataaaat tcgtcaggaa gatttaacaa aaattataag 481 tagtgctcag aaggttcaga taaaacagga aactattgag tccaggcttt ctgaattaaa 541 aagtgagaat gagtcccttt ggaaggaggt gtcagaatta cgagcaaagc atgcacaaca 601 gcaacaagtt attcgaaaga ttgtccagtt tattgttaca ttggttcaaa ataaccaact 661 tgtgagttta aaacgtaaaa ggcctctact tctaaacact aatggagccc aaaagaagaa 721 cctgtttcag cacatagtca aagaaccaac tgataatcat catcataaag ttccacacag 781 taggactgaa ggtttaaagc caagggagag gatttcagat gacatcatta tttatgatgt 841 tactgatgat aatgcagatg aagaaaatat cccagttatt ccagaaacta atgaggatgt 901 tatatctgat ccctccaact gtagccagta ccctgatatt gtcatcgttg aagatgacaa 961 tgaagatgag tatgcacctg tcattcagag tggagagcag aatgaaccag ccagagaatc 1021 cctaagttca ggcagtgatg gcagcagccc tctcatgtct agtgctgtcc agctaaatgg 1081 ctcatccagt ctgacctcag aagatccagt gaccatgatg gattccattt tgaatgataa 1141 catcaatctt ttgggaaagg ttgagctgtt ggattatctt gacagtattg actgcagttt 1201 agaggacttc caggccatgc tatcaggaag acaatttagc atagacccag atctcctggt 1261 tgatcttttc actagttctg tgcagatgaa tcccacagat tacatcaata atacaaaatc 1321 tgagaataaa ggattagaaa ctaccaagaa caatgtagtt cagccagttt cggaagaggg 1381 aagaaaatct aaatccaaac cagataagca gcttatccag tataccgcct ttccacttct 1441 tgcattcctc gatgggaacc ctgcttcttc tgttgaacag gcgagtacaa cagcatcatc 1501 agaagttttg tcctctgtag ataaacccat agaagttgat gagcttctgg atagcagcct 1561 agacccagaa ccaacccaaa gtaagcttgt tcgcctggag ccattgactg aagctgaagc 1621 tagtgaagct acactgtttt atttatgtga acttgctcct gcacctctgg atagtgatat 1681 gccactttta gatagctaaa tccccaggaa gtggacttta catgtatata ttcatcaaaa 1741 tgatgaacta tttattttaa agtatcattt ggtacttttt ttgtaaattg ctttgttttg 1801 tttaatcaga tactgtggaa taaaagcacc ttttgctttt ctcactaacc acacactctt 1861 gcagagcttt caggtgttac tcagctgcat agttacgcag atgtaatgca cattattggc 1921 gtatctttaa gttggattca aatggccatt tttctccaat tttggtaaat tggatatctt 1981 ttttttacaa atacgaccat taacctcagt taaatttttg tttgttttcc tgtttgatgc 2041 tgtctatttg cattgagtgt aagtcatttg aactaatggt ataactccta aagcttctct 2101 gctccagtta tttttattaa atatttttca cttggcttat ttttaaaact gggaacataa 2161 agtgcctgta tcttgtaaaa cttcatttgt ttcttttggt tcagagaagt tcatttatgt 2221 tcaaagacgt ttattcatgt tcaacaggaa agacaaagtg tacgtgaatg ctcgctgtct 2281 gatagggttc cagctccata tatatagaaa gatcgggggt gggatgggat ggagtgagcc 2341 ccatccagtt agttggacta gttttaaata aaggttttcc ggtttgtgtt tttttgaacc 2401 atactgttta g // LOCUS HUMHSH 1599 bp mRNA PRI 03-JAN-1994 DEFINITION Homo sapiens serine hydroxymethyltransferase mRNA, complete cds. ACCESSION L23928 NID g438633 KEYWORDS alternative splicing; serine hydroxymethyltransferase. SOURCE Homo sapiens (library: lambda ZAPII) fetus liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Xu,L. TITLE Molecular Cloning and Characterization of the Rabbit and Human Liver Cytosolic Serine Hydroxymethyltransferase Genes JOURNAL Thesis (1992) REFERENCE 2 (bases 1 to 1599) AUTHORS Xu,L., Mangum,J.H. and Robertson,D.L. TITLE Cloning and analysis of the human liver cytosolic serine hydroxymethyltransferase gene JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="liver" /tissue_lib="lambda ZAPII" /map="12q12-q14" gene join(13..826,944..1464) /gene="SHMT" CDS 13..1464 /gene="SHMT" /EC_number="2.1.2.1" /note="The ORF encodes a protein with considerable sequence homology with rabbit SHMT (J. Biol. Chem. 262:5499-5509).; putative" /citation=[2] /citation=[1] /codon_start=1 /function="Ser and THF conversion to Gly and n5n10methylene THF" /product="serine hydroxymethyltransferase" /db_xref="PID:g438636" /translation="MTMPVNGAHKDADLWSSHDKMLAQPLKDSDVEVYNIIKKESNRQ RVGLELIASENFASRAVLEALGSCLNNKYSEGYPGQRYYGGTEFIDELETLCQKRALQ AYKLDPQCWGVNVQPYSGSPANFAVYTALVEPHGRIMGLDLPDGGHLTHGFMTDKKKI SATSIFFESMPYKVNPDTGYINYDQLEENARLFHPKLIIAGTSCYSRNLEYARLRKIA DENGAYLMADMAHISGLVAAGVVPSPFEHCHVVTTTTHKTLRGCRAGMIFYRKGVKSV DPKTGKEILYNLESLINSAVFPGLQGGPHNHAIAGVAVALKQAMTLEFKVYQHQVVAN CRALSEALTELGYKIVTGGSDNHLILVDLRSKGTDGGRAEKVLEACSIACNKNTCPGD RSALRPSGLRLGTPALTSRGLLEKDFQKVAHFIHRGIELTLQIQSDTGVRATLKEFKE RLAGDKYQAAVQALREEVESFASLFPLPGLPDF" CDS join(13..826,944..1464) /gene="SHMT" /codon_start=1 /product="serine hydroxymethyltransferase" /db_xref="PID:g438634" /translation="MTMPVNGAHKDADLWSSHDKMLAQPLKDSDVEVYNIIKKESNRQ RVGLELIASENFASRAVLEALGSCLNNKYSEGYPGQRYYGGTEFIDELETLCQKRALQ AYKLDPQCWGVNVQPYSGSPANFAVYTALVEPHGRIMGLDLPDGGHLTHGFMTDKKKI SATSIFFESMPYKVNPDTGYINYDQLEENARLFHPKLIIAGTSCYSRNLEYARLRKIA DENGAYLMADMAHISGLVAAGVVPSPFEHCHVVTTTTHKTLRGCRAGMIFYRKGVAVA LKQAMTLEFKVYQHQVVANCRALSEALTELGYKIVTGGSDNHLILVDLRSKGTDGGRA EKVLEACSIACNKNTCPGDRSALRPSGLRLGTPALTSRGLLEKDFQKVAHFIHRGIEL TLQIQSDTGVRATLKEFKERLAGDKYQAAVQALREEVESFASLFPLPGLPDF" CDS join(13..826,1067..1464) /gene="SHMT" /codon_start=1 /product="serine hydroxymethyltransferase" /db_xref="PID:g438635" /translation="MTMPVNGAHKDADLWSSHDKMLAQPLKDSDVEVYNIIKKESNRQ RVGLELIASENFASRAVLEALGSCLNNKYSEGYPGQRYYGGTEFIDELETLCQKRALQ AYKLDPQCWGVNVQPYSGSPANFAVYTALVEPHGRIMGLDLPDGGHLTHGFMTDKKKI SATSIFFESMPYKVNPDTGYINYDQLEENARLFHPKLIIAGTSCYSRNLEYARLRKIA DENGAYLMADMAHISGLVAAGVVPSPFEHCHVVTTTTHKTLRGCRAGMIFYRKGGSDN HLILVDLRSKGTDGGRAEKVLEACSIACNKNTCPGDRSALRPSGLRLGTPALTSRGLL EKDFQKVAHFIHRGIELTLQIQSDTGVRATLKEFKERLAGDKYQAAVQALREEVESFA SLFPLPGLPDF" misc_binding 760..789 /gene="SHMT" /note="This AA sequence (VVTTTTHKTL) was originally identified in rabbit SHMT (Bossa et al., Eur. J. Biochem. 70:397-401 [1976]) as the PLP binding site.; putative" /citation=[2] /citation=[1] /bound_moiety="pyridoxal 5'-phosphate (enzyme cofactor)" /label=PLPbindingsite intron 827..943 /gene="SHMT" /note="An mRNA variant lacking 117 nucleotides from the normal, full-length SHMT open reading frame was isolated using PCR. This mRNA would produce a protein lacking 39 amino acids in this region of mRNA." /citation=[2] /citation=[1] /function="intron region removed from normal, mature mRNA" /evidence=experimental intron 827..1066 /gene="SHMT" /note="This splicing variant lacks 351 nucleotides from the full-length SHMT open reading frame. The shortened SHMT protein would produce a protein lacking 80 amino acids compared to mature SHMT." /citation=[2] /citation=[1] /function="intron removal from mature, full-length SHMT mRNA" /evidence=experimental BASE COUNT 380 a 433 c 458 g 328 t ORIGIN 1 cgaaccagtg caatgacgat gccagtcaac ggggcccaca aggatgctga cctgtggtcc 61 tcacatgaca agatgctggc acaacccctc aaagacagtg atgttgaggt ttacaacatc 121 attaagaagg agagtaaccg gcagagggtt ggattggagc tgattgcctc ggagaatttc 181 gccagccgag cagttttgga ggccctaggc tcttgcttaa ataacaaata ctctgagggg 241 tacccgggcc agagatacta tggcgggact gagtttattg atgaactgga gaccctctgt 301 cagaagcgag ccctgcaggc ctataagctg gacccacagt gctggggggt caacgtccag 361 ccctactcag gctcccctgc aaactttgct gtgtacactg ccctggtgga accccatggg 421 cgcatcatgg gcctggacct tccggatggg ggccacctga cccatgggtt catgacagac 481 aagaagaaaa tctctgccac gtccatcttc tttgaatcta tgccctacaa ggtgaaccca 541 gatactggct acatcaacta tgaccagctg gaggagaacg cacgcctctt ccacccgaag 601 ctgatcatcg caggaaccag ctgctactcc cgaaacctgg aatatgcccg gctacggaag 661 attgcagatg agaacggggc gtatctcatg gcggacatgg ctcacatcag cgggctggtg 721 gcggctggcg tggtgccctc cccatttgaa cactgccatg tggtgaccac caccactcac 781 aagaccctgc gaggctgccg agctggcatg atcttctaca ggaaaggagt gaaaagtgtg 841 gatcccaaga ctggcaaaga gattctgtac aacctggagt ctcttatcaa ttctgctgtg 901 ttccctggcc tgcagggagg tccccacaac cacgccattg ctggggttgc tgtggcactg 961 aagcaagcta tgactctgga atttaaagtt tatcaacacc aggtggtggc caactgcagg 1021 gctctgtctg aggccctgac ggagctgggc tacaaaatag tcacaggtgg ttctgacaac 1081 catttgatcc ttgtggatct ccgttccaaa ggcacagatg gtggaagggc tgagaaggtg 1141 ctagaagcct gttctattgc ctgcaacaag aacacctgtc caggtgacag aagcgctctg 1201 cggcccagtg gactgcggct ggggacccca gcactgacgt cccgtggact tttggaaaaa 1261 gacttccaaa aagtagccca ctttattcac agagggatag agctgaccct gcagatccag 1321 agcgacactg gtgtcagagc caccctgaaa gagttcaagg agagactggc aggggataag 1381 taccaggcgg ccgtgcaggc tctccgggag gaggttgaga gcttcgcctc tctcttccct 1441 ctgcctggcc tgcctgactt ctaaaggagc gggcccactc tggacccacc tggcgccaca 1501 gaggaagctg cctgccggag gacccccacc tgagagatgg atgagctgct ccaaaggggg 1561 actgttgaca ctcgggccct ttgagggggt ttcttttgg // LOCUS HUMHSP60A 2202 bp mRNA PRI 11-JUN-1993 DEFINITION Human chaperonin (HSP60) mRNA, complete cds. ACCESSION M34664 NID g184411 KEYWORDS chaperonin protein. SOURCE Human placenta cDNA to mRNA, and DNA, clone PGEM-10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2202) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequences and novel structural features of human and chinese hamster hsp60 (Chaperonin) gene families JOURNAL DNA Cell Biol. 9, 545-552 (1990) MEDLINE 91103874 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA. FEATURES Location/Qualifiers source 1..2202 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 25..1746 /note="chaperonin (HSP60)" /codon_start=1 /db_xref="PID:g306890" /translation="MLRLPTVFRQMRPVSRVLAPHLTRAYAKDVKFGADARALMLQGV DLLADAVAVTMGPKGRTVIIEQGWGSPKVTKDGVTVAKSIDLKDKYKNIGAKLVQDVA NNTNEEAGDGTTTATVLARSIAKEGFEKISKGANPVEIRRGVMLAVDAVIAELKKQSK PVTTPEEIAQVATISANGDKEIGNIISDAMKKVGRKGVITVKDGKTLNDELEIIEGMK FDRGYISPYFINTSKGQKCEFQDAYVLLSEKKISSIQSIVPALEIANAHRKPLVIIAE DVDGEALSTLVLNRLKVGLQVVAVKAPGFGDNRKNQLKDMAIATGGAVFGEEGLTLNL EDVQPHDLGKVGEVIVTKDDAMLLKGKGDKAQIEKRIQEIIEQLDVTTSEYEKEKLNE RLAKLSDGVAVLKVGGTSDVEVNEKKDRVTDALNATRAAVEEGIVLGGGCALLRCIPA LDSLTPANEDQKIGIEIIKRTLKIPAMTIAKNAGVEGSLIVEKIMQSSSEVGYDAMAG DFVNMVEKGIIDPTKVVRTALLDAAGVASLLTTAEVVVTEIPKEEKDPGMGAMGGMGG GMGGGMF" BASE COUNT 699 a 371 c 538 g 594 t ORIGIN 1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga 61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt 121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc 181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa 241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac 301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc 361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc 421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt 481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct 541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa 601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa 661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca 721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct 781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc 841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag 901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag 961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg 1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa 1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa 1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa 1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt 1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt 1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac 1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc 1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag 1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat 1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct 1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa 1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg 1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc 1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc 1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact 1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt 1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac 2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat 2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta 2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa // LOCUS HUMHSP70H 2391 bp mRNA PRI 08-SEP-1993 DEFINITION Human heat shock protein 70 (hsp70) mRNA, complete cds. ACCESSION L12723 NID g292159 KEYWORDS heat shock protein; heat shock protein 70; major histocompatibility complex class III. SOURCE Homo sapiens (individual_isolate patient RY) (library: Clonetech lambda gt11; Bluescript) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2391) AUTHORS Fathallah,D.M., Cherif,D., Dellagi,K. and Arnaout,M.A. TITLE Molecular cloning of a novel human hsp70 from a B cell line and its assignment to chromosome 5 JOURNAL J. Immunol. 151, 810-813 (1993) MEDLINE 93329076 FEATURES Location/Qualifiers source 1..2391 /organism="Homo sapiens" /isolate="patient RY" /db_xref="taxon:9606" /cell_line="EBV transformed B-cell" /cell_type="leukocyte" /tissue_lib="Clonetech lambda gt11; Bluescript" /map="5q31.1-q31.2" gene 134..2239 /gene="hsp70" CDS 134..2239 /gene="hsp70" /codon_start=1 /product="heat shock protein 70" /db_xref="PID:g292160" /translation="MSVVGIDLGFQSCYVAVARAGGIETIANEYSDRCTPACISFGPK NRSIGAAAKSQVISNAKNTVQGFKRFHGRAFSDPFVEAEKSNLAYDIVQWPTGLTGIK VTYMEEERNFTTEQVTAMLLSKLKETAESVLKKPVVDCVVSVPCFYTDAERRSVMDAT QIAGLNCLRLMNETTAVALAYGIYKQDLPRLEEKPRNVVFVDMGHSAYQVSVCAFNRG KLKVLATAFDTTLGGRKFDEVLVNHFCEEFGKKYKLDIKSKIRALLRLSQECEKLKKL MSANASDLPLSIECFMNDVDVSGTMNRGKFLEMCNDLLARVEPPLRSVLEQTKLKKED IYAVEIVGGATRIPAVKEKISKFFGKELSTTLNADEAVTRGCALQCAILSPAFKVREF SITDVVPYPISLRWNSPAEEGSSDCEVFSKNHAAPFSKVLTFYRKEPFTLEAYYSSPQ DLPYPDPAIAQFSVQKVTPQSDGSSSKVKVKVRVNVHGIFSVSSASLVEVHKSEENEE PMETDQNAKEEEKMQVDQEEPHVEEQQQQTPAENKAESEEMETSQAGSKDKKMDQPPQ CQEGKSEDQYCGPANRESAIWQIDREMLNLYIENEGKMIMQDKLEKERNDAKNAVEEY VYEMRDKLSGEYEKFVSEDDRNSFTLKLEDTENWLYEDGEDQPKQVYVDKLAELKNLG QPIKIRFQESEERPNYLKN" BASE COUNT 753 a 444 c 592 g 602 t ORIGIN 1 tgcggccgca cctgcaggcg cagagtaggt atggaagatc cctcgagatc cattgtgctc 61 taaagccgcc gggggtccgt gtcctgtctc ggttggccgg acccgggccc gagcccgagc 121 agtagccggc gccatgtcgg tggtgggcat agacctgggc ttccagagct gctacgtcgc 181 tgtggcccgc gccggcggca tcgagactat cgctaatgag tatagcgacc gctgcacgcc 241 ggcttgcatt tcttttggtc ctaagaatcg ttcaattgga gcagcagcta aaagccaggt 301 aatttctaat gcaaagaaca cagtccaagg atttaaaaga ttccatggcc gagcattctc 361 tgatccattt gtggaggcag aaaaatctaa ccttgcatat gatattgtgc agtggcctac 421 aggattaaca ggtataaagg tgacatatat ggaggaagag cgaaatttta ccactgagca 481 agtgactgcc atgcttttgt ccaaactgaa ggagacagcc gaaagtgttc ttaagaagcc 541 tgtagttgac tgtgttgttt cggttccttg tttctatact gatgcagaaa gacgatcagt 601 gatggatgca acacagattg ctggtcttaa ttgcttgcga ttaatgaatg aaaccactgc 661 agttgctctt gcatatggaa tctataagca ggatcttcct cgcttagaag agaaaccaag 721 aaatgtagtt tttgtagaca tgggccactc tgcttatcaa gtttctgtat gtgcatttaa 781 tagaggaaaa ctgaaagttc tggccactgc atttgacacg acattgggag gtagaaaatt 841 tgatgaagtg ttagtaaatc acttctgtga agaatttggg aagaaataca agctagacat 901 taagtccaaa atccgtgcat tattacgact ctctcaggag tgtgagaaac tcaagaaatt 961 gatgagtgca aatgcttcag atctcccttt gagcattgaa tgttttatga atgatgttga 1021 tgtatctgga actatgaata gaggcaaatt tctggagatg tgcaatgatc tcttagctag 1081 agtggagcca ccacttcgta gtgttttgga acaaaccaag ttaaagaaag aagatattta 1141 tgcagtggag atagttggtg gtgctacacg aatccctgcg gtaaaagaga agatcagcaa 1201 atttttcggt aaagaactta gtacaacatt aaatgctgat gaagctgtca ctcgaggctg 1261 tgcattgcag tgtgccatct tatcgcctgc tttcaaagtc agagaatttt ctatcactga 1321 tgtagtacca tatccaatat ctctgagatg gaattctcca gctgaagaag ggtcaagtga 1381 ctgtgaagtc ttttccaaaa atcatgctgc tcctttctct aaagttctta cattttatag 1441 aaaggaacct ttcactcttg aggcctacta cagctctcct caggatttgc cctatccaga 1501 tcctgctata gctcagtttt cagttcagaa agtcactcct cagtctgatg gctccagttc 1561 aaaagtgaaa gtcaaagttc gagtaaatgt ccatggcatt ttcagtgtgt ccagtgcatc 1621 tttagtggag gttcacaagt ctgaggaaaa tgaggagcca atggaaacag atcagaatgc 1681 aaaggaggaa gagaagatgc aagtggacca ggaggaacca catgttgaag agcaacagca 1741 gcagacacca gcagaaaata aggcagagtc tgaagaaatg gagacctctc aagctggatc 1801 caaggataaa aagatggacc aaccacccca atgccaagaa ggcaaaagtg aagaccagta 1861 ctgtggacct gccaatcgag aatcagctat atggcagata gacagagaga tgctcaactt 1921 gtacattgaa aatgagggta agatgatcat gcaggataaa ctggagaagg agcggaatga 1981 tgctaagaac gcagtggagg aatatgtgta tgaaatgaga gacaagctta gtggtgaata 2041 tgagaagttt gtgagtgaag atgatcgtaa cagttttact ttgaaactgg aagatactga 2101 aaattggttg tatgaggatg gagaagacca gccaaagcaa gtttatgttg ataagttggc 2161 tgaattaaaa aatctaggtc aacctattaa gatacgtttc caggaatctg aagaacgacc 2221 aaattatttg aagaactaga gaacttaggg aaacagatcc aacagtatat gaaaataatc 2281 agctcctttc aaaaacaagg aggaccaggt atgatcattt ggatgctgct gacatgacaa 2341 aggtagaaaa aagcacaaat gaagcaatgg agtggatgga agtcacacca a // LOCUS HUMHSPG2B 14327 bp mRNA PRI 08-NOV-1994 DEFINITION Human heparan sulfate proteoglycan (HSPG2) mRNA, complete cds. ACCESSION M85289 NID g184426 KEYWORDS HSPG2 gene; heparan sulfate proteoglycan. SOURCE Homo sapiens skin; colon cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 14327) AUTHORS Dodge,G.R., Kovalszky,I., Chu,M.L., Hassell,J.R., McBride,O.W., Yi,H.F. and Iozzo,R.V. TITLE Heparan sulfate proteoglycan of human colon: partial molecular cloning, cellular expression, and mapping of the gene (HSPG2) to the short arm of human chromosome 1 JOURNAL Genomics 10 (3), 673-680 (1991) MEDLINE 91365376 REFERENCE 2 (bases 1 to 14327) AUTHORS Murdoch,A.D., Dodge,G.R., Cohen,I., Tuan,R.S. and Iozzo,R.V. TITLE Primary structure of the human heparan sulfate proteoglycan from basement membrane (HSPG2/perlecan). A chimeric molecule with multiple domains homologous to the low density lipoprotein receptor, laminin, neural cell adhesion molecules, and epidermal growth factor JOURNAL J. Biol. Chem. 267 (12), 8544-8557 (1992) MEDLINE 92235084 FEATURES Location/Qualifiers source 1..14327 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WiDr; CRL 1262" /cell_type="fibroblast; amnion" /tissue_type="skin; colon" /map="1p36.1-p35" gene 81..13256 /gene="HSPG2" CDS 81..13256 /gene="HSPG2" /codon_start=1 /db_xref="GDB:G00-126-372" /product="heparan sulfate proteoglycan" /db_xref="PID:g184427" /translation="MGWRAPGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTAS QMRWTHSYLSDDEYMLADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLED AGSREFREVSEAVVDTLESEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQ IQEMLLRVISSGSVASYVTSPQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYR CDRRPDCRDMSDELNCEEPVLGISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLL PGSVRPLPCGPQEAACRNGHCIPRDYLCDGQEDCEDGSDELDCGPPPPCEPNEFPCGN GHCALKLWRCDGDFDCEDRTDEANCPTKRPEEVCGPTQFRCVSTNMCIPASFHCDEES DCPDRSDEFGCMPPQVVTPPRESIQASRGQTVTFTCVAIGVPTPIINWRLNWGHIPSH PRVTVTSEGGRGTLIIRDVKESDQGAYTCEAMNARGMVFGIPDGVLELVPQRGPCPDG HFYLEHSAACLPCFCFGITSVCQSTRRFRDQIRLRFDQPDDFKGVNVTMPAQPGTPPL SSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPEQFLGNKVDSYGGSLRYNVRYELAR GMLEPVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQVQFSEEHWVHESGRPVQRAEL LQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHATSHGRAHSVEECRCPIGYSGL SCESCDAHFTRVPGGPYLGTCSGCSCNGHASSCDPVYGHCLNCQHNTEGPQCNKCKAG FFGDAMKATATSCRPCPCPYIDASRRFSDTCFLDTDGQATCDACAPGYTGRRCESCAP GYEGNPIQPGGKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVGRLCNECADGSFHLS TRNPDGCLKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAASTHTTNEGIFSPT PGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPGSTPLHGQPLV VLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMALAGIDTLL IRASYAQQPAESRVSGISMDVAVPEETGQDPALEVEQCSCPPGYRGPSCQDCDTGYTR TPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQRGTPQ DCQLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQGQP CQRDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD GCLPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPE GAQLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQ ITGNNIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIR ATFSSVPLVASISAVSLEVAQPGPSNRPRALEVEECRCPPGYIGLSCQDCAPGYTRTG SGLYLGHCELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDC QPCACPLTNPENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQC LPETNQAPLVVEVHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRH QGSELHFPSVQPSDAGVYICTCRNLHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVR PGADVTFICTAKSKSPAYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTG SNMFAMDQGTATLHVQASGTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWT GGPGGQLPAKAQIHGGILRLPAVEPTDQAQYLCRAHSSAGQQVARAVLHVHGGGGPRV QVSPERTQVHAGRTVRLYCRAAGVPSATITWRKEGGSLPPQARSERTDIATLLIPAIT TADAGFYLCVATSPAGTAQARMQVVVLSASDASPPGVKIESSSPSVTEGQTLDLNCVV AGSAHAQVTWYRRGGSLPPHTQVHGSRLRLPQVSPADSGEYVCRVENGSGPKEASITV SVLHGTHSGPSYTPVPGSTRPIRIEPSSSHVAEGQTLDLNCVVPGQAHAQVTWHKRGG SLPARHQTHGSLLRLHQVTPADSGEYVCHVVGTSGPLEASVLVTIEASVIPGPIPPVR IESSSSTVAEGQTLDLSCVVAGQAHAQVTWYKRGGSLPARHQVRGSRLYIFQASPADA GQYVCRASNGMEASITVTVTGTQGANLAYPAGSTQPIRIEPSSSQVAEGQTLDLNCVV PGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPADSGEYVCRVLGSSVPLEASVLV TIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLVAGQAHAQVTWHKRGGSLPAR HQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTIQQRLSGSHSQGVAYPVRI ESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQIVGSRLRIPQVTPADSG EYVCHVSNGAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPTVVEGQTLDLNCVVA RQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANNNIDALEASIVIS VSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTWHKRGGSLPSH HQTRGSRLRLHHVSPADSGEYVCRVMGSSGPLEASVLVTIEASGSSAVHVPAPGGAPP IRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRLNQVSPA DSGEYSCQVTGSSGTLEASVLVTIEPSSPGPIPAPGLAQPIYIEASSSHVTEGQTLDL NCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPEQE ASFTVTVPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASNAYGVAQSVVNLSVHGPPTVSVLP EGPVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAK PSDAGTYVCLAQNALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCS ATGSPAPTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATII LHVESPPYATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELL HFERAAPEDSGRYRCRVTNKVGSAEAFAQLLVQGPPGSLPATSIPAGSTPTVQVTPQL ETKSIGASVEFHCAVPSDQGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYIC QAHGPWGKAQASAQLVIQALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSK VGGHLRPGIVQSGGVVRIAHVELADAGQYRCTATNAAGTTQSHVLLLVQALPQISMPQ EVRVPAGSAAVFPCIASGYPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYV CTATNRQGKVKAFAHLQVPERVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSAD GMLLYNGQKRVPGSPTNLANRQPDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGH FHTVTLLRSLTQGSLIVGDLAPVNGTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSS GFIGCVRELRIQGEEIVFHDLNLTAHGISHCPTCRDRPCQNGGQCHDSESSSYVCVCP AGFTGSRCEHSQALHCHPEACGPDATCVNRPDGRGYTCRCHLGRSGLRCEEGVTVTTP SLSGAGSYLALPALTNTHHELRLDVEFKPLAPDGVLLFSGGKSGPVEDFVSLAMVGGH LEFRYELGSGLAVLRSAEPLALGRWHRVSAERLNKDGSLRVNGGRPVLRSSPGKSQGL NLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEVSVNGKRLDLTYSFLGSQGIGQCYD SSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLCEHEENPCQLREPCLHGGTCQGT RCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPGQYGAYFHDDGFLAFPGHVFS RSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFISLGLQDGHLVFRYQLGSG EARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSPGPNVAVNAKGSVYIGG APDVATLTGGRFSSGITGCVKNLVLHSARPGAPPPQPLDLQHRAQAGANTRPCPS" sig_peptide 81..143 /gene="HSPG2" /note="G00-126-372" misc_feature 144..659 /gene="HSPG2" /note="unique region with 3 glycosamnioglycan attachment sites (domain I); G00-126-372" mat_peptide 144..13253 /gene="HSPG2" /note="G00-126-372" /product="heparan sulfate proteoglycan" misc_feature 660..1289 /gene="HSPG2" /note="shares homology with low density lipoprotein (LDL) receptor (domain II); G00-126-372" misc_feature 1593..5108 /gene="HSPG2" /note="shares homology with the short arm of laminin A chain (domain III); G00-126-372" repeat_region 5109..11138 /gene="HSPG2" /note="shares homology with immunoglobulin repeats of N-CAM (domain IV); G00-126-372" misc_feature 11139..13253 /gene="HSPG2" /note="shares homology with G-domain of laminin A chain and EGF (domain V); G00-126-372" polyA_site 14327 /gene="HSPG2" /note="G00-126-372" BASE COUNT 2615 a 4940 c 4299 g 2473 t ORIGIN 1 ggccggcgag cgggcggctg cgggcggcgc ggagcgggcg gcgcggagcg agcgagcgag 61 agagcggcgc gggccgggcc atggggtggc gggcgccggg cgcgctgctg ctggcgctgc 121 tgctgcacgg gcggctgctg gcggtgaccc atgggctgag ggcatacgat ggcttgtctc 181 tgcctgagga catagagacc gtcacagcaa gccaaatgcg ctggacacat tcgtaccttt 241 ctgatgatga gtacatgctg gctgacagca tctcaggaga cgacctgggc agtggggacc 301 tgggcagcgg ggacttccag atggtttatt tccgagccct ggtgaatttc actcgctcca 361 tcgagtacag ccctcagctg gaggatgcag gctccagaga gtttcgagag gtgtccgagg 421 ctgtggtaga cacgctggag tcggagtact tgaaaattcc cggagaccag gttgtcagtg 481 tggtgttcat caaggagctg gatggctggg tttttgtgga gctcgatgtg ggctcggaag 541 ggaatgcgga tggtgctcag attcaggaga tgctgctcag ggtcatctcc agcggctctg 601 tggcctccta cgtcacctct ccccagggat tccagttccg acgcctgggc acagtgcccc 661 agttcccaag agcctgcacg gaggccgagt ttgcctgcca cagctacaat gagtgtgtgg 721 ccctggagta tcgctgtgac cggcggcccg actgcaggga catgtctgat gagctcaatt 781 gtgaggagcc agtcctgggt atcagcccca cattctctct ccttgtggag acgacatctt 841 taccgccccg gccagagaca accatcatgc gacagccacc agtcacccac gctcctcagc 901 ccctgcttcc cggttccgtc aggcccctgc cctgtgggcc ccaggaggcc gcatgccgca 961 atgggcactg catccccaga gactacctct gcgacggaca ggaggactgc gaggacggca 1021 gcgatgagct agactgtggc cccccgccac cctgtgagcc caacgagttc ccctgcggga 1081 atggacattg tgccctcaag ctgtggcgct gcgatggtga ctttgactgt gaggaccgaa 1141 ctgatgaagc caactgcccc accaagcgtc ctgaggaagt gtgcgggccc acacagttcc 1201 gatgcgtctc taccaacatg tgcatcccag ccagcttcca ctgtgacgag gagagcgact 1261 gtcctgaccg gagcgacgag tttggctgca tgccccccca ggtggtgaca cctccccggg 1321 agtccatcca ggcttcccgg ggccagacag tgaccttcac ctgcgtggcc attggcgtcc 1381 ccacccccat catcaattgg aggctcaact ggggccacat cccctctcat cccagggtga 1441 cagtgaccag cgagggtggc cgtggcacac tgatcatccg tgatgtgaag gagtcagacc 1501 agggtgccta cacctgtgag gccatgaacg cccggggcat ggtgtttggc attcctgacg 1561 gtgtccttga gctcgtccca caacgaggcc cctgccctga cggccacttc tacctggagc 1621 acagcgccgc ctgcctgccc tgcttctgct ttggcatcac cagcgtgtgc cagagcaccc 1681 gccgcttccg ggaccagatc aggctgcgct ttgaccaacc cgatgacttc aagggtgtga 1741 atgtgacaat gcctgcgcag cccggcacgc cacccctctc ctccacgcag ctgcagatcg 1801 acccatccct gcacgagttc cagctagtag acctgtcccg ccgcttcctc gtccacgact 1861 ccttctgggc tctgcctgaa cagttcctgg gcaacaaggt ggactcctat ggcggctccc 1921 tgcgttacaa cgtgcgctac gagttggccc gtggcatgct ggagccagtg cagcggccgg 1981 acgtggtcct cgtgggtgcc gggtaccgcc tcctctcccg aggccacaca cccacccaac 2041 ctggtgctct gaaccagcgc caggtccagt tctctgagga gcactgggtc catgagtctg 2101 gccggccggt gcagcgcgcg gagctgctgc aggtgctgca gagcctggag gccgtgctca 2161 tccagaccgt gtacaacacc aagatggcta gcgtgggact tagcgacatc gccatggata 2221 ccaccgtcac ccatgccacc agccatggcc gtgcccacag tgtggaggag tgcagatgcc 2281 ccattggcta ttctggcttg tcctgcgaga gctgtgatgc ccacttcact cgggtgcctg 2341 gtgggcccta cctgggcacc tgctctggtt gcagttgcaa tggccatgcc agctcctgtg 2401 accctgtgta tggccactgc ctgaattgcc agcacaacac ggaggggcca cagtgcaaca 2461 agtgcaaggc tggcttcttt ggggacgcca tgaaggccac ggccacttcc tgccggccct 2521 gcccttgccc atacatcgat gcctcccgca gattctcaga cacttgcttc ctggacacgg 2581 atggccaagc cacatgtgac gcctgtgccc caggctacac tggccgccgc tgtgagagct 2641 gtgcccccgg atacgagggc aaccccatcc agcccggcgg gaagtgcagg cccgtcaacc 2701 aggagattgt gcgctgtgac gagcgtggca gcatggggac ctccggggag gcctgccgct 2761 gtaagaacaa tgtggtgggg cgcttgtgca atgaatgtgc tgacggctct ttccacctga 2821 gtacccgaaa ccccgatggc tgcctcaagt gcttctgcat gggtgtcagt cgccactgca 2881 ccagctcttc atggagccgt gcccagttgc atggggcctc tgaggagcct ggtcacttca 2941 gcctgaccaa cgccgcaagc acccacacca ccaacgaggg catcttctcc cccacgcccg 3001 gggaactggg attctcctcc ttccacagac tcttatctgg accctacttc tggagcctcc 3061 cttcacgctt cctgggggac aaggtgacct cctatggagg agagctgcgc ttcacagtga 3121 cccagaggtc ccagccgggc tccacacccc tgcacgggca gccgttggtg gtgctgcaag 3181 gtaacaacat catcctagag caccatgtgg cccaggagcc cagccccggc cagcccagca 3241 ccttcattgt gcctttccgg gagcaagcat ggcagcggcc cgatgggcag ccagccacac 3301 gggagcacct gctgatggca ctggcaggca tcgacaccct cctgatccga gcatcctacg 3361 cccagcagcc cgctgagagc agggtctctg gcatcagcat ggacgtggct gtgcccgagg 3421 aaaccggcca ggaccccgcg ctggaagtgg aacagtgctc ctgcccaccc gggtaccgtg 3481 ggccgtcctg ccaggactgt gacacaggct acacacgcac gcccagtggc ctctacctgg 3541 gtacctgtga acgctgcagc tgccatggcc actcagaggc ctgcgagcca gaaacaggtg 3601 cctgccaggg ctgccagcat cacacggagg gccctcggtg tgagcagtgc cagccaggat 3661 actacgggga cgcccagcgg gggacaccac aggactgcca gctgtgcccc tgctacggag 3721 accctgctgc cggccaggct gcccacactt gttttctgga cacagacggc caccccacct 3781 gtgatgcgtg ctccccaggc cacagtgggc gtcactgtga gaggtgcgcc cctggctact 3841 atggcaaccc cagccagggc cagccatgcc agagagacag ccaggtgcca gggcccatag 3901 gctgcaactg tgacccccaa ggcagcgtca gcagccagtg tgatgctgct ggtcagtgcc 3961 agtgcaaggc ccaggtagaa ggcctcactt gcagccactg ccggccccac cacttccacc 4021 tgagtgccag caacccagac ggctgcctgc cctgcttctg tatgggcatc acccagcagt 4081 gcgccagctc tgcctacaca cgccacctga tctccaccca ctttgcccct ggggacttcc 4141 aaggctttgc cctggtgaac ccacagcgaa acagccgcct gacaggagaa ttcactgtgg 4201 aacccgtgcc cgagggtgcc cagctctctt ttggcaactt tgcccaactc ggccatgagt 4261 ccttctactg gcagctgccg gagacatacc agggagacaa ggtggcggcc tacggtggga 4321 agttgcgata caccctctcc tacacagcag gcccacaggg cagcccactc tcggaccccg 4381 atgtgcagat cacgggcaac aacatcatgc tagtggcctc ccagccagcg ctgcagggcc 4441 cagagaggag gagctacgag atcatgttcc gagaggaatt ctggcgccgg cccgatgggc 4501 agccggccac acgcgagcac ctcctgatgg cactggccga cctggatgag ctcctgatcc 4561 gggccacgtt ctcctccgtg ccgctggtgg ccagcatcag cgcagtcagc ctggaggtcg 4621 cccagccggg gccctcaaac agaccccgcg ccctcgaggt ggaggagtgc cgctgcccgc 4681 caggctacat cggtctgtcc tgccaggact gtgcccccgg ctacacgcgc accgggagtg 4741 ggctctacct cggccactgc gagctatgtg aatgcaatgg ccactcagac ctgtgccacc 4801 cagagactgg ggcctgctcg caatgccagc acaacgccgc aggggagttc tgcgagcttt 4861 gtgcccctgg ctactacgga gatgccacag ccgggacgcc tgaggactgc cagccctgtg 4921 cctgcccact gaccaaccca gagaacatgt tttcccgcac ctgtgagagc ctgggagccg 4981 gcgggtaccg ctgcacggcc tgcgaacccg gctacactgg ccagtactgt gagcagtgtg 5041 gcccaggtta cgtgggtaac cccagtgtgc aagggggcca gtgcctgcca gagacaaacc 5101 aagccccact ggtggtcgag gtccatcctg ctcgaagcat agtgccccaa ggtggctccc 5161 actccctgcg gtgtcaggtc agtgggagcc caccccacta cttctattgg tcccgtgagg 5221 atgggcggcc tgtgcccagc ggcacccagc agcgacatca aggctccgag ctccacttcc 5281 ccagcgtcca gccctcggat gctggggtct acatttgcac ctgccgtaat ctccaccaat 5341 ccaataccag ccgggcagag ctgctggtca ctgaggctcc aagcaagccc atcacagtga 5401 ctgtggagga gcagcggagc cagagcgtgc gccccggagc tgacgtcacc ttcatctgca 5461 cagccaaaag caagtcccca gcctataccc tggtgtggac ccgcctgcac aacgggaaac 5521 tgcccacccg agccatggat ttcaatggca tcctgaccat tcgcaacgtc cagctgagtg 5581 atgcaggcac ctacgtgtgc accggctcca acatgtttgc catggaccag ggcacagcca 5641 ctctacatgt gcaggcctcg ggcaccttgt ccgcccccgt ggtctccatc catccgccac 5701 agctcacagt gcagcccggg caactggcgg agttccgctg cagcgccaca gggagcccca 5761 cgcccaccct cgagtggaca gggggccccg gcggccagct ccctgcgaag gcacaaatcc 5821 acggcggcat cctgcgcctg ccagctgtcg agcccacgga tcaggcccag tacttgtgcc 5881 gagcccacag cagcgctggg cagcaggtgg ccagggctgt gctccacgtg catgggggcg 5941 gtgggcccag agtccaagtg agcccagaga ggacccaggt ccacgcaggc cggaccgtca 6001 ggctgtactg cagggctgca ggcgtgccta gcgccaccat cacctggagg aaggaagggg 6061 gcagcctccc accacaggcc cggtcagagc gcacagacat cgcgacactg ctcatcccag 6121 ccatcacgac tgctgacgcc ggcttctacc tctgcgtggc caccagccct gcaggcactg 6181 cccaggcccg gatgcaagtg gttgtccttt cagcctcaga tgccagccca ccgggggtca 6241 agattgagtc ctcatcgcct tctgtgacag aagggcaaac actcgacctc aactgtgtgg 6301 tggcagggtc agcccatgcc caggtcacct ggtacaggcg agggggtagc ctgcctcccc 6361 acacccaggt gcacggctcc cgtctgcggc tcccccaggt ctcaccagct gattctggag 6421 aatatgtgtg ccgtgtggag aatggatcgg gccccaagga ggcctccatt actgtgtctg 6481 tgctccacgg cacccattct ggccccagct acaccccagt gcccggcagc acccggccca 6541 tccgcatcga gccctcctcc tcacacgtgg cggaagggca gaccctggat ctgaactgcg 6601 tggtgcccgg gcaggcccac gcccaggtca cgtggcacaa gcgtgggggc agcctccctg 6661 cccggcacca gacccacggc tcgctgctgc ggctgcacca ggtgaccccg gccgactcag 6721 gcgagtatgt gtgccatgtg gtgggcacct ccggccccct agaggcctca gtcctggtca 6781 ccatcgaagc ctctgtcatc cctggaccca tcccacctgt caggatcgag tcttcatcct 6841 ccacagtggc cgagggccag accctggatc tgagctgcgt ggtggcaggg caggcccacg 6901 cccaggtcac atggtacaag cgtgggggca gcctccctgc ccggcaccag gttcgtggct 6961 cccgcctgta catcttccag gcctcacctg ccgatgcggg acagtacgtc tgccgggcca 7021 gcaacggcat ggaggcctcc atcacggtca cagtaactgg gacccagggg gccaacttag 7081 cctaccctgc cggcagcacc cagcccatcc gcatcgagcc ctcctcctcg caagtggcgg 7141 aagggcagac cctggatctg aactgcgtgg tgcccgggca gtcccatgcc caggtcacgt 7201 ggcacaagcg tgggggcagc ctccctgtcc ggcaccagac ccacggctcc ctgctgagac 7261 tctaccaagc gtcccccgcc gactcgggcg agtacgtgtg ccgagtgttg ggcagctccg 7321 tgcctctaga ggcctctgtc ctggtcacca ttgagcctgc gggctcagtg cctgcacttg 7381 gggtcacccc cacggtccgg atcgagtcat cgtcttcgca agtggccgag gggcagaccc 7441 tggacctgaa ctgcctcgtt gctggtcagg cccatgccca ggtcacgtgg cacaagcgcg 7501 ggggcagcct cccggcccgg caccaggtgc atggctcgag gctacgcctg ctccaggtga 7561 ccccagctga ttcaggggag tacgtgtgcc gtgtggtcgg cagctcaggt acccaggaag 7621 cctcagtcct tgtcaccatc cagcagcgcc ttagtggctc ccactcccag ggtgtggcgt 7681 accccgtccg catcgagtcc tcctcagcct ccctggccaa tggacacacc ctggacctca 7741 actgcctggt tgccagccag gctccccaca ccatcacctg gtataagcgt ggaggcagct 7801 tacccagccg gcaccagatc gtgggctccc ggctgcggat ccctcaggtg actccggcag 7861 actcgggcga gtacgtgtgt cacgtcagta acggtgcagg ctcccgggag acctcgctca 7921 tcgtcaccat ccagggcagc ggttcctccc acgtgcccag cgtctcccca ccgatcagga 7981 tcgagtcgtc ttcccccacg gtggtggaag ggcagacctt ggatctgaac tgcgtggtcg 8041 ccaggcagcc ccaggctatc atcacatggt acaagcgtgg gggcagcctt ccctcccgac 8101 accagaccca tggctcccac ctgcggttgc accaaatgtc tgtggctgac tcgggcgagt 8161 atgtgtgccg ggccaacaac aacatcgatg ccctggaggc ctccatcgtc atctccgtct 8221 cccctagcgc cggcagcccc tccgcccctg gcagctccat gcccatcaga attgagtcat 8281 cctcctcaca cgtggccgaa ggggagaccc tggatctgaa ctgcgtggtc cccgggcagg 8341 cccatgccca ggtcacttgg cacaagcgtg ggggcagcct ccccagtcac catcagaccc 8401 gcggctcacg gctgcggctg caccatgtgt ccccggccga ctcgggtgaa tacgtgtgcc 8461 gggtgatggg cagctctggc cccctggagg cctcagtcct ggtcaccatc gaagcctctg 8521 gctcaagtgc tgtccacgtc cccgccccag gtggagcccc acccatccgc atcgagccct 8581 cctcctcccg agtggcagaa gggcagaccc tggatctgaa gtgcgtggtg cccgggcagg 8641 cccacgccca ggtcacatgg cacaagcgtg gaggaaacct ccctgcccgg caccaggtcc 8701 acggcccact gctgaggctg aaccaggtgt ccccggctga ctctggcgag tactcgtgcc 8761 aagtgaccgg aagctcaggc accctggagg catctgtcct ggtcacaatt gagccctcca 8821 gcccaggacc cattcctgct ccaggactgg cccagcccat ctacatcgag gcctcctctt 8881 cacacgtgac tgaagggcag actctggatc tgaactgtgt ggtgcccggg caggcccatg 8941 cccaggtcac gtggtacaag cgcgggggca gcctccccgc ccggcaccag acccatggct 9001 cccagctgcg gctccacctc gtctcccctg ccgactcagg cgagtatgtg tgtcgtgcag 9061 ccagcggccc aggccctgag caagaagcct ccttcacagt caccgtcccg cccagtgagg 9121 ggtcttccta ccgccttagg agcccggtca tctccatcga cccgcccagc agcaccgtgc 9181 agcagggcca ggatgccagc ttcaagtgcc tcatccatga cggggcagcc cccatcagcc 9241 tcgagtggaa gacccggaac caggagctgg aggacaacgt ccacatcagt cccaatggct 9301 ccatcatcac catcgtgggc acccggccca gcaaccacgg tacctaccgc tgcgtggcct 9361 ccaatgccta cggtgtggcc cagagtgtgg tgaacctcag tgtgcacggg ccccctacag 9421 tgtccgtgct ccccgagggc cccgtgtggg tgaaagtggg aaaggctgtc accctggagt 9481 gtgtcagtgc cggggagccc cgctcctctg ctcgttggac ccggatcagc agcacccctg 9541 ccaagttgga gcagcggaca tatgggctca tggacagcca cgcggtgctg cagatttcat 9601 cagctaaacc atcagatgcg ggcacttatg tgtgccttgc tcagaatgca ctaggcacag 9661 cacagaagca ggtggaggtg atcgtggaca cgggcgccat ggccccaggg gcccctcagg 9721 tccaagctga agaagctgag ctgactgtgg aggctggaca cacggccacc ttgcgctgct 9781 cagccacagg cagccccgcg cccaccatcc actggtccaa gctgcgttcc ccactgccct 9841 ggcagcaccg gctggaaggt gacacactca tcataccccg ggtagcccag caggactcgg 9901 gccagtacat ctgcaatgcc actagccctg ctgggcacgc tgaggccacc atcatcctgc 9961 acgtggagag cccaccatat gccaccacgg tcccagagca cgcttcggtg caggcagggg 10021 agacggtgca gctccagtgc ctggctcacg ggacaccccc actcaccttc cagtggagcc 10081 gcgtgggcag cagccttcct gggagggcga ccgccaggaa cgagctgctg cactttgagc 10141 gtgcagcccc tgaggactca ggccgctacc gctgccgggt caccaacaag gtgggctcag 10201 ccgaggcctt tgcccagctg ctcgtccaag gccctcccgg ctctctccct gccacctcca 10261 tcccagcagg gtccacgccc accgtgcagg tcacgcctca gctagagacc aagagcattg 10321 gggccagcgt tgagttccac tgtgctgtgc ccagcgacca gggtacccag ctccgttggt 10381 tcaaggaagg gggtcagctg cctccgggtc acagcgtgca ggatggggtg ctccgaatcc 10441 agaacttgga ccagagctgc caagggacgt atatatgcca ggcccatgga ccttggggga 10501 aggcccaggc cagtgcccag ctggttatcc aagccctgcc ctcggtgctc atcaacatcc 10561 ggacctctgt gcagaccgtg gtggttggcc acgccgtgga gttcgaatgc ctggcactgg 10621 gtgaccccaa gcctcaggtg acatggagca aagttggagg gcacctgcgg ccaggcattg 10681 tgcagagcgg aggtgtcgtc aggatcgccc acgtagagct ggctgatgcg ggacagtatc 10741 gctgcactgc caccaacgca gctggcacca cacaatccca cgtcctgctg cttgtgcaag 10801 ccttgcccca gatctcaatg ccccaagaag tccgtgtgcc tgctggttct gcagctgtct 10861 tcccctgcat agcctcaggc taccccactc ctgacatcag ctggagcaag ctggatggca 10921 gcctgccacc tgacagccgc ctggagaaca acatgctgat gctgccctca gtccgacccc 10981 aggacgcagg tacctacgtc tgcaccgcca ctaaccgcca gggcaaggtc aaagcctttg 11041 cccacctgca ggtgccagag cgggtggtgc cctacttcac gcagaccccc tactccttcc 11101 taccgctgcc caccatcaag gatgcctaca ggaagttcga gatcaagatc accttccggc 11161 ccgactcagc cgatgggatg ctgctgtaca atgggcagaa gcgagtccca gggagcccca 11221 ccaacctggc caaccggcag cccgacttca tctccttcgg cctcgtgggg ggaaggcccg 11281 agttccggtt cgatgcaggc tcaggcatgg ccaccatccg ccatcccaca ccactggccc 11341 tgggccattt ccacaccgtg accctgctgc gcagcctcac ccagggctcc ctgattgtgg 11401 gtgacctggc cccggtcaat gggacctccc agggcaagtt ccagggcctg gatctgaacg 11461 aggaactcta cctgggtggc tatcctgact atggtgccat ccccaaggcg gggctgagca 11521 gcggcttcat aggctgtgtc cgggagctgc gcatccaggg cgaggagatc gtcttccatg 11581 acctcaacct cacggcgcac ggcatctccc actgccccac ctgtcgggac cggccctgcc 11641 agaatggcgg tcagtgccat gactctgaga gcagcagcta cgtgtgcgtc tgcccagctg 11701 gcttcaccgg gagccgctgt gagcactcgc aggccctgca ctgccatcca gaggcctgtg 11761 ggcccgacgc cacctgtgtg aaccggcctg acggtcgagg ctacacctgc cgctgccacc 11821 tgggccgctc ggggttgcgg tgtgaggaag gtgtgacagt gaccaccccc tcgctgtcgg 11881 gtgctggctc ctacctggca ctgcccgccc tcaccaacac acaccacgag ctacgcctgg 11941 acgtggagtt caagccactc gcccctgacg gggtcctgct gttcagcggg gggaagagcg 12001 ggcctgtgga ggacttcgtg tccctggcga tggtgggcgg ccacctggag ttccgctatg 12061 agttggggtc agggctggcc gttctgcgga gcgccgagcc gctggccctg ggccgctggc 12121 accgtgtgtc tgcagagcgt ctcaacaagg acggcagcct gcgggtgaat ggtggacgcc 12181 ctgtgctgcg ctcctcgccc ggcaagagcc agggcctcaa cctgcacacc ctgctctacc 12241 tggggggtgt ggagccttcc gtgccactgt ccccggccac caacatgagc gctcacttcc 12301 gcggctgtgt gggcgaggtg tcagtgaatg gcaaacggct ggacctcacc tacagtttcc 12361 taggcagcca gggcatcggg caatgctatg atagctcccc atgtgagcgc cagccttgcc 12421 aacatggtgc cacgtgcatg cccgctggcg agtatgagtt ccagtgcctg tgtcgagatg 12481 gattcaaagg agacctgtgt gagcacgagg agaacccctg ccagctccgt gaaccctgtc 12541 tgcatggggg cacctgccag ggcacccgct gcctctgcct ccctggcttc tctggcccac 12601 gctgccaaca aggctctgga catggcatag cagagtccga ctggcatctt gaaggcagcg 12661 ggggcaatga tgcccctggg cagtacggag cctatttcca cgatgatggc ttcctcgcct 12721 tccctggcca tgtcttctcc aggagcctgc ccgaggtgcc cgagaccatc gagctggagg 12781 ttcggaccag cacagccagt ggcctcctgc tctggcaggg tgtggaggtg ggagaggccg 12841 gccaaggcaa ggacttcatc agcctcgggc ttcaagacgg gcaccttgtc ttcaggtacc 12901 agctgggtag tggggaggcc cgcctggtct ctgaggaccc catcaatgac ggcgagtggc 12961 accgggtgac agcactgcgg gagggccgca gaggttccat ccaagtcgac ggtgaggagc 13021 tggtcagcgg ccggtcccca ggtcccaacg tggcagtcaa cgccaagggc agcgtctaca 13081 tcggcggagc ccctgacgtg gccacgctga ccgggggcag attctcctcg ggcatcacag 13141 gctgtgtcaa gaacctggtg ctgcactcgg cccgacccgg cgccccgccc ccacagcccc 13201 tggacctgca gcaccgcgcc caggccgggg ccaacacacg cccctgcccc tcgtaggcac 13261 ctgcctgccc cacacggact cccgggccac gccccagccc gacaatgtcg agtatattat 13321 tattaatatt attatgaatt tttgtaagaa accgaggcga tgccacgctt tgctgctacc 13381 gccctgggct ggactggagg tgggcatgcc accctcacac acacagctgg gcaaagccac 13441 aaggctggcc agcaaggcag gttggatggg agtgggcacc tcagaaagtc accaggactt 13501 ggggtcagga acagtggctg ggtgggccca gaactgcccc cactgtcccc ctacccaccg 13561 atggagcccc cagatagagc tgggtggcct gtttctgcag cccttgggca gttctcactc 13621 ctaggagagc caacctcggc ttgtgggctg gtgccccaca gctacctgag acgggcatcg 13681 caggagtctc tgccacccac tcaggattgg gaattgtctt tagtgccggc tgtggagcaa 13741 aaggcagctc acccctgggc aggcggtccc catccccacc agctcgtttt tcagcacccc 13801 cacccacctc cacccagccc ctggcacctc ctctggcaga ctccccctcc taccacgtcc 13861 tcctggcctg cattcccacc ccctcctgcc agcacacagc ctggggtccc tccctcaggg 13921 gctgtaaggg aaggcccacc ccaactctta ccaggagctg ctacaggcag agcccagcac 13981 tgatagggcc ccgcccaccg ggccccgccc accccaggcc acatccccac ccatctggaa 14041 gtgaaggccc agggactcct ccaacagaca acggacggac ggatgccgct ggtgctcagg 14101 aagagctagt gccttaggtg ggggaaggca ggactcacga ctgagagaga gaggaggggg 14161 atatgaccac cctgccccat ctgcaggagc ctgaagatcc agctcaagtg ccatcctgcc 14221 agtggccccc agactgtggg gttgggacgc ctggcctctg tgtcctagaa gggaccctcc 14281 tgtggtcttt gtcttgattt ttcttaataa acggtgctat ccccgcc // LOCUS HUMHSPR 1968 bp mRNA PRI 16-MAY-1995 DEFINITION Human GTP-binding protein (HSR1) mRNA, complete cds. ACCESSION L25665 NID g807999 KEYWORDS GTP-binding protein; cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1968) AUTHORS Vernet,C., Ribouchon,M.T., Chimini,G. and Pontarotti,P. TITLE Structure and evolution of a member of a new subfamily of GTP-binding proteins mapping to the human MHC class I region JOURNAL Mamm. Genome 5 (2), 100-105 (1994) MEDLINE 94235953 FEATURES Location/Qualifiers source 1..1968 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="PHA-stimulated T-cell" /map="6p21.3" gene 482..1774 /gene="HSR1" CDS 482..1774 /gene="HSR1" /note="homology with MMR1; similarities with YRB1 HALCU, Obg BACSU, Era ECOLI, THDF BACSU, THDF PSEPU; putative" /codon_start=1 /product="GTP-binding protein" /db_xref="PID:g431091" /translation="MEAAVAVLEMSDIVLLITDIRHPVVNFPPALYEYVTGELGLALV LVLNKVDLAPRRLVVAWKHYFHQHYPQLHVVLFTSFPRDPRTPQDPSSVLKKSRRRGR GWTRALGPEQLLRACEAITVGKVDLSSWREKIARDAGATWGNGSGEEEEEEDGPAVLV EQQQTDSAMEPTGPTQERYKDGVVTIGCVGFPNVGKSSLINGLVGRKVVSVSRTPGHT RYFQTYFLTPSVKLCDCPGLIFPSLLPRQLQVLAGIYPIAQIQEPYTAVGYLASRIPV QALLHLRHPEAEDPSAEHPWCAWDICEAWAEKRGYKTAKAARNDVYRAANSLLRLAVD GRVSLCFHPPGYSEQKGTWESHPETTELVVLQGRVGPAGDEEEEEEEELSSSCEEEGE EDRDADEEGEGDEETPTSAPGSSLAGRNPYALLGEDEC" BASE COUNT 421 a 520 c 620 g 407 t ORIGIN 1 cgtgcaggac aaacgggagc ggaagagagg gcttcaagat gggctgcgct ccagttccaa 61 cagccgcagc ggagccggga gcgcagagga acagaccgac acctcggacg gggagtctgt 121 gacccatcat atccgcaggc ttaaccagca gccttctcag gggcctgggt ccacgaggct 181 acgacccaaa tcgataccga ctgcattttg agagagacag cagggaggag gtagagagga 241 gaaagagagc agcccgggag caagttctac agccgttcag tgctgagttg ttggagctgg 301 acatccggga ggtgtatcag cctggctcag ttctggactt tcctcgacgt ctccttggag 361 ctatgagatg tccaaggagc aactaatgag ccaagaggaa cgagcttcca agactatctt 421 gggaagattc atggggctta ctcctctgag aaactcagct acttgagcac aatctggaga 481 catggaggca gctgtggcgg tgttagagat gtctgacatc gtcctgctta tcactgatat 541 ccgacatcca gttgtgaatt tcccgccagc actttatgag tatgtgactg gagaacttgg 601 actggccctg gtgctggttt tgaacaaggt ggatctggcc ccgcgacgtc ttgtggttgc 661 ctggaagcat tatttccatc aacactatcc ccagctccac gtcgtccttt tcacctcttt 721 tcctcgggac ccccgcaccc cacaggatcc tagtagtgtc ttgaagaaga gtcggaggcg 781 ggggagagga tggactcggg ccctggggcc agagcagttg ctgagagcct gtgaagccat 841 cactgtgggg aaagtggact tgagcagctg gcgggagaag attgctcggg atgctggggc 901 cacctggggt aatggctctg gggaggagga ggaagaggag gatggcccag cagtcctggt 961 ggagcagcag cagactgatt cagcaatgga gccaactggc ccaacccaag agcgctacaa 1021 ggatggggtg gtgaccatcg gctgtgtggg tttccctaat gtgggaaagt cctcgctgat 1081 caatgggctg gtggggcgga aagtcgtgag tgtctccaga accccgggcc atacccgata 1141 ctttcagacc tactttctta ccccctctgt gaagctctgt gactgcccag gcctcatctt 1201 cccatctctt ctgcctaggc agttgcaggt tctggcgggg atctacccta tcgcccagat 1261 ccaggagccc tacactgctg tgggctacct ggcctcccga attcccgtgc aggccctgct 1321 ccacctgcgc cacccagagg ctgaggaccc ctcagcggaa cacccctggt gtgcctggga 1381 catctgtgaa gcctgggcag agaaacgtgg ttacaagaca gccaaggcgg ctcggaatga 1441 tgtgtacaga gcagccaaca gtctcttgcg gctggcagtg gacggccgcg tcagcctgtg 1501 ttttcatccc ccaggctaca gtgaacagaa aggcacctgg gagtcccatc cagagaccac 1561 ggagctggtg gttttgcagg gcagggtggg gccagcaggt gacgaggagg aggaggaaga 1621 ggaagagctg agcagctcct gtgaggagga gggagaggag gaccgggatg cggatgagga 1681 gggagaaggg gatgaggaga ccccaacctc ggctccaggg tccagcctgg ctggccgaaa 1741 cccttatgcc ctgctgggtg aggatgagtg ctgagttccg cccagcgcct acttccctcc 1801 cagatacttt cgttttggac tgacctgggg gtatctcccc tctgccaccc caattgtgaa 1861 taaagattgt ttgctttgta gccccttccc cagatgaact gaggtgagag cggctgttcc 1921 aggctaacag ctgtgggagg cttctctcct cttctccctt cttttttt // LOCUS HUMHST 6616 bp DNA PRI 22-AUG-1995 DEFINITION Human transforming protein (hst) gene, complete cds. ACCESSION J02986 M16338 NID g184430 KEYWORDS transforming protein. SOURCE Homo sapiens (clone: pLBS6.2) DNA; and Homo sapiens (clone: lambda-CT361-b3) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 6182 to 6616; 2313 to 2890; 3508 to 3611) AUTHORS Taira,M., Yoshida,T., Miyagawa,K., Sakamoto,H., Terada,M. and Sugimura,T. TITLE cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (9), 2980-2984 (1987) MEDLINE 87204251 REFERENCE 2 (bases 1 to 6181) AUTHORS Yoshida,T., Miyagawa,K., Odagiri,H., Sakamoto,H., Little,P.F., Terada,M. and Sugimura,T. TITLE Genomic sequence of hst, a transforming gene encoding a protein homologous to fibroblast growth factors and the int-2-encoded protein [published erratum appears in Proc Natl Acad Sci U S A 1988 Mar;85(6):1967] JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (20), 7305-7309 (1987) MEDLINE 88041096 COMMENT Draft entry and printed copy of sequence for [1],[2] kindly provided by H.Sakamoto, 08/06/87. No polyadenylation site was found. FEATURES Location/Qualifiers source 1..6616 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pLBS6.2" /clone="lambda-CT361-b3" /cell_line="T361-2nd-1 stomach cancer" /map="11q13.3" exon 2313..2890 /note="transforming protein" /number=1 variation 2444 /note="c in DNA, t in cDNA" /replace="t" variation 2474 /note="c in DNA, t in cDNA" /replace="t" gene 2551..4326 /gene="FGF4" CDS join(2551..2890,3508..3611,4150..4326) /gene="FGF4" /codon_start=1 /db_xref="GDB:G00-120-066" /product="transforming protein" /db_xref="PID:g386788" /translation="MSGPGTAAVALLPAVLLALLAPWAGRGGAAAPTAPNGTLEAELE RRWESLVALSLARLPVAAQPKEAAVQSGAGDYLLGIKRLRRLYCNVGIGFHLQALPDG RIGGAHADTRDSLLELSPVERGVVSIFGVASRFFVAMSSKGKLYGSPFFTDECTFKEI LLPNNYNAYESYKYPGMFIALSKNGKTKKGNRVSPTMKVTHFLPRL" intron 2891..3507 /gene="FGF4" /note="hst intron A" exon 3508..3611 /gene="FGF4" /number=2 intron 3612..4149 /gene="FGF4" /note="hst intron B" exon 4150..>6181 /note="transforming protein" /number=3 variation 4383 /note="g in DNA, a in cDNA" /replace="a" CDS 4685..5149 /note="ORF; putative" /codon_start=1 /db_xref="PID:g567023" /translation="MAWSISVWKGPPEAWAASSGSSQAWLSASILRGPLPLCPVAINR DISVYFGYRKCGVEVLAATLFLDLPRLTFELSCSQSSSIYQMGETLGQLYKLLFAFFG SATAPIAVTIGEKTKLFHKFHGEESISHWKARNGQDSVFAITNKTLVMKNNL" BASE COUNT 1404 a 1833 c 1984 g 1395 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccagaa ggcatccccg agtggctact ccaatggagt ggcttctcca ttcaggcaaa 61 cctgaatggg ataagtcatt ggcaggaaga tctggggccg ggggtcatcc agtgggaagg 121 ggagagatga cgcggtcagc atggcgggaa cacaggagca gaaaggaagc aggtgggaag 181 ccaggtcaag ggccaggggc acggaaaggg gtcagatgca gataagtgag tgcttcctgg 241 tgcatccttc atccgcaatt catccttacc tgtgcttttg ttgcctccat tgcacagctg 301 aggaggccag ggcctgcgga ggttgagagt gtgctcaggg agcccccgga gcaaagtgga 361 agccagattc cagatcagtt ctgctgggaa ttcccagctc ccaaaagccc tgctggctgt 421 cagtccccag tcaccacaag cacctatcct gtgtgggtgg gcctgcagtt ctgggagata 481 tatcagctgc ctgcagcgtc ctttgctgaa ctcacagcaa ataggagaga cagggagggg 541 tccttgggaa gccctaaatt gagcttgctg tgggagtcct gggaagaaag gagcctcatc 601 ctatcaaaag ccggggggaa gacatcagag tccctctgct caggtcagct ggcacaggtg 661 ggtctccagg cctgggtctc acttccccag agggtgtgtt cgggtggccc caggctgagg 721 gaggaaagcc cacctcccat gtcattttgc aaatggggag tcagggacct agagatggaa 781 agacaacaca gcaagtgagg gatgggttct aggtcccctg caccctgcac cctgcaccct 841 ggccaacgat gtctatttgg caccagatct gcaggctcat ctgggggacc ccaggaccca 901 gaggcagccg ggttgcatct cgaagctgtg agctgcagcc caggaaggtc caggtctggg 961 tggcgctgcc caagcaggct gcaggcccaa ggaggaacaa agatcctctc aaggggtgcg 1021 gagctgaggt tccggtcctg ccaaagccac ttgatgaccc ccaagtgccc ccctttctgc 1081 acctcagaga agagccctca agcctcccag gtcccctcca ggggcacgaa taagccccag 1141 cagggttctg aaggggtccc aggaatctcc ctgtggggat gcggtggagg tggaggaggc 1201 tgcggtggcc tggggacatc tctggtcaca ggtgctggtg gtatgagaga tggggtaggc 1261 accaagcccc ctgcagctgt ggctaggcgg gcctgcagga agggccaggc aggctcctca 1321 gggaccacaa agaacagggg ttttcacacc taggtgggcc tgcatctagc taggccagtc 1381 cccatcaggc cataatgggc acagtgggag gtagaaccat gagtgagaga ggggaggctt 1441 ccagaggcct ggcctgggtc cctgctagat tgagggctct ggctatggta catggatatt 1501 tctgctgtgg aatcaaagga gcaggggatg ctgaatatcc cctctggccc tatgccctgc 1561 tacctgtcct ttcacggaag ggtgtgtgtg tagggggtgc aggaccaggc ctccctgggt 1621 gcatctctgc caccttgccc tttggctcag gtggacctcc accaggtatt cagaactcca 1681 gcccagaaac gcgccaagcc tgtggggcca agacctaggg ggtgggggtg gcctccctcc 1741 cgcctgtagc caaagggtcc tcccttgccc agccaggccc cggtgtcgct tactgctctt 1801 atccacccct ccttcccagg ccggtcctca aggccccagc aaaggaacca agttcccgtg 1861 agcctccgaa aggcgaaggg caggcagcag ccgctggctt ctgcgcccac taggagcttc 1921 ggatgcccga gttagggctg cgccaaggcg gccggagcag agagggagac ggggacgggg 1981 acaggcaggg acaaagtgca agaggcaaaa ctggctgaaa agcagaagtg taggagccgc 2041 caaggggcgg gacgaacagg tccgtgggcc gggcggagcc aagggtgggg gccggggtcc 2101 ctccaggtgg cactcgcggc gctagtcccc agcctcctcc cttcccccgg ccctgattgg 2161 caggcggcct gcgaccagcc gcgaacgcca cagcgccccg ggcgcccagg agaacgcgaa 2221 cggccccccg cgggagcggg cgagtaggag ggggcgccgg gctatatata tagcggctcg 2281 gcctcgggcg ggcctggcgc tcagggaggc gcgcactgct cctcagagtc ccagctccag 2341 ccgcgcgctt tccgcccggc tcgccgctcc atgcagccgg ggtagagccc ggcgcccggg 2401 ggccccgtcg cttgcctccc gcacctcctc ggttgcgcac tcccgcccga ggtcggccgt 2461 gcgctcccgc gggccgccac aggcgcagct ctgcccccca gcttcccggg cgcactgacc 2521 gcctgaccga cgcacggccc tcgggccggg atgtcggggc ccgggacggc cgcggtagcg 2581 ctgctcccgg cggtcctgct ggccttgctg gcgccctggg cgggccgagg gggcgccgcc 2641 gcacccactg cacccaacgg cacgctggag gccgagctgg agcgccgctg ggagagcctg 2701 gtggcgctct cgttggcgcg cctgccggtg gcagcgcagc ccaaggaggc ggccgtccag 2761 agcggcgccg gcgactacct gctgggcatc aagcggctgc ggcggctcta ctgcaacgtg 2821 ggcatcggct tccacctcca ggcgctcccc gacggccgca tcggcggcgc gcacgcggac 2881 acccgcgaca gtgagtggcg cggccaggcg cgaaggggcg ggggcggggg gcaacggccg 2941 ccgggccaac ccgctcagtc acactctgag accctcggcg ggcacctgct cgggggcccc 3001 gggaaccggg gcggactcgg gctccggtcc cttctgacgc ggggctgggg acgcagacac 3061 tcttggctcc ggcagcccag cgcaacccct gaggtcgggc gccgcctccc gccttcagaa 3121 actcgggctc cgagcgccga attccagcgc cttcgcccgt gggcacaggg cgcgcggtgc 3181 agccacaggg ggcccgagac acgcgccccg gcctggccca ggctggggaa ccgctggggt 3241 cgggctcgcg tctgaaggtc cgggactggg tgcggccgcc gggggtcccc tacacaggca 3301 agctaatctg agctagcgca ggcttgggct ccggaggccc tagagggcag cttgggctct 3361 ggaggccctt gggggcggct gcgccgggaa ccctggccct ttatccccaa ccccacccca 3421 gaaatagggt ccccggaggc gaacaagccg aggggcggag tgggccaggg atcacctgcc 3481 ccgcaatgac ctgcgccccg cccccaggcc tgctggagct ctcgcccgtg gagcggggcg 3541 tggtgagcat cttcggcgtg gccagccggt tcttcgtggc catgagcagc aagggcaagc 3601 tctatggctc ggtgagtacc gcaggggtct ggctaggcac ctagttggga acagcggaca 3661 tggctagcag gctcgtggct tctccagccc cacctgtgcc tgggtcttgg aggggtggca 3721 gggtcaccag gtcacgggac cggcaggcct ccccagacaa aggaagcagc cccaaggcag 3781 gaacaatgag gttcctgcca tccctgagtg ggcccctccc agaccgagga aagggcgcta 3841 ttgagagccc ttcccttctc tagtccagag gggtaggtct cagtgttgga actgcgggct 3901 tgaggctgga cacgcaggga atgaattctc tggctgctag gtgcagggca ggtggtgaga 3961 gcaccagctg ttgtgggctg gccatgtccc cttctcaccc tgtgtgggtc ttgacacctt 4021 aactgctcag cagagacatc tcagcccagg gtggggggtg ggacagaagg gggttctgac 4081 ccctggcttc aggctgggta ccttgcccaa gaggtgcccc agccctgaca ctgccctgct 4141 ttgctgcagc ccttcttcac cgatgagtgc acgttcaagg agattctcct tcccaacaac 4201 tacaacgcct acgagtccta caagtacccc ggcatgttca tcgccctgag caagaatggg 4261 aagaccaaga aggggaaccg agtgtcgccc accatgaagg tcacccactt cctccccagg 4321 ctgtgaccct ccagaggacc cttgcctcag cctcgggaag cccctgggag ggcagtgccg 4381 agggtcacct tggtgcactt tcttcggatg aagagtttaa tgcaagagta ggtgtaagat 4441 atttaaatta attatttaaa tgtgtatata ttgccaccaa attatttata gttctgcggg 4501 tgtgtttttt aattttctgg ggggaaaaaa agacaaaaca aaaaaccaac tctgactttt 4561 ctggtgcaac agtggagaat cttaccattg gatttcttta acttgtcaaa agttgtcacg 4621 agtgtgctgc tattctgtgt tttaaaaaaa ggtgacattg gattccgatg tcatcccctg 4681 tagtatggcg tggagcatct ctgtctggaa aggcccgcct gaggcttggg cagccagttc 4741 agggagctcc caggcttggc tctcggctag catcctcaga ggcccactcc ctttgtgccc 4801 tgttgctatt aatcgggaca tatcggttta cttcgggtac agaaagtgcg gtgttgaagt 4861 cctcgctgcc actctgtttt tagatctgcc aagactgacc tttgaacttt cctgtagtca 4921 atcttcctcg atctaccaga tgggagagac ccttggacaa ctttataaac tcctgtttgc 4981 cttttttgga tcagcgacag cccccatcgc tgtgactatt ggggaaaaga cgaagctctt 5041 tcataaattc catggagagg aatcaatatc ccactggaag gctagaaatg gacaagatag 5101 tgtatttgca atcacaaaca aaaccctagt gatgaaaaat aatttgtgat ggcagatgct 5161 tctgatggtg tgatagaata tgtttttgaa aacaaaccat cgaacccccc gccccacccc 5221 caaaacgggc ttccctgtgt ttagggagct ttgggctaga actagctacg atttttaggt 5281 gaaatgtcct tgtaattgta caaagcactt ggtgcagtgt ttgcgtggag cagcctgctg 5341 ctttctgatg cattccctgt ttaagtgcgt ttaacatcta cctcacaagc cctgaaaccc 5401 caggcaaaac ccacagaaag ctcatacccg gtgcaggagt ttgccatccc aagtggcttt 5461 ttttccatat gtagccaaaa aggattgcag atagcgtcgg tgcgtcccat tcgaaccttg 5521 tcacgtttga gctatcttta ccctgtgatt tacttttagt aagggtgatc atggtgaaaa 5581 tatttgcaga cagctgttac agtacactat atggtcacca agtaacctta tatttttctt 5641 tatatatttt acaaatgtaa cccctgtcat tgaagcaacc gtggaagagg cagggtcggt 5701 gatgtttaaa aaaagttccg aggtgatggc aaacatttaa ttttaatgaa tgacttttta 5761 gagtttatac aaaatgacct tagcttgcta ccagaaatgc tccgaatgtt tcgtcaagac 5821 tttaatactc tcctaggatg tttctgaact gtctcccgaa ttaactttat gggagtctac 5881 agacagcaag actggaaaat ctgattggag tttttgtctt tcacattcct tttgaaaact 5941 ctttgttcga atgcaaatca tcgacttaaa atactattct taaccaaggc ctggaagaaa 6001 gaagacactt gcaaagccgc taagacagga ccacacatct taaactgctg ttcctaccat 6061 gcactaaact gtttttaagt tttaaaccac accctaggct ccaggagtgt tcaggaaaga 6121 tggtgtttgt aggtctccat gctgtttggc gttggggggt gtggagggat catccgtcga 6181 ctttctgaat tttaatgtat tcacttagta acaaaccatg attgtcttaa atgccttaaa 6241 ttattatgag atttcttgtc tcagagccca atcagattgt caggaattaa catgtgttag 6301 gtttgatcac ccttgaccac ttcttataga tatttcttca acaaatcatg tgtgatgcct 6361 gtaggaacac aactgtacct ttaaaatatt gttttcatat tgctgtgatg gggattcgag 6421 gttcctgtat gtgccactgt tttcagaatc tgtagtttta tacaggtgcc gaccctcgtt 6481 gtgatgtatg tgctgtgcac attgacatgc tgaccgacaa tgataagcgt ttatcgtgta 6541 taaaaagaca ccactggact ggatgtacac aactgggaaa ggaattaaaa gctattaaaa 6601 ttgtgccttg aaatgc // LOCUS HUMHSTNBP 2561 bp mRNA PRI 23-DEC-1992 DEFINITION Homo sapiens histone-binding protein mRNA, complete cds. ACCESSION M97856 NID g184432 KEYWORDS histone-binding protein. SOURCE Homo sapiens (library: Clontech) male 50 year old adult testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kleinschmidt,J.A., Dingwall,C., Maier,G. and Franke,W.W. TITLE Molecualr characterization of a karyophilic, histone-binding protein: cDNA cloning, amino acid sequence and expression of nuclear protein N1/N2 of Xenopus laevis JOURNAL EMBO J. 5, 3547-3552 (1986) MEDLINE 87161764 REFERENCE 2 (sites) AUTHORS Welch,J.E., Zimmerman,L.J., Joseph,D.R. and O'Rand,M.G. TITLE Characterization of a nuclear autoantigenic sperm protein: Complete sequence and homology with the Xenopus protein, N1/N2 JOURNAL Biol. Reprod. 43, 559-568 (1990) MEDLINE 91145522 REFERENCE 3 (bases 1 to 2561) AUTHORS O'Rand,M.G., Richardson,R.T., Zimmerman,L.J. and Widgren,E.E. TITLE Sequence and localization of human NASP: Conservation of a Xenopus histone binding protein JOURNAL Dev. Biol. 154, 37-44 (1992) MEDLINE 93050782 FEATURES Location/Qualifiers source 1..2561 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="primary spermatocytes; round spermatids; spermatozoa" /dev_stage="50 year old adult" /germline /sex="male" /tissue_type="testis" /tissue_lib="Clontech" 5'UTR 1..85 CDS 86..2449 /note="putative histone binding domain I: aa 115-126; putativehistone binding domain II: aa 469-511; nuclear translocationsignal: aa 715-720; putative" /citation=[1] /citation=[2] /codon_start=1 /function="binds histones" /label=hbp /product="histone-binding protein" /db_xref="PID:g184433" /translation="MAMESTATAAVAADVVSADKIEDVPAPSTSADKVESLDVDSEAK KLLGLGQKHLVMGDIPAAVNAFQEAASLLGKKYGETANECGEAFFFYGKSLLELARME NGVLGNALEGVHVEEEEGEKTEDESLVENNDNIDEEAREELREQVYDAMGEKEEAKKT EDKSLAKPETDKEQDSEMEKGGREDMDISKSAEEPQEKVDLTLDWLTETSEEAKGGAA PEGPNEAEVTSGKPEQEVPDAEEEKSVSGTDVQEECREKGGQEKQGEVIVSIEEKPKE VSEEQPVVTLEKQGTAVEVEAESLDPTVKPVDVGGDEPEEKVVTSENEAGKAVLEQLV GQEVPPAEESPEVQTEAAEASAVEAGSEVSEKPGQEAPVLPKDGAVNGPSVVGDQTPI EPQTSIERLTETKDGSGLEEKVRAKLVPSQEETKLSVEESEAAGDGVDTKVAQGATEK SPEDKVQIAANEETQEREEQMKEGEETEGSEEDDKENDKTEEMPNDSVLENKSLQENE EEEIGNLELAWDMLDLAKIIFKRQETKEAQLYAAQAHLKLGEVSVESENYVQAVEEFQ SCLNLQEQYLEAHDRLLAETHYQLGLAYGYNSQYDEAVAQFSKSIEVIENRMAVLNEQ VKEAEGSSEYKKEIEELKELLPEIREKIEDAKESQRSGNVAELALKATLVESSTSGFT PGGGGSSVSMIASRKPTDGASSSNCVTDISHLVRKKRKPEEESPRKDDAKKAKQEPEV NGGSGDAVPSGNEVSENMEEEAENQLKRGAAVEGTLEAGATVESTAC" 3'UTR 2447..2561 /citation=[2] /evidence=experimental polyA_signal 2537..2542 /citation=[2] BASE COUNT 841 a 446 c 757 g 517 t ORIGIN 1 gcctgagtga gtctctggcg tcccaaattg cctgtttttc tcgcaggctc tattccgttc 61 gctggttcgc cacctcaggg gaacgatggc catggagtcc acagccactg ccgccgtcgc 121 cgcggacgtg gtttctgccg acaaaattga agatgtccct gctccttcta catctgcaga 181 taaagtggag agtctggatg tggatagtga agctaagaaa ctattgggtt taggacagaa 241 acatctggtg atgggggata ttccagcagc tgtcaatgca ttccaggaag cagctagtct 301 tttaggtaag aagtatggag agacagctaa tgagtgtgga gaagccttct ttttctatgg 361 gaaatcactt ctggagttgg caagaatgga gaatggtgtg ttgggaaacg ccttggaagg 421 tgtgcatgtg gaagaggaag aaggagaaaa aacagaagat gaatctctgg tagaaaataa 481 tgataacata gatgaggaag caagggaaga gttgagagaa caggtttatg acgccatggg 541 agaaaaagaa gaagccaaaa aaacagaaga caagtctttg gcaaagcctg aaactgataa 601 agaacaggac agtgaaatgg agaagggtgg aagagaagat atggatataa gtaaatctgc 661 agaggagcca caggaaaaag ttgacttgac tctagattgg ttaactgaaa cctctgaaga 721 ggcaaaagga ggagcagcac cagaaggacc gaatgaagct gaggtcactt ctgggaagcc 781 agaacaggaa gtaccagatg ctgaggaaga aaaatcagtt tctggaactg atgtccaaga 841 agagtgcaga gaaaaaggag gtcaggagaa gcagggagag gtaattgtga gcatagagga 901 gaagccaaaa gaagtttcag aagagcagcc tgtggtgact ctagaaaagc agggcactgc 961 agtggaggta gaagcagagt ctttagaccc gacagtcaag ccagtggatg tgggtgggga 1021 cgagccagag gagaaggtag ttacctctga aaacgaggca ggaaaggcgg ttcttgaaca 1081 actggtaggt caagaagtac cacctgctga agagtcacca gaggtgcaaa cagaggctgc 1141 agaggcctca gctgtagagg ctggatcaga agtctctgaa aagcctgggc aggaggctcc 1201 agttctccct aaggatggtg cagtcaatgg accgtcagtt gtaggagatc agactcctat 1261 tgaaccacag acttctatag aaagactgac agaaacaaaa gatggctcag gactagagga 1321 gaaggtcagg gcaaagctgg ttcctagtca ggaggagact aagctgtctg tagaagagtc 1381 tgaggcagct ggagatgggg ttgataccaa ggtagcccag ggagctactg agaaatcacc 1441 tgaagacaaa gttcagatag ctgctaatga agagacacaa gagagagaag aacagatgaa 1501 agagggtgaa gaaactgaag gctcggaaga ggatgataaa gaaaatgata agactgaaga 1561 aatgccaaat gattcagtcc ttgaaaacaa gtctcttcaa gaaaatgagg aggaggagat 1621 tgggaaccta gagcttgcct gggatatgct ggatttagca aagatcattt ttaaaaggca 1681 agaaacaaaa gaagcacagc tttatgctgc ccaggcacat cttaaactcg gagaagttag 1741 tgttgaatct gaaaactatg tgcaagctgt ggaggagttc cagtcctgcc ttaacctgca 1801 ggaacagtac ctggaagccc acgaccgtct gcttgcagag acccactacc agctgggctt 1861 ggcttatggg tacaactctc agtatgatga ggcagtggca cagttcagca aatctattga 1921 agtcattgag aacagaatgg ctgtactaaa cgagcaggtg aaggaggctg aaggatcgtc 1981 tgaatacaag aaagaaattg aggaactaaa ggaactgcta cccgaaatta gagagaagat 2041 agaagatgca aaggagtctc agcgtagtgg gaatgtagct gaactggctc tgaaagctac 2101 tctggtggag agttctactt caggtttcac tcctggtgga ggaggctctt cagtctccat 2161 gattgccagt agaaagccaa cagacggtgc ttcctcatca aattgtgtga ctgatatttc 2221 ccaccttgtc agaaagaaga ggaaaccaga ggaagagagt ccccggaaag atgatgcaaa 2281 gaaagccaaa caagagccgg aggtgaacgg aggcagtggg gatgctgtcc cgagtggaaa 2341 tgaagtttcg gaaaacatgg aggaggaggc tgagaatcag ctgaaacgcg gagcagcagt 2401 ggaggggaca ctggaggctg gagctacagt tgaaagcact gcatgttaag agggggcaca 2461 gcctcctccc aagggaaagt gtttttgtat ataatgtatt ttttcacttt tggaggattc 2521 tttttgtata acttcaataa agattgtaag caaaaaaaaa a // LOCUS HUMHTFP 979 bp mRNA PRI 27-JUL-1994 DEFINITION Homo sapiens tissue factor pathway inhibitor-2 mRNA, complete cds. ACCESSION L27624 NID g441149 KEYWORDS tissue factor pathway inhibitor-2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 979) AUTHORS Sprecher,C.A., Kisiel,W., Mathewes,S. and Foster,D.C. TITLE Molecular cloning, expression, and partial characterization of a second human tissue-factor-pathway inhibitor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (8), 3353-3357 (1994) MEDLINE 94211862 FEATURES Location/Qualifiers source 1..979 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 39..746 /note="putative" /codon_start=1 /product="tissue factor pathway inhibitor-2" /db_xref="PID:g441150" /translation="MDPARPLGLSILLLFLTEAALGDAAQEPTGNNAEICLLPLDYGP CRALLLRYYYDRYTQSCRQFLYGGCEGNANNFYTWEACDDACWRIEKVPKVCRLQVSV DDQCEGSTEKYFFNLSSMTCEKFFSGGCHRNRIENRFPDEATCMGFCAPKKIPSFCYS PKDEGLCSANVTRYYFNPRYRTCDAFTYTGCGGNDNNFVSREDCKRACAKALKKKKKM PKLRFASRIRKIRKKQF" BASE COUNT 258 a 217 c 231 g 273 t ORIGIN 1 ggacgccttg cccagcgggc cgcccgaccc cctgcaccat ggaccccgct cgccccctgg 61 ggctgtcgat tctgctgctt ttcctgacgg aggctgcact gggcgatgct gctcaggagc 121 caacaggaaa taacgcggag atctgtctcc tgcccctaga ctacggaccc tgccgggccc 181 tacttctccg ttactactac gacaggtaca cgcagagctg ccgccagttc ctgtacgggg 241 gctgcgaggg caacgccaac aatttctaca cctgggaggc ttgcgacgat gcttgctgga 301 ggatagaaaa agttcccaaa gtttgccggc tgcaagtgag tgtggacgac cagtgtgagg 361 ggtccacaga aaagtatttc tttaatctaa gttccatgac atgtgaaaaa ttcttttccg 421 gtgggtgtca ccggaaccgg attgagaaca ggtttccaga tgaagctact tgtatgggct 481 tctgcgcacc aaagaaaatt ccatcatttt gctacagtcc aaaagatgag ggactgtgct 541 ctgccaatgt gactcgctat tattttaatc caagatacag aacctgtgat gctttcacct 601 atactggctg tggagggaat gacaataact ttgttagcag ggaggattgc aaacgtgcat 661 gtgcaaaagc tttgaaaaag aaaaagaaga tgccaaagct tcgctttgcc agtagaatcc 721 ggaaaattcg gaagaagcaa ttttaaacat tcttaatatg tcatcttgtt tgtctttatg 781 gcttatttgc ctttatggtt gtatctgaag aataatatga cagcatgagg aaacaaatca 841 ttggtgattt attcaccagt ttttattaat acaagtcact ttttcaaaaa tttggatttt 901 tttatatata actagctgct attcaaatgt gagtctacca tttttaattt atggttcaac 961 tgtttgtgag actgaattc // LOCUS HUMHTPB 1991 bp mRNA PRI 18-JAN-1996 DEFINITION Human mRNA for mitochondrial 3-ketoacyl-CoA thiolase beta-subunit of trifunctional protein, complete cds. ACCESSION D16481 NID g473711 KEYWORDS 3-ketoacyl-CoA thiolase beta-subunit; trifunctional protein. SOURCE Homo sapiens (library: lambda gt11) cDNA to mRNA, clones Hbeta-[1 and 2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1991) AUTHORS Kamijo,T., Aoyama,T., Komiyama,A. and Hashimoto,T. TITLE Structural analysis of cDNAs for subunits of human mitochondrial fatty acid beta-oxidation trifunctional protein JOURNAL Biochem. Biophys. Res. Commun. 199 (2), 818-825 (1994) MEDLINE 94183263 REFERENCE 2 (bases 1 to 1991) AUTHORS Kamijo,T. TITLE Direct Submission JOURNAL Submitted (17-JUN-1993) to the DDBJ/EMBL/GenBank databases. Takehiko Kamijo, Shinshu University School of Medicine, Pediaric Department; Asahi 3-1-1, Matsumoto, Nagano 390, Japan (E-mail:kkamijo, Tel:0263-35-4600, Fax:0263-33-6458) COMMENT Submitted (17-JUN-1993) to DDBJ by: Takehiko Kamijo Pediatric Department Shinshu University School of Medicine Asahi 3-1-1, Matsumoto Nagano 390 Japan Phone: 0263-35-4600 Fax: 0263-33-6458. FEATURES Location/Qualifiers source 1..1991 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" sig_peptide 47..145 /gene="HTP-beta" CDS 47..1471 /gene="HTP-beta" /EC_number="2.3.1.16" /codon_start=1 /product="3-ketoacyl-CoA thiolase beta-subunit of trifunctional protein" /db_xref="PID:d1004458" /db_xref="PID:g862458" /translation="MTILTYPFKNLPTASKWALRFSIRPLSCSSQLRAAPAVQTKTKK TLAKPNIRNVVVVDGVRTPFLLSGTSYKDLMPHDLARAALTGLLHRTSVPKEVVDYII FGTVIQEVKTSNVAREAALGAGFSDKTPAHTVTMACISANQAMTTGVGLIASGQCDVI VAGGVELMSDVPIRHSRKMRKLMLDLNKAKSMGQRLSLISKFRFNFLAPELPAVSEFS TSETMGHSADRLAAAFAVSRLEQDEYALRSHSLAKKAQDEGLLSDVVPFKVPGKDTVT KDNGIRPSSLEQMAKLKPAFIKPYGTVTAANSSFLTDGASAMLIMAEEKALAMGYKPK AYLRDFMYVSQDPKDQLLLGPTYATPKVLEKAGLTMNDIDAFEFHEAFSGQILANFKA MDSDWFAENYMGRKTKVGLPPLEKFNNWGGSLSLGHPFGATGCRLVMAAANRLRKEGG QYGLVAACAAGGQGHAMIVEAYPK" gene 47..1471 /gene="HTP-beta" mat_peptide 146..1468 /gene="HTP-beta" /product="3-ketoacyl-CoA thiolase beta-subunit of trifunctional protein" BASE COUNT 550 a 421 c 457 g 563 t ORIGIN 1 cttgctccga gagggagtcc tcgcggacgt cagccaagat tccagaatga ctatcttgac 61 ttaccccttt aaaaatcttc ccactgcatc aaaatgggcc ctcagatttt ccataagacc 121 tctgagctgt tcctcccagc tacgagctgc cccagctgtc cagaccaaaa cgaagaagac 181 gttagccaaa cccaatataa ggaatgttgt ggtggtggat ggtgttcgca ctccattttt 241 gctgtctggc acttcatata aagacctgat gccacatgat ttggctagag cagcgcttac 301 gggtttgttg catcggacca gtgtccctaa ggaagtagtt gattatatca tctttggtac 361 agttattcag gaagtgaaaa caagcaatgt ggctagagag gctgcccttg gagctggctt 421 ctctgacaag actcctgctc acactgtcac catggcttgt atctctgcca accaagccat 481 gaccacaggt gttggcttga ttgcttctgg ccagtgtgat gtgatcgtgg caggtggtgt 541 tgagttgatg tccgatgtcc ctattcgtca ctcaaggaaa atgagaaaac tgatgcttga 601 tctcaataag gccaaatcta tgggccagcg actgtcttta atctctaaat tccgatttaa 661 tttcctagca cctgagctcc ctgcggtttc tgagttctcc accagtgaga ccatgggcca 721 ctctgcagac cgactggccg ctgcctttgc tgtttctcgg ctggaacagg atgaatatgc 781 actgcgctct cacagtctag ccaagaaggc acaggatgaa ggactccttt ctgatgtggt 841 acccttcaaa gtaccaggaa aagatacagt taccaaagat aatggcatcc gtccttcctc 901 actggagcag atggccaaac taaaacctgc attcatcaag ccctacggca cagtgacagc 961 tgcaaattct tctttcttga ctgatggtgc atctgcaatg ttaatcatgg cggaggaaaa 1021 ggctctggcc atgggttata agccgaaggc atatttgagg gattttatgt atgtgtctca 1081 ggatccaaaa gatcaactat tacttggacc aacatatgct actccaaaag ttctagaaaa 1141 ggcaggattg accatgaatg atattgatgc ttttgaattt catgaagctt tctcgggtca 1201 gattttggca aattttaaag ccatggattc tgattggttt gcagaaaact acatgggtag 1261 aaaaaccaag gttggattgc ctcctttgga gaagtttaat aactggggtg gatctctgtc 1321 cctgggacac ccatttggag ccactggctg caggttggtc atggctgctg ccaacagatt 1381 acggaaagaa ggaggccagt atggcttagt ggctgcgtgt gcagctggag ggcagggcca 1441 tgctatgata gtggaagctt atccaaaata atagatccag aagaagtgac ctgaagtttc 1501 tgtgcaacac tcacactagg caatgccatt tcaatgcatt actaaatgac atttgtagtt 1561 cctagctcct cttaggaaaa cagttcttgt ggccttctat taaatagttt gcacttaagc 1621 cttgccagtg ttctgagctt ttcaataatc agtttactgc tctttcaggg atttctaagc 1681 caccagaatc tcacatgaga tgtgtgggtg gttgtttttg gtctctgttg tcactaaaga 1741 ctaaatgagg gtttgcagtt gggaaagagg tcaactgaga tttggaaatc atctttgtaa 1801 tatttgcaaa ttatacttgt tcttatctgt gtcctaaaga tgtgttctct ataaaataca 1861 aaccaacgtg cctaattaat tatggaaaaa taattcagaa tctaaacacc actgaaaact 1921 tataaaaaat gtttagatac ataaatatgg tggtcagcgt taataaagtg gagaaatatt 1981 ggaaaaaaaa a // LOCUS HUMHTR1DB 1959 bp DNA PRI 31-DEC-1994 DEFINITION Human serotonin 1Db receptor (HTR1D) gene, complete cds. ACCESSION M75128 NID g184459 KEYWORDS serotonin 1Db receptor. SOURCE Homo sapiens (tissue library: EMBL3 SP6/T7 (Clontech)) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1959) AUTHORS Demchyshyn,L., Sunahara,R.K., Miller,K., Teitler,M., Hoffman,B.J., Kennedy,J.L., Seeman,P., Van Tol,H.H. and Niznik,H.B. TITLE A human serotonin 1D receptor variant (5HT1D beta) encoded by an intronless gene on chromosome 6 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (12), 5522-5526 (1992) MEDLINE 92302275 FEATURES Location/Qualifiers source 1..1959 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="EMBL3 SP6/T7 (Clontech)" gene 619..1791 /gene="HTR1D" CDS 619..1791 /gene="HTR1D" /codon_start=1 /product="serotonin 1Db receptor" /db_xref="PID:g184460" /translation="MEEPGAQCAPPPPAGSETWVPQANLSSAPSQNCSAKDYIYQDSI SLPWKVLLVMLLALITLATTLSNAFVIATVYRTRKLHTPANYLIASLAVTDLLVSILV MPISTMYTVTGRWTLGQVVCDFWLSSDITCCTASILHLCVIALDRYWAITDAVEYSAK RTPKRAAVMIALVWVFSISISLPPFFWRQAKAEEEVSECVVNTDHILYTVYSTVGAFY FPTLLLIALYGRIYVEARSRILKQTPNRTGKRLTRAQLITDSPGSTSSVTSINSRVPD VPSESGSPVYVNQVKVRVSDALLEKKKLMAARERKATKTLGIILGAFIVCWLPFFIIS LVMPICKDACWFHLAIFDFFTWLGYLNSLINPIIYTMSNEDFKQAFHKLIRFKCTS" BASE COUNT 393 a 627 c 516 g 423 t ORIGIN 1 gagctccggc gcgaggcgcg gcgcagcgct gctcctagac ttcaccccac ccagctctgg 61 cggccgctgc agccccccaa aagtgcccca gcttggggcg aggggtggga atgcaagatc 121 tcgggacctc tcgctggcct gcaagctttg gtctctacac ctaggaaact cctgtgggca 181 aagtctgcag atccaaaagc gtccaggtta ggagacgctc agcctcaagc aactggggta 241 agagatccca tttggtcaaa gccttctcct caagcagtac ttcaccctcc tgcactagac 301 gcctccaggg agctggagcg gagcagggct cggtgggcca gctcttagca acccaggtct 361 aagacccggt gtggagagga acaaccacag acgcggcggc ttagctaggc gctctggaag 421 tgcaggggag gcgccgcctg ccttggctgc cgcacccatg acctctagtt tcagctgtga 481 acctgggcgg aggaataatt gaggaactca cggaactatc aactggggac aaacctgcga 541 tcgccacggt ccttccgccc tctccttcgt ccgctccatg cccaagagct gcgctccgga 601 gctggggcga ggagagccat ggaggaaccg ggtgctcagt gcgctccacc gccgcccgcg 661 ggctccgaga cctgggttcc tcaagccaac ttatcctctg ctccctccca aaactgcagc 721 gccaaggact acatttacca ggactccatc tccctaccct ggaaagtact gctggttatg 781 ctattggcgc tcatcacctt ggccaccacg ctctccaatg cctttgtgat tgccacagtg 841 taccggaccc ggaaactgca caccccggct aactacctga tcgcctctct ggcggtcacc 901 gacctgcttg tgtccatcct ggtgatgccc atcagcacca tgtacactgt caccggccgc 961 tggacactgg gccaggtggt ctgtgacttc tggctgtcgt cggacatcac ttgttgcact 1021 gcctccatcc tgcacctctg tgtcatcgcc ctggaccgct actgggccat cacggacgcc 1081 gtggagtact cagctaaaag gactcccaag agggcggcgg tcatgatcgc gctggtgtgg 1141 gtcttctcca tctctatctc gctgccgccc ttcttctggc gtcaggctaa ggccgaagag 1201 gaggtgtcgg aatgcgtggt gaacaccgac cacatcctct acacggtcta ctccacggtg 1261 ggtgctttct acttccccac cctgctcctc atcgccctct atggccgcat ctacgtagaa 1321 gcccgctccc ggattttgaa acagacgccc aacaggaccg gcaagcgctt gacccgagcc 1381 cagctgataa ccgactcccc cgggtccacg tcctcggtca cctctattaa ctcgcgggtt 1441 cccgacgtgc ccagcgaatc cggatctcct gtgtatgtga accaagtcaa agtgcgagtc 1501 tccgacgccc tgctggaaaa gaagaaactc atggccgcta gggagcgcaa agccaccaag 1561 accctaggga tcattttggg agcctttatt gtgtgttggc tacccttctt catcatctcc 1621 ctagtgatgc ctatctgcaa agatgcctgc tggttccacc tagccatctt tgacttcttc 1681 acatggctgg gctatctcaa ctccctcatc aaccccataa tctataccat gtccaatgag 1741 gactttaaac aagcattcca taaactgata cgttttaagt gcacaagttg acttgccgtt 1801 tgcagtgggg tcgcctaagc gacctttggg gaccaagttg tgtctggttc cacaggtagg 1861 tcgaatcttc tttcgcggtt tctgggtccc agcgaggctc tctctcctgg gcaagggcaa 1921 tggatcctga gaagccagaa tagtcctgag agagagctc // LOCUS HUMHTR3A 1685 bp mRNA PRI 31-DEC-1994 DEFINITION Human chaperonin-like protein (HTR3) mRNA, complete cds. ACCESSION M94083 NID g184461 KEYWORDS chaperonin-like protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1685) AUTHORS Segel,G.B., Boal,T.R., Cardillo,T.S., Murant,F.G., Lichtman,M.A. and Sherman,F. TITLE Isolation of a gene encoding a chaperonin-like protein by complementation of yeast amino acid transport mutants with human cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (13), 6060-6064 (1992) MEDLINE 92335237 FEATURES Location/Qualifiers source 1..1685 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="b-lymphocyte" gene 162..1331 /gene="HTR3" CDS 162..1331 /gene="HTR3" /codon_start=1 /product="chaperonin-like protein" /db_xref="PID:g184462" /translation="MDRETLIDVARTSLRTKVHAELADVLTEAVVDSILAIKKQDEPI DLFMIEIMEMKHKSETDTSLIRGLVLDHGARHPDMKKRVEDAYILTCNVSLEYEKTEV NSGFFYKSAEEREKLVKAERKFIEDRVKKIIELKRKVCGDSDKGFVVINQKGIDPFSL DALSKEGIVALRRAKRRNMERLTLACGGVALNSFDDLSPDCLGHAGLVYEYTLGEEKF TFIEKCNNPRSVTLLIKGPNKHTLTQIKDAVRDGLRAVKNAIDDGCVVPGAGAVEVAM AEALIKHKPSVKGRAQLGVQAFADALLIIPKVLAQNSGFDLQETLVKIQAEHSESGQL VGVDLNTGEPMVAAEVGVWDNYCVKKQLLHSCTVIATNILLVDEIMRAGMSSLKG" BASE COUNT 523 a 296 c 403 g 463 t ORIGIN 1 ttcttctgcc ctatgtctcc aaggccatca ttggagagct gctgaaacag gcggatctct 61 acatttctga aggccttcat cctagaataa tcactgaagg atttgaagct gcaaaggaaa 121 aggcccttca gtttttggaa gaagtcaaag taagcagaga gatggacagg gaaacactta 181 tagatgtggc cagaacatct cttcgtacta aagttcatgc tgaacttgca gatgtcttaa 241 cagaggctgt agtggactcc attttggcca ttaaaaagca agatgaacct attgatctct 301 tcatgattga gatcatggag atgaaacata aatctgaaac tgatacaagc ttaatcagag 361 ggcttgtttt ggaccacgga gcacggcatc ctgatatgaa gaaaagggtg gaggatgcat 421 acatcctcac ttgtaacgtg tcattagagt atgagaaaac agaagtgaat tctggctttt 481 tttacaagag tgcagaagag agagaaaaac tcgtgaaagc tgaaagaaaa ttcattgaag 541 atagggttaa aaaaataata gaactgaaaa ggaaagtctg tggcgattca gataaaggat 601 ttgttgttat taatcaaaag ggaattgacc ccttttcctt agatgctctt tcaaaagaag 661 gcatagtcgc tctgcgcaga gctaaaagga gaaatatgga gaggctgact cttgcttgtg 721 gtggggtagc cctgaattct tttgacgacc taagtcctga ctgcttggga catgcaggac 781 ttgtatatga gtatacattg ggagaagaga agtttacctt tattgagaaa tgtaacaacc 841 ctcgttctgt cacattattg atcaaaggac caaataagca cacactcact cagatcaaag 901 atgcagtgag ggacggcttg agggctgtca aaaatgctat tgatgatggc tgtgtggttc 961 caggtgctgg tgccgtggaa gtggcaatgg cagaagccct gattaaacat aagcccagtg 1021 taaagggcag ggcacagctt ggagtccaag catttgctga tgcattgctc attattccca 1081 aggttcttgc tcagaactct ggttttgacc ttcaggaaac attagttaaa attcaagcag 1141 aacattcaga atcaggtcag cttgtgggtg tggacctgaa cacaggtgag ccaatggtgg 1201 cagcagaagt aggcgtatgg gataactatt gtgtaaagaa acagcttctt cactcctgca 1261 ctgtgattgc caccaacatt ctcttggttg atgagatcat gcgagctgga atgtcttctc 1321 tgaaaggttg aattgaagct tcctctgtat ctgaatcttg aagactgcaa agtgatcctg 1381 aggattacag ctgtggaatt tttgtccaag cttcaaataa ttttgaaaga aattttccca 1441 tatgaaaaaa ggagagaaca ctggcatctg ttgaaatttg gaagttctga aattatagta 1501 tttttaaaaa ttgcactgaa gtgtatacac ataaagcagg tcttttatcc agtgaacagg 1561 atgttttgct ttagcagcag tgacataaaa ttccatgtta gataagcata tgttacttac 1621 cttgttatta aatatttctt gaaaagcagg ccacgaaggc cggccttcgt ggcctcgagg 1681 aattc // LOCUS HUMHUC 1929 bp mRNA PRI 31-DEC-1994 DEFINITION Homo sapiens (huc) mRNA, complete cds. ACCESSION L26405 NID g431092 KEYWORDS . SOURCE Homo sapiens (tissue library: lambda ZAP) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1929) AUTHORS Manley,T. and Furneaux,H.M. TITLE Isolation of the huc antigen JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..1929 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /tissue_lib="lambda ZAP" gene 147..1226 /gene="huc" CDS 147..1226 /gene="huc" /codon_start=1 /db_xref="PID:g431093" /translation="MVTQILGAMESQVGGGPAGPALPNGPLLGTNGATDDSKTNLIVN YLPQNMTQDEFKSLFGSIGDIESCKLVRDKITGRDLGYGFVNYPDPNDADKAINTLNG LKLQTKTIKVSYARPSSASIRDANLYVSGLPKTMSQKEMEQLFSQYGRIITSRILVDQ VTGVSRGVGFIRFDKRIEAEEAIKGLNGQKPLGAAEPITVKFANNPSQKTGQALLTHL YQSSARRYAGPLHHQTQRFRLDNLLNMAYGVKRFSPIAIDGMSGLAGVGLSGGAAGGW CIFVYNLSPEADESVLWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAI ASLNGYRLAERVLQVSFKTSKQHKA" BASE COUNT 462 a 582 c 576 g 309 t ORIGIN 1 cggagccgcc gccttcatcg ccacatctgc agcggccgca ccagagcgcc cgggcggacc 61 ccagcgtgac gatcgggcgc ccccctagga gtgcaccacc cccggagccc ccctcaacac 121 ggaccgcgcc cgccgggcac acaagaatgg tcactcagat actgggggcc atggagtctc 181 aggtgggggg gggcccggcc ggcccggccc tgcccaacgg gccactcctt ggtacaaatg 241 gagccactga cgacagcaag accaacctca tcgtcaacta cctgccccag aacatgaccc 301 aggatgagtt caagagtctc ttcggcagca ttggcgacat cgagtcctgc aagttggttc 361 gggacaagat cacaggcaga gaccttggct acgggtttgt gaactatcct gaccccaatg 421 atgcagacaa agccatcaac accctcaacg gcctcaaatt acagacgaag accatcaagg 481 tgtcctatgc cagacccagt tcagcatcca tccgggatgc taacctgtac gtcagcgggc 541 tccccaagac catgagccag aaagagatgg agcagctctt ctcccagtac ggccgcatca 601 tcacgtcccg catcctggtg gaccaggtca caggtgtctc tcggggtgtg ggattcatcc 661 gctttgacaa gaggattgag gccgaagagg ctatcaaagg actgaatggg cagaagccgc 721 tgggcgcagc tgagcccatc acagtcaagt tcgcgaacaa cccaagtcag aagacggggc 781 aggcgctgct cacccacctc taccagtcat ccgcccggcg ctacgcaggc cccctacacc 841 atcagaccca gcgtttccgg ctggacaatt tgctcaacat ggcctacggc gtcaagaggt 901 tctcgccgat cgccatcgat ggtatgagcg gcctggcggg cgtgggcctg tcggggggcg 961 cggcgggcgg ctggtgcatc ttcgtgtaca acctgtcacc ggaggcagac gagagcgtgc 1021 tgtggcagct gttcgggcct tttggggcag tcaccaacgt caaggtcatc cgtgatttca 1081 ccaccaacaa gtgcaagggt ttcggcttcg tgaccatgac caactatgac gaggcggcca 1141 tggccatcgc cagcctgaac ggctatcgcc tggccgagcg cgtgctgcag gtctccttca 1201 agaccagcaa acagcacaag gcgtgagccc accccgcctg ccctcccacc ccctccccgg 1261 gcagcagaga gagagagaga gaaagagaga gagagagaga gaaggggccc aagagagaca 1321 gcacaggcag ccccacggac gacgcgaggg ccccacgtcc ctgcggaagc cacagggtga 1381 gcactctggg gtgggagggt ctgcagggaa ttgggggggt gcccggggat cccccgcccc 1441 atcctcctgc ccccacccca ggctgggctg ttcactctct cgtcttggtt tggttcatgg 1501 tgaaggtttt tgtttctttt ttcggctaaa aagaatgcag agatgtgccc ccacccccac 1561 cctcgaccac ccccgatggg atggcttggg gggctccagg gggtgccctc ccagaccccc 1621 ttgcccaggc ctccccagca cctaggtggg gcctggggta ggaggaacag gtttaaaaat 1681 ccccaaaaaa gcgaaccgtg aggaggggtg tgggcacccc cggcccagtg ccccctggtg 1741 gaatgcgggg gagcaggcag tggggctgga agcagaaaca aaatgaaaaa aaaagggggg 1801 tgggagggga agaaaaactc tatttttgta aaaagggaaa aagacctcgt ggagaatttt 1861 tactggggat tcttgaactt gaaaaaaaaa aacacaaaaa aagacaaaaa aaaaaaaaaa 1921 aaaggaatt // LOCUS HUMHUGBR1 2351 bp mRNA PRI 07-MAR-1995 DEFINITION Human bilirubin UDP-glucuronosyltransferase isozyme 1 mRNA, complete cds. ACCESSION M57899 NID g184472 KEYWORDS UDP-glucuronosyltransferase; bilirubin UDP-glucuronosyltransferase. SOURCE Human adult female liver, cDNA to mRNA, clones Z6, Z11 and Z6MB2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2351) AUTHORS Ritter,J.K., Crawford,J.M. and Owens,I.S. TITLE Cloning of two human liver bilirubin UDP-glucuronosyltransferase cDNAs with expression in COS-1 cells JOURNAL J. Biol. Chem. 266 (2), 1043-1047 (1991) MEDLINE 91093210 FEATURES Location/Qualifiers source 1..2351 /organism="Homo sapiens" /isolate="AK" /db_xref="taxon:9606" /clone="Z6" /dev_stage="adult" /sex="female" /tissue_type="liver" /tissue_lib="lambda-ZAP" /map="2" gene 16..1617 /gene="UGT1" CDS 16..1617 /gene="UGT1" /standard_name="bilirubin UDP-glucuronosyltransferase isozyme 1" /EC_number="2.4.1.17" /codon_start=1 /db_xref="GDB:G00-120-007" /product="UDP-glucuronosyltransferase 1" /db_xref="PID:g184473" /translation="MAVESQGGRPLVLGLLLCVLGPVVSHAGKILLIPVDGSHWLSML GAIQQLQQRGHEIVVLAPDASLYIRDGAFYTLKTYPVPFQREDVKESFVSLGHNVFEN DSFLQRVIKTYKKIKKDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIV AQYLSLPTVFFLHALPCSLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFS QNFLCDVVYSPYATLASEFLQREVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVG GINCLHQNPLSQEFEAYINASGEHGIVVFSLGSMVSEIPEKKAMAIADALGKIPQTVL WRYTGTRPSNLANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMP LFGDQMDNAKRMETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDR PVEPLDLAVFWVEFVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAVVLTVAFITFK CCAYGYRKCLGKKGRVKKAHKSKTH" sig_peptide 43..75 /gene="UGT1" /note="G00-120-007" mat_peptide 76..1614 /gene="UGT1" /standard_name="bilirubin UDP-glucuronosyltransferase isozyme 1" /EC_number="2.4.1.17" /note="G00-120-007" /product="UDP-glucuronosyltransferase 1" BASE COUNT 601 a 540 c 557 g 653 t ORIGIN 1 aggagcaaag gcgccatggc tgtggagtcc cagggcggac gcccacttgt cctgggcctg 61 ctgctgtgtg tgctgggccc agtggtgtcc catgctggga agatactgtt gatcccagtg 121 gatggcagcc actggctgag catgcttggg gccatccagc agctgcagca gaggggacat 181 gaaatagttg tcctagcacc tgacgcctcg ttgtacatca gagacggagc attttacacc 241 ttgaagacgt accctgtgcc attccaaagg gaggatgtga aagagtcttt tgttagtctc 301 gggcataatg tttttgagaa tgattctttc ctgcagcgtg tgatcaaaac atacaagaaa 361 ataaaaaagg actctgctat gcttttgtct ggctgttccc acttactgca caacaaggag 421 ctcatggcct ccctggcaga aagcagcttt gatgtcatgc tgacggaccc tttccttcct 481 tgcagcccca tcgtggccca gtacctgtct ctgcccactg tattcttctt gcatgcactg 541 ccatgcagcc tggaatttga ggctacccag tgccccaacc cattctccta cgtgcccagg 601 cctctctcct ctcattcaga tcacatgacc ttcctgcagc gggtgaagaa catgctcatt 661 gccttttcac agaactttct gtgcgacgtg gtttattccc cgtatgcaac ccttgcctca 721 gaattccttc agagagaggt gactgtccag gacctattga gctctgcatc tgtctggctg 781 tttagaagtg actttgtgaa ggattaccct aggcccatca tgcccaatat ggtttttgtt 841 ggtggaatca actgccttca ccaaaatcca ctatcccagg aatttgaagc ctacattaat 901 gcttctggag aacatggaat tgtggttttc tctttgggat caatggtctc agaaattcca 961 gagaagaaag ctatggcaat tgctgatgct ttgggcaaaa tccctcagac agtcctgtgg 1021 cggtacactg gaacccgacc atcgaatctt gcgaacaaca cgatacttgt taagtggcta 1081 ccccaaaacg atctgcttgg tcacccgatg acccgtgcct ttatcaccca tgctggttcc 1141 catggtgttt atgaaagcat atgcaatggc gttcccatgg tgatgatgcc cttgtttggt 1201 gatcagatgg acaatgcaaa gcgcatggag actaagggag ctggagtgac cctgaatgtt 1261 ctggaaatga cttctgaaga tttagaaaat gctctaaaag cagtcatcaa tgacaaaagt 1321 tacaaggaga acatcatgcg cctctccagc cttcacaagg accgcccggt ggagccgctg 1381 gacctggccg tgttctgggt ggagtttgtg atgaggcaca agggcgcgcc acacctgcgc 1441 cccgcagccc acgacctcac ctggtaccag taccattcct tggacgtgat tggtttcctc 1501 ttggccgtcg tgctgacagt ggccttcatc acctttaaat gttgtgctta tggctaccgg 1561 aaatgcttgg ggaaaaaagg gcgagttaag aaagcccaca aatccaagac ccattgagaa 1621 gtgggtggga aataaggtaa aattttgaac cattccctag tcatttccaa acttgaaaac 1681 agaatcagtg ttaaattcat tttattctta ttaaggaaat actttgcata aattaatcag 1741 ccccagagtg ctttaaaaaa ttctcttaaa taaaaataat agactcgcta gtcagtaaag 1801 atatttgaat atgtatcgtg ccccctccgg tgtctttgat caggatgaca tgtgccattt 1861 ttcagaggac gtgcagacag gctggcattc tagattactt ttcttactct gaaacatggc 1921 ctgtttggga gtgcgggatt caaaggtggt cccaccgctg cccctactgc aaatggcagt 1981 tttaatctta tcttttggct tctgcagatg gttgcaattg atccttaacc aataatggtc 2041 agtcctcatc tctgtcctgc ttcataggtg ccaccttgtg tgtttaaaga agggaagctt 2101 tgtaccttta gagtgtaggt gaaatgaatg aatggcttgg agtgcactga gaacagcata 2161 tgatttcttg ctttggggaa aaagaatgat gctatgaaat tggtgggtgg tgtatttgag 2221 aagataatca ttgcttatgt caaatggagc tgaatttgat aaaaacccaa aatacagcta 2281 tgaagtgctg ggcaagttta ctttttttct gatgtttcct acaactaaaa ataaattaat 2341 aaatttataa a // LOCUS HUMHUNTDIS 5672 bp mRNA PRI 08-NOV-1994 DEFINITION Homo sapiens Huntington disease-associated protein (HD) mRNA, complete cds. ACCESSION L20431 NID g398028 KEYWORDS Huntington's chorea protein. SOURCE Homo sapiens brain (frontal cortex) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5672) AUTHORS Lin,B., Rommens,J.M., Graham,R.K., Kalchman,M., MacDonald,H., Nasir,J., Delaney,A., Goldberg,Y.P. and Hayden,M.R. TITLE Differential 3' polyadenylation of the Huntington disease gene results in two mRNA species with variable tissue expression JOURNAL Hum. Mol. Genet. 2 (10), 1541-1545 (1993) MEDLINE 94093536 FEATURES Location/Qualifiers source 1..5672 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain (frontal cortex)" gene 2..1750 /gene="HD" CDS 2..1750 /gene="HD" /codon_start=1 /db_xref="GDB:G00-119-307" /product="Huntington disease-associated protein" /db_xref="PID:g398029" /translation="MVSKRENIATHHLYQAWDPVPSLSPATTGALISHEKLLLQINPE RELGSMSYKLGQVSIHSVWLGNSITPLREEEWDEEEEEEADAPAPSSPPTSPVNSRKH RAGVDIHSCSQFLLELYSRWILPSSSARRTPAILISEVVRSLLVVSDLFTERNQFELM YVTLTELRRVHPSEDEILAQYLVPATCKAAAVLGMDKAVAEPVSRLLESTLRSSHLPS RVGALHGILYVLECDLLDDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCATA FYLIENYPLDVGPEFSASIIQMCGVMLSGSEESTPSIIYHCALRGLERLLLSEQLSRL DAESLVKLSVDRVNVHSPHRAMAALGLMLTCMYTGKEKVSPGRTSDPNPAAPDSESVI VAMERVSVLFDRIRKGFPCEARVVARILPQFLDDFFPPQDIMNKVIGEFLSNQQPYPQ FMATVVYKVFQTLHSTGQSSMVRDWVMLSLSNFTQRAPVAMATWSLSCFFVSASTSPW VAAILPHVISRMGKLEQVDVNLFCLVATDFYRHQIEEELDRRAFQSVLEVVAAPGSPY HRLLTCLRNVHKVTTC" 3'UTR 1751 BASE COUNT 1154 a 1601 c 1613 g 1304 t ORIGIN 1 aatggtttca aagagagaga atattgccac ccatcattta tatcaggcat gggatcctgt 61 cccttctctg tctccggcta ctacaggtgc cctcatcagc cacgagaagc tgctgctaca 121 gatcaacccc gagcgggagc tggggagcat gagctacaaa ctcggccagg tgtccataca 181 ctccgtgtgg ctggggaaca gcatcacacc cctgagggag gaggaatggg acgaggaaga 241 ggaggaggag gccgacgccc ctgcaccttc gtcaccaccc acgtctccag tcaactccag 301 gaaacaccgg gctggagttg acatccactc ctgttcgcag tttttgcttg agttgtacag 361 ccgctggatc ctgccgtcca gctcagccag gaggaccccg gccatcctga tcagtgaggt 421 ggtcagatcc cttctagtgg tctcagactt gttcaccgag cgcaaccagt ttgagctgat 481 gtatgtgacg ctgacagaac tgcgaagggt gcacccttca gaagacgaga tcctcgctca 541 gtacctggtg cctgccacct gcaaggcagc tgccgtcctt gggatggaca aggccgtggc 601 ggagcctgtc agccgcctgc tggagagcac gctcaggagc agccacctgc ccagcagggt 661 tggagccctg cacggcatcc tctatgtgct ggagtgcgac ctgctggacg acactgccaa 721 gcagctcatc ccggtcatca gcgactatct cctctccaac ctgaaaggga tcgcccactg 781 cgtgaacatt cacagccagc agcacgtact ggtcatgtgt gccactgcgt tttacctcat 841 tgagaactat cctctggacg tagggccgga attttcagca tcaataatac agatgtgtgg 901 ggtgatgctg tctggaagtg aggagtccac cccctccatc atttaccact gtgccctcag 961 aggcctggag cgcctcctgc tctctgagca gctctcccgc ctggatgcag aatcgctggt 1021 caagctgagt gtggacagag tgaacgtgca cagcccgcac cgggccatgg cggctctggg 1081 cctgatgctc acctgcatgt acacaggaaa ggagaaagtc agtccgggta gaacttcaga 1141 ccctaatcct gcagcccccg acagcgagtc agtgattgtt gctatggagc gggtatctgt 1201 tctttttgat aggatcagga aaggctttcc ttgtgaagcc agagtggtgg ccaggatcct 1261 gccccagttt ctagacgact tcttcccacc ccaggacatc atgaacaaag tcatcggaga 1321 gtttctgtcc aaccagcagc cataccccca gttcatggcc accgtggtgt ataaggtgtt 1381 tcagactctg cacagcaccg ggcagtcgtc catggtccgg gactgggtca tgctgtccct 1441 ctccaacttc acgcagaggg ccccggtcgc catggccacg tggagcctct cctgcttctt 1501 tgtcagcgcg tccaccagcc cgtgggtcgc ggcgatcctc ccacatgtca tcagcaggat 1561 gggcaagctg gagcaggtgg acgtgaacct tttctgcctg gtcgccacag acttctacag 1621 acaccagata gaggaggagc tcgaccgcag ggccttccag tctgtgcttg aggtggttgc 1681 agccccagga agcccatatc accggctgct gacttgttta cgaaatgtcc acaaggtcac 1741 cacctgctga gcgccatggt gggagagact gtgaggcggc agctggggcc ggagcctttg 1801 gaagtctgtg cccttgtgcc ctgcctccac cgagccagct tggtccctat gggcttccgc 1861 acatgccgcg ggcggccagg caacgtgcgt gtctctgcca tgtggcagaa gtgctctttg 1921 tggcagtggc caggcaggga gtgtctgcag tcctggtggg gctgagcctg aggccttcca 1981 gaaagcagga gcagctgtgc tgcaccccat gtgggtgacc aggtcctttc tcctgatagt 2041 cacctgctgg ttgttgccag gttgcagctg ctcttgcatc tgggccagaa gtcctccctc 2101 ctgcaggctg gctgttggcc cctctgctgt cctgcagtag aaggtgccgt gagcaggctt 2161 tgggaacact ggcctgggtc tccctggtgg ggtgtgcatg ccacgccccg tgtctggatg 2221 cacagatgcc atggcctgtg ctgggccagt ggctgggggt gctagacacc cggcaccatt 2281 ctcccttctc tcttttcttc tcaggattta aaatttaatt atatcagtaa agagattaat 2341 tttaacgtaa ctctttctat gcccgtgtaa agtatgtgaa tcgcaaggcc tgtgctgcat 2401 gcgacagcgt ccggggtggt ggacagggcc cccggccacg ctccctctcc tgtagccact 2461 ggcatagccc tcctgagcac ccgctgacat ttccgttgta catgttcctg tttatgcatt 2521 cacaaggtga ctgggatgta gagaggcgtt agtgggcagg tggccacagc aggactgagg 2581 acaggccccc attatcctag gggtgcgctc aactgcagcc cctcctcctc gggcacagac 2641 gactgtcgtt ctccacccac cagtcaggga cagcagcctc cctgtcactc agctgagaag 2701 gccagccctc cctggctgtg agcagcctcc actgtgtcca gagacatggg cctcccactc 2761 ctgttccttg ctagccctgg ggtggcgtct gcctaggagc tggctggcag gtgttgggac 2821 ctgctgctcc atggatgcat gccctaagag tgtcactgag ctgtgttttg tctgagcctc 2881 tctcggtcaa cagcaaagct tggtgtcttg gcactgttag tgacagagcc cagcatccct 2941 tctgcccccg ttccagctga catcttgcac ggtgacccct tttagtcagg agagtgcaga 3001 tctgtgctca tcggagactg ccccacggcc ctgtcagagc cgccactcct atccccagga 3061 caggtccctg gaccagcctc ctgtttgcag gcccagagga gccaagtcat taaaatggaa 3121 gtggattctg gatggccggg ctgctgctga tgtaggagct ggatttggga gctctgcttg 3181 ccgactggct gtgagacgag gcaggggctc tgcttcctca gccctagagg cgagccaggc 3241 aaggttggcg actgtcatgt ggcttggttt ggtcatgccc gtcgatgttt tgggtattga 3301 atgtggtaag tggaggaaat gttggaactc tgtgcaggtg ctgccttgag acccccaagc 3361 ttccacctgt ccctctccta tgtggcagct ggggagcagc tgagatgtgg acttgtatgc 3421 tgcccacata cgtgaggggg agctgaaagg gagcccctgc tcaaagggag cccctcctct 3481 gagcagcctc tgccaggcct gtatgaggct tttcccacca gctcccaaca gaggcctccc 3541 ccagccagga ccacctcgtc ctcgtggcgg ggcagcagga gcggtagaaa ggggtccgat 3601 gtttgaggag gcccttaagg gaagctactg aattataaca cgtaagaaaa tcaccattct 3661 tccgtattgg ttgggggctc ctgtttctca tcctagcttt ttcctggaaa agcccgctag 3721 aaggtttggg aacgagggga aagttctcag aactgttgct gctccccacc cgcctcccgc 3781 ctcccccgca ggttatgtca gcagctctga gacagcagta tcacaggcca gatgttgttc 3841 ctggctagat gtttacattt gtaagaaata acactgtgaa tgtaaaacag agccattccc 3901 ttggaatgca tatcgctggg ctcaacatag agtttgtctt cctcttgttt acgacgtgat 3961 ctaaaccagt ccttagcaag gggctcagaa caccccgctc tggcagtagg tgtcccccac 4021 ccccaaagac ctgcctgtgt gctccggaga tgaatatgag ctcattagta aaaatgactt 4081 cacccacgca tatacataaa gtatccatgc atgtgcatat agacacatct ataattttac 4141 acacacacct ctcaagacgg agatgcatgg cctctaagag tgcccgtgtc ggttcttcct 4201 ggaagttgac tttccttaga cccgccaggt caagttagcc gcgtgacgga catccaggcg 4261 tgggacgtgg tcagggcagg gctcattcat tgcccactag gatcccactg gcgaagatgg 4321 tctccatatc agctctctgc agaagggagg aagactttat catgttccta aaaatctgtg 4381 gcaagcaccc atcgtattat ccaaattttg ttgcaaatgt gattaatttg gttgtcaagt 4441 tttgggggtg ggctgtgggg agattgcttt tgttttcctg ctggtaatat cgggaaagat 4501 tttaatgaaa ccagggtaga attgtttggc aatgcactga agcgtgtttc tttcccaaaa 4561 tgtgcctccc ttccgctgcg ggcccagctg agtctatgta ggtgatgttt ccagctgcca 4621 agtgctcttt gttactgtcc accctcattt ctgccagcgc atgtgtcctt tcaaggggaa 4681 aatgtgaagc tgaaccccct ccagacaccc agaatgtagc atctgagaag gccctgtgcc 4741 ctaaaggaca cccctcgccc ccatcttcat ggagggggtc atttcagagc cctcggagcc 4801 aatgaacagc tcctcctctt ggagctgaga tgagccccac gtggagctcg ggacggatag 4861 tagacagcaa taactcggtg tgtggccgcc tggcaggtgg aacttcctcc cgttgcgggg 4921 tggagtgagg ttagttctgt gtgtctggtg ggtggagtca ggcttctctt gctacctgtg 4981 agcatccttc ccagcagaca tcctcatcgg gctttgtccc tcccccgctt cctccctctg 5041 cggggaggac ccgggaccac agctgctggc cagggtagac ttggagctgt cctccagagg 5101 ggtcacgtgt aggagtgaga agaaggaaga tcttgagagc tgctgaggga ccttggagag 5161 ctcaggatgg ctcagacgag gacactcgct tgccgggcct ggccctcctg ggaaggaggg 5221 agctgctcag aatgccgcat gacaactgaa ggcaacctgg aaggttcagg gcccgctctt 5281 cccccatgtg cctgtcacgc tctggtgcag tcaaaggaac gccttcccct cagttgtttc 5341 taagagcaga gtctcccgct gcaatctggg tggtaactgc cagccttgga ggatcgtggc 5401 caacgtggac ctgcctacgg agggtgggct ctgacccaag tggggcctcc ttgcccaggt 5461 ctcactgctt tgcaccgtgg tcagagggac tgtcagctga gcttgagctc ccctggagcc 5521 agcagggctg tgatgggcga gtcccggagc cccacccaga cctgaatgct tctgagagca 5581 aagggaagga ctgacgagag atgtatattt aattttttaa ctgctgcaaa cattgtacat 5641 ccaaattaaa gggaaaaaat ggaaaccatc aa // LOCUS HUMHVDC 7032 bp mRNA PRI 09-JAN-1995 DEFINITION Homo sapiens (clone pcDNA-alpha1E-1) voltage-dependent calcium channel alpha-1E-1 subunit mRNA, complete cds. ACCESSION L29384 NID g495867 KEYWORDS neuronal calcium channel; voltage-dependent calcium channel. SOURCE Homo sapiens brain, hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7032) AUTHORS Williams,M.E., Marubio,L.M., Deal,C.R., Hans,M., Brust,P.F., Philipson,L.H., Miller,R.J., Johnson,E.C., Harpold,M.M. and Ellis,S.B. TITLE Structure and functional characterization of neuronal alpha 1E calcium channel subtypes JOURNAL J. Biol. Chem. 269 (35), 22347-22357 (1994) MEDLINE 94350992 FEATURES Location/Qualifiers source 1..7032 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, hippocampus" 5'UTR 1..165 CDS 166..6921 /note="brain specific" /codon_start=1 /product="voltage-dependent calcium channel alpha-1E-1" /db_xref="PID:g495868" /translation="MARFGEAVVARPGSGDGDSDQSRNRQGTPVPASGQAAAYKQTKA QRARTMALYNPIPVRQNCFTVNRSLFIFGEDNIVRKYAKKLIDWPPFEYMILATIIAN CIVLALEQHLPEDDKTPMSRRLEKTEPYFIGIFCFEAGIKIVALGFIFHKGSYLRNGW NVMDFIVVLSGILATAGTHFNTHVDLRTLRAVRVLRPLKLVSGIPSLQIVLKSIMKAM VPLLQIGLLLFFAILMFAIIGLEFYSGKLHRACFMNNSGILEGFDPPHPCGVQGCPAG YECKDWIGPNDGITQFDNILFAVLTVFQCITMEGWTTVLYNTNDALGATWNWLYFIPL IIIGSFFVLNLVLGVLSGEFAKERERVENRRAFMKLRRQQQIERELNGYRAWIDKAEE VMLAEENKNAGTSALEVLRRATIKRSRTEAMTRDSSDEHCVDISSVGTPLARASIKSA KVDGVSYFRHKERLLRISIRHMVKSQVFYWIVLSLVALNTACVAIVHHNQPQWLTHLL YYAEFLFLGLFLLEMSLKMYGMGPRLYFHSSFNCFDFGVTVGSIFEVVWAIFRPGTSF GISVLRALRLLRIFKITKYWASLRNLVVSLMSSMKSIISLLFLLFLFIVVFALLGMQL FGGRFNFNDGTPSANFDTFPAAIMTVFQILTGEDWNEVMYNGIRSQGGVSSGMWSAIY FIVLTLFGNYTLLNVFLAIAVDNLANAQELTKDEQEEEEAFNQKHALQKAKEVSPMSA PNMPSIERERRRRHHMSVWEQRTSQLRKHMQMSSQEALNREEAPTMNPLNPLNPLSSL NPLNAHPSLYRRPRAIEGLALGLALEKFEEERISRGGSLKGDGGDRSSALDNQRTPLS LGQREPPWLARPCHGNCDPTQQEAGGGEAVVTFEDRARHRQSQRRSRHRRVRTEGKES SSASRSRSASQERSLDEAMPTEGEKDHELRGNHGAKEPTIQEERAQDLRRTNSLMVSR GSGLAGGLDEADTPLVLPHPELEVGKHVVLTEQEPEGSSEQALLGNVQLDMGRVISQS EPDLSCITANTDKATTESTSVTVAIPDVDPLVDSTVVHISNKTDGEASPLKEAEIRED EEEVEKKKQKKEKRETGKAMVPHSSMFIFSTTNPIRRACHYIVNLRYFEMCILLVIAA SSIALAAEDPVLTNSERNKVLRYFDYVFTGVFTFEMVIKMIDQGLILQDGSYFRDLWN ILDFVVVVGALVAFALANALGTNKGRDIKTIKSLRVLRVLRPLKTIKRLPKLKAVFDC VVTSLKNVFNILIVYKLFMFIFAVIAVQLFKGKFFYCTDSSKDTEKECIGNYVDHEKN KMEVKGREWKRHEFHYDNIIWALLTLFTVSTGEGWPQVLQHSVDVTEEDRGPSRSNRM EMSIFYVVYFVVFPFFFVNIFVALIIITFQEQGDKMMEECSLEKNERACIDFAISAKP LTRYMPQNRHTFQYRVWHFVVSPSFEYTIMAMIALNTVVLMMKYYSAPCTYELALKYL NIAFTMVFSLECVLKVIAFGFLNYFRDTWNIFDFITVIGSITEIILTDSKLVNTSGFN MSFLKLFRAARLIKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVF GNIKLDEESHINRHNNFRSFFGSLMLLFRSATGEAWQEIMLSCLGEKGCEPDTTAPSG QNENERCGTDLAYVYFVSFIFFCSFLMLNLFVAVIMDNFEYLTRDSSILGPHHLDEFV RVWAEYDRAACGRIHYTEMYEMLTLMSPPLGLGKRCPSKVAYKRLVLMNMPVAEDMTV HFTSTLMALIRTALDIKIAKGGADRQQLDSELQKETLAIWPHLSQKMLDLLVPMPKAS DLTVGKIYAAMMIMDYYKQSKVKKQRQQLEEQKNAPMFQRMEPSSLPQEIIANAKALP YLQQDPVSGLSGRSGYPSMSPLSPQDIFQLACMDPADDGQFQERQSLVVTDPSSMRRS FSTIRDKRSNSSWLEEFSMERSSENTYKSRRRSYHSSLRLSAHRLNSDSGHKSDTHPS GGRERRRSKERKHLLSPDVSRCNSEERGTQADWESPERRQSRSPSEGRSQTPNRQGTG SLSESSIPSVSDTSTPRRSRRQLPPVPPKPRPLLSYSSLIRHAGSISPPADGSEEGSP LTSQALESNNAWLTESSNSPHPQQRQHASPQRYISEPYLALHEDSHASDCVEEETLTF EAAVATSLGRSNTIGSAPPLRHSWQMPNGHYRRRRRGGPGPGMMCGAVNNLLSDTEED DKC" 3'UTR 6922..7032 BASE COUNT 1604 a 1953 c 1903 g 1572 t ORIGIN 1 gctgctgctg cctctccgaa gagctcgcgg agctccccag aggcggtggt ccccgtgctt 61 gtctggatgc ggctctgagt ctccgtgtgt ctttctgctt gttgctgtgt gcgggtgttc 121 ggccgcgatc acctttgtgt gtcttctgtc tgtttaaacc tcaggatggc tcgcttcggg 181 gaggcggtgg tcgccaggcc agggtccggc gatggagact cggaccagag caggaaccgg 241 caaggaaccc ccgtgccggc ctcggggcag gcggccgcct acaagcagac gaaagcacag 301 agggcgcgga ctatggcttt gtacaacccc attcccgtcc ggcagaactg tttcaccgtc 361 aacagatccc tgttcatctt cggagaagat aacattgtca ggaaatatgc caagaagctc 421 atcgattggc cgccatttga gtacatgatc ctggccacca tcattgccaa ctgcatcgtc 481 ctggccctgg agcagcatct tcctgaggat gacaagaccc ccatgtcccg aagactggag 541 aagacagaac cttatttcat tgggatcttt tgctttgaag ctgggatcaa aattgtggcc 601 ctggggttca tcttccataa gggctcttac ctccgcaatg gctggaatgt catggacttc 661 atcgtggtcc tcagtggcat cctggccact gcaggaaccc acttcaatac tcacgtggac 721 ctgaggaccc tccgggctgt gcgtgtcctg cggcctttga agctcgtgtc agggatacct 781 agcctgcaga ttgtgttgaa gtccatcatg aaggccatgg tacctcttct gcagattggc 841 cttctgctct tctttgccat cctgatgttt gctatcattg gtttggagtt ctacagtggc 901 aagttacatc gagcgtgctt catgaacaat tcaggtattc tagaaggatt tgacccccct 961 cacccatgtg gtgtgcaggg ctgcccagct ggttatgaat gcaaggactg gatcggcccc 1021 aatgatggga tcacccagtt tgataacatc ctttttgctg tgctgactgt cttccagtgc 1081 atcaccatgg aagggtggac cactgtgctg tacaatacca atgatgcctt aggagccacc 1141 tggaattggc tgtacttcat ccccctcatc atcattggat ccttctttgt tctcaaccta 1201 gtcctgggag tgctttccgg ggaatttgcc aaagagagag agagagtgga gaaccgaagg 1261 gctttcatga agctgcggcg ccagcagcag attgagcgtg agctgaatgg ctaccgtgcc 1321 tggatagaca aagcagagga agtcatgctc gctgaagaaa ataaaaatgc tggaacatcc 1381 gccttagaag tgcttcgaag ggcaaccatc aagaggagcc ggacagaggc catgactcga 1441 gactccagtg atgagcactg tgttgatatc tcctctgtgg gcacacctct ggcccgagcc 1501 agtatcaaaa gtgcaaaggt agacggggtc tcttatttcc ggcacaagga aaggcttctg 1561 cgcatctcca ttcgccacat ggttaaatcc caggtgtttt actggattgt gctgagcctt 1621 gtggcactca acactgcctg tgtggccatt gtccatcaca accagcccca gtggctcacc 1681 cacctcctct actatgcaga atttctgttt ctgggactct tcctcttgga gatgtccctg 1741 aagatgtatg gcatggggcc tcgcctttat tttcactctt cattcaactg ctttgatttt 1801 ggggtcacag tgggcagtat ctttgaagtg gtctgggcaa tcttcagacc tggtacgtct 1861 tttggaatca gtgtcttgcg agccctccgg cttctaagaa tatttaaaat aaccaagtat 1921 tgggcttccc tacggaattt ggtggtctcc ttgatgagct caatgaagtc tatcatcagt 1981 ttgcttttcc tcctcttcct cttcatcgtt gtctttgctc tcctaggaat gcagttattt 2041 ggaggcaggt ttaactttaa tgatgggact ccttcggcaa attttgatac cttccctgca 2101 gccatcatga ctgtgttcca gatcctgacg ggtgaggact ggaatgaggt gatgtacaat 2161 gggatccgct cccagggtgg ggtcagctca ggcatgtggt ctgccatcta cttcattgtg 2221 ctcaccttgt ttggcaacta cacgctactg aatgtgttct tggctatcgc tgtggataat 2281 ctcgccaacg cccaggaact gaccaaggat gaacaggagg aagaagaggc cttcaaccag 2341 aaacatgcac tgcagaaggc caaggaggtc agcccgatgt ctgcacccaa catgccttcg 2401 atcgagaggg agcggaggcg ccggcaccac atgtccgtgt gggagcagcg taccagccag 2461 ctgaggaagc acatgcagat gtccagccag gaggccctca acagagagga ggcgccgacc 2521 atgaacccgc tcaaccccct caacccgctc agctccctca acccgctcaa tgcccacccc 2581 agcctttatc ggcgacccag ggccattgag ggcctggccc tgggcctggc cctggagaag 2641 ttcgaggagg agcgcatcag ccgtgggggg tccctcaagg gggatggagg ggaccgatcc 2701 agtgccctgg acaaccagag gacccctttg tccctgggcc agcgggagcc accatggctg 2761 gccaggccct gtcatggaaa ctgtgacccg actcagcagg aggcaggggg aggagaggct 2821 gtggtgacct ttgaggaccg ggccaggcac aggcagagcc aacggcgcag ccggcatcgc 2881 cgcgtcagga cagaaggcaa ggagtcctct tcagcctccc ggagcaggtc tgccagccag 2941 gaacgcagtc tggatgaagc catgcccact gaaggggaga aggaccatga gctcaggggc 3001 aaccatggtg ccaaggagcc aacgatccaa gaagagagag cccaggattt aaggaggacc 3061 aacagtctga tggtgtccag aggctccggg ctggcaggag gccttgatga ggctgacacc 3121 cccctagtcc tgccccatcc tgagctggaa gtggggaagc acgtggtgct gacggagcag 3181 gagccagaag gcagcagtga gcaggccctg ctggggaatg tgcagctaga catgggccgg 3241 gtcatcagcc agagcgagcc tgacctctcc tgcatcacgg ccaacacgga caaggccacc 3301 accgagagca ccagcgtcac cgtcgccatc cccgacgtgg accccttggt ggactcaacc 3361 gtggtgcaca ttagcaacaa gacggatggg gaagccagtc ccttgaagga ggcagagatc 3421 agagaggatg aggaggaggt ggagaagaag aagcagaaga aggagaagcg tgagacaggc 3481 aaagccatgg tgccccacag ctcaatgttc atcttcagca ccaccaaccc gatccggagg 3541 gcctgccact acatcgtgaa cctgcgctac tttgagatgt gcatcctcct ggtgattgca 3601 gccagcagca tcgccctggc ggcagaggac cccgtcctga ccaactcgga gcgcaacaaa 3661 gtcctgaggt attttgacta tgtgttcacg ggcgtgttca cctttgagat ggttataaag 3721 atgatagacc aaggcttgat cctgcaggat gggtcctact tccgagactt gtggaacatc 3781 ctggactttg tggtggtcgt tggcgcattg gtggcctttg ctctggcgaa cgctttggga 3841 accaacaaag gacgggacat caagaccatc aagtctctgc gggtgctccg agttctaagg 3901 ccactgaaaa ccatcaagcg cttgcccaag ctcaaggccg tcttcgactg cgtagtgacc 3961 tccttgaaga atgtcttcaa catactcatt gtgtacaagc tcttcatgtt catctttgct 4021 gtcatcgcag ttcagctctt caagggaaag ttcttttatt gcacggacag ttccaaggac 4081 acagagaagg agtgcatagg caactatgta gatcacgaga aaaacaagat ggaggtgaag 4141 ggccgggaat ggaagcgcca tgaattccac tacgacaaca ttatctgggc cctgctgacc 4201 ctcttcaccg tctccacagg ggaaggatgg cctcaagttc tgcagcactc tgtagatgtg 4261 acagaggaag accgaggccc aagccgcagc aaccgcatgg agatgtctat cttttatgta 4321 gtctactttg tggtcttccc cttcttcttt gtcaatatct ttgtggctct catcatcatc 4381 accttccagg agcaagggga taagatgatg gaggagtgca gcctggagaa gaatgagagg 4441 gcgtgcatcg acttcgccat cagcgccaaa cctctcaccc gctacatgcc gcagaacaga 4501 cacaccttcc agtaccgcgt gtggcacttt gtggtgtctc cgtcctttga gtacaccatt 4561 atggccatga tcgccttgaa tactgttgtg ctgatgatga agtattattc tgctccctgt 4621 acctatgagc tggccctgaa gtacctgaat atcgccttca ccatggtgtt ttccctggaa 4681 tgtgtcctga aggtcatcgc ttttggcttt ttgaactatt tccgagacac ctggaatatc 4741 tttgacttca tcaccgtgat tggcagtatc acagaaatta tcctgacaga cagcaagctg 4801 gtgaacacca gtggcttcaa tatgagcttt ctgaagctct tccgagctgc ccgcctcata 4861 aagctcctgc gtcagggcta taccatacgc attttgctgt ggacctttgt gcagtccttt 4921 aaggccctcc cttatgtctg ccttttaatt gccatgcttt tcttcattta tgccatcatt 4981 gggatgcagg tatttggaaa cataaaatta gacgaggaga gtcacatcaa ccggcacaac 5041 aacttccgga gtttctttgg gtccctaatg ctactcttca ggagtgccac aggtgaggcc 5101 tggcaggaga ttatgctgtc atgccttggg gagaagggct gtgagcctga caccaccgca 5161 ccatcagggc agaacgagaa tgaacgctgc ggcaccgatc tggcctacgt gtactttgtc 5221 tccttcatct tcttctgctc cttcttgatg ctcaacctgt ttgtggccgt catcatggac 5281 aactttgagt acctgactcg ggactcctcc atcctggggc ctcaccactt ggacgagttt 5341 gtccgcgtct gggcagaata tgaccgagca gcatgtggcc gcatccatta cactgagatg 5401 tatgaaatgc tgactctcat gtcacctccg ctaggcctcg gcaagagatg tccctccaaa 5461 gtggcatata agaggttggt cctgatgaac atgccagtag ctgaggacat gacggtccac 5521 ttcacctcca cacttatggc tctgatccgg acagctctgg acattaaaat tgccaaaggt 5581 ggtgcagaca ggcagcagct agactcagag ctacaaaagg agaccctagc catctggcct 5641 cacctatccc agaagatgct ggatctgctt gtgcccatgc ccaaagcctc tgacctgact 5701 gtgggcaaaa tctatgcagc aatgatgatc atggactact ataagcagag taaggtgaag 5761 aagcagaggc agcagctgga ggaacagaaa aatgccccca tgttccagcg catggagcct 5821 tcatctctgc ctcaggagat cattgctaat gccaaagccc tgccttacct ccagcaggac 5881 cccgtttcag gcctgagtgg ccggagtgga tacccttcga tgagtccact ctctccccag 5941 gatatattcc agttggcttg tatggacccc gccgatgacg gacagttcca agaacggcag 6001 tctctggtgg tgacagaccc tagctccatg agacgttcat tttccactat tcgggataag 6061 cgttcaaatt cctcgtggtt ggaggaattc tccatggagc gaagcagtga aaatacctac 6121 aagtcccgtc gccggagtta ccactcctcc ttgcggctgt cagcccaccg cctgaactct 6181 gattcaggcc acaagtctga cactcacccc tcagggggca gggagcggcg acgatcaaaa 6241 gagcgaaagc atcttctctc tcctgatgtc tcccgctgca attcagaaga gcgagggacc 6301 caggctgact gggagtcccc agagcgccgt caatccaggt cacccagtga gggcaggtca 6361 cagacgccca acagacaggg cacaggttcc ctaagtgaga gctccatccc ctctgtctct 6421 gacaccagca ccccaagaag aagtcgtcgg cagctcccac ccgtcccgcc aaagccccgg 6481 cccctccttt cctacagctc cctgattcga cacgcgggca gcatctctcc acctgctgat 6541 ggaagcgagg agggctcccc gctgacctcc caagctctgg agagcaacaa tgcttggctg 6601 accgagtctt ccaactctcc gcacccccag cagaggcaac atgcctcccc acagcgctac 6661 atctccgagc cctacttggc cctgcacgaa gactcccacg cctcagactg tgttgaggag 6721 gagacgctca ctttcgaagc agccgtggct actagcctgg gccgttccaa caccatcggc 6781 tcagccccac ccctgcggca tagctggcag atgcccaacg ggcactatcg gcggcggagg 6841 cgcggggggc ctgggccagg catgatgtgt ggggctgtca acaacctgct aagtgacacg 6901 gaagaagatg acaaatgcta gaggctgctc ccccctccga tgcatgctct tctctcacat 6961 ggagaaaacc aagacagaat tgggaagcca gtgcggcccc gcggggagga agagggaaaa 7021 ggaagatgga ag // LOCUS HUMHVRP 1640 bp mRNA PRI 16-FEB-1995 DEFINITION Human helodermin-preferring VIP receptor (VIP2/PACAP receptor) mRNA, complete cds. ACCESSION L36566 NID g550477 KEYWORDS PACAP receptor; VIP receptor; helodermin-preferring VIP receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1640) AUTHORS Svoboda,M., Tastenoy,M., Van Rampelbergh,J., Goossens,J.F., De Neef,P., Waelbroeck,M. and Robberecht,P. TITLE Molecular cloning and functional characterization of a human VIP receptor from SUP-T1 lymphoblasts JOURNAL Biochem. Biophys. Res. Commun. 205 (3), 1617-1624 (1994) MEDLINE 95110300 FEATURES Location/Qualifiers source 1..1640 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SUP T1 lymphoblast" /clone_lib="lamda ZAP II" sig_peptide 163..231 /note="putative; putative" CDS 163..1479 /note="human VIP2 receptor was previously called 'helodermin-preferring VIP receptor'; transmembrane domains are located at positions: 637-696; 772-844; 880-948; 1003-1071; 1147-1209; 1243-1302.; Potential glycosylation sites are located at :334-336; 424-426; 436-438." /codon_start=1 /function="VIP and PACAP receptor" /db_xref="PID:g550478" /translation="MRTLLPPALLTCWLLAPVNSIHPECRFHLEIQEEETKCTELLRS QTEKHKACSGVWDNITCWRPANVGETVTVPCPKVFSNFYSKAGNISKNCTSDGWSETF PDFVDACGYSDPEDESKITFYILVKAIYTLGYSVSLMSLATGSIILCLFRKLHCTRNY IHLNLFLSFILRAISVLVKDDVLYSSSGTLHCPDQPSSWVGCKLSLVFLQYCIMANFF WLLVEGLYLHTLLVAMLPPRRCFLAYLLIGWGLPTVCIGAWTAARLYLEDTGCWDTND HSVPWWVIRIPILISIIVNFVLFISIIRILLQKLTSPDVGGNDQSQYKRLAKSTLLLI PLFGVHYMVFAVFPISISSKYQILFELCLGSFQGLVVAVLYCFLNSEVQCELKRKWRS RCPTPSASRDYRVCGSSFSHNGSEGALQFHRASRAQSFLQTETSVI" mat_peptide 232..1476 /function="VIP and PACAP receptor" BASE COUNT 315 a 512 c 461 g 352 t ORIGIN 1 cgggacgagg gggcggcccc cgcgctcggg gcgctcggct acagctgcgg ggcccgaggt 61 ctccgcgcac tcgctcccgg cccatgctgg aggcggcgga acccggggga cctaggacgg 121 aggcggcggg cgctgggcgg cccccggcac gctgagctcg ggatgcggac gctgctgcct 181 cccgcgctgc tgacctgctg gctgctcgcc cccgtgaaca gcattcaccc agaatgccga 241 tttcatctgg aaatacagga ggaagaaaca aaatgtacag agcttctgag gtctcaaaca 301 gaaaaacaca aagcctgcag tggcgtctgg gacaacatca cgtgctggcg gcctgccaat 361 gtgggagaga ccgtcacggt gccctgccca aaagtcttca gcaattttta cagcaaagca 421 ggaaacataa gcaaaaactg tacgagtgac ggatggtcag agacgttccc agatttcgtc 481 gatgcctgtg gctacagcga cccggaggat gagagcaaga tcacgtttta tattctggtg 541 aaggccattt ataccctggg ctacagtgtc tctctgatgt ctcttgcaac aggaagcata 601 attctgtgcc tcttcaggaa gctgcactgc accaggaatt acatccacct gaacctgttc 661 ctgtccttca tcctgagagc catctcagtg ctggtcaagg acgacgttct ctactccagc 721 tctggcacgt tgcactgccc tgaccagcca tcctcctggg tgggctgcaa gctgagcctg 781 gtcttcctgc agtactgcat catggccaac ttcttctggc tgctggtgga ggggctctac 841 ctccacaccc tcctggtggc catgctcccc cctagaaggt gcttcctggc ctacctcctg 901 atcggatggg gcctccccac cgtctgcatc ggtgcatgga ctgcggccag gctctactta 961 gaagacaccg gttgctggga tacaaacgac cacagtgtgc cctggtgggt catacgaata 1021 ccgattttaa tttccatcat cgtcaatttt gtccttttca ttagtattat acgaattttg 1081 ctgcagaagt taacatcccc agatgtcggc ggcaacgacc agtctcagta caagaggctg 1141 gccaagtcca cgctcctgct tatcccgctg ttcggcgtcc actacatggt gtttgccgtg 1201 tttcccatca gcatctcctc caaataccag atactgtttg agctgtgcct cgggtcgttc 1261 cagggcctgg tggtggccgt cctctactgt ttcctgaaca gtgaggtgca gtgcgagctg 1321 aagcgaaaat ggcgaagccg gtgcccgacc ccgtccgcga gccgggatta cagggtctgc 1381 ggttcctcct tctcccacaa cggctcggag ggcgccctgc agttccaccg cgcgtcccga 1441 gcccagtcct tcctgcaaac ggagacctcg gtcatctagc cccacccctg cctgtcggac 1501 gcggcgggag gcccacggtt cggggcttct gcggggctga gacgccggct tcctccttcc 1561 agatgcccga gcaccgtgtc gggcaggtca gcgcggtcct gactccgtca agctggttgt 1621 ccactaaacc ccatacctgg // LOCUS HUMHXBP1 1818 bp mRNA PRI 11-JUN-1993 DEFINITION Human X box binding protein-1 (XBP-1) mRNA, complete cds. ACCESSION M31627 NID g184485 KEYWORDS DNA-binding protein; X box binding protein; major histocompatibility complex. SOURCE Human B cell JY, cDNA to mRNA, clone JY 113. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1818) AUTHORS Liou,H.-C., Boothby,M.R., Finn,P.W., Davidon,R., Nabavi,N., Zeleznik-Le,N.J., Ting,J.P.-Y. and Glimcher,L.H. TITLE A new member of the leucine zipper class of proteins that binds to the HLA DR-alpha promoter JOURNAL Science 247, 1581-1584 (1990) MEDLINE 90208323 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.-C.Liou, 25-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..1818 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 13..795 /note="X box binding protein-1" /codon_start=1 /db_xref="PID:g306893" /translation="MVVVAAAPNPADGTPKVLLLSGQPASAAGAPAARLPLMVPAQRG ASPEAASGGLPQARKRQRLTHLSPEEKALRRKLKNRVAAQTARDRKKARMSELEQQVV DLEEENQKLLLENQLLREKTHGLVVENQELRQRLGMDALVAEEEAEAKGNEVRPVAGS AESAALRLRAPLQQVQAQLSPLQNISPWILAVLTLQIQSLISCWAFWTTWTQSCSSNA LPQSLPAWRSSQRSTQKDPVPYQPPFLCQWGRHQPSWKPLMN" BASE COUNT 480 a 437 c 437 g 464 t ORIGIN Chromosome 22. 1 tagtctggag ctatggtggt ggtggcagcc gcgccgaacc cggccgacgg gacccctaaa 61 gttctgcttc tgtcggggca gcccgcctcc gccgccggag ccccggcggc caggctgccg 121 ctcatggtgc cagcccagag aggggccagc ccggaggcag cgagcggggg gctgccccag 181 gcgcgcaagc gacagcgcct cacgcacctg agccccgagg agaaggcgct gaggaggaaa 241 ctgaaaaaca gagtagcagc tcagactgcc agagatcgaa agaaggctcg aatgagtgag 301 ctggaacagc aagtggtaga tttagaagaa gagaaccaaa aacttttgct agaaaatcag 361 cttttacgag agaaaactca tggccttgta gttgagaacc aggagttaag acagcgcttg 421 gggatggatg ccctggttgc tgaagaggag gcggaagcca aggggaatga agtgaggcca 481 gtggccgggt ctgctgagtc cgcagcactc agactacgtg cacctctgca gcaggtgcag 541 gcccagttgt cacccctcca gaacatctcc ccatggattc tggcggtatt gactcttcag 601 attcagagtc tgatatcctg ttgggcattc tggacaactt ggacccagtc atgttcttca 661 aatgcccttc cccagagcct gccagcctgg aggagctccc agaggtctac ccagaaggac 721 ccagttcctt accagcctcc ctttctctgt cagtggggac gtcatcagcc aagctggaag 781 ccattaatga actaattcgt tttgaccaca tatataccaa gcccctagtc ttagagatac 841 cctctgagac agagagccaa gctaatgtgg tagtgaaaat cgaggaagca cctctcagcc 901 cctcagagaa tgatcaccct gaattcattg tctcagtgaa ggaagaacct gtagaagatg 961 acctcgttcc ggagctgggt atctcaaatc tgctttcatc cagccactgc ccaaagccat 1021 cttcctgcct actggatgct acagtgactg tggatacggg ggttcccttt ccccattcag 1081 tgacatgtcc tctctgcttg gtgtaaacat tcttgggagg acacttttgc caatgaactc 1141 tttccccagc tgattagtgt ctaaggaatg atccaatact gttgcccttt tccttgacta 1201 ttacactgcc tggaggatag cagagaagcc tgtctgtact tcattcaaaa agccaaaata 1261 gagagtatac agtcctagag aatccctcta tttgttcaga tctcatagat gacccccagg 1321 tattgccttt tgacatccag cagtccaagg tattgagaca tattactgga agtaagaaat 1381 attactataa ttgagaacta cagcttttaa gattgtactt ttaagattgt acttttatct 1441 taaaagggtg gtagttttcc ctaaaatact tattatgtaa gggtcattag acaaatgtct 1501 tgaagtagac atggaattta tgaatggtct ttatcatttc tcttccccct ttttggcatc 1561 ctggcttgcc tccagtttta ggtcctttag tttgcttctg caagcaacgg gaacacctgc 1621 tgagggggct ctttccctca tgtatacttc aagtaagatc aagaatcttt tgtgaaatta 1681 tagaaattta ctatgtaaat gcttgatgga attttttcct gctagtgtag cttctgaaag 1741 gtgctttctc catttattta aaaactaccc atgcaattaa aaggtacaat gcaaaaaaaa 1801 aaaaaaaaaa attttttt // LOCUS HUMHYMEGLA 1568 bp mRNA PRI 20-MAR-1996 DEFINITION Human hydroxymethylglutaryl-CoA lyase mRNA, complete cds. ACCESSION L07033 NID g184502 KEYWORDS . SOURCE Homo sapiens male liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1568) AUTHORS Mitchell,G.A., Robert,M.F., Hruz,P.W., Wang,S., Fontaine,G., Behnke,C.E., Mende-Mueller,L.M., Schappert,K., Lee,C., Gibson,K.M. et,al. TITLE 3-Hydroxy-3-methylglutaryl coenzyme A lyase (HL). Cloning of human and chicken liver HL cDNAs and characterization of a mutation causing human HL deficiency JOURNAL J. Biol. Chem. 268 (6), 4376-4381 (1993) MEDLINE 93179448 FEATURES Location/Qualifiers source 1..1568 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="liver" misc_feature 1..7 /standard_name="EcoRI-linker" /note="putative" CDS 15..992 /standard_name="3-hydroxy-3-methylglutaryl CoA; HMG CoA Lyase" /EC_number="4.1.3.4" /codon_start=1 /evidence=experimental /product="hydroxymethylglutaryl-CoA lyase" /db_xref="PID:g184503" /translation="MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGL QNEKNIVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGIN YPVLTPNLKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSA NISVRGYVSCALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDML SAVMQEVPLAALAVHCHDTYGQALTNTLMALQMGVSVVDSSVAGLGGCPYAQGASGNL ATEDLVYMLEGLGIHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL" BASE COUNT 377 a 397 c 437 g 357 t ORIGIN 1 gaattccggc caagatggca gcaatgagga aggcgcttcc gcggcgactg gtgggcttgg 61 cgtccctccg ggctgtcagc acctcatcta tgggcacttt accaaagcgg gtgaaaattg 121 tggaagttgg tccccgagat ggactacaaa atgaaaagaa tatcgtatct actccagtga 181 aaatcaagct gatagacatg ctttctgaag caggactctc tgttatagaa accaccagct 241 ttgtgtctcc taagtgggtt ccccagatgg gtgaccacac tgaagtcttg aagggcattc 301 agaagtttcc tggcatcaac tacccagtcc tgaccccaaa tttgaaaggc ttcgaggcag 361 cggttgctgc tggagccaag gaagtagtca tctttggagc tgcctcagag ctcttcacca 421 agaagaacat caattgttcc atagaggaga gttttcagag gtttgacgca atcctgaagg 481 cagcgcagtc agccaatatt tctgtgcggg ggtacgtctc ctgtgctctt ggctgccctt 541 atgaagggaa gatctcccca gctaaagtag ctgaggtcac caagaagttc tactcaatgg 601 gctgctacga gatctccctg ggggacacca ttggtgtggg caccccaggg atcatgaaag 661 acatgctatc tgctgtcatg caggaagtgc ctctggctgc cctggctgtc cactgccatg 721 acacctatgg tcaagccctg accaacacct tgatggccct gcagatggga gtgagtgtcg 781 tggactcttc tgtggcagga cttggaggct gtccctacgc acagggggca tcaggaaact 841 tggccacaga agacctggtc tacatgctag agggcttggg cattcacacg ggtgtgaatc 901 tccagaagct tctggaagct ggaaacttta tctgtcaagc cctgaacaga aaaactagct 961 ccaaagtggc tcaggctacc tgtaaactct gagccccttg cccacctgaa gccctgggga 1021 tgatgtggaa ataggggcac acacagatga ttcatggatg gggacatgga aatgagaata 1081 ggttaaatgg tgcaggtacc tcatagccag ctctacacag aggtctctcc tggcagaaag 1141 caggcgaagg gcaggaggag ctgcttggca gaaggacctc ctgcccagac ctgaggagtg 1201 agaggctttg agggctgaag tctccctttg ttacggaccc tggcccagga gttgaatgcc 1261 tgaggacgtg tgggaacccc gttccctact tagcatgatc cttgagtctc ctctctggat 1321 ggaatccgcg agctggccac ctggccaccc tctacacggc tccaccctgc catggccgtg 1381 gggcccttgc tctctgactt ctcaggacac aggtcatgga ggttcttccc aagctggcag 1441 aggccatttg tggaaagtgg agagctacgt ggtggccgtc tgccaactcc agcatctctg 1501 gaaaatctcc acgctgaatg tgatttttga aaacagctta tgtaattaaa ggttgaatgg 1561 cacatcat // LOCUS HUMI1R 3590 bp mRNA PRI 05-MAR-1996 DEFINITION Homo sapiens interleukin-1 receptor-associated kinase (IRAK) mRNA, complete cds. ACCESSION L76191 NID g1220312 KEYWORDS interleukin 1 receptor; protein kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3590) AUTHORS Cao,Z., Henzel,W.J. and Gao,X. TITLE IRAK: a kinase associated with the interleukin-1 receptor JOURNAL Science 271 (5252), 1128-1131 (1996) MEDLINE 96180673 FEATURES Location/Qualifiers source 1..3590 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..79 mRNA 1..3590 CDS 80..2218 /codon_start=1 /product="interleukin-1 receptor-associated kinase" /db_xref="PID:g1220313" /translation="MAGGPGPGEPAAPGAQHFLYEVPPWVMCRFYKVMDALEPADWCQ FAALIVRDQTELRLCERSGQRTASVLWPWINRNARVADLVHILTHLQLLRARDIITAW HPPAPLPSPGTTAPRPSSIPAPAEAEAWSPRKLPSSASTFLSPAFPGSQTHSGPELGL VPSPASLWPPPPSPAPSSTKPGPESSVSLLQGARPSPFCWPLCEISRGTHNFSEELKI GEGGFGCVYRAVMRNTVYAVKRLKENADLEWTAVKQSFLTEVEQLSRFRHPNIVDFAG YCAQNGFYCLVYGFLPNGSLEDRLHCQTQACPPLSWPQRLDILLGTARAIQFLHQDSP SLIHGDIKSSNVLLDERLTPKLGDFGLARFSRFAGSSPSQSSMVARTQTVRGTLAYLP EEYIKTGRLAVDTDTFSFGVVVLETLAGQRAVKTHGARTKYLKDLVEEEAEEAGVALR STQSTLQAGLAADAWAAPIAMQIYKKHLDPRPGPCPPELGLGLGQLACCCLHRRAKRR PPMTQVYERLEKLQAVVAGVPGHLEAASCIPPSPQENSYVSSTGRAHSGAAPWQPLAA PSGASAQAAEQLQRGPNQPVESDESLGGLSAALRSWHLTPSCPLDPAPLREAGCPQGD TAGESSWGSGPGSRPTAVEGLALGSSASSSSEPPQIIINPARQKMVQKLALYEDGALD SLQLLSSSSLPGLGLEQDRQGPEESDEFQS" 3'UTR 2219..3590 BASE COUNT 712 a 1112 c 1129 g 637 t ORIGIN 1 cgcggacccg gccggcccag gcccgcgccc gccgcggccc tgagaggccc cggcaggtcc 61 cggcccggcg gcggcagcca tggccggggg gccgggcccg ggggagcccg cagcccccgg 121 cgcccagcac ttcttgtacg aggtgccgcc ctgggtcatg tgccgcttct acaaagtgat 181 ggacgccctg gagcccgccg actggtgcca gttcgccgcc ctgatcgtgc gcgaccagac 241 cgagctgcgg ctgtgcgagc gctccgggca gcgcacggcc agcgtcctgt ggccctggat 301 caaccgcaac gcccgtgtgg ccgacctcgt gcacatcctc acgcacctgc agctgctccg 361 tgcgcgggac atcatcacag cctggcaccc tcccgccccg cttccgtccc caggcaccac 421 tgccccgagg cccagcagca tccctgcacc cgccgaggcc gaggcctgga gcccccggaa 481 gttgccatcc tcagcctcca ccttcctctc cccagctttt ccaggctccc agacccattc 541 agggcctgag ctcggcctgg ttccaagccc tgcttccctg tggcctccac cgccatctcc 601 agccccttct tctaccaagc caggcccaga gagctcagtg tccctcctgc agggagcccg 661 cccctctccg ttttgctggc ccctctgtga gatttcccgg ggcacccaca acttctcgga 721 ggagctcaag atcggggagg gtggctttgg gtgcgtgtac cgggcggtga tgaggaacac 781 ggtgtatgct gtgaagaggc tgaaggagaa cgctgacctg gagtggactg cagtgaagca 841 gagcttcctg accgaggtgg agcagctgtc caggtttcgt cacccaaaca ttgtggactt 901 tgctggctac tgtgctcaga acggcttcta ctgcctggtg tacggcttcc tgcccaacgg 961 ctccctggag gaccgtctcc actgccagac ccaggcctgc ccacctctct cctggcctca 1021 gcgactggac atccttctgg gtacagcccg ggcaattcag tttctacatc aggacagccc 1081 cagcctcatc catggagaca tcaagagttc caacgtcctt ctggatgaga ggctgacacc 1141 caagctggga gactttggcc tggcccggtt cagccgcttt gccgggtcca gccccagcca 1201 gagcagcatg gtggcccgga cacagacagt gcggggcacc ctggcctacc tgcccgagga 1261 gtacatcaag acgggaaggc tggctgtgga cacggacacc ttcagctttg gggtggtagt 1321 gctagagacc ttggctggtc agagggctgt gaagacgcac ggtgccagga ccaagtatct 1381 gaaagacctg gtggaagagg aggctgagga ggctggagtg gctttgagaa gcacccagag 1441 cacactgcaa gcaggtctgg ctgcagatgc ctgggctgct cccatcgcca tgcagatcta 1501 caagaagcac ctggacccca ggcccgggcc ctgcccacct gagctgggcc tgggcctggg 1561 ccagctggcc tgctgctgcc tgcaccgccg ggccaaaagg aggcctccta tgacccaggt 1621 gtacgagagg ctagagaagc tgcaggcagt ggtggcgggg gtgcccgggc atttggaggc 1681 cgccagctgc atcccccctt ccccgcagga gaactcctac gtgtccagca ctggcagagc 1741 ccacagtggg gctgctccat ggcagcccct ggcagcgcca tcaggagcca gtgcccaggc 1801 agcagagcag ctgcagagag gccccaacca gcccgtggag agtgacgaga gcctaggcgg 1861 cctctctgct gccctgcgct cctggcactt gactccaagc tgccctctgg acccagcacc 1921 cctcagggag gccggctgtc ctcaggggga cacggcagga gaatcgagct gggggagtgg 1981 cccaggatcc cggcccacag ccgtggaagg actggccctt ggcagctctg catcatcgtc 2041 gtcagagcca ccgcagatta tcatcaaccc tgcccgacag aagatggtcc agaagctggc 2101 cctgtacgag gatggggccc tggacagcct gcagctgctg tcgtccagct ccctcccagg 2161 cttgggcctg gaacaggaca ggcaggggcc cgaagaaagt gatgaatttc agagctgatg 2221 tgttcacctg ggcagatccc ccaaatccgg aagtcaaagt tctcatggtc agaagttctc 2281 atggtgcacg agtcctcagc actctgccgg cagtgggggt gggggcccat gcccgcgggg 2341 gagagaagga ggtggccctg ctgttctagg ctctgtgggc ataggcaggc agagtggaac 2401 cctgcctcca tgccagcatc tgggggcaag gaaggctggc atcatccagt gaggaggctg 2461 gcgcatgttg ggaggctgct ggctgcacag acccgtgagg ggaggagagg ggctgctgtg 2521 caggggtgtg gagtagggag ctggctcccc tgagagccat gcagggcgtc tgcagcccag 2581 gcctctggca gcagctcttt gcccatctct ttggacagtg gccaccctgc acaatggggc 2641 cgacgaggcc tagggccctc ctacctgctt acaatttgga aaagtgtggc cgggtgcggt 2701 ggctcacgcc tgtaatccca gcactttggg aggccaaggc aggaggatcg ctggagccca 2761 gtaggtcaag accagccagg gcaacatgat gagaccctgt ctctgccaaa aaatttttta 2821 aactattagc ctggcgtggt agcgcacgcc tgtggtccca gctgctgggg aggctgaagt 2881 aggaggatca tttatgcttg ggaggtcgag gctgcagtga gtcatgattg tatgactgca 2941 ctccagcctg ggtgacagag caagaccctg tttcaaaaag aaaaaccctg ggaaaagtga 3001 agtatggctg taagtctcat ggttcagtcc tagcaagaag cgagaattct gagatcctcc 3061 agaaagtcga gcagcaccca cctccaacct cgggccagtg tcttcaggct ttactgggga 3121 cctgcgagct ggcctaatgt ggtggcctgc aagccaggcc atccctgggc gccacagacg 3181 agctccgagc caggtcaggc ttcggaggcc acaagctcag cctcaggccc aggcactgat 3241 tgtggcagag gggccactac ccaaggtcta gctaggccca agacctagtt acccagacag 3301 tgagaagccc ctggaaggca gaaaagttgg gagcatggca gacagggaag ggaaacattt 3361 tcagggaaaa gacatgtatc acatgtcttc agaagcaagt caggtttcat gtaaccgagt 3421 gtcctcttgc gtgtccaaaa gtagcccagg gctgtagcac aggcttcaca gtgattttgt 3481 gttcagccgt gagtcacact acatgccccc gtgaagctgg gcattggtga cgtccaggtt 3541 gtccttgagt aataaaaacg tatgttccct aaaaaaaaaa aaaggaattc // LOCUS HUMIA1X 2838 bp mRNA PRI 31-DEC-1994 DEFINITION Human zinc-finger DNA-binding motifs (IA-1) mRNA, complete cds. ACCESSION M93119 NID g184510 KEYWORDS . SOURCE Homo sapiens insulinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2838) AUTHORS Goto,Y., De Silva,M.G., Toscani,A., Prabhakar,B.S., Notkins,A.L. and Lan,M.S. TITLE A novel human insulinoma-associated cDNA, IA-1, encodes a protein with 'zinc-finger' DNA-binding motifs JOURNAL J. Biol. Chem. 267 (21), 15252-15257 (1992) MEDLINE 92340582 FEATURES Location/Qualifiers source 1..2838 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="beta islet tumor" /tissue_type="insulinoma" gene 148..1680 /gene="IA-1" CDS 148..1680 /gene="IA-1" /codon_start=1 /db_xref="PID:g184511" /translation="MPRGFLVKRSKKSTPVSYRVRGGEDGDRALLLSPSCGGARAEPP APSPVPGPLPPPPPAERAHAALAAALACAPGPQPPPQGPRAAHFGNPEAAHPAPLYSP TRPVSREHEKHKYFERSFNLGSPVSAESFPTPAALLGGGGGGGASGAGGGGTCGGDPL LFAPAELKMGTAFSAGAEAARGPGPGPPLPPAAALRPPGKRPPPPTAAEPPAKAVKAP GAKKPKAIRKLHFEDEVTTSPVLGLKIKEGPVEAPRGRAGGAARPLGEFICQLCKEEY ADPFALAQHKCSRIVRVEYRCPECAKVFSCPANLASHRRWHKPRPAPAAARAPEPEAA ARAEAREAPGGGSDRDTPSPGGVSESGSEDGLYECHHCAKKFRRQAYLRKHLLAHHQA LQAKGAPLAPPAEDLLALYPGPDEKAPQEAAGDGEGAGVLGLSASAECHLCPVCGESF ASKGAQERHLRLLHAAQVFPCKYCPATFYSSPGLTRHINKCHPSENRQVILLQVPVRP AC" BASE COUNT 492 a 941 c 847 g 558 t ORIGIN 1 gggcgcagag ctgggccgag ccgtcgccgg cgccacgcga gtcccgcagc cgccgcgccc 61 gggcaatggg ccgggggcac tgagggccgc cggggccgag cgcggagggg ggaccgagcc 121 agtgccgtgc cctcgggccg cgccaacatg ccccgcggct tcctggtgaa gcgcagcaag 181 aagtccacgc ccgtttccta ccgggtccgc ggcggcgagg acggcgaccg cgcactgctg 241 ctctcgccca gctgcggggg cgcccgcgcc gagcccccgg cgccgagccc ggtccccggg 301 ccgctgccgc cgccgccgcc cgcggagcgc gcccatgcag cgctcgccgc cgcgcttgcc 361 tgcgcgcctg ggccgcagcc acccccgcag ggcccgcggg ccgcgcactt cggcaacccc 421 gaggctgcgc accccgcgcc gctctacagt cccacgcggc ccgtgagccg cgagcacgag 481 aagcacaagt acttcgaacg cagcttcaac ctgggctcgc cggtctcggc cgagtccttc 541 cccacgcccg ccgcgctgct cggagggggc ggcggcggcg gcgcgagcgg agctggcgga 601 ggcggcacct gcggcggcga cccgctgctc ttcgcgcccg ccgagctcaa gatgggcacg 661 gcgttctcgg ctggcgccga ggcggcccgc ggcccgggcc ccggcccccc actgccccct 721 gccgccgccc tgcggccccc gggaaagcgg cccccgcccc ctaccgccgc ggagccgccc 781 gccaaggcag tcaaggcccc gggcgccaag aagcccaagg ccatccgcaa gctgcacttc 841 gaggacgagg tgaccacgtc gcccgtgctg gggctcaaga tcaaggaggg cccggtggag 901 gcgccgcggg gccgcgcggg gggcgcggcg cggccgctgg gcgagttcat ctgccagctg 961 tgcaaggagg agtacgccga cccgttcgcg ctggcgcagc acaaatgctc gcgcatcgtg 1021 cgtgtggagt accgctgtcc cgagtgcgcc aaggtcttca gctgcccggc caacctggcc 1081 tcgcaccgcc gctggcacaa accgcggccc gcgcccgccg ccgcccgcgc gccggagcca 1141 gaagcagcag ccagggctga ggcgcgggag gcacccggcg gcggcagcga ccgggacacg 1201 ccgagccccg gcggcgtgtc cgagtcgggc tccgaggacg ggctctacga gtgccatcac 1261 tgcgccaaga agttccgccg ccaggcctac ctacgcaagc acctgctggc gcaccaccag 1321 gcgctgcagg ccaagggcgc gccgctagcg cccccggccg aggacctact ggccttgtac 1381 cccgggcccg acgagaaggc gccccaggag gcggccggcg acggcgaggg ggccggcgtg 1441 ctgggcctga gtgcgtccgc cgagtgccac ctgtgcccag tgtgcggaga gtcgttcgcc 1501 agcaagggcg ctcaggagcg ccacctgcgc ctgctgcacg ccgcccaggt gttcccctgc 1561 aagtactgcc cggccacctt ctacagctcg cccggcctta cgcggcacat caacaagtgc 1621 cacccatccg aaaacagaca ggtgatcctc ctgcaggtgc ccgtgcgccc ggcctgctag 1681 agcgcgccct ccaccccggc ccccgaactg tgccttcgct tggagaccca caaagagagt 1741 gcgccctgca cgccccgaac ccgagtccgc gctgggggag cctcgccccc gcccccaccg 1801 ggtgagagtg tcgtctccgc ttctctcggt gtggcgtgac ggtaacccca tactctcctt 1861 ttgactcctt ttggaacccc cacttttacg ttgtgtccct ccgcctcccc catggcgcaa 1921 caggagtcag tctctttctg tacaagggag aaaagctgta cgcgtttgtc tcgtggttgg 1981 aagcctcccc ttggcgggga gaagcttttt ttcttgctag tattcgctgt gttcatggtc 2041 tagaaatgcg gtctggtctc gcctcgccta ccaatctctg ctctctatgt atgtagcgta 2101 cgggttgttt tgggtgaatc ttgaggaata aatgccttta tatttcacag gctgtaaatt 2161 gaacttccca cacgattagc tttattatgg cttgtgaact gctggagtct ggctttacct 2221 ttttgtatgt gaacaaatca aattgcttaa aaaagagttt tctttagtat agccacaaat 2281 gccttgaact gttgtctggg attgttttgt ggggggaggg aagggagtgt tccgaagatg 2341 ctgtagtaac tgcctcagtg tttcacgtaa gactttttgg tttgatcatc tttgttgagg 2401 taggactatc agttccctct aaatgtatat gttgatttat gagtaattgt tatttattct 2461 ttatttattt atattaatta tgaagattat gatattattt gattgcagat ttttttggcg 2521 cgctgccccc tccccaccct gccactcttg acattccact gtgcgtttta gaagagagcc 2581 tttttctaaa gggatctgct taaagtttta acttttatac ctatctgagt gaattacaga 2641 caacctatca tttattctgc ttcgagggtc cccagggccc ttgtacaacc gacagctctt 2701 acttttaaat gcaatctctt ttctacatac attattttct taattgttag ctatttatag 2761 aaagcttcaa tagaactgtt tcaactgtat aactatttac tattcaaata aaatattttc 2821 aaagtcaaaa aaaaaaaa // LOCUS HUMIAS 3588 bp mRNA PRI 20-FEB-1996 DEFINITION Human mRNA for integrin alpha subunit, complete cds. ACCESSION D25303 NID g464180 KEYWORDS integrin alpha subunit; integrin alpha-RLC. SOURCE Homo sapiens fetal and adult cDNA to mRNA, clones FL1, L5F1 and F1F4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3588) AUTHORS Hibi,K., Yamakawa,K., Ueda,R., Horio,Y., Murata,Y., Tamari,M., Uchida,K., Takahashi,T., Nakamura,Y. and Takahashi,T. TITLE Aberrant upregulation of a novel integrin alpha subunit gene at 3p21.3 in small cell lung cancer JOURNAL Oncogene 9 (2), 611-619 (1994) MEDLINE 94119603 REFERENCE 2 (bases 1 to 3588) AUTHORS Takahashi,T. TITLE Direct Submission JOURNAL Submitted (18-NOV-1993) to the DDBJ/EMBL/GenBank databases. Takashi Takahashi, Aichi Cancer Center Research Institute, Lab of Ultrastructure Research; 1-1 Kanokoden, Chikusaku, Nagoya 464, Japan (E-mail:j45960u@nucc.cc.nagoya-u.ac.jp, Tel:052-762-6111, Fax:052-763-5233) COMMENT Submitted (18-Nov-1993) to DDBJ by: Takashi Takahashi Aichi Cancer Center Research Inst. 1-1 Kanokoden, Chikusa-ku Nagoya 464 Japan Phone: 052-762-6111 Fax: 052-763-5233. FEATURES Location/Qualifiers source 1..3588 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal and adult" /tissue_type="lung" CDS 52..3159 /codon_start=1 /product="integrin alpha subunit" /db_xref="PID:d1005525" /db_xref="PID:g533327" /translation="MGGPAAPRGAGRLRALLLALVVAGIPAGAYNLDPQRPVHFQGPA DSFFGYAVLEHFHDNTRWVLVGAPKADSKYSPSVKSPGAVFKCRVHTNPDRRCTELDM ARGKNRGTSCGKTCREDRDDEWMGVSLARQPKADGRVLACAHRWKNIYYEADHILPHG FCYIIPSNLQAKGRTLIPCYEEYKKKYGEEHGSCQAGIAGFFTEELVVMGAPGSFYWA GTIKVLNLTDNTYLKLNDEVIMNRRYTYLGYAVTAGHFSHPSTIDVVGGAPQDKGIGK VYIFRADRRSGTLIKIFQASGKKMGSYFGSSLCAVDLNGDGLSDLLVGAPMFSEIRDE GQVTVYINRGNGALEEQLALTGDGAYNAHFGESIASLDDLDNDGFPDVAIGAPKEDDF AGAVYIYHGDAGGIVPQYSMKLSGQKINPVLRMFGQSISGGIDMDGNGYPDVTVGAFM SDSVVLLRARPVITVDVSIFLPGSINITAPQCHDGQQPVNCLNVTTCFSFHGKHVPEE IGLNYVLMADVAKKEKGQMPRVYFVLLGETMGQVTEKLQLTYMEETCRHYVAHVKRRV QDVISPIVFEAAYSLSEHVTGEEERELPPLTPVLRWKKGQKIAQKNQTVFERNCRSED CAADLQLQGKLLLSSMDEKTLYLALGAVKNISLNISISNLGDDAYDANVSFNVSRELF FINMWQKEEMGISCELLESDFLKCSVGFPFMRSKSKYEFSVIFDTSHLSGEEEVLSFI VTAQSGNTERSESLHDNTLVLMVPLMHEVDTSITGIMSPTSFVYGESVDAANFIQLDD LECHFQPINITLQVYNTGPSTLPGSSVSISFPNRLSSGGAEMFHVQEMVVGQEKGNCS FQKNPTPCIIPQEQENIFHTIFAFFTKSGRKVLDCEKPGISCLTAHCNFSALAKEESR TIDIYMLLNTEILKKDSSSVIQFMSRAKVKVDPALRVVEIAHGNPEEVTVVFEALHNL EPRGYVVGWIIAISLLVGILIFLLLAVLLWKMGFFRRRYKEIIEAEKNRKENEDSWDW VQKNQ" BASE COUNT 871 a 915 c 964 g 838 t ORIGIN Chromosome 3p21.3. 1 ccgcgctcgg cgccctgctc gccgggcaga ggggaaggcg gccggctggg gatgggcggc 61 ccggctgcgc cgaggggcgc cgggaggctc cgcgcgctgc tgctggcgct ggtggtcgcg 121 gggatccccg cgggcgccta caacctcgac ccgcagcgcc ccgtgcactt ccagggcccc 181 gctgactcgt tcttcggcta cgcagttctg gagcatttcc acgacaacac gcgctgggtc 241 cttgtgggcg caccaaaggc agattccaaa tacagccctt cagtgaagtc tcctggggct 301 gtgtttaagt gccgtgttca caccaaccct gaccggagat gcaccgaact ggacatggct 361 cgagggaaga atcggggcac gtcctgcgga aagacctgcc gggaagaccg cgatgatgag 421 tggatggggg tgagcctggc ccgacagccc aaggctgatg gccgtgtgtt ggcctgtgct 481 catcgctgga agaacatcta ctatgaagcc gaccacatcc taccccatgg cttctgctac 541 atcatcccct ccaacctcca ggccaaaggc aggacactga tcccttgcta tgaagagtat 601 aagaagaagt acggagagga acacggctcc tgccaggctg ggatagcggg cttcttcact 661 gaggagctgg tggtgatggg tgctccaggg tcattttatt gggctggaac catcaaagtg 721 ctgaacctta cggacaacac ctatttaaaa ctgaacgacg aagtgatcat gaacaggcgg 781 tacacctacc tgggctacgc agtgaccgct ggccacttct ctcacccgtc caccattgat 841 gtggtaggag gtgccccaca ggacaaaggc atcggcaagg tttatatttt cagagctgac 901 cgaagatcag gcaccttaat taagatcttt caagcatcag gtaaaaagat gggctcttac 961 ttcggctcct ccttgtgcgc agttgacctg aatggggacg gcctctctga cctgctggtg 1021 ggggccccca tgttttctga gatcagggat gagggacagg tcactgtcta catcaacaga 1081 ggaaatggag ccctcgagga gcagctggct ctgactgggg atggtgccta caatgcgcac 1141 tttggagaga gcattgccag cctggacgat ctggacaatg atgggttccc agatgtggcc 1201 attggtgcac ccaaggagga tgacttcgca ggggcggtct atatctatca tggtgatgcc 1261 ggtgggatag tccctcagta ctcaatgaaa ctgtctgggc agaagataaa tccagtgctc 1321 cggatgtttg gtcagtccat atcgggaggc attgatatgg atggaaatgg ctatcctgat 1381 gtcactgttg gagccttcat gtccgacagc gtggttcttc tcagagcaag gcctgtcatt 1441 acggtggatg tctccatctt cctcccgggc tccatcaaca tcacagcgcc tcagtgtcac 1501 gacggacagc agcctgtgaa ctgcctgaac gtcaccacct gcttcagctt ccatggcaaa 1561 cacgttccag aagagattgg cctgaattat gttctgatgg ctgacgtggc caaaaaggag 1621 aagggccaga tgcccagggt ctactttgtg ctgctgggag agaccatggg tcaggtcaca 1681 gagaagctgc agctgactta catggaggag acgtgtcgtc actatgtggc ccatgtgaag 1741 cggagggtgc aggacgtcat cagcccgatc gtgtttgaag cagcctacag cctcagtgag 1801 catgtgactg gagaggagga gagggaactg ccgcctctga caccagttct ccgctggaaa 1861 aagggacaaa agattgccca aaagaatcag actgtttttg aaaggaattg ccgttcagag 1921 gactgtgccg cagacctgca gcttcagggt aaactgctgc tctccagtat ggatgagaaa 1981 accctgtatc tagctttggg ggctgtgaag aacatctccc taaacatctc tatctccaac 2041 ctcggagatg atgcctatga tgccaacgtg tccttcaatg tttcccggga gctcttcttc 2101 atcaacatgt ggcagaagga ggagatgggc atctcctgtg agctgctgga atcggacttc 2161 ctcaaatgca gcgtgggatt tcctttcatg aggtcaaagt caaagtatga attcagcgtg 2221 atctttgata caagccacct gtctggggaa gaggaagttc tcagcttcat tgttactgct 2281 cagagtggca acacggagcg ctctgaatcc ctgcatgaca acaccctcgt gctgatggtg 2341 ccactgatgc acgaggtgga cacgtccatc accggaatca tgtctccaac ctcctttgta 2401 tatggcgagt ccgtggacgc agccaacttc attcagctgg atgacctgga gtgtcacttt 2461 cagcccatca atatcaccct tcaggtctac aacactggcc caagcaccct tccagggtca 2521 tctgtcagca tctctttccc taatcgactc tcatctggtg gtgcagagat gtttcatgtc 2581 caggaaatgg tggtgggcca agagaaggga aactgctctt tccagaaaaa cccaactccc 2641 tgcatcatcc ctcaagaaca agaaaatatc ttccacacaa tatttgcttt tttcacaaag 2701 tctggaagaa aagtcttgga ctgtgaaaaa ccaggaattt cttgcctaac agcacactgt 2761 aactttagtg ctcttgctaa agaagaaagt cgtactatag acatttacat gctgctgaac 2821 acagaaatac tgaaaaagga cagttcgtct gtcatccagt tcatgtcccg cgccaaggtg 2881 aaggtggatc ctgccctaag ggtggtggaa atagctcatg ggaacccaga agaggtgacg 2941 gtggtcttcg aggccctgca caatctggag ccccgtggct acgtcgtggg gtggatcatc 3001 gccatcagtt tgttggtggg aatcctcatc ttcctgctgc tggccgtgct gctctggaag 3061 atgggcttct ttcgccgaag gtacaaagaa attatcgaag ctgagaagaa ccggaaagag 3121 aatgaagaca gttgggactg ggtccagaaa aaccagtgag ctgccacacc agtcacatga 3181 cctgatcact agcctgtcat ccttggtctt tgtatcttcc atatttggaa aaaaaaaatc 3241 ttctccagat ttttcggagg ccccactgat gctgttctct tcttcattct atcaagccca 3301 ggtgccagcc tgaggcagcc acttcggcca ggtcacacga ccgggggcag caccacttcg 3361 ctttaaagac tctgaacttt ggagagtgac agagccgagc aatatttagg atgcaacacg 3421 catggtcacc ctcaggggaa aactgttaaa gtatttttat aaatataagc cttttatact 3481 gattattctt ttatatttgt atcgatatta tttctattaa atagttataa ttcactcaag 3541 cactgattct ggcctaaaat cttggaagtc catgaataca aattttaa // LOCUS HUMIBSUB 3110 bp mRNA PRI 08-NOV-1994 DEFINITION Human integrin beta-5 subunit mRNA, complete cds. ACCESSION M35011 NID g184524 KEYWORDS integrin. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3110) AUTHORS Suzuki,S., Huang,Z.S. and Tanihara,H. TITLE Cloning of an integrin beta subunit exhibiting high homology with integrin beta 3 subunit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (14), 5354-5358 (1990) MEDLINE 90319111 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Suzuki, 05-JUN-1990. FEATURES Location/Qualifiers source 1..3110 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 30..2420 /gene="ITGB5" CDS 30..2420 /gene="ITGB5" /note="integrin beta-5 subunit" /codon_start=1 /db_xref="GDB:G00-128-005" /db_xref="PID:g306894" /translation="MPRAPAPLYACLLGLCALLPRLAGLNICTSGSATSCEECLLIHP KCAWCSKEDFGSPRSITSRCDLRANLVKNGCGGEIESPASSFHVLRSLPLSSKGSGSA GWDVIQMTPQEIAVNLRPGDKTTFQLQVRQVEDYPVDLYYLMDLSLSMKDDLDNIRSL GTKLAEEMRKLTSNFRLGFGSFVDKDISPFSYAAPRYQTNPCIGYKLFPNCVPSFGFR HLLPLTDRVDSFNEEVRKQRVSRNRDAPEGGFDAVLQAAVCKEKIGWRKDALHLLVFT TDDVPHIALDGKLGGLVQPHDGQCHLNEANEYTASNQMDYPSLALLGEKLAENNINLI FAVTKNHYMLYKNFTALIPGTTVEILDGDSKNIIQLIINAYNSIRSKVELSVWDQPED LNLFFTATCQDGVSYPGQRKCEGLKIGDTASFEVSLEARSCPSRHTEHVFALRPVGFR DSLEVGVTYNCTCGCSVGLEPNSARCNGSGTYVCGLCECSPGYLGTRCECQDGENQSV YQNLCREAEGKPLCSGRGDCSCNQCSCFESEFGKIYGPFCECDNFSCARNKGVLCSGH GECHCGECKCHAGYIGDNCNCSTDISTCRGRDGQICSERGHCLCGQCQCTEPGAFGEM CEKCPTCPDACSTKRDCVECLLLHSGKPDNQTCHSLCRDEVITWVDTIVKDDQEAVLC FYKTAKDCVMMFTYVELPSGKSNLTVLREPECGNTPNAMTILLAVVGSILLVGLALLA IWKLLVTIHDRREFAKFQSERSRARYEMASNPLYRKPISTHTVDFTFNKSYNGTVD" BASE COUNT 726 a 809 c 887 g 688 t ORIGIN 1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct 61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg 121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca 181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg 241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga 301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac 361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc 421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga 481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc 541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct 601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt 661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca 721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg 781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc 841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag 901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat 961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca 1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc 1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga 1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg 1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga 1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc 1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg 1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac 1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc 1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga 1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca 1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg 1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact 1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca 1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg 1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct 1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga 1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca 2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg 2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg 2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca 2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc 2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa 2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct 2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa 2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg 2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa 2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga 2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt 2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc 2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt 2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt 2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga 2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg 3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta 3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa // LOCUS HUMICOA 1881 bp mRNA PRI 08-NOV-1994 DEFINITION Human isovaleryl-coA dehydrogenase (IVD) mRNA, complete cds. ACCESSION M34192 NID g184538 KEYWORDS isovaleryl-CoA dehydrogenase. SOURCE Human placenta, cDNA to mRNA, (library of Clontech). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1881) AUTHORS Matsubara,Y., Ito,M., Glassberg,R., Satyabhama,S., Ikeda,Y. and Tanaka,K. TITLE Nucleotide sequence of messenger RNA encoding human isovaleryl-coenzyme A dehydrogenase and its expression in isovaleric acidemia fibroblasts JOURNAL J. Clin. Invest. 85 (4), 1058-1064 (1990) MEDLINE 90203210 FEATURES Location/Qualifiers source 1..1881 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q14-q15" mRNA <1..1881 /gene="IVD" /note="isovaleryl-coA dehydrogenase mRNA; G00-119-354" gene 1..1881 /gene="IVD" CDS 16..1287 /gene="IVD" /note="isovaleryl-coA dehydrogenase (IVD)" /codon_start=1 /db_xref="GDB:G00-119-354" /db_xref="PID:g306897" /translation="MATATRLLGWRVASWRLRPPLAGFVSQRAHSLLPVDDAINGLSE EQRQLRQTMAKFLQEHLAPKAQEIDRSNEFKNLREFWKQLGNLGVLGITAPVQYGGSG LGYLEHVLVMEEISRASGAVGLSYGAHSNLCINQLVRNGNEAQKEKYLPKLISGEYIG ALAMSEPNAGSDVVSMKLKAEKKGNHYILNGNKFWITNGPDADVLIVYAKTDLAAVPA SRGITAFIVEKGMPGFSTSKKLDKLGMRGSNTCELIFEDCKIPAANILGHENKGVYVL MSGLDLERLVLAGGPLGLMQAVLDHTIPYLHVREAFGQKIGHFQLMQGKMADMYTRLM ACRQYVYNVAKACDEGHCTAKDCAGVILYSAECATQVALDGIQCFGGNGYINDFPMGR FLRDAKLYEIGAGTSEVRRLVIGRAFNADFH" BASE COUNT 396 a 497 c 554 g 434 t ORIGIN 1 tcgtgcatgg cagagatggc gactgcgact cggctgctgg ggtggcgtgt ggcgagctgg 61 aggctgcggc cgccgcttgc cggcttcgtt tcccagcggg cccactcgct tttgcccgtg 121 gacgatgcaa tcaatgggct aagcgaggag cagaggcagc ttcgtcagac catggctaag 181 ttccttcagg agcacctggc ccccaaggcc caggagatcg atcgcagcaa tgagttcaag 241 aacctgcgag aattttggaa gcagctgggg aacctgggcg tattgggcat cacagcccct 301 gttcagtatg gcggctccgg cctgggctac ctggagcatg tgctggtgat ggaggagata 361 tcccgagctt ccggagcagt ggggctcagt tacggtgccc actccaacct ctgcatcaac 421 cagcttgtac gcaatgggaa tgaggcccag aaagagaagt atctcccgaa gctgatcagt 481 ggtgagtaca tcggagccct ggccatgagt gagcccaatg caggctctga tgttgtctct 541 atgaagctca aagcggaaaa gaaaggaaat cactacatcc tgaatggcaa caagttctgg 601 atcactaatg gccctgatgc tgacgtcctg attgtctatg ccaagacaga tctggctgct 661 gtgccagctt ctcggggcat cacagccttc attgtggaga agggtatgcc tggctttagc 721 acctctaaga agctggacaa gctggggatg aggggctcta acacctgtga gctaatcttt 781 gaagactgca agattcctgc tgccaacatc ctgggccatg agaataaggg tgtctacgtg 841 ctgatgagtg ggctggacct ggagcggctg gtgctggccg gggggcctct tgggctcatg 901 caagcggtcc tggaccacac cattccctac ctgcacgtga gggaagcctt tggccagaag 961 atcggccact tccagttgat gcaggggaag atggctgaca tgtacacccg cctcatggcg 1021 tgtcggcagt atgtctacaa tgtcgccaag gcctgcgatg agggccattg cactgctaag 1081 gactgtgcag gtgtgattct ttactcagct gagtgtgcta cacaggtagc cctggacggc 1141 attcagtgtt ttggtggcaa tggctacatc aatgactttc ccatgggccg ctttcttcga 1201 gatgccaagc tgtatgagat aggggctggg accagcgagg tgaggcggct ggtcatcggc 1261 agagccttca atgcagactt tcactagtcc tgagaccctt cgcccccttt tcctgcacct 1321 agtggccttt cttgggaagt agagatgtgg cggctttccc accctgccca cagcaggccc 1381 tcctgcccag ctgctcttgt cagccctctg gcctctggat gaggttgagt tctccacaac 1441 agctcccaag catcatgggc ctcgcagccg ggcctgtgcc acggctagtg ttgtgtgatt 1501 taaaatggac tcagcaggaa gcatattgtc tggggattgt tgggacaggt tttggtgact 1561 ctgtgccctt gctctctaac ttctgagccc acctcccagg gtaggcacct gggggcatgc 1621 aggtgcccac ctcccagggt aggcacctgg gggcatgcag gtacccacct ctttctcttg 1681 ggtgaggctc tggcaaggag atctctctgc tcaagcacag cagaatcatg gcccctctcc 1741 atgaattgga acttggtaca ggttaagtat ccctaatcct gaaatctgaa acacttgtgg 1801 ttccaagcat tttggataag gcaaattcaa ctttcagtct cttttctggg ggaaaaaaat 1861 aataaaccta gcctagccag g // LOCUS HUMID2B 1167 bp mRNA PRI 18-MAR-1994 DEFINITION Human striated muscle contraction regulatory protein (Id2B) mRNA, complete cds. ACCESSION M96843 NID g397775 KEYWORDS DNA-binding inihibitory protein; calcium-binding protein; contractile protein; helix-loop-helix protein; striated muscle contraction regulatory protein. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1167) AUTHORS Kurabayashi,M., Jeyaseelan,R. and Kedes,L. TITLE Two distinct cDNA sequences encoding the human helix-loop-helix protein Id2 JOURNAL Gene 133 (2), 305-306 (1993) MEDLINE 94040830 FEATURES Location/Qualifiers source 1..1167 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="heart" gene 110..220 /gene="Id2B" CDS 110..220 /gene="Id2B" /codon_start=1 /function="regulates striated muscle contraction" /product="contractile protein" /db_xref="PID:g397776" /translation="MKAFSPVRSIRKNSLLDHRLGISQSKTPVDDLMSLL" BASE COUNT 338 a 268 c 234 g 327 t ORIGIN 1 attctgagcc aagtccggtg ccaagcgcag ctagctcagc aggccgcagc ggtggcctga 61 gcttcaggac agccagctcc ctcccggtct cgccttcctc gcggtcagca tgaaagcctt 121 cagtcccgtg aggtccatta ggaaaaacag cctgttggac caccgcctgg gcatctccca 181 gagcaaaacc ccggtggatg acctgatgag cctgctgtaa aacatgaatg actgctactc 241 caagctcaag gagctggtgc ccagcatccc ccagaacaag aagtggagca agatggaaat 301 cctgcagcac gtcatcgact acatcttgga cctgcagatc accctggact tgcatcccac 361 tattgtcagc ctgcatcacc agagacccgg gcagaaccag gcgtccagga cgccgctgac 421 caccctcaac acggatatca gcatcctgtc cttgcaggct tctgaattcc cttcggagtt 481 aatgtcaaat gacaggaaag cactgtgtgg ctgaataatc atgacttctt ttttttcttt 541 gcacaacaac gacaacaaca aattcacaga accttttcag cgctgaactt atttttcaac 601 catttcacaa ggaggcagtt gcatgacttt taaaagcaaa aaaggagaaa ctagatgaaa 661 aagactttta aatgcccttt ctgcagttgg aaggtttttt tcatatacta ttcccaccat 721 ggggagcaga aacgttaaaa tcacaaggaa ttgcccaatc taagcagact ttgccttttt 781 tcaaaggtgg agtgtcaata ccagaaggat ccagtatcag tctcttaaat gaagtctttt 841 cggtcagaaa ttaccttttt gacacaagcc tactgaatgc cgtgtatata tttatatata 901 aatatatctt atttaagtga aaccttgtta actctttaat tagagttttc ttgtatagtg 961 gcagagatgt ctatttctgc attaaaaagt gtaatgatgt acttattcat gatgaacttt 1021 ttataaaagt ttaaactttt agttgtaaac gtaacccttt tatacaaaat aaatcagtgt 1081 gtttattgaa tggtgattgc ctcgtttatt tcagaggacc agtgctttga ttttttatta 1141 tgctatgtta taactgaacc aaataaa // LOCUS HUMIDB 498 bp mRNA PRI 11-JUN-1993 DEFINITION Human insulinoma rig-analog mRNA encoding DNA-binding protein, complete cds. ACCESSION J02984 NID g184553 KEYWORDS . SOURCE Human insulinoma (surgically removed pancreas), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 498) AUTHORS Inoue,C., Shiga,K., Takasawa,S., Kitagawa,M., Yamamoto,H. and Okamoto,H. TITLE Evolutionary conservation of the insulinoma gene rig and its possible function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6659-6662 (1987) MEDLINE 88016150 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Shiga, 18-DEC-1988. FEATURES Location/Qualifiers source 1..498 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..498 /note="rig-analog mRNA" CDS 30..467 /note="rig-analog protein (putative); putative" /codon_start=1 /db_xref="PID:g306898" /translation="MAEVEQKKKRTFRKFTYRGVDLDQLLDMSYEQLMQLYSARQRRR LNRGLRRKQHSLLKRLRKAKKEAPPMEKPEVVKTHLRDMIILPEMVGSMVGVYNGKTF NQVEIKPEMIGHYLGEFSITYKPVKHGRPGIGATHSSRFIPLK" BASE COUNT 118 a 154 c 151 g 75 t ORIGIN 101 bp upstream of PvuII site. 1 aaagcgatct cttctgagga tccggcaaga tggcagaagt agagcagaag aagaagcgga 61 ccttccgcaa gttcacctac cgcggcgtgg acctcgacca gctgctggac atgtcctacg 121 agcagctgat gcagctgtac agtgcgcgcc agcggcggcg gctgaaccgg ggcctgcggc 181 ggaagcagca ctccctgctg aagcgcctgc gcaaggccaa gaaggaggcg ccgcccatgg 241 agaagccgga agtggtgaag acgcacctgc gggacatgat catcctaccc gagatggtgg 301 gcagcatggt gggcgtctac aacggcaaga ccttcaacca ggtggagatc aagcccgaga 361 tgatcggcca ctacctgggc gagttctcca tcacctacaa gcccgtaaag catggccggc 421 ccggcatcgg ggccacccac tcctcccgct tcatccctct caagtaatgg ctcagctaat 481 aaaggcgcac atgactcc // LOCUS HUMIDE 3337 bp mRNA PRI 08-NOV-1994 DEFINITION Human insulin-degrading enzyme (IDE) mRNA, complete cds. ACCESSION M21188 NID g184555 KEYWORDS insulin-degrading enzyme; proteinase. SOURCE Homo sapiens hepatoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3337) AUTHORS Affholter,J.A., Fried,V.A. and Roth,R.A. TITLE Human insulin-degrading enzyme shares structural and functional homologies with E. coli protease III JOURNAL Science 242 (4884), 1415-1418 (1988) MEDLINE 89072709 REFERENCE 2 (sites) AUTHORS Affholter,J.A., Hsieh,C.L., Francke,U. and Roth,R.A. TITLE Insulin-degrading enzyme: stable expression of the human complementary DNA, characterization of its protein product, and chromosomal mapping of the human and mouse genes JOURNAL Mol. Endocrinol. 4 (8), 1125-1135 (1990) MEDLINE 91155945 COMMENT [2] revises [1]. Computer readable copy of sequence [1] kindly provided by R.A.Roth, 02-NOV-1988. FEATURES Location/Qualifiers source 1..3337 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_type="hepatoma" /map="10" mRNA <1..3279 /gene="IDE" /note="G00-118-817" gene 1..3279 /gene="IDE" CDS 58..3117 /gene="IDE" /codon_start=1 /db_xref="GDB:G00-118-817" /product="insulin-degrading enzyme" /db_xref="PID:g184556" /translation="MRYRLAWLLHPALPSTFRSVLGARLPPPERLCGFQKKTYSKMNN PAIKRIGNHITKSPEDKREYRGLELANGIKVLLMSDPTTDKSSAALDVHIGSLSDPPN IAGLSHFCEHMLFLGTKKYPKENEYSQFLSEHAGSSNAFTSGEHTNYYFDVSHEHLEG ALDRFAQFFLCPLFDESCKDREVNAVDSEHEKNVMNDAWRLFQLEKATGNPKHPFSKF GTGNKYTLETRPNQEGIDVRQELLKFHSAYYSSNLMAVCVLGRESLDDLTNLVVKLFS EVENKNVPLPEFPEHPFQEEHLKQLYKIVPIKDIRNLYVTFPIPDLQKYYKSNPGHYL GHLIGHEGPGSLLSELKSKGWVNTLVGGQKEGARGFMFFIINVDLTEEGLLHVEDIIL HMFQYIQKLRAEGPQEWVFQECKDLNAVAFRFKDKERPRGYTSKIAGILHYYPLEEVL TAEYLLEEFRPDLIEMVLDKLRPENVRVAIVSKSFEGKTDRTEEWYGTQYKQEAIPDE VIKKWQNADLNGKFKLPTKNEFIPTNFEILPLEKEATPYPALIKDTVMSKLWFKQDDK KKKPKACLNFEFFSPFAYVDPLHCNMAYLYLELLKDSLNEYAYAAELAGLSYDLQNTI YGMYLSVKGYNDKQPILLKKIIEKMATFEIDEKRFEIIKEAYMRSLNNFRAEQPHQHA MYYLRLLMTEVAWTKDELKEALDDVTLPRLKAFIPQLLSRLHIEALLHGNITKQAALG IMQMVEDTLIEHAHTKPLLPSQLVRYREVQLPDRGWFVYQQRNEVHNNCGIEIYYQTD MQSTSENMFLELFCQIISEPCFNTLRTKEQLGYIVFSGPRRANGIQSLRFIIQSEKPP HYLESRVEAFLITMEKSIEDMTEEAFQKHIQALAIRRLDKPKKLSAECAKYWGEIISQ QYNFDRDNTEVAYLKTLTKEDIIKFYKEMLAVDAPRRHKVSVHVLAREMDSCPVVGEF PCQNDINLSQAPALPQPEVIQNMTEFKRGLPLFPLVKPHINFMAAKL" BASE COUNT 1060 a 694 c 694 g 889 t ORIGIN Chromosome 10./kastern. 1 ccggctcgaa gcgcaacgag gaagcgtttg cggtgatccc ggcgactgcg ctggctaatg 61 cggtaccggc tagcgtggct tctgcacccc gcactgccca gcaccttccg ctcagtcctc 121 ggcgcccgcc tgccgcctcc ggagcgcctg tgtggtttcc aaaaaaagac ttacagcaaa 181 atgaataatc cagccatcaa gagaatagga aatcacatta ccaagtctcc tgaagacaag 241 cgagaatatc gagggctaga gctggccaat ggtatcaaag tacttcttat gagtgatccc 301 accacggata agtcatcagc agcacttgat gtgcacatag gttcattgtc ggatcctcca 361 aatattgctg gcttaagtca tttttgtgaa catatgcttt ttttgggaac aaagaaatac 421 cctaaagaaa atgaatacag ccagtttctc agtgagcatg caggaagttc aaatgccttt 481 actagtggag agcataccaa ttactatttt gatgtttctc atgaacacct agaaggtgcc 541 ctagacaggt ttgcacagtt ttttctgtgc cccttgttcg atgaaagttg caaagacaga 601 gaggtgaatg cagttgattc agaacatgag aagaatgtga tgaatgatgc ctggagactc 661 tttcaattgg aaaaagctac agggaatcct aaacacccct tcagtaaatt tgggacaggt 721 aacaaatata ctctggagac tagaccaaac caagaaggca ttgatgtaag acaagagcta 781 ctgaaattcc attctgctta ctattcatcc aacttaatgg ctgtttgtgt tttaggtcga 841 gaatctttag atgacttgac taatctggtg gtaaagttat tttctgaagt agagaacaaa 901 aatgttccat tgccagaatt tcctgaacac cctttccaag aagaacatct taaacaactt 961 tacaaaatag tacccattaa agatattagg aatctctatg tgacatttcc catacctgac 1021 cttcagaaat actacaaatc aaatcctggt cattatcttg gtcatctcat tgggcatgaa 1081 ggtcctggaa gtctgttatc agaacttaag tcaaagggct gggttaatac tcttgttggt 1141 gggcagaagg aaggagcccg aggttttatg ttttttatca ttaatgtgga cttgaccgag 1201 gaaggattat tacatgttga agatataatt ttgcacatgt ttcaatacat tcagaagtta 1261 cgtgcagaag gacctcaaga atgggttttc caagagtgca aggacttgaa tgctgttgct 1321 tttaggttta aagacaaaga gaggccacgg ggctatacat ctaagattgc aggaatattg 1381 cattattatc ccctagaaga ggtgctcaca gcggaatatt tactggaaga atttagacct 1441 gacttaatag agatggttct cgataaactc agaccagaaa atgtccgggt tgccatagtt 1501 tctaaatctt ttgaaggaaa aactgatcgc acagaagagt ggtatggaac ccagtacaaa 1561 caagaagcta taccggatga agtcatcaag aaatggcaaa atgctgacct gaatgggaaa 1621 tttaaacttc ctacaaagaa tgaatttatt cctacgaatt ttgagatttt accgttagaa 1681 aaagaggcga caccataccc tgctcttatt aaggatacag tcatgagcaa actttggttc 1741 aaacaagatg ataagaaaaa aaagccgaag gcttgtctca actttgaatt tttcagccca 1801 tttgcttatg tggacccctt gcactgtaac atggcctatt tgtaccttga gctcctcaaa 1861 gactcactca acgagtatgc atatgcagca gagctagcag gcttgagcta tgatctccaa 1921 aataccatct atgggatgta tctttcagtg aaaggttaca atgacaagca gccaatttta 1981 ctaaagaaga ttattgagaa aatggctacc tttgagattg atgaaaaaag atttgaaatt 2041 atcaaagaag catatatgcg atctcttaac aatttccggg ctgaacagcc tcaccagcat 2101 gccatgtact acctccgctt gctgatgact gaagtggcct ggactaaaga tgagttaaaa 2161 gaagctctgg atgatgtaac ccttcctcgc cttaaggcct tcatacctca gctcctgtca 2221 cggctgcaca ttgaagccct tctccatgga aacataacaa agcaggctgc attaggaatt 2281 atgcagatgg ttgaagacac cctcattgaa catgctcata ccaaacctct ccttccaagt 2341 cagctggttc ggtatagaga agttcagctc cctgacagag gatggtttgt ttatcagcag 2401 agaaatgaag ttcacaataa ctgtggcatc gagatatact accaaacaga catgcaaagc 2461 acctcagaga atatgtttct ggagctcttc tgtcagatta tctcggaacc ttgcttcaac 2521 accctgcgca ccaaggagca gttgggctat atcgtcttca gcgggccacg tcgagctaat 2581 ggcatacaga gcttgagatt catcatccag tcagaaaagc cacctcacta cctagaaagc 2641 agagtggaag ctttcttaat taccatggaa aagtccatag aggacatgac agaagaggcc 2701 ttccaaaaac acattcaggc attagcaatt cgtcgactag acaaaccaaa gaagctatct 2761 gctgagtgtg ctaaatactg gggagaaatc atctcccagc aatataattt tgacagagat 2821 aacactgagg ttgcatattt aaagacactt accaaggaag atatcatcaa attctacaag 2881 gaaatgttgg cagtagatgc tccaaggaga cataaggtat ccgtccatgt tcttgccagg 2941 gaaatggatt cttgtcctgt tgttggagag ttcccatgtc aaaatgacat aaatttgtca 3001 caagcaccag ccttgccaca acctgaagtg attcagaaca tgaccgaatt caagcgtggt 3061 ctgccactgt ttccccttgt gaaaccacat attaacttca tggctgcaaa actctgaaga 3121 ttccccatgc atgggaaagt gcaagtggat gcattcctga gtcttccaga gcctaagaaa 3181 atcatcttgg ccactttaat agtttctgat tcactattag agaaacaaac aaaaaattgt 3241 caaatgtcat tatgtagaaa tattataaat ccaaagtaaa ttacaaaatc ttatagatgt 3301 agaatatttt ttaaatacat gccctcttaa atatttc // LOCUS HUMIDNAL 2155 bp mRNA PRI 22-NOV-1995 DEFINITION Human alpha-L-iduronidas (IDUA) mRNA, complete cds. ACCESSION M74715 NID g184558 KEYWORDS alpha-L-iduronidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2155) AUTHORS Scott,H.S., Anson,D.S., Orsborn,A.M., Nelson,P.V., Clements,P.R., Morris,C.P. and Hopwood,J.J. TITLE Human alpha-L-iduronidase: cDNA isolation and expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (21), 9695-9699 (1991) MEDLINE 92052158 REFERENCE 2 (bases 1 to 2155) AUTHORS Morris,C.P. TITLE Direct Submission JOURNAL Submitted (06-AUG-1991) C. Phillip Morris, Adelaide Children's Hospital, Chemical Pathology, North Adelaide, 5006, Australia FEATURES Location/Qualifiers source 1..2155 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4p16.3" 5'UTR 1..88 /gene="IDUA" /note="G00-119-327" gene 1..2155 /gene="IDUA" sig_peptide 89..166 /gene="IDUA" /note="G00-119-327" CDS 89..2050 /gene="IDUA" /codon_start=1 /db_xref="GDB:G00-119-327" /product="alpha-L-iduronidase" /db_xref="PID:g184559" /translation="MRPLRPRAALLALLASLLAAPPVAPAEAPHLVQVDAARALWPLR RFWRSTGFCPPLPHSQADQYVLSWDQQLNLAYVGAVPHRGIKQVRTHWLLELVTTRGS TGRGLSYNFTHLDGYLDLLRENQLLPGFELMGSASGHFTDFEDKQQVFEWKDLVSSLA RRYIGRYGLAHVSKWNFETWNEPDHHDFDNVSMTMQGFLNYYDACSEGLRAASPALRL GGPGDSFHTPPRSPLSWGLLRHCHDGTNFFTGEAGVRLDYISLHRKGARSSISILEQE KVVAQQIRQLFPKFADTPIYNDEADPLVGWSLPQPWRADVTYAAMVVKVIAQHQNLLL ANTTSAFPYALLSNDNAFLSYHPHPFAQRTLTARFQVNNTRPPHVQLLRKPVLTAMGL LALLDEEQLWAEVSQAGTVLDSNHTVGVLASAHRPQGPADAWRAAVLIYASDDTRAHP NRSVAVTLRLRGVPPGPGLVYVTRYLDNGLCSPDGEWRRLGRPVFPTAEQFRRMRAAE DPVAAAPRPLPAGGRLTLRPALRLPSLLLVHVCARPEKPPGQVTRLRALPLTQGQLVL VWSDEHVGSKCLWTYEIQFSQDGKAYTPVSRKPSTFNLFVFSPDTGAVSGSYRVRALD YWARPGPFSDPVPYLEVPVPRGPPSPGNP" mat_peptide 167..2047 /gene="IDUA" /note="G00-119-327" /product="alpha-L-iduronidase" 3'UTR 2048..2155 /gene="IDUA" /note="G00-119-327" BASE COUNT 333 a 810 c 667 g 345 t ORIGIN 1 gtcacatggg gtgcgcgccc agactccgac ccggaggcgg aaccggcagt gcagcccgaa 61 gccccgcagt ccccgagcac gcgtggccat gcgtcccctg cgcccccgcg ccgcgctgct 121 ggcgctcctg gcctcgctcc tggccgcgcc cccggtggcc ccggccgagg ccccgcacct 181 ggtgcaggtg gacgcggccc gcgcgctgtg gcccctgcgg cgcttctgga ggagcacagg 241 cttctgcccc ccgctgccac acagccaggc tgaccagtac gtcctcagct gggaccagca 301 gctcaacctc gcctatgtgg gcgccgtccc tcaccgcggc atcaagcagg tccggaccca 361 ctggctgctg gagcttgtca ccaccagggg gtccactgga cggggcctga gctacaactt 421 cacccacctg gacgggtact tggaccttct cagggagaac cagctcctcc cagggtttga 481 gctgatgggc agcgcctcgg gccacttcac tgactttgag gacaagcagc aggtgtttga 541 gtggaaggac ttggtctcca gcctggccag gagatacatc ggtaggtacg gactggcgca 601 tgtttccaag tggaacttcg agacgtggaa tgagccagac caccacgact ttgacaacgt 661 ctccatgacc atgcaaggct tcctgaacta ctacgatgcc tgctcggagg gtctgcgcgc 721 cgccagcccc gccctgcggc tgggaggccc cggcgactcc ttccacaccc caccgcgatc 781 cccgctgagc tggggcctcc tgcgccactg ccacgacggt accaacttct tcactgggga 841 ggcgggcgtg cggctggact acatctccct ccacaggaag ggtgcgcgca gctccatctc 901 catcctggag caggagaagg tcgtcgcgca gcagatccgg cagctcttcc ccaagttcgc 961 ggacaccccc atttacaacg acgaggcgga cccgctggtg ggctggtccc tgccacagcc 1021 gtggagggcg gacgtgacct acgcggccat ggtggtgaag gtcatcgcgc agcatcagaa 1081 cctgctactg gccaacacca cctccgcctt cccctacgcg ctcctgagca acgacaatgc 1141 cttcctgagc taccacccgc accccttcgc gcagcgcacg ctcaccgcgc gcttccaggt 1201 caacaacacc cgcccgccgc acgtgcagct gttgcgcaag ccggtgctca cggccatggg 1261 gctgctggcg ctgctggatg aggagcagct ctgggccgaa gtgtcgcagg ccgggaccgt 1321 cctggacagc aaccacacgg tgggcgtcct ggccagcgcc caccgccccc agggcccggc 1381 cgacgcctgg cgcgccgcgg tgctgatcta cgcgagcgac gacacccgcg cccaccccaa 1441 ccgcagcgtc gcggtgaccc tgcggctgcg cggggtgccc cccggcccgg gcctggtcta 1501 cgtcacgcgc tacctggaca acgggctctg cagccccgac ggcgagtggc ggcgcctggg 1561 ccggcccgtc ttccccacgg cagagcagtt ccggcgcatg cgcgcggctg aggacccggt 1621 ggccgcggcg ccccgcccct tacccgccgg cggccgcctg accctgcgcc ccgcgctgcg 1681 gctgccgtcg cttttgctgg tgcacgtgtg tgcgcgcccc gagaagccgc ccgggcaggt 1741 cacgcggctc cgcgccctgc ccctgaccca agggcagctg gttctggtct ggtcggatga 1801 acacgtgggc tccaagtgcc tgtggacata cgagatccag ttctctcagg acggtaaggc 1861 gtacaccccg gtcagcagga agccatcgac cttcaacctc tttgtgttca gcccagacac 1921 aggtgctgtc tctggctcct accgagttcg agccctggac tactgggccc gaccaggccc 1981 cttctcggac cctgtgccgt acctggaggt ccctgtgcca agagggcccc catccccggg 2041 caatccatga gcctgtgctg agccccagtg ggttgcacct ccaccggcag tcagcgagct 2101 ggggctgcac tgtgcccatg ctgccctccc atcaccccct ttgcaatata ttttt // LOCUS HUMIEF 2113 bp mRNA PRI 01-JAN-1995 DEFINITION Human transformation-sensitive protein (IEF SSP 3521) mRNA, complete cds. ACCESSION M86752 NID g184564 KEYWORDS transformation-sensitive protein. SOURCE Homo sapiens (tissue library: lambda-ZAPII MRC-5 V2) mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2113) AUTHORS Honore,B., Leffers,H., Madsen,P., Rasmussen,H.H., Vandekerckhove,J. and Celis,J.E. TITLE Molecular cloning and expression of a transformation-sensitive human protein containing the TPR motif and sharing identity to the stress-inducible yeast protein STI1 JOURNAL J. Biol. Chem. 267 (12), 8485-8491 (1992) MEDLINE 92235077 FEATURES Location/Qualifiers source 1..2113 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SV40-transformed cell line MRC-5" /cell_type="fibroblast" /tissue_lib="lambda-ZAPII MRC-5 V2" gene 63..1694 /gene="IEF SSP 3521" CDS 63..1694 /gene="IEF SSP 3521" /codon_start=1 /product="transformation-sensitive protein" /db_xref="PID:g184565" /translation="MEQVNELKEKGNKALSVGNIDDALQCYSEAIKLDPHNHVLYSNR SAAYAKKGDYQKAYEDGCKTVDLKPDWGKGYSRKAAALEFLNRFEEAKRTYEEGLKHE ANNPQLKEGLQNMEARLAERKFMNPFNMPNLYQKLESDPRTRTLLSDPTYRELIEQLR NKPSDLGTKLQDPRIMTTLSVLLGVDLGSMDEEEEIATPPPPPPPKKETKPEPMEEDL PENKKQALKEKELGNDAYKKKDFDTALKHYDKAKELDPTNMTYITNQAAVYFEKGDYN KCRELCEKAIEVGRENREDYRQIAKAYARIGNSYFKEEKYKDAIHFYNKSLAEHRTPD VLKKCQQAEKILKEQERLAYINPDLALEEKNKGNECFQKGDYPQAMKHYTEAIKRNPK DAKLYSNRAACYTKLLEFQLALKDCEECIQLEPTFIKGYTRKAAALEAMKDYTKAMDV YQKALDLDSSCKEAADGYQRCMMAQYNRHDSPEDVKRRAMADPEVQQIMSDPAMRLIL EQMQKDPQALSEHLKNPVIAQKIQKLMDVGLIAIR" BASE COUNT 605 a 538 c 567 g 403 t ORIGIN 1 gtgcggttgg gaacgcggag cggacggatt cgattcaacg gggttccgga ccgcgctgcg 61 ctatggagca ggtcaatgag ctgaaggaga aaggcaacaa ggccctgagc gtgggtaaca 121 tcgatgatgc cttacagtgc tactccgaag ctattaagct ggatccccac aaccacgtgc 181 tgtacagcaa ccgttctgct gcctatgcca agaaaggaga ctaccagaag gcttatgagg 241 atggctgcaa gactgtcgac ctaaagcctg actggggcaa gggctattca cgaaaagcag 301 cagctctaga gttcttaaac cgctttgaag aagccaagcg aacctatgag gagggcttaa 361 aacacgaggc aaataaccct caactgaaag agggtttaca gaatatggag gccaggttgg 421 cagagagaaa attcatgaac cctttcaaca tgcctaatct gtatcagaag ttggagagtg 481 atcccaggac aaggacacta ctcagtgatc ctacctaccg ggagctgata gagcagctac 541 gaaacaagcc ttctgacctg ggcacgaaac tacaagatcc ccggatcatg accactctca 601 gcgtcctcct tggggtcgat ctgggcagta tggatgagga ggaagagatt gcaacacctc 661 caccaccacc ccctcccaaa aaggagacca agccagagcc aatggaagaa gatcttccag 721 agaataagaa gcaggcactg aaagaaaaag agctggggaa cgatgcctac aagaagaaag 781 actttgacac agccttgaag cattacgaca aagccaagga gctggacccc actaacatga 841 cttacattac caatcaagca gcggtatact ttgaaaaggg cgactacaat aagtgccggg 901 agctttgtga gaaggccatt gaagtgggga gagaaaaccg agaagactat cgacagattg 961 ccaaagcata tgctcgaatt ggcaactcct acttcaaaga agaaaagtac aaggatgcca 1021 tccatttcta taacaagtct ctggcagagc accgaacccc agatgtgctc aagaaatgcc 1081 agcaggcaga gaaaatcctg aaggagcaag agcggctggc ctacataaac cccgacctgg 1141 ctttggagga gaagaacaaa ggcaacgagt gttttcagaa aggggactat ccccaggcca 1201 tgaagcatta tacagaagcc atcaaaagga acccgaaaga tgccaaatta tacagcaatc 1261 gagctgcctg ctacaccaaa ctcctggagt tccagctggc actcaaggac tgtgaggaat 1321 gtatccagct ggagccgacc ttcatcaagg gttatacacg gaaagccgct gcgctggaag 1381 cgatgaagga ctacaccaaa gccatggatg tgtaccagaa ggcgctagac ctggactcca 1441 gctgtaagga ggcggcagac ggctaccagc gctgtatgat ggcgcagtac aaccggcacg 1501 acagccccga agatgtgaag cgacgagcca tggccgaccc tgaggtgcag cagatcatga 1561 gtgacccagc catgcgcctt atcctggaac agatgcagaa ggacccccag gcactcagcg 1621 aacacttaaa gaatcctgta atagcacaga agatccagaa gctgatggat gtgggtctga 1681 ttgcaattcg gtgatgactt gttcatcccc ccttcccttc gccctcatgt ggaaagagga 1741 gctgggaccg cggcgagcag cacggagcgg aagggagagc aggggagaga aggcctcatc 1801 tctctatatt tatacataac cccggggaag acacagagac tcgtacctgc gctgtttgtg 1861 ccgccgctgc ctctgggccc tcccagcaca cgcatggtct cttcaccgct gccctcgagt 1921 tccatgtctc tttcccctgc ccctagttgc tgtctcggct gctctcccat agttggtttt 1981 ttttttattt ggggcagtgg gcatgttatg gggaggggag ggggttcttc cagcctcagg 2041 tcccagctgt ctcacgttgt ttattctgcg tccccttctc caataaaaca agccagttgg 2101 gcgtggttat aac // LOCUS HUMIEF2G 1440 bp mRNA PRI 14-JUL-1994 DEFINITION Human translation initiation factor eIF-2 gamma subunit mRNA, complete cds. ACCESSION L19161 NID g306899 KEYWORDS eIF-2 gamma; translation initiation; translation initiation factor; translation initiation factor eIF; translation initiation factor eIF2. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1440) AUTHORS Gaspar,N.J., Kinzy,T.G., Scherer,B.J., Humbelin,M., Hershey,J.W. and Merrick,W.C. TITLE Translation initiation factor eIF-2. Cloning and expression of the human cDNA encoding the gamma-subunit JOURNAL J. Biol. Chem. 269, 3415-3422 (1994) MEDLINE 94148837 FEATURES Location/Qualifiers source 1..1440 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Erythroid, progenitor" /tissue_type="leukemic" CDS 1..1419 /standard_name="eIF-2 gamma" /codon_start=1 /product="translation initiation factor eIF-2 gamma subunit" /db_xref="PID:g306900" /translation="MAGGEAGVTLGQPHLSRQDLTTLDVTKLTPLSHEVISRQATINI GTIGHVAHGKSTVVKAISGVHTVRFKNELERNITIKLGYANAKIYKLDDPSCPRPECY RSCGSSTPDEFPTDIPGTKGNFKLVRHVSFVDCPGHDILMATMLNGAAVMDAALLLIA GNESCPQPQTSEHLAAIEIMKLKHILILQNKIDLVKESQAKEQYEQILAFVQGTVAEG APIIPISAQLKYNIEVVCEYIVKKIPVPPRDFTSEPRLIVIRSFDVNKPGCEVDDLKG GVAGGSILKGVLKVGQEIEVRPGIVSKDSEGKLMCKPIFSKIVSLFAEHNDLQYAAPG GLIGVGTKIDPTLCRADRMVGQVLGAVGALPEIFTELEISYFLLRRLLGVRTEGDKKA AKVQKLSKNEVLMVNIGSLSTGGRVSAVKADLGKIVLTNPVCTEVGEKIALSRRVEKH WRLIGWGQIRRGVTIKPTVDDD" BASE COUNT 441 a 269 c 349 g 381 t ORIGIN 1 atggcgggcg gagaagctgg agtgactcta gggcagccgc atctttcgcg tcaggatctc 61 accaccttgg atgttaccaa gttgacgcca ctttcacatg aagttatcag cagacaagcc 121 acaattaaca taggtacaat tggtcatgta gctcatggga aatccacagt cgtcaaagct 181 atttctggag ttcatactgt caggttcaaa aatgaactag aaagaaatat tacaatcaag 241 cttggatatg ctaatgctaa gatttataag cttgatgacc caagttgccc tcggccagaa 301 tgttatagat cttgtgggag cagtacacct gacgagtttc ctacggacat tccagggacc 361 aaagggaact tcaaattagt cagacatgtt tcctttgttg actgtcctgg ccacgatatt 421 ttgatggcta ctatgctgaa cggtgcagca gtgatggatg cagctcttct gttgatagct 481 ggtaatgaat cttgccctca gcctcagaca tcggaacacc tggctgctat agagatcatg 541 aaactgaagc atattttgat tctacaaaat aaaattgatt tggtaaaaga aagtcaggct 601 aaagaacaat acgagcagat ccttgcattt gtccaaggta cagtagcaga gggagctccc 661 attattccaa tttcagctca gctgaaatac aatattgaag ttgtttgtga gtacatagta 721 aagaaaattc cagtaccccc aagagacttt acttcagagc cccggcttat tgttattaga 781 tcttttgatg tcaacaaacc tggctgtgaa gttgatgacc ttaagggagg tgtagctggt 841 ggtagtatcc taaaaggagt attaaaggtg ggccaggaga tagaagtaag acctggtatt 901 gtttccaaag atagtgaagg aaaactcatg tgtaaaccaa tcttttccaa aattgtatca 961 ctttttgcgg agcataatga tctgcaatat gctgctccag gcggtcttat tggagttgga 1021 acaaaaattg accccacttt gtgccgggct gacagaatgg tggggcaagt acttggtgca 1081 gtcggagctt tacctgagat attcacagaa ttggaaattt cctatttcct gcttagacgg 1141 cttctaggtg tacgcactga aggagacaag aaagcagcaa aggttcaaaa gctgtctaag 1201 aatgaagtgc tcatggtgaa cataggatcc ctgtcaacag gagggagagt tagtgctgtc 1261 aaggccgatt tgggtaaaat tgttttgacc aatccagtgt gcacagaggt aggagaaaaa 1321 attgccctta gccgaagagt tgaaaaacac tggcgtttaa ttggttgggg tcagataaga 1381 agaggagtga caatcaagcc aacagtagat gatgactgaa gaataccagt taaataatac // LOCUS HUMIERB 1646 bp mRNA PRI 22-FEB-1995 DEFINITION Homo sapiens IgE receptor beta chain (HTm4) mRNA, complete cds. ACCESSION L35848 NID g561638 KEYWORDS CD20 antigen; IgE receptor; IgE receptor beta chain; immunoglobulin; immunoglobulin E receptor. SOURCE Homo sapiens hematopoietic cells cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1646) AUTHORS Adra,C.N., Lelias,J.M., Kobayashi,H., Kaghad,M., Morrison,P., Rowley,J.D. and Lim,B. TITLE Cloning of the cDNA for a hematopoietic cell-specific protein related to CD20 and the beta subunit of the high-affinity IgE receptor: evidence for a family of proteins with four membrane-spanning regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (21), 10178-10182 (1994) MEDLINE 95024008 FEATURES Location/Qualifiers source 1..1646 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" /tissue_type="hematopoietic cells" /map="11q12-13.1" mRNA <99..>743 /gene="HTm4" gene 99..743 /gene="HTm4" CDS 99..743 /gene="HTm4" /codon_start=1 /product="IgE receptor beta subunit" /db_xref="PID:g561639" /translation="MASHEVDNAELGSASAHGTPGSETGPEELNTSVYHPINGSPDYQ KAKLQVLGAIQILNAAMILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSS GTLSVVAGIKPTRTWIQNSFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESP DLCNYMGSISNGMVSLLLILTLLELCVTISTIAMWCNANCCNSREEISSPPNSV" BASE COUNT 490 a 352 c 314 g 490 t ORIGIN 1 tagtgatctt ttctgagtgt ctcctacttg cgacaaggtg gacttgggag gaaagccgtc 61 tgccaaagcc tgaagcctcc aagccataaa caaccccaat ggcctcccac gaagttgata 121 atgcagagct ggggtcagcc tctgcccatg gtaccccagg cagtgagacg ggaccagaag 181 agctgaatac ttctgtctac caccccataa atggatcacc agattatcag aaagcaaaat 241 tacaagttct tggggccatc cagatcctga atgcagcaat gattctggct ttgggtgtct 301 ttctgggttc cttgcaatac ccataccact tccaaaagca cttctttttc ttcaccttct 361 acacaggcta cccgatttgg ggtgctgtgt ttttctgtag ttcaggaacc ttgtctgttg 421 tagcagggat aaaacccaca agaacatgga tacagaacag ttttggaatg aacattgcca 481 gtgctacaat tgcactagtg gggactgctt ttctctcact aaatatagca gttaatatcc 541 agtcattaag gagttgtcac tcttcatcag agtcaccgga cctatgcaat tacatgggct 601 ccatatcaaa tggcatggtg tctctactgc tgattctcac cttgctggaa ttatgcgtaa 661 ctatctctac catagccatg tggtgcaatg caaactgctg taattcaaga gaggaaattt 721 cctcacctcc caattctgtg taatcaagaa tacctcctta tgaaaataat tctgagagca 781 tgaatatttg accttaaatc tccagtgact cagagcttca cccacaaact caggagaaca 841 taagcctgct cgtaaagctc aatccttcta tcatggcacc aatcacaaga accttggacg 901 tttgactgac tctatccttt ctctcctaac tataaatcct atttgtgtgt cgtgggtatg 961 gaaggacaga tatatttctt taggcattct tggatatctg taacttctat gatcattact 1021 ccaaagttgt ttccagaaat tggttctatt tcttcttatc cacctactcc attgctttat 1081 gaggtttaag gaaggaaggc ggtataatcc ctattcaata tattttttct aaaatccaac 1141 ttctgaccgc ccagtaggaa gaaaaatgag acattttttc cattacagag aaatgcttct 1201 tgactttaac atcagcatta taaaaagtgt caaataaaaa attaccatca ttatcattaa 1261 aataaatttt cactgtattt gagatgggag ggttaaggct cagggatttt atttcagtga 1321 actgctggaa ctcacacatg ccctgatatg taaatgatga tttatgttgg cgagtctgag 1381 agcaagccca aatgtgttct tcaaaggaca atgggaaact gtaaagtaga gaactaaaga 1441 ataaggcctt tagaatctga cacatctggg ttcaaattct gaaactgtca cttattacct 1501 gtatgaacat gggcaaatta tctaatctct ctgatctatt tttcctcatc tgtaaaatag 1561 gtgtaataat aacaactact ttgtcggttg ctctgagggt taaatgaaaa taaaaagaaa 1621 atgtgaaaca gcaccacagg tacttg // LOCUS HUMIF4E 1842 bp mRNA PRI 01-SEP-1995 DEFINITION Homo sapiens cap-binding protein mRNA, complete cds. ACCESSION M15353 NID g306486 KEYWORDS cap-binding protein; eIF-4E gene. SOURCE Homo sapiens (clone library: library of J. Whittaker) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1842) AUTHORS Rychlik,W., Domier,L.L., Gardner,P.R., Hellmann,G.M. and Rhoads,R.E. TITLE Amino acid sequence of the mRNA cap-binding protein from human tissues [published erratum appears in Proc Natl Acad Sci U S A 1992 Feb 1;89(3):1148] JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (4), 945-949 (1987) MEDLINE 87147214 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Domier, 21-APR-1987. There are at least two species of CBP mRNA of about 1900 and 2500 nucleotides each. There may be alternative start sites. Potential polyadenylation signals are located at positions 1315-1320, 1621-1626, and 1811-1816. FEATURES Location/Qualifiers source 1..1842 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="library of J. Whittaker" /cell_line="IM9" /cell_type="lymphocyte" mRNA <1..1842 /note="CBP mRNA" CDS 19..672 /codon_start=1 /product="cap-binding protein" /db_xref="PID:g306487" /translation="MATVEPETTPTPNPPTTEEEKTESNQEVANPEHYIKHPLQNRWA LWFFKNDKSKTWQANLRLISKFDTVEDFWALYNHIQLSSNLMPGCDYSLFKDGIEPMW EDEKNKRGGRWLITLNKQQRRSDLDRFWLETLLCLIGESFDDYSDDVCGAVVNVRAKG DKIAIWTTECENREAVTHIGRVYKERLGLPPKIVIGYQSHADTATKSGSTTKNRFVV" polyA_signal one-of(1314..1319,1620..1625,1810..1815) polyA_site 1842 BASE COUNT 550 a 310 c 338 g 644 t ORIGIN 1 cgatcagatc gatctaagat ggcgactgtc gaaccggaaa ccacccctac tcctaatccc 61 ccgactacag aagaggagaa aacggaatct aatcaggagg ttgctaaccc agaacactat 121 attaaacatc ccctacagaa cagatgggca ctctggtttt ttaaaaatga taaaagcaaa 181 acttggcaag caaacctgcg gctgatctcc aagtttgata ctgttgaaga cttttgggct 241 ctgtacaacc atatccagtt gtctagtaat ttaatgcctg gctgtgacta ctcacttttt 301 aaggatggta ttgagcctat gtgggaagat gagaaaaaca aacggggagg acgatggcta 361 attacattga acaaacagca gagacgaagt gacctcgatc gcttttggct agagacactt 421 ctgtgcctta ttggagaatc ttttgatgac tacagtgatg atgtatgtgg cgctgttgtt 481 aatgttagag ctaaaggtga taagatagca atatggacta ctgaatgtga aaacagagaa 541 gctgttacac atatagggag ggtatacaag gaaaggttag gacttcctcc aaagatagtg 601 attggttatc agtcccacgc agacacagct actaagagcg gctccaccac taaaaatagg 661 tttgttgttt aagaagacac cttctgagta ttctcatagg agactgcgtc aagcaatcga 721 gatttgggag ctgaaccaaa gcctcttcaa aaagcagagt ggactgcatt taaatttgat 781 ttccatctta atgttactca gatataagag aagtctcatt cgcctttgtc ttgtacttct 841 gtgttcattt tttttttttt tttttggcta gagtttccac tatcccaatc aaagaattac 901 agtacacatc cccagaatcc ataaatgtgt tcctggccca ctctgtaata gttcagtaga 961 attaccatta attacataca gattttacct atccacaata gtcagaaaac aacttggcat 1021 ttctatactt tacaggaaaa aaaattctgt tgttccattt tatgcagaag catattttgc 1081 tggtttgaaa gattatgatg catacagttt tctagcaatt ttctttgttt ctttttacag 1141 cattgtcttt gctgtactct tgctgatggc tgctagattt taatttattt gtttccctac 1201 ttgataatat tagtgattct gatttcagtt tttcatttgt tttgcttaaa tttttttttt 1261 ttttttcctc atgtaacatt ggtgaaggat ccaggaatat gacacaaagg tggaataaac 1321 attaattttg tgcattcttt ggtaattttt tttgtttttt gtaactacaa agctttgcta 1381 caaatttatg catttcattc aaatcagtga tctatgtttg tgtgatttcc taaacataat 1441 tgtggattat aaaaaatgta acatcataat tacattccta actagaatta gtatgtctgt 1501 ttttgtatct ttatgctgta ttttaacact ttgtattact taggttattt tgctttggtt 1561 aaaaatggct caagtagaaa agcagtccca ttcatattaa gacagtgtac aaaactgtaa 1621 ataaaatgtg tacagtgaat tgtcttttag acaactagat ttgtcctttt atttctccat 1681 ctttatagaa ggaatttgta cttcttattg caggcaagtc tctatattat gtcctctttt 1741 gtggtgtctt ccatgtgaac agcataagtt tggagcacta gtttgattat tatgtttatt 1801 acaattttta ataaattgaa taggtagtat catatatatg ga // LOCUS HUMIFI16A 2709 bp mRNA PRI 01-JAN-1995 DEFINITION Human interferon-gamma induced protein (IFI 16) gene, complete cds. ACCESSION M63838 NID g184568 KEYWORDS interferon-gamma inducible protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Trapani,J.A., Browne,K.A., Dawson,M.J., Ramsay,R.G., Eddy,R.L., Show,T.B., White,P.C. and Dupont,B. TITLE A novel gene constitutively expressed in human lymphoid cells is inducible with interferon-gamma in myeloid cells JOURNAL Immunogenetics 36 (6), 369-376 (1992) MEDLINE 92406263 FEATURES Location/Qualifiers source 1..2709 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CTL/NK cell" 5'UTR 1..264 /gene="IFI 16" gene 1..2686 /gene="IFI 16" CDS 265..2454 /gene="IFI 16" /codon_start=1 /product="interferon-gamma induced protein" /db_xref="PID:g184569" /translation="MGKKYKNIVLLKGLEVINDYHFRMVKSLLSNDLKLNLKMREEYD KIQIADLMEEKFRGDAGLGKLIKIFEDIPTLEDLAETLKKEKLKVKGPALSRKRKKEV HATSPAPSTSSTVKTEGAEATPGAQKRKKSTKEKAGPKGSKVSEEQTQPPSPAGAGMS TAMGRSPSPKTSLSAPPNSSSTENPKTVAKCQVTPRRNVLQKRPVIVKVLSTTKPFEY ETPEMEKKIMFHATVATQTQFFHVKVLNTSLKEKFNGKKIIIISDYLEYDSLLEVNEE STVSEAGPNQTFEVPNKIINRAKETLKIDILHKQASGNIVYGVFMLHKKTVNQKTTIY EIQDDRGKMDVVGTGQCHNIPCEEGDKLQLFCFRLRKKNQMSKLISEMHSFIQIKKKT NPRNNDPKSMKLPQEQRQLPYPSEASTTFPESHLRTPQMPPTTPSSSFFTKKSEDTIS KMNDFMRMQILKEGSHFPGPFMTSIGPAESHPHTPQMPPSTPSSSFLTTLKPRLKTEP EEVSIEDSAQSDLKEVMVLNATESFVYEPKEQKKMFHATVATENEVFRVKVFNIDLKE KFTPKKIIAIANYVCRNGFLEVYPFTLVADVNADRNMEIPKGLIRSASVTPKINQLCS QTKGSFVNGVFEVHKKNVRGEFTYYEIQDNTGKMEVVVHGRLNTINCEEGDKLKLTSF ELAPKSGNTGELRSVIHSHIKVIKTRKNKKDILNPDSSMETSPDFFF" polyA_signal 2677..2686 /gene="IFI 16" BASE COUNT 960 a 542 c 548 g 659 t ORIGIN 1 gggaatagca gaataggagc aagccagcac tagtcagcta actaagtgac tcaaccaagg 61 ccttttttcc ttgttatctt tgcagatact tcattttctt agcgtttctg gagattacaa 121 catcctgcgg ttccgtttct gggaacttta ctgatttatc tcccccctca cacaaataag 181 cattgattcc tgcatttctg aagatctcaa gatctggact actgttgaaa aaatttccag 241 tgaggctcac ttatgtctgt aaagatggga aaaaaataca agaacattgt tctactaaaa 301 ggattagagg tcatcaatga ttatcatttt agaatggtta agtccttact gagcaacgat 361 ttaaaactta atttaaaaat gagagaagag tatgacaaaa ttcagattgc tgacttgatg 421 gaagaaaagt tccgaggtga tgctggtttg ggcaaactaa taaaaatttt cgaagatata 481 ccaacgcttg aagacctggc tgaaactctt aaaaaagaaa agttaaaagt aaaaggacca 541 gccctatcaa gaaagaggaa gaaggaagtg catgctactt cacctgcacc ctccacaagc 601 agcactgtca aaactgaagg agcagaggca actcctggag ctcagaaaag aaaaaaatca 661 accaaagaaa aggctggacc caaagggagt aaggtgtccg aggaacagac tcagcctccc 721 tctcctgcag gagccggcat gtccacagcc atgggccgtt ccccatctcc caagacctca 781 ttgtcagctc cacccaacag ttcttcaact gagaacccga aaacagtggc caaatgtcag 841 gtaactccca gaagaaatgt tctccaaaaa cgcccagtga tagtgaaggt actgagtaca 901 acaaagccat ttgaatatga gaccccagaa atggagaaaa aaataatgtt tcatgctaca 961 gtggctacac agacacagtt cttccatgtg aaggttttaa acaccagctt gaaggagaaa 1021 ttcaatggaa agaaaatcat catcatatca gattatttgg aatatgatag tctcctagag 1081 gtcaatgaag aatctactgt atctgaagct ggtcctaacc aaacgtttga ggttccaaat 1141 aaaatcatca acagagcaaa ggaaactctg aagattgata ttcttcacaa acaagcttca 1201 ggaaatattg tatatggggt atttatgcta cataagaaaa cagtaaatca gaagaccaca 1261 atctacgaaa ttcaggatga tagaggaaaa atggatgtag tggggacagg acaatgtcac 1321 aatatcccct gtgaagaagg agataagctc cagcttttct gctttcgact tagaaaaaag 1381 aaccagatgt caaaactgat ttcagaaatg catagtttta tccagataaa gaaaaaaaca 1441 aacccgagaa acaatgaccc caagagcatg aagctacccc aggaacagcg tcagcttcca 1501 tatccttcag aggccagcac aaccttccct gagagccatc ttcggactcc tcagatgcca 1561 ccaacaactc catccagcag tttcttcacc aagaaaagtg aagacacaat ctccaaaatg 1621 aatgacttca tgaggatgca gatactgaag gaagggagtc attttccagg accgttcatg 1681 accagcatag gcccagctga gagccatccc cacactcctc agatgcctcc atcaacacca 1741 agcagcagtt tcttaaccac gttgaaacca agactgaaga ctgaacctga agaagtttcc 1801 atagaagaca gtgcccagag tgacctcaaa gaagtgatgg tgctgaacgc aacagaatca 1861 tttgtatatg agcccaaaga gcagaagaaa atgtttcatg ccacagtggc aactgagaat 1921 gaagtcttcc gagtgaaggt ttttaatatt gacctaaagg agaagttcac cccaaagaag 1981 atcattgcca tagcaaatta tgtttgccgc aatgggttcc tggaggtata tcctttcaca 2041 cttgtggctg atgtgaatgc tgaccgaaac atggagatcc caaaaggatt gattagaagt 2101 gccagcgtaa ctcctaaaat caatcagctt tgctcacaaa ctaaaggaag ttttgtgaat 2161 ggggtgtttg aggtacataa gaaaaatgta aggggtgaat tcacttatta tgaaatacaa 2221 gataatacag ggaagatgga agtggtggtg catggacgac tgaacacaat caactgtgag 2281 gaaggagata aactgaaact caccagcttt gaattggcac cgaaaagtgg gaataccggg 2341 gagttgagat ctgtaattca tagtcacatc aaggtcatca agaccaggaa aaacaagaaa 2401 gacatactca atcctgattc aagtatggaa acttcaccag actttttctt ctaaaatctg 2461 gatgtcattg acgataatgt ttatggagat aaggtctaag tccctaaaaa aatgtacata 2521 tacctggttg aaatacaaca ctatacatac acaccaccat atatactagc tgttaatcct 2581 atggaatggg ggtattggga gtgctttttt aatttttcat agtttttttt taataaaatg 2641 gcatattttg catctacaac ttctataata agaaaaaata aataaacatt atcttttttg 2701 tgaaaaaaa // LOCUS HUMIFN15K 634 bp mRNA PRI 11-JUN-1993 DEFINITION Human interferon-induced 17-kDa/15-kDa protein mRNA, complete cds. ACCESSION M13755 NID g184570 KEYWORDS interferon; interferon-inducible protein. SOURCE Human (Daudi cells) cDNA to mRNA, clone 31. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 634; 1 to 634) AUTHORS Blomstrom,D.C. JOURNAL Unpublished (1986) REFERENCE 2 (bases 1 to 634) AUTHORS Blomstrom,D.C., Fahey,D., Kutny,R., Korant,B.D. and Knight,E.Jr. TITLE Molecular characterization of the interferon-induced 15-kDa protein: Molecular cloning and nucleotide and amino acid sequence JOURNAL J. Biol. Chem. 261, 8811-8816 (1986) MEDLINE 86250802 COMMENT [1] revises [2]. [1] revises [1]. Draft entry and computer-readable sequence for [2],[1], kindly provided by D.C.Blomstrom, 15-DEC-1986, and for [1] 28-MAR-1989. The 15-kDa protein is formed by a post-translational modification of the 17-kDa protein in which eight amino acids are removed from the COOH terminus. FEATURES Location/Qualifiers source 1..634 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>634 /note="17-kDa/15-kDa mRNA" mat_peptide 76..546 /note="15-kDa protein" CDS 76..573 /note="17-kDa protein" /codon_start=1 /db_xref="PID:g306901" /translation="MGWDLTVKMLAGNEFQVSLSSSMSVSELKAQITQKIGVHAFQQR LAVHPSGVALQDRVPLASQGLGPGSTVLLVVDKCDEPLSILVRNNKGRSSTYEVRLTQ TVAHLKQQVSGLEGVQDDLFWLTFEGKPLEDQLPLGEYGLKPLSTVFMNLRLRGGGTE PGGRS" BASE COUNT 130 a 187 c 219 g 98 t ORIGIN 239 bp upstream of PstI site. 1 cggctgagag gcagcgaact catctttgcc agtacaggag cttgtgccgt ggcccacagc 61 ccacagccca cagccatggg ctgggacctg acggtgaaga tgctggcggg caacgaattc 121 caggtgtccc tgagcagctc catgtcggtg tcagagctga aggcgcagat cacccagaag 181 attggcgtgc acgccttcca gcagcgtctg gctgtccacc cgagcggtgt ggcgctgcag 241 gacagggtcc cccttgccag ccagggcctg ggccctggca gcacggtcct gctggtggtg 301 gacaaatgcg acgaacctct gagcatcctg gtgaggaata acaagggccg cagcagcacc 361 tacgaggtcc ggctgacgca gaccgtggcc cacctgaagc agcaagtgag cgggctggag 421 ggtgtgcagg acgacctgtt ctggctgacc ttcgagggga agcccctgga ggaccagctc 481 ccgctggggg agtacggcct caagcccctg agcaccgtgt tcatgaatct gcgcctgcgg 541 ggaggcggca cagagcctgg cgggcggagc taagggcctc caccagcatc cgagcaggat 601 caagggccgg aaataaaggc tgttgtaaga gaat // LOCUS HUMIFNAII 1257 bp DNA PRI 08-NOV-1994 DEFINITION Human interferon-alpha class II (IFNA-II-1) gene, complete cds. ACCESSION M11003 NID g184610 KEYWORDS interferon alpha-II-1. SOURCE Homo sapiens foetus liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1257) AUTHORS Capon,D.J., Shepard,H.M. and Goeddel,D.V. TITLE Two distinct families of human and bovine interferon-alpha genes are coordinately expressed and encode functional polypeptides JOURNAL Mol. Cell. Biol. 5 (4), 768-779 (1985) MEDLINE 85187974 FEATURES Location/Qualifiers source 1..1257 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="foetus" /tissue_type="liver" /map="9p22" mRNA 140..>795 /gene="IFNA" /note="G00-119-328" gene 140..795 /gene="IFNA" CDS 208..795 /gene="IFNA" /note="class II" /codon_start=1 /db_xref="GDB:G00-119-328" /product="interferon-alpha" /db_xref="PID:g386800" /translation="MALLFPLLAALVMTSYSPVGSLGCDLPQNHGLLSRNTLVLLHQM RRISPFLCLKDRRDFRFPQEMVKGSQLQKAHVMSVLHEMLQQIFSLFHTERSSAAWNM TLLDQLHTELHQQLQHLETCLLQVVGEGESAGAISSPALTLRRYFQGIRVYLKEKKYS DCAWEVVRMEIMKSLFLSTNMQERLRSKDRDLGSS" BASE COUNT 386 a 272 c 235 g 364 t ORIGIN Chromosome 9p22-p13. 1 tagattgttg tcatcctctt aagtcatagg gagaacacac aaatgaaaac agtaaaagaa 61 actgaaagta cagagaaatg ttcagaaaat gaaaaccatg tgtttcctat taaaagccat 121 gcatacaagc aatgtcttca gaaaacctag ggtccaaggt taagccatat cccagctcag 181 taaagccagg agcatcctca tttcccaatg gccctcctgt tccctctact ggcagcccta 241 gtgatgacca gctatagccc tgttggatct ctgggctgtg atctgcctca gaaccatggc 301 ctacttagca ggaacacctt ggtgcttctg caccaaatga ggagaatctc ccctttcttg 361 tgtctcaagg acagaagaga cttcaggttc ccccaggaga tggtaaaagg gagccagttg 421 cagaaggccc atgtcatgtc tgtcctccat gagatgctgc agcagatctt cagcctcttc 481 cacacagagc gctcctctgc tgcctggaac atgaccctcc tagaccaact ccacactgaa 541 cttcatcagc aactgcaaca cctggagacc tgcttgctgc aggtagtggg agaaggagaa 601 tctgctgggg caattagcag ccctgcactg accttgagga ggtacttcca gggaatccgt 661 gtctacctga aagagaagaa atacagcgac tgtgcctggg aagttgtcag aatggaaatc 721 atgaaatcct tgttcttatc aacaaacatg caagaaagac tgagaagtaa agatagagac 781 ctgggctcat cttgaaatga ttctcattga ttaatttgcc ataataacac ttgcacatgt 841 gactctggtc aattcaaaag actcttattt cggctttaat cacagaatga ctgaattagt 901 tctgcaaata ctttgtcggt atattaagcc agtatatgtt aaaaagactt aggttcaggg 961 gcatcagtcc ctaagatgtt atttattttt actcatttat ttattcttac attttatcat 1021 atttatacta tttatattct tatataacaa atgtttgcct ttacattgta ttaagataac 1081 aaaacatgtt cagctttcca tttggttaaa tattgtattt tgttatttat taaattattt 1141 tcaaacaaaa cttcttgaag ttatttattc gaaaaccaaa atccaaacac tagttttctg 1201 aaccaaatca aggaatggac ggtaatatac acttacctat tcattcattc catttac // LOCUS HUMIFNRG 2064 bp mRNA PRI 08-NOV-1994 DEFINITION Human interferon-gamma receptor mRNA, complete cds. ACCESSION J03143 NID g184650 KEYWORDS interferon receptor. SOURCE Human lymphoid tissue cell line Raji, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2064) AUTHORS Aguet,M., Dembic,Z. and Merlin,G. TITLE Molecular cloning and expression of the human interferon-gamma receptor JOURNAL Cell 55 (2), 273-280 (1988) MEDLINE 89003065 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Aguet, 08-SEP-1988. FEATURES Location/Qualifiers source 1..2064 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6q23-q24" mRNA <1..2064 /note="IFNR-gamma mRNA" gene 49..1518 /gene="IFNGR1" CDS 49..1518 /gene="IFNGR1" /note="interferon-gamma receptor" /codon_start=1 /db_xref="GDB:G00-120-688" /db_xref="PID:g306915" /translation="MALLFLLPLVMQGVSRAEMGTADLGPSSVPTPTNVTIESYNMNP IVYWEYQIMPQVPVFTVEVKNYGVKNSEWIDACINISHHYCNISDHVGDPSNSLWVRV KARVGQKESAYAKSEEFAVCRDGKIGPPKLDIRKEEKQIMIDIFHPSVFVNGDEQEVD YDPETTCYIRVYNVYVRMNGSEIQYKILTQKEDDCDEIQCQLAIPVSSLNSQYCVSAE GVLHVWGVTTEKSKEVCITIFNSSIKGSLWIPVVAALLLFLVLSLVFICFYIKKINPL KEKSIILPKSLISVVRSATLETKPESKYVSLITSYQPFSLEKEVVCEEPLSPATVPGM HTEDNPGKVEHTEELSSITEVVTTEENIPDVVPGSHLTPIERESSSPLSSNQSEPGSI ALNSYHSRNCSESDHSRNGFDTDSSCLESHSSLSDSEFPPNNKGEIKTEGQELITVIK APTSFGYDKPHVLVDLLVDDSGKESLIGYRPTEDSKEFS" BASE COUNT 639 a 383 c 426 g 616 t ORIGIN 1 bp upstream of EcoRI site; chromosome 6q15-q21. 1 gaattccgca ggcgctcggg gttggagcca gcgaccgtcg gtagcagcat ggctctcctc 61 tttctcctac cccttgtcat gcagggtgtg agcagggctg agatgggcac cgcggatctg 121 gggccgtcct cagtgcctac accaactaat gttacaattg aatcctataa catgaaccct 181 atcgtatatt gggagtacca gatcatgcca caggtccctg tttttaccgt agaggtaaag 241 aactatggtg ttaagaattc agaatggatt gatgcctgca tcaatatttc tcatcattat 301 tgtaatattt ctgatcatgt tggtgatcca tcaaattctc tttgggtcag agttaaagcc 361 agggttggac aaaaagaatc tgcctatgca aagtcagaag aatttgctgt atgccgagat 421 ggaaaaattg gaccacctaa actggatatc agaaaggagg agaagcaaat catgattgac 481 atatttcacc cttcagtttt tgtaaatgga gacgagcagg aagtcgatta tgatcccgaa 541 actacctgtt acattagggt gtacaatgtg tatgtgagaa tgaacggaag tgagatccag 601 tataaaatac tcacgcagaa ggaagatgat tgtgacgaga ttcagtgcca gttagcgatt 661 ccagtatcct cactgaattc tcagtactgt gtttcagcag aaggagtctt acatgtgtgg 721 ggtgttacaa ctgaaaagtc aaaagaagtt tgtattacca ttttcaatag cagtataaaa 781 ggttctcttt ggattccagt tgttgctgct ttactactct ttctagtgct tagcctggta 841 ttcatctgtt tttatattaa gaaaattaat ccattgaagg aaaaaagcat aatattaccc 901 aagtccttga tctctgtggt aagaagtgct actttagaga caaaacctga atcaaaatat 961 gtatcactca tcacgtcata ccagccattt tccttagaaa aggaggtggt ctgtgaagag 1021 ccgttgtctc cagcaacagt tccaggcatg cataccgaag acaatccagg aaaagtggaa 1081 catacagaag aactttctag tataacagaa gtggtgacta ctgaagaaaa tattcctgac 1141 gtggtcccgg gcagccatct gactccaata gagagagaga gttcttcacc tttaagtagt 1201 aaccagtctg aacctggcag catcgcttta aactcgtatc actccagaaa ttgttctgag 1261 agtgatcact ccagaaatgg ttttgatact gattccagct gtctggaatc acatagctcc 1321 ttatctgact cagaatttcc cccaaataat aaaggtgaaa taaaaacaga aggacaagag 1381 ctcataaccg taataaaagc ccccacctcc tttggttatg ataaaccaca tgtgctagtg 1441 gatctacttg tggatgatag cggtaaagag tccttgattg gttatagacc aacagaagat 1501 tccaaagaat tttcatgaga tcagctaagt tgcaccaact ttgaagtctg attttcctgg 1561 acagttttct gctttaattt catgaaaaga ttatgatctc agaaattgta tcttagttgg 1621 tatcaaccaa atggagtgac ttagtgtaca tgaaagcgta aagaggatgt gtggcatttt 1681 cacttttggc ttgtaaagta cagacttttt ttttttttta aacaaaaaaa gcattgtaac 1741 ttatgaacct ttacatccag ataggttacc agtaacggaa catatccagt actcctggtt 1801 cctaggtgag caggtgatgc cccagggacc tttgtagcca cttcactttt tttcttttct 1861 ctgccttggt atagcatatg tgttttgtaa gtttatgcat acagtaattt taagtaattt 1921 cagaagaaat tctcgaagct tttcaaaatt ggacttaaaa tctaattcaa actaatagaa 1981 ttaatggaat atgtaaatag aaacgtgtat attttttatg aaacattaca gttagagatt 2041 tttaaataaa gaattttaaa actc // LOCUS HUMIFNRTF 1584 bp mRNA PRI 01-JAN-1995 DEFINITION Human IFN-responsive transcription factor subunit mRNA, complete cds. ACCESSION M87503 NID g184652 KEYWORDS DNA-binding protein; IFN-alpha responsive transcription factor; ISGF3-gamma protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1584) AUTHORS Veals,S.A., Schindler,C., Leonard,D., Fu,X.Y., Aebersold,R., Darnell,J.E. Jr. and Levy,D.E. TITLE Subunit of an alpha-interferon-responsive transcription factor is related to interferon regulatory factor and Myb families of DNA-binding proteins JOURNAL Mol. Cell. Biol. 12 (8), 3315-3324 (1992) MEDLINE 92334329 FEATURES Location/Qualifiers source 1..1584 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 35..1568 /gene="ISGF3-gamma" CDS 35..1216 /gene="ISGF3-gamma" /codon_start=1 /product="IFN-alpha responsive transcription factor" /db_xref="PID:g184653" /translation="MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKH AGKQDFREDQDAAFFKAWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGR MDVAEPYKVYQLLPPGIVSGQPGTQKVPSKRQHSSVSSERKEEEDAMQNCTLSPSVLQ DSLNNEEEGASGGAVHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDY SLLLTFIYNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLE RGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVR YFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSL V" polyA_signal 1563..1568 /gene="ISGF3-gamma" polyA_site 1584 /gene="ISGF3-gamma" BASE COUNT 364 a 434 c 428 g 358 t ORIGIN 1 gatcagaggg cgatcagctg gacagcaact caggatggca tcaggcaggg cacgctgcac 61 ccgaaaactc cggaactggg tggtggagca agtggagagt gggcagtttc ccggagtgtg 121 ctgggatgat acagctaaga ccatgttccg gattccctgg aaacatgcag gcaagcagga 181 cttccgggag gaccaggatg ctgccttctt caaggcctgg gcaatattta agggaaagta 241 taaggagggg gacacaggag gtccagctgt ctggaagact cgcctgcgct gtgcactcaa 301 caagagttct gaatttaagg aggttcctga gaggggccgc atggatgttg ctgagcccta 361 caaggtgtat cagttgctgc caccaggaat cgtctctggc cagccaggga ctcagaaagt 421 accatcaaag cgacagcaca gttctgtgtc ctctgagagg aaggaggaag aggatgccat 481 gcagaactgc acactcagtc cctctgtgct ccaggactcc ctcaataatg aggaggaggg 541 ggccagtggg ggagcagtcc attcagacat tgggagcagc agcagcagca gcagccctga 601 gccacaggaa gttacagaca caactgaggc cccctttcaa ggggatcaga ggtccctgga 661 gtttctgctt cctccagagc cagactactc actgctgctc accttcatct acaacgggcg 721 cgtggtgggc gaggcccagg tgcaaagcct ggattgccgc cttgtggctg agccctcagg 781 ctctgagagc agcatggagc aggtgctgtt ccccaagcct ggcccactgg agcccacgca 841 gcgcctgctg agccagcttg agaggggcat cctagtggcc agcaaccccc gaggcctctt 901 cgtgcagcgc ctttgcccca tccccatctc ctggaatgca ccccaggctc cacctgggcc 961 aggcccgcat ctgctgccca gcaacgagtg cgtggagctc ttcagaaccg cctacttctg 1021 cagagacttg gtcaggtact ttcagggcct gggcccccca ccgaagttcc aggtaacact 1081 gaatttctgg gaagagagcc atggctccag ccatactcca cagaatctta tcacagtgaa 1141 gatggagcag gcctttgccc gatacttgct ggagcagact ccagagcagc aggcagccat 1201 tctgtccctg gtgtagagcc tgggggaccc atcttccacc tcacctcttt gttcttcctg 1261 tctcctttga agtagactca ttcttcacac gattgacctg tcctctttgt gataattctc 1321 agtagttgtc cgtgataatc gtgtcctgaa aatcctcgca cacactggct ggtggagaac 1381 tcaaggctaa ttttttatcc tttttttttt tttatttttg agatatacgc cctctttcat 1441 ctgtaaggga ctaggaaatt ccaaatggtg tgaacccagg gggcctttcc ctcttccctg 1501 acctcccaac tctaaagcca agcactttat atttttctct tagatattca ctaaggactt 1561 aaaataaaat ttttttgaaa gagg // LOCUS HUMIGB7 1491 bp mRNA PRI 11-JUN-1993 DEFINITION Human Ig rearranged B7 protein mRNA VC1-region, complete cds. ACCESSION M27533 NID g184680 KEYWORDS C-region; V-region. SOURCE Human lymphoid B cell line Raji, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1491) AUTHORS Freeman,G.J., Freedman,A.S., Segil,J.M., Lee,G., Whitman,J.F. and Nadler,L.M. TITLE B7, a new member of the Ig superfamily with unique expression on activated and neoplastic B cells JOURNAL J. Immunol. 143, 2714-2722 (1989) MEDLINE 90010147 COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by G.J. Freeman, 08-SEP-1989. FEATURES Location/Qualifiers source 1..1491 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 318..395 /note="transmembrane protein B1 signal peptide" CDS 318..1184 /note="transmembrane protein B1 precursor" /codon_start=1 /db_xref="PID:g306916" /translation="MGHTRRQGTSPSKCPYLNFFQLLVLAGLSHFCSGVIHVTKEVKE VATLSCGHNVSVEELAQTRIYWQKEKKMVLTMMSGDMNIWPEYKNRTIFDITNNLSIV ILALRPSDEGTYECVVLKYEKDAFKREHLAEVTLSVKADFPTPSISDFEIPTSNIRRI ICSTSGGFPEPHLSWLENGEELNAINTTVSQDPETELYAVSSKLDFNMTTNHSFMCLI KYGHLRVNQTFNWNTTKQEHFPDNLLPSWAITLISVNGIFVICCLTYCFAPRCRERRR NERLRRESVRPV" mat_peptide 396..1181 /note="transmembrane protein B1" BASE COUNT 419 a 343 c 311 g 418 t ORIGIN 1 ccaaagaaaa agtgatttgt cattgcttta tagactgtaa gaagagaaca tctcagaagt 61 ggagtcttac cctgaaatca aaggatttaa agaaaaagtg gaatttttct tcagcaagct 121 gtgaaactaa atccacaacc tttggagacc caggaacacc ctccaatctc tgtgtgtttt 181 gtaaacatca ctggagggtc ttctacgtga gcaattggat tgtcatcagc cctgcctgtt 241 ttgcacctgg gaagtgccct ggtcttactt gggtccaaat tgttggcttt cacttttgac 301 cctaagcatc tgaagccatg ggccacacac ggaggcaggg aacatcacca tccaagtgtc 361 catacctcaa tttctttcag ctcttggtgc tggctggtct ttctcacttc tgttcaggtg 421 ttatccacgt gaccaaggaa gtgaaagaag tggcaacgct gtcctgtggt cacaatgttt 481 ctgttgaaga gctggcacaa actcgcatct actggcaaaa ggagaagaaa atggtgctga 541 ctatgatgtc tggggacatg aatatatggc ccgagtacaa gaaccggacc atctttgata 601 tcactaataa cctctccatt gtgatcctgg ctctgcgccc atctgacgag ggcacatacg 661 agtgtgttgt tctgaagtat gaaaaagacg ctttcaagcg ggaacacctg gctgaagtga 721 cgttatcagt caaagctgac ttccctacac ctagtatatc tgactttgaa attccaactt 781 ctaatattag aaggataatt tgctcaacct ctggaggttt tccagagcct cacctctcct 841 ggttggaaaa tggagaagaa ttaaatgcca tcaacacaac agtttcccaa gatcctgaaa 901 ctgagctcta tgctgttagc agcaaactgg atttcaatat gacaaccaac cacagcttca 961 tgtgtctcat caagtatgga catttaagag tgaatcagac cttcaactgg aatacaacca 1021 agcaagagca ttttcctgat aacctgctcc catcctgggc cattacctta atctcagtaa 1081 atggaatttt tgtgatatgc tgcctgacct actgctttgc cccaagatgc agagagagaa 1141 ggaggaatga gagattgaga agggaaagtg tacgccctgt ataacagtgt ccgcagaagc 1201 aaggggctga aaagatctga aggtagcctc cgtcatctct tctgggatac atggatcgtg 1261 gggatcatga ggcattcttc ccttaacaaa tttaagctgt tttacccact acctcacctt 1321 cttaaaaacc tctttcagat taagctgaac agttacaaga tggctggcat ccctctcctt 1381 tctccccata tgcaatttgc ttaatgtaac ctcttctttt gccatgtttc cattctgcca 1441 tcttgaattg tcttgtcagc caattcatta tctattaaac actaatttga g // LOCUS HUMIGFACID 2125 bp mRNA PRI 07-OCT-1992 DEFINITION Human IGF binding protein complex acid-labile subunit a mRNA, complete cds. ACCESSION M86826 NID g184807 KEYWORDS acid-labile subunit; insulin-like growth factor binding protein. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2125) AUTHORS Leong,S.R., Baxter,R.C., Camerato,T., Dai,J. and Wood,W.I. TITLE Structure and functional expression of the acid-labile subunit of the insulin-like growth factor binding protein complex JOURNAL Mol. Endocrinol. 6, 870-876 (1992) MEDLINE 92357025 FEATURES Location/Qualifiers source 1..2125 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" CDS 57..1874 /note="acid-labile subunits a" /codon_start=1 /product="insulin-like growth factor binding protein complex" /db_xref="PID:g184808" /translation="MALRKGGLALALLLLSWVALGPRSLEGADPGTPGEAEGPACPAA CVCSYDDDADELSVFCSSRNLTRLPDGVPGGTQALWLDGNNLSSVPPAAFQNLSSLGF LNLQGGQLGSLEPQALLGLENLCHLHLERNQLRSLALGTFAHTPALASLGLSNNRLSR LEDGLFEGLGSLWDLNLGWNSLAVLPDAAFRGLGSLRELVLAGNRLAYLQPALFSGLA ELRELDLSRNALRAIKANVFVQLPRLQKLYLDRNLIAAVAPGAFLGLKALRWLDLSHN RVAGLLEDTFPGLLGLRVLRLSHNAIASLRPRTFKDLHFLEELQLGHNRIRQLAERSF EGLGQLEVLTLDHNQLQEVKAGAFLGLTNVAVMNLSGNCLRNLPEQVFRGLGKLHSLH LEGSCLGRIRPHTFTGLSGLRRLFLKDNGLVGIEEQSLWGLAELLELDLTSNQLTHLP HRLFQGLGKLEYLLLSRNRLAELPADALGPLQRAFWLDVSHNRLEALPNSLLAPLGRL RYLSLRNNSLRTFTPQPPGLERLWLEGNPWDCGCPLKALRDFALQNPSAVPRFVQAIC EGDDCQPPAYTYNNITCASPPEVVGLDLRDLSEAHFAPC" mat_peptide 138..1871 /note="acid-labile subunits a" /evidence=experimental /product="insulin-like growth factor binding protein complex" BASE COUNT 389 a 752 c 657 g 327 t ORIGIN 1 ggcacagcag acgtaccctc cctcgctgcc tgcctgcggc ctgccctgca tgcaggatgg 61 ccctgaggaa aggaggcctg gccctggcgc tgctgctgct gtcctgggtg gcactgggcc 121 cccgcagcct ggagggagca gaccccggaa cgccggggga agccgagggc ccagcgtgcc 181 cggccgcctg tgtctgcagc tacgatgacg acgcggatga gctcagcgtc ttctgcagct 241 ccaggaacct cacgcgcctg cctgacggag tcccgggcgg cacccaagcc ctgtggctgg 301 acggcaacaa cctctcgtcc gtccccccgg cagccttcca gaacctctcc agcctgggct 361 tcctcaacct gcagggcggc cagctgggca gcctggagcc acaggcgctg ctgggcctag 421 agaacctgtg ccacctgcac ctggagcgga accagctgcg cagcctggca ctcggcacgt 481 ttgcacacac gcccgcgctg gcctcgctcg gcctcagcaa caaccgtctg agcaggctgg 541 aggacgggct cttcgagggc ctcggcagcc tctgggacct caacctcggc tggaatagcc 601 tggcggtgct ccccgatgcg gcgttccgcg gcctgggcag cctgcgcgag ctggtgctgg 661 cgggcaacag gctggcctac ctgcagcccg cgctcttcag cggcctggcc gagctccggg 721 agctggacct gagcaggaac gcgctgcggg ccatcaaggc aaacgtgttc gtgcagctgc 781 cccggctcca gaaactctac ctggaccgca acctcatcgc tgccgtggcc ccgggcgcct 841 tcctgggcct gaaggcgctg cgatggctgg acctgtccca caaccgcgtg gctggcctcc 901 tggaggacac gttccccggt ctgctgggcc tgcgtgtgct gcggctgtcc cacaacgcca 961 tcgccagcct gcggccccgc accttcaagg acctgcactt cctggaggag ctgcagctgg 1021 gccacaaccg catccggcag ctggctgagc gcagctttga gggcctgggg cagcttgagg 1081 tgctcacgct agaccacaac cagctccagg aggtcaaggc gggcgctttc ctcggcctca 1141 ccaacgtggc ggtcatgaac ctctctggga actgtctccg gaaccttccg gagcaggtgt 1201 tccggggcct gggcaagctg cacagcctgc acctggaggg cagctgcctg ggacgcatcc 1261 gcccgcacac cttcaccggc ctctcggggc tccgccgact cttcctcaag gacaacggcc 1321 tcgtgggcat tgaggagcag agcctgtggg ggctggcgga gctgctggag ctcgacctga 1381 cctccaacca gctcacgcac ctgccccacc gcctcttcca gggcctgggc aagctggagt 1441 acctgctgct ctcccgcaac cgcctggcag agctgccggc ggacgccctg ggccccctgc 1501 agcgggcctt ctggctggac gtctcgcaca accgcctgga ggcattgccc aacagcctct 1561 tggcaccact ggggcggctg cgctacctca gcctcaggaa caactcactg cggaccttca 1621 cgccgcagcc cccgggcctg gagcgcctgt ggctggaggg taacccctgg gactgtggct 1681 gccctctcaa ggcgctgcgg gacttcgccc tgcagaaccc cagtgctgtg ccccgcttcg 1741 tccaggccat ctgtgagggg gacgattgcc agccgcccgc gtacacctac aacaacatca 1801 cctgtgccag cccgcccgag gtcgtggggc tcgacctgcg ggacctcagc gaggcccact 1861 ttgctccctg ctgaccaggt ccccggactc aagccccgga ctcaggcccc cacctggctc 1921 accttgtgct ggggacaggt cctcagtgtc ctcaggggcc tgcccagtgc acttgctgga 1981 agacgcaagg gcctgatggg gtggaaggca tggcggcccc cccagctgtc atcaattaaa 2041 ggcaaaggca atcgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2101 aaaaaaaaaa aaaaaaaaaa aaaaa // LOCUS HUMIGFBP5A 1023 bp mRNA PRI 23-NOV-1994 DEFINITION Homo sapiens insulin-like growth factor binding protein 5 (IGFBP-5) mRNA, complete cds. ACCESSION M62782 NID g184817 KEYWORDS insulin-like growth factor binding protein 5. SOURCE Homo sapiens (tissue library: lambda gt11) adult placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1023) AUTHORS Shimasaki,S., Shimonaka,M., Zhang,H.P. and Ling,N. TITLE Identification of five different insulin-like growth factor binding proteins (IGFBPs) from adult rat serum and molecular cloning of a novel IGFBP-5 in rat and human JOURNAL J. Biol. Chem. 266 (16), 10646-10653 (1991) MEDLINE 91244847 FEATURES Location/Qualifiers source 1..1023 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /tissue_lib="lambda gt11" /map="2q33-34" sig_peptide 57..116 /gene="IGFBP5" /note="G00-126-837" CDS 57..875 /gene="IGFBP5" /codon_start=1 /db_xref="GDB:G00-126-837" /product="insulin-like growth factor binding protein 5" /db_xref="PID:g184818" /translation="MVLLTAVLLLLAAYAGPAQSLGSFVHCEPCDEKALSMCPPSPLG CELVKEPGCGCCMTCALAEGQSCGVYTERCAQGLRCLPRQDEEKPLHALLHGRGVCLN EKSYREQVKIERDSREHEEPTTSEMAEETYSPKIFRPKHTRISELKAEAVKKDRRKKL TQSKFVGGAENTAHPRIISAPEMRQESEQGPCRRHMEASLQELKASPRMVPRAVYLPN CDRKGFYKRKQCKPSRGRKRGICWCVDKYGMKLPGMEYVDGDFQCHTFDSSNVE" gene 57..875 /gene="IGFBP5" mat_peptide 117..872 /gene="IGFBP5" /note="G00-126-837" /product="insulin-like growth factor binding protein 5" BASE COUNT 224 a 352 c 284 g 163 t ORIGIN 1 ccctgcactc tcgctctcct gccccacccc gaggtaaagg gggcgactaa gagaagatgg 61 tgttgctcac cgcggtcctc ctgctgctgg ccgcctatgc ggggccggcc cagagcctgg 121 gctccttcgt gcactgcgag ccctgcgacg agaaagccct ctccatgtgc ccccccagcc 181 ccctgggctg cgagctggtc aaggagccgg gctgcggctg ctgcatgacc tgcgccctgg 241 ccgaggggca gtcgtgcggc gtctacaccg agcgctgcgc ccaggggctg cgctgcctcc 301 cccggcagga cgaggagaag ccgctgcacg ccctgctgca cggccgcggg gtttgcctca 361 acgaaaagag ctaccgcgag caagtcaaga tcgagagaga ctcccgtgag cacgaggagc 421 ccaccacctc tgagatggcc gaggagacct actcccccaa gatcttccgg cccaaacaca 481 cccgcatctc cgagctgaag gctgaagcag tgaagaagga ccgcagaaag aagctgaccc 541 agtccaagtt tgtcggggga gccgagaaca ctgcccaccc ccggatcatc tctgcacctg 601 agatgagaca ggagtctgag cagggcccct gccgcagaca catggaggct tccctgcagg 661 agctcaaagc cagcccacgc atggtgcccc gtgctgtgta cctgcccaat tgtgaccgca 721 aaggattcta caagagaaag cagtgcaaac cttcccgtgg ccgcaagcgt ggcatctgct 781 ggtgcgtgga caagtacggg atgaagctgc caggcatgga gtacgttgac ggggactttc 841 agtgccacac cttcgacagc agcaacgttg agtgatgcgt ccccccccaa cctttccctc 901 accccctccc acccccagcc ccgactccag ccagcgcctc cctccacccc aggacgccac 961 tcatttcatc tcatttaagg gaaaaatata tatctatcta tttgaaaaaa aaaaaaaaaa 1021 ccc // LOCUS HUMIGHBO 141 bp mRNA PRI 09-NOV-1994 DEFINITION Human unproductively rearranged Ig mu-chain mRNA V-region (VD), 5' end, clone mu-3A1A. ACCESSION M21388 NID g185160 KEYWORDS C-region; D-region; J-region; V-region; immunoglobulin heavy chain; immunoglobulin mu-chain. SOURCE Human B-lymphocyte, cDNA to mRNA, clone mu-3A1A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 141) AUTHORS Schwaber,J. and Chen,R.H. TITLE Premature termination of variable gene rearrangement in B lymphocytes from X-linked agammaglobulinemia JOURNAL J. Clin. Invest. 81 (6), 2004-2009 (1988) MEDLINE 88257474 FEATURES Location/Qualifiers source 1..141 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q32.33" gene 1..90 /gene="IGHM" CDS 1..90 /gene="IGHM" /note="Ig mu-chain V-region (V-D) precursor" /codon_start=1 /db_xref="GDB:G00-120-086" /db_xref="PID:g306932" /translation="MQPDIPFPQARGRFEVRSVSLWYYDYVWG" sig_peptide 1..66 /gene="IGHM" /note="Ig mu-chain V-region signal peptide" misc_recomb 66..67 /gene="IGHM" /organism="Homo sapiens" mat_peptide 67..90 /gene="IGHM" /note="Ig mu-chain V-region" BASE COUNT 29 a 33 c 43 g 36 t ORIGIN Chromosome 14q32.3. 1 atgcagcctg acatcccgtt tccccaggcc agaggtaggt ttgaagtgag gtctgtgtca 61 ctgtggtatt atgattacgt ttgggggtaa gactacggta tggacgtctg gggccaagga 121 accacggtca ccgtctcctc a // LOCUS HUMIGMBC 2540 bp mRNA PRI 15-FEB-1995 DEFINITION Homo sapiens M2 mitochondrial autoantigen dihydrolipoamide acetyltransferase mRNA, complete cds. ACCESSION J03866 NID g619443 KEYWORDS M2 mitochondrial autoantigen; autoantigen. SOURCE Homo sapiens (individual_isolate patient with primary biliary cirrhosis) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2540) AUTHORS Coppel,R.L., McNeilage,L.J., Surh,C.D., Van de Water,J., Spithill,T.W., Whittingham,S. and Gershwin,M.E. TITLE Primary structure of the human M2 mitochondrial autoantigen of primary biliary cirrhosis: dihydrolipoamide acetyltransferase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (19), 7317-7321 (1988) MEDLINE 89017189 FEATURES Location/Qualifiers source 1..2540 /organism="Homo sapiens" /isolate="patient with primary biliary cirrhosis" /db_xref="taxon:9606" /tissue_type="placenta" CDS 636..2477 /EC_number="2.3.1.12" /codon_start=1 /product="dihydrolipoamide acetyltransferase" /db_xref="PID:g619444" /translation="MSPHCSTTYLRTLGRTTMFWKTTEGRDGKMAVQEFSEFGLLLQL LGSPGRRYYSLPPHQKVPLPSLSPTMQAGTIARWEKKEGDKINEGDLIAEVETDKATV GFESLEECYMAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYTLDSSAAPTPQAAP APTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEKKVGEKLSEGDL LAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADISAFADYRPT EVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSTPCPATPAGPKGRVFVDPLAKKLAVE KGIDLTQVKGTGPDGRITKKDIDSFVPSKVAPAPAAVVPPTGPGMAPVPTGVFTDIPI SNIRRVIAQRLMQSKQTIPHYYLLSCKYGEVLLVRKELNKILEGRSKISVNDFIIKAS ALACLKVPEANSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIANDVV SLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPA DNEKGFDVASMMSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL" BASE COUNT 666 a 644 c 603 g 627 t ORIGIN 1 gaattccgcg ccagggccga ccgttcttcc tgtactacgc ttcccaccac actcactacc 61 ctcagttcag tggacaaagc ttcaccaagc gctcaggccg tgggccattt ggggactcct 121 tgatggagct ggatggagct gtaggggcct tgatgacaac tgtgggggac ctcggtctgc 181 tggaagagac actagtcatc ttcactgcag ataacggtcc tgagttgatg cgcatgtcca 241 atggcggctg ctctggcctc ttgagatgtg gaaaaggaac aacttttgaa ggtggcgtcc 301 gagagcctgc cttggtctac tggccaggtc acattactcc tggtgtaacc catgagctgg 361 ccagctctct ggacctgctg cccaccctgg cagccctgac cggggtccgc tgcccaacgt 421 caccttggat ggtgttgaca tcagcccctt gctgctaggc acaggcaaga gcccacggaa 481 gtctgtcttc ttctacccgc cctacccaga cgagatccat ggggtctttg ctgttcggaa 541 tgggaaatac aaggctcatt tcttcaccca gggctccgcc cacagtgaca ccacttcaga 601 tcctgcctgt catgctgcca accgtctgac ggctcatgag cccccactgc tctacgactt 661 atctcaggac cctggggaga actacaatgt tttggaaaac cacagaggga agagatggga 721 aaatggctgt ccaggagttt tcggaattcg ggttactgct gcagcttttg gggtcgcccg 781 gccgccgcta ttacagtctt cccccgcatc agaaggttcc attgccttct ctttccccca 841 caatgcaggc aggcaccata gcccgttggg aaaaaaaaga gggggacaaa atcaatgaag 901 gtgacctaat tgcagaggtt gaaactgata aagccactgt tggatttgag agcctggagg 961 agtgttatat ggcaaagata cttgttgctg aaggtaccag ggatgttccc atcggagcga 1021 tcatctgtat cacagttggc aagcctgagg atattgaggc ctttaaaaat tatacactgg 1081 attcctcagc agcacctacc ccacaagcgg ccccagcacc aacccctgct gccactgctt 1141 cgccacctac accttctgct caggctcctg gtagctcata tccccctcac atgcaggtac 1201 ttcttcctgc cctctctccc accatgacca tgggcacagt tcagagatgg gaaaaaaaag 1261 tgggtgagaa gctaagtgaa ggagacttac tggcagagat agaaactgac aaagccacta 1321 taggttttga agtacaggaa gaaggttatc tggcaaaaat cctggtccct gaaggcacaa 1381 gagatgtccc tctaggaacc ccactctgta tcattgtaga aaaagaggca gatatatcag 1441 catttgctga ctataggcca accgaagtaa cagatttaaa accacaagtg ccaccaccta 1501 ccccaccccc ggtggccgct gttcctccaa ctccccagcc tttagctcct acaccttcga 1561 caccctgccc agctactcct gctggaccaa agggaagggt gtttgttgac cctcttgcaa 1621 agaagttggc agtagagaaa gggattgatc ttacacaagt aaaagggaca ggaccagatg 1681 gtagaatcac caagaaggat atcgactctt ttgtgcctag taaagttgct cctgctccgg 1741 cagctgttgt gcctcccaca ggtcctggaa tggcaccagt tcctacaggt gtcttcacag 1801 atatcccaat cagcaacatt cgtcgggtta ttgcacagcg attaatgcaa tcaaagcaaa 1861 ccatacctca ttattacctt ctatcgtgta aatatggaga agttttgttg gtacggaaag 1921 aacttaataa gatattagaa gggagaagca aaatttctgt caatgacttc atcataaaag 1981 cttcagcttt ggcatgttta aaagttcccg aagcaaattc ttcttggatg gacacagtta 2041 taagacaaaa tcatgttgtt gatgtcagtg ttgcggtcag tactcctgca ggactcatca 2101 cacctattgt gtttaatgca catataaaag gagtggaaac cattgctaat gatgttgttt 2161 ctttagcaac caaagcaaga gagggtaaac tacagccaca tgaattccag ggtggcactt 2221 ttacgatctc caatttagga atgtttggaa ttaagaattt ctctgctatt attaacccac 2281 ctcaagcatg tattttggca attggtgctt cagaggataa actggtccct gcagataatg 2341 aaaaagggtt tgatgtggct agcatgatgt ctgttacact cagttgtgat caccgggtgg 2401 tggatggagc agttggagcc cagtggcttg ctgagtttag aaagtacctt gaaaaaccta 2461 tcactatgtt gttgtaacta actcaagaat ttctaaactc tcccaggtca cactgattca 2521 ttcttaacaa gcccgaattc // LOCUS HUMIIIA 1381 bp mRNA PRI 25-OCT-1996 DEFINITION Human GTF3A mRNA for Xenopus transcription factor IIIA homologue, complete cds. ACCESSION D32257 NID g1000446 KEYWORDS GTF3A; Xenopus transcription factor IIIA homologue. SOURCE Homo sapiens cDNA to mRNA, clone_lib:librarry of T.Fujiwara, S.Shin and Y.Nakamura clone:39H11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1381) AUTHORS Arakawa,H., Nagase,H., Hayashi,N., Ogawa,M., Nagata,M., Fujiwara,T., Takahashi,E., Shin,S. and Nakamura,Y. TITLE Molecular cloning, characterization, and chromosomal mapping of a novel human gene (GTF3A) that is highly homologous to Xenopus transcription factor IIIA JOURNAL Cytogenet. Cell Genet. 70 (3-4), 235-238 (1995) MEDLINE 95309028 REFERENCE 2 (bases 1 to 1381) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (22-JUL-1994) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:nakamura@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4501), Fax:03-3918-0342) FEATURES Location/Qualifiers source 1..1381 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="39H11" /clone_lib="librarry of T.Fujiwara, S.Shin and Y.Nakamura" gene 20..1291 /gene="GTF3A" CDS 20..1291 /gene="GTF3A" /codon_start=1 /product="Xenopus transcription factor IIIA homologue" /db_xref="PID:d1007565" /db_xref="PID:g1616942" /translation="MRSSGADAGRCLVTARAPGSVPASREGSAGSRGPGARFPARVSA RGSAPGPGLGGAGALDPPAVVAESVSSLTIADAFIAAGESSAPTPPRPALPRRFICSF PDCSANYSKAWKLDAHLCKHTGERPFVCDYEGCGKAFIRDYHLSRHILTHTGEKPFVC AANGCDQKFNTKSNLKKHFERKHENQQKQYICSFEDCKKTFKKHQQLKIHQCQNTNEP LFKCTQEGCGKHFASPSKLKRHAKAHEGYVCQKGCSFVAKTWTELLKHVRETHKEEIL CEVCRKTFKRKDYLKQHMKTHAPERDVCRCPREGCGRTYTTVFNLQSHILSFHEESRP FVCEHAGCGKTFAMKQSLTRHAVVHDPDKKKMKLKVKKSREKREFGLSSQWIYPPKRK QGQGLSLCQNGESPNCVEDKMLSTVAVLTLG" BASE COUNT 386 a 350 c 354 g 291 t ORIGIN 1 atgcgcgatc tcccggagca tgcgcagcag cggcgccgac gcggggcggt gcctggtgac 61 cgcgcgcgct cccggaagtg tgccggcgtc gcgcgaaggt tcagcaggga gccgtgggcc 121 gggcgcgcgg ttcccggcac gtgtctcggc acgtggcagc gcgcctggcc ctgggcttgg 181 aggcgccggc gccctggatc cgccggccgt ggtcgccgag tcggtgtcgt ccttgaccat 241 cgccgacgcg ttcattgcag ccggcgagag ctcagctccg accccgccgc gccccgcgct 301 tcccaggagg ttcatctgct ccttccctga ctgcagcgcc aattacagca aagcctggaa 361 gcttgacgcg cacctgtgca agcacacggg ggagagacca tttgtttgtg actatgaagg 421 gtgtggcaag gccttcatca gggactacca tctgagccgc cacattctga ctcacacagg 481 agaaaagccg tttgtttgtg cagccaatgg ctgtgatcaa aaattcaaca caaaatcaaa 541 cttgaagaaa cattttgaac gcaaacatga aaatcaacaa aaacaatata tatgcagttt 601 tgaagactgt aagaagacct ttaagaaaca tcagcagctg aaaatccatc agtgccagaa 661 taccaatgaa cctctattca agtgtaccca ggaaggatgt gggaaacact ttgcatcacc 721 cagcaagctg aaacgacatg ccaaggccca cgagggctat gtatgtcaaa aaggatgttc 781 ctttgtggca aaaacatgga cggaacttct gaaacatgtg agagaaaccc ataaagagga 841 aatactatgt gaagtatgcc ggaaaacatt taaacgcaaa gattacctta agcaacacat 901 gaaaactcat gccccagaaa gggatgtatg tcgctgtcca agagaaggct gtggaagaac 961 ctatacaact gtgtttaatc tccaaagcca tatcctctcc ttccatgagg aaagccgccc 1021 ttttgtgtgt gaacatgctg gctgtggcaa aacatttgca atgaaacaaa gtctcactag 1081 gcatgctgtt gtacatgatc ctgacaagaa gaaaatgaag ctcaaagtca aaaaatctcg 1141 tgaaaaacgg gagtttggcc tctcatctca gtggatatat cctcccaaaa ggaaacaagg 1201 gcaaggctta tctttgtgtc aaaacggaga gtcacccaac tgtgtggaag acaagatgct 1261 ctcgacagtt gcagtactta cccttggcta agaactgcac tgctttgttt aaaggactgc 1321 agaccaagga gtcgagcttt ctctcagagc atgcttttct ttattaaaat tactgatgca 1381 g // LOCUS HUMIIP 1032 bp mRNA PRI 11-JUN-1993 DEFINITION Human gamma-interferon-inducible protein (IP-30) mRNA, complete cds. ACCESSION J03909 NID g186264 KEYWORDS gamma-interferon-inducible protein. SOURCE Human monocytic cell line U937, cDNA to mRNA, clones p[9.0,9.2,9.3,9.7]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1032) AUTHORS Luster,A.D., Weinshank,R.L., Feinman,R. and Ravetch,J.V. TITLE Molecular and biochemical characterization of a novel gamma- interferon-inducible protein JOURNAL J. Biol. Chem. 263, 12036-12043 (1988) MEDLINE 88298888 FEATURES Location/Qualifiers source 1..1032 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 41..151 /note="gamma-interferon-inducible protein signal peptide" CDS 41..952 /note="gamma-interferon-inducible protein precursor" /codon_start=1 /db_xref="PID:g307042" /translation="MDSRHTFAPAAMTLSPLLLFLPPLLLLLDVPTAAVQASPLQALD FFGNGPPVNYKTGNLYLRGPLKKSNAPLVNVTLYYEALCGGCRAFLIRELFPTWLLVM EILNVTSVPYGNAQEQNVSGRWEFKCQLGEEECKFNKVEACVLDELDMELAFLTMSGM AWKSLRTWREVCHYACSSTPQGCRQNYHGVCNGGPRHAAHARQRPADRCSPATARVCA LGHRQWETLGRSDPAPYPCLPVVPGQEAGCLPFLNQLPPECLLRVLAGGLRRAHGRRV GTRLPAFFSDPDPRHLLLTNWKILCIP" mat_peptide 152..949 /note="gamma-interferon-inducible protein" BASE COUNT 227 a 295 c 281 g 229 t ORIGIN 1 ggagggtggg cagcactcgc tttattgtcc agcattccac atggatagtc gccacacctt 61 tgcccctgct gcgatgaccc tgtcgccact tctgctgttc ctgccaccgc tgctgctgct 121 gctggacgtc cccacggcgg cggtgcaggc gtcccctctg caagcgttag acttctttgg 181 gaatgggcca ccagttaact acaagacagg caatctatac ctgcgggggc ccctgaagaa 241 gtccaatgca ccgcttgtca atgtgaccct ctactatgaa gcactgtgcg gtggctgccg 301 agccttcctg atccgggagc tcttcccaac atggctgttg gtcatggaga tcctcaatgt 361 cacgtcggtg ccctacggaa acgcacagga acaaaatgtc agtggcaggt gggagttcaa 421 gtgccagctt ggagaagagg agtgcaaatt caacaaggtg gaggcctgcg tgttggatga 481 acttgacatg gagctagcct tcctgaccat gtctggcatg gcatggaaga gtttgaggac 541 atggagagaa gtctgccact atgcctgcag ctctacgccc cagggctgtc gccagaacta 601 tcatggagtg tgcaatgggg gaccgcggca tgcagctcat gcacgccaac gcccagcgga 661 cagatgctct ccagccaccg cacgagtatg tgccctgggt caccgtcaat gggaaaccct 721 tggaagatca gacccagctc cttacccttg tctgccagtt gtaccagggc aagaagccgg 781 atgtctgccc ttcctcaacc agctccctcc ggagtgtttg cttcgagtgt tggccggtgg 841 gctgcggaga gctcatggaa ggcgagtggg aactcggctg cctgcctttt tttctgatcc 901 agaccctcgg cacctgctac ttaccaactg gaaaatttta tgcatcccat gaagcccaga 961 tacacaaaat tccaccccta gatcaagaat cctgctccac taagaatggt gctaaagtaa 1021 aactagttta at // LOCUS HUMIL2AB 442 bp DNA PRI 06-JAN-1995 DEFINITION Human interleukin 2 gene, clone pATtacIL-2C/2TT, complete cds, clone pATtacIL-2C/2TT. ACCESSION M22005 NID g186300 KEYWORDS interleukin 2. SOURCE Human T-lymphocyte DNA, clone pATtacIL-2C/2TT. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 442) AUTHORS Weir,M.P., Chaplin,M.A., Wallace,D.M., Dykes,C.W. and Hobden,A.N. TITLE Structure-activity relationships of recombinant human interleukin 2 JOURNAL Biochemistry 27 (18), 6883-6892 (1988) MEDLINE 89062420 FEATURES Location/Qualifiers source 1..442 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-lymphocyte" /map="4q26-q27" gene 22..426 /gene="IL2" CDS 22..426 /gene="IL2" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-344" /product="interleukin 2" /db_xref="PID:g386818" /translation="MAPTSSSTKKTQLQLEHLLLDLQMILNGINNYKNPKLTRMLTFK FYMPKKATELKHLQCLEEELKPLEEVLNLAQSKNFHLRPRDLISNINVIVLELKGSET TFMCEYADETATIVEFLNRWITFCQSIISTLT" mat_peptide 25..423 /gene="IL2" /note="G00-119-344" /product="interleukin 2" BASE COUNT 124 a 132 c 93 g 93 t ORIGIN Chromosome 4q26-q27. 1 gatcctagga ggtttggtac catggctccg acgagcagct ccaccaagaa aacccagctc 61 cagctcgaac acctgctgct ggacctgcag atgatcctga acggtatcaa caactacaag 121 aacccgaaac tgactcgtat gctgaccttc aagttctaca tgccgaagaa agctaccgaa 181 ctgaaacacc tgcaatgcct cgaggaggag ctcaaaccgc tggaagaggt tctgaacctg 241 gctcagtcca agaacttcca cctgcgtccg cgcgacctga tctccaacat caacgttatc 301 gttctggaac tgaaaggcag tgagactacc ttcatgtgcg aatacgctga cgaaaccgct 361 actatcgttg aattcctgaa ccgttggatc accttctgtc agtccatcat ctccaccctg 421 acctaataac taactaagtc ga // LOCUS HUMIL2RBC 4034 bp mRNA PRI 06-JAN-1995 DEFINITION Human interleukin 2 receptor beta chain (p70-75) mRNA, complete cds. ACCESSION M26062 NID g186322 KEYWORDS interleukin; interleukin 2 receptor beta-chain. SOURCE Human lymphoid leukemia line (YT) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4034) AUTHORS Hatakeyama,M., Tsudo,M., Minamoto,S., Kono,T., Doi,T., Miyata,T., Miyasaka,M. and Taniguchi,T. TITLE Interleukin-2 receptor beta chain gene: generation of three receptor forms by cloned human alpha and beta chain cDNA's JOURNAL Science 244 (4904), 551-556 (1989) MEDLINE 89242117 COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by M.Hatakeyama 12-JUL-1989. FEATURES Location/Qualifiers source 1..4034 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q13" sig_peptide 132..209 /gene="IL2RB" /note="interleukin 2 receptor beta chain signal peptide; G00-118-822" gene 132..1787 /gene="IL2RB" CDS 132..1787 /gene="IL2RB" /note="interleukin 2 receptor beta chain precursor peptide" /codon_start=1 /db_xref="GDB:G00-118-822" /db_xref="PID:g307048" /translation="MAAPALSWRLPLLILLLPLATSWASAAVNGTSQFTCFYNSRANI SCVWSQDGALQDTSCQVHAWPDRRRWNQTCELLPVSQASWACNLILGAPDSQKLTTVD IVTLRVLCREGVRWRVMAIQDFKPFENLRLMAPISLQVVHVETHRCNISWEISQASHY FERHLEFEARTLSPGHTWEEAPLLTLKQKQEWICLETLTPDTQYEFQVRVKPLQGEFT TWSPWSQPLAFRTKPAALGKDTIPWLGHLLVGLSGAFGFIILVYLLINCRNTGPWLKK VLKCNTPDPSKFFSQLSSEHGGDVQKWLSSPFPSSSFSPGGLAPEISPLEVLERDKVT QLLLQQDKVPEPASLSSNHSLTSCFTNQGYFFFHLPDALEIEACQVYFTYDPYSEEDP DEGVAGAPTGSSPQPLQPLSGEDDAYCTFPSRDDLLLFSPSLLGGPSPPSTAPGGSGA GEERMPPSLQERVPRDWDPQPLGPPTPGVPDLVDFQPPPELVLREAGEEVPDAGPREG VSFPWSRPPGQGEFRALNARLPLNTDAYLSLQELQGQDPTHLV" BASE COUNT 814 a 1315 c 1017 g 888 t ORIGIN 1 gcagccagag ctcagcaggg ccctggagag atggccacgg tcccagcacc ggggaggact 61 ggagagcgcg cgctgccacc gccccatgtc tcagccaggg cttccttcct cggctccacc 121 ctgtggatgt aatggcggcc cctgctctgt cctggcgtct gcccctcctc atcctcctcc 181 tgcccctggc tacctcttgg gcatctgcag cggtgaatgg cacttcccag ttcacatgct 241 tctacaactc gagagccaac atctcctgtg tctggagcca agatggggct ctgcaggaca 301 cttcctgcca agtccatgcc tggccggaca gacggcggtg gaaccaaacc tgtgagctgc 361 tccccgtgag tcaagcatcc tgggcctgca acctgatcct cggagcccca gattctcaga 421 aactgaccac agttgacatc gtcaccctga gggtgctgtg ccgtgagggg gtgcgatgga 481 gggtgatggc catccaggac ttcaagccct ttgagaacct tcgcctgatg gcccccatct 541 ccctccaagt tgtccacgtg gagacccaca gatgcaacat aagctgggaa atctcccaag 601 cctcccacta ctttgaaaga cacctggagt tcgaggcccg gacgctgtcc ccaggccaca 661 cctgggagga ggcccccctg ctgactctca agcagaagca ggaatggatc tgcctggaga 721 cgctcacccc agacacccag tatgagtttc aggtgcgggt caagcctctg caaggcgagt 781 tcacgacctg gagcccctgg agccagcccc tggccttcag gacaaagcct gcagcccttg 841 ggaaggacac cattccgtgg ctcggccacc tcctcgtggg cctcagcggg gcttttggct 901 tcatcatctt agtgtacttg ctgatcaact gcaggaacac cgggccatgg ctgaagaagg 961 tcctgaagtg taacacccca gacccctcga agttcttttc ccagctgagc tcagagcatg 1021 gaggagacgt ccagaagtgg ctctcttcgc ccttcccctc atcgtccttc agccctggcg 1081 gcctggcacc tgagatctcg ccactagaag tgctggagag ggacaaggtg acgcagctgc 1141 tcctgcagca ggacaaggtg cctgagcccg catccttaag cagcaaccac tcgctgacca 1201 gctgcttcac caaccagggt tacttcttct tccacctccc ggatgccttg gagatagagg 1261 cctgccaggt gtactttact tacgacccct actcagagga agaccctgat gagggtgtgg 1321 ccggggcacc cacagggtct tccccccaac ccctgcagcc tctgtcaggg gaggacgacg 1381 cctactgcac cttcccctcc agggatgacc tgctgctctt ctcccccagt ctcctcggtg 1441 gccccagccc cccaagcact gcccctgggg gcagtggggc cggtgaagag aggatgcccc 1501 cttctttgca agaaagagtc cccagagact gggaccccca gcccctgggg cctcccaccc 1561 caggagtccc agacctggtg gattttcagc caccccctga gctggtgctg cgagaggctg 1621 gggaggaggt ccctgacgct ggccccaggg agggagtcag tttcccctgg tccaggcctc 1681 ctgggcaggg ggagttcagg gcccttaatg ctcgcctgcc cctgaacact gatgcctact 1741 tgtccctcca agaactccag ggtcaggacc caactcactt ggtgtagaca gatggccagg 1801 gtgggaggca ggcagctgcc tgctctgcgc cgagcctcag aaggaccctg ttgagggtcc 1861 tcagtccact gctgaggaca ctcagtgtcc agttgcagct ggacttctcc acccggatgg 1921 cccccaccca gtcctgcaca cttggtccat ccatttccaa acctccactg ctgctcccgg 1981 gtcctgctgc ccgagccagg aactgtgtgt gttgcagggg ggcagtaact ccccaactcc 2041 ctcgttaatc acaggatccc acgaatttag gctcagaagc atcgctcctc tccagccctg 2101 cagctattca ccaatatcag tcctcgcggc tctccagggc tccctgccct gacctcttcc 2161 ctgggttttc tgccccagcc tcctccttcc ctcccctccc cgtccacagg gcagcctgag 2221 cgtgctttcc aaaacccaaa tatggccacg ctccccctcg gttcaaaacc ttgcacaggt 2281 cccactgccc tcagccccac ttctcagcct ggtacttgta cctccggtgt cgtgtgggga 2341 catccccttc tgcaatcctc cctaccgtcc tcccgagcca ctcagagctc cctcacaccc 2401 cctctgttgc acatgctatt ccctggggct gctgtgcgct ccccctcatc taggtgacaa 2461 acttccctga ctcttcaagt gccggttttg cttctcctgg agggaagcac tgcctccctt 2521 aatctgccag aaacttctag cgtcagtgct ggagggagaa gctgtcaggg acccagggcg 2581 cctggagaaa gaggccctgt tactattcct ttgggatctc tgaggcctca gagtgcttgg 2641 ctgctgtatc tttaatgctg gggcccaagt aagggcacag atccccccac aaagtggatg 2701 cctgctgcat cttcccacag tggcttcaca gacccacaag agaagctgat ggggagtaaa 2761 ccctggagtc cgaggcccag gcagcagccc cgcctagtgg tgggccctga tgctgccagg 2821 cctgggacct cccactgccc cctccactgg aggggtctcc tctgcagctc agggactggc 2881 acactggcct ccagaagggc agctccacag ggcagggcct cattattttt cactgcccca 2941 gacacagtgc ccaacacccc gtcgtatacc ctggatgaac gaattaatta cctggcacca 3001 cctcgtctgg gctccctgcg cctgacattc acacagagag gcagagtccc gtgcccatta 3061 ggtctggcat gccccctcct gcaaggggct caacccccta ccccgacccc tccacgtatc 3121 tttcctaggc agatcacgtt gcaatggctc aaacaacatt ccaccccagc aggacagtga 3181 ccccagtccc agctaactct gacctgggag ccctcaggca cctgcactta caggccttgc 3241 tcacagctga ttgggcacct gaccacacgc ccccacaggc tctgaccagc agcctatgag 3301 ggggtttggc accaagctct gtccaatcag gtaggctggg cctgaactag ccaatcagat 3361 caactctgtc ttgggcgttt gaactcaggg agggaggccc ttgggagcag gtgcttgtgg 3421 acaaggctcc acaagcgttg agccttggaa aggtagacaa gcgttgagcc actaagcaga 3481 ggaccttggg ttcccaatac aaaaatacct actgctgaga gggctgctga ccatttggtc 3541 aggattcctg ttgcctttat atccaaaata aactcccctt tcttgaggtt gtctgagtct 3601 tgggtctatg ccttgaaaaa agctgaatta ttggacagtc tcacctcctg ccatagggtc 3661 ctgaatgttt cagaccacaa ggggctccac acctttgctg tgtgttctgg ggcaacctac 3721 taatcctctc tgcaagtcgg tctccttatc cccccaaatg gaaattgtat ttgccttctc 3781 cactttggga ggctcccact tcttgggagg gttacatttt ttaagtctta atcatttgtg 3841 acatatgtat ctatacatcc gtatctttta atgatccgtg tgtaccatct ttgtgattat 3901 ttccttaata ttttttcttt aagtcagttc attttcgttg aaatacattt ataaagaaaa 3961 atctttgtta ctctgtaaat gaaaaaaccc attttcgcta taaataaaag gtaactgtac 4021 aaaataagta caat // LOCUS HUMIL3B 1460 bp mRNA PRI 06-JAN-1995 DEFINITION Human interleukin 3 receptor (hIL-3Ra) mRNA, complete cds. ACCESSION M74782 NID g186330 KEYWORDS cytokine receptor; interleukin 3 receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1460) AUTHORS Kitamura,T., Sato,N., Arai,K. and Miyajima,A. TITLE Expression cloning of the human IL-3 receptor cDNA reveals a shared beta subunit for the human IL-3 and GM-CSF receptors JOURNAL Cell 66 (6), 1165-1174 (1991) MEDLINE 92005668 FEATURES Location/Qualifiers source 1..1460 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="homopoietic TF-1" sig_peptide 147..200 /gene="hIL-3Ra" CDS 147..1283 /gene="hIL-3Ra" /codon_start=1 /product="interleukin 3 receptor" /db_xref="PID:g186331" /translation="MVLLWLTLLLIALPCLLQTKEDPNPPITNLRMKAKAQQLTWDLN RNVTDIECVKDADYSMPAVNNSYCQFGAISLCEVTNYTVRVANPPFSTWILFPENSGK PWAGAENLTCWIHDVDFLSCSWAVGPGAPADVQYDLYLNVANRRQQYECLHYKTDAQG TRIGCRFDDISRLSSGSQSSHILVRGRSAAFGIPCTDKFVVFSQIEILTPPNMTAKCN KTHSFMHWKMRSHFNRKFRYELQIQKRMQPVITEQVRDRTSFQLLNPGTYTVQIRARE RVYEFLSAWSTPQRFECDQEEGANTRAWRTSLLIALGTLLALVCVFVICRRYLVMQRL FPRIPHMKDPIGDSFQNDKLVVWEAGKAGLEECLVTEVQVVQKT" gene 147..1283 /gene="hIL-3Ra" mat_peptide 201..1280 /gene="hIL-3Ra" /product="interleukin 3 receptor" BASE COUNT 358 a 396 c 391 g 315 t ORIGIN 1 gcacacggga agatatcaga aacatcctag gatcaggaca ccccagatct tctcaactgg 61 aaccacgaag gctgtttctt ccacacagca ctttgatctc catttaagca ggcacctctg 121 tcctgcgttc cggagctgcg ttcccgatgg tcctcctttg gctcacgctg ctcctgatcg 181 ccctgccctg tctcctgcaa acgaaggaag atccaaaccc accaatcacg aacctaagga 241 tgaaagcaaa ggctcagcag ttgacctggg accttaacag aaatgtgacc gatatcgagt 301 gtgttaaaga tgccgactat tctatgccgg cagtgaacaa tagctattgc cagtttggag 361 caatttcctt atgtgaagtg accaactaca ccgtccgagt ggccaaccca ccattctcca 421 cgtggatcct cttccctgag aacagtggga agccttgggc aggtgcggag aatctgacct 481 gctggattca tgacgtggat ttcttgagct gcagctgggc ggtaggcccg ggggcccccg 541 cggacgtcca gtacgacctg tacttgaacg ttgccaacag gcgtcaacag tacgagtgtc 601 ttcactacaa aacggatgct cagggaacac gtatcgggtg tcgtttcgat gacatctctc 661 gactctccag cggttctcaa agttcccaca tcctggtgcg gggcaggagc gcagccttcg 721 gtatcccctg cacagataag tttgtcgtct tttcacagat tgagatatta actccaccca 781 acatgactgc aaagtgtaat aagacacatt cctttatgca ctggaaaatg agaagtcatt 841 tcaatcgcaa atttcgctat gagcttcaga tacaaaagag aatgcagcct gtaatcacag 901 aacaggtcag agacagaacc tccttccagc tactcaatcc tggaacgtac acagtacaaa 961 taagagcccg ggaaagagtg tatgaattct tgagcgcctg gagcaccccc cagcgcttcg 1021 agtgcgacca ggaggagggc gcaaacacac gtgcctggcg gacgtcgctg ctgatcgcgc 1081 tggggacgct gctggccctg gtctgtgtct tcgtgatctg cagaaggtat ctggtgatgc 1141 agagactctt tccccgcatc cctcacatga aagaccccat cggtgacagc ttccaaaacg 1201 acaagctggt ggtctgggag gcgggcaaag ccggcctgga ggagtgtctg gtgactgaag 1261 tacaggtcgt gcagaaaact tgagactggg gttcagggct tgtgggggtc tgcctcaatc 1321 tccctggccg ggccaggcgc ctgcacagac tggctgctgg acctgcgcac gcagcccagg 1381 aatggacatt cctaacgggt ggtgggcatg ggagatgcct gtgtaatttc gtccgaagct 1441 gccaggaaga agaacagaac // LOCUS HUMIL6GP 3085 bp mRNA PRI 06-JAN-1995 DEFINITION Human membrane glycoprotein gp130 mRNA, complete cds. ACCESSION M57230 NID g186353 KEYWORDS interleukin 6; interleukin 6 receptor; interleukin 6 signal transducer; membrane glycoprotein. SOURCE Human placenta and myeloma cell line (U266), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3085) AUTHORS Hibi,M., Murakami,M., Saito,M., Hirano,T., Taga,T. and Kishimoto,T. TITLE Molecular cloning and expression of an IL-6 signal transducer, gp130 JOURNAL Cell 63 (6), 1149-1157 (1990) MEDLINE 91084844 FEATURES Location/Qualifiers source 1..3085 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="7p21-p14" sig_peptide 256..321 /gene="gp130" /note="putative" CDS 256..3012 /gene="IL6" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-748" /product="membrane glycoprotein 130" /db_xref="PID:g186354" /translation="MLTLQTWVVQALFIFLTTESTGELLDPCGYISPESPVVQLHSNF TAVCVLKEKCMDYFHVNANYIVWKTNHFTIPKEQYTIINRTASSVTFTDIASLNIQLT CNILTFGQLEQNVYGITIISGLPPEKPKNLSCIVNEGKKMRCEWDGGRETHLETNFTL KSEWATHKFADCKAKRDTPTSCTVDYSTVYFVNIEVWVEAENALGKVTSDHINFDPVY KVKPNPPHNLSVINSEELSSILKLTWTNPSIKSVIILKYNIQYRTKDASTWSQIPPED TASTRSSFTVQDLKPFTEYVFRIRCMKEDGKGYWSDWSEEASGITYEDRPSKAPSFWY KIDPSHTQGYRTVQLVWKTLPPFEANGKILDYEVTLTRWKSHLQNYTVNATKLTVNLT NDRYLATLTVRNLVGKSDAAVLTIPACDFQATHPVMDLKAFPKDNMLWVEWTTPRESV KKYILEWCVLSDKAPCITDWQQEDGTVHRTYLRGNLAESKCYLITVTPVYADGPGSPE SIKAYLKQAPPSKGPTVRTKKVGKNEAVLEWDQLPVDVQNGFIRNYTIFYRTIIGNET AVNVDSSHTEYTLSSLTSDTLYMVRMAAYTDEGGKDGPEFTFTTPKFAQGEIEAIVVP VCLAFLLTTLLGVLFCFNKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQM YSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPS ISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQL VDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHI SQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGG YMPQ" gene 256..3009 /gene="gp130" gene 256..3012 /gene="IL6" mat_peptide 322..3009 /gene="gp130" /note="putative" /product="membrane glycoprotein 130" BASE COUNT 977 a 637 c 656 g 815 t ORIGIN 1 gagcagccaa aaggcccgcg gagtcgcgct gggccgcccc ggcgcagctg aaccgggggc 61 cgcgcctgcc aggccgacgg gtctggccca gcctggcgcc aaggggttcg tgcgctgtgg 121 agacgcggag ggtcgaggcg gcgcggcctg agtgaaaccc aatggaaaaa gcatgacatt 181 tagaagtaga agacttagct tcaaatccct actccttcac ttactaattt tgtgatttgg 241 aaatatccgc gcaagatgtt gacgttgcag acttgggtag tgcaagcctt gtttattttc 301 ctcaccactg aatctacagg tgaacttcta gatccatgtg gttatatcag tcctgaatct 361 ccagttgtac aacttcattc taatttcact gcagtttgtg tgctaaagga aaaatgtatg 421 gattattttc atgtaaatgc taattacatt gtctggaaaa caaaccattt tactattcct 481 aaggagcaat atactatcat aaacagaaca gcatccagtg tcacctttac agatatagct 541 tcattaaata ttcagctcac ttgcaacatt cttacattcg gacagcttga acagaatgtt 601 tatggaatca caataatttc aggcttgcct ccagaaaaac ctaaaaattt gagttgcatt 661 gtgaacgagg ggaagaaaat gaggtgtgag tgggatggtg gaagggaaac acacttggag 721 acaaacttca ctttaaaatc tgaatgggca acacacaagt ttgctgattg caaagcaaaa 781 cgtgacaccc ccacctcatg cactgttgat tattctactg tgtattttgt caacattgaa 841 gtctgggtag aagcagagaa tgcccttggg aaggttacat cagatcatat caattttgat 901 cctgtatata aagtgaagcc caatccgcca cataatttat cagtgatcaa ctcagaggaa 961 ctgtctagta tcttaaaatt gacatggacc aacccaagta ttaagagtgt tataatacta 1021 aaatataaca ttcaatatag gaccaaagat gcctcaactt ggagccagat tcctcctgaa 1081 gacacagcat ccacccgatc ttcattcact gtccaagacc ttaaaccttt tacagaatat 1141 gtgtttagga ttcgctgtat gaaggaagat ggtaagggat actggagtga ctggagtgaa 1201 gaagcaagtg ggatcaccta tgaagataga ccatctaaag caccaagttt ctggtataaa 1261 atagatccat cccatactca aggctacaga actgtacaac tcgtgtggaa gacattgcct 1321 ccttttgaag ccaatggaaa aatcttggat tatgaagtga ctctcacaag atggaaatca 1381 catttacaaa attacacagt taatgccaca aaactgacag taaatctcac aaatgatcgc 1441 tatctagcaa ccctaacagt aagaaatctt gttggcaaat cagatgcagc tgttttaact 1501 atccctgcct gtgactttca agctactcac cctgtaatgg atcttaaagc attccccaaa 1561 gataacatgc tttgggtgga atggactact ccaagggaat ctgtaaagaa atatatactt 1621 gagtggtgtg tgttatcaga taaagcaccc tgtatcacag actggcaaca agaagatggt 1681 accgtgcatc gcacctattt aagagggaac ttagcagaga gcaaatgcta tttgataaca 1741 gttactccag tatatgctga tggaccagga agccctgaat ccataaaggc ataccttaaa 1801 caagctccac cttccaaagg acctactgtt cggacaaaaa aagtagggaa aaacgaagct 1861 gtcttagagt gggaccaact tcctgttgat gttcagaatg gatttatcag aaattatact 1921 atattttata gaaccatcat tggaaatgaa actgctgtga atgtggattc ttcccacaca 1981 gaatatacat tgtcctcttt gactagtgac acattgtaca tggtacgaat ggcagcatac 2041 acagatgaag gtgggaagga tggtccagaa ttcactttta ctaccccaaa gtttgctcaa 2101 ggagaaattg aagccatagt cgtgcctgtt tgcttagcat tcctattgac aactcttctg 2161 ggagtgctgt tctgctttaa taagcgagac ctaattaaaa aacacatctg gcctaatgtt 2221 ccagatcctt caaagagtca tattgcccag tggtcacctc acactcctcc aaggcacaat 2281 tttaattcaa aagatcaaat gtattcagat ggcaatttca ctgatgtaag tgttgtggaa 2341 atagaagcaa atgacaaaaa gccttttcca gaagatctga aatcattgga cctgttcaaa 2401 aaggaaaaaa ttaatactga aggacacagc agtggtattg gggggtcttc atgcatgtca 2461 tcttctaggc caagcatttc tagcagtgat gaaaatgaat cttcacaaaa cacttcgagc 2521 actgtccagt attctaccgt ggtacacagt ggctacagac accaagttcc gtcagtccaa 2581 gtcttctcaa gatccgagtc tacccagccc ttgttagatt cagaggagcg gccagaagat 2641 ctacaattag tagatcatgt agatggcggt gatggtattt tgcccaggca acagtacttc 2701 aaacagaact gcagtcagca tgaatccagt ccagatattt cacattttga aaggtcaaag 2761 caagtttcat cagtcaatga ggaagatttt gttagactta aacagcagat ttcagatcat 2821 atttcacaat cctgtggatc tgggcaaatg aaaatgtttc aggaagtttc tgcagcagat 2881 gcttttggtc caggtactga gggacaagta gaaagatttg aaacagttgg catggaggct 2941 gcgactgatg aaggcatgcc taaaagttac ttaccacaga ctgtacggca aggcggctac 3001 atgcctcagt gaaggactag tagttcctgc tacaacttca gcagtaccta taaagtaaag 3061 ctaaaatgat tttatctgtg aattc // LOCUS HUMIL7A 1589 bp mRNA PRI 06-JAN-1995 DEFINITION Human interleukin 7 (IL-7) mRNA, complete cds. ACCESSION J04156 NID g186363 KEYWORDS interleukin; interleukin 7. SOURCE Human (SK-HEP-1 cell) liver adenocarcinoma, cDNA to mRNA, clone #3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1589) AUTHORS Goodwin,R.G., Lupton,S., Schmierer,A., Hjerrild,K.J., Jerzy,R., Clevenger,W., Gillis,S., Cosman,D. and Namen,A.E. TITLE Human interleukin 7: molecular cloning and growth factor activity on human and murine B-lineage cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (1), 302-306 (1989) MEDLINE 89098903 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.Goodwin, 05-JAN-1989. FEATURES Location/Qualifiers source 1..1589 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q12-q13" mRNA <1..1589 /note="interleukin 7 mRNA" sig_peptide 385..459 /gene="IL7" /note="interleukin 7 signal peptide" gene 385..918 /gene="IL7" CDS 385..918 /gene="IL7" /note="interleukin 7 precursor" /codon_start=1 /db_xref="GDB:G00-120-098" /db_xref="PID:g307064" /translation="MFHVSFRYIFGLPPLILVLLPVASSDCDIEGKDGKQYESVLMVS IDQLLDSMKEIGSNCLNNEFNFFKRHICDANKEGMFLFRAARKLRQFLKMNSTGDFDL HLLKVSEGTTILLNCTGQVKGRKPAALGEAQPTKSLEENKSLKEQKKLNDLCFLKRLL QEIKTCWNKILMGTKEH" mat_peptide 460..915 /gene="IL7" /note="interleukin 7" BASE COUNT 532 a 284 c 339 g 434 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcctct ggtcctcatc caggtgcgcg ggaagcaggt gcccaggaga gaggggataa 61 tgaagattcc atgctgatga tcccaaagat tgaacctgca gaccaagcgc aaagtagaaa 121 ctgaaagtac actgctggcg gatcctacgg aagttatgga aaaggcaaag cgcagagcca 181 cgccgtagtg tgtgccgccc cccttgggat ggatgaaact gcagtcgcgg cgtgggtaag 241 aggaaccagc tgcagagatc accctgccca acacagactc ggcaactccg cggaagacca 301 gggtcctggg agtgactatg ggcggtgaga gcttgctcct gctccagttg cggtcatcat 361 gactacgccc gcctcccgca gaccatgttc catgtttctt ttaggtatat ctttggactt 421 cctcccctga tccttgttct gttgccagta gcatcatctg attgtgatat tgaaggtaaa 481 gatggcaaac aatatgagag tgttctaatg gtcagcatcg atcaattatt ggacagcatg 541 aaagaaattg gtagcaattg cctgaataat gaatttaact tttttaaaag acatatctgt 601 gatgctaata aggaaggtat gtttttattc cgtgctgctc gcaagttgag gcaatttctt 661 aaaatgaata gcactggtga ttttgatctc cacttattaa aagtttcaga aggcacaaca 721 atactgttga actgcactgg ccaggttaaa ggaagaaaac cagctgccct gggtgaagcc 781 caaccaacaa agagtttgga agaaaataaa tctttaaagg aacagaaaaa actgaatgac 841 ttgtgtttcc taaagagact attacaagag ataaaaactt gttggaataa aattttgatg 901 ggcactaaag aacactgaaa aatatggagt ggcaatatag aaacacgaac tttagctgca 961 tcctccaaga atctatctgc ttatgcagtt tttcagagtg gaatgcttcc tagaagttac 1021 tgaatgcacc atggtcaaaa cggattaggg catttgagaa atgcatattg tattactaga 1081 agatgaatac aaacaatgga aactgaatgc tccagtcaac aaactatttc ttatatatgt 1141 gaacatttat caatcagtat aattctgtac tgatttttgt aagacaatcc atgtaaggta 1201 tcagttgcaa taatacttct caaacctgtt taaatatttc aagacattaa atctatgaag 1261 tatataatgg tttcaaagat tcaaaattga cattgcttta ctgtcaaaat aattttatgg 1321 ctcactatga atctattata ctgtattaag agtgaaaatt gtcttcttct gtgctggaga 1381 tgttttagag ttaacaatga tatatggata atgccggtga gaataagaga gtcataaacc 1441 ttaagtaagc aacagcataa caaggtccaa gatacctaaa agagatttca agagatttaa 1501 ttaatcatga atgtgtaaca cagtgccttc aataaatggt atagcaaatg ttttgacatg 1561 aaaaaaggac aatttcaaaa aaataaaat // LOCUS HUMIL7AA 1658 bp mRNA PRI 06-JAN-1995 DEFINITION Human interleukin-7 receptor (IL-7) mRNA, complete cds. ACCESSION M29696 NID g186365 KEYWORDS interleukin 7 receptor. SOURCE Human pre-B cell line IxN/2b, cDNA to mRNA, clone H20. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1658) AUTHORS Goodwin,R.G., Friend,D., Ziegler,S.F., Jerzy,R., Falk,B.A., Gimpel,S., Cosman,D., Dower,S.K., March,C.J., Namen,A.E. and Park,L.S. TITLE Cloning of the human and murine interleukin-7 receptors: demonstration of a soluble form and homology to a new receptor superfamily JOURNAL Cell 60 (6), 941-951 (1990) MEDLINE 90199875 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.G.Goodwin, 03-NOV-1989. FEATURES Location/Qualifiers source 1..1658 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" sig_peptide 23..82 /gene="IL7R" /note="interleukin-7 receptor signal peptide" CDS 23..1402 /gene="IL7R" /note="interleukin-7 receptor precursor" /codon_start=1 /db_xref="GDB:G00-127-886" /db_xref="PID:g307065" /translation="MTILGTTFGMVFSLLQVVSGESGYAQNGDLEDAELDDYSFSCYS QLEVNGSQHSLTCAFEDPDVNTTNLEFEICGALVEVKCLNFRKLQEIYFIETKKFLLI GKSNICVKVGEKSLTCKKIDLTTIVKPEAPFDLSVIYREGANDFVVTFNTSHLQKKYV KVLMHDVAYRQEKDENKWTHVNLSSTKLTLLQRKLQPAAMYEIKVRSIPDHYFKGFWS EWSPSYYFRTPEINNSSGEMDPILLTISILSFFSVALLVILACVLWKKRIKPIVWPSL PDHKKTLEHLCKKPRKNLNVSFNPESFLDCQIHRVDDIQARDEVEGFLQDTFPQQLEE SEKQRLGGDVQSPNCPSEDVVVTPESFGRDSSLTCLAGNVSACDAPILSSSRSLDCRE SGKNGPHVYQDLLLSLGTTNSTLPPPFSLQSGILTLNPVAQGQPILTSLGSNQEEAYV TMSSFYQNQ" gene 23..1402 /gene="IL7R" mat_peptide 83..1399 /gene="IL7R" /note="interleukin-7 receptor" BASE COUNT 493 a 364 c 363 g 438 t ORIGIN 1 ctctctctct atctctctca gaatgacaat tctaggtaca acttttggca tggttttttc 61 tttacttcaa gtcgtttctg gagaaagtgg ctatgctcaa aatggagact tggaagatgc 121 agaactggat gactactcat tctcatgcta tagccagttg gaagtgaatg gatcgcagca 181 ttcactgacc tgtgcttttg aggacccaga tgtcaacacc accaatctgg aatttgaaat 241 atgtggggcc ctcgtggagg taaagtgcct gaatttcagg aaactacaag agatatattt 301 catcgagaca aagaaattct tactgattgg aaagagcaat atatgtgtga aggttggaga 361 aaagagtcta acctgcaaaa aaatagacct aaccactata gttaaacctg aggctccttt 421 tgacctgagt gtcatctatc gggaaggagc caatgacttt gtggtgacat ttaatacatc 481 acacttgcaa aagaagtatg taaaagtttt aatgcatgat gtagcttacc gccaggaaaa 541 ggatgaaaac aaatggacgc atgtgaattt atccagcaca aagctgacac tcctgcagag 601 aaagctccaa ccggcagcaa tgtatgagat taaagttcga tccatccctg atcactattt 661 taaaggcttc tggagtgaat ggagtccaag ttattacttc agaactccag agatcaataa 721 tagctcaggg gagatggatc ctatcttact aaccatcagc attttgagtt ttttctctgt 781 cgctctgttg gtcatcttgg cctgtgtgtt atggaaaaaa aggattaagc ctatcgtatg 841 gcccagtctc cccgatcata agaagactct ggaacatctt tgtaagaaac caagaaaaaa 901 tttaaatgtg agtttcaatc ctgaaagttt cctggactgc cagattcata gggtggatga 961 cattcaagct agagatgaag tggaaggttt tctgcaagat acgtttcctc agcaactaga 1021 agaatctgag aagcagaggc ttggagggga tgtgcagagc cccaactgcc catctgagga 1081 tgtagtcgtc actccagaaa gctttggaag agattcatcc ctcacatgcc tggctgggaa 1141 tgtcagtgca tgtgacgccc ctattctctc ctcttccagg tccctagact gcagggagag 1201 tggcaagaat gggcctcatg tgtaccagga cctcctgctt agccttggga ctacaaacag 1261 cacgctgccc cctccatttt ctctccaatc tggaatcctg acattgaacc cagttgctca 1321 gggtcagccc attcttactt ccctgggatc aaatcaagaa gaagcatatg tcaccatgtc 1381 cagcttctac caaaaccagt gaagtgtaag aaacccagac tgaacttacc gtgagcgaca 1441 aagatgattt aaaagggaag tctagagttc ctagtctccc tcacagcaca gagaagacaa 1501 aattagcaaa accccactac acagtctgca agattctgaa acattgcttt gaccactctt 1561 cctgagttca gtggcactca acatgagtca agagcatcct gcttctacca tgtggatttg 1621 gtcacaaggt ttaaggtgac ccaatgattc agctattt // LOCUS HUMIMMPHLN 2156 bp mRNA PRI 01-DEC-1992 DEFINITION Human immunophilin (FKBP52) mRNA, complete cds. ACCESSION M88279 NID g186389 KEYWORDS FK506 binding protein; immunophilin. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2156) AUTHORS Sanchez,E.R., Faber,L.E, Henzel,W.J. and Pratt,W.B. TITLE The 56-59-kilodalton protein identified in untransformed steroid receptor complexes is a unique protein that exists in cytosol in a complex with both the 70- and 90-kilodalton heat shock proteins JOURNAL Biochemistry 29, 5145-5152 (1990) MEDLINE 90335211 REFERENCE 2 (bases 1 to 2156) AUTHORS Yem,A.W., Tomasselli,A.G., Heinrikson,R.L., Zurcher-Neely,H.A., Ruff,V.A., Johnson,R.A. and Deibel,M.R. TITLE The hsp56 component of steroid receptor complexes binds to immobilized FK506 and shows homology to FKBP-12 and FKBP-13 JOURNAL J. Biol. Chem. 267, 2868-2871 (1992) MEDLINE 92147620 REFERENCE 3 (bases 1 to 2156) AUTHORS Benasutti,M., Harding,M.W., Fleming,M.A., DeCenzo,M.T., Lippke,J.A., Livingston,D.J. and Peattie,D.A. TITLE Expression and characterization of human FKBP52, an immunophilin that associates with the 90-kDa heat shock protein and is a component of steroid receptor complexes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 10974-10978 (1992) MEDLINE 93066366 FEATURES Location/Qualifiers source 1..2156 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" 5'UTR 1..99 CDS 100..1479 /note="'FKBP52; 52 kD FK506 binding protein'" /codon_start=1 /function="'binds the immunosuppressants FK506 and rapamycin'" /product="immunophilin" /db_xref="PID:g186390" /translation="MTAEEMKATESGAQSAPLPMEGVDISPKQDEGVLKVIKREGTGT EMPMIGDRVFVHYTGWLLDGTKFDSSLDRKDKFSFDLGKGEVIKAWDIAIATMKVGEV CHITCKPEYAYGSAGSPPKIPPNATLVFEVELFEFKGEDLTEEEDGGIIRRIQTRGEG YAKPNEGAIVEVALEGYYKDKLFDQRELRFEIGEGENLDLPYGLERAIQRMEKGEHSI VYLKPSYAFGSVGKEKFQIPPNAELKYELHLKSFEKAKESWEMNSEEKLEQSTIVKER GTVYFKEGKYKQALLQYKKIVSWLEYESSFSNEEAQKAQALRLASHLNLAMCHLKLQA FSAAIESCNKALELDSNNEKGLFRRGEAHLAVNDFELARADFQKVLQLYPNNKAAKTQ LAVCQQRIRRQLAREKKLYANMFERLAEEENKAKAEASSGDHPTDTEMKEEQKSNTAG SQSQVETEA" 3'UTR 1480..2156 polyA_signal 2143..2148 polyA_site 2156 BASE COUNT 530 a 551 c 606 g 469 t ORIGIN 1 cccggcctcc cgcacgcccc gcaggtagcg cccccgcccg cggcccagag tgcgctcgcg 61 ccggcaccag ctcccggata aacggcgcgc cgcgcggaga tgacagccga ggagatgaag 121 gcgaccgaga gcggggcgca gtcggcgccg ctgcccatgg agggagtgga catcagcccc 181 aaacaggacg aaggcgtgct gaaggtcatc aagagagagg gcacaggtac agagatgccc 241 atgattgggg accgagtctt tgtccactac actggctggc tattagatgg cacaaagttt 301 gactccagtc tggatcgcaa ggacaaattc tcctttgacc tgggaaaagg ggaggtcatc 361 aaggcttggg acattgccat agccaccatg aaggtggggg aggtgtgcca catcacctgc 421 aaaccagaat atgcctacgg ttcagcaggc agtcctccaa agattccccc caatgccacg 481 cttgtatttg aggtggagtt gtttgagttt aagggagaag atctgacgga agaggaagat 541 ggcggaatca ttcgcagaat acagactcgc ggtgaaggct atgctaagcc caatgagggt 601 gctatcgtgg aggttgcact ggaagggtac tacaaggaca agctctttga ccagcgggag 661 ctccgctttg agattggcga gggggagaac ctggatctgc cttatggtct ggagagggcc 721 attcagcgca tggagaaagg agaacattcc atcgtgtacc tcaagcccag ctatgctttt 781 ggcagtgttg ggaaggaaaa gttccaaatc ccaccaaatg ctgagctgaa atatgaatta 841 cacctcaaga gttttgaaaa ggccaaggag tcttgggaga tgaattcaga agagaagctg 901 gaacagagca ccatagtgaa agagcggggc actgtgtact tcaaggaagg taaatacaag 961 caagctttac tacagtataa gaagatcgtg tcttggctgg aatatgagtc tagtttttcc 1021 aatgaggaag cacagaaagc acaggccctt cgactggcct ctcacctcaa cctggccatg 1081 tgtcatctga aactacaggc cttctctgct gccattgaaa gctgtaacaa ggccctagaa 1141 ctggacagca acaacgagaa gggcctcttc cgccggggag aggcccacct ggccgtgaat 1201 gactttgaac tggcacgggc tgatttccag aaggtcctgc agctctaccc caacaacaaa 1261 gccgccaaga cccagctggc tgtgtgccag cagcggatcc gaaggcagct tgcccgggag 1321 aagaagctct atgccaatat gtttgagagg ctggctgagg aggagaacaa ggccaaggca 1381 gaggcttcct caggagacca tcccactgac acagagatga aggaggagca gaagagcaac 1441 acggcaggga gccagtctca ggtggagaca gaagcatagc ccctctccac cagccctact 1501 cctgcggctg cctgcccccc agtctcccca ctccaccctg ttagttttgt aaaaactgaa 1561 gaattttgag tgaattagac ctttattttt ctatctggtt ggatggtggc tttaggggaa 1621 gggggaaagg tgtaggctgg gggattgagg tggggaatca ttttagctgg tgtcagcccc 1681 tcttcccttc ctccattgca catgaacata tgtccatcca tatatattca tcagaatgtt 1741 aatttatttt gctccctctg ttaggtccat tttctaaggg tagaagaggc aagtggtagg 1801 gatgaggtct gataagaacc cagggtggag agggagactc ctgggcagcc gttttcctca 1861 tcctttccct ctcccagtcc atttccaaat gtggcctcca tgtgggtgct agggacatgg 1921 gaaaaaccac tgctatgcca tttcttctct ctgttccctt cctcaccccc gacggtgtgg 1981 ctgatgatgt cttctggtgt catggtgacc accccctgtt ccctgttctg gtatttcccc 2041 tgtcagtttc ccctctcggc caggttgtgt cccaaaatcc cctcagcctc ttctctgcac 2101 gttgctgaag gtccaggctt gcctcaagtt ccatgcttga gcaataaagt ggaaac // LOCUS HUMIMP90A 4212 bp mRNA PRI 05-JUL-1995 DEFINITION Homo sapiens importin beta subunit mRNA, complete cds. ACCESSION L38951 NID g893287 KEYWORDS importin beta subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4212) AUTHORS Gorlich,D., Kostka,S., Kraft,R., Dingwall,C., Laskey,R.A., Hartmann,E. and Prehn,S. TITLE Two different subunits of importin cooperate to recognize nuclear localization signals and bind them to the nuclear envelope JOURNAL Curr. Biol. 5 (4), 383-392 (1995) MEDLINE 95353691 FEATURES Location/Qualifiers source 1..4212 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..4212 CDS 338..2968 /codon_start=1 /function="protein import into the nucleus" /evidence=experimental /product="importin beta subunit" /db_xref="PID:g893288" /translation="MELITILEKTVSPDRLELEAAQKFLERAAVENLPTFLVELSRVL ANPGNSQVARVAAGLQIKNSLTSKDPDIKAQYQQRWLAIDANARREVKNYVLQTLGTE TYRPSSASQCVAGIACAEIPVNQWPELIPQLVANVTNPNSTEHMKESTLEAIGYICQD IDPEQLQDKSNEILTAIIQGMRKEEPSNNVKLAATNALLNSLEFTKANFDKESERHFI MQVVCEATQCPDTRVRVAALQNLVKIMSLYYQYMETYMGPALFAITIEAMKSDIDEVA LQGIEFWSNVCDEEMDLAIEASEAAEQGRPPEHTSKFYAKGALQYLVPILTQTLTKQD ENDDDDDWNPCKAAGVCLMLLATCCEDDIVPHVLPFIKEHIKNPDWRYRDAAVMAFGC ILEGPEPSQLKPLVIQAMPTLIELMKDPSVVVRDTAAWTVGRICELLPEAAINDVYLA PLLQCLIEGLSAEPRVASNVCWAFSSLAEAAYEAADVADDQEEPATYCLSSSFELIVQ KLLETTDRPDGHQNNLRSSAYESLMEIVKNSAKDCYPAVQKTTLVIMERLQQVLQMES HIQSTSDRIQFNDLQSLLCATLQNVLRKVQHQDALQISDVVMASLLRMFQSTAGSGGV QEDALMAVSTLVEVLGGEFLKYMEAFKPFLGIGLKNYAEYQVCLAAVGLVGDLCRALQ SNIIPFCDEVMQLLLENLGNENVHRSVKPQILSVFGDIALAIGGEFKKYLEVVLNTLQ QASQAQVDKSDYDMVDYLNELRESCLEAYTGIVQGLKGDQENVHPDVMLVQPRVEFIL SFIDHIAGDEDHTDGVVACAAGLIGDLCTAFGKDVLKLVEARPMIHELLTEGRRSKTN KAKTLATWATKELRKLKNQA" BASE COUNT 1219 a 940 c 1032 g 1019 t 2 others ORIGIN 1 ctccctcgct ccctccctgc gcgccgcctc tcactcacag cctcccttcc ttctttctcc 61 ctccgcctcc cgagcaccag cgcgctctga gctgccccca gggtcccctc ccccgccgcc 121 agcagcccat ttggagggag gaagtaaggg aagaggagag gaaggggagc cggaccgact 181 acccagacag agccggtgaa tgggtttgtg gtgacccccg ccccccaccc caccctccct 241 tcccacccga cccccaaccc ccatccccag ttcgagccgc cgcccgaaag gccgggccgt 301 cgtcttagga ggagtcgccg ccgccgccac ctccgccatg gagctgatca ccattctcga 361 gaagaccgtg tctcccgatc ggctggagct ggaagcggcg cagaagttcc tggagcgtgc 421 ggccgtggag aacctgccca ctttccttgt ggaactgtcc agagtgctgg caaatccagg 481 aaacagtcag gttgccagag ttgcagctgg tctacaaatc aagaactctt tgacatctaa 541 agatccagat atcaaggcac aatatcagca gaggtggctt gctattgatg ctaatgctcg 601 acgagaagtc aagaactatg ttttgcagac attgggtaca gaaacttacc ggcctagttc 661 tgcctcacag tgtgtggctg gtattgcttg tgcagagatc ccagtaaacc agtggccaga 721 actcattcct cagctggtgg ccaatgtcac aaaccccaac agcacagagc acatgaagga 781 gtcgacattg gaagccatcg gttatatttg ccaagatata gacccagagc agctacaaga 841 taaatccaat gagattctga ctgccataat ccaggggatg aggaaagaag agcctagtaa 901 taatgtgaag ctagctgcta caaatgcact cctgaactca ttggagttca ccaaagcaaa 961 ctttgataaa gagtctgaaa ggcactttat tatgcaggtg gtctgtgaag ccacacagtg 1021 tccagatacg agggtacgag tggctgcttt acagaatctg gtgaagataa tgtccttata 1081 ttatcagtac atggagacat atatgggtcc tgctcttttt gcaatcacaa tcgaagcaat 1141 gaaaagtgac attgatgagg tggctttaca agggatagaa ttctggtcca atgtctgtga 1201 tgaggaaatg gatttggcca ttgaagcttc agaggcagca gaacaaggac ggccccctga 1261 gcacaccagc aagttttatg cgaagggagc actacagtat ctggttccaa tcctcacaca 1321 gacactaact aaacaggacg aaaatgatga tgacgatgac tggaacccct gcaaagcagc 1381 aggggtgtgc ctcatgcttc tggccacctg ctgtgaagat gacattgtcc cacatgtcct 1441 ccccttcatt aaagaacaca tcaagaaccc agattggcgg taccgggatg cagcagtgat 1501 ggcttttggt tgtatcttgg aaggaccaga gcccagtcag ctcaaaccac tagttataca 1561 ggctatgccc accctaatag aattaatgaa agaccccagt gtagttgttc gagatacagc 1621 tgcatggact gtaggcagaa tttgtgagct gcttcctgaa gctgccatca atgatgtcta 1681 cttggctccc ctgctacagt gtctgattga gggtctcagt gctgaaccca gagtggcttc 1741 aaatgtgtgc tgggctttct ccagtctggc tgaagctgct tatgaagctg cagacgttgc 1801 tgatgatcag gaagaaccag ctacttactg cttatcttct tcatttgaac tcatagttca 1861 gaagctccta gagactacag acagacctga tggacaccag aacaacctga ggagttctgc 1921 atatgaatct ctgatggaaa ttgtgaaaaa cagtgccaag gattgttacc ctgctgtcca 1981 gaaaacgact ttggtcatca tggaacgact gcaacaggtt cttcagatgg agtcacatat 2041 ccagagcaca tccgatagaa tccagttcaa tgaccttcag tctttactct gtgcaactct 2101 tcagaatgtt cttcggaaag tgcaacatca agatgctttg cagatctctg atgtggttat 2161 ggcctccctg ttaaggatgt tccaaagcac agctgggtct gggggagtac aagaggatgc 2221 cctgatggca gttagcacac tggtggaagt gttgggtggt gaattcctca agtacatgga 2281 ggcctttaaa cccttcctgg gcattggatt aaaaaattat gctgaatacc aggtttgttt 2341 ggcagctgtg ggcttagtgg gagacttgtg ccgtgccctg caatccaaca tcataccttt 2401 ctgtgacgag gtgatgcagc tgcttctgga aaatttgggg aatgagaacg tccacaggtc 2461 tgtgaagccg cagattctgt cagtgtttgg tgatattgcc cttgctattg gaggagagtt 2521 taaaaaatac ttagaggttg tattgaatac tcttcagcag gcctcccaag cccaggtgga 2581 caagtcagac tatgacatgg tggattatct gaatgagcta agggaaagct gcttggaagc 2641 ctatactgga atcgtccagg gattaaaggg ggatcaggag aacgtacacc cggatgtgat 2701 gctggtacaa cccagagtag aatttattct gtctttcatt gaccacattg ctggagatga 2761 ggatcacaca gatggagtag tagcttgtgc tgctggacta ataggggact tatgtacagc 2821 atttgggaag gatgtactga aattagtaga agctaggcca atgatccatg aattgttaac 2881 tgaagggcgg agatcgaaga ctaacaaagc aaaaaccctt gctacatggg caacaaaaga 2941 actgaggaaa ctgaagaacc aagcttgatc tgttaccatt gggatgataa cctgaggacc 3001 cccactggaa atctcccatc ttttgaaaaa cctggaagtg aggagtgtgc acggatgctg 3061 aatgtttggg aatgagagga tgagtgagtg aggcttgaaa acacaccaca ttgaaaatcc 3121 tgccacagca gcagccgcag ccgccaacag cagcgctgtt agtgagctaa gtaagcactg 3181 acttcgtaga aaaccataac atcggccatc ttggaaaaga gaaaaacaat ggagttactt 3241 atttaaaaaa aaagaaagaa agttatctct tcccaggaga ggctagaagt agcttttctg 3301 tcttttggcc agtgccgagt ggaatgcctg gtttcgggga ggaggaggga ctgggttcag 3361 ctgtggtgct ttgttgtaaa aggcagcctg gcctttgcta ctgaggagaa agatggagcc 3421 tgggtctcaa gcccaccttc gctgtacctt tgccacatgg tactgtatgc ttgccagcta 3481 gaaggagggt cagggttttt tacagtctga gaatgagtgt gtgtgagtga ggcggtatcc 3541 acattctcaa cttcaagtca ttgctgtttc tttttcccag aaaacaaggg gttagatgtt 3601 gcatttcata aaactaaccg aagttctgtc tactgatgca gcacaagaga tgtaaaaaaa 3661 aaaaaaaaaa aaaaaaaaaa aaaacacaca aaaaaacaca cacacagagg aaagacgctc 3721 tttaggtttt gttttgtttt tttttttttg gttttgtttt ttgttttttt ttttactcta 3781 gggaaaacac tgacgaatgg tcagagctcc tatcctgatc ttttcatcaa ggcgcctttc 3841 ctaataatat ggttcaactg tgaatgtaga agtgggggnn aggggggaga aaaagaaaac 3901 tctggcgtta gaggatataa aaaaatataa gtacaattgt tacaaataac gcagacttca 3961 aaaacaaaaa aatcacaacc caaacaaacc aaaatttaaa tgatcagaat tggcagcaca 4021 aagaaaacgc cctctcctga cttgtattgt ggcagtctga acgcccccag aaaattgtgc 4081 caaagagttt agaaaaataa atatacaata aaagtaaaca catacacaca aaacagcaaa 4141 cttcaggtaa ctattttgga ttgcaaacag gataattaaa tgttcaaaca atctgataaa 4201 ataaccattt gg // LOCUS HUMIMPA6 1694 bp mRNA PRI 21-NOV-1997 DEFINITION Homo sapiens importin alpha 6 mRNA, complete cds. ACCESSION AF005361 NID g2343115 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1694) AUTHORS Kohler,M., Ansieau,S., Prehn,S., Leutz,A., Haller,H. and Hartmann,E. TITLE Cloning of two novel human importin-alpha subunits and analysis of the expression pattern of the importin-alpha protein family JOURNAL FEBS Lett. 417 (1), 104-108 (1997) MEDLINE 98055463 REFERENCE 2 (bases 1 to 1694) AUTHORS Kohler,M., Prehn,S. and Hartmann,E. TITLE Direct Submission JOURNAL Submitted (25-MAY-1997) Zellbiologie, Max-Delbrueck-Centrum fuer Molekulare Medizin, Robert-Roessle-Str.10, Berlin 13125, Germany FEATURES Location/Qualifiers source 1..1694 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 6..1616 /function="involved in NLS-dependent protein import into the nucleus" /note="importin alpha isoform; belongs to importin alpha protein family; most closely related to human SRP1" /codon_start=1 /product="importin alpha 6" /db_xref="PID:g2343116" /translation="MASPGKDNYRMKSYKNKALNPQEMRRRREEEGIQLRKQKREEQL FKRRNVYLPRNDESMLESPIQDSDISSTVPIPEEGVVTTDMVQMIFSNNADQQLTATQ KFRKLLSKEPNPPIDQVIQKPGVVQRFVKFLERNENCTLQFEAAWALTNIASGTFLHT KVVIETGAVPIFIKLLNSEHEDVQEQAVWALGNIAGDNAECRDFVLNCEILPPLLELL TNSNRLTTTRNAVWALSNLCRGKNPPPNFSKVSPCLNVLSRLLFSSDPDVLADVCWAL SYLSDGPNDKIQAVIDSGVCRRLVELLMHNDYKVVSPALRAVGNIVTGDDIQTQVILN CSALPCLLHLLSSPKESIRKEACWTVSNITAGNRAQIQAVIDANIFPVLIEILQKAEF RTRKEAAWAITNATSGGTPEQIRYLVALGCIKPLCDLLTVMDSKIVQVALNGLENILR LGEQESKQNGIGINPYCALIEEAYGLDKIEFLQSHENQEIYQKAFDLIEHYFGVEEDD PSIVPQVDENQQQFIFQQQEAPMDGFQL" BASE COUNT 564 a 287 c 351 g 492 t ORIGIN 1 atgccatggc tagtccaggg aaagataact atagaatgaa aagttataag aataaagccc 61 taaatcctca agagatgcgt agacgaagag aagaagaagg aatacagctt agaaaacaaa 121 aaagagaaga acagttgttc aaacgcagaa atgtctattt gcccagaaat gatgaatcta 181 tgcttgaaag tcctatacag gattcagata ttagttccac tgtacccatt ccagaggaag 241 gagttgttac tacagatatg gttcaaatga ttttttctaa taatgctgat caacagctaa 301 cagcaacaca gaaatttaga aagctgcttt ctaaagaacc taatccacca atagatcaag 361 ttatacagaa accaggagtt gtacagagat ttgtgaaatt tcttgaaaga aatgaaaatt 421 gcactttaca atttgaagct gcatgggcat taacaaatat agcatctgga acttttctgc 481 ataccaaggt agtgattgaa actggggctg ttccgatttt tatcaaactt cttaattctg 541 aacatgaaga tgttcaggaa caggctgttt gggcacttgg taatattgct ggtgacaatg 601 cagaatgcag agattttgtt ttgaattgtg aaatacttcc acctctttta gagttattaa 661 caaattcaaa cagactcaca acaacaagaa atgccgtgtg ggccctctca aatttatgta 721 gaggcaaaaa ccctcctcca aactttagta aggtttcacc ttgcttaaat gtcctgtcac 781 gactgttgtt tagcagtgac ccagatgtgt tagcagacgt gtgttgggcc ctttcttatc 841 tttccgatgg acccaatgat aaaattcaag cagtcattga ttctggagtc tgtcgaagat 901 tggtggaact tttgatgcac aatgattata aagttgtatc acctgcatta agggcagttg 961 gtaatattgt gactggtgat gatattcaaa cacaggtaat tttgaattgt tctgcattac 1021 cctgtctctt acatttattg agtagcccaa aggagtcaat tagaaaagaa gcctgctgga 1081 ctgtttctaa catcactgct ggaaatagag ctcagattca ggctgttata gatgcaaata 1141 tttttcctgt tttgattgag attcttcaga aagcagagtt tcgtaccaga aaagaagcag 1201 cttgggctat aactaatgca acatcaggag gtactccaga gcaaataagg tatttggtag 1261 ctttaggctg cattaaacca ctttgtgatc ttttgactgt tatggactcc aaaatagtcc 1321 aagtggcttt aaatggactt gaaaatattt tacgtcttgg agaacaagaa tctaagcaga 1381 atggaatagg cattaatcca tactgtgctc tcattgaaga agcatatggt ctggataaaa 1441 ttgagttttt gcaaagccat gaaaatcagg aaatttacca gaaggcattt gatctgattg 1501 aacattactt tggtgtagaa gaagatgacc ccagcattgt acctcaggtg gatgaaaacc 1561 aacaacagtt tatatttcag cagcaggaag caccaatgga tggatttcaa ctttaactta 1621 ctggaggaaa aaaaatttat ggctaaaaag ggtagcttca ggtaactcct ctttgttgcc 1681 aatgtaagga actg // LOCUS HUMIMPH 2858 bp mRNA PRI 11-JUN-1993 DEFINITION Human IMP dehydrogenase type 1 mRNA complete cds. ACCESSION J05272 NID g186393 KEYWORDS IMP dehydrogenase. SOURCE Human spleen, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2858) AUTHORS Natsumeda,Y., Ohno,S., Kawasaki,H., Konno,Y., Weber,G. and Suzuki,K. TITLE Two distinct cDNAs for human IMP dehydrogenase JOURNAL J. Biol. Chem. 265, 5292-5295 (1990) MEDLINE 90203022 COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Natsumeda, 15-FEB-1990. FEATURES Location/Qualifiers source 1..2858 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 601..2145 /note="IMP dehydrogenase type 1 (EC 1.1.1.205)" /codon_start=1 /db_xref="PID:g307067" /translation="MADYLISGGTGYVPEDGLTAQQLFASADDLTYNDFLILPGFIDF IADEVDLTSALTRKITLKTPLISSPMDTVTEADMAIAMALMGGIGFIHHNCTPEFQAN EVRKVKNFEQGFITDPVVLSPSHTVGDVLEAKMRHGFSGIPITETGTMGSKLVGIVTS RDIDFLAEKDHTTLLSEVMTPRIELVVAPAGVTLKEANEILQRSKKGKLPIVNDCDEL VAIIARTDLKKNRDYPLASKDSQKQLLCGAAVGTREDDKYRLDLLTQAGVDVIVFHSS QGNSVYQIAMVHYIKQKYPHLQVIGGNVVTAAQAKNLIDAGVDGLRVGMGCGSICITQ EVMACGRPQGTAVYKVAEYARRFGVPIIADGGIQTVGHVVKALALGASTVMMGSLLAA TTEAPGEYFFSDGVRLKKYRGMGSLDPMEKSSSSQKRYFSEGDKVKIAQGVSGSIQDK GSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTMSPQIEGGVHGLHS YEKRLY" polyA_signal 2843..2848 BASE COUNT 545 a 923 c 860 g 530 t ORIGIN 1 tcggaagggg ccaggagaca ctggaaggtc cggacggcag ggaaggggac ggggttcttt 61 ccagtcccac ccgtgtaggg acacctctcc ccctcatccc ccgatgtacc ctcgctgaat 121 ctgggatggg agagacgaac cgagtctagg catctgcgta gcagcgccgg ggagagcggg 181 gagcccaggc ggagcccagt cgactcccgg attcccctgc cccgcccccg gcacgaggcc 241 ccgccccggc gccccgcccc tcctcgggac tcgaccgggc tgcgctcact gcccagccgg 301 ggccccggga gcctccaggc tcgcccgccc tgagctgcgg cctccgcatg gagggccact 361 cactccacca ccgctgcagg gaggcggacg gcgctgttcc ggagcccgga gcccggcaac 421 acccgggaca cgagacggcg gcgcagggct acagcgcccg actgctgcag gccggctacg 481 agcccgagag ccctagattg gacctcgcta cacacccgac gacaccccgt tcagaactat 541 cttcagtggt cttactggca ggtgttggtg tccagatgga tcgccttcgc agggctagcc 601 atggcggact acctgatcag cggcggcacc ggctacgtgc ccgaggatgg gctcaccgcg 661 cagcagctct tcgccagcgc cgacgacctc acctacaacg acttcctgat tctcccagga 721 ttcatagact tcatagctga tgaggtggac ctgacctcag ccctgacccg gaagatcacg 781 ctgaagacgc cactcatctc ctcccccatg gacactgtga cagaggctga catggccatt 841 gccatggctc tgatgggagg tattgggttc attcaccaca actgcacccc agagttccag 901 gccaatgaag tacgcaaggt caagaacttt gaacagggct tcatcacgga ccctgtggtg 961 ctgagcccct cgcacactgt gggcgatgtg ctggaggcca agatgcggca tggcttctct 1021 ggcatcccca tcactgagac gggcaccatg ggcagcaagc tggtgggcat cgtcacctcc 1081 cgagacatcg actttcttgc tgagaaggac cacaccaccc tcctcagtga ggtgatgacg 1141 ccaaggattg aactggtggt ggctccagca ggtgtgacgt tgaaagaggc aaatgagatc 1201 ctgcagcgta gcaagaaagg gaagctgcct atcgtcaatg attgcgatga gctggtggcc 1261 atcatcgccc gcaccgacct gaagaagaat cgagactacc ctctggcctc caaggattcc 1321 cagaagcagc tgctctgtgg ggcagctgtg ggcacccgtg aggatgacaa ataccgtctg 1381 gacctgctga cccaggcggg ggtcgacgtc atagtcttcc actcgtccca agggaattcg 1441 gtgtatcaga tcgccatggt gcattacatc aaacagaagt acccccacct ccaggtgatt 1501 ggggggaacg tggtgacagc agcccaggcc aagaacctga ttgatgctgg tgtggacggg 1561 ctgcgcgtgg gcatgggctg cggctccatc tgcatcaccc aggaagtgat ggcctgtggt 1621 cggccccagg gcactgctgt gtacaaggtg gctgagtatg cccggcgctt tggtgtgccc 1681 atcatagccg atggcggcat ccagaccgtg ggacacgtgg tcaaggccct ggcccttgga 1741 gcctccacag tgatgatggg ctccctgctg gccgccacta cggaggcccc tggcgagtac 1801 ttcttctcag acggggtgcg gctcaagaag taccggggca tgggctcact ggatcccatg 1861 gagaagagca gcagcagcca gaaacgatac ttcagcgagg gggataaagt gaagatcgca 1921 cagggtgtct cgggctccat ccaggacaaa ggatccattc agaagttcgt gccctacctc 1981 atagcaggca tccaacacgg ctgccaggat atcggggccc gcagcctgtc tgtccttcgg 2041 tccatgatgt actcaggaga gctcaagttt gagaagcgga ccatgtcgcc ccagattgag 2101 ggtggtgtcc atggcctgca ctcttacgaa aagcggctgt actgaggaca gcggtggagg 2161 ccgaggtggt ggaggggatg caccccagtg tccacttttg ggcacaggct ccctccataa 2221 ctgagtggtc cacagatttg cactacgggt tctccagctc ctttccaggc agagaggagg 2281 ggaggtcctg aggggactgc tgcccctcac tcggcatccc ctgcagagtc aggactgctc 2341 ccgggggcca ggctgccctg ggaggccccc tccgagacca gccagccagg ctctcaggac 2401 ctgcgctgcc ttaggatctt tcttgctgca gcctgctcca gcctggcccc caccccaggg 2461 gcaggcggcc cctcctggct tctcctgtag ggcacctccc tgcccctagc ctcccagcaa 2521 atggtgctct cctggccctg ctctggccct tcccgggccg tgcccctcag ccatgtggca 2581 cttctgagct cctgacctag gccaagggga ggtctctgcc cccttccccg gccctgggct 2641 acccttgggt cctgctcctc aggccgctcc cctgtccctg gccatgggta ggagactgcc 2701 ctggtcatgg ccgcctgcct gtcattcctg actcaccacc gtccccaggt gaaccattcc 2761 tcccttctcc tcagctgcag tcgaaggctt taactttgca cacttgggat cacagttgcg 2821 tcattgtgta ttaaatactt ggaataaatc aagcaggt // LOCUS HUMINAE 3933 bp mRNA PRI 01-MAR-1994 DEFINITION Homo sapiens integrin alpha E mRNA, complete cds. ACCESSION L25851 NID g457244 KEYWORDS integrin alpha E. SOURCE Homo sapiens (library: lambda ZAP II) adult intestinal epithelial lining cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3933) AUTHORS Shaw,S.K., Cepek,K.L., Murphy,E.A., Russell,G.J., Brenner,M.B. and Parker,C.M. TITLE Molecular cloning of the human mucosal lymphocyte integrin alpha E subunit JOURNAL J. Biol. Chem. 269, 6016-6025 (1994) MEDLINE 94164962 FEATURES Location/Qualifiers source 1..3933 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="032891" /cell_type="TGF-beta1 induced cultured intra-epithelial lymphocytes" /dev_stage="adult" /tissue_type="intestinal epithelial lining" /tissue_lib="lambda ZAP II" sig_peptide 126..179 CDS 126..3662 /note="includes signal sequence" /codon_start=1 /function="precursor" /product="integrin alpha E" /db_xref="PID:g457245" /translation="MWLFHTLLCIASLALLAAFNVDVARPWLTPKGGAPFVLSSLLHQ DPSTNQTWLLVTSPRTKRTPGPLHRCSLVQDEILCHPVEHVPIQGEAPGSDRCPEPPR CFDMHSSAGPAPHSLSSELTGTCSLLGPDLRPQAQANFFDLENLLDPDARVDTGDCYS NKEGGGEDDVNTARQRRALEKEEEEDKEEEEDEEEEEAGTEIAIILDGSGSIDPPDFQ RAKDFISNMMRNFYEKCFECNFALVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGS VTKTASAMQHVLDSIFTSSHGSRRKASKVMVVLTDGGIFEDPLNLTTVINSPKMQGVE RFAIGVGEEFKSARTARELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIISMEGTV GDALHYQLAQIGFSAQILDERQVLLGAVGAFDWSGGALLYDTRSRRGRFLNQTAAAAA DAEAAQYSYLGYAVAVLHKTCSLSYVAGAPQYKHHGAVFELQKEGREASFLPVLEGEQ MGSYFGSELCPVDIDMDGSTDFLLVAAPFYHVHGEEGRVYVYRLSEQDGSFSLARILS GHPGFTNARFGFAMAAMGDLSQDKLTDVAIGAPLEGFGADDGASFGSVYIYNGHWDGL SASPSQRIRASTVAPGLQYFGMSMAGGFDISGDGLADITVGTLGQAVVFRSRPVVRLK VSMAFTPSALPIGFNGVVNVRLCFEISSVTTASESGLREALLNFTLDVDVGKQRRRLQ CSDVRSCLGCLREWSSGSQLCEDLLLMPTEGELCEEDCFSNASVKVSYQLQTPEGQTD HPQPILDRYTEPFAIFQLPYEKACKNKLFCVAELQLATTVSQQELVVGLTKELTLNIN LTNSGEDSYMTSMALNYPRNLQLKRMQKPPSPNIQCDDPQPVASVLIMNCRIGHPVLK RSSAHVSVVWQLEENAFPNRTADITVTVTNSNERRSLANETHTLQFRHGFVAVLSKPS IMYVNTGQGLSHHKEFLFHVHGENLFGAEYQLQICVPTKLRGLQVAAVKKLTRTQAST VCTWSQERACAYSSVQHVEEWHSVSCVIASDKENVTVAAEISWDHSEELLKDVTELQI LGEISFNKSLYEGLNAENHRTKITVVFLKDEKYHSLPIIIKGSVGGLLVLIVILVILF KCGFFKRKYQQLNLESIRKAQLKSENLLEEEN" mat_peptide 180..3659 /function="adhesion molecule" /evidence=experimental /product="integrin alpha E" misc_difference 1202 /replace="a" misc_difference 2188..2189 /note="deletion" /replace="c" misc_difference 3244 /replace="c" polyA_site 3858..3863 BASE COUNT 938 a 1062 c 1074 g 859 t ORIGIN 1 gaattccggc ccccgtgtct gggcgtccgc ctcctggcct cctggctgag gggaagctga 61 gtgggccacg gcccatgtgt cgcactcgcc tcggctccca cacagccgcc tctgctccag 121 caaggatgtg gctcttccac actctgctct gcatagccag cctggccctg ctggccgctt 181 tcaatgtgga tgtggcccgg ccctggctca cgcccaaggg aggtgcccct ttcgtgctca 241 gctcccttct gcaccaagac cccagcacca accagacctg gctcctggtc accagcccca 301 gaaccaagag gacaccaggg cccctccatc gatgttccct tgtccaggat gaaatccttt 361 gccatcctgt agagcatgtc cccatccaag gggaggcacc ggggagtgac cgttgtccgg 421 agccaccacg gtgttttgat atgcattcaa gtgctggtcc ggcgcctcac agcctcagct 481 cagaactcac aggcacctgt agcctcctgg gccctgacct ccgtccccag gctcaggcca 541 acttcttcga ccttgaaaat ctcctggatc cagatgcacg tgtggacact ggagactgct 601 acagcaacaa agaaggcggt ggagaagacg atgtgaacac agccaggcag cgccgggctc 661 tggagaagga ggaggaggaa gacaaggagg aggaggaaga cgaggaggag gaggaagctg 721 gcaccgagat tgccatcatc ctggatggct caggaagcat tgatccccca gactttcaga 781 gagccaaaga cttcatctcc aacatgatga ggaacttcta tgaaaagtgt tttgagtgca 841 actttgcctt ggtgcagtat ggaggagtga tccagactga gtttgacctt cgggacagcc 901 aggatgtgat ggcctccctc gccagagtcc agaacatcac tcaagtgggg agtgtcacca 961 agactgcctc agccatgcaa cacgtcttag acagcatctt cacctcaagc cacggctcca 1021 ggagaaaggc atccaaggtc atggtggtgc tcaccgatgg tggcatattc gaggaccccc 1081 tcaaccttac gacagtcatc aactccccca aaatgcaggg tgttgagcgc tttgccattg 1141 gggtgggaga agaatttaag agtgctagga ctgcgaggga actgaacctg atcgcctcag 1201 acccggatga gacccatgct ttcaaggtga ccaactacat ggcgctggat gggctgctga 1261 gcaaactgcg gtacaacatc atcagcatgg aaggcacggt tggagacgcc cttcactacc 1321 agctggcaca gattggcttc agtgctcaga tcctggatga gcggcaggtg ctgctcggcg 1381 ccgtcggggc ctttgactgg tccggagggg cgttgctcta cgacacacgc agccgccggg 1441 gccgcttcct gaaccagaca gcggcggcgg cggcagacgc ggaggctgcg cagtacagct 1501 acctgggtta cgctgtggcc gtgctgcaca agacctgcag cctctcctac gtcgcggggg 1561 ctccacagta caaacatcat ggggccgtgt ttgagctcca gaaggagggc agagaggcca 1621 gcttcctgcc agtgctggag ggagagcaga tggggtccta ttttggctct gagctgtgcc 1681 ctgtggacat tgacatggat ggaagcacgg acttcttgct ggtggctgct ccattttacc 1741 acgttcatgg agaagaaggc agagtctacg tgtaccgtct cagcgagcag gatggttctt 1801 tctccttggc acgcatactg agtgggcacc ccgggttcac caatgcccgc tttggctttg 1861 ccatggcggc tatgggggat ctcagtcagg ataagctcac agatgtggcc atcggggccc 1921 ccctggaagg ttttggggca gatgatggtg ccagcttcgg cagtgtgtat atctacaatg 1981 gacactggga cggcctctcc gccagcccct cgcagcggat cagagcctcc acggtggccc 2041 caggactcca gtacttcggc atgtccatgg ctggtggctt tgatattagt ggcgacggcc 2101 ttgccgacat caccgtgggc actctgggcc aggcggttgt gttccgctcc cggcctgtgg 2161 ttcgcctgaa ggtctccatg gccttcaccc ccagcgcact gcccatcggc ttcaacggcg 2221 tcgtgaatgt ccgtttatgt tttgaaatca gctctgtaac cacagcctct gagtcaggcc 2281 tccgtgaggc acttctcaac ttcacgctgg atgtggatgt ggggaagcag aggagacggc 2341 tgcagtgttc agacgtaaga agctgtctgg gctgcctgag ggagtggagc agcggatccc 2401 agctttgtga ggacctcctg ctcatgccca cagagggaga gctctgtgag gaggactgct 2461 tctccaatgc cagtgtcaaa gtcagctacc agctccagac ccctgaggga cagacggacc 2521 atccccagcc catcctggac cgctacactg agccctttgc catcttccag ctgccctatg 2581 agaaggcctg caagaataag ctgttttgtg tcgcagaatt acagttggcc accaccgtct 2641 ctcagcagga gttggtggtg ggtctcacaa aggagctgac cctgaacatt aacctaacta 2701 actccgggga agattcctac atgacaagca tggccttgaa ttaccccaga aacctgcagt 2761 tgaagaggat gcaaaagcct ccctctccaa acattcagtg tgatgaccct cagccggttg 2821 cttctgtcct gatcatgaac tgcaggattg gtcaccccgt cctcaagagg tcatctgctc 2881 atgtttcagt cgtttggcag ctagaggaga atgcctttcc aaacaggaca gcagacatca 2941 ctgtgactgt caccaattcc aatgaaagac ggtctttggc caacgagacc cacacccttc 3001 aattcaggca tggcttcgtt gcagttctgt ccaaaccatc cataatgtac gtgaacacag 3061 gccaggggct ttctcaccac aaagaattcc tcttccatgt acatggggag aacctctttg 3121 gagcagaata ccagttgcaa atttgcgtcc caaccaaatt acgaggtctc caggttgcag 3181 cagtgaagaa gctgacgagg actcaggcct ccacggtgtg cacctggagt caggagcgcg 3241 cttgtgcgta cagttcggtt cagcatgtgg aagaatggca ttcagtgagc tgtgtcatcg 3301 cttcagataa agaaaatgtc accgtggctg cagagatctc ctgggatcac tctgaggagt 3361 tactaaaaga tgtaactgaa ctgcagatcc ttggtgaaat atctttcaac aaatctctat 3421 atgagggact gaatgcagag aaccacagaa ctaagatcac tgtcgtcttc ctgaaagatg 3481 agaagtacca ttctttgcct atcatcatta aaggcagcgt tggtggactt ctggtgttga 3541 tcgtgattct ggtcatcctg ttcaagtgtg gcttttttaa aagaaaatat caacaactga 3601 acttggagag catcaggaag gcccagctga aatcagagaa tctgctcgaa gaagagaatt 3661 aggacctgct atccactggg agaggctatc agccagtcct gggacttgga gacccagcat 3721 cctttgcatt actttttcct tcaggatgat ctagagcagc atggagctgt tggtagaata 3781 ttagttttta accatacatt gtcccaaaag tgtctgtgca ttgtgcaaaa agtaaactta 3841 ggaaacattt ggtattaaat aaatttacac ttttctttgc aaaaaaaaaa aaaaaaaaaa 3901 aaaaaaaaaa aaaaaaaaaa aaaaccggaa ttc // LOCUS HUMINHA 1338 bp mRNA PRI 06-JAN-1995 DEFINITION Human inhibin A-subunit mRNA, complete cds. ACCESSION M13981 NID g186410 KEYWORDS inhibin. SOURCE Human term placenta, cDNA to mRNA, clone hFSA-110. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1338) AUTHORS Mayo,K.E., Cerelli,G.M., Spiess,J., Rivier,J., Rosenfeld,M.G., Evans,R.M. and Vale,W. TITLE Inhibin A-subunit cDNAs from porcine ovary and human placenta JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (16), 5849-5853 (1986) MEDLINE 86287350 FEATURES Location/Qualifiers source 1..1338 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q33-qter" sig_peptide 145..198 /gene="INHA" /note="inhibin A-subunit signal peptide" CDS 145..1245 /gene="INHA" /note="inhibin A-subunit precursor" /codon_start=1 /db_xref="GDB:G00-120-100" /db_xref="PID:g307068" /translation="MVLHLLLFLLLTPQGGHSCQGLELARELVLAKVRALFLDALGPP AVTREGGDPGVRRLPRRHALGGFTHRGSEPEEEEDVSQAILFPATDASCEDKSAARGL AQEAEEGLFRYMFRPSQHTRSRQVTSAQLWFHTGLDRQGTAASNSSEPLLGLLALSPG GPVAVPMSLGHAPPHWAVLHLATSALSLLTHPVLVLLLRCPLCTCSARPEATPFLVAH TRTRPPSGGERARRSTPLMSWPWSPSALRLLQRPPEEPAAHANCHRVALNISFQELGW ERWIVYPPSFIFHYCHGGCGLHIPPNLSLPVPGAPPTPAQPYSLLPGAQPCCAALPGT MRPLHVRTTSDGGYSFKYETVPNLLTQHCACI" gene 145..1245 /gene="INHA" mat_peptide 841..1242 /gene="INHA" /note="inhibin A-subunit" BASE COUNT 232 a 433 c 417 g 256 t ORIGIN 197 bp upstream of PvuII site. 1 gaaggactgg ggaagactgg atgagaaggg tagaagaggg tgggtgtggg atggggaggg 61 gagagtggaa aggccctggg cagaccctgg cagaaggggc acggggcagg gtgtgagttc 121 cccactagca gggccaggtg agctatggtg ctgcacctac tgctcttctt gctgctgacc 181 ccacagggtg ggcacagctg ccaggggctg gagctggccc gggaacttgt tctggccaag 241 gtgagggccc tgttcttgga tgccttgggg ccccccgcgg tgaccaggga aggtggggac 301 cctggagtca ggcggctgcc ccgaagacat gccctggggg gcttcacaca caggggctct 361 gagcccgagg aagaggagga tgtctcccaa gccatccttt tcccagccac agatgccagc 421 tgtgaggaca agtcagctgc cagagggctg gcccaggagg ctgaggaggg cctcttcaga 481 tacatgttcc ggccatccca gcatacacgc agccgccagg tgacttcagc ccagctgtgg 541 ttccacaccg ggctggacag gcagggcaca gcagcctcca atagctctga gcccctgcta 601 ggcctgctgg cactgtcacc gggaggaccc gtggctgtgc ccatgtcttt gggccatgct 661 ccccctcact gggccgtgct gcacctggcc acctctgctc tctctctgct gacccacccc 721 gtcctggtgc tgctgctgcg ctgtcccctc tgtacctgct cagcccggcc tgaggccacg 781 cccttcctgg tggcccacac tcggaccaga ccacccagtg gaggggagag agcccgacgc 841 tcaactcccc tgatgtcctg gccttggtct ccctctgctc tgcgcctgct gcagaggcct 901 ccggaggaac cggctgccca tgccaactgc cacagagtag cactgaacat ctccttccag 961 gagctgggct gggaacggtg gatcgtgtac cctcccagtt tcatcttcca ctactgtcat 1021 ggtggttgtg ggctgcacat cccaccaaac ctgtcccttc cagtccctgg ggctccccct 1081 accccagccc agccctactc cttgctgcca ggggcccagc cctgctgtgc tgctctccca 1141 gggaccatga ggcccctaca tgtccgcacc acctcggatg gaggttactc tttcaagtat 1201 gagacagtgc ccaaccttct cacgcagcac tgtgcttgta tctaagggtg gggggtcttc 1261 cttcttaatc ccatggctgg tggccacgcc cccaccatca tcagctggga ggaaaggcag 1321 agttgggaaa tagatggc // LOCUS HUMINOS 1705 bp mRNA PRI 12-JAN-1993 DEFINITION Human inositol polyphosphate 1-phosphatase mRNA, complete cds. ACCESSION L08488 NID g186425 KEYWORDS inositol; inositol polyphosphate 1-phosphatase; phosphatase. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS York,J.D., Veile,R.A., Donis-Keller,H. and Majerus,P.W. TITLE Cloning, heterologous expression, and chromosomal localization of human inositol polyphosphate 1-phosphatase JOURNAL J. Biol. Chem. 90, 5833-5837 (1993) FEATURES Location/Qualifiers source 1..1705 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 327..1526 /codon_start=1 /product="inositol polyphosphate 1-phosphatase" /db_xref="PID:g186426" /translation="MSDILRELLCVSEKAANIARACRQQEALFQLLIEEKKEGEKNKK FAVDFKTLADVLVQEVIKQNMENKFPGLEKNIFGEESNEFTNDWGEKITLRLCSTEEE TAELLSKVLNGNKVASEALARVVHQDVAFTDPTLDSTEINVPQDILGIWVDPIDSTYQ YIKGSADIKSNQGIFPCGLQCVTILIGVYDIQTGVPLMGVINQPFVSRDPNTLRWKGQ CYWGLSYMGTNMHSLQLTISRRNGSETHTGNTGSEAAFSPSFSAVISTSEKETIKAAL SRVCGDRIFGAAGAGYKSLCVVQGLVDIYIFSEDTTFKWDSCAAHAILRAMGGGIVDL KECLERNPETGLDLPQLVYHVENEGAAGVDRWANKGGLIAYRSRKRLETFLSLLVQNL APAETHT" polyA_site 1705 BASE COUNT 433 a 424 c 426 g 422 t ORIGIN 1 gaattcatct gtccactgct acccctgctg aggccaagct cggatccggt gccgagccaa 61 gcggggccgt gcgtcgccgg ccttcgctcg cgtgacctcc gccgtcctcc ccaaccctcg 121 tcctctgcgc ctgcggccgc agccccagcg cccctcgcct aacctcccgc cgggccgcgc 181 ctcctcctcc tcctgctccc cgccgcttcc gtttctcgag ggaaaggctg ctgcctcctg 241 ctctgtcctc atccccggct tagctgacgg cccagaggtg ggtgccaatt ccaccagcag 301 ctgcaactga aaagcaaggt tcagaaatgt cagatatcct ccgggagctg ctctgtgtct 361 ctgagaaggc tgctaacatt gcccgggcgt gcagacagca ggaagccctc ttccagctgc 421 tgatcgaaga aaagaaagag ggagaaaaga acaagaagtt tgcagttgac ttcaagactc 481 tggctgatgt actggtacag gaagttataa aacagaatat ggagaacaag tttccaggct 541 tggaaaaaaa tatttttgga gaagaatcca atgagtttac taatgactgg ggggaaaaga 601 ttaccttgag gttgtgttca acagaggaag aaacagcaga gcttcttagc aaagtcctca 661 atggtaacaa ggtagcatct gaagcattag ccagggttgt tcatcaggat gttgccttta 721 ctgacccaac tctggattcc acagagatca atgttccaca ggacattttg ggaatttggg 781 tggaccccat agattcaact tatcagtata taaaaggttc tgctgacatt aaatccaacc 841 agggaatctt cccctgtgga cttcagtgtg tcaccatttt aattggtgtc tatgacatac 901 agacaggggt tcccctgatg ggagtcatca atcaaccttt tgtgtcacga gatccaaaca 961 ccctcaggtg gaaaggacag tgctattggg gcctttctta catggggacc aacatgcatt 1021 cactacagct caccatctct agaagaaacg gcagtgaaac acacactgga aacaccggct 1081 ctgaggcagc attctccccc agtttttcag ccgtaattag tacaagtgaa aaggagacta 1141 tcaaagctgc attgtcacgt gtgtgtggag atcgcatatt tggggcagct ggggctggtt 1201 ataagagcct atgtgttgtc caaggcctcg ttgacattta catcttttca gaagatacca 1261 cattcaaatg ggactcttgt gctgctcatg ccatactgcg ggccatgggt gggggaatag 1321 tagacttgaa agaatgctta gaaagaaatc cagaaacagg gcttgatttg ccacagttgg 1381 tgtaccacgt ggaaaatgag ggtgctgctg gggtggatcg gtgggccaac aagggaggac 1441 tcattgcata cagatccagg aagcggctgg agacattcct gagcctcctg gtccaaaacc 1501 tggcacctgc agagacgcat acctagagga actctaaccc cggtgtacct gtataaactg 1561 aactgtgaaa ctgtttcggt tatctctgtc ttttgaggat ggctttgtcc tgttgctggt 1621 taacattcac cttcctcttt tgaggagtat ttttccatta tgtattcata ataatgttaa 1681 tttcaataaa tgacattcat gcagc // LOCUS HUMINSH 617 bp mRNA PRI 20-SEP-1996 DEFINITION Homo sapiens early placenta insulin-like peptide EPIL (INSL4) mRNA, complete cds. ACCESSION L34838 NID g1220314 KEYWORDS insulin family; placentin. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 617) AUTHORS Chassin,D., Laurent,A., Janneau,J.L., Berger,R. and Bellet,D. TITLE Cloning of a new member of the insulin gene superfamily (INSL4) expressed in human placenta JOURNAL Genomics 29 (2), 465-470 (1995) MEDLINE 96115599 REFERENCE 2 (bases 1 to 617) AUTHORS Koman,A. et al. TITLE Patent application, FR 2721033, 13-JUN-1994 JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..617 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="trophoblast" /dev_stage="6 weeks" /tissue_type="placenta" /tissue_lib="subtracted cDNA" gene 106..525 /gene="INSL4" CDS 106..525 /gene="INSL4" /note="early placenta insulin-like peptide" /codon_start=1 /product="EPIL" /db_xref="PID:g1220315" /translation="MASLFRSYLPAIWLLLSQLLRESLAAELRGCGPRFGKHLLSYCP MPEKTFTTTPGGWLLESGRPKEMVSTSNNKDGQALGTTSEFIPNLSPELKKPLSEGQP SLKKIILSRKKRSGRHRFDPFCCEVICDDGTSVKLCT" BASE COUNT 189 a 144 c 142 g 142 t ORIGIN 1 agtctggagc ccagaaggga cacaccagca cagtctggta ggctacagca gcaagtctct 61 aaagaaaggc tgagaacacc cagaacagga gagttcaggt ccaggatggc cagcctgttc 121 cggtcctatc tgccagcaat ctggctgctg ctgagccaac tccttagaga aagcctagca 181 gcagagctga ggggatgtgg tccccgattt ggaaaacact tgctgtcata ttgccccatg 241 cctgagaaga cattcaccac caccccagga gggtggctgc tggaatctgg acgtcccaaa 301 gaaatggtgt caacctccaa caacaaagat ggacaagcct taggtacgac atcagaattc 361 attcctaatt tgtcaccaga gctgaagaaa ccactgtctg aagggcagcc atcattgaag 421 aaaataatac tttcccgcaa aaagagaagt ggacgtcaca gatttgatcc attctgttgt 481 gaagtaattt gtgacgatgg aacttcagtt aaattatgta catagtagag taatcatgga 541 ctggacatct catccattct catatgtatt ctcaatgaca aattcactga tgcccaatta 601 aatgattgct gtttaaa // LOCUS HUMINTA3A 4637 bp mRNA PRI 30-OCT-1991 DEFINITION Human integrin alpha-3 chain mRNA, complete cds. ACCESSION M59911 NID g186496 KEYWORDS collagen receptor; fibronectin; integrin alpha-3 chain; laminin receptor; transmembrane protein. SOURCE Homo sapiens adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4637) AUTHORS Takada,Y., Murphy,E., Pil,P., Chen,C., Ginsberg,M.H. and Hemler,M.E. TITLE Molecular cloning and expression of the cDNA for alpha3 subunit of human alpha3 beta1 (VLA-3), an integrin receptor for fibronectin laminin and collagen JOURNAL J. Cell Biol. 115, 257-266 (1991) MEDLINE 92011866 FEATURES Location/Qualifiers source 1..4637 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="endothelial cell" /dev_stage="adult" /haplotype="diploid" CDS 74..3229 /codon_start=1 /product="integrin alpha-3 chain" /db_xref="PID:g186497" /translation="MGPGPSRAPRAPRLMLCALALMVAAGGCVVSAFNLDTRFLVVKE AGNPGSLFGYSVALHRQTERQQRYLLLAGAPRELAVPDGYTNRTGAVYLCPLTAHKDD CERMNITVKNDPGHHIIEDMWLGVTVASQGPAGRVLVCAHRYTQVLWSGSEDQRRMVG KCYVRGNDLELDSSDDWQTYHNEMCNSNTDYLETGMCQLGTSGGFTQNTVYFGAPGAY NWKGNSYMIQRKEWDLSEYSYKDPEDQGNLYIGYTMQVGSFILHPKNITIVTGAPRHR HMGAVFLLSQEAGGDLRRRQVLEGSQVGAYFGSAIALADLNNDGWQDLLVGAPYYFER KEEVGGAIYVFMNQAGTSFPAHPSLLLHGPSGSAFGLSVASIGDINQDGFQDIAVGAP FEGLGKVYIYHSSSKGLLRQPQQVIHGEKLGLPGLATFGYSLSGQMDVDENFYPDLLV GSLSDHIVLLRARPVINIVHKTLVPRPAVLDPALCTATSCVQVELCFAYNQSAGNPNY RRNITLAYTLEADRDRRPPRLRFAGSESAVFHGFFSMPEMRCQKLELLLMDNLRDKLR PIIISMNYSLPLRMPDRPRLGLRSLDAYPILNQAQALENHTEVQFQKECGPDNKCESN LQMRAAFVSEQQQKLSRLQYSRDVRKLLLSINVTNTRTSERSGEDAHEALLTLVVPPA LLLSSVRPPGACQANETIFCELGNPFKRNQRMELLIAFEVIGVTLHTRDLQVQLQLST SSHQDNLWPMILTLLVDYTLQTSLSMVNHRLQSFFGGTVMGESGMKTVEDVGSPLKYE FQVGPMGEGLVGLGTLVLGLEWPYEVSNGKWLLYPTEITVHGNGSWPCRPPGDLINPL NLTLSDPGDRPSSPQRRRRQLDPGGGQGPPPVTLAAAKKAKSETVLTCATGRAHCVWL ECPIPDAPVVTNVTVKARVWNSTFIEDYRDFDRVRVNGWATLFLRTSIPTINMENKTT WFSVDIDSELVEELPAEIELWLVLVAVGAGLLLLGLIILLLWKCGFFKRARTRALYEA KRQKAEMKSQPSETERLTDDY" BASE COUNT 962 a 1450 c 1339 g 886 t ORIGIN chromosome 17. 1 aggtgaacag gtcctcacgc ccagctccgc cccctcacgc gctctcgccg ggaccccgct 61 tccgctggca gccatgggcc ccggccccag ccgcgcgccc cgcgccccac gcctgatgct 121 ctgtgcgctc gccttgatgg tggcggccgg cggctgcgtc gtctccgcct tcaacctgga 181 tacccgattc ctggtagtga aggaggccgg gaacccgggc agcctcttcg gctactcggt 241 cgccctccat cggcagacag agcggcagca gcgctacctg ctcctggctg gtgccccccg 301 ggagctcgct gtgcccgatg gctacaccaa ccggactggt gctgtgtacc tgtgcccact 361 cactgcccac aaggatgact gtgagcggat gaacatcaca gtgaaaaatg accctggcca 421 tcacattatt gaggacatgt ggcttggagt gactgtggcc agccagggcc ctgcaggcag 481 agttctggtc tgtgcccacc gctacaccca ggtgctgtgg tcagggtcag aagaccagcg 541 gcgcatggtg ggcaagtgct acgtgcgagg caatgaccta gagctggact ccagtgatga 601 ctggcagacc taccacaacg agatgtgcaa tagcaacaca gactacctgg agacgggcat 661 gtgccagctg ggcaccagcg gtggcttcac ccagaacact gtgtacttcg gcgcccccgg 721 tgcctacaac tggaaaggaa acagctacat gattcagcgc aaggagtggg acttatctga 781 gtatagttac aaggacccag aggaccaagg aaacctctat attgggtaca cgatgcaggt 841 aggcagcttc atcctgcacc ccaaaaacat caccattgtg acaggtgccc cacggcaccg 901 acatatgggc gcggtgttct tgctgagcca ggaggcaggc ggagacctgc ggaggaggca 961 ggtgctggag ggctcgcagg tgggcgccta ttttggcagc gcaattgccc tggcagacct 1021 gaacaatgat gggtggcagg acctcctggt gggcgccccc tactacttcg agaggaaaga 1081 ggaagtaggg ggtgccatct atgtcttcat gaaccaggcg ggaacctcct tccctgctca 1141 cccctcactc cttcttcatg gccccagtgg ctctgccttt ggtttatctg tggccagcat 1201 tggtgacatc aaccaggatg gatttcagga tattgctgtg ggagctccgt ttgaaggctt 1261 gggcaaagtg tacatctatc acagtagctc taaggggctc cttagacagc cccagcaggt 1321 aatccatgga gagaagctgg gactgcctgg gttggccacc ttcggctatt ccctcagtgg 1381 gcagatggat gtggatgaga acttctaccc agaccttcta gtgggaagcc tgtcagacca 1441 cattgtgctg ctgcgggccc ggccagtcat caacatcgtc cacaagacct tggtgcccag 1501 gccagctgtg ctggaccctg cactttgcac ggccacctct tgtgtgcaag tggagctgtg 1561 ctttgcttac aaccagagtg ccgggaaccc caactacagg cgaaacatca ccctggccta 1621 cactctggag gctgacaggg accgccggcc gccccggctc cgctttgccg gcagtgagtc 1681 cgctgtcttc cacggcttct tctccatgcc cgagatgcgc tgccagaagc tggagctgct 1741 cctgatggac aacctccgtg acaaactccg ccccatcatc atctccatga actactcttt 1801 acctttgcgg atgcccgatc gcccccggct ggggctgcgg tccctggacg cctacccgat 1861 cctcaaccag gcacaggctc tggagaacca cactgaggtc cagttccaga aggagtgcgg 1921 gcctgacaac aagtgtgaga gcaacttgca gatgcgggca gccttcgtgt cagagcagca 1981 gcagaagctg agcaggctcc agtacagcag agacgtccgg aaattgctcc tgagcatcaa 2041 cgtgacgaac acccggacct cggagcgctc cggggaggac gcccacgagg cgctgctcac 2101 cctggtggtg cctcccgccc tgctgctgtc ctcagtgcgc ccccccgggg cctgccaagc 2161 taatgagacc atcttttgcg agctggggaa ccccttcaaa cggaaccaga ggatggagct 2221 gctcatcgcc tttgaggtca tcggggtgac cctgcacaca agggaccttc aggtgcagct 2281 gcagctctcc acgtcgagtc accaggacaa cctgtggccc atgatcctca ctctgctggt 2341 ggactataca ctccagacct cgcttagcat ggtaaatcac cggctacaaa gcttctttgg 2401 ggggacagtg atgggtgagt ctggcatgaa aactgtggag gatgtaggaa gccccctcaa 2461 gtatgaattc caggtgggcc caatggggga ggggctggtg ggcctgggga ccctggtcct 2521 aggtctggag tggccctacg aagtcagcaa tggcaagtgg ctgctgtatc ccacggagat 2581 caccgtccat ggcaatgggt cctggccctg ccgaccacct ggagacctta tcaaccctct 2641 caacctcact ctttctgacc ctggggacag gccatcatcc ccacagcgca ggcgccgaca 2701 gctggatcca gggggaggcc agggcccccc acctgtcact ctggctgctg ccaaaaaagc 2761 caagtctgag actgtgctga cctgtgccac agggcgtgcc cactgtgtgt ggctagagtg 2821 ccccatccct gatgcccccg ttgtcaccaa cgtgactgtg aaggcacgag tgtggaacag 2881 caccttcatc gaggattaca gagactttga ccgagtccgg gtaaatggct gggctaccct 2941 attcctccga accagcatcc ccaccatcaa catggagaac aagaccacgt ggttctctgt 3001 ggacattgac tcggagctgg tggaggagct gccggccgaa atcgagctgt ggctggtgct 3061 ggtggccgtg ggtgcagggc tgctgctgct ggggctgatc atcctcctgc tgtggaagtg 3121 cggcttcttc aagcgagccc gcactcgcgc cctgtatgaa gctaagaggc agaaggcgga 3181 gatgaagagc cagccgtcag agacagagag gctgaccgac gactactgag ggggcagccc 3241 cccgcccccg gcccacctgg tgtgacttct ttaagcggac ccgctattat cagatcatgc 3301 ccaagtacca cgcagtgcgg atccgggagg aggagcgcta cccacctcca gggagcaccc 3361 tgcccaccaa gaagcactgg gtgaccagct ggcagactcg ggaccaatac tactgacgtc 3421 ctccctgatc ccaccccctc ctcccccagt gtcccctttc ttcctattta tcataagtta 3481 tgcctctgac agtccacagg ggccaccacc tttggctggt agcagcaggc tcaggcacat 3541 acacctcgtc aagagcatgc acatgctgtc tggccctggg gatcttccca caggagggcc 3601 agcgctgtgg accttacaac gccgagtgca ctgcattcct gtgccctaga tgcacgtggg 3661 gcccactgct cgtggactgt gctggtgcat cacggatggt gcatgggctc gccgtgtctc 3721 agcctctgcc agcgccagcg ccaaaacaag ccaaagagcc tcccaccaga gccgggagga 3781 aaaggcccct gcaatgtggt gacacctccc ctttcacacc tggatccatc ttgagagcca 3841 cagtcactgg attgactttg ctgtcaaaac tactgacagg gagcagcccc cgggccgctg 3901 gctggtgggc ccccaattga cacccatgcc agagaggtgg ggatcctgcc taaggttgtc 3961 tacgggggca cttggaggac ctggcgtgct cagacccaac agcaaaggaa ctagaaagaa 4021 ggacccagaa ggcttgcttt cctgcatctc tgtgaagcct ctctccttgg ccacagactg 4081 aactcgcagg gagtgcagca ggaaggaaca aagacaggca aacggcaacg tagcctgggc 4141 tcactgtgct ggggcatggc gggatcctcc acagagagga ggggaccaat tctggacaga 4201 cagatgttgg gaggatacag aggagatgcc acttctcact caccactacc agccagcctc 4261 cagaaggccc cagagagacc ctgcaagacc acggagggag ccgacacttg aatgtagtaa 4321 taggcagggg gccctgccac cccatccagc cagaccccag ctgaaccatg cgtcaggggc 4381 ctagaggtgg agttcttagc tatccttggc tttctgtgcc agcctggctc tgcccctccc 4441 ccatgggctg tgtcctaagg cccatttgag aagctgaggc tagttccaaa aacctctcct 4501 gacccctgcc tgttggcagc ccactcccca gccccagccc cttccatggt actgtagcag 4561 gggaattccc tccccctcct tgtgccttct ttgtatatag gcttctcacc gcgaccaata 4621 aacagctccc agtttgt // LOCUS HUMINTB7 2780 bp mRNA PRI 06-JAN-1995 DEFINITION Human integrin beta-7 subunit mRNA, complete cds. ACCESSION M68892 NID g186508 KEYWORDS integrin beta-7 subunit. SOURCE Homo sapiens (tissue library: SEA-activated human T cell cDNA library) lymphoid cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2780) AUTHORS Yuan,Q.A., Jiang,W.M., Krissansen,G.W. and Watson,J.D. TITLE Cloning and sequence analysis of a novel beta 2-related integrin transcript from T lymphocytes: homology of integrin cysteine-rich repeats to domain III of laminin B chains [published erratum appears in Int Immunol 1991 Dec;3(12):1373-4] JOURNAL Int. Immunol. 2 (11), 1097-1108 (1990) MEDLINE 91190778 FEATURES Location/Qualifiers source 1..2780 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="peripheral blood T cell" /cell_type="T lymphocyte" /germline /tissue_type="lymphoid" /tissue_lib="SEA-activated human T cell cDNA library" /map="12" sig_peptide 152..202 /gene="ITGB7" /note="leader sequence" CDS 152..2548 /gene="ITGB7" /codon_start=1 /db_xref="GDB:G00-128-601" /product="integrin beta-7 subunit" /db_xref="PID:g186509" /translation="MVALPMVLVLLLVLSRGESELDAKIPSTGDATEWRNPHLSMLGS CQPAPSCQKCILSHPSCAWCKQLNFTASGEAEARRCARREELLARGCPLEELEEPRGQ QEVLQDQPLSQGARGEGATQLAPQRVRVTLRPGEPQQLQVRFLRAEGYPVDLYYLMDL SYSMKDDLERVRQLGHALLVRLQEVTHSVRIGFGSFVDKTVLPFVSTVPSKLRHPCPT RLERCQSPFSFHHVLSLTGDAQAFEREVGRQSVSGNLDSPEGGFDAILQAALCQEQIG WRNVSRLLVFTSDDTFHTAGDGKLGGIFMPSDGHCHLDSNGLYSRSTEFDYPSVGQVA QALSAANIQPIFAVTSAALPVYQELSKLIPKSAVGELSEDSSNVVQLIMDAYNSLSST VTLEHSSLPPGVHISYESQCEGPEKREGKAEDRGQCNHVRINQTVTFWVSLQATHCLP EPHLLRLRALGFSEELIVELHTLCDCNCSDTQPQAPHCSDGQGHLQCGVCSCAPGRLG RLCECSVAELSSPDLESGCRAPNGTGPLCSGKGHCQCGRCSCSGQSSGHLCECDDASC ERHEGILCGGFGRCQCGVCHCHANRTGRACECSGDMDSCISPEGGLCSGHGRCKCNRC QCLDGYYGALCDQCPGCKTPCERHRDCAECGAFRTGPLATNCSTACAHTNVTLALAPI LDDGWCKERTLDNQLFFFLVEDDARGTVVLRVRPQEKGADHTQAIVLGCVGGIVAVGL GLVLAYRLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL " gene 152..2548 /gene="ITGB7" mat_peptide 203..2545 /gene="ITGB7" /product="integrin beta-7 subunit" BASE COUNT 559 a 810 c 842 g 569 t ORIGIN 1 cgttgctgtc gctctgcacg cacctatgtg gaaactaaag cccagagaga aagtctgact 61 tgccccacag ccagtgagtg actgcagcag caccagaatc tggtctgttt cctgtttggc 121 tcttctacca ctacggcttg ggatctcggg catggtggct ttgccaatgg tccttgtttt 181 gctgctggtc ctgagcagag gtgagagtga attggacgcc aagatcccat ccacagggga 241 tgccacagaa tggcggaatc ctcacctgtc catgctgggg tcctgccagc cagccccctc 301 ctgccagaag tgcatcctct cacaccccag ctgtgcatgg tgcaagcaac tgaacttcac 361 cgcgtcggga gaggcggagg cgcggcgctg cgcccgacga gaggagctgc tggctcgagg 421 ctgcccgctg gaggagctgg aggagccccg cggccagcag gaggtgctgc aggaccagcc 481 gctcagccag ggcgcccgcg gagagggtgc cacccagctg gcgccgcagc gggtccgggt 541 cacgctgcgg cctggggagc cccagcagct ccaggtccgc ttccttcgtg ctgagggata 601 cccggtggac ctgtactacc ttatggacct gagctactcc atgaaggacg acctggaacg 661 cgtgcgccag ctcgggcacg ctctgctggt ccggctgcag gaagtcaccc attctgtgcg 721 cattggtttt ggttcctttg tggacaaaac ggtgctgccc tttgtgagca cagtaccctc 781 caaactgcgc cacccctgcc ccacccggct ggagcgctgc cagtcaccat tcagctttca 841 ccatgtgctg tccctgacgg gggacgcaca agccttcgag cgggaggtgg ggcgccagag 901 tgtgtccggc aatctggact cgcctgaagg tggcttcgat gccattctgc aggctgcact 961 ctgccaggag cagattggct ggagaaatgt gtcccggctg ctggtgttca cttcagacga 1021 cacattccat acagctgggg acgggaagtt gggcggcatt ttcatgccca gtgatgggca 1081 ctgccacttg gacagcaatg gcctctacag tcgcagcaca gagtttgact acccttctgt 1141 gggtcaggta gcccaggccc tctctgcagc aaatatccag cccatctttg ctgtcaccag 1201 tgccgcactg cctgtctacc aggagctgag taaactgatt cctaagtctg cagttgggga 1261 gctgagtgag gactccagca acgtggtaca gctcatcatg gatgcttata atagcctgtc 1321 ttccaccgtg acccttgaac actcttcact ccctcctggg gtccacattt cttacgaatc 1381 ccagtgtgag ggtcctgaga agagggaggg taaggctgag gatcgaggac agtgcaacca 1441 cgtccgaatc aaccagacgg tgactttctg ggtttctctc caagccaccc actgcctccc 1501 agagccccat ctcctgaggc tccgggccct tggcttctca gaggagctga ttgtggagtt 1561 gcacacgctg tgtgactgta attgcagtga cacccagccc caggctcccc actgcagtga 1621 tggccaggga cacctacaat gtggtgtatg cagctgtgcc cctggccgcc taggtcggct 1681 ctgtgagtgc tctgtggcag agctgtcctc cccagacctg gaatctgggt gccgggctcc 1741 caatggcaca gggcccctgt gcagtggaaa gggtcactgt caatgtggac gctgcagctg 1801 cagtggacag agctctgggc atctgtgcga gtgtgacgat gccagctgtg agcgacatga 1861 gggcatcctc tgcggaggct ttggtcgctg ccaatgtgga gtatgtcact gtcatgccaa 1921 ccgcacgggc agagcatgcg aatgcagtgg ggacatggac agttgcatca gtcccgaggg 1981 agggctctgc agtgggcatg gacgctgcaa atgcaaccgc tgccagtgct tggacggcta 2041 ctatggtgct ctatgcgacc aatgcccagg ctgcaagaca ccatgcgaga gacaccggga 2101 ctgtgcagag tgtggggcct tcaggactgg cccactggcc accaactgca gtacagcttg 2161 tgcccatacc aatgtgaccc tggccttggc ccctatcttg gatgatggct ggtgcaaaga 2221 gcggaccctg gacaaccagc tgttcttctt cttggtggag gatgacgcca gaggcacggt 2281 cgtgctcaga gtgagacccc aagaaaaggg agcagaccac acgcaggcca ttgtgctggg 2341 ctgcgtaggg ggcatcgtgg cagtggggct ggggctggtc ctggcttacc ggctctcggt 2401 ggaaatctat gaccgccggg aatacagtcg ctttgagaag gagcagcaac aactcaactg 2461 gaagcaggac agtaatcctc tctacaaaag tgccatcacg accaccatca atcctcgctt 2521 tcaagaggca gacagtccca ctctctgaag gagggaggga cacttaccca aggctcttct 2581 ccttggagga cagtgggaac tggagggtga gaggaagggt gggtctgtaa gaccttggta 2641 ggggactaat tcactggcga ggtgcggcca ccaccctact tcattttcag agtgacaccc 2701 aagagggctg cttcccatgc ctgcaacctt gcatccatct gggctacccc acccaagtat 2761 acaataaagt cttacctcag // LOCUS HUMINTERFE 588 bp mRNA PRI 03-MAY-1994 DEFINITION Human interferon mRNA, complete cds. ACCESSION L25664 NID g479010 KEYWORDS interferon. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 588) AUTHORS Whaley,A.E., Meka,C.S.R., Reddy,C.S., Harbison,L.A., Hunt,J.S. and Imakawa,K. TITLE Identification and cellular localization of unique interferon mRNA from human placenta JOURNAL J. Biol. Chem. 269, 10864-10868 (1994) MEDLINE 94193794 FEATURES Location/Qualifiers source 1..588 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="JM101" /cell_type="trophoblast" /tissue_type="placenta" CDS 1..588 /note="disulfide bond 1..99 disulfide bond 29..139; putative" /codon_start=1 /product="interferon" /db_xref="PID:g479011" /translation="MAFVLSLLMALVLVSYGPGGSLGCDLSQNHVLVGRKNLRLLDEM RRLSPHFCLQDRKDFALPQEMVEGGQLQEAQAISVLHEMLQQSFNLFHTEHSSAAWDT TLLEPCRTGLHQQLDNLDACLGQVMGEEDSALGRTGPTLALKRYFQGIHVYLKEKGYS DCAWETVRLEIMRSFSSLISLQERLRMMDGDLSSP" BASE COUNT 133 a 170 c 165 g 120 t ORIGIN 1 atggccttcg tgctctctct actcatggcc ctggtgctgg tcagctacgg cccaggagga 61 tccctgggtt gtgacctgtc tcagaaccac gtgctggttg gcaggaagaa cctcaggctc 121 ctggacgaaa tgaggagact ctcccctcac ttttgtctgc aggacagaaa agacttcgct 181 ttaccccagg aaatggtgga gggcggccag ctccaggagg cccaggccat ctctgtgctc 241 catgagatgc tccagcagag cttcaacctc ttccacacag agcactcctc tgctgcctgg 301 gacaccaccc tcctggagcc atgccgcact ggactccatc agcagctgga caacctggat 361 gcctgcctgg ggcaggtgat gggagaggaa gactctgccc tgggaaggac gggccccacc 421 ctggctctga agaggtactt ccagggcatc catgtctacc tgaaagagaa gggatacagc 481 gactgcgcct gggaaaccgt cagactggaa atcatgagat ccttctcttc attaatcagc 541 ttgcaagaaa ggttaagaat gatggatgga gacctgagct caccttga // LOCUS HUMINV2 2108 bp DNA PRI 06-JAN-1995 DEFINITION Human involucrin gene, exon 2. ACCESSION M13903 NID g186519 KEYWORDS involucrin; keratinocyte protein. SEGMENT 2 of 2 SOURCE Human keratinocyte, cDNA to mRNA; and DNA, clone lambda-1-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2108) AUTHORS Eckert,R.L. and Green,H. TITLE Structure and evolution of the human involucrin gene JOURNAL Cell 46 (4), 583-589 (1986) MEDLINE 86272107 FEATURES Location/Qualifiers source 1..2108 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q21-q22" gene join(M13902:785..834,1..1787) /gene="IVL" intron <1..10 /gene="IVL" /note="G00-119-355" /number=1 CDS 30..1787 /gene="IVL" /codon_start=1 /db_xref="GDB:G00-119-355" /product="involucrin" /db_xref="PID:g386834" /translation="MSQQHTLPVTLSPALSQELLKTVPPPVNTHQEQMKQPTPLPPPC QKVPVELPVEVPSKQEEKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEEYQKAE NPEQQLKQEKTQRDQQLNKQLEEEKKLLDQQLDQELVKRDEQLGMKKEQLLELPEQQE GHLKHLEQQEGQLKHPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQ QEGQLELPQQQEGQLELSEQQEGQLELSEQQEGQLELSEQQEGQLKHLEHQEGQLEVP EEQMGQLKYLEQQEGQLKHLDQQEKQPELPEQQMGQLKHLEQQEGQPKHLEQQEGQLE QLEEQEGQLKHLEQQEGQLEHLEHQEGQLGLPEQQVLQLKQLEKQQGQPKHLEEEEGQ LKHLVQQEGQLKHLVQQEGQLEQQERQVEHLEQQVGQLKHLEEQEGQLKHLEQQQGQL EVPEQQVGQPKNLEQEEKQLELPEQQEGQVKHLEKQEAQLELPEQQVGQPKHLEQQEK HLEHPEQQDGQLKHLEQQEGQLKDLEQQKGQLEQPVFAPAPGQVQDIQPALPTKGEVL LPVEHQQQKQEVQWPPKHK" BASE COUNT 602 a 526 c 711 g 269 t ORIGIN About 1188 bp after segment 1. 1 tgtctttcag gttgacagta gcttctaaga tgtcccagca acacacactg ccagtgaccc 61 tctcccctgc cctcagtcag gagctcctca agactgttcc tcctccagtc aatacccatc 121 aggagcaaat gaaacagcca actccactgc ctcccccatg ccagaaggtg cctgtcgagc 181 tcccagtgga ggtcccatca aagcaagagg aaaagcacat gactgctgta aagggactgc 241 ctgagcaaga atgtgagcaa cagcagaagg agccacagga gcaggagctg cagcaacagc 301 actgggaaca gcatgaggaa tatcagaaag cagaaaaccc agagcagcag cttaagcagg 361 agaaaacaca aagggatcag cagctaaaca aacagctgga agaagagaag aagctcttag 421 accagcaact ggatcaagag ctagtcaaga gagatgagca actgggaatg aagaaagagc 481 aactgttgga gctcccagag cagcaggagg ggcacctgaa gcacctagag cagcaggagg 541 gacagctgaa gcacccggag cagcaggagg ggcagctgga gctcccagag cagcaggagg 601 ggcagctgga gctcccagag cagcaggagg ggcagctgga gctcccagag cagcaggagg 661 ggcagctgga gctcccagag cagcaggagg ggcagctgga gctcccacag cagcaggagg 721 ggcagctgga gctctctgag cagcaggagg ggcagctgga gctctctgag cagcaggagg 781 ggcagctgga gctctctgag cagcaggagg gacagctgaa gcacctggag caccaggagg 841 ggcagctgga ggtcccagag gagcagatgg ggcagctgaa gtacctggaa cagcaggagg 901 ggcagctgaa gcacctggat cagcaggaga agcagccaga gctcccagag cagcagatgg 961 ggcagctgaa gcacctggag cagcaggagg ggcagcctaa gcatctggag cagcaggagg 1021 ggcaactgga gcagctggag gagcaggagg ggcagctgaa gcacctggag cagcaggagg 1081 ggcagctgga gcacctggag caccaggaag ggcagctggg gctcccagag cagcaggtgc 1141 tgcagctgaa gcagctagag aagcagcagg ggcagccaaa gcacctggag gaggaggagg 1201 ggcagctgaa gcacctggtg cagcaggagg ggcagctgaa gcatctggtg cagcaggagg 1261 ggcagctgga gcagcaggag aggcaggtgg agcacctgga gcagcaggtg gggcagctga 1321 agcacctaga ggagcaggag ggacagctga agcatctgga gcagcagcag gggcagttgg 1381 aggtcccaga gcagcaggtg gggcagccaa agaacctgga gcaggaggag aagcaactgg 1441 agctcccaga gcagcaagag ggccaggtga agcacctgga gaagcaggag gcacagctgg 1501 agctcccaga gcagcaggta ggacagccaa agcacctgga acagcaggaa aagcacctag 1561 agcacccaga gcagcaggac ggacaactaa aacatctgga gcagcaggag gggcagctga 1621 aggacctgga gcagcagaag gggcagctgg agcagcctgt gtttgcccca gctccaggcc 1681 aggtccaaga cattcaacca gccctgccca caaagggaga agtattgctt cctgtagagc 1741 accagcagca gaagcaggag gtgcagtggc cacccaaaca taaataacca cccgcagtgt 1801 ccagaggccc tcagatcgtc tcatacaagg gaagagagag ccactggctc cacttatttc 1861 gggtccgcta ggtggcccgt ctcatctgtg aacttgactc tgtccctcta catgtctctt 1921 taatggggtg agggtggggg agagagggaa ttattgtcca gtgccaaccc caatgacccc 1981 aatcccaacc tcaggtgagc ggagcctcta cttgagggac tattgttact ataggaatcc 2041 ttacttcccc agtattgaag ctgaatcagt gagtgtgtac aatgatacat aataaatctt 2101 ggaagtct // LOCUS HUMIP3R3 8833 bp mRNA PRI 07-FEB-1994 DEFINITION Human type 3 inositol 1,4,5-trisphosphate receptor (ITPR3) mRNA, complete cds. ACCESSION U01062 NID g453367 KEYWORDS Inositol trisphosphate receptor; calcium release channel. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8833) AUTHORS Maranto,A.R. TITLE Primary structure, ligand binding, and localization of the human type 3 inositol 1,4,5-trisphosphate receptor expressed in intestinal epithelium JOURNAL J. Biol. Chem. 269 (2), 1222-1230 (1994) MEDLINE 94117432 REFERENCE 2 (bases 1 to 8833) AUTHORS Maranto,A.R. TITLE Direct Submission JOURNAL Submitted (25-AUG-1993) Maranto, A.R., St. Elizabeth's Hospital, Department of Biomedical Research, 736 Cambridge Street, Boston, MA 02135, USA FEATURES Location/Qualifiers source 1..8833 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT29" /cell_type="epithelial cell" mRNA 1..8833 /gene="ITPR3" gene 1..8833 /gene="ITPR3" CDS 37..8052 /gene="ITPR3" /codon_start=1 /function="Ca2+ release channel" /evidence=experimental /product="human type 3 inositol 1,4,5-trisphosphate receptor" /db_xref="PID:g393036" /translation="MSEMSSFLHIGDIVSLYAEGSVNGFISTLGLVDDRCVVEPAAGD LDNPPKKFRDCLFKVCPMNRYSAQKQYWKAKQTKQDKEKIADVVLLQKLQHAAQMEQK QNDTENKKVHGDVVKYGSVIQLLHMKSNKYLTVNKRLPALLEKNAMRVTLDATGNEGS WLFIQPFWKLRSNGDNVVVGDKVILNPVNAGQPLHASNYELSDNAGCKEVNSVNCNTS WKINLFMQFRDHLEEVLKGGDVVRLFHAEQEKFLTCDEYKGKLQVFLRTTLRQSATSA TSSNALWEVEVVHHDPCRGGAGHWNGLYRFKHLATGNYLAAEENPSYKGDASDPKAAG MGAQGRTGRRNAGEKIKYCLVAVPHGNDIASLFELDPTTLQKTDSFVPRNSYVRLRHL CTNTWIQSTNVPIDIEEERPIRLMLGTCPTKEDKEAFAIVSVPVSEIRDLDFANDASS MLASAVEKLNEGFISQNDRRFVIQLLEDLVFFVSDVPNNGQNVLDIMVTKPNRERQKL MREQNILKQVFGILKVPFREKGGEGPLVRLEELSDQKNAPYQHMFRLCYRVLRYSQED YRKNQEHIAKQFGMMQSQIGYDILAEDTITALLHNNRKLLEKHITKTEVETFVSLVRK NREPRFLDYLSDLCVSNHIAIPVTQELICKCVLDPKNSDILIRTELRPVKEMAQSHEY LSIEYSEEEVWLTWTDKNNEHHEKSVRQLAQEARAGNAHDENVLSYYRYQLKLFARMC LDRQYLAIDEISQQLGVDLIFLCMADEMLPFDLRASFCHLMLHVHVDRDPQELVTPVK FARLWTEIPTAITIKDYDSNLNASRDDKKNKFANTMEFVEDYLNNVVSEAVPFANEEK NKLTFEVVSLAHNLIYFGFYSFSELLRLTRTLLGIIDCVQGPPAMLQAYEDPGGKNVR RSIQGVGHMMSTMVLSRKQSVFSAPSLSAGASAAEPLDRSKFEENEDIVVMETKLKIL EILQFILNVRLDYRISYLLSVFKKEFVEVFPMQDSGADGTAPAFDSTTANMNLDRIGE QAEAMFGVGKTSSMLEVDDEGGRMFLRVLIHLTMHDYAPLVSGALQLLFKHFSQRQEA MHTFKQVQLLISAQDVENYKVIKSELDRLRTMVEKSELWVDKKGSGKGEEVEAGTAKD KKERPTDEEGFLHPPGEKSSENYQIVKGILERLNKMCGVGEQMRKKQQRLLKNMDAHK VMLDLLQIPYDKGDAKMMEILRYTHQFLQKFCAGNPGNQALLHKHLHLFLTPGLLEAE TMQHIFLNNYQLCSEISEPVLQHFVHLLATHGRHVQYLDFLHTVIKAEGKYVKKCQDM IMTELTNAGDDVVVFYNDKASLAHLLDMMKAARDGVEDHSPLMYHISLVDLLAACAEG KNVYTEIKCTSLVPLEDVVSVVTHEDCITEVKMAYVNFVNHCYVDTEVEMKEIYTSNH IWTLFENFTLDMARVCSKREKRVADPTLEKYVLSVVLDTINAFFSSPFSENSTSLQTH QPVVVQLLQSTTRLLECPWLQQQHKGSVEACIRTLAMVAKGRAILLPMDLDAHISSML SSGASCAAAAQRNASSYKATTRAFPRVTPTANQWDYKNIIEKLQDIITALEERLKPLV QAELSVLVDVLHWPELLFLEGSEAYQRCESGGFLSKLIQHTKDLMESEEKLCIKVLRT LQQMLVKKTKYGDRGNQLRKMLLQNYLQNRKSTSRGDLPDPIGTGLDPDWSAIAATQC RLDKEGATKLVCDLITSTKNEKIFQESIGLAIHLLDGGNTEIQKSFHNLMMSDKKSER FFKVLHDRMKRAQQETKSTVAVNMNDLGSQPHEDREPVDPTTKGRVASFSIPGSSSRY SLGPSLRRGHEVSERVQSSEMGTSVLIMQPILRFLQLLCENHNRDLQNFLRCQNNKTN YNLVCETLQFLDIMCGSTTGGLGLLGLYINEDNVGLVIQTLETLTEYCQGPCHENQTC IVTHESNGIDIITALILNDISPLCKYRMDLVLQLKDNASKLLLALMESRHDSENAERI LISLRPQELVDVIKKAYLQEEERENSEVSPREVGHNIYILALQLSRHNKQLQHLLKPV KRIQEEEAEGISSMLSLNNKQLSQMLKSSAPAQEEEEDPLAYYENHTSQIEIVRQDRS MEQIVFPVPGICQFLTEETKHRLFTTTEQDEQGSKVSDFFDQSSFLHNEMEWQRNVRS MPLIYWFSRRMTLWGSISFNLAVFINIIIAFFYPYMEGASTGVLDSPLISLLFWILIC FSIAALFTKRYSIRPLIVALILRSIYYLGIGPTLNILGALNLTNKIVFVVSFVGNRGT FIRGYKAMVMDMEFLYHVGYILTSVLGLFAHELFYSILLFDLIYREETLFNVIKSVTR NGRSILLTALLALILVYLFSIVGFLFLKDDFILEVDRLPNNHSTASPLGMPHGAAAFV DTCSGDKMDCVSGLSVPEVLEEDRELDSTERACDTLLMCIVTVMNHGLRNGGGVGDIL RKPSKDESLFPARVVYDLLFFFIVIIIVLNLIFGVIIDTFADLRSEKQKKEEILKTTC FICGLERDKFDNKTVSFEEHIKLEHNMWNYLYFIVLVRVKNKTDYTGPESYVAQMIKN KNLDWFPRMRAMSLVSNEGEGEQNEIRILQDKLNSTMKLVSHLTAQLNELKEQMTEQR KRRQRLGFVDVQNCISR" polyA_signal 8814..8819 /gene="ITPR3" polyA_site 8833 /gene="ITPR3" /note="18 A residues" BASE COUNT 1989 a 2601 c 2519 g 1724 t ORIGIN 1 cgccccccac gccctgggcc ccggagggcc gcagccatga gtgaaatgtc cagctttctt 61 cacatcgggg acatcgtctc cctgtacgcc gagggctccg tcaatggctt catcagcact 121 ttggggctgg tggatgaccg ctgtgtggtg gagcccgcgg ccggggacct ggacaacccc 181 cctaagaagt tccgtgactg cctcttcaag gtgtgcccca tgaaccgcta ctcggcccag 241 aagcagtact ggaaggccaa gcagactaag caggacaagg agaagatcgc tgatgtggtg 301 ttgctgcaga agctgcagca tgcggcgcag atggagcaga agcaaaatga cacggagaac 361 aagaaggtgc atggggatgt cgtgaagtat ggcagtgtga tccagctcct gcacatgaag 421 agcaacaagt acctgacagt gaacaagcgg cttccggcct tgctggagaa gaacgccatg 481 cgggtgactc tggatgccac aggcaacgag ggttcctggc tcttcatcca gcccttctgg 541 aagctgcgga gcaacgggga caacgtggtc gtgggggaca aggtgatcct gaatcctgtc 601 aatgccgggc agcctctgca tgccagcaat tacgagctca gcgacaacgc cggctgcaag 661 gaggtcaatt ctgtgaactg caacaccagc tggaagatca acctgtttat gcagtttcgg 721 gaccacctgg aggaggtgtt gaaaggggga gacgtggtgc ggctgttcca tgcggagcag 781 gagaagttcc tgacgtgtga cgagtacaag ggcaagctgc aggtgttcct gcgaactaca 841 ctgcgccagt ctgccacctc ggccaccagc tccaatgctc tctgggaggt ggaggtggtc 901 caccacgacc cctgccgtgg aggagctggg cactggaatg gcttgtaccg cttcaagcac 961 ctggctacag gcaactacct ggctgctgag gagaacccca gttacaaagg tgatgcctca 1021 gatcccaagg cagcaggaat gggggcacag ggccgcacag gccgcaggaa tgctggggag 1081 aagatcaagt actgcctggt ggctgtgcct catggcaatg acatcgcctc tctctttgag 1141 ctggacccca ccaccttgca gaaaaccgac tctttcgtgc cccggaactc gtacgtccgg 1201 ctgcggcacc tctgcaccaa cacgtggatt cagagcacca atgtgcccat tgacatcgag 1261 gaggagcggc ccatccggct catgctgggc acctgcccca ccaaggagga caaggaggcc 1321 tttgccatcg tgtcagtgcc cgtgtctgag atccgagacc tggactttgc caatgacgcc 1381 agctccatgc tggccagtgc cgtggagaaa ctcaacgagg gcttcatcag ccagaatgac 1441 cgcaggtttg tcatccagct gctggaagac ctggtgttct ttgtcagcga tgtccccaac 1501 aatgggcaga atgtcctgga catcatggtc actaagccca accgggaacg gcagaagctg 1561 atgagggagc agaacatcct caaacaggtc tttggcattc tgaaggtccc gttccgtgag 1621 aaggggggtg aaggtcccct ggtgcggctg gaggagctgt cagaccagaa gaacgccccc 1681 taccagcaca tgttccgcct gtgctaccgt gtgttgcggt attcccagga ggactaccgc 1741 aagaaccagg agcacattgc caagcagttt gggatgatgc agtcccagat tggctacgac 1801 atcctggccg aggacaccat cactgccctg ctgcacaaca accgcaagct cctggaaaag 1861 cacatcacca agaccgaggt ggagaccttc gtcagccttg tgcgcaagaa ccgggagccc 1921 aggttcctgg actacctctc tgacctgtgt gtgtccaacc acatcgccat ccccgtcacc 1981 caagagctca tctgcaagtg tgtgctggac cccaagaaca gtgacattct catccggacc 2041 gagcttcggc ccgtgaagga gatggcccaa tcccacgagt acctgagcat cgagtactca 2101 gaagaggaag tgtggctcac gtggactgac aagaataacg agcatcatga gaagagtgtg 2161 aggcagctgg cccaggaggc gcgggccggc aacgcccacg acgagaatgt gctcagctac 2221 tacaggtacc agctgaagct ctttgcccgc atgtgcttgg accgccagta cttggccatc 2281 gacgagatct cccagcagct gggcgtggac ctgattttcc tgtgcatggc agacgagatg 2341 ctgccctttg acctgcgcgc ctccttctgc cacctgatgc tgcacgtgca cgtggaccgt 2401 gacccccagg agctggtcac gccggtcaag tttgcccgtc tctggactga gatccccaca 2461 gccatcacca tcaaggacta tgattccaac ctcaacgcgt cccgagatga caagaagaac 2521 aagtttgcca acaccatgga gttcgtggag gactacctca acaatgtagt cagcgaggcc 2581 gtgccctttg ccaacgagga gaagaacaag ctcacttttg aggtggtcag cctggcgcac 2641 aatctcatct acttcggctt ctacagcttc agcgagctgc tgcggctcac tcgcacactg 2701 ctgggcatca tcgactgtgt gcaggggccc ccggccatgc tgcaggccta tgaggacccc 2761 ggtggcaaga atgtgcggcg gtccatccag ggcgtggggc acatgatgtc caccatggtg 2821 ctgagccgca agcagtccgt cttcagtgcc cccagcctgt ctgctggggc cagtgctgct 2881 gagccgctgg acagaagcaa gtttgaggag aatgaggaca ttgtggtgat ggagaccaag 2941 ctgaagatcc tggaaatcct tcagttcatc ctcaacgtcc gcctggatta ccgcatatcc 3001 tacctgctgt ctgtcttcaa gaaggagttt gtggaggtgt ttcccatgca ggacagtggg 3061 gctgatggca cagcccctgc cttcgactct accactgcca acatgaacct ggatcgcatc 3121 ggggagcagg cggaggccat gtttggagtg gggaagacaa gcagcatgct ggaggtggat 3181 gacgagggcg gccgcatgtt cctgcgcgtg ctcatccacc tcaccatgca cgactatgcg 3241 ccactggtct cgggtgccct gcagctgctc ttcaagcact tcagccagcg ccaggaggcc 3301 atgcacacct tcaagcaggt tcagctgctg atctcagcgc aggacgtgga gaactacaag 3361 gtgatcaagt cggagctgga ccggctgcgg accatggtgg agaagtcaga gctgtgggtg 3421 gacaagaagg gcagtggcaa gggtgaggag gtggaggcag gcaccgccaa ggacaagaaa 3481 gagcgtccca cggacgagga gggctttctg cacccaccag gggagaaaag cagtgagaac 3541 taccagatcg tcaagggcat cctggaaagg ctgaacaaga tgtgcggggt tggggagcaa 3601 atgaggaaga agcagcaacg gctgctgaag aacatggatg cccacaaggt catgctggac 3661 ctgctgcaga tcccctatga caagggtgat gccaagatga tggagatcct gcgctacacg 3721 caccagttcc tgcagaagtt ctgtgcaggg aaccccggca accaggccct gctgcacaaa 3781 cacctgcacc tcttcctcac gccagggctc ctggaggcag agaccatgca gcacatcttc 3841 ctgaacaact atcagctctg ctccgagatc agcgagcctg tgttgcagca cttcgtgcac 3901 ctgctggcca cgcacgggcg ccatgtgcag tacctggact tcctgcacac cgtcattaag 3961 gccgagggca agtacgtcaa gaagtgccag gacatgatca tgactgagct gaccaatgca 4021 ggtgacgatg tggtcgtgtt ctacaatgat aaggcatcgc tggcccacct gctggacatg 4081 atgaaggccg cccgcgacgg cgtggaggac cacagccccc tcatgtacca catttccctg 4141 gtggacctgc tggccgcctg tgccgagggc aaaaacgtct acactgagat caagtgcacc 4201 tccctcgtgc cgctggagga cgtggtgtct gtggtgacgc atgaggactg catcactgag 4261 gtgaaaatgg cctatgtgaa cttcgtgaac cactgctacg tggacacgga ggtggagatg 4321 aaggagatct acaccagcaa ccacatctgg acgctctttg agaacttcac cctggacatg 4381 gctcgggtct gcagcaagcg tgagaagcgc gtggctgacc ccaccttgga gaagtacgtg 4441 ctgagcgttg tgctggacac catcaacgcc ttcttcagct ccccattctc tgagaacagc 4501 acttccctgc agacacacca gccggttgtg gtgcagctgc tgcagtctac cacacgcctc 4561 ctcgagtgtc cgtggctaca gcagcagcac aagggctccg tggaggcctg catccggacc 4621 ctcgccatgg tggccaaggg ccgggccatc ttgctgccca tggacctgga tgcccacatc 4681 agctcgatgc tcagcagtgg agccagctgt gcagctgccg cccagcggaa cgcctccagc 4741 tacaaggcaa ccacgcgggc cttcccccgc gtcaccccca ccgccaacca gtgggactac 4801 aagaacatca ttgagaagct gcaggacatc atcacagccc tggaggagcg gctgaagccc 4861 ctggtacagg ctgagctgtc cgtgctggtg gatgtcctgc actggcctga gctgctcttc 4921 ctggagggca gtgaggccta ccagcgctgc gagagtgggg gcttcctgtc caagctgatc 4981 cagcacacca aggacctcat ggagtcggag gagaagctgt gcatcaaggt gctgcggacc 5041 ctgcagcaga tgctcgtcaa gaagaccaag tacggggacc ggggcaacca gctgcgcaag 5101 atgctgctgc aaaactacct ccagaaccgg aagtccacct cgcgggggga ccttcccgac 5161 cccataggca ctggcctgga cccagactgg tcggcaatcg cagccaccca gtgccggctg 5221 gacaaggagg gggccaccaa gttggtatgc gacctcatca ccagcaccaa gaacgagaag 5281 atcttccagg agagcatcgg cctggccatc cacctgctgg atggtggcaa cacagagatc 5341 cagaaatcct tccacaacct gatgatgagt gacaagaagt cagagcgctt cttcaaggtg 5401 ctgcacgacc gcatgaagcg ggcccagcag gagaccaagt ccacggtggc agtcaacatg 5461 aatgacctgg gcagccagcc acatgaggac cgcgagccag tcgaccccac caccaaaggc 5521 cgcgtggcct ccttctcgat acctggctcc tcatcccgct actcgctggg ccccagcctg 5581 cgccgggggc acgaggtgag cgaacgtgtg cagagcagtg agatgggcac atccgtgctc 5641 atcatgcagc ccatcctgcg ctttctgcag ctgctgtgtg agaaccacaa ccgggacctg 5701 cagaacttcc tgcgctgtca gaacaacaaa accaactaca acttggtatg cgagacgctg 5761 cagttcctgg acatcatgtg cggcagcacc acgggcggcc tggggctgct ggggctctac 5821 atcaatgagg acaacgtggg cctcgtcatc cagaccttgg agaccctcac tgagtactgc 5881 cagggcccct gccatgagaa ccagacttgc attgtgactc acgagtccaa tggcatagac 5941 atcatcaccg cactgatcct caatgacatc agccccctgt gcaagtaccg catggatctg 6001 gtgctgcagc tcaaggacaa tgcctccaag ctgctcctgg ctctgatgga gagccggcat 6061 gacagtgaaa atgctgagcg aatcctcatc agcctgcggc cccaggagct ggtggacgtc 6121 atcaagaagg cctacctgca ggaggaagag cgtgagaact cggaggtgag cccacgtgaa 6181 gtgggccata acatctatat cctggcgctg cagctctcca ggcacaataa acagctgcag 6241 cacctgctga agccggtgaa gcgcattcaa gaggaggagg ccgagggtat ctcttccatg 6301 ctcagcctca acaacaagca gctgtcacag atgctcaagt cctcagcgcc agcacaggag 6361 gaggaggaag accccctggc ctactatgag aaccacacgt cccagatcga gattgtgcgg 6421 caggaccgca gcatggagca gatcgtgttc ccagtgcccg gcatctgcca gttcctgacg 6481 gaggaaacca agcaccggct cttcaccact actgagcagg acgagcaggg cagcaaagtg 6541 agcgacttct tcgaccagtc ctccttcctg cacaacgaga tggagtggca gcgcaacgtc 6601 cgcagcatgc cgctgatcta ctggttctcc cgccgcatga ccctgtgggg cagcatctcc 6661 ttcaacctgg ccgtgtttat caacatcatc attgccttct tctaccctta catggagggc 6721 gcgtccacag gcgtgctgga ctcccctctc atctcattgc tcttctggat cctcatctgc 6781 ttctccatcg cggccctgtt caccaagcgc tacagcatcc gccccctcat cgtggcgctc 6841 atcctgcgct ccatctacta tctgggcatc gggcccacac tcaacatcct gggtgccctc 6901 aatctgacca acaagatcgt gtttgtggtg agcttcgtgg gcaaccgtgg caccttcatc 6961 cggggctata aggccatggt catggacatg gaattcctct accacgtggg ctacatcctg 7021 accagtgtcc tgggcctctt tgctcatgag ctgttctaca gcatcctgct ctttgacctc 7081 atctaccgcg aggagacgct gttcaacgtc atcaagagtg tgacccgcaa tggccgctcc 7141 atcctgctga cagccctgct ggccctcatc ctggtctacc tcttctccat cgtcggcttc 7201 ctcttcctca aggatgactt cattctcgag gtcgaccggc tgcccaacaa ccactccaca 7261 gccagccccc tggggatgcc acatggagct gctgcatttg tggacacctg cagtggggac 7321 aagatggact gtgtctcagg gctctcggtg cctgaggtcc tggaagagga cagggagctg 7381 gacagcacag agcgggcctg tgacactctg ttgatgtgca tcgtcactgt catgaaccat 7441 gggctacgca acggtggtgg cgtgggcgac attctccgca agccctccaa agatgagtct 7501 ctcttcccag cccgagtggt ctatgacctc ctgttcttct tcatcgtcat catcattgtg 7561 ctgaacctca tctttggggt aatcatcgac accttcgctg acctgcgtag tgagaagcag 7621 aagaaggagg agattcttaa gacgacatgc ttcatctgtg gtctggagag ggacaagttt 7681 gataacaaga cagtgtcatt tgaggaacac atcaagctgg agcacaacat gtggaactac 7741 ttgtacttca ttgtgctggt ccgcgtgaag aacaagaccg actacacggg ccctgagagc 7801 tacgtggccc agatgatcaa gaacaagaac ctggactggt tcccccggat gcgggccatg 7861 tcccttgtca gcaatgaggg cgagggggag cagaatgaga ttcggattct ccaggacaag 7921 ctcaactcca ccatgaagct ggtgtcccac ctcactgccc agctcaacga gctcaaggag 7981 cagatgacgg agcagcggaa acgcaggcaa cgcctaggct ttgtggatgt ccagaactgc 8041 attagccgct gaggagagcc accgaaggcc ccaacagggg atgctcatca ctggagactg 8101 cgactgggaa gaacactgcc ccctccctcg ggttgggtgg cccagccagc tggccagcct 8161 ccactcccac tctgccagac accctgacac ccacccaggc tttgaagagc atggaggggg 8221 agcctcagag ctgacagtcc tgcttagagc ccttaaaaag acttgaaagt tcactgggac 8281 tcagtttacc ttaatgcctt agcagaagat aaatcctacc tagagacctt tgttccttaa 8341 agcaataact gacaactctt tgtagtcctc cttgtgggta gttaagagtg gggtcacccc 8401 tttaactcca agcactacat tttggcggct gcggcctctg ggggaggtgg cagttatgct 8461 gttactagtg attttagggc tttgttattt aacttatttc aagggtgctg tgctcagccc 8521 tgcccatggc tgtgcagctc cctccgtgcc tcagatctgc tgtagccagt gcagacctca 8581 ctgtcgtgtc catgccaccc ccggcatggc tccaggtggc ctggtgactc catgatggac 8641 gatcttgctc ccaggacctg cctcttccca ggcttcctgg ggaagagttg tacgcccagg 8701 caacaagggc tgagctgcgc ttgcgtggct gtttcatgac cgcttgtttt tctccttttg 8761 gtgtaatgtt ttacaaatcc tttggcctga gaactaatat gttaattgcc ttaaataaat 8821 taatagaaat cta // LOCUS HUMIPL 2881 bp mRNA PRI 05-OCT-1995 DEFINITION Homo sapiens inducible protein mRNA, complete cds. ACCESSION L47738 NID g1009098 KEYWORDS . SOURCE Homo sapiens blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2881) AUTHORS Roerig,C.K. TITLE New inducible mRNA isolated from human lymphocytes JOURNAL Unpublished (1996) FEATURES Location/Qualifiers source 1..2881 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocytes" /tissue_type="blood" CDS 1005..1715 /note="inducible protein" /codon_start=1 /db_xref="PID:g1009099" /translation="MAGSVLLDKRFRAECKNYGVIIPYPPSNRYETLLKQRHVQLLGR SIDLNRLITQRISAAMYKSLDQAISRFESEDLTSIVELEWLLEINRVTHRLLCKHMTL DSFDAMFTHRLLCKHMTLDSFDAMFREANHNVSAPYGRITLHVFWELNFDFLPNYCYN GSTNRFVRTATSFHPRTTTRQTCQRPALLPLWIQASQHCLQPHLQLLQEFRGATSFQD YPADSWVIRASLWSWRNC" BASE COUNT 698 a 806 c 736 g 641 t ORIGIN 1 gtcagacgag gagtatcgcg agctcttcga cctagccctg cggggtctgc agcttctatc 61 caagtggagc gcccacgtca tggaggtgta ctcttggaag ctggttcatc ccacagacaa 121 gttctgcaac aaggactgtc ctggcaccgc ggaggaatat gcgcgcgcca cacgctacaa 181 ttacaccagt gaggaaaaat ttgccttcgt tgaggtgatc gccatgatca aaggcctgca 241 ggtgctcatg ggcaggatgg agagcgtctt caaccaggcc atcaggaaca ccatctacgc 301 ggcattgcag gacttcgccc aggtgacgct gcgtgagccc ctgcggcagg cggtacggaa 361 gaagaagaat gtcctcatca gcgtcctaca ggcaattcga aagaccatct gtgactggga 421 gggagggcga gaacccccta atgacccatg cttgagaggg gagaaggacc ccaaaggtgg 481 atttgatatc aaggtgcccc ggcgtgctgt ggggccatcc agcacacagc tgtacatggt 541 gcggaccatg cttgaatcac tcatttcaga caaaagcggc tccaagaaga ccctgaggag 601 cagcctggat ggacccattg tcctccccat agaggacttt cacaaacagt ccttcttctt 661 cacacatctg ctcaacatca gtgaagccct gcagcagtgt tgtgacctct cccagctctg 721 gttccgagaa ttcttcctgg agttaaccat gggccgacga atccagttcc ccatcgagat 781 gtccatgccc tggattctaa cggaccatat ctggaaacca aagaaccttc catgatggag 841 tatgtcctct accctctgga tctctgtaca acgacagcgc ctactatgct ctgaccaagt 901 ttaaaaagca gttcctgtac gatgagatag aagctgaggt gaacctgtgt tttgatcagt 961 ttgtctacaa gctggcagac cagatctttg cttactacaa agccatggct ggcagtgtcc 1021 tgttggataa acgttttcga gctgagtgta agaattatgg cgtcatcatt ccgtatccac 1081 cgtccaatcg ctatgaaaca ctgctgaagc agagacacgt ccagctgttg ggtagatcaa 1141 ttgacttgaa cagactcatt acccagcgca tctctgccgc catgtataaa tccttggacc 1201 aagctatcag ccgctttgag agtgaggacc tgacctccat tgtggagctg gagtggctgc 1261 tggagattaa ccgcgtcacg catcggctgc tctgtaagca tatgacgctg gacagcttcg 1321 atgccatgtt cacgcatcgg ctgctctgta agcatatgac gctggacagc ttcgatgcca 1381 tgttccgaga ggccaatcac aatgtgtccg ccccctatgg ccgtatcacc ctgcatgtct 1441 tctgggaact gaactttgac tttctcccca actactgcta caatgggtcc actaaccgtt 1501 ttgtgcggac tgccacttcc tttcacccaa gaaccacaac gagacaaacc tgccaacgtc 1561 cagccttatt acctctatgg atccaagcct ctcaacattg cctacagcca catctacagc 1621 tcctacagga atttcgtggg gccacctcat ttcaagacta tcctgcagac tcctgggtta 1681 tcagggcatc gctgtggtca tggaggaact gctaaagatt gtgaagagct tgctccaagg 1741 aaccattctc cagtatgtga aaacactgat agaggtgatg cccaagatat gccgcttgcc 1801 ccgacatgag tatggctccc cagggatcct ggagttcttc caccaccagc tgaaggacat 1861 cattgagtac gcagagctca aaacagacgt gttccagagc ctgagggaag tgggcaatgc 1921 catcctcttc tgcctcctca tagagcaagc tctgtctcag gaggaggtct gcgatttgct 1981 ccatgccgca cccttccaaa acatcttgcc tagagtctac atcaaagagg gggagcgcct 2041 ggaggtccgg atgaaacgtc tggaagccaa gtatgccccg ctccacctgg tccctctgat 2101 cgagcggctg gggacccctc agcaaatcgc cattgctcgc gagggtgacc tcctgaccaa 2161 ggagcggctg tgctgtggcc tgtccatgtt cgaggtcatc ctgacccgca ttcggagcta 2221 cctgcaggac cccatctggc ggggccaacc ggccaaccaa tggcgtatgc acttcgatga 2281 gtgttggatt ccaccggctg tggagcgcca tcgacagttc gtgtactgca tccctgtggg 2341 aacaaacgag ttcacagttg agacagtgtt tcggcgatgg cttgaactgg gctggttgct 2401 ccatcattgt cctgctgggc cagcacggtc gcttgacctg ttcgacttct gttaccacct 2461 gctaaaagtg cagaggcagg acggaaggat gaaatcatta agaatgtgcc cctgaagaag 2521 atggccgacc ggatcaggaa gtatcagatc ttgaacaatg aggtttttgc catcctgaac 2581 aaatacatga agtccgtgga gacagacagt tccactgtgg agcatgtgcg ctgcttccaa 2641 ccacccatcc accagtcctt ggccaccact tgctaagcag agatcctgca gacccttatc 2701 tggaggagga agagaagcag gagagagaaa ccacagccag cctgccatag gatccaactg 2761 gacaacgtgt gggatggacc tggaaacaag cacctcccca aacacatcac cactccctag 2821 ggcggggcct gtgcatgctc tcccatgaca tctccatgct ggtttctcca tagcataaat 2881 g // LOCUS HUMIPLAS 3639 bp mRNA PRI 19-JUL-1994 DEFINITION Human I-plastin mRNA, complete cds. ACCESSION L20826 NID g405229 KEYWORDS I-plastin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3639) AUTHORS Lin,C.-S., Shen,W., Chen,Z.P., Tu,Y.-H. and Matsudaira,P. TITLE Identification of I-plastin, a human fimbrin isoform expressed in intestine and kidney JOURNAL Mol. Cell. Biol. 14, 2457-2467 (1994) MEDLINE 94187717 FEATURES Location/Qualifiers source 1..3639 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..97 CDS 98..1987 /codon_start=1 /product="I-plastin" /db_xref="PID:g405230" /translation="MENSTTTISREELEELQEAFNKIDIDNSGYVSDYELQDLFKEAS LPLPGYKVREIVEKILSVADSNKDGKISFEEFVSLMQELKSKDISKTFRKIINKREGI TAIGGTSTISSEGTQHSYSEEEKVAFVNWINKALENDPDCKHLIPMNPNDDSLFKSLA DGILLCKMINLSEPDTIDERAINKKKLTPFTISENLNLALNSASAIGCTVVNIGASDL KEGKPHLVLGLLWQIIKVGLFADIEISRNEALIALLNEGEELEELMKLSPEELLLRWV NYHLTNAGWHTISNFSQDIKDSRAYFHLLNQIAPKGGEDGPAIAIDLSGINETNDLKR AGLMLQEADKLGCKQFVTPADVVSGNPKLNLAFVANLFNTYPCLHKPNNNDIDMNLLE GESKEERTFRNWMNSLGVNPYINHLYSDLADALVIFQLYEMIRVPVNWSHVNKPPYPA LGGNMKKIENCNYAVELGKNKAKFSLVGIAGQDLNERNSTLTLALVWQLMRRYTLNVL SDLGEGEKVNDEIIIKWVNQTLKSANKKTSISSFKDKSISTSLPVLDLIDAIAPNAVR QEMIRRENLSDEDKLNNAKYAISVARKIGARIYALPDDLVEVKPKMVMTVFACLMGKG LNRIK" 3'UTR 1988..3639 polyA_signal 3220..3225 BASE COUNT 1151 a 636 c 679 g 1173 t ORIGIN 1 aagtcttctg aattgttttt ctggacttcc aaatctcaag tgataagacc agcagaagca 61 gatataaaga cctgaagata gtcttttctg tccaaagatg gaaaacagta ctactaccat 121 ttctcgggag gagcttgaag aactacaaga ggcatttaat aaaatagata ttgacaatag 181 tgggtatgtc agtgactatg aacttcaaga cctgtttaag gaagcaagcc ttcctctgcc 241 tggctacaag gtgcgcgaga ttgtggagaa aattctatca gttgctgaca gcaacaaaga 301 tggcaaaatc agttttgaag agtttgtgtc actaatgcaa gaattaaaaa gcaaagatat 361 cagcaaaaca ttccgaaaaa taattaacaa gagggaaggg attactgcta ttggaggaac 421 ttcaactatt tccagtgagg gcacacagca ttcttattca gaggaagaaa aagtggcttt 481 tgttaactgg ataaacaaag ccctggagaa tgaccctgac tgtaagcatc ttatacccat 541 gaatcccaat gatgatagtc ttttcaagtc acttgcagat ggcatccttc tttgcaaaat 601 gatcaactta tctgaaccag atacaattga tgaaagagcc atcaataaga aaaagctcac 661 gccattcact atttctgaaa atttaaacct agctctgaat tctgcctcag ccattggttg 721 tacagtggtc aacattggtg catcagatct caaagaagga aaacctcact tggtcttggg 781 acttctctgg cagatcatca aagttggcct ttttgctgat attgagattt ccaggaatga 841 agctctgatt gcattgttaa atgaaggtga ggaactagag gagctgatga agctttctcc 901 cgaggaatta ctgctgcgat gggtgaacta ccatctgacc aatgcaggat ggcataccat 961 cagcaacttc agccaagaca ttaaggactc gagagcctat tttcatctgc ttaatcagat 1021 tgcccctaaa ggtggggaag atggacctgc cattgccatt gacctttcag gaattaatga 1081 gacaaatgac ctgaagcgtg ctggactcat gcttcaagaa gcagataaac tgggctgcaa 1141 acagtttgtt actcctgcag atgtggtttc aggcaatcct aaacttaatt tagcttttgt 1201 agctaatttg tttaacacat acccgtgcct gcacaagccg aataataatg acatcgatat 1261 gaatttactg gaaggagaga gcaaggaaga gagaacattt cggaactgga tgaattcctt 1321 gggagtcaac ccatacatta atcatttgta cagtgacctt gcagatgctt tagtgatctt 1381 tcagctctat gagatgatcc gagtgccagt caactggagc catgtcaaca aacctcctta 1441 tcctgccctt ggagggaaca tgaagaagat tgaaaactgt aactatgcag tggaacttgg 1501 gaagaacaag gccaaattct ccttggttgg cattgctggg caggacctaa atgaaaggaa 1561 ttcaacactt accctggcat tggtatggca gctgatgaga aggtacacat tgaatgtgtt 1621 atcggatctt ggagagggtg aaaaagtaaa tgatgaaatt ataattaaat gggtcaatca 1681 gactcttaaa agtgcaaaca aaaagacttc tatttccagc ttcaaggata aatctataag 1741 cacaagttta cctgtcctag atttaataga tgccattgcg ccaaatgcag ttcgtcaaga 1801 aatgatcagg agagaaaact tatctgatga ggacaagctg aacaatgcta aatacgccat 1861 ttcagttgct cgaaagatcg gtgcccggat atatgcatta cctgatgacc tcgtagaagt 1921 gaaaccaaag atggttatga cggtgtttgc atgcttaatg ggaaaaggac tgaacagaat 1981 aaaataatca tttcatatga ttttctgcca cattaaacat attgtatgcc tcacagttta 2041 caggattctg aaatgtagtg ggtgtaaaac cagagattat ttgtatgctc aaaatagtta 2101 tatattcatt aatgaattca atatcctgtt catactagtt agagctggtc agcctttttg 2161 ggtaacacag ttaatttacc aactgataca gataatagaa tatattcata atcaagctga 2221 tacttcatga ttaaattatt tttgttgctt aaaagtcgta ttagacaaga ctaaatcatt 2281 cttttttatg gttcaaaaaa gatgaataca aacgtttttg caggttctgc tgtgaaatgt 2341 ggtttgattt ttttggtgtg ttaattttga tcataaatgc attcatactc ataatccagt 2401 ttaatccttt tatttgcttc ctccaactat ttaaagtggt ccaaaaacac ttttctgtaa 2461 gtttctatac tgtctaaaac cttatggtga ccagaattgt ttattaatat caaacttttt 2521 tatatatgag aactaattct tgaataaacc ccaaagttca ctctcttgtt taagtagcag 2581 cagcttttta cttaaaattt aattttaact acattgatac tttacacatc ctagtttggt 2641 aacacagctt taactatgtc atgcaacata tatatgttgg taggatgtta ttagagagat 2701 atgtgtgcat atatattttt ttgcacctga atcacccagc ttttcataag tggtatgttt 2761 aattggtcat tcagccaacc atcagtattt tccccccacg acatgtgtaa cacttttcag 2821 tctgtggata tctgatacat taagatttct ttttataagt attcattttg aatgtgcata 2881 tagtcatttg accccttcca aatacttgta gccaaacatt ggctagaaca tcccaagata 2941 tgctgacact gtcctgttag cttcatatta tacttgctag tttaggtctc tatagaagcc 3001 ctatataatt tagaatatgc ccactgaata tctttaatag aaagtaacat aaagctagta 3061 ttcaatgtag agtattttca tatgtttttc acagcccgtt acaaattggc aatgtttggt 3121 taatgtttgt attacttgga aatcgctaca gcttggacta tttttttcta aatttttagc 3181 attagtccat ttctgctgct aacaattgaa tccagaaatc tactttctcc atcttccact 3241 gttagtgcca gtgagcaata ctgttgtgca acaaaaatgt cactttatct cagtgtgaat 3301 gagtagtcta aattcccttt ctaccattga tttaaatata tatattggta agagagactg 3361 cccatgtgtt tagaatagaa ttttttaaat gaaatgatca acaggtggaa tttgaaatat 3421 attcttctac aaaagagatt tctttccctt ttatattttg atgattgttt tcttaagatt 3481 aagatatgtt cttgctcttt tataagatta tttaaattat gtttccctct gatttttttt 3541 caccattgta tttactaagt tattggattt acatgaaatc tggcacttta gggtgttctt 3601 tttctcacag agtatattta ataaaaatgc tgtgtatat // LOCUS HUMIQGA 7573 bp mRNA PRI 06-JAN-1995 DEFINITION Homo sapiens ras GTPase-activating-like protein (IQGAP1) mRNA, complete cds. ACCESSION L33075 NID g536843 KEYWORDS ras GTPase-activating-like protein. SOURCE Homo sapiens placenta, liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7573) AUTHORS Weissbach,L., Settleman,J., Kalady,M.F., Snijders,A.J., Murthy,A.E., Yan,Y.X. and Bernards,A. TITLE Identification of a human rasGAP-related protein containing calmodulin-binding motifs JOURNAL J. Biol. Chem. 269 (32), 20517-20521 (1994) MEDLINE 94327627 FEATURES Location/Qualifiers source 1..7573 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Nalm6" /cell_type="pre-B cell" /tissue_type="placenta, liver" /map="15" 5'UTR 1..467 gene 468..5441 /gene="IQGAP1" CDS 468..5441 /gene="IQGAP1" /note="amino acid feature: IQ calmodulin-binding domains, aa 740 .. 865; amino acid feature: N-terminal repeats, aa 210 .. 680; amino acid feature: Sar1 homologous region, aa 920 .. 1657; amino acid feature: putative GAP catalytic domain, aa 1000 .. 1270" /codon_start=1 /product="ras GTPase-activating-like protein" /db_xref="PID:g536844" /translation="MSAADEVDGLGVARPHYGSVLDNERLTAEEMDERRRQNVAYEYL CHLEEAKRWMEACLGEDLPPTTELEEGLRNGVYLAKLGNFFSPKVVSLKKIYDREQTR YKATGLHFRHTDNVIQWLNAMDEIGLPKIFYPETTDIYDRKNMPRCIYCIHALSLYLF KLGLAPQIQDLYGKVDFTEEEINNMKTELEKYGIQMPAFSKIGGILANELSVDEAALH AAVIAINEAIDRRIPADTFAALKNPNAMLVNLEEPLASTYQDILYQAKQDKMTNAKNR TENSERERDVYEELLTQAEIQGNINKVNTFSALANIDLALEQGDALALFRALQSPALG LRGLQQQNSDWYLKQLLSDKQQKRQSGQTDPLQKEELQSGVDAANSAAQQYQRRLAAV ALINAAIQKGVAEKTVLELMNPEAQLPQVYPFAADLYQKELATLQRQSPEHNLTHPEL SVAVEMLSSVALINRALESGDVNTVWKQLSSSVTGLTNIEEENCQRYLDELMKLKAQA HAENNEFITWNDIQACVDHVNLVVQEEHERILAIGLINEALDEGDAQKTLQALQIPAA KLEGVLAEVAQHYQDTLIRAKREKAQEIQDESAVLWLDEIQGGIWQSNKDTQEAQKFA LGIFAINEAVESGDVGKTLSALRSPDVGLYGVIPECGETYHSDLAEAKKKKLAVGDNN SKWVKHWVKGGYYYYHNLETQEGGWDEPPNFVQNSMQLSREEIQSSISGVTAAYNREQ LWLANEGLITRLQARCRGYLVRQEFRSRMNFLKKQIPAITCIQSQWRGYKQKKAYQDR LAYLRSHKDEVVKIQSLARMHQARKRYRDRLQYFRDHINDIIKIQAFIRANKARDDYK TLINAEDPPMVVVRKFVHLLDQSDQDFQEELDLMKMREEVITLIRSNQQLENDLNLMD IKIGLLVKNKITLQDVVSHSKKLTKKNKEQLSDMMMINKQKGGLKALSKEKREKLEAY QHLFYLLQTNPTYLAKLIFQMPQNKSTKFMDSVIFTLYNYASNQREEYLLLRLFKTAL QEEIKSKVDQIQEIVTGNPTVIKMVVSFNRGARGQNALRQILAPVVKEIMDDKSLNIK TDPVDIYKSWVNQMESQTGEASKLPYDVTPEQALAHEEVKTRLDSSIRNMRAVTDKFL SAIVSSVDKIPYGMRFIAKVLKDSLHEKFPDAGEDELLKIIGNLLYYRYMNPAIVAPD AFDIIDLSAGGQLTTDQRRNLGSIAKMLQHAASNKMFLGDNAHLSIINEYLSQSYQKF RRFFQTACDVPELQDKFNVDEYSDLVTLTKPVIYISIGEIINTHTLLLDHQDAIAPEH NDPIHELLDDLGEVPTIESLIGESSGNLNDPNKEALAKTEVSLTLTNKFDVPGDENAE MDARTILLNTKRLIVDVIRFQPGETLTEILETPATSEQEAEHQRAMQRRAIRDAKTPD KMKKSKSVKEDSNLTLQEKKEKIQTGLKKLTELGTVDPKNKYQELINDIARDIRNQRR YRQRRKAELVKLQQTYAALNSKATFYGEQVDYYKSYIKTCLDNLASKGKVSKKPREMK GKKSKKISLKYTAARLHEKGVLLEIEDLQVNQFKNVIFEISPTEEVGDFEVKAKFMGV QMETFMLHYQDLLQLQYEGVAVMKLFDRAKVNVNLLIFLLNKKFYGK" 3'UTR 5442..7573 BASE COUNT 2320 a 1570 c 1680 g 2003 t ORIGIN 1 ggtattaaaa ctgatctttt gacatttttg acaatgttct tataaattac tttctttttt 61 atcatatatg gatgggatga agcacagagt aagatagagt gcacagcaaa ggggatctgc 121 ccctcctatc tgtccaatac cccacaggtt ttggtgataa tcttgggcaa tgttccagtc 181 aaacctgcct cccacttctc actaaagtta gtgaacatgt gacccacatt ccccaaataa 241 gagcctctta taaactccat tcttggcttt ttcattcata gagatagcta ttttatgaga 301 catagataaa gcatttttta gtgatgtgca cgatgccttt tttcttaatt attaacttct 361 caaaacataa acacattgga ggcacttaat aaagggagct gtacgtaccg ccgtccgcgc 421 ctccaaggtt tcacggcttc ctcagcagag actcgggctc gtccgccatg tccgccgcag 481 acgaggttga cgggctgggc gtggcccggc cgcactatgg ctctgtcctg gataatgaaa 541 gacttactgc agaggagatg gatgaaagga gacgtcagaa cgtggcttat gagtaccttt 601 gtcatttgga agaagcgaag aggtggatgg aagcatgcct aggggaagat ctgcctccca 661 ccacagaact ggaggagggg cttaggaatg gggtctacct tgccaaactg gggaacttct 721 tctctcccaa agtagtgtcc ctgaaaaaaa tctatgatcg agaacagacc agatacaagg 781 cgactggcct ccactttaga cacactgata atgtgattca gtggttgaat gccatggatg 841 agattggatt gcctaagatt ttttacccag aaactacaga tatctatgat cgaaagaaca 901 tgccaagatg tatctactgt atccatgcac tcagtttgta cctgttcaag ctaggcctgg 961 cccctcagat tcaagaccta tatggaaagg ttgacttcac agaagaagaa atcaacaaca 1021 tgaagactga gttggagaag tatggcatcc agatgcctgc ctttagcaag attgggggca 1081 tcttggctaa tgaactgtca gtggatgaag ccgcattaca tgctgctgtt attgctatta 1141 atgaagctat tgaccgtaga attccagccg acacatttgc agctttgaaa aatccgaatg 1201 ccatgcttgt aaatcttgaa gagcccttgg catccactta ccaggatata ctttaccagg 1261 ctaagcagga caaaatgaca aatgctaaaa acaggacaga aaactcagag agagaaagag 1321 atgtttatga ggagctgctc acgcaagctg aaattcaagg caatataaac aaagtcaata 1381 cattttctgc attagcaaat atcgacctgg ctttagaaca aggagatgca ctggccttgt 1441 tcagggctct gcagtcacca gccctggggc ttcgaggact gcagcaacag aatagcgact 1501 ggtacttgaa gcagctcctg agtgataaac agcagaagag acagagtggt cagactgacc 1561 ccctgcagaa ggaggagctg cagtctggag tggatgctgc aaacagtgct gcccagcaat 1621 atcagagaag attggcagca gtagcactga ttaatgctgc aatccagaag ggtgttgctg 1681 agaagactgt tttggaactg atgaatcccg aagcccagct gccccaggtg tatccatttg 1741 ccgccgatct ctatcagaag gagctggcta ccctgcagcg acaaagtcct gaacataatc 1801 tcacccaccc agagctctct gtcgcagtgg agatgttgtc atcggtggcc ctgatcaaca 1861 gggcattgga atcaggagat gtgaatacag tgtggaagca attgagcagt tcagttactg 1921 gtcttaccaa tattgaggaa gaaaactgtc agaggtatct cgatgagttg atgaaactga 1981 aggctcaggc acatgcagag aataatgaat tcattacatg gaatgatatc caagcttgcg 2041 tggaccatgt gaacctggtg gtgcaagagg aacatgagag gattttagcc attggtttaa 2101 ttaatgaagc cctggatgaa ggtgatgccc aaaagactct gcaggcccta cagattcctg 2161 cagctaaact tgagggagtc cttgcagaag tggcccagca ttaccaagac acgctgatta 2221 gagcgaagag agagaaagcc caggaaatcc aggatgagtc agctgtgtta tggttggatg 2281 aaattcaagg tggaatctgg cagtccaaca aagacaccca agaagcacag aagtttgcct 2341 taggaatctt tgccattaat gaggcagtag aaagtggtga tgttggcaaa acactgagtg 2401 cccttcgctc ccctgatgtt ggcttgtatg gagtcatccc tgagtgtggt gaaacttacc 2461 acagtgatct tgctgaagcc aagaagaaaa aactggcagt aggagataat aacagcaagt 2521 gggtgaagca ctgggtaaaa ggtggatatt attattacca caatctggag acccaggaag 2581 gaggatggga tgaacctcca aattttgtgc aaaattctat gcagctttct cgggaggaga 2641 tccagagttc tatctctggg gtgactgccg catataaccg agaacagctg tggctggcca 2701 atgaaggcct gatcaccagg ctgcaggctc gctgccgtgg atacttagtt cgacaggaat 2761 tccgatccag gatgaatttc ctgaagaaac aaatccctgc catcacctgc attcagtcac 2821 agtggagagg atacaagcag aagaaggcat atcaagatcg gttagcttac ctgcgctccc 2881 acaaagatga agttgtaaag attcagtccc tggcaaggat gcaccaagct cgaaagcgct 2941 atcgagatcg cctgcagtac ttccgggacc atataaatga cattatcaaa atccaggctt 3001 ttattcgggc aaacaaagct cgggatgact acaagactct catcaatgct gaggatcctc 3061 ctatggttgt ggtccgaaaa tttgtccacc tgctggacca aagtgaccag gattttcagg 3121 aggagcttga ccttatgaag atgcgggaag aggttatcac cctcattcgt tctaaccagc 3181 agctggagaa tgacctcaat ctcatggata tcaaaattgg actgctagtg aaaaataaga 3241 ttacgttgca ggatgtggtt tcccacagta aaaaacttac caaaaaaaat aaggaacagt 3301 tgtctgatat gatgatgata aataaacaga agggaggtct caaggctttg agcaaggaga 3361 agagagagaa gttggaagct taccagcacc tgttttattt attgcaaacc aatcccacct 3421 atctggccaa gctcattttt cagatgcccc agaacaagtc caccaagttc atggactctg 3481 taatcttcac actctacaac tacgcgtcca accagcgaga ggagtacctg ctcctgcggc 3541 tctttaagac agcactccaa gaggaaatca agtcgaaggt agatcagatt caagagattg 3601 tgacaggaaa tcctacggtt attaaaatgg ttgtaagttt caaccgtggt gcccgtggcc 3661 agaatgccct gagacagatc ttggccccag tcgtgaagga aattatggat gacaaatctc 3721 tcaacatcaa aactgaccct gtggatattt acaaatcttg ggttaatcag atggagtctc 3781 agacaggaga ggcaagcaaa ctgccctatg atgtgacccc tgagcaggcg ctagctcatg 3841 aagaagtgaa gacacggcta gacagctcca tcaggaacat gcgggctgtg acagacaagt 3901 ttctctcagc cattgtcagc tctgtggaca aaatccctta tgggatgcgc ttcattgcca 3961 aagtgctgaa ggactcgttg catgagaagt tccctgatgc tggtgaggat gagctgctga 4021 agattattgg taacttgctt tattatcgat acatgaatcc agccattgtt gctcctgatg 4081 cctttgacat cattgacctg tcagcaggag gccagcttac cacagaccaa cgccgaaatc 4141 tgggctccat tgcaaaaatg cttcagcatg ctgcttccaa taagatgttt ctgggagata 4201 atgcccactt aagcatcatt aatgaatatc tttcccagtc ctaccagaaa ttcagacggt 4261 ttttccaaac tgcttgtgat gtcccagagc ttcaggataa atttaatgtg gatgagtact 4321 ctgatttagt aaccctcacc aaaccagtaa tctacatttc cattggtgaa atcatcaaca 4381 cccacactct cctgttggat caccaggatg ccattgctcc ggagcacaat gatccaatcc 4441 acgaactgct ggacgacctc ggcgaggtgc ccaccatcga gtccctgata ggggaaagct 4501 ctggcaattt aaatgaccca aataaggagg cactggctaa gacggaagtg tctctcaccc 4561 tgaccaacaa gttcgacgtg cctggagatg agaatgcaga aatggatgct cgaaccatct 4621 tactgaatac aaaacgttta attgtggatg tcatccggtt ccagccagga gagaccttga 4681 ctgaaatcct agaaacacca gccaccagtg aacaggaagc agaacatcag agagccatgc 4741 agagacgtgc tatccgtgat gccaaaacac ctgacaagat gaaaaagtca aaatctgtaa 4801 aggaagacag caacctcact cttcaagaga agaaagagaa gatccagaca ggtttaaaga 4861 agctaacaga gcttggaacc gtggacccaa agaacaaata ccaggaactg atcaacgaca 4921 ttgccaggga tattcggaat cagcggaggt accgacagag gagaaaggcc gaactagtga 4981 aactgcaaca gacatacgct gctctgaact ctaaggccac cttttatggg gagcaggtgg 5041 attactataa aagctatatc aaaacctgct tggataactt agccagcaag ggcaaagtct 5101 ccaaaaagcc tagggaaatg aaaggaaaga aaagcaaaaa gatttctctg aaatatacag 5161 cagcaagact acatgaaaaa ggagttcttc tggaaattga ggacctgcaa gtgaatcagt 5221 ttaaaaatgt tatatttgaa atcagtccaa cagaagaagt tggagacttc gaagtgaaag 5281 ccaaattcat gggagttcaa atggagactt ttatgttaca ttatcaggac ctgctgcagc 5341 tacagtatga aggagttgca gtcatgaaat tatttgatag agctaaagta aatgtcaacc 5401 tcctgatctt ccttctcaac aaaaagttct acgggaagta attgatcgtt tgctgccagc 5461 ccagaaggat gaaggaaaga agcacctcac agctcctttc taggtccttc tttcctcatt 5521 ggaagcaaag acctagccaa caacagcacc tcaatctgat acactcccga tgccacattt 5581 ttaactcctc tcgctctgat gggacatttg ttaccctttt ttcatagtga aattgtgttt 5641 caggcttagt ctgacctttc tggtttcttc attttcttcc attacttagg aaagagtgga 5701 aactccacta aaatttctct gtgttgttac agtcttagag gttgcagtac tatattgtaa 5761 gctttggtgt ttgtttaatt agcaataggg atggtaggat tcaaatgtgt gtcatttaga 5821 agtggaagct attagcacca atgacataaa tacatacaag acacagaact aaaatgtcat 5881 gttattaaca gttattaggt tgtcatttaa aaataaagtt cctttatatt tctgtcccat 5941 caggaaaact gaaggatatg gggaatcatt ggttatcttc cattgtgttt ttctttatgg 6001 acaggagcta atggaagtga cagtcatgtt caaaggaagc atttctagaa aaaaggagat 6061 aatgttttta aatttcatta tcaaacttgg gcaattctgt ttgtgtaact ccccgactag 6121 tggatgggag agtcccattg ctaaaattca gctactcaga taaattcaga atgggtcaag 6181 gcacctgcct gtttttgttg gtgcacagag attgacttga ttcagagaga caattcactc 6241 catccctatg gcagaggaat gggttagccc taatgtagaa tgtcattgtt tttaaaactg 6301 ttttatatct taagagtgcc ttattaaagt atagatgtat gtcttaaaat gtgggtgata 6361 ggaattttaa agatttatat aatgcatcaa aagccttaga ataagaaaag ctttttttaa 6421 attgctttat ctgtatatct gaactcttga aacttatagc taaaacacta ggatttatct 6481 gcagtgttgc agggagataa ttctgcctta aattgtctaa aacaaaaaca aaaccagcca 6541 acctatgtta cacgtgagat taaaaccaat tttttcccca ttttttctcc ttttttctct 6601 tgctgcccac attgtgcctt tattttatga gccccagttt tctgggctta gtttaaaaaa 6661 aaaatcaagt ctaaacattg catttagaaa gcttttgttc ttggataaaa agtcatacac 6721 tttaaaaaaa aaaaaaaaac tttttccagg aaaatatatt gaaatcatgc tgctgagcct 6781 ctattttctt tctttgatgt tttgattcag tattctttta tcataaattt ttagcattta 6841 aaaattcact gatgtacatt aagccaataa actgctttaa tgaataacaa actatgtagt 6901 gtgtccctat tataaatgca ttggagaagt atttttatga gactctttac tcaggtgcat 6961 ggttacagcc acagggaggc atggagtgcc atggaaggat tcgccactac ccagaccttg 7021 ttttttgttg tattttggaa gacaggtttt ttaaagaaac attttcctca gattaaaaga 7081 tgatgctatt acaactagca ttgcctcaaa aactgggacc aaccaaagtg tgtcaaccct 7141 gtttccttaa aagaggctat gaatcccaaa ggccacatcc aagacaggca ataatgagca 7201 gagtttacag ctcctttaat aaaatgtgtc agtaatttta aggtttatag ttccctcaac 7261 acaattgcta atgcagaata gtgtaaaatg cgcttcaaga atgttgatga tgatgatata 7321 gaattgtggc tttagtagca cagaggatgc cccaacaaac tcatggcgtt gaaaccacac 7381 agttctcatt actgttattt attagctgta gcattctctg tctcctctct ctcctccttt 7441 gaccttctcc tcgaccagcc atcatgacat ttaccatgaa tttacttcct cccaagagtt 7501 tggactgccc gtcagattgt ttctgcacat agttgccttt gtatctctgt atgaaataaa 7561 aggtcatttg ttc // LOCUS HUMIRELA 2314 bp mRNA PRI 25-JUN-1992 DEFINITION Homo sapiens I-Rel mRNA, complete cds. ACCESSION M83221 NID g186549 KEYWORDS I-Rel; NF-kappa-B transcription factor inhibitor. SOURCE Homo sapiens (library: Jurkat T-cell in lambda ZapII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2314) AUTHORS Ruben,S.M., Klement,J.F., Maher,M., Coleman,T.A., Chen,C.-H. and Rosen,C.A. TITLE I-Rel: A novel re1-related protein that inhibits NF-kapaB transcriptional activity JOURNAL Genes Dev. 6, 745-760 (1992) MEDLINE 92249768 FEATURES Location/Qualifiers source 1..2314 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T-lymphocyte" /tissue_lib="Jurkat T-cell in lambda ZapII" CDS 145..1884 /codon_start=1 /function="NF-kappa-B transcpription factor p50-subunit inhibitor" /product="I-Rel" /db_xref="PID:g186550" /translation="MLRSGPASGPSVPTGRAMPSRRVARPPAAPELGALGSPDLSSLS LAVSRSTDELEIIDEYIKENGFGLDGGQPGPGEGLPRLVSRGAASLSTVTLGPVAPPA TPPPWGCPLGRLVSPAPGPGPQPHLVITEQPKQRGMPFRYECEGRSAGSILGESSTEA SKTLPAIELRDCGGLREVEVTACLVWKDWPHRVHPHSLVGKDCTDGICRVRLRPHVSP RHSFNNLGIQCVRKKEIEAAIERKIQLGIDPYNAGSLKNHQEVDMNVVRICFQASYRD QQGQMRRMDPVLSEPVYDKKSTNTSELRICRINKESGPCTGGEELYLLCDKVQKEDIS VVFSRASWEGRADFSQADVHRQIAIVFKTPPYEDLEIVEPVTVNVFLQRLTDGVCSEP LPFTYLPRDHDSYGVDKKAKRGMPDVLGELNSSDPHGIESKRRKKKPAILDHFLPNHG SGPFLPPSALLPDPDFFSGTVSLPGLEPPGGPDLLDDGFAYDPTAPTLFTMLDLLPPA PPHASAVVCSGGAGAVVGETPGPEPLTLDSYQAPGPGDGGTASLVGSNMFPNHYREAA FGGGLLSPGPEAT" BASE COUNT 421 a 769 c 728 g 396 t ORIGIN 1 ggaattcccg cccggcccgg ccccgcgccc cgcagccccg ggcgccgcgc gtcctgcccg 61 gcctgcggcc cagcccttgc gccgctcgtc cgacccgcga tcgtccacca gaccgtgcct 121 cccggccgcc cgggccccgc gtgcatgctt cggtctgggc cagcctctgg gccgtccgtc 181 cccactggcc gggccatgcc gagtcgccgc gtcgccagac cgccggctgc gccggagctg 241 ggggccttag ggtcccccga cctctcctca ctctcgctcg ccgtttccag gagcacagat 301 gaattggaga tcatcgacga gtacatcaag gagaacggct tcggcctgga cgggggacag 361 ccgggcccgg gcgaggggct gccacgcctg gtgtctcgcg gggctgcgtc cctgagcacg 421 gtcaccctgg gccctgtggc gcccccagcc acgccgccgc cttggggctg ccccctgggc 481 cgactagtgt ccccagcgcc gggcccgggc ccgcagccgc acctggtcat cacggagcag 541 cccaagcagc gcggcatgcc gttccgctac gagtgcgagg gccgctcggc cggcagcatc 601 cttggggaga gcagcaccga ggccagcaag acgctgcccg ccatcgagct ccgggattgt 661 ggagggctgc gggaggtgga ggtgactgcc tgcctggtgt ggaaggactg gcctcaccga 721 gtccaccccc acagcctcgt ggggaaagac tgcaccgacg gcatctgcag ggtgcggctc 781 cggcctcacg tcagcccccg gcacagtttt aacaacctgg gcatccagtg tgtgaggaag 841 aaggagattg aggctgccat tgagcggaag attcaactgg gcattgaccc ctacaacgct 901 gggtccctga agaaccatca ggaagtagac atgaatgtgg tgaggatctg cttccaggcc 961 tcatatcggg accagcaggg acagatgcgc cggatggatc ctgtgctttc cgagcccgtc 1021 tatgacaaga aatccacaaa cacatcagag ctgcggattt gccgaattaa caaggaaagc 1081 gggccgtgca ccggtggcga ggagctctac ttgctctgcg acaaggtgca gaaagaggac 1141 atatcagtgg tgttcagcag ggcctcctgg gaaggtcggg ctgacttctc ccaggccgac 1201 gtgcaccgcc agattgccat tgtgttcaag acgccgccct acgaggacct ggagattgtc 1261 gagcccgtga cagtcaacgt cttcctgcag cggctcaccg atggggtctg cagcgagcca 1321 ttgcctttca cgtacctgcc tcgcgaccat gacagctacg gcgtggacaa gaaggcgaaa 1381 cgggggatgc ccgacgtcct tggggagctg aacagctctg acccccatgg catcgagagc 1441 aaacggcgga agaaaaagcc ggccatcctg gaccacttcc tgcccaacca cggctcaggc 1501 ccgttcctcc cgccgtcagc cctgctgcca gaccctgact tcttctctgg caccgtgtcc 1561 ctgcccggcc tggagccccc tggcgggcct gacctcctgg acgatggctt tgcctacgac 1621 cctacggccc ccacactctt caccatgctg gacctgctgc ccccggcacc gccacacgct 1681 agcgctgttg tgtgcagcgg aggtgccggg gccgtggttg gggagacccc cggccctgaa 1741 ccactgacac tggactcgta ccaggccccg ggccccgggg atggaggcac cgccagcctt 1801 gtgggcagca acatgttccc caatcattac cgcgaggcgg cctttggggg cggcctccta 1861 tccccggggc ctgaagccac gtagccccgc gatgccagag gaggggcact gggtggggag 1921 ggaggtggag gagccgtgca atcccaacca ggatgtctag cacccccatc cccttggccc 1981 ttcctcatgc ttctgaagtg gacatattca gccttggcga gaagctccgt tgcacgggtt 2041 tccccttgag cccattttac agatgaggaa actgagtccg gagaggaaaa gggacatggc 2101 tcccgtgcac tagcttgtta cagctgcctc tgtccccaca tgtgggggca ccttctccag 2161 taggattcgg aaaagattgt acatatggga ggagggggca gattcctggc cctccctccc 2221 cagacttgac ttgaaggtgg ggggtaggtt ggttgttcag agtcttccca ataaagatga 2281 gtttttgagc ctcaaaaaaa aaaaaaggaa ttcc // LOCUS HUMIRKCB 1635 bp DNA PRI 27-JAN-1997 DEFINITION Human gene for inward rectifier K channel, complete cds. ACCESSION D50582 NID g1088444 KEYWORDS . SOURCE Homo sapiens (isolate:caucasian) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Inagaki,N., Gonoi,T., Clement,J.P. IV., Namba,N., Inazawa,J., Gonzalez,G., Aguilar-Bryan,L., Seino,S. and Bryan,J. TITLE Reconstitution of IKATP: an inward rectifier subunit plus the sulfonylurea receptor JOURNAL Science 270 (5239), 1166-1170 (1995) MEDLINE 96072967 REFERENCE 2 (bases 1 to 1635) AUTHORS Inagaki,N. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 1635) AUTHORS Inagaki,N. TITLE Direct Submission JOURNAL Submitted (16-MAY-1995) to the DDBJ/EMBL/GenBank databases. Nobuya Inagaki, Chiba University School of Medicine, Center for Biomedical Science; 1-8-1 Inohana, Chuo-ku, Chiba, Chiba 260, Japan (Tel:043-222-7171(ex.2223), Fax:043-221-7803) FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /isolate="caucasian" /db_xref="taxon:9606" /chromosome="11" /tissue_type="placenta" CDS 210..1382 /codon_start=1 /product="inward rectifier K channel" /db_xref="PID:d1009769" /db_xref="PID:g1088445" /translation="MLSRKGIIPEEYVLTRLAEDPAEPRYRARQRRARFVSKKGNCNV AHKNIREQGRFLQDVFTTLVDLKWPHTLLIFTMSFLCSWLLFAMAWWLIAFAHGDLAP SEGTAEPCVTSIHSFSSAFLFSIEVQVTIGFGGRMVTEECPLAILSLIVQNIVGLMIN AIMLGCIFMKTAQAHRRAETLIFSKHAVIALRHGRLCFMLRVGDLRKSMIISATIHMQ VVRKTTSPEGEVVPLHQVDIPMENGVGGNSIFLVAPLIIYHVIDANSPLYDLAPSDLH HHQDLEIIVILEGVVETTGITTQARTSYLADEILWGQRFVPIVAEEDGRYSVDYSKFG NTIKVPTPLCTARQLDEDHSLLEALTLASARGPLRKRSVPMAKAKPKFSISPDSLS" BASE COUNT 314 a 548 c 459 g 314 t ORIGIN Chromosome 11. 1 ctgaggctgg tattaagaag tgaagtggga cccaggtgga ggtaaggaag agtctggtgg 61 ggagttatct cagaagtgag gccagcacag gctgagtgca gccccagggt gagaaggtgc 121 ccaccgagag gactctgcag tgaggcccta ggccacgtcc gaggggtgcc tccgatgggg 181 gaagcccctc cctgggggtc accggagcca tgctgtcccg caagggcatc atccccgagg 241 aatacgtgct gacacgcctg gcagaggacc ctgccgagcc caggtaccgt gcccgccagc 301 ggagggcccg ctttgtgtcc aagaaaggca actgcaacgt ggcccacaag aacatccggg 361 agcagggccg cttcctgcag gacgtgttca ccacgctggt ggacctcaag tggccacaca 421 cattgctcat cttcaccatg tccttcctgt gcagctggct gctcttcgcc atggcctggt 481 ggctcatcgc cttcgcccac ggtgacctgg cccccagcga gggcactgct gagccctgtg 541 tcaccagcat ccactccttc tcgtctgcct tccttttctc cattgaggtc caagtgacta 601 ttggctttgg ggggcgcatg gtgactgagg agtgcccact ggccatcctg agcctcatcg 661 tgcagaacat cgtggggctc atgatcaacg ccatcatgct tggctgcatc ttcatgaaga 721 ctgcccaagc ccaccgcagg gctgagaccc tcatcttcag caagcatgcg gtgatcgctc 781 tgcgccacgg ccgcctctgc ttcatgctac gtgtgggtga cctccgcaag agcatgatca 841 tcagcgccac catccacatg caggtggtac gcaagaccac cagccccgag ggcgaggtgg 901 tgcccctcca ccaggtggac atccccatgg agaacggcgt gggtggcaac agcatcttcc 961 tggtggcccc gctgatcatc taccatgtca ttgatgccaa cagcccactc tacgacctgg 1021 cacccagcga cctgcaccac caccaggacc tcgagatcat cgtcatcctg gaaggcgtgg 1081 tggaaaccac gggcatcacc acccaggccc gcacctccta cctggccgat gagatcctgt 1141 ggggccagcg ctttgtgccc attgtagctg aggaggacgg acgttactct gtggactact 1201 ccaagtttgg caacaccatc aaagtgccca caccactctg cacggcccgc cagcttgatg 1261 aggaccacag cctactggaa gctctgaccc tcgcctcagc ccgcgggccc ctgcgcaagc 1321 gcagcgtgcc catggccaag gccaagccca agttcagcat ctctccagat tccctgtcct 1381 gagccatggt ctctcgggcc ccccacacgc gtgtgtacac acggaccatg tggtatgtag 1441 cccagccagg gcctggtgtg aggctgggcc agcctcagct cagcctcccc ctgctgctca 1501 tccagggtgt tacaaggcac ttgtcactat gctatttctg gcctcagcag gaacctgtac 1561 tgggttattt ttgtccctgc tcctcccaac ccaatttagg actggctcac ccctctcccc 1621 cgcccaaggc tgcag // LOCUS HUMISGF3A 4003 bp mRNA PRI 25-JUL-1997 DEFINITION Homo sapiens transcription factor ISGF-3 mRNA, complete cds. ACCESSION M97935 NID g2281070 KEYWORDS transcription factor. SOURCE Homo sapiens female cultured cells cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4003) AUTHORS Schindler,C., Fu,X.-Y., Improta,T., Aebersold,R.H. and Darnell,J.E. TITLE Proteins of transcription factor ISGF-3: One gene encodes the 91- and 84-kDa ISGF-3 proteins that are activated by interferon alpha JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 7836-7839 (1992) MEDLINE 92366557 REFERENCE 2 (bases 1 to 4003) AUTHORS Horvath,C. TITLE Direct Submission JOURNAL Submitted (25-JUL-1997) Molecular Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10021, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..4003 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela S3 (ATTC)" /cell_type="Hela" /sex="female" /tissue_type="cultured cells" CDS 197..2449 /codon_start=1 /product="transcription factor ISGF-3" /db_xref="PID:g2281071" /translation="MSQWYELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWE HAANDVSFATIRFHDLLSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPIQMS MIIYSCLKEERKILENAQRFNQAQSGNIQSTVMLDKQKELDSKVRNVKDKVMCIEHEI KSLEDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQLLLKKMYLMLDNKRKEVVHKII ELLNVTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQNWFTIVAESLQQVRQQ LKKLEELEQKYTYEHDPITKNKQVLWDRTFSLFQQLIQSSFVVERQPCMPTHPQRPLV LKTGVQFTVKLRLLVKLQELNYNLKVKVLFDKDVNERNTVKGFRKFNILGTHTKVMNM EESTNGSLAAEFRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLET TSLPVVVISNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPCARWAQLSEVLSWQFSS VTKRGLNVDQLNMLGEKLLGPNASPDGLIPWTRFCKENINDKNFPFWLWIESILELIK KHLLPLWNDGCIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGG EPDFHAVEPYTKKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYS RPKEAPEPMELDGPKGTGYIKTELISVSEVHPSRLQTTDNLLPMSPEEFDEVSRIVGS VEFDSMMNTV" BASE COUNT 1173 a 812 c 883 g 1135 t ORIGIN 1 attaaacctc tcgccgagcc cctccgcaga ctctgcgccg gaaagtttca tttgctgtat 61 gccatcctcg agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc 121 ttggcaccta acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg 181 ggcacaaggt ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct 241 ggagcaggtt caccagcttt atgatgacag ttttcccatg gaaatcagac agtacctggc 301 acagtggtta gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat 361 ccgttttcat gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa 421 taacttcttg ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca 481 ggaagaccca atccagatgt ctatgatcat ttacagctgt ctgaaggaag aaaggaaaat 541 tctggaaaac gcccagagat ttaatcaggc tcagtcgggg aatattcaga gcacagtgat 601 gttagacaaa cagaaagagc ttgacagtaa agtcagaaat gtgaaggaca aggttatgtg 661 tatagagcat gaaatcaaga gcctggaaga tttacaagat gaatatgact tcaaatgcaa 721 aaccttgcag aacagagaac acgagaccaa tggtgtggca aagagtgatc agaaacaaga 781 acagctgtta ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca 841 caaaataata gagttgctga atgtcactga acttacccag aatgccctga ttaatgatga 901 actagtggag tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg 961 cttggatcag ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca 1021 gcagcttaaa aagttggagg aattggaaca gaaatacacc tacgaacatg accctatcac 1081 aaaaaacaaa caagtgttat gggaccgcac cttcagtctt ttccagcagc tcattcagag 1141 ctcgtttgtg gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt 1201 gaagacaggg gtccagttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa 1261 ttataatttg aaagtcaaag tcttatttga taaagatgtg aatgagagaa atacagtaaa 1321 aggatttagg aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc 1381 caccaatggc agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaaaaatgc 1441 tggcaccaga acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt 1501 tgaaacccaa ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt 1561 tgtggtgatc tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa 1621 catgctggtg gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg 1681 ggctcagctt tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa 1741 tgtggaccag ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg 1801 tctcattccg tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg 1861 gctttggatt gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga 1921 tgggtgcatc atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca 1981 gccggggacc ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac 2041 atgggtggag cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac 2101 gaagaaagaa ctttctgctg ttactttccc tgacatcatt cgcaattaca aagtcatggc 2161 tgctgagaat attcctgaga atcccctgaa gtatctgtat ccaaatattg acaaagacca 2221 tgcctttgga aagtattact ccaggccaaa ggaagcacca gagccaatgg aacttgatgg 2281 ccctaaagga actggatata tcaagactga gttgatttct gtgtctgaag ttcacccttc 2341 tagacttcag accacagaca acctgctccc catgtctcct gaggagtttg acgaggtgtc 2401 tcggatagtg ggctctgtag aattcgacag tatgatgaac acagtataga gcatgaattt 2461 ttttcatctt ctctggcgac agttttcctt ctcatctgtg attccctcct gctactctgt 2521 tccttcacat cctgtgtttc tagggaaatg aaagaaaggc cagcaaattc gctgcaacct 2581 gttgatagca agtgaatttt tctctaactc agaaacatca gttactctga agggcatcat 2641 gcatcttact gaaggtaaaa ttgaaaggca ttctctgaag agtgggtttc acaagtgaaa 2701 aacatccaga tacacccaaa gtatcaggac gagaatgagg gtcctttggg aaaggagaag 2761 ttaagcaaca tctagcaaat gttatgcata aagtcagtgc ccaactgtta taggttgttg 2821 gataaatcag tggttattta gggaactgct tgacgtagga acggtaaatt tctgtgggag 2881 aattcttaca tgttttcttt gctttaagtg taactggcag ttttccattg gtttacctgt 2941 gaaatagttc aaagccaagt ttatatacaa ttatatcagt cctctttcaa aggtagccat 3001 catggatctg gtagggggaa aatgtgtatt ttattacatc tttcacattg gctatttaaa 3061 gacaaagaca aattctgttt cttgagaaga gaatattagc tttactgttt gttatggctt 3121 aatgacacta gctaatatca atagaaggat gtacatttcc aaattcacaa gttgtgtttg 3181 atatccaaag ctgaatacat tctgctttca tcttggtcac atacaattat ttttacagtt 3241 ctcccaaggg agttaggcta ttcacaacca ctcattcaaa agttgaaatt aaccatagat 3301 gtagataaac tcagaaattt aattcatgtt tcttaaatgg gctactttgt cctttttgtt 3361 attagggtgg tatttagtct attagccaca aaattgggaa aggagtagaa aaagcagtaa 3421 ctgacaactt gaataataca ccagagataa tatgagaatc agatcatttc aaaactcatt 3481 tcctatgtaa ctgcattgag aactgcatat gtttcgctga tatatgtgtt tttcacattt 3541 gcgaatggtt ccattctctc tcctgtactt tttccagaca cttttttgag tggatgatgt 3601 ttcgtgaagt atactgtatt tttacctttt tccttcctta tcactgacac aaaaagtaga 3661 ttaagagatg ggtttgacaa ggttcttccc ttttacatac tgctgtctat gtggctgtat 3721 cttgtttttc cactactgct accacaacta tattatcatg caaatgctgt attcttcttt 3781 ggtggagata aagatttctt gagttttgtt ttaaaattaa agctaaagta tctgtattgc 3841 attaaatata atatcgacac agtgctttcc gtggcactgc atacaatctg aggcctcctc 3901 tctcagtttt tatatagatg gcgagaacct aagtttcagt tgattttaca attgaaatga 3961 ctaaaaaaca aagaagacaa cattaaaaac aatattgttt cta // LOCUS HUMJNK1 1418 bp mRNA PRI 25-APR-1994 DEFINITION Human protein kinase (JNK1) mRNA, complete cds. ACCESSION L26318 NID g474900 KEYWORDS protein kinase. SOURCE Homo sapiens (library: lambda ZAPII) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1418) AUTHORS Derijard,B., Hibi,M., Wu,I.-H., Barrett,T., Su,B., Deng,T., Karin,M. and Davis,R.J. TITLE JNK1: A protein kinase stimulated by UV light and Ha-Ras that binds and phosphorylates the c-Jun activation domain JOURNAL Cell 76, 1025-1037 (1994) MEDLINE 94185163 FEATURES Location/Qualifiers source 1..1418 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="lambda ZAPII" 5'UTR 1..18 CDS 19..1173 /standard_name="JNK1" /codon_start=1 /product="protein kinase" /db_xref="PID:g474901" /translation="MSRSKRDNNFYSVEIGDSTFTVLKRYQNLKPIGSGAQGIVCAAY DAILERNVAIKKLSRPFQNQTHAKRAYRELVLMKCVNHKNIIGLLNVFTPQKSLEEFQ DVYIVMELMDANLCQVIQMELDHERMSYLLYQMLCGIKHLHSAGIIHRDLKPSNIVVK SDCTLKILDFGLARTAGTSFMMTPYVVTRYYRAPEVILGMGYKENVDLWSVGCIMGEM VCHKILFPGRDYIDQWNKVIEQLGTPCPEFMKKLQPTVRTYVENRPKYAGYSFEKLFP DVLFPADSEHNKLKASQARDLLSKMLVIDASKRISVDEALQHPYINVWYDPSEAEAPP PKIPDKQLDEREHTIEEWKELIYKEVMDLEERTKNGVIRGQPSPLAQVQQ" 3'UTR 1171..1418 BASE COUNT 437 a 278 c 326 g 377 t ORIGIN 1 cattaattgc ttgccatcat gagcagaagc aagcgtgaca acaattttta tagtgtagag 61 attggagatt ctacattcac agtcctgaaa cgatatcaga atttaaaacc tataggctca 121 ggagctcaag gaatagtatg cgcagcttat gatgccattc ttgaaagaaa tgttgcaatc 181 aagaagctaa gccgaccatt tcagaatcag actcatgcca agcgggccta cagagagcta 241 gttcttatga aatgtgttaa tcacaaaaat ataattggcc ttttgaatgt tttcacacca 301 cagaaatccc tagaagaatt tcaagatgtt tacatagtca tggagctcat ggatgcaaat 361 ctttgccaag tgattcagat ggagctagat catgaaagaa tgtcctacct tctctatcag 421 atgctgtgtg gaatcaagca ccttcattct gctggaatta ttcatcggga cttaaagccc 481 agtaatatag tagtaaaatc tgattgcact ttgaagattc ttgacttcgg tctggccagg 541 actgcaggaa cgagttttat gatgacgcct tatgtagtga ctcgctacta cagagcaccc 601 gaggtcatcc ttggcatggg ctacaaggaa aacgtggatt tatggtctgt ggggtgcatt 661 atgggagaaa tggtttgcca caaaatcctc tttccaggaa gggactatat tgatcagtgg 721 aataaagtta ttgaacagct tggaacacca tgtcctgaat tcatgaagaa actgcaacca 781 acagtaagga cttacgttga aaacagacct aaatatgctg gatatagctt tgagaaactc 841 ttccctgatg tccttttccc agctgactca gaacacaaca aacttaaagc cagtcaggca 901 agggatttgt tatccaaaat gctggtaata gatgcatcta aaaggatctc tgtagatgaa 961 gctctccaac acccgtacat caatgtctgg tatgatcctt ctgaagcaga agctccacca 1021 ccaaagatcc ctgacaagca gttagatgaa agggaacaca caatagaaga gtggaaagaa 1081 ttgatatata aggaagttat ggacttggag gagagaacca agaatggagt tatacggggg 1141 cagccctctc ctttagcaca ggtgcagcag tgatcaatgg ctctcagcat ccatcatcat 1201 cgtcgtctgt caatgatgtg tcttcaatgt caacagatcc gactttggcc tctgatacag 1261 acagcagtct agaagcagca gctgggcctc tgggctgctg tagatgacta cttgggccat 1321 cggggggtgg gagggatggg gagtcggtta gtcattgata gaactacttt gaaaacaatt 1381 cagtggtctt atttttgggt gatttttcaa aaaatgta // LOCUS HUMJUNA 3622 bp DNA PRI 06-JAN-1995 DEFINITION Human c-jun proto oncogene (JUN), complete cds, clone hCJ-1. ACCESSION J04111 NID g186624 KEYWORDS jun oncogene. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3622) AUTHORS Hattori,K., Angel,P., Le Beau,M.M. and Karin,M. TITLE Structure and chromosomal localization of the functional intronless human JUN protooncogene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (23), 9148-9152 (1988) MEDLINE 89057892 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Hattori, 16-NOV-1988. FEATURES Location/Qualifiers source 1..3622 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="leukocyte" /map="1p32-p31" gene 287..3622 /gene="JUN" exon 287..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 exon 289..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 exon 293..3622 /gene="JUN" /note="alternative mRNA start; G00-120-114" /number=1 CDS 1261..2256 /gene="JUN" /codon_start=1 /db_xref="GDB:G00-120-114" /db_xref="PID:g386839" /translation="MTAKMETTFYDDALNASFLPSESGPYGYSNPKILKQSMTLNLAD PVGSLKPHLRAKNSDLLTSPDVGLLKLASPELERLIIQSSNGHITTTPTPTQFLCPKN VTDEQEGFAEGFVRALAELHSQNTLPSVTSAAQPVNGAGMVAPAVASVAGGSGSGGFS ASLHSEPPVYANLSNFNPGALSSGGGAPSYGAAGLAFPAQPQQQQQPPHHLPQQMPVQ HPRLQALKEEPQTVPEMPGETPPLSPIDMESQERIKAERKRMRNRIAASKCRKRKLER IARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNHVNSGCQLMLTQQLQTF" BASE COUNT 851 a 949 c 1091 g 731 t ORIGIN Chromosome 1p31-p32. 1 cccggggagg ggaccgggga acagagggcc gagaggcgtg cggcaggggg gagggtagga 61 gaaagaaggg cccgactgta ggagggcagc ggagcattac ctcatcccgt gagcctccgc 121 gggcccagag aagaatcttc tagggtggag tctccatggt gacgggcggg cccgcccccc 181 tgagagcgac gcgagccaat gggaaggcct tggggtgaca tcatgggcta tttttagggg 241 ttgactggta gcagataagt gttgagctcg ggctggataa gggctcagag ttgcactgag 301 tgtggctgaa gcagcgaggc gggagtggag gtgcgcggag tcaggcagac agacagacac 361 agccagccag ccaggtcggc agtatagtcc gaactgcaaa tcttattttc ttttcacctt 421 ctctctaact gcccagagct agcgcctgtg gctcccgggc tggtggttcg ggagtgtcca 481 gagagccttg tctccagccg gccccgggag gagagccctg ctgcccaggc gctgttgaca 541 gcggcggaaa gcagcggtac cccacgcgcc cgccggggga cgtcggcgag cggctgcagc 601 agcaaagaac tttcccggcg gggaggaccg gagacaagtg gcagagtccc ggagcgaact 661 tttgcaagcc tttcctgcgt cttaggcttc tccacggcgg taaagaccag aaggcggcgg 721 agagccacgc aagagaagaa ggacgtgcgc tcagcttcgc tcgcaccggt tgttgaactt 781 gggcgagcgc gagccgcggc tgccgggcgc cccctccccc tagcagcgga ggaggggaca 841 agtcgtcgga gtccgggcgg ccaagacccg ccgccggccg gccactgcag ggtccgcact 901 gatccgctcc gcggggagag ccgctgctct gggaagtgag ttcgcctgcg gactccgagg 961 aaccgctgcg cccgaagagc gctcagtgag tgaccgcgac ttttcaaagc cgggtagcgc 1021 gcgcgagtcg acaagtaaga gtgcgggagg catcttaatt aaccctgcgc tccctggagc 1081 gagctggtga ggagggcgca gcggggacga cagccagcgg gtgcgtgcgc tcttagagaa 1141 actttccctg tcaaaggctc cggggggcgc gggtgtcccc cgcttgccag agccctgttg 1201 cggccccgaa acttgtgcgc gcacgccaaa ctaacctcac gtgaagtgac ggactgttct 1261 atgactgcaa agatggaaac gaccttctat gacgatgccc tcaacgcctc gttcctcccg 1321 tccgagagcg gaccttatgg ctacagtaac cccaagatcc tgaaacagag catgaccctg 1381 aacctggccg acccagtggg gagcctgaag ccgcacctcc gcgccaagaa ctcggacctc 1441 ctcacctcgc ccgacgtggg gctgctcaag ctggcgtcgc ccgagctgga gcgcctgata 1501 atccagtcca gcaacgggca catcaccacc acgccgaccc ccacccagtt cctgtgcccc 1561 aagaacgtga cagatgagca ggaggggttc gccgagggct tcgtgcgcgc cctggccgaa 1621 ctgcacagcc agaacacgct gcccagcgtc acgtcggcgg cgcagccggt caacggggca 1681 ggcatggtgg ctcccgcggt agcctcggtg gcagggggca gcggcagcgg cggcttcagc 1741 gccagcctgc acagcgagcc gccggtctac gcaaacctca gcaacttcaa cccaggcgcg 1801 ctgagcagcg gcggcggggc gccctcctac ggcgcggccg gcctggcctt tcccgcgcaa 1861 ccccagcagc agcagcagcc gccgcaccac ctgccccagc agatgcccgt gcagcacccg 1921 cggctgcagg ccctgaagga ggagcctcag acagtgcccg agatgcccgg cgagacaccg 1981 cccctgtccc ccatcgacat ggagtcccag gagcggatca aggcggagag gaagcgcatg 2041 aggaaccgca tcgctgcctc caagtgccga aaaaggaagc tggagagaat cgcccggctg 2101 gaggaaaaag tgaaaacctt gaaagctcag aactcggagc tggcgtccac ggccaacatg 2161 ctcagggaac aggtggcaca gcttaaacag aaagtcatga accacgttaa cagtgggtgc 2221 caactcatgc taacgcagca gttgcaaaca ttttgaagag agaccgtcgg gggctgaggg 2281 gcaacgaaga aaaaaaataa cacagagaga cagacttgag aacttgacaa gttgcgacgg 2341 agagaaaaaa gaagtgtccg agaactaaag ccaagggtat ccaagttgga ctgggttcgg 2401 tctgacggcg cccccagtgt gcacgagtgg gaaggacttg gtcgcgccct cccttggcgt 2461 ggagccaggg agcggccgcc tgcgggctgc cccgctttgc ggacgggctg tccccgcgcg 2521 aacggaacgt tggactttcg ttaacattga ccaagaactg catggaccta acattcgatc 2581 tcattcagta ttaaaggggg gagggggagg gggttacaaa ctgcaataga gactgtagat 2641 tgcttctgta gtactcctta agaacacaaa gcggggggag ggttggggag gggcggcagg 2701 agggaggttt gtgagagcga ggctgagcct acagatgaac tctttctggc ctgctttcgt 2761 taactgtgta tgtacatata tatatttttt aatttgatta aagctgatta ctgtcaataa 2821 acagcttcat gcctttgtaa gttatttctt gtttgtttgt ttgggtatcc tgcccagtgt 2881 tgtttgtaaa taagagattt ggagcactct gagtttacca tttgtaataa agtatataat 2941 ttttttatgt tttgtttctg aaaattccag aaaggatatt taagaaaata caataaacta 3001 ttggaaagta ctcccctaac ctcttttctg catcatctgt agatcctagt ctatctaggt 3061 ggagttgaaa gagttaagaa tgctcgataa aatcactctc agtgcttctt actattaagc 3121 agtaaaaact gttctctatt agacttagaa ataaatgtac ctgatgtacc tgatgctatg 3181 tcaggcttca tactccacgc tcccccagcg tatctatatg gaattgctta ccaaaggcta 3241 gtgcgatgtt tcaggaggct ggaggaaggg gggttgcagt ggagagggac agcccactga 3301 gaagtcaaac atttcaaagt ttggattgca tcaagtggca tgtgctgtga ccatttataa 3361 tgttagaaat tttacaatag gtgcttattc tcaaagcagg aattggtggc agattttaca 3421 aaagatgtat ccttccaatt tggaatcttc tctttgacaa ttcctagata aaaagatggc 3481 ctttgtctta tgaatattta taacagcatt ctgtcacaat aaatgtattc aaataccaat 3541 aacagatctt gaattgcttc cctttactac ttttttgttc ccaagttata tactgaagtt 3601 tttattttta gttgctgagg tt // LOCUS HUMKAP1A 844 bp mRNA PRI 16-MAY-1995 DEFINITION Human protein phosphatase (KAP1) mRNA, complete cds. ACCESSION L27711 NID g808006 KEYWORDS protein phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 844) AUTHORS Hannon,G.J., Casso,D. and Beach,D. TITLE KAP: a dual specificity phosphatase that interacts with cyclin-dependent kinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (5), 1731-1735 (1994) MEDLINE 94173903 FEATURES Location/Qualifiers source 1..844 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /tissue_lib="pGAD-GH" gene 52..690 /gene="KAP1" CDS 52..690 /gene="KAP1" /standard_name="dual specificity protein phosphatase" /codon_start=1 /function="interaction with cyclin dependent kinases" /product="protein phosphatase" /db_xref="PID:g443669" /translation="MKPPSSIQTSEFDSSDEEPIEDEQTPIHISWLSLSRVNCSQFLG LCALPGCKFKDVRRNVQKDTEELKSCGIQDIFVFCTRGELSKYRVPNLLDLYQQCGII THHHPIADGGTPDIASCCEIMEELTTCLKNYRKTLIHCYGGLGRSCLVAACLLLYLSD TISPEQAIDSLRDLRGSGAIQTIKQYNYLHEFRDKLAAHLSSRDSQSRSVSR" BASE COUNT 285 a 170 c 171 g 218 t ORIGIN 1 gcacgagctg cagagggagg cggcactggt ctcgacgtgg ggcggccagc gatgaagccg 61 cccagttcaa tacaaacaag tgagtttgac tcatcagatg aagagcctat tgaagatgaa 121 cagactccaa ttcatatatc atggctatct ttgtcacgag tgaattgttc tcagtttctc 181 ggtttatgtg ctcttccagg ttgtaaattt aaagatgtta gaagaaatgt ccaaaaagat 241 acagaagaac taaagagctg tggtatacaa gacatatttg ttttctgcac cagaggggaa 301 ctgtcaaaat atagagtccc aaaccttctg gatctctacc agcaatgtgg aattatcacc 361 catcatcatc caatcgcaga tggagggact cctgacatag ccagctgctg tgaaataatg 421 gaagagctta caacctgcct taaaaattac cgaaaaacct taatacactg ctatggagga 481 cttgggagat cttgtcttgt agctgcttgt ctcctactat acctgtctga cacaatatca 541 ccagagcaag ccatagacag cctgcgagac ctaagaggat ccggggcaat acagaccatc 601 aagcaataca attatcttca tgagtttcgg gacaaattag ctgcacatct atcatcaaga 661 gattcacaat caagatctgt atcaagataa aggaattcaa atagcatata tatgaccatg 721 tctgaaatgt cagttctcta gcataatttg tattgaaatg aaaccaccag tgttatcaac 781 ttgaatgtaa atgtacatgt gcagatattc ctaaagtttt attgacaaaa aaaaaaaaaa 841 aaaa // LOCUS HUMKBF2 5360 bp DNA PRI 05-MAR-1996 DEFINITION Homo sapiens H2K binding factor 2 (KBF2) mRNA, complete cds. ACCESSION L08904 L09117 L09761 NID g1220319 KEYWORDS H-2K binding factor-2. SOURCE Homo sapiens (tissue library: lambda gt11) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5360) AUTHORS Tang,X., Gachelin,G., Yokoyama,K. and Israel,A. TITLE Nucleotide sequence and the chromosomal mapping of KBF-2 cDNA JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..5360 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /tissue_lib="lambda gt11" gene 239..5341 /gene="KBF2" CDS 239..1501 /gene="KBF2" /codon_start=1 /product="H-2K binding factor-2" /db_xref="PID:g1220320" /translation="MGSGWKKIKLQMKCDGCSEQGSHPCAFIGIGNSDQEMQQLNLEG KNYCTAKTLYISDSDKQKHFMLSVKVFYGNGDDIGVFLSKSSKPSKKKQSLKNADLCI GSGTKVALFNRLRSQTVSTRYLHVEGGNFHASSQQWGAFTLFLDDDGSEGEEFTVRDG YIHYGQTVKLVCSVTGMALPRLIIRKVDKQTTLLDADDPVSQLHKCAFDLEDTERMYL CLSQERIIQFQATPCPTEPNKEMINDGASWAIISTHKAKYTFYERMGPVLALVMPMPV VESLKLNGGGDEAMLELTGQNFTPNLRVWFGDVEAETMYRCGESMLRVVPDVLHSEKV GDSSQQPVQVSVTLVRNDGIIYSTSLTFTYTPEAGPRPHCSVAGAILKASSSHVPPNE LNTNSDGSYTNASTNSTSVTSSTPTVVS" polyA_signal 5336..5341 /gene="KBF2" BASE COUNT 1552 a 978 c 1008 g 1822 t ORIGIN 1 ttgaggtgca ttgaaatgtt ccaagctgtt acttacctta acatgttctt gaggtaccat 61 ggcatggatt aaaaggaaat ttggtaagtg gcctccactt aaacgactta ctagggaagc 121 tatgtgaaat tatttaaaag ggcgagggga tcaaatagta cttatccttc atgcaaaagt 181 tgtacagaag tcatatggaa tgaaaaaggt tttttgccct cccccttgtg tatatcttat 241 gggcagtgga tggaagaaaa taaaattaca aatgaaatgc gatggttgtt ctgaacaagg 301 ctctcatcca tgtgcattta ttgggatagg aaatagtgac caagaaatgc agcagctaaa 361 cttggaagga aagaactatt gcacagccaa aacattgtac atatctgatt cagacaagca 421 aaagcacttc atgttgtctg taaaggtgtt ctatggcaac ggtgatgaca ttggtgtgtt 481 cctcagcaag tcgtccaaac cttccaaaaa gaagcagtca ttgaaaaatg ctgacttatg 541 cattggctca ggaacaaagg tggctctgtt taatcgacta cgatcccaga cagttagtac 601 cagatacttg catgtagaag gagggaattt tcatgccagt tcacagcagt ggggagcatt 661 tacattattc ttggatgatg atggatcaga aggagaagaa ttcacagtca gagatggcta 721 cattcattat ggacaaacag tcaagcttgt gtgctcagtt actggcatgg cactcccaag 781 attgataatt aggaaagttg ataagcagac cacattattg gatgcagatg atcctgtgtc 841 acaactccat aaatgtgcat ttgaccttga ggatacagaa agaatgtact tatgcctttc 901 tcaagaaaga ataattcaat ttcaggccac tccatgccca acagaaccaa ataaagagat 961 gataaatgat ggtgcttcct gggcaatcat tagcacacat aaggcgaagt atacatttta 1021 tgagagaatg ggccctgtcc ttgccctggt catgcctatg cctgtcgtag agagccttaa 1081 gttgaatggc ggtggggacg aagcaatgct tgaacttaca ggacagaatt tcactccaaa 1141 tttacgagtg tggtttgggg atgtagaagc tgaaactatg tacaggtgtg gagagagtat 1201 gctccgtgtt gtcccagacg ttctgcattc tgagaaggtt ggagatagtt cccagcaacc 1261 agtccaggtt tcagtaactt tggtccgaaa tgatggaatc atatattcca ccagccttac 1321 ctttacctac acaccagaag cagggccgcg gccacattgc agtgtagcag gcgcaatcct 1381 taaggccagt tcaagccacg tgccccctaa tgaattaaac acaaacagcg atggaagtta 1441 cacaaatgcc agcacaaatt caaccagtgt cacatcatct acaccaacag tggtatcctg 1501 aactaccgtc tttttgctaa gactcaaacg gcttgagtgc agcaaaaagt tgacaaaaaa 1561 ggaaaaaaaa atgaacagtc ttttgtggtt tattgggaaa cttttcatac caggtgatac 1621 tattctaaaa ccccgttgtc tccctgcaag tgctgatttg aaatgcagaa gccacagtaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaagaaa aaaaaatcaa aatgtataaa tattggaaat 1741 caagtttttc agctgttttg ttggttggtt ggttggtttt tgtttggttt tgtttaaagg 1801 gacaagaagt aaataatgtg gctggaatac aagttgaaca aactagaaga cacaaatcta 1861 acatagtttt tatggaccaa ggaacttgta tattgtataa gctttagtaa aaggtacatt 1921 ttcaccatac ctttttttat atcacggtat tatagtacac cttgttacca ataggttgtt 1981 ctcttcccca ccctcctttg agctttgctc taaaatacat tctggttcca agcctgacca 2041 tccttgttta atctatcata ctcttccagg tttttttttt tggtctaagg ctggaacttt 2101 tttctttttt tttcagctga agtcttatga ctttttcatg agtcaaaatt gtttggattt 2161 cacgaagtca aatcttgcaa aggcctgcat atttttttta agattatatg aagtctgtgc 2221 aaaaagcttt aaaaaattgc ctctgccttg cctgcataca tgcaatgtat gtaacttagt 2281 ctctcttctc agacactgtt gggtagttat ttctgtgttt tcttttttta aaaaaaaata 2341 tggacttatt gtggtttatc tgagaggttc taacattcac atgcaatttg gtgtggcatt 2401 tagctattat gagttattgg cgcgaacttg tttgatattt gaagtgtctc tccccttttc 2461 ccatgacgta atacataggt gtgttccagg atttgttcag gtttttcccc ccctcctaat 2521 cttgtacata acttgtattt tgtgtaagtt aaacatttta tttgaacttg gaatgttccc 2581 agtgatttca ttcagcaggg tattttctgc cttgttggca agtagcaaaa aatatgggaa 2641 gtatttgcta ccagttgtta gatggtgccc cttattggta gaatcaggaa aatgtccgca 2701 aaagcatgtt ttattatctt tacttttttg gggggttgga gggggtagcc tagccagaca 2761 tcatgtaatc ttaaaacata agatgctttt attagatgat caactaaaaa tagctggaag 2821 acagtacttt agaaacaaaa tagttagtaa gatatataat gcaaatgtaa cttatgtttt 2881 catttttttc tctgcctttt ttttttgttt tttttctttt tttccagtac tgagcatctc 2941 cacaaatgtc tcctactcag aaaatgtttc ttttctttca gttgagattt ggtgcattca 3001 gggttgtagg ttggccttgc ttgctaaccc gccggtttta ccgtgcttta ttcctgaact 3061 ttgtttatgc ctttgtttgg ttcttctgaa attgcagcag actcattggg ctacatttag 3121 tacaggaacc acgtgtgtaa tgttatacaa cacagtcagt aatacaatca tccctcttag 3181 agtaaaaact acctctagat tgtgtaagct ttttactgtc cataaaacag gagccacagt 3241 accttatgaa tgcaaaactg taacttccta cagtgtttcc ccacagaaca ttgtctttct 3301 ggtgtctggg ctgtttttga aaaagtttcc attaatagac tttttagaaa ttattattag 3361 tagcattttt tttccagctt tgcgtcttca tcactcactc aagtgtcaga ctatgcactg 3421 taaatatctt cctaacatct ttaaatcgcc ttttcctcag ttttcaaggg gaaggtcatt 3481 tgtaaagcac gttaggtggt taaatcagtt attgcggttt tctcttacag caagcctttt 3541 taatcacccc caggctgcat tttattctat atcgcctttt ttcttcaaat ctgctccaat 3601 catccacttc tctcttataa gctattcctg cctcacacct aaatctgttt cagtgatcaa 3661 gggcagaact cattgtggcc ttatctttct ttgttgtaat tgttcactgt ctctttctta 3721 cagaccactt attctgagta gtagttattc ctccctatgg agtcatggca ggaatcatta 3781 cacagtgctt ttgttcagag catggacatg ttccaggtgc tgctttgctt taacggccac 3841 aagtttcctc cacttctcag gtttggtatt tagtaaggaa tcaattaaat taaccaataa 3901 caaaagagat acttttgaag aacaaactat tctttaccca ttttgtagct caaaaataat 3961 ttttcaagtt catgacctta ttaaaatgaa cttgtgtttt tttaacaaac gtctatttta 4021 ttttgatagt ttctttccga agataattga aatattatac tgtaaccctt ttcttttctt 4081 ttttgaaaag tccaagaatg tacttataca ggatttttcc ccacctattt ttggccattc 4141 tcataccaca gacaaaagag tgaatgattg tcattgtagc ttattgttta tcagtagttc 4201 ttttgtagct gcttacattt tttctttcat ggtttgtgaa tcatttcagt atgtaattta 4261 taggaacctt gtcctctggt atagtagact gtgtgccctc ctccaggatg gcattattag 4321 acatgctggt catttaccct cagaaagact ctcttataga atggtgagtg cttcagttat 4381 agtatgtttg aattttaaaa aattcctgtt tagaatgtat ctatgctctc atgactatgc 4441 agtttctaac atacacatag aagctgagtc tctgatccaa tatgttttta tttgttccat 4501 taatttatca catagattgg gaaggcaagc taaaagcctt aaaaatgccc tttatatttt 4561 gagtgatttc agcgttgaac acagtatact atctaaattt gctgctcact ttcttaaact 4621 gttgcaatta aaggcatgtt tatacatgac taatcgtgaa atgtttgtca ctcttactgc 4681 acagacttat ctgcaatcaa actggttagt ttttttgttt tgttttgttt tattgttttt 4741 aatgaatctg gtaccatctg tgctttcaca aaaaacttcc aatgccattt ttgagaacta 4801 acctaacagt catgctaacc agaaaatcca ctggggagga ggttcctttg aaacaaaatg 4861 ctgttcagtt agtaaccaag ttactttgat tgcaaaagca gctgtgtttc tgataagtac 4921 tgaacaaatg tgtgtaattt tctgtgccag acttatgact ttgttttcaa gcactgtaat 4981 gtgggatgga tggttagaaa caataatata tagggtttct gttaaccctt tcaggactca 5041 actgtatctc cttttgttaa ttttcccctg tgttgtgata aattgtttgc cagcattcag 5101 tactgtgttg gtgcagatga ggtttatatc tcattttagc ttatttcttg tacctttcag 5161 catgcctacg cattcagtcc ttaaggggtt tattttacaa actgtgcgcc tgaagtttat 5221 tagcaataag atagaaaatg agcaagttta taccataatt ttgagaaaaa aagaatctgc 5281 tcagttccat atttcatccg tgaaaaactt gcaatacgag cagtttcaag gaataaataa 5341 aaaggaaatg aaaccattgt // LOCUS HUMKBLOOD 2458 bp mRNA PRI 03-NOV-1993 DEFINITION Human kell blood group protein mRNA. ACCESSION M64934 NID g413776 KEYWORDS kell blood group protein. SOURCE Homo sapiens (library: bone marrow DNA) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2458) AUTHORS Lee,S., Zambas,E.D., Marsh,W.L. and Redman,C.M. TITLE Molecular cloning and primary structure of Kell blood group protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 6353-6357 (1991) MEDLINE 91296819 FEATURES Location/Qualifiers source 1..2458 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 124..2322 /codon_start=1 /product="kell blood group protein" /db_xref="PID:g413777" /translation="MEGGDQSEEEPRERSQAGGMGTLWSQESTPEERLPVEGSRPWAV ARRVLTAILILGLLLCFSVLLFYNFQNCGPRPCETSVCLDLRDHYLASGNTSVAPCTD FFSFACGRAKETNNSFQELATKNKNRLRRILEVQNSWHPGSGEEKAFQFYNSCMDTLA IEAAGTGPLRQVIEELGGWRISGKWTSLNFNRTLRLLMSQYGHFPFFRAYLGPHPASP HTPVIQIDQPEFDVPLKQDQEQKIYAQIFREYLTYLNQLGTLLGGDPSKVQEHSSLSI SITSRLFQFLRPLEQRRAQGKLFQMVTIDQLKEMAPAIDWLSCLQATFTPMSLSPSQS LVVHDVEYLKNMSQLVEEMLLKQRDFLQSHMILGLVVTLSPALDSQFQEARRKLSQKL RELTEQPPMPARPRWMKCVEETGTFFEPTLAALFVREAFGPSTRSAAMKLFTAIRDAL ITRLRNLPWMNEETQNMAQDKVAQLQVEMGASEWALKPELARQEYNDIQLGSSFLQSV LSCVRSLRARIVQSFLQPHPQHRWKVSPWDVNAYYSVSDHVVVFPAGLLQPPFFHPGY PRAVNFGAAGSIMAHELLHIFYQLLLPGGCLACDNHALQEAHLCLKRHYAAFPLPSRT SFNDSLTFLENAADVGGLAIALQAYSKRLLRHHGETVLPSLDLSPQQIFFRSYAQVMC RKPSPQDSHDTHSPPHLRVHGPLSSTPAFARYFRCARGALLNPSSRCQLW" polyA_signal 2424..2429 BASE COUNT 561 a 725 c 634 g 538 t ORIGIN 1 cgggaagtgc cccttctcca ggatcaagga actggggcgg ggggtgtttc ctggacccca 61 gtcctccgaa tcagctccta gagtggaacc aggaaggatt ctggagccac agaagataga 121 cagatggaag gtggggacca aagtgaggaa gagccgaggg aacgcagcca ggcaggtgga 181 atgggaactc tctggagcca agagagcact ccagaagaga ggctgcccgt ggaagggagc 241 aggccatggg cagtggccag gcgggtgctg acagctatcc tgattttggg cctgctcctt 301 tgtttttctg tgcttttgtt ctacaacttc cagaactgtg gccctcgccc ctgtgagaca 361 tctgtgtgtt tggatctccg ggatcattac ctggcctctg ggaacacaag tgtggccccc 421 tgcaccgact tcttcagctt tgcctgtgga agggccaaag agaccaataa ttcttttcag 481 gagcttgcca caaagaacaa aaaccgactt cggagaatac tggaggtcca gaattcctgg 541 cacccaggct ctggggagga gaaagccttc cagttctaca actcctgcat ggatacactt 601 gccattgaag ctgcagggac tggtcccctc agacaagtta ttgaggagct tggaggctgg 661 cgcatctctg gtaaatggac ttccttaaac tttaaccgaa cgctgagact tctgatgagt 721 cagtatggcc atttcccttt cttcagagcc tacctaggac ctcatcctgc ctctccacac 781 acaccagtca tccagataga ccagccagag tttgatgttc ccctcaagca agatcaagaa 841 cagaagatct atgcccagat ctttcgggaa tacctgactt acctgaatca gctgggaacc 901 ttgctgggag gagacccaag caaggtgcaa gaacactctt ccttgtcaat ctccatcact 961 tcacggctgt tccagtttct gaggcccctg gagcagcggc gggcacaggg caagctcttc 1021 cagatggtca ctatcgacca gctcaaggaa atggcccccg ccatcgactg gttgtcctgc 1081 ttgcaagcga cattcacacc gatgtccctg agcccttctc agtccctcgt ggtccatgac 1141 gtggaatatt tgaaaaacat gtcacaactg gtggaggaga tgctgctaaa gcagagggac 1201 tttctgcaga gccacatgat cttagggctg gtggtgaccc tttctccagc cctggacagt 1261 caattccagg aggcacgcag aaagctcagc cagaaactgc gggaactgac agagcaacca 1321 cccatgcctg cccgcccacg atggatgaag tgcgtggagg agacaggcac gttcttcgag 1381 cccacgctgg cggctttgtt tgttcgtgag gcctttggcc cgagcacccg aagtgctgcc 1441 atgaaattat tcactgcgat ccgggatgcc ctcatcactc gcctcagaaa ccttccctgg 1501 atgaatgagg agacccagaa catggcccag gacaaggttg ctcaactgca ggtggagatg 1561 ggggcttcag aatgggccct gaagccagag ctggcccgac aagaatacaa cgatatacag 1621 cttggatcga gcttcctgca gtctgtcctg agctgtgtcc ggtccctccg agctagaatt 1681 gtccagagct tcttgcagcc tcacccccaa cacaggtgga aggtgtcccc ttgggacgtc 1741 aatgcttact attcggtatc tgaccatgtg gtagtctttc cagctggact cctccaaccc 1801 ccattcttcc accctggcta tcccagagcc gtgaactttg gcgctgctgg cagcatcatg 1861 gcccacgagc tgttgcacat cttctaccag ctcttactgc ctgggggctg cctcgcctgt 1921 gacaaccatg ccctccagga agctcacctg tgcctgaagc gccattatgc tgcctttcca 1981 ttacctagca gaacctcctt caatgactcc ctcacattct tagagaatgc tgcagacgtt 2041 ggggggctag ccatcgcgct gcaggcatac agcaagaggc tgttacggca ccatggggag 2101 actgtcctgc ccagcctgga cctcagcccc cagcagatct tctttcgaag ctatgcccag 2161 gtgatgtgta ggaagcccag cccccaggac tctcacgaca ctcacagccc tccacacctc 2221 cgagtccacg ggcccctcag cagcacccca gcctttgcca ggtatttccg ctgtgcacgt 2281 ggtgctctct tgaacccctc cagccgctgc cagctctggt aacttggtta ccaaagatgc 2341 cacagcacag aaatatcgac caacacctcc ctggtcacat ccatggaatc agagcaagat 2401 ttcctttctg cttctgttcc aaaaataaaa gctggcactt ggcttccgcc cggaattc // LOCUS HUMKCHC 1500 bp mRNA PRI 24-SEP-1992 DEFINITION Human potassium channel mRNA, complete cds. ACCESSION L02752 NID g186668 KEYWORDS potassium channel. SOURCE Homo sapiens neuronal cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1500) AUTHORS Ramashwami,M., Gautam,M., Kamb,A.A., Rudy,B., Tanouye,M.A. and Mathew,M.K. TITLE Human potassium channel genes: molecular cloning and functional expression JOURNAL Mol. Cell. Neurosci. 1, 214-223 (1990) FEATURES Location/Qualifiers source 1..1500 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="neuronal" CDS 1..1500 /codon_start=1 /product="potassium channel" /db_xref="PID:g186669" /translation="MTVATGDPADEAAALPGHPQDTYDPEADHECCERVVINISGLRF ETQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYYQSGGRLRRPV NVPLDIFSEEIRFYELGEEAMEMFREDEGYIKEEERPLPENEFQRQVWLLFEYPESSG PARIIAIVSVMVILISIVSFCLETLPIFRDENEDMHGSGVTFHTYSNSTIGYQQSTSF TDPFFIVETLCIIWFSFEFLVRFFACPSKAGFFTNIMNIIDIVAIIPYFITLGTELAE KPEDAQQGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGQTLKASMRELGLLIFF LFIGVILFSSAVYFAEADERESQFPSIPDAFWWAVVSMTTVGYGDMVPTTIGGKIVGS LCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQAQYLQVTSCPKIPSSPDLKKSRSA STISKSDYMEIQEGVNNSNEDFREENLKTANCTLANTNYVNITKMLTDV" BASE COUNT 381 a 372 c 370 g 377 t ORIGIN 1 atgacagtgg ccaccggaga cccagcagac gaggctgctg ccctccctgg gcacccacag 61 gacacctatg acccagaggc agaccacgag tgctgtgaga gggtggtgat caacatctca 121 gggctgcggt ttgagaccca gctaaagacc ttagcccagt ttccagagac cctcttaggg 181 gacccaaaga aacgaatgag gtactttgac cccctccgaa atgagtactt tttcgatcgg 241 aaccgcccta gctttgatgc cattttgtac tactaccagt cagggggccg attgaggcga 301 cctgtgaatg tgcccttaga tatattctct gaagaaattc ggttttatga gctgggagaa 361 gaagcgatgg agatgtttcg ggaagatgaa ggctacatca aggaggaaga gcgtcctctg 421 cctgaaaatg agtttcagag acaagtgtgg cttctctttg aatacccaga gagctcaggg 481 cctgccagga ttatagctat tgtgtctgtc atggtgattc tgatctcaat tgtcagcttc 541 tgtctggaaa cattgcccat cttccgggat gagaatgaag acatgcatgg tagtggggtg 601 accttccaca cctattccaa cagcaccatc gggtaccagc agtccacttc cttcacagac 661 cctttcttca ttgtagagac actctgcatc atctggttct cctttgaatt cttggtgagg 721 ttctttgcct gtcccagcaa agccggcttc ttcaccaaca tcatgaacat cattgacatt 781 gtggccatca tcccctactt catcaccctg gggacagagt tggctgagaa gccagaggac 841 gctcagcaag gccagcaggc catgtcactg gccatcctcc gtgtcatccg gttggtaaga 901 gtctttagga ttttcaagtt gtccagacac tccaaaggtc tccagattct aggtcagacc 961 ctcaaagcca gcatgagaga attgggcctc ctgatattct ttctcttcat aggggtcatc 1021 cttttctcta gtgctgtgta ttttgcagag gccgatgagc gagagtccca gttccccagc 1081 atcccagatg ccttctggtg ggcagtcgtc tccatgacaa ctgtaggcta tggagacatg 1141 gttccgacta ccattggggg aaagatagtg ggttccctat gtgcgattgc aggtgtgtta 1201 actattgcct taccggtccc tgtcattgtg tccaatttca actacttcta ccaccgggag 1261 acagagggag aggagcaggc ccaatacttg caagtgacaa gctgtccaaa gatcccatcc 1321 tcccctgacc taaagaaaag tagaagtgcc tctaccatta gtaagtctga ttacatggag 1381 atccaggagg gtgtaaataa cagtaatgag gactttagag aggaaaactt gaaaacagcc 1441 aactgtacct tggctaacac aaactatgtg aatattacca aaatgttaac tgatgtctga // LOCUS HUMKG1AA 6962 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0100 gene, complete cds. ACCESSION D43947 NID g603948 KEYWORDS KIAA0100. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6962) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (16-DEC-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6962) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..6962 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..329 gene 330..6608 /gene="KIAA0100" CDS 330..6608 /gene="KIAA0100" /note="KIAA0100 is a human counterpart of mouse e1 gene." /citation=[3] /codon_start=1 /db_xref="PID:d1008477" /db_xref="PID:g603949" /translation="MVLKVDTSESLWHIQISRSRFLLDSDGKRLICEVSLCKINSKVL KSGQLEDTCLVELSLALDLCLKVGISSRHLTAITVDVWTLHAELHEGLFQSQLLCQGP SLASKPVPCSEVTENLVEPTLPGLFLLQQLPDQVKVKMENTSVVLSMNSQKRHLTWTL KLLQFLYHRDEDQLPLRSFTANSDMAQMSTELLLEDGLLLSQSRQRIVCLNSLKASVQ VTTIDLSASLVLNTCIIHYRHQEFSHWLHLLALETQGSSSPVLKQRKKRTFPQILAPI IFSTSISNVNISIQLGDTPPFALGFNSISLDYQHLRPQSIHQRGVLTVDHLCWRVGSD SHIQRAPHPPNMHVWGEALVLDSFTLQGSYNQPLGLSSTQSDTLFLDCTIRGLQVEAS DTCAQCLSRILSLMGPQSGKSAVSRHSSFGESVSLLWKVDLKVEDMNLFTLSALVGAS EVRLDTLTILGSAETSTVGIQGLVLALVKSVTEKMQPCCKAPDIPTPVLSLSMLSITY HSSIRSLEVQCGAGLTLLWSPPDHMYLYQHVLATLQCRDLLRATVFPETVPSLALETS GTTSELEGRAPEPLPPKRLLNLTLEVSTAKLTAFVAEDKFITLAAESVSLSRHGGSLQ AYCPELAAGFDGNSIFNFKEVEVQLLPELEEMILHRNPFPALQTLRNRVWLLSFGSVS VEFPYQYDFSRTLDEAVGVQKWLKGLHQGTRAWASPSPVPLPPDLLLKVEHFSWVFLD DVFEVKLHDNYELMKDESKESAKRLQLLDAKVAALRKQHGELLPARKIEELYASLERK NIEIYIQRSRRLYGNTPMRRALLTWSLAGLELVALADASFHGPEHVVEQVQELDPGSP FPPEGLDLVIQWCRMLKCNVKSFLVRIRDYPRYLFEIRDWRLMGRLVGTEQSGQPCSR RRQILHLGLPWGNVAVERNMPPLKFYHDFHSEIFQYTVVWGPCWDPAWTLIGQCVDLL TKPSADPSPPLPWWDKSRLLFHGDWHMDIEQANLHQLATEDPYNTTENMHWEWSHLSF HWKPGQFVFKGDLDINVRTASKYDDCCFLHLPDLCMTLDLQWLCHGNPHDHHSVTLRA PEFLPEVPLGQLHDSYRAFRSENLNLSIKMDLTRHSGTISQPRILLYSSTLRWMQNFW ATWTSVTRPICRGKLFNNLKPSKKKLGQHYKQLSYTALFPQLQVHYWASFAQQRGIQI ECSQGHVFTRGTQRLIPQAGTVMRRLISDWSVTQMVSDLSQVTVHLMASPTEENADHC LDPLVTKTHLLSLSSLTYQRHSNRTAEEELSARDGDPTFHTHQLHLVDLRISWTTTNR DIAFGLYDGYKKAAVLKRNLSTEALKGLKIDPQMPAKKPKRGVPTSASAPPRVNTPSF SGQPDKGSSGGAYMLQKLIEETDRFVVFTEEESGMSDQLCGIAACQTDDIYNRNCLIE LVNCQMVLRGAETEGCVIVSAAKAQLLQCQHHPAWYGDTLKQKTSWTCLLDGMQYFAT TESSPTEQDGRQLWLEVKNIEEHRQRSLDSVQELMESGQAVGGMVTTTTDWNQPAEAQ QAQQVQRIISRCNCRMYYISYSHDIDPELATQIKPPEVLENQEKEDLLKKQEGAVDTF TLIHHELEISTNPAQYAMILDIVNNLLLHVEPKRKEHSEKKQRVRFQLEISSNPEEQR SSILHLQEAVRQHVAQIRQLEKQMYSIMKSLQDDSKNENLLDLNQKLQLQLNQEKANL QLESEELNILIRCFKDFQLQRANKMELRKQQEDVSVVRRTEFYFAQARWRLTEEDGQL GIAELELQRFLYSKVNKSDDTAEHLLELGWFTMNNLLPNAVYKVVLRPQSSCQSGRQL ALRLFSKVRPPVGGISVKEHFEVNVVPLTIQLTHQFFHRMMGFFFPGRSVEDDEVGDE EDKSKLVTTGIPVVKPRQLIATDDAVPLGPGKGVAQGLTRSSGVRRSFRKSPEHPVDD IDKMKERAAMNNSFIYIKIPQVPLCVSYKGEKNSVDWGDLNLVLPCLEYHNNTWTWLD FAMAVKRDSRKALVAQVIKEKLRLKSATGSEVRGKLETKSDLNMQQQEEEEKARLLIG LSVGDKNPGKKSIFGRRK" 3'UTR 6609..6962 BASE COUNT 1719 a 1779 c 1814 g 1650 t ORIGIN 1 cgcgatgggc acctgcagca tcggtctggg gagaacagca ttcgagctgc cgggtggctt 61 gtggtccggt tggccaccaa gtggtgtcag cggaagctgc aggcggagct aaagattggc 121 tccttccgct ttttttggat ccagaatgtc agtcttaagt ttcagcaaca ccagcaaaca 181 gtggaaattg ataacctgtg gatttccagc aaactcctta gccatgatct tccagcgctg 241 gggtggatca aaaggaactg tccttcagcc catccttatt gaagatcttc tgccaactat 301 tctccattca tgtagatgct ataaacatca tggttctcaa ggtggatacc tctgagtcct 361 tatggcatat tcagatcagt agaagcagat ttcttttgga tagtgatggg aaaaggctaa 421 tctgtgaggt gagcttatgt aagatcaaca gcaaagttct aaagagtggt cagctggagg 481 acacctgcct agtggagctt tcactggccc tggacctgtg tctaaaggtg ggcattagca 541 gtcggcatct cactgctatc actgtggatg tgtggacact ccatgctgaa ctgcatgagg 601 gcctcttcca gagccaactg ctgtgccagg gcccaagcct agcatctaag cctgttccct 661 gttcagaggt gacagaaaac ttagttgagc caactctgcc tggcctattc cttctccagc 721 agctgccaga ccaggtcaag gttaagatgg agaacacaag cgtggtattg tccatgaata 781 gtcaaaagag gcacctgact tggactctga agctgctgca gttcctgtac caccgtgatg 841 aggatcagct gccccttcga agcttcacag caaactctga tatggcacag atgagcactg 901 aactgctgct ggaagatggg ttgttgttgt cccagagtcg ccaacgcatt gtctgcctca 961 actccctcaa ggctagtgtg caggtgacca ccattgacct ctcagcctcc ctagttctga 1021 acacttgcat cattcactac cggcaccagg aattctctca ctggctgcac ctgctagcac 1081 tggaaaccca agggtctagt tcacctgttc taaagcaaag gaaaaaaaga accttccccc 1141 aaatcctggc tcccatcatc tttagcacct ccatctccaa tgtcaacatt tccattcaac 1201 ttggagatac accacctttt gccttgggat tcaattctat ctctctggat taccagcacc 1261 tcaggccaca aagcatccat cagcggggcg tcctaactgt ggaccacctc tgctggcgtg 1321 tgggcagtga ctcccacatt cagcgggcgc cacacccacc caatatgcat gtttggggtg 1381 aggcacttgt tctggactcc ttcacactac agggtagcta taaccagcct ctgggcctgt 1441 ccagcaccca gtcagatacc ctttttcttg attgtaccat tcgaggactt caggtggaag 1501 catcagatac ctgtgcccaa tgtctgtctc gtatcttatc cctgatgggt ccacaatctg 1561 ggaagtcagc tgtctctagg cactcttcat ttggggaatc tgtgtcatta ctgtggaagg 1621 tggacttgaa ggtcgaagac atgaacttgt ttaccctttc tgccttggtt ggtgcttcag 1681 aggtacgact ggacacccta actatcctgg gcagtgcaga gacgtccact gtggggattc 1741 aaggacttgt gttagcgctg gtgaaatcag tcacggagaa gatgcaaccc tgttgcaagg 1801 cccctgacat ccctacccca gtgctcagcc tttccatgct ctccatcacc tatcacagca 1861 gcatccgctc tctggaggtt cagtgtggtg cagggctgac cttactttgg agccccccag 1921 atcacatgta cctgtaccag catgtcctgg ccactctaca gtgccgagac ctactaagag 1981 ccactgtgtt tcctgagact gtaccatccc ttgcactaga gacttcagga actacttctg 2041 agctagaagg ccgtgcccct gagccattac ccccaaagcg gctgctaaac ctaaccctgg 2101 aggtgagcac agccaagctc acagcttttg tagctgagga caagttcatt accctggctg 2161 cagagagtgt gtcactgagc cggcatggag gttccctgca ggcatactgt ccagagctgg 2221 ctgctggctt tgatggcaat agtatcttca acttcaagga ggtggaggtg cagctgctac 2281 ctgagctgga agagatgatc ctccaccgga accccttccc tgcgctgcag accctccgga 2341 accgtgtttg gctcctctct ttcggctcag tctcggtgga gtttccttat cagtatgact 2401 tttctcgaac tctagatgag gctgtgggag ttcagaagtg gctgaaggga ctacatcaag 2461 ggactcgtgc ttgggcctct ccaagccctg tcccactccc acctgatcta ctcttaaagg 2521 ttgagcactt ctcatgggtt ttcttggatg atgtttttga ggtgaaactt catgataact 2581 acgagctgat gaaggatgaa agtaaggaga gtgccaaaag actacagcta ctggatgcta 2641 aagtggccgc ccttcggaag cagcatgggg agttgttgcc tgcccgcaaa attgaggagc 2701 tctatgcctc tttggaacgc aaaaacattg aaatctacat ccagcgttcc cgtcgtctct 2761 atggcaacac acccatgcgc cgggcactgc ttacttggag cttagcaggg ctagaactgg 2821 tagctctggc agatgcctcc ttccatggtc ctgagcatgt ggtagaacag gttcaagagc 2881 ttgatccagg cagccctttt ccccctgagg gattagatct tgtcattcag tggtgtcgaa 2941 tgctcaagtg caatgtcaag agctttctgg ttcggatcag ggactatcca cggtacctgt 3001 ttgagatccg tgactggcgg ctaatgggtc gacttgtggg caccgagcag agtggtcagc 3061 cttgctcccg tcggcgtcag atcttgcact tggggcttcc gtggggtaac gtggcagtgg 3121 agaggaacat gcccccactc aaattctacc atgactttca ctcggaaata ttccagtaca 3181 cagtggtgtg gggcccatgc tgggatccag cctggacact aattggccag tgtgtggacc 3241 tcttgaccaa gccctcagct gaccccagcc cacctttgcc ctggtgggac aagagccgtc 3301 ttctgttcca tggagactgg cacatggaca ttgaacaggc gaacctgcac cagctggcca 3361 ctgaggatcc atacaacaca actgaaaata tgcactggga gtggagccac ctgtcttttc 3421 attggaaacc tggtcagttt gtgttcaagg gtgacttgga tatcaacgtg agaacagcct 3481 ctaagtatga cgactgctgc ttccttcacc tgcctgacct ctgcatgaca ctggacctgc 3541 agtggctgtg ccatgggaac ccccatgatc accatagtgt cactctgcgg gccccagagt 3601 tcctgcctga ggtgcccttg ggccagcttc atgactccta ccgggccttt cgctcggaga 3661 acctcaatct ctccatcaag atggatctga ctcggcacag tggaacaata tcccagcccc 3721 gaattctgct atatagtagt accctgcgct ggatgcaaaa cttctgggca acttggacaa 3781 gtgtcacaag gcctatctgc aggggaaagc tcttcaataa cctgaaaccc agcaagaaga 3841 aacttggtca gcactacaag caactttcct atacagccct ctttccccag ctgcaggtac 3901 attattgggc ctcatttgcc cagcaacggg gcatccagat tgagtgcagt cagggccatg 3961 tcttcactcg ggggactcag cggcttatac ctcaagcagg cacagtgatg cggcgcctta 4021 tctctgattg gagtgttacc cagatggtga gtgacctaag tcaggtgacc gttcacctga 4081 tggcctcacc cactgaagag aatgctgatc actgtcttga tcccttggta acaaagaccc 4141 acctgctgag cttgtcctcc ctcacctacc aacggcatag caatcgcaca gctgaggagg 4201 agctctctgc tcgtgatggg gatcctacct ttcatacaca tcagctgcac ttagtagatt 4261 tacggatttc ctggacaact accaatcgag acattgcctt tgggttatat gatgggtaca 4321 aaaaggcagc tgtactcaaa cgtaatcttt ctactgaggc cctgaagggg ttaaagattg 4381 atccacagat gccagccaaa aagccaaagc ggggtgtccc aactagtgcc tcagccccac 4441 ctcgtgttaa cactcccagc ttcagtggac aacctgataa ggggtcatca ggaggtgctt 4501 acatgttgca gaagctaatt gaagagacag ataggtttgt agtgttcaca gaagaggaat 4561 caggcatgag tgaccagttg tgtggcattg ctgcctgcca gacggatgac atatacaacc 4621 gaaactgcct tattgaattg gtcaactgtc agatggttct tcgtggagca gagacagaag 4681 gctgtgtcat tgtgtcagct gccaaagccc aactgctgca gtgccagcac catccagcct 4741 ggtatggtga tacattgaag caaaagacat cctggacttg cctcttggat ggcatgcagt 4801 actttgccac cactgaaagc agccccacag agcaggatgg ccgacagctc tggttagagg 4861 tgaagaatat cgaggagcac cggcagcgta gtctggactc tgtgcaggag ctgatggaga 4921 gtgggcaggc agtgggcggc atggttacca caaccacaga ttggaaccag ccagctgagg 4981 cacagcaagc ccagcaagtc cagcggatca tttcgcgttg caactgccga atgtactata 5041 ttagttacag ccatgacatt gatcctgaac tagcaactca gattaagcca cctgaagttc 5101 ttgagaacca ggaaaaggaa gatctcctaa agaagcagga aggggctgtg gataccttca 5161 cccttatcca ccatgagctg gaaatttcca ccaacccagc tcagtatgcc atgatcctgg 5221 acattgtcaa caacctgctg ctccatgtag aacctaagcg gaaggaacat agtgagaaga 5281 agcaacgggt caggttccag cttgagatct ctagcaatcc agaggagcaa cgcagcagca 5341 tactgcattt gcaggaggct gtgcggcagc atgtggccca aatacgacag ctggagaagc 5401 agatgtattc tatcatgaag tctttgcagg atgacagcaa gaatgagaat ctgcttgacc 5461 tgaaccagaa gcttcagttg cagctaaacc aggagaaggc caacctgcag ctggaaagtg 5521 aagaactgaa tatcctcatc aggtgtttta aggatttcca actgcagcgg gctaacaaga 5581 tggagctgcg aaagcagcaa gaagatgtga gtgtggtccg tcgcactgag ttttactttg 5641 ctcaggcacg gtggcgcctg acagaggaag atggacagct gggaattgct gaattagaac 5701 tgcagaggtt cctctacagc aaggtgaata agtctgatga cacagcagaa catcttctgg 5761 agttgggctg gtttaccatg aacaacctcc tccccaatgc tgtctataag gtagtactgc 5821 ggccccagag ctcctgccag tctgggcgac agctagctct ccgcctcttc agcaaagttc 5881 ggccccctgt tgggggtatc tctgttaagg agcattttga ggtaaatgtg gtgcctctca 5941 ccatccagct gacacaccag ttcttccaca gaatgatggg ctttttcttt cctggccgaa 6001 gtgtggaaga tgatgaagtt ggtgatgaag aggataagtc caaactggtg actactggaa 6061 taccagtggt gaagcctcgg cagctgattg caacagatga tgcagtacca ctgggccctg 6121 ggaagggtgt ggcacagggt ttgactcgga gttctggggt cagaaggtca tttcgcaaat 6181 cgccagagca ccctgtggat gacattgaca agatgaaaga gcgagctgcc atgaacaact 6241 ccttcatcta cataaagatt ccacaggttc cactgtgtgt cagctacaag ggtgagaaga 6301 acagtgtgga ctggggtgac cttaacctgg tgctgccctg tctggagtac cacaacaaca 6361 catggacatg gctagacttt gccatggctg tcaaaaggga cagccgcaaa gccctggttg 6421 cccaggtaat caaagagaag ctaaggctga agtctgcaac aggctctgag gtccggggaa 6481 agctagaaac taaatcggac ctgaacatgc aacagcagga agaggaggag aaagcccggc 6541 tcctcattgg tttaagtgtg ggcgacaaga accctggcaa gaagtccatc tttggcaggc 6601 gcaaatgatt tggcgattcg agtggctgca gtacaggatc tgactctggc tcaggctcca 6661 gggacttgtg gggtgggagg ggcttcccgt tatccacgag gatttgtggg tgtcagagcc 6721 cataggcatc actcttcagc acctggtctg ttcgctgcag ggcatggtgg acagtaatgc 6781 tgagttctgt ctcacactga tcaggctcag ggccagagag gcaacaagag agcaagaccc 6841 agggaatggg cccagggcag gaccctattc ccttggggtc aagtgaaagg gtagggggat 6901 agtcctgatc aagtgtgata aatttttata gacatatata aatatatata tattatatat 6961 at // LOCUS HUMKG1C 6428 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0045 gene, complete cds. ACCESSION D28476 NID g460710 KEYWORDS KIAA0045. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6428) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (08-FEB-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6428) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..6428 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..109 gene 110..6088 /gene="KIAA0045" CDS 110..6088 /gene="KIAA0045" /codon_start=1 /db_xref="PID:d1006384" /db_xref="PID:g460711" /translation="MSNRPNNNPGGSLRRSQRNTAGAQPQDDSIGGRSCSSSSAVIVP QPEDPDRANTSERQKTGQVPKKDNSRGVKRSASPDYNRTNSPSSAKKPKALQHTESPS ETNKPHSKSKKRHLDQEQQLKSAQSPSTSKAHTRKSGATGGSRSQKRKRTESSCVKSG SGSESTGAEERSAKPTKLASKSATSAKAGCSTITDSSSAASTSSSSSAVASASSTVPP GARVKQGKDQNKARRSRSASSPSPRRSSREKEQSKTGGSSKFDWAARFSPKVSLPKTK LSLPGSSKSETSKPGPSGLQAKLASLRKSTKKRSESPPAELPSLRRSTRQKTTGSCAS TSRRGSGLGKRGAAEARRQEKMADPESNQEAVNSSAARTDEAPQGAAGAVGMTTSGES ESDDSEMGRLQALLEARGLPPHLFGPLGPRMSQLFHRTIGSGASSKAQQLLQGLQASD ESQQLQAVIEMCQLLVMGNEETLGGFPVKSVVPALITLLQMEHNFDIMNHACRALTYM MEALPRSSAVVVDAIPVFLEKLQVIQCIDVAEQALTALEMLSRRHSKAILQAGGLADC LLYLEFFSINAQRNALAIAANCCQSITPDEFHFVADSLPLLTQRLTHQDKKSVESTCL CFARLVDNFQHEENLLQQVASKDLLTNVQQLLVVTPPILSSGMFIMVVRMFSLMCSNC PTLAVQLMKQNIAETLHFLLCGASNGSCQEQIDLVPRSPQELYELTSLICELMPCLPK EGIFAVDTMLKKGNAQNTDGAIWQWRDDRGLWHPYNRIDSRIIEQINEDTGTARAIQR KPNPLANSNTSGYSESKKDDARAQLMKEDPELAKSFIKTLFGVLYEVYSSSAGPAVRH KCLRAILRIIYFADAELLKDVLKNHAVSSHIASMLSSQDLKIVVGALQMAEILMQKLP DIFSVYFRREGVMHQVKHLAESESLLTSPPKACTNGSGSMGSTTSVSSGTATAATHAA ADLGSPSLQHSRDDSLDLSPQGRLSDVLKRKRLPKRGPRRPKYSPPRDDDKVDNQAKS PTTTQSPKSSFLASLNPKTWGRLSTQSNSNNIEPARTAGGSGLARAASKDTISNNREK IKGWIKEQAHKFVERYFSSENMDGSNPALNVLQRLCAATEQLNLQVDGGAECLVEIRS IVSESDVSSFEIQHSGFVKQLLLYLTSKSEKDAVSREIRLKRFLHVFFSSPLPGEEPI GRVEPVGNAPLLALVHKMNNCLSQMEQFPVKVHDFPSGNGTGGSFSLNRGSQALKFFN THQLKCQLQRHPDCANVKQWKGGPVKIDPLALVQAIERYLVVRGYGRVREDDEDSDDD GSDEEIDESLAAQFLNSGNVRHRLQFYIGEHLLPYNMTVYQAVRQFSIQAEDERESTD DESNPLGRAGIWTKTHTIWYKPVREDEESNKDCVGGKRGRAQTAPTKTSPRNAKKHDE LWHDGVCPSVSNPLEVYLIPTPPENITFEDPSLDVILLLRVLHAISRYWYYLYDNAMC KEIIPTSEFINSKLTAKANRQLQDPLVIMTGNIPTWLTELGKTCPFFFPFDTRQMLFY VTAFDRDRAMQRLLDTNPEINQSDSQDSRVAPRLDRKKRTVNREELLKQAESVMQDLG SSRAMLEIQYENEVGTGLGPTLEFYALVSQELQRADLGLWRGEEVTLSNPKGSQEGTK YIQNLQGLFALPFGRTAKPAHIAKVKMKFRFLGKLMAKAIMDFRLVDLPLGLPFYKWM LRQETSLTSHDLFDIDPVVARSVYHLEDIVRQKKRLEQDKSQTKESLQYALETLTMNG CSVEDLGLDFTLPGFPNIELKKGGKDIPVTIHNLEEYLRLVIFWALNEGVSRQFDSFR DGFESVFPLSHLQYFYPEELDQLLCGSKADTWDAKTLMECCRPDHGYTHDSRAVKFLF EILSSFDNEQQRLFLQFVTGSPRLPVGGFRSLNPPLTIVRKTFESTENPDDFLPSVMT CVNYLKLPDYSSIEIMREKLLIAAREGQQSFHLS" 3'UTR 6089..6428 BASE COUNT 1939 a 1329 c 1522 g 1638 t ORIGIN 1 gctagtggaa gttactgccg cgccaccgag tccggaccgg agactttggg gcctaactag 61 tgaatggtag tgtctagaaa gggtatgtcc cttcaagaga gaggtgccaa tgtccaaccg 121 gcctaataac aatccagggg ggtcactgcg acgttcacag aggaacactg ccggggccca 181 accacaagac gactcaatag gaggaagaag ctgcagttca tcatctgctg tgatagttcc 241 acaaccagag gatccagaca gagccaatac ttcagaaaga caaaaaacgg ggcaggtgcc 301 taagaaagac aattctcgag gagtgaagcg cagtgctagt ccagactaca acaggaccaa 361 ttctcctagc tctgcaaaaa aaccaaaagc acttcagcat actgaatctc cctcagaaac 421 aaataagcca catagtaagt caaagaagag acatttagac caggagcaac aactgaaatc 481 tgcacaatca ccatcaacaa gcaaggctca taccaggaag agtggggcca ctggcggttc 541 acggagtcag aaaagaaaaa ggacagagag ttcttgtgta aagagtggct ccgggtctga 601 atcaactggt gcagaagaga gatctgcgaa acctaccaag ctggcttcaa aatcagccac 661 ctcagccaaa gctgggtgta gcaccatcac tgattcttct tctgctgcct ctacttcctc 721 ctcgtcttct gctgtagcct cggcctcctc cactgtacca ccaggtgcca gagtgaaaca 781 aggaaaagat cagaacaagg ccaggcgttc ccgttcagcg tccagtccca gccccagaag 841 aagtagcagg gaaaaggaac agagtaaaac tggtggctct tcaaaatttg attgggctgc 901 tcgtttcagc cctaaagtta gccttcctaa aacaaaactg tctcttccag ggtcttctaa 961 gtcagagaca tcaaaacctg gaccttctgg attacaggcc aaattagcaa gtttaagaaa 1021 atctacgaag aaacgcagtg agtctccacc tgctgagctc cccagtttga ggcggagcac 1081 acgccaaaag accacgggct cctgtgctag taccagtcgg cgaggctctg gcctgggcaa 1141 aagaggagca gctgaagctc gtcgacagga gaaaatggca gaccctgaaa gcaaccagga 1201 ggcagtaaat tcttcagctg ctcggacaga tgaagctccc caaggagctg caggggctgt 1261 tggcatgacc acctctgggg agagtgaatc agatgattcc gagatgggac gtttgcaagc 1321 tttgttagag gcaaggggtc ttccccctca cctatttggt cctcttggtc ctcggatgtc 1381 acagcttttc catagaacaa ttggaagtgg agctagttct aaggcccagc agctactaca 1441 aggattgcaa gccagtgatg aaagtcaaca gcttcaggca gttattgaga tgtgtcagtt 1501 actggtcatg ggaaatgagg agacactggg agggtttcct gtcaagagtg ttgttccagc 1561 tttgattacg ttacttcaga tggagcacaa ttttgatatt atgaaccatg cttgtcgagc 1621 cttaacatac atgatggaag cacttcctcg atcttctgct gttgtagtag atgctattcc 1681 tgtcttttta gaaaagctgc aagttattca gtgtattgat gtggcagagc aggccttgac 1741 tgccttggag atgttgtcac ggagacatag taaagccatt ctacaggcgg gtggtttggc 1801 agactgcttg ctgtacctag aattcttcag cataaatgcc caaagaaatg cattagcaat 1861 tgcagctaat tgctgccaga gtatcacgcc agatgaattt cattttgtgg cagattcact 1921 cccattgcta acccaaaggc taacacatca ggataaaaag tcagtagaaa gcacttgcct 1981 ttgttttgca cgcctagtgg acaacttcca gcatgaggag aatttactcc agcaggttgc 2041 ttccaaagat ctgcttacaa atgttcaaca gctgttggta gtgactccac ccattttaag 2101 ttctgggatg tttataatgg tggttcgcat gttttctctg atgtgttcca actgtccaac 2161 tttagctgtt caacttatga aacaaaacat tgcagaaacg cttcactttc tcctgtgtgg 2221 tgcctccaat ggaagttgtc aggaacagat tgatcttgtt ccacgaagcc ctcaagagtt 2281 gtatgaactg acatctctga tttgtgaact tatgccatgt ttaccaaaag aaggcatttt 2341 tgcagttgat accatgttga agaagggaaa tgcacagaac acagatggtg cgatatggca 2401 gtggcgtgat gatcggggcc tctggcatcc atataacagg attgacagcc ggatcattga 2461 gcaaatcaat gaggacacgg gaacagcacg tgccattcag agaaaaccta acccgttagc 2521 caatagtaac actagtggat attcagagtc aaagaaggat gatgctcgag cacagcttat 2581 gaaagaggat ccggaactgg ctaagtcttt tattaagaca ttatttggtg ttctttatga 2641 agtgtatagt tcctcagcag gacctgcggt cagacataag tgccttagag caattcttag 2701 gataatttat tttgcggatg ctgaacttct gaaggatgtt ctgaaaaatc atgctgtttc 2761 aagtcacatt gcttccatgc tgtcaagcca agacctgaag atagtagtgg gagcacttca 2821 gatggcagaa attttaatgc agaagttacc tgatattttt agtgtttact tcagaagaga 2881 aggtgtaatg catcaagtaa aacacttagc agaatcagag tctttgttga caagtccacc 2941 aaaggcatgt acgaatggat cgggatccat gggatccaca acttcagtca gcagtgggac 3001 agccacagct gccactcatg ctgcagctga cttgggatca cccagcttgc agcacagcag 3061 ggatgattct ttagatctca gccctcaagg tcgattaagt gatgttctaa agagaaaacg 3121 actgccaaaa cgagggccaa gaaggccaaa gtactcacct ccaagagatg atgacaaagt 3181 agacaatcaa gctaaaagcc ccaccactac tcagtcacct aaatcttctt tcctggcaag 3241 cttgaatcca aaaacatggg gaaggttaag tacacagtcc aacagcaaca acattgagcc 3301 agcacggact gcgggaggta gtggccttgc cagggctgcc tcaaaggata ccatctccaa 3361 taatagagaa aaaattaaag gttggattaa ggagcaggca cataaatttg tagaacgtta 3421 tttcagttct gagaatatgg atggaagcaa ccctgcattg aatgtccttc agagactttg 3481 tgctgcaacc gaacaactca acctccaggt ggatggtgga gctgagtgcc ttgtagaaat 3541 ccgtagcata gtctcagagt cagatgtttc atcatttgaa atccaacata gtggatttgt 3601 gaagcagctg ttgctttatt tgacatctaa aagtgaaaag gatgctgtga gcagagagat 3661 cagattaaag cgatttcttc atgtattttt ttcttctcca cttcctggag aagagcccat 3721 tggaagagtg gaaccagtgg gtaatgcacc tttgttggca ttagttcaca agatgaacaa 3781 ctgcctcagc cagatggaac aatttccagt caaagtacat gatttcccta gtggaaatgg 3841 gacaggaggc agcttttctc tcaacagagg atcacaggct ttaaaatttt tcaacacaca 3901 tcaattaaaa tgccagttac aaaggcatcc agactgtgca aatgtgaagc agtggaaggg 3961 tggacctgtc aagattgacc ctctggcttt ggtacaagcc atcgagagat accttgtagt 4021 tagagggtat ggaagagtaa gagaagatga tgaagacagc gatgacgatg gatcagatga 4081 ggaaatagat gagtctctgg ctgctcagtt cctaaattca ggaaatgtaa gacacaggct 4141 gcagttttat attggagaac atttgctgcc gtataacatg actgtgtatc aggcagtacg 4201 gcagtttagt atacaggctg aagatgaaag agaatccaca gatgatgaga gcaatcctct 4261 aggcagagct ggtatttgga caaagactca tacaatatgg tataaacctg tgagagagga 4321 tgaagaaagt aataaagatt gtgttggtgg taaaagagga agagcccaaa cagctccaac 4381 gaaaacttcc cctagaaatg caaaaaagca tgatgagtta tggcacgatg gagtgtgccc 4441 atcagtatca aatcctttag aagtttacct cattcccaca ccacctgaaa atataacatt 4501 tgaagacccg tcattagatg tgatccttct tttaagagtt ttacatgcta tcagtcgata 4561 ctggtattac ttgtatgata atgcaatgtg caaggaaatt attccaacta gtgaatttat 4621 taacagtaag ttaacagcaa aagcaaatag gcaacttcaa gatcctttag taatcatgac 4681 aggaaacatc ccaacatggc ttactgagct aggaaaaacc tgcccatttt tctttccttt 4741 tgatacccgg caaatgcttt tttatgtaac tgcatttgat cgggaccgag caatgcaaag 4801 attacttgat accaacccag aaatcaacca gtctgattct caagatagca gagttgcacc 4861 tagattggat agaaaaaaac gtactgtgaa ccgagaggag ctgctgaaac aggcggagtc 4921 tgtgatgcag gacctcggca gctcacgggc catgttagaa atccagtatg aaaatgaggt 4981 tggtacaggt cttgggccta cactggagtt ttatgcgctt gtatctcagg aactacagag 5041 agctgacttg ggtctttgga gaggtgaaga agtaactctt agcaatccaa aagggagcca 5101 agaagggacc aagtatattc aaaacctcca gggcctgttt gcgcttccct ttggtaggac 5161 agcaaagcca gctcatatcg caaaggttaa gatgaagttt cgcttcttag gaaaattaat 5221 ggccaaggct atcatggatt tcagattggt ggaccttccc cttggcttac ccttttataa 5281 atggatgcta cggcaagaaa cttcactgac atcacacgat ttgtttgaca tcgacccagt 5341 tgtagccaga tcagtttatc acctagaaga cattgtcaga cagaagaaaa gacttgaaca 5401 agataaatcc cagaccaaag agagtctaca gtatgcatta gaaaccttga ctatgaatgg 5461 ctgctcagtt gaagatctag gactggattt cactctgcca gggtttccca atatcgaact 5521 gaagaaagga gggaaggata taccagtcac tatccacaat ttagaggagt atctaagact 5581 ggttatattc tgggcactaa atgaaggcgt ttctaggcaa tttgattcgt tcagagatgg 5641 atttgaatca gtcttcccac tcagtcatct tcagtacttc tacccggagg aactggatca 5701 gctcctttgt ggcagtaaag cagacacttg ggatgcaaag acactgatgg aatgctgtag 5761 gcctgatcat ggttatactc atgacagtcg ggctgtgaag tttttgtttg agattctcag 5821 tagttttgat aatgagcagc agaggttatt tctccagttt gtgactggta gcccaagatt 5881 gcctgttgga ggattccgga gtttgaatcc acctttgaca attgtccgaa agacgtttga 5941 atcaacagaa aacccagatg acttcttgcc ctctgtaatg acttgtgtga actatcttaa 6001 gttgccggac tattcaagca ttgagataat gcgtgaaaaa ctgttgatag cagcaagaga 6061 agggcagcag tcgttccatc tttcctgatt atagcaagaa atgcagtgtc tgcctgttac 6121 agcaaaagaa acaaatcatg atttcttttc taatgttatc acctgagtca aggaaacatg 6181 ttacgccttc ttgttgtagg aaaaacggct tgcagattat aaagagacat ttggttgata 6241 ttcattaatg gccccatgga cttaaagtga tcaggcccta aaacgttgtt gtgatgaggt 6301 ttctttagca agttcttgtt taaattatca tttatttgat gagtgaagtt tttaacatgc 6361 tttgctgtgt gaaatttaaa aaagggatgt ttttccaggc tggaacaata aatgtggctg 6421 tgcagttt // LOCUS HUMKG1EE 5319 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0099 gene, complete cds. ACCESSION D43951 NID g603956 KEYWORDS KIAA0099. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5319) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (16-DEC-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5319) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..5319 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..56 gene 57..3617 /gene="KIAA0099" CDS 57..3617 /gene="KIAA0099" /note="KIAA0099 is related to D.melanogaster pumilio gene." /citation=[3] /codon_start=1 /db_xref="PID:d1008481" /db_xref="PID:g603957" /translation="MSVACVLKRKAVLWQDSFSPHLKHHPQEPANPNMPVVLTSGTGS QAQPQPAANQALAAGTHSSPVPGSIGVAGRSQDDAMVDYFFQRQHGEQLGGGGSGGGG YNNSKHRWPTGDNIHAEHQVRSMDELNHDFQALALEGRAMGEQLLPGKKFWETDESSK DGPKGIFLGDQWRDSAWGTSDHSVSQPIMVQRRPGQSFHVNSEVNSVLSPRSESGGLG VSMVEYVLSSSPGDSCLRKGGFGPRDADSDENDKGEKKNKGTFDGDKLGDLKEEGDVM DKTNGLPVQNGIDADVKDFSRTPGNCQNSANEVDLLGPNQNGSEGLAQLTSTNGAKPV EDFSNMESQSVPLDPMEHVGMEPLQFDYSGTQVPVDSAAATVGLFDYNSQQQLFQRPN ALAVQQLTAAQQQQYALAAAHQPHIGLAPAAFVPNPYIISAAPPGTDPYTAGLAAAAT LGPAVVPHQYYGVTPWGVYPASLFQQQAAAAAAATNSANQQTTPQAQQGQQQVLRGGA SQRPLTPNQNQQGQQTDPLVAAAAVNSALAFGQGLAAGMPGYPVLAPAAYYDQTGALV VNAGARNGLGAPVRLVAPAPVIISSSAAQAAVAAAAASANGAAGGLAGTTNGPFRPLG TQQPQPQPQQQPNNNLASSSFYGNNSLNSNSQSSSLFSQGSAQPANTSLGFGSSSSLG ATLGSALGGFGTAVANSNTGSGSRRDSLTGSSDLYKRTSSSLTPIGHSFYNGLSFSSS PGPVGMPLPSQGPGHSQTPPPSLSSHGSSSSLNLGGLTNGSGRYISAAPGAEAKYRSA SSASSLFSPSSTLFSSSRLRYGMSDVMPSGRSRLLEDFRNNRYPNLQLREIAGHIMEF SQDQHGSRFIQLKLERATPAERQLVFNEILQAAYQLMVDVFGNYVIQKFFEFGSLEQK LALAERIRGHVLSLALQMYGCRVIQKALEFIPSDQQNEMVRELDGHVLKCVKDQNGNH VVQKCIECVQPQSLQFIIDAFKGQVFALSTHPYGCRVIQRILEHCLPDQTLPILEELH QHTEQLVQDQYGNYVIQHVLEHGRPEDKSKIVAEIRGNVLVLSQHKFASNVVEKCVTH ASRTERAVLIDEVCTMNDGPHSALYTMMKDQYANYVVQKMIDVAEPGQRKIVMHKIRP HIATLRKYTYGKHILAKLEKYYMKNGVDLGPICGPPNGII" 3'UTR 3618..5319 BASE COUNT 1436 a 1224 c 1235 g 1424 t ORIGIN 1 gaagatcggg gggctgaaat ccatcttcat cctaccgctc cgcccgtgtt ggtggaatga 61 gcgttgcatg tgtcttgaag agaaaagcag tgctttggca ggactctttc agcccccacc 121 tgaaacatca ccctcaagaa ccagctaatc ccaacatgcc tgttgttttg acatctggaa 181 cagggtcgca agcgcagcca caaccagctg caaatcaggc tcttgcagct gggactcact 241 ccagccctgt cccaggatct ataggagttg caggccgttc ccaggacgac gctatggtgg 301 actacttctt tcagaggcag catggtgagc agcttggggg aggaggaagt ggaggaggcg 361 gctataataa tagcaaacat cgatggccta ctggggataa cattcatgca gaacatcagg 421 tgcgttccat ggatgaactg aatcatgatt ttcaagcact tgctctggag ggaagagcga 481 tgggagagca gctcttgcca ggtaaaaagt tttgggaaac agatgaatcc agcaaagatg 541 gaccaaaagg aatattcctg ggtgatcaat ggcgagacag tgcctgggga acatcagatc 601 attcagtttc ccagccaatc atggtgcaga gaagacctgg tcagagtttc catgtgaaca 661 gtgaggtcaa ttctgtactg tccccacgat cggagagtgg gggactaggc gttagcatgg 721 tggagtatgt gttgagctca tccccgggcg attcctgtct aagaaaagga ggatttggcc 781 caagggatgc agacagtgat gaaaacgaca aaggtgaaaa gaagaacaag ggtacgtttg 841 atggagataa gctaggagat ttgaaggagg agggtgatgt gatggacaag accaatggtt 901 taccagtgca gaatgggatt gatgcagacg tcaaagattt tagccgtacc cctggtaatt 961 gccagaactc tgctaatgaa gtggatcttc tgggtccaaa ccagaatggt tctgagggct 1021 tagcccagct gaccagcacc aatggtgcca agcctgtgga ggatttctcc aacatggagt 1081 cccagagtgt ccccttggac cccatggaac atgtgggcat ggagcctctt cagtttgatt 1141 attcaggcac gcaggtacct gtggactcag cagcagcaac tgtgggactt tttgactaca 1201 attctcaaca acagctgttc caaagaccta atgcgcttgc tgtccagcag ttgacagctg 1261 ctcagcagca gcagtatgca ctggcagctg ctcatcagcc gcacatcggt ttagctcccg 1321 ctgcgtttgt ccccaatcca tacatcatca gcgctgctcc cccagggacg gacccctaca 1381 cagctggatt ggctgcagca gcgacactag gcccagctgt ggtccctcac cagtattatg 1441 gagttactcc ctggggagtc taccctgcca gtcttttcca gcagcaagct gccgctgccg 1501 ctgcagcaac taattcagct aatcaacaga ccaccccaca ggctcagcaa ggacagcagc 1561 aggttctccg tggaggagcc agccaacgtc ctttgacccc aaaccagaac cagcagggac 1621 agcaaacgga tccccttgtg gcagctgcag cagtgaattc tgcccttgca tttggacaag 1681 gtctggcagc aggcatgcca ggttatccgg tgttggctcc tgctgcttac tatgaccaaa 1741 ctggtgccct tgtagtgaat gcaggcgcga gaaatggtct tggagctcct gttcgacttg 1801 tagctcctgc cccagtcatc attagttcct cagctgcaca agcagctgtt gcagcagccg 1861 cagcttcagc aaatggagca gctggtggtc ttgctggaac aacaaatgga ccatttcgcc 1921 ctttaggaac acagcagcct cagccccagc cccagcagca gcccaataac aacctggcat 1981 ccagttcttt ctacggcaac aactctctga acagcaattc acagagcagc tccctcttct 2041 cccagggctc tgcccagcct gccaacacat ccttgggatt cggaagtagc agttctctcg 2101 gcgccaccct gggatccgcc cttggagggt ttggaacagc agttgcaaac tccaacactg 2161 gcagtggctc ccgccgtgac tccctgactg gcagcagtga cctttataag aggacatcga 2221 gcagcttgac ccccattgga cacagttttt ataacggcct tagcttttcc tcctctcctg 2281 gacccgtggg catgcctctc cctagtcagg gaccaggaca ttcacagaca ccacctcctt 2341 ccctctcttc acatggatcc tcttcaagct taaacctggg aggactcacg aatggcagtg 2401 gaagatacat ctctgctgct ccaggcgctg aagccaagta ccgcagtgca agcagcgcct 2461 ccagcctctt cagcccgagc agcactcttt tctcttcctc tcgtttgcga tatggaatgt 2521 ctgatgtcat gccttctggc aggagcaggc ttttggaaga ttttcgaaac aaccggtacc 2581 ccaatttaca actgcgggag attgctggac atataatgga attttcccaa gaccagcatg 2641 ggtccagatt cattcagctg aaactggagc gtgccacacc agctgagcgc cagcttgtct 2701 tcaatgaaat cctccaggct gcctaccaac tcatggtgga tgtgtttggt aattacgtca 2761 ttcagaagtt ctttgaattt ggcagtcttg aacagaagct ggctttggca gaacggattc 2821 gaggccacgt cctgtcattg gcactacaga tgtatggctg ccgtgttatc cagaaagctc 2881 ttgagtttat tccttcagac cagcagaatg agatggttcg ggaactagat ggccatgtct 2941 tgaagtgtgt gaaagatcag aatggcaatc acgtggttca gaaatgcatt gaatgtgtac 3001 agccccagtc tttgcaattt atcatcgatg cgtttaaggg acaggtattt gccttatcca 3061 cacatcctta tggctgccga gtgattcaga gaatcctgga gcactgtctc cctgaccaga 3121 cactccctat tttagaggag cttcaccagc acacagagca gcttgtacag gatcaatatg 3181 gaaattatgt aatccaacat gtactggagc acggtcgtcc tgaggataaa agcaaaattg 3241 tagcagaaat ccgaggcaat gtacttgtat tgagtcagca caaatttgca agcaatgttg 3301 tggagaagtg tgttactcac gcctcacgta cggagcgcgc tgtgctcatc gatgaggtgt 3361 gcaccatgaa cgacggtccc cacagtgcct tatacaccat gatgaaggac cagtatgcca 3421 actacgtggt ccagaagatg attgacgtgg cggagccagg ccagcggaag atcgtcatgc 3481 ataagatccg gccccacatc gcaactcttc gtaagtacac ctatggcaag cacattctgg 3541 ccaagctgga gaagtactac atgaagaacg gtgttgactt agggcccatc tgtggccccc 3601 ctaatggtat catctgaggc agtgtcaccc gctgttccct cattcccgct gacctcactg 3661 gcccactggc aaatccaacc agcaaccaga aatgttctag tgtagagtct gagacgggca 3721 agtggttgct ccaggattac tccctcctcc aaaaaaggaa tcaaatccac gagtggaaaa 3781 gcctttgtaa atttaatttt attacacata acatgtacta ttttttttaa ttgactaatt 3841 gccctgctgt tttactggtg tataggatac ttgtacatag gtaaccaatg tacatgggag 3901 gccacatatt ttgttcactg ttgtatctat atttcacatg tggaaacttt cagggtggtt 3961 ggtttaacaa aaaaaaaaag ctttaaaaaa aaaagaaaaa aaggaaaagg tttttagctc 4021 atttgcctgg ccggcaagtt ttgcaaatag ctcttcccca cctcctcatt ttagtaaaaa 4081 acaaacaaaa acaaaaaaac ctgagaagtt tgaattgtag ttaaatgacc ccaaactggc 4141 atttaacact gtttataaaa aatatatata tatatatata tatataatga aaaaggtttc 4201 agagttgcta aagcttcagt ttgtgacatt aagtttatga aattctaaaa aatgcctttt 4261 ttggagacta tattatgctg aagaaggctg ttcgtgagga ggagatgcga gcacccagaa 4321 cgtcttttga ggctgggcgg gtgtgattgt ttactgccta ctggattttt ttctattaac 4381 attgaaaggt aaaatctgat tatttagcat gagaaaaaaa atccaactct gcttttggtc 4441 ttgcttctat aaatatatag tgtatacttg gtgtagactt tgcatatata caaatttgta 4501 gtattttctt gttttgatgt ctaatctgta tctataatgt accctagtag tcgaacatac 4561 ttttgattgt acaattgtac atttgtatac ctgtaatgta aatgtggaga agtttgaatc 4621 aacataaaca cgttttttgg taagaaaaga gaattagcca gccctgtgca ttcagtgtat 4681 attctcacct tttatggtcg tagcatatag tgttgtatat tgtaaattgt aatttcaacc 4741 agaagtaaat ttttttgttt tgaaggaata aatgttcttt atacagccta gttaatgttt 4801 aaaaagaaaa aaatagcttg gttttatttg tcatctagtc tcaagtatag cgagattctt 4861 tctaaatgtt attcaagatt gagttctcac tagtgttttt ttaatcctaa aaaagtaatg 4921 ttttgatttt gtgacagtca aaaggacgtg caaaagtcta gccttgcccg agctttcctt 4981 acaatcagag cccctctcac cttgtaaagt gtgaatcgcc cttccctttt gtacagaaga 5041 tgaactgtat tttgcatttt gtctacttgt aagtgaatgt aacatactgt caattttcct 5101 tgtttgaata tagaattgta acactacacg gtgtacattt ccagagcctt gtgtatattt 5161 ccaatgaact tttttgcaag cacacttgta accatatgtg tataattaac aaacctgtgt 5221 atgcttatgc ctgggcaact attttttgta actcttgtgt agattgtctc taaacaatgt 5281 gtgatcttta ttttgaaaaa tacagaactt tggaatctg // LOCUS HUMKGF 3853 bp mRNA PRI 07-MAR-1995 DEFINITION Human keratinocyte growth factor mRNA, complete cds. ACCESSION M60828 M25295 NID g186738 KEYWORDS fibroblast growth factor; keratinocyte growth factor; mitogen. SOURCE Human embryonic lung fibroblast cell line M426, cDNA to mRNA, clones pCEV9-[32,49]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3853) AUTHORS Finch,P.W., Rubin,J.S., Miki,T., Ron,D. and Aaronson,S.A. TITLE Human KGF is FGF-related with properties of a paracrine effector of epithelial cell growth JOURNAL Science 245 (4919), 752-755 (1989) MEDLINE 89368897 FEATURES Location/Qualifiers source 1..3853 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pCEV9-32" /cell_line="M426" /cell_type="fibroblast" /dev_stage="embryo" /tissue_type="lung" /tissue_lib="pCEV9" /map="Unassigned" gene 446..1030 /gene="FGF7" CDS 446..1030 /gene="FGF7" /codon_start=1 /db_xref="GDB:G00-131-444" /product="keratinocyte growth factor" /db_xref="PID:g186739" /translation="MHKWILTWILPTLLYRSCFHIICLVGTISLACNDMTPEQMATNV NCSSPERHTRSYDYMEGGDIRVRRLFCRTQWYLRIDKRGKVKGTQEMKNNYNIMEIRT VAVGIVAIKGVESEFYLAMNKEGKLYAKKECNEDCNFKELILENHYNTYASAKWTHNG GEMFVALNQKGIPVRGKKTKKEQKTAHFLPMAIT" BASE COUNT 1373 a 603 c 658 g 1219 t ORIGIN 1 acgcgctcac acacagagag aaaatccttc tgcctgttga tttatggaaa caattatgat 61 tctgctggag aacttttcag ctgagaaata gtttgtagct acagtagaaa ggctcaagtt 121 gcaccaggca gacaacagac atggaattct tatatatcca gctgttagca acaaaacaaa 181 agtcaaatag caaacagcgt cacagcaact gaacttacta cgaactgttt ttatgaggat 241 ttatcaacag agttatttaa ggaggaatcc tgtgttgtta tcaggaacta aaaggataag 301 gctaacaatt tggaaagagc aagtactctt tcttaaatca atctacaatt cacagatagg 361 aagaggtcaa tgacctagga gtaacaatca actcaagatt cattttcatt atgttattca 421 tgaacacccg gagcactaca ctataatgca caaatggata ctgacatgga tcctgccaac 481 tttgctctac agatcatgct ttcacattat ctgtctagtg ggtactatat ctttagcttg 541 caatgacatg actccagagc aaatggctac aaatgtgaac tgttccagcc ctgagcgaca 601 cacaagaagt tatgattaca tggaaggagg ggatataaga gtgagaagac tcttctgtcg 661 aacacagtgg tacctgagga tcgataaaag aggcaaagta aaagggaccc aagagatgaa 721 gaataattac aatatcatgg aaatcaggac agtggcagtt ggaattgtgg caatcaaagg 781 ggtggaaagt gaattctatc ttgcaatgaa caaggaagga aaactctatg caaagaaaga 841 atgcaatgaa gattgtaact tcaaagaact aattctggaa aaccattaca acacatatgc 901 atcagctaaa tggacacaca acggagggga aatgtttgtt gccttaaatc aaaaggggat 961 tcctgtaaga ggaaaaaaaa cgaagaaaga acaaaaaaca gcccactttc ttcctatggc 1021 aataacttaa ttgcatatgg tatataaaga acccagttcc agcagggaga tttctttaag 1081 tggactgttt tctttcttct caaaattttc tttcctttta ttttttagta atcaagaaag 1141 gctggaaaaa ctactgaaaa actgatcaag ctggacttgt gcatttatgt ttgttttaag 1201 acactgcatt aaagaaagat ttgaaaagta tacacaaaaa tcagatttag taactaaagg 1261 ttgtaaaaaa ttgtaaaact ggttgtacaa tcatgatgtt agtaacagta atttttttct 1321 taaattaatt tacccttaag agtatgttag atttgattat ctgataatga ttatttaaat 1381 attcctatct gcttataaaa tggctgctat aataataata atacagatgt tgttatataa 1441 ggtatatcag acctacaggc ttctggcagg atttgtcaga taatcaagcc acactaacta 1501 tggaaaatga gcagcatttt aaatgctttc tagtgaaaaa ttataatcta cttaaactct 1561 aatcagaaaa aaaattctca aaaaaactat tatgaaagtc aataaaatag ataatttaac 1621 aaaagtacag gattagaaca tgcttatacc tataaataag aacaaaattt ctaatgctgc 1681 tcaagtggaa agggtattgc taaaaggatg tttccaaaaa tcttgtatat aagatagcaa 1741 cagtgattga tgataatact gtacttcatc ttacttgcca caaaataaca ttttataaat 1801 cctcaaagta aaattgagaa atctttaagt ttttttcaag taacataatc tatctttgta 1861 taattcatat ttgggaatat ggcttttaat aatgttcttc ccacaaataa tcatgctttt 1921 ttcctatggt tacagcatta aactctattt taagttgttt ttgaacttta ttgttttgtt 1981 atttaagttt atgttattta taaaaaaaaa accttaataa gctgtatctg tttcatatgc 2041 ttttaatttt aaaggaataa caaaactgtc tggctcaacg gcaagtttcc ctcccttttc 2101 tgactgacac taagtctagc acacagcact tgggccagca aatcctggaa gcagacaaaa 2161 ataagagcct gaagcaatgc ttacaataga tgtctcacac agaacaatac aaatatgtaa 2221 aaactctttc accacatatt cttgccaatt aattggatca tataagtaaa atcattacaa 2281 atataagtat ttacaggatt ttaaagttag aatatatttg aatgcatggg tagaaaatat 2341 catattttaa aactatgtat atttaaattt agtaattttc taatctctag aaatctctgc 2401 tgttcaaaag gtggcagcac tgaaagttgt tttcctgtta gatggcaaga gcacaatgcc 2461 caaaatagaa gatgcagtta agaataaggg gccctgaatg tcatgaaggc ttgaggtcag 2521 cctacagata acaggattat tacaaggatg aatttccact tcaaaagtct ttcattggca 2581 gatcttggta gcactttata tgttcaccaa tgggaggtca atatttatct aatttaaaag 2641 gtatgctaac cactgtggtt ttaatttcaa aatatttgtc attcaagtcc ctttacataa 2701 atagtatttg gtaatacatt tatagatgag agttatatga aaaggctagg tcaacaaaaa 2761 caatagattc atttaatttt cctgtggttg acctatacga ccaggatgta gaaaactaga 2821 aagaactgcc cttcctcaga tatactcttg ggagagagca tgaatggtat tctgaactat 2881 cacctgattc aaggactttg ctagctaggt tttgaggtca ggcttcagta actgtagtct 2941 tgtgagcata ttgagggcag aggaggactt agtttttcat atgtgtttcc ttagtgccta 3001 gcagactatc tgttcataat cagttttcag tgtgaattca ctgaatgttt atagacaaaa 3061 gaaaatacac actaaaacta atcttcattt taaaagggta aaacatgact atacagaaat 3121 ttaaatagaa atagtgtata tacatataaa atacaagcta tgttaggacc aaatgctctt 3181 tgtctatgga gttatacttc catcaaatta catagcaatg ctgaattagg caaaaccaac 3241 atttagtggt aaatccattc ctggtagtat aagtcaccta aaaaagactt ctagaaatat 3301 gtactttaat tatttgtttt tctcctattt ttaaatttat tatgcaaatt ttagaaaata 3361 aaatttgctc tagttacaca cctttagaat tctagaatat taaaactgta aggggcctcc 3421 atccctctta ctcatttgta gtctaggaaa ttgagatttt gatacaccta aggtcacgca 3481 gctgggtaga tatacagctg tcacaagagt ctagatcagt tagcacatgc tttctactct 3541 tcgattatta gtattattag ctaatggtct ttggcatgtt tttgtttttt atttctgttg 3601 agatatagcc tttacatttg tacacaaatg tgactatgtc ttggcaatgc acttcataca 3661 caatgactaa tctatactgt gatgatttga ctcaaaagga gaaaagaaat tatgtagttt 3721 tcaattctga ttcctattca ccttttgttt atgaatggaa agctttgtgc aaaatataca 3781 tataagcaga gtaagccttt taaaaatgtt ctttgaaaga taaaattaaa tacatgagtt 3841 tctaacaatt aga // LOCUS HUMKIAAB 4283 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0087 gene, complete cds. ACCESSION D42038 NID g577288 KEYWORDS KIAA0087. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4283) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (07-NOV-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4283) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..4283 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..234 gene 235..651 /gene="KIAA0087" CDS 235..651 /gene="KIAA0087" /note="The ha1002 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1008221" /db_xref="PID:g577289" /translation="MEAWESSQPLLRCEIPCPLPGTDRDGSVSLPGEAASCDLDTLEP EHGNRRVSGNPISVCWAYKVTKVKCWSVRERGGRHIGGPRSTLKHPAHHGMGKNLATS LPTAASLGLGKGQLLVSIRFMDTTKKRGQSETFNIC" 3'UTR 652..4283 BASE COUNT 1250 a 883 c 897 g 1253 t ORIGIN 1 ctcagctcct tttcagtaat ttcagttcta ttttcttact ctatcattct ggtgttttca 61 ttgcattttc ttataaaaga gatccagatt tattttggaa atagatttga atgacgaaaa 121 catcctatag aagcaaatcc taaacaaata taaatatttg accatgatta tgcaaagaaa 181 gctcctgcct cctcctctgt gtcacagaga ccctcgggag gaactccagg gagaatggaa 241 gcttgggagt catcccaacc cttgctcagg tgcgaaatcc cctgccctct gcctggcact 301 gacagggacg gatcagtttc tctgcctgga gaggcagcct cctgtgacct ggatacattg 361 gagcctgaac acgggaacag aagggtctct gggaatccaa tctcagtctg ttgggcttac 421 aaagtgacca aagtcaaatg ctggtctgtg agagaaagag gaggcagaca catcggggga 481 ccgagaagta ctttgaagca ccccgctcac catggtatgg ggaagaattt ggccacatcc 541 ctgccaactg ctgcttctct tggactggga aagggtcagt tgcttgtttc tatcagattc 601 atggacacca ccaagaaaag aggccagtct gagacattca atatttgttg aaacacagag 661 gtttaactga tcattgaggt acacagggtg ataggaagag tatgaacttg agaatctgat 721 ggattttgtt taaagttagg actaggctac tgactaccca ccctgctaat tgtcttgggg 781 aagttattta acctctctga acctcagctt ctcatccaca aaataggaca ctagcacaca 841 gtgggctatc tatatatatt tattgaattg aaatgggagt attcaaaatt atttgggttg 901 ttggcttaaa ataagcagaa taattcaatt cctgttcatc tggaatctaa ttgtccaatt 961 tggagtttca gatgtaacaa agatcctaat gacctaatat tattggggta atcctaaaat 1021 ttattaggtg atctgcaagt tccatgatga ttgtatactg gaggaaatgc acaatgcaca 1081 cagaccttcc taggagtaaa agactgatgt gaatttgcag cactacttaa ttataagtaa 1141 ttgttgcatg aacacacaca caaacacaca cccacagaac tcacactata agataggggg 1201 gcaaataatg acatttgaca aaaatgagct cttgcagagc tcaacgagtc taagtttcct 1261 ccctaatatc ttaacaggct acagtataca caggctttta agtaatgtac ttgttatcca 1321 aacactagca gacatttaac tggaatccag gagcattttg tcaagaagga aaagaactaa 1381 ccaaagaaaa agaaaagtag agagtttccc agctgtttta ccacatcaca acttcaaaaa 1441 aaccaagcct gtccttgcct tgtccctgat ggctctgcca gcagagtgaa tgaattggag 1501 cagatttagg ttttaaaatg cctcctaata tcatttgggc agattagctt acatggctgg 1561 acattgtgct cccagccaca gggaatgttc ttttcctgtt ggaagctctg acagccactg 1621 ccacggtgaa atatcaggag cctcctaaca tttttcttga tttctttaag agtgaacatt 1681 agacatatta aaaacaacaa cccaatcatt aagccctgtg aggttttcct aggagccagc 1741 tctatcgctt taatcagatt acggacaact cattctcatc ccagccattt atcttctttg 1801 tttacacaca caccaagtgg cattcttctt agcatcaaaa aatagtaatc aaaattgaga 1861 acaacaaaag acctttcatt atttacagtg ttaagtaaca tctatttgcc ttcattagtt 1921 tctaaacaac gaagtacact ggttctgatt aaagctggca tgtgttaatt atatccgagg 1981 tctgggattg ttttgtctac aataaaggat tgctgaaagc tgaaatttga ggcgcccagg 2041 ccccagaagt cacctgcctg tatgtgtttc ctcagtgaga gtttccagcc tgagaatggc 2101 agaagcggat gtgagcttgt cacagaaagt tttttttgct ttgttttgtt ttgtttggtt 2161 tttaagccct cggttttatt gcccatttta tgggctggag ttaaatggaa gtacaatgca 2221 gtttgctctc ttactgctca gcactaacag ccatgatgag ccataattag aagtaaaacc 2281 accacataaa taacatggta gaaaaattgc cacagtgaga agccagactt tggcgacggg 2341 aaacttcagc gggtcctgag tatccatcat acttaagggg caccgcgtat tttagcccat 2401 atttgtctta ctaaagccaa aatgggagcc atttagactt tcgaacaagc ttttaaaaat 2461 agagaatgta ggcttaaaca gactgcctgc tgctaagtgt gatgaagaaa cctgtcacct 2521 ggaggctacg acagctggaa gggctgggag ggagagccct gggagcccag gggttaggaa 2581 gggaagataa acaggagagg aagaggaggc ctggactctc cacaacagca aggtctagac 2641 atttcactgc aaagcctaag agtctgcaca ccgttgagca cagcagctta gacttccgcc 2701 ggggaacgtg tgaggctggg acgccttgca gattttgctt tcatgacctg tactcagcga 2761 tgaagtggga gcaggtagac tagttttttt tttttttttt aatggagtct cattctgtca 2821 cccaggctgg agtgcagtgg cgtgatctcg gttcactgca acctctgcca cccatgttca 2881 agcgattctc ctgcctcagc cccctgagta gctgggatta caggtgcctg ccactgcgtc 2941 tggctaattt ttgtattttt agtagagatg gggtttcacc atcttggcca ggctggtctt 3001 gaactcctga ccttgtggtc cacccgcctc ggcctcccaa aatgctggga ttacatgcat 3061 gagccacgcg cccagcctag actagcattt ttgagtaccc actatctcct tggcatttta 3121 cttccatttt aattctcata acagccttag gacgccaatg aaaaggaaga gaaaataaag 3181 tgtgttctgt gcctgaaaat ggccttaaag aaacaaggac caacagctgg agcagagtgg 3241 ggctgcccat acgcaaagag acaacaagta ggaattaaca taaggcatat cacgtggatg 3301 ccatagaact atcacagcaa tttctttttt ccccaagtca gcctctcttg tacaggggat 3361 tagaaaacaa acccaagaga ataaacttga aatggaagga acgtgctttg tagtcgagtg 3421 tttttcaaag cctgcatcat tttaaaaact gatttaagta tattcatttg ctttctaaaa 3481 taagtcatat gtatgtttct actgatcggc cagaatagtt aaggtctgtc tgatcttcat 3541 atggaatgaa aggggacatt ctttggatat tctggaagtc cacacgtgtt ctttacagag 3601 tttggtttgc tttctccgtt tcttccttcc tttgcaaaat caaacaaaca ggaataaaaa 3661 ccaaaccaaa acaaaattaa tgaaaaccct tgtgttgagg actttgtttt gttttaaatg 3721 tgtctggtat taagcagtat gtttagtgaa atgactttaa acttaagtag gaaatacagc 3781 tccatttcac acgtggattt tgtctgcatt tacatctctg caggcctcac atgccgtgac 3841 accagacatt tttttaatta agatatgaag tgctcatttt ttttttgtgg aactgctgta 3901 ttctaacaaa tttactacac gtctcagaaa acacatagct ttgccaaagt tggtgagtag 3961 cccagcatgg tttatctcca gcccacagct ctttccctga attcaatact gcaggcacaa 4021 tttaatgcaa acccaattta tgacttaatg ctctagggac cgtagtggga gattgttaca 4081 ttattctttt ctaatcctat ttgtgtttta atttgctaag tctgtaatga ggtgcatgcc 4141 ttgaatattc aagatctttc acattgcttt tgactttgtt taatataata gattttacag 4201 tttgcattaa atacccaggt gcttcggtca gtattatctc tttattttct caactagagt 4261 gaataaaatt acacgatgaa agt // LOCUS HUMKIAAI 4468 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0086 gene, complete cds. ACCESSION D42045 NID g577302 KEYWORDS KIAA0086. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4468) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (07-NOV-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4468) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..4468 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..918 gene 919..4041 /gene="KIAA0086" CDS 919..4041 /gene="KIAA0086" /note="The ha3611 gene product is related to S.cerevisiae SNM1 protein." /citation=[3] /codon_start=1 /db_xref="PID:d1008228" /db_xref="PID:g577303" /translation="MLEDISEEDIWEYKSKRKPKRVDPNNGSKNILKSVEKATDGKYQ SKRSRNRKRAAEAKEVKDHEVPLGNAGCQTSVASSQNSSCGDGIQQTQDKETTPGKLC RTQKSQHVSPKIRPVYDGYCPNCQMPFSSLIGQTPRWHVFECLDSPPRSETECPDGLL CTSTIPFHYKRYTHFLLAQSRAGDHPFSSPSPASGGSFSETKSGVLCSLEERWSSYQN QTDNSVSNDPLLMTQYFKKSPSLTEASEKISTHIQTSQQALQFTDFVENDKLVGVALR LANNSEHINLPLPENDFSDCEISYSPLQSDEDTHDIDEKPDDSQEQLFFTESSKDGSL EEDDDSCGFFKKRHGPLLKDQDESCPKVNSFLTRDKYDEGLYRFNSLNDLSQPISQNN ESTLPYDLACTGGDFVLFPPALAGKLAASVHQATKAKPDEPEFHSAQSNKQKQVIEES SVYNQVSLPLVKSLMLKPFESQVEGYLSSQPTQNTIRKLSSENLNAKNNTNSACFCRK ALEGVPVGKATILNTENLSSTPAPKYLKILPSGLKYNARHPSTKVMKQMDIGVYFGLP PKRKEEKLLGESALEGINLNPVPSPNQKRSSQCKRKAEKSLSDLEFDASTLHESQLSV ELSSERSQRQKKRCRKSNSLQEGACQKRSDHLINTESEAVNLSKVKVFTKSAHGGLQR GNKKIPESSNVGGSRKKTCPFYKKIPGTGFTVDAFQYGVVEGCTAYFLTHFHSDHYAG LSKHFTFPVYCSEITGNLLKNKLHVQEQYIHPLPLDTECIVNGVKVVLLDANHCPGAV MILFYLPNGTVILHTGDFRADPSMERSLLADQKVHMLYLDTTYCSPEYTFPSQQEVIR FAINTAFEAVTLNPHALVVCGTYSIGKEKVFLAIADVLGSKVGMSQEKYKTLQCLNIP EINSLITTDMCSSLVHLLPMMQINFKGLQSHLKKCGGKYNQILAFRPTGWTHSNKFTR IADVIPQTKGNISIYGIPYSEHSSYLEMKRFVQWLKPQKIIPTVNVGTWKSRSTMEKY FREWKLEAGY" 3'UTR 4042..4468 BASE COUNT 1389 a 844 c 925 g 1310 t ORIGIN 1 ggactctgag ggctttttgg agctcgctat gcttaacctg gagatgatta aggccccgct 61 tcctggcctc ccagcctcta atgccaaaag ataagggaga ggctggcgtg tgaccccgtt 121 ttgagtcagg tggacagagg gctggccacc ttcggaacca tgggtgcaat acggagtcag 181 acctcaatac aagcccactc tttcacatat ttgaactttt ttcacatatc aacttttttt 241 gttcactgtg cagggattgt tcattgctgc tggaggaaga tcatggactg tcgcgggaaa 301 ctgaagtggt tgagtatcca ctagtcgtgg atgagggcag tgacttcgca gttttttgcg 361 aattacacat ctctttgatt atgttgtgac tagttttgtt agatagtcat ttagtgtttg 421 ggatacctgt taagcccttt gtccagggac tgtggttgga tttatgaatt atttggacgg 481 ttgtccactt gaaagaactg acagtagctt cataacaatg ttacaaatct cgttctaaga 541 ttaagctgtt gaacctatat ttgccattag cgcttaattt ttgaagtatt atttttatga 601 atcaagccct ggaaaaggac aagatatttg aatgaaatag cacccataat ggagaacttc 661 acagttgcta ctcctgtgat aggtttatct tagtttcatt gtggtataaa tggaatagca 721 ggtgttgtca ggtacaaggt ttgtagcttg ccaatatgtt cattaccaac actgcagatt 781 cccattgagt tggtgggggt tttgttacct ttgttttttt tctcagcaaa ataattctat 841 aactttttgt ttgtgacaag aaatggactt tcagtttact taagattaat acttcttgaa 901 tgataaaatc attttgccat gttagaagac atttccgaag aagacatttg ggaatacaaa 961 tctaaaagaa aaccaaaacg agttgatcca aataatggct ctaaaaatat tctaaaatct 1021 gttgaaaaag caacagatgg aaaataccag tcaaaacgga gtagaaacag aaaaagagcc 1081 gcagaagcta aagaggtgaa ggaccatgaa gtgccccttg gaaatgcagg ttgtcagact 1141 tctgttgctt ctagtcagaa ttcaagttgt ggagatggta ttcagcagac ccaagacaag 1201 gaaactactc caggaaaact ctgtagaact caaaaaagcc aacacgtgtc cccaaagata 1261 cgtccagttt atgatggata ctgtccaaat tgccagatgc ctttttcctc attgataggg 1321 cagacacctc gatggcatgt ttttgaatgt ttggattctc caccacgctc tgaaacagag 1381 tgtcctgatg gtcttctgtg tacctcaacc attccttttc attacaagag atacactcac 1441 ttcctgctag ctcaaagcag ggctggtgat catcctttta gcagcccatc acctgcgtca 1501 ggtggcagtt tcagtgagac taagtcaggc gtcctttgta gccttgagga aagatggtct 1561 tcgtatcaga accaaactga taactcggtt tcaaatgatc ccttattgat gacacagtat 1621 tttaaaaagt ctccgtctct gactgaagcc agtgaaaaga tttctactca tatccaaaca 1681 tcccaacaag ctctacaatt tacagatttt gttgagaatg acaaactagt gggagttgct 1741 ttgcgtcttg caaacaactc agaacacata aatttgccat tgccagaaaa tgacttcagt 1801 gactgtgaaa tctcctattc tccacttcaa agtgatgaag acactcatga tatcgatgaa 1861 aaaccggatg attcacaaga acaactgttt tttaccgaaa gctcaaaaga tggcagcctc 1921 gaagaagatg atgacagctg tggttttttt aaaaaacgac atggtccctt actgaaggac 1981 caggatgaga gctgccccaa agtgaacagc ttcttaactc gggataagta tgatgaagga 2041 ttgtatagat tcaatagtct aaatgatttg tctcaaccta tttctcaaaa taatgagagt 2101 actttgcctt atgatctggc atgtactggt ggtgattttg tgttgtttcc acctgcattg 2161 gcagggaagc ttgctgcttc tgttcatcag gcaactaaag caaaacctga tgagccagaa 2221 tttcactcag ctcaatcaaa taaacagaaa caggtaattg aagaatcatc tgtttacaat 2281 caagtttctc ttccgttagt taagagttta atgttgaaac cttttgaaag tcaggtagaa 2341 gggtatcttt cttcccaacc aacccaaaat acaattagaa aattatcaag tgagaacttg 2401 aatgctaaga ataatactaa ctcagcatgt ttctgcagaa aggcattaga gggtgtgcca 2461 gttggtaaag ctacaatttt aaatacagaa aacttgtcta gtacacctgc tccgaagtat 2521 ttgaaaatat tgccttctgg tcttaagtat aatgcaagac atccttctac caaggtaatg 2581 aagcaaatgg atataggtgt gtattttgga ctacctccca aaagaaagga agagaaattg 2641 ctaggggaaa gtgcattaga agggataaac ttaaatccag ttccaagtcc taatcaaaag 2701 aggtcctcgc agtgcaagag gaaagcagaa aaatctttaa gtgatttaga atttgatgca 2761 agtactttac atgagagtca gctttctgtg gaactttcta gtgagaggtc acagcgtcaa 2821 aaaaagagat gtagaaagtc aaattcactg caggaaggag cgtgtcagaa gagatcagat 2881 caccttatta atacagaatc tgaagcagtc aatttaagta aagtcaaagt cttcacaaaa 2941 tcagctcatg gtgggctgca aaggggcaac aagaaaatcc cagagtcatc taatgtagga 3001 ggatcaagaa aaaagacatg tccattctat aagaaaatac ctggaaccgg ctttacagtt 3061 gatgcctttc agtatggcgt ggttgaaggt tgcacagcct attttctcac acattttcat 3121 tctgatcatt atgctggatt gtctaaacac ttcacatttc cagtttattg tagtgagata 3181 actggcaatt tgttgaagaa caagcttcat gtgcaagaac aatatattca cccattgcca 3241 ctggacactg aatgtattgt gaatggtgtc aaagttgttt tgcttgatgc caatcactgt 3301 ccaggtgctg tcatgatcct cttttatctt cctaatggta ctgtcatatt acacacggga 3361 gacttcagag cagatcccag catggaacgt tctcttcttg cggaccagaa agtccatatg 3421 ctgtacttag ataccacata ttgtagccca gaatacacct ttccatctca gcaagaggtt 3481 atccggtttg ccatcaacac tgcctttgag gctgtaactc taaacccaca tgctcttgtt 3541 gtctgtggca cttactctat tggaaaagag aaagtcttcc tagccattgc tgatgtttta 3601 ggttcaaaag tgggcatgtc ccaggaaaaa tataaaactc tacagtgcct caatatacca 3661 gaaattaatt cactcatcac taccgacatg tgcagttcat tggttcacct tctcccaatg 3721 atgcaaatta attttaaggg cttacagagt catttgaaga agtgtggtgg gaaatacaat 3781 cagattttgg catttcgacc tacaggatgg acacactcta acaagttcac tagaatagca 3841 gatgttattc cccagaccaa aggaaacatt tcaatatatg gaattcctta cagtgaacac 3901 agcagctacc tagaaatgaa gcgctttgtc cagtggctga agccccagaa aatcatacct 3961 actgtaaatg tgggcacctg gaaatctagg agcacaatgg agaaatattt tagagagtgg 4021 aaattggaag ctggatattg atgatacctc cgaggattca gtagtagtta agttccttgg 4081 atgtagcttg ttagtagtta aatctataga aatgtgaaat acactttgtg tggaaaaacc 4141 tcatgaagat tgttcagata ctttattttc tcatttatgt ttgaacaaca tgttcgtggt 4201 gctgaatgcc tctcagcatc atcaaggata actgaaactg ggtctccctg ggacccttaa 4261 tttcttgtcc cctgccctcc atgggcagtt atattctgca tcaagcctta gaagaggaag 4321 caaaggcaga ttcagggacc aaaaggatta atgataatta ataaagtagt ttgaagcatt 4381 atatatataa gtaattatgt gtctttaaaa ttatgagatg aaacttttat atgacgtgta 4441 tacttaaata aaattaatat aaaatttg // LOCUS HUMKIAAL 4338 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0091 gene, complete cds. ACCESSION D42053 NID g577308 KEYWORDS KIAA0091. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4338) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (08-NOV-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4338) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..4338 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..496 gene 497..3655 /gene="KIAA0091" CDS 497..3655 /gene="KIAA0091" /note="KIAA0091 gene product is related to subtilisin." /citation=[3] /codon_start=1 /db_xref="PID:d1008235" /db_xref="PID:g577309" /translation="MKLVNIWLLLLVVLLCGKKHLGDRLEKKSFEKAPCPGCSHLTLK VEFSSTVVEYEYIVAFNGYFTAKARNSFISSALKSSEVDNWRIIPRNNPSSDYPSDFE VIQIKEKQKAGLLTLEDHPNIKRVTPQRKVFRSLKYAESDPTVPCNETRWSQKWQSSR PLRRASLSLGSGFWHATGRHSSRRLLRAIPRQVAQTLQADVLWQMGYTGANVRVAVFD TGLSEKHPHFKNVKERTNWTNERTLDDGLGHGTFVAGVIASMRECQGFAPDAELHIFR VFTNNQVSYTSWFLDAFNYAILKKIDVLNLSIGGPDFMDHPFVDKVWELTANNVIMVS AIGNDGPLYGTLNNPADQMDVIGVGGIDFEDNIARFSSRGMTTWELPGGYGRMKPDIV TYGAGVRGSGVKGGCRALSGTSVASPVVAGAVTLLVSTVQKRELVNPASMKQALIASA RRLPGVNMFEQGHGKLDLLRAYQILNSYKPQASLSPSYIDLTECPYMWPYCSQPIYYG GMPTVVNVTILNGMGVTGRIVDKPDWQPYLPQNGDNIEVAFSYSSVLWPWSGYLAISI SVTKKAASWEGIAQGHVMITVASPAETESKNGAEQTSTVKLPIKVKIIPTPPRSKRVL WDQYHNLRYPPGYFPRDNLRMKNDPLDWNGDHIHTNFRDMYQHLRSMGYFVEVLGAPF TCFDASQYGTLLMVDSEEEYFPEEIAKLRRDVDNGLSLVIFSDWYNTSVMRKVKFYDE NTRQWWMPDTGGANIPALNELLSVWNMGFSDGLYEGEFTLANHDMYYASGCSIAKFPE DGVVITQTFKDQGLEVLKQETAVVENVPILGLYQIPAEGGGRIVLYGDSNCLDDSHRQ KDCFWLLDALLQYTSYGVTPPSLSHSGNRQRPPSGAGSVTPERMEGNHLHRYSKVLEA HLGDPKPRPLPACPRLSWAKPQPLNETAPSNLWKHQKLLSIDLDKVVLPNFRSNRPQV RPLSPGESGAWDIPGGIMPGRYNQEVGQTIPVFAFLGAMVVLAFFVVQINKAKSRPKR RKPRVKRPQLMQQVHPPKTPSV" 3'UTR 3656..4338 BASE COUNT 1085 a 1051 c 1160 g 1042 t ORIGIN 1 cagggcacgc tgggtcggcg gagctgaggc tcccagctgt gggcctcgct ggcccggtcg 61 cccagtctcg cgagagttgg gagtaaacag ccccgaatgg agtgcccagg cgtgttcgcc 121 gcggaggcgc cgttatcccg ggcccgccgg ccctgagctc ccggcggcgc agattggctc 181 acagtggttg attgatcaac cccattggac gttggttctg tggtacaaat ggagtacagg 241 actcagtcgt cacggcctga gtgagagaag ccttatttcc aagatggaga agaagcggag 301 aaagaaatga aagcctctct tcaggctgaa ccacaaaagg ccatgggatt taacttttat 361 ttatgttggg caagactgta agatggctga tcagtaatgt tgcagctttt agctgaaaca 421 aaaattcact tttaatcaag aagaaaaaag tgtgatttga atatatgcaa ttttatgatc 481 atattcgctt gtgaccatga agcttgtcaa catctggctg cttctgctcg tggttttgct 541 ctgtgggaag aaacatctgg gcgacagact ggaaaagaaa tcttttgaaa aggccccatg 601 ccctggctgt tcccacctga ctttgaaggt ggaattctca tcaacagttg tggaatatga 661 atatattgtg gctttcaatg gatactttac agccaaagct agaaattcat ttatttcaag 721 tgccctgaag agcagtgaag tagacaattg gagaattata cctcgaaaca atccatccag 781 tgactaccct agtgattttg aggtgattca gataaaagaa aaacagaaag cggggctgct 841 aacacttgaa gatcatccaa acatcaaacg ggtcacgccc caacgaaaag tctttcgttc 901 cctcaagtat gctgaatctg accccacagt accctgcaat gaaacccggt ggagccagaa 961 gtggcaatca tcacgtcccc tgcgaagagc cagcctctcc ctgggctctg gcttctggca 1021 tgctacggga aggcattcga gcagacggct gctgagagcc atcccgcgcc aggttgccca 1081 gacactgcag gcagatgtgc tctggcagat gggatataca ggtgctaatg taagagttgc 1141 tgtttttgac actgggctga gcgagaagca tccccacttc aaaaatgtga aggagagaac 1201 caactggacc aacgagcgaa cgctggacga tgggttgggc catggcacat tcgtggcagg 1261 tgtgatagcc agcatgaggg agtgccaagg atttgctcca gatgcagaac ttcacatttt 1321 cagggtcttt accaataatc aggtatctta cacatcttgg tttttggacg ccttcaacta 1381 tgccatttta aagaagatcg acgtgttaaa cctcagcatc ggcggcccgg acttcatgga 1441 tcatccgttt gttgacaagg tgtgggaatt aacagctaac aatgtaatca tggtttctgc 1501 tattggcaat gacggacctc tttatggcac tctgaataac cctgctgatc aaatggatgt 1561 gattggagta ggcggcattg actttgaaga taacatcgcc cgcttttctt caaggggaat 1621 gactacctgg gagctaccag gaggctacgg tcgcatgaaa cctgacattg tcacctatgg 1681 tgctggcgtg cggggttctg gcgtgaaagg ggggtgccgg gccctctcag ggaccagtgt 1741 tgcttctcca gtggttgcag gtgctgtcac cttgttagtg agcacagtcc agaagcgtga 1801 gctggtgaat cccgccagta tgaagcaggc cctgatcgcg tcagcccgga ggctccccgg 1861 ggtcaacatg tttgagcaag gccacggcaa gctcgatctg ctcagagcct atcagatcct 1921 caacagctac aagccacagg caagtttgag ccccagctac atagatctga ctgagtgtcc 1981 ctacatgtgg ccctactgct cccagcccat ctactatgga ggaatgccga cagttgttaa 2041 tgtcaccatc ctcaacggca tgggagtcac aggaagaatt gtagataagc ctgactggca 2101 gccctatttg ccacagaacg gagacaacat tgaagttgcc ttctcctact cctcggtctt 2161 atggccttgg tcgggctacc tggccatctc catttctgtg accaagaaag cggcttcctg 2221 ggaaggcatt gctcagggcc atgtcatgat cactgtggct tccccagcag agacagagtc 2281 aaaaaatggt gcagaacaga cttcaacagt aaagctcccc attaaggtga agataattcc 2341 tactcccccg cgaagcaaga gagttctctg ggatcagtac cacaacctcc gctatccacc 2401 tggctatttc cccagggata atttaaggat gaagaatgac cctttagact ggaatggtga 2461 tcacatccac accaatttca gggatatgta ccagcatctg agaagcatgg gctactttgt 2521 agaggtcctc ggggccccct tcacgtgttt tgatgccagt cagtatggca ctttgctgat 2581 ggtggacagt gaggaggagt acttccctga agagatcgcc aagctccgga gggacgtgga 2641 caacggcctc tcgctcgtca tcttcagtga ctggtacaac acttctgtta tgagaaaagt 2701 gaagttttat gatgaaaaca caaggcagtg gtggatgccg gataccggag gagctaacat 2761 cccagctctg aatgagctgc tgtctgtgtg gaacatgggg ttcagcgatg gcctgtatga 2821 aggggagttc accctggcca accatgacat gtattatgcg tcagggtgca gcatcgcgaa 2881 gtttccagaa gatggcgtcg tgataacaca gactttcaag gaccaaggat tggaggtttt 2941 aaagcaggaa acagcagttg ttgaaaacgt ccccattttg ggactttatc agattccagc 3001 tgagggtgga ggccggattg tactgtatgg ggactccaat tgcttggatg acagtcaccg 3061 acagaaggac tgcttttggc ttctggatgc cctcctccag tacacatcgt atggggtgac 3121 accgcctagc ctcagtcact ctgggaaccg ccagcgccct cccagtggag caggctcagt 3181 cactccagag aggatggaag gaaaccatct tcatcggtac tccaaggttc tggaggccca 3241 tttgggagac ccaaaacctc ggcctctacc agcctgtcca cgcttgtctt gggccaagcc 3301 acagccttta aacgagacgg cgcccagtaa cctttggaaa catcagaagc tactctccat 3361 tgacctggac aaggtggtgt tacccaactt tcgatcgaat cgccctcaag tgaggccctt 3421 gtcccctgga gagagcggcg cctgggacat tcctggaggg atcatgcctg gccgctacaa 3481 ccaggaggtg ggccagacca ttcctgtctt tgccttcctg ggagccatgg tggtcctggc 3541 cttctttgtg gtacaaatca acaaggccaa gagcaggccg aagcggagga agcccagggt 3601 gaagcgcccg cagctcatgc agcaggttca cccgccaaag accccttcgg tgtgaccggc 3661 agcctggctg accgtgaggg ccagagagag ccttcacgga cggcgctggt gggtgagccg 3721 agctgtggtg gcggctggtt taaaagggat ccagtttcca gctgcaggtt tgttagagtc 3781 tgttctacat gggcctgccc tcctgtgatg ggcagaggct cctggtacat cgagaagatt 3841 cctgtggatc ccgtcaggag ggacttagtg gctctgccgc cagtgagact tcccgccggc 3901 agctgtgcgc accaaagact cgggagaact ggaaaggctg tctggggtct tctgactgca 3961 ggggaaggat gtactttcca aacaaatgat acaaccctga ccaagctaaa agacgcttgt 4021 taaaggctat tttctatatt tattgttggg aaaagtcact ttaaagactt gtgctatttg 4081 gaagcaaagc tatttttttt gtcagtggaa tgcagttttt ttactattcc atcatgagga 4141 acaacataga ttccatgatc tttttaatga cagtacagac tgagatttga aggaaacatg 4201 cacaaatctg taaaacatag accttcgctt tatttttgta agtatcacct gccaccatgt 4261 tttgtaattt gaggtcttga tttcaccatt gtcggtgaag aaaattttca ataaatatgt 4321 attacccgtc tgaagctt // LOCUS HUMKIAAM 2913 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0092 gene, complete cds. ACCESSION D42054 NID g577310 KEYWORDS KIAA0092. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2913) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (08-NOV-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2913) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..2913 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..53 gene 54..1478 /gene="KIAA0092" CDS 54..1478 /gene="KIAA0092" /note="KIAA0092 gene product is distantly related to smooth muscle myosin." /citation=[3] /codon_start=1 /db_xref="PID:d1008236" /db_xref="PID:g577311" /translation="MAAASVSAASGSHLSNSFAEPSRSNGSMVRHSSSPYVVYPSDKP FLNSDLRRSPSKPTLAYPESNSRAIFSALKNLQDKIRRLELERIQAEESVKTLSRETI EYKKVLDEQIQERENSKNEESKHNQELTSQLLAAENKCNLLEKQLEYMRNMIKHAEME RTSVLEKQVSLERERQHDQTHVQSQLEKLDLLEQEYNKLTTMQALAEKKMQELEAKLH EEEQERKRMQAKAAELQTGLETNRLIFEDKATPCVPNARRIKKKKSKPPEKSTSPSHA VVANVQLVLHLMKQHSKALCNDRVINSIPLAKQVSSRGGKSKKLSVTPPSSNGINEEL SEVLQTLQDEFGQMSFDHQQLAKLIQESPTVELKDKLECELEALVGRMEAKANQITKV RKYQAQLEKQKLEKQKKELKATKKTLDEERNSSSRSGITGTTNKKDFMKQRPGEKRRK NLQLLKDMQSIQNSLQSSSLCWDY" 3'UTR 1479..2913 BASE COUNT 1008 a 506 c 576 g 823 t ORIGIN 1 acgagaacct agaccgcccc cgaagtgcgg agaccccctg ggcaggctga aagatggcgg 61 cggcgtctgt ctctgcggct tctggttctc acttgtcgaa cagctttgct gagccatcaa 121 ggtctaatgg aagcatggtt cggcattctt catctccata tgtagtatat ccttcggata 181 agcctttcct taatagtgat ctacgacgct ccccaagtaa gcctacactt gcctatccag 241 aaagcaacag cagagccata ttttctgctc ttaagaatct tcaagataag attcgacgct 301 tggaacttga gaggattcag gcagaagaaa gtgtgaaaac cttgtctaga gaaacaattg 361 aatataagaa agtactggat gaacagatac aagaaaggga gaattcaaag aatgaggaat 421 caaagcacaa tcaagaactg acatctcagt tgttagctgc agaaaataaa tgcaatctat 481 tagaaaaaca attggaatac atgcgaaata tgataaagca tgccgaaatg gagaggacat 541 ctgtcttaga gaaacaagtt tccctagaaa gagaacgaca acatgatcaa acacatgttc 601 agagccaact tgaaaaattg gatcttcttg aacaggagta taacaaactt accacaatgc 661 aggcccttgc agaaaaaaaa atgcaagagt tggaagcaaa actccatgaa gaagaacagg 721 aaaggaaacg catgcaagct aaggcagctg agttgcagac tggtctagaa acaaatagac 781 ttatctttga agataaggca actccgtgtg ttcccaatgc aagaagaatt aaaaaaaaga 841 agtcaaaacc accagaaaag tccacaagcc ctagccatgc cgtggtagcc aatgttcagc 901 ttgtcttgca tctaatgaag caacacagta aagctttgtg caatgatcga gtcatcaaca 961 gtattccttt ggcaaagcaa gtatcttcac gaggtggtaa aagtaagaag ttgtcagtaa 1021 cacctccctc ctccaacggt attaatgagg agttgtcaga agtcttacag actttacagg 1081 atgaatttgg gcaaatgagc tttgatcacc agcagcttgc aaaacttatc caggagtcgc 1141 caaccgttga actgaaagac aagttggagt gtgaattgga ggcattagtg ggaaggatgg 1201 aagcaaaagc caaccaaata actaaagttc gaaaatacca agcccagctg gagaaacaga 1261 agttagagaa gcagaagaag gaattaaaag ctaccaaaaa gactcttgat gaagaaagaa 1321 acagcagcag ccgttctgga atcacaggga ccacaaataa gaaagatttt atgaaacaga 1381 gacctggaga aaaaaggaga aaaaatcttc agttattgaa ggacatgcaa agcatacaga 1441 attcattaca aagcagtagt ttgtgttggg attactgact cataaccagg tcagaaattt 1501 tattcagata atctgtacct catcaatcag atgatgacaa tttacttccc aggtctcata 1561 ctcacttatg ttggaattaa ttaatagcag gtgttaaagg acccaggctt cattacacag 1621 gcttttcatg tatgcaggat gactcaatgt taaagcattt aaatggaaac caggggagtt 1681 ttaaagcccg agaaaccaca cataatcttt tgttgagatg agtttgctgt actgacgctg 1741 cactttgtaa acagattacc agttttttac ttgtgggtgt gattttttaa aatagttctt 1801 tatatataat taaaattagg tttaaatttt caaaatatga gactatgcta taggcagtgc 1861 ttgcttgaaa agtctcattt ttaaatctcc ctggcatgag gtgcccactt cccctttcca 1921 aagcaaccat taaacatact ttgtttctac tattggtgga gtttttctat atttaaaaat 1981 acatatatat ttcagaggat ttttattttg ctttttggca tttcagactt tgatcagtgt 2041 taagtgcact tgtattgctt tttaatctgt taatttttta aagacccaag cagtcatttt 2101 gagttatatc tataaaaatt ataaaaggat ttttgaaagt ataaacaaat tgtcagtgaa 2161 ataaatgaga ttttggaata aagtgagaat gggagaaggg atatgttgtg agcatatacc 2221 ttcacagttc ttaatacctg ttttgtaatc atgatattca gtcaaggcat tatggttttt 2281 aatcttgaaa cttagagaac cctttgaata tttgctttta ctggtgtaca gtatgagtgg 2341 aatataaact gtacacataa ttatcatgtt gatataaatc ataatttcaa ctagatcaag 2401 acatgttaac cttttataaa tttaaagtca ataaagcacc tttttaaagg aaacactgca 2461 actttcctta acagcctgtt aataaaggct ttgtgacaag ctctcaaaaa atgcatttct 2521 aaataggagg acatcaatgt attgatgaag agaaaaaact agtacttagt tgccacactc 2581 atgcttacat agaaagagag cccaagaata ttagatttcc tcatgataca agatactaca 2641 gtaacaggct ttaatttagg atccttaaga ttttggggta ttatttgtga ctctcctgaa 2701 attgtaaact tgtgcttctg tgtccagttt tctaatgagt aggttcgtag cttgattgaa 2761 ttaataattg tgagcccata gacacaaggg aagtgagaaa cagtgctctg gtgacatgat 2821 aaatatatgt gtcaaccacc atttcagcta ttaaaaactc ctgttatctc cttgtttgaa 2881 tttcaggtca ttaaattgta taaccatcat ttg // LOCUS HUMKIAAP 2681 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0095 gene, complete cds. ACCESSION D42085 NID g577316 KEYWORDS KIAA0095. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2681) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (09-NOV-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2681) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Sato,S., Nagase,T., Seki,T., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA 0081 - KIAA 0120) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1995) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..2681 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..66 gene 67..2526 /gene="KIAA0095" CDS 67..2526 /gene="KIAA0095" /note="KIAA0095 gene is related to S.cerevisiae NIC96 gene." /citation=[3] /codon_start=1 /db_xref="PID:d1008263" /db_xref="PID:g577317" /translation="MDTEGFGELLQQAEQLAAETEGISELPHVERNLQEIQQAGERLR SRTLTRTSQETADVKASVLLGSRGLDISHISQRLESLSAATTFEPLEPVKDTDIQGFL KNEKDNALLSAIEESRKRTFGMAEEYHRESMLVEWEQVKQRILHTLLASGEDALDFTQ ESEPSYISDVGPPGRSSLDNIEMAYARQIYIYNEKIVNGHLQPNLVDLCASVAELDDK SISDMWTMVKQMTDVLLTPATDALKNRSSVEVRMEFVRQALAYLEQSYKNYTLVTVFG NLHQAQLGGVPGTYQLVRSFLNIKLPAPLPGLQDGEVEGHPVWALIYYCMRCGDLLAA SQVVNRAQHQLGEFKTWFQEYMNSKDRRLSPATENKLRLHYRRALRNNTDPYKRAVYC IIGRCDVTDNQSEVADKTEDYLWLKLNQVCFDDDGTSSPQDRLTLSQFQKQLLEDYGE SHFTVNQQPFLYFQVLFLTAQFEAAVAFLFRMERLRCHAVHVALVLFELKLLLKSSGQ SAQLLSHEPGDPPCLRRLNFVRLLMLYTRKFESTDPREALQYFYFLRDEKDSQGENMF LRCVSELVIESREFDMILGKLENDGSRKPGVIDKFTSDTKPIINKVASVAENKGLFEE AAKLYDLAKNADKVLELMNKLLSPVVPQISAPQSNKERLKNMALSIAERYRAQGISAN KFVDSTFYLLLDLITFFDEYHSGHIDRAFDIIERLKLVPLNQESVEERVAAFRNFSDE IRHNLSEVLLATMNILFTQFKRLKGTSPSSSSRPQRVIEDRDSQLRSQARTLITFAGM IPYRTSGDTNARLVQMEVLMN" 3'UTR 2527..2681 BASE COUNT 666 a 651 c 723 g 641 t ORIGIN 1 cggccgcgtc ctcaagccgg cacctgagcg gcggagacgg ctgtagcaca aggatctgca 61 tctccaatgg atactgaggg gtttggtgag ctccttcagc aagctgaaca gcttgctgct 121 gagactgagg gcatctcaga gcttccccat gtggaacgga acttacagga gatccagcag 181 gcgggagagc gcctgcgttc ccgtacccta acacgcacgt cccaggagac ggcagatgtc 241 aaggcgtcag ttctcctcgg gtctcgggga cttgacatat cccacatctc ccagcgattg 301 gagagtctga gtgcagccac cacctttgag cctcttgagc ctgtgaagga cactgacatt 361 cagggcttcc tgaagaatga gaaggacaat gccctgctgt ctgccatcga agagtcccgg 421 aagaggacct tcggcatggc tgaggagtac catcgggagt caatgttggt tgagtgggag 481 caagtgaaac agcgaattct gcacacactg ctggcatcag gagaagacgc ccttgacttt 541 actcaagaaa gcgagccaag ctacatcagt gatgtgggac cccctggtcg aagctctctg 601 gataacatcg agatggccta tgcgcggcaa atttatatct ataatgagaa aattgtaaat 661 ggacacctgc agcctaacct ggtggacctt tgtgcttccg tcgcagagct ggatgataag 721 agcatttccg acatgtggac catggtaaaa caaatgacag acgtgttgtt gacaccggca 781 acggatgccc tgaagaaccg cagcagcgtg gaagtgcgca tggagtttgt caggcaggcc 841 ttggcgtacc ttgagcagag ttataagaat tacacccttg tgactgtctt tggaaatttg 901 catcaggccc agctgggcgg ggtgcctggg acttaccaat tggttcgaag tttcctgaac 961 attaaactgc cagctccctt gcctggacta caggatggag aggtggaagg ccatcctgtg 1021 tgggcgctaa tttactactg catgcgctgt ggagacctgc ttgccgcttc acaggtagtt 1081 aatcgagccc agcaccagct gggagagttt aaaacctggt tccaggagta catgaacagc 1141 aaggacagaa gattgtcccc agctacggaa aacaagctcc ggctgcatta ccgtagggcc 1201 ctcaggaaca atacagatcc ctacaagcgg gccgtgtact gtatcattgg cagatgtgac 1261 gtcaccgaca accagagtga agtggcggac aaaactgagg attacctgtg gctgaagttg 1321 aaccaagtgt gttttgacga cgatggcacc agctccccac aagacaggct cactctctca 1381 cagttccaga agcagttgtt ggaagactat ggcgagtccc actttacggt gaaccagcaa 1441 cccttcctct acttccaagt cctgttcctg acagcgcagt ttgaagcagc agttgccttt 1501 cttttccgca tggagcggct gcgctgccat gctgtccatg tagcactggt gctgtttgag 1561 ctgaagctgc ttttaaagtc ctctggacag agtgctcagc tcctcagcca cgagcctggt 1621 gaccctcctt gcttgcggcg gctgaacttc gtgcggctcc tcatgctgta cacccggaag 1681 tttgagtcca cggacccaag ggaggccctc cagtacttct atttcctcag ggatgagaaa 1741 gatagtcaag gagaaaacat gtttctgcgc tgtgtgagtg agcttgtgat tgaaagccga 1801 gagttcgata tgattcttgg gaaactagag aatgacggaa gtagaaagcc tggagtcata 1861 gataagttta ctagtgacac aaagcctatt atcaacaaag ttgcttctgt ggcagaaaat 1921 aaaggactgt ttgaagaggc agcaaagctg tatgaccttg ccaagaatgc tgacaaggta 1981 ctggagctga tgaacaaact gctgagccct gtcgtccccc agatcagtgc cccgcaatcc 2041 aacaaggaga ggctgaagaa catggcactc tccattgccg aacggtatag ggctcaagga 2101 ataagcgcaa ataaatttgt ggactccacg ttctatcttc ttttggactt gatcaccttt 2161 tttgacgagt atcatagtgg tcatattgat agagcttttg atatcattga gcgcttgaag 2221 ctggtgcccc tgaatcagga aagtgtggaa gagagagtgg ctgctttcag aaatttcagt 2281 gatgaaatca ggcacaacct ctcagaagtg cttcttgcca ccatgaacat cttgttcaca 2341 cagtttaaga ggctcaaggg gacaagtcca tcctcgtcat ccaggcccca gcgagtcatc 2401 gaggaccgcg actctcaact ccgaagtcaa gcccgcactc tgattacctt tgctggaatg 2461 ataccatacc gaacgtctgg ggacaccaat gcgaggctgg tgcagatgga ggtcctcatg 2521 aattaagtgc catgctttgt gggagtctgg gtcggcacac tgtcagtaca tcaggcacat 2581 gggcccacta ggctggggtt tctggttttg tttctgttgt gttttgtttt ggtttctgta 2641 ttatgtattt ttgtcaacgc caataaattt ctttgatttg t // LOCUS HUMKINESLC 2308 bp mRNA PRI 09-MAR-1994 DEFINITION Homo sapiens kinesin light chain mRNA, complete cds. ACCESSION L04733 NID g307084 KEYWORDS kinesin light chain. SOURCE Homo sapiens (human). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2308) AUTHORS Cabeza-Arvelaiz,Y., Shih,L.C., Hardman,N., Asselbergs,F., Bilbe,G., Schmitz,A., White,B., Siciliano,M.J. and Lachman,L.B. TITLE Cloning and genetic characterization of the human kinesin light-chain (KLC) gene JOURNAL DNA Cell Biol. 12 (10), 881-892 (1993) MEDLINE 94099888 FEATURES Location/Qualifiers source 1..2308 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /tissue_type="blood" /tissue_lib="lambda-gt11, PHA-stimulated T-cell" CDS 277..1986 /note="putative" /codon_start=1 /function="membrane-bounded organelles transport" /product="kinesin light chain" /db_xref="PID:g307085" /translation="MSTMVYIKEDKLEKLTQDEIISKTKQVIQGLEALKNEHNSILQS LLETLKCLKKDDESNLVEEKSNMIRKSLEMLELGLSEAQVMMALSNHLNAVESEKQKL RAQVRRLCQENQWLRDELANTQQKLQKSEQSVAQLEEEKKHLEFMNQLKKYDDDISPS EDKDTDSTKEPLDDLFPNDEDDPGQGIQQQHSSAAAAAQQGGYEIPARLRTLHNLVIQ YASQGRYEVAVPLCKQALEDLEKTSGHDHPDVATMLNILALVYRDQNKYKDAANLLND ALAIREKTLGKDHPAVAATLNNLAVLYGKRGKYKEAEPLCKRALEIREKVLGKDHPDV AKQLNNLALLCQNQGKYEEVEYYYQRALEIYQTKLGPDDPNVAKTKNNLASCYLKQGK FKQAETLYKEILTRAHEREFGSVDDENKPIWMHAEEREECKGKQKDGTSFGEYGGWYK ACKVDSPTVTTTLKNLGALYRRQGKFEAAETLEEAAMRSRKQGLDNVHKQRVAEVLND PENMEKRRSRESLNVDVVKYESGPDGGEEVSMSVEWNGGVSGRASFCGKRQQQQWPGR RHR" BASE COUNT 618 a 545 c 705 g 440 t ORIGIN 1 gaattcgggc gagcgggact ggctgggtcg gctgggctgc tggtgcgagg agccgcgggg 61 ctgtgctcgg cggccaaggg gacagcgcgt gggtggccga ggatgctgcg gggcggtagc 121 tccggcgccc ctagctggtg actgctgcgc cgtgcctcac acagccgagg cgggctcggc 181 gcacagtcgc tgctccgcgc gcgcgcccgg cggcgctcca ggtgctgaca gcgcgagaga 241 gcgcggccct caggagcaag gcgaatgtat gacaccatgt ccacaatggt gtacataaag 301 gaagacaagt tggagaagct tacacaggat gaaattattt ctaagacaaa gcaagtaatt 361 caggggctgg aagctttgaa gaatgagcac aattccattt tacaaagttt gctggagaca 421 ctgaagtgtt tgaagaaaga tgatgaaagt aatttggtgg aggagaaatc aaacatgatc 481 cggaagtcac tggagatgtt ggagctcggc ctgagtgagg cacaggttat gatggctttg 541 tcaaatcacc tgaatgctgt ggagtccgag aagcagaaac tgcgtgcgca ggttcgtcgt 601 ctgtgccagg agaatcagtg gctacgggat gaactggcca acacgcagca gaaactgcag 661 aagagtgagc agtctgtggc tcaactggag gaggagaaga agcatctgga gtttatgaat 721 cagctaaaaa aatatgatga cgacatttcc ccatccgagg acaaagacac tgattctacc 781 aaagagcctc tggatgacct tttccccaat gatgaagacg acccagggca aggaatccag 841 cagcagcaca gcagtgcagc cgcggctgcc cagcagggcg gctacgagat ccccgcgcgg 901 ctgcggacgc tccacaacct ggtgatccag tacgcctcgc aggggcgcta cgaggtagct 961 gtgcccctct gcaagcaggc cctggaggac ctggagaaga cttcaggaca cgaccacccg 1021 gacgtggcca ccatgctcaa catcctggcc ttggtgtaca gggatcagaa taaatacaaa 1081 gatgcagcta acctactgaa tgatgccttg gctattcgtg agaaaacttt gggcaaagat 1141 catcctgcgg tggcggcgac tttgaataac cttgcagtcc tttatggtaa aagagggaag 1201 tacaaagaag cagagccgtt gtgtaaaaga gctctggaaa tccgagaaaa ggttttgggg 1261 aaggatcacc ccgatgttgc caagcagtta aataacttgg ccttactgtg ccagaaccag 1321 ggcaagtatg aagaagtaga atattattat caaagagccc tcgagatcta ccagacaaaa 1381 ctgggacctg atgaccccaa cgtggctaag acgaaaaata acctggcatc ctgctatttg 1441 aaacaaggaa agttcaagca agcagaaaca ctgtacaaag agattctcac tcgtgcacat 1501 gaaagggagt ttggttctgt agatgatgaa aataaaccca tctggatgca tgctgaagaa 1561 agagaagaat gcaaaggaaa gcaaaaggat gggacatctt ttggagagta tggcggctgg 1621 tacaaagcct gcaaagttga tagtccaact gttacaacca ctctaaaaaa ccttggggca 1681 ctttacagac gtcaaggcaa atttgaagct gcagaaacgt tagaagaagc tgctatgagg 1741 tctcgtaaac agggtcttga caatgttcac aaacagaggg tggcagaagt gctcaatgac 1801 cctgagaaca tggagaagcg caggagccgt gagagcctca acgtggacgt ggtcaagtac 1861 gagagtggcc ctgacggagg ggaggaagtg agtatgagcg tagagtggaa cgggggcgtc 1921 tctggccgag cctctttttg tggaaaacga cagcagcagc agtggcctgg aagacgccac 1981 cgctaactga ccccgacctg gccccgctcc aggatgggac tgccgagtgt ggcccggagc 2041 tggcccggga cagccagggc ggcagggagg cccctggccg ggagcgcagc gctcactcat 2101 ttctcctgcg tctgtgtgca taggacatga tactaataac cacacggctg gcgtgacctt 2161 ggggctgggg ctgggcctaa gctggtgccc tggtgcggcg tggtctctcc caggagacct 2221 ggggcatgag ctgggcccac ggctcccttc ccatgtgtaa cttcctcacg ttgtgtgcga 2281 taacgtattt tattgtacac ccgaattc // LOCUS HUMKRASM 5775 bp mRNA PRI 04-FEB-1997 DEFINITION Human K-ras oncogene protein mRNA, complete cds. ACCESSION M54968 M38506 NID g1815608 KEYWORDS K-ras oncogene. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5775) AUTHORS Kahn,S., Yamamoto,F., Almoguera,C., Winter,E., Forrester,K., Jordano,J. and Perucho,M. TITLE The c-K-ras gene and human cancer (review) JOURNAL Anticancer Res. 7 (4A), 639-652 (1987) MEDLINE 88022525 FEATURES Location/Qualifiers source 1..5775 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="tumor" gene 193..5315 /gene="K-ras" CDS 193..759 /gene="K-ras" /codon_start=1 /product="K-ras oncogene protein" /db_xref="PID:g186764" /translation="MTEYKLVVVGACGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQV VIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHHYREQIKR VKDSEDVPMVLVGNKCDLPSRTVDTKQAQDLARSYGIPFIETSAKTRQGVDDAFYTLV REIRKHKEKMSKDGKKKKKKSKTKCVIM" repeat_unit 3031..3289 /gene="K-ras" /rpt_family="Alu repetitive sequence" polyA_signal 3601..3606 /gene="K-ras" polyA_signal 5297..5302 /gene="K-ras" polyA_site 5315 /gene="K-ras" BASE COUNT 1739 a 974 c 1105 g 1957 t ORIGIN 1 tcctaggcgg cggccgcggc ggcggaggca gcagcggcgg cggcagtggc ggcggcgaag 61 gtggcggcgg ctcggccagt actcccggcc cccgccattt cggactggga gcgagcgcgg 121 cgcaggcact gaaggcggcg gcggggccag aggctcagcg gctcccaggt gcgggagaga 181 ggcctgctga aaatgactga atataaactt gtggtagttg gagcttgtgg cgtaggcaag 241 agtgccttga cgatacagct aattcagaat cattttgtgg acgaatatga tccaacaata 301 gaggattcct acaggaagca agtagtaatt gatggagaaa cctgtctctt ggatattctc 361 gacacagcag gtcaagagga gtacagtgca atgagggacc agtacatgag gactggggag 421 ggctttcttt gtgtatttgc cataaataat actaaatcat ttgaagatat tcaccattat 481 agagaacaaa ttaaaagagt taaggactct gaagatgtac ctatggtcct agtaggaaat 541 aaatgtgatt tgccttctag aacagtagac acaaaacagg ctcaggactt agcaagaagt 601 tatggaattc cttttattga aacatcagca aagacaagac agggtgttga tgatgccttc 661 tatacattag ttcgagaaat tcgaaaacat aaagaaaaga tgagcaaaga tggtaaaaag 721 aagaaaaaga agtcaaagac aaagtgtgta attatgtaaa tacaatttgt acttttttct 781 taaggcatac tagtacaagt ggtaattttt gtacattaca ctaaattatt agcatttgtt 841 ttagcattac ctaatttttt tcctgctcca tgcagactgt tagcttttac cttaaatgct 901 tattttaaaa tgacagtgga agtttttttt tcctcgaagt gccagtattc ccagagtttt 961 ggtttttgaa ctagcaatgc ctgtgaaaaa gaaactgaat acctaagatt tctgtcttgg 1021 ggtttttggt gcatgcagtt gattacttct tatttttctt accaagtgtg aatgttggtg 1081 tgaaacaaat taatgaagct tttgaatcat ccctattctg tgttttatct agtcacataa 1141 atggattaat tactaatttc agttgagacc ttctaattgg tttttactga aacattgagg 1201 gacacaaatt tatgggcttc ctgatgatga ttcttctagg catcatgtcc tatagtttgt 1261 catccctgat gaatgtaaag ttacactgtt cacaaaggtt ttgtctcctt tccactgcta 1321 ttagtcatgg tcactctccc caaaatatta tattttttct ataaaaagaa aaaaatggaa 1381 aaaaattaca aggcaatgga aactattata aggccatttc cttttcacat tagataaatt 1441 actataaaga ctcctaatag ctttttcctg ttaaggcaga cccagtatga atgggattat 1501 tatagcaacc attttggggc tatatttaca tgctactaaa tttttataat aattgaaaag 1561 attttaacaa gtataaaaaa attctcatag gaattaaatg tagtctccct gtgtcagact 1621 gctctttcat agtataactt taaatctttt cttcaacttg agtctttgaa gatagtttta 1681 attctgcttg tgacattaaa agattatttg ggccagttat agcttattag gtgttgaaga 1741 gaccaaggtt gcaagccagg ccctgtgtga accttgagct ttcatagaga gtttcacagc 1801 atggactgtg tgccccacgg tcatccgagt ggttgtacga tgcattggtt agtcaaaaat 1861 ggggagggac tagggcagtt tggatagctc aacaagatac aatctcactc tgtggtggtc 1921 ctgctgacaa atcaagagca ttgcttttgt ttcttaagaa aacaaactct tttttaaaaa 1981 ttacttttaa atattaactc aaaagttgag attttggggt ggtggtgtgc caagacatta 2041 attttttttt taaacaatga agtgaaaaag ttttacaatc tctaggtttg gctagttctc 2101 ttaacactgg ttaaattaac attgcataaa cacttttcaa gtctgatcca tatttaataa 2161 tgctttaaaa taaaaataaa aacaatcctt ttgataaatt taaaatgtta cttattttaa 2221 aataaatgaa gtgagatggc atggtgaggt gaaagtatca ctggactagg ttgttggtga 2281 cttaggttct agataggtgt cttttaggac tctgattttg aggacatcac ttactatcca 2341 tttcttcatg ttaaaagaag tcatctcaaa ctcttagttt ttttttttta cactatgtga 2401 tttatattcc atttacataa ggatacactt atttgtcaag ctcagcacaa tctgtaaatt 2461 tttaacctat gttacaccat cttcagtgcc agtcttgggc aaaattgtgc aagaggtgaa 2521 gtttatattt gaatatccat tctcgtttta ggactcttct tccatattag tgtcatcttg 2581 cctccctacc ttccacatgc cccatgactt gatgcagttt taatacttgt aattccccta 2641 accataagat ttactgctgc tgtggatatc tccatgaagt tttcccactg agtcacatca 2701 gaaatgccct acatcttatt ttcctcaggg ctcaagagaa tctgacagat accataaagg 2761 gatttgacct aatcactaat tttcaggtgg tggctgatgc tttgaacatc tctttgctgc 2821 ccaatccatt agcgacagta ggatttttca accctggtat gaatagacag aaccctatcc 2881 agtggaagga gaatttaata aagatagtgc agaaagaatt ccttaggtaa tctataacta 2941 ggactactcc tggtaacagt aatacattcc attgttttag taaccagaaa tcttcatgca 3001 atgaaaaata ctttaattca tgaagcttac tttttttttt ttggtgtcag agtctcgctc 3061 ttgtcaccca ggctggaatg cagtggcgcc atctcagctc actgcaacct tccatcttcc 3121 caggttcaag cgattctcgt gcctcggcct cctgagtagc tgggattaca ggcgtgtgca 3181 ctacactcaa ctaatttttg tatttttagg agagacgggg tttcacctgt tggccaggct 3241 ggtctcgaac tcctgacctc aagtgattca cccaccttgg cctcataaac ctgttttgca 3301 gaactcattt attcagcaaa tatttattga gtgcctacca gatgccagtc accgcacaag 3361 gcactgggta tatggtatcc ccaaacaaga gacataatcc cggtccttag gtactgctag 3421 tgtggtctgt aatatcttac taaggccttt ggtatacgac ccagagataa cacgatgcgt 3481 attttagttt tgcaaagaag gggtttggtc tctgtgccag ctctataatt gttttgctac 3541 gattccactg aaactcttcg atcaagctac tttatgtaaa tcacttcatt gttttaaagg 3601 aataaacttg attatattgt ttttttattt ggcataactg tgattctttt aggacaatta 3661 ctgtacacat taaggtgtat gtcagatatt catattgacc caaatgtgta atattccagt 3721 tttctctgca taagtaatta aaatatactt aaaaattaat agttttatct gggtacaaat 3781 aaacagtgcc tgaactagtt cacagacaag ggaaacttct atgtaaaaat cactatgatt 3841 tctgaattgc tatgtgaaac tacagatctt tggaacactg tttaggtagg gtgttaagac 3901 ttgacacagt acctcgtttc tacacagaga aagaaatggc catacttcag gaactgcagt 3961 gcttatgagg ggatatttag gcctcttgaa tttttgatgt agatgggcat ttttttaagg 4021 tagtggttaa ttacctttat gtgaactttg aatggtttaa caaaagattt gtttttgtag 4081 agattttaaa gggggagaat tctagaaata aatgttacct aattattaca gccttaaaga 4141 caaaaatcct tgttgaagtt tttttaaaaa aagactaaat tacatagact taggcattaa 4201 catgtttgtg gaagaatata gcagacgtat attgtatcat ttgagtgaat gttcccaagt 4261 aggcattcta ggctctattt aactgagtca cactgcatag gaatttagaa cctaactttt 4321 ataggttatc aaaactgttg tcaccattgc acaattttgt cctaatatat acatagaaac 4381 tttgtggggc atgttaagtt acagtttgca caagttcatc tcatttgtat tccattgatt 4441 tttttttttc ttctaaacat tttttcttca aaacagtata tataactttt tttaggggat 4501 tttttttaga cagcaaaaaa ctatctgaag atttccattt gtcaaaaagt aatgatttct 4561 tgataattgt gtagtgaatg ttttttagaa cccagcagtt accttgaaag ctgaatttat 4621 atttagtaac ttctgtgtta atactggata gcatgaattc tgcattgaga aactgaatag 4681 ctgtcataaa atgctttctt tcctaaagaa agatactcac atgagttctt gaagaatagt 4741 cataactaga ttaagatctg tgttttagtt taatagtttg aagtgcctgt ttgggataat 4801 gataggtaat ttagatgaat ttaggggaaa aaaaagttat ctgcagttat gttgagggcc 4861 catctctccc cccacacccc cacagagcta actgggttac agtgttttat ccgaaagttt 4921 ccaattccac tgtcttgtgt tttcatgttg aaaatacttt tgcatttttc ctttgagtgc 4981 caatttctta ctagtactat ttcttaatgt aacatgttta cctggcctgt cttttaacta 5041 tttttgtata gtgtaaactg aaacatgcac attttgtaca ttgtgctttc ttttgtgggt 5101 catatgcagt gtgatccagt tgttttccat catttggttg cgctgaccta ggaatgttgg 5161 tcatatcaaa cattaaaaat gaccactctt ttaatgaaat taacttttaa atgtttatag 5221 gagtatgtgc tgtgaagtga tctaaaattt gtaatatttt tgtcatgaac tgtactactc 5281 ctaattattg taatgtaata aaaatagtta cagtgactat gagtgtgtat ttattcatgc 5341 aaatttgaac tgtttgcccc gaaatggata tggatacttt ataagccata gacactatag 5401 tataccagtg aatcttttat gcagcttgtt agaagtatcc ttttattttc taaaaggtgc 5461 tgtggatatt atgtaaaggc gtgtttgctt aaacaatttt ccatatttag aagtagatgc 5521 aaaacaaatc tgcctttatg acaaaaaaat aggataacat tatttattta tttcctttta 5581 tcaataaggt aattgataca caacaggtga cttggtttta ggcccaaagg tagcagcagc 5641 aacattaata atggaaataa ttgaatagtt agttatgtat gttaatgcca gtcaccagca 5701 ggctatttca aggtcagaag taatgactcc atacatatta tttatttcta taactacatt 5761 taaatcatta ccagg // LOCUS HUMKTRAN 2682 bp mRNA PRI 06-JAN-1995 DEFINITION Human protein-glutamine gamma-glutamyltransferase mRNA, complete cds. ACCESSION M55183 NID g186789 KEYWORDS keratinocyte protein; keratinocyte transglutaminase; protein-glutamine gamma-glutamyltransferase; transglutaminase. SOURCE Human epidermal cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2682) AUTHORS Phillips,M.A., Stewart,B.E., Qin,Q., Chakravarty,R., Floyd,E.E., Jetten,A.M. and Rice,R.H. TITLE Primary structure of keratinocyte transglutaminase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9333-9337 (1990) MEDLINE 91067700 FEATURES Location/Qualifiers source 1..2682 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /tissue_type="skin" gene 54..2507 /gene="protein-glutamine gamma-glutamyltransferase" CDS 54..2507 /gene="protein-glutamine gamma-glutamyltransferase" /EC_number="2.3.2.13" /codon_start=1 /product="protein-glutamine gamma-glutamyltransferase" /db_xref="PID:g186790" /translation="MMDGPRSDVGRWGGNPLQPPTTPSPEPEPEPDGRSRRGGGRSFW ARCCGCCSCRNAADDDWGPEPSDSRGRGSSSGTRRPGSRGSDSRRPVSRGSGVNAAGD GTIREGMLVVNGVDLLSSRSDQNRREHHTDEYEYDELIVRRGQPFHMLLLLSRTYESS DRITLELLIGNNPEVGKGTHVIIPVGKGGSGGWKAQVVKASGQNLNLRVHTSPNAIIG KFQFTVRTQSDAGEFQLPFDPRNEIYILFNPWCPEDIVYVDHEDWRQEYVLNESGRIY YGTEAQIGERTWNYGQFDHGVLDACLYILDRRGMPYGGRGDPVNVSRVISAMVNSLDD NGVLIGNWSGDYSRGTNPSAWVGSVEILLSYLRTGYSVPYGQCWVFAGVTTTVLRCLG LATRTVTNFNSAHDTDTSLTMDIYFDENMKPLEHLNHDSVWNFHVWNDCWMKRPDLPS GFDGWQVVDATPQETSSGIFCCGPCSVESIKNGLVYMKYDTPFIFAEVNSDKVYWQRQ DDGSFKIVYVEEKAIGTLIVTKAISSNMREDITYLYKHPEGSDAERKAVETAAAHGSK PNVYANRGSAEDVAMQVEAQDAVMGQDLMVSVMLINHSSSRRTVKLHLYLSVTFYTGV SGTIFKETKKEVELAPGASDRVTMPVAYKEYRPHLVDQGAMLLNVSGHVKESGQVLAK QHTFRLRTPDLSLTLLGAAVVGQECEVQIVFKNPLPVTLTNVVFRLEGSGLQRPKILN VGDIGGNETVTLRQSFVPVRPGPRQLIASLDSPQLSQVHGVIQVDVAPAPGDGGFFSD AGGDSHLGETIPMASRGGA" BASE COUNT 582 a 770 c 800 g 530 t ORIGIN 1 ttccatctca gccccaggac tcagtactgc ggttgccaac actgctgcca ggcatgatgg 61 atgggccacg ttccgatgtg ggccgttggg gtggcaaccc cttgcagccc cctaccacgc 121 catctccaga gccagagcca gagccagacg gacgctctcg cagaggagga ggccgttcct 181 tctgggctcg ctgctgtggc tgctgttcat gccgaaatgc ggcagatgac gactggggac 241 ctgaaccctc tgactccagg ggtcgagggt ccagctctgg cactcgaaga cctggctccc 301 ggggctcaga ctcccgccgg cctgtatccc ggggcagcgg tgtcaatgca gctggagatg 361 gcaccatccg agagggcatg ctagtagtga acggtgtgga cttgctgagc tcgcgctcgg 421 accagaaccg ccgagagcac cacacagacg agtatgagta cgacgagctg atagtgcgcc 481 gcgggcagcc tttccatatg ctcctcctcc tgtcccggac ctatgaatcc tctgatcgca 541 tcacccttga gttactcatc ggaaacaacc ccgaggtggg caagggcacg cacgtgatca 601 tcccagtggg caaggggggc agtggaggct ggaaagccca ggtggtcaag gccagtgggc 661 agaatctgaa cctgcgggtc cacacttccc ccaacgccat catcggcaag tttcagttca 721 cagtccgcac acaatcagac gctggggagt tccagttgcc ctttgacccc cgcaatgaga 781 tctacatcct cttcaacccc tggtgcccag aggacattgt gtacgtggac catgaggatt 841 ggcggcagga gtatgttctt aatgagtctg ggagaattta ctacgggacc gaagcacaga 901 ttggtgagcg gacctggaac tacggccagt ttgaccacgg ggtgctggat gcctgcttat 961 acatcctgga ccggcggggg atgccatatg gaggccgtgg agacccagtc aatgtctccc 1021 gggtcatctc tgccatggtg aactccctgg atgacaatgg agtcctgatt gggaactggt 1081 ctggtgatta ctcccgaggc accaacccat cagcgtgggt gggcagcgtg gagatcctgc 1141 ttagctacct acgcacggga tattccgtcc cctatggcca gtgctgggtc tttgctggag 1201 tgaccaccac agtgctgcgc tgcctgggtc tggccacccg tactgtcacc aacttcaact 1261 ccgcccacga cacagacaca tcccttacca tggacatcta cttcgacgag aacatgaagc 1321 ccctggagca cctgaaccat gattctgtct ggaacttcca tgtgtggaac gactgctgga 1381 tgaagaggcc ggatctgccc tcgggctttg atgggtggca ggtggtggat gccacacccc 1441 aagagactag cagtggcatc ttctgctgcg gcccctgctc tgtggagtcc atcaagaatg 1501 gcctggtcta catgaagtac gacacgcctt tcatttttgc tgaggtgaat agtgacaagg 1561 tgtactggca gcggcaggat gatggcagct tcaagattgt ttatgtggag gagaaggcca 1621 tcggcacact cattgtcaca aaggccatca gctccaacat gcgggaggac atcacctacc 1681 tctataagca cccagaaggc tcagacgcag agcggaaggc agtagagaca gcagcagccc 1741 acggcagcaa acccaatgtg tatgccaacc ggggctcagc ggaggatgtg gccatgcagg 1801 tggaggcaca ggacgcggtg atggggcagg atctgatggt ctctgtgatg ctgatcaatc 1861 acagcagcag ccgccgcaca gtgaaactgc acctctacct ctcagtcact ttctatactg 1921 gtgtcagtgg taccatcttc aaggagacca agaaggaagt ggagctggca ccaggggcct 1981 cggaccgtgt gaccatgcca gtggcctaca aggaataccg gccccatctt gtggaccagg 2041 gggccatgct gctcaatgtc tcaggccacg tcaaggagag cgggcaggtg ctggccaagc 2101 agcacacctt ccgtctgcgc accccagacc tctccctcac gttactggga gcagcagtgg 2161 ttggccagga gtgtgaagta cagattgtct tcaagaaccc ccttcccgtc accctcacca 2221 atgtcgtctt ccggctcgaa ggctctgggt tacagaggcc caagatcctc aacgttgggg 2281 acattggagg caatgaaaca gtgacactgc gccagtcgtt tgtgcctgtg cgaccaggcc 2341 cccgccagct cattgccagc ttggacagcc cacagctctc ccaggtgcac ggtgtcatcc 2401 aggtggatgt ggccccagcc cctggggatg ggggcttctt ctcagacgct ggaggtgaca 2461 gtcacttagg agagaccatc cctatggcat ctcgaggtgg agcttagccc tgtgccagga 2521 gcaatgggac tggagtcaga tgagcaagga cattgcccca agataggggc acactacaga 2581 gcagctcccc aggagctcag gtggggagtc cagggctccc ggagagggag tcagtcttca 2641 cttgcactgg gggaacagat gctaataaac tgttttttaa tg // LOCUS HUMKUANT 3052 bp mRNA PRI 06-JAN-1995 DEFINITION Human Ku autoimmune antigen gene, complete cds. ACCESSION J04977 NID g186791 KEYWORDS Ku antigen; nonhistone DNA binding protein; nuclear protein. SOURCE Human fetal liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3052) AUTHORS Yaneva,M., Wen,J., Ayala,A. and Cook,R. TITLE cDNA-derived amino acid sequence of the 86-kDa subunit of the Ku antigen JOURNAL J. Biol. Chem. 264 (23), 13407-13411 (1989) MEDLINE 89340410 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Yaneva, 02-JUN-1989. FEATURES Location/Qualifiers source 1..3052 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q11-q13" mRNA 1..3052 /note="Ku mRNA" gene 34..2232 /gene="G22P1" CDS 34..2232 /gene="G22P1" /note="Ku antigen" /codon_start=1 /db_xref="GDB:G00-119-963" /db_xref="PID:g307093" /translation="MVRSGNKAAVVLCMDVGFTMSNSIPGIESPFEQAKKVITMFVQR QVFAENKDEIALVLFGTDGTDNPLSGGDQYQNITVHRHLMLPDFDLLEDIESKIQPGS QQADFLDALIVSMDVIQHETIGKKFEKRHIEIFTDLSSRFSKSQLDIIIHSLKKCDIS LQFFLPFSLGKEDGSGDRGDGPFRLGGHGPSFPLKGITEQQKEGLEIVKMVMISLEGE DGLDEIYSFSESLRKLCVFKKIERHSIHWPCRLTIGSNLSIRIAAYKSILQERVKKTW TVVDAKTLKKEDIQKETVYCLNDDDETEVLKEDIIQGFRYGSDIVPFSKVDEEQMKYK SEGKCFSVLGFCKSSQVQRRFFMGNQVLKVFAARDDEAAAVALSSLIHALDDLDMVAI VRYAYDKRANPQVGVAFPHIKHNYECLVYVQLPFMEDLRQYMFSSLKNSKKYAPTEAQ LNAVDALIDSMSLAKKDEKTDTLEDLFPTTKIPNPRFQRLFQCLLHRALHPREPLPPI QQHIWNMLNPPAEVTTKSQIPLSKIKTLFPLIEAKKKDQVTAQEIFQDNHEDGPTAKK LKTEQGGAHFSVSSLAEGSVTSVGSVNPAENFRVLVKQKKASFEEASNQLINHIEQFL DTNETPYFMKSIDCIRAFREEAIKFSEEQRFNNFLKALQEKVEIKQLNHFWEIVVQDG ITLITKEEASGSSVTAEEAKKFLAPKDKPSGDTAAVFEEGGDVDDLLDMI" BASE COUNT 906 a 592 c 708 g 846 t ORIGIN 58 bp upstream of PvuII site. 1 ggcgggcgac caaagcgcct gaggaccggc aacatggtgc ggtcggggaa taaggcagct 61 gttgtgctgt gtatggacgt gggctttacc atgagtaact ccattcctgg tatagaatcc 121 ccatttgaac aagcaaagaa ggtgataacc atgtttgtac agcgacaggt gtttgctgag 181 aacaaggatg agattgcttt agtcctgttt ggtacagatg gcactgacaa tcccctttct 241 ggtggggatc agtatcagaa catcacagtg cacagacatc tgatgctacc agattttgat 301 ttgctggagg acattgaaag caaaatccaa ccaggttctc aacaggctga cttcctggat 361 gcactaatcg tgagcatgga tgtgattcaa catgaaacaa taggaaagaa gtttgagaag 421 aggcatattg aaatattcac tgacctcagc agccgattca gcaaaagtca gctggatatt 481 ataattcata gcttgaagaa atgtgacatc tccctgcaat tcttcttgcc tttctcactt 541 ggcaaggaag atggaagtgg ggacagagga gatggcccct ttcgcttagg tggccatggg 601 ccttcctttc cactaaaagg aattaccgaa cagcaaaaag aaggtcttga gatagtgaaa 661 atggtgatga tatctttaga aggtgaagat gggttggatg aaatttattc attcagtgag 721 agtctgagaa aactgtgcgt cttcaagaaa attgagaggc attccattca ctggccctgc 781 cgactgacca ttggctccaa tttgtctata aggattgcag cctataaatc gattctacag 841 gagagagtta aaaagacttg gacagttgtg gatgcaaaaa ccctaaaaaa agaagatata 901 caaaaagaaa cagtttattg cttaaatgat gatgatgaaa ctgaagtttt aaaagaggat 961 attattcaag ggttccgcta tggaagtgat atagttcctt tctctaaagt ggatgaggaa 1021 caaatgaaat ataaatcgga ggggaagtgc ttctctgttt tgggattttg taaatcttct 1081 caggttcaga gaagattctt catgggaaat caagttctaa aggtctttgc agcaagagat 1141 gatgaggcag ctgcagttgc actttcctcc ctgattcatg ctttggatga cttagacatg 1201 gtggccatag ttcgatatgc ttatgacaaa agagctaatc ctcaagtcgg cgtggctttt 1261 cctcatatca agcataacta tgagtgttta gtgtatgtgc agctgccttt catggaagac 1321 ttgcggcaat acatgttttc atccttgaaa aacagtaaga aatatgctcc caccgaggca 1381 cagttgaatg ctgttgatgc tttgattgac tccatgagct tggcaaagaa agatgagaag 1441 acagacaccc ttgaagactt gtttccaacc accaaaatcc caaatcctcg atttcagaga 1501 ttatttcagt gtctgctgca cagagcttta catccccggg agcctctacc cccaattcag 1561 cagcatattt ggaatatgct gaatcctccc gctgaggtga caacgaaaag tcagattcct 1621 ctctctaaaa taaagaccct ttttcctctg attgaagcca agaaaaagga tcaagtgact 1681 gctcaggaaa ttttccaaga caaccatgaa gatggaccta cagctaaaaa attaaagact 1741 gagcaagggg gagcccactt cagcgtctcc agtctggctg aaggcagtgt cacctctgtt 1801 ggaagtgtga atcctgctga aaacttccgt gttctagtga aacagaagaa ggccagcttt 1861 gaggaagcga gtaaccagct cataaatcac atcgaacagt ttttggatac taatgaaaca 1921 ccgtatttta tgaagagcat agactgcatc cgagccttcc gggaagaagc cattaagttt 1981 tcagaagagc agcgctttaa caacttcctg aaagcccttc aagagaaagt ggaaattaaa 2041 caattaaatc atttctggga aattgttgtc caggatggaa ttactctgat caccaaagag 2101 gaagcctctg gaagttctgt cacagctgag gaagccaaaa agtttctggc ccccaaagac 2161 aaaccaagtg gagacacagc agctgtattt gaagaaggtg gtgatgtgga cgatttattg 2221 gacatgatat aggtcgtgga tgtatgggga atctaagaga gctgccatcg ctgtgatgct 2281 gggagttcta acaaaacaag ttggatgcgg ccattcaagg ggagccaaat tctcaagaaa 2341 ttcccagcag gttacctgga ggcggatcat ctaattctct gtggaatgaa tacacacata 2401 tatattacaa gggataattt agaccccata caagtttata aagagtcatt gttattttct 2461 ggttggtgta ttattttttc tgtggtctta ctgatctttg tatattacat acatgctttg 2521 aagtttctgg aaagtagatc ttttcttgac ctagtatatc agtgacagtt gcagcccttg 2581 tgatgtgatt agtgtctcat gtggaaccat ggcatggtta ttgatgagtt tcttaaccct 2641 ttccagagtc ctcctttgcc tgatcctcca acagctgtca cagcttgtgt tgagcaagca 2701 gtagcatttg cttcctccca acaagcagct gggttaggaa aaccatgggt aaggacggac 2761 tcacttctct ttttagttga ggccttctag ttaccacatt actctgcctc tgtatatagg 2821 tggttttctt taagtggggt gggaagggga gcacaatttc ccttcatact ccttttaagc 2881 agtgagttat ggtggtggtc tcatgaagaa aagacctttt ggcccaatct ctgccatatc 2941 agtgaacctt tagaaactca aaaactgaga aatttactac agtagttaga attatatcac 3001 ttcactgttc tctacttgca agcctcaaag agagaaagtt tcgttatatt gg // LOCUS HUMKV21CH 3777 bp mRNA PRI 16-NOV-1992 DEFINITION Homo sapiens potassium channel Kv2.1 mRNA, complete cds. ACCESSION L02840 NID g186797 KEYWORDS potassium channel. SOURCE Homo sapiens male adult cerebral cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3777) AUTHORS Ikeda,S.R., Soler,F., Zuhlke,R.D, Joho,R.H and Lewis,D.L. TITLE Heterologous expression of the human potassium channel Kv2.1 in clonal mammalian cells by direct cytoplasmic microinjection of cRNA JOURNAL Eur. J. Physiol. (1992) In press FEATURES Location/Qualifiers source 1..3777 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="cerebral cortex" 5'UTR 1..198 mRNA 1..3680 CDS 199..2763 /standard_name="human Kv2.1 (DRK1)" /codon_start=1 /function="delayed rectifier" /evidence=experimental /product="voltage-gated potassium channel" /db_xref="PID:g186798" /translation="MTKHGSRSTSSLPPEPMEIVRSKACSRRVRLNVGGLAHEVLWRT LDRLPRTRLGKLRDCNTHDSLLEVCDDYSLDDNEYFFDRHPGAFTSILNFYRTGRLHM MEEMCALSFSQELDYWGIDEIYLESCCQARYHQKKEQMNEELKREAETLREREGEEFD NTCCAEKRKKLWDLLEKPNSSVAAKILAIISIMFIVLSTIALSLNTLPELQSLDEFGQ STDNPQLAHVEAVCIAWFTMEYLLRFLSSPKKWKFFKGPLNAIDLLAILPYYVTIFLT ESNKSVLQFQNVRRVVQIFRIMRILRILKLARHSTGLQSLGFTLRRSYNELGLLILFL AMGIMIFSSLVFFAEKDEDDTKFKSIPASFWWATITMTTVGYGDIYPKTLLGKIVGGL CCIAGVLVIALPIPIIVNNFSEFYKEQKRQEKAIKRREALERAKRNGSIVSMNMKDAF ARSIEMMDIVVEKNGENMGKKDKVQDNHLSPNKWKWTKRTLSETSSSKSFETKEQGSP EKARSSSSPQHLNVQQLEDMYNKMAKTQSQPILNTKESAAQSKPKEELEMESIPSPVA PLPTRTEGVIDMRSMSSIDSFISCATDFPEATRFSHSPLTSLPSKTGGSTAPEVGWRG ALGASGGRFVEANPSPDASQHSSFFIESPKSSMKTNNPLKLRALKVNFMEGDPSPLLP VLGMYHDPLRNRGSAAAAVAGLECATLLDKAVLSPESSIYTTASAKTPPRSPEKHTAI AFNFEAGVHQYIDADTDDEGQLLYSVDSSPPKSLPGSTSPKFSTGTRSEKNHFESSPL PTSPKFLRQNCIYSTEALTGKGPSGQEKCKLENHISPDVRVLPGGGAHGSTRDQSI" 3'UTR 2761..3777 polyA_site 3777 BASE COUNT 956 a 1063 c 972 g 786 t ORIGIN 1 cctgcccagg agcgccgccc tggggcagtc gggatggagg tggagaacag gccgtgacgc 61 gcgcgggggc ccccctgcac ccccagcagc ccacgacgct ccctgccccc ctcccgcagc 121 agcgggcctt gccgtcgagt gacagcggcc tggggggcag ggggggcggg ggcggccgga 181 tcagcgatgc cggcgggcat gacgaagcat ggctcccgct ccaccagctc gctgccgccc 241 gagcccatgg agatcgtgcg cagcaaggcg tgctctcggc gggtccgcct caacgtcggg 301 gggctggcgc acgaggtact ctggcgtacc ctggaccgcc tgccccgcac gcggctgggc 361 aagctccgcg actgcaacac gcacgactcg ctgctcgagg tgtgcgatga ctacagcctc 421 gacgacaacg agtacttctt tgaccgccac ccgggcgcct tcacctccat cctcaacttc 481 taccgcactg ggcgactgca catgatggag gagatgtgcg cgctcagctt cagccaagag 541 ctcgactact ggggcatcga cgagatctac ctggagtcct gctgccaggc ccgctaccac 601 cagaagaaag agcagatgaa cgaggagctc aagcgtgagg ccgagactct acgggagcgg 661 gaaggcgagg agttcgataa cacgtgctgc gcagagaaga ggaaaaaact ctgggaccta 721 ctggagaagc ccaattcctc tgtggctgcc aagatccttg ccataatttc catcatgttc 781 atcgtcctct ccaccattgc cctgtccctc aacacgctgc ctgagctaca gagcctcgat 841 gagttcggcc agtccacaga caacccccag ctggcccacg tggaggccgt gtgcatcgca 901 tggttcacca tggagtacct gctgaggttc ctctcctcgc ccaagaagtg gaagttcttc 961 aagggcccac tcaatgccat tgacttgttg gccattctgc catactatgt caccattttc 1021 ctcaccgaat ccaacaagag cgtgctgcaa ttccagaatg tccgccgcgt ggtccagatc 1081 ttccgcatca tgcgaattct ccgcatcctt aagcttgcac gccactccac tggcctccag 1141 tctctgggct tcactttgcg gaggagctac aatgagttgg gcttgctcat cctcttcctt 1201 gccatgggca ttatgatctt ctccagcctt gtcttctttg ctgagaagga tgaggacgac 1261 accaagttca aaagcatccc agcctctttc tggtgggcca ccatcaccat gactactgtt 1321 gggtatggag acatctaccc caagactctc ctggggaaaa ttgttggggg actctgctgc 1381 attgcaggag tcctggtgat tgctcttccc atccccatca tcgtcaataa cttctctgag 1441 ttctataagg agcagaagag acaggagaaa gcaatcaaac ggcgagaggc tctggagaga 1501 gccaagagga atggcagcat cgtatccatg aacatgaagg atgcttttgc ccggagcatt 1561 gagatgatgg acattgtggt tgagaaaaat ggggagaata tgggtaagaa agacaaagta 1621 caagataacc acttgtctcc taacaaatgg aaatggacaa agaggacact gtctgaaacc 1681 agctcaagta agtcctttga aaccaaggaa cagggatccc ctgaaaaagc cagatcgtct 1741 tctagtcctc agcacctgaa cgttcagcag ttggaagaca tgtacaataa gatggccaag 1801 acccaatccc aacccatcct caataccaag gagtcagcag cacagagcaa accaaaggaa 1861 gaacttgaaa tggagagtat ccccagcccc gtagcccctc tgcccactcg cacagaaggg 1921 gtcattgaca tgcgaagtat gtcaagcatt gatagtttca ttagctgtgc cacagacttc 1981 cctgaggcca ccagattctc ccacagccct ttgacatcac tccccagcaa gactgggggc 2041 agcacagccc cagaagtggg ctggcgggga gctctgggtg ccagtggtgg taggtttgtg 2101 gaggccaacc ccagccctga tgccagccag cactctagtt tcttcatcga gagccccaag 2161 agttccatga aaactaacaa ccctttgaag ctccgagcac ttaaagtcaa cttcatggag 2221 ggtgacccca gtccactcct ccccgttcta gggatgtacc atgaccctct caggaaccgg 2281 gggagtgctg cggctgctgt cgctggactg gagtgtgcca cgcttttgga caaggctgtg 2341 ctgagcccag agtcctccat ctacaccaca gcaagtgcta agacaccccc ccggtctcct 2401 gagaaacaca cagcaatagc gttcaacttt gaggcgggtg tccaccagta cattgacgca 2461 gacacagatg atgagggaca gctgctctac agtgtggact ccagcccccc caaaagcctc 2521 cctgggagca ccagtccgaa gttcagcacg gggacaagat cggagaaaaa ccactttgaa 2581 agctcccctt tacccacctc ccctaagttc ttaaggcaga actgtattta ctccacagaa 2641 gcattgactg gaaaaggccc cagtggtcag gaaaagtgca aacttgagaa ccacatctcc 2701 cctgacgtcc gtgtgttgcc agggggagga gcccatggaa gcacacgaga tcagagcatc 2761 tgaactgccc tgccttggag gagagacttt tgggtgaggt ccaaagagga gagctgttca 2821 gcttacctgc cacagagctt ttctgcatga actctggaac agaaaggccc tgtaaagccc 2881 tcagagagaa gagagactcc agagaaggct ccctaagacc ttgagagcca tgacaggtcc 2941 atcagcatga agttggccaa gccatagggc acagcacctc cttgtaacaa ctctatagcc 3001 ctctttggga gatgacatga gtggaactca cagccaccac taccaccact ttagacagga 3061 ccgaggccac atactcccca ttctctcgtg gctttccatc tcagcctcgg agggcaacat 3121 tgacagtcct cctggcttca gctagagaag gatgctggaa caagcggctg gtgttgaaag 3181 agtgggttga ccaatttggt attgaatgtt gcccagccac ccctaggaac acctgtccat 3241 cacctcctgg atggattcca ctgttagaca gctacaggga atgattggtc atgggaagtc 3301 tctgcgccat aagccacgat cccagcgcaa aacccttact caaatgtctt cattgacttc 3361 ggtatttcat agtacccgag attttatttt gagataccat cagggtgagt tgcaccactt 3421 gtactcaatt ctaattgccc cctggcaatc tgggaagggt tcagaaggtg ggcacccagc 3481 caacagcatg aactcagagc attgttttag ggttggagga ggaacacgct ttctttacat 3541 cactagtgta gactcaaaag atatgcaagt gtcaaatatg caaaagaaat agtttattca 3601 aagagactgt gtgttactga agaacagcat aaaaatatga tttttttact tgcaaaaatg 3661 aaaggaaaaa aataccacgc attgaaatgc ccagttcaga ctgaataatt cctgctgcag 3721 caaggaaagt acctactata atagaaattc tgttttgttt tctgtggttt tcaagtt // LOCUS HUML12A 612 bp mRNA PRI 02-MAR-1993 DEFINITION Human ribosomal protein L12 mRNA, complete cds. ACCESSION L06505 NID g186799 KEYWORDS ribosomal protein L12. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 612) AUTHORS Chu,W., Presky,D.H., Swerlisk,R.A. and Burns,D.K. TITLE The primary structure of human ribosomal protein L12 JOURNAL Nucleic Acids Res. 21, 749-749 (1993) MEDLINE 93181279 FEATURES Location/Qualifiers source 1..612 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="dermal vascular endothelial" mRNA 1..612 CDS 69..566 /note="putative" /codon_start=1 /product="ribosomal protein L12" /db_xref="PID:g186800" /translation="MPPKFDPNEIKVVYLRCTGGEVGATSALAPKIGPLGLSPKKVGD DIAKATGDWKGLRITVKLTIQNRQAQIEVVPSASALIIKALKEPPRDRKKQKNIKHSG NITFDEIVNIARQMRHRSLARELSGTIKEILGTAQSVGCNVDGRHPHDIIDDINSGAV ECPAS" polyA_signal 586..591 BASE COUNT 173 a 163 c 154 g 122 t ORIGIN 1 ggaggccaag gtgcaacttc cttcggtcgt cccgaatccg ggttcatccg acaccagccg 61 cctccaccat gccgccgaag ttcgacccca acgagatcaa agtcgtatac ctgaggtgca 121 ccggaggtga agtcggtgcc acttctgccc tggcccccaa gatcggcccc ctgggtctgt 181 ctccaaaaaa agttggtgat gacattgcca aggcaacggg tgactggaag ggcctgagga 241 ttacagtgaa actgaccatt cagaacagac aggcccagat tgaggtggtg ccttctgcct 301 ctgccctgat catcaaagcc ctcaaggaac caccaagaga cagaaagaaa cagaaaaaca 361 ttaaacacag tgggaatatc acttttgatg agattgtcaa cattgctcga cagatgcggc 421 accgatcctt agccagagaa ctctctggaa ccattaaaga gatcctgggg actgcccagt 481 cagtgggctg taatgttgat ggccgccatc ctcatgacat catcgatgac atcaacagtg 541 gtgctgtgga atgcccagcc agttaagcac aaaggaaaac atttcaataa aggatcattt 601 gacaactggt ga // LOCUS HUML6A 1188 bp mRNA PRI 21-APR-1992 DEFINITION Human tumor antigen (L6) mRNA, complete cds. ACCESSION M90657 NID g186803 KEYWORDS tumor antigen. SOURCE Homo sapiens colon cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1188) AUTHORS Marken,J.S., Schieven,G.L., Hellstroem,I., Hellstroem,K.E. and Aruffo,A. TITLE Cloning and expression of the tumor-associated antigen L6 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 3503-3507 (1992) MEDLINE 92228814 FEATURES Location/Qualifiers source 1..1188 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="H3347" /tissue_type="colon" gene 109..717 /gene="L6" CDS 109..717 /gene="L6" /codon_start=1 /db_xref="PID:g186804" /translation="MCYGKCARCIGHSLVGLALLCIAANILLYFPNGETKYASENHLS RFVWFFSGIVGGGLLMLLPAFVFIGLEQDDCCGCCGHENCGKRCAMLSSVLAALIGIA GSGYCVIVAALGLAEGPLCLDSLGQWNYTFASTEGQYLLDTSTWSECTEPKHIVEWNV SLFSILLALGGIEFILCLIQVINGVLGGICGFCCSHQQQYDC" BASE COUNT 293 a 257 c 260 g 378 t ORIGIN 1 tcgagatcca ttgtgctcta aaggctcgcc ctcctgtgca tcgcggctaa tttggggtat 61 cactgagctg aagacaaaga gaagggggag aaaacctagc agaccaccat gtgctatggg 121 aagtgtgcac gatgcatcgg acattctctg gtggggctcg ccctcctgtg catcgcggct 181 aatattttgc tttactttcc caatggggaa acaaagtatg cctccgaaaa ccacctcagc 241 cgcttcgtgt ggttcttttc tggcatcgta ggaggtggcc tgctgatgct cctgccagca 301 tttgtcttca ttgggctgga acaggatgac tgctgtggct gctgtggcca tgaaaactgt 361 ggcaaacgat gtgcgatgct ttcttctgta ttggctgctc tcattggaat tgcaggatct 421 ggctactgtg tcattgtggc agcccttggc ttagcagaag gaccactatg tcttgattcc 481 ctcggccagt ggaactacac ctttgccagc accgagggcc agtaccttct ggatacctcc 541 acatggtccg agtgcactga acccaagcac attgtggaat ggaatgtatc tctgttttct 601 atcctcttgg ctcttggtgg aattgaattc atcttgtgtc ttattcaagt aataaatgga 661 gtgcttggag gcatatgtgg cttttgctgc tctcaccaac agcaatatga ctgctaaaag 721 aaccaaccca ggacagagcc acaatcttcc tctatttcat tgtaatttat atatttcact 781 tgtattcatt tgtaaaactt tgtattagtg taacatactc cccacagtct acttttacaa 841 acgcctgtaa agactggcat cttcacagga tgtcagtgtt taaatttagt aaacttcttt 901 tttgtttgtt tatttgtgta acatactccc cacagtctac ttttacaaac gcctgtaaag 961 actggcatct tcacaggatg tcagtgttta aatttagtaa acttcttttt tgtttgttta 1021 tttgtttttg ttttttttta aggaatgagg aaacaaacca ccctctgggg gtagtttaca 1081 gactgagtga cagtactcag tatatctgag ataaactcta taatgttttg gataaaaata 1141 acattccatg gcacatatat acaatagtga ttggctttag agcacaat // LOCUS HUMLAM101 5613 bp mRNA PRI 06-JAN-1995 DEFINITION Human laminin B1 chain mRNA, complete cds. ACCESSION M61916 J02778 NID g186836 KEYWORDS laminin; membrane glycoprotein. SOURCE Homo sapiens RNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5613) AUTHORS Pikkarainen,T., Eddy,R., Fukushima,Y., Byers,M., Shows,T., Pihlajaniemi,T., Saraste,M. and Tryggvason,K. TITLE Human laminin B1 chain. A multidomain protein with gene (LAMB1) locus in the q22 region of chromosome 7 JOURNAL J. Biol. Chem. 262 (22), 10454-10462 (1987) MEDLINE 87280097 COMMENT SWISS-PROT; P07942; LMB1$HUMAN. Computer-readable copy of sequence in [1] kindly provided by K.Tryggvason, 03-JUN-1987. The location of this gene was mapped to chromosome 7, band q22. Three potential polyadenylation signals (positions 5487-5592, 5555-5560, and 5604-5609), but no poly-A tail, were found. From EMBL entry HSLAM101; dated 16-FEB-1991. FEATURES Location/Qualifiers source 1..5613 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q22-q31" sig_peptide 118..180 /gene="LAMB1" /note="laminin B1 chain signal peptide" CDS 118..5478 /gene="LAMB1" /note="lamini/" /codon_start=1 /db_xref="GDB:G00-119-357" /product="laminin B1" /db_xref="PID:g186837" /translation="MGLLQLLAFSFLALCRARVRAQEPEFSYGCAEGSCYPATGDLLI GRAQKLSVTSTCGLHKPEPYCIVSHLQEDKKCFICNSQDPYHETLNPDSHLIENVVTT FAPNRLKIWWQSENGVENVTIQLDLEAEFHFTHLIMTFKTFRPAAMLIERSSDFGKTW GVYRYFAYDCEASFPGISTGPMKKVDDIICDSRYSDIEPSTEGEVIFRALDPAFKIED PYSPRIQNLLKITNLRIKFVKLHTLGDNLLDSRMEIREKYYYAVYDMVVRGNCFCYGH ASECAPVDGFNEEVEGMVHGHCMCRHNTKGLNCELCMDFYHDLPWRPAEGRNSNACKK CNCNEHSISCHFDMAVYLATGNVSGGVCDDCQHNTMGRNCEQCKPFYYQHPERDIRDP NFCERCTCDPAGSQNEGICDSYTDFSTGLIAGQCRCKLNVEGEHCDVCKEGFYDLSSE DPFGCKSCACNPLGTIPGGNPCDSETGHCYCKRLVTGQHCDQCLPEHWGLSNDLDGCR PCDCDLGGALNNSCFAESGQCSCRPHMIGRQCNEVEPGYYFATLDHYLYEAEEANLGP GVSIVERQYIQDRIPSWTGAGFVRVPEGAYLEFFIDNIPYSMEYDILIRYEPQLPDHW EKAVITVQRPGRIPTSSRCGNTIPDDDNQVVSLSPGSRYVVLPRPVCFEKGTNYTVRL ELPQYTSSDSDVESPYTLIDSLVLMPYCKSLDIFTVGGSGDGVVTNSAWETFQRYRCL ENSRSVVKTPMTDVCRNIIFSISALLHQTGLACECDPQGSLSSVCDPNGGQCQCRPNV VGRTCNRCAPGTFGFGPSGCKPCECHLQGSVNAFCNPVTGQCHCFQGVYARQCDRCLP GHWGFPSCQPCQCNGHADDCDPVTGECLNCQDYTMGHNCERCLAGYYGDPIIGSGDHC RPCPCPDGPDSGRQFARSCYQDPVTLQLACVCDPGYIGSRCDDCASGYFGNPSEVGGS CQPCQCHNNIDTTDPEACDKETGRCLKCLYHTEGEHCQFCRFGYYGDALRQDCRKCVC NYLGTVQEHCNGSDCQCDKATGQCLCLPNVIGQNCDRCAPNTWQLASGTGCDPCNCNA AHSFGPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPDVECRACDCDPRGIETPQCDQ STGQCVCVEGVEGPRCDKCTRGYSGVFPDCTPCHQCFALWDVIIAELTNRTHRFLEKA KALKISGVIGPYRETVDSVERKVSEIKDILAQSPAAEPLKNIGNLFEEAEKLIKDVTE MMAQVEVKLSDTTSQSNSTAKELDSLQTEAESLDNTVKELAEQLEFIKNSDIRGALDS ITKYFQMSLEAEERVNASTTEPNSTVEQSALMRDRVEDVMMERESQFKEKQEEQARLL DELAGKLQSLDLSAAAEMTCGTPPGASCSETECGGPNCRTDEGERKCGGPGCGGLVTV AHNAWQKAMDLDQDVLSALAEVEQLSKMVSEAKLRADEAKQSAEDILLKTNATKEKMD KSNEELRNLIKQIRNFLTQDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVE SLSQVEVILQHSAADIARAEMLLEEAKRASKSATDVKVTADMVKEALEEAEKAQVAAE KAIKQADEDIQGTQNLLTSIESETAASEETLFNASQRISELERNVEELKRKAAQNSGE AEYIEKVVYTVKQSAEDVKKTLDGELDEKYKKVENLIAKKTEESADARRKAEMLQNEA KTLLAQANSKLQLLKDLERKYEDNQRYLEDKAQELARLEGEVRSLLKDISQKVAVYST CL" gene 118..5478 /gene="LAMB1" mat_peptide 181..5475 /gene="LAMB1" /note="laminin B1 chain" BASE COUNT 1556 a 1288 c 1488 g 1281 t ORIGIN 1 cccggagcag ggcgagagct cgcgtcgccg gaaaggaaga cgggaagaaa gggcaggcgg 61 ctcggcgggc gtcttctcca ctcctctgcc gcgtccccgt ggctgcaggg agccggcatg 121 gggcttctcc agttgctagc tttcagtttc ttagccctgt gcagagcccg agtgcgcgct 181 caggaacccg agttcagcta cggctgcgca gaaggcagct gctatcccgc cacgggcgac 241 cttctcatcg gccgagcaca gaagctttcg gtgacctcga cgtgcgggct gcacaagccc 301 gaaccctact gtatcgtcag ccacttgcag gaggacaaaa aatgcttcat atgcaattcc 361 caagatcctt atcatgagac cctgaatcct gacagccatc tcattgaaaa tgtggtcact 421 acatttgctc caaaccgcct taagatttgg tggcaatctg aaaatggtgt ggaaaatgta 481 actatccaac tggatttgga agcagaattc cattttactc atctcataat gactttcaag 541 acattccgtc cagctgctat gctgatagaa cgatcgtccg actttgggaa aacctggggt 601 gtgtatagat acttcgccta tgactgtgag gcctcgtttc caggcatttc aactggcccc 661 atgaaaaaag tcgatgacat aatttgtgat tctcgatatt ctgacattga accctcaact 721 gaaggagagg tgatatttcg tgctttagat cctgctttca aaatagaaga tccttatagc 781 ccaaggatac agaatttatt aaaaattacc aacttgagaa tcaagtttgt gaaactgcat 841 actttgggag ataaccttct ggattccagg atggaaatca gagaaaagta ttattatgca 901 gtttatgata tggtggttcg aggaaattgc ttctgctatg gtcatgccag cgaatgtgcc 961 cctgtggatg gattcaatga agaagtggaa ggaatggttc acggacactg catgtgcagg 1021 cataacacca agggcttaaa ctgtgaactc tgcatggatt tctaccatga tttaccttgg 1081 agacctgctg aaggccgaaa cagcaacgcc tgtaaaaaat gtaactgcaa tgaacattcc 1141 atctcttgtc actttgacat ggctgtttac ctggccacgg ggaacgtcag cggaggcgtg 1201 tgtgatgact gtcagcacaa caccatgggg cgcaactgtg agcagtgcaa gccgttttac 1261 taccagcacc cagagaggga catccgagat cctaatttct gtgaacgatg tacgtgtgac 1321 ccagctggct ctcaaaatga gggaatttgt gacagctata ctgatttttc tactggtctc 1381 attgctggcc agtgtcggtg taaattaaat gtggaaggag aacattgtga tgtttgcaaa 1441 gaaggcttct atgatttaag cagtgaagat ccatttggtt gtaaatcttg tgcttgcaat 1501 cctctgggaa caattcctgg agggaatcct tgtgattccg agacaggtca ctgctactgc 1561 aagcgtctgg tgacaggaca gcattgtgac cagtgcctgc cagagcactg gggcttaagc 1621 aatgatttgg atggatgtcg accatgtgac tgtgaccttg ggggagcctt aaacaacagt 1681 tgctttgcgg agtcaggcca gtgctcatgc cggcctcaca tgattggacg tcagtgcaac 1741 gaagtggaac ctggttacta ctttgccacc ctggatcact acctctatga agcggaggaa 1801 gccaacttgg ggcctggggt tagcatagtg gagcggcaat atatccagga ccggattccc 1861 tcctggactg gagccggctt cgtccgagtg cctgaagggg cttatttgga gtttttcatt 1921 gacaacatac catattccat ggagtacgac atcctaattc gctacgagcc acagctaccc 1981 gaccactggg aaaaagctgt catcacagtg cagcgacctg gaaggattcc aaccagcagc 2041 cgatgtggta ataccatccc cgatgatgac aaccaggtgg tgtcattatc accaggctca 2101 agatatgtcg tccttcctcg gccggtgtgc tttgagaagg gaacaaacta cacggtgagg 2161 ttggagctgc ctcagtacac ctcctctgat agcgacgtgg agagccccta cacgctgatc 2221 gattctcttg ttctcatgcc atactgtaaa tcactggaca tcttcaccgt gggaggttca 2281 ggagatgggg tggtcaccaa cagtgcctgg gaaacctttc agagataccg atgtctagag 2341 aacagcagaa gcgttgtgaa aacaccgatg acagatgttt gcagaaacat catctttagc 2401 atttctgccc tgttacacca gacaggcctg gcttgtgaat gcgaccctca gggttcgtta 2461 agttccgtgt gtgatcccaa cggaggccag tgccagtgcc ggcccaacgt ggttggaaga 2521 acctgcaaca gatgtgcacc tggaactttt ggctttggcc ccagtggatg caaaccttgt 2581 gagtgccatc tgcaaggatc tgtcaatgcc ttctgcaatc ccgtcactgg ccagtgccac 2641 tgtttccagg gagtgtatgc tcggcagtgt gatcggtgct tacctgggca ctggggcttt 2701 ccaagttgcc agccctgcca gtgcaatggc cacgccgatg actgcgaccc agtgactggg 2761 gagtgcttga actgccagga ctacaccatg ggtcataact gtgaaaggtg cttggctggt 2821 tactatggcg accccatcat tgggtcaggt gatcactgcc gcccttgccc ttgcccagat 2881 ggtcccgaca gtggacgcca gtttgccagg agctgctacc aagatcctgt tactttacag 2941 cttgcctgtg tttgtgatcc tggatacatt ggttccagat gtgacgactg tgcctcagga 3001 tactttggca atccatcaga agttgggggg tcgtgtcagc cttgccagtg tcacaacaac 3061 attgacacga cagacccaga agcctgtgac aaggagactg ggaggtgtct caagtgcctg 3121 taccacacgg aaggggaaca ctgtcagttc tgccggtttg gatactatgg tgatgccctc 3181 cggcaggact gtcgaaagtg tgtctgtaat tacctgggca ccgtgcaaga gcactgtaac 3241 ggctctgact gccagtgcga caaagccact ggtcagtgct tgtgtcttcc taatgtgatc 3301 gggcagaact gtgaccgctg tgcgcccaat acctggcagc tggccagtgg cactggctgt 3361 gacccatgca actgcaatgc tgctcattcc ttcgggccat cttgcaatga gttcacgggg 3421 cagtgccagt gcatgcctgg gtttggaggc cgcacctgca gcgagtgcca ggaactcttc 3481 tggggagacc ccgacgtgga gtgccgagcc tgtgactgtg accccagggg cattgagacg 3541 ccacagtgtg accagtccac gggccagtgt gtctgcgttg agggtgttga gggtccacgc 3601 tgtgacaagt gcacgcgagg gtactcgggg gtcttccctg actgcacacc ctgccaccag 3661 tgctttgctc tctgggatgt gatcattgcc gagctgacca acaggacaca cagattcctg 3721 gagaaagcca aggccttgaa gatcagtggt gtgatcgggc cttaccgtga gactgtggac 3781 tcggtggaga ggaaagtcag cgagataaaa gacatcctgg cgcagagccc cgcagcagag 3841 ccactgaaaa acattgggaa tctctttgag gaagcagaga aactgattaa agatgttaca 3901 gaaatgatgg ctcaagtaga agtgaaatta tctgacacaa cttcccaaag caacagcaca 3961 gccaaagaac tggattctct acagacagaa gccgaaagcc tagacaacac tgtgaaagaa 4021 cttgctgaac aactggaatt tatcaaaaac tcagatattc ggggtgcctt ggatagcatt 4081 accaagtatt tccagatgtc tcttgaggca gaggagaggg tgaatgcctc caccacagaa 4141 cccaacagca ctgtggagca gtcagccctc atgagagaca gagtagaaga cgtgatgatg 4201 gagcgagaat cccagttcaa ggaaaaacaa gaggagcagg ctcgcctcct tgatgaactg 4261 gcaggcaagc tacaaagcct agacctttca gccgctgccg aaatgacctg tggaacaccc 4321 ccaggggcct cctgttccga gactgaatgt ggcgggccaa actgcagaac tgacgaagga 4381 gagaggaagt gtggggggcc tggctgtggt ggtctggtta ctgttgcaca caacgcctgg 4441 cagaaagcca tggacttgga ccaagatgtc ctgagtgccc tggctgaagt ggaacagctc 4501 tccaagatgg tctctgaagc aaaactgagg gcagatgagg caaaacaaag tgctgaagac 4561 attctgttga agacaaatgc taccaaagaa aaaatggaca agagcaatga ggagctgaga 4621 aatctaatca agcaaatcag aaactttttg acccaggata gtgctgattt ggacagcatt 4681 gaagcagttg ctaatgaagt attgaaaatg gagatgccta gcaccccaca gcagttacag 4741 aacttgacag aagatatacg tgaacgagtt gaaagccttt ctcaagtaga ggttattctt 4801 cagcatagtg ctgctgacat tgccagagct gagatgttgt tagaagaagc taaaagagca 4861 agcaaaagtg caacagatgt taaagtcact gcagatatgg taaaggaagc tctggaagaa 4921 gcagaaaagg cccaggtcgc agcagagaag gcaattaaac aagcagatga agacattcaa 4981 ggaacccaga acctgttaac ttcgattgag tctgaaacag cagcttctga ggaaaccttg 5041 ttcaacgcgt cccagcgcat cagcgagtta gagaggaatg tggaagaact taagcggaaa 5101 gctgcccaaa actccgggga ggcagaatat attgaaaaag tagtatatac tgtgaagcaa 5161 agtgcagaag atgttaagaa gactttagat ggtgaacttg atgaaaagta taaaaaagta 5221 gaaaatttaa ttgccaaaaa aactgaagag tcagctgatg ccagaaggaa agccgaaatg 5281 ctacaaaatg aagcaaaaac tcttttagct caagcaaata gcaagctgca actgctcaaa 5341 gatttagaaa gaaaatatga agacaatcaa agatacttag aagataaagc tcaagaatta 5401 gcaagactgg aaggagaagt ccgttcactc ctaaaggata taagccagaa agttgctgtg 5461 tatagcacat gcttgtaaca gaggagaata aaaaatggct gaggtgaaca aggtaaaaca 5521 actacatttt aaaaactgac ttaatgctct tcaaaataaa acatcaccta tttaatgttt 5581 ttaatcacat tttgtatgag ttaaataaag ccc // LOCUS HUMLAMAA 5433 bp mRNA PRI 06-JAN-1995 DEFINITION Homo sapiens laminin-related protein (LamA3) mRNA, complete cds. ACCESSION L34155 NID g551596 KEYWORDS basement membrane protein; epiligrin alpha 3 subunit; extracellular matrix protein; laminin-related protein. SOURCE Homo sapiens (tissue library: random-primed, lambda gt11 (Clontech); dT-primed, plasmid [pcDNA II] (Oligo)) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5433) AUTHORS Ryan,M.C., Tizard,R., VanDevanter,D.R. and Carter,W.G. TITLE Cloning of the LamA3 gene encoding the alpha 3 chain of the adhesive ligand epiligrin. Expression in wound repair JOURNAL J. Biol. Chem. 269 (36), 22779-22787 (1994) MEDLINE 94357926 FEATURES Location/Qualifiers source 1..5433 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="keratinocytes" /tissue_lib="random-primed, lambda gt11 (Clontech); dT-primed, plasmid [pcDNA II] (Oligo)" /map="18q11.2" gene 1..5142 /gene="LamA3" sig_peptide 1..60 /gene="LamA3" CDS 1..5142 /gene="LamA3" /note="amino acid feature: G1 subdomain, aa 794 .. 970; amino acid feature: G2 subdomain, aa 971 .. 1139; amino acid feature: G3 subdomain, aa 1140 .. 1353; amino acid feature: G4 subdomain, aa 1354 .. 1529; amino acid feature: G5 subdomain, aa 1530 .. 1713" /codon_start=1 /function="cell adhesion ligand for integrins; epithelial cell expression" /product="epiligrin alpha 3 subunit" /db_xref="PID:g551597" /translation="MGWLWIFGAALGQCLGYSSQQQRVPFLQPPGQSQLQASYVEFRP SQGCSPGYYRDHKGLYTGRCVPCNCNGHSNQCQDGSGICVNCQHNTAGEHCERCQEGY YGNAVHGSCRACPCPHTNSFATGCVVNGGDVRCSCKAGYTGTQCERCAPGYFGNPQKF GGSCQPCSCNSNGQLGSCHPLTGDCINQEPKDSSPAEECDDCDSCVMTLLNDLATMGE QLRLVKSQLQGLSASAGLLEQMRHMETQAKDLRNQLLNYRSAISNHGSKIEGLERELT DLNQEFETLQEKAQVNSRKAQTLNNNVNRATQSAKELDVKIKNVIRNVHILLKQISGT DGEGNNVPSGDFSREWAEAQRMMRELRNRNFGKHLREAEADKRESQLLLNRIRTWQKT HQGENNGLANSIRDSLNEYEAKLSDLRARLQEAAAQAKQANGLNQENERALGAIQRQV KEINSLQSDFTKYLTTADSSLLQTNIALQLMEKSQKEYEKLAASLNEARQELSDKVRE LSRSAGKTSLVEEAEKHARSLQELAKQLEEIKRNASGDELVRCAVDAATAYENILNAI KAAEDAANRAASASESALQTVIKEDLPRKAKTLSSNSDKLLNEAKMTQKKLKQEVSPA LNNLQQTLNIVTVQKEVIDTNLTTLRDGLHGIQRGDIDAMISSAKSMVRKANDITDEV LDGLNPIQTDVERIKDTYGRTQNEDFKKALTDADNSVNKLTNKLPDLWRKIESINQQL LPLGNISDNMDRIRELIQQARDAASKVAVPMRFNGKSGVEVRLPNDLEDLKGYTSLSL FLQRPNSRENGGTENMFVMYLGNKDASRDYIGMAVVDGQLTCVYNLGDREAELQVDQI LTKSETKEAVMDRVKFQRIYQFARLNYTKGATSSKPETPGVYDMDGRNSNTLLNLDPE NVVFYVGGYPPDFKLPSRLSFPPYKGCIELDDLNENVLSLYNFKKTFNLNTTEVEPCR RRKEESDKNYFEGTGYARVPTQPHAPIPTFGQTIQTTVDRGLLFFAENGDRFISLNIE DGKLMVRYKLNSELPKERGVGDAINNGRDHSIQIKIGKLQKRMWINVDVQNTIIDGEV FDFSTYYLGGIPIAIRERFNISTPAFRGCMKNLKKTSGVVRLNDTVGVTKKCSEDWKL VRSASFSRGGQLSFTDLGLPPTDHLQASFGFQTFQPSGILLDHQTWTRNLQVTLEDGY IELSTSDSGGPIFKSPQTYMDGLLHYVSVISDNSGLRLLIDDQLLRNSKRLKHISSSR QSLRLGGSNFEGCISNVFVQRLSLSPEVLDLTSNSLKRDVSLGGCSLNKPPFLMLLKG STRFNKTKTFRINQLLQDTPVASPRSVKVWQDACSPLPKTQANHGALQFGDIPTSHLL FKLPQELLKPRSQFAVDMQTTSSRGLVFHTGTKNSFMALYLSKGRLVFALGTDGKKLR IKSKEKCNDGKWHTVVFGHDGEKGRLVVDGLRAREGSLPGNSTISIRAPVYLGSPPSG KPKSLPTNSFVGCLKNFQLDSKPLYTPSSSFGVSSCLGGPLEKGIYFSEEGGHVVLAH SVLLGPEFKLVFSIRPRSLTGILIHIGSQPGKHLCVYLEAGKVTASMDSGAGGTSTSV TPKQSLCDGQWHSVAVTIKQHILHLELDTDSSYTAGQIPFPPASTQEPLHLGGAPANL TTLRIPVWKSFFGCLRNIHVNHIPVPVTEALEVQGPVSLNGCPDQ" repeat_region 136..603 /note="EGF repeats, domain IIIa" repeat_region 604..2379 /note="heptad repeats, domain I/II" 3'UTR 5140..5433 polyA_site 5433 BASE COUNT 1618 a 1233 c 1338 g 1244 t ORIGIN 1 atgggatggc tgtggatctt tggggcagcc ctggggcagt gtctgggcta cagttcacag 61 cagcaaaggg tgccatttct tcagcctccc ggtcaaagtc aactgcaagc gagttatgtg 121 gagtttagac ccagccaggg ttgtagccct ggatactatc gggatcataa aggcttgtat 181 accggacggt gtgttccctg caattgcaac ggacattcaa atcaatgcca ggatggctca 241 ggcatatgtg ttaactgtca gcacaacacc gcgggagagc actgtgaacg ctgccaggag 301 ggctactatg gcaacgccgt ccacggatcc tgcagggcct gcccatgtcc tcacactaac 361 agctttgcca ctggctgtgt ggtgaatggg ggagacgtgc ggtgctcctg caaagctggg 421 tacacaggaa cacagtgtga aaggtgtgca ccgggatatt tcgggaatcc ccagaaattc 481 ggaggtagct gccaaccatg cagttgtaac agcaatggcc agctgggcag ctgtcatccc 541 ctgactggag actgcataaa ccaagaaccc aaagatagca gccctgcaga agaatgtgat 601 gattgcgaca gctgtgtgat gaccctcctg aacgacctgg ccaccatggg cgagcagctc 661 cgcctggtca agtctcagct gcagggcctg agtgccagcg cagggcttct ggagcagatg 721 aggcacatgg agacccaggc caaggacctg aggaatcagt tgctcaacta ccgttctgcc 781 atttcaaatc atggatcaaa aatagaaggc ctggaaagag aactgactga tttgaatcaa 841 gaatttgaga ctttgcaaga aaaggctcaa gtaaattcca gaaaagcaca aacattaaac 901 aacaatgtta atcgggcaac acaaagcgca aaagaactgg atgtgaagat taaaaatgtc 961 atccggaatg tgcacattct tttaaagcag atctctggga cagatggaga gggaaacaac 1021 gtgccttcag gtgacttttc cagagagtgg gctgaagccc agcgcatgat gagggaactg 1081 cggaacagga actttggaaa gcacctcaga gaagcagaag ctgataaaag ggagtcgcag 1141 ctcttgctga accggataag gacctggcag aaaacccacc agggggagaa caatgggctt 1201 gctaacagta tccgggattc tttaaatgaa tacgaagcca aactcagtga ccttcgtgct 1261 cggctgcagg aggcagctgc ccaagccaag caggcaaatg gcttgaacca agaaaacgag 1321 agagctttgg gagccattca gagacaagtg aaagaaataa attccctgca gagtgatttc 1381 accaagtatc taaccactgc agactcatct ttgttgcaaa ccaacattgc gctgcagctg 1441 atggagaaaa gccagaagga atatgaaaaa ttagctgcca gtttaaatga agcaagacaa 1501 gaactaagtg acaaagtaag agaactttcc agatctgctg gcaaaacatc ccttgtggag 1561 gaggcagaaa agcacgcgcg gtccttacaa gagctggcaa agcagctgga agagatcaag 1621 agaaacgcca gcggggatga gctggtgcgc tgtgctgtgg atgccgccac cgcctacgag 1681 aacatcctca atgccatcaa agcggccgag gacgcagcca acagggctgc cagtgcatct 1741 gaatctgccc tccagacagt gataaaggaa gatctgccaa gaaaagctaa aaccctgagt 1801 tccaacagtg ataaactgtt aaatgaagcc aagatgacac aaaagaagct aaagcaagaa 1861 gtcagtccag ctctcaacaa cctacagcaa accctgaata ttgtgacagt tcagaaagaa 1921 gtgatagaca ccaatctcac aactctccga gatggtcttc atgggataca gagaggtgat 1981 attgatgcta tgatcagtag tgcaaagagc atggtcagaa aggccaacga catcacagat 2041 gaggttctgg atgggctcaa ccccatccag acagatgtgg aaagaattaa ggacacctat 2101 gggaggacac agaacgaaga cttcaaaaag gctctgactg atgcagataa ctcggtgaat 2161 aagttaacca acaaactacc tgatctttgg cgcaagattg aaagtatcaa ccaacagctg 2221 ttgcccttgg gaaacatctc tgacaacatg gacagaatac gagaactaat tcagcaggcc 2281 agagatgctg ccagtaaggt tgctgtcccc atgaggttca atggtaaatc tggagtcgaa 2341 gtccgactgc caaatgacct ggaagatttg aaaggatata catctctgtc cttgtttctc 2401 caaaggccca actcaagaga aaatgggggt actgagaata tgtttgtgat gtaccttgga 2461 aataaagatg cctcccggga ctacatcggc atggcagttg tggatggcca gctcacctgt 2521 gtctacaacc tgggggaccg tgaggctgaa ctccaagtgg accagatctt gaccaagagt 2581 gagactaagg aggcagttat ggatcgggtg aaatttcaga gaatttatca gtttgcaagg 2641 cttaattaca ccaaaggagc cacatccagt aaaccagaaa cacccggagt ctatgacatg 2701 gatggtagaa atagcaatac actccttaat ttggatcctg aaaatgttgt attttatgtt 2761 ggaggttacc cacctgattt taaacttccc agtcgactaa gtttccctcc atacaaaggt 2821 tgtattgaat tagatgacct caatgaaaat gttctgagct tgtacaactt caaaaaaaca 2881 ttcaatctca acacaactga agtggagcct tgtagaagga ggaaggaaga gtcagacaaa 2941 aattattttg aaggtacggg ctatgctcga gttccaactc aaccacatgc tcccatccca 3001 acctttggac agacaattca gaccaccgtg gatagaggct tgctgttctt tgcagaaaac 3061 ggggatcgct tcatatctct aaatatagaa gatggcaagc tcatggtgag atacaaactg 3121 aattcagagc taccaaaaga gagaggagtt ggagacgcca taaacaacgg cagagaccat 3181 tcgattcaga tcaaaattgg aaaactccaa aagcgtatgt ggataaatgt ggacgttcaa 3241 aacactataa ttgatggtga agtatttgat ttcagcacat attatctggg aggaattcca 3301 attgcaatca gggaaagatt taacatttct acgcctgctt tccgaggctg catgaaaaat 3361 ttgaagaaaa ccagtggtgt cgttagattg aatgatactg tgggagtaac caaaaagtgc 3421 tcggaagact ggaagcttgt gcgatctgcc tcattctcca gaggaggaca attgagtttc 3481 actgatttgg gcttaccacc tactgaccac ctccaggcct catttggatt tcagaccttt 3541 caacccagtg gcatattatt agatcatcag acatggacaa ggaacctgca ggtcactctg 3601 gaagatggtt acattgaatt gagcaccagc gatagcggcg gcccaatttt taaatctcca 3661 cagacgtata tggatggttt actgcattat gtatctgtaa taagcgacaa ctctggacta 3721 cggcttctca tcgatgacca gcttctgaga aatagcaaaa ggctaaaaca catttcaagt 3781 tcccggcagt ctctgcgtct gggcgggagc aattttgagg gttgtattag caatgttttt 3841 gtccagaggt tatcactgag tcctgaagtc ctagatttga ccagtaactc tctcaagaga 3901 gatgtgtccc tgggaggctg cagtttaaac aaaccacctt ttctaatgtt gcttaaaggt 3961 tctaccaggt ttaacaagac caagactttt cgtatcaacc agctgttgca ggacacacca 4021 gtggcctccc caaggagcgt gaaggtgtgg caagatgctt gctcaccact tcccaagacc 4081 caggccaatc atggagccct ccagtttggg gacattccca ccagccactt gctattcaag 4141 cttcctcagg agctgctgaa acccaggtca cagtttgctg tggacatgca gacaacatcc 4201 tccagaggac tggtgtttca cacgggcact aagaactcct ttatggctct ttatctttca 4261 aaaggacgtc tggtctttgc actggggaca gatgggaaaa aattgaggat caaaagcaag 4321 gagaaatgca atgatgggaa atggcacacg gtggtgtttg gccatgatgg ggaaaagggg 4381 cgcttggttg tggatggact gagggcccgg gagggaagtt tgcctggaaa ctccaccatc 4441 agcatcagag cgccagttta cctgggatca cctccatcag ggaaaccaaa gagcctcccc 4501 acaaacagct ttgtgggatg cctgaagaac tttcagctgg attcaaaacc cttgtatacc 4561 ccttcttcaa gcttcggggt gtcttcctgc ttgggtggtc ctttggagaa aggcatttat 4621 ttctctgaag aaggaggtca tgtcgtcttg gctcactctg tattgttggg gccagaattt 4681 aagcttgttt tcagcatccg cccaagaagt ctcactggga tcctaataca catcggaagt 4741 cagcccggga agcacttatg tgtttacctg gaggcaggaa aggtcacggc ctctatggac 4801 agtggggcag gtgggacctc aacgtcggtc acaccaaagc agtctctgtg tgatggacag 4861 tggcactcgg tggcagtcac cataaaacaa cacatcctgc acctggaact ggacacagac 4921 agtagctaca cagctggaca gatccccttc ccacctgcca gcactcaaga gccactacac 4981 cttggaggtg ctccagccaa tttgacgaca ctgaggatcc ctgtgtggaa atcattcttt 5041 ggctgtctga ggaatattca tgtcaatcac atccctgtcc ctgtcactga agccttggaa 5101 gtccaggggc ctgtcagtct gaatggttgt cctgaccagt aacccaagcc tatttcacag 5161 caaggaaatt caccttcaaa agcactgatt acccaatgca cctccctccc cagctcgaga 5221 tcattcttca attaggacac aaaccagaca ggtttaatag cgaatctaat tttgaattct 5281 gaccatggat acccatcact ttggcattca gtgctacatg tgtattttat ataaaaatcc 5341 catttcttga agataaaaaa attgttattc aaattgttat gcacagaatg tttttggtaa 5401 tattaatttc cactaaaaaa ttaaatgtct ttt // LOCUS HUMLAMBA 2850 bp mRNA PRI 11-JUN-1993 DEFINITION Human lamin B mRNA, complete cds. ACCESSION M34458 NID g186877 KEYWORDS intermediate filament; lamin B. SOURCE Human T-cell line MOLT-4, cDNA to mRNA, clone LAM-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2850) AUTHORS Pollard,K.M., Chan,E.K.L., Grant,B.J., Sullivan,K.F., Tan,E.M. and Glass,C.A. TITLE In vitro posttranslational modification of lamin B cloned from a human T-cell line JOURNAL Mol. Cell. Biol. 10, 2164-2175 (1990) MEDLINE 90220602 FEATURES Location/Qualifiers source 1..2850 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <342..2850 /note="lamin B mRNA" CDS 342..2102 /note="lamin B" /codon_start=1 /db_xref="PID:g307106" /translation="MATATPVPPRMGSRAGGPTTPLSPTRLSRLQEKEELRELNDRLA VYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAK LQIELGKCKAEHDQLLLNYAKKESDLNGAQIKLREYEAALNSKDAALATALGDKKSLE GDLEDLKDQIAQLEASLAAAKKQLADETLLKVDLENRCQSLTEDLEFRKSMYEEEINE TRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLENAR LSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERIQELEDLLAKEKDN SRRMLTDKEREMAEIRDQMQQQLNDYEQLLDVKLALDMEISAYRKLLEGEEERLKLSP SPSSRVTVSRASSSRSVRTTRGKRKRVDVEESEASSSVSISHSASATGNVCIEEIDVD GKFIRLKNTSEQDQPMGGWEMIRKIGDTSVSYKYTSRYVLKAGQTVTIWAANAGVTAS PPTDLIWKNQNSWGTGEDVKVILKNSQGEEVAQRSTVFKTTIPEEEEEEEEAAGVVVE EELFHQQGTPRASNRSCAIM" polyA_signal 2834..2839 BASE COUNT 776 a 614 c 748 g 712 t ORIGIN 1 cgcgagcagg agacggcggc gggcgaaccc tgctgggcct ccagtcaccc tcgtcttgca 61 ttttcccgcg tgcgtgtgtg agtgggtgtg tgtgttttct tacaaagggt atttcgcgat 121 cgatcgattg attcgtagtt cccccccgcg cgcctttgcc ctttgtgctg taatcgagct 181 cccgccatcc caggtgcttc tccgttcctc taaacgccag cgtctggacg tgagcgcagg 241 tcgccggttt gtgccttcgg tccccgcttc gccccctgcc gtcccctcct tatcacggtc 301 ccgctcgcgg cctcgccgcc ccgctgtctc cgccgcccgc catggcgact gcgacccccg 361 tgccgccgcg gatgggcagc cgcgctggcg gccccaccac gccgctgagc cccacgcgcc 421 tgtcgcggct ccaggagaag gaggagctgc gcgagctcaa tgaccggctg gcggtgtaca 481 tcgacaaggt gcgcagcctg gagacggaga acagcgcgct gcagctgcag gtgacggagc 541 gcgaggaggt gcgcggccgt gagctcaccg gcctcaaggc gctctacgag accgagctgg 601 ccgacgcgcg acgcgcgctc gacgacacgg cccgcgagcg cgccaagctg cagatcgagc 661 tgggcaagtg caaggcggaa cacgaccagc tgctcctcaa ctatgctaag aaggaatctg 721 atcttaatgg cgcccagatc aagcttcgag aatatgaagc agcactgaat tcgaaagatg 781 cagctcttgc tactgcactt ggtgacaaaa aaagtttaga gggagatttg gaggatctga 841 aggatcagat tgcccagttg gaagcctcct tagctgcagc caaaaaacag ttagcagatg 901 aaactttact taaagtagat ttggagaatc gttgtcagag ccttactgag gacttggagt 961 ttcgcaaaag catgtatgaa gaggagatta acgagaccag aaggaagcat gaaacgcgct 1021 tggtagaggt ggattctggg cgtcaaattg agtatgagta caagctggcg caagcccttc 1081 atgagatgag agagcaacat gatgcccaag tgaggctgta taaggaggag ctggagcaga 1141 cttaccatgc caaacttgag aatgccagac tgtcatcaga gatgaatact tctactgtca 1201 acagtgccag ggaagaactg atggaaagcc gcatgagaat tgagagcctt tcatcccagc 1261 tttctaatct acagaaagag tctagagcat gtttggaaag gattcaagaa ttagaggact 1321 tgcttgctaa agaaaaagac aactctcgtc gcatgctgac agacaaagag agagagatgg 1381 cggaaataag ggatcaaatg cagcaacagc tgaatgacta tgaacagctt cttgatgtaa 1441 agttagccct ggacatggaa atcagtgctt acaggaaact cttagaaggc gaagaagaga 1501 ggttgaagct gtctccaagc ccttcttccc gtgtgacagt atcccgagca tcctcaagtc 1561 gtagtgtacg tacaactaga ggaaagcgga agagggttga tgtggaagaa tcagaggcga 1621 gtagtagtgt tagcatctct cattccgcct cagccactgg aaatgtttgc atcgaagaaa 1681 ttgatgttga tgggaaattt atccgcttga agaacacttc tgaacaggat caaccaatgg 1741 gaggctggga gatgatcaga aaaattggag acacatcagt cagttataaa tatacctcaa 1801 gatatgtgct gaaggcaggc cagactgtta caatttgggc tgcaaacgct ggtgtcacag 1861 ccagcccccc aactgacctc atctggaaga accagaactc gtggggcact ggcgaagatg 1921 tgaaggttat attgaaaaat tctcagggag aggaggttgc tcaaagaagt acagtcttta 1981 aaacaaccat acctgaagaa gaggaggagg aggaagaagc agctggagtg gttgttgagg 2041 aagaactttt ccaccagcag ggaaccccaa gagcatccaa tagaagctgt gcaattatgt 2101 aaaattttca actgtcttcc tcaaaataaa gaagtatggt aatctttacc tgtatacagt 2161 gcagagcctt ctcagaagca cagaatattt ttatatttcc tttatgtgaa tttttaagct 2221 gcaaatctga tggccttaat ttcctttttg acactgaaag ttttgtaaaa gaaatcatgt 2281 ccatacactt tgttgcaaga tgtgaattat tgacactgaa cttaataact gtgtactgtt 2341 cggaaggggt tcctcaaatt ttttgacttt ttttgtatgt gtgttttttc ttttttttta 2401 agttcttatg aggaggggag ggtaaataaa ccactgtgcg tcttggtgta atttgaagat 2461 tgccccatct agactagcaa tctcttcatt attctctgct atatataaaa cggtgctgtg 2521 agggagggga aaagcatttt tcaatatatt gaacttttgt actgaatttt tttgtaataa 2581 gcaatcaagg ttataatttt ttttaaaata gaaattttgt aagaaggcaa tattaaccta 2641 atcaccatgt aagcactctg gatgatggat tccacaaaac ttggttttat ggttacttct 2701 tctcttagat tcttaattca tgaggagggt gggggaggga ggtggaggga gggaagggtt 2761 tctctattaa aatgcattcg ttgtgttttt taagatagtg taacttgctt aaatttctta 2821 tgtgacatta acaaataaaa aagctctttt // LOCUS HUMLAMBB 5306 bp mRNA PRI 06-JAN-1995 DEFINITION Human laminin B2 chain mRNA, complete cds. ACCESSION J03202 NID g186916 KEYWORDS glycoprotein; laminin. SOURCE Human placenta, cDNA to mRNA, clones HL-[205,209,210,220,237,246]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5306) AUTHORS Pikkarainen,T., Kallunki,T. and Tryggvason,K. TITLE Human laminin B2 chain. Comparison of the complete amino acid sequence with the B1 chain reveals variability in sequence homology between different structural domains JOURNAL J. Biol. Chem. 263 (14), 6751-6758 (1988) MEDLINE 88198245 COMMENT Computer-readable copy of sequence for [1] kindly provided by K.Tryggvason, 26-FEB-1988. FEATURES Location/Qualifiers source 1..5306 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q31" mRNA <1..5306 /note="LAMB2 mRNA" sig_peptide 260..358 /gene="LAMB2" /note="laminin B2 signal peptide (3' end put.); putative" gene 260..5089 /gene="LAMB2" CDS 260..5089 /gene="LAMB2" /note="laminin B2 precursor" /codon_start=1 /db_xref="GDB:G00-120-136" /db_xref="PID:g307107" /translation="MRGSHRAAPALRPRGRLWPVLAVLAAAAAAGCAQAAMDECTDEG GRPQRCMPEFVNAAFNVTVVATNTCGTPPEEYCVQTGVTGVTKSCHLCDAGQPHLQHG AAFLTDYNNQADTTWWQSQTMLAGVQYPSSINLTLHLGKAFDITYVRLKFHTSRPESF AIYKRTREDGPWIPYQYYSGSCENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGG NVAFSTLEGRPSAYNFDNSPVLQEWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYA ISDFAVGGRCKCNGHASECMKNEFDKLVCNCKHNTYGVDCEKCLPFFNDRPWRRATAE SASECLPCDCNGRSQECYFDPELYRSTGHGGHCTNCQDNTDGAHCERCRENFFRLGNN EACSSCHCSPVGSLSTQCDSYGRCSCKPGVMGDKCDRCQPGFHSLTEAGCRPCSCDPS GSIDECNVETGRCVCKDNVEGFNCERCKPGFFNLESSNPRGCTPCFCFGHSSVCTNAV GYSVYSISSTFQIDEDGWRAEQRDGSEASLEWSSERQDIAVISDSYFPRYFIAPAKFL GKQVLSYGQNLSFSFRVDRRDTRLSAEDLVLEGAGLRVSVPLIAQGNSYPSETTVKYV FRLHEATDYPWRPALTPFEFQKLLNNLTSIKIRGTYSERSAGYLDDVTLASARPGPGV PATWVESCTCPVGYGGQFCEMCLSGYRRETPNLGPYSPCVLCACNGHSETCDPETGVC NCRDNTAGPHCEKCSDGYYGDSTAGTSSDCQPCPCPGGSSCAVVPKTKEVVCTNCPTG TTGKRCELCDDGYFGDPLGRNGPVRLCRLCQCSDNIDPNAVGNCNRLTGECLKCIYNT AGFYCDRCKDGFFGNPLAPNPADKCKACNCNPYGTMKQQSSCNPVTGQCECLPHVTGQ DCGACDPGFYNLQSGQGCERCDCHALGSTNGQCDIRTGQCECQPGITGQHCERCEVNH FGFGPEGCKPCDCHPEGSLSLQCKDDGRCECREGFVGNRCDQCEENYFYNRSWPGCQE CPACYRLVKDKVADHRVKLQELESLIANLGTGDEMVTDQAFEDRLKEAEREVMDLLRE AQDVKDVDQNLMDRLQRVNNTLSSQISRLQNIRNTIEETGNLAEQARAHVENTERLIE IASRELEKAKVAAANVSVTQPESTGDPNNMTLLAEEARKLAERHKQEADDIVRVAKTA NDTSTEAYNLLLRTLAGENQTAFEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGD KAVEIYASVAQLSPLDSETLENEANNIKMEAENLEQLIDQKLKDYEDLREDMRGKELE VKNLLEKGKTEQQTADQLLARADAAKALAEEAAKKGRDTLQEANDILNNLKDFDRRVN DNKTAAEEALRKIPAINQTITEANEKTREAQQALGSAAADATEAKNKAHEAERIASAV QKNATSTKAEAERTFAEVTDLDNEVNNMLKQLQEAEKELKRKQDDADQDMMMAGMASQ AAQEAEINARKAKNSVTSLLSIINDLLEQLGQLDTVDLNKLNEIEGTLNKAKDEMKVS DLDRKVSDLENEAKKQEAAIMDYNRDIEEIMKDIRNLEDIRKTLPSGCFNTPSIEKP" mat_peptide 359..5086 /gene="LAMB2" /note="laminin B2 (5' end put.); putative" BASE COUNT 1387 a 1310 c 1490 g 1119 t ORIGIN 681 bp upstream of HindIII site; chromosome 1q31. 1 cggggcaggc tgctcccggg gtaggtgagg gaagcgcgga ggcggcgcgc gggggcagtg 61 gtcggcgagc agcgcggtcc tcgctagggg cgcccacccg tcagtctctc cggcgcgagc 121 cgccgccacc gcccgcgccg gagtcaggcc cctgggcccc caggctcaag cagcgaagcg 181 gcctccgggg gacgccgcta ggcgagagga acgcgccggt gcccttgcct tcgccgtgac 241 ccagcgtgcg ggcggcggga tgagagggag ccatcgggcc gcgccggccc tgcggccccg 301 ggggcggctc tggcccgtgc tggccgtgct ggcggcggcc gccgcggcgg gctgtgccca 361 ggcagccatg gacgagtgca cggacgaggg cgggcggccg cagcgctgca tgcccgagtt 421 cgtcaacgcc gctttcaacg tgactgtggt ggccaccaac acgtgtggga ctccgcccga 481 ggaatactgt gtgcagaccg gggtgaccgg ggtcaccaag tcctgtcacc tgtgcgacgc 541 cgggcagccc cacctgcagc acggggcagc cttcctgacc gactacaaca accaggccga 601 caccacctgg tggcaaagcc agaccatgct ggccggggtg cagtacccca gctccatcaa 661 cctcacgctg cacctgggaa aagcttttga catcacctat gtgcgtctca agttccacac 721 cagccgcccg gagagctttg ccatttacaa gcgcacacgg gaagacgggc cctggattcc 781 ttaccagtac tacagtggtt cctgcgagaa cacctactcc aaggcaaacc gcggcttcat 841 caggacagga ggggacgagc agcaggcctt gtgtactgat gaattcagtg acatttctcc 901 cctcactggg ggcaacgtgg ccttttctac cctggaagga aggcccagcg cctataactt 961 tgacaatagc cctgtgctgc aggaatgggt aactgccact gacatcagag taactcttaa 1021 tcgcctgaac acttttggag atgaagtgtt taacgatccc aaagttctca agtcctatta 1081 ttatgccatc tctgattttg ctgtaggtgg cagatgtaaa tgtaatggac acgcaagcga 1141 gtgtatgaag aacgaatttg ataagctggt gtgtaattgc aaacataaca catatggagt 1201 agactgtgaa aagtgtcttc ctttcttcaa tgaccggccg tggaggaggg caactgcgga 1261 aagtgccagt gaatgcctgc cctgtgattg caatggtcga tcccaggaat gctacttcga 1321 ccctgaactc tatcgttcca ctggccatgg gggccactgt accaactgcc aggataacac 1381 agatggcgcc cactgtgaga ggtgccgaga gaacttcttc cgccttggca acaatgaagc 1441 ctgctcttca tgccactgta gtcctgtggg ctctctaagc acacagtgtg atagttacgg 1501 cagatgcagc tgtaagccag gagtgatggg ggacaaatgt gaccgttgcc agcctggatt 1561 ccattctctc actgaagcag gatgcaggcc atgctcttgt gatccctctg gcagcataga 1621 tgaatgtaat gttgaaacag gaagatgtgt ttgcaaagac aatgtcgaag gcttcaattg 1681 tgaaagatgc aaacctggat tttttaatct ggaatcatct aatcctcggg gttgcacacc 1741 ctgcttctgc tttgggcatt cttctgtctg tacaaacgct gttggctaca gtgtttattc 1801 tatctcctct acctttcaga ttgatgagga tgggtggcgt gcggaacaga gagatggctc 1861 tgaagcatct ctcgagtggt cctctgagag gcaagatatc gccgtgatct cagacagcta 1921 ctttcctcgg tacttcattg ctcctgcaaa gttcttgggc aagcaggtgt tgagttatgg 1981 tcagaacctc tccttctcct ttcgagtgga caggcgagat actcgcctct ctgccgaaga 2041 ccttgtgctt gagggagctg gcttaagagt atctgtaccc ttgatcgctc agggcaattc 2101 ctatccaagt gagaccactg tgaagtatgt cttcaggctc catgaagcaa cagattaccc 2161 ttggaggcct gctcttaccc cttttgaatt tcagaagctc ctaaacaact tgacctctat 2221 caagatacgt gggacataca gtgagagaag tgctggatat ttggatgatg tcaccctggc 2281 aagtgctcgt cctgggcctg gagtccctgc aacttgggtg gagtcctgca cctgtcctgt 2341 gggatatgga gggcagtttt gtgagatgtg cctctcaggt tacagaagag aaactcctaa 2401 tcttggacca tacagtccat gtgtgctttg cgcctgcaat ggacacagcg agacctgtga 2461 tcctgagaca ggtgtttgta actgcagaga caatacggct ggcccgcact gtgagaagtg 2521 cagtgatggg tactatggag attcaactgc aggcacctcc tccgattgcc aaccctgtcc 2581 gtgtcctgga ggttcaagtt gtgctgttgt tcccaagaca aaggaggtgg tgtgcaccaa 2641 ctgtcctact ggcaccactg gtaagagatg tgagctctgt gatgatggct actttggaga 2701 ccccctgggt agaaacggcc ctgtgagact ttgccgcctg tgccagtgca gtgacaacat 2761 cgatcccaac gcagttggaa attgcaatcg cttgacggga gaatgcctga agtgcatcta 2821 taacactgct ggcttctatt gtgaccggtg caaagacgga ttttttggaa atcccctggc 2881 tcccaatcca gcagacaaat gcaaagcctg caattgcaat ccgtatggga ccatgaagca 2941 gcagagcagc tgtaaccccg tgacggggca gtgtgaatgt ttgcctcacg tgactggcca 3001 ggactgtggt gcttgtgacc ctggattcta caatctgcag agtgggcaag gctgtgagag 3061 gtgtgactgc catgccttgg gctccaccaa tgggcagtgt gacatccgca ccggccagtg 3121 tgagtgccag cccggcatca ctggtcagca ctgtgagcgc tgtgaggtca accactttgg 3181 gtttggacct gaaggctgca aaccctgtga ctgtcatcct gagggatctc tttcacttca 3241 gtgcaaagat gatggtcgct gtgaatgcag agaaggcttt gtgggaaatc gctgtgacca 3301 gtgtgaagaa aactatttct acaatcggtc ttggcctggc tgccaggaat gtccagcttg 3361 ttaccggctg gtaaaggata aggttgctga tcatagagtg aagctccagg aattagagag 3421 tctcatagca aaccttggaa ctggggatga gatggtgaca gatcaagcct tcgaggatag 3481 actaaaggaa gcagagaggg aagttatgga cctccttcgt gaggcccagg atgtcaaaga 3541 tgttgaccag aatttgatgg atcgcctaca gagagtgaat aacactctgt ccagccaaat 3601 tagccgttta cagaatatcc ggaataccat tgaagagact ggaaacttgg ctgaacaagc 3661 gcgtgcccat gtagagaaca cagagcggtt gattgaaatc gcatccagag aacttgagaa 3721 agcaaaagtc gctgctgcca atgtgtcagt cactcagcca gaatctacag gggacccaaa 3781 caacatgact cttttggcag aagaggctcg aaagcttgct gaacgtcata aacaggaagc 3841 tgatgacatt gttcgagtgg caaagacagc caatgatacg tcaactgagg catacaacct 3901 gcttctgagg acactggcag gagaaaatca aacagcattt gagattgaag agcttaatag 3961 gaagtatgaa caagcgaaga acatctcaca ggatctggaa aaacaagctg cccgagtaca 4021 tgaggaggcc aaaagggccg gtgacaaagc tgtggagatc tatgccagcg tggctcagct 4081 gagccctttg gactctgaga cactggagaa tgaagcaaat aacataaaga tggaagctga 4141 gaatctggaa caactgattg accagaaatt aaaagattat gaggacctca gagaagatat 4201 gagagggaag gaacttgaag tcaagaacct tctggagaaa ggcaagactg aacagcagac 4261 cgcagaccaa ctcctagccc gagctgatgc tgccaaggcc ctcgctgaag aagctgcaaa 4321 gaagggacgg gataccttac aagaagctaa tgacattctc aacaacctga aagattttga 4381 taggcgcgtg aacgataaca agacggccgc agaggaggca ctaaggaaga ttcctgccat 4441 caaccagacc atcactgaag ccaatgaaaa gaccagagaa gcccagcagg ccctgggcag 4501 tgctgcggcg gatgccacag aggccaagaa caaggcccat gaggcggaga ggatcgcaag 4561 cgctgtccaa aagaatgcca ccagcaccaa ggcagaagct gaaagaactt ttgcagaagt 4621 tacagatctg gataatgagg tgaacaatat gttgaagcaa ctgcaggaag cagaaaaaga 4681 gctaaagaga aaacaagatg acgctgacca ggacatgatg atggcaggga tggcttcaca 4741 ggctgctcaa gaagccgaga tcaatgccag aaaagccaaa aactctgtta ctagcctcct 4801 cagcattatt aatgacctct tggagcagct ggggcagctg gatacagtgg acctgaataa 4861 gctaaacgag attgaaggca ccctaaacaa agccaaagat gaaatgaagg tcagcgatct 4921 tgataggaaa gtgtctgacc tggagaatga agccaagaag caggaggctg ccatcatgga 4981 ctataaccga gatatcgagg agatcatgaa ggacattcgc aatctggagg acatcaggaa 5041 gaccttacca tctggctgct tcaacacccc gtccattgaa aagccctagt gtctttaggg 5101 ctggaaggca gcatccctct gacagggggg cagttgtgag gccacagagt gccttgacac 5161 aaagattaca tttttcagac ccccactcct ctgctgctgt ccatcactgt ccttttgaac 5221 caggaaaagt cacagagttt aaagagaagc aaattaaaca tcctgaatcg ggaacaaagg 5281 gttttatcta ataaagtgtc tcttcc // LOCUS HUMLAMP1A 2455 bp mRNA PRI 11-JAN-1995 DEFINITION Homo sapiens lysosomal membrane glycoprotein-1 (LAMP1) mRNA, complete cds. ACCESSION J04182 NID g186927 KEYWORDS LAMP1 gene; lysosomal membrane glycoprotein-1; membrane glycoprotein. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2455) AUTHORS Fukuda,M., Viitala,J., Matteson,J. and Carlsson,S.R. TITLE Cloning of cDNAs encoding human lysosomal membrane glycoproteins, h-lamp-1 and h-lamp-2. Comparison of their deduced amino acid sequences JOURNAL J. Biol. Chem. 263 (35), 18920-18928 (1988) MEDLINE 89066687 COMMENT Computer readable copy of sequence [1] kindly submitted by M.Fukuda 24-OCT-1988. FEATURES Location/Qualifiers source 1..2455 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="P-hL1-15B" /tissue_type="placenta" /map="13q34" sig_peptide 191..271 /gene="LAMP1" /note="G00-120-137" CDS 191..1441 /gene="LAMP1" /note="precursor" /codon_start=1 /db_xref="GDB:G00-120-137" /product="lysosomal membrane glycoprotein-1" /db_xref="PID:g307109" /translation="MAPRSARRPLLLLLPVAAARPHALSSAAMFMVKNGNGTACIMAN FSAAFSVNYDTKSGPKNMTFDLPSDATVVLNRSSCGKENTSDPSLVIAFGRGHTLTLN FTRNATRYSVQLMSFVYNLSDTHLFPNASSKEIKTVESITDIRADIDKKYRCVSGTQV HMNNVTVTLHDATIQAYLSNSSFSRGETRCEQDRPSPTTAPPAPPSPSPSPVPKSPSV DKYNVSGTNGTCLLASMGLQLNLTYERKDNTTVTRLLNINPNKTSASGSCGAHLVTLE LHSEGTTVLLFQFGMNASSSRFFLQGIQLNTILPDARDPAFKAANGSLRALQATVGNS YKCNAEEHVRVTKAFSVNIFKVWVQAFKVEGGQFGSVEECLLDENSTLIPIAVGGALA GLVLIVLIAYLVGRKRSHAGYQTI" gene 191..1441 /gene="LAMP1" mat_peptide 272..1438 /gene="LAMP1" /product="lysosomal membrane glycoprotein-1" BASE COUNT 530 a 671 c 677 g 577 t ORIGIN 1 gaattcgggc gggcttcttc gctgccgacg tacgacgagt ggccgggctc ttgcgtctgg 61 taacgcgctg tctctaacgc cagcgccgtc tcgcgcgcac tgcgcacaga ccacccgcag 121 acgcccggca gtccgcaggc ccaaacgcgc acgcgacccc gctctccgca ccgtacccgg 181 ccgcctcggc atggcgcccc gcagcgcccg gcgacccctg ctgctgctac tgcctgttgc 241 tgctgctcgg cctcatgcat tgtcgtcagc agccatgttt atggtgaaaa atggcaacgg 301 gaccgcgtgc ataatggcca acttctctgc tgccttctca gtgaactacg acaccaagag 361 tggccccaag aacatgacct ttgacctgcc atcagatgcc acagtggtgc tcaaccgcag 421 ctcctgtgga aaagagaaca cttctgaccc cagtctcgtg attgcttttg gaagaggaca 481 tacactcact ctcaatttca cgagaaatgc aacacgttac agcgttcagc tcatgagttt 541 tgtttataac ttgtcagaca cacacctttt ccccaatgcg agctccaaag aaatcaagac 601 tgtggaatct ataactgaca tcagggcaga tatagataaa aaatacagat gtgttagtgg 661 cacccaggtc cacatgaaca acgtgaccgt aacgctccat gatgccacca tccaggcgta 721 cctttccaac agcagcttca gcaggggaga gacacgctgt gaacaagaca ggccttcccc 781 aaccacagcg ccccctgcgc cacccagccc ctcgccctca cccgtgccca agagcccctc 841 tgtggacaag tacaacgtga gcggcaccaa cgggacctgc ctgctggcca gcatggggct 901 gcagctgaac ctcacctatg agaggaagga caacacgacg gtgacaaggc ttctcaacat 961 caaccccaac aagacctcgg ccagcgggag ctgcggcgcc cacctggtga ctctggagct 1021 gcacagcgag ggcaccaccg tcctgctctt ccagttcggg atgaatgcaa gttctagccg 1081 gtttttccta caaggaatcc agttgaatac aattcttcct gacgccagag accctgcctt 1141 taaagctgcc aacggctccc tgcgagcgct gcaggccaca gtcggcaatt cctacaagtg 1201 caacgcggag gagcacgtcc gtgtcacgaa ggcgttttca gtcaatatat tcaaagtgtg 1261 ggtccaggct ttcaaggtgg aaggtggcca gtttggctct gtggaggagt gtctgctgga 1321 cgagaacagc acgctgatcc ccatcgctgt gggtggtgcc ctggcggggc tggtcctcat 1381 cgtcctcatc gcctacctcg tcggcaggaa gaggagtcac gcaggctacc agactatcta 1441 gcctggtgca cgcaggcaca gcagctgcag gggcctctgt tcctttctct gggcttaggg 1501 tcctgtcgaa ggggaggcac actttctgca aacgtttctc aaatctgctt catccaatgt 1561 gaagttcatc ttgcagcatt tactatgcac aacagagtaa ctatcgaaat gacggtgtta 1621 attttgctaa ctgggttaaa tattttgcta actggttaaa cattaatatt taccaaagta 1681 ggattttgag ggtgggggtg ctctctctga gggggtgggg gtgccgctgt ctctgagggg 1741 tgggggtgcc gctgtctgag gggtgggggt gccgctctct ctgagggggt gggggtgccg 1801 ctttctctga gggggtgggg gtgccgctct ctctgagggg gtgggggtgc tgctctctcc 1861 gaggggtgga atgccgctgt ctctgagggg tgggggtgcc gctctaaatt ggctccatat 1921 cattgagttt agggttctgg tgtttggttt cttcattctt tactgcactc agatttaagc 1981 cttacaaagg gaaacctctg gccgtcacac gtaggacgca tgaaggtcac tcgtgtgagg 2041 ctgacatgct cacacattac aacagtagag agggaaaatc ctaagacaga ggaactccag 2101 agatgagtgt ctggagcggc ttcagttcag ctttaaaggc caggacgcgc gacacgtggc 2161 tggcggcctc gttccagtgg cggcacgtcc ttggcgtctc taatgtctgc agctcaaggg 2221 ctggcacttt tttaaatata aaaatggtgt tatttttatt tttttttgta aagtgatttt 2281 tggtcttctg ttgacattcg ggtgatcctg ttctgcgctg tgtacaatgt gagatcggtg 2341 cgttctcctg atgttttgcc gtggcttggg gattgtacac gggaccagct cacgtaatgc 2401 attgcctgta acaatgtaat aaaaagcctc tttctttcaa aaaaaccccg aattc // LOCUS HUMLAP 2776 bp mRNA PRI 06-JAN-1995 DEFINITION Human leukocyte adhesion protein (LFA-1/Mac-1/p150,95 family) beta subunit mRNA. ACCESSION M15395 NID g186933 KEYWORDS cell adhesion molecule; cell surface glycoprotein; glycoprotein; leukocyte adhesion protein. SOURCE Human tonsil, cDNA to mRNA, clones 18.1.1, 9.1.1 and 3.1.1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2776) AUTHORS Kishimoto,T.K., O'Connor,K., Lee,A., Roberts,T.M. and Springer,T.A. TITLE Cloning of the beta subunit of the leukocyte adhesion proteins: homology to an extracellular matrix receptor defines a novel supergene family JOURNAL Cell 48 (4), 681-690 (1987) MEDLINE 87131080 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.K.Kishimoto, 22-APR-1987. FEATURES Location/Qualifiers source 1..2776 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q23-q25" sig_peptide 73..138 /gene="LYAM1" /note="leukocyte adhesion protein signal peptide; G00-120-157" CDS 73..2382 /gene="LYAM1" /note="leukocyte adhesion protein beta-subunit precursor" /codon_start=1 /db_xref="GDB:G00-120-157" /db_xref="PID:g307113" /translation="MLGLRPPLLALVGLLSLGCVLSQECTKFKVSSCRECIESGPGCT WCQKLNFTGPGDPDSIRCDTRPQLLMRGCAADDIMDPTSLAETQEDHNGGQKQLSPQK VTLYLRPGQAAAFNVTFRRAKGYPIDLYYLMDLSYSMLDDLRNVKKLGGDLLRALNEI TESGRIGFGSFVDKTVLPFVNTHPDKLRNPCPNKEKECQPPFAFRHVLKLTNNSNQFQ TEVGKQLISGNLDAPEGGLDAMMQVAACPEEIGWRNVTRLLVFATDDGFHFAGDGKLG AILTPNDGRCHLEDNLYKRSNEFDYPSVGQLAHKLAENNIQPIFAVTSRMVKTYEKLT EIIPKSAVGELSEDSSNVVHLIKNAYNKLSSRVFLDHNALPDTLKVTYDSFCSNGVTH RNQPRGDCDGVQINVPITFQVKVTATECIQEQSFVIRALGFTDIVTVQVLPQCECRCR DQSRDRSLCHGKGFLECGICRCDTGYIGKNCECQTQGRSSQELEGSCRKDNNSIICSG LGDCVCGQCLCHTSDVPGKLIYGQYCECDTINCERYNGQVCGGPGRGLCFCGKCRCHP GFEGSACQCERTTEGCLNPRRVECSGRGRCRCNVCECHSGYQLPLCQECPGCPSPCGK YISCAECLKFEKGPFGKNCSAACPGLQLSNNPVKGRTCKERDSEGCWVAYTLEQQDGM DRYLIYVDESRECVAGPNIAAIVGGTVAGIVLIGILLLVIWKALIHLSDLREYRRFEK EKLKSQWNNDNPLFKSATTTVMNPKFAES" gene 73..2382 /gene="LYAM1" mat_peptide 139..2379 /gene="LYAM1" /note="leukocyte adhesion protein beta-subunit; G00-120-157" BASE COUNT 610 a 817 c 834 g 515 t ORIGIN 194 bp upstream of ApaI site; chromosoome 21. 1 cagggcagac tggtagcaaa gcccccacgc ccagccagga gcaccgccgc ggactccagc 61 acaccgaggg acatgctggg cctgcgcccc ccactgctcg ccctggtggg gctgctctcc 121 ctcgggtgcg tcctctctca ggagtgcacg aagttcaagg tcagcagctg ccgggaatgc 181 atcgagtcgg ggcccggctg cacctggtgc cagaagctga acttcacagg gccgggggat 241 cctgactcca ttcgctgcga cacccggcca cagctgctca tgaggggctg tgcggctgac 301 gacatcatgg accccacaag cctcgctgaa acccaggaag accacaatgg gggccagaag 361 cagctgtccc cacaaaaagt gacgctttac ctgcgaccag gccaggcagc agcgttcaac 421 gtgaccttcc ggcgggccaa gggctacccc atcgacctgt actatctgat ggacctctcc 481 tactccatgc ttgatgacct caggaatgtc aagaagctag gtggcgacct gctccgggcc 541 ctcaacgaga tcaccgagtc cggccgcatt ggcttcgggt ccttcgtgga caagaccgtg 601 ctgccgttcg tgaacacgca ccctgataag ctgcgaaacc catgccccaa caaggagaaa 661 gagtgccagc ccccgtttgc cttcaggcac gtgctgaagc tgaccaacaa ctccaaccag 721 tttcagaccg aggtcgggaa gcagctgatt tccggaaacc tggatgcacc cgagggtggg 781 ctggacgcca tgatgcaggt cgccgcctgc ccggaggaaa tcggctggcg caacgtcacg 841 cggctgctgg tgtttgccac tgatgacggc ttccatttcg cgggcgacgg aaagctgggc 901 gccatcctga cccccaacga cggccgctgt cacctggagg acaacttgta caagaggagc 961 aacgaattcg actacccatc ggtgggccag ctggcgcaca agctggctga aaacaacatc 1021 cagcccatct tcgcggtgac cagtaggatg gtgaagacct acgagaaact caccgagatc 1081 atccccaagt cagccgtggg ggagctgtct gaggactcca gcaatgtggt ccatctcatt 1141 aagaatgctt acaataaact ctcctccagg gtcttcctgg atcacaacgc cctccccgac 1201 accctgaaag tcacctacga ctccttctgc agcaatggag tgacgcacag gaaccagccc 1261 agaggtgact gtgatggcgt gcagatcaat gtcccgatca ccttccaggt gaaggtcacg 1321 gccacagagt gcatccagga gcagtcgttt gtcatccggg cgctgggctt cacggacata 1381 gtgaccgtgc aggttcttcc ccagtgtgag tgccggtgcc gggaccagag cagagaccgc 1441 agcctctgcc atggcaaggg cttcttggag tgcggcatct gcaggtgtga cactggctac 1501 attgggaaaa actgtgagtg ccagacacag ggccggagca gccaggagct ggaaggaagc 1561 tgccggaagg acaacaactc catcatctgc tcagggctgg gggactgtgt ctgcgggcag 1621 tgcctgtgcc acaccagcga cgtccccggc aagctgatat acgggcagta ctgcgagtgt 1681 gacaccatca actgtgagcg ctacaacggc caggtctgcg gcggcccggg gagggggctc 1741 tgcttctgcg ggaagtgccg ctgccacccg ggctttgagg gctcagcgtg ccagtgcgag 1801 aggaccactg agggctgcct gaacccgcgg cgtgttgagt gtagtggtcg tggccggtgc 1861 cgctgcaacg tatgcgagtg ccattcaggc taccagctgc ctctgtgcca ggagtgcccc 1921 ggctgcccct caccctgtgg caagtacatc tcctgcgccg agtgcctgaa gttcgaaaag 1981 ggcccctttg ggaagaactg cagcgcggcg tgtccgggcc tgcagctgtc gaacaacccc 2041 gtgaagggca ggacctgcaa ggagagggac tcagagggct gctgggtggc ctacacgctg 2101 gagcagcagg acgggatgga ccgctacctc atctatgtgg atgagagccg agagtgtgtg 2161 gcaggcccca acatcgccgc catcgtcggg ggcaccgtgg caggcatcgt gctgatcggc 2221 attctcctgc tggtcatctg gaaggctctg atccacctga gcgacctccg ggagtacagg 2281 cgctttgaga aggagaagct caagtcccag tggaacaatg ataatcccct tttcaagagc 2341 gccaccacga cggtcatgaa ccccaagttt gctgagagtt aggagcactt ggtgaagaca 2401 aggccgtcag gacccaccat gtctgcccca tcacgcggcc gagacatggc ttggccacag 2461 ctcttgagga tgtcaccaat taaccagaaa tccagttatt ttccgccctc aaaatgacag 2521 ccatggccgg ccggtgcttc tgggggctcg tcggggggac agctccactc tgactggcac 2581 agtctttgca tggagacttg aggagggctt gaggttggtg aggttaggtg cgtgtttcct 2641 gtgcaagtca ggacatcagt ctgattaaag gtggtgccaa tttatttaca tttaaacttg 2701 tcagggtata aaatgacatc ccattaatta tattgttaat caatcacgtg tatagaaaaa 2761 aaaataaaac ttcaat // LOCUS HUMLAPA 3595 bp mRNA PRI 06-JAN-1995 DEFINITION Human CD11b (MAC-1/Mo1/CR3) leukocyte adhesion receptor alpha subunit mRNA, partial cds. ACCESSION M18044 J03270 M19664 X07421 NID g186935 KEYWORDS complement receptor 3; integrin; leukocyte adhesion glycoprotein; leukocyte adhesion receptor. SOURCE Human monocyte and peripheral blood lymphocyte, cDNA to mRNA, clone 5B. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3595) AUTHORS Arnaout,M.A., Gupta,S.K., Pierce,M.W. and Tenen,D.G. TITLE Amino acid sequence of the alpha subunit of human leukocyte adhesion receptor Mo1 (complement receptor type 3) JOURNAL J. Cell Biol. 106 (6), 2153-2158 (1988) MEDLINE 88257215 REFERENCE 2 (bases 2821 to 3208) AUTHORS Arnaout,M.A., Remold-O'Donnell,E., Pierce,M.W., Harris,P. and Tenen,D.G. TITLE Molecular cloning of the alpha subunit of human and guinea pig leukocyte adhesion glycoprotein Mo1: chromosomal localization and homology to the alpha subunits of integrins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (8), 2776-2780 (1988) MEDLINE 88190151 COMMENT Draft entry and printed copy of sequence for [2],[1] kindly provided by M.A.Arnaout, 25-Apr-1988. FEATURES Location/Qualifiers source 1..3595 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q23-q25" sig_peptide 76..123 /gene="LYAM1" /note="leukocyte adhesion glycoprotein signal peptide" CDS 76..3534 /gene="LYAM1" /note="leukocyte adhesion glycoprotein precursor" /codon_start=1 /db_xref="GDB:G00-120-157" /db_xref="PID:g307114" /translation="MALRVLLLTALTLCHGFNLDTENAMTFQENARGFGQSVVQLQGS RVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIRLQVPVEAVNMSLGLSLAATTSPPQL LACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQKFPEALRGCPQEDSDIAFLIDGSGS IIPHDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNPNPRSLVKPI TQLLGRTHTATGIRKVVRELFNITNGARKNAFKILVVITDGEKFGDPLGYEDVIPEAD REGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKIFA IEGTQTGSSSSFEHEMSQEGFSAAITSNGPLLSTVGSYDWAGGVFLYTSKEKSTFINM TRVDSDMNDAYLGYAAAIILRNRVQSLVLGAPRYQHIGLVAMFRQNTGMWESNANVKG TQIGAYFGASLCSVDVDSNGSTDLVLIGAPHYYEQTRGGQVSVCPLPRGRARWQCDAV LYGEQGQPWGRFGAALTVLGDVNGDKLTDVAIGAPGEEDNRGAVYLFHGTSGSGISPS HSQRIAGSKLSPRLQYFGQSLSGGQDLTMDGLVDLTVGAQGHVLLLRSQPVLRVKAIM EFNPREVARNVFECNDQVVKGKEAGEVRVCLHVQKSTRDRLREGQIQSVVTYDLALDS GRPHSRAVFNETKNSTRRQTQVLGLTQTCETLKLQLPNCIEDPVSPIVLRLNFSLVGT PLSAFGNLRPVLAEDAQRLFTALFPFEKNCGNDNICQDDLSITFSFMSLDCLVVGGPR EFNVTVTVRNDGEDSYRTQVTFFFPLDLSYRKVSTLQNQRSQRSWRLACESASSTEVS GALKSTSCSINHPIFPENSEVTFNITFDVDSKASLGNKLLLKANVTSENNMPRTNKTE FQLELPVKYAVYMVVTSHGVSTKYLNFTASENTSRVMQHQYQVSNLGQRSPPISLVFL VPVRLNQTVIWDRPQVTFSENLSSTCHTKERLPSHSDFLAELRKAPVVNCSIAVCQRI QCDIPFFGIQEEFNATLKGNLSFDWYIKTSHNHLLIVSTAEILFNDSVFTLLPGQGAF VRSQTETKVEPFEVPNPLPLIVGSSVGGLLLLALITAALYKLGFFKRQYKDMMSEGGP PGAEPQ" gene 76..3534 /gene="LYAM1" mat_peptide 124..3531 /gene="LYAM1" /note="leukocyte adhesion glycoprotein" BASE COUNT 809 a 1034 c 993 g 759 t ORIGIN Chromosome 16. 1 tggcttcctt gtggttcctc agtggtgcct gcaacccctg gttcacctcc ttccaggttc 61 tggctccttc cagccatggc tctcagagtc cttctgttaa cagccttgac cttatgtcat 121 gggttcaact tggacactga aaacgcaatg accttccaag agaacgcaag gggcttcggg 181 cagagcgtgg tccagcttca gggatccagg gtggtggttg gagcccccca ggagatagtg 241 gctgccaacc aaaggggcag cctctaccag tgcgactaca gcacaggctc atgcgagccc 301 atccgcctgc aggtccccgt ggaggccgtg aacatgtccc tgggcctgtc cctggcagcc 361 accaccagcc cccctcagct gctggcctgt ggtcccaccg tgcaccagac ttgcagtgag 421 aacacgtatg tgaaagggct ctgcttcctg tttggatcca acctacggca gcagccccag 481 aagttcccag aggccctccg agggtgtcct caagaggata gtgacattgc cttcttgatt 541 gatggctctg gtagcatcat cccacatgac tttcggcgga tgaaggagtt tgtctcaact 601 gtgatggagc aattaaaaaa gtccaaaacc ttgttctctt tgatgcagta ctctgaagaa 661 ttccggattc actttacctt caaagagttc cagaacaacc ctaacccaag atcactggtg 721 aagccaataa cgcagctgct tgggcggaca cacacggcca cgggcatccg caaagtggta 781 cgagagctgt ttaacatcac caacggagcc cgaaagaatg cctttaagat cctagttgtc 841 atcacggatg gagaaaagtt tggcgatccc ttgggatatg aggatgtcat ccctgaggca 901 gacagagagg gagtcattcg ctacgtcatt ggggtgggag atgccttccg cagtgagaaa 961 tcccgccaag agcttaatac catcgcatcc aagccgcctc gtgatcacgt gttccaggtg 1021 aataactttg aggctctgaa gaccattcag aaccagcttc gggagaagat ctttgcgatc 1081 gagggtactc agacaggaag tagcagctcc tttgagcatg agatgtctca ggaaggcttc 1141 agcgctgcca tcacctctaa tggccccttg ctgagcactg tggggagcta tgactgggct 1201 ggtggagtct ttctatatac atcaaaggag aaaagcacct tcatcaacat gaccagagtg 1261 gattcagaca tgaatgatgc ttacttgggt tatgctgccg ccatcatctt acggaaccgg 1321 gtgcaaagcc tggttctggg ggcacctcga tatcagcaca tcggcctggt agcgatgttc 1381 aggcagaaca ctggcatgtg ggagtccaac gctaatgtca agggcaccca gatcggcgcc 1441 tacttcgggg cctccctctg ctccgtggac gtggacagca acggcagcac cgacctggtc 1501 ctcatcgggg ccccccatta ctacgagcag acccgagggg gccaggtgtc cgtgtgcccc 1561 ttgcccaggg ggagggctcg gtggcagtgt gatgctgttc tctacgggga gcagggccaa 1621 ccctggggcc gctttggggc agccctaaca gtgctggggg acgtaaatgg ggacaagctg 1681 acggacgtgg ccattggggc cccaggagag gaggacaacc ggggtgctgt ttacctgttt 1741 cacggaacct caggatctgg catcagcccc tcccatagcc agcggatagc aggctccaag 1801 ctctctccca ggctccagta ttttggtcag tcactgagtg ggggccagga cctcacaatg 1861 gatggactgg tagacctgac tgtaggagcc caggggcacg tgctgctgct caggtcccag 1921 ccagtactga gagtcaaggc aatcatggag ttcaatccca gggaagtggc aaggaatgta 1981 tttgagtgta atgatcaggt ggtgaaaggc aaggaagccg gagaggtcag agtctgcctc 2041 catgtccaga agagcacacg ggatcggcta agagaaggac agatccagag tgttgtgact 2101 tatgacctgg ctctggactc cggccgccca cattcccgcg ccgtcttcaa tgagacaaag 2161 aacagcacac gcagacagac acaggtcttg gggctgaccc agacttgtga gaccctgaaa 2221 ctacagttgc cgaattgcat cgaggaccca gtgagcccca ttgtgctgcg cctgaacttc 2281 tctctggtgg gaacgccatt gtctgctttc gggaacctcc ggccagtgct ggcggaggat 2341 gctcagagac tcttcacagc cttgtttccc tttgagaaga attgtggcaa tgacaacatc 2401 tgccaggatg acctcagcat caccttcagt ttcatgagcc tggactgcct cgtggtgggt 2461 gggccccggg agttcaacgt gacagtgact gtgagaaatg atggtgagga ctcctacagg 2521 acacaggtca ccttcttctt cccgcttgac ctgtcctacc ggaaggtgtc cacactccag 2581 aaccagcgct cacagcgatc ctggcgcctg gcctgtgagt ctgcctcctc caccgaagtg 2641 tctggggcct tgaagagcac cagctgcagc ataaaccacc ccatcttccc ggaaaactca 2701 gaggtcacct ttaatatcac gtttgatgta gactctaagg cttcccttgg aaacaaactg 2761 ctcctcaagg ccaatgtgac cagtgagaac aacatgccca gaaccaacaa aaccgaattc 2821 caactggagc tgccggtgaa atatgctgtc tacatggtgg tcaccagcca tggggtctcc 2881 actaaatatc tcaacttcac ggcctcagag aataccagtc gggtcatgca gcatcaatat 2941 caggtcagca acctggggca gaggagcccc cccatcagcc tggtgttctt ggtgcccgtc 3001 cggctgaacc agactgtcat atgggaccgc ccccaggtca ccttctccga gaacctctcg 3061 agtacgtgcc acaccaagga gcgcttgccc tctcactccg actttctggc tgagcttcgg 3121 aaggcccccg tggtgaactg ctccatcgct gtctgccaga gaatccagtg tgacatcccg 3181 ttctttggca tccaggaaga attcaatgct accctcaaag gcaacctctc gtttgactgg 3241 tacatcaaga cctcgcataa ccacctcctg atcgtgagca cagctgagat cttgtttaac 3301 gattccgtgt tcaccctgct gccgggacag ggggcgtttg tgaggtccca gacggagacc 3361 aaagtggagc cgttcgaggt ccccaacccc ctgccgctca tcgtgggcag ctctgtcggg 3421 ggactgctgc tcctggccct catcaccgcc gcgctgtaca agctcggctt cttcaagcgg 3481 caatacaagg acatgatgag tgaagggggt cccccggggg ccgaacccca gtagcggctc 3541 cttcccgaca gagctgcctc tcggtggcca gcaggactct gcccagacca cacgt // LOCUS HUMLBR 3714 bp mRNA PRI 07-JAN-1995 DEFINITION Human lamin B receptor (LBR) mRNA, complete cds. ACCESSION L25931 NID g438638 KEYWORDS integral membrane protein; nuclear envelope protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3714) AUTHORS Ye,Q. and Worman,H.J. TITLE Primary structure analysis and lamin B and DNA binding of human LBR, an integral protein of the nuclear envelope inner membrane JOURNAL J. Biol. Chem. 269 (15), 11306-11311 (1994) MEDLINE 94209307 FEATURES Location/Qualifiers source 1..3714 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..75 /gene="LBR" gene 1..3714 /gene="LBR" CDS 76..1923 /gene="LBR" /codon_start=1 /product="lamin B receptor" /db_xref="PID:g438639" /translation="MPSRKFADGEVVRGRWPGSSLYYEVEILSHDSTSQLYTVKYKDG TELELKENDIKPLTSFRQRKGGSTSSSPSRRRGSRSRSRSRSPGRPPKSARRSASASH QADIKEARREVEVKLTPLILKPFGNSISRYNGEPEHIERNDAPHKNTQEKFSLSQESS YIATQYSLRPRREEVKLKEIDSKEEKYVAKELAVRTFEVTPIRAKDLEFGGVPGVFLI MFGLPVFLFLLLLMCKQKDPSLLNFPPPLPALYELWETRVFGVYLLWFLIQVLFYLLP IGKVVEGTPLIDGRRLKYRLNGFYPFILTSAVIGTSLFQGVEFHYVYSHFLQFALAAT VFCVVLSVYLYMRSLKAPRNDLSPASSGNAVYDFFIGRELNPRIGTFDLKYFCELRPG LIGWVVINLVMLLAEMKIQDRAVPSLAMILVNSFQLLYVVDALWNEEALLTTMDIIHD GFGFMLAFGDLVWVPFIYSFQAFYLVSHPNEVSWPMASLIIVLKLCGYVIFRGANSQK NAFRKNPSDPKLAHLKTIHTSSGKNLLVSGWWGFVRHPNYLGDLIMALAWSLPCGFNH ILPYFYIIYFTMLLVHREARDEYHCKKKYGVAWEKYCQRVPYRIFPYIY" 3'UTR 1924..3714 /gene="LBR" BASE COUNT 1046 a 644 c 759 g 1265 t ORIGIN 1 ccgggttgct gtgcgactat tctccgggag ccgttcgtgt caccgccgga acctggcgca 61 ggttaattat agaaaatgcc aagtaggaaa tttgccgatg gtgaagtggt aagaggtcga 121 tggcctggga gttcacttta ttatgaagta gaaattctga gccacgacag cacctcccag 181 ctttacactg tgaagtataa agatggaaca gagcttgaat tgaaagagaa tgatattaag 241 cctttaactt cctttaggca aaggaaaggt ggctcaactt ccagttcccc ttccagacgc 301 cgagggagtc gatcaaggtc acgctcccga tcccctggtc gaccacctaa aagtgcccgc 361 cgatctgctt ctgcttccca ccaggccgac attaaggaag caaggaggga agtggaagtt 421 aaattgactc cgctgattct gaagccattt ggaaatagca tcagcagata taatggggag 481 cctgagcata ttgagagaaa tgacgcacct cataaaaata cacaggaaaa attcagtttg 541 tcacaagaaa gcagttacat agcaacacag tatagccttc gtccaagaag agaagaagtc 601 aaattaaaag aaatagattc taaggaagaa aaatacgttg caaaagaact ggcagtgaga 661 acctttgaag tgacccccat ccgggcaaag gacttggagt ttggaggagt acctggtgtg 721 tttctcatca tgtttggcct gcctgtgttc ctcttcctgt tgctgttgat gtgtaaacag 781 aaagatccca gtcttctgaa tttccctcct cctttgccag ctttgtatga gttatgggaa 841 accagagtat ttggggtcta cctcctgtgg tttttgattc aagtcctgtt ctacctactg 901 ccaattggaa aggttgtaga aggaacgcct cttattgatg gaagaagact caagtataga 961 ttaaatggat tctatccttt tatcctgaca tctgcagtca tcggaacatc tctcttccag 1021 ggcgtagagt ttcattacgt gtacagtcat tttcttcagt ttgcacttgc ggccactgtt 1081 ttttgtgtgg tcttgagtgt gtatctctac atgcgctctt tgaaagcgcc ccggaatgac 1141 ctgtcgcctg ccagctctgg aaatgctgtc tatgatttct tcattggccg tgaattaaac 1201 cctcgaattg gtacttttga tctcaaatac ttttgtgaat tgcgccccgg attgattgga 1261 tgggtggtta ttaacttggt gatgcttttg gctgaaatga aaatacagga ccgcgctgtt 1321 ccatccttgg ccatgatttt agttaatagt ttccagcttc tctatgtggt ggatgctctc 1381 tggaatgagg aagcgttgtt gacgaccatg gacatcatcc acgatggatt tggattcatg 1441 ctggcttttg gagacttggt gtgggttccc tttatttaca gcttccaagc cttttattta 1501 gtcagtcatc caaatgaagt gtcttggcca atggcttctc taattattgt tctgaaactt 1561 tgtggttatg taatcttccg aggtgcaaat tctcagaaaa atgcattccg gaaaaatccc 1621 agtgatccaa agcttgcaca tttaaaaacc attcatactt caagtggaaa aaatcttcta 1681 gtttctggat ggtggggctt tgttcgccac cccaattact tgggtgatct catcatggcc 1741 ttggcgtggt ccctcccatg tggttttaac cacattctgc cttatttcta cataatttat 1801 ttcaccatgt tgcttgtcca ccgagaagct cgtgacgagt accactgtaa gaagaaatac 1861 ggcgtggctt gggaaaagta ctgtcagcgt gtgccctacc gtatatttcc atacatctac 1921 taatgctctt ctggcttttc tacaaaatac tcctgcaatt ccagctgcca tttgcaaaaa 1981 caggaaaaaa atccgaaact ttcttttgtt gcactgacag ggtctgtact tttttttttc 2041 tttttgagtc aggactatgg agccgagtag ttgatctttt aatatagccg tgtttacttg 2101 tattaactta cagttaacat aggaaaaata caagtaagga tgtgagaatt tgcattttaa 2161 tgggaaattt tcaaccctta atctgaaaac agaagacagt cttaatataa atgtactgtg 2221 aagaatgcta ttgatgttta tggtttctga ttacttttca aattttgatg tttttttgcc 2281 agttggcttt tcttaaatga aaacactgtt ccatttaaag tacatttatg ttttattcag 2341 taagagaata gaattttcat ttgtttttct ttaaatcctt tactaattat ataatttgaa 2401 agcaaaaaga agggcctata ttaaatgctg aaagtgaaaa gtgatgacat tattagcaga 2461 cactgcttaa aggagaccat ttgtagcagt tggcttaacc tcaacttcta aaactacatt 2521 gaaaatgtaa atacatagct tagttttttg taatatatgg tgacttcaga tttttttgta 2581 cagtattttg aatgtgagat gattgtcagg actaactgtc tttttaacaa aacattttca 2641 gtattttaaa taaaattttg taaagtaatg tgaattaaaa attttggaac aattagaatt 2701 cattcactat tgtatagaag atgctgttaa aacataggaa gggtattttt cttgatccaa 2761 agtttgtgaa tttggctttg ctacctcaat tgcaggtgtt tgtttgcctt tataaactgt 2821 tgcaaataga aaaaaaatag aataagtata tatttttgga gtaacatcaa tatttaaaca 2881 tttttacaca gatcggtgtt tgaaaatttg ccatttcagg ctaatatttt tatatatttt 2941 tgacttttta aaagttcatc agtgtttttg ctactgttaa gcttatgcag tttatactgt 3001 attttttatg tatcctttat atttaccaaa cctgactccc tgtaaaggag tgctgtctta 3061 aaaacaactg aaggggttaa agtcgtttct tttagtttaa tagatgtgca taaggtagct 3121 ttagcaatta aattctagtg aagttgatat agtctcattt ttaattgtcc tgtaatggaa 3181 cagtagcaaa ttcactaaac ttttgtgttc agagttaaat tgttctcagt actttcaatg 3241 taggggaatg taataaacat agtgtgtatg tttgggtttt aattacacat tttatatatg 3301 agccatttag atatgcagtg ttaattctat actgcatttg aagtgtatgt aacttagctt 3361 atgttaatgc agtcatgaag ttggtttgct ccagcatccg gtagtcttta aacattcttt 3421 tagtgaaatt gtcattgttt tatcagtgct aatgtgtgca agcagttttt ttattttgct 3481 tttctcctgg catcagaaag tggtggcgtt ttctgtactg gattgcacca aggaagcttt 3541 tggggaggaa ggaaggacat taaattcttt ccctggtaat gaaaagagcc ctttatcaat 3601 acagtgctgc aatttctgga tatcagctac actttgtttt taagtttgtt tttgacatgt 3661 ttatttggca aattttataa tgaagtttta agttgaaaat aaaatgtagc aaca // LOCUS HUMLCACHA 1794 bp mRNA PRI 23-JUL-1993 DEFINITION Human L-type voltage-gated calcium channel B subunit mRNA for isoform a, complete cds. ACCESSION L06110 NID g187014 KEYWORDS L-type calcium channel; L-type voltage-dependent calcium channel; calcium channel protein; cardiac calcium channel; voltage-dependent calcium channel; voltage-gated calcium channel. SOURCE Homo sapiens (library: Zap II (Stratagene)) female Cardiac muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1794) AUTHORS Collin,T., Wang,J., Nargeot,J. and Schwartz,A. TITLE Molecular cloning of three isoforms of the L-type voltage-dependent calcium channel B subunit from normal human heart JOURNAL Circ. Res. 72, 1337-1344 (1993) MEDLINE 93265672 FEATURES Location/Qualifiers source 1..1794 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="Cardiac muscle" /tissue_lib="Zap II (Stratagene)" CDS 1..1794 /standard_name="betaa" /note="isoform a; putative" /codon_start=1 /product="L-type voltage-gated calcium channel B subunit" /db_xref="PID:g187015" /translation="MVQKSGMSRGPYPPSQEIPMEVFDPSPQRKYSKRKGRFKRSDGS TSSDTTSNSFVRQGSAESYTSRPSDSDVSLEEDREALRKEAERQALAQLEKAKTKPVA FAVRTNVGYNPSPGDEVPVQGVAITFEPKDFLDIKEKYNNDWWIGRLVKEGCEVGFIP SPVKLDSLRLLQEQTVRQNRLSSSKSGDNSSSSLGDVVTGTRRPTPPASAKQKQKSSE HVPPYDVVPSMRPIILVGPSLKGYEVTDMMQKALFDFLKHRFDGRISITRVTADISLA KRSVLNNPSKHIIIERLQHTSSLAEVQSEIERIFELARTLQLVALDADTINHPAQLSK TSLAPIIVYLKITSPKVLQRLIKSRGKSQSKHLNVQIAASEKLAQCPPEMFDIILDEN QLEDACEHLAEYLEAYWKATHPPSSTPPNPLLNRTIATAALAASPAPVSNLQGPYLAS GDQPLDRATGEHASVHEYPGELGQPPGLYPSNHPPGRAGTLWALSRQDTFDADTPGSR NSAYTEPGDSCVDMETDPSEGPGPGDPAGGGTPPARQGSWEEEEDYEEEMTDNRNRGR NKARYCAEGGGPVLGRNKNELEGWGQGVYIR" BASE COUNT 425 a 561 c 502 g 306 t ORIGIN 1 atggtccaga agagcggcat gtcccggggc ccttacccac cttcccaaga gatccctatg 61 gaggtcttcg accccagccc acagcggaaa tacagcaaga ggaaagggcg gttcaaaagg 121 tcagatggga gcacctcctc agataccaca tccaacagct ttgtccgtca gggctcagca 181 gagtcctaca cgagccggcc gtcagactcc gacgtgtccc tggaggagga ccgggaagcc 241 ttaaggaagg aggcagagcg ccaggcctta gcccagctcg agaaagccaa gaccaaacca 301 gtggcctttg ctgttcggac aaatgttggc tacaatccgt ctccaggaga tgaggtgcca 361 gtgcagggag tggccatcac ctttgagccc aaggacttcc tcgacatcaa ggagaaatac 421 aataatgact ggtggatcgg gaggctggtg aaggaaggct gtgaggttgg tttcatcccc 481 agccctgtca aactggacag ccttcgcctg ctgcaggaac agaccgtgcg ccaaaaccgc 541 ctcagctcca gcaagtcagg tgacaactcc agttccagtc tgggagatgt ggtgactggc 601 acccgccgcc ccacaccccc cgccagtgcc aaacagaagc agaagtcgtc agagcacgtg 661 cccccctatg acgtggtgcc ttccatgagg cccatcatcc tggtgggacc atcgctcaag 721 ggctatgagg taactgacat gatgcagaaa gctttatttg acttcttgaa gcatcggttt 781 gatggcagga tctccatcac tcgtgtgacg gcagatattt ccctggctaa gcgctcagtt 841 ctcaacaacc ccagcaaaca catcatcatt gagcgactcc aacacacgtc cagcctggct 901 gaggtgcaga gtgaaatcga gcgaatcttc gagctggccc ggacccttca gttggttgcc 961 ctggacgccg acaccatcaa ccacccagcc cagctctcta aaacctcgct ggcccccatc 1021 attgtttacc tcaagatcac ctctcccaag gtacttcaaa ggctcatcaa gtcccgagga 1081 aagtctcagt ccaaacacct caatgtccaa atagcggcct cggaaaagct ggcacagtgc 1141 ccccctgaaa tgtttgacat catcctggat gagaaccaat tggaggatgc ctgcgagcat 1201 ctggcggagt acttggaagc ctattggaag gccacacacc cgcccagcag cacgccaccc 1261 aatccgctgc tgaaccgcac catcgctacc gccgctctgg ctgccagccc tgcccccgtc 1321 tccaacctcc agggacccta ccttgcttcc ggggaccagc cgctggaccg ggccactggg 1381 gagcatgcca gtgtgcacga gtaccccggg gagctgggcc agcccccagg cctttacccc 1441 agcaaccacc cacctggccg ggcaggcacc ctgtgggcgc tatcccgcca agacaccttt 1501 gatgctgaca cccccggcag ccgaaactct gcctacacgg agccaggaga ctcgtgtgtg 1561 gacatggaga cagacccctc agagggccca gggcctggag accctgcagg gggaggtaca 1621 ccaccagctc ggcagggctc ctgggaagag gaggaagatt atgaggagga gatgaccgac 1681 aacaggaacc ggggccggaa taaggcccgc tactgtgcgg agggtggtgg gccggttctg 1741 gggcgcaata agaatgagct ggagggctgg ggacaaggcg tctacatccg ctga // LOCUS HUMLGALS2A 429 bp mRNA PRI 07-JAN-1995 DEFINITION Human S-lac lectin L-14-II (LGALS2) mRNA, complete cds. ACCESSION M87842 M87010 NID g187129 KEYWORDS S-lac lectin; beta-galactosidase-binding protein. SOURCE Homo sapiens (tissue library: lambda gt11) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 429) AUTHORS Gitt,M.A., Massa,S.M., Leffler,H. and Barondes,S.H. TITLE Isolation and expression of a gene encoding L-14-II, a new human soluble lactose-binding lectin JOURNAL J. Biol. Chem. 267 (15), 10601-10606 (1992) MEDLINE 92268105 FEATURES Location/Qualifiers source 1..429 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /tissue_lib="lambda gt11" /map="1p13" gene 27..425 /gene="LGALS2" CDS 27..425 /gene="LGALS2" /codon_start=1 /db_xref="GDB:G00-127-515" /product="S-lac lectin" /db_xref="PID:g187130" /translation="MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNL HFNPRFSESTIVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHE LTFPNRLGHSHLSYLSVRGGFNMSSFKLKE" BASE COUNT 118 a 106 c 122 g 83 t ORIGIN 1 gggagctgcc gccaggagct gtcaccatga cgggggaact tgaggttaag aacatggaca 61 tgaagccggg gtcaaccctg aagatcacag gcagcatcgc cgatggcact gatggctttg 121 taattaatct gggccagggg acagacaagc tgaacctgca tttcaaccct cgcttcagcg 181 aatccaccat tgtctgcaac tcattggacg gcagcaactg ggggcaagaa caacgggaag 241 atcacctgtg cttcagccca gggtcagagg tcaagttcac agtgaccttt gagagtgaca 301 aattcaaggt gaagctgcca gatgggcacg agctgacttt tcccaacagg ctgggtcaca 361 gccacctgag ctacctgagc gtaaggggcg ggttcaacat gtcctctttc aagttaaaag 421 aataaaaga // LOCUS HUMLGTPA 3168 bp mRNA PRI 07-JAN-1995 DEFINITION Human liver glucose transporter-like protein (GLUT2), complete cds. ACCESSION J03810 NID g187133 KEYWORDS glucose transporter protein; membrane protein; transport protein. SOURCE Human liver, cDNA to mRNA, clone lambda-hHTL-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3168) AUTHORS Fukumoto,H., Seino,S., Imura,H., Seino,Y., Eddy,R.L., Fukushima,Y., Byers,M.G., Shows,T.B. and Bell,G.I. TITLE Sequence, tissue distribution, and chromosomal localization of mRNA encoding a human glucose transporter-like protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (15), 5434-5438 (1988) MEDLINE 88289735 COMMENT Draft entry and computer readable form of sequence [1] kindly provided by G.I.Bell 30-JUN-1988. FEATURES Location/Qualifiers source 1..3168 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q26.1-q26.3" gene 39..1613 /gene="GLUT2" CDS 39..1613 /gene="GLUT2" /note="glucose transporter-like protein" /codon_start=1 /db_xref="GDB:G00-119-995" /db_xref="PID:g307125" /translation="MTEDKVTGTLVFTVITAVLGSFQFGYDIGVINAPQQVIISHYRH VLGVPLDDRKAINNYVINSTDELPTISYSMNPKPTPWAEEETVAAAQLITMLWSLSVS SFAVGGMTASFFGGWLGDTLGRIKAMLVANILSLVGALLMGFSKLGPSHILIIAGRSI SGLYCGLISGLVPMYIGEIAPTALRGALGTFHQLAIVTGILISQIIGLEFILGNYDLW HILLGLSGVRAILQSLLLFFCPESPRYLYIKLDEEVKAKQSLKRLRGYDDVTKDINEM RKEREEASSEQKVSIIQLFTNSSYRQPILVALMLHVAQQFSGINGIFYYSTSIFQTAG ISKPVYATIGVGAVNMVFTAVSVFLVEKAGRRSLFLIGMSGMFVCAIFMSVGLVLLNK FSWMSYVSMIAIFLFVSFFEIGPGPIPWFMVAEFFSQGPRPAALAIAAFSNWTCNFIV ALCFQYIADFCGPYVFFLFAGVLLAFTLFTFFKVPETKGKSFEEIAAEFQKKSGSAHR PKAAVEMKFLGATETV" BASE COUNT 920 a 591 c 600 g 1057 t ORIGIN 30 bp upstream of SpeI site; Chromosome 3q26.1-q26.3. 1 cacaagacct ggaattgaca ggactcccaa ctagtacaat gacagaagat aaggtcactg 61 ggaccctggt tttcactgtc atcactgctg tgctgggttc cttccagttt ggatatgaca 121 ttggtgtgat caatgcacct caacaggtaa taatatctca ctatagacat gttttgggtg 181 ttccactgga tgaccgaaaa gctatcaaca actatgttat caacagtaca gatgaactgc 241 ccacaatctc atactcaatg aacccaaaac caaccccttg ggctgaggaa gagactgtgg 301 cagctgctca actaatcacc atgctctggt ccctgtctgt atccagcttt gcagttggtg 361 gaatgactgc atcattcttt ggtgggtggc ttggggacac acttggaaga atcaaagcca 421 tgttagtagc aaacattctg tcattagttg gagctctctt gatggggttt tcaaaattgg 481 gaccatctca tatacttata attgctggaa gaagcatatc aggactatat tgtgggctaa 541 tttcaggcct ggttcctatg tatatcggtg aaattgctcc aaccgctctc aggggagcac 601 ttggcacttt tcatcagctg gccatcgtca cgggcattct tattagtcag attattggtc 661 ttgaatttat cttgggcaat tatgatctgt ggcacatcct gcttggcctg tctggtgtgc 721 gagccatcct tcagtctctg ctactctttt tctgtccaga aagccccaga tacctttaca 781 tcaagttaga tgaggaagtc aaagcaaaac aaagcttgaa aagactcaga ggatatgatg 841 atgtcaccaa agatattaat gaaatgagaa aagaaagaga agaagcatcg agtgagcaga 901 aagtctctat aattcagctc ttcaccaatt ccagctaccg acagcctatt ctagtggcac 961 tgatgctgca tgtggctcag caattttccg gaatcaatgg cattttttac tactcaacca 1021 gcatttttca gacggctggt atcagcaaac ctgtttatgc aaccattgga gttggcgctg 1081 taaacatggt tttcactgct gtctctgtat tccttgtgga gaaggcaggg cgacgttctc 1141 tctttctaat tggaatgagt gggatgtttg tttgtgccat cttcatgtca gtgggacttg 1201 tgctgctgaa taagttctct tggatgagtt atgtgagcat gatagccatc ttcctctttg 1261 tcagcttctt tgaaattggg ccaggcccga tcccctggtt catggtggct gagtttttca 1321 gtcaaggacc acgtcctgct gctttagcaa tagctgcatt cagcaattgg acctgcaatt 1381 tcattgtagc tctgtgtttc cagtacattg cggacttctg tggaccttat gtgtttttcc 1441 tctttgctgg agtgctcctg gcctttaccc tgttcacatt ttttaaagtt ccagaaacca 1501 aaggaaagtc ttttgaggaa attgctgcag aattccaaaa gaagagtggc tcagcccaca 1561 ggccaaaagc tgctgtagaa atgaaattcc taggagctac agagactgtg taaaaaaaaa 1621 accctgcttt ttgacatgaa cagaaacaat aagggaaccg tctgttttta aatgatgatt 1681 ccttgagcat tttatatcca catctttaag tattgtttta tttttatgtg ctctcatcag 1741 aaatgtcatc aaatattacc aaaaaagtat ttttttaagt tagagaatat atttttgatg 1801 gtaagactgt aattaagtaa accaaaaagg ctagtttatt ttgttacact aaagggcagg 1861 tggttctaat atttttagct ctgttcttta taacaaggtt cttctaaaat tgaagagatt 1921 tcaacatatc atttttttaa cacataacta gaaacctgag gatgcaacaa atatttatat 1981 atttgaatat cattaaattg gaattttctt acccatatat cttatgttaa aggagatatg 2041 gctagtggca ataagttcca tgttaaaata gacaactctt ccatttattg cactcagctt 2101 ttttcttgag tactagaatt tgtattttgc ttaaaatttt acttttgttc tgtattttca 2161 tgtggaatgg attatagagt atactaaaaa atgtctatag agaaaaactt tcatttttgg 2221 taggcttatc aaaatctttc agcactcaga aaagaaaacc attttagttc ctttatttaa 2281 tggccaaatg gtttttgcaa gatttaacac taaaaaggtt tcacctgatc atatagcgtg 2341 ggttatcagt taacattaac atctattata aaaccatgtt gattcccttc tggtacaatc 2401 ctttgagtta tagtttgctt tgctttttaa ttgaggacag cctggttttc acatacactc 2461 aaacaatcat gagtcagaca tttggtatat tacctcaaat tcctaataag tttgatcaaa 2521 tctaatgtaa gaaaatttga agtaaaggat tgatcacttt gttaaaaata ttttctgaat 2581 tattatgtct caaaataagt tgaaaaggta gggtttgagg attcctgagt gtgggcttct 2641 gaaacttcat aaatgttcag cttcagactt ttatcaaaat ccctatttaa ttttcctgga 2701 aagactgatt gttttatggt gtgttcctaa cataaaataa tcgtctcctt tgacatttcc 2761 ttctttgtct tagctgtata cagattctag ccaaactatt ctatggccat tactaacacg 2821 cattgtacac tatctatctg cctttaccta cataggcaaa ttggaaatac acagatgatt 2881 aaacagactt tagcttacag tcaattttac aattatggaa atatagttct gatgggtccc 2941 aaaagcttag cagggtgcta acgtatctct aggctgtttt ctccaccaac tggagcactg 3001 atcaatcctt cttatgtttg ctttaatgtg tattgaagaa aagcactttt taaaaagtac 3061 tctttaagag tgaaataatt aaaaaccact gaacatttgc tttgttttct aaagttgttc 3121 acatatatgt aatttagcag tccaaagaac aagaaattgt ttcttttc // LOCUS HUMLHCGR 3019 bp mRNA PRI 19-JUL-1995 DEFINITION Homo sapiens lutropin/choriogonadotropin receptor (LHCGR) mRNA, complete cds. ACCESSION M73746 NID g903745 KEYWORDS G-protein linked receptor; lutropin; lutropin-choriogonadotropic receptor. SOURCE Homo sapiens (tissue library: lambda gt11) adult cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3019) AUTHORS Frazier,A.L., Robbins,L.S., Stork,P.J., Sprengel,R., Segaloff,D.L. and Cone,R.D. TITLE Isolation of TSH and LH/CG receptor cDNAs from human thyroid: regulation by tissue specific splicing JOURNAL Mol. Endocrinol. 4 (8), 1264-1276 (1990) MEDLINE 91155962 FEATURES Location/Qualifiers source 1..3019 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_lib="lambda gt11" /map="2p21" gene 39..2096 /gene="LHCGR" CDS 39..2096 /gene="LHCGR" /codon_start=1 /db_xref="GDB:G00-125-260" /product="lutropin/choriogonadotropin receptor" /db_xref="PID:g903746" /translation="MKQRFSPLQLLKLLLLLQAPLPRALRRLCPEPCNCVPDGALRAP APRPSTRLSLAYLPVKVIPSQSFRGLNEVIKIEISQIDSLERIEANAFDNLLNLSEIL IQNTKNLRYIEPGAFINLPRLKYLSICNTGIRKFPDVTKVFSSESNFILEICDNLHIT TIPGNAFQGMNNESVTLKLYGNGFEEVQSHAFNGTTLTSLELKENVHLEKMHNGAFRG ATGPKTLDISSTKLQALPSYGLESIQRLIATSSYSLKKLPSKQTFVNLLRATLHYPSH CCAFRNLPTKELNFSHSISENFSKQCESTVRKSELSGWDYEYGFCLPKTPRCAPEPDA FNPCEDIMGYDFLRVLIWLINILAIMGNMTVLFVLLTSRYKLTVPRFLMCNLSFADFC MGLYLLLIASVDSQTKGQYYNHAIDWQTGSGCSTAGFFTVLASELSVYTLTVITLERW HTITYAIHLDQKLRLRHAILIMLGGWLFSSLIAMLPLVGVSNYMKVSICFPMDVETTL SQVYILTILILNVVAFLIICACYIKIYFAVRNPELMATNKDTKIAKKMAILIFTDFTC MAPISFFAISAAFKVPLITVTNSKVLLVLFYPINSCANPFLYAIFTKTFQRDFFLLLS KFGCCKRRADPLYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQGTALLDKTRY TEC" BASE COUNT 884 a 680 c 545 g 909 t 1 others ORIGIN 1 cagacactgg caagccgcag aagcccagtt cgccggccat gaagcagcgg ttctcgccgc 61 tgcagctgct gaagctgctg ctgctgctgc aggcgccgct gccacgagcg ctgcgcaggc 121 tctgccctga gccctgcaac tgcgtgcccg acggcgccct gcgtgccccg gccccacggc 181 cgtccactcg actatcactt gcctacctcc ctgtcaaagt gatcccatct caaagtttca 241 gaggacttaa tgaggtcata aaaattgaaa tctctcagat tgattccctg gaaaggatag 301 aagctaatgc ctttgacaac ctcctcaatt tgtctgaaat actgatccag aacaccaaaa 361 atctgagata cattgagccc ggagcattta taaatcttcc ccgattaaaa tacttgagca 421 tctgtaacac aggcatcaga aagtttccag atgttacgaa ggtcttctcc tctgaatcaa 481 atttcattct ggaaatttgt gataacttac acataaccac cataccagga aatgcttttc 541 aagggatgaa taatgaatct gtaacactca aactatatgg aaatggattt gaagaagtac 601 aaagtcatgc attcaatggg acgacactga cttcactgga gctaaaggaa aacgtacatc 661 tggagaagat gcacaatgga gccttccgtg gggccacagg gccgaaaacc ttggatattt 721 cttccaccaa attgcaggcc ctgccgagct atggcctaga gtccattcag aggctaattg 781 ccacgtcatc ctattctcta aaaaaattgc catcaaaaca aacatttgtc aatctcctga 841 gggccacgct tcattacccc agccactgct gtgcatttag aaacttgcca acgaaagagc 901 taaacttctc acattccatt tctgaaaact tttccaaaca atgtgaaagc acagtaagga 961 aaagtgaact gagtggctgg gactatgaat atggtttctg cttacccaag acaccccgat 1021 gtgctcctga accagatgct tttaatccct gtgaagacat tatgggctat gacttcctta 1081 gggtcctgat ttggctgatt aatattctag ccatcatggg aaacatgact gttctttttg 1141 ttctcctgac aagtcgttac aaacttacag tgcctcgttt tctcatgtgc aatctctcct 1201 ttgcagactt ttgcatgggg ctctatctgc tgctcatagc ctcagttgat tcccaaacca 1261 agggccagta ctataaccat gccatagact ggcagacagg gagtgggtgc agcactgctg 1321 gctttttcac tgtattagca agtgaacttt ctgtctacac cctcaccgtc atcactctag 1381 aaagatggca caccatcacc tatgctattc acctggacca aaagctgcga ttaagacatg 1441 ccattctgat tatgcttgga ggctggctct tttcttctct aattgctatg ttgccccttg 1501 tcggggtcag caattacatg aaggtcagta tttgcttccc catggatgtg gaaaccactc 1561 tctcacaagt ctatatatta accatcctga ttctcaatgt ggtggccttc ttaataattt 1621 gtgcttgcta cattaaaatt tattttgcag ttcgaaaccc agaattaatg gctaccaata 1681 aagatacaaa gattgctaag aaaatggcaa tcctcatctt caccgatttt acctgcatgg 1741 cacctatctc tttttttgcc atctcagctg ccttcaaagt acctcttatc acagtaacca 1801 actctaaagt tttactggtt cttttttatc ccatcaattc ttgtgccaat ccatttctgt 1861 atgcaatatt cactaagaca ttccaaagag atttctttct tttgctgagc aaatttggct 1921 gctgtaaacg tcgggctgat cctctttata gaaggaaaga tttttcagct tacacctcca 1981 actgcaaaaa tggcttcact ggatcaaata agccttctca atccaccttg aagttgtcca 2041 cattgcactg tcaaggtaca gctctcctag acaagactcg ctacacagag tgttaactgt 2101 tacatcagta actgcattat tgaattgttc ttaaacctgt aaaaaaaaat tacctgtacc 2161 agtaatttta acataaaggg ttggatttag gaaattattt atttttaggt acattaggca 2221 agagacctct accagtagaa agtgtagtct atgaccactg ccacactaaa aactatttgt 2281 cattgttaca ttggcataaa tactgaagtt gagagtgttt tttatagaaa ttttgacaca 2341 gtaattttgt ttgatgaatc ttttaaaaaa ctgaggaggt attttgcata tctttttttt 2401 cattttcgta atttgtattg cattctataa aaatattagt tcataacaga tcagaaattt 2461 aaaataactg gcctttttcc tcaggtagtt tgaaaaacac actctagaga tgcactgtcc 2521 aatccggtag ccactagcac atgtggctaa attaaaatta aataaaatga gaaatgtagt 2581 ttctcagttg cactagccac gtttcaagtt ctcaatggct acgtgtgact agtgcttacc 2641 atactggaca gcacagacac agaatatttt catcaccaca gaaagttcta tctgttctat 2701 tatagagact tttatctatg ccctatctgg attckactta tttataattt aaggtaaaca 2761 tctgaaagca catttcagcc tatttgctta gtgaaacatt aagctgtaga ctgtaaactc 2821 ctcgtgagta ggaaccctgt ctcagtgcat tttgttttcc tgcttcctac ctcaagatct 2881 tggcaatggt acactacaaa tgtgctgagt tagaattact ctgaagttat gaaacatata 2941 atgaaaacaa ttttttctag agcttatatt tttatttgaa tgaaataaaa tgtttaaata 3001 tttaaaaata aaaaaaaaa // LOCUS HUMLIC 1362 bp mRNA PRI 23-DEC-1997 DEFINITION Homo sapiens mRNA for lipocortin II, complete cds. ACCESSION D00017 M14043 N00017 NID g219909 KEYWORDS lipocortin II. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1362) AUTHORS Huang,K.S., Wallner,B.P., Mattaliano,R.J., Tizard,R., Burne,C., Frey,A., Hession,C., McGray,P., Sinclair,L.K., Chow,E.P., Browning,J.L., Ramachandran,K.L., Tang,J., Smart,J.E. and Pepinsky,R.B. TITLE Two human 35 kd inhibitors of phospholipase A2 are related to substrates of pp60v-src and of the epidermal growth factor receptor/kinase JOURNAL Cell 46 (2), 191-199 (1986) MEDLINE 86245065 COMMENT A cDNA of human lipocortin II, one of two phospholipase A2 inhibitors, was cloned and sequenced. It contains 1017 bp coding region and a polyadenylation signal, AATAAA, is identified in 3' untranslated region. The two human phospholipase A2 inhibitors, lipocortin I and II, share about 50% amino acid sequence homology and thus are derived evolutionarily from a common gene. FEATURES Location/Qualifiers source 1..1362 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 50..1069 /codon_start=1 /product="lipocortin II" /db_xref="PID:d1000439" /db_xref="PID:g219910" /translation="MSTVHEILCKLSLEGDHSTPPSAYGSVKAYTNFDAERDALNIET AIKTKGVDEVTIVNILTNRSNAQRQDIAFAYQRRTKKELASALKSALSGHLETVILGL LKTPAQYDASELKASMKGLGTDEDSLIEIICSRTNQELQEINRVYKEMYKTDLEKDII SDTSGDFRKLMVALAKGRRAEDGSVIDYELIDQDARDLYDAGVKRKGTDVPKWISIMT ERSVPHLQKVFDRYKSYSPYDMLESIRKEVKGDLENAFLNLVQCIQNKPLYFADRLYD SMKGKGTRDKVLIRIMVSRSEVDMLKIRSEFKRKYGKSLYYYIQQDTKGDYQKALLYL CGGDD" polyA_signal 1342..1347 BASE COUNT 384 a 316 c 343 g 319 t ORIGIN 1 catttgggga cgctctcagc tctcggcgca cggcccagct tccttcaaaa tgtctactgt 61 tcacgaaatc ctgtgcaagc tcagcttgga gggtgatcac tctacacccc caagtgcata 121 tgggtctgtc aaagcctata ctaactttga tgctgagcgg gatgctttga acattgaaac 181 agccatcaag accaaaggtg tggatgaggt caccattgtc aacattttga ccaaccgcag 241 caatgcacag agacaggata ttgccttcgc ctaccagaga aggaccaaaa aggaacttgc 301 atcagcactg aagtcagcct tatctggcca cctggagacg gtgattttgg gcctattgaa 361 gacacctgct cagtatgacg cttctgagct aaaagcttcc atgaaggggc tgggaaccga 421 cgaggactct ctcattgaga tcatctgctc cagaaccaac caggagctgc aggaaattaa 481 cagagtctac aaggaaatgt acaagactga tctggagaag gacattattt cggacacatc 541 tggtgacttc cgcaagctga tggttgccct ggcaaagggt agaagagcag aggatggctc 601 tgtcattgat tatgaactga ttgaccaaga tgctcgggat ctctatgacg ctggagtgaa 661 gaggaaagga actgatgttc ccaagtggat cagcatcatg accgagcgga gcgtgcccca 721 cctccagaaa gtatttgata ggtacaagag ttacagccct tatgacatgt tggaaagcat 781 caggaaagag gttaaaggag acctggaaaa tgctttcctg aacctggttc agtgcattca 841 gaacaagccc ctgtattttg ctgatcggct gtatgactcc atgaagggca aggggacgcg 901 agataaggtc ctgatcagaa tcatggtctc ccgcagtgaa gtggacatgt tgaaaattag 961 gtctgaattc aagagaaagt acggcaagtc cctgtactat tatatccagc aagacactaa 1021 gggcgactac cagaaagcgc tgctgtacct gtgtggtgga gatgactgaa gcccgacacg 1081 gcctgagcgt ccagaaatgg tgctcaccat gcttccagct aacaggtcta gaaaaccagc 1141 ttgcgaataa cagtccccgt ggccatccct gtgagggtga cgttagcatt acccccaacc 1201 tcattttagt tgcctaagca ttgcctggcc ttcctgtcta gtctctcctg taagccaaag 1261 aaatgaacat tccaaggagt tggaagtgaa gtctatgatg tgaaacactt tgcctcctgt 1321 gtactgtgtc ataaacagat gaataaactg aatttgtact tt // LOCUS HUMLIGAA 3083 bp mRNA PRI 07-JAN-1995 DEFINITION Human DNA ligase I mRNA, complete cds. ACCESSION M36067 NID g187142 KEYWORDS DNA ligase I. SOURCE Human T lymphoblast cell line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3083) AUTHORS Barnes,D.E., Johnston,L.H., Kodama,K., Tomkinson,A.E., Lasko,D.D. and Lindahl,T. TITLE Human DNA ligase I cDNA: cloning and functional expression in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (17), 6679-6683 (1990) MEDLINE 90370849 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Barnes, 03-JUL-1990. FEATURES Location/Qualifiers source 1..3083 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.3" mRNA <1..3083 /note="DNA ligase I" gene 121..2880 /gene="LIG1" CDS 121..2880 /gene="LIG1" /note="DNA ligase I" /codon_start=1 /db_xref="GDB:G00-127-274" /db_xref="PID:g187143" /translation="MQRSIMSFFHPKKEGKAKKPEKEASNSSRETEPPPKAALKEWNG VVSESDSPVKRPGRKAARVLGSEGEEEDEALSPAKGQKPALDCSQVSPPRPATSPENN ASLSDTSPMDSSPSGIPKRRTARKQLPKRTIQEVLEEQSEDEDREAKRKKEEEEEETP KESLTEAEVATEKEGEDGDQPTTPPKPLKTSKAETPTESVSEPEVATKQELQEEEEQT KPPRRAPKTLSSFFTPRKPAVKKEVKEEEPGAPGKEGAAEGPLDPSGYNPAKNNYHPV EDACWKPGQKVPYLAVARTFEKIEEVSARLRMVETLSNLLRSVVALSPPDLLPVLYLS LNHLGPPQQGLELGVGDGVLLKAVAQATGRQLESVRAEAAEKGDVGLVAENSRSTQRL MLPPPPLTASGVFSKFRDIARLTGSASTAKKIDIIKGLFVACRHSEARFIARSLSGRL RLGLAEQSVLAALSQAVSLTPPGQEFPPAMVDAGKGKTAEARKTWLEEQGMILKQTFC EVPDLDRIIPVLLEHGLERLPEHCKLSPGIPLKPMLAHPTRGISEVLKRFEEAAFTCE YKYDGQRAQIHALEGGEVKIFSRNQEDNTGKYPDIISRIPKIKLPSVTSFILDTEAVA WDREKKQIQPFQVLTTRKRKEVDASEIQVQVCLYAFDLIYLNGESLVREPLSRRRQLL RENFVETEGEFVFATSLDTKDIEQIAEFLEQSVKDSCEGLMVKTLDVDATYEIAKRSH NWLKLKKDYLDGVGDTLDLVVIGAYLGRGKRAGRYGGFLLASYDEDSEELQAICKLGT GFSDEELEEHHQSLKALVLPSPRPYVRIDGAVIPDHWLDPSAVWEVKCADLSLSPIYP AARGLVDSDKGISLRFPRFIRVREDKQPEQATTSAQVACLYRKQSQIQNQQGEDSGSD PEDTY" polyA_signal 3062..3067 BASE COUNT 716 a 882 c 962 g 523 t ORIGIN Chromosome 19. 1 cagaggcgcg cctggcggat ctgagtgtgt tgcccgggca gcggcgcgcg ggaccaacgc 61 aaggagcagc tgacagacga agaaaagtgc tggacaggaa gggagaattc tgacgccaac 121 atgcagcgaa gtatcatgtc atttttccac cccaagaaag agggtaaagc aaagaagcct 181 gagaaggagg catccaatag cagcagagag acggagcccc ctccaaaggc ggcactgaag 241 gagtggaatg gagtggtgtc cgagagtgac tctccggtga agaggccagg gaggaaggcg 301 gcccgggtcc tgggcagcga aggggaagag gaggatgaag cccttagccc tgctaaaggc 361 cagaagcctg ccctggactg ctcacaggtc tccccgcccc gtcctgccac atctcctgag 421 aacaatgctt ccctctctga cacctctccc atggacagtt ccccatcagg gattccgaag 481 cgtcgcacag ctcggaagca gctcccgaaa cggaccattc aggaagtcct ggaagagcag 541 agtgaggacg aggacagaga agccaagagg aagaaggagg aggaagaaga ggagaccccg 601 aaagaaagcc tcacagaggc tgaagtggca acagagaagg aaggagaaga cggggaccag 661 cccaccacgc ctcccaagcc cctaaagacc tccaaagcag agaccccgac ggaaagcgtt 721 tcagagcctg aggtggccac gaagcaggaa ctgcaggagg aggaagagca gaccaagcct 781 ccccgcagag ctcccaagac gctcagcagc ttcttcaccc cccggaagcc agcagtcaaa 841 aaagaagtga aggaagagga gccaggggct ccaggaaagg agggagctgc tgagggaccc 901 ctggatccat ctggttacaa tcctgccaag aacaactatc atcccgtgga agatgcctgc 961 tggaaaccgg gccagaaggt tccttacctg gctgtggccc ggacgtttga gaagatcgag 1021 gaggtgtctg ctcggctccg gatggtggag acgctgagca acttgctgcg ctccgtggtg 1081 gccctgtcgc ctccagacct cctccctgtc ctctacctca gcctcaacca ccttgggcca 1141 ccccagcagg gcctggagct tggcgtgggt gatggtgtcc ttctcaaggc agtggcccag 1201 gccacaggtc ggcagctgga gtccgtccgg gctgaggcag ccgagaaagg cgacgtgggg 1261 ctggtggccg agaacagccg cagcacccag aggctcatgc tgccaccacc tccgctcact 1321 gcctccgggg tcttcagcaa gttccgcgac atcgccaggc tcactggcag tgcttccaca 1381 gccaagaaga tagacatcat caaaggcctc tttgtggcct gccgccactc agaagcccgg 1441 ttcatcgcta ggtccctgag cggacggctg cgccttgggc tggcagagca gtcggtgctg 1501 gctgccctct cccaggcagt gagcctcacg cccccgggcc aagaattccc accagccatg 1561 gtggatgctg ggaagggcaa gacagcagag gccagaaaga cgtggctgga ggagcaaggc 1621 atgatcctga agcagacgtt ctgcgaggtt cccgacctgg accgaattat ccccgtgctg 1681 ctggagcacg gcctggaacg tctcccggag cactgcaagc tgagcccagg gattcccctg 1741 aaaccaatgt tggcccatcc cacccggggc atcagcgagg tcctgaaacg ctttgaggag 1801 gcagctttca cctgcgaata caaatatgac gggcagaggg cacagatcca cgccctggaa 1861 ggcggggagg tgaagatctt cagcaggaat caggaagaca acactgggaa gtacccggac 1921 atcatcagcc gcatccccaa gattaaactc ccatcggtca catccttcat cctggacacc 1981 gaagccgtgg cttgggaccg ggaaaagaag cagatccagc cattccaagt gctcaccacc 2041 cgcaaacgca aggaggtgga tgcgtctgag atccaggtgc aggtgtgttt gtacgccttc 2101 gacctcatct acctcaatgg agagtccctg gtacgtgagc ccctttcccg gcgccggcag 2161 ctgctccggg agaactttgt ggagacagag ggcgagtttg tcttcgccac ctccctggac 2221 accaaggaca tcgagcagat cgccgagttc ctggagcagt cagtgaaaga ctcctgcgag 2281 gggctgatgg tgaagaccct ggatgttgat gccacctacg agatcgccaa gagatcgcac 2341 aactggctca agctgaagaa ggactacctt gatggcgtgg gtgacaccct ggacctggtg 2401 gtgatcggcg cctacctggg ccgggggaag cgggccggcc ggtacggggg cttcctgctg 2461 gcctcctacg acgaggacag tgaggagctg caggccatat gcaagcttgg aactggcttc 2521 agtgatgagg agctggagga gcatcaccag agcctcaagg cgctggtgct gcccagccca 2581 cgcccttacg tgcggataga tggcgctgtg attcccgacc actggctgga ccccagcgct 2641 gtgtgggagg tgaagtgcgc tgacctctcc ctctctccca tctaccctgc tgcgcggggc 2701 ctggtggata gtgacaaggg catctccctt cgcttccctc ggtttattcg agtccgtgaa 2761 gacaagcagc cggagcaggc caccaccagt gctcaggtgg cctgtttgta ccggaagcaa 2821 agtcagattc agaaccaaca aggcgaggac tcaggctctg accctgaaga tacctactaa 2881 gccctcgccc tcctagggcc tgggtacagg gcatgagttg gacggacccc agggttatta 2941 ttgcctttgc tttttagcaa atctgctgtg gcaggctgtg gattttgaga gtcaggggag 3001 gggtgtgtgt gtgagggggt ggcttactcc ggagtctggg attcatcccg tcatttcttt 3061 caataaataa ttattggata gct // LOCUS HUMLIGAND 926 bp mRNA PRI 25-MAY-1993 DEFINITION Human CD27 ligand mRNA, complete cds. ACCESSION L08096 NID g307127 KEYWORDS CD27 ligand; antigen binding; transmembrane protein type II. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 926) AUTHORS Goodwin,R.G. TITLE Molecular cloning of a ligand for CD27 defines a new family of cytokines with homology to tumor necrosis factor JOURNAL Cell 73, 447-456 (1993) MEDLINE 93258810 FEATURES Location/Qualifiers source 1..926 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MP-1" /cell_type="B-cell" CDS 151..732 /note="transmembrane domain (aa 21..38), potential glycosylation sites (aa 63) and (aa 170); homology to TNF and CD40 ligand; putative" /codon_start=1 /function="binds CD27" /product="CD27 ligand" /db_xref="PID:g307128" /translation="MPEEGSGCSVRRRPYGCVLRAALVPLVAGLVICLVVCIQRFAQA QQQLPLESLGWDVAELQLNHTGPQQDPRLYWQGGPALGRSFLHGPELDKGQLRIHRDG IYMVHIQVTLAICSSTTASRHHPTTLAVGICSPASRSISLLRLSFHQGCTIVSQRLTP LARGDTLCTNLTGTLLPSRNTDETFFGVQWVRP" BASE COUNT 178 a 275 c 271 g 202 t ORIGIN 1 ccagagaggg gcaggcttgt cccctgacag gttgaagcaa gtagacgccc aggagccccg 61 ggagggggct gcagtttcct tccttccttc tcggcagcgc tccgcgcccc catcgcccct 121 cctgcgctag cggaggtgat cgccgcggcg atgccggagg agggttcggg ctgctcggtg 181 cggcgcaggc cctatgggtg cgtcctgcgg gctgctttgg tcccattggt cgcgggcttg 241 gtgatctgcc tcgtggtgtg catccagcgc ttcgcacagg ctcagcagca gctgccgctc 301 gagtcacttg ggtgggacgt agctgagctg cagctgaatc acacaggacc tcagcaggac 361 cccaggctat actggcaggg gggcccagca ctgggccgct ccttcctgca tggaccagag 421 ctggacaagg ggcagctacg tatccatcgt gatggcatct acatggtaca catccaggtg 481 acgctggcca tctgctcctc cacgacggcc tccaggcacc accccaccac cctggccgtg 541 ggaatctgct ctcccgcctc ccgtagcatc agcctgctgc gtctcagctt ccaccaaggt 601 tgtaccattg tctcccagcg cctgacgccc ctggcccgag gggacacact ctgcaccaac 661 ctcactggga cacttttgcc ttcccgaaac actgatgaga ccttctttgg agtgcagtgg 721 gtgcgcccct gaccactgct gctgattagg gttttttaaa ttttatttta ttttatttaa 781 gttcaagaga aaaagtgtac acacaggggc cacccggggt tggggtggga gtgtggtggg 841 gggtagtttg tggcaggaca agagaaggca ttgagctttt tctttcattt tcctattaaa 901 aaatacaaaa atcaaaacaa aaaaaa // LOCUS HUMLIPH 1550 bp mRNA PRI 07-JAN-1995 DEFINITION Human hepatic lipase mRNA, complete cds. ACCESSION J03540 NID g187153 KEYWORDS lipase. SOURCE Human liver, cDNA to mRNA, clones lambda-HL[1,2,3]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1550) AUTHORS Datta,S., Luo,C.C., Li,W.H., VanTuinen,P., Ledbetter,D.H., Brown,M.A., Chen,S.H., Liu,S.W. and Chan,L. TITLE Human hepatic lipase. Cloned cDNA sequence, restriction fragment length polymorphisms, chromosomal localization, and evolutionary relationships with lipoprotein lipase and pancreatic lipase JOURNAL J. Biol. Chem. 263 (3), 1107-1110 (1988) MEDLINE 88087233 FEATURES Location/Qualifiers source 1..1550 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15q21-q23" sig_peptide 5..73 /gene="LIPC" /note="hepatic lipase signal peptide" CDS 5..1504 /gene="LIPC" /note="hepatic lipase precursor" /codon_start=1 /db_xref="GDB:G00-119-366" /db_xref="PID:g307129" /translation="MDTSPLCFSILLVLCIFIQSSALGQSLKPEPFGRRAQAVETNKT LHEMKTRFLLFGETNQGCQIRINHPDTLQECGFNSSLPLVMIIHGWSVDGVLENWIWQ MVAALKSQPAQPVNVGLVDWITLAHDHYTIAVRNTRLVGKEVAALLRWLEESVQLSRS HVHLIGYSLGAHVSGFAGSSIGGTHKIGRITGLDAAGPLFEGSAPSNRLSPDDASFVD AIHTFTREHMGLSVGIKQPIGHYDFYPNGGSFQPGCHFLELYRHIAQHGFNAITQTIK CSHERSVHLFIDSLLHAGTQSMAYPCGDMNSFSQGLCLSCKKGRCNTLGYHVRQEPRS KSKRLFLVTRAQSPFKVYHYQLKIQFINQTETPIQTTFTMSLLGTKEKMQKIPITLGK GIASNKTYSFLITLDVDIGELIMIKFKWENSAVWANVWDTVQTIIPWSTGPRHSGLVL KTIRVKAGETQQRMTFCSENTDDLLLRPTQEKIFVKCEIKSKTSKRKIR" gene 5..1504 /gene="LIPC" mat_peptide 74..1501 /gene="LIPC" /note="hepatic lipase" BASE COUNT 417 a 419 c 382 g 332 t ORIGIN 1 agaaatggac acaagtcccc tgtgtttctc cattctgttg gttttatgca tctttatcca 61 atcaagtgcc cttggacaaa gcctgaaacc agagccattt ggaagaagag ctcaagctgt 121 tgaaacaaac aaaacgctgc atgagatgaa gaccagattc ctgctctttg gagaaaccaa 181 tcagggctgt cagattcgaa tcaatcatcc ggacacgtta caggagtgcg gcttcaactc 241 ctccctgcct ctggtgatga taatccacgg gtggtcggtg gacggcgtgc tagaaaactg 301 gatctggcag atggtggccg cgctgaagtc tcagccggcc cagccagtga acgtggggct 361 ggtggactgg atcaccctgg cccacgacca ctacaccatc gccgtccgca acacccgcct 421 tgtgggcaag gaggtcgcgg ctcttctccg gtggctggag gaatctgttc aactctctcg 481 aagccatgtt cacctaattg ggtacagcct gggtgcacac gtgtcaggat ttgccggcag 541 ttccatcggt ggaacgcaca agattgggag aatcacaggg ctggatgccg cgggaccttt 601 gtttgaggga agtgccccca gcaatcgtct ttctccagat gatgccagtt ttgtggatgc 661 cattcatacc tttacccggg agcacatggg cctgagcgtg ggcatcaaac agcccatagg 721 acactatgac ttctatccca acgggggctc cttccagcct ggctgccact tcctagagct 781 ctacagacat attgcccagc acggcttcaa tgccatcacc cagaccataa aatgctccca 841 cgagcgatcg gtgcaccttt tcatcgactc cttgctgcac gccggcacgc agagcatggc 901 ctacccgtgt ggtgacatga acagcttcag ccagggcctg tgcctgagct gcaagaaggg 961 ccgctgcaac acgctgggct accacgtccg ccaggagccg cggagcaaga gcaagaggct 1021 cttcctcgta acgcgagccc agtccccctt caaagtttat cattaccagt taaagatcca 1081 gttcatcaac caaactgaga cgccaataca aacaactttt accatgtcac tactcggaac 1141 aaaagagaaa atgcagaaaa ttcccatcac tctgggcaaa ggaattgcta gtaataaaac 1201 gtattccttt cttatcacgc tggatgtgga tatcggcgag ctgatcatga tcaagttcaa 1261 gtgggaaaac agtgcagtgt gggccaatgt ctgggacacg gtccagacca tcatcccatg 1321 gagcacaggg ccgcgccact caggcctcgt tctgaagacg atcagagtca aagcaggaga 1381 aacccagcaa agaatgacat tttgttcaga aaacacagat gacctactac ttcgcccaac 1441 ccaggaaaaa atcttcgtga aatgtgaaat aaagtctaaa acatcaaagc gaaagatcag 1501 atgagattta atgaagaccc agtgtaaaga ataaatgaat cttactcctt // LOCUS HUMLIS1A 5243 bp mRNA PRI 30-SEP-1993 DEFINITION Homo sapiens(clone 71) Miller-Dieker lissencephaly protein (LIS1) mRNA, complete cds. ACCESSION L13385 NID g349823 KEYWORDS Miller-Dieker lissencephaly protein. SOURCE Homo sapiens (library: lambda gt10) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5243) AUTHORS Reiner,O., Carrozzo,R., Shen,Y., Wehnert,M., Faustinella,F., Dobyns,W.B., Caskey,C.T. and Ledbetter,D.H. TITLE Isolation of a Miller-Dieker lissencephaly gene containing G protein beta-subunit-like repeats JOURNAL Nature 364 (6439), 717-721 (1993) MEDLINE 93361119 FEATURES Location/Qualifiers source 1..5243 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="lambda gt10" /map="17p13.3" 5'UTR 1..217 gene 218..1450 /gene="LIS1" CDS 218..1450 /gene="LIS1" /codon_start=1 /product="Miller-Dieker lissencephaly protein" /db_xref="PID:g349824" /translation="MVLSQRQRDELNRAIADYLRSNGYEEAYSVFKKEAELDVNEELD KKYAGLLEKKWTSVIRLQKKVMELESKLNEAKEEFTSGGPLGQKRDPKEWIPRPPEKY ALSGHRSPVTRVIFHPVFSVMVSASEDATIKVWDYETGDFERTLKGHTDSVQDISFDH SGKLLASCSADMTIKLWDFQGFECIRTMHGHDHNVSSVAIMPNGDHIVSASRDKTIKM WEVQTGYCVKTFTGHREWVRMVRPNQDGTLIASCSNDQTVRVWVVATKECKAELREHE HVVECISWAPESSYSSISEATGSETKKSGKPGPFLLSGSRDKTIKMWDVSTGMCLMTL VGHDNWVRGVLFHSGGKFILSCADDKTLRVWDYKNKRCMKTLNAHEHFVTSLDFHKTA PYVVTGSVDQTVKVWECR" 3'UTR 1451..5243 BASE COUNT 1495 a 963 c 1147 g 1638 t ORIGIN 1 cggcggaggc ggcggtgcag cgctccggtg gaatgaatct tacttgttga atatcttctg 61 gttactagtt ggattcattt gtgaaagaat cattttcccc tgtgtggaag acacttagtg 121 gcatatttaa attataagtc cacggatcaa aaagcttttt gatttcccaa aggagggaca 181 taccactata tcagataagc ttgacattac agccaagatg gtgctgtccc agagacaacg 241 agatgaacta aatcgagcta tagcagatta tcttcgttca aatggctatg aagaggcata 301 ttcagttttt aaaaaggaag ctgaattaga tgtgaatgaa gaattagata aaaagtatgc 361 tggtcttttg gaaaaaaaat ggacatctgt tattagatta caaaagaagg ttatggaatt 421 agaatcaaag ctaaatgaag caaaagaaga atttacgtca ggtggacctc ttggtcagaa 481 acgagaccca aaagaatgga ttccccgtcc gccagaaaaa tatgcattga gtggtcacag 541 gagtccagtc actcgagtca ttttccatcc tgtgttcagt gttatggtct ctgcttcaga 601 ggatgctaca attaaggtgt gggattatga gactggagat tttgaacgaa ctcttaaagg 661 acatacagac tctgtacagg acatttcatt cgaccacagc ggcaagcttc tggcttcctg 721 ttctgcagat atgaccatta aactatggga ttttcagggc tttgaatgca tcagaaccat 781 gcacggccat gaccacaatg tttcttcagt agccatcatg cccaatggag atcatatagt 841 gtctgcctca agggataaaa ctataaaaat gtgggaagtg caaactggct actgtgtgaa 901 gacattcaca ggacacagag aatgggtacg tatggtacgg ccaaatcaag atggcactct 961 gatagccagc tgttccaatg accagactgt gcgtgtatgg gtcgtagcaa caaaggaatg 1021 caaggctgag ctccgagagc atgagcatgt ggtagaatgc atttcctggg ctccagaaag 1081 ctcatattcc tccatctctg aagcaacagg atctgagact aaaaaaagtg gtaaacctgg 1141 gccattcttg ctgtctggat ccagagacaa gactattaag atgtgggatg tcagtactgg 1201 catgtgcctt atgaccctcg tgggtcatga taactgggta cgtggagttc tgttccattc 1261 tggggggaag tttattttga gttgtgctga tgacaagacc ctacgcgtat gggattacaa 1321 gaacaagcga tgcatgaaga ccctcaatgc gcatgaacac tttgttacct ccttggattt 1381 ccacaagacg gcaccctatg tcgtcactgg cagcgtagat caaacagtaa aagtgtggga 1441 gtgccgttga ttgtgtctcc ttcggcccct cctccctctt ttcctctgga tgcactctga 1501 tgataccatg gttaccccat tgagctctgt ttaaataaat attgtccttt catgtaaatt 1561 attctggatg tagattgagc ttattaaatg ttacacacaa agtattcatg catggtgaat 1621 ccaaattgta tactgtaaat ttacatacgt tgtctagaag taccataggg tttaaaaacc 1681 tgggctggca ttggtcacac caggcctaag aaggcagaag ttgaatcaat tgaactaggg 1741 cactaaactg aatagttgac agtgtcattt tatgttggat tattaattcc tgtttttctt 1801 tctgctatct gttggtgcct gacttgatgg cctcatttgg ggaaaagtgg tggttattag 1861 ggcttttcct gaaatgtgta tctatgtaac atcacttaag tgtgcttaat aaatctcctg 1921 taaggatttt agatgataag gctacaattc agaatcttct gaaccatcta tgtaatgaat 1981 ggggattata cattggaatt tttgtcatga cacatttgcc aaatcagtag gatatatttg 2041 ttttggcagc ctatcacgca gaggctagtg gtatatttat gtaagaaaat gactgtaaat 2101 ctcaagaaaa atctcagcag ctaatagcaa ctcatttatt tcattttggt cttaatgctt 2161 tgtaaacagg tcaaaaaata ctgtcatact ctaagcttct attttccaca ctggacatac 2221 ttctagttgt attctccata ctattagact gtgtagtgat gtgacttcca agtagaattt 2281 aatctcccca ttgagtgtgt catggtacaa atcactattc gtttttggtg ttttttaggg 2341 atgtgcaatg tgcattacat aatgacagaa atactgagaa ggttctgtgt gcccatttga 2401 aaggagtggg aggaatacag cagtttgttt ttcaacatga atctgatatt gatttaaact 2461 gtgtttcact tacaagtttt aaaaaaatga cagggtttaa tggagcgtgc ataaaaatgt 2521 actgttttca ccttttgttt atatgtaaat gtttgtaagt atatgggcct atctgtaagt 2581 gggtaagtct gtatgtgtgt atcatacaca tcaacctcca tgtccttagt cctgggtttt 2641 tgaaaaagtg ctaaaacgga caagtagaat aaatgttgct gtggaatgcc atgctttaga 2701 acaaaccctt tttgatctta atgcttctga aaactaggtc tgactctggg gatttttttc 2761 cagccgaagg aaaatcactt ccgttatgtc cccctctaat ttagccgctc gacattttac 2821 acaacccgga tatgttgtat attttgaccc aaagttacag gtaggtttaa gagaattttt 2881 agccatgact tttggagcac tattccattg tcagttatta ataaagaatt ccattgctta 2941 gctaaccaac aggttttttt tgtttccaag agagttattt gaaaagttaa cagaacaatg 3001 agataacagt gacagtttaa caaagataaa attctgaact gcgttttatt catttgtgta 3061 ctatgtgatt ttttaaatgt cccctttagt atttaatgga aaattggttc ctgcaaaaga 3121 caaagggtga gagttagcgt cctgtagata cacacagaga ctaggccgta tattaactag 3181 aagcagcttt atgtctagct tgtgtctttt tgtttgtttg cttgtttgtt tttagattcc 3241 tgagagatgt ctctggaagg gaaagttttg agaactaatg gctatttttg aggacaaaaa 3301 ttacatctta agctaattcc ttaaatacat acagtaggtg aattttcagg acaatattgc 3361 ctcacaaccc tgcttacatt gaaaagtctt tttcccttag ctcttctgac tggatttttc 3421 tacaaaacta tggaaaatat ctttgttctt gtttgctgct attttctgtc ctattttgag 3481 aaatataaat acatagaaat ggtgcatctt aacatttgtt tgtacatgta taaatgtctt 3541 gtattttaat tcatttttag catgaattgt ttaagggtaa gccacaacat ctagaaatca 3601 ctcatagata ttgaacaata aaggagaatg gtaccgatgc aggaggaagc aagcgtgtct 3661 tcccctgcag cacacagcga cttgcgttga caaaggagga ggaaacgatt actctgtaaa 3721 caaagttatc cttacttggg agattgccac agcctgctgc tgagttgagt taccagacat 3781 cctccatgtg agaagcagcg aacattgaat ctcagggatg gcccacaact gggtccacat 3841 gtaatgagcc ctgtttaata acgaaggggt gggggagagc agtccgtcta caacctggaa 3901 tcagatttgc aaaatttcct gcactgctgt ctgacactgt cctgttgatg ccctttctga 3961 ctgtgttctc tgttttctct gtctgctgtc taaccctgtg ccttgcctgg gataaggaca 4021 atgatgaggt tactggtttg gattgtaagt agaggacttt tattaattgg tttagaggtt 4081 cactgctgct ttgtcacttt ctcaatcaaa ttggccactt aagaaataaa gagctggtag 4141 aattgcatcc tcagatgatt attgactgtg tgtgtgtgtg aaaacagaca ttccagtgcc 4201 acccaaatat atatctgtaa cgtgcccaag aaatcctagc tgcgctcttg agagtgcatg 4261 ccatggagac tggtttagac accgcgtgga gcctagttgc ctgttgtcac ggcatcttgc 4321 actttaggag actaagaccg tcctggttcg tctgtgtgtg gtgtgaccaa tggtgtgccc 4381 agagcactac tctcaaaatc actagtgtta gcaagtcgtc ccgggctggg gagcgttcgc 4441 cgtagtcttt ggaagctttg gctttagatt taccaagccc cgcctccccg ctgccagtgc 4501 cctgctctcc cgttcgcctc tttctgtttc tgtgtgaact ttcccggtaa tatcactcgt 4561 taaataggtt ttctttaaac ttaattaagg aaaaactatt taaaggtaaa ggatattttg 4621 ttgacatcgg tggctcgatc atccttaagc aactgaagtt aaaattgttg aaggaaaagg 4681 cacttaaatt ggttactttc atgtccagct gtatataagt ccagtgtgtt catctagatg 4741 acgcaaagaa tctcctggta gagaagcgac atgtaaaaaa ctggtggaaa aaggttttgg 4801 attttttttc cagtggggtg gggggagggc aagctggatt tacaggtcac ggctggactg 4861 aatgggcctt tttatcttcc cactgtatca tggaagtagc tgcttgcttg tactgtccat 4921 ccttcaggca tccctaaagc tcactctgaa gatgttagag acaaacacaa actcttcgag 4981 ttaaagttga tcctgacact gacatgaagg caagccttga tttcgtatga acgttgctga 5041 agtggtaatt gaggaaaaca gttccccaga ttgttaagag ttcactgaag atattgacac 5101 aattttaaaa aatcagtaaa ggaatgtata taatattgct ctcgtgtttt acagtaagat 5161 ttgttgctct cagactgtgt aaaacaaaat ttattcatgt tttctgcata ttaaaaaatc 5221 ttattgtacc aactggtaaa ccg // LOCUS HUMLKHA 2060 bp mRNA PRI 11-JUN-1993 DEFINITION Human leukotriene A-4 hydrolase mRNA, complete cds. ACCESSION J03459 NID g187172 KEYWORDS leukotriene hydrolase. SOURCE Human spleen, cDNA to mRNA, clone LTA85. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2060) AUTHORS Minami,M., Ohno,S., Kawasaki,H., Raedmark,O., Samuelsson,B., Joernvall,H., Shimizu,T., Seyama,Y. and Suzuki,K. TITLE Molecular cloning of a cDNA coding for human leukotriene A-4 hydrolase: Complete primary structure of an enzyme involved in eicosanoid synthesis JOURNAL J. Biol. Chem. 262, 13873-13876 (1987) MEDLINE 88007621 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by T.Shimizu, 09-SEP-1987. FEATURES Location/Qualifiers source 1..2060 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>2060 /note="leukotriene A-4 hydrolase mRNA" CDS 69..1904 /note="leukotriene A-4 hydrolase precursor" /codon_start=1 /db_xref="PID:g307130" /translation="MPEIVDTCSLASPASVCRTKHLHLRCSVDFTRRTLTGTAALTVQ SQEDNLRSLVLDTKDLTIEKVVINGQEVKYALGERQSYKGSPMEISLPIALSKNQEIV IEISFETSPKSSALQWLTPEQTSGKEHPYLFSQCQAIHCRAILPCQDTPSVKLTYTAE VSVPKELVALMSAIRDGETPDPEDPSRKIYKFIQKVPIPCYLIALVVGALESRQIGPR TLVWSEKEQVEKSAYEFSETESMLKIAEDLGGPYVWGQYDLLVLPPSFPYGGMENPCL TFVTPTLLAGDKSLSNVIAHEISHSWTGNLVTNKTWDHFWLNEGHTVYLERHICGRLF GEKFRHFNALGGWGELQNSVKTFGETHPFTKLVVDLTDIDPDVAYSSVPYEKGFALLF YLEQLLGGPEIFLGFLKAYVEKFSYKSITTDDWKDFLYSYFKDKVDVLNQVDWNAWLY SPGLPPIKPNYDMTLTNACIALSQRWITAKEDDLNSFNATDLKDLSSHQLNEFLAQTL QRAPLPLGHIKRMQEVYNFNAINNSEIRFRWLRLCIQSKWEDAIPLALKMATEQGRMK FTRPLFKDLAAFDKSHDQAVRTYQEHKASMHPVTAMLVGKDLKVD" mat_peptide 72..1901 /note="leukotriene A-4 hydrolase" BASE COUNT 576 a 444 c 457 g 583 t ORIGIN 99 bp upstream of HaeIII site. 1 ctctatcgac gagtctggta gctgagcgtt gggctgtagg tcgctgtgct gtgtgatccc 61 ccagagccat gcccgagata gtggatacct gttcgttggc ctctccggct tccgtctgcc 121 ggaccaagca cctgcacctg cgctgcagcg tcgactttac tcgccggacg ctgaccggga 181 ctgctgctct cacggtccag tctcaggagg acaatctgcg cagcctggtt ttggatacaa 241 aggaccttac aatagaaaaa gtagtgatca atggacaaga agtcaaatat gctcttggag 301 aaagacaaag ttacaaggga tcgccaatgg aaatctctct tcctatcgct ttgagcaaaa 361 atcaagaaat tgttatagaa atttcttttg agacctctcc aaaatcttct gctctccagt 421 ggctcactcc tgaacagact tctgggaagg aacacccata tctctttagt cagtgccagg 481 ccatccactg cagagcaatc cttccttgtc aggacactcc ttctgtgaaa ttaacctata 541 ctgcagaggt gtctgtccct aaagaactgg tggcacttat gagtgctatt cgtgatggag 601 aaacacctga cccagaagac ccaagcagga aaatatacaa attcatccaa aaagttccaa 661 taccctgcta cctgattgct ttagttgttg gagctttaga aagcaggcaa attggcccaa 721 gaactttggt gtggtctgag aaagagcagg tggaaaagtc tgcttatgag ttttctgaga 781 ctgaatctat gcttaaaata gcagaagatc tgggaggacc gtatgtatgg ggacagtatg 841 acctattggt cctgccacca tccttccctt atggtggcat ggagaatcct tgccttactt 901 ttgtaactcc tactctactg gcaggcgaca agtcactctc caatgtcatt gcacatgaaa 961 tatctcatag ctggacaggg aatctagtga ccaacaaaac ttgggatcac ttttggttaa 1021 atgagggaca tactgtgtac ttggaacgcc acatttgcgg acgattgttt ggtgaaaagt 1081 tcagacattt taatgctctg ggaggatggg gagaactaca gaattcggta aagacatttg 1141 gggagacaca tcctttcacc aaacttgtgg ttgatctgac agatatagac cctgatgtag 1201 cttattcttc agttccctat gagaagggct ttgctttact tttttacctt gaacaactgc 1261 ttggaggacc agagattttc ctaggattct taaaagctta tgttgagaag ttttcctata 1321 agagcataac tactgatgac tggaaggatt tcctgtattc ctattttaaa gataaggttg 1381 atgttctcaa tcaagttgat tggaatgcct ggctctactc tcctggactg cctcccataa 1441 agcccaatta tgatatgact ctgacaaatg cttgtattgc cttaagtcaa agatggatta 1501 ctgccaaaga agatgattta aattcattca atgccacaga cctgaaggat ctctcttctc 1561 atcaattgaa tgagttttta gcacagacgc tccagagggc acctcttcca ttggggcaca 1621 taaagcgaat gcaagaggtg tacaacttca atgccattaa caattctgaa atacgattca 1681 gatggctgcg gctctgcatt caatccaagt gggaggacgc aattcctttg gcgctaaaga 1741 tggcaactga acaaggaaga atgaagttta cccggccctt attcaaggat cttgctgcct 1801 ttgacaaatc ccatgatcaa gctgtccgaa cctaccaaga gcacaaagca agcatgcatc 1861 ccgtgactgc aatgctggtg gggaaagact taaaagtgga ttaaagacct gcgtattgat 1921 gattttagag atttctcttt tttaaatgga attcgtaaag aaatataaaa cttcagctca 1981 caattaaaac tgtcttttta gttttggctt tttattgttt tgttggtgat tttactgaaa 2041 taaagatgag ctacttcttc // LOCUS HUMLORAA 1218 bp mRNA PRI 08-MAY-1991 DEFINITION Human loricrin mRNA, complete cds. ACCESSION M61120 NID g187184 KEYWORDS loricrin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1218) AUTHORS Hohl,D., Mehrel,T., Lichti,U., Turner,M.L., Roop,D.R. and Steinert,P.M. TITLE Characterization of human loricrin: structure and functionof a new class of epidermal cell envelope protein JOURNAL J. Biol. Chem. 266, 6626-6636 (1991) MEDLINE 91177926 FEATURES Location/Qualifiers source 1..1218 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 34..984 /codon_start=1 /product="loricrin" /db_xref="PID:g187185" /translation="MSYQKKQPTPQPPVDCVKTSGGGGGGGGTGGGGCGFFGGGGSGG GSSGSGCGYSGGGGYSGGGCGGGSSGGGGGGGIGGCGGGSGGSVKYSGGGGSSGGGSG CFSSGGGGSGCFSSGGGGSSGGGSGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQS YGGVSSGGSSGGGSGCFSSGGGGGSVCGYSGGGSGGGSGCGGGSSGGSGSGYVSSQQV TQTSCAPQPSYGGGSSGGGGSGGSGCFSSGGGGGSSGCGGGSSGIGSGCIISGGGSVC GGGSSGGGGGGSSVGGSGSGKGVPICHQTQQKQAPTWPSK" BASE COUNT 138 a 382 c 467 g 231 t ORIGIN 1 ggtgctttgg gctctccttc cttctcagac aagatgtctt atcagaaaaa gcagcccacc 61 cctcagcccc cagtggactg cgtgaagacc tctggcggcg gtggcggtgg cggcggcacg 121 ggcggtggtg gctgcggctt cttcggcggc ggcggctcag ggggcggtag cagcggttct 181 ggctgtggct actccggcgg cggtggctac tctggcggcg gctgcggcgg gggctcctcc 241 ggcggcgggg gcgggggcgg cattggaggc tgcggagggg gctccggtgg gagcgtcaag 301 tactccggcg gcggcggctc ctccggcggg ggctctggct gtttctccag cggtgggggc 361 ggctccggct gcttctcctc cggtggcggc ggctcctccg ggggaggctc cggctgcttc 421 tccagcggtg ggggcggctc ctccgggggc ggctccggct gcttctcctc cggcggcggc 481 ggcttctcgg gccaggcggt ccagtgccag agctacggag gcgtctctag cggcggctcc 541 tccgggggcg gctccggctg cttctccagc ggcgggggcg gcggctctgt ctgcggctac 601 tctggcggcg gctctggcgg cggctctggc tgcggcggag gctcctctgg cggcagcggc 661 tccggctacg tctcctcgca gcaggtcact cagacctcgt gcgcgcccca gccgagttac 721 ggaggggggt cgtccggcgg cggcggcagc ggcggaagcg gctgcttctc cagcggcggg 781 ggcggcggga gctccggctg cggcggcggc tcctccggga ttggcagcgg ctgcatcatc 841 agtggcgggg gctccgtctg cggaggtggt tcctctggag gcggcggcgg cggctcctcc 901 gtgggtggct ccgggagtgg caagggcgtc ccgatctgcc accagaccca gcagaagcag 961 gcgcctacct ggccgtccaa atagatcccc cagggtacca cggaggcgaa ggagttggag 1021 gtgttttcca ggggcaccga tgggcttaga gctctcatga tgctacccga ggtttgcaaa 1081 tccttcatgt cttaacctac ctggaagaag ccattgagct ctccggctgc atctagttct 1141 gctgtttagc ctctttggtt tctgtacaac tacctcccaa ccccagtgcc tcagtcaata 1201 aatttgcaaa ttcatgag // LOCUS HUMLOX 1604 bp mRNA PRI 07-JAN-1995 DEFINITION Human lysyl oxidase (LOX) mRNA, complete cds. ACCESSION M94054 NID g187188 KEYWORDS lysyl oxidase. SOURCE Homo sapiens (tissue library: lambda gt10) adult skin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1604) AUTHORS Mariani,T.J., Trackman,P.C., Kagan,H.M., Eddy,R.L.Jr.., Shows,T.B., Boyd,C.D. and Deak,S.B. TITLE The complete derived amino acid sequence of human lysyl oxidase JOURNAL Matrix (1992) In press FEATURES Location/Qualifiers source 1..1604 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /tissue_type="skin" /tissue_lib="lambda gt10" /map="X" gene 78..1331 /gene="LOX" CDS 78..1331 /gene="LOX" /EC_number="1.4.3.13" /codon_start=1 /function="oxidation of peptidyl lysine" /db_xref="GDB:G00-119-367" /product="lysyl oxidase" /db_xref="PID:g187189" /translation="MRFAWTVLLLGPLQLCALVHCAPPAAGQQQPPREPPAAPGAWRQ QIQWENNGQVFSLLSLGSQYQPQRRRDPGAAVPGAANASAQQPRTPILLIRDNRTAAA RTRTAGSSGVTAGRPRPTARHWFQAGYSTSRARERGASRAENQTAPGEVPALSNLRPP SRVDGMVGDDPYNPYKYSDDNPYYNYYDTYERPRPGGRYRPGYGTGYFQYGLPDLVAD PYYIQASTYVQKMSMYNLRCAAEENCLASTAYRADVRDYDHRVLLRFPQRVKNQGTSD FLPSRPRYSWEWHSCHQHYHSMDEFSHYDLLDANTQRRVAEGHKASFCLEDTSCDYGY HRRFACTAHTQGLSPGCYDTYGADIDCQWIDITDVKPGNYILKVSVNPSYLVPESDYT NNVVRCDIRYTGHHAYASGCTISPY" BASE COUNT 397 a 456 c 403 g 347 t 1 others ORIGIN 1 gggcgtgatt tgagccccgt ttttattttc tgtgagccac gtcctcctcg agggggtcaa 61 tctggccaaa aggagtgatg cgcttcgcct ggaccgtgct cctgctcggg cctttgcagc 121 tctgcgcgct agtgcactgc gcccctcccg ccgccggcca acagcagccc ccgcgcgagc 181 cgccggcggc tccgggcgcc tggcgccagc agatccaatg ggagaacaac gggcaggtgt 241 tcagcttgct gagcctgggc tcacagtacc agcctcagcg ccgccgggac ccgggcgccg 301 ccgtccctgg tgcagccaac gcctccgccc agcagccccg cactccgatc ctgctgatcc 361 gcgacaaccg caccgccgcg gcgcgaacgc ggacggccgg ctcatctgga gtcaccgctg 421 gccgccccag gcccaccgcc cgtcactggt tccaagctgg ctactcgaca tctagagccc 481 gcgaacgtgg cgcctcgcgc gcggagaacc agacagcgcc gggagaagtt cctgcgctca 541 gtaacctgcg gccgcccagc cgcgtggacg gcatggtggg cgacgaccct tacaacccct 601 acaagtactc tgacgacaac ccttattaca actactacga tacttatgaa aggcccagac 661 ctgggggcag gtaccggccc ggatacggca ctggctactt ccagtacggt ctcccagacc 721 tggtggccga cccctactac atccaggcgt ccacgtacgt gcagaagatg tccatgtaca 781 acctgagatg cgcggcggag gaaaactgtc tggccagtac agcatacagg gcagatgtca 841 gagattatga tcacagggtg ctgctcagat ttccccaaag agtgaaaaac caagggacat 901 cagatttctt acccagccga ccaagatatt cctgggaatg gcacagttgt catcaacatt 961 accacagtat ggatgagttt agccactatg acctgcttga tgccaacacc cagaggagag 1021 tggctgaagg ccacaaagca agtttctgtc ttgaagacac atcctgtgac tatggctacc 1081 acaggcgatt tgcatgtact gcacacacac agggattgag tcctggctgt tatgatacct 1141 atggtgcaga catagactgc cagtggattg atattacaga tgtaaaacct ggaaactata 1201 tcctaaaggt cagtgtaaac cccagctacc tggttcctga atctgactat accaacaatg 1261 ttgtgcgctg tgacattcgc tacacaggac atcatgcgta tgcctcaggc tgcacaattt 1321 caccgtatta gaaggcaaag caaaactccc aatggataaa tcagtgcctg gtgttctgaa 1381 gtgggaaaaa atagactaac ttcagtagga tttatgtatt ttgaaaaaga gaacagaaaa 1441 caacaaaaga atttttgttt ggactgtttt caataacaaa gcacataact ggattttgaa 1501 cgcttaagtc aatcattact tggaaatttn taatgtttat tatttacatc aactttgtga 1561 attaacacag tgtttcaatt ctgtaatttc atatttgact cttt // LOCUS HUMLOX15A 2671 bp mRNA PRI 11-JUN-1993 DEFINITION Human 15-lipoxygenase mRNA, complete cds. ACCESSION M23892 NID g187190 KEYWORDS 15-lipoxygenase. SOURCE Human reticulocyte, cDNA to mRNA, clone 15LOX. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2671) AUTHORS Sigal,E., Craik,C.S., Highland,E., Grunberger,D., Costello,L.L., Dixon,R.A. and Nadel,J.A. TITLE Molecular cloning and primary structure of human 15-lipoxygenase JOURNAL Biochem. Biophys. Res. Commun. 157, 457-464 (1988) MEDLINE 89076270 FEATURES Location/Qualifiers source 1..2671 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2671 /note="15-lipoxygenase mRNA" CDS 4..1992 /note="15-lipoxygenase" /codon_start=1 /db_xref="PID:g307135" /translation="MGLYRIRVSTGASLYAGSNNQVQLWLVGQHGEAALGKRLWPARG KETELKVEVPEYLGPLLFVKLRKRHLLKDDAWFCNWISVQGPGAGDEVRFPCYRWVEG NGVLSLPEGTGRTVGEDPQGLFQKHREEELEERRKLYRWGNWKDGLILNMAGAKLYDL PVDERFLEDKRVDFEVSLAKGLADLAIKDSLNVLTCWKDLDDFNRIFWCGQSKLAERV RDSWKEDALFGYQFLNGANPVVLRRSAHLPARLVFPPGMEELQAQLEKELEGGTLFEA DFSLLDGIKANVILCSQQHLAAPLVMLKLQPDGKLLPMVIQLQLPRTGSPPPPLFLPT DPPMAWLLAKCWVRSSDFQLHELQSHLLRGHLMAEVIVVATMRCLPSIHPIFKLIIPH LRYTLEINVRARTGLVSDMGIFDQIMSTGGGGHVQLLKQAGAFLTYSSFCPPDDLADR GLLGVKSSFYAQDALRLWEIIYRYVEGIVSLHYKTDVAVKDDPELQTWCREITEIGLQ GAQDRGFPVSLQARDQVCHFVTMCIFTCTGQHASVHLGQLDWYSWVPNAPCTMRLPPP TTKDATLETVMATLPNFHQASLQMSITWQLGRRQPVMVAVGQHEEEYFSGPEPKAVLK KFREELAALDKEIEIRNAKLDMPYEYLRPSVVENSVAI" BASE COUNT 580 a 743 c 718 g 630 t ORIGIN 1 aagatgggtc tctaccgcat ccgcgtgtcc actggggcct cgctctatgc cggttccaac 61 aaccaggtgc agctgtggct ggtcggccag cacggggagg cggcgctcgg gaagcgactg 121 tggcccgcac ggggcaagga gacagaactc aaggtggaag taccggagta tctggggccg 181 ctgctgtttg tgaaactgcg caaacggcac ctccttaagg acgacgcctg gttctgcaac 241 tggatctctg tgcagggccc cggagccggg gacgaggtca ggttcccttg ttaccgctgg 301 gtggagggca acggcgtcct gagcctgcct gaaggcaccg gccgcactgt gggcgaggac 361 cctcagggcc tgttccagaa acaccgggaa gaagagctgg aagagagaag gaagttgtac 421 cggtggggaa actggaagga cgggttaatt ctgaatatgg ctggggccaa actatatgac 481 ctccctgtgg atgagcgatt tctggaagac aagagagttg actttgaggt ttcgctggcc 541 aaggggctgg ccgacctcgc tatcaaagac tctctaaatg ttctgacttg ctggaaggat 601 ctagatgact tcaaccggat tttctggtgt ggtcagagca agctggctga gcgcgtgcgg 661 gactcctgga aggaagatgc cttatttggg taccagtttc ttaatggcgc caaccccgtg 721 gtgctgaggc gctctgctca ccttcctgct cgcctagtgt tccctccagg catggaggaa 781 ctgcaggccc agctggagaa ggagctggag ggaggcacac tgttcgaagc tgacttctcc 841 ctgctggatg ggatcaaggc caacgtcatt ctctgtagcc agcagcacct ggctgcccct 901 ctagtcatgc tgaaattgca gcctgatggg aaactcttgc ccatggtcat ccagctccag 961 ctgccccgca caggatcccc accacctccc cttttcttgc ctacggatcc cccaatggcc 1021 tggcttctgg ccaaatgctg ggtgcgcagc tctgacttcc agctccatga gctgcagtct 1081 catcttctga ggggacactt gatggctgag gtcattgttg tggccaccat gaggtgcctg 1141 ccgtcgatac atcctatctt caagcttata attccccacc tgcgatacac cctggaaatt 1201 aacgtccggg ccaggactgg gctggtctct gacatgggaa ttttcgacca gataatgagc 1261 actggtgggg gaggccacgt gcagctgctc aagcaagctg gagccttcct aacctacagc 1321 tccttctgtc cccctgatga cttggccgac cgggggctcc tgggagtgaa gtcttccttc 1381 tatgcccaag atgcgctgcg gctctgggaa atcatctatc ggtatgtgga aggaatcgtg 1441 agtctccact ataagacaga cgtggctgtg aaagacgacc cagagctgca gacctggtgt 1501 cgagagatca ctgaaatcgg gctgcaaggg gcccaggacc gagggtttcc tgtctcttta 1561 caggctcggg accaggtttg ccactttgtc accatgtgta tcttcacctg caccggccaa 1621 cacgcctctg tgcacctggg ccagctggac tggtactctt gggtgcctaa tgcaccctgc 1681 acgatgcggc tgcccccgcc aaccaccaag gatgcaacgc tggagacagt gatggcgaca 1741 ctgcccaact tccaccaggc ttctctccag atgtccatca cttggcagct gggcagacgc 1801 cagcccgtta tggtggctgt gggccagcat gaggaggagt atttttcggg ccctgagcct 1861 aaggctgtgc tgaagaagtt cagggaggag ctggctgccc tggataagga aattgagatc 1921 cggaatgcaa agctggacat gccctacgag tacctgcggc ccagcgtggt ggaaaacagt 1981 gtggccatct aagcgtcgcc accctttggt tatttcagcc cccatcaccc aagccacaag 2041 ctgacccctt cgtggttata gccctgccct cccaagtccc accctcttcc catgtcccac 2101 cctccctaga ggggcacctt ttcatggtct ctgcacccag tgaacacatt ttactctaga 2161 ggcatcacct gggaccttac tcctctttcc ttccttcctc ctttcctatc ttccttcctc 2221 tctctcttcc tctttcttca ttcagatcta tatggcaaat agccacaatt atataaatca 2281 tttcaagact agaatagggg gatataatac atattactcc acacctttta tgaatcaaat 2341 atgatttttt tgttgttgtt aagacagagt ctcactttga cacccaggct ggagtgcagt 2401 ggtgccatca ccacggctca ctgcagcctc agcgtcctgg gctcaaatga tcctcccacc 2461 tcagcctcct gagtagctgg gactacaggc tcatgccatc atgcccagct aatatttttt 2521 tattttcgtg gagacggggc ctcactatgt tgcctaggct ggaaatagga ttttgaaccc 2581 aaattgagtt taacaataat aaaaagttgt tttacgctaa agatggaaaa gaactaggac 2641 tgaactattt taaataaaat attggcaaaa g // LOCUS HUMLOX5 2497 bp mRNA PRI 15-MAR-1989 DEFINITION Human lipoxygenase mRNA, complete cds. ACCESSION J03600 NID g187192 KEYWORDS lipoxygenase. SOURCE Human cell line HL60, cDNA to mRNA, clone lambda-5LO6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2497) AUTHORS Dixon,R.A., Jones,R.E., Diehl,R.E., Bennett,C.D., Kargman,S. and Rouzer,C.A. TITLE Cloning of the cDNA for human 5-lipoxygenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 416-420 (1988) MEDLINE 88124852 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by R.A.F.Dixon, 08-AUG-1989. FEATURES Location/Qualifiers source 1..2497 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 45..2069 /note="lipoxygenase" /codon_start=1 /db_xref="PID:g187193" /translation="MPSYTVTVATGSQWFAGTDDYIYLSLVGSAGCSEKHLLDKPFYN DFERGAVDSYDVTVDEELGEIQLVRIEKRKYWLNDDWYLKYITLKTPHGDYIEFPCYR WITGDVEVVLRDGRAKLARDDQIHILKQHRRKELETRQKQYRWMEWNPGFPLSIDAKC HKDLPRDIQFDSEKGVDFVLNYSKAMENLFINRFMHMFQSSWNDFADFEKIFVKISNT ISERVMNHWQEDLMFGYQFLNGCNPVLIRRCTELPEKLPVTTEMVECSLERQLSLEQE VQQGNIFIVDFELLDGIDANKTDPCTLQFLAAPICLLYKNLANKIVPIAIQLNQIPGD ENPIFLPSDAKYDWLLAKIWVRSSDFHVHQTITHLLRTHLVSEVFGIAMYRQLPAVHP IFKLLVAHVRFTIAINTKAREQLICECGLFDKANATGGGGHVQMVQRAMKDLTYASLC FPEAIKARGMESKEDIPYYFYRDDGLLVWEAIRTFTAEVVDIYYEGDQVVEEDPELQD FVNDVYVYGMRGRKSSGFPKSVKSREQLSEYLTVVIFTASAQHAAVNFGQYDWCSWIP NAPPTMRAPPPTAKGVVTIEQIVDTLPDRGRSCWHLGAVWALSQFQENELFLGMYPEE HFIEKPVKEAMARFRKNLEAIVSVIAERNKKKQLPYYYLSPDRIPNSVAI" BASE COUNT 580 a 721 c 667 g 529 t ORIGIN 1 gggcgccgag gctccccgcc gctcgctgct ccccggcccg cgccatgccc tcctacacgg 61 tcaccgtggc cactggcagc cagtggttcg ccggcactga cgactacatc tacctcagcc 121 tcgtgggctc ggcgggctgc agcgagaagc acctgctgga caagcccttc tacaacgact 181 tcgagcgtgg cgcggtggat tcatacgacg tgactgtgga cgaggaactg ggcgagatcc 241 agctggtcag aatcgagaag cgcaagtact ggctgaatga cgactggtac ctgaagtaca 301 tcacgctgaa gacgccccac ggggactaca tcgagttccc ctgctaccgc tggatcaccg 361 gcgatgtcga ggttgtcctg agggatggac gcgcaaagtt ggcccgagat gaccaaattc 421 acattctcaa gcaacaccga cgtaaagaac tggaaacacg gcaaaaacaa tatcgatgga 481 tggagtggaa ccctggcttc cccttgagca tcgatgccaa atgccacaag gatttacccc 541 gtgatatcca gtttgatagt gaaaaaggag tggactttgt tctgaattac tccaaagcga 601 tggagaacct gttcatcaac cgcttcatgc acatgttcca gtcttcttgg aatgacttcg 661 ccgactttga gaaaatcttt gtcaagatca gcaacactat ttctgagcgg gtcatgaatc 721 actggcagga agacctgatg tttggctacc agttcctgaa tggctgcaac cctgtgttga 781 tccggcgctg cacagagctg cccgagaagc tcccggtgac cacggagatg gtagagtgca 841 gcctggagcg gcagctcagc ttggagcagg aggtccagca agggaacatt ttcatcgtgg 901 actttgagct gctggatggc atcgatgcca acaaaacaga cccctgcaca ctccagttcc 961 tggccgctcc catctgcttg ctgtataaga acctggccaa caagattgtc cccattgcca 1021 tccagctcaa ccaaatcccg ggagatgaga accctatttt cctcccttcg gatgcaaaat 1081 acgactggct tttggccaaa atctgggtgc gttccagtga cttccacgtc caccagacca 1141 tcacccacct tctgcgaaca catctggtgt ctgaggtttt tggcattgca atgtaccgcc 1201 agctgcctgc tgtgcacccc attttcaagc tgctggtggc acacgtgaga ttcaccattg 1261 caatcaacac caaggcccgt gagcagctca tctgcgagtg tggcctcttt gacaaggcca 1321 acgccacagg gggcggtggg cacgtgcaga tggtgcagag ggccatgaag gacctgacct 1381 atgcctccct gtgctttccc gaggccatca aggcccgggg catggagagc aaagaagaca 1441 tcccctacta cttctaccgg gacgacgggc tcctggtgtg ggaagccatc aggacgttca 1501 cggccgaggt ggtagacatc tactacgagg gcgaccaggt ggtggaggag gacccggagc 1561 tgcaggactt cgtgaacgat gtctacgtgt acggcatgcg gggccgcaag tcctcaggct 1621 tccccaagtc ggtcaagagc cgggagcagc tgtcggagta cctgaccgtg gtgatcttca 1681 ccgcctccgc ccagcacgcc gcggtcaact tcggccagta cgactggtgc tcctggatcc 1741 ccaatgcgcc cccaaccatg cgagccccgc caccgactgc caagggcgtg gtgaccattg 1801 agcagatcgt ggacacgctg cccgaccgcg gccgctcctg ctggcatctg ggtgcagtgt 1861 gggcgctgag ccagttccag gaaaacgagc tgttcctggg catgtaccca gaagagcatt 1921 ttatcgagaa gcctgtgaag gaagccatgg cccgattccg caagaacctc gaggccattg 1981 tcagcgtgat tgctgagcgc aacaagaaga agcagctgcc atattactac ttgtccccag 2041 accggattcc gaacagtgtg gccatctgag cacactgcca gtctcactgt gggaaggcca 2101 gctgccccag ccagatggac tccagcctgc ctggcaggct gtctggccag gcctcttggc 2161 agtcacatct cttcctccga ggccagtacc tttccattta ttctttgatc ttcagggaac 2221 tgcatagatt gtatcaaagt gtaaacacca tagggaccca ttctacacag agcaggactg 2281 cacaggcgtc ctgtccacac ccagctcagc atttccacac caagcagcaa cagcaaatca 2341 cgaccactga tagatgtcta ttcttgttgg agacatggga tgattatttt ctgttctatt 2401 tgtgcttagt ccaattcctt gcacatagta ggtacccaat tcaattacta ttgaatgaat 2461 taagaattgg ttgccataaa aataaatcag ttcattt // LOCUS HUMLRP1P 1481 bp mRNA PRI 07-JAN-1995 DEFINITION Human pancreatic lipase related protein 1 (PLRP1) mRNA, complete cds. ACCESSION M93283 NID g187229 KEYWORDS lipase related protein 1. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1481) AUTHORS Giller,T., Buchwald,P., Blum-Kaelin,D. and Hunziker,W. TITLE Two novel human pancreatic lipase related proteins, hPLRP1 and hPLRP2. Differences in colipase dependence and in lipase activity JOURNAL J. Biol. Chem. 267 (23), 16509-16516 (1992) MEDLINE 92355622 FEATURES Location/Qualifiers source 1..1481 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 20..1464 /gene="PLRP1" sig_peptide 20..70 /gene="PLRP1" CDS 20..1423 /gene="PLRP1" /codon_start=1 /product="lipase related protein 1" /db_xref="PID:g187230" /translation="MLIFWTITLFLLGAAKGKEVCYEDLGCFSDTEPWGGTAIRPLKI LPWSPEKIGTRFLLYTNENPNNFQILLLSDPSTIEASNFQMDRKTRFIIHGFIDKGDE SWVTDMCKKLFEVEEVNCICVDWKKGSQATYTQAANNVRVVGAQVAQMLDILLTEYSY PPSKVHLIGHSLGAHVAGEAGSKTPGLSRITGLDPVEASFESTPEEVRLDPSDADFVD VIHTDAAPLIPFLGFGTNQQMGHLDFFPNGGESMPGCKKNALSQIVDLDGIWAGTRDF VACNHLRSYKYYLESILNPDGFAAYPCTSYKSFESDKCFPCPDQGCPQMGHYADKFAG RTSEEQQKFFLNTGEASNFARWRYGVSITLSGRTATGQIKVALFGNKGNTHQYSIFRG ILKPGSTHSYEFDAKLDVGTIEKVKFLWNNNVINPTLPKVGATKITVQKGEEKTVYNF CSEDTVREDTLLTLTPC" mat_peptide 71..1420 /gene="PLRP1" /product="lipase related protein 1" polyA_signal 1458..1464 /gene="PLRP1" polyA_site 1481 /gene="PLRP1" BASE COUNT 399 a 371 c 379 g 332 t ORIGIN 1 ctctggaaca ttagacagga tgctgatctt ctggacaatc acacttttcc tgctgggagc 61 agccaaagga aaagaagttt gctatgagga cctcgggtgc ttttctgaca ctgagccctg 121 gggcgggaca gcaatcaggc ccctgaaaat tctcccctgg agccctgaga agatcggcac 181 ccgcttcctg ctgtacacca atgaaaaccc aaacaacttt caaattctcc tcctctctga 241 tccatcaaca attgaggcat caaattttca aatggacaga aagacccggt tcatcatcca 301 tggcttcata gacaaaggag atgagagctg ggtgacagac atgtgcaaga aactgttcga 361 ggtggaggag gtgaactgca tctgcgtgga ctggaagaag ggctcccaag ccacctacac 421 acaggctgcc aacaacgtgc gagtggtggg cgcccaggtg gcccagatgc tcgacatcct 481 cttgacagag tatagctacc ccccttccaa agttcacctc attggccaca gcctgggagc 541 ccacgtggct ggagaggcag gaagcaagac tccaggcctg agcaggatta cagggttgga 601 tcctgtagaa gcaagtttcg agagtactcc tgaagaggtg cgacttgatc cctctgatgc 661 tgactttgtt gatgtgattc acacggatgc agctcccctg atcccattct tgggttttgg 721 aacgaaccaa cagatgggtc atcttgactt cttccccaat ggaggagaga gcatgccggg 781 atgcaagaag aatgccctgt ctcagatcgt ggatctagat ggcatctggg cgggaacccg 841 ggactttgtg gcttgcaatc acctaagaag ctacaagtat tacttggaaa gcatcctcaa 901 tcccgatggg tttgctgcat atccctgcac ttcctacaag tcctttgagt ctgacaagtg 961 cttcccgtgt ccagatcaag gatgcccaca gatgggtcac tatgctgata aatttgctgg 1021 caggacaagt gaagagcagc agaaattctt cttgaacaca ggagaggcta gcaatttcgc 1081 tcgctggaga tatggggttt ccatcacact gtctggaaga acagccactg gtcagatcaa 1141 agttgctttg tttggaaata agggaaacac tcaccagtac agcatcttca gggggattct 1201 caaaccaggc tcaacccatt cctatgagtt tgatgcaaag ctggatgttg gaacaattga 1261 gaaagtcaag tttctttgga ataacaatgt gataaatcca accctcccca aagtgggtgc 1321 caccaagatc actgtgcaaa agggagaaga gaagacagtg tacaacttct gtagcgaaga 1381 cacagtgcgg gaagacacgc tgctcaccct cacgccctgc taagctcccg gggcgacgag 1441 gctgctgcgt tcacactaat aaaatccact ggtgcatctg t // LOCUS HUMLRP2P 1450 bp mRNA PRI 07-JAN-1995 DEFINITION Human pancreatic lipase related protein 2 (PLRP2) mRNA, complete cds. ACCESSION M93284 NID g187231 KEYWORDS lipase related protein 2. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1450) AUTHORS Giller,T., Buchwald,P., Blum-Kaelin,D. and Hunziker,W. TITLE Two novel human pancreatic lipase related proteins, hPLRP1 and hPLRP2. Differences in colipase dependence and in lipase activity JOURNAL J. Biol. Chem. 267 (23), 16509-16516 (1992) MEDLINE 92355622 FEATURES Location/Qualifiers source 1..1450 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 1..1439 /gene="PLRP2" sig_peptide 1..51 /gene="PLRP2" CDS 1..1410 /gene="PLRP2" /codon_start=1 /product="lipase related protein 2" /db_xref="PID:g187232" /translation="MLPPWTLGLLLLATVRGKEVCYGQLGCFSDEKPWAGTLQRPVKL LPWSPEDIDTRFLLYTNENPNNFQLITGTEPDTIEASNFQLDRKTRFIIHGFLDKAED SWPSDMCKKMFEVEKVNCICVDWRHGSRAMYTQAVQNIRVVGAETAFLIQALSTQLGY SLEDVHVIGHSLGAHTAAEAGRRLGGRVGRITGLDPAGPCFQDEPEEVRLDPSDAVFV DVIHTDSSPIVPSLGFGMSQKVGHLDFFPNGGKEMPGCKKNVLSTITDIDGIWEGIGG FVSCNHLRSFEYYSSSVLNPDGFLGYPCASYDEFQESKCFPCPAEGCPKMGHYADQFK GKTSAVEQTFFLNTGESGNFTSWRYKVSVTLSGKEKVNGYIRIALYGSNENSKQYEIF KGSLKPDASHTCAIDVDFNVGKIQKVKFLWNKRGINLSEPKLGASQITVQSGEDGTEY NFCSSDTVEENVLQSLYPC" mat_peptide 52..1407 /gene="PLRP2" /product="lipase related protein 2" polyA_signal 1434..1439 /gene="PLRP2" polyA_site 1450 /gene="PLRP2" BASE COUNT 396 a 326 c 379 g 349 t ORIGIN 1 atgctgcccc cttggaccct cggccttctc ctgctggcca cagtcagagg aaaagaggtc 61 tgctacggac aacttggctg cttttctgat gaaaaaccat gggcaggaac ccttcagcga 121 cctgtaaaat tacttccctg gtcccccgag gacattgaca cccgctttct tctgtacaca 181 aatgaaaatc caaacaactt ccaactaatc actggcacgg aaccagacac cattgaggct 241 tcaaacttcc aactggaccg caagacacgc ttcatcatcc atggcttctt agacaaggcg 301 gaggacagct ggccatcgga catgtgcaag aaaatgtttg aagtggagaa ggtgaactgc 361 atctgtgtgg actggaggca cgggtcccgg gcaatgtaca cccaagccgt gcaaaacatt 421 cgggttgttg gggcggagac agctttctta atacaagcac tgtcgacgca gctagggtac 481 agccttgagg acgtgcatgt catcggccac agcctgggcg cgcacacggc cgcggaggcg 541 ggcaggaggc tggggggccg cgtgggcagg atcacagggc tggatccagc agggccgtgc 601 ttccaggatg aacctgagga ggttcggttg gatccatctg acgccgtgtt tgtggatgtg 661 attcacacag attcttctcc catagttcct tccctaggtt tcggaatgag ccaaaaggtg 721 ggccatctgg atttctttcc aaatggagga aaggaaatgc ccggatgtaa gaaaaatgtc 781 ctttcaacca ttactgatat tgatggaata tgggaaggaa ttggtggctt tgtgtcttgc 841 aatcacctaa gaagcttcga gtattactca agcagcgtcc tcaaccctga tggcttcctg 901 ggctatccct gtgcctccta cgatgagttt caggagagta agtgtttccc ttgtccagct 961 gaaggatgcc ccaaaatggg gcactatgct gaccaattta aggggaaaac aagtgctgtg 1021 gaacaaacct ttttcctgaa cacaggagag agtggtaact ttactagttg gagatataag 1081 gtatcagtca cactttctgg aaaagagaaa gtgaatgggt acatcaggat tgctttgtat 1141 ggaagtaatg aaaactcgaa acaatatgag attttcaaag gatccctcaa accagatgca 1201 agtcacacgt gtgctattga tgtggatttt aatgttggaa aaatacagaa agttaaattc 1261 ctctggaaca aacgtgggat aaatctatct gagcccaaac tgggggcttc ccaaatcaca 1321 gtgcaaagtg gtgaagatgg gactgagtat aatttttgta gcagcgacac tgtggaagaa 1381 aacgtcttgc aatctcttta cccttgttaa aaacgtggtg cggctattgc ggtaataaaa 1441 tctttaatgc // LOCUS HUMLSP1Q1 2426 bp mRNA PRI 02-APR-1991 DEFINITION Human leukocyte surface protein (CD31) mRNA, complete cds. ACCESSION M37780 NID g187239 KEYWORDS leukocyte surface protein. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2426) AUTHORS Stockinger,H., Gadd,S.J., Eher,R., Majdic,O., Schreiber,W., Kasinrerk,W., Strass,B., Schnabl,E. and Knapp,W. TITLE Molecular characterization and functional analysis of the leukocyte surface protein CD31 JOURNAL J. Immunol. 145, 3889-3897 (1990) MEDLINE 91060975 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by B.Seed, 17-AUG-1990. Mol Biol Mass. General Hospital Boston, MA 02114. FEATURES Location/Qualifiers source 1..2426 /organism="Homo sapiens" /db_xref="taxon:9606" gene 126..2342 /gene="CD31" CDS 126..2342 /gene="CD31" /codon_start=1 /product="leukocyte surface protein" /db_xref="PID:g187240" /translation="MQPRWAQGATMWLGVLLTLLLCSSLEGQENSFTINSVDMKSLPD WTVQNGKNLTLQCFADVSTTSHVKPQHQMLFYKDDVLFYNISSMKSTESYFIPEVRIY DSGTYKCTVIVNNKEKTTAEYQVLVEGVPSPRVTLDKKEAIQGGIVRVNCSVPEEKAP IHFTIEKLELNEKMVKLKREKNSRDQNFVILEFPVEEQDRVLSFRCQARIISGIHMQT SESTKSELVTVTESFSTPKFHISPTGMIMEGAQLHIKCTIQVTHLAQEFPEIIIQKDK AIVAHNRHGNKAVYSVMAMVEHSGNYTCKVESSRISKVSSIVVNITELFSKPELESSF THLDQGERLNLSCSIPGAPPANFTIQKEDTIVSQTQDFTKIASKSDSGTYICTAGIDK VVKKSNTVQIVVCEMLSQPRISYDAQFEVIKGQTIEVRCESISGTLPISYQLLKTSKV LENSTKNSNDPAVFKDNPTEDVEYQCVADNCHSHAKMLSEVLRVKVIAPVDEVQISIL SSKVVESGEDIVLQCAVNEGSGPITYKFYREKEGKPFYQMTSNATQAFWTKQKANKEQ EGEYYCTAFNRANHASSVPRSKILTVRVILAPWKKGLIAVVIIGVIIALLIIAAKCYF LRKAKAKQMPVEMSRPAVPLLNSNNEKMSDPNMEANSHYGHNDDVGNHAMKPINDNKE PLNSDVQYTEVQVSSAESHKDLGKKDTETVYSEVRKAVPDAVESRYSRTEGSLDGT" BASE COUNT 729 a 581 c 584 g 532 t ORIGIN 1 gaccagagca atttctgctt ttcacagggc gggtttctca acggtgactt gtgggcagtg 61 ccttctgctg agcgagtcat ggcccgaagg cagaactaac tgtgcctgca gtcttcactc 121 tcaggatgca gccgaggtgg gcccaagggg ccacgatgtg gcttggagtc ctgctgaccc 181 ttctgctctg ttcaagcctt gagggtcaag aaaactcttt cacaatcaac agtgttgaca 241 tgaagagcct gccggactgg acggtgcaaa atgggaagaa cctgaccctg cagtgcttcg 301 cggatgtcag caccacctct cacgtcaagc ctcagcacca gatgctgttc tataaggatg 361 acgtgctgtt ttacaacatc tcctccatga agagcacaga gagttatttt attcctgaag 421 tccggatcta tgactcaggg acatataaat gtactgtgat tgtgaacaac aaagagaaaa 481 ccactgcaga gtaccaggtg ttggtggaag gagtgcccag tcccagggtg acactggaca 541 agaaagaggc catccaaggt gggatcgtga gggtcaactg ttctgtccca gaggaaaagg 601 ccccaataca cttcacaatt gaaaaacttg aactaaatga aaaaatggtc aagctgaaaa 661 gagagaagaa ttctcgagac cagaattttg tgatactgga attccccgtt gaggaacagg 721 accgcgtttt atccttccga tgtcaagcta ggatcatttc tgggatccat atgcagacct 781 cagaatctac caagagtgaa ctggtcaccg tgacggaatc cttctctaca cccaagttcc 841 acatcagccc caccggaatg atcatggaag gagctcagct ccacattaag tgcaccattc 901 aagtgactca cctggcccag gagtttccag aaatcataat tcagaaggac aaggcgattg 961 tggcccacaa cagacatggc aacaaggctg tgtactcagt catggccatg gtggagcaca 1021 gtggcaacta cacgtgcaaa gtggagtcca gccgcatatc caaggtcagc agcatcgtgg 1081 tcaacataac agaactattt tccaagcccg aactggaatc ttccttcaca catctggacc 1141 aaggtgaaag actgaacctg tcctgctcca tcccaggagc acctccagcc aacttcacca 1201 tccagaagga agatacgatt gtgtcacaga ctcaagattt caccaagata gcctcaaagt 1261 cggacagtgg gacgtatatc tgcactgcag gtattgacaa agtggtcaag aaaagcaaca 1321 cagtccagat agtcgtatgt gaaatgctct cccagcccag gatttcttat gatgcccagt 1381 ttgaggtcat aaaaggacag accatcgaag tccgttgcga atcgatcagt ggaactttgc 1441 ctatttctta ccaactttta aaaacaagta aagttttgga gaatagtacc aagaactcaa 1501 atgatcctgc ggtattcaaa gacaacccca ctgaagacgt cgaataccag tgtgttgcag 1561 ataattgcca ttcccacgcc aaaatgttaa gtgaggttct gagggtgaag gtgatagccc 1621 cggtggatga ggtccagatt tctatcctgt caagtaaggt ggtggagtct ggagaggaca 1681 ttgtgctgca atgtgctgtg aatgaaggat ctggtcccat cacctataag ttttacagag 1741 aaaaagaggg caaacccttc tatcaaatga cctcaaatgc cacccaggca ttttggacca 1801 agcagaaggc taacaaggaa caggagggag agtattactg cacagccttc aacagagcca 1861 accacgcctc cagtgtcccc agaagcaaaa tactgacagt cagagtcatt cttgccccat 1921 ggaagaaagg acttattgca gtggttatca tcggagtgat cattgctctc ttgatcattg 1981 cggccaaatg ttattttctg aggaaagcca aggccaagca gatgccagtg gaaatgtcca 2041 ggccagcagt accacttctg aactccaaca acgagaaaat gtcagatccc aatatggaag 2101 ctaacagtca ttacggtcac aatgacgatg tcggaaacca tgcaatgaaa ccaataaatg 2161 ataataaaga gcctctgaac tcagacgtgc agtacacgga agttcaagtg tcctcagctg 2221 agtctcacaa agatctagga aagaaggaca cagagacagt gtacagtgaa gtccggaaag 2281 ctgtccctga tgccgtggaa agcagatact ctagaacgga aggctccctt gatggaactt 2341 agacagcaag gccagatgca catccctgga aggacatcca tgttccgaga agaacagatg 2401 atccctgtat ttcaagacct ctgtcc // LOCUS HUMLSPRO 2622 bp mRNA PRI 25-JAN-1993 DEFINITION Human lymphocyte surface protein exons 1-5, complete cds. ACCESSION M99578 NID g187241 KEYWORDS lymphocyte surface protein. SOURCE Homo sapiens (library: lambda gt11) neonate placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2622) AUTHORS Voland,J.R., Wyzykowski,R.J., Mark,H. and Dutton,R.W. TITLE Cloning and sequencing of a trophoblast- endothelial- activated lymphocyte surface protein: cDNA sequence and genomic structure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 10425-10429 (1992) MEDLINE 93066251 FEATURES Location/Qualifiers source 1..2622 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="trophoblast" /dev_stage="neonate" /tissue_type="placenta" /tissue_lib="lambda gt11" mRNA join(1..153,153..933,934..1082,1083..1323,1324..2622) exon 1..153 /note="codes the 5' UTR; intron between exon 1 and exon 2 is 2.7Kb" /number=1 5'UTR 1..171 /note="84% GC in 5'UTR" exon 153..933 /note="codes the extracellular domain; intron between exon 2 and exon 3 is 0.7Kb" /number=2 CDS 170..1822 /note="550 amino acids MW=61kDa, glycosylated=75 kDa; expressed on endothelium, activated lymphocytes and syncytiotrophoblast, contains leucine zipper and basic region homologous to myc; 721P" /codon_start=1 /db_xref="PID:g187242" /translation="MAAATIVHDTSEAVELCPAYGLYLKPITKMTISVALPQLKQPGK SISNWEVMERLKGMVQNHQFSTLRISKSTMDFIRFEGEVENKSLVKSFLACLDGKTIK LSGFSDILKVRAAEFKIDFPTRHDWDSFFRDAKDMNETLPGERPDTIHLEGLPCKWFA LKESGSEKPSEDVLVKVFEKFGEIRNVDIPMLDPYREEMTGRNFHTFSFGGHLNFEAY VQYREYMGFIQAMSALRGMKLMYKGEDGKAVACNIKVSFDSTKHLSDASIKKRQLERQ KLQELEQQREEQKRREKEAEERQRAEERKQKELEELERERKREEKLRKREQKQRDREL RRNQKKLEKLQAEEQKQLQEKIKLEERKLLLAQRNLQSIRLIAELLSRAKAVKLREQE QKEEKLRLQQQEERRRLQEAELRRVEEEKERALGLQRKERELRERLLSILQSKKPDDS HTHDELGVAHGPAAARPGHPADRVVRLCERHHAAPPRGPAPGRCPQGEPGPPRGRRRS QKRERERGRGGPMQGGSELLSCGPRGWLSREEVPGRRPLLHS" misc_feature 172..232 /note="codes for protein leader sequence" exon 934..1082 /note="codes the putative basic region; intron between exon 3 and exon 4 is 3.5Kb" /number=3 exon 1083..1323 /note="codes the leucine zipper region; the intron between exon 4 and exon 5 is 1.3Kb" /number=4 misc_feature 1199..1263 /note="codes for leucine zipper, homologous to myc" exon 1324..2622 /note="codes the serine phosphorylation site and the 3'UTR" /number=5 BASE COUNT 589 a 792 c 898 g 343 t ORIGIN 1 ggcgacggcg gtggcggcgt cggaggcgcc tccgggggac ggtggcggct cccggcggtg 61 aggccgcgcc tgtccgggga tcgtcgaggg acggcgggag cttgggccag cggcggcggc 121 ggcctgggac gcaggcggag ccccgcgcag gcccaaggtc ccggaggcta tggcagcggc 181 taccatcgtg cacgacacgt ctgaggccgt ggagctctgc cctgcttacg gcttgtacct 241 gaagcccatc accaagatga ccatcagcgt ggcactcccg cagctgaagc agccggggaa 301 gtccatctcc aactgggagg tgatggagag gctgaagggc atggtgcaga accaccagtt 361 ctccacgctg cgtatttcca agagcaccat ggacttcatc cgcttcgagg gggaggtgga 421 gaacaagagc ctggtcaagt cttttctggc ctgcctggac ggcaagacca tcaagctcag 481 cggcttctcc gacatcctga aggtgcgcgc ggccgagttc aagatcgact tccccacccg 541 ccacgactgg gactccttct tccgcgacgc caaggacatg aacgagaccc tgccggggga 601 gcggccggac accatccacc tggaggggct gccctgcaag tggttcgccc tgaaggagtc 661 gggctccgag aagcccagcg aggacgtcct ggtcaaggtg tttgagaagt tcggggagat 721 ccggaatgtg gacatcccca tgctggaccc ctaccgggag gagatgacgg gccgcaactt 781 ccacaccttc agtttcgggg ggcacttgaa cttcgaggcc tatgtgcagt accgtgagta 841 catgggcttc atccaggcca tgagcgccct gcgcgggatg aaactcatgt acaagggcga 901 ggacggcaag gccgtggcct gcaacatcaa ggtttctttt gattcgacca aacacctgag 961 tgatgcctca attaagaagc ggcagctgga gaggcagaag cttcaggaac tggagcagca 1021 aagagaagaa caaaagcgca gagagaagga agcggaggag aggcagcgag cggaggaaag 1081 gaaacaaaag gagctggaag agctggagcg agagaggaaa agagaagaga agcttcgcaa 1141 gagggagcag aagcagaggg accgtgagct gcgccggaat cagaagaagc tggagaagct 1201 gcaggcggag gagcagaagc agctgcagga gaagatcaag ctggaggagc gcaagctgct 1261 gctggcccag aggaacctgc agtccatccg gctcatcgcc gagctgctca gcagagccaa 1321 ggctgtgaag ctacgggaac aggagcagaa ggaggagaag ctgaggctcc agcagcagga 1381 ggagcggcgg cggctgcagg aggccgagct gcggcgcgtg gaggaggaga aggagcgcgc 1441 gctgggcctg cagcggaaag agcgggagct gcgcgagcgg ctgctgagca tcctgcagag 1501 caagaagccg gacgacagcc acacacacga cgagctgggc gtggcacacg gacctgctgc 1561 agcccgtcct ggacatcctg cagaccgtgt cgtccggctg tgtgagcgcc accacgctgc 1621 accccctcgg gggccagccc ccggccggtg cccccaagga gagcccggcc cccccagagg 1681 ccgacggcgc tcccaaaagc gtgaacggga gcgtggccga ggaggcccca tgcaaggagg 1741 ttcagagctc ctgtcgtgtg gtccccgagg atggctctcc agagaagagg tgcccgggcg 1801 gcgtcctctc ctgcattcct gacaacaacc aacagcccaa gggcatccct gcctgcgagc 1861 agaatgtctc cagaaaggac acccggtcag aacaggacaa gtgcaaccgg gagcccagca 1921 agggccgggg ccgggccacc ggagacgggc ttgctgaccg gcacaagcgg gagaggagcc 1981 gggccaggcg ggccagcagc agggaggacg ggaggccacg caaggagcgg cggccccaca 2041 agaagcacgc ctacaaggat gacagccccc gccggcgcag cacgagcccg gaccacaccc 2101 ggtcccggag gtcccacagc aaagacaggc accggaggga gcggagccgg gagcggaggg 2161 gcagcgccag caggaagcac agccgccacc gccgccgaag cgagcggtcg cgctcccggt 2221 ccccgagcag gcaccgcagt acctggaaca ggtaatgacg ggcacggcct ccccacggcc 2281 tgtccgggaa agaccaggac ctgctcgagc ctcctggccg ctccttggcc gctctccgtc 2341 cacccctgca aagccaagac ccttctgcag ccacgaatgt ccacggagcc cgccggcagg 2401 aaggaagaca ccatgcttta gagatccatc tttctccact caccgcagcg tacttggcac 2461 ttcagtttca aacacgtagt cctttaaaac ttgatccgat agctttaatg cggccggtcc 2521 tctctcagtc aggaaaattg cacagaccga cagtcgtgag gatggcagag ctgctgcatt 2581 cccccacacg gggatttctg tgtctgcttg gcgacctcct ac // LOCUS HUMLUCA14 36545 bp DNA PRI 08-OCT-1996 DEFINITION Human cosmid LUCA14. ACCESSION U73167 NID g1613891 KEYWORDS interferon; tumor suppressor; semaphorin V. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 36545) AUTHORS Dante,M and Wamsley,P. TITLE The sequence of H. sapiens cosmid LUCA14 JOURNAL Unpublished (1996) REFERENCE 2 (bases 1 to 36545) AUTHORS Waterston,R. TITLE Direct Submission JOURNAL Submitted (02-OCT-1996) COMMENT Submitted by: Genome Sequencing Center Department of Genetics, Washington University, St. Louis, MO 63110, USA, and e-mail: sapiens@watson.wustl.edu NOTICE: This sequence may not be the entire insert of this clone. It may be shorter because we only sequence overlapping sections once, or longer because we provide a small overlap between neighboring submissions. This sequence was finished as follows unless otherwise noted: all regions were double stranded or sequenced with an alternate chemistry; an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one subclone; and the assembly was confirmed by restriction digest SOURCE INFORMATION: This clone is from a chromosome 3 specific library, VECTOR: pWE15 Clone reference:Ming-Hui Wei et al, CANCER RESEARCH, 56,1487-1492,1996. NEIGHBORING SEQUENCE INFORMATION: The left clone is LUCA13;right clone is LUCA15, 200 bp overlap. Actual start of this cosmid is at base position 1 of HUMLUCA14; actual end is at 36545 of HUMLUCA14. FEATURES Location/Qualifiers source 1..36545 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /clone="LUCA14" /clone_lib="LLNL3" /map="3p21.3" repeat_region complement(18..79) /rpt_family="ALU" repeat_region complement(125..415) /rpt_family="ALU" gene 673..1267 /gene="LUCA-1" CDS join(<673..762,950..1267) /gene="LUCA-1" /note="coded for by human cDNAs T81817, W84634, R98080, T82162, W84686 and U03056; H_LUCA14.1" /codon_start=1 /product="human tumor suppressor LUCA-1" /db_xref="PID:g1613892" /translation="DELEHSLGESAAQGAAGVVLWVSWENTRTKESCQAIKEYMDTTL GPFILNVTSGALLCSQALCSGHGRCVRRTSHPKALLLLNPASFSIQLTPGGGPLSLRG ALSLEDQAQMAVEFKCRCYPGWQAPWCERKSMW" gene 4286..5145 /gene="H_LUCA14.2b" gene 4286..4726 /gene="H_LUCA14.2a" CDS 4286..4726 /gene="H_LUCA14.2a" /note="coded for by human cDNAs R20196, R50893, H19883, R44982 and R50776; weakly similar to RNA1 polyprotein, tomato ringspot virus (SP:P29150)" /codon_start=1 /db_xref="PID:g1613893" /translation="MELILSTSPAELTLDPACQPKLPLDSTCQPEMTFNPGPTELTLD PEHQPEETPAPSLAELTLEPVHRRPELLDACADLINDQWPRSRTSRLHSLGQSSDAFP LCLMLLSPHPTLEAAPVVVGHARLSRVLNQPQSLLVETVVVARA" CDS join(4286..4711,4759..5145) /gene="H_LUCA14.2b" /note="similar to C. elegans predicted protein C56G2.15 (G726412); alternatively spliced form of H_LUCA14.2a" /codon_start=1 /evidence=not_experimental /db_xref="PID:g1613894" /translation="MELILSTSPAELTLDPACQPKLPLDSTCQPEMTFNPGPTELTLD PEHQPEETPAPSLAELTLEPVHRRPELLDACADLINDQWPRSRTSRLHSLGQSSDAFP LCLMLLSPHPTLEAAPVVVGHARLSRVLNQPQSLLVETVVGLEVFARARGFRKLHLTT HDQVHFYTHLGYQLGEPVQGLVFTSRRLPATLLNAFPTAPSPRPPRKAPNLTAQAAPR GPKGPPLPPPPPLPECLTISPPVPSGPPSKSLLETQYQNVRGRPIFWMEKDI" gene 6030..8640 /gene="H_LUCA14.3" CDS join(6030..6078,6129..6172,6221..7039,8027..8167, 8233..8495,8553..8640) /gene="H_LUCA14.3" /note="similar to human tumor suppressor LUCA-1 (G532974), hyaluronidase precursor and other glyclosyl hydrolases" /codon_start=1 /evidence=not_experimental /db_xref="PID:g1613895" /translation="MGRSWGSFPGFPCHVEGFHPWGMTTQLGPALVPERPFSVLWNVP SAHCEARFGVHLPLNALGIIANRGQHFHGQNMTIFYKNQLGLYPYFGPRGTAHNGGIP QALPLDRHLALAAYQIHHSLRPGFAGPAVLDWEEWCPLWAGNWGRRRAYQAASWAWAQ QVFPDLDPQEQLYKAYTGFEQAARALMEDTLRVAQALRPHGLWGFYHYPACGNGWHSM ASNYTGRCHAATLARNTQLHWLWAASSALFPSIYLPPRLPPAHHQAFVRHRLEEAFRV ALVGHRHPLPVLAYVRLTHRRSGRFLSQDDLVQSIGVSAALGAAGVVLWGDLSLSSSE VIIAPLSLPCSTCWGPREECWHLHDYLVDTLGPYVINVTRAAMACSHQRCHGHGRCAR RDPGQMEAFLHLWPDGSLGDWKSFSCHCYWGWAGPTCQEPSLGLKKHPGTTLSHSCSI QFTVNPPKHTPRFPWNP" repeat_region 7317..7445 /rpt_family="ALU" repeat_region complement(7446..7729) /rpt_family="ALU" repeat_region 7730..7893 /rpt_family="ALU" gene 8829..13556 /gene="SM15" CDS join(8829..8854,8900..9159,9207..9265,9320..9530, 11068..11187,11275..11359,11453..11577,11661..11818, 11985..12035,12119..12300,12387..12492,12810..12947, 13036..13194,13243..13338,13476..13556) /gene="SM15" /note="coded for by human cDNAs U09585, R46154, T23833, R48311 and T90376; H_LUCA14.4" /codon_start=1 /product="human interferon-related protein SM15 (U09585); final exon similar to partial sequence of human EST R48415, but would require alternative splice" /db_xref="PID:g1613896" /translation="MGPKKSWRSHFFPLPEVWLLLLLSQALSPSHGPQTQAGVGLVWS VPLPSALRSQSCLGAPLRDASELTTVHLFPTRGWGARRALWVHSSASASSAASRRRLR AQASGISSFSLVDGGAPREDGGARGVWLPSSGQVSAQRTGRRLVGLEPTPTGSLTPRP PRPVPGMPRARKGNTLRKGGQRRGGGARSSAQADSGSSDDEAASEARSTASECPSLLS TTAEDSLGGDVVDEQGQQEDLEEKLKEYVDCLTDKSAKTRQGALESLRLALASRLLPD FLLERRLTLADALEKCLKKGKGEEQALAAAVLGLLCVQLGPGPKGEELFHSLQPLLVS VLSDSTASPAARLHCASALGLGCYVAAADIQDLVSCLACLESVFSRFYGLGGSSTSPV VPASLHGLLSAALQAWALLLTICPSTQISHILDRQLPRLPQLLSSESVNLRIAAGETI ALLFELARDLEEEFVYEDMEALCSVLRTLATDSNKYRAKADRRRQRSTFRAVLHSVEG GECEEEIVRFGFEVLYMDSWARHRIYAAFKEVLGSGMHHHLQVRGRTGRGHLNNELLR DIFGLGPVLLLDATALKACKVPRFEKHLYNAAAFKARTKARSRVRDKRADIL" repeat_region 10438..10727 /rpt_family="ALU" repeat_region complement(15895..15966) /rpt_family="ALU" repeat_region complement(16098..17299) /rpt_family="ALU" repeat_region complement(17534..17822) /rpt_family="ALU" repeat_region complement(17944..19144) /rpt_family="ALU" repeat_region 19459..20060 /rpt_family="ALU" repeat_region 20086..20683 /rpt_family="ALU" repeat_region 20798..21145 /rpt_family="ALU" repeat_region 21299..21862 /rpt_family="ALU" repeat_region complement(23770..24059) /rpt_family="ALU" repeat_region complement(24190..24475) /rpt_family="ALU" gene complement(25040..32502) /gene="semaphorin V" CDS complement(join(25040..25444,25902..26041,26131..26186, 26287..26444,26693..26734,26812..26903,27164..27383, 27689..27833,27918..27987,28098..28209,28301..28446, 30264..30383,30563..30656,30735..30854,31251..31311, 31460..31617,32395..32502)) /gene="semaphorin V" /note="coded for by human cDNAs T48905, U33920, AA031717, R74494, R35662, T59295, N26818, R74607, T48906, N32021, T59254, R36632, R35662 and AA031718; H_LUCA14.5" /codon_start=1 /product="human semaphorin V mRNA (U28369)" /db_xref="PID:g1613897" /translation="MGRAGAAAVIPGLALLWAVGLGSAAPSPHAFGSPSKSSRPGMVS RLSAWSEPAATRPCWWMRSVDACLWVPRTMWPPSTWTTSASGPRSWPGRPLWNGERSA TGQGRTLTECMNFVKLLHAYNRTHLLACGTGAFHPTCAFVEVGHRAEEPVLRLDPGRI EDGKGKSPYDPRHRAASVLVGEELYSGVAADLMGRDFTIFRSLGQRPSLRTEPHDSRW LNEPKFVKVFWIPESENPDDDKIYFFFRETAVEAAPALGRLSVSRVGQICRNDVGGQR SLVNKWTTFLKARLVCSVPGVEGDTHFDQLQDVFLLSSRDHRTPLLYAVFSTSSSIFQ GSAVCVYSMNDVRRAFLGPFAHKEGPMHQWVSYQGRVPYPRPGMCPSKTFGTFSSTKD FPDDVIQFARNHPLMYNSVLPTGGRPLFLQVGANYTFTQIAADRVAAADGHYDVLFIG TDVGTVLKVISVPKGSRPSAEGLLLEELHVFEDSAAVTSMQISSKRHQLYVASRSAVA QIALHRCAAHGRVCTECCLARDPYCAWDGVACTRFQPSAKRRFRRQDVRNGDPSTLCS GDSSRPALLEHKVFGVEGSSAFLECEPRSLQARVEWTFQRAGVTAHTQVLAEERTERT ARGLLLRRLRRRDSGVYLCAAVEQGFTQPLRRLSLHVLSATQAERLARAEEAAPAAPP GPKLWYRDFLQLVEPGGGGSANSLRMCRPQPALQSLPLESRRKGRNRRTHAPEPRAER GPRSATHW" repeat_region complement(28899..29449) /rpt_family="ALU" repeat_region complement(29476..29749) /rpt_family="ALU" gene 34254..34904 /gene="H_LUCA14.6" CDS join(34254..34462,34856..34904) /gene="H_LUCA14.6" /note="coded for by human cDNA F19257" /codon_start=1 /db_xref="PID:g1613898" /translation="MGTREGLVGTTPPILGLKAEGCPRWRSQKPGGQPRQGRKKPQLQ GLSRQRESLSGGTTRGLQYLNLSLHRLEVGGTTPTKPRAEN" BASE COUNT 7494 a 10603 c 10439 g 8009 t ORIGIN 1 gatccgggcc ccccctcgat ccatccgcct cggcctccca aagtgctggg attacaggcg 61 tgagccaccg cgcccggccc ccaaccttgg gacattttca tccattcatt catccttttt 121 tttttttttt ttttgagacg gagtcttgct ctgtcaccca ggctggagtg caggggcaag 181 atctcagctc ctgcaccctc caccttccgg attcaagtga ttctcctgcc tcagcctccc 241 aagtagttgg gattacaggc atgccatcaa catgtctggc taatttttgt atttttagta 301 gaaatggggt ttcaccatgt tggccaggct ggtctcgaac tcctgacttc aggtgatcct 361 cccacctcag cctcccaaag tgctgggatt acaggtatga gccaccgcgc ctggcgcatg 421 ggcacatcca ttgagtgtgc acttggtgcc aagttctgtg ccaggcacag gcaattcaac 481 atttattgga atgatgtagt ccctgtctgc atggaattca taggctagag gaggaagcag 541 tttgcctctg gtcccatggc cagagcagcc ccaggtgaag gttatgaatt atttgtccca 601 tctaatggtg ttccagcagt ctgccacatg gtgggaagga ggccccacag agctgtgctg 661 tctccttccc aggatgagct ggagcacagc ctgggggaga gtgcggccca gggggcagct 721 ggagtggtgc tctgggtgag ctgggaaaat acaagaacca aggtgagctt aggcctggca 781 tgagggtggg ggtgggggag gggtggggcc attaagctga cggggtagac cctgacttac 841 cctttctacc tgcaaagtcc tggctgacca gcaggtgagt gcctcagtgc cctgggtggg 901 tccatacatg gccatggtgt ccctgacgct atcctccctt cccacctagg aatcatgtca 961 ggccatcaag gagtatatgg acactacact ggggcccttc atcctgaacg tgaccagtgg 1021 ggcccttctc tgcagtcaag ccctgtgctc cggccatggc cgctgtgtcc gccgcaccag 1081 ccaccccaaa gccctcctcc tccttaaccc tgccagtttc tccatccagc tcacgcctgg 1141 tggtgggccc ctgagcctgc ggggtgccct ctcacttgaa gatcaggcac agatggctgt 1201 ggagttcaaa tgtcgatgct accctggctg gcaggcaccg tggtgtgagc ggaagagcat 1261 gtggtgattg gccacacact gagttgcaca tattgagaac ctaatgcact ctgggtctgg 1321 ccagggcttc ctcaaataca tgcacagtca tacaagtcat ggtcacagta aagagtacac 1381 tcagccactg tcacaggcat attccctgca cacacatgca tacttacaga ctggaatagt 1441 ggcataagga gttagaacca cagcagacac cattcattcc atgtccatat gcatctactt 1501 ggcaaggtca tagacaattc ctccagagac actgagccag tctttgaact gcagcaatca 1561 caaaggctga cattcactga gtgcctactc tttgccaatc cccgtgctaa gcgttttatg 1621 tggacttatt cattcctcac aatgaggcta tgaggaaact gagtcactca cattgagagt 1681 aagcacgttg cccaaggttg cacagcaaga aaagggagaa gttgagattc aaacccaggc 1741 tgtctagctc cgggggtaca gcccttgcac tcctactgag tttgtggtaa ccagccctgc 1801 acgacccctg aatctgctga gaggcaccag tccagcaaat aaagcagtca tgatttactt 1861 agctgtttac tgagcgccta aaatgtccag gccccggtat ccaaagggca caggaaaggc 1921 ttgtcactct caccagatct gacgtcagtc taagggcggc tgcttcccag ggtggggccc 1981 aaatttcaag aggaagcccc acctgatcgg accgtttcca agctgtcctg gggcaggagg 2041 ccggcgggaa ccgaagctag acttgtcaga gcctggggcg gggctggcgg aggcagagcc 2101 agctggaggc ggggcctgcg cctggctgaa gtgacgtgct gggctggggc agggcctgga 2161 agaagccaaa tcgaggcggg acctaagctg tgacgcaagg agaaggagga ggggcctgta 2221 tggggccggg tggggaggag gcggactcaa atggagatgg ggtggggctc agcaggggcg 2281 gagccagggt agagactgag tcggcgcctg gcgggtgttg tgacgcgcaa accaggggcg 2341 ggttctaagc ggcgacgcgc gggtgctggg aggccttaga acgccgtggc gtgccgcagg 2401 acgcgacggc tgcagaacat ccgccgcacc gctgggggcc aacgttgtcg gaccgaggag 2461 tccgaggtgg cgactgagtg agatacccag ctgtgcggag ctaggacgcg gaacatccca 2521 gaggccagca tcaacatgtc agtaccgcac ccttcccatt tgctgcctgt aggtccaccc 2581 cgcgcgtgcg gtccccaagt ggccatgcca ccctgcacgc cctccccgcg tcagtcgttc 2641 gtttccagac catgccgagc cacacctgcc cacagagtcg agctctcgaa gtcctcattc 2701 ccaaaagctc cacagcctgt gacccactag gagggagggg gaaaccgagg cccgggagag 2761 acggagaagc cactctctga ggaccccgga gccccgccca cagtgggtgg ggtcggcact 2821 gaaggggtta aggccgcggg gaagccccgg gctcagccta ggggcggatc ctggggcttc 2881 ctccctctag accaaagggt gcggctgctg cagaggtggc tgatgcaggt gaggggctta 2941 cagccctgtg aatggtgtgg gaggtgaggg gaggggggtg cctgttactg accagtgttt 3001 aattggcctc aagctcagga gaggtaaggg gagccaggag gccccctctg gagcaggtga 3061 gtagaaggag ggggctcagc tgcccgccca cgactagaaa acccccagca tagacctgcc 3121 ggacggcagg accttggtta agatgggaaa acagtagggt ttgggcacag gaggtctggg 3181 cccccaacct ggtgaccgtt ggacaccctg tggatatagg aaggaagctg tggtccttat 3241 ctagggtgcc tgcgaaactc aagccagtgg ggaaccaagc ctgcagaggc aatactgctg 3301 tgtgactgtg ggccaggccc agtgctggtg ctgctggggc cccatctgag gagacagttg 3361 ggtctatatc caaagtgtcc tgagatgggc agagttggga agggggccat gggctccagg 3421 tggggaaggc agtcaagaca ggccaaagct cagcctgatg ggtgcatgag tggggcccca 3481 ggatccacta aggcagtggg catagcccag gtagttgaaa aggctgggag ctggaggaga 3541 ctcggaacat agggagaaga aagggacagg aagactcccg ggcagttttt cctatggagg 3601 gaaccttggg taggtttgag cagggggtat agtgtctgct ttgcatcttc tatcccatgc 3661 cctagtggtg gactaggagg actactggaa gacagaaggc cagcaagtag attgggtcgg 3721 ctgttttata gacctggtgg ttacagaccc cctgtgaggg gaggtcctga ggaggacaaa 3781 tttgtagcca gagcaggctg gggccaggac tcctaggttc tgccccagcc ttaagcagct 3841 catccatcac tcccagcctc cctccaatgt ggggtagaca cctggatcta tctgccctgc 3901 tctatagggg atgggtctct tccaggtcac ctgtgctatc aggtgagtgt gggacaaagt 3961 gcagcccacc actccccagt tccagtatgc atggtccccc ctgcactctg tcagcccttc 4021 cctgctccaa ccccaccctc cctcctggca cagcccagca tggctgttat ctacaggagg 4081 agagccactg tggtctgaag agatgctact gcctggagac tggggtctta acctccagcc 4141 tggcactgag ccacctgcaa cctagcagca ggtgaccttg gctcccagcc tgactcagct 4201 gaacctggat cctgtgcata ggcaagagct gactctgagc cctggcccag ccaagctgac 4261 ccctacacta gaccctacac accggatgga gctgatcctg agtaccagcc cagctgagct 4321 gactctggat cctgcgtgcc agccaaagct gcccctggat tccacatgcc aaccagagat 4381 gaccttcaat cctggtccaa ctgagcttac cctggatcct gaacaccagc cagaggagac 4441 cccagctcct agcctggctg agttgaccct ggagcctgtg caccgccgac ccgagctcct 4501 ggatgcttgt gctgacctca tcaatgatca gtggccccgc agccgcacct cccgcctgca 4561 ctccctgggc cagtcctcag atgccttccc cctctgcctg atgctgctaa gcccccaccc 4621 cacacttgaa gcagcacccg ttgtggtggg ccatgcccgc ctgtcacggg tgctgaacca 4681 gccccagagc ctcttagtgg agacagtggt ggtggcccgg gcctgagggg ccgtggcttt 4741 ggccgccgcc tcatggaggg cctggaggtc tttgctcggg cccggggctt ccgcaagctg 4801 catctcacca cccatgacca ggtgcacttc tatacccacc tgggctacca gctgggtgag 4861 cctgtgcagg gcctggtctt caccagcaga cggctgcctg ccaccctgct taatgccttc 4921 cccacagccc cctctccccg gccacccagg aaggccccaa acctgactgc ccaagctgcc 4981 ccaaggggtc ccaagggacc tccattgcca ccaccccctc ccctacctga gtgcctgacc 5041 atctcacccc cagttccatc agggccccct tcaaaaagcc tgctggagac acaatatcaa 5101 aatgtgaggg ggcgccccat attctggatg gaaaaagaca tctgaggcca tccagggcaa 5161 ggaactgtct ttctggttca atagactgcc ccgacagtct acaagcctca gcccactgac 5221 catacctcag cccctagccc ctggggggca gctttaacct gggcatgttt cctgggtacc 5281 agtggggcca ggaggtggct ctggctcaga gccgtcagtg tggctgaata aaggctctct 5341 tgggtatggc tgtgacagtt tatttgttga tccccctacc ctcacctctc acctcttctg 5401 caggtccctt atccctgcag aagtctccag atccaccttg gccctgaggc cattgatggg 5461 aggatgcctg tcctttgcct ttacccccca cctggctcag gagacagggt ggctgttttc 5521 ttccccattc actcattacc attcactgag cacctactgt gtgtcaagcc ctggacggga 5581 cataggcaat gggtaactag acaaacaggc atacagtagc aggatccatg tggcacaggg 5641 gaggtacaga ggctttggga acccaagtga cttcactcca cctggggatc caggagacct 5701 cccagggcag tgatgtcaca gcagagacct gagtgctagg taggaattaa accaggcagg 5761 tgaggaggtg ggagctgact gttcttggaa gagggaacaa ggtgggcaga ggaaagaagg 5821 ggacttgtga cagttgtggg aggacacagt ggtgtattga caaagacgga acatgggagg 5881 gaatgtaccc tcagctcact gtaaagcccg ctttggtgtg cacctgccac tcgatgcccg 5941 gggcatcata gccagcttgc ggtgtggctg ctttaaaagg cccaagagac ccctggggaa 6001 acatgctttc cccagctcct cctgtaagga tggggaggag ctgggggagc tttcctggct 6061 ttccctgcca tgtggaaggt gtggccatag ctgcggactc taagcttaca cccccctctc 6121 tcctgcaggt ttccatcctt ggggaatgac cacgcaactg ggcccagccc tggtgctggg 6181 ggtggccctg tgcctgggtt gtggccagcc cctaccacag gtccctgaac gccccttctc 6241 tgtgctgtgg aatgtaccct cagcacactg tgaggcccgc tttggtgtgc acctgccact 6301 caatgctctg ggcatcatag ccaaccgtgg ccagcatttt cacggtcaga acatgaccat 6361 tttctacaag aaccaactcg gcctctatcc ctactttgga cccaggggca cagctcacaa 6421 tgggggcatc ccccaggctt tgccccttga ccgccacctg gcactggctg cctaccagat 6481 ccaccacagc ctgagacctg gctttgctgg cccagcagtg ctggattggg aggagtggtg 6541 tccactctgg gctgggaact ggggccgccg ccgagcttat caggcagcct cttgggcttg 6601 ggcacagcag gtattccctg acctggaccc tcaggagcag ctctacaagg cctatactgg 6661 ctttgagcag gcggcccgtg cactgatgga ggatacgctg cgggtggccc aggcactacg 6721 gccccatgga ctctggggct tctatcacta cccagcctgt ggcaatggct ggcatagtat 6781 ggcttccaac tataccggcc gctgccatgc agccaccctt gcccgcaaca ctcaactgca 6841 ttggctctgg gccgcctcca gtgccctctt ccccagcatc tacctcccac ccaggctgcc 6901 acctgcccac caccaggcct ttgtccgaca tcgcctggag gaggccttcc gtgtggccct 6961 tgttgggcac cgacatcccc tgcctgtcct ggcctatgtc cgcctcacac accggagatc 7021 tgggaggttc ctgtcccagg taagtggaag ctgaggtcta ggggtctgag tcaggaggta 7081 tggccctgtt ctaggaaggt cccaggagaa gaccttgcct aggggttggt cctaatctag 7141 gcgttctcag cggaaggccc tgtcctggag catctgatga gggaggcatg gctttgccct 7201 ggaagatgtt gaggggtagg gaggcccagc ctttggagtg ctggcctgag gggagatatt 7261 ctgagtggaa ggcctgagaa gccactcaga gactcgatac aaaggggccc tacttgggct 7321 gggtgagtgt ctcacaccta taatcccagt actttgggag cctgagatgg aaggattcct 7381 tgaggccaag agttcgagac cagcctgggc aacatagtga gaccccaacc tctaccaaaa 7441 aaacatttga gacagagtct tgctctgttg cccaggctgg agtgcagtgg caccatctca 7501 gctcactgcg gccttcactt cctggttcta gtgatcctcc cacctaagcc tcctcccgaa 7561 tagctggact acagatgcat accaccacac ccagctaatt tttatatttt tgtagagatg 7621 gggttttgtc atgttgccca ggctggtatt gaactcctgg gctcaagtga ttctcccgct 7681 tttgcctccc aaagtgtggg attacaggtg tgagccactg tgcccagcct aaaaaattat 7741 tagccaggtg ttatgcacct gtagtcccag ctgcttggga ggctgaggtg ggaggattac 7801 ttgatcccag gagttcaagg ctgcagtgag ctatgatcat gccactgcac tccagcctgg 7861 atgacagagt aaaaccccgc ctctaaaacc aaaccaaact agagagtccc tactgtagag 7921 ttagaatcag attgtgacag tgaaccagaa gagtttcttg aatgtggatg tgtgcccaca 7981 tggatctggc cagcccagca ctggcttagc agctctctgc ctccaggatg accttgtgca 8041 gtccattggt gtgagtgcag cactaggggc agccggcgtg gtgctctggg gggacctgag 8101 cctctccagc tctgaggtga tcattgcccc tttgagcctg ccatgtagca catgctgggg 8161 tcccagggtg ggggacggcc atgtcaagat tatagagcag gcatattgac acatatcttc 8221 ccttctcctc aggaggagtg ctggcatctc catgactacc tggtggacac cttgggcccc 8281 tatgtgatca atgtgaccag ggcagcgatg gcctgcagtc accagcggtg ccatggccac 8341 gggcgctgtg cccggcgaga tccaggacag atggaagcct ttctacacct gtggccagac 8401 ggcagccttg gagattggaa gtccttcagc tgccactgtt actggggctg ggctggcccc 8461 acctgccagg agcccagcct gggcctaaag aagcagtata aagccagggc ccctgccact 8521 gcctcttctt ttccctgctg ccacttttcc agtcctggaa ctactctgtc ccactcttgc 8581 tctattcagt ttacagtcaa ccctcccaag cacacacccc gcttcccttg gaatccctga 8641 ggggtagaag gggccagaaa aaacgcttat aaaaccagag gccctctgag atcatgtgag 8701 tcctccatgg caaggaagca gttccaggga gagtcaggtt ccagctagtt agggctgcca 8761 gcctagggct ttgtgcctac acctcactaa gcccatggag aggtcacaga tgggccgtgc 8821 acgggcagat gggccccaaa aaatcttggc gaaggtcggt aaagtgctaa gctgttgtct 8881 gcactctttc atcataaagt cacttttttc cactgcctga ggtttggctg ttgctcctgt 8941 tatcccaagc tctaagccct tcccatggtc cccagaccca ggcaggggta ggtctcgtct 9001 ggagtgtccc gctgccaagt gccctgagaa gccagtcctg ccttggtgct ccactgaggg 9061 acgcttcgga gttaaccacc gtgcacttgt tcccgacgcg gggctggggc gcgcgcaggg 9121 cattgtgggt gcatagttca gccagcgcgt cgtcggctgg tgggccccag gcgtggacta 9181 ccattcccat ggtgctctac gcgcagctag ccgccgtcgc ctgcgcgctc aggcctctgg 9241 gattagtagt tttagcctcg tggatgtggg cctgaatcgg atggcctgga actcgccttc 9301 ccggcgacct gtttggcagg gcggggcgcc tcgcgaagat ggtggcgcgc gtggcgtgtg 9361 gctcccgtcg tctggccaag tctcagcgca gcgcaccggc cggcgtctcg ttggcctgga 9421 gcccacaccc accgggtccc tgaccccgcg ccccccgcgc ccggttcccg gcatgcctcg 9481 cgcccgtaag ggcaacacgc tccggaaggg tggtcagcgc cgtggaggag gtgagtgggg 9541 tggggggcgc gggacagctg tatgagcggc gggcgggggt cctgctggat ccccttggtt 9601 gttctcgagg gccctgggtg gggaaccccc tcggccagcc ccacgcagct tcccataact 9661 ctaccgaggc tggcacgtgc cccagcagtc attggccacg tgcgtgccca acttgagctc 9721 gctgcccact gcctgttggg agtccgcgct cccagcggtg acccggccag cccctccggg 9781 cctcagttgc ctgctgggtg gacaaactca ccggacaggg caggaagcca gtcctcttgc 9841 caaaccctga tcctttcctg gactggtcaa ggaagttggc gggagtcctg gcctgctccg 9901 gtgaagagag ggagggggct ctgttgggaa gctacctggt tagggccttg gcagcttggc 9961 aggcctgact ccacaaggta gctgagtatc ctggagcctg aggcagtctt gggactcgct 10021 tatcagctta gaccaggagc ctatcttaat gcgttaaaca catatgttta ctctgctggg 10081 atggggtgcc taccacgcac aaatttattt atttttattc ttgaggggga agggaggagg 10141 gccgggcaca ggaaggagga aaatacctag gcacaacttt aaactggtca ccaccatggt 10201 gggttttggg tgggggacat ctaatagggt gtggaacatg ttcactgagt ggtgtcagcc 10261 ccagacttag gactcaggga ggcctggggg gaagtgcttc tgggtgagtc ctgagaggat 10321 ggagaagaga gcaagttgta caaagacctg gcagctggag ggaccccagc gcagtggagt 10381 gaatgaggga aggctgggct gctccggcag gtagaatagt aagataaaat gggagcaggc 10441 tgggcgtggt ggctcacacc tgtaatccca gcactttggg aggccaaggc aggcagatca 10501 cctgaggcca ggagttcgag accagcctga ccaacatgga gaaaccctat ctactaaaaa 10561 tacaaaatta gctgggtgtg gtgtcgtatg cctgtaatcc cagctactcg ggaggctgag 10621 gcaggagaat cactggaacc cgggaggcag aggtcgcagt gagccgagat ggcgtcattg 10681 cactccagcc tgggcaacaa gagcaaaact ctgtctcaaa aaaaaaaaaa aaaaaaaaag 10741 gtggcgggga gcagaggcca gtgcgcaggc tggactttgt cctgcagatg aggggtgtga 10801 gtagttctta gaagtagggc cagcctggtg atcacaccct gtccaaaact ctgccacagc 10861 agccaacctt gccaatgtca gcccttggcc ctggacagac cccacacaca ggagtgcacc 10921 agccctactg aactgaccct tggctttggt ttttcatctc tgagagggac accattctat 10981 cttttgggat tatgaggtcg agtgaggtac ccccagccat ccctcatcag tagaggcaag 11041 ttgagtgtcc tattccacct tctccaggtg cccggagcag tgcccaagct gactcgggtt 11101 ccagtgacga tgaggcagcc agtgaggccc gcagcaccgc cagtgaatgc cccagccttc 11161 tcagcaccac tgcagaggac agccttggtg agagcgggtg gaagtttgac aggggcttgg 11221 tgagggctcc atgggctgag gacaagaagc ggtgctgacc aggtggcctt gcaggggggg 11281 atgtcgtgga tgagcagggc cagcaggaag accttgagga aaagctgaag gagtatgtgg 11341 actgtctcac agacaagagg tacccctggc tgccagccaa ctcctacacc cagctccaag 11401 tgtgatcaag ggagggctgg cccatatgac cccccttctc gacctccccc agtgccaaga 11461 cccggcaggg tgctcttgag agcctgcgcc tggccctagc gtcccgccta ctccccgact 11521 tcttgctgga gcgccgcctc acgctagccg atgccctgga aaagtgcctc aagaaaggtt 11581 ggacctgggg gtgtgtggga gacttaaact gggcagacac tggcccttgc tgcatgggct 11641 gactggaaag catcccacag ggaagggcga ggaacaagcc ctggctgctg ctgtgctagg 11701 cctgctctgc gtgcagctgg gccctggacc taagggtgag gagctgtttc acagcctgca 11761 gcctctgctg gtctctgtgc tcagtgacag cacagctagc cctgctgccc ggctccacgt 11821 gagtgtgcct gtgccccatg aaacccttcc tgcaccttat ccctcagcag agtggtgggt 11881 tccccctatc ttcagcctcc tttactctga ggggagtgag ctccagggct gggaacccag 11941 gttcacccgc tgaccgtggc attgcattgc ccttctccca acagtgtgct tctgcccttg 12001 gcctgggctg ctacgtggct gccgctgaca tccaggtgag gggtctttgg gcacaggtgg 12061 tagagcatct agggctgtaa ctctgcctct gagctcccct gcctctctgt gctcctagga 12121 cctggtctct tgccttgcct gcttagaaag tgttttcagc cggttctatg gcttgggggg 12181 cagctccaca agtcctgtgg ttcctgccag cctgcacggc ctgctctctg ctgccctgca 12241 ggcctgggca ttgctgctca ccatctgccc tagcacccaa atcagccaca tccttgacag 12301 gtaggggtgg ctgtccactg ggagggggag gggatctcaa agaggccccc aagccacaca 12361 tatagctcag cctgcccctt ccctaggcag ctgccccggc tgccccagct cttgtccagt 12421 gaaagtgtga acctgcggat cgctgccggt gaaaccattg cactgctctt tgagcttgcc 12481 cgggaccttg aggtgcgagg gacaaggatg gggggtgctt ggtgacacca cctgcccatc 12541 acaggctgga tgcagggggt gccacacaaa acagaacagc tttaggtcat tatgcagagg 12601 aggtggcccc aaaacagatt tatctcctag atgtcatgat gggtgccctc agcagtggtg 12661 tcctggcctg acagaggcca aggaggggtc aaaggggcca ggcagagaag agagggtctc 12721 tcagtgaaag gaggggtttg ggcagtgccc tgttcagagc cagcagagct caagcatcta 12781 ccacacaccc tccaatgctc ccattgcagg aggagtttgt ttacgaggac atggaggccc 12841 tctgcagtgt cctgcgcact ctggccactg acagtaacaa gtaccgtgcc aaggctgatc 12901 gtcggcgcca gcgctctact ttccgcgccg tgctgcactc cgtggaggtg tgtgtgagaa 12961 catatgtgtc ctagcaaggg tgcaccccca ggcatagcag ccaagcccag ttgtgttggc 13021 acctctaccc tgcagggcgg tgaatgcgaa gaagagatag tgcgcttcgg ctttgaggtg 13081 ctctacatgg acagctgggc tcggcaccgg atctacgctg ccttcaagga agtgctgggt 13141 tcgggcatgc accaccacct ccaggtgcgg ggacggacag ggaggggaca tctggtgtgg 13201 ttgcttcagt ctggcctgag ctcactgccc tctgcccccc agaacaatga gctactccgt 13261 gacatctttg gcctgggccc tgtgctgttg ctggatgcca ctgccctgaa ggcctgcaag 13321 gttccacgct ttgagaaggt ttgcaccctt gggcaccttt ctcttccccc tattcccatt 13381 tcctggaggc ctggaattct gtagaggccg gaagaggacc cccagccctt tcccttccca 13441 gctccccagg gtgtcactct ctgtccccac tctagcacct gtacaatgct gctgccttca 13501 aagcccggac caaggctcga agccgtgtgc gggacaagcg ggcagacatc ctgtgaagca 13561 ggacctgctg aagaggagac tttctatgcc cttggtccgt atttttaaca gaagacagtg 13621 caacaactgg tctccaccag tatttgtcac tttatttttt ttaatgacaa aaccaaaaac 13681 agacatgggg tgggtagctg ggggcccgga cacttgggac cctgacccct ttgtccctgc 13741 actcagccct gtggcccctt cctgtcctgt ctcaggccag gctaaatatg tgccttcctc 13801 agggctgtgg ggcaggcact agggggcctt tcccttcctt tcctttctca ggccttgctc 13861 ccccaggatg acccactctt aggggggtgg tggcatctgg acaaatgcca ccacagcagg 13921 tggggtggca aagctacctg gaatggattt gtgtgctgat ttttaaggat tattacagat 13981 aattaaacag aacggtcagc cttctgtggt cttaacccct gggtattttt ctgttctccc 14041 tccccatcta ctatcccagg cttgggccca actggtcttc cacgatgtca cctttgccct 14101 ccaaggcagt cttccccagg tggtgcctct ccccctactc agagcccagc ctgtctttaa 14161 gagctgaggc tggaccctca ctgggagccc tggcagagtt tgggtgattg ctatgggggc 14221 agctatttct agacttcaga acctgccatc tggggtggcc agagagtgtt gacaggccac 14281 aggaggagcc aggaggctgg tgccccttcc cctgaccttg ggccacccaa agcgaggctt 14341 tggcaccaga ggcttggcta ggcctggctt gaagagatca ggagagggag gcagccatta 14401 agttaacaac aggttcttgt acaaaaatct caccaaggaa atagtgtaga tgtggcagcc 14461 agcagtaggg aaggagagac ctgcccatag ccactttatt cccccaccaa cacacacccc 14521 caggccccag atccaaatgg catctcagct gggtgcttgg gcctcactgg agttgagcct 14581 ccgaagctgg ctgaggctgg ccaagcggag tttgagtagt gtctcctcct gcgtgcggag 14641 cgtgtgtgcc aggatgcgca gggattcact ctgcagcact gtgaatggga agtgggggat 14701 catcagacca tgtgccaccc cgtgacccct gcagacccag cccaggacca ggggcagcca 14761 cgggcctccc gccccatcct tcactgctgc agcatctgca gttactcctt ccagctgcac 14821 agaagggcct cccagtcccc actgccttgc tttccattcc tactagcacc ctatgcgtcc 14881 ataccgctca ggtagacagc caggagtgcg agcactaggc aagtgagcac cagcagcgcg 14941 agcagcagca ggaaccctcc tcgtctgtac acagggctgc agccagcctg ggcacacagt 15001 gacggcggca aaacgcccag cagctcatcc cacggtcgtg cctcttcagt tagatagggg 15061 cgcagtgtgc ctgctgggtg gatgggacat tgggagggtg gtgggtgcac ctgctttcgt 15121 cccctccccc agcacatcat ccaccccact gtccccactc acctccacta tgtaggtcgc 15181 tgatggactc cacctggtgc aggcatacct catgcacgtg gttgggggcc aatggccccc 15241 tgctcctctg gctgggcatc attggcgcca cggagtctgc aggggcagcg caggctgatt 15301 tagccagtgg tgggtcaacc cccttccttc taccaggtgg cttcaggcag caaggacttg 15361 cggatgatag ggatagtggg acatttggtt ccaaatcccc taaaactgac tcaggaatta 15421 cccttgattt ggatccagga tgatcccaga cccttacagg cttattctgt atacaggagc 15481 aacaggtaca tttgatttgt ttccttccaa gcaccccagg gactcactca ggcattcctc 15541 tgcctgctta tttctagcag tttatcagcc ttgaatgcta aacctgcagc aaatggaatt 15601 cagtcctgcc tcttttgtca gttctccctg cagacccaat gcctggcatc ttctggtcag 15661 aaagccctat tgtagccact ctggtgcctt tccctcctgg ccttggttct gtgtttgtta 15721 acatactcag tgtgactctg ctgtacatgt gtgaaccctg ctgttttcat tcaacattta 15781 ctgttgggga gacccttctt tgttatactg ttgcacatgc ctctgaggtc tctcccttca 15841 cacatctccc tgaggctgcc tgggcatcta ggttgtttgc agtccccgcc cccctttttt 15901 ttttgagacg gagtttcgct cttgttgccc aggctggagc gcaatggcgc gatcttggct 15961 caccgctgca gttccctttt gaaattaacc cccaggacat ccaccgttga atatgaaggt 16021 tttttcccaa tttcctgata gtttaattcc cagaagtgga attactggat ccaagagcag 16081 ggatttttgt ttgtttgttt gttttttgag atggagtctc cctgtattgc ccaggctgga 16141 gtgctatgtc gtgatctcgc ctcactgcac cttccacctc ctgggttcaa gcgattctcc 16201 tgcatcagcc tcctgagtag ctgggattac aggcgtgtgc caccacgccc agctaatttt 16261 tgtatttttg gtagagatga ggattcacca tgttggccag gctggtctgg aactcctgac 16321 ctcaagtgat ctgcccgcct cggccttcca aagtcctggc atttacaggc atgagccact 16381 gcatttggtc gttttttgtt ttggtttggt ttttttaaga tggagtctcc ctctgtcgcc 16441 caggctggag tgcaatggca agatctcggc tcactgcaac ctctgcctcc cgggttcaat 16501 cagttctctg cctcagcctc ctgagtagct gggattacag gcgccttcca ccacacccag 16561 ctacttttta tatttttagt ggagatgggg tttcaccatc ttggccaggc tagtcttgaa 16621 cccctgacct tgtgatccac ccacctcggc ctcccaaagt gctgggattt acaggcgtga 16681 gccaccgtgc ccggcctgtt tttttgtttt tgagacagag tcttgctctg ttgcccaggc 16741 tggagtgcag tggcgcaatc ttggctcatt gcaacctcca cctcctgggt tcaagtgatt 16801 ctcctgtttc agcctcccaa gtagctggga ttacagatgt gtgccaccac gccctgctaa 16861 tttttgtatt ttcagtagaa accaggtttc accatgctgg ctaggctggt ctcaaactcc 16921 tgacctcaag tgatccgccc gcctcagact cccaaagtgc tgggattaca ggcgtgagcc 16981 atcgcgcctg gcctgagatt tttgttttgt ttttgagaca gattcttact ctttcaccca 17041 ggctggagtt cagtggagtg atcacagttc accgcagcct ccacctcctg ggctcaggtg 17101 atcctcctgc atcagccttc ccagtagctg ggactacagg catgcactac catgcccagt 17161 taattttttt tgtatttttt gtagagacag ggttttacta tgttgcccag gctggtctcg 17221 agctcctggt ctcaagagat ccatcctgct tggcctctca aaatgctggg attacaggtg 17281 tgagccacca tgcccggcct gatttttttt aaagctatta ccaaactgtc ctccagaagc 17341 actgtccaca gctcccccgc agggtataat attgccacca ttaggcatcc ccataggaaa 17401 aaaattatat ttacatgcac acgtgcacac atatatttgc taacttgaga gatgagaaat 17461 ggtctttctt attttattgg gtttcttagc ctagggagtg tgactaatac gtgtgtggcg 17521 cttttttttt tttttttttt tttgagacag tcttgctgtg ttgcccaggc tggagtgcag 17581 tggtgcgatc tcagctcact gcaacctcca cctcccaggt tcaagcaatt ctcgtgcttc 17641 agcctcccaa gtagctggga ctacaggcac ctgccaccat gcctggctaa tttttgtatt 17701 tttagtagag accaggtttt gccatgttgg ccaggctggt ctcaaactcc tgacctcaag 17761 tgatccaccc gccttggcct cccaaagtgc tgggattata ggcgcaagcc accatgccca 17821 gctgtgtgtg gcttcttaat tatcaatttg aagcctctgc ccatttagtc acttgggtct 17881 gtgtactttt cttttgattt taattatgta ctttcacact cattgaagtt tttgcttttt 17941 gtttgtttgt ttgagacaga gtctctgttg ccctggctgg agtgcagtgg cacgatctgg 18001 gctcactgca acctccgcct cccgggttca agggtttctc ctacctcacc ctccttagta 18061 gctagcacta caggtgtgca ccaccacacc ctgcaaattt tttttttttt tttttttttt 18121 gagatggagt ctcgctctgc cgcccaggct ggagtgcagt ggcacgatct cggctcactg 18181 caagctccgc ctcccaggtt cgtgccattc tcctgcctca gcctcccaag tagctgggac 18241 tacaggcgtc cgctgccatg cccggctaat tttttgtatt tttagtagag atggggtttc 18301 accatgttag ccaggatagt ctcgatctcc taaccttgtg atccgtctgc ctcagcctcc 18361 caaagttctg ggattacagg tgtgaaccac cgcgcccggc caatttttgt attttttgat 18421 aaagatgggg tttcaccttc ttggccaggc tggtcttgaa ctcctgacct caggtaatcc 18481 acccgcctca gcctcccaaa gtgctgggat tataggcgtg agccatcgca cccagccggt 18541 gtttcgtttg tttgtttgtt tttgagacag aatctccctc tcttgccatg ctggagtgca 18601 gtggcgcaat ctcagctcac tgcaacctcc gcctcccagg ttcaaccgat tctcctgcct 18661 tagcctcccg agtggctgga actacaggca cgtgccacca cgcctggcta atttttgtat 18721 ttttagtaga gacggggttt caccatgttg gccaggatgg tctcgatctc ttgacctcgt 18781 gatctgctca cctcagcctc ccaaagtgct gggattacag gcatgagcca ccatgcctgg 18841 ccttttgttt gtctgttttt tttgagacag agtcttactg tgtcacccag actggagtac 18901 agtggcatga tctcagctca ctgcaacttc tgcctcctgg gttcaagtga ttttcctgcc 18961 tcgtctcccc agtagctggg attacaggca cgtgccacca tgcccagcta atttttgcat 19021 ttttagtaca gctggggttt caccattttg gccacgctgg tcttgaactc ctgacctcaa 19081 gtcatctgcc catcttgtcc tcccaaagtg ctgggtttac aggcatgagc caccgtacct 19141 ggccaatatt taattatatt ttcttctagt tgttctttaa cttgatgtct aaaaatcctg 19201 gtccagatgc caagagctcc agatacccac ctggaagctg ataacagtag ggaagagcat 19261 tgaggggaca cctccagata ggagcaaggg tggccttgca ctctgggact gtcattctca 19321 ggacagtaac tcaacctcca tgatttactt gaaactgcct cttgacgtgc tcaaaagcaa 19381 gtacaacaaa aacaagcaag tgctgccagt cattatgtct gggtggtggg ttgaaggtca 19441 tattaaattc tctctttggg ccgggcactg tggctcatgc ctgtaatccc agcactttgg 19501 gaggccaagg caggaggatc atttgagtct aagagtttga aaccagccag ggcaacgtag 19561 ggagacccca tctctacaaa aaaatcaaag attagggccg ggcatggtgg ctcacacctg 19621 taatcccagc actttgggag gctgaggtgg gcggatcacg aggtcaggag ttcaagatca 19681 gcctggtcaa catggtgaaa ccccatctgt actaaaaata caaaaaatta gccgggcatg 19741 gtgatgggcg cctgtagtcc cagctactca ggaggctgaa ggcaggagaa tagcttgaac 19801 ccaggaggcg gagcttgcag tgatccaagc tcaagccact gcactccagc ctgggcgaca 19861 gagctagacc tcgtctcaaa aaacaaaaaa gtaattaaag attagtttgg tgtggtggca 19921 tgcttctgtg gtcccagctt ctcaggaggc tgaggtggga gggttgcttg agtccaggaa 19981 gtcaaggctg cagtgagctg tgatcatgcc attgtactcc agcctgggca acagagtgag 20041 accctatctc caaaaaaaaa aaaaaaaaaa aaaaaattcc ctgctgccgg gcgcagtggc 20101 tcacacctat aatcccagca ctttgggagg ccaaggcaag tggatcacaa ggtcaggagt 20161 ttgagaccag cctggccaat atggtgaaac cccgtctcta ccaaaaatat ttaaaaatta 20221 gccaggtatg gtggcaggcg cctgtagtcc cagctacttg ggaggctgag acaggagaat 20281 cacttgaacc tgggaggcag aggttgcagt gagcagagat cgtgccactg cactccaccc 20341 ggggcgacag agcaagactc cgtctcagaa aaaaaaaaaa aaaaaagggc cgggcgcagt 20401 ggcccatgcc tgtaatccca gcactttgga aggccgaggt gggcaggtca cgaggtcagg 20461 agatcgagac catcctggct aacacagtga aaccccgtct ctactaaaaa attcaaaaca 20521 aaaattagct gggcatggtg gctggcgcct gtagtcccag ctactcggga ggctgaggca 20581 ggagaatggc atgaacccgg gaggcagagc ttgcaatgag ccaagatcgt gccactgcac 20641 tccagcctgg gcgacagagc aagactctgt ctcaaaaaaa aaaaaaaatt tccctctttg 20701 tactttgttg tgcttttctc acactttcta aattgaatgt gaattgttta ttacaggaaa 20761 aacacacagt aaatgttatt gttaagatcc caaaagaggg taggcacagt ggcttatgcc 20821 tctaatccca gcactttgaa aggccaaggt ggctggccgg gcgcggtagc tcacacctgt 20881 aatcccagca ctttgggagg ccgaagtggg tggatcacga ggtcagaaga tcgagaccat 20941 gccggctaac acagtgaaac cccatctcta ctaaaaatac aaaaaattag ccaggcgtag 21001 tggcgggctc ctgtagtccc agctactcag gaggctgagg caggagaatg gcgtgaaccc 21061 ggaaggcgga gcttgcagtg agctgagatc gcgccactgc actccagcct gggcgacaga 21121 gcgagactcc atctcaaaaa aaaaaaaaaa agaaaagaaa ggccaaggtg ggaggattgg 21181 attggttgag gccaggagtt caagaccagg gagacccttc tctacacaca cacatgcatg 21241 aaagtaaaca tttcccccta ccagggcaag gcccctctcc tgcaatgttg aaaatgttgg 21301 cagtggctca cgcctgtaat cccaacaatt tgggaggcca aggtgggtgg atcacctgag 21361 gtcaggagtt tgagaccagc ctggccaaca cggtgaaacc ttgtctctac taaaaataca 21421 aaaattagcc gggcatggta gcacatgcct gtaatcccag ctacttggga gcctgagaca 21481 ggagaatagc ttgaatctgg gaggcagagg ttgcagtgag ccgagaccgc accactgcac 21541 tccagcctgg gtgacaaaaa aaaaaaaagt tgaggccagg cgcggtggct cacccctgta 21601 atcccaacac tttgggaggc tgaggtgggt ggcttacgag gtcaggagtt caagaccagc 21661 ctggccaaga tggtgaaccc ccgtctctac taaaaataca aaaattagct aggcatggtg 21721 gcaggcgcct gtaatcccag ctacttggga ggctgaggca gagaattgct tgaatctggg 21781 aggcggaggt tgcagtgagc cgagatcacg ccactgtact ccggcctggg tgacagagcg 21841 agattccatc tcaaaaaaaa aaaaaaagtt gaaaatgtga taagaggagc ttgctagctg 21901 ggccatgctc tgtcatgggc cataacatgg agccatggag cagacagtcc tacgtcctgg 21961 ccagtactgg acctgtagct tcctagattc ctgctgccct ggcccctctg agcatcagta 22021 cttcttatat gcagtggtgc agttaggtga gtggccacca agctcttgac tagctgagtc 22081 tctgtctgac caggccaagg gacccccaac cctaggcagt tggggatatt tagacccaag 22141 tcaggggagg ccagaggcct aactactttt cagtccatgg gacaggtacc caaatgcttt 22201 ctggaaccac tacccacccc aatcccagct tccttcctta agagctgaac cggccaggca 22261 gctgaccgga tgcccacacc cacctgagca cagcctgtag ttgacccact cctaactggg 22321 tagcttctcc catccctcct tgatgtcccc agcaggggaa actgaagcag ggcctgaggt 22381 gacaaggggc tccaggcatg gcaggctttt cctccctgca cagggggcag gtccttttac 22441 tggagctgga gcatgaaaat gggtaactaa ctactcaaga cagtgaggtc agtgggacag 22501 agggtgggtc tctccatggt ccacaaggtc acaggactga ggccttgccc tccctcatgg 22561 tcaccctctc ctaccttctt gggtctcctc aggcatggca agcagtgggc agtcgggggc 22621 caatgatggc atccagtagc aggactggac aaatgcagca gtggctcctt tgtgagccca 22681 gggaaggcct ggctgcctcc cagccttggc ctaaaatagg cctgagctca gcccactggg 22741 ctatatttag agggggcagc cctcagccat gggaaggggc agagtgatcc acgtgggcca 22801 gcctgaacta tctacctggt gagggagcca gccaggagcc tgcctccact agtccaggtg 22861 cccagggacc ttcaagggga agcacacctc ccccatacat ccagaatggc cactccaggc 22921 tcagcaaggc cccatgtggc agccaagaca gacaaaggaa gcctgtgcat ctctatttgg 22981 ccacccctct acccctgcag actcctaccc acagcccagt catctctcct cccagcaaac 23041 acagcagcct ccactgcatg acctgctagc acacaatgct attgttgtgt gtgtcttata 23101 gagggtgatg gacaactgaa tcccaatgcc atgagggctc tgatagcttc acaagtgggc 23161 agatacacca acagcaccca tgctggccag tgggtggacc taggttggag gctggttcat 23221 ttgttactaa gcttgcaacc ttaggtaaag ggtctcccct ctctgaactc agtttgctca 23281 tgtgtaaagt cggaataaca gtggctcctc cctggcagtg tacactgaga acaacatctg 23341 gagcatttag cacaatccac ggttttggtc ttcccctgct cccatctcca caagggcaga 23401 caggtcccat aaggttgtgt aaggatgcgc atcactgctt accttgaaag gaggaggtgg 23461 tccagcttcc agctttccct ctgtggttgg atccctgtgc ccttccttcc cagtgggggc 23521 aaagcaagac tgtgggctct acttcctaca cacctcaaac ctgtcactcc gttgtctcac 23581 actggcctcc ctgatgttcc tcaaagtcaa caagcttgtt accacctcag gggcttggtg 23641 gtggctgttc cctcctggaa tgctctgctc ccagatagcc ctgtggccag ccccgtcttg 23701 tcagcgaact caaatgccac cccttcagtg aggccttctg tgtacattct tttccacatc 23761 acccagtttt cttttctttg agacagagtc tcgctctgtt gcccaggctg gagtgcagtg 23821 gggcgatctc ggctcactgc aacctctgcc tccctggttc aagcaattat cttacctcag 23881 cctcctgagt agctgggatt aaaggcgcgt gccatcacac ccacctaagt tttgtatttt 23941 tagtagagac agggtttcac catgttggtc aggctggtct tgaactcctg acttcatgat 24001 ccgactgctt cggcctccca aaatgctggg attacaggcg tgagccactg cgcccggtcc 24061 agttttattt tcttcctagc acttagtcct gaagttatat tatgcacaat gtaccctgcg 24121 tgtcactatt cccaaaggcc ctacagaacc tgttcagtca cgcggaacca cctttttttt 24181 tttttttttt tttttttttg gagtctcgct ctgtcgccca ggctggagtg cagcggcgag 24241 atctaggctc attgcaacct ccacctccca ggttcaagca attctcctgc ctcagcctcc 24301 tgagtagcta ggattacagg cgcccgccac cacgcccgac taattttttg tatttttagt 24361 acagacgggg tttcaccgtg ttagccggga tggtttccat ctcctgacct cgtgatccgt 24421 ccgcctgggc ctccgaaagt gttgggatta caggcgtgag ccaccgcgcc tggccggaac 24481 cacctttaat cctcaccagg caaccttgga caggagaaac tcgggccctg acaccgctag 24541 taaggtgccc aagaccacat agcaaggccg aggactgggg ttttctgctg gggccattcc 24601 agctttggct gataatgcgt ttattgcgtt actttagtac agtcggcctg cctcgcccca 24661 cctccagcag cccagcctgt tcacagcttt ctgctcactg ctcactaccc aaggaggtgg 24721 ggtctgtctc ccagggtggg cactccaggc acctcctcgg ccctggtcgt tgctctggag 24781 ctcggcttcc ccgtcccttc cctccggctg aatcctgcgc ccgacctgtg ccgccctccc 24841 ggggctgcta cattcaaggg ttatcctgtt aataaataag cccgcgcagc tgccctcagc 24901 cagggcctgt cggttggtgt ctccatcggc cggtgacccc caatgtgccc ggccccaacc 24961 cgtccttctt ttcagggtct gtctggctcc tctctcgcct gtcgtctcct gcttggttcc 25021 cggcgtgggg acagtctggt caccagtgcg ttgcgctgcg cggcccccgc tcagcgcgag 25081 gctcaggggc gtgggtcctc cggttacggc cctttctccg cgactccagg ggcagtgact 25141 gcagcgcagg ctgcgggcgg cacatgcgca gggagttcgc gctgccacct ccgcccggct 25201 ccaccagctg cagaaagtcc cggtaccaga gtttggggcc cggcggcgcg gcgggcgcag 25261 cctcctcggc ccgcgccagt cgttcggcct gcgtagcact caacacgtgc agcgacaggc 25321 gacgcagcgg ttgcgtaaag ccctgctcga cggcggcgca caagtacacg cccgagtccc 25381 ggcgccgcag cctgcgcagc agtagtcccc gggcggtgcg ctcggtgcgc tcctctgcca 25441 gcacctgcgg gcagagggca gcgtgaggcg ggggtgtcgg gccgggaaca gggcttccgg 25501 gccctagcac gcgacggaag aagcaaagag tcattgggag ccgaggtgga gcgggaaagg 25561 ggtgcccgca ggcgcacatt ttaaggctga gtgtttggga gctggtggtc ttcaagggag 25621 aatccgaaag aggcggggtt tacatgaact tggtgggggg tggtcaggga ccttaatggg 25681 agggtcgagg gcgggtcttc tccttgattc aggggaggcg ggtcgggagc cccgttggac 25741 gcaatggggc ctcagcagct tgtggagtgc aggtgaggga ccaaactgga cagggcgggg 25801 agtgggtctg gttggacggg gcttcgttgg gtagggcggg gcctctcctg gatgcagggg 25861 gtgggacagg agcctggcgg ggagggcgga gtaaggctca cctgggtgtg ggctgtcacc 25921 cctgcgcgct ggaaagtcca ctccacgcgc gcctgcagcg agcggggctc acactccaga 25981 aaggcgctgc tgccctccac gccgaacacc ttgtgttcca gcagcgcggg acgagacgag 26041 tcttggggcg agaagaaagt cagatttagg caaggcaggg caggtggggc gtcttctggg 26101 gctgagggta ggggcagctg gggcactcac ctccggagca caacgtgctg gggtcgccat 26161 tccttacgtc ttgccgccgg aaccgcctgt ggggaacatc cgatcttctg tgagcccttc 26221 tttcagcccc gggcagtgaa agacccttcg cctccctccc ggcggcccaa ccccgacccc 26281 gcccacctct tggcactggg ctggaagcgc gtgcacgcga ccccgtccca ggcgcagtag 26341 gggtcacgcg ccagacagca ttcggtgcag acgcggccgt gggcagcgca gcggtgcaac 26401 gcgatctggg ccaccgcgct ccgcgaggct acgtacagct ggtgctaggg ggcacgaggg 26461 gctctgggct gaccgagggc gaccccacgc ctgcctccca tcggtcaggg atcccttctt 26521 ccactcgaca gatgggaaca ctgaggtctt acgcctcagg tcacacagtc taacaaagcc 26581 agagcctgta ctcaaacccg ggactgcaaa ctaccaaaag gggccagagg cttgggacgc 26641 gccaggcaca agctcagtcc atcccacccc gacccccatc ctggtcactc accctcttgg 26701 aagaaatttg catgctggtg acagcggccg agtcctgaga gggaggaggg gcgacggggt 26761 cagggcttag tggggtgggg gggtcccggg cgactggggg tgaggcctca cctcaaacac 26821 gtgcagctcc tccaggagca gcccctctgc gctgggccta ctgcccttgg ggaccgagat 26881 caccttcagc accgtgccaa cgtctgcagg gatgagaagg ggtaatgacc attggtgctc 26941 ccggacagct ggggctagtg tctcctcgcc tttgaggcgt ctaccctcag catgtttggg 27001 ttgtgaagag tgtggactgg aagagtgtgg gttgggcagc atccacaggg tggagacaca 27061 gcccaggaaa ggatgtggag atgggactga acagggagag cttccatcag cagcccacag 27121 ggcctggctg ggagcctggg ggtcgctgtg gagggaccct gacctgtgcc aatgaagagg 27181 acgtcatagt gtccgtcagc ggctgcaacc cggtccgcgg caatttgagt gaaggtgtaa 27241 ttggctccaa cttgtaggaa aagagggcgc cccccagtgg gcaggacaga gttgtacatg 27301 agggggtggt tccgcgcaaa ctggatgaca tcgtctggga agtccttggt ggaactgaag 27361 gtgccaaagg tcttgctggg gcactggggg tgggggaaag ggaggcacag cagggatata 27421 gatatggggg cttgataggc agccctccat gcccagcctc tgggaacatg gagggggatg 27481 gggacagaac cctgccctga gaattagaga ggagatcagc cttgccctag gaagtcagag 27541 gcgggaaaca agccttgtct tatgatggaa acgttttccc acccataact aaaggaagaa 27601 accacattca ccaagaagac cccaggccaa gttctcaaac ccttgaggct ttgaagtggg 27661 ttgtagcaga agtccctggg ctacgaacca tgcctggccg cgggtagggg acgcgaccct 27721 ggtatgacac ccactggtgc atgggcccct ccttgtgtgc aaagggtccc aagaaggccc 27781 ggcgcacgtc gttcatgctg tacacgcaca ccgcagagcc ctggaagatg ctgctggagg 27841 cgaagaccag ggcgaggtga ggggccgggt ggagcccagc ggcccgcccc gggctctccc 27901 tacctcctgc ccctcacctg gacgtggaga agacggcata gagcagcggg gtccggtggt 27961 cccgcgagga caacagaaac acatcctctg cggggaaggg gcctagctgg ggtagggcgc 28021 acagccgctg cggcgggggc tcccggagtc tgcgccgctg cccccctccc caaccccata 28081 cccactcccg cactcacgga gctgatcgaa gtgggtgtcg ccctcgacgc cgggcaccga 28141 gcacaccagc cgcgccttca ggaacgtcgt ccacttgttg accaggctgc gctggccgcc 28201 cacgtcgttc tgccgggacg atcaaagggg atgagggcga gaccagggca ggcaaagggg 28261 taggggcagg gtcgccgggt gtggcccagg gactcctcac ccggcagatc tggccaacgc 28321 gggacacgga caggcgtccc agtgccggcg ccgcctctac cgccgtctca cgaaagaaga 28381 agtagatttt gtcgtcgtct gggttctcgc tctccgggat ccaaaatacc ttgacaaact 28441 tgggctctga ccgcggcagg aggcatgggt cagcgggtcc tggccttgcc tcctgatgcg 28501 agacccacgc gctgcttccc cttcccgtag tgcgggcaca gcacccggga tcacagtgtc 28561 tgaccaccca acctctagca ccatgtccag ttggcgctct gcgcggaccg aatccaaagg 28621 atcagggcct gcgaggcaag aagcagccgc gagggggcag cagagaccgt ggctacctgg 28681 gggcgcaggc gtgaagcccg gctagcctcc gatctcgccc cacacgtcgc ccagcacctt 28741 aggctggggt aggagggaga tgaggtaatc tggtttcacc ttcactcatg aatcttcctc 28801 tctagcataa ccagatgctc cccactcctg cttcaaatcc ttctgctggg tcccattgct 28861 ctttattatt attattatta ttattatttg tttatttatt tattttttga gacagtctcg 28921 ctctgtcgcc caggctggag tgctcgatct cggctcactg caacctcagc ctcccaagta 28981 gctgagatta cagtcagagc ctaccaccac acctggcttt tttttttttt tttgagacga 29041 agtctcgctc tattgcccag gctggaatgc agtggcacag tctccattca ctgcaacctc 29101 cgcctcctag gttcaagcga ttctcctgcc tcagcctcct gagtagctgg gattacagat 29161 gcacaccatc atgccggact aatttttgta tatttagtag agacggggtt acaccatgtt 29221 ggccaggctg gtctcaaatg cctgaactca ggtgatccgc ccgcctcagc ctcccaaagt 29281 gctgggatta caggtgcaca ccactgcacc cggccaattt ttgtattttt tagtagagac 29341 agggtttcac catgttggcc aggctagtct caaactcctg gcctcaagcg atttgcctgc 29401 ctccgcctcc caaagtgctg agattacagg tgtgagccac tgtgcccacc agcccatagc 29461 ccttatttaa tttttatttg atttagagac agggtctgct ctgtcctgca gtggcagcat 29521 catggctcac tgcagcctcc aactcctggg ctcaagtgat ccttccatct cagcctcctg 29581 actagccagg actacaggtg tgtgcatgcc actgccccca gctaatttta ttttattttt 29641 tgtcttgctt tgtagcccag gctggtatga aactcctgtc ttcaagcaat ctgtctgcct 29701 cagcctccca aagtgctggg attacaggtg caagccattg tgcccagccc ccatgactct 29761 tagaataaat gactctggcc cctgacagtc caggccccct tttcattgct agctctgcaa 29821 gtctattcct ttgcttgttc ttgcatccag gcctttgcac tagttgctcc ctctgcctgg 29881 aatgctccct ctgcctggaa tgctcatccc taaattttgc tggctagctg tcttgtgctt 29941 gggaattagc tcatactccc ctcagcacag cggcttcctt tacccaaccc cctccagcca 30001 cccagtctcc ttttcccttc ctcccttgtc tctctctggc ctcaaggcca ttggacttga 30061 tcttatcatg tatgtgatct gtgaactcaa agaggtcacg gtcatacctg tcttgtctgt 30121 ctctgtgacg ccagtgcctg attaaggagg gtatttggta agacctggcc acacccactg 30181 ctcacttact ttatgggcct tgcttggcac ctgccctgtg ggagggtgac gacctctacc 30241 caccaacccc accagcctct caccattgag ccagcgggag tcgtgtggct ctgttcggag 30301 acttggacgt tgccctaggc tgcgaaagat ggtaaagtct cgtcccatga ggtctgctgc 30361 cacccctgag tatagctcct cccctgcaga cagagcaagg gccactcagg caaatggagg 30421 gctcttgggg caggagggcc tggggagctc tgggagtgag tgaggttatg ggggattccc 30481 tggagctctt gggggtcttg ggactttgtg aggttctaga gtttctaagg ggttctcttg 30541 gggccttagc ccttggactc acccaccagc acggaggcag cccgatgcct ggggtcataa 30601 ggactcttcc ccttgccatc ctctatcctt cctgggtcca gccggaggac gggctcctgg 30661 gaggtgtggc aagttgtgac taccaggccc ttcttaccct cctgacctcc ctccctgcct 30721 agatccggcc ttacctctgc ccggtggccc acttccacaa aggcacaggt tgggtggaag 30781 gctcccgtgc cacaggccag caaatgggtg cggttgtagg catgcagcaa cttcacgaag 30841 ttcatgcact cagtctagta gggttggagg tgtgaggctt cccccaggga cgtagcctta 30901 agaggtccca gctcaccagt ggcatcctca cccctccctc tcttcggcat cagcccaggc 30961 aagtactaac atgggctaca gggacctggg ggcccttgct gaccagccca ggcctcacga 31021 gggtgggtgt ctggggaagc acaggcctgc aaagcctccc atgactcaga ggcaccccag 31081 aagaattaca cgagccccca ccctgactca ccacccccag acctcagcca cctcgacctc 31141 cctgccctca tcccagccag aacaggcagg cctctgtttt agccctgccc tgtctctggg 31201 gtgaggggct gaccctcccc actcccggcc cgggcagctg gcactcacac caatgtcctt 31261 ccctgcccag ttgcactcct ctcgccattc cacaggggcc ggccaggcca gctatcggga 31321 ggttggggga gaggggacac aggtcaggaa cttgggatat tctctacctc agggagccat 31381 tcctctgggt ctctccaggc tggagtgggg tcccagggct ccttccctgg ggagtgctgg 31441 ggttgggagt ccctggcacc ttcttggccc gcttgctgat gttgtccagg ttgagggagg 31501 ccacatggtt ctcggcaccc acaaacaggc gtccacgctc ctcatccacc agcaaggcct 31561 ggtagcagca ggttcgctcc aggctgaaag tctggagacc atgccaggcc tggagctctg 31621 cagggcagga cactgctgat gactcgccag ccatgctgga agcctccccc aggtcctgcc 31681 tggtggtgac tctatcggcc aaggttgcca ttaggacctg gcctgtacac atgtgagttt 31741 acatgtgtgt gagtttacac acgcaaaccc aagcccaccc aggtgcacgt gtggagctgc 31801 cagggtaggt ggtacatccc cccaggaaga ggaaggcagg ttgaacttgg gcatgctcag 31861 acaggtggtg tctccaggcg ggcgcactgg gaggcaaggc ttatggacac cagagtcctg 31921 ggggagactg gcatgcaggg atggccagaa ccccaactcc caaaagtcag gctgagagtt 31981 cctttcctgc cactccacca ccagcctgag ctcagcagaa ctgcctattc actgccactg 32041 cccaggggcc accctcctat acaggaggca tgagaagggg gctctgactt ccctgttgct 32101 ctgctgaagg agtatcgcct tgttgggggt actgaacgga acagagacaa ggtctgttgc 32161 tgtggggaca ggaggggtct tcctgagaaa gcacgaagag aatgtgggca gaggctgggg 32221 gctgaggccc ggacagggtt aactggaggt ggcctgagag tgcccactaa tcctaccaga 32281 gtctggtgac aggcacgcac atggggcaac agccaacaca gagccccctg gccatacggg 32341 gaccctttcc tgcccacctc aagctgggcc ctcccgcctg ccaggtgcac ctaccttgga 32401 aggagagccg aaggcgtggg ggctgggggc ggcactcccc agccccactg cccagagcag 32461 ggccaggccc gggatcacgg cggcagcccc ggcccgcccc atctcagcag ctcagggtgc 32521 tcagggttca gcgggtgtgt gtgtggagga gccttgagct gccctggact ctgccccagg 32581 atctggaaga ggagtcagca gtgagggcga gggcgggagg ctaggggggg agtccccagg 32641 gatcagatta cagcccctag gggaaagagg ctggggggtc gcattccact tccgaaccct 32701 cccctccgag ggtgccccgc ctggccacaa tcacagacac acccacgcca ctgggcacac 32761 cctcagggtc accctgcact ggcaagcccg cctcaccacg tcacccttcc cagtaatctc 32821 ctctcttgta tacacactca cagacattaa ttcacaatgt gccacccact tcatgccagc 32881 ctagacctgg gaggagggat actgggagac atagacaagg tccccatgtg aaaggggaca 32941 gagtgccaac aaggaatcag atggatgcag agggtgccaa gtgcaatgag caaaacaagc 33001 cagtcaaaag agtgagagtg acaggagggg gtcaggcctc tggaggtggc agtgtcccca 33061 cagtcccgca gaaaaacaca gagacacttt tgaggctgga gccaagggtg tgggtgcctg 33121 tgaatgtgtt tccgagtgtc cccttgtgat gtgcgcatgt gtgtagtgtg tatgtgcatg 33181 tgtgtgcaca tctatgtgca aatatatagt gtgtcatatt gcacatgtgt ggatatgggt 33241 tggtgtgcac acacatgtgg cactgtgcaa tccatgcagt acatatgtgt gatatgtgca 33301 tattttgcac acagatgtgt tgcacatttg tgcatgtgca aacgtgtgca cccataagaa 33361 tagacatgtc tctgggcatg caagtgagca aatccaactg cagcatcaat cactccattc 33421 ctcagaggga gaagtgctac ccaggagacc caagacctgg gcagaagcct cagcctctga 33481 cttggggact gctcccaacc cagctgaagc agcctctgca tttgagcttg agggctctct 33541 ctccacaccc ctgaggaatc aaagaggggc aaggtgaggc agcccttcat ttgcacccca 33601 gcctcctggc ctcgctgagc tggccttccc tctcaacgcc tccacagcct ctctgagcct 33661 ctcccttcac ttctctcacc ccttgagttc ttccccatct ttgagcccct tcactctagg 33721 agccagacac ctccccctgg ccctcagctg gggcaaagac ctggctgcta tgcccaccct 33781 ccttcccagg ggtcagctgg gcaggtgctg gacacaggag cccagggagt gggacaaggt 33841 tccaggtggg attaagcaag tggagcagct gctgaggcag gttccaggcc cctccaccca 33901 cactcctgcc cagctcctgg ccccatccca ggccaggacc cccacctccc tccttccctc 33961 cgctgacact tgccgctgcc tcttctcagg ttctagaggt cttccaggcc aacactggct 34021 gccttctcac actccaccga ggacttccgc agcagcatct ctccactctg gccccttccc 34081 accctggggc cgctggggac cctctgacgg ctcctttaag aagcagcccc cgcccccacc 34141 aggccagcca ccgccccacg caccgcccgc cgtgccgtaa agtttagagg gcggatcggg 34201 tgaccgggca ggcagccggg accagctgga gacggcagcc aggcgggagt ggaatgggca 34261 caagggaggg gctggtaggg acgacccctc ccatattggg ccttaaagca gaagggtgcc 34321 ccaggtggcg gtcccagaag ccaggtggcc agccaaggca agggaggaag aaaccccagc 34381 tccaggggct cagcaggcaa agggaatcac tgagtggggg caccacccgt ggactccaat 34441 atctcaacct ctccctccac aggtggggag ctgtggggaa agataatggg gagctcagct 34501 gccacctcag ttcccaggga ccggctgggg tggccgggca gctggaggtc aggggagggg 34561 ctctcaactg gaggtcggat gggccctcag gacccagtcc ccatccttcc tcaacacttg 34621 ggccacttag tacttcagtg gcctggccag cagtggcttc taccatgaca gccagatagg 34681 ggagggagag gagggcagga gaggggcaga cgcaggagga gcagcaaatc tcacttctgc 34741 accacagggc ggggcccatc tggagtccgc catcctggac aggtagcttt gcatctctgc 34801 tgagaggttg gggggcagac catgtgacct cccttcctct ggcttctctt tctaggttgg 34861 aggtgggagg aacaaccccc accaaaccca gagccgaaaa ctgagggagt tttacagaca 34921 ggacggagct cctgcacctc ggagcctcag ttgggaatga cctggggtct tgtcctgaag 34981 ctgagtctgg tgaacgtgcc ccatttgtaa catgaggggt acttctctgg agggactgta 35041 tgttgacagt ggcagagtgg agccctgaag tccacctgag tgaatatacc agggcttgag 35101 aatgggcttt gatccttcca tccccgcaaa aggcattttc ctaccacctc ccaaggctga 35161 tggggcagtg tgggcatttc aagtgttgca gttctgttgc ccagccctgg atgcctccag 35221 ccaagcaagg acagaggtgg ggggcacatc tccaagtccc ctggaagtgg aagtggggct 35281 atgcccctcc ttcctgagcc tgaggacctg gtatacctgg cctggcctgg ccagctggca 35341 ggtaaataac aggggccagt gggagccgca gggcccttca ggagggcaag tggaaggaca 35401 gatccctgcc tgggctcgtg ggcttcctgg atgtggcctt ttcctactcc aggagtagct 35461 gcctctagcc ttagaggacc tgggcaggct ctcctggctc catgcaagca gacaacattc 35521 tttggctgtt tacagctttc cacccaagca gccaggatca gcaagtgcct cagaggctcc 35581 catccactcc cactgtgccc ccataggccc tcactgcctt atgtttttcc aggcccaggg 35641 gccctgtctc tgtttcatca ccccgcacct tcctccacct gtgttcactc tgggaaacgg 35701 tatcagaaac ccccccgcca ccgccattat ctttccaata gcctgggagt caacccccag 35761 ccaaaaggac tagactgtct ctgtccccat tgagtgctgt gccctgtcct ctgtcctcta 35821 gatgtgtacc ctcccttgag taggaatgga ggtctccagg gatgggaggt caagttgtcc 35881 ctgattccac atggtcccct ttctgttcct cagccccagg tccagtgtgc caggccatgg 35941 gtaggggccc ccatggaggt cagcactcca ggagcaaggt cactgcctgt gggtcacact 36001 gggggctgga actcctcgag gattctggaa tcaggcagcc tgagcctgag tctcaacaga 36061 atggggcaaa ccaggcaagg cagctggggt cccttttcct gcctgtactc ctaccctgga 36121 cctctctctt ttgagggagc ttccatgggc agagacctgc ctgggtcctt gctctggggc 36181 tgcctgtcag tggcatggcc taccacggcc ttggcttttc cttctggaaa aacctggatt 36241 gttgtgccaa atctcagcac ctgcctcccc acccctggcc accagctggg cctgcccctt 36301 tgcccctgcc tggactccgg gtggtgtggt tggggtggga cacccatctg aggaaggttg 36361 tggcaaccct cccagtgcag cctggactgg atgggatctt gggcgcccca ctcacacctg 36421 ctttagtcat cagggcttgt ggctccaacg tcacaactct tccttgttct ggccacgtag 36481 gtaccaggtc atgctgccca gaggacttag gcacagtggg ggcaggcgtg ggcggccata 36541 ggatc // LOCUS HUMLYB2 1531 bp mRNA PRI 29-JAN-1991 DEFINITION Human B cell differentiation antigen mRNA, complete cds. ACCESSION M54992 M38040 NID g187262 KEYWORDS B cell differentiation antigen; Lyb-2 protein. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1531) AUTHORS Von Hoegen,I., Nakayama,E. and Parnes,J.R. TITLE Identification of a human protein homologous to the mouse Lyb-2 B cell differentiation antigen and sequence of the corresponding cDNA JOURNAL J. Immunol. 144, 4870-4877 (1990) MEDLINE 90278102 FEATURES Location/Qualifiers source 1..1531 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 109..1188 /note="Lyb-2 protein" /codon_start=1 /product="B cell differentiation antigen" /db_xref="PID:g187263" /translation="MAEAITYADLRFVKAPLKKSISSRLGQDPGADDDGEITYENVQV PAVLGVPSSLASSVLGDKAAVKSEQPTASWRAVTSPAVGRILPCRTTCLRYLLLGLLL TCLLLGVTAICLGVRYLQVSQQLQQTNRVLEVTNSSLRQQLRLKITQLGQSAEDLQGS RRELAQSQEALQVEQRAHQAAEGQLQACQADRQKTKETLQSEEQQRRALEQKLSNMEN RLKPFFTCGSADTCCPSGWIMHQKSCFYISLTSKNWQESQKQCETLSSKLATFSEIYP QSHSYYFLNSLLPNGGSGNSYWTGLSSNKDWKLTDDTQRTRTYAQSSKCNKVHKTWSW WTLESESCRSSLPYICEMTAFRFPD" BASE COUNT 394 a 395 c 433 g 309 t ORIGIN 1 agtcacagag ggaacacaga gcctagttgt aaacggacag agacgagagg ggcaagggag 61 gacagtggat gacagggaag acgagtgggg gcagagctgc tcaggaccat ggctgaggcc 121 atcacctatg cagatctgag gtttgtgaag gctcccctga agaagagcat ctccagccgg 181 ttaggacagg acccaggggc tgatgatgat ggggaaatca cctacgagaa tgttcaagtg 241 cccgcagtcc taggggtgcc ctcaagcttg gcttcttctg tactagggga caaagcagcg 301 gtcaagtcgg agcagccaac tgcgtcctgg agagccgtga cgtcaccagc tgtcgggcgg 361 attctcccct gccgcacaac ctgcctgcga tacctcctgc tcggcctgct cctcacctgc 421 ctgctgttag gagtgaccgc catctgcctg ggagtgcgct atctgcaggt gtctcagcag 481 ctccagcaga cgaacagggt tctggaagtc actaacagca gcctgaggca gcagctccgc 541 ctcaagataa cgcagctggg acagagtgca gaggatctgc aggggtccag gagagagctg 601 gcgcagagtc aggaagcact acaggtggaa cagagggctc atcaggcggc cgaagggcag 661 ctacaggcct gccaggcaga cagacagaag acgaaggaga ccttgcaaag tgaggagcaa 721 cagaggaggg ccttggagca gaagctgagc aacatggaga acagactgaa gcccttcttc 781 acatgcggct cagcagacac ctgctgtccg tcgggatgga taatgcatca gaaaagctgc 841 ttttacatct cacttacttc aaaaaattgg caggagagcc aaaaacaatg tgaaactctg 901 tcttccaagc tggccacatt cagtgaaatt tatccacaat cacactctta ctacttctta 961 aattcactgt tgccaaatgg tggttcaggg aattcatatt ggactggcct cagctctaac 1021 aaggattgga agttgactga tgatacacaa cgcactagga cttatgctca aagctcaaaa 1081 tgtaacaagg tacataaaac ttggtcatgg tggacactgg agtcagagtc atgtagaagt 1141 tctcttccct acatctgtga gatgacagct ttcaggtttc cagattagga cagtcctttg 1201 cactgagttg acactcatgc caacaagaac ctgtgcccct ccttcctaac ctgaggcctg 1261 gggttcctca gaccatctcc ttcattctgg gcagtgccag ccaccggctg acccacacct 1321 gacacttcca gccagtctgc tgcctgctcc ctcttcctga aactggactg ttcctgggaa 1381 aagggtgaag ccacctctag aagggacttt ggcctccccc caagaacttc ccatggtaga 1441 atggggtggg ggaggagggc gcacgggctg agcggatagg ggcggcccgg agccagccag 1501 gcagttttat tgaaatcttt ttaaataatt g // LOCUS HUMLYN 2298 bp mRNA PRI 07-JAN-1995 DEFINITION Human lyn mRNA encoding a tyrosine kinase. ACCESSION M16038 NID g187268 KEYWORDS protein kinase; tyrosine kinase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2298) AUTHORS Yamanashi,Y., Fukushige,S., Semba,K., Sukegawa,J., Miyajima,N., Matsubara,K., Yamamoto,T. and Toyoshima,K. TITLE The yes-related cellular gene lyn encodes a possible tyrosine kinase similar to p56lck JOURNAL Mol. Cell. Biol. 7 (1), 237-243 (1987) MEDLINE 87172710 FEATURES Location/Qualifiers source 1..2298 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q13-qter" gene 298..1836 /gene="LYN" CDS 298..1836 /gene="LYN" /note="lyn tyrosine kinase" /codon_start=1 /db_xref="GDB:G00-120-159" /db_xref="PID:g307144" /translation="MGCIKSKGKDSLSDDGVDLKTQPVRNTERTIYVRDPTSNKQQRP VPESQLLPGQRFQTKDPEEQGDIVVALYPYDGIHPDDLSFKKGEKMKVLEEHGEWWKA KSLLTKKEGFIPSNYVAKLNTLETEEWFFKDITRKDAERQLLAPGNSAGAFLIRESET LKGSFSLSVRDFDPVHGDVIKHYKIRSLDNGGYYISPRITFPCISDMIKHYQKQADGL CRRLEKACISPKPQKPWDKDAWEIPRESIKLVKRLGAGQFGEVWMGYYNNSTKVAVKT LKPGTMSVQAFLEEANLMKTLQHDKLVRLYAVVTREEPIYIITEYMAKGSLLDFLKSD EGGKVLLPKLIDFSAQIAEGMAYIERKNYIHRDLRAANVLVSESLMCKIADFGLARVI EDNEYTAREGAKFPIKWTAPEAINFGCFTIKSDVWSFGILLYEIVTYGKIPYPGRTNA DVMTALSQGYRMPRVENCPDELYDIMKMCWKEKAEERPTFDYLQSVLDDFYTATEGQY QQQP" BASE COUNT 645 a 564 c 576 g 513 t ORIGIN 1 tcggccgagc ccagagacag ccagttcctc tcccgccgcg ccgggccgcg tgccgctcgc 61 tccccggccg tggcgcctcc gggccagacg cgctgcagcc tccagcccgc ggcaagcggg 121 cggggcggcc gcgccacccc cggccccgcg ccagcagccc ctcgccgcgc gtccagcgtt 181 cccggccagc agcctcccca tacgcagtcc tgctggaccg ccccgtcgcg ccccccactc 241 tgaactcaag tcaccgtgga gctccgccgc cccgaaactt tcacgcgagc gggaaatatg 301 ggatgtataa aatcaaaagg gaaagacagc ttgagtgacg atggagtaga tttgaagact 361 caaccagtac gtaatactga aagaactatt tatgtgagag atccaacgtc caataaacag 421 caaaggccag ttccagaatc tcagctttta cctggacaga ggtttcaaac taaagatcca 481 gaggaacaag gagacattgt ggtagccttg tacccctatg atggcatcca cccggacgac 541 ttgtctttca agaaaggaga gaagatgaaa gtcctggagg agcatggaga atggtggaaa 601 gcaaagtccc ttttaacaaa aaaagaaggc ttcatcccca gcaactatgt ggccaaactc 661 aacaccttag aaacagaaga gtggtttttc aaggatataa ccaggaagga cgcagaaagg 721 cagcttttgg caccaggaaa tagcgctgga gctttcctta ttagagaaag tgaaacatta 781 aaaggaagct tctctctgtc tgtcagagac tttgaccctg tgcatggtga tgttattaag 841 cactacaaaa ttagaagtct ggataatggg ggctattaca tctctccacg aatcactttt 901 ccctgtatca gcgacatgat taaacattac caaaagcagg cagatggctt gtgcagaaga 961 ttggagaagg cttgtattag tcccaagcca cagaagccat gggataaaga tgcctgggag 1021 atcccccggg agtccatcaa gttggtgaaa aggcttggcg ctgggcagtt tggggaagtc 1081 tggatgggtt actataacaa cagtaccaag gtggctgtga aaaccctgaa gccaggaact 1141 atgtctgtgc aagccttcct ggaagaagcc aacctcatga agaccctgca gcatgacaag 1201 ctcgtgaggc tctacgctgt ggtcaccagg gaggagccca tttacatcat caccgagtac 1261 atggccaagg gcagtttgct ggatttcctg aagagcgatg aaggtggcaa agtgctgctt 1321 ccaaagctca ttgacttttc tgctcagatt gcagagggaa tggcatacat cgagcggaag 1381 aactacattc accgggacct gcgagcagct aatgttctgg tctccgagtc actaatgtgc 1441 aaaattgcag attttggcct tgctagagta attgaagata atgagtacac agcaagggaa 1501 ggtgctaagt tccctattaa gtggacggct ccagaagcaa tcaactttgg atgtttcact 1561 attaagtctg atgtgtggtc ctttggaatc ctcctatacg aaattgtcac ctatgggaaa 1621 attccctacc cagggagaac taatgccgac gtgatgaccg ccctgtccca gggctacagg 1681 atgccccgtg tggagaactg cccagatgag ctctatgaca ttatgaaaat gtgctggaaa 1741 gaaaaggcag aagagagacc aacgtttgac tacttacaga gcgtcctgga tgatttctac 1801 acagccacgg aagggcaata ccagcagcag ccttagagca cagggagacc cgtccatttg 1861 gcaggggtgg ctgcctcatt tagagaggaa aagtaaccat cactggttgc acttatgatt 1921 tcatgtgcgg ggatcatctg ccgtgcctgg atcctgaaat agaggctaaa ttactcagga 1981 agaacaccct ctaaatggga aagtattctg tactcttaga tggattctcc actcagttgc 2041 aacttggact tgtcctcagc agctggtaat cttgctctgc ttgacaacat ctgagtgcag 2101 ccgtttgaga agaaaacatc tattctctcc aaaaatgcac ccaactagct ctatgtttac 2161 aaatggacat aggactcaaa gtttcagaga ccattgcaat gaatccccaa taattgcaga 2221 actaaactca tttataaagc taaaataacc ggatatatac atagcatgac atttctttgt 2281 gctttggctt acttgttt // LOCUS HUMLYSOPX 598 bp mRNA PRI 20-MAY-1993 DEFINITION Human eosinophil Charcot-Leyden crystal (CLC) protein (lysophospholipase) mRNA, complete cds. ACCESSION L01664 NID g187273 KEYWORDS S-type lectin; basophil; eosinophil; lysophospholipase. SOURCE Homo sapiens adult leukemia cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Gomolin,H.I, Yamaguchi,Y., Paulpillai,A.V, Ackerman,S.J. and Tenen,D.G. TITLE Human Eosinophil Charcot-Leyden crystal protein: Characterization of a lysophospholipase gene promoter JOURNAL Unpublished (1992) REFERENCE 2 (bases 1 to 598) AUTHORS Mastrianni,D.M., Eddy,R.L, Rosenberg,H.F., Corrette,S.E., Shows,T.B, Tenen,D.G. and Ackerman,S.J. TITLE Localization of the human eosinophil Charcot-Leyden crystal protein (lysophospholipase) gene to chromosome 19 and the human ribonucleasse 2 (EDN) and ribonuclease 3 (ECP) genes to chromosome 14 JOURNAL Genomics 13, 240-242 (1992) MEDLINE 92250060 REFERENCE 3 (bases 1 to 598) AUTHORS Ackerman,S.J., Corrette,S.E., Rosenberg,H.F., Bennett,J.C., Mastrianni,D.M., Nicholson-Weller,A., Weller,P.F., Chin,D.T. and Tenen,D.G. TITLE Molecular cloning and characterization of human eosinophil Charcot-Leyden crystal protein (lysophospholipase): Similarities to IgE binding proteins and the S-type animal lectin superfamily JOURNAL J. Immunol. 150, 456-468 (1993) MEDLINE 93123746 FEATURES Location/Qualifiers source 1..598 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60 3c5I" /cell_type="leukocyte (eosinophil, basophil)" /dev_stage="adult" /tissue_type="leukemia" /map="q13.1" 5'UTR 1..33 CDS 34..462 /codon_start=1 /product="lysophospholipase" /db_xref="PID:g187274" /translation="MSLLPVPYTEAASLSTGSTVTIKGRPLVCFLNEPYLQVDFHTEM KEESDIVFHFQVCFGRRVVMNSREYGAWKQQVESKNMPFQDGQEFELSISVLPDKYQV MVNGQSSYTFDHRIKPEAVKMVQVWRDISLTKFNVSYLKR" 3'UTR 460..586 polyA_signal 566..571 BASE COUNT 175 a 140 c 129 g 154 t ORIGIN 1 caattcagaa gagccaccca gaaggagaca acaatgtccc tgctacccgt gccatacaca 61 gaggctgcct ctttgtctac tggttctact gtgacaatca aagggcgacc acttgtctgt 121 ttcttgaatg aaccatatct gcaggtggat ttccacactg agatgaagga ggaatcagac 181 attgtcttcc atttccaagt gtgctttggt cgtcgtgtgg tcatgaacag ccgtgagtat 241 ggggcctgga agcagcaggt ggaatccaag aacatgccct ttcaggatgg ccaagaattt 301 gaactgagca tctcagtgct gccagataag taccaggtaa tggtcaatgg ccaatcctct 361 tacacctttg accatagaat caagcctgag gctgtgaaga tggtgcaagt gtggagagat 421 atctccctga ccaaatttaa tgtcagctat ttaaagagat aaccagactt catgttgcca 481 aggaatccct gtctctacgt gaacttggga ttccaaagcc agctaacagc atgatctttt 541 ctcacttcaa tccttactcc tgctcattaa aacttaatca aacttcaaaa aaaaaaaa // LOCUS HUMLYSOXLK 2328 bp mRNA PRI 07-OCT-1994 DEFINITION Human lysyl oxidase-like protein mRNA, complete cds. ACCESSION L21186 NID g307145 KEYWORDS lysyl oxidase-like protein. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2328) AUTHORS Kenyon,K., Modi,W.S., Contente,S. and Friedman,R.M. TITLE A novel human cDNA with a predicted protein similar to lysyl oxidase maps to chromosome 15q24-q25 JOURNAL J. Biol. Chem. 268 (25), 18435-18437 (1993) MEDLINE 93366738 FEATURES Location/Qualifiers source 1..2328 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_lib="G1016 of E.H. Chang" 5'UTR 1..305 CDS 306..2030 /standard_name="lox-like protein" /note="putative" /codon_start=1 /product="lysyl oxidase-like protein" /db_xref="PID:g307146" /translation="MALARGSRQLGALVWGACLCVLVHGQQAQPGQGSDPARWRQLIQ WENNGQVYSLLNSGSEYVPAGPQRSESSSRVLLAGAPQAQQRRSHGSPRRRQAPSLPL PGRVGSDTVRGQARHPFGFGQVPDNWREVAVGDSTGMALARTSVSQQRHGGSASSVSA SAFASTYRQQPSYPQQFPYPQAPFVSQYENYDPASRTYDQGFVYYRPAGGGVGAGAAA VASAGVIYPYQPRARYEEYGGGEELPEYPPQGFYPAPERPYVPPPPPPPDGLDRRYSH SLYSEGTPGFEQAYPDPGPEAAQAHGGDPRLGWYPPYANPPPEAYGPPRALEPPYLPV RSSDTPPPGGERNGAQQGRLSVGSVYRPNQNGRGLPDLVPDPNYVQASTYVQRAHLYS LRCAAEEKCLASTAYAPEATDYDVRVLLRFPQRVKNQGTADFLPNRPRHTWEWHSCHQ HYHSMDEFSHYDLLDAATGKKVAEGHKASFCLEDSTCDFGNLKRYACTSHTQGLSPGC YDTYNADIDCQWIDITDVQPGNYILKVHVNPKYIVLESDFTNNVVRCNIHYTGRYVSA TNCKIVQS" 3'UTR 2031..2328 polyA_site 2328 BASE COUNT 407 a 835 c 726 g 360 t ORIGIN 1 gccagccgag cggccagcca gtgcggggct ggccatgtaa ggcccacagg cggtcctgcc 61 cgcccggtgc cctgcggaga gcctcgtgca gccctgggca ccgcccctgc cctgccctga 121 ccccttggcc ttgaaatgct gtcatcggag gagccgtccc gctcgggaca aggccagcat 181 ggacaaagct agagctgggg caagcaagga gccttcctgt cctcgaggcc gtgggaagag 241 aagcacgccc agggggccac tcctgagagc ctctctgtcc accaggcctc tgcagagggg 301 tcaccatggc tctggcccga ggcagccggc agctgggggc cctggtgtgg ggcgcctgcc 361 tgtgcgtgct ggtgcacggg cagcaggcgc agcccgggca gggctcggac cccgcccgct 421 ggcggcagct gatccagtgg gagaacaacg ggcaggtgta cagcttgctc aactcgggct 481 cagagtacgt gccggccgga cctcagcgct ccgagagtag ctcccgggtg ctgctggccg 541 gcgcgcccca ggcccagcag cggcgcagcc acgggagccc ccggcgtcgg caggcgccgt 601 ccctgcccct gccggggcgc gtgggctcgg acaccgtgcg cggccaggcg cggcacccat 661 tcggctttgg ccaggtgccc gacaactggc gcgaggtggc cgtcggggac agcacgggca 721 tggccctggc ccgcacctcc gtctcccagc aacggcacgg gggctccgcc tcctcggtct 781 cggcttcggc cttcgccagc acctaccgcc agcagccctc ctacccgcag cagttcccct 841 acccgcaggc gcccttcgtc agccagtacg agaactacga ccccgcgtcg cggacctacg 901 accagggttt cgtgtactac cggcccgcgg gcggcggcgt gggcgcgggg gcggcggccg 961 tggcctcggc gggggtcatc tacccctacc agccccgggc gcgctacgag gagtacggcg 1021 gcggcgaaga gctgcccgag tacccgcctc agggcttcta cccggccccc gagaggccct 1081 acgtgccgcc gccgccgccg ccccccgacg gcctggaccg ccgctactcg cacagtctgt 1141 acagcgaggg cacccccggc ttcgagcagg cctaccctga ccccggtccc gaggcggcgc 1201 aggcccatgg cggagaccca cgcctgggct ggtacccgcc ctacgccaac ccgccgcccg 1261 aggcgtacgg gccgccgcgc gcgctggagc cgccctacct gccggtgcgc agctccgaca 1321 cgcccccgcc gggtggggag cggaacggcg cgcagcaggg ccgcctcagc gtaggcagcg 1381 tgtaccggcc caaccagaac ggccgcggtc tccctgactt ggtcccagac cccaactatg 1441 tgcaagcatc cacttatgtg cagagagccc acctgtactc cctgcgctgt gctgcggagg 1501 agaagtgtct ggccagcaca gcctatgccc ctgaggccac cgactacgat gtgcgggtgc 1561 tactgcgctt cccccagcgc gtgaagaacc agggcacagc agacttcctc cccaaccggc 1621 cacggcacac ctgggagtgg cacagctgcc accagcatta ccacagcatg gacgagttca 1681 gccactacga cctactggat gcagccacag gcaagaaggt ggccgagggc cacaaggcca 1741 gtttctgcct ggaggacagc acctgtgact tcggcaacct caagcgctat gcatgcacct 1801 ctcataccca gggcctgagc ccaggctgct atgacaccta caatgcggac atcgactgcc 1861 agtggatcga cataaccgac gtgcagcctg ggaactacat cctcaaggtg cacgtgaacc 1921 caaagtatat tgttttggag tctgacttca ccaacaacgt ggtgagatgc aacattcact 1981 acacaggtcg ctacgtttct gcaacaaact gcaaaattgt ccaatcctga tctccgggag 2041 ggacagatgg ccaatctctc cccttccaaa gcaggccctg ctccccgggc agcctcccgc 2101 cgaggggccc agcccccaac ccacaggcag ggaggggcat ccctccctgc cggcctcagg 2161 gagcgaacgt ggatgaaaac cacagggatt ccggatgcca gaccccattt tatacttcac 2221 ttttctctac agtgttgttt tgttgttgtt ggtttttatt ttttatactt tggccatacc 2281 acagagctag attgcccagg tctgggctga ataaaacaag gtttttct // LOCUS HUMLZTR1 4227 bp mRNA PRI 09-AUG-1996 DEFINITION Human mRNA for LZTR-1, complete cds. ACCESSION D38496 NID g809500 KEYWORDS LZTR-1; leucine zipper; ttk protein homologue; DiGeorge syndrome-related. SOURCE Homo sapiens fetus brain cDNA to mRNA, clone:C17. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4227) AUTHORS Kurahashi,H., Akagi,K., Inazawa,J., Ohta,T., Niikawa,N., Kayatani,F., Sano,T., Okada,S. and Nishisho,I. TITLE Isolation and characterization of a novel gene deleted in DiGeorge syndrome JOURNAL Hum. Mol. Genet. 4 (4), 541-549 (1995) MEDLINE 95359956 REFERENCE 2 (bases 1 to 4227) AUTHORS Kurahashi,H. TITLE Direct Submission JOURNAL Submitted (11-OCT-1994) to the DDBJ/EMBL/GenBank databases. Hiroki Kurahashi, Biomedical Research Center, Osaka University Medical School, Department of Medical Genetics; 2-2 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-879-3381, Fax:06-879-3389) FEATURES Location/Qualifiers source 1..4227 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="22" /clone="C17" /dev_stage="fetus" /map="22q11" /tissue_type="brain" 5'UTR 1..862 CDS 863..2521 /codon_start=1 /product="LZTR-1" /db_xref="PID:d1008089" /db_xref="PID:g809501" /translation="MVAFDRHLYVFGGAADNTLPNELHCYDVDFQTWEVVQPSSDSEV GGAEVPERACASEEVPTLTYEERVGFKKSRDVFGLDFGTTSAKQPTQPASELPSGRLF HAAAVISDAMYIFGGTVDNNIRSGEMYRFQFSCYPKCTLHEDYGRLWESRQFCDVEFV LGEKEECVQGHVAIVTARSRWLRRKITQARERLAQKLEQEAAPVPREAPSVAAGGARP PLLHVAIREAEARPFEVLMQFLYTDKIKYPRKGHVEDVLLIMDVYKLALSFQLCRLEQ LCRQYIEASVDLQNVLVVCESAARLQLSQLKEHCLNFVVKESHFNQVIMMKEFERLSS PLIVEIVRRKQQPPPRTPLDQPVDIGTSLIQDMKAYLEGAGAEFCDITLLLDGHPRPA HKAILAARSSYFEAMFRSFMPEDGQVNISIGEMVPSRQAFESMLRYIYYGEVNMPPED SLYLFAAPYYYGFYNNRLQAYCKQNLEMNVTVQNVLQILEAADKTQALDMKRHCLHII VHQFTKVSKLPTLRSLSQQLLLDIIDSLASHISDKQCAELGADI" variation 1960 /note="'c' in variation" /replace="" 3'UTR 2522..4227 polyA_signal 4160..4165 polyA_site 4227 BASE COUNT 854 a 1259 c 1282 g 832 t ORIGIN Chromosome 22q11. 1 ttcctggacc gggcagcacg ggggggcaga tcggggctgc ggcctggcag gcggcgcgcg 61 gtccaaggta gccccgagcg tggacttcga ccatagctgc tcggacagtg tcgagtacct 121 gacgctcaac ttcgggccct tcgaaacagt gcatcgctgg cggcgcctcc cgccctgcga 181 cgagttcgtg ggtgcccggc gcagaagcac acagtggtgg cctataaaga tgccatttat 241 gtatttggtg gagacaatgg gaagaccatg ctcaatgacc tcctgcggtt cgatgtgaaa 301 gactgctcct ggtgcagggc ctttaccact gggaccccac cggccccccg ttaccaccac 361 tcggccgtcg tctatgggag cagcatgttt gtctttgggg gttacactgg ggacatttat 421 tccaattcta acttgaagaa taaaaacgac ctctttgaat acaagtttgc aactggccag 481 tggacggagt ggaaaattga aggacggttg ccagtcgcta ggtcagccca tggggccacg 541 gtgtacagtg acaagctgtg gatctttgct ggctatgacg gcaacgccag gttgaatgac 601 atgtggacaa ttggcctcca ggaccgagag ctcacctgct gggaggaggt ggcccagagt 661 gggcgagatc cccccatctt gctgcaactt ccccgtggct gtgtgccggg acaagatgtt 721 tgtattctct gggcaaagcg gagccaaaat aaccaacaac ctcttccagt ttgaattcaa 781 ggacaagacg tggacacgca tcccaactga acacctgctc cggggctccc caccaccccc 841 gcagcggcgc tacgggcata ccatggtggc ctttgaccgc cacctctatg tgtttggggg 901 tgcggccgac aacacgctgc ccaacgagct gcactgctat gacgtggact tccagacctg 961 ggaggtcgtc cagcccagct ccgacagcga ggttggtggg gctgaagtgc ccgagcgagc 1021 ctgtgcttcc gaggaggtgc ccaccctgac ctatgaggag cgggttggct tcaagaagtc 1081 ccgagatgtg tttggcctgg actttggcac cacctcagcc aagcagccca cccagcctgc 1141 ctcggagctg cccagtggga ggctcttcca cgcggctgct gtcatctcgg acgccatgta 1201 catcttcggg ggcacggtgg acaacaacat ccgcagcggg gagatgtaca ggttccagtt 1261 ctcctgttac cctaaatgca cgctgcacga ggactacggg cggctgtggg agagccgcca 1321 gttctgcgac gtggagttcg tgctgggtga gaaggaggag tgcgtgcagg gccacgtagc 1381 cattgtcaca gcgcggagcc gctggcttcg caggaagatc acgcaggcgc gggagaggct 1441 ggcccagaag ctggagcagg aggccgcccc agttcccagg gaggccccca gcgtggctgc 1501 tggtggggcc cggccgcccc tgctgcacgt ggccatccgg gaggccgagg cccggccctt 1561 cgaggtgctc atgcagttcc tctacaccga caagatcaaa tacccacgga aaggccatgt 1621 ggaggatgtg ctgctcatca tggatgtgta caaactggca ctgagcttcc agttgtgccg 1681 cctggagcag ctgtgccgcc agtacatcga ggcctccgtg gacctgcaga acgtgctggt 1741 tgtgtgcgag agtgccgccc ggctgcagct gagccaactc aaggagcact gcctgaactt 1801 cgtggtaaag gagtcccact tcaaccaggt gatcatgatg aaggagttcg agcgcctctc 1861 ctctccactg atagtggaga ttgtgcggcg gaagcagcag ccgccccctc gcactccctt 1921 ggaccagcca gtggacattg gcacatctct gatccaggat atgaaggcat acctggaggg 1981 agcgggcgcg gaattctgtg acatcactct gttgcttgac gggcacccac ggccagccca 2041 caaggctatc ctggccgccc gctccagcta ctttgaagcc atgttccggt ccttcatgcc 2101 cgaagatggg caggtgaaca tctccatcgg ggagatggtg cccagcaggc aggccttcga 2161 gtccatgctg cgctacatct actacggcga ggtcaacatg ccgcccgagg actcgctcta 2221 cttgtttgcg gccccctact actacggctt ctacaacaac cggctgcagg cgtactgcaa 2281 gcagaacctg gagatgaacg tgacggtgca gaacgtgctg cagatcctgg aggcagctga 2341 caaaacgcag gcactggaca tgaagcggca ctgcctgcac atcattgtgc accagttcac 2401 caaggtctcc aagttgccca ccctgcggtc gctgagccag cagctgctgc tggacatcat 2461 agactccctg gcctcccaca tctcagacaa gcagtgcgca gagctgggcg ccgacatctg 2521 aggccctgtg gcgcctgccc attgtgaaga atcgccgtgc ctgcctgccc tgcctactga 2581 gaagactacc ggctatgcgc atgcctatgg cagtgggtgg cacctgccag gccaagggtc 2641 agggtgccca gagcctccaa agagagctga ggggatgtgg gggccccaaa ctcattaatt 2701 cactgaagac acaggtccac agggagcgga tgatgaagca gaccccctcc ctgtcatcac 2761 cctctcctgg tgtagtgtgg atgcgaggcc acggctcagt gatgggctca ccacccagaa 2821 gtggggagag actttgggcc ctccacccag tggggcttgg cctggcttct gtggcctggg 2881 cgtgttgtgg actcagcact ggggcctgtc acccaaggct cctccaacat gcgggaggag 2941 gcttagcaga cttgcgctgc accagcgaat ctgcctgggc tgctcctgtc cctccctccc 3001 tgagatccat gtaaggggca ccacaaccca cctggaactt gtggtgggga cccatgatgt 3061 atgggtctca cctgacttga ggtgaatttt ggagtgaagg gccctgaggt cagctcccag 3121 gtcggtcgtg ctgggccagg cctggttttc acaggggctg aaggatccca gtcacctgtg 3181 tgcatgtcaa gtggctcggg ccgggaagaa gccagcaaag tcccccgtgt cccttgctga 3241 gtattctgtc acagacaagc ctccattaaa gccacagcag tgctacccac cacacacacc 3301 ttgctggccc ggccaccact gctggcttca gccccttgag cagcccatgg cttagcagaa 3361 ccccagatgt aggtcagtgg ccttacctgt ctctatccat gctgtcaact cctgcctcca 3421 cctggggtca cccagtcaca ttgggaaggg ctgtgaaggc ctccaggctg gccttccagg 3481 gaatcctgag gcctggggtg gctcctgccc cttctgccct gccttgcccc tgcactatgc 3541 tcttggctcc tgtggaagga gggctgccct cttgccctag tgagggcccc atgtggatcc 3601 actctagtgc tggagccagc gctcccttac tgggaacagg attccaggac ccctttcttg 3661 ttgtggctgc catgaagcca cagctccttg gggaagtgac ctgctctcct ttgggtgtat 3721 gcaggtgtgt ggggggccct gagtggcaag ttgcttagct aacaggagat ccataggcag 3781 cctgcaggct aggaagtggc ctagtgcaag atgagctggg aacaagggaa gagagagcag 3841 gagctggggc agaggctgag ccgggaggcc cttgaggtga ggacacagca ggcccaggac 3901 catggctggg gaggatatgt cagcacctgg aagtggagtg caggctgcag cgcccagcca 3961 tgtgggccag gtgcattcac tcagagtggg gccacacacc catctaccca gtttccacaa 4021 gatgtggctc ctgccacacc cacagggcag cctcctccaa atccctcctg gaggggccta 4081 cccagaagcc tccttgaacc agtctgcaac cctgctctat gctgaccctt gtcactgaac 4141 cctgatctag acttatatga ataaatgaaa ttacatgcca agggccctaa aaagcaaatt 4201 ttacaaaatt gtgtgccagt ttctggg // LOCUS HUMM6PR 2428 bp mRNA PRI 07-JAN-1995 DEFINITION Human cation-dependent mannose 6-phosphate-specific receptor mRNA, complete cds. ACCESSION M16985 J02937 M19258 M19259 NID g187282 KEYWORDS mannose 6-phosphate-specific receptor. SOURCE Human placenta, cDNA to mRNA (library of Clontech), clones P[4a,29]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2428) AUTHORS Pohlmann,R., Nagel,G., Schmidt,B., Stein,M., Lorkowski,G., Krentler,C., Cully,J., Meyer,H.E., Grzeschik,K.-H., Mersmann,G., Hasilik,A. and von Figura,K. TITLE Cloning of a cDNA encoding the human cation-dependent mannose 6-phosphate-specific receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (16), 5575-5579 (1987) MEDLINE 87289647 COMMENT The protein is a presumptive prepropeptide, though the cleavage site for the mature peptide was not determined. Draft entry and computer readable copy of sequence [1] kindly provided by K.von Figura, 01-OCT-1987. FEATURES Location/Qualifiers source 1..2428 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12" mRNA <1..2428 /note="mps receptor mRNA" sig_peptide 146..223 /gene="M6PR" /note="mannose 6-phosphate-specific receptor protein signal peptide" gene 146..979 /gene="M6PR" CDS 146..979 /gene="M6PR" /note="mannose 6-phosphate-specific receptor protein precursor" /codon_start=1 /db_xref="GDB:G00-120-162" /db_xref="PID:g307147" /translation="MFPFYSCWRTGLLLLLLAVAVRESWQTEEKTCDLVGEKGKESEK ELALVKRLKPLFNKSFESTVGQGSDTYIYIFRVCREAGNHTSGAGLVQINKSNGKETV VGRLNETHIFNGSNWIMLIYKGGDEYDNHCGKEQRRAVVMISCNRHTLADNFNPVSEE RGKVQDCFYLFEMDSSLACSPEISHLSVGSILLVTFASLVAVYVVGGFLYQRLVVGAK GMEQFPHLAFWQDLGNLVADGCDFVCRSKPRNVPAAYRGVGDDQLGEESEERDDHLLP M" mat_peptide 224..976 /gene="M6PR" /note="mannose 6-phosphate-specific receptor protein" BASE COUNT 618 a 534 c 576 g 700 t ORIGIN 319 bp upstream of HindIII site; chromosome 12. 1 cgaggcgcta gggggaacgc tggcctctga aactagctct gggaccgggg tctgcggccg 61 gcccctagct ggccccgtct cccatcccca gaagggtatt cactggggat tctgagcttt 121 ggctactcca gtttcccacg acacgatgtt ccctttctac agctgctgga ggactggact 181 gctactacta ctcctggctg tggcagtgag agaatcctgg cagacagaag aaaaaacttg 241 cgacttggta ggagaaaagg gtaaagagtc agagaaagag ttggctctag tgaagaggct 301 gaaaccactg tttaataaaa gctttgagag cactgtgggc cagggttcag acacatacat 361 ctacatcttc agggtgtgcc gggaagctgg caaccacact tctggggcag gcctggtgca 421 aatcaacaaa agtaatggga aggagacagt ggtagggaga ctcaacgaga ctcacatctt 481 caacggaagt aattggatca tgctgatcta taaagggggt gatgaatatg acaaccactg 541 tggcaaggag cagcgtcgtg cagtggtgat gatctcctgc aatcgacaca ccctagcgga 601 caattttaac cctgtgtctg aggagcgtgg caaagtccaa gattgtttct acctctttga 661 gatggatagc agcctggcct gttcaccaga gatctcccac ctcagtgtgg gttccatctt 721 acttgtcacg tttgcatcac tggttgctgt ttatgttgtt ggggggttcc tataccagcg 781 actggtagtg ggagccaaag gaatggagca gtttccccac ttagccttct ggcaggatct 841 tggcaacctg gtagcagatg gctgtgactt tgtctgccgt tctaaacctc gaaatgtgcc 901 tgcagcatat cgtggtgtgg gggatgacca gctgggggag gagtcagaag aaagggatga 961 ccatttatta ccaatgtaga ttgcacttta tatgtccagc ctcttcctca gtcccccaaa 1021 ccaaagctac acagccagat ttctcaagca gtctcaactc cagtccctca tctcaccctt 1081 actattgctc ttgctttcca gtttgctttt gatttgcatc ttctcactag taaaactgcc 1141 ttccctttgt tccttatttt ctgttttttc tctagagagg tacagttgta agtcagagtt 1201 aatataatag ggcctgtgaa aacagaggct tttgcattgt ctcttgacat cagaagttac 1261 aataggcata tgggcaaaat ggtgtagcag gctcactggc cgtttgtttt ttaaacacat 1321 tttcacaagt ttttgagaca ctggatttct ttaattaaaa aaaaaatgcc aagaaacatt 1381 atttatacag ggttgattgc tttcatgttg ttattctgta ccctatagta gcctccatga 1441 gaatctggta tttcttgctg cttggaacta ctttgcagtg attacttggt tgcagtccaa 1501 gtactctcgt ttagtctgag cctggagatg ttctagactt gcttctccca cctctgagat 1561 taggacagga aaaatgtgaa atttcccaat tacaggatta tacggtacca tcacatcatt 1621 tgtggaaatt ggggtgactg tatagctggg attgggctaa ggactgtggt cttatctgtc 1681 cacatacagc caaaatgcct atccagaaat ccagttcgtt ggaaaggaaa attggtactc 1741 ctgtgccaca ggggttccag aaaagggaag tcactttacc ttgcggtggt gggatcctga 1801 tgtctttcat ccatttgtag taaaagctgg taaagctttt cttactcctg gttccctacc 1861 agtatttcta aacatgtcgc actttctcca caggcatgtg gttttgacct ttttttcaat 1921 cttctagaaa gggaacggaa gcagaagtgg gacatcgagg gctctgctgt cctctgcgct 1981 gggtgtggaa tgctgctgca cctgtccctt ctgctggctc agggaagtgt cttcttgccc 2041 acatttctgt ggggaaaggt ttttaatcct ctgatgcttc catcttcctg tttaggccat 2101 gtgcccagaa acctggactg atctttcttt aatagtgaac ccctgggcca ctgaagagta 2161 acatggctcc actggacaca aaagagggat ggaatcaaca ggcagggggc cttttataag 2221 ccttaggaaa agaaaatgaa actatttcat ctttggactt ttcaatacta ttggagtgat 2281 ttttttcttt ctaaacaggg aaaataatgt tacaaaagca tcttttttgt tatttgtttg 2341 catccctccc ccacaccctg gtgttttaaa atgaagaaaa aaaaccatca ccttttgtac 2401 aaaaactctt aatgattaaa aaacaaac // LOCUS HUMMAC25X 1115 bp mRNA PRI 02-FEB-1994 DEFINITION Human MAC25 mRNA, complete cds. ACCESSION L19182 NID g307150 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1115) AUTHORS Murphy,M., Pykett,M.J., Harnish,P., Zang,K.D. and George,D.L. TITLE Identification and characterization of genes differentially expressed in meningiomas JOURNAL Cell Growth Differ. 4 (9), 715-722 (1993) MEDLINE 94059820 FEATURES Location/Qualifiers source 1..1115 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="arachnoid" /tissue_type="leptomeninges" gene 14..847 /gene="MAC25" CDS 14..847 /gene="MAC25" /codon_start=1 /db_xref="PID:g307151" /translation="MERASLRALLFGPAGLLLLLLPLSSSSSSDTCGPCEPASCPPLP PLGCLLGETRDACGCCPMCARGEGEPCGGGGAGRGYCAPGMECVKSRKRRRGKAGAAA GGPGVSGVCVCKSRYPVCGSDGTTYPSGCQLRAASQRAESRGEKAITQVSKGTCEQGP SIVTPPKDIWNVTGAQVYLSCEVIGIPTPVLIWNKVKRGHYGVQRTELLPGDRDNLAI QTRGGPEKHEVTGWVLVSPLSKEDAGEYECHASNSQGQASASAKITVVDALHEIASEK R" BASE COUNT 270 a 311 c 311 g 223 t ORIGIN 1 ctctaaagcc gccatggagc gcgcgtcgct gcgcgccctg ctcttcggcc ccgctgggct 61 gctgctcctg ctcctgcccc tctcctcttc ctcctcttcg gacacctgcg gcccctgcga 121 gccggcctcc tgcccgcccc tgcccccgct gggctgcctg ctgggcgaga cccgcgacgc 181 gtgcggctgc tgccctatgt gcgcccgcgg cgagggcgag ccgtgcgggg gtggcggcgc 241 cggcaggggg tactgcgcgc cgggcatgga gtgcgtgaag agccgcaaga ggcggagggg 301 taaagccggg gcagcagccg gcggtccggg tgtaagcggc gtgtgcgtgt gcaagagccg 361 ctacccggtg tgcggcagcg acggcaccac ctacccgagc ggctgccagc tgcgcgccgc 421 cagccagagg gccgagagcc gcggggagaa ggccatcacc caggtcagca agggcacctg 481 cgagcaaggt ccttccatag tgacgccccc caaggacatc tggaatgtca ctggtgccca 541 ggtgtacttg agctgtgagg tcatcggaat cccgacacct gtcctcatct ggaacaaggt 601 aaaaaggggt cactatggag ttcaaaggac agaactcctg cctggtgacc gggacaacct 661 ggccattcag acccggggtg gcccagaaaa gcatgaagta actggctggg tgctggtatc 721 tcctctaagt aaggaagatg ctggagaata tgagtgccat gcatccaatt cccaaggaca 781 ggcttcagca tcagcaaaaa ttacagtggt tgatgcctta catgaaatag ccagtgaaaa 841 aaggtgaagg tgccgagcta taaacctcca gaatattatt agtctgcatg gttaaaagta 901 gtcatggata actacattac cctgttcttg cctaataagt ttcttttaat ccaatccact 961 aacactttag ttatattcac tggttttaca cagagaaata caaaataaag atcacacatc 1021 aagactatct acaaaaattt attatatatt tacagaagaa aagcatgcat atcattaaac 1081 aaataaaata ctttttatca caaaaaactt tagag // LOCUS HUMMAC2A 2257 bp mRNA PRI 15-JUL-1993 DEFINITION Human Mac-2 binding protein mRNA, complete cds. ACCESSION L13210 NID g307152 KEYWORDS L3 antigen; Mac-2 antigen; Mac-2 binding protein; cysteine-rich domain; macrophage scavenger receptor; proteolytic cleavage. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2257) AUTHORS Koths,K., Taylor,E., Halenbeck,R., Casipit,C. and Wang,A. TITLE Cloning and characterization of a human Mac-2 binding protein, a new member of the superfamily defined by the macrophage scavenger receptor cysteine-rich domain JOURNAL J. Biol. Chem. 268, 14245-14249 (1993) MEDLINE 93300818 FEATURES Location/Qualifiers source 1..2257 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" sig_peptide 180..233 /evidence=experimental CDS 180..1937 /note="major proteolytic cleavage site at bp 1484-1485; cysteine-rich domain, similar to human scavenger receptor cysteine-rich domain, at bp 234-554" /codon_start=1 /product="Mac-2 binding protein" /db_xref="PID:g307153" /translation="MTPPRLFWVWLLVAGTQGVNDGDMRLADGGATNQGRVEIFYRGQ WGTVCDNLWDLTDASVVCRALGFENATQALGRAAFGQGSGPIMLDEVQCTGTEASLAD CKSLGWLKSNCRHERDAGVVCTNETRSTHTLDLSRELSEALGQIFDSQRGCDLSISVN VQGEDALGFCGHTVILTANLEAQALWKEPGSNVTMSVDAECVPMVRDLLRYFYSRRID ITLSSVKCFHKLASAYGARQLQGYCASLFAILLPQDPSFQMPLDLYAYAVATGDALLE KLCLQFLAWNFEALTQAEAWPSVPTDLLQLLLPRSDLAVPSELALLKAVDTWSWGERA SHEEVEGLVEKIRFPMMLPEELFELQFNLSLYWSHEALFQKKTLQALEFHTVPFQLLA RYKGLNLTEDTYKPRIYTSPTWSAFVTDSSWSARKSQLVYQSRRGPLVKYSSDYFQAP SDYRYYPYQSFQTPQHPSFLFQDKRVSWSLVYLPTIQSCWNYGFSCSSDELPVLGLTK SGGSDRTIAYENKALMLCEGLFVADVTDFEGWKAAIPSALDTNSSKSTSSFPCPAGHF NGFRTVIRPFYLTNSSGVD" mat_peptide 234..1934 /evidence=experimental /product="Mac-2 binding protein" polyA_signal 2235..2240 /note="putative" polyA_site 2257 BASE COUNT 437 a 738 c 636 g 446 t ORIGIN 1 aatcgaaagt agactctttt ctgaagcatt tcctgggatc agcctgacca cgctccatac 61 tgggagaggc ttctgggtca aaggaccagt ctgcagaggg atcctgtggc tggaagcgag 121 gaggctccac acggccgttg cagctaccgc agccaggatc tgggcatcca ggcacggcca 181 tgacccctcc gaggctcttc tgggtgtggc tgctggttgc aggaacccaa ggcgtgaatg 241 atggtgacat gcggctggcc gatgggggcg ccaccaacca gggccgcgtg gagatcttct 301 acagaggcca gtggggcact gtgtgtgaca acctgtggga cctgactgat gccagcgtcg 361 tctgccgggc cctgggcttc gagaacgcca cccaggctct gggcagagct gccttcgggc 421 aaggatcagg ccccatcatg ctggacgagg tccagtgcac gggaaccgag gcctcactgg 481 ccgactgcaa gtccctgggc tggctgaaga gcaactgcag gcacgagaga gacgctggtg 541 tggtctgcac caatgaaacc aggagcaccc acaccctgga cctctccagg gagctctcgg 601 aggcccttgg ccagatcttt gacagccagc ggggctgcga cctgtccatc agcgtgaatg 661 tgcagggcga ggacgccctg ggcttctgtg gccacacggt catcctgact gccaacctgg 721 aggcccaggc cctgtggaag gagccgggca gcaatgtcac catgagtgtg gatgctgagt 781 gtgtgcccat ggtcagggac cttctcaggt acttctactc ccgaaggatt gacatcaccc 841 tgtcgtcagt caagtgcttc cacaagctgg cctctgccta tggggccagg cagctgcagg 901 gctactgcgc aagcctcttt gccatcctcc tcccccagga cccctcgttc cagatgcccc 961 tggacctgta tgcctatgca gtggccacag gggacgccct gctggagaag ctctgcctac 1021 agttcctggc ctggaacttc gaggccttga cgcaggccga ggcctggccc agtgtcccca 1081 cagacctgct ccaactgctg ctgcccagga gcgacctggc ggtgcccagc gagctggccc 1141 tactgaaggc cgtggacacc tggagctggg gggagcgtgc ctcccatgag gaggtggagg 1201 gcttggtgga gaagatccgc ttccccatga tgctccctga ggagctcttt gagctgcagt 1261 tcaacctgtc cctgtactgg agccacgagg ccctgttcca gaagaagact ctgcaggccc 1321 tggaattcca cactgtgccc ttccagttgc tggcccggta caaaggcctg aacctcaccg 1381 aggataccta caagccccgg atttacacct cgcccacctg gagtgccttt gtgacagaca 1441 gttcctggag tgcacggaag tcacaactgg tctatcagtc cagacggggg cctttggtca 1501 aatattcttc tgattacttc caagccccct ctgactacag atactacccc taccagtcct 1561 tccagactcc acaacacccc agcttcctct tccaggacaa gagggtgtcc tggtccctgg 1621 tctacctccc caccatccag agctgctgga actacggctt ctcctgctcc tcggacgagc 1681 tccctgtcct gggcctcacc aagtctggcg gctcagatcg caccattgcc tacgaaaaca 1741 aagccctgat gctctgcgaa gggctcttcg tggcagacgt caccgatttc gagggctgga 1801 aggctgcgat tcccagtgcc ctggacacca acagctcgaa gagcacctcc tccttcccct 1861 gcccggcagg gcacttcaac ggcttccgca cggtcatccg ccccttctac ctgaccaact 1921 cctcaggtgt ggactagacg gcgtggccca agggtggtga gaaccggaga accccaggac 1981 gccctcactg caggctcccc tcctcggctt ccttcctctc tgcaatgacc ttcaacaacc 2041 ggccaccaga tgtcgcccta ctcacctgag cgctcagctt caagaaatta ctggaaggct 2101 tccactaggg tccaccagga gttctcccac cacctcacca gtttccaggt ggtaagcacc 2161 aggacgccct cgaggttgct ctgggatccc cccacagccc ctggtcagtc tgcccttgtc 2221 actggtctga ggtcattaaa attacattga ggttcct // LOCUS HUMMACT 1518 bp mRNA PRI 17-JAN-1992 DEFINITION Human mRNA for acetoacetyl-coenzyme A thiolase (EC 2.3.1.9). ACCESSION D90228 M61117 NID g219917 KEYWORDS mitochondrial acetoacetyl-CoA thiolase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1518) AUTHORS Fukao,T., Yamaguchi,S., Kano,M., Orii,T., Fujiki,Y., Osumi,T. and Hashimoto,T. TITLE Molecular cloning and sequence of the complementary DNA encoding human mitochondrial acetoacetyl-coenzyme A thiolase and study of the variant enzymes in cultured fibroblasts from patients with 3-ketothiolase deficiency JOURNAL J. Clin. Invest. 86 (6), 2086-2092 (1990) MEDLINE 91072688 COMMENT These data kindly submitted in computer readable form by: Toshiyuki Fukao Department of Pediatrics Gifu University School of Medicine 40 Tsukasa-machi Gifu 500 Japan Phone: 0582-65-1241 Fax: 0582-65-9011. FEATURES Location/Qualifiers source 1..1518 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 77..1360 /note="mitochondrial acetoacetyl-CoA thiolase precursor" /codon_start=1 /db_xref="PID:d1014983" /db_xref="PID:g219918" /translation="MAVLAALLRSGARSRSPLLRRLVQEIRYVERSYVSKPTLKEVVI VSATRTPIGSFLGSLSLLPATKLGSIAIQGAIEKAGIPKEEVKEAYMGNVLQGGEGQA PTRQAVLGAGLPISTPCTTINKVCASGMKAIMMASQSLMCGHQDVMVAGGMESMSNVP YVMNRGSTPYGGVKLEDLIVKDGLTDVYNKIHMGSCAENTAKKLNIARNEQDAYAINS YTRSKAAWEAGKFGNEVIPVTVTVKGQPDVVVKEDEEYKRVDFSKVPKLKTVFQKENG TVTAANASTLNDGAAALVLMTADAAKRLNVTPLARIVAFADAAVEPIDFPIAPVYAAS MVLKDVGLKKEDIAMWEVNEAFSLVVLANIKMLEIDPQKVNINGGAVSLGHPIGMSGA RIVGHLTHALKQGEYGLASICNGGGGASAMLIQKL" sig_peptide 77..175 /note="leader peptide" mat_peptide 176..1357 /note="mature peptide" misc_feature 452..454 /note="put. active site (Cys)" BASE COUNT 454 a 283 c 390 g 391 t ORIGIN 1 aggccgctag ggtgcggggt tggggaggag gccgctagtc tacgcctgtg gagccgatac 61 tcagccctct gcgaccatgg ctgtgctggc ggcacttctg cgcagcggcg cccgcagccg 121 cagccccctg ctccggaggc tggtgcagga aataagatat gtggaacgga gttatgtatc 181 aaaacccact ttgaaggaag tggtcatagt aagtgctaca agaacaccca ttggatcttt 241 tttaggcagc ctttccttgc tgccagccac taagcttggt tccattgcaa ttcagggagc 301 cattgaaaag gcagggattc caaaagaaga agtgaaagaa gcatacatgg gtaatgttct 361 acaaggaggt gaaggacaag ctcctacaag gcaggcagta ttgggtgcag gcttacctat 421 ttctactcca tgtaccacca taaacaaagt ttgtgcttca ggaatgaaag ccatcatgat 481 ggcctctcaa agtcttatgt gtggacatca ggatgtgatg gtggcaggtg ggatggagag 541 catgtccaat gttccatatg taatgaacag aggatcaaca ccatatggtg gggtaaagct 601 tgaagatttg attgtaaaag acgggctaac tgatgtctac aataaaattc atatgggcag 661 ctgtgctgag aatacagcaa agaagctgaa tattgcacga aatgaacagg acgcttatgc 721 tattaattct tataccagaa gtaaagcagc atgggaagct gggaaatttg gaaatgaagt 781 tattcctgtc acagttacag taaaaggtca accagatgta gtggtgaaag aagatgaaga 841 atataaacgt gttgatttta gcaaagttcc aaagctgaag acagttttcc agaaagaaaa 901 tggcacagta acagctgcca atgccagtac actgaatgat ggagcagctg ctctggttct 961 catgacggca gatgcagcga agaggctcaa tgttacacca ctggcaagaa tagtagcatt 1021 tgctgacgct gctgtagaac ctattgattt tccaattgct cctgtatatg ctgcatctat 1081 ggttcttaaa gatgtgggat tgaaaaaaga agatattgca atgtgggaag taaatgaagc 1141 ctttagtctg gttgtactag caaacattaa aatgttggag attgatcccc aaaaagtgaa 1201 tatcaatgga ggagctgttt ctctgggaca tccaattggg atgtctggag ccaggattgt 1261 tggtcatttg actcatgcct tgaagcaagg agaatacggt cttgccagta tttgcaatgg 1321 aggaggaggt gcttctgcca tgctaattca gaagctgtag acaacctctg ctatttaagg 1381 agacaaccct atgtgaccag aaggcctgct gtaatcagtg tgactactgt gggtcagctt 1441 atattcagat aagctgtttc attttttatt attttctatg ttaactttta aaaatcaaaa 1501 tgatgaaatc ccaaaaca // LOCUS HUMMAD 1002 bp mRNA PRI 05-MAR-1993 DEFINITION Homo sapiens antagonizer of myc transcriptional activity (Mad) mRNA, complete cds. ACCESSION L06895 NID g187288 KEYWORDS antagonizer of myc transcriptional activity. SOURCE Homo sapiens (library: gt10) lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1002) AUTHORS Ayer,D.E., Kretzner,L. and Eisenman,R.N. TITLE Mad: a heterodimeric partner for max that antigonizes myc transcriptional activity JOURNAL Cell 72, 211-222 (1993) MEDLINE 93145323 FEATURES Location/Qualifiers source 1..1002 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI26 VA4" /cell_type="fibroblast" /tissue_type="lung" /tissue_lib="gt10" CDS 148..813 /codon_start=1 /product="antagonizer of myc transcriptional activity" /db_xref="PID:g187289" /translation="MAAAVRMNIQMLLEAADYLERREREAEHGYASMLPYNNKDRDAL KRRNKSKKNNSSSRSTHNEMEKNRRAHLRLCLEKLKGLVPLGPESSRHTTLSLLTKAK LHIKKLEDCDRKAVHQIDQLQREQRHLKRQLEKLGIERIRMDSIGSTVSSERSDSDRE EIDVDVESTDYLTGDLDWSSSSVSDSDERGSMQSLGSDEGYSSTSIKRIKLQDSHKAC LGL" BASE COUNT 255 a 255 c 298 g 194 t ORIGIN 1 cgccagagag gctccctcag ccctgctccg cggggtccac agcgggctcc acagcgggct 61 ccatagcggg ctccacagcg gtccggcggc ggcagcgagc ccgtgggcag tgggggttgg 121 tcccgtggct ccggcccccg gtgcagaatg gcggcggcgg ttcggatgaa catccagatg 181 ctgctggagg cggccgacta tctggagcgg cgggagagag aagctgaaca tggttatgcc 241 tccatgttac catacaataa caaggacaga gatgccttaa aacggaggaa caaatccaaa 301 aagaataaca gcagtagcag atcaactcac aatgaaatgg agaagaatag acgggctcat 361 cttcgcttgt gcctggagaa gttgaagggg ctggtgccac tgggacccga atcaagtcga 421 cacactacgt tgagtttatt aacaaaagcc aaattgcaca taaagaaact tgaagattgt 481 gacagaaaag ccgttcacca aatcgaccag cttcagcgag agcagcgaca cctgaagagg 541 cagctggaga agctgggcat tgagaggatc cggatggaca gcatcggctc caccgtctcc 601 tcggagcgct ccgactccga cagggaagaa atcgacgttg acgtggagag cacggactat 661 ctcacaggtg atctggactg gagcagcagc agtgtgagcg actctgacga gcggggcagc 721 atgcagagcc tcggcagtga tgagggctat tccagcacca gcatcaagag aataaagctg 781 caggacagtc acaaggcgtg tcttggtctc taagagagtg ggcactgcgg ctgtctcctt 841 gaaggttctc cctgttggtt ctgattaggt aacgtattgg acctgcccac aactcccttg 901 cacgtaaact tcagtgtccc accttgacca aaatcagctt tgtaactgtt ttcaaggagg 961 tgcttaggat tgtgggtttc tgattgcatc actagcttct cc // LOCUS HUMMAD3A 1550 bp mRNA PRI 07-MAR-1994 DEFINITION Homo sapiens MAD-3 mRNA encoding IkB-like activity, complete cds. ACCESSION M69043 NID g187290 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1550) AUTHORS Haskill,S., Beg,A.A., Tompkins,S.M., Morris,J.S., Yurochko,A.D., Sampson-Johannes,A., Mondal,K., Ralph,P. and Baldwin,A.S.Jr.. TITLE Characterization of an immediate-early gene induced in adherent monocytes which encodes ikB-like activity JOURNAL Cell 65, 1281-1289 (1991) MEDLINE 91292530 FEATURES Location/Qualifiers source 1..1550 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="monocyte, neutrophil" /tissue_lib="pcDNA I" gene 95..1550 /gene="MAD3" CDS 95..1048 /gene="MAD3" /codon_start=1 /function="IkB-like activity" /db_xref="PID:g187291" /translation="MFQAAERPQEWAMEGPRDGLKKERLLDDRHDSGLDSMKDEEYEQ MVKELQEIRLEPQEVPRGSEPWKQQLTEDGDSFLHLAIIHEEKALTMEVIRQVKGDLA FLNFQNNLQQTPLHLAVITNQPEIAEALLGAGCDPELRDFRGNTPLHLACEQGCLASV GVLTQSCTTPHLHSILKATNYNGHTCLHLASIHGYLGIVELLVSLGADVNAQEPCNGR TALHLAVDLQNPDLVSLLLKCGADVNRVTYQGYSPYQLTWGRPSTRIQQQLGQLTLEN LQMLPESEDEESYDTESEFTEFTEDELPYDDCVFGGQRLTL" misc_feature 1135..1139 /gene="MAD3" /note="attta motif" misc_feature 1346..1350 /gene="MAD3" /note="attta motif" misc_feature 1525..1529 /gene="MAD3" /note="attta motif" polyA_site 1550 /gene="MAD3" BASE COUNT 380 a 402 c 416 g 352 t ORIGIN 1 tgccgccgtc ccgcccgcca gcgccccagc gaggaagcag cgcgcagccc gcggcccagc 61 gcacccgcag cagcgcccgc agctcgtccg cgccatgttc caggcggccg agcgccccca 121 ggagtgggcc atggagggcc cccgcgacgg gctgaagaag gagcggctac tggacgaccg 181 ccacgacagc ggcctggact ccatgaaaga cgaggagtac gagcagatgg tcaaggagct 241 gcaggagatc cgcctcgagc cgcaggaggt gccgcgcggc tcggagccct ggaagcagca 301 gctcaccgag gacggggact cgttcctgca cttggccatc atccatgaag aaaaggcact 361 gaccatggaa gtgatccgcc aggtgaaggg agacctggct ttcctcaact tccagaacaa 421 cctgcagcag actccactcc acttggctgt gatcaccaac cagccagaaa ttgctgaggc 481 acttctggga gctggctgtg atcctgagct ccgagacttt cgaggaaata cccccctaca 541 ccttgcctgt gagcagggct gcctggccag cgtgggagtc ctgactcagt cctgcaccac 601 cccgcacctc cactccatcc tgaaggctac caactacaat ggccacacgt gtctacactt 661 agcctctatc catggctacc tgggcatcgt ggagcttttg gtgtccttgg gtgctgatgt 721 caatgctcag gagccctgta atggccggac tgcccttcac ctcgcagtgg acctgcaaaa 781 tcctgacctg gtgtcactcc tgttgaagtg tggggctgat gtcaacagag ttacctacca 841 gggctattct ccctaccagc tcacctgggg ccgcccaagc acccggatac agcagcagct 901 gggccagctg acactagaaa accttcagat gctgccagag agtgaggatg aggagagcta 961 tgacacagag tcagagttca cggagttcac agaggacgag ctgccctatg atgactgtgt 1021 gtttggaggc cagcgtctga cgttatgagt gcaaaggggc tgaaagaaca tggacttgta 1081 tatttgtaca aaaaaaaagt tttatttttc taaaaaaaga aaaaagaaga aaaaatttaa 1141 agggtgtact tatatccaca ctgcacactg cctagcccaa aacgtcttat tgtggtagga 1201 tcagccctca ttttgttgct tttgtgaact ttttgtaggg gacgagaaag atcattgaaa 1261 ttctgagaaa acttctttta aacctcacct ttgtggggtt tttggagaag gttatcaaaa 1321 atttcatgga aggaccacat tttatattta ttgtgcttcg agtgactgac cccagtggta 1381 tcctgtgaca tgtaacagcc aggagtgtta agcgttcagt gatgtggggt gaaaagttac 1441 tacctgtcaa ggtttgtgtt accctcctgt aaatggtgta cataatgtat tgttggtaat 1501 tattttggta cttttatgat gtatatttat taaagagatt tttacaaatg // LOCUS HUMMALENAD 1923 bp mRNA PRI 02-APR-1991 DEFINITION Human mitochondrial NAD(P)+ dependent malic enzyme mRNA, complete cds. ACCESSION M55905 NID g187299 KEYWORDS NAD(P)+ -dependent malic enzyme. SOURCE Human fibrosarcoma cell line HS 913T, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1923) AUTHORS Loeber,G., Infante,A.A., Maurer-Fogy,I., Krystek,E. and Dworkin,M.B. TITLE Human NAD+ -dependent mitochondrial malic enzyme: cDNA cloning, primary structure, and expression in Escherichia coli JOURNAL J. Biol. Chem. 266, 3016-3021 (1991) MEDLINE 91131600 FEATURES Location/Qualifiers source 1..1923 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HS913T" /tissue_type="fibrosarcoma" gene 90..1844 /gene="NAD(P)+ -dependent malic enzyme" CDS 90..1844 /gene="NAD(P)+ -dependent malic enzyme" /EC_number="1.1.1.39" /codon_start=1 /product="mitochondrial NAD(P)+ -dependent malic enzyme" /db_xref="PID:g187300" /translation="MLSRLRVVSTTCTLACRHLHIKEKGKPLMLNPRTNKGMAFTLQE RQMLGLQGLLPPKIETQDIQALRFHRNLKKMTSPLEKYIYIMGIQERNEKLFYRILQD DIESLMPIVYTPTVGLACSQYGHIFRRPKGLFISISDRGHVRSIVDNWPENHVKAVVV TDGERILGLGDLGVYGMGIPVGKLCLYTACAGIRPDRCLPVCIDVGTDNIALLKDPFY MGLYQKRDRTQQYDDLIDEFMKAITDRYGRNTLIQFEDFGNHNAFRFLRKYREKYCTF NDDIQGTAAVALAGLLAAQKVISKPISEHKILFLGAGEAALGIANLIVMSMVENGLSE QEAQKKIWMFDKYGLLVKGRKAKIDSYQEPFTHSAPESIPDTFEDAVNILKPSTIIGV AGAGRLFTPDVIRAMASINERPVIFALSNPTAQAECTAEEAYTLTEGRCLFASGSPFG PVKLTDGRVFTPGQGNNVYIFPGVALAVILCNTRHISDSVFLEAAKALTSQLTDEELA QGRLYPPLANIQEVSINIAIKVTEYLYANKMAFRYPEPEDKAKYVKERTWRSEYDSLL PDVYEWPESASSPPVITE" polyA_site 1923 /gene="NAD(P)+ -dependent malic enzyme" BASE COUNT 599 a 373 c 436 g 515 t ORIGIN 1 gctgagcatc gccagggcgg gcggcagggc gcggcctctc cgccgggtgt acctcctgtc 61 gcggcgcgag acctctggtg aaagaaaaga tgttgtcccg gttaagagta gtttccacca 121 cttgtacttt ggcatgtcga catttgcaca taaaagaaaa aggcaagcca cttatgctga 181 acccaagaac aaacaaggga atggcattta ctttacaaga acgacaaatg cttggtcttc 241 aaggacttct acctcccaaa atagagacac aagatattca agccttacga tttcatagaa 301 acttgaagaa aatgactagc cctttggaaa aatatatcta cataatggga atacaagaaa 361 gaaatgagaa attgttttat agaatactgc aagatgacat tgagagttta atgccaattg 421 tatatacacc gacggttggt cttgcctgct cccagtatgg acacatcttt agaagaccta 481 agggattatt tatttcgatc tcagacagag gtcatgttag atcaattgtg gataactggc 541 cagaaaatca tgttaaggct gttgtagtga ctgatggaga gagaattctg ggtcttggag 601 atctgggtgt ctatggaatg ggaattccag taggaaaact ttgtttgtat acagcttgtg 661 caggaatacg gcctgataga tgcctgccag tgtgtattga tgtgggaact gataatatcg 721 cactcttaaa agacccattt tacatgggct tgtaccagaa acgagatcgc acacaacagt 781 atgatgacct gattgatgag tttatgaaag ctattactga cagatatggc cggaacacac 841 tcattcagtt cgaagacttt ggaaatcata atgcattcag gttcttgaga aagtaccgag 901 aaaaatattg tactttcaat gatgatattc aagggacagc tgcagtagct ctagcaggtc 961 ttcttgcagc acaaaaagtt attagtaaac caatctccga acacaaaatc ttattccttg 1021 gagcaggaga ggctgctctt ggaattgcaa atcttatagt tatgtctatg gtagaaaatg 1081 gcctgtcaga acaagaggca caaaagaaaa tctggatgtt tgacaagtat ggtttattag 1141 ttaagggacg gaaagcaaaa atagatagtt atcaggaacc atttactcac tcagccccag 1201 agagcatacc tgatactttt gaagatgcag tgaatatact gaagccttca actattattg 1261 gagttgcagg tgctggccgt cttttcactc ctgatgtaat cagagccatg gcctctatca 1321 atgaaaggcc tgtaatattt gcattaagta atcctacagc acaggcagag tgcacggctg 1381 aagaagcata tacacttaca gagggcaggt gtttgtttgc cagtggcagt ccatttgggc 1441 cagtgaaact tacagatggg cgagtcttta caccaggtca aggaaacaat gtttatattt 1501 ttccaggtgt ggctttagct gttattctct gtaacacccg gcatattagt gacagtgttt 1561 tcctagaagc tgcaaaggcc ctgacaagcc aattgacaga tgaagagcta gcccaaggga 1621 gactttaccc accgcttgct aatattcagg aagtttctat taacattgct attaaagtta 1681 cagaatacct atatgctaat aaaatggctt tccgataccc agaacctgaa gacaaggcca 1741 aatatgttaa agaaagaaca tggcggagtg aatatgattc cctgctgcca gatgtgtatg 1801 aatggccaga atctgcatca agccctcctg tgataacaga atagaagcac tcccctgata 1861 aatactttct gtgctccagg gaaccccttt tttcagacaa gaagagataa tgtcttcagt 1921 ttt // LOCUS HUMMAMEPI 1245 bp mRNA PRI 07-JAN-1995 DEFINITION Human epithelial cell marker protein 1 (HMe1) mRNA, complete cds. ACCESSION M93010 NID g187301 KEYWORDS HMe1 protein; epithelial cell marker protein. SOURCE Homo sapiens female mammary gland cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1245) AUTHORS Prasad,G.L., Valverius,E.M., McDuffie,E. and Cooper,H.L. TITLE Complementary DNA cloning of a novel epithelial cell marker protein, HME1, that may be down-regulated in neoplastic mammary cells JOURNAL Cell Growth Differ. 3 (8), 507-513 (1992) MEDLINE 93002614 FEATURES Location/Qualifiers source 1..1245 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="184 (from M. Stampfer)" /cell_type="epithelial cell" /sex="female" /tissue_type="mammary gland" 5'UTR 1..10 /note="putative" gene 11..757 /gene="HME1" CDS 11..757 /gene="HME1" /standard_name="HMe1" /codon_start=1 /product="epithelial cell marker protein 1" /db_xref="PID:g187302" /translation="MERASLIQKAKLAEQAERYEDMAAFMKGAVEKGEELSCEERNLL SVAYKNVVGGQRAAWRVLSSIEQKSNEEGSEEKGPEVREYREKVETELQGVCDTVLGL LDSHLIKEAGDAESRVFHLKMKGDYYRYLAEVATGDDKKRIIDSARSAYQEAMDISKK EMPPTNPIRLGLALNFSVFHYEIANSPEEAISLAKTTFDEAMADLHTLSEDSYKDSTL IMQLLRDNLTLWTADNAGEEGGEVPQEPQS" 3'UTR 758..1245 repeat_region 1118..1153 /note="putative" /rpt_family="tg" /rpt_type=other /rpt_unit=1118..1119 polyA_signal 1223..1228 BASE COUNT 249 a 381 c 392 g 223 t ORIGIN 1 gcacgaggcc atggagagag ccagtctgat ccagaaggcc aagctggcag agcaggccga 61 acgctatgag gacatggcag ccttcatgaa aggcgccgtg gagaagggcg aggagctctc 121 ctgcgaagag cgaaacctgc tctcagtagc ctataagaac gtggtgggcg gccagagggc 181 tgcctggagg gtgctgtcca gtattgagca gaaaagcaac gaggagggct cggaggagaa 241 ggggcccgag gtgcgtgagt accgggagaa ggtggagact gagctccagg gcgtgtgcga 301 caccgtgctg ggcctgctgg acagccacct catcaaggag gccggggacg ccgagagccg 361 ggtcttccac ctgaagatga agggtgacta ctaccgctac ctggccgagg tggccaccgg 421 tgacgacaag aagcgcatca ttgactcagc ccggtcagcc taccaggagg ccatggacat 481 cagcaagaag gagatgccgc ccaccaaccc catccgcctg ggcctggccc tgaacttttc 541 cgtcttccac tacgagatcg ccaacagccc cgaggaggcc atctctctgg ccaagaccac 601 tttcgacgag gccatggctg atctgcacac cctcagcgag gactcctaca aagacagcac 661 cctcatcatg cagctgctgc gagacaacct gacactgtgg acggccgaca acgccgggga 721 agaggggggc gaggttcccc aggagcccca gagctgagtg ttgcccgcca ccgccccgcc 781 ctgccctcca gtcccccacc ctgccgagag gactagtatt gtggagggcc cacccttctc 841 ccctaggcgc tgttcttgct cccaaggctc cgtggagagg gactgcagac tgaggccacc 901 tgggctgggg atccactctt cttgcagctg ttgagcgcac ctaaccactg gtcatgcccc 961 cacccctgct ctccgcaccc gcttcctccc gaccccagga ccaggctact tctcccctcc 1021 tcttgcctcc ctcctgcccc tgctgcctct gatcgtagga attgaggagt gtccccttgt 1081 ggctgtgaac tggacagtgc aggggctgga gatggggtgt gtgtgtgtgt gtgtgtgtgt 1141 gtgtgtgtgt gtgctcgcgc gccagtgcaa gaccgagatt gagggaaagc atgtctgctg 1201 ggtgtgacca tgttttcctc tcaataaagt tggggtgtga cactc // LOCUS HUMMAOAA 1949 bp mRNA PRI 07-JAN-1995 DEFINITION Human monoamine oxidase A (MAOA) mRNA, complete cds. ACCESSION M68840 J03792 NID g187352 KEYWORDS monoamine oxidase A. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1949) AUTHORS Bach,A.W., Lan,N.C., Johnson,D.L., Abell,C.W., Bembenek,M.E., Kwan,S.W., Seeburg,P.H. and Shih,J.C. TITLE cDNA cloning of human liver monoamine oxidase A and B: molecular basis of differences in enzymatic properties JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (13), 4934-4938 (1988) MEDLINE 88263063 FEATURES Location/Qualifiers source 1..1949 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="Xp11.4-p11.3" gene 74..1657 /gene="MAOA" CDS 74..1657 /gene="MAOA" /EC_number="1.4.3.4" /codon_start=1 /db_xref="GDB:G00-120-164" /product="monoamine oxidase A" /db_xref="PID:g187353" /translation="MENQEKASIAGHMFDVVVIGGGISGLSAAKLLTEYGVSVLVLEA RDRVGGRTYTIRNEHVDYVDVGGAYVGPTQNRILRLSKELGIETYKVNVSERLVQYVK GKTYPFRGAFPPVWNPIAYLDYNNLWRTIDNMGKEIPTDAPWEAQHADKWDKMTMKEL IDKICWTKTARRFAYLFVNINVTSEPHEVSALWFLWYVKQCGGTTRIFSVTNGGQERK FVGGSGQVSERIMDLLGDQVKLNHPVTHVDQSSDNIIIETLNHEHYECKYVINAIPPT LTAKIHFRPELPAERNQLIQRLPMGAVIKCMMYYKEAFWKKKDYCGCMIIEDEDAPIS ITLDDTKPDGSLPAIMGFILARKADRLAKLHKEIRKKKICELYAKVLGSQEALHPVHY EEKNWCEEQYSGGCYTAYFPPGIMTQYGRVIRQPVGRIFFAGTETATKWSGYMEGAVE AGERAAREVLNGLGKVTEKDIWVQEPESKDVPAVEITHTFWERNLPSVSGLLKIIGFS TSVTALGFVLYKYKLLPRS" polyA_site 1949 /gene="MAOA" /note="G00-120-164" BASE COUNT 550 a 412 c 490 g 497 t ORIGIN 1 catagaaggg tccttcccac cctttgccgt ccccactcct gtgcctacga cccaggagcg 61 tgtcagccaa atcatggaga atcaagagaa ggcgagtatc gcgggccaca tgttcgacgt 121 agtcgtgatc ggaggtggca tttcaggact atctgctgcc aaactcttga ctgaatatgg 181 cgttagtgtt ttggttttag aagctcggga cagggttgga ggaagaacat atactataag 241 gaatgagcat gttgattacg tagatgttgg tggagcttat gtgggaccaa cccaaaacag 301 aatcttacgc ttgtctaagg agctgggcat agagacttac aaagtgaatg tcagtgagcg 361 tctcgttcaa tatgtcaagg ggaaaacata tccatttcgg ggcgcctttc caccagtatg 421 gaatcccatt gcatatttgg attacaataa tctgtggagg acaatagata acatggggaa 481 ggagattcca actgatgcac cctgggaggc tcaacatgct gacaaatggg acaaaatgac 541 catgaaagag ctcattgaca aaatctgctg gacaaagact gctaggcggt ttgcttatct 601 ttttgtgaat atcaatgtga cctctgagcc tcacgaagtg tctgccctgt ggttcttgtg 661 gtatgtgaag cagtgcgggg gcaccactcg gatattctct gtcaccaatg gtggccagga 721 acggaagttt gtaggtggat ctggtcaagt gagcgaacgg ataatggacc tcctcggaga 781 ccaagtgaag ctgaaccatc ctgtcactca cgttgaccag tcaagtgaca acatcatcat 841 agagacgctg aaccatgaac attatgagtg caaatacgta attaatgcga tccctccgac 901 cttgactgcc aagattcact tcagaccaga gcttccagca gagagaaacc agttaattca 961 gcggcttcca atgggagctg tcattaagtg catgatgtat tacaaggagg ccttctggaa 1021 gaagaaggat tactgtggct gcatgatcat tgaagatgaa gatgctccaa tttcaataac 1081 cttggatgac accaagcctg atgggtcact gcctgccatc atgggcttca ttcttgcccg 1141 gaaagctgat cgacttgcta agctacataa ggaaataagg aagaagaaaa tctgtgagct 1201 ctatgccaaa gtgctgggat cccaagaagc tttacatcca gtgcattatg aagagaagaa 1261 ctggtgtgag gagcagtact ctgggggctg ctacacggcc tacttccctc ctgggatcat 1321 gactcaatat ggaagggtga ttcgtcaacc cgtgggcagg attttctttg cgggcacaga 1381 gactgccaca aagtggagcg gctacatgga aggggcagtt gaggctggag aacgagcagc 1441 tagggaggtc ttaaatggtc tcgggaaggt gaccgagaaa gatatctggg tacaagaacc 1501 tgaatcaaag gacgttccag cggtagaaat cacccacacc ttctgggaaa ggaacctgcc 1561 ctctgtttct ggcctgctga agatcattgg attttccaca tcagtaactg ccctggggtt 1621 tgtgctgtac aaatacaagc tcctgccacg gtcttgaagt tctgttctta tgctctctgc 1681 tcactggttt tcaataccac caagaggaaa aatattgaca agtttaaagg ctgtgtcatt 1741 gggccatgtt taagtgtact ggatttaact acctttggct taattccaat cattgttaaa 1801 gtaaaaacaa ttcaaagaat cacctaatta atttcagtag atcaagctcc atcttatttg 1861 tcagtgtaga tcactcatgt taattgatag aataaagcct tgtgatcatt tctgaaattc 1921 acaagtaacg tgtatgtgct catcagaac // LOCUS HUMMAOB 2491 bp mRNA PRI 07-JAN-1995 DEFINITION Human monoamine oxidase B (MAOB) mRNA, complete cds. ACCESSION M69177 J03793 NID g187358 KEYWORDS monoamine oxidase B. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2491) AUTHORS Bach,A.W., Lan,N.C., Johnson,D.L., Abell,C.W., Bembenek,M.E., Kwan,S.W., Seeburg,P.H. and Shih,J.C. TITLE cDNA cloning of human liver monoamine oxidase A and B: molecular basis of differences in enzymatic properties JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (13), 4934-4938 (1988) MEDLINE 88263063 FEATURES Location/Qualifiers source 1..2491 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="Xp11.4-p11.3" gene 78..1640 /gene="MAOB" CDS 78..1640 /gene="MAOB" /codon_start=1 /db_xref="GDB:G00-119-377" /product="monoamine oxidase B" /db_xref="PID:g187359" /translation="MSNKCDVVVVGGGISGMAAAKLLHDSGLNVVVLEARDRVGGRTY TLRNQKVKYVDLGGSYVGPTQNRILRLAKELGLETYKVNEVERLIHHVKGKSYPFRGP FPPVWNPITYLDHNNFWRTMDDMGREIPSDAPWKAPLAEEWDNMTMKELLDKLCWTES AKQLATLFVNLCVTAETHEVSALWFLWYVKQCGGTTRIISTTNGGQERKFVGGSGQVS ERIMDLLGDRVKLERPVIYIDQTRENVLVETLNHEMYEAKYVISAIPPTLGMKIHFNP PLPMMRNQMITRVPLGSVIKCIVYYKEPFWRKKDYCGTMIIDGEEAPVAYTLDDTKPE GNYAAIMGFILAHKARKLARLTKEERLKKLCELYAKVLGSLEALEPVHYEEKNWCEEQ YSGGCYTTYFPPGILTQYGRVLRQPVDRIYFAGTETATHWSGYMEGAVEAGERAAREI LHAMGKIPEDEIWQSEPESVDVPAQPITTTFLERHLPSVPGLLRLIGLTTIFSATALG FLAHKRGLLVRV" polyA_site 2491 /gene="MAOB" /note="G00-119-377" BASE COUNT 647 a 556 c 630 g 658 t ORIGIN 1 ctggcaggca ggactgggat cgaggcccag aaaacggagc agcgggcacc agggaggcct 61 ggaacggggc gagcgccatg agcaacaaat gcgacgtggt cgtggtgggg ggcggcatct 121 caggtatggc agcagccaaa cttctgcatg actctggact gaatgtggtt gttctggaag 181 cccgggaccg tgtgggaggc aggacttaca ctcttaggaa ccaaaaggtt aaatatgtgg 241 accttggagg atcctatgtt ggaccaaccc agaatcgtat cttgagatta gccaaggagc 301 taggattgga gacctacaaa gtgaatgagg ttgagcgtct gatccaccat gtaaagggca 361 aatcataccc cttcaggggg ccattcccac ctgtatggaa tccaattacc tacttagatc 421 ataacaactt ttggaggaca atggatgaca tggggcgaga gattccgagt gatgccccat 481 ggaaggctcc ccttgcagaa gagtgggaca acatgacaat gaaggagcta ctggacaagc 541 tctgctggac tgaatctgca aagcagcttg ccactctctt tgtgaacctg tgtgtcactg 601 cagagaccca tgaggtctct gctctctggt tcctgtggta tgtgaagcag tgtggaggca 661 caacaagaat catctcgaca acaaatggag gacaggagag gaaatttgtg ggcggatctg 721 gtcaagtgag tgagcggata atggacctcc ttggagaccg agtgaagctg gagaggcctg 781 tgatctacat tgaccagaca agagaaaatg tccttgtgga gaccctaaac catgagatgt 841 atgaggctaa atatgtgatt agtgctattc ctcctactct gggcatgaag attcacttca 901 atccccctct gccaatgatg agaaaccaga tgatcactcg tgtgcctttg ggttcagtca 961 tcaagtgtat agtttattat aaagagcctt tctggaggaa aaaggattac tgtggaacca 1021 tgattattga tggagaagaa gctccagttg cctacacgtt ggatgatacc aaacctgaag 1081 gcaactatgc tgccataatg ggatttatcc tggcccacaa agccagaaaa ctggcacgtc 1141 ttaccaaaga ggaaaggttg aagaaacttt gtgaactcta tgccaaggtt ctgggttccc 1201 tagaagctct ggagccagtg cattatgaag aaaagaactg gtgtgaggag cagtactctg 1261 ggggctgcta cacaacttat ttcccccctg ggatcctgac tcaatatgga agggttctac 1321 gccagccagt ggacaggatt tactttgcag gcaccgagac tgccacacac tggagcggct 1381 acatggaggg ggctgtagag gccggggaga gagcagcccg agagatcctg catgccatgg 1441 ggaagattcc agaggatgaa atctggcagt cagaaccaga gtctgtggat gtccctgcac 1501 agcccatcac caccaccttt ttggagagac atttgccctc cgtgccaggc ctgctcaggc 1561 tgattggatt gaccaccatc ttttcagcaa cggctcttgg cttcctggcc cacaaaaggg 1621 ggctacttgt gagagtctaa agagagaggg tgtctgtaat cacactctct tcttactgta 1681 tttgggatat gagtttgggg aaagagttgc aagtaaagtt ccatgaagac aaatagtgtg 1741 gagtgaggcg ggggagcatg aagataaatc caactctgac tgtaaaatac aatggtatct 1801 ctttctccgt tgtggcccct gcttagtgtc ccttacctgg cttagcgttc tgtttcacca 1861 gtttccaagt ttattgccct caaatcttta gaatagttaa attggcttgt ttaaggttct 1921 tgctgcccca caacacacct tgcccatgca caggatgaat tttttcctac cattatggct 1981 ttgtgcttgt tcttcctctt acctgtatag cctcacttcc ctagttcttt gcattcgtcc 2041 ttaggtactg tattgttaca gctgaaagac agtaaagacc atttagtcct caccttctgt 2101 tttagagttg agcaaactga agcccacaga ggtggaactt aattacctaa gagccacaat 2161 aagccactgg tatctggggg actagaacac aaataattgc ttttcccacc tctttggatg 2221 ttttccccaa ttatcctcct tcactccctg tcatagttac cgatggtgtc ccgttgtgtg 2281 ggtttactct gtgctaagtt gtcttacact tctcaaatgc tactcagtat atagccttaa 2341 ctcttactgt tttgtgcggt gtgtctccag ctgattttaa cttttttgat ggtagaaatt 2401 ttatctcttc ttccttttgt atcctccatt gtatcttcat acaaaggaca gtacacactt 2461 gggtaattaa aaataaaagt tgattgacca t // LOCUS HUMMARKSG 1885 bp mRNA PRI 07-JAN-1995 DEFINITION Human myristoylated alanine-rich C-kinase substrate mRNA, complete cds. ACCESSION M68956 NID g187386 KEYWORDS myristoylated alanine-rich C-kinase substrate. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1885) AUTHORS Harlan,D.M., Graff,J.M., Stumpo,D.J., Eddy,R.L. Jr., Shows,T.B., Boyle,J.M. and Blackshear,P.J. TITLE The human myristoylated alanine-rich C kinase substrate (MARCKS) gene (MACS). Analysis of its gene product, promoter, and chromosomal localization JOURNAL J. Biol. Chem. 266 (22), 14399-14405 (1991) MEDLINE 91317795 FEATURES Location/Qualifiers source 1..1885 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B lymphoblastid" /map="6q21-q27" gene 309..1307 /gene="MACS" CDS 309..1307 /gene="MACS" /codon_start=1 /db_xref="GDB:G00-118-835" /product="myristoylated alanine-rich C-kinase substrate" /db_xref="PID:g187387" /translation="MGAQFSKTAAKGEAAAERPGEAAVASSPSKANGQENGHVKVNGD ASPAAAESGAKEELQANGSAPAADKEEPAAAGSGAASPSAAEKGEPAAAAAPEAGASP VEKEAPAEGEAAEPGSPTAAEGEAASAASSTSSPKAEDGATPSPSNETPKKKKKRFSF KKSFKLSGFSFKKNKKEAGEGGEAEAPAAEGGKDEAAGGAAAAAAEAGAASGEQAAAP GEEAAAGEEGAAGGDSQEAKPQEAAVAPEKPPASDETKAAEEPSKVEEKKAEEAGASA AACEAPSAAGLVCPRRGGSPRGGARGRRSLNQACAAPSQEAQPECSPEAPPAEAAE" BASE COUNT 376 a 546 c 558 g 405 t ORIGIN chromosome 6, q21-qter. 1 tttattactt cttttttttt cgaactacac ttgggctcct ttttttgtgc tcgacttttc 61 cacccttttt ccctccctcc tgtgctgctg ctttttgatc tcttcgacta aaattttttt 121 atccggagtg tatttaatcg gttctgttct gtcctctcca ccacccccac ccccctccct 181 ccggtgtgtg tgccgctgcc gctgttgccg ccgccgctgc tgctgctgct cgccccgtcg 241 ttacaccaac ccgaggctct ttgtttcccc tcttggatct gttgagtttc tttgttgaag 301 aagccagcat gggtgcccag ttctccaaga ccgcagcgaa gggagaagcc gccgcggaga 361 ggcctgggga ggcggctgtg gcctcgtcgc cttccaaagc gaacggacag gagaatggcc 421 acgtgaaggt aaacggcgac gcttcgcccg cggccgccga gtcgggcgcc aaggaggagc 481 tgcaggccaa cggcagcgcc ccggccgccg acaaggagga gcccgcggcc gccgggagcg 541 gggcggcgtc gccctccgcg gccgagaaag gtgagccggc cgccgccgct gcccccgagg 601 ccggggccag cccggtagag aaggaggccc ccgcggaagg cgaggctgcc gagcccggct 661 cgcccacggc cgcggaggga gaggccgcgt cggccgcctc ctcgacttct tcgcccaagg 721 ccgaggacgg ggccacgccc tcgcccagca acgagacccc gaaaaaaaaa aagaagcgct 781 tttccttcaa gaagtctttc aagctgagcg gcttctcctt caagaagaac aagaaggagg 841 ctggagaagg cggtgaggct gaggcgcccg ctgccgaagg cggcaaggac gaggccgccg 901 ggggcgcagc tgcggccgcc gccgaggcgg gcgcggcctc cggggagcag gcagcggcgc 961 cgggcgagga ggcagcagcg ggcgaggagg gggcggcggg tggcgactcg caggaggcca 1021 agccccagga ggccgctgtc gcgccagaga agccgcccgc cagcgacgag accaaggccg 1081 ccgaggagcc cagcaaggtg gaggagaaaa aggccgagga ggccggggcc agcgccgccg 1141 cctgcgaggc cccctccgcc gccgggctgg tgtgcccccg gagaggaggc agcccccgcg 1201 gaggagcccg cggccgccgc agcctcaatc aagcctgcgc agccccctca caggaggccc 1261 agcccgagtg cagtccagaa gcccccccag cggaggcggc agagtaaaag agcaagcttt 1321 tgtgagataa tcgaagaaca ttttctcccc cgtttgtttg gttggagtgg tgccaggtac 1381 tggattttgg agaacttgtc tacaaccagg gattgatttt aaagatgtct ttttttattt 1441 tacttttttt taagcaccaa attttgttgt tttttttttc tcccctcccc acagatccca 1501 tctcaaatca ttctgttaac caccattcca acaggtcgag gagagcttaa acaccttctt 1561 cctctggcct tgtttctctt ttatttttta ttttttcgca tcagtattaa tgtttttgca 1621 tactttgcat ctttattcaa aagtgtaaac tttctttgtc aatctatgga catgcccata 1681 tatgaaggag atgggtgggt caaaaaggga tatcaaatga agtgataggg gtcacaatgg 1741 ggaaattgaa gtggtgcata acattgccaa aatagtgtgc cactagaaat ggtgtaaagg 1801 ctgtcttttt tttttttttt aaagaaaagt tattaccatg tattttgtga ggcaggttta 1861 caacactaca actcgtgccg aattc // LOCUS HUMMAS 1388 bp mRNA PRI 11-JUN-1993 DEFINITION Human mas proto-oncogene mRNA, complete cds. ACCESSION M13150 NID g187388 KEYWORDS mas oncogene; mas protein; membrane protein; proto-oncogene. SOURCE Human, cDNA to mRNA, clone pMS424; and DNA, clone pMS422. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1388) AUTHORS Young,D., Waitches,G., Birchmeier,C., Fasano,O. and Wigler,M. TITLE Isolation and characterization of a new cellular oncogene encoding a protein with multiple potential transmembrane domains JOURNAL Cell 45, 711-719 (1986) MEDLINE 86218084 COMMENT Draft entry and sequence in computer-readable form for [1] kindly provided by D.Young, 11-DEC-1986. The mas oncogene has a weak focus-inducing activity in transfected NIH 3T3 cells. A DNA rearrangement, which occurrs during transfection, is probably responsible for activation of the mas gene. The mas gene may be the first of a new functional class of oncogenes. The first 253 nucleotides are from DNA and the rest from cDNA. FEATURES Location/Qualifiers source 1..1388 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>1388 /note="mas mRNA" CDS 268..1245 /note="mas protein" /codon_start=1 /db_xref="PID:g307158" /translation="MDGSNVTSFVVEEPTNISTGRNASVGNAHRQIPIVHWVIMSISP VGFVENGILLWFLCFRMRRNPFTVYITHLSIADISLLFCIFILSIDYALDYELSSGHY YTIVTLSVTFLFGYNTGLYLLTAISVERCLSVLYPIWYRCHRPKYQSALVCALLWALS CLVTTMEYVMCIDREEESHSRNDCRAVIIFIAILSFLVFTPLMLVSSTILVVKIRKNT WASHSSKLYIVIMVTIIIFLIFAMPMRLLYLLYYEYWSTFGNLHHISLLFSTINSSAN PFIYFFVGSSKKKRFKESLKVVLTRAFKDEMQPRRQKDNCNTVTVETVV" BASE COUNT 346 a 329 c 284 g 429 t ORIGIN 1032 bp upstream of SalI site. 1 ggatccagaa gggtcattca atcagttctc agtcttatca ggtctaagtt cctttcttat 61 caggtcctaa aggcctaatc ttatcattgt gacaaagata actgtagagt ctgttaaact 121 ttttttttaa taacatgaag attatgattt atagctgaat ttctcccttt tattccaatt 181 caacaatttt catggctttt tgtgtttgtt ttgttctgga catatttaca gaaaattacc 241 tgaagagttc caacctgagg cctcctcatg gatgggtcaa acgtgacatc atttgttgtt 301 gaggaaccca cgaacatctc aactggcagg aacgcctcag tcgggaatgc acatcggcaa 361 atccccatcg tgcactgggt cattatgagc atctccccag tggggtttgt tgagaatggg 421 attctcctct ggttcctgtg cttccggatg agaagaaatc ccttcactgt ctacatcacc 481 cacctgtcta tcgcagacat ctcactgctc ttctgtattt tcatcttgtc tatcgactat 541 gctttagatt atgagctttc ttctggccat tactacacaa ttgtcacatt atcagtgact 601 tttctgtttg gctacaacac gggcctctat ctgctgacgg ccattagtgt ggagaggtgc 661 ctgtcagtcc tttaccccat ctggtaccga tgccatcgcc ccaagtacca gtcggcattg 721 gtctgtgccc ttctgtgggc tctttcttgc ttggtgacca ccatggagta tgtcatgtgc 781 atcgacagag aagaagagag tcactctcgg aatgactgcc gagcagtcat catctttata 841 gccatcctga gcttcctggt cttcacgccc ctcatgctgg tgtccagcac catcttggtc 901 gtgaagatcc ggaagaacac gtgggcttcc cattcctcca agctttacat agtcatcatg 961 gtcaccatca ttatattcct catcttcgct atgcccatga gactccttta cctgctgtac 1021 tatgagtatt ggtcgacctt tgggaaccta caccacattt ccctgctctt ctccacaatc 1081 aacagtagcg ccaacccttt catttacttc tttgtgggaa gcagtaagaa gaagagattc 1141 aaggagtcct taaaagttgt tctgaccagg gctttcaaag atgaaatgca acctcggcgc 1201 cagaaagaca attgtaatac ggtcacagtt gagactgtcg tctaagaact gtgagggaag 1261 ttgtggataa aaatggtgga acacaggtca tttttagttt gtgcttggaa tatgacttaa 1321 gtatctccta aatgtgatac agaagaacat ctcatcccat atgcatgaga tactaattaa 1381 tgatgaaa // LOCUS HUMMAT1H 1312 bp mRNA PRI 24-APR-1996 DEFINITION Homo sapiens homolog of mouse MAT-1 oncogene mRNA, complete cds. ACCESSION L37385 NID g598186 KEYWORDS homologue; oncogene. SOURCE Homo sapiens (clone: hMAT-1) female mammary gland cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1312) AUTHORS Bandyopadhyay,G.K., Bera,T.K. and Nandi,S. TITLE Cloning of a cDNA for the human homolog of mouse MAT-1 oncogene and its expression in breast cancer cells JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..1312 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="hMAT-1" /sex="female" /tissue_type="mammary gland" mRNA 1..1312 CDS 546..773 /note="homolog of mouse MAT-1 oncogene" /codon_start=1 /product="unknown" /db_xref="PID:g598187" /translation="MYIKTALPCLPFFVVSSINLLSRPERGWEGNATVKGVGRIKLLH SPKQKRRAFSKNIKFTCSLRDYLDKVQVRSF" polyA_signal 1280..1285 BASE COUNT 344 a 329 c 304 g 335 t ORIGIN 1 ggatcctttc ctggtcccta agatcaaacc ccatggagca gccagcgtta gatgccccca 61 cccacctgta ctctggagag actgtgctgg gaacatgtac cactgagcct gagatgggga 121 tgagggcaga gagaggggag ccccctcttc cactcagttg ttcctactca gactgttgca 181 ctctaaacct agggaggttg aagaatgaga cccttaggtt ttaacacgaa tcctgacacc 241 accatctata gggtccaact tggttattgt aggcaacctt ccctctctcc ttggtgaaga 301 acatcccaag ccagaaagaa gttaactaca gtgttttcct ttgcaccgat ccccacccca 361 attcaatccc ggaaggactt acttaggaaa cccttcttta ctagatatcc tggccccctg 421 ggcttgtgaa cacctcctag ccacatcact acagtacagt gagtgacccc agcctcctgc 481 ctaccccaag atgcccctcc ccaccctgac cgtgctaact gtgtgtacat atatattcta 541 catatatgta tattaaaact gcactgccat gtctgccctt ttttgtggtg tctagcatta 601 acttattgtc taggccagag cgggggtggg aggggaatgc cacagtgaag ggagtgggca 661 gaatcaaatt gctacatagt ccaaagcaaa aaagaagggc tttttcaaaa aacattaaat 721 tcacatgcag tctcagagac tatttagaca aagttcaagt taggagcttt taggatgtgg 781 gagtaaaact ttaatgggag gggagggctg gctgctggaa gaaggaagaa gccagactgg 841 ttagacagta ctcttaactc ctagcccagc ctagcgtgcc ctgcccctct ggccactgct 901 gcagacacct gccttaacac acacacctct aggactccac agttttgcct taaaggacct 961 tcccaagtct ccctttccct gtctggcttc tcccttaaga agagagagat acttgtagaa 1021 ttgggtgggg ggaatgagca tgaactgtcc ttccatttgg gatatgttac attagagtga 1081 gagagagaat aaggagcctt tcttatggaa gaaatgggag aagagagaca gggttctttt 1141 cagcagagtc tagtagtttc tctgtaaggc aaaataatct aaaaagacta acctgcccac 1201 ccactcctta tattgctgtg agattgcccc tatcttgtgc tcttctgtct gcagtgtgca 1261 cggccttgtt ctaacccgga ataaaggtga ttgattgtat tgccgcggat cc // LOCUS HUMMC5R 1262 bp DNA PRI 07-JAN-1995 DEFINITION Human melanocortin 5 receptor (MC5R) gene, complete cds. ACCESSION L27080 NID g435599 KEYWORDS melanocortin 5 receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1262) AUTHORS Griffon,N., Mignon,V., Facchinetti,P., Diaz,J., Schwartz,J.C. and Sokoloff,P. TITLE Molecular cloning and characterization of the rat fifth melanocortin receptor JOURNAL Biochem. Biophys. Res. Commun. 200 (2), 1007-1014 (1994) MEDLINE 94234987 FEATURES Location/Qualifiers source 1..1262 /organism="Homo sapiens" /db_xref="taxon:9606" gene 184..1161 /gene="MC5R" CDS 184..1161 /gene="MC5R" /codon_start=1 /product="melanocortin 5 receptor" /db_xref="PID:g435600" /translation="MNSSFHLHFLDLNLNATEGNLSGPNVKNKSSPCEDMGIAVEVFL TLGVISLLENILVIGAIVKNKNLHSPMYFFVCSLAVADMLVSMSSAWETITIYLLNNK HLVIADAFVRHIDNVFDSMICISVVASMCSLLAIAVDRYVTIFYALAYHHIMTARRSG AIIAGIWAFCTGCGIVFILYSESTYVILCLISMFFAMLFLLVSLYIHMFLLARTHVKR IAALPGASSARQRTSMQGAVTVTMLLGVFTVCWAPFFLHLTLMLSCPQNLYCSRFMSH FNMYLILIMCNSVMDPLIYAYRSQEMRKTFKEIICCRGFRIACSFPRRD" BASE COUNT 256 a 353 c 299 g 354 t ORIGIN 1 agactgagtg agcgagccag tcctctgatg cactgtgtat tcatcccctt tcttaggcgg 61 ctgtgttggt tctaggctag ctgctgtctt tctttggtag gctgctaacc tctttggatt 121 gtgaatttaa aacatgtttt acagtaaatt tgctgccaag acaagaggtg tatttctcca 181 gcaatgaatt cctcatttca cctgcatttc ttggatctca acctgaatgc cacagagggc 241 aacctttcag gacccaatgt caaaaacaag tcttcaccat gtgaagacat gggcattgct 301 gtggaggtgt ttctcactct gggtgtcatc agcctcttgg agaacatctt ggtcataggg 361 gccatagtga agaacaaaaa cctgcactcc cccatgtact tcttcgtgtg cagcctggca 421 gtggcggaca tgctggtgag catgtccagt gcctgggaga ccatcaccat ctacctactc 481 aacaacaagc acctagtgat agcagacgcc tttgtgcgcc acattgacaa tgtgtttgac 541 tccatgatct gcatttccgt ggtggcatcc atgtgcagct tactggccat tgcagtggat 601 aggtacgtca ccatcttcta cgccctggcc taccaccaca tcatgacggc gaggcgctca 661 ggggccatca tcgccggcat ctgggctttc tgcacgggct gcggcattgt cttcatcctg 721 tactcagaat ccacctacgt catcctgtgc ctcatctcca tgttcttcgc tatgctgttc 781 ctcctggtgt ctctgtacat acacatgttc ctcctggcgc ggactcacgt caagcggatc 841 gcggctctgc ccggggccag ctctgcgcgg cagaggacca gcatgcaggg cgcggtcacc 901 gtcaccatgc tgctgggcgt gtttaccgtg tgctgggccc cgttcttcct tcatctcact 961 ttaatgcttt cttgccctca gaacctctac tgctctcgct tcatgtctca cttcaatatg 1021 tacctcatac tcatcatgtg taattccgtg atggaccctc tcatatatgc ctaccgcagc 1081 caagagatgc ggaagacctt taaggagatt atttgctgcc gtggtttcag gatcgcctgc 1141 agctttccca gaagggatta agcacaaagt gctcctctct gtggctctgt tctcctttgt 1201 ttgctcacct atgacaaagc gacagcaagc gggtaggcta gggagtgcta gcatccattt 1261 tt // LOCUS HUMMCM 2798 bp mRNA PRI 07-JAN-1995 DEFINITION Human methylmalonyl-CoA mutase (MCM) mRNA, complete cds. ACCESSION M65131 M22990 M65022 NID g187451 KEYWORDS L-methylmalonyl-CoA mutase; adenosylcobalamin cofactor; methylmalonyl CoA mutase. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2798) AUTHORS Jansen,R., Kalousek,F., Fenton,W.A., Rosenberg,L.E. and Ledley,F.D. TITLE Cloning of full-length methylmalonyl-CoA mutase from a cDNA library using the polymerase chain reaction JOURNAL Genomics 4 (2), 198-205 (1989) MEDLINE 89290848 REFERENCE 2 (bases 1 to 2798) AUTHORS Andrews,E., Crane,A.M., Jansen,R. and Ledley,F.D. TITLE Biochemical characteristics of recombinant human methylmalonyl CoA mutase expressed in S. cerevisiae JOURNAL Unpublished (1991) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.D.Ledley, 17-MAR-1989. FEATURES Location/Qualifiers source 1..2798 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="6p21" mRNA 1..2770 /gene="MUT" /note="G00-120-204" gene 1..2770 /gene="MUT" sig_peptide 77..172 /gene="MUT" /note="mitochondrial leader; G00-120-204" CDS 77..2329 /gene="MUT" /EC_number="5.4.99.2" /codon_start=1 /db_xref="GDB:G00-120-204" /product="methylmalonyl-CoA mutase" /db_xref="PID:g187452" /translation="MLRAKNQLFLLSPHYLRQVKESSGSRLIQQRLLHQQQPLHPEWA ALAKKQLKGKNPEDLIWHTPEGISIKPLYSKRDTMDLPEELPGVKPFTRGPYPTMYTF RPWTIRQYAGFSTVEESNKFYKDNIKAGQQGLSVAFDLATHRGYDSDNPRVRGDVGMA GVAIDTVEDTKILFDGIPLEKMSVSMTMNGAVIPVLANFIVTGEEQGVPKEKLTGTIQ NDILKEFMVRNTYIFPPEPSMKIIADIFEYTAKHMPKFNSISISGYHMQEAGADAILE LAYTLADGLEYSRTGLQAGLTIDEFAPRLSFFWGIGMNFYMEIAKMRAGRRLWAHLIE KMFQPKNSKSLLLRAHCQTSGWSLTEQDPYNNIVRTAIEAMAAVFGGTQSLHTNSFDE ALGLPTVKSARIARNTQIIIQEESGIPKVADPWGGSYMMECLTNDVYDAALKLINEIE EMGGMAKAVAEGIPKLRIEECAARRQARIDSGSEVIVGVNKYQLEKEDAVEVLAIDNT SVRNRQIEKLKKIKSSRDQALAEHCLAALTECAASGDGNILALAVDASRARCTVGEIT DALKKVFGEHKANDRMVSGAYRQEFGESKEITSAIKRVHKFMEREGRRPRLLVAKMGQ DGHDRGAKVIATGFADLGFDVDIGPLFQTPREVAQQAVDADVHAVGVSTLAAGHKTLV PELIKELNSLGRPDILVMCGGVIPPQDYEFLFEVGVSNVFGPGTRIPKAAVQVLDDIE KCLEKKQQSV" mat_peptide 173..2326 /gene="MUT" /EC_number="5.4.99.2" /note="G00-120-204" /product="methylmalonyl-CoA mutase" BASE COUNT 886 a 526 c 591 g 795 t ORIGIN 374 bp upstream of AccI site. 1 gccctctccc acagcggagt ccaaaacagg cctaccagtc agttcttatt tctattgggt 61 gtttccatgc tccaccatgt taagagctaa gaatcagctt tttttacttt cacctcatta 121 cctgaggcag gtaaaagaat catcaggctc caggctcata cagcaacgac ttctacacca 181 gcaacagccc cttcacccag aatgggctgc cctggctaaa aagcagctga aaggcaaaaa 241 cccagaagac ctaatatggc acaccccgga agggatctct ataaaaccct tgtattccaa 301 gagagatact atggacttac ctgaagaact tccaggagtg aagccattca cacgtggacc 361 atatcctacc atgtatacct ttaggccctg gaccatccgc cagtatgctg gttttagtac 421 tgtggaagaa agcaataagt tctataagga caacattaag gctggtcagc agggattatc 481 agttgccttt gatctggcga cacatcgtgg ctatgattca gacaaccctc gagttcgtgg 541 tgatgttgga atggctggag ttgctattga cactgtggaa gataccaaaa ttctttttga 601 tggaattcct ttagaaaaaa tgtcagtttc catgactatg aatggagcag ttattccagt 661 tcttgcaaat tttatagtaa ctggagaaga acaaggtgta cctaaagaga aacttactgg 721 taccatccaa aatgatatac taaaggaatt tatggttcga aatacataca tttttcctcc 781 agaaccatcc atgaaaatta ttgctgacat atttgaatat acagcaaagc acatgccaaa 841 atttaattca atttcaatta gtggatacca tatgcaggaa gcaggggctg atgccattct 901 ggagctggcc tatactttag cagatggatt ggagtactct agaactggac tccaggctgg 961 cctgacaatt gatgaatttg caccaaggtt gtctttcttc tggggaattg gaatgaattt 1021 ctatatggaa atagcaaaga tgagagctgg tagaagactc tgggctcact taatagagaa 1081 aatgtttcag cctaaaaact caaaatctct tcttctaaga gcacactgtc agacatctgg 1141 atggtcactt actgagcagg atccctacaa taatattgtc cgtactgcaa tagaagcaat 1201 ggcagcagta tttggaggga ctcagtcttt gcacacaaat tcttttgatg aagctttggg 1261 tttgccaact gtgaaaagtg ctcgaattgc caggaacaca caaatcatca ttcaagaaga 1321 atctgggatt cccaaagtgg ctgatccttg gggaggttct tacatgatgg aatgtctcac 1381 aaatgatgtt tatgatgctg ctttaaagct cattaatgaa attgaagaaa tgggtggaat 1441 ggccaaagct gtagctgagg gaatacctaa acttcgaatt gaagaatgtg ctgcccgaag 1501 acaagctaga atagattctg gttctgaagt aattgttgga gtaaataagt accagttgga 1561 aaaagaagac gctgtagaag ttctggcaat tgataatact tcagtgcgaa acaggcagat 1621 tgaaaaactt aagaagatca aatccagcag ggatcaagct ttggctgaac attgtcttgc 1681 tgcactaacc gaatgtgctg ctagcggaga tggaaatatc ctggctcttg cagtggatgc 1741 atctcgggca agatgtacag tgggagaaat cacagatgcc ctgaaaaagg tatttggtga 1801 acataaagcg aatgatcgaa tggtgagtgg agcatatcgc caggaatttg gagaaagtaa 1861 agagataaca tctgctatca agagggttca taaattcatg gaacgtgaag gtcgcagacc 1921 tcgtcttctt gtagcaaaaa tgggacaaga tggccatgac agaggagcaa aagttattgc 1981 tacaggattt gctgatcttg gttttgatgt ggacataggc cctcttttcc agactcctcg 2041 tgaagtggcc cagcaggctg tggatgcgga tgtgcatgct gtgggcgtaa gcaccctcgc 2101 tgctggtcat aaaaccctag ttcctgaact catcaaagaa cttaactccc ttggacggcc 2161 agatattctt gtcatgtgtg gaggggtgat accacctcag gattatgaat ttctgtttga 2221 agttggtgtt tccaatgtat ttggtcctgg gactcgaatt ccaaaggctg ccgttcaggt 2281 gcttgatgat attgagaagt gtttggaaaa gaagcagcaa tctgtataat atcctctttt 2341 tgttttagct tttgtctaaa atattatttt agttatgatc aaagaagaga gtaaagctat 2401 gtcttcaatt taatttcaat acctgatttg tactttcctt gaaagcttta ctttaaaata 2461 ccttacttat aggcctggtg tcatgctata agtatgtaca tacagtttca cttcaaaaat 2521 aaaaaaaaat ccctaaaaac tctctatact ctctataaca atactttatc aagaactctg 2581 gacaatggta ttatttttaa aaatcatggt gatgtattta ttagaatgtt tcttataaat 2641 ctctttcatt tttatattaa gaattaaact gtacctaaaa aaactctgac tattcccatt 2701 tctcagttta gcattacatt gtcttgagca ccagaaaata aaatccatat attaattaaa 2761 acctatcttg aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMMCNDA 1670 bp mRNA PRI 11-JUL-1995 DEFINITION H.sapiens myeloid cell nuclear differentiation antigen mRNA, complete cds. ACCESSION M81750 NID g895928 KEYWORDS interferon response element; interferon stimulated gene; interferon-alpha; myeloid cell nuclear differentiation antigen. SOURCE Homo sapiens blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1670) AUTHORS Briggs,J.A., Burrus,G.R., Stickney,B.D. and Briggs,R.C. TITLE Cloning and expression of the human myeloid cell nuclear differentiation antigen: regulation by interferon alpha JOURNAL J. Cell. Biochem. 49 (1), 82-92 (1992) MEDLINE 92355667 REFERENCE 2 (bases 1 to 1670) AUTHORS Briggs,R. TITLE Direct Submission JOURNAL Submitted (20-FEB-1992) Pathology, Vanderbilt Univ., 23rd and Pierce, Nashville, TN 37232-5310, USA FEATURES Location/Qualifiers source 1..1670 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="granulocytes and monocytes and precursors" /tissue_type="blood" mRNA 1..1670 /partial /gene="MNDA" gene 1..1670 /gene="MNDA" enhancer 5..16 /gene="MNDA" /standard_name="interferon stimulated response element (consensus core)" CDS 201..1424 /gene="MNDA" /codon_start=1 /product="myeloid cell nuclear differentiation antigen" /db_xref="PID:g187454" /translation="MVNEYKKILLLKGFELMDDYHFTSIKSLLAYDLGLTTKMQEEYN RIKITDLMEKKFQGVACLDKLIELAKDMPSLKNLVNNLRKEKSKVAKKIKTQEKAPVK KINQEEVGLAAPAPTARNKLTSEARGRIPVAQKRKTPNKEKTEAKRNKVSQEQSKPPG PSGASTSAAVDHPPLPQTSSSTPSNTSFTPNQETQAQRQVDARRNVPQNDPVTVVVLK ATAPFKYESPENGKSTMFHATVASKTQYFHVKVFDINLKEKFVRKKVITISDYSECKG VMEIKEASSVSDFNQNFEVPNRIIEIANKTPKISQLYKQASGTMVYGLFMLQKKSVHK KNTIYEIQDNTGSMDVVGSGKWHNIKCEKGDKLRLFCLQLRTVDRKLKLVCGSHSFIK VIKAKKNKEGPMNVN" BASE COUNT 611 a 315 c 322 g 422 t ORIGIN 1 attgagagtg gctctaacaa gtgccatttt tccttgttag ctttcatttc tcagcccttt 61 acaagattaa aatagtctgc agtttaatct ctccaaagct ttacggacag tgattctgtc 121 ctaaacaaga cagtgactcc aggatttctg aagactattg tggaagaagc atccattaag 181 gccaagctat aacatcagaa atggtgaatg aatacaagaa aattcttttg ctgaaaggat 241 ttgagctcat ggatgattat cattttacat caattaagtc cttactggcc tatgatttag 301 gactaactac aaaaatgcaa gaggaataca acagaattaa gattacagat ttgatggaaa 361 aaaagttcca aggcgttgcc tgtctagaca aactaataga acttgccaaa gatatgccat 421 cacttaaaaa ccttgttaac aatcttcgaa aagagaagtc aaaagttgct aagaaaatta 481 aaacacaaga aaaagctcca gtgaaaaaaa taaaccagga agaagtgggt cttgcggcac 541 ctgcacccac cgcaagaaac aaactgacat cggaagcaag agggaggatt cctgtagctc 601 agaaaagaaa aactccaaac aaagaaaaga ctgaagccaa aaggaataag gtgtcccaag 661 agcagagtaa gcccccaggt ccctcaggag ccagcacatc tgcagctgtg gatcatcccc 721 cactacccca gacctcatca tcaactccat ccaacacttc gtttactccg aatcaggaaa 781 cccaggccca acggcaggtg gatgcaagaa gaaatgttcc ccaaaacgac ccagtgacag 841 tggtggtact gaaagcaaca gcgccattta aatacgagtc cccagaaaat gggaaaagca 901 caatgtttca tgctacagtg gccagtaaga ctcaatattt ccatgtgaaa gtcttcgaca 961 tcaacttgaa agagaaattt gtaaggaaga aggtcattac catatctgat tactctgaat 1021 gtaaaggagt aatggaaata aaggaagcat catctgtgtc tgactttaat caaaattttg 1081 aggtcccaaa cagaattatc gaaatagcaa ataaaactcc caagatcagt caactttaca 1141 agcaagcatc tggaacaatg gtgtatgggt tgtttatgtt acaaaagaaa agcgtacaca 1201 agaagaacac aatttatgaa atacaggata atacaggatc catggatgta gtggggagtg 1261 gaaaatggca caatatcaag tgtgagaaag gagataaact tcgactcttc tgccttcaac 1321 tgagaacagt tgaccgcaag ctgaaactgg tgtgtggaag tcacagcttc atcaaggtca 1381 tcaaggccaa gaaaaacaag gaaggaccaa tgaatgttaa ttgaaatatg aaagctgaaa 1441 tgcaacaaac aacttccgct taaaacaatt aagttgttaa taactgtgat tttgtaaatt 1501 tcagtaattc atttaaatga tgtttcagta gatatattct agcatattaa gagcttttat 1561 aactgagtta tagattagtt tgctttctgg aataaaattt tcttcttata ctcttccttt 1621 tttttagata ttacattttg cttttatgac attcacgagg caaaaaaccg // LOCUS HUMMCPGV 1221 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens macrophage capping protein mRNA, complete cds. ACCESSION M94345 NID g187455 KEYWORDS gelsolin; macrophage capping protein; villin. SOURCE Homo sapiens (tissue library: U937 lambda-GT10) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1221) AUTHORS Dabiri,G.A., Young,C.L., Rosenbloom,J. and Southwick,F.S. TITLE Molecular cloning of human macrophage capping protein cDNA. A unique member of the gelsolin/villin family expressed primarily in macrophages JOURNAL J. Biol. Chem. 267 (23), 16545-16552 (1992) MEDLINE 92355627 FEATURES Location/Qualifiers source 1..1221 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" /cell_type="undifferentiated monocyte" /tissue_lib="U937 lambda-GT10" gene 50..1096 /gene="macrophage capping protein" CDS 50..1096 /gene="macrophage capping protein" /codon_start=1 /product="macrophage capping protein" /db_xref="PID:g187456" /translation="MYTAIPQSGSPFPGSVQDPGLHVWRVEKLKPVPVAQENQGVFFS GDSYLVLHNGPEEVSHLHLWIGQQSSRDEQGACAVLAVHLNTLLGERPVQHREVQGNE SDLFMSYFPRGLKYQEGGVESAFHKTSTGAPAAIKKLYQVKGKKNIRATERALNWDSF NTGDCFILDLGQNIFAWCGGKSNILERNKARDLALAIRDSERQGKAQVEIVTDGEEPA EMIQVLGPKPALKEGNPEEDLTADKANAQAAALYKVSDATGQMNLTKVADSSPFALEL LISDDCFVLDNGLCGKIYIWKGRKANEKERQAALQVAEGFISRMQYAPNTQVEILPQG RESPIFKQFFKDWK" polyA_site 1221 /gene="macrophage capping protein" BASE COUNT 273 a 347 c 374 g 227 t ORIGIN 1 cgcaggctgg aaggaagacg aacctacgaa gcagagatct gaagacagca tgtacacagc 61 cattccccag agtggctctc cattcccagg ctcagtgcag gatccaggcc tgcatgtgtg 121 gcgggtggag aagctgaagc cggtgcctgt ggcgcaagag aaccagggcg tcttcttctc 181 gggggactcc tacctagtgc tgcacaatgg cccagaagag gtttcccatc tgcacctgtg 241 gataggccag cagtcatccc gggatgagca gggggcctgt gccgtgctgg ctgtgcacct 301 caacacgctg ctgggagagc ggcctgtgca gcaccgcgag gtgcagggca atgagtctga 361 cctcttcatg agctacttcc cacggggcct caagtaccag gaaggtggtg tggagtcagc 421 atttcacaag acctccacag gagccccagc tgccatcaag aaactctacc aggtgaaggg 481 gaagaagaac atccgtgcca ccgagcgggc actgaactgg gacagcttca acactgggga 541 ctgcttcatc ctggacctgg gccagaacat cttcgcctgg tgtggtggaa agtccaacat 601 cctggaacgc aacaaggcga gggacctggc cctggccatc cgggacagtg agcgacaggg 661 caaggcccag gtggagattg tcactgatgg ggaggagcct gctgagatga tccaggtcct 721 gggccccaag cctgctctga aggagggcaa ccctgaggaa gacctcacag ctgacaaggc 781 aaatgcccag gccgcagctc tgtataaggt ctctgatgcc actggacaga tgaacctgac 841 caaggtggct gactccagcc cctttgccct tgaactgctg atatctgatg actgctttgt 901 gctggacaac gggctctgtg gcaagatcta tatctggaag gggcgaaaag cgaatgagaa 961 ggagcggcag gcagccctgc aggtggccga gggcttcatc tcgcgcatgc agtacgcccc 1021 gaacactcag gtggagattc tgcctcaggg ccgtgagagt cccatcttca agcaattttt 1081 caaggactgg aaatgagggt gggcgtcttc ctgccccatg ctcccctgcc ccccaccacc 1141 tgcctgcttg cttctctggc tgcctggtca gtgcagaggt gccccctgca gatgttcaat 1201 aaaggagaca agtgctttcc c // LOCUS HUMMCT 2578 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens monocarboxylate transporter 1 (SLC16A1) mRNA, complete cds. ACCESSION L31801 NID g561721 KEYWORDS monocarboxylate transporter 1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2578) AUTHORS Garcia,C.K., Li,X., Luna,J. and Francke,U. TITLE cDNA cloning of the human monocarboxylate transporter 1 and chromosomal localization of the SLC16A1 locus to 1p13.2-p12 JOURNAL Genomics 23 (2), 500-503 (1994) MEDLINE 95137602 FEATURES Location/Qualifiers source 1..2578 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1p13.2-p12" 5'UTR 1..12 /gene="SLC16A1" gene 1..2578 /gene="SLC16A1" CDS 13..1515 /gene="SLC16A1" /codon_start=1 /product="monocarboxylate transporter 1" /db_xref="PID:g561722" /translation="MPPAVGGPVGYTPPDGGWGWAVVIGAFISIGFSYAFPKSITVFF KEIEGIFHATTSEVSWISSIMLAVMYGGGPISSILVNKYGSRIVMIVGGCLSGCGLIA ASFCNTVQQLYVCIGVIGGLGLAFNLNPALTMIGKYFYKRRPLANGLAMAGSPVFLCT LAPLNQVFFGIFGWRGSFLILGGLLLNCCVAGALMRPIGPKPTKAGKDKSKASLEKAG KSGVKKDLHDANTDLIGRHPKQEKRSVFQTINQFLDLTLFTHRGFLLYLSGNVIMFFG LFAPLVFLSSYGKSQHYSSEKSAFLLSILAFVDMVARPSMGLVANTKPIRPRIQYFFA ASVVANGVCHMLAPLSTTYVGFCVYAGFFGFAFGWLSSVLFETLMDLVGPQRFSSAVG LVTIVECCPVLLGPPLLGRLNDMYGDYKYTYWACGVVLIISGIYLFIGMGINYRLLAK EQKANEQKKESKEEETSIDVAGKPNEVTKTAESPDQKDTEGGPKEEESPV" 3'UTR 1516..2578 /gene="SLC16A1" BASE COUNT 696 a 498 c 560 g 824 t ORIGIN 1 tctacactta aaatgccacc agcagttgga ggtccagttg gatacacccc cccagatgga 61 ggctggggct gggcagtggt aattggagct ttcatttcca tcggcttctc ttatgcattt 121 cccaaatcaa ttactgtctt cttcaaagag attgaaggta tattccatgc caccaccagc 181 gaagtgtcat ggatatcctc cataatgttg gctgtcatgt atggtggagg tcctatcagc 241 agtatcctgg tgaataaata tggaagtcgt atagtcatga ttgttggtgg ctgcttgtca 301 ggctgtggct tgattgcagc ttctttctgt aacaccgtac agcaactata cgtctgtatt 361 ggagtcattg gaggtcttgg gcttgccttc aacttgaatc cagctctgac catgattggc 421 aagtatttct acaagaggcg accattggcc aacggactgg ccatggcagg cagccctgtg 481 ttcctctgta ctctggcccc cctcaatcag gttttcttcg gtatctttgg atggagagga 541 agctttctaa ttcttggggg cttgctacta aactgctgtg ttgctggagc cctcatgcga 601 ccaatcgggc ccaagccaac caaggcaggg aaagataagt ctaaagcatc ccttgagaaa 661 gctggaaaat ctggtgtgaa aaaagatctg catgatgcaa atacagatct tattggaaga 721 caccctaaac aagagaaacg atcagtcttc caaacaatta atcagttcct ggacttaacc 781 ctattcaccc acagaggctt tttgctatac ctctctggaa atgtgatcat gttttttgga 841 ctctttgcac ctttggtgtt tcttagtagt tatgggaaga gtcagcatta ttctagtgag 901 aagtctgcct tccttctttc cattctggct tttgttgaca tggtagcccg accatctatg 961 ggacttgtag ccaacacaaa gccaataaga cctcgaattc agtatttctt tgcggcttcc 1021 gttgttgcaa atggagtgtg tcatatgcta gcacctttat ccactaccta tgttggattc 1081 tgtgtctatg cgggattctt tggatttgcc ttcgggtggc tcagctccgt attgtttgaa 1141 acattgatgg accttgttgg accccagagg ttctccagcg ctgtgggatt ggtgaccatt 1201 gtggaatgct gtcctgtcct cctggggcca ccacttttag gtcggctcaa tgacatgtat 1261 ggagactaca aatacacata ctgggcatgt ggcgtcgtcc taattatttc aggtatctat 1321 ctcttcattg gcatgggcat caattatcga cttttggcaa aagaacagaa agcaaacgag 1381 cagaaaaagg aaagtaaaga ggaagagacc agtatagatg ttgctgggaa gccaaatgaa 1441 gttaccaaaa cagcagaatc tccggaccag aaagacacag aaggagggcc caaggaggag 1501 gaaagtccag tctgaatcca tggggctgaa gggtaaattg agcagttcat gacccaggat 1561 atctgaaaat attctactgg cctgtaatct accagtggtg ctcaatgcaa atagtagaca 1621 tttgtgtgga aatcatacca gttgttcatt gatgggattt ttgtttgact ccttaccaat 1681 agcctgaatt tgaggaggga atgattggta gcaaaggatg ggggaaagaa gtaggttctg 1741 ttttgttttg ttttaatctt agcttttaat agtgtcataa agattataat atgtgcctta 1801 agttttagtc tttagaactc tagagagcct taacttctta aaccattttt gctgaattca 1861 tctatttcga gtgttgtgtt aaaaggaaaa ataacaacta acttgtttga ggcaaatcta 1921 aaatttaaaa ttaatcttgc ttcattgtta catgtaatat atttcagaca ttttcactgg 1981 aagatttatg aacagaaata ttggttgaaa gttagagatt ttacaaaatg ctgacaaaaa 2041 tattttccta gcatcagtag atttctggca tatgtttctg ctagctatat atttaggaaa 2101 ttcaaagcat aaaactttgg caacatcttg gctgttctag acacagtgta cttgtcaacc 2161 cctctcaggt accttttctt gggatgctta ttagaagcca agtaaagtgc ttaaggtttg 2221 ttttcattaa attagctatt tctgctcccc tgttcaaaga tgcattttga gtgtttatag 2281 atcactgccc tttttgaaat cacctggtat tatttttctt actggaaaag ttagtattaa 2341 aatctacaga actacatatt tgtgcctcct tggtaaatac aacacatcta attaaatgta 2401 gacagatatt tcaaacatca gctgaattca cttaagtttt tccaaaacct cagttaaact 2461 gtgaagctat tggaattttt ttttcctgga atttttcccc tttgattcac agtggtccca 2521 tttatatctg cttctagctt agtgctatgt gtgagatatg tgtgtgtttg gtgttttt // LOCUS HUMMDC 2908 bp mRNA PRI 11-MAY-1994 DEFINITION Human mRNA for MDC protein. ACCESSION D17390 NID g452188 KEYWORDS MDC protein. SOURCE Homo sapiens cerebellum, cDNA to mRNA, clone pBRcDNA1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Emi,M., Katagiri,T., Harada,Y., Saito,H., Inazawa,J., Ito,I., Kasumi,F. and Nakamura,Y. TITLE A novel metalloprotease/disintegrin-like gene at 17q21.3 is somatically rearranged in two primary breast cancers JOURNAL Nature Genet. 5 (2), 151-157 (1993) MEDLINE 94073190 REFERENCE 2 (bases 1 to 2908) AUTHORS Emi,M., Katagiri,T., Harada,Y., Saito,H., Inazawa,J., Ito,I., Kasumi,F. and Nakamura,Y. JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 2908) AUTHORS Emi,M. TITLE Direct Submission JOURNAL Submitted (09-AUG-1993) to the DDBJ/EMBL/GenBank databases. Mitsuru Emi, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (Tel:03-3918-0111(ex.4505), Fax:03-3918-0167) COMMENT Submitted (09-Aug-1993) to DDBJ by: Mitsuru Emi Cancer Institute Department of Biochemistry 1-37-1 Kami-ikebukuro, Toshima-ku Tokyo 170 Japan Phone: 03-3918-0111 x4505 Fax: 03-3918-0167. FEATURES Location/Qualifiers source 1..2908 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="human celebellar" /tissue_type="cerebellum" gene 28..1602 /gene="MDC" CDS 28..1602 /gene="MDC" /codon_start=1 /product="MDC protein" /db_xref="PID:d1004732" /db_xref="PID:g484255" /translation="MCWLSHQLLSSQYVERHFSREGTTQHSTGAGDHCYYQGKLRGNP HSFAALSTCQGLHGVFSDGNLTYIVEPQEVAGPWGAPQGPLPHLIYRTPLLPDPLGCR EPGCLFAVPAQSAPPNRPRLRRKRQVRRGHPTVHSETKYVELIVINDHQLFEQMRQSV VLTSNFAKSVVNLADVIYKEQLNTRIVLVAMETWADGDKIQVQDDLLETLARLMVYRR EGLPEPSNATHLFSGRTFQSTSSGAAYVGGICSLSHGGGVNEYGNMGAMAVTLAQTLG QNLGMMWNKHRSSAGDCKCPDIWLGCIMEDTGFYLPRKFSRCSIDEYNQFLQEGGGSC LFNKPLKLLDPPECGNGFVEAGEECDCGSVQECSRAGGNCCKKCTLTHDAMCSDGLCC RRCKYEPRGVSCREAVNECDIAETCTGDSSQCPPNLHKLDGYYCDHEQGRCYGGRCKT RDRQCQVLWGHAAADRFCYEKLNVEGTERGSCGRKGSGWVQCSKQPQQGRAVWLPPLC QHLWSSSARGPGGRHQ" BASE COUNT 592 a 893 c 887 g 536 t ORIGIN Chromosome 17. 1 gcgtttactg gcaaaccgca tttgtaaatg tgctggctga gccaccaact cctctcctcg 61 caatacgtgg agcgccactt cagccgggag gggacaaccc agcacagcac cggggctgga 121 gaccactgct actaccaggg gaagctccgg gggaacccgc actccttcgc cgccctctcc 181 acctgccagg ggctgcatgg ggtcttctct gatgggaact tgacttacat cgtggagccc 241 caagaggtgg ctggaccttg gggagcccct cagggacccc ttccccacct catttaccgg 301 acccctctcc tcccagatcc cctcggatgc agggaaccag gctgcctgtt tgctgtgcct 361 gcccagtcgg ctcctccaaa ccggccgagg ctgagaagga aaaggcaggt ccgccggggc 421 caccctacag tgcacagtga aaccaagtat gtggagctaa ttgtgatcaa cgaccaccag 481 ctgttcgagc agatgcgaca gtcggtggtc ctcaccagca actttgccaa gtccgtggtg 541 aacctggccg atgtgatata caaggagcag ctcaacactc gcatcgtcct ggttgccatg 601 gaaacatggg cagatgggga caagatccag gtgcaggatg acctcctgga gaccctggcc 661 cggctcatgg tctaccgacg ggagggtctg cctgagccca gtaatgccac ccacctcttc 721 tcgggcagga ccttccagag cacgagcagc ggggcagcct acgtgggggg catatgctcc 781 ctgtcccatg gcgggggtgt gaacgagtac ggcaacatgg gggcgatggc cgtgaccctt 841 gcccagacgc tgggacagaa cctgggcatg atgtggaaca aacaccggag ctcggcaggg 901 gactgcaagt gtccagacat ctggctgggc tgcatcatgg aggacactgg gttctacctg 961 ccccgcaagt tctctcgctg cagcatcgac gagtacaacc agtttctgca ggagggtggt 1021 ggcagctgcc tcttcaacaa gcccctcaag ctcctggacc ccccagagtg cgggaacggc 1081 ttcgtggagg caggggagga gtgcgactgc ggctcggtgc aggagtgcag ccgcgcaggt 1141 ggcaactgct gcaagaaatg caccctgact cacgacgcca tgtgcagcga cgggctctgc 1201 tgtcgccgct gcaagtacga accacggggt gtgtcctgcc gagaggccgt gaacgagtgc 1261 gacatcgcgg agacctgcac cggggactct agccagtgcc cgcctaacct gcacaagctg 1321 gacggttact actgtgacca tgagcagggc cgctgctacg gaggtcgctg caaaacccgg 1381 gaccggcagt gccaggttct ttggggccat gcggctgctg atcgcttctg ctacgagaag 1441 ctgaatgtgg aggggacgga gcgtgggagc tgtgggcgca agggatccgg ctgggtccag 1501 tgcagtaagc agccccaaca gggacgtgct gtgtggcttc ctcctctgtg tcaacatctc 1561 tggagctcct cggctagggg acctggtggg agacatcagt agtgtcacct tctaccacca 1621 gggcaaggag ctggactgca ggggaggcca cgtgcagctg gcggacggct ctgacctgag 1681 ctatgtggag gatggcacag cctgcgggcc taacatgttg tgcctggacc atcgctgcct 1741 gccagcttct gccttcaact tcagcacctg ccccggcagt ggggagcgcc ggatttgctc 1801 ccaccacggg gtctgcagca atgaagggaa gtgcatctgt cagccagact ggacaggcaa 1861 agactgcagt atccataacc ccctgcccac gtccccaccc acgggggaga cggagagata 1921 taaaggtccc agcggcacca acatcatcat tggctccatc gctggggctg tcctggttgc 1981 agccatcgtc ctgggcggca cgggctgggg atttaaaaac attcgccgag gaaggtccgg 2041 aggggcctaa gtgccaccct cctccctcca agcctggcac ccaccgtctc ggccctgaac 2101 cacgaggctg cccccatcca gccacggagg gaggcaccat gcaaatgtct tccaggtcca 2161 aacccttcaa ctcctggctc cgcaggggtt tgggtggggg ctgtggccct gcccttggca 2221 ccaccagggt ggaccaggcc tggagggcac ttcctccaca gtcccccacc cacctcctgc 2281 ggctcagcct tgcacaccca ctgccccgtg tgaatgtagc ttccacctca tggattgcca 2341 cagctcaact cgggggcacc tggagggatg cccccaggca gccaccagtg gacctagcct 2401 ggatggcccc tccttgcaac caggcagctg agaccagggt cttatctctc tgggacctag 2461 ggggacgggg ctgacatcta cattttttaa aactgaatct taatcgatga atgtaaactc 2521 gggggtgctg gggccagggc agatgtgggg atgttttgac atttacagga ggccccggag 2581 aaactgaggt atggccatgc cctagaccct ccccaaggat gaccacaccc gaagtcctgt 2641 cactgagcac agtcaggggc tgggcatccc agcttgcccc cgcttagccc cgctgagctt 2701 ggaggaagta tgagtgctga ttcaaaccaa agctgcctgt gccatgccca aggcctaggt 2761 tatgggtacg gcaaccacat gtcccagatc gtctccaatt cgaaaacaac cgtcctgctg 2821 tccctgtcag gacacatgga ttttggcagg gcgggggggg gttctagaaa atataggttc 2881 ctataataaa atggcacctt cccccttt // LOCUS HUMMDMCSF 3112 bp mRNA PRI 07-JAN-1995 DEFINITION Human methylenetetrahydrofolate dehydrogenase- methenyltetrahydrofolate cyclohydrolase-formyltetrahydrofolate synthetase mRNA, complete cds. ACCESSION J04031 NID g187464 KEYWORDS trifunctional enzyme. SOURCE Human colonic adenocarcinoma (LS180) cell line, cDNA to mRNA, clone HUFOLDCS. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3112) AUTHORS Hum,D.W., Bell,A.W., Rozen,R. and MacKenzie,R.E. TITLE Primary structure of a human trifunctional enzyme. Isolation of a cDNA encoding methylenetetrahydrofolate dehydrogenase-methenyltetrahydrofolate cyclohydrolase-formyltetrahydrofolate synthetase JOURNAL J. Biol. Chem. 263 (31), 15946-15950 (1988) MEDLINE 89034046 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.E.MacKenzie 03-AUG-1988. FEATURES Location/Qualifiers source 1..3112 /organism="Homo sapiens" /db_xref="taxon:9606" /map="14q24" gene 54..2861 /gene="MTHFD" CDS 54..2861 /gene="MTHFD" /note="MDMCSF (EC 1.5.1.5; EC 3.5.4.9; EC 6.3.4.3)" /codon_start=1 /db_xref="GDB:G00-120-704" /db_xref="PID:g307178" /translation="MAPAEILNGKEISAQIRARLKNQVTQLKEQVPGFTPRLAILQVG NRDDSNLYINVKLKAAEEIGIKATHIKLPRTTTESEVMKYITSLNEDSTVHGFLVQLP LDSENSINTEEVINAIAPEKDVDGLTSINAGRLARGDLNDCFIPCTPKGCLELIKETG VPIAGRHAVVVGRSKIVGAPMHDLLLWNNATVTTCHSKTAHLDEEVNKGDILVVATGQ PEMVKGEWIKPGAIVIDCGINYVPDDKKPNGRKVVGDVAYDEAKERASFITPVPGGVG PMTVAMLMQSTVESAKRFLEKFKPGKWMIQYNNLNLKTPVPSDIDISRSCKPKPIGKL AREIGLLSEEVELYGETKAKVLLSALERLKHRPDGKYVVVTGITPTPLGEGKSTTTIG LVQALGAHLYQNVFACVRQPSQGPTFGIKGGAAGGGYSQVIPMEEFNLHLTGDIHAIT AANNLVAAAIDARIFHELTQTDKALFNRLVPSVNGVRRFSDIQIRRLKRLGIEKTDPT TLTDEEINRFARLDIDPETITWQRVLDTNDRFLRKITIGQAPTEKGHTRTAQFDISVA SEIMAVLALTTSLEDMRERLGKMVVASSKKGEPVSAEDLGVSGALTVLMKDAIKPNLM QTLEGTPVFVHAGPFANIAHGNSSIIADRIALKLVGPEGFVVTEAGFGADIGMEKFFN IKCRYSGLCPHVVVLVATVRALKMHGGGPTVTAGLPLPKAYIQENLELVEKGFSNLKK QIENARMFGIPVVVAVNAFKTDTESELDLISRLSREHGAFDAVKCTHWAEGGKGALAL AQAVQRAAQAPSSFQLLYDLKLPVEDKIRIIAQKIYGADDIELLPEAQHKAEVYTKQG FGNLPICMAKTHLSLSHNPEQKGVPTGFILPIRDIRASVGAGFLYPLVGTMSTMPGLP TRPCFYDIDLDPETEQVNGLF" BASE COUNT 885 a 711 c 799 g 717 t ORIGIN 57 bp upstream of AcyI site. 1 gtggaacctc gatattggtg gtgtccatcg tgggcagcgg actaataaag gccatggcgc 61 cagcagaaat cctgaacggg aaggagatct ccgcgcaaat aagggcgaga ctgaaaaatc 121 aagtcactca gttgaaggag caagtacctg gtttcacacc acgcctggca atattacagg 181 ttggcaacag agatgattcc aatctttata taaatgtgaa gctgaaggct gctgaagaga 241 ttgggatcaa agccactcac attaagttac caagaacaac cacagaatct gaggtgatga 301 agtacattac atctttgaat gaagactcta ctgtacatgg gttcttagtg cagctacctt 361 tagattcaga gaattccatt aacactgaag aagtgatcaa tgctattgca cccgagaagg 421 atgtggatgg attgactagc atcaatgctg ggagacttgc tagaggtgac ctcaatgact 481 gtttcattcc ttgtacgcct aagggatgct tggaactcat caaagagaca ggggtgccga 541 ttgccggaag gcatgctgtg gtggttgggc gcagtaaaat agttggggcc ccgatgcatg 601 acttgcttct gtggaacaat gccacagtga ccacctgcca ctccaagact gcccatctgg 661 atgaggaggt aaataaaggt gacatcctgg tggttgcaac tggtcagcct gaaatggtta 721 aaggggagtg gatcaaacct ggggcaatag tcatcgactg tggaatcaat tatgtcccag 781 atgataaaaa accaaatggg agaaaagttg tgggtgatgt ggcatacgac gaggccaaag 841 agagggcgag cttcatcact cctgttcctg gcggcgtagg gcccatgaca gttgcaatgc 901 tcatgcagag cacagtagag agtgccaagc gtttcctgga gaaatttaag ccaggaaagt 961 ggatgattca gtataacaac cttaacctca agacacctgt tccaagtgac attgatatat 1021 cacgatcttg taaaccgaag cccattggta agctggctcg agaaattggt ctgctgtctg 1081 aagaggtaga attatatggt gaaacaaagg ccaaagttct gctgtcagca ctagaacgcc 1141 tgaagcaccg gcctgatggg aaatacgtgg tggtgactgg aataactcca acacccctgg 1201 gagaagggaa aagcacaact acaatcgggc tagtgcaagc ccttggtgcc catctctacc 1261 agaatgtctt tgcgtgtgtg cgacagcctt ctcagggccc cacctttgga ataaaaggtg 1321 gcgctgcagg aggcggctac tcccaggtca ttcctatgga agagtttaat ctccacctca 1381 caggtgacat ccatgccatc actgcagcta ataacctcgt tgctgcggcc attgatgctc 1441 ggatatttca tgaactgacc cagacagaca aggctctctt taatcgtttg gtgccatcag 1501 taaatggagt gagaaggttc tctgacatcc aaatccgaag gttaaagaga ctaggcattg 1561 aaaagactga ccctaccaca ctgacagatg aagagataaa cagatttgca agattggaca 1621 ttgatccaga aaccataact tggcaaagag tgttggatac caatgataga ttcctgagga 1681 agatcacgat tggacaggct ccaacggaga agggtcacac acggacggcc cagtttgata 1741 tctctgtggc cagtgaaatt atggctgtcc tggctctcac cacttctcta gaagacatga 1801 gagagagact gggcaaaatg gtggtggcat ccagtaagaa aggagagccc gtcagtgccg 1861 aagatctggg ggtgagtggt gcactgacag tgcttatgaa ggacgcaatc aagcccaatc 1921 tcatgcagac actggagggc actccagtgt ttgtccatgc tggcccgttt gccaacatcg 1981 cacatggcaa ttcctccatc attgcagacc ggatcgcact caagcttgtt ggcccagaag 2041 ggtttgtagt gacggaagca ggatttggag cagacattgg aatggaaaag ttttttaaca 2101 tcaaatgccg gtattccggc ctctgccccc acgtggtggt gcttgttgcc actgtcaggg 2161 ctctcaagat gcacgggggc ggccccacgg tcactgctgg actgcctctt cccaaggctt 2221 acatacagga gaacctggag ctggttgaaa aaggcttcag taacttgaag aaacaaattg 2281 aaaatgccag aatgtttgga attccagtag tagtggccgt gaatgcattc aagacggata 2341 cagagtctga gctggacctc atcagccgcc tttccagaga acatggggct tttgatgccg 2401 tgaagtgcac tcactgggca gaagggggca agggtgcctt agccctggct caggccgtcc 2461 agagagcagc acaagcaccc agcagcttcc agctccttta tgacctcaag ctcccagttg 2521 aggataaaat caggatcatt gcacagaaga tctatggagc agatgacatt gaattacttc 2581 ccgaagctca acacaaagct gaagtctaca cgaagcaggg ctttgggaat ctccccatct 2641 gcatggctaa aacacacttg tctttgtctc acaacccaga gcaaaaaggt gtccctacag 2701 gcttcattct gcccattcgc gacatccgcg ccagcgttgg ggctggtttt ctgtacccct 2761 tagtaggaac gatgagcaca atgcctggac tccccacccg gccctgtttt tatgatattg 2821 atttggaccc tgaaacagaa caggtgaatg gattattcta aacagatcac catccatctt 2881 caagaagcta ctttgaaagt ctggccagtg tctattcagg cccactggga gttaggaagt 2941 ataagtaagc caagagaagt cagcccctgc ccagaagatc tgaaactaat agtaggagtt 3001 tccccagaag tcattttcag ccttaattct catcatgtat aaattaacat aaatcatgca 3061 tgtctgttta ctttagtgac gttccacaga ataaaaggaa acaagtttgc ca // LOCUS HUMMDR1 4646 bp mRNA PRI 07-JAN-1995 DEFINITION Human P-glycoprotein (MDR1) mRNA, complete cds. ACCESSION M14758 NID g187468 KEYWORDS P-glycoprotein; drug resistance protein; transport protein. SOURCE Human drug-resistant cell line KB-C2.5 cDNA to mRNA, clones lambda-HDR[10, 5, 104]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4646) AUTHORS Chen,C.J., Chin,J.E., Ueda,K., Clark,D.P., Pastan,I., Gottesman,M.M. and Roninson,I.B. TITLE Internal duplication and homology with bacterial transport proteins in the mdr1 (P-glycoprotein) gene from multidrug-resistant human cells JOURNAL Cell 47 (3), 381-389 (1986) MEDLINE 87028230 REFERENCE 2 (sites) AUTHORS Ueda,K., Clark,D.P., Chen,C.J., Roninson,I.B., Gottesman,M.M. and Pastan,I. TITLE The human multidrug resistance (mdr1) gene. cDNA cloning and transcription initiation JOURNAL J. Biol. Chem. 262 (2), 505-508 (1987) MEDLINE 87109132 REFERENCE 3 (bases 971 to 985; 3095 to 3109) AUTHORS Kioka,N., Tsubota,J., Kakehi,Y., Komano,T., Gottesman,M.M., Pastan,I. and Ueda,K. TITLE P-glycoprotein gene (MDR1) cDNA from human adrenal: normal P-glycoprotein carries Gly185 with an altered pattern of multidrug resistance JOURNAL Biochem. Biophys. Res. Commun. 162 (1), 224-231 (1989) MEDLINE 89322246 COMMENT [2] sites. Draft entry and computer-readable sequence [1] kindly submitted by I.B.Roninson, 13-AUG-1987. The sequence shown is of a cDNA clone initiating at a minor upstream transcription initiation site and containing the major site of transcription initiation. FEATURES Location/Qualifiers source 1..4646 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q21" mRNA <1..4646 /note="MDR1 mRNA (alt.)" mRNA 1..4646 /note="MDR1 mRNA (alt.)" mRNA 285..4646 /note="MDR1 mRNA (alt.)" mRNA 289..4646 /note="MDR1 mRNA (alt.)" mutation 382 /note="g in [1]; a in [2]" gene 425..4267 /gene="PGY1" CDS 425..4267 /gene="PGY1" /note="P-glycoprotein" /codon_start=1 /db_xref="GDB:G00-120-712" /db_xref="PID:g307180" /translation="MDLEGDRNGGAKKKNFFKLNNKSEKDKKEKKPTVSVFSMFRYSN WLDKLYMVVGTLAAIIHGAGLPLMMLVFGEMTDIFANAGNLEDLMSNITNRSDINDTG FFMNLEEDMTRYAYYYSGIGAGVLVAAYIQVSFWCLAAGRQIHKIRKQFFHAIMRQEI GWFDVHDVGELNTRLTDDVSKINEVIGDKIGMFFQSMATFFTGFIVGFTRGWKLTLVI LAISPVLGLSAAVWAKILSSFTDKELLAYAKAGAVAEEVLAAIRTVIAFGGQKKELER YNKNLEEAKRIGIKKAITANISIGAAFLLIYASYALAFWYGTTLVLSGEYSIGQVLTV FFSVLIGAFSVGQASPSIEAFANARGAAYEIFKIIDNKPSIDSYSKSGHKPDNIKGNL EFRNVHFSYPSRKEVKILKGLNLKVQSGQTVALVGNSGCGKSTTVQLMQRLYDPTEGM VSVDGQDIRTINVRFLREIIGVVSQEPVLFATTIAENIRYGRENVTMDEIEKAVKEAN AYDFIMKLPHKFDTLVGERGAQLSGGQKQRIAIARALVRNPKILLLDEATSALDTESE AVVQVALDKARKGRTTIVIAHRLSTVRNADVIAGFDDGVIVEKGNHDELMKEKGIYFK LVTMQTAGNEVELENAADESKSEIDALEMSSNDSRSSLIRKRSTRRSVRGSQAQDRKL STKEALDESIPPVSFWRIMKLNLTEWPYFVVGVFCAIINGGLQPAFAIIFSKIIGVFT RIDDPETKRQNSNLFSLLFLALGIISFITFFLQGFTFGKAGEILTKRLRYMVFRSMLR QDVSWFDDPKNTTGALTTRLANDAAQVKGAIGSRLAVITQNIANLGTGIIISFIYGWQ LTLLLLAIVPIIAIAGVVEMKMLSGQALKDKKELEGAGKIATEAIENFRTVVSLTQEQ KFEHMYAQSLQVPYRNSLRKAHIFGITFSFTQAMMYFSYAGCFRFGAYLVAHKLMSFE DVLLVFSAVVFGAMAVGQVSSFAPDYAKAKISAAHIIMIIEKTPLIDSYSTEGLMPNT LEGNVTFGEVVFNYPTRPDIPVLQGLSLEVKKGQTLALVGSSGCGKSTVVQLLERFYD PLAGKVLLDGKEIKRLNVQWLRAHLGIVSQEPILFDCSIAENIAYGDNSRVVSQEEIV RAAKEANIHAFIESLPNKYSTKVGDKGTQLSGGQKQRIAIARALVRQPHILLLDEATS ALDTESEKVVQEALDKAREGRTCIVIAHRLSTIQNADLIVVFQNGRVKEHGTHQQLLA QKGIYFSMVSVQAGTKRQ" mutation 964 /gene="PGY1" /note="t in [1]; c in [2]" allele 978..979 /gene="PGY1" /note="tt in [1]; ga in [2] Val->Gly" mutation 1660 /gene="PGY1" /note="c in [1]; t in [2]" mutation 2065 /gene="PGY1" /note="c in [1]; t in [2]" allele 3101 /gene="PGY1" /note="g in [1]; t in [2] Ala-> Ser" mutation 3859 /gene="PGY1" /note="c in [1]; t in [2]" mutation 4460 /note="a in [1]; g in [2]" BASE COUNT 1371 a 892 c 1129 g 1254 t ORIGIN 154 bp upstream of AvaI site; chromosome 7q21.1. 1 cctactctat tcagatattc tccagattcc taaagattag agatcatttc tcattctcct 61 aggagtactc acttcaggaa gcaaccagat aaaagagagg tgcaacggaa gccagaacat 121 tcctcctgga aattcaacct gtttcgcagt ttctcgagga atcagcattc agtcaatccg 181 ggccgggagc agtcatctgt ggtgaggctg attggctggg caggaacagc gccggggcgt 241 gggctgagca cagcgcttcg ctctctttgc cacaggaagc ctgagctcat tcgagtagcg 301 gctcttccaa gctcaaagaa gcagaggccg ctgttcgttt cctttaggtc tttccactaa 361 agtcggagta tcttcttcca agatttcacg tcttggtggc cgttccaagg agcgcgaggt 421 cgggatggat cttgaagggg accgcaatgg aggagcaaag aagaagaact tttttaaact 481 gaacaataaa agtgaaaaag ataagaagga aaagaaacca actgtcagtg tattttcaat 541 gtttcgctat tcaaattggc ttgacaagtt gtatatggtg gtgggaactt tggctgccat 601 catccatggg gctggacttc ctctcatgat gctggtgttt ggagaaatga cagatatctt 661 tgcaaatgca ggaaatttag aagatctgat gtcaaacatc actaatagaa gtgatatcaa 721 tgatacaggg ttcttcatga atctggagga agacatgacc aggtatgcct attattacag 781 tggaattggt gctggggtgc tggttgctgc ttacattcag gtttcatttt ggtgcctggc 841 agctggaaga caaatacaca aaattagaaa acagtttttt catgctataa tgcgacagga 901 gataggctgg tttgatgtgc acgatgttgg ggagcttaac acccgactta cagatgatgt 961 ctctaagatt aatgaagtta ttggtgacaa aattggaatg ttctttcagt caatggcaac 1021 atttttcact gggtttatag taggatttac acgtggttgg aagctaaccc ttgtgatttt 1081 ggccatcagt cctgttcttg gactgtcagc tgctgtctgg gcaaagatac tatcttcatt 1141 tactgataaa gaactcttag cgtatgcaaa agctggagca gtagctgaag aggtcttggc 1201 agcaattaga actgtgattg catttggagg acaaaagaaa gaacttgaaa ggtacaacaa 1261 aaatttagaa gaagctaaaa gaattgggat aaagaaagct attacagcca atatttctat 1321 aggtgctgct ttcctgctga tctatgcatc ttatgctctg gccttctggt atgggaccac 1381 cttggtcctc tcaggggaat attctattgg acaagtactc actgtattct tttctgtatt 1441 aattggggct tttagtgttg gacaggcatc tccaagcatt gaagcatttg caaatgcaag 1501 aggagcagct tatgaaatct tcaagataat tgataataag ccaagtattg acagctattc 1561 gaagagtggg cacaaaccag ataatattaa gggaaatttg gaattcagaa atgttcactt 1621 cagttaccca tctcgaaaag aagttaagat cttgaagggc ctgaacctga aggtgcagag 1681 tgggcagacg gtggccctgg ttggaaacag tggctgtggg aagagcacaa cagtccagct 1741 gatgcagagg ctctatgacc ccacagaggg gatggtcagt gttgatggac aggatattag 1801 gaccataaat gtaaggtttc tacgggaaat cattggtgtg gtgagtcagg aacctgtatt 1861 gtttgccacc acgatagctg aaaacattcg ctatggccgt gaaaatgtca ccatggatga 1921 gattgagaaa gctgtcaagg aagccaatgc ctatgacttt atcatgaaac tgcctcataa 1981 atttgacacc ctggttggag agagaggggc ccagttgagt ggtgggcaga agcagaggat 2041 cgccattgca cgtgccctgg ttcgcaaccc caagatcctc ctgctggatg aggccacgtc 2101 agccttggac acagaaagcg aagcagtggt tcaggtggct ctggataagg ccagaaaagg 2161 tcggaccacc attgtgatag ctcatcgttt gtctacagtt cgtaatgctg acgtcatcgc 2221 tggtttcgat gatggagtca ttgtggagaa aggaaatcat gatgaactca tgaaagagaa 2281 aggcatttac ttcaaacttg tcacaatgca gacagcagga aatgaagttg aattagaaaa 2341 tgcagctgat gaatccaaaa gtgaaattga tgccttggaa atgtcttcaa atgattcaag 2401 atccagtcta ataagaaaaa gatcaactcg taggagtgtc cgtggatcac aagcccaaga 2461 cagaaagctt agtaccaaag aggctctgga tgaaagtata cctccagttt ccttttggag 2521 gattatgaag ctaaatttaa ctgaatggcc ttattttgtt gttggtgtat tttgtgccat 2581 tataaatgga ggcctgcaac cagcatttgc aataatattt tcaaagatta taggggtttt 2641 tacaagaatt gatgatcctg aaacaaaacg acagaatagt aacttgtttt cactattgtt 2701 tctagccctt ggaattattt cttttattac atttttcctt cagggtttca catttggcaa 2761 agctggagag atcctcacca agcggctccg atacatggtt ttccgatcca tgctcagaca 2821 ggatgtgagt tggtttgatg accctaaaaa caccactgga gcattgacta ccaggctcgc 2881 caatgatgct gctcaagtta aaggggctat aggttccagg cttgctgtaa ttacccagaa 2941 tatagcaaat cttgggacag gaataattat atccttcatc tatggttggc aactaacact 3001 gttactctta gcaattgtac ccatcattgc aatagcagga gttgttgaaa tgaaaatgtt 3061 gtctggacaa gcactgaaag ataagaaaga actagaaggt gctgggaaga tcgctactga 3121 agcaatagaa aacttccgaa ccgttgtttc tttgactcag gagcagaagt ttgaacatat 3181 gtatgctcag agtttgcagg taccatacag aaactctttg aggaaagcac acatctttgg 3241 aattacattt tccttcaccc aggcaatgat gtatttttcc tatgctggat gtttccggtt 3301 tggagcctac ttggtggcac ataaactcat gagctttgag gatgttctgt tagtattttc 3361 agctgttgtc tttggtgcca tggccgtggg gcaagtcagt tcatttgctc ctgactatgc 3421 caaagccaaa atatcagcag cccacatcat catgatcatt gaaaaaaccc ctttgattga 3481 cagctacagc acggaaggcc taatgccgaa cacattggaa ggaaatgtca catttggtga 3541 agttgtattc aactatccca cccgaccgga catcccagtg cttcagggac tgagcctgga 3601 ggtgaagaag ggccagacgc tggctctggt gggcagcagt ggctgtggga agagcacagt 3661 ggtccagctc ctggagcggt tctacgaccc cttggcaggg aaagtgctgc ttgatggcaa 3721 agaaataaag cgactgaatg ttcagtggct ccgagcacac ctgggcatcg tgtcccagga 3781 gcccatcctg tttgactgca gcattgctga gaacattgcc tatggagaca acagccgggt 3841 ggtgtcacag gaagagatcg tgagggcagc aaaggaggcc aacatacatg ccttcatcga 3901 gtcactgcct aataaatata gcactaaagt aggagacaaa ggaactcagc tctctggtgg 3961 ccagaaacaa cgcattgcca tagctcgtgc ccttgttaga cagcctcata ttttgctttt 4021 ggatgaagcc acgtcagctc tggatacaga aagtgaaaag gttgtccaag aagccctgga 4081 caaagccaga gaaggccgca cctgcattgt gattgctcac cgcctgtcca ccatccagaa 4141 tgcagactta atagtggtgt ttcagaatgg cagagtcaag gagcatggca cgcatcagca 4201 gctgctggca cagaaaggca tctatttttc aatggtcagt gtccaggctg gaacaaagcg 4261 ccagtgaact ctgactgtat gagatgttaa atacttttta atatttgttt agatatgaca 4321 tttattcaaa gttaaaagca aacacttaca gaattatgaa gaggtatctg tttaacattt 4381 cctcagtcaa gttcagagtc ttcagagact tcgtaattaa aggaacagag tgagagacat 4441 catcaagtgg agagaaatca tagtttaaac tgcattataa attttataac agaattaaag 4501 tagattttaa aagataaaat gtgtaatttt gtttatattt tcccatttgg actgtaactg 4561 actgccttgc taaaagatta tagaagtagc aaaaagtatt gaaatgtttg cataaagtgt 4621 ctataataaa actaaacttt catgtg // LOCUS HUMMDR3 3924 bp mRNA PRI 11-JUN-1993 DEFINITION Human membrane glycoprotein P (mdr3) mRNA, complete cds. ACCESSION M23234 NID g187501 KEYWORDS P-glycoprotein; membrane glycoprotein. SOURCE Human liver, cDNA to mRNA, clone 3.27. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3924) AUTHORS van der Bliek,A.M., Kooiman,P.M., Schneider,C. and Borst,P.A. TITLE Sequence of mdr3 cDNA encoding a human P-glycoprotein JOURNAL Gene 71, 401-411 (1988) MEDLINE 89138016 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P. Borst, 21-MAR-1989. FEATURES Location/Qualifiers source 1..3924 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..3924 /note="mdr3 mRNA" CDS 33..3872 /note="P-glycoprotein" /codon_start=1 /db_xref="PID:g307181" /translation="MDLEAAKNGTAWRPTSAEGDFELGISSKQKRKKTKTVKMIGVLT LFRYSDWQDKLFMSLGTIMAIAHGSGLPLMMIVFGEMTDKFVDTAGNFSFPVNFSLSL LNPGKILEEEMTRYAYYYSGLGAGVLVAAYIQVSFWTLAAGRQIRKIRQKFFHAILRQ EIGWFDINDTTELNTRLTDDISKISEGIGDKVGMFFQAVATFFAGFIVGFIRGWKLTL VIMAISPILGLSAAVWAKILSAFSDKELAAYAKAGAVAEEALGAIRTVIAFGGQNKEL ERYQKHLENAKEIGIKKAISANISMGIAFLLIYASYALAFWYGSTLVISKEYTIGNAM TVFFSILIGAFSVGQAAPCIDAFANARGAAYVIFDIIDNNPKIDSFSERGHKPDSIKG NLEFNDVHFSYPSRANVKILKGLNLKVQSGQTVALVGSSGCGKSTTVQLIQRLYDPDE GTINIDGQDIRNFNVNYLREIIGVVSQEPVLFSTTIAENICYGRGNVTMDEIKKAVKE ANAYEFIMKLPQKFDTLVGERGAQLSGGQKQRIAIARALVRNPKILLLDEATSALDTE SEAEVQAALDKAREGRTTIVIAHRLSTVRNADVIAGFEDGVIVEQGSHSELMKKEGVY FKLVNMQTSGSQIQSEEFELNDEKAATRMAPNGWKSRLFRHSTQKNLKNSQMCQKSLD VETDGLEANVPPVSFLKVLKLNKTEWPYFVVGTVCAIANGGLQPAFSVIFSEIIAIFG PGDDAVKQQKCNIFSLIFLFLGIISFFTFFLQGFTFGKAGEILTRRLRSMAFKAMLRQ DMSWFDDHKNSTGALSTRLATDAAQVQGATGTRLALIAQNIANLGTGIIISFIYGWQL TLLLLAVVPIIAVSGIVEMKLLAGNAKRDKKELEAAGKIATEAIENIRTVVSLTQERK FESMYVEKLYGPYRNSVQKAHIYGITFSISQAFMYFSYAGCFRFGAYLIVNGHMRFRD VILVFSAIVFGAVALGHASSFAPDYAKAKLSAAHLFMLFERQPLIDSYSEEGLKPDKF EGNITFNEVVFNYPTRANVPVLQGLSLEVKKGQTLALVGSSGCGKSTVVQLLERFYDP LAGTVLLDGQEAKKLNVQWLRAQLGIVSQEPILFDCSIAENIAYGDNSRVVSQDEIVS AAKAANIHPFIETLPHKYETRVGDKGTQLSGGQKQRIAIARALIRQPQILLLDEATSA LDTESEKVVQEALDKAREGRTCIVIAHRLSTIQNADLIVVFQNGRVKEHGTHQQLLAQ KGIYFSMVSVQAGTQNL" BASE COUNT 1145 a 790 c 977 g 1012 t ORIGIN 1 cctgccagac acgcgcgagg ttcgaggctg agatggatct tgaggcggca aagaacggaa 61 cagcctggcg ccccacgagc gcggagggcg actttgaact gggcatcagc agcaaacaaa 121 aaaggaaaaa aacgaagaca gtgaaaatga ttggagtatt aacattgttt cgatactccg 181 attggcagga taaattgttt atgtcgctgg gtaccatcat ggccatagct cacggatcag 241 gtctccccct catgatgata gtatttggag agatgactga caaatttgtt gatactgcag 301 gaaacttctc ctttccagtg aacttttcct tgtcgctgct aaatccaggc aaaattctgg 361 aagaagaaat gactagatat gcatattact actcaggatt gggtgctgga gttcttgttg 421 ctgcctatat acaagtttca ttttggactt tggcagctgg tcgacagatc aggaaaatta 481 ggcagaagtt ttttcatgct attctacgac aggaaatagg atggtttgac atcaatgaca 541 ccactgaact caatacgcgg ctaacagatg acatctccaa aatcagtgaa ggaattggtg 601 acaaggttgg aatgttcttt caagcagtag ccacgttttt tgcaggattc atagtgggat 661 tcatcagagg atggaagctc acccttgtga taatggccat cagccctatt ctaggactct 721 ctgcagccgt ttgggcaaag atactctcgg catttagtga caaagaacta gctgcttatg 781 caaaagcagg cgccgtggca gaagaggctc tgggggccat caggactgtg atagctttcg 841 ggggccagaa caaagagctg gaaaggtatc agaaacattt agaaaatgcc aaagagattg 901 gaattaaaaa agctatttca gcaaacattt ccatgggtat tgccttcctg ttaatatatg 961 catcatatgc actggccttc tggtatggat ccactctagt catatcaaaa gaatatacta 1021 ttggaaatgc aatgacagtt tttttttcaa tcctaattgg agctttcagt gttggccagg 1081 ctgccccatg tattgatgct tttgccaatg caagaggagc agcatatgtg atctttgata 1141 ttattgataa taatcctaaa attgacagtt tttcagagag aggacacaaa ccagacagca 1201 tcaaagggaa tttggagttc aatgatgttc acttttctta cccttctcga gctaacgtca 1261 agatcttgaa gggcctcaac ctgaaggtgc agagtgggca gacggtggcc ctggttggaa 1321 gtagtggctg tgggaagagc acaacggtcc agctgataca gaggctctat gaccctgatg 1381 agggcacaat taacattgat gggcaggata ttaggaactt taatgtaaac tatctgaggg 1441 aaatcattgg tgtggtgagt caggagccgg tgctgttttc caccacaatt gctgaaaata 1501 tttgttatgg ccgtggaaat gtaaccatgg atgagataaa gaaagctgtc aaagaggcca 1561 acgcctatga gtttatcatg aaattaccac agaaatttga caccctggtt ggagagagag 1621 gggcccagct gagtggtggg cagaagcaga ggatcgccat tgcacgtgcc ctggttcgca 1681 accccaagat ccttctgctg gatgaggcca cgtcagcatt ggacacagaa agtgaagctg 1741 aggtacaggc agctctggat aaggccagag aaggccggac caccattgtg atagcacacc 1801 gactgtctac ggtccgaaat gcagatgtca tcgctgggtt tgaggatgga gtaattgtgg 1861 agcaaggaag ccacagcgaa ctgatgaaga aggaaggggt gtacttcaaa cttgtcaaca 1921 tgcagacatc aggaagccag atccagtcag aagaatttga actaaatgat gaaaaggctg 1981 ccactagaat ggccccaaat ggctggaaat ctcgcctatt taggcattct actcagaaaa 2041 accttaaaaa ttcacaaatg tgtcagaaga gccttgatgt ggaaaccgat ggacttgaag 2101 caaatgtgcc accagtgtcc tttctgaagg tcctgaaact gaataaaaca gaatggccct 2161 actttgtcgt gggaacagta tgtgccattg ccaatggggg gcttcagccg gcattttcag 2221 tcatattctc agagatcata gcgatttttg gaccaggcga tgatgcagtg aagcagcaga 2281 agtgcaacat attctctttg attttcttat ttctgggaat tatttctttt tttactttct 2341 tccttcaggg tttcacgttt gggaaagctg gcgagatcct caccagaaga ctgcggtcaa 2401 tggcttttaa agcaatgcta agacaggaca tgagctggtt tgatgaccat aaaaacagta 2461 ctggtgcact ttctacaaga cttgccacag atgctgccca agtccaagga gccacaggaa 2521 ccaggttggc tttaattgca cagaatatag ctaaccttgg aactggtatt atcatatcat 2581 ttatctacgg ttggcagtta accctattgc tattagcagt tgttccaatt attgctgtgt 2641 caggaattgt tgaaatgaaa ttgttggctg gaaatgccaa aagagataaa aaagaactgg 2701 aagctgctgg aaagattgca acagaggcaa tagaaaatat taggacagtt gtgtctttga 2761 cccaggaaag aaaatttgaa tcaatgtatg ttgaaaaatt gtatggacct tacaggaatt 2821 ctgtgcagaa ggcacacatc tatggaatta cttttagtat ctcacaagca tttatgtatt 2881 tttcctatgc cggttgtttt cgatttggtg catatctcat tgtgaatgga catatgcgct 2941 tcagagatgt tattctggtg ttttctgcaa ttgtatttgg tgcagtggct ctaggacatg 3001 ccagttcatt tgctccagac tatgctaaag ctaagctgtc tgcagcccac ttattcatgc 3061 tgtttgaaag acaacctctg attgacagct acagtgaaga ggggctgaag cctgataaat 3121 ttgaaggaaa tataacattt aatgaagtcg tgttcaacta tcccacccga gcaaacgtgc 3181 cagtgcttca ggggctgagc ctggaggtga agaaaggcca gacactagcc ctggtgggca 3241 gcagtggctg tgggaagagc acggtggtcc agctcctgga gcggttctac gaccccttgg 3301 cggggacagt gcttctcgat ggtcaagaag caaagaaact caatgtccag tggctcagag 3361 ctcaactcgg aatcgtgtct caggagccta tcctatttga ctgcagcatt gccgagaata 3421 ttgcctatgg agacaacagc cgggttgtat cacaggatga aattgtgagt gcagccaaag 3481 ctgccaacat acatcctttc atcgagacgt taccccacaa atatgaaaca agagtgggag 3541 ataaggggac tcagctctca ggaggtcaaa aacagaggat tgctattgcc cgagccctca 3601 tcagacaacc tcaaatcctc ctgttggatg aagctacatc agctctggat actgaaagtg 3661 aaaaggttgt ccaagaagcc ctggacaaag ccagagaagg ccgcacctgc attgtgattg 3721 ctcaccgcct gtccaccatc cagaatgcag acttaatagt ggtgtttcag aatgggagag 3781 tcaaggagca tggcacgcat cagcagctgc tggcacagaa aggcatctat ttttcaatgg 3841 tcagtgtcca ggctgggaca cagaacttat gaacttttgc tacagtatat tttaaaaata 3901 aattcaaatt attctaccca tttt // LOCUS HUMMEA 832 bp mRNA PRI 11-JUN-1993 DEFINITION Human male-enhanced antigen mRNA (Mea), complete cds. ACCESSION M27937 NID g187507 KEYWORDS male-enhanced antigen. SOURCE Human (adult) testis, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 832) AUTHORS Lau,Y.-F.C., Chan,K. and Sparkes,R.S. TITLE Male-enhanced antigen gene is phylogenetically conserved and expressed at late stages of spermatogenesis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8462-8466 (1989) MEDLINE 90046817 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.-F.C.Lau, 13-SEP-1989. FEATURES Location/Qualifiers source 1..832 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" CDS 35..592 /note="male-enhanced antigen mRNA" /codon_start=1 /db_xref="PID:g307182" /translation="MGPERHLSGAPARMATVVLGGDTMGPERIFPNQTEELGHQGPSE GTGDWSSEEPEEEQEETGSGPAGYSYQPLNQDPEQEEVELAPVGDGDVVADIQDRIQA LGLHLPDPPLESEDEDEEGATALNNHSSIPMDPEHVELVKRTMAGVSLPAPGVPAWAR EISDAQWEDVVQKALQARQASPAWK" polyA_signal 806..811 BASE COUNT 211 a 225 c 237 g 159 t ORIGIN Chromosome 6p21.1-21.3. 1 cgccgctgca gctggggcca tttgagggga gcccatgggg cctgaaaggc atctgtcagg 61 cgcccctgcc cggatggcaa cagtagttct aggaggagac accatgggcc ctgagcgtat 121 cttccccaat cagactgagg aactgggaca tcagggccct tcagaaggca ctggggattg 181 gagcagtgag gagcctgagg aagagcagga ggaaacgggg tcgggcccag ctggctactc 241 ctaccagccc ctgaaccaag atcctgaaca agaggaggtg gaactggcac cagtggggga 301 tggagatgta gttgctgaca tccaggatcg aatccaggcc ctggggcttc atttgccaga 361 cccaccatta gagagtgaag atgaagatga ggagggagct acagcgttga acaaccacag 421 ctctattccc atggacccag aacatgtaga gctggtgaaa aggacaatgg ctggagtaag 481 cctgcctgcg ccaggggttc ctgcctgggc tcgggagata tctgatgccc agtgggaaga 541 tgtggtacag aaagccctcc aagcccggca ggcatcccct gcctggaagt gaccacagtg 601 agagctgcct tatattccta cattccaggc cagaaccagc acaggactga acacatccct 661 ggttgtaatg tccatttcca tcttccccgt ctccctttcc acatcaaggc acatcagact 721 tctcagagac ccactttatt cagttctgta catatgggga catcggtcca agcccaacca 781 ccttagcatg tatcactctg tggagaataa agcacctatg tactgagcca aa // LOCUS HUMMECH 1277 bp mRNA PRI 11-DEC-1993 DEFINITION Human mRNA for mitochondrial short-chain enoyl-CoA hydratase, complete cds. ACCESSION D13900 NID g433412 KEYWORDS mitochondrial short-chain enoyl-CoA hydratase. SOURCE Homo sapiens liver cDNA to mRNA, clones hSCEH4, 5 and 9. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1277) AUTHORS Kanazawa,M., Ohtake,A., Abe,H., Yamamoto,S., Satoh,Y., Takayanagi,M., Niimi,H., Mori,M. and Hashimoto,T. TITLE Molecular cloning and sequence analysis of the cDNA for human mitochondrial short-chain enoyl-CoA hydratase JOURNAL Enzyme Protein 47 (1), 9-13 (1993) MEDLINE 94282213 REFERENCE 2 (bases 1 to 1277) AUTHORS Ohtake,A. TITLE Direct Submission JOURNAL Submitted (07-DEC-1992) to the DDBJ/EMBL/GenBank databases. Akira Ohtake, The Tokyo Metropolitan institute of Medical Science, Department of Clinical Genetics; 3-18-22 Honkomagome, Bunkyo-ku, Tokyo 113, Japan (E-mail:ohtake@rinshoken.or.jp, Tel:03-3823-2101, Fax:03-3823-6008) COMMENT Submitted (07-Dec-1992) to DDBJ by: Akira Ohtake Department of Clinical Genetics The Tokyo Metropolitan Institute of Medical Science 3-18-22 Honkomagome Bunkyo-ku Tokyo 113 Japan Phone: 03-3823-2101 Fax: 03-3823-6008 E-mail: ohtake@rinshoken.or.jp. FEATURES Location/Qualifiers source 1..1277 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" sig_peptide 22..108 CDS 22..894 /EC_number="4.2.1.17" /codon_start=1 /product="mitochondrial short-chain enoyl-CoA hydratase" /db_xref="PID:d1003507" /db_xref="PID:g433413" /translation="MAALRVLLSCARGPLRPPVRCPAWRPFASGANFEYIIAEKRGKN NTVGLIQLNRPKALNALCDGLIDELNQALKIFEEDPAVGGIVLTGGDKAFAAGADIKE MQNLSFQDCYSSKFLKHWGHLTQVKKPVIAAVNGYPFGGGCELAMMCDIIYAGEKAQF AQPEILIGTIPGAGGTQRLTRAVGKSLELEMVLTGDAISAQDAKQAGLVSKICPVETL VEEAIQCAEKIASNSKIVVAMAKESVNAAFEMTLTEGSKLEKKLFYSTFATDDRKEGM TAFVEKRKANFKDQ" mat_peptide 109..891 /product="mitochondrial short-chain enoyl-CoA hydratase" polyA_signal 1254..1259 polyA_site 1277 BASE COUNT 295 a 335 c 367 g 280 t ORIGIN 1 gggcgaggag tccagagagc catggccgcc ctgcgtgtcc tgctgtcctg cgcccgcggc 61 ccgctgaggc ccccggttcg ctgtcccgcc tggcgtccct tcgcctcggg tgctaacttt 121 gagtacatca tcgcagaaaa aagagggaag aataacaccg tggggttgat ccaactgaac 181 cgccccaagg ccctcaatgc actttgcgat ggcctgattg acgagctcaa ccaggccctg 241 aagatcttcg aggaggaccc ggccgttggg ggcattgtcc tcaccggcgg ggataaggcc 301 tttgcagctg gagctgatat caaggaaatg cagaacctga gtttccagga ctgttactcc 361 agcaagttct tgaagcactg gggccacctc acccaggtca agaagccagt catcgctgct 421 gtcaatggct atccgtttgg cgggggctgt gagcttgcca tgatgtgtga tatcatctat 481 gccggtgaga aggcccagtt tgcacagccg gagatcttaa taggaaccat cccaggtgca 541 ggcggcaccc agagactcac ccgtgctgtt gggaagtcgc tggagctgga gatggtcctc 601 accggtgacg cgatctcagc ccaggacgcc aagcaagcag gtcttgtcag caagatttgt 661 cctgttgaga cactggtgga agaagccatc cagtgtgcag aaaaaattgc cagcaattct 721 aaaattgtag tagcgatggc caaagaatca gtgaatgcag cttttgaaat gacattaaca 781 gaaggaagta agttggagaa gaaactcttt tattcaacct ttgccactga tgaccggaaa 841 gaagggatga ccgcgtttgt ggaaaagaga aaggccaact tcaaagacca gtgagaacca 901 gctgcccctg cttcacacct ctgcttggag aggacaagtg cagcctgtca gttttaggaa 961 gcaagtaaat catcctcttt tcaagagcag tgtccgtggt gtgcagttcc tctccaattg 1021 ctgcgtggtc gtggcccgac ctctcacggc atgacagcct tcgtcaccca gcctgtgagg 1081 gtcctgactg gagcaccttc taaatctaag attctgctga ggagcccccg ctggtccctc 1141 tgggcatgct gtgctcggac ggaaagcggg gcctgtgggt ccttgtgtcc ctgccgctga 1201 agaatggggc tgctctgagg gaaacgctgt ctgctgcctt catacagatg ctgattaaag 1261 tgatagcgat tcagatt // LOCUS HUMMECP 1669 bp mRNA PRI 31-AUG-1995 DEFINITION Homo sapiens methyl-CpG-binding protein (MeCP-2) mRNA, complete cds. ACCESSION L37298 NID g972764 KEYWORDS methyl-CpG-binding protein. SOURCE Homo sapiens (clone: pcdMP4) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1669) AUTHORS Kudo,S. and Fukuda,M. JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..1669 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pcdMP4" /cell_line="SK-MEL-28" /cell_type="melanoma" mRNA 1..1669 5'UTR 1..81 gene 82..1542 /gene="MeCP-2" CDS 82..1542 /gene="MeCP-2" /codon_start=1 /product="methyl-CpG-binding protein" /db_xref="PID:g972765" /translation="MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHE PVQPSAHHSAEPAEAGKAETSEGSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPE GWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVT GRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPG KLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPGSVV AAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVSIEVKEVVKPLLVSTLGEKSGK GLKTCKSPGRKSKESSPKGRSSSASSPPKKEHHHHHHHSESPKAPVPLLPPLPPPPPE PESSEDPTSPPEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVATAATAAEKY KHRGEGERKDIVSSSMPRPNREEPVDSRTPVTERVS" 3'UTR 1540..1669 polyA_signal 1645..1650 polyA_site 1669 BASE COUNT 454 a 486 c 475 g 254 t ORIGIN 1 agactacagt tcctgctttg atgtgacatg tgactcccca gaatacacct tgcttctgta 61 gaccagctcc aacaggattc catggtagct gggatgttag ggctcaggga agaaaagtca 121 gaagaccagg acctccaggg cctcaaggac aaacccctca agtttaaaaa ggtgaagaaa 181 gataagaaag aagagaaaga gggcaagcat gagcccgtgc agccatcagc ccaccactct 241 gctgagcccg cagaggcagg caaagcagag acatcagaag ggtcaggctc cgccccggct 301 gtgccggaag cttctgcctc ccccaaacag cggcgctcca tcatccgtga ccggggaccc 361 atgtatgatg accccaccct gcctgaaggc tggacacgga agcttaagca aaggaaatct 421 ggccgctctg ctgggaagta tgatgtgtat ttgatcaatc cccagggaaa agcctttcgc 481 tctaaagtgg agttgattgc gtacttcgaa aaggtaggcg acacatccct ggaccctaat 541 gattttgact tcacggtaac tgggagaggg agcccctccc ggcgagagca gaaaccacct 601 aagaagccca aatctcccaa agctccagga actggcagag gccggggacg ccccaaaggg 661 agcggcacca cgagacccaa ggcggccacg tcagagggtg tgcaggtgaa aagggtcctg 721 gagaaaagtc ctgggaagct ccttgtcaag atgccttttc aaacttcgcc agggggcaag 781 gctgaggggg gtggggccac cacatccacc caggtcatgg tgatcaaacg ccccggcagg 841 aagcgaaaag ctgaagctga ccctcaggcc attcccaaga aacggggccg aaagccgggg 901 agtgtggtgg cagccgctgc cgccgaggcc aaaaagaaag ccgtgaagga gtcttctatc 961 cgatctgtgc aggagaccgt actccccatc aagaagcgca agacccggga gacggtcagc 1021 atcgaggtca aggaagtggt gaagcccctg ctggtgtcca ccctcggtga gaagagcggg 1081 aaaggactga agacctgtaa gagccctggg cggaaaagca aggagagcag ccccaagggg 1141 cgcagcagca gcgcctcctc accccccaag aaggagcacc accaccatca ccaccactca 1201 gagtccccaa aggcccccgt gccactgctc ccacccctgc ccccacctcc acctgagccc 1261 gagagctccg aggaccccac cagcccccct gagccccagg acttgagcag cagcgtctgc 1321 aaagaggaga agatgcccag aggaggctca ctggagagcg acggctgccc caaggagcca 1381 gctaagactc agcccgcggt tgccaccgcc gccacggccg cagaaaagta caaacaccga 1441 ggggagggag agcgcaaaga cattgtttca tcctccatgc caaggccaaa cagagaggag 1501 cctgtggaca gccggacgcc cgtgaccgag agagttagct gactttacac ggagcggatt 1561 gcaaagcaaa ccaacaagaa taaaggcagc tgttgtctct tctccttatg ggtagggctc 1621 tgacaaagct tcccgattaa ctgaaataaa aaatattttt ttttctttc // LOCUS HUMMEF2C 4077 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens MADS/MEF2-family transcription factor (MEF2C) mRNA, complete cds. ACCESSION L08895 NID g292289 KEYWORDS myocyte-specific enhancer binding factor 2C(hMEF2C); transcription factor. SOURCE Homo sapiens Fetus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4077) AUTHORS Leifer,D., Krainc,D., Yu,Y.T., McDermott,J., Breitbart,R.E., Heng,J., Neve,R.L., Kosofsky,B., Nadal-Ginard,B. and Lipton,S.A. TITLE MEF2C, a MADS/MEF2-family transcription factor expressed in a laminar distribution in cerebral cortex JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (4), 1546-1550 (1993) MEDLINE 93165732 FEATURES Location/Qualifiers source 1..4077 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Fetus" misc_feature 316 /note="sequence derived from skeletal muscle clones starts here" gene 402..1823 /gene="MEF2C" CDS 402..1823 /gene="MEF2C" /note="myocyte-specific" /codon_start=1 /product="enhancer binding factor 2C" /db_xref="PID:g292290" /translation="MGRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIA LIIFNSTNKLFQYASTDMDKVLLKYTEYNEPHESRTNSDIVETLRKKGLNGCDSPDPD ADDSVGHSPESEDKYRKINEDIDLMISRQRLCAVPPPNFEMPVSIPVSSHNSLVYSNP VSSLGNPNLLPLAHPSLQRNSMSPGVTHRPPSAGNTGGLMGGDLTSGAGTSAGNGYGN PRNSPGLLVSPGNLNKNMQAKSPPPMNLGMNNRKPDLRVLIPPGSKNTMPSVSEDVDL LLNQRINNSQSAQSLATPVVSVATPTLPGQGMGGYPSAISTTYGTEYSLSSADLSSLS GFNTASALHLGSVTGWQQQHLHNMPPSALSQLGACTSTHLSQSSNLSLPSTQSLNIKS EPVSPPRDRTTTPSRYPQHTRHEAGRSPVDSLSSCSSSYDGSDREDHRNEFHSPIGLT RPSPDERESPSVKRMRLSEGWAT" allele 825 /gene="MEF2C" /note="1 of 9 independent plasmid isolates has G replaced with T" allele 1212..1235 /gene="MEF2C" /note="nucleotides deleted from skeletal muscle clones" allele 1503..1598 /gene="MEF2C" /note="nucleotides deleted from some clones derived from both brain and muscle" allele 1843..1855 /note="number of T's varies from 10 to 14 in independent plasmidisolates" polyA_signal 2831..2836 polyA_site 2850 /note="one skeletal muscle clone polyadenylated at this nucleotide" polyA_signal 4035..4040 polyA_site 4077 BASE COUNT 1230 a 823 c 840 g 1184 t ORIGIN 1 gaattcccag ctctctgctc gctctgctcg cagtcacaga cacttgagca cacgcgtaca 61 cccagacatc ttcgggctgc tattggattg actttgaagg ttctgtgtgg gtcgccgtgg 121 ctgcatgttt gaatcaggtg gagaagcact tcaacgctgg acgaagtaaa gattattgtt 181 gttatttttt ttttctctct ctctctctct taagaaagga aaatatccca aggactaatc 241 tgatcgggtc ttccttcatc aggaacgaat gcaggaattt gggaactgag ctgtgcaagt 301 gctgaagaag gagatttgtt tggaggaaac aggaaagaga aagaaaagga aggaaaaaat 361 acataatttc agggacgaga gagagaagaa aaacggggac tatggggaga aaaaagattc 421 agattacgag gattatggat gaacgtaaca gacaggtgac atttacaaag aggaaatttg 481 ggttgatgaa gaaggcttat gagctgagcg tgctgtgtga ctgtgagatt gcgctgatca 541 tcttcaacag caccaacaag ctgttccagt atgccagcac cgacatggac aaagtgcttc 601 tcaagtacac ggagtacaac gagccgcatg agagccggac aaactcagac atcgtggaga 661 cgttgagaaa gaagggcctt aatggctgtg acagcccaga ccccgatgcg gacgattccg 721 taggtcacag ccctgagtct gaggacaagt acaggaaaat taacgaagat attgatctaa 781 tgatcagcag gcaaagattg tgtgctgttc cacctcccaa cttcgagatg ccagtctcca 841 tcccagtgtc cagccacaac agtttggtgt acagcaaccc tgtcagctca ctgggaaacc 901 ccaacctatt gccactggct cacccttctc tgcagaggaa tagtatgtct cctggtgtaa 961 cacatcgacc tccaagtgca ggtaacacag gtggtctgat gggtggagac ctcacgtctg 1021 gtgcaggcac cagtgcaggg aacgggtatg gcaatccccg aaactcacca ggtctgctgg 1081 tctcacctgg taacttgaac aagaatatgc aagcaaaatc tcctccccca atgaatttag 1141 gaatgaataa ccgtaaacca gatctccgag ttcttattcc accaggcagc aagaatacga 1201 tgccatcagt gtctgaggat gtcgacctgc ttttgaatca aaggataaat aactcccagt 1261 cggctcagtc attggctacc ccagtggttt ccgtagcaac tcctacttta ccaggacaag 1321 gaatgggagg atatccatca gccatttcaa caacatatgg taccgagtac tctctgagta 1381 gtgcagacct gtcatctctg tctgggttta acaccgccag cgctcttcac cttggttcag 1441 taactggctg gcaacagcaa cacctacata acatgccacc atctgccctc agtcagttgg 1501 gagcttgcac tagcactcat ttatctcaga gttcaaatct ctccctgcct tctactcaaa 1561 gcctcaacat caagtcagaa cctgtttctc ctcctagaga ccgtaccacc accccttcga 1621 gatacccaca acacacgcgc cacgaggcgg ggagatctcc tgttgacagc ttgagcagct 1681 gtagcagttc gtacgacggg agcgaccgag aggatcaccg gaacgaattc cactccccca 1741 ttggactcac cagaccttcg ccggacgaaa gggaaagtcc ctcagtcaag cgcatgcgac 1801 tttctgaagg atgggcaaca tgatcagatt attacttact agtttttttt tttttcttgc 1861 agtgtgtgtg tgtgctatac cttaatgggg aaggggggtc gatatgcatt atatgtgccg 1921 tgtgtggaaa aaaaaaaagt caggtactct gttttgtaaa agtactttta aattgcctca 1981 gtgatacagt ataaagataa acagaaatgc tgagataagc ttagcacttg agttgtacaa 2041 cagaacactt gtacaaaata gattttaagg ctaacttctt ttcactgttg tgctcctttg 2101 caaaatgtat gttacaatag atagtgtcat gttgcaggtt caacgttatt tacatgtaaa 2161 tagacaaaag gaaacatttg ccaaaagcgg cagatcttta ctgaaagaga gagcagctgt 2221 tatgcaacat atagaaaaat gtatagatgc ttggacagac ccggtaatgg gtggccattg 2281 gtaaatgtta ggaacacacc aggtcacctg acatcccaag aatgctcaca aacctgcagg 2341 catatcattg gcgtatggca ctcattaaaa aggatcagag accattaaaa gaggaccata 2401 cctattaaaa aaaaatgtgg agttggaggg ctaacatatt taattaaata aataaataaa 2461 tctgggtctg catctcttat taaataaaaa tataaaaata tgtacattac attttgctta 2521 ttttcatata aaaggtaaga cagagtttgc aaagcatttg tggctttttg tagtttactt 2581 aagccaaaat gtgttttttt ccccttgata gcttcgctaa tattttaaac agtcctgtaa 2641 aaaaccaaaa aggacttttt gtatagaaag cactacccta agccatgaag aactccatgc 2701 tttgctaacc aagataactg ttttctcttt gtagaagttt tgtttttgaa atgtgtattt 2761 ctaattatat aaaatattaa gaatctttta aaaaaatctg tgaaattaac atgcttgtgt 2821 atagctttct aatatatata atattatggt aatagcagaa gttttgttat cttaatagcg 2881 ggaggggggt atatttgtgc agttgcacat ttgagtaact attttctttc tgttttcttt 2941 tactctgctt acattttata agtttaaggt cagctgtcaa aaggataacc tgtggggtta 3001 gaacatatca cattgcaaca ccctaaattg tttttaatac attagcaatc tattgggtca 3061 actgacatcc attgtatata ctagtttctt tcatgctatt tttattttgt tttttgcatt 3121 tttatcaaat gcagggcccc tttctgatct caccatttca ccatgcatct tggaattcag 3181 taagtgcata tcctaacttg cccatattct aaatcatctg gttggttttc agcctagaat 3241 ttgatacgct ttttagaaat atgcccagaa tagaaaagct atgttggggc acatgtcctg 3301 caaatatggc cctagaaaca agtgatatgg aatttacttg gtgaataagt tataaattcc 3361 cacagaagaa aaatgtgaaa gactgggtgc tagacaagaa ggaagcaggt aaagggatag 3421 ttgctttgtc atccgttttt aattatttta actgaccctt gacaatcttg tcagcaatat 3481 aggactgttg aacaatcccg gtgtgtcagg acccccaaat gtcacttctg cataaagcat 3541 gtatgtcatc tattttttct tcaataaaga gatttaatag ccatttcaag aaatcccata 3601 aagaacctct ctatgtccct ttttttaatt taaaaaaatg actcttgtct aatattcgtc 3661 tataagggat taattttcag accctttaat aagtgagtgc cataagaaag tcaatatata 3721 ttgtttaaaa gatatttcag tctaggaaag attttccttc tcttggaatg tgaagatctg 3781 tcgattcatc tccaatcata tgcattgaca tacacagcaa agaagatata ggcagtaata 3841 tcaacactgc tatatcatgt gtaggacatt tcttatccat tttttctctt ttacttgcat 3901 agttgctatg tgtttctcat tgtaaaaggc tgccgctggg tggcagaagc caagagacct 3961 tattaactag gctatatttt tcttaacttg atctgaaatc cacaattaga ccacaatgca 4021 cctttggttg tatccataaa ggatgctagc ctgccttgta ctaatgtttt atatatt // LOCUS HUMMEL18 2227 bp mRNA PRI 25-NOV-1993 DEFINITION Human mRNA for Mel-18 protein, complete cds. ACCESSION D13969 NID g285932 KEYWORDS Mel-18 protein; bmi-1; polycomb group gene; zinc-finger. SOURCE Homo sapiens (isolate: HTLV transformed HUT102) T-cell, lambda gt11 library of M. Yoshida, cDNA to mRNA, clones 1 and 4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2227) AUTHORS Ishida,A., Asano,H., Hasegawa,M., Koseki,H., Ono,T., Yoshida,M.C., Taniguchi,M. and Kanno,M. TITLE Cloning and chromosome mapping of the human Mel-18 gene which encodes a DNA-binding protein with a new 'RING-finger' motif JOURNAL Gene 129 (2), 249-255 (1993) MEDLINE 93314969 REFERENCE 2 (bases 1 to 2227) AUTHORS Ishida,A. TITLE Direct Submission JOURNAL Submitted (21-DEC-1992) to the DDBJ/EMBL/GenBank databases. Atsushi Ishida, School of Medicine, Chiba University, Division of Molecular Immunology; 1-8-1 Inohana, Chuo-ku, Chiba, Chiba 260, Japan (Tel:81-43-222-7171(ex.2156-8), Fax:81-43-227-1498) COMMENT Submitted (21-DEC-1992) to DDBJ by: Atsushi Ishida Division of Molecular Immunology School of Medicine Chiba University 1-8-1 Inohana, Chuo-ku Chiba 260 Japan Phone: 043-222-7171 x2156-8 Fax: 043-227-1498. FEATURES Location/Qualifiers source 1..2227 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /clone_lib="lambda gt11 library of M. Yoshida" CDS 202..1236 /codon_start=1 /product="Mel-18 protein" /db_xref="PID:d1003580" /db_xref="PID:g285933" /translation="MHRTTRIKITELNPHLMCALCGGYFIDATTIVECLHSFCKTCIV RYLETNKYCPMCDVQVHKTRPLLSIRSDKTLQDIVYKLVPGLFKDEMKRRRDFYAAYP LTEVPNGSNEDRGEVLEQEKGALSDDEIVSLSIEFYEGARDRDEKKGPLENGDGDKEK TGVRFLRCPAAMTVMHLAKFLRNKMDVPSKYKVEVLYEDEPLKEYYTLMDIAYIYPWR RNGPLPLKYRVQPACKRLTLATVPTPSEGTNTSGASECESVSDKAPSPATLPATSSSL PSPATPSHGSPSSHGPPATHPTSPTPPSTASGATTAANGGSLNCLQTPSSTSRGRKMT VNGAPVPPLT" polyA_signal 1364..1369 polyA_signal 1576..1581 BASE COUNT 489 a 659 c 568 g 511 t ORIGIN Chromosome 12q22. 1 gagagcccga acaggaagag ggtacagctt tgtgcaggtc acatgcccac tgcagccctc 61 cagcctctgg tccccagagc ggactttgga agctgaactg cttttgttgc tggaagactt 121 atgttataat ttaccctggg tggaccaggg tcgtacaaaa gggcaacgct ccccagtccc 181 cccactcccg accccggaat catgcatcgg actacacgga tcaaaatcac agagctgaac 241 ccccacctca tgtgtgccct ctgcgggggg tacttcatcg acgccaccac tatcgtggag 301 tgcctgcatt ccttctgcaa aacctgcatc gtgcgctacc tggagaccaa caaatactgc 361 cccatgtgtg acgtgcaggt ccataaaacc cggccgctgc tgagcatcag gtctgacaaa 421 acacttcaag acattgtcta caaattggtc cctgggcttt ttaaagatga gatgaaacgg 481 cggcgggatt tctatgcagc gtaccccctg acggaggtcc ccaacggctc caatgaggac 541 cgcggcgagg tcttggagca ggagaagggg gctctgagtg atgatgagat tgtcagcctc 601 tccatcgaat tctacgaagg tgccagggac cgggatgaga agaagggccc cctggagaat 661 ggggatgggg acaaagagaa aacaggggtg cgcttcctgc gatgcccagc agccatgacc 721 gtcatgcatc ttgccaagtt tctccgcaac aagatggatg tgcccagcaa gtacaaggtg 781 gaggttctgt acgaggacga gccactgaag gaatactaca ccctcatgga catcgcctac 841 atctacccct ggcggcggaa cgggcctctc cccctcaagt accgtgtcca gccagcctgc 901 aagcggctca ccctagccac ggtgcccacc ccctccgagg gcaccaacac cagcggggcg 961 tccgagtgtg agtcagtcag cgacaaggct cccagccctg ccaccctgcc agccacctcc 1021 tcctccctgc ccagcccagc caccccatcc catggctctc ccagttccca tgggcctcca 1081 gccacccacc ctacctcccc cactccccct tcgacagcca gtggggccac cacagctgcc 1141 aacgggggta gcttgaactg cctgcagaca ccatcctcca ccagcagggg gcgcaagatg 1201 actgtcaacg gcgctcccgt gcccccctta acttgaggcc agggaccctc tcccttcttc 1261 cagccaagcc tctccactcc ttccactttt tctgggccct tttttccact tcttctactt 1321 tccccagctc ttcccacctt gggggtgggg ggcgggtttt ataaataaat atatatatat 1381 atgtacatag gaaaaaccaa atatacatac ttattttcta tggaccaacc agattaattt 1441 aaatgccaca ggaaacaaac tttatgtgtg tgtgtatgtg tggaaaatgg tgttcatttt 1501 ttttgggggg ggtcttgtgt aatttgctgt ttttgggggt gcctggagat gaactggatg 1561 ggccactgga gtctcaataa agctctgcac catcctcgct gtttcccaag gcaggtggtg 1621 tgttgggggc cccttcagac ccaaagcttt aggcatgatt ccaactggct gcatatagga 1681 gtcagttaga attgtttctt tctctccccg tttctctccc catcttggct gctgtcctgc 1741 ctctgaccag tggccgcccc ccgcgttgtt gaatgtccag aaattgctaa gaacagtgcc 1801 ttttacaaat gcagtttatc cctggttctg aggagcaagt gcagggtgga ggtggcacct 1861 gcatcacctc ctcctcttgc agtggaaact ttgtgcaaag aatagatagt tctgcctctt 1921 tttttttttt ttcctgtgtg tgtggccttt gcatcattta tcttgtggaa aagaagattc 1981 aggccctgag aggtctcagc tcttggagga gggctaaggc tttagcattg tgaagcgctg 2041 cacccccacc aaccttaccc tcaccgggga accctcacta gcaggactgg tggtggagtc 2101 tcacctgggg cctagagtgg aagtgggggt gggttaacct cacacaagca cagatcccag 2161 actttgccag aggcaaacag ggaattccgc cgatactgac gggctccagg agtcgtcgcc 2221 acactcg // LOCUS HUMMET 1035 bp mRNA PRI 07-JAN-1995 DEFINITION Human metalloproteinase inhibitor mRNA, complete cds. ACCESSION M32304 NID g187522 KEYWORDS metalloproteinase inhibitor. SOURCE Human fetal aorta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1035) AUTHORS Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E. TITLE cDNA cloning and expression of a metalloproteinase inhibitor related to tissue inhibitor of metalloproteinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (7), 2800-2804 (1990) MEDLINE 90207285 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.E.Langley, 23-FEB-1990, for release after publication. FEATURES Location/Qualifiers source 1..1035 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp11.3-p11.23" gene 255..917 /gene="TIMP" CDS 255..917 /gene="TIMP" /note="metalloproteinase inhibitor precursor" /codon_start=1 /db_xref="GDB:G00-119-615" /db_xref="PID:g307195" /translation="MGAAARTLRLALGLLLLATLLRPADACSCSPVHPQQAFCNADVV IRAKAVSEKEVDSGNDIYGNPIKRIQYEIKQIKMFKGPEKDIEFIYTAPSSAVCGVSL DVGGKKEYLIAGKAEGDGKMHITLCDFIVPWDTLSTTQKKSLNHRYQMGCECKITRCP MIPCYISSPDECLWMDWVTEKNINGHQAKFFACIKRSDGSCAWYRGAAPPKQEFLDIE DP" sig_peptide 255..332 /gene="TIMP" /note="metalloproteinase inhibitor signal peptide" mat_peptide 333..914 /gene="TIMP" /note="metalloproteinase inhibitor" polyA_signal 1011..1016 BASE COUNT 216 a 361 c 306 g 152 t ORIGIN 1 gaattccggc ccgccgtccc ccaccccgcc gccccgcccg gcgaattgcg ccccgcgccc 61 ctcccctcgc gcccccgaga caaagaggag agaaagtttg cgcggccgag cggggcaggt 121 gaggagggtg agccgcgcgg gaggggcccg cctcggcccc ggctcagccc ccgcccgcgc 181 ccccagcccg ccgccgcgag cagcgcccgg accccccagc ggcggccccc gcccgcccag 241 ccccccggcc cgccatgggc gccgcggccc gcaccctgcg gctggcgctc ggcctcctgc 301 tgctggcgac gctgcttcgc ccggccgacg cctgcagctg ctccccggtg cacccgcaac 361 aggcgttttg caatgcagat gtagtgatca gggccaaagc ggtcagtgag aaggaagtgg 421 actctggaaa cgacatttat ggcaacccta tcaagaggat ccagtatgag atcaagcaga 481 taaagatgtt caaagggcct gagaaggata tagagtttat ctacacggcc ccctcctcgg 541 cagtgtgtgg ggtctcgctg gacgttggag gaaagaagga atatctcatt gcaggaaagg 601 ccgaggggga cggcaagatg cacatcaccc tctgtgactt catcgtgccc tgggacaccc 661 tgagcaccac ccagaagaag agcctgaacc acaggtacca gatgggctgc gagtgcaaga 721 tcacgcgctg ccccatgatc ccgtgctaca tctcctcccc ggacgagtgc ctctggatgg 781 actgggtcac agagaagaac atcaacgggc accaggccaa gttcttcgcc tgcatcaaga 841 gaagtgacgg ctcctgtgcg tggtaccgcg gcgcggcgcc ccccaagcag gagtttctcg 901 acatcgagga cccataagca ggcctccaac gcccctgtgg ccaactgcaa aaaaagcctc 961 caagggtttc gactggtcca gctctgacat cccttcctgg aaacagcatg aataaaacac 1021 tcatccccgg aattc // LOCUS HUMMETPOA 4626 bp mRNA PRI 07-JAN-1995 DEFINITION Human MET proto-oncogene mRNA, complete cds. ACCESSION J02958 NID g187558 KEYWORDS cell surface receptor; proto-oncogene; tyrosine kinase. SOURCE Human osteogenic sarcoma cell line (HOS) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4626) AUTHORS Park,M., Dean,M., Kaul,K., Braun,M.J., Gonda,M.A. and Vande Woude,G. TITLE Sequence of MET protooncogene cDNA has features characteristic of the tyrosine kinase family of growth-factor receptors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (18), 6379-6383 (1987) MEDLINE 87317655 COMMENT Draft entry and computer-readable sequence of sequence [1] kindly provided 09-OCT-1987. FEATURES Location/Qualifiers source 1..4626 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7q31" gene 195..4421 /gene="MET" CDS 195..4421 /gene="MET" /note="MET proto-oncogene protein" /codon_start=1 /db_xref="GDB:G00-120-178" /db_xref="PID:g307196" /translation="MKAPAVLAPGILVLLFTLVQRSNGECKEALAKSEMNVNMKYQLP NFTAETPIQNVILHEHHIFLGATNYIYVLNEEDLQKVAEYKTGPVLEHPDCFPCQDCS SKANLSGGVWKDNINMALVVDTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVHC IFSPQIEEPSQCPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRR LKETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETLDAQTFHT RIIRFCSINSGLHSYMEMPLECILTEKRKKRSTKKEVFNILQAAYVSKPGAQLARQIG ASLNDDILFGVFAQSKPDSAEPMDRSAMCAFPIKYVNDFFNKIVNKNNVRCLQHFYGP NHEHCFNRTLLRNSSGCEARRDEYRTEFTTALQRVDLFMGQFSEVLLTSISTFIKGDL TIANLGTSEGRFMQVVVSRSGPSTPHVNFLLDSHPVSPEVIVEHTLNQNGYTLVITGK KITKIPLNGLGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICLPAIY KVFPNSAPLEGGTRLTICGWDFGFRRNNKFDLKKTRVLLGNESCTLTLSESTMNTLKC TVGPAMNKHFNMSIIISNGHGTTQYSTFSYVDPVITSISPKYGPMAGGTLLTLTGNYL NSGNSRHISIGGKTCTLKSVSNSILECYTPAQTISTEFAVKLKIDLANRETSIFSYRE DPIVYEIHPTKSFISTWWKEPLNIVSFLFCFASGGSTITGVGKNLNSVSVPRMVINVH EAGRNFTVACQHRSNSEIICCTTPSLQQLNLQLPLKTKAFFMLDGILSKYFDLIYVHN PVFKPFEKPVMISMGNENVLEIKGNDIDPEAVKGEVLKVGNKSCENIHLHSEAVLCTV PNDLLKLNSELNIEWKQAISSTVLGKVIVQPDQNFTGLIAGVVSISTALLLLLGFFLW LKKRKQIKDLGSELVRYDARVHTPHLDRLVSARSVSPTTEMVSNESVDYRATFPEDQF PNSSQNGSCRQVQYPLTDMSPILTSGDSDISSPLLQNTVHIDLSALNPELVQAVQHVV IGPSSLIVHFNEVIGRGHFGCVYHGTLLDNDGKKIHCAVKSLNRITDIGEVSQFLTEG IIMKDFSHPNVLSLLGICLRSEGSPLVVLPYMKHGDLRNFIRNETHNPTVKDLIGFGL QVAKAMKYLASKKFVHRDLAARNCMLDEKFTVKVADFGLARDMYDKEYYSVHNKTGAK LPVKWMALESLQTQKFTTKSDVWSFGVVLWELMTRGAPPYPDVNTFDITVYLLQGRRL LQPEYCPDPLYEVMLKCWHPKAEMRPSFSELVSRISAIFSTFIGEHYVHVNATYVNVK CVAPYPSLLSSEDNADDEVDTRPASFWETS" BASE COUNT 1317 a 1034 c 1026 g 1249 t ORIGIN Chromosome 7q31-q32. 1 gaattccgcc ctcgccgccc gcggcgcccc gagcgctttg tgagcagatg cggagccgag 61 tggagggcgc gagccagatg cggggcgaca gctgacttgc tgagaggagg cggggaggcg 121 cggagcgcgc gtgtggtcct tgcgccgctg acttctccac tggttcctgg gcaccgaaag 181 ataaacctct cataatgaag gcccccgctg tgcttgcacc tggcatcctc gtgctcctgt 241 ttaccttggt gcagaggagc aatggggagt gtaaagaggc actagcaaag tccgagatga 301 atgtgaatat gaagtatcag cttcccaact tcaccgcgga aacacccatc cagaatgtca 361 ttctacatga gcatcacatt ttccttggtg ccactaacta catttatgtt ttaaatgagg 421 aagaccttca gaaggttgct gagtacaaga ctgggcctgt gctggaacac ccagattgtt 481 tcccatgtca ggactgcagc agcaaagcca atttatcagg aggtgtttgg aaagataaca 541 tcaacatggc tctagttgtc gacacctact atgatgatca actcattagc tgtggcagcg 601 tcaacagagg gacctgccag cgacatgtct ttccccacaa tcatactgct gacatacagt 661 cggaggttca ctgcatattc tccccacaga tagaagagcc cagccagtgt cctgactgtg 721 tggtgagcgc cctgggagcc aaagtccttt catctgtaaa ggaccggttc atcaacttct 781 ttgtaggcaa taccataaat tcttcttatt tcccagatca tccattgcat tcgatatcag 841 tgagaaggct aaaggaaacg aaagatggtt ttatgttttt gacggaccag tcctacattg 901 atgttttacc tgagttcaga gattcttacc ccattaagta tgtccatgcc tttgaaagca 961 acaattttat ttacttcttg acggtccaaa gggaaactct agatgctcag acttttcaca 1021 caagaataat caggttctgt tccataaact ctggattgca ttcctacatg gaaatgcctc 1081 tggagtgtat tctcacagaa aagagaaaaa agagatccac aaagaaggaa gtgtttaata 1141 tacttcaggc tgcgtatgtc agcaagcctg gggcccagct tgctagacaa ataggagcca 1201 gcctgaatga tgacattctt ttcggggtgt tcgcacaaag caagccagat tctgccgaac 1261 caatggatcg atctgccatg tgtgcattcc ctatcaaata tgtcaacgac ttcttcaaca 1321 agatcgtcaa caaaaacaat gtgagatgtc tccagcattt ttacggaccc aatcatgagc 1381 actgctttaa taggacactt ctgagaaatt catcaggctg tgaagcgcgc cgtgatgaat 1441 atcgaacaga gtttaccaca gctttgcagc gcgttgactt attcatgggt caattcagcg 1501 aagtcctctt aacatctata tccaccttca ttaaaggaga cctcaccata gctaatcttg 1561 ggacatcaga gggtcgcttc atgcaggttg tggtttctcg atcaggacca tcaacccctc 1621 atgtgaattt tctcctggac tcccatccag tgtctccaga agtgattgtg gagcatacat 1681 taaaccaaaa tggctacaca ctggttatca ctgggaagaa gatcacgaag atcccattga 1741 atggcttggg ctgcagacat ttccagtcct gcagtcaatg cctctctgcc ccaccctttg 1801 ttcagtgtgg ctggtgccac gacaaatgtg tgcgatcgga ggaatgcctg agcgggacat 1861 ggactcaaca gatctgtctg cctgcaatct acaaggtttt cccaaatagt gcaccccttg 1921 aaggagggac aaggctgacc atatgtggct gggactttgg atttcggagg aataataaat 1981 ttgatttaaa gaaaactaga gttctccttg gaaatgagag ctgcaccttg actttaagtg 2041 agagcacgat gaatacattg aaatgcacag ttggtcctgc catgaataag catttcaata 2101 tgtccataat tatttcaaat ggccacggga caacacaata cagtacattc tcctatgtgg 2161 atcctgtaat aacaagtatt tcgccgaaat acggtcctat ggctggtggc actttactta 2221 ctttaactgg aaattaccta aacagtggga attctagaca catttcaatt ggtggaaaaa 2281 catgtacttt aaaaagtgtg tcaaacagta ttcttgaatg ttatacccca gcccaaacca 2341 tttcaactga gtttgctgtt aaattgaaaa ttgacttagc caaccgagag acaagcatct 2401 tcagttaccg tgaagatccc attgtctatg aaattcatcc aaccaaatct tttattagta 2461 cttggtggaa agaacctctc aacattgtca gttttctatt ttgctttgcc agtggtggga 2521 gcacaataac aggtgttggg aaaaacctga attcagttag tgtcccgaga atggtcataa 2581 atgtgcatga agcaggaagg aactttacag tggcatgtca acatcgctct aattcagaga 2641 taatctgttg taccactcct tccctgcaac agctgaatct gcaactcccc ctgaaaacca 2701 aagccttttt catgttagat gggatccttt ccaaatactt tgatctcatt tatgtacata 2761 atcctgtgtt taagcctttt gaaaagccag tgatgatctc aatgggcaat gaaaatgtac 2821 tggaaattaa gggaaatgat attgaccctg aagcagttaa aggtgaagtg ttaaaagttg 2881 gaaataagag ctgtgagaat atacacttac attctgaagc cgttttatgc acggtcccca 2941 atgacctgct gaaattgaac agcgagctaa atatagagtg gaagcaagca atttcttcaa 3001 ccgtccttgg aaaagtaata gttcaaccag atcagaattt cacaggattg attgctggtg 3061 ttgtctcaat atcaacagca ctgttattac tacttgggtt tttcctgtgg ctgaaaaaga 3121 gaaagcaaat taaagatctg ggcagtgaat tagttcgcta cgatgcaaga gtacacactc 3181 ctcatttgga taggcttgta agtgcccgaa gtgtaagccc aactacagaa atggtttcaa 3241 atgaatctgt agactaccga gctacttttc cagaagatca gtttcctaat tcatctcaga 3301 acggttcatg ccgacaagtg cagtatcctc tgacagacat gtcccccatc ctaactagtg 3361 gggactctga tatatccagt ccattactgc aaaatactgt ccacattgac ctcagtgctc 3421 taaatccaga gctggtccag gcagtgcagc atgtagtgat tgggcccagt agcctgattg 3481 tgcatttcaa tgaagtcata ggaagagggc attttggttg tgtatatcat gggactttgt 3541 tggacaatga tggcaagaaa attcactgtg ctgtgaaatc cttgaacaga atcactgaca 3601 taggagaagt ttcccaattt ctgaccgagg gaatcatcat gaaagatttt agtcatccca 3661 atgtcctctc gctcctggga atctgcctgc gaagtgaagg gtctccgctg gtggtcctac 3721 catacatgaa acatggagat cttcgaaatt tcattcgaaa tgagactcat aatccaactg 3781 taaaagatct tattggcttt ggtcttcaag tagccaaagc gatgaaatat cttgcaagca 3841 aaaagtttgt ccacagagac ttggctgcaa gaaactgtat gctggatgaa aaattcacag 3901 tcaaggttgc tgattttggt cttgccagag acatgtatga taaagaatac tatagtgtac 3961 acaacaaaac aggtgcaaag ctgccagtga agtggatggc tttggaaagt ctgcaaactc 4021 aaaagtttac caccaagtca gatgtgtggt cctttggcgt cgtcctctgg gagctgatga 4081 caagaggagc cccaccttat cctgacgtaa acacctttga tataactgtt tacttgttgc 4141 aagggagaag actcctacaa cccgaatact gcccagaccc cttatatgaa gtaatgctaa 4201 aatgctggca ccctaaagcc gaaatgcgcc catccttttc tgaactggtg tcccggatat 4261 cagcgatctt ctctactttc attggggagc actatgtcca tgtgaacgct acttatgtga 4321 acgtaaaatg tgtcgctccg tatccttctc tgttgtcatc agaagataac gctgatgatg 4381 aggtggacac acgaccagcc tccttctggg agacatcata gtgctagtac tatgtcaaag 4441 caacagtcca cactttgtcc aatggttttt tcactgcctg acctttaaaa ggccatcgat 4501 attctttgct ccttgccata ggacttgtat tgttatttaa attactggat tctaaggaat 4561 ttcttatctg acagagcatc agaaccagag gcttggtccc acaggccagg gaccaatgcg 4621 ctgcag // LOCUS HUMMETSYN 857 bp mRNA PRI 19-JAN-1996 DEFINITION Homo sapiens 5,10-methenyltetrahydrofolate synthetase mRNA, complete cds. ACCESSION L38928 NID g886296 KEYWORDS 5,10-methenyltetrahydrofolate synthetase. SOURCE Homo sapiens (clone: MTHFS-1) (clone library: lambda-DR2) (tissue library: Clontech) adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 857) AUTHORS Dayan,A., Bertrand,R., Beauchemin,M., Chahla,D., Mamo,A., Filion,M., Skup,D., Massie,B. and Jolivet,J. TITLE Cloning and characterization of the human 5,10-methenyltetrahydrofolate synthetase-encoding cDNA JOURNAL Gene 165 (2), 307-311 (1995) MEDLINE 96096540 FEATURES Location/Qualifiers source 1..857 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="MTHFS-1" /clone_lib="lambda-DR2" /dev_stage="adult" /tissue_type="liver" /tissue_lib="Clontech" mRNA 1..857 CDS 14..625 /EC_number="6.3.3.2" /codon_start=1 /product="5,10-methenyltetrahydrofolate synthetase" /db_xref="PID:g886297" /translation="MAAAAVSSAKRSLRGELKQRLRAMSAEERLRQSRVLSQKVIAHS EYQKSKRISIFLSMQDEIETEEIIKDIFQRGKICFIPRYRFQSNHMDMVRIESPEEIS LLPKTSWNIPQPGEGDVREEALSTGGLDLIFMPGLGFDKHGNRLGRGKGYYDAYLKRC LQHQEVKPYTLALAFKEQICLQVPVNENDMKVDEVLYEDSSTA" BASE COUNT 255 a 174 c 217 g 211 t ORIGIN 1 gcgtgggcgt gagatggcgg cggcagcggt gagcagcgcc aagcggagcc tgcggggaga 61 gctgaagcag cgtctgcggg cgatgagtgc cgaggagcgg ctacgccagt cccgcgtact 121 gagccagaag gtgattgccc acagtgagta tcaaaagtcc aaaagaattt ccatctttct 181 gagcatgcaa gatgaaattg agacagaaga gatcatcaag gacattttcc aacgaggcaa 241 aatctgcttc atccctcggt accggttcca gagcaatcac atggatatgg tgagaataga 301 atcaccagag gaaatttctt tacttcccaa aacatcctgg aatatccctc agcctggtga 361 gggtgatgtt cgggaggagg ccttgtccac agggggactt gatctcatct tcatgccagg 421 ccttgggttt gacaaacatg gcaaccgact ggggaggggc aagggctact atgatgccta 481 tctgaagcgc tgtttgcagc atcaggaagt gaagccctac accctggcgt tggctttcaa 541 agaacagatt tgcctccagg tcccagtgaa tgaaaacgac atgaaggtag atgaagtcct 601 ttacgaagac tcgtcaacag cttaaatctg gattactaca gccaaataat cagtgtttta 661 tatgagagta aagcaaagta tgtgtatttt tcccttgtca aaaattagtt gaaattgttc 721 attaatgtga atacagactg cattttaaaa ttgtaattat gaaatacctt atataaaacc 781 atctttaaaa accaatagaa gtgtgaatag tagaatatta attaaaatgg aggctatcag 841 cctgtgattt tcagctt // LOCUS HUMMEVKIN 1967 bp mRNA PRI 03-JUN-1993 DEFINITION Homo sapiens mevalonate kinase mRNA, complete cds. ACCESSION M88468 NID g307197 KEYWORDS mevalonate kinase. SOURCE Homo sapiens young male skin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1967) AUTHORS Schafer,B.L., Bishop,R.W., Kratunis,V.J., Kalinowski,S.S., Mosley,S.T., Gibson,K.M. and Tanaka,R.D. TITLE Molecular cloning of human mevalonate kinase and identification of a missense mutation in the genetic disease mevalonic aciduria JOURNAL J. Biol. Chem. 267, 13229-13238 (1992) MEDLINE 92317034 FEATURES Location/Qualifiers source 1..1967 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /sex="young male" /tissue_type="skin" 5'UTR 1..91 CDS 92..1282 /EC_number="2.7.1.36" /codon_start=1 /product="mevalonate kinase" /db_xref="PID:g187561" /translation="MLSEVLLVSAPGKVILHGEHAVVHGKVALAVSLNLRTFLRLQPH SNGKVDLSLPNIGIKRAWDVARLQSLDTSFLEQGDVTTPTSEQVEKLKEVAGLPDDCA VTERLAVLAFLYLYLSICRKQRALPSLDIVVWSELPPGAGLGSSAAYSVCLAAALLTV CEEIPNPLKDGDCVNRWTKEDLELINKWAFQGERMIHGNPSGVDNAVSTWGGALRYHQ GKISSLKRSPALQILLTNTKVPRNTRALVAGVRNRLLKFPEIVAPLLTSIDAISLECE RVLGEMGEAPAPEQYLVLEELIDMNQHHLNALGVGHASLDQLCQVTRARGLHSKLTGA GGGGCGITLLKPGLEQPEVEATKQALTSCGFDCLETSIGAPGVSIHSATSLDSRVQQA LDGL" mutation 993 /note="'mevalonic aciduria missense mutation'; 'C' in mutation" /replace="" 3'UTR 1283..1967 polyA_signal 1946 BASE COUNT 380 a 601 c 597 g 389 t ORIGIN 1 caaaacaaaa ggtagtgggg agctgctccg gcttcggcgc ggaggggcgg cggccgggga 61 ggcggcggcg gcggcaggat tcccaggagc catgttgtca gaagtcctac tggtgtctgc 121 tccggggaaa gtcatccttc atggagaaca tgccgtggta catggcaagg tagcactggc 181 tgtatccttg aacttgagaa cattcctccg gcttcaaccc cacagcaatg ggaaagtgga 241 cctcagctta cccaacattg gtatcaagcg ggcctgggat gtggccaggc ttcagtcact 301 ggacacaagc tttctggagc aaggtgatgt cacaacaccc acctcagagc aagtggagaa 361 gctaaaggag gttgcaggct tgcctgacga ctgtgctgtc accgagcgcc tggctgtgct 421 ggcctttctt tacttatacc tgtccatctg ccggaagcag agggccctgc cgagcctgga 481 tatcgtagtg tggtcggagc tgccccccgg ggcgggcttg ggctccagcg ccgcctactc 541 ggtgtgtctg gcagcagccc tcctgactgt gtgcgaggag atcccaaacc cgctgaagga 601 cggggattgc gtcaacaggt ggaccaagga ggatttggag ctaattaaca agtgggcctt 661 ccaaggggag agaatgattc acgggaaccc ctccggagtg gacaatgctg tcagcacctg 721 gggaggagcc ctccgatacc atcaagggaa gatttcatcc ttaaagaggt cgccagctct 781 ccagatcctg ctgaccaaca ccaaagtccc tcgcaatacc agggcccttg tggctggcgt 841 cagaaacagg ctgctcaagt tcccagagat cgtggccccc ctcctgacct caatagatgc 901 catctccctg gagtgtgagc gcgtgctggg agagatgggg gaagccccag ccccggagca 961 gtacctcgtg ctggaagagc tcattgacat gaaccagcac catctgaatg ccctcggcgt 1021 gggccacgcc tctctggacc agctctgcca ggtgaccagg gcccgcggac ttcacagcaa 1081 gctgactggc gcaggcggtg gtggctgtgg catcacactc ctcaagccag ggctggagca 1141 gccagaagtg gaggccacga agcaggccct gaccagctgt ggctttgact gcttggaaac 1201 cagcatcggt gcccccggcg tctccatcca ctcagccacc tccctggaca gccgagtcca 1261 gcaagccctg gatggcctct gagaggagcc cacgacactg cagccccacc cagatgcccc 1321 tttctggatt attctggggg ctgcagttcg actctgtgct ggccagcgag cgcccagctc 1381 ctgacactgc tggagaggcc ccagccgctt ggcgatgcca gccaagctct gcagtcccag 1441 cggtgggacc tagggaggca tggtctgccc tctgcatcct ctggagccag ccgagcagga 1501 ggcctaggag ggtcctctga gactccagac ctgaggcgag aagggctgct tccctgaagc 1561 tcccacagtc ccatctgctt caggcccccg ccttggcctg tgttcttcct ggccgcctgg 1621 gtccaatgct caggtgctgg ggcctggttc ccggagaagt gtgccttctc tctccctttt 1681 cagggacggc cccctgtctc tcagggccag gcctctccct cctccaggaa gccttcccct 1741 accccttgtc gcccctccct cccagagcac ctgctgtctg ggtggctcac tcagcacttg 1801 gcccttctac ctagcgggat ggggctcccc caggggctgt cccggaggcg gtgggcctgg 1861 ttaaataagg cagtgtggcc ttggtttata tgcactttct tccgatctgt acctgagagg 1921 tttgtggaaa agatggcaaa tggggaataa aaagattttg tgtcaac // LOCUS HUMMFAP 1330 bp DNA PRI 27-APR-1995 DEFINITION Homo sapiens extracellular matrix protein (MFAP3) gene, complete cds. ACCESSION L35251 NID g786118 KEYWORDS elastic microfibrillar component; extracellular matrix protein. SOURCE Homo sapiens (tissue library: genomic) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1330) AUTHORS Abrams,W.R., Ma,R.I., Kucich,U., Bashir,M.M., Decker,S., Tsipouras,P., McPherson,J.D., Wasmuth,J.J. and Rosenbloom,J. TITLE Molecular cloning of the microfibrillar protein MFAP3 and assignment of the gene to human chromosome 5q32-q33.2 JOURNAL Genomics 26 (1), 47-54 (1995) MEDLINE 95301292 FEATURES Location/Qualifiers source 1..1330 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lung and dermal fibroblasts" /tissue_lib="genomic" /map="5q31.2-q33.3" mRNA 1..>1330 /gene="MFAP3" /note="G00-371-694" exon 1..467 /gene="MFAP3" /note="G00-371-694" /number=1 gene 1..1330 /gene="MFAP3" CDS 173..1261 /gene="MFAP3" /standard_name="microfibrillar-associated protein 3" /codon_start=1 /db_xref="GDB:G00-371-694" /product="extracellular matrix protein" /db_xref="PID:g786119" /translation="MKLHCCLFTLVASIIVPAAFVLEDVDFDQMVSLEANRSSYNASF PSSFELSASSHSDDDVIIAKEGTSVSIECLLTASHYEDVHWHNSKGQQLDGRSRGGKW LVSDNFLNITNVAFDDRGLYTCFVTSPIRASYSVTLRVIFTSGDMSVYYMIVCLIAFT ITLILNVTRLCMMSSHLRKTEKAINEFFRTEGAEKLQKAFEIAKRIPIITSAKTLELA KVTQFKTMEFARYIEELARSVPLPPLILNCRAFVEEMFEAVRVDDPDDLGERIKERPA LNAQGGIYVINPEMGRSNSPGGDSDDGSLNEQGQEIAVQVSVHLQSETKSIDTESQGS SHFSPPDDIGSAESNCNYKDGAYENCQL" exon 468..>1330 /gene="MFAP3" /note="G00-371-694" /number=2 BASE COUNT 373 a 295 c 283 g 379 t ORIGIN 1 cgtcgggttc tctactcaca tcttttaatc ttgaagacta gaaaatataa ctggatctgc 61 cacttgtttg gaaaatatct ctaccaagca ataaattacc cgctgtgctt ttgttgtagt 121 gtagaagttt ttgagttctc caaatctaaa caagattttg tcccattttc ccatgaagct 181 acattgttgc ttattcactt tagtggcaag tattattgtg ccagctgctt ttgttttgga 241 agatgtggac ttcgaccaaa tggtttcact ggaagcaaat cgtagttctt acaatgcatc 301 ctttccctca agctttgaac tctcagcaag ttcccactcg gatgatgacg tcatcatagc 361 caaagaggga actagcgttt caattgagtg tcttctcaca gccagtcact atgaagatgt 421 ccattggcac aattcaaaag gacagcaact ggatggcaga agcagaggtg gaaagtggtt 481 ggtttctgat aacttcctaa acatcaccaa tgtagctttt gatgaccgtg ggctctatac 541 ctgtttcgtc acctctccaa ttcgtgcctc ctactctgtc accctacgtg ttatcttcac 601 ctcgggagac atgagtgtct attacatgat tgtttgcctg attgccttta caatcacact 661 catcttgaat gtcacacggc tgtgcatgat gagcagccat cttcgcaaga ctgagaaggc 721 catcaatgag ttctttagaa ctgaaggggc tgagaaactt cagaaggcct ttgagattgc 781 aaaacgtatc cccatcatta cctcagccaa aactctggag ctcgccaaag tcacacaatt 841 taagaccatg gagtttgctc gttatattga agaactggca agaagtgtcc ctcttccacc 901 tcttattcta aactgtcgag cctttgttga ggagatgttt gaggctgtgc gagtggatga 961 ccctgatgac ctgggtgaaa gaattaaaga gagacctgcc ttgaatgctc aaggtggcat 1021 ctatgtcatt aacccagaga tgggacggag taattcacca ggaggagatt cagatgatgg 1081 ctctctgaat gaacaaggcc aggaaatagc agttcaggtt tctgtccacc ttcagtcaga 1141 aaccaaaagt attgatacag agtctcaagg cagcagtcat ttcagtccac ctgatgatat 1201 aggatctgca gaatctaact gtaactacaa agatggggca tatgaaaact gtcagctgta 1261 acctacaatg ctgtaaccca gtacctacaa aatcagctcg ctctcagaaa aggaacctgt 1321 ttcttagaag // LOCUS HUMMGC24 2427 bp mRNA PRI 10-FEB-1993 DEFINITION Human mRNA for MGC-24, complete cds. ACCESSION D14043 NID g219924 KEYWORDS MGC-24. SOURCE Homo sapiens (library: lambda gt11 cDNA) gastric carcinoma cell cDNA to mRNA, cell line KATO-III, clone KP10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2427) AUTHORS Masuzawa,Y., Miyauchi,T., Hamanoue,M., Ando,S., Yoshida,J., Takao,S., Shimazu,H., Adachi,M. and Muramatsu,T. TITLE A novel core protein as well as polymorphic epithelial mucin carry peanut agglutinin binding sites in human gastric carcinoma cells: sequence analysis and examination of gene expression JOURNAL J. Biochem. 112 (5), 609-615 (1992) MEDLINE 93123189 REFERENCE 2 (bases 1 to 2427) AUTHORS Masuzawa,Y. TITLE Direct Submission JOURNAL Submitted (11-JAN-1993) to the DDBJ/EMBL/GenBank databases. Yasushi Masuzawa, Japan Immunoresearch Laboratories Co., Ltd; 351-1 Nishiyokote-cho, Takasaki, Gunma 370, Japan (Tel:0273-53-1411, Fax:0273-53-1770) COMMENT Submitted (11-JAN-1993) to DDBJ by: Yasushi Masuzawa Japan Immunoresearch Lab. Co., Ltd. 351-1 Nishiyokote-cho Takasaki, Gunma 370 Japan Phone: 0273-53-1411 Fax: 0273-53-1770. FEATURES Location/Qualifiers source 1..2427 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KATO-III" /cell_type="gastric carcinoma cell" /clone_lib="lambda gt11 cDNA" CDS 80..649 /codon_start=1 /product="MGC-24 precursor" /db_xref="PID:d1003636" /db_xref="PID:g219925" /translation="MSRLSRSLLWAATCLGVLCVLSADKNTTQHPNVTTLAPISNVTS APVTSLPLVTTPAPETCEGRNSCVSCFNVSVVNTTCFWIECKDESYCSHNSTVSDCQV GNTTDFCSVSTATPVPTANSTAKPTVQPSPSTTSKTVTTSGTTNNTVTPTSQPVRKST FDAASFIGGIVLVLEIRCHTRNYIPDLKK" sig_peptide 80..148 mat_peptide 149..646 /product="MGC-24" polyA_signal 2388..2393 BASE COUNT 697 a 466 c 443 g 821 t ORIGIN 1 gcggcgccgc aggggattga ggggttgact gagcgttgcg agccttagct ttctcccgaa 61 cgccagcgct gaggacacga tgtcgcggct ctcccgctca ctgctttggg ccgccacctg 121 cctgggcgtg ctctgcgtgc tgtccgcgga caagaacacg acccagcacc cgaacgtgac 181 gactttagcg cccatctcca acgtaacctc ggcgccggtg acgtccctcc cgctggtcac 241 cactccggca ccagaaacct gtgaaggtcg aaacagctgc gtttcctgtt ttaatgttag 301 cgttgttaat actacctgct tttggataga atgtaaagat gagagctatt gttcacataa 361 ctcaacagtt agtgattgtc aagtggggaa cacgacagac ttctgttccg tttccacggc 421 cactccagtg ccaacagcca attctacagc taaacccaca gttcagccct ccccttctac 481 aacttccaag acagttacta catcaggtac aacaaataac actgtgactc caacctcaca 541 acctgtgcga aagtctacct ttgatgcagc cagtttcatt ggaggaattg tcctggtctt 601 ggaaataaga tgccacacaa ggaactacat tccagattta aagaaatgaa aggataccat 661 tagtgtgtat aacagattat tgttcatact tgtaaagcac cttatgtcat tgagaatata 721 aagaacagtg ccttagaaga cagtgaaagg taagctctag cttaatgtct atgatttgtt 781 ctttgacatt aaggaaggta aggattggtc agaggatgta acttgatgtg agcagtagta 841 aacctgtttt agatatcata ctgttaatat tttattgaaa atttatttca gagcggagaa 901 acttaagcta aagtctgtta tacagaattg aaagccttcg tatcttgaac ctcccaacat 961 ttttcttatg gctgttgaaa agtatagagc taaattgatt taattacact ttcctttgta 1021 ctttaaaaaa aagtatgcta gcactattgt accttgaaag gatttccacc agactgtctt 1081 gagtagtgac ttctttggtg aggcaagaag gatatacatt attttagaat catttactat 1141 ttaaatgaga caatcatatt attttagaat catttatttt aaatgagaca atcattttaa 1201 gttttaagat aacagaagtg accaatgtaa tttcacaaca cctaaggatt ttttggttga 1261 tcaggttact gtagattttt actgattgtc ctggatgaat agactgtgct ttttcttttt 1321 ctctcccttc cttcttggtt tcccatagta taataagcat gcatacttta acttctatag 1381 ttttctcctt tagagggtct tcttcagttt tagaggttta cttctccctt gcctttgact 1441 cattggacta gtgcagaggc tttaagtagt ttaaaatggg cttttgcttt tctaggtcat 1501 taacgttttt tatttagttt ctttagccaa tagtggctga gtttcgcact tgattttcaa 1561 tattttatag taagaaatga caaactgctt tggttcattt cataaacaaa ctctgcattt 1621 agataactat taaaggttgt taagatgaag atttactgtt tctttgttac tcgttggtac 1681 agctgtttgt tttacttgca catttgtaga tatacttaat gttttcaagt gccttaattg 1741 tttaaaatct ctggcttcaa agtttcttgg ggaaaggtcg gtttacctca cattttttgt 1801 ttccattagt aatattctag gtacctcaca aaatgtatta tggtgccatg gctgttagtt 1861 tttagtgagt gctgtaggat taattcgaaa ataggcagaa ttccattcct cccaaggtgg 1921 caaaaattag ctatactgat gtaattgtca tttacctggg tatgaattcc ctgacacaca 1981 ttcatgtcaa catatgtagc aaattttgtg aaaacacaac aatttgaagc ttctgtaatt 2041 ttgagcactg ctctaacaac aagcataata taaaattagt tagattttgc aagtctacaa 2101 atgagctctt gcaacagaac tcacagcctt tttacttttt tcccctaact ttagcaatgt 2161 agtatcttga gccattaatt tttgggtttt tttaaaatcc agaaggtata tagaaacctt 2221 ttcagatttt tcatctgatt tgttcttgca gatgttcttc tatcaaatac cttattttac 2281 cttacagata tttgttgcac aggcagatac tgctgtattt agacatttct atttcagttc 2341 attaaaaact gcaaaaccaa tctgtatcat gtaccaaact gacttaaaat aaatctacat 2401 gtttattgaa ttaaaacaaa aaaaaaa // LOCUS HUMMGPHB 3816 bp mRNA PRI 07-JAN-1995 DEFINITION Human brain glycogen phosphorylase mRNA, complete cds. ACCESSION J03544 NID g187596 KEYWORDS glycogen phosphorylase. SOURCE Human astrocytoma cell line U251, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3816) AUTHORS Newgard,C.B., Littman,D.R., van Genderen,C., Smith,M. and Fletterick,R.J. TITLE Human brain glycogen phosphorylase. Cloning, sequence analysis, chromosomal mapping, tissue expression, and comparison with the human liver and muscle isozymes JOURNAL J. Biol. Chem. 263 (8), 3850-3857 (1988) MEDLINE 88153685 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.B.Newgard, 04-DEC-1987. FEATURES Location/Qualifiers source 1..3816 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20" mRNA <1..3816 /note="brain glycogen phosphorylase mRNA" gene 80..2671 /gene="PYGB" CDS 80..2671 /gene="PYGB" /note="brain glycogen phosphorylase" /codon_start=1 /db_xref="GDB:G00-120-326" /db_xref="PID:g307200" /translation="MGEPLTDSEKRKQISVRGLAGLGDVAEVRKSFNRHLHFTLVKDR NVATPRDYFFALAHTVRDHLVGRWIRTQQHYYERDPKRIYYLSLEFYMGRTLQNTMVN LGLQNACDEAIYQLGLDLEELEEIEEDAGLGNGGLGRLAACFLDSMATLGLAAYGYGI RYEFGIFNQKIVNGWQVEEADDWLRYGNPWEKARPEYMLPVHFYGRVEHTPDGVKWLD TQVVLAMPYDTPVPGYKNNTVNTMRLWSARAPNDFKLQDFNVGDYIEAVLDRNLAENI SRVLYPNDNFFEGKELRLKQEYFVVGATLQDIIRRFKSSKFGCRDPVRTCFETFPDKV AIQLNDTHPALSIPELMRILVDVEKVDWDKAWEITKKTCAYTNHTVLPEALERWPVSM FEKLLPRHLEIIYAINQRHLDHVAALFPGDVDRLRRMSVIEEGDCKRINMAHLCVIGS HAVNGVARIHSEIVKQSVFKDFYELEPEKFQNKTNGITPRRWLLLCNPGLADTIVEKI GEEFLTDLSQLKKLLPLVSDEVFIRDVAKVKQENKLKFSAFLEKEYKVKINPSSMFDV HVKRIHEYKRQLLNCLHVVTLYNRIKRDPAKAFVPRTVMIGGKAAPGYHMAKLIIKLV TSIGDVVNHDPVVGDRLKVIFLENYRVSLAEKVIPAADLSQQISTAGTEASGTGNMKF MLNGALTIGTMDGANVEMAEEAGAENLFIFGLRVEDVEALDRKGYNAREYYDHLPELK QAVDQISSGFFSPKEPDCFKDIVNMLMHHDRFKVFADYEAYMQCQAQVDQLYRNPKEW TKKVIRNIACSGKFSSDRTITEYAREIWGVEPSDLQLQHLPHPEWESGGATCWAPPEL CTHLAMY" BASE COUNT 795 a 1085 c 1141 g 795 t ORIGIN 282 bp upstream of BamHI site; chromosome 20. 1 gagcagctgc accatcccgg cgttcgcgtg tgccgccgct ttcctcctcc atctcttttc 61 ctccgcctcc gccggcgcga tgggcgaacc gctgacggac agcgagaagc ggaagcagat 121 cagcgtgcgc ggcctggcgg ggctaggcga cgtggccgag gtgcggaaga gcttcaaccg 181 gcacttgcac ttcacgctgg tcaaggaccg caatgtggcc acgccccgcg actacttctt 241 cgcgctggcg cacacggtgc gcgaccacct cgtgggccgc tggatccgca cgcagcagca 301 ctactacgag cgcgacccca agcgaattta ttatctttcc ctggaattct acatgggtcg 361 cacgctgcag aacacgatgg tgaacctggg ccttcagaat gcctgcgatg aagccatcta 421 tcagttgggg ttagacttgg aggaactcga ggagatagaa gaagatgctg gccttgggaa 481 tggaggcctg gggaggctgg cagcgtgttt ccttgactca atggctacct tgggcctggc 541 agcatacggc tatggaatcc gctatgaatt tgggattttt aaccagaaga ttgtcaatgg 601 ctggcaggta gaggaggccg atgactggct gcgctacggc aacccctggg agaaagcgcg 661 gcctgagtat atgcttcccg tgcacttcta cggacgcgtg gagcacaccc ccgacggcgt 721 gaagtggctg gacacacagg tggtgctggc catgccctac gacaccccag tgcccggcta 781 caagaacaac accgtcaaca ccatgcggct gtggtccgca agggctccca acgacttcaa 841 gctgcaggac ttcaacgtgg gagactacat cgaggcggtc ctggaccgga acttggctga 901 gaacatctcc agggtcctgt atccaaatga taacttcttt gaggggaagg agctgcggct 961 gaagcaggag tacttcgtgg tgggcgccac gctccaggac atcatccgcc gcttcaagtc 1021 gtccaagttc ggctgccggg accctgtgag aacctgtttc gagacgttcc cagacaaggt 1081 ggccatccag ctgaacgaca cccaccccgc cctctccatc cctgagctca tgcggatcct 1141 ggtggacgtg gagaaggtgg actgggacaa ggcctgggaa atcacgaaga agacctgtgc 1201 atacaccaac cacactgtgc tgcctgaggc cttggagcgc tggcccgtgt ccatgtttga 1261 gaagctgctg ccgcggcacc tggagataat ctatgccatc aaccagcggc acctggacca 1321 cgtggccgcg ctgtttcccg gcgatgtgga ccgcctgcgc aggatgtctg tgatcgagga 1381 gggggactgc aagcggatca acatggccca cctgtgtgtg attgggtccc atgctgtcaa 1441 tggtgtggcg aggatccact cggagatcgt gaaacagtcg gtctttaagg atttttatga 1501 actggagcca gagaagttcc agaataagac caatggcatc accccccgcc ggtggctgct 1561 gctgtgcaac ccggggctgg ccgataccat cgtggagaaa attggggagg agttcctgac 1621 tgacctgagc cagctgaaga agctgctgcc gctggtcagt gacgaggtgt tcatcaggga 1681 cgtggccaag gtcaaacagg agaacaagct caagttctcg gccttcctgg agaaggagta 1741 caaggtgaag atcaacccct cctccatgtt cgatgtgcat gtgaagagga tccacgagta 1801 caagcggcag ctgctcaact gcctgcacgt cgtcaccctg tacaatcgaa tcaagagaga 1861 cccggccaag gcttttgtgc ccaggactgt tatgattggg ggcaaggcag cgcccggtta 1921 ccacatggcc aagctgatca tcaagttggt cacctccatc ggcgacgtcg tcaatcatga 1981 cccagttgtg ggtgacaggt tgaaagtgat cttcctggag aactaccgtg tgtccttggc 2041 tgagaaagtg atcccggccg ctgatctgtc gcagcagatc tccactgcag gcaccgaggc 2101 ctcaggcaca ggcaacatga agttcatgct caacggggcc ctcaccatcg gcaccatgga 2161 cggcgccaac gtggagatgg ccgaggaggc cggggccgag aacctcttca tcttcggcct 2221 gcgggtggag gatgtcgagg ccttggaccg gaaagggtac aatgccaggg agtactacga 2281 ccacctgccc gagctgaagc aggccgtgga ccagatcagc agtggctttt tttctcccaa 2341 ggagccagac tgcttcaagg acatcgtgaa catgctgatg caccatgaca ggttcaaggt 2401 gtttgcagac tatgaagcct acatgcagtg ccaggcacag gtggaccagc tgtaccggaa 2461 ccccaaggag tggaccaaga aggtcatcag gaacatcgcc tgctcgggca agttctccag 2521 tgaccggacc atcacggagt atgcacggga gatctggggt gtggagccct ccgacctgca 2581 gcttcagcac ctgccccacc cagagtggga gtcaggtgga gccacctgct gggctccccc 2641 agaactttgc acacatcttg ctatgtatta gccgatggct ttagtgttga gcctctggat 2701 tctggggtct gcggcagtgg ccatagtgaa gcctgggaat gagtgttact gcagcatctg 2761 gctgccagcc acagggaagg gccaagcccc atgtagcccc agtcatcctg cccagccctg 2821 cctcctggcc atgccgggag gggtcggatc ctctaggcat cgcttcacag ccccctgccc 2881 cctgccctct gtcctggctc tgcacctggt atatgggtca tggaccagat ggggctttcc 2941 ctttgtagcc atccaatggg cattgtgtgg gtgcttggaa cccgggatga ctgaggggga 3001 cactggagtg ggtgcttgtg tctgctgtct cagaggcctt ggtcaggatg aagttggctg 3061 acacagctta gcttggtttt gcttattcaa aagagaaaat aactacacat ggaaatgaaa 3121 ctagtgaagc cttttcttgt tttagaatga aaattgtact tggtcacttt tgtgcttgag 3181 gaggcccatt ttctagcctg gcaggggcag gtcctgtgcc ctcccgcttg actcctgctg 3241 tgtcctgagg tgcatttcct gtttgttaca cacaagggcc aggctccatt ctccctccct 3301 ttccaccagt gccacagcct cgtctggaaa aaggaccagg ggtcccggag gaacccattt 3361 gtgctctgct tggacagcag gcctggcact gggaggtggg gtgagcccct cacagccttg 3421 cccctcccca aggctcgaac ctgcctccca ttgcccaaga gagaggggca gggaacaggc 3481 tactgtcctt ccctgtggaa ttgccgagaa atctagcacc ttgcatgctg gatctgggct 3541 gcggggaggc tctttttctc cctggcctcc agtgcccacc aggaggatct gcgcacggtg 3601 cacagcccac cagagcacta cagcctttta ttgagtgggg caagtgctgg gctgtggtcg 3661 tgccctgaca gcatcttccc caggcagcgg ctctgtggag gaggccatac tcccctagtt 3721 ggccactggg gccaccaccc tgaccaccac tgtgcccctc attgttactg ccttgtgaga 3781 taaaaactga ttaaaccttt gtggctttgg ttggtt // LOCUS HUMMGRS5A 4518 bp mRNA PRI 09-JUL-1996 DEFINITION Human mRNA for metabotropic glutamate receptor subtype 5a, complete cds. ACCESSION D28538 NID g1408051 KEYWORDS mGluR5a; metabotropic glutamate receptor. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4518) AUTHORS Minakami,R., Katsuki,F., Yamamoto,T., Nakamura,K. and Sugiyama,H. TITLE Molecular cloning and the functional expression of two isoforms of human metabotropic glutamate receptor subtype 5 JOURNAL Biochem. Biophys. Res. Commun. 199 (3), 1136-1143 (1994) MEDLINE 94197696 REFERENCE 2 (bases 1 to 4518) AUTHORS Katsuki,F. TITLE Direct Submission JOURNAL Submitted (14-FEB-1994) to the DDBJ/EMBL/GenBank databases. Fujika Katsuki, Faculty of Science, Kyushu University, Department of Biology; 6-10-1 Hakozaki, Higashi-ku, Fukuoka, Fukuoka 812, Japan (Tel:092-642-2630, Fax:092-642-2645) COMMENT Sequence updated (03-Jul-1996) by: Fujika Katsuki. FEATURES Location/Qualifiers source 1..4518 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" 5'UTR <1..150 CDS 151..3693 /codon_start=1 /product="metabotropic glutamate receptor subtype 5a (mGluR5a)" /db_xref="PID:d1006441" /db_xref="PID:g1408052" /translation="MVLLLILSVLLLKEDVRGSAQSSERRVVAHMPGDIIIGALFSVH HQPTVDKVHERKCGAVREQYGIQRVEAMLHTLERINSDPTLLPNITLGCEIRDSCWHS AVALEQSIEFIRDSLISSEEEEGLVRCVDGSSSSFRSKKPIVGVIGPGSSSVAIQVQN LLQLFNIPQIAYSATSMDLSDKTLFKYFMRVVPSDAQQARAMVDIVKRYNWTYVSAVH TEGNYGESGMEAFKDMSAKEGICIAHSYKIYSNAGEQSFDKLLKKLTSHLPKARVVAC FCEGMTVRGLLMAMRRLGLAGEFLLLGSDGWADRYDVTDGYQREAVGGITIKLQSPDV KWFDDYYLKLRPETNHRNPWFQEFWQHRFQCRLEGFPQENSKYNKTCNSSLTLKTHHV QDSKMGFVINAIYSMAYGLHNMQMSLCPGYAGLCDAMKPIDGRKLLESLMKTNFTGVS GDTILFDENGDSPGRYEIMNFKEMGKDYFDYINVGSWDNGELKMDDDEVWSKKSNIIR SVCSEPCEKGQIKVIRKGEVSCCWTCTPCKENEYVFDEYTCKACQLGSWPTDDLTGCD LIPVQYLRWGDPEPIAAVVFACLGLLATLFVTVVFIIYRDTPVVKSSSRELCYIILAG ICLGYLCTFCLIAKPKQIYCYLQRIGIGLSPAMSYSALVTKTNRIARILAGSKKKICT KKPRFMSACAQLVIAFILICIQLGIIVALFIMEPPDIMHDYPSIREVYLICNTTNLGV VTPLGYNGLLILSCTFYAFKTRNVPANFNEAKYIAFTMYTTCIIWLAFVPIYFGSNYK IITMCFSVSLSATVALGCMFVPKVYIILAKPERNVRSAFTTSTVVRMHVGDGKSSSAA SRSSSLVNLWKRRGSSGETLSSNGKSVTWAQNEKSSRGQHLWQRLSIHINKKENPNQT AVIKPFPKSTESRGLGAGAGAGGSAGGVGATGGAGCAGAGPGGPESPDAGPKALYDVA EAEEHFPAPARPRSPSPISTLSHRAGSASRTDDDVPSLHSEPVARSSSSQGSLMEQIS SVVTRFTANISELNSMMLSTAAPSPGVGAPLCSSYLIPKEIQLPTTMTTFAEIQPLPA IEVTGGAQPAAGAQAAGDAARESPAAGPEAAAAKPDLEELVALTPPSPFRDSVDSGST TPNSPVSESALCIPSSPKYDTLIIRDYTQSSSSL" 3'UTR 3694..>4518 BASE COUNT 1114 a 1170 c 1145 g 1089 t ORIGIN 1 acaaaatggt cctttagaaa atacatctga attgctggct aatttcttga tttgcgactc 61 aacgtaggac atcgcttgtt cgtagctatc agaaccctcc tgaattttcc ccaccatgct 121 atctttattg gcttgaactc ctttcctaaa atggtccttc tgttgatcct gtcagtctta 181 cttttgaaag aagatgtccg tgggagtgca cagtccagtg agaggagggt ggtggctcac 241 atgccgggtg acatcattat tggagctctc ttttctgttc atcaccagcc tactgtggac 301 aaagttcatg agaggaagtg tggggcggtc cgtgaacagt atggcattca gagagtggag 361 gccatgctgc ataccctgga aaggatcaat tcagacccca cactcttgcc caacatcaca 421 ctgggctgtg agataaggga ctcctgctgg cattcggctg tggccctaga gcagagcatt 481 gagttcataa gagattccct catttcttca gaagaggaag aaggcttggt acgctgtgtg 541 gatggctcct cctcttcctt ccgctccaag aagcccatag taggggtcat tgggcctggc 601 tccagttctg tagccattca ggtccagaat ttgctccagc ttttcaacat acctcagatt 661 gcttactcag caaccagcat ggatctgagt gacaagactc tgttcaaata tttcatgagg 721 gttgtgcctt cagatgctca gcaggcaagg gccatggtgg acatagtgaa gaggtacaac 781 tggacctatg tatcagccgt gcacacagaa ggcaactatg gagaaagtgg gatggaagcc 841 ttcaaagata tgtcagcgaa ggaagggatt tgcatcgccc actcttacaa aatctacagt 901 aatgcagggg agcagagctt tgataagctg ctgaagaagc tcacaagtca cttgcccaag 961 gcccgggtgg tggcctgctt ctgtgagggc atgacggtga gaggtctgct gatggccatg 1021 aggcgcctgg gtctagcggg agaatttctg cttctgggca gtgatggctg ggctgacagg 1081 tatgatgtga cagatggata tcagcgagaa gctgttggtg gcatcacaat caagctccaa 1141 tctcccgatg tcaagtggtt tgatgattat tatctgaagc tccggccaga aacaaaccac 1201 cgaaaccctt ggtttcaaga attttggcag catcgttttc agtgccgact ggaagggttt 1261 ccacaggaga acagcaaata caacaagact tgcaatagtt ctctgactct gaaaacacat 1321 catgttcagg attccaaaat gggatttgtg atcaacgcca tctattcgat ggcctatggg 1381 ctccacaaca tgcagatgtc cctctgccca ggctatgcag gactctgtga tgccatgaag 1441 ccaattgatg gacggaaact tttggagtcc ctgatgaaaa ccaattttac tggggtttct 1501 ggagatacga tcctattcga tgagaatgga gactctccag gaaggtatga aataatgaat 1561 ttcaaggaaa tgggaaaaga ttactttgat tatatcaacg ttggaagttg ggacaatgga 1621 gaattaaaaa tggatgatga tgaagtatgg tccaagaaaa gcaacatcat cagatctgtg 1681 tgcagtgaac catgtgagaa aggccagatc aaggtgatcc gaaagggaga agtcagctgt 1741 tgttggacct gtacaccttg taaggagaat gagtatgtct ttgatgagta cacatgcaag 1801 gcatgccaac tggggtcttg gcccactgat gatctcacag gttgtgactt gatcccagta 1861 cagtatcttc gatggggtga ccctgaaccc attgcagctg tggtgtttgc ctgccttggc 1921 ctcctggcca ccctgtttgt tactgtagtc ttcatcattt accgtgatac accagtagtc 1981 aagtcctcaa gcagggaact ctgctacatt atccttgctg gcatctgcct gggctactta 2041 tgtaccttct gcctcattgc gaagcccaaa cagatttact gctaccttca gagaattggc 2101 attggtctct ccccagccat gagctactca gcccttgtaa caaagaccaa ccgtattgca 2161 aggatcctgg ctggcagcaa gaagaagatc tgtaccaaaa agcccagatt catgagtgcc 2221 tgtgcccagc tagtgattgc tttcattctc atatgcatcc agttgggcat catcgttgcc 2281 ctctttataa tggagcctcc tgacataatg catgactacc caagcattcg agaagtctac 2341 ctgatctgta acaccaccaa cctaggagtt gtcactccac ttggatacaa tggattgttg 2401 attttgagct gcaccttcta tgcgttcaag accagaaatg ttccagctaa cttcaacgag 2461 gccaagtata tcgccttcac aatgtacacg acctgcatta tatggctagc ttttgtgcca 2521 atctactttg gcagcaacta caaaatcatc accatgtgtt tctcggtcag cctcagtgcc 2581 acagtggccc taggctgcat gtttgtgccg aaggtgtaca tcatcctggc caaaccagag 2641 agaaacgtgc gcagcgcctt caccacatct accgtggtgc gcatgcatgt aggggatggc 2701 aagtcatcct ccgcagccag cagatccagc agcctagtca acctgtggaa gagaaggggc 2761 tcctctgggg aaaccttaag ttccaatgga aaatccgtca cgtgggccca gaatgagaag 2821 agcagccggg ggcagcacct gtggcagcgc ctgtccatcc acatcaacaa gaaagaaaac 2881 cccaaccaaa cggccgtcat caagcccttc cccaagagca cggagagccg tggcctgggc 2941 gctggcgctg gcgcaggcgg gagcgctggg ggcgtggggg ccacgggcgg tgcgggctgc 3001 gcaggcgccg gcccaggcgg gcccgagtcc ccagacgccg gccccaaggc gctgtatgat 3061 gtggccgagg ctgaggagca cttcccggcg cccgcgcggc cgcgctcacc gtcgcccatc 3121 agcacgctga gccaccgcgc gggctcggcc agccgcacgg acgacgatgt gccgtcgctg 3181 cactcggagc ctgtggcgcg cagcagctcc tcgcagggct ccctcatgga gcagatcagc 3241 agtgtggtca cccgcttcac ggccaacatc agcgagctca actccatgat gctgtccacc 3301 gcggccccca gccccggcgt cggcgccccg ctctgctcgt cctacctgat ccccaaagag 3361 atccagttgc ccacgaccat gacgaccttt gccgaaatcc agcctctgcc ggccatcgaa 3421 gtcacgggcg gcgcgcagcc cgcggcaggg gcgcaggcgg ctggggacgc ggcccgggag 3481 agccccgcgg ccggtcccga ggctgcggcc gccaagccag acctggagga gctggtggct 3541 ctcaccccgc cgtccccctt cagagactcg gtggactcgg ggagcacaac ccccaactcg 3601 ccagtgtccg agtcggccct ctgtatcccg tcgtctccca aatatgacac tcttatcata 3661 agagattaca ctcagagctc ctcgtcgttg tgaatgtccc tggaaagcac gccggcctgc 3721 gcgtgcggag cggagccccc cgtgttcaca cacacacaat ggcaagcata gtcgcctggt 3781 tacggcccag ggggaatatg ccaagggacc ccttaatgga aacacagatc agtagtgcta 3841 tctcatgaca accacaagaa accgacgaca aatcttttgc gagattttct tctagtggct 3901 tagaaacatg gcttttaaga aacacggtga tatctttgag ggtgacaagg cgtctcttca 3961 aacagttcca taccaactgc tttgctctag ggaagcagtg cgtgtgaaac agcgtaacgg 4021 agggtgaaga gcatagttaa taagcaactg taaaaagttt tatttgttta ctttaattct 4081 tttcccctgt aaaaagtttt atttgtttac tttaattctt ttcccagaaa agagtctttg 4141 attcaccaaa catgaatgta cattttctaa caaactcaaa atctgggacc aaaacatcaa 4201 cttttttctt tcttttttct ttctttttgt tttttctttc ctgtaaagac cttgaaaaga 4261 ccttgaaaag cagtaacttg ggtccagtat ttacggaggc gttgtgaatg tgtcccatgc 4321 ataacacact actggatagt gagtcgtgcg ctaatgtact acgtagggct tctaccagag 4381 attttcctct ccaattgggt tgtgaaatac tcttccaaaa gcctgcatcg gggattccac 4441 ctacttattt cagattcacc tccattaacc aagaaaacca gtggaagatt tcttgactat 4501 ttcaccatgt tgccaatc // LOCUS HUMMHBA123 1093 bp mRNA PRI 07-JAN-1995 DEFINITION Human MHC protein homologous to chicken B complex protein mRNA, complete cds. ACCESSION M24194 NID g187701 KEYWORDS G protein beta subunit; major histocompatibility complex. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1093) AUTHORS Guillemot,F., Billault,A. and Auffray,C. TITLE Physical linkage of a guanine nucleotide-binding protein-related gene to the chicken major histocompatibility complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (12), 4594-4598 (1989) MEDLINE 89282817 FEATURES Location/Qualifiers source 1..1093 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="JY" /cell_type="B lymphoblastoid" mRNA <1..1075 /note="MHC protein homologous to chicken B complex protein 12.3; putative" gene 96..1049 /gene="H12-3" CDS 96..1049 /gene="H12-3" /note="homologue; putative" /codon_start=1 /product="MHC B complex protein 12.3" /db_xref="PID:g307218" /translation="MTEQMTLRGTLKGHNGWVTQIATTPQFPDMILSASRDKTIIMWK LTRDETNYGIPQRALRGHSHFVSDVVISSDGQFALSGSWDGTLRLWDLTTGTTTRRFV GHTKDVLSVAFSSDNRQIVSGSRDKTIKLWNTLGVCKYTVQDESHSEWVSCVRFSPNS SNPIIVSCGWDKLVKVWNLANCKLKTNHIGHTGYLNTVTVSPDGSLCASGGKDGQAML WDLNEGKHLYTLDGGDIINALCFSPNRYWLCAATGPSIKIWDLEGKIIVDELKQEVIS TSSKAEPPQCTSLAWSADGQTLFAGYTDNLVRVWQVTIGTR" BASE COUNT 256 a 308 c 288 g 241 t ORIGIN 1 ctgcaaggcg gcggcaggag aggttgtggt gctagtttct ctaagccatc cagtgccatc 61 ctcgtcgctg cagcgacacc gctctcgccg ccgccatgac tgagcagatg acccttcgtg 121 gcaccctcaa gggccacaac ggctgggtaa cccagatcgc tactaccccg cagttcccgg 181 acatgatcct ctccgcctct cgagataaga ccatcatcat gtggaaactg accagggatg 241 agaccaacta tggaattcca cagcgtgctc tgcggggtca ctcccacttt gttagtgatg 301 tggttatctc ctcagatggc cagtttgccc tctcaggctc ctgggatgga accctgcgcc 361 tctgggatct cacaacgggc accaccacga ggcgatttgt gggccatacc aaggatgtgc 421 tgagtgtggc cttctcctct gacaaccggc agattgtctc tggatctcga gataaaacca 481 tcaagctatg gaataccctg ggtgtgtgca aatacactgt ccaggatgag agccactcag 541 agtgggtgtc ttgtgtccgc ttctcgccca acagcagcaa ccctatcatc gtctcctgtg 601 gctgggacaa gctggtcaag gtatggaacc tggctaactg caagctgaag accaaccaca 661 ttggccacac aggctatctg aacacggtga ctgtctctcc agatggatcc ctctgtgctt 721 ctggaggcaa ggatggccag gccatgttat gggatctcaa cgaaggcaaa cacctttaca 781 cgctagatgg tggggacatc atcaacgccc tgtgcttcag ccctaaccgc tactggctgt 841 gtgctgccac aggccccagc atcaagatct gggatttaga gggaaagatc attgtagatg 901 aactgaagca agaagttatc agtaccagca gcaaggcaga accaccccag tgcacttccc 961 tggcctggtc tgctgatggc cagactctgt ttgctggcta cacggacaac ctggtgcgag 1021 tgtggcaggt gaccattggc acacgctaga agtttatggc agagctttac aaataaaaaa 1081 aaaatggctt ttc // LOCUS HUMMHDRA 1199 bp mRNA PRI 11-JUN-1993 DEFINITION human hla-dr antigen alpha-chain mrna & ivs fragments. ACCESSION J00194 NID g188231 KEYWORDS antigen; histocompatibility antigen. SOURCE human cdna clones; source not explicitly given. genomic dna from human placental cosmid library. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1199) AUTHORS Lee,J.S., Trowsdale,J., Travers,P.J., Carey,J., Grosveld,F., Jenkins,J. and Bodmer,W.F. TITLE sequence of an hla-dr alpha-chain cdna clone and intron-exon organization of the corresponding gene JOURNAL Nature 299, 750-752 (1982) MEDLINE 83013020 COMMENT a polyadenylation signal is found between 1168-1173. ivs1 3' end: tttcttgcctttcag(aagaa...(exon 2)). ivs2 5' end: ((exon 2)...caatg)gtacctccctctctg. ivs2 3' end: tcatgtgtcccccag(tacct...(exon 3)). ivs3 5' end: ((exon 3)...ctggg)gtatggaccaacact. ivs3 3' end: tattccccag(agttt...(exon 4)). ivs4 5' end: (exon 4)...tggag)gtgagttaggtgtgg. ivs4 3' end: tgtgtcttgctatag(gtgat...(exon5)). FEATURES Location/Qualifiers source 1..1199 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>1199 /note="signal peptide" CDS 27..791 /note="hla-dr antigen alpha chain" /codon_start=1 /db_xref="PID:g307264" /translation="MAISGVPVLGFFIIAVLMSAQESWAIKEEHVIIQAEFYLNPDQS GEFMFDFDGDEIFHVDMAKKETVWRLEEFGRFASFEAQGALANIAVDKANLEIMTKRS NYTPITNVPPEVTVLTNSPVELREPNVLICFIDKFTPPVVNVTWLRNGKPVTTGVSET VFLPREDHLFRKFHYLPFLPSTEDVYDCRVEHWGLDEPLLKHWEFDAPSPLPETTENV VCALGLTVGLVGIIIGTIFIIKGVRKSNAAERRGPL" sig_peptide 30..101 /note="signal peptide" BASE COUNT 302 a 293 c 280 g 324 t ORIGIN about 100bp upstream from the sau3a site on the cdna. 1 actcccaacg agcgcccaag aagaaaatgg ccataagtgg agtccctgtg ctaggatttt 61 tcatcatagc tgtgctgatg agcgctcagg aatcatgggc tatcaaagaa gaacatgtga 121 tcatccaggc cgagttctat ctgaatcctg accaatcagg cgagtttatg tttgactttg 181 atggtgatga gattttccat gtggatatgg caaagaagga gacggtctgg cggcttgaag 241 aatttggacg atttgccagc tttgaggctc aaggtgcatt ggccaacata gctgtggaca 301 aagccaacct ggaaatcatg acaaagcgct ccaactatac tccgatcacc aatgtacctc 361 cagaggtaac tgtgctcacg aacagccctg tggaactgag agagcccaac gtcctcatct 421 gtttcatcga caagttcacc ccaccagtgg tcaatgtcac gtggcttcga aatggaaaac 481 ctgtcaccac aggagtgtca gagacagtct tcctgcccag ggaagaccac cttttccgca 541 agttccacta tctccccttc ctgccctcaa ctgaggacgt ttacgactgc agggtggagc 601 actggggctt ggatgagcct cttctcaagc actgggagtt tgatgctcca agccctctcc 661 cagagactac agagaacgtg gtgtgtgccc tgggcctgac tgtgggtctg gtgggcatca 721 ttattgggac catcttcatc atcaagggag tgcgcaaaag caatgcagca gaacgcaggg 781 ggcctctgta aggcacatgg aggtgatgat gtttcttaga gagaagatca ctgaagaaac 841 ttctgcttta atgactttac aaagctggca atattacaat ccttgacctc agtgaaagca 901 gtcatcttca gcgttttcca gccctatagc caccccaagt gtggttatgc ctcctcgatt 961 gctccgtact ctaacatcta gctggctttc cctgtctatt gccttttcct gtatctattt 1021 tcctctattt cctatcattt tattatcacc atgcaatgcc tctggaataa aacatacagg 1081 agtctgtctc tgctatggaa tgccccatgg ggctctcttg tgtacttatt gtttaaggtt 1141 tcctcaaact gtgatttttc tgaacacaat aaactatttt gatgatcttg ggtggaaaa // LOCUS HUMMHHSPHO 3330 bp DNA PRI 07-MAR-1995 DEFINITION Human MHC class III HSP70-HOM gene (HLA), complete cds. ACCESSION M59829 M34268 NID g188491 KEYWORDS class III gene; complement system protein; heat shock-induced protein; major histocompatibility complex. SOURCE Human DNA, clone H92. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3330) AUTHORS Milner,C.M. and Campbell,R.D. TITLE Structure and expression of the three MHC-linked HSP70 genes JOURNAL Immunogenetics 32 (4), 242-251 (1990) MEDLINE 91055806 FEATURES Location/Qualifiers source 1..3330 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="H92" /haplotype="HLA:A2,B7,C2C,Bfs,C4A3,C4BQ0,DR2" gene 960..2885 /gene="HSP70-HOM" CDS 960..2885 /gene="HSP70-HOM" /codon_start=1 /product="heat shock-induced protein" /db_xref="PID:g188492" /translation="MATAKGIAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYV AFTDTERLIGDAAKNQVAMNPQNTVFDAKRLIGRKFNDPVVQADMKLWPFQVINEGGK PKVLVSYKGENKAFYPEEISSMVLTKLKETAEAFLGHPVTNAVITVPAYFNDSQRQAT KDAGVIAGLNVLRIINEPTAAAIAYGLDKGGQGERHVLIFDLGGGTFDVSILTIDDGI FEVKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQNKRAVRRLRTACERAKRTL SSSTQANLEIDSLYEGIDFYTSITRARFEELCADLFRGTLEPVEKALRDAKMDKAKIH DIVLVGGSTRIPKVQRLLQDYFNGRDLNKSINPDEAVAYGAAVQAAILMGDKSEKVQD LLLLDVAPLSLGLETVGGVMTALIKRNSTIPPKQTQIFTTYSDNQPGVLIQVYEGERA MTKDNNLLGRFDLTGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKVNKITITND KGRLSKEEIERMVLDAEKYKAEDEVQREKIAAKNALESYAFNMKSVVSDEGLKGKISE SDKNKILDKCNELLSWLEVNQLAEKDEFDHKRKELEQMCNPIITKLYQGGCTGPACGT GYVPGRPATGPTIEEVD" BASE COUNT 951 a 738 c 867 g 774 t ORIGIN Chromosome 6p21.3. 1 ggatcctatg agcctgggag gtcaggactg cagtgagcca tgattacacc actgcagtgc 61 agcctgcgtg acaaaacgag accctgtctc taaaaaatga gaaaaaaaaa tggttgttac 121 caggcgataa agggagggga aaacgggagt tacttaatga gtatacagtt tcagttttgc 181 gagatgaaca gaattctgga aattggttga acaccgctgt gattgaactc actaccaaac 241 tctacactta aaaatggtta agatggtaca atttgtatgt attttaccac aataaaaaat 301 aaaaaaaagg ctgggcgaga tgttcactcc tgtaatccca gtacttgggg aggctggggc 361 tgaaggatcg tttgagccct gaaggagttt gagaccagcc tgagcaacat aaggagaccc 421 catctgtaca caaaattaaa acattagcca ggcagagagc tggtcacggt ggctcacgta 481 tgtaatccca gcactttggg aggccgaggc gggcgggcgg atcacctgag gtcaggagtt 541 tgagaccagc ctggccaaca tagtgaaacc gtgaaacccc atctctacta aaaatacaaa 601 aattagctgg gcgtggtggt gccctcataa tcccagccac tcgggaggct gagacaggag 661 aatcgcttga actcaggagg tggaggttgc agtgagccta gatcacacca ctgcagtcca 721 aagcaagact ccgtctcaaa aaaaaaaaaa attagcccgg ctgttgtctc cagttattct 781 ggaggctaag gcaggaagat tgctggagcc taggagatca aagctgcagt gagctatgac 841 tgcgcctctg cactccaacc tgggtgacag aggaagaccc tgtctcaaaa aaataaataa 901 cattgaaaag gaactctccc aaaagtatct tattctttct ccataggcct cagagaacca 961 tggctactgc caagggaatc gccataggaa tcgacctggg caccacctac tcctgtgtgg 1021 gggtgttcca gcacggcaag gtggagatca tcgccaacga ccagggcaac cgcaccaccc 1081 ccagctacgt ggccttcaca gacaccgagc ggctcattgg ggatgcggcc aagaaccagg 1141 tagcaatgaa tccccagaac actgtttttg atgctaaacg tctgatcggc aggaaattta 1201 atgatcctgt tgtacaagca gatatgaaac tttggccttt tcaagtgatt aatgaaggag 1261 gcaagcccaa agtccttgtg tcctacaaag gggagaataa agctttctac cctgaggaaa 1321 tctcttcgat ggtattgact aagttgaagg agactgctga ggcctttttg ggccaccctg 1381 tcaccaatgc agtgattacc gtgccagcct atttcaatga ctctcaacgt caggctacta 1441 aggatgcagg tgtgattgct ggacttaatg tgctaagaat catcaatgag cccacggctg 1501 ctgccattgc ctatggttta gataaaggag gtcaaggaga acgacatgtc ctgatttttg 1561 atctgggtgg aggcacattt gatgtgtcaa ttctgaccat agatgatggg atttttgagg 1621 taaaggccac tgctggggac actcacctgg gtggggagga ctttgacaac aggcttgtga 1681 gccacttcgt ggaggagttc aagaggaaac acaaaaagga catcagccag aacaagcgag 1741 ccgtgaggcg gctgcgcacc gcctgcgaga gggccaagag gaccctgtcg tccagcaccc 1801 aggccaacct agaaattgat tcactttatg aaggcattga cttctataca tccatcacca 1861 gagctcgatt tgaagagttg tgtgcagacc tgtttagggg taccctggag cctgtagaaa 1921 aagcgcttcg ggatgccaag atggataagg ctaaaatcca tgacattgtt ttagtagggg 1981 gctccacccg catccccaag gtgcagcggc tgcttcagga ctacttcaat ggacgtgatc 2041 tcaacaagag catcaaccct gatgaggccg tagcatatgg ggctgcggta caagcagcca 2101 tcctgatggg ggacaagtct gagaaggtac aggacctgct gctgctggac gtggctcccc 2161 tgtccctggg tctggagacg gttgggggcg tgatgactgc cctgataaag cgcaactcca 2221 ccatcccacc caagcagaca cagattttca ccacctactc tgacaaccaa cccggggtgc 2281 tgatccaggt gtatgagggc gagagggcca tgacaaagga caacaacctg ctggggcggt 2341 ttgatctgac tggaatccct ccagcaccca ggggagttcc tcagatcgag gtgacgtttg 2401 acattgatgc caatggtatt ctcaatgtca cagccacgga caagagcacc ggcaaggtga 2461 acaagatcac catcaccaat gacaagggcc gcctgagcaa ggaggagatt gagcggatgg 2521 ttctggatgc tgagaaatat aaagctgaag atgaggtcca gagggagaaa attgctgcaa 2581 agaatgcctt agaatcctat gcttttaaca tgaagagtgt tgtgagtgat gaaggtttga 2641 agggcaagat tagtgagtct gataaaaata aaatattgga taaatgcaac gagctccttt 2701 cgtggctgga ggtcaatcaa ctggcagaga aagatgagtt tgatcataag agaaaggaat 2761 tggagcagat gtgtaaccct atcatcacaa aactctacca aggaggatgc actgggcctg 2821 cctgcggaac agggtatgtg cctggaaggc ctgccacagg ccccacaatt gaagaagtag 2881 attaattctt tttagaactg aagcatccta ggatgcctct acatgtattt cattcccctc 2941 atgttgaaac atcattatta ttcttgacca gacctgaatc taagttacca tcccttggaa 3001 attctggaga aggagtctca tgcaccacct atcacactcc ctcacatcct gtttctgact 3061 ttggaatgga ctcaggaaaa ctaggcccct ctttaaccgt gtgatgtatt tgaatgtctg 3121 ttatttccag ccaccctaac attcttcttc ctgtgtggat gcttatttgt caatcagtaa 3181 atttgttcgt aaagaaaatt acttctggta tttaggctgt gaatgtacct tgaaggggag 3241 agttcatgga gagagcatgt gttctctgat tgtgaggtca ctgtgaatga ttaaattggt 3301 aagggtaaag tatttgaatt ttcatgaact // LOCUS HUMMITCORA 1985 bp mRNA PRI 25-JUL-1994 DEFINITION Human ubiquinol cytochrome-c reductase core I protein mRNA, complete cds. ACCESSION L16842 NID g349472 KEYWORDS core protein I; mitochondrial respiratory chain; ubiquinol cytochrome-c reductase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1985) AUTHORS Hoffman,G.G., Lee,S., Christiano,A.M., Chung-Honet,L.C., Cheng,W., Katchman,S., Uitto,J. and Greenspan,D.S. TITLE Complete coding sequence, intron/exon organization, and chromosomal location of the gene for the core I protein of human ubiquinol-cytochrome c reductase JOURNAL J. Biol. Chem. 268, 21113-21119 (1993) MEDLINE 94012661 FEATURES Location/Qualifiers source 1..1985 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="placenta" /map="3p21.3" 5'UTR 391..417 exon 391..487 /number=1 CDS 418..1860 /codon_start=1 /product="ubiquinol-cytochrome c reductase core I protein" /db_xref="PID:g515634" /translation="MAASVVCRAATAGAQVLLRARRSPALLRTPALRSTATFAQALQF VPETQVSLLDNGLRVASEQSSQPTCTVGVWIDVGSRFETEKNNGAGYFLEHLAFKGTK NRPGSALEKEVESMGAHLNAYSTREHTAYYIKALSKDLPKAVELLGDIVQNCSLEDSQ IEKERDVILREMQENDASMRDVVFNYLHATAFQGTPLAQAVEGPSENVRKLSRADLTE YLSTHYKAPRMVLAAAGGVEHQQLLDLAQKHLGGIPWTYAEDAVPTLTPCRFTGSEIR HRDDALPFAHVAIAVEGPGWASPDSVALQVANAIIGHYDCTYGGGVHLSSPLASGAVA NKLCQSFQTFSICYAETGLLGAHFVCDRMKIDDMMFVLQGQWMRLCTSATESEVARGK NILRNALVSHLDGTTPVCEDIGRSLLTYGRRIPLAEWESRIAEVDASVVREICSKYIY DQCPAVAGYGPIEQLPDYNRIRSGMFWLRF" sig_peptide 418..519 /label=leader exon 488..627 /number=2 exon 628..714 /number=3 exon 715..844 /number=4 exon 845..1043 /number=5 exon 1044..1123 /number=6 exon 1124..1239 /number=7 exon 1240..1383 /number=8 exon 1384..1544 /number=9 exon 1545..1630 /number=10 exon 1631..1719 /number=11 exon 1720..1795 /number=12 exon 1796..>1860 /number=13 polyA_signal 1966..1971 BASE COUNT 413 a 574 c 609 g 389 t ORIGIN 1 tactgacttt cagcaccaac ttgtggtccc aggtaagttt ccacgctggt actttagccc 61 tgggctcgaa cctgcggaca cgctgtggct gcaactcccc gcccgccaga cctcagtacg 121 cagcgcggct ggtgagaaac ataatgcact ctgacttcca cgcgtggaat ggggagatgg 181 actgcacggc gggcggacct gctcgggctg atggacggca ggtggactga tgtgcgcagg 241 gactggcggc agcgcggtca gagccagtca gccaaagcca ggccagcaca atagactgtc 301 ccggttcccg ccaggaggcg gccgagcacc aactgtacgg tactgcgcct gcgccgcgac 361 cgccaacgcg cccagtctac gcttgcgcgg cgcaacaggg ccgactgcag ctggaagatg 421 gcggcgtccg tggtctgtcg ggccgctacc gccggggcac aagtgctatt gcgcgcccgc 481 cgctcgccgg ccctgctgcg gacgccagcc ttgcggagta cggcaacctt cgctcaggcg 541 ctccagttcg tgccggagac gcaggttagc ctgctggaca acggcctgcg tgtggcctcc 601 gagcagtcct ctcagcccac ttgcacggtg ggagtgtgga ttgatgttgg cagccgtttt 661 gagactgaga agaataatgg ggcaggctac tttttggagc atctggcttt caagggaaca 721 aagaatcggc ctggcagtgc cctggagaag gaggtggaga gcatgggggc ccatcttaat 781 gcctacagca cccgggagca cacagcttac tacatcaagg cgctgtccaa ggatctgccg 841 aaagctgtgg agctcctggg tgacattgtg cagaactgta gtctggaaga ctcacagatt 901 gagaaggaac gtgatgtgat cctgcgggag atgcaggaga atgatgcatc tatgcgagat 961 gtggtcttta actacctgca tgccacagca ttccagggca cacctctagc ccaggctgtg 1021 gaggggccca gtgagaatgt caggaagctg tctcgtgcag acttgaccga gtacctcagc 1081 acacattaca aggcccctcg aatggtgctg gcagcagctg gaggagtgga gcaccagcaa 1141 ctgttagacc tcgcccagaa gcacctcggt ggcatcccat ggacatatgc agaggacgct 1201 gtgcccactc ttactccatg ccgcttcact ggcagtgaga tccgccaccg tgatgatgct 1261 ctaccttttg cccacgtggc cattgcagta gagggtcctg gctgggccag cccggacagt 1321 gtggccttgc aagtggccaa tgccatcatc ggccactatg actgcactta tggtggtggc 1381 gtgcacctgt ccagcccact ggcttcaggt gctgtggcca acaagctatg ccagagtttc 1441 cagaccttca gcatctgcta tgcagagacg ggcttgctgg gtgcacactt tgtctgtgac 1501 cgaatgaaaa tcgatgacat gatgttcgtc ctgcaagggc agtggatgcg cctgtgtacc 1561 agtgccacgg agagtgaggt ggcccggggc aaaaacatcc tcagaaatgc cctggtatct 1621 catctagatg gcactactcc tgtgtgtgag gacatcggac gcagcctcct gacctatggc 1681 cgccgcatcc ccctggctga atgggaaagc cggattgcgg aggtggatgc cagtgtggta 1741 cgtgagatct gctccaagta catctatgac cagtgcccag cagtggctgg atatggcccc 1801 attgagcagc tcccagacta caaccggatc cgtagcggca tgttctggct gcgcttctag 1861 gcgggaagcc tatgtaagca agagggcagg gccggggttt gtggtccccc ccccaccaca 1921 aacacagcac ttcggctcct ctaacctgtg ccacaggtga ccaccaataa aatcctctgc 1981 tgaga // LOCUS HUMMITF1 1936 bp mRNA PRI 07-JAN-1995 DEFINITION Human mitochondrial transcription factor 1 mRNA, complete cds. ACCESSION M62810 NID g188563 KEYWORDS mitochondrial transcription factor 1. SOURCE Homo sapiens (tissue library: of S.Elledge) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1936) AUTHORS Parisi,M.A. and Clayton,D.A. TITLE Similarity of human mitochondrial transcription factor 1 to high mobility group proteins JOURNAL Science 252 (5008), 965-969 (1991) MEDLINE 91240283 FEATURES Location/Qualifiers source 1..1936 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="EBV-transformed lymphocyte" /tissue_lib="of S.Elledge" gene 1..870 /gene="mitochondrial transcription factor 1" 5'UTR 1..132 /gene="mitochondrial transcription factor 1" /note="mRNA 3' end has not been definitively mapped; the 5' UTR could span bp 1-150; putative" transit_peptide 133..258 /gene="mitochondrial transcription factor 1" /note="unsure if initial methionine starts at base 133 or 151; putative" CDS 133..873 /note="unsure if initial methionine starts at base 133 or 151; putative" /codon_start=1 /product="mitochondrial transcription factor 1" /db_xref="PID:g619859" /translation="MAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLPRWFSS VLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIYQDA YRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPRS AYNVYVAERFQEAKGDSPQEKLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSW EEQMIEVGRKDLLRRTIKKQRKYGAEEC" mat_peptide 259..870 /gene="mitochondrial transcription factor 1" /note="putative" /product="mitochondrial transcription factor 1" BASE COUNT 633 a 300 c 404 g 599 t ORIGIN 1 cctcgctagt ggcgggcatg ataacacacg ccggagggtc gcacgcgggt tccagttgtg 61 attgctggag ttgtgtattg ccaggaggct ctccgagatt ggggtcgggt cactgcctca 121 tccaccggag cgatggcgtt tctccgaagc atgtggggcg tgctgagtgc cctgggaagg 181 tctggagcag agctgtgcac cggctgtgga agtcgactgc gctccccctt cagttttgtg 241 tatttaccga ggtggttttc atctgtcttg gcaagttgtc caaagaaacc tgtaagttct 301 taccttcgat tttctaaaga acaactaccc atatttaaag ctcagaaccc agatgcaaaa 361 actacagaac taattagaag aattgcccag cgttggaggg aacttcctga ttcaaagaaa 421 aaaatatatc aagatgctta tagggcggag tggcaggtat ataaagaaga gataagcaga 481 tttaaagaac agctaactcc aagtcagatt atgtctttgg aaaaagaaat catggacaaa 541 catttaaaaa ggaaagctat gacaaaaaaa aaagagttaa cactgcttgg aaaaccaaaa 601 agacctcgtt cagcttataa cgtttatgta gctgaaagat tccaagaagc taagggtgat 661 tcaccgcagg aaaagctgaa gactgtaaag gaaaactgga aaaatctgtc tgactctgaa 721 aaggaattat atattcagca tgctaaagag gacgaaactc gttatcataa tgaaatgaag 781 tcttgggaag aacaaatgat tgaagttgga cgaaaggatc ttctacgtcg cacaataaag 841 aaacaacgaa aatatggtgc tgaggagtgt taaaagtaga agattgagat gtgttcacaa 901 tggataggca caggaaacca gttaggtctc aatacctgaa gctatcgtaa aattaagaaa 961 ggataaagtt ggtaaacctt ttatatttag tatcttttta ttcagctcat ggacttctgc 1021 cagcataata cttgctttgg aaaacccaga taaaggttca tgcaaacttt attttgtgtt 1081 taggaactac tgaggatcag agtaatccaa gcaaatgtga atcattttac ctttgacaaa 1141 ggtaaatcag actatgaagt tttttttata caggatgatg actatggaaa gagtactctt 1201 gtttccttat attatggagg caggagtttc gttttcaaaa ttgttacaaa ttgtagaagc 1261 cacggtgttc tgtgatataa gtgtgtgttt ttcataaagc aggcagaact catctaggta 1321 aattacagtt cctaggtata attcacattg tattcagagt tgatggttgt acatataagt 1381 gattgctggt tttagttgca actttgtata aaagggactg agaaatttat aaactttttt 1441 cttactgtct tttttctaaa gtaaaaacaa agaaattatg tgccagattt atgcatatta 1501 ttttatgttg catagaataa aatttttaat ctttaatttt acatttccta aatatatttt 1561 aagacgaaac atttgttcta tagcttttcc ctttttttaa gtaaggaatt ttattttttt 1621 ctgaattatt ttctctcgtg agtatattga tccagaaaga aaacttgtat tatgtgtgtt 1681 ttaaaatgag aaatctaaaa aacgaaaagt ctccaaagtc tctggaattt gaaacacttt 1741 gcataacgta taaaagcctg tttaagagac agccaactat ggcctgtgga tcaaatccag 1801 cctgctgcct gctttttatg gcctgtgagc taggaattgt gtttataatt ttaaatgttt 1861 ttttttaaag acttttatga tacttgaaaa ttaacatgaa tatttagtgt tcataaataa 1921 agtttgttga aacaca // LOCUS HUMMKK 1455 bp mRNA PRI 03-FEB-1993 DEFINITION Homo sapiens MAP kinase kinase mRNA, complete cds. ACCESSION L05624 NID g188568 KEYWORDS MAP kinase kinase; kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1455) AUTHORS Seger,R., Seger,D., Lozeman,F.J., Ahn,N.G., Graves,L.M., Campbell,J.S., Ericsson,L.H., Harrylock,M., Jensen,A.M. and Krebs,E.G. TITLE Human T-cell mitogen-activated protein kinase kinases are related to yeast signal transduction kinases JOURNAL J. Biol. Chem. 267, 25628-25631 (1992) MEDLINE 93100262 FEATURES Location/Qualifiers source 1..1455 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" CDS 34..1215 /codon_start=1 /product="MAP kinase kinase" /db_xref="PID:g188569" /translation="MPKKKPTPIQLNPAPDGSAVNGTSSAETNLEALQKKLEELELDE QQRKRLEAFLTQKQKVGELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARKLIHLE IKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKKAGR IPEQILGKVSIAVIKGLTYLREKHKIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDS MANSFVGTRSYMSPERLQGTHYSVQSDIWSMGLSLVEMAVGRYPIPPPDAKELELMFG CQVEGDAAETPPRPRTPGRPLSSYGMDSRPPMAIFELLDYIVNEPPPKLPSGVFSLEF QDFVNKCLIKNPAERADLKQLMVHAFIKRSDAEEVDFAGWLCSTIGLNQPSTPTHAAG V" BASE COUNT 358 a 354 c 398 g 345 t ORIGIN 1 ggcggagttg gaagcgcgtt acccgggtcc aaaatgccca agaagaagcc gacgcccatc 61 cagctgaacc cggcccccga cggctctgca gttaacggga ccagctctgc ggagaccaac 121 ttggaggcct tgcagaagaa gctggaggag ctagagcttg atgagcagca gcgaaagcgc 181 cttgaggcct ttcttaccca gaagcagaag gtgggagaac tgaaggatga cgactttgag 241 aagatcagtg agctgggggc tggcaatggc ggtgtggtgt tcaaggtctc ccacaagcct 301 tctggcctgg tcatggccag aaagctaatt catctggaga tcaaacccgc aatccggaac 361 cagatcataa gggagctgca ggttctgcat gagtgcaact ctccgtacat cgtgggcttc 421 tatggtgcgt tctacagcga tggcgagatc agtatctgca tggagcacat ggatggaggt 481 tctctggatc aagtcctgaa gaaagctgga agaattcctg aacaaatttt aggaaaagtt 541 agcattgctg taataaaagg cctgacatat ctgagggaga agcacaagat catgcacaga 601 gatgtcaagc cctccaacat cctagtcaac tcccgtgggg agatcaagct ctgtgacttt 661 ggggtcagcg ggcagctcat cgactccatg gccaactcct tcgtgggcac aaggtcctac 721 atgtcgccag aaagactcca ggggactcat tactctgtgc agtcagacat ctggagcatg 781 ggactgtctc tggtagagat ggcggttggg aggtatccca tccctcctcc agatgccaag 841 gagctggagc tgatgtttgg gtgccaggtg gaaggagatg cggctgagac cccacccagg 901 ccaaggaccc ccgggaggcc ccttagctca tacggaatgg acagccgacc tcccatggca 961 atttttgagt tgttggatta catagtcaac gagcctcctc caaaactgcc cagtggagtg 1021 ttcagtctgg aatttcaaga ttttgtgaat aaatgcttaa taaaaaaccc cgcagagaga 1081 gcagatttga agcaactcat ggttcatgct tttatcaaga gatctgatgc tgaggaagtg 1141 gattttgcag gttggctctg ctccaccatc ggccttaacc agcccagcac accaacccat 1201 gctgctggcg tctaagtgtt tgggaagcaa caaagagcga gtcccctgcc cggtggtttg 1261 ccatgtcgct tttgggcctc cttcccatgc ctgtctctgt tcagatgtgc atttcacctg 1321 tgacaaagga tgaagaacac agcatgtgcc aagattctac tcttgtcatt tttaatatta 1381 ctgtctttat tcttattact attattgttc ccctaagtgg attggctttg tgcttggggc 1441 tatttgtgtg tatcc // LOCUS HUMMKK3A 2030 bp mRNA PRI 24-FEB-1995 DEFINITION Homo sapiens MAP kinase kinase 3 (MKK3) mRNA, complete cds. ACCESSION L36719 NID g685173 KEYWORDS MAP kinase kinase; mitogen activated protein kinase; protein kinase. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2030) AUTHORS Derijard,B., Raingeaud,J., Barrett,T., Wu,I.H., Han,J., Ulevitch,R.J. and Davis,R.J. TITLE Independent human MAP-kinase signal transduction pathways defined by MEK and MKK isoforms JOURNAL Science 267 (5198), 682-685 (1995) MEDLINE 95141073 REMARK Erratum:[[published erratum appears in Science 1995 Jul 7;269(5220):17]] FEATURES Location/Qualifiers source 1..2030 /organism="Homo sapiens" /note="vector lamda ZAPII" /db_xref="taxon:9606" /tissue_type="brain" 5'UTR 1..337 /gene="MKK3" gene 1..2030 /gene="MKK3" mRNA 1..2030 /gene="MKK3" CDS 338..1294 /gene="MKK3" /codon_start=1 /function="protein kinase" /product="MAP kinase kinase 3" /db_xref="PID:g685174" /translation="MSKPPAPNPTPPRNLDSRTFITIGDRNFEVEADDLVTISELGRG AYGVVEKVRHAQSGTIMAVKRIRATVNSQEQKRLLMDLDINMRTVDCFYTVTFYGALF REGDVWICMELMDTSLDKFYRKVLDKNMTIPEDILGEIAVSIVRALEHLHSKLSVIHR DVKPSNVLINKEGHVKMCDFGISGYLVDSVAKTMDAGCKPYMAPERINPELNQKGYNV KSDVWSLGITMIEMAILRFPYESWGTPFQQLKQVVEEPSPQLPADRFSPEFVDFTAQC LRKNPAERMSYLELMEHPFFTLHKTKKTDIAAFVKKILGEDS" 3'UTR 1292..2030 /gene="MKK3" BASE COUNT 415 a 610 c 576 g 429 t ORIGIN 1 tggctggcaa tggccttgct gacctcgagc cgggcccacg tggggacctt tggagcacag 61 cctacgatcc tggtgcaagg ccggtggatg cagaggccag tccatatacc acccaggcct 121 gcgaggagcg tggtccccac ccatccagcc catatgtgca agtgcccttg acagagaggc 181 tggtcatatc catggtgacc atttatgggc cacaacaggt ccccatctgc gcagtgaacc 241 ctgtgctgag caccttgcag acgtgatctt gcttcgtcct gcagcactgt gcggggcagg 301 aaaatccaag aggaagaagg atctacggat atcctgcatg tccaagccac ccgcacccaa 361 ccccacaccc ccccggaacc tggactcccg gaccttcatc accattggag acagaaactt 421 tgaggtggag gctgatgact tggtgaccat ctcagaactg ggccgtggag cctatggggt 481 ggtagagaag gtgcggcacg cccagagcgg caccatcatg gccgtgaagc ggatccgggc 541 caccgtgaac tcacaggagc agaagcggct gctcatggac ctggacatca acatgcgcac 601 ggtcgactgt ttctacactg tcaccttcta cggggcacta ttcagagagg gagacgtgtg 661 gatctgcatg gagctcatgg acacatcctt ggacaagttc taccggaagg tgctggataa 721 aaacatgaca attccagagg acatccttgg ggagattgct gtgtctatcg tgcgggccct 781 ggagcatctg cacagcaagc tgtcggtgat ccacagagat gtgaagccct ccaatgtcct 841 tatcaacaag gagggccatg tgaagatgtg tgactttggc atcagtggct acttggtgga 901 ctctgtggcc aagacgatgg atgccggctg caagccctac atggcccctg agaggatcaa 961 cccagagctg aaccagaagg gctacaatgt caagtccgac gtctggagcc tgggcatcac 1021 catgattgag atggccatcc tgcggttccc ttacgagtcc tgggggaccc cgttccagca 1081 gctgaagcag gtggtggagg agccgtcccc ccagctccca gccgaccgtt tctcccccga 1141 gtttgtggac ttcactgctc agtgcctgag gaagaacccc gcagagcgta tgagctacct 1201 ggagctgatg gagcacccct tcttcacctt gcacaaaacc aagaagacgg acattgctgc 1261 cttcgtgaag aagatcctgg gagaagactc ataggggctg ggcctcggac cccactccgg 1321 ccctccagag ccccacagcc ccatctgcgg gggcagtgct cacccacacc ataagctact 1381 gccatcctgg cccagggcat ctgggaggaa ccgagggggc tgctcccacc tggctctgtg 1441 gcgagccatt tgtcccaagt gccaaagaag cagaccattg gggctcccag ccaggccctt 1501 gtcggcccca ccagtgcctc tccctgctgc tcctaggacc cgtctccagc tgctgagatc 1561 ctggactgag ggggcctgga tgccccctgt ggatgctgct gcccctgcac agcaggctgc 1621 cagtgcctgg gtggatgggc caccgccttg cccagcctgg atgccatcca agttgtatat 1681 ttttttaatc tctcgactga atggactttg cacactttgg cccagggtgg ccacacctct 1741 atcccggctt tggtgcgggg tacacaagag gggatgagtt gtgtgaatac cccaagactc 1801 ccatgaggga gatgccatga gccgcccaag gccttcccct ggcactggca aacagggcct 1861 ctgcggagca cactggctca cccagtcctg cccgccaccg ttatcggtgt cattcacctt 1921 tcgtgttttt tttaatttat cctctgttga ttttttcttt tgctttatgg gtttggcttg 1981 tttttcttgc atggtttgga gctgatcgct tctcccccac cccctagggg // LOCUS HUMMLC2A 1120 bp mRNA PRI 07-JAN-1995 DEFINITION Human 20-kDa myosin light chain (MLC-2) mRNA, complete cds. ACCESSION J02854 NID g188585 KEYWORDS myosin; myosin light chain. SOURCE Human, cDNA to mRNA, clone HuMLC-6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1120) AUTHORS Kumar,C.C., Mohan,S.R., Zavodny,P.J., Narula,S.K. and Leibowitz,P.J. TITLE Characterization and differential expression of human vascular smooth muscle myosin light chain 2 isoform in nonmuscle cells JOURNAL Biochemistry 28 (9), 4027-4035 (1989) MEDLINE 89323116 COMMENT Draft entry and clean copy of sequence for [1] kindly provided by C.C.Kumar, 03-MAR-1989. FEATURES Location/Qualifiers source 1..1120 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 43..561 /gene="MYL2" CDS 43..561 /gene="MYL2" /codon_start=1 /db_xref="GDB:G00-128-829" /product="myosin light chain 2" /db_xref="PID:g188586" /translation="MSSKRAKAKATKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQN RDGFIDKEDLHDMLASLGKNPTDEYLEGMMSEAPGPYNFTMFLTMFGEKLNGTDPEDV IRNAFACFDEESSGFIHEDHLRKLLTTMGDRFTDEEVDEMYREAPVDKKGNFNYVEFT RILKHGAKDKHD" BASE COUNT 276 a 364 c 301 g 179 t ORIGIN 240 bp upstream of MboII site. 1 acttcttcgc accagggaag ccccacccac cagaacgcca agatgtccag caagcgggcc 61 aaagccaagg ccaccaagaa gcggccacag cgggccacat ccaatgtctt cgcaatgttt 121 gaccagtccc agatccagga gtttaaggag gctttcaaca tgattgacca gaaccgtgat 181 ggcttcattg acaaggagga cctgcacgac atgctggcct cgctggggaa gaaccccaca 241 gacgaatacc tggagggcat gatgagcgag gccccggggc catacaactt caccatgttc 301 ctcaccatgt ttggggagaa gctgaacggc acggaccccg aggatgtgat tcgcaacgcc 361 tttgcctgct tcgacgagga atcctcaggt ttcatccatg aggaccacct ccggaagctg 421 ctcaccacca tgggtgaccg cttcacagat gaggaagtgg acgagatgta ccgggaggca 481 cccgttgata agaaaggcaa cttcaactac gtggagttca cccgcatcct caaacatggc 541 gccaaggata aacacgacta ggccatcccc agccccctga cacccagccc ccgccagtca 601 cccctccccg cacacacccg tccataccag ctccctgccc atgaccctcg ctcagggatc 661 cccctttgag ggttagggtc ccagttccca gtggaagaaa caggccagga gagtgcgtgc 721 cgagctgagg cagatgttcc cacagtgacc ccagagccct gggctatagt ctctgacccc 781 tccaaggaaa gaccaccttc tggggacatg ggctggaggg caggacctag aggcaccaag 841 ggaaccgcat tccggggctg ttccccgagg aggaagggaa gcctctgtgt gccccccagg 901 aggaagaggc cctgagtcct gggatcagac accccttcac gtgtatccca cacaaatgca 961 agctcaccaa ggtcccctct cagtcccctt ccctacaccc tgacgccaga tgccgcacac 1021 ccaacgccac cagccatggg agtgtgctca ggagtcgcgg ggcagacgtg acatctgtcc 1081 agagggggca gaatctccaa tagaggactg agacaacatg // LOCUS HUMMLC2B 662 bp mRNA PRI 05-MAR-1996 DEFINITION Human (clone PWHLC2-24) myosin light chain 2 mRNA, complete cds. ACCESSION M21812 NID g1220345 KEYWORDS myosin; myosin light chain 2. SOURCE Homo sapiens (tissue library: lambda gt11) fetal skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Wu,Q.L. TITLE Characterization of a full length human skeletal fast light chain 2 cDNA JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..662 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="skeletal muscle" /tissue_lib="lambda gt11" 5'UTR 1..58 CDS 59..571 /codon_start=1 /product="myosin light chain 2" /db_xref="PID:g1220346" /translation="MAPKRAKRRTVAEGGSSSVFSMFDQTQIQEFKEAFTVIDQNRDG IIDKEDLRDTFAAMGRLNVKNEELDAMMKEASGPINFTVFLTMFGEKLKGADPEDVIT GAFKVLDPEGKGTIKKKFLEELLTTQCDRFSQEEIKNMWAAFPPDVGGNVDYKNICYV ITHGDAKDQE" 3'UTR 572..662 stem_loop 578..628 BASE COUNT 155 a 197 c 186 g 124 t ORIGIN 1 ctgactcctt gcttctttcc agccggtgcc gctgccttgc cccccggaga ctgaagacat 61 ggcacccaag agggccaaga gaaggacagt agcagagggc ggaagctcca gcgtcttctc 121 catgttcgac cagactcaga tccaggagtt caaagaggcc ttcactgtga tcgaccagaa 181 ccgtgatggt attatagaca aggaggacct tcgggacacc ttcgcagcca tgggccgcct 241 caatgtgaag aatgaggagt tggatgccat gatgaaggaa gccagcggtc ccatcaactt 301 caccgtcttc ctgaccatgt tcggggagaa gctcaagggt gccgaccctg aggatgtgat 361 caccggagcc ttcaaggtct tggaccctga gggaaagggc accatcaaga agaagttcct 421 ggaggagctg ctgaccacgc agtgtgaccg cttctcccag gaggagatca agaacatgtg 481 ggcggccttc ccccccgacg tgggcggcaa cgtcgactac aaaaacatct gctacgtcat 541 cacgcacggc gacgccaagg accaggagta ggggcacccg cgggcctccg ctgcccgacg 601 cttctgttcg gcccgacctc caccccggct cccaataaaa tttaactgat ctttgtttct 661 ta // LOCUS HUMMLC3NM 706 bp mRNA PRI 07-JAN-1995 DEFINITION Human myosin light chain 3 non-muscle (MLC3nm) mRNA, complete cds. ACCESSION M31212 NID g188589 KEYWORDS alkali myosin light chain; contractile protein; non-muscle myosin light chain 3; structural protein. SOURCE Human adult transformed fibroblast, cDNA to mRNA, clone MLC3nm. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 706) AUTHORS Hailstones,D.L. and Gunning,P.W. TITLE Characterization of human myosin light chains 1sa and 3nm: implications for isoform evolution and function JOURNAL Mol. Cell. Biol. 10 (3), 1095-1104 (1990) MEDLINE 90158572 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Hailstones, 11-JAN-1990. The smooth muscle myosin light chain 3 is generated by an alternative splicing event. Its amino acid boundaries are from bp 41 to 467 and 514 to 542. FEATURES Location/Qualifiers source 1..706 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /map="3p21" mRNA 1..706 /note="MLC3nm mRNA" gene 41..496 /gene="MYL3" CDS 41..496 /gene="MYL3" /codon_start=1 /db_xref="GDB:G00-120-218" /product="myosin light chain 3" /db_xref="PID:g188590" /translation="MCDFTEDQTAEFKEAFQLFDRTGDGKILYSQCGDVMRALGQNPT DAEVLKVLGNPKSDEMNVKVLDFEHFLPMLQTVAKNKDQGTYEDYVEGLRVFDKEGNG TVMGAEIRHVLVTLGEKMTEEEVEMLVAGHEDSNGCINYEAFVRHILSG" BASE COUNT 165 a 163 c 213 g 165 t ORIGIN 1 attactgcag gaaaaggtcc cggagagctg agcagtcaag atgtgtgact tcaccgaaga 61 ccagaccgca gagttcaagg aggccttcca gctgtttgac cgaacaggtg atggcaagat 121 cctgtacagc cagtgtgggg atgtgatgag ggccctgggc cagaacccta ccgacgccga 181 ggtgctcaag gtcctgggga accccaagag tgatgagatg aatgtgaagg tgctggactt 241 tgagcacttt ctgcccatgc tgcagacagt ggccaagaac aaggaccagg gcacctatga 301 ggattatgtc gaaggacttc gggtgtttga caaggaagga aatggcaccg tcatgggtgc 361 tgaaatccgg catgttcttg tcacactggg tgagaagatg acagaggaag aagtagagat 421 gctggtggca gggcatgagg acagcaatgg ttgtatcaac tatgaagcgt ttgtgaggca 481 tatcctgtcg gggtgacggg cccgatgggg cggagctcgt ccgcatggtg ctgaatggct 541 gaggaccttc ccagtctccc cagagtccgt gcctttccct gtgtgaattt tgtatctagc 601 ctaaagtttc cctaggcttt cttgtctcag caactttccc atcttgtctc tcttggatga 661 tgtttgccgt cagcattcac caaaataaac ttgctctctg ggcctc // LOCUS HUMMOESIN 3879 bp mRNA PRI 28-JUL-1992 DEFINITION Human moesin mRNA, complete cds. ACCESSION M69066 NID g188625 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3879) AUTHORS Lankes,W.T. and Furthmayr,H. TITLE Moesin: a new member of the protein 4.1 - talin - ezrin family of proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1991) In press REFERENCE 2 (sites) AUTHORS Lankes,W.T. and Furthmayr,H. TITLE Moesin: A member of the protein 4.1-talin-ezrin family of proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 8297-8301 (1991) MEDLINE 92020840 FEATURES Location/Qualifiers source 1..3879 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL 60" CDS 101..1834 /codon_start=1 /product="moesin B" /db_xref="PID:g188626" /translation="MPKTISVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWF FGLQYQDTKGFSTWLKLNKKVTAQDVRKESPLLFKFRAKFYPEDVSEELIQDITQRLF FLQVKEGILNDDIYCPPETAVLLASYAVQSKYGDFNKEVHKSGYLAGDKLLPQRVLEQ HKLNKDQWEERIQVWHEEHRGMLREDAVLEYLKIAQDLEMYGVNYFSIKNKKGSELWL GVDALGLNIYEQNDRLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRI NKRILALCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQMERAMLENEKKKREMAE KEKEKIEREKEELMERLKQIEEQTKKAQQELEEQTRRALELEQERKRAQSEAEKLAKE RQEAEEAKEALLQASRDQKKTQEQLALEMAELTARISQLEMARQKKESEAVEWQQKAQ MVQEDLEKTRAELKTAMSTPHVAEPAENEQDEQDENGAEASADLRADAMAKDRSEEER TTEAEKNERVQKHLKALTSELANARDESKKTANDMIHAENMRLGRDKYKTLRQIRQGN TKQRIDEFESM" BASE COUNT 1000 a 972 c 965 g 942 t ORIGIN 1 ggcacgaggc cagccgaatc caagccgtgt gtactgcgtg ctcagcactg cccgacagtc 61 ctagctaaac ttcgccaact ccgctgcctt tgccgccacc atgcccaaaa cgatcagtgt 121 gcgtgtgacc accatggatg cagagctgga gtttgccatc cagcccaaca ccaccgggaa 181 gcagctattt gaccaggtgg tgaaaactat tggcttgagg gaagtttggt tctttggtct 241 gcagtaccag gacactaaag gtttctccac ctggctgaaa ctcaataaga aggtgactgc 301 ccaggatgtg cggaaggaaa gccccctgct ctttaagttc cgtgccaagt tctaccctga 361 ggatgtgtcc gaggaattga ttcaggacat cactcagcgc ctgttctttc tgcaagtgaa 421 agagggcatt ctcaatgatg atatttactg cccgcctgag accgctgtgc tgctggcctc 481 gtatgctgtc cagtctaagt atggcgactt caataaggaa gtgcataagt ctggctacct 541 ggccggagac aagttgctcc cgcagagagt cctggaacag cacaaactca acaaggacca 601 gtgggaggag cggatccagg tgtggcatga ggaacaccgt ggcatgctca gggaggatgc 661 tgtcctggaa tatctgaaga ttgctcaaga tctggagatg tatggtgtga actacttcag 721 catcaagaac aagaaaggct cagagctgtg gctgggggtg gatgccctgg gtctcaacat 781 ctatgagcag aatgacagac taactcccaa gataggcttc ccctggagtg aaatcaggaa 841 catctctttc aatgataaga aatttgtcat caagcccatt gacaaaaaag ccccggactt 901 cgtcttctat gctccccggc tgcggattaa caagcggatc ttggccttgt gcatggggaa 961 ccatgaacta tacatgcgcc gtcgcaagcc tgataccatt gaggtgcagc agatgaaggc 1021 acaggcccgg gaggagaagc accagaagca gatggagcgt gctatgctgg aaaatgagaa 1081 gaagaagcgt gaaatggcag agaaggagaa agagaagatt gaacgggaga aggaggagct 1141 gatggagagg ctgaagcaga tcgaggaaca gactaagaag gctcagcaag aactggaaga 1201 acagacccgt agggctctgg aacttgagca ggaacggaag cgtgcccaga gcgaggctga 1261 aaagctggcc aaggagcgtc aagaagctga agaggccaag gaggccttgc tgcaggcctc 1321 ccgggaccag aaaaagactc aggaacagct ggccttggaa atggcagagc tgacagctcg 1381 aatctcccag ctggagatgg cccgacagaa gaaggagagt gaggctgtgg agtggcagca 1441 gaaggcccag atggtacagg aagacttgga gaagacccgt gctgagctga agactgccat 1501 gagtacacct catgtggcag agcctgctga gaatgagcag gatgagcagg atgagaatgg 1561 ggcagaggct agtgctgacc tacgggctga tgctatggcc aaggaccgca gtgaggagga 1621 acgtaccact gaggcagaga agaatgagcg tgtgcagaag cacctgaagg ccctcacttc 1681 ggagctggcc aatgccagag atgagtccaa gaagactgcc aatgacatga tccatgctga 1741 gaacatgcga ctgggccgag acaaatacaa gaccctgcgc cagatccggc agggcaacac 1801 caagcagcgc attgacgaat ttgagtctat gtaatgggca cccagcctct agggacccct 1861 cctccctttt tccttgtccc cacactccta cacctaactc acctaactca tactgtgctg 1921 gagccactaa ctagagcagc cctggagtca tgccaagcat ttaatgtagc catgggacca 1981 aacctagccc cttagccccc acccacttcc ctgggcaaat gaatggctca ctatggtgcc 2041 aatggaacct cctttctctt ctctgttcca ttgaatctgt atggctagaa tatcctactt 2101 ctccagccta gaggtacttt ccacttgatt ttgcaaatgc ccttacactt actgttgtcc 2161 tatgggagtc aagtgtggag taggttggaa gctagctccc ctcctctccc ctccactgtc 2221 ttcttcaggt cctgagatta cacggtggag tgtatgcggt ctaggaatga gacaggacct 2281 agatatcttc tccagggatg tcaactgacc taaaatttgc cctcccatcc cgtttagagt 2341 tatttaggct ttgtaacgat tgggggaata aaaagatgtt cagtcatttt tgtttctacc 2401 tcccagatcg gatctgttgc aaactcagcc tcaataagcc ttgtcgttga ctttagggac 2461 tcaatttctc cccagggtgg atgggggaaa tggtgccttc aagaccttca ccaaacatac 2521 tagaagggca ttggccattc tattgtggca aggctgagta gaagatccta ccccaattcc 2581 ttgtaggagt ataggccggt ctaaagtgag ctctatgggc agatctaccc cttacttatt 2641 attccagatc tgcagtcact tcgtgggatc tgcccctccc tgcttcaata cccaaatcct 2701 ctccagctat aacagtaggg atgagtaccc aaaagctcag ccagccccat caggactctt 2761 gtgaaaagag aggatatgtt cacacctagc gtcagtattt tccctgctag gggttttagg 2821 tctcttcccc tctcagagct acttgggcca tagctcctgc tccacagcca tcccagcctt 2881 ggcatctaga gcttgatgcc agtaggctca actagggagt gagtgcaaaa agctgagtat 2941 ggtgagagaa gcctgtgccc tgatccaagt ttactcaacc ctctcaggtg accaaaatcc 3001 ccttctcatc actcccctca aagaggtgac tgggccctgc ctctgtttga caaacctcta 3061 acccaggtct tgacaccagc tgttctgtcc cttggagctg taaaccagag agctgctggg 3121 ggattctggc ctagtccctt ccacaccccc accccttgct ctcaacccag gagcatccac 3181 ctccttctct gtctcatgtg tgctcttctt ctttctacag tattatgtac tctactgata 3241 tctaaatatt gatttctgcc ttccttgcta atgcaccatt agaagatatt agtcttgggg 3301 caggatgatt ttggcctcat tactttacca cccccacacc tggaaagcat atactatatt 3361 acaaaatgac attttgccaa aattattaat ataagaagct ttcagtatta gtgatgtcat 3421 ctgtcactat aggtcataca atccattctt aaagtacttg ttatttgttt ttattattac 3481 tgtttgtctt ctccccaggg ttcagtccct caaggggcca tcctgtccca ccatgcagtg 3541 ccccctagct tagagcctcc ctcaattccc cctggccacc accccccact ctgtgcctga 3601 ccttgaggag tcttgtgtgc attgctgtga attagctcac ttggtgatat gtcctatatt 3661 ggctaaattg aaacctggaa ttgtggggca atctattaat agctgcctta aagtcagtaa 3721 cttaccctta gggaggctgg gggaaaaggt tagattttgt attcaggggt tttttgtgta 3781 ctttttgggt ttttaaaaaa ttgtttttgg aggggtttat gctcaatcca tgttctattt 3841 cagtgccaat aaaatttagg tgacttcaaa aaaaaaaaa // LOCUS HUMMOR1X 2162 bp mRNA PRI 08-AUG-1994 DEFINITION Human Mu opiate receptor (MOR1) mRNA, complete cds. ACCESSION L25119 NID g452072 KEYWORDS Mu opiate receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2162) AUTHORS Wang,J.B., Johnson,P.S., Persico,A.M., Hawkins,A.L., Griffin,C.A. and Uhl,G.R. TITLE Human mu opiate receptor. cDNA and genomic clones, pharmacologic characterization and chromosomal assignment JOURNAL FEBS Lett. 338 (2), 217-222 (1994) MEDLINE 94139928 FEATURES Location/Qualifiers source 1..2162 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_lib="lambda ZAP; Stratagene" gene 213..1415 /gene="MOR1" CDS 213..1415 /gene="MOR1" /codon_start=1 /product="Mu opiate receptor" /db_xref="PID:g452073" /translation="MDSSAAPTNASNCTDALAYSSCSPAPSPGSWVNLSHLDGNLSDP CGPNRTNLGGRDSLCPPTGSPSMITAITIMALYSIVCVVGLFGNFLVMYVIVRYTKMK TATNIYIFNLALADALATSTLPFQSVNYLMGTWPFGTILCKIVISIDYYNMFTSIFTL CTMSVDRYIAVCHPVKALDFRTPRNAKIINVCNWILSSAIGLPVMFMATTKYRQGSID CTLTFSHPTWYWENLVKICVFIFAFIMPVLIITVCYGLMILRLKSVRMLSGSKEKDRN LRRITRMVLVVVAVFIVCWTPIHIYVIIKALVTIPETTFQTVSWHFCIALGYTNSCLN PVLYAFLDENFKRCFREFCIPTSSNIEQQNSTRIRQNTRDHPSTANTVDRTNHQLENL EAETAPLP" BASE COUNT 563 a 566 c 455 g 576 t 2 others ORIGIN 1 ggaattccgg ctataggcag aggagaatgt cagatgctca gctcggtccc ctccgcctga 61 cgctcctctc tgtctcagcc aggactggtt tctgtaagaa acagcaggag ctgtggcagc 121 ggcgaaagga agcggctgag gcgcttggaa cccgaaaagt ctcggtgctc ctggctacct 181 cgcacagcgg tgcccgcccg gccgtcagta ccatggacag cagcgctgcc cccacgaacg 241 ccagcaattg cactgatgcc ttggcgtact caagttgctc cccagcaccc agccccggtt 301 cctgggtcaa cttgtcccac ttagatggca acctgtccga cccatgcggt ccgaaccgca 361 ccaacctggg cgggagagac agcctgtgcc ctccgaccgg cagtccctcc atgatcacgg 421 ccatcacgat catggccctc tactccatcg tgtgcgtggt ggggctcttc ggaaacttcc 481 tggtcatgta tgtgattgtc agatacacca agatgaagac tgccaccaac atctacattt 541 tcaaccttgc tctggcagat gccttagcca ccagtaccct gcccttccag agtgtgaatt 601 acctaatggg aacatggcca tttggaacca tcctttgcaa gatagtgatc tccatagatt 661 actataacat gttcaccagc atattcaccc tctgcaccat gagtgttgat cgatacattg 721 cagtctgcca ccctgtcaag gccttagatt tccgtactcc ccgaaatgcc aaaattatca 781 atgtctgcaa ctggatcctc tcttcagcca ttggtcttcc tgtaatgttc atggctacaa 841 caaaatacag gcaaggttcc atagattgta cactaacatt ctctcatcca acctggtact 901 gggaaaacct cgtgaagatc tgtgttttca tcttcgcctt cattatgcca gtgctcatca 961 ttaccgtgtg ctatggactg atgatcttgc gcctcaagag tgtccgcatg ctctctggct 1021 ccaaagaaaa ggacaggaat cttcgaagga tcaccaggat ggtgctggtg gtggtggctg 1081 tgttcatcgt ctgctggact cccattcaca tttacgtcat cattaaagcc ttggttacaa 1141 tcccagaaac tacgttccag actgtttctt ggcacttctg cattgctcta ggttacacaa 1201 acagctgcct caacccagtc ctttatgcat ttctggatga aaacttcaaa cgatgcttca 1261 gagagttctg tatcccaacc tcttccaaca ttgagcaaca aaactccact cgaattcgtc 1321 agaacactag agaccacccc tccacggcca atacagtgga tagaactaat catcagctag 1381 aaaatctgga agcagaaact gctccgttgc cctaacaggg tctcatgcca ttccgacctt 1441 caccaagctt agaagccacc atgtatgtgg aagcaggttg cttcaagaat gtgtaggagg 1501 ctctaattct ctaggaaagt gcctactttt aggtcatcca acctctttcc tctctggcca 1561 ctctgctctg cacattagag ggacagccaa aagtaagtgg agcatttgga aggaaaggaa 1621 tataccacac cgaggagtcc agtttgtgca agacacccag tggaaccaaa acccatcgtg 1681 gtatgtgaat tgaagtcatc ataaaaggtg acccttctgt ctgtaagatt ttattttcaa 1741 gcaaatattt atgacctcaa caaagaagaa ccatcttttg ttaagttcac cgtagtaaca 1801 cataaagtaa atgctacctc tgatcaaagc accttgaatg gaaggtccga gtctttttag 1861 tgtttttgca agggaatgaa tccattattc tattttagac ttttaacttc aacttaaaat 1921 tagcatctgg ctaaggcatc attttcacct ccatttcttg gttttgtatt gtttaaaaaa 1981 aataacatct ctttcatcta gctccataat tgcaagggaa gagattagca tgaaaggtaa 2041 tctgaaacac agtcatgtgt canctgtaga aaggttgatt ctcatgcact ncaaatactt 2101 ccaaagagtc atcatggggg atttttcatt cttaggcttt cagtggtttg ttcctggaat 2161 tc // LOCUS HUMMPSI 329 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens metallopanstimulin (MPS1) mRNA, complete cds. ACCESSION L19739 NID g431318 KEYWORDS metallopanstimulin; nuclear zinc-finger protein; zinc finger protein. SOURCE Homo sapiens (tissue library: pcDNA-II) female adult mammary gland carcinoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fernandez-Pol,J.A., Klos,D.J., Hamilton,P.D. and Schuette,V.M. TITLE A growth factor-induced gene product produced in a baculovirus expression system is a nuclear zinc-finger protein that binds to DNA JOURNAL J. Cell. Biochem. 16 C, 27-27 (1992) REFERENCE 2 (bases 1 to 329) AUTHORS Fernandez-Pol,J.A., Klos,D.J. and Hamilton,P.D. TITLE A growth factor-inducible gene encodes a novel nuclear protein with zinc finger structure JOURNAL J. Biol. Chem. 268 (28), 21198-21204 (1993) MEDLINE 94012671 FEATURES Location/Qualifiers source 1..329 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MDA-MB-468" /dev_stage="adult" /sex="female" /tissue_type="mammary gland carcinoma" /tissue_lib="pcDNA-II" gene 21..275 /gene="MPS1" CDS 21..275 /gene="MPS1" /note="putative" /codon_start=1 /function="involved in cell growth" /product="metallopanstimulin" /db_xref="PID:g431319" /translation="MPLAKDLLHPSPEEEKRKHKKKRLVQSPNSYFMDVKCPGCYKIT TVFSHAQTVVLCVGCSTVLCQPTGGKARLTEGCSFRRKQH" BASE COUNT 102 a 84 c 76 g 67 t ORIGIN 1 cgacctacgc acacgagaac atgcctctcg caaaggatct ccttcatccc tctccagaag 61 aggagaagag gaaacacaag aagaaacgcc tggtgcagag ccccaattcc tacttcatgg 121 atgtgaaatg cccaggatgc tataaaatca ccacggtctt tagccatgca caaacggtag 181 ttttgtgtgt tggctgctcc actgtcctct gccagcctac aggaggaaaa gcaaggctta 241 cagaaggatg ttccttcagg aggaagcagc actaaaagca ctctgagtca agatgagtgg 301 gaaaccatct caacaaacac attttggat // LOCUS HUMMR 1083 bp DNA PRI 18-FEB-1993 DEFINITION Homo sapiens melanortin receptor gene, complete cds. ACCESSION L06155 NID g188673 KEYWORDS melanortin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1083) AUTHORS Gantz,I., Konda,Y., Tashiro,T., Shimoto,Y., Munzert,G., DelValle,J. and Yamada,T. TITLE Molecular cloning of a novel melanortin receptor JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1083 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1083 /codon_start=1 /product="melanortin receptor" /db_xref="PID:g188674" /translation="MSIQKKYLEGDFVFPVSSSSFLRTLLEPQLGSALLTAMNASCCL PSVQPTLPNGSEHLQAPFFSNQSSSAFCEQVFIKPEIFLSLGIVSLLENILVILAVVR NGNLHSPMYFFLCSLAVADMLVSVSNALETIMIAIVHSDYLTFEDQFIQHMDNIFDSM ICISLVASICNLLAIAVDRYVTIFYALRYHSIMTVRKALTLIVAIWVCCGVCGVVFIV YSESKMVIVCLITMFFAMMLLMGTLYVHMFLFARLHVKRIAALPPADGVAPQQHSCMK GAVTITILLGVFIFCWAPFFLHLVLIITCPTNPYCICYTAHFNTYLVLIMCNSVIDPL IYAFRSLELRNTFREILCGCNGMNLG" BASE COUNT 208 a 361 c 251 g 263 t ORIGIN 1 atgagcatcc aaaagaagta tctggaggga gattttgtct ttcctgtgag cagcagcagc 61 ttcctacgga ccctgctgga gccccagctc ggatcagccc ttctgacagc aatgaatgct 121 tcgtgctgcc tgccctctgt tcagccaaca ctgcctaatg gctcggagca cctccaagcc 181 cctttcttca gcaaccagag cagcagcgcc ttctgtgagc aggtcttcat caagcccgag 241 attttcctgt ctctgggcat cgtcagtctg ctggaaaaca tcctggttat cctggccgtg 301 gtcaggaacg gcaacctgca ctccccgatg tacttctttc tctgcagcct ggcggtggcc 361 gacatgctgg taagtgtgtc caatgccctg gagaccatca tgatcgccat cgtccacagc 421 gactacctga ccttcgagga ccagtttatc cagcacatgg acaacatctt cgactccatg 481 atctgcatct ccctggtggc ctccatctgc aacctcctgg ccatcgccgt cgacaggtac 541 gtcaccatct tttacgcgct ccgctaccac agcatcatga ccgtgaggaa ggccctcacc 601 ttgatcgtgg ccatctgggt ctgctgcggc gtctgtggcg tggtgttcat cgtctactcg 661 gagagcaaaa tggtcattgt gtgcctcatc accatgttct tcgccatgat gctcctcatg 721 ggcaccctct acgtgcacat gttcctcttt gcgcggctgc acgtcaagcg catagcagca 781 ctgccacctg ccgacggggt ggccccacag caacactcat gcatgaaggg ggcagtcacc 841 atcaccattc tcctgggcgt gttcatcttc tgctgggccc ccttcttcct ccacctggtc 901 ctcatcatca cctgccccac caacccctac tgcatctgct acactgccca cttcaacacc 961 tacctggtcc tcatcatgtg caactccgtc atcgacccac tcatctacgc tttccggagc 1021 ctggaattgc gcaacacctt tagggagatt ctctgtggct gcaacggcat gaacttggga 1081 tag // LOCUS HUMMRA 5185 bp mRNA PRI 07-JAN-1995 DEFINITION Human mannose receptor mRNA, complete cds. ACCESSION J05550 NID g188675 KEYWORDS mannose receptor. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5185) AUTHORS Taylor,M.E., Conary,J.T., Lennartz,M.R., Stahl,P.D. and Drickamer,K. TITLE Primary structure of the mannose receptor contains multiple motifs resembling carbohydrate-recognition domains JOURNAL J. Biol. Chem. 265 (21), 12156-12162 (1990) MEDLINE 90324192 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Drickamer, 18-MAY-1990. FEATURES Location/Qualifiers source 1..5185 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12" sig_peptide 104..157 /gene="M6PR" /note="mannose receptor signal peptide" CDS 104..4474 /gene="M6PR" /note="mannose receptor precursor" /codon_start=1 /db_xref="GDB:G00-120-162" /db_xref="PID:g188676" /translation="MRLPLLLVFASVIPGAVLLLDTRQFLIYNEDHKRCVDAVSPSAV QTAACNQDAESQKFRWVSESQIMSVAFKLCLGVPSKTDWVAITLYACDSKSEFQKWEC KNDTLLGIKGEDLFFNYGNRQEKNIMLYKGSGLWSRWKIYGTTDNLCSRGYEAMYTLL GNANGATCAFPFKFENKWYADCTSAGRSDGWLWCGTTTDYDTDKLFGYCPLKFEGSES LWNKDPLTSVSYQINSKSALTWHQARKSCQQQNAELLSITEIHEQTYLTGLTSSLTSG LWIGLNSLSFNSGWQWSDRSPFRYLNWLPGSPSAEPGKSCVSLNPGKNAKWENLECVQ KLGYICKKGNTTLNSFVIPSESDVPTHCPSQWWPYAGHCYKIHRDEKKIQRDALTTCR KEGGDLTSIHTIEELDFIISQLGYEPNDELWIGLNDIKIQMYFEWSDGTPVTFTKWLR GEPSHENNRQEDCVVMKGKDGYWADRGCEWPLGYICKMKSRSQGPEIVEVEKGCRKGW KKHHFYCYMIGHTLSTFAEANQTCNNENAYLTTIEDRYEQAFLTSFVGLRPEKYFWTG LSDIQTKGTFQWTIEEEVRFTHWNSDMPGRKPGCVAMRTGIAGGLWDVLKCDEKAKFV CKHWAEGVTHPPKPTTTPEPKCPEDWGASSRTSLCFKLYAKGKHEKKTWFESRDFCRA LGGDLASINNKEEQQTIWRLITASGSYHKLFWLGLTYGSPSEGFTWSDGSPVSYENWA YGEPNNYQNVEYCGELKGDPTMSWNDINCEHLNNWICQIQKGQTPKPEPTPAPQDNPP VTEDGWVIYKDYQYYFSKEKETMDNARAFCKRNFGDLVSIQSESEKKFLWKYVNRNDA QSAYFIGLLISLDKKFAWMDGSKVDYVSWATGEPNFANEDENCVTMYSNSGFWNDINC GYPNAFICQRHNSSINATTVMPTMPSVPSGCKEGWNFYSNKCFKIFGFMEEERKNWQE ARKACIGFGGNLVSIQNEKEQAFLTYHMKDSTFSAWTGLNDVNSEHTFLWTDGRGVHY TNWGKGYPGGRRSSLSYEDADCVVIIGGASNEAGKWMDDTCDSKRGYICQTRSDPSLT NPPATIQTDGFVKYGKSSYSLMRQKFQWHEAETYCKLHNSLIASILDPYSNAFAWLQM ETSNERVWIALNSNLTDNQYTWTDKWRVRYTNWAADEPKLKSACVYLDLDGYWKTAHC NESFYFLCKRSDEIPATEPPQLPGRCPESDHTAWIPFHGHCYYIESSYTRNWGQASLE CLRMGSSLVSIESAAESSFLSYRVEPLKSKTNFWIGLFRNVEGTWLWINNSPVSFVNW NTGDPSGERNDCVALHASSGFWSNIHCSSYKGYICKRPKIIDAKPTHELLTTKADTRK MDPSKPSSNVAGVVIIVILLILTGAGLAAYFFYKKRRVHLPQEGAFENTLYFNSQSSP GTSDMKDLVGNIEQNEHSVI" gene 104..4474 /gene="M6PR" mat_peptide 158..4471 /gene="M6PR" /note="mannose receptor" BASE COUNT 1602 a 999 c 1198 g 1386 t ORIGIN 1 gggaacttgg attaggtgga gaggcagttg gggggcctcg ttgttttgcg tcttagttcc 61 gccctcctgt ccatcaggag aaggaaagga taaaccctgg gccatgaggc tacccctgct 121 cctggttttt gcctctgtca ttccgggtgc tgttctccta ctggacacca ggcaattttt 181 aatctataat gaagatcaca agcgctgcgt ggatgcagtg agtcccagtg ccgtccaaac 241 cgcagcttgc aaccaggatg ccgaatcaca gaaattccga tgggtgtccg aatctcagat 301 tatgagtgtt gcatttaaat tatgcctggg agtgccatca aaaacagact gggttgctat 361 cactctctat gcctgtgact caaaaagtga atttcagaaa tgggagtgca aaaatgacac 421 acttttgggg atcaaaggag aagatttatt ttttaactac ggcaacagac aagaaaagaa 481 tattatgctc tacaagggat cgggtttatg gagcaggtgg aagatctatg gaaccacaga 541 caatctgtgc tccagaggtt atgaagccat gtatacgcta ctaggcaatg ccaatggagc 601 aacctgtgca ttcccgttca agtttgaaaa caagtggtac gcagattgca cgagtgctgg 661 gcggtcggat ggatggctct ggtgcggaac cactactgac tatgacacag acaagctatt 721 tggatattgt ccattgaaat ttgagggcag tgaaagctta tggaataaag acccgctgac 781 cagcgtttcc taccagataa actccaaatc cgctttaacg tggcaccaag cgaggaaaag 841 ctgccaacaa cagaacgctg agctcctgag catcacagag atacatgagc aaacatacct 901 gacaggatta accagttcct tgacctcagg actctggatt ggacttaaca gtctgagctt 961 caacagcggt tggcagtgga gtgaccgcag tcctttccga tatttgaact ggttaccagg 1021 aagtccatca gctgaacctg gaaaaagctg tgtgtcacta aatcctggaa aaaatgctaa 1081 atgggaaaat ctggaatgtg ttcagaaact gggctatatt tgcaaaaagg gcaacaccac 1141 tttaaattct tttgttattc cctcagaaag tgatgtgcct actcactgtc ctagtcagtg 1201 gtggccgtat gccggtcact gttacaagat tcacagagat gagaaaaaaa tccagaggga 1261 tgctctgacc acctgcagga aggaaggcgg tgacctcaca agtatccaca ccatcgagga 1321 attggacttt attatctccc agctaggata tgagccaaat gacgaattgt ggatcggctt 1381 aaatgacatt aagattcaaa tgtactttga gtggagtgat gggacccctg taacgtttac 1441 caaatggctt cgtggagaac caagccatga aaacaacaga caggaggatt gtgtggtgat 1501 gaaaggcaag gatgggtact gggcagatcg gggctgtgag tggcctcttg gctacatctg 1561 caagatgaaa tcacgaagcc aaggtccaga aatagtggaa gtcgaaaaag gctgcaggaa 1621 aggctggaaa aaacatcact tttactgcta tatgattgga catacgcttt caacatttgc 1681 agaagcaaac caaacctgta ataatgagaa tgcttattta acaactattg aagacagata 1741 tgaacaagcc ttcctgacta gtttcgttgg cttaaggcct gaaaaatatt tctggacagg 1801 actttcagat atacaaacca aagggacttt tcagtggacc atcgaggaag aggttcggtt 1861 cacccactgg aattcagata tgccagggcg aaagccaggg tgtgttgcca tgagaaccgg 1921 gattgcaggg ggcttatggg atgttttgaa atgtgatgaa aaggcaaaat ttgtgtgcaa 1981 gcactgggca gaaggagtaa cccacccacc gaagcccacg acgactcccg aacccaaatg 2041 tccggaggat tggggcgcca gcagtagaac aagcttgtgt ttcaagctgt atgcaaaagg 2101 aaaacatgag aagaaaacgt ggtttgaatc tcgagatttt tgtcgagctc tgggtggaga 2161 cttagctagc atcaataaca aagaggaaca gcaaacaata tggcgattaa taacagctag 2221 tggaagctac cacaaactgt tttggttggg attgacatat ggaagccctt cagaaggttt 2281 tacttggagt gatggttctc ctgtttcata tgaaaactgg gcttatggag aacctaataa 2341 ttatcaaaat gttgaatact gtggtgagct gaaaggtgac cctactatgt cttggaatga 2401 tattaattgt gaacacctta acaactggat ttgccagata caaaaaggac aaacaccaaa 2461 acctgagcca acaccagctc ctcaagacaa tccaccagtt actgaagatg ggtgggttat 2521 ttacaaagac taccagtatt atttcagcaa agagaaggaa accatggaca atgcgcgagc 2581 gttttgcaag aggaattttg gtgatcttgt ttctattcaa agtgaaagtg aaaagaagtt 2641 tctatggaaa tatgtaaaca gaaatgatgc acagtctgca tattttattg gtttattgat 2701 cagcttggat aaaaagtttg cttggatgga tggaagcaaa gtggattacg tgtcttgggc 2761 cacaggtgaa cccaattttg caaatgaaga tgaaaactgt gtgaccatgt attcaaattc 2821 agggttttgg aatgacatta actgtggcta tccaaacgcc ttcatttgcc agcgacataa 2881 cagtagtatc aatgctacca cagttatgcc taccatgccc tcggtcccat cagggtgcaa 2941 ggaaggttgg aatttctaca gcaacaagtg tttcaaaatc tttggattta tggaagaaga 3001 aagaaaaaat tggcaagagg cacgaaaagc ttgtataggc tttggaggga atctggtctc 3061 catacaaaat gaaaaagagc aagcatttct tacctatcac atgaaggact ccactttcag 3121 tgcctggact gggctgaatg atgtcaattc agaacacacg ttcctttgga cggatggacg 3181 aggagtccat tacacaaact gggggaaagg ttaccctggt ggaagaagaa gcagtctttc 3241 ttatgaagat gctgactgtg ttgttattat tggaggtgca tcaaatgaag caggaaaatg 3301 gatggatgat acctgcgaca gtaaacgagg ctacatatgc cagacacgat ccgacccttc 3361 cttgactaat cctccagcaa cgattcaaac agatggcttt gttaaatatg gcaaaagcag 3421 ctattcactc atgagacaaa aatttcaatg gcatgaagcg gagacatact gcaagcttca 3481 caattccctt atagccagca ttctggatcc ctacagtaat gcatttgcgt ggctgcagat 3541 ggaaacatct aatgaacgtg tgtggatcgc cctgaacagt aacttgactg ataatcaata 3601 cacttggact gataagtgga gggtgaggta cactaactgg gctgctgatg agcccaaatt 3661 gaaatcagca tgtgtttatc tggatcttga tggctactgg aagacagcac attgcaatga 3721 aagtttttac tttctctgta aaagatcaga tgaaatccct gctactgaac ccccacaact 3781 gcctggcaga tgcccggagt cagatcacac agcatggatt cctttccatg gtcactgtta 3841 ctatattgag tcctcatata caagaaactg gggccaagct tctctggaat gtcttcgaat 3901 gggttcctct ctggtttcca ttgaaagtgc tgcagaatcc agttttctgt catatcgggt 3961 tgagccactt aaaagtaaaa ccaatttttg gataggattg ttcagaaatg ttgaagggac 4021 gtggctgtgg ataaataaca gtccggtctc ctttgtcaac tggaacacag gagatccctc 4081 tggtgaacgg aatgattgtg tagctttaca tgcgtcttct gggttttgga gtaatattca 4141 ctgttcttcc tacaaaggat atatttgtaa aagaccaaaa attattgatg ctaaacctac 4201 tcatgaatta cttacaacaa aagctgacac aaggaagatg gacccttcta aaccgtcttc 4261 caacgtggcc ggagtagtca tcattgtgat cctcctgatt ttaacgggtg ctggccttgc 4321 cgcctatttc ttttataaga aaagacgtgt gcacctacct caagagggcg cctttgaaaa 4381 cactctgtat tttaacagtc agtcaagccc aggaactagt gatatgaaag atctcgtggg 4441 caatattgaa cagaatgaac actcggtcat ctagtacctc aatgcgattc tgagatattt 4501 gaatttcata aaattgtaac tgaaatttaa aatttttagt tcaatgtgat tgttttcttt 4561 aaaatgagta ctgaattgta ctggtctgtc cttttttcct ttgcctaatt gaagaaataa 4621 ttgcttgttt tctagcctgg caagatattt tcataaaaga gggataacaa tgctgattac 4681 taccttttaa aatattttag ataaatgcac agcaccacag caccacatct aagcattagt 4741 gatgggtagc tgatgtcagc ttcatgtgga ttttaagcac tctagaaaca atgaagcttc 4801 ttggcatatt ttaaggagct cccaaaatgt gttacctatt aaattgtaac tcagcaagta 4861 gaagaccatt tgaaaagtca ggtacaaatt tcctcaagtg gcataaaaat gtagtcagtt 4921 ttctctttta ccagttttta tttccactcc aattatttag aactttattt gtacatgtgc 4981 agaagaataa ggcagctgag aatcttgttt cccccaagag agttttacag gctgagtgtt 5041 gcaaatgtgt tctttgtcct gttatatgta tatcaggaat acaaggatgt gaaataaaac 5101 tgtaaatttg cataactgga tgtacttaga taatgtgaaa taaacattaa agacaaggtc 5161 tatttttaat aaaaaaaaaa aaaaa // LOCUS HUMMSS1 1478 bp mRNA PRI 24-JUL-1992 DEFINITION Human mRNA for MSS1, complete cds. ACCESSION D11094 NID g219930 KEYWORDS mammalian suppressor of sgv1; transactivation factor. SOURCE Human Hela cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1478) AUTHORS Shibuya,H., Irie,K., Ninomiya-Tsuji,J., Goebl,M., Taniguchi,T. and Matsumoto,K. TITLE New human gene encoding a positive modulator of HIV Tat-mediated transactivation JOURNAL Nature 357 (6380), 700-702 (1992) MEDLINE 92310549 REFERENCE 2 (bases 1 to 1478) AUTHORS Irie,K. TITLE Direct Submission JOURNAL Submitted (07-MAY-1992) to the DDBJ/EMBL/GenBank databases. Kenji Irie, Nagoya University, Faculty of Science, Dept. of Molecular Biology; Furo-cho, Chigusa-ku, Nagoya 464-01, Japan (Tel:052-782-4493, Fax:052-782-8575) COMMENT Submitted (07-MAY-1992) to DDBJ by: Kenji Irie Department of Molecular Biology Faculty of Science, Nagoya University Chikusa-ku, Nagoya 464-01 Japan Phone: 052-782-4493 Fax: 052-782-8575. FEATURES Location/Qualifiers source 1..1478 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela cell" gene 67..1368 /gene="MSS1" CDS 67..1368 /gene="MSS1" /function="positive modulator of HIV Tat-mediated transactivation" /codon_start=1 /product="MSS1 protein" /db_xref="PID:d1002345" /db_xref="PID:g219931" /translation="MPDYLGADQRKTKEDEKDDKPIRALDEGDIALLKTYGQSTYSRQ IKQVEDDIQQLLKKINELTGIKESDTGLAPPALWDLAADKQTLQSEQPLQVARCTKII NADSEDPKYIINVKQFAKFVVDLSDQVAPTDIEEGMRVGVDRNKYQIHIPLPPKIDPT VTMMQVEEKPDVTYSDVGGCKEQIEKLREVVETPLLHPERFVNLGIEPPKGVLLFGPP GTGKTLCARAVANRTDACFIRVIGSELVQKYVGEGARMVRELFEMARTKKACLIFFDE IDAIGGARFDDGAGGDNEVQRTMLELINQLDGFDPRGNIKVLMATNRPDTLDPALMRP GRLDRKIEFSLPDLEGRTHIFKIHARSMSVERDIRFELLARLCPNSTGAEIRSVCTEA GMFAIRARRKIATEKDFLEAVNKVIKSYAKFSATPRYMTYN" BASE COUNT 439 a 287 c 375 g 377 t ORIGIN 1 ccattgtgct ctaaagggaa ggtgctgtgt aatcattaag gagcggaggc ttttggagct 61 gctaaaatgc cggattacct cggtgccgat cagcggaaga ccaaagagga tgagaaggac 121 gacaagccca tccgagctct ggatgagggg gatattgcct tgttgaaaac ttatggtcag 181 agcacttact ctaggcagat caagcaagtt gaagatgaca ttcagcaact tctcaagaaa 241 attaatgagc tcactggtat taaagaatct gacactggcc tggccccacc agcactctgg 301 gatttggctg cagataagca gacactccag agtgaacagc ctttacaggt tgccaggtgt 361 acaaagataa tcaatgctga ttcggaggac ccaaaataca ttatcaacgt aaagcagttt 421 gccaagtttg tggtggacct tagtgatcag gtggcaccta ctgacattga agaagggatg 481 agagtgggcg tggatagaaa taaatatcaa attcacattc cattgcctcc taagattgac 541 ccaacagtta ccatgatgca ggtggaagag aaacctgatg tcacatacag tgatgttggt 601 ggctgtaagg aacagattga gaaactgcga gaagtagttg aaaccccatt acttcatcca 661 gagaggtttg tgaaccttgg cattgagcct cccaagggcg tgctgctctt tggtccaccc 721 ggtacaggca agacactctg tgcgcgggca gttgctaatc ggactgatgc gtgcttcatt 781 cgagttattg gatctgagct tgtacagaaa tacgtcggtg agggggctcg aatggttcgt 841 gaactctttg aaatggccag aacaaaaaaa gcctgcctta tcttctttga tgaaattgat 901 gctattggag gggctcgttt tgatgatggt gctggaggtg acaatgaagt gcagagaaca 961 atgttggaac tgatcaatca gcttgatggt tttgatcctc gaggcaatat taaagtgctg 1021 atggccacta acagacctga tactttggat ccagcactga tgaggccagg gagattggat 1081 agaaaaattg aatttagctt gcccgatcta gagggtcgga cccacatatt taagattcac 1141 gctcgttcaa tgagtgttga aagagatatc agatttgaac tgttagcacg actgtgtcca 1201 aatagcactg gtgctgagat tagaagcgtc tgcacagagg ctggtatgtt tgccatcaga 1261 gcacggcgaa aaattgctac cgagaaggat ttcttggaag ctgtaaataa ggtcattaag 1321 tcttatgcca aattcagtgc tactcctcgt tacatgacat acaactgaac cctgaaggct 1381 ttcaagtgaa aactttaaat tggaatccta accttatata gacttgttaa taaccaattc 1441 ataaacaaat aaatggcttc aactttagag cacaatgg // LOCUS HUMMTA 16559 bp DNA circular PRI 15-JUN-1996 DEFINITION Human mitochondrial DNA, complete sequence. ACCESSION D38112 NID g644480 KEYWORDS 12S rRNA; 16S rRNA; ATPase subunit 6; ATPase subunit 8; NADH dehydrogenase subunit 1; NADH dehydrogenase subunit 2; NADH dehydrogenase subunit 3; NADH dehydrogenase subunit 4; NADH dehydrogenase subunit 4L; NADH dehydrogenase subunit 5; NADH dehydrogenase subunit 6; cytochrome b; cytochrome c oxidase subunit I; cytochrome c oxidase subunit II; cytochrome c oxidase subunit III; tRNA-Ala; tRNA-Arg; tRNA-Asn; tRNA-Asp; tRNA-Cys; tRNA-Gln; tRNA-Glu; tRNA-Gly; tRNA-His; tRNA-Leu(CUN); tRNA-Leu(UUR); tRNA-Lys; tRNA-Met; tRNA-Phe; tRNA-Pro; tRNA-Ser(AGY); tRNA-Ser(UCN); tRNA-Thr; tRNA-Trp; tRNA-Tyr; tRNA-Val. SOURCE Homo sapiens (strain:African, isolate:SB17) placenta mitochondrion DNA. ORGANISM Mitochondrion Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Horai,S., Hayasaka,K., Kondo,R., Tsugane,K. and Takahata,N. TITLE Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (2), 532-536 (1995) MEDLINE 95132634 REFERENCE 2 (bases 1 to 16559) AUTHORS Horai,S., Kazuo,T., Hayasaka,K., Naoyuki,T. and Rumi,K. JOURNAL Unpublished (1995) REFERENCE 3 (bases 1 to 16559) AUTHORS Hayasaka,K. TITLE Direct Submission JOURNAL Submitted (02-SEP-1994) to the DDBJ/EMBL/GenBank databases. Kenji Hayasaka, National Institute of Genetics, Human Genetics; 1,111 Yata, Mishima, Shizuoka 411, Japan (E-mail:khayasak@ddbj.nig.ac.jp, Tel:81-559-75-0771(ex.568)) FEATURES Location/Qualifiers source 1..16559 /organism="Homo sapiens" /mitochondrion /isolate="SB17" /strain="African" /db_xref="taxon:9606" /tissue_type="placenta" tRNA 1..71 /product="tRNA-Phe" rRNA 72..1025 /product="12S rRNA" tRNA 1026..1094 /product="tRNA-Val" rRNA 1095..2652 /product="16S rRNA" tRNA 2653..2727 /product="tRNA-Leu(UUR)" CDS 2730..3686 /codon_start=1 /product="NADH dehydrogenase subunit 1" /db_xref="PID:d1007868" /db_xref="PID:g1262579" /transl_table=2 /translation="MPMANLLLLIVPILIAMAFLMLTERKILGYMQLRKGPNVVGPYG LLQPFADAMKLFTKEPLKPATSTITLYITAPTLALTIALLLWAPLPMPNPLVNLNLGL LFILATSSLAVYSILWSGWASNSNYALIGALRAVAQTISYEVTLAIILLSTLLMSGSF NLSTLITAQEHLWLLLPSWPLAMMWFISTLAETNRTPFDLAEGESELVSGFNIEYAAG PFALFFMAEYTNIIMMNTLTTTIFLGTTYDALSPELYTTYFVTKTLLLTSLFLWIRTA YPRFRYDQLMHLLWKNFLPLTLALLMWYVSMPITISSIPPQT" tRNA 3686..3754 /product="tRNA-Ile" tRNA complement(3752..3823) /product="tRNA-Gln" tRNA 3825..3892 /product="tRNA-Met" CDS 3893..4936 /codon_start=1 /product="NADH dehydrogenase subunit 2" /db_xref="PID:d1007869" /db_xref="PID:g1262580" /transl_table=2 /translation="MNPLAQPVIYSTIFAGTLITALSSHWFFTWVGLEMNMLAFIPIL TKKMNPRSTEAAIKYFLTQATASMILLMAILFNNMLSGQWTMTNTTNQYSSLMIMMAM AMKLGMAPFHFWVPEVTQGTPLTSGLLLLTWQKLAPISIMYQISPSLNVSLLLTLSIL SIMAGSWGGLNQTQLRKILAYSSITHMGWMMAVLPYNPNMTILNLTIYIILTTTAFLL LNLNSSTTTLLLSRTWNKLTWLTPLIPSTLLSLGGLPPLTGFLPKWAIIEEFTKNNSL IIPTIMATITLLNLYFYLRLIYSTSITLLPMSNNVKMKWQFEHTKPTPLLPTLITLTT LLLPISPFMLMIL" tRNA 4935..5002 /product="tRNA-Trp" tRNA complement(5010..5078) /product="tRNA-Ala" tRNA complement(5080..5152) /product="tRNA-Asn" tRNA complement(5184..5249) /product="tRNA-Cys" tRNA complement(5249..5314) /product="tRNA-Tyr" CDS 5327..6868 /codon_start=1 /product="cytochrome c oxidase subunit 1" /db_xref="PID:d1007870" /db_xref="PID:g1262581" /transl_table=2 /translation="MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLLIRAELGQP GNLLGNDHIYNVIVTAHAFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSF WLLPPSLLLLLASAMVEAGAGTGWTVYPPLAGNYSHPGASVDLTIFSLHLAGVSSILG AINFITTIINMKPPAMTQYQTPLFVWSVLITAVLLLLSLPVLAAGITMLLTDRNLNTT FFDPAGGGDPILYQHLSWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWA MMSIGFLGFIVWAHHMFTVGMDVDTRAYFTSATMIIAIPTGVKVFSWLATLHGSNMKW SAAVLWALGFIFLFTVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAVFAIMGG FIHWFPLFSGYTLDQTYAKIHFAIMFIGVNLTFFPQHFLGLSGMPRRYSDYPDAYTTW NILSSVGSFISLTAVMLMIFMIWEAFASKRKVLMVEEPSMNLEWLYGCPPPYHTFEEP VYMKS" tRNA complement(6868..6939) /product="tRNA-Ser(UCN)" tRNA 6941..7008 /product="tRNA-Asp" CDS 7009..7692 /codon_start=1 /product="cytochrome c oxidase subunit 2" /db_xref="PID:d1007871" /db_xref="PID:g704444" /transl_table=2 /translation="MAHAAQVGLQDATSPIMEELITFHDHALMIIFLICFLVLYALFL TLTTKLTNTNISDAQEMETVWTILPAIILVLIALPSLRILYMTDEVNDPSLTIKSIGH QWYWTYEYTDYGGLIFNSYMLPPLFLEPGDLRLLDVDNRVVLPIEAPIRMMITSQDVL HSWAVPTLGLKTDAIPGRLNQTTFTATRPGVYYGQCSEICGANHSFMPIVLELIPLKI FEMGPVFTL" tRNA 7709..7778 /product="tRNA-Lys" CDS 7780..7986 /codon_start=1 /product="ATPase subunit 8" /db_xref="PID:d1007872" /db_xref="PID:g704445" /transl_table=2 /translation="MPQLNTTVWPTMITPMLLTLFLITQLKMLNTSYHLPPSPKPMKM KNYNKPWEPKWTKICSLHSLPPQS" CDS 7941..8621 /codon_start=1 /product="ATPase subunit 6" /db_xref="PID:d1007873" /db_xref="PID:g1262582" /transl_table=2 /translation="MNENLFASFIAPTVLGLPAAVLIILFPPLLIPTSKYLINNRLIT TQQWLIKLTSKQMMAMHNTKGRTWSLMLVSLIIFIATTNLLGLLPHSFTPTTQLSMNL AMAIPLWAGAVIMGFRSKIKNALAHFLPQGTPTPLIPMLVIIETISLLIQPMALAVRL TANITAGHLLMHLIGSATLAMSTINLPSTLIIFTILILLTILEIAVALIQAYVFTLLV SLYLHDNT" mat_peptide 8621..(9403.9406) /product="cytochrome c oxidase subunit 3" tRNA 9405..9472 /product="tRNA-Gly" mat_peptide 9473..9817 /product="NADH dehydrogenase subunit 3" tRNA 9819..9883 /product="tRNA-Arg" CDS 9884..10180 /codon_start=1 /product="NADH dehydrogenase subunit 4L" /db_xref="PID:d1007874" /db_xref="PID:g704446" /transl_table=2 /translation="MPLIYMNIMLAFTISLLGMLVYRSHLMSSLLCLEGMMLSLFIMA TLMTLNTHSLLANIVPIAMLVFAACEAAVGLALLVSISNTYGLDYVHNLNLLQC" mat_peptide 10174..11550 /product="NADH dehydrogenase subunit 4" tRNA 11552..11620 /product="tRNA-His" tRNA 11621..11679 /product="tRNA-Ser(AGY)" tRNA 11680..11750 /product="tRNA-Leu(CUN)" CDS 11751..13562 /codon_start=1 /product="NADH dehydrogenase subunit 5" /db_xref="PID:d1007875" /db_xref="PID:g704447" /transl_table=2 /translation="MTMHTTMTTLTLTSLIPPILTTLVNPNKKNSYPHYVKSIVASTF IISLFPTTMFMCLDQEVIISNWHWATTQTTQLSLSFKLDYFSMMFIPVALFVTWSIME FSLWYMNSDPNINQFFKYLLIFLITMLILVTANNLFQLFIGWEGVGIMSFLLISWWYA RADANTAAIQAILYNRIGDIGFILALAWFILHSNSWDPQQMALLNANPSLPPLLGLLL AAAGKSAQLGLHPWLPSAMEGPTPVSALLHSSTMVVAGVFLLIRFHPLAENSPLIQTL TLCLGAITTLFAAVCALTQNDIKKIVAFSTSSQLGLMVVTIGINQPHLAFLHICTHAF FKAMLFMCSGSIIHNLNNEQDIRKMGGLLKTMPLTSTSLTIGSLALAGMPFLTGFYSK DHIIETANMSYTNAWALSITLIATSLTSAYSTRMILLTLTGQPRFPTLTNINENNPTL LNPIKRLAAGSLFAGFLITNNISPASPFQTTIPLYLKLTALAVTFLGLLTALDLNYLT NKLKMKSPLCTFYFSNMLGFYPSITHRTIPYLGLLTSQNLPLLLLDLTWLEKLLPKTI SQHQISTSIITSTQKGMIKLYFLSFFFPLILTLLLIT" CDS complement(13563..14087) /codon_start=1 /product="NADH dehydrogenase subunit 6" /db_xref="PID:d1007876" /db_xref="PID:g704448" /transl_table=2 /translation="MMYALFLLSVGLVMGFVGFSSKPSPIYGGLVLIVSGVVGCVIIL NFGGGYMGLMVFLIYLGGMMVVFGYTTAMAIEEYPEAWGSGVEVLVSVLVGLAMEVGL VLWVKEYDGVVVVVNFNSVGSWMIYEGEGSGLIREDPIGAGALYDYGRWLVVVTGWTL FVGVYIVIEIARGN" tRNA complement(14088..14156) /product="tRNA-Glu" mat_peptide 14161..15300 /product="cytochrome b" tRNA 15302..15367 /product="tRNA-Thr" tRNA complement(15370..15437) /product="tRNA-Pro" D-loop 15438..16559 BASE COUNT 5117 a 5169 c 2177 g 4096 t ORIGIN 1 gtttatgtag cttacctcct caaagcaata cactgaaaat gtttagacgg gctcacatca 61 ccccataaac aaataggttt ggtcctagcc tttctattag ctcttagtaa gattacacat 121 gcaagcatcc ccgttccagt gagttcaccc tctaaatcac cacgatcaaa agggacaagc 181 atcaagcacg caacaatgca gctcaaaacg cttagcctag ccacaccccc acgggaaaca 241 gcagtgataa acctttagca ataaacgaaa gtttaactaa gctatactaa ccccagggtt 301 ggtcaatttc gtgccagcca ccgcggtcac acgattaacc caagtcaata gaagccggcg 361 taaagagtgt tttagatcac cccctcccca ataaagctaa aactcacctg agttgtaaaa 421 aactccagtt gacacaaaat aaactacgaa agtggcttta acatatctga atacacaata 481 gctaagaccc aaactgggat tagatacccc actatgctta gccctaaacc tcaacagtta 541 aatcaacaaa actgctcgcc agaacactac gagccacagc ttaaaactca aaggacctgg 601 cggtgcttca tatccctcta gaggagcctg ttctgtaatc gataaacccc gatcaacctc 661 accacctctt gctcagccta tataccgcca tcttcagcaa accctgatga aggctacaaa 721 gtaagcgcaa gtacccacgt aaagacgtta ggtcaaggtg tagcccatga ggtggcaaga 781 aatgggctac attttctacc ccagaaaact acgatagccc ttatgaaact taagggtcga 841 aggtggattt agcagtaaac tgagagtaga gtgcttagtt gaacagggcc ctgaagcgcg 901 tacacaccgc ccgtcaccct cctcaagtat acttcaaagg acatttaact aaaaccccta 961 cgcatttata tagaggagac aagtcgtaac atggtaagtg tactggaaag tgcacttgga 1021 cgaaccagag tgtagcttaa cacaaagcac ccaacttaca cttaggagat ttcaacttaa 1081 cttgaccgct ctgagctaaa cctagcccca aacccactcc accttactac cagacaacct 1141 tagccaaacc atttacccaa ataaagtata ggcgatagaa attgaaacct ggcgcaatag 1201 atatagtacc gcaagggaaa gatgaaaaat tataaccaag cataatatag caaggactaa 1261 cccctatacc ttctgcataa tgaattaact agaaataact ttgcaaggag agccaaagct 1321 aagacccccg aaaccagacg agctacctaa gaacagctaa aagagcacac ccgtctatgt 1381 agcaaaatag tgggaagatt tataggtaga ggcgacaaac ctaccgagcc tggtgatagc 1441 tggttgtcca agatagaatc ttagttcaac tttaaatttg cccacagaac cctctaaatc 1501 cccttgtaaa tttaactgtt agtccaaaga ggaacagctc tttggacact aggaaaaaac 1561 cttgtagaga gagtaaaaaa tttaacaccc atagtaggcc taaaagcagc caccaattaa 1621 gaaagcgttc aagctcaaca cccactacct aaaaaatccc aaacatatga ctgaactcct 1681 cacacccaat tggaccaatc tatcacccta tagaagaact aatgttagta taagtaacat 1741 gaaaacattc tcctccgcat aagcctgcgt cagattaaaa cactgaactg acaattaaca 1801 gcccaatatc tacaatcaac caacaagtca ttattaccct cactgtcaac ccaacacagg 1861 catgctcata aggaaaggtt aaaaaaagta aaaggaactc ggcaaatctt accccgcctg 1921 tttaccaaaa acatcacctc tagcatcacc agtattagag gcaccgcctg cccagtgaca 1981 catgtttaac ggccgcggta ccctaaccgt gcaaaggtag cataatcact tgttccttaa 2041 atagggacct gtatgaatgg ctccacgagg gttcagctgt ctcttacttt taaccagtga 2101 aattgacctg cccgtgaaga ggcgggcatg acacagcaag acgagaagac cctatggagc 2161 tttaatttat taatgcaaac aatacctaac aaacccacag gtcctaaact accaaacctg 2221 cattaaaaat ttcggttggg gcgacctcgg agcagaaccc aacctccgag cagtacatgc 2281 caagacttca ccagtcaaag cgaactacca tactcaattg atccaataac ttgaccaacg 2341 gaacaagtta ccctagggat aacagcgcaa tcctattcta gagtccatat caacaatagg 2401 gtttacgacc tcgatgttgg atcaggacat cccgatggtg cagccgctat taaaggttcg 2461 tttgttcaac gattaaagtc ctacgtgatc tgagttcaga ccggagtaat ccaggtcggt 2521 ttctatctac ttcaaattcc tccctgtacg aaaggacaag agaaataagg cctacttcac 2581 aaagcgcctt cccccgtaaa tgatatcatc tcaacttagt attataccca cacccaccca 2641 agaacagggt ttgttaagat ggcagagccc ggtaatcgca taaaacttaa aactttacag 2701 tcagaggttc aattcctctt cttaacaaca tacccatggc caacctccta ctcctcattg 2761 tacccattct aatcgcaatg gcattcctaa tgcttaccga acgaaaaatt ctaggctata 2821 tacaactacg caaaggcccc aacgttgtag gcccctacgg gctactacaa cccttcgctg 2881 acgccataaa actcttcacc aaagagcccc taaaacccgc cacatctacc atcaccctat 2941 acatcaccgc cccgacctta gctctcacca tcgctcttct actatgagcc cccctcccca 3001 tacccaaccc cctggttaac ctcaacctag gcctcctatt tattctagcc acctctagcc 3061 tagccgttta ctcaatcctc tgatcagggt gagcatcaaa ctcaaactac gccctgatcg 3121 gcgcactgcg agcagtagcc caaacaatct catatgaagt caccctagcc atcattctac 3181 tatcaacatt actaataagt ggctccttta acctctccac ccttatcaca gcacaagaac 3241 acctctgatt actcctgcca tcatgaccct tggccataat atgatttatc tccacactag 3301 cagagaccaa ccgaaccccc ttcgaccttg ccgaagggga gtccgaacta gtctcaggct 3361 tcaacatcga atacgccgca ggccccttcg ccctattctt catagccgaa tacacaaaca 3421 ttattataat aaacaccctc accactacaa tcttcctagg aacaacatat gacgcactct 3481 cccctgaact ctacacaaca tattttgtca ccaagaccct acttctgacc tccctgttct 3541 tatgaattcg aacagcatac ccccgattcc gctacgacca actcatacac ctcctatgaa 3601 aaaacttcct accactcacc ctagcattac ttatatgata tgtctccata cccattacaa 3661 tctccagcat tccccctcaa acctaagaaa tatgtctgat aaaagagtta ctttgataga 3721 gtaaataata ggagtttaaa cccccttatt tctaggacta tgagaatcga acccatccct 3781 gagaatccaa aattctccgt gccacctatc acaccccatc ctaaagtaag gtcagctaaa 3841 taagctatcg ggcccatacc ccgaaaatgt tggttatacc cttcccgtac taattaatcc 3901 cctggcccaa cccgtcatct actctaccat ctttgcaggc acactcatca cagcgctaag 3961 ctcgcactga ttttttacct gagtaggcct agaaataaac atgctagcct ttattccaat 4021 tctaaccaaa aaaataaacc ctcgttccac agaagctgcc atcaagtatt tcctcacgca 4081 agcaaccgca tccataatcc ttctaatagc tatcctcttc aacaatatac tctccggaca 4141 atgaaccata accaatacta ccaatcaata ctcatcatta ataatcataa tggctatagc 4201 aataaaacta ggaatagccc cctttcactt ctgagtccca gaggttaccc aaggcacccc 4261 tctgacatcc ggcctgcttc ttctcacatg acaaaaacta gcccccatct caatcatata 4321 ccaaatctct ccctcactaa acgtaagcct tctcctcact ctctcaatct tatccatcat 4381 agcaggcagt tgaggtggat taaaccaaac ccagctacgc aaaatcttag catactcctc 4441 aattacccac ataggatgaa taatagcagt tctaccgtac aaccctaaca taaccattct 4501 taatttaact atttatatta tcctaactac taccgcattc ctactactca acttaaactc 4561 cagcaccaca accctactac tatctcgcac ctgaaacaag ctaacatgac taacaccctt 4621 aattccatcc accctcctct ccctaggagg cctgcccccg ctaaccggct ttttgcccaa 4681 atgggccatt atcgaagaat tcacaaaaaa caatagcctc atcatcccca ccatcatagc 4741 caccatcacc ctccttaacc tctacttcta cctacgccta atctactcca cctcaatcac 4801 actactcccc atatctaaca acgtaaaaat aaaatgacag tttgaacata caaaacccac 4861 cccactcctc cccacactca tcacccttac cacgctactc ctacctatct ccccttttat 4921 actaataatc ttatagaaat ttaggttaaa tacagaccaa gagccttcaa agccctcagt 4981 aagttgcaat acttaatttc tgtgacagct aaggactgca aaacctcact ctgcatcaac 5041 tgaacgcaaa tcagccactt taattaagct aagcccttac tagaccaatg ggacttaaac 5101 ccacaaacac ttagttaaca gctaagcacc ctagtcaact ggcttcaatc tacttctccc 5161 gccgccggga aaaaaggcgg gagaagcccc ggcaggtttg aagctgcttc ttcgaatttg 5221 caattcaata tgaaaatcac ctcggagctg gtaaaaagag gcctaacccc tgtctttaga 5281 tttacagtcc aatgcttcac tcagccattt tacctcaccc ccactgatgt tcgccgaccg 5341 ttgactattc tctacaaacc acaaagacat tggaacacta tacctattat tcggcgcatg 5401 agctggagtc ctaggcacag ctctaagcct ccttattcga gccgagctgg gccagccagg 5461 caaccttcta ggtaacgacc acatctacaa cgttatcgtc acagcccatg catttgtaat 5521 aatcttcttc atagtaatac ccatcataat cggaggcttt ggcaactgac tagttcccct 5581 aataatcggt gcccccgata tggcgttccc ccgcataaac aacataagct tctgactctt 5641 acctccctct ctcctactcc tgctcgcatc tgctatagta gaggccggag caggaacagg 5701 ttgaacagtc taccctccct tagcagggaa ctactcccac cctggagcct ccgtagacct 5761 aaccatcttc tccttacacc tagcaggtgt ctcctctatc ttaggggcca tcaatttcat 5821 cacaacaatt atcaatataa aaccccctgc cataacccaa taccaaacgc ccctcttcgt 5881 ctgatccgtc ctaatcacag cagtcctact tctcctatct ctcccagtcc tagctgctgg 5941 catcactata ctactaacag accgcaacct caacaccacc ttcttcgacc ccgccggagg 6001 aggagacccc attctatacc aacacctatc ctgatttttc ggtcaccctg aagtttatat 6061 tcttatccta ccaggcttcg gaataatctc ccatattgta acttactact ccggaaaaaa 6121 agaaccattt ggatacatag gtatggtctg agctatgata tcaattggct tcctagggtt 6181 tatcgtgtga gcacaccata tatttacagt aggaatagac gtagacacac gagcatattt 6241 cacctccgct accataatca tcgctatccc caccggcgtc aaagtattta gctgactcgc 6301 cacactccac ggaagcaata tgaaatgatc tgctgcagtg ctctgagccc taggattcat 6361 ctttcttttc accgtaggtg gcctgactgg cattgtatta gcaaactcat cactagacat 6421 cgtactacac gacacgtact acgttgtagc tcacttccac tatgtcctat caataggagc 6481 tgtatttgcc atcataggag gcttcattca ctgatttccc ctattctcag gctacaccct 6541 agaccaaacc tacgccaaaa tccatttcgc tatcatattc atcggcgtaa atctaacttt 6601 cttcccacaa cactttctcg gcctatccgg aatgccccga cgttactcgg actaccccga 6661 tgcatacacc acatgaaata tcctatcatc tgtaggctca ttcatttctc taacagcagt 6721 aatattaata attttcatga tttgagaagc cttcgcttcg aagcgaaaag tcctaatagt 6781 agaagaaccc tccataaacc tggagtgact atatggatgc cccccaccct accacacatt 6841 cgaagaaccc gtatacataa aatctagaca aaaaaggaag gaatcgaacc ccccaaagct 6901 ggtttcaagc caaccccatg gcctccatga ctttttcaaa aagatattag aaaaaccatt 6961 tcataacttt gtcaaagtta aatcataggc taaatcctat atatcttaat ggcacatgca 7021 gcgcaagtag gtctacaaga cgctacttcc cctatcatag aagagcttat cacctttcat 7081 gatcacgccc tcataatcat tttccttatc tgcttcctag tcctgtatgc ccttttccta 7141 acactcacaa caaaactaac taatactaac atctcagacg ctcaggaaat agaaaccgtc 7201 tgaactatcc tgcccgccat catcctagtc ctcatcgccc tcccatccct acgcatcctt 7261 tacataacag acgaggtcaa cgatccctcc cttaccatca aatcaattgg ccaccaatgg 7321 tactgaacct acgagtacac cgactacggc ggactaatct tcaactccta catacttccc 7381 ccattattcc tagaaccagg cgacctgcga ctccttgacg ttgacaatcg agtagtactc 7441 ccgattgaag cccccattcg tataataatt acatcacaag acgtcttgca ctcatgagct 7501 gtccccacat taggcttaaa aacagatgca attcccggac gtctaaacca aaccactttc 7561 accgctacac gaccgggggt atactacggt caatgctctg aaatctgtgg agcaaaccac 7621 agtttcatgc ccatcgtcct agaattaatt cccctaaaaa tctttgaaat aggacccgta 7681 tttaccctat agcaccccct ctagagccca ctgtaaagct aacttagcat taacctttta 7741 agttaaagat taagagaacc aacacctctt tacagtgaaa tgccccaact aaatactacc 7801 gtatggccca ccataattac ccccatactc cttacactat ttctcatcac ccaactaaaa 7861 atattaaaca caagctacca cttacctccc tcaccaaagc ccataaaaat aaaaaattat 7921 aacaaaccct gagaaccaaa atgaacgaaa atctgttcgc ttcattcatt gcccccacag 7981 tcctaggcct acccgccgca gtactgatca ttctatttcc ccctctattg atccccacct 8041 ccaaatatct catcaacaac cgactaatta ccacccaaca atgactaatc aaactaacct 8101 caaaacaaat gatagccata cacaacacta aaggacgaac ctgatctctt atactagtat 8161 ccttaatcat ttttattgcc acaactaacc tcctcggact cctgcctcac tcatttacac 8221 caaccaccca actatctata aacctagcca tggccatccc cttatgagcg ggcgcagtga 8281 ttataggctt tcgctctaag attaaaaatg ccctagccca cttcttacca caaggcacac 8341 ctacacccct tatccccata ctagttatta tcgaaaccat cagcctactc attcaaccaa 8401 tagccctggc cgtacgccta accgctaaca ttactgcagg ccacctactc atgcatctaa 8461 ttggaagcgc caccctagca atatcaacca ttaaccttcc ctctacactt atcatcttca 8521 caattctaat tctactgact atcctagaaa tcgctgtcgc cttaatccaa gcctacgttt 8581 tcacacttct agtaagcctc tacctgcacg acaacacata atgacccacc aatcacatgc 8641 ctatcatata gtaaaaccca gcccatgacc cctaacaggg gccctctcag ccctcctaat 8701 gacctccggc ctagccatgt gatttcactt ccactccata acgctcctca tactaggcct 8761 gctaaccaac acactaacca tataccaatg atggcgcgat gtaacacgag aaagcacata 8821 ccaaggccac cacacaccac ctgtccaaaa aggccttcga tacgggataa tcctatttat 8881 tacctcagaa gtttttttct tcgcaggatt tttctgagcc ttttaccact ccagcctagc 8941 ccctaccccc caactaggag ggcactggcc cccaacaggc atcaccccgc taaatcccct 9001 agaagtccca ctcctaaaca catccgtatt actcgcatca ggagtatcaa tcacctgagc 9061 tcaccatagt ctaatagaaa acaaccgaaa ccaaataatt caagcactgc ttattacaat 9121 tttactgggt ctctatttta ccctcctaca agcctcagag tacttcgaat ctcccttcac 9181 catttccgac ggcatctacg gctcaacatt ttttgtagcc acaggcttcc atggacttca 9241 cgtcattatt ggctcaactt tcctcactat ctgcttcatc cgccaactaa tatttcactt 9301 tacatccaaa catcactttg gcttcgaagc cgccgcctga tactggcatt ttgtagatgt 9361 ggtttgacta tttctgtatg tctccatcta ttgatgaggg tcttactctt ttagtataaa 9421 tagtaccgtt aacttccaat taactagttt tgacaacatt caaaaaagag taataaactt 9481 cgccttaatt ttaataatca acaccctcct agccttacta ctaataatta ttacattttg 9541 actaccacaa ctcaacggct acatagaaaa atccacccct tacgagtgcg gcttcgaccc 9601 tatatccccc gcccgcgtcc ctttctccat aaaattcttc ttagtagcta ttaccttctt 9661 attatttgat ctagaaattg ccctcctttt acccctacca tgagccctac aaacaactaa 9721 cctgccacta atagttatgt catccctctt attaatcatc atcctagccc taagtctggc 9781 ctatgagtga ctacaaaaag gattagactg agccgaattg gtatatagtt taaacaaaac 9841 gaatgatttc gactcattaa attatgataa tcatatttac caaatgcccc tcatttacat 9901 aaatattata ctagcattta ccatctcact tctaggaata ctagtatatc gctcacacct 9961 catatcctcc ctactatgcc tagaaggaat aatactatcg ctattcatta tagctactct 10021 cataaccctc aacacccact ccctcttagc caatattgtg cctattgcca tactagtttt 10081 tgccgcctgc gaagcagcgg taggcctagc cctactagtc tcaatctcca acacatatgg 10141 cctagactac gtacataacc taaacctact ccaatgctaa aactaatcgt cccaacaatt 10201 atattactac cactgacatg actctccaaa aaacacataa tttgaatcaa cacaaccacc 10261 cacagcctaa ttattagcat catcccccta ctatttttta accaaatcaa caacaaccta 10321 tttagctgct ccccaacctt ttcctccgac cccctaacaa cccccctcct aatactaact 10381 acctgactcc tacccctgac aatcatggca agccaacgcc acttatccag tgaaccacta 10441 tcacgaaaaa aactctacct ctctatacta atctccctac aaatctcctt aattataaca 10501 ttcacagcca cagaactaat catattttat atcttcttcg aaaccacact tatccccacc 10561 ttggctatca tcacccgatg aggcagccaa ccagaacgcc tgaacgcagg cacatacttc 10621 ctattctaca ccctagtagg ctcccttccc ctactcatcg cactaattta cactcacaac 10681 accctaggct cactaaacat tctactactc actctcactg cccaagaact atcaaactcc 10741 tgagccaaca acttaatatg actagcttac acaatagctt ttatagtaaa gatacctctt 10801 tacggactcc acttatgact ccctaaagcc catgtcgaag cccccatcgc tgggtcaata 10861 gtacttgccg cagtactctt aaaactaggc ggctatggta taatacgcct cacactcatt 10921 ctcaaccccc tgacaaaaca catagcctac cccttccttg tactatccct atgaggcata 10981 attataacaa gctccatctg cctacgacaa acagacctaa aatcgctcat tgcatactct 11041 tcaatcagcc acatggccct cgtagtaaca gccattctca tccaaacccc ctgaagcttc 11101 accggcgcag tcattctcat aatcgcccac ggacttacat cctcattact attctgccta 11161 gcaaactcaa actacgaacg cactcacagt cgcatcataa tcctctctca aggacttcaa 11221 actctactcc cactaatagc tttttgatga cttctagcaa gcctcgctaa cctcgcctta 11281 ccccccacta ttaacctact gggagaactc tctgtgctag taaccacatt ctcctgatca 11341 aatatcactc tcctacttac aggactcaac atactagtca cagccctata ctccctctac 11401 atatttacca caacacaatg aggctcactc acccaccaca ttaacaacat aaaaccctca 11461 ttcacacgag aaaacaccct catgttcata cacctatccc ccattctcct cctatccctc 11521 aaccccgaca tcattaccgg gttttcctct tgtaaatata gtttaaccaa aacatcagat 11581 tgtgaatctg acaacagagg cttacgaccc cttatttacc gagaaagctc acaagaactg 11641 ctaactcatg cccccatgtc taacaacatg gctttctcaa cttttaaagg ataacagcta 11701 tccattggtc ttaggcccca aaaattttgg tgcaactcca aataaaagta ataaccatgc 11761 acactactat aaccacccta accctgactt ccctaattcc ccccatcctt accaccctcg 11821 ttaaccctaa caaaaaaaac tcataccccc attatgtaaa atccattgtc gcatccacct 11881 ttattatcag tctcttcccc acaacaatat tcatgtgcct agaccaagaa gttattatct 11941 cgaactgaca ctgagccaca acccaaacaa cccagctctc cctaagcttc aaactagact 12001 acttctccat aatattcatc cctgtagcat tgttcgttac atggtccatc atagaattct 12061 cactgtgata tataaactca gacccaaaca ttaatcagtt cttcaaatat ctactcattt 12121 tcctaattac catgctaatc ttagttaccg ctaacaacct attccaactg ttcatcggct 12181 gagagggcgt aggaattata tccttcttgc tcatcagttg atgatacgcc cgagcagatg 12241 ccaacacagc agccattcaa gcaatcctat acaaccgtat cggcgatatc ggtttcatcc 12301 tcgccttagc atgatttatc ctacactcca actcatgaga cccacaacaa atagcccttc 12361 taaacgctaa tccaagcctc cccccactac taggcctcct cctagcagca gcaggcaaat 12421 cagcccaatt aggtctccac ccctgactcc cctcagccat agaaggcccc accccagtct 12481 cagccctact ccactcaagc actatagttg tagcaggagt cttcttactc atccgcttcc 12541 accccctagc agaaaatagc ccactaatcc aaactctaac actatgctta ggcgctatca 12601 ccactctgtt cgcagcagtc tgcgccctta cacaaaatga catcaaaaaa atcgtagcct 12661 tctccacttc aagtcaacta ggactcatag tagttacaat cggcatcaac caaccacacc 12721 tagcattcct gcacatctgt acccacgcct tcttcaaagc catactattt atgtgctccg 12781 ggtccatcat ccacaacctt aacaatgaac aagatattcg aaaaatagga ggactactca 12841 aaaccatacc tctcacttca acctccctca ccattggcag cctagcatta gcaggaatac 12901 ctttcctcac aggtttctat tccaaagacc acatcatcga aaccgcaaac atatcataca 12961 caaacgcctg agccctatct attactctca tcgctacctc cctgacaagc gcctatagca 13021 ctcgaataat tcttctcacc ctaacaggtc aacctcgctt ccctaccctt actaacatta 13081 acgaaaataa ccccacccta ctaaacccca ttaaacgcct ggcagccgga agcctattcg 13141 caggatttct cattactaac aacatttccc ccgcatcccc cttccaaaca acaatccccc 13201 tctacctaaa actcacagcc ctcgctgtca ctttcctagg acttctaaca gccctagacc 13261 tcaactacct aaccaacaaa cttaaaataa aatccccact atgcacattt tatttctcca 13321 acatactcgg attctaccct agcatcacac accgcacaat cccctatcta ggccttctta 13381 cgagccaaaa cctgccccta ctcctcctag acctaacctg actagaaaag ctattaccta 13441 aaacaatttc acagcaccaa atctccacct ccatcatcac ctcaacccaa aaaggcataa 13501 ttaaacttta cttcctctct ttcttcttcc cactcatcct aaccctactc ctaatcacat 13561 aacctattcc cccgagcaat ctcaattaca atatatacac caacaaacaa tgttcaacca 13621 gtaactacta ctaatcaacg cccataatca tacaaagccc ccgcaccaat aggatcctcc 13681 cgaatcaacc ctgacccctc tccttcataa attattcagc ttcctacact attaaagttt 13741 accacaacca ccaccccatc atactctttc acccacagca ccaatcctac ctccatcgct 13801 aaccccacta aaacactcac caagacctca acccctgacc cccatgcctc aggatactcc 13861 tcaatagcca tcgctgtagt atatccaaag acaaccatca ttccccctaa ataaattaaa 13921 aaaactatta aacccatata acctccccca aaattcagaa taataacaca cccgaccaca 13981 ccgctaacaa tcaatactaa acccccataa ataggagaag gcttagaaga aaaccccaca 14041 aaccccatta ctaaacccac actcaacaga aacaaagcat acatcattat tctcgcacgg 14101 actacaacca cgaccaatga tatgaaaaac catcgttgta tttcaactac aagaacacca 14161 atgaccccaa tacgcaaaat taacccccta ataaaattaa ttaaccactc attcatcgac 14221 ctccccaccc catccaacat ctccgcatga tgaaacttcg gctcactcct tggcgcctgc 14281 ctgatcctcc aaatcaccac aggactattc ctagccatgc actactcacc agacgcctca 14341 accgcctttt catcaatcgc ccacatcact cgagacgtaa attatggctg aatcatccgc 14401 taccttcacg ccaatggcgc ctcaatattc tttatctgcc tcttcctaca catcgggcga 14461 ggcctatatt acggatcatt tctctactca gaaacctgaa acatcggcat tatcctcctg 14521 cttgcaacta tagcaacagc cttcataggt tatgtcctcc cgtgaggcca aatatcattc 14581 tgaggggcca cagtaattac aaacttacta tccgccatcc catacattgg gacagaccta 14641 gttcaatgaa tctgaggagg ctactcagta gacagtccca ccctcacacg attctttacc 14701 tttcacttca tcttgccctt cattattgca accctagcag cactccacct cctattcttg 14761 cacgaaacgg gatcaaacaa ccccctagga atcacctccc attccgataa aatcaccttc 14821 cacccttact acacaatcaa agacaccctc ggcttacttc tcttccttct ctccttaatg 14881 acattaacac tattctcacc agacctccta ggcgacccag acaattatac cctagccaac 14941 cccttaaaca cccctcccca catcaagccc gaatgatatt tcctattcgc ctacacaatt 15001 ctccgatccg tccctaacaa actaggaggc gtccttgccc tattactatc catcctcatc 15061 ctagcaataa tccccatcct ccatatatcc aaacaacaaa gcataatatt tcgcccacta 15121 agccaatcac tttattgact cctagccgca gacctcctca ttctaacctg aatcggagga 15181 caaccagtaa gctacccttt taccatcatt ggacaagtag catccgtact atacttcaca 15241 acaatcctaa tcctaatacc aactatctcc ctaattgaaa acaaaatact caaatgggcc 15301 tgtccttgta gtataaacta atacaccagt cttgtaaacc ggagatgaaa acctttttcc 15361 aaggacaaat cagagaaaaa gtctttaact ccaccattag cacccaaagc taagattcta 15421 atttaaacta ttctctgttc tttcatgggg aagcagattt gggtaccacc caagtattga 15481 ctcacccatc aacaaccgct atgtatttcg tacattactg ccagccacca tgaatattgt 15541 acggtaccat aaatacttga ctacctgtag tacataaaaa cccaacccac atcaaaatcc 15601 tacccccatg cttacaagca agtacagcaa tcaaccttca actgtcacac atcaactgca 15661 actccaaagc cacccctcac ccactaggat accaacaaac ctacccaccc ttaacagtac 15721 atagcacata aagtcattta ccgtacatag cacattacag tcaaatccct tctcgtcccc 15781 atggatgacc cccctcagat aggggtccct tgaccaccat cctccgtgaa atcaatatcc 15841 cgcacaagag tgctactctc ctcgctccgg gcccataaca cttgggggta gctaaagtga 15901 actgtatccg acatctggtt cctacttcag ggccataaag cctaaatagc ccacacgttc 15961 cccttaaata agacatcacg atggatcaca ggtctatcac cctattaacc actcacggga 16021 gctctccatg catttggtat tttcgtttgg ggggtatgca cgcgatagca tcgcgggccg 16081 ctggagccgg agcaccctat gtcgcagtat ctgtctttga ttcctgcctc atcccattat 16141 ttatcgcacc tacattcaat attacaggcg agcatactta ctaaagtgtg ttaattaatt 16201 aatgcttgta ggacataaca ataacaatta aatgtctgca cagccgcttt ccacacagac 16261 atcataacaa aaaatttcca ccaaaccccc cctcccccgc ttctggccac agcacttaaa 16321 cacatctctg ccaaacccca aaaacaaaga accctaacac cagcctaacc agatttcaaa 16381 ttttatcttt tggcggtatg cacttttaac agtcaccccc caactaacac attattttcc 16441 cctcccactc ccatactact aatctcatca atacaacccc cgcccatcct acccagcaca 16501 cacacaccgc tgctaacccc ataccccgaa ccaaccaaac cccaaagaca ccccccaca // LOCUS HUMMTALD 6074 bp DNA PRI 15-APR-1996 DEFINITION Human mitochondrial aldehyde dehydrogenase x gene, complete cds. ACCESSION M63967 NID g337184 KEYWORDS aldehyde dehydrogenase. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6074) AUTHORS Hsu,L.C. and Chang,W.C. TITLE Cloning and characterization of a new functional human aldehyde dehydrogenase gene JOURNAL J. Biol. Chem. 266 (19), 12257-12265 (1991) MEDLINE 91286241 FEATURES Location/Qualifiers source 1..6074 /organism="Homo sapiens" /db_xref="taxon:9606" exon 1301..1317 exon 2257..3966 CDS 2266..3819 /codon_start=1 /product="aldehyde dehydrogenase" /db_xref="PID:g1263008" /translation="MLRFLAPRLLSLQGRTALYSSAAALPSPILNPDIPYNQLFINNE WQDAVSKKTFPTVNPTTGEVIGHVAEGDRADVDRAVKAAREAFRLGSPWRRMDASERG RLLNLLADLVERDRVYLASLETLDNGKPFQESYALDLDEVIKVYRYFAGWADKWHGKT IPMHGQHFCFTRHEPVGVCGQIIPWNFPLVMQGWKLAPALATGNTVVMKVAEQTPLSA LYLASLIKEAGFPPGVVNIITGYGPTAGAAIAQHMDVDKVAFTGSTEVGHLIQKAAGD SNLKRVTLELGGKSPSIVLADADMEHAVEQCHEALFFNMGQCCCAGSRTFVEESIYNE FLERTVEKAKQRKVGNPFELDTQQGPQVDKEQFERVLGYIQLGQKEGAKLLCGGERFG ERGFFIKPTVFGGVQDDMRIAKEEIFGPVQPLFKFKKIEEVVERANNTRYGLAAAVFT RDLDKAMYFTQALQAGTVWVNTYNIVTCHTPFGGFKESGNGRELGEDGLKAYTEVKTV TIKVPQKNS" mutation 2448 mutation 2552 mutation 2585 repeat_region 4803..5137 repeat_region 5613..5924 BASE COUNT 1475 a 1505 c 1594 g 1500 t ORIGIN 1 ctgcagcctc cattcaaaga gcaggaagcc aggtcaagga gcggctacag ccacagcggc 61 cggtccacag ccagaaagct ggagaaggga caaggaatgc tgccgcagaa catctcccat 121 ttctgctatc ctgctagtgc caccctacag gggcaggaca ataccctcca atgcgtgccc 181 gccttccaca tccggcagag tactactaat tggcagcacc tagtccacat ccagaactct 241 aactgcaaat gaggctggga aacatggcgg ttagctttgg gacctcccta gccagaagga 301 aggtgcagcg agggtggagc gagtgagatg catgttcagt ggtccatctt ggcaaccttt 361 atagcaacaa aatggagaca attaaatgtc caacagttta cccctacaac agaattctat 421 gccaccttgg agaggatgat gtcatctgta attagtgata ggagaaaact tcaaaataca 481 gcgctaagtt tttaaatttt ttttaaagca gcacaacaga ggttactata tgattttatt 541 tttattaaaa taattttgaa agatgcagag gaaacaatac tagcaggacg aacaccgtaa 601 taatggctac cgtaggtgaa gagattacag gtgatatttt tcttcttttt tcatcctttc 661 ctgtgttttc caagctctac tgcatacctt ctttttgtcc ttaggacaaa aatgtacttt 721 gaaaaatgcc caaatataaa atcttcttga aggaactgaa gaaaggagca atgaaagtgg 781 aggacagaaa ttagtgaatc ctggacaaaa aaagtgaggg agaggcgagg cagagaaggg 841 gttaagggga gtctcaagga tggcagtcga ttaagcagcg tcttgcagcg gggagaaatg 901 caaagtgtag ttggggatcg gtctccgaaa actttgctgt gtgacctggg gaagcccctg 961 cccctctctg ggtatctctg tttccgctcc caaaccagtg ccttaggacc ctagatctga 1021 gcaacgtgga cactgggctt gcggagaccc gagggaggga gcgctgaagc ggtgcgggct 1081 gccgggggta gaggggcggg gaccgggaaa gaccgggctg ggcgagggag gagcggcctt 1141 ggccggcgac aggacgtagg agcgccccag gcgcagcgga gcctcattgg ccgctgagcc 1201 ccgggctgcg cggaggcggg acctgcggcc agccctgggc ggccatgtgg acagagctgg 1261 gagggccgga accagaaccc aagcgtgatc ctgaaccgga gcccgagcct gctgcaggta 1321 actaacgctg gtctcccctc ggctccctcg ggaagccgca gctcctggct ccgcgtgggg 1381 ggctttcctc tctgggccgc gtcgacactc ttcagagttg tagcttttct tctcagctgc 1441 ctcctgactt gctgcacctc tgggaaagct gaaaagggat tctgagacct gtggttgggg 1501 gatgctgggc taggtcaatt tctgaggcac ggccagttct gatgtcctga gtggagcttg 1561 ccagggtttt tatatgcaag acatcattca acataacccc atgaggcatt tcacatatgg 1621 gggaactgag gcagagaggt gaaatgaagc aagtccctct cagagaacaa ccatctcacc 1681 gtccctttga tttctactca agtcttaggt caattcttag acagcctttg tggcaatttc 1741 agtgttttct cagtgtatca ttcagtactg tgtgttgagc cgctgctctg tagcaggcat 1801 acctctaggc tcggggacat agcagtgaat ggaacagaaa aataagttcc tgtgctcatg 1861 gagcttgcat tttattgggg ggagacagac aaacatataa gcctgtaaag atagtatttg 1921 ggatgtgaga tagagaagga gtaaagcagg gtagcgggga tagagagtgt tggggtggga 1981 agggttgcaa ttttttaaag ggtggtcagg gaaggccttg ctgagacagt ggcttttgag 2041 acccagcaga ggtgagggag tgagctgagg ataggcggga cttgatggag ttggcccaga 2101 gagttcatca gggccctcac agctcttaca gtctgtgttt ttagaggtga cagtccttta 2161 tgctggaatc ttgaaatgtt tgagctggtg ggcccttggg taccgccacc tgccttctcc 2221 cacctgttca ccctggtttc ttttgtccct ctccagagtg tcagcatgct gcgcttcctg 2281 gcaccccggc tgcttagcct ccagggcagg accgccctct actcctcggc agcagccctc 2341 ccaagcccca ttctgaaccc agacatcccc tacaaccagc tgttcatcaa caatgaatgg 2401 caagatgcag tcagcaagaa gaccttcccg acggtcaacc ctaccaccgg ggaggtcatc 2461 gggcacgtgg ctgaaggtga ccgggctgat gtggatcggg ccgtgaaagc agcccgggaa 2521 gccttccgcc tggggtcccc atggcgccgg atggatgcct ctgagcgggg ccggctgctg 2581 aacctcctgg cagacctagt ggagcgggat cgagtctact tggcctcact cgagaccttg 2641 gacaatggga agcctttcca agagtcttac gccttggact tggatgaggt catcaaggtg 2701 tatcggtact ttgctggctg ggctgacaag tggcatggca agaccatccc catgcatggc 2761 cagcatttct gcttcacccg gcatgagccc gttggtgtct gtggccagat catcccgtgg 2821 aacttcccct tggtcatgca gggttggaaa cttgccccgg cactcgccac aggcaacact 2881 gtggttatga aggtggcaga gcagaccccc ctctctgccc tgtatttggc ctccctcatc 2941 aaggaggcag gctttccccc tggggtggtg aacatcatca cggggtatgg cccaacagca 3001 ggtgcggcca tcgcccagca catggatgtt gacaaagttg ccttcaccgg ttccaccgag 3061 gtgggccacc tgatccagaa agcagctggc gattccaacc tcaagagagt caccctggag 3121 ctgggtggta agagccccag catcgtgctg gccgatgctg acatggagca tgccgtggag 3181 cagtgccacg aagccctgtt cttcaacatg ggccagtgct gctgtgctgg ctcccggacc 3241 ttcgtggaag aatccatcta caatgagttt ctcgagagaa ccgtggagaa agcaaagcag 3301 aggaaagtgg ggaacccctt tgagctggac acccagcagg ggcctcaggt ggacaaggag 3361 cagtttgaac gagtcctagg ctacatccag cttggccaga aggagggcgc aaaactcctc 3421 tgtggcggag agcgtttcgg ggagcgtggt ttcttcatca agcctactgt ctttggtggc 3481 gtgcaggatg acatgagaat tgccaaagag gagatctttg ggcctgtgca gcccctgttc 3541 aagttcaaga agattgagga ggtggttgag agggccaaca acaccaggta tggcctggct 3601 gcggctgtgt tcacccggga tctggacaag gccatgtact tcacccaggc actccaggcc 3661 gggaccgtgt gggtaaacac ctacaacatc gtcacctgcc acacgccatt tggagggttt 3721 aaggaatctg gaaacgggag ggagctgggt gaggatgggc ttaaggccta cacagaggta 3781 aagacggtca ccatcaaggt tcctcagaag aactcgtaag gcagctgtca gggaggccca 3841 gtcacagtcc agcaattcca caaccacctt gacgaatgct tgccaagctg ttttaaagcc 3901 aagaacaccc tttctttgtt ccaaattaac tcttagaaga aaccccacaa ataaagcaat 3961 tcaatcaagg ctgttctatt taaatcagag atggggacca ggctcagagt tctacctatc 4021 taacccccaa ccacagcccc cttggtggcc catgagttgc ttccatgaaa tcttaggagt 4081 ctctggagga cagattaaaa accagtgatc tgtaatttgt agctcttcct gctgatccaa 4141 ggactttccc atgggtgcgc ttgatggttt agtggatcga ctcaactcag aacacaagct 4201 tggaaagtgt taggggtttg aactaggtgg atactaaatc tcggccccac tcttcattgg 4261 cttaacctaa aaaccagagg tgcttttcct tgtctgtgtg ccagttgctg gctgttttag 4321 ttgcttgccc ttcattttgc tactgatttt ccttaatttg tgggaaggag taggcaaaga 4381 atatgcttac atgattacac ctgtaaagta agcccaaaca tcccaaatgt ccgtcaactg 4441 atgagtggat taataaaatg tttccatgga atattccttg gattactcag ccataaaaag 4501 gaatgaagta ctgacacatg ctgtgacatc agtgaaccct gaaaacatcc ttctcagtga 4561 aacaagccag agagatgtac aaggctacag actgtatgat tccatttata tgaaatatac 4621 atactaggca aatccatgga gatggaacat agattagtgg ttgccagcgg atagaggagt 4681 aattgttagt gggcatggga tttgtttttg ggaggttttg aaaatgttct ggagttgaac 4741 aatagtaatg gttgcatgat ttggtaaaaa tactaaaaac tatggaattg tttaattatg 4801 atatgaatta atttctttct ttttttcttt tttttttttt ttttttgaca cggagtctca 4861 ctctgtcgcc caggctggag tgcagtggtg tgatctcgtc tcactcgaac ctccgcctcc 4921 cagattcaag cgattctcct gtctcagcct cctgagtagc tgggattata ggtacatgcc 4981 atcacacctg gctaattttt gtatttttag tagagatgag gtttcaccat tttggccagg 5041 ctgatcttga actcctgacc tcaggtgatc cacccgcctc agcctcccaa agtgctggga 5101 ttacaggtgg gagccacaac acccggccat gaattaactc cgttaaaaaa taaacgtata 5161 cattctgtga gcaaatccat tttgtctcca tattgtcctg ctgctgaaca ttttatagag 5221 tgtgggggat ggaaggaccc aggtggtcta gtcgaccagt caggtgcaaa atcccctccc 5281 caatgtttct gtttttgttt tctcttggat gggtgacaaa gtgcaactgt aagacccgta 5341 gagaaaactc tggttcctgc tcagaatggg cccatcttgt tggactcgtt tccacagccc 5401 cccacccctt ccaaaccata cccacccttc aagccaagcc tagcatgggg gccaccttga 5461 catctgggga tttccagagt ttcaaacctc agagttcata atgtttattg ttagtcttgc 5521 tacattcatt ccccaactga tgggagtgaa ggcaaatcca agccagccag gccacaatac 5581 agaagcagaa tgctgagcct ccctctctcg gtccatcttt ctagaccatg ctgcttacaa 5641 gccaccttac gttcaagtcc ccccattcct gtaaatgcac accccactca tctctgcttc 5701 ccaggctcaa gcaatcctcc cacctgagcc tcccaagtag ctgggactac aggcactcac 5761 caccatgcct ggctaatttt tttttttttt ttggtatttt ttgtagagat gaggtttcac 5821 catgtttccc aggcttgtct caaactcctg gtctcaagca accctcctgc cacggcctcc 5881 caaagtgctg gaattacagg catgagccac catgcctggc cccagtcaca tcctattgtc 5941 ttacctaggc aacatgatga ttttgtttat tatcataact tgggtttggc tggaaactgc 6001 tgcttgaggg ttcagagtct gtttggttcc ctcacctccc ttcctaggac ctcctttcta 6061 caccatttgt ctac // LOCUS HUMMTG8 2436 bp mRNA PRI 27-MAR-1996 DEFINITION Human mRNA for MTG8 protein, complete cds. ACCESSION D14289 NID g474987 KEYWORDS MTG8 protein. SOURCE Homo sapiens blood B-cell cDNA to mRNA, cell line Raji. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2436) AUTHORS Miyoshi,H. TITLE The t(8;21) translocation in acute myeloid leukemia results in production of an AML1-MTG8 fusion transcript JOURNAL Unpublished (1993) REFERENCE 2 (bases 1 to 2436) AUTHORS Kozu,T. TITLE Direct Submission JOURNAL Submitted (27-JAN-1993) to the DDBJ/EMBL/GenBank databases. Tomoko Kozu, Saitama Cancer Center Research Institute, Department of Immunology and Virology; 818 Komuro, Ina, Saitama 362, Japan (Tel:048-722-1111(ex.265), Fax:048-722-1739) COMMENT Submitted (27-JAN-1993) to DDBJ by: Tomoko Kozu Department of Immunology and Virology Saitama Cancer Center Research Institute 818 Komuro, Ina Saitama 362 Japan Phone: 048-722-1111x265 Fax: 048-722-1739. FEATURES Location/Qualifiers source 1..2436 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Raji" /cell_type="B-cell" /tissue_type="blood" CDS 298..2001 /codon_start=1 /product="MTG8 protein" /db_xref="PID:d1003756" /db_xref="PID:g474988" /translation="MPDSPVDVKTQSRLTPPTMPPPPTTQGAPRTSSFTPTTLTNGTS HSPTALNGAPSPPNGFSNGPSSSSSSSLANQQLPPACGARQLSKLKRFLTTLQQFGND ISPEIGERVRTLVLGLVNSTLTIEEFHSKLQEATNFPLRPFVIPFLKANLPLLQRELL HCARLAKQNPAQYLAQHEQLLLDASTTSPVDSSELLLDVNENGKRRTPDRTKENGFDR EPLHSEHPSKRPCTISPGQRYSPNNGLSYQPNGLPHPTPPPPQHYRLDDMAIAHHYRD SYRHPSHRDLRDRNRPMGLHGTRQEEMIDHRLTDREWAEEWKHLDHLLNCIMDMVEKT RRSLTVLRRCQEADREELNYWIRRYSDAEDLKKGGGSSSSHSRQQSPVNPDPVALDAH REFLHRPASGYVPEEIWKKAEEAVNEVKRQAMTELQKAVSEAERKAHDMITTERAKME RTVAEAKRQAAEDALAVINQQEDSSESCWNCGRKASETCSGCNTARYCGSFCQHKDWE KHHHICGQTLQAQQQGDTPAVSSSVTPNSGAGSPMDTPPAATPRSTTPGTPSTIETTP R" BASE COUNT 722 a 638 c 556 g 520 t ORIGIN Chromosome 8. 1 tataacacta ccaaggtagc atctgcctaa gccagaaata acaataggat ataatataac 61 ctagttacaa atgaggtcct gccatctgtg ctataccttc ttaagtgggg tttggatgat 121 aaaatcctga tttaaacgtc agtagataag tgtattttat tgatagaaga tgttgaattt 181 cttctgttca cttgctttta aaaaagataa accccacttg aaaaactgag gtgcttaagg 241 agtaaaataa tatgttcctg gtggcatcct ccagatcgta ctgagaagca ctccacaatg 301 ccagactcac ctgtggatgt gaagacgcaa tctaggctga ctcctccaac aatgccacct 361 cccccaacta ctcaaggagc tccaagaacc agttcattta caccgacaac gttaactaat 421 ggcacgagcc attctcctac agccttgaat ggcgccccct caccacccaa tggcttcagc 481 aatgggcctt cctcttcttc ctcctcctct ctggctaatc aacagctgcc cccagcctgt 541 ggtgccaggc aactcagcaa gctgaaaagg ttccttacta ccctgcagca gtttggcaat 601 gacatttcac ccgagatagg agaaagagtt cgcaccctcg ttctgggact agtgaactcc 661 actttgacaa ttgaagaatt tcattccaaa ctgcaagaag ctactaactt cccactgaga 721 ccttttgtca tcccattttt gaaggccaac ttgcccctgc tgcagcgtga gctcctccac 781 tgcgcaagac tggccaaaca gaaccctgcc cagtacctcg cccagcatga acagctgctt 841 ctggatgcca gcaccacctc acctgttgac tcctcagagc tgcttctcga tgtgaacgaa 901 aacgggaaga ggcgaactcc agacagaacc aaagaaaatg gctttgacag agagcctttg 961 cactcagaac atccaagcaa gcgaccatgc actattagcc caggccagcg gtacagtcca 1021 aataacggct tatcctacca gcccaatggc ctgcctcacc ctaccccacc tccacctcag 1081 cattaccgtt tggatgatat ggccattgcc caccactaca gggactccta tcgacacccc 1141 agccacaggg acctcaggga cagaaacaga cctatggggt tgcatggcac acgtcaagaa 1201 gaaatgattg atcacagact aacagacaga gaatgggcag aagagtggaa acatcttgac 1261 catctgttaa actgcataat ggacatggta gaaaaaacaa ggcgatctct caccgtacta 1321 aggcggtgtc aagaagcaga ccgggaagaa ttgaattact ggatccggcg gtacagtgac 1381 gccgaggact taaaaaaagg tggcggcagt agcagcagcc actctaggca gcagagtccc 1441 gtcaacccag acccagttgc actagacgcg catcgggaat tccttcacag gcctgcgtct 1501 ggatacgtgc cagaggagat ctggaagaaa gctgaggagg ccgtcaatga ggtgaagcgc 1561 caggcgatga cggagctgca gaaggccgtg tctgaggcgg agcggaaagc ccacgacatg 1621 atcacaacag agagggccaa gatggagcgc acggtcgccg aggccaaacg gcaggcggcg 1681 gaggacgcac tggcagttat caatcagcag gaggattcaa gcgagagttg ctggaattgt 1741 ggccgtaaag cgagtgaaac ctgcagtggc tgtaacacag cccgatactg tggctcattt 1801 tgccagcaca aagactggga gaagcaccat cacatctgtg gacagaccct gcaggcccag 1861 cagcagggag acacacctgc agtcagctcc tctgtcacgc ccaacagcgg ggctgggagc 1921 ccgatggaca caccaccagc agccactccg aggtcaacca ccccgggaac cccttccacc 1981 atagagacaa cccctcgcta gacgtgaact cagaactgtc ggaggaaaga caacacaacc 2041 aacgcgaaac caattcctca tcctcagatg ctcaaagttg ttttttttgt ttgtttgttt 2101 attagatgaa ttatcctatt tcagtacttc agcaagagag aacctaactg tatcttgagg 2161 tggtagtaaa acacagaggg ccagtaacgg gtcgtaatga cttattgtgg ataacaaaga 2221 tatcttttct ttagagaact gaaaagagag cagagaatat aacatgaaat gatagatttg 2281 acctcctccc tgttattttc aagtagctgg gattttaaac tagatgacct cattaaccga 2341 tgctttacca aacagcaaac caagagattg ctaattgctg ttgaaagcaa aaatgctaat 2401 attaaaagtc acaatgttct ttatatacaa taatgg // LOCUS HUMMTINFB 2481 bp mRNA PRI 22-MAY-1995 DEFINITION Human nuclear-encoded mitochondrial initiation factor 2 mRNA, complete cds. ACCESSION L34600 NID g609491 KEYWORDS initiation factor; initiation factor 2; mitochondrial protein; protein synthesis. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2481) AUTHORS Ma,L. and Spremulli,L.L. TITLE Cloning and sequence analysis of the human mitochondrial translational initiation factor 2 cDNA JOURNAL J. Biol. Chem. 270 (4), 1859-1865 (1995) MEDLINE 95130568 FEATURES Location/Qualifiers source 1..2481 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 271..2454 /note="initiation factor 2 for mitochondrial protein synthesis" /codon_start=1 /product="initiation factor 2" /db_xref="PID:g609492" /translation="MNQKLLKLENLLRFHTIYRQLHSLCQRRALRQWRHGFSSAYPVW TAQLCAWPWPTDVLNGAALSQYRLLVTKKEEGPWKSQLSSTKSKKVVEVWIGMTIEEL ARAMEKNTDYVYEALLNTDIDIDSLEADSHLDEVWIKEVITKAGMKLKWSKLKQDKVR KNKDAVRRPQADPALLTPRSPVVTIMGHVDHGKTTLLDKFRKTQVAAVETGGITQHIG AFLVSLPSGEKITFLDTPGHAAFSAMRARGAQVTDIVVLVVAADDGVMKQTVESIQHA KDAQVPIILAVNKCDKAEADPEKVKKELLAYDVVCEDYGGDVQAVPVSALTGDNLMAL AEATVALAEMLELKADPNGPVEGTVIESFTDKGRGLVTTAIIQRGTLRKGSVLVAGKC WAKVRLMFDENGKTIDEAYPSMPVGITGWRDLPSAGEEILEVESEPRAREVVDWRKYE QEQEKGQEDLKIIEEKRKEHKEAHQKAREKYGHLLWKKRSILRFLERKEQIPLKPKEK RERDSNVLSVIIKGDVDGSVEAILNIIDTYDASHECELELVHFGVGDISANDVNLAET FDGVIYGFNVNAGNVIQQSAAKKGVKIKLHKIIYRLVEDLQEELSSRLPCAVEEHPVG EASILATFSVTEGKKKVPVAGCRVQKGQLEKQKKFKLTRNGHVIWKGSLTSLKHHKDD ISIVKTGMDCGLSLDEDNMEFQVGDRIVCYEEKQIQAKTSWDPGF" BASE COUNT 836 a 419 c 605 g 621 t ORIGIN 1 ggcacgagca gaatccaggg gcccggggct gtagattcct tgacaaggat atcctagcgg 61 cgaaacaaca ccgtactggg agtcagaacg tctgggttct agtcttgact gccattaact 121 agcggtatga cattggagaa gcttttttga cccttctgga tttccgtttc cttttctgta 181 aaatgaggag cttggaagat ccggaaaatg aggcccatag gaaacaagtg acttgctgag 241 tccagataac actgactgtc agagagaaac atgaaccaga agctactgaa gttggagaac 301 ttgctacgat ttcacactat ttataggcaa ctgcacagtc tgtgtcaaag aagagcatta 361 agacagtgga ggcatgggtt ttcatctgct taccctgtgt ggacagctca actgtgtgcc 421 tggccctggc caacagatgt gctcaatggg gctgctttat ctcagtatag gcttctagta 481 acaaaaaagg aagaaggacc atggaaatct cagttatctt caacaaaatc taaaaaggtg 541 gtagaagtat ggattggaat gactattgag gaactggcca gggcaatgga aaaaaacaca 601 gattatgtat atgaagcttt attgaacact gatattgaca tagattcact ggaagcagac 661 tcacatttag atgaagtctg gatcaaagaa gtgataacga aggcagggat gaagttaaag 721 tggagtaaat taaaacagga caaagtcaga aaaaataaag atgctgtaag aaggccccag 781 gcagatccag ctttattaac cccaaggtcc ccagttgtta ctataatggg ccatgttgat 841 cacgggaaaa cgacattact tgacaaattt cgaaaaactc aagtggcagc agtggaaact 901 ggaggcatca ctcagcacat tggtgccttt cttgtctctc tgccttctgg ggaaaagata 961 acttttcttg atactccagg acatgctgct ttctcagcaa tgagagccag aggtgctcag 1021 gtcactgaca ttgtcgtatt ggttgtagct gcagatgatg gagtgatgaa acaaactgta 1081 gaatctattc agcatgccaa agatgcacag gttcctatta tccttgccgt aaataaatgt 1141 gacaaagctg aggctgatcc tgagaaagtg aaaaaagagc tgctggctta cgatgtggta 1201 tgtgaagatt atggaggtga tgttcaagca gtgcctgtct ccgcacttac gggcgataat 1261 ctgatggctt tggcagaagc aacagttgct cttgcagaaa tgttagaatt gaaagcagat 1321 cccaatggtc cagtggaagg aacagtaata gagtctttca cagacaaagg aagaggtctt 1381 gttactacag ctataattca aagaggaact ttaagaaaag gctctgttct ggttgctgga 1441 aaatgttggg caaaagtacg cttaatgttt gatgaaaatg gaaaaacaat tgatgaggcc 1501 tatcccagca tgccagtggg aattacaggc tggagagacc ttccttctgc aggagaagaa 1561 attcttgaag tagaatctga gccaagggca cgtgaagttg ttgactggag gaaatatgaa 1621 caagaacagg agaaaggtca ggaggatctg aaaataatag aagaaaagcg aaaggaacac 1681 aaagaagcac atcagaaagc ccgtgagaag tatggccatc tactgtggaa gaagagatca 1741 attctacggt ttttagaaag aaaagaacaa atacccttaa agccaaaaga gaaaagggaa 1801 agagattcaa atgtactttc tgtgattatt aaaggtgatg ttgatggttc tgttgaggcc 1861 attttgaaca ttatagatac ctatgatgct tcacacgagt gtgaactaga attagtacat 1921 tttggagtgg gtgatataag tgcaaatgat gttaaccttg ctgaaacatt tgatggtgtt 1981 atatatggct ttaatgtgaa tgcaggcaat gttatccaac agtcagctgc aaaaaaagga 2041 gtaaaaatta aacttcacaa aataatttac cgtcttgttg aagatttgca agaggaactg 2101 agcagcagat taccctgtgc tgtggaagag cacccagtag gtgaggcatc tatactagct 2161 accttctctg taacagaagg gaagaaaaaa gttcctgtgg ctggctgcag agtccaaaag 2221 ggacagttag aaaaacaaaa aaaatttaaa ctaacccgta atggacatgt aatttggaag 2281 ggctcattaa cctcattgaa acaccataaa gatgacattt caattgtcaa aacgggaatg 2341 gattgtggtc tcagtttaga tgaagacaat atggaatttc aagtgggaga cagaattgtt 2401 tgttatgaag aaaagcaaat tcaagccaag acttcttggg atccaggatt ttaaaattac 2461 attaaaaatg taaataactc a // LOCUS HUMMTNUBA 827 bp mRNA PRI 14-SEP-1995 DEFINITION Human nuclear-encoded mitochondrial NADH-ubiquinone reductase 24Kd subunit mRNA, complete cds. ACCESSION M22538 M25484 NID g986883 KEYWORDS NADH ubiquinone reductase 24kd subunit; ubiquinone reductase. SOURCE Homo sapiens (clone: lambda-HumCI-24.1) (tissue library: HUT 78) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 827) AUTHORS Pilkington,S.J. and Walker,J.E. TITLE Mitochondrial NADH-ubiquinone reductase: complementary DNA sequences of import precursors of the bovine and human 24-kDa subunit JOURNAL Biochemistry 28 (8), 3257-3264 (1989) MEDLINE 89302922 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.E.Walker 10-FEB-1989. FEATURES Location/Qualifiers source 1..827 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-HumCI-24.1" /tissue_lib="HUT 78" CDS 19..768 /EC_number="1.6.5.3" /note="precursor" /codon_start=1 /product="NADH-ubiquinone reductase" /db_xref="PID:g188852" /translation="MFFSAALRARAAGLTAHWGRHVRNLHKTAMQNGAGGALFVHRDT PENNPDTPFDFTPENYKRIEAIVKNYPEGHKAAAVLPVLDLAQRQNGWLPISAMNKVA EVLQVPPMRVYEVATFYTMYNRKPVGKYHIQVCTTTPCMLRNSDSILEAIQKKLGIKV GETTPDKLFTLIEVECLGACVNAPMVQINDNYYEDLTAKDIEEIIDELKAGKIPKPGP RSGRFSCEPAGGLTSLTEPPKGPGFGVQAGL" mat_peptide 19..765 /EC_number="1.6.5.3" /product="NADH-ubiquinone reductase" sig_peptide 19..114 /product="NADH-ubiquinone reductase" polyA_site 827 BASE COUNT 254 a 174 c 194 g 205 t ORIGIN 1 ggaacagtgt ggcccgccat gttcttctcc gcggcgctcc gggcccgggc ggctggcctc 61 accgcccact ggggaagaca tgtaaggaat ttgcataaga cagctatgca aaatggagct 121 ggaggagctt tatttgtgca cagagatact cctgagaata accctgatac tccatttgat 181 ttcacaccag aaaactataa gaggatagag gcaattgtaa aaaactatcc agaaggccat 241 aaagcagcag ctgttcttcc agtcctggat ttagcccaaa ggcagaatgg gtggttgccc 301 atctctgcta tgaacaaggt tgcagaagtt ttacaagtac ctccaatgag agtatatgaa 361 gtagcaactt tttatacaat gtataatcga aagccagttg gaaagtatca cattcaggtc 421 tgcactacta caccctgcat gcttcgaaac tctgacagca tactggaggc cattcagaaa 481 aagcttggaa taaaggttgg ggagactaca cctgacaaac ttttcactct tatagaagtg 541 gaatgtttag gggcctgtgt gaacgcacca atggttcaaa taaatgacaa ttactatgag 601 gatttgacag ctaaggatat tgaagaaatt attgatgagc tcaaggctgg caaaatccca 661 aaaccagggc caaggagtgg acgcttctct tgtgagccag ctggaggtct tacctctttg 721 actgaaccac ccaagggacc tggatttggt gtacaagcag gcctttaatt tatattgaac 781 tgtaaatatg tcactagaga aataaaatat ggacttccaa tctacgt // LOCUS HUMMTSSB 628 bp mRNA PRI 22-JUL-1993 DEFINITION Human mitochondrial specific single stranded DNA binding protein mRNA, complete cds. ACCESSION M94556 NID g188855 KEYWORDS single stranded DNA binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 628) AUTHORS Tiranti,V., Zeviani,M., Rocchi,M. and Di Donato,S. TITLE Cloning of human and rat cDNAs encoding the mitochondrial single-stranded-DNA binding protein, (SSB) JOURNAL Gene 126, 219-225 (1993) MEDLINE 93246247 FEATURES Location/Qualifiers source 1..628 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" 5'UTR 1..78 sig_peptide 79..126 CDS 79..525 /standard_name="SSB" /note="possible binding domain bps. 328..368" /codon_start=1 /product="single stranded DNA binding protein" /db_xref="PID:g188856" /translation="MFRRPVLQVLRQFVRHESETTTSLVLERSLNRVHLLGRVGQDPV LRQVEGKNPVTIFSLATNEMWRSGDSEVYQLGDVSQKTTWHRISVFRPGLRDVAYQYV KKGSRIYLEGKIDYGEYMDKNNVRRQATTIIADNIIFLSDQTKEKE" mat_peptide 127..522 /product="single stranded DNA binding protein" exon 482..628 /note="last exon" 3'UTR 523..628 polyA_signal 615..621 polyA_site 628 BASE COUNT 205 a 107 c 152 g 164 t ORIGIN 1 cctgcgtggc tgggctgctc gggttagatc gtcaggaaaa gcctaaagat tagactgtaa 61 gaaaagaaaa tagaagccat gtttcgaaga cctgtattac aggtacttcg tcagtttgta 121 agacatgagt ccgaaacaac taccagtttg gttcttgaaa gatccctgaa tcgtgtgcac 181 ttacttgggc gagtgggtca ggaccctgtc ttgagacagg tggaaggaaa aaatccagtc 241 acaatatttt ctctagcaac taatgagatg tggcgatcag gggatagtga agtttaccaa 301 ctgggtgatg tcagtcaaaa gacaacatgg cacagaatat cagtattccg gccaggcctc 361 agagacgtgg catatcaata tgtgaaaaag gggtctcgaa tttatttgga agggaaaata 421 gactatggtg aatacatgga taaaaataat gtgaggcgac aagcaacaac aatcatagct 481 gataatatta tatttctgag tgaccagacg aaagagaagg agtagaaagg atgattcttc 541 tttggccatc atttggtaca gtctcatttc caagtcatgt ataatcttta tggcttccaa 601 ggacaagaat taaaatactc ttttacgt // LOCUS HUMMTTUF1M 1554 bp mRNA PRI 21-MAY-1996 DEFINITION Homo sapiens nuclear-encoded mitochondrial elongation factor Tu mRNA, complete cds. ACCESSION L38995 NID g704415 KEYWORDS elongation factor Tu; mitochondrial protein; protein synthesis. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1554) AUTHORS Woriax,V.L., Burkhart,W. and Spremulli,L.L. TITLE Cloning, sequence analysis and expression of mammalian mitochondrial protein synthesis elongation factor Tu JOURNAL Biochim. Biophys. Acta, Gene Struct. Expr. 1264 (3), 347-356 (1995) MEDLINE 96138557 FEATURES Location/Qualifiers source 1..1554 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 8..1366 /codon_start=1 /product="elongation factor Tu" /db_xref="PID:g704416" /translation="MAAATLLRATPHFSGLAAGRTFLLQGLLRLLKAPALPLLCRGLA VEAKKTYVRDKPHVNVGTIGHVDHGKTTLTAAITKILAEGGGAKFKKYEEIDNAPEER ARGITINAAHVEYSTAARHYAHTDCPGHADYVKNMITGTAPLDGCILVVAANDGPMPQ TREHLLLARQIGVEHVVVYVNKADAVQDSEMVELVELEIRELLTEFGYKGEETPVIVG SALCALEGRDPELGLKSVQKLLDAVDTYIPVPARDLEKPFLLPVEAVYSVPGRGTVVT GTLERGILKKGDECELLGHSKNIRTVVTGIEMFHKSLERAEAGDNLGALVRGLKREDL RRGLVMVKPGSIKPHQKVEAQVYILSKEEGGRHKPFVSHFMPVMFSLTWNMACRIILP PEKELAMPGEDLKFNLILRQPMILEKGQRFTLRDGNRTIGTGLVTNTLAMTEEEKNIK WG" BASE COUNT 332 a 434 c 481 g 307 t ORIGIN 1 gaccacaatg gcggccgcca ccctgctgcg cgcgacgccc cacttcagcg gtctcgccgc 61 cggccggacc ttcctgctgc agggtctgtt gcggctgctg aaagccccgg cattgcctct 121 cttgtgccgc ggcctggccg tggaggccaa gaagacttac gtgcgcgaca agccacatgt 181 gaatgtgggt accatcggcc atgtggacca cgggaagacc acgctgactg cagccatcac 241 gaagattcta gctgagggag gtggggctaa gttcaagaag tacgaggaga ttgacaatgc 301 cccggaggag cgagctcggg gtatcaccat caatgcggct catgtggagt atagcactgc 361 cgcccgccac tacgcccaca cagactgccc gggtcatgca gattatgtta agaatatgat 421 cacaggcact gcacccctcg acggctgcat cctggtggta gcagccaatg acggccccat 481 gccccagacc cgagagcact tattactggc cagacagatt ggggtggagc atgtggtggt 541 gtatgtgaac aaggctgacg ctgtccagga ctctgagatg gtggaactgg tggaactgga 601 gatccgggag ctgctcaccg agtttggcta taaaggggag gagaccccag tcatcgtagg 661 ctctgctctc tgtgcccttg agggtcggga ccctgagtta ggcctgaagt ctgtgcagaa 721 gctactggat gctgtggaca cttacatccc agtgcccgcc cgggacctgg agaagccttt 781 cctgctgcct gtggaggcgg tgtactccgt ccctggccgt ggcaccgtgg tgacaggtac 841 actagagcgt ggcattttaa agaagggaga cgagtgtgag ctcctaggac atagcaagaa 901 catccgcact gtggtgacag gcattgagat gttccacaag agcctggaga gggccgaggc 961 cggagataac ctcggggccc tggtccgagg cttgaagcgg gaggacttgc ggcggggcct 1021 ggtcatggtc aagccaggtt ccatcaagcc ccaccagaag gtggaggccc aggtttacat 1081 cctcagcaag gaggaaggtg gccgccacaa gccctttgtg tcccacttca tgcctgtcat 1141 gttctccctg acttggaaca tggcctgtcg gattatcctg cccccagaga aggagcttgc 1201 catgcccggg gaggacctga agttcaacct aatcttgcgg cagccaatga tcttagagaa 1261 aggccagcgt ttcaccctgc gagatggcaa ccggactatt ggcaccggtc tagtcaccaa 1321 cacgctggcc atgactgagg aggagaagaa tatcaaatgg ggttgagtgt gcagatctct 1381 gctcagcttc ccttgcgttt aaggcctgcc ctagccaggg ctccctcctg cttccagtac 1441 cctctcatgg cataggctgc aacccagcag agggcagcta gatggacatt tcccctgctc 1501 ggaagggttg gcctgcctgg ctggggaggt cagtaaactt tgaatagtaa gcca // LOCUS HUMMUC18A 2943 bp mRNA PRI 16-AUG-1994 DEFINITION Human isolate JuSo MUC18 glycoprotein mRNA (3' variant), complete cds. ACCESSION M29277 NID g530047 KEYWORDS MUC18 glycoprotein; cell adhesion molecule; immunoglobulin-like sequence; integral membrane glycoprotein. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2943) AUTHORS Lehmann,J.M., Riethmuller,G. and Johnson,J.P. TITLE MUC18, a marker of tumor progression in human melanoma, shows sequence similarity to the neural cell adhesion molecules of the immunoglobulin superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (24), 9891-9895 (1989) MEDLINE 90099368 REFERENCE 2 (bases 1 to 2943) AUTHORS Sers,C., Kirsch,K., Rothbacher,U., Riethmuller,G. and Johnson,J.P. TITLE Genomic organization of the melanoma-associated glycoprotein MUC18: implications for the evolution of the immunoglobulin domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (18), 8514-8518 (1993) MEDLINE 93391384 REFERENCE 3 (bases 1 to 2943) AUTHORS Johnson,J.P. TITLE Direct Submission JOURNAL Submitted (06-OCT-1989) Judith P. Johnson, Institute for Immunology, University of Munich, Goethestrasse 31, Munich, Germany 80336 FEATURES Location/Qualifiers source 1..2943 /organism="Homo sapiens" /isolate="JuSo" /db_xref="taxon:9606" /tissue_type="melanoma" /cell_line="Mel JuSo" /clone="drop4.7" CDS 8..1948 /codon_start=1 /product="MUC18 glycoprotein" /db_xref="PID:g530048" /translation="MGLPRLVCAFLLAACCCCPRVAGVPGEAEQPAPELVEVEVGSTA LLKCGLSQSQGNLSHVDWFSVHKEKRTLIFRVRQGQGQSEPGEYEQRLSLQDRGATLA LTQVTPQDERIFLCQGKRPRSQEYRIQLRVYKAPEEPNIQVNPLGIPVNSKEPEEVAT CVGRNGYPIPQVIWYKNGRPLKEEKNRVHIQSSQTVESSGLYTLQSILKAQLVKEDKD AQFYCELNYRLPSGNHMKESREVTVPVFYPTEKVWLEVEPVGMLKEGDRVEIRCLADG NPPPHFSISKQNPSTREAEEETTNDNGVLVLEPARKEHSGRYECQAWNLDTMISLLSE PQELLVNYVSDVRVSPAAPERQEGSSLTLTCEAESSQDLEFQWLREETDQVLERGPVL QLHDLKREAGGGYRCVASVPSIPGLNRTQLVKLAIFGPPWMAFKERKVWVKENMVLNL SCEASGHPRPTISWNVNGTASEQDQDPQRVLSTLNVLVTPELLETGVECTASNDLGKN TSILFLELVNLTTLTPDSNTTTGLSTSTASPHTRANSTSTERKLPEPESRGVVIVAVI VCILVLAVLGAVLYFLYKKGKLPCRRSGKQEITLPPSRKTELVVEVKSDKLPEEMGLL QGSSGDKRAPGDQGEKYIDLRH" sig_peptide 8..91 /product="MUC18 glycoprotein signal peptide" mat_peptide 92..1945 /product="MUC18 glycoprotein" misc_feature 122..373 /note="immunoglobulin-like (V set) domain I" misc_feature 467..694 /note="immunoglobulin-like (V set) domain II" misc_feature 795..988 /note="immunoglobulin-like (C2 set) domain" misc_feature 1076..1249 /note="mmunoglobulin-like (C2 set) domain" misc_feature 1337..1525 /note="immunoglobulin-like (C2 set) domain" BASE COUNT 668 a 863 c 858 g 554 t ORIGIN 1 gggaagcatg gggcttccca ggctggtctg cgccttcttg ctcgccgcct gctgctgctg 61 tcctcgcgtc gcgggtgtgc ccggagaggc tgagcagcct gcgcctgagc tggtggaggt 121 ggaagtgggc agcacagccc ttctgaagtg cggcctctcc cagtcccaag gcaacctcag 181 ccatgtcgac tggttttctg tccacaagga gaagcggacg ctcatcttcc gtgtgcgcca 241 gggccagggc cagagcgaac ctggggagta cgagcagcgg ctcagcctcc aggacagagg 301 ggctactctg gccctgactc aagtcacccc ccaagacgag cgcatcttct tgtgccaggg 361 caagcgccct cggtcccagg agtaccgcat ccagctccgc gtctacaaag ctccggagga 421 gccaaacatc caggtcaacc ccctgggcat ccctgtgaac agtaaggagc ctgaggaggt 481 cgctacctgt gtagggagga acgggtaccc cattcctcaa gtcatctggt acaagaatgg 541 ccggcctctg aaggaggaga agaaccgggt ccacattcag tcgtcccaga ctgtggagtc 601 gagtggtttg tacaccttgc agagtattct gaaggcacag ctggttaaag aagacaaaga 661 tgcccagttt tactgtgagc tcaactaccg gctgcccagt gggaaccaca tgaaggagtc 721 cagggaagtc accgtccctg ttttctaccc gacagaaaaa gtgtggctgg aagtggagcc 781 cgtgggaatg ctgaaggaag gggaccgcgt ggaaatcagg tgtttggctg atggcaaccc 841 tccaccacac ttcagcatca gcaagcagaa ccccagcacc agggaggcag aggaagagac 901 aaccaacgac aacggggtcc tggtgctgga gcctgcccgg aaggaacaca gtgggcgcta 961 tgaatgtcag gcctggaact tggacaccat gatatcgctg ctgagtgaac cacaggaact 1021 actggtgaac tatgtgtctg acgtccgagt gagtcccgca gcccctgaga gacaggaagg 1081 cagcagcctc accctgacct gtgaggcaga gagtagccag gacctcgagt tccagtggct 1141 gagagaagag acagaccagg tgctggaaag ggggcctgtg cttcagttgc atgacctgaa 1201 acgggaggca ggaggcggct atcgctgcgt ggcgtctgtg cccagcatac ccggcctgaa 1261 ccgcacacag ctggtcaagc tggccatttt tggcccccct tggatggcat tcaaggagag 1321 gaaggtgtgg gtgaaagaga atatggtgtt gaatctgtct tgtgaagcgt cagggcaccc 1381 ccggcccacc atctcctgga acgtcaacgg cacggcaagt gaacaagacc aagatccaca 1441 gcgagtcctg agcaccctga atgtcctcgt gaccccggag ctgttggaga caggtgttga 1501 atgcacggcc tccaacgacc tgggcaaaaa caccagcatc ctcttcctgg agctggtcaa 1561 tttaaccacc ctcacaccag actccaacac aaccactggc ctcagcactt ccactgccag 1621 tcctcatacc agagccaaca gcacctccac agagagaaag ctgccggagc cggagagccg 1681 gggcgtggtc atcgtggctg tgattgtgtg catcctggtc ctggcggtgc tgggcgctgt 1741 cctctatttc ctctataaga agggcaagct gccgtgcagg cgctcaggga agcaggagat 1801 cacgctgccc ccgtctcgta agaccgaact tgtagttgaa gttaagtcag ataagctccc 1861 agaagagatg ggcctcctgc agggcagcag cggtgacaag agggctccgg gagaccaggg 1921 agagaaatac atcgatctga ggcattagcc ccgaatcact tcagctccct tccctgcctg 1981 gaccattccc agctccctgc tcactcttct ctcagccaaa gctcaaaggg actagagaga 2041 agcctcctgc tcccctcgcc tgcacacccc ctttcagagg gccactgggt taggacctga 2101 ggacctcact tggccctgca aggcccgctt ttcagggacc agtccaccac catctcctcc 2161 acgttgagtg aagctcatcc caagcaagga gccccagtct cccgagcggg taggagagtt 2221 tcttgcagaa cgtgtttttt ctttacacac attatgctgt aaatacgctc gtcctgccag 2281 cagctgagct gggtagcctc tctgagctgg tttcctgccc caaaggctgg cattccacca 2341 tccaggtgca ccactgaagt gaggacacac cggagccagg cgcctgctca tgttgaagtg 2401 cgctgttcac acccgctccg gagagcaccc cagcagcatc cagaagcagc tgcagtgcaa 2461 gcttgcatgc ctgcgtgttg ctgcaccacc ctcctgtctg cctcttcaaa gtctcctgtg 2521 acattttttc tttggtcaga ggccaggaac tgtgtcattc cttaaagata cgtgccgggg 2581 ccaggtgtgg ctcacgcctg taatcccagc actttgggag gccgaggcgg cggatcacaa 2641 agtcagacga gaccatcctg gctaacacgg tgaaaccctg tctctactaa aaatacaaaa 2701 aaaaattagc taggcgtagt ggttggcacc tatagtccca gctactcgga aggctgaagc 2761 aggagaatgg tatgaatcca ggaggtggag cttgcagtga gccgagaccg tgccactgca 2821 ctccagcctg ggcaacacag cgagactccg tctcgagccg gccggttgcg cgggccctcg 2881 gaccctcaga gaggcgaggg ttcgagggca cgagttcgag gccaacctgg tccacatggg 2941 ttg // LOCUS HUMMUPCAD 6972 bp mRNA PRI 24-MAR-1997 DEFINITION Human CAD mRNA for multifunctional protein CAD, complete cds. ACCESSION D78586 NID g1228048 KEYWORDS multifunctional protein CAD. SOURCE Homo sapiens fetus fetal lung fibroblast cell_line:TIG-1-20 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6972) AUTHORS Iwahana,H., Fujimura,M., Ii,S., Kondo,M., Moritani,M., Takahashi,Y., Yamaoka,T., Yoshimoto,K. and Itakura,M. TITLE Molecular cloning of a human cDNA encoding a trifunctional enzyme of carbamoyl-phosphate synthetase-aspartate transcarbamoylase-dihydroorotase in de Novo pyrimidine synthesis JOURNAL Biochem. Biophys. Res. Commun. 219 (1), 249-255 (1996) MEDLINE 96190701 REFERENCE 2 (bases 1 to 6972) AUTHORS Iwahana,H. TITLE Direct Submission JOURNAL Submitted (01-DEC-1995) to the DDBJ/EMBL/GenBank databases. Hiroyuki Iwahana, School of Medicine, The University of Tokushima, Otsuka Dept. of Clin. & Mol. Nutr.; Kuramoto 3-18-15, Tokushima, Tokushima 770, Japan (Tel:0886-33-7098, Fax:0886-31-9476) FEATURES Location/Qualifiers source 1..6972 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="TIG-1-20" /cell_type="fibroblast" /dev_stage="fetus" /tissue_type="fetal lung" 5'UTR 1..26 gene 27..6704 /gene="CAD" CDS 27..6704 /gene="CAD" /note="carbamoyl-phosphate synthetase II (EC 6.3.5.5) + aspartate carbamoyltransferase (EC 2.1.3.2) + dihydroorotase (EC 3.5.2.3)" /codon_start=1 /product="multifunctional protein CAD" /db_xref="PID:d1012088" /db_xref="PID:g1228049" /translation="MAALVLEDGSVLRGQPFGAAVSTAGEVVFQTGMVGYPEALTDPS YKAQILVLTYPLIGNYGIPPDEMDEFGLCKWFESSGIHVAALVVGECCPTPSHWSATR TLHEWLQQHGIPGLQGVDTRELTKKLREQGSLLGKLVQNGTEPSSLPFLDPNARPLVP EVSIKTPRVFNTGGAPRILALDCGLKYNQIRCLCQRGAEVTVVPWDHALDSQEYEGLF LSNGPGDPASYPSVVSTLSRVLSEPNPRPVFGICLGHQLLALAIGAKTYKMRYGNRGH NQPCLLVGSGRCFLTSQNHGFAVETDSLPADWAPLFTNANDGSNEGIVHNSLPFFSVQ FHPEHQAGPSDMELLFDIFLETVKEATAGNPGGQTVRERLTERLCPPGIPTPGSGLPP PRKVLILGSGGLSIGQAGEFDYSGSQAIKALKEENIQTLLINPNIATVQTSQGLADKV YFLPITPHYVTQVIRNERPDGVLLTFGGQTALNCGVELTKAGVLARYGVRVLGTTVET IELTEDRRAFAARMAEIGEHVAPSEAGNSLEQAQAAAERLGYPVLVRAAFAVGGLGSG FASNREELSALVAPAFAHTSQVLVDKSLKGWKEIEYEVVRDAYGNCVTVCNMENLDPL GIHTGESIVVAPSQTLNDREYQLLRQTAIKVTQHLGIVGECNVQYALNPESEQYYIIE VNARLSRSSALASKATGYPLAYVAAKLALGIPLPELRNSVTGGTAAFEPSVDYCVVKI PRWDLSKFLRVSTKIGSCMKSVGEVMGIGRSFEEAFQKALRMVDENCVGFDHTVKPVS DMELETPTDKRIFVVAAALWAGYSVDRLYELTRIDRWFLHRMKRIIAHAQLLEQHRGQ PLPPDLLQQAKCLGFSDKQIALAVLSTELAVRKLRQELGICPAVKQIDTVAAEWPAQT NYLYLTYWGTTHDLTFRTPHVLVLGSGVYRIGSSVEFDWCAVGCIQQLRKMGYKTIMV NYNPETVSTDYDMCDRLYFDEISFEVVMDIYELENPEGVILSMGGQLPNNMAMALHRQ QCRVLGTSPEAIDSAENRFKFSRLLDTIGISQPQWRELSDLESARQFCQTVGYPCVVR PSYVLSGAAMNVAYADGDLERFLSSAAAVSKEHPVVISKFIQEAKEIDVDAVASDGVV AAIAISEHVENAGVHSGDATLVTPPQDITAKTLERIKAIVHAVGQELQVTGPFNLQLI AKDDQLKVIECNVRVSRSFPFVSKTLGVDLVALATRVIMGEEVEPVGLMTGSGVVGVK VPQFSFSRLAGADVVLGVEMTSTGEVAGFGESRCEAYLKAMLSTGFKIPKKNILLTIG SYKNKSELLPTVRLLESLGYSLYASLGTADFYTEHGVKVTAVDWHFEEAVDGECPPQR SILEQLAEKNFELVINLSMRGAGGRRLSSFVTKGYRTRRLAADFSVPLIIDIKCTKLF VEALGQIGPAPPLKVHVDCMTSQKLVRLPGLIDVHVHLREPGGTHKEDFASGTAAALA GGITMVCAMPNTRPPIIDGPALALAQKLAEAGARCDFALFLGASSENAGTLGTVAGSA AGLKLYLNETFSELRLDSVVQWMEHFETWPSHLPIVAHAEQQTVAAVLMVAQLTQRSV HICHVARKEEILLIKAAKARGLPVTCEVAPHHLFLSHDDLERLGPGKGEVRPELGSRQ DVEALWEDMAVIDCFASDHAPHTLEEKCGSRPPPGFPGLETMLPLLLTAVSEGRLSLD DLLQRLHHNPRRIFHLPPQEDTYVEVDLEHEWTIPSHMPFSKAHWTPFEGQKVKGTVR RVVLRGEVAYIDGQVLVPPGYGQDVRKWPQGAVPQLPPSAPATSEMTTTPERPRRGIP GLPDGRFHLPPRIHRASDPGLPAEEPKEKSSRKVAEPELMGTPDGTCYPPPPVPRQAS PQNLGTPGLLHPQTSPLLHSLVGQHILSVQQFTKDQMSHLFNVAHTLRMMVQKERSLD ILKGKVMASMFYEVSTRTSSSFAAAMARLGGAVLSFSEATSSVQKGESLADSVQTMSC YADVVVLRHPQPGAVELAAKHCRRPVINAGDGVGEHPTQALLDIFTIREELGTVNGMT ITMVGDLKHGRTVHSLACLLTQYRVSLRYVAPPSLRMPPTVRAFVASRGTKQEEFESI EEALPDTDVLYMTRIQKERFGSTQEYEACFGQFILTPHIMTRAKKKMVVMHPMPRVNE ISVEVDSDPRAAYFRQAENGMYIRMALLATVLGRF" 3'UTR 6705..6972 polyA_signal 6949..6954 polyA_site 6972 BASE COUNT 1437 a 1998 c 2025 g 1512 t ORIGIN 1 cgcccccgcc tctgagctcc cttcccatgg cggccctagt gttggaggac gggtcggtcc 61 tgcggggcca gccctttggg gccgccgtgt cgactgccgg ggaagtggtg tttcaaaccg 121 gcatggtcgg ctaccccgag gccctcactg atccctccta caaggcacag atcttagtgc 181 tcacctatcc tctgatcggc aactatggca tccccccaga tgaaatggat gagttcggtc 241 tctgcaagtg gtttgaatcc tcgggcatcc acgtagcagc actggtagtg ggagagtgct 301 gtcctactcc cagccactgg agtgccaccc gcaccctgca tgagtggctg cagcagcatg 361 gcatccctgg cttgcaagga gtagacactc gggagctgac caagaagttg cgggaacagg 421 ggtctctgct ggggaagctg gtccagaatg gaacagaacc ttcatccctg ccattcttgg 481 accccaatgc ccgccccctg gtaccagagg tctccattaa gactccacgg gtattcaata 541 cagggggtgc ccctcggatc cttgctttgg actgtggcct caagtataat cagatccgat 601 gcctctgcca gcgtggggct gaggtcactg tggtaccctg ggaccatgca ctagacagcc 661 aagagtatga gggtctcttc ttaagtaatg ggcctggtga ccctgcctcc tatcccagtg 721 tcgtatccac actgagccgt gttttatctg agcctaatcc ccgacctgtc tttgggatct 781 gcctgggaca ccagctattg gccttagcca ttggggccaa gacttacaag atgagatatg 841 ggaaccgagg ccataaccag ccctgcttgt tggtgggctc tgggcgctgc tttctgacat 901 cccagaacca tgggtttgct gtggagacag actcactgcc agcagactgg gctcctctct 961 tcaccaacgc caatgatggt tccaatgaag gcattgtgca caacagcttg cctttcttca 1021 gtgtccagtt tcacccagag caccaagctg gcccttcaga tatggaactg cttttcgata 1081 tctttctgga aactgtgaaa gaggccacag ctgggaaccc tgggggccag acagttagag 1141 agcggctgac tgagcgcctc tgtccccctg ggattcccac tcccggctct ggacttccac 1201 caccacgaaa ggttctgatc ctgggctcag ggggcctctc cattggccaa gctggagaat 1261 ttgactactc gggctctcag gcaattaagg ccctgaagga ggaaaacatc cagacgttgc 1321 tgatcaaccc caatattgcc acagtgcaga cctcccaggg gctggccgac aaggtctatt 1381 ttcttcccat aacacctcat tatgtaaccc aggtgatacg taatgaacgc cccgatggtg 1441 tgttactgac ttttgggggc cagactgctc tgaactgtgg tgtggagctg accaaggccg 1501 gggtgctggc tcggtatggg gtccgggtcc tgggcacaac agtggagacc attgagctga 1561 ccgaggatcg acgggccttt gctgccagaa tggcagagat cggagagcat gtggccccga 1621 gcgaggcagg aaattctctt gaacaggccc aggcagccgc tgaacggctg gggtaccctg 1681 tgctagtgcg tgcagccttt gccgtgggtg gcctgggctc tggctttgcc tctaacaggg 1741 aggagctctc tgctctcgtg gccccagctt ttgcccatac cagccaagtg ctagtagaca 1801 agtctctgaa gggatggaag gagattgagt acgaggtggt gagagacgcc tatggcaact 1861 gtgtcacggt gtgtaacatg gagaacttgg acccactggg catccacact ggtgagtcca 1921 tagtggtggc ccctagccag acactgaatg acagggagta tcagctcctg aggcagacag 1981 ctatcaaggt gacccagcac ctgggaattg ttggggagtg caatgtgcag tatgccttga 2041 accctgagtc tgagcagtat tacatcattg aagtgaatgc caggctctct cgcagctctg 2101 ccctggccag taaggccaca ggttatccac tggcttatgt ggcagccaag ctagcattgg 2161 gcatcccttt gcctgagctc aggaactctg tgacaggggg tacagcagcc tttgaaccca 2221 gcgtggatta ttgtgtggtg aagattcctc gatgggacct tagcaagttc ctgcgagtca 2281 gcacaaagat tgggagctgc atgaagagcg ttggtgaagt catgggcatt gggcgttcat 2341 ttgaggaggc cttccagaag gccctgcgca tggtggatga gaactgtgtg ggctttgatc 2401 acacagtgaa accagtcagc gatatggagt tggagactcc aacagataag cggatttttg 2461 tggtggcagc tgctttgtgg gctggttatt cagtggaccg cctgtatgag ctcacacgca 2521 tcgaccgctg gttcctgcac cgaatgaagc gtatcatcgc acatgcccag ctgctagaac 2581 aacaccgtgg acagcctttg ccgccagacc tgctgcaaca ggccaagtgt cttggcttct 2641 cagacaaaca gattgccctt gcagttctga gcacagagct ggctgttcgc aagctgcgtc 2701 aggaactggg gatctgtcca gcagtgaaac agattgacac agttgcagct gagtggccag 2761 cccagacaaa ttacctatac ctaacgtatt ggggcaccac ccatgacctc acctttcgaa 2821 cacctcatgt cctagtcctt ggctctggcg tctaccgtat tggctccagt gttgagtttg 2881 actggtgtgc tgtaggctgc atccagcagc tccgaaagat gggatataag accatcatgg 2941 tgaactataa cccagagaca gtcagcaccg actatgacat gtgtgatcga ctctactttg 3001 atgagatctc ttttgaggtg gtgatggaca tctatgagct cgagaaccct gaaggtgtga 3061 tcctatccat gggtggacag ctgcccaaca acatggccat ggcgttgcat cggcagcagt 3121 gccgggtgct gggcacctcc cctgaagcca ttgactcggc tgagaaccgt ttcaagtttt 3181 cccggctcct tgacaccatt ggtatcagcc agcctcagtg gagggagctc agtgacctcg 3241 agtctgctcg ccaattctgc cagaccgtgg ggtacccctg tgtggtgcgc ccctcctatg 3301 tgctgagcgg tgctgctatg aatgtggcct acgcggatgg agacctggag cgcttcctga 3361 gcagcgcagc agccgtctcc aaagagcatc ccgtggtcat ctccaagttc atccaggagg 3421 ctaaggagat tgacgtggat gccgtggcct ctgatggtgt ggtggcagcc atcgccatct 3481 ctgagcatgt ggagaatgca ggtgtgcatt caggtgatgc gacgctggtg acccccccac 3541 aagatatcac tgccaaaacc ctggagcgga tcaaagccat tgtgcatgct gtgggccagg 3601 agctacaggt cacaggaccc ttcaatctgc agctcattgc caaggatgac cagctgaaag 3661 ttattgaatg caacgtacgt gtctctcgct ccttcccctt cgtttccaag acactgggtg 3721 tggacctagt agccttggcc acgcgggtca tcatggggga agaagtggaa cctgtggggc 3781 taatgactgg ttctggagtc gtgggagtaa aggtgcctca gttctccttc tcccgcttgg 3841 cgggtgctga cgtggtgttg ggtgtggaaa tgaccagtac tggggaggtg gccggctttg 3901 gggagagccg ctgtgaggca tacctcaagg ccatgctaag cactggcttt aagatcccca 3961 agaagaatat cctgctgacc attggcagct ataagaacaa aagcgagctg ctcccaactg 4021 tgcggctact ggagagcctg ggctacagcc tctatgccag tctcggcaca gctgacttct 4081 acactgagca tggcgtcaag gtaacagctg tggactggca ctttgaggag gctgtggatg 4141 gtgagtgccc accacagcgg agcatcctgg agcagctagc tgagaaaaac tttgagctgg 4201 tgattaacct gtcaatgcgt ggagctgggg gccggcgtct ctcctccttt gtcaccaagg 4261 gctaccgcac ccgacgcttg gccgctgact tctccgtgcc cctaatcatc gatatcaagt 4321 gcaccaaact ctttgtggag gccctaggcc agatcgggcc agcccctcct ttgaaggtgc 4381 atgttgactg tatgacctcc caaaagcttg tgcgactgcc gggattgatt gatgtccatg 4441 tgcacctgcg ggaaccaggt gggacacata aggaggactt tgcttcaggc acagccgctg 4501 ccctggctgg gggtatcacc atggtgtgtg ccatgcctaa tacccggccc cccatcattg 4561 acggccctgc tctggccctg gcccagaagc tggcagaggc tggcgcccgg tgcgactttg 4621 cgctattcct tggggcctcg tctgaaaatg caggaacctt gggcaccgtg gccgggtctg 4681 cagccgggct gaagctttac ctcaatgaga ccttctctga gctgcggctg gacagcgtgg 4741 tccagtggat ggagcatttc gagacatggc cctcccacct ccccattgtg gctcacgcag 4801 agcagcaaac cgtggctgct gtcctcatgg tggctcagct cactcagcgc tcagtgcaca 4861 tatgtcacgt ggcacggaag gaggagatcc tgctaattaa agctgcaaag gcacggggct 4921 tgccagtgac ctgcgaggtg gctccccacc acctgttcct aagccatgat gacctggagc 4981 gcctggggcc tgggaagggg gaggtccggc ctgagcttgg ctcccgccag gatgtggaag 5041 ccctgtggga ggacatggct gtcatcgact gctttgcctc agaccatgct ccccatacct 5101 tggaggagaa gtgtgggtcc aggcccccac ctgggttccc agggttagag accatgctgc 5161 cactactcct gacggctgta agcgagggcc ggctcagcct ggacgacctg ctgcagcgat 5221 tgcaccacaa tcctcggcgc atctttcacc tgcccccgca ggaggacacc tatgtggagg 5281 tggatctgga gcatgagtgg acaattccca gccacatgcc cttctccaag gcccactgga 5341 caccttttga agggcagaaa gtgaagggca ccgtccgccg tgtggtcctg cgaggggagg 5401 ttgcctatat cgatgggcag gttctggtac ccccgggcta tggacaggat gtacggaagt 5461 ggccacaggg ggctgttcct cagctcccac cctcagcccc tgccactagt gagatgacca 5521 cgacacctga aagaccccgc cgtggcatcc cagggcttcc tgatggccgc ttccatctgc 5581 cgccccgaat ccatcgagcc tccgacccag gtttgccagc tgaggagcca aaggagaagt 5641 cctctcggaa ggtagccgag ccagagctga tgggaacccc tgatggcacc tgctaccctc 5701 caccaccagt accgagacag gcatctcccc agaacctggg gacccctggc ttgctgcacc 5761 cccagacctc acccctgctg cactcattag tgggccaaca tatcctgtcc gtccagcagt 5821 tcaccaagga tcagatgtct cacctgttca atgtggcaca cacactgcgt atgatggtgc 5881 agaaggagcg gagcctcgac atcctgaagg ggaaggtcat ggcctccatg ttctatgaag 5941 tgagcacacg gaccagcagc tcctttgcag cagccatggc ccggctggga ggtgctgtgc 6001 tcagcttctc ggaagccaca tcgtccgtcc agaagggcga atccctggct gactccgtgc 6061 agaccatgag ctgctatgcc gacgtcgtcg tgctccggca cccccagcct ggagcagtgg 6121 agctggccgc caagcactgc cggaggccag tgatcaatgc tggggatggg gtcggagagc 6181 accccaccca ggccctgctg gacatcttca ccatccgtga ggagctggga actgtcaatg 6241 gcatgacgat cacgatggtg ggtgacctga agcacggacg cacagtacat tccctggcct 6301 gcctgctcac ccagtatcgt gtcagcctgc gctacgtggc acctcccagc ctgcgcatgc 6361 cacccactgt gcgggccttc gtggcctccc gcggcaccaa gcaggaggaa ttcgagagca 6421 ttgaggaggc gctgcctgac actgatgtgc tctacatgac tcgaatccag aaggaacgat 6481 ttggctctac ccaggagtac gaagcttgct ttggtcagtt catcctcact ccccacatca 6541 tgacccgggc caagaagaag atggtggtga tgcacccgat gccccgtgtc aacgagataa 6601 gcgtggaagt ggactcggat ccccgcgcag cctacttccg ccaggctgag aacggcatgt 6661 acatccgcat ggctctgtta gccaccgtgc tgggccgttt ctaggggcct ggcttcctca 6721 gcctcttctc tttaggccca gctgctgggc aaggaattcc agtgcctcct acgggggcag 6781 cacacttaga tattcctgga catccagatt gctcacatgt gctgaccaca cttcaggctc 6841 tggactggag ctctctggca tgggggtggg gcctcagatg ctggggccca gtctgcccca 6901 tcttcattcc tgcaccttaa acctgtacag tcatttttct actgacttaa taaacagccg 6961 agctgtccct tg // LOCUS HUMMXA 2651 bp mRNA PRI 26-OCT-1992 DEFINITION Human interferon-induced cellular resistance mediator protein (MxA) mRNA, complete cds. ACCESSION M30817 NID g188900 KEYWORDS interferon-induced cellular resistance mediator protein; interferon-inducible protein. SOURCE Human IFN-alpha-2 treated cell line T98G, cDNA to mRNA, clone MxA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2651) AUTHORS Aebi,M., Faeh,J., Hurt,N., Samuel,C.E., Thomis,D.C., Bazzigher,L., Pavlovic,J., Haller,O. and Staeheli,P. TITLE cDNA structures and regulation of two interferon-induced human Mx proteins JOURNAL Mol. Cell. Biol. 9, 5062-5072 (1989) MEDLINE 90097923 FEATURES Location/Qualifiers source 1..2651 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IFN-alpha-2 treated cell line T98G" mRNA <1..2651 /note="MxA mRNA" CDS 211..2199 /codon_start=1 /product="interferon-induced Mx protein" /db_xref="PID:g188901" /translation="MVVSEVDIAKADPAAASHPLLLNGDATVAQKNPGSVAENNLCSQ YEEKVRPCIDLIDSLRALGVEQDLALPAIAVIGDQSSGKSSVLEALSGVALPRGSGIV TRCPLVLKLKKLVNEDKWRGKVSYQDYEIEISDASEVEKEINKAQNAIAGEGMGISHE LITREISSRDVPDLTLIDLPGITRVAVGNQPADIGYKIKTLIKKYIQRQETISLVVVP SNVDIATTEALSMAQEVDPEGDRTIGILTKPDLVDKGTEDKVVDVVRNLVFHLKKGYM IVKCRGQQEIQDQLSLSEALQREKIFFENHPYFRDLLEEGKATVPCLAEKLTSELITH ICKSLPLLENQIKETHQRITEELQKYGVDIPEDENEKMFFLIDKINAFNQDITALMQG EETVGEEDIRLFTRLRHEFHKWSTIIENNFQEGHKILSRKIQKFENQYRGRELPGFVN YRTFETIVKQQIKALEEPAVDMLHTVTDMVRLAFTDVSIKNFEEFFNLHRTAKSKIED IRAEQEREGEKLIRLHFQMEQIVYCQDQVYRGALQKVREKELEEEKKKKSWDFGAFQS SSATDSSMEEIFQHLMAYHQEASKRISSHIPLIIQFFMLQTYGQQLQKAMLQLLQDKD TYSWLLKERSDTSDKRKFLKERLARLTQARRRLAQFPG" BASE COUNT 732 a 646 c 704 g 569 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattctgt ggccatactg cgaggagatc ggttccgggt cggaggctac aggaagactc 61 ccactccctg aaatctggag tgaagaacgc cgccatccag ccaccattcc aaggaggtgc 121 aggagaacag ctctgtgata ccatttaact tgttgacatt acttttattt gaaggaacgt 181 atattagagc ttactttgca aagaaggaag atggttgttt ccgaagtgga catcgcaaaa 241 gctgatccag ctgctgcatc ccaccctcta ttactgaatg gagatgctac tgtggcccag 301 aaaaatccag gctcggtggc cgagaacaac ctgtgcagcc agtatgagga gaaggtgcgc 361 ccctgcatcg acctcattga ctccctgcgg gctctaggtg tggagcagga cctggccctg 421 ccagccatcg ccgtcatcgg ggaccagagc tcgggcaaga gctccgtgtt ggaggcactg 481 tcaggagttg cccttcccag aggcagcggg atcgtgacca gatgcccgct ggtgctgaaa 541 ctgaagaaac ttgtgaacga agataagtgg agaggcaagg tcagttacca ggactacgag 601 attgagattt cggatgcttc agaggtagaa aaggaaatta ataaagccca gaatgccatc 661 gccggggaag gaatgggaat cagtcatgag ctaatcaccc gtgagatcag ctcccgagat 721 gtcccggatc tgactctaat agaccttcct ggcataacca gagtggctgt gggcaatcag 781 cctgctgaca ttgggtataa gatcaagaca ctcatcaaga agtacatcca gaggcaggag 841 acaatcagcc tggtggtggt ccccagtaat gtggacattg ccaccacaga ggctctcagc 901 atggcccagg aggtggaccc cgagggagac aggaccatcg gaatcttgac gaagcctgat 961 ctggtggaca aaggaactga agacaaggtt gtggacgtgg tgcggaacct cgtgttccac 1021 ctgaagaagg gttacatgat tgtcaagtgc cggggccagc aggagatcca ggaccagctg 1081 agcctgtccg aagccctgca gagagagaag atcttctttg agaaccaccc atatttcagg 1141 gatctgctgg aggaaggaaa ggccacggtt ccctgcctgg cagaaaaact taccagcgag 1201 ctcatcacac atatctgtaa atctctgccc ctgttagaaa atcaaatcaa ggagactcac 1261 cagagaataa cagaggagct acaaaagtat ggtgtcgaca taccggaaga cgaaaatgaa 1321 aaaatgttct tcctgataga taaaattaat gcctttaatc aggacatcac tgctctcatg 1381 caaggagagg aaactgtagg ggaggaagac attcggctgt ttaccagact ccgacacgag 1441 ttccacaaat ggagtacaat aattgaaaac aattttcaag aaggccataa aattttgagt 1501 agaaaaatcc agaaatttga aaatcagtat cgtggtagag agctgccagg ctttgtgaat 1561 tacaggacat ttgagacaat cgtgaaacag caaatcaagg cactggaaga gccggctgtg 1621 gatatgctac acaccgtgac ggatatggtc cggcttgctt tcacagatgt ttcgataaaa 1681 aattttgaag agttttttaa cctccacaga accgccaagt ccaaaattga agacattaga 1741 gcagaacaag agagagaagg tgagaagctg atccgcctcc acttccagat ggaacagatt 1801 gtctactgcc aggaccaggt atacaggggt gcattgcaga aggtcagaga gaaggagctg 1861 gaagaagaaa agaagaagaa atcctgggat tttggggctt tccaatccag ctcggcaaca 1921 gactcttcca tggaggagat ctttcagcac ctgatggcct atcaccagga ggccagcaag 1981 cgcatctcca gccacatccc tttgatcatc cagttcttca tgctccagac gtacggccag 2041 cagcttcaga aggccatgct gcagctcctg caggacaagg acacctacag ctggctcctg 2101 aaggagcgga gcgacaccag cgacaagcgg aagttcctga aggagcggct tgcacggctg 2161 acgcaggctc ggcgccggct tgcccagttc cccggttaac cacactctgt ccagccccgt 2221 agacgtgcac gcacactgtc tgcccccgtt cccgggtagc cactggactg acgacttgag 2281 tgctcagtag tcagactgga tagtccgttc ctgcttatcc gttagccgtg gtgatttagc 2341 aggaagctgt gagagcagtt tggtttctag catgaagaca gagccccacc ctcagatgca 2401 catgagctgg cgggattgaa ggatgctgtc ttcgtactgg gaaagggatt ttcagccctc 2461 agaatcgctc caccttgcag ctctcccctt ctctgtattc ctagaaactg acacatgctg 2521 aacatcacag cttatttcct catttttata atgtcccttc acaaacccag tgttttagga 2581 gcatgagtgc cgtgtgtgtg cgtcctgtcg gagccctgtc tctctctctg taataaactc 2641 atttctagca g // LOCUS HUMMXI1A 2416 bp mRNA PRI 19-SEP-1995 DEFINITION Human MXI1 mRNA, complete cds. ACCESSION L07648 NID g506626 KEYWORDS DNA binding protein; heterodimer; transcription factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS Zervos,A.S., Gyuris,J. and Brent,R. TITLE Mxi1, a protein that specifically interacts with Max to bind Myc-Max recognition sites [published erratum appears in Cell 1994 Oct 21;79(2):following 388] JOURNAL Cell 72 (2), 223-232 (1993) MEDLINE 93145324 FEATURES Location/Qualifiers source 1..2416 /organism="Homo sapiens" /db_xref="taxon:9606" gene 209..895 /gene="MXI1" CDS 209..895 /gene="MXI1" /standard_name="mxi1 protein" /codon_start=1 /function="transcription factor; forms heterodimers with Max protein" /db_xref="PID:g506627" /translation="MERVKMINVQRLLEAAEFLERRERECEHGYASSFPSMPSPRLQH SKPPRRLSRAQKHSSGTSNTSTANRSTHNELEKNRRAHLRLCLERLKVLIPLGPDCTR HTTLGLLNKAKAHIKKLEEAERKSQHQLENLEREQRFLKWRLEQLQGPQEMERIRMDS IGSTISSDRSDSEREEIEVDVESTEFSHGEVDNISTTSISDIDDHSSLPSIGSDEGYS SASVKLSFTS" BASE COUNT 731 a 518 c 502 g 665 t ORIGIN 1 agattatgat cgcctgaggc ccctctccta cccagatacc gatgttatac tgatgtgttt 61 ttcctttttt tttttttttt tttaagtaat taagggtagt taaattattt aaagtataca 121 aagtccaaac agccaggggt aaggtctcca agaggccttc ccagggtaag ggagtgcgga 181 gaggccccgg tcgccacccg cggtgcccat ggagcgggtg aagatgatca acgtgcagcg 241 tctgctggag gctgccgagt ttttggagcg ccgggagcga gagtgtgaac atggctacgc 301 ctcttcattc ccgtccatgc cgagcccccg actgcagcat tcaaagcccc cacggaggtt 361 gagccgggca cagaaacaca gcagcgggac gagcaacacc agcactgcca acagatctac 421 acacaatgag ctggaaaaga atcgacgagc tcatctgcgc ctttgtttag aacgcttaaa 481 agttctgatt ccactaggac cagactgcac ccggcacaca acacttggtt tgctcaacaa 541 agccaaagca cacatcaaga aacttgaaga agctgaaaga aaaagccagc accagctcga 601 gaatttggaa cgagaacaga gatttttaaa gtggcgactg gaacagctgc agggtcctca 661 ggagatggaa cgaatacgaa tggacagcat tggatcaact atttcttcag atcgttctga 721 ttcagagcga gaggagattg aagtggatgt tgaaagcaca gagttctccc atggagaagt 781 ggacaatata agtaccacca gcatcagtga cattgatgac cacagcagcc tgccgagtat 841 tgggagtgac gagggttact ccagtgccag tgtcaaactt tcattcactt catagaaccc 901 agcatgacat aacagtgcag ggcaaaatat tcactgggcc aattcaatac aaacaatctc 961 ttaaattggg ttcatgatgc agtctcctct ttaaaacaaa acaaaacaaa acaaaactat 1021 acttgaacaa aagggtcaga ggacctgtat ttaagcaaat acttagcaaa aagtggggca 1081 gagctcccaa ggagaacaaa tattcagaat attcatattg gaaaaatcac aatttttaat 1141 ggcagcagaa aacttgtgtg aaattttctt gatttgagtt gattgagaag aggacattgg 1201 agatgccatc ctctttctct tttctcgttt gctcatacta cattgagtag acacatttaa 1261 ggatggggtt atgaaccctt cctgagcttt atggtcctaa aagcaaaata aaaactattc 1321 gaatgaaaag acaagaaaat caggtattaa tcttggatag ctaataatga gctattaaaa 1381 ctcagcctgg gacagtttat catgaagcct gtggatgatc aatcctttat tattattttt 1441 tttttttgaa aaaagctcat ttcatgctct gcaaaaggag agactcccat gaagcctttt 1501 gaaagggatc atcatgcagc tcaactttct gttggattcc atgctaagca agctaacctt 1561 atcctgcatt gttagcacta ggcacccagc tgccacctct ccatcctgct gcccttaggc 1621 cacatgggag cagtccatgc atgacagcct ctatcctaca aggcctatga gtatggattg 1681 ggggggccaa aaggaaaaag ctccatgtgc ctctttgtct gcgtgggtca gaagagttgt 1741 gcacgcagat tagcaggcca aggtctgagc cacagcagca tttttatttc agattttgat 1801 aactgtttat atgtgttgaa aaccaaaatg acatcttttt aaagcttatc cataaaaaaa 1861 aatagatgtc ttttatagtg gaaaaacaca tggggaaaaa aatcatctat tttgatgcag 1921 catttgataa tgataaaaca cctcacacct cactctttat agtgcacaaa atgaatgagg 1981 tctgggctag gtagaaaaag ggtcaatgct atttttgttt ttagaatcat taccttttac 2041 cagcttttaa ccatctgata tctatagtag acacactatc atagttaaca tagttaagtt 2101 cagcacttgt ctcattttaa tgtaaagatt tgcttccatt ttcctacagg cagtctctct 2161 cttcctcaca gtcccactgt gcaggtgcta ttgttactct tacgaatatt ttcagtaatg 2221 ttattttctt ctaagtgaaa tttctagcct gcactttgat gtcatgtgtt ccctttgtct 2281 ttcaaactcc aaggttcccc tgtggccctc tcccttaccc tgggaaggcc tcttggagac 2341 cttacccctg gctgtttgga ctttgtatac tttaaataat ttaactaccc ttaattactt 2401 aaaaaaaaaa aaaaaa // LOCUS HUMMYC3L 7011 bp DNA PRI 07-JAN-1995 DEFINITION Human L-myc protein gene, complete cds. ACCESSION M19720 NID g188906 KEYWORDS L-myc oncogene; Myc protein; alternative splicing; myc oncogene. SOURCE Human small-cell lung cancer cell line NCI H209, cDNA to mRNA, and placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7011) AUTHORS Kaye,F., Battey,J., Nau,M., Brooks,B., Seifter,E., De Greve,J., Birrer,M., Sausville,E. and Minna,J. TITLE Structure and expression of the human L-myc gene reveal a complex pattern of alternative mRNA processing JOURNAL Mol. Cell. Biol. 8 (1), 186-195 (1988) MEDLINE 88094386 COMMENT Intron A (positions 432-795) is part of an alternative mRNA. The same is true for bases 2624-4271. These different mRNAs are produced by means of alternative splicing. FEATURES Location/Qualifiers source 1..7011 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" TATA_signal 205..209 /gene="L-myc" mRNA join(224..431,796..2623) /gene="L-myc" /note="short form" /product="L-myc protein" mRNA join(224..431,796..1300,4272..6812) /gene="L-myc" /note="long form" /product="L-myc protein" gene join(224..431,796..2623) /gene="L-myc" exon 224..431 /partial /gene="L-myc" /number=1 intron 432..795 /note="L-myc intron A" exon 796..2623 /partial /gene="L-myc" /note="short form" /number=2 exon 796..1300 /gene="L-myc" /number=2 CDS join(805..1300,4272..4870) /partial /gene="L-myc" /codon_start=1 /product="L-myc protein" /db_xref="PID:g386964" /translation="MDYDSYQHYFYDYDCGEDFYRSTAPSEDIWKKFELVPSPPTSPP WGLGPGAGDPAPGIGPPEPWPGGCTGDEAESRGHSKGWGRNYASIIRRDCMWSGFSAR ERLERAVSDRLAPGAPRGNPPKASAAPDCTPSLEAGNPAPAAPCPLGEPKTQACSGSE SPSDSENEEIDVVTVEKRQSLGIRKPVTITVRADPLDPCMKHFHISIHQQQHNYAARF PPESCSQEEASERGPQEEVLERDAAGEKEDEEDEEIVSPPPVESEAAQSCHPKPVSSD TEDVTKRKNHNFLERKRRNDLRSRFLALRDQVPTLASCSKAPKVVILSKALEYLQALV GAEKRMATEKRQLRCRQQQLQKRIAYLSGY" CDS 805..1425 /gene="L-myc" /note="short form" /codon_start=1 /product="L-myc protein" /db_xref="PID:g188908" /translation="MDYDSYQHYFYDYDCGEDFYRSTAPSEDIWKKFELVPSPPTSPP WGLGPGAGDPAPGIGPPEPWPGGCTGDEAESRGHSKGWGRNYASIIRRDCMWSGFSAR ERLERAVSDRLAPGAPRGNPPKASAAPDCTPSLEAGNPAPAAPCPLGEPKTQACSGSE SPSDSGKDLPEPSKRGPPHGWPKLCPCLRSGIGSSQALGPSPPLFG" intron 1301..4271 /note="L-myc intron B" exon 4272..6812 /partial /gene="L-myc" /number=3 BASE COUNT 1561 a 1825 c 1924 g 1701 t ORIGIN 3106 bp upstream of EcoRI site. 1 cgggccgcat cagccctcct cctgtttgcg ctccccagcg tgcaatttat ttggggggct 61 accggggatt gaacggagcg ggcgagcgct gccaggaggt ggggccggcc ccacctgtcg 121 actgcccgta gtaggcaggg agagggcggg gtttgtccca tagggcccgc cccccagtcc 181 ctgggtcccg ggcgcgcgac gagatataag gcagtcagga aacaatgcgc ctgcagctcg 241 cgctcccgcg ccgatcccga gagcgtccgg gccgccgtgc gcgagcgagg gagggcgcgc 301 gcgcgggggg ggcgcgctcg tgagtgcggg ccgcgctctc ggcggcgcgc atgtgcgtgt 361 gtgctggctg ccgggctgcc ccgagccggc ggggagccgg tccgctccag gtggcgggcg 421 gctggagcga ggtgaggctg cgggtggcca gggcacgggc gcgggtcccg cggtgcgggc 481 tggctgcagg ctgccttctg ggcacggcgc gcccccgccc ggccccgccg ggccctggga 541 gctgcgctcc gggcggcgct ggcaaagttt gctttgaact cgctgcccac agtcgggtcc 601 gcgcgctgcg attggcttcc cctaccactc tgacccgggg cccggcttcc cgggacgcga 661 ggactgggcg caggctgcaa gctggtgggg ttggggagga acgagagccc ggcagccgac 721 tgtgccgagg gacccgggga cacctccttc gcccggccgg cacccggtca gcacgtcccc 781 ccttccctcc cgcagggagc ggacatggac tacgactcgt accagcacta tttctacgac 841 tatgactgcg gggaggattt ctaccgctcc acggcgccca gcgaggacat ctggaagaaa 901 ttcgagctgg tgccatcgcc ccccacgtcg ccgccctggg gcttgggtcc cggcgcaggg 961 gacccggccc ccgggattgg tcccccggag ccgtggcccg gagggtgcac cggagacgaa 1021 gcggaatccc ggggccactc gaaaggctgg ggcaggaact acgcctccat catacgccgt 1081 gactgcatgt ggagcggctt ctcggcccgg gaacggctgg agagagctgt gagcgaccgg 1141 ctcgctcctg gcgcgccccg ggggaacccg cccaaggcgt ccgccgcccc ggactgcact 1201 cccagcctcg aagccggcaa cccggcgccc gccgccccct gtccgctggg cgaacccaag 1261 acccaggcct gctccgggtc cgagagccca agcgactcgg gtaaggacct ccccgagcca 1321 tccaagaggg ggccacccca tgggtggcca aagctctgcc cctgcctgag gtcaggcatt 1381 ggctcttctc aagctcttgg gccatctccg cctctctttg gctgaagctg cccgtgtagt 1441 ccccaaccgt gtctgtctgg cacgtgggtg tgttggtaaa cagtttggaa aagtggcgtg 1501 ggagccagcc tccctttgat gattattgga gccccagggg acaagggatt tgaggtgagg 1561 gttggcgctt agagaggaca atactggggt tggactgtaa gggattgaag ggggtacctt 1621 aagagacact ccaaacctga agtttttttg ctgctgcctc tttccctagg aaactcacac 1681 tcccctaggg ggagaagaag ccgagagcct tttgtgcaaa gccaaaacct tcgtcctttt 1741 aaaaacctag gtctccagtt ggctttactt taaaatgcca ataataaatg ccctcttctc 1801 gtgcctcccc accaccactt accactcgtg catccctgag acagggaggg aagaatgaac 1861 actccccatt aacagatgga aaaactgagg cttagagata gacaatcact acaagtcagc 1921 tccagctttc tgccatctag ccagcccctc ttccccaatg ctccatccca accaggcacc 1981 tcttccttga tgtttggggt ctttgtggta gcttatctta gaagcactac accttgcctt 2041 gctgtttgtc ctgagatgga aaagtgtcct tcttgctccc cctcaataga tctccagcgt 2101 cagctgctcc ctggcattca acaaatattc actggcccct actttgtggc aatctgtggg 2161 ctacatgctg gggtcaaggc agtagaactc caggccctcc tctcccatcc ttgatgcaag 2221 tgcaacctcg ctgagggcag actggggcat cctgtgccac taaactacat tgttcttatt 2281 ctggcatctt agacctccac acccgtgaga aatcctggag agggtatttt tgtagagtgt 2341 agactgtggc tagtgacaaa taaattagga ccaagaaagc tcactgtagc ttttaggaat 2401 aacttttaca cgaccatttg atagggaact ggggaatggg gtatggaagt tttcctacac 2461 ttgagagaaa aaataggata acaaaaatta aaagtctttt tttcctggtc cactgtgtta 2521 aggtcatttt taaccagctt gctttctaca ccaagagttt atgtttgttt aatggctgga 2581 aagagaatct tgagatcaaa aaaccaataa agatgtatct ctacaacggc tggtggagtg 2641 gtagagtgga aagagcattg ctttggaagt tggaacattt tagtttgaga tccagaacgt 2701 tacaaaggtg atatgtggac ttcgctgatc tgggcctcag tttccccatt tgcacacgat 2761 ggggttggac ttgattgtcc tgctgatgac atttccttgt ctggatagag taagacacta 2821 ctctctgaaa gggagaatgg tgtgcttaaa ttatttcttt cttagataga atcttcctga 2881 gccacgaggc ttaacactga aaattaaagg tttgggatgt aggaaagcct gctgaatcat 2941 tttctaacct accctttaac ctgaacctgt ttgtgagctt ctagttcact cacaggccac 3001 atggcctgga acaaaatgca acagattgca aacaatgagg cggggggtgg ggaaagtgat 3061 tggcagcaga gctcacccaa taggggctag gggctgggta agacagaatt ccaaacacag 3121 cgtaatcagc caatcatggg ctttggggcc aggagggctg aatggtcagg tttattaatg 3181 gagaaataat gcgattgtcc acacaatgga agccttcctg acaaaggggc tcaagcttcc 3241 tgatatgcaa agaagctgag aacggagctc ttcctttgcc gaggccgaga tccattaagg 3301 tcggacttct gtgtggaggc tgcaaaatgt gtggagcagg aggagacttt tctcccaatt 3361 gcccctctcc tggttaggtt aacctaagag accttcaagc cagtgaatga gaagggcgtg 3421 tccaggtgtc tccaggtctc tggtgttatg agccccatat ctgggacatt ctgctgccca 3481 gtctctgcct ctggtgcagg tagtttggaa atggtcgctt gtacctttgt gaagttcctg 3541 cagcttcgcc gacctatgat tacaaatcta accttctagt ccagggaagg aggtggggca 3601 ggcgacctat aaatgatgga tgactttaga aacccattga acccaggagc aaaatgctcc 3661 taagggaaac cctttccctc ccctctgtgg gtgaagaggg atgggttgta gccctccctt 3721 ctctgaatct tcagctgaaa gggatggcag aatagagagg tgggggaata ataggattta 3781 taacttgtga aaagtaacaa ttccccaagt gcaggctgtg ctgggcagga acaaagggca 3841 gctctgccca cagacccctc atttacaatt ctgatggggc atgaaagagc ccgactgggg 3901 aagatcttta tagctaaact ttgtcccagg ccggtagctc tttctctcca acccctccgt 3961 gggggagggg agagcctttg cagactgggg gctgttggct tgggtctgcc ttttgttctt 4021 atctaagcct tgctgtgcaa aaggaaattg gagaatattt tccttcttgc taatgtcccc 4081 tcctttcctt cactgtgccc ttaccacatt acaaatgaat cagctttctg ctcacctcga 4141 tttgtatata tctaaattgg aaaaatgtct cctaccttcc caagcaccag cgtagacagc 4201 taaagctgta gggtctatgt ttgtgtttct catgggatgt gtttcttctc ttgatctctt 4261 ttctcggaca gagaatgaag aaattgatgt tgtgacagta gagaagaggc agtctctggg 4321 tattcggaag ccggtcacca tcacggtgcg agcagacccc ctggatccct gcatgaagca 4381 tttccacatc tccatccatc agcaacagca caactatgct gcccgttttc ctccagaaag 4441 ctgctcccaa gaagaggctt cagagagggg tccccaagaa gaggttctgg agagagatgc 4501 tgcaggggaa aaggaagatg aggaggatga agagattgtg agtcccccac ctgtagaaag 4561 tgaggctgcc cagtcctgcc accccaaacc tgtcagttct gatactgagg atgtgaccaa 4621 gaggaagaat cacaacttcc tggagcgcaa gaggcggaat gacctgcgtt cgcgattctt 4681 ggcgctgagg gaccaggtgc ccaccctggc cagctgctcc aaggccccca aagtagtgat 4741 cctaagcaag gccttggaat acttgcaagc cctggtgggg gctgagaaga ggatggctac 4801 agagaaaaga cagctccgat gccggcagca gcagttgcag aaaagaattg catacctcag 4861 tggctactaa ctgaccaaaa agcctgacag ttctgtctta cgaagacaca agtttatttt 4921 ttaacctccc tctccccttt agtaatttgc acattttggt tatggtggga cagtctggac 4981 agtagatccc agaatgcatt gcagccggtg cacacacaat aaaggcttgc attcttggaa 5041 accttgaaac ccagctctcc ctcttccctg actcatggga gtgctgtatg ttctctggcg 5101 cctttggctt cccagcaggc agctgactga ggagccttgg ggtctgccta gctcactagc 5161 tctgaagaaa aggctgacag atgctatgca acaggtggtg gatgttgtca ggggctccag 5221 cctgcatgaa atctcacact ctgcatgagc tttaggctag gaaaggatgc tcccaactgg 5281 tgtctctggg gtgatgcaag gacagctggg cctggatgct ctccctgagg ctcctttttc 5341 cagaagacac acgagctgtc ttgggtgaag acaagcttgc agacttgatc aacattgacc 5401 attacctcac tgtcagacac tttacagtag ccaaggagtt ggaaaccttt atgtattatg 5461 atgttagctg acccccttcc tcccactccc aatgctgcga ccctgggaac acttaaaaag 5521 cttggcctct agattctttg tctcagagcc ctctgggctc tctcctctga gggagggacc 5581 tttctttcct cacaagggac ttttttgttc cattatgcct tgttatgcaa tgggctctac 5641 agcacccttt cccacaggtc agaaatattt ccccaagaca cagggaaatc ggtcctagcc 5701 tggggcctgg ggatagcttg gagtcctggc ccatgaactt gatccctgcc caggtgtttt 5761 ccgaggggca cttgaggccc agtcttttct caaggcaggt gtaagacact cagagggaga 5821 actgtactgc tgcctctttc ccaccttcct catctcaatc cttgagcggc aagtttgaag 5881 ttcttctgga accatgcaaa tctgtcctcc tcatgcaatt ccaaggagct tgctggctct 5941 gcagccacct ctgggcccct tccagcctgc catgaatcag atatctttcc cagaatctgg 6001 gcgtttctga agttttgggg agagctgttg ggactcatcc agtgctccag aaggtggact 6061 tgcttctggg gggttttaaa ggagcctcca ggagatatgc ttagccaacc atgatggatt 6121 ttaccccagc tggactcggc agctccaagt ggaatccacg tgcagcttct agtctgggaa 6181 agtcacccaa cctagcagtt gtcatgtggg taacctcagg cacctctaag cctgtcctgg 6241 aagaaggacc agcagcccct ccagaactct gcccaggaca gcaggtgcct gctggctctg 6301 ggtttggaag tttggggtgg gtagggggtg gtaagtacta tatatggctc tggaaaacca 6361 gctgctactt ccaaatctat tgtccataat ggtttctttc tgaggttgct tcttggcctc 6421 agaggacccc aggggatgtt tggaaatagc ctctctaccc ttctggagca tggtttacaa 6481 aagccagctg acttctggaa ttgtctatgg aggacagttt gggtgtaggt tactgatgtc 6541 tcaactgaat agcttgtgtt ttataagctg ctgttggcta ttatgctggg ggagtctttt 6601 ttttttatat tgtatttttg tatgcctttt gcaaagtggt gttaactgtt tttgtacaag 6661 gaaaaaaact cttggggcaa tttcctgttg caagggtctg atttattttg aaaggcaagt 6721 tcacctgaaa ttttgtattt agttgtgatt actgattgcc tgattttaaa atgttgcctt 6781 ctgggacatc ttctaataaa agatttctca aacatgtcag agtgggggca gcttatgcca 6841 cctgagtcct cctcaaccac ggaaaactat ttcagggtag ccacaagtga tccagagggc 6901 tgcacttctc taaccatgtt gctaacctgg tcattccact ctgggttcct gaaatgccat 6961 ttcagacatg ttgaaacaat gtaggctcag tactcagtga acacggaatt c // LOCUS HUMMYCC 10996 bp DNA PRI 25-JUL-1994 DEFINITION Human (Lawn) c-myc proto-oncogene, complete coding sequence and flanks. ACCESSION J00120 K01908 M23541 V00501 X00364 NID g515632 KEYWORDS Alu repeat; c-myc proto-oncogene; myc oncogene; proto-oncogene; repeat region; transforming gene. SOURCE Human DNA (genomic library of Lawn et al.), clones lambda-M1 [1], and pUC9-myc [2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 3507 to 7559) AUTHORS Colby,W.W., Chen,E.Y., Smith,D.H. and Levinson,A.D. TITLE Identification and nucleotide sequence of a human locus homologous to the v-myc oncogene of avian myelocytomatosis virus MC29 JOURNAL Nature 301 (5902), 722-725 (1983) MEDLINE 83141777 REFERENCE 2 (bases 1 to 8082) AUTHORS Gazin,C., Dupont de Dinechin,S., Hampe,A., Masson,J.M., Martin,P., Stehelin,D. and Galibert,F. TITLE Nucleotide sequence of the human c-myc locus: provocative open reading frame within the first exon JOURNAL EMBO J. 3 (2), 383-387 (1984) MEDLINE 84182501 REFERENCE 3 (bases 8083 to 10996) AUTHORS Guilhot,S., Petridou,B., Syed-Hussain,S. and Galibert,F. TITLE Nucleotide sequence 3' to the human c-myc oncogene; Presence of a long inverted repeat JOURNAL Gene 72, 105-108 (1988) MEDLINE 89211899 COMMENT The myc gene is the cellular homologue of the transforming gene carried by the avian myelocytomatosis virus MC29. Unlike the ras proto-oncogenes which obtain transforming potential through mutations within their coding exons (namely mutations within codon 12), the myc gene identified as the cause of Burkitt lymphomas acquires its transforming potential through defects of either transcriptional or translational control. Thus it is not an altered gene product that induces tumors, but a normal product that is present either in the wrong quantity or at the wrong time in the life cycle of the cell. [2] notes an open reading frame upstream of the c-myc coding exons with an 'atg' start codon at bases 2304-2306 and a 'tag' stop codon at bases 2868-2870. However other researchers have used c-myc and v-myc DNA sequences to probe for mRNA's with homology to c-myc in various human cell lines and none of them have noted any mRNA's beginning upstream of bp 2328 (see other human c-myc entries). The t(8;14) translocation site in the Burkitt lymphoma cell line BL22 occurs between bp 1316 and 1317 of this sequence. See other human c-myc entries. FEATURES Location/Qualifiers source 1..10996 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q24" CDS 2304..2870 /note="ORF1" /codon_start=1 /db_xref="PID:g515633" /translation="MRGSGRLRTPELCCSRPPPPGPGRPWLPSCLEKGRASQRLGGKK NGGRDRAEYKSRFSGLYLTRCSNSSERQRERAGGRLGWKSRASRAALRASWEGRSGAN RGLRLWPSPPADPPASGPQPLPHPRNFAHSSGRALCTGTYNTRARTRLSRRGEAILPI WGHFPAAARTRFSERLSLQLLRRWIFFG" gene 2327..7657 /gene="MYC" exon 2327..2881 /gene="MYC" /note="alternative exon" /number=1 mRNA join(2327..2881,4506..5277,6654..>7216) /gene="MYC" exon 2502..2881 /gene="MYC" /note="alternative exon" /number=1 mRNA join(2502..2881,4506..5277,6654..>7216) /gene="MYC" intron 2882..4505 /gene="MYC" /number=1 exon 4506..5277 /gene="MYC" /number=2 CDS join(4521..5277,6654..7216) /gene="MYC" /codon_start=1 /product="c-myc protein" /db_xref="PID:g386965" /translation="MPLNVSFTNRNYDLDYDSVQPYFYCDEEENFYQQQQQSELQPPA PSEDIWKKFELLPTPPLSPSRRSGLCSPSYVAVTPFSLRGDNDGGGGSFSTADQLEMV TELLGGDMVNQSFICDPDDETFIKNIIIQDCMWSGFSAAAKLVSEKLASYQAARKDSG SPNPARGHSVCSTSSLYLQDLSAAASECIDPSVVFPYPLNDSSSPKSCASQDSSAFSP SSDSLLSSTESSPQGSPEPLVLHEETPPTTSSDSEEEQEDEEEIDVVSVEKRQAPGKR SESGSPSAGGHSKPPHSPLVLKRCHVSTHQHNYAAPPSTRKDYPAAKRVKLDSVRVLR QISNNRKCTSPRSSDTEENVKRRTHNVLERQRRNELKRSFFALRDQIPELENNEKAPK VVILKKATAYILSVQAEEQKLISEEDLLRKRREQLKHKLEQLRNSCA" intron 5278..6653 /gene="MYC" /number=2 repeat_region 6299..6466 /note="Alu-like repeat" /rpt_family="Alu" exon 6654..7216 /gene="MYC" /number=3 polyA_signal 7511..7516 /gene="MYC" polyA_signal 7652..7657 /gene="MYC" BASE COUNT 2747 a 2723 c 2733 g 2793 t ORIGIN 198 bp upstream of Sau96A site, on chromosome 8 (q24). 1 agcttgtttg gccgttttag ggtttgttgg aatttttttt tcgtctatgt acttgtgaat 61 tatttcacgt ttgccattac cggttctcca tagggtgatg ttcattagca gtggtgatag 121 gttaattttc accatctctt atgcggttga atagtcacct ctgaaccact ttttcctcca 181 gtaactcctc tttcttcgga ccttctgcag ccaacctgaa agaataacaa ggaggtggct 241 ggaaacttgt tttaaggaac cgcctgtcct tcccccgctg gaaaccttgc acctcggacg 301 ctcctgctcc tgcccccacc tgacccccgc cctcgttgac atccaggcgc gatgatctct 361 gctgccagta gagggcacac ttactttact ttcgcaaacc tgaacgcggg tgctgcccag 421 agagggggcg gagggaaaga cgctttgcag caaaatccag catagcgatt ggttgctccc 481 cgcgtttgcg gcaaaggcct ggaggcagga gtaatttgca atccttaaag ctgaattgtg 541 cagtgcatcg gatttggaag ctactatatt cacttaacac ttgaacgctg agctgcaaac 601 tcaacgggta ataacccatc ttgaacagcg tacatgctat acacacaccc ctttcccccg 661 aattgttttc tcttttggag gtggtggagg gagagaaaag tttacttaaa atgcctttgg 721 gtgagggacc aaggatgaga agaatgtttt ttgtttttca tgccgtggaa taacacaaaa 781 taaaaaatcc cgagggaata tacattatat attaaatata gatcatttca gggagcaaac 841 aaatcatgtg tggggctggg caactagctg agtcgaagcg taaataaaat gtgaatacac 901 gtttgcgggt tacatacagt gcactttcac tagtattcag aaaaaattgt gagtcagtga 961 actaggaaat taatgcctgg aaggcagcca aattttaatt agctcaagac tccccccccc 1021 ccccaaaaaa aggcacggaa gtaatactcc tctcctcttc tttgatcaga atcgatgcat 1081 tttttgtgca tgaccgcatt tccaataata aaaggggaaa gaggacctgg aaaggaatta 1141 aacgtccggt ttgtccgggg aggaaagagt taacggtttt tttcacaagg gtctctgctg 1201 actcccccgg ctcggtccac aagctctcca cttgcccctt ttaggaagtc cggtcccgcg 1261 gttcgggtac cccctgcccc tcccatattc tcccgtctag cacctttgat ttctcccaaa 1321 cccggcagcc cgagactgtt gcaaaccggc gccacagggc gcaaagggga tttgtctctt 1381 ctgaaacctg gctgagaaat tgggaactcc gtgtgggagg cgtgggggtg ggacggtggg 1441 gtacagactg gcagagagca ggcaacctcc ctctcgccct agcccagctc tggaacaggc 1501 agacacatct cagggctaaa cagacgcctc ccgcacgggg ccccacggaa gcctgagcag 1561 gcggggcagg aggggcggta tctgctgctt tggcagcaaa ttgggggact cagtctgggt 1621 ggaaggtatc caatccagat agctgtgcat acataatgca taatacatga ctccccccaa 1681 caaatgcaat gggagtttat tcataacgcg ctctccaagt atacgtggca atgcgttgct 1741 gggttatttt aatcattcta ggcatcgttt tcctccttat gcctctatca ttcctcccta 1801 tctacactaa catcccacgc tctgaacgcg cgcccattaa tacccttctt tcctccactc 1861 tccctgggac tcttgatcaa agcgcggccc tttccccagc cttagcgagg cgccctgcag 1921 cctggtacgc gcgtggcgtg gcggtgggcg cgcagtgcgt tctctgtgtg gagggcagct 1981 gttccgcctg cgatgattta tactcacagg acaaggatgc ggtttgtcaa acagtactgc 2041 tacggaggag cagcagagaa agggagaggg tttgagaggg agcaaaagaa aatggtaggc 2101 gcgcgtagtt aattcatgcg gctctcttac tctgtttaca tcctagagct agagtgctcg 2161 gctgcccggc tgagtctcct ccccaccttc cccaccctcc ccaccctccc cataagcgcc 2221 cctcccgggt tcccaaagca gagggcgtgg gggaaaagaa aaaagatcct ctctcgctaa 2281 tctccgccca ccggcccttt ataatgcgag ggtctggacg gctgaggacc cccgagctgt 2341 gctgctcgcg gccgccaccg ccgggccccg gccgtccctg gctcccctcc tgcctcgaga 2401 agggcagggc ttctcagagg cttggcggga aaaagaacgg agggagggat cgcgctgagt 2461 ataaaagccg gttttcgggg ctttatctaa ctcgctgtag taattccagc gagaggcaga 2521 gggagcgagc gggcggccgg ctagggtgga agagccgggc gagcagagct gcgctgcggg 2581 cgtcctggga agggagatcc ggagcgaata gggggcttcg cctctggccc agccctcccg 2641 ctgatccccc agccagcggt ccgcaaccct tgccgcatcc acgaaacttt gcccatagca 2701 gcgggcgggc actttgcact ggaacttaca acacccgagc aaggacgcga ctctcccgac 2761 gcggggaggc tattctgccc atttggggac acttccccgc cgctgccagg acccgcttct 2821 ctgaaaggct ctccttgcag ctgcttagac gctggatttt tttcgggtag tggaaaacca 2881 ggtaagcacc gaagtccact tgccttttaa tttatttttt tatcacttta atgctgagat 2941 gagtcgaatg cctaaatagg gtgtcttttc tcccattcct gcgctattga cacttttctc 3001 agagtagtta tggtaactgg ggctggggtg gggggtaatc cagaactgga tcggggtaaa 3061 gtgacttgtc aagatgggag aggagaaggc agagggaaaa cgggaatggt ttttaagact 3121 accctttcga gatttctgcc ttatgaatat attcacgctg actcccggcc ggtcggacat 3181 tcctgcttta ttgtgttaat tgctctctgg gttttggggg gctgggggtt gctttgcggt 3241 gggcagaaag ccccttgcat cctgagctcc ttggagtagg gaccgcatat cgcctgtgtg 3301 agccagatcg ctccgcagcc gctgacttgt ccccgtctcc gggagggcat ttaaatttcg 3361 gctcaccgca tttctgacag ccggagacgg acactgcggc gcgtcccgcc cgcctgtccc 3421 cgcggcgatt ccaacccgcc ctgatccttt taagaagttg gcatttggct ttttaaaaag 3481 caataataca atttaaaacc tgggtctcta gaggtgttag gacgtggtgt tgggtaggcg 3541 caggcagggg aaaagggagg cgaggatgtg tccgattctc ctggaatcgt tgacttggaa 3601 aaaccagggc gaatctccgc acccagccct gactcccctg ccgcggccgc cctcgggtgt 3661 cctcgcgccc gagatgcgga ggaactgcga ggagcggggc tctgggcggt tccagaacag 3721 ctgctaccct tggtggggtg gctccggggg aggtatcgca gcggggtctc tggcgcagtt 3781 gcatctccgt attgagtgcg aagggaggtg cccctattat tatttgacac cccccttgta 3841 tttatggagg ggtgttaaag cccgcggctg agctcgccac tccagccggc gagagaaaga 3901 agaaaagctg gcaaaaggag tgttggacgg gggcggtact gggggtgggg acgggggcgg 3961 tggagaggga aggttgggag gggctgcggt gccggcgggg gtaggagagc ggctagggcg 4021 cgagtgggaa cagccgcagc ggaggggccc cggcgcggag cggggttcac gcagccgcta 4081 gcgcccaggc gcctctcgcc ttctccttca ggtggcgcaa aactttgtgc cttggatttt 4141 ggcaaattgt tttcctcacc gccacctccc gcggcttctt aagggcgcca gggccgattt 4201 cgattcctct gccgctgcgg ggccgactcc cgggctttgc gctccgggct cccgggggag 4261 cgggggctcg gcgggcacca agccgctggt tcactaagtg cgtctccgag atagcagggg 4321 actgtccaaa gggggtgaaa gggtgctccc tttattcccc caccaagacc acccagccgc 4381 tttaggggat agctctgcaa ggggagaggt tcgggactgt ggcgcgcact gcgcgctgcg 4441 ccaggtttcc gcaccaagac ccctttaact caagactgcc tcccgctttg tgtgccccgc 4501 tccagcagcc tcccgcgacg atgcccctca acgttagctt caccaacagg aactatgacc 4561 tcgactacga ctcggtgcag ccgtatttct actgcgacga ggaggagaac ttctaccagc 4621 agcagcagca gagcgagctg cagcccccgg cgcccagcga ggatatctgg aagaaattcg 4681 agctgctgcc caccccgccc ctgtccccta gccgccgctc cgggctctgc tcgccctcct 4741 acgttgcggt cacacccttc tcccttcggg gagacaacga cggcggtggc gggagcttct 4801 ccacggccga ccagctggag atggtgaccg agctgctggg aggagacatg gtgaaccaga 4861 gtttcatctg cgacccggac gacgagacct tcatcaaaaa catcatcatc caggactgta 4921 tgtggagcgg cttctcggcc gccgccaagc tcgtctcaga gaagctggcc tcctaccagg 4981 ctgcgcgcaa agacagcggc agcccgaacc ccgcccgcgg ccacagcgtc tgctccacct 5041 ccagcttgta cctgcaggat ctgagcgccg ccgcctcaga gtgcatcgac ccctcggtgg 5101 tcttccccta ccctctcaac gacagcagct cgcccaagtc ctgcgcctcg caagactcca 5161 gcgccttctc tccgtcctcg gattctctgc tctcctcgac ggagtcctcc ccgcagggca 5221 gccccgagcc cctggtgctc catgaggaga caccgcccac caccagcagc gactctggta 5281 agcgaagccc gcccaggcct gtcaaaagtg ggcggctgga tacctttccc attttcattg 5341 gcagcttatt taacgggcca ctcttattag gaaggagaga tagcagatct ggagagattt 5401 gggagctcat cacctctgaa accttgggct ttagcgtttc ctcccatccc ttccccttag 5461 actgcccatg tttgcagccc ccctccccgt ttgtctccca cccctcagga atttcattta 5521 ggtttttaaa ccttctggct tatcttacaa ctcaatccac ttcttcttac ctcccgttaa 5581 cattttaatt gccctggggc ggggtggcag ggagtgtatg aatgaggata agagaggatt 5641 gatctctgag agtgaatgaa ttgcttccct cttaacttcc gagaagtggt gggatttaat 5701 gaactatcta caaaaatgag gggctgtgtt tagaggctag gcagggcctg cctgagtgcg 5761 ggagccagtg aactgcctca agagtgggtg ggctgaggag ctgggatctt ctcagcctat 5821 tttgaacact gaaaagcaaa tccttgccaa agttggactt ttttttttct tttattcctt 5881 cccccgccct cttggacttt tggcaaaact gcaatttttt tttttttatt tttcatttcc 5941 agtaaaatag ggagttgcta aagtcatacc aagcaatttg cagctatcat ttgcaacacc 6001 tgaagtgttc ttggtaaagt ccctcaaaaa taggaggtgc ttgggaatgt gctttgcttt 6061 gggtgtgtcc aaagcctcat taagtcttag gtaagaattg gcatcaatgt cctatcctgg 6121 gaagttgcac ttttcttgtc catgccataa cccagctgtc tttcccttta tgagactctt 6181 accttcatgg tgagaggagt aagggtggct ggctagattg gttctttttt tttttttttc 6241 cttttttaag acggagtctc actctgtcac taggctggag tgcagtggcg caatcaacct 6301 ccaaccccct ggttcaagag attctcctgc ctcagcctcc caagtagctg ggactacagg 6361 tgcacaccac catgccaggc taatttttgt aattttagta gagatggggt ttcatcgtgt 6421 tggccaggat ggtctctcct gacctcacga tccgcccacc tcggcctccc aaagtgctgg 6481 gattacaggt gtgagccagg gcaccaggct tagatgtggc tctttgggga gataattttg 6541 tccagagacc tttctaacgt attcatgcct tgtatttgta cagcattaat ctggtaattg 6601 attattttaa tgtaaccttg ctaaaggagt gatttctatt tcctttctta aagaggagga 6661 acaagaagat gaggaagaaa tcgatgttgt ttctgtggaa aagaggcagg ctcctggcaa 6721 aaggtcagag tctggatcac cttctgctgg aggccacagc aaacctcctc acagcccact 6781 ggtcctcaag aggtgccacg tctccacaca tcagcacaac tacgcagcgc ctccctccac 6841 tcggaaggac tatcctgctg ccaagagggt caagttggac agtgtcagag tcctgagaca 6901 gatcagcaac aaccgaaaat gcaccagccc caggtcctcg gacaccgagg agaatgtcaa 6961 gaggcgaaca cacaacgtct tggagcgcca gaggaggaac gagctaaaac ggagcttttt 7021 tgccctgcgt gaccagatcc cggagttgga aaacaatgaa aaggccccca aggtagttat 7081 ccttaaaaaa gccacagcat acatcctgtc cgtccaagca gaggagcaaa agctcatttc 7141 tgaagaggac ttgttgcgga aacgacgaga acagttgaaa cacaaacttg aacagctacg 7201 gaactcttgt gcgtaaggaa aagtaaggaa aacgattcct tctaacagaa atgtcctgag 7261 caatcaccta tgaacttgtt tcaaatgcat gatcaaatgc aacctcacaa ccttggctga 7321 gtcttgagac tgaaagattt agccataatg taaactgcct caaattggac tttgggcata 7381 aaagaacttt tttatgctta ccatcttttt tttttcttta acagatttgt atttaagaat 7441 tgtttttaaa aaattttaag atttacacaa tgtttctctg taaatattgc cattaaatgt 7501 aaataacttt aataaaacgt ttatagcagt tacacagaat ttcaatccta gtatatagta 7561 cctagtatta taggtactat aaaccctaat tttttttatt taagtacatt ttgcttttta 7621 aagttgattt ttttctattg tttttagaaa aaataaaata actggcaaat atatcattga 7681 gccaaatctt aagttgtgaa tgttttgttt cgtttcttcc ccctcccaac caccaccatc 7741 cctgtttgtt ttcatcaatt gccccttcag agggcggtct taagaaaggc aagagttttc 7801 ctctgttgaa atgggtctgg gggccttaag gtctttaagt tcttggaggt tctaagatgc 7861 ttcctggaga ctatgataac agccagagtt gacagttaga aggaatggca gaaggcaggt 7921 gagaaggtga gaggtaggca aaggagatac aagaggtcaa aggtagcagt taagtacaca 7981 aagaggcata aggactgggg agttgggagg aaggtgagga agaaactcct gttactttag 8041 ttaaccagtg ccagtcccct gctcactcca aacccaggaa ttctgcccag ttgatgggga 8101 cacggtggga accagcttct gctgccttca caaccaggcg ccagtcctgt ccatgggtta 8161 tctcgcaaac cccagaggat ctctgggagg aatgctacta ttaaccctat ttcacaaaca 8221 aggaaataga agagctcaaa gaggttatgt aacttatctg tagccacgca gataatacaa 8281 agcagcaatc tggacccatt ctgttcaaaa cacttaaccc ttcgctatca tgccttggtt 8341 catctgggtc taatgtgctg agatcaagaa ggtttaggac ctaatggaca gactcaagtc 8401 ataacaatgc taagctctat ttgtgtccca agcactccta agcattttat ccctaactct 8461 acatcaaccc catgaaggag atactgttga tttccccata ttagaagtag agagggaagc 8521 tgaggcacac aaagactcat ccacatgccc aagattcact gatagggaaa agtggaagcg 8581 agatttgaac ccaggctgtt tactcctaac ctgtccaagc cacctctcag acgacggtag 8641 gaatcagctg gctgcttgtg agtacaggag ttacagtcca gtgggttatg ttttttaagt 8701 ctcaacatct aagcctggtc aggcatcagt tccccttttt ttgtgattta ttttgttttt 8761 attttgttgt tcattgttta atttttcctt ttacaatgag aaggtcacca tcttgactcc 8821 taccttagcc atttgttgaa tcagactcat gacggctcct gggaagaagc cagttcagat 8881 cataaaataa aacatattta ttctttgtca tgggagtcat tattttagaa actacaaact 8941 ctccttgctt ccatcctttt ttacatactc atgacacatg ctcatcctga gtccttgaaa 9001 aggtattttt gaacatgtgt attaattata agcctctgaa aacctatggc ccaaaccaga 9061 aatgatgttg attatatagg taaatgaagg atgctattgc tgttctaatt acctcattgt 9121 ctcagtctca aagtaggtct tcagctccct gtactttggg attttaatct accaccaccc 9181 ataaatcaat aaataattac tttctttgac tctgactcct agaataatct attcaaaacc 9241 ttaatgtctt tttcttgatc cttcttttga gtcctaagta ccgccattac agcttcaaat 9301 tggcacgtca tataggcgaa tttcaaaggg agatgcaatc cacagaagta tagtagttca 9361 aagggttaca aaagcaaggc gctcttaaac agctcagtct ttgccccttt gtggcctagg 9421 gctggagtgc agctctgggg tgactcactt gggaatcggg aaggtgttag tctgaatcac 9481 taagtccagg caagccctca gaataggaga gagtgttcct agcaaggaaa acaactctcc 9541 attccaaata atcaggaaag aactttaggg atgtggagct tggctatggg aatagaaagg 9601 aaccattcca agtgcctatt aggccgctct tacctttact gagccagaga atggctctga 9661 aaacaggaca gatgccaact tccttcccga aagtcaggct gatcttgacc acaatacaaa 9721 ttggccctta gagcctatac agggagatcc caggggtctc tgccattgtg caacctattt 9781 tgtagataat aatcaagaat cggacgtgaa ggggaggagt ttgcaacttg gtcaggaatg 9841 tataagaagg aataagctaa ttctgactat gccctttatc catgacacta tccaggaatt 9901 aatgactctc ccagaggatt cctggaatga ttttgttgag ggatggaatg tataaagagg 9961 aaggaagtgt tattttatgc tgccatttgg aagcaacaaa ggagatcaac agtatgaaaa 10021 caatcaatca aatttgaaaa tgaacaaagt tttcacaatc ccagcctaat acttagagag 10081 ctcacagctt ggatgcataa gtaaagagtt ctctgctggt ctttaagaca aactctcaca 10141 caaaacttgg gaaaaaggac aaaaatgttg cattaggggg ttttctgtgg tttgtttgca 10201 ataactataa ttggctcaat caataattat tttttagtat acacactaag ggcccctgta 10261 gcattttttc ccatcgataa ataatcctta gtctagaaaa tgccgaggga tgttctccac 10321 ccttgtctat aaatgcactt ctagatgact ttataaaagg ctcctccttc aagtttatag 10381 aatattataa gactacatta aaggagaaga gaggccggtc gaggtagctc acacctgtaa 10441 tcccagcact ttgggaggcc gaagtgggcg gatcatgagg tcaggagatg gagaccatcc 10501 tggctaaaac agtgaaaccc cgtctctact aaaaatacaa aaaattagcc gggcgtggtg 10561 gcacagcctg cagtcccagc tactcaggag gctgaggcag gagaatcgct tgaacctggg 10621 aggcagaggt tacagtgagc tgagattgtg ccactgcatt ccagcctgga tgacacagcg 10681 agactccgtc tcaaaaataa ataaataata aataaataaa taaataaagg agaaaaagta 10741 aaaacaaagc cagtaggatg ggagcaagga cttattttta aaaataaaac taaaaagact 10801 ctgctaccta cctccaaagc cttagcaaaa agtctttttt ctagctcctt caggagagaa 10861 cttacaccaa ctctccattt agaggaaaac acccagaaat gctggctttg ccaaactggt 10921 tggagacgac tgtaaatgac tgtaagttga tttcattttt aattttattt tattcactat 10981 tagcttatac taagct // LOCUS HUMMYCPOB 2041 bp mRNA PRI 14-FEB-1996 DEFINITION Human c-myc-P64 mRNA, initiating from promoter P0, (HLmyc3.1) partial cds. ACCESSION M13930 NID g188965 KEYWORDS myc-P64 protein. SOURCE Homo sapiens (clone: HLmyc3.1.) promyelocytic leukemia cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2041) AUTHORS Bentley,D.L. and Groudine,M. TITLE Novel promoter upstream of the human c-myc gene and regulation of c-myc expression in B-cell lymphomas JOURNAL Mol. Cell. Biol. 6 (10), 3481-3489 (1986) MEDLINE 87089682 COMMENT Draft entry and printed copy of sequence [1] kindly provided by D.L.Bentley, 08-DEC-1986. There are four alternate transcription initiation sites at the P0 promoter. Two are annotated in the FEATURES table. THe other two are located 3-13 bp upsteam of position 1 (this entry). The c-myc-P64 transcript is usually initiated at promoters P1 or P2 ( also annotated in FEATURES table). The polyadenylation site in clone HLmyc3.1 is located 944 bp downstream of position 2041. [1] has also sequenced another mRNA from clone HLmyc2.5 that can be found under accession number M13929. FEATURES Location/Qualifiers source 1..2041 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HLmyc3.1." /cell_line="promyelocytic leukemia cell line HL-60" /tissue_type="promyelocytic leukemia" mRNA 36..>2041 /note="P0 mRNA (alt.)" CDS 40..384 /note="ORF 114" /codon_start=1 /db_xref="PID:g188966" /translation="MRCWVILIILGIVFLLMPLSFLPIYTNIPRSERAPINTLLSSTL PGTLDQSAALSPALARRPAAWYARGVAVGAQCVLGVEGSCSACDDLYSQDKDAVCQTV LLRRSSRERERV" mRNA 55..>2041 /note="P0 mRNA (alt.)" CDS 612..1178 /note="ORF 118" /codon_start=1 /db_xref="PID:g188967" /translation="MRGSGRLRTPELCCSRPPPPGPGRPWLPSCLEKGRASQRLGGKK NGGRDRAEYKSRFSGLYLTRCSNSSERQRERAGGRLGWKSRASRAALRASWEGRSGAN RGLRLWPSPPADPPASGPQPLPHPRNFAHSSGRALCTGTYNTRARTRLSRRGEAILPI WGHFPAAARTRFSERLSLQLLRRWIFFG" mRNA 639..>2041 /note="P1 mRNA" mRNA 798..>2041 /note="P2 mRNA" CDS 1205..1978 /codon_start=1 /product="truncated c-myc-P64 protein" /db_xref="PID:g188968" /translation="MPLNVSFTNRNYDLDYDSVQPYFYCDEEENFYQQQQQSELQPPA PSEDIWKKFELLPTPPLSPSRRSGLCSPSYVAVTPFSLRGDNDGGGGSFSTADQLEMV TELLGGDMVNQSFICDPDDETFIKNIIIQDCMWSGFSAAAKLVSEKLASYQAARKDSG SPNPARGHSVCSTSSLYLQDLSAAASECIDPSVVFPYPLNDSSSPKSCASQDSSAFSP SSDSLLSSTESSPQGSPEPLVLHEETPPTTSSDSGGTRR" BASE COUNT 402 a 668 c 578 g 393 t ORIGIN 65 bp upstream of SfaNI site. 1 ggagtttatt cataacgcgc tctccaagta tacgtggcaa tgcgttgctg ggttatttta 61 atcattctag gcatcgtttt cctccttatg cctctatcat tcctccctat ctacactaac 121 atcccacgct ctgaacgcgc gcccattaat acccttcttt cctccactct ccctgggact 181 cttgatcaaa gcgcggccct ttccccagcc ttagcgaggc gccctgcagc ctggtacgcg 241 cgtggcgtgg cggtgggcgc gcagtgcgtt ctcggtgtgg agggcagctg ttccgcctgc 301 gatgatttat actcacagga caaggatgcg gtttgtcaaa cagtactgct acggaggagc 361 agcagagaaa gggagagggt ttgagaggga gcaaaagaaa atggtaggcg cgcgtagtta 421 attcatgcgg ctctcttact ctgtttacat cctagagcta gagtgctcgg ctgcccggct 481 gagtctcctc cccaccttcc ccaccctccc caccctcccc ataagcgccc tcccgggttc 541 ccaaagcaga gggcgtgggg gaaaagaaaa aagatcctct ctcgctaatc tccgcccacc 601 ggccctttat aatgcgaggg tctggacggc tgaggacccc cgagctgtgc tgctcgcggc 661 cgccaccgcc gggccccggc cgtccctggc tcccctcctg cctcgagaag ggcagggctt 721 ctcagaggct tggcgggaaa aagaacggag ggagggatcg cgctgagtat aaaagccggt 781 tttcggggct ttatctaact cgctgtagta attccagcga gaggcagagg gagcgagcgg 841 gcggccggct agggtggaag agccgggcga gcagagctgc gctgcgggcg tcctgggaag 901 ggagatccgg agcgaatagg gggcttcgcc tctggcccag ccctcccgct gatcccccag 961 ccagcggtcc gcaacccttg ccgcatccac gaaactttgc ccatagcagc gggcgggcac 1021 tttgcactgg aacttacaac acccgagcaa ggacgcgact ctcccgacgc ggggaggcta 1081 ttctgcccat ttggggacac ttccccgccg ctgccaggac ccgcttctct gaaaggctct 1141 ccttgcagct gcttagacgc tggatttttt tcgggtagtg gaaaaccagc agcctcccgc 1201 gacgatgccc ctcaacgtta gcttcaccaa caggaactat gacctcgact acgactcggt 1261 gcagccgtat ttctactgcg acgaggagga gaacttctac cagcagcagc agcagagcga 1321 gctgcagccc ccggcgccca gcgaggatat ctggaagaaa ttcgagctgc tgcccacccc 1381 gcccctgtcc cctagccgcc gctccgggct ctgctcgccc tcctacgttg cggtcacacc 1441 cttctccctt cggggagaca acgacggcgg tggcgggagc ttctccacgg ccgaccagct 1501 ggagatggtg accgagctgc tgggaggaga catggtgaac cagagtttca tctgcgaccc 1561 ggacgacgag accttcatca aaaacatcat catccaggac tgtatgtgga gcggcttctc 1621 ggccgccgcc aagctcgtct cagagaagct ggcctcctac caggctgcgc gcaaagacag 1681 cggcagcccg aaccccgccc gcggccacag cgtctgctcc acctccagct tgtacctgca 1741 ggatctgagc gccgccgcct cagagtgcat cgacccctcg gtggtcttcc cctaccctct 1801 caacgacagc agctcgccca agtcctgcgc ctcgcaagac tccagcgcct tctctccgtc 1861 ctcggattct ctgctctcct cgacggagtc ctccccgcag ggcagccccg agcccctggt 1921 gctccatgag gagacaccgc ccaccaccag cagcgactct ggaggaacaa gaagatgagg 1981 aagaaatcga tgttgtttct gtggaaaaga ggcaggctcc tggcaaaagg tcagagtctg 2041 g // LOCUS HUMMYL2AI 593 bp mRNA PRI 07-JAN-1995 DEFINITION HUMMLC2At; Homo sapiens; ; 593 base-pairs. ACCESSION M94547 NID g189010 KEYWORDS MYL2 gene; myosin light chain. SOURCE Homo sapiens adult atrial muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 593) AUTHORS Hailstones,D., Barton,P., Chan-Thomas,P., Sasse,S., Sutherland,C., Hardeman,E. and Gunning,P. TITLE Differential regulation of the atrial isoforms of the myosin light chains during striated muscle development JOURNAL J. Biol. Chem. 267 (32), 23295-23300 (1992) MEDLINE 93054664 FEATURES Location/Qualifiers source 1..593 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="myocyte" /dev_stage="adult" /tissue_type="atrial muscle" /map="Unassigned" mRNA <1..593 /gene="MYL2" /note="G00-128-829" 5'UTR 1..12 /gene="MYL2" /note="G00-128-829" gene 1..593 /gene="MYL2" CDS 13..540 /gene="MYL2" /codon_start=1 /db_xref="GDB:G00-128-829" /product="atrial myosin light chain" /db_xref="PID:g189011" /translation="MASRKAGTRGKVAATKQAQRGSSNVFSMFEQAQIQEFKEAFSCI DQNRDGIICKADLRETYSQLGKVSVPEEELDAMLQEGKGPINFTVFLTLFGEKLNGTD PEEAILSAFRMFDPSGKGVVNKDEFKQLLLTQADKFSPAEVEQMFALTPMDLAGNIDY KSLCYIITHGDEKEE" 3'UTR 537..593 /gene="MYL2" /note="G00-128-829" polyA_signal 570..576 /gene="MYL2" /note="G00-128-829" BASE COUNT 152 a 158 c 182 g 101 t ORIGIN 1 ggcacgagga gaatggccag caggaaggcg gggacccggg gcaaggtggc agccaccaag 61 caggcccaac gtggttcttc caacgtcttt tccatgtttg aacaagccca gatacaggag 121 ttcaaagaag ccttcagctg tatcgaccag aatcgtgatg gcatcatctg caaggcagac 181 ctgagggaga cctactccca gctggggaag gtgagtgtcc cagaggagga gctggacgcc 241 atgctgcaag agggcaaggg ccccatcaac ttcaccgtct tcctcacgct ctttggggag 301 aagctcaatg ggacagaccc cgaggaagcc atcctgagtg ccttccgcat gtttgacccc 361 agcggcaaag gggtggtgaa caaggatgag ttcaagcagc ttctcctgac ccaggcagac 421 aagttctctc cagctgaggt ggagcagatg ttcgccctga cacccatgga cctggcgggg 481 aacatcgact acaagtcact gtgctacatc atcacccatg gagacgagaa agaggaatga 541 ggggcagggc aggccacggg ggggcacctc aataaactct gttgcaaaat tgg // LOCUS HUMMYL5 661 bp mRNA PRI 07-JAN-1995 DEFINITION Human regulatory myosin light chain (MYL5) mRNA, complete cds. ACCESSION L03785 NID g189012 KEYWORDS myosin light chain; myosin regulatory light chain. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 661) AUTHORS Collins,C., Schappert,K. and Hayden,M.R. TITLE The genomic organization of a novel regulatory myosin light chain gene (MYL5) that maps to chromosome 4p16.3 and shows different patterns of expression between primates JOURNAL Hum. Mol. Genet. 1 (9), 727-733 (1992) MEDLINE 93258315 FEATURES Location/Qualifiers source 1..661 /organism="Homo sapiens" /db_xref="taxon:9606" gene 106..627 /gene="MYL5" CDS 106..627 /gene="MYL5" /codon_start=1 /product="myosin regulatory light chain" /db_xref="PID:g189013" /translation="MASRKTKKKEGGALRAQRASSNVFSNFEQTQIQEFKEAFTLMDQ NRDGFIDKEDLKDTYASLGKTNVKDDELDAMLKEASGPINFTMFLNLFGEKLSGTDAE ETILNAFKMLDPDGKGKINKEYIKRLLMSQADKMTAEEVDQMFQFASIDVAGNLDYKA LSYVITHGEEKEE" BASE COUNT 179 a 171 c 208 g 103 t ORIGIN 1 ggagtggcag ccggagtctg aactgtcctg ggggaccaag caggagctta agatgggcaa 61 gacctggggc cctgggcaga cgcatcaaag caggcagaag caggcatggc cagcaggaag 121 accaagaaga aggaaggggg tgccctccgg gcccagagag cctcatccaa tgtcttctcc 181 aactttgagc agactcagat ccaggagttc aaggaggcat tcacactcat ggatcagaac 241 cgagatggct tcattgacaa ggaggacctg aaggacacct atgcctccct gggcaagacc 301 aacgtcaagg acgacgagct ggacgccatg ctcaaagagg cctcggggcc catcaacttc 361 accatgtttc tgaacctgtt tggggagaag ctgagcggta ccgacgccga ggagaccatt 421 cttaacgcct tcaagatgct ggacccggac gggaaaggga aaatcaacaa ggagtacatc 481 aagcgtctgc tgatgtccca ggctgacaag atgacggcgg aagaggtgga ccagatgttc 541 cagttcgcct ccatcgatgt ggcgggcaac ctggactaca aggcgctcag ctacgtgatc 601 acccacgggg aggagaagga ggagtgagac ccagccgggt caataaacct ggacgcttgg 661 a // LOCUS HUMMZF1 2678 bp mRNA PRI 07-JAN-1995 DEFINITION Human zinc finger protein 42 (MZF-1) mRNA, complete cds. ACCESSION M58297 NID g189043 KEYWORDS zinc finger protein. SOURCE Human adult blood myeloid cell from Patient S, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2678) AUTHORS Hromas,R., Collins,S.J., Hickstein,D., Raskind,W., Deaven,L.L., O'Hara,P., Hagen,F.S. and Kaushansky,K. TITLE A retinoic acid-responsive human zinc finger gene, MZF-1, preferentially expressed in myeloid cells JOURNAL J. Biol. Chem. 266 (22), 14183-14187 (1991) MEDLINE 91317761 FEATURES Location/Qualifiers source 1..2678 /organism="Homo sapiens" /isolate="patient S" /db_xref="taxon:9606" /cell_type="myeloid" /dev_stage="adult" /tissue_type="blood" /map="Unassigned" gene 1091..2548 /gene="ZNF42" CDS 1091..2548 /gene="ZNF42" /codon_start=1 /db_xref="GDB:G00-125-898" /product="zinc finger protein 42" /db_xref="PID:g189044" /translation="MNGPLVYAGFALQLGSISAGPGSVSPHLHVPWDLGMAGLSGQIQ SPSREGGFAHRVLLPSDLRSEQDPTDEDPCRGVGPALITTRWRSPRGRSRGRPSTGGG VVRGGRCDVCGKVFSQRSNLLRHQKIHTGERPFVCSECGRSFSRSSHLLRHQLTHTEE RPFVCGDCGQGFVRSARLEEHRRVHTGEQPFRCAECGQSFRQRSNLLQHQRIHGDPPG PGAKPPAPPGAPEPPGPFPCSECRESFARRAVLLEHQAVHTGDKSFGCVECGERFGRR SVLLQHRRVHSGERPFACAECGQSFRQRSNLTQHRRIHTGERPFACAECGKAFRQRPT LTQHLRVHTGEKPFACPECGQRFSQRLKLTRHQRTHTGEKPYHCGECGLGFTQVSRLT EHQRIHTGERPFACPECGQSFRQHANLTQHRRIHTGERPYACPECGKAFRQRPTLTQH LRTHRREKPFACQDCGRRFHQSTKLIQHQRVHSAE" BASE COUNT 469 a 882 c 721 g 606 t ORIGIN 19q13.2-4. 1 cggccggcca atacatagga acacttgggt ccctgcagtc agggtgtgga aatggcagat 61 gagttcagcc ctaaggtgca tttttcttac taggaggaga tggagtgtat tttatgggat 121 ataagcatta gctacatttc ctgtcctgtt cacatccttt gcccatgtgt ctatgaggtt 181 attgatcttc ttactgattt attgtagctc tttacttagg aggttaatta gccttttgcc 241 tgtggagagt tttttggttt gccatttgtc cttttttaat tttttttgtt ttttggccat 301 ttgtcttttg actccgatgt ggtttttgct gatttccttt gatgtattct agtttatctg 361 acttttcttt ggcgacttat ggactttctc tcaccactaa aagccctcac tgctctctca 421 gtcttcttga tttaacctcc tccaggcttc cgccttctcc aggccctgat tctcagttgg 481 agttgctggt gcctcctcct tcacccagcg tctgacgctg gagtgctcac agtgtggctg 541 ggacccactt ctctcctctg tagataccca cccctgtgtt gatcacttgc aggcccgggt 601 tctgtgtgcc atgtgtatgc cctagagccc ttgctcacgt ttccccacag ccttcatgaa 661 gtctgtgttc ctcagatgcc ccacagacat cacaagcaag gcacatccaa accccagacc 721 actatccagg agcctgcacc ctctttctgt tggctccacc tccagcctcc gagacccacc 781 cacttccctg catttgctga gaccatcatt ttccacctag acaatgcccc cacgcttgcc 841 ctacagccct tccaaaaacg attttttcca acttaaatca gactagaaag ctttttcaca 901 tagcccagtc ttcctccttg tgctgggttc tgtctcatta tcacctcatc agggaagtct 961 gtacagatag aatccctacc cctgcatttg tcgcctccgt ctgcctcttt ggtcagtttc 1021 aggtccctgt agttcacact gtgtccccag ggatgaagtg ggtcccggca cggtgggcat 1081 tctgtcatga atgaatggtc cccttgtgta tgcagggttc gcgctgcagc taggcagcat 1141 ctccgcaggt ccaggtagtg taagccctca cctccacgtc ccctgggacc tcggcatggc 1201 tggcctttct ggccagatcc aatcaccctc ccgcgaaggt ggctttgcgc atcgcgttct 1261 gctccccagc gatctgagga gtgaacagga ccccacggac gaggatccct gccggggtgt 1321 gggccctgct ctgatcacca cccgctggcg ctcccccagg ggccggagcc ggggccgccc 1381 cagcactggg ggcggggtgg ttaggggcgg ccgttgcgat gtatgtggca aggtgttcag 1441 ccaacgcagc aacctgctga ggcaccagaa gatccacacg ggtgagcgac cattcgtgtg 1501 cagcgagtgc ggccgcagct tcagccgcag ctcgcacctg ctgcgccacc agcttacgca 1561 caccgaggag cggccgttcg tgtgcggcga ctgtggccag ggcttcgtgc gcagcgcgcg 1621 cctggaagag catcggagag tgcacacggg cgaacagcct ttccgttgcg ctgagtgcgg 1681 ccagagcttc cggcagcgct ccaatctgct gcagcaccag cgcatccacg gcgatccccc 1741 gggccctggc gctaagcccc cggcccctcc tggtgcgccc gagcctcccg gcccctttcc 1801 gtgcagcgag tgccgcgaga gcttcgcgcg gcgcgccgtg ctgctggagc accaggcggt 1861 acacacgggc gacaagtcct ttggctgcgt cgagtgcggc gagcgcttcg gccgccgctc 1921 agtgctgctg cagcaccggc gcgtgcacag tggcgagcgg cccttcgcct gtgccgagtg 1981 cggccagagc ttccggcagc gctccaacct gacgcagcac cggcgcatcc acaccgggga 2041 gcggcccttc gcctgcgccg agtgtggcaa ggccttccgc cagcggccta cgctcacgca 2101 gcatctccgc gtacacacgg gcgagaaacc ctttgcctgc cccgagtgtg gccagcgctt 2161 cagccagcgc ctcaagctca cgcgtcatca gaggacacac accggcgaaa agccctacca 2221 ctgcggtgag tgcggcctgg gcttcacgca ggtctcgcgg ctcaccgagc accagcgcat 2281 ccacacgggc gaacggccct tcgcctgccc cgagtgcggc cagagctttc ggcagcacgc 2341 caacctcacc cagcaccggc gcatccacac gggtgaacgg ccctacgcat gccctgagtg 2401 tggcaaggcc ttccgccagc ggcccacgct cacgcagcat ctgcgcaccc accgacgaga 2461 gaagcccttc gcctgccagg actgtggccg ccgcttccac cagagcacca agctcattca 2521 gcaccagcgc gtccacagcg ccgagtagct ccagccggga cgcactgtgt ccgccatggt 2581 cctcccctgg ttattgtgag gctggcgatt acataagtat aagcaggtcg cccagggctt 2641 ggctactgta ggtgtccaat aaacagtaga tggaaacc // LOCUS HUMNACH 5389 bp mRNA PRI 07-JAN-1995 DEFINITION Human voltage-gated sodium channel mRNA, complete cds. ACCESSION M91556 NID g189046 KEYWORDS Na+ channel; sodium channel. SOURCE Homo sapiens (tissue library: of M.Tamkun) ventricular heart muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5389) AUTHORS George,A.L. Jr., Knittle,T.J. and Tamkun,M.M. TITLE Molecular cloning of an atypical voltage-gated sodium channel expressed in human heart and uterus: evidence for a distinct gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (11), 4893-4897 (1992) MEDLINE 92279233 FEATURES Location/Qualifiers source 1..5389 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="ventricular heart muscle" /tissue_lib="of M.Tamkun" gene 140..5188 /gene="Na+ channel" CDS 140..5188 /gene="Na+ channel" /codon_start=1 /product="Na+ channel" /db_xref="PID:g189047" /translation="MLASPEPKGLVPFTKESFELIKQHIAKTHNEDHEEEDLKPNPDL EVGKKLPFIYGNLSQGMVSEPLEDVDPYYYKKKNTFIVLNKNRTIFRFNAASILCTLS PFNCIRRTTIKVLVHPFFQLFILISVLIDCVFMSLTNLPKWRPVLENTLLGIYTFEIL VKLFARGVWAGSFSFLGDPWNWLDFSVTVFEVIIRYSPLDFIPTLQTARTLRILKIIP LNQGLKSLVGVLIHCLKQLIGVIILTLFFLSIFSLIGMGLFMGNLKHKCFRWPQENEN ETLHNRTGNPYYIRETENFYYLEGERYALLCGNRTDAGQCPEGYVCVKAGINPDQGFT NFDSFGWALFALFRLMAQDYPEVLYHLILYASGKVYMIFFVVVSFLFSFYMASLFLGI LAMAYEEEKQRVGEISKKIEPKFQQTGKELQEGNETDEAKTIQIEMKKRSPISTDTSL DVLEDATLRHKEELEKSKKICPLYWYKFAKTFLIWNCSPCWLKLKEFVHRIIMAPFTD LFLIICIILNVCFLTLEHYPMSKQTNTLLNIGNLVFIGIFTAEMIFKIIAMHPYGYFQ VGWNIFDSMIVFHGLIELCLANVAGMALLRLFRMLRIFKLGKYWPTFQILMWSLSNSW VALKDLVLLLFTFIFFSAAFGMKLFGKNYEEFVCHIDKDCQLPRWHMHDFFHSFLNVF RILCGEWVETLWDCMEVAGQSWCIPFYLMVILIGNLLVLYLFLALVSSFSSCKDVTAE ENNEAKNLQLAVARIKKGINYVLLKILCKTQNVPKDTMDHVNEVYVKEDISDHTLSEL SNTQDFLKDKEKSSGTEKNATENESQSLIPSPSVSETVPIASGESDIENLDNKEIQSK SGDGGSKEKIKQSSSSECSTVDIAISEEEEMFYGGERSKHLKNGCRRGSSLGQISGAS KKGKIWQNIRKTCCKIVENNWFKCFIGLVTLLSTGTLAFEDIYIDQRKTIKILLEYAD MIFTYIFILEMLLKWMAYGFKAYFSNGWYRLDFVVVIVFCLSLIGKTREELKPLISMK FLRPLRVLSQFERMKVVVRALIKTTLPTLNVFLVCLMIWLIFSIMGVDLFAGRFYECI DPTSGERFPSSEVMNKSRCESLLFNESMLWENAKMNFDNVGNGFLSLLQVATFNGWIT IMNSAIDSVAVNIQPHFEVNIYMYCYFINFIIFGVFLPLSMLITVIIDNFNKHKIKLG GSNIFITVKQRKQYRRLKKLMYEDSQRPVPRPLNKLQGFIFDVVTSQAFNVIVMVLIC FQAIAMMIDTDVQSLQMSIALYWINSIFVMLYTMECILKLIAFRCFYFTIAWNIFDFM VVIFSITGLCLPMTVGSYLVPPSLVQLILLSRIIHMLRLGKGPKVFHNLMLPLMLSLP ALLNIILLIFLVMFIYAVFGMYNFAYVKKEAGINDVSNFETFGNSMLCLFQVAIFAGW DGMLDAIFNSKWSDCDPDKINPGTQVRGDCGNPSVGIFYFVSYILISWLIIVNMYIVV VMEFLNIASKKKNKTLSEDDFRKFFQVWKRFDPDRTQYIDSSKLSDFAAALDPPLFMA KPNKGQLIALDLPMAVGDRIHCLDILLAFTKRVMGQDVRMEKVVSEIESGFLLANPFK ITCEPITTTLKRKQEAVSATIIQRAYKNYRLRRNDKNTSDIHMIDGDRDVHATKEGAY FDKAKEKSPIQSQI" BASE COUNT 1642 a 961 c 1074 g 1712 t ORIGIN 1 gcggccgcgc aagtcctcca ccatgtgaat gccaacatgg ccaggtcatt agagctgagg 61 gaaaactagt gcccaaagat atgaaaagag tgtggatctt ctggagaagt gctgttgttc 121 aacaggtaca aaattggaaa tgttggcttc accagaacct aagggccttg ttcccttcac 181 taaagagtct tttgaactta taaaacagca tattgctaaa acacataatg aagaccatga 241 agaagaagac ttaaagccaa atcctgattt ggaagttggc aaaaagcttc catttattta 301 tggaaacctt tctcaaggaa tggtgtcaga gcccttggaa gatgtggacc catattacta 361 caagaaaaaa aatactttca tagtattaaa taaaaataga acaatcttca gattcaatgc 421 ggcttccatc ttgtgtacat tgtctccttt caattgtatt agaagaacaa ctatcaaggt 481 tttggtacat ccctttttcc aactgtttat tctaattagt gtcctgattg attgcgtatt 541 catgtccctg actaatttgc caaaatggag accagtatta gagaatactt tgcttggaat 601 ttacacattt gaaatacttg taaaactctt tgcaagaggt gtctgggcag gatcattttc 661 cttcctcggt gatccatgga actggctcga tttcagcgta actgtgtttg aggttattat 721 aagatactca cctctggact tcattccaac gcttcaaact gcaagaactt tgagaatttt 781 aaaaattatt cctttaaatc aaggtctgaa atcccttgta ggggtcctga tccactgctt 841 gaagcagctt attggtgtca ttatcctaac tctgtttttt ctgagcatat tttctctaat 901 tgggatgggg ctcttcatgg gcaacttgaa acacaaatgt tttcgatggc cccaagagaa 961 tgaaaatgaa accctgcaca acagaactgg aaacccatat tatattcgag aaacagaaaa 1021 cttttattat ttggaaggag aaagatatgc tctcctttgt ggcaacagga cagatgctgg 1081 tcagtgtcct gaaggatatg tgtgtgtaaa agctggcata aatcctgatc aaggcttcac 1141 aaattttgac agttttggct gggccttatt tgccctattt cggttaatgg ctcaggatta 1201 ccctgaagta ctttatcacc tgatacttta tgcttctggg aaggtctaca tgatattttt 1261 tgtggtggta agttttttgt tttcctttta tatggcaagt ttgttcttag gcatacttgc 1321 catggcctat gaagaagaaa agcagagagt tggtgaaata tctaagaaga ttgaaccaaa 1381 atttcaacag actggaaaag aacttcaaga aggaaatgaa acagatgagg ccaagaccat 1441 acaaatagaa atgaagaaaa ggtcaccaat ttccacagac acatcattgg atgtgttgga 1501 agatgctact ctcagacata aggaagaact tgaaaaatcc aagaagatat gcccattata 1561 ctggtataag tttgctaaaa ctttcttgat ctggaattgt tctccctgtt ggttaaaatt 1621 gaaagagttt gtccatagga ttataatggc accatttact gatcttttcc ttatcatatg 1681 cataatttta aacgtatgtt ttctgacctt ggagcattat ccaatgagta aacaaactaa 1741 cactcttctc aacattggaa acctggtttt cattggaatt ttcacagcag aaatgatttt 1801 taaaataatt gcaatgcatc catatgggta tttccaagta ggttggaaca tttttgatag 1861 catgatagtg ttccatggtt taatagaact ttgtctagca aatgttgcag gaatggctct 1921 tcttcgatta ttcaggatgt taagaatttt caagttggga aagtattggc caacattcca 1981 gattttgatg tggtctctta gtaactcatg ggtggccctg aaagacttgg tcctgttgtt 2041 gttcacattc atcttctttt ctgctgcatt cggcatgaag ctgtttggta agaattatga 2101 agaatttgtc tgccacatag acaaagactg tcaactccca cgctggcaca tgcatgactt 2161 tttccactcc ttcctgaatg tgttccgaat tctctgtgga gagtgggtag agaccttgtg 2221 ggactgtatg gaggttgcag gccaatcctg gtgtattcct ttttacctga tggtcatttt 2281 aattggaaat ttactggtac tttacctgtt tctggcattg gtgagctcat ttagttcatg 2341 caaggatgta acagctgaag agaataatga agcaaaaaat ctccagcttg cagtggcaag 2401 aattaaaaaa ggaataaact atgtgcttct taaaatacta tgcaaaacac aaaatgtccc 2461 aaaggacaca atggaccatg taaatgaggt atatgttaaa gaagatattt ctgaccatac 2521 cctttctgaa ttgagcaaca cccaagattt tctcaaagat aaggaaaaaa gcagtggcac 2581 agagaaaaac gctactgaaa atgagagcca atcacttatc cccagtccta gtgtctcaga 2641 aactgtacca attgcttcag gagaatctga tatagaaaat ctggataata aggagattca 2701 gagtaagtct ggtgatggag gcagcaaaga gaaaataaag caatctagct catctgaatg 2761 cagtactgtt gatattgcta tctctgaaga agaagaaatg ttctatggag gtgaaagatc 2821 aaagcatctg aaaaatggtt gcagacgcgg atcttcactt ggtcaaatca gtggagcatc 2881 caagaaagga aaaatctggc agaacatcag gaaaacctgc tgcaagattg tagagaacaa 2941 ttggtttaag tgttttattg ggcttgttac tctgctcagc actggcactc tggcttttga 3001 agatatatat attgatcaga gaaagacaat taaaatttta ttagaatatg ctgacatgat 3061 ctttacttat atcttcattc tggaaatgct tctaaaatgg atggcatatg gttttaaggc 3121 ctatttctct aatggctggt acaggctgga cttcgtggtt gttattgtgt tttgtcttag 3181 cttaataggc aaaactcggg aagaactaaa acctcttatt tccatgaaat tccttcggcc 3241 cctcagagtt ctatctcaat ttgaaagaat gaaggtggtt gtgagagctt tgatcaaaac 3301 aaccttaccc actttgaatg tgtttcttgt ctgcctgatg atctggctga tttttagtat 3361 catgggagta gacttatttg ctggcagatt ctatgaatgc attgacccaa caagtggaga 3421 aaggtttcct tcatctgaag tcatgaataa gagtcggtgt gaaagccttc tgtttaacga 3481 atccatgcta tgggaaaatg caaaaatgaa ctttgataat gttggaaatg gtttcctttc 3541 tctgcttcaa gtagcaacat ttaatggatg gatcactatt atgaattcag caattgattc 3601 tgttgctgtt aatatacagc ctcattttga agtcaacatc tacatgtatt gttactttat 3661 caactttatt atatttggag tatttctccc tctgagtatg ctgattactg ttattattga 3721 taatttcaac aagcataaaa taaagctggg aggctcaaat atctttataa cggttaaaca 3781 gagaaaacag taccgcaggc tgaagaagct aatgtatgag gattctcaaa gaccagtacc 3841 tcgcccatta aacaagctcc aaggattcat ctttgatgtg gtaacaagcc aagcttttaa 3901 tgtcattgtt atggttctta tatgtttcca agcaatagcc atgatgatag acactgatgt 3961 tcagagtcta caaatgtcca ttgctctcta ctggattaac tcaatttttg ttatgctata 4021 tactatggaa tgtatactga agctcatcgc tttccgttgt ttttatttca ccattgcgtg 4081 gaacattttt gattttatgg tggttatttt ctccatcaca ggactatgtc tgcctatgac 4141 agtaggatcc taccttgtgc ctccttcact tgtgcaactg atacttctct cacggatcat 4201 tcacatgctg cgtcttggaa aaggaccaaa ggtgtttcat aatctgatgc ttcctttgat 4261 gctgtccctc ccagcattat tgaacatcat tcttctcatc ttcctggtca tgttcatcta 4321 tgccgtattt ggaatgtata attttgccta tgttaaaaaa gaagctggaa ttaatgatgt 4381 gtctaatttt gaaacctttg gcaacagtat gctctgtctt tttcaagttg caatatttgc 4441 tggttgggat gggatgcttg atgcaatttt caacagtaaa tggtctgact gtgatcctga 4501 taaaattaac cctgggactc aagttagagg agattgtggg aacccctctg ttgggatttt 4561 ttattttgtc agttatatcc tcatatcatg gctgatcatt gtaaatatgt acattgttgt 4621 tgtcatggag tttttaaata ttgcttctaa gaagaaaaac aagaccttga gtgaagatga 4681 ttttaggaaa ttctttcagg tatggaaaag gtttgatcct gataggaccc agtacataga 4741 ctctagcaag ctttcagatt ttgcagctgc tcttgatcct cctcttttca tggcaaaacc 4801 aaacaagggc cagctcattg ctttggacct ccccatggct gttggggaca gaattcattg 4861 cctcgatatc ttacttgctt ttacaaagag agttatgggt caagatgtga ggatggagaa 4921 agttgtttca gaaatagaat cagggttttt gttagccaac ccttttaaga tcacatgtga 4981 gccaattacg actactttga aacgaaaaca agaggcagtt tcagcaacca tcattcaacg 5041 tgcttataaa aattaccgct tgaggcgaaa tgacaaaaat acatcagata ttcatatgat 5101 agatggtgac agagatgttc atgctactaa agaaggtgcc tattttgaca aagctaagga 5161 aaagtcacct attcaaagcc agatctaata ccacttacca cctcttttca tatttcttca 5221 catatctgaa aaatgttgaa agcctaagcc aggaataaaa gaaaagtaga gataataatc 5281 agttctttac aaccgatggt aattaagctt gtattcacaa gacttcatgc caaattcact 5341 ttttagcatt atatctaaca aatcaagaga atccttaata ttgctgcag // LOCUS HUMNADPHO 1349 bp mRNA PRI 07-JAN-1995 DEFINITION Human 47-kD autosomal chronic granulomatous disease protein mRNA, complete cds. ACCESSION M55067 M38755 NID g189050 KEYWORDS autosomal chronic granulomatous disease protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1349) AUTHORS Rodaway,A.R., Teahan,C.G., Casimir,C.M., Segal,A.W. and Bentley,D.L. TITLE Characterization of the 47-kilodalton autosomal chronic granulomatous disease protein: tissue-specific expression and transcriptional control by retinoic acid JOURNAL Mol. Cell. Biol. 10 (10), 5388-5396 (1990) MEDLINE 90377229 FEATURES Location/Qualifiers source 1..1349 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /map="7q11.23" gene 23..1334 /gene="NCF1" CDS 23..1195 /gene="NCF1" /codon_start=1 /db_xref="GDB:G00-120-222" /product="autosomal chronic granulomatous disease protein" /db_xref="PID:g189051" /translation="MGDTFIRHIALLGFEKRFVPSQHYVYMFLVKWQDLSEKVVYRRF TEIYEFHKTLKEMFPIEAGAINPENRIIPHLPAPKWFDGQRAAENRQGTLTEYCSTLM SLPTKISRCPHLLDFFKVRPDDLKLPTDNQTKKPETYLMPKDGKSTATDITGPIILQT YRAIADYEKTSGSEMALSTGDVVEVVEKSESGWWFCQMKAKRGWIPASFLEPLDSPDE TEDPEPNYAGEPYVAIKAYTAVEGDEVSLLEGEAVEVIHKLLDGWWVIRKDDVTGYFP SMYLQKSGQDVSQAQRQIKRGAPPRRSSIRNAHSIHQRSRKRLSQDAYRRNSVRFLQQ RRRQARPGPQSPGSPLEEERQTQRSKPQPAVPPRPSADLILNRCSESTKRKLASAV" polyA_signal 1329..1334 /gene="NCF1" /note="G00-120-222" polyA_site 1349 /gene="NCF1" /note="G00-120-222" BASE COUNT 289 a 443 c 399 g 218 t ORIGIN 1 gagcactgga ggccacccag tcatggggga caccttcatc cgtcacatcg ccctgctggg 61 ctttgagaag cgcttcgtac ccagccagca ctatgtgtac atgttcctgg tgaaatggca 121 ggacctgtcg gagaaggtgg tctaccggcg cttcaccgag atctacgagt tccataaaac 181 cttaaaagaa atgttcccta ttgaggcagg ggcgatcaat ccagagaaca ggatcatccc 241 ccacctccca gctcccaagt ggtttgacgg gcagcgggcc gccgagaacc gccagggcac 301 acttaccgag tactgcagca cgctcatgag cctgcccacc aagatctccc gctgtcccca 361 cctcctcgac ttcttcaagg tgcgccctga tgacctcaag ctccccacgg acaaccagac 421 aaaaaagcca gagacatact tgatgcccaa agatggcaag agtaccgcga cagacatcac 481 cggccccatc atcctgcaga cgtaccgcgc cattgccgac tacgagaaga cctcgggctc 541 cgagatggct ctgtccacgg gggacgtggt ggaggtcgtg gagaagagcg agagcggttg 601 gtggttctgt cagatgaaag caaagcgagg ctggatccca gcatccttcc tcgagcccct 661 ggacagtcct gacgagacgg aagaccctga gcccaactat gcaggtgagc catacgtcgc 721 catcaaggcc tacactgctg tggaggggga cgaggtgtcc ctgctcgagg gtgaagctgt 781 tgaggtcatt cacaagctcc tggacggctg gtgggtcatc aggaaagacg acgtcacagg 841 ctactttccg tccatgtacc tgcaaaagtc ggggcaagac gtgtcccagg cccaacgcca 901 gatcaagcgg ggggcgccgc cccgcaggtc gtccatccgc aacgcgcaca gcatccatca 961 gcggtcgcgg aagcgcctca gccaggacgc ctatcgccgc aacagcgtcc gttttctgca 1021 gcagcgacgc cgccaggcgc ggccgggacc gcagagcccc gggagcccgc tcgaggagga 1081 gcggcagacg cagcgctcta aaccgcagcc ggcggtgccc ccgcggccga gcgccgacct 1141 catcctgaac cgctgcagcg agagcaccaa gcggaagctg gcgtctgccg tctgaggctg 1201 gagcgcagtc cccagctagc gtctcggccc ttgccgcccc gtgcctgtac atacgtgttc 1261 tatagagcct ggcgtctgga cgccgagggc agccccgacc cctgtccagc gcggctcccg 1321 ccaccctcaa taaatgttgc ttggagtgg // LOCUS HUMNAGAT 1807 bp mRNA PRI 27-NOV-1995 DEFINITION Human I beta 1-6 N-acetylglucosaminyltransferase mRNA, complete cds. ACCESSION L19659 NID g307297 KEYWORDS I beta 1-6 N-acetylglucosaminyltransferase; N-acetylglucosaminyltransferase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1807) AUTHORS Bierhuizen,M.F., Mattei,M.G. and Fukuda,M. TITLE Expression of the developmental I antigen by a cloned human cDNA encoding a member of a beta-1,6-N-acetylglucosaminyltransferase gene family JOURNAL Genes Dev. 7 (3), 468-478 (1993) MEDLINE 93194065 FEATURES Location/Qualifiers source 1..1807 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PA-1" /dev_stage="embryo" /tissue_type="carcinoma" CDS 255..1457 /EC_number="2.4.1.150" /codon_start=1 /product="I beta 1-6 N-acetylglucosaminyltransferase" /db_xref="PID:g307298" /translation="MPLSMRYLFIISVSSVIIFIVFSVFNFGGDPSFQRLNISDPLRL TQVCTSFINGKTRFLWKNKLMIHEKSSCKEYLTQSHYITAPLSKEEADFPLAYIMVIH HHFDTFARLFRAIYMPQNIYCVHVDEKATTEFKDAVEQLLSCFPNAFLASKMEPVVYG GISRLQADLNCIRDLSAFEVSWKYVINTCGQDFPLKTNKEIVQYLKGFKGKNITPGVL PPAHAIGRTKYVHQEHLGKELSYVIRTTALKPPPPHNLTIYFGSAYVALSREFANFVL HDPRAVDLLQWSKDTFSPDEHFWVTLNRIPGVPGSMPNASWTGNLRAIKWSDMEDRHG GCHGHYVHGICIYGNGDLKWLVNSPSLFANKFELNTYPLTVECLELRHRERTLNQSET AIQPSWYF" BASE COUNT 511 a 401 c 398 g 497 t ORIGIN 1 ctgggcttca gcaacctgcc acggggattt aaacaaagga ggtttgagag aggcgggatc 61 tggctgtaat atcggcacag ggacagagac agcagctgga ctctcgggat gaaacggaat 121 cgattcccag cgtctccaac agggcaggag tgagtggagt atgttgcaaa ataagaactc 181 agagaaacga gtgagtttgg aaaaaagact tacagatttt gacggtctct tgacatttca 241 cccttctttg aggcatgcct ttatcaatgc gttacctctt cataatttct gtctctagtg 301 taattatttt tatcgtcttc tctgtgttca attttggggg agatccaagc ttccaaaggc 361 taaatatctc agaccctttg aggctgactc aagtttgcac atcttttatc aatggaaaaa 421 cacgtttcct gtggaaaaac aaactaatga tccatgagaa gtcttcttgc aaggaatact 481 tgacccagag ccactacatc acagcccctt tatctaagga agaagctgac tttcccttgg 541 catatataat ggtcatccat catcactttg acacctttgc aaggctcttc agggctattt 601 acatgcccca aaatatctac tgtgttcatg tggatgaaaa agcaacaact gaatttaaag 661 atgcggtaga gcaactatta agctgcttcc caaacgcttt tctggcttcc aagatggaac 721 ccgttgtcta tggagggatc tccaggctcc aggctgacct gaactgcatc agagatcttt 781 ctgccttcga ggtctcatgg aagtacgtta tcaacacctg tgggcaagac ttccccctga 841 aaaccaacaa ggaaatagtt cagtatctga aaggatttaa aggtaaaaat atcaccccag 901 gggtgctgcc cccagctcat gcaattggac ggactaaata tgtccaccaa gagcacctgg 961 gcaaagagct ttcctatgtg ataagaacaa cagcgttgaa accgcctccc ccccataatc 1021 tcacaattta ctttggctct gcctatgtgg ctctatcaag agagtttgcc aactttgttc 1081 tgcatgaccc acgggctgtt gatttgctcc agtggtccaa ggacactttc agtcctgatg 1141 agcatttctg ggtgacactc aataggattc caggtgttcc tggctctatg ccaaatgcat 1201 cctggactgg aaacctcaga gctataaagt ggagtgacat ggaagacaga cacggaggct 1261 gccacggcca ctatgtacat ggtatttgta tctatggaaa cggagactta aagtggctgg 1321 ttaattcacc aagcctgttt gctaacaagt ttgagcttaa tacctacccc cttactgtgg 1381 aatgcctaga actgaggcat cgcgaaagaa ccctcaatca gagtgaaact gcgatacaac 1441 ccagctggta tttttgagct attcatgagc tactcatgac tgaagggaaa ctgcagctgg 1501 gaagaggagc ctgtttttgt gagagacttt tgccttcgta atgttaaccg tttcaggacc 1561 acgtttatag cttcaggacc tggctacgta attatactta aaatatccac tggacactgt 1621 gaaatacact aacaggatgg ctgggtagag caatctgggc actttggcca attttagtct 1681 tgctgtttct tgatgctcac ctctatatta gtttattgtt aggatcaatg ataaatttaa 1741 atgacctcag atctttgcac cagatactca tcatatacaa atgttttagt aaaaaagaga 1801 attgtag // LOCUS HUMNAGT3 2087 bp mRNA PRI 10-SEP-1993 DEFINITION Human mRNA for N-acetylglucosaminyltransferase III, complete cds. ACCESSION D13789 NID g398137 KEYWORDS N-acetylglucosaminyltransferase III. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2087) AUTHORS Ihara,Y., Nishikawa,A., Tohma,T., Soejima,H., Niikawa,N. and Taniguchi,N. TITLE cDNA cloning, expression, and chromosomal localization of human N-acetylglucosaminyltransferase III (GnT-III) JOURNAL J. Biochem. 113 (6), 692-698 (1993) MEDLINE 93380894 REFERENCE 2 (bases 1 to 2087) AUTHORS Ihara,Y. TITLE Direct Submission JOURNAL Submitted (27-NOV-1992) to the DDBJ/EMBL/GenBank databases. Yoshito Ihara, Osaka University Medical School, Department of Biochemistry; 2-2 Yamadaoka, Suita city, Osaka 565, Japan (E-mail:a62520a@center.osaka-u.ac.jp, Tel:06-875-7309, Fax:06-875-7314) COMMENT Submitted (27-NOV-1992) to DDBJ by: Yoshito Ihara Department of Biochemistry Osaka University Medical School 2-2 Yamadaoka, Suita-shi Osaka 565 Japan Phone: 06-875-7309 Email: a62520a@center.osaka-u.ac.jp Fax: 06-875-7314. FEATURES Location/Qualifiers source 1..2087 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 9..1604 /EC_number="2.4.1.144" /codon_start=1 /product="N-acetylglucosaminyltransferase III" /db_xref="PID:d1003443" /db_xref="PID:g398138" /translation="MRRYKLFLMFCMAGLCLISFLHFFKTLSYVTFPRELASLSPNLV SSFFWNNAPVTPQASPEPGGPDLLRTPLYSHSPLLQPLPPSKAAEELHRVDLVLPEDT TEYFVRTKAGGVCFKPGTKMLERPPPGRPEEKPEGANGSSARRPPRYLLSARERTGGR GARRKWVECVCLPGWHGPSCGVPTVVQYSNLPTKERLVPREVPRRVINAINVNHEFDL LDVRFHELGDVVDAFVVCESNFTAYGEPRPLKFREMLTNGTFEYIRHKVLYVFLDHFP PGGRQDGWIADDYLRTFLTQDGVSRLRNLRPDDVFIIDDADEIPARDGVLFLKLYDGW TEPFAFHMRTSLYGFFWKQPGTLEVVSGCTVDMLQAVYGLDGIRLRRRQYYTMPNFRQ YENRTGHILVQWSLGSPLHFAGWHCSWCFTPEGIYFKLVSAQNGDFPRWGDYEDKRDL NYIRGLIRTGGWFDGTQQEYPPADPSEHMYAPKYLLKNYDRFHYLLDNPYQEPRSTAA GGWRHRGPEGRPPARGKLDEAEV" BASE COUNT 353 a 684 c 685 g 365 t ORIGIN 1 ggatgaagat gagacgctac aagctctttc tcatgttctg tatggccggc ctgtgcctca 61 tctccttcct gcacttcttc aagaccctgt cctatgtcac cttcccccga gaactggcct 121 ccctcagccc taacctggtg tccagctttt tctggaacaa tgccccggtc acgccccagg 181 ccagccccga gccaggaggc cctgacctgc tgcgtacccc actctactcc cactcgcccc 241 tgctgcagcc gctgccgccc agcaaggcgg ccgaggagct ccaccgggtg gacttggtgc 301 tgcccgagga caccaccgag tatttcgtgc gcaccaaggc cggcggcgtc tgcttcaaac 361 ccggcaccaa gatgctggag aggccgcccc cgggacggcc ggaggagaag cctgaggggg 421 ccaacggctc ctcggcccgg cggccacccc ggtacctcct gagcgcccgg gagcgcacgg 481 ggggccgagg cgcccggcgc aagtgggtgg agtgcgtgtg cctgcccggc tggcacggac 541 ccagctgcgg cgtgcccact gtggtgcagt actccaacct gcccaccaag gagcggctgg 601 tgcccaggga ggtgccgcgc cgcgtcatca acgccatcaa cgtcaaccac gagttcgacc 661 tgctggacgt gcgcttccac gagctgggcg acgtggtgga cgcctttgtg gtgtgcgagt 721 ccaacttcac ggcttatggg gagccgcggc cgctcaagtt ccgggagatg ctgaccaatg 781 gcaccttcga gtacatccgc cacaaggtgc tctatgtctt cctggaccac ttcccgcccg 841 gcggccggca ggacggctgg atcgccgacg actacctgcg caccttcctc acccaggacg 901 gcgtctcgcg gctgcgcaac ctgcggcccg acgacgtctt catcattgac gatgcggacg 961 agatcccggc ccgtgacggc gtccttttcc tcaagctcta cgatggctgg accgagccct 1021 tcgccttcca catgcgcacg tcgctctacg gcttcttctg gaagcagccg ggcaccctgg 1081 aggtggtgtc aggctgcacg gtggacatgc tgcaggcagt gtatgggctg gacggcatcc 1141 gcctgcgccg ccgccagtac tacaccatgc ccaacttcag acagtatgag aaccgcaccg 1201 gccacatcct ggtgcagtgg tcgctgggca gccccctgca cttcgccggc tggcactgct 1261 cctggtgctt cacgcccgag ggcatctact tcaagctcgt gtccgcccag aatggcgact 1321 tcccacgctg gggtgactac gaggacaagc gggacctgaa ctacatccgc ggcctgatcc 1381 gcaccggggg ctggttcgac ggcacgcagc aggagtaccc gcctgcagac cccagcgagc 1441 acatgtatgc gcccaagtac ctgctgaaga actacgaccg gttccactac ctgctggaca 1501 acccctacca ggagcccagg agcacggcgg cgggcgggtg gcgccacagg ggtcccgagg 1561 gaaggccgcc cgcccggggc aaactggacg aggcggaagt ctagagctgc atgatctgat 1621 agggtttgtg acagggcggg ggtggcggcg gcccctagcg ctatctccct gcctcctgcc 1681 ggctccttgg ttcttgaggg gaccaggagt gggtggggag tgggggtggg gctagggttt 1741 ccctactgaa gcccttgtga tcaagggtca ggcctttgag ctcagaaaat atccctcctg 1801 ttgggagagg gcgcaggccg tgacgtctgg gtggccctta tgactgccaa gactgctgtg 1861 gccaggaggt gccactggag tgtgcgtggt ggtccctggg tagcggggga gggtaggcag 1921 gattggggaa gagagcctgc aggatctcac caggcagcct ctggggggtg gccaggccgg 1981 aaaaagccca ccatttggca tccctgggcc ttgggctccg tgtgggagac cggcctgcca 2041 ggaggaccca gggctctgta agtagatgca tttgggtcca ggaggaa // LOCUS HUMNAP 1560 bp mRNA PRI 14-MAR-1994 DEFINITION H.sapiens NAP (nucleosome assembly protein) mRNA, complete cds. ACCESSION M86667 NID g189066 KEYWORDS nucleosome assembly protein. SOURCE Homo sapiens unknown thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1560) AUTHORS Simon,H.U., Mills,G.B., Kozlowski,M., Hogg,D., Branch,D., Ishimi,Y. and Siminovitch,K.A. TITLE Molecular characterization of hNRP, a cDNA encoding a human nucleosome-assembly-protein-I-related gene product involved in the induction of cell proliferation JOURNAL Biochem. J. 297 (Pt 2), 389-397 (1994) MEDLINE 94128073 FEATURES Location/Qualifiers source 1..1560 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..75 /partial /note="putative" gene 76..1251 /gene="NAP" CDS 76..1251 /gene="NAP" /codon_start=1 /db_xref="PID:g189067" /translation="MADIDNKEQSELDQDLDDVEEVEEEETGEETKLKARQLTVQMMQ NPQILAALQERLDGLVETPTGYIESLPRVVKRRVNALKNLQVKCAQIEAKFYEEVHDL ERKYAVLYQPLFDKRFEIINAIYEPTEEECEWKPDEEDEISEELKEKAKIEDEKKDEE KEDPKGIPEFWLTVFKNVDLLSDMVQEHDEPILKHLKDIKVKFSDAGQPMSFVLEFHF EPNEYFTNEVLTKTYRMRSEPDDSDPFSFDGPEIMGCTGCQIDWKKGKNVTLKTIKKK QKHKGRGTVRTVTKTVSNDSFFNFFAPPEVPESGDLDDDAEAILAADFEIGHFLRERI IPRSVLYFTGEAIEDDDDDYDEEGEEADEEGEEEGDEENDPDYDPKKDQNPAECKQQ" 3'UTR 1252..1560 polyA_signal 1391..1396 BASE COUNT 524 a 254 c 357 g 425 t ORIGIN 1 ccctgagtca ctgcctgcgc acgtccggcc gcctggctcc ccatactagt cgccgatatt 61 tggagttctt acaacatggc agacattgac aacaaagaac agtctgaact tgatcaagat 121 ttggatgatg ttgaagaagt agaagaagag gaaactggtg aagaaacaaa actcaaagca 181 cgtcagctaa ctgttcagat gatgcaaaat cctcagattc ttgcagccct tcaagaaaga 241 cttgatggtc tggtagaaac accaacagga tacattgaaa gcctgcctag ggtagttaaa 301 agacgagtga atgctctcaa aaacctgcaa gttaaatgtg cacagataga agccaaattc 361 tatgaggaag ttcatgatct tgaaaggaag tatgctgttc tctatcagcc tctatttgat 421 aagcgatttg aaattattaa tgcaatttat gaacctacgg aagaagaatg tgaatggaaa 481 ccagatgaag aagatgagat ttcggaggaa ttgaaagaaa aggccaagat tgaagatgag 541 aaaaaggatg aagaaaaaga agaccccaaa ggaattcctg aattttggtt aactgttttt 601 aagaatgttg acttgctcag tgatatggtt caggaacacg atgaacctat tctgaagcac 661 ttgaaagata ttaaagtgaa gttctcagat gctggccagc ctatgagttt tgtcttagaa 721 tttcactttg aacccaatga atattttaca aatgaagtgc tgacaaagac atacaggatg 781 aggtcagaac cagatgattc tgatcccttt tcttttgatg gaccagaaat tatgggttgt 841 acagggtgcc agatagattg gaaaaaagga aagaatgtca ctttgaaaac tattaagaag 901 aagcagaaac acaagggacg tgggacagtt cgtactgtga ctaaaacagt ttccaatgac 961 tctttcttta acttttttgc ccctcctgaa gttcctgaga gtggagatct ggatgatgat 1021 gctgaagcta tccttgctgc agacttcgaa attggtcact ttttacgtga gcgtataatc 1081 ccaagatcag tgttatattt tactggagaa gctattgaag atgatgatga tgattatgat 1141 gaagaaggtg aagaagcgga tgaggaaggg gaagaagaag gagatgagga aaatgatcca 1201 gactatgacc caaagaagga tcaaaaccca gcagagtgca agcagcagtg aagcaggatg 1261 tatgtggcct tgaggataac ctgcactggt ctaccttctg cttccctgga aaggatgaat 1321 ttacatcatt tgacaagcct attttcaagt tatttgttgt ttgtttgctt gtttttgttt 1381 ttgcagctaa aataaaaatt tcaaatacaa ttttagttct tacaagataa tgtcttaatt 1441 ttgtaccaat tcaggtagaa gtagaggcct accttgaatt aagggttata ctcagttttt 1501 aacacattgt tgaagaaaag gtaccagctt tggaacgaga tgctatacta ataagcaagt // LOCUS HUMNAPI3X 2554 bp mRNA PRI 24-AUG-1993 DEFINITION Homo sapiens renal Na/Pi-cotransporter mRNA, complete cds. ACCESSION L13258 NID g292349 KEYWORDS Na/Pi-cotransport; renal sodium-dependent phosphate transporter. SOURCE Homo sapiens (library: Superscript, pSport1) male adult kidney cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2554) AUTHORS Magagnin,S., Werner,A., Markovich,D., Sorribas,V., Stange,G., Biber,J. and Murer,H. TITLE Expression cloning of human and rat renal cortex Na/Pi cotransport JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 5979-5983 (1993) MEDLINE 93317607 FEATURES Location/Qualifiers source 1..2554 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="kidney cortex" /tissue_lib="Superscript, pSport1" CDS 82..2001 /standard_name="Na/Pi-cotransport" /note="obtained by expression cloning using Xenopus laevis oocytes" /codon_start=1 /function="transport of phosphate" /evidence=experimental /product="renal sodium-dependent phosphate transporter" /db_xref="PID:g292350" /translation="MLSYGERLGSPAVSPLPVRGGHVMRGTAFAYVPSPQVLHRIPGT SAYAFPSLGPVALAEHTCPCGEVLERHEPLPAKLALEEEQKPESRLVPKLRQAGAMLL KVPLMLTFLYLFVCSLDMLSSAFQLAGGKVAGDIFKDNAILSNPVAGLVVGILVTVLV QSSSTSTSIIVSMVSSGLLEVSSAIPIIMGSNIGTSVTNTIVALMQAGDRTDFRRAFA GATVHDCFNWLSVLVLLPLEAATGYLHHITRLVVASFNIHGGRDAPDLLKIITEPFTK LIIQLDESVITSIATGDESLRNHSLIQIWCHPDSLQAPTSMSRAEANSSQTLGNATME KCNHIFVDTGLPDLAVGLILLAGSLVLLCTCLILLVKMLNSLLKGQVAKVIQKVINTD FPAPFTWVTGYFAMVVGASMTFVVQSSSVFTSAITPLIGLGVISIERAYPLTLGSNIG TTTTAILAALASPREKLSSAFQIALCHFFFNISGILLWYPVPCTRLPIRMAKALGKRT AKYRWFAVLYLLVCFLLLPSLVFGISMAGWQVMVGVGTPFGALLAFVVLINVLQSRSP GHLPKWLQTWDFLPRWMHSLKPLDHLITRATLCCARPEPRSPPLPPRVFLEELPPATP SPRLALPAHHNATRL" BASE COUNT 457 a 825 c 701 g 571 t ORIGIN 1 ctgctgagca gaagctgaaa cacagaattc taagcgttgc tgagacccac tgacctgcag 61 acctcatagt gggtgcccag gatgttgtcc tacggagaga ggctggggtc ccctgctgtc 121 tccccactcc cagtccgtgg ggggcatgtg atgcgaggga cggcctttgc ctacgtgccc 181 agccctcagg tcctacacag gatcccgggg acctctgcct atgccttccc cagcctgggc 241 cctgtggccc ttgctgagca cacctgcccc tgtggggagg tcctggagcg ccatgaacca 301 ctgcctgcca agctggccct ggaggaggag cagaagccag agtccaggct ggtccccaag 361 ctgcgccagg ctggcgccat gctgctcaag gtgccactga tgctcacctt cctctacctc 421 ttcgtctgct ccctggacat gctcagctcg gccttccagc tggctggagg gaaggtggct 481 ggtgacatct tcaaggataa cgccatcctg tccaacccgg tggccgggct ggtggtgggg 541 atcctggtga ccgtgctggt gcagagctcc agcacctcca catccatcat cgtcagcatg 601 gtctcctctg gcttgctgga ggtgagctct gccatcccca tcatcatggg ctccaacatc 661 ggcacctctg tcaccaacac catcgtggcc ctgatgcagg cgggggacag gactgacttc 721 cggcgggcct tcgcgggggc cacggtgcat gactgcttta actggctgtc agtgctggtc 781 ctgctgcccc tggaggctgc cactggctac ctgcaccaca tcactcgact tgtggtggcc 841 tccttcaaca tccatggtgg ccgtgatgct cctgacctgc tcaagatcat cacagagccc 901 ttcacgaagc tcatcatcca gctggacgag tctgtgataa ccagcattgc cactggtgat 961 gagtccctga ggaaccacag tctcatccag atctggtgcc acccagactc cttacaggct 1021 cccacctcca tgtccagagc agaggccaac tccagccaga cccttggaaa tgccaccatg 1081 gagaaatgca accacatctt tgtggacact ggcctaccgg acctggctgt ggggctcatc 1141 ctgctggcag gatccctggt gctgctgtgc acctgcctca tcctcctagt caagatgctc 1201 aactccctgc tcaagggcca agtggccaag gtcatccaga aggtcatcaa tacggacttc 1261 cctgccccct tcacctgggt cacaggctac tttgccatgg tggtgggcgc cagcatgacc 1321 ttcgtggtcc agagcagttc tgtgttcacc tcggccatca ccccactcat cggtcttggt 1381 gtgatcagca ttgagagggc ctacccgctc acactgggtt ccaacatcgg caccaccacc 1441 acggccatcc tggctgccct ggccagcccc agggagaagc tgtccagcgc tttccagatt 1501 gccctctgtc acttcttctt caacatctcg ggtatccttc tgtggtaccc ggtgccctgc 1561 acacgcctgc ccatccgcat ggccaaggcg ctggggaaac gcacggccaa gtaccgctgg 1621 tttgccgtcc tctatctcct tgtctgcttc ctgctgctgc cctcactggt gtttggcatc 1681 tccatggcag gctggcaggt catggtaggt gtgggcacgc ccttcggggc cctgctggcc 1741 ttcgtggtgc tcatcaatgt cctgcagagt cggagtcccg ggcacctgcc caagtggtta 1801 cagacatggg acttcctgcc tcgctggatg cactccctga agcccctgga ccacctcatc 1861 acccgcgcca ccctatgctg tgccaggcct gagccccgct cacccccgct gccccccagg 1921 gtcttcctgg aggagctacc ccctgccaca ccctcccccc gtcttgcact gcctgctcac 1981 cacaatgcca cccgcctcta ggctgtgggc ccagactaca gcctggaatg gggaaggcct 2041 ggtgtggaaa ggcaggggag ggagggtgtg tgtaggtatg tgcatgtgcc tgtgccaccc 2101 tgggtgccag tctctccttc tgtagctccg caaagctctg ggcttgtgtg agagtgtcgg 2161 tgtgtgtgca tgtgtggggg tgagtctgca tgtgcacctg tcatgtgtag aagcttgtat 2221 ttgtgtacag gtgtgccagc ccatgcaggt gtacacagac acacctgtgg gaggctgtgt 2281 gcaggctgca ggatatctgg gtatgatttc aggtcctctg cacgtgtaca catgactagg 2341 ataggcagga gtaagggtgg gtctgggtat atgactgtgc agctgtttgt gcatagatgt 2401 tggtgcctgc gttactgaat ttgcacacct ccttgccacc ttccttcctc caagatacca 2461 tctcctcatc ctaacccagg tcttcggcca ccaccacaat taattccctt cccaacactt 2521 gcctgatgaa aaaaaaacaa aggaattaaa actc // LOCUS HUMNATV1 2421 bp mRNA PRI 18-JAN-1996 DEFINITION Human mRNA for N-acetylglucosaminyltransferase V, complete cds. ACCESSION D17716 NID g469489 KEYWORDS N-acetylglucosaminyltransferase V; glycosyltransferase. SOURCE Homo sapiens fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2421) AUTHORS Nishikawa,A., Saito,H., Gu,J., Ihara,Y., Soejima,H., Wada,Y., Sekiya,C., Kangawa,K., Niikawa,N. and Taniguchi,N. TITLE cDNA cloning and chromosomal mapping of human N-acetylglucosaminyltransferase V+ JOURNAL Biochemical and Biophysical Research Communication 198, 318-327 (1994) REFERENCE 2 (bases 1 to 2421) AUTHORS Nishikawa,A. TITLE Direct Submission JOURNAL Submitted (22-SEP-1993) to the DDBJ/EMBL/GenBank databases. Atsushi Nishikawa, Osaka University Medical School, Department of Biochemistry; 2-2 Yamadaoka, Suita city, Osaka 565, Japan (E-mail:a62520a@center.osaka-u.ac.jp, Tel:06-879-3421, Fax:06-879-3429) COMMENT Submitted (22-Sep-1993) to DDBJ by: Atsushi Nishikawa Department of Biochemistry Osaka University Medical School 2-2 Yamadaoka, Suita, Osaka 565 Japan Phone: 06-879-3421 Email: a62520a@center.osaka-u.ac.jp Fax: 06-879-3429. FEATURES Location/Qualifiers source 1..2421 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="liver" CDS 146..2371 /EC_number="2.4.1.155" /codon_start=1 /product="N-acetylglucosaminyltransferase V" /db_xref="PID:d1005098" /db_xref="PID:g870752" /translation="MALFTPWKLSSQKLGFFLVTFGFIWGMMLLHFTIQQRTQPESSS MLREQILDLSKRYIKALAEENRNVVDGPYAGVMTAYDLKKTLAVLLDNILQRIGKLES KVDNLVVNGTGTNSTNSTTAVPSLVALEKINVADIINGAQEKCVLPPMDGYPHCEGKI KWMKDMWRSDPCYADYGVDGSTCSFFIYLSEVENWCPHLPWRAKNPYEEADHNSLAEI RTDFNILYSMMKKHEEFRWMRLRIRRMADAWIQAIKSLAEKQNLEKRKRKKVLVHLGL LTKESGFKIAETAFSGGPLGELVQWSDLITSLYLLGHDIRISASLAELKEIMKKVVGN RSGCPTVGDRIVELIYIDIVGLAQFKKTLGPSWVHYQCMLRVLDSFGTEPEFNHANYA QSKGHKTPWGKWNLNPQQFYTMFPHTPDNSFLGFVVEQHLNSSDIHHINEIKRQNQSL VYGKVDSFWKNKKIYLDIIHTYMEVHATVYGSSTKNIPSYVKNHGILSGRDLQFLLRE TKLFVGLGFPYEGPAPLEAIANGCAFLNPKFNPPKSSKNTDFFIGKPTLRELTSQHPY AEVFIGRPHVWTVDLNNQEEVEDAVKAILNQKIEPYMPYEFTCEGMLQRINAFIEKQD FCHGQVMWPPLSALQVKLAEPGQSCKQVCQESQLICEPSFFQHLNKDKDMLKYKVTCQ SSELAKDILVPSFDPKNKHCVFQGDLLLFSCAGAHPRHQRVCPCRDFIKGQVALCKDC L" BASE COUNT 674 a 577 c 585 g 585 t ORIGIN Chromosome 2q21. 1 catcagaatg gaagtgagga aaggcaacca gctgacacag gagccagagt gagaccagca 61 gactctcaca ctcaacctac accatgaatt tgtgtctatc ttctacgcgt taagagccaa 121 ggacaggtga agttgccaga gagcaatggc tctcttcact ccgtggaagt tgtcctctca 181 gaagctgggc tttttcctgg tgacttttgg cttcatttgg ggtatgatgc ttctgcactt 241 taccatccag cagcgaactc agcctgaaag cagctccatg ctgcgcgagc agatcctgga 301 cctcagcaaa aggtacatca aggcactggc agaagaaaac aggaatgtgg tggatgggcc 361 atacgctgga gtcatgacag cttatgatct gaagaaaacc cttgctgtgt tattagataa 421 cattttgcag cgcattggca agttggagtc gaaggtggac aatcttgttg tcaatggcac 481 cggaacaaac tcaaccaact ccactacagc tgttcccagc ttggttgcac ttgagaaaat 541 taatgtggca gatatcatta acggagctca agaaaaatgt gtattgcctc ctatggacgg 601 ctaccctcac tgtgagggaa agatcaagtg gatgaaagac atgtggcgtt cagatccctg 661 ctacgcagac tatggagtgg atggatccac ctgctctttt tttatttacc tcagtgaggt 721 tgaaaattgg tgtcctcatt taccttggag agcaaaaaat ccctacgaag aagctgatca 781 taattcattg gcggaaattc gtacagattt taatattctc tacagtatga tgaaaaagca 841 tgaagaattc cggtggatga gactacggat ccggcgaatg gctgacgcat ggatccaagc 901 aatcaagtcc ctggcagaaa agcagaacct tgaaaagaga aagcggaaga aagtcctcgt 961 tcacctggga ctcctgacca aggaatctgg atttaagatt gcagagacag ctttcagtgg 1021 tggccctctt ggtgaattag ttcaatggag tgatttaatt acatctctgt acttactggg 1081 ccatgacatt aggatttcag cttcactggc tgagctcaag gaaatcatga agaaggttgt 1141 aggaaaccga tctggctgcc caactgtagg agacagaatt gttgagctca tttacattga 1201 tattgtagga cttgctcaat tcaagaaaac tcttggacca tcctgggttc attaccagtg 1261 catgctccga gtccttgatt catttggtac tgaacccgaa tttaatcatg caaattatgc 1321 ccaatcgaaa ggccacaaga ccccttgggg aaaatggaat ctgaaccctc agcagtttta 1381 taccatgttc cctcataccc cagacaacag ctttctgggg tttgtggttg agcagcacct 1441 gaactccagt gatatccacc acattaatga aatcaaaagg cagaaccagt cccttgtgta 1501 tggcaaagtg gatagcttct ggaagaataa gaagatctac ttggacatta ttcacacata 1561 catggaagtg catgcaactg tttatggctc cagcacaaag aatattccca gttacgtgaa 1621 aaaccatggt atcctcagtg gacgggacct gcagttcctt cttcgagaaa ccaagttgtt 1681 tgttggactt gggttccctt acgagggccc agctcccctg gaagctatcg caaatggatg 1741 tgcttttctg aatcccaagt tcaacccacc caaaagcagc aaaaacacag actttttcat 1801 tggcaagcca actctgagag agctgacatc ccagcatcct tacgctgaag ttttcatcgg 1861 gcggccacat gtgtggactg ttgacctcaa caatcaggag gaagtagagg atgcagtgaa 1921 agcaatttta aatcagaaga ttgagccata catgccatat gaatttacgt gcgaggggat 1981 gctacagaga atcaatgctt tcattgaaaa acaggacttc tgccatgggc aagtgatgtg 2041 gccacccctc agcgccctac aggtcaagct tgctgagccc gggcagtcct gcaagcaggt 2101 gtgccaggag agccagctca tctgcgagcc ttctttcttc cagcacctca acaaggacaa 2161 ggacatgctg aagtacaagg tgacctgcca aagctcagag ctggccaagg acatcctggt 2221 gccctccttt gaccctaaga ataagcactg tgtgtttcaa ggtgacctcc tgctcttcag 2281 ctgtgcaggc gcccacccca ggcaccagag ggtctgcccc tgccgggact tcatcaaggg 2341 ccaggtggct ctctgcaaag actgcctata gcagctacct gctcagccct gcaccatgct 2401 gctggggaag acagtggccc c // LOCUS HUMNCA 2533 bp mRNA PRI 07-JAN-1995 DEFINITION Human nonspecific crossreacting antigen mRNA, complete cds. ACCESSION M18728 NID g189084 KEYWORDS nonspecific cross-reacting antigen. SOURCE Human lung carcinoma cell line HLC-1, cDNA to mRNA, clones lambda-NCA11 and lambda-NCA15. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2533) AUTHORS Tawaragi,Y., Oikawa,S., Matsuoka,Y., Kosaki,G. and Nakazato,H. TITLE Primary structure of nonspecific crossreacting antigen (NCA), a member of carcinoembryonic antigen (CEA) gene family, deduced from cDNA sequence JOURNAL Biochem. Biophys. Res. Commun. 150 (1), 89-96 (1988) MEDLINE 88106638 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by Y.Tawaragi, 19-MAR-1988. FEATURES Location/Qualifiers source 1..2533 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HLC-1" /tissue_type="lung carcinoma" /map="19q13.2" mRNA <1..2533 /gene="NCA" /note="G00-120-221" gene 1..2533 /gene="NCA" CDS 51..1085 /gene="NCA" /codon_start=1 /db_xref="GDB:G00-120-221" /product="non-specific cross reacting antigen" /db_xref="PID:g189085" /translation="MGPPSAPPCRLHVPWKEVLLTASLLTFWNPPTTAKLTIESTPFN VAEGKEVLLLAHNLPQNRIGYSWYKGERVDGNSLIVGYVIGTQQATPGPAYSGRETIY PNASLLIQNVTQNDTGFYTLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDK DAVAFTCEPEVQNTTYLWWVNGQSLPVSPRLQLSNGNMTLTLLSVKRNDAGSYECEIQ NPASANRSDPVTLNVLYGPDVPTISPSKANYRPGENLNLSCHAASNPPAQYSWFINGT FQQSTQELFIPNITVNNSGSYMCQAHNSATGLNRTTVTMITVSGSAPVLSAVATVGIT IGVLARVALI" sig_peptide 51..152 /gene="NCA" /note="G00-120-221" mat_peptide 153..1082 /gene="NCA" /note="G00-120-221" /product="non-specific cross reacting antigen" CDS 1355..1657 /gene="NCA" /note="ORF1" /codon_start=1 /db_xref="PID:g189086" /translation="MDSFSQDVKTRLLIMIRLLPPFNLSLLMPASFAWQDDAVISISQ EVASEGNLTECQIYLVNPNVLHKIRDPLVHPVTDISSIFNTAVCSNVQWSFSELDF" CDS 2370..2501 /gene="NCA" /note="ORF2" /codon_start=1 /db_xref="PID:g189087" /translation="MLTNVFISVVLFPCSNLTKPTVLVLYCPGGAITVLVEWCCFNS" BASE COUNT 732 a 648 c 507 g 646 t ORIGIN Unreported. 1 ggagctcaag ctcctctaca aagaggtgga cagagaagac agcagagacc atgggacccc 61 cctcagcccc tccctgcaga ttgcatgtcc cctggaagga ggtcctgctc acagcctcac 121 ttctaacctt ctggaaccca cccaccactg ccaagctcac tattgaatcc acgccattca 181 atgtcgcaga ggggaaggag gttcttctac tcgcccacaa cctgccccag aatcgtattg 241 gttacagctg gtacaaaggc gaaagagtgg atggcaacag tctaattgta ggatatgtaa 301 taggaactca acaagctacc ccagggcccg catacagtgg tcgagagaca atatacccca 361 atgcatccct gctgatccag aacgtcaccc agaatgacac aggattctat accctacaag 421 tcataaagtc agatcttgtg aatgaagaag caaccggaca gttccatgta tacccggagc 481 tgcccaagcc ctccatctcc agcaacaact ccaaccccgt ggaggacaag gatgctgtgg 541 ccttcacctg tgaacctgag gttcagaaca caacctacct gtggtgggta aatggtcaga 601 gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa catgaccctc actctactca 661 gcgtcaaaag gaacgatgca ggatcctatg aatgtgaaat acagaaccca gcgagtgcca 721 accgcagtga cccagtcacc ctgaatgtcc tctatggccc agatgtcccc accatttccc 781 cctcaaaggc caattaccgt ccaggggaaa atctgaacct ctcctgccac gcagcctcta 841 acccacctgc acagtactct tggtttatca atgggacgtt ccagcaatcc acacaagagc 901 tctttatccc caacatcact gtgaataata gcggatccta tatgtgccaa gcccataact 961 cagccactgg cctcaatagg accacagtca cgatgatcac agtctctgga agtgctcctg 1021 tcctctcagc tgtggccacc gtcggcatca cgattggagt gctggccagg gtggctctga 1081 tatagcagcc ctggtgtatt ttcgatattt caggaagact ggcagattgg accagaccct 1141 gaattcttct agctcctcca atcccatttt atcccatgga accactaaaa acaaggtctg 1201 ctctgctcct gaagccctat atgctggaga tggacaactc aatgaaaatt taaagggaaa 1261 accctcaggc ctgaggtgtg tgccactcag agacttcacc taactagaga cagtcaaact 1321 gcaaaccatg gtgagaaatt gacgacttca cactatggac agcttttccc aagatgtcaa 1381 aacaagactc ctcatcatga taaggctctt accccctttt aatttgtcct tgcttatgcc 1441 tgcctctttc gcttggcagg atgatgctgt cattagtatt tcacaagaag tagcttcaga 1501 gggtaactta acagagtgtc agatctatct tgtcaatccc aacgttttac ataaaataag 1561 agatccttta gtgcacccag tgactgacat tagcagcatc tttaacacag ccgtgtgttc 1621 aaatgtacag tggtcctttt cagagttgga cttctagact cacctgttct cactccctgt 1681 tttaattcaa cccagccatg caatgccaaa taatagaatt gctccctacc agctgaacag 1741 ggaggagtct gtgcagtttc tgacacttgt tgttgaacat ggctaaatac aatgggtatc 1801 gctgagacta agttgtagaa attaacaaat gtgctgcttg gttaaaatgg ctacactcat 1861 ctgactcatt ctttattcta ttttagttgg tttgtatctt gcctaaggtg cgtagtccaa 1921 ctcttggtat taccctccta atagtcatac tagtagtcat actccctggt gtagtgtatt 1981 ctctaaaagc tttaaatgtc tgcatgcagc cagccatcaa atagtgaatg gtctctcttt 2041 ggctggaatt acaaaactca gagaaatgtg tcatcaggag aacatcataa cccatgaagg 2101 ataaaagccc caaatggtgg taactgataa tagcactaat gctttaagat ttggtcacac 2161 tctcacctag gtgagcgcat tgagccagtg gtgctaaatg ctacatactc caactgaaat 2221 gttaaggaag aagatagatc caattaaaaa aaattaaaac caatttaaaa aaaaaaaaga 2281 acacaggaga ttccagtcta cttgagttag cataatacag aagtcccctc tactttaact 2341 tttacaaaaa agtaacctga actaatctga tgttaaccaa tgtatttatt tctgtggttc 2401 tgtttccttg ttccaatttg acaaaaccca ctgttcttgt attgtattgc ccagggggag 2461 ctatcactgt acttgtagag tggtgctgct ttaattcata aatcacaaat aaaagccaat 2521 tagctctata act // LOCUS HUMNCBLCA 687 bp mRNA PRI 28-FEB-1996 DEFINITION Human neutrophil cytochrome b light chain p22 phagocyte b-cytochrome mRNA, complete cds. ACCESSION M21186 J03774 NID g189105 KEYWORDS cytochrome b. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 687) AUTHORS Parkos,C.A., Dinauer,M.C., Walker,L.E., Allen,R.A., Jesaitis,A.J. and Orkin,S.H. TITLE Primary structure and unique expression of the 22-kilodalton light chain of human neutrophil cytochrome b JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (10), 3319-3323 (1988) MEDLINE 88217892 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.C.Dinauer 30-JUN-1988. FEATURES Location/Qualifiers source 1..687 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSV-HdIII" /tissue_type="promyelocytic leukemia" /map="580 bp upstream of BstEII site; Xp21" /chromosome="X" CDS 29..616 /codon_start=1 /product="p22 phagocyte b-cytochrome" /db_xref="PID:g189106" /translation="MGQIEWAMWANEQALASGLILITGGIVATAGRFTQWYFGAYSIV AGVFVCLLEYPRGKRKKGSTMERWGQKHMTAVVKLFGPFTRNYYVRAVLHLLLSVPAG FLLATILGTACLAIASGIYLLAAVRGEQWTPIEPKPRERPQIGGTIKQPPSNPPPRPP AEARKKPSEEEAAAAAGGPPGGPQVNPIPVTDEVV" BASE COUNT 116 a 235 c 232 g 104 t ORIGIN 1 gcagtgtccc agccgggttc gtgtcgccat ggggcagatc gagtgggcca tgtgggccaa 61 cgagcaggcg ctggcgtccg gcctgatcct catcaccggg ggcatcgtgg ccacagctgg 121 gcgcttcacc cagtggtact ttggtgccta ctccattgtg gcgggcgtgt ttgtgtgcct 181 gctggagtac ccccggggga agaggaagaa gggctccacc atggagcgct ggggacagaa 241 gcacatgacc gccgtggtga agctgttcgg gccctttacc aggaattact atgttcgggc 301 cgtcctgcat ctcctgctct cggtgcccgc cggcttcctg ctggccacca tccttgggac 361 cgcctgcctg gccattgcga gcggcatcta cctactggcg gctgtgcgtg gcgagcagtg 421 gacgcccatc gagcccaagc cccgggagcg gccgcagatc ggaggcacca tcaagcagcc 481 gcccagcaac cccccgccgc ggcccccggc cgaggcccgc aagaagccca gcgaggagga 541 ggctgcggcg gcggcggggg gacccccggg aggtccccag gtcaacccca tcccggtgac 601 cgacgaggtc gtgtgacctc gccccggacc tgccctccca ccaggtgcac ccacctgcaa 661 taaacgcagc gaaggccggg aaaaaaa // LOCUS HUMNEKAR 1197 bp mRNA PRI 11-JAN-1991 DEFINITION Human neurokinin A receptor (NK-2R) mRNA, complete cds. ACCESSION M57414 J05680 NID g189134 KEYWORDS neurokinin A receptor; neuropeptide receptor; plasma membrane protein; substance K receptor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1197) AUTHORS Gerard,N.P., Eddy,R.L.Jr.., Shows,T.B.Jr.. and Gerard,C. TITLE The human neurokinin a (substance K) receptor: Molecular cloning of the gene, chromosome localization, and isolation of cDNA from tracheal and gastric tissues JOURNAL J. Biol. Chem. 265, 20455-20462 (1990) MEDLINE 91056095 FEATURES Location/Qualifiers source 1..1197 /organism="Homo sapiens" /db_xref="taxon:9606" /map="chromosome 10" gene 1..1197 /gene="NK-2R" CDS 1..1197 /gene="NK-2R" /codon_start=1 /product="neurokinin A receptor" /db_xref="PID:g189135" /translation="MGTCDIVTEANISSGPESNTTGITAFSMPSWQLALWAPAYLALV LVAVTGNAIVIWIILAHRRMRTVTNYFIVNLALADLCMAAFNAAFNFVYASHNIWYFG RAFCYFQNLFPITAMFVSIYSMTAIAADRYMAIVHPFQPRLSAPSTKAVIAGIWLVAL ALASPQCFYSTVTMDQGATKCVVAWPEDSGGKTLLLYHLVVIALIYFLPLAVMFVAYS VIGLTLWRRAVPGHQAHGANLRHLQAKKKFVKTMVLVVLTFAICWLPYHLYFILGSFQ EDIYCHKFIQQVYLALFWLAMSSTMYNPIIYCCLNHRFRSGFRLAFRCCPWVTPTKED KLELTPTTSLSTRVNRCHTKETLFMAGDTAPSEATSGEAGRPQDGSGLWFGYGLLAPT KTHVEI" BASE COUNT 221 a 397 c 303 g 276 t ORIGIN 1 atggggacct gtgacattgt gactgaagcc aatatctcat ctggccctga gagcaacacc 61 acgggcatca cagccttctc catgcccagc tggcagctgg cactgtgggc accagcctac 121 ctggccctgg tgctggtggc cgtgacgggt aatgccatcg tcatctggat catcctggcc 181 catcggagga tgcgcacagt caccaactac ttcatcgtca atctggcgct ggctgacctc 241 tgcatggctg ccttcaatgc cgccttcaac tttgtctatg ccagccacaa catctggtac 301 tttggccgtg ccttctgcta cttccagaac ctcttcccca tcacagccat gtttgtcagc 361 atctactcca tgaccgccat tgctgccgac aggtacatgg ccatcgtcca ccccttccag 421 cctcggcttt cagctcccag caccaaggcg gttattgctg gcatctggct ggtggctctc 481 gccctggcct cccctcagtg cttctactcc accgtcacca tggaccaggg tgccaccaag 541 tgcgtggtgg cctggcccga agacagcggg ggcaagacgc tcctcctgta ccacctcgtg 601 gtgatcgccc tcatctactt cctgccgctc gcggtgatgt ttgtagccta cagcgtcatc 661 ggcctcacgc tctggaggcg cgcagtgccc ggacatcagg cgcacggtgc caacctccgc 721 catctgcagg ccaagaagaa gtttgtgaag accatggtgc tggtggtgct gacgtttgcc 781 atctgctggc tgccctacca cctctacttc atcctgggca gcttccagga ggacatctac 841 tgccacaagt tcatccagca agtctacctg gcactcttct ggttggccat gagctctacc 901 atgtacaatc ccatcatcta ctgctgtctc aaccacaggt ttcgctctgg gttccggctt 961 gccttccgct gctgcccatg ggtcacaccc accaaggaag ataagctcga gctgactccc 1021 acgacctccc tctccacgag agtcaacagg tgtcacacta aggagacttt gttcatggct 1081 ggggacacag ccccctccga ggctaccagt ggggaggcgg ggcgtcccca ggatggatca 1141 gggctatggt ttgggtatgg tttgcttgcc cccaccaaaa ctcatgttga aatttga // LOCUS HUMNEUYREC 1470 bp mRNA PRI 07-JAN-1995 DEFINITION Human neuropeptide y receptor mRNA, complete cds. ACCESSION M84755 NID g189153 KEYWORDS neuropeptide Y receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1470) AUTHORS Herzog,H., Hort,Y.J., Ball,H.J., Hayes,G., Shine,J. and Selbie,L.A. TITLE Cloned human neuropeptide Y receptor couples to two different second messenger systems JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (13), 5794-5798 (1992) MEDLINE 92335184 FEATURES Location/Qualifiers source 1..1470 /organism="Homo sapiens" /db_xref="taxon:9606" /map="7pter-q22" gene 1..1155 /gene="NPY" CDS 1..1155 /gene="NPY" /codon_start=1 /db_xref="GDB:G00-119-456" /product="neuropeptide y receptor" /db_xref="PID:g189154" /translation="MNSTLFSQVENHSVHSNFSEKNAQLLAFENDDCHLPLAMIFTLA LAYGAVIILGVSGNLALIIIILKQKEMRNVTNILIVNLSFSDLLVAIMCLPLTFVYTL MDHWVFGEAMCKLNPFVQCVSITVSIFSLVLIAVERHQLIINPRGWRPNNRHAYVGIA VIWVLAVASSLPFLIYQVMTDEPFQNVTLDAYKDKYVCFDQFPSDSHRLSYTTLLLVL QYFGPLCFIFICYFKIYIRLKRRNNMMDKMRDNKYRSSETKRINIMLLSIVVAFAVCW LPLTIFNTVFDWNHQIIATCNHNLLFLLCHLTAMISTCVNPIFYGFLNKNFQRDLQFF FNFCDFRSRDDDYETIAMSTMHTDVSKTSLKQASPVAFKKINNNDDNEKI" BASE COUNT 404 a 299 c 272 g 495 t ORIGIN 1 atgaattcaa cattattttc ccaggttgaa aatcattcag tccactctaa tttctcagag 61 aagaatgccc agcttctggc ttttgaaaat gatgattgtc atctgccctt ggccatgata 121 tttaccttag ctcttgctta tggagctgtg atcattcttg gtgtctctgg aaacctggcc 181 ttgatcataa tcatcttgaa acaaaaggag atgagaaatg ttaccaacat cctgattgtg 241 aacctttcct tctcagactt gcttgttgcc atcatgtgtc tccctttgac atttgtctac 301 acattaatgg accactgggt ctttggtgag gcgatgtgta agttgaatcc ttttgtgcaa 361 tgtgtttcaa tcactgtgtc cattttctct ctggttctca ttgctgtgga acgacatcag 421 ctgataatca accctcgagg ttggagacca aataatagac atgcttatgt aggtattgct 481 gtgatttggg tccttgctgt ggcttcttct ttgcctttcc tgatctacca agtaatgact 541 gatgagccgt tccaaaatgt aacacttgat gcgtacaaag acaaatacgt gtgctttgat 601 caatttccat cggactctca taggttgtct tataccactc tcctcttggt gctgcagtat 661 tttggtccac tttgttttat atttatttgc tacttcaaga tatatatacg cctaaaaagg 721 agaaacaaca tgatggacaa gatgagagac aataagtaca ggtccagtga aaccaaaaga 781 atcaatatca tgctgctctc cattgtggta gcatttgcag tctgctggct ccctcttacc 841 atctttaaca ctgtgtttga ttggaatcat cagatcattg ctacctgcaa ccacaatctg 901 ttattcctgc tctgccacct cacagcaatg atatccactt gtgtcaaccc catattttat 961 gggttcctga acaaaaactt ccagagagac ttgcagttct tcttcaactt ttgtgatttc 1021 cggtctcggg atgatgatta tgaaacaata gccatgtcca cgatgcacac agatgtttcc 1081 aaaacttctt tgaagcaagc aagcccagtc gcatttaaaa aaatcaacaa caatgatgat 1141 aatgaaaaaa tctgaaacta cttatagcct atggtcccgg atgacatctg tttaaaaaca 1201 agcacaacct gcaacatact ttgattacct gttctcccaa ggaatggggt tgaaatcatt 1261 tgaaaatgac taagattttc ttgtcttgct ttttactgct tttgttgtag ttgtcataat 1321 tacatttgga acaaaaggtg tgggctttgg ggtcttctgg aaatagtttt gaccagacat 1381 ctttgaagtg ctttttgtga atttatgcat ataatataaa gacttttata ctgtacttat 1441 tggaatgaaa tttctttaaa gtattacgat // LOCUS HUMNF1AA 1483 bp mRNA PRI 07-JAN-1995 DEFINITION Human neurofibromatosis type 1 (NF1) mRNA, complete cds. ACCESSION M61213 NID g189162 KEYWORDS neurofibromatosis protein type 1. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1483) AUTHORS Martin,G.A., Viskochil,D., Bollag,G., McCabe,P.C., Crosier,W.J., Haubruck,H., Conroy,L., Clark,R., O'Connell,P., Cawthon,R.M., Innis,M.A. and McCormick,F. TITLE The GAP-related domain of the neurofibromatosis type 1 gene product interacts with ras p21 JOURNAL Cell 63 (4), 843-849 (1990) MEDLINE 91029515 FEATURES Location/Qualifiers source 1..1483 /organism="Homo sapiens" /db_xref="taxon:9606" gene 19..1467 /gene="NF1" CDS 19..1467 /gene="NF1" /codon_start=1 /product="neurofibromatosis protein type 1" /db_xref="PID:g189163" /translation="MEAKSQLFLKYFTLFMNLLNDCSEVEDESAQTGGRKRGMSRRLA SLRHCTVLAMSNLLNANVDSGLMHSIGLGYHKDLQTRATFMEVLTKILQQGTEFDTLA ETVLADRFERLVELVTMMGDQGELPIAMALANVVPCSQWDELARVLVTLFDSRHLLYQ LLWNMFSKEVELADSMQTLFRGNSLASKIMTFCFKVYGATYLQKLLDPLLRIVITSSD WQHVSFEVDPTRLEPSESLEENQRNLLQMTEKFFHAIISSSSEFPPQLRSVCHCLYQV VSQRFPQNSIGAVGSAMFLRFINPAIVSPYEAGILDKKPPPRIERGLKLMSKILQSIA NHVLFTKEEHMRPFNDFVKSNFDAARRFFLDIASDCPTSDAVNHSLSFISDGNVLALH RLLWNNQEKIGQYLSSNRDHKAVGRRPFDKMATLLAYLGPPEHKPVADTHWSSLNLTS SKFEEFMTRHQVHEKEEFKALKTLTPPPEPET" BASE COUNT 434 a 327 c 321 g 401 t ORIGIN 1 ggagatggtg tgtcgaccat ggaagccaaa tcacagttat ttcttaaata cttcacatta 61 tttatgaacc ttttgaatga ctgcagtgaa gttgaagatg aaagtgcgca aacaggtggc 121 aggaaacgtg gcatgtctcg gaggctggca tcactgaggc actgtacggt ccttgcaatg 181 tcaaacttac tcaatgccaa cgtagacagt ggtctcatgc actccatagg cttaggttac 241 cacaaggatc tccagacaag agctacattt atggaagttc tgacaaaaat ccttcaacaa 301 ggcacagaat ttgacacact tgcagaaaca gtattggctg atcggtttga gagattggtg 361 gaactggtca caatgatggg tgatcaagga gaactcccta tagcgatggc tctggccaat 421 gtggttcctt gttctcagtg ggatgaacta gctcgagttc tggttactct gtttgattct 481 cggcatttac tctaccaact gctctggaac atgttttcta aagaagtaga attggcagac 541 tccatgcaga ctctcttccg aggcaacagc ttggccagta aaataatgac attctgtttc 601 aaggtatatg gtgctaccta tctacaaaaa ctcctggatc ctttattacg aattgtgatc 661 acatcctctg attggcaaca tgttagcttt gaagtggatc ctaccaggtt agaaccatca 721 gagagccttg aggaaaacca gcggaacctc cttcagatga ctgaaaagtt cttccatgcc 781 atcatcagtt cctcctcaga attcccccct caacttcgaa gtgtgtgcca ctgtttatac 841 caggtggtta gccagcgttt ccctcagaac agcatcggtg cagtaggaag tgccatgttc 901 ctcagattta tcaatcctgc cattgtctca ccgtatgaag cagggatttt agataaaaag 961 ccaccaccta gaatcgaaag gggcttgaag ttaatgtcaa agatacttca gagtattgcc 1021 aatcatgttc tcttcacaaa agaagaacat atgcggcctt tcaatgattt tgtgaaaagc 1081 aactttgatg cagcacgcag gtttttcctt gatatagcat ctgattgtcc tacaagtgat 1141 gcagtaaatc atagtctttc cttcataagt gacggcaatg tgcttgcttt acatcgtcta 1201 ctctggaaca atcaggagaa aattgggcag tatctttcca gcaacaggga tcataaagct 1261 gttggaagac gaccttttga taagatggca acacttcttg catacctggg tcctccagag 1321 cacaaacctg tggcagatac acactggtcc agccttaacc ttaccagttc aaagtttgag 1381 gaatttatga ctaggcatca ggtacatgaa aaagaagaat tcaaggcttt gaaaacgtta 1441 acaccaccac cagaaccaga aacatgagct ctagagaatc cta // LOCUS HUMNF1AAA 2621 bp mRNA PRI 10-MAR-1997 DEFINITION Human mRNA for NF1 N-isoform-exon11, complete cds. ACCESSION D42072 NID g1060900 KEYWORDS NF1; NF1 N-isoform-exon11; neurofibromin. SOURCE Homo sapiens kidney cDNA to mRNA, clone:pHN-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Suzuki,H., Takahashi,K., Kubota,Y. and Shibahara,S. TITLE Molecular cloning of a cDNA coding for neurofibromatosis type 1 protein isoform lacking the domain related to ras GTPase-activating protein JOURNAL Biochem. Biophys. Res. Commun. 187 (2), 984-990 (1992) MEDLINE 92412152 REFERENCE 2 (bases 1 to 2621) AUTHORS Suzuki,H., Takahashi,K. and Shibahara,S. TITLE Evidence for the presence of two amino-terminal isoforms of neurofibromin, a gene product responsible for neurofibromatosis type 1 JOURNAL Tohoku J. Exp. Med. 175 (4), 225-233 (1995) MEDLINE 96047222 REFERENCE 3 (bases 1 to 2621) AUTHORS Shibahara,S. TITLE Direct Submission JOURNAL Submitted (09-NOV-1994) to the DDBJ/EMBL/GenBank databases. Shigeki Shibahara, Tohoku University School of Medicine, Dept. of Applied Physiol. and Mol. Biol.; 2-1 Seiryomachi, Aoba-ku, Sendai, Miyagi 980, Japan (Tel:022-717-8117, Fax:022-717-8118) FEATURES Location/Qualifiers source 1..2621 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="17" /clone="pHN-1" /map="17q11.2" /tissue_type="kidney" gene 66..1847 /gene="NF1" CDS 66..1847 /gene="NF1" /note="A novel neurofibromin isoform lacking a domain related to GTPase-activating protein" /codon_start=1 /product="NF1 N-isoform-exon11" /db_xref="PID:d1008251" /db_xref="PID:g1060901" /translation="MAAHRPVEWVQAVVSRFDEQLPIKTGQQNTHTKVSTEHNKECLI NISKYKFSLVISGLTTILKNVNNMRIFGEAAEKNLYLSQLIILDTLEKCLAGQPKDTM RLDETMLVKQLLPEICHFLHTCREGNQHAAELRNSASGVLFSLSCNNFNAVFSRISTR LQELTVCSEDNVDVHDIELLQYINVDCAKLKRLLKETAFKFKALKKVAQLAVINSLEK AFWNWVENYPDEFTKLYQIPQTDMAECAEKLFDLVDGFAESTKRKAAVWPLQIILLIL CPEIIQDISKDVVDENNMNKKLFLDSLRKALAGHGGSRQLTESAAIACVKLCKASTYI NWEDNSVIFLLVQSMVVDLKNLLFNPSKPFSRGSQPADVDLMIDCLVSCFRISPHNNQ HFKICLAQNSPSTFHYVLVNSLHRIITNSALDWWPKIDAVYCHSVELRNMFGETLHKA VQGCGAHPAIRMAPSLTFKEKVTSLKFKEKPTDLETRSYKYLLLSMVKLIHADPKLLL CNPRKQGPETQGSTAELITGLVQLVPQSHMPEIAQEAMEALLVLHQLDSIDLWNPDAP VETFWEIRYMYFYFLNSTFKFYFVFLS" mutation 767 /gene="NF1" /replace="g" BASE COUNT 824 a 489 c 493 g 815 t ORIGIN 1 ccctcttccc ggcccagggc gccggcccac ccttccctcc gccgcccccc ggccgcgggg 61 aggacatggc cgcgcacagg ccggtggaat gggtccaggc cgtggtcagc cgcttcgacg 121 agcagcttcc aataaaaaca ggacagcaga acacacatac caaagtcagt actgagcaca 181 acaaggaatg tctaatcaat atttccaaat acaagttttc tttggttata agcggcctca 241 ctactatttt aaagaatgtt aacaatatga gaatatttgg agaagctgct gaaaaaaatt 301 tatatctctc tcagttgatt atattggata cactggaaaa atgtcttgct gggcaaccaa 361 aggacacaat gagattagat gaaacgatgc tggtcaaaca gttgctgcca gaaatctgcc 421 attttcttca cacctgtcgt gaaggaaacc agcatgcagc tgaacttcgg aattctgcct 481 ctggggtttt attttctctc agctgcaaca acttcaatgc agtctttagt cgcatttcta 541 ccaggttaca ggaattaact gtttgttcag aagacaatgt tgatgttcat gatatagaat 601 tgttacagta tatcaatgtg gattgtgcaa aattaaaacg actcctgaag gaaacagcat 661 ttaaatttaa agccctaaag aaggttgcgc agttagcagt tataaatagc ctggaaaagg 721 cattttggaa ctgggtagaa aattatccag atgaatttac aaaactatac cagatcccac 781 agactgatat ggctgaatgt gcagaaaagc tatttgactt ggtggatggt tttgctgaaa 841 gcaccaaacg taaagcagca gtttggccac tacaaatcat tctccttatc ttgtgtccag 901 aaataatcca ggatatatcc aaagacgtgg ttgatgaaaa caacatgaat aagaagttat 961 ttctggacag tctacgaaaa gctcttgctg gccatggagg aagtaggcag ctgacagaaa 1021 gtgctgcaat tgcctgtgtc aaactgtgta aagcaagtac ttacatcaat tgggaagata 1081 actctgtcat tttcctactt gttcagtcca tggtggttga tcttaagaac ctgcttttta 1141 atccaagtaa gccattctca agaggcagtc agcctgcaga tgtggatcta atgattgact 1201 gccttgtttc ttgctttcgt ataagccctc acaacaacca acactttaag atctgcctgg 1261 ctcagaattc accttctaca tttcactatg tgctggtaaa ttcactccat cgaatcatca 1321 ccaattccgc attggattgg tggcctaaga ttgatgctgt gtattgtcac tcggttgaac 1381 ttcgaaatat gtttggtgaa acacttcata aagcagtgca aggttgtgga gcacacccag 1441 caatacgaat ggcaccgagt cttacattta aagaaaaagt aacaagcctt aaatttaaag 1501 aaaaacctac agacctggag acaagaagct ataagtatct tctcttgtcc atggtgaaac 1561 taattcatgc agatccaaag ctcttgcttt gtaatccaag aaaacagggg cccgaaaccc 1621 aaggcagtac agcagaatta attacagggc tcgtccaact ggtccctcag tcacacatgc 1681 cagagattgc tcaggaagca atggaggctc tgctggttct tcatcagtta gatagcattg 1741 atttgtggaa tcctgatgct cctgtagaaa cattttggga gattaggtat atgtactttt 1801 attttttaaa ttcaactttt aaattttatt ttgtattttt gtcttgaaat attaactctg 1861 tagtacttag tacattgtaa aacttacact tccaaaggtt ttatggtttt gtattttatt 1921 tgacttcaaa ttattagaat ttcttgtttt aactgtaaga aaagtatcac agcaatttag 1981 aaaataaatt ttaagaatag tgctaaattt tgtcacccta acataagtac tgttgtttgg 2041 tatattactt ttttcagatt tcaatgtggt tactactgta tttttaatag attttcatag 2101 ttataagcct agaatgataa aattttgtaa caatactgtt ttttcagttt tttgaactat 2161 gatctttcat aaactttctg taataccaat gctttcgatg aatgaattaa taatggacac 2221 ctgcttagaa gaaaaaaatg tatgcagaat tttgtggtct gcttcctaga ttatacaaat 2281 cattacattt taatgagcat gaagtcacca cacggaggaa aatgtaaatg tgtaaacctc 2341 aagtttgcca ttatcttata agaatgggtg tgctaagtta cttggcagct gaattaaacc 2401 ttactctaga gtagtgctgt ccactagtaa tataatgaga accccatatg ttaaaaactt 2461 ttttacttac tacattaaag agtaaaaaga agcaggtaaa attaattgta attatgtttt 2521 gtttaacccg gtgtatatta tctataattt caacatgtaa tcaatataaa agttattaat 2581 gacatttcat tctctttttt ttaataaagt cttcaaactt t // LOCUS HUMNFKB 3625 bp mRNA PRI 20-FEB-1991 DEFINITION Human nuclear factor kappa-B DNA binding subunit (NF-kappa-B) mRNA, complete cds. ACCESSION M58603 NID g189177 KEYWORDS nuclear factor kappa-B DNA binding subunit. SOURCE Human premyeloid, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3625) AUTHORS Meyer,R., Hatada,E.N., Hohmann,H.-P., Haiker,M., Bartsch,C., Roethlisberger,U., Lahm,H.-W., Schlaeger,E.J., van Loon,A.P.G.M. and Scheidereit,C. TITLE Cloning of the DNA-binding subunit of human nuclear factor kappa-B: The level of its mRNA is strongly regulated by phorbol ester or tumor necrosis factor alpha JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 966-970 (1991) MEDLINE 91126115 FEATURES Location/Qualifiers source 1..3625 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /tissue_type="premyeloid" /tissue_lib="lambda ZAP" CDS 398..3304 /codon_start=1 /product="nuclear factor kappa-B DNA binding subunit" /db_xref="PID:g189178" /translation="MAEDDPYLGRPEQMFHLDPSLTHTIFNPEVFQPQMALPTDGPYL QILEQPKQRGFRFRYVCEGPSHGGLPGASSEKNKKSYPQVKICNYVGPAKVIVQLVTN GKNIHLHAHSLVGKHCEDGICTVTAGPKDMVVGFANLGILHVTKKKVFETLEARMTEA CIRGYNPGLLVHPDLAYLQAEGGGDRQLGDREKELIRQAALQQTKEMDLSVVRLMFTA FLPDSTGSFTRRLEPVVSDAIYDSKAPNASNLKIVRMDRTAGCVTGGEEIYLLCDKVQ KDDIQIRFYEEEENGGVWEGFGDFSPTDVHRQFAIVFKTPKYKDINITKPASVFVQLR RKSDLETSEPKPFLYYPEIKDKEEVQRKRQKLMPNFSDSFGGGSGAGAGGGGMFGSGG GGGGTGSTGPGYSFPHYGFPTYGGITFHPGTTKSNAGMKHGTMDTESKKDPEGCDKSD DKNTVNLFGKVIETTEQDQEPSEATVGNGEVTLTYATGTKEESAGVQDNLFLEKAMQL AKRHANALFDYAVTGDVKMLLAVQRHLTAVQDENGDSVLHLAIIHLHSQLVRDLLEVT SGLISDDIINMRNDLYQTPLHLAVITKQEDVVEDLLRAGADLSLLDRLGNSVLHLAAK EGHDKVLSILLKHKKAALLLDHPNGDGLNAIHLAMMSNSLPCLLLLVAAGADVNAQEQ KSGRTALHLAVEHDNISLAGCLLLEGDAHVDSTTYDGTTPLHIAAGRGSTRLAALLKA AGADPLVENFEPLYDLDDSWENAGEDEGVVPGTTPLDMATSWQVFDILNGKPYEPEFT SDDLLAQGDMKQLAEDVKLQLYKLLEIPDPDKNWATLAQKLGLGILNNAFRLSPAPSK TLMDNYEVSGGTVRELVEALRQMGYTEAIEVIQAASSPVKTTSQAHSLPLSPASTRQQ IDELRDSDSVCDTGVETSFRKLSFTESLTSGASLLTLNKMPHDYGQEGPLEGKI" BASE COUNT 933 a 937 c 956 g 799 t ORIGIN 1 ggccaccgga gcggcccggc gacgatcgct gacagcttcc cctgcccttc ccgtcggtcg 61 ggccgccagc cgccgcagcc ctcggcctgc acgcagccac cggccccgct cccggagccc 121 agcgccgccg aggccgcagc cgcccggcca gtaaggcggc gccgcccgcg gccaccgcgg 181 gccctgccgt tccctccgcc gcgctgcgcc atggcgcggc gctgactggc ctggcccggc 241 cccgccgcgc tcccgctcgc cccgacccgc actcgggccc gcccgggctc cggcctgccg 301 ccgcctcttc cttctccagc cggcaggccc cgccgcttag gagggagagc ccacccgcgc 361 caggaggccg aacgcggact cgccacccgg cttcagaatg gcagaagatg atccatattt 421 gggaaggcct gaacaaatgt ttcatttgga tccttctttg actcatacaa tatttaatcc 481 agaagtattt caaccacaga tggcactgcc aacagatggc ccataccttc aaatattaga 541 gcaacctaaa cagagaggat ttcgtttccg ttatgtatgt gaaggcccat cccatggtgg 601 actacctggt gcctctagtg aaaagaacaa gaagtcttac cctcaggtca aaatctgcaa 661 ctatgtggga ccagcaaagg ttattgttca gttggtcaca aatggaaaaa atatccacct 721 gcatgcccac agcctggtgg gaaaacactg tgaggatggg atctgcactg taactgctgg 781 acccaaggac atggtggtcg gcttcgcaaa cctgggtata cttcatgtga caaagaaaaa 841 agtatttgaa acactggaag cacgaatgac agaggcgtgt ataaggggct ataatcctgg 901 actcttggtg caccctgacc ttgcctattt gcaagcagaa ggtggagggg accggcagct 961 gggagatcgg gaaaaagagc taatccgcca agcagctctg cagcagacca aggagatgga 1021 cctcagcgtg gtgcggctca tgtttacagc ttttcttccg gatagcactg gcagcttcac 1081 aaggcgcctg gaacccgtgg tatcagacgc catctatgac agtaaagccc ccaatgcatc 1141 caacttgaaa attgtaagaa tggacaggac agctggatgt gtgactggag gggaggaaat 1201 ttatcttctt tgtgacaaag ttcagaaaga tgacatccag attcgatttt atgaagagga 1261 agaaaatggt ggagtctggg aaggatttgg agatttttcc cccacagatg ttcatagaca 1321 atttgccatt gtcttcaaaa ctccaaagta taaagatatt aatattacaa aaccagcctc 1381 tgtgtttgtc cagcttcgga ggaaatctga cttggaaact agtgaaccaa aacctttcct 1441 ctactatcct gaaatcaaag ataaagaaga agtgcagagg aaacgtcaga agctcatgcc 1501 caatttttcg gatagtttcg gcggtggtag tggtgccgga gctggaggcg gaggcatgtt 1561 tggtagtggc ggtggaggag ggggcactgg aagtacaggt ccagggtata gcttcccaca 1621 ctatggattt cctacttatg gtgggattac tttccatcct ggaactacta aatctaatgc 1681 tgggatgaag catggaacca tggacactga atctaaaaag gaccctgaag gttgtgacaa 1741 aagtgatgac aaaaacactg taaacctctt tgggaaagtt attgaaacca cagagcaaga 1801 tcaggagccc agcgaggcca ccgttgggaa tggtgaggtc actctaacgt atgcaacagg 1861 aacaaaagaa gagagtgctg gagttcagga taacctcttt ctagagaagg ctatgcagct 1921 tgcaaagagg catgccaatg cccttttcga ctacgcggtg acaggagacg tgaagatgct 1981 gctggccgtc cagcgccatc tcactgctgt gcaggatgag aatggggaca gtgtcttaca 2041 cttagcaatc atccaccttc attctcaact tgtgagggat ctactagaag tcacatctgg 2101 tttgatttct gatgacatta tcaacatgag aaatgatctg taccagacgc ccttgcactt 2161 ggcagtgatc actaagcagg aagatgtggt ggaggatttg ctgagggctg gggccgacct 2221 gagccttctg gaccgcttgg gtaactctgt tttgcaccta gctgccaaag aaggacatga 2281 taaagttctc agtatcttac tcaagcacaa aaaggcagca ctacttcttg accaccccaa 2341 cggggacggt ctgaatgcca ttcatctagc catgatgagc aatagcctgc catgtttgct 2401 gctgctggtg gccgctgggg ctgacgtcaa tgctcaggag cagaagtccg ggcgcacagc 2461 actgcacctg gctgtggagc acgacaacat ctcattggca ggctgcctgc tcctggaggg 2521 tgatgcccat gtggacagta ctacctacga tggaaccaca cccctgcata tagcagctgg 2581 gagagggtcc accaggctgg cagctcttct caaagcagca ggagcagatc ccctggtgga 2641 gaactttgag cctctctatg acctggatga ctcttgggaa aatgcaggag aggatgaagg 2701 agttgtgcct ggaaccacgc ctctagatat ggccaccagc tggcaggtat ttgacatatt 2761 aaatgggaaa ccatatgagc cagagtttac atctgatgat ttactagcac aaggagacat 2821 gaaacagctg gctgaagatg tgaagctgca gctgtataag ttactagaaa ttcctgatcc 2881 agacaaaaac tgggctactc tggcgcagaa attaggtctg gggatactta ataatgcctt 2941 ccggctgagt cctgctcctt ccaaaacact tatggacaac tatgaggtct ctgggggtac 3001 agtcagagag ctggtggagg ccctgagaca aatgggctac accgaagcaa ttgaagtgat 3061 ccaggcagcc tccagcccag tgaagaccac ctctcaggcc cactcgctgc ctctctcgcc 3121 tgcctccaca aggcagcaaa tagacgagct ccgagacagt gacagtgtct gcgacacggg 3181 cgtggagaca tccttccgca aactcagctt taccgagtct ctgaccagtg gtgcctcact 3241 gctaactctc aacaaaatgc cccatgatta tgggcaggaa ggacctctag aaggcaaaat 3301 ttagcctgct gacaatttcc cacaccgtgt aaaccaaagc cctaaaattc cactgcgttg 3361 tccacaagac agaagctgaa gtgcatccaa aggtgctcag agagccggcc cgcctgaatc 3421 attctcgatt taactcgaga ccttttcaac ttggcttcct ttcttggttc ataaatgaat 3481 tttagtttgg ttcacttaca gatagtatct agcaatcaca acactggctg agcggatgca 3541 tctggggatg aggttgctta ctaagctttg ccagctgctg ctggatcaca gctgctttct 3601 gttgtcattg ctgttgtccc tctgc // LOCUS HUMNFR 3683 bp mRNA PRI 07-JAN-1995 DEFINITION Human tumor necrosis factor receptor mRNA, complete cds. ACCESSION M32315 NID g189185 KEYWORDS c-myc proto-oncogene; necrosis factor receptor. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3683) AUTHORS Smith,C.A., Davis,T., Anderson,D., Solam,L., Beckmann,M.P., Jerzy,R., Dower,S.K., Cosman,D. and Goodwin,R.G. TITLE A receptor for tumor necrosis factor defines an unusual family of cellular and viral proteins JOURNAL Science 248 (4958), 1019-1023 (1990) MEDLINE 90260639 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Smith, 30-MAR-1990, for release after publication. FEATURES Location/Qualifiers source 1..3683 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI26-VA4" /cell_type="fibroblast" /tissue_type="lung" sig_peptide 90..155 /gene="tnfr" CDS 90..1475 /codon_start=1 /product="tumor necrosis factor receptor" /db_xref="PID:g189186" /translation="MAPVAVWAALAVGLELWAAAHALPAQVAFTPYAPEPGSTCRLRE YYDQTAQMCCSKCSPGQHAKVFCTKTSDTVCDSCEDSTYTQLWNWVPECLSCGSRCSS DQVETQACTREQNRICTCRPGWYCALSKQEGCRLCAPLRKCRPGFGVARPGTETSDVV CKPCAPGTFSNTTSSTDICRPHQICNVVAIPGNASMDAVCTSTSPTRSMAPGAVHLPQ PVSTRSQHTQPTPEPSTAPSTSFLLPMGPSPPAEGSTGDFALPVGLIVGVTALGLLII GVVNCVIMTQVKKKPLCLQREAKVPHLPADKARGTQGPEQQHLLITAPSSSSSSLESS ASALDRRAPTRNQPQAPGVEASGAGEARASTGSSDSSPGGHGTQVNVTCIVNVCSSSD HSSQCSSQASSTMGDTDSSPSESPKDEQVPFSKEECAFRSQLETPETLLGSTEEKPLP LGVPDAGMKPS" gene 90..155 /gene="tnfr" mat_peptide 156..1472 /product="tumor necrosis factor receptor" BASE COUNT 781 a 1098 c 1086 g 718 t ORIGIN 1 gcgagcgcag cggagcctgg agagaaggcg ctgggctgcg agggcgcgag ggcgcgaggg 61 cagggggcaa ccggaccccg cccgcaccca tggcgcccgt cgccgtctgg gccgcgctgg 121 ccgtcggact ggagctctgg gctgcggcgc acgccttgcc cgcccaggtg gcatttacac 181 cctacgcccc ggagcccggg agcacatgcc ggctcagaga atactatgac cagacagctc 241 agatgtgctg cagcaaatgc tcgccgggcc aacatgcaaa agtcttctgt accaagacct 301 cggacaccgt gtgtgactcc tgtgaggaca gcacatacac ccagctctgg aactgggttc 361 ccgagtgctt gagctgtggc tcccgctgta gctctgacca ggtggaaact caagcctgca 421 ctcgggaaca gaaccgcatc tgcacctgca ggcccggctg gtactgcgcg ctgagcaagc 481 aggaggggtg ccggctgtgc gcgccgctgc gcaagtgccg cccgggcttc ggcgtggcca 541 gaccaggaac tgaaacatca gacgtggtgt gcaagccctg tgccccgggg acgttctcca 601 acacgacttc atccacggat atttgcaggc cccaccagat ctgtaacgtg gtggccatcc 661 ctgggaatgc aagcatggat gcagtctgca cgtccacgtc ccccacccgg agtatggccc 721 caggggcagt acacttaccc cagccagtgt ccacacgatc ccaacacacg cagccaactc 781 cagaacccag cactgctcca agcacctcct tcctgctccc aatgggcccc agccccccag 841 ctgaagggag cactggcgac ttcgctcttc cagttggact gattgtgggt gtgacagcct 901 tgggtctact aataatagga gtggtgaact gtgtcatcat gacccaggtg aaaaagaagc 961 ccttgtgcct gcagagagaa gccaaggtgc ctcacttgcc tgccgataag gcccggggta 1021 cacagggccc cgagcagcag cacctgctga tcacagcgcc gagctccagc agcagctccc 1081 tggagagctc ggccagtgcg ttggacagaa gggcgcccac tcggaaccag ccacaggcac 1141 caggcgtgga ggccagtggg gccggggagg cccgggccag caccgggagc tcagattctt 1201 cccctggtgg ccatgggacc caggtcaatg tcacctgcat cgtgaacgtc tgtagcagct 1261 ctgaccacag ctcacagtgc tcctcccaag ccagctccac aatgggagac acagattcca 1321 gcccctcgga gtccccgaag gacgagcagg tccccttctc caaggaggaa tgtgcctttc 1381 ggtcacagct ggagacgcca gagaccctgc tggggagcac cgaagagaag cccctgcccc 1441 ttggagtgcc tgatgctggg atgaagccca gttaaccagg ccggtgtggg ctgtgtcgta 1501 gccaaggtgg gctgagccct ggcaggatga ccctgcgaag gggccctggt ccttccaggc 1561 ccccaccact aggactctga ggctctttct gggccaagtt cctctagtgc cctccacagc 1621 cgcagcctcc ctctgacctg caggccaaga gcagaggcag cgagttgggg aaagcctctg 1681 ctgccatggt gtgtccctct cggaaggctg gctgggcatg gacgttcggg gcatgctggg 1741 gcaagtccct gactctctgt gacctgcccc gcccagctgc acctgccagc ctggcttctg 1801 gagcccttgg gttttttgtt tgtttgtttg tttgtttgtt tgtttctccc cctgggctct 1861 gcccagctct ggcttccaga aaaccccagc atccttttct gcagaggggc tttctggaga 1921 ggagggatgc tgcctgagtc acccatgaag acaggacagt gcttcagcct gaggctgaga 1981 ctgcgggatg gtcctggggc tctgtgtagg gaggaggtgg cagccctgta gggaacgggg 2041 tccttcaagt tagctcagga ggcttggaaa gcatcacctc aggccaggtg cagtggctca 2101 cgcctatgat cccagcactt tgggaggctg aggcgggtgg atcacctgag gttaggagtt 2161 cgagaccagc ctggccaaca tggtaaaacc ccatctctac taaaaataca gaaattagcc 2221 gggcgtggtg gcgggcacct atagtcccag ctactcagaa gcctgaggct gggaaatcgt 2281 ttgaacccgg gaagcggagg ttgcagggag ccgagatcac gccactgcac tccagcctgg 2341 gcgacagagc gagagtctgt ctcaaaagaa aaaaaaaaaa gcaccgcctc caaatgctaa 2401 cttgtccttt tgtaccatgg tgtgaaagtc agatgcccag agggcccagg caggccacca 2461 tattcagtgc tgtggcctgg gcaagataac gcacttctaa ctagaaatct gccaattttt 2521 taaaaaagta agtaccactc aggccaacaa gccaacgaca aagccaaact ctgccagcca 2581 catccaaccc cccacctgcc atttgcaccc tccgccttca ctccggtgtg cctgcagccc 2641 cgcgcctcct tccttgctgt cctaggccac accatctcct ttcagggaat ttcaggaact 2701 agagatgact gagtcctcgt agccatctct ctactcctac ctcagcctag accctcctcc 2761 tcccccagag gggtgggttc ctcttcccca ctccccacct tcaattcctg ggccccaaac 2821 gggctgccct gccactttgg tacatggcca gtgtgatccc aagtgccagt cttgtgtctg 2881 cgtctgtgtt gcgtgtcgtg ggtgtgtgta gccaaggtcg gtaagttgaa tggcctgcct 2941 tgaagccact gaagctggga ttcctcccca ttagagtcag ccttccccct cccagggcca 3001 gggccctgca gaggggaaac cagtgtagcc ttgcccggat tctgggagga agcaggttga 3061 ggggctcctg gaaaggctca gtctcaggag catggggata aaggagaagg catgaaattg 3121 tctagcagag caggggcagg gtgataaatt gttgataaat tccactggac ttgagcttgg 3181 cagctgaact attggagggt gggagagccc agccattacc atggagacaa gaagggtttt 3241 ccaccctgga atcaagatgt cagactggct ggctgcagtg acgtgcacct gtactcagga 3301 ggctgagggg aggatcactg gagcccagga gtttgaggct gcagcgagct atgatcgcgc 3361 cactacactc cagcctgagc aacagagtga gaccctgtct cttaaagaaa aaaaaagtca 3421 gactgctggg actggccagg tttctgccca cattggaccc acatgaggac atgatggagc 3481 gcacctgccc cctggtggac agtcctggga gaacctcagg cttccttggc atcacagggc 3541 agagccggga agcgatgaat ttggagactc tgtggggcct tggttccctt gtgtgtgtgt 3601 gttgatccca agacaatgaa agtttgcact gtatgctgga cggcattcct gcttatcaat 3661 aaacctgttt gttttaaaaa aaa // LOCUS HUMNGFR 3386 bp mRNA PRI 11-AUG-1995 DEFINITION Human nerve growth factor receptor mRNA, complete cds. ACCESSION M14764 NID g189204 KEYWORDS nerve growth factor receptor. SOURCE Homo sapiens (clone: E1.) melanoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3386) AUTHORS Johnson,D., Lanahan,A., Buck,C.R., Sehgal,A., Morgan,C., Mercer,E., Bothwell,M. and Chao,M. TITLE Expression and structure of the human NGF receptor JOURNAL Cell 47 (4), 545-554 (1986) MEDLINE 87051725 COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.Bothwell, 28-APR-1987. FEATURES Location/Qualifiers source 1..3386 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="E1." /cell_line="A875" /tissue_type="melanoma" /map="17q21-q22" mRNA <1..3386 /gene="NGFR" /note="G00-120-234" gene 1..3386 /gene="NGFR" sig_peptide 114..197 /gene="NGFR" /note="G00-120-234" CDS 114..1397 /gene="NGFR" /codon_start=1 /db_xref="GDB:G00-120-234" /product="nerve growth factor receptor" /db_xref="PID:g189205" /translation="MGAGATGRAMDGPRLLLLLLLGVSLGGAKEACPTGLYTHSGECC KACNLGEGVAQPCGANQTVCEPCLDSVTFSDVVSATEPCKPCTECVGLQSMSAPCVEA DDAVCRCAYGYYQDETTGRCEACRVCEAGSGLVFSCQDKQNTVCEECPDGTYSDEANH VDPCLPCTVCEDTERQLRECTRWADAECEEIPGRWITRSTPPEGSDSTAPSTQEPEAP PEQDLIASTVAGVVTTVMGSSQPVVTRGTTDNLIPVYCSILAAVVVGLVAYIAFKRWN SCKQNKQGANSRPVNQTPPPEGEKLHSDSGISVDSQSLHDQQPHTQTASGQALKGDGG LYSSLPPAKREEVEKLLNGSAGDTWRHLAGELGYQPEHIDSFTHEACPVRALLASWAT QDSATLDALLAALRRIQRADLVESLCSESTATSPV" mat_peptide 198..1394 /gene="NGFR" /note="G00-120-234" /product="nerve growth factor receptor" variation 1105 /gene="NGFR" /note="g sometimes t" /replace="t" BASE COUNT 656 a 1104 c 1030 g 596 t ORIGIN 5 bp upstream of SacII site; 17q21-q22. 1 gccgcggcca gctccggcgg gcaggggggg cgctggagcg cagcgcagcg cagccccatc 61 agtccgcaaa gcggaccgag ctggaagtcg agcgctgccg cgggaggcgg gcgatggggg 121 caggtgccac cggccgcgcc atggacgggc cgcgcctgct gctgttgctg cttctggggg 181 tgtcccttgg aggtgccaag gaggcatgcc ccacaggcct gtacacacac agcggtgagt 241 gctgcaaagc ctgcaacctg ggcgagggtg tggcccagcc ttgtggagcc aaccagaccg 301 tgtgtgagcc ctgcctggac agcgtgacgt tctccgacgt ggtgagcgcg accgagccgt 361 gcaagccgtg caccgagtgc gtggggctcc agagcatgtc ggcgccgtgc gtggaggccg 421 acgacgccgt gtgccgctgc gcctacggct actaccagga tgagacgact gggcgctgcg 481 aggcgtgccg cgtgtgcgag gcgggctcgg gcctcgtgtt ctcctgccag gacaagcaga 541 acaccgtgtg cgaggagtgc cccgacggca cgtattccga cgaggccaac cacgtggacc 601 cgtgcctgcc ctgcaccgtg tgcgaggaca ccgagcgcca gctccgcgag tgcacacgct 661 gggccgacgc cgagtgcgag gagatccctg gccgttggat tacacggtcc acacccccag 721 agggctcgga cagcacagcc cccagcaccc aggagcctga ggcacctcca gaacaagacc 781 tcatagccag cacggtggca ggtgtggtga ccacagtgat gggcagctcc cagcccgtgg 841 tgacccgagg caccaccgac aacctcatcc ctgtctattg ctccatcctg gctgctgtgg 901 ttgtgggcct tgtggcctac atagccttca agaggtggaa cagctgcaag cagaacaagc 961 aaggagccaa cagccggcca gtgaaccaga cgcccccacc agagggagaa aaactccaca 1021 gcgacagtgg catctccgtg gacagccaga gcctgcatga ccagcagccc cacacgcaga 1081 cagcctcggg ccaggccctc aagggtgacg gaggcctcta cagcagcctg cccccagcca 1141 agcgggagga ggtggagaag cttctcaacg gctctgcggg ggacacctgg cggcacctgg 1201 cgggcgagct gggctaccag cccgagcaca tagactcctt tacccatgag gcctgccccg 1261 ttcgcgccct gcttgcaagc tgggccaccc aggacagcgc cacactggac gccctcctgg 1321 ccgccctgcg ccgcatccag cgagccgacc tcgtggagag tctgtgcagt gagtccactg 1381 ccacatcccc ggtgtgagcc caaccgggga gcccccgccc cgccccacat tccgacaacc 1441 gatgctccag ccaacccctg tggagcccgc acccccaccc tttggggggg gcccgcctgg 1501 cagaactgag ctcctctggg caggacctca gagtccaggc cccaaaacca cagccctgtc 1561 agtgcagccc gtgtggcccc ttcacttctg accacacttc ctgtccagag agagaagtgc 1621 ccctgctgcc tccccaaccc tgcccctgcc ccgtcaccat ctcaggccac ctgccccctt 1681 ctcccacact gctaggtggg ccagcccctc ccaccacagc aggtgtcata tatggggggc 1741 caacaccagg gatggtacta gggggaagtg acaaggcccc agagactcag agggaggaat 1801 cgaggaacca gagccatgga ctctacactg tgaacttggg gaacaagggt ggcatcccag 1861 tggcctcaac cctccctcag cccctcttgc cccccacccc agcctaagat gaagaggatc 1921 ggaggcttgt cagagctggg aggggttttc gaagctcagc ccacccccct cattttggat 1981 ataggtcagt gaggcccagg gagaggccat gattcgccca aagccagaca gcaacgggga 2041 ggccaagtgc aggctggcac cgccttctct aaatgagggg cctcaggttt gcctgagggc 2101 gaggggaggg tggcaggtga ccttctggga aatggcttga agccaagtca gctttgcctt 2161 ccacgctgtc tccagacccc caccccttcc ccactgcctg cccacccgtg gagatgggat 2221 gcttgcctag ggcctggtcc atgatggagt caggtttggg gttcgtggaa agggtgctgc 2281 ttccctctgc ctgtccctct caggcatgcc tgtgtgacat cagtggcatg gctccagtct 2341 gctgccctcc atcccgacat ggacccggag ctaacactgg cccctagaat cagcctaggg 2401 gtcagggacc aaggacccct caccttgcaa cacacagaca cacgcacaca cacacacagg 2461 aggagaaatc tcacttttct ccatgagttt tttctcttgg gctgagactg gatactgccc 2521 ggggcagctg ccagagaagc atcggaggga attgaggtct gctcggccgt cttcactcgc 2581 ccccgggttt ggcgggccaa ggactgccga ccgaggctgg agctggcgtc tgtcttcaag 2641 ggcttacacg tggaggaatg ctcccccatc ctccccttcc ctgcaaacat ggggttggct 2701 gggcccagaa ggttgcgatg aagaaaagcg ggccagtgtg ggaatgcggc aagaaggaat 2761 tgacttcgac tgtgacctgt ggggatttct cccagctcta gacaaccctg caaaggactg 2821 ttttttcctg agcttggcca gaagggggcc atgaggcctc agtggacttt ccaccccctc 2881 cctggcctgt tctgttttgc ctgaagttgg agtgagtgtg gctcccctct atttagcatg 2941 acaagcccca ggcaggctgt gcgctgacaa ccaccgctcc ccagcccagg gttcccccag 3001 ccctgtggaa gggactagga gcactgtagt aaatggcaat tctttgacct caacctgtga 3061 tgaggggagg aaactcacct gctggcccct cacctgggca cctggggagt gggacagagt 3121 ctgggtgtat ttattttcct ccccagcagg tggggagggg gtttggtggc ttgcaagtat 3181 gttttagcat gtgtttggtt ctggggcccc tttttactcc ccttgagctg agatggaacc 3241 cttttggccc ccagctgggg gccatgagct ccagaccccc agcaaccctc ctatcacctc 3301 ccctccttgc ctcctgtgta atcatttctt gggccctcct gaaacttaca cacaaaacgt 3361 taagtgatga acattaaata gcaaag // LOCUS HUMNGP78A 1765 bp mRNA PRI 17-OCT-1995 DEFINITION Homo sapiens autocrine motility factor receptor (AMFR) mRNA, complete cds. ACCESSION L35233 NID g521220 KEYWORDS autocrine motility factor receptor. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1765) AUTHORS Huang,B., Xie,Y. and Raz,A. TITLE Identification of an upstream region that controls the transcription of the human autocrine motility factor receptor JOURNAL Biochem. Biophys. Res. Commun. 212 (3), 727-742 (1995) MEDLINE 95352090 FEATURES Location/Qualifiers source 1..1765 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="30 week old" /tissue_type="placenta" gene 669..1640 /gene="AMFR" CDS 669..1640 /gene="AMFR" /note="Ngp78; ORF; putative" /codon_start=1 /product="autocrine motility factor receptor" /db_xref="PID:g521221" /translation="MRDSACWSQRKDELLQQARKRFLNKSSEDDAASESFLPSEGASS DPVTLRRRMLAAARNGGFRSSRPPSAPLPSSAASCALCPTDWRRPVPILPLHGKAGLT ALPLYKACGLIVFGQLINLILLCNTFYVTFLFPLETLQILTVGMISSGVDWTAWGGGR SGGSEPVACLQQAASTPASCIRPTNAGVLSTTPSGKSVGEAHSVSPPPRRGVTSVIKL LSLLWKHVDCARARPTGSCTPEQQGILEKELLVRYLEQRRGKSRAIGCDEVTPFCPTT SGTDFPSLQSKAGLISVNSGAPASHECAPWVPSPLSISLSRLDLGSG" BASE COUNT 404 a 465 c 471 g 425 t ORIGIN 1 gaaactgcct gtgaacatct tttccacaac tcctgtcttc gttcctggct agaacaagac 61 acctcctgtc caacatgcag aatgtctctt aatattgccg acaatatcgt gtcagggaag 121 aacatcaagg agagaacttg gatgagaatt tggttcctga tgcagcagcc gaaggagacc 181 tcgcttaaac caacacaatc acttcttcca tttcgatggg tctcggattg cgagctggct 241 gccgagtttt tcggttgaag tgatgcacac caccaacatt cttggcatta cgcaggccag 301 caactcccag ctcaatgcaa tggctcatca gattcaagag atgtttcccc aggttccata 361 ccatctggta ctgcaggacc tccagctgac acgctcagtt aaataacaac agtcaatatt 421 ttagaaggac ggattcaact accttttcct acacagcggt cagatagcat cagacctgca 481 ttgaacagtc ctgtggaaag gccaagcagt gaccaggaag agggagaaac ttctgctcag 541 accgagcgtg gtgccactgg acctcagtcc tcgcctggag gagacgctgg acttcggcga 601 ggtggaagtg gagcccagtg aggtggaaga cttcgaggct cgtgggagcc gcttctccaa 661 gtctgctgat gagagacagc gcatgctggt cgcagcgtaa ggacgaactc ctccagcaag 721 ctcgcaaacg tttcttgaac aaaagttctg aagatgatgc ggcctcagag agcttcctcc 781 cctcggaagg tgcgtcctct gaccccgtga ccctgcgtcg aaggatgctg gctgccgcgc 841 ggaacggagg cttcagaagc agcagacctc ctagcgctcc cttgccttcc tcagctgcct 901 cctgcgccct gtgcccgact gactggagga ggcctgtccc aattctgccg ctccatggaa 961 aagcgggctt gactgcattg ccgctgtata aagcatgtgg tcttatagtg tttggacagc 1021 tgataaattt aatccttctt tgtaatactt tctatgtgac atttctcttc cccttagaaa 1081 cactgcaaat tttaactgta ggtatgatct cttctggtgt tgactggact gcttggggtg 1141 ggggacgatc aggaggaagt gagccagtcg cctgcctgca gcaggcagct tctactcctg 1201 cctcatgcat acgtcccaca aatgcaggtg tcctgagcac cacacccagt gggaagagtg 1261 tgggggaggc gcacagtgtg agcccgcccc cacgtcgtgg ggtaacatct gttatcaaac 1321 tgctgtcgtt gttgtggaag catgtagact gtgccagagc cagacccacg ggctcatgca 1381 cccctgagca gcagggcatc ttggaaaagg aactcttggt tcgatacctg gagcagagga 1441 ggggaaagtc cagggctata gggtgtgatg aagtcacccc tttctgtccc actacatctg 1501 ggactgactt tccgagcctc cagtccaaag ccggcttgat ttccgtgaac tctggtgctc 1561 ctgcatctca tgagtgtgcc ccatgggtcc cctcccctct cagcatttcc ttgtcccgtc 1621 tggacctggg gagtggttag gcagcaagct ttggcttatg gttttcattc attggtgaag 1681 taaattaggc tagtgcacgt aaagcctgtg ggtttggtcc ttgaacaaga tgtgggcctt 1741 gcaagatggg agagtaaacc ttaag // LOCUS HUMNID 4898 bp mRNA PRI 07-JAN-1995 DEFINITION Human nidogen mRNA, complete cds. ACCESSION M30269 NID g189208 KEYWORDS nidogen. SOURCE Human placenta and skin fibroblast, cDNA to mRNA, clones cHFN-[5,7,8,12,16,26]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4898) AUTHORS Nagayoshi,T., Sanborn,D., Hickok,N.J., Olsen,D.R., Fazio,M.J., Chu,M.-L., Knowlton,R., Mann,K., Deutzmann,R., Timpl,R. and Uitto,J. TITLE Human nidogen: complete amino acid sequence and structural domains deduced from cDNAs, and evidence for polymorphism of the gene JOURNAL DNA 8 (8), 581-594 (1989) MEDLINE 90091745 FEATURES Location/Qualifiers source 1..4898 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="placenta; skin" /map="1q43" sig_peptide 91..174 /gene="NID" /note="G00-120-236" CDS 91..3834 /gene="NID" /codon_start=1 /db_xref="GDB:G00-120-236" /product="nidogen" /db_xref="PID:g189209" /translation="MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDL ELEDGDDFVSPALELSGALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPTFG AVAPFLADLDTTDGLGKVYYREDLSPSITQRAAECVHRGFPEISFQPSSAVVVTWESV APYQGPSRDPDQKGKRNTFQAVLASSDSSSYAIFLYPEDGLQFHTTFSKKENNQVPAV VAFSQGSVGFLWKSNGAYNIFANDRESIENLAKSSNSGQQGVWVFEIGSPATTNGVVP ADVILGTEDGAEYDDEDEDYDLATTRLGLEDVGTTPFSYKALRRGGADTYSVPSVLSP RRAATERPLGPPTERTRSFQLAVETFHQQHPQVIDVDEVEETGVVFSYNTDSRQTCAN NRHQCSVHAECRDYATGFCCSCVAGYTGNGRQCVAEGSPQRVNGKVKGRIFVGSSQVP IVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPVGGIIGWMFAVEQDGFKNG FSITGGEFTRQAEVTFVGHPGNLVIKQRFSGIDEHGHLTIDTELEGRVPQIPFGSSVH IEPYTELYHYSTSVITSSSTREYTVTEPERDGASPSRIYTYQWRQTITFQECVHDDSR PALPSTQQLSVDSVFVLYNQEEKILRYAFSNSIGPVREGSPDALQNPCYIGTHGCDTN AACRPGPRTQFTCECSIGFRGDGRTCYDIDECSEQPSVCGSHTICNNHPGTFRCECVE GYQFSDEGTCVAVVDQRPINYCETGLHNCDIPQRAQCIYTGGSSYTCSCLPGFSGDGQ ACQDVDECQPSRCHPDAFCYNTPGSFTCQCKPGYQGDGFRCVPGEVEKTRCQHEREHI LGAAGATDPQRPIPPGLFVPECDAHGHYAPTQCHGSTGYCWCVDRDGREVEGTRTRPG MTPPCLSTVAPPIHQGPAVPTAVIPLPPGTHLLFAQTGKIERLPLEGNTMRKTEAKAF LHVPAKVIIGLAFDCVDKMVYWTDITEPSIGRASLHGGEPTTIIRQDLGSPEGIAVDH LGRNIFWTDSNLDRIEVAKLDGTQRRVLFETDLVNPRGIVTDSVRGNLYWTDWNRDNP KIETSYMDGTNRRILVQDDLGLPNGLHFDAFSSQLCWVDAGTNRAECLNPSQPSRRKA LEGLQYPFAVTSYGKNLYFTDWKMNSVVALDLAISKETDAFQPHKQTRLYGITTALSQ CPQGHNYCSVNNGGCTHLCLATPGSRTCRCPDNTLGVDCIERK" gene 91..3834 /gene="NID" mat_peptide 175..3831 /gene="NID" /note="G00-120-236" /product="nidogen" BASE COUNT 1096 a 1417 c 1293 g 1092 t ORIGIN 1 gaattccggc tgccaggggc gtccggttac atccccgcct tcctctgtcc tggccgcggg 61 accgggtttg cgggaccgca gttcgggaac atgttggcct cgagcagccg gatccgggct 121 gcgtggacgc gggcgctgct gctgccgctg ctgctggcgg ggcctgtggg ctgcctgagc 181 cgccaggagc tctttccctt cggccccgga cagggggacc tggagctgga ggacggggat 241 gacttcgtct ctcctgccct ggagctgagt ggggcgctcc gcttctacga cagatccgac 301 atcgacgcag tctacgtcac cacaaatggc atcattgcta cgagtgaacc cccggccaaa 361 gaatcccatc ccgggctctt cccaccaaca ttcggtgcag tcgccccttt cctggcggac 421 ttggacacga ccgatggcct ggggaaggtt tattatcgag aagacttatc cccctccatc 481 actcagcgag cagcagagtg tgtccacaga gggttcccgg agatctcttt ccagcctagt 541 agcgcggtgg ttgtcacttg ggaatccgtg gccccctacc aagggcccag cagggaccca 601 gaccagaaag gcaagagaaa cacgttccag gctgttctag cctcctctga ttccagctcc 661 tatgccattt tcctttatcc tgaggatggt ctgcagttcc atacgacatt ctcaaagaag 721 gaaaacaacc aagttcctgc cgtggttgca ttcagtcaag gttcagtggg attcttatgg 781 aagagcaacg gagcttataa catatttgct aatgacaggg aatcaattga aaatttggcc 841 aagagtagta actctgggca gcagggtgtc tgggtgtttg agattgggag tccagccacc 901 accaatggcg tggtgcctgc agacgtgatc ctcggaactg aagatggggc agagtatgat 961 gatgaggatg aagattatga cctggcgacc actcgtctgg gcctggagga tgtgggcacc 1021 acgcccttct cctacaaggc tctgagaagg ggaggtgctg acacatacag tgtgcccagc 1081 gtcctctccc cgcgccgggc agctaccgaa aggccccttg gacctcccac agagagaacc 1141 aggtctttcc agttggcagt ggagactttt caccagcagc accctcaggt catagatgtg 1201 gatgaagttg aggaaacagg agttgttttc agctataaca cggattcccg ccagacgtgt 1261 gctaacaaca gacaccagtg ctcggtgcac gcagagtgca gggactacgc cacgggcttc 1321 tgctgcagct gtgtcgctgg ctatacgggc aatggcaggc aatgtgttgc agaaggttcc 1381 ccccagcgag tcaatggcaa ggtgaaagga aggatctttg tggggagcag ccaggtcccc 1441 attgtctttg agaacactga cctccactct tacgtagtaa tgaaccacgg gcgctcctac 1501 acagccatca gcaccattcc cgagaccgtt ggatattctc tgcttccact ggccccagtt 1561 ggaggcatca ttggatggat gtttgcagtg gagcaggacg gattcaagaa tgggttcagc 1621 atcaccgggg gtgagttcac tcgccaggct gaggtgacct tcgtggggca cccgggcaat 1681 ctggtcatta agcagcggtt cagcggcatc gatgagcatg ggcacctgac catcgacacg 1741 gagctggagg gccgcgtgcc gcagattccg ttcggctcct ccgtgcacat tgagccctac 1801 acggagctgt accactactc cacctcagtg atcacttcct cctccacccg ggagtacacg 1861 gtgactgagc ccgagcgaga tggggcatct ccttcacgca tctacactta ccagtggcgc 1921 cagaccatca ccttccagga atgcgtccac gatgactccc ggccagccct gcccagcacc 1981 cagcagctct cggtggacag cgtgttcgtc ctgtacaacc aggaggagaa gatcttgcgc 2041 tacgctttca gcaactccat tgggcctgtg agggaaggct cccctgatgc tcttcagaat 2101 ccctgctaca tcggcactca tgggtgtgac accaacgcgg cctgtcgccc tggtcccagg 2161 acacagttca cctgcgagtg ctccatcggc ttccgaggag acgggcgaac ctgctatgat 2221 attgatgaat gttcagaaca accctcagtg tgtgggagcc acacaatctg caataatcac 2281 ccaggaacct tccgctgcga gtgtgtggag ggctaccagt tttcagatga gggaacgtgt 2341 gtggctgtcg tggaccagcg ccccatcaac tactgtgaaa ctggccttca taactgcgac 2401 ataccccagc gggcccagtg tatctacaca ggaggctcct cctacacctg ttcctgcttg 2461 ccaggctttt ctggggatgg ccaagcctgc caagatgtag atgaatgcca gccaagccga 2521 tgtcaccctg acgccttctg ctacaacact ccaggctctt tcacgtgcca gtgcaaacct 2581 ggttatcagg gagacggctt ccgttgcgtg cccggagagg tggagaaaac ccggtgccag 2641 cacgagcgag aacacattct cggggcagcg ggggcgacag acccacagcg acccattcct 2701 ccggggctgt tcgttcctga gtgcgatgcg cacgggcact acgcgcccac ccagtgccac 2761 ggcagcaccg gctactgctg gtgcgtggat cgcgacggcc gcgaggtgga gggcaccagg 2821 accaggcccg ggatgacgcc cccgtgtctg agtacagtgg ctcccccgat tcaccaagga 2881 cctgcggtgc ctaccgccgt gatccccttg cctcctggga cccatttact ctttgcccag 2941 actgggaaga ttgagcgcct gcccctggag ggaaatacca tgaggaagac agaagcaaag 3001 gcgttccttc atgtcccggc taaagtcatc attggactgg cctttgactg cgtggacaag 3061 atggtttact ggacggacat cactgagcct tccattggga gagctagtct acatggtgga 3121 gagccaacca ccatcattag acaagatctt ggaagtccag aaggtatcgc tgttgatcac 3181 cttggccgca acatcttctg gacagactct aacctggatc gaatagaagt ggcgaagctg 3241 gacggcacgc agcgccgggt gctctttgag actgacctgg tgaatcccag aggcattgta 3301 acggattccg tgagagggaa cctttactgg acagactgga acagagataa ccccaagatt 3361 gaaacttcct acatggacgg cacgaaccgg aggatccttg tgcaggatga cctgggcttg 3421 cccaatggac tgcacttcga tgcgttctca tctcagctct gctgggtgga tgcaggcacc 3481 aatcgggcgg aatgcctgaa ccccagtcag cccagcagac gcaaggctct cgaagggctc 3541 cagtatcctt ttgctgtgac gagctacggg aagaatctgt atttcacaga ctggaagatg 3601 aattccgtgg ttgctctcga tcttgcaatt tccaaggaga cggatgcttt ccaaccccac 3661 aagcagaccc ggctgtatgg catcaccacg gccctgtctc agtgtccgca aggccataac 3721 tactgctcag tgaacaatgg cggctgcacc cacctatgct tggccacccc agggagcagg 3781 acctgccgtt gccctgacaa caccttggga gttgactgta tcgaacggaa atgaagacaa 3841 gagtgcctta tttcctttcc aagtatttca cagcaacact ctacttgaag caacttggtc 3901 cagattgaaa agtgtcctct ggctgagtgg ccactaggcc cagacccagc ccagcctgag 3961 ccccaacaac aacttttccc tcactgttcc ccaaaacatg caccctggac ttctctaata 4021 gaaaagtctc cacccctaca caaggacaga accctccacc cctaccccca accctcagac 4081 agacttatac acccctgagt gaggattaca tgcccatccc agtgtcctag gaccttttcc 4141 caatactagc cccccagtgg tgaacagaac ctcccaaatt tgagttgcac ccttccctgt 4201 ggccttatga gctcagcctc gctttgaggt acccaccgtc ctgtcagctc cttgacctat 4261 gagctggggc ctgactagga aaagttggga gttaaggagg aaattagcat tccttaatgt 4321 tttgttttgg tgctctgaat ttcttcttta ttatagtcct atagttttac tcctcagttc 4381 ctcaccatca tcatcttgtc taagaccccc attataatat tcatgcgctg ctttttcatc 4441 aaaacctacc ctgtcctaga gatctatggg catttggtgg atgataatga gcagcccctc 4501 ccagatagaa tgtcaatatt tgagcagtag gatattggca tttgttagtt aaaggcttaa 4561 atcaaaagaa tgtccaatgg taggaatttc aaggtgtagg tcagatattt gagaataggg 4621 gatttttttg atgtgcctta aattatacca aagattacta attattcctc tttgcccaaa 4681 atacttgcat ccaaggttct agtctctgtt gctgtgctgg tctttagccc cactgctggc 4741 actgatgtcc ctcctttttc acggagacct atctgaggta caggatgggg ctggcaccag 4801 atgatgtccc accacagtcc ctcacctccg gcctccacat gacagaacca atttacactc 4861 aaccatgacc tcacccctcc ttggtttctc cctccccg // LOCUS HUMNIOXSYN 4077 bp mRNA PRI 11-SEP-1992 DEFINITION Human nitric oxide synthase mRNA, complete cds. ACCESSION M93718 NID g189211 KEYWORDS nitric oxide synthase; vasodilator. SOURCE Homo sapiens umbilical vein cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Janssens,S.P., Shimoushi,A., Quertermous,T., Bloch,D.B. and Blach,K.D. TITLE Cloning and expression of a cDNA encoding human endothelium-derived relaxing factor/nitric oxide synthase JOURNAL J. Biol. Chem. 267, 14519-14522 (1992) MEDLINE 92340475 FEATURES Location/Qualifiers source 1..4077 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" /tissue_type="umbilical vein" CDS 47..3658 /codon_start=1 /product="nitric oxide synthase" /db_xref="PID:g189212" /translation="MGNLKSVAQEPGPPCGLGLGLGLGLCGKQGPATPAPEPSRAPAS LLPPAPEHSPPSSPLTQPPEGPKFPRVKNWEVGSITYDTLSAQAQQDGPCTPRRCLGS LVFPRKLQGRPSPGPPAPEQLLSQARDFINQYYSSIKRSGSQAHEQRLQEVEAEVAAT GTYQLRESELVFGAKQAWRNAPRCVGRIQWGKLQVFDARDCRSAQEMFTYICNHIKYA TNRGNLRSAITVFPQRCPGRGDFRIWNSQLVRYAGYRQQDGSVRGDPANVEITELCIQ HGWTPGNGRFDVLPLLLQAPDEPPELFLLPPELVLEVPLEHPTLEWFAALGLRWYALP AVSNMLLEIGGLEFPAAPFSGWYMSTEIGTRNLCDPHRYNILEDVAVCMDLDTRTTSS LWKDKAAVEINVAVLHSYQLAKVTIVDHHAATASFMKHLENEQKARGGCPADWAWIVP PISGSLTPVFHQEMVNYFLSPAFRYQPDPWKGSAAKGTGITRKKTFKEVANAVKISAS LMGTVMAKRVKATILYGSETGRAQSYAQQLGRLFRKAFDPRVLCMDEYDVVSLEHETL VLVVTSTFGNGDPPENGESFAAALMEMSGPYNSSPRPEQHKSYKIRFNSISCSDPLVS SWRRKRKESSNTDSAGALGTLRFCVFGLGSRAYPHFCAFARAVDTRLEELGGERLLQL GQGDELCGQEEAFRGWAQAAFQAACETFCVGEDAKAAARDIFSPKRSWKRQRYRLSAQ AEGLQLLPGLIHVHRRKMFQATIRSVENLQSSKSTRATILVRLDTGGQEGLQYQPGDH IGVCPPNRPGLVEALLSRVEDPPAPTEPVAVEQLEKGSPGGPPPGWVRDPRLPPCTLR QALTFFLDITSPPSPQLLRLLSTLAEEPREQQELEALSQDPRRYEEWKWFRCPTLLEV LEQFPSVALPAPLLLTQLPLLQPRYYSVSSAPSTHPGEIHLTVAVLAYRTQDGLGPLH YGVCSTWLSQLKPGDPVPCFIRGAPSFRLPPDPSLPCILVGPGTGIAPFRGFWQERLH DIESKGLQPTPMTLVFGCRCSQLDHLYRDEVQNAQQRGVFGRVLTAFSREPDNPKTYV QDILRTELAAEVHRVLCLERGHMFVCGDVTMATNVLQTVQRILATEGDMELDEAGDVI GVLRDQQRYHEDIFGLTLRTQEVTSRIRTQSFSLQERQLRGAVPWAFDPPGSDTNSP" polyA_site 4077 BASE COUNT 763 a 1371 c 1234 g 709 t ORIGIN 1 gaattcccac tctgctgcct gctccagcag acggacgcac agtaacatgg gcaacttgaa 61 gagcgtggcc caggagcctg ggccaccctg cggcctgggg ctggggctgg gccttgggct 121 gtgcggcaag cagggcccag ccaccccggc ccctgagccc agccgggccc cagcatccct 181 actcccacca gcgccagaac acagcccccc gagctccccg ctaacccagc ccccagaggg 241 gcccaagttc cctcgtgtga agaactggga ggtggggagc atcacctatg acaccctcag 301 cgcccaggcg cagcaggatg ggccctgcac cccaagacgc tgcctgggct ccctggtatt 361 tccacggaaa ctacagggcc ggccctcccc cggccccccg gcccctgagc agctgctgag 421 tcaggcccgg gacttcatca accagtacta cagctccatt aagaggagcg gctcccaggc 481 ccacgaacag cggcttcaag aggtggaagc cgaggtggca gccacaggca cctaccagct 541 tagggagagc gagctggtgt tcggggctaa gcaggcctgg cgcaacgctc cccgctgcgt 601 gggccggatc cagtggggga agctgcaggt gttcgatgcc cgggactgca ggtctgcaca 661 ggaaatgttc acctacatct gcaaccacat caagtatgcc accaaccggg gcaaccttcg 721 ctcggccatc acagtgttcc cgcagcgctg ccctggccga ggagacttcc gaatctggaa 781 cagccagctg gtgcgctacg cgggctaccg gcagcaggac ggctctgtgc ggggggaccc 841 agccaacgtg gagatcaccg agctctgcat tcagcacggc tggaccccag gaaacggtcg 901 cttcgacgtg ctgcccctgc tgctgcaggc cccagatgag cccccagaac tcttccttct 961 gccccccgag ctggtccttg aggtgcccct ggagcacccc acgctggagt ggtttgcagc 1021 cctgggcctg cgctggtacg ccctcccggc agtgtccaac atgctgctgg aaattggggg 1081 cctggagttc cccgcagccc ccttcagtgg ctggtacatg agcactgaga tcggcacgag 1141 gaacctgtgt gaccctcacc gctacaacat cctggaggat gtggctgtct gcatggacct 1201 ggatacccgg accacctcgt ccctgtggaa agacaaggca gcagtggaaa tcaacgtggc 1261 cgtgctgcac agttaccagc tagccaaagt caccatcgtg gaccaccacg ccgccacggc 1321 ctctttcatg aagcacctgg agaatgagca gaaggccagg gggggctgcc ctgcagactg 1381 ggcctggatc gtgcccccca tctcgggcag cctcactcct gttttccatc aggagatggt 1441 caactatttc ctgtccccgg ccttccgcta ccagccagac ccctggaagg ggagtgccgc 1501 caagggcacc ggcatcacca ggaagaagac ctttaaagaa gtggccaacg ccgtgaagat 1561 ctccgcctcg ctcatgggca cggtgatggc gaagcgagtg aaggcgacaa tcctgtatgg 1621 ctccgagacc ggccgggccc agagctacgc acagcagctg gggagactct tccggaaggc 1681 ttttgatccc cgggtcctgt gtatggatga gtatgacgtg gtgtccctcg aacacgagac 1741 gctggtgctg gtggtaacca gcacatttgg gaatggggat cccccggaga atggagagag 1801 ctttgcagct gccctgatgg agatgtccgg cccctacaac agctcccctc ggccggaaca 1861 gcacaagagt tataagatcc gcttcaacag catctcctgc tcagacccac tggtgtcctc 1921 ttggcggcgg aagaggaagg agtccagtaa cacagacagt gcaggggccc tgggcaccct 1981 caggttctgt gtgttcgggc tcggctcccg ggcatacccc cacttctgcg cctttgctcg 2041 tgccgtggac acacggctgg aggaactggg cggggagcgg ctgctgcagc tgggccaggg 2101 cgacgagctg tgcggccagg aggaggcctt ccgaggctgg gcccaggctg ccttccaggc 2161 cgcctgtgag accttctgtg tgggagagga tgccaaggcc gccgcccgag acatcttcag 2221 ccccaaacgg agctggaagc gccagaggta ccggctgagc gcccaggccg agggcctgca 2281 gttgctgcca ggtctgatcc acgtgcacag gcggaagatg ttccaggcta caatccgctc 2341 agtggaaaac ctgcaaagca gcaagtccac gagggccacc atcctggtgc gcctggacac 2401 cggaggccag gaggggctgc agtaccagcc gggggaccac ataggtgtct gcccgcccaa 2461 ccggcccggc cttgtggagg cgctgctgag ccgcgtggag gacccgccgg cgcccactga 2521 gcccgtggca gtagagcagc tggagaaggg cagccctggt ggccctcccc ccggctgggt 2581 gcgggacccc cggctgcccc cgtgcacgct gcgccaggct ctcaccttct tcctggacat 2641 cacctcccca cccagccctc agctcttgcg gctgctcagc accttggcag aagagcccag 2701 ggaacagcag gagctggagg ccctcagcca ggatccccga cgctacgagg agtggaagtg 2761 gttccgctgc cccacgctgc tggaggtgct ggagcagttc ccgtcggtgg cgctgcctgc 2821 cccactgctc ctcacccagc tgcctctgct ccagccccgg tactactcag tcagctcggc 2881 acccagcacc cacccaggag agatccacct cactgtagct gtgctggcat acaggactca 2941 ggatgggctg ggccccctgc actatggagt ctgctccacg tggctaagcc agctcaagcc 3001 cggagaccct gtgccctgct tcatccgggg ggctccctcc ttccggctgc cacccgatcc 3061 cagcttgccc tgcatcctgg tgggtccagg cactggcatt gcccccttcc ggggattctg 3121 gcaggagcgg ctgcatgaca ttgagagcaa agggctgcag cccactccca tgactttggt 3181 gttcggctgc cgatgctccc aacttgacca tctctaccgc gacgaggtgc agaacgccca 3241 gcagcgcggg gtgtttggcc gagtcctcac cgccttctcc cgggaacctg acaaccccaa 3301 gacctacgtg caggacatcc tgaggacgga gctggctgcg gaggtgcacc gcgtgctgtg 3361 cctcgagcgg ggccacatgt ttgtctgcgg cgatgttacc atggcaacca acgtcctgca 3421 gaccgtgcag cgcatcctgg cgacggaggg cgacatggag ctggacgagg ccggcgacgt 3481 catcggcgtg ctgcgggatc agcaacgcta ccacgaagac attttcgggc tcacgctgcg 3541 cacccaggag gtgacaagcc gcatacgcac ccagagcttt tccttgcagg agcgtcagtt 3601 gcggggcgca gtgccctggg cgttcgaccc tcccggctca gacaccaaca gcccctgaga 3661 gccgcctggc tttcccttcc agttccggga gagcggctgc ccgactcagg tccgcccgac 3721 caggatcagc cccgctcctc ccctcttgag gtggtgcctt ctcacatctg tccagaggct 3781 gcaaggattc agcattattc ctccaggaag gagcaaaacg cctcttttcc ctctctaggc 3841 ctgttgcctc gggcctgggt ccgccttaat ctggaaggcc cctcccagca gcggtacccc 3901 agggcctact gccacccgct tcctgtttct tagtccgaat gttagattcc tcttgcctct 3961 ctcaggagta tcttacctgt aaagtctaat ctctaaatca agtatttatt attgaagatt 4021 taccataagg gactgtgcca gatgttagga gaactactaa agtgcctacc ccagctc // LOCUS HUMNK1A 1466 bp mRNA PRI 07-JAN-1995 DEFINITION Human NK-1 receptor mRNA, complete cds. ACCESSION M81797 NID g189213 KEYWORDS NK-1 receptor. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1466) AUTHORS Hopkins,B., Powell,S.J., Danks,P., Briggs,I. and Graham,A. TITLE Isolation and characterization of the human lung NK1-receptor cDNA JOURNAL Biochem. Biophys. Res. Commun. (1991) In press FEATURES Location/Qualifiers source 1..1466 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" /map="Unassigned" gene 45..1268 /gene="TAC1R" CDS 45..1268 /gene="TAC1R" /codon_start=1 /db_xref="GDB:G00-128-977" /product="NK-1 receptor" /db_xref="PID:g189214" /translation="MDNVLPVDSDLSPNISTNTSEPNQFVQPAWQIVLWAAAYTVIVV TSVVGNVVVMWIILAHKRMRTVTNYFLVNLAFAEASMAAFNTVVNFTYAVHNEWYYGL FYCKFHNFFPIAAVFASIYSMTAVAFDRYMAIIHPLQPRLSATATKVVICVIWVLALL LAFPQGYYSTTETMPSRVVCMIEWPEHPNKIYEKVYHICVTVLIYFLPLLVIGYAYTV VGITLWASEIPGDSSDRYHEQVSAKRKVVKMMIVVVCTFAICWLPFHIFFLLPYINPD LYLKKFIQQVYLAIMWLAMSSTMYNPIIYCCLNDRFRLGFKHAFRCCPFISAGDYEGL EMKSTRYLQTQGSVYKVSRLETTISTVVGAHEEEPEDGPKATPSSLDLTSNCSSRSDS KTMTESFSFSSNVLS" BASE COUNT 324 a 457 c 342 g 343 t ORIGIN 1 gggggttgtg tacagatagg tagggcttta cgcctagctt cgaaatggat aacgtcctcc 61 cggtggactc agacctctcc ccaaacatct ccactaacac ctcggaaccc aatcagttcg 121 tgcaaccagc ctggcaaatt gtcctttggg cagctgccta cacggtcatt gtggtgacct 181 ctgtggtggg caacgtggta gtgatgtgga tcatcttagc ccacaaaaga atgaggacag 241 tgacgaacta ttttctggtg aacctggcct tcgcggaggc ctccatggct gcattcaata 301 cagtggtgaa cttcacctat gctgtccaca acgaatggta ctacggcctg ttctactgca 361 agttccacaa cttcttcccc atcgccgctg tcttcgccag tatctactcc atgacggctg 421 tggcctttga taggtacatg gccatcatac atcccctcca gccccggctg tcagccacag 481 ccaccaaagt ggtcatctgt gtcatctggg tcctggctct cctgctggcc ttcccccagg 541 gctactactc aaccacagag accatgccca gcagagtcgt gtgcatgatc gaatggccag 601 agcatccgaa caagatttat gagaaagtgt accacatctg tgtgactgtg ctgatctact 661 tcctccccct gctggtgatt ggctatgcat acaccgtagt gggaatcaca ctatgggcca 721 gtgagatccc cggggactcc tctgaccgct accacgagca agtctctgcc aagcgcaagg 781 tggtcaaaat gatgattgtc gtggtgtgca ccttcgccat ctgctggctg cccttccaca 841 tcttcttcct cctgccctac atcaacccag atctctacct gaagaagttt atccagcagg 901 tctacctggc catcatgtgg ctggccatga gctccaccat gtacaacccc atcatctact 961 gctgcctcaa tgacaggttc cgtctgggct tcaagcatgc cttccggtgc tgccccttca 1021 tcagcgccgg cgactatgag gggctggaaa tgaaatccac ccggtatctc cagacccagg 1081 gcagtgtgta caaagtcagc cgcctggaga ccaccatctc cacagtggtg ggggcccacg 1141 aggaggagcc agaggacggc cccaaggcca caccctcgtc cctggacctg acctccaact 1201 gctcttcacg aagtgactcc aagaccatga cagagagctt cagcttctcc tccaatgtgc 1261 tctcctaggc cacagggctt tggcaggtgc agcccccact gcctttgacc tgcctccctt 1321 catgcatgga aattcccttc atctggaacc atcagaaaca ccctcacact gggacttgca 1381 aaaagggtca gtatgggtta gggaaaacat tccatccttg agtcaaaaaa tctcaattct 1441 tccctatctt tgccaccctc atgctg // LOCUS HUMNK3R 1755 bp mRNA PRI 29-MAY-1992 DEFINITION Human neurokinin 3 receptor (NK3R) mRNA, complete cds. ACCESSION M89473 NID g189223 KEYWORDS neurokinin 3 receptor. SOURCE Homo sapiens brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1755) AUTHORS Huang,R.-R.C., Cheung,A.H., Mazina,K.E., Strader,C.D. and Fong,T.M. TITLE cDNA sequence and heterologous expression of the human neurokinin-3 receptor JOURNAL Biochem. Biophys. Res. Commun. 184, 966-972 (1992) MEDLINE 92246993 FEATURES Location/Qualifiers source 1..1755 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" CDS 144..1541 /standard_name="NK3 receptor" /codon_start=1 /function="binds neurokinin B" /evidence=experimental /product="neurokinin-3 receptor" /db_xref="PID:g189224" /translation="MATLPAAETWIDGGGGVGADAVNLTASLAAGAATGAVETGWLQL LDQAGNLSSSPSALGLPVASPAPSQPWANLTNQFVQPSWRIALWSLAYGVVVAVAVLG NLIVIWIILAHKRMRTVTNYFLVNLAFSDASMAAFNTLVNFIYALHSEWYFGANYCRF QNFFPITAVFASIYSMTAIAVDRYMAIIDPLKPRLSATATKIVIGSIWILAFLLAFPQ CLYSKTKVMPGRTLCFVQWPEGPKQHFTYHIIVIILVYCFPLLIMGITYTIVGITLWG GEIPGDTCDKYHEQLKAKRKVVKMMIIVVMTFAICWLPYHIYFILTAIYQQLNRWKYI QQVYLASFWLAMSSTMYNPIIYCCLNKRFRAGFKRAFRWCPFIKVSSYDELELKTTRF HPNRQSSMYTVTRMESMTVVFDPNDADTTRSSRKKRATPRDPSFNGCSRRNSKSASAT SSFISSPYTSVDEYS" BASE COUNT 432 a 469 c 403 g 451 t ORIGIN 1 ctattgcagt atctttcagc ttccagtctt atctgaagac cccggcacca aagtgaccag 61 gaggcagaga agaacttcag aggagtctcg tcttgggctg cccgtgggtg agtgggaggg 121 tccgggactg cagaccggtg gcgatggcca ctctcccagc agcagaaacc tggatagacg 181 ggggtggagg cgtgggtgca gacgccgtga acctgaccgc ctcgctagct gccggggcgg 241 ccacgggggc agttgagact gggtggctgc aactgctgga ccaagctggc aacctctcct 301 cctccccttc cgcgctggga ctgcctgtgg cttcccccgc gccctcccag ccctgggcca 361 acctcaccaa ccagttcgtg cagccgtcct ggcgcatcgc gctctggtcc ctggcgtatg 421 gtgtggtggt ggcagtggca gttttgggaa atctcatcgt catctggatc atcctggccc 481 acaagcgcat gaggactgtc accaactact tccttgtgaa cctggctttc tccgacgcct 541 ccatggccgc cttcaacacg ttggtcaatt tcatctacgc gcttcatagc gagtggtact 601 ttggcgccaa ctactgccgc ttccagaact tctttcctat cacagctgtg ttcgccagca 661 tctactccat gacggccatt gcggtggaca ggtatatggc tattattgat cccttgaaac 721 ccagactgtc tgctacagca accaagattg tcattggaag tatttggatt ctagcatttc 781 tacttgcctt ccctcagtgt ctttattcca aaaccaaagt catgccaggc cgtactctct 841 gctttgtgca atggccagaa ggtcccaaac aacatttcac ttaccatatt atcgtcatta 901 tactggtgta ctgtttccca ttgctcatca tgggtattac atacaccatt gttggaatta 961 ctctctgggg aggagaaatc ccaggagata cctgtgacaa gtatcatgag cagctaaagg 1021 ccaaaagaaa ggttgtcaaa atgatgatta ttgttgtcat gacatttgct atctgctggc 1081 tgccctatca tatttacttc attctcactg caatctatca acaactaaat agatggaaat 1141 acatccagca ggtctacctg gctagctttt ggctggcaat gagctcaacc atgtacaatc 1201 ccatcatcta ctgctgtctg aataaaagat ttcgagctgg cttcaagaga gcatttcgct 1261 ggtgtccttt catcaaagtt tccagctatg atgagctaga gctcaagacc accaggtttc 1321 atccaaaccg gcaaagcagt atgtacaccg tgaccagaat ggagtccatg acagtcgtgt 1381 ttgaccccaa cgatgcagac accaccaggt ccagtcggaa gaaaagagca acgccaagag 1441 acccaagttt caatggctgc tctcgcagga attccaaatc tgcctccgcc acttcaagtt 1501 tcataagctc accctatacc tctgtggatg aatattctta attccatttc ctgaggtaaa 1561 agattagtgt gagaccatca tggtgccagt ctaggacccc attctcctat ttatcagtcc 1621 tgtcctatat accctctaga aacagaaagc aatttttagg cagctatggt caaattgaga 1681 aaggtagtgt ataaatgtga caaagacact aataacatgt tagcctccac ccaaaataaa 1741 atgggcttta aattt // LOCUS HUMNKB 640 bp mRNA PRI 07-JAN-1995 DEFINITION Human neuromedin B mRNA, complete cds. ACCESSION M21551 J03948 NID g189227 KEYWORDS neuromedin. SOURCE Human hypothalamus, cDNA to mRNA, (library of R.Goodman). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 640; 1 to 640) AUTHORS Krane,I.M., Naylor,S.L., Helin-Davis,D., Chin,W.W. and Spindel,E.R. TITLE Molecular cloning of cDNAs encoding the human bombesin-like peptide neuromedin B. Chromosomal localization and comparison to cDNAs encoding its amphibian homolog ranatensin [published erratum appears in J Biol Chem 1990 Apr 25;265(12):7091] JOURNAL J. Biol. Chem. 263 (26), 13317-13323 (1988) MEDLINE 88330837 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by I.M.Krane, 07-NOV-1988, and [J. Biol. Chem. (1989) In press] 12-DEC-1989. The authors [J. Biol. Chem. (1989) In press] gratefully acknowledge Jim Battey and James Way for bringing this correction to their attention. FEATURES Location/Qualifiers source 1..640 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hypothalamus" /tissue_lib="of R.Goodman" /map="15q11-qter" mRNA 1..639 /gene="NMB" /note="G00-120-237" gene 1..639 /gene="NMB" sig_peptide 37..108 /gene="NMB" /note="G00-120-237" CDS 37..402 /gene="NMB" /codon_start=1 /db_xref="GDB:G00-120-237" /product="neuromedin B" /db_xref="PID:g189228" /translation="MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVH SRGNLWATGHFMGKKSLEPSSPSHWGQLPTPPLRDQRLQLSHDLLGILLLKKALGVSL SRPAPQIQYRRLLVQILQK" mat_peptide 109..207 /gene="NMB" /note="G00-120-237" /product="neuromedin B" BASE COUNT 131 a 204 c 176 g 129 t ORIGIN Chromosome 15 q11-qter. 1 cgcgcgcccg aacgaagccg cggcccgggc acagccatgg cccggcgggc ggggggcgct 61 cggatgttcg gcagcctcct gctcttcgcc ctgctcgctg ccggcgtcgc cccgctcagc 121 tgggatctcc cggagccccg cagccgagcc agcaagatcc gagtgcactc gcgaggcaac 181 ctctgggcca ccggtcactt catgggcaag aagagtctgg agccttccag cccatcccat 241 tggggacagc tccccacacc tcccctgagg gaccagcgac tgcagctgag tcatgatctg 301 ctcggaatcc tcctgctaaa gaaggctctg ggcgtgagcc tcagccgccc cgcaccccaa 361 atccagtaca ggaggctgct ggtacaaata ctgcagaaat gacaccaata ataggggcag 421 acacaacagc gtggcttaga ttgtgcccac ccagggaagg tgctgaatgg gaccctgttg 481 atggccccat ctggatgtaa atcctgagct caaatctctg ttactccatt actgtgattt 541 ctggctgggt caccagaaat atcgctgatg cagacacaga ttatgttcct gctgtatttc 601 ctgcttccct gttgaattgg tgaataaaac cttgctcttt // LOCUS HUMNLK 1987 bp mRNA PRI 26-OCT-1992 DEFINITION Human neuroleukin mRNA, complete cds. ACCESSION K03515 NID g189237 KEYWORDS growth factor; lymphokine; neuroleukin; neurotrophic factor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1987) AUTHORS Gurney,M.E. JOURNAL Unpublished (1987) COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Gurney, 09-MAR-1987. FEATURES Location/Qualifiers source 1..1987 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1987 /note="NL mRNA" CDS 16..1692 /codon_start=1 /product="neuroleukin" /db_xref="PID:g189238" /translation="MAALTRDPQFQKLQQWYREHRSELNLRRLFDANKDRFNHFSLTL NTNHGHILVDYSKNLVTEDVMRMLVDLAKSRGVEAARERMFNGEKINYTEGRAVLHVA LRNRSNTPILVDGKDVMPEVNKVLDKMKSFCQRVRSGDWKGYTGKTITDVINIGIVGS DLGPLMVTEALKPYSSGGPRVWYVSNIDGTHIAKTLAQLNPESSLFIIASKTFTTQET ITNAETAKEWFLQAAKDPSAVAKHFVALSTNTTKVKEFGIDPQNMFEFWDWVGGRYSL WSAIGLSIALHVGFDNFEQLLSGAHWMDQHFRTTPLEKNAPVLLALLGIWYINCFGCE THAMLPYDQYLHRFAAYFQQGDMESNGKYITKSGTRVDHQTGPIVWGEPGTNGQHAFY QLIHQGTKMIPCDFLIPVQTQHPIRKGLHHKILLANFLAQTEALMRGKSTEEARKELQ AAGKSPEDLERLLPHKVFEGNRPTNSIVFTKLTPFMLGALVAMYEHKIFVQGIIWDIN SFDQWGVELGKQLAKKIEPELDGSAQVTSHDASTNGLINFIKQQREARVQ" BASE COUNT 449 a 595 c 531 g 412 t ORIGIN 1 bp upstream of XhoI site. 1 ctcgagagct ccgccatggc cgctctcacc cgggaccccc agttccagaa gctgcagcaa 61 tggtaccgcg agcaccgctc cgagctgaac ctgcgccgcc tcttcgatgc caacaaggac 121 cgcttcaacc acttcagctt gaccctcaac accaaccatg ggcatatcct ggtggattac 181 tccaagaacc tggtgacgga ggacgtgatg cggatgctgg tggacttggc caagtccagg 241 ggcgtggagg ccgcccggga gcggatgttc aatggtgaga agatcaacta caccgagggt 301 cgagccgtgc tgcacgtggc tctgcggaac cggtcaaaca cacccatcct ggtagacggc 361 aaggatgtga tgccagaggt caacaaggtt ctggacaaga tgaagtcttt ctgccagcgt 421 gtccggagcg gtgactggaa ggggtacaca ggcaagacca tcacggacgt catcaacatt 481 ggcattgtcg gctccgacct gggacccctc atggtgactg aagcccttaa gccatactct 541 tcaggaggtc cccgcgtctg gtatgtctcc aacattgatg gaactcacat tgccaaaacc 601 ctggcccagc tgaacccgga gtcctccctg ttcatcattg cctccaagac ctttactacc 661 caggagacca tcacgaatgc agagacggcg aaggagtggt ttctccaggc ggccaaggat 721 ccttctgcag tggcgaagca ctttgttgcc ctgtctacta acacaaccaa agtgaaggag 781 tttggaattg accctcaaaa catgttcgag ttctgggatt gggtgggagg acgctactcg 841 ctgtggtcgg ccatcggact ctccattgcc ctgcacgtgg gttttgacaa cttcgagcag 901 ctgctctcgg gggctcactg gatggaccag cacttccgca cgacgcccct ggagaagaac 961 gcccccgtct tgctggccct gctgggtatc tggtacatca actgctttgg gtgtgagaca 1021 cacgccatgc tgccctatga ccagtacctg caccgctttg ctgcgtactt ccagcagggc 1081 gacatggagt ccaatgggaa atacatcacc aaatctggaa cccgtgtgga ccaccagaca 1141 ggccccattg tgtgggggga gccagggacc aatggccagc atgcttttta ccagctcatc 1201 caccaaggca ccaagatgat accctgtgac ttcctcatcc cggtccagac ccagcacccc 1261 atacggaagg gtctgcatca caagatcctc ctggccaact tcttggccca gacagaggcc 1321 ctgatgaggg gaaaatcgac ggaggaggcc cgaaaggagc tccaggctgc gggcaagagt 1381 ccagaggacc ttgagaggct gctgccacat aaggtctttg aaggaaatcg cccaaccaac 1441 tctattgtgt tcaccaagct cacaccattc atgcttggag ccttggtcgc catgtatgag 1501 cacaagatct tcgttcaggg catcatctgg gacatcaaca gctttgacca gtggggagtg 1561 gagctgggaa agcagctggc taagaaaata gagcctgagc ttgatggcag tgctcaagtg 1621 acctctcacg acgcttctac caatgggctc atcaacttca tcaagcagca gcgcgaggcc 1681 agagtccaat aaactcgtgc tcatctgcag cctcctctgt gactcccctt tctcttctcg 1741 tccctcctcc ccggagccgg cactgcatgt tcctggacac cacccagagc accctctggt 1801 tgtgggcttg gaccacgagc ccttagcagg gaaggctggt ctcccccagc ctaaccccca 1861 gcccctccat gtctatgctc cctctgtgtt agaattggct gaagtgtttt tgtgcagctg 1921 acttttctga cccatgttca cgttgttcac atcccatgta gaaaaacaaa gatgccacgg 1981 aggaggt // LOCUS HUMNMBR 1352 bp mRNA PRI 07-JAN-1995 DEFINITION Human neuromedin B receptor (NMB-R) mRNA, complete cds. ACCESSION M73482 NID g189241 KEYWORDS G-protein coupled receptor; bombesin peptide receptor; growth factor receptor; neuromedin B receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1352) AUTHORS Corjay,M.H., Dobrzanski,D.J., Way,J.M., Viallet,J., Shapira,H., Worland,P., Sausville,E.A. and Battey,J.F. TITLE Two distinct bombesin receptor subtypes are expressed and functional in human lung carcinoma cells JOURNAL J. Biol. Chem. 266 (28), 18771-18779 (1991) MEDLINE 92011639 FEATURES Location/Qualifiers source 1..1352 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="NCI-H345" /map="6q21-qter" mRNA 1..1352 /gene="NMB-R" gene 1..1352 /gene="NMB-R" gene 140..1312 /gene="NMBR" CDS 140..1312 /gene="NMBR" /codon_start=1 /db_xref="GDB:G00-128-063" /product="neuromedin B receptor" /db_xref="PID:g189242" /translation="MPSKSLSNLSVTTGANESGSVPEGWERDFLPASDGTTTELVIRC VIPSLYLLIITVGLLGNIMLVKIFITNSAMRSVPNIFISNLAAGDLLLLLTCVPVDAS RYFFDEWMFGKVGCKLIPVIQLTSVGVSVFTLTALSADRYRAIVNPMDMQTSGALLRT CVKAMGIWVVSVLLAVPEAVFSEVARISSLDNSSFTACIPYPQTDELHPKIHSVLIFL VYFLIPLAIISIYYYHIAKTLIKSAHNLPGEYNEHTKKQMETRKRLAKIVLVFVGCFI FCWFPNHILYMYRSFNYNEIDPSLGHMIVTLVARVLSFGNSCVNPFALYLLSESFRRH FNSQLCCGRKSYQERGTSYLLSSSAVRMTSLKSNAKNMVTNSVLLNGHSMKQEMAM" BASE COUNT 318 a 351 c 332 g 351 t ORIGIN 1 gtgctgtgag gcttgcccgc ggacagtaaa cttgcagggg cgagagggag ggacatcgat 61 taaacctaaa tcgtgggcgt tcagtcctca gggcaccgag cgcgtgaaaa ctccagcgga 121 ctctgctgga aaggagatca tgccctctaa gtctctttcc aacctctcgg tgaccaccgg 181 cgcgaatgag agcggttccg ttcccgaggg gtgggaaagg gatttcctgc cggcctcgga 241 cgggaccacc acggagttgg tgatccgctg tgtgatcccg tccctctacc tgctcatcat 301 caccgtgggc ttgctgggca acatcatgct ggtgaagatc ttcatcacca acagcgccat 361 gaggagcgtc cccaacatct tcatctctaa cctggcggcc ggggacttgc tgctgctgct 421 cacctgcgtc ccggtggacg cctcgcgcta cttcttcgac gagtggatgt ttggcaaggt 481 gggctgcaaa ctgatccctg tcatccagct cacttccgtg ggggtttccg tgttcactct 541 cactgccctc agcgccgaca ggtacagagc catcgttaac cccatggaca tgcagacgtc 601 aggggcattg ctgcggacct gtgtgaaggc catgggtatc tgggtggtct ccgtgttgct 661 ggcagttccc gaagcggtgt tttcagaagt ggctcgcatc agtagcttgg ataatagcag 721 cttcacagca tgtatcccat accctcaaac agatgaatta catccaaaga ttcattcagt 781 gctcattttc ttggtctatt tcctcatacc acttgctatt attagcattt attattatca 841 tattgcaaag accttaatta aaagcgcaca caatcttcct ggagaataca atgaacatac 901 caaaaaacag atggaaacac ggaaacgcct ggctaaaatt gtgcttgtct ttgtgggctg 961 tttcatcttc tgttggtttc caaaccacat cctttacatg tatcggtctt tcaactataa 1021 tgagattgat ccatctctag gccacatgat tgtcacctta gttgcccggg ttctcagttt 1081 tggcaattct tgtgtcaacc catttgctct ttacctactc agtgaaagct tcaggaggca 1141 tttcaacagc caactctgct gtgggaggaa gtcctatcaa gagagaggaa ccagctacct 1201 actcagctct tcagcggtgc gtatgacatc tctgaaaagc aatgctaaga acatggtgac 1261 caattctgtt ttactaaatg ggcacagcat gaagcaggaa atggcaatgt gattttggcc 1321 attcaactca ctacctggag agaacttagt aa // LOCUS HUMNMDAREC 3245 bp mRNA PRI 30-AUG-1994 DEFINITION Homo sapiens NMDA receptor subunit (NR1) mRNA, complete cds. ACCESSION L05666 NID g307302 KEYWORDS NMDA receptor; transmembrane protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3245) AUTHORS Planells-Cases,R., Sun,W., Ferrer-Montiel,A.V. and Montal,M. TITLE Molecular cloning, functional expression, and pharmacological characterization of an N-methyl-D-aspartate receptor subunit from human brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (11), 5057-5061 (1993) MEDLINE 93281695 FEATURES Location/Qualifiers source 1..3245 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" sig_peptide 18..71 /gene="NR1" CDS 18..2675 /codon_start=1 /product="NMDA receptor" /db_xref="PID:g307303" /translation="MSTMRLLTLALLFSCSVARAACDPKIVNIGAVLSTRKHEQMFRE AVNQANKRHGSWKIQLNATSVTHKPNAIQMALSVCEDLISSQVYAILVSHPPTPNDHF TPTPVSYTAGFYRIPVLGLTTRMSIYSDKSIHLSFLRTVPPYSHQSSVWFEMMRVYSW NHIILLVSDDHEGRAAQKRLETLLEERESKAEKVLQFDPGTKNVTALLMEAKELEARV IILSASEDDAATVYRAAAMLNMTGSGYVWLVGEREISGNALRYAPDGILGLQLINGKN ESAHISDAVGVVAQAVHELLEKENITDPPRGCVGNTNIWKTGPLFKRVLMSSKYADGV TGRVEFNEDGDRKFANYSIMNLQNRKLVQVGIYNGTHVIPNDRKIIWPGGETEKPRGY QMSTRLKIVTIHQEPFVYVKPTLSDGTCKEEFTVNGDPVKKVICTGPNDTSPGSPRHT VPQCCYGFCIDLLIKLARTMNFTYEVHLVADGKFGTQERVNNSNKKEWNGMMGELLSG QADMIVAPLTINNERAQYIEFSKPFKYQGLTILVKKEIPRSTLDSFMQPFQSTLWLLV GLSVHVVAVMLYLLDRFSPFGRFKVNSEEEEEDALTLSSAMWFSWGVLLNSGIGEGAP RSFSARILGMVWAGFAMIIVASYTANLAAFLVLDRPEERITGINDPRLRNPSDKFIYA TVKQSSVDIYFRRQVELSTMYRHMEKHNYESAAEAIQAVRDNKLHAFIWDSAVLEFEA SQKCDLVTTGELFFRSGFGIGMRKDSPWKQNVSLSILKSHENGFMEDLDKTWVRYQEC DSRSNAPATLTFENMAGVFMLVAGGIVAGIFLIFIEIAYKRHKDARRKQMQLAFAAVN VWRKNLQQYHPTDITGPLNLSDPSVSTVV" gene 18..2672 /gene="NR1" mat_peptide 72..2672 /gene="NR1" /note="transmembrane protein" /function="NMDA selective" /function="cation channel" /function="glutamate receptor" /product="NMDA receptor" BASE COUNT 633 a 1067 c 1001 g 544 t ORIGIN 1 gcccgcggcc cgagcccatg agcaccatgc gcctgctgac gctcgccctg ctgttctcct 61 gctccgtcgc ccgtgccgcg tgcgacccca agatcgtcaa cattggcgcg gtgctgagca 121 cgcggaagca cgagcagatg ttccgcgagg ccgtgaacca ggccaacaag cggcacggct 181 cctggaagat tcagctcaat gccacctccg tcacgcacaa gcccaacgcc atccagatgg 241 ctctgtcggt gtgcgaggac ctcatctcca gccaggtcta cgccatccta gttagccatc 301 cacctacccc caacgaccac ttcactccca cccctgtctc ctacacagcc ggcttctacc 361 gcatacccgt gctggggctg accacccgca tgtccatcta ctcggacaag agcatccacc 421 tgagcttcct gcgcaccgtg ccgccctact cccaccagtc cagcgtgtgg tttgagatga 481 tgcgtgtcta cagctggaac cacatcatcc tgctggtcag cgacgaccac gagggccggg 541 cggctcagaa acgcctggag acgctgctgg aggagcgtga gtccaaggca gagaaggtgc 601 tgcagtttga cccagggacc aagaacgtga cggccctgct gatggaggcg aaagagctgg 661 aggcccgggt catcatcctt tctgccagcg aggacgatgc tgccactgta taccgcgcag 721 ccgcgatgct gaacatgacg ggctccgggt acgtgtggct ggtcggcgag cgcgagatct 781 cggggaacgc cctgcgctac gccccggacg gcatcctcgg gctgcagctc atcaacggca 841 agaacgagtc ggcccacatc agcgacgccg taggcgtggt ggcccaggcc gtgcacgagc 901 tcctcgagaa ggagaacatc accgacccgc cgcggggctg cgtgggcaac accaacatct 961 ggaagaccgg gccgctcttc aagagagtgc tgatgtcttc caagtatgcg gatggggtga 1021 ctggtcgcgt ggagttcaat gaggatgggg accggaagtt cgccaactac agcatcatga 1081 acctgcagaa ccgcaagctg gtgcaagtgg gcatctacaa tggcacccac gtcatcccta 1141 atgacaggaa gatcatctgg ccaggcggag agacagagaa gcctcgaggg taccagatgt 1201 ccaccagact gaagattgtg acgatccacc aggagccctt cgtgtacgtc aagcccacgc 1261 tgagtgatgg gacatgcaag gaggagttca cagtcaacgg cgacccagtc aagaaggtga 1321 tctgcaccgg gcccaacgac acgtcgccgg gcagcccccg ccacacggtg cctcagtgtt 1381 gctacggctt ttgcatcgac ctgctcatca agctggcacg gaccatgaac ttcacctacg 1441 aggtgcacct ggtggcagat ggcaagttcg gcacacagga gcgggtgaac aacagcaaca 1501 agaaggagtg gaatgggatg atgggcgagc tgctcagcgg gcaggcagac atgatcgtgg 1561 cgccgctaac cataaacaac gagcgcgcgc agtacatcga gttttccaag cccttcaagt 1621 accagggcct gactattctg gtcaagaagg agattccccg gagcacgctg gactcgttca 1681 tgcagccgtt ccagagcaca ctgtggctgc tggtggggct gtcggtgcac gtggtggccg 1741 tgatgctgta cctgctggac cgcttcagcc ccttcggccg gttcaaggtg aacagcgagg 1801 aggaggagga ggacgcactg accctgtcct cggccatgtg gttctcctgg ggcgtcctgc 1861 tcaactccgg catcggggaa ggcgccccca gaagcttctc agcgcgcatc ctgggcatgg 1921 tgtgggccgg ctttgccatg atcatcgtgg cctcctacac cgccaacctg gcggccttcc 1981 tggtgctgga ccggccggag gagcgcatca cgggcatcaa cgaccctcgg ctgaggaacc 2041 cctcggacaa gtttatctac gccacggtga agcagagctc cgtggatatc tacttccggc 2101 gccaggtgga gctgagcacc atgtaccggc atatggagaa gcacaactac gagagtgcgg 2161 cggaggccat ccaggccgtg agagacaaca agctgcatgc cttcatctgg gactcggcgg 2221 tgctggagtt cgaggcctcg cagaagtgcg acctggtgac gactggagag ctgtttttcc 2281 gctcgggctt cggcataggc atgcgcaaag acagcccctg gaagcagaac gtctccctgt 2341 ccatcctcaa gtcccacgag aatggcttca tggaagacct ggacaagacg tgggttcggt 2401 atcaggaatg tgactcgcgc agcaacgccc ctgcgaccct tacttttgag aacatggccg 2461 gggtcttcat gctggtagct gggggcatcg tggccgggat cttcctgatt ttcatcgaga 2521 ttgcctacaa gcggcacaag gatgctcgcc ggaagcagat gcagctggcc tttgccgccg 2581 ttaacgtgtg gcggaagaac ctgcagcagt accatcccac tgatatcacg ggcccgctca 2641 acctctcaga tccctcggtc agcaccgtgg tgtgaggccc ccggaggcgc ccacctgccc 2701 agttagcccg gccaaggaca ctgatgggtc ctgctgctcg ggaaggcctg agggaagccc 2761 acccgcccca gagactgccc accctgggcc tcccgtccgt ccgcccgccc accccgctgc 2821 ctggcgggca gcccctgctg gaccaaggtg cggaccggag cggctgagga cggggcagag 2881 ctgagtcggc tgggcagggc cgcagggcgc tccggcagag gcagggccct ggggtctctg 2941 agcagtgggg agcgggggct aactggcccc aggcggaggg gcttggagca gagacggcag 3001 ccccatcctt cccgcagcac cagcctgagc cacagtgggg cccatggccc cagctggctg 3061 ggtcgcccct cctcgggcgc ctgcgctcct ctgcagcctg agctccaccc tcccctcttc 3121 ttgcggcacc gcccacccac accccgtctg ccccttgacc ccacacgccg gggctggccc 3181 tgccctcccc cacggccgtc cctgacttcc cagctgcagc gcctcccgcc gcctcgggcc 3241 gcctc // LOCUS HUMNMOR 2447 bp mRNA PRI 07-JAN-1995 DEFINITION Human, NAD(P)H:menadione oxidoreductase mRNA, complete cds. ACCESSION J03934 NID g189245 KEYWORDS NAD(P)H:menadione oxidoreductase; flavoprotein. SOURCE Human liver, cDNA to mRNA, clones HNMOR[1a,1b,1c,1d]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2447) AUTHORS Jaiswal,A.K., McBride,O.W., Adesnik,M. and Nebert,D.W. TITLE Human dioxin-inducible cytosolic NAD(P)H:menadione oxidoreductase. cDNA sequence and localization of gene to chromosome 16 JOURNAL J. Biol. Chem. 263 (27), 13572-13578 (1988) MEDLINE 88330879 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Jaiswal, 16-JUN-1988. An Alu repeat is located between the polyadenylation signals located at positions 1460-1465 and 1838-1843. FEATURES Location/Qualifiers source 1..2447 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="16q22.1" gene 51..875 /gene="NMOR1" CDS 51..875 /gene="NMOR1" /codon_start=1 /db_xref="GDB:G00-120-238" /product="NAD(P)H:menadione oxidoreductase" /db_xref="PID:g189246" /translation="MVGRRALIVLAHSERTSFNYAMKEAAAAALKKKGWEVVESDLYA MNFNPIISRKDITGKLKDPANFQYPAESVLAYKEGHLSPDIVAEQKKLEAADLVIFQF PLQWFGVPAILKGWFERVFIGEFAYTYAAMYDKGPFRSKKAVLSITTGGSGSMYSLQG IHGDMNVILWPIQSGILHFCGFQVLEPQLTYSIGHTPADARIQILEGWKKRLENIWDE TPLYFAPSSLFDLNFQAGFLMKKEVQDEEKNKKFGLSVGHHLGKSIPTDNQIKARK" BASE COUNT 682 a 537 c 519 g 709 t ORIGIN 131 bp upstream of PstI site; chromosome 16. 1 cccggcaacc acgagcccag ccaatcagcg ccccggactg caccagagcc atggtcggca 61 gaagagcact gatcgtactg gctcactcag agaggacctc cttcaactat gccatgaagg 121 aggctgctgc agcggctttg aagaagaaag gatgggaggt ggtggagtcg gacctctatg 181 ccatgaactt caatcccatc atttccagaa aggacatcac aggtaaactg aaggaccctg 241 cgaactttca gtatcctgcc gagtctgttc tggcttataa agaaggccat ctgagcccag 301 atattgtggc tgaacaaaag aagctggaag ccgcagacct tgtgatattc cagttccccc 361 tgcagtggtt tggagtccct gccattctga aaggctggtt tgagcgagtg ttcataggag 421 agtttgctta cacttacgct gccatgtatg acaaaggacc cttccggagt aagaaggcag 481 tgctttccat caccactggt ggcagtggct ccatgtactc tctgcaaggg atccacgggg 541 acatgaatgt cattctctgg ccaattcaga gtggcattct gcatttctgt ggcttccaag 601 tcttagaacc tcaactgaca tatagcattg ggcacactcc agcagacgcc cgaattcaaa 661 tcctggaagg atggaagaaa cgcctggaga atatttggga tgagacacca ctgtattttg 721 ctccaagcag cctctttgac ctaaacttcc aggcaggatt cttaatgaaa aaagaggtac 781 aggatgagga gaaaaacaag aaatttggcc tttctgtggg ccatcacttg ggcaagtcca 841 tcccaactga caaccagatc aaagctagaa aatgagattc cttagcctgg atttccttct 901 aacatgttat caaatctggg tatctttcca ggcttccctg acttgcttta gtttttaaga 961 tttgtgtttt tctttttcca caaggaataa atgagaggga atcgaccgta ttcgtgcatt 1021 tttggatgca tttttaactg attcttatga ttactatcat ggcatataac caaaatccga 1081 ctgggctcaa gaggccactt agggaaagat gtagaaagat gctagaaaaa tgttctttaa 1141 aggcatctac acaatttaat tcctcttttt agggtcaaag tttagggtac agttggctag 1201 gtatcattca actctccaat gttctattaa tcacctctct gtagtttatg gtcagaaggg 1261 aattgctcag agaaggtaaa agactgaatc tacctgccta agggacttaa cttgtttggt 1321 agttagccat ctaatgcttg tttatgatat ttcttgcttt caattacaaa gcagttacta 1381 atatgcctag cacaagtacc actcttggtc agcttttgtt gtttatatac agtacacaga 1441 taccttgaaa ggaagagcta ataaatctct tctttgctgc agtcatctac ttttttttta 1501 attaaaaaaa attttttttg aagcagtctt gctctgttac ccaggctgga gtgcagtggt 1561 gtgatctcgg ctcactgcaa cctctgcctc ccaggttcca gcaattctcc tgcctcagcc 1621 tccctagtag ctgggatgac aggcgcctgc catcatgcct gactaatttt tgtattttta 1681 gtagagacgg cgtttcacca tgttggccag gctggtctca aactcctgac ctcaggtgat 1741 ccgcctacct cagcctccca aagtgctggg attacaggcg tgatccacca cacctggccc 1801 ttgcaatctt ctactttaag gtttgcagag ataaaccaat aaatccacac cgtacatctg 1861 caatatgaat tccaagaaag gaaatagtac cttcaatact taaaaatagt cttccacaaa 1921 aaatacttta tttctgatct atacaaattt tcagaaggtt attttcttta tcattgctaa 1981 actgatgact taccatggga tggggtccag tcccatgacc ttggggtaca attgtaaacc 2041 tagagtttta tcaactttgg tgaacagttt tggcataata gtcaatttct acttctggaa 2101 gtcatctcat tccactgttg gtattatata attcaaggag aatatgataa aacactgccc 2161 tcttgtggtg cattgaaaga agagatgaga aatgatgaaa aggttgcctg aaaaatggga 2221 gacagcctct tacttgccaa gaaaatgaag ggattggacc gagctggaaa acctccttta 2281 ccagatgctg actggcactg gtggtttttg ctctcgacat atccacaata gctgacggct 2341 gggtgtttca gtttgcaaaa tattttgttg ccttcatctt cactgcaatt ttgtgtaaat 2401 ttctcaaaga tctaattaaa taaataaaat tcatttctac agactca // LOCUS HUMNMRE 3995 bp mRNA PRI 14-FEB-1996 DEFINITION Homo sapiens NMDA receptor mRNA, complete cds. ACCESSION L76224 NID g1196448 KEYWORDS NMDA receptor; glutamate receptor. SOURCE Homo sapiens (clone: NR2C) (clone library: hippocampal (Stratagene), cerebellar (Stratagene)) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3995) AUTHORS Lin,Y.J., Bovetto,S., Carver,J. and Giordano,T. TITLE The cloning of the human NR2C NMDA glutamate receptor and its expression in the central nervous system and periphery JOURNAL Unpublished (1996) FEATURES Location/Qualifiers source 1..3995 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="NR2C" /clone_lib="hippocampal (Stratagene), cerebellar (Stratagene)" /tissue_type="brain" CDS 1..3702 /codon_start=1 /product="NMDA receptor" /db_xref="PID:g1196449" /translation="MGGALGPALLLTSLFGAWAGLGPGQGEQGMTVAVVFSSSGPPQA QFRARLTPQSFLDLPLEIQPLTVGVNTTNPSSLLTQICGLLGAAHVHGIVFEDNVDTE AVAQILDFISSQTHVPILSISGGSAVVLTPKEPGSAFLQLGVSLEQQLQVLFKVLEEY DWSAFAVITSLHPGHALFLEGVRAVADASHVSWRLLDVVTLELGPGGPRARTQRLLRQ LDAPVFVAYCSREEAEVLFAEAAQAGLVGPGHVWLVPNLALGSTDAPPATFPVGLISV VTESWRLSLRQKVRDGVAILALGAHSYWRQHGTLPAPAGDCRVHPGPVSPAREAFYRH LLNVTWEGRDFSFSPGGYLVQPTMVVIALNRHRLWEMVGRWEHGVLYMKYPVWPRYSA SLQPVVDSRHLTVATLEERPFVIVESPDPGTGGCVPNTVPCRRQSNHTFSSGDVAPYT KLCCKGFCIDILKKLARVVKFSYDLYLVTNGKHGKRVRGVWNGMIGEVYYKRADMAIG SLTINEERSEIVDFSVPFVETGISVMVARSNGTVSPSAFLEPYSPAVWVMMFVMCLTV VAITVFMFEYFSPVSYNQNLTRGKKSGGPAFTIGKSVWLLWALVFNNSVPIENPRGTT SKIMVLVWAFFAVIFLASYTANLAAFMIQEQYIDTVSGLSDKKFQRPQDQYPPFRFGT VPNGSTERNIRSNYRDMHTHMVKFNQRSVEDALTSLKMGKLDAFIYDAAVLNYMAGKD EGCKLVTIGSGKVFATTGYGIAMQKDSHWKRAIDLALLQFLGDGETQKLETVWLSGIC QNEKNEVMSSKLDIDNMAGVFYMLLVAMGLALLVFAWEHLVYWKLRHSVPNSSQLDFL LAFSRGIYSCFSGVQSLASPPRQASPDLTASSAQASVLKMLQAARDMVTTAGVSSSLD RATRTIENWGGGRRAPPPSPCPTPRSGPSPCLPTPDPPPEPSPTGWGPPDGGRAALVR RAPQPPGRPPTPGPPLSDVSRVSRRPAWEARWPVRTGHCGRHLSASERPLSPARCHYS SFPRADRSGRPFLPLFPELEDLPLLGKEQLARREALLHAAWARGSRPRHASLPSSVAE AFARPSSLPAGCTGPACARPDGHSACRRLAQAQSMCLPIYREACQEGEQAGAPAWQHR QHVCLHAHAHLPFCWGAVCPHLPPCASHGSWLSGAWGPLGHRGRTLGLGTGYRDSGGL DEISRVARGTQGFPGPCTWRRISSLESEV" BASE COUNT 643 a 1375 c 1265 g 712 t ORIGIN 1 atgggtgggg ccctggggcc ggccctgttg ctcacctcgc tcttcggtgc ctgggcaggg 61 ctgggtccgg ggcagggcga gcagggcatg acggtggccg tggtgtttag cagctcaggg 121 ccgccccagg cccagttccg tgcccgcctc accccccaga gcttcctgga cctacccctg 181 gagatccagc cgctcacagt tggggtcaac accaccaacc ccagcagcct cctcacccag 241 atctgcggcc tcctgggtgc tgcccacgtc cacggcattg tctttgagga caacgtggac 301 accgaggcgg tggcccagat ccttgacttc atctcctccc agacccatgt gcccatcctc 361 agcatcagcg gaggctctgc tgtggtcctc acccccaagg agccgggctc cgccttcctg 421 cagctgggcg tgtccctgga gcagcagctg caggtgctgt tcaaggtgct ggaagagtac 481 gactggagcg ccttcgccgt catcaccagc ctgcacccgg gccacgcgct cttcctggag 541 ggcgtgcgcg ccgtcgccga cgccagccac gtgagttggc ggctgctgga cgtggtcacg 601 ctggagctgg gcccgggagg gccgcgcgcg cgcacgcagc gcctgctgcg ccagctcgac 661 gcgcccgtgt ttgtggccta ctgctcgcgc gaggaggccg aggtgctctt cgccgaggcg 721 gcgcaggccg gtctggtggg gcccggccac gtgtggttgg tgcccaacct ggcgctgggc 781 agcaccgatg cgccccccgc caccttcccc gtgggcctca tcagcgtcgt caccgagagc 841 tggcgcctca gcctgcgcca gaaggtgcgc gacggcgtgg ccattctggc cctgggcgcc 901 cacagctact ggcgccagca tggaaccctg ccagccccgg ccggggactg ccgtgttcac 961 cctgggcccg tcagccctgc ccgggaggcc ttctacaggc acctactgaa tgtcacctgg 1021 gagggccgag acttctcctt cagccctggt gggtacctgg tccagcccac catggtggtg 1081 atcgccctca accggcaccg cctctgggag atggtggggc gctgggagca tggcgtccta 1141 tacatgaagt accccgtgtg gcctcgctac agtgcctctc tgcagcctgt ggtggacagt 1201 cggcacctga cggtggccac gctggaagag cggccctttg tcatcgtgga gagccctgac 1261 cctggcacag gaggctgtgt ccccaacacc gtgccctgcc gcaggcagag caaccacacc 1321 ttcagcagcg gggacgtggc cccctacacc aagctctgct gtaagggatt ctgcatcgac 1381 atcctcaaga agctggccag agtggtcaaa ttctcctacg acctgtacct ggtgaccaac 1441 ggcaagcatg gcaagcgggt gcgcggcgta tggaacggca tgattgggga ggtgtactac 1501 aagcgggcag acatggccat cggctccctc accatcaatg aggaacgctc cgagatcgta 1561 gacttctctg taccctttgt ggagacgggc atcagtgtga tggtggctcg cagcaatggc 1621 accgtctccc cctcggcctt cttggagcca tatagccctg cagtgtgggt gatgatgttt 1681 gtcatgtgcc tcactgtggt ggccatcacc gtcttcatgt tcgagtactt cagccctgtc 1741 agctacaacc agaacctcac cagaggcaag aagtccgggg gcccagcttt cactatcggc 1801 aagtccgtgt ggctgctgtg ggcgctggtc ttcaacaact cagtgcccat cgagaacccg 1861 cggggcacca ccagcaagat catggttctg gtctgggcct tctttgctgt catcttcctc 1921 gccagctaca cggccaacct ggccgccttc atgatccaag agcaatacat cgacactgtg 1981 tcgggcctca gtgacaagaa gtttcagcgg cctcaagatc agtacccacc tttccgcttc 2041 ggcacggtgc ccaacggcag cacggagcgg aacatccgca gtaactaccg tgacatgcac 2101 acccacatgg tcaagttcaa ccagcgctcg gtggaggacg cgctcaccag cctcaagatg 2161 gggaagctgg atgccttcat ctatgatgct gctgtcctca actacatggc aggcaaggac 2221 gagggctgca agctggtcac cattgggtct ggcaaggtct ttgctaccac tggctacggc 2281 atcgccatgc agaaggactc ccactggaag cgggccatag acctggcgct cttgcagttc 2341 ctgggggacg gagagacaca gaaactggag acagtgtggc tctcagggat ctgccagaat 2401 gagaagaacg aggtgatgag cagcaagctg gacatcgaca acatggcagg cgtcttctac 2461 atgctgctgg tggccatggg gctggccctg ctggtcttcg cctgggagca cctggtctac 2521 tggaagctgc gccactcggt gcccaactca tcccagctgg acttcctgct ggctttcagc 2581 aggggcatct acagctgctt cagcggggtg cagagcctcg ccagcccacc gcggcaggcc 2641 agcccggacc tcacggccag ctcggcccag gccagcgtgc tcaagatgct gcaggcagcc 2701 cgcgacatgg tgaccacggc gggcgtaagc agctccctgg accgcgccac tcgcaccatc 2761 gagaattggg gtggcggccg ccgtgcgccc ccaccgtccc cctgcccgac cccgcggtct 2821 ggccccagcc catgcctgcc cacccccgac ccgcccccag agccgagccc cacgggctgg 2881 ggaccgccag acgggggtcg cgcggcgctt gtgcgcaggg ctccgcagcc cccgggccgc 2941 cccccgacgc cggggccgcc cctgtccgac gtctcccgag tgtcgcgccg cccagcctgg 3001 gaggcgcggt ggccggtgcg gaccgggcac tgcgggaggc acctctcggc ctccgagcgg 3061 cccctgtcgc ccgcgcgctg tcactacagc tcctttcctc gagccgaccg atccggccgc 3121 cccttcctcc cgctcttccc ggagctggag gacctgccgc tgctcggtaa ggagcagctg 3181 gcccggcggg aggccctgct gcacgcggcc tgggcccggg gctcgcgccc gcgtcacgct 3241 tccctgccca gctccgtggc cgaggccttc gctcggccca gctcgctgcc cgctgggtgc 3301 accggccccg cctgcgcccg ccccgacggc cactcggcct gcaggcgctt ggcgcaggcg 3361 cagtcgatgt gcttgccgat ctaccgggag gcctgccagg agggcgagca ggcaggggcc 3421 cccgcctggc agcacagaca gcacgtctgc ctgcacgccc acgcccacct gccattttgc 3481 tggggggctg tctgtcctca ccttccaccc tgtgccagcc acggctcctg gctctccggg 3541 gcctgggggc ctctggggca caggggcagg actctggggc tgggcacagg ctacagagac 3601 agtgggggac tggacgagat cagcagggta gcccgtggga cgcaaggctt cccgggaccc 3661 tgcacctgga gacggatctc cagtctggag tcagaagtgt gagttatcag ccactcaggc 3721 tccgagccag ctggattctc tgcctgccac tgtcagggtt aagcggcagg caggattggg 3781 cttttctggc ttctaccatg aaatcctggc catgggaccc cagtgacaga tgatgtcttc 3841 catggtcatc agtgacctca gtagcctcaa atcatggtga gggctgggct tttgctgtcc 3901 tcttctcacg caaagttctg ccaggaaggt gtgctgtggg ggtcaaactc ctgaggctct 3961 cccttccctg gggctaccaa ttactggtca tggct // LOCUS HUMNORTR 1983 bp mRNA PRI 07-JAN-1995 DEFINITION Human noradrenaline transporter mRNA, complete cds. ACCESSION M65105 NID g189257 KEYWORDS noradrenaline transporter. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1983) AUTHORS Pacholczyk,T., Blakely,R.D. and Amara,S.G. TITLE Expression cloning of a cocaine- and antidepressant-sensitive human noradrenaline transporter JOURNAL Nature 350 (6316), 350-354 (1991) MEDLINE 91179515 FEATURES Location/Qualifiers source 1..1983 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="neuroblastoma" /map="Unassigned" gene 61..1914 /gene="NAT1" CDS 61..1914 /gene="NAT1" /codon_start=1 /db_xref="GDB:G00-127-367" /product="noradrenaline transporter" /db_xref="PID:g189258" /translation="MLLARMNPQVQPENNGADTGPEQPLRARKTAELLVVKERNGVQC LLAPRDGDAQPRETWGKKIDFLLSVVGFAVDLANVWRFPYLCYKNGGGAFLIPYTLFL IIAGMPLFYMELALGQYNREGAATVWKICPFFKGVGYAVILIALYVGFYYNVIIAWSL YYLFSSFTLNLPWTDCGHTWNSPNCTDPKLLNGSVLGNHTKYSKYKFTPAAEFYERGV LHLHESSGIHDIGLPQWQLLLCLMVVVIVLYFSLWKGVKTSGKVVWITATLPYFVLFV LLVHGVTLPGASNGINAYLHIDFYRLKEATVWIDAATQIFFSLGAGFGVLIAFASYNK FDNNCYRDALLTSSINCITSFVSGFAIFSILGYMAHEHKVNIEDVATEGAGLVFILYP EAISTLSGSTFWAVVFFVMLLALGLDSSMGGMEAVITGLADDFQVLKRHRKLFTFGVT FSTFLLALFCITKGGIYVLTLLDTFAAGTSILFAVLMEAIGVSWFYGVDRFSNDIQQM MGFRPGLYWRLCWKFVSPAFLLFVVVVSIINFKPLTYDDYIFPPWANWVGWGIALSSM VLVPIYVIYKFLSTQGSLWERLAYGITPENEHHLVAQRDIRQFQLQHWLAI" BASE COUNT 383 a 602 c 530 g 468 t ORIGIN 1 gccggacaca gcctcggcgt gcccccagga ccggtaaagt tcctctcgcc agccgcatcc 61 atgcttctgg cgcggatgaa cccgcaggtg cagcccgaga acaacggggc ggacacgggt 121 ccagagcagc cccttcgggc gcgcaaaact gcggagctgc tggtggtgaa ggagcgcaac 181 ggcgtccagt gcctgctggc gccccgcgac ggcgacgcgc agccccggga gacctggggc 241 aagaagatcg acttcctgct gtccgtagtc ggcttcgcag tggacctggc caacgtgtgg 301 cgcttcccct acctctgcta caagaacggc ggcggtgcct tcttgatccc gtacacactg 361 ttccttatca tcgcggggat gcccctgttc tacatggagc tggctctggg acagtacaac 421 cgggaggggg ctgccaccgt ttggaaaatc tgcccattct tcaaaggcgt tggctatgct 481 gtcatcctga tcgccctgta cgttggcttc tactacaacg tcatcatcgc ctggtcactc 541 tactacctct tctcctcctt caccctcaac ctgccctgga ccgactgtgg ccacacctgg 601 aacagcccca actgtaccga ccccaagctc ctcaatggct ccgtgcttgg caaccacacc 661 aagtactcca agtacaagtt cacgccggca gccgagtttt atgagcgtgg tgtcctgcac 721 cttcacgaga gcagcgggat tcatgacatc ggcctgcccc agtggcagct cttgctctgt 781 ctgatggtcg tcgtcatcgt cttgtatttt agcctctgga aaggggtgaa gacatcagga 841 aaggtggtgt ggatcacagc cacgctgcct tacttcgtgc tgttcgtgct cctggtccat 901 ggcgtcacgc tgcccggagc ctccaatggc atcaatgcct acctgcacat cgacttctac 961 cgcttgaaag aggccacggt atggattgat gccgcaactc agatattttt ttccttgggg 1021 gctggatttg gagtattgat tgcatttgcc agttacaaca aatttgacaa caactgttac 1081 agggatgccc tgctgaccag cagcatcaac tgtatcacca gcttcgtctc tgggttcgcc 1141 atcttctcca tccttggtta catggcccat gaacacaagg tcaacattga ggatgtggcc 1201 acagaaggag ctggcctagt gttcatcctg tatccagagg ccatttctac cctgtctgga 1261 tctacattct gggctgttgt gtttttcgtc atgctcctgg cgctgggcct tgacagctca 1321 atgggaggca tggaggctgt catcacgggc ctggcagatg acttccaggt cctgaagcga 1381 caccggaaac tcttcacatt tggcgtcacc ttcagcactt tccttctcgc cctgttctgc 1441 ataaccaagg gtggaattta cgtcttgacc ctcctggaca cctttgctgc gggcacctcc 1501 atcctttttg ctgtcctcat ggaagccatc ggagtttcct ggttttatgg agtggacagg 1561 ttcagcaacg acatccagca gatgatgggg ttcaggccgg gtctatactg gagactgtgc 1621 tggaagttcg tcagtcctgc cttcctcctg ttcgtggttg tggtcagcat catcaacttc 1681 aagccactca cctacgacga ctacatcttc ccgccctggg ccaactgggt ggggtggggc 1741 atcgccctgt cctccatggt cctggtgccc atctacgtca tctataagtt cctcagcacg 1801 cagggctctc tttgggagag actggcctat ggcatcacgc cagagaacga gcaccacctg 1861 gtggctcaga gggacatcag acagttccag ttgcaacact ggctggccat ctgagcctgc 1921 ctggaggaga aggaggaacc cccatgccaa tgtccaggtc acaggcatcc gctgctacgt 1981 caa // LOCUS HUMNOXF 2206 bp mRNA PRI 26-OCT-1992 DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds. ACCESSION M32011 NID g189267 KEYWORDS neutrophil oxidase factor. SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA, clone 10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2206) AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G., Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L. TITLE Cloning of a 67kD neutrophil oxidase factor with similarity to a noncatalytic region of P60-c-src JOURNAL Science 248, 727-730 (1990) MEDLINE 90239568 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.L.Malech, 08-FEB-1990. FEATURES Location/Qualifiers source 1..2206 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL60" /cell_type="leukemia myeloid" mRNA <1..2206 /note="p67-phox" CDS 68..1648 /codon_start=1 /product="neutrophil oxidase factor" /db_xref="PID:g189268" /translation="MSLVEAISLWNEGVLAADKKDWKGALDAFSAVQDPHSRICFNIG CMYTILKNMTEAEKAFTRSINRDKHLAVAYFQRGMLYYQTEKYDLAIKDLKEALIQLR GNQLIDYKILGLQFKLFACEVLYNIAFMYAKKEEWKKAEEQLALATSMKSEPRHSKID KAMECVWKQKLYEPVVIPVGKLFRPNERQVAQLAKKDYLGKATVVASVVDQDSFSGFA PLQPQAAEPPPRPKTPEIFRALEGEAHRVLFGFVPETKEELQVMPGNIVFVLKKGNDN WATVMFNGQKGLVPCNYLEPVELRIHPQQQPQEESSPQSDIPAPPSSKAPGKPQLSPG QKQKEEPKEVKLSVPMPYTLKVHYKYTVVMKTQPGLPYSQVRDMVSKKLELRLEHTKL SYRPRDSNELVPLSEDSMKDAWGQVKNYCLTLWCENTVGDQGFPDEPKESEKADANNQ TTEPQLKKGSQVEALFSYEATQPEDLEFQEGDIILVLSKVNEEWLEGECKGKVGIFPK VFVEDCATTDLESTRREV" polyA_signal 2030..2035 polyA_signal 2181..2186 BASE COUNT 624 a 499 c 573 g 510 t ORIGIN 1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta 61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga 121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg 181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa 241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg 301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt 361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct 421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa 481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa 541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc 601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta 661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc 721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag 781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga 841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc 901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga 961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc 1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa 1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta 1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat 1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga 1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa 1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga 1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa 1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt 1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga 1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga 1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat 1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca 1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga 1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat 1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct 1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt 1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa 2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa 2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg 2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc // LOCUS HUMNP220 6571 bp mRNA PRI 30-JUL-1996 DEFINITION Human mRNA for nuclear protein, NP220, complete cds. ACCESSION D83032 NID g1374697 KEYWORDS nuclear protein, NP220. SOURCE Homo sapiens cell_line:HeLa cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Inagaki,H., Matsushima,Y., Nakamura,K., Ohshima,M., Kadowaki,T. and Kitagawa,Y. TITLE A large DNA-binding nuclear protein with RNA recognition motif and serine/arginine-rich domain JOURNAL J. Biol. Chem. 271 (21), 12525-12531 (1996) MEDLINE 96218178 REFERENCE 2 (bases 1 to 6571) AUTHORS Kitagawa,Y. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 6571) AUTHORS Kitagawa,Y. TITLE Direct Submission JOURNAL Submitted (11-JAN-1996) to the DDBJ/EMBL/GenBank databases. Yasuo Kitagawa, Nagoya University Bioscience Center, Department of Animal Science; Chikusa, Nagoya, Aichi 464-01, Japan (E-mail:i45073a@nucc.cc.nagoya-u.ac.jp, Tel:052-789-5227, Fax:052-789-5228) FEATURES Location/Qualifiers source 1..6571 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cell" CDS 316..6252 /note="unsure initial methyonine;putative" /codon_start=1 /product="nuclear protein, NP220" /db_xref="PID:d1012415" /db_xref="PID:g1374698" /translation="MSRPRFNPRGDFPLQRPRAPNPSGMRPPGPFMRPGSMGLPRFYP AGRARGIPHRFAGLESYQNMGPQRMNVQVTQHRTDPRLTKEKLDFHEAQQKKGKPHGS RWDDEPHISASVAVKQSSVTQVTEQSPKVQSRYTKESASSILASFGLSNEDLEELSRY PDEQLTPENMPLILRDIRMRKMGRRLPNLPSQSRNKETLGSEAVSSNVIDYGHASKYG YTEDPLEVRIYDPEIPTDEVENEFQSQQNISASVPNPNVICNSMFPVEDVFRQMDFPG ESSNNRSFFSVESGTKMSGLHISGGQSVLEPIKSVNQSINQTVSQTMSQSLIPPSMNQ QPFSSELISSVSQQERIPHEPVINSSNVHVGSRGSKKNYQSQADIPIRSPFGIVKASW LPKFSHADAQKMKRLPTPSMMNDYYAASPRIFPHLCSLCNVECSHLKDWIQHQNTSTH IESCRQLRQQYPDWNPEILPSRRNEGNRKENETPRRRSHSPSPRRSRRSSSSHRFRRS RSPMHYMYRPRSRSPRICHRFISRYRSRSRSRSPYRIRNPFRGSPKCFRSVSPERMSR RSVRSSDRKKALEDVVQRSGHGTEFNKQKHLEAADKGHSPAQKPKTSSGTKPSVKPTS ATKSDSNLGGHSIRCKSKNLEDDTLSECKQVSDKAVSLQRKLRKEQSLHYGSVLLITE LPEDGCTEEDVRKLFQPFGKVNDVLIVPYRKEAYLEMEFKEAITAIMKYIETTPLTIK GKSVKICVPGKKKAQNKEVKKKTLESKKVSASTLKRDADASKAVEIVTSTSAAKTGQA KACVAKVNKSTGKSASSVKSVVTVAVKGNKASIKTAKSGGKKSLEAKKTGNVKNKDSN KPVTIPENSEIKTSIEVKATENCAKEAISDAALEATENEPLNKETEEMCVMLVSNLPN KGYSVEEVYDLAKPFGGLKDILILSSHKKAYIEINRKAAESMVKFYTCFPVLMDGNQL SISMAPENMNIKDEEAIFITLVKENDPEANIDTIYDRFVHLDNLPEDGLQCVLCVGLQ FGKVDHHVFISNRNKAILQLDSPESAQSMYSFLKQNPQNIGDHMLTCSLSPKIDLPEV QIEHDPELEKESPGLKNSPIDESEVQTATDSPSVKPNELEEESTPSIQTETLVQQEEP CEEEAEKATCDSDFAVETLELETQGEEVKEEIPLVASASVSIEQFTENAEECALNQQM FNSDLEKKGAEIINPKTALLPSDSVFAEERNLKGILEESPSEAEDFISGITQTMVEAV AEVEKNETVSEILPSTCIVTLVPGIPTGDEKTVDKKNISEKKGNMDEKEEKEFNTKET RMDLQIGTEKAEKNEGRMDAEKVEKMAAMKEKPAENTLFKAYPNKGVGQANKPDETSK TSILAVSDVSSSKPSIKAVIVSSPKAKATVSKTENQKSFPKSVPRDQINAEKKLSAKE FGLLKPTSARSGLAESSSKFKPTQSSLTRGGSGRISALQGKLSKLDYRDITKQSQETE ARPSIMKRDDSNNKTLAEQNTKNPKSTTGRSSKSKEEPLFPFNLDEFVTVDEVIEEVN PSQAKQNPLKGKRKETLKNVPFSELNLKKKKGKTSTPRGVEGELSFVTLDEIGEEEDA AAHLAQALVTVDEVIDEEELNMEEMVKNSNSLFTLDELIDQDDCISHSEPKDVTVLSV AEEQDLLKQERLVTVDEIGEVEELPLNESADITFATLNTKGNEGDIVRDSIGFISSQV PEDPSTLVTVDEIQDDSSDLHLVTLDEVTEEDEDSLADFNNLKEELNFVTVDEVGEEE DGDNDLKVELAQSKNDHPTDKKGNRKKRAVDTKKTKLESLSQVGPVNENVMEEDLKTM IERHLTAKTPTKRVRIGKTLPSEKAVVTEPAKGEEAFQMSEVDEESGLKDSEPERKRK KTEDSSSGKSVASDVPEELDFLVPKAGFFCPICSLFYSGEKAMTNHCKSTRHKQNTEK FMAKQRKEKEQNEAEERSSR" polyA_signal 6472..6479 BASE COUNT 2335 a 1164 c 1415 g 1657 t ORIGIN 1 ggcatgcgtg cagctctttg gaggcggtag ctttttcggc gtcgagactg gaggctgagt 61 gctaaactgt gtggggcgcg gatgggatcc agctgttagt cgggtaggca tagctttgtg 121 ttattcttgg aaaatttcgc accacttgtg aattccttga acctgggcat tgcaaaccca 181 cttctgttgg gcccatctcc tttgcacttt gctcagatta agactcagtt ggcgcttcag 241 cagctgaatg ccgttgcctc acatggttca acaccacctt atactttatt aaatcaggct 301 ttcttgaaaa tagccatgtc gagacccagg tttaatcctc gaggagactt tccacttcaa 361 aggccacgag cacctaaccc ttctgggatg aggcctccag gaccatttat gaggcctgga 421 tctatgggtc tcccaagatt ttacccagca gggagagcac gtggaattcc acacagattt 481 gctggcctgg aatcttatca gaacatgggg ccacagagaa tgaatgttca ggtaactcaa 541 cacagaactg atccaagatt gaccaaagaa aaactggatt ttcatgaagc acaacagaag 601 aaggggaagc ctcatggtag ccggtgggat gatgagcctc atatatctgc atcagtggca 661 gtgaaacaga gttctgtaac acaggttaca gagcagagtc ccaaagtaca gagccgctat 721 acaaaagaga gtgcctcaag tatcttagca agttttggat tatctaatga agacctagaa 781 gaacttagtc gctatcctga tgaacaacta actcctgaaa atatgccatt aattttgagg 841 gatataagaa tgcgaaaaat ggggcgccga ttacctaatt taccttctca gagcagaaat 901 aaagaaacac ttggtagtga agcagtttca agtaatgtga tcgattatgg gcatgcaagc 961 aaatatggct acacagaaga tccacttgaa gtacgtattt atgatcctga aattccaact 1021 gatgaggtcg agaatgaatt tcagtcacag cagaacattt ctgcatctgt tcccaatcca 1081 aatgtgatat gtaattctat gtttcctgtt gaagacgtat ttcgccaaat ggacttcccc 1141 ggtgagtcct ccaataatcg gtcctttttc tcagttgaga gtggaaccaa gatgtcaggc 1201 ttacacattt caggaggaca gtcagtcctt gaacccataa aatccgtcaa ccaatccatt 1261 aaccaaacag ttagccagac aatgagtcaa tctctgattc ctccatctat gaaccagcaa 1321 cctttttcgt cggaattaat ttcatctgta agccagcaag agcggatccc acatgaacct 1381 gtgattaatt catctaacgt acatgttgga tcaagaggaa gtaaaaagaa ttaccagtca 1441 caggctgaca ttcccattcg gtctcccttt ggtattgtga aagcatcctg gctaccaaag 1501 ttttcacatg ctgatgccca gaagatgaag agacttccaa ctccttctat gatgaatgat 1561 tattatgcag catctccaag aatatttcca catttgtgtt ctctgtgtaa cgtagaatgt 1621 agtcatttga aggattggat tcagcatcaa aatacatcta ctcatattga gagctgtcga 1681 cagttacgtc aacagtatcc tgattggaat cctgagatcc tcccatcgag aagaaatgag 1741 ggcaatagaa aagaaaatga aactccacga agacgttctc attcccccag tcctaggcgt 1801 tctagaagat caagctcaag tcacagattc cgtcggtctc gaagcccaat gcattacatg 1861 tataggccga gaagtcgaag tccaagaatt tgccatcgtt tcatttctag atacagatcc 1921 agatccagat cccgttcacc atatcgaatt agaaatccat ttagaggtag tccaaaatgc 1981 tttcgatcag ttagccctga gaggatgtca aggagatcag tgagatcatc agatagaaaa 2041 aaagcattag aagatgtagt acaacgatct gggcatggga cagaatttaa taaacagaag 2101 catcttgaag ctgctgataa gggacattca ccagcacaaa agcctaaaac tagcagtgga 2161 acaaaaccat cagttaaacc tacaagcgct acaaagagtg attcaaatct aggaggacat 2221 tctattcgtt gtaaatcaaa gaatcttgaa gatgacactt tgtcagaatg taaacaggtg 2281 tctgataaag ctgtttctct ccagcgaaag cttcggaaag aacagtcatt gcattatggt 2341 tcggttcttc ttataactga attaccagag gatggttgta ctgaagaaga tgtgagaaaa 2401 ttatttcaac catttgggaa agtgaatgat gtcctaattg ttccatatag aaaagaggct 2461 tacctagaaa tggaatttaa agaggcaatt actgcaatta tgaagtacat tgaaacaaca 2521 cctcttacga taaaaggaaa aagtgtgaaa atatgtgttc caggaaagaa aaaagcacag 2581 aacaaagagg tgaagaaaaa gactttagag tcaaagaaag tatctgcatc taccttaaaa 2641 agagatgcag atgcttcaaa agctgttgaa attgttactt caacttctgc tgccaaaact 2701 ggacaagcca aggcatgtgt agccaaagta aacaaatcta cagggaaatc agcaagttct 2761 gtaaaatctg tggtaacggt agctgttaaa ggtaataaag cttcaatcaa aacagcaaaa 2821 tctggtggaa agaagtctct agaagccaaa aagactggga atgtcaaaaa caaagactct 2881 aacaaacctg tgactatacc agaaaactct gaaataaaga ccagtattga agtcaaagcc 2941 actgaaaact gtgctaaaga agctatttct gatgctgctt tggaggccac agagaatgaa 3001 ccacttaaca aggaaacaga agaaatgtgt gtgatgcttg tctctaattt gcctaataaa 3061 ggatattctg tagaagaagt ttatgactta gcaaaaccat ttggtggttt aaaggatatc 3121 ttgattttat catctcataa aaaggcatat atagaaataa atagaaaagc tgctgagtct 3181 atggtaaaat tttatacctg cttcccagta ttgatggatg gaaatcaact ctcaataagt 3241 atggctcctg aaaacatgaa tataaaagat gaggaagcta tatttataac cttggtaaaa 3301 gaaaatgacc cagaggcaaa catagataca atttatgatc gatttgtaca tcttgataat 3361 ttaccggaag atggacttca gtgtgtactt tgtgttggac ttcagtttgg aaaagtggat 3421 caccatgtat tcataagtaa tagaaacaag gcaattcttc agttagatag tcctgaatct 3481 gctcagtcaa tgtatagctt tctgaaacaa aatccacaaa atattggtga ccatatgttg 3541 acctgctcat tatctccaaa gatagactta ccagaggtgc aaattgagca tgacccagaa 3601 ttagaaaaag aaagccctgg cttgaaaaac agtccaattg atgaaagtga ggtgcaaaca 3661 gcaactgata gtccctctgt taaacctaat gagcttgaag aagaaagtac tcccagcatt 3721 caaacagaaa ctttggtaca gcaggaagag ccttgtgagg aagaagctga aaaagcaaca 3781 tgtgattctg actttgctgt tgaaactttg gagcttgaaa ctcaaggaga ggaggtcaaa 3841 gaagaaattc ctcttgtagc atccgcttca gtcagtattg aacaattcac tgaaaatgcc 3901 gaggagtgtg ctttaaatca gcagatgttt aacagtgact tggagaagaa aggggcagaa 3961 attattaacc ctaaaacagc attgttacca tctgacagtg tgtttgcaga agaaaggaac 4021 ctcaaaggaa ttctagaaga atctccatct gaagcagaag atttcatttc tggaattaca 4081 cagactatgg tagaagctgt agctgaagta gaaaaaaatg aaactgtttc ggaaatattg 4141 ccatcaactt gtattgtgac gttagtacca ggaattccca ctggggatga gaagacagtg 4201 gacaaaaaga atatttctga aaaaaaaggt aacatggatg aaaaggagga gaaggaattt 4261 aatactaagg aaaccagaat ggatcttcaa ataggaacag agaaggctga aaagaatgaa 4321 ggtaggatgg atgcagaaaa ggtggaaaag atggcagcaa tgaaagaaaa gcctgcagaa 4381 aacactttat tcaaggcata cccaaataaa ggagtgggtc aggctaataa gcctgatgaa 4441 actagtaaaa ctagtattct ggctgtatca gatgtatcta gcagtaaacc aagcatcaag 4501 gctgttatag tctcttctcc taaggcaaaa gctacagttt caaaaactga aaatcagaaa 4561 agttttccaa aatctgtgcc cagagatcaa ataaatgctg aaaagaaact ttcagccaag 4621 gaatttggtc tgcttaaacc cacaagtgcc aggtcaggct tggcagaaag cagcagtaaa 4681 ttcaaaccta ctcagagcag tcttaccaga ggaggcagtg gaaggatctc agccctgcaa 4741 ggcaagcttt ctaaactgga ttacagagat ataacaaaac aatctcagga aacagaggct 4801 agaccttcca tcatgaaacg ggatgacagc aacaataaga ctttggctga gcaaaacact 4861 aagaatccta aaagcactac tggtagaagt tccaaatcta aagaggagcc attatttcca 4921 tttaatttgg atgaatttgt tactgtggat gaggttatag aagaagtgaa tccttctcag 4981 gccaagcaga atccactaaa gggaaaaagg aaagaaactc tcaaaaatgt tcctttctct 5041 gaacttaact taaagaagaa aaaggggaaa acttccactc ctcgtggtgt tgagggagaa 5101 ctatcttttg tgacattgga tgagattggg gaagaggaag atgcagctgc acatctagca 5161 caagctctag tcactgtgga tgaagtaatt gatgaagaag aactaaatat ggaagaaatg 5221 gtaaaaaatt caaattcact ttttacatta gatgaattaa ttgaccaaga tgattgcatt 5281 tcccacagtg aacctaaaga tgttactgtt ctgtcagtgg ctgaagaaca agatctcctc 5341 aaacaggaac gcttggtaac tgtggatgaa attggagaag tggaagagct acctttgaat 5401 gagtcagcag acataacttt tgccacttta aatactaaag gaaatgaagg agatatcgta 5461 agggattcca ttggcttcat ttcttctcag gtgcccgaag acccttctac tttagttact 5521 gtagatgaaa tacaagatga cagcagtgat ttgcatttag tgactttgga tgaagtaact 5581 gaagaggatg aagactctct ggcggatttt aacaacctta aagaagagct taattttgtt 5641 actgttgatg aagttggaga ggaggaagat ggagataatg atttaaaagt tgagttagca 5701 caaagcaaaa atgaccatcc cacagataaa aaagggaata gaaagaagag agctgtggac 5761 acaaaaaaga caaaacttga atccttgtcc caagtgggtc cagtaaatga gaatgttatg 5821 gaagaagatc taaaaaccat gattgaaaga cacttaacag ctaaaactcc aaccaagaga 5881 gttagaattg ggaaaactct gccatcagaa aaagctgttg tgacagaacc agcaaaaggt 5941 gaagaggcct tccagatgag tgaagttgat gaggaatctg gattaaagga ttcagaacca 6001 gagcgaaaac gcaagaagac tgaagactct tcttcaggca aatcagtggc gtctgatgtc 6061 cctgaggaat tagactttct tgtacctaag gctggattct tctgtccaat ttgttccctc 6121 ttctactcag gtgaaaaagc aatgacaaat cactgcaaga gtacacgtca taagcaaaat 6181 actgagaaat ttatggccaa gcaaagaaag gaaaaggagc agaatgaggc tgaagaaaga 6241 agctctaggt gattggggga aaggaaagaa ttcactagaa atttgtttag ggtccagttg 6301 atttgtgtat ttttgttatc atttaatttg taattttcgt ttcagaagca aatattcgtg 6361 ttgtacaaat ttctgattgc cctaaatgta gagagactga tggggaaagt atgatgggtt 6421 tgatttttat atcaaatcat caggcatgga gaaatatctt ttagaagtgt taaaataaat 6481 gttcctactg tatatttaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6541 aaaaaaaaaa aaaaaaaaaa aaaaaaacaa a // LOCUS HUMNPIIY20 1481 bp mRNA PRI 27-JAN-1998 DEFINITION Homo sapiens leukocyte platelet-activating factor receptor mRNA, complete cds. ACCESSION M76676 NID g2810988 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1481) AUTHORS Kunz,D., Gerard,N.P. and Gerard,C. TITLE The human leukocyte platelet-activating factor receptor: cDNA cloning, cell surface expression, and construction of a novel epitope-bearing analog JOURNAL J. Biol. Chem. 267, 9101-9106 (1992) MEDLINE 92250505 FEATURES Location/Qualifiers source 1..1481 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937, cAMP induced" /cell_type="leukocyte" CDS 89..1057 /note="putative" /codon_start=1 /product="leukocyte platelet-activating factor receptor" /db_xref="PID:g189270" /translation="MALLGSQHSGAPSAAGPPGGTSSAATAAVLSFSTVATAALGNLS DASGGGTAAAPGGGGLGGSGAAREAGAAVRRPLGPEAAPLLSHGAAVAAQALVLLLIF LLSSLGNCAVMGVIVKHRQLRTVTNAFILSLSLSDLLTALLCLPAAFLDLFTPPGGSA PALPAGPWRGFCRPSRFFSSCFGIVYAQRGAHLVGPLLRYRRPPREKIGRRRALQLLA GAWLTALGFSLPWELLGAPRELAAGQSFHGCLYRTSPDPAQLGGPFSVGLVVACYLLP FLLICFCHYHICKTVRLSDVRVRPVNTYARVLRSSARCARPPPSSS" BASE COUNT 183 a 548 c 496 g 254 t ORIGIN 1 ggccgccgcc cccggtgcgg gatgaggaga tccgcggccg ccactgggcc ccatggagga 61 gccgccgccg ccccgcccac cagcgagcat ggccttactg ggcagccagc actccggcgc 121 cccctccgcg gccggcccac ctggcgggac ttcctcagcg gccacggcgg ccgtgctctc 181 cttcagcacc gtggcgaccg cggcgctggg gaacctgagc gacgcaagcg gaggcggcac 241 agctgccgct cccggtggcg gcggccttgg cgggtccggg gcagcgcggg aggcgggggc 301 ggcggtgagg cggccgctag gcccggaggc ggcgccgctg ctgtcgcacg gagctgcagt 361 ggcggcccag gcgctcgtcc tcctgctcat cttcctgctg tctagccttg gcaactgcgc 421 ggtgatgggg gtgattgtga agcaccggca gctccgcacc gtcaccaacg ccttcatcct 481 gtcgctgtcc ctatcggatc tgctcacggc gctgctctgc ctgcccgccg ccttcctgga 541 cctcttcact ccgcccgggg gttcggcgcc tgcgctgccc gcggggccct ggcgcggctt 601 ctgccggcca agccgcttct tcagctcgtg cttcggcatc gtgtacgctc agcgtggcgc 661 tcatctcgtt ggaccgttac tgcgctatcg tcggccgccg cgggagaaga tcggccgccg 721 ccgcgcgctg cagctgctgg cgggcgcctg gctgacggcc ctgggcttct ccttgccctg 781 ggagctgctc ggggcgcccc gggaactcgc ggcgggccag agcttccacg gctgcctcta 841 ccggacctcc ccggaccccg cgcagctggg cggccccttc agcgtggggc tggtggtggc 901 ctgctacctg ctgcccttcc tgctcatctg cttctgccac taccacatct gcaagacggt 961 gcgcctgtcg gacgtgcgcg tgcggccggt gaacacctac gcgcgcgtgc tgcgttcttc 1021 agcgaggtgc gcacggccac caccgtcctc atcatgatcg tcttcgtcat ctgctgctgg 1081 gggccctact gcttcctggt gctgctggcc gccgcccggc aggcccagac catgcaggcc 1141 ccctcgctcc tcagcgtggt ggccgtctgg ctgacctggg ccaatggggc catcaaccct 1201 gtcatctacg ccatccgcaa tcccaacatt tcgatgctcc tagggcgcaa ccgcgaggag 1261 ggctaccgga ctaggaatgt ggacgctttc ctgcccagcc agggcccggg tctgcaagcc 1321 agaagccgca gtcgccttcg aaaccgctat gccaaccggc tgggggcctg caacaggatg 1381 tcctcttcca acccggccag cggagtggca ggggacgtgg ccatgtgggc cgcaaaaatc 1441 cagttgtact tttctgccga gaggaccacc agagccggtg a // LOCUS HUMNPMMLF 1116 bp mRNA PRI 16-MAY-1996 DEFINITION Homo sapiens t(3;5)(q25.1;p34) fusion gene NPM-MLF1 mRNA, complete cds. ACCESSION L49054 NID g1066391 KEYWORDS acute myeloid leukemia; fusion gene; myelodysplastic syndrome; translocation. SOURCE Homo sapiens mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1116) AUTHORS Yoneda-Kato,N., Look,A.T., Kirstein,M.N., Valentine,M.B., Raimondi,S.C., Cohen,K.J., Carroll,A.J. and Morris,S.W. TITLE The t(3;5)(q25.1;q34) of myelodysplastic syndrome and acute myeloid leukemia produces a novel fusion gene, NPM-MLF1 JOURNAL Oncogene 12 (2), 265-275 (1996) MEDLINE 96152893 FEATURES Location/Qualifiers source 1..1116 /organism="Homo sapiens" /db_xref="taxon:9606" gene 109..915 /gene="NPM-MLF1" CDS 109..915 /gene="NPM-MLF1" /note="t(3;5)(q25.1;p34) fusion gene" /codon_start=1 /db_xref="PID:g1066392" /translation="MFRMLNSSFEDDPFFSESILAHRENMRQMIRSFSEPFGRDLLSI SDGRGRAHNRRGHNDGEDSLTHTDVSSFQTMDQMVSNMRNYMQKLERNFGQLSVDPNG HSFCSSSVMTYSKIGDEPPKVFQASTQTRRAPGGIKETRKAMRDSDSGLEKMAIGHHI HDRAHVIKKSKNKKTGDEEVNQEFINMNESDAHAFDEEWQSEVLKYKPGRHNLGNTRM RSVGHENPGSRELKRREKPQQSPAIEHGRRSNVLGDKLHIKGSSVKSNKK" BASE COUNT 357 a 213 c 249 g 297 t ORIGIN 1 gttatgtgtt cccgtccgta ctggaggcta gctcttgtcg cggccgcggc gagttaacat 61 cgtttttcca atctgtccgc ggctgccgcc acccaagaca gagccagaat gttcaggatg 121 ctgaacagca gttttgagga tgaccccttc ttctctgagt ccattcttgc acaccgagaa 181 aatatgcgac agatgataag aagtttttct gaaccctttg gaagagactt gctcagtatc 241 tctgatggta gagggagagc tcataatcgt agaggacata atgatggtga agattctttg 301 actcatacag atgtcagctc tttccagacc atggaccaaa tggtgtcaaa tatgagaaac 361 tatatgcaga aattagaaag aaacttcggt caactttcag tggatccaaa tggacattca 421 ttttgttctt cctcagttat gacttattcc aaaataggag atgaaccgcc aaaggttttt 481 caggcctcaa ctcaaactcg tcgagctcca ggaggaataa aggaaaccag gaaagcaatg 541 agagattctg acagtggact agaaaaaatg gctattggtc atcatatcca tgaccgagct 601 catgtcatta aaaagtcaaa gaacaagaag actggagatg aagaggtcaa ccaggagttc 661 atcaatatga atgaaagcga tgctcatgct tttgatgagg agtggcaaag tgaggttttg 721 aagtacaaac caggacgaca caatctagga aacactagaa tgagaagtgt tggccatgag 781 aatcctggct cccgagaact taaaagaagg gagaaacctc aacaaagtcc agccattgaa 841 catggaagga gatcaaatgt tttgggggac aaactccaca tcaaaggctc atctgtgaaa 901 agcaacaaaa aataaatagc catgcatttg atttgtttag ttttgattgt tttaacagtt 961 agtaatggtg ctgggtaata agcataagac caatctcttg ctgttaaatc agttctgtcc 1021 ttggcaactt tcttctgata tctgaatgtt catgaaggtc ctagctttat attgtccctc 1081 ttttaggaat aaaattttga ttttcaacaa aaaaaa // LOCUS HUMNPY 551 bp mRNA PRI 07-JAN-1995 DEFINITION Human neuropeptide Y (NPY) mRNA, complete cds. ACCESSION K01911 NID g189273 KEYWORDS neuropeptide Y. SOURCE Human pheochromocytoma, cDNA to mRNA, clone pNPY3-75. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 551) AUTHORS Minth,C.D., Bloom,S.R., Polak,J.M. and Dixon,J.E. TITLE Cloning, characterization, and DNA sequence of a human cDNA encoding neuropeptide tyrosine JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4577-4581 (1984) MEDLINE 84272678 COMMENT Neuropeptide Y (NPY) is one of the most abundant peptides in the mammalian nervous system, and its extensive distribution suggests a neuro-transmitter or -modulator role. NPY is also found in some chromaffin cells of the adrenal medulla. FEATURES Location/Qualifiers source 1..551 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /map="7pter-q22" mRNA <1..551 /gene="NPY" /note="G00-119-456" gene 1..551 /gene="NPY" sig_peptide 87..170 /gene="NPY" /note="G00-119-456" CDS 87..380 /gene="NPY" /codon_start=1 /db_xref="GDB:G00-119-456" /product="neuropeptide Y" /db_xref="PID:g189274" /translation="MLGNKRLGLSGLTLALSLLVCLGALAEAYPSKPDNPGEDAPAED MARYYSALRHYINLITRQRYGKRSSPETLISDLLMRESTENVPRTRLEDPAMW" mat_peptide 171..278 /gene="NPY" /note="G00-119-456" /product="neuropeptide Y" BASE COUNT 131 a 171 c 129 g 120 t ORIGIN 51 bp upstream of RsaI site. 1 accccatccg ctggctctca cccctcggag acgctcgccc gacagcatag tacttgccgc 61 ccagccacgc ccgcgcgcca gccaccatgc taggtaacaa gcgactgggg ctgtccggac 121 tgaccctcgc cctgtccctg ctcgtgtgcc tgggtgcgct ggccgaggcg tacccctcca 181 agccggacaa cccgggcgag gacgcaccag cggaggacat ggccagatac tactcggcgc 241 tgcgacacta catcaacctc atcaccaggc agagatatgg aaaacgatcc agcccagaga 301 cactgatttc agacctcttg atgagagaaa gcacagaaaa tgttcccaga actcggcttg 361 aagaccctgc aatgtggtga tgggaaatga gacttgctct ctggcctttt cctattttca 421 gcccatattt catcgtgtaa aacgagaatc cacccatcct accaatgcat gcagccactg 481 tgctgaattc tgcaatgttt tcctttgtca tcattgtata tatgtgtgtt taaataaagt 541 atcatgcatt c // LOCUS HUMNRAMP 2007 bp mRNA PRI 27-DEC-1994 DEFINITION Homo sapiens integral membrane protein (NRAMP1) mRNA, complete cds. ACCESSION L32185 NID g600219 KEYWORDS integral membrane protein. SOURCE Homo sapiens (clone library: lambda gt10) adult spleen cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2007) AUTHORS Cellier,M.F., Govoni,G., Vidal,S., Kwan,T., Groulx,N., Liu,J., Sanchez,F., Skamene,E., Schurr,E. and Gros,P. TITLE Human natural resistance-associated macrophage protein: cDNA cloning, chromosomal mapping, genomic organization and tissue-specific expression JOURNAL J. Exp. Med. 180, 1741-1752 (1994) MEDLINE 95053705 FEATURES Location/Qualifiers source 1..2007 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP1" /dev_stage="adult" /clone_lib="lambda gt10" /tissue_type="spleen" /map="chromosome 2q35-36" gene 77..1729 /gene="NRAMP1" CDS 77..1729 /gene="NRAMP1" /standard_name="macrophage-specific integral membrane protein" /note="N-glycosylation sites (aa 324..326, 338..340); PKC phosphorylation sites (aa 40..42, 54..56, 117..119); additional putative transmembrane domains a (aa 58..75) and b (aa 137..158); binding-protein-dependent transport system inner membrane component signature (aa 373..392); transmembrane domains 1 through 10 (aa's): (84..105), (168..186), (197..217), (241..259), (288..309), (350..369), (401..418), (430..449), (469..488), (496..517)" /codon_start=1 /function="potential transporter" /product="integral membrane protein" /db_xref="PID:g600220" /translation="MTGDKGPQRLSGSSYGSISSPTSPTSPGPQQAPPRETYLSEKIP IPDTKPGTFSLRKLWAFTGPGFLMSIAFLDPGNIESDLQAGAVAGFKLLWVLLWATVL GLLCQRLAARLGVVTGKDLGEVCHLYYPKVPRTVLWLTIELAIVGSDMQEVIGTAIAF NLLSAGRIPLWGGVLITIVDTFFFLFLDNYGLRKLEAFFGLLITIMALTFGYEYVVAR PEQGALLRGLFLPSCPGCGHPELLQAVGIVGAIIMPHNIYLHSALVKSREIDRARRAD IREANMYFLIEATIALSVSFIINLFVMAVFGQAFYQKTNQAAFNICANSSLHDYAKIF PMNNATVAVDIYQGGVILGCLFGPAALYIWAIGLLAAGQSSTMTGTYAGQFVMEGFLR LRWSRFARVLLTRSCAILPTVLVAVFRDLRDLSGLNDLLNVLQSLLLPFAVLPILTFT SMPTLMQEFANGLLNKVVTSSIMVLVCAINLYFVVSYLPSLPHPAYFGLAALLAAAYL GLSTYLVWTCCLAHGATFLAHSSHHHFLYGLLEEDQKGETSG" BASE COUNT 372 a 656 c 548 g 431 t ORIGIN 1 tctgggcacg ggtgcaggct gaggagctgc ccagagcacc gctcacactc ccagagtacc 61 tgaagtcggc atttcaatga caggtgacaa gggtccccaa aggctaagcg ggtccagcta 121 tggttccatc tccagcccga ccagcccgac cagcccaggg ccacagcaag cacctcccag 181 agagacctac ctgagtgaga agatccccat cccagacaca aaaccgggca ccttcagcct 241 gcggaagcta tgggccttca cggggcctgg cttcctcatg agcattgctt tcctggaccc 301 aggaaacatc gagtcagatc ttcaggctgg cgccgtggcg ggattcaaac ttctctgggt 361 gctgctctgg gccaccgtgt tgggcttgct ctgccagcga ctggctgcac gtctgggcgt 421 ggtgacaggc aaggacttgg gcgaggtctg ccatctctac taccctaagg tgccccgcac 481 cgtcctctgg ctgaccatcg agctagccat tgtgggctcc gacatgcagg aagtcatcgg 541 cacggccatt gcattcaatc tgctctcagc tggacgaatc ccactctggg gtggcgtcct 601 catcaccatc gtggacacct tcttcttcct cttcctcgat aactacgggc tgcggaagct 661 ggaagctttt tttggactcc ttataaccat tatggccttg acctttggct atgagtatgt 721 ggtggcgcgt cctgagcagg gagcgcttct tcggggcctg ttcctgccct cgtgcccggg 781 ctgcggccac cccgagctgc tgcaggcggt gggcattgtt ggcgccatca tcatgcccca 841 caacatctac ctgcactcgg ccctggtcaa gtctcgagag atagaccggg cccgccgagc 901 ggacatcaga gaagccaaca tgtacttcct gattgaggcc accatcgccc tgtccgtctc 961 ctttatcatc aacctctttg tcatggctgt ctttgggcag gccttctacc agaaaaccaa 1021 ccaggctgcg ttcaacatct gtgccaacag cagcctccac gactacgcca agatcttccc 1081 catgaacaac gccaccgtgg ccgtggacat ttaccagggg ggcgtgatcc tgggctgcct 1141 gttcggcccc gcggccctct acatctgggc cataggtctc ctggcggctg ggcagagctc 1201 caccatgacg ggcacctacg cgggacagtt cgtgatggag ggcttcctga ggctgcggtg 1261 gtcacgcttc gcccgtgtcc tcctcacccg ctcctgcgcc atcctgccca ccgtgctcgt 1321 ggctgtcttc cgggacctga gggacttgtc gggcctcaat gatctgctca acgtgctgca 1381 gagcctgctg ctcccgttcg ccgtgctgcc catcctcacg ttcaccagca tgcccaccct 1441 catgcaggag tttgccaatg gcctgctgaa caaggtcgtc acctcttcca tcatggtgct 1501 agtctgcgcc atcaacctct acttcgtggt cagctatctg cccagcctgc cccaccctgc 1561 ctacttcggc cttgcagcct tgctggccgc agcctacctg ggcctcagca cctacctggt 1621 ctggacctgt tgccttgccc acggagccac ctttctggcc cacagctccc accaccactt 1681 cctgtatggg ctccttgaag aggaccagaa aggggagacc tctggctagg cccacaccag 1741 ggcctggctg ggagtggcat gtatgacgtg actggcctgc tggatgtgga gggggcgcgt 1801 gcaggcagca ggatagagtg ggacagttcc tgagaccagc caacctgggg gctttaggga 1861 cctgctgttt cctagcgcag ccatgtgatt accctctggg tctcagtgtc ctcatctgta 1921 aaatggagac accaccaccc ttgccatgga ggttaagcac tttaacacag tgtctggcac 1981 ttgggacaaa aacaaacaaa cgaaaaa // LOCUS HUMNSPA 3202 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens neuroendocrine-specific protein A (NSP) mRNA, complete cds. ACCESSION L10333 NID g307306 KEYWORDS neuroendocrine-specific protein A. SOURCE Homo sapiens carcinoid lung tumor cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3202) AUTHORS Roebroek,A.J., van de Velde,H.J., Van Bokhoven,A., Broers,J.L., Ramaekers,F.C. and Van de Ven,W.J. TITLE Cloning and expression of alternative transcripts of a novel neuroendocrine-specific gene and identification of its 135-kDa translational product JOURNAL J. Biol. Chem. 268 (18), 13439-13447 (1993) MEDLINE 93293865 COMMENT The NSP gene encodes different transcripts: L10333/3.4 kb, L10334/2.3 kb, and L10335/1.8 kb. Each NSP gene products A, B, and C have a unique 5' end but share the same 3' end. FEATURES Location/Qualifiers source 1..3202 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="carcinoid lung tumor" 5'UTR 1..122 /gene="NSP" gene 1..3202 /gene="NSP" mRNA 1..3202 /gene="NSP" /note="3.4kb transcript" CDS 123..2453 /gene="NSP" /codon_start=1 /evidence=experimental /product="nueroendocrine-specific protein A" /db_xref="PID:g307307" /translation="MAAPGDPQDELLPLAGPGSQWLRHRGEGENEAVTPKGATPAPQA GEPSPGLGARAREAASREAGSGPARQSPVAMETASTGVAGVSSAMDHTFSTTSKDGEG SCYTSLISDICYPPQEDSTYFTGILQKENGHVTISESPEELGTPGPSLPDVPGIESRG LFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDK DLDFKNKDTDISIKPEGVREPDKPAPVEGKIIKDHLLEESTFAPYIDDLSEEQRRAPQ ITTPVKITLTEIEPSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGP PTIPSPLDHEASSAESGDSEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSPASPSIQ YSILREEREAELDSELIIESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEE RAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGALEPETPMLPRKPEEDSSSNQSPAA TKGPGPLGPGAPPPLLFLNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSVVSVV AYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFY VNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAVVSMFTLPVVYV KHQAQIDQYLGLVRTHINAVVAKIQAKIPGAKRHAE" 3'UTR 2454..3202 /gene="NSP" polyA_signal 2943..2948 /gene="NSP" polyA_signal 2996..3001 /gene="NSP" polyA_signal 3000..3005 /gene="NSP" polyA_signal 3008..3013 /gene="NSP" polyA_signal 3196..3201 /gene="NSP" BASE COUNT 784 a 891 c 825 g 702 t ORIGIN 1 ctgagacacc gcagcttccc tgagcgccga gtccctccgg ggacagcagc agggagcgcc 61 cgcgcagcca ccgagcctct gcccagccaa gccgccgtcg ccgcgccggg ggaccgccag 121 ccatggccgc gccgggggat ccgcaggacg agctgctgcc gctggccggc cccgggtccc 181 agtggctcag gcaccgaggg gagggggaga acgaagcggt gacgccgaaa ggggccacgc 241 cggcgccgca ggctggggag cccagcccgg ggttgggcgc cagggcccgg gaagcggcgt 301 cgcgggaagc cggctcgggc cccgcccggc agtcgcccgt tgccatggaa actgcatcca 361 caggtgtggc aggtgtttcc agtgccatgg accacacctt ctcaacaaca tcaaaagatg 421 gggaaggatc gtgttacaca tctctcattt ctgacatctg ctatccacct caggaggatt 481 ctacatattt tactggaatt cttcagaagg aaaatggcca cgtcaccatt tcagagagcc 541 ctgaggagct gggtacaccc ggcccctcct taccagatgt gcctgggata gagtctcgtg 601 gcttatttag ttctgattct ggaatagaga tgactcctgc agagtccacg gaagtgaaca 661 agatcttagc agaccctctg gaccagatga aagcagaggc ctataaatac attgacataa 721 ccagacccga ggaggtgaag caccaagaac aacatcaccc cgagctggaa gataaagact 781 tggactttaa gaataaagac actgacatct caattaaacc tgaaggagtc cgtgaacctg 841 acaaaccagc tcctgtggag ggaaaaatca tcaaggacca tttattggaa gaatccacat 901 ttgctccata catagatgat ctctctgaag aacagcgcag ggctcctcag atcaccaccc 961 ctgtcaaaat cacactgacg gaaatagaac cttctgttga aaccactacc caagagaaga 1021 cccctgagaa gcaagatata tgtctaaagc caagtcctga cacagtcccc actgtcactg 1081 tctcggagcc tgaagacgac agcccaggat ctatcacccc tccatcttct ggaacagaac 1141 catctgctgc agaatcccag gggaaaggca gcatctccga ggatgagctg atcaccgcca 1201 tcaaagaagc aaagggatta tcgtatgaaa ccgccgagaa cccacggccg gtgggccagc 1261 tggccgacag gcccgaggtc aaggccaggt ccggaccgcc aaccatcccc agccccctgg 1321 accacgaggc cagcagcgcg gagtcggggg actcagagat cgagctggtg tccgaggacc 1381 ccatggccgc ggaggacgcg ctgccctcag gctatgtgag ctttggccac gtgggcggcc 1441 cgccgccctc gcccgcctcg ccatccatcc agtacagcat cctgagggag gagcgcgagg 1501 ccgagctgga cagcgagctc atcatcgagt cgtgcgacgc ctcctcggcc tcggaggaga 1561 gccccaagcg ggagcaggac tcacccccga tgaagcccag cgccctggat gccatccggg 1621 aggagactgg cgtccgggcc gaggagcgtg cgccaagccg gcggggcctg gccgagccgg 1681 gttccttcct cgactacccc tcaactgagc cccagcctgg ccccgagctg ccccctggag 1741 acggagccct ggagcctgag acgcccatgt tgccacggaa gcctgaagaa gactcgagtt 1801 ccaaccaaag tcctgcggcc acaaagggcc ctgggcctct aggtcctggc gccccgcccc 1861 cactgctgtt tctcaataag caaaaagcta ttgacctgtt gtattggcgg gacatcaagc 1921 agacgggcat cgtgtttggg agtttcctgc tgctgctctt ctccctgacc cagttcagcg 1981 tggtgagcgt cgtggcctac ctggccctgg ccgcactctc agccaccatc agtttccgca 2041 tctacaagtc tgttttacaa gcagtgcaga aaaccgacga aggccaccct ttcaaggcct 2101 acttggagct tgagatcacc ctttctcagg agcagattca gaagtacacg gactgcctgc 2161 agttctacgt gaacagcaca cttaaggaac tgaggaggct cttccttgtc caggacctgg 2221 tggattcctt aaaatttgca gtcctgatgt ggctcctgac ctacgttggc gctctcttca 2281 atggcctgac cctgctgctc atggctgtgg tttcaatgtt tactctacct gtagtgtatg 2341 ttaagcacca ggcacagatt gaccaatatc tgggacttgt gaggactcac ataaatgctg 2401 ttgtggcaaa gattcaggct aaaatcccag gcgctaagag gcacgctgag taaactgatt 2461 tcccaccggg gactggacac aaacaggaat gtctggagtg gtaacagctc tcttcttact 2521 cattactgca aattgattgt ctttcccccc tccctccagt accataatct tagagacaaa 2581 ccttaaaaca gctgttttta ggctgttcct tgtactctta ggatatttga gtcacttgtg 2641 tcaaccacta aagtatagag aaaagtgtat tagatgtggt ttttaatttt gtgttgctaa 2701 aaaaagtgca tgatggtgag agcccaagtt atctttccct cttcggtgtt cttcttctct 2761 tctctgcaat gcttctgtag cttctaatgt tccccgtggc taggcctttc ctgccgagtg 2821 ctctgatgca atagtggaaa tcgcttatat gtccttgggt tgctggttgg attaatcttt 2881 aataacaata tatagaattg tagactgatg ttttagcatt tttccaacac acacaacgta 2941 aaaataaaag cagtcgaccg cacttatggt aatcagtttt gtataactta aaataattaa 3001 ataaatgaat aaatccaaaa caaacatgca gtacttttgt tgtatgggat tggtgggctg 3061 atttacatgt atggttacta aaaagtacca gcatgttaac tttattacaa tttgtattac 3121 tttctctgta gttcctaatg gattcaatta cggactctgg atatttgcac ttatgtactt 3181 gatactgaat gcataaataa at // LOCUS HUMNTCP 1580 bp mRNA PRI 29-JUN-1994 DEFINITION Human Na/taurocholate cotransporting polypeptide mRNA, complete cds. ACCESSION L21893 NID g410213 KEYWORDS Na/taurocholate cotransporting polypeptide. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1580) AUTHORS Hagenbuch,B. and Meier,P.J. TITLE Molecular cloning, chromosomal localization, and functional characterization of a human liver Na/bile acid cotransporter JOURNAL J. Clin. Invest. 93, 1326-1331 (1994) MEDLINE 94179485 FEATURES Location/Qualifiers source 1..1580 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /dev_stage="adult" /tissue_type="liver" /map="14" CDS 83..1132 /standard_name="NTCP" /codon_start=1 /function="mediates transport of bile acids" /evidence=experimental /product="Na/taurocholate cotransporting polypeptide" /db_xref="PID:g410214" /translation="MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGC TMEFSKIKAHLWKPKGLAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGG NLSNVFSLAMKGDMNLSIVMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIV ISLVLVLIPCTIGIVLKSKRPQYMRYVIKGGMIIILLCSVAVTVLSAINVGKSIMFAM TPLLIATSSLMPFIGFLLGYVLSALFCLNGRCRRTVSMETGCQNVQLCSTILNVAFPP EVIGPLFFFPLLYMIFQLGEGLLLIAIFWCYEKFKTPKDKTKMIYTAATTEETIPGAL GNGTYKGEDCSPCTA" polyA_site 1580 /note="putative" BASE COUNT 400 a 434 c 341 g 405 t ORIGIN 1 aaagaaggca tccagcaaga actgcacaag aaacggagtc agccggagaa caaggagtgg 61 tcttccactg cctcacagga ggatggaggc ccacaacgcg tctgccccat tcaacttcac 121 cctgccaccc aactttggca agcgccccac agacctggca ctgagcgtca tcctggtgtt 181 catgttgttc ttcatcatgc tctcgctggg ctgcaccatg gagttcagca agatcaaggc 241 tcacttatgg aagcctaaag ggctggccat cgccctggtg gcacagtatg gcatcatgcc 301 cctcacggcc tttgtgctgg gcaaggtctt ccggctgaag aacattgagg cactggccat 361 cttggtctgt ggctgctcac ctggagggaa cctgtccaat gtcttcagtc tggccatgaa 421 gggggacatg aacctcagca ttgtgatgac cacctgctcc accttctgtg cccttggcat 481 gatgcctctc ctcctgtaca tctactccag ggggatctat gatggggacc tgaaggacaa 541 ggtgccctat aaaggcatcg tgatatcact ggtcctggtt ctcattcctt gcaccatagg 601 gatcgtcctc aaatccaaac ggccacaata catgcgctat gtcatcaagg gagggatgat 661 catcattctc ttgtgcagtg tggccgtcac agttctctct gccatcaatg tggggaagag 721 catcatgttt gccatgacac cactcttgat tgccacctcc tccctgatgc cttttattgg 781 ctttctgctg ggttatgttc tctctgctct cttctgcctc aatggacggt gcagacgcac 841 tgtcagcatg gagactggat gccaaaatgt ccaactctgt tccaccatcc tcaatgtggc 901 ctttccacct gaagtcattg gaccactttt cttctttccc ctcctctaca tgattttcca 961 gcttggagaa gggcttctcc tcattgccat attttggtgc tatgagaaat tcaagactcc 1021 caaggataaa acaaaaatga tctacacagc tgccacaact gaagaaacaa ttccaggagc 1081 tctgggaaat ggcacctaca aaggggagga ctgctcccct tgcacagcct agcccttccc 1141 ctggtggcct ggattctggt cccaaagcaa ttctgaaagc cagtgtggta aactagagag 1201 agcagcaaaa acaccagtct tgcctgagtc tttctccagc atttccagta catctatcag 1261 aatcatcaag tcttggccgg gaacacagac agggtgtcta cccaagaagc ctcacctatc 1321 cccaacttag aatttgctac ttattttaaa gacttgttca gtgactgtaa actctatgaa 1381 accagaaacc gaatctgcct cttgctggga tctctaaaag tgtctgataa gcatcttaaa 1441 gtcactcaat tcctgaacta atcaatatat atgtttaacc cattactcaa atacccaaat 1501 cccattccaa gttttgtgac ccaaaagaga aataaatgct cacaagtgct gtagaattaa 1561 acttcagaag ttctaacctt // LOCUS HUMNUCLEOB 1650 bp mRNA PRI 25-FEB-1993 DEFINITION Human nucleobindin precursor mRNA, complete cds. ACCESSION M96824 NID g189307 KEYWORDS DNA-binding protein; nucleobindin. SOURCE Homo sapiens (library: lambda gt11) pheochromacytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1650) AUTHORS Miura,K., Titani,K., Kurosawa,Y. and Kanai,Y. TITLE Molecular cloning of nucleobindin, a novel DNA-binding protein that contains both a signal peptide and a leucine zipper structure JOURNAL Biochem. Biophys. Res. Commun. 187, 375-380 (1992) MEDLINE 92392352 FEATURES Location/Qualifiers source 1..1650 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromacytoma" /tissue_lib="lambda gt11" CDS 40..1422 /note="precursor nucleobindin; putative" /codon_start=1 /product="nucleobindin" /db_xref="PID:g189308" /translation="MPPSGPRGTLLLSLLLLLLLRAVLAVPLERGAPNKEETPATESP DTGLYYHRYLQEVIDVLETDGHFREKLQAANAEDIKSGKLSRELDFVSHHVRTKLDEL KRQEVSRLRMLLKAKMDAEQDPNVQVDHLNLLKQFEHLDPQNQHTFEARDLELLIQTA TRDLAQYDAAHHEEFKRYEMLKEHERRRYLESLGEEQRKEAERKLEEQQRRHREHPKV NVPGSQAQLKEVWEELDGLDPNRFNPKTFFILHDINSDGVLDEQELEALFTKELEKVY DPKNEEDDMREMEEERLRMREQLMKNVDTNQDRLVTLEEFLASTQRKEFGDTGEGWET VEMHPAYTEEELRRFEEELAAREAELNAKAQRLSQETEALGRSQGRLEAKKRELLLAV LHMEQRKQQQQQQQGHKAPAAHPEGQLKFHPDTDDVPVPAPAGDQKEVDTSEKKLLER LPEVEVPQHL" sig_peptide 40..>114 /note="3' end uncertain; putative" mat_peptide <115..1419 /note="5' end uncertain" /product="nucleobindin" BASE COUNT 380 a 473 c 528 g 269 t ORIGIN 1 ggaaaacgcc ctctgcggtg aaggagagac cacactgcca tgcctccctc tgggccccga 61 ggaaccctcc ttctgtcgct gctgctgctg ctcctgcttc gcgccgtgct ggctgtcccc 121 ctggagcgag gggcgcccaa caaggaggag acccctgcga ctgagagtcc cgacacaggc 181 ctgtactacc accggtacct ccaggaggtc atcgatgtac tggagacgga tgggcatttc 241 cgagagaagc tgcaggctgc caatgcggag gacatcaaga gcgggaagct gagccgagag 301 ctggactttg tcagccacca cgtccgcacc aagctggatg agctcaagcg acaggaggtg 361 tcacggctgc ggatgctgct caaggccaag atggacgccg agcaggatcc caatgtacag 421 gtggatcatc tgaatctcct gaaacagttt gaacacctgg accctcagaa ccagcataca 481 ttcgaggccc gcgacctgga gctgctgatc cagacggcca cccgggacct tgcccagtac 541 gacgcagccc atcatgaaga gttcaagcgc tacgagatgc ttaaggaaca cgagagacgg 601 cgttatctgg agtcactggg agaggagcag agaaaggagg cggagaggaa gctggaagag 661 caacagcgcc ggcaccgcga gcaccctaaa gtcaacgtgc ctggcagcca agcccagttg 721 aaggaggtgt gggaggagct ggatggactg gaccccaaca ggtttaaccc caagaccttc 781 ttcatactgc atgatatcaa cagtgatggt gtcctggatg agcaggagct ggaggctctc 841 ttcaccaagg agctggagaa agtgtacgac ccaaagaatg aggaggacga catgcgggag 901 atggaggagg agcgactgcg catgcgggag cagttgatga agaatgtgga caccaaccag 961 gaccgcctcg tgaccctgga ggagttcctc gcatccactc agaggaagga gtttggggac 1021 accggggagg gctgggagac agtggagatg caccctgcct acaccgagga agagctgagg 1081 cgctttgaag aggagctggc tgcccgggag gcagagctga atgccaaggc ccagcgcctc 1141 agccaggaga cagaggctct agggcggtcc cagggccgct tggaggccaa gaagagagag 1201 ctgctgctgg ctgtgctgca catggagcag cggaagcagc agcagcagca gcagcaaggc 1261 cacaaggccc cggctgccca ccctgagggg cagctcaagt tccacccaga cacagacgat 1321 gtacctgtcc cagctccagc cggtgaccag aaggaggtgg acacttcaga aaagaaactt 1381 ctcgagcggc tccctgaggt tgaggtgccc cagcatctgt gatctcggac cccagccctc 1441 aggattcctg atgctccaag gcgactgatg ggcgctggat gaagtggcac agtcagcttc 1501 cctgggggcc ggtgtcatgt tgggctcctg gggcggggca cggcctggca tttcaccgat 1561 tgctgccacc ccagatccac ctgtctccac tttcacagcc tccaagtctg tggctcttcc 1621 cttctgtcct ccgaggggct tgccttctct // LOCUS HUMNUCPHOX 835 bp mRNA PRI 07-JUN-1994 DEFINITION Human nuclear phosphoprotein mRNA, complete cds. ACCESSION L22342 NID g402204 KEYWORDS nuclear protein; phosphoprotein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 835) AUTHORS Kadereit,S., Gewert,D.R., Galabru,J., Hovanessian,A.G. and Meurs,E.F. TITLE Molecular cloning of two new interferon-induced, highly related nuclear phosphoproteins JOURNAL J. Biol. Chem. 268 (32), 24432-24441 (1993) MEDLINE 94043285 FEATURES Location/Qualifiers source 1..835 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda gt11; lambda gt10" CDS 1..747 /note="IFN-induced" /codon_start=1 /product="phosphoprotein" /db_xref="PID:g402205" /translation="MASSGVKNTPRWRRKAPHGRERKEKGKKRKRCIWSTPKRRHKKK SLPREIIDGTSEMNEGKRSQKMPSTPRRVTQGAASPGHGIQEKLQVVDKVTQRKDDST WNSEVMMRVQKARTKCARKSRSKEKKKEKDICSSSKRRFQKNIHRRGKPKSDTVDFHC SKSPVTCGEAKGILYKKKMKHGSSVKCIRNEDGTWLTPNEFEVEGKGRNAKNWKRNIR CEGMTLGELLKSGLLLCPPRINLKRELNSK" BASE COUNT 307 a 158 c 216 g 154 t ORIGIN 1 atggcgagca gcggagtcaa gaacacacca cgatggcgga gaaaagcccc tcatgggagg 61 gaaaggaaag agaaaggaaa gaaaagaaaa agatgtatct ggtcaactcc aaaaaggaga 121 cataagaaaa aaagcctccc aagagagatc attgatggca cttcagaaat gaatgaagga 181 aagaggtccc agaagatgcc tagtacacca cgaagggtca cacaaggggc agcctcacct 241 gggcatggca tccaagagaa gctccaagtg gtggataagg tgactcaaag gaaagacgac 301 tcaacctgga actcagaggt catgatgagg gtccaaaagg caagaactaa atgtgcccga 361 aagtccagat cgaaagaaaa gaaaaaggag aaagatatct gttcaagctc aaaaaggaga 421 tttcagaaaa atattcaccg aagaggaaaa cccaaaagtg acactgtgga ttttcactgt 481 tctaagtccc ccgtgacctg tggtgaggcg aaagggattt tatataagaa gaaaatgaaa 541 cacggatcct cagtgaagtg cattcggaat gaggatggaa cttggttaac accaaatgaa 601 tttgaagtcg aaggaaaagg aaggaacgca aagaactgga aacggaatat acgttgtgaa 661 ggaatgaccc taggagagct gctgaagagt ggacttttgc tctgtcctcc aagaataaat 721 ctcaagagag agttaaatag caagtgaatt tctactaccc tctcagtcac catgttgcag 781 actttccctg tctggaggct caccttagag cttctgagtt tccaagcccg gaatt // LOCUS HUMNUCTIAR 1401 bp mRNA PRI 09-JUN-1993 DEFINITION Homo sapiens nucleolysin TIAR mRNA, complete cds. ACCESSION M96954 NID g307313 KEYWORDS RNA-binding protein; nucleolysin TIAR. SOURCE Homo sapiens peripheral blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1401) AUTHORS Kawakami,A., Tian,Q., Duan,X., Streuli,M., Schlossman,S.F. and Anderson,P. TITLE Identification and functional characterization of a TIA-1-related nucleolysin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 8681-8685 (1992) MEDLINE 92409580 FEATURES Location/Qualifiers source 1..1401 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_type="peripheral blood" 5'UTR 1..45 CDS 46..1173 /codon_start=1 /function="RNA binding protein" /product="nucleolysin TIAR" /db_xref="PID:g189310" /translation="MMEDDGQPRTLYVGNLSRDVTEVLILQLFSQIGPCKSCKMITEH TSNDPYCFVEFYEHRDAAAALAAMNGRKILGKEVKVNWATTPSSQKKDTSNHFHVFVG DLSPEITTEDIKSAFAPFGKISDARVVKDMATGKSKGYGFVSFYNKLDAENAIVHMGG QWLGGRQIRTNWATRKPPAPKSTQENNTKQLRFEDVVNQSSPKNCTVYCGGIASGLTD QLMRQTFSPFGQIMEIRVFPEKGYSFVRFSTHESAAHAIVSVNGTTIEGHVVKCYWGK ESPDMTKNFQQVDYSQWGQWSQVYGNPQQYGQYMANGWQVPPYGVYGQPWNQQGFGVD QSPSAAWMGGFGAQPPQGQAPPPVIPPPNQAGYGMASYQTQ" 3'UTR 1171..1401 BASE COUNT 436 a 277 c 318 g 370 t ORIGIN 1 accctgccct cggccttgtc ccgggatcgc tccgtcgcac ccaccatgat ggaagacgac 61 gggcagcccc ggactctata cgtaggtaac ctttccagag atgtgacaga agtccttata 121 cttcagttgt tcagtcagat tggaccctgt aaaagctgta aaatgataac agagcataca 181 agcaatgacc catattgctt tgtggaattt tatgaacaca gagatgcagc tgctgcatta 241 gctgctatga atgggagaaa aattttggga aaggaggtca aagtaaactg ggcaaccaca 301 ccaagtagcc agaaaaaaga tacttccaat cacttccatg tgtttgttgg ggatttgagt 361 ccagaaatta caacagaaga tatcaaatca gcatttgccc cctttggtaa aatatcggat 421 gcccgggtag ttaaagacat ggcaactgga aaatccaaag gctatggttt tgtatctttt 481 tataacaaac tggatgcaga aaatgcgatt gtgcatatgg gcggtcagtg gttgggtggt 541 cgtcaaatcc gaaccaattg ggccactcgt aaaccacctg cacctaaaag tacacaagaa 601 aacaacacta agcagttgag atttgaagat gtagtaaacc agtcaagtcc aaaaaattgt 661 actgtgtact gtggaggaat tgcgtctggg ttaacagatc agcttatgag acagacattc 721 tcaccatttg gacaaattat ggaaataaga gttttcccag aaaagggcta ttcatttgtc 781 agattttcaa cccatgaaag tgcagcccat gccattgttt cggtgaacgg tactacgatt 841 gaaggacatg tggttaaatg ctattggggt aaagaatctc ctgatatgac taaaaacttc 901 caacaggttg actatagtca atggggccaa tggagccaag tgtatggaaa cccacaacag 961 tatggacagt atatggcaaa tgggtggcaa gtaccgcctt atggagtata cgggcaacca 1021 tggaatcaac aaggatttgg agtagatcaa tcaccttctg ctgcttggat gggtggattt 1081 ggtgctcagc ctccccaagg acaagctcct ccccctgtaa tacctcctcc taaccaagcc 1141 ggatatggta tggcaagtta ccaaacacag tgagccggga ctctaaaaaa aaattgtaat 1201 tcatgatagg cttcgatttc ctgtgacact ctgaagacat gaaagtagac atcggaaaat 1261 gaaaatattt attttaaaaa ttgaaatgtt tggaaccttt agcacagatt tgctttggtg 1321 aaggacacgt gtcttctagt tctgcctttt taagtttttg ttcatgatgg atatgaacat 1381 gatttttctt tatgtacaaa a // LOCUS HUMNUP358G 10677 bp DNA PRI 09-JUN-1995 DEFINITION Homo sapiens nucleoporin (NUP358) gene, complete cds. ACCESSION L41840 NID g857367 KEYWORDS nucleoporin. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 10677) AUTHORS Wu,J., Matunis,M.J., Kraemer,D., Blobel,G. and Coutavas,E. TITLE Nup358, a cytoplasmically exposed nucleoporin with peptide repeats, Ran-GTP binding sites, zinc fingers, a cyclophilin A homologous domain, and a leucine-rich region JOURNAL J. Biol. Chem. 270 (23), 14209-14213 (1995) MEDLINE 95294031 FEATURES Location/Qualifiers source 1..10677 /organism="Homo sapiens" /note="(vector lambda EXlox)" /db_xref="taxon:9606" /cell_line="HeLa" mRNA <114..>9788 /gene="NUP358" gene 114..9788 /gene="NUP358" CDS 114..9788 /gene="NUP358" /codon_start=1 /product="nucleoporin" /db_xref="PID:g857368" /translation="MRRSKADVERYIASVQGSTPSPRQKSMKGFYFAKLYYEAKEYDL AKKYICTYINVQERDPKAHRFLGLLYELEENTDKAVECYRRSVELNPTQKDLVLKIAE LLCKNDVTDGRAKYWLERAAKLFPGSPAIYKLKEQLLDCEGEDGWNKLFDLIQSELYV RPDDVHVNIRLVEVYRSTKRLKDAVAHCHEAERNIALRSSLEWNSCVVQTLKEYLESL QCLESDKSDWRATNTDLLLAYANLMLLTLSTRDVQESRELLQSFDSALQSVKSLGGND ELSATFLEMKGHFYMHAGSLLLKMGQHSSNVQWRALSELAALCYLIAFQVPRPKIKLI KGEAGQNLLEMMACDRLSQSGHMLLNLSRGKQDFLKEIVETFANKSGQSALYDALFSS QSPKDTSFLGSDDIGNIDVREPELEDLTRYDVGAIRAHNGSLQHLTWLGLQWNSLPAL PGIRKWLKQLFHHLPHETSRLETNAPESICILDLEVFLLGVVYTSHLQLKEKCNSHHS SYQPLCLPLPVCKQLCTERQKSWWDAVCTLIHRKAVPGNVAKLRLLVQHEINTLRAQE KHGLQPALLVHWAECLQKTGSGLNSFYDQREYIGRSVHYWKKVLPLLKIIKKKNSIPE PIDPLFKHFHSVDIQASEIVEYEEDAHITFAILDAVNGNIEDAVTAFESIKSVVSYWN LALIFHRKAEDIENDALSPEEQEECKNYLRKTRDYLIKIIDDSDSNLSVVKKLPVPLE SVKEMLNSVMQELEDYSEGGPLYKNGSLRNADSEIKRSTPSPTRYSLSPSKSYKYSPK TPPRWAEDQNSLLKMICQQVEAIKKEMQELKLNSSNSASPHRWPTENYGPDSVPDGYQ GSQTFHGAPLTVATTGPSVYYSQSPAYNSQYLLRPAANVTPTKGPVYGMNRLPPQQHI YAYPQQMHTPPVQSSSACMFSQEMYGPPALRFESPATGILSPRGDDYFNYNVQQTSTN PPLPEPGYFTKPPIAAHASRSAESKTIEFGKTNFVQPMPGEGLRPSLPTQAHTTQPTP FKFNSNFKSNDGDFTFSSPQVVTQPPPAAYSNSESLLGLLTSDKPLQGDGYSGAKPIP GGQTIGPRNTFNFGSKNVSGISFTENMGSSQQKNSGFRRSDDMFTFHGPGKSVFGTPT LETANKNHETDGGSAHGDDDDDGPHFEPVVPLPDKIEVKTGEEDEEEFFCNRAKLFRF DVESKEWKERGIGNVKILRHKTSGKIRLLMRREQVLKICANHYISPDMKLTPNAGSDR SFVWHALDYADELPKPEQLAIRFKTPEEAALFKCKFEEAQSILKAPGTNVAMASNQAV RIVKEPTSHDNKDICKSDAGNLNFEFQVAKKEGSWWHCNSCSLKNASTAKKCVSCQNL NPSNKELVGPPLAETVFTPKTSPENVQDRFALVTPKKEGHWDCSICLVRNEPTVSRCI ACQNTKSANKSGSSFVHQASFKFGQGDLPKPINSDFRSVFSTKEGQWDCSACLVQNEG SSTKCAACQNPRKQSLPATSIPTPASFKFGTSETSKTLKSGFEDMFAKKEGQWDCSSC LVRNEANATRCVACQNPDKPSPSTSVPAPASFKFGTSETSKAPKSGFEGMFTKKEGQW DCSVCLVRNEASATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEGQ WDCSVCLVRNEASATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEG QWDCSVCLVRNEASATKCIACQCPSKQNQTTAISTPASSEISKAPKSGFEGMFIRKGQ WDCSVCCVQNESSSLKCVACDASKPTHKPIAEAPSAFTLGSEMKLHDSSGSQVGTGFK SNFSEKASKFGNTEQGFKFGHVDQENSPSFMFQGSSNTEFKSTKEGFSIPVSADGFKF GISEPGNQEKKSEKPLENGTGFQAQDISGQKNGRGVIFGQTSSTFTFADLAKSTSGEG FQFGKKDPNFKGFSGAGEKLFSSQYGKMANKANTSGDFEKDDDAYKTEDSDDIHFEPV VQMPEKVELVTGEEDEKVLYSQRVKLFRFDAEVSQWKERGLGNLKILKNEVNGKLRML MRREQVLKVCANHWITTTMNLKPLSGSDRAWMWLASDFSDGDAKLEQLAAKFKTPELA EEFKQKFEECQRLLLDIPLQTPHKLVDTGRAAKLIQRAEEMKSGLKDFKTFLTNDQTK VTEEENKGSGTGAAGASDTTIKPNPENTGPTLEWDNYDLREDALDDSVSSSSVHASPL ASSPVRKNLFRFGESTTGFNFSFKSALSPSKSPAKLNQSGTSVGTDEESDVTQEEERD GQYFEPVVPLPDLVEVSSGEENEQVVFSHRAKLYRYDKDVGQWKERGIGDIKILQNYD NKQVRIVMRRDQVLKLCANHRITPDMTLQNMKGTERVWLWTACDFADGERKVEHLAVR FKLQDVADSFKKIFDEAKTAQEKDSLITPHVSRSSTPRESPCGKIAVAVLEETTRERT DVIQGDDVADATSEVEVSSTSETTPKAVVSPPKFVFGSESVKSIFSSEKSKPFAFGNS SATGSLFGFSFNAPLKSNNSETSSVAQSGSESKVEPKKCELSKNSDIEQSSDSKVKNL FASFPTEESSINYTFKTPEKAKEKKKPEDSPSDDDVLIVYELTPTAEQKALATKLKLP PTFFCYKNRPDYVSEEEEDDEDFETAVKKLNGKLYLDGSEKCRPLEENTADNEKECII VWEKKPTVEEKAKADTLKLPPTFFCGVCSDTDEDNGNGEDFQSELQKVQEAQKSQTEE ITSTTDSVYTGGTEVMVPSFCKSEEPDSITKSISSPSVSSETMDKPVDLSTRKEIDTD STSQGESKIVSFGFGSSTGLSFADLASSNSGDFAFGSKDKNFQWANTGAAVFGTQSVG TQSAGKVGEDEDGSDEEVVHNEDIHFEPIVSLPEVEVKSGEEDEEILFKERAKLYRWD RDVSQWKERGVGDIKILWHTMKNYYRILMRRDQVFKVCANHVITKTMELKPLNVSNNA LVWTASDYADGEAKVEQLAVRFKTKEVADCFKKTFEECQQNLMKLQKGHVSLAAELSK ETNPVVFFDVCADGEPLGRITMELFSNIVPRTAENFRALCTGEKGFGFKNSIFHRVIP DFVCQGGDITKHDGTGGQSIYGDKFEDENFDVKHTGPGLLSMANQGQNTNNSQFVITL KKAEHLDFKHVVFGFVKDGMDTVKKIESFGSPKGSVCRRITITECGQI" BASE COUNT 3475 a 1907 c 2331 g 2964 t ORIGIN 1 ggatcgaatc gcggccgcgt cgacggtttg caggcgcttt cctcttggaa gtggcgactg 61 ctgcgggcct gagcgctggt ctcacgcgcc tcgggagcca ggttggcggc gcgatgaggc 121 gcagcaaggc tgacgtggag cggtacatcg cctcggtgca gggctccacc ccgtcgcctc 181 gacagaagtc aatgaaagga ttctattttg caaagctgta ttatgaagct aaagaatatg 241 atcttgctaa aaaatacata tgtacttaca ttaatgtgca agagagggat cccaaagctc 301 acagatttct gggtcttctt tatgaattgg aagaaaacac agacaaagcc gttgaatgtt 361 acaggcgttc agtggaatta aacccaacac aaaaagatct tgtgttgaag attgcagaat 421 tgctttgtaa aaatgatgtt actgatggaa gagcaaaata ctggcttgaa agagcagcca 481 aacttttccc aggaagtcct gcaatttata aactaaagga acagcttcta gattgtgaag 541 gtgaagatgg atggaataaa ctttttgact tgattcagtc agaactttat gtaagacctg 601 atgacgtcca tgtgaacatc cggctagtgg aggtgtatcg ctcaactaaa agattgaagg 661 atgctgtggc ccactgccat gaggcagaga ggaacatagc tttgcgttca agtttagaat 721 ggaattcgtg tgttgtacag acccttaagg aatatctgga gtctttacag tgtttggagt 781 ctgataaaag tgactggcga gcaaccaata cagacttact gctggcctat gctaatctta 841 tgcttcttac gctttccact agagatgtgc aggaaagtag agaattactg caaagttttg 901 atagtgctct tcagtctgtg aaatctttgg gtggaaatga tgaactgtca gctactttct 961 tagaaatgaa aggacatttc tacatgcatg ctggttctct gcttttgaag atgggtcagc 1021 atagtagtaa tgttcaatgg cgagctcttt ctgagctggc tgcattgtgc tatctcatag 1081 catttcaggt tccaagacca aagattaaat taataaaagg tgaagctgga caaaatctgc 1141 tggaaatgat ggcctgtgac cgactgagcc aatcagggca catgttgcta aacttaagtc 1201 gtggcaagca agatttttta aaagagattg ttgaaacttt tgccaacaaa agcgggcagt 1261 ctgcattata tgatgctctg ttttctagtc agtcacctaa ggatacatct tttcttggta 1321 gcgatgatat tggaaacatt gatgtacgag aaccagagct tgaagatttg actagatacg 1381 atgttggtgc tattcgagca cataatggta gtcttcagca ccttacttgg cttggcttac 1441 agtggaattc attgcctgct ttacctggaa tccgaaaatg gctaaaacag cttttccatc 1501 atttgcccca tgaaacctca aggcttgaaa caaatgcacc tgaatcaata tgtattttag 1561 atcttgaagt atttctcctt ggagtagtat ataccagcca cttacaatta aaggagaaat 1621 gtaattctca ccacagctcc tatcagccgt tatgcctgcc ccttcctgtg tgtaaacagc 1681 tttgtacaga aagacaaaaa tcttggtggg atgcggtttg tactctgatt cacagaaaag 1741 cagtacctgg aaacgtagca aaattgagac ttctagttca gcatgaaata aacactctaa 1801 gagcccagga aaaacatggc cttcaacctg ctctgcttgt acattgggca gaatgccttc 1861 agaaaacggg cagcggtctt aattcttttt atgatcaacg agaatacata gggagaagtg 1921 ttcattattg gaagaaagtt ttgccattgt tgaagataat aaaaaagaag aacagtattc 1981 ctgaacctat tgatcctctg tttaaacatt ttcatagtgt agacattcag gcatcagaaa 2041 ttgttgaata tgaagaagac gcacacataa cttttgctat attggatgca gtaaatggaa 2101 atatagaaga tgctgtgact gcttttgaat ctataaaaag tgttgtttct tattggaatc 2161 ttgcactgat ttttcacagg aaggcagaag acattgaaaa tgatgccctt tctcctgaag 2221 aacaagaaga atgcaaaaat tatctgagaa agaccaggga ctacctaata aagattatag 2281 atgacagtga ttcaaatctt tcagtggtca agaaattgcc tgtgcccctg gagtctgtaa 2341 aagagatgct taattcagtc atgcaggaac tcgaagacta tagtgaagga ggtcctctct 2401 ataaaaatgg ttctttgcga aatgcagatt cagaaataaa acgttctaca ccgtctccta 2461 ccagatattc actatcacca agtaaaagtt acaagtattc tcccaaaaca ccacctcgat 2521 gggcagaaga tcagaattct ttactgaaaa tgatttgcca acaagtagag gccattaaga 2581 aagaaatgca ggagttgaaa ctaaatagca gtaactcagc atcccctcat cgttggccca 2641 cagagaatta tggaccagac tcggtgcctg atggatatca ggggtcacag acatttcatg 2701 gggctccact aacagttgca actactggcc cttcagtata ttatagtcag tcaccagcat 2761 ataattccca gtatcttctc agaccagcag ctaatgttac tcccacaaag ggcccagtct 2821 atggcatgaa taggcttcca ccccaacagc atatttatgc ctatccgcaa cagatgcaca 2881 caccgccagt gcaaagctca tctgcttgta tgttctctca ggagatgtat ggtcctcctg 2941 cattgcgttt tgagtctcct gcaacgggaa ttctatcgcc caggggtgat gattacttta 3001 attacaatgt tcaacagaca agcacaaatc cacctttgcc agaaccagga tatttcacaa 3061 aacctccgat tgcagctcat gcttcaagat ctgcagaatc taagactata gaatttggga 3121 aaactaattt tgttcagccc atgccgggtg aaggattaag gccatctttg ccaacacaag 3181 cacacacaac acagccaact ccttttaaat ttaactcaaa tttcaaatca aatgatggtg 3241 acttcacgtt ttcctcacca caggttgtga cacagccccc tcctgcagct tacagtaaca 3301 gtgaaagcct tttaggtctc ctgacttcag ataaaccctt gcaaggagat ggctatagtg 3361 gagccaaacc aattcctggt ggtcaaacca ttgggcctcg aaatacattc aattttggaa 3421 gcaaaaatgt gtctggaatt tcatttacag aaaacatggg gtcgagtcag caaaagaatt 3481 ctggttttcg gcgaagtgat gatatgttta ctttccatgg tccagggaaa tcagtatttg 3541 gaacacccac tttagagaca gcaaacaaga atcatgagac agatggagga agtgcccatg 3601 gggatgatga tgatgacggt cctcactttg agcctgtagt acctcttcct gataagattg 3661 aagtaaaaac tggtgaggaa gatgaagaag aattcttttg caaccgcgcg aaattgtttc 3721 gtttcgatgt agaatccaaa gaatggaaag aacgtgggat tggcaatgta aaaatactga 3781 ggcataaaac atctggtaaa attcgccttc taatgagacg agagcaagta ttgaaaatct 3841 gtgcaaatca ttacatcagt ccagatatga aattgacacc aaatgctgga tcagacagat 3901 cttttgtatg gcatgccctt gattatgcag atgagttgcc aaaaccagaa caacttgcta 3961 ttaggttcaa aactcctgag gaagcagcac tttttaaatg caagtttgaa gaagcccaga 4021 gcattttaaa agccccagga acaaatgtag ccatggcgtc aaatcaggct gtcagaattg 4081 taaaagaacc cacaagtcat gataacaagg atatttgcaa atctgatgct ggaaacctga 4141 attttgaatt tcaggttgca aagaaagaag ggtcttggtg gcattgtaac agctgctcat 4201 taaagaatgc ttcaactgct aagaaatgtg tatcatgcca aaatctaaac ccaagcaata 4261 aagagctcgt tggcccacca ttagctgaaa ctgtttttac tcctaaaacc agcccagaga 4321 atgttcaaga tcgatttgca ttggtgactc caaagaaaga aggtcactgg gattgtagta 4381 tttgtttagt aagaaatgaa cctactgtat ctaggtgcat tgcgtgtcag aatacaaaat 4441 ctgctaacaa aagtggatct tcatttgttc atcaagcttc atttaaattt ggccagggag 4501 atcttcctaa acctattaac agtgatttca gatctgtttt ttctacaaag gaaggacagt 4561 gggattgcag tgcatgtttg gtacaaaatg aggggagctc tacaaaatgt gctgcttgtc 4621 agaatccgag aaaacagagt ctacctgcta cttctattcc aacacctgcc tcttttaagt 4681 ttggtacttc agagacaagt aaaactctaa aaagtggatt tgaagacatg tttgctaaga 4741 aggaaggaca gtgggattgc agttcatgct tagtgcgaaa tgaagcaaat gctacaagat 4801 gtgttgcttg tcagaatccg gataaaccaa gtccatctac ttctgttcca gctcctgcct 4861 cttttaagtt tggtacttca gagacaagca aggctccaaa gagcggattt gagggaatgt 4921 tcactaagaa ggagggacag tgggattgca gtgtgtgctt agtaagaaat gaagccagtg 4981 ctaccaaatg tattgcttgt cagaatccag gtaaacaaaa tcaaactact tctgcagttt 5041 caacacctgc ctcttcagag acaagcaagg ctccaaagag cggatttgag ggaatgttca 5101 ctaagaagga gggacagtgg gattgcagtg tgtgcttagt aagaaatgaa gccagtgcta 5161 ccaaatgtat tgcttgtcag aatccaggta aacaaaatca aactacttct gcagtttcaa 5221 cacctgcctc ttcagagaca agcaaggctc caaagagcgg atttgaggga atgttcacta 5281 agaaggaagg acagtgggat tgcagtgtgt gcttagtaag aaatgaagcc agtgctacca 5341 aatgtattgc ttgtcagtgt ccaagtaaac aaaatcaaac aactgcaatt tcaacacctg 5401 cctcttcgga gataagcaag gctccaaaga gtggatttga aggaatgttc atcaggaaag 5461 gacagtggga ttgtagtgtt tgctgtgtac aaaatgagag ttcttcctta aaatgtgtgg 5521 cttgtgatgc ctctaaacca actcataaac ctattgcaga agctccttca gctttcacac 5581 tgggctcaga aatgaagttg catgactctt ctggaagtca ggtgggaaca ggatttaaaa 5641 gtaatttctc agaaaaagct tctaagtttg gcaatacaga gcaaggattc aaatttgggc 5701 atgtggatca agaaaattca ccttcattta tgtttcaggg ttcttctaat acagaattta 5761 agtcaaccaa agaaggattt tccatccctg tgtctgctga tggatttaaa tttggcattt 5821 cggaaccagg aaatcaagaa aagaaaagtg aaaagcctct tgaaaatggt actggcttcc 5881 aggctcagga tattagtggc cagaagaatg gccgtggtgt gatttttggc caaacaagta 5941 gcacttttac atttgcagat cttgcaaaat caacttcagg agaaggattt cagtttggca 6001 aaaaagaccc caatttcaag ggattttcag gtgctggaga aaaattattc tcatcacaat 6061 acggtaaaat ggccaataaa gcaaacactt ccggtgactt tgagaaagat gatgatgcct 6121 ataagactga ggacagcgat gacatccatt ttgaaccagt agttcaaatg cccgaaaaag 6181 tagaacttgt aacaggagaa gaagatgaaa aagttctgta ttcacagcgg gtaaaactat 6241 ttagatttga tgctgaggta agtcagtgga aagaaagggg cttggggaac ttaaaaattc 6301 tcaaaaacga ggtcaatggc aaactaagaa tgctgatgcg aagagaacaa gtactaaaag 6361 tgtgtgctaa tcattggata acgactacga tgaacctgaa gcctctctct ggatcagata 6421 gagcatggat gtggttagcc agtgatttct ctgatggtga tgccaaacta gagcagttgg 6481 cagcaaaatt taaaacacca gagctggctg aagaattcaa gcagaaattt gaggaatgcc 6541 agcggcttct gttagacata ccacttcaaa ctccccataa acttgtagat actggcagag 6601 ctgccaagtt aatacagaga gctgaagaaa tgaagagtgg actgaaagat ttcaaaacat 6661 ttttgacaaa tgatcaaaca aaagtcactg aggaagaaaa taagggttca ggtacaggtg 6721 cggccggtgc ctcagacaca acaataaaac ccaatcctga aaacactggg cccacattag 6781 aatgggataa ctatgattta agggaagatg ctttggatga tagtgtcagt agtagctcag 6841 tacatgcttc tccattggca agtagccctg tgagaaaaaa tcttttccgt tttggtgagt 6901 caacaacagg atttaacttc agttttaaat ctgctttgag tccatctaag tctcctgcca 6961 agttgaatca gagtgggact tcagttggca ctgatgaaga atctgatgtt actcaagaag 7021 aagagagaga tggacagtac tttgaacctg ttgttccttt acctgatcta gttgaagtat 7081 ccagtggtga ggaaaatgaa caagttgttt ttagtcacag ggcaaaactc tacagatatg 7141 ataaagatgt tggtcaatgg aaagaaaggg gcattggtga tataaagatt ttacagaatt 7201 atgataataa gcaagttcgt atagtgatga gaagggacca agtattaaaa ctttgtgcca 7261 atcacagaat aactccagac atgactttgc aaaatatgaa agggacagaa agagtatggt 7321 tgtggactgc atgtgatttt gcagatggag aaagaaaagt agagcattta gctgttcgtt 7381 ttaaactaca ggatgttgca gactcgttta agaaaatttt tgatgaagca aaaacagccc 7441 aggaaaaaga ttctttgata acacctcatg tttctcggtc aagcactccc agagagtcac 7501 catgtggcaa aattgctgta gctgtattag aagaaaccac aagagagagg acagatgtta 7561 ttcagggtga tgatgtagca gatgcaactt cagaagttga agtgtctagc acatctgaaa 7621 caacaccaaa agcagtggtt tctcctccaa agtttgtatt tggttcagag tctgttaaaa 7681 gcatttttag tagtgaaaaa tcaaaaccat ttgcattcgg caacagttca gccactgggt 7741 ctttgtttgg atttagtttt aatgcacctt tgaaaagtaa caatagtgaa actagttcag 7801 tagcccagag tggatctgaa agcaaagtgg aacctaaaaa atgtgaactg tcaaagaact 7861 ctgatatcga acagtcttca gatagcaaag tcaaaaatct ctttgcttcc tttccaacgg 7921 aagaatcttc aatcaactac acatttaaaa caccagaaaa ggcaaaagag aagaaaaaac 7981 ctgaagattc tccctcagat gatgatgttc tcattgtata tgaactaact ccaaccgctg 8041 agcagaaagc ccttgcaacc aaacttaaac ttcctccaac tttcttctgc tacaagaata 8101 gaccagatta tgttagtgaa gaagaggagg atgatgaaga tttcgaaaca gctgtcaaga 8161 aacttaatgg aaaactatat ttggatggct cagaaaaatg tagacccttg gaagaaaata 8221 cagcagataa tgagaaagaa tgtattattg tttgggaaaa gaaaccaaca gttgaagaga 8281 aggcaaaagc agatacgtta aaacttccac ctacattttt ttgtggagtc tgtagtgata 8341 ctgatgaaga caatggaaat ggggaagact ttcaatcaga gcttcaaaaa gttcaggaag 8401 ctcaaaaatc tcagacagaa gaaataacta gcacaactga cagtgtatat acaggtggga 8461 ctgaagtgat ggtaccttct ttctgtaaat ctgaagaacc tgattctatt accaaatcca 8521 ttagttcacc atctgtttcc tctgaaacta tggacaaacc tgtagatttg tcaactagaa 8581 aggaaattga tacagattct acaagccaag gggaaagcaa gatagtttca tttggatttg 8641 gaagtagcac agggctctca tttgcagact tggcttccag taattctgga gattttgctt 8701 ttggttctaa agataaaaat ttccaatggg caaatactgg agcagctgtg tttggaacac 8761 agtcagtcgg aacccagtca gccggtaaag ttggtgaaga tgaagatggt agtgatgaag 8821 aagtagttca taatgaagat atccattttg aaccaatagt gtcactacca gaggtagaag 8881 taaaatctgg agaagaagat gaagaaattt tgtttaaaga gagagccaaa ctttatagat 8941 gggatcggga tgtcagtcag tggaaggagc gcggtgttgg agatataaag attctttggc 9001 atacaatgaa gaattattac cggatcctaa tgagaagaga ccaggttttt aaagtgtgtg 9061 caaaccacgt tattactaaa acaatggaat taaagccctt aaatgtttca aataatgctt 9121 tagtttggac tgcctcagat tatgctgatg gagaagcaaa agtagaacag cttgcagtga 9181 gatttaaaac taaagaagta gctgattgtt tcaagaaaac atttgaagaa tgtcagcaga 9241 atttaatgaa actccagaaa ggacatgtat cactggcagc agaattatca aaggagacca 9301 atcctgtggt gttttttgat gtttgtgcgg acggtgaacc tctagggcgg ataactatgg 9361 aattattttc aaacattgtt cctcggactg ctgagaactt cagagcacta tgcactggag 9421 agaaaggctt tggtttcaag aattccattt ttcacagagt aattccagat tttgtttgcc 9481 aaggaggaga tatcaccaaa catgatggaa caggcggaca gtccatttat ggagacaaat 9541 ttgaagatga aaattttgat gtgaaacata ctggtcctgg tttactatcc atggccaatc 9601 aaggccagaa taccaataat tctcaatttg ttataacact gaagaaagca gaacatttgg 9661 actttaagca tgtagtattt gggtttgtta aggatggcat ggatactgtg aaaaagattg 9721 aatcatttgg ttctcccaaa gggtctgttt gtcgaagaat aactatcaca gaatgtggac 9781 agatataaaa tcattgttgt tcatagaaaa tttcatctgt ataagcagtt ggattgaagc 9841 ttagctatta caatttgata gttatgttca gcttttgaaa atggacgttt ccgatttaca 9901 aatgtaaaat tgcagcttat agctgttgtc actttttaat gtgttataat tgaccttgca 9961 tggtgtgaaa taaaagttta aacactggtg tatttcaggt gtacttgtgt ttatgtactc 10021 ctgacgtatt aaaatggaat aatactaatc ttgttaaaag caatagacct caaactattg 10081 aaggaatatg atatatgcaa tttaatttta attcctttta agatatttgg acttcctgca 10141 tggatatact taccatttga ataaagggac cacaacttgg ataatttaat tttaggtttg 10201 aaatatattt ggtaatctta actattggtg tactcattta tgcatagaga ctcgtttatg 10261 aatgggtaga gccacagaac gtatagagtt aaccaaagtg ctcttctcta gaatctttac 10321 acctcctgtg tggttacaag ttaactttgt aagtagcgta ccttccttcc ttaaaatatc 10381 tagcttcctg tgccctttca tagatattcg attaattttt acattttaaa caagttgact 10441 atttccttta ggggttttgt ttcaaacttt tctgtcatct gtctctacta cctcagaaac 10501 tgcagcttgg ttctgatgat agaaattgaa tttttccttg tagttattgt gataaagtat 10561 gaatattttt agaaagtcta taccatgttc tttcgttaaa gatttgcttt atacaagatt 10621 gttgcagtac ctttttctgg taaattttgt agcagaaata aaatgacaat tcctaag // LOCUS HUMOBCAM 1478 bp mRNA PRI 21-JUL-1994 DEFINITION Human (clone pHOM) opioid-binding cell adhesion molecule mRNA, complete cds. ACCESSION L34774 NID g514373 KEYWORDS opioid-binding cell adhesion molecule. SOURCE Homo sapiens (library: Stratagene brain) occipital cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1478) AUTHORS Shark,K.B. and Lee,N.M. TITLE Cloning, sequencing and localization to chromosome 11 of a cDNA encoding a human opioid-binding cell adhesion molecule (OBCAM) JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..1478 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="occipital cortex" /tissue_lib="Stratagene brain" /map="chromosome 11" CDS 51..1088 /codon_start=1 /product="opioid-binding cell adhesion molecule" /db_xref="PID:g514374" /translation="MGVCGYLFLPWKCLVVVSLRLLFLVPTGVPVRSGDATFPKAMDN VTVRQGESATLRCTIDDRVTRVAWLNRSTILYAGNDKWSIDPRVIILVNTPTQYSIMI QNVDVYDEGPYTCSVQTDNHPKTSRVHLIVQVPPQIMNISSDITVNEGSSVTLLCLAI GRPEPTVTWRHLSVKEGQGFVSEDEYLEISDIKRDQSGEYECSALNDVAAPDVRKVKI TVNYPPYISKAKNTGVSVGQKGILSCEASAVPMAEFQWFKEETRLATGLDGMRIENKG RMSTLTFFNVSEKDYGNYTCVATNKLGNTNASITLYGPGAVIDGVNSASRALACLWLS GTLLAHFFIKF" BASE COUNT 351 a 373 c 371 g 383 t ORIGIN 1 gaccaggact gtgcggctgc cggagtcctg ggaagttgtg gctgtcgaga atgggggtct 61 gtgggtacct gttcctgccc tggaagtgcc tcgtggtcgt gtctctcagg ctgctgttcc 121 ttgtacccac aggagtgccc gtgcgcagcg gagatgccac cttccccaaa gctatggaca 181 acgtgacggt ccggcagggg gagagcgcca ccctcaggtg taccatagat gaccgggtaa 241 cccgggtggc ctggctaaac cgcagcacca tcctctacgc tgggaatgac aagtggtcca 301 tagaccctcg tgtgatcatc ctggtcaata caccaaccca gtacagcatc atgatccaaa 361 atgtggatgt gtatgacgaa ggtccgtaca cctgctctgt gcagacagac aatcatccca 421 aaacgtcccg ggttcaccta atagtgcaag ttcctcctca gatcatgaat atctcctcag 481 acatcactgt gaatgaggga agcagtgtga ccctgctgtg tcttgctatt ggcagaccag 541 agccaactgt gacatggaga cacctgtcag tcaaggaagg ccagggcttt gtaagtgagg 601 atgagtacct ggagatctct gacatcaagc gagaccagtc cggggagtac gaatgcagcg 661 cgttgaacga tgtcgctgcg cccgatgtgc ggaaagtaaa aatcactgta aactatcctc 721 cctatatctc aaaagccaag aacactggtg tttcagtcgg tcagaagggc atcctgagct 781 gtgaagcctc tgcagtcccc atggctgaat tccagtggtt caaggaagaa accaggttag 841 ccactggtct ggatggaatg aggattgaaa acaaaggccg catgtccact ctgactttct 901 tcaatgtttc tgaaaaggat tatgggaact atacttgtgt ggccacgaac aagcttggga 961 acaccaatgc cagcatcaca ttgtatgggc ctggagcagt cattgatggt gtaaactcgg 1021 cctccagagc actggcttgt ctctggctat cagggaccct cttagcccac ttcttcatca 1081 agttttgata agaaatccta ggtcctctga gcaacgcctg cttctcatat cacagacttt 1141 aatctacact gcggagagca aaccagcttg ggcttctttt tgtttttttc tgttattcta 1201 gatttgtttt ctttttgttt ttgtttattt gtttgtttgc ttttatttcc agcttgaatg 1261 agtggggttg ggggcggggt gggcagggtt ctaccacgtg taggataatc attcattggt 1321 gtgtccaaaa atggggtctg ctcctgctac cttgaccctt ccctttcctc tgcttctctc 1381 ctcatcatca ttcccaacaa catcctctgc cacacacaac aaaacgtaag tttcatttgg 1441 gcaaaaattg agcctcacaa taaacaccct gaagacac // LOCUS HUMOBP2A 589 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens oriP binding protein (OBP-2) mRNA, complete cds. ACCESSION L29096 NID g456697 KEYWORDS DNA-binding protein; oriP binding protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Zhang,S. and Nonoyama,M. TITLE The cellular proteins that bind specifically to the Epstein-Barr virus origin of plasmid DNA replication belong to a gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (7), 2843-2847 (1994) MEDLINE 94195838 FEATURES Location/Qualifiers source 1..589 /organism="Homo sapiens" /db_xref="taxon:9606" gene 3..515 /gene="OBP-2" CDS 3..515 /gene="OBP-2" /standard_name="DNA-binding protein" /note="putative" /codon_start=1 /function="specifically bind Epstein-Barr virus origin of plasmid origin DNA replication" /product="oriP binding protein" /db_xref="PID:g456698" /translation="MLAQKAEEKENHCPTMLRPLSHRTVTGAKPLKKAVVMPLQLIQE QAASPNAEIHILKNKGRKRKLESLDALEPEEKAEDCWELQISPELLAHGRQKILDLLN EGSARDLRSLQRIGPKKAQLIVGWRELHGPFSQVEDLERVEGITGKQMESFLKANILG LAAGQRCGAS" polyA_site 589 /gene="OBP-2" /note="putative" BASE COUNT 142 a 167 c 170 g 110 t ORIGIN 1 agatgttggc ccagaaggct gaggaaaagg agaaccattg tcccacaatg ctccggcccc 61 tttcacatcg cacagtcaca ggggcaaagc ccctgaaaaa ggctgtggtg atgcccctac 121 agctaattca ggagcaggca gcatccccaa atgccgagat ccacatcctg aagaataaag 181 gccggaagag aaagctggag tccctggatg ccctagagcc tgaggagaag gctgaggact 241 gctgggagct acagatcagc ccggagctac tggctcatgg gcgccaaaaa atactggatc 301 tgctgaacga aggctcagcc cgagatctcc gcagtcttca gcgcattggc ccgaagaagg 361 cccagctaat cgtgggctgg cgggagctcc acggcccctt cagccaggtg gaggacctgg 421 aacgcgtgga gggcataacg gggaaacaga tggagtcctt cctgaaggca aacatcctgg 481 gtctcgccgc cggccagcgc tgtggcgcct cctgaccgtc gtctcctcac tccgcctttt 541 caaatttttg tataaccccg tgttgtgtaa atacagtttt tgctccggt // LOCUS HUMOCLHMB 1643 bp mRNA PRI 07-JAN-1995 DEFINITION H.sapiens oculorhombin (aniridia) mRNA, complete cds. ACCESSION M77844 NID g189352 KEYWORDS DNA-binding protein; homeobox protein; iris hypoplasia; oculorhombin; paired box; retinal abnormalities; transcription factor. SOURCE Homo sapiens fetal eye cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1643) AUTHORS Ton,C.C.T., Hirvonen,H., Miwa,H., Well,M.M., Monaghan,P., Jordan,T., van Heyningen,V., Hastle,N.D., Meijers-Heijboer,H., Drechsler,M., Royer-Pokora,B., Collins,F.S., Swaroop,A., Strong,L.C. and Saunders,G.F. TITLE Positional cloning and characterization of a paired box- and homeobox-containing gene from the aniridia region JOURNAL Cell 67 (6), 1059-1074 (1991) MEDLINE 92103673 FEATURES Location/Qualifiers source 1..1643 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="eye" /map="11 p13 (D" gene 363..1631 /gene="aniridia" CDS 363..1631 /gene="aniridia" /codon_start=1 /product="oculorhombin" /db_xref="PID:g189353" /translation="MQNSHSGVNQLGGVFVNGRPLPDSTRQKIVELAHSGARPCDISR ILQVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATPEVVSKIAQYKRECPSIFAWEI RDRLLSEGVCTNDNIPSVSSINRVLRNLASEKQQMGADGMYDKLRMLNGQTGSWGTRP GWYPGTSVPGQPTQDGCQQQEGGGENTNSISSNGEDSDEAQMRLQLKRKLQRNRTSFT QEQIEALEKEFERTHYPDVFARERLAAKIDLPEARIQVWFSNRRAKWRREEKLRNQRR QASNTPSHIPISSSFSTSVYQPIPQPTTPVSSFTSGSMLGLTDTALTNTYSALPPMPS FTMANNLPMQPPVPSQTSSYSCMLPTSPSVNGRSYDTYTPPHMQTHMNSQPMGTSGTT STGLISPGVSVPVQVPGSEPDMSQYWPRLQ" CDS 363..1631 /gene="aniridia" /note="alternate start site" /codon_start=1 /product="oculorhombin" /db_xref="PID:g189354" /translation="MQNSHSGVNQLGGVFVNGRPLPDSTRQKIVELAHSGARPCDISR ILQVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATPEVVSKIAQYKRECPSIFAWEI RDRLLSEGVCTNDNIPSVSSINRVLRNLASEKQQMGADGMYDKLRMLNGQTGSWGTRP GWYPGTSVPGQPTQDGCQQQEGGGENTNSISSNGEDSDEAQMRLQLKRKLQRNRTSFT QEQIEALEKEFERTHYPDVFARERLAAKIDLPEARIQVWFSNRRAKWRREEKLRNQRR QASNTPSHIPISSSFSTSVYQPIPQPTTPVSSFTSGSMLGLTDTALTNTYSALPPMPS FTMANNLPMQPPVPSQTSSYSCMLPTSPSVNGRSYDTYTPPHMQTHMNSQPMGTSGTT STGLISPGVSVPVQVPGSEPDMSQYWPRLQ" misc_feature 436..685 /gene="aniridia" /note="paired box" misc_feature 1017..1165 /gene="aniridia" /note="homeobox" BASE COUNT 487 a 444 c 388 g 324 t ORIGIN chromosome 11 p13 (D11S812E). 1 tatcgataag tttttttttt attgtcaatc tctgtctcct tcccaggaat ctgaggattg 61 ctcttacaca ccaacccagc aacatccgtg gagaaaactc tcaccagcaa ctcctttaaa 121 acaccgtcat ttcaaaccat tgtggtcttc aagcaacaac agcagcacaa aaaaccccaa 181 ccaaacaaaa ctcttgacag aagctgtgac aaccagaaag gatgcctcat aaagggggaa 241 gactttaact aggggcgcgc agatgtgtga ggccttttat tgtgagagtg gacagacatc 301 cgagatttca gagccccata ttcgagcccc gtggaatccc gcggccccca gccagagcca 361 gcatgcagaa cagtcacagc ggagtgaatc agctcggtgg tgtctttgtc aacgggcggc 421 cactgccgga ctccacccgg cagaagattg tagagctagc tcacagcggg gcccggccgt 481 gcgacatttc ccgaattctg caggtgtcca acggatgtgt gagtaaaatt ctgggcaggt 541 attacgagac tggctccatc agacccaggg caatcggtgg tagtaaaccg agagtagcga 601 ctccagaagt tgtaagcaaa atagcccagt ataagcggga gtgcccgtcc atctttgctt 661 gggaaatccg agacagatta ctgtccgagg gggtctgtac caacgataac ataccaagcg 721 tgtcatcaat aaacagagtt cttcgcaacc tggctagcga aaagcaacag atgggcgcag 781 acggcatgta tgataaacta aggatgttga acgggcagac cggaagctgg ggcacccgcc 841 ctggttggta tccggggact tcggtgccag ggcaacctac gcaagatggc tgccagcaac 901 aggaaggagg gggagagaat accaactcca tcagttccaa cggagaagat tcagatgagg 961 ctcaaatgcg acttcagctg aagcggaagc tgcaaagaaa tagaacatcc tttacccaag 1021 agcaaattga ggccctggag aaagagtttg agagaaccca ttatccagat gtgtttgccc 1081 gagaaagact agcagccaaa atagatctac ctgaagcaag aatacaggta tggttttcta 1141 atcgaagggc caaatggaga agagaagaaa aactgaggaa tcagagaaga caggccagca 1201 acacacctag tcatattcct atcagcagta gtttcagcac cagtgtctac caaccaattc 1261 cacaacccac cacaccggtt tcctccttca catctggctc catgttgggc ctaacagaca 1321 cagccctcac aaacacctac agcgctctgc cgcctatgcc cagcttcacc atggcaaata 1381 acctgcctat gcaaccccca gtccccagcc agacctcctc atactcctgc atgctgccca 1441 ccagcccttc ggtgaatggg cggagttatg atacctacac ccccccacat atgcagacac 1501 acatgaacag tcagccaatg ggcacctcgg gcaccacttc aacaggactc atttcccctg 1561 gtgtgtcagt tccagttcaa gttcccggaa gtgaacctga tatgtctcaa tactggccaa 1621 gattacagta aaaaaaaaaa aaa // LOCUS HUMOCRL 4353 bp mRNA PRI 07-JAN-1995 DEFINITION Human Lowe oculocerebrorenal syndrome (OCRL) mRNA, complete cds. ACCESSION M88162 M86675 NID g189355 KEYWORDS Lowe oculocerebrorenal syndrome; inositol polyphosphate 5-phosphatase. SOURCE Homo sapiens kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4353) AUTHORS Attree,O., Olivos,I.M., Okabe,I., Bailey,L.C., Nelson,D.L., Lewis,R.A., McInnes,R.R. and Nussbaum,R.L. TITLE The Lowe's oculocerebrorenal syndrome gene encodes a protein highly homologous to inositol polyphosphate-5-phosphatase JOURNAL Nature 358 (6383), 239-242 (1992) MEDLINE 92334430 FEATURES Location/Qualifiers source 1..4353 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /map="Xq25-q26.1" gene 16..2922 /gene="OCRL" CDS 16..2922 /gene="OCRL" /note="highly homologous to inositol polyphosphate-5-phophatase" /codon_start=1 /db_xref="GDB:G00-119-461" /db_xref="PID:g189356" /translation="MKFFVFKSFLSDCYRSLLDKSQLPAPRSRLPAPGARRGAVPQTT RSRGGWVWGRGSQCRRIGPQSAVLLSPEAAWMEPPLPVGAQPLATVEGMEMKGPLREP CALTLAQRNGQYELIIQLHEKEQHVQDIIPINSHFRCVQEAEETLLIDIASNSGCKIR VQGDWIRERRFEIPDEEHCLKFLSAVLAAQKAQSQLLVPEQKDSSSWYQKLDTKDKPS VFSGLLGFEDNFSSMNLDKKINSQNQPTGIHREPPPPPFSVNKMLPREKEASNKEQPK VTNTMRKLFVPNTQSGQREGLIKHILAKREKEYVNIQTFRFFVGTWNVNGQSPDSGLE PWLNCDPNPPDIYCIGFQELDLSTEAFFYFESVKEQEWSMAVERGLHSKAKYKKVQLV RLVGMMLLIFARKDQCRYIRDIATETVGTGIMGKMGNKGGVAVRFVFHNTTFCIVNSH LAAHVEDFERRNQDYKDICARMSFVVPNQTLPQLNIMKHEVVIWLGDLNYRLCMPDAN EVKSLINKKDLQRLLKFDQLNIQRTQKKAFVDFNEGEIKFIPTYKYDSKTDRWDSSGK CRVPAWCDRILWRGTNVNQLNYRSHMELKTSDHKPVSALFHIGVKVVDERRYRKVFED SVRIMDRMENDFLPSLELSRREFVFENVKFRQLQKGKFQISNNGQVPCHFSFIPKLND SQYCKPWLRAEPFEGYLEPNETVDISLDVYVSKDSVTILNSGEDKIEDILVLHLDRGK DYFLTISGNYLPSCFGTSLEALCRMKRPIREVPVTKLIDLEKSLLQMVPLDEGASERP LQVPKEIWLLVDHLFKYACHQEDLFQTPGMQEELQQIIDCLDTSIPETIPGSNHSVAE ALLIFLEALPEPVICYELYQRCLDSAYDPRICRQVISQLPRCHRNVFRYLMAFLRELL KFSEYNSVNANMIATLFTSLLLRPPPNLMARQTPSDRQRAIQFLLGFLLGSEED" BASE COUNT 1158 a 1035 c 970 g 1190 t ORIGIN 1 tcagcctgaa catggatgaa gttttttgtt tttaagagct tccttagtga ttgttatagg 61 agcctcctag acaagtctca gctcccagct ccccgctccc ggctcccggc gcccggcgcc 121 cggcgcggag ctgttcctca aacgacacgc agccgaggtg ggtgggtgtg gggacgcggg 181 agccagtgtc gtcggatcgg cccgcagtcc gctgtcctgc tgagcccgga ggccgcctgg 241 atggagccgc cgctcccggt cggagcccag ccgcttgcca ctgtcgaggg tatggagatg 301 aagggtcctc tccgggagcc ctgcgccctg accctagccc agaggaacgg gcaatatgag 361 ttaataatcc agttgcatga gaaggaacag catgttcaag atatcattcc tataaatagc 421 cacttcagat gtgttcaaga agcagaagaa actcttttga ttgacatagc ttctaacagt 481 ggctgcaaaa ttcgggttca gggggactgg atcagagagc gccgctttga aatccctgat 541 gaggaacact gtttgaagtt cctctcagct gtccttgctg ctcagaaagc tcagtcacag 601 cttcttgttc cagagcaaaa ggactcatct agctggtacc agaaattaga cactaaggac 661 aaaccttctg ttttttcagg gcttcttgga tttgaagaca atttttcttc tatgaatttg 721 gacaagaaaa taaattcaca aaatcagcct actgggattc atcgggaacc cccacctcca 781 cccttttcag tgaataaaat gcttccacgt gaaaaagaag cttctaacaa ggagcagccc 841 aaagtgacca acaccatgcg gaagctcttt gtaccaaata cccaatctgg gcagcgggag 901 ggtctcatca aacatatcct ggcaaagcga gagaaagaat atgtcaacat tcagactttc 961 agattttttg ttggaacttg gaatgtgaat ggccagtctc cagatagcgg gttagaacct 1021 tggctgaact gtgatcccaa tcctcctgat atctactgca ttggattcca agaactggac 1081 ttgagcacag aagccttctt ctactttgaa tctgtgaagg aacaagaatg gtccatggct 1141 gtagagagag gtttgcattc caaagccaag tataagaaag ttcaactggt gcgccttgtt 1201 gggatgatgc ttcttatatt tgccagaaag gatcagtgtc gatacattcg tgatattgct 1261 acagaaacag ttggaactgg aatcatgggg aaaatgggaa acaaaggtgg ggtagctgtg 1321 agatttgtat ttcacaacac caccttttgc attgtcaatt cccatctggc tgcacacgtg 1381 gaggactttg agagaaggaa tcaagattat aaggacattt gtgcgagaat gagttttgtg 1441 gtcccaaatc agaccctccc gcagttgaac atcatgaaac atgaggttgt catttggttg 1501 ggagatttga attatagact ttgcatgcct gatgccaatg aggtgaaaag tcttattaat 1561 aagaaagacc ttcagagact cttgaaattc gaccagctaa atattcagcg cacacagaaa 1621 aaagcttttg ttgacttcaa tgaaggggaa atcaagttca tccccactta taagtatgac 1681 tctaaaacag accggtggga ttccagtggg aaatgccggg ttccagcctg gtgtgaccga 1741 attctttgga gaggaacaaa tgttaatcag cttaattatc ggagtcacat ggaactgaaa 1801 accagcgacc acaagcctgt tagcgccctc ttccatattg gggtgaaggt tgtggatgaa 1861 cgaaggtacc ggaaagtctt tgaagatagt gtacgcatca tggacagaat ggaaaatgac 1921 ttccttcctt ccttagaact cagcaggagg gagtttgtgt ttgaaaatgt gaagtttcgg 1981 caactacaaa aggggaagtt ccagatcagc aacaatggac aggttccctg ccatttttct 2041 ttcatcccta aacttaatga cagccagtac tgcaagccat ggcttcgggc tgaacctttt 2101 gagggctact tggagccaaa tgagacagtg gacatttctc ttgatgtgta tgtcagcaaa 2161 gactctgtaa ccatcctgaa ctcgggagaa gataagattg aagatattct cgtccttcac 2221 ctggatcgag gcaaagatta cttcttgact atcagtggaa attacctccc aagttgtttt 2281 ggcacatcct tagaggctct gtgccgtatg aaaagaccaa tccgagaagt tcctgttacc 2341 aaactcatag acttggagaa atcccttctg caaatggttc ctttggatga aggtgccagt 2401 gagagacccc ttcaggttcc caaggagatc tggcttctag tagatcacct attcaaatac 2461 gcctgtcacc aggaggacct gttccagacc cctggaatgc aggaagagct ccagcagatc 2521 attgattgtc tggataccag cattcctgag acaatccctg gcagcaacca ctctgtggct 2581 gaagcactgc tcattttctt ggaagccctg ccagagccag tcatctgtta cgagctgtat 2641 cagcgatgtc ttgactctgc ttatgatccc cggatctgcc gacaggtgat ctcccagctt 2701 ccgagatgcc atagaaatgt tttccgttac ttgatggcat tccttcgaga actcttaaaa 2761 ttctctgaat acaatagcgt caatgccaac atgatcgcta ctctcttcac tagtcttctc 2821 ctgaggcctc cacccaacct tatggcaaga cagactccaa gtgaccgcca gcgtgctatt 2881 cagttccttc tgggctttct gcttgggagc gaagaagact aaggctttta ctgttctctg 2941 atattctaga agcagacgat ctcgggctcc aagtatttca gaatgattta aaaagtcatg 3001 ccacaggaag ggtctattgc agaatttcaa gttctgttta tagtaaaaag gaagagcgtt 3061 tcctaatccc tcctttacca tatcctacac agaaaaatac ttttagactt atattgccaa 3121 gccaaagtta ccatattttg gtgtttttgt gttttctctt tataaggcaa aaagatctgt 3181 atttacactc cttcacctaa ggatgtgttt gttgccctcc tacccaattg tcatgattgt 3241 ccttagtacc ctaggcctag attctgagat cttcccattc taggcctaca agcactactt 3301 gctgtagctg agacttgtct agagtccttt gttttgcact tttgacccac cccttcctgg 3361 atcactcctt tgcactccac tcccggaatt cctgtcactt tgaacgaagt ctgagtgagg 3421 ctagtgactc cttgggtgtc ctcaacagtg aattcactgt ctgcgtgcag ttattacatg 3481 catttgtgca ttctactaca atggcatctt tatgtctctg taacattggc cttttcatgg 3541 ctccacactg ggtggaagga tattctctta gatcacattt agtagcataa ctgtagggac 3601 tattagagat ggcatctcat cgatgagaga gaatcacaat cagaatggaa gcactttgag 3661 tatctgaaga gtgagagcat tcatgtttga caggtcctgc ttcccactat ccttttcctg 3721 ttattattca aattttacac aaggactaat cctgggtgtc tctgagaccc atctcctgcc 3781 tagacatcca cctccagagc aacactggcc ccacagtaaa agaggaagtc ttgtacctca 3841 ggcaggccca tctagagcta ttgctccttc ccacagcaaa ggtattgtgg atgaccctta 3901 gaatccattc tctggtcttc tgaaatacca agggcagatg tcacctcctt cctcagcagg 3961 actgactctg ggctctacaa ccagctcctt cacataaagg gtttagagac tccccttggc 4021 tcccagtcac catatccagt gttgtgtaaa gagactggcc aacaggacca accaagcacc 4081 ttacctctcc catacaagat gaccctctga gcttttcatt tattcaagct ctgtggtaca 4141 gccttttttt aaaataaatt aatctatatt ggttgacaaa caagccacca accactgact 4201 gcaaaactgc ctgatgcagt tgggttcctc ctggttttct tttgttacaa ccacccttgc 4261 ctgtttacat taattgcaag gagcataacg tacaggctgt atgtacaatc ctgggcattg 4321 actctgtgac atttctagca tatccaaggc acc // LOCUS HUMOGC 891 bp mRNA PRI 07-MAR-1995 DEFINITION Human unknown protein from clone pHGR74 mRNA, complete cds. ACCESSION M38188 X56942 NID g189378 KEYWORDS . SOURCE Human ovarian granulosa cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 891) AUTHORS Rapp,G., Freudenstein,J., Klaudiny,J., Mucha,J., Wempe,F., Zimmer,M. and Scheit,K.H. TITLE Characterization of three abundant mRNAs from human ovarian granulosa cells JOURNAL DNA Cell Biol. 9 (7), 479-485 (1990) MEDLINE 91025550 COMMENT Draft entry and computer-readable sequence for [DNA 9, 479-485 (1990)] kindly submitted by K.H.Scheit, 27-AUG-1990. FEATURES Location/Qualifiers source 1..891 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pHGR74" /cell_type="granulosa" /tissue_type="ovary" mRNA <1..891 /note="protein of unknown function" CDS 312..647 /note="protein of unknown function" /codon_start=1 /db_xref="PID:g189379" /translation="MANIHQENEEMEQPMQNGEEDRPLGGGEGHQPAGNRRGQARRLA PNFRWAIPNRQINDGMGGDGDDMEIFMEEMREIRRKLRELQLRNCLRILMGELSNHHD HHDEFCLMP" CDS 361..534 /note="protein of unknown function" /codon_start=1 /db_xref="PID:g189380" /translation="MERKTALWEEVKATSLQEIDGDRLADLPLIFDGPYPIGRSMMGW VEMEMIWKYSWRR" BASE COUNT 251 a 182 c 224 g 234 t ORIGIN 1 accccatccc ccactcctat accggtcctc cattttggtg cctgcaaagc tctgggaaag 61 aatcccggga aacgaaaaat ggtgggtttg ggggaaggga ggtaagggga gaaagctgga 121 gggaggggct ttaattggag gccccgtaga ggacgcgcgg aacttctaag gtgggaaaaa 181 acgaaattaa aaaatccttt gatatcaggg ctctgaatcc tgctggtcag agcaccaagc 241 attcagtctc tctccttgcc tttgtcttac ttgtgttcaa agaaaaacaa ccagaaaaaa 301 aaaatctcat catggcaaat attcaccagg aaaacgaaga gatggagcag cctatgcaga 361 atggagagga agaccgccct ttgggaggag gtgaaggcca ccagcctgca ggaaatcgac 421 ggggacaggc tcgccgactt gcccctaatt ttcgatgggc catacccaat aggcagatca 481 atgatgggat gggtggagat ggagatgata tggaaatatt catggaggag atgagagaaa 541 tcagaagaaa acttagggag ctgcagttga ggaattgtct gcgtatcctt atgggggagc 601 tctctaatca ccatgaccat catgatgaat tttgccttat gccttgactc ctgccattta 661 tcatgagatt aatactgtga ttcccgctgt tttctttttc cttgcatttt cctaatatgc 721 ctttactgat ccgtttgctg tgaaccctat gttatttcca tgtgtcaagt gggtcttgtg 781 ttgccagctt ctatttgaag attgcctttg cactcagtgt aagtttctgt cagcagtagt 841 ttcacccatt tgcatggaaa aatttaaagc taataaagca atttaaaaag c // LOCUS HUMOP2A 1842 bp mRNA PRI 28-MAY-1996 DEFINITION Homo sapiens osteogenic protein-2 (OP-2) mRNA, complete cds. ACCESSION M97016 NID g189389 KEYWORDS TGF beta-like protein; osteogenic protein-2. SOURCE Homo sapiens embryo hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1842) AUTHORS Ozkaynak,E., Schnegelsberg,P.N., Jin,D.F., Clifford,G.M., Warren,F.D., Drier,E.A. and Oppermann,H. TITLE Osteogenic protein-2. A new member of the transforming growth factor-beta superfamily expressed early in embryogenesis JOURNAL J. Biol. Chem. 267 (35), 25220-25227 (1992) MEDLINE 93094231 FEATURES Location/Qualifiers source 1..1842 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="embryo" /tissue_type="hippocampus" sig_peptide 497..553 /gene="OP-2" CDS 497..1705 /gene="OP-2" /standard_name="pre-pro peptide" /codon_start=1 /product="osteogenic protein-2" /db_xref="PID:g189390" /translation="MTALPGPLWLLGLALCALGGGGPGLRPPPGCPQRRLGARERRDV QREILAVLGLPGRPRPRAPPAASRLPASAPLFMLDLYHAMAGDDDEDGAPAERRLGRA DLVMSFVNMVERDRALGHQEPHWKEFRFDLTQIPAGEAVTAAEFRIYKVPSIHLLNRT LHVSMFQVVQEQSNRESDLFFLDLQTLRAGDEGWLVLDVTAASDCWLLKRHKDLGLRL YVETEDGHSVDPGLAGLLGQRAPRSQQPFVVTFFRASPSPIRTPRAVRPLRRRQPKKS NELPQANRLPGIFDDVHGSHGRQVCRRHELYVSFQDLGWLDWVIAPQGYSAYYCEGEC SFPLDSCMNATNHAILQSLVHLMKPNAVPKACCAPTKLSATSVLYYDSSNNVILRKHR NMVVKACGCH" gene 497..1705 /gene="OP-2" mat_peptide 1286..1702 /gene="OP-2" /note="putative" /product="osteogenic protein-2" BASE COUNT 290 a 664 c 605 g 283 t ORIGIN 1 ccacagtggc gccggcagag caggagtggc tggaggagct gtggttggag caggaggtgg 61 cacggcaggg ctggagggct ccctatgagt ggcggagacg gcccaggagg cgctggagca 121 acagctccca caccgcacca agcggtggct gcaggagctc gcccatcgcc cctgcgctgc 181 tcggaccgcg gccacagccg gactggcggg tacggcggcg acagacggat tggccgagag 241 tcccagtccg cagagtagcc ccggcctcga ggcggtggcg tcccggtcct ctccgtccag 301 gagccaggac aggtgtcgcg cggcggggct ccagggaccg cgcctgaggc cggctgcccg 361 cccgtcccgc cccgccccgc cgcccgccgc ccgccgagcc cagcctcctt gccgtcgggg 421 cgtccccagg ccctgggtcg gccgcggagc cgatgcgcgc ccgctgagcg ccccagctga 481 gcgcccccgg cctgccatga ccgcgctccc cggcccgctc tggctcctgg gcctggcgct 541 atgcgcgctg ggcgggggcg gccccggcct gcgacccccg cccggctgtc cccagcgacg 601 tctgggcgcg cgcgagcgcc gggacgtgca gcgcgagatc ctggcggtgc tcgggctgcc 661 tgggcggccc cggccccgcg cgccacccgc cgcctcccgg ctgcccgcgt ccgcgccgct 721 cttcatgctg gacctgtacc acgccatggc cggcgacgac gacgaggacg gcgcgcccgc 781 ggagcggcgc ctgggccgcg ccgacctggt catgagcttc gttaacatgg tggagcgaga 841 ccgtgccctg ggccaccagg agccccattg gaaggagttc cgctttgacc tgacccagat 901 cccggctggg gaggcggtca cagctgcgga gttccggatt tacaaggtgc ccagcatcca 961 cctgctcaac aggaccctcc acgtcagcat gttccaggtg gtccaggagc agtccaacag 1021 ggagtctgac ttgttctttt tggatcttca gacgctccga gctggagacg agggctggct 1081 ggtgctggat gtcacagcag ccagtgactg ctggttgctg aagcgtcaca aggacctggg 1141 actccgcctc tatgtggaga ctgaggacgg gcacagcgtg gatcctggcc tggccggcct 1201 gctgggtcaa cgggccccac gctcccaaca gcctttcgtg gtcactttct tcagggccag 1261 tccgagtccc atccgcaccc ctcgggcagt gaggccactg aggaggaggc agccgaagaa 1321 aagcaacgag ctgccgcagg ccaaccgact cccagggatc tttgatgacg tccacggctc 1381 ccacggccgg caggtctgcc gtcggcacga gctctacgtc agcttccagg acctcggctg 1441 gctggactgg gtcatcgctc cccaaggcta ctcggcctat tactgtgagg gggagtgctc 1501 cttcccactg gactcctgca tgaatgccac caaccacgcc atcctgcagt ccctggtgca 1561 cctgatgaag ccaaacgcag tccccaaggc gtgctgtgca cccaccaagc tgagcgccac 1621 ctctgtgctc tactatgaca gcagcaacaa cgtcatcctg cgcaagcacc gcaacatggt 1681 ggtcaaggcc tgcggctgcc actgagtcag cccgcccagc cctactgcag ccacccttct 1741 catctggatc gggccctgca gaggcagaaa acccttaaat gctgtcacag ctcaagcagg 1801 agtgtcaggg gccctcactc tctgtgccta cttcctgtca gg // LOCUS HUMORF 1121 bp mRNA PRI 03-JUN-1991 DEFINITION Human ORF mRNA, complete cds. ACCESSION M68864 NID g189396 KEYWORDS . SOURCE Human T-lymphocyte MOLT-4 cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1121) AUTHORS Pollard,K.M. JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..1121 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MOLT-4" /cell_type="T-lymphocyte" CDS 136..1032 /note="ORF" /codon_start=1 /db_xref="PID:g189397" /translation="MAELTALESLIEMGFPRGRAEKALALTGNQGIEAAMDWLMEHED DPDVDEPLETPLGHILGREPTSSEQGGLEGSASAAGEGKPALSEEERQEQTKRMLELV AQKQREREEREEREALERERQRRRQGQELSAARQRLQEDEMRRAAAEERRRENAEELA ARQRVREKIERDKAERAKKYGGSVGSQPPPVAPEPGPVPSSPSQEPPTKREYDQCRIQ VRLPDGTSLTQTFRAREQLAAVRLYVELHRGEELGGGQDPVQLLSGFPRRAFSEADME RPLQELGLVPSAVLIVAKKCPS" polyA_signal 1099..1104 /note="ORF" polyA_site 1121 /note="ORF" BASE COUNT 254 a 303 c 371 g 193 t ORIGIN 1 ggaattccct atagagccgg gtgagagagc gagcgcccgt cggcgggtgt cgagggcggg 61 ttgcctcgcg ctgacccttc ccgccctcct tctcgtcaca caccaggtcc ccgcggaagc 121 cgcggtgtcg gcgccatggc ggagctgacg gctcttgaga gtctcatcga gatgggcttc 181 cccaggggac gcgcggagaa ggctctggcc ctcacaggga accagggcat cgaggctgcg 241 atggactggc tgatggagca cgaagacgac cccgatgtgg acgagccttt agagactccc 301 cttggacata tcctgggacg ggagcccact tcctcagagc aaggcggcct tgaaggatct 361 gcttctgctg ccggagaagg caaacccgct ttgagtgaag aggaaagaca ggaacaaact 421 aagaggatgt tggagctggt ggcccagaag cagcgggagc gtgaagaaag agaggaacgg 481 gaggcattgg aacgggaacg gcagcgcagg agacaagggc aagagttgtc agcagcacga 541 cagcggctac aggaagatga gatgcgccgg gctgctgctg aggagaggcg gagggaaaat 601 gccgaggagt tagcagccag acaaagagtt agagaaaaga tcgagaggga caaagcagag 661 agagccaaga agtatggtgg cagtgtgggc tctcagccac ccccagtggc accagagcca 721 ggtcctgttc cctcttctcc cagccaggag cctcccacca agcgggagta tgaccagtgt 781 cgcatacagg tcaggctgcc agatgggacc tcactgaccc agacgttccg ggcccgggaa 841 cagctggcag ctgtgaggct ctatgtggag ctccaccgtg gggaggaact aggtgggggc 901 caggaccctg tgcaattgct cagtggcttc cccagacggg ccttctcaga agctgacatg 961 gagcggcctc tgcaggagct gggactcgtg ccttctgctg ttctcattgt ggccaagaaa 1021 tgtcccagct gagggccttt gtcccattgt ccctctgtga ccccttcatc tttgataaag 1081 cactgacatc tccttcctaa taaatagacc ctgagttctg t // LOCUS HUMORF003 5253 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0076 gene, complete cds. ACCESSION D38548 NID g559706 KEYWORDS KIAA0076. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5253) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (17-OCT-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5253) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..5253 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..86 gene 87..5183 /gene="KIAA0076" CDS 87..5183 /gene="KIAA0076" /note="The ha0936 gene product is novel." /citation=[3] /codon_start=1 /db_xref="PID:d1008132" /db_xref="PID:g559707" /translation="MVGELRYREFRVPLGPGLHAYPDELIRQRVGHDGHPEYQIRWLI LRRGDEGDGGSGQVDCKAEHILLWMSKDEIYANCHKMLGEDGQVIGPSQESAGEVGAL DKSVLEEMETDVKSLIQRALRQLEECVGTIPPAPLLHTVHVLSAYASIEPLTGVFKDP RVLDLLMHMLSSPDYQIRWSAGRMIQALSSHDAGTRTQILLSLSQQEAIEKHLDFDSR CALLALFAQATLSEHPMSFEGIQLPQVPGRVLFSLVKRYLHVTSLLDQLNDSAAEPGA QNTSAPEELSGERGQLELEFSMAMGTLISELVQAMRWDQASDRPRSSARSPGSIFQPQ LADVSPGLPAAQAQPSFRRSRRFRPRSEFASGNTYALYVRDTLQPGMRVRMLDDYEEI SAGDEGEFRQSNNGVPPVQVFWESTGRTYWVHWHMLEILGFEEDIEDMVEADEYQGAV ASRVLGRALPAWRWRPMTELYAVPYVLPEDEDTEECEHLTLAEWWELLFFIKKLDGPD HQEVLQILQENLDGEILDDEILAELAVPIELAQDLLLTLPQRLNDSALRDLINCHVYK KYGPEALAGNQAYPSLLEAQEDVLLLDAQAQAKDSEDAAKVEAKEPPSQSPNTPLQRL VEGYGPAGKILLDLEQALSSEGTQENKVKPLLLQLQRQPQPFLALMQSLDTPETNRTL HLTVLRILKQLVDFPEALLLPWHEAVDACMACLRSPNTDREVLQELIFFLHRLTSVSR DYAVVLNQLGARDAISKALEKHLGKLELAQELRDMVFKCEKHAHLYRKLITNILGGCI QMVLGQIEDHRRTHRPINIPFFDVFLRYLCQGSSVEVKEDKCWEKVEVSSNPHRASKL TDHNPKTYWESNGSAGSHYITLHMRRGILIRQLTLLVASEDSSYMPARVVVCGGDSTS SLHTELNSVNVMPSASRVILLENLTRFWPIIQIRIKRCQQGGIDTRIRGLEILGPKPT FWPVFREQLCRHTRLFYMVRAQAWSQDMAEDRRSLLHLSSRLNGALRQEQNFADRFLP DDEAAQALGKTCWEALVSPVVQNITSPDEDGISPLGWLLDQYLECQEAVFNPQSRGPA FFSRVRRLTHLLVHVEPCEAPPPVVATPRPKGRNRSHDWSSLATRGLPSSIMRNLTRC WRAVVEKQVNNFLTSSWRDDDFVPRYCEHFNILQNSSSELFGPRAAFLLALQNGCAGA LLKLPFLKAAHVSEQFARHIDQQIQGSRIGGAQEMERLAQLQQCLQAVLIFSGLEIAT TFEHYYQHYMADRLLGVVSSWLEGAVLEQIGPCFPNRLPQQMLQSLSTSKELQRQFHV YQLQQLDQELLKLEDTEKKIQVGLGASGKEHKSEKEEEAGAAAVVDVAEGEEEEEENE DLYYEGAMPEVSVLVLSRHSWPVASICHTLNPRTCLPSYLRGTLNRYSNFYNKSQSHP ALERGSQRRLQWTWLGWAELQFGNQTLHVSTVQMWLLLYLNDLKAVSVESLLAFSGLS ADMLNQAIGPLTSSRGPLDLHEQKDIPGGVLKIRDGSKEPRSRWDIVRLIPPQTYLQA EGEDGQNLEKRRNLLNCLIVRILKAHGDEGLHIDQLVCLVLEAWQKGPCPPRGLVSSL GKGSACSSTDVLSCILHLLGKGTLRRHDDRPQVLSYAVPVTVMEPHTESLNPGSSGPN PPLTFHTLQIRSRGVPYASCTATQSFSTFR" 3'UTR 5184..5253 BASE COUNT 1109 a 1531 c 1564 g 1049 t ORIGIN 1 gtcgccgcca gcgtctgtgc cgcgtccctt gctctgtgaa ggacaggcct cgcgccagga 61 ccccggtgga cttctgaggt gccaggatgg tgggagaact ccgctacagg gaattcaggg 121 tgcccctggg gcccggctta catgcctatc ctgatgagct gatccgccag cgcgtgggcc 181 atgatgggca tcctgagtac cagatccgtt ggctcatcct gcggcgtggc gatgaggggg 241 acgggggctc tggccaagtg gactgcaagg ctgagcacat cctgctgtgg atgtccaagg 301 atgagatcta tgccaactgc cacaagatgc tgggcgagga tggccaggtc atcgggccct 361 cccaggagtc tgcaggggag gttggggccc tggacaaatc tgtgctggag gagatggaaa 421 ccgatgtgaa gtccctcatt cagagagccc ttcggcagct ggaggagtgt gtgggcacta 481 tccctcctgc tcctctactt cacactgtcc acgtgctcag cgcctatgcc agcattgagc 541 ccctcactgg agtattcaag gacccaaggg tcctggactt gctcatgcac atgttgagta 601 gtcctgatta tcagattcgc tggagtgcag gccggatgat acaagccctg tcctcccatg 661 acgctgggac ccggactcag atccttctgt cactgagcca acaagaagcc attgagaaac 721 acctggattt tgacagccgc tgtgctctgc tagcactgtt tgcacaggcc acgctctctg 781 aacaccccat gtctttcgag ggcattcagc taccacaggt cccaggaagg gtgctcttct 841 ccctggtgaa gcggtatttg catgtcacct cgctcctgga tcagctgaac gacagtgctg 901 cggagccagg agcccagaac acctctgctc ctgaggagtt gagtggggag aggggtcaac 961 tggagctgga gttcagtatg gccatgggca ccctgatctc ggagctggtg caagccatgc 1021 gctgggacca ggcctcagac agaccaagga gctcagcacg gtcccccggt tccatcttcc 1081 agcctcagct ggcagatgtg agcccagggc tccccgctgc ccaggctcag ccctccttca 1141 ggaggtcaag acgttttcgc cctcgttctg agttcgcaag tggcaatacc tatgctttgt 1201 atgtgcggga cacactgcag ccggggatgc gagtgcggat gctggatgat tatgaggaga 1261 tcagtgccgg ggatgagggc gagtttcggc agagcaacaa cggtgtgcct cctgtgcagg 1321 tattttggga gtcaacaggc cgcacctatt gggtgcactg gcacatgctg gagatcttgg 1381 gctttgagga agacattgag gacatggttg aggctgatga gtaccaaggg gcagtggcca 1441 gtagagtcct gggtagagcc ctgcctgcct ggcgctggag gcccatgaca gaactctatg 1501 ctgtgcctta tgtgctgcct gaggatgagg acactgagga gtgtgaacac ctgaccctgg 1561 ctgagtggtg ggaactcctc ttcttcatca agaagctgga tggacctgac catcaggagg 1621 ttctccagat cctccaggag aacctagatg gggagattct ggatgatgag atcctagctg 1681 aactggccgt gcccatagaa ttggcccagg acttgctgct gactctgcca cagcgactca 1741 atgacagtgc cctcagggac ctgatcaact gccatgtcta caagaagtat gggcctgaag 1801 ccctagcagg gaaccaagcc tacccatccc ttctagaagc ccaagaagat gtcctcctgc 1861 tagacgcgca ggcccaggct aaggactcag aagatgcagc caaagtggaa gcaaaagaac 1921 ccccatctca gagtcccaac actcccctgc agcgtctggt ggagggttat ggtccagctg 1981 ggaaaatcct cctggatcta gagcaagccc tcagctcaga ggggacccag gagaacaagg 2041 tcaagccact cctgctgcag ctgcagcggc agccgcagcc cttcctggca ctgatgcaga 2101 gcctggacac tccggagact aacaggaccc tgcacctgac tgtgctgaga atcctgaagc 2161 agctggtgga cttccccgag gcactgctgc tcccctggca cgaggccgtg gatgcctgca 2221 tggcctgcct gcggtcccca aacactgatc gagaggtgct ccaggaactg attttcttcc 2281 tgcaccgcct gacctcagtg agcagggact atgccgtggt gctgaatcag ctgggagcaa 2341 gagacgctat ctccaaggcc ctggaaaagc acctgggaaa gctggagctg gctcaggagc 2401 tgcgggacat ggtgttcaag tgtgagaagc atgcccacct ctaccgcaaa ctcatcacca 2461 acatcctggg aggctgcatc cagatggtgc tgggccagat cgaagaccac agacgaaccc 2521 accggcccat caacatccct ttctttgatg tgttcctcag atacctgtgc cagggctcca 2581 gtgtggaagt gaaggaggac aagtgctggg agaaggtgga ggtgtcctcc aacccgcacc 2641 gggccagcaa gctgacggac cacaacccca agacctattg ggagtccaac ggcagcgccg 2701 gctcccacta catcaccctg cacatgcgcc ggggcatcct catcaggcaa ctgactctgc 2761 ttgtggctag tgaggactcg agttacatgc cggcccgagt ggtggtgtgc gggggtgata 2821 gcactagctc tcttcacacg gaactcaact cggtgaatgt gatgccctct gccagccggg 2881 tgatcctcct ggagaacctg acccgcttct ggcccatcat ccagatccgc ataaagcgct 2941 gccagcaggg tggcattgat acgcgcattc gggggttaga gatcctaggc cccaagccca 3001 cgttctggcc agtgttccgg gagcagctct gtcgtcacac acgcctcttc tacatggttc 3061 gggcacaggc ctggagccag gacatggcag aggaccgcag gagcctcctg cacctgagtt 3121 ctagactcaa cggtgctctg cgccaggagc agaattttgc tgaccgcttc ctccctgatg 3181 acgaggctgc ccaagctctg ggcaagacct gctgggaggc cctggtcagc cccgtggtgc 3241 agaacatcac ctcccctgat gaggatggca ttagccccct gggttggctg ctggaccagt 3301 acctggagtg tcaggaagct gtcttcaacc cccagagccg cggcccagct ttcttctcgc 3361 gggtgcgccg tctcactcac ctgctggtgc atgtcgagcc ctgtgaggca ccccctcctg 3421 tggtggccac tcctcggccc aaaggcagaa acagaagcca cgactggagc tccttggcta 3481 cccggggcct tccaagcagc atcatgagaa acctgacgcg ctgttggcgg gccgtggtgg 3541 agaagcaggt gaacaatttt ctgacctcat cctggcggga tgatgacttt gtgccacgct 3601 actgtgagca ctttaatatt ctgcagaact caagctctga actgtttggg cctcgggcag 3661 ccttcttgct ggcgctgcaa aatggctgtg cgggagcctt gctgaagctc ccttttctca 3721 aagctgccca cgtgagtgag cagttcgccc ggcacattga ccagcagatc cagggcagcc 3781 ggatcggtgg agcccaggaa atggagaggc tggcacagct gcagcaatgc ctgcaagctg 3841 tcctgatttt ctccggcttg gagatagcca ccacttttga gcattattac cagcactaca 3901 tggcggaccg tctcctgggc gtggtctcga gctggctgga gggggccgtg ctggagcaga 3961 tcggtccctg cttccccaac cgcctccccc agcagatgtt gcagagcctg agcacctcta 4021 aggagctgca gcgccagttc cacgtctacc agctccagca gctggatcag gaactcctaa 4081 agctggagga tacagagaag aaaatacagg tgggccttgg ggccagtggc aaggagcaca 4141 agagcgagaa ggaagaggaa gctggggcag cagcagtggt ggatgtggcg gagggagagg 4201 aggaagagga ggagaatgag gacctctact atgaaggggc aatgccagaa gtgtctgtgc 4261 ttgtcctgtc ccgacactcc tggcctgttg cctcaatctg ccacacactg aaccccagaa 4321 cctgcctgcc ctcctacctg aggggcactt tgaacagata ctccaacttc tacaacaaga 4381 gtcagagcca ccctgccctt gagcgaggct cacagaggcg actgcagtgg acgtggctgg 4441 gctgggctga gctgcagttt gggaaccaga ccctgcatgt gtccaccgtg cagatgtggc 4501 tactgctgta tctcaacgac ctgaaggcgg tctctgtgga gagtctgctg gcgttctcag 4561 ggctctccgc agacatgctc aatcaggcga ttgggcccct cacctcttca agaggccccc 4621 tggaccttca cgagcaaaag gatataccag gaggggtcct caagattcga gatggcagca 4681 aggaacccag gtcgagatgg gacattgtgc ggctcatccc acctcagacg tacctgcaag 4741 ctgagggtga agacggccag aacttggaga agagacggaa tcttctgaac tgcctcatcg 4801 tccgaatcct caaggcccat ggagatgagg ggctgcacat tgaccagctt gtctgtctgg 4861 tgctggaggc ttggcagaag ggcccgtgtc ctcccagggg tttggtcagc agccttggta 4921 aggggtctgc atgcagcagc actgacgtcc tctcctgcat cctacacctc ctgggcaagg 4981 gcacgctgag acgccatgac gaccggcccc aggtgctgtc ctatgcagtc cctgtgactg 5041 tcatggagcc tcacactgag tccctgaacc caggctcctc aggccccaac ccacccctca 5101 ccttccatac cctacagatt cgctcccggg gtgtgcccta tgcctcctgc actgccaccc 5161 agagcttctc taccttccgg tagccctaga cttggggtca ggggaaggta gagctggagc 5221 ttttacagaa attaaaccca agagtttgat tat // LOCUS HUMORF005 3647 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0078 gene, complete cds. ACCESSION D38551 NID g1531549 KEYWORDS KIAA0078. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3647) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (17-OCT-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3647) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..3647 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..184 gene 185..2080 /gene="KIAA0078" CDS 185..2080 /gene="KIAA0078" /note="The ha1237 gene product is related to S.pombe rad21 gene product." /citation=[3] /codon_start=1 /db_xref="PID:d1008135" /db_xref="PID:g1531550" /translation="MFYAHFVLSKRGPLAKIWLAAHWDKKLTKAHVFECNLESSVESI ISPKVKMALRTSGHLLLGVVRIYHRKAKYLLADCNEAFIKIKMAFRPGVVDLPEENRE AAYNAITLPEEFHDFDQPLPDLDDIDVAQQFSLNQSRVEEITMREEVGNISILQENDF GDFGMDDREIMREGSAFEDDDMLVSTTTSNLLLESEQSTSNLNEKINHLEYEDQYKDD NFGEGNDGGILDDKLISNNDGGIFDDPPALSEAGVMLPEQPAHDDMDEDDNVSMGGPD SPDSVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDS KTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRC LTPLVPEDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQQHQQRDVIDEPIIEEPSR LQESVMEASRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQQVEQMEIPPVELPPEE PPNICQLIPELELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQEERRWNKRTQQMLHG LQRALAKTGAESISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELTQEEPYSDIIATP GPRFHII" 3'UTR 2081..3647 BASE COUNT 1155 a 669 c 800 g 1023 t ORIGIN 1 agtgaatcgg aaaggaggcg ccggctgtgg cggcggcggg agctgctcgg aagctacacc 61 tcgcaagggc tccccccttt ccccaccccc tcccccgacc cttttcccct ccccgggcca 121 cccagcccgc ccaactccca gcggagagca aggttttctt ctgttttcat agccagccag 181 aacaatgttc tacgcacatt ttgttctcag taaaagaggg cctctggcca aaatttggct 241 agcggcccat tgggataaga agctaaccaa agcccatgtg ttcgagtgta atttagagag 301 cagcgtggag agtatcatct caccaaaggt gaaaatggca ttacggacat caggacatct 361 cttactggga gtagttcgaa tctatcacag gaaagccaaa taccttcttg cagactgtaa 421 tgaagcattc attaagataa agatggcttt tcggccaggt gtggttgacc tgcctgagga 481 aaatcgggaa gcagcttata atgccattac tttacctgaa gaatttcatg actttgatca 541 gccactgcct gacttagatg acatcgatgt ggcccagcag ttcagcttga atcagagtag 601 agtggaagag ataaccatga gagaagaagt tgggaacatc agtattttac aagaaaatga 661 ttttggtgat tttggaatgg atgatcgtga gataatgaga gaaggcagtg cttttgagga 721 tgacgacatg ttagtaagca ctactacttc taacctccta ttagagtctg aacagagcac 781 cagcaatctg aatgagaaaa ttaaccattt agaatatgaa gatcaatata aggatgataa 841 ttttggagaa ggaaatgatg gtggaatatt agatgacaaa cttattagta ataatgatgg 901 cggtatcttt gatgatcccc ctgccctctc tgaggcaggg gtgatgttgc cagagcagcc 961 tgcacatgac gatatggatg aggatgataa tgtatcaatg ggtgggcctg atagtcctga 1021 ttcagtggat cccgttgaac caatgccaac catgactgat caaacaacac ttgttccaaa 1081 tgaggaagaa gcatttgcat tggagcctat tgatataact gttaaagaaa caaaagccaa 1141 gaggaagagg aagctaattg ttgacagtgt caaagagttg gatagcaaga caattagagc 1201 ccaacttagt gattattcag atattgttac tactttggat ctggcaccgc ccaccaagaa 1261 attgatgatg tggaaagaga caggaggagt agaaaaactg ttttctttac ctgctcagcc 1321 tttgtggaat aacagactac tgaagctctt tacacgctgt cttacaccgc ttgtaccaga 1381 agaccttaga aaaaggagga aaggaggaga ggcagataat ttggatgaat tcctcaaaga 1441 atttgaaaat ccagaggttc ctagagagga ccagcaacag cagcatcagc agcgtgatgt 1501 tatcgatgag cccattattg aagagccaag ccgcctccag gagtcagtga tggaggccag 1561 cagaacaaac atagatgagt cagctatgcc tccaccacca cctcagggag ttaagcgaaa 1621 agctggacaa attgacccag agcctgtgat gcctcctcag caggtagagc agatggaaat 1681 accacctgta gagcttcccc cagaagaacc tccaaatatc tgtcagctaa taccagagtt 1741 agaacttctg ccagaaaaag agaaggagaa agagaaggaa aaagaagatg atgaagagga 1801 agaggatgaa gatgcatcag ggggcgatca agatcaggaa gaaagaagat ggaacaaaag 1861 gactcagcag atgcttcatg gtcttcagcg tgctcttgct aaaactggag ctgaatctat 1921 cagtttgctt gagttatgtc gaaatacgaa cagaaaacaa gctgccgcaa agttctacag 1981 cttcttggtt cttaaaaagc agcaagctat tgagctgaca caggaagaac cgtacagtga 2041 catcatcgca acacctggac caaggttcca tattatataa ggagctagaa gcattatagc 2101 tagtgtttga ttcactagtg cttacaaatt gcccccatgt gtaggggaca cagaaccctt 2161 tgagaaaact tagatttttg tctgtacaaa gtctttgcct ttttccttct tcattttttt 2221 ccagtacatt aaatttgtca atttcatctt tgagggaaac tgattagatg ggttgtgttt 2281 gtgttctgat ggagaaaaca gcaccccaag gactcagaag atgattttaa cagttcagaa 2341 cagatgtgtg caatattggt gcatgtaata atgttgagtg gcagtcaaaa gtcatgattt 2401 ttatcttagt tcttcattac tgcattgaaa aggaaaacct gtctgagaaa atgcctgaca 2461 gtttaattta aaactatggt gtaagtcttt gacaagaaaa aaaaacaaac aaacacttct 2521 ttccatcagt aacactggca atcttcctgt taaccactct ccttagggat ggtatctgaa 2581 acaacaatgg tcaccctctt gagattcgtt ttaagtgtaa ttccataatg agcagaggtg 2641 tacgcgaaat tgtgttatga ctgatagcct tcagctacaa aaagatagga ctgacctggt 2701 ttaaagtgtt ctattttgta aatcattcca tttgagtctt tctgatgaac ttggctatac 2761 tgaaatctgt tattttagtg aggctccaaa atgagcaaag ctaggcctga ttagagtaga 2821 gtgactatta aaaaacataa ctttctagga gctataaatc aaagttttaa aaagatgttt 2881 ggatatattt gagtattccg atcatgaaaa cagaaattgc cctgcctact acaaggacag 2941 actgatggga aattatgcac ctggtcaact tagcttttaa gcagacgatg ctgtaaaaac 3001 taacggcttc tctgatattt attgtaagtt ttagtactga tctccttttc cagtgctgca 3061 cactcctggt ttggaacttt aatagcgttg caacgaaatc ctatatccag tttcctgtaa 3121 tttaattgaa gaaaaataca tccaaataaa gactttatta ttaacagacc agatagcatc 3181 agaaatcatg tgactgttat gattatcaga atatgtctta actttttagg gcaaagttaa 3241 cactgaaagt tctagcttaa gtgttgaaac ttttgtggga aaaaaaaatc acttttgaaa 3301 ctcagacttc agtgtatacc caataattta aaattatgtg aaatgtttta aatttgtgaa 3361 ctcgtaatta ctgttttaat gattcagttt cttcagagtg gtaattgtat aaaattgcta 3421 ttgcagcttt atattcaata tgatgtgcct gtaaaccaag gagttttccc cgtttgtaaa 3481 aagacattgt agataattga atgtttgatt ttagaaaggt cattagtttc ttgttacaca 3541 ttttgttagt ctggtttttg ttgcttatcg ggtttaatat tgttcttgaa aatagttgat 3601 gctatgttat gtataacttt tctaataaaa gttgtgttat aagctgt // LOCUS HUMORF008 4464 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0079 gene, complete cds. ACCESSION D38555 NID g559716 KEYWORDS KIAA0079. SOURCE Homo sapiens male myeloblast cell-line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4464) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (17-OCT-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4464) AUTHORS Nomura,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..4464 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..114 gene 115..3492 /gene="KIAA0079" CDS 115..3492 /gene="KIAA0079" /note="The ha3543 gene product is related to S.cerevisiae protein encoded in chromosome VIII." /citation=[3] /codon_start=1 /db_xref="PID:d1008139" /db_xref="PID:g559717" /translation="MNVNQSVPPVPPFGQPQPIYPGYHQSSYGGQSGSTAPAIPYGAY NGPVPGYQQTPPQGMSRAPPSSGAPPASTAQAPCGQAAYGQFGQGDVQNGPSSTVQMQ RLPGSQPFGSPLAPVGNQPPVLQPYGPPPTSAQVATQLSGMQISGAVAPAPPSSGLGF GPPTSLASASGSFPNSGLYGSYPQGQAPPLSQAQGHPGIQTPQRSAPSQASSFTPPAS GGPRLPSMTGPLLPGQSFGGPSVSQPNHVSSPPQALPPGTQMTGPLGPLPPMHSPQQP GYQPQQNGSFGPARGPQSNYGGPYPAAPTFGSQPGPPQPLPPKRLDPDAIPSPIQVIE DDRNNRGTEPFVTGVRGQVPPLVTTNFLVKDQGNASPRYIRCTSYNIPCTSDMAKQAQ VPLAAVIKPLARLPPEEASPYVVDHGESGPLRCNRCKAYMCPFMQFIEGGRRFQCCFC SCINDVPPQYFQHLDHTGKRVDAYDRPELSLGSYEFLATVDYCKNNKFPSPPAFIFMI DVSYNAIRTGLVRLLCEELKSLLDFLPREGGAEESAIRVGFVTYNKVLHFYNVKSSLA QPQMMVVSDVADMFVPLLDGFLVNVNESRAVITSLLDQIPEMFADTRETETVFVPVIQ AGMEALKAAECAGKLFLFHTSLPIAEAPGKLKNRDDRKLINTDKEKTLFQPQTGAYQT LAKECVAQGCCVDLFLFPNQYVDVATLSVVPQLTGGSVYKYASFQVENDQERFLSDLR RDVQKVVGFDAVMRVRTSTGIRAVDFFGAFYMSNTTDVELAGLDGDKTVTVEFKHDDR LNEESGALLQCALLYTSCAGQRRLRIHNLALNCCTQLADLYRNCETDTLINYMAKFAY RGVLNSPVKAVRDTLITQCAQILACYRKNCASPSSAGQLILPECMKLLPVYLNCVLKS DVLQPGAEVTTDDRAYVRQLVTSMDVTETNVFFYPRLLPLTKSPVESTTEPPAVRASE ERLSNGDIYLLENGLNLFLWVGASVQQGVVQSLFSVSSFSQITSGLSVLPVLDNPLSK KVRGLIDSLRGTEIPVHEAYRGETGRQDGDAVQALPGGRQESEWGSILCGLSLSYAQG DSAATELKQVGKWHRAQASFQKAPQDVREIGTVTYLM" 3'UTR 3493..4464 BASE COUNT 1008 a 1212 c 1159 g 1085 t ORIGIN 1 agataatctg aatgctggct ggggcagaaa attactaaga tcctgtggaa gtgtgaggat 61 tattaaactg atcactgtct gataaggtga gatcaaattg ggaatgcttt cataatgaac 121 gtcaaccagt cagttccacc tgtgccacca tttgggcagc cccagcccat ctacccaggg 181 tatcatcagt ccagctatgg tgggcaatca gggtccacag cccccgccat tccctatgga 241 gcctacaatg gcccagtacc aggctatcag caaacacctc cccaaggtat gtcaagagcc 301 ccaccttcct cgggggcacc tccagcctca acagcacagg ctccttgtgg ccaggctgca 361 tatggccagt ttggccaagg agatgtacag aatgggccaa gctccactgt tcagatgcaa 421 aggctgcctg ggtctcagcc atttgggtcc ccattggccc ctgtgggcaa ccagccacct 481 gtgcttcagc cctatggccc tcccccgaca agtgcacagg tggctacgca gctgtctgga 541 atgcagatca gcggtgctgt ggccccagcc cctccttctt cagggctggg ctttggccca 601 ccaacatcgc tggcttcagc ctcaggaagt ttccctaact ctggtctgta tggctcctat 661 cctcagggcc aggctcctcc ccttagccag gcccaaggtc atcctgggat ccagactccc 721 cagcgatctg ccccatcaca ggcctccagc ttcacacccc cagcttcagg gggtcctcgg 781 ctgccttcga tgactggtcc actcctgcct ggacagagtt ttggagggcc ctcagtgagc 841 cagcccaacc atgtgtcttc acctcctcaa gctctgcccc ctggcaccca gatgactggg 901 cccctgggac cactgccacc tatgcactcc ccgcagcagc caggctatca gccccaacaa 961 aatggttcct tcggaccagc ccggggccct cagtctaatt atggaggccc ctacccagca 1021 gcacccacct ttggcagtca gcctgggcct cctcagccac tgcctcctaa gcgcctggac 1081 cctgatgcca tcccaagccc tattcaggtc attgaagatg acaggaacaa ccggggtaca 1141 gagccatttg ttactggagt acggggccag gtgccaccct tagtcactac caacttcctg 1201 gtgaaagacc aagggaatgc aagtccccga tacatccgat gtacatccta taatatccct 1261 tgcacatctg acatggctaa gcaggctcag gtgcccctgg cagcagtcat caaaccgctg 1321 gcaaggctgc ccccagagga ggcttcaccg tatgttgtgg accatgggga atctggccct 1381 ttgcgctgca accgctgcaa agcatacatg tgtcccttca tgcagttcat tgaaggaggg 1441 aggcgtttcc agtgctgttt ttgcagctgt atcaatgatg ttccccccca gtattttcag 1501 cacctggatc ataccggcaa acgtgtggat gcttatgacc gccctgagct atccctgggc 1561 tcttatgaat tcttggccac tgtagattac tgcaagaaca ataagttccc cagccctcct 1621 gcctttatct tcatgattga cgtctcctac aatgccatca ggactggtct tgttaggctc 1681 ctctgtgagg agctcaagtc actgttagac tttctaccta gggagggtgg ggcagaagag 1741 tcagcaatcc gcgttggctt tgtcacctac aataaggtgc tccacttcta taatgtgaag 1801 agctcattgg cccagccaca gatgatggtt gtgtctgatg tggctgacat gtttgtgcca 1861 ctgctggatg gcttcctggt caacgtcaat gagtctcggg cagttatcac cagcttattg 1921 gatcagattc cagaaatgtt tgcagacaca agggaaacag agacagtatt tgtaccagtt 1981 atccaggctg gaatggaggc tctgaaggct gctgagtgtg cagggaagct ctttctattc 2041 catacatccc tgcccattgc agaggcccca gggaaactga agaacagaga tgacaggaag 2101 ctgatcaata cagacaagga gaagactctg ttccagcctc agacaggtgc ctatcagacc 2161 ctggccaaag agtgtgtggc ccaaggctgc tgtgtagatc tctttctctt ccctaaccag 2221 tatgtggatg tggccacact ctctgttgtg ccccagctca ctggtggctc tgtctacaaa 2281 tatgcttcct ttcaggtgga gaacgaccag gagcggttcc tgagtgacct gcgtcgtgat 2341 gtccagaagg ttgttggctt tgatgctgtg atgcgggtcc ggacaagcac tggtatccgt 2401 gctgtagatt tctttggagc tttctacatg agcaacacga cagatgtgga gctggctggg 2461 ctagatgggg acaaaacagt gactgtggag ttcaagcatg acgatcggct caatgaagag 2521 agcggagctc tcctgcagtg tgccctgctt tacaccagct gtgcagggca gcgtcggctc 2581 cgcatccata atctggccct gaactgctgc acccagctgg ctgatctata tcgaaactgt 2641 gagactgaca cgctcatcaa ctacatggcc aagtttgcat atcggggagt cctgaatagc 2701 cctgtgaagg ctgttcgtga cacgctcatc acccagtgtg cccagatcct ggcctgttac 2761 agaaagaact gtgctagccc ctcctctgca ggacagttga tccttcctga gtgcatgaag 2821 ctactcccag tttacctgaa ctgtgtgttg aagagtgatg tcctgcagcc tggagctgaa 2881 gtcactactg atgaccgtgc ctatgtccga cagctagtta cctccatgga tgtgactgag 2941 accaatgtct tcttctaccc tcggctctta cctttgacaa agtctcccgt tgagagtact 3001 accgaaccac cagcagttcg agcctctgaa gagcgtctaa gcaatgggga tatatattta 3061 ctggagaatg ggctcaacct cttcctctgg gtgggagcaa gcgtccaaca gggtgttgtc 3121 cagagccttt tcagcgtctc ctccttcagt cagatcacca gtggtttgag tgttctgcca 3181 gttctggata atccactgtc caagaaggtt cgaggcctca ttgatagctt acggggcaca 3241 gagatcccgg tacatgaagc ttaccgtggt gaaacaggaa gacaagatgg agatgctgtt 3301 caagcacttc ctggtggaag acaagagtct gagtggggga gcatcttatg tggactttct 3361 ctgtcatatg cacaaggaga ttcggcagct actgagctaa agcaagtggg taaatggcat 3421 agggcccagg ctagcttcca gaaagcaccc caggatgtca gagaaattgg gacagtaaca 3481 tatcttatgt aagctgacct cagtctctct ggggggaggg ggagatataa ggagacacct 3541 tctttctggg ctcaagtatc ctgccactct gtcatgtcct gctgatggaa ggtgcccctg 3601 ttccctcatt ctaccctctt tttcctgcta atcctgtcat aatgaatgta gcttctcagt 3661 tcactgtata tgattcggta ttgggggttt ggaggcaccc agaccctggc aatattatgt 3721 gtccctttgg accagtctcc caagaggaga ggggcaggca ggaaagagtg gggatcctaa 3781 ggttactaca gggggctcag tgtcatccac aacttcctat attagggata aaacatatag 3841 gtgcacaaga gctggggtat agcccatagg tggtggagag aaaagtggtc agtccttctt 3901 gggcctggag gttagcagtc aagtttctct gctttcactg ctcgctcgct ctctcctgca 3961 atgattgatg atcactccgt ggatagagag gcacactgtc agaggtgacc ggagaactga 4021 gttgcaaaat atattaagat ctggtagagg taccagcttc ctttccagct ggagaggccc 4081 caacactgga tggttctgta gggagcctag ggagcctggt catcaacttg caatacctca 4141 cagagccagt tcacatccca ctctgagctc ccacgagaaa cactgcttct ccaggcccgg 4201 ggttgttggg gagagaggca gaggcagctg gagcgccgtt ctctcctgct gggacaccgc 4261 ttgggctttg gtattgactg agtggctgac agttatcttc caaccccaac tggcttgggg 4321 gcaggacaag ggctaggctt gatggtggcc aggcttgcct gctccccacc tgggatgccc 4381 ctgctctgga cctctcattt ctcttcattg gtttattttt caatgcatct ttaatttgta 4441 aagaaataaa ataaattaag atgt // LOCUS HUMORF01 836 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0101 gene, complete cds. ACCESSION D14657 NID g285938 KEYWORDS KIAA0101. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 836) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 836) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..836 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..61 gene 62..397 /gene="KIAA0101" CDS 62..397 /gene="KIAA0101" /codon_start=1 /db_xref="PID:d1004002" /db_xref="PID:g285939" /translation="MVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSRKAE NKYAGGNPVCVRPTPKWQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPLQ PDHTNDEKE" 3'UTR 398..>836 BASE COUNT 249 a 143 c 167 g 277 t ORIGIN 1 gtgaaacacc ctcggctggg aagtcagttc gttctctcct ctcctctctt cttgtttgaa 61 catggtgcgg actaaagcag acagtgttcc aggcacttac agaaaagtgg tggctgctcg 121 agcccccaga aaggtgcttg gttcttccac ctctgccact aattcgacat cagtttcatc 181 gaggaaagct gaaaataaat atgcaggagg gaaccccgtt tgcgtgcgcc caactcccaa 241 gtggcaaaaa ggaattggag aattctttag gttgtcccct aaagattctg aaaaagagaa 301 tcagattcct gaagaggcag gaagcagtgg cttaggaaaa gcaaagagaa aagcatgtcc 361 tttgcaacct gatcacacaa atgatgaaaa agaatagaac tttctcattc atctttgaat 421 aacgtctcct tgtttaccct ggtattctag aatgtaaatt tacataaatg tgtttgttcc 481 aattagcttt gttgaacagg catttaatta aaaaatttag gtttaaattt agatgttcaa 541 aagtagttgt gaaatttgag aatttgtaag actaattatg gtaacttagc ttagtattca 601 atataatgca ttgtttggtt tcttttacca aattaagtgt ctagttcttg ctaaaatcaa 661 gtcattgcat tgtgttctaa ttacaagtat gttgtatttg agatttgctt agattgttgt 721 actgctgcca tttttattgg tgtttgatta ttggaatggt gccatattgt cactccttct 781 acttgcttta aaaagcagag ttagattttt gcacattaaa aaattcagta ttaatt // LOCUS HUMORF02 1370 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0102 gene, complete cds. ACCESSION D14658 NID g285940 KEYWORDS KIAA0102. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1370) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1370) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1370 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..307 gene 308..679 /gene="KIAA0102" CDS 308..679 /gene="KIAA0102" /codon_start=1 /db_xref="PID:d1004003" /db_xref="PID:g285941" /translation="MHPFPESKPVLALCVISYFVMMGILTIYTSYKEKSIFLVAHRKD PTGMDPDDIWQLSSSLKRFDDKYTLKLTFISGRTKQQREAEFTKSIAKFFDHSGTLVM DAYEPEISRLHDSLAIERKIK" 3'UTR 680..>1370 BASE COUNT 375 a 231 c 310 g 454 t ORIGIN 1 ggcggcggca gctgtacagg gcgggagaag cggtggtagc ggaggctgta gtggggctgg 61 tggtgcttcc aactgcggga caggaagtgg ccgtagcggc ttgttggata agtggaagat 121 agatgataag cctgtaaaaa ttgacaagtg ggatggatca gctgtgaaaa actctttgga 181 tgattctgcc aaaaaggtac ttctggaaaa atacaaatat gtggagaatt ttggtctaat 241 tgatggtcgc ctcaccatct gtacaatctc ctgtttcttt gccatagtgg ctttgatttg 301 ggattatatg cacccctttc cagagtccaa acccgttttg gctttgtgtg tcatatccta 361 ttttgtgatg atggggattc tgaccattta tacctcatat aaggagaaga gcatctttct 421 cgtggcccac aggaaagatc ctacaggaat ggatcctgat gatatttggc agctgtcctc 481 cagtcttaaa aggtttgatg acaaatacac cttgaagctg accttcatca gtgggagaac 541 aaagcagcag cgggaagccg agttcacaaa gtccattgct aagttttttg accacagtgg 601 gacactggtc atggatgcat atgagcctga aatatccagg ctccatgaca gtcttgccat 661 agaaagaaaa ataaagtagc caattctaaa agtagccctc tttctcctgg atcttgctga 721 attagtggct tggggggtgg gggagataaa aagaacttaa aatgggtaaa gtaagaaatg 781 ttaaaaagtc cctgttttgt cctgaaattt tagtctattc tgggtaaata ggattttctg 841 acacagatat gagaagttgt agctctgatg tctagctgta gtctccttga tctgctgatt 901 gcattatttt aatttgcttt tctgggaaag cagttttgct aaaagctgta cagacttttt 961 cttttgtacc tagcagtact ttatatagta tagctttggg ccatgtagca ttttaagact 1021 caattttaaa aaattattaa tctgttgctg actcttaatt cctatttcaa tatgtgtttc 1081 cttgaagaat tcaggataca acttcttgtg tatgacagct ttccttcaca cactattttt 1141 gtgggtgtgt atatatctga tttgggaaga atttaaaaaa cacatagctt tttaatttgt 1201 ttgaaacaga ctttctgcct gttacatttt tgcttttaac caattaaaga agccaatggc 1261 attttagttt tatattgtgt tttccactag tatatccctg ttgatttgtt tgtgcctttt 1321 attaactgcc attttctaaa atttttttca ataaaaggaa ggaagatgtg // LOCUS HUMORF03 1219 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0103 gene, complete cds. ACCESSION D14659 NID g285942 KEYWORDS KIAA0103. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1219) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1219) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1219 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..6 gene 7..900 /gene="KIAA0103" CDS 7..900 /gene="KIAA0103" /codon_start=1 /db_xref="PID:d1004004" /db_xref="PID:g285943" /translation="MAKVSELYDVTWEEMRDKMRKWREENSRNSEQIVEVGEELINEY ASKLGDDIWIIYEQVMIAALDYGRDDLALFCLQELRRQFPGSHRVKRLTGMRFEAMER YDDAIQLYDRILQEDPTNTAARKRKIAIRKAQGKNVEAIRELNEYLEQFVGDQEAWHE LAELYINEHDYAKAAFCLEELMMTNPHNHLYCQQYAEVKYTQGGLENLELSRKYFAQA LKLNNRNMRALFGLYMSASHIASNPKASAKTKKDNMKYASWAASQINRAYQFAGRSKK ETKYSLKAVEDMLETLQITQS" 3'UTR 901..>1219 BASE COUNT 432 a 189 c 266 g 332 t ORIGIN 1 gggaagatgg cgaaggtctc agagctttac gatgtcactt gggaagaaat gagagataaa 61 atgagaaaat ggagagaaga aaactcaaga aatagtgagc aaattgtgga agttggagaa 121 gaattaatta atgaatatgc ttctaagctg ggagatgata tttggatcat atatgaacag 181 gtgatgattg cagcactaga ctatggtcgg gatgacttgg cattgttttg tcttcaagag 241 ctgagaagac agttccctgg cagtcacaga gtcaagcgat taacaggcat gagatttgaa 301 gccatggaaa gatatgatga tgctatacag ctatatgata ggattttaca agaagatcca 361 actaacactg ctgcaagaaa gcgtaagatt gccattcgaa aagcccaggg gaaaaatgtg 421 gaggccattc gggagctgaa tgagtatctg gaacaatttg ttggagacca agaagcctgg 481 catgaacttg cagaacttta catcaatgaa catgactatg caaaagcagc cttttgttta 541 gaggaactaa tgatgactaa tccacacaac cacttatact gtcagcagta tgctgaagtt 601 aagtataccc aaggtggact tgaaaacctc gaactttcaa gaaagtattt tgcacaggca 661 ttgaaactga acaacagaaa tatgagagct ttgtttggac tttatatgtc ggcaagtcat 721 attgcttcta atccaaaagc aagtgcaaaa acgaaaaagg acaacatgaa atatgctagt 781 tgggcagcta gtcaaataaa cagagcttat cagtttgcag gtcgaagtaa gaaggaaacc 841 aaatattctc ttaaggctgt cgaagacatg ttggaaacat tgcagatcac ccagtcttaa 901 ggtttcaaaa actctttgac attagatttc acaactgcac aattgaactt attggcctgt 961 aacttattta ctaaatgctc agtgctattt atatactaca gtaattttct gttaagaagg 1021 cagttgtaaa gaatgtgttt atataaacct aaaaatgcct tttactgcta agtggggaga 1081 tgggggaaat ccatggaaga gagatttaag acttattgat tgtacatcag tctcttcata 1141 tcacatatac atgtatatat ataaaactct aatgtagtat aaccttgtta aataaaccat 1201 gatgatttat taaacttgc // LOCUS HUMORF04 1322 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0104 gene, complete cds. ACCESSION D14660 NID g285944 KEYWORDS KIAA00104. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1322) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1322) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1322 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..34 gene 35..877 /gene="KIAA00104" CDS 35..877 /gene="KIAA00104" /codon_start=1 /db_xref="PID:d1004005" /db_xref="PID:g285945" /translation="MGLGRSFQAARTLLPPPASIACRVHAGPVRQQSTGPSEPGAFQP PPKPVIVDKHRPVEPERRFLSPEFIPRRGRTDPLKFQIERKDMLERRKVLHIPEFYVG SILRVTTADPYASGKISQFLGICIQRSGRGLGATFILRNVIEGQGVEICFELYNPRVQ EIQVVKLEKRLDDSLLYLRDALPEYSTFDVNMKPVVQEPNQKVPVNELKVKMKPKPWS KRWERPNFNIKGIRFDLCLTEQQMKEAQKWNQPWLEFDMMREYDTSKIEAAIWKEIEA SKRS" 3'UTR 878..>1322 BASE COUNT 425 a 244 c 300 g 353 t ORIGIN 1 ggcggcctgc attgcagcgg ggcactgggc tgcaatgggc ctaggccgga gtttccaagc 61 cgccaggact ctgctccccc cgccggcctc tatcgcctgc agggtccacg cggggcctgt 121 ccggcagcag agcactgggc cttccgagcc cggtgcgttc caaccgccgc cgaaaccggt 181 catcgtggac aagcaccgcc ccgtggaacc ggaacgcagg ttcttgagtc ctgaattcat 241 tcctcgaagg ggaagaacag atcctctgaa atttcaaata gaaagaaaag atatgttaga 301 aaggagaaaa gtactccaca ttccagagtt ctatgttgga agtattcttc gtgttactac 361 agctgaccca tatgccagtg gaaaaatcag ccagtttctg gggatttgca ttcagagatc 421 aggaagagga cttggagcta ctttcatcct taggaatgtt atcgaaggac aaggtgtcga 481 gatttgcttt gaactttata atcctcgggt ccaggagatt caggtggtca aattagagaa 541 acggctggat gatagcttgc tatacttacg agatgccctt cctgaatata gcacttttga 601 tgtgaatatg aagccagtag tacaagagcc taaccaaaaa gttcctgtta atgagctgaa 661 agtaaaaatg aagcctaagc cctggtctaa acgctgggaa cgtccaaatt ttaatattaa 721 aggaatcaga tttgatcttt gtttaactga acagcaaatg aaagaagctc agaagtggaa 781 tcagccatgg cttgaatttg atatgatgag ggaatatgat acttcaaaaa ttgaagctgc 841 aatatggaag gaaattgaag cgtcgaaaag gtcttgattc tgagaatgaa tttggttagt 901 tgcagaagat acattggctc taagaggata tattttgaga ccaatttaat ttcatttata 961 agaacatagt aattaagtga actaagcatt cattgtttta ttaatacttt ttttctaaaa 1021 taaaacttgt acaccagttt attactctaa aaagagaatt acacatgcca aatggaccaa 1081 tgtccatttg cttattggag gcaaagctac aatagaagtc agagcatcac cagaatggtc 1141 tttaatgagc atggaacctg agcaaaggga ataggtggga tgaatttttt ttttaattgt 1201 gaaacaattc ataagcacaa tatgatttac agaataataa acattcatgt acccactatc 1261 aggttaagaa atagaacatt tattaatatg taggaatgtt aagaaataaa acatttaata 1321 ag // LOCUS HUMORF05 1622 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0105 gene, complete cds. ACCESSION D14661 NID g285946 KEYWORDS KIAA0105. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1622) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1622) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1622 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..124 gene 125..580 /gene="KIAA0105" CDS 125..580 /gene="KIAA0105" /codon_start=1 /db_xref="PID:d1004006" /db_xref="PID:g285947" /translation="MTNEEPLPKKVRLSETDFKVMARDELILRWKQYEAYVQALEGKY TDLNSNDVTGLRESEEKLKQQQQESARRENILVMRLATKEQEMQECTTQIQYLKQVQQ PSVAQLRSTMVDPAINLFFLKMKGELEQTKDKLEQAQNELSAWKFTPDR" 3'UTR 581..>1622 BASE COUNT 496 a 321 c 364 g 441 t ORIGIN 1 ggccggcggc agagctgtcc ggctgcgcgg tggcccgggg ggcccgggcg gcagggcaag 61 cagcgcggcc tcggcctatg cgaccggtgg cgccggcgcg gcttctgcct ggagaggatt 121 caagatgacc aacgaagaac ctcttcccaa gaaggttcga ttgagtgaaa cagacttcaa 181 agttatggca agagatgagt taattctaag atggaaacaa tatgaagcat atgtacaagc 241 tttggagggc aagtacacag atcttaactc taatgatgta actggcctaa gagagtctga 301 agaaaaacta aagcaacaac agcaggagtc tgcacgcagg gaaaacatcc ttgtaatgcg 361 actagcaacc aaggaacaag agatgcaaga gtgtactact caaatccagt acctcaagca 421 agtccagcag ccgagcgttg cccaactgag atcaacaatg gtagacccag cgatcaactt 481 gtttttccta aaaatgaaag gtgaactgga acagactaaa gacaaactgg aacaagccca 541 aaatgaactg agtgcctgga agtttacgcc tgataggtaa acaaatcata ctccccagtc 601 aagacttccc tgacagtccc actacgagaa agctgtggtg ggacagccaa gtactcgttt 661 ccacaccaag actcagactt tttgagccaa aaaaaagcca cattcttaca ctgtccagct 721 tgtaatggtt aatgtaaaac ttaccagatg aaccttgtgt ttcagctttt ttcttttccc 781 cttccccttg cttcagaggc ctgatggcgt cggactattc cgaagaagtg gccacctccg 841 aaaaattccc cttctagaac atgtagacac ttgagaaatg tttctgtttg aagaaaatag 901 agggagaaac agaagtctta agtctgtggc acactgtgtc ttcagacagt ttgaaggaat 961 gaaaacctag agattttaaa tcatgaattg aacatgtaaa attccagtaa aatgtaaaaa 1021 cggaatatgc atcgctctta accttgagca tagtgactta gagacactgt gtatcagttt 1081 tgccaataag actgtggact tcatgattgt tgttgaactt ctgggtcaaa actcaaatga 1141 ggtgaatttt gcctttaaag ggtttatttg ctgagaacca actttcaata gtcatgagag 1201 aatcaaataa tagatgtccg tacaagtagc gcatatattt aaccatttag tttggggctc 1261 tatattactt gcttgagcct taatcaatgt ggttttattc aatggtttgt tctttgaatg 1321 gttgcaaaaa ctgtagataa tcttactgag gactgtacaa acatgaaggt gtggtatcaa 1381 acttcaggtt gaaactgttt gaagcattat aaacattcat ttcacaacta gattgtataa 1441 ggatattagc tgtgatgaga ctcactgcat tatttttttt agtgaatttt atgaaatccc 1501 cgttccattc aacaggcaca tgtttaaaag agctttgtcg ttggtgttaa tgggggaatg 1561 tgttccttca ttgtatttgg gccttttgta ttgcactctt gatattaaat taaatgtgcc 1621 tt // LOCUS HUMORF06 1653 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0106 gene, complete cds. ACCESSION D14662 NID g285948 KEYWORDS KIAA0106. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1653) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1653) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 REFERENCE 4 (sites) AUTHORS Kim,T.S., Sundaresh,C.S., Feinstein,S.I., Dodia,C., Skach,W.R., Jain,M.K., Nagase,T., Seki,N., Ishikawa,K., Nomura,N. and Fisher,A.B. TITLE Identification of a human cDNA clone for lysosomal type Ca2+-independent phospholipase A2 and properties of the expressed protein JOURNAL J. Biol. Chem. 272 (4), 2542-2550 (1997) MEDLINE 97153037 REMARK Erratum:[[published erratum appears in J Biol Chem 1997 Apr 18;272(16):10981]] FEATURES Location/Qualifiers source 1..1653 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..43 gene 44..718 /gene="KIAA0106" CDS 44..718 /gene="KIAA0106" /codon_start=1 /db_xref="PID:d1004007" /db_xref="PID:g285949" /translation="MPGGLLLGDVAPNFEANTTVGRIRFHDFLGDSWGILFSHPRDFT PVCTTELGRAAKLAPEFAKRNVKLIALSIDSVEDHLAWSKDINAYNCEEPTEKLPFPI IDDRNRELAILLGMLDPAEKDEKGMPVTARVVFVFGPDKKLKLSILYPATTGRNFDEI LRVVISLQLTAEKRVATPVDWKDGDSVMVLPTIPEEEAKKLFPKGVFTKELPSGKKYL RYTPQP" 3'UTR 719..>1653 BASE COUNT 443 a 365 c 359 g 486 t ORIGIN 1 cggttgcttg ctgtcccagc ggcgccccct catcaccgtc gccatgcccg gaggtctgct 61 tctcggggac gtggctccca actttgaggc caataccacc gtcggccgca tccgtttcca 121 cgactttctg ggagactcat ggggcattct cttctcccac cctcgggact ttaccccagt 181 gtgcaccaca gagcttggca gagctgcaaa gctggcacca gaatttgcca agaggaatgt 241 taagttgatt gccctttcaa tagacagtgt tgaggaccat cttgcctgga gcaaggatat 301 caatgcttac aattgtgaag agcccacaga aaagttacct tttcccatca tcgatgatag 361 gaatcgggag cttgccatcc tgttgggcat gctggatcca gcagagaagg atgaaaaggg 421 catgcctgtg acagctcgtg tggtgtttgt ttttggtcct gataagaagc tgaagctgtc 481 tatcctctac ccagctacca ctggcaggaa ctttgatgag attctcaggg tagtcatctc 541 tctccagctg acagcagaaa aaagggttgc caccccagtt gattggaagg atggggatag 601 tgtgatggtc cttccaacca tccctgaaga agaagccaaa aaacttttcc cgaaaggagt 661 cttcaccaaa gagctcccat ctggcaagaa atacctccgc tacacacccc agccttaagt 721 ctcttggaga agttggtgct gtgagccaga ggatgtcagc tgccaattgt gttttcctgc 781 agcaattcca taaacacatc ctggtgtcat cacagccaag gtttttaggt tgctatacca 841 atggcttatt aaatgaaaat ggcactaaaa gtttcttgag attctttata ctctctgcct 901 tcagcaatca attccattca tacatcagca ctctgctggt tctgtttgaa atatgttctg 961 tatttaaaac tcaaatcttg ttggatctct gcagggcttg tgaccaatga agtcatattt 1021 gttgatggtt gacaaagctt gcttcactcc atcagagaat gactatcaat ttttttttaa 1081 ctgtcctatc acgtcctctc ctgtcaccca ttttgaagag tggcagaact tgaagttcaa 1141 cttcctctgt aaatatccaa gtataaagcc caggaacttc tagaataacc cagatgcgct 1201 ttaatttttt ttaatatgtt ttgatcacag aacttctaga ataacccaga tgctctttca 1261 tattctttta atacatcttg atcacagctg ggggaaaaaa agctttttaa ttctgtacct 1321 tcctagtaga taagtgaaga gcagggaaag agacctttaa atattttgct ataaaaaaat 1381 ttgtgataag tttctatcaa aatggggaga ttgcagaaaa ggcttccctt ggctcccaag 1441 gaggtgtagc aggtgtgagc aatattagtg ccatgtgcct ttcacacagg gtttgcattt 1501 atcagtctgt tttccgatga tgtgtacatg aaagagtaca ccatgtgaag agaagagaga 1561 atgattgaaa atgttttagt atagaactct tcttgcagtg ggttgctatt ttctagattt 1621 tactttttag ggaacaaaat aaaatccttt gtt // LOCUS HUMORF07 1308 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0107 gene, complete cds. ACCESSION D14663 NID g285950 KEYWORDS KIAA0107. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1308) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (15-MAR-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1308) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1308 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..25 gene 26..1195 /gene="KIAA0107" CDS 26..1195 /gene="KIAA0107" /codon_start=1 /db_xref="PID:d1004008" /db_xref="PID:g285951" /translation="MPLENLEEEGLPKNPDLRIAQLRFLLSLPEHRGDAAVRDELMAA VRDNNMAPYYEALCKSLDWQIDVDLLNKMKKANEDELKRLDEELEDAEKNLGESEIRD AMMAKAEYLCRIGDKEGALTAFRKTYDKTVALGHRLDIVFYLLRIGLFYMDNDLITRN TEKAKSLIEEGGDWDRRNRLKVYQGLYCVAIRDFKQAAELFLDTVSTFTSYELMDYKT FVTYTVYVSMIALERPDLREKVIKGAEILEVLHSLPAVRQYLFSLYECRYSVFFQSLA VVEQEMKKDWLFAPHYRYYVREMRIHAYSQLLESYRSLTLGYMAEAFGVGVEFIDQEL SRFIAAGRLHCKIDKVNEIVETNRPDSKNWQYQETIKKGDLLLNRVQKLSRVINM" 3'UTR 1196..>1308 BASE COUNT 393 a 263 c 316 g 336 t ORIGIN 1 gtcagccgct gtccccttag ccgcgatgcc gctggagaac ctggaggagg agggtctgcc 61 caagaacccc gacttgcgta tcgcgcagct gcgcttcctg ctcagcctgc ccgagcaccg 121 cggagacgct gccgtgcgcg acgagctgat ggcggccgtc cgcgataaca acatggctcc 181 ttactatgaa gccttgtgca aatccctcga ctggcagata gacgtggacc tactcaataa 241 aatgaagaag gcaaatgaag atgagttgaa gcgtttggat gaggagctgg aagatgcaga 301 gaagaatcta ggagagagcg aaattcgcga tgcaatgatg gcaaaggccg agtacctctg 361 ccggataggt gacaaagagg gagctctgac agcctttcgc aagacatatg acaaaactgt 421 ggccctgggt caccgattgg atattgtatt ctatctcctt aggattggct tattttatat 481 ggataatgat ctcatcacac gaaacacaga aaaggccaaa agcttaatag aagaaggagg 541 agactgggac aggagaaacc gcctaaaagt gtatcagggt ctttattgtg tggctattcg 601 tgatttcaaa caggcagctg aactcttcct tgacactgtt tcaacattta catcctatga 661 actcatggat tataaaacat ttgtgactta tactgtctat gtcagtatga ttgccttaga 721 aagaccagat ctcagggaaa aggtcattaa aggagcagag attcttgaag tgttgcacag 781 tcttccagca gttcggcagt atctgttttc actctatgaa tgccgttact ctgttttctt 841 ccaatcatta gcggttgtgg aacaggaaat gaaaaaggac tggctttttg cccctcatta 901 tcgatactat gtaagagaaa tgagaattca tgcatacagt cagctgctgg aatcatatag 961 gtcattaacc cttggctata tggcagaagc gtttggtgtt ggtgtggaat tcattgatca 1021 ggaactgtcc aggtttattg ctgccgggag actacactgc aaaatagata aagtgaatga 1081 aatagtagaa accaacagac ctgatagcaa gaactggcag taccaagaaa ctatcaagaa 1141 aggagatctg ctactaaaca gagttcaaaa actttccaga gtaattaata tgtaaagcca 1201 tgtaactaac aaaggatttg ctttagagat aattatttgg aatttttata gcttacttca 1261 caatgtgccc aggtcagctg tataaaataa atactgcatt gttgtttc // LOCUS HUMORF08 3694 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0022 gene, complete cds. ACCESSION D14664 NID g285952 KEYWORDS KIAA0022. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3694) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3694) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..3694 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..184 gene 185..697 /gene="KIAA0022" CDS 185..697 /gene="KIAA0022" /codon_start=1 /db_xref="PID:d1004009" /db_xref="PID:g285953" /translation="MISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFD NSNMTFDKWTDQDDDEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYL SDNHILISALVIASTVILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVG EENEYPVQFD" 3'UTR 698..>3694 BASE COUNT 1185 a 696 c 644 g 1169 t ORIGIN 1 gctccgggcc gcgctgcccg cgctcctgct gccgttgctg ggcctcgccg ctgctgccgt 61 cgcggactgt ccttcatcta cttggattca gttccaagac agttgttaca tttttctcca 121 agaagccatc aaagtagaaa gcatagagga tgtcagaaat cagtgtactg accatggagc 181 ggacatgata agcatacata atgaagaaga aaatgctttt atactggata ctttgaaaaa 241 gcaatggaaa ggcccagatg atatcctact aggcatgttt tatgacacag atgatgcgag 301 tttcaagtgg tttgataatt caaatatgac atttgataag tggacagacc aagatgatga 361 tgaggattta gttgacacct gtgcttttct gcacatcaag acaggtgaat ggaaaaaagg 421 aaattgtgaa gtttcttctg tggaaggaac actatgcaaa acagctatcc catacaaaag 481 gaaatattta tcagataacc acattttaat atcagcattg gtgattgcta gcacggtaat 541 tttgacagtt ttgggagcaa tcatttggtt cctgtacaaa aaacattctg attctcgttt 601 caccacagtt ttttcaaccg caccccaatc accttataat gaagactgtg ttttggtagt 661 tggagaagaa aatgaatatc ctgttcaatt tgactaagtt tttggtaatc ttgcactaag 721 acatcaacaa atgccctggc agagataact tgggaaagat tttaatataa aacttgacat 781 tggatattag agctttaatg gtattcctta ttccagtaac atttttatgt actcatctgc 841 tgtgaaaagt ctttaggttc attaaaaaaa caggttttag aaatgatctt agatctaata 901 tagtgatttt aagcatcccg tcaaaggcag aatctgtcac ttgaatgaag gaaagcttaa 961 agcccaagca gataaaaata aaagcccagc ctatttgtct tgcctgctgt atcttcccta 1021 tttagttgac ccactttagt ttatatgttt attagtaaac atgaaatggg gaataagtga 1081 ttttaagtac atcccataca tttaaatatc tttgataatt gttatttttt tggcagataa 1141 ttcctctaga atgtgtatct ttttatgatt tagatgaaga aaattttaca acttttaaca 1201 ccccacacca attttagttt cattactttt acacacacca ttttatcaca aatgactcaa 1261 gttttaatga atgtttataa attatttgaa acaaaatatg atcgctgtgt ccaggatggc 1321 atagagaaag ctggcaatta ggttaacact tacatattat agtgcccctt taaggatttc 1381 tctcttgcca ccataccttt tgtactttcc cctatacaag atgtatctca ttctcctcaa 1441 gcatttataa atttttcctt caatgacatg aaaactgtgc aagcaaaaac cgaagaaaaa 1501 cacttaagta caactgtagt gacagtgatc aaagttttca gtgcatttat tgtacatttt 1561 aagaaaaagg tgaaaatcat ttggggagta aaaaaatgaa aaagctgaaa cgagtaattt 1621 tcctcaccat caataaacca aaaacaggaa agataaagaa tgtataaatt tcacgtaaat 1681 tagtcacgta tcacttatca atggggatac gttctaagaa atgcatagtt agggaatctt 1741 gtgtgaaaat cagcttgtat ttacacaaac ccagatggta gagcctattt tgtcccaaac 1801 ctacacagca tgttactgtg ctgaatactg cagacaattg taacacaata tttgtgtatc 1861 taaatataga aaaggtacag taaaaatatg gtctactaag gaaacactgt tctatatgtg 1921 gtccattact gactgaagta tactgtctag aagtctgagg ctcaaagaaa agtaatccct 1981 cttctgaatc cacaccccat caattatctt actttcttct ggggagatag atagatatac 2041 tatctcacta gcttgactaa tggcaacaaa gttccagctt gtgtagtctc tttttattga 2101 ccacatgaat cgaaaacact catcacaatt aatggcacta tcattaatga gacatgagta 2161 actaaaaagt gatagaaaac tattacagtg cggctacatg gtactgaaaa tgcaggcatt 2221 acaccagctg ttacacaagc acaagcatgc tctgtaagag ctttacattt ctgagatttt 2281 gtatagtgat tgagatgtct attttattat tgatagacta ttactaatgt caatattgaa 2341 cactaccctg gaattcctgc ctggttttcc tacccaaatt gtaccactcc ttgaagaact 2401 acaggcacag taaaaaaata tggcgtatta tgtgaactaa aagagttcta aaggagttct 2461 taaaggagtg gtagaatttg ggtaggaaag tgattaagtc caacttaaaa ccaacagtct 2521 caaacgtcta caactacaat gtccaatgag ccactagcca catgaggcta tttaagtaaa 2581 tttagtttaa aatccagttt tcgaattaca ttagccacat tgtcaagtgt tcaaatcaca 2641 ggtggttagt ggctactgta ctgggcaaca tacattatag aacattttca ttataggaag 2701 ttttattggg cagtgctgct cttaaatcct accttccact caactcccat acaactttct 2761 tttgtacatt ttgatacttt ctacctaatg gcagctcttc caaaatagct gctttaaact 2821 ctgatttaat tttcaatatt tggtttcatt tttcaacagg ccaagaggcc tctggtaatg 2881 aagtgctata tatatatata tatgacggag tctcactgtg ctgcccaggc tacagtgcag 2941 tggctcgatc ttggctctct ccaatctccg ccttgcaggt tttcaagcaa ttctcctgcc 3001 tcagcctcct tagtagctgg gaccacagac atctgtcacc acacccagct aactttttgt 3061 atttttggta gagacggggt ttcgccatat tgactgggct ggtctcaaac tcctgacctc 3121 aagtgatcca cccaccttgg tctcccaaag tgctgggatt acatgcgtga gccaccacac 3181 ttggcctaca ttttttcttt atataccaga acatctataa caggcacctt atctactcat 3241 tagtgaagag ataattggat tacacaggca ggcttgttta ctacatccag aatgtagaaa 3301 ctgctttctt caacatcttg gttctagcta gtaataacaa tataattctt tggcagatat 3361 tcagaataac attttaaact acattttctt agaaaattgc attcttgtag tgagcagtgt 3421 atggtctctt ttgttcagaa tttaaaactg ataaccaatg aaagcctttt ctcttattcc 3481 tctaccgtca tttacatgat aatctgaagc taatatgaca atatttaaat actaagtggt 3541 actagggaac tacaagaata ctgtaaagct taagccattg ttatcactgt catttagcat 3601 ttaataacaa aactatacag aattatgtgc ataccaatga atgttttgta ccatctagtt 3661 aaatttttta aataaagttt tatgggttaa gcag // LOCUS HUMORF11 2504 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0024 gene, complete cds. ACCESSION D14694 NID g603801 KEYWORDS KIAA0024. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2504) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Kuge,O., Nishijima,M. and Akamatsu,Y. TITLE A Chinese hamster cDNA encoding a protein essential for phosphatidylserine synthase I activity JOURNAL J. Biol. Chem. 266 (35), 24184-24189 (1991) MEDLINE 92084729 REFERENCE 3 (bases 1 to 2504) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 5 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 FEATURES Location/Qualifiers source 1..2504 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR <1..102 gene 103..1524 /gene="KIAA0024" CDS 103..1524 /gene="KIAA0024" /note="Whole ORF continues from bp19 (right after 'tag') to bp1596 ('tga').; similar to chinese hamster phosphatidylserine synthase." /citation=[2] /codon_start=1 /db_xref="PID:d1004031" /db_xref="PID:g603802" /translation="MASCVGSRTLSKDDVNYKMHFRMINEQQVEDITIDFFYRPHTIT LLSFTIVSLMYFAFTRDDSVPEDNIWRGILSVIFFFLIISVLAFPNGPFTRPHPALWR MVFGLSVLYFLFLVFLLFLNFEQVKSLMYWLDPNLRYATREADVMEYAVNCHVITWER IISHFDIFAFGHFWGWAMKALLIRSYGLCWTISITWELTELFFMHLLPNFAECWWDQV ILDILLCNGGGIWLGMVVCRFLEMRTYHWASFKDIHTTTGKIKRAVLQFTPASWTYVR WFDPKSSFQRVAGVYLFMIIWQLTELNTFFLKHIFVFQASHPLSWGRILFIGGITAPT VRQYYAYLTDTQCKRVGTQCWVFGVIGFLEAIVCIKFGQDLFSKTQILYVVLWLLCVA FTTFLCLYGMIWYAEHYGHREKTYSECEDGTYSPEISWHHRKGTKGSEDSPPKHAGNN ESHSSRRRNRHSKSKVTNGVGKK" 3'UTR 1525..>2504 BASE COUNT 573 a 612 c 614 g 705 t ORIGIN 1 ctttgccgtc cggctattag cctactgtgg ctagtcaccc ccggggtccc ggccttctcg 61 ggctggggcc gccgccaccg cggcaggacg gggaggcggg ccatggcgtc ctgcgtgggg 121 agccggaccc taagcaagga tgatgtgaac tacaaaatgc atttccggat gatcaacgag 181 cagcaagtgg aggacatcac cattgacttc ttctaccggc cgcataccat caccctgctc 241 agcttcacca tcgtcagcct catgtacttc gcctttacca gggatgactc tgttccagaa 301 gacaacatct ggagaggcat cctctctgtt attttcttct ttcttatcat cagtgtgtta 361 gctttcccca atggtccgtt cactcgacct catccagcct tatggcgaat ggtttttgga 421 ctcagtgtgc tctacttcct gttcctggta ttcctactct tcctgaattt cgagcaggtt 481 aaatctctaa tgtattggct agatccaaat cttcgatacg ccacaaggga agcagatgtc 541 atggagtatg ctgtgaactg ccatgtgatc acctgggaga ggattatcag ccactttgat 601 atttttgcat ttggacattt ctggggctgg gccatgaagg ccttgctgat ccgtagttac 661 ggtctctgct ggacaatcag tattacctgg gagctgactg agctcttctt catgcatctc 721 ctccccaatt ttgccgagtg ctggtgggat caagtcattc tggacatcct gttgtgcaat 781 ggcggtggca tttggctggg catggtcgtt tgccggtttt tagagatgag gacttaccac 841 tgggcaagct tcaaggacat tcataccacc accgggaaga tcaagagagc tgttctgcag 901 ttcactcctg ctagctggac ctatgttcga tggtttgacc ccaaatcttc ttttcagaga 961 gtagctggag tgtacctttt catgatcatc tggcagctga ctgagttgaa taccttcttc 1021 ttgaagcata tctttgtgtt ccaagccagt catccattaa gttggggtag aattctcttt 1081 attggtggca tcacagctcc cacagtgaga cagtactacg cttacctcac cgacacacag 1141 tgcaagcgcg taggaacaca atgctgggtg tttggggtca ttggtttcct ggaggccatt 1201 gtttgcataa aatttggaca agatctcttc tctaagaccc aaatactcta tgttgtgctt 1261 tggcttcttt gcgtggcttt caccactttc ctctgtctgt acggcatgat ttggtatgca 1321 gaacactatg gtcaccgaga aaagacctac tcggagtgtg aagatggcac ctacagtcca 1381 gagatctcct ggcatcacag gaaagggaca aaaggttctg aagacagccc acccaagcat 1441 gcaggcaaca acgaaagcca ttcttccagg agaaggaatc ggcattccaa gtcaaaagtc 1501 accaatggcg ttggaaagaa atgaaaaacc ctggttaatc aaagatgttc cagagtgcct 1561 agaactgaga gggaaatgga actcatttgg aactccccgt gaggaggtcg aggcgcacag 1621 ggcaagcagg aagaggcgag ggcacttggg ggtcattatt tgagatcgta agtcttgttt 1681 cccacagacc tggccgcgtc aggcagatca tcgcctgggg ggcctttgcc aacgtggggt 1741 ctcttctaac ttcagcactt gacatgcggt caccggtggc agcgcggtgt gttgaaggga 1801 aacggtagct attcattcac agttgccaag agcagctccg cgcctgctgg atcgtggatg 1861 cagcgtaaac atcttccttc agacgaggca ttaaccccat ggttaatgga ctggtcacca 1921 gtttttattt tatttttatg aatctacctt tccattgatt gatttaagtt caggccactt 1981 ttctgtcttt tatttggtta ctgttgttat ttgtttttaa gttaggatgc tttttaacag 2041 cctttagaag ccgctgctga aattgatact gggggaaggg ttccccttcc ttctagagca 2101 gaaaagggag agaagtgttg tattcctgtt tggtaacctc agtctcctgt aagacctcct 2161 accacatggc gagtatacac caatcaggag agggtagctg cctgcatagg agcctcgctt 2221 ccgattattc ccttcccaat attattcatc cagacttagc cacagtgcac aaaagcaaac 2281 ctgctagaga ggcagtgaac accacagctt ctccccagct tggtgccttt tacatcgggt 2341 ttgttctcct tccatggtgt gttgctgaca ttgtcactga gtcccatgtg aggtgctggt 2401 gagtattacc tttcatctgt gccatgctct agaaccttga ccttgatagt tcaccacgtc 2461 tgatggatcc ctgttttaaa taaaaacgat tcactttaaa gcct // LOCUS HUMORF12 1860 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0025 gene, complete cds. ACCESSION D14695 NID g285960 KEYWORDS KIAA0025. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1860) AUTHORS Miyajima,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 FEATURES Location/Qualifiers source 1..1860 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR <1..93 gene 94..1269 /gene="KIAA0025" CDS 94..1269 /gene="KIAA0025" /codon_start=1 /db_xref="PID:d1004032" /db_xref="PID:g285961" /translation="MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRV YPERPRPEDQRLIYSGKLLLDHQCLRDLLPKQEKRHVLHLVCNVKSPSKMPEINAKVA ESTEEPAGSNRGQYPEDSSSDGLRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFS GYTPYGWLQLSWFQQIYARQYYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQ FPAENQPANQNAAPQVVVNPGANQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSV FLSILYFYSSLSRFLMVMGATVVMYLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNN NLQEGTDPETEDPNHLPPDRDVLDGEQTSPSFMSTAWLVFKTFFASLLPEGPPAIAN" 3'UTR 1270..>1860 /note="the sequence of bp 1728-1860 is homologous to PIGHEP3" BASE COUNT 464 a 448 c 456 g 492 t ORIGIN 1 cgtgaacggt cgttgcagag attgcgggcg gctgagacgc cgcctgcctg gcacctagga 61 gcgcagcgga gccccgacac cgccgccgcc gccatggagt ccgagaccga acccgagccc 121 gtcacgctcc tggtgaagag ccccaaccag cgccaccgcg acttggagct gagtggcgac 181 cgcggctgga gtgtgggcca cctcaaggcc cacctgagcc gcgtctaccc cgagcgtccg 241 cgtccagagg accagaggtt aatttattct gggaagctgt tgttggatca ccaatgtctc 301 agggacttgc ttccaaagca ggaaaaacgg catgttttgc atctggtgtg caatgtgaag 361 agtccttcaa aaatgccaga aatcaacgcc aaggtggctg aatccacaga ggagcctgct 421 ggttctaatc ggggacagta tcctgaggat tcctcaagtg atggtttaag gcaaagggaa 481 gttcttcgga acctttcttc ccctggatgg gaaaacatct caaggcctga agctgcccag 541 caggcattcc aaggcctggg tcctggtttc tccggttaca caccctatgg gtggcttcag 601 ctttcctggt tccagcagat atatgcacga cagtactaca tgcaatattt agcagccact 661 gctgcatcag gggcttttgt tccaccacca agtgcacaag agatacctgt ggtctctgca 721 cctgctccag cccctattca caaccagttt ccagctgaaa accagcctgc caatcagaat 781 gctgctcctc aagtggttgt taatcctgga gccaatcaaa atttgcggat gaatgcacaa 841 ggtggcccta ttgtggaaga agatgatgaa ataaatcgag attggttgga ttggacctat 901 tcagcagcta cattttctgt ttttctcagt atcctctact tctactcctc cctgagcaga 961 ttcctcatgg tcatgggggc caccgttgtt atgtacctgc atcacgttgg gtggtttcca 1021 tttagaccga ggccggttca gaacttccca aatgatggtc ctcctcctga cgttgtaaat 1081 caggacccca acaataactt acaggaaggc actgatcctg aaactgaaga ccccaaccac 1141 ctccctccag acagggatgt actagatggc gagcagacca gcccctcctt tatgagcaca 1201 gcatggcttg tcttcaagac tttctttgcc tctcttcttc cagaaggccc cccagccatc 1261 gcaaactgat ggtgtttgtg ctgtagctgt tggaggcttt gacaggaatg gactggatca 1321 cctgactcca gctagattgc ctctcctgga catggcaatg atgagttttt aaaaaacagt 1381 gtggatgatg atatgctttt gtgagcaagc aaaagcagaa acgtgaagcc gtgatacaaa 1441 ttggtgaaca aaaaatgccc aaggcttctc atgtgtttat tctgaagagc tttaatatat 1501 actctatgta gtttaataag cactgtacgt agaaggcctt aggtgttgca tgtctatgct 1561 tgaggaactt ttccaaatgt gtgtgtctgc atgtgtgttt gtacatagaa gtcatagatg 1621 cagaagtggt tctgctggta agatttgatt cctgttggaa tgtttaaatt acactaagtg 1681 tactacttta tataatcaat gaaattgcta gacatgtttt agcaggactt ttctaggaaa 1741 gacttatgta taattgcttt ttaaaatgca gtgctttact ttaaactaag gggaactttg 1801 cggaggtgaa aacctttgct gggttttctg ttcaataaag ttttactatg aatgaccctg // LOCUS HUMORF13 1402 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0108 gene, complete cds. ACCESSION D14696 NID g285962 KEYWORDS KIAA0108. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1402) AUTHORS Miyajima,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1402 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR <1..146 gene 147..848 /gene="KIAA0108" CDS 147..848 /gene="KIAA0108" /codon_start=1 /db_xref="PID:d1004033" /db_xref="PID:g285963" /translation="MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMVVNLLMA ILLTVEVTHPNSMPAVNIQYEVIGNYYSSERMADNACVLFAVSVLMFIISSMLVYGAI SYQVGWLIPFFCYRLFDFVLSCLVAISSLTYLPRIKEYLDQLPDFPYKDDLLALDSSC LLFIVLVFFALFIIFKAYLINCVWNCYKYINNRNVPEIAVYPAFEAPPQYVLPTYEMA VKMPEKEPPPPYLPA" 3'UTR 849..>1402 /note="the sequence of bp 1262-1401 is identical to HepG2 mRNA (HUM0S12E01)" BASE COUNT 336 a 278 c 313 g 475 t ORIGIN 1 cgaagaagct ggcaggggca cgagccgggg gcgggtttga agacgcgtcg ttgggttttg 61 gaggccgtga aacagccgtt tgagtttggc tgcgggtgga gaacgtttgt caggggcccg 121 gccaagaagg aggcccgcct gttacgatgg tgtccatgag tttcaagcgg aaccgcagtg 181 accggttcta cagcacccgg tgctgcggct gttgccatgt ccgcaccggg acgatcatcc 241 tggggacctg gtacatggta gtaaacctat tgatggcaat tttgctgact gtggaagtga 301 ctcatccaaa ctccatgcca gctgtcaaca ttcagtatga agtcatcggt aattactatt 361 cgtctgagag aatggctgat aatgcctgtg ttctttttgc cgtctctgtt cttatgttta 421 taatcagttc aatgctggtt tatggagcaa tttcttatca agtgggttgg ctgattccat 481 tcttctgtta ccgacttttt gacttcgtcc tcagttgcct ggttgctatt agttctctca 541 cctatttgcc aagaatcaaa gaatatctgg atcaactacc tgattttccc tacaaagatg 601 acctcctggc cttggactcc agctgcctcc tgttcattgt tcttgtgttc tttgccttat 661 tcatcatttt taaggcttat ctaattaact gtgtttggaa ctgctataaa tacatcaaca 721 accgaaacgt gccggagatt gctgtgtacc ctgcctttga agcacctcct cagtacgttt 781 tgccaaccta tgaaatggcc gtgaaaatgc ctgaaaaaga accaccacct ccttacttac 841 ctgcctgaag aaattctgcc tttgacaata aatcctatac cagctttttg tttgtttatg 901 ttacagaatg ctgcaattca gggctcttca aacttgtttg atataaaata tgttgtcttt 961 tgtttaagca tttattttca aacactaagg agctttttga catctgttaa acgtcttttt 1021 gtttttttgt taagtctttt acattttaat agtttttgaa gacaatctag gttaagcaag 1081 agcaaagtgc cattgtttgc ctttaattgg ggggtgggaa gggaaagagg gtacttgcca 1141 catagtttcc tttttaactg cactttcttt atataatcgt ttgcattttg ttacttgcta 1201 ccctgagtac tttcaggaag actgacttaa atattcgggg tgagtaagta gttgggtata 1261 agatctgaac ttttcatctg cagaggcaag aaaaatattt gacattgtga cttgactgtg 1321 gaagatgatg gttgcatgtt tctagtttgt atatgtttcc atctttgtga taagatgatt 1381 taataaatct ctttaaatac tt // LOCUS HUMORF15 1233 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0110 gene, complete cds. ACCESSION D14811 NID g285966 KEYWORDS KIAA0110. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1233) AUTHORS Miyajima,N. and Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (sites) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1233 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /clone="HA0666" /sex="male" 5'UTR <1..3 gene 4..828 /gene="KIAA0110" CDS 4..828 /gene="KIAA0110" /codon_start=1 /db_xref="PID:d1004063" /db_xref="PID:g285967" /translation="MAAPEAEVLSSAAVPDLEWYEKSEETHASQIELLETSSTQEPLN ASEAFCPRDCMVPVVFPGPVSQEGCCQFTCELLKHIMYQRQQLPLPYEQLKHFYRKPS PQAEEMLKKKPRATTEVSSRKCQQALAELESVLSHLEDFFARTLVPRVLILLGGNALS PKEFYELDLSLLAPYSVDQSLSTAACLRRLFRAIFMADAFSELQAPPLMGTVVMAQGH RNCGEDWFRPKLNYRVPSRGHKLTVTLSCGRPSIRTTAWEDYIWFQAPVTFKGFRE" 3'UTR 829..>1233 BASE COUNT 293 a 316 c 311 g 313 t ORIGIN 1 gtgatggcgg cgccggaggc ggaggttctg tcctcagccg cagtccctga tttggagtgg 61 tatgagaagt ccgaagaaac tcacgcctcc cagatagaac tacttgagac aagctctacg 121 caggaacctc tcaacgcttc ggaggccttt tgcccaagag actgcatggt accagtggtg 181 tttcctgggc ctgtgagcca ggaaggctgc tgtcagttta cttgtgaact tctaaagcat 241 atcatgtatc aacgccagca gctccctctg ccctatgaac agcttaagca cttttaccga 301 aaaccttctc cccaggcaga ggagatgctg aagaagaaac ctcgggccac cactgaggtg 361 agcagcagga aatgccaaca agccctggca gaactggaga gtgtcctcag ccacctggag 421 gacttctttg cacggacact agtaccgcga gtgctgattc tccttggggg caatgcccta 481 agccccaagg agttctatga actcgacttg tctctgctgg ccccctacag cgtggaccag 541 agcctgagca cagcagcttg tttgcgccgt ctcttccgag ccatattcat ggctgatgcc 601 tttagcgagc ttcaggctcc tccactcatg ggcaccgtcg tcatggcaca gggacaccgc 661 aactgtggag aagattggtt tcgacccaag ctcaactatc gagtgcccag ccggggccat 721 aaactgactg tgaccctgtc atgtggcaga ccttccatcc gaaccacggc ttgggaagac 781 tacatttggt tccaggcacc agtgacattt aaaggcttcc gcgagtgaat gagtgcttct 841 taatcctaaa aacacaatgg ctgaattatc tttctccatg tggcgctgaa tcacccatct 901 ggtttggagc tagagttgct tcctggtgag agaggaagca actctccttc tggttgtctg 961 cctcccctca gatttcctga taggctgatg gcatgtggct gtgactgtga ctgtaatcat 1021 tgctgaacaa catctctttg aatcaaaggt tgattttccc agagggtgct gggtcaggca 1081 tttctattag gagttggaaa gcaaaaatgg gtccatagac actctatgga ggtgtccctt 1141 tctgctcttt gctgtgtcct ttcagaattt ttaccaggaa cataatgtgg atgtgactta 1201 tgaacttaaa tataaaataa atagattctt att // LOCUS HUMORF16 1826 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0026 gene, complete cds. ACCESSION D14812 NID g285968 KEYWORDS KIAA0026. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1826) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1826) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..1826 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="Myeloblast" /sex="male" 5'UTR <1..305 gene 306..1172 /gene="KIAA0026" CDS 306..1172 /gene="KIAA0026" /codon_start=1 /db_xref="PID:d1004064" /db_xref="PID:g285969" /translation="MSSRKQGSQPRGQQSAEEENFKKPTRSNMQRSKMRGASSGKKTA GPQQKNLEPALPGRWGGRSAENPPSGSVRKTRKNKQKTPGNGDGGSTSEAPQPPRKKR ARADPTVESEEAFKNRMEVKVKIPEELKPWLVEDWDLVTRQKQLFQLPAKKNVDAILE EYANCKKSQGNVDNKEYAVNEVVAGIKEYFNVMLGTQLLYKFERPQYAEILLAHPDAP MSQVYGAPHLLRLFVRIGAMLAYTPLDEKSLALLLGYLHDFLKYLAKNSASLFTASDY KVASAEYHRKAL" 3'UTR 1173..1826 BASE COUNT 537 a 352 c 402 g 535 t ORIGIN 1 cacgtcggct gctgggaaga tctggattct cgtttcaggt caccatcaga aaagctaagt 61 ttgctgtata gtgaggatca ggagatctga tcctgattgc agaaccttcc ctgattacag 121 aatcttggat gatttcacaa aagttcatct tcattgcaga tacctgcctt tctttctagg 181 ttgtatctcc cacttcaccc ttctagacca tcccagaaga tctataagat ttcatctggg 241 aaatcactag gagttcttgg aagggaaaga aggaagattg ttggttggaa taaaaacagg 301 gttgaatgag ttccagaaag cagggttctc aacctcgtgg acagcaatct gcagaagaag 361 agaacttcaa aaaaccaact agaagcaaca tgcagagaag taaaatgaga ggggcctcct 421 caggaaagaa gacagctggt ccacagcaga aaaatcttga accagctctc ccaggaagat 481 ggggtggtcg ctctgcagag aacccccctt caggatccgt gaggaagacc agaaagaaca 541 agcagaagac tcctggaaac ggagatggtg gcagtaccag cgaagcacct cagccccctc 601 ggaagaaaag ggcccgggca gaccccactg ttgaaagtga ggaggcgttt aagaatagaa 661 tggaggttaa agtgaagatt cctgaagaat taaaaccatg gcttgttgag gactgggact 721 tagttaccag gcagaagcag ctgtttcaac tccctgccaa gaaaaatgta gatgcaattc 781 tggaggagta tgcaaattgc aagaaatcgc agggaaatgt tgataataag gaatatgcgg 841 ttaatgaagt tgtggcagga ataaaagaat atttcaatgt gatgttgggc actcagctgc 901 tctacaaatt tgagaggccc cagtatgctg aaatcctctt ggctcaccct gatgctccaa 961 tgtcccaggt ttatggagca ccacacctac tgagattatt tgtaagaatt ggagcaatgt 1021 tggcctatac gccccttgat gagaaaagcc ttgcattatt gttgggctat ttgcatgatt 1081 tcctaaaata tctggcaaag aattctgcat ctctctttac tgccagtgat tacaaagtgg 1141 cttctgctga gtaccaccgc aaagccctgt gagcgtctac agacagctca ccatttttgt 1201 cctgtatctg taaacacttt ttgttcttag tctttttctt gtaaaattga tgttctttaa 1261 aatcgttaat gtataacagg gcttatgttt cagtttgttt tccgttctgt tttaaacaga 1321 aaataaaagg agtgtaagct ccttttctca tttcaaagtt gctaccagtg tatgcagtaa 1381 ttagaacaaa gaagaaacat tcagtagaac attttattgc ctagttgaca acattgcttg 1441 aatgctggtg gttcctatcc ctttgacact acacaatttt ctaatatgtg ttaatgctat 1501 gtgacaaaac gccctgattc ctagtgccaa aggttcaact taatgtatat acctgaaaac 1561 ccatgcattt gtgctctttt tttttttatg gtgcttgaag taaaacagcc catcctctgc 1621 aagtccatct atgttgttct taggcattct atctttgctc aaattgttga aggatggtga 1681 tttgtttcat ggtttttgta tttgagtcta atgcacgttc taacatgata gaggcaatgc 1741 attattgtgt agccacggtt ttctggaaaa gttgatattt taggaattgt atttcagatc 1801 ttaaataaaa tttgtttcta aatttc // LOCUS HUMORFA03 2739 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0053 gene, complete cds. ACCESSION D29642 NID g473934 KEYWORDS KIAA0053. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2739) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2739) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..2739 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..193 gene 194..2110 /gene="KIAA0053" CDS 194..2110 /gene="KIAA0053" /citation=[3] /codon_start=1 /db_xref="PID:d1006679" /db_xref="PID:g473935" /translation="MSLGQSACLFLSIARSRSVMTGEQMAAFHPSSTPNPLERPIKMG WLKKQRSIVKNWQQRYFVLRAQQLYYYKDEEDTKPQGCMYLPGCTIKEIATNPEEAGK FVFEIIPASWDQNRMGQDSYVLMASSQAEMEEWVKFLRRVAGTPCGVFGQRLDETVAY EQKFGPHLVPILVEKCAEFILEHGRNEEGIFRLPGQDNLVKQLRDAFDAGERPSFDRD TDVHTVASLLKLYLRDLPEPVVPWSQYEGFLLCGQLTNADEAKAQQELMKQLSILPRD NYSLLSYICRFLHEIQLNCAVNKMSVDNLATVIGVNLIRSKVEDPAVIMRGTPQIQRV MTMMIRDHEVLFPKSKDIPLSPPAQKNDPKKAPVARSSVGWDATEDLRISRTDSFSSM TSDSDTTSPTGQQPSDAFPEDSSKVPREKPGDWKMQSRKRTQTLPNRKCFLTSAFQGA NSSKMEIFKNEFWSPSSEAKAGEGHRRTMSQDLRQLSDSQRTSTYDNVPSLPGSPGEE ASALSSQACDSKGDTLASPNSETGPGKKNSGEEEIDSLQRMVQELRKEIETQKQMYEE QIKNLEKENYDVWAKVVRLNEELEKEKKKSAALEISLRNMERSREDVEKRNKALEEEV KEFVKSMKEPKTEA" 3'UTR 2111..2739 BASE COUNT 727 a 700 c 721 g 591 t ORIGIN 1 cgactgcagc ctgggtttta ttcttggcct ggccctgacc gggagctggc ccctcggctg 61 cttctctggc tcggggggga ctttctctgg ctcagatccg gacccctgaa ctggacctgg 121 ttgtcgtccc cccgcttctc agccccctct ggggttcctc tctcctctcc gcccactctt 181 tgctcactgc cccatgtccc tcggtcagtc ggcctgtctg ttcctctcta tagctcggtc 241 aaggagtgtg atgactggcg agcagatggc tgccttccat ccatcgtcca cccccaaccc 301 gctggagagg cccatcaaga tgggctggct gaagaagcag aggtccatcg tgaagaactg 361 gcagcagagg tactttgtgc tgagggcgca gcagctctac tactacaagg atgaagagga 421 cacgaagccc cagggctgca tgtatctacc aggatgtaca atcaaggaga tcgccacaaa 481 cccagaagaa gctgggaagt ttgtctttga aatcattcca gcctcatggg accagaatcg 541 catgggacag gactcctatg tcctcatggc cagctctcag gcggagatgg aggagtgggt 601 taaattcctc aggagagttg ctggcacacc ctgtggagtg tttggccagc gcttggatga 661 gactgtggcc tatgaacaga aattcggccc ccatctggtg cccatcctgg tggagaaatg 721 tgcagagttc atcctggagc acggccggaa tgaagagggc atcttccgtc tgcctgggca 781 ggacaacctg gtgaagcagc tgagagacgc ttttgatgct ggggagcggc cctcctttga 841 cagagacaca gatgtgcaca ctgtggcttc cctgttaaag ctctacctcc gagacctccc 901 agagcccgtg gttccctgga gccagtacga agggttcctg ctctgtgggc agctcacgaa 961 tgcggatgag gcaaaggctc agcaggagtt gatgaagcag ctctccatcc ttcctcgtga 1021 caactatagt ctcctgagct acatctgcag gttcctacat gaaatacagc tgaactgtgc 1081 tgttaacaag atgagtgtgg acaacctggc tactgtgatt ggtgtgaatc tcatcaggtc 1141 gaaggtcgaa gaccctgccg tgatcatgag agggactcct cagatccaaa gagtgatgac 1201 tatgatgatc agagaccatg aagtcctctt ccccaagtcc aaggatatac ccctgtcacc 1261 ccctgcccag aaaaatgacc ccaagaaagc tccagtggcc cgaagctctg taggctggga 1321 tgccactgaa gacctccgaa tttctaggac agacagcttc agtagcatga caagcgactc 1381 tgatacaacc agccccaccg gacagcagcc gagcgatgcg tttccggagg acagcagcaa 1441 agtacccagg gaaaagccag gagactggaa aatgcaatct cgtaaaagga ctcaaacact 1501 ccctaaccgg aaatgtttct tgacatcagc ttttcagggt gccaacagca gcaaaatgga 1561 gatctttaaa aatgaattct ggtcgccttc ctcagaggct aaggcagggg aagggcacag 1621 gagaacgatg tctcaagact tgcgccaact ttctgactcc caacggactt ccacctacga 1681 taacgtccct tccctgccag ggtcccctgg ggaggaagcc agtgcactct cttcccaagc 1741 ctgtgactcc aagggagata ctcttgccag tccaaactct gaaactgggc ctggaaaaaa 1801 gaactctgga gaagaggaaa ttgattcttt gcagaggatg gtccaagagc tacgaaagga 1861 aatagaaaca cagaagcaaa tgtatgagga acagattaaa aaccttgaga aggaaaatta 1921 tgacgtttgg gctaaagtgg tgaggctcaa tgaagaactg gagaaggaaa agaagaagtc 1981 tgcagcccta gagatcagcc tccgcaacat ggagcgctcc cgggaggatg ttgagaagag 2041 gaacaaggcc ttggaagaag aagtcaagga atttgtcaaa tccatgaagg aacccaagac 2101 cgaggcttaa gggtcccagg agtactgcag ggacagcccc agagaggccc aactctggcc 2161 cctttctcag tgctatctga tgacggggaa acaaaattat tctctgagag ggaaaggaca 2221 tttgagggaa acatcaaatt tccccataaa taaatgaatg gagtttgcag gaaggtgagg 2281 gtgagcagag atgtgtgtgg acatctctga ccatccatcg ctgtattcaa atggattgtt 2341 ctattccatt ctggtctcag gcatgaccac gtccagtgaa gacatttgag gcagcacatc 2401 tcaggaccca ggcaatagac tggccccaac tcaggctgga ctaaggtgtg attaattctt 2461 tgttttttgt gtggaacagc tcaccttgtc agacagcctc agggcatctc tgagacacag 2521 gggcagaaaa tgacattcat cttttgagtc ctcatccatg gagtgctgtg tttggggggc 2581 tgcatctgct gaagcgagaa ccccattctg ccaccccacc aggatgccca ttctccagga 2641 cttctccaac ttactattag actaaaccag aacaagcaac aaactgtatt tatgcaagca 2701 aaattgatga gaaaattata ttcaaataaa gcaaaattt // LOCUS HUMORFA05 6274 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0054 gene, complete cds. ACCESSION D29677 D29959 NID g473938 KEYWORDS KIAA0054. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6274) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (22-MAR-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6274) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..6274 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..145 gene 146..5974 /gene="KIAA0054" CDS 146..5974 /gene="KIAA0054" /citation=[3] /codon_start=1 /db_xref="PID:d1006709" /db_xref="PID:g473951" /translation="MEDRRAEKSCEQACESLKRQDYEMALKHCTEALLSLGQYSMADF TGPCPLEIERITIESLLYRIASFLQLKNYVQADEDCRHVLGEGLAKGEDAFRAVLCCM QLKGKLQPVSTILAKSLTGESLNGMVTKDLTRLKTLLSETETATSNALSGYHVEDLDE GSCNGWHFRPPPRGITSSEEYTLCKRFLEQGICRYGAQCTSAHSQEELAEWQKRYASR LIKLKQQNENKQLSGSYMETLIEKWMNSLSPEKVLSECIEGVKVEHNPDLSVTVSTKK SHQTWTFALTCKPARMLYRVALLYDAHRPHFSIIAISAGDSTTQVSQEVPENCQEWIG GKMAQNGLDHYVYKVGIAFNTEIFGTFRQTIVFDFGLEPVLMQRVMIDAASTEDLEYL MHAKQQLVTTAKRWDSSSKTIIDFEPNETTDLEKSLLIRYQIPLSADQLFTQSVLDKS LTKSNYQSRLHDLLYIEEIAQYKEISKFNLKVQLQILASFMLTGVSGGAKYAQNGQLF GRFKLTETLSEDTLAGRLVMTKVNAVYLLPVPKQKLVQTQGTKEKVYEATIEEKTKEY IFLRLSRECCEELNLRPDCDTQVELQFQLNRLPLCEMHYALDRIKDNGVLFPDISMTP TIPWSPNRQWDEQLDPRLNAKQKEAVLAITTPLAIQLPPVLIIGPYGTGKTFTLAQAV KHILQQQETRILICTHSNSAADLYIKDYLHPYVEAGNPQARPLRVYFRNRWVKTVHPV VHQYCLISSAHSTFQMPQKEDILKHRVVVVTLNTSQYLCQLDLEPGFFTHILLDEAAQ AMECETIMPLALATQNTRIVLAGDHMQLSPFVYSEFARERNLHVSLLDRLYEHYPAEF PCRILLCENYRSHEAIINYTSELFYEGKLMASGKQPAHKDFYPLTFFTARGEDVQEKN STAFYNNAEVFEVVERVEELRRKWPVAWGKLDDGSIGVVTPYADQVFRIRAELRKKRL SDVNVERVLNVQGKQFRVLFLSTVRTRHTCKHKQTPIKKKEQLLEDSTEDLDYGFLSN YKLLNTAITRAQSLVAVVGDPIALCSIGRCRKFWERFIALCHENSSLHGITFEQIKAQ LEALELKKTYVLNPLAPEFIPRALRLQHSGSTNKQQQSPPKGKSLHHTQNDHFQNDGI VQPNPSVLIGNPIRAYTPPPPLGPHPNLGKSPSPVQRIDPHTGTSILYVPAVYGGNVV MSVPLPVPWTGYQGRFAVDPRIITHQAAMAYNMNLLQTHGRGSPIPYGLGHHPPVTIG QPQNQHQEKDQHEQNRNGKSDTNNSGPEINKIRTPEKKPTEPKQVDLESNPQNRSPES RPSVVYPSTKFPRKDNLNPRHINLPLPAPHAQYAIPNRHFHPLPQLPRPPFPIPQQHT LLNQQQNNLPEQPNQIPPQPNQVVQQQSQLNQQPQQPPPQLSPAYQAGPNNAFFNSAV AHRPQSPPAEAVIPEQQPPPMLQEGHSPLRAIAQPGPILPSHLNSFIDENPSGLPIGE ALDRIHGSVALETLRQQQARFQQWSEHHAFLSQGSVPYPHHHHPHLQHLPQPPLGLHQ PPVRADWKLTSSAEDEVETTYSRFQDLIRELSHRDQSETRELAEMPPPQSRLLQYRQV QSRSPPAVPSPPSSTDHSSHFSNFNDNSRDIEVASNPAFPQRLPPQIFNSPFSLPSEH LAPPPLKYLAPDGAWTFANLQQNHLMGPGFPYGLPPLPHRPPQNPFVQIQNHQHAIGQ EPFHPLSSRTVSSSSLPSLEEYEPRGPGRPLYQRRISSSSVQPCSEEVSTPQDSLAQC KELQDHSNQSSFNFSSPESWVNTTSSTPYQNIPCNGSSRTAQPRELIAPPKTVKPPED QLKSENLEVSSSFNYSVLQHLGQFPPLMPNKQIAESANSSSPQSSAGGKPAMSYASAL RAPPKPRPPPEQAKKSSDPLSLFQELSLGSSSGSNGFYSYFK" 3'UTR 5975..6274 BASE COUNT 1864 a 1497 c 1300 g 1613 t ORIGIN 1 gaaagtaatg acaggatttg gatgaagtaa tggatcacta atgaactaca gtgctggtga 61 gatgtgtaag aaattataaa gagctctgat gggctatttg ggtgataccc agtgcagtga 121 actgcaggat ttttgtccct gagtcatgga agacagaaga gctgaaaagt catgtgaaca 181 agcatgtgaa tcacttaaga ggcaggacta tgaaatggcc ctcaagcact gcacagaggc 241 ccttctttct cttggccagt actccatggc agacttcaca gggccttgtc cattggaaat 301 agaacgcatc acaatcgaga gtcttctcta cagaattgcc tcatttttgc aactgaaaaa 361 ttatgtgcaa gctgatgaag attgtagaca tgtgctggga gaaggactgg ccaagggaga 421 agatgccttt cgggcagtgc tttgctgcat gcagctgaaa gggaagctcc aacctgtatc 481 caccattctt gccaagtcac tcacaggaga gtccctgaat gggatggtaa caaaggattt 541 gacaagacta aaaacacttc tctcagaaac agagacagca actagtaacg ccctctctgg 601 atatcacgtg gaagacttag atgaggggtc ttgtaatggt tggcatttcc gcccaccacc 661 taggggaatc acaagcagcg aggaatatac tttgtgtaaa agatttttag aacaaggaat 721 ctgtaggtat ggtgcccagt gtacttcagc acattcccag gaagaactag cagaatggca 781 gaaaagatat gcttcacggc tgataaaatt gaaacagcaa aatgagaata aacagctctc 841 aggcagttac atggaaacct tgatagaaaa gtggatgaat tcattgtctc ctgagaaagt 901 gcttagtgaa tgtatagaag gagtaaaggt agagcacaat cctgacctgt cagttactgt 961 cagcaccaaa aaatcccacc agacatggac ctttgctctc acttgtaagc ctgcaagaat 1021 gctgtatcgt gtagcattgc tttatgatgc tcatcgtcct cattttagta tcattgcaat 1081 atctgccgga gatagtacta cccaggtatc acaagaagtc ccagaaaact gtcaagaatg 1141 gataggagga aagatggccc aaaatggatt agatcattac gtgtataaag tcgggatagc 1201 atttaacaca gaaatatttg gaacttttcg ccaaaccata gttttcgact ttggattgga 1261 accagtactc atgcaaagag taatgattga tgcagcttct acagaagatc tcgaatacct 1321 gatgcatgca aaacagcagc tagtaaccac agctaaacgt tgggattctt cctctaagac 1381 tattatagat tttgaaccta atgaaactac tgatttggag aagagccttc ttatcagata 1441 ccaaattccc ctctctgctg accagctatt tactcagtcc gttttagaca aatcattgac 1501 caagagcaac tatcagtcac ggttacatga ccttctttat attgaggaga tagcccagta 1561 taaagaaatc agcaagttca accttaaagt gcaattgcag attctggcaa gcttcatgct 1621 cactggtgtt tctggaggtg caaagtatgc tcagaatgga caactttttg gtcgctttaa 1681 gcttactgaa acactttctg aagatacttt ggctggacga ctggtgatga ccaaagtcaa 1741 tgctgtttat ttattaccag tccctaaaca gaagttagta cagacccagg gaaccaaaga 1801 gaaggtttat gaagctacta ttgaagaaaa aacaaaggaa tatatatttt taaggctatc 1861 tagggaatgc tgtgaagaac ttaatcttcg gcctgactgt gacacacagg ttgaacttca 1921 gtttcaatta aatcgattac ccctctgtga aatgcactat gcactagaca ggatcaagga 1981 caatggggtt ttgtttccag acatcagtat gactcccacc ataccatgga gtcctaacag 2041 acaatgggat gaacagttgg atcctcgact aaatgcaaaa cagaaagagg ctgttctggc 2101 cattaccact ccacttgcaa tccagctgcc gcctgtgctt atcatcggac cctatgggac 2161 aggcaaaacg ttcactctag ctcaggctgt caaacatatt ctgcagcaac aggagactag 2221 gattctcatt tgcacccatt ctaatagtgc tgctgatctc tacataaagg attatttaca 2281 tccatatgta gaagcaggca atccccaggc aagacctctc agggtatatt tcagaaatcg 2341 ctgggtaaag actgtccacc cagttgtgca tcagtactgt ttgatctcaa gcgcacattc 2401 cacctttcag atgccccaga aagaagatat tcttaaacat cgagtggtgg ttgttacctt 2461 gaatacttcc cagtacctct gtcagttgga ccttgaacct gggtttttta cacacattct 2521 attagatgaa gctgcccagg ccatggagtg tgaaaccatt atgcctctag cattagcaac 2581 tcaaaacact cggattgtct tggctggtga tcacatgcag ctcagtcctt ttgtttacag 2641 cgagtttgcc agggagagaa accttcacgt ttcattactt gaccgactct atgagcatta 2701 ccctgctgag ttcccatgta ggattctcct gtgtgagaac taccgctccc atgaagctat 2761 catcaattat acctctgagc ttttctatga gggcaaactg atggccagtg ggaagcagcc 2821 agcacacaaa gatttctacc cactaacttt ctttacagca cgaggagaag atgtacaaga 2881 aaaaaatagc acagcttttt ataataatgc agaggtgttt gaagtggtgg aacgtgtaga 2941 agagttaaga aggaagtggc cagtagcgtg ggggaagtta gatgatggca gtattggtgt 3001 ggtgactcca tatgctgatc aagtgtttag aatacgtgct gaacttcgaa aaaagagatt 3061 atctgatgtt aatgtagaaa gggtgctaaa tgttcaagga aagcaattca gagttttgtt 3121 tcttagcaca gtacgtacaa gacatacttg taaacataaa cagacaccaa ttaaaaagaa 3181 agagcaactt ctggaagatt ccacagagga cttagattat ggttttttat ctaactacaa 3241 gcttctcaat actgccatca caagagcaca atccctggtt gctgtggtgg gtgatcccat 3301 tgctctgtgc tctattggaa gatgcaggaa attttgggaa cggtttattg ccctgtgtca 3361 tgaaaacagt agcctacatg gaatcacttt tgaacagatc aaagcccagt tagaggcttt 3421 agaactaaag aagacatatg tgttgaatcc gctggcacct gaatttatcc cccgggctct 3481 aagactgcag cattcaggaa gtaccaacaa acagcagcaa tcaccaccca aggggaaaag 3541 tcttcatcat acccagaatg atcacttcca gaatgatgga attgttcagc ccaatccttc 3601 tgtacttatt ggcaatccta ttagagcata tactcctcca ccccctcttg gacctcaccc 3661 aaatttggga aaatctccaa gccctgttca aagaatagat cctcacactg ggacaagtat 3721 tctttatgta cctgctgtct atggagggaa tgtagttatg tcggtgcctt tacctgtacc 3781 atggacagga taccagggta ggtttgcagt tgatcctcga attattacac atcaggcagc 3841 aatggcctat aacatgaacc tattacagac acatggacga ggatctccta ttccttatgg 3901 ccttggacat cacccacctg tcaccatagg ccagccacaa aatcagcatc aggagaagga 3961 tcaacatgag caaaatcgaa atggtaaaag tgatacaaat aattccggac ctgaaattaa 4021 taagattcga acaccagaga aaaagccaac agaaccaaaa caggttgatt tggaatcaaa 4081 tccacagaac agaagtcctg aatcacgtcc tagtgttgtt tatcccagta ccaaatttcc 4141 tcgcaaagat aatctcaacc caagacacat aaatcttccc cttcctgctc cccacgcaca 4201 gtatgcaatc cctaatcgcc actttcatcc ccttccccag ctaccaagac caccctttcc 4261 aattccacag cagcacacct tgttaaatca gcagcagaat aatttgcctg aacaaccaaa 4321 tcagatacca cctcagccaa atcaggtagt ccagcagcaa agtcagttga atcagcagcc 4381 tcagcagcca cctcctcagc tttctcctgc atatcaggcg ggacccaaca atgctttttt 4441 taatagtgca gttgctcatc ggccacagtc tcctcctgca gaagctgtaa ttccggagca 4501 gcagccccct cccatgctgc aagaaggcca cagtcctctg agagccattg cacaacccgg 4561 ccccattctt ccttcacatc tgaatagctt cattgatgag aacccctcgg gattacctat 4621 aggggaggct ttagatcgta tacatgggag tgtcgctctg gaaacattaa ggcagcagca 4681 ggcacggttc cagcagtgga gcgagcatca tgcctttctc agtcagggca gcgttccata 4741 cccacaccat caccatcctc acctccagca tcttcctcag ccgcccctgg gattacatca 4801 gccgccagtg agggcagact ggaagctcac cagcagtgcc gaagatgaag tggagaccac 4861 atactcaagg tttcaagact taatcagaga actgtctcat cgtgatcaaa gtgaaacacg 4921 ggaactagct gaaatgccac cacctcaatc aagacttttg caatatagac aagtacagag 4981 tagaagccca ccagcagtcc catctccccc atccagtaca gaccacagta gccacttttc 5041 taactttaat gataacagca gagacattga agtagccagc aacccagcat ttccacagcg 5101 cctcccaccc cagatattca actcaccttt ctcgttgcca tctgaacacc ttgcccctcc 5161 tcccttgaaa tacctggcac ctgatggagc atggactttt gctaacttgc aacagaatca 5221 cctaatgggg ccaggttttc cctatggcct acctccattg cctcacaggc caccgcagaa 5281 cccttttgta caaatacaga atcatcaaca tgctattggt caagagccat ttcacccatt 5341 gtcatctcga acagtatctt cttcttcgct ccctagctta gaagagtatg agcccagagg 5401 acctggtcgg cccttgtacc aaagaagaat ctcatctagc tcagttcaac cttgttctga 5461 agaagtaagc actcctcaag acagtctggc tcagtgtaaa gagcttcagg accacagtaa 5521 ccaatcttct ttcaactttt catccccgga gtcctgggta aacaccacct catctactcc 5581 ttatcagaac attccgtgca atggatccag caggacagct cagcccagag agttgatagc 5641 gccacccaag actgtcaaac cccctgagga tcaactgaag tcggagaacc tcgaggtgtc 5701 cagttccttc aactacagtg tgctgcagca tcttggccag tttccacccc ttatgcctaa 5761 caagcagatc gcggagtcgg ccaatagcag tagcccccag agctctgcgg ggggcaagcc 5821 cgccatgtcc tatgccagcg ctctgcgggc ccctccaaag cccaggcccc ctcctgagca 5881 ggccaagaag agtagcgacc ctctgtctct cttccaggaa ctgagcctag ggagctcatc 5941 tggcagcaat ggcttttact catattttaa ataatcactt ttttttccct caagggagaa 6001 tgttttaatt tctgtttgta tcagtagaat taaggtagtt ggacttcatc tatagatgca 6061 cagttccctt tgttttaata ttaaatatgt tctcacttaa ttgctttgct gctagacttg 6121 caactaattt ttttaaagta tattccatta ttttgcattt ttgatgtgtc aaaactttga 6181 cagcttttat gtagaataaa aaaattttta aatttgtgta ttgttacata tgtttgcatc 6241 aagctagcag ccaagaggtt aattgtgcaa ctat // LOCUS HUMORFA08 4359 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0055 gene, complete cds. ACCESSION D29956 NID g473944 KEYWORDS KIAA0055. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4359) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (13-APR-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4359) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..4359 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..317 gene 318..3674 /gene="KIAA0055" CDS 318..3674 /gene="KIAA0055" /note="This gene is similar to tre oncogene(X63547)." /citation=[3] /codon_start=1 /db_xref="PID:d1006789" /db_xref="PID:g473945" /translation="MPAVASVPKELYLSSSLKDLNKKTEVKPEKISTKSYVHSALKIF KTAEECRLDRDEERAYVLYMKYVTVYNLIKKRPDFKQQQDYFHSILGPGNIKKAVEEA ERLSESLKLRYEEAEVRKKLEEKDRQEEAQRLQQKRQETGREDGGTLAKGSLENVLDS KDKTQKSNGEKNEKCETKEKGAITAKELYTMMTDKNISLIIMDARRMQDYQDSCILHS LSVPEEAISPGVTASWIEAHLPDDSKDTWKKRGNVEYVVLLDWFSSAKDLQIGTTLRS LKDALFKWESKTVLRNEPLVLEGGYENWLLCYPQYTTNAKVTPPPRRQNEEVSISLDF TYPSLEESIPSKPAAQTPPASIEVDENIELISGQNERMGPLNISTPVEPVAASKSDVS PIIQPVPSIKNVPQIDRTKKPAVKLPEEHRIKSESTNHEQQSPQSGKVIPDRSTKPVV FSPTLMLTDEEKARIHAETALLMEKNKQEKELRERQQEEQKEKLRKEEQEQKAKKKQE AEENEITEKQQKAKEEMEKKESEQAKKEDKETSAKRGKEITGVKRQSKSEHETSDAKK SVEDRGKRCPTPEIQKKSTGDVPHTSVTGDSGSGKPFKIKGQPESGILRTGTFREDTD DTERNKAQREPLTRARSEEMGRIVPGLPSGWAKFLDPITGTFRYYHSPTNTVHMYPPE MAPSSAPPSTPPTHKAKPQIPAERDREPSKLKRSYSSPDITQAIQEEEKRKPTVTPTV NRENKPTCYPKAEISRLSASQIRNLNPVFGGSGPALTGLRNLGNTCYMNSILQCLCNA PHLADYFNRNCYQDDINRSNLLGHKGEVAEEFGIIMKALWTGQYRYISPKDFKITIGK INDQFAGYSQQDSQELLLFLMDGLHEDLNKADNRKRYKEENNDHLDDFKAAEHAWQKH KQLNESIIVALFQGQFKSTVQCLTCHKKSRTFEAFMYLSLPLASTSKCTLQDCLRLFS KEEKLTDNNRFYCSHCRARRDSLKKIEIWKLPPVLLVHLKRFSYDGRWKQKLQTSVDF PLENLDLSQYVIGPKNNLKKYNLFSVSNHYGGLDGGHYTAYCKNAARQRWFKFDDHEV SDISVSSVKSSAAYILFYTSLGPRVTDVAT" 3'UTR 3675..4359 BASE COUNT 1492 a 836 c 916 g 1115 t ORIGIN 1 gtgagctggg ctggcttccg tcctggtagc caaggctaat tctccctcga gttcttggga 61 gatgggcatt tggcgagaag gctggcgtta gtgaagcgcg cccggcgtca cggtgagtgc 121 gggtcttggg ccctagcacc tgttctctgg gaagtcgtcc gctgtgaacg atgaacgcct 181 ttccttccac cagctgctgg ttaccccgga gacaagctct gtccgcggag aggagtggga 241 caactcctaa aggaaagaag cacttgtaag gaaatatagc atccattgtg aaagtggaaa 301 agtaaagata attcatcatg cctgctgtgg cttcagttcc taaagaactc tacctcagtt 361 cttcactaaa agaccttaat aagaagacag aagttaaacc agagaaaata agcactaaga 421 gttatgtgca cagtgccctg aagatcttta agacagcaga agaatgcaga ttagatcgtg 481 atgaggaaag ggcctatgta ctatatatga aatacgtgac tgtttataat cttatcaaaa 541 aaagacctga tttcaagcaa cagcaggatt atttccattc aatacttgga cctggaaaca 601 tcaaaaaagc tgtcgaagaa gctgaaagac tctctgaaag ccttaaatta agatatgaag 661 aagctgaagt ccggaaaaaa cttgaggaaa aagacaggca ggaggaagca cagcggctac 721 aacaaaaaag gcaggaaaca ggaagagagg atggtggcac attggctaaa ggctctttgg 781 agaatgtttt ggattccaaa gacaaaaccc aaaagagcaa tggtgaaaag aatgaaaaat 841 gtgagaccaa agagaaagga gcaatcacag caaaggaact atacacaatg atgacggata 901 aaaacatcag cttgattata atggatgctc gaagaatgca ggattatcag gattcctgta 961 ttttacattc tctcagtgtt cctgaagaag ccatcagtcc aggagtcact gctagttgga 1021 ttgaagcaca cctgccagat gattctaaag acacatggaa gaagaggggg aatgtggagt 1081 atgtggtact tcttgactgg tttagttctg ccaaagattt acagattgga acaactctcc 1141 ggagtctgaa agatgcactt ttcaagtggg aaagtaaaac tgtcctgcgc aatgagcctt 1201 tggttttaga gggaggctat gaaaactggc tcctttgtta tccccagtat acaacaaatg 1261 ctaaggtcac tccaccccca cgacgccaga atgaagaggt gtctatctca ttggatttta 1321 cttatccctc attggaagaa tcaattcctt ctaaacctgc tgcccagacg ccacctgcat 1381 ctatagaagt agatgaaaat atagaattga taagtggtca aaatgagaga atgggaccac 1441 tgaatatatc aactccagtt gaaccagttg ctgcttctaa atctgatgtt tcacccataa 1501 ttcagccagt gcctagtata aagaatgttc cacagattga tcgtactaaa aaaccagcag 1561 tcaaattgcc tgaagagcat agaataaaat ctgaaagtac aaaccatgag caacaatctc 1621 ctcagagtgg aaaagttatt cctgatcgtt ccaccaagcc agtagttttt tctccaactc 1681 tcatgttaac agatgaagaa aaggctcgta ttcatgcaga aactgctctt ctaatggaaa 1741 aaaacaaaca agaaaaagaa cttcgggaaa ggcagcaaga ggaacagaaa gagaaactga 1801 ggaaggaaga acaagaacaa aaagccaaaa agaaacaaga agctgaagaa aatgaaatta 1861 cagagaagca acaaaaagca aaagaagaaa tggagaagaa agaaagtgaa caggccaaga 1921 aagaagataa agaaacctca gcaaagaggg gcaaagaaat aacaggagta aaaagacaaa 1981 gtaaaagtga acatgaaact tctgatgcca agaaatctgt agaagatagg gggaaaaggt 2041 gtccaacccc agaaatacag aaaaagtcaa caggagatgt gccccataca tctgtgacag 2101 gggattcagg ttcaggcaag ccatttaaga ttaaaggaca accagaaagt ggaattctaa 2161 ggacaggaac ttttagagag gatacagacg ataccgaaag aaataaagct caacgagaac 2221 ctttgacaag agcacgaagt gaagaaatgg ggaggatcgt accaggactg ccttcaggct 2281 gggccaagtt tcttgaccca atcactggaa cctttcgtta ttatcattca cccaccaaca 2341 ctgttcatat gtacccaccg gaaatggctc cttcatctgc acctccttcc acccctccaa 2401 ctcataaagc caagccacag attcctgctg agcgggatag ggaaccttcc aaactgaagc 2461 gctcctactc ctccccagat ataacccagg ctattcaaga ggaagagaag aggaagccaa 2521 cagtaactcc aacagttaat cgggaaaaca agccaacatg ttatcctaaa gctgagatct 2581 caaggctttc tgcttctcag attcggaacc tcaatcctgt ttttggaggt tctggaccag 2641 ctcttactgg acttcgtaac ttaggaaata cttgttatat gaactcaata ttgcagtgcc 2701 tatgtaacgc tccacatttg gctgattatt tcaaccgaaa ctgttatcag gatgatatta 2761 acaggtcaaa tttgttgggg cataaaggtg aagtggcaga agaatttggt ataatcatga 2821 aagccctgtg gacaggacag tatagatata tcagtccaaa ggactttaaa atcaccattg 2881 ggaagatcaa tgaccagttt gcaggataca gtcagcaaga ttcacaagaa ttgcttctgt 2941 tcctaatgga tggtctccat gaagatctaa ataaagctga taatcggaag agatataaag 3001 aagaaaataa tgatcatctc gatgacttta aagctgcaga acatgcctgg cagaaacaca 3061 agcagctcaa tgagtctatt attgttgcac tttttcaggg tcaattcaaa tctacagtac 3121 agtgcctcac atgtcacaaa aagtctagga catttgaggc cttcatgtat ttgtctctac 3181 cactagcatc cacaagtaaa tgtacattac aggattgcct tagattattt tccaaagaag 3241 aaaaactcac agataacaac agattttact gcagtcattg cagagctcga cgggattctc 3301 taaaaaagat agaaatctgg aagttaccac ctgtgctttt agtgcatctg aaacgttttt 3361 cctacgatgg caggtggaaa caaaaattac agacatctgt ggacttcccg ttagaaaatc 3421 ttgacttgtc acagtatgtt attggtccaa agaacaattt gaagaaatat aatttgtttt 3481 ctgtttcaaa tcactacggt gggctggatg gaggccacta cacagcctat tgtaaaaatg 3541 cagcaagaca acggtggttt aagtttgatg atcatgaagt ttctgatatc tccgtttctt 3601 ctgtgaaatc ttcagcagct tatatcctct tttatacttc attgggacca cgagtaactg 3661 atgtagccac ataaggagac ataggttata aactagttat cttttaaaag gctcagcaac 3721 acaactcttg aaatgcttat caggataatg gtagctatag ctggccattt agaggaattc 3781 taggacagtg ggagctgtgt tactagcact atataattcc ggtcagtgct gacaaataac 3841 atttaacaag tattgcagta atcatcactt acaggtacca tttatttcaa aacaactttt 3901 ttagtctgct ccaaagttaa aataattaac tagctaagca ttattattca actggtctaa 3961 aaactattgt tatctttttt tttccttttc actgttatgg ccttttcaca tttctaaatc 4021 ccatcttgat atactatgaa tactctagaa tgatgtaaag cagataggaa tgtatgtgta 4081 catatttatt gcatacttgc acatcaaatc gatgtacata gtttaacacg tggtcctttt 4141 gtgaaaccta gaactcagag gattgctttt tttctttcag cctattttga gttaacttca 4201 gtcctttctt agggaaatga cagggcaaag caatttttct gttggctttg ggctgtattt 4261 gtgcactaaa tctttattct aaaaaaaaaa atggaaactt taattttttt aaaacgggaa 4321 tttcatttac agctacatta aaatcttaat gagaaaaat // LOCUS HUMORFC 2307 bp mRNA PRI 14-JUL-1995 DEFINITION Homo sapiens (clone S171) mRNA, complete cds. ACCESSION L40393 NID g887361 KEYWORDS . SOURCE Homo sapiens (clone: S171) (clone library: Bento Soares) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2307) AUTHORS Sherrington,R., Rogaev,E.I., Liang,Y., Rogaeva,E.A., Levesque,G., Ikeda,M., Chi,H., Lin,C., Li,G., Holman,K., Tsuda,T., Mar,L., Foncin,J.-F., Bruni,A.C., Montesi,M.P., Sorbl,S., Rainero,I., Pinessi,L., Nee,L., Chumakov,I., Pollen,D., Brookes,A., Sanseau,P., Polinsky,R.J., Wasco,W., Da Silva,H.A.R., Haines,J.L., Pericak-Vance,M.A., Tanzi,R.E., Roses,A.D., Fraser,P.E., Rommens,J.M. and St. George-Hyslop,P.H. TITLE Cloning of a gene bearing missense mutations in early-onset familial Alzheimer's disease JOURNAL Nature 375 (6534), 754-760 (1995) MEDLINE 95319502 FEATURES Location/Qualifiers source 1..2307 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="S171" /clone_lib="Bento Soares" /dev_stage="fetus" /germline /tissue_type="brain" /map="14q24.3" 5'UTR <1..538 /note="putative" mRNA <1..>2307 CDS 539..946 /note="ORF; putative" /codon_start=1 /db_xref="PID:g887362" /translation="MPYPAPNVPVVGITPSQMVANVFGTAGHPQAAHPHQSPSLVRQQ TFPHYEASSATTSPFFKPPAQHLNGSAAFNGVDDGRLASADRHTEVPTGTCPVDPFEA QWAALENKSKQRTNPSPTNPFSSDLQKTFEIEL" 3'UTR 947..>2307 /note="putative" BASE COUNT 626 a 532 c 488 g 655 t 6 others ORIGIN 1 agggtgcttc agtgtggctg acacagcagc atggtcttga caagttttct tcatcctacc 61 acaaaatccc agttggtaat agagacttta ctcctaccta tcaaaaccac aaaatgtccc 121 attagggggg gacatgttgt acatgttagg atcattcaaa taaccaagat tataaggtga 181 ggaaagatgc ccctaactga ttcttttgtc tctcatcttg ttggttccag ggaccgagtg 241 gggtcaatct tctggtsstg cctctccagg tctcttccag gccggtcata gacgtactcc 301 ctctgaggcc gaccgatggt tagaagaggt gtctaagagc gtccgggctc agcagcccca 361 ggcctcagct gctcctctgc agccagttct ccagcctcct ccacccactg ccatctccca 421 gccagcatca cctttccaag ggaatgcatt cctcacctct cagcctgtgc cagtgggtgt 481 ggtcccagcc ctgcaaccag cctttgtccc tgcccagtcc tatcctgtgg ccaatggaat 541 gccctatcca gcccctaatg tgcctgtggt gggcatcact ccctcccaga tggtggccaa 601 cgtwtttggc actgcaggcc accctcaggc tgcccatccc catcagtcac ccagcctggt 661 caggcagcag acattccctc actacgaggc aagcagtgct accaccagtc ccttctttaa 721 gcctcctgct cagcacctca acggttctgc agctttcaat ggtgtagatg atggcaggtt 781 ggcctcagca gacaggcata cagaggttcc tacaggcacc tgcccagtgg atccttttga 841 agcccagtgg gctgcattag aaaataagtc caagcagcgt actaatccct cccctaccaa 901 ccctttctcc agtgacttac agaagacgtt tgaaattgaa ctttaagcaa tcattatggc 961 tatgtatctt gtccatacca gacagggagc agggggtagc ggtcaaagga gcmaaacaga 1021 ytttgtctcc tgattagtac tcttttcact aatcccaaag gtcccaagga acaagtccag 1081 gcccagagta ctgtgagggg tgattttgaa agacatggga aaaagcattc ctagagaaaa 1141 gctgccttgc aattaggcta aagaagtcaa ggaaatgttg ctttctgtac tccctcttcc 1201 cttaccccct tacaaatctc tggcaacaga gaggcaaagt atctgaacaa gaatctatat 1261 tccaagcaca tttactgaaa tgtaaaacac aacaggaagc aaagcaatgt ccctttgttt 1321 ttcaggccat tcacctgcct cctgtcagta gtggcctgta ttagagatca agaagagtgg 1381 tttgtgctca ggctgggaac agagaggcac gctatgctgc cagaattccc aggagggcat 1441 atcagcaact gcccagcaga gctatatttt gggggagaag ttgagcttcc attttgagta 1501 acagaataaa tattatatat atcaaaagcc aaaatcttta tttttatgca tttagaatat 1561 tttaaatagt tctcagatat taagaagttg tatgagttgt aagtaatctt gccaaaggta 1621 aaggggctag ttgtaagaaa ttgtacatra gattgattta tcattgatgc ctactgaaat 1681 aaaaagagga aaggctggaa gcatgcagac aggatcccta gcttgttttc tgtcagtcat 1741 tcattgtaag tagcacattg caacaacaat catgcttatg accaatacag tcactaggtt 1801 gtagtttttt ttaaataaag gaaaagcagt attgtcctgg ttttaaacct atgatggaat 1861 tctaatgtca ttattttaat ggaatcaatc gaaatatgct ctatagagaa tatatctttt 1921 atatattgct gcagtttcct tatgttaatc ctttaacact aaggtaacat gacataatca 1981 taccatagaa gggaacacag gttaccatat tggtttgtaa tatgggtctt ggtgggtttt 2041 gttttatcct ttaaattttg ttcccatgag ttttgtgggg atggggattc tggttttatt 2101 agctttgtgt gtgtcctctt cccccaaacc cccttttggt gagaacatcc ccttgacagt 2161 tgcagcctct tgacctcgga taacaataag agagctcatc tcatttttac ttttgaacgt 2221 tggcgcttac aatcaaatgt aagttatata tatttgtact gatgaaaatt tataatctgc 2281 tttaacaaaa ataaatgttc atggtag // LOCUS HUMORFDA 3784 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0031 gene, complete cds. ACCESSION D21163 NID g434758 KEYWORDS KIAA0031. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3784) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3784) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..3784 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" gene 61..2979 /gene="KIAA0031" CDS 61..2979 /gene="KIAA0031" /note="similar to human elongation factor 2 mRNA (HSEF2)." /codon_start=1 /db_xref="PID:d1005229" /db_xref="PID:g434759" /translation="MDTDLYDEFGNYIGPELDSDEDDDELGRETKDLDEMDDDDDDDD VGDHDDDHPGMEVVLHEDKKYYPTAEEVYGPEVETIVQEEDTQPLTEPIIKPVKTKKF TLMEQTLPVTVYEMDFLADLMDNSELIRNVTLCGHLHHGKTCFVDCLIEQTHPEIRKR YDQDLCYTDILFTEQERGVGIKSTPVTVVLPDTKGKSYLFNIMDTPGHVNFSDEVTAG LRISDGVVLFIDAAEGVMLNTERLIKHAVQERLAVTVCINKIDRLILELKLPPTDAYY KLRHIVDEVNGLISMYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDI NYQEFAKRLWGDIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQVVGDVDTSL PRTLDELGIHLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIE HTYTGGVDSDLGEAMSDCDPDGPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVK VLGENYTLEDEEDSQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATIT EPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEES GEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPVVTFCETVVETSSLKCFAETPNKK NKITMIAEPLEKGLAEDIENEVVQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDAT GPNILVDDTLPSEVDKALLGSVKDSIVQGFQWGTREGPLCDELIRNVKFKILDAVVAQ EPLHRGGGQIIPTARRVVYSAFLMATPRLMEPYYFVEVQAPADCVSAVYTVLARRRGH VTQDAPIPGSPLYTIKAFIPAIDSFGFETDLRTHTQGQAFSLSVFHHWQIVPGDPLDK SIVIRPLEPQPAPHLAREFMIKTRRRKGLSEDVSISKFFDDPMLLELAKQDVVLNYPM " 3'UTR 2980..>3784 BASE COUNT 884 a 997 c 985 g 918 t ORIGIN 1 ggcggaagca cgatctccgg cagcggcctg ggaactctta gctgagcagg cgagagcatc 61 atggataccg acttatatga tgagtttggg aattatattg gaccagagct tgattctgat 121 gaagatgatg atgaattggg tagagagacc aaagatcttg atgagatgga tgatgatgac 181 gacgacgatg acgtaggaga tcatgacgat gaccaccctg ggatggaggt ggtgctgcat 241 gaggacaaga agtactaccc aacagccgag gaggtgtatg gtcctgaggt ggagaccata 301 gttcaagagg aagacactca gcctctcaca gaacccatta ttaagccagt gaaaaccaag 361 aaattcactc tgatggagca gacattacct gttacggtgt atgagatgga tttcttggcg 421 gatctgatgg ataactcaga gctcatcaga aatgtgaccc tttgtggaca tctccaccat 481 ggcaagacat gttttgtgga ttgtttaatt gaacagactc acccggaaat cagaaagcgc 541 tatgaccaag atctgtgcta tactgacatc ctcttcacag agcaagagag aggtgtaggc 601 atcaaaagca ctcctgtgac agtggtcttg ccagacacca aaggaaaatc ttatctcttc 661 aatatcatgg acactccagg acatgtgaat ttctctgatg aggtcacagc tggcttgcgc 721 atctcagatg gagtggtcct tttcattgat gctgctgagg gggtgatgct gaacacagag 781 cggctgatca agcatgcggt gcaggagagg ctggcagtca ctgtgtgcat caacaagatt 841 gaccggctga tcctggagct gaagctgcct ccaactgatg cttattacaa gctgcgccac 901 attgtggatg aggtcaatgg attaataagc atgtattcca ctgatgagaa cctgatcctt 961 tccccactcc tgggtaacgt ctgcttctcc agctcccagt acagcatctg cttcacgctg 1021 ggctcctttg ccaagatcta tgccgacacc tttggtgaca ttaattacca agaatttgct 1081 aaaagactct ggggtgacat ctacttcaac cctaagacgc gaaagttcac caaaaaggcc 1141 ccaactagca gctcccagag aagtttcgtg gagtttatct tggagcctct ttataagatc 1201 ctcgcccagg ttgtaggtga cgtggacacc agcctcccac ggaccctaga cgagcttggc 1261 atccacctga cgaaggagga gctgaagctg aacatccgcc ccttgctcag gctggtctgc 1321 aaaaagttct ttggcgagtt cacaggcttt gtggacatgt gtgtgcagca tatcccttct 1381 ccaaaggtgg gcgccaagcc caagattgag cacacctaca ccggtggtgt ggactccgac 1441 ctcggcgagg ctatgagtga ctgtgaccct gatggccccc tgatgtgcca cactactaag 1501 atgtacagca cagatgatgg agtccagttt cacgcctttg gccgggtgct gagtggcacc 1561 attcatgctg ggcagcctgt gaaggtactg ggggagaact acaccctgga ggatgaggaa 1621 gactcccaga tatgcaccgt gggccgcctt tggatctctg tggccaggta ccacatcgag 1681 gtgaaccgtg ttcctgctgg caactgggtt ctgattgaag gtgttgatca accaattgtg 1741 aagacagcaa ccataaccga accccgaggc aatgaggagg ctcagatttt ccgacccttg 1801 aagttcaata ccacatctgt tatcaagatt gctgtggagc cagtcaaccc ctcagagctg 1861 cccaagatgc ttgatggcct gcgcaaggtc aacaagagct atccatccct caccaccaag 1921 gtggaggagt ctggcgagca tgtgatcctg ggcactgggg agctctacct ggactgtgtg 1981 atgcatgatt tgcggaagat gtactcagag atagacatca aggtggctga cccagttgtc 2041 acgttttgtg agacggtggt ggaaacatcc tccctcaagt gctttgctga aacgcctaat 2101 aagaagaaca agatcaccat gattgctgag cctcttgaga agggcctggc agaggacata 2161 gagaatgagg tggtccagat tacgtggaac aggaagaagc tgggagagtt cttccagacc 2221 aagtacgatt gggatctgct ggctgcccgt tccatctggg cttttggccc tgatgcgact 2281 ggccccaaca ttctggtgga tgatactctg ccctctgagg tggacaaggc tcttcttggt 2341 tcagtgaagg acagcatcgt tcaaggtttc cagtggggaa ccagggaggg ccccctctgt 2401 gatgaattga ttcggaatgt caagtttaag atcctggatg cggtggttgc ccaggagccc 2461 ctgcaccggg gcgggggcca gatcatcccc acagccagga gagtcgtcta ctctgccttc 2521 ctcatggcta ctcctcgtct gatggagcct tactactttg tagaggtcca ggcccctgca 2581 gattgcgtct ctgcagttta taccgtcctg gccaggcgca gggggcacgt gactcaggat 2641 gcacccatcc caggctcccc tctgtacacc atcaaagctt ttatcccggc catcgactct 2701 tttggctttg agactgatct ccggactcac acccagggac aagccttttc tctgtctgtc 2761 ttccaccact ggcagattgt gcctggtgat cccctggaca agagcattgt catccgcccc 2821 ttggagccac agccagctcc tcacctggcc cgggaattca tgatcaaaac ccgccgtagg 2881 aagggcctca gtgaagatgt gagcatcagc aaattcttcg atgatcctat gttgctggaa 2941 cttgccaaac aggatgttgt gctcaattac cccatgtgag tgcgtggact cctgggagct 3001 cctgctccct acagtgggct gcaactcctg tacttgaagc tgagacctca tatgacgtgg 3061 ccttcgtgtt gtcagagagt gtctggaagc tgctgttgcc atcttgaaca actcaccaac 3121 ctccaaccca gagccccagt gagagaggag catttggcct cctgcttcct tctgtggcct 3181 ctgccgggct ccattcccaa ggaaaagaga ggagcttggg ctcacagaaa gagaagggga 3241 tgaaacccca aggggcccta tctttgggat ttacatggaa ttttattttc tacaagtttg 3301 accttagcca tggtttgcaa gtgaacagaa cattctgacc tctgtcttgc tctgctcctt 3361 tcatcctcgt ctcccctgcc ccatctggtg cttacattct gaatatatgt catctcccaa 3421 gaggcttcac tgcctctgct tccagctgca gcctccttcc tgcctgggtc cccagggaag 3481 ccgcctgcct tttaattcag tgttcccatg agcgccaagg ccccattatt gcccccttgc 3541 tcccactcca tgctgcttct gggtgggacc taagatggct tgggagttgt tgggttcctg 3601 cgatcagaag tctaccccac cacctcctca ggaaactgct gcctccccta agaatcttcc 3661 ttgccctgga gtagggggcc agagcacttt gatttccagc catttactcc aagtcctctc 3721 cccagctacc accagtccct tactctgttc tcccccagtg aaaaagagtc tgttgatttt 3781 cctc // LOCUS HUMORFEA 6111 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0034 gene, complete cds. ACCESSION D21260 NID g434760 KEYWORDS KIAA0034. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6111) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6111) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 REFERENCE 5 (sites) AUTHORS Kedra,D., Peyrard,M., Fransson,I., Collins,J.E., Dunham,I., Roe,B.A. and Dumanski,J.P. TITLE Characterization of a second human clathrin heavy chain polypeptide gene (CLH-22) from chromosome 22q11 JOURNAL Hum. Mol. Genet. 5 (5), 625-631 (1996) MEDLINE 96311557 FEATURES Location/Qualifiers source 1..6111 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..172 gene 173..5200 /gene="KIAA0034" CDS 173..5200 /gene="KIAA0034" /note="similar to rat clathrin heavy chain mRNA (ratcarhc)." /codon_start=1 /db_xref="PID:d1005334" /db_xref="PID:g434761" /translation="MAQILPIRFQEHLQLQNLGINPANIGFSTLTMESDKFICIREKV GEQAQVVIIDMNDPSNPIRRPISADSAIMNPASKVIALKAGKTLQIFNIEMKSKMKAH TMTDDVTFWKWISLNTVALVTDNAVYHWSMEGESQPVKMFDRHSSLAGCQIINYRTDA KQKWLLLTGISAQQNRVVGAMQLYSVDRKVSQPIEGHAASFAQFKMEGNAEESTLFCF AVRGQAGGKLHIIEVGTPPTGNQPFPKKAVDVFFPPEAQNDFPVAMQISEKHDVVFLI TKYGYIHLYDLETGTCIYMNRISGETIFVTAPHEATAGIIGVNRKGQVLSVCVEEENI IPYITNVLQNPDLALRMAVRNNLAGAEELFARKFNALFAQGNYSEAAKVAANAPKGIL RTPDTIRRFQSVPAQPGQTSPLLQYFGILLDQGQLNKYESLELCRPVLQQGRKQLLEK WLKEDKLECSEELGDLVKSVDPTLALSVYLRANVPNKVIQCFAETGQVQKIVLYAKKV GYTPDWIFLLRNVMRISPDQGQQFAQMLVQDEEPLADITQIVDVFMEYNLIQQCTAFL LDALKNNRPSEGPLQTRLLEMNLMHAPQVADAILGNQMFTHYDRAHIAQLCEKAGLLQ RALEHFTDLYDIKRAVVHTHLLNPEWLVNYFGSLSVEDSLECLRAMLSANIRQNLQIC VQVASKYHEQLSTQSLIELFESFKSFEGLFYFLGSIVNFSQDPDVHFKYIQAACKTGQ IKEVERICRESNCYDPERVKNFLKEAKLTDQLPLIIVCDRFDFVHDLVLYLYRNNLQK YIEIYVQKVNPSRLPVVIGGLLDVDCSEDVIKNLILVVRGQFSTDELVAEVEKRNRLK LLLPWLEARIHEGCEEPATHNALAKIYIDSNNNPERFLRENPYYDSRVVGKYCEKRDP HLACVAYERGQCDLELINVCNENSLFKSLSRYLVRRKDPELWGSVLLESNPYRRPLID QVVQTALSETQDPEEVSVTVKAFMTADLPNELIELLEKIVLDNSVFSEHRNLQNLLIL TAIKADRTRVMEYINRLDNYDAPDIANIAISNELFEEAFAIFRKFDVNTSAVQVLIEH IGNLDRAYEFAERCNEPAVWSQLAKAQLQKGMVKEAIDSYIKADDPSSYMEVVQAANT SGNWEELVKYLQMARKKARESYVETELIFALAKTNRLAELEEFINGPNNAHIQQVGDR CYDEKMYDAAKLLYNNVSNFGRLASTLVHLGEYQAAVDGARKANSTRTWKEVCFACVD GKEFRLAQMCGLHIVVHADELEELINYYQDRGYFEELITMLEAALGLERAHMGMFTEL AILYSKFKPQKMREHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAIITM MNHPTDAWKEGQFKDIITKVANVELYYRAIQFYLEFKPLLLNDLLMVLSPRLDHTRAV NYFSKVKQLPLVKPYLRSVQNHNNKSVNESLNNLFITEEDYQALRTSIDAYDNFDNIS LAQRLEKHELIEFRRIAAYLFKGNNRWKQSVELCKKDSLYKDAMQYASESKDTELAEE LLQWFLQEEKRECFGACLFTCYDLLRPDVVLETAWRHNIMDFAMPYFIQVMKEYLTKV DKLDASESLRKEEEQATETQPIVYGQPQLMLTAGPSVAVPPQAPFGYGYTAPPYGQPQ PGFGYSM" 3'UTR 5201..6111 BASE COUNT 1792 a 1240 c 1329 g 1750 t ORIGIN 1 ccgcccccga cccgagctct ttcgtctgcc tgccagtttc ctgcgtcccc ggagaggatc 61 ctgctgagcc cagcctcccc cctccccttc tcctcctctc ccttggagag cccgggcagc 121 cactgccccg cagccccagt gacaggagga gaccataacc cccgacagcg ccatggccca 181 gattctgcca attcgttttc aggagcatct ccagctccag aacctgggta tcaacccagc 241 aaacattggc ttcagtaccc tgactatgga gtctgacaaa ttcatctgca ttagagaaaa 301 agtaggagag caggcccagg tggtaatcat tgatatgaat gacccaagta atccaattcg 361 aagaccaatt tcagcagaca gcgccatcat gaatccagct agcaaagtaa ttgcactgaa 421 agctgggaaa actcttcaga tttttaacat tgaaatgaaa agtaaaatga aggctcatac 481 catgactgat gatgtcacct tttggaaatg gatctctttg aatacggttg ctcttgttac 541 ggataatgca gtttatcact ggagtatgga aggagagtct cagccagtga aaatgtttga 601 tcgccattct agccttgcag ggtgccagat tatcaattac cgtacagatg caaaacaaaa 661 gtggttactt ctgactggta tatctgcaca gcaaaatcgt gtggtgggag ctatgcagct 721 atattctgta gataggaaag tgtctcagcc cattgaagga catgcagcta gctttgcaca 781 gtttaagatg gaaggaaatg cagaagaatc aacgttattt tgttttgcag ttcggggcca 841 agctggaggg aagttacata ttattgaagt tggcacacca cctacaggga accagccctt 901 tccaaagaag gcagtggatg tcttctttcc tccagaagca caaaatgatt ttcctgttgc 961 aatgcagatc agtgaaaagc atgatgtggt gttcttgata accaagtatg gttatatcca 1021 cctctatgat cttgagactg gtacctgcat ctacatgaat agaatcagtg gagaaacaat 1081 ttttgttact gcacctcatg aagccacagc tggaataatt ggagtaaaca gaaagggaca 1141 agttctgtca gtgtgtgtgg aagaagaaaa cataattcct tacatcacca atgttctaca 1201 aaatcctgat ttggctctga gaatggctgt acgtaataac ttagccggtg ctgaagaact 1261 ctttgcccgg aaatttaatg ctctttttgc ccagggaaat tactcggagg cagcaaaggt 1321 ggctgctaat gcaccaaagg gaattcttcg tactccagac actatccgtc ggttccagag 1381 tgtcccagcc cagccaggtc aaacttctcc tctacttcag tactttggta tccttttgga 1441 ccagggacag ctcaacaaat acgaatcctt agagctttgt aggcctgtac ttcagcaagg 1501 gcgaaaacag cttttggaga aatggttaaa agaagataag ctggaatgtt ctgaagaact 1561 gggtgatctt gtgaaatctg tggaccctac attggcactt agtgtgtacc taagggctaa 1621 cgtcccaaat aaagtcattc agtgctttgc agaaacaggt caagtccaaa agattgtttt 1681 atatgctaaa aaagttggat acactccaga ttggatattt ctgctgagaa atgtaatgcg 1741 aatcagtcca gatcagggac agcagtttgc ccaaatgtta gttcaagatg aagagcctct 1801 tgctgacatc acacagattg tagatgtctt tatggaatac aatctaattc agcagtgtac 1861 tgcattcttg cttgatgctc tgaagaataa tcgcccatct gaaggtcctt tacagacgcg 1921 gttacttgag atgaacctta tgcatgcgcc tcaagttgca gatgctattc taggcaatca 1981 gatgttcaca cattatgacc gggctcatat tgctcaactg tgtgaaaagg ctggcctact 2041 gcagcgtgca ttagaacatt tcactgattt atatgatata aaacgtgcag tggttcacac 2101 ccatcttctt aaccctgagt ggttagtcaa ctactttggt tccttatcag tagaagactc 2161 cctagaatgt ctcagagcca tgctgtctgc caacatccgt cagaatctgc agatttgtgt 2221 tcaggtggct tctaaatatc atgaacaact gtcaactcag tctctgattg aactttttga 2281 atctttcaag agttttgaag gtctctttta ttttctggga tccattgtta actttagcca 2341 ggacccagat gtgcacttta aatatattca ggcagcttgc aagactgggc aaatcaaaga 2401 agtagaaaga atctgtagag aaagcaactg ctacgatcct gagcgagtca agaattttct 2461 taaggaagca aaactaacag atcagctacc acttatcatt gtgtgtgatc gatttgactt 2521 tgtccatgat ttggtgctct atttatatag aaataatctt caaaagtata tagagatata 2581 tgtacagaag gtgaatccaa gtcgacttcc tgtagttatt ggaggattac ttgatgttga 2641 ctgttctgaa gatgtcataa aaaacttgat tcttgttgta agaggtcaat tctctactga 2701 tgagcttgtt gctgaggttg aaaaaagaaa cagattgaaa ctgcttctgc cttggctaga 2761 ggccagaatt catgagggct gtgaggagcc tgctactcac aatgccttag ccaaaatcta 2821 catagacagt aataacaacc cggagagatt tcttcgtgaa aatccctact atgacagtcg 2881 cgttgttgga aagtattgtg agaagagaga tccacatctg gcctgtgttg cttatgaacg 2941 tggccaatgt gatctggaac ttattaatgt ttgcaatgag aattccctct tcaaaagtct 3001 ttctcgctac ctggtacgtc gaaaggatcc agaattgtgg ggcagcgtgc tgctggaaag 3061 caatccttac aggagacccc taattgacca ggttgtacaa acagctttgt ctgagactca 3121 ggaccctgaa gaagtgtcag taactgtaaa ggctttcatg actgcagacc ttcctaatga 3181 actcattgaa ctgctggaga aaattgtcct tgataactct gtattcagtg aacacaggaa 3241 tctgcaaaac ctccttatcc tcactgcaat taaggctgac cgtacacgtg ttatggagta 3301 tattaaccgc ctggataatt atgatgcccc agatattgcc aatatcgcca tcagcaatga 3361 gctgtttgaa gaagcatttg ccattttccg gaaatttgat gtcaatactt cagcagttca 3421 ggtcttaatt gagcatattg gaaacttgga tcgggcatat gagtttgctg aacgttgcaa 3481 tgaacctgcg gtctggagtc aacttgcaaa agcccagttg cagaaaggaa tggtgaaaga 3541 agccattgat tcttatatca aagcagatga tccttcctcc tacatggaag ttgttcaggc 3601 tgccaatact agtggaaact gggaagaact ggtgaagtac ttgcagatgg cccgtaagaa 3661 ggctcgagag tcctatgtgg agacagaact gatattcgca ctggctaaaa caaaccgcct 3721 tgcagagtta gaagaattta tcaatggacc aaataatgct catatccaac aagttggtga 3781 ccgttgttat gatgaaaaaa tgtatgatgc tgctaagttg ttgtacaata atgtttccaa 3841 ttttggacgt ttggcatcta ccctggttca cctgggtgaa tatcaggcag ctgttgatgg 3901 ggctaggaaa gctaacagta ctcgaacatg gaaagaggtc tgcttcgcct gtgtagatgg 3961 gaaagaattc cgtcttgctc agatgtgtgg acttcatatt gttgtacatg cagatgaatt 4021 agaagaactt atcaactact atcaggatcg tggctatttt gaagagctga tcaccatgtt 4081 ggaagcagca ctgggacttg agcgagctca catgggaatg tttactgaat tagctattct 4141 atactctaaa tttaagcctc agaaaatgag ggagcacctg gagctgttct ggtctagagt 4201 gaatattccc aaggtgctaa gagctgcaga acaagctcat ctttgggcag aactggtgtt 4261 tttgtatgac aagtatgaag aatatgataa tgccataatt accatgatga atcatccaac 4321 tgatgcctgg aaagaagggc aattcaaaga tatcattacc aaggttgcca atgtggaact 4381 atactacaga gcaatacagt tctacttaga attcaagcct ctgttgttaa atgatttgct 4441 gatggtgctg tctccacggt tggatcacac tcgtgcagtc aattatttca gcaaggttaa 4501 acagctacca ctggtgaaac cgtatttgcg ttcagttcag aaccataaca acaaatctgt 4561 gaatgaatca ttgaacaatc tttttattac agaagaagat tatcaggctc tgcgaacatc 4621 aatagatgct tatgacaact ttgacaatat ctcgcttgct cagcgtttgg aaaaacatga 4681 actcattgag ttcaggagaa ttgctgctta tctcttcaaa ggcaacaatc gctggaaaca 4741 gagtgtagag ctgtgcaaga aagacagcct ttacaaggat gcaatgcagt atgcttctga 4801 atctaaagat actgaattgg ctgaagaact cctgcagtgg tttttgcagg aagaaaaaag 4861 agagtgcttt ggagcttgtc tgtttacctg ttacgatctt ttaaggccag atgtcgtcct 4921 agaaactgca tggaggcaca atatcatgga ttttgccatg ccctatttca tccaggtcat 4981 gaaggagtac ttgacaaagg tggataaatt agatgcttca gaatcactga gaaaagaaga 5041 agaacaagct acagagacac aacccattgt ttatggtcag ccccagttga tgctgacagc 5101 aggacccagt gttgccgtcc ctccccaggc accttttggt tatggttata ccgcaccacc 5161 gtatggacag ccacagcctg gctttgggta cagcatgtga gatgaagcgc tgatcctgta 5221 gtcacctatt ttcgtactga aacatcgtct ttacccactt ctcagtttat aatgggggaa 5281 aacaggcaac gtgttcttgt aacctttatt tcatgaagga cttctttttg tttctaacta 5341 taaacttgga tcacctatgt taaaacctta tttcacattc cacatcattt tagaatttat 5401 tttcgaaggg gaatagtttc aatgttttat tcacttgggc tttttttctt ccccctcttt 5461 ctttaaagaa ctgctcaata ttcaatctgt tgtgaagaac ctgatttgca ctctgtagtg 5521 tttaaagaaa caaagaaact ctaatattga atctcttaaa tttagtgtat gtaaacagct 5581 tacaaatacg tattgtctaa atgcatttaa atctgtttta ttcaaagaaa agctaaagca 5641 aaaacactgg catatgacca tgcaagactg tcagtgccaa caaagacaac actaatcagc 5701 acatcgtaca ctggattgca gtgcttccca gattattgaa aaatgttaca gacaacttgc 5761 ctgattttta aatgagcgta aaaggccctc taacctatgc aggtttcccc attatgcata 5821 tagaaaatgc tagtatgttt tgctcacttc atatgtaaca ggtgccctta tgttgtgctg 5881 tatcctgtgc tttttctgtg ggaccattcc attcaggagc aaagagcacc atgattccaa 5941 tcttgtgtgt gtttactaac ccttccctga ggcttgtgta tgttggatat tgtggtgttt 6001 tagatcactg agtgtacaga agagagaaat tcaaacaaaa tattgctgtt cttcagtttt 6061 gtttgtggaa tttgaaatta ctcaaattta aaataaatta ctggactgtg g // LOCUS HUMORFFA 1360 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0120 gene, complete cds. ACCESSION D21261 NID g434762 KEYWORDS KIAA0120. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1360) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (21-OCT-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1360) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nagase,T., Miyajima,N., Tanaka,A., Sazuka,T., Seki,N., Sato,S., Tabata,S., Ishikawa,K.-i., Kawarabayasi,Y., Kotani,H. and Nomura,N. TITLE Prediction of the coding sequences of unidentified human genes. III. The coding sequences of 40 new genes (KIAA0081-KIAA0120) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 2 (1), 37-43 (1995) MEDLINE 95308325 FEATURES Location/Qualifiers source 1..1360 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..73 gene 74..673 /gene="KIAA0120" CDS 74..673 /gene="KIAA0120" /note="similar to human 22kDa, SM22 mRNA (HUM22SM)." /codon_start=1 /db_xref="PID:d1005335" /db_xref="PID:g434763" /translation="MANRGPAYGLSREVQQKIEKQYDADLEQILIQWITTQCRKDVGR PQPGRENFQNWLKDGTVLCELINALYPEGQAPVKKIQASTMAFKQMEQISQFLQAAER YGINTTDIFQTVDLWEGKNMACVQRTLMNLGGLAVARDDGLFSGDPNWFPKKSKENPR NFSDNQLQEGKNVIGLQMGTNRGASQAGMTGYGMPRQIL" 3'UTR 674..1360 BASE COUNT 283 a 397 c 364 g 316 t ORIGIN 1 gcccttgcct tgagtcagtg cgctgctctc cagcccgctt gaacgctccc cgcagccacc 61 gccacccatt ggaatggcca acaggggacc tgcatatggc ctgagccggg aggtgcagca 121 gaagattgag aaacaatatg atgcagatct ggagcagatc ctgatccagt ggatcaccac 181 ccagtgccga aaggatgtgg gccggcccca gcctggacgc gagaacttcc agaactggct 241 caaggatggc acggtgctat gtgagctcat taatgcactg taccccgagg ggcaggcccc 301 agtaaagaag atccaggcct ccaccatggc cttcaagcag atggagcaga tctctcagtt 361 cctgcaagca gctgagcgct atggcattaa caccactgac atcttccaaa ctgtggacct 421 ctgggaagga aagaacatgg cctgtgtgca gcggacgctg atgaatctgg gtgggctggc 481 agtagcccga gatgatgggc tcttctctgg ggatcccaac tggttcccta agaaatccaa 541 ggagaatcct cggaacttct cagataacca gctgcaagag ggcaagaacg tgatcgggtt 601 acagatgggc accaaccgcg gggcgtctca ggcaggcatg actggctacg ggatgccacg 661 ccagatcctc tgatcccacc ccaggccttg cccctgccct cccacgaatg gttaatatat 721 atgtagatat atattttagc agtgacattc ccagagagcc ccagagctct caagctcctt 781 tctgtcaggg tggggggttc agcctgtcct gtcacctctg aggtgcctgc tggcatcctc 841 tcccccatgc ttactaatac attcccttcc ccatagccat caaaactgga ccaactggcc 901 tcttcctttc ccctgggacc aaaatttagg ggcctcagtc cctcaccgcc atgccctggc 961 ctattctgtc tctccttctt ccccctggcc tgttctgtct ctgagctctg tgtcctccgt 1021 tcattccatg gctgggagtc actgatgctg cctctgcctt ctgatgctgg actggccttg 1081 cttctacaag tatgcttctc ccacagctgt ggctgcagga acttaattta tagggaggag 1141 cctgtggcag ctgctgcccc agccacagct gcactgactg tgctcaccac acatctgggg 1201 cagccttccc tggcaggggc cctcgtggct tctcattttc cattcccttc actgtggcta 1261 aggggtgggg tgaggggatg gagagggagg gctgcctacc atggtctggg gcttgaggaa 1321 gatgagtttg ttgatttaaa taaagaattt gtcatttttg // LOCUS HUMORFHA 4203 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0028 gene, partial cds. ACCESSION D21851 NID g434766 KEYWORDS KIAA0028. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4203) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4203) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..4203 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..185 gene 186..2897 /gene="KIAA0028" CDS 186..2897 /gene="KIAA0028" /codon_start=1 /db_xref="PID:d1005413" /db_xref="PID:g2104217" /translation="MASVWQRLGFYASLLKRQLNGGPDVIKWERRVIPGCTRSIYSAT GKWTKEYTLQTRKDVEKWWHQRIKEQASKISEADKSKPKFYVLSMFPYPSGKLHMGHV RVYTISDTIARFQKMRGMQVINPMGWDAFGLPAENAAVERNLHPQSWTQSNIKHMRKQ LDRLGLCFSWDREITTCLPDYYKWTQYLFIKLYEAGLAYQKEALVNWDPVDQTVLANE QVDEHGCSWRSGAKVEQKYLRQWFIKTTAYAKAMQDALADLPEWYGIKGMQAHWIGDC VGCHLDFTLKVHGQATGEKLTAYTATPEAIYGTSHVAISPSHRLLHGHSSLKEALRMA LVPGKDCLTPVMAVNMLTQQEVPVVILAKADLEGSLDSKIGIPSTSSEDTILAQTLGL AYSEVIETLPDGTERLSSSAEFTGMTRQDAFLALTQKARGKRVGGDVTSDKLKDWLIS RQRYWGTPIPIVHCPVCGPTPVPLEDLPVTLPNIASFTGKGGPPLAMASEWVNCSCPR CKGAAKRETDTMDTFVDSAWYYFRYTDPHNPHSPFNTAVADYWMPVDLYIGGKEHAVM HLFYARFFSHFCHDQKMVKHREPFHKLLAQGLIKGQTFRLPSGQYLQREEVDLTGSVP VHAKTKEKLEVTWEKMSKSKHNGVDPEEVVEQYGIDTIRLYILFAAPPEKDILWDVKT DALPGVLRWQQRLWTLTTRFIEARASGKSPQPQLLSNKEKAEARKLWEYKNSVISQVT THFTEDFSLNSAISQLMGLSNALSQASQSVILHSPEFEDALCALMVMAAPLAPHVTSE IWAGLALVPRKLCAHYTWDASVLLQAWPAVDPEFLQQPEVVQMAVLINNKACGKIPVP QQVARDQDKVHEFVLQSELGVRLLQGRSIKKSFLSPRTALINFLVQD" 3'UTR 2898..4203 BASE COUNT 1066 a 1043 c 1137 g 957 t ORIGIN 1 tgacaacatg gcggcgccca tggtccgtgg cccggcagtg ctcgcctaaa ggtggagaac 61 gaggagtaga ggaggccgca gccagagcct gtgagcagat ccagacctac agataaaaaa 121 cattatttaa tctatctggg atttactccg gcttatgatt tgagggcctt ctcaccttct 181 gaagaatggc ttctgtttgg cagagattgg gtttttatgc ctctcttctg aaaagacagc 241 taaatggtgg gccagatgtc atcaagtggg aaaggagagt aattcccgga tgtaccagaa 301 gcatctacag tgccacggga aagtggacaa aagagtatac attgcagaca agaaaggatg 361 ttgagaaatg gtggcatcaa cgaataaaag aacaggcctc caaaatttca gaagctgata 421 aatcgaagcc aaaattttac gtgctttcca tgttccctta tccttctggt aagctgcaca 481 tgggccatgt gcgtgtctac accatcagcg acaccatagc acggttccag aagatgagag 541 ggatgcaggt catcaacccc atgggatggg atgcttttgg attgcctgct gaaaatgccg 601 cagtcgagag gaatctacat ccacaaagtt ggacacaaag taatattaaa cacatgagga 661 aacagcttga tcgtctgggc ctgtgtttca gctgggatag ggaaataact acgtgtttgc 721 cagattacta caagtggact cagtatctct ttattaaact gtatgaggct gggctggcct 781 atcaaaagga ggccctggtt aactgggacc cagtggatca aacagtgctt gccaatgagc 841 aggtggatga acatggctgt tcatggcgtt ctggagcaaa ggtggaacag aagtacctca 901 gacaatggtt tattaagaca accgcttatg caaaggccat gcaggacgcg ttggcagacc 961 ttccagaatg gtatggaata aaaggcatgc aagcccactg gattggggac tgtgtgggct 1021 gccacctgga cttcacatta aaggttcatg ggcaagccac gggcgaaaag ctgactgcct 1081 atacggccac ccctgaagcc atttatggca cctcccacgt ggccatctcg cccagccaca 1141 gactcctaca tgggcacagc tctctgaagg aagccttgag gatggccctt gtccctggca 1201 aagattgcct cacgcctgta atggctgtga acatgcttac ccagcaggag gtccctgtcg 1261 ttattttggc caaagctgac ttggaaggct ctctggattc aaaaatagga attcccagta 1321 ctagctcaga ggacaccatc ttagcccaaa ccctgggcct ggcctactct gaagtcattg 1381 aaactttgcc agatggcaca gagagactga gcagctctgc tgagttcaca ggtatgaccc 1441 ggcaggatgc ttttctagcc ctgactcaga aagcccgggg gaagagagtg ggtggagacg 1501 tgacaagtga taaactgaaa gactggctga tttcacggca gcggtactgg ggcacaccaa 1561 tccccattgt ccactgccca gtctgtggcc ccacacctgt gcccctggag gacttgcctg 1621 tgaccctgcc caacatcgcg tctttcactg gcaagggagg ccccccactg gccatggctt 1681 cagagtgggt gaactgctcc tgcccaaggt gcaagggagc agccaagaga gagacagaca 1741 cgatggatac ctttgttgat tctgcttggt actacttcag atacactgac cctcataatc 1801 cacacagccc ttttaacaca gcagtggccg attactggat gcctgtggat ttgtacattg 1861 gagggaaaga acatgccgtc atgcacttgt tctatgcaag attctttagt catttttgcc 1921 atgatcaaaa aatggttaaa catagggagc cttttcataa gctgctggcc caaggcctta 1981 tcaaggggca gacattccgc ctaccatctg gacagtatct acagagagag gaagtggatc 2041 tcacaggttc cgttcctgtt catgcaaaaa cgaaagagaa gttagaggtg acgtgggaga 2101 agatgagtaa gtccaaacac aacggggtgg acccagagga agttgtggag cagtatggga 2161 tcgacacgat tcggctctac atcctttttg ctgcccctcc tgagaaggat atcttgtggg 2221 atgtgaaaac tgatgctctc cctggggtgc tgagatggca acaacgactg tggaccttga 2281 caactcggtt tattgaggcc agggcttctg ggaagtctcc ccagcctcag ctgctgagta 2341 acaaggagaa agctgaggcc aggaagctct gggagtacaa gaactccgtc atctctcagg 2401 tgaccaccca tttcacagag gacttctcac tgaattctgc aatttctcag ctgatgggac 2461 tcagcaatgc cctctcgcaa gcctctcaga gcgtcattct ccacagcccc gagtttgagg 2521 atgctttgtg tgccctgatg gtaatggctg ctccactggc ccctcatgta acctcagaga 2581 tctgggcagg cctggcgctg gtgccgagga agctctgtgc ccactacact tgggatgcca 2641 gtgtgctgct ccaggcatgg cctgctgtgg acccggagtt cctgcagcag cctgaggttg 2701 tccagatggc agttctgatc aacaataaag cttgtggcaa aattcctgtg ccccaacaag 2761 ttgcccggga ccaggacaaa gtccacgaat ttgttcttca aagcgagctg ggtgtcaggc 2821 ttttgcaagg acgaagcatc aagaagtcct tcctttcccc gagaactgcc ctcatcaact 2881 tcctggtgca agattgacag ccaggaggct gcagctacca cgagggcctc tgaggaacct 2941 ccttccaggc ctgggatgag ggggcgatgt ctgctggccc aggggaaggg aaaagacaaa 3001 tgtcttgact gttgacctcg gtcctgtggc agactgcagt caacagtgtg cctctgtagt 3061 gtggcctggt gctggggtga aggtgagctg ggcaaaggag aaatatgagc tactgaggag 3121 ggggttggac atcctgcccc tcacccccca cccacactgc aggtagagga ggccatctga 3181 tcccatggga agccatcaga gacactgctg gtgggagcag gaaggagcag tgcccctcga 3241 gcagccagga agcctgcgga tctgggaaat ggctctgcct taggcacttc tcgggaattt 3301 gaggccagcc tgaggaactg caggactcag gtgcaatgtg ccagccactt ggaactgcta 3361 actgagcctc cagatggtag tgaatggtct ctttgccttc aggctggatg aggaagtcat 3421 ttaggaaatg ttcaaataac caatatgtgg aaatggacac agggatcttc tgaagttgct 3481 ttgaatcaaa aggcaggcag tgctggttcc tctgcctgtg tccccaccac tccccagctc 3541 tgtcatgcag gcctgtcctc cccaacccca gctggatgtg cctcccaggc ctgctgtggt 3601 tctgacacac aggatcccag gcaaggcacc acttcctcac atgaatgagg agcagcaagt 3661 cataaccact cccttgggta tacaatttgc tgtgtagtga agtggaacca ggctcaggct 3721 gctggtccca acctcagagc cccaccgcag cccagtaggg atgcagcacg ccccagaggg 3781 ctcatgtggg ccccagatgg caatgccacc attgttgatg tgactccaga gccagttatt 3841 aggaagagca agctcaccac agaggagtgg aactgaggcc ccccagatgt tgcctccggt 3901 gtccaagcca cagcggtctg gctgttggga agatggccag gaatggactc ataccattgg 3961 cacattaggc taatcctggt tttatgtgaa gtcagcaatt aagtgttccc actagaactg 4021 acctaagcca ctgattaata tttaatgagg gaaggtaggg gagaatctag ccattttata 4081 atgccagaaa tctatatatg ttatctgatg ccatttttct gaagtagcct cacatgtggt 4141 ccccctgcag ttcagcagtt aacagatgac ttttttagtg taataaaatg tttatcatct 4201 atg // LOCUS HUMORFIA 4272 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0029 gene, partial cds. ACCESSION D21852 NID g434768 KEYWORDS KIAA0029. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4272) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4272) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..4272 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..382 gene 383..3298 /gene="KIAA0029" CDS 383..3298 /gene="KIAA0029" /codon_start=1 /db_xref="PID:d1005414" /db_xref="PID:g2104218" /translation="MRMSDTVTVKDETATMKDLEAEVKDTTRVENLIKSENYGKILVE KNEHCIENNIDLQEKIQIQLTQSFEKEEKPSKDEAEKEKASDKLPRKMLSRDSSQEYT DSTGIDLHEFLVNTLKNNPRDRMMLLKLEQEILDFIGNNESPRKKFPPMTSYHRMLLH RVAAYFGLDHNVDQSGKSVIVNKTSNTRIPDQKFNEHIKDDKGEDFQKRYILKRDNSS FDKDDNQVRIRLKDDRRSKSIEEREEEYQRARDRIFSQDSLCSQENYIIDKRLQDEDA SSTQQRRQIFRVNKDASGRSTNSHQSSTENELKYSEPRPWSSTDSDSSLRNLKPAVTK ASSFSGISVLTRGDSSGSSKSIGRLSKTGQPFINPDGSPVVYNPPMTQQPVRSQVPGP PQPPLPAPPQQPAANHIFSQQDNLGSQFSHMSLARQPSADGSDPHAAMFQSTVVLQSP QQSGYIMTAAPPPHPPPPPPPPPPPPPLPPGQPVPTAGYPASGHPVSQPVLQQQGYIQ QPSPQMPACYCAPGHYHSSQPQYRPVPSVHYNSHLNQPLPQPAQQTGYQVIPNQQQNY QGIVGVQQPQSQSLVSGQPNSIGNQIQGVVIPYTSVPTYQVSLPQGSQGIPHQTYQQP VMFPNQSNQGSMPTTGMPVYYSVIPPGQQNNLSSSVGYLQHPGSEQVQFPRTTSPCSS QQLQGHQCTAGPPPPPGGGMVMMQLSVPNNPQSCAHSPPQWKQNKYYCDHQRGQKCVE FSSVDNIVQHSPQLSSPIISPAQSPAPAQLSTLKTVRPSGPPLSIMPQFSRPFVPGQG DSRYPLLGQPLQYNPPAVLHGHIPNQQGQPGSRHGNRGRRQAKKAASTDLGAGETVVG KVLEITELPDGITRMEAEKLFGELFKIGAKIRWLRDPQSQPRRHPLCCGSGDNTANPE RSKPSDLASTYTVLATFPSISAAQNALKKQINSVNKFKLRTSKKHYDFHILERASSQ" 3'UTR 3299..4272 BASE COUNT 1309 a 1064 c 834 g 1065 t ORIGIN 1 tgccgctgga gccggtgtcc gggctggtga tggggttaat tccctttcgt aagactctta 61 cttgcaccca cccagccccg ccgtcgcccc gccgcgccgc gctccaaccg cctcctcctc 121 ctcagtaacg cgggccacgg aaaggtatga tatatttgat ccaagacagt ccattccagt 181 ccgggaatct acagtggtga caaggacatg ggactcctcc tgccagatta cagatggttc 241 actacagttg acatcctggc tgacaactgt gaaaaagaac cttggattat tttattttat 301 ttttgtggga caccacaatc ccaaatccaa aggacgcatc aggcttcaag ctccctgtag 361 aattcgaaaa taaccttttc taatgaggat gtctgatact gttactgtaa aagatgaaac 421 tgcaacaatg aaggatttgg aggcagaagt gaaagataca accagagttg aaaatcttat 481 caaatcagaa aactatggga agattttggt agagaagaat gaacattgta ttgagaacaa 541 tatagatttg caggagaaaa ttcagatcca gttaacacaa tcatttgaga aagaagagaa 601 gccctcaaaa gatgaagcag aaaaagaaaa ggccagtgat aagttgccca gaaaaatgtt 661 atcaagagat tccagtcaag aatacactga ttcaactggc atagatctac atgaattttt 721 agtaaataca ttaaaaaaca atcccaggga cagaatgatg ctgctgaaat tggaacaaga 781 aattttagat ttcattggta ataatgagtc tccacgtaaa aaattccccc caatgacatc 841 ttaccatagg atgctattac acagagtagc cgcttacttt ggattagacc acaatgttga 901 tcagagtggg aagtctgtca tagtaaacaa aactagcaat acaagaatac ctgatcagaa 961 atttaatgaa catattaagg atgataaagg tgaagacttt cagaaacgtt atatcctcaa 1021 gagagataac tctagctttg acaaagatga taaccaggtg agaatacgtt tgaaagatga 1081 cagaagaagc aaatctatag aagaaagaga agaagagtac cagagagcca gagaccgaat 1141 attttcccaa gattccctgt gttcccaaga gaattacatt attgacaaaa gactccaaga 1201 cgaggatgcc agtagtaccc agcagaggcg ccagatattt agagttaata aagatgcttc 1261 agggagatct acaaatagcc atcaaagcag cactgagaat gagttgaagt actcggaacc 1321 acgaccctgg agcagcacag attcagacag ctctcttcga aacctgaaac ctgctgtaac 1381 caaagccagc agcttcagtg gaatctcagt cctgacaaga ggtgatagtt ctggaagcag 1441 caaaagcata ggcaggcttt caaaaacagg tcagcccttc ataaacccag atgggagtcc 1501 agttgtgtat aatcctccta tgactcaaca accagttaga tcccaagtgc ctggacctcc 1561 acagccacct ctgccagccc cacctcaaca accagcagct aatcacattt tctcacagca 1621 ggataaccta gggtctcagt ttagccacat gagtcttgct cgccagccat ctgctgatgg 1681 ttctgaccct catgccgcca tgttccagtc cactgtggtt cttcagtctc cacagcagtc 1741 tggttatatc atgacagcag cccctccacc acatcctcct ccaccgccac caccaccacc 1801 tcctcctcct cccctaccac ctgggcagcc agtccctact gctggatatc ctgcctctgg 1861 tcatcctgtc agccagcctg tgctccagca gcagggatat attcagcagc catcaccaca 1921 gatgccagcc tgttattgcg ctccaggcca ctatcactcc agccaacctc agtatcgccc 1981 agtcccttct gttcattaca attcacatct aaaccaacca ctgccacaac ctgcgcagca 2041 gacaggttat caagttatac ccaaccagca gcaaaactac caaggaatag ttggagttca 2101 gcaaccccag agtcagagcc tagtcagtgg ccaacccaac agcattggaa atcagattca 2161 aggagtggtc atcccctata cttcagtgcc aacatatcag gtttcactgc ctcaaggttc 2221 tcaaggaatt ccccatcaga cttatcaaca gcctgttatg ttccctaatc agtctaatca 2281 aggatctatg cccacaacag gaatgcctgt ttactatagt gtcattccac ctggtcaaca 2341 aaacaattta agctcttcag taggttacct gcaacatcca ggatcagaac aagtacaatt 2401 tcctcgaacc acttcaccat gcagttccca gcagcttcaa ggccaccaat gtacagctgg 2461 accaccaccg ccacctggtg gggggatggt gatgatgcag ctcagtgtac caaacaatcc 2521 acaatcttgt gcccactcac ccccgcagtg gaaacaaaac aaatattact gtgatcacca 2581 gagaggacag aagtgtgtag aatttagcag tgtagacaat attgtccagc acagccctca 2641 actcagtagc cccattattt caccagctca gtcgccagca ccagctcagc tgtccaccct 2701 gaaaactgta cgtccctctg gaccaccact ttccatcatg ccccaatttt ctagaccttt 2761 tgtccccggg caaggagatt ccaggtatcc attacttggc cagccactgc agtacaatcc 2821 tcctgctgtt ctgcacggac acattccaaa ccaacagggt cagcctggca gcaggcatgg 2881 aaaccgagga aggagacaag ctaaaaaagc tgcatccaca gaccttggag caggagaaac 2941 agttgttggg aaggtcttgg aaattactga actaccagat ggaataactc gcatggaagc 3001 tgaaaagctt tttggggaac tctttaaaat tggcgccaag atccggtggc tccgggaccc 3061 ccagtcccaa ccacgtcgtc accccctctg ctgtggcagt ggggacaaca ctgccaaccc 3121 tgaacgctct aaacccagtg acttggcctc cacctacacc gtcttagcca cattcccctc 3181 catttcagct gcacagaatg cactgaagaa acaaattaac tcagttaaca agtttaagct 3241 gagaacaagc aagaagcact atgactttca cattttggaa agggcaagtt ctcagtaaca 3301 gccacctttg gacccttcgc ctttatggtt cccctgccct ctcccatctt tgattggctt 3361 ggtatttgga gcttctgtta acattataga gactcctagg atgtgtgttc atggcattat 3421 agcttttgaa gaaaggccag tgatccagca aagggggaaa aatatgcatt tcaccccaca 3481 tgactaggaa tccacatcag aatgatacag agttagcagg tttttctaag gaaatgccat 3541 tcaaatgcct cctaactttt atagttattt tgttttatat ttctaaattc ttgtatcaga 3601 tccaaagctc tattgtacag caaattattc ttcaaaatga ttataaccag ttgcaccctg 3661 tatttctttt tgcagccagc acaatgtgac ccaacttaaa atttggggga aaaagaatgc 3721 aggagtgaaa taaccaagtc aaaaccatgt actatctcct tgggggttag ggatgctaag 3781 aagagcccac aaatagagga ttactcttcc cctgaatctc taaactcaga aacaattacc 3841 aaaaaataca taactcttcc ttgtagggcc ctttccttat tcatttaggt agtgtgaaca 3901 ttaagtataa aataaattat gttcttaatg cctcttaaac cacttacatt caaaggggaa 3961 cagaaatcat tctaagcagg aaaatacttc cacttttttt ttttcaagta tctctctaat 4021 aactaaatgc cacttatttg cattctcctt gtggattttt tgtcacctaa ggaaatgcat 4081 ttgatgagtg ctggaaactt cttaagtgct ttacagtttg ttttcattgt ttgcagcgga 4141 tcactggaca tcaaagattc attgcactta tgaacaagga accttctttt caatttctgt 4201 gtaatttgca aggctgtaca atgtgtgctg atgcaagcct ttttcagttc aagagaataa 4261 atgtttacaa at // LOCUS HUMORFKA 4894 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0032 gene, complete cds. ACCESSION D25215 NID g517114 KEYWORDS KIAA0032. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4894) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4894) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..4894 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..166 gene 167..3319 /gene="KIAA0032" CDS 167..3319 /gene="KIAA0032" /codon_start=1 /db_xref="PID:d1005483" /db_xref="PID:g517115" /translation="MLCWGYWSLGQPGISTNLQGIVAEPQVCGFISDRSVKEVACGGN HSVFLLEDGEVYTCGLNTKGQLGHEREGNKPEQIGALADQHIIHVACGESHSLALSDR GQLFSWGAGSDGQLGLMTTEDSVAVPRLIQKLNQQTILQVSCGNWHCLALAADGQFFT WGKNSHGQLGLGKEFPSQASPQRVRSLEGIPLAQVAAGGAHSFALSLSGAVFGWGMNN AGQLGLSDEKDRESPCHVKLLRTQKVVYISCGEEHTAVLTKSGGVFTFGAGSCGQLGH DSMNDEVNPRRVLELMGSEVTQIACGRQHTLAFVPSSGLIYAFGCGARGQLGTGHTCN VKCPSPVKGYWAAHSGQLSARADRFKYHIVKQIFSGGDQTFVLCSKYENYSPAVDFRT MNQAHYTSLINDETIAVWRQKLSEHNNANTINGVVQILSSAACWNGSFLEKKIDEHFK TSPKIPGIDLNSTRVLFEKLMNSQHSMILEQILNSFESCLIPQLSSSPPDVEAMRIYL ILPEFPLLQDSKYYITLTIPLAMAILRLDTNPSKVLDNWWSQVCPKYFMKLVNLYKGA VLYLLRGRKTFLIPVLFNNYITAALKLLEKLYKVNLKVKHVEYDTFYIPEISNLVDIQ EDYLMWFLHQAGMKARPSIIQDTVTLCSYPFIFDAQAKTKMLQTDAELQMQVAVNGAN LQNVFMLLTLEPLLARSPFLVLHVRRNNLVGDALRELSIHSDIDLKKPLKVIFDGEEA VDAGGVTKEFFLLLLKELLNPIYGMFTYYQDSNLLWFSDTCFVEHNWFHLIGITCGLA IYNSTVVDLHFPLALYKKLLNVKPGLEDLKELSPTEGRSLQELLDYPGEDVEETFCLN FTICRESYGVIEQKKLIPGGDNVTVCKDNRQEFVDAYVNYVFQISVHEWYTAFSSGFL KVCGGKVLELFQPSELRAMMVGNSNYNWEELEETAIYKGDYSATHPTVKLFWETFHEF PLEKKKKFLLFLTGSDRIPIYGMASLQIVIQSTASGEEYLPVAHTCYNLLDLPKYSSK EILSARLTQALDNYEGFSLA" 3'UTR 3320..4894 BASE COUNT 1302 a 1027 c 1077 g 1488 t ORIGIN 1 cgaaaacgga gaaaccccgg gtccggcgag aggggctgtg acagtcggag tcccaagctg 61 cggttcggct gctgccgaga actgcaaggt gtggaatatt tctggcttct agtccaatgc 121 caagtgtgtg acctgtggct acatgattcc ctgaaagata agaacaatgt tatgttgggg 181 atattggtct ctgggccaac ctggtatcag caccaacctg cagggaattg tggctgagcc 241 ccaggtgtgt gggttcatat ctgacagaag tgtcaaggaa gtggcctgtg ggggaaacca 301 ctctgtgttc ctgctggaag atggggaagt ttacacatgt ggtttgaaca ccaaggggca 361 actgggccat gagagggaag gaaacaagcc agaacaaatt ggagctctgg cagatcagca 421 tatcattcat gtggcatgtg gcgagtccca cagtctggcc ctcagtgacc gaggccagct 481 gttttcttgg ggtgcaggga gtgatggtca gctaggactc atgactactg aggattctgt 541 ggcagtgccc aggttaatac aaaagctgaa ccagcaaaca atattacaag tttcctgtgg 601 caactggcat tgcttggctc ttgcggctga tggccagttc ttcacctggg gaaagaacag 661 ccatgggcag cttggcttag ggaaggagtt cccctcccaa gccagcccac agagggtgag 721 gtccctggag gggatcccac tggctcaggt ggctgccgga ggggctcaca gctttgccct 781 gtctctctca ggagctgttt ttggctgggg gatgaataat gccgggcagc tagggctcag 841 tgatgaaaaa gatcgagaat ctccatgcca tgtaaaactc ttacgcacgc aaaaagttgt 901 ctatattagt tgtggagaag aacacacagc agttctcaca aagagtggag gtgtgtttac 961 ctttggcgct ggttcctgtg ggcaacttgg acacgactcc atgaatgatg aggttaaccc 1021 tagaagagtt ctagagctga tgggtagtga agtaactcaa attgcttgtg gcagacaaca 1081 taccctagcc ttcgtgcctt cttctggact catctatgca tttggttgtg gagcaagagg 1141 tcaattagga actgggcaca cttgtaatgt taagtgccca tctcctgtca agggttactg 1201 ggctgcccac agtggccagc tttcagcccg agctgatcgc tttaaatatc atatcgttaa 1261 gcagatcttc tctggaggag accagacttt tgtactttgc tccaaatacg agaattattc 1321 tcctgctgtt gacttcagga ctatgaacca agcacattat accagtttaa taaatgatga 1381 aaccatagca gtttggagac aaaaactctc agaacacaac aatgcaaata caatcaatgg 1441 tgttgttcag atattatctt ctgcagcctg ttggaatgga agttttcttg aaaaaaaaat 1501 tgatgaacat tttaaaacga gtcccaaaat ccctgggatt gacctgaact caactagggt 1561 gttatttgag aagttaatga actctcagca ctccatgatt ctagaacaga ttttgaacag 1621 ttttgaaagt tgtctgattc cccagttgtc aagctcacca ccagatgttg aagccatgag 1681 aatctattta atactacctg agtttcccct actccaggat tccaagtatt atataacatt 1741 gactattccc ttggctatgg ccattcttcg gctggataca aaccccagca aagtactaga 1801 taactggtgg tctcaggtat gcccgaaata tttcatgaag ctggtaaacc tctataaagg 1861 tgcagtcctt tatctactga ggggaagaaa gacattctta attcccgtac tgtttaacaa 1921 ttatatcaca gcagctctca aactcttgga gaagttatat aaggtaaatc ttaaagtgaa 1981 gcatgtggaa tatgatacat tttacattcc tgagatttcc aatctcgtgg acattcagga 2041 agactacctc atgtggttct tgcatcaagc agggatgaag gctagaccat caataataca 2101 ggatactgta acactttgtt cctacccttt catctttgat gcccaagcca agaccaaaat 2161 gttacagaca gatgctgaac tacagatgca ggtggcagtc aatggagcca acctgcagaa 2221 tgtcttcatg cttctcaccc tggagcctct gctggccaga agccccttcc tggtccttca 2281 cgttcgcagg aacaaccttg ttggagatgc cctaagagag ctgagcattc attctgatat 2341 tgatttgaaa aagcctctca aagtaatctt tgatggtgaa gaagcagtgg atgccggtgg 2401 tgttacaaag gaattttttc ttttgctgtt aaaagaactt ttgaatccca tctatggaat 2461 gtttacctac tatcaagatt caaatctctt gtggttttca gacacgtgtt ttgtagagca 2521 caactggttt cacttgattg gtataacctg tggactagct atctacaact ccactgtggt 2581 cgatctccac ttcccattgg ctctctacaa gaagttactc aatgtaaagc ctggcttgga 2641 agacttaaag gagttgtcac ccactgaagg aaggagtctc caagagcttt tagattaccc 2701 cggggaggat gtggaggaga ctttctgcct caacttcacg atctgccgag aaagctatgg 2761 agtgattgaa cagaagaagc tgatacctgg gggagataat gtaactgtgt gcaaggataa 2821 caggcaggaa tttgtggatg cttatgtgaa ttatgtcttc caaatctcag ttcatgaatg 2881 gtacacagcc ttctctagtg gcttcctaaa ggtgtgtggt ggcaaagtac ttgagctctt 2941 ccagccttca gaactgaggg ctatgatggt ggggaacagc aactacaact gggaagaact 3001 ggaagagact gccatctaca agggagatta ctcggccaca catcccactg taaaactatt 3061 ttgggaaaca tttcatgagt ttccattgga aaagaagaag aagtttctct tgttcctgac 3121 aggcagcgat cggattccca tctacggcat ggccagtctg cagattgtca tccagtccac 3181 agccagcggg gaggagtact tgccggtggc ccacacttgc tacaaccttc ttgacctccc 3241 caagtacagc agcaaagaga ttctgagtgc ccggctgacc caggcccttg acaactatga 3301 agggtttagt ttggcctgag gcttctcagc ttgtccagta tttcccttcg ttcctcagtg 3361 tccacattga ggcctataca gaaaatcatg gggagtgatt tctatttttt tattgtctaa 3421 gtgggttggg acttttaaat actgagcctg gttgatgtgt ttctgggatt gtatagcagt 3481 aaacaacctt tttgaaaaat tagaggttgg ggatggggtg aaaaattggc ccttgtatgg 3541 gaggtgtttt tgtttttgtt ttaaaccaaa ctacccagta ttccttgcac ttgtgaatgt 3601 gttgcactct gctggatgaa atggcagtgg atttttaaac tttaatttcc caaatgtctc 3661 tctcagccct gatgttttct cacagtgctt ccttgtcctt ctcttaactt ctcattcctc 3721 tataagaatg atttagactg acctgtcctt ttttatctgc gcatgcgaga acatcacctt 3781 cctctgtaca cttggaaatg cctctggctt gttgcagccc tcctttaacc caaaggagga 3841 aaggactgct tcagaaactc ccaattccaa aaagctgagt ctgggtccat tattttggca 3901 gaactcctaa gaatttatgg gagcctatat aaacatatct tgcttttaaa aagttcttga 3961 gggaatagca actttcccat ggctgtgcct atttcctaga ccttttaaaa gatgtgcaga 4021 gcagcttagc attcgttgca gctgagccta attttttctt gctcatcctt gtccctttga 4081 caataaggtt aattgataga cccaccacct cttgcactct cgcttttgga gcaagttgca 4141 ttaactattt tgagtctcta tattgtccaa gaaaagtaga aataataaat ttactttccc 4201 tttttctatc accttatgtc ctctaccatt ttctccttcc tcccttccct tattttctcc 4261 ttttcgtacc ctgtgtcctc cctgattttc ctttcgtttc ttctttattt tatcccattc 4321 tctgttactt gactcagtgc tcccttcctc tcctctcctt ctagtggatg catgcagcct 4381 ttttttcaat ttttatttaa attgcaaaat ttttactcag attttttttc ctcttcccta 4441 attgctaaga tttaaggacg ttctttatta tgaaacttta tcacattcga aatgtttgtt 4501 tacagtggga ttttaggggg gattgtgttt aaatcaaata tatgtatttt aaaaataatg 4561 acatgctcaa ccttcctcat catggagtaa gaaaattcta catgattaaa gaatccatgt 4621 aagtctaatt ttaaattcct agtaactaga gaaaagactt atttatataa aatgaagtat 4681 ttatgaactg tgataaagca tcaaatcttg atgaaggatt gtagattttt gctttttctt 4741 tttgttttta aaacttattc caattgctaa attggtagtt tttcagtctt tataaataca 4801 ggattaaaaa tatatataca gttatatgaa atgtttattt tctatgtgtg tgcatatagt 4861 tcaatattat gcaataaatt tggtgtttta actt // LOCUS HUMORFKG1A 6974 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0057 gene, complete cds. ACCESSION D31762 NID g498149 KEYWORDS KIAA0057. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6974) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6974) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..6974 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..75 gene 76..1188 /gene="KIAA0057" CDS 76..1188 /gene="KIAA0057" /note="similar to human TRAMP protein." /citation=[3] /codon_start=1 /db_xref="PID:d1007111" /db_xref="PID:g498150" /translation="MAFRRRTKSYPLFSQEFVIHNHADIGFCLVLCVLIGLMFEVTAK TAFLFILPQYNISVPTADSETVHYHYGPKDLVTILFYIFITIILHAVVQEYILDKISK RLHLSKVKHSKFNESGQLVVFHFTSVIWCFYVVVTEGYLTNPRSLWEDYPHVHLPFQV KFFYLCQLAYWLHALPELYFQKVRKEEIPRQLQYICLYLVHIAGAYLLNLSRLGLILL LLQYSTEFLFHTARLFYFADENNEKLFSAWAAVFGVTRLFILTLAVLAIGFGLARMEN QAFDPEKGNFNTLFCRLCVLLLVCAAQAWLMWRFIHSQLRHWREYWNEQSAKRRVPAT PRLPARLIKRESGYHENGVVKAENGTSPRTKKLKSP" 3'UTR 1189..6974 BASE COUNT 1590 a 1800 c 1671 g 1913 t ORIGIN 1 cggtgctgga gaagtttgcg ctgcggttcg tgagcgcagg gtgcgggccc cgccggccgc 61 tgcgcgcccg ctgccatggc tttccgcagg aggacgaaaa gttacccgct cttcagccag 121 gagttcgtca tccacaacca tgcggacatc ggcttctgcc tggtgctctg cgtcctcatc 181 gggcttatgt tcgaggtcac agccaagact gcctttctat ttattttacc tcagtataac 241 attagcgtgc ctacagcaga cagtgagacc gtgcactacc actatggccc taaggacctg 301 gtcacaatct tgttctacat cttcatcacc atcatcttgc atgctgtggt tcaggagtac 361 attttagata aaatcagcaa acggcttcat ctctccaaag tcaaacacag caagttcaat 421 gaatctggac agctggtcgt ctttcatttc acctcggtga tttggtgctt ctacgtggtg 481 gtgacggaag gatacttaac aaacccaaga agcctctggg aagactaccc gcatgtgcac 541 ctccccttcc aggtgaagtt tttctaccta tgccagctgg cctactggct gcacgcactt 601 cctgagctat acttccagaa ggtacggaag gaggaaattc cccgccagct ccagtatatt 661 tgcctgtacc tggtgcatat agctggagca tacctcttaa acctgagccg cctgggcctg 721 atcttgctgc tgctgcagta ctcaactgag ttcctcttcc acacggctag actcttctac 781 tttgcagatg aaaacaacga gaaactgttc agtgcctggg ctgctgtttt tggggttacc 841 cgcctcttca tcctcaccct tgccgtgctg gccattggct ttggactggc tcgcatggaa 901 aaccaggcat ttgatcccga gaaagggaac ttcaacactt tgttttgcag gctctgcgtg 961 ctgctgctgg tgtgtgccgc ccaggcctgg ctcatgtggc gcttcatcca ctcccagctg 1021 cggcactggc gggaatactg gaatgagcag agtgcaaagc ggagagtccc agccacaccc 1081 agactaccag ccaggctcat caagagggaa tctggttacc atgaaaatgg agtggtgaag 1141 gcagagaacg gaacctcccc acggactaag aaactcaagt ctccctaagg ccaaagtgct 1201 aagaacagga atcctcttgg tgggggccga gcagggggca aggagcccag gccccctccc 1261 tgcctcctcc ttcctgcctg tgatgctccg tctcaaacag ccgaaacctg tcttgcaatg 1321 gggggagggg gcgtttcgct ttccttcttc ttggcttcct cttattcttc cacaaaccat 1381 tctcaataaa gccaaaaatc tttctctttc tccccctcag gccacctcct gtcctcactc 1441 ctgtcctgtg ctggcttttc tggaacgcca ggcgcccatg gctggcacct ttctgcttgc 1501 tctgtttctt gccttatggc tgctgctttt cctttttact tcctattttc accttatctt 1561 gcaatttttc tgtctgattt ttacaatggg aggggagcta agattgcagt cctgtccttc 1621 ggtcccccag ggcctgccgg tcagaagcct ggggctggta ggcccttggt ggtcctcatg 1681 tggatgggca agaagagagc ggccatctcg gatcataatc tccttggtgc tgattaactg 1741 acgagatata tgattccagt tctgcatgta ccatcttgag gcacagcagc cactgctcgt 1801 tgtaaatgcc aaggcatttg gctttgggac gtgacaactc aatccagaag gatggtgtga 1861 actcggttgg gtcccgtgac tcgagctcct accagtggct ggccgcggat tggaagccag 1921 cctgctgtcg ctctgtgggg aggacatgtc ttcccactgc ttagagcgag agcagagcaa 1981 actgcgcagc aggcacctcc agaaaggtaa tggtggcaga acccacagtg gagtcgacct 2041 aggcctttct ccagcagtcc cagtcgccat tgctttttca gccattcaca agcattcaaa 2101 accaaaccaa acagcagttc atatacctgc ctgagatagg ctggtcctca cctccagagc 2161 cagccagccc cgtcaggggc caaacttact accttgactt catctctagc tgcagaaaca 2221 ctaagtctca agggcttcag ccccatgctg gtcccttggt gttcagggag ggtcacttgg 2281 accgctgttc atctggccgc ccttgttgag tgttctttgg aattgtcgtt ttttgagcac 2341 aactacagca ttttagactg catgaaacca tgactgactg agagtcactc tctgggtaga 2401 tgataggcgc ctttctggcc ccttccctca cagattcttt cccctcccct ccacctgaag 2461 agaaggcctc caagtccttt tggtgccttg tgaggacttt tagaaggggc gttcagcttt 2521 aaaaagccgg tcctaattac ggccggacgc agtagcttac gcctgttatc ccagcacttt 2581 gggaggtcga ggtgggcaga tcacctgagg ttaggagttc aagaccagcc tggccaacat 2641 ggtgaaaccc catctctact aaaaatacaa aaaattaggt gtagtggcag gcacctgtaa 2701 tcccagctac tcgggaggct gaggcaggag aatcgcttga acctagaagg tggaggttgc 2761 agtgagcgga gattgtacca tggcactcca gcctggacaa caagagcgaa attctgtcta 2821 aaaaaacaaa agtcccaatt aagaacctcc gaactctgtt ttgaggcaaa ggggagtagt 2881 tcttggtagg tgcaggaata gtagtgtcat ttggaatact ggtcatcttt ctgacatcac 2941 agtagaaacc aaaccttgga tttagattca aaagggggga aatgggtctt ttcatcaagg 3001 caactcccct tctccaagtc acttacatca tagataaatt ttagcttccc agtaactgag 3061 ggatttgttt cctaacgcca ttggaggcct tcatccctct ctacgataag gttgcagaaa 3121 tgggaagagc tacccgtggt tgcttttgat tacccttagg aagtgagaca gtgtttttga 3181 aaatatgtat ttctcccatt tctccctctc cttccctgac acttctctgg gctgcacagc 3241 agaaacgttg gtaaaagggc agtttggttt caacacagca gacctgatat gggatccctt 3301 agccacttta gtcaaacagc cctgacagag tctataattg agttcaggcc ccccaccttg 3361 cctaataact gcaaatcgca tgttcagcca gcagcctcct aagcccacct tcctccccca 3421 ttagagaaca cccatcctag gtgctctcca ggctgtgtca ttggcagggc ttcacatgca 3481 ggaggcctct ctcaggtgag tccaggttaa actgttgagt tgtggcttca acagatatgt 3541 atggcatgct gggatgtgcc aggtgcctgc gttgtgccag ttgctggaga ggtagtgtga 3601 gcagagcagc tgaaatcttg ccatcaagca accctcattc tcatgcctgt aggtttccat 3661 tgctctgtcc caggacactt gcgtgccaga gacgccacaa cttcatgtcc ctgtctcttg 3721 caagctcccc gtgctgccag tacttcatgc cttggatgtg gtcccaccag cccagtggct 3781 ggggtcagct taggctctgc ttcccagtgg acgggtgtgc taagggttta ttttatgtaa 3841 aaaaaaaaaa aaaacaaaaa aaaaccctga gaccatgagt ggggctggca tcttgccagc 3901 ctgggcttca gggatgtttg gggggggtgg ttagagggta gttgtagggt actttgtcac 3961 ccccctcccc ctgccaccct ccctggcacg tttatttcac agcagagcca agtctgtggc 4021 aggttgacac agactgtgtt gccagagctg aaataattcc acttcatcct atgagcgtgt 4081 tgggctagct tgttctaatt ttggccactt tggctgtttt cttcagtttt atgcattctc 4141 tcctgcccca aagtgccaag ccatttgtga aggctctgcc agacacctcc aagcttgaga 4201 gctcagcacc atgcaccaag agcaggagaa aagacgtaaa cctaccccag caactgtggc 4261 ctctcgacag ccctggctaa ctaacttaca tttgtgggga agccaacaga cacagcagga 4321 ggagagggag gtggcgctgg tggaccaagg atctgtgcta cccgctcccc tccttggagg 4381 tgcagtgatg atgggagtta tttttaccat ccgggcgctg atagctgcac tattaataaa 4441 ttgcatgtgt tccttttgaa ggtaggggat ggttctgggt gagaggggag caggctgagc 4501 cggcggggga tctgctgtcc tcccttttga gtcagttcta atcccatgtg tgtctgggcc 4561 accagaccga aatggttgct gagaaacttg tctgttcatg tcccaaggca taacttccca 4621 acatttaaga aaccccaata gacacctctg ccctggccac gttcacagat ccttctcttg 4681 accggaaacc ctgggaccct aagaacccct gaagcttggg gtgggtgtgt gcttctgggg 4741 tctcttttgg gacctccttt gtcagtaccc cttctttttt ctaagcagct aataagaggt 4801 tgggtgaaag agtgcatctc ctcccaggat tccacaacaa aattcttatc ttccatggat 4861 gctttaattg gaagtgggtt gccgaccccc ttgtgcctag aaaaggcctt tgcttgggtt 4921 tcctttgtat gcttcagcct tcctagttgg tttttctagg cctggtgtga gaggtaggga 4981 agtctgcaca taactaattc ttttgcttaa gggcctatgg cacaagtgca caaacttcaa 5041 ttcttgatgt tctaagctct ctcctctaac agagggagtg ctgaaagctt ttgagtcaag 5101 acaatggagt gctcttcctc cctcactctg ccttccgagc ttatggttcc ttttctcagg 5161 agaggatttt caggattatt ggaggattag gtcattgtca gatgactgga aaacctaaat 5221 aggatctctc tccagctcaa ggttgtccca gtgaggaaga ctttaccaac ttctcactct 5281 accccactac tcacatgagt gttagctcca ccttgcaaag gctgaagacc agttctcccc 5341 agtgaaagct gcctcattct tttatggagt tccctggagt ggcagagcta taaagacgag 5401 cattgggatt tgcagtctcc atgtagcctt tcgtgcttgg caacccctgt agactttttg 5461 tcccaagcag attgcgtgcg tgcgcctgtg tgtgagaata agtgccttac tttgctgtgt 5521 ggttttcaac ttgtactccg tggccagccc ccagttgcca gggctcgacg gcagccaagg 5581 acaccatacc tcagtatagt tatatataaa atggacacgg attgtgacag tttcacccca 5641 tttgtttcta accccgctgc ccaggattag ggtctgtggt gtgttctgtt ttgtttttgg 5701 tttctccctt gtgtcagttc tcttctggcc cagctgggtg gctgtggaag tctgtgaggt 5761 ggcccaacca caagcatacc tattaagaga agcccagagc ttccagcccc cacttcgaaa 5821 actctcctct ggccccacat agcaaactcc ttctccgtta ttttccccac ccccagattt 5881 tttttaaaag gcccacttgc cataacctct tttggtctat tttgcttccc attcagccca 5941 aagtttatat gataaaggtg tttactttta cttcccagtc tccaagtgct aacacataaa 6001 cacatacatg tctgactgtt gcagaactgt tcgagctcct aattcagtgt taccttgttt 6061 tagtcgcagc aaccctctcc cctacccctt gcccgcccac gtttttctca ctcttccggg 6121 ttgtgcaata actctcccag ccagtggtcc tttccacagc ctttctgtcc cttaaaacac 6181 ctgcaactgg gggagaaatg ggacccatgg gagggggagt catcatccct tacacaagaa 6241 atagccactt tccttttgtt gtcattcttg tgatcctggg tgggtttctg tggcactctt 6301 ttagaacatg tagcatcatc ttagaggtct atttttaaaa aatgtgttga agaggaaaaa 6361 accattctca cgatggggct taagtcattg tccaggaata agattggcgt ggtgcccatg 6421 acatcaccgt cactctgcct aaaagcactc tagagctact tgttcacgtg gagaggaagg 6481 atattttgcg aagcaacagc cgcaggtgga gagccctgtt cacctgatag ggtctagctg 6541 tgacagtaaa tataataccg ctgtttcctt gggtacagat ttgagtgttc atgtgatgag 6601 actgtaaacc tcatttttcg gttcctctgt ttaaaaaaac atctgaagga tgaactaagg 6661 ctgctggtgc cctgagcaac tgataatgca aatgtggaca aagtgtctgt tttctactct 6721 agcctgttca tatggaccaa atttcaacaa ggaactcaag gaaaatttgt acctgccgta 6781 tttatgcttt catgtaaaaa agggttgggg ggaggggtgt ctttttgctt ttggtgaact 6841 ttttttcaaa atcatttttc cactgtttct gtctggtttt aaaacaaatt acagttttgt 6901 atggattttt taaatgtaca ttttggaaca aatgatcaaa tattttctga aataacaata 6961 aaaggcagaa aatt // LOCUS HUMORFKG1C 2043 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0064 gene, complete cds. ACCESSION D31764 NID g498153 KEYWORDS KIAA0064. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2043) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2043) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..2043 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..222 gene 223..1635 /gene="KIAA0064" CDS 223..1635 /gene="KIAA0064" /citation=[3] /codon_start=1 /db_xref="PID:d1007113" /db_xref="PID:g498154" /translation="MHFSIPETESRSGDSGGSAYVAYNIHVNGVLHCRVRYSQLLGLH EQLRKEYGANVLPAFPPKKLFSLTPAEVEQRREQLEKYMQAVRQDPLLGSSETFNSFL RRAQQETQQVPTEEVSLEVLLSNGQKVLVNVLTSDQTEDVLEAVAAKLDLPDDLIGYF SLFLVREKEDGAFSFVRKLQEFELPYVSVTSLRSQEYKIVLRKSYWDSAYDDDVMENR VGLNLLYAQTVSDIERGWILVTKEQHRQLKSLQEKVSKKEFLRLAQTLRHYGYLRFDA CVADFPEKDCPVVVSAGNSELSLQLRLPGQQLREGSFRVTRMRCWRVTSSVPLPSGST SSPGRGRGEVRLELAFEYLMSKDRLQWVTITSPQAIMMSICLQSMVDELMVKKSGGSI RKMLRRRVGGTLRRSDSQQAVKSPPLLESPDATRESMVKLSSKLSAVSLRGIGSPSTD ASASDVHGNFAFEGIGDEDL" 3'UTR 1636..2043 BASE COUNT 427 a 561 c 594 g 461 t ORIGIN 1 ccctatccgg acaggtggct cttgcccttt agactacagt tcccagcatg cccaggcgat 61 tgcgtcccag aaccgacgtc ccaccgcctt cccacatcgg atcgcagggc tcccaaaatg 121 gcgagtgagg ctgcggggac tcgctgagca gcggaggggg agcgtgcaga gccgctgcgg 181 ccctcacagt ccggagcccg gccgtgccgt gccgtaggga acatgcactt ttccattccc 241 gaaaccgagt cccgcagcgg ggacagcggc ggctccgcct acgtggccta taacattcac 301 gtgaatggag tcctgcactg tcgggtgcgc tacagccagc tcctggggct gcacgagcag 361 cttcggaagg agtatggggc caatgtgctt cctgcattcc ccccaaagaa gcttttctct 421 ctgactcctg ctgaggtaga acagaggaga gagcagttag agaagtacat gcaagctgtt 481 cggcaagacc cattgcttgg gagcagcgag actttcaaca gtttcctgcg tcgggcacaa 541 caggagacac agcaggtccc cacagaggaa gtgtccttgg aagtgctgct cagcaacggg 601 cagaaagttc tggtcaacgt gctaacttca gatcagactg aggatgtcct ggaggctgta 661 gctgcaaagc tggatcttcc agatgacttg attggatact ttagtctatt cttagttcga 721 gaaaaagagg atggagcctt ttcttttgta cggaagttgc aagagtttga gctgccttat 781 gtgtctgtca ccagccttcg gagtcaagag tataagattg tgctaaggaa gagttattgg 841 gactctgcct atgatgacga tgtcatggag aaccgggttg gcctgaacct gctttatgct 901 cagacggtat cagatattga gcgtgggtgg atcttggtca ccaaggaaca gcaccggcaa 961 ctcaaatctc tgcaagagaa agtctccaag aaggagttcc tgagactggc ccagacgctg 1021 cggcactatg gctacttgcg ctttgatgcc tgtgtggctg acttcccaga aaaggactgt 1081 cctgtggtgg tgagcgcggg caacagtgag ctcagcctgc agctccgcct gcctggccag 1141 caactccgag aaggctcctt ccgggtcacc cgcatgcgat gctggcgggt cacctcctct 1201 gtaccattgc ccagtggaag cacgagcagc ccaggccggg gccggggtga ggtgcgcctg 1261 gaactggctt ttgaatacct catgagcaag gaccggctac agtgggtcac catcactagc 1321 ccccaggcta tcatgatgag catctgcttg cagtccatgg ttgatgaact gatggtgaag 1381 aaatctggcg gcagtatcag gaagatgctg cgccggcggg tggggggtac tctgagacgc 1441 tcagacagcc agcaagcagt gaagtcccca ccactgcttg agtcacctga tgccacccgg 1501 gagtctatgg tcaaactctc aagtaagctg agtgccgtga gcttgcgggg aattggcagt 1561 cccagcacag atgccagtgc cagtgatgtc cacggcaatt tcgccttcga gggcattgga 1621 gatgaggatc tgtaatctcc actgcttgga tgtctgccct ctaccccaga ggaatttaca 1681 gaaacttgcc ctgtgcctgt gtcccccatg ctaggggcgg aggggtcttt tccttcttct 1741 ttcctaccta ccccttttct cttggccagg ggcctcgtat cctacctttc cttgtcccct 1801 gggctggctg cacagaggat tgccccttct cttttcagag ctggccctcg atgccaaatt 1861 agcatttagt attttgcaca aagtctaagg gaccatggct gcctgccttg gggaggaacc 1921 atagctccct ctgggccgct tctggcctct tggagccatg ggccaaaggc caaggggatg 1981 ggcagaggtc tgtgtttggt ctggcccagt tccccatcat taaactcagc ctgactgctg 2041 cct // LOCUS HUMORFKG1F 1897 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0058 gene, complete cds. ACCESSION D31767 NID g505091 KEYWORDS KIAA0058. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1897) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-JUN-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1897) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..1897 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..69 gene 70..576 /gene="KIAA0058" CDS 70..576 /gene="KIAA0058" /citation=[3] /codon_start=1 /db_xref="PID:d1007116" /db_xref="PID:g505092" /translation="MNSKGQYPTQPTYPVQPPGNPVYPQTLHLPQAPPYTDAPPAYSE LYRPSFVHPGAATVPTMSAAFPGASLYLPMAQSVAVGPLGSTIPMAYYPVGPIYPPGS TVLVEGGYDAGARFGAGATAGNIPPPPPGCPPNAAQLAVMQGANVLVTQRKGNFFMGG SDGGYTIW" 3'UTR 577..1897 BASE COUNT 478 a 455 c 391 g 573 t ORIGIN 1 ctccgaacag gaagaggacg aaaaaaataa ccgtccgcga cgccgagaca aaccggaccc 61 gcaaccacca tgaacagcaa aggtcaatat ccaacacagc caacctaccc tgtgcagcct 121 cctgggaatc cagtataccc tcagaccttg catcttcctc aggctccacc ctataccgat 181 gctccacctg cctactcaga gctctatcgt ccgagctttg tgcacccagg ggctgccaca 241 gtccccacca tgtcagccgc atttcctgga gcctctctgt atcttcccat ggcccagtct 301 gtggctgttg ggcctttagg ttccacaatc cccatggctt attatccagt cggtcccatc 361 tatccacctg gctccacagt gctggtggaa ggagggtatg atgcaggtgc cagatttgga 421 gctggggcta ctgctggcaa cattcctcct ccacctcctg gatgccctcc caatgctgct 481 cagcttgcag tcatgcaggg agccaacgtc ctcgtaactc agcggaaggg gaacttcttc 541 atgggtggtt cagatggtgg ctacaccatc tggtgaggaa ccaaggccac ctctgtgccg 601 ggaaagacat cacatacctt cagcacttct cacaatgtaa ctgctttagt catattaacc 661 tgaagttgca gtttagacac atgttgttgg ggtgtctttc tggtgcccaa actttcaggc 721 acttttcaaa tttaataagg aaccatgtaa tggtagcagt acctccctaa agcattttga 781 ggtaggggag gtatccattc ataaaatgaa tgtgggtgaa gccgccctaa ggattttcct 841 ttaatttctc tggagtaata ctgtaccata ctggtctttg cttttagtaa taaaacatca 901 aattaggttt ggagggaact ttgatcttcc taagaattaa agttgccaaa ttattctgat 961 tggtctttaa tctcctttaa gtctttgata tatattactt gttataaatg gaacgcatta 1021 gttgtctgcc ttttcctttc catcccttgc cccacccatc ccatctccaa ccctagtctt 1081 ccatttcctc ccgccagtct ccattgaatc aatggtgcag gacagaaagc cagtcagact 1141 aatttccttc tttcctcgca cttctcccca ctcgtcatct tttaactagt gtttcacaag 1201 gatcctctga aaccctctct gtgccccaag tacagatgcc attacttctg ctttcgtatc 1261 tcctcaggca aaagtggagg gtgccttatg ggccctcctc ataggttgtc tctgcataca 1321 cgaacctaac ccaaatttgc tttggtgcca gaaaaactga gctatgtttg aacaaagatg 1381 tcgtgcaaac tgtactgtga acaacagttg gtttaaaata tgaggggcaa ggaggaggat 1441 gcatttcaaa agcttgattg atgtgttcag agctaaatta agaggagttt tcagatcaaa 1501 aactggttac cattttttgt cagagtgtct gatgcggcca ctcattcggc tccccagaat 1561 tcctagactg ggttaatagg gtcatattgt gaatgtctca ctacaaaatg acttgagtcc 1621 agtgaaatct cattagggtt taagaatatt tcagggatcc ttaatgtttt gatttttgtt 1681 ttctgaaatt ggattttatt ttattttatc ttataatttc agttcatcta aattgtgtgt 1741 tctgtacatg tgatgtttga ctgtaccatt gactgttatg gaagttcagc gttgtatgtc 1801 tctctctaca ctgtggtgca cttaacttgt ggaattttta tactaaaaat gtagaataaa 1861 gactattttg aagatttgaa taaagtgatg aagttgc // LOCUS HUMORFKG1K 2494 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0050 gene, complete cds. ACCESSION D30758 NID g495679 KEYWORDS KIAA0050. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2494) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (19-MAY-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2494) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..2494 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..188 gene 189..2411 /gene="KIAA0050" CDS 189..2411 /gene="KIAA0050" /note="similar to HUMORFU (D26069)" /citation=[3] /codon_start=1 /db_xref="PID:d1006988" /db_xref="PID:g488505" /translation="MTVKLDFEECLKDSPRFRASIELVEAEVSELETRLEKLLKLGTG LLESGRHYLAASRAFVVGICDLARLGPPEPMMAECLEKFTVSLNHKLDSHAELLDATQ HTLQQQIQTLVKEGLRGFREARRDFWRGAESLEAALTHNAEVPRRRAQEAEEAGAALR TARAGYRGRALDYALQINVIEDKRKFDIMEFVLRLVEAQATHFQQGHEELSRLSQYRK ELGAQLHQLVLNSAREKRDMEQRHVLLKQKELGGEEPEPSLREGPGGLVMEGHLFKRA SNAFKTWSRRWFTIQSNQLVYQKKYKDPVTVVVDDLRLCTVKLCPDSERRFCFEVVST SKSCLLQADSERLLQLWVSAVQSSIASAFSQARLDDSPRGPGQGSGHLAIGSAATLGS GGMARGREPGGVGHVVAQVQSVDGNAQCCDCREPAPEWASINLGVTLCIQCSGIHRSL GVHFSKVRSLTLDSWEPELVKLMCELGNVIINQIYEARVEAMAVKKPGPSCSRQEKEA WIHAKYVEKKFLTKLPEIRGRRGGRGRPRGQPPVPPKPSIRPRPGSLRSKPEPPSEDL GSLHPGALLFRASGHPPSLPTMADALAHGADVNWVNGGQDNATPLIQATAANSLLACE FLLQNGANVNQADSAGRGPLHHATILGHTGLACLFLKRGADLGARDSEGRDPLTIAME TANADIVTLLRLAKMREAEAAQGQAGDETYLDIFRDFSLMASDDPEKLSRRSHDLHTL " 3'UTR 2412..2494 BASE COUNT 506 a 752 c 774 g 462 t ORIGIN 1 ggggtgagag ctcctcctag gacacccctt tccccttggg gaaagaattg tgcccccagg 61 cccttccccg cggaggtccc tctcctcctt ccccctcatc tccccttcct gggacagaaa 121 gtgcctccac ctgcatcccc aggggcccgg cctccagggc ccgctggccc cacagcaggc 181 aagctgagat gacggtcaag ctggatttcg aggagtgtct caaggactca ccccgtttcc 241 gagcctctat tgagctggtg gaagccgaag tgtcagaatt ggagacccgt ctggaaaagc 301 tcctgaaact gggcactggt ctcctggaaa gtgggcgcca ttaccttgct gccagccgcg 361 ccttcgttgt cggcatttgt gacctggccc gcctgggtcc accagagccc atgatggcgg 421 agtgtctgga aaaattcacc gtgagcctga accacaagct ggacagccat gcggagcttc 481 tagatgccac ccaacacaca ctgcagcagc agatccagac cctggtcaag gaaggtctgc 541 ggggtttccg agaggctcgc cgggatttct ggcggggggc tgagagcctg gaggctgccc 601 tgacccacaa cgcagaggtt cccaggcgcc gggcccagga ggcagaagag gcaggagctg 661 ctttgaggac ggctcgagct gggtaccggg gacgggcact ggattatgcc ctgcagatca 721 acgtgattga ggacaagagg aagtttgaca tcatggagtt tgtgctgcgt ttggtggagg 781 cccaggctac ccatttccag cagggccatg aggagctgag ccggctgtcc cagtatcgaa 841 aggagctggg cgcccagttg caccagctgg tcttgaattc agcacgagag aagagggaca 901 tggagcagag acacgtgctg ctgaaacaga aggagctggg tggggaggag ccagaaccaa 961 gcttaagaga ggggcctggt ggcctggtga tggaaggaca tctcttcaaa cgggccagca 1021 acgcatttaa gacctggagc agacgctggt tcaccattca gagcaaccaa ctggtttacc 1081 agaagaagta caaggaccct gtgactgtgg tggtggatga ccttcgtctc tgcacagtga 1141 aactctgccc tgactcagaa aggcggttct gctttgaggt ggtgtccacc agcaagtcct 1201 gcctcctcca ggctgactca gagcgcctcc tgcagctgtg ggtcagtgct gtgcagagca 1261 gcattgcttc tgccttcagt caggctcgcc ttgatgacag cccccggggt ccaggccagg 1321 gctcaggaca cctggccata ggctctgctg ccaccctggg ctctggtgga atggccaggg 1381 gaagggagcc tgggggagtc gggcacgtgg tggcccaggt ccagagtgtg gatggcaatg 1441 cccagtgctg cgactgccgg gagccagccc cggagtgggc cagcatcaac cttggtgtca 1501 ccctctgcat tcagtgttcc ggcatccaca ggagccttgg tgttcacttc tccaaagtcc 1561 ggtctctgac ccttgactca tgggagccag aactagtgaa gctcatgtgt gagctgggaa 1621 atgtcatcat caaccagatc tatgaggccc gcgtggaggc catggcagtg aagaaaccag 1681 ggcccagctg ctcccggcag gagaaggagg cctggattca cgctaaatac gtggagaaga 1741 agttcctgac caagctgcct gagattcgag ggcgaagagg tggccggggg cgcccaaggg 1801 ggcagcctcc tgtgccccca aagccttcca tcaggccccg gccagggagc ttgagatcca 1861 agccagagcc cccctctgag gacctgggaa gcctgcaccc tggggcccta ctgtttcgag 1921 cgtctgggca tcctccatct cttcccacca tggctgatgc ccttgcccat ggagctgatg 1981 tcaactgggt caatgggggc caagataatg ccacaccgct gatccaggcc acagctgcta 2041 attctcttct ggcctgtgag tttctcctcc agaacggggc gaacgtgaac caagcggaca 2101 gtgcgggccg gggcccgctg caccacgcaa ccattcttgg ccacacgggg ctcgcctgcc 2161 tgttcctgaa acggggagct gatctggggg ctcgagactc tgaaggcagg gaccctctga 2221 ccatcgccat ggaaacagcc aacgctgaca tcgtcaccct gctacgactg gcaaagatga 2281 gggaggctga agcggcccag gggcaggcag gagatgagac gtatcttgac atcttccgcg 2341 acttctccct catggcgtca gacgacccgg agaagctgag ccgtcgcagt catgacctcc 2401 acacgctgtg acccgaggcc cacggggccc gcgcctgcct cccttccccg ccaccgggcc 2461 ctctgccatt aaagcctccg tgcttcgctc ttcc // LOCUS HUMORFKG1M 3168 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0063 gene, complete cds. ACCESSION D31884 NID g505095 KEYWORDS KIAA0063. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3168) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (16-JUN-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3168) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..3168 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..279 gene 280..888 /gene="KIAA0063" CDS 280..888 /gene="KIAA0063" /citation=[3] /codon_start=1 /db_xref="PID:d1007254" /db_xref="PID:g505096" /translation="MSCVPWKGDKAKSESLELPQAAPPQIYHEKQRRELCALHALNNV FQDSNAFTRDTLQEIFQRLSPNTMVTPHKKSMLGNGNYDVNVIMAALQTKGYEAVWWD KRRDVGVIALTNVMGFIMNLPSSLCWGPLKLPLKRQHWICVREVGGAYYNLDSKLKMP EWIGGESELRKFLKHHLRGKNCELLLVVPEEVEAHQSWRTDV" 3'UTR 889..3168 BASE COUNT 791 a 774 c 773 g 830 t ORIGIN 1 gtcttacatt ttatgtgttt ttttaaattc tgcactttga cagtggtagc aaaataaaag 61 taatccaaaa gttatttttc tattgagaag agagaagagg tgggttttat ttttttcctt 121 ttgattccaa tcttttctcc cagtgacaaa gcataaatca tggacaccag agaataaggt 181 ggacctcagg aatccggaaa agactgagaa gcttacattc ttcctctaga gggaagagtg 241 gggtgtttat agccatctct ggaacctaaa acaaaaaaca tgagttgtgt gccatggaaa 301 ggagacaagg ccaaatctga atcattggag ctgccccagg cagcaccccc acaaatctac 361 catgagaaac agcgcaggga gctttgtgcc ctccacgccc tcaataacgt cttccaggac 421 agcaatgcct tcacccggga tacgctgcaa gagattttcc agaggttgtc tccaaacacc 481 atggtgacac ctcacaagaa gagcatgctg ggaaatggca actacgatgt gaatgtcatt 541 atggcagcac ttcagaccaa aggctatgaa gctgtttggt gggacaagcg cagggatgtc 601 ggtgtcattg ccctcactaa cgtcatgggc ttcatcatga atctgccctc cagcctatgc 661 tggggtccac tgaaactgcc cctcaaaagg cagcactgga tctgtgttcg agaggtggga 721 ggggcctact acaacctcga ctccaaactc aagatgcccg agtggattgg aggcgagagc 781 gagctcagga agtttctaaa acatcatttg cgaggaaaga actgtgaact cctgctggtg 841 gtaccagaag aggtagaggc tcatcagagt tggaggaccg atgtgtaaca gttctgccca 901 acctccctct cgcctcagcc ccttcagtcc tctgtgacgt gctgtggcct ctacagtggg 961 tctgcccttg ccacttcccc aaacatctca tcaagttttt ccccttcaga tctgacagtg 1021 caataggaca gacgtgtgga ctgttataag aactactcag tgttttgttc ctgggcaagg 1081 aaggtaggag ttctgtgcac ttaaggccag tggtcacaaa cccttgtttt atttaagaga 1141 cagaggagaa agtggagcgg ggagggaatc ctagcttatt ttcccttttc tatgaggact 1201 tgacacaggt tctgctgagt tgtcactgct gctccagact cacctagaga tgctgcctcc 1261 actttccatc ctgtctgggt ctgaaaacag tgggtctgca gatagtgccc acaaacccca 1321 tgtgactggt ttgaaggacc cagagcataa aggtctctca ggaaaccatg tccaaaaccc 1381 tagcagcggt acagcatgct gtctccaacc cttatcccca ggtttaaggg tggtttatgg 1441 ccatacgtgg aggttttttg ttgttgtttt tgagactgag tttcactctt gttgcccagg 1501 ctggagtgca atggcaccat ctcggctcac tgcaacctcc acctcctggt tcaagcgatt 1561 ctcaggcctc agcttcccaa gtagttggga ttacaggcgc ctgccaccac acctggctaa 1621 ttttgtattt ttagtagaga tggggtttct ccatgttggt caggctggtc tcaaactccc 1681 gacctcaagt gatctgcccg ccttggcctc ccaaagtgct gggattacag gcgtgagcca 1741 ccgcacccgg cagagtttta taatgaaaaa ttaactaata ttctagtatg aagtgaggag 1801 gatactgaac aggatgtggc taaagccaac ctgggacagc catggggtgg cttggtttct 1861 tcactccagt gttgtcccta ccatttcgca gcattgattt aggaggctct gggacaaaag 1921 agaagccaaa gagcagtttt cccagttcac tcactctggc aaaatcagga aaaaaaagtc 1981 tgcttttgac atcaaattcc actaatttgg ggcagcgttg ggtgaggaaa gtattgtgaa 2041 gacaggcttc ttggagtagg ggcagccaca attcagtaga cactctaggc tcggaggctg 2101 ccactgtagt tgccaagctc aggttgggtg gttctgtgct gtatggatgg aataggacct 2161 gggctggtca tcttcatgtc gtttcctctc tgtatcaatg gaagttcaac ccgcccctac 2221 ctcttcagat agttgtaggc cacttttctc ttgtaacttt ggaaaacaaa agaggagaaa 2281 taagtatcat accatatgcg tgtctccaaa gtggatgtgg ttgcctcaag gcaggtggca 2341 ggcaggggtg acctgctggc cctcagatca atggtcgtgg caggtctgag agctgtccca 2401 ctggccagac ttctctccag cagcaaagcc agcctggggc ttgcatgttg atcctgagca 2461 agcttaacgg ggtgaagctg ggctttctcc cccctgtgac tggagtgcat gttgacacca 2521 gcactttttc tgcacatgta tcttcaatcc aacaaggccg tttttttaat gctgagtaac 2581 aggccaccaa gcggctactg cgttatatct tctcagcaac cggccgcagt ctcttctgca 2641 ccattttcta caccagacct gcttggcacc acagggagct cttttcctgc cctgcacaat 2701 gacattccaa ccaccaccag ccagacatta cagccaacct tgctgattgt cacaagcagg 2761 accttggggc cactggcact gtcagatagt aagccatttc ttgggtagag gaggaaactc 2821 ctctccacaa atccacttgg gcctgtgcaa atggcacttg aaagagtccc catgcacttg 2881 gagtccatga gccaatggga tatgcaaaga cgcttaaaca tttcagggct ggtttctctg 2941 ttcatatcca attctggtgc ttaggaacag ggacccatgc tgatgcccaa gggcaaaaag 3001 ccccacttcc tttaaggaag tgaacaggcc tgaccctgat gcccaataac gggcaaccct 3061 aggctttttg tttttcttgc ttttattcct ttttgttgtt ggccttgtgc tgcgtttgtt 3121 tacaaaagat gtattttgtt taaccaaata ttaaaaatgg aaaactcc // LOCUS HUMORFKG1T 4333 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0067 gene, complete cds. ACCESSION D31891 NID g505109 KEYWORDS KIAA0067. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4333) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (16-JUN-1994) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4333) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayashi,Y., Nagase,T., Ishikawa,K., Seki,T. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA 0041- KIAA 0080) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG1 JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..4333 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..86 gene 87..3962 /gene="KIAA0067" CDS 87..3962 /gene="KIAA0067" /note="similar to G9a gene." /citation=[3] /codon_start=1 /db_xref="PID:d1007261" /db_xref="PID:g505110" /translation="MSSLPGCIGLDAATATVESEEIAELQQAVVEELGISMEELRHFI DEELEKMDCVQQRKKQLAELETWVIQKESEVAHVDQLFDDASRAVTNCESLVKDFYSK LGLQYRDSSSEDESSRPTEIIEIPDEDDDVLSIDSGDAGSRTPKDQKLREAMAALRKS AQDVQKFMDAVNKKSSSQDLHKGTLSQMSGELSKDGDLIVSMRILGKKRTKTWHKGTL IAIQTVGPGKKYKVKFDNKGKSLLSGNHIAYDYHPPADKLYVGSRVVAKYKDGNQVWL YAGIVAETPNVKNKLRFLIFFDDGYASYVTQSELYPICRPLKKTWEDIEDISCRDFIE EYVTAYPNRPMVLLKSGQLIKTEWEGTWWKSRVEEVDGSLVRILFLDDKRCEWIYRGS TRLEPMFSMKTSSASALEKKQGQLRTRPNMGAVRSKGPVVQYTQDLTGTGTQFKPVEP PQPTAPPAPPFPPAPPLSPQAGDSDLESQLAQSRKQVAKKSTSFRPGSVGSGHSSPTS PALSENVSGGKPGINQTYRSPLGSTASAPAPSALPAPPAPPVFHGMLERAPAEPSYRA PMEKLFYLPHVCSYTCLSRVRPMRNEQYRGKNPLLVPLLYDFRRMTARRRVNRKMGFH VIYKTPCGLCLRTMQEIERYLFETGCDFLFLEMFCLDPYVLVDRKFQPYKPFYYILDI TYGKEDVPLSCVNEIDTTPPPQVAYSKERIPGKGVFINTGPEFLVGCDCKDGCRDKSK CACHQLTIQATACTPGGQINPNSGYQYKRLEECLPTGVYECNKRCKCDPNMCTNRLVQ HGLQVRLQLFKTQNKGWGIRCLDDIAKGSFVCIYAGKILTDDFADKEGLEMGDEYFAN LDHIESVENFKEGYESDAPCSSDSSGVDLKDQEDGNSGTEDPEESNDDSSDDNFCKDE DFSTSSVWRSYATRRQTRGQKENGLSETTSKDSHPPDLGPPHIPVPPSIPVGGCNPPS SEETPKNKVASWLSCNSVSEGGFADSDSHSSFKTNEGGEGRAGGSRMEAEKASTSGLG IKDEGDIKQAKKEDTDDRNKMSVVTESSRNYGYNPSPVKPEGLRRPPSKTSMHQSRRL MASAQSNPDDVLTLSSSTESEGESGTSRKPTAGQTSATAVDSDDIQTISSGSEGDDFE DKKNMTGPMKRQVAVKSTRGFALKSTHGIAIKSTNMASVDKGESAPVRKNTRQFYDGE ESCYIIDAKLEGNLGRYLNHSCSPNLFVQNVFVDTHDLRFPWVAFFASKRIRAGTELT WDYNYEVGSVEGKELLCCCGAIECRGRLL" 3'UTR 3963..4333 BASE COUNT 1142 a 1085 c 1116 g 990 t ORIGIN 1 gtcgaggccg acccctgagt tgtgagtctg gggtctggtt ggtgaaaaag agcccttgaa 61 gctggaagac gggagaggac aaaagcatgt cttcccttcc tgggtgcatt ggtttggatg 121 cagcaacagc tacagtggag tctgaagaga ttgcagagct gcaacaggca gtggttgagg 181 aactgggtat ctctatggag gaacttcggc atttcatcga tgaggaactg gagaagatgg 241 attgtgtaca gcaacgcaag aagcagctag cagagttaga gacatgggta atacagaaag 301 aatctgaggt ggctcacgtt gaccaactct ttgatgatgc atccagggca gtgactaatt 361 gtgagtcttt ggtgaaggac ttctactcca agctgggact acaataccgg gacagtagct 421 ctgaggacga atcttcccgg cctacagaaa taattgagat tcctgatgaa gatgatgatg 481 tcctcagtat tgattcaggt gatgctggga gcagaactcc aaaagaccag aagctccgtg 541 aagctatggc tgccttaaga aagtcagctc aagatgttca gaagttcatg gatgctgtca 601 acaagaagag cagttcccag gatctgcata aaggaacctt gagtcagatg tctggagaac 661 taagcaaaga tggtgacctg atagtcagca tgcgaattct gggcaagaag agaactaaga 721 cttggcacaa aggcaccctt attgccatcc agacagttgg gccagggaag aaatacaagg 781 tgaaatttga caacaaagga aagagtctac tgtcggggaa ccatattgcc tatgattacc 841 accctcctgc tgacaagctg tatgtgggca gtcgggtggt cgccaaatac aaagatggga 901 atcaggtctg gctctatgct ggcattgtag ctgagacacc aaacgtcaaa aacaagctca 961 ggtttctcat tttctttgat gatggctatg cttcctatgt cacacagtcg gaactgtatc 1021 ccatttgccg gccactgaaa aagacttggg aggacataga agacatctcc tgccgtgact 1081 tcatagagga gtatgtcact gcctacccca accgccccat ggtactgctc aagagtggcc 1141 agcttatcaa gactgagtgg gaaggcacgt ggtggaagtc ccgagttgag gaggtggatg 1201 gcagcctagt caggatcctc ttcctggatg acaaaagatg tgagtggatc tatcgaggct 1261 ctacacggct ggagcccatg ttcagcatga aaacatcctc agcctctgca ctggagaaga 1321 agcaaggaca gctcaggaca cgtccaaata tgggtgctgt gaggagcaaa ggccctgttg 1381 tccagtacac acaggatctg accggtactg gaacccagtt caagccagtg gaacccccac 1441 agcctacagc tccacctgcc ccacctttcc cacctgctcc acctctatcc ccccaagcag 1501 gtgacagtga cttggaaagc cagcttgccc agtcacggaa gcaggtagcc aaaaagagca 1561 cgtcctttcg accaggatct gtgggctctg gtcattcctc ccctacatct cctgcactca 1621 gtgaaaatgt ctctggtggg aaacctggga tcaaccagac atatagatca cctttaggct 1681 ccacagcctc tgccccagca ccctcagcac tcccggcccc tccagcaccc ccagtcttcc 1741 atggcatgct ggagcgggcc ccagcagagc cctcctaccg tgctcccatg gagaagcttt 1801 tctacttacc tcatgtctgc agctatacct gtctgtctcg agtcagacct atgaggaatg 1861 agcagtaccg gggcaagaac cctctgctgg tcccgttact atatgacttc cggcggatga 1921 cagcccggcg tcgagttaac cgcaagatgg gctttcatgt tatctataag acaccttgtg 1981 gtctctgcct tcggacaatg caggagatag aacgctacct tttcgagact ggctgtgact 2041 tcctcttcct ggagatgttc tgtttggatc catatgttct tgtggaccga aagtttcagc 2101 cctataagcc tttttactat attttggaca tcacttatgg gaaggaagat gttcccctat 2161 cctgtgtcaa tgagattgac acaacccctc caccccaggt ggcctacagc aaggaacgta 2221 tcccgggcaa gggtgttttc attaacacag gccctgaatt tctggttggc tgtgactgca 2281 aggatgggtg tcgggacaag tccaagtgtg cctgccatca actaactatc caggctacag 2341 cctgtacccc aggaggccaa atcaacccta actctggcta ccagtacaag agactagaag 2401 agtgtctacc cacaggggta tatgagtgta acaaacgctg caaatgtgac ccaaacatgt 2461 gcacaaaccg gttggtgcaa catggactac aagttcggct acagctattc aagacacaga 2521 acaagggctg gggtatccgc tgcttggatg acattgccaa aggctctttt gtttgtattt 2581 atgcaggcaa aatcctgaca gatgactttg cagacaagga gggtctggaa atgggtgatg 2641 agtactttgc aaatctggac catatcgaga gcgtggagaa cttcaaagaa ggatatgaga 2701 gtgatgcccc ctgttcctct gacagcagtg gtgtagactt gaaggaccag gaagatggca 2761 acagcggtac agaggaccct gaagagtcca atgatgatag ctcagatgat aacttctgta 2821 aggatgagga cttcagcacc agttcagtgt ggcggagcta tgctacccgg aggcagaccc 2881 ggggccagaa agagaacgga ctctctgaga caacttccaa ggactcccac cccccagatc 2941 ttggaccccc acatattcct gttcctccct caatccctgt aggtggctgc aatccacctt 3001 cctccgaaga gacacccaag aacaaggtgg cctcatggtt gagctgcaat agtgtcagtg 3061 aaggtggttt tgctgactct gatagccatt catccttcaa gactaatgaa ggtggggagg 3121 gccgggctgg gggaagccga atggaggctg agaaggcctc cacctcagga ctaggcatca 3181 aggatgaggg agacatcaaa caggccaaga aagaggacac tgacgaccga aacaagatgt 3241 cagtagttac tgaaagctct cgaaattacg gttacaatcc ttctcctgtg aagcctgaag 3301 gacttcgccg cccacctagt aagactagta tgcatcaaag ccgaagactc atggcttctg 3361 ctcagtccaa ccctgatgat gtcctgacac tgtccagcag cacagaaagt gagggggaaa 3421 gtgggaccag ccgaaagccc actgctggtc agacttcggc tacagcggtt gacagtgatg 3481 atatccagac catatcctct ggctctgaag gggatgactt tgaggacaag aagaacatga 3541 ctggtccaat gaagcgtcaa gtggcagtaa aatcaacccg aggctttgct cttaaatcaa 3601 cccatgggat tgcaattaaa tcaaccaaca tggcctctgt ggacaagggg gagagcgcac 3661 ctgttcgtaa gaacacacgc caattctatg atggcgagga gtcttgctac atcattgatg 3721 ccaagcttga aggcaacctg ggccgctacc tcaaccacag ttgcagcccc aacctgtttg 3781 tccagaatgt cttcgtggat acccatgatc ttcgcttccc ctgggtggcc ttctttgcca 3841 gcaaaagaat ccgggctggg acagaactta cttgggacta caactacgag gtgggcagtg 3901 tggaaggcaa ggagctactc tgttgctgtg gggccattga atgcagagga cgtcttcttt 3961 agaggacagc cttcttccca acccttcttg aactgtcgtt tcctcaggaa ctgggtcttc 4021 ctgattgttg aaccctgacc cgaagtctct gggctagcta ctccccccag ctcctagttg 4081 atagaaatgg gggttctgga ccagatgatc ccttccaatg tggtgctagc aggcaggatc 4141 ccttctccac ctccaaaggc cctaaagggt ggggagagat caccactcta acctcggcct 4201 gacatccctc ccatcccata tttgtccaag tgttcctgct tctaacagac tttgttctta 4261 gaatggagcc tgtgtatcta ctatctccag tttgtattat ttcttgaaag tcttttaaca 4321 atatgataaa act // LOCUS HUMORFLA 5323 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0014 gene, complete cds. ACCESSION D25216 NID g434774 KEYWORDS KIAA0014. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5323) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (09-NOV-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5323) AUTHORS Miyajima,N. JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 FEATURES Location/Qualifiers source 1..5323 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..146 gene 147..1628 /gene="KIAA0014" CDS 147..1628 /gene="KIAA0014" /codon_start=1 /db_xref="PID:d1005484" /db_xref="PID:g434775" /translation="MHTLVFLSTRQVLQCQPAACQALPLLPRELFPLLFKVAFMDKKT VVLRELVHTWPFPLLSFQQLLQECAHCSRALLQERPSTESMQAVILGLTARLHTSEPG ASTQPLCRKHALRVLDMTGLLDDGVEQDPGTMSMWDCTAAVARTCIAQQQGGAAEPGP APIPVEVRVDLRVNRASYAFLREALRSSVGSPLRLCCRDLRAEDLPMRNTVALLQLLD AGCLRRVDLRFNNLGLRGLSVIIPHVARFQHLASLRLHYVHGDSRQPSVDGEDNFRYF LAQMGRFTCLRELSMGSSLLSGRLDQLLSTLQSPLESLELAFCALLPEDLRFLARSPH AAHLKKLDLSGNDLSGSQLAPFQGLLQASAATLLHLELTECQLADTQLLATLPILTQC ASLRYLGLYGNPLSMAGLKELLRDSVAQAELRTVVHPFPVDCYEGLPWPPPASVLLEA SINEEKFARVEAELHQLLLASGRAHVLWTTDIYGRLAADYFSL" 3'UTR 1629..5323 BASE COUNT 839 a 1764 c 1655 g 1065 t ORIGIN 1 cggggctgga ggcggtggct gcggttgcgg gaccggcact atgctgggcc ttcctaccac 61 ttgtgtgtgg cttggtagtg gcctagggtc tctcctccct gctgaagtcc ctctcctgca 121 ggtggccgtc tgcccggccc agcaccatgc acacgcttgt gttcttgagc acacggcagg 181 tgctgcagtg ccagccagct gcctgccagg ccctgcccct gctgccacgc gaactcttcc 241 ccctgctgtt caaggtggcc ttcatggaca agaagacagt ggtactgcgc gagttggtac 301 acacgtggcc cttcccgctg ctcagtttcc agcagctgct acaggagtgt gcccactgca 361 gccgtgccct cctgcaggag cggcctagca ctgagagcat gcaggctgtt atcctggggc 421 tgactgcccg gctccacacc tcagagcctg gggccagcac acagcccctc tgcaggaagc 481 atgcgctgcg ggtgctggac atgacgggcc tcttggatga tggtgtggaa caggatcctg 541 gcaccatgag catgtgggac tgtactgctg ccgtagctcg cacatgcatt gcccagcagc 601 agggtggggc cgcagagcct gggccagccc ccatccccgt ggaggtgcgc gtggacctgc 661 gggtgaaccg ggcctcctat gcgttcctgc gggaggcact ccgaagcagc gtgggcagcc 721 cgctgcggct ctgctgccgg gacctgcgag ctgaggacct gcccatgcgc aacactgtgg 781 ccctgctgca gcttctggat gcaggctgcc tgcgccgcgt ggacctgcgc ttcaacaatc 841 tgggcctgcg cggcctgtct gtgatcatcc cacacgtggc ccgcttccag cacctggcca 901 gcctgcggct ccactatgtg catggggatt caaggcagcc ctccgtggat ggcgaggaca 961 acttccgcta cttccttgcc cagatgggcc gcttcacctg tctgcgtgag ctcagcatgg 1021 gctcctctct cctttcaggg aggctggacc agctgctcag caccctgcag agccccctgg 1081 agagcctgga gttggccttc tgtgctctgc tgcctgagga cctacgcttc ctggcacgga 1141 gcccacatgc tgcccacctc aagaagttgg acctgagtgg taacgacctg tctggcagcc 1201 agctggcacc cttccagggt ctgttgcagg catcagcagc cacactgttg catctggagc 1261 tgactgagtg tcagctcgca gacacccagc tgttggccac actacccatc ctgactcagt 1321 gcgccagtct ccggtacctt ggcctctatg gcaacccact gtccatggcg ggcctcaagg 1381 agctgctgcg ggactcagtg gcacaggctg agctgcgtac tgtggtgcac cccttccctg 1441 tggactgcta tgagggcttg ccctggccgc cgcctgcctc tgtcctgctg gaggcctcca 1501 tcaatgagga gaagtttgcc cgcgtagaag ctgagttgca ccagctgctt ctagcctcag 1561 gccgtgccca tgtgctctgg accacggaca tctacgggcg actggctgcg gactacttca 1621 gcctatgatg aagtagctct gggtgagaca caggccgccc tgcagtctct ttaggtaggc 1681 agggcctttg ctgggacccc tggtggaggc cttcacaaaa gcactggtta ctggtttcct 1741 gctgggtcta ccttgcttct gggcacacct caagcctccc ctgctttctg cagtgcccca 1801 cgcggttttc cctgcacttg ctccataatt ggctgatcat ctgtgggccc cggggctgga 1861 tgtcaggcct ccattgccct gctcagtttg gctgcatttg gctgccgtct ggggtcctgg 1921 tcctttgtgc aaatgctttg ggattccagt tgtgagctga gagagatgat ggcctctctg 1981 ggcctttcct ctccctttac tgagagctca gtgcttctgg ggttgaagtt ggacagaggc 2041 ctgcttcagg gaagctggga gtccccaggc actcacgcct cttacgtgtt ccctaccctg 2101 ccacccagcc accccactgg gctgggctcg ggctggaggg ggtcatcaag gtacacatgt 2161 gcctggaagt tgaatttgtg gctgtttttt tttctgtcca cgttggtcac ccttatcctt 2221 atctctgctg tcacccccaa catggccacc gggcaacaac tgccatccag cctgtcgccc 2281 cgcccttcgc ggggcagccc cgtcggcact gccggccagt ccttgctctt cccacctttc 2341 ggaggcccaa gatcctactg tggccggcca gggccagcga gggacccccc ccatgcagag 2401 ctggaggttg gggtgatgtc ttttcggaag agcttcaagg gaggtgttgg ggcctccccg 2461 gccaccttcc attgctaccc caggattccc gagtgcaacg ttcccggctc gcgccccaca 2521 cacggctcag cgcacactgc gcggcttcca cctttactga cggagcatgc gcgaggccgc 2581 accggccaat ctccggcgcc cacgtcatcc gcgcgcccgc ggccctagca gtggatctcg 2641 taggcgaccg gcgggggcac gcggagtccc ggccccgccc cctgttccgg gccgcagtca 2701 gcgggcgcct ccgccggacc ctcggcgaag agcggcttgg agcggttgat gacgaacatc 2761 tcgtggccgc gctcgtcgcg gagctcctct agctgtgcga acgtacaggg gccgtccaag 2821 tagtcgttga cgaacagcgc tccctccccc ggaggccccc gcgccttttt tcgcctgcgg 2881 cgccggcgac agatcatggc gaccaggagc agcgccgtga gcgccagcag cgcgatggcc 2941 gccgcaatgg ccgtctgtgt ggccacgccc agggcgcgga aggccatgct gcccgcctcg 3001 ggccggggct cgctgccggc ggggcgggcg gccggaggcg gcggttgcgc gggctgctgc 3061 ggctgctgcc gggacgcgtt gaccaggagc cggaagggca cgcgggcagc gccgccggcg 3121 ttggaggcct cgcactcgta cttaccggcg tgcgccagcg tgatgttgct gaggaagagc 3181 atgccgctgc ccgtgtcgga tgccgagtgt ccgcccaggc ccagcaaccc gccttctagc 3241 tgggcctggg ctcgcggccg gccctcgcga ggctggggca cctttctcca ggtcaccaat 3301 ggctgcgggt agccggaggc ttggcaggca acccgcaggt cctcacccag gttggctgtg 3361 agctccagcg gctgcacgtg gacagagggc ggaatgcaga tgaggctgct gtgggatacg 3421 tccaggagac tctggagcgc caggcgcggg ggctctgcac acatgatctt cctgtccctg 3481 gaggtgagca gccgctggcc gccctccttg atccaggccc ccagccagtg cagggcgcag 3541 tcacagcgcc atgggttctc tgtgggagag cagcgttagg caggtggctt gagggtgctg 3601 ctaaaacagc ctgtgcagtt ggggttttgc aggccaggac agaggcctct ttcccacctc 3661 ccacagcgtt ttcacacgga gtccaaggcc ctgccacccc ttccttgacc ccaagctcct 3721 tggggcggca ggcccttcac cctcgccccc ctccccttta gctctgtgat gcctgcctgt 3781 tacagatcac ttctccgtcg gtctctgaga aagcacctgc tccttaagtc ttcctgcaac 3841 aagtgccact gtttttagga acctgggcgt ccacatagac atctcaccag cactgaaacc 3901 tcacaagtcc tctcagcttt gcctttggat gccctctctt gggaatgtcc ccagtcctgg 3961 tcagctgtct ctctcctttg caattttgtc tgcctcccct cagcctaaaa gtgtgcagaa 4021 ccctcaattc tgttaagtca ccctgtggag ttcctgtctt ctgttttccc caggcagggt 4081 gcctgagctg tatcccccag cacacccact cccgcagccc tccagtgtgg ctgcaggcgg 4141 tggtgcagcc ttccagactg ctgcccagtt gcctgatgtc agagcccctc cacacatgag 4201 cctgctccct actgccaaca ccgtggccca gacagagacg ctttccgagg aagaggtacc 4261 tgtgaggcgc aggacttgca gactggccag gggctgcagg gcctctcggc tgatggtgcc 4321 cagctggttc ctgctgaggt ccagcagtgc tagggaggac agccccgcta gagcctggtc 4381 ctccagcagc tcaatgctgt tttcttgcag gtgaagctcc tgcagtcgct gaagtaagga 4441 cagcagatcg tgaggaaaaa gggcgccgag gttgggggca tgtctctctt cttaccaagc 4501 tagactgggt tgccttttct aactattcca gccctacagg gcgaggggcc ataatggagt 4561 atcccgcccc tttagacccc aggcgctcac cggcaggtgc aagaaggtga aatccagcag 4621 ccgcgccagc tggttgcccg ccaggtagag cacgcgcagc tgggccaggc ctacgaaggc 4681 gccgctgcgc aagccgcgca gccggttgct agtgagcgcc agctccagca ggcgcggctg 4741 cgcgcggaag gcgccggcct ccagggcgcg caggctgttg ttgtgcaggt agagccggcg 4801 cagagcggcg agtggcgcca gggctcccgg ctctaggcgg gcgatgttgt tgtcctgcag 4861 gaacagtgtc tgcaggccgg ggaaagagga ggcgcttacc ccgctgcggg ggtcttcctc 4921 tccctggggg caccccgtcc tcccgcagct ccacacggtg cccacctgcg tccctggcgg 4981 gattcccagc gggacgacgc gcaaccgcag ggcgccacac tccaccgtgg cgctgtagca 5041 gcggcaggct gctgggcagc cggcggcgcg gagcggcagt agtagcagca gcagcggcag 5101 cagttcgggg gccctcaggg ccatctcccg aggcccggtt cctcaccggc ccttccgcgg 5161 ttcagccgca gacgcgtgcc ctcctgaaac acaggttggc aggccagtct cggcagtcga 5221 gagccagcca atagatggaa tggaggcctg cacctgcgtc taacttttga cactataaat 5281 aggttcaaga aactaataaa acgttctggt tttctccttt gac // LOCUS HUMORFO 2535 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0036 gene, complete cds. ACCESSION D25278 NID g434780 KEYWORDS KIAA0036. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2535) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2535) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2535 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..156 gene 157..1953 /gene="KIAA0036" CDS 157..1953 /gene="KIAA0036" /codon_start=1 /db_xref="PID:d1005508" /db_xref="PID:g434781" /translation="MKPTGTDPRILSIAAEVAKSPEQNVPVILLKLKEIINITPLGSS ELKKIKQDIYCYDLIQYCLLVLSQDYSRIQGGWTTISQLTQILSHCCVGLEPGEDAEE FYNELLPSAAENFLVLGRQLQTCFINAAKAEEKDELLHFFQIVTDSLFWLLGGHVELI QNVLQSDHFLHLLQADNVQIGSAVMMMLQNILQINSGDLLRIGRKALYSILDEVIFKL FSTPSPVIRSTATKLLLLMAESHQEILILLRQSTCYKGLRRLLSKQETGTEFSQELRQ LVGLLSPMVYQEVEEQKLHQAACLIQAYWKGFQTRKRLKKLPSAVIALQRSFRSKRSK MLLEINRQKEEEDLKLQLQLQRQRAMRLSRELQLSMLEIVHPGQVEKHYREMEEKSAL IIQKHWRGYRERKNFHQQRQSLIEYKAAVTLQRAALKFLAKCRKKKKLFAPWRGLQEL TDARRVELKKRVDDYVRRHLGSPMSDVVSRELHAQAQERLQHYFMGRALEERAQQHRE ALIAQISTNVEQLMKAPSLKEAEGKEPELFLSRSRPVAAKAKQAHLTTLKHIQAPWWK KLGEESGDEIDVPKDELSIELENLFIGGTKPP" 3'UTR 1954..2535 BASE COUNT 807 a 485 c 551 g 692 t ORIGIN 1 cgctgtagtg cggcgcccca ggttctttag tggaagaacg cgaagcgagg atgagtgatc 61 cgtggaggca gtaacaggcg cggcgaggga gaagtgattc ccgaagaatc aaggctgggc 121 cggacccggt ggcctggcaa cagggtaata agagaaatga agccaacagg tacagaccca 181 aggatcttat ctatagctgc tgaagttgca aaaagccctg agcagaatgt ccctgttata 241 ctgttgaagt taaaagaaat aataaacatc acacctttag gaagctcaga gttgaagaaa 301 atcaaacaag atatatattg ttatgatctc attcaatatt gcctcttggt cctcagtcaa 361 gattattctc gaatccaggg tggttggact acaatttccc agcttacaca gatattaagc 421 cattgctgtg tgggcttgga gccaggagaa gatgcagagg aattttacaa tgaattactt 481 ccatcagctg cagaaaattt tctagttttg gggagacaat tacaaacatg ttttatcaat 541 gcagctaagg ctgaagaaaa agatgaatta ctacactttt tccaaattgt gactgattct 601 ctcttctggc ttttgggagg ccatgttgaa cttattcaga atgtactaca aagtgatcat 661 ttcttacatt tactgcaagc tgacaatgtc caaataggat ctgcagtcat gatgatgcta 721 cagaatatat tacagatcaa cagtggtgat ttactcagaa taggaagaaa agccctgtat 781 tcaattttag atgaagttat tttcaagctt ttttcaactc ctagtccagt tataagaagt 841 actgctacaa aactcctact gttgatggct gaatcccatc aggaaatttt gattttactg 901 agacaaagta cctgctacaa aggactcaga cgtctactaa gtaaacagga aactgggact 961 gaattcagtc aagaacttag acagcttgtt ggccttttaa gcccaatggt ctatcaggaa 1021 gtagaagagc agaaactaca tcaagcagca tgcttgattc aagcctattg gaagggtttt 1081 cagacaagaa agagattaaa gaagcttcca tctgctgtga ttgctttgca gaggagtttc 1141 agatccaaac gatcaaagat gttgctggag ataaataggc agaaggaaga agaggacctc 1201 aaattacaat tgcaacttca aagacagaga gccatgagac tttcccgaga attgcagctg 1261 agtatgctcg aaatagttca tccaggtcag gtggagaaac actatcggga aatggaagag 1321 aaatcagcac tgattatcca gaaacattgg agagggtaca gggaaaggaa aaattttcac 1381 caacagaggc agtctctcat agagtataaa gcagctgtca cacttcaaag agcagcgctt 1441 aaattcctag cgaagtgccg taagaaaaag aaactatttg ctccttggcg aggactccaa 1501 gaactcactg atgcacgccg agttgaactg aagaaacgag tggatgacta tgtcagaaga 1561 catttgggct ctccaatgtc agatgtggtc agtagggagc tccatgccca agctcaagaa 1621 cgactgcaac actactttat gggcagggcc ctagaagagc gagcccagca gcacagagaa 1681 gctctgatag cacagatcag caccaacgtt gaacagctaa tgaaggcacc aagtctgaag 1741 gaggcagaag ggaaagaacc tgagctcttc ctaagtagat ccaggcctgt ggcagccaag 1801 gccaagcagg cccatctcac aaccctgaag cacatacaag caccctggtg gaagaagctt 1861 ggagaagaat ctggagatga gattgatgtt ccaaaggatg agcttagtat agaattagaa 1921 aatttattca ttggtggaac caaaccacct tagtgagtaa ccctaagaat tgacacaaat 1981 ctcatatttt aggagattat attggttctg cctctggcat gctggtagac tagggccatc 2041 ctaacttatt attttccaga ggttctcctc cagacaagac ctgcagtaag caaagagtta 2101 tattctacct ctctctcaat tttctttttc ttttctctgt atcctcatcc ttagccacac 2161 acagatttgt gtggctttta ttgtagaact aaacttagca tagtgttctg ttgtttacat 2221 gaagtgtgtt tttctttggt ttcttctgtt ttccaactaa atattttttt ctaaataaat 2281 attttcaaca attgatttga aaaatttgtc aggattattt caacttttca catttgttat 2341 ctgaaattcc tatttcctgt taacatagga ggtgtgtgca gactttatta atgtgaggaa 2401 aagaaatgct caattgaagg acatttccct gttttctata aagcaatggt tgaactcatt 2461 ttctattttg ttatttctaa aaggaactgc ataccaaaaa aatgcattct ttctattaaa 2521 ctgtgagaac tacat // LOCUS HUMORFQ 6196 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0037 gene, complete cds. ACCESSION D25538 NID g436217 KEYWORDS KIAA0037. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6196) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6196) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..6196 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..265 gene 266..3508 /gene="KIAA0037" CDS 266..3508 /gene="KIAA0037" /citation=[3] /codon_start=1 /db_xref="PID:d1005562" /db_xref="PID:g436218" /translation="MPAKGRYFLNEGEEGPDQDALYEKYQLTSQHGPLLLTLLLVAAT ACVALIIIAFSQGDPSRHQAILGMAFLVLAVFAALSVLMYVECLLRRWLRALALLTWA CLVALGYVLVFDAWTKAACAWEQVPFFLFIVFVVYTLLPFSMRGAVAVGAVSTASHLL VLGSLMGGFTTPSVRVGLQLLANAVIFLCGNLTGAFHKHQMQDASRDLFTYTVKCIQI RRKLRIEKRQQENLLLSVLPAHISMGMKLAIIERLKEHGDRRCMPDNNFHSLYVKRHQ NVSILYADIVGFTQLASDCSPKELVVVLNELFGKFDQIAKANECMRIKILGDCYYCVS GLPVSLPTHARNCVKMGLDMCQAIKQVREATGVDINMRVGIHSGNVLCGVIGLRKWQY DVWSHDVSLANRMEAAGVPGRVHITEATLKHLDKAYEVEDGHGQQRDPYLKEMNIRTY LVIDPRSQQPPPPSQHLPRPKGDAALKMRASVRMTRYLESWGAARPFAHLNHRESVSS GETHVPNGRRPKSVPQRHRRTPDRSMSPKGRSEDDSYDDEMLSAIEGLSSTRPCCSKS DDFYTFGSIFLEKGFEREYRLAPIPRARHDFACASLIFVCILLVHVLLMPRTAALGVS FGLVACVLGLVLGLCFATKFSRCCPARGTLCTISERVETQPLLRLTLAVLTIGSLLTV AIINLPLMPFQVPELPVGNETGLLAASSKTRALCEPLPYYTCSCVLGFIACSVFLRMS LEPKVVLLTVALVAYLVLFNLSPCWQWDCCGQGLGNLTKPNGTTSGTPSCSWKDLKTM TNFYLVLFYITLLTLSRQIDYYCRLDCLWKKKFKKEHEEFETMENVNRLLLENVLPAH VAAHFIGDKLNEDWYHQSYDCVCVMFASVPDFKVFYTECDVNKEGLECLRLLNEIIAD FDELLLKPKFSGVEKIKTIGSTYMAAAGLSVASGHENQELERQHAHIGVMVEFSIALM SKLDGINRHSFNSFRLRVGINHGPVIAGVIGARKPQYDIWGNTVNVASRMESTGELGK IQVTEETCTILQGLGYSCECRGLINVKGKGELRTYFVCTDTAKFQGLGLN" 3'UTR 3509..6196 BASE COUNT 1405 a 1649 c 1689 g 1453 t ORIGIN 1 tgaggaactg cgtgtggagt cagcccagtc tggatgcaca ggaggatgct ggcggcacag 61 tgagtgaggc ctggtgccag agctgtgcgg accccttgtt ggccatggag cagcaggccc 121 agaggccctc tccccagccc tgcttgcctg cctcggagag gacagaggcc taggcccacg 181 ggggagggtg ttggcagaca gatgccctcc aggccctggg gcctccttaa cggcccctta 241 acgacacgcg tgccaagggt ggaggatgcc agccaagggg cgctacttcc tcaacgaggg 301 cgaggagggc cctgaccaag atgcgctcta cgagaagtac cagctcacca gccagcatgg 361 gccgctgctg ctcacgctcc tgctggtggc cgccactgcc tgcgtggccc tcatcatcat 421 tgccttcagc cagggggacc cctccagaca ccaggccatt ctgggcatgg cgttcctggt 481 gctggcggtg tttgcggccc tctctgtgct gatgtacgtc gagtgtctcc tgcggcgctg 541 gctcagggcc ttggcgctgc tcacctgggc ctgcttggtg gcgctgggct atgtgctggt 601 gttcgacgca tggacaaagg cggcctgtgc gtgggagcag gtgcccttct tcctgttcat 661 tgtcttcgtg gtgtacacac tactgccctt cagcatgcgg ggcgctgtcg ccgttggggc 721 cgtctccact gcctcccacc tcctggtgct cggttctttg atgggaggct tcacgacacc 781 cagtgtccgg gtggggctgc agctgctggc caacgcagtc atcttcctgt gtgggaacct 841 gacaggcgcc ttccacaagc accaaatgca ggatgcgtcc cgggacctct tcacctacac 901 tgtgaagtgc atccagatcc gccggaagct gcgcatcgag aagcgccagc aggagaacct 961 gctgctgtca gtgcttccgg cccacatctc catgggcatg aagctggcca tcatcgaacg 1021 gctcaaggag catggtgacc gtcgctgcat gcctgacaac aacttccaca gcctctacgt 1081 caagaggcac cagaatgtca gcatcctcta tgcggacatc gtgggcttca cgcagctggc 1141 cagcgactgt tctcccaagg agctggtggt ggtgctgaat gagctctttg gcaagttcga 1201 ccagatcgcc aaggccaacg agtgcatgcg aatcaagatc ctcggcgact gctactactg 1261 tgtatcgggc ctgcccgtgt cgctgcctac ccacgcccgg aactgcgtga agatggggct 1321 ggacatgtgc caggccatca agcaggtgcg ggaggccacg ggcgtggaca tcaacatgcg 1381 tgtgggcata cactcgggga atgtgctgtg cggggtcatc gggctgcgca agtggcagta 1441 tgacgtgtgg tcccacgacg tgtccctggc caaccggatg gaggcagccg gagtacccgg 1501 ccgggtgcac atcacggagg ccacgctaaa gcacctggac aaggcgtacg aggtggagga 1561 tgggcacggg cagcagcggg acccctacct caaggagatg aacatccgca cctacctggt 1621 catcgacccc cggagccagc agccaccccc gcccagccaa cacctcccca ggcccaaggg 1681 ggacgcggcc ctgaagatgc gggcgtcagt gcgcatgacc cggtacctcg agtcctgggg 1741 ggcggcacgg ccctttgcac atctcaacca ccgtgagagc gtgagcagtg gtgagaccca 1801 cgtccccaac gggcggaggc ctaagagcgt tccccagcgc caccgccgga ccccagacag 1861 aagcatgtcc cccaaggggc ggtcggagga tgactcgtac gatgacgaga tgctgtcagc 1921 cattgagggg ctcagctcca cgaggccctg ctgctccaag tccgatgact tctacacctt 1981 tgggtccatc ttcctggaga agggctttga gcgcgagtac cgcctggcac ccatcccccg 2041 ggcccgccac gactttgcct gcgccagcct gatcttcgtc tgcatcctgc tcgtccatgt 2101 cctgctcatg cccaggacgg cggcactggg tgtgtccttc gggctggtgg cctgtgtact 2161 ggggctggtg ctgggcctgt gctttgccac caagttctcg aggtgctgcc cagctcgggg 2221 gacgctctgc actatctctg agagggtgga gacacagccc ctgctgaggc tgaccctggc 2281 cgtcctgacc atcggcagcc tgctcactgt ggccatcatc aacctgcccc tgatgccttt 2341 ccaagttcca gagctgcctg ttggcaatga gacaggccta ctggccgcga gcagcaagac 2401 aagagccctg tgtgagcccc tcccgtacta cacctgcagc tgtgtcctgg gcttcatcgc 2461 ctgctcggtc ttcctgagga tgagcctgga gccaaaggtt gtgctgctga cagtggccct 2521 ggtggcctac ctggtgctct tcaacctctc cccatgctgg cagtgggact gctgcggcca 2581 aggcctgggc aacctcacca agcccaacgg caccaccagt ggcaccccta gctgttcctg 2641 gaaggacctg aagaccatga ccaatttcta cctggtcctg ttctacatca ccctgcttac 2701 actctccaga cagattgact attactgccg cttggactgc ctatggaaga agaagttcaa 2761 gaaggagcac gaggagtttg agaccatgga gaacgtgaac cgccttcttc tggagaacgt 2821 cctgccagcc cacgtggctg cccactttat cggtgacaag ttaaacgagg actggtacca 2881 tcagtcctat gactgcgtct gtgtcatgtt tgcctccgtg ccggacttca aagtgttcta 2941 cacagagtgc gatgtcaaca aagaagggct ggagtgccta cgcctgctca atgagatcat 3001 tgccgacttc gacgagctcc tactgaagcc caagttcagc ggcgtggaga agatcaagac 3061 catcggcagc acgtacatgg cagctgcagg gctcagcgtc gcctcagggc acgagaacca 3121 ggagctggag cggcagcatg cccacattgg tgtcatggtg gagttcagca tcgccctgat 3181 gagtaagctg gacggcatca acaggcactc cttcaactcc ttccgcctcc gcgtcggcat 3241 aaaccatggg cctgtgattg ctggagtgat tggggcccga aaacctcagt atgacatctg 3301 gggaaacact gtcaatgtgg ccagccgaat ggaaagcact ggagaacttg ggaaaatcca 3361 ggttaccgag gagacctgca ccatcctcca gggcctcggg tactcttgtg aatgccgtgg 3421 cctgatcaac gtcaaaggca aaggcgagct gaggacttac tttgtctgta cggacactgc 3481 caagtttcag gggctggggc tgaactgagg gctcctgctg gattccgaaa aggccgggaa 3541 gccagtctcc ttccctgaag caagcccagg agaagactct ccgccccacg ccaatcccaa 3601 aggcatgcag atggctgtgc atgttggctt ctttggacct gcactggagg atttctcaga 3661 cacatgcacc agattctggc tcgaagcagc cactgagcca taatgcgcag gggaggccag 3721 aagctctgtg cctggtctgt aacagtttcc aggccagctg gagaatgttc actggttcgg 3781 ggctgacttt gagatctttg ttccctgagg tgccaggcag gcaactttag cacatgatga 3841 aaacagactt ccacctcagt ggcctgtggg cacgcacaag tgaggtctgt ttttctagac 3901 accaaggggg agtaagctga gctgtctagc acggattgga gactccctct ccctggtggg 3961 cctggcaatg acagcatttc tcacagaggc attctggtaa atgaagctga aaggggtgtt 4021 ttacatctgt aaacggtttc aaacaggtag agagaaaaac accacaatta acactgttac 4081 tttttgcctt gtctggcatg tttgttttaa atgaatacat taatggggtt tttatccttt 4141 tgaatgactt ttcagacact agacataaat ctcttccctc cagtgtatgc tctgcctttt 4201 taaccactga catgtaagga ggactactgt ctagcatcag cttatggggt cagctggctg 4261 tggggataga gtcctgagga atgtggtcac agcaagaagg cggggagcag cagagccttg 4321 cctttgaatg aggcagcttg tgaggcaagc attctggaga gaggtgcttt gaaagtaagg 4381 tgcggccttt cacctcttcc ttgattactc acacatcttt gcgttctccc ctgccgtcct 4441 tcaactgtat cttacttttc ttaccagaaa ggaatggagt ctgtttagag acaacttgga 4501 caacctgtga gtgcatctct tctttccttt agtcttcaca gctaactctg gagagcttca 4561 aaactagaag gatctactcc gcatgggtgc atgcagaggc tcctggatct gggaagcccg 4621 ccccctcaca aatgctgagc cgttcttgct ctgaaactgc gtgagtcaag gcaaatgcaa 4681 aaagccaggt tttggggatg tgtcttactg tgcttcaact tcccaaggaa ttgaaagtca 4741 acctaactgt aacaacaggg tgagaaatga ccaaactgcc cgtgactttt tctgaatgga 4801 cttcataacc ggaagactta accggtggcc tcatcaccag agcatcgcca ggatttctaa 4861 tgcactcagt ttccctacat agcagggatt cttagctagg tgtccccatg aaccccgtaa 4921 agttctacac aaagtcttgc atacaggagc ctttacaaga tgattataca gggttgcaga 4981 ttgggtgact gaccagactt gttggggtcc tgggatgagt tgccccgggc tgcaaattaa 5041 gagtacagct aagtgcgggg gtggcggtgg agggaacgaa aattgaacct gtctgcctgt 5101 gctgtgtcgt gtggctttat cagcccgagg aagggcaggt gtattctaat ttgcacaaag 5161 gtgctgggta gactagtggc agctctcatg tgctgcacat aagtggaatc agtatgaata 5221 gaagaacttg ctgtataaag gaatttcatg gcaacaatgc tggtaagggc aattagcctc 5281 gcttaagttg ccttttttac acaccaaaac tttttacatg aagggctggt ttcacatgaa 5341 tactatactg aaatctgtgc cacaccaaaa ctttttacat gaagggctgg tttcacatga 5401 atactatact gaaatctgtg ctctcaagat ctagcagtga ccagggctgc ccggcggggg 5461 ctctcctggc aagtcaggaa ggtttctgtt gctaatataa catagaaaca cattagtgca 5521 ctgggcctct ctgaggtcag catatttgta ctcttggaat atttgttttt ttcttcagta 5581 acaacagaaa ccccagttgg gagtttaaca aataactgac taccactcac tcatgcattt 5641 ttatttccaa ttaaagcaaa gcactgtgct gtgctcagat aataatagtt tgtaagtaaa 5701 agtttttagt tttcagtgtt caggttatag aatataactg accataaaaa ttacctgcag 5761 gtattttctt tttatgaact tgtttttaaa ttaccaagta attactggtg tcattttgtt 5821 ttatgacaga cacacgtatc taacaaacaa acaaacagtg accttctcca tgggtcaagg 5881 acttccttac aatttctcct gagttaactt ttgtgaaaat aatacctaag gttttctggc 5941 ttattgagga aatttcctaa caaacaaaca aacaaacaaa cagaagagaa gatcattaac 6001 cactgtatac tttgtgtata taataggtca gtgtaaagaa atatgatttg aggtggtgca 6061 tgcaagtaac tagggtttat tctatataat gaatatttat agatctgtaa catttgtttc 6121 aaaatgctgt ttcattttta taaagtacca gtgtttagct gctttttata cattaaatta 6181 gcaatttgaa aaactc // LOCUS HUMORFW 6586 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0042 gene, complete cds. ACCESSION D26361 NID g452516 KEYWORDS KIAA0042. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6586) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (27-DEC-1993) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 6586) AUTHORS Miyajima,N. JOURNAL Unpublished (1996) REFERENCE 3 (sites) AUTHORS Nomura,N., Nagase,T., Miyajima,N., Sazuka,T., Tanaka,A., Sato,S., Seki,N., Kawarabayasi,Y., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. II. The coding sequences of 40 new genes (KIAA0041-KIAA0080) deduced by analysis of cDNA clones from human cell line KG-1 JOURNAL DNA Res. 1 (5), 223-229 (1994) MEDLINE 96051398 FEATURES Location/Qualifiers source 1..6586 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR 1..439 gene 440..5386 /gene="KIAA0042" CDS 440..5386 /gene="KIAA0042" /codon_start=1 /db_xref="PID:d1005934" /db_xref="PID:g452517" /translation="MSLHSTHNRNNSGDILDIPSSQNSSSLNALTHSSRLKLHLKSDM SECENDDPLLRSAGKVRDINRTYVISASRKTADMPLTPNPVGRLALQRRTTRNKESSL LVSELEDTTEKTAETRLTLQRRAKTDSAEKWKTAEIDSVKMTLNVGGETENNGVSKES RTNVRIVNNAKNSFVASSVPLDEDPQVIEMMADKKYKETFSAPSRANENVALKYSSNR PPIASLSQTEVVRSGHLTTKPTQSKLDIKVLGTGNLYHRSIGKEIAKTSNKFGSLEKR TPTKCTTEHKLTTKCSLPQLKSPAPSILKNRMSNLQVKQRPKSSFLANKQERSAENTI LPEEETVVQNTSAGKDPLKVENSQVTVAVRVRPFTKREKIEKASQVVFMSGKEITVEH PDTKQVYNFIYDVSFWSFDECHPHYASQTTVYEKLAAPLLERAFEGFNTCLFAYGQTG SGKSYTMMGFSEEPGIIPRFCEDLFSQVARKQTQEVSYHIEMSFFEVYNEKIHDLLVC KDENGQRKQPLRVREHPVYGPYVEALSMNIVSSYADIQSWLELGNKQRATAATGMNDK SSRSHSVFTLVMTQTKTEFVEGEEHDHRITSRINLIDLAGSERCSTAHTNGDRLKEGV SINKSLLTLGKVISALSEQANQRSVFIPYRESVLTWLLKESLGGNSKTAMIATISPAA SNIEETLSTLRYANQARLIVNIAKVNEDMNAKLIRELKAEIAKLKAAQRNSRNIDPER YRLCRQEITSLRMKLHQQERDMAEMQRVWKEKFEQAEKRKLQETKELQKAGIMFQMDN HLPNLVNLNEDPQLSEMLLYMIKEGTTTVGKYKPNSSHDIQLSGVLIADDHCTIKNFG GTVSIIPVGEAKTYVNGKHILEITVLRHGDRVILGGDHYFRFNHPVEVQKGKRPSGRD TPISEGPKDFEFAKNELLMAQRSQLEAEIKEAQLKAKEEMMQGIQIAKEMAQQELSSQ KAAYESKIKALEAELREESQRKKMQEINNQKANHKIEELEKAKQHLEQEIYVNKKRLE METLATKQALEDHSIRHARILEALETEKQKIAKEVQILQQNRNNRDKTFTVQTTWSSM KLSMMIQEANAISSKLKTYYVFGRHDISDKSSSDTSIRVRNLKLGISTFWSLEKFESK LAAMKELYESNGSNRGEDAFCDPEDEWEPDITDAPVSSLSRRRSRSLMKNRRISGCLH DIQVHPIKNLHSSHSSGLMDKSSTIYSNSAESFLPGICKELIGSSLDFFGQSYDEERT IADSLINSFLKIYNGLFAISKAHEEQDEESQDNLFSSDRAIQSLTIQTACAFEQLVVL MKHWLSDLLPCTNIARLEDELRQEVKKLGGYLQLFLQGCCLDISSMIKEAQKNAIQIV QQAVKYVGQLAVLKGSKLHFLENGNNKAASVQEEFMDAVCDGVGLGMKILLDSGLEKA KELQHELFRQCTKNEVTKEMKTNAMGLIRSLENIFAESKIKSFRRQVQEENFEYQDFK RMVNRAPEFLKLKHCLEKAIEIIISALKGCHSDINLLQTCVESIRNLASDFYSDFSVP STSVGSYESRVTHIVHQELESLAKSLLFCFESEESPDLLKPWETYNQNTKEEHQQSKS SGIDGSKNKGVPKRVYELHGSSPAVSSEECTPSRIQWV" 3'UTR 5387..6586 BASE COUNT 2261 a 1151 c 1424 g 1750 t ORIGIN 1 ctggggagcc ggcgctggag gtggtgagtg gcgtggggac tgtgtcgagg gggtccccaa 61 ggtgccggac cctgcggagg ggcgaagttt cggcactggg gagggcgtgc ggacgctttc 121 cctacaggcg accactgctc tgcgggcggg tggtcttagc tccagtcccc cattcagttc 181 ctcagcattc caggtcggcg gcgaaggggt ccccgaacga agggcgcaag gcagcgtctc 241 tgctgggacc gggaagccgg acttcagggc ctctcggccc gtgggcttct ccccgagtct 301 ccccgagtcg gttggcatta agagtttagc agatactttc agaaatggat acataagaaa 361 tggctggaaa tcaaatgaat gtccaaagaa gagcttaggg tcttagtaac attctttttt 421 aaaataactg tctgccaaaa tgtcattaca cagtactcat aatagaaata acagcggtga 481 tattcttgat attccttctt cccaaaatag ttcatcactg aatgccctca cccacagtag 541 ccgacttaag ctgcatttga agtcggatat gtcagaatgt gaaaatgatg atccattatt 601 gagatctgca ggtaaagtca gagacataaa tagaacttat gttatttctg ccagtagaaa 661 aacagcagac atgcccctta cccctaatcc tgtaggtaga ttggcacttc agaggagaac 721 tacaaggaac aaagaatcat ctttgcttgt tagtgagttg gaagacacaa ctgaaaaaac 781 agcagaaaca cgtcttacat tacaacgtcg tgctaaaaca gattctgcag aaaagtggaa 841 aacagctgaa atagattctg tcaaaatgac actgaatgtg ggaggtgaaa cagaaaataa 901 tggtgtttct aaggaaagta gaacaaatgt aaggattgta aataatgcta aaaactcttt 961 tgttgcctct tctgtacctt tagatgaaga tccacaggtc attgaaatga tggctgataa 1021 gaaatacaaa gaaacatttt ctgcccccag tagagcaaat gaaaatgttg cacttaagta 1081 ctcaagtaat agaccaccca ttgcttccct gagtcagact gaagttgtta gatcaggaca 1141 cttgacaacg aaacctactc agagcaagtt ggatatcaaa gtgttgggaa caggaaactt 1201 gtatcataga agtattggga aggaaattgc aaaaacttca aataaatttg ggagcttaga 1261 aaaaagaaca cctacaaaat gtacaacaga acacaaactg acaacaaagt gcagcctgcc 1321 tcagcttaag agcccagctc catcaatact gaagaataga atgtctaacc ttcaagttaa 1381 acaaagacca aaaagttcct ttcttgcaaa taaacaggaa agatccgcag aaaatacaat 1441 tcttcccgaa gaagaaactg tagttcagaa cacctctgca ggaaaagacc ccttaaaagt 1501 agagaatagt caagtgacag tggcagtacg cgtaagacct ttcaccaaga gagagaagat 1561 tgaaaaagca tcccaggtag tcttcatgag tgggaaagaa ataactgtgg aacaccctga 1621 cacgaaacaa gtttataatt ttatttatga tgtttcattc tggtcttttg atgaatgtca 1681 tcctcactac gctagccaga caactgtcta tgagaagcta gcagcaccac tcctagaaag 1741 agccttcgaa ggcttcaata cctgtctttt tgcttatggt cagactggct ctggaaaatc 1801 atatacgatg atgggattta gtgaagaacc aggaataatt ccaagatttt gtgaagatct 1861 tttttctcaa gtagccagaa aacaaaccca agaggtcagc tatcacattg aaatgagctt 1921 ctttgaagta tataatgaaa aaattcacga ccttctggtt tgtaaagatg aaaatgggca 1981 gagaaagcaa ccactgagag tgagggaaca tcctgtttat ggaccatatg ttgaagcact 2041 gtcaatgaac attgtcagtt cttacgctga tatccagagt tggctagaat tgggaaataa 2101 acaaagagct actgctgcta ctggtatgaa tgataaaagt tcccgatctc attcagtttt 2161 caccctggtg atgacccaga ccaagacaga atttgtggaa ggggaagaac acgatcacag 2221 aataacaagt cgaattaacc taatagatct ggcaggcagt gagcgctgct ctacggctca 2281 cactaatgga gatcgactaa aggaaggtgt gagtattaat aagtccttgc taactttggg 2341 aaaagttata tctgcacttt cggaacaagc aaaccaaagg agtgttttta ttccttatcg 2401 tgaatctgtt cttacatggc tgttaaaaga aagtctgggt ggaaattcaa aaactgcaat 2461 gattgctacg attagtcccg ctgccagcaa catagaagaa acattaagca cacttagata 2521 tgctaaccaa gcccgtttaa tagtcaacat tgctaaagta aatgaagata tgaacgctaa 2581 gttaattaga gaattgaagg cagaaattgc aaagctaaaa gctgctcaga gaaacagtcg 2641 gaatattgac cctgaacgat acaggctctg tcggcaagaa ataacatcct taagaatgaa 2701 actgcatcaa caggagagag acatggcaga aatgcaaaga gtgtggaaag aaaagtttga 2761 acaagctgaa aaaagaaaac ttcaagaaac aaaagagtta cagaaagcag gaattatgtt 2821 tcaaatggac aatcatttac caaaccttgt taatctgaat gaagatccac aactatctga 2881 gatgctgcta tatatgataa aagaaggaac aactacagtt ggaaagtata aaccaaactc 2941 aagccatgat attcagttat ctggggtgct gattgctgat gatcattgta ctatcaaaaa 3001 ttttggtggg acagtgagta ttatcccagt tggggaagca aagacatatg taaatggaaa 3061 acatattttg gaaatcacag tattacgtca tggtgatcga gtgattcttg gtggagatca 3121 ttattttaga tttaatcatc cagtagaagt ccagaaagga aaaaggccat ctggaagaga 3181 tactcctata agtgagggtc caaaagactt tgaatttgca aaaaatgagt tgctcatggc 3241 acagagatca caacttgaag cagaaataaa agaggctcag ttgaaggcaa aggaagaaat 3301 gatgcaagga atccagattg caaaagaaat ggctcagcaa gagctttctt ctcaaaaagc 3361 tgcatatgaa agcaaaataa aagcactgga agcagaactg agagaagagt ctcaaaggaa 3421 aaaaatgcag gaaataaata accagaaggc taatcacaaa attgaggaat tagaaaaggc 3481 aaagcagcat cttgaacagg aaatatatgt caacaaaaag cgattagaaa tggagacatt 3541 ggctacaaaa caggctttag aagaccatag catccgccat gcaagaattc tggaagcttt 3601 agaaactgaa aagcaaaaaa ttgctaaaga agtacaaatt ctacagcaga atcggaataa 3661 tagggataaa acttttacag tgcagacaac ttggagctct atgaaactct caatgatgat 3721 tcaggaagcc aatgctatca gcagcaaatt gaaaacatac tatgtttttg gcagacatga 3781 tatatcagat aaaagtagtt ctgacacttc tattcgggtt cgtaacctga aactaggaat 3841 ctcaacattc tggagtctgg aaaagtttga atctaaactt gcagcaatga aagaacttta 3901 tgagagtaat ggtagtaaca ggggtgaaga tgccttttgt gatcctgaag atgaatggga 3961 acccgacatt acagatgcac cagtttcttc actttctaga aggaggagta ggagtttgat 4021 gaagaacaga agaatttctg gttgtttaca tgacatacaa gtccatccaa ttaagaattt 4081 gcattcttca cattcatcag gtttaatgga caaatcaagc actatttact caaattcagc 4141 agagtccttt cttcctggaa tttgcaaaga attgattggt tcttcgttag atttttttgg 4201 acagagttat gatgaagaaa gaactatagc agacagccta attaatagtt ttcttaaaat 4261 ttataatggg ctatttgcca tttccaaggc tcatgaagaa caagatgaag aaagtcaaga 4321 taacttgttt tcttctgatc gagcaatcca gtcacttact attcagactg catgtgcttt 4381 tgagcagcta gtagtgctaa tgaaacactg gctgagtgat ttactgcctt gtaccaacat 4441 agcaagactt gaggatgagt tgagacaaga agttaaaaaa ctgggaggct acttacagtt 4501 atttttgcag ggatgctgtt tggatatttc atcaatgata aaagaggctc aaaagaatgc 4561 aatccaaatt gtacaacaag ctgtaaagta tgtggggcag ttagcagttc tgaaagggag 4621 caagctacat tttctagaaa acggtaacaa taaagctgcc agtgtccagg aggaattcat 4681 ggatgctgtt tgtgatggtg taggcttagg aatgaagatt ttattagatt ctggactgga 4741 aaaagcaaaa gaacttcagc atgaactctt taggcagtgt acaaaaaatg aggttaccaa 4801 agaaatgaaa actaatgcca tgggattgat tagatctctt gaaaacatct ttgctgaatc 4861 gaaaattaaa agtttcagaa ggcaagtaca agaagaaaac tttgaatacc aagatttcaa 4921 gaggatggtt aatcgtgctc cagaattctt aaagttaaaa cattgcttag agaaagctat 4981 tgaaattatt atttctgcac tgaaaggatg ccatagtgat ataaatcttc tccagacttg 5041 tgttgaaagt attcgcaact tggccagtga tttttacagt gacttcagtg tgccttctac 5101 ttctgttggc agctatgaga gtagagtaac tcacattgtc caccaggaac tagaatctct 5161 agctaagtct ctcctctttt gttttgaatc tgaagaaagc cctgatttgt tgaaaccctg 5221 ggaaacttat aatcaaaata ccaaagaaga acaccaacaa tctaaatcaa gcgggattga 5281 cggcagtaag aataaaggtg taccaaagcg tgtctatgag ctccatggct catccccagc 5341 agtgagctca gaggaatgca cacccagtag gattcagtgg gtgtgaatac tgatgtgtag 5401 gcacttttat gaccacccat gaaagaaaaa gaacacttgc tcggtaattt tctttatgca 5461 ggagagttta agagaaatca gcacagatat ttcaaaaaag tccatgtctt tttatcttta 5521 aaatatctat ttatcaaagg ccagacacag tggctcacgc ctgtaatccc agcactttgg 5581 gaggcgggca gatcacaagg tcaggagttt gagaccggcc tggccaacat ggtgaaaccc 5641 cgtctctact aaaaatacaa aaatttgctg ggcatggtgg cgcgtgcctg taatcccagc 5701 tactaggggg gctgaggcag gaggatcgct tgaacctgag aggcagaggt tgcagtgagc 5761 caagatcatg ccactttact ccagtctgag caacagaacg agacttagtc aaaataaata 5821 aataaataag taaataaata aataaataaa atatctttta tctttaaagt gtttaacatt 5881 ggtatactgt ctgtagttgg ttcattagtc gtttataaag ggttattttc tcatgagtgg 5941 aaacctgaac aatcagttac ctttgtgcct atgccttctc tctcctcaga cagctgggat 6001 gtttatggtg aaatggcctg tacaagttta actaagacaa cttaacttgc attgttaatc 6061 aaaaattctt ttctcaaagg gttaactggt tgccattttg aatagtatgt tcaagggtgt 6121 agcttcctgt ttctttccaa attataagta gctacctaaa tatagtataa ttatatatta 6181 ataatatggc ttgctggcac agtagtttac cctgttatct gtgtttcata atgggggctg 6241 tatgaatatt atttaaaact aataaaatgt tgccagaatt atactaaact gttggatgag 6301 attaggagat cagaggctgg accttctctt gataatgctt gttttgttaa aggtataatg 6361 aaataatttg tatatgattt gatgaagatt aaagaccctt attttccaca gctttaaaaa 6421 aaaaccttta tttatgatca agtaataaag ataatattct acttgtggga tcttacatta 6481 tggaaatagt ttgacgtttt tgacctcaag agtatgtata atttgaagag atactttgta 6541 actatgcttg ggtgatattg agcagttcct aaagaataat tcattt // LOCUS HUMORLMHC 1990 bp DNA PRI 25-NOV-1996 DEFINITION Human olfactory receptor-like gene, complete cds. ACCESSION L35475 NID g1041044 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1990) AUTHORS Fan,W., Cai,W., Parimoo,S., Lennon,G.G. and Weissman,S.M. TITLE Identification of seven new human MHC class I region genes around the HLA-F locus JOURNAL Immunogenetics 44 (2), 97-103 (1996) MEDLINE 96269983 REFERENCE 2 (bases 1 to 1990) AUTHORS Fan,W.-F. TITLE Direct Submission JOURNAL Submitted (27-SEP-1995) Wufang Fan, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA FEATURES Location/Qualifiers source 1..1990 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="FAT11" /map="6p21" /tissue_lib="YAC A146G11" CDS 500..1450 /codon_start=1 /product="olfactory receptor-like protein" /db_xref="PID:g601919" /translation="MDNQSSTPGFLLLGFSEHPGLGRTLFVDVITSYLLTLVGNTLII LLSALDTKLHSPMYFFLSNLSFLDLCFTTSCVPQMLANLWGPKKTISFLDCSVQIFIF LSLGTTECILMKVMAFDRYVAVCQPLHYATIIHPRLCWQLASVAWVIGLVGSVVQTPS TLHLPFCPDRQVDDFVCEVPALIRLSCEDTSYNEIQVAVASVFILVVPLSLILVSYGA ITWAVLRINSATAWRKAFGTCSSHLTVVTLFYSSVIAVYLQPKNPYAQGRGKFFGLFY AVGTPSLNPLVYTLRNKEIKRALRRLLGKERDSRESWRAA" BASE COUNT 508 a 535 c 440 g 507 t ORIGIN 1 tttacagagt tctataaaat tcattcaacc aatagagcaa taattgagcc tacagagaca 61 acttatcaga aaattcattc aatatacctt acgagatcat ccaatagata agagacaact 121 ctagaacagc attcagaaca tagtggcact caataaattt cccctgaatg aatgaattaa 181 tgaattagtg catattttaa tcagcctcct ttgccctcac ccaggaagtc agaggcacca 241 gtgtgagtat ccatctgctg tccagtacat tcatggattc ctcactctca ctagacaatg 301 tttgaccagg aagaacaggg aatgagaagg agctgctggg tggtgatgag ccttggaaag 361 ggaggctggg cgagcagaga cagaagagaa acacctacct gctgtgacct cacaaacacc 421 caggctgagt tttgataaga caggttgaat cacactgggg tgacagcctc atccctccag 481 gtacaaacaa gaacaggcca tggataacca aagctccaca ccgggcttcc tccttctggg 541 cttctctgaa cacccagggc tgggaaggac tctcttcgtg gatgtcatca cttcctacct 601 cctaacccta gtgggcaaca cactcatcat cctgctgtct gcgctggaca ccaagctcca 661 ctctccaatg tactttttcc tctccaacct ctccttcttg gacctctgtt tcaccacgag 721 ttgtgttccc caaatgctgg ccaacctctg gggcccaaag aagaccatca gcttcctgga 781 ctgctctgtc cagatcttca tcttcctgtc cctggggaca actgagtgca tcctcatgaa 841 agtgatggct tttgatcgct acgtggctgt ctgccagccc ctccactatg ccaccatcat 901 ccacccccgc ctgtgctggc agctggcatc tgtggcctgg gtcattgggc tagtggggtc 961 agtggtccag acaccatcca ccctgcacct gcccttctgc cccgatcggc aggtggatga 1021 ttttgtctgt gaggtcccag ctctaattcg actctcctgt gaagacacct cctacaatga 1081 gatccaggtg gctgttgcca gtgtcttcat cttggttgtg cctctcagcc tcatccttgt 1141 ctcttacgga gccattacct gggcagtgct gaggattaac tccgccacag catggagaaa 1201 ggcctttggg acctgctcct cccatctcac tgtggtcacc ctcttctaca gctcagtcat 1261 tgctgtctac ctccagccca aaaatccgta tgcccaaggg aggggcaagt tctttggtct 1321 cttctatgca gtgggcactc cttcacttaa ccctctcgta tacaccctga ggaacaagga 1381 gataaagcga gcactcagga ggttactagg gaaggaaaga gactccaggg aaagctggag 1441 agctgcttaa tatactttcg aaagtaagaa gagtttcttc aagatttatg aacatgttaa 1501 gttttccaga ctactaccct tcccacatac aacctggagc cactgtgggg gggtcacagg 1561 gtgggtatgt tatctatgag agggagaatg agaaagagag ggacagagag ataaaagaat 1621 ttgggtgaga ggagataggt agctccataa ggcacacaaa ttcagatatt atcattccta 1681 tcactgtcca tccttaatat ttctatcctc cattctgtcc tatttactgt catcactcct 1741 atagattccc taactccacc atgcctatct ctggttatat aattgctctc caatggtcat 1801 gtcagtgtag gggaactact ccatcatagc attctggaca cctcgcatgt atctacgtag 1861 gtcatgtcag cacaggcttg aaggaacagc tactctgaga tttaggaaga atgcttctgg 1921 gatcccccct cggcaatatg agaggatcgg gaggcccctt caggaacctg cctcaaatgc 1981 caccttctca // LOCUS HUMOSBPA 2997 bp mRNA PRI 07-JAN-1995 DEFINITION Human oxysterol-binding protein (OSBP) mRNA, complete cds. ACCESSION M86917 J04757 NID g189402 KEYWORDS oxysterol-binding protein. SOURCE Homo sapiens (tissue library: oligo(dT)-primed human kidney cDNA in lambda-gt10) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2997) AUTHORS Levanon,D., Hsieh,C.L., Francke,U., Dawson,P.A., Ridgway,N.D., Brown,M.S. and Goldstein,J.L. TITLE cDNA cloning of human oxysterol-binding protein and localization of the gene to human chromosome 11 and mouse chromosome 19 JOURNAL Genomics 7 (1), 65-74 (1990) MEDLINE 90243258 FEATURES Location/Qualifiers source 1..2997 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="oligo(dT)-primed human kidney cDNA in lambda-gt10" /map="11q11-qter" gene 481..2904 /gene="OSBP" CDS 481..2904 /gene="OSBP" /codon_start=1 /db_xref="GDB:G00-120-252" /product="oxysterol-binding protein" /db_xref="PID:g189403" /translation="MAATELRGVVGPGPAAIAALGGGGAGPPVVGGGGGRGDAGPGSG AASGTVVAAAAGGPGPGAGGVAAAGPAPAPPTGGSGGSGAGGSGSAREGWLFKWTNYI KGYQRRWFVLSNGLLSYYRSKAEMRHTCRGTINLATANITVEDSCNFIISNGGAQTYH LKASSEVERQRWVTALELAKAKAVKMLAESDESGDEESVSQTDKTELQNTLRTLSSKV EDLSTCNDLIAKHGTALQRSLSELESLKLPAESNEKIKQVNERATLFRITSNAMINAC RDFLMLAQTHSKKWQKSLQYERDQRIRLEETLEQLAKQHNHLERAFRGATVLPANTPG NVGSGKDQCCSGKGDMSDEDDENEFFDAPEIITMPENLGHKRTGSNISGASSDISLDE QYKHQLEETKKEKRTRIPYKPNYSLNLWSIMKNCIGKELSKIPMPVNFNEPLSMLQRL TEDLEYHELLDRAAKCENSLEQLCYVAAFTVSSYSTTVFRTSKPFNPLLGETFELDRL EENGYRSLCEQVSHHPPAAAHHAESKNGWTLRQEIKITSKFRGKYLSIMPLGTIHCIF HATGHHYTWKKVTTTVHNIIVGKLWIDQSGEIDIVNHKTGDKCNLKFVPYSYFSRDVA RKVTGEVTDPSGKVHFALLGTWDEKMECFKVQPVIGENGGDARQRGHEAEESRVMLWK RNPLPKNAENMYYFSELALTLNAWESGTAPTDSRLRPDQRLMENGRWDEANAEKQRLE EKQRLSRKKREAEAMKATEDGTPYDPYKALWFERKKDPVTKELTHIYRGEYWECKEKQ DWSSCPDIF" BASE COUNT 799 a 736 c 879 g 583 t ORIGIN 1 agaagagcga tgggaggggg tgacgtggca gtgacaggcg cataaactga aatgaatggc 61 ggaaacagca gccaatcggc gatcagattg gccgggcatt ccgcattctc ccgtcctccc 121 tgagcgctgg aaacttcagg aaagcaccaa tgagcttccc caaaagtctg attggccttt 181 atctgaacca atcaaaggtg ggtctggcgg acgcgtccca ccccaggctc tcactgggcg 241 ctgtagaatg acaggcccga gagaaatgcc aatatccgcg gggtcttcct caagtgacac 301 tcagcggcac caatcgacgt ccttcttccc ggcggccgag ccctcctccc cgcggctggc 361 cgggcggggc ggggacgctg cgcggcggtg gctgatgcgg tagccgtgtg gggcgctccg 421 ggcggcgacg gcggctctcg taggcggttc cggtcttgta tctccaggcg gcggcggctc 481 atggcggcga cggagctgag aggagtggtg gggccaggcc cggcagccat tgcagcactt 541 ggcggcggcg gcgccggtcc cccagtggtg ggaggaggcg gcggccgcgg agatgcgggg 601 ccaggctccg gggccgcgtc agggacggtg gtcgcggcgg cggcgggagg cccgggcccg 661 ggggccgggg gagtggcggc ggctggcccg gcccctgcgc cgccgactgg gggctcgggc 721 ggctcgggcg ctgggggttc gggctcggct cgagagggct ggctcttcaa atggaccaat 781 tatatcaaag gctaccagcg gcgatggttc gtgctgagca acgggctcct gagctactac 841 agatcaaagg cagaaatgag acatacctgc cgtggtacca tcaacctcgc cacagccaac 901 atcaccgtgg aggactcctg caacttcatc atttccaatg ggggtgctca gacctaccat 961 ctgaaagcta gttcagaagt tgagcggcag cgctgggtga cggccctgga actggccaag 1021 gccaaagctg tgaagatgct ggcagagtca gatgaatcag gagatgaaga gtctgtctca 1081 caaactgaca agactgagct gcagaatacc cttcggaccc tctctagcaa agtagaggac 1141 ttgagcacgt gcaatgactt gatagctaag catggcacag ctctgcagcg ttctctcagt 1201 gagctggagt ccctgaagtt gcctgctgag agcaatgaaa agatcaaaca ggtcaacgaa 1261 cgagccacac tctttaggat aacatccaat gccatgatca acgcctgcag agatttcctc 1321 atgttagccc agacccatag taaaaaatgg caaaagtcac tacagtatga aagagaccag 1381 cgtatccgac tggaagaaac cctcgagcag ctggcgaagc agcataatca cctggagagg 1441 gccttccgag gagccacggt gctgccggca aacactcctg gcaatgtggg ttctggtaaa 1501 gatcagtgct gctctggcaa aggggacatg agcgatgaag atgatgagaa tgaatttttt 1561 gatgcacctg agatcatcac catgcctgaa aatttgggcc acaaacgtac tggcagcaat 1621 atcagtggag ccagcagtga catcagcctt gatgaacagt acaagcatca gctggaggag 1681 accaaaaagg aaaagagaac cagaatacca tacaagccaa actatagcct caatttatgg 1741 agcatcatga agaactgcat tggaaaagaa ctctctaaga tccccatgcc ggtaaacttt 1801 aatgagccct tgtccatgct tcagcgcctt actgaagatc tggaatacca tgagctgtta 1861 gaccgagctg caaaatgtga gaattctcta gaacagctct gttatgttgc agctttcacc 1921 gtgtcctcct actccactac tgtcttccgc accagtaagc cattcaaccc actgcttggg 1981 gagacctttg agctggaccg attagaggag aatgggtacc gatccctctg tgaacaggtg 2041 agtcatcatc cccctgctgc tgcgcaccat gctgagtcca aaaatggctg gacattgcgt 2101 caggaaatca aaatcaccag caagtttcga ggcaaatacc tctccattat gcccctcggt 2161 accattcatt gtattttcca tgcaactggg caccactaca cttggaagaa agttaccaca 2221 actgtacaca acattattgt gggcaagttg tggatagatc agtctggcga aattgatatt 2281 gtgaatcaca agacaggaga caagtgtaat cttaaatttg ttccttatag ctacttctct 2341 cgggatgtag caagaaaggt gacgggggaa gtgacagatc catcaggaaa agtccacttt 2401 gctcttctgg ggacgtggga tgagaaaatg gaatgtttca aagtacagcc agtcattggg 2461 gaaaatgggg gtgatgctcg acagagaggc catgaagcag aggaaagcag ggtcatgctg 2521 tggaaaagga atcctttacc gaagaatgca gaaaacatgt actacttctc agagcttgct 2581 ctgactctca atgcttggga aagtggcact gcccccacag acagccggtt acgacctgac 2641 cagagactga tggaaaatgg acgctgggat gaagcaaatg cggagaagca gcgcctggag 2701 gaaaaacaaa gactttccag aaagaagaga gaagcggaag ctatgaaagc cacagaggat 2761 ggcacaccat atgatcccta taaggcactg tggtttgagc ggaagaagga ccctgttacc 2821 aaggagttaa cccatattta taggggagaa tactgggagt gtaaagaaaa acaggactgg 2881 agctcatgcc cggacatttt ctgaaacggc agtaacaaaa aagaggagca tataatggag 2941 aagaggacag aggatgtgtg ggaaagctgg aagttgtgac tctcttacca agtgctt // LOCUS HUMOSF2OS 3213 bp mRNA PRI 02-OCT-1993 DEFINITION Homo sapiens mRNA for osteoblast specific factor 2 (OSF-2os). ACCESSION D13666 NID g393316 KEYWORDS OSF-2os; osteoblast specific factor. SOURCE Homo sapiens osteosarcoma osteoblast cDNA to mRNA, clone pKOT158. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3213) AUTHORS Takeshita,S., Kikuno,R., Tezuka,K. and Amann,E. TITLE Osteoblast-specific factor 2: cloning of a putative bone adhesion protein with homology with the insect protein fasciclin I JOURNAL Biochem. J. 294 (Pt 1), 271-278 (1993) MEDLINE 93371373 COMMENT Submitted (12-Nov-1992) to DDBJ by: Reiko Kikuno Pharma Research Labs. Hoechst Japan Ltd. 1-3-2 Minami-dai Kawagoe Saitama 350 Japan Phone: 0492-43-6149 Fax: 0492-41-6475 E-mail: rkikuno@ddbj.nig.ac.jp. FEATURES Location/Qualifiers source 1..3213 /organism="Homo sapiens" /db_xref="taxon:9606" gene 12..2522 /gene="osf-2" CDS 12..2522 /gene="osf-2" /codon_start=1 /product="OSF-2os" /db_xref="PID:g393317" /translation="MIPFLPMFSLLLLLIVNPINANNHYDKILAHSRIRGRDQGPNVC ALQQILGTKKKYFSTCKNWYKKSICGQKTTVLYECCPGYMRMEGMKGCPAVLPIDHVY GTLGIVGATTTQRYSDASKLREEIEGKGSFTYFAPSNEAWDNLDSDIRRGLESNVNVE LLNALHSHMINKRMLTKDLKNGMIIPSMYNNLGLFINHYPNGVVTVNCARIIHGNQIA TNGVVHVIDRVLTQIGTSIQDFIEAEDDLSSFRAAAITSDILEALGRDGHFTLFAPTN EAFEKLPRGVLERFMGDKVASEALMKYHILNTLQCSESIMGGAVFETLEGNTIEIGCD GDSITVNGIKMVNKKDIVTNNGVIHLIDQVLIPDSAKQVIELAGKQQTTFTDLVAQLG LASALRPDGEYTLLAPVNNAFSDDTLSMVQRLLKLILQNHILKVKVGLNELYNGQILE TIGGKQLRVFVYRTAVCIENSCMEKGSKQGRNGAIHIFREIIKPAEKSLHEKLKQDKR FSTFLSLLEAADLKELLTQPGDWTLFVPTNDAFKGMTSEEKEILIRDKNALQNIILYH LTPGVFIGKGFEPGVTNILKTTQGSKIFLKEVNDTLLVNELKSKESDIMTTNGVIHVV DKLLYPADTPVGNDQLLEILNKLIKYIQIKFVRGSTFKEIPVTVYTTKIITKVVEPKI KVIEGSLQPIIKTEGPTLTKVKIEGEPEFRLIKEGETITEVIHGEPIIKKYTKIIDGV PVEITEKETREERIITGPEIKYTRISTGGGETEETLKKLLQEEVTKVTKFIEGGDGHL FEDEEIKRLLQGDTPVRKLQANKKVQGSRRRLREGRSQ" polyA_signal 3118..3123 BASE COUNT 1096 a 590 c 654 g 873 t ORIGIN 1 agagactcaa gatgattccc tttttaccca tgttttctct actattgctg cttattgtta 61 accctataaa cgccaacaat cattatgaca agatcttggc tcatagtcgt atcaggggtc 121 gggaccaagg cccaaatgtc tgtgcccttc aacagatttt gggcaccaaa aagaaatact 181 tcagcacttg taagaactgg tataaaaagt ccatctgtgg acagaaaacg actgttttat 241 atgaatgttg ccctggttat atgagaatgg aaggaatgaa aggctgccca gcagttttgc 301 ccattgacca tgtttatggc actctgggca tcgtgggagc caccacaacg cagcgctatt 361 ctgacgcctc aaaactgagg gaggagatcg agggaaaggg atccttcact tactttgcac 421 cgagtaatga ggcttgggac aacttggatt ctgatatccg tagaggtttg gagagcaacg 481 tgaatgttga attactgaat gctttacata gtcacatgat taataagaga atgttgacca 541 aggacttaaa aaatggcatg attattcctt caatgtataa caatttgggg cttttcatta 601 accattatcc taatggggtt gtcactgtta attgtgctcg aatcatccat gggaaccaga 661 ttgcaacaaa tggtgttgtc catgtcattg accgtgtgct tacacaaatt ggtacctcaa 721 ttcaagactt cattgaagca gaagatgacc tttcatcttt tagagcagct gccatcacat 781 cggacatatt ggaggccctt ggaagagacg gtcacttcac actctttgct cccaccaatg 841 aggcttttga gaaacttcca cgaggtgtcc tagaaaggtt catgggagac aaagtggctt 901 ccgaagctct tatgaagtac cacatcttaa atactctcca gtgttctgag tctattatgg 961 gaggagcagt ctttgagacg ctggaaggaa atacaattga gataggatgt gacggtgaca 1021 gtataacagt aaatggaatc aaaatggtga acaaaaagga tattgtgaca aataatggtg 1081 tgatccattt gattgatcag gtcctaattc ctgattctgc caaacaagtt attgagctgg 1141 ctggaaaaca gcaaaccacc ttcacggatc ttgtggccca attaggcttg gcatctgctc 1201 tgaggccaga tggagaatac actttgctgg cacctgtgaa taatgcattt tctgatgata 1261 ctctcagcat ggttcagcgc ctccttaaat taattctgca gaatcacata ttgaaagtaa 1321 aagttggcct taatgagctt tacaacgggc aaatactgga aaccatcgga ggcaaacagc 1381 tcagagtctt cgtatatcgt acagctgtct gcattgaaaa ttcatgcatg gagaaaggga 1441 gtaagcaagg gagaaacggt gcgattcaca tattccgcga gatcatcaag ccagcagaga 1501 aatccctcca tgaaaagtta aaacaagata agcgctttag caccttcctc agcctacttg 1561 aagctgcaga cttgaaagag ctcctgacac aacctggaga ctggacatta tttgtgccaa 1621 ccaatgatgc ttttaaggga atgactagtg aagaaaaaga aattctgata cgggacaaaa 1681 atgctcttca aaacatcatt ctttatcacc tgacaccagg agttttcatt ggaaaaggat 1741 ttgaacctgg tgttactaac attttaaaga ccacacaagg aagcaaaatc tttctgaaag 1801 aagtaaatga tacacttctg gtgaatgaat tgaaatcaaa agaatctgac atcatgacaa 1861 caaatggtgt aattcatgtt gtagataaac tcctctatcc agcagacaca cctgttggaa 1921 atgatcaact gctggaaata cttaataaat taatcaaata catccaaatt aagtttgttc 1981 gtggtagcac cttcaaagaa atccccgtga ctgtctatac aactaaaatt ataaccaaag 2041 ttgtggaacc aaaaattaaa gtgattgaag gcagtcttca gcctattatc aaaactgaag 2101 gacccacact aacaaaagtc aaaattgaag gtgaacctga attcagactg attaaagaag 2161 gtgaaacaat aactgaagtg atccatggag agccaattat taaaaaatac accaaaatca 2221 ttgatggagt gcctgtggaa ataactgaaa aagagacacg agaagaacga atcattacag 2281 gtcctgaaat aaaatacact aggatttcta ctggaggtgg agaaacagaa gaaactctga 2341 agaaattgtt acaagaagag gtcaccaagg tcaccaaatt cattgaaggt ggtgatggtc 2401 atttatttga agatgaagaa attaaaagac tgcttcaggg agacacaccc gtgaggaagt 2461 tgcaagccaa caaaaaagtt caaggttcta gaagacgatt aagggaaggt cgttctcagt 2521 gaaaatccaa aaaccagaaa aaaatgttta tacaacccta agtcaataac ctgaccttag 2581 aaaattgtga gagccaagtt gacttcagga actgaaacat cagcacaaag aagcaatcat 2641 caaataattc tgaacacaaa tttaatattt ttttttctga atgagaaaca tgagggaaat 2701 tgtggagtta gcctcctgtg gtaaaggaat tgaagaaaat ataacacctt acaccctttt 2761 tcatcttgac attaaaagtt ctggctaact ttggaatcca ttagagaaaa atccttgtca 2821 ccagattcat tacaattcaa atcgaagagt tgtgaactgt tatcccattg aaaagaccga 2881 gccttgtatg tatgttatgg atacataaaa tgcacgcaag ccattatctc tccatgggaa 2941 gctaagttat aaaaataggt gcttggtgta caaaactttt tatatcaaaa ggctttgcac 3001 atttctatat gagtgggttt actggtaaat tatgttattt tttacaacta attttgtact 3061 ctcagaatgt ttgtcatatg cttcttgcaa tgcatatttt ttaatctcaa acgtttcaat 3121 aaaaccattt ttcagatata aagagaatta cttcaaattg agtaattcag aaaaactcaa 3181 gatttaagtt aaaaagtggt ttggacttgg gaa // LOCUS HUMOTC 1464 bp mRNA PRI 07-JAN-1995 DEFINITION Human ornithine transcarbamylase (OTC) mRNA, complete coding sequence. ACCESSION K02100 NID g189406 KEYWORDS nuclear matrix protein; ornithine transcarbamylase. SOURCE Human cDNA to adult liver mRNA, clones pHO-1, pHO-7, and pHO-31. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1464) AUTHORS Horwich,A.L., Fenton,W.A., Williams,K.R., Kalousek,F., Kraus,J.P., Doolittle,R.F., Konigsberg,W. and Rosenberg,L.E. TITLE Structure and expression of a complementary DNA for the nuclear coded precursor of human mitochondrial ornithine transcarbamylase JOURNAL Science 224 (4653), 1068-1074 (1984) MEDLINE 84196410 COMMENT The structural gene for ornithine transcarbamylase (OTC) is a nuclear-coded polypeptide encoded on the X chromosome. OTC consists of 3 identical subunits. The precursor for each subunit is synthesized in the cytoplasm and then processed posttranslationally to its mature form during mitochondrial import. Comparison of the human mature OTC subunit sequence with that of the rat revealed 90% amino acid homology, and 95% DNA sequence homology. The cDNA sequence of OTC was derived from three overlapping sequences from plasmids pHO-1, pHO-7, and pHO-31. FEATURES Location/Qualifiers source 1..1464 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /map="Xp21.1" mRNA <1..>1464 /note="OTC mRNA" sig_peptide 136..231 /gene="OTC" /note="G00-119-468" gene 136..1200 /gene="OTC" CDS 136..1200 /gene="OTC" /codon_start=1 /db_xref="GDB:G00-119-468" /product="ornithine transcarbamylase" /db_xref="PID:g189407" /translation="MLFNLRILLNNAAFRNGHNFMVRNFRCGQPLQNKVQLKGRDLLT LKNFTGEEIKYMLWLSADLKFRIKQKGEYLPLLQGKSLGMIFEKRSTRTRLSTETGFA LLGGHPCFPTTQDIHLGVNESLTDTARVLSSMADAVLARVYKQSDLDTLAKEASIPII NGLSDLYHPIQILADYLTLQEHYSSLKGLTLSCFGDGNNILHSIMMSAAKFGMHLQAA TPKGYEPDASVTKLAEQYAKENGTKLLLTNDPLEAAHGGNVLITDTWISMGREEEKKK RLQAFQGYQVTMKTAKVAASDWTFLHCLPRKPEEVDDEVFYSPRSLVFPEAENRKWTI MAVMVSLLTDYSPQLQKPKF" mat_peptide 232..1197 /gene="OTC" /note="G00-119-468" /product="ornithine transcarbamylase" BASE COUNT 445 a 297 c 323 g 399 t ORIGIN 243 bp upstream of PvuII site on X chromosome. 1 aagctgaagg gtgatattac ctttgctccc tcactgcaac tgaacacatt tcttagtttt 61 taggtggccc ccgctggcta acttgctgtg gagttttcaa gggcatagaa tcgtccttta 121 cacaattaaa agaagatgct gtttaatctg aggatcctgt taaacaatgc agcttttaga 181 aatggtcaca acttcatggt tcgaaatttt cggtgtggac aaccactaca aaataaagtg 241 cagctgaagg gccgtgacct tctcactcta aaaaacttta ccggagaaga aattaaatat 301 atgctatggc tatcagcaga tctgaaattt aggataaaac agaaaggaga gtatttgcct 361 ttattgcagg ggaagtcctt aggcatgatt tttgagaaaa gaagtactcg aacaagattg 421 tctacagaaa caggctttgc acttctggga ggacatcctt gttttcctac cacacaagat 481 attcatttgg gtgtgaatga aagtctcacg gacacggccc gtgtattgtc tagcatggca 541 gatgcagtat tggctcgagt gtataaacaa tcagatttgg acacccttgc taaagaagca 601 tccatcccaa ttatcaatgg gctgtcagat ttgtaccatc ctatccagat cctggctgat 661 tacctcacgc tccaggaaca ctatagctct ctgaaaggtc ttaccctcag ctgtttcggg 721 gatgggaaca atatcctgca ctccatcatg atgagcgcag cgaaattcgg aatgcacctt 781 caggcagcta ctccaaaggg ttatgagccg gatgctagtg taaccaagtt ggcagagcag 841 tatgccaaag agaatggtac caagctgttg ctgacaaatg atccattgga agcagcgcat 901 ggaggcaatg tattaattac agacacttgg ataagcatgg gacgagaaga ggagaagaaa 961 aagcggctcc aagctttcca aggttaccaa gttacaatga agactgctaa agttgctgcc 1021 tctgactgga catttttaca ctgcttgccc agaaagccag aagaagtgga tgatgaagtc 1081 ttttattctc ctcgatcact agtgttccca gaggcagaaa acagaaagtg gacaatcatg 1141 gctgtcatgg tgtccctgct gacagattac tcacctcagc tccagaagcc taaattttga 1201 tgttgtgtta cttgtcaaga aagaagcaat gttggtcagt aacagaatga gttggtttat 1261 ggggaaaaga gaagagaatc taaaaaataa accaatccct aacacgtggt atgggcgaat 1321 cgtacgatat gctttgccat tgtgaaactt tccttaagcc ttcaatttaa gtgctgatgc 1381 actgtaatac gtgcttaact ttgcttaaac tctctaattc ccaatttctg agttacattt 1441 agatatcata ttaactatca tata // LOCUS HUMP107B 3960 bp mRNA PRI 11-AUG-1993 DEFINITION Human retinoblastoma related protein (p107) mRNA, complete cds. ACCESSION L14812 NID g292373 KEYWORDS cell cycle regulation protein; retinoblastoma protein; tumor suppressor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Ewen,M.E., Xing,Y., Bentley-Lawrence,J. and Livingston,D.M. TITLE Molecular cloning, chromosome mapping and expression of the cDNA for p107 a retinoblastoma gene product-related protein JOURNAL Cell 66, 1155-1164 (1991) MEDLINE 92005667 REFERENCE 2 (bases 1 to 3960) AUTHORS Zhu,L., van den Heuvel,S., Helin,K., Fattaey,A., Ewen,M., Livingston,D., Dyson,N. and Harlow,E. TITLE Inhibition of cell proliferation by p107, a relative of the retinoblastoma protein JOURNAL Genes Dev. 7 (7A), 1111-1125 (1993) MEDLINE 93307648 FEATURES Location/Qualifiers source 1..3960 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20q11.2" CDS 69..3275 /codon_start=1 /product="p107" /db_xref="PID:g292374" /translation="MFEDKPHAEGAAVVAAAGEALQALCQELNLDEGSAAEALDDFTA IRGNYSLEGEVTHWLACSLYVACRKSIIPTVGKGIMEGNCVSLTRILRSAKLSLIQFF SKMKKWMDMSNLPQEFRERIERLERNFEVSTVIFKKYEPIFLDIFQNPYEEPPKLPRS RKQRRIPCSVKDLFNFCWTLFVYTKGNFRMIGDDLVNSYHLLLCCLDLIFANAIMCPN RQDLLNPSFKGLPSDFHTADFTASEEPPCIIAVLCELHDGLLVEAKGIKEHYFKPYIS KLFDRKILKGECLLDLSSFTDNSKAVNKEYEEYVLTVGDFDERIFLGADAEEEIGTPR KFTRDTPLGKLTAQANVEYNLQQHFEKKRSFAPSTPLTGRRYLREKEAVITPVASATQ SVSRLQSIVAGLKNAPSDQLINIFESCVRNPVENIMKILKGIGETFCQHYTQSTDEQP GSHIDFAVNRLKLAEILYYKILETVMVQETRRLHGMDMSVLLEQDIFHRSLMACCLEI VLFAYSSPRTFPWIIEVLNLQPFYFYKVIEVVIRSEEGLSRDMVKHLNSIEEQILESL AWSHDSALWEALQVSANKVPTCEEVIFPNNFETGNGGNVQGHLPLMPMSPLMHPRVKE VRTDSGSLRRDMQPLSPISVHERYSSPTAGSAKRRLFGEDPPKEMLMDKIITEGTKLK IAPSSSITAENVSILPGQTLLTMATAPVTGTTGHKVTIPLHGVANDAGEITLIPLSMN TNQESKVKSPVSLTAHSLIGASPKQTNLTKAQEVHSTGINRPKRTGSLALFYRKVYHL ASVRLRDLCLKLDVSNELRRKIWTCFEFTLVHCPDLMKDRHLDQLLLCAFYIMAKVTK EERTFQEIMKSYRNQPQANSHVYRSVLLKSIPREVVAYNKNINDDFEMIDCDLEDATK TPDCSSGPVKEERSDLIKFYNTIYVGRVKSFALKYDLANQDHMMDAPPLSPFPHIKQQ PGSPRRISQQHSIYISPHKNGSGLTPRSALLYKFNGSPSKSLKDINNMIRQGEQRTKK RVIAIDSDAESPAKRVCQENDDVLLKRLQDVVSERANH" BASE COUNT 1257 a 771 c 869 g 1063 t ORIGIN 1 cgggtagcgc gcctgggagg gagaaagaag tcgggggccg tggcgcgcag cccgcggggc 61 ctgaagggat gttcgaggac aagccccacg ctgagggggc ggcggtggtc gccgcagccg 121 gggaggcgct acaggccctg tgccaggagc tgaacctgga cgaggggagc gcggccgaag 181 ccctggacga ctttactgcc atccgaggca actacagcct agagggagaa gttacacact 241 ggttggcatg ttcattatat gttgcatgcc gcaaaagcat tattcccacg gttggaaagg 301 gtatcatgga aggcaactgt gtttcactta ccagaatact acgttcagct aaattaagtt 361 taatacaatt ttttagtaaa atgaagaaat ggatggacat gtcaaatcta ccacaagaat 421 ttcgtgaacg tatagaaagg ctagagagaa attttgaggt gtctactgta atattcaaaa 481 aatatgagcc aattttttta gatatatttc aaaatccata tgaagaacca ccaaagttac 541 cacgaagccg gaagcagagg aggattcctt gcagtgttaa ggatctgttt aatttctgtt 601 ggacactttt tgtttatact aagggtaatt ttcggatgat tggggatgac ttagtaaact 661 cttatcattt acttctatgc tgcttggatc tgatttttgc caatgcgatt atgtgcccaa 721 atagacaaga cttgctaaat ccatcattta aaggtttacc atctgatttt catactgctg 781 actttacggc ttctgaagag ccaccctgca tcattgctgt actgtgtgaa ctgcatgatg 841 gacttctcgt agaagcaaaa ggaataaagg agcactactt taagccatat atttcaaaac 901 tctttgacag gaagatatta aaaggagaat gcctcctgga cctttcaagt tttactgata 961 atagcaaagc agtgaataag gagtatgaag agtatgttct aactgttggt gattttgatg 1021 agaggatctt tttgggagca gacgcagaag aggaaattgg aacacctcga aagttcactc 1081 gtgacacccc attagggaaa ctgacagcac aggctaatgt ggagtataac cttcaacagc 1141 actttgaaaa aaaaaggtca tttgcacctt ctaccccact gaccggacgg agatatttac 1201 gagaaaaaga agcagtcatt actcctgttg catcagccac ccaaagtgtg agccggttac 1261 agagtattgt ggctggtctg aaaaatgcac caagtgacca acttataaat atttttgaat 1321 cttgtgtgcg taatcctgtt gaaaacatta tgaaaatact aaaaggaata ggagagactt 1381 tctgtcaaca ctatactcaa tcaacagatg aacagccagg atctcacata gactttgctg 1441 taaacagact aaagctggca gaaattttgt attataaaat actagagact gtaatggttc 1501 aggaaacacg aagacttcat ggaatggaca tgtcagttct tttagagcaa gatatatttc 1561 atcgttcctt gatggcttgt tgtttggaaa ttgtgctctt tgcctatagc tcacctcgta 1621 cttttccttg gattattgaa gttctcaact tgcaaccatt ttacttttat aaggttattg 1681 aggtggtgat ccgctcagaa gaggggctct caagggacat ggtgaaacac ctaaacagca 1741 ttgaagaaca gattttggag agtttagcat ggagtcacga ttctgcactg tgggaggctc 1801 tccaggtttc tgcaaacaaa gttcctacct gtgaagaagt tatattccca aataactttg 1861 aaacaggaaa tggaggaaat gtgcagggac atcttcccct gatgccaatg tctcctctaa 1921 tgcacccaag agtcaaggaa gttcgaactg acagtgggag tcttcgaaga gatatgcaac 1981 cattgtctcc aatttctgtc catgaacgct acagttctcc taccgcaggg agtgctaaga 2041 gaagactctt tggagaggac cccccaaagg aaatgcttat ggacaagatc ataacagaag 2101 gaacaaaatt gaaaatcgct ccttcttcaa gcattactgc tgaaaatgta tcaattttac 2161 ctggtcaaac tcttctaaca atggccacag ccccagtaac aggaacaaca ggacataaag 2221 ttacaattcc attacatggt gtcgcaaatg atgctggaga gatcacactg atacctcttt 2281 ccatgaatac aaatcaggag tccaaagtca agagtcctgt atcacttact gctcattcat 2341 taattggtgc ttctccaaaa cagaccaatc tgactaaagc acaagaggta cattcaactg 2401 gaataaacag gccaaagaga actgggtcct tagcactatt ttacagaaag gtctatcatt 2461 tggcaagtgt acgcttacgt gatctatgtc taaaactgga tgtttcaaat gagttacgaa 2521 ggaagatatg gacgtgtttt gaattcactt tagttcactg tcctgatcta atgaaagaca 2581 ggcatttgga tcagctcctc ctttgtgcct tttatatcat ggcaaaggta acaaaagaag 2641 aaagaacttt tcaagaaatt atgaaaagtt ataggaatca gccccaagct aatagtcacg 2701 tatatagaag tgttctgctg aaaagtattc caagagaagt tgtggcatat aataaaaata 2761 taaatgatga ctttgaaatg atagattgtg acttagaaga tgctacaaaa acacctgact 2821 gttccagtgg accagtgaaa gaggaaagaa gtgatcttat aaaattttac aatacaatat 2881 atgtaggaag agtgaagtca tttgcactga aatacgactt ggcgaatcag gaccatatga 2941 tggatgctcc accactctct ccttttccac atattaaaca acagccaggc tcaccacgcc 3001 gcatttccca gcagcactcc atttatattt ccccgcacaa gaatgggtca ggccttacac 3061 caagaagcgc tctgctgtac aagttcaatg gcagcccttc taagagtttg aaagatatca 3121 acaacatgat aaggcaaggt gagcagagaa ccaagaagcg agtaatagcc atcgatagtg 3181 atgcagaatc ccctgccaaa cgcgtctgtc aagaaaatga tgacgtttta ctgaaacgac 3241 tacaggatgt tgtcagtgaa agagcaaatc attaatgttg ttcttgtttc tatgataaaa 3301 gcactttcag attgttctgc agaaagttgg agctctgtcc ttcaaacctt ttagccctat 3361 agatgataaa tatcactggg ttataagaaa aaattgcaca aaaattatgt gctttttaaa 3421 atatttatcc aaaatgtagt tgacagagat gtattttgag ttggattgga aaggaatatt 3481 ttaagtgcct tttaaaaata ctaatagtcc ggccaggcgc tgtggctcac gcctctaatc 3541 ccaggacttt gggaggccaa ggcgggcaga tcaccggagt caggagttcg agaccagcct 3601 gaccaacatg gagaaacccc atctctacta aaaatacaaa attagccggg tggtgtggcg 3661 catgcctata atcccagcta cttgggaggc tgaggcagaa ttgcttgaac ccaggaagcg 3721 gaggttgtgg tgagccaagg ttgcgccact gcactccagc ctgggcaaca agagtaaaac 3781 tccatctcaa aaaatatata tatatatata aatagggaat tttttttaat gtttgctcct 3841 tgagttttca agatgaaata aggagaaacc ccataacttt ttagctctct tttaaaaata 3901 aatgtctcct tctgtgttct gtaatatgag gataaataat ctgcttttga tagcaaaaaa // LOCUS HUMP1BX 480 bp mRNA PRI 07-JAN-1995 DEFINITION Human secretory protein (P1.B) mRNA, complete cds. ACCESSION L15203 NID g402482 KEYWORDS secretory protein. SOURCE Homo sapiens intestine, stomach, uterus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 480) AUTHORS Hauser,F., Poulsom,R., Chinery,R., Rogers,L.A., Hanby,A.M., Wright,N.A. and Hoffmann,W. TITLE hP1.B, a human P-domain peptide homologous with rat intestinal trefoil factor, is expressed also in the ulcer-associated cell lineage and the uterus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (15), 6961-6965 (1993) MEDLINE 93348192 FEATURES Location/Qualifiers source 1..480 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="intestine, stomach, uterus" mRNA 1..460 /gene="P1.B" /note="minor transcript" mRNA 1..480 /gene="P1.B" /note="major transcript" gene 1..480 /gene="P1.B" sig_peptide 36..98 /gene="P1.B" CDS 36..278 /gene="P1.B" /codon_start=1 /product="secretory protein" /db_xref="PID:g402483" /translation="MAARALCMLGLVLALLSSSSAEEYVGLSANQCAVPAKDRVDCGY PHVTPKECNNRGCCFDSRIPGVPWCFKPLQEAECTF" polyA_signal 437..442 /gene="P1.B" BASE COUNT 78 a 154 c 141 g 107 t ORIGIN 1 cagtcctgag ctgcgtcccg gagcccacgg tggtcatggc tgccagagcg ctctgcatgc 61 tggggctggt cctggccttg ctgtcctcca gctctgctga ggagtacgtg ggcctgtctg 121 caaaccagtg tgccgtgcca gccaaggaca gggtggactg cggctacccc catgtcaccc 181 ccaaggagtg caacaaccgg ggctgctgct ttgactccag gatccctgga gtgccttggt 241 gtttcaagcc cctgcaggaa gcagaatgca ccttctgagg cacctccagc tgcccccggc 301 cgggggatgc gaggctcgga gcacccttgc ccggctgtga ttgctgccag gcactgttca 361 tctcagcttt tctgtccctt tgctcccggc aagcgcttct gctgaaagtt catatctgga 421 gcctgatgtc ttaacgaata aaggtcccat gctccacccg aggacagttc ttcgtgcctg // LOCUS HUMP2A 2205 bp mRNA PRI 22-OCT-1992 DEFINITION Human protein phosphatase 2A regulatory subunit alpha-isotype (alpha-PR65) mRNA, complete cds. ACCESSION J02902 NID g189427 KEYWORDS protein phosphatase-2A regulatory alpha-subunit. SOURCE Human HeLa cell, cDNA to mRNA, clone lambda-HHPR65-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2205) AUTHORS Hemmings,B.A., Adams-Pearson,C., Maurer,F., Mueller,P., Goris,J., Merlevede,W., Hofsteenge,J. and Stone,S.R. TITLE Alpha and beta-forms of the 65-kDa subunit of protein phosphatase 2A have a similar 39 amino acid repeating structure JOURNAL Biochemistry 29, 3166-3173 (1990) MEDLINE 90241887 COMMENT Draft entry and printed sequence [1] kindly submitted by B.A.Hemmings, 23-MAR-1990, for release after publication. FEATURES Location/Qualifiers source 1..2205 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" mRNA 7..2205 /note="phosphatase 2A regulatory subunit mRNA" CDS 145..1914 /codon_start=1 /product="phosphatase 2A regulatory subunit" /db_xref="PID:g189428" /translation="MAAADGDDSLYPIAVLIDELRNEDVQLRLNSIKKLSTIALALGV ERTRSELLPFLTDTIYDEDEVLLALAEQLGTFTTLVGGPEYVHCLLPPLESLATVEET VVRDKAVESLRAISHEHSPSDLEAHFVPLVKRLAGGDWFTSRTSACGLFSVCYPRVSS AVKAELRQYFRNLCSDDTPMVRRAAASKLGEFAKVLELDNVKSEIIPMFSNLASDEQD SVRLLAVEACVNIAQLLPQEDLEALVMPTLRQAAEDKSWAVRYMVADKFTELQKAVGP EITKTDLVPAFQNLMKDCEAEVRAAASHKVKEFCENLSADCRENVIMSQILPCIKELV SDANQHVKSALASVIMGLSPILGKDNTIEHLLPLFLAQLKDECPEVRLNIISNLDCVN EVIGIRQLSQSLLPAIVELAEDAKWRVRLAIIEYMPLLAGQLGVEFFDEKLNSLCMAW LVDHVYAIREAATSNLKKLVEKFGKEWAHATIIPKVLAMSGDPNYLHRMTTLFCINVL SEVCGQDITTKHMLPTVLRMAGDPVANVRFNVAKSLQKIGPILDNSTLQSEVKPILEK LTQDQDVDVKYFAQEALTVLSLA" BASE COUNT 447 a 676 c 639 g 443 t ORIGIN 1 gaattccggt tctcactctt gacgttgtcc agctccagca ccttggcaac tcccccagct 61 tggacggccg gcccgccgct ccatggggga gtcatctgag cacagctgct ggccgcagtc 121 tgacaggaaa gggacggagc caagatggcg gcggccgacg gcgacgactc gctgtacccc 181 atcgcggtgc tcatagacga actccgcaat gaggacgttc agcttcgcct caacagcatc 241 aagaagctgt ccaccatcgc cttggccctt ggggttgaaa ggacccgaag tgagcttctg 301 cctttcctta cagataccat ctatgatgaa gatgaggtcc tcctggccct ggcagaacag 361 ctgggaacct tcactaccct ggtgggaggc ccagagtacg tgcactgcct gctgccaccg 421 ctggagtcgc tggccacagt ggaggagaca gtggtgcggg acaaggcagt ggagtcctta 481 cgggccatct cacacgagca ctcgccctct gacctggagg cgcactttgt gccgctagtg 541 aagcggctgg cgggcggcga ctggttcacc tcccgcacct cggcctgcgg cctcttctcc 601 gtctgctacc cccgagtgtc cagtgctgtg aaggcggaac ttcgacagta cttccggaac 661 ctgtgctcag atgacacccc catggtgcgg cgggccgcag cctccaagct gggggagttt 721 gccaaggtgc tggagctgga caacgtcaag agtgagatca tccccatgtt ctccaacctg 781 gcctctgacg agcaggactc ggtgcggctg ctggcggtgg aggcgtgcgt gaacatcgcc 841 cagcttctgc cccaggagga tctggaggcc ctggtgatgc ccactctgcg ccaggccgct 901 gaagacaagt cctgggccgt ccgctacatg gtggctgaca agttcacaga gctccagaaa 961 gcagtggggc ctgagatcac caagacagac ctggtccctg ccttccagaa cctgatgaaa 1021 gactgtgagg ccgaggtgag ggccgcagcc tcccacaagg tcaaagagtt ctgtgaaaac 1081 ctctcagctg actgtcggga gaatgtgatc atgtcccaga tcttgccctg catcaaggag 1141 ctggtgtccg atgccaacca acatgtcaag tctgccctgg cctcagtcat catgggtctc 1201 tctcccatct tgggcaaaga caacaccatc gagcacctct tgcccctctt cctggctcag 1261 ctgaaggatg agtgccctga ggtacggctg aacatcatct ctaacctgga ctgtgtgaac 1321 gaggtgattg gcatccggca gctgtcccag tccctgctcc ctgccattgt ggagctggct 1381 gaggacgcca agtggcgggt gcggctggcc atcattgagt acatgcccct cctggctgga 1441 cagctgggag tggagttctt tgatgagaaa cttaactcct tgtgcatggc ctggcttgtg 1501 gatcatgtat atgccatccg cgaggcagcc accagcaacc tgaagaagct agtggaaaag 1561 tttgggaagg agtgggccca tgccacaatc atccccaagg tcttggccat gtccggagac 1621 cccaactacc tgcaccgcat gactacgctc ttctgcatca atgtgctgtc tgaggtctgt 1681 gggcaggaca tcaccaccaa gcacatgcta cccacggttc tgcgcatggc tggggacccg 1741 gttgccaatg tccgcttcaa tgtggccaag tctctgcaga agatagggcc catcctggac 1801 aacagcacct tgcagagtga agtcaagccc atcctagaga agctgaccca ggaccaggat 1861 gtggacgtca aatactttgc ccaggaggct ctgactgttc tgtctctcgc ctgatgctgg 1921 aagaggagca aacactggcc tctggtgtcc accctccaac ccccacaagt ccctctttgg 1981 ggagacactg gggggccttt ggctgtcact ccctgtgcat ggtctgaccc caggcccctt 2041 cccccagcac ggttcctcct ctccccagcc tgggaagatg tctcactgtc cacctcccaa 2101 cggctagggg agcacggggt tggacaggac agtgaccttg ggaggaaggg gctactccgc 2161 catccttaaa agccatggag ccggaggtgg caattcaccg aattc // LOCUS HUMP40MOV 1602 bp mRNA PRI 01-OCT-1996 DEFINITION Human mRNA for proteasome subunit p40 / Mov34 protein, complete cds. ACCESSION D50063 NID g971269 KEYWORDS proteasome subunit p40 / Mov34 protein. SOURCE Homo sapiens cell_line HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1602) AUTHORS Tsurumi,C., DeMartino,G.N., Slaughter,C., Shimbara,N. and Tanaka,K. TITLE cDNA cloning of p40, a regulatory subunit of the human 26S proteasome, and a homolog of the Mov-34 gene product JOURNAL Biochemical and Biophysical Research Communication 210, 600-608 (1995) REFERENCE 2 (bases 1 to 1602) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (06-APR-1995) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) FEATURES Location/Qualifiers source 1..1602 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 84..1058 /codon_start=1 /product="proteasome subunit p40 / Mov34 protein" /db_xref="PID:d1009401" /db_xref="PID:g971270" /translation="MPELAVQKVVVHPLVLLSVVDHFNRIVKVGNQKRVVGVLLGSWQ KKVLDVSNSFAVPFDEDDKDDSVWFLDHDYLENMYGMFKKVNARERIVGWYHTGPKLH KNDIAINELMKRYCPNSVLVIIDVKPKDLGLPTEAYISVEEDQDDGTPTSKTFEHVTS EIGAEEAEEVGVEHLLRDIKDTTVGTLSQRITNQVHGLKGLNSKLLDIRSYLEKVGTG KLPINHQIIYQLQDVFNLLPDVSLQEFVKAFYLKTNDQMVVVYLASLIRSVVALHNLI NNKIANRDAEKKEGQEKEESKKDRKEDKEKDKDKEKSDVKKEEKKEKK" BASE COUNT 482 a 322 c 401 g 397 t ORIGIN 1 ggaaaaaggg taccggtgac cgctactgct gccggtgttt gcgtgtggca gggagccagt 61 cctggcgagc ggggtgtgtc gcgatgccgg agctggcagt gcagaaggtg gtggtccacc 121 ccctggtgct gctcagtgtg gtggatcatt tcaaccgaat cgtcaaggtt ggaaaccaga 181 agcgtgtagt tggtgtgctt ttggggtcat ggcaaaagaa agtacttgat gtatcgaaca 241 gttttgcagt tccttttgat gaagatgaca aagacgattc tgtatggttt ttagaccatg 301 attatttgga aaacatgtat ggaatgttta agaaagtcaa tgccagggaa agaatagttg 361 gctggtacca cacaggccct aaactacaca agaatgacat tgccatcaac gaactcatga 421 aaagatactg tcctaattcc gtattggtca tcattgatgt gaagccgaag gacctagggc 481 tgcctacaga agcgtacatt tcagtggaag aagaccaaga tgatggaact ccaacctcga 541 aaacatttga acacgtgacc agtgaaattg gagcagagga agctgaggaa gttggagttg 601 aacacttgtt acgagatatc aaagacacga cggtgggcac tctgtcccag cggatcacaa 661 accaggtcca tggtttgaag ggactgaact ccaagcttct ggatatcagg agctacctgg 721 aaaaagtcgg cacaggcaag ctgcccatca accaccagat catctaccag ctgcaggacg 781 tcttcaacct gctgccagat gtcagcctgc aggagttcgt caaggccttt tacctgaaga 841 ccaatgacca gatggtggta gtgtacttgg cctcgctgat ccgttccgtg gtcgccctgc 901 acaacctcat caacaacaag attgccaacc gggatgcaga gaagaaagaa gggcaggaga 961 aagaagagag caaaaaggat aggaaagagg acaaggagaa agataaagat aaggaaaaga 1021 gtgatgtaaa gaaagaggag aaaaaggaga aaaagtaaaa catgtattaa atagcttttt 1081 tactttgtaa attaaaatct tacaaactaa atcagtgtgc tgctagaggg ttctttttca 1141 cttgacatgc ttattagaaa gctgacccaa caagagctct ctgcctccgg tcactcttgc 1201 tgtggtgcta cgtggaagtg aatggagact gatctcaaat ctgaactgca gctttccctc 1261 ctgtgagttg gggaaatgat agtcaactca gccttcagat tgtatgagaa aaatgaagag 1321 aagccaccaa atattttggt actcttcatt catttatctc taaaaccagg agttgaattt 1381 tcctcatctt gaaagactct tggggtctgt ttctggtatt ttacaaaatt gctaagtgga 1441 atgcatgaat tgcattatgt tctctggtaa cacgtagagt tcagaccctt ctgaactctg 1501 ttgataatac cacaccatgt tctggaccca tagctctggc atcctcaggg gttgtgatcc 1561 agctccatat attgtttacc ttcaaagata caattaaata ac // LOCUS HUMP4K 3034 bp mRNA PRI 06-DEC-1994 DEFINITION Homo sapiens phosphatidylinositol 4-kinase mRNA, complete cds. ACCESSION L36151 NID g598192 KEYWORDS phosphatidylinositol 4-kinase; phosphatidylinositol kinase. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3034) AUTHORS Wong,K. and Cantley,L. TITLE Cloning and characterization of a human phosphatidylinositol 4-kinase JOURNAL J. Biol. Chem. 269, 28878-28884 (1994) MEDLINE 95050701 FEATURES Location/Qualifiers source 1..3034 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" CDS 118..2682 /EC_number="2.7.1.68" /codon_start=1 /product="phosphatidylinositol 4-kinase" /db_xref="PID:g598193" /translation="MREMAGAWHMTVEQKFGLFSAEIKEADPLAASEASQPKPCPPEV TPHYIWIDFLVQRFEIAKYCSSDQVEIFSSLLQRSMSLNIGGAKGSMNRHVAAIGPRF KLLTLGLSLLHADVVPNATIRNVLREKIYSTAFDYFSCPPKFPTQGEKRLREDISIMI KFWTAMFSDKKYLTASQLVPPDNQDTRSNLDITVGSRQQATQGWINTYPLSSGMSTIS KKSGMSKKTNRGSQLHKYYMKRRTLLLSLLATEIERLITWYNPLSAPELELDQAGENS VANWRSKYISLSEKQWKDNVNLAWSISPYLAVQLPARFKNTEAIGNEVTRLVRLDPGA VSDVPEAIKFLVTWHTIDADAPELSHVLCWAPTDPPTGLSYFSSMYPPHPLTAQYGVK VLRSFPPDAILFYIPQIVQALRYDKMGYVREYILWAASKSQLLAHQFIWNMKTNIYLD EEGHQKDPDIGDLLDQLVEEITGSLSGPAKDFYQREFDFFNKITNVSAIIKPYPKGDE RKKACLSALSEVKVQPGCYLPSNPEAIVLDIDYKSGTPMQSAAKAPYLAKFKVKRCGV SELEKEGLRCRSDSEDECSTQEADGQKISWQAAIFKVGDDCRQDMLALQIIDLFKNIF QLVGLDLFVFPYRVVATAPGCGVIECIPDCTSRDQLGRQTDFGMYDYFTRQYGDESTL AFQQARYNFIRSMAAYSLLLFLLQIKDRHNGNIMLDKKGHIIHIDFGFMFESSPGGNL GWEPDIKLTDEMVMIMGGKMEATPFKWFMEMCVRGYLAVRPYMDAVVSLVTLMLDTGL PCFRGQTIKLLKHRFSPNMTEREAANFIMKVIQSCFLSNRSRTYDMIQYYQNDIPY" BASE COUNT 717 a 889 c 797 g 631 t ORIGIN 1 tggtttctaa atcttattca gtgagcacgt gttccttttt ttaattagag aaaacttgac 61 agtagcccct ttttgtgttt cacagaggag ctaacattcc ttctgtacct gtagttcatg 121 cgggagatgg caggggcctg gcacatgacg gtggagcaga aatttggcct gttttctgct 181 gagataaagg aagcagaccc cctggctgcc tcggaagcaa gtcaacccaa accctgtccc 241 cccgaagtga ccccccacta catctggatc gacttcctgg tgcagcggtt tgagatcgcc 301 aagtactgca gctctgacca agtggagatc ttctccagcc tgctgcagcg ctccatgtcc 361 ctgaacatcg gcggggccaa ggggagcatg aaccggcacg tggcggccat cgggccccgc 421 ttcaagctgc tgaccctggg gctgtccctc ctgcatgccg atgtggttcc aaatgcaacc 481 atccgcaatg tgcttcgcga gaagatctac tccactgcct ttgactactt cagctgtccc 541 ccaaagttcc ctactcaagg agagaagcgg ctgcgtgaag acataagcat catgattaaa 601 ttttggaccg ccatgttctc agataagaag tacctgaccg ccagccagct tgttccccca 661 gataatcagg acacccggag caacctggac ataactgtcg gctctcggca acaagccacc 721 caaggctgga tcaacacata ccccctgtcc agcggcatgt ccaccatctc caagaaatca 781 ggcatgtcta agaaaaccaa ccggggctcc cagctgcaca aatactacat gaagcgcagg 841 acgctgctgc tgtccctgct ggccactgag atcgagcgtc tcatcacatg gtacaacccg 901 ctgtcagccc cggaactgga actagaccag gccggagaga acagcgtggc caactggaga 961 tctaagtaca tcagcctgag tgagaagcag tggaaggaca acgtgaacct cgcctggagc 1021 atctctccct acctagccgt gcagctgcct gccaggttta agaacacaga agccattggg 1081 aacgaagtga cccgtctcgt tcggttggac ccgggagccg ttagtgatgt gcctgaagca 1141 atcaagttcc tggtcacctg gcacaccatc gacgccgatg ctccagagct cagccatgtg 1201 ctgtgctggg cgcccacgga cccacccaca ggcctctcct acttctccag catgtacccg 1261 ccgcaccctc tcacggcgca gtacggggtg aaagtcctgc ggtccttccc tccggacgcc 1321 atcctcttct acatccccca gattgtgcag gccctcaggt acgacaagat gggctatgtg 1381 cgggagtata ttctgtgggc agcgtctaaa tcccagcttc tggcacacca gttcatctgg 1441 aacatgaaga ctaacattta tctagatgaa gagggccacc agaaagaccc tgacatcggc 1501 gacctcctgg atcagttggt agaggagatc acaggctcct tgtccggccc agcgaaggac 1561 ttttaccagc gggagtttga tttctttaac aagatcacca acgtgtcggc tatcatcaag 1621 ccctacccta aaggcgacga gagaaagaag gcttgtctgt cggccctgtc tgaagtgaag 1681 gtgcagccgg gctgctacct gcccagcaac cctgaggcca ttgtgctgga catcgactac 1741 aagtctggga ccccgatgca gagtgctgca aaagccccat atctggccaa gttcaaggtg 1801 aagcgatgtg gagttagtga acttgaaaaa gaaggtctgc ggtgccgctc agactccgag 1861 gatgagtgca gcacgcagga ggccgacggc cagaagatct cctggcaggc agccatcttc 1921 aaggtgggag acgactgccg gcaggacatg ctggccctgc agatcatcga cctcttcaag 1981 aacatcttcc agctggtcgg cctggacctc tttgtttttc cctaccgcgt ggtggccact 2041 gcccctgggt gcggggtgat cgagtgcatc cccgactgca cctcccggga ccagctgggc 2101 cgccagacag acttcggcat gtacgactac ttcacacgcc agtacgggga tgagtccact 2161 ctggccttcc agcaggcccg ctacaacttc atccgaagca tggccgccta cagcctcctg 2221 ctgttcctgc tgcagatcaa ggacagacac aacggcaaca ttatgctgga caagaagggt 2281 catatcatcc acatcgactt tgggttcatg tttgaaagct cgccgggcgg caatctcggc 2341 tgggaacccg acatcaagct gacggatgag atggtgatga tcatgggggg caagatggag 2401 gccacaccct tcaagtggtt catggagatg tgtgtccgag gctacctggc tgtgcggccc 2461 tacatggacg cggtcgtctc cctggtcact ctcatgttgg acacgggcct gccctgtttt 2521 cgcggccaga caatcaagct cttgaagcac aggtttagcc ccaacatgac tgagcgcgag 2581 gctgcaaatt tcatcatgaa ggtcatccag agctgcttcc tcagcaacag gagccggacc 2641 tacgacatga tccagtacta tcagaatgac atcccctact gaggagggga ccttcgaggg 2701 cctctgcccc atgtgccctc aaagctgtcc cacaatcatg gagccctgcg acctccctgc 2761 cctgccgcca catgcagtgg aggagaggcc tgtggcccaa agaacctggt agcgcctcct 2821 ggggcagcac gtgggtggcg cagccttggt aacgccatgg actgcagcga caatcaatgg 2881 atggtgctgt ctatgcacag gtgtgagtcc tctgtttgca ctggacatat tccctacctg 2941 tcttatttca taggtacatg aagtattgtg tataaaaaaa gagataagat ttaaccaaca 3001 tcaacaaaat aaaaacccaa aatagtaaaa accc // LOCUS HUMP5 1882 bp mRNA PRI 17-JUN-1996 DEFINITION Human mRNA for protein disulfide isomerase-related protein P5, complete cds. ACCESSION D49489 NID g1136742 KEYWORDS P5; protein disulfide isomerase-related protein; PDI-related protein; ER resident protein-related; endoplasmic reticulum resident protein-related; thioredoxin-like domain. SOURCE Homo sapiens placenta cDNA to mRNA, clone_lib:lambda gt11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1882) AUTHORS Hayano,T. and Kikuchi,M. TITLE Cloning and sequencing of the cDNA encoding human P5 JOURNAL Gene 164 (2), 377-378 (1995) MEDLINE 96069616 REFERENCE 2 (bases 1 to 1882) AUTHORS Hayano,T. TITLE Direct Submission JOURNAL Submitted (03-MAR-1995) to the DDBJ/EMBL/GenBank databases. Toshiya Hayano, Protein Engineering Research Institute, The Third Research Department; 6-2-3 Furuedai, Suita, Osaka 565, Japan (Tel:06-872-8200, Fax:06-872-8210) FEATURES Location/Qualifiers source 1..1882 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="placenta" 5'UTR 1..94 CDS 95..1417 /function="unknown" /note="The transcript is amplified in hydroxyurea-resistant cells.; an endoplasmic reticulum-retention signal (ER-retention signal) at 1403-1414; two thioredoxin-like sequences (Trx-like motifs) at 254-271, 659-676" /codon_start=1 /product="human P5" /db_xref="PID:d1009061" /db_xref="PID:g1136743" /translation="MALLVLGLVSCTFFLAVNGLYSSSDDVIELTPSNFNREVIQSDS LWLVEFYAPWCGHCQRLTPEWKKAATALKDVVKVGAVDADKHHSLGGQYGVQGFPTIK IFGSNKNRPEDYQGGRTGEAIVDAALSALRQLVKDRLGGRSGGYSSGKQGRSDSSSKK DVIELTDDSFDKNVLDSEDVWMVEFYAPWCGHCKNLEPEWAAAASEVKEQTKGKVKLA AVDATVNQVLASRYGIRGFPTIKIFQKGESPVDYDGGRTRSDIVSRALDLFSDNAPPP ELLEIINEDIAKRTCEEHQLCVVAVLPHILDTGAAGRNSYLEVLLKLADKYKKKMWGW LWTEAGAQSELETALGIGGFGYPAMAAINARKMKFALLKGSFSEQGINEFLRELSFGR GSTAPVGGGAFPTIVEREPWDGRDGELPVEDDIDLSDVELDDLGKDEL" sig_peptide 95..151 /note="a putative signal sequence" 3'UTR 1418..1882 polyA_signal 1846..1851 polyA_site 1882 BASE COUNT 487 a 370 c 527 g 498 t ORIGIN 1 gaattcgggc gtgggcgcgg gggcgcggcg tgcggcacgc tgcagggctg aagcggcggc 61 ggcggtgggg actgcacgta gcccggcgct cggcatggct ctcctggtgc tcggtctggt 121 gagctgtacc ttctttctgg cagtgaatgg tctgtattcc tctagtgatg atgtgatcga 181 attaactcca tcaaatttca accgagaagt tattcagagt gatagtttgt ggcttgtaga 241 attctatgct ccatggtgtg gtcactgtca aagattaaca ccagaatgga agaaagcagc 301 aactgcatta aaagatgttg tcaaagttgg tgcagttgat gcagataagc atcattccct 361 aggaggtcag tatggtgttc agggatttcc taccattaag atttttggat ccaacaaaaa 421 cagaccagaa gattaccaag gtggcagaac tggtgaagcc attgtagatg ctgcgctgag 481 tgctctgcgc cagctcgtga aggatcgcct cgggggacgg agcggaggat acagttctgg 541 aaaacaaggc agaagtgata gttcaagtaa gaaggatgtg attgagctga cagacgacag 601 ctttgataag aatgttctgg acagtgaaga tgtttggatg gttgagttct atgctccttg 661 gtgtggacac tgcaaaaacc tagagccaga gtgggctgcc gcagcttcag aagtaaaaga 721 gcagacgaaa ggaaaagtga aactggcagc tgtggatgct acagtcaatc aggttctggc 781 ctcccgatac gggattagag gatttcctac aatcaagata tttcagaaag gcgagtctcc 841 tgtggattat gacggtgggc ggacaagatc cgacatcgtg tcccgggccc ttgatttgtt 901 ttctgataac gccccacctc ctgagctgct tgagattatc aacgaggaca ttgccaagag 961 gacgtgtgag gagcaccagc tctgtgttgt ggctgtgctg ccccatatcc ttgatactgg 1021 agctgcaggc agaaattctt atctggaagt tcttctgaag ttggcagaca aatacaaaaa 1081 gaaaatgtgg gggtggctgt ggacagaagc tggagcccag tctgaacttg agaccgcgtt 1141 ggggattgga gggtttgggt accccgccat ggccgccatc aatgcacgca agatgaaatt 1201 tgctctgcta aaaggctcct tcagtgagca aggcatcaac gagtttctca gggagctctc 1261 ttttgggcgt ggctccacgg cacctgtagg aggcggggct ttccctacca tcgttgagag 1321 agagccttgg gacggcaggg atggcgagct tcccgtggag gatgacattg acctcagtga 1381 tgtggagctt gatgacttag ggaaagatga gttgtgagag ccacaacaga ggcttcagac 1441 cattttcttt tcttgggagc cagtggattt ttccagcagt gaagggacat tctctacttt 1501 cttttcttgg gagccagtgg atttttccag cagtgaaggg acattctcta cactcagatg 1561 actctaccag tggcctttta accaagaagt agtacttgat tggtcatttg aaaacactgc 1621 aacagtgaac ttttgcatct caagaaaaca ttgaaaaatt ctatgaattg ttgtagccgg 1681 tgaattgagt cgtattctgt cacataatat tttgaagaaa acttggctgt cgaaacattt 1741 ttctctctga ctgctgcttg aatgttcttg gaggctgttt cttatgtatg ggtttttttt 1801 aatgtgatcc cttcatttga atattaatgg ctttttccat taaagaataa attggaaaaa 1861 agaaaaaaaa aaaaaggaat tc // LOCUS HUMP51MHR 2535 bp mRNA PRI 07-JAN-1995 DEFINITION Homo Sapiens P5-1 mRNA, complete cds. ACCESSION L06175 NID g189448 KEYWORDS . SOURCE Homo sapiens spleen cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2535) AUTHORS Vernet,C., Ribouchon,M.T., Chimini,G., Jouanolle,A.M., Sidibe,I. and Pontarotti,P. TITLE A novel coding sequence belonging to a new multicopy gene family mapping within the human MHC class I region JOURNAL Immunogenetics 38 (1), 47-53 (1993) MEDLINE 93216307 FEATURES Location/Qualifiers source 1..2535 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="spleen" /map="6p21.3" gene 305..736 /gene="P5-1" CDS 305..736 /gene="P5-1" /note="occurs in MHC class I region; ORF" /codon_start=1 /db_xref="PID:g189449" /translation="MCPRKWSCFGVPGVTHHPLEQPERSRASAPMLLRMSEHRNEALG NYLEMRLKSSFLRGLGSWKSNPLRLGGWTILLTLTMGQGEPGGPRIPGFHTNSSYPHC VTAAMPPPGDQDLLPVPGEEGTPLLTRWSLDTYCPIPLWQL" BASE COUNT 615 a 630 c 637 g 652 t 1 others ORIGIN 1 ggcccagacg ccaaggttgc gggtcatgga gtcccgaacc ctcctcctgc tgttctcggg 61 agccgtggcc ctgatccaga cctgggcaga ttacaattac aatcaaggca gaaatgatct 121 catttttaca ttacaactcc tggaaaaggc aatagactga gatgcaagtg tgcccccaag 181 tgatgggcag aaggagagaa ggatgttttg gatgcattct agaacacagg taatctaagg 241 agagttgaty caaggcacgt aggaagagtc ctccagccac tattaggcca tcaaaggagt 301 cctcatgtgt cccaggaaat ggtcctgctt tggtgtccct ggtgtgaccc atcacccgct 361 ggaacagcct gagagaagta gggcctctgc accaatgctg ctgaggatgt cagagcacag 421 gaacgaggcc ttgggaaatt acctggaaat gcgactgaaa tcttccttcc tgaggggtct 481 gggctcttgg aaatcaaacc ctctcaggtt gggtggctgg acgattctcc tcacacttac 541 aatgggacaa ggggaaccag gaggtccaag gatccctggg ttccacacga actcctccta 601 ccctcattgt gtgacagcag ccatgcctcc tcctggggat caggatctat tacctgtgcc 661 tggagaggag gggactcctc ttctcacccg ctggtctctg gacacatact gtccaattcc 721 cctgtggcag ctgtaatgtg tagttcaatg ggcactcatt tgtccctttt aagggtaccc 781 tcctttagaa tccaggacct tctaccctgc agagtgtggt ttggggagag aagtgcaaaa 841 tcccacgaca ggtgagttga aggaatggga tatggagcca catccacttc caccccttgg 901 tatctggacc cacgtgttct tcctactgag attacagaac tgtagagatg tctttgattt 961 ttaaaatgca ccatgtcctg aaagatggca ccctcccacc cgcaggagtg cttcctgcaa 1021 gctggcgttg agctgtgcct atagaagctc ttttcaacat tctttatggt ccaggagccc 1081 ttggttggtg cagatggtga taggacccag tgggtcccac agcatggcca cactgcacct 1141 ccttcgctgt caagtgggtc ccccacgaag atactgcacg gagagcagtg ccaagcctgt 1201 ggatcaggaa tatcaacagc ccccagagag tggtgctggc tgagggtctg agagcaggac 1261 aggaaaaccc acctatggaa taggtgccta tccctgtgaa gatgaacctc tggcccttcc 1321 aggatggaag gagtgcaatg tagtcaactc ctcacttagg gtctggttgg tcacctaaag 1381 aaatagagcc ctaccaggga agatcattgg gttcaaatgc tgatgagtag gacatttaga 1441 ggtggcagtg tctggatcta ccttggtagg agggagtcag tactgttgga cccataggta 1501 gcctcatccc tgccactgtg tttgctccat ttatgtaccc acctacccgg cctgggctga 1561 ccatgggaag gctggctaat ttcagtgctt gtgcttggtt gttcagggcc atttcaggtt 1621 tgggtgtttt ctggggatgt taacatggga ttcaggctca actcacaaga aacttttcat 1681 ctcatgatgg atgctgttgg gcatgtccaa tgtatgactt catgagttac acagatgcta 1741 attcgtaggg gcacttggaa tcacatggtt gttttgtgtc ccatggtcaa gcattctatc 1801 ttatcagggc ctacagtaac atgccaaaag ttgcttccaa catatttctc tgctttggat 1861 ggggcatatt tctgtgctgt ggatgacatg gccttactcc agaatcccag gccctccact 1921 gtgactctcc tactggtgct tggttcagct ccaccccaaa tcttacccca ccactggcac 1981 tttcagcacc agggggtctg aaggatggtg actgcgccat ggcctggatc tgctgcagtg 2041 tcctttcctg tggaggctcc actcaaagct ggcatcctcc tatgtcacct agagtgtggg 2101 tcaaagcaat acacctacat gtagaatgtg atgtcagaac tcaaacaggc tcaccaggca 2161 gtgtgcttct tccttgcatg aggatgcaag atgcaacagt ttgtcttcac attggaagga 2221 cacccctgga tgcccctaac cactagacct gtaaaacttc actgcagtgg ccacttctga 2281 atctctgtaa ggtttattta tcttcacccc tctggagaga agatgtttta ccaaagcctc 2341 tagtgtaccg tcctcctctt actcatccat cccagtcaac atgatgttgt caatgaaata 2401 aaggaattta atattctata gtatatccag gttctccaga tctcttaaga ctgtactata 2461 gaggcctggg gaattataat agccctgagg caaactatga attaaagtgt tgtggatccc 2521 acatgaaaaa aaaaa // LOCUS HUMP5CR 1792 bp mRNA PRI 23-JAN-1992 DEFINITION Human pyrroline 5-carboxylate reductase mRNA, complete cds. ACCESSION M77836 NID g189497 KEYWORDS cytosolic enzyme; proline synthesis; pyrroline-5-carboxylate reductase. SOURCE Homo sapiens hepatoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1792) AUTHORS Dougherty,K.M., Brandriss,M.C. and Valle,D. TITLE Clong human pyrroline-5-carboxylate reductase cDNA by complementation in Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 267, 871-875 (1992) MEDLINE 92112821 FEATURES Location/Qualifiers source 1..1792 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /tissue_type="hepatoma" CDS 12..971 /codon_start=1 /product="pyrroline-5-carboxylate reductase" /db_xref="PID:g189498" /translation="MSVGFIGAGQLAFALAKGFTAAGVLAAHKIMASSPDMDLATVSA LRKMGVKLTPHNKETVQHSDVLFLAVKPHIIPFILDEIGADIEDRHIVVSCAAGVTIS SIEKKLSAFRPAPRVIRCMTNTPVVVREGATVYATGTHAQVEDGRLMEQLLSTVGFCT EVEEDLIDAVTGLSGSGPAYAFTALDALADGGVKMGLPRRLAVRLGAQALLGAAKMLL HSEQHPGQLKDNVSSPGGATIHALHVLESGGFRSLLINAVEASCIRTRELQSMADQEQ VSPAAIKKTILDKVKLDSPAGTALSPSGHTKLLPRSLAPAGKD" BASE COUNT 342 a 573 c 522 g 355 t ORIGIN Chromosome 17. 1 ctccggacag catgagcgtg ggcttcatcg gcgctggcca gctggctttt gccctggcca 61 agggcttcac agcagcaggc gtcttggctg cccacaagat aatggctagc tccccagaca 121 tggacctggc cacagtttct gctctcagga agatgggggt gaagttgaca ccccacaaca 181 aggagacggt gcagcacagt gatgtgctct tcctggctgt gaagccacac atcatcccct 241 tcatcctgga tgaaataggc gccgacattg aggacagaca cattgtggtg tcctgcgcgg 301 ccggcgtcac catcagctcc attgagaaga agctgtcagc gtttcggcca gcccccaggg 361 tcatccgctg catgaccaac actccagtcg tggtgcggga gggggccacc gtgtatgcca 421 caggcacgca cgcccaggtg gaggacggga ggctcatgga gcagctgctg agcacggtgg 481 gcttctgcac ggaggtggaa gaggacctga ttgatgccgt cacggggctc agtggcagcg 541 gccccgccta cgcattcaca gccctggatg ccctggctga tgggggtgtg aagatgggac 601 ttccaaggcg cctggcagtc cgcctcgggg cccaggccct cctgggggct gccaagatgc 661 tgctgcactc agaacagcac ccaggccagc tcaaggacaa cgtcagctct cctggtgggg 721 ccaccatcca tgccttgcat gtgctggaga gtgggggctt ccgctccctg ctcatcaacg 781 ctgtggaggc ctcctgcatc cgcacacggg agctgcagtc catggctgac caggagcagg 841 tgtcaccagc cgccatcaag aagaccatcc tggacaaggt gaagctggac tcccctgcag 901 ggaccgctct gtcgccttct ggccacacca agctgctccc ccgcagcctg gccccagcgg 961 gcaaggattg acacgtcctg cctgaccacc atcctgccac caccttctct tctcttgtca 1021 ctagggggac tagggggtcc ccaaagtggc ccactttctg tggctctgat cagcgcaggg 1081 gccagccagg gacatagcca gggaggggcc acatcacttc ccactggaaa tctctgtggt 1141 ctgcaagtgc ttcccagccc agaacagggg tggattcccc aacctcaacc tcctttcttc 1201 tctgctccca aaccatgtca ggaccacctt cctctagagc tcgggagccc ggagggtctt 1261 cacccactcc tactccagta tcagctggca cgggctcctt cctgagagca aaggtcaagg 1321 accccctctg tgaaggctca gcagaggtgg gatcccacgc cccctcccgg cccctccctg 1381 ccctccattc agggagaaac ctctccttcc cgtgtgagaa gggccagagg gtccaggcat 1441 cccaagtcca gcgtgaaggg ccacagcccc tcttggctgc caagcacgca gatcccatgg 1501 acatttgggg aaagggctcc ttgggctgct ggtgaacttc tgtggccacc acctcctgct 1561 cctgacctcc ctgggagggt gctatcagtt ctgtcctggc cctttcagtt ttataagttg 1621 gtttccagcc cccagtgtcc tgacttctgt ctgccacatg aggagggagg ccctgcctgt 1681 gtgggagggt ggttactgtg ggtggaatag tggaggcctt caactgatta gacaaggccc 1741 gcccacatct tggagggcat ctgccttact gattaaaatg tcaatgtaat ct // LOCUS HUMP62 2685 bp mRNA PRI 07-JAN-1995 DEFINITION Human p62 mRNA, complete cds. ACCESSION M88108 NID g189499 KEYWORDS . SOURCE Homo sapiens (tissue library: Clonetech human placenta lambda gt11 library HL1008b) fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2685) AUTHORS Wong,G., Muller,O., Clark,R., Conroy,L., Moran,M.F., Polakis,P. and McCormick,F. TITLE Molecular cloning and nucleic acid binding properties of the GAP-associated tyrosine phosphoprotein p62 JOURNAL Cell 69 (3), 551-558 (1992) MEDLINE 92257595 FEATURES Location/Qualifiers source 1..2685 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="fetal brain" /tissue_lib="Clonetech human placenta lambda gt11 library HL1008b" gene 107..1438 /gene="p62" CDS 107..1438 /gene="p62" /codon_start=1 /product="p62" /db_xref="PID:g189500" /translation="MQRRDDPAARMSRSSGRSGSMDPSGAHPSVRQTPSRQPPLPHRS RGGGGGSRGGARASPATQPPPLLPPSATGPDATVGGPAPTPLLPPSATASVKMEPENK YLPELMAEKDSLDPSFTHAMQLLTAEIEKIQKGDSKKDDEENYLDLFSHKNMKLKERV LIPVKQYPKFNFVGKILGPQGNTIKRLQEETGAKISVLGKGSMRDKAKEEELRKGGDP KYAHLNMDLHVFIEVFGPPCEAYALMAHAMEEVKKFLVPDMMDDICQEQFLELSYLNG VPEPSRGRGVPVRGRGAAPPPPPVPRGRGVGPPRGALVRGTPVRGAITRGATVTRGVP PPPTVRGAPAPRARTAGIQRIPLPPPPAPETYEEYGYDDTYAEQSYEGYEGYYSQSQG DSEYYDYGHGEVQDSYEAYGQDDWNGTRPSLKAPPARPVKGAYREHPYGRY" BASE COUNT 743 a 610 c 649 g 683 t ORIGIN 1 ggcttcggtc gctaccgctc ccgctctgcc acccccgcca accgccgctc gggcctccgt 61 cgctgccgcg tcgctttctc gctccttgga tcgcacatcc tcccagatgc agcgccggga 121 cgaccccgcc gcgcgcatga gccggtcttc gggccgtagc ggctccatgg acccctccgg 181 tgcccacccc tcggtgcgtc agacgccgtc tcggcagccg ccgctgcctc accggtcccg 241 gggaggcgga gggggatccc gcgggggcgc ccgggcctcg cccgccacgc agccgccacc 301 gctgctgccg ccctcggcca cgggtcccga cgcgacagtg ggcgggccag cgccgacccc 361 gctgctgccc ccctcggcca cagcctcggt caagatggag ccagagaaca agtacctgcc 421 cgaactcatg gccgagaagg actcgctcga cccgtccttc actcacgcca tgcagctgct 481 gacggcagaa attgagaaga ttcagaaagg agactcaaaa aaggatgatg aggagaatta 541 cttggattta ttttctcata agaacatgaa actgaaagag cgagtgctga tacctgtcaa 601 gcagtatccc aagttcaatt ttgtggggaa gattcttgga ccacaaggga atacaatcaa 661 aagactgcag gaagagactg gtgcaaagat ctctgtattg ggaaagggct caatgagaga 721 caaagccaag gaggaagagc tgcgcaaagg tggagacccc aaatatgccc acttgaatat 781 ggatctgcat gtcttcattg aagtctttgg acccccatgt gaggcttatg ctcttatggc 841 ccatgccatg gaggaagtca agaaatttct agtaccggat atgatggatg atatctgtca 901 ggagcaattt ctagagctgt cctacttgaa tggagtacct gaaccctctc gtggacgtgg 961 ggtgccagtg agaggccggg gagctgcacc tcctccacca cctgttccca ggggccgtgg 1021 tgttggacca cctcgggggg ctttggtacg tggtacacca gtaaggggag ccatcaccag 1081 aggtgccact gtgactcgag gcgtgccacc cccacctact gtgaggggtg ctccagcacc 1141 aagagcacgg acagcgggca tccagaggat acctttgcct ccacctcctg caccagaaac 1201 atatgaagaa tatggatatg atgatacata cgcagaacaa agttacgaag gctacgaagg 1261 ctattacagc cagagtcaag gggactcaga atattatgac tatggacatg gggaggttca 1321 agattcttat gaagcttatg gccaggacga ctggaatggg accaggccgt cgctgaaggc 1381 ccctcctgct aggccagtga agggagcata cagagagcac ccatatggac gttattaaaa 1441 acaaacatga ggggaaaata tcagttatga gcaaagttgt tactgatttc ttgtatctcc 1501 caggattcct gttgctttac ccacaacaga caagtaattg tctaagtgtt tttcttcgtg 1561 gtccccttct tctccccacc ttattccatt cttaactctg cattctggct tctgtatgta 1621 gtattttaaa atgagttaaa atagatttag gaatattgaa ttaatttttt aagtgtgtag 1681 atgctttttt ctttgttgtt taaatataaa cagaagtgta ccttttataa taaaaaaaag 1741 aagttgagta aaaaaaaaaa acacacaaac ctgttagttt caaaaatgac attgcttgct 1801 taaaggttct gaagtaaagg cttgttaagt ttctcttagt tttgatttga ggcatcccgt 1861 aaagttgtag ttgcagaatc ccaaactagg ctacatttca aaattcaggg ctgtttaaga 1921 tttaaaatca caaacattaa cggcagtagg caccaccatg taaaagtgag ctcagacgtc 1981 tctaaaaaat gtttccttta taaaagcaca tggcggttga atcttaaggt taaattttaa 2041 tatgaaagat cctcatgaat taaatagttg atgcaatttt taacgttaat tgatataaaa 2101 aaaaaaacaa caaaattagg cttgtaaaac tgactttttc attacgtggg ttttgaaatc 2161 tagccccaga catactgtgt tgagagatac ttagagggag ggagtaggtt ttgaagaggt 2221 tgatggtggt ggggagggaa ggcctcctga attgagtttg atgcagagct ttttagccat 2281 gaagaatctt tcagtcatag tactaataat taaattttca gtatttaaaa agacaaagta 2341 ttttgtccat ttgagattct gcactccatg aaaagttcac ttggacgctg gggccaaaag 2401 ctgttgattt tcttaagttg acggttgtca atatatcgaa ctgttcccaa gttagtcaag 2461 tatgtctcaa cactagcatg atataaaaag ggacactgca gctgaatgaa aaaggaatca 2521 aaatccactt tgtacataag ttaaagtcct aattggattt gtaccgtcct cccattttgt 2581 tctcggaaga ttaaatgcta catgtgtaag tctgcctaaa taggtagctt aaacttatgt 2641 caaaatgtct gcagcagttt gtcaataaag tttagtcctt tttta // LOCUS HUMP65 3175 bp mRNA PRI 07-MAR-1995 DEFINITION Human 65-kilodalton phosphoprotein (p65) mRNA, complete cds. ACCESSION J02923 NID g189501 KEYWORDS phosphoprotein. SOURCE Human T lymphocyte, cDNA to mRNA , clone YZ3-5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3175) AUTHORS Zu,Y.L., Shigesada,K., Nishida,E., Kubota,I., Kohno,M., Hanaoka,M. and Namba,Y. TITLE 65-kilodalton protein phosphorylated by interleukin 2 stimulation bears two putative actin-binding sites and two calcium-binding sites JOURNAL Biochemistry 29 (36), 8319-8324 (1990) MEDLINE 91070054 COMMENT Draft entry and computer-readable sequence for [Biochemistry (1990) In press] kindly provided by Y.Namba 06-JUL-1990. FEATURES Location/Qualifiers source 1..3175 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="YZ3-5" /cell_line="KUT-2" /cell_type="T lymphocyte" CDS 75..1958 /codon_start=1 /product="phosphoprotein p65" /db_xref="PID:g189502" /translation="MARGSVSDEEMMELREAFAKVDTDGNGYISFNELNDLFKAACLP LPGYRVREITENLMATGDLDQDGRISFDEFIKIFHGLKSTDVAKTFRKAINKKEGICA IGGTSEQSSVGTQHSYSEEEKYAFVNWINKALENDPDCRHVIPMNPNTNDLFNAVGDG IVLCKMINLSVPDTIDERTINKKKLTPFTIQENLNLALNSASAIGCHVVNIGAEDLKE GKPYLVLGLLWQVIKIGLFADIELSRNEALIALLREGESLEDLMKLSPEELLLRWANY HLENAGCNKIGNFSTDIKDSKAYYHLLEQVAPKGDEEGVPAVVIDMSGLREKDDIQRA ECMLQQAERLGCRQFVTATDVVRGNPKLNLAFIANLFNRYPALHKPENQDIDWGALEG ETREERTFRNWMNSLGVNPRVNHLYSDLSDALVIFQLYEKIKVPVDWNRVNKPPYPKL GGNMKKLENCNYAVELGKNQAKFSLVGIGGQDLNEGNRTLTLALIWQLMRRYTLNILE EIGGGQKVNDDIIVNWVNETLREAEKSSSISSFKDPKISTSLPVLDLIDAIQPGSINY DLLKTENLNDDEKLNNAKYAISMARKIGARVYALPEDLVEVNPKMVMTVFACLMGKGM KRV" BASE COUNT 893 a 706 c 752 g 824 t ORIGIN 1 ggttcctggt catacaccag tactaccaag gacagctttt ttcctgcaag atctgttacc 61 taaagcaata aaaaatggcc agaggatcag tgtccgatga ggaaatgatg gagctcagag 121 aagcttttgc caaagttgat actgatggca atggatacat cagcttcaat gagttgaatg 181 acttgttcaa ggctgcttgc ttgcctttgc ctgggtatag agtacgagaa attacagaaa 241 acctgatggc tacaggtgat ctggaccaag atggaaggat cagctttgat gagtttatca 301 agattttcca tggcctaaaa agcacagatg ttgccaagac ctttagaaaa gcaatcaata 361 agaaggaagg gatttgtgca atcggtggta cttcagagca gtctagcgtt ggcacccaac 421 actcctattc agaggaagaa aagtatgcct ttgtcaactg gataaacaaa gccctggaaa 481 atgatcctga ttgtcggcat gtcatcccaa tgaacccaaa cacgaatgat ctctttaatg 541 ctgttggaga tggcattgtc ctttgtaaaa tgatcaacct gtcagtgcca gacacaattg 601 atgaaagaac aatcaacaaa aagaagctaa cccctttcac cattcaggaa aatctgaact 661 tggctctgaa ctctgcctca gccatcgggt gccatgtggt caacataggg gctgaggacc 721 tgaaggaggg gaagccttat ctggtcctgg gacttctgtg gcaagtcatc aagattgggt 781 tgtttgctga cattgaactc agcagaaatg aagctctgat tgctcttttg agagaaggtg 841 agagcctgga ggatttgatg aaactctccc ctgaagagct cttgctgagg tgggctaatt 901 accacctgga aaatgcaggc tgcaacaaaa ttggcaactt cagtactgac atcaaggact 961 caaaagctta ttaccacctg cttgagcagg tggctccaaa aggagatgaa gaaggtgttc 1021 ctgctgttgt tattgacatg tcaggactgc gggagaagga tgacatccag agggcagaat 1081 gcatgctgca gcaggcggag aggctgggct gccggcagtt tgtcacagcc acagatgttg 1141 tccgagggaa ccccaagttg aacttggctt ttattgccaa cctctttaac agataccctg 1201 ccctgcacaa accagagaac caggacattg actggggggc tcttgaaggt gagacgagag 1261 aagagcggac atttaggaac tggatgaact ccctgggtgt taaccctcga gtcaatcatt 1321 tgtacagtga cttatcagat gccctggtca tcttccagct ctatgaaaag atcaaagttc 1381 ctgttgactg gaacagagta aacaaaccgc cataccccaa actgggaggc aatatgaaga 1441 agcttgagaa ttgtaactac gcggtagaat tggggaagaa tcaagcgaag ttctccctgg 1501 ttggcatcgg tggacaagat ctcaatgaag gaaaccgcac tctcacactg gccttgattt 1561 ggcagctaat gagaaggtat acactgaata tcctcgaaga aattggtggt ggccagaagg 1621 tcaatgatga cattattgtc aactgggtga atgaaacatt gagggaagca gagaaaagtt 1681 catccatctc tagtttcaag gacccgaaga ttagtacaag tctgcctgtt ctggacctca 1741 tcgatgccat ccaaccaggt tccattaact atgaccttct gaagacagaa aatctgaatg 1801 atgatgagaa actcaacaat gcaaaatatg ccatctctat ggcccgaaaa attggagcaa 1861 gagtgtatgc cctgccagaa gacctggttg aagtgaaccc caaaatggtc atgaccgtgt 1921 ttgcctgcct catggggaaa ggaatgaaga gggtgtgagg ccaatggggc tgggtgggag 1981 gcggtgcact cactcctgac tgcccggcac agatgctcca gggatgattc aagccattcc 2041 aaagttcaac ttggtgacac tctataagat tccaaaaagc acatattagt gcagccaagt 2101 agcctctcct gtatttaaca aaaagtgctt cattctttgc aggaggccca acctcctata 2161 tataggtttc tattcttgat ttatttgctt cttcgaaaat ctagaggaaa agaaagaagt 2221 tattttccag gtacccttct cgcttttgcc attagccaag gatagaagct gcagtggtat 2281 taattttgat ataatctttc aaaccagctt gttgtggctt cccttttctt tgttcaagat 2341 gagggccagg aggggaaaca tcacacctgc cctaaaccct gttcctggag gtcagcattt 2401 gatctgttgc aagcccctct ttctgtcccc tcttcctacc ctgcctccca tgactttgct 2461 cctcacactt ttggaaccat gccttccggg ggggcccatc tcttctggcg gtccttgtct 2521 ctgggccact tggagtgtgt gataaatcag tcaagctgtt gaagtctcag gagtctctgg 2581 tagcctgcag aagtaagcct catcatcaga gcctttcctc aaaactggag tcccaaatgt 2641 catcaggttt tgtttttttt cagccactaa gaacccctct gcttttaact ctagaatttg 2701 ggcttggacc agatctaaca tcttgaatac tctgccctct agagccttca gccttaatgg 2761 aaggttggat ccaaggaggt gtaatggaat cggaatcaag ccactcggca ggcatggagc 2821 tataactaag catccttagg gttctgcctc tccaggcatt agccctcaca ttagatctag 2881 ttactgtggt atggctaata cctgtcaaca tttggaggca atcctacctt gcttttgctt 2941 ctagagctta gcatatctga ttgttgtcag gccatattat caatgtttac ttttttggta 3001 ctataaaagc tttctgccac ccctaaactc caggggggac aatatgtgcc aatcaatagc 3061 acccctactc acatacacac acacctagcc agctgtcaag ggcagaatga atctatgctg 3121 gataagaaat ggtggaactg cgttatgaag agctaattta ctggacaaag aattc // LOCUS HUMP65NFKB 1767 bp mRNA PRI 02-DEC-1991 DEFINITION Human NF-kappa-B transcription factor p65 DNA binding subunit mRNA, complete cds. ACCESSION M62399 NID g189503 KEYWORDS NF-kappa-B transcription factor. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1767) AUTHORS Ruben,S.M., Dillon,P.J., Schreck,R., Henkel,T., Chen,C.-H., Maher,M., Baeuerle,P.A. and Rosen,C.A. TITLE A novel rel-related human cDNA that potentially encodes the 65 kDa subunit of NF-kappaB JOURNAL Science 251, 1490-1493 (1991) MEDLINE 91173312 FEATURES Location/Qualifiers source 1..1767 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 80..1735 /note="p50 DNA binding subunit" /codon_start=1 /product="NF-kappa-B transcription factor" /db_xref="PID:g189504" /translation="MDELFPLIFPAEPAQASGPYVEIIEQPKQRGMRFRYKCEGRSAG SIPGERSTDTTKTHPTIKINGYTGPGTVRISLVTKDPPHRPHPHELVGKDCRDGFYEA ELCPDRCIHSFQNLGIQCVKKRDLEQAISQRIQTNNNPFQVPIEEQRGDYDLNAVRLC FQVTVRDPSGRPLRLPPVLPHPIFDNRAPNTAELKICRVNRNSGSCLGGDEIFLLCDK VQKEDIEVYFTGPGWEARGSFSQADVHRQVAIVFRTPPYADPSLQAPVRVSMQLRRPS DRELSEPMEFQYLPDTDDRHRIEEKRKRTYETFKSIMKKSPFSGPTDPRPPPRRIAVP SRSSASVPKPAPQPYPFTSSLSTINYDEFPTMVFPSGQISQASALAPAPPQVLPQAPA PAPAPAMVSALAQAPAPVPVLAPGPPQAVAPPAPKPTQAGEGTLSEALLQLQFDDEDL GALLGNSTDPAVFTDLASVDNSEFQQLLNQGIPVAPHTTEPMLMEYPEAITRLVTGAQ RPPDPAPAPLGAPGLPNGLLSGDEDFSSIADMDFSALLSQISS" BASE COUNT 354 a 626 c 470 g 317 t ORIGIN 1 gaattccggc gaatggctcg tctgtagtgc acgccgcggg cccagctgcg accccggccc 61 cgcccccggg accccggcca tggacgaact gttccccctc atcttcccgg cagagccagc 121 ccaggcctct ggcccctatg tggagatcat tgagcagccc aagcagcggg gcatgcgctt 181 ccgctacaag tgcgaggggc gctccgcggg cagcatccca ggcgagagga gcacagatac 241 caccaagacc caccccacca tcaagatcaa tggctacaca ggaccaggga cagtgcgcat 301 ctccctggtc accaaggacc ctcctcaccg gcctcacccc cacgagcttg taggaaagga 361 ctgccgggat ggcttctatg aggctgagct ctgcccggac cgctgcatcc acagtttcca 421 gaacctggga atccagtgtg tgaagaagcg ggacctggag caggctatca gtcagcgcat 481 ccagaccaac aacaacccct tccaagttcc tatagaagag cagcgtgggg actacgacct 541 gaatgctgtg cggctctgct tccaggtgac agtgcgggac ccatcaggca ggcccctccg 601 cctgccgcct gtccttcctc atcccatctt tgacaatcgt gcccccaaca ctgccgagct 661 caagatctgc cgagtgaacc gaaactctgg cagctgcctc ggtggggatg agatcttcct 721 actgtgtgac aaggtgcaga aagaggacat tgaggtgtat ttcacgggac caggctggga 781 ggcccgaggc tccttttcgc aagctgatgt gcaccgacaa gtggccattg tgttccggac 841 ccctccctac gcagacccca gcctgcaggc tcctgtgcgt gtctccatgc agctgcggcg 901 gccttccgac cgggagctca gtgagcccat ggaattccag tacctgccag atacagacga 961 tcgtcaccgg attgaggaga aacgtaaaag gacatatgag accttcaaga gcatcatgaa 1021 gaagagtcct ttcagcggac ccaccgaccc ccggcctcca cctcgacgca ttgctgtgcc 1081 ttcccgcagc tcagcttctg tccccaagcc agcaccccag ccctatccct ttacgtcatc 1141 cctgagcacc atcaactatg atgagtttcc caccatggtg tttccttctg ggcagatcag 1201 ccaggcctcg gccttggccc cggcccctcc ccaagtcctg ccccaggctc cagcccctgc 1261 ccctgctcca gccatggtat cagctctggc ccaggcccca gcccctgtcc cagtcctagc 1321 cccaggccct cctcaggctg tggccccacc tgcccccaag cccacccagg ctggggaagg 1381 aacgctgtca gaggccctgc tgcagctgca gtttgatgat gaagacctgg gggccttgct 1441 tggcaacagc acagacccag ctgtgttcac agacctggca tccgtcgaca actccgagtt 1501 tcagcagctg ctgaaccagg gcatacctgt ggccccccac acaactgagc ccatgctgat 1561 ggagtaccct gaggctataa ctcgcctagt gacaggggcc cagaggcccc ccgacccagc 1621 tcctgctcca ctgggggccc cggggctccc caatggcctc ctttcaggag atgaagactt 1681 ctcctccatt gcggacatgg acttctcagc cctgctgagt cagatcagct cctaaggggg 1741 tgacgcctgc cctccccaga gcactgg // LOCUS HUMP68A 2562 bp mRNA PRI 29-JAN-1991 DEFINITION Human p68 kinase mRNA, complete cds. ACCESSION M35663 NID g189505 KEYWORDS p68 kinase. SOURCE Human interferon-treated Daudi cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2562) AUTHORS Meurs,E., Chong,K., Galabru,J., Thomas,N.S.B., Kerr,I.M., Williams,B.R. G. and Hovanessian,A.G. TITLE Molecular cloning and characterization of the human double-stranded RNA-activated protein kinase induced by interferon JOURNAL Cell 62, 379-390 (1990) MEDLINE 90322433 REFERENCE 2 (bases 1 to 2562) AUTHORS Meurs,E. TITLE Revision JOURNAL Unpublished (1991) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Meurs, 22-JUN-1990. FEATURES Location/Qualifiers source 1..2562 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 187..1842 /note="p68 kinase" /codon_start=1 /db_xref="PID:g189506" /translation="MAGDLSAGFFMEELNTYRQKQGVVLKYQELPNSGPPHDRRFTFQ VIIDGREFPEGEGRSKKEAKNAAAKLAVEILNKEKKAVSPLLLTTTNSSEGLSMGNYI GLINRIAQKKRLTVNYEQCASGVHGPEGFHYKCKMGQKEYSIGTGSTKQEAKQLAAKL AYLQILSEETSVKSDYLSSGSFATTCESQSNSLVTSTLASESSSEGDFSADTSEINSN SDSLNSSSLLMNGLRNNQRKAKRSLAPRFDLPDMKETKYTVDKRFGMDFKEIELIGSG GFGQVFKAKHRIDGKTYVIKRVKYNNEKAEREVKALAKLDHVNIVHYNGCWDGFDYDP ETSDDSLESSDYDPENSKNSSRSKTKCLFIQMEFCDKGTLEQWIEKRRGEKLDKVLAL ELFEQITKGVDYIHSKKLIHRDLKPSNIFLVDTKQVKIGDFGLVTSLKNDGKRTRSKG TLRYMSPEQISSQDYGKEVDLYALGLILAELLHVCDTAFETSKFFTDLRDGIISDIFD KKEKTLLQKLLSKKPEDRPNTSEILRTLTVWKKSPEKNERHTC" repeat_region 1177..1215 /note="39 bp direct repeat" repeat_region 1219..1257 /note="39 bp direct repeat" polyA_signal 2236..2241 polyA_signal 2440..2445 polyA_signal 2530..2535 BASE COUNT 843 a 478 c 500 g 741 t ORIGIN 1 cagtttctgg agcaaattca gtttgccttc ctggatttgt aaattgtaat gacctcaaaa 61 ctttagcagt tcttccatct gactcaggtt tgcttctctg gcggtcttca gaatcaacat 121 ccacacttcc gtgattatct gcgtgcattt tggacaaagc ttccaaccag gatacgggaa 181 gaagaaatgg ctggtgatct ttcagcaggt ttcttcatgg aggaacttaa tacataccgt 241 cagaagcagg gagtagtact taaatatcaa gaactgccta attcaggacc tccacatgat 301 aggaggttta catttcaagt tataatagat ggaagagaat ttccagaagg tgaaggtaga 361 tcaaagaagg aagcaaaaaa tgccgcagcc aaattagctg ttgagatact taataaggaa 421 aagaaggcag ttagtccttt attattgaca acaacgaatt cttcagaagg attatccatg 481 gggaattaca taggccttat caatagaatt gcccagaaga aaagactaac tgtaaattat 541 gaacagtgtg catcgggggt gcatgggcca gaaggatttc attataaatg caaaatggga 601 cagaaagaat atagtattgg tacaggttct actaaacagg aagcaaaaca attggccgct 661 aaacttgcat atcttcagat attatcagaa gaaacctcag tgaaatctga ctacctgtcc 721 tctggttctt ttgctactac gtgtgagtcc caaagcaact ctttagtgac cagcacactc 781 gcttctgaat catcatctga aggtgacttc tcagcagata catcagagat aaattctaac 841 agtgacagtt taaacagttc ttcgttgctt atgaatggtc tcagaaataa tcaaaggaag 901 gcaaaaagat ctttggcacc cagatttgac cttcctgaca tgaaagaaac aaagtatact 961 gtggacaaga ggtttggcat ggattttaaa gaaatagaat taattggctc aggtggattt 1021 ggccaagttt tcaaagcaaa acacagaatt gacggaaaga cttacgttat taaacgtgtt 1081 aaatataata acgagaaggc ggagcgtgaa gtaaaagcat tggcaaaact tgatcatgta 1141 aatattgttc actacaatgg ctgttgggat ggatttgatt atgatcctga gaccagtgat 1201 gattctcttg agagcagtga ttatgatcct gagaacagca aaaatagttc aaggtcaaag 1261 actaagtgcc ttttcatcca aatggaattc tgtgataaag ggaccttgga acaatggatt 1321 gaaaaaagaa gaggcgagaa actagacaaa gttttggctt tggaactctt tgaacaaata 1381 acaaaagggg tggattatat acattcaaaa aaattaattc atagagatct taagccaagt 1441 aatatattct tagtagatac aaaacaagta aagattggag actttggact tgtaacatct 1501 ctgaaaaatg atggaaagcg aacaaggagt aagggaactt tgcgatacat gagcccagaa 1561 cagatttctt cgcaagacta tggaaaggaa gtggacctct acgctttggg gctaattctt 1621 gctgaacttc ttcatgtatg tgacactgct tttgaaacat caaagttttt cacagaccta 1681 cgggatggca tcatctcaga tatatttgat aaaaaagaaa aaactcttct acagaaatta 1741 ctctcaaaga aacctgagga tcgacctaac acatctgaaa tactaaggac cttgactgtg 1801 tggaagaaaa gcccagagaa aaatgaacga cacacatgtt agagcccttc tgaaaaagta 1861 tcctgcttct gatatgcagt tttccttaaa ttatctaaaa tctgctaggg aatatcaata 1921 gatatttacc ttttatttta atgtttcctt taatttttta ctatttttac taatctttct 1981 gcagaaacag aaaggttttc ttctttttgc ttcaaaaaca ttcttacatt ttactttttc 2041 ctggctcatc tctttatttt tttttttttt ttttaaagac agagtctcgc tctgttgccc 2101 aggctggagt gcaatgacac agtcttggct cactgcaact tctgcctctt gggttcaagt 2161 gattctcctg cctcagcctc ctgagtagct ggattacagg catgtgccac ccacccaact 2221 aatttttgtg tttttaataa agacagggtt tcaccatgtt ggccaggctg gtctcaaact 2281 cctgacctca agtaatccac ctgcctcggc ctcccaaagt gctgggatta cagggatgag 2341 ccaccgcgcc cagcctcatc tctttgttct aaagatggaa aaaccacccc caaattttct 2401 ttttatacta ttaatgaatc aatcaattca tatctattta ttaaatttct accgctttta 2461 ggccaaaaaa atgtaagatc gttctctgcc tcacatagct tacaagccag ctggagaaat 2521 atggtactca ttaaaaaaaa aaaaaaaaag tgatgtacaa cc // LOCUS HUMP70S6KA 2346 bp mRNA PRI 31-OCT-1991 DEFINITION Human p70 ribosomal S6 kinase alpha-I mRNA, complete cds. ACCESSION M60724 NID g189507 KEYWORDS p70 ribosomal S6 kinase alpha-I. SOURCE Human liver hepatoma, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2346) AUTHORS Grove,J.R., Banerjee,P., Balasubramanyam,A., Coffer,P.J., Price,D.J., Avruch,J. and Woodgett,J.R. TITLE Cloning and expression of two human p70 S6 kinase polypeptides differing only at their amino termini JOURNAL Mol. Cell. Biol. 11, 5541-5550 (1991) MEDLINE 92017834 FEATURES Location/Qualifiers source 1..2346 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatoma" /tissue_type="liver" CDS 28..1605 /codon_start=1 /product="p70 ribosomal S6 kinase alpha-I" /db_xref="PID:g189508" /translation="MRRRRRRDGFYPAPDFRDREAEDMAGVFDIDLDQPEDAGSEDEL EEGGQLNESMDHGGVGPYELGMEHCEKFEISETSVNRGPEKIRPECFELLRVLGKGGY GKVFQVRKVTGANTGKIFAMKVLKKAMIVRNAKDTAHTKAERNILEEVKHPFIVDLIY AFQTGGKLYLILEYLSGGELFMQLEREGIFMEDTACFYLAEISMALGHLHQKGIIYRD LKPENIMLNHQGHVKLTDFGLCKESIHDGTVTHTFCGTIEYMAPEILMRSGHNRAVDW WSLGALMYDMLTGAPPFTGENRKKTIDKILKCKLNLPPYLTQEARDLLKKLLKRNAAS RLGAGPGDAGEVQAHPFFRHINWEELLARKVEPPFKPLLQSEEDVSQFDSKFTRQTPV DSPDDSTLSESANQVFLGFTYVAPSVLESVKEKFSFEPKIRSPRRFIGSPRTPVSPVK FSPGDFWGRGASASTANPQTPVEYPMETSGIEQMDVTMSGEASAPLPIRQPNSGPYKK QAFPMISKRPEHLRMNL" BASE COUNT 750 a 453 c 558 g 585 t ORIGIN 1 gcacgaggct gcggcgggtc cgggcccatg aggcgacgaa ggaggcggga cggcttttac 61 ccagccccgg acttccgaga cagggaagct gaggacatgg caggagtgtt tgacatagac 121 ctggaccagc cagaggacgc gggctctgag gatgagctgg aggagggggg tcagttaaat 181 gaaagcatgg accatggggg agttggacca tatgaacttg gcatggaaca ttgtgagaaa 241 tttgaaatct cagaaactag tgtgaacaga gggccagaaa aaatcagacc agaatgtttt 301 gagctacttc gggtacttgg taaagggggc tatggaaagg tttttcaagt acgaaaagta 361 acaggagcaa atactgggaa aatatttgcc atgaaggtgc ttaaaaaggc aatgatagta 421 agaaatgcta aagatacagc tcatacaaaa gcagaacgga atattctgga ggaagtaaag 481 catcccttca tcgtggattt aatttatgcc tttcagactg gtggaaaact ctacctcatc 541 cttgagtatc tcagtggagg agaactattt atgcagttag aaagagaggg aatatttatg 601 gaagacactg cctgctttta cttggcagaa atctccatgg ctttggggca tttacatcaa 661 aaggggatca tctacagaga cctgaagccg gagaatatca tgcttaatca ccaaggtcat 721 gtgaaactaa cagactttgg actatgcaaa gaatctattc atgatggaac agtcacacac 781 acattttgtg gaacaataga atacatggcc cctgaaatct tgatgagaag tggccacaat 841 cgtgctgtgg attggtggag tttgggagca ttaatgtatg acatgctgac tggagcaccc 901 ccattcactg gggagaatag aaagaaaaca attgacaaaa tcctcaaatg taaactcaat 961 ttgcctccct acctcacaca agaagccaga gatctgctta aaaagctgct gaaaagaaat 1021 gctgcttctc gtctgggagc tggtcctggg gacgctggag aagttcaagc tcatccattc 1081 tttagacaca ttaactggga agaacttctg gctcgaaagg tggagccccc ctttaaacct 1141 ctgttgcaat ctgaagagga tgtaagtcag tttgattcca agtttacacg tcagacacct 1201 gtcgacagcc cagatgactc aactctcagt gaaagtgcca atcaggtctt tctgggtttt 1261 acatatgtgg ctccatctgt acttgaaagt gtgaaagaaa agttttcctt tgaaccaaaa 1321 atccgatcac ctcgaagatt tattggcagc ccacgaacac ctgtcagccc agtcaaattt 1381 tctcctgggg atttctgggg aagaggtgct tcggccagca cagcaaatcc tcagacacct 1441 gtggaatacc caatggaaac aagtggcata gagcagatgg atgtgacaat gagtggggaa 1501 gcatcggcac cacttccaat acgacagccg aactctgggc catacaaaaa acaagctttt 1561 cccatgatct ccaaacggcc agagcacctg cgtatgaatc tatgacagag caatgctttt 1621 aatgaattta aggcaaaaag gtggagaggg agatgtgtga gcatcctgca aggtgaaaca 1681 agactcaaaa tgacagtttc agagagtcaa tgtcattaca tagaacactt cggacacagg 1741 aaaaataaac gtggatttta aaaaatcaat caatggtgca aaaaaaaact taaagcaaaa 1801 tagtattgct gaactcttag gcacatcaat taattgattc ctcgcgacat ctttctcaac 1861 cttatcaagg attttcatgt tgatgactcg aaactgacag tattaagggt aggatgttgc 1921 tctgaatcac tgtgagtctg atgtgtgaag aagggtatcc tttcattagg caagtacaaa 1981 ttgcctataa tacttgcaac taaggacaaa ttagcatgca agcttggtca aacttttccc 2041 aggcaaaatg ggaaggcaaa gacaaaagaa acttaccaat tgatgtttta cgtgcaaaca 2101 acctgaatct tttttttata taaatatata tttttcaaat agatttttga ttcagctcat 2161 tatgaaaaac atcccaaact ttaaaatgcg aaattattgg ttggtgtgaa gaaagccaga 2221 caacttctgt ttcttctctt ggtgaaataa taaaatgcaa atgaatcatt gttaacacag 2281 ctgtggctcg tttgagggat tggggtggac ctggggttta ttttcagtaa cccagctgcg 2341 gagcct // LOCUS HUMP78A 2914 bp mRNA PRI 07-JAN-1995 DEFINITION Human protein p78 mRNA, complete cds. ACCESSION M80359 NID g189511 KEYWORDS protein p78. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2914) AUTHORS Maheshwari,K.K., Som,S. and Parsa,I. TITLE Sequence of a cDNA encoding 78kD marker protein lost in chemically induced transplantable carcinoma and primary carcinoma of human pancreas JOURNAL Unpublished (1991) FEATURES Location/Qualifiers source 1..2914 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" gene 172..2895 /gene="p78" CDS 172..2313 /gene="p78" /codon_start=1 /product="protein p78" /db_xref="PID:g189512" /translation="MSTRTPLPTVNERDTENHTSHGDGRQEVTSRTSRSGARCRNSIA SCADEQPHIGNYRLLKTIGKGNFAKVKLARHILTGREVAIKIIDKTQLNPTSLQKLFR EVRIMKILNHPNIVKLFEVIETQKTLYLIMEYASGGKVFDYLVAHGRMKEKEARSKFR QIVSAVQYCHQKRIVHRDLKAENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPY AAPELFQGKKYDGPEVDVWSLGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYM STDCENLLKRFLVLNPIKRGTLEQIMKDRWINAGHEEDELKPFVEPELDISDQKRIDI MVGMGYSQEEIQESLSKMKYDEITATYLLLGRKSSEVRPSSDLNNSTGQSPHHKVQRS VSSSQKQRRYSDHAGPGIPSVVAYPKRSQTSTADSDLKEDGISSRKSTGSAVGGKGIA PASPMLGNASNPNKADIPERKKSSTVPSSNTASGGMTRRNTYVCSERTTDDRHSVIQN GKENSTIPDQRTPVASTHSISSAATPDRIRFPRGTASRSTFHGQPRERRTATYNGPPA SPSLSHEATPLSQTRSRGSTTLFSKLTSKLTRSRNVSAKQKDENKEAKPRSLRFTWSM KTTSSMDPGDMMREIRKVLDANNCDYEQRERFLLFCVHGDGHAENLVQWEMEVCKLPR LSLNGVRFKRISGTSIAFKNIASKIANELKL" polyA_signal 2889..2895 /gene="p78" polyA_site 2914 /gene="p78" BASE COUNT 891 a 636 c 686 g 701 t ORIGIN 1 gacggcccgg gccaggcccg ggatctagaa cggccgtagg gggaagggag ccgccctccc 61 cacggcgcct tttcggaact gccgtggact cgaggacgct ggtcgccggc ctcctagggc 121 tgtgctgttt tgttttgacc ctcgcattgt gcagaattaa agtgcagtaa aatgtccact 181 aggaccccat tgccaacggt gaatgaacga gacactgaaa accacacgtc acatggagat 241 gggcgtcaag aagttacctc tcgtaccagc cgctcaggag ctcggtgtag aaactctata 301 gcctcctgtg cagatgaaca acctcacatc ggaaactaca gactgttgaa aacaatcggc 361 aaggggaatt ttgcaaaagt aaaattggca agacatatcc ttacaggcag agaggttgca 421 ataaaaataa ttgacaaaac tcagttgaat ccaacaagtc tacaaaagct cttcagagaa 481 gtaagaataa tgaagatttt aaatcatccc aatatagtga agttattcga agtcattgaa 541 actcaaaaaa cactctacct aatcatggaa tatgcaagtg gaggtaaagt atttgactat 601 ttggttgcac atggcaggat gaaggaaaaa gaagcaagat ctaaatttag acagattgtg 661 tctgcagttc aatactgcca tcagaaacgg atcgtacatc gagacctcaa ggctgaaaat 721 ctattgttag atgccgatat gaacattaaa atagcagatt tcggttttag caatgaattt 781 actgttggcg gtaaactcga cacgttttgt ggcagtcctc catacgcagc acctgagctc 841 ttccagggca agaaatatga cgggccagaa gtggatgtgt ggagtctggg ggtcatttta 901 tacacactag tcagtggctc acttcccttt gatgggcaaa acctaaagga actgagagag 961 agagtattaa gagggaaata cagaattccc ttctacatgt ctacagactg tgaaaacctt 1021 ctcaaacgtt tcctggtgct aaatccaatt aaacgcggca ctctagagca aatcatgaag 1081 gacaggtgga tcaatgcagg gcatgaagaa gatgaactca aaccatttgt tgaaccagag 1141 ctagacatct cagaccaaaa aagaatagat attatggtgg gaatgggata ttcacaagaa 1201 gaaattcaag aatctcttag taagatgaaa tacgatgaaa tcacagctac atatttgtta 1261 ttggggagaa aatcttcaga ggttaggccg agcagtgatc tcaacaacag tactggccag 1321 tctcctcacc acaaagtgca gagaagtgtt tcttcaagcc aaaagcaaag acgctacagt 1381 gaccatgctg gaccaggtat tccttctgtt gtggcgtatc cgaaaaggag tcagaccagc 1441 actgcagata gtgacctcaa agaagatgga atttcctccc ggaaatcaac tggcagtgct 1501 gttggaggaa agggaattgc tccagccagt cccatgcttg ggaatgcaag taatcctaat 1561 aaggcggata ttcctgaacg caagaaaagc tccactgtcc ctagtagtaa cacagcatct 1621 ggtggaatga cacgacgaaa tacttatgtt tgcagtgaga gaactacaga tgatagacac 1681 tcagtgattc agaatggcaa agaaaacagc actattcctg atcagagaac tccagttgct 1741 tcaacacaca gtatcagtag tgcagccacc ccagatcgaa tccgcttccc aagaggcact 1801 gccagtcgta gcactttcca cggccagccc cgggaacggc gaaccgcaac atataatggc 1861 cctcctgcct ctcccagcct gtcccatgaa gccacaccat tgtcccagac tcgaagccga 1921 ggctccacta ctctctttag taaattaact tcaaaactca caaggagtcg caatgtatct 1981 gctaagcaaa aagatgaaaa caaagaagca aagcctcgat ccctacgctt cacctggagc 2041 atgaaaacca ctagttcaat ggatcccggg gacatgatgc gggaaatccg caaagtgttg 2101 gacgccaata actgcgacta tgagcagagg gagcgcttct tgctcttctg cgtccacgga 2161 gatgggcacg cggagaacct cgtgcagtgg gaaatggaag tgtgcaagct gccaagactg 2221 tctctgaacg gggtccggtt taagcggata tcggggacat ccatagcctt caaaaatatt 2281 gcttccaaaa ttgccaatga gctaaagctg taacccagtg attatgatgt aaattaagta 2341 gcaagtaaag tgttttcctg aacactgatg gaaatgtata gaataatatt taggcaataa 2401 cgtctgcatc ttctaaatca tgaaattaaa gtctgaggac gagagcacgc ctgggagcga 2461 aagctggcct tttttctacg aatgcactac attaaagatg tgcaacctat gcgccccctg 2521 ccctacttcc gttaccctga gagtcggcgt gtggccccat ctccatgtgc ctcccgtctg 2581 ggtgggtgtg agagtggacg gtatgtgtgt gaagtggtgt atatggaagc atctccctac 2641 actggcagcc agtcattact agtacctctg cgggagatca tccggtgcta aaacattaca 2701 gttgccaagg aggaaaatac tgaatgactg ctaagaatta accttaagac cagttcatag 2761 ttaatacagg tttacagttc atgcctgtgg ttttgtgttt gttgttttgt gtttttttag 2821 tgcaaaaggt ttaaatttat agttgtgaac attgcttgtg tgtgtttttc taagtagatt 2881 cacaagataa ttaaaaattc actttttctc aggt // LOCUS HUMP8789R 2697 bp mRNA PRI 11-JAN-1996 DEFINITION Homo sapiens p87/89 gene, complete cds. ACCESSION L42572 NID g1160962 KEYWORDS transmembrane protein. SOURCE Homo sapiens (clone: As64) male cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2697) AUTHORS Gieffers,C., Korioth,F., Heimann,P. and Frey,J. TITLE Characterization of p87/89, a novel transmembraneprotein of the ER, which is expressed in two isoforms generated by alternative splicing JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..2697 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="As64" /cell_line="MG-63" /cell_type="Osteosarcoma" /sex="male" 5'UTR 1..92 /gene="p87/89" /note="putative" gene 1..2697 /gene="p87/89" mRNA 1..2697 /gene="p87/89" CDS 93..2369 /gene="p87/89" /note="endoplasmic reticulum protein" /codon_start=1 /evidence=experimental /product="transmembrane protein" /db_xref="PID:g1160963" /translation="MLRACQLSGVTAAAQSCLCGKFVLRPLRPCRRYSTSGSSGLTTG KIAGAGLLFVGGGIGGTILYAKWDSHFRESVEKTIPYSDKLFEMVLGPAAYNVPLPKK SIQSGPLKISSVSEVMKESKQSASQLQKQKGDTPASATAPTEAAQIISAAGDTLSVPA PAVQPEESLKTDHPEIGEGKPTPALSEEASSSSIRERPPEEVAARLAQQEKQEQVKIE SLAKSLEDALRQTASVTLQAIAAQNAAVQAVNAHSNILKAAMDNSEIAGEKKSAQWRT VEGALKERRKAVDEAADALLKAKEELEKMKSVIENAKKKEVAGAKPHITAAEGKLHNM IVDLDNVVKKVQAAQSEAKVVSQYHELVVQARDDFKRELDSITPEVLPGWKGMSVSDL ADKLSTDDLNSLIAHAHRRIDQLNRELAEQKATEKQHITLALEKQKLEEKRAFDSAVA KALEHHRSEIQAEQDRKIEEVRDAMENEMRTQLRRQAAAHTDHLRDVLRVQEQELKSE FEQNLSEKLSEQELQFRRLSQEQVDNFTLDINTAYARLRGIEQAVQSHAVAEEEARKA HQLWLSVEALKYSMKTSSAETPTIPLGSAVEAIKANCSDNEFTQALTAAIPPESLTRG VYSEETLRARFYAVQKLARRVAMIDETRNSLYQYFLSYLQSLLLFPPQQLKPPPELCP EDINTFKLLSYASYCIEHGDLELAAKFVNQLKGESRRVAQDWLKEARMTLETKQIVEI LTAYASAVGIGTTQVQPE" 3'UTR 2370..2697 /gene="p87/89" polyA_signal 2681..2686 /gene="p87/89" BASE COUNT 802 a 625 c 647 g 623 t ORIGIN 1 acgcgggcac gcacacacgg aagcacgcct ccacttaact cgcgccgccg cggcagctcg 61 agtccaccag cagcgccgtc cgcttgaccg agatgctgcg ggcctgtcag ttatcgggtg 121 tgaccgccgc cgcccagagt tgtctctgtg ggaagtttgt cctccgtcca ttgcgaccat 181 gccgcagata ctctacttca ggcagctctg ggttgactac tggcaaaatt gctggagctg 241 gccttttgtt tgttggtgga ggtattggtg gcactatcct atatgccaaa tgggattccc 301 atttccggga aagtgtagag aaaaccatac cttactcaga caaactcttc gagatggttc 361 ttggtcctgc agcttataat gttccattgc caaagaaatc gattcagtcg ggtccactaa 421 aaatctctag tgtatcagaa gtaatgaaag aatctaaaca gtctgcctca caactccaaa 481 aacaaaaggg agatactcca gcttcagcaa cagcacctac agaagcggct caaattattt 541 ctgcagcagg tgataccctg tcggtcccag cccctgcagt tcagcctgag gaatctttaa 601 aaactgatca ccctgaaatt ggtgaaggaa aacccacacc tgcactttca gaagaagcat 661 cctcatcttc tataagggag cgaccacctg aagaagttgc agctcgcctt gcacaacagg 721 aaaaacaaga acaagttaaa attgagtctc tagccaagag cttagaagat gctctgaggc 781 aaactgcaag tgtcactctg caggctattg cagctcagaa tgctgcggtc caggctgtca 841 atgcacactc caacatattg aaagccgcca tggacaattc tgagattgca ggcgagaaga 901 aatctgctca gtggcgcaca gtggagggtg cattgaagga acgcagaaag gcagtagatg 961 aagctgccga tgcccttctc aaagccaaag aagagttaga gaagatgaaa agtgtgattg 1021 aaaatgcaaa gaaaaaagag gttgctgggg ccaagcctca tataactgct gcagagggta 1081 aacttcacaa catgatagtt gatctggata atgtggtcaa aaaggtccaa gcagctcagt 1141 ctgaggctaa ggttgtatct cagtatcatg agctggtggt ccaagctcgg gatgacttta 1201 aacgagagct ggacagtatt actccagaag tccttcctgg atggaaagga atgagtgttt 1261 cagacttagc tgacaagctc tctactgatg atctgaactc cctcattgct catgcacatc 1321 gtcgtattga tcagctgaac agagagctgg cagaacagaa ggccaccgaa aagcagcaca 1381 tcacgttagc cttggagaaa caaaagctgg aagaaaagcg ggcatttgac tctgcagtag 1441 caaaagcatt agaacatcac agaagtgaaa tacaggctga acaggacaga aagatagaag 1501 aagtcagaga tgccatggaa aatgaaatga gaacccagct tcgccgacag gcagctgccc 1561 acactgatca cttgcgagat gtccttaggg tacaagaaca ggaattgaag tctgaatttg 1621 agcagaacct gtctgagaaa ctctctgaac aagaattaca atttcgtcgt ctcagtcaag 1681 agcaagttga caactttact ctggatataa atactgccta tgccagactc agaggaatcg 1741 aacaggctgt tcagagccat gcagttgctg aagaggaagc cagaaaagcc caccaactct 1801 ggctttcagt ggaggcatta aagtacagca tgaagacctc atctgcagaa acacctacta 1861 tcccgctggg tagtgcagtt gaggccatca aagccaactg ttctgataat gaattcaccc 1921 aagctttaac cgcagctatc cctccagagt ccctgacccg tggggtgtac agtgaagaga 1981 cccttagagc ccgtttctat gctgttcaaa aactggcccg aagggtagca atgattgatg 2041 aaaccagaaa tagcttgtac cagtacttcc tctcctacct acagtccctg ctcctattcc 2101 cacctcagca actgaagccg cccccagagc tctgccctga ggatataaac acatttaaat 2161 tactgtcata tgcttcctat tgcattgagc atggtgatct ggagctagca gcaaagtttg 2221 tcaatcagct gaagggggaa tccagacgag tggcacagga ctggctgaag gaagcccgaa 2281 tgaccctaga aacgaaacag atagtggaaa tcctgacagc atatgccagc gccgtaggaa 2341 taggaaccac tcaggtgcag ccagagtgag gtttaggaag attttcataa agtcatattt 2401 catgtcaaag gaaatcagca gtgatagatg aagggttcgc agcgagagtc ccggacttgt 2461 ctagaaatga gcaggtttac aagtactgtt ctaaatgtta acacctgttg catttatatt 2521 ctttccattt gctatcatgt cagtgaacgc caggagtgct ttctttgcaa cttgtgtaac 2581 attttctgtt ttttcaggtt ttactgatga ggcttgtgag gccaatcaaa ataatgtttg 2641 tgatctctac tactgttgat tttgccctcg gagcaaactg aataaagcaa caagatg // LOCUS HUMP971 2368 bp mRNA PRI 07-JAN-1995 DEFINITION Human melanoma-associated antigen p97 (melanotransferrin) mRNA, complete cds. ACCESSION M12154 NID g189515 KEYWORDS antigen; antigen p97; cell surface glycoprotein; glycoprotein; melanotransferrin. SEGMENT 1 of 2 SOURCE Human SK-MEL 28 melanoma cell, cDNA to mRNA, clones p972f1,p971j1 and p9710a1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2368) AUTHORS Rose,T.M., Plowman,G.D., Teplow,D.B., Dreyer,W.J., Hellstrom,K.E. and Brown,J.P. TITLE Primary structure of the human melanoma-associated antigen p97 (melanotransferrin) deduced from the mRNA sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (5), 1261-1265 (1986) MEDLINE 86149285 COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by T.M.Rose, 29-MAY-1986. FEATURES Location/Qualifiers source 1..2368 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q28-q29" mRNA <1..>2368 /note="p97 mRNA" gene 61..2277 /gene="MFI2" sig_peptide 61..117 /gene="MFI2" /note="G00-119-387" CDS 61..2277 /gene="MFI2" /codon_start=1 /db_xref="GDB:G00-119-387" /product="melanotransferrin" /db_xref="PID:g189518" /translation="MRGPSGALWLLLALRTVLGGMEVRWCATSDPEQHKCGNMSEAFR EAGIQPSLLCVRGTSADHCVQLIAAQEADAITLDGGAIYEAGKEHGLKPVVGEVYDQE VGTSYYAVAVVRRSSHVTIDTLKGVKSCHTGINRTVGWNVPVGYLVESGRLSVMGCDV LKAVSDYFGGSCVPGAGETSYSESLCRLCRGDSSGEGVCDKSPLERYYDYSGAFRCLA EGAGDVAFVKHSTVLENTDGKTLPSWGQALLSQDFELLCRDGSRADVTEWRQCHLARV PAHAVVVRADTDGGLIFRLLNEGQRLFSHEGSSFQMFSSEAYGQKDLLFKDSTSELVP IATQTYEAWLGHEYLHAMKGLLCDPNRLPPYLRWCVLSTPEIQKCGDMAVAFRRQRLK PEIQCVSAKSPQHCMERIQAEQVDAVTLSGEDIYTAGKKYGLVPAAGEHYAPEDSSNS YYVVAVVRRDSSHAFTLDELRGKRSCHAGFGSPAGWDVPVGALIQRGFIRPKDCDVLT AVSEFFNASCVPVNNPKNYPSSLCALCVGDEQGRNKCVGNSQERYYGYRGAFRCLVEN AGDVAFVRHTTVFDNTNGHNSEPWAAELRSEDYELLCPNGARAEVSQFAACNLAQIPP HAVMVRPDTNIFTVYGLLDKAQDLFGDDHNKNGFKMFDSSNYHGQDLLFKDATVRAVP VGEKTTYRGWLGLDYVAALEGMSSQQCSGAAAPAPGAPLLPLLLPALAARLLPPAL" mat_peptide 118..2274 /gene="MFI2" /note="G00-119-387" /product="melanotransferrin" BASE COUNT 427 a 766 c 769 g 406 t ORIGIN 409 bp upstream of SstI site; chromosome 3q28-q29. 1 gcggacttcc tcggacccgg acccagcccc agcccggccc cagccagccc cgacggcgcc 61 atgcggggtc cgagcggggc tctgtggctg ctcctggctc tgcgcaccgt gctcggaggc 121 atggaggtgc ggtggtgcgc cacctcggac ccagagcagc acaagtgcgg caacatgagc 181 gaggccttcc gggaagcggg catccagccc tccctcctct gcgtccgggg cacctccgcc 241 gaccactgcg tccagctcat cgcggcccag gaggctgacg ccatcactct ggatggagga 301 gccatctatg aggcgggaaa ggagcacggc ctgaagccgg tggtgggcga agtgtacgat 361 caagaggtcg gtacctccta ttacgccgtg gctgtggtca ggaggagctc ccatgtgacc 421 attgacaccc tgaaaggcgt gaagtcctgc cacacgggca tcaatcgcac agtgggctgg 481 aacgtgcccg tgggctacct ggtggagagc ggccgcctct cggtgatggg ctgcgatgta 541 ctcaaagctg tcagcgacta ttttgggggc agctgcgtcc cgggggcagg agagaccagt 601 tactctgagt ccctctgtcg cctctgcagg ggtgacagct ctggggaagg ggtgtgtgac 661 aagagccccc tggagagata ctacgactac agcggggcct tccggtgcct ggcggaaggg 721 gcaggggacg tggcttttgt gaagcacagc acggtactgg agaacacgga tgggaagacg 781 cttccctcct ggggccaggc cctgctgtca caggacttcg agctgctgtg ccgggatggt 841 agccgggccg atgtcaccga gtggaggcag tgccatctgg cccgggtgcc tgctcacgcc 901 gtggtggtcc gggccgacac agatgggggc ctcatcttcc ggctgctcaa cgaaggccag 961 cgtctgttca gccacgaggg cagcagcttc cagatgttca gctctgaggc ctatggccag 1021 aaggatctac tcttcaaaga ctctacctcg gagcttgtgc ccatcgccac acagacctat 1081 gaggcgtggc tgggccatga gtacctgcac gccatgaagg gtctgctctg tgaccccaac 1141 cggctgcccc cctacctgcg ctggtgtgtg ctctccactc ccgagatcca gaagtgtgga 1201 gacatggccg tggccttccg ccggcagcgc ctcaagccag agatccagtg cgtgtcagcc 1261 aagtcccccc aacactgcat ggagcggatc caggctgagc aggtcgacgc tgtgacccta 1321 agtggcgagg acatttacac ggcggggaag aagtacggcc tggttcccgc agccggcgag 1381 cactatgccc cggaagacag cagcaactcg tactacgtgg tggccgtggt gagacgggac 1441 agctcccacg ccttcacctt ggatgagctt cggggcaagc gctcctgcca cgccggtttc 1501 ggcagccctg caggctggga tgtccccgtg ggtgccctta ttcagagagg cttcatccgg 1561 cccaaggact gtgacgtcct cacagcagtg agcgagttct tcaatgccag ctgcgtgccc 1621 gtgaacaacc ccaagaacta cccctcctcg ctgtgtgcac tgtgcgtggg ggacgagcag 1681 ggccgcaaca agtgtgtggg caacagccag gagcggtatt acggctaccg cggcgccttc 1741 aggtgcctgg tggagaatgc gggtgacgtt gccttcgtca ggcacacaac cgtctttgac 1801 aacacaaacg gccacaattc cgagccctgg gctgctgagc tcaggtcaga ggactatgaa 1861 ctgctgtgcc ccaacggggc ccgagccgag gtgtcccagt ttgcagcctg caacctggca 1921 cagataccac cccacgccgt gatggtccgg cccgacacca acatcttcac cgtgtatgga 1981 ctgctggaca aggcccagga cctgtttgga gacgaccaca ataagaacgg gttcaaaatg 2041 ttcgactcct ccaactatca tggccaagac ctgcttttca aggatgccac cgtccgggcg 2101 gtgcctgtcg gagagaaaac cacctaccgc ggctggctgg ggctggacta cgtggcggcg 2161 ctggaaggga tgtcgtctca gcagtgctcg ggcgcagcgg ccccggcgcc cggggcgccc 2221 ctgctcccgc tgctgctgcc cgccctcgcc gcccgcctgc tcccgcccgc cctctgagcc 2281 cggccgcccc gccccagagc tccgatgccc gcccggggag tttccgcggc ggcctctcgc 2341 gctgcggaat ccagaaggaa gctcgcga // LOCUS HUMPACE4A 4403 bp mRNA PRI 07-JAN-1995 DEFINITION Human subtilisin-like protein (PACE4) mRNA, complete cds. ACCESSION M80482 NID g189531 KEYWORDS . SOURCE Homo sapiens (tissue library: lambda ZAP II) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4403) AUTHORS Kiefer,M.C., Tucker,J.E., Joh,R., Landsberg,K.E., Saltman,D. and Barr,P.J. TITLE Identification of a second human subtilisin-like protease gene in the fes/fps region of chromosome 15 JOURNAL DNA Cell Biol. 10 (10), 757-769 (1991) MEDLINE 92075167 FEATURES Location/Qualifiers source 1..4403 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /tissue_lib="lambda ZAP II" sig_peptide 170..358 /gene="PACE4" CDS 170..3079 /gene="PACE4" /codon_start=1 /product="subtilisin-like protease" /db_xref="PID:g189532" /translation="MPPRAPPAPGPRPPPRAAAATDTAAGAGGAGGAGGAGGPGFRPL APRPWRWLLLLALPAACSAPPPRPVYTNHWAVQVLGGPAEADRVAAAHGYLNLGQIGN LEDYYHFYHSKTFKRSTLSSRGPHTFLRMDPQVKWLQQQEVKRRVKRQVRSDPQALYF NDPIWSNMWYLHCGDKNSRCRSEMNVQAAWKRGYTGKNVVVTILDDGIERNHPDLAPN YDSYASYDVNGNDYDPSPRYDASNENKHGTRCAGEVAASANNSYCIVGIAYNAKIGGI RMLDGDVTDVVEAKSLGIRPNYIDIYSASWGPDDDGKTVDGPGRLAKQAFEYGIKKGR QGLGSIFVWASGNGGREGDYCSCDGYTNSIYTISVSSATENGYKPWYLEECASTLATT YSSGAFYERKIVTTDLRQRCTDGHTGTSVSAPMVAGIIALALEANSQLTWRDVQHLLV KTSRPAHLKASDWKVNGAGHKVSHFYGFGLVDAEALVVEAKKWTAVPSQHMCVAASDK RPRSIPLVQVLRTTALTSACAEHSDQRVVYLEHVVVRTSISHPRRGDLQIYLVSPSGT KSQLLAKRLLDLSNEGFTNWEFMTVHCWGEKAEGQWTLEIQDLPSQVRNPEKQGKLKE WSLILYGTAEHPYHTFSAHQSRSRMLELSAPELEPPKAALSPSQVEVPEDEEDYTAQS TPGSANILQTSVCHPECGDKGCDGPNADQCLNCVHFSLGSVKTSRKCVSVCPLGYFGD TAARRCRRCHKGCETCSSRAATQCLSCRRGFYHHQEMNTCVTLCPAGFYADESQKNCL KCHPSCKKCVDEPEKCTVCKEGFSLARGSCIPDCEPGTYFDSELIRCGECHHTCGTCV GPGREECIHCAKNFHFHDWKCVPACGEGFYPEEMPGLPHKVCRRCDENCLSCAGSSRN CSRCKTGFTQLGTSCITNHTCSNADETFCEMVKSNRLCERKLFIQFCCRTCLLAG" gene 170..3079 /gene="PACE4" mat_peptide 359..3076 /gene="PACE4" /product="subtilisin-like protease" BASE COUNT 1016 a 1252 c 1214 g 921 t ORIGIN Chromosome 15q25. 1 cgggaacgcg ccgcggccgc ctcctcctcc ccggctcccg cccgcggcgg tgttggcggc 61 ggcggtggcg gcggcggcgg cgcttccccg gcgcggagcg gctttaaaag gcggcactcc 121 accccccggc gcactcgcag ctcgggcgcc gcgcgagcct gtcgccgcta tgcctccgcg 181 cgcgccgcct gcgcccgggc cccggccgcc gccccgggcc gccgccgcca ccgacaccgc 241 cgcgggcgcg gggggcgcgg ggggcgcggg gggcgccggc gggcccgggt tccggccgct 301 cgcgccgcgt ccctggcgct ggctgctgct gctggcgctg cctgccgcct gctccgcgcc 361 cccgccgcgc cccgtctaca ccaaccactg ggcggtgcaa gtgctgggcg gcccggccga 421 ggcggaccgc gtggcggcgg cgcacggcta cctcaacttg ggccagattg gaaacctgga 481 agattactac catttttatc acagcaaaac ctttaaaaga tcaaccttga gtagcagagg 541 ccctcacacc ttcctcagaa tggaccccca ggtgaaatgg ctccagcaac aggaagtgaa 601 acgaagggtg aagagacagg tgcgaagtga cccgcaggcc ctttacttca acgaccccat 661 ttggtccaac atgtggtacc tgcattgtgg cgacaagaac agtcgctgcc ggtcggaaat 721 gaatgtccag gcagcgtgga agaggggcta cacaggaaaa aacgtggtgg tcaccatcct 781 tgatgatggc atagagagaa atcaccctga cctggcccca aattatgatt cctacgccag 841 ctacgacgtg aacggcaatg attatgaccc atctccacga tatgatgcca gcaatgaaaa 901 taaacacggc actcgttgtg cgggagaagt tgctgcttca gcaaacaatt cctactgcat 961 cgtgggcata gcgtacaatg ccaaaatagg aggcatccgc atgctggacg gcgatgtcac 1021 agatgtggtc gaggcaaagt cgctgggcat cagacccaac tacatcgaca tttacagtgc 1081 cagctggggg ccggacgacg acggcaagac ggtggacggg cccggccgac tggctaagca 1141 ggctttcgag tatggcatta aaaagggccg gcagggcctg ggctccattt tcgtctgggc 1201 atctgggaat ggcgggagag agggggacta ctgctcgtgc gatggctaca ccaacagcat 1261 ctacaccatc tccgtcagca gcgccaccga gaatggctac aagccctggt acctggaaga 1321 gtgtgcctcc accctggcca ccacctacag cagtggggcc ttttatgagc gaaaaatcgt 1381 caccacggat ctgcgtcagc gctgtaccga tggccacact gggacctcag tctctgcccc 1441 catggtggcg ggcatcatcg ccttggctct agaagcaaac agccagttaa cctggaggga 1501 cgtccagcac ctgctagtga agacatcccg gccggcccac ctgaaagcga gcgactggaa 1561 agtaaacggc gcgggtcata aagttagcca tttctatgga tttggtttgg tggacgcaga 1621 agctctcgtt gtggaggcaa agaagtggac agcagtgcca tcgcagcaca tgtgtgtggc 1681 cgcctcggac aagagaccca ggagcatccc cttagtgcag gtgctgcgga ctacggccct 1741 gaccagcgcc tgcgcggagc actcggacca gcgggtggtc tacttggagc acgtggtggt 1801 tcgcacctcc atctcacacc cacgccgagg agacctccag atctacctgg tttctccctc 1861 gggaaccaag tctcaacttt tggcaaagag gttgctggat ctttccaatg aagggtttac 1921 aaactgggaa ttcatgactg tccactgctg gggagaaaag gctgaagggc agtggacctt 1981 ggaaatccaa gatctgccat cccaggtccg caacccggag aagcaaggga agttgaaaga 2041 atggagcctc atactgtatg gcacagcaga gcacccgtac cacaccttca gtgcccatca 2101 gtcccgctcg cggatgctgg agctctcagc cccagagctg gagccaccca aggctgccct 2161 gtcaccctcc caggtggaag ttcctgaaga tgaggaagat tacacagctc aatccacccc 2221 aggctctgct aatattttac agaccagtgt gtgccatccg gagtgtggtg acaaaggctg 2281 tgatggcccc aatgcagacc agtgcttgaa ctgcgtccac ttcagcctgg ggagtgtcaa 2341 gaccagcagg aagtgcgtga gtgtgtgccc cttgggctac tttggggaca cagcagcaag 2401 acgctgtcgc cggtgccaca aggggtgtga gacctgctcc agcagagctg cgacgcagtg 2461 cctgtcttgc cgccgcgggt tctatcacca ccaggagatg aacacctgtg tgaccctctg 2521 tcctgcagga ttttatgctg atgaaagtca gaaaaattgc cttaaatgcc acccaagctg 2581 taaaaagtgc gtggatgaac ctgagaaatg tactgtctgt aaagaaggat tcagccttgc 2641 acggggcagc tgcattcctg actgtgagcc aggcacctac tttgactcag agctgatcag 2701 atgtggggaa tgccatcaca cctgcggaac ctgcgtgggg ccaggcagag aagagtgcat 2761 tcactgtgcg aaaaacttcc acttccacga ctggaagtgt gtgccagcct gtggtgaggg 2821 cttctaccca gaagagatgc cgggcttgcc ccacaaagtg tgtcgaaggt gtgacgagaa 2881 ctgcttgagc tgtgcaggct ccagcaggaa ctgtagcagg tgtaagacgg gcttcacaca 2941 gctggggacc tcctgcatca ccaaccacac gtgcagcaac gctgacgaga cattctgcga 3001 gatggtgaag tccaaccggc tgtgcgaacg gaagctcttc attcagttct gctgccgcac 3061 gtgcctcctg gccgggtaag ggtgcctagc tgcccacaga gggcaggcac tcccatccat 3121 ccatccgtcc accttcctcc agactgtcgg ccagagtctg tttcaggagc ggcgccctgc 3181 acctgacagc tttatctccc caggagcagc atctctgagc acccaagcca ggtgggtggt 3241 ggctcttaag gaggtgttcc taaaatggtg atatcctctc aaatgctgct tgttggctcc 3301 agtcttccga caaactaaca ggaacaaaat gaattctggg aatccacagc tctggctttg 3361 gagcagcttc tgggaccata agtttactga atcttcaaga ccaaagcaga aaagaaaggc 3421 gcttggcatc acacatcact cttctccccg tgcttttctg cggctgtgta gtaaatctcc 3481 ccggcccagc tggcgaaccc tgggccatcc tcacatgtga caaagggcca gcagtctacc 3541 tgctcgttgc ctgccactga gcagtctggg gacggtttgg tcagactata aataagatag 3601 gtttgagggc ataaaatgta tgaccactgg ggccggagta tctatttcta catagtcagc 3661 tacttctgaa actgcagcag tggcttagaa agtccaattc caaagccaga ccagaagatt 3721 ctatcccccg cagcgctctc ctttgagcaa gccgagctct ccttgttacc gtgttctgtc 3781 tgtgtcttca ggagtctcat ggcctgaacg accacctcga cctgatgcag agccttctga 3841 ggagaggcaa caggaggcat tctgtggcca gccaaaaggt accccgatgg ccaagcaatt 3901 cctctgaaca aaatgtaaag ccagccatgc attgttaatc atccatcact tcccatttta 3961 tggaattgct tttaaaatac atttggcctc tgcccttcag aagactcgtt tttaaggtgg 4021 aaactcctgt gtctgtgtat attacaagcc tacatgacac agttggattt attctgccaa 4081 acctgtgtag gcattttata agctacatgt tctaattttt accgatgtta attattttga 4141 caaatatttc atatattttc attgaaatgc acagatctgc ttgatcaatt cccttgaata 4201 gggaagtaac atttgcctta aattttttcg acctcgtctt tctccatatt gtcctgctcc 4261 cctgtttgac gacagtgcat ttgccttgtc acctgtgagc tggagagaac ccagatgttg 4321 tttattgaat ctacaactct gaaagagaaa tcaatgaagc aagtacaatg ttaaccctaa 4381 attaataaaa gagttaacat ccc // LOCUS HUMPAFAA 852 bp mRNA PRI 03-FEB-1997 DEFINITION Human mRNA for platelet activating factor acetylhydrolase IB gamma-subunit, complete cds. ACCESSION D63391 NID g1122218 KEYWORDS platelet activating factor acetylhydrolase IB gamma-subunit. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 852) AUTHORS Adachi,H., Tsujimoto,M., Hattori,M., Arai,H. and Inoue,K. TITLE cDNA cloning of human cytosolic platelet-activating factor acetylhydrolase gamma-subunit and its mRNA expression in human tissues JOURNAL Biochem. Biophys. Res. Commun. 214 (1), 180-187 (1995) MEDLINE 95398632 REFERENCE 2 (bases 1 to 852) AUTHORS Adachi,H. TITLE Direct Submission JOURNAL Submitted (11-JUL-1995) to the DDBJ/EMBL/GenBank databases. Hideki Adachi, Suntory Institute for Biomedical Research; 1-1-1 Wakayamadai, Shimamoto-cho, Mishima-gun, Osaka 618, Japan (E-mail:adachi_h@minase.suntory.co.jp, Tel:075-962-9283, Fax:075-962-6448) COMMENT Sequence updated (08-DEC-1995) by: Hideki Adachi. FEATURES Location/Qualifiers source 1..852 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 114..809 /codon_start=1 /product="platelet activating factor acetylhydrolase IB gamma-subunit" /db_xref="PID:d1010352" /db_xref="PID:g1122219" /translation="MSGEENPASKPTPVQDVQGDGRWMSLHHRFVADSKDKEPEVVFI GDSLVQLMHQCEIWRELFSPLHALNFGIGGDGTQHVLWRLENGELEHIRPKIVVVWVG TNNHGHTAEQVTGGIKAIVQLVNERQPQARVVVLGLLPRGQHPNPLREKNRQVNELVR AALAGHPRAHFLDADPGFVHSDGTISHHDMYDYLHLSRLGYTPVCRALHSLLLRLLAQ DQGQGAPLLEPAP" BASE COUNT 167 a 266 c 259 g 160 t ORIGIN 1 ggacggtcct ttgttgccgc gaggggtagg agtgggcgtg gcggagccag ctccgttcgg 61 aacactcccg ggccgacccg actcgctcat cctgcaggag ctgcggcgcc aagatgagtg 121 gagaggagaa cccagccagc aagcccacgc cggtgcagga cgtacagggc gacgggcgct 181 ggatgtccct gcaccatcgg ttcgtggctg acagcaaaga taaggaaccc gaagtcgtct 241 tcatcgggga ctccttggtc cagctcatgc accagtgcga gatctggcgc gagctcttct 301 ctcctctgca tgcacttaac tttggcattg gtggtgacgg cacacagcat gtactgtggc 361 ggctggagaa tggggagctg gaacacatcc ggcccaagat tgtggtggtc tgggtgggca 421 ccaacaacca cggacacaca gcagagcagg tgactggtgg catcaaggcc attgtgcaac 481 tggtgaatga gcgacagccc caggcccggg ttgtggtgct gggcctgctt ccgcgaggcc 541 aacatcccaa cccacttcgg gagaagaacc gacaggtgaa cgagctggta cgggcggcac 601 tggctggcca ccctcgggcc cacttcctag atgccgaccc tggctttgtg cactcagatg 661 gcaccatcag ccatcatgac atgtatgatt acctgcatct gagccgcctg ggctacacac 721 ctgtttgccg ggctctgcac tccctgcttc tgcgtctgct ggcccaagac cagggccaag 781 gtgctcccct gctggagccc gcaccctaag catcctgctg ccttcccaca acattaaact 841 ctccttcctc ag // LOCUS HUMPAFR 1551 bp mRNA PRI 07-JAN-1995 DEFINITION Human platelet activating factor receptor mRNA, complete cds. ACCESSION M80436 NID g189537 KEYWORDS platelet activating factor receptor. SOURCE Homo sapiens (tissue library: lambda gt11 phage) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1551) AUTHORS Ye,R.D., Prossnitz,E.R., Zou,A.H. and Cochrane,C.G. TITLE Characterization of a human cDNA that encodes a functional receptor for platelet activating factor JOURNAL Biochem. Biophys. Res. Commun. 180 (1), 105-111 (1991) MEDLINE 92028922 FEATURES Location/Qualifiers source 1..1551 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="granulocyte" /tissue_lib="lambda gt11 phage" /map="Unassigned" gene 267..1295 /gene="PTAFR" CDS 267..1295 /gene="PTAFR" /codon_start=1 /db_xref="GDB:G00-128-806" /product="platelet activating factor receptor" /db_xref="PID:g189538" /translation="MEPHDSSHMDSEFRYTLFPIVYSIIFVLGVIANGYVLWVFARLY PCKKFNEIKIFMVNLTMADMLFLITLPLWIVYYQNQGNWILPKFLCNVAGCLFFINTY CSVAFLGVITYNRFQAVTRPIKTAQANTRKRGISLSLVIWVAIVGAASYFLILDSTNT VPDSAGSGNVTRCFEHYEKGSVPVLIIHIFIVFSFFLVFLIILFCNLVIIRTLLMQPV QQQRNAEVKRRALWMVCTVLAVFIICFVPHHVVQLPWTLAELGFQDSKFHQAINDAHQ VTLCLLSTNCVLDPVIYCFLTKKFRKHLTEKFYSMRSSRKCSRATTDTVTEVVVPFNQ IPGNSLKN" BASE COUNT 325 a 471 c 369 g 386 t ORIGIN 1 ctggtggcct ttaatacctg gctgttgctg aaaggtcttt agaaacggcg ctaacagcag 61 gtttgtggaa tgccggatcg ctcaacggcc tgacgtgggc aaaaacctcg ccttccgcac 121 ccatcattat attgatgctc attgccgccg ccttactggt acgccggatg cgcttgctgg 181 aaatgggaca cacggtcact gcagctgaag ccgctgcccc tgctacaggc accaccagga 241 ccagctgatc attccagccc acagcaatgg agccacatga ctcctcccac atggactctg 301 agttccgata cactctcttc ccgattgttt acagcatcat ctttgtgctc ggggtcattg 361 ctaatggcta cgtgctgtgg gtctttgccc gcctgtaccc ttgcaagaaa ttcaatgaga 421 taaagatctt catggtgaac ctcaccatgg cggacatgct cttcttgatc accctgccac 481 tttggattgt ctactaccaa aaccagggca actggatact ccccaaattc ctgtgcaacg 541 tggctggctg ccttttcttc atcaacacct actgctctgt ggccttcctg ggcgtcatca 601 cttataaccg cttccaggca gtaactcggc ccatcaagac tgctcaggcc aacacccgca 661 agcgtggcat ctctttgtcc ttggtcatct gggtggccat tgtgggagct gcatcctact 721 tcctcatcct ggactccacc aacacagtgc ccgacagtgc tggctcaggc aacgtcactc 781 gctgctttga gcattacgag aagggcagcg tgccagtcct catcatccac atcttcatcg 841 tgttcagctt cttcctggtc ttcctcatca tcctcttctg caacctggtc atcatccgta 901 ccttgctcat gcagccggtg cagcagcagc gcaacgctga agtcaagcgc cgggcgctgt 961 ggatggtgtg cacggtcttg gcggtgttca tcatctgctt cgtgccccac cacgtggtgc 1021 agctgccctg gacccttgct gagctgggct tccaggacag caaattccac caggccatta 1081 atgatgcaca tcaggtcacc ctctgcctcc ttagcaccaa ctgtgtctta gaccctgtta 1141 tctactgttt cctcaccaag aagttccgca agcacctcac cgaaaagttc tacagcatgc 1201 gcagtagccg gaaatgctcc cgggccacca cggatacggt cactgaagtg gttgtgccat 1261 tcaaccagat ccctggcaat tccctcaaaa attagtccct gcttccaggc ctgaagtctt 1321 ctcctccatg aacatcatgg actgagctgg gggaagaagg gatatctact gtggtctggg 1381 caccacctct gtgggcactg gtgggccatt agatttggag gctacctcac ctgggcaggg 1441 atgatggcag agccaggctg ttggaaaatc cagaactcaa atgagcccct tcatccgcct 1501 gtggggcata ctacagtaac tgtgacttga tgactttatc tgagtcctta t // LOCUS HUMPAHQ20X 2679 bp mRNA PRI 06-NOV-1995 DEFINITION Homo sapiens phenylalanine hydroxylase (PAH) mutant Q20stop mRNA. ACCESSION L47726 NID g1009165 KEYWORDS mutation; phenylalanine hydroxylase. SOURCE Homo sapiens (individual_isolate Germany) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2679) AUTHORS Consortium,P.A.H. TITLE Human phenylalanine hydroxylase (PAH) gene JOURNAL Unpublished (1995) COMMENT The PAH mutation data base can be found on WWW address: http://www.mcgill.ca/pahdb The information for the entries was kindly provided by the team at McGill University, E-mail address: mclh@musica.mcgill.ca Phone: (514) 934-4417 Fax: (514) 934-4329 Mail can be addressed to: A-717, McGill University Montreal Children's Hospital Research Institute 2300 Tupper Street Montreal, Quebec Canada H3H 1P3. FEATURES Location/Qualifiers source 1..2679 /organism="Homo sapiens" /isolate="Germany" /db_xref="taxon:9606" /map="12q22-q24.2" gene 473..532 /gene="PAH" CDS 473..532 /gene="PAH" /codon_start=1 /db_xref="GDB:G00-119-470" /db_xref="PID:g1009166" /translation="MSTAVLENPGLGRKLSDFG" mutation 530 /gene="PAH" /note="c (Gln) in wt/t (stop) in mutant; G00-119-470" BASE COUNT 761 a 615 c 566 g 737 t ORIGIN 1 cagctggggg taaggggggc ggattattca tataattgtt ataccagacg gtcgcaggct 61 tagtccaatt gcagagaact cgcttcccag gcttctgaga gtcccggaag tgcctaaacc 121 tgtctaatcg acggggcttg ggtggcccgt cgctccctgg cttcttccct ttacccaggg 181 cgggcagcga agtggtgcct cctgcgtccc ccacaccctc cctcagcccc tcccctccgg 241 cccgtcctgg gcaggtgacc tggagcatcc ggcaggctgc cctggcctcc tgcgtcagga 301 caagcccacg aggggcgtta ctgtgcggag atgcaccacg caagagacac cctttgtaac 361 tctcttctcc tccctagtgc gaggttaaaa ccttcagccc cacgtgctgt ttgcaaacct 421 gcctgtacct gaggccctaa aaagccagag acctcactcc cggggagcca gcatgtccac 481 tgcggtcctg gaaaacccag gcttgggcag gaaactctct gactttggat aggaaacaag 541 ctatattgaa gacaactgca atcaaaatgg tgccatatca ctgatcttct cactcaaaga 601 agaagttggt gcattggcca aagtattgcg cttatttgag gagaatgatg taaacctgac 661 ccacattgaa tctagacctt ctcgtttaaa gaaagatgag tatgaatttt tcacccattt 721 ggataaacgt agcctgcctg ctctgacaaa catcatcaag atcttgaggc atgacattgg 781 tgccactgtc catgagcttt cacgagataa gaagaaagac acagtgccct ggttcccaag 841 aaccattcaa gagctggaca gatttgccaa tcagattctc agctatggag cggaactgga 901 tgctgaccac cctggtttta aagatcctgt gtaccgtgca agacggaagc agtttgctga 961 cattgcctac aactaccgcc atgggcagcc catccctcga gtggaataca tggaggaaga 1021 aaagaaaaca tggggcacag tgttcaagac tctgaagtcc ttgtataaaa cccatgcttg 1081 ctatgagtac aatcacattt ttccacttct tgaaaagtac tgtggcttcc atgaagataa 1141 cattccccag ctggaagacg tttctcaatt cctgcagact tgcactggtt tccgcctccg 1201 acctgtggct ggcctgcttt cctctcggga tttcttgggt ggcctggcct tccgagtctt 1261 ccactgcaca cagtacatca gacatggatc caagcccatg tatacccccg aacctgacat 1321 ctgccatgag ctgttgggac atgtgccctt gttttcagat cgcagctttg cccagttttc 1381 ccaggaaatt ggccttgcct ctctgggtgc acctgatgaa tacattgaaa agctcgccac 1441 aatttactgg tttactgtgg agtttgggct ctgcaaacaa ggagactcca taaaggcata 1501 tggtgctggg ctcctgtcat cctttggtga attacagtac tgcttatcag agaagccaaa 1561 gcttctcccc ctggagctgg agaagacagc catccaaaat tacactgtca cggagttcca 1621 gcccctgtat tacgtggcag agagttttaa tgatgccaag gagaaagtaa ggaactttgc 1681 tgccacaata cctcggccct tctcagttcg ctacgaccca tacacccaaa ggattgaggt 1741 cttggacaat acccagcagc ttaagatttt ggctgattcc attaacagtg aaattggaat 1801 cctttgcagt gccctccaga aaataaagta aagccatgga cagaatgtgg tctgtcagct 1861 gtgaatctgt tgatggagat ccaactattt ctttcatcag aaaaagtccg aaaagcaaac 1921 cttaatttga aataacagcc ttaaatcctt tacaagatgg agaaacaaca aataagtcaa 1981 aataatctga aatgacagga tatgagtaca tactcaagag cataatggta aatcttttgg 2041 ggtcatcttt gatttagaga tgataatccc atactctcaa ttgagttaaa tcagtaatct 2101 gtcgcatttc atcaagatta attaaaattt gggacctgct tcattcaagc ttcatatatg 2161 ctttgcagag aactcataaa ggagcatata aggctaaatg taaaacacaa gactgtcatt 2221 agaattgaat tattgggctt aatataaatc gtaacctatg aagtttattt tctattttag 2281 ttaactatga ttccaattac tactttgtta ttgtacctaa gtaaattttc tttaggtcag 2341 aagcccatta aaatagttac aagcattgaa cttctttagt attatattaa tataaaaaca 2401 tttttgtatg ttttattgta atcataaata ctgctgtata aggtaataaa actctgcacc 2461 taatccccat aacttccagt atcattttcc aattaattat caagtctgtt ttgggaaaca 2521 ctttgaggac atttatgatg cagcagatgt tgactaaagg cttggttggt agatattcag 2581 gaaatgttca ctgaataaat aagtaaatac attattgaaa agcaaatctg tataaatgtg 2641 aaatttttat ttgtattagt aataaaacat tagtagttt // LOCUS HUMPALF01 1913 bp DNA PRI 07-JAN-1995 DEFINITION Human mutant prealbumin gene directly linked to familial amyloidotic polyneuropathy (FAP), exons 1 and 2. ACCESSION M15515 NID g189589 KEYWORDS prealbumin; transthyretin. SEGMENT 1 of 3 SOURCE Homo sapiens (individual_isolate FAP patient) liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1913) AUTHORS Maeda,S., Mita,S., Araki,S. and Shimada,K. TITLE Structure and expression of the mutant prealbumin gene associated with familial amyloidotic polyneuropathy JOURNAL Mol. Biol. Med. 3 (4), 329-338 (1986) MEDLINE 87038739 COMMENT Draft entry and printed copy of sequence [1] kindly provided by K.Shimada, 01-JUL-1987. The mutation causing familial amyloidotic polyneuropathy at position 1679 is due to a change of amino acid from valine 'gtg' to methionine 'atg'. FEATURES Location/Qualifiers source 1..1913 /organism="Homo sapiens" /isolate="FAP patient" /db_xref="taxon:9606" /tissue_type="liver" /map="18q11.2-q12.1" sig_peptide 608..667 /gene="TTR" /note="G00-119-471" exon <608..676 /gene="TTR" /note="G00-119-471" /number=1 intron 677..1600 /gene="TTR" /note="G00-119-471" /number=1 CDS 1265..1447 /gene="TTR" /note="ORF1" /codon_start=1 /db_xref="PID:g475791" /translation="MYGTLHLFSNSTQMETFNKATVLRGPIFSLKIHYTHPWLIAVCL EAETILALETITSVLY" CDS 1269..1382 /gene="TTR" /note="ORF2" /codon_start=1 /db_xref="PID:g475792" /translation="MVHYIFSVIPLKWRLLTKQLFSGDLFSPLKFIIHIPG" exon 1601..1731 /gene="TTR" /note="G00-119-471" /number=2 mutation 1679 /gene="TTR" /note="a in FAP mutant; g in normal alb gene" intron 1732..>1913 /gene="TTR" /note="G00-119-471" /number=2 BASE COUNT 552 a 372 c 382 g 607 t ORIGIN 1 bp upstream of HindIII site; chromosome 18q11.2-q12.1. 1 aagcttccaa atgacttagt ttggctaaaa tgtaggcttt taaaaatgtg agcactgcca 61 agggtttttc cttgttgacc catggatcca tcaagtgcaa acattttcta atgcactata 121 tttaagcctg tgcagctaga tgtcattcaa catgaaatac attattacaa cttgcatctg 181 tctaaaatct tgcatctaaa atgagagaca aaaaatctat aaaaatggaa aacatgcata 241 gaaatatgtg agggaggaaa aaattacccc caagaatgtt agtgcacgca gtcacacagg 301 gagaagacta tttttgtttt gttttgattg ttttgttttg ttttggttgt tttgttttgg 361 tgacctaact ggtcaaatga cctattaaga atatttcata gaacgaatgt tccgatgctc 421 taatctctct agacaaggtt catatttgta tgggttactt attctctctt tgttgactaa 481 gtcaataatc agaatcagca ggtttgcagt cagattggca gggataagca gcctagctca 541 ggagaagtga gtataaaagc cccaggctgg gagcagccat cacagaagtc cactcattct 601 tggcaggatg gcttctcatc gtctgctcct cctctgcctt gctggactgg tatttgtgtc 661 tgaggctggc cctacggtga gtgtttctgt gacatcccat tcctacattt aagattcacg 721 ctaaatgaag tagaagtgac tccttccagc tttgccaacc agcttttatt actagggcaa 781 gggtacccag catctatttt taatataatt aattcaaact tcaaaaagaa tgaagttcca 841 ctgagcttac tgagctggga cttgaactct gagcattcta cctcattgct ttggtgcatt 901 aggtttgtaa tatctggtac ctctgtttcc tcagatagat gatagaaata aagatatgat 961 attaaggaag ctgttaatac tgaattttca gaaaagtatc cctccataaa atgtatttgg 1021 gggacaaact gcaggagatt atattctggc cctatagtta ttcaaaacgt atttattgat 1081 taatctttaa aaggcttagt gaacaatatt ctagtcagat atctaattct taaatcctct 1141 agaagaatta actaatacta taaaatgggt ctggatgtag ttctgacatt attttataac 1201 aactggtaag agggagtgac tatagcaaca actaaaatga tctcaggaaa acctgtttgg 1261 ccctatgtat ggtacattac atcttttcag taattccact caaatggaga cttttaacaa 1321 agcaactgtt ctcaggggac ctattttctc ccttaaaatt cattatacac atccctggtt 1381 gatagcagtg tgtctggagg cagaaaccat tcttgctttg gaaacaatta cgtctgtgtt 1441 atactgagta gggaagctca ttaattgtcg acacttacgt tcctgataat gggatcagtg 1501 tgtaattctt gtttcgctcc agatttctaa taccacaaag aataaatcct ttcactctga 1561 tcaattttgt taacttctca cgtgtcttct ctacacccag ggcaccggtg aatccaagtg 1621 tcctctgatg gtcaaagttc tagatgctgt ccgaggcagt cctgccatca atgtggccat 1681 gcatgtgttc agaaaggctg ctgatgacac ctgggagcca tttgcctctg ggtaagttgc 1741 caaagaaccc tcccacagga cttggtttta tcttcccgtt tgcccctcac ttggtagaga 1801 gaggctcaca tcatctgcta aagaatttac aagtagattg aaaaacgtag gcagaggtca 1861 agtatgccct ctgaaggatg ccctcttttt gttttgctta gctaggaagt gac // LOCUS HUMPALF03 1062 bp DNA PRI 07-JAN-1995 DEFINITION Human mutant prealbumin gene directly linked to familial amyloidotic polyneuropathy (FAP), exon 4. ACCESSION M15517 NID g189591 KEYWORDS prealbumin; transthyretin. SEGMENT 3 of 3 SOURCE Homo sapiens (individual_isolate FAP patient) liver DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1062) AUTHORS Maeda,S., Mita,S., Araki,S. and Shimada,K. TITLE Structure and expression of the mutant prealbumin gene associated with familial amyloidotic polyneuropathy JOURNAL Mol. Biol. Med. 3 (4), 329-338 (1986) MEDLINE 87038739 COMMENT Draft entry and printed copy of sequence [1] kindly provided by K.Shimada, 01-JUL-1987. FEATURES Location/Qualifiers source 1..1062 /organism="Homo sapiens" /isolate="FAP patient" /db_xref="taxon:9606" /tissue_type="liver" /map="18q11.2-q12.1" gene join(M15515:608..1913,M15516:1..820,1..822) /gene="TTR" CDS join(M15515:608..676,M15515:1601..1731,M15516:398..533, 715..822) /gene="TTR" /note="precursor (FAP mutant)" /codon_start=1 /db_xref="GDB:G00-119-471" /product="prealbumin" /db_xref="PID:g387000" /translation="MASHRLLLLCLAGLVFVSEAGPTGTGESKCPLMVKVLDAVRGSP AINVAMHVFRKAADDTWEPFASGKTSESGELHGLTTEEEFVEGIYKVEIDTKSYWKAL GISPFHEHAEVVFTANDSGPRRYTIAALLSPYSYSTTAVVTNPKE" mat_peptide join(M15515:668..676,M15515:1601..1731,M15516:398..533, 715..819) /gene="TTR" /note="FAP mutant; G00-119-471" /product="prealbumin" intron <1..714 /gene="TTR" /note="G00-119-471" /number=3 CDS 88..237 /gene="TTR" /note="ORF3" /codon_start=1 /db_xref="PID:g475793" /translation="MLLLRISLLVIHKNILFFFMLENAKNRRVGESLGLETGDLPSYY GSIRM" CDS 155..364 /gene="TTR" /note="ORF4" /codon_start=1 /db_xref="PID:g475794" /translation="MQRIGGWGNLWAWRQETCLPTMVPSECRLGQYNNSSLVCSSVNW EECFQLQNAKSLSLWLAATIAAALQ" exon 715..>822 /gene="TTR" /note="prealbumin (FAP mutant); G00-119-471" /number=4 BASE COUNT 290 a 238 c 202 g 332 t ORIGIN About 1.4 kb after segment 2; chromosome 18q11.2-q12.1. 1 aagcttaaat gagctctagt gcatgcatat atatttcaaa attccaccat gatcttccac 61 actctgtatt gtaaatagag ccctgtaatg cttttacttc gtatttcatt gcttgttata 121 cataaaaata tacttttctt cttcatgtta gaaaatgcaa agaataggag ggtgggggaa 181 tctctgggct tggagacagg agacttgcct tcctactatg gttccatcag aatgtagact 241 gggacaatac aataattcaa gtctggtttg ctcatctgta aattgggaag aatgtttcca 301 gctccagaat gctaaatctc taagtctgtg gttggcagcc actattgcag cagctcttca 361 atgactcaat gcagttttgc attctcccta cctttttttt ctaaaaccaa taaaatagat 421 acagccttta ggctttctgg gatttccctt agtcaagcta gggtcatcct gactttcggc 481 gtgaatttgc aaaacaagac ctgactctgt actcctgctc taaggactgt gcatggttcc 541 aaaggcttag cttgccagca tatttgagct ttttccttct gttcaaactg ttccaaaata 601 taaaagaata aaattaatta agttggcact ggacttccgg tggtcagtca tgtgtgtcat 661 ctgtcacgtt tttcgggctc tggtggaaat ggatctgtct gtcttctctc ataggtggta 721 ttcacagcca acgactccgg cccccgccgc tacaccattg ccgccctgct gagcccctac 781 tcctattcca ccacggctgt cgtcaccaat cccaaggaat gagggacttc tcctccagtg 841 gacctgaagg acgagggatg ggatttcatg taaccaagag tattccattt ttactaaagc 901 agtgttttca cctcatatgc tatgttagaa gtccaggcag agacaataaa acattcctgt 961 gaaaggcact tttcattcca ctttaacttg attttttaaa ttcccttatt gtcccttcca 1021 aaaaaaagag aatcaaaatt ttacaaagaa tcaaaggaat tc // LOCUS HUMPAM12 3748 bp mRNA PRI 11-JAN-1991 DEFINITION Human peptidylglycine alpha-amidating monooxygenase mRNA, complete cds. ACCESSION M37721 NID g189594 KEYWORDS peptidylglycine alpha-amidating monooxygenase. SOURCE Human thyroid carcinoma, cDNA to mRNA, clones lambda-PAM[1-3]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3748) AUTHORS Glauder,J., Ragg,H., Rauch,J. and Engels,J.W. TITLE Human peptidylglycine alpha-amidating monooxygenase: cDNA, cloning and functional expression of a truncated form in COS cells JOURNAL Biochem. Biophys. Res. Commun. 169, 551-558 (1990) MEDLINE 90290494 FEATURES Location/Qualifiers source 1..3748 /organism="Homo sapiens" /db_xref="taxon:9606" gene 189..3113 /gene="peptidylglycine alpha-amidating monooxygenase" sig_peptide 189..251 /gene="peptidylglycine alpha-amidating monooxygenase" CDS 189..3113 /gene="peptidylglycine alpha-amidating monooxygenase" /EC_number="1.14.17.3" /codon_start=1 /product="peptidylglycine alpha-amidating monooxygenase" /db_xref="PID:g189595" /translation="MAGRVPSLLVLLVFPSSCLAFRSPLSVFKRFKETTRPFSNECLG TTRPVVPIDSSDFALDIRMPGVTPKQSDTYFCMSMRIPVDEEAFVIDFKPRASMDTVH HMLLFGCNMPSSTGSYWFCDEGTCTDKANILYAWARNAPPTRLPKGVGFRVGGETGSK YFVLQVHYGDISAFRDNNKDCSGVSLHLTRLPQPLIAGMYLMMSVDTVIPAGEKVVNS DISCHYKNYPMHVFAYRVHTHHLGKVVSGYRVRNGQWTLIGRQSPQLPQAFYPVGHPV DVSFGDLLAARCVFTGEGRTEATHIGGTSSDEMCNLYIMYYMEAKHAVSFMTCTQNVA PDMFRTIPPEANIPIPVKSDMVMMHEHHKETEYKDKIPLLQQPKREEEEVLDQGDFYS LLSKLLGEREDVVHVHKYNPTEKAESESDLVAEIANVVQKKDLGRSDAREGAEHERGN AILVRDRIHKFHRLVSTLRPPESRVFSLQQPPPGEGTWEPEHTGDFHMEEALDWPGVY LLPGQVSGVALDPKNNLVIFHRGDHVWDGNSFDSKFVYQQIGLGPIEEDTILVIDPNN AAVLQSSEKNLFYLPHGLSIDKDGNYWVTDVALHQVFKLDPNNKEGPVLILGRSMQPG SDQNHFCQPTDVAVDPGTGAIYVSDGYCNSRIVQFSPSGKFITQWGEESSGSSPLPGQ FTVPHSLALVPLLGQLCVADRENGRIQCFKTDTKEFVREIKHSSFGRNVFAISYIPGL LFAVNGKPHFGDQEPVQGFVMNFSNGEIIDIFKPVRKHFDMPHDIVASEDGTVYIGDA HTNTVWKFTLTEKLEHRSVKKAGIEVQEIKEAEAVVETKMENKPTSSELQKMQEKQKL IKEPGSGVPVVLITTLLVIPVVVLLAIAIFIRWKKSRAFGADSEHKLETSSGRVLGRF RGKGSGGLNLGNFFASRKGYSRKGFDRLSTEGSDQEKEDDGSESEEEYSAPLPALAPS SS" mat_peptide 297..3110 /gene="peptidylglycine alpha-amidating monooxygenase" /EC_number="1.14.17.3" /product="peptidylglycine alpha-amidating monooxygenase" variation 1108 /gene="peptidylglycine alpha-amidating monooxygenase" /note="a could be c" variation 1109 /gene="peptidylglycine alpha-amidating monooxygenase" /note="t could be g" variation 1857 /gene="peptidylglycine alpha-amidating monooxygenase" /note="g could be c" variation 2564 /gene="peptidylglycine alpha-amidating monooxygenase" /note="c could be t" BASE COUNT 1038 a 797 c 881 g 1032 t ORIGIN 1 cggaccgaga cgcctcgccg cggccagctc gctgctctcg ctggcggatg gtgtgtggcc 61 gccgcaggac gcccgccgtg cccgggccat gaagtagcgg ctgctggcgg cgccgctgcc 121 caaccgccag ccccagcccc gcgctgcgct gcccggtcct ctcccggcgg ggtcgtatcg 181 gcgtggacat ggctggccgc gtccctagcc tgctagttct ccttgttttt ccaagcagct 241 gtttggcttt ccgaagccca ctttctgtct ttaagaggtt taaagaaact accagaccat 301 tttccaatga atgtcttggt accaccagac ccgtagttcc tattgattca tcagattttg 361 cattggatat tcgcatgcct ggggttacac ctaaacagtc cgatacatac ttctgcatgt 421 ctatgcgaat accagtggat gaggaagcct tcgtgattga cttcaagcct cgagccagca 481 tggatactgt ccatcacatg ttactttttg gatgcaatat gccttcatcc actggaagtt 541 actggttttg tgatgaagga acctgtacag ataaagccaa tattctgtat gcctgggcga 601 gaaatgctcc ccctacccgg ctccccaaag gtgttggatt cagagttgga ggagagactg 661 gaagtaaata ctttgtacta caggtacact atggggatat tagtgctttt agagataata 721 acaaggactg ttctggtgtg tccttacacc tcacacgtct gccacagcct ttaattgctg 781 gcatgtacct tatgatgtct gttgacactg ttatcccagc aggagaaaaa gtggtgaatt 841 ctgacatttc atgccattat aaaaattatc caatgcatgt ctttgcctat agagttcaca 901 ctcaccattt aggtaaggta gtaagtggat acagagtaag aaatggacag tggacactga 961 ttggacggca gagccctcag ctgccacagg ctttctaccc tgtggggcat ccagttgatg 1021 taagttttgg tgacctactg gctgcaagat gtgtattcac tggtgaagga aggacagaag 1081 ccacacacat tggtggcacg tctagtgatg aaatgtgcaa cttatacatt atgtattaca 1141 tggaagccaa gcatgcagtt tctttcatga cctgtaccca gaatgtagct ccagatatgt 1201 tcagaaccat accaccagag gccaacattc caattcccgt gaagtctgat atggttatga 1261 tgcatgaaca tcataaagaa acagaatata aagataagat tcctttacta cagcagccaa 1321 aacgagaaga agaagaagtg ttagaccagg gtgatttcta ttcactactt tccaagctgc 1381 taggagaaag ggaagatgtt gttcatgtgc acaaatataa tcctacagaa aaggcagaat 1441 cagagtcaga cctggtagct gagattgcaa atgtagtcca aaaaaaggat cttggtcgat 1501 ctgatgccag agagggtgca gaacatgaga ggggtaatgc tattcttgtc agagacagaa 1561 ttcacaaatt ccacagacta gtatctacct tgaggccacc agagagcaga gttttctcat 1621 tacagcagcc cccacctggt gaaggcacct gggaaccaga acacacagga gatttccaca 1681 tggaagaggc actggattgg cctggagtat acttgttacc aggccaggtt tctggggtgg 1741 ctctagaccc taagaataac ctggtgattt tccacagagg tgaccatgtc tgggatggaa 1801 actcgtttga cagcaagttt gtttaccagc aaataggact cggaccaatt gaagaagaca 1861 ctattcttgt catagatcca aataatgctg cagtactcca gtccagtgaa aaaaatctgt 1921 tttacttgcc acatggcttg agtatagata aagatgggaa ttattgggtc acagacgtgg 1981 ctctccatca ggtgttcaaa ctggatccaa acaataaaga aggccctgta ttaatcctgg 2041 gaaggagcat gcaaccaggc agtgaccaga atcacttctg tcaacccact gatgtggctg 2101 tggatccagg cactggagcc atttatgtat cagatggtta ctgcaacagc aggattgtgc 2161 agttttcacc aagtggaaag ttcatcacac agtggggaga agagtcttca gggagcagtc 2221 ctctgccagg ccagttcact gttcctcaca gcttggctct tgtgcctctt ttgggccaat 2281 tatgtgtggc agaccgggaa aatggtcgga tccagtgttt taaaactgac accaaagaat 2341 ttgtgagaga gattaagcat tcatcatttg gaagaaatgt atttgcaatt tcatatatac 2401 caggcttgct ctttgcagtg aatgggaagc ctcattttgg ggaccaagaa cctgtacaag 2461 gatttgtgat gaacttttcc aatggggaaa ttatagacat cttcaagcca gtgcgcaagc 2521 actttgatat gcctcatgat attgttgcat ctgaagatgg gaccgtgtac attggagatg 2581 ctcataccaa caccgtgtgg aagttcacct tgactgagaa attggaacat cgatcagtta 2641 aaaaggctgg cattgaggtc caggaaatca aagaagccga ggcagttgtt gaaaccaaaa 2701 tggagaacaa acccacctcc tcagaattgc agaagatgca agagaaacag aagctgatca 2761 aagagccagg ctcgggagtg cctgttgttc tcattacaac ccttctggtt attccggtgg 2821 ttgtcctgct ggccattgcc atatttattc ggtggaaaaa atcaagggcc tttggagcag 2881 attctgaaca caaactcgag acgagttcag gaagagtact gggaagattt agaggaaagg 2941 gaagtggagg cttaaacctt ggtaatttct ttgcaagccg taagggctac agtcgaaaag 3001 ggtttgaccg gcttagcact gagggcagtg accaagagaa agaggatgat ggaagtgaat 3061 cagaagagga gtattcagca cctctgcctg cgctcgcacc ttcctcctcc tgaaaaccaa 3121 gctttgattt agattgagta agatttaccc agaatgtcag attcctttcc ctttagcacg 3181 tttaaagttc tgtgtattta attgtaaact gtactagtct gtgtgggact gtacacactt 3241 tatttacttc gttttggtta agttggcttc tgtttctagt tgaggagttt cctaaaagtt 3301 cataacagtg ccattgtctt tatatgaaca tagactagag aaaccgtcct ctttttccat 3361 cataattcta atctaacaat ggaagatttg cccatttaca cttttgagac tttttggtgg 3421 atgtaaataa ccccattctt tgcttgaaca cagtatcttc ccaatagcac tttcattgcc 3481 agtgtctttc tttggtgcct ttcctgttca gcattcttag cctgtggcaa taaagagaaa 3541 ctttgtgcta catgacgaca aagctgctaa atctcctatt tttttaaaat cactaacatt 3601 atattgcaat gaaggaaata aaaaagtctc tatttaaatt cttttttaaa ttttcttcag 3661 ttggtgtgtt tttgggatgt cttattttta gatggttaca ctgttagaac actattttca 3721 gaatctgaat gtaatttgtg taataacg // LOCUS HUMPASP 1332 bp mRNA PRI 19-MAY-1995 DEFINITION Human procarboxypeptidase B mRNA, complete cds. ACCESSION M81057 NID g809194 KEYWORDS pancreatic specific protein; procarboxypeptidase B. SOURCE Homo sapiens (tissue library: lambda gt11 HL 1069b) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1332) AUTHORS Yamamoto,K.K., Pousette,A., Chow,P., Wilson,H., el Shami,S. and French,C.K. TITLE Isolation of a cDNA encoding a human serum marker for acute pancreatitis. Identification of pancreas-specific protein as pancreatic procarboxypeptidase B JOURNAL J. Biol. Chem. 267 (4), 2575-2581 (1992) MEDLINE 92129345 FEATURES Location/Qualifiers source 1..1332 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="lambda gt11 HL 1069b" gene 24..1274 /gene="procarboxypeptidase B" CDS 24..1274 /gene="procarboxypeptidase B" /codon_start=1 /product="procarboxypeptidase B" /db_xref="PID:g189625" /translation="MLALLVLVTVALASAHHGGEHFEGEKVFRVNVEDENHINIIREL ASTTQIDFWKPDSVTQIKPHSTVDFRVKAEDTVTVENVLKQNELQYKVLISNLRNVVE AQFDSRVRATGHSYEKYNKWETIEAWTQQVATENPALISRSVIGTTFEGRAIYLLKVG KAGQNKPAIFMDCGFHAREWISPAFCQWFVREAVRTYGREIQVTELLDKLDFYVLPVL NIDGYIYTWTKSRFWRKTRSTHTGSSIGTDPNRNFDAGWCEIGASRNPCDETYCGPAA ESEKETKALADFIRNKLSSIKAYLTIHSYSQMMIYPYSYAYKLGENNAELNALAKATV KELASLHGTKYTYGPGATTIYPAAGGSDDWAYDQGIRYSFTFELRDTGRYGFLLPESQ IRATCEETFLAIKYVASYVLEHLY" sig_peptide 24..68 /gene="procarboxypeptidase B" /evidence=experimental misc_feature 69..533 /gene="procarboxypeptidase B" /note="propeptide" /evidence=experimental mat_peptide 534..1271 /gene="procarboxypeptidase B" /note="putative" /product="procarboxypeptidase B" BASE COUNT 357 a 321 c 311 g 343 t ORIGIN 1 cggactagac ctggtcagac acaatgttgg cactcttggt tctggtgact gtggccctgg 61 catctgctca tcatggtggt gagcactttg aaggcgagaa ggtgttccgt gttaacgttg 121 aagatgaaaa tcacattaac ataatccgcg agttggccag cacgacccag attgacttct 181 ggaagccaga ttctgtcaca caaatcaaac ctcacagtac agttgacttc cgtgttaaag 241 cagaagatac tgtcactgtg gagaatgttc taaagcagaa tgaactacaa tacaaggtac 301 tgataagcaa cctgagaaat gtggtggagg ctcagtttga tagccgggtt cgtgcaacag 361 gacacagtta tgagaagtac aacaagtggg aaacgataga ggcttggact caacaagtcg 421 ccactgagaa tccagccctc atctctcgca gtgttatcgg aaccacattt gagggacgcg 481 ctatttacct cctgaaggtt ggcaaagctg gacaaaataa gcctgccatt ttcatggact 541 gtggtttcca tgccagagag tggatttctc ctgcattctg ccagtggttt gtaagagagg 601 ctgttcgtac ctatggacgt gagatccaag tgacagagct tctcgacaag ttagactttt 661 atgtcctgcc tgtgctcaat attgatggct acatctacac ctggaccaag agccgatttt 721 ggagaaagac tcgctccacc catactggat ctagcattgg cacagacccc aacagaaatt 781 ttgatgctgg ttggtgtgaa attggagcct ctcgaaaccc ctgtgatgaa acttactgtg 841 gacctgccgc agagtctgaa aaggagacca aggccctggc tgatttcatc cgcaacaaac 901 tctcttccat caaggcatat ctgacaatcc actcgtactc ccaaatgatg atctaccctt 961 actcatatgc ttacaaactc ggtgagaaca atgctgagtt gaatgccctg gctaaagcta 1021 ctgtgaaaga acttgcctca ctgcacggca ccaagtacac atatggcccg ggagctacaa 1081 caatctatcc tgctgctggg ggctctgacg actgggctta tgaccaagga atcagatatt 1141 ccttcacctt tgaacttcga gatacaggca gatatggctt tctccttcca gaatcccaga 1201 tccgggctac ctgcgaggag accttcctgg caatcaagta tgttgccagc tacgtcctgg 1261 aacacctgta ctagttgaga aagctgatgg ccttgtttca aaattctcat ttttcatttc 1321 ttttctttct tg // LOCUS HUMPAX2A 3421 bp mRNA PRI 07-JAN-1995 DEFINITION Human paired-box protein (PAX2) mRNA, complete cds. ACCESSION M89470 NID g409138 KEYWORDS paired-box protein. SOURCE Homo sapiens (tissue library: lanbda-gt10 of Graham Bell and Clontech) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3421) AUTHORS Eccles,M.R., Wallis,L.J., Fidler,A.E., Spurr,N.K., Goodfellow,P.J. and Reeve,A.E. TITLE Expression of the PAX2 gene in human fetal kidney and Wilms' tumor JOURNAL Cell Growth Differ. 3 (5), 279-289 (1992) MEDLINE 92338102 FEATURES Location/Qualifiers source 1..3421 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="lanbda-gt10 of Graham Bell and Clontech" /map="10q22.1-q24.3" gene 544..1725 /gene="PAX2" CDS 544..1725 /gene="PAX2" /note="octapeptide sequence bp 1096..1120; paired box domain bp 589..979" /codon_start=1 /product="paired-box protein" /db_xref="PID:g409139" /translation="MDMHCKADPFSAMHPGHGGVNQLGGVFVNGRPLPDVVRQRIVEL AHQGVRPCDISRQLRVSHGCVSKILGRYYETGSIKPGVIGGSKPKVATPKVVDKIAEY KRQNPTMFAWEIRDRLLAEGICDNDTVPSVSSINRIIRTKVQQPFHPTPDGAGTGVTA PGHTIVPSTASPPVSSASNDPVGSYSINGILGIPRSNGEKRKRDEDVSEGSVPNGDSQ SGVDSLRKHLRADTFTQQQLEALDRVFERPSYPDVFQASEHIKSEQGNEYSLPALTPG LDEVKSSLSASTNPELGSNVSGTQTYPVVTGRDMASTTLPGYPPHVPPTGQGSYPTST LAGMVPGSEFSGNPYSHPQYTAYNEAWRFSNPALLSSPYYYSAAPRSAPAARAAAYDR H" BASE COUNT 593 a 1264 c 929 g 635 t ORIGIN 1 cgggggcctg gccgcgcgct cccctcccgc aggcgccacc tcggacatcc ccgggattgc 61 tacttctctg ccaacttcgc caactcgcca gcacttggag aggcccggct cccctcccgg 121 cgccctctga ccgcccccgc cccgcggcgc tctccgacca ccgcctctcg gatgaccagg 181 ttccagggga gctgagcgag tcgcctcccc cgcccagctt cagccctggc tgcagctgca 241 gcgcgagcca tgcgccccca gtgcaccccg gcccaccgcc ccggggccat tctgctgacc 301 gcccagcccc gagccccgac agtggcaagt tgcggctact gcagttgcaa gctccggcca 361 acccggagga gccccacggg gaaggcagtc gtgcgccccc cgcccccggg cgccccgcag 421 cagccgggcg ttcactcatc ctccctcccc caccgtccct cccttttctc ctcaagtcct 481 gaagttgagt ttgagaggcg acacggcggc ggcgccgcgc tgctcccgct cctctgcctc 541 cccatggata tgcactgcaa agcagacccc ttctccgcga tgcacccagg gcacgggggt 601 gtgaaccagc tcgggggggt gtttgtgaac ggccggcccc tacccgacgt ggtgaggcag 661 cgcatcgtgg agctggccca ccagggtgtg cggccctgtg acatctcccg gcagctgcgg 721 gtcagccacg gctgtgtcag caaaatcctg ggcaggtact acgagaccgg cagcatcaag 781 ccgggtgtga tcggtggctc caagcccaaa gtggcgacgc ccaaagtggt ggacaagatt 841 gctgaataca aacgacagaa cccgactatg ttcgcctggg agattcgaga ccggctcctg 901 gccgagggca tctgtgacaa tgacacagtg cccagcgtct cttccatcaa cagaatcatc 961 cggaccaaag ttcagcagcc tttccaccca acgccggatg gggctgggac aggagtgacc 1021 gcccctggcc acaccattgt tcccagcacg gcctcccctc ctgtttccag cgcctccaat 1081 gacccagtgg gatcctactc catcaatggg atcctgggga ttcctcgctc caatggtgag 1141 aagaggaaac gtgatgaaga tgtgtctgag ggctcagtcc ccaatggaga ttcccagagt 1201 ggtgtggaca gtttgcggaa gcacttgcga gctgacacct tcacccagca gcagctggaa 1261 gctttggatc gggtctttga gcgtccttcc taccctgacg tcttccaggc atcagagcac 1321 atcaaatcag aacaggggaa cgagtactcc ctcccagccc tgacccctgg gcttgatgaa 1381 gtcaagtcga gtctatctgc atccaccaac cctgagctgg gcagcaacgt gtcaggcaca 1441 cagacatacc ccgttgtgac tggtcgtgac atggcgagca ccactctgcc tggttacccc 1501 cctcacgtgc cccccactgg ccagggaagc taccccacct ccaccctggc aggaatggtg 1561 cctgggagcg agttctccgg caacccgtac agccaccccc agtacacggc ctacaacgag 1621 gcttggagat tcagcaaccc cgccttacta agttcccctt attattatag tgccgccccc 1681 cggtccgccc ctgccgctcg tgccgctgcc tatgaccgcc actagttacc gcggggacca 1741 catcaagctt caggccgaca gcttcggcct ccacatcgtc cccgtctgac cccaccccgg 1801 aggagggagg accgacgcga cgcatgcctc ccggccaccg ccccagcctc accccatccc 1861 acgacccccg caacccttca catcaccccc ctcgaaggtc ggacaggacg ggtggagccg 1921 cggggcggga ccctcaggcc cgggcccacc gcccccagcc ccgcctgccg cccctccccg 1981 cctgcctgga ctgcgcggcg ccgtgagggg gattcggccc agctcgtccc ggcctccacc 2041 aagccagccc cgaagcccgc cagccaccct gccgtactcg ggcgcgacct gctggtgcgc 2101 gccggatgtt tctgtgacac acaatcagcg cggaccgcag cgcggcccag ccccgggcac 2161 ccgcctcgga cgctcgggcg ccaggagctt cgctggaggg gctgggccaa ggagattaag 2221 aagaaaacga ctttctgcag gaggaagagc ccgctgccga atccctggga aaaattcttt 2281 tcccccagtg ccagccggac tgccctcgcc ttccgggtgt gccctgtccc agaagatgga 2341 atgggggtgt gggggtccgg ctctaggaac gggctttggg ggcgtcaggt ctttccaagg 2401 ttgggaccca aggatcgggg ggcccagcag cccgcaccga tcgagccgga ctctcggctc 2461 ttcactgctc ctcctggcct gcctagttcc ccagggcccg gcacctcctg ctgcgagacc 2521 cggctctcag ccctgccttg cccctacctc agcgtctctt ccacctgctg gcctcccagt 2581 ttcccctcct gccagtcctt cgcctgtccc ttgacgccct gcatcctcct ccctgactcg 2641 cagccccatc ggacgctctc ccgggaccgc cgcaggacca gtttccatag actgcggact 2701 ggggtcttcc tccagcagtt acttgatgcc ccctcccccg acacagactc tcaatctgcc 2761 ggtggtaaga accggttctg agctggcgtc tgagctgctg cggggtggaa gtggggggct 2821 gcccactcca ctcctcccat cccctcccag cctcctcctc cggcaggaac tgaacagaac 2881 cacaaaaagt ctacatttat ttaatatgat ggtctttgca aaaaggaaca aaacaacaca 2941 aaagcccacc aggctgctgc tttgtggaaa gacggtgtgt gtcgtgtgaa ggcgaaaccc 3001 ggtgtacata acccctcccc ctccgccccg ccccgcccgg ccccgtagag tccctgtcgc 3061 ccgccggccc tgcctgtaga tacgccccgc tgtctgtgct gtgagagtcg ccgctcgctg 3121 ggggggaagg gggggacaca gctacacgcc cattaaagca cagcacgtcc tgggggaggg 3181 gggcattttt tatgttacaa aaaaaaatta cgaagaaaga atctcatttg caaaatagcg 3241 aacatggtct gtgactcctc tggcctgttt gttggctctt tctctgtaat tccgtgtttt 3301 cgctttttcc tccctgcccc tctctccctc tgcccctctc tcctctccgc ttctctcccc 3361 ctctgtctct gtctctctcc gtctctgtcg ctcttgtctg tctgtctctg ctctttctcg 3421 c // LOCUS HUMPBX1AB 1819 bp mRNA PRI 07-JAN-1995 DEFINITION H.sapiens PBX1a and PBX1b mRNA, complete cds. ACCESSION M86546 NID g189647 KEYWORDS PBX1a; PBX1b. SOURCE Homo sapiens lymphoid cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1819) AUTHORS Monica,K.A. JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..1819 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="B-cell" /tissue_type="lymphoid" /map="1q23" gene 124..1416 /gene="PBX1" CDS 124..1416 /gene="PBX1" /codon_start=1 /db_xref="GDB:G00-125-351" /product="PBX1a" /db_xref="PID:g189648" /translation="MDEQPRLMHSHAGVGMAGHPGLSQHLQDGAGGTEGEGGRKQDIG DILQQIMTITDQSLDEAQARKHALNCHRMKPALFNVLCEIKEKTVLSIRGAQEEEPTD PQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGAGSDNSVEHSDYRAKLSQIRQI YHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVSIIHRKFSSIQMQLKQS TCEAVMILRSRFLDARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGIT VSQVSNWFGNKRIRYKKNIGKFQEEANIYAAKTAVTATNVSAHGSQANSPSTPNSAGS SSSFNMSNSGDLFMSVQSLNGDSYQGAQVGANVQSQVDTLRHVISQTGGYSDGLAASQ MYSPQGISANGGWQDATTPSSVTSPTEGPGSVHSDTSN" misc_feature 820..990 /gene="PBX1" /note="homeodomain; G00-125-351" BASE COUNT 482 a 468 c 484 g 385 t ORIGIN 1 cttccctgtt tatcctgaaa aggatttgaa gacaagcttg aaggataaaa agccttggtg 61 cttcccagga gccgagccga ggagcagaag aggaagagcc gggggctgcc gtagcctttg 121 gagatggacg agcagcccag gctgatgcat tcccatgctg gggtcgggat ggccggacac 181 cccggcctgt cccagcactt gcaggatggg gccggaggga ccgaggggga gggcgggagg 241 aagcaggaca ttggagacat tttacagcaa attatgacca tcacagacca gagtttggat 301 gaggcgcagg ccagaaaaca tgctttaaac tgccacagaa tgaagcctgc cttgtttaat 361 gtgttgtgtg aaatcaaaga aaaaacagtt ttgagtatcc gaggagccca ggaggaggaa 421 cccacagacc cccagctgat gcggctggac aacatgctgt tagcggaagg cgtggcgggg 481 cctgagaagg gcggagggtc ggcggcagcg gcggcagcgg cggcggcttc tggaggggca 541 ggttcagaca actcagtgga gcattcagat tacagagcca aactctcaca gatcagacaa 601 atctaccata cggagctgga gaaatacgag caggcctgca acgagttcac cacccacgtg 661 atgaatctcc tgcgagagca aagccggacc aggcccatct ccccaaagga gattgagcgg 721 atggtcagca tcatccaccg caagttcagc tccatccaga tgcagctcaa gcagagcacg 781 tgcgaggcgg tgatgatcct gcgttcccga tttctggatg cgcggcggaa gagacggaat 841 ttcaacaagc aagcgacaga aatcctgaat gaatatttct attcccatct cagcaaccct 901 taccccagtg aggaagccaa agaggagtta gccaagaagt gtggcatcac agtctcccag 961 gtatcaaact ggtttggaaa taagcgaatc cggtacaaga agaacatagg taaatttcaa 1021 gaggaagcca atatttatgc tgccaaaaca gctgtcactg ctaccaatgt gtcagcccat 1081 ggaagccaag ctaactcgcc ctcaactccc aactcggctg gttcttccag ttcttttaac 1141 atgtcaaact ctggagattt gttcatgagc gtgcagtcac tcaatgggga ttcttaccaa 1201 ggggcccagg ttggagccaa cgtgcaatca caggtggata cccttcgcca tgttatcagc 1261 cagacaggag gatacagtga tggactcgca gccagtcaga tgtacagtcc gcagggcatc 1321 agtgctaatg gaggttggca ggatgctact accccttcat cagtgacctc ccctacagaa 1381 ggccctggca gtgttcactc tgatacctcc aactgatctc ccagcaatcg catcccggct 1441 gaccctctgc cccagttggg gcaggggcag gagggagggt ttctctccca agctgaagcg 1501 gtcagactgg aggtcgaagc aatcagcaaa cacaataaga gtctccttct cttctcttct 1561 ttgggatgct atttcagcca atctggacac ttctttatac tctcttccct tttttttctg 1621 ggtagaagcc acccttccct gcctccagct gtcagcctgg ttttcgtcat cttccctgcc 1681 cctgtgcctc tgtcctagac ttcccggggt ccccgccctc tctcatatca ctgaaggata 1741 ttttcaacaa ttagaggaat ttaaagagga aaaaaattac aaagaaaata ataaaagtgt 1801 ttgtacgttt tcaaaaaaa // LOCUS HUMPC1Q1 3486 bp mRNA PRI 07-MAR-1995 DEFINITION Human plasma cell membrane glycoprotein (PC-1) mRNA, complete cds. ACCESSION M57736 J05654 NID g189649 KEYWORDS plasma cell membrane glycoprotein PC-1. SOURCE Human placenta, cDNA to mRNA, clones lambda-hPC1-2 and lambda-hPC1-3; Human fetal liver, cDNA to mRNA, clones lambda-hPC1-1 and lambda-hPC1-4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3486) AUTHORS Buckley,M.F., Loveland,K.A., McKinstry,W.J., Garson,O.M. and Goding,J.W. TITLE Plasma cell membrane glycoprotein PC-1. cDNA cloning of the human molecule, amino acid sequence, and chromosomal location JOURNAL J. Biol. Chem. 265 (29), 17506-17511 (1990) MEDLINE 91009202 FEATURES Location/Qualifiers source 1..3486 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-hPC1-4" /dev_stage="fetus" /tissue_type="liver, placenta" /tissue_lib="lambda-gt10" /map="6q22-q23" mRNA <1..3238 /gene="PC1" mRNA <1..3484 /gene="PC1" gene 1..3484 /gene="PC1" CDS 164..2785 /gene="PC1" /codon_start=1 /product="plasma cell membrane glycoprotein PC-1" /db_xref="PID:g189650" /translation="MDVGEEPLEKAARARTAKDPNTYKVLSLVLSVCVLTTILGCIFG LKPSCAKEVKSCKGRCFERTFGNCRCDAACVELGNCCLDYQETCIEPEHIWTCNKFRC GEKRLTRSLCACSDDCKDKGDCCINYSSVCQGEKSWVEEPCESINEPQCPAGFETPPT LLFSLDGFRAEYLHTWGGLLPVISKLKKCGTYTKNMRPVYPTKTFPNHYSIVTGLYPE SHGIIDNKMYDPKMNASFSLKSKEKFNPEWYKGEPIWVTAKYQGLKSGTFFWPGSDVE INGIFPDIYKMYNGSVPFEERILAVLQWLQLPKDERPHFYTLYLEEPDSSGHSYGPVS SEVIKALQRVDGMVGMLMDGLKELNLHRCLNLILISDHGMEQGSCKKYIYLNKYLGDV KNIKVIYGPAARLRPSDVPDKYYSFNYEGIARNLSCREPNQHFKPYLKHFLPKRLHFA KSDRIEPLTFYLDPQWQLALNPSERKYCGSGFHGSDNVFSNMQALFVGYGPGFKHGIE ADTFENIEVYNLMCDLLNLTPAPNNGTHGSLNHLLKNPVYTPKHPKEVHPLVQCPFTR NPRDNLGCSCNPSILPIEDFQTQFNLTVAEEKIIKHETLPYGRPRVLQKENTICLLSQ HQFMSGYSQDILMPLWTSYTVDRNDSFSTEDFSNCLYQDFRIPLSPVHKCSFYKNNTK VSYGFLSPPQLNKNSSGIYSEALLTTNIVPMYQSFQVIWRYFHDTLLRKYAEERNGVN VVSGPVFDFDYDGRCDSLENLRQKRRVIRNQEILIPTHFFIVLTSCKDTSQTPLHCEN LDTLAFILPHRTDNSESCVHGKHDSSWVEELLMLHRARITDVEHITGLSFYQQRKEPV SDILKLKTHLPTFSQED" polyA_signal 3130..3135 /gene="PC1" BASE COUNT 1022 a 720 c 756 g 988 t ORIGIN 1 ggccacgatg gagcgcgacg gctgcgcggg gggcgggagc cgcggcggcg agggcgggcg 61 cgctccccgg gagggcccgg cggggaacgg ccgcgatcgg ggccgcagcc acgctgccga 121 ggcgcccggg gacccgcagg cggccgcgtc cttgctggcc cctatggacg tgggggagga 181 gccgctggag aaggcggcgc gcgcccgcac tgccaaggac cccaacacct ataaagtact 241 ctcgctggta ttgtcagtat gtgtgttaac aacaatactt ggttgtatat ttgggttgaa 301 accaagctgt gccaaagaag ttaaaagttg caaaggtcgc tgtttcgaga gaacatttgg 361 gaactgtcgc tgtgatgctg cctgtgttga gcttggaaac tgctgtttag attaccagga 421 gacgtgcata gaaccagaac atatatggac ttgcaacaaa ttcaggtgtg gtgagaaaag 481 gttgaccaga agcctctgtg cctgttcaga tgactgcaag gacaagggcg actgctgcat 541 caactacagt tctgtgtgtc aaggtgagaa aagttgggta gaagaaccat gtgagagcat 601 taatgagcca cagtgcccag cagggtttga aacgcctcct accctcttat tttctttgga 661 tggattcagg gcagaatatt tacacacttg gggtggactt cttcctgtta ttagcaaact 721 aaaaaaatgt ggaacatata ctaaaaacat gagaccggta tatccaacaa aaactttccc 781 caatcactac agcattgtca ccggattgta tccagaatct catggcataa tcgacaataa 841 aatgtatgat cccaaaatga atgcttcctt ttcacttaaa agtaaagaga aatttaatcc 901 tgagtggtac aaaggagaac caatttgggt cacagctaag tatcaaggcc tcaagtctgg 961 cacatttttc tggccaggat cagatgtgga aattaacgga attttcccag acatctataa 1021 aatgtataat ggttcagtac catttgaaga aaggatttta gctgttcttc agtggctaca 1081 gcttcctaaa gatgaaagac cacactttta cactctgtat ttagaagaac cagattcttc 1141 aggtcattca tatggaccag tcagcagtga agtcatcaaa gccttgcaga gggttgatgg 1201 tatggttggt atgctgatgg atggtctgaa agagctgaac ttgcacagat gcctgaacct 1261 catccttatt tcagatcatg gcatggaaca aggcagttgt aagaaataca tatatctgaa 1321 taaatatttg ggggatgtta aaaatattaa agttatctat ggacctgcag ctcgattgag 1381 accctctgat gtcccagata aatactattc atttaactat gaaggcattg cccgaaatct 1441 ttcttgccgg gaaccaaacc agcacttcaa accttacctg aaacatttct tacctaagcg 1501 tttgcacttt gctaagagtg atagaattga gcccttgaca ttctatttgg accctcagtg 1561 gcaacttgca ttgaatccct cagaaaggaa atattgtgga agtggatttc atggctctga 1621 caatgtattt tcaaatatgc aagccctctt tgttggctat ggacctggat tcaagcatgg 1681 cattgaggct gacacctttg aaaacattga agtctataac ttaatgtgtg atttactgaa 1741 tttgacaccg gctcctaata acggaactca tggaagtctt aaccaccttc taaagaatcc 1801 tgtttatacg ccaaagcatc ccaaagaagt gcaccccctg gtacagtgcc ccttcacaag 1861 aaaccccaga gataaccttg gctgctcatg taacccttcg attttgccga ttgaggattt 1921 tcaaacacag ttcaatctga ctgtggcaga agagaagatt attaagcatg aaactttacc 1981 ctatggaaga cctagagttc tccagaagga aaacaccatc tgtcttcttt cccagcacca 2041 gtttatgagt ggatacagcc aagacatctt aatgcccctt tggacatcct ataccgtgga 2101 cagaaatgac agtttctcta cggaagactt ctccaactgt ctgtaccagg actttagaat 2161 tcctcttagt cctgtccata aatgttcatt ttataaaaat aacaccaaag tgagttacgg 2221 gttcctctcc ccaccacaac taaataaaaa ttcaagtgga atatattctg aagctttgct 2281 tactacaaat atagtgccaa tgtaccagag ttttcaagtt atatggcgct actttcatga 2341 caccctactg cgaaagtatg ctgaagaaag aaatggtgtc aatgtcgtca gtggtcctgt 2401 gtttgacttt gattatgatg gacgttgtga ttccttagag aatctgaggc aaaaaagaag 2461 agtcatccgt aaccaagaaa ttttgattcc aactcacttc tttattgtgc taacaagctg 2521 taaagataca tctcagacgc ctttgcactg tgaaaaccta gacaccttag ctttcatttt 2581 gcctcacagg actgataaca gcgagagctg tgtgcatggg aagcatgact cctcatgggt 2641 tgaagaattg ttaatgttac acagagcacg gatcacagat gttgagcaca tcactggact 2701 cagcttctat caacaaagaa aagagccagt ttcagacatt ttaaagttga aaacacattt 2761 gccaaccttt agccaagaag actgatatgt tttttatccc caaacaccat gaatcttttt 2821 gagagaacct tatattttat atagtcctct agctacacta ttgcattgtt cagaaactgt 2881 cgaccagagt tagaacggag ccctcggtga tgcggacatc tcagggaaac ttgcgtactc 2941 agcacagcag tggagagtgt tcctgttgaa tcttgcacat atttgaatgt gtaagcattg 3001 tatacattga tcaagttcgg gggaataaag acagaccaca cctaaaactg cctttctgct 3061 tctcttaaag gagaagtagc tgtgaacatt gtctggatac cagatatttg aatctttctt 3121 actattggta ataaaccttg atggcattgg gcaaacagta gacttatagt agggttgggg 3181 tagcccatgt tatgtgacta tctttatgag aattttaaag tggttctgga tatcttttaa 3241 cttggagttt catttctttt cattgtaatc aaaaaaaaaa ttaacagaag ccaaaatact 3301 tctgagacct tgtttcaatc tttgctgtat atcccctcaa aatccaagtt attaatctta 3361 tgtgttttct ttttaatttt ttgattggat ttctttagat ttaatggttc aaatgagttc 3421 aactttgagg gacgatcttt gaatatactt acctattata aaatcttact ttgtatttgt 3481 atttaa // LOCUS HUMPC2A 2223 bp mRNA PRI 07-JAN-1995 DEFINITION Human Kex2-like endoprotease mRNA, complete cds. ACCESSION J05252 NID g189651 KEYWORDS endoprotease; protease. SOURCE Human insulinoma, cDNA to mRNA, clone PC2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2223) AUTHORS Smeekens,S.P. and Steiner,D.F. TITLE Identification of a human insulinoma cDNA encoding a novel mammalian protein structurally related to the yeast dibasic processing protease Kex2 JOURNAL J. Biol. Chem. 265 (6), 2997-3000 (1990) MEDLINE 90153937 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.P.Smeekens, 14-FEB-1990. FEATURES Location/Qualifiers source 1..2223 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 88..162 /gene="endoprotease" CDS 88..2004 /note="Kex2-like endoprotease" /codon_start=1 /product="endoprotease" /db_xref="PID:g189652" /translation="MKGGCVSQWKAAAGFLFCVMVFASAERPVFTNHFLVELHKGGED KARQVAAEHGFGVRKLPFAEGLYHFYHNGLAKAKRRRSLHHKQQLERDPRVKMALQQE GFDRKKRGYRDINEIDINMNDPLFTKQWYLINTGQADGTPGLDLNVAEAWELGYTGKG VTIGIMDDGIDYLHPDLASNYNAEASYDFSSNDPYPYPRYTDDWFNSHGTRCAGEVSA AANNNICGVGVAYNSKVAGIRMLDQPFMTDIIEASSISHMPQLIDIYSASWGPTDNGK TVDGPRDVTLQAMADGVNKGRGGKGSIYVWASGDGGSYDDCNCDGYASSMWTISINSA INDGRTALYDESCSSTLASTFSNGRKRNPEAGVATTDLYGNCTLRHSGTSAAAPEAAG VFALALEANLGLTWRDMQHLTVLTSKRNQLHDEVHQWRRNGVGLEFNHLFGYGVLDAG AMVKMAKDWKTVPERFHCVGGSVQDPEKIPSTGKLVLTLTTDACEGKENFVRYLEHVQ AVITVNATRRGDLNINMTSPMGTKSILLSRRPRDDDSKVGFDKWPFMTTHTWGEDARG TWTLELGFVGSAPQKGVLKEWTLMLHGTQSAPYIDQVVRDYQSKLAMSKKEELEEELD EAVERSLKSILNKN" gene 88..162 /gene="endoprotease" mat_peptide 163..2001 /note="Kex2-like endoprotease" /product="endoprotease" BASE COUNT 538 a 623 c 602 g 460 t ORIGIN 1 ggaattcttt atttgcaccc tccctccgag tcccctgctc cgccagcctg cgcgcctcct 61 agcaccactt ttcactccca aagaaggatg aagggtggtt gtgtctccca gtggaaggcg 121 gccgccgggt tcctcttctg tgtcatggtt tttgcatctg ctgagcgacc ggtcttcacg 181 aatcattttc ttgtggagtt gcataaaggg ggagaggaca aagctcgcca agttgcagca 241 gaacacggct ttggagtccg aaagcttccc tttgctgaag gtctgtacca cttttatcac 301 aatggccttg caaaggccaa gagaagacgc agcctacacc acaagcagca gctggagaga 361 gaccccaggg taaagatggc tttgcagcag gaaggatttg accgaaaaaa gcgaggttac 421 agagacatca atgagatcga catcaacatg aacgatcctc tttttacaaa gcagtggtat 481 ctgatcaata ctgggcaagc tgatggcact cctggccttg atttgaatgt ggctgaagcc 541 tgggagctgg gatacacagg gaaaggtgtt accattggaa ttatggatga tgggattgac 601 tatctccacc cggacctggc ctccaactat aatgccgaag caagttacga cttcagcagc 661 aacgacccct atccttaccc tcggtacaca gatgactggt ttaacagcca cgggacccga 721 tgtgcaggag aagtttctgc tgccgccaac aacaatatct gtggagttgg agtagcatac 781 aactccaagg ttgcaggcat ccggatgctg gaccagccat tcatgacaga catcatcgag 841 gcctcctcca tcagtcatat gccacagctg attgacatct acagcgccag ctggggcccc 901 acagacaacg gcaagacagt ggatgggccc cgggacgtca cgctgcaggc catggccgat 961 ggcgtgaaca agggccgcgg cggcaaaggc agcatctacg tgtgggcctc cggggacggc 1021 ggcagctatg acgactgcaa ctgcgacggc tacgcctcca gcatgtggac catctccatc 1081 aactcagcca tcaacgacgg caggactgcc ctgtacgacg agagctgctc ttccaccttg 1141 gcttccacct tcagcaacgg gaggaaaagg aaccccgagg ccggtgtggc aaccacagat 1201 ttgtacggca actgcactct gaggcattct gggacatctg cagctgcccc cgaggcagct 1261 ggtgtgtttg cactggctct ggaggctaac ctgggtctga cctggcggga catgcagcat 1321 ctgactgtgc tcacctccaa acggaaccag cttcacgacg aggtccatca gtggcggcgc 1381 aatggggtcg gcctggaatt taatcacctc tttggctacg gggtccttga tgcaggtgcc 1441 atggtgaaaa tggctaaaga ctggaaaacc gtgcctgaga gattccactg tgtgggaggc 1501 tccgtgcagg accctgagaa aataccatcc actggcaagt tggtgctgac actcacaacc 1561 gacgcctgtg aggggaagga aaattttgtc cgctacctgg agcatgtcca ggctgtcatc 1621 acggtcaacg caaccagaag aggagacctg aacatcaaca tgacttcccc tatgggcacc 1681 aagtccattt tgctgagccg gcgtccaagg gatgacgact ccaaggtggg ctttgacaag 1741 tggcctttca tgaccactca cacgtggggg gaagacgccc gaggcacctg gaccctggag 1801 ctgggatttg tcggcagcgc cccgcagaag ggggtgctga aggagtggac cctgatgctg 1861 catggcactc agagtgcccc gtacatcgac caggtggtgc gggattacca gtccaagttg 1921 gccatgtcca agaaagagga gctggaggaa gagctggacg aagccgtgga gagaagcctg 1981 aaaagcatcc ttaacaagaa ctagcgctgc acatccgcct ttcccaccgc cctccctccc 2041 cagctccgcc tctgtcctcg ctccacgttt caggcaggca cctagcaatt ccatcacccg 2101 tacaggcaat tccgtcttct taatctgaag cttcactcac tgtcaatgat tattttcatt 2161 acaatggaaa caatcttttt tactctatgc cccaaatata gcgttcccaa caacccggaa 2221 ttc // LOCUS HUMPC42ABB 4069 bp mRNA PRI 22-JUL-1993 DEFINITION Human protocadherin 42 mRNA, complete cds for abbreviated PC42. ACCESSION L11370 NID g387674 KEYWORDS alternative splicing; cadherin; protocadherin. SOURCE Homo sapiens (library: Stratagene) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4069) AUTHORS Sano,K., Tanihara,H., Heimark,R.L., Obata,S., Davidson,M.K., St John,T., Taketani,S. and Suzuki,S. TITLE Protocadherins: A large family of cadherin-related molecules in the central nervous system JOURNAL EMBO J. 12, 2249-2256 (1993) MEDLINE 93285094 FEATURES Location/Qualifiers source 1..4069 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /tissue_lib="Stratagene" CDS 494..3574 /standard_name="abbreviated PC42" /codon_start=1 /product="protocadherin 42" /db_xref="PID:g387675" /translation="MEPLRHSPGPGGQRLLLPSMLLALLLLLAPSPGHATRVVYKVPE EQPPNTLIGSLAADYGFPDVGHLYKLEVGAPYLRVDGKTGDIFTTETSIDREGLRECQ NQLPGDPCILEFEVSITDLVQNASPRLLEGQIEVQDINDNTPNFASPVITLAIPENTN IGSLFPIPLASDRDAGPNGVASYELQVAEDQEEKQPQLIVMGNLDRERWDSYDLTIKV QDGGSPPRATSALLRVTVLDTNDNAPKFERPSYEAELSENSPIGHSVIQVKANDSDQG ANAEIEYTFHQAPEVVRRLLRLDRNTGLITVQGPVDREDLSTLRFSVLAKDRGTNPKS ARAQVVVTVKDMNDNAPTIEIRGIGLVTHQDGMANISEDVAEETAVALVQVSDRDEGE NAAVTCVVAGDVPFQLRQASETGSDSKKKYFLQTTTPLDYEKVKDYTIEIVAVDSGNP PLSSTNSLKVQVVDVNDNAPVFTQSVTEVAFPENNKPGEVIAEITASDADSGSNAELV YSLEPEPAAKGLFTISPETGEIQVKTSLDREQRESYELKVVAADRGSPSLQGTATVLV NVLDCNDNDPKFMLSGYNFSVMENMPALSPVGMVTVIDGDKGENAQVQLSVEQDNGDF VIQNGTGTILSSLSFDREQQSTYTFQLKAVDGGVPPRSAYVGVTINVLDENDNAPYIT APSNTSHKLLTPQTRLGETVSQVAAEDFDSGVNAELIYSIAGGNPYGLFQIGSHSGAI TLEKEIERRHHGLHRLVVKVSDRGKPPRYGTALVHLYVNETLANRTLLETLLGHSLDT PLDIDIAGDPEYERSKQRGNILFGVVAGVVAVALLIALAVLVRYCRQREAKSGYQAGK KETKDLYAPKPSGKASKGNKSKGKKSKSPKPVKPVEDEDEAGLQKSLKFNLMSDAPGD SPRIHLPLNYPPGSPDLGRHYRSNSPLPSIQLQPQSPSASKKHQVVQDLPPANTFVGT GDTTSTGSEQYSDYSYRTNPPKYPSKQVGQPFQLSTPQPLPHPYHGAIWTEVWE" polyA_site 4069 BASE COUNT 902 a 1189 c 1113 g 865 t ORIGIN 1 ctctattcga cattctcttt ggattgtttt gctataactt gaaatttggg atgtcacaaa 61 cgaaactgtc atctgtttcc gccaaactgt ggttctgcta atctcccagg ctggcagcat 121 tggagacttg ctgacttctt tcatccccca ctcttttcac ctgaaattcc tttccttggt 181 tttgctctaa gtcctatgct tcagtcaggg gccaaccaaa tctcactgcc tcctttttat 241 catgaagcct ttgatcactg atagttcttt ttatatcttg aaaaatcacc cttcccagta 301 cagttaatat ttagtatctc tactcatctt ggcacttact cacagctcca taattcagtg 361 tttctcgtac ctcttcatgg tgatggggag ccctttggag gtggtgactg tgctttatac 421 tcctcatgat gcttcacatg tggcaggcgt ggagtgcccg gaggcggccc tcctgattct 481 ggggcctccc aggatggagc ccctgaggca cagcccaggc cctggggggc aacggctact 541 gctgccctcc atgctgctag cactgctgct cctgctggct ccatccccag gccacgccac 601 tcgggtagtg tacaaggtgc cggaggaaca gccacccaac accctcattg ggagcctcgc 661 agccgactat ggttttccag atgtggggca cctgtacaag ctagaggtgg gtgccccgta 721 ccttcgcgtg gatggcaaga caggtgacat tttcaccacc gagacctcca tcgaccgtga 781 ggggctccgt gaatgccaga accagctccc tggtgatccc tgcatcctgg agtttgaggt 841 atctatcaca gacctcgtgc agaatgcgag cccccggctg ctagagggcc agatagaagt 901 acaagacatc aatgacaaca cacccaactt cgcctcacca gtcatcactc tggccatccc 961 tgagaacacc aacatcggct cactcttccc catcccgctg gcttcagacc gtgatgctgg 1021 tcccaacggt gtggcatcct atgagctgca ggtggcagag gaccaggagg agaagcaacc 1081 acagctcatt gtgatgggca acctggaccg tgagcgctgg gactcctatg acctcaccat 1141 caaggtgcag gatggcggca gccccccacg cgccacgagt gccctgctgc gtgtcaccgt 1201 gcttgacacc aatgacaacg cccccaagtt tgagcggccc tcctatgagg ccgaactatc 1261 tgagaatagc cccataggcc actcggtcat ccaggtgaag gccaatgact cagaccaagg 1321 tgccaatgca gaaatcgaat acacattcca ccaggcgccc gaagttgtga ggcgtcttct 1381 tcgactggac aggaacactg gacttatcac tgttcagggc ccggtggacc gtgaggacct 1441 aagcaccctg cgcttctcag tgcttgctaa ggaccgaggc accaacccca agagtgcccg 1501 tgcccaggtg gttgtgaccg tgaaggacat gaatgacaat gcccccacca ttgagatccg 1561 gggcataggg ctagtgactc atcaagatgg gatggctaac atctcagagg atgtggcaga 1621 ggagacagct gtggccctgg tgcaggtgtc tgaccgagat gagggagaga atgcagctgt 1681 cacctgtgtg gtggcaggtg atgtgccctt ccagctgcgc caggccagtg agacaggcag 1741 tgacagcaag aagaagtatt tcctgcagac taccaccccg ctagactacg agaaggtcaa 1801 agactacacc attgagattg tggctgtgga ctctggcaac cccccactct ccagcactaa 1861 ctccctcaag gtgcaggtgg tggacgtcaa tgacaacgca cctgtcttca ctcagagtgt 1921 cactgaggtc gccttcccgg aaaacaacaa gcctggtgaa gtgattgctg agatcactgc 1981 cagtgatgct gactctggct ctaatgctga gctggtttac tctctggagc ctgagccggc 2041 tgctaagggc ctcttcacca tctcacccga gactggagag atccaggtga agacatctct 2101 ggatcgggaa cagcgggaga gctatgagtt gaaggtggtg gcagctgacc ggggcagtcc 2161 tagcctccag ggcacagcca ctgtccttgt caatgtgctg gactgcaatg acaatgaccc 2221 caaatttatg ctgagtggct acaacttctc agtgatggag aacatgccag cactgagtcc 2281 agtgggcatg gtgactgtca ttgatggaga caagggggag aatgcccagg tgcagctctc 2341 agtggagcag gacaacggtg actttgttat ccagaatggc acaggcacca tcctatccag 2401 cctgagcttt gatcgagagc aacaaagcac ctacaccttc cagctgaagg cagtggatgg 2461 tggcgtccca cctcgctcag cttacgttgg tgtcaccatc aatgtgctgg acgagaatga 2521 caacgcaccc tatatcactg ccccttctaa cacctctcac aagctgctga ccccccagac 2581 acgtcttggt gagacggtca gccaggtggc agccgaggac tttgactctg gtgtcaatgc 2641 cgagctgatc tacagcattg caggtggcaa cccttatgga ctcttccaga ttgggtcaca 2701 ttcaggtgcc atcaccctgg agaaggagat tgagcggcgc caccatgggc tacaccgcct 2761 ggtggtgaag gtcagtgacc gcggcaagcc cccacgctat ggcacagcct tggtccatct 2821 ttatgtcaat gagactctgg ccaaccgcac gctgctggag accctcctgg gccacagcct 2881 ggacacgccg ctggatattg acattgctgg ggatccagaa tatgagcgct ccaagcagcg 2941 tggcaacatt ctctttggtg tggtggctgg tgtggtggcc gtggccttgc tcatcgccct 3001 ggcggttctt gtgcgctact gcagacagcg ggaggccaaa agtggttacc aggctggtaa 3061 gaaggagacc aaggacctgt atgcccccaa gcccagtggc aaggcctcca agggaaacaa 3121 aagcaaaggc aagaagagca agtccccaaa gcccgtgaag ccagtggagg acgaggatga 3181 ggccgggctg cagaagtccc tcaagttcaa cctgatgagc gatgcccctg gggacagtcc 3241 ccgcatccac ctgcccctca actacccacc aggcagccct gacctgggcc gccactatcg 3301 ctctaactcc ccactgcctt ccatccagct gcagccccag tcaccctcag cctccaagaa 3361 gcaccaggtg gtacaggacc tgccacctgc aaacacattc gtgggcaccg gggacaccac 3421 gtccacgggc tctgagcagt actccgacta cagctaccgc accaaccccc ccaaataccc 3481 cagcaagcag gtaggccagc cctttcagct cagcacaccc cagcccctac cccaccccta 3541 ccacggagcc atctggaccg aggtgtggga gtgatggagc aggtttactg tgcctgcccg 3601 tgttgggggc cagcctgagc cagcagtggg aggtggggcc ttagtgcctc accgggcaca 3661 cggattaggc tgagtgaaga ttaagggagg gtgtgctctg tggtctcctc cctgccctct 3721 ccccactggg gagagacctg tgatttgcca agtccctgga ccctggacca gctactgggc 3781 cttatgggtt gggggtggta ggcaggtgag cgtaagtggg gagggaaatg ggtaagaagt 3841 ctactccaaa cctaggtctc tatgtcagac cagacctagg tgcttctcta ggagggaaac 3901 agggagacct ggggtcctgt ggataactga gtggggagtc tgccagggga gggcaccttc 3961 ccattgtgcc ttctgtgtgt attgtgcatt aacctcttcc tcaccactag gcttctgggg 4021 ctgggtccca catgcccttg accctgacaa taaagttctc tatttttgg // LOCUS HUMPC43ABB 4688 bp mRNA PRI 14-SEP-1995 DEFINITION Human protocadherin 43 mRNA, complete cds for abbreviated PC43. ACCESSION L11373 NID g307328 KEYWORDS alternative splicing; cadherin; protocadherin. SOURCE Homo sapiens (tissue library: Stratagene) brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4688) AUTHORS Sano,K., Tanihara,H., Heimark,R.L., Obata,S., Davidson,M., St John,T., Taketani,S. and Suzuki,S. TITLE Protocadherins: a large family of cadherin-related molecules in central nervous system JOURNAL EMBO J. 12 (6), 2249-2256 (1993) MEDLINE 93285094 FEATURES Location/Qualifiers source 1..4688 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain" /tissue_lib="Stratagene" CDS 115..2829 /standard_name="abbreviated PC43" /codon_start=1 /product="protocadherin 43" /db_xref="PID:g307329" /translation="MVPEAWRSGLVSTGRVVGVLLLLGALNKASTVIHYEIPEEREKG FAVGNVVANLGLDLGSLSARRFPVVSGASRRFFEVNRETGEMFVNDRLDREELCGTLP SCTVTLELVVENPLELFSVEVVIQDINDNNPAFPTQEMKLEISEAVAPGTRFPLESAH DPDLGSNSLQTYELSRNEYFALRVQTREDSTKYAELVLERALDREREPSLQLVLTALD GGTPALSASLPIHIKVLDANDNAPVFNQSLYRARVPGGCTSGTRVVQVLATDLDEGPN GEIIYSFGSHNRAGVRQLFALDLVTGMLTIKGRLDFEDTKLHEIYIQAKDKGANPEGA HCKVLVEVVDVNDNAPEITVTSVYSPVPEDASGTVIALLSVTDLDAGENGLVTCEVPP GLPFSLTSSLKNYFTLKTSADLDRETVPEYNLSITARDAGTPSLSALTIVRVQVSDIN DNPPQSSQSSYDVYIEENNLPGAPILNLSVWDPDAPQNARLSFFLLEQGAETGLVGRY FTINRDNGIVSSLVPLDYEDRREFELTAHISDGGTPVLATNISVNIFVTDRNDNAPQV LYPRPGGSSVEMLPRGTSAGHLVSRVVGWDADAGHNAWLSYSLFGSPNQSLFAIGLHT GQISTARPVQDTDSPRQTLTVLIKDNGEPSLSTTATLTVSVTEDSPEARAEFPSGSAP REQKKNLTFYLLLSLILVSVGFVVTVFGVIIFKVYKWKQSRDLYRAPVSSLYRTPGPS LHADAVRGGLMSPHLYHQVYLTTDSRRSDPLLKKPGAASPLASRQNTLRSCDPVFYRQ VLGAESAPPGQQAPPNTDWRFSQAQRPGTSGSQNGDDTGTWPNNQFDTEMLQAMILAS ASEAADGSSTLGGGAGTMGLSARYGPQFTLQHVPDYRQNVYIPGSNAH" polyA_site 4688 BASE COUNT 1033 a 1393 c 1245 g 1017 t ORIGIN 1 cgaaagccat gtcggactcg tcgcccagcg cccaagcgct aacccgctga aagtttctca 61 gcgaaatctc agggacgatc tggaccccgc tgagaggaac tgcttttgag tgagatggtc 121 ccagaggcct ggaggagcgg actggtaagc accgggaggg tagtgggagt tttgcttctg 181 cttggtgcct tgaacaaggc ttccacggtc attcactatg agatcccgga ggaaagagag 241 aagggtttcg ctgtgggcaa cgtggtcgcg aaccttggtt tggatctcgg tagcctctca 301 gcccgcaggt tcccggtggt gtctggagct agccgaagat tctttgaggt gaaccgggag 361 accggagaga tgtttgtgaa cgaccgtctg gatcgagagg agctgtgtgg gacactgccc 421 tcttgcactg taactctgga gttggtagtg gagaacccgc tggagctgtt cagcgtggaa 481 gtggtgatcc aggacatcaa cgacaacaat cctgctttcc ctacccagga aatgaaattg 541 gagattagcg aggccgtggc tccggggacg cgctttccgc tcgagagcgc gcacgatccc 601 gatctgggaa gcaactcttt acaaacctat gagctgagcc gaaatgaata ctttgcgctt 661 cgcgtgcaga cgcgggagga cagcaccaag tacgcggagc tggtgttgga gcgcgccctg 721 gaccgagaac gggagcctag tctccagtta gtgctgacgg cgttggacgg agggacccca 781 gctctctccg ccagcctgcc tattcacatc aaggtgctgg acgcgaatga caatgcgcct 841 gtcttcaacc agtccttgta ccgggcgcgc gttcctggag gatgcacctc cggcacgcgc 901 gtggtacaag tccttgcaac ggatctggat gaaggcccca acggtgaaat tatttactcc 961 ttcggcagcc acaaccgcgc cggcgtgcgg caactattcg ccttagacct tgtaaccggg 1021 atgctgacaa tcaagggtcg gctggacttc gaggacacca aactccatga gatttacatc 1081 caggccaaag acaagggcgc caatcccgaa ggagcacatt gcaaagtgtt ggtggaggtt 1141 gtggatgtga atgacaacgc cccggagatc acagtcacct ccgtgtacag cccagtaccc 1201 gaggatgcct ctgggactgt catcgctttg ctcagtgtga ctgacctgga tgctggcgag 1261 aacgggctgg tgacctgcga agttccaccg ggtctccctt tcagccttac ttcttccctc 1321 aagaattact tcactttgaa aaccagtgca gacctggatc gggagactgt gccagaatac 1381 aacctcagca tcaccgcccg agacgccgga accccttccc tctcagccct tacaatagtg 1441 cgtgttcaag tgtccgacat caatgacaac cctccacaat cttctcaatc ttcctacgac 1501 gtttacattg aagaaaacaa cctccccggg gctccaatac taaacctaag tgtctgggac 1561 cccgacgccc cgcagaatgc tcggctttct ttctttctct tggagcaagg agctgaaacc 1621 gggctagtgg gtcgctattt cacaataaat cgtgacaatg gcatagtgtc atccttagtg 1681 cccctagact atgaggatcg gcgggaattt gaattaacag ctcatatcag cgatgggggc 1741 accccggtcc tagccaccaa catcagcgtg aacatatttg tcactgatcg caatgacaat 1801 gccccccagg tcctatatcc tcggccaggt gggagctcgg tggagatgct gcctcgaggt 1861 acctcagctg gccacctagt gtcacgggtg gtaggctggg acgcggatgc agggcacaat 1921 gcctggctct cctacagtct ctttggatcc cctaaccaga gcctttttgc catagggctg 1981 cacactggtc aaatcagtac tgcccgtcca gtccaagaca cagattcacc caggcagact 2041 ctcactgtct tgatcaaaga caatggggag ccttcgctct ccaccactgc taccctcact 2101 gtgtcagtaa ccgaggactc tcctgaagcc cgagccgagt tcccctctgg ctctgccccc 2161 cgggagcaga aaaaaaatct caccttttat ctacttcttt ctctaatcct ggtttctgtg 2221 ggcttcgtgg tcacagtgtt cggagtaatc atattcaaag tttacaagtg gaagcagtct 2281 agagacctat accgagcccc ggtgagctca ctgtaccgaa caccagggcc ctccttgcac 2341 gcggacgccg tgcggggagg cctgatgtcg ccgcaccttt accatcaggt gtatctcacc 2401 acggactccc gccgcagcga cccgctgctg aagaaacctg gtgcagccag tccactggcc 2461 agccgccaga acacgctgcg gagctgtgat ccggtgttct ataggcaggt gttgggtgca 2521 gagagcgccc ctcccggaca gcaagccccg cccaacacgg actggcgttt ctctcaggcc 2581 cagagacccg gcaccagcgg ctcccaaaat ggcgatgaca ccggcacctg gcccaacaac 2641 cagtttgaca cagagatgct gcaagccatg atcttggcgt ccgccagtga agctgctgat 2701 gggagctcca ccctgggagg gggtgccggc accatgggat tgagcgcccg ctacggaccc 2761 cagttcaccc tgcagcacgt gcccgactac cgccagaatg tctacatccc aggcagcaat 2821 gcacactgac caacgcagct ggcaagcgga tggcaaggcc cagcaggtgg caatggcaac 2881 aagaagaagt cggcaagaag gagaagaagt aacatggagg ccaggccaag agccacaggg 2941 cagcctctcc ccgaaccagc ccagcttctc cttacctgca cccaggcctc agagtttcag 3001 ggctaacccc cagaatactg gtaggggcca aggcatctcc cttggaaaca gaaacaagtg 3061 ccatcacacc atcccttccc caggtgtaat atccaaagca gttccgctgg gaaccccatc 3121 caatcagtgg ctgtacccat ttgggtagtg gggttcatgt agacaccaag aaccatttgc 3181 cacaccccgt ttagttacag ctgaaccctc catcttccaa atcaatcagg cccatccatc 3241 ccatgcctcc ctcctcccca ccccactcca acagttcctc tttcccgagt aaggtggttg 3301 gggtgttgaa gtaccaagta acctacaagc ctcctagttc tgaaaagttg gaagggcatc 3361 atgacctctt ggcctctcct ttgattctca atcttccccc aaagcatggt ttggtgccag 3421 ccccttcacc tccttccaga gcccaagatc aatgctcaag ttttggagga catgatcacc 3481 atccccatgg tactgatgct tgctggattt agggagggca ttttgctacc aagcctcttc 3541 ccaacgccct gggaccagtc ttctgttttg tttttcattg tttgagcttt ccactgcatg 3601 ccttgacttc ccccacctcc tcctcaaaca agagactcca ctgcatgttc caagacagta 3661 tggggtggta agataaggaa gggaagtgtg tggatgtgga tggtgggggc atggacaaag 3721 cttgacacat caagttatca aggccttgga ggaggctctg tatgtcctca ggggactgac 3781 aacatcctcc agattccagc cataaaccaa taactaggct ggacccttcc cactacataa 3841 tagggctcag ccaggcagcc agctttgggc tgagctaaca ggaccaatgg attaactggc 3901 atttcagtcc aaggaagctc gaagcaggtt taggaccagg tccccttgag aggtcagagg 3961 ggcctctgtg ggtgctgggt actccagagg tgccactggt ggaagggtca gcggagcccc 4021 agcaggaagg gtgggccagc caggccattc ttagtccctg ggttggggag gcagggagct 4081 agggcaggga ccaaatgaac agaaagtctc agcccaggat ggggcttctt caacaggccc 4141 ctgccctcct gaagcctcag tccttcacct tgccaggtgc cgtttctctt ccgtgaaggc 4201 cactgcccag gtccccagtg cgccccctag tggccatagc ctggttaaag ttccccagtg 4261 cctccttgtg atagaccttc ttctcccacc cccttctgcc cctgggtccc cggccatcca 4321 gcggggctgc cagagaaccc cagacctgcc cttacagtag tgtagcgccc cctccctctt 4381 tcggctggtg tagaatagcc agtagtgtag tgcggtgtgc ttttacgtga tggcgggtgg 4441 gcagcgggcg gcggcgtccg cgcagccgtc tgtccttgat ctgcccgcgg cggcccgtgt 4501 tgtgttttgt gctgtgtcca gcgctaaggc gaccccctcc cccgtactga cttctcctat 4561 aagcgcttct cttcgcatag tcacgtagct cccaccccac cctcttcctg tgtctcacgc 4621 aagttttata ctctaatatt tatatggctt tttttcttcg acaaaaaaat aataaaacgt 4681 ttcttctg // LOCUS HUMPCD17 1855 bp mRNA PRI 20-OCT-1992 DEFINITION Human mRNA for paraneoplastic cerebellar degeneration-associated antigen, complete cds. ACCESSION D12981 NID g219979 KEYWORDS cerebellar Purkinje cell; cytoplasmic protein; paraneoplastic cerebellar degeneration-associated antigen. SOURCE Homo sapiens cerebellum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1855) AUTHORS Sakai,K., Mitchell,D.J., Tsukamoto,T. and Steinman,L. TITLE Isolation of a complementary DNA clone encoding an autoantigen recognized by an anti-neuronal cell antibody from a patient with paraneoplastic cerebellar degeneration JOURNAL Ann. Neurol. 28 (5), 692-698 (1990) MEDLINE 91083322 REMARK Erratum:[Ann Neurol 1991 Nov;30(5):738] REFERENCE 2 (bases 1 to 1855) AUTHORS Sakai,K. TITLE Direct Submission JOURNAL Submitted (26-AUG-1992) to the DDBJ/EMBL/GenBank databases. Koichiro Sakai, Kanazawa Medical University, Dept. of Neurology; 1-1 Daigaku, Uchinada-machi, Kahoku-gun, Ishikawa 920-02, Japan (E-mail:ksakai, Tel:0762-86-3511, Fax:0762-86-3259) COMMENT Submitted (26-AUG-1992) to DDBJ by: Koichiro Sakai Dept. of Neurology Kanazawa Medical University 1-1 Daigaku, Uchinada-machi Kahoku-gun, Ishikawa 920-02 Japan Phone: 0762-86-3511 Fax: 0762-86-3259. FEATURES Location/Qualifiers source 1..1855 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="cerebellum" gene 18..1349 /gene="PCD-17" CDS 18..1349 /gene="PCD-17" /codon_start=1 /product="paraneoplastic cerebellar degeneration-associated antigen" /db_xref="PID:d1002859" /db_xref="PID:g219980" /translation="MKEDEPWYDHQDLQQDLQLAAELGKTLLDRNTELEDSVQQMYTT NQEQLQEIEYLTKQVELLRQMNEQHAKVYEQLDVTARELEETNQKLVADSKASQQKIL SLTETIECLQTNIDHLQSQVEELKSSGQGRRSPGKCDQEKPAPSFACLKELYDLRQHF VYDHVFAEKITSLQGQPSPDEEENEHLKKTVTMLQAQLSLERQKRVTMEEEYGLVLKE NSELEQQLGATGAYRARALELEAEVAEMRQMLQSEHPFVNGVEKLVPDSLYVPFKEPS QSLLEEMFLTVPESHRKPLKRSSSETILSSLAGSDIVKGHEETCIRRAKAVKQRGISL LHEVDTQYSALKVKYEELLKKCQEEQDSLSHKAVQTSRAAAKDLTGVNAQSEPVASGW ELASVNPEPVSSPTTPPEYKALFKEIFSCIKKTKQEIDEQRTKYRSLSSHS" BASE COUNT 553 a 421 c 497 g 384 t ORIGIN 1 ggacagagga gtttgagatg aaggaggacg agccgtggta cgaccaccag gacctccagc 61 aagatcttca acttgctgct gagcttggga agacattact ggatcggaac acagagttgg 121 aggactctgt tcagcagatg tatacaacca atcaggagca gttacaggaa attgagtatc 181 tgacgaagca agtggaactt ctacggcaga tgaacgaaca acatgcaaag gtttatgaac 241 aattagacgt cacagcaagg gaactggaag aaacaaatca aaagctagtt gctgacagca 301 aggcctcaca gcaaaagatt ctgagcctga ctgaaacgat tgaatgcctg caaaccaaca 361 ttgatcacct ccagagccaa gtggaggagc tgaagtcatc tggccaaggg agaaggagcc 421 cgggaaagtg tgaccaggag aaaccggcac ccagctttgc atgtctgaag gagctgtatg 481 acctccgcca acacttcgtg tatgatcatg tgttcgctga gaagatcact tccttgcaag 541 gtcagccaag ccctgatgaa gaggaaaatg agcacttgaa aaaaacagtg acaatgttgc 601 aggcccagct gagcctggag cggcagaagc gggtgactat ggaggaggaa tatgggctcg 661 tgttaaagga gaacagtgaa ctggagcagc agctgggggc cacaggtgcc taccgagcac 721 gggcgctgga actagaggcc gaggtcgcag agatgcgaca gatgttgcag tcagagcatc 781 catttgtgaa tggagttgag aagctggtgc cagactctct gtatgttcct ttcaaagagc 841 ccagccagag cctgctggaa gagatgttcc tgactgtgcc ggaatcacat agaaagcctc 901 tcaagcgcag cagcagtgag acgatcctca gcagcttggc agggagtgac atcgtgaagg 961 gccacgagga gacctgcatc aggagggcca aggctgtgaa acagaggggc atctcccttc 1021 tgcacgaagt ggacacgcag tacagcgccc tgaaggtgaa gtatgaagag ttgctgaaga 1081 agtgccaaga ggaacaggac tccctgtcac acaaggctgt gcagacctcc agggctgcag 1141 ccaaggacct gactggagtg aacgcccagt ctgagcctgt tgccagcggc tgggaactgg 1201 cctctgtcaa cccagagccc gtgagttccc ctacaacacc tccagaatac aaagcgttgt 1261 ttaaggagat ctttagttgc atcaagaaaa ctaagcagga aatagatgaa cagagaacaa 1321 aataccgatc actctcctct cattcttaat tgaacctcta gctctactac taatttgcct 1381 attgcctatc gcctctctct cccattcaga caagtgtttg tagactctga agcctaatgt 1441 tactcatgac gtttgcctca ttgctttgct tatttagcaa atgcatacaa cgaggaaagg 1501 aggtggctag tggtatcagt tctctgatcc acttccattt aagctcccca ggaaatccca 1561 tgacaaactg gcctctggct ggcgcgctgt tagacttcag ttcctgaaaa ggaccagtgg 1621 agggaagagc tatacttctg gagaagtagg cctggagtta ctacagtatg ggggaaaagg 1681 gtcgagttag aacaaagcta aggcaattcc tattgcttcc ttgcgcaact tctcaaaacg 1741 atgaaagtca gaaggctgtc aaactcaaat atctttgcaa acactgtttg aatactgtga 1801 attcattacg aagaatgttc gagagaaagc aggggtctaa tccaaaaaaa aaaaa // LOCUS HUMPCHSUCA 1162 bp mRNA PRI 07-JAN-1995 DEFINITION Human vacuolar H+ ATPase proton channel subunit mRNA, complete cds. ACCESSION M62762 NID g189675 KEYWORDS vacuolar H(+)-ATPase. SOURCE Homo sapiens (tissue library: cDNA) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1162) AUTHORS Gillespie,G.A., Somlo,S., Germino,G.G., Weinstat-Saslow,D. and Reeders,S.T. TITLE CpG island in the region of an autosomal dominant polycystic kidney disease locus defines the 5' end of a gene encoding a putative proton channel JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (10), 4289-4293 (1991) MEDLINE 91239553 FEATURES Location/Qualifiers source 1..1162 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="HeLa" /tissue_lib="cDNA" /map="16p13.3" gene 231..698 /gene="ATPL" CDS 231..698 /gene="ATPL" /codon_start=1 /db_xref="GDB:G00-128-131" /product="vacuolar H+ ATPase proton channel subunit" /db_xref="PID:g189676" /translation="MSESKSGPEYASFFAVMGASAAMVFSALGAAYGTAKSGTGIAAM SVMRPEQIMKSIIPVVMAGIIAIYGLVVAVLIANSLNDDISLYKSFLQLGAGLSVGLS GLAAGFAIGIVGDAGVRGTAQQPRLFVGMILILIFAEVLGLYGLIVALILSTK" BASE COUNT 176 a 421 c 312 g 253 t ORIGIN Chromosome 16, map position p13.3. 1 acgcgtctcc cccacggtgc gaagtgggta cggctcgcag gggcggggcc aggtcatgtg 61 acgcggccgc gccgcatttt gttctgcggt gctggtattt agagcgcacg ctgacgggcc 121 ggatcgcctt cgccgccgcc cgcccgcaaa ccttcgtgcc cggcccgtcc tcgcccccgc 181 ctccgccacc gcctcggccc gcagagcttg ccccctcccc acccgcagac atgtccgagt 241 ccaagagcgg ccccgagtat gcttcgtttt tcgccgtcat gggcgcctcg gccgccatgg 301 tcttcagcgc cctgggcgct gcctatggca cagccaagag cggtaccggc attgcggcca 361 tgtctgtcat gcggccggag cagatcatga agtccatcat cccagtggtc atggctggca 421 tcatcgccat ctacggcctg gtggtggcag tcctcatcgc caactccctg aatgacgaca 481 tcagcctcta caagagcttc ctccagctgg gcgccggcct gagcgtgggc ctgagcggcc 541 tggcagccgg ctttgccatc ggcatcgtgg gggacgctgg cgtgcggggc accgcccagc 601 agccccgact attcgtgggc atgatcctga ttctcatctt cgccgaggtg ctcggcctct 661 acggtctcat cgtcgccctc atcctctcca caaagtagac cctctccgag cccaccagcc 721 acagaatatt atgtaaagac cacccctcct cattccagaa cgaacagcct gacacatacg 781 cagcgccgcc cgcccccagt agttggtctt gtacatgcgc agtatcctag tgcccatcgt 841 ctgtttcccc gccttgcccc cgcccgcccc gtgccgtgga catctgggcc cactcatcgc 901 ccctccaggc ccccggcgcc ccacccccta aagtgctcta gtatgcggat gatttagaat 961 tgtcatttct ctttactgga tgtttattat taaagatctc gcctgttcct gcgtctgcgg 1021 agccgccctt gtctcccagc tatctataac cttagcttgt gtgtcgcctt gtgggttcct 1081 gttgctgaga cttttcctgg atggagccgc cctcaccgcg cccgtggccc tgcgcggagc 1141 tgtgtccaat aaagttcttg ct // LOCUS HUMPCKD13X 2104 bp mRNA PRI 02-NOV-1993 DEFINITION Human protein kinase C-delta 13 mRNA, complete cds. ACCESSION L07860 NID g189679 KEYWORDS protein kinase C-delta. SOURCE Homo sapiens (library: Stratagene HepG2) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2104) AUTHORS Aris,J.P., Basta,P.V., Holmes,W.D., Ballas,L.M., Moomaw,C., Rankl,N.B., Blobel,G., Loomis,C.R. and Burns,D.J. TITLE Molecular and biochemical characterization of a recombinant human PKC-delta family member JOURNAL Biochim. Biophys. Acta 1174, 171-181 (1993) MEDLINE 93363635 FEATURES Location/Qualifiers source 1..2104 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="Stratagene HepG2" CDS 59..2089 /codon_start=1 /product="protein kinase C-delta 13" /db_xref="PID:g189680" /translation="MAPFLRIAFNSYELGSLQAEDEANQPFCAVKMKEALSTERGKTL VQKKPTMYPEWKSTFDAHIYEGRVIQIVLMRAAEEPVSEVTVGVSVLAERCKKNNGKA EFWLDLQPQAKVLMSVQYFLEDVDCKQSMRSEDEAKFPTMNRRGAIKQAKIHYIKNHE FIATFFGQPTFCSVCKDFVWGLNKQGYKCRQCNAAIHKKCIDKIIGRCTGTAANSRDT IFQKERFNIDMPHRFKVHNYMSPTFCDHCGSLLWGLVKQGLKCEDCGMNVHHKCREKV ANLCGINQKLLAEALNQVTQRASRRSDSASSEPVGIYQGFEKKTGVAGEDMQDNSGTY GKIWEGSSKCNINNFIFHKVLGKGSFGKVLLGELKGRGEYSAIKALKKDVVLIDDDVE CTMVEKRVLTLAAENPFLTHLICTFQTKDHLFFVMEFLNGGDLMYHIQDKGRFELYRA TFYAAEIMCGLQFLHSKGIIYRDLKLDNVLLDRDGHIKIADFGMCKENIFGESRASTF CGTPDYIAPEILQGLKYTFSVDWWSFGVLLYEMLIGQSPFHGDDEDELFESIRVDTPH YPRWITKESKDILEKLFEREPTKRLGMTGNIKIHPFFKTINWTLLEKRRLEPPFRPKV KSPRDYSNFDQEFLNEKARLSYSDKNLIDSMDQSAFAGFSFVNPKFEHLLED" BASE COUNT 524 a 581 c 582 g 417 t ORIGIN 1 tgccgccgcg acccttggcg cctgcccctg caacgggagc cccactgcag gccccaccat 61 ggcgccgttc ctgcgcatcg ccttcaactc ctatgagctg ggctccctgc aggccgagga 121 cgaggcgaac cagcccttct gtgccgtgaa gatgaaggag gcgctcagca cagagcgtgg 181 gaaaacactg gtgcagaaga agccgaccat gtatcctgag tggaagtcga cgttcgatgc 241 ccacatctat gaggggcgcg tcatccagat tgtgctaatg cgggcagcag aggagccagt 301 gtctgaggtg accgtgggtg tgtcggtgct ggccgagcgc tgcaagaaga acaatggcaa 361 ggctgagttc tggctggacc tgcagcctca ggccaaggtg ttgatgtctg ttcagtattt 421 cctggaggac gtggattgca aacaatctat gcgcagtgag gacgaggcca agttcccaac 481 gatgaaccgc cgcggagcca tcaaacaggc caaaatccac tacatcaaga accatgagtt 541 tatcgccacc ttctttgggc aacccacctt ctgttctgtg tgcaaagact ttgtctgggg 601 cctcaacaag caaggctaca aatgcaggca atgtaacgct gccatccaca agaaatgcat 661 cgacaagatc atcggcagat gcactggcac cgcggccaac agccgggaca ctatattcca 721 gaaagaacgc ttcaacatcg acatgccgca ccgcttcaag gttcacaact acatgagccc 781 caccttctgt gaccactgcg gcagcctgct ctggggactg gtgaagcagg gattaaagtg 841 tgaagactgc ggcatgaatg tgcaccataa atgccgggag aaggtggcca acctctgcgg 901 catcaaccag aagcttttgg ctgaggcctt gaaccaagtc acccagagag cctcccggag 961 atcagactca gcctcctcag agcctgttgg gatatatcag ggtttcgaga agaagaccgg 1021 agttgctggg gaggacatgc aagacaacag tgggacctac ggcaagatct gggagggcag 1081 cagcaagtgc aacatcaaca acttcatctt ccacaaggtc ctgggcaaag gcagcttcgg 1141 gaaggtgctg cttggagagc tgaagggcag aggagagtac tctgccatca aggccctcaa 1201 gaaggatgtg gtcctgatcg acgacgacgt ggagtgcacc atggttgaga agcgggtgct 1261 gacacttgcc gcagagaatc cctttctcac ccacctcatc tgcaccttcc agaccaagga 1321 ccacctgttc tttgtgatgg agttcctcaa cgggggggac ctgatgtacc acatccagga 1381 caaaggccgc tttgaactct accgtgccac gttttatgcc gctgagataa tgtgtggact 1441 gcagtttcta cacagcaagg gcatcattta cagggacctc aaactggaca atgtgctgtt 1501 ggaccgggat ggccacatca agattgccga ctttgggatg tgcaaagaga acatattcgg 1561 ggagagccgg gccagcacct tctgcggcac ccctgactat atcgcccctg agatcctaca 1621 gggcctgaag tacacattct ctgtggactg gtggtctttc ggggtccttc tgtacgagat 1681 gctcattggc cagtccccct tccatggtga tgatgaggat gaactcttcg agtccatccg 1741 tgtggacacg ccacattatc cccgctggat caccaaggag tccaaggaca tcctggagaa 1801 gctctttgaa agggaaccaa ccaagaggct gggaatgacg ggaaacatca aaatccaccc 1861 cttcttcaag accataaact ggactctgct ggaaaagcgg aggttggagc cacccttcag 1921 gcccaaagtg aagtcaccca gagactacag taactttgac caggagttcc tgaacgagaa 1981 ggcgcgcctc tcctacagcg acaagaacct catcgactcc atggaccagt ctgcattcgc 2041 tggcttctcc tttgtgaacc ccaaattcga gcacctcctg gaagattgag gttcctggac 2101 agat // LOCUS HUMPCOLCE 1480 bp mRNA PRI 03-FEB-1995 DEFINITION Human procollagen C-proteinase enhancer protein (PCOLCE) mRNA, complete cds. ACCESSION L33799 NID g642907 KEYWORDS procollagen C-proteinase enhancer protein. SOURCE Homo sapiens (tissue library: lambda gt10) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1480) AUTHORS Takahara,K., Kessler,E., Biniaminov,L., Brusel,M., Eddy,R.L., Jani-Sait,S., Shows,T.B. and Greenspan,D.S. TITLE Type I procollagen COOH-terminal proteinase enhancer protein: identification, primary structure, and chromosomal localization of the cognate human gene (PCOLCE) JOURNAL J. Biol. Chem. 269 (42), 26280-26285 (1994) MEDLINE 95014462 FEATURES Location/Qualifiers source 1..1480 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="lambda gt10" /map="7q21.3-7q22" sig_peptide 61..135 /gene="PCOLCE" CDS 61..1410 /gene="PCOLCE" /note="putative N-linked glycosylation site 145..153; putative N-linked glycosylation site 1351..1359" /codon_start=1 /product="procollagen C-proteinase enhancer protein" /db_xref="PID:g642908" /translation="MLPAATASLLGPLLTACALLPFAQGQTPNYTRPVFLCGGDVKGE SGYVASEGFPNSYPPNKECIWTITVPEGQTVSLSFRVFDLELHPACRYDALEVFAGSG TSGQRLGRFCGTFRPAPLVAPGNQVTLRMTTDEGTGGRGFLLWYSGRATSGSEHQFCG GRLEKAQGTLTTPNWPESDYPPGISCSWHIIAPPDQVIALTFEKFDLEPDTYCRYDSV SVFNGAVSDDSRRLGKFCGDAVPGSISSEGNELLVQFVSDLSVTADGFSASYKTLPRG TAKEGQGPGPKRGTEPKVKLPPKSQPPEKTEESPSAPDAPTCPKQCRRTGTLQSNFCA SSLVVTATVKSMVREPGEGLAVTVSLIGAYKTGGLDLPTPPTGASLKFYVPCKQCPPM KKGVSYLLMGQVEENRGPVLPPESFVVLHRPNQDQILTNLSKRKCPSQPVRAAASQD" gene 61..1410 /gene="PCOLCE" mat_peptide 136..1407 /gene="PCOLCE" /product="procollagen C-proteinase enhancer protein" BASE COUNT 286 a 501 c 414 g 279 t ORIGIN 1 ctctgcaaaa ttcagctgct gcctctgtct tgaggacccc agcgcctttc ccccggggcc 61 atgctgcctg cagccacagc ctccctcctg gggcccctcc tcactgcctg cgccctgctg 121 ccttttgccc agggccagac ccccaactac accagacccg tgttcctgtg cggaggggat 181 gtgaaggggg aatcaggtta cgtggcaagt gaggggttcc ccaactccta cccccctaat 241 aaggagtgca tctggaccat aacggtcccc gagggccaga ctgtgtccct ctcattccga 301 gtcttcgacc tggagctgca ccccgcctgc cgctacgatg ctctggaggt cttcgctggg 361 tctgggactt ccggccagcg gctcggacgc ttttgtggga ccttccggcc tgcgccccta 421 gtcgcccccg gcaaccaggt gaccctgagg atgacgacgg atgagggcac aggaggacga 481 ggcttcctgc tctggtacag cgggcgggcc acctcgggct ctgagcacca attttgcggg 541 gggcggctgg agaaggccca gggaaccctg accacgccca actggcccga gtccgattac 601 cccccgggca tcagctgttc ctggcacatc atcgcgcccc cggaccaggt catcgcgctg 661 accttcgaga agtttgacct ggagccggac acctactgcc gctatgactc ggtcagcgtc 721 ttcaacggag ccgtgagcga cgactcccgg aggctgggga agttctgcgg cgacgcagtc 781 ccgggctcca tctcctccga agggaatgaa ctcctcgtcc agttcgtctc agatctcagt 841 gtcaccgctg atggcttctc agcctcctac aagaccctgc cgcggggcac tgccaaagaa 901 gggcaagggc ccggccccaa acggggaact gagcctaaag tcaagctgcc ccccaagtcc 961 caacctccgg agaaaacaga ggaatctcct tcagcccctg atgcacccac ctgcccaaag 1021 cagtgccgcc ggacaggcac cttgcagagc aacttctgtg ccagcagcct tgtggtgact 1081 gcgacagtga agtccatggt tcgggagcca ggggagggcc ttgccgtgac tgtcagtctt 1141 attggtgctt ataaaactgg aggactggac ctgccaactc cacccactgg tgcctccctg 1201 aagttttacg tgccttgcaa gcagtgcccc cccatgaaga aaggagtcag ttatctgctg 1261 atgggccagg tagaagagaa cagaggcccc gtccttcctc cagagagctt tgtggttctc 1321 caccggccca accaggacca gatcctcacc aacctaagca agaggaagtg cccctctcaa 1381 cctgtgcggg ctgctgcgtc ccaggactga gacgcaggcc agccccggcc cctagccctc 1441 aggcctctct tcttatccaa ataaatgttt cttaatgaaa // LOCUS HUMPCP 2060 bp mRNA PRI 14-MAY-1996 DEFINITION Human prolylcarboxypeptidase mRNA, complete cds. ACCESSION L13977 NID g431320 KEYWORDS prolylcarboxypeptidase. SOURCE Homo sapiens (tissue library: lambda gt10 of Graeme I. Bell) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2060) AUTHORS Yang,H.Y., Erdos,E.G. and Chiang,T.S. TITLE New enzymatic route for the inactivation of angiotensin JOURNAL Nature 218 (148), 1224-1226 (1968) MEDLINE 68284682 REFERENCE 2 (bases 1 to 2060) AUTHORS Odya,C.E., Marinkovic,D.V., Hammon,K.J., Stewart,T.A. and Erdos,E.G. TITLE Purification and properties of prolylcarboxypeptidase (angiotensinase C) from human kidney JOURNAL J. Biol. Chem. 253 (17), 5927-5931 (1978) MEDLINE 78242265 REFERENCE 3 (bases 1 to 2060) AUTHORS Tan,F., Morris,P.W., Skidgel,R.A. and Erdos,E.G. TITLE Sequencing and cloning of human prolylcarboxypeptidase (angiotensinase C). Similarity to both serine carboxypeptidase and prolylendopeptidase families JOURNAL J. Biol. Chem. 268 (22), 16631-16638 (1993) MEDLINE 93346415 REMARK Erratum:[J Biol Chem 1993 Dec 5;268(34):26032] FEATURES Location/Qualifiers source 1..2060 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="kidney" /tissue_lib="lambda gt10 of Graeme I. Bell" 5'UTR 1..29 /note="putative" /citation=[3] CDS 30..1520 /standard_name="angiotensinase C" /EC_number="3.4.16.2" /function="cleaves C-terminal amino acids linked to penultimate proline" /note="Human prolylcarboxypeptidase is a lysosomal enzyme with an acid pH optimum that cleaves C-terminal amino acids linked to proline in peptides such as angiotensin II and des-Arg9-bradykinin." /citation=[2] /citation=[3] /codon_start=1 /evidence=experimental /product="prolylcarboxypeptidase" /db_xref="PID:g431321" /translation="MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAV AKNYSVLYFQQKVDHFGFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNN TGFMWDVAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKH LKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVF MKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLK DWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVY YNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSD DCFQQWGVRPRPSWITTMYGGKNISSHTNIVFSNGELDPWSGGGVTKDITDTLVAVTI SEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYDSAGKQH" sig_peptide 30..119 /note="putative" /citation=[3] mat_peptide 165..1517 /EC_number="3.4.16.2" /citation=[2] /citation=[3] /function="cleaves C-terminal amino acids linked to penultimate proline" /evidence=experimental /product="prolylcarboxypeptidase" 3'UTR 1518..2060 /note="putative" /citation=[3] polyA_signal 2038..2043 /note="putative" /citation=[3] polyA_site 2060 /note="truncated 70 bp polyA tail; putative" /citation=[3] BASE COUNT 534 a 462 c 462 g 602 t ORIGIN 1 cacccgcact gcagtctcca gcctgagcca tgggccgccg agccctcctg ctcctgcttc 61 tgtcttttct ggcgccctgg gccaccatag ccctccggcc ggccttaagg gccctcggca 121 gcctacactt gccaaccaac cccacatccc tcccggctgt agccaagaac tattcggttc 181 tctacttcca acagaaggtt gatcattttg gatttaatac tgtgaaaact tttaatcagc 241 ggtacctagt agctgataaa tactggaaga aaaatggtgg atcaatactt ttctacactg 301 gtaatgaagg ggacattatc tggttttgta ataacacggg gttcatgtgg gatgtggctg 361 aggaactgaa agctatgttg gtgtttgctg aacatcgata ctatggagag tctctcccct 421 ttggtgacaa ctcattcaag gattccagac acttgaattt cctgacatca gaacaagctc 481 tggctgattt tgcagagtta atcaaacact tgaaaagaac aatcccagga gctgaaaatc 541 aacctgtcat tgccatagga ggctcctatg gtggcatgct tgccgcctgg tttaggatga 601 aatatcctca tatggtagtt ggagctcttg cagcttctgc ccctatctgg cagtttgagg 661 atttagtacc ttgtggtgta tttatgaaga tcgtaactac agattttagg aaaagcggtc 721 cacattgttc agagagcatc cacaggtcct gggatgccat taatcgactc tcaaatactg 781 gcagtggttt gcagtggctt actggagccc ttcacttatg cagcccatta acttctcagg 841 acatccaaca tttgaaagac tggatctctg aaacctgggt gaatctggca atggtggact 901 atccttatgc ctctaacttt ttacagcctt tgcctgcttg gcctatcaag gtagtgtgcc 961 agtatttgaa aaatcccaac gtatctgatt cactgctgct gcagaatatt ttccaagctc 1021 tgaatgtata ttacaattat tcgggccagg tgaaatgcct gaatatttca gagacagcaa 1081 ctagcagtct gggaacactg ggttggagct atcaggcctg cacagaagta gtcatgccct 1141 tttgtactaa tggtgtcgat gacatgtttg aacctcactc atggaactta aaggaacttt 1201 ctgatgactg ttttcaacag tggggtgtga gaccaaggcc ctcctggatc actactatgt 1261 atggaggcaa aaacattagt tcacacacaa acattgtttt cagcaatggt gaactagacc 1321 cctggtcagg aggtggagta actaaggata tcacagacac tctggttgca gtcaccatct 1381 cagagggggc ccaccactta gatctccgca ccaagaatgc cttggatcct atgtctgtgc 1441 tgttagcccg ctccttggaa gttagacata tgaagaattg gatcagagat ttctatgaca 1501 gtgcaggaaa gcagcactga gaaacttttg attgttttca atttcttctt ttatgttcac 1561 accaccacat tcccattcac tttgattttc tacatgtaat taccttcttt tgtttatcat 1621 tagatttgat ggggccaaag ttgagataga atagagggtg atgacggtaa gagcaagtgt 1681 cccatgaatg tgatttcctg ggttctcact gtctttgcac cacgtctagg aagaatcttc 1741 ttgatagctc tcccacacca tcagtggccc tcataactgg agtagagttc ctggttgctt 1801 ttcataagag ggagagttac tttctttgta tctctgcaag cagagatttc tctttggttt 1861 tgaggttgaa gtgtctttgg cccatttgta agtccccatc cctaccctac acaaagtaaa 1921 agcagaagat agataaaaaa tgatgtaatt gcagctggta ggatgtctgg tgcccaatcc 1981 caggaagtga gagccatttc ttttgtactg gatttaatga ctttgaactg tgctgtaaat 2041 aaataataca gctggacctt // LOCUS HUMPCPBX 1728 bp mRNA PRI 07-JAN-1995 DEFINITION Human prepro-plasma carboxypeptidase B mRNA, complete cds. ACCESSION M75106 NID g189686 KEYWORDS plasma carboxypeptidase. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1728) AUTHORS Eaton,D.L., Malloy,B.E., Tsai,S.P., Henzel,W. and Drayna,D. TITLE Isolation, molecular cloning, and partial characterization of a novel carboxypeptidase B from human plasma JOURNAL J. Biol. Chem. 266 (32), 21833-21838 (1991) MEDLINE 92042093 FEATURES Location/Qualifiers source 1..1728 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 20..1291 /gene="pCPB" CDS 20..1291 /gene="pCPB" /note="prepro-plasma carboxypeptidase B" /codon_start=1 /db_xref="PID:g189687" /translation="MKLCSLAVLVPIVLFCEQHVFAFQSGQVLAALPRTSRQVQVLQN LTTTYEIVLWQPVTADLIVKKKQVHFFVNASDVDNVKAHLNVSGIPCSVLLADVEDLI QQQISNDTVSPRASASYYEQYHSLNEIYSWIEFITERHPDMLTKIHIGSSFEKYPLYV LKVSGKEQTAKNAIWIDCGIHAREWISPAFCLWFIGHITQFYGIIGQYTNLLRLVDFY VMPVVNVDGYDYSWKKNRMWRKNRSFYANNHCIGTDLNRNFASKHWCEEGASSSSCSE TYCGLYPESEPEVKAVASFLRRNINQIKAYISMHSYSQHIVFPYSYTRSKSKDHEELS LVASEAVRAIEKTSKNTRYTHGHGSETLYLAPGGGDDWIYDLGIKYSFTIELRDTGTY GFLLPERYIKPTCREAFAAVSKIAWHVIRNV" mat_peptide 86..1288 /gene="pCPB" /product="plasma carboxypeptidase B" BASE COUNT 518 a 354 c 338 g 518 t ORIGIN 1 agagaaaatt gctgttggga tgaagctttg cagccttgca gtccttgtac ccattgttct 61 cttctgtgag cagcatgtct tcgcgtttca gagtggccaa gttctagctg ctcttcctag 121 aacctctagg caagttcaag ttctacagaa tcttactaca acatatgaga ttgttctctg 181 gcagccggta acagctgacc ttattgtgaa gaaaaaacaa gtccattttt ttgtaaatgc 241 atctgatgtc gacaatgtga aagcccattt aaatgtgagc ggaattccat gcagtgtctt 301 gctggcagac gtggaagatc ttattcaaca gcagatttcc aacgacacag tcagcccccg 361 agcctccgca tcgtactatg aacagtatca ctcactaaat gaaatctatt cttggataga 421 atttataact gagaggcatc ctgatatgct tacaaaaatc cacattggat cctcatttga 481 gaagtaccca ctctatgttt taaaggtttc tggaaaagaa caaacagcca aaaatgccat 541 atggattgac tgtggaatcc atgccagaga atggatctct cctgctttct gcttgtggtt 601 cataggccat ataactcaat tctatgggat aatagggcaa tataccaatc tcctgaggct 661 tgtggatttc tatgttatgc cggtggttaa tgtggacggt tatgactact catggaaaaa 721 gaatcgaatg tggagaaaga accgttcttt ctatgcgaac aatcattgca tcggaacaga 781 cctgaatagg aactttgctt ccaaacactg gtgtgaggaa ggtgcatcca gttcctcatg 841 ctcggaaacc tactgtggac tttatcctga gtcagaacca gaagtgaagg cagtggctag 901 tttcttgaga agaaatatca accagattaa agcatacatc agcatgcatt catactccca 961 gcatatagtg tttccatatt cctatacacg aagtaaaagc aaagaccatg aggaactgtc 1021 tctagtagcc agtgaagcag ttcgtgctat tgagaaaact agtaaaaata ccaggtatac 1081 acatggccat ggctcagaaa ccttatacct agctcctgga ggtggggacg attggatcta 1141 tgatttgggc atcaaatatt cgtttacaat tgaacttcga gatacgggca catacggatt 1201 cttgctgccg gagcgttaca tcaaacccac ctgtagagaa gcttttgccg ctgtctctaa 1261 aatagcttgg catgtcatta ggaatgttta atgcccctga ttttatcatt ctgcttccgt 1321 attttaattt actgattcca gcaagaccaa atcattgtat cagattattt ttaagtttta 1381 tccgtagttt tgataaaaga ttttcctatt ccttggttct gtcagagaac ctaataagtg 1441 ctactttgcc attaaggcag actagggttc atgtcttttt accctttaaa aaaaaattgt 1501 aaaagtctag ttacctactt tttctttgat tttcgacgtt tgactagcca tctcaagcaa 1561 ctttcgacgt ttgactagcc atctcaagca agtttaatca aagatcatct cacgctgatc 1621 attggatcct actcaacaaa aggaagggtg gtcagaagta cattaaagat ttctgctcca 1681 aattttcaat aaatttcttc ttctccttta aaaaaaaaaa aaaaaaaa // LOCUS HUMPD1A 921 bp DNA PRI 27-DEC-1994 DEFINITION Human PD-1 gene, complete cds. ACCESSION L27440 NID g604540 KEYWORDS . SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 921) AUTHORS Shinohara,T., Taniwaki,M., Ishida,Y., Kawaichi,M. and Honjo,T. TITLE Structure and chromosomal localization of the human PD-1 gene (PDCD1) JOURNAL Genomics 23 (3), 704-706 (1994) MEDLINE 95154844 FEATURES Location/Qualifiers source 1..921 /organism="Homo sapiens" /db_xref="taxon:9606" gene 25..891 /gene="PD-1" CDS 25..891 /gene="PD-1" /note="Tyr-X-X-Leu motif 691..777; extracellular domain 25..525; immunoglobulin-like domain 172..405; intracellular domain 601..891; transmembrane region 526..600" /codon_start=1 /db_xref="PID:g604541" /translation="MQIPQAPWPVVWAVLQLGWRPGWFLDSPDRPWNPPTFSPALLVV TEGDNATFTCSFSNTSESFVLNWYRMSPSNQTDKLAAFPEDRSQPGQDCRFRVTQLPN GRDFHMSVVRARRNDSGTYLCGAISLAPKAQIKESLRAELRVTERRAEVPTAHPSPSP RSAGQFQTLVVGVVGGLLGSLVLLVWVLAVICSRAARGTIGARRTGQPLKEDPSAVPV FSVDYGELDFQWREKTPEPPVPCVPEQTEYATIVFPSGMGTSSPARRGSADGPRSAQP LRPEDGHCSWPL" BASE COUNT 163 a 326 c 280 g 152 t ORIGIN 1 cactctggtg gggctgctcc aggcatgcag atcccacagg cgccctggcc agtcgtctgg 61 gcggtgctac aactgggctg gcggccagga tggttcttag actccccaga caggccctgg 121 aaccccccca ccttctcccc agccctgctc gtggtgaccg aaggggacaa cgccaccttc 181 acctgcagct tctccaacac atcggagagc ttcgtgctaa actggtaccg catgagcccc 241 agcaaccaga cggacaagct ggccgccttc cccgaggacc gcagccagcc cggccaggac 301 tgccgcttcc gtgtcacaca actgcccaac gggcgtgact tccacatgag cgtggtcagg 361 gcccggcgca atgacagcgg cacctacctc tgtggggcca tctccctggc ccccaaggcg 421 cagatcaaag agagcctgcg ggcagagctc agggtgacag agagaagggc agaagtgccc 481 acagcccacc ccagcccctc acccaggtca gccggccagt tccaaaccct ggtggttggt 541 gtcgtgggcg gcctgctggg cagcctggtg ctgctagtct gggtcctggc cgtcatctgc 601 tcccgggccg cacgagggac aataggagcc aggcgcaccg gccagcccct gaaggaggac 661 ccctcagccg tgcctgtgtt ctctgtggac tatggggagc tggatttcca gtggcgagag 721 aagaccccgg agccccccgt gccctgtgtc cctgagcaga cggagtatgc caccattgtc 781 tttcctagcg gaatgggcac ctcatccccc gcccgcaggg gctcagctga cggccctcgg 841 agtgcccagc cactgaggcc tgaggatgga cactgctctt ggcccctctg accggcttcc 901 ttggccacca gtgttctgca g // LOCUS HUMPDE2A 3871 bp mRNA PRI 03-JUN-1993 DEFINITION Human rolipram-sebsitive, cAMP-specific phosphodiesterase (PDE2) mRNA, complete cds. ACCESSION M97515 NID g292387 KEYWORDS cAMP-specific phosphodiesterase. SOURCE Homo sapiens brain, frontal cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3871) AUTHORS McLaughlin,M.M., Cieslinski,L.B., Burman,M., Torphy,T.J. and Livi,G.P. TITLE A Low-Km, Rolipram-sensitive, cAMP-specific Phosphodiesterase from Human Brain: Cloning and Expression of cDNA, Biochemical Characterization of Recombinant Protein, and Tissue Distribution of mRNA JOURNAL J. Biol. Chem. 268, 6470-6476 (1993) MEDLINE 93203241 FEATURES Location/Qualifiers source 1..3871 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="brain, frontal cortex" CDS 282..1976 /note="rolipram-sensitive, cAMP-specific" /codon_start=1 /product="phosphodiesterase" /db_xref="PID:g292388" /translation="MKEHGGTFSSTGISGGSGDSAMDSLQPLQPNYMPVCLFAEESYQ KLAMETLEELDWCLDQLETIQTYRSVSEMASNKFKRMLNRELTHLSEMSRSGNQVSEY ISNTFLDKQNDVEIPSPTQKDREKKKKQQLMTQISGVKKLMHSSSLNNTSISRFGVNT ENEDHLAKELEDLNKWGLNIFNVAGYSHNRPLTCIMYAIFQERDLLKTFRISSDTFIT YMMTLEDHYHSDVAYHNSLHAADVAQSTHVLLSTPALDAVFTDLEILAAIFAAAIHDV DHPGVSNQFLINTNSELALMYNDESVLENHHLAVGFKLLQEEHCDIFMNLTKKQRQTL RKMVIDMVLATDMSKHMSLLADLKTMVETKKVTSSGVLLLDNYTDRIQVLRNMVHCAD LSNPTKSLELYRQWTDRIMEEFFQQGDKERERGMEISPMCDKHTASVEKSQVGFIDYI VHPLWETWADLVQPDAQDILDTLEDNRNWYQSMIPQSPSPPLDEQNRDCQGLMEKFQF ELTLDEEDSEGPEKEGEGHSYFSSTKTLCVIDPENRDSLGETDIDIATEDKSPVDT" polyA_site 1371 BASE COUNT 1108 a 847 c 851 g 1065 t ORIGIN 1 ggcacgagcc taaagaaccc tgggatgact aaggcagaga gagtctgaga aaactctttg 61 gtgcttctgc ctttagtttt aggacacatt tatgcagatg agcttataag agaccgttcc 121 ctccgccttc ttcctcagag gaagtttctt ggtagatcac cgacacctca tccaggcggg 181 gggttggggg gaaacttggc accagccatc ccaggcagag caccactgtg atttgttctc 241 ctggtggaga gagctggaag gaaggagcca gcgtgcaaat aatgaaggag cacgggggca 301 ccttcagtag caccggaatc agcggtggta gcggtgactc tgctatggac agcctgcagc 361 cgctccagcc taactacatg cctgtgtgtt tgtttgcaga agaatcttat caaaaattag 421 caatggaaac gctggaggaa ttagactggt gtttagacca gctagagacc atacagacct 481 accggtctgt cagtgagatg gcttctaaca agttcaaaag aatgctgaac cgggagctga 541 cacacctctc agagatgagc cgatcaggga accaggtgtc tgaatacatt tcaaatactt 601 tcttagacaa gcagaatgat gtggagatcc catctcctac ccagaaagac agggagaaaa 661 agaaaaagca gcagctcatg acccagataa gtggagtgaa gaaattaatg catagttcaa 721 gcctaaacaa tacaagcatc tcacgctttg gagtcaacac tgaaaatgaa gatcacctgg 781 ccaaggagct ggaagacctg aacaaatggg gtcttaacat ctttaatgtg gctggatatt 841 ctcacaatag acccctaaca tgcatcatgt atgctatatt ccaggaaaga gacctcctaa 901 agacattcag aatctcatct gacacattta taacctacat gatgacttta gaagaccatt 961 accattctga cgtggcatat cacaacagcc tgcacgctgc tgatgtagcc cagtcgaccc 1021 atgttctcct ttctacacca gcattagacg ctgtcttcac agatttggag atcctggctg 1081 ccatttttgc agctgccatc catgacgttg atcatcctgg agtctccaat cagtttctca 1141 tcaacacaaa ttcagaactt gctttgatgt ataatgatga atctgtgttg gaaaatcatc 1201 accttgctgt gggtttcaaa ctgctgcaag aagaacactg tgacatcttc atgaatctca 1261 ccaagaagca gcgtcagaca ctcaggaaga tggttattga catggtgtta gcaactgata 1321 tgtctaaaca tatgagcctg ctggcagacc tgaagacaat ggtagaaacg aagaaagtta 1381 caagttcagg cgttcttctc ctagacaact ataccgatcg cattcaggtc cttcgcaaca 1441 tggtacactg tgcagacctg agcaacccca ccaagtcctt ggaattgtat cggcaatgga 1501 cagaccgcat catggaggaa tttttccagc agggagacaa agagcgggag aggggaatgg 1561 aaattagccc aatgtgtgat aaacacacag cttctgtgga aaaatcccag gttggtttca 1621 tcgactacat tgtccatcca ttgtgggaga catgggcaga tttggtacag cctgatgctc 1681 aggacattct cgatacctta gaagataaca ggaactggta tcagagcatg atacctcaaa 1741 gtccctcacc accactggac gagcagaaca gggactgcca gggtctgatg gagaagtttc 1801 agtttgaact gactctcgat gaggaagatt ctgaaggacc tgagaaggag ggagagggac 1861 acagctattt cagcagcaca aagacgcttt gtgtgattga tccagaaaac agagattccc 1921 tgggagagac tgacatagac attgcaacag aagacaagtc ccccgtggat acataatccc 1981 cctctccctg tggagatgaa cattctatcc ttgatgagca tgccagctat gtggtagggc 2041 cagcccacca tgggggccaa gacctgcaca ggacaagggc cacctggctt tcagttactt 2101 gagtttggag tcagaaagca agaccaggaa gcaaatagca gctcaggaaa tcccacggtt 2161 gacttgcctt gatggcaagc ttggtggaga gggctgaagc tgttgctggg ggccgattct 2221 gatcaagaca catggcttga aaatggaaga cacaaaactg agagatcatt ctgcactaag 2281 tttcgggaac ttatccccga cagtgactga actcactgac taataacttc atttatgaat 2341 cttctcactt gtccctttgt ctgccaacct gtgtgccttt tttgtaaaac attttcatgt 2401 ctttaaaatg cctgttgaat acctggagtt tagtatcaac ttctacacag ataagctttc 2461 aaagttgaca aacttttttg actctttctg gaaaagggaa agaaaatagt cttccttctt 2521 tcttgggcaa tatccttcac tttactacag ttacttttgc aaacagacag aaaggataca 2581 cttctaacca cattttactt ccttcccctg ttgtccagtc caactccaca gtcactctta 2641 aaacttctct ctgtttgcct gcctccaaca gtacttttaa ctttttgctg taaacagaat 2701 aaaattgaac aaattagggg gtagaaagga gcagtggtgt cgttcaccgt gagagtctgc 2761 atagaactca gcagtgtgcc ctgctgtgtc ttggaccctg ccccccacag gagttgtaca 2821 gtccctggcc ctgctcccta cctcctctct tcaccccgtt aggctgtttt caatgtaatg 2881 ctgccgtcct tctcttgcac tgccttctgc gctaacacct ccattcctgt ttataaccgt 2941 gtatttatta cttaatgtat ataatgtaat gttttgtaag ttattaattt atatatctaa 3001 cattgcctgc caatggtggt gttaaatttg tgtagaaaac tctgcctaag agttacgact 3061 ttttcttgta atgttttgta ttgtgtatta tataacccaa acgtcactta gtagagacat 3121 atggccccct tggcagagag gacaggggtg ggcttttgtt caaagggtct gccctttccc 3181 tgcctgagtt gctacttctg cacaacccct ttatgaacca gttttggaaa caatattcta 3241 cacattagat actaaatggt ttatactgag cttttacttt tgtatagctt gataggggca 3301 gggggcaatg gatgtagttt ttacccaggt tctatccaaa tctatgtggg catgagttgg 3361 gttataactg gatcctacta tcattgtggc tttggttcaa aaggaaacac tacatttgct 3421 cacagatgat tcttctgaat gctcccgaac tactgacttt gaagaggtag cctcctgcct 3481 gccattaagc aggaatgtca tgttccagtt cattacaaaa gaaaacaata aaacaatgtg 3541 aatttttata ataaaatgtg aactgatgta gcaaattacg caaatgtgaa gcctcttctg 3601 ataacacttg ttaggcctct tactgatgtc agtttcagtt tgtaaaatat gtttcatgct 3661 ttcagttcag cattgtgact cagtaaatac agaaaatggc acaaatgtgc atgaccaatg 3721 tatgtctatg aacactgcat tgtttcaggt ggacatttta tcgattttca aatgtttctc 3781 acaatgtatg ttatagtgtt attattatat attgtgttca aatgcattct aaagagactt 3841 ttatatgagg tgaataaaga aaagcataat t // LOCUS HUMPDGFR 5427 bp mRNA PRI 28-SEP-1992 DEFINITION Human platelet-derived growth factor (PDGF) receptor mRNA, complete cds. ACCESSION M21616 NID g189729 KEYWORDS platelet-derived growth factor. SOURCE Human foreskin fibroblast mRNA, clone lambda-HPDGFR-8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5427) AUTHORS Claesson-Welsh,L., Eriksson,A., Moren,A., Severinsson,L., Ek,B., Oestman,A., Betsholtz,C. and Heldin,C.-H. TITLE cDNA cloning and expression of a human platelet-derived growth factor (PDGF) receptor specific for B-chain-containing PDGF molecules JOURNAL Mol. Cell. Biol. 8, 3476-3486 (1988) MEDLINE 89096941 FEATURES Location/Qualifiers source 1..5427 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_type="foreskin" sig_peptide 187..282 CDS 187..3507 /codon_start=1 /product="platelet-derived growth factor receptor" /db_xref="PID:g189730" /translation="MRLPGAMPALALKGELLLLSLLLLLEPQISQGLVVTPPGPELVL NVSSTFVLTCSGSAPVVWERMSQEPPQEMAKAQDGTFSSVLTLTNLTGLDTGEYFCTH NDSRGLETDERKRLYIFVPDPTVGFLPNDAEELFIFLTEITEITIPCRVTDPQLVVTL HEKKGDVALPVPYDHQRGFSGIFEDRSYICKTTIGDREVDSDAYYVYRLQVSSINVSV NAVQTVVRQGENITLMCIVIGNDVVNFEWTYPRKESGRLVEPVTDFLLDMPYHIRSIL HIPSAELEDSGTYTCNVTESVNDHQDEKAINITVVESGYVRLLGEVGTLQFAELHRSR TLQVVFEAYPPPTVLWFKDNRTLGDSSAGEIALSTRNVSETRYVSELTLVRVKVAEAG HYTMRAFHEDAEVQLSFQLQINVPVRVLELSESHPDSGEQTVRCRGRGMPQPNIIWSA CRDLKRCPRELPPTLLGNSSEEESQLETNVTYWEEEQEFEVVSTLRLQHVDRPLSVRC TLRNAVGQDTQEVIVVPHSLPFKVVVISAILALVVLTIISLIILIMLWQKKPRYEIRW KVIESVSSDGHEYIYVDPMQLPYDSTWELPRDQLVLGRTLGSGAFGQVVEATAHGLSH SQATMKVAVKMLKSTARSSEKQALMSELKIMSHLGPHLNVVNLLGACTKGGPIYIITE YCRYGDLVDYLHRNKHTFLQHHSDKRRPPSAELYSNALPVGLPLPSHVSLTGESDGGY MDMSKDESVDYVPMLDMKGDVKYADIESSNYMAPYDNYVPSAPERTCRATLINESPVL SYMDLVGFSYQVANGMEFLASKNCVHRDLAARNVLICEGKLVKICDFGLARDIMRDSN YISKGSTFLPLKWMAPESIFNSLYTTLSDVWSFGILLWEIFTLGGTPYPELPMNEQFY NAIKRGYRMAQPAHASDEIYEIMQKCWEEKFEIRPPFSQLVLLLERLLGEGYKKKYQQ VDEEFLRSDHPAILRSQARLPGFHGLRSPLDTSSVLYTAVQPNEGDNDYIIPLPDPKP EVADEGPLEGSPSLASSTLNEVNTSSTISCDSPLEPQDEPEPEPQLELQVEPEPELEQ LPDSGCPAPRAEAEDSFL" mat_peptide 283..3504 /product="platelet-derived growth factor receptor" BASE COUNT 1178 a 1605 c 1492 g 1152 t ORIGIN 1 tgttctcctg agccttcagg agcctgcacc agtcctgcct gtccttctac tcagctgtta 61 cccactctgg gaccagcagt ctttctgata actgggagag ggcagtaagg aggacttcct 121 ggagggggtg actgtccaga gcctggaact gtgcccacac cagaagccat cagcagcaag 181 gacaccatgc ggcttccggg tgcgatgcca gctctggccc tcaaaggcga gctgctgttg 241 ctgtctctcc tgttacttct ggaaccacag atctctcagg gcctggtcgt cacacccccg 301 gggccagagc ttgtcctcaa tgtctccagc accttcgttc tgacctgctc gggttcagct 361 ccggtggtgt gggaacggat gtcccaggag cccccacagg aaatggccaa ggcccaggat 421 ggcaccttct ccagcgtgct cacactgacc aacctcactg ggctagacac gggagaatac 481 ttttgcaccc acaatgactc ccgtggactg gagaccgatg agcggaaacg gctctacatc 541 tttgtgccag atcccaccgt gggcttcctc cctaatgatg ccgaggaact attcatcttt 601 ctcacggaaa taactgagat caccattcca tgccgagtaa cagacccaca gctggtggtg 661 acactgcacg agaagaaagg ggacgttgca ctgcctgtcc cctatgatca ccaacgtggc 721 ttttctggta tctttgagga cagaagctac atctgcaaaa ccaccattgg ggacagggag 781 gtggattctg atgcctacta tgtctacaga ctccaggtgt catccatcaa cgtctctgtg 841 aacgcagtgc agactgtggt ccgccagggt gagaacatca ccctcatgtg cattgtgatc 901 gggaatgatg tggtcaactt cgagtggaca tacccccgca aagaaagtgg gcggctggtg 961 gagccggtga ctgacttcct cttggatatg ccttaccaca tccgctccat cctgcacatc 1021 cccagtgccg agttagaaga ctcggggacc tacacctgca atgtgacgga gagtgtgaat 1081 gaccatcagg atgaaaaggc catcaacatc accgtggttg agagcggcta cgtgcggctc 1141 ctgggagagg tgggcacact acaatttgct gagctgcatc ggagccggac actgcaggta 1201 gtgttcgagg cctacccacc gcccactgtc ctgtggttca aagacaaccg caccctgggc 1261 gactccagcg ctggcgaaat cgccctgtcc acgcgcaacg tgtcggagac ccggtatgtg 1321 tcagagctga cactggttcg cgtgaaggtg gcagaggctg gccactacac catgcgggcc 1381 ttccatgagg atgctgaggt ccagctctcc ttccagctac agatcaatgt ccctgtccga 1441 gtgctggagc taagtgagag ccaccctgac agtggggaac agacagtccg ctgtcgtggc 1501 cggggcatgc cgcagccgaa catcatctgg tctgcctgca gagacctcaa aaggtgtcca 1561 cgtgagctgc cgcccacgct gctggggaac agttccgaag aggagagcca gctggagact 1621 aacgtgacgt actgggagga ggagcaggag tttgaggtgg tgagcacact gcgtctgcag 1681 cacgtggatc ggccactgtc ggtgcgctgc acgctgcgca acgctgtggg ccaggacacg 1741 caggaggtca tcgtggtgcc acactccttg ccctttaagg tggtggtgat ctcagccatc 1801 ctggccctgg tggtgctcac catcatctcc cttatcatcc tcatcatgct ttggcagaag 1861 aagccacgtt acgagatccg atggaaggtg attgagtctg tgagctctga cggccatgag 1921 tacatctacg tggaccccat gcagctgccc tatgactcca cgtgggagct gccgcgggac 1981 cagcttgtgc tgggacgcac cctcggctct ggggcctttg ggcaggtggt ggaggccaca 2041 gctcatggtc tgagccattc tcaggccacg atgaaagtgg ccgtcaagat gcttaaatcc 2101 acagcccgca gcagtgagaa gcaagccctt atgtcggagc tgaagatcat gagtcacctt 2161 gggccccacc tgaacgtggt caacctgttg ggggcctgca ccaaaggagg acccatctat 2221 atcatcactg agtactgccg ctacggagac ctggtggact acctgcaccg caacaaacac 2281 accttcctgc agcaccactc cgacaagcgc cgcccgccca gcgcggagct ctacagcaat 2341 gctctgcccg ttgggctccc cctgcccagc catgtgtcct tgaccgggga gagcgacggt 2401 ggctacatgg acatgagcaa ggacgagtcg gtggactatg tgcccatgct ggacatgaaa 2461 ggagacgtca aatatgcaga catcgagtcc tccaactaca tggcccctta cgataactac 2521 gttccctctg cccctgagag gacctgccga gcaactttga tcaacgagtc tccagtgcta 2581 agctacatgg acctcgtggg cttcagctac caggtggcca atggcatgga gtttctggcc 2641 tccaagaact gcgtccacag agacctggcg gctaggaacg tgctcatctg tgaaggcaag 2701 ctggtcaaga tctgtgactt tggcctggct cgagacatca tgcgggactc gaattacatc 2761 tccaaaggca gcaccttttt gcctttaaag tggatggctc cggagagcat cttcaacagc 2821 ctctacacca ccctgagcga cgtgtggtcc ttcgggatcc tgctctggga gatcttcacc 2881 ttgggtggca ccccttaccc agagctgccc atgaacgagc agttctacaa tgccatcaaa 2941 cggggttacc gcatggccca gcctgcccat gcctccgacg agatctatga gatcatgcag 3001 aagtgctggg aagagaagtt tgagattcgg ccccccttct cccagctggt gctgcttctc 3061 gagagactgt tgggcgaagg ttacaaaaag aagtaccagc aggtggatga ggagtttctg 3121 aggagtgacc acccagccat ccttcggtcc caggcccgct tgcctgggtt ccatggcctc 3181 cgatctcccc tggacaccag ctccgtcctc tatactgccg tgcagcccaa tgagggtgac 3241 aacgactata tcatccccct gcctgacccc aaacctgagg ttgctgacga gggcccactg 3301 gagggttccc ccagcctagc cagctccacc ctgaatgaag tcaacacctc ctcaaccatc 3361 tcctgtgaca gccccctgga gccccaggac gaaccagagc cagagcccca gcttgagctc 3421 caggtggagc cggagccgga gctggaacag ttgccggatt cggggtgccc tgcgcctcgg 3481 gcggaagcag aggatagctt cctgtagggg gctggcccct accctgccct gcctgaagct 3541 cccccgctgc cagcacccag catctcctgg cctggcctgg ccgggcttcc tgtcagccag 3601 gctgccctta tcagctgtcc ccttctggaa gctttctgct cctgacgtgt tgtgccccaa 3661 accctggggc tggcttagga ggcaagaaaa ctgcaggggc cgtgaccagc cctctgcctc 3721 cagggaggcc aactgactct gagccagggt tcccccaggg aactcagttt tcccatatgt 3781 aagatgggaa agttaggctt gatgacccag aatctaggat tctctccctg gctgacaggt 3841 ggggagaccg aatccctccc tgggaagatt cttggagtta ctgaggtggt aaattaactt 3901 ttttctgttc agccagctac ccctcaagga atcatagctc tctcctcgca cttttatcca 3961 cccaggagct agggaagaga ccctagcctc cctggctgct ggctgagcta gggcctagcc 4021 ttgagcagtg ttgcctcatc cagaagaaag ccagtctcct ccctatgatg ccagtccctg 4081 cgttccctgg cccgagctgg tctggggcca ttaggcagcc taattaatgc tggaggctga 4141 gccaagtaca ggacaccccc agcctgcagc ccttgcccag ggcacttgga gcacacgcag 4201 ccatagcaag tgcctgtgtc cctgtccttc aggcccatca gtcctggggc tttttcttta 4261 tcaccctcag tcttaatcca tccaccagag tctagaaggc cagacgggcc ccgcatctgt 4321 gatgagaatg taaatgtgcc agtgtggagt ggccacgtgt gtgtgccaga tatggccctg 4381 gctctgcatt ggacctgcta tgaggctttg gaggaatccc tcaccctctc tgggcctcag 4441 tttccccttc aaaaaatgaa taagtcggac ttattaactc tgagtgcctt gccagcacta 4501 acattctaga gtatccaggt ggttgcacat ttgtccagat gaagcaaggc catataccct 4561 aaacttccat cctgggggtc agctgggctc ctgggagatt ccagatcaca catcacactc 4621 tggggactca ggaaccatgc cccttcccca ggcccccagc aagtctcaag aacacagctg 4681 cacaggcctt gacttagagt gacagccggt gtcctggaaa gcccccagca gctgccccag 4741 ggacatggga agaccacggg acctctttca ctacccacga tgacctccgg gggtatcctg 4801 ggcaaaaggg acaaagaggg caaatgagat cacctcctgc agcccaccac tccagcacct 4861 gtgccgaggt ctgcgtcgaa gacagaatgg acagtgagga cagttatgtc ttgtaaaaga 4921 caagaagctt cagatgggta ccccaagaag gatgtgagag gtgggcgctt tggaggtttg 4981 cccctcaccc accagctgcc ccatccctga ggcagcgctc catgggggta tggttttgtc 5041 actgcccaga cctagcagtg acatctcatt gtccccagcc cagtgggcat tggaggtgcc 5101 aggggagtca gggttgtagc caagacgccc ccgcacgggg agggttggga agggggtgca 5161 ggaagctcaa cccctctggg caccaaccct gcattgcagg ttggcacctt acttccctgg 5221 gatcccagag ttggtccaag gagggagagt gggttctcaa tacggtacca aagatataat 5281 cacctaggtt tacaaatatt tttaggactc acgttaactc acatttatac agcagaaatg 5341 ctattttgta tgctgttaag tttttctatc tgtgtacttt tttttaaggg aaagatttta 5401 atattaaacc tggtgcttct cactcac // LOCUS HUMPDIR 1693 bp mRNA PRI 17-JUN-1996 DEFINITION Human mRNA for protein disulfide isomerase-related protein (PDIR), complete cds. ACCESSION D49490 NID g1072306 KEYWORDS protein disulfide isomerase-related protein (PDIR). SOURCE Homo sapiens cDNA to mRNA, clone_lib:placental cDNA clone:phPDIR. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1693) AUTHORS Hayano,T. and Kikuchi,M. TITLE Molecular cloning of the cDNA encoding a novel protein disulfide isomerase-related protein (PDIR) JOURNAL FEBS Lett. 372 (2-3), 210-214 (1995) MEDLINE 96000209 REFERENCE 2 (bases 1 to 1693) AUTHORS Hayano,T. TITLE Direct Submission JOURNAL Submitted (03-MAR-1995) to the DDBJ/EMBL/GenBank databases. Toshiya Hayano, Protein Engineering Research Institute, The Third Research Department; 6-2-3 Furuedai, Suita, Osaka 565, Japan (Tel:06-872-8200, Fax:06-872-8210) FEATURES Location/Qualifiers source 1..1693 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="phPDIR" /clone_lib="placental cDNA" 5'UTR 1..56 CDS 57..1616 /note="three thioredoxin-like sequences" /codon_start=1 /product="protein disulfide isomerase-related protein (PDIR)" /db_xref="PID:d1009062" /db_xref="PID:g1072307" /translation="MARAGPAWLLLAIWVVLPSWLSSAKVSSLIERISDPKDLKKLLR TRNNVLVLYSKSEVAAENHLRLLSTVAQAVKGQGTICWVDCGDAESRKLCKKMKVDLS PKDKKVELFHYQDGAFHTEYNRAVTFKSIVAFLKDPKGPPLWEEDPGAKDVVHLDSEK DFRRLLKKEEKPLLIMFYAPWCSMCKRMMPHFQKAATQLRGHAVLAGMNVYSSEFENI KEEYSVRGFPTICYFEKGRFLFQYDNYGSTAEDIVEWLKNPQPPQPQVPETPWADEGG SVYHLTDEDFDQFVKEHSSVLVMFHAPWCGHCKKMKPEFEKAAEALHGEADSSGVLAA VDATVNKALAERFHISEFPTLKYFKNGEKYAVPVLRTKKKFLEWMQNPEAPPPPEPTW EEQQTSVLHLVGDNFRETLKKKKHTLVMFYAPWCPHCKKVIPHFTATADAFKDDRKIA CAAVDCVKDKNQDLCQQEAVKGYPTFHYYHYGKFAEKYDSDRTELGFTNYIRALREGD HERLGKKKEEL" misc_feature 597..614 /note="thioredoxin-like sequence" misc_feature 966..983 /note="thioredoxin-like sequence" misc_feature 1329..1346 /note="thioredoxin-like sequence" misc_feature 1602..1613 /note="endoplasmic reticulum-retention signal" 3'UTR 1617..1693 BASE COUNT 441 a 414 c 470 g 368 t ORIGIN 1 gaattcgggg ggcgccggag tggagaaagg agccagcggt gggcagcgct gctgggatgg 61 cgcgggccgg gccggcgtgg ctgctgctgg caatctgggt ggtcctgcca tcatggctgt 121 cctctgcaaa ggtctcctcg ctcattgaga gaatctctga ccccaaggac ttgaaaaaac 181 tgctcagaac ccggaataat gtactggtgc tttactccaa atctgaggtg gcagctgaaa 241 atcatctcag gttactgtcc acagtggccc aggcggtgaa aggacaaggg accatctgct 301 gggtggactg tggtgatgca gagagtagaa aattgtgcaa gaagatgaaa gttgacctga 361 gcccgaagga caaaaaggtt gaattattcc attaccagga tggtgcattt catactgaat 421 ataaccgagc tgtgacattt aagtccatag tggccttttt gaaggatcca aaagggcccc 481 cactgtggga ggaagatcct ggagccaaag atgttgtcca ccttgacagt gaaaaggact 541 tcagacggct cctgaagaag gaagagaagc cgctcctgat catgttttat gccccctggt 601 gcagcatgtg caagaggatg atgccgcatt tccagaaggc tgcgactcag ctgcgaggcc 661 acgccgtgct ggccgggatg aatgtctact cctctgaatt tgaaaacatc aaggaggagt 721 acagcgtgcg cggcttcccc accatctgct attttgagaa aggacggttc ttgttccagt 781 atgacaacta tgggtccaca gctgaggaca ttgtggagtg gctgaagaat ccgcagccgc 841 cacagcccca ggtccctgag actccctggg cagatgaggg cggctccgtt tatcacctga 901 ccgatgaaga ctttgaccag tttgtgaagg aacactcctc tgtcctcgtc atgttccacg 961 ccccatggtg tggccactgt aagaaaatga agccggagtt tgagaaggca gcagaagccc 1021 tccatggaga agcggatagc tctggtgtcc ttgcagctgt cgatgccact gtcaacaagg 1081 ccctggcaga aagattccac atctcagagt ttcctacgtt gaagtatttt aagaatggag 1141 agaaatacgc agtgcctgtg ctcaggacaa agaagaagtt tctcgagtgg atgcaaaacc 1201 ctgaggcccc cccgccccca gagcccacgt gggaagagca gcagacaagc gtgttgcacc 1261 tggtggggga caacttccgg gagaccctga agaagaagaa acacaccttg gtcatgttct 1321 acgccccttg gtgcccacac tgtaagaagg tcattccgca ctttactgct actgctgatg 1381 ccttcaaaga tgaccgaaag attgcctgtg ccgctgttga ctgtgtcaaa gacaagaacc 1441 aagacctgtg ccagcaggag gcggtcaagg gctaccccac tttccactac taccactatg 1501 ggaagttcgc agaaaagtat gacagcgacc gcacagaatt gggatttacc aattatattc 1561 gagccctccg ggagggagac catgaaagac tagggaaaaa gaaggaagag ttataattcc 1621 tgcctcagaa aaagcttttc cattacactg tgaatgatac ctgttttgtt gtttctgaat 1681 ttcccccgaa ttc // LOCUS HUMPDK1R 1593 bp mRNA PRI 29-NOV-1995 DEFINITION Homo sapiens pyruvate dehydrogenase kinase isoenzyme 1 (PDK1) mRNA, complete cds. ACCESSION L42450 NID g1088280 KEYWORDS isoenzyme 1; pyruvate dehydrogenase kinase. SOURCE Homo sapiens male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1593) AUTHORS Gudi,R., Bowker-Kinley,M.M., Kedishvili,N.Y., Zhao,Y. and Popov,K.M. TITLE Diversity of the pyruvate dehydrogenase kinase gene family in humans JOURNAL J. Biol. Chem. 270 (48), 28989-28994 (1995) MEDLINE 96081973 REMARK Erratum:[[published erratum appears in J Biol Chem 1996 Jan 12;271(2):1250]] FEATURES Location/Qualifiers source 1..1593 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /dev_stage="adult" /sex="male" /tissue_type="liver" 5'UTR <1..125 /gene="PDK1" /evidence=experimental gene 1..1593 /gene="PDK1" mRNA <1..1593 /gene="PDK1" CDS 125..1435 /gene="PDK1" /note="isoenzyme 1" /codon_start=1 /product="pyruvate dehydrogenase kinase" /db_xref="PID:g1088281" /translation="MRLARLLRGAALAGPGPGLRAAGFSRSFSSDSGSSPASERGVPG QVDFYARFSPSPLSMKQFLDFGSVNACEKTSFMFLRQELPVRLANIMKEISLLPDNLL RTPSVQLVQSWYIQSLQELLDFKDKSAEDAKAIYDFTDTVIRIRNRHNDVIPTMAQGV IEYKESFGVDPVTSQNVQYFLDRFYMSRISIRMLLNQHSLLFGGKGKGSPSHRKHIGS INPNCNVLEVIKDGYENARRLCDLYYINSPELELEELNAKSPGQPIQVVYVPSHLYHM VFELFKNAMRATMEHHANRGVYPPIQVHVTLGNEDLTVKMSDRGGGVPLRKIDRLFNY MYSTAPRPRVETSRAVPLAGFGYGLPISRLYAQYFQGDLKLYSLEGYGTDAVIYIKAL STDSIERLPVYNKAAWKHYNTNHEADDWCVPSREPKDMTTFRSA" 3'UTR 1436..1593 /gene="PDK1" BASE COUNT 429 a 373 c 379 g 412 t ORIGIN 1 ctctagagga tcccctttat tccccacttt acctggctaa ttgaagtgta acaaaagctt 61 catccaggaa cattggcgcg ggaaacctgg cgtactggct gtggcttctc tagcgggact 121 cggcatgagg ctggcgcggc tgcttcgcgg agccgccttg gccggcccgg gcccggggct 181 gcgcgccgcc ggcttcagcc gcagcttcag ctcggactcg ggctccagcc cggcgtccga 241 gcgcggcgtt ccgggccagg tggacttcta cgcgcgcttc tcgccgtccc cgctctccat 301 gaagcagttc ctggacttcg gatcagtgaa tgcttgtgaa aagacctcat ttatgtttct 361 gcggcaagag ttgcctgtca gactggcaaa tataatgaaa gaaataagtc tccttccaga 421 taatcttctc aggacaccat ccgttcaatt ggtacaaagc tggtatatcc agagtcttca 481 ggagcttctt gattttaagg acaaaagtgc tgaggatgct aaagctattt atgactttac 541 agatactgtg atacggatca gaaaccgaca caatgatgtc attcccacaa tggcccaggg 601 tgtgattgaa tacaaggaga gctttggggt ggatcctgtc accagccaga atgttcagta 661 ctttttggat cgattctaca tgagtcgcat ttcaattaga atgttactca atcagcactc 721 tttattgttt ggtggaaaag gcaaaggaag tccatctcat cgaaaacaca ttggaagcat 781 aaatccaaac tgcaatgtac ttgaagttat taaagatggc tatgaaaatg ctaggcgtct 841 gtgtgatttg tattatatta actctcccga actagaactt gaagaactaa atgcaaaatc 901 accaggacag ccaatacaag tggtttatgt accatcccat ctctatcaca tggtgtttga 961 acttttcaag aatgcaatga gagccactat ggaacaccat gccaacagag gtgtttaccc 1021 ccctattcaa gttcatgtca cgctgggtaa tgaggatttg actgtgaaga tgagtgaccg 1081 aggaggtggc gttcctttga ggaaaattga cagacttttc aactacatgt attcaactgc 1141 accaagacct cgtgttgaga cctcccgcgc agtgcctctg gctggttttg gttatggatt 1201 gcccatatca cgtctttacg cacaatactt ccaaggagac ctgaagctgt attccctaga 1261 gggttacggg acagatgcag ttatctacat taaggctctg tcaacagact caatagaaag 1321 actcccagtg tataacaaag ctgcctggaa gcattacaac accaaccacg aggctgatga 1381 ctggtgcgtc cccagcagag aacccaaaga catgacgacg ttccgcagtg cctagacaca 1441 ctggggacat cggaaaatcc aaatgtggct tttgtattaa atttggaagg tatggtgttc 1501 agaactatat tataccaagt actttattta tcgttttcac aaaactattt gagtagaata 1561 aatggaaact gaattcgagc tcggtacccg ggg // LOCUS HUMPDK2R 1422 bp mRNA PRI 29-NOV-1995 DEFINITION Homo sapiens pyruvate dehydrogenase kinase isoenzyme 2 (PDK2) mRNA, complete cds. ACCESSION L42451 NID g1088282 KEYWORDS isoenzyme 2; pyruvate dehydrogenase kinase. SOURCE Homo sapiens male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1422) AUTHORS Gudi,R., Bowker-Kinley,M.M., Kedishvili,N.Y., Zhao,Y. and Popov,K.M. TITLE Diversity of the pyruvate dehydrogenase kinase gene family in humans JOURNAL J. Biol. Chem. 270 (48), 28989-28994 (1995) MEDLINE 96081973 REMARK Erratum:[[published erratum appears in J Biol Chem 1996 Jan 12;271(2):1250]] FEATURES Location/Qualifiers source 1..1422 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /dev_stage="adult" /sex="male" /tissue_type="liver" 5'UTR 1..49 /gene="PDK2" gene 1..1422 /gene="PDK2" mRNA 1..1422 /gene="PDK2" CDS 49..1272 /gene="PDK2" /note="isoenzyme 2" /codon_start=1 /product="pyruvate dehydrogenase kinase" /db_xref="PID:g1088283" /translation="MRWVWALLKNASLAGAPKYIEHFSKFSPSPLSMKQFLDFGSSNA CEKTSFTFLRQELPVRLANIMKEINLLPDRVLSTPTVQLVQSWYVQSLLDIMEFLDKD PEDHRTLSQFTDALVTIRNRHNDVVPTMAQGVLEYKDTYGDDPVSNQNIQYFLDRFYL SRISIRMLINQHTLIFDGSTNPAHPKHIGSIDPNCNVSEVVKDAYDMAKLLCDKYYMA SPDLEIQEINAANSKQPIHMVYVPSHLYHMLFELFKNAMRATVESHESSLILPPIKVM VALGEEDLSIKMSDRGGGVPLRKIERLFSYMYSTAPTPQPGTGGTPLAGFGYGLPISR LYAKYFQGDLQLFSMEGFGTDAVIYLKALSTDSVERLPVYNKSAWRHYQTIQEAGDWC VPSTEPKNTSTYRVT" 3'UTR 1273..1422 /gene="PDK2" BASE COUNT 302 a 473 c 372 g 275 t ORIGIN 1 gtggctgccc gcgcggggac cacaaccaaa gtcgcggccg ccgcagccat gcgctgggtg 61 tgggcgctgc tgaagaatgc gtccctggca ggggcgccca agtacataga gcacttcagc 121 aagttctccc cgtccccgct gtccatgaag cagtttctgg acttcggatc cagcaatgcc 181 tgtgagaaaa cctccttcac cttcctcagg caggagctgc ctgtgcgcct ggccaacatc 241 atgaaagaga tcaacctgct tcccgaccga gtgctgagca cacccaccgt gcagctggtg 301 cagagctggt atgtccagag cctcctggac atcatggagt tcctggacaa ggatcctgag 361 gaccatcgca ccctgagcca gttcactgac gccctggtca ccatccggaa ccggcacaac 421 gacgtggtgc ccaccatggc acaaggcgtg ctcgagtaca aggacaccta cggcgatgac 481 cccgtctcca accagaacat ccagtacttc ctggaccgct tctacctcag ccgcatctcc 541 atccgcatgc tcatcaacca gcacaccctc atctttgatg gcagcaccaa cccagcccat 601 cccaaacaca tcggcagcat cgaccccaac tgcaacgtct ctgaggtggt caaagatgcc 661 tacgacatgg ctaagctcct gtgtgacaag tattacatgg cctcacctga cctggagatc 721 caggagatca atgcagccaa ctccaaacag ccgattcaca tggtctacgt cccctcccac 781 ctctaccaca tgctctttga gctcttcaag aatgccatga gggcgactgt ggaaagccat 841 gagtccagcc tcattctccc acccatcaag gtcatggtgg ccttgggtga ggaagatctg 901 tccatcaaga tgagtgaccg aggtgggggt gttcccttga ggaagattga gcgactcttc 961 agctatatgt actccacagc acccaccccc cagcctggca ccgggggaac gccgctggct 1021 ggctttggtt atgggctccc catttcccgc ctctacgcca agtacttcca gggagacctg 1081 cagctcttct ccatggaagg ctttgggacc gatgctgtca tctatctcaa ggccctgtcc 1141 acggactcgg tggagcgcct gcctgtctac aacaagtcag cctggcgcca ctaccagacc 1201 atccaggagg ccggcgactg gtgtgtgccc agcacggagc ccaagaacac gtccacgtac 1261 cgcgtcacgt aagggccgcc gtgcatctgc acctgagagg acggactgcc gcctctgggt 1321 ccccccaccg tggtgcccct caccatcctc ctgggggagc agggggtggg ttctccctga 1381 tgaccaggtt ctgtctctat ggaagtcact gcggtgaata gg // LOCUS HUMPDK3R 1599 bp mRNA PRI 29-NOV-1995 DEFINITION Homo sapiens pyruvate dehydrogenase kinase isoenzyme 3 (PDK3) mRNA, complete cds. ACCESSION L42452 NID g1088284 KEYWORDS isoenzyme 3; pyruvate dehydrogenase kinase. SOURCE Homo sapiens male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1599) AUTHORS Gudi,R., Bowker-Kinley,M.M., Kedishvili,N.Y., Zhao,Y. and Popov,K.M. TITLE Diversity of the pyruvate dehydrogenase kinase gene family in humans JOURNAL J. Biol. Chem. 270 (48), 28989-28994 (1995) MEDLINE 96081973 REMARK Erratum:[[published erratum appears in J Biol Chem 1996 Jan 12;271(2):1250]] FEATURES Location/Qualifiers source 1..1599 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /dev_stage="adult" /sex="male" /tissue_type="liver" 5'UTR 1..23 /gene="PDK3" gene 1..1599 /gene="PDK3" mRNA 1..1599 /gene="PDK3" CDS 23..1243 /gene="PDK3" /note="isoenzyme 3" /codon_start=1 /product="pyruvate dehydrogenase kinase" /db_xref="PID:g1088285" /translation="MRLFRWLLKQPVPKQIERYSRFSPSPLSIKQFLDFGRDNACEKT SYMFLRKELPVRLANTMREVNLLPDNLLNRPSVGLVQSWYMQSFLELLEYENKSPEDP QVLDNFLQVLIKVRNRHNDVVPTMAQGVIEYKEKFGFDPFISTNIQYFLDRFYTNRIS FRMLINQHTLLFGGDTNPVHPKHIGSIDPTCNVADVVKDAYETAKMLCEQYYLVAPEL EVEEFNAKAPDKPIQVVYVPSHLFHMLFELFKNSMRATVELYEDRKEGYPAVKTLVTL GKEDLSIKISDLGGGVPLRKIDRLFNYMYSTAPRPSLEPTRAAPLAGFGYGLPISRLY ARYFQGDLKLYSMEGVGTDAVIYLKALSSESFERLPVFNKSAWRHYKTTPEADDWSNP SSEPRDASKYKAKQ" 3'UTR 1244..1599 /gene="PDK3" BASE COUNT 466 a 326 c 357 g 450 t ORIGIN 1 gtggtgcgcc gcccgggcga ggatgcggct gttccggtgg ctgctgaagc agccggtgcc 61 caagcagatc gagcgctact cgcgcttttc gccgtcgccg ctctccatca aacaattcct 121 ggacttcggg agagataatg catgtgagaa aacttcatat atgtttctac gaaaggaact 181 tcctgtgcgg ctggctaaca caatgagaga agttaatctt ctgccggata atttacttaa 241 ccgcccttca gtgggattgg ttcagagttg gtatatgcag agttttcttg aacttttaga 301 atatgaaaat aagagccctg aggatccaca ggtcttggat aactttctac aagttctgat 361 taaagtcaga aatagacaca atgatgtggt tcctacaatg gcacaaggag tgattgaata 421 caaggagaag tttgggtttg atcctttcat tagcactaac atccaatatt ttctggatcg 481 gttttatacc aaccgcatct ctttccgcat gcttattaat cagcacacac ttctgtttgg 541 gggtgacact aatcctgttc atcctaaaca cataggaagt atcgatccca cctgtaacgt 601 ggcggatgtg gtgaaagatg catatgaaac agccaagatg ctgtgtgaac agtattacct 661 ggtagctcca gagctggaag ttgaagaatt caatgccaaa gcgccagaca aacctattca 721 ggtggtttat gtgccctcac atctgtttca tatgctattt gagttgttca agaactcaat 781 gagagcgaca gttgaactct atgaagacag aaaagagggc taccctgctg ttaaaaccct 841 cgttactttg ggtaaagaag acttatccat taagatcagt gacctaggtg gtggtgtccc 901 acttcgaaaa atagatcgtc tttttaacta catgtattct actgctccta ggcccagcct 961 ggagcctacc agagctgccc ctttggctgg atttggttat ggtttgccaa tttcccgtct 1021 gtatgctaga tattttcaag gagatctgaa actgtattcc atggaaggag tgggtactga 1081 tgctgtcatt tatttgaagg ctctttcaag tgagtcattt gagagacttc cagtttttaa 1141 taagtccgca tggcgccatt acaagaccac gcctgaagcc gatgattgga gcaatcccag 1201 cagtgaaccc agggatgctt caaaatacaa agcaaaacag taatatacca ccttgatttc 1261 cattacaaag tatctgattt gtctgaataa aggtgtccca ctcactgttc caggaattct 1321 tgcagtgtag aggtattcac aacagcaagc agggatttgg cctgccatca attttattta 1381 aaaagcaatt aagtttgcag tttgtcctca taaacgttgt aggttgaaac tgaaaaataa 1441 gttaaaatga tcaatcaaga taaaaggtga tctatccatc caatggaata ttatttggcc 1501 ttaaaaagga aggacatctg atatgtgctg caacatgggt gaatcttgaa gtcattatgc 1561 taagtgaaat aaaccagaca taaaaggaca aatacatcg // LOCUS HUMPEMP 1989 bp mRNA PRI 07-JAN-1995 DEFINITION Human palmitoylated erythrocyte membrane protein (MPP1) mRNA, complete cds. ACCESSION M64925 NID g189785 KEYWORDS MPP1 gene; erythrocyte protein 55; palmitoylated erythrocyte membrane protein. SOURCE Human reticulocyte, cDNA to mRNA, clone 2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1989) AUTHORS Ruff,P., Speicher,D.W. and Husain-Chishti,A. TITLE Molecular identification of a major palmitoylated erythrocyte membrane protein containing the src homology 3 motif JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (15), 6595-6599 (1991) MEDLINE 91319732 FEATURES Location/Qualifiers source 1..1989 /organism="Homo sapiens" /sub_species="sapiens" /db_xref="taxon:9606" /cell_type="blood" /tissue_type="erythrocyte" /map="Unassigned" gene 104..1504 /gene="MPP1" CDS 104..1504 /gene="MPP1" /codon_start=1 /db_xref="GDB:G00-131-663" /product="erythrocyte p55" /db_xref="PID:g189786" /translation="MTLKASEGESGGSMHTALSDLYLEHLLQKRSRPEAVSHPLNTVT EDMYTNGSPAPGSPAQVKGQEVRKVRLIQFEKVTEEPMGITLKLNEKQSCTVARILHG GMIHRQGSLHVGDEILEINGTNVTNHSVDQLQKAMKETKGMISLKVIPNQQSRLPALQ MFMRAQFDYDPKKDNLIPCKEAGLKFATGDIIQIINKDDSNWWQGRVEGSSKESAGLI PSPELQEWRVASMAQSAPSEAPSCSPFGKKKKYKDKYLAKHSSIFDQLDVVSYEEVVR LPAFKRKTLVLIGASGVGRSHIKNALLSQNPEKFVYPVPYTTRPPRKSEEDGKEYHFI STEEMTRNISANEFLEFGSYQGNMFGTKFETVHQIHKQNKIAILDIEPQTLKIVRTAE LSPFIVFIAPTDQGTQTEALQQLQKDSEAIRSQYAHYFDLSLVNNGVDETLKKLQEAF DQACSSPQWVPVSWVY" BASE COUNT 526 a 500 c 505 g 458 t ORIGIN 1 agccgcaccg cgtctcccgc cttctccgca gccccgcagg ccccgggccc tgtcattccc 61 agcgctgccc tgtcttgcgt tccagtgttc cagcttctgc gagatgaccc tcaaggcgag 121 cgagggcgag agtgggggca gcatgcacac ggcgctctcc gacctctacc tggagcattt 181 gctgcagaag cgtagtcggc cagaggctgt atcgcatcca ttgaatactg tgaccgagga 241 catgtacacc aacgggtctc ctgccccagg tagccctgcc caggtcaagg gacaggaggt 301 gcggaaagtg cgactcatac agtttgagaa ggtcacagaa gagcccatgg gaatcacgct 361 gaagctgaat gaaaaacagt cctgtacggt ggccagaatt cttcatggtg gcatgatcca 421 tagacaaggc tcccttcacg tgggggatga gatcctagaa atcaatggca caaatgtgac 481 aaatcattca gtggatcagc tgcagaaggc gatgaaagaa accaaaggaa tgatctcatt 541 aaaagtaatt cccaaccagc aaagccgtct tcctgcacta cagatgttca tgagagcgca 601 gtttgactat gatcccaaaa aggacaatct gatcccttgc aaggaggcgg gactgaagtt 661 tgctactggg gacattatcc agattatcaa caaggatgac agcaattggt ggcagggacg 721 ggtggaaggc tcctccaagg agtcagcagg attgatccct tcccctgagc tgcaggaatg 781 gcgagtggca agtatggctc agtcagctcc tagcgaagcc ccgagctgca gtccctttgg 841 gaagaagaag aagtacaaag acaaatatct ggccaagcac agctcgattt ttgatcagtt 901 ggatgttgtt tcctacgagg aagtcgttcg gctccctgca ttcaagagga agaccctggt 961 gctgatcgga gccagtgggg tgggtcgcag ccacattaag aatgccctgc tcagccagaa 1021 tccggagaag tttgtgtacc ctgtcccata tacaacacgg ccgccaagga agagtgagga 1081 agatgggaag gagtaccact ttatctcaac ggaggagatg acgaggaaca tctctgccaa 1141 tgagttcttg gagtttggca gctaccaagg caacatgttt ggcaccaaat ttgaaacagt 1201 gcaccagatc cataagcaga acaagattgc catccttgac attgagcccc agaccctgaa 1261 aattgttcgg acagcagaac tttcgccttt cattgtgttc attgcaccta ctgaccaggg 1321 cactcagaca gaagccctgc agcagctgca gaaggactct gaggccatcc gcagccagta 1381 cgctcactac tttgacctct cactggtcaa taatggtgtt gatgaaaccc ttaagaaatt 1441 acaagaagcc ttcgaccaag cgtgcagttc tccacagtgg gtgcctgtct cctgggttta 1501 ctaagcttgt agaatggggg aacccactgt atgcccctct ccagcatttg gaattccacc 1561 cgccttgctt taagacaaac agggctgctc caactagttt tgtgtcagct tccagctctc 1621 tgcagctatc ctaattcagc cagtaaggtt cagtcttctt gctcaggctc ctgaagggtt 1681 gattctcctg atagatgggg ccccactgat ctggatttga aaaggatttc tagaaattgg 1741 gggtaagaag tactaccaaa atgtaactgc taatcaaggg tgatgcacag caaaagcaat 1801 ggaccccatc cctctaaagc ctgccctcct ttgccttcaa ctgtatatgc tgggtatttc 1861 atttgtcttt ttattttgga gaaagcgttt ttaactgcaa ctttctataa tgccaaaatg 1921 acacatctgt gcaatagaat gatgtctgct ctagggaaac cttcaaaagc aataaaaatg 1981 ctgtgttgg // LOCUS HUMPEPD 1888 bp mRNA PRI 07-JAN-1995 DEFINITION Human prolidase (imidodipeptidase) mRNA, complete cds. ACCESSION J04605 NID g189841 KEYWORDS imidodipeptidase; peptidase; prolidase. SOURCE Human liver and placenta, cDNA to mRNA, clones PL[1,21] and PP[1,6]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1888) AUTHORS Endo,F., Tanoue,A., Nakai,H., Hata,A., Indo,Y., Titani,K. and Matsuda,I. TITLE Primary structure and gene localization of human prolidase JOURNAL J. Biol. Chem. 264 (8), 4476-4481 (1989) MEDLINE 89174701 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Endo, 01-MAR-1989. FEATURES Location/Qualifiers source 1..1888 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver; placenta" /map="19q12-q13.2" mRNA <1..1888 /note="PEPD mRNA" gene 17..1498 /gene="PEPD" CDS 17..1498 /gene="PEPD" /EC_number="3.4.13.9" /codon_start=1 /db_xref="GDB:G00-120-273" /product="prolidase" /db_xref="PID:g189842" /translation="MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS IVVLQGGEETQRYCTDTGVLFLQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPA SHATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCRE ASFDGISKFEVNNTILHPEIVESRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMK EYGLESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFD MGGEYYSVASDITCSFPRNGKFTADQKAVYEAVLLSSRAVMGAMKPGDWWPDIDRLAD RIHLEELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEP GLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVR IEEDVVVIDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK" BASE COUNT 395 a 539 c 544 g 410 t ORIGIN Chromosome 19q12-q13.2. 1 ccggtgccgg gcgaacatgg cggcggccac cggaccctcg ttttggctgg ggaatgaaac 61 cctgaaggtg ccgctggcgc tctttgcctt gaaccggcag cgcctgtgtg agcggctgcg 121 gaagaaccct gctgtgcagg ccggctccat cgtggtcctg cagggcgggg aggagactca 181 gcgctactgc accgacaccg gggtcctctt cctccaggag tccttctttc actgggcgtt 241 cggtgtcact gagccaggct gctatggtgt catcgatgtt gacactggga agtcgaccct 301 gtttgtgccc aggcttcctg ccagccatgc cacctggatg ggaaagatcc attccaagga 361 gcacttcaag gagaagtatg ccgtggacga cgtccagtac gtagatgaga ttgccagcgt 421 cctgacgtca cagaagccct ctgtcctcct cactttgcgt ggcgtcaaca cggacagcgg 481 cagtgtctgc agggaggcct cctttgacgg catcagcaag ttcgaagtca acaataccat 541 tcttcaccca gagatcgttg agagccgagt gtttaagacg gatatggagc tggaggttct 601 gcgctatacc aataaaatct ccagcgaggc ccaccgtgag gtaatgaagg ctgtaaaagt 661 gggaatgaaa gaatatgggt tggaaagcct cttcgagcac tactgctact cccggggcgg 721 catgcgccac agctcctaca cctgcatctg cggcagtggt gagaactcag ccgtgctaca 781 ctacggacac gccggagctc ccaacgaccg aacgatccag aatggggata tgtgcctgtt 841 cgacatgggc ggtgagtatt actctgtcgc ttccgacatc acctgctcct ttccccgcaa 901 cggcaagttc actgcagacc agaaggccgt ctatgaggca gtgctgctga gctcccgtgc 961 cgtcatgggt gccatgaagc caggtgactg gtggcctgac atcgaccgcc tggctgaccg 1021 catccacctg gaggagctgg cccacatggg catcctgagc ggcagcgtgg acgccatggt 1081 ccaggctcac ctgggggccg tgtttatgcc tcacgggctt ggccacttcc tgggcattga 1141 cgtgcacgac gtgggaggct acccagaggg cgtggagcgc atcgacgagc ccggcctgcg 1201 gagcctgcgc actgcacggc acctgcagcc aggcatggtg ctcaccgtgg agccgggcat 1261 ctacttcatc gaccacctcc tggatgaggc cctggcggac ccggcccgcg cctccttcct 1321 taaccgcgag gtcctgcagc gctttcgcgg ttttggcggg gtccgcatcg aggaggacgt 1381 cgtggtgatc gacagcggca tagagctgct gacctgcgtg ccccgcactg tggaagagat 1441 tgaagcatgc atggcaggct gtgacaaggc ctttaccccc ttctctggcc ccaagtagag 1501 ccagccagaa atcccagcgc acctgggggc ctggccttgc aacctctttt cgtgatgggc 1561 agcctgctgg tcagcactcc agtagcgaga gacggcaccc agaatcagat cccagcttcg 1621 gcatttgatc agaccaaaca gtgctgtttc ccggggagga aacacttttt taattaccct 1681 tttgcaggca ccacctttaa tctgttttat accttgctta ttaaatgagc gacttaaaat 1741 gattgaaaat aatgctgtcc tttagtagca agtaaaatgt gtcttgctgt catttatatt 1801 ccttttccca ggaaagaagc atttctgata ctttctgtca aaaatcaata tgcagaatgg 1861 catttgcaat aaaaggtttc ctaaaatg // LOCUS HUMPERAF1A 1630 bp mRNA PRI 07-JAN-1995 DEFINITION H.sapiens peroxisome assembly factor-1 mRNA, complete cds. ACCESSION M86852 NID g189848 KEYWORDS peroxisome assembly factor-1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1630) AUTHORS Shimozawa,N., Tsukamoto,T., Suzuki,Y., Orii,T., Shirayoshi,Y., Mori,T. and Fujiki,Y. TITLE A human gene responsible for Zellweger syndrome that affects peroxisome assembly JOURNAL Science 255 (5048), 1132-1134 (1992) MEDLINE 92188187 FEATURES Location/Qualifiers source 1..1630 /organism="Homo sapiens" /db_xref="taxon:9606" gene 374..1614 /gene="PAF-1" CDS 374..1291 /gene="PAF-1" /codon_start=1 /product="peroxisome assembly factor-1" /db_xref="PID:g189849" /translation="MASRKENAKSANRVLRISQLDALELNKALEQLVWSQFTQCFHGF KPGLLARFEPEVKACLWVFLWRFTIYSKNATVGQSVLNIKYKNDFSPNLRYQPPSKNQ KIWYAVCTIGGRWLEERCYDLFRNHHLASFGKVKQCVNFVIGLLKLGGLINFLIFLQR GKFATLTERLLGIHSVFCKPQNIREVGFEYMNRELLWHGFAEFLIFLLPLINVQKLKA KLSSWCIPLTGAPNSDNTLATSGKECALCGEWPTMPHTIGCEHIFCYFCAKSSFLFDV YFTCPKCGTEVHSLQPLKSGIEMSEVNAL" polyA_signal 1609..1614 /gene="PAF-1" /note="this polyA_signal is atypical" BASE COUNT 457 a 315 c 355 g 503 t ORIGIN 1 ccatccagtg ccttccgcag cgccgctaaa gcgcagttct cgttggtgta actttttctt 61 ttttttttca gccacttccg gctcctgcgt cgctccggaa gcctgcgagt tccggaagcc 121 ttggtaatcc agattcggct aggaaaagac aagctttcca gagaatgttt cagagaaagt 181 tacgtggagc gtgggcgttt cgcagactcc taagtggtct ggaacctaac ctgcagtgtc 241 tctgagcttc tggcaagtat gaggcaagga tttcacagaa gaacttggag caaaaattcc 301 cactgtcata gctataggct gtccagtctt gagggagctg caggaacagg aaaaagagac 361 cttcagagaa gacatggctt ccagaaaaga gaatgcgaag agtgcaaaca gagtgctaag 421 aataagccag ttggatgcac ttgaactaaa caaggccctg gagcagctag tttggtccca 481 gtttactcag tgctttcatg gatttaaacc tgggctgtta gctcgctttg agccagaggt 541 gaaagcgtgc ttatgggttt tcttgtggag attcaccatc tactccaaaa atgccacagt 601 gggacagtca gttttgaata ttaagtacaa aaatgatttt tcccctaacc tgagatatca 661 gccacccagt aaaaatcaaa aaatctggta tgctgtttgt acaattggtg gcaggtggtt 721 agaagaacga tgctatgatt tgtttcgaaa ccatcattta gcatcatttg ggaaagtcaa 781 gcagtgtgtg aattttgtga ttggactttt gaaattaggt gggctgatta attttttgat 841 tttccttcag aggggaaagt ttgcaacttt gacagaacgt ctcctaggta ttcattctgt 901 attttgcaag cctcaaaaca tacgtgaagt tggctttgaa tacatgaata gggaacttct 961 ctggcatggt tttgctgaat ttctgatttt tctcttacca cttatcaatg tccagaagtt 1021 gaaagccaag ctgtcttcat ggtgtattcc tcttactggt gcacctaata gtgacaatac 1081 attagccacc agtggcaaag aatgcgctct atgtggagag tggcccacca tgcctcacac 1141 cataggatgt gagcatattt tctgttattt ctgtgctaag agtagtttct tatttgacgt 1201 gtactttact tgtcctaagt gtggcacaga agtacacagt ctgcagccac tgaaatcagg 1261 aatcgagatg tcagaagtaa atgctcttta gaaactaaaa ttgcttcctt tgaggaaaaa 1321 aatgcaccgt gtttaaattc ttaatattag tcatcctaag tataccattt atgtatcctt 1381 tataaggaat gtgctcctag ccactgtctt ctcctttcca ggcatgactg aaatctaatc 1441 actggaaacc atgtgattct aaatatatta tgtaaatgtt aatgtattat gttttttaaa 1501 tcattgcatt caatttttaa tgtcaagaat aatggacagc ttttgtcagg tgactactaa 1561 caatgctcct tcattttact acttcttaaa acaaggttgg attctaaaga taaagatttt 1621 ggagactctg // LOCUS HUMPERE 1958 bp mRNA PRI 03-MAY-1994 DEFINITION Homo sapiens prostaglandin E2 receptor EP2 subtype mRNA, complete cds. ACCESSION L28175 NID g452495 KEYWORDS prostaglandin E2 receptor EP2 subtype. SOURCE Homo sapiens (library: lambda gt10) male lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1958) AUTHORS Bastien,L., Sawyer,N., Grygorczk,R., Metters,K.M. and Adam,M.A. TITLE Cloning, functional expression and characterization of the human Prostaglandin E2 receptor EP2 subtype JOURNAL J. Biol. Chem. 269, 11873-11877 (1994) MEDLINE 94216291 FEATURES Location/Qualifiers source 1..1958 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="lung" /tissue_lib="lambda gt10" 5'UTR <1..388 CDS 389..1855 /codon_start=1 /product="prostaglandin E2 receptor EP2 subtype" /db_xref="PID:g452496" /translation="MSTPGVNSSASLSPDRLNSPVTIPAVMFIFGVVGNLVAIVVLCK SRKEQKETTFYTLVCGLAVTDLLGTLLVSPVTIATYMKGQWPGGQPLCEYSTFILLFF SLSGLSIICAMSVERYLAINHAYFYSHYVDKRLAGLTLFAVYASNVLFCALPNMGLGS SRLQYPDTWCFIDWTTNVTAHAAYSYMYAGFSSFLILATVLCNVLVCGALLRMHRQFM RRTSLGTEQHHAAAAASVASRGHPAASPALPRLSDFRRRRSFRRIAGAEIQMVILLIA TSLVVLICSIPLVVRVFVNQLYQPSLEREVSKNPDLQAIRIASVNPILDPWIYILLRK TVLSKAIEKIKCLFCRIGGSRRERSGQHCSDSQRTSSAMSGHSRSFISRELKEISSTS QTLLPDLSLPDLSENGLGGRNLLPGVPGMGLAQEDTTSLRTLRISETSDSSQGQDSES VLLVDEAGGSGRAGPAPKGSSLQVTFPSETLNLSEKCI" 3'UTR 1856..>1958 BASE COUNT 411 a 615 c 521 g 411 t ORIGIN 1 cggcacagcc tcacacctga acgctgtcct cccgcagacg agaccggcgg gcactgcaaa 61 gctgggactc gtctttgaag gaaaaaaaat agcgagtaag aaatccagca ccattcttca 121 ctgacccatc ccgctgcacc tcttgtttcc caagtttttg aaagctggca actctgacct 181 cggtgtccaa aaatcgacag ccactgagac cggctttgag aagccgaaga tttggcagtt 241 tccagactga gcaggacaag gtgaaagcag gttggaggcg ggtccaggac atctgagggc 301 tgaccctggg ggctcgtgag gctgccaccg ctgctgccgc tacagaccca gccttgcact 361 ccaaggctgc gcaccgccag ccactatcat gtccactccc ggggtcaatt cgtccgcctc 421 cttgagcccc gaccggctga acagcccagt gaccatcccg gcggtgatgt tcatcttcgg 481 ggtggtgggc aacctggtgg ccatcgtggt gctgtgcaag tcgcgcaagg agcagaagga 541 gacgaccttc tacacgctgg tatgtgggct ggctgtcacc gacctgttgg gcactttgtt 601 ggtgagcccg gtgaccatcg ccacgtacat gaagggccaa tggcccgggg gccagccgct 661 gtgcgagtac agcaccttca ttctgctctt cttcagcctg tccggcctca gcatcatctg 721 cgccatgagt gtcgagcgct acctggccat caaccatgcc tatttctaca gccactacgt 781 ggacaagcga ttggcgggcc tcacgctctt tgcagtctat gcgtccaacg tgctcttttg 841 cgcgctgccc aacatgggtc tcggtagctc gcggctgcag tacccagaca cctggtgctt 901 catcgactgg accaccaacg tgacggcgca cgccgcctac tcctacatgt acgcgggctt 961 cagctccttc ctcattctcg ccaccgtcct ctgcaacgtg cttgtgtgcg gcgcgctgct 1021 ccgcatgcac cgccagttca tgcgccgcac ctcgctgggc accgagcagc accacgcggc 1081 cgcggccgcc tcggttgcct cccggggcca ccccgctgcc tccccagcct tgccgcgcct 1141 cagcgacttt cggcgccgcc ggagcttccg ccgcatcgcg ggcgccgaga tccagatggt 1201 catcttactc attgccacct ccctggtggt gctcatctgc tccatcccgc tcgtggtgcg 1261 agtattcgtc aaccagttat atcagccaag tttggagcga gaagtcagta aaaatccaga 1321 tttgcaggcc atccgaattg cttctgtgaa ccccatccta gacccctgga tatatatcct 1381 cctgagaaag acagtgctca gtaaagcaat agagaagatc aaatgcctct tctgccgcat 1441 tggcgggtcc cgcagggagc gctccggaca gcactgctca gacagtcaaa ggacatcttc 1501 tgccatgtca ggccactctc gctccttcat ctcccgggag ctgaaggaga tcagcagtac 1561 atctcagacc ctcctgccag acctctcact gccagacctc agtgaaaatg gccttggagg 1621 caggaatttg cttccaggtg tgcctggcat gggcctggcc caggaagaca ccacctcact 1681 gaggactttg cgaatatcag agacctcaga ctcttcacag ggtcaggact cagagagtgt 1741 cttactggtg gatgaggctg gtgggagcgg cagggctggg cctgccccta aggggagctc 1801 cctgcaagtc acatttccca gtgaaacact gaacttatca gaaaaatgta tataataggc 1861 aaggaaagaa atacagtact gtttctggac ccttataaaa tcctgtgcaa tagacacata 1921 catgtcacat ttagctgtgc tcagaagggc tatcatca // LOCUS HUMPF2AR 2494 bp mRNA PRI 18-APR-1994 DEFINITION Homo sapiens prostanoid FP receptor mRNA, complete cds. ACCESSION L24470 NID g456563 KEYWORDS prostaglandin F2a receptor; prostanoid FP receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2494) AUTHORS Abramovitz,M., Boie,Y., Nguyen,T., Rushmore,T.H., Bayne,M.A., Metters,K.M., Slipetz,D.M. and Grygorczyk,R. TITLE Cloning and expression of a cDNA for the human prostanoid FP receptor JOURNAL J. Biol. Chem. 269, 2632-2636 (1994) MEDLINE 94132028 FEATURES Location/Qualifiers source 1..2494 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="uterus" /tissue_lib="lambda gt10; Clontech" CDS 238..1317 /codon_start=1 /product="prostanoid FP receptor" /db_xref="PID:g456564" /translation="MSMNNSKQLVSPAAALLSNTTCQTENRLSVFFSVIFMTVGILSN SLAIAILMKAYQRFRQKSKASFLLLASGLVITDFFGHLINGAIAVFVYASDKEWIRFD QSNVLCSIFGICMVFSGLCPLLLGSVMAIERCIGVTKPIFHSTKITSKHVKMMLSGVC LFAVFIALLPILGHRDYKIQASRTWCFYNTEDIKDWEDRFYLLLFSFLGLLALGVSLL CNAITGITLLRVKFKSQQHRQGRSHHLEMVIQLLAIMCVSCICWSPFLVTMANIGING NHSLETCETTLFALRMATWNQILDPWVYILLRKAVLKNLYKLASQCCGVHVISLHIWE LSSIKNSLKVAAISESPVAEKSAST" BASE COUNT 707 a 501 c 525 g 761 t ORIGIN 1 gtgcgcggag gggacgagcg gctggaccac agccggcgcc cgatcaggat ctccgcgctg 61 ggatcggtgg aacttgaggc agcggcggcg cggggcgcca tggcacaccg agcggctccg 121 tcttctgctc ctcagagagc ccggctggcg gcctgggatg acaagatgtc tggactgcaa 181 tcctgcacag ttttgagagg gagatgactt gagtggttgg cttttatctc cacaacaatg 241 tccatgaaca attccaaaca gctagtgtct cctgcagctg cgcttctttc aaacacaacc 301 tgccagacgg aaaaccggct ttccgtattt ttttcagtaa tcttcatgac agtgggaatc 361 ttgtcaaaca gccttgccat cgccattctc atgaaggcat atcagagatt tagacagaag 421 tccaaggcat cgtttctgct tttggccagc ggcctggtaa tcactgattt ctttggccat 481 ctcatcaatg gagccatagc agtatttgta tatgcttctg ataaagaatg gatccgcttt 541 gaccaatcaa atgtcctttg cagtattttt ggtatctgca tggtgttttc tggtctgtgc 601 ccacttcttc taggcagtgt gatggccatt gagcggtgta ttggagtcac aaaaccaata 661 tttcattcta cgaaaattac atccaaacat gtgaaaatga tgttaagtgg tgtgtgcttg 721 tttgctgttt tcatagcttt gctgcccatc cttggacatc gagactataa aattcaggcg 781 tcgaggacct ggtgtttcta caacacagaa gacatcaaag actgggaaga tagattttat 841 cttctacttt tttcttttct ggggctctta gcccttggtg tttcattgtt gtgcaatgca 901 atcacaggaa ttacactttt aagagttaaa tttaaaagtc agcagcacag acaaggcaga 961 tctcatcatt tggaaatggt aatccagctc ctggcgataa tgtgtgtctc ctgtatttgt 1021 tggagcccat ttctggttac aatggccaac attggaataa atggaaatca ttctctggaa 1081 acctgtgaaa caacactttt tgctctccga atggcaacat ggaatcaaat cttagatcct 1141 tgggtatata ttcttctacg aaaggctgtc cttaagaatc tctataagct tgccagtcaa 1201 tgctgtggag tgcatgtcat cagcttacat atttgggagc ttagttccat taaaaattcc 1261 ttaaaggttg ctgctatttc tgagtcacca gttgcagaga aatcagcaag cacctagctt 1321 aataggacag taaatctgtg tggggctaga acaaaaatta agacatgttt ggcaatattt 1381 cagttagtta aatacctgta gcctaactgg aaaattcagg cttcatcatg tagtttgaag 1441 atactattgt cagattcagg ttttgaaatt tgtcaaataa acaggataac tgtacatttt 1501 caacttgttt ttgccaatgg gaggtagaca caataaaata atgccatggg agtcacactg 1561 aaagcaattt tgagcttatc tgtcttattt atgctttgag tgaatcatct gttgaggtct 1621 aatgcctcta cttggcctat ttgccagaga acatcttaat gcagcctgca tagtgaaatg 1681 gttattttga gatcaccgct ctgtagctaa cccttataaa ctaggctcag taaaataaag 1741 cactcttatt ttttgatctg gcctattttg cccctcattg tgtagcctca attaacacat 1801 gcatggtcat gacacccaga attcatgatg gtttgttata acaacctctg catattccag 1861 gtctggcaga caggttgcct gaccctgcaa tcctatctag aatgggccca ttcttgtcac 1921 atttgacaaa taggactgcc tacatttatt attatgaagg tcgattgttg ttggaagtgt 1981 tttttcatgt catagattag caattttcaa ataattattt tttctctgaa aattttgtgt 2041 gtgattgcac aataaataat ttttagagaa acaaaggctc tttctcagca cattgatggg 2101 caactagaat tacagcagtt tcaaactcta ccatggataa tgcaaacaaa ccgaagctac 2161 atgccaatga taggtgcaaa gaatattggc aaaaggtgct ttaccttgag ccattatttg 2221 tgtcagagaa caaaagaaac agaatcaata tataaattca aagactatct gcagctagtg 2281 tgtttcttct ttacacacat atacacacag acatcagaaa attctgttga gagcaggttc 2341 attaaatttg taagatggca tattctaaag cctgtgctac cagtactaag aggggaagac 2401 tggcaatttg ccaagcactt ggggattatt ataacaatta actaggagat caagagataa 2461 taatctctcc ccaaattttc caataataat tgag // LOCUS HUMPGAM 1709 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens phosphoglycerate mutase (PGAM-B) mRNA, complete cds. ACCESSION J04173 NID g551173 KEYWORDS phosphoglycerate mutase. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1709) AUTHORS Sakoda,S., Shanske,S., DiMauro,S. and Schon,E.A. TITLE Isolation of a cDNA encoding the B isozyme of human phosphoglycerate mutase (PGAM) and characterization of the PGAM gene family JOURNAL J. Biol. Chem. 263 (32), 16899-16905 (1988) MEDLINE 89034186 FEATURES Location/Qualifiers source 1..1709 /organism="Homo sapiens" /note="(vector lambda-gt11)" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /map="7p13-p12" gene 32..1694 /gene="PGAM2" CDS 32..796 /gene="PGAM2" /codon_start=1 /db_xref="GDB:G00-120-280" /product="phosphoglycerate mutase 2" /db_xref="PID:g551174" /translation="MAAYKLVLIRHGESAWNLENRFSGWYDADLSPAGHEEAKRGGQA LRDAGYEFDICFTSVQKRAIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAE TAAKHGEAQVKIWRRSYDVPPPPMEPDHPFYSNISKDRRYADLTEDQLPSCESLKDTI ARALPFWNEEIVPQIKEGKRVLIAAHGNSLRGIVKHLEGLSEEAIMELNLPTGIPIVY ELDKNLKPIKPMQFLGDEETVRKAMEAVAAQGKAKK" polyA_signal 1689..1694 /gene="PGAM2" /note="G00-120-280" polyA_site 1709 /gene="PGAM2" /note="G00-120-280" BASE COUNT 402 a 438 c 450 g 419 t ORIGIN 1 gggcggggtg ccgcatcccc agcccgccgc catggccgcc tacaaactgg tgctgatccg 61 gcacggcgag agcgcatgga acctggagaa ccgcttcagc ggctggtacg acgccgacct 121 gagcccggcg ggccacgagg aggcgaagcg cggcgggcag gcgctacgag atgctggcta 181 tgagtttgac atctgcttca cctcagtgca gaagagagcg atccggaccc tctggacagt 241 gctagatgcc attgatcaga tgtggctgcc agtggtgagg acttggcgcc tcaatgagcg 301 gcactatggg ggtctaaccg gtctcaataa agcagaaact gctgcaaagc atggtgaggc 361 ccaggtgaag atctggaggc gctcctatga tgtcccacca cctccgatgg agcccgacca 421 tcctttctac agcaacatca gtaaggatcg caggtatgca gacctcacag aagatcagct 481 accctcctgt gagagtctga aggatactat tgccagagct ctgcccttct ggaatgaaga 541 aatagttccc cagatcaagg aggggaaacg tgtactgatt gcagcccatg gcaacagcct 601 ccggggcatt gtcaagcatc tggagggtct ctctgaagag gctatcatgg agctgaacct 661 gccgactggt attcccattg tctatgaatt ggacaagaac ttgaagccta tcaagcccat 721 gcagtttctg ggggatgaag agacggtgcg caaagccatg gaagctgtgg ctgcccaggg 781 caaggccaag aagtgaaggc cggcggggag gatactgtcc ccaggagcac cctccctgcc 841 cgtcttgtcc ctctgcccct cccacctgca catgtcacac tgaccacatc tgtagacatc 901 ttgagttgta gctgcagacg gggaccagtg gctcccattt tcattttagc cattttgtcg 961 cctgcaccca ctcccttcat acaatctagt cagaatagca gttctagagc acaggttctc 1021 agtctaagct atggaaaagc tccccttatc caacagagtt taaaagtagt gacttgggtt 1081 tttgcgagtg ctttgtttac taaggacttt ggggaggaac catgctaagc catgaccagt 1141 gaggagaagc aacagagcct gtctgtcccc atgagcggag tctgtcctct gctcttctgc 1201 agtcaggtca ctgcctactg cctgggggct ctagtcattc cagtggaaga cgaatgtaac 1261 ctgcgtggtg atgtgacaac tgtttcctcc ctgaccccag aggatctggc tctaggttgg 1321 gatcaatcct gaatttcgtt atgtgttaat ttacttttat taaaaaagta tagtatatat 1381 aatacaaaac aataaccctt ctggggtttc ttgtggcggt tgaaatagtc ccacatgtgg 1441 tcatcagaaa tagcattcct cataccaata taggatcagc tccttgacct ctgaggggtc 1501 aggagtgctt cctggtgtgt gtattagaat cccttcctgc cttgtttcat ggcagtgaaa 1561 tgcctcttgg tcctgtccag tgtatctttc actgatttct gaatcatgtt ctagttgctt 1621 gaccctgcca catgggtcca gtgttcatct gagcataact gtactaaatc ctttttccat 1681 atcagtataa taaaggagtg atgtgcaat // LOCUS HUMPGCA 1318 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens pepsinogen C (PGC) mRNA, complete cds. ACCESSION J04443 NID g551175 KEYWORDS gastricsin; pepsinogen C. SOURCE Homo sapiens gastric mucosa cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1318) AUTHORS Taggart,R.T., Cass,L.G., Mohandas,T.K., Derby,P., Barr,P.J., Pals,G. and Bell,G.I. TITLE Human pepsinogen C (progastricsin). Isolation of cDNA clones, localization to chromosome 6, and sequence homology with pepsinogen A JOURNAL J. Biol. Chem. 264 (1), 375-379 (1989) MEDLINE 89079679 FEATURES Location/Qualifiers source 1..1318 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="gastric mucosa" /map="6pter-p21.1" gene 8..1306 /gene="PGC" sig_peptide 8..55 /gene="PGC" /note="G00-119-485" CDS 8..1174 /gene="PGC" /note="precursor" /codon_start=1 /db_xref="GDB:G00-119-485" /product="pepsinogen" /db_xref="PID:g551176" /translation="MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLR THKYDPAWKYRFGDLSVTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSV YCQSQACTSHSRFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEF GLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQQ GSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVD TGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSS YILSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA" mat_peptide 56..1171 /gene="PGC" /note="propeptide; G00-119-485" /product="pepsinogen" polyA_signal 1297..1306 /gene="PGC" /note="G00-119-485" polyA_site 1318 /gene="PGC" /note="G00-119-485" BASE COUNT 248 a 403 c 347 g 320 t ORIGIN 1 cagcatcatg aagtggatgg tggtggtctt ggtctgcctc cagctcttgg aggcagcagt 61 ggtcaaagtg cccctgaaga aatttaagtc tatccgtgag accatgaagg agaagggctt 121 gctgggggag ttcctgagga cccacaagta tgatcctgct tggaagtacc gctttggtga 181 cctcagcgtg acctacgagc ccatggccta catggatgct gcctactttg gtgagatcag 241 catcgggact ccaccccaga acttcctggt cctttttgac accggctcct ccaacttgtg 301 ggtgccctct gtctactgcc agagccaggc ctgcaccagt cactcccgct tcaaccccag 361 cgagtcgtcc acctactcca ccaatgggca gaccttctcc ctgcagtatg gcagtggcag 421 cctcaccggc ttctttggct atgacaccct gactgtccag agcatccagg tccccaacca 481 ggagttcggc ttgagtgaga atgagcctgg taccaacttc gtctatgcgc agtttgatgg 541 catcatgggc ctggcctacc ctgctctgtc cgtggatgag gccaccacag ctatgcaggg 601 catggtgcag gagggcgccc tcaccagccc cgtcttcagc gtctacctca gcaaccagca 661 gggctccagc gggggagcgg ttgtctttgg gggtgtggat agcagcctgt acacggggca 721 gatctactgg gcgcctgtca cccaggaact ctactggcag attggcattg aagagttcct 781 catcggcggc caggcctccg gctggtgttc tgagggttgc caggccatcg tggacacagg 841 cacctctctg ctcactgtgc cccagcagta catgagtgct cttctgcagg ccacaggggc 901 ccaggaggat gagtatggac agtttctcgt gaactgtaac agcattcaga atctgcccag 961 cttgaccttc atcatcaatg gtgtggagtt ccctctgcca ccttcctcct atatcctcag 1021 taacaacggc tactgcaccg tgggagtcga gcccacctac ctgtcctccc agaacggcca 1081 gcccctgtgg atcctcgggg atgtcttcct caggtcctac tattccgtct acgacttggg 1141 caacaacaga gtaggctttg ccactgccgc ctagacttgc tgcctcgaca cgtgggtggg 1201 ctcccctctt cctcttgacc ctgcaccctc ctagggcatt gtatctgtct ttccactctg 1261 gattcagcct tctttttctg gactctggac tttctctaat aataaatagt tcttcttt // LOCUS HUMPGES 2554 bp mRNA PRI 17-DEC-1993 DEFINITION Human prostaglandin endoperoxide synthase mRNA, complete cds. ACCESSION M59979 NID g189886 KEYWORDS prostaglandin endoperoxide synthase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2554) AUTHORS Funk,C.D., Funk,L.B., Kennedy,M.E., Pong,A.S. and Fitzgerald,G.A. TITLE Human platelet/erythroleukemia cell prostaglandin G/H synthase: cDNA cloning, expression, and gene chromosomal assignment JOURNAL FASEB J. 5 (9), 2304-2312 (1991) MEDLINE 91317397 REFERENCE 2 (bases 1 to 2554) AUTHORS Funk,C.D. TITLE Direct Submission JOURNAL Submitted (06-MAR-1991) C.D. Funk, Division of Clinical Pharmacology, Vanderbilt University, Nashville, TN 37232, USA FEATURES Location/Qualifiers source 1..2554 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="blood platelet cell line HEL" sig_peptide 6..74 CDS 6..1805 /EC_number="1.14.99.1" /codon_start=1 /evidence=experimental /product="prostaglandin endoperoxide synthase" /db_xref="PID:g189887" /translation="MSRSLLLRFLLFLLLLPPLPVLLADPGAPTPVNPCCYYPCQHQG ICVRFGLDRYQCDCTRTGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFV NATFIREMLMRLVLTVRSNLIPSPPTYNSAHDYISWESFSNVSYYTRILPSVPKDCPT PMGTKGKKQLPDAQLLARRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPG FTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPR GIPPQSQMAVGQEVFGLLPGLMLYATLWLREHNRVCDLLKAEHPTWGDEQLFQTTRLI LIGETIKIVIEEYVQQLSGYFLQLKFDPELLFGVQFQYRNRIAMEFNHLYHWHPLMPD SFKVGSQEYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGRIGGGRNMDHHILHVAVDV IRESREMRLQPFNEYRKRFGMKPYTSFQELVGEKEMAAELEELYGDIDALEFYPGLLL EKCHPNSIFGESMIEIGAPFSLKGLLGNPICSPEYWKPSTFGGEVGFNIVKTATLKKL VCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL" mat_peptide 75..1802 /EC_number="1.14.99.1" /product="prostaglandin endoperoxide synthase" BASE COUNT 533 a 697 c 675 g 649 t ORIGIN 1 gcgccatgag ccggagtctc ttgctccggt tcttgctgtt cctgctcctg ctcccgccgc 61 tccccgtcct gctcgcggac ccaggggcgc ccacgccagt gaatccctgt tgttactatc 121 catgccagca ccagggcatc tgtgtccgct tcggccttga ccgctaccag tgtgactgca 181 cccgcacggg ctattccggc cccaactgca ccatccctgg cctgtggacc tggctccgga 241 attcactgcg gcccagcccc tctttcaccc acttcctgct cactcacggg cgctggttct 301 gggagtttgt caatgccacc ttcatccgag agatgctcat gcgcctggta ctcacagtgc 361 gctccaacct tatccccagt ccccccacct acaactcagc acatgactac atcagctggg 421 agtctttctc caacgtgagc tattacactc gtattctgcc ctctgtgcct aaagattgcc 481 ccacacccat gggaaccaaa gggaagaagc agttgccaga tgcccagctc ctggcccgcc 541 gcttcctgct caggaggaag ttcatacctg acccccaagg caccaacctc atgtttgcct 601 tctttgcaca acacttcacc caccagttct tcaaaacttc tggcaagatg ggtcctggct 661 tcaccaaggc cttgggccat ggggtagacc tcggccacat ttatggagac aatctggagc 721 gtcagtatca actgcggctc tttaaggatg ggaaactcaa gtaccaggtg ctggatggag 781 aaatgtaccc gccctcggta gaagaggcgc ctgtgttgat gcactacccc cgaggcatcc 841 cgccccagag ccagatggct gtgggccagg aggtgtttgg gctgcttcct gggctcatgc 901 tgtatgccac gctctggcta cgtgagcaca accgtgtgtg tgacctgctg aaggctgagc 961 accccacctg gggcgatgag cagcttttcc agacgacccg cctcatcctc ataggggaga 1021 ccatcaagat tgtcatcgag gagtacgtgc agcagctgag tggctatttc ctgcagctga 1081 aatttgaccc agagctgctg ttcggtgtcc agttccaata ccgcaaccgc attgccatgg 1141 agttcaacca tctctaccac tggcaccccc tcatgcctga ctccttcaag gtgggctccc 1201 aggagtacag ctacgagcag ttcttgttca acacctccat gttggtggac tatggggttg 1261 aggccctggt ggatgccttc tctcgccaga ttgctggccg gatcggtggg ggcaggaaca 1321 tggaccacca catcctgcat gtggctgtgg atgtcatcag ggagtctcgg gagatgcggc 1381 tgcagccctt caatgagtac cgcaagaggt ttggcatgaa accctacacc tccttccagg 1441 agctcgtagg agagaaggag atggcagcag agttggagga attgtatgga gacattgatg 1501 cgttggagtt ctaccctgga ctgcttcttg aaaagtgcca tccaaactct atctttgggg 1561 agagtatgat agagattggg gctccctttt ccctcaaggg tctcctaggg aatcccatct 1621 gttctccgga gtactggaag ccgagcacat ttggcggcga ggtgggcttt aacattgtca 1681 agacggccac actgaagaag ctggtctgcc tcaacaccaa gacctgtccc tacgtttcct 1741 tccgtgtgcc ggatgccagt caggatgatg ggcctgctgt ggagcgacca tccacagagc 1801 tctgaggggc aggaaagcag cattctggag gggagagctt tgtgcttgtc attccagagt 1861 gctgaggcca gggctgatgg tcttaaatgc tcattttctg gtttggcatg gtgagtgttg 1921 gggttgacat ttagaacttt aagtctcacc cattatctgg aatattgtga ttctgtttat 1981 tcttccagaa tgctgaactc cttgttagcc cttcagattg ttaggagtgg ttctcatttg 2041 gtctgccaga atactgggtt cttagttgac aacctagaat gtcagatttc tggttgattt 2101 gtaacacagt cattctagga tgtggagcta ctgatgaaat ctgctagaaa gttagggggt 2161 tcttattttg cattccagaa tcttgacttt ctgattggtg attcaaagtg ttgtgttccc 2221 tggctgatga tccagaacag tggctcgtat cccaaatctg tcagcatctg gctgtctaga 2281 atgtggattt gattcatttt cctgttcagt gagatatcat agagacggag atcctaaggt 2341 ccaacaagaa tgcattccct gaatctgtgc ctgcactgag agggcaagga agtggggtgt 2401 tcttcttggg acccccacta agaccctggt ctgaggatgt agagagaaca ggtgggctgt 2461 attcacgcca ttggttggaa gctaccagag ctctatcccc atccaggtct tgactcatgg 2521 cagctgtttc tcatgaagct aataaaattc gccc // LOCUS HUMPGM1A 2320 bp mRNA PRI 07-JAN-1995 DEFINITION Human phosphoglucomutase 1 (PGM1) mRNA, complete cds. ACCESSION M83088 NID g189925 KEYWORDS phosphoglucomutase. SOURCE Homo sapiens Adult Muscle, skeletal cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Whitehouse,D.B., Putt,W., Lovegrove,J.U., Morrison,K., Hollyoake,M., Fox,M.F., Hopkinson,D.A. and Edwards,Y.H. TITLE Phosphoglucomutase 1: complete human and rabbit mRNA sequences and direct mapping of this highly polymorphic marker on human chromosome 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 411-415 (1992) MEDLINE 92108065 FEATURES Location/Qualifiers source 1..2320 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /tissue_type="Muscle, skeletal" /map="1p22.1" gene 63..1751 /gene="PGM1" CDS 63..1751 /gene="PGM1" /EC_number="5.4.2.2" /codon_start=1 /db_xref="GDB:G00-119-489" /db_xref="PID:g189926" /translation="MVKIVTVKTQAYQDQKPGTSGLRKRVKVFQSSANYAENFIQSII STVEPAQRQEATLVVGGDGRFYMKEAIQLIARIAAANGIGRLVIGQNGILSTPAVSCI IRKIKAIGGIILTASHNPGGPNGDFGIKFNISNGGPAPEAITDKIFQISKTIEEYAVC PDLKVDLGVLGKQQFDLENKFKPFTVEIVDSVEAYATMLRSIFDFSALKELLSGPNRL KICIDAMHGVVGPYVKKILCEELGAPANSAVNCVPLEDFGGHHPDPNLTYAADLVETM KSGEHDFGAAFDGDGDRNMILGKHGFFVNPSDSVAVIAANIFSIPYFQQTGVRGFARS MPTSGALDRVASATKIALYETPTGWKFFGNLMDASKLSLCGEESFGTGSDHIREKDGL WAVLAWLSILATRKQSVEDILKDHWQKHGRNFFTRYDYEEVEAEGANKMMKDLEALMF DRSFVGKQFSANDKVYTVEKADNFEYSDPVDGSISRNQGLRLIFTDGSRIVFRLSGTG SAGATIRLYIDSYEKDVAKINQDPQVMLAPLISIALKVSQLQERTGRTAPTVIT" BASE COUNT 576 a 572 c 610 g 562 t ORIGIN 1 gggccggccg cccctccgcc agccaagtcc gccgctctga cccccggcag caagtcgcca 61 ccatggtgaa gatcgtgaca gttaagaccc aggcgtacca ggaccagaag ccgggcacga 121 gcgggctgcg gaagcgggtg aaggtgttcc agagcagcgc caactacgcg gagaacttca 181 tccagagtat catctccacc gtggagccgg cgcagcggca ggaggccacg ctggtggtgg 241 gcggggacgg ccggttctac atgaaggagg ccatccagct catcgctcgc atcgctgccg 301 ccaacgggat cggtcgcttg gttatcggac agaatggaat cctctccacc cctgctgtat 361 cctgcatcat tagaaaaatc aaagccattg gtgggatcat tctgacagcc agtcacaacc 421 cagggggccc caatggagat tttggaatca aattcaatat ttctaatgga ggtcctgctc 481 cagaagcaat aactgataaa attttccaaa tcagcaagac aattgaagaa tatgcagttt 541 gccctgacct gaaagtagac cttggtgttc tgggaaagca gcagtttgac ttggaaaata 601 agttcaaacc cttcacagtg gaaattgtgg attcggtaga agcttatgct acaatgctga 661 gaagcatctt tgatttcagt gcactgaaag aactactttc tgggccaaac cgactgaaga 721 tctgtattga tgctatgcat ggagttgtgg gaccgtatgt aaagaagatc ctctgtgaag 781 aactcggtgc ccctgcgaac tcggcagtta actgcgttcc tctggaggac tttggaggcc 841 accaccctga ccccaacctc acctatgcag ctgacctggt ggagaccatg aagtcaggag 901 agcatgattt tggggctgcc tttgatggag atggggatcg aaacatgatt ctgggcaagc 961 atgggttctt tgtgaaccct tcagactctg tggctgtcat tgctgccaac atcttcagca 1021 ttccgtattt ccagcagact ggggtccgcg gctttgcacg gagcatgccc acgagtggtg 1081 ctctggaccg ggtggctagt gctacaaaga ttgctttgta tgagacccca actggctgga 1141 agttttttgg gaatttgatg gacgcgagca aactgtccct ttgtggggag gagagcttcg 1201 ggaccggttc tgaccacatc cgtgagaaag atggactgtg ggctgtcctt gcctggctct 1261 ccatcctagc cacccgcaag cagagtgtgg aggacattct caaagatcat tggcaaaagc 1321 atggccggaa tttcttcacc aggtatgatt acgaggaggt ggaagctgag ggcgcaaaca 1381 aaatgatgaa ggacttggag gccctgatgt ttgatcgctc ctttgtgggg aagcagttct 1441 cagcaaatga caaagtttac actgtggaga aggccgataa ctttgaatac agcgacccag 1501 tggatggaag catttcaaga aatcagggct tgcgcctcat tttcacagat ggttctcgaa 1561 tcgtcttccg actgagcggc actgggagtg ccggggccac cattcggctg tacatcgata 1621 gctatgagaa ggacgttgcc aagattaacc aggaccccca ggtcatgttg gcccccctta 1681 tttccattgc tctgaaagtg tcccagctgc aggagaggac gggacgcact gcacccactg 1741 tcatcaccta agaagacagg cctgatgtgg tacgtccctc cacccccgga cccatccaag 1801 tcatctgatt gaagagcatg acagaaacaa aatgtattca ccaagcattt taggatttga 1861 ctttttcact aaccagttga cgagcagtgc atttacaagg cactgccaaa caagatgccc 1921 ttgggagctg tgagggaaag aggacctgcg ggcttagatc aatctcaatt ccttttcatg 1981 ccctcctgca ttgctgctgc gtgggtattt gtctccttag ccatcaggta cagtttacac 2041 tacaatgtaa gctataggtg gagcatcagc agtgagtgag gccattcttc atccttagga 2101 tgtggcaatg aaatgatggt gcaagttcct ttctcttttg tgaatctttc cccccatttc 2161 ctgtttacat gtaacccaac aaaatgcaat ttctagtgcc ttctgtccaa tcagttcttt 2221 cctctgagtg agacgtactt ggctacagat ttctgccttg ttttgcgaca ttgtcccatt 2281 cacacagata ttttgggata ataaaggaaa ataagctaca // LOCUS HUMPGMR 2292 bp DNA PRI 14-FEB-1996 DEFINITION Homo sapiens phosphoglucomutase-related protein (PGMRP) gene, complete cds. ACCESSION L40933 NID g1160964 KEYWORDS PGM-related protein; adherens junction; dystrophin; homologue; phosphoglucomutase-related protein; utrophin. SOURCE Homo sapiens (clone: A111;A85) (clone library: ZAPII) female uterus DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2292) AUTHORS Moiseeva,E.P., Belkin,A.M., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R. TITLE A novel dystrophin/utrophin-associated protein is an enzymatically inactive member of the phosphoglucomutase superfamily JOURNAL Eur. J. Biochem. 235 (1-2), 103-113 (1996) MEDLINE 96202923 COMMENT PGMRP gene is located on 9q12-13. FEATURES Location/Qualifiers source 1..2292 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="A111;A85" /clone_lib="ZAPII" /sex="female" /tissue_type="uterus" /map="9qcen-q13" gene 92..1612 /gene="PGMRP" CDS 92..1612 /gene="PGMRP" /note="homologue to phosphoglucomutase 1 (PGM1)" /codon_start=1 /evidence=experimental /product="phosphoglucomutase-related protein" /db_xref="PID:g1160965" /translation="MVVGSDGRYFSRTAIEIVVQMAAANGIGRLIIGQNGILSTPAVS CIIRKIKAAGGIILTASHCPGGPGGEFGVKFNVANGGPAPDVVSDKIYQISKTIEEYA ICPDLRIDLSRLGRQEFDLENKFKPFRVEIVDPVDIYLNLLRTIFDFHAIKGLLTGPS QLKIRIDAMHGVMGPYVRKVLCDELGAPANSAINCVPLEDFGGQHPDPNLTYATTLLE AMKGGEYGFGAAFDADGDRYMILGQNGFFVSPSDSLAIIAANLSCIPYFRQMGVRGFG RSMPTSMALDRVAKSMKVPVYETPAGWRFFSNLMDSGRCNLCGEESFGTGSDHLREKD GLWAVLVWLSIIAARKQSVEEIVRDHWAKFGRHYYCRFDYEGLDPKTTYYIMRDLEAL VTDKSFIGQQFAVGSHVYSVAKTDSFEYVDPVDGTVTKKQGLRIIFSDASRLIFRLSS SSGVRATLRLYAESYERDPSGHDQEPQAVLSPLIAIALKISQIHERTGRRGPTVIT" BASE COUNT 595 a 566 c 583 g 548 t ORIGIN 1 cggcctcttc gagggccagc gcaactacct gcccaacttt atccagagcg tgctgtcgtc 61 catcgacctg cgcgaccgtc agggctgcac catggtggtg ggcagcgacg gcaggtactt 121 tagcaggacg gccatcgaga tcgtggtgca gatggccgcg gccaacggga ttggacgact 181 gattattgga cagaatggca tcttgtcgac acctgcggtc tcctgcatta tcaggaagat 241 caaggcagct ggtggaatca ttctaacagc cagccactgc cctggaggac cagggggaga 301 gtttggagtg aagtttaatg ttgccaatgg aggtcctgca cccgatgttg tctcagacaa 361 aatctaccaa atcagcaaaa cgattgagga atatgctata tgtcctgatc tccgaatcga 421 cctatctcga ctaggaagac aagaatttga cctagaaaac aaattcaaac cattcagagt 481 ggagatagtg gacccagtgg atatctatct taacctcctt cggaccatct ttgactttca 541 tgccatcaag ggtttgctga ctggacccag ccaactgaag attcgcattg acgcaatgca 601 cggagttatg ggaccttatg tgagaaaagt tctgtgtgat gagctggggg ccccagccaa 661 ttctgcaata aactgtgttc ccctggaaga ctttggaggg cagcaccctg acccaaacct 721 gacatatgca acgactcttc tggaagcaat gaaaggagga gaatatggat ttggagctgc 781 atttgatgct gatggggacc gttatatgat cctaggccaa aatggcttct ttgtgagccc 841 ttctgactcc ctggccatca ttgctgccaa cctctcttgc attccatatt tccgtcagat 901 gggggtccgc gggtttggga ggagtatgcc aaccagcatg gccctggaca gagtggccaa 961 atcaatgaag gtccctgtat atgagacccc agctggatgg agattcttct caaatctgat 1021 ggactcagga cgttgcaatc tgtgtgggga agagagcttt ggcactggct ctgaccacct 1081 ccgagagaag gatggcctgt gggctgtctt ggtctggctc tccattattg ctgcccggaa 1141 gcagagtgtg gaggaaattg tccgagatca ctgggccaaa tttggccgcc actactattg 1201 caggtttgac tatgaggggt tggatcccaa gacgacatat tatatcatga gggacctgga 1261 ggccctggtc acagacaaat ccttcattgg ccagcagttt gctgtgggga gccatgtcta 1321 cagcgtggcg aagacggata gttttgaata cgtggaccct gtggatggca ctgtgaccaa 1381 gaaacagggc ctaaggatca ttttctcgga tgcatcacgg ctcatcttcc ggctcagttc 1441 ctccagtggt gtgcgggcca ccctcagact gtacgcagag agctacgaga gggatcccag 1501 cggccatgac caggagccac aggcagtgct gagccctctc atagccatcg cactgaaaat 1561 atcccagatt catgagagaa ctggccggag gggacccact gtcatcacct gaatagagga 1621 aagatcactc accagggcca aagagagtgc tcagcgggag atgcttcact gatgccttct 1681 tgctacctgt ttgtgcctct tatgactttg gaaaaacaaa agatattttg cttttggggg 1741 atagagggtg ggtgggaaaa gaaaaaaaat ccatttggtt ttggttttgt cctattcctc 1801 caaatgcagc agggccttta gttgtctgtt aaagctgcac tataatttgg tatctacatt 1861 ttatcacaca aaggaacctc cccttttgac aacaactggg ctaggcagct gttaatcaca 1921 acatttgtgc atcacttgtg ccaagtgaga aaatgttcta aaatcacaag agagaacagt 1981 gccagaatga aactgaccct aagtcccagg tgcccctggg caggcagaag gagacactcc 2041 cagcatggag gagggtttat cttttcatcc taggtcaggt ctacaatggg ggaaggtttt 2101 attatagaac tcccaacagc ccacctcact cctgccaccc acccgatggc cctgcctccc 2161 ccatcccatc cccaacatcc ctgtaccacc ttctctcaca tcttctaaag ctttgtacaa 2221 atcacaatgg tgcacttcca acaaaatata tcaataggtg ttttcctctc tcaaaaaaaa 2281 aaaaaaaaaa aa // LOCUS HUMPHIDYIN 1368 bp mRNA PRI 02-AUG-1994 DEFINITION Human phosphatidylinositol transfer protein mRNA, complete cds. ACCESSION M73704 NID g189938 KEYWORDS phosphatidylinositol transfer protein. SOURCE Homo sapiens male Adult Testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1368) AUTHORS Dickeson,S.K., Helmkamp,G.M.Jr.. and Yarbrough,L.R. TITLE Sequence of a human cDNA encoding phosphatidylinositol transfer protein and occurrence of a related sequence in widely dibergent eukaryotes JOURNAL Gene 142, 301-305 (1994) MEDLINE 94252585 FEATURES Location/Qualifiers source 1..1368 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="Adult" /sex="male" /tissue_type="Testis" 5'UTR 1..216 CDS 217..1029 /codon_start=1 /product="phosphatidylinositol transfer protein" /db_xref="PID:g189939" /translation="MVLLKEYRVILPVSVDEYQVGQLYSVAEASKNETGGGEGVEVLV NEPYEKDGEKGQYTHKIYHLQSKVPTFVRMLAPEGALNIHEKAWNAYPYCRTVITNEY MKEDFLIKIETWHKPDLGTQENVHKLEPEAWKHVEAVYIDIADRSQVLSKDYKAEEDP AKFKSIKTGRGPLGPNWKQELVNQKDCPYMCAYKLVTVKFKWWGLQNKVENFIHKQER RLFTNFHRQLFCWLDKWVDLTMDDIRRMEEETKRQLDEMRQKDPVKGMTADD" 3'UTR 1030..1368 BASE COUNT 359 a 342 c 410 g 257 t ORIGIN 1 ccgcagctcc gggcggcgtc ggcagcggcg cgagaggcga cgaggcccgg gcggcaggag 61 ccggcgcggc gaccgcggcg agggcggcgg ggacggagca gagcacgacg aagacgcaca 121 ggcagccggg ccgggccggg ccacggggag agcggccggc gggcagcggg cgggaggccg 181 ggcgggccgc gggcagccac cgagccgcga agcgacatgg tgctgctcaa ggagtatcga 241 gtaatcctgc ctgtgtctgt agatgagtat caagtggggc agctgtattc tgtggctgag 301 gccagtaaaa atgaaacggg tggtggcgaa ggcgtggagg tcctggtgaa tgagccctac 361 gagaaggacg gtgagaaagg ccagtacaca cacaagatct accacctgca gagcaaagta 421 cccacgtttg ttcgaatgct ggccccagag ggagccctga atatacacga gaaagcctgg 481 aatgcttacc cctactgcag aaccgttatt acaaatgagt acatgaaaga agactttctg 541 attaaaattg aaacctggca caaaccagat cttggcacgc aggagaatgt gcataagctg 601 gagcctgagg cgtggaaaca cgtggaagcc gtatatatag acattgcaga tcgaagccaa 661 gtgctcagca aggattacaa ggcagaggaa gacccagcaa aatttaaatc tatcaaaaca 721 ggccgaggac ccttgggccc caattggaag caagagcttg taaaccagaa ggactgccca 781 tatatgtgtg catacaaact ggtgaccgtc aagttcaagt ggtggggcct gcagaacaaa 841 gtggagaact tcatccataa gcaagagagg cgtctgttta caaacttcca caggcagctg 901 ttctgttggc tcgataagtg ggttgacctg accatggacg acattcgaag gatggaagaa 961 gagacgaaga gacagctgga tgaaatgaga caaaaggacc cagtgaaagg aatgacagca 1021 gatgactaaa gccgcctttc ccctctgcac tttttgcaag acagtggtca acaggaagac 1081 ccagcagctc caccctccca agtgacaggc cattttcgga cgcagccgtt ctgtgcccat 1141 tcttcaggca actttcgatt ccttactgta gctaattcta gatgcccttc agtattgtga 1201 acaaatagac cgtctggtgt cacagagccc gcgtgtgcag gagcctgctc ctccgagata 1261 cgtggcgtgt aggttgctgt agttacagcg cctccgtctc tctccattgt gttccgatcc 1321 atttctgtgt gttcccccaa cctttatttc caagtgacat tttcagtc // LOCUS HUMPHK 1571 bp mRNA PRI 24-SEP-1992 DEFINITION Human phosphorylase kinase (PSK-C3) mRNA, complete cds. ACCESSION M31606 NID g189940 KEYWORDS phosphorylase kinase. SOURCE Human HeLa cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1571) AUTHORS Hanks,S.K. TITLE Messenger ribonucleic acid encoding an apparent isoform of phosphorylase kinase catalytic subunit is abundant in the adult testis JOURNAL Mol. Endocrinol. 3, 110-116 (1989) MEDLINE 89127266 FEATURES Location/Qualifiers source 1..1571 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" mRNA <1..1571 /note="PSK-C3 mRNA" CDS 94..1314 /codon_start=1 /product="phosphorylase kinase" /db_xref="PID:g189941" /translation="MTLDVGPEDELPDWAAAKEFYQKYDPKDVIGRGVSSVVRRCVHR ATGHEFAVKIMEVTAERLSPEQLEEVREATRRETHILRQVAGHPHIITLIDSYESSSF MFLVFDLMRKGELFDYLTEKVALSEKETRSIMRSLLEAVSFLHANNIVHRDLKPENIL LDDNMQIRLSDFGFSCHLEPGEKLRELCGTPGYLAPEILKCSMDETHPGYGKEVDLWA CGVILFTLLAGSPPFWHRRQILMLRMIMEGQYQFSSPEWDDRSSTVKDLISRLLQVDP EARLTAEQALQHPFFERCEGSQPWNLTPRQRFRVAVWTVLAAGRVALSTHRVRPLTKN ALLRDPYALRSVRHLIDNCAFRLYGHWVKKGEQQNRAALFQHRPPGPFPIMGPEEEGD SAAITEDEAVLVLG" BASE COUNT 327 a 468 c 455 g 321 t ORIGIN 16 bp upstream of PstI site. 1 aaggtgagcg actgcaggca aacccggcga cagcgcagct cgcgtcgacc ctggctcctc 61 tgcctgcccc ctcaggcccc cgcctccttc aggatgacgc tggacgtggg gccggaggat 121 gagctgcccg actgggccgc cgccaaagag ttttaccaga agtacgaccc taaggacgtc 181 atcggcagag gagtgagctc tgtggtccgc cgttgtgttc atcgagctac tggccacgag 241 tttgcggtga agattatgga agtgacagct gagcggctga gtcctgagca gctggaggag 301 gtgcgggaag ccacacggcg agagacacac atccttcgcc aggtcgccgg ccacccccac 361 atcatcaccc tcatcgattc ctacgagtct tctagcttca tgttcctggt gtttgacctg 421 atgcggaagg gagagctgtt tgactatctc acagagaagg tggccctctc tgaaaaggaa 481 accaggtcca tcatgcggtc tctgctggaa gcagtgagct ttctccatgc caacaacatt 541 gtgcatcgag atctgaagcc cgagaatatt ctcctagatg acaatatgca gatccgactt 601 tcagatttcg ggttctcctg ccacttggaa cctggcgaga agcttcgaga gttgtgtggg 661 accccagggt atctagcgcc agagatcctt aaatgctcca tggatgaaac ccacccaggc 721 tatggcaagg aggtcgacct ctgggcctgt ggggtgatct tgttcacact cctggctggc 781 tcgccaccct tctggcaccg gcggcagatc ctgatgttac gcatgatcat ggagggccag 841 taccagttca gttcccccga gtgggatgac cgttccagca ctgtcaaaga cctgatctcc 901 aggctgctgc aggtggatcc tgaggcacgc ctgacagctg agcaggccct acagcacccc 961 ttctttgagc gttgtgaagg cagccaaccc tggaacctca ccccccgcca gcggttccgg 1021 gtggcagtgt ggacagtgct ggctgctgga cgagtggccc taagcaccca tcgtgtacgg 1081 ccactgacca agaatgcact gttgagggac ccttatgcgc tgcggtcagt gcggcacctc 1141 atcgacaact gtgccttccg gctctacggg cactgggtaa agaaagggga gcagcagaac 1201 cgggcggctc tctttcagca ccggccccct gggccttttc ccatcatggg ccctgaagag 1261 gagggagact ctgctgctat aactgaggat gaggccgtgc ttgtgctggg ctaggacctc 1321 aaccccaggg attcccagga agcagaactc tccagaagaa gggttttgat cattccagct 1381 cctctgggct ctggcctcag gcccactaat gatcctgcta ccctcttgaa gaccagcccg 1441 gtacctctct ccccactggc caggactctg agatcagagc tggggtggaa gggagccatt 1501 ctgaacgcca cgcctggccc ggtcagtgct gcatgcactg catatgaaat aaaatctgct 1561 acacgccagg g // LOCUS HUMPHKI 1002 bp mRNA PRI 24-JUL-1996 DEFINITION Homo sapiens phosphomevalonate kinase mRNA, complete cds. ACCESSION L77213 NID g1294781 KEYWORDS cholesterol biosynthesis; kinase; peroxisomal; phosphomevalonate kinase. SOURCE Homo sapiens (clone library: Stratagene) adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1002) AUTHORS Chambliss,K.L., Slaughter,C.A., Schreiner,R., Hoffmann,G.F. and Gibson,K.M. TITLE Molecular cloning of human phosphomevalonate kinase and identification of a consensus peroxisomal targeting sequence JOURNAL J. Biol. Chem. 271 (29), 17330-17334 (1996) MEDLINE 96291886 FEATURES Location/Qualifiers source 1..1002 /organism="Homo sapiens" /note="(vector lambda ZAPII)" /db_xref="taxon:9606" /clone_lib="Stratagene" /dev_stage="adult" /tissue_type="liver" 5'UTR 1..35 mRNA 1..1002 CDS 36..614 /codon_start=1 /evidence=experimental /product="phosphomevalonate kinase" /db_xref="PID:g1294782" /translation="MAPLGGAPRLVLLFSGKRKSGKDFVTEALQSRLGADVCAVLRLS GPLKEQYAQEHGLNFQRLLDTSTYKEAFRKDMIRWGEEKRQADPGFFCRKIVEGISQP IWLVSDTRRVSDIQWFREAYGAVTQTVRVVALEQSRQQRGWVFTPGVDDAESECGLDN FGDFDWVIENHGVEQRLEEQLENLIEFIRSRL" misc_feature 603..611 /function="encodes peroxisomal targeting sequence" 3'UTR 615..1002 polyA_signal 984..989 polyA_site 1002 BASE COUNT 208 a 251 c 339 g 204 t ORIGIN 1 ccgcgatcta gaactagtcg aggcgtggcg gccccatggc cccgctggga ggcgccccgc 61 ggctggtact gctgttcagc ggcaagagga aatccgggaa ggacttcgtg accgaggcgc 121 tgcagagcag acttggagct gatgtctgtg ctgtcctccg gctctctggt ccactcaagg 181 aacagtatgc tcaggagcat ggcttgaact tccagagact cctggacacc agcacctaca 241 aggaggcctt tcggaaggac atgatccgct ggggagagga gaaacgccag gctgacccag 301 gcttcttttg caggaagatt gtggagggca tctcccagcc catctggctg gtgagtgaca 361 cacggagagt gtctgacatc cagtggtttc gggaggccta tggggccgtg acgcagacgg 421 tccgcgttgt agcgttggag cagagccgac agcagcgggg ctgggtgttc acgccagggg 481 tggacgatgc tgagtcagaa tgtggcctgg acaacttcgg ggactttgac tgggtcatcg 541 agaaccatgg agttgaacag cgcctggagg agcagttgga gaacctgata gaatttatcc 601 gctccagact ttagtcacta ggttctagga gtgagctggg gcctgctgag gtgggggtgg 661 ggctgactct gcaaaatggg ggtgtccccc gatcctggcc gaggtgagga acagacaggg 721 ggggtctaga ttctgagggg gttggtggat attgggcaag gcaggaaacc tctggagacc 781 tcattttctc catggggaag acagccatgc tcttcaggag gagactccaa gggcaaagga 841 gggtgtcttg gctgtgcttg aaggcgaaac cctgccatat ccccagtgcc agtcccctca 901 gcctgtggtg gccttgcatc ctgactggat gttctcagcc ccttgttctg ggcaagaacc 961 cagagctccc cagtgtggat actaataaac ctcttggagc ac // LOCUS HUMPHLAM 1635 bp mRNA PRI 07-JAN-1995 DEFINITION Human phospholamban mRNA, complete cds. ACCESSION M63603 NID g189942 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1635) AUTHORS Fujii,J., Zarain-Herzberg,A., Willard,H.F., Tada,M. and MacLennan,D.H. TITLE Structure of the rabbit phospholamban gene, cloning of the human cDNA, and assignment of the gene to human chromosome 6 JOURNAL J. Biol. Chem. 266 (18), 11669-11675 (1991) MEDLINE 91268032 FEATURES Location/Qualifiers source 1..1635 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6" gene 182..340 /gene="PLB" CDS 182..340 /gene="PLB" /codon_start=1 /function="regulatory protein of sarcoplasmic reticulum Ca-ATPase" /db_xref="GDB:G00-128-300" /product="phospholamban" /db_xref="PID:g189943" /translation="MEKVQYLTRSAIRRASTIEMPQQARQKLQNLFINFCLILICLLL ICIIVMLL" BASE COUNT 567 a 309 c 231 g 528 t ORIGIN 1 cagagtcaga aaactcccca gctaaacacc cgtaagactt catacaacac aatactctat 61 actgtgatga tcacagctgc caaggctacc taaaagaaga cagttatctc atatttggct 121 gccagctttt tatctttctc tcgaccactt aaaacttcag acttcctgtc ctgctggtat 181 catggagaaa gtccaatacc tcactcgctc agctataaga agagcctcaa ccattgaaat 241 gcctcaacaa gcacgtcaaa agctacagaa tctatttatc aatttctgtc tcatcttaat 301 atgtctcttg ctgatctgta tcatcgtgat gcttctctga agttctgcta caacctctag 361 atctgcagct tgccacatca gcttaaaatc tgtcatccca tgcagacagg aaaacaatat 421 tgtataacag accacttcct gagtagaaga gtttctttgt gaaaaggtca agattaagac 481 taaaacttat tgttaccata tgtattcatc tgttggatct tgtaaacatg aaaagggctt 541 tattttcaaa aattaacttc aaaataagtg tataaaatgc aactgttgat ttcctcaaca 601 tggctcacaa atttctatcc caaatctttt ctgaagatga agagtttagt tttaaaactg 661 cactgccaac aagttcactt catatataaa gcattatttt tactcttttg aggtgaatat 721 aatttatatt acaatgtaaa agcttcttta atactaagta tttttcaggt cttcaccaag 781 tatcaaagta ataacacaaa tgaagtgtca ttattcaaaa tagtccactg actcctcaca 841 tctgttatct tattataaag aactatttgt agtaactatc agaatctaca ttctaaaaca 901 gaaattgtat tttttctatg ccacattaac atcttttaaa gttgatgaga atcaagtatg 961 gaaaagtaag gccatactct tacataataa aattcctttt aagtaatttt ttcaaagaat 1021 cacagaattc tagtacatgt aggtaaatca taaatctgtt ctaagacata tgatcaacag 1081 atgagaactg gtggttaata tgtgacagtg agattagtca tatcactaat atactaacaa 1141 cagaatctaa tcttcattta aggcactgta gtgaattatc tgagctagag ttacctagct 1201 taccatacta tatctttgga atcatgaaac cttaagactt cagaatgatt ttgcaggttg 1261 tcttccattc cagcctaaca tccaatgcag gcaaggaaaa taaaagattt ccagtgacag 1321 aaaaatatat tatctcaagt attttttaaa aatatatgaa ttctctctcc aaatattaac 1381 taattattag attatatttt gaaatgaact tgttggccca tctattacat ctacagctga 1441 cccttgaaca tgggggttag gggagctgac aattcgtggg tccgcaaaat cttaactacc 1501 taatagccta ctattgacca taaaccttac tgataacata aacagtaaat taacacatat 1561 tttgcgtgtt atatgtatta tacactatat tcctacaata aagtaagcta gagaaaatgt 1621 tatttagaaa atcat // LOCUS HUMPHOSLIP 1750 bp mRNA PRI 05-APR-1994 DEFINITION Human phospholipid transfer protein mRNA, complete cds. ACCESSION L26232 NID g468325 KEYWORDS phospholipid transfer protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1750) AUTHORS Day,J.R., Albers,J.J., Lofton-Day,C.E., Gilbert,T.L., Ching,A.F., Grant,F.J., O'Hara,P.J., Marcovina,S.M. and Adolphson,J.L. TITLE Complete cDNA encoding human phospholipid transfer protein from human endothelial cells JOURNAL J. Biol. Chem. 269, 9388-9391 (1994) MEDLINE 94179366 FEATURES Location/Qualifiers source 1..1750 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial" CDS 88..1569 /note="putative" /codon_start=1 /product="phospholipid transfer protein" /db_xref="PID:g468326" /translation="MALFGALFLALLAGAHAEFPGCKIRVTSKALELVKQEGLRFLEQ ELETITIPDLRGKEGHFYYNISEVKVTELQLTSSELDFQPQQELMLQITNASLGLRFR RQLLYWFFYDGGYINASAEGVSIRTGLELSRDPAGRMKVSNVSCQASVSRMHAAFGGT FKKVYDFLSTFITSGMRFLLNQQICPVLYHAGTVLLNSLLDTVPVRSSVDELVGIDYS LMKDPVASTSNLDMDFRGAFFPLTERNWSLPNRAVEPQLQEEERMVYVAFSEFFFDSA MESYFRAGALQLLLVGDKVPHDLDMLLRATYFGSIVLLSPAVIDSPLKLELRVLAPPR CTIKPSGTTISVTASVTIALVPPDQPEVQLSSMTMDARLSAKMALRGKALRTQLDLRR FRIYSNHSALESLALIPLQAPLKTMLQIGVMPMLNERTWRGVQIPLPEGINFVHEVVT NHAGFLTIGADLHFAKGLREVIEKNRPADVRASTAPTPSTAAV" BASE COUNT 330 a 579 c 471 g 370 t ORIGIN 1 gtggccgccg tcgcccggat cccctgagct gcccgccatc ccacgtgacc gcgccgcccc 61 ccagctccac cgctgagccc gctcgccatg gccctcttcg gggccctctt cctagcgctg 121 ctggcaggcg cacatgcaga gttcccaggc tgcaagatcc gcgtcacctc caaggcgctg 181 gagctggtga agcaggaggg gctgcgcttt ctggagcaag agctggagac tatcaccatt 241 ccggacctgc ggggcaaaga aggccacttc tactacaaca tctctgaggt gaaggtcaca 301 gagctgcaac tgacatcttc cgagctcgat ttccagccac agcaggagct gatgcttcaa 361 atcaccaatg cctccttggg gctgcgcttc cggagacagc tgctctactg gttcttctat 421 gatgggggct acatcaacgc ctcagctgag ggtgtgtcca tccgcactgg tctggagctc 481 tcccgggatc ccgctggacg gatgaaagtg tccaatgtct cctgccaggc ctctgtctcc 541 agaatgcacg cggccttcgg gggaaccttc aagaaggtgt atgattttct ctccacgttc 601 atcacctcag ggatgcgctt cctcctcaac cagcagatct gccctgtcct ctaccacgca 661 gggacggtcc tgctcaactc cctcctggac accgtgcctg tgcgcagttc tgtggacgag 721 cttgttggca ttgactattc cctcatgaag gatcctgtgg cttccaccag caacctggac 781 atggacttcc ggggggcctt cttccccctg actgagagga actggagcct ccccaaccgg 841 gcagtggagc cccagctgca ggaggaagag cggatggtgt atgtggcctt ctctgagttc 901 ttcttcgact ctgccatgga gagctacttc cgggcggggg ccctgcagct gttgctggtg 961 ggggacaagg tgccccacga cctggacatg ctgctgaggg ccacctactt tgggagcatt 1021 gtcctgctga gcccagcagt gattgactcc ccattgaagc tggagctgcg ggtcctggcc 1081 ccaccgcgct gcaccatcaa gccctctggc accaccatct ctgtcactgc tagcgtcacc 1141 attgccctgg tcccaccaga ccagcctgag gtccagctgt ccagcatgac tatggacgcc 1201 cgtctcagcg ccaagatggc tctccggggg aaggccctgc gcacgcagct ggacctgcgc 1261 aggttccgaa tctattccaa ccattctgca ctggagtcgc tggctctgat cccattacag 1321 gcccctctga agaccatgct gcagattggg gtgatgccca tgctcaatga gcggacctgg 1381 cgtggggtgc agatcccact acctgagggc atcaactttg tgcatgaggt ggtgacgaac 1441 catgcgggat tcctcaccat cggggctgat ctccactttg ccaaagggct gcgagaggtg 1501 attgagaaga accggcctgc tgatgtcagg gcgtccactg cccccacacc gtccacagca 1561 gctgtctgag ccctcaatcc ccaagctggc agctgtcatt caggacccca acccctctca 1621 gcccctcttt tcccacattc atagcctgta gtgccccctc taacccccag tgccacagag 1681 aagacgggat ttgaagctgt acccaattta attccataat caatctatca attacagtcc 1741 gtccaccacc // LOCUS HUMPHOSPDL 2893 bp mRNA PRI 01-SEP-1993 DEFINITION Human phospholipase D mRNA, complete cds. ACCESSION L11701 NID g388762 KEYWORDS phospholipase D. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2893) AUTHORS Tsang,T.C., Fung,W.-J.C., Levine,J., Metz,C.N., Davitz,M.A., Burns,D.K., Huang,K.-S. and Kochan,J.P. TITLE Isolation and expression of two human glycosylphosphatidylinositol phospholipase D (GPI-PLD) cDNAs JOURNAL FASEB Journal 6, 1922-1922 (1992) REFERENCE 2 (bases 1 to 2893) AUTHORS Tsang,T.C., Fung,W.-J.C., Metz,C.N., Davitz,M.A., Burns,D.K., Huang,K.-S. and Kochan,J.P. TITLE Identification and functional chracterization of two human glycosyl-phosphatidylinositol specific phospholipase D cDNAs JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..2893 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" sig_peptide 33..104 CDS 33..2558 /note="precursor" /codon_start=1 /product="phospholipase D" /db_xref="PID:g388763" /translation="MSAFRLWPGLLMIVMASLCHRGSSCGLSTHIEIGHRALEFLHLH NGHVNYKELLLEHQDAYQAGTVFPDCFYPSLCKGGKFHDVSESTHWTPFLNASVHYIR ENYPLPWEKDTEKLVAFLFGITSHMVADVSWHSLGIEQGFLRTMGAIDFHGSYSEAHS AGDFGGDVLSQFEFNFNYLARRWYVPVKDLLGIYEKLYGREVITENVIVDCSHIQFLE MYGEMLAVSKLYPSYSTKSPFLVEQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSD CSLPENPLFIACGGQQNHTQGSKMQKNDFHRNLTSSLTENIDRNINYTERGVFFSVNS WTPDSMSFIYKALERNVRTMFIGGSQLSQKHISSPLASYFLSFPYARLGWAMTSADLN QDGYGDLVVGAPGYSRPGRIHIGRVYLIYGNELGLPPVDLDLDKEAHGILEGFQPSGR FGSALAMLDFNMDGVPDLAVGAPSVGSEQLTYKGAVYVYFGSKQGRMSSSPNITISCQ DIYCNLGWTLLAADVNGDSEPDLVIGSPFAPGGGKQKGIVAAFYSGPSLSNKEKLNVE AANWTVRGEEDFAWFGYSLHGVTVDNRTLLLVGSPTWKNASRLGRLLHIRDEKKSLGR VYGYFPPNSQSWFTIVGDKAMGKLGTSLSSGHVLMNGTLTQVLLVGAPTRDDVSKMAF LTMTLHQGGATRMYALTSDLQPPLLSTFSGDRRFSRFGGVLHLSDLDDDGVDEIIVAA PLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWMTPCPEEKAQYVLISPEAS SRFGSSLITVRSKAKNQVVIAAGRSSLGARLSGALHVYSFGSD" mat_peptide 105..2555 /note="putative" /product="phospholipase D" BASE COUNT 691 a 702 c 738 g 762 t ORIGIN 1 cgtcattaga ggagccggtg gggaatgaga gcatgtctgc tttcaggttg tggcccggcc 61 tgctgatgat cgtgatggct tctctctgcc atagaggttc atcgtgtggc ctttcaacgc 121 acatagaaat cggacacaga gctctggagt ttcttcatct tcacaatggg catgttaact 181 acaaagagct gttactagaa caccaggatg catatcaggc tggaaccgtg tttcctgatt 241 gtttttaccc tagcctctgc aaaggaggaa aattccatga tgtgtctgag agcactcact 301 ggactccgtt tcttaacgca agcgttcatt atatccgaga gaactatccc cttccctggg 361 agaaggacac agagaaactg gtagctttct tgtttggaat tacttctcat atggtagcag 421 atgtcagctg gcatagtctg ggcattgaac aaggattcct taggaccatg ggagctattg 481 attttcacgg ctcctattct gaggctcatt cagctggtga ttttggagga gatgtgttga 541 gccagtttga atttaatttt aattaccttg cacgacgctg gtatgtgcca gtcaaagatc 601 tgctgggaat ttatgagaaa ctctatggtc gagaagtcat cactgaaaat gtaattgttg 661 attgttcaca tatccagttc ttagaaatgt atggtgagat gctagctgtt tccaagttat 721 atccctctta ctctacaaag tccccgtttt tggtggaaca attccaagag tattttcttg 781 gaggactgga tgatatggcg ttttggtcca ctaatattta ccatctaacg agcttcatgt 841 tggagaatgg gaccagtgac tgcagcctac ctgagaaccc tctgttcatt gcatgtggtg 901 gccagcaaaa ccacacccag ggctcgaaaa tgcagaaaaa tgattttcac agaaatttga 961 cttcatccct aactgaaaac attgacagga atataaacta taccgaaaga ggagtgttct 1021 tcagtgtaaa ttcctggacc ccggattcca tgtcctttat ctacaaggct ttggaaagga 1081 acgtaaggac aatgttcata ggtggctctc agttgtcaca gaagcacatc tctagcccct 1141 tagcatctta cttcttgtca tttccttatg caaggcttgg ctgggcaatg acctcagctg 1201 acctcaacca ggatgggtac ggcgacctcg tggtgggcgc accaggctac agccgccctg 1261 gccgcatcca catcgggcgc gtgtacctca tctacggcaa tgaactgggt ctgccgcccg 1321 ttgacctgga cctggacaag gaggcccacg ggatccttga aggtttccag ccctcaggtc 1381 ggtttggctc ggccttggct atgttggact ttaacatgga tggcgtgcct gacctggccg 1441 tgggagctcc ctcggtgggc tctgagcagc tcacctacaa aggtgctgtg tatgtctact 1501 ttggttccaa acaaggaaga atgtcttctt cccctaacat caccatctct tgccaggaca 1561 tctactgtaa cttgggctgg actctcttgg ctgcagatgt gaatggagac agtgagcccg 1621 atctggtcat tggctcccct tttgcaccag gtggagggaa gcagaaggga attgtggctg 1681 cgttttattc tggccccagc ctgagcaaca aagagaaact gaacgtggag gcggccaact 1741 ggacggtgag aggcgaggaa gactttgcct ggtttggata ctcccttcac ggtgtcactg 1801 tggacaacag aaccttgctg ctggttggga gcccgacctg gaagaatgcc agcaggctgg 1861 gccgtttgtt acacatccga gatgagaaaa agagccttgg gagggtgtat ggctacttcc 1921 caccaaacag ccaaagctgg tttaccattg ttggagacaa ggcaatgggg aaactgggta 1981 cttccctgtc cagtggccac gtgctgatga atggaactct gacccaggtg ctgctggtgg 2041 gagccccgac acgtgatgat gtgtctaaga tggcattcct gaccatgacc ctgcaccaag 2101 gcggagccac tcggatgtac gcgctcacat ccgacctgca gccaccgctg ctcagcacct 2161 tcagcggaga ccgccgcttc tctcgatttg gtggcgttct gcacttgagt gacctggatg 2221 atgatggcgt agatgaaatc atcgtggcag cccccctgag gatagcagat gtaacctctg 2281 ggctgattgg gggagaagat ggccgagttt atgtatataa tggcaaagag accacccttg 2341 gtgacatgac tggcaaatgc aaatcgtgga tgactccatg tccagaagaa aaggcccaat 2401 atgtattgat ttctcctgaa gccagctcaa ggtttgggag ctccctgatc accgtgaggt 2461 ccaaggcaaa gaatcaagtc gtcattgccg ctggaaggag ctctttggga gcccgactct 2521 ccggggcact tcacgtctat agctttggct cagattgaag atttcactgc gtttccccac 2581 tctgcccacc tctctcatgc tgaatcacat ccatggtgag cattttgatg gacaaaatgg 2641 cacatccagt ggagctgtgg cagatcctaa tagatgtggg gctcctggga gtagagacac 2701 acaccaacag ccaccctttc tggaaatctg atatagtata tatatgactg caccaggagt 2761 atgtgaaata tcagacacac tctgctcatt catgtctcct tccacagttt atttcctcgc 2821 ttcctttgca tctaaacctt tcttctttcc gaactttttg cctatagtca gacctgctgt 2881 accacctatt tcc // LOCUS HUMPHPA28A 828 bp mRNA PRI 14-NOV-1996 DEFINITION Human mRNA for proteasome activator hPA28 subunit beta, complete cds. ACCESSION D45248 NID g1008914 KEYWORDS proteasome activator hPA28 subunit beta; gamma-interferon-inducible protein activator of 20S proteasome; PA28beta. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA, clone_lib:lambda ZAPII. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 828) AUTHORS Joon Young,A., Tanahashi,N., Akiyama,K., Hisamatsu,H., Noda,C., Tanaka,K., Chin Ha,C., Shimbara,N., Willy,P.J., Mott,J.D., Clive A,S. and DeMartino,G.N. TITLE Primary structures of two homologous subunits of PA28, a gamma-interferon-inducible protein activator of the 20S proteasome JOURNAL FEBS Lett. 366 (1), 37-42 (1995) MEDLINE 95309399 REFERENCE 2 (bases 1 to 828) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (23-JAN-1995) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) FEATURES Location/Qualifiers source 1..828 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /clone_lib="lambda ZAPII" CDS 66..785 /codon_start=1 /product="proteasome activator hPA28 suunit beta" /db_xref="PID:d1008800" /db_xref="PID:g1008915" /translation="MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLN QLLQEDSLNVADLTSLRAPLDIPIPDPPPKDDEMETDKQEKKEVPKCGFLPGNEKVLS LLALVKPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAF QTTISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYH IISSNLEKIVTPKGEEKPSMY" polyA_signal 804..809 polyA_site 828 BASE COUNT 230 a 192 c 236 g 170 t ORIGIN 1 gggggagtga aagcgaaagc ccgggcgact agccgggaga ccagagatct agcgactgaa 61 gcagcatggc caagccgtgt ggggtgcgcc tgagcgggga agcccgcaaa caggtggagg 121 tcttcaggca gaatcttttc caggaggctg aggaattcct ctacagattc ttgccacaga 181 aaatcatata cctgaatcag ctcttgcaag aggactccct caatgtggct gacttgactt 241 ccctccgggc cccactggac atccccatcc cagaccctcc acccaaggat gatgagatgg 301 aaacagataa gcaggagaag aaagaagtcc ctaagtgtgg atttctccct gggaatgaga 361 aagtcctgtc cctgcttgcc ctggttaagc cagaagtctg gactctcaaa gagaaatgca 421 ttctggtgat tacatggatc caacacctga tccccaagat tgaagatgga aatgattttg 481 gggtagcaat ccaggagaag gtgctggaga gggtgaatgc cgtcaagacc aaagtggaag 541 ctttccagac aaccatttcc aagtacttct cagaacgtgg ggatgctgtg gccaaggcct 601 ccaaggagac tcatgtaatg gattaccggg ccttggtgca tgagcgagat gaggcagcct 661 atggggagct cagggccatg gtgctggacc tgagggcctt ctatgctgag ctttatcata 721 tcatcagcag caacctggag aaaattgtca ccccaaaggg tgaagaaaag ccatctatgt 781 actgaacccg ggactagaag gaaaataaat gatctatatg ttgtgtgg // LOCUS HUMPHSR1 2028 bp mRNA PRI 18-JAN-1991 DEFINITION Human mRNA for scavenger receptor type I (phSR1). ACCESSION D90187 NID g219989 KEYWORDS phSR1; scavenger receptor; scavenger receptor type 1. SOURCE Human monocytic cell line THP-1 (4 days after phorbol ester treatment), cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2028) AUTHORS Matsumoto,A., Naito,M., Itakura,H., Ikemoto,S., Asaoka,H., Hayakawa,I., Kanamori,H., Aburatani,H., Takaku,F., Suzuki,H., Kobari,Y., Miyai,T., Takahashi,K., Cohen,H.E., Wydro,R., Housman,E.D. and Kodama,T. TITLE Human macrophage scavenger receptors: primary structure, expression, and localization in atherosclerotic lesions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9133-9137 (1990) MEDLINE 91067661 COMMENT These data kindly submitted in computer readable form by: Akiyo Matsumoto The National Institute of Health and Nutrition 1-23-1 Toyama Shinjuku-ku, Tokyo 162 Japan Phone: 03-203-5725 Fax: 03-207-3520. FEATURES Location/Qualifiers source 1..2028 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 47..1402 /note="phSR1" /codon_start=1 /db_xref="PID:d1014913" /db_xref="PID:g219990" /translation="MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQE KLKSFKAALIALYLLVFAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGN DSEEEMRFQEVFMEHMSNMEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLS TLFSSVQGHGNAIDEISKSLISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEER VYNVSAEIMAMKEEQVHLEQEIKGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPG PPGEKGDRGPTGESGPRGFPGPIGPPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQ KGEKGSGNTLTPFTKVRLVGGSGPHEGRVEILHSGQWGTICDDRWEVRVGQVVCRSLG YPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQWGTRACSHSEDAGVTCT L" BASE COUNT 653 a 369 c 434 g 572 t ORIGIN 1 agagaagtgg ataaatcagt gctgctttct ttaggacgaa agaagtatgg agcagtggga 61 tcactttcac aatcaacagg aggacactga tagctgctcc gaatctgtga aatttgatgc 121 tcgctcaatg acagctttgc ttcctccgaa tcctaaaaac agcccttccc ttcaagagaa 181 actgaagtcc ttcaaagctg cactgattgc cctttacctc ctcgtgtttg cagttctcat 241 ccctctcatt ggaatagtgg cagctcaact cctgaagtgg gaaacgaaga attgctcagt 301 tagttcaact aatgcaaatg atataactca aagtctcacg ggaaaaggaa atgacagcga 361 agaggaaatg agatttcaag aagtctttat ggaacacatg agcaacatgg agaagagaat 421 ccagcatatt ttagacatgg aagccaacct catggacaca gagcatttcc aaaatttcag 481 catgacaact gatcaaagat ttaatgacat tcttctgcag ctaagtacct tgttttcctc 541 agtccaggga catgggaatg caatagatga aatctccaag tccttaataa gtttgaatac 601 cacattgctt gatttgcagc tcaacataga aaatctgaat ggcaaaatcc aagagaatac 661 cttcaaacaa caagaggaaa tcagtaaatt agaggagcgt gtttacaatg tatcagcaga 721 aattatggct atgaaagaag aacaagtgca tttggaacag gaaataaaag gagaagtgaa 781 agtactgaat aacatcacta atgatctcag actgaaagat tgggaacatt ctcagacctt 841 gagaaatatc actttaattc aaggtcctcc tggacccccg ggtgaaaaag gagatcgagg 901 tcccactgga gaaagtggtc cacgaggatt tccaggtcca ataggtcctc cgggtcttaa 961 aggtgatcgg ggagcaattg gctttcctgg aagtcgagga ctcccaggat atgccggaag 1021 gccaggaaat tctggaccaa aaggccagaa aggggaaaag gggagtggaa acacattaac 1081 tccatttacg aaagttcgac tggtcggtgg gagcggccct cacgagggga gagtggagat 1141 actccacagc ggccagtggg gtacaatttg tgacgatcgc tgggaagtgc gcgttggaca 1201 ggtcgtctgt aggagcttgg gatacccagg tgttcaagcc gtgcacaagg cagctcactt 1261 tggacaaggt actggtccaa tatggctgaa tgaagtgttt tgttttggga gagaatcatc 1321 tattgaagaa tgtaaaattc ggcaatgggg gacaagagcc tgttcacatt ctgaagatgc 1381 tggagtcact tgcactttat aatgcatcat attttcattc acaactatga aatcgctgct 1441 caaaaatgat tttattacct tgttcctgta aaatccattt aatcaatatt taagagatta 1501 agaatattgc ccaaataata ttttagatta caggattaat atattgaaca ccttcatgct 1561 tactatttta tgtctatatt taaatcattt taacttctat aggtttttaa atggaatttt 1621 ctaatataat gacttatatg ctgaattgaa cattttgaag tttatagctt ccagattaca 1681 aaggccaagg gtaatagaaa tgcataccag taattggctc caattcataa tatgttcacc 1741 aggagattac aattttttgc tcttcttgtc tttgtaatct atttagttga ttttaattac 1801 tttctgaata acggaaggga tcagaagata tcttttgtgc ctagattgca aaatctccaa 1861 tccacacata ttgttttaaa ataagaatgt tatccaacta ttaagatatc tcaatgtgca 1921 ataacttgtg tattagatat caatgttaat gatatgtctt ggccactatg gaccagggag 1981 cttatttttc ttgtcatgta ctgacaactg tttaattgaa tcatgaag // LOCUS HUMPIGA 3589 bp mRNA PRI 05-MAR-1993 DEFINITION Human mRNA for PIG-A protein, complete cds. ACCESSION D11466 NID g219993 KEYWORDS glycosyl-phosphatidylinositol-anchor. SOURCE Human Hela cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Miyata,T., Takeda,J., Iida,Y., Yamada,N., Inoue,N., Takahashi,M., Maeda,K., Kitani,T. and Kinoshita,T. TITLE The cloning of PIG-A, a component in the early step of GPI-anchor biosynthesis JOURNAL Science 259 (5099), 1318-1320 (1993) MEDLINE 93190103 REFERENCE 2 (bases 1 to 3589) AUTHORS Kinoshita,T. JOURNAL Unpublished (1993) REFERENCE 3 (bases 1 to 3589) AUTHORS Kinoshita,T. TITLE Direct Submission JOURNAL Submitted (19-JUN-1992) to the DDBJ/EMBL/GenBank databases. Taroh Kinoshita, Institute for Microbial Deseases, Osaka University, Department of Immunoregulation Research; 3-1 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-875-5233, Fax:06-875-5233) COMMENT Submitted (19-Jun-1992) to DDBJ by: Taroh Kinoshita Department of Immunoregulation Research Institute for Microbial Diseases Osaka University 3-1 Yamadaoka Suita Osaka 565 Japan Phone: 06-875-5233 Fax: 06-875-5233. FEATURES Location/Qualifiers source 1..3589 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela cell" gene 86..1540 /gene="PIG-A" CDS 86..1540 /gene="PIG-A" /codon_start=1 /product="PIG-A protein" /db_xref="PID:d1002501" /db_xref="PID:g219994" /translation="MACRGGAGNGHRASATLSRVSPGSLYTCRTRTHNICMVSDFFYP NMGGVESHIYQLSQCLIERGHKVIIVTHAYGNRKGIRYLTSGLKVYYLPLKVMYNQST ATTLFHSLPLLRYIFVRERVTIIHSHSSFSAMAHDALFHAKTMGLQTVFTDHSLFGFA DVSSVLTNKLLTVSLCDTNHIICVSYTSKENTVLRAALNPEIVSVIPNAVDPTDFTPD PFRRHDSITIVVVSRLVYRKGIDLLSGIIPELCQKYPDLNFIIGGEGPKRIILEEVRE RYQLHDRVRLLGALEHKDVRNVLVQGHIFLNTSLTEAFCMAIVEAASCGLQVVSTRVG GIPEVLPENLIILCEPSVKSLCEGLEKAIFQLKSGTLPAPENIHNIVKTFYTWRNVAE RTEKVYDRVSVEAVLPMDKRLDRLISHCGPVTGYIFALLAVFNFLFLIFLRWMTPDSI IDVAIDATGPRGAWTNNYSHSKRGGENNEISETR" BASE COUNT 1050 a 649 c 710 g 1180 t ORIGIN 1 actggcggcc atggaactca ccggtaatag aggacacatc tcttaactgg gttgctctaa 61 gaactgatgt ctaaaccgtc tcagcatggc ctgtagagga ggagctggga atggccaccg 121 tgcctcagct acactctctc gggttagccc tggaagtctt tacacatgta gaacccgtac 181 ccataatata tgcatggtat ctgacttttt ctacccaaat atgggaggcg tggaaagcca 241 catttaccag ctctctcagt gcctgattga aagagggcat aaggttataa ttgtcaccca 301 tgcttatgga aatcgaaaag gcatccgtta cctcaccagt ggcctcaaag tctattactt 361 gcctctgaaa gtcatgtaca accagtctac agccacgacc ctctttcaca gtctgccatt 421 gctcaggtac atatttgttc gggagagagt cacgataatc cattcacata gttctttttc 481 tgctatggcc catgatgctc tcttccacgc caagacaatg gggcttcaga cagtcttcac 541 ggaccattcc ctttttggat ttgctgatgt cagctcggtg cttacaaaca agcttctaac 601 cgtgtctctt tgtgatacaa accacatcat ttgtgtgtct tatactagta aggaaaatac 661 tgtactaaga gcagcactga atcctgaaat agtgtccgtc attcctaatg ctgtagatcc 721 tactgacttc actccagacc catttagaag gcatgatagt ataactattg ttgttgtcag 781 cagacttgtt tacagaaaag ggatcgattt gcttagtggt ataatacctg aactctgtca 841 gaaatatcca gatttaaatt tcataattgg aggagaggga ccaaagagaa tcattttgga 901 agaagttcgg gaaagatacc agctgcatga cagggtgcgt cttttgggag ctttagaaca 961 caaggatgtt agaaatgtct tagttcaagg acatattttt ctgaatacct cccttactga 1021 agcattctgc atggcgatcg tggaagcagc cagttgtggt ttacaggttg taagtaccag 1081 agttggtgga attcctgagg tgcttccaga aaaccttatt attttatgtg agccttcagt 1141 aaaatctttg tgtgaaggat tggaaaaggc tattttccaa ctgaagtcag ggacattgcc 1201 agctccagaa aacatccata acatagtaaa gactttctac acctggagga atgttgcaga 1261 aagaactgaa aaggtatatg accgggtatc agtggaagct gtgttgccaa tggacaaacg 1321 actggacaga cttatttctc actgcggccc agtaacaggc tacatctttg ctttgttggc 1381 agttttcaac ttcctcttcc tcattttctt gagatggatg actccagatt ctatcattga 1441 tgttgcaata gatgccactg ggccacgggg tgcctggact aataactatt ctcacagtaa 1501 aagagggggt gagaataatg agatatctga aaccaggtag aaggaagcct agattgtaag 1561 attttaaaca tttgtaatag ttctataaag actatggaaa ataaccttgc ttttgggggg 1621 tttttgtttt tttagagtta atttagtaag ttatgctacc tctatatcat tcaatatttt 1681 ctgttgagga aagataaaaa tgtatgcaat tcctgagtgt agaaacttct tgcacttatt 1741 taaaatttag gagagaacat ttaagccact caggtatgca atttttcaga ctactgaaat 1801 ccctgtagca gagatgtttt aacattatat tttgagagct ttgggtgctg aagggccaaa 1861 cgttttctgg gcattttttg gccagttttt aatgtaacac cattagacac tcaccagatg 1921 tttacaagtt ttctttaggg gaactacaac aattatatga actgttttat atcatgttca 1981 tatacattta ttaggaatct aaatcatgtc tttgaacatt tattaggttc actcagtagg 2041 tgttacatgt aattaacagg ttccttgagt aagatagtcc atcagttacc agcacatttt 2101 gaacccctgc tctgtgtaga atgttgaact agatgcttcc cgccattaag gaccaggggt 2161 gcattcactc tttgtttacc attcaaatgg cttacttcat cataattgtg gttgatatga 2221 gatcaatatc caacatgcca aaaatgctca tgccagttaa tgccaggaaa aaaatcaccg 2281 acacactact agtactttgt tcctgttgta tgcattctcc taggtagagc ctccatcttc 2341 agttgtgttt gtgaaggtat tttttgcttt ttaaatactg gggaccgata tcactgttga 2401 tagtgcagag aaaccctcca catttttcag tgcataattg agttttctat aaatgccttc 2461 gtgttttctg agcagaatgt acgaggtgtg ccatcccaaa accagctgct accctgtcct 2521 tttaatgtaa gtcactcccc ttcactgtgg cctcgctgat gtctgataag tattgtcagt 2581 gtgcaaaagg ctttacttca gaatggttta tttatagcaa actaagttga aaattttaga 2641 aacagtcttt gtgggtggat gttattaact gtcattgttg ttgcccagag ccatgggttt 2701 tttaacccca aattatccac atggtgtgta ttatgaattc tttgaactct taaggttttt 2761 gtgagaaaag gactgtgaat tcaaaacaat aaggcacttg tgggtgcact acatagattc 2821 tgacagtgtt gtgattctgt ataggatttt taaaaatgac aacattcaca aaatttatta 2881 ctttttaaaa aataacatgc ctattaactg gttgcactga tataaaagaa atatatttgt 2941 gttttgtttg tactaaaatg caaaagcaag agtgcaattt ttaaaatcta gaagttaggg 3001 gttttgttgg agaaaaatgg actgatcttt aaactattca gtcttactgg gatttttatg 3061 catagaaact cacatataaa catgaaataa acagtgccag tattcatagg aaagtgagaa 3121 actgtaatat ttggccatta ttctattcaa caggttttag aggcatgcca ccattttttc 3181 cttatatttt tgcttaattt ttttaaattg tcatttaatt cttaaactgt catttatttg 3241 agatggaaat aagatctaaa gttagttgcc tttgcctgta aaacatgtga tttgcaaatt 3301 attattttcc ttttttttta acaaatggaa gtaaatttgt ttcacgtaaa tcttaatttt 3361 caacctttct ggatacctta attgtaactg tcagtttgca ctggtcggta tatggaaaca 3421 cattgctcta ccctgctact tagttgattt taaagtgaat ttacagtgat gagaaatttg 3481 tgaaaaatat attgtatttc ttttgatgtt tcaaaaggtt gcctatgaaa aactgatttg 3541 ttaaaacatg ctacatgtcc aaaaataaag accagaatga cattttgat // LOCUS HUMPIGF 917 bp mRNA PRI 19-JUN-1993 DEFINITION Human mRNA for PIG-F (phosphatidyl-inositol-glycan class F), complete cds. ACCESSION D13435 NID g303615 KEYWORDS GPI-anchor biosynthesis; PIG-F; glycosylphosphatidylinositol-anchored protein synthesis; phosphatidyl-inositol-glycan class F. SOURCE Homo sapiens (individual_isolate KT-3) (library: pCEV4 KT-3) T-lymphocyte cell line KT-3 cDNA to mRNA, clone PIG-F. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 917) AUTHORS Inoue,N., Kinoshita,T., Orii,T. and Takeda,J. TITLE Cloning of a human gene, PIG-F, a component of glycosylphosphatidylinositol anchor biosynthesis, by a novel expression cloning strategy JOURNAL J. Biol. Chem. 268 (10), 6882-6885 (1993) MEDLINE 93216618 COMMENT Submitted (20-OCT-1992) to DDBJ by: Norimitsh Inoue Department of Immunoregulation Research Institute for Microbial Diseases Osaka University 3-1 Yamadaoka, Suita Osaka 565 Japan Phone: 06-875-5233 Fax: 06-875-5233. FEATURES Location/Qualifiers source 1..917 /organism="Homo sapiens" /isolate="KT-3" /db_xref="taxon:9606" /cell_line="KT-3" /cell_type="T-lymophocyte" /clone_lib="pCEV4 KT-3" CDS 68..727 /note="Involvement of GPI-anchor biosynthesis" /codon_start=1 /product="PIG-F" /db_xref="PID:d1003202" /db_xref="PID:g303616" /translation="MKDNDIKRLLYTHLLCIFSIILSVFIPSLFLENFSILETHLTWL CICSGFVTAVNLVLYLVVKPNTSSKRSSLSHKVTGFLKCCIYFLMSCFSFHVIFVLYG APLIELALETFLFAVILSTFTTVPCLCLLGPNLKAWLRVFSRNGVTSIWENSLQITTI SSFVGAWLGALPIPLDWERPWQVWPISCTLGATFGYVAGLVISPLWIYWNRKQLTYKN N" mat_peptide 68..724 /product="PIG-F" polyA_signal 901..906 polyA_site 917 BASE COUNT 254 a 176 c 184 g 303 t ORIGIN 1 aggaggcagt agttccccgc ttcccttccg cgggagggag agttagctag ccatccaaga 61 aaacaccatg aaagataacg atatcaagag actactgtat acccatcttt tatgcatatt 121 ttcaattatc ctaagtgtct tcattccatc actcttcttg gagaacttct caatattgga 181 aacacacttg acatggttgt gcatctgttc tggttttgta actgctgtca atctagtact 241 atatttagta gtgaaaccaa atacatcctc taaaagaagt tcattatcac acaaggtaac 301 tggatttttg aaatgctgta tctactttct tatgtcttgt ttctcctttc atgtaatttt 361 tgttctgtat ggagcaccac tgatagagtt ggcattggaa acatttttat ttgcagttat 421 tttgtctact tttactactg tgccttgctt atgtttgtta ggaccaaacc tcaaagcatg 481 gctaagagtg ttcagtagaa atggagttac atccatatgg gagaatagtc tccagatcac 541 tacaatttct agctttgtag gagcatggct tggagcactt cctattccac tggattggga 601 aagaccatgg caggtatggc ccatctcctg tacgcttgga gcgacctttg gctacgtggc 661 tggccttgtt atttcaccac tctggatata ctggaataga aagcaactta catacaagaa 721 caattaactg gagcaaaggg agatatttct ttgtgcagat tctgtaaggg ctgggcagaa 781 atgtgtatgg tcaaagccaa gcagttccat ttacagctct gttttttacg tagttacaac 841 atgatgtgat tgtagctttt taaactatga aacccctgag agattgtacc ttctagttga 901 aataaagtat ttataat // LOCUS HUMPIP 565 bp mRNA PRI 07-JAN-1995 DEFINITION Human prolactin-inducible protein (PIP) mRNA, complete cds. ACCESSION J03460 NID g189963 KEYWORDS prolactin-inducible protein. SOURCE Homo sapiens breast cancer cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 565) AUTHORS Murphy,L.C., Tsuyuki,D., Myal,Y. and Shiu,R.P. TITLE Isolation and sequencing of a cDNA clone for a prolactin-inducible protein (PIP). Regulation of PIP gene expression in the human breast cancer cell line, T-47D JOURNAL J. Biol. Chem. 262 (31), 15236-15241 (1987) MEDLINE 88033111 FEATURES Location/Qualifiers source 1..565 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="T-47D" /tissue_type="breast cancer" /map="7q32-qter" mRNA <1..565 /gene="PIP" /note="G00-120-292" gene 1..565 /gene="PIP" CDS 26..466 /gene="PIP" /codon_start=1 /db_xref="GDB:G00-120-292" /product="prolactin-inducible protein" /db_xref="PID:g189964" /translation="MRLLQLLFRASPATLLLVLCLQLGANKAQDNTRKIIIKNFDIPK SVRPNDEVTAVLAVQTELKECMVVKTYLISSIPLQGAFNYKYTACLCDDNPKTFYWDF YTNRTVQIAAVVDVIRELGICPDDAAVIPIKNNRFYTIEILKVE" BASE COUNT 157 a 140 c 110 g 158 t ORIGIN 90 bp upstream of PstI site. 1 cacattgcct tttgttttct ccagcatgcg cttgctccag ctcctgttca gggccagccc 61 tgccaccctg ctcctggttc tctgcctgca gttgggggcc aacaaagctc aggacaacac 121 tcggaagatc ataataaaga attttgacat tcccaagtca gtacgtccaa atgacgaagt 181 cactgcagtg cttgcagttc aaacagaatt gaaagaatgc atggtggtta aaacttacct 241 cattagcagc atccctctac aaggtgcatt taactataag tatactgcct gcctatgtga 301 cgacaatcca aaaaccttct actgggactt ttacaccaac agaactgtgc aaattgcagc 361 cgtcgttgat gttattcggg aattaggcat ctgccctgat gatgctgctg taatccccat 421 caaaaacaac cggttttata ctattgaaat cctaaaggta gaataatgga agccctgtct 481 gtttgccaca cccaggtgat ttcctctaaa gaaacttggc tggaatttct gctgtggtct 541 ataaaataaa cttcttaaca tgctt // LOCUS HUMPIR 1417 bp mRNA PRI 27-MAY-1994 DEFINITION Homo sapiens prostanoid IP receptor, complete cds. ACCESSION L29016 NID g495042 KEYWORDS prostanoid IP receptor. SOURCE Homo sapiens (library: cDNA library) male lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1417) AUTHORS Boie,Y., Rushmore,T.H., Darmon-Goodwin,A., Grygorczyk,R., Slipetz,D.M., Metters,K.M. and Abramovitz,M. TITLE Cloning and expression of a cDNA for the human prostanoid IP receptor JOURNAL J. Biol. Chem. 269, 12173-12178 (1994) MEDLINE 94216334 FEATURES Location/Qualifiers source 1..1417 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="lung" /tissue_lib="cDNA library" CDS 57..1217 /codon_start=1 /product="prostanoid IP receptor" /db_xref="PID:g495043" /translation="MADSCRNLTYVRGSVGPATSTLMFVAGVVGNGLALGILSARRPA RPSAFAVLVTGLAATDLLGTSFLSPAVFVAYARNSSLLGLARGGPALCDAFAFAMTFF GLASMLILFAMAVERCLALSHPYLYAQLDGPRCARLALPAIYAFCVLFCALPLLGLGQ HQQYCPGSWCFLRMRWAQPGGAAFSLAYAGLVALLVAAIFLCNGSVTLSLCRMYRQQK RHQGSLGPRPRTGEDEVDHLILLALMTVVMAVCSLPLTIRCFTQAVAPDSSSEMGDLL AFRFYAFNPILDPWVFILFRKAVFQRLKLWVCCLCLGPAHGDSQTPLSQLASGRRDPR APSAPVGKEGSCVPLSAWGEGQVEPLPPTQQSSGSAVGTSSKAEASVACSLC" BASE COUNT 195 a 507 c 450 g 265 t ORIGIN 1 ggcacagacg cacgggacag gagagcctgg gcaagactgg agagcccaga cctgggatgg 61 cggattcgtg caggaacctc acctacgtgc ggggctcggt ggggccggcc accagcaccc 121 tgatgttcgt ggccggtgtg gtgggcaacg ggctggccct gggcatcctg agcgcacggc 181 gaccggcgcg cccctcggcc ttcgcggtgc tggtcaccgg actggcggcc accgacctgc 241 tgggcaccag cttcctgagc ccggccgtgt tcgtggccta tgcgcgcaac agctccctgc 301 tgggcctggc ccgaggcggc cccgccctgt gcgatgcctt cgccttcgcc atgaccttct 361 tcggcctggc gtccatgctc atcctctttg ccatggccgt ggagcgctgc ctggcgctga 421 gccaccccta cctctacgcg cagctggacg ggccccgctg cgcccgcctg gcgctgccag 481 ccatctacgc cttctgcgtc ctcttctgcg cgctgcccct gctgggcctg ggccaacacc 541 agcagtactg ccccggcagc tggtgcttcc tccgcatgcg ctgggcccag ccgggcggcg 601 ccgccttctc gctggcctac gccggcctgg tggccctgct ggtggctgcc atcttcctct 661 gcaacggctc ggtcaccctc agcctctgcc gcatgtaccg ccagcagaag cgccaccagg 721 gctctctggg tccacggccg cgcaccggag aggacgaggt ggaccacctg atcctgctgg 781 ccctcatgac agtggtcatg gccgtgtgct ccctgcctct cacgatccgc tgcttcaccc 841 aggctgtcgc ccctgacagc agcagtgaga tgggggacct ccttgccttc cgcttctacg 901 ccttcaaccc catcctggac ccctgggtct tcatcctttt ccgcaaggct gtcttccagc 961 gactcaagct ctgggtctgc tgcctgtgcc tcgggcctgc ccacggagac tcgcagacac 1021 ccctttccca gctcgcctcc gggaggaggg acccaagggc cccctctgct cctgtgggaa 1081 aggaggggag ctgcgtgcct ttgtcggctt ggggcgaggg gcaggtggag cccttgcctc 1141 ccacacagca gtccagcggc agcgccgtgg gaacgtcgtc caaagcagaa gccagcgtcg 1201 cctgctccct ctgctgacat ttcaagctga ccctgtgatc tctgccctgt cttcgggcga 1261 caggagccag aaaatcaggg acatggctga tggctgcgga tgctggaacc ttggccccca 1321 aactctgggg ccgatcagct gctgtttctc tgcggcaggg cagtcgctgc tggctctggg 1381 aagagagtga gggacagagg aaacgtttat cctggag // LOCUS HUMPITPB 1204 bp mRNA PRI 30-JAN-1997 DEFINITION Human mRNA for phosphatidylinositol transfer protein (PI-TPbeta), complete cds. ACCESSION D30037 NID g1060904 KEYWORDS phosphatidylinositol transfer protein; PI-TPbeta. SOURCE Homo sapiens male brain cDNA to mRNA, clone:HBPITB. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tanaka,S., Yamashita,S. and Hosaka,K. TITLE Cloning and expression of human cDNA encoding phosphatidylinositol transfer protein beta JOURNAL Biochim. Biophys. Acta 1259 (3), 199-202 (1995) MEDLINE 96130253 REFERENCE 2 (bases 1 to 1204) AUTHORS Tanaka,S., Yamashita,S. and Hosaka,K. TITLE Molecular cloning and sequencing of cDNAs encoding phosphatidylinositol transfer protein alpha and beta isoforms from human JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 1204) AUTHORS Hosaka,K. TITLE Direct Submission JOURNAL Submitted (27-APR-1994) to the DDBJ/EMBL/GenBank databases. Kohei Hosaka, Gunma University School of Medicine, Department of Biochemistry; 3-39-22 Showa-machi, Maebashi, Gunma 371, Japan (Tel:0272-20-7943, Fax:0272-20-7948) FEATURES Location/Qualifiers source 1..1204 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HBPITB" /sex="male" /tissue_type="brain" CDS 47..862 /standard_name="PI-TPbeta" /codon_start=1 /product="phosphatidylinositol transfer protein" /db_xref="PID:d1006841" /db_xref="PID:g1060905" /translation="MVLIKEFRVVLPCSVQEYQVGQLYSVAEASKNETGGGEGIEVLK NEPYEKDGEKGQYTHKIYHLKSKVPAFVRMIAPEGSLVFHEKAWNAYPYCRTIVTNEY MKDDFFIKIETWHKPDLGTLENVHGLDPNTWKTVEIVHIDIADRSQVEPADYKADEDP ALFQSVKTKRGPLGPNWKKELANSPDCPQMCAYKLVTIKFKWWGLQSKVENFIQKQEK RIFTNFHRQLFCWIDKWIDLTMEDIRRMEDETQKELETMRKRGSVRGTSAADV" BASE COUNT 382 a 223 c 306 g 293 t ORIGIN 1 tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 61 ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 121 tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa 181 tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 241 gagcaaagtg cctgcattcg tgaggatgat tgctcccgag ggctccttgg tgtttcatga 301 gaaagcctgg aatgcgtacc cctactgtag aacaattgta acgaatgaat atatgaaaga 361 tgatttcttc attaaaatcg aaacatggca caaaccagac ttgggaacat tagaaaatgt 421 acatggttta gatccaaaca catggaaaac tgttgaaatt gtccatatag atattgcaga 481 tagaagtcaa gttgaaccag cagactacaa agctgatgaa gacccagcat tattccagtc 541 agtcaagacc aagagaggcc ctttgggacc caactggaag aaggagctgg caaacagccc 601 tgactgtccc cagatgtgtg cctataagct ggtgaccatc aaattcaagt ggtggggact 661 gcaaagcaaa gtagaaaact tcattcaaaa gcaagaaaaa cggatattta caaacttcca 721 tcgccagctt ttttgttgga ttgacaagtg gatcgatctc acgatggaag acattaggag 781 aatggaagac gagactcaga aagaactaga aacaatgcgt aagaggggtt ccgttcgagg 841 cacgtcggct gctgatgtct agatgagtcc cctgtagggt cagagacaat gtcaaactgt 901 ttacgtaatc aaggtcaagt gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 961 tgaccagtat cttgcagtgt tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 1021 tgcacaggtt ctcaaccacg tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1081 taaagcgcat gtagagctac agatccagat acacacactt gtgtatatat gtacatacag 1141 acatactgaa gggattagta caatttctcc aaagtactgt acctatcttc agcaagaatg 1201 caaa // LOCUS HUMPKC 2754 bp mRNA PRI 20-SEP-1995 DEFINITION Human protein kinase C-theta (PRKCT) mRNA, complete cds. ACCESSION L01087 NID g558098 KEYWORDS highly expressed gene; protein kinase C-theta. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2754) AUTHORS Chang,J.D., Xu,Y., Raychowdhury,M.K. and Ware,J.A. TITLE Molecular cloning and expression of a cDNA encoding a novel isoenzyme of protein kinase C (nPKC). A new member of the nPKC family expressed in skeletal muscle, megakaryoblastic cells, and platelets [published erratum appears in J Biol Chem 1994 Dec 9;269(49):31322] JOURNAL J. Biol. Chem. 268 (19), 14208-14214 (1993) MEDLINE 93300813 FEATURES Location/Qualifiers source 1..2754 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEL" /cell_type="erythroleukemia" /tissue_lib="HEL cell lambda-gt11 library of M. Poncz" gene 95..2215 /gene="PRKCT" CDS 95..2215 /gene="PRKCT" /codon_start=1 /product="protein kinase C-theta" /db_xref="PID:g558099" /translation="MSPFLRIGLSNFDCGSCQSCQGEAVNPYCAVLVKEYVESENGQM YIQKKPTMYPPWDSTFDAHINKGRVMQIIVKGKNVDLISETTVELYSLAERCRKNNGK TEIWLELKPQGRMLMNARYFLEMSDTKDMNEFETEGFFALHQRRGAIKQAKVHHVKCH EFTATFFPQPTFCSVCHEFVWGLNKQGYQCRQCNAAIHKKCIDKVIAKCTGSAINSRE TMFHKERFKIDMPHRFKVYNYKSPTFCEHCGTLLWGLARQGLKCDACGMNVHHRCQTK VANLCGINQKLMAEALAMIESTQQARCLRDTEQIFREGPVEIGLPCSIKNEARLPCLP TPGKREPQGISWESPLDEVDKMCHLPEPELNKERPSLQIKLKIEDFILHKMLGKGSFG KVFLAEFKKTNQFFAIKALKKDVVLMDDDVECTMVEKRVLSLAWEHPFLTHMFCTFQT KENLFFVMEYLNGGDLMYHIQSCHKFDLSRATFYAAEIILGLQFLHSKGIVYRDLKLD NILLDKDGHIKIADFGMCKENMLGDAKTNTFCGTPDYIAPEILLGQKYNHSVDWWSFG VLLYEMLIGQSPFHGQDEEELFHSIRMDNPFYPRWLEKEAKDLLVKLFVREPEKRLGV RGDIRQHPLFREINWEELERKEIDPPFRPKVKSPFDCSNFDKEFLNEKPRLSFADRAL INSMDQNMFRNFSFMNPRMERLIS" polyA_signal 2324 polyA_signal 2524 BASE COUNT 764 a 657 c 686 g 647 t ORIGIN 1 gaattccgcc agccccgcca gtccccgcgc agtccccgcg cagtcccagc gccaccgggc 61 agcagcggcg ccgtgctcgc tccagggcgc aaccatgtcg ccatttcttc ggattggctt 121 gtccaacttt gactgcgggt cctgccagtc ttgtcagggc gaggctgtta acccttactg 181 tgctgtgctc gtcaaagagt atgtcgaatc agagaacggg cagatgtata tccagaaaaa 241 gcctaccatg tacccaccct gggacagcac ttttgatgcc catatcaaca agggaagagt 301 catgcagatc attgtgaaag gcaaaaacgt ggacctcatc tctgaaacca ccgtggagct 361 ctactcgctg gctgagaggt gcaggaagaa caacgggaag acagaaatat ggttagagct 421 gaaacctcaa ggccgaatgc taatgaatgc aagatacttt ctggaaatga gtgacacaaa 481 ggacatgaat gaatttgaga cggaaggctt ctttgctttg catcagcgcc ggggtgccat 541 caagcaggca aaggtccacc acgtcaagtg ccacgagttc actgccacct tcttcccaca 601 gcccacattt tgctctgtct gccacgagtt tgtctggggc ctgaacaaac agggctacca 661 gtgccgacaa tgcaatgcag caattcacaa gaagtgtatt gataaagtta tagcaaagtg 721 cacaggatca gctatcaata gccgagaaac catgttccac aaggagagat tcaaaattga 781 catgccacac agatttaaag tctacaatta caagagcccg accttctgtg aacactgtgg 841 gaccctgctg tggggactgg cacggcaagg actcaagtgt gatgcatgtg gcatgaatgt 901 gcatcataga tgccagacaa aggtggccaa cctttgtggc ataaaccaga agctaatggc 961 tgaagcgctg gccatgattg agagcactca acaggctcgc tgcttaagag atactgaaca 1021 gatcttcaga gaaggtccgg ttgaaattgg tctcccatgc tccatcaaaa atgaagcaag 1081 gctgccatgt ttaccgacac cgggaaaaag agagcctcag ggcatttcct gggagtctcc 1141 gttggatgag gtggataaaa tgtgccatct tccagaacct gaactgaaca aagaaagacc 1201 atctctgcag attaaactaa aaattgagga ttttatcttg cacaaaatgt tggggaaagg 1261 aagttttggc aaggtcttcc tggcagaatt caagaaaacc aatcaatttt tcgcaataaa 1321 ggccttaaag aaagatgtgg tcttgatgga cgatgatgtt gagtgcacga tggtagagaa 1381 gagagttctt tccttggcct gggagcatcc gtttctgacg cacatgtttt gtacatttca 1441 gaccaaggaa aacctctttt ttgtgatgga gtacctcaac ggaggggact taatgtacca 1501 catccaaagc tgccacaagt tcgacctttc cagagcgacg ttttatgctg ctgaaatcat 1561 tcttggtctg cagttccttc attccaaagg aatagtctac agggacctga agctagataa 1621 catcctgtta gacaaagatg gacatatcaa gatcgcggat tttggaatgt gcaaggagaa 1681 catgttagga gatgccaaga cgaatacctt ctgtgggaca cctgactaca tcgccccaga 1741 gatcttgctg ggtcagaaat acaaccactc tgtggactgg tggtccttcg gggttctcct 1801 ttatgaaatg ctgattggtc agtcgccttt ccacgggcag gatgaggagg agctcttcca 1861 ctccatccgc atggacaatc ccttttaccc acggtggctg gagaaggaag caaaggacct 1921 tctggtgaag ctcttcgtgc gagaacctga gaagaggctg ggcgtgaggg gagacatccg 1981 ccagcaccct ttgtttcggg agatcaactg ggaggaactt gaacggaagg agattgaccc 2041 accgttccgg ccgaaagtga aatcaccatt tgactgcagc aatttcgaca aagaattctt 2101 aaacgagaag ccccggctgt catttgccga cagagcactg atcaacagca tggaccagaa 2161 tatgttcagg aacttttcct tcatgaaccc ccggatggag cggctgatat cctgaatctt 2221 gcccctccag agacaggaaa gaatttgcct tgtccctggg aactggttca agagacactg 2281 cttgggttcc tttttcaact tggaaaaaga aagaaacact caacaataaa gactgagacc 2341 cgttcgcccc catgtgactt ttatctgtag cagaaaccaa gtctacttca ctaatgacga 2401 tgccgtgtgt ctcgtctcct gacatgtctc acagacgctc ctgaagttag gtcattacta 2461 accatagtta tttacttgaa agatgggtct ccgcacttgg aaaggtttca agacttgata 2521 ctgcaataaa ttatggctct tcacctgggc gccaactgct gatcaacgaa atgcttgttg 2581 aatcaggggc aaacggagta cagacgtctc aagactgaaa cggccccatt gcctggtcta 2641 gtagcggatc tcactcagcc gcagacaagt aatcactaac ccgttttatt ctattcctat 2701 ctgtggatgg gtaaatgctg ggggccagcc ctggataggt ttttatggga attc // LOCUS HUMPKCAMD 3259 bp mRNA PRI 07-JAN-1995 DEFINITION Human cAMP-dependent protein kinase subunit RII-beta mRNA, complete cds. ACCESSION M31158 NID g189980 KEYWORDS cAMP-dependent protein kinase RII-beta regulatory subunit; protein kinase. SOURCE Homo sapiens testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3259) AUTHORS Levy,F.O., Oyen,O., Sandberg,M., Tasken,K., Eskild,W., Hansson,V. and Jahnsen,T. TITLE Molecular cloning, complementary deoxyribonucleic acid structure and predicted full-length amino acid sequence of the hormone-inducible regulatory subunit of 3'-5'-cyclic adenosine monophosphate-dependent protein kinase from human testis JOURNAL Mol. Endocrinol. 2 (12), 1364-1373 (1988) MEDLINE 89112218 FEATURES Location/Qualifiers source 1..3259 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /map="Unassigned" gene 167..1423 /gene="PRKAR2A" CDS 167..1423 /gene="PRKAR2A" /codon_start=1 /db_xref="GDB:G00-120-314" /product="cAMP-dependent protein kinase subunit RII-beta" /db_xref="PID:g189981" /translation="MSIEIPAGLTELLQGFTVEVLRHQPADLLEFALQHFTRLQQENE RKGTARFGHEGRTWGDLGAAAGGGTPSKGVNFAEEPMQSDSEDGEEEEAAPADAGAFN APVINRFTRRASVCAEAYNPDEEEDDAESRIIHPKTDDQRNRLQEACKDILLFKNLDP EQMSQVLDAMFEKLVKDGEHVIDQGDDGDNFYVIDRGTFDIYVKCDGVGRCVGNYDNR GSFGELALMYNTPRAATITATSPGALWGLDRVTFRRIIVKNNAKKRKMYESFIESLPF LKSLEFSERLKVVDVIGTKVYNDGEQIIAQGDSADSFFIVESGEVKITMKRKGKSEVE ENGAVEMPRCSRGQYFGELALVTNKPRAASAHAIGTVKCLAMDVQAFERLLGPCMEIM KRNIATYEEQLVALFGTNMDIVEPTA" BASE COUNT 968 a 600 c 723 g 968 t ORIGIN 1 gacgcgcgcc gggagccggc ggccgggcca gccggcgccg gggcccagtg cgccgcgctc 61 gcagccggta gcgcgccagc cgtaggcgtc gctcggcagc cgcggggccc taggcgtgcc 121 ggggaggggg cgagggcggc caggcgcctg ccgccccgga ggcaggatga gcatcgagat 181 cccggcggga ctgacggagc tgctgcaggg cttcacggtg gaggtgctga ggcaccagcc 241 cgcggacctg ctggagttcg cgctgcagca cttcacccgc ctgcagcagg agaacgagcg 301 caaaggcacc gcgcgcttcg gccatgaggg caggacctgg ggggacctgg gcgccgctgc 361 cgggggcggc acccccagca agggggtcaa cttcgccgag gagcccatgc agtccgactc 421 cgaggacggg gaggaggagg aggcggcgcc cgcggacgca ggggcgttca atgctccagt 481 aataaaccga ttcacaaggc gtgcctcagt atgtgcagaa gcttataatc ctgatgaaga 541 agaagatgat gcagagtcca ggattataca tccaaaaact gatgatcaaa gaaataggtt 601 gcaagaggct tgcaaagaca tcctgctgtt taagaatctg gatccggagc agatgtctca 661 agtattagat gccatgtttg aaaaattggt caaagatggg gagcatgtaa ttgatcaagg 721 tgacgatggt gacaactttt atgtaattga tagaggcaca tttgatattt atgtgaaatg 781 tgatggtgtt ggaagatgtg ttggtaacta tgataatcgt gggagtttcg gcgaactggc 841 cttaatgtac aatacaccca gagcagctac aatcactgct acctctcctg gtgctctgtg 901 gggtttggac agggtaacct tcaggagaat aattgtgaaa aacaatgcca aaaagagaaa 961 aatgtatgaa agctttattg agtcactgcc attccttaaa tctttggagt tttctgaacg 1021 cctgaaagta gtagatgtga taggcaccaa agtatacaac gatggagaac aaatcattgc 1081 tcagggagat tcggctgatt cttttttcat tgtagaatct ggagaagtga aaattactat 1141 gaaaagaaag ggtaaatcag aagtggaaga gaatggtgca gtagaaatgc ctcgatgctc 1201 gcggggacag tactttggag agcttgccct ggtaactaac aaacctcgag cagcttctgc 1261 ccacgccatt gggactgtca aatgtttagc aatggatgtg caagcatttg aaaggcttct 1321 gggaccttgc atggaaatta tgaaaaggaa catcgctacc tatgaagaac agttagttgc 1381 cctgtttgga acgaacatgg atattgttga acccactgca tgaagcaaaa gtatggagca 1441 agacctgtag tgacaaaatt acacagtagt ggttagtcca ctgagaatgt gtttgtgtag 1501 atgccaagca ttttctgtga tttcaggttt tttccttttt ttacatttac aacgtatcaa 1561 taaacagtag tgatttaata gtcaataggc tttaacatca ctttctaaag agtagttcat 1621 aaaaaaatca acatactgat aaaatgactt tgtactccac aaaattatga ctgaaaggtt 1681 tattaaaatg attgtaatat atagaaagta tctgtgttta agaagataat taaaggatgt 1741 tatcataggc tatatgtgtt ttacttattc agactgataa tcatattagt gactatcccc 1801 atgtaagagg gcacttggca attaaacatg ctacacagca tggcatcact tttttttata 1861 actcattaaa cacagtaaaa ttttaatcat ttttgtttta aagttttcta gcttgataag 1921 ttatgtgctg ccttggccta ttggtgaaat ggtataaaat atcatatgca gttttaaaac 1981 tttttatatt tttgcaataa agtacatttt gactttgttg gcataatgtc agtaacatac 2041 atattccagt ggttttatgg acaggcaatt tagtcattat gataataagg aaaacagtgt 2101 tttagatgag agatcattaa tgcatttttc cctcatcaag catatatctg ctttttttta 2161 ttttgcaatt ctctgtattc tatgtcttta aaaatttgat cttgacattt aatgtcacaa 2221 agttttgttt ttttaaaaag tgatttaaac ttaagatccg acattttttg tattctttaa 2281 gattttacac ctaaaaaatc tctcctatcc caaaaataat gtgggatcct tatcagcatg 2341 cccacagttt atttctttgt tcttcactag gcctgcataa tacagtccta tgtagacatc 2401 tgttcccttg ggtttccgtt ctttcttagg atggttgcca acccacaatc tcattgatca 2461 gcagccaata tgggtttgtt tggttttttt aattcttaaa aacatcctct agaggaatag 2521 aaacaaattt ttatgagcat aaccctatat aaagacaaaa tgaatttctg accttaccat 2581 atataccatt aggccttgcc attgctttaa tgtagactca tagttgaaat tagtgcagaa 2641 agaactcaga tgtactagat tttcattgtt cattgatatg ctcagtatgc tgccacataa 2701 gatgaattta attatattca accaaagcaa tatactctta catgatttct aggccccatg 2761 acccagtgtc tagagacatt aattctaacc agttgtttgc ttttaaatga gtgatttcat 2821 tttgggaaac aggtttcaaa tgaatatata tacatgggta aaattactct gtgctagtgt 2881 agtcttacta gagaatgttt atggtcccac ttgtatatga aaatgtggtt agaatgttaa 2941 ttggataatg tatatataag aagttaaagt atgtaaagta taacttcagc cacattttta 3001 gaacactgtt taacattttt gcaaaacctt cttgtaggaa aagagagctc tctacatgaa 3061 gatgacttgt tttatatttc agattttatt ttaaaagcca tgtctgttaa acaagaaaaa 3121 acacaaaaga actccagatt cctggttcat cattctgtat tcttactcac tttttcaagt 3181 tatctatttt gttgcataaa ctaattgtta actattcatg gaacagcaaa cgcctgttta 3241 ataaagaact ttgaccaag // LOCUS HUMPKM2L 2287 bp mRNA PRI 24-SEP-1992 DEFINITION Human M2-type pyruvate kinase mRNA, complete cds. ACCESSION M23725 NID g189997 KEYWORDS pyruvate kinase. SOURCE Human liver, cDNA to mRNA, clones pHM2PK-[D,21]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Tani,K., Yoshida,M.C., Satoh,H., Mitamura,K., Noguchi,T., Tanaka,T., Fujii,H. and Miwa,S. TITLE Human M2-type pyruvate kinase: cDNA cloning, chromosomal assignment and expression in hepatoma JOURNAL Gene 73, 509-516 (1988) MEDLINE 89211988 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tani, 04-APR-1989. FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 110..1705 /codon_start=1 /product="M2-type pyruvate kinase" /db_xref="PID:g189998" /translation="MSKPHSEAGTAFIQTQQLHAAMADTFLEHMCRLDIDSPPITARN TGIICTIGPASRSVETLKEMIKSGMNVARLNFSHGTHEYHAETIKNVRTATESFASDP ILYRPVAVALDTKGPEIRTGLIKGSGTAEVELKKGATLKITLDNAYMEKCDENILWLD YKNICKVVEVGSKIYVDDGLISLQVKQKGADFLVTEVENGGSLGSKKGVNLPGAAVDL PAVSEKDIQDLKFGVEQDVDMVFASFIRKASDVHEVRKVLGEKGKNIKIISKIENHEG VRRFDEILEASDGIMVARGDLGIEIPAEKVFLAQKMMIGRCNRAGKPVICATQMLESM IKKPRPTRAEGSDVANAVLDGADCIMLSGETAKGDYPLEAVRMQNLIAREAEAAIYHL QLFEELRRLAPITSDPTEATAVGAVEASFKCCSGAIIVLTKSGRSAHQVARYRPRAPI IAVTRNPQTARQAHLYRGIFPVLCKDPVQEAWAEDVDLRVNFAMNVGKARGFFKKGDV VIVLTGWRPGSGFTNTMRVVPVP" BASE COUNT 501 a 648 c 654 g 484 t ORIGIN 49 bp upstream of XhoII site; chromosome 15q22.2-q22.3. 1 ggctgaggca gtggctcctt gcacagcagc tgcacgcgcc gtggctccgg atcttcttcg 61 tctttgcagc gtagcccgag tcggtcagcg ccagaggacc tcagcagcca tgtcgaagcc 121 ccatagtgaa gccgggactg ccttcattca gacccagcag ctgcacgcag ccatggctga 181 cacattcctg gagcacatgt gccgcctgga cattgattca ccacccatca cagcccggaa 241 cactggcatc atctgtacca ttggcccagc ttcccgatca gtggagacgt tgaaggagat 301 gattaagtct ggaatgaatg tggctcgtct gaacttctct catggaactc atgagtacca 361 tgcggagacc atcaagaatg tgcgcacagc cacggaaagc tttgcttctg accccatcct 421 ctaccggccc gttgctgtgg ctctagacac taaaggacct gagatccgaa ctgggctcat 481 caagggcagc ggcactgcag aggtggagct gaagaaggga gccactctca aaatcacgct 541 ggataacgcc tacatggaaa agtgtgacga gaacatcctg tggctggact acaagaacat 601 ctgcaaggtg gtggaagtgg gcagcaagat ctacgtggat gatgggctta tttctctcca 661 ggtgaagcag aaaggtgccg acttcctggt gacggaggtg gaaaatggtg gctccttggg 721 cagcaagaag ggtgtgaacc ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa 781 ggacatccag gatctgaagt ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt 841 catccgcaag gcatctgatg tccatgaagt taggaaggtc ctgggagaga agggaaagaa 901 catcaagatt atcagcaaaa tcgagaatca tgagggggtt cggaggtttg atgaaatcct 961 ggaggccagt gatgggatca tggtggctcg tggtgatcta ggcattgaga ttcctgcaga 1021 gaaggtcttc cttgctcaga agatgatgat tggacggtgc aaccgagctg ggaagcctgt 1081 catctgtgct actcagatgc tggagagcat gatcaagaag ccccgcccca ctcgggctga 1141 aggcagtgat gtggccaatg cagtcctgga tggagccgac tgcatcatgc tgtctggaga 1201 aacagccaaa ggggactatc ctctggaggc tgtgcgcatg cagaacctga ttgcccgtga 1261 ggcagaggct gccatctacc acttgcaatt atttgaggaa ctccgccgcc tggcgcccat 1321 taccagcgac cccacagaag ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg 1381 cagtggggcc ataatcgtcc tcaccaagtc tggcaggtct gctcaccagg tggccagata 1441 ccgcccacgt gcccccatca ttgctgtgac ccggaatccc cagacagctc gtcaggccca 1501 cctgtaccgt ggcatcttcc ctgtgctgtg caaggaccca gtccaggagg cctgggctga 1561 ggacgtggac ctccgggtga actttgccat gaatgttggc aaggcccgag gcttcttcaa 1621 gaagggagat gtggtcattg tgctgaccgg atggcgccct ggctccggct tcaccaacac 1681 catgcgtgtt gttcctgtgc cgtgatggac cccagagccc ctcctccagc ccctgtccca 1741 cccccttccc ccagcccatc cattaggcca gcaacgcttg tagaactcac tctgggctgt 1801 aacgtggcac tggtaggttg ggacaccagg gaagaagatc aacgcctcac tgaaacatgg 1861 ctgtgtttgc agcctgctct agtgggacag cccagagcct ggctgcccca tcatgtggcc 1921 ccacccaatc aagggaagaa ggaggaatgc tggactggag gcccctggag ccagatggca 1981 agagggtgac agcttccttt cctgtgtgta ctctgtccag ttcctttaga aaaaatggat 2041 gcccagagga ctcccaaccc tggcttgggg tcaagaaaca gccagcaaga gttaggggcc 2101 ttagggcact gggctgttgt tccattgaag ccgactctgg ccctggccct tacttgcttc 2161 tctagctctc taggcctctc cagtttgcac ctgtccccac cctccactca gctgtcctgc 2221 agcaaacact ccaccctcca ccttccattt tcccccacta ctgcagcacc tccaggcctg 2281 ttgccgc // LOCUS HUMPLA2 2875 bp mRNA PRI 07-JAN-1995 DEFINITION Human phosphatidylcholine 2-acylhydrolase (cPLA2) mRNA, complete cds. ACCESSION M68874 NID g190003 KEYWORDS phosphatidylcholine 2-acylhydrolase; phospholipase A2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2875) AUTHORS Sharp,J.D., White,D.L., Chiou,X.G., Goodson,T., Gamboa,G.C., McClure,D., Burgett,S., Hoskins,J.A., Skatrud,P.L., Sportsman,J.R., Becker,G.W., Kang,L.H., Roberts,E.F. and Kramer,R.M. TITLE Molecular cloning and expression of human Ca(2+)-sensitive cytosolic phospholipase A2 JOURNAL J. Biol. Chem. 266 (23), 14850-14853 (1991) MEDLINE 91331987 FEATURES Location/Qualifiers source 1..2875 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U937" gene 139..2388 /gene="cPLA2" CDS 139..2388 /gene="cPLA2" /EC_number="3.1.1.4" /codon_start=1 /evidence=experimental /product="phosphatidylcholine 2-acylhydrolase" /db_xref="PID:g190004" /translation="MSFIDPYQHIIVEHQYSHKFTVVVLRATKVTKGAFGDMLDTPDP YVELFISTTPDSRKRTRHFNNDINPVWNETFEFILDPNQENVLEITLMDANYVMDETL GTATFTVSSMKVGEKKEVPFIFNQVTEMVLEMSLEVCSCPDLRFSMALCDQEKTFRQQ RKEHIRESMKKLLGPKNSEGLHSARDVPVVAILGSGGGFRAMVGFSGVMKALYESGIL DCATYVAGLSGSTWYMSTLYSHPDFPEKGPEEINEELMKNVSHNPLLLLTPQKVKRYV ESLWKKKSSGQPVTFTDIFGMLIGETLIHNRMNTTLSSLKEKVNTAQCPLPLFTCLHV KPDVSELMFADWVEFSPYEIGMAKYGTFMAPDLFGSKFFMGTVVKKYEENPLHFLMGV WGSAFSILFNRVLGVSGSQSRGSTMEEELENITTKHIVSNDSSDSDDESHEPKGTENE DAGSDYQSDNQASWIHRMIMALVSDSALFNTREGRAGKVHNFMLGLNLNTSYPLSPLS DFATQDSFDDDELDAAVADPDEFERIYEPLDVKSKKIHVVDSGLTFNLPYPLILRPQR GVDLIISFDFSARPSDSSPPFKELLLAEKWAKMNKLPFPKIDPYVFDREGLKECYVFK PKNPDMEKDCPTIIHFVLANINFRKYKAPGVPRETEEEKEIADFDIFDDPESPFSTFN FQYPNQAFKRLHDLMHFNTLNNIDVIKEAMVESIEYRRQNPSRCSVSLSNVEARRFFN KEFLSKPKA" CDS <265..>381 /gene="cPLA2" /note="homology with PKC" /codon_start=1 /db_xref="PID:g190005" /translation="DPYVELFISTTPDSRKRTRHFNNDINPVWNETFEFILDP" BASE COUNT 908 a 522 c 599 g 846 t ORIGIN 1 gaattctccg gagctgaaaa aggatcctga ctgaaagcta gaggcattga ggagcctgaa 61 gattctcagg ttttaaagac gctagagtgc caaagaagac tttgaagtgt gaaaacattt 121 cctgtaattg aaaccaaaat gtcatttata gatccttacc agcacattat agtggagcac 181 cagtattccc acaagtttac ggtagtggtg ttacgtgcca ccaaagtgac aaagggggcc 241 tttggtgaca tgcttgatac tccagatccc tatgtggaac tttttatctc tacaacccct 301 gacagcagga agagaacaag acatttcaat aatgacataa accctgtgtg gaatgagacc 361 tttgaattta ttttggatcc taatcaggaa aatgttttgg agattacgtt aatggatgcc 421 aattatgtca tggatgaaac tctagggaca gcaacattta ctgtatcttc tatgaaggtg 481 ggagaaaaga aagaagttcc ttttattttc aaccaagtca ctgaaatggt tctagaaatg 541 tctcttgaag tttgctcatg cccagaccta cgatttagta tggctctgtg tgatcaggag 601 aagactttca gacaacagag aaaagaacac ataagggaga gcatgaagaa actcttgggt 661 ccaaagaata gtgaaggatt gcattctgca cgtgatgtgc ctgtggtagc catattgggt 721 tcaggtgggg gtttccgagc catggtggga ttctctggtg tgatgaaggc attatacgaa 781 tcaggaattc tggattgtgc tacctacgtt gctggtcttt ctggctccac ctggtatatg 841 tcaaccttgt attctcaccc tgattttcca gagaaagggc cagaggagat taatgaagaa 901 ctaatgaaaa atgttagcca caatcccctt ttacttctca caccacagaa agttaaaaga 961 tatgttgagt ctttatggaa gaagaaaagc tctggacaac ctgtcacctt tactgacatc 1021 tttgggatgt taataggaga aacactaatt cataatagaa tgaatactac tctgagcagt 1081 ttgaaggaaa aagttaatac tgcacaatgc cctttacctc ttttcacctg tcttcatgtc 1141 aaacctgacg tttcagagct gatgtttgca gattgggttg aatttagtcc atacgaaatt 1201 ggcatggcta aatatggtac ttttatggct cccgacttat ttggaagcaa attttttatg 1261 ggaacagtcg ttaagaagta tgaagaaaac cccttgcatt tcttaatggg tgtctggggc 1321 agtgcctttt ccatattgtt caacagagtt ttgggcgttt ctggttcaca aagcagaggc 1381 tccacaatgg aggaagaatt agaaaatatt accacaaagc atattgtgag taatgatagc 1441 tcggacagtg atgatgaatc acacgaaccc aaaggcactg aaaatgaaga tgctggaagt 1501 gactatcaaa gtgataatca agcaagttgg attcatcgta tgataatggc cttggtgagt 1561 gattcagctt tattcaatac cagagaagga cgtgctggga aggtacacaa cttcatgctg 1621 ggcttgaatc tcaatacatc ttatccactg tctcctttga gtgactttgc cacacaggac 1681 tcctttgatg atgatgaact ggatgcagct gtagcagatc ctgatgaatt tgagcgaata 1741 tatgagcctc tggatgtcaa aagtaaaaag attcatgtag tggacagtgg gctcacattt 1801 aacctgccgt atcccttgat actgagacct cagagagggg ttgatctcat aatctccttt 1861 gacttttctg caaggccaag tgactctagt cctccgttca aggaacttct acttgcagaa 1921 aagtgggcta aaatgaacaa gctccccttt ccaaagattg atccttatgt gtttgatcgg 1981 gaagggctga aggagtgcta tgtctttaaa cccaagaatc ctgatatgga gaaagattgc 2041 ccaaccatca tccactttgt tctggccaac atcaacttca gaaagtacaa ggctccaggt 2101 gttccaaggg aaactgagga agagaaagaa atcgctgact ttgatatttt tgatgaccca 2161 gaatcaccat tttcaacctt caattttcaa tatccaaatc aagcattcaa aagactacat 2221 gatcttatgc acttcaatac tctgaacaac attgatgtga taaaagaagc catggttgaa 2281 agcattgaat atagaagaca gaatccatct cgttgctctg tttcccttag taatgttgag 2341 gcaagaagat ttttcaacaa ggagtttcta agtaaaccca aagcatagtt catgtactgg 2401 aaatggcagc agtttctgat gctgaggcag tttgcaatcc catgacaact ggatttaaaa 2461 gtacagtaca gatagtcgta ctgatcatga gagactggct gatactcaaa gttgcagtta 2521 cttagctgca tgagaataat actattataa gttaggtgac aaatgatgtt gattatgtaa 2581 ggatatactt agctacattt tcagtcagta tgaacttcct gatacaaatg tagggatata 2641 tactgtattt ttaaacattt ctcaccaact ttcttatgtg tgttcttttt aaaaattttt 2701 tttcttttaa aatatttaac agttcaatct caataagacc tcgcattatg tatgaatgtt 2761 attcactgac tagatttatt cataccatga gacaacacta tttttattta tatatgcata 2821 tatatacata catgaaataa atacatcaat ataaaaataa aaaaaaacgg aattc // LOCUS HUMPLAST 1713 bp mRNA PRI 25-JUN-1996 DEFINITION Human T-plastin polypeptide mRNA, complete cds, clone p4. ACCESSION M22299 NID g190027 KEYWORDS T-plastin. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1713) AUTHORS Lin,C.S., Aebersold,R.H., Kent,S.B., Varma,M. and Leavitt,J. TITLE Molecular cloning and characterization of plastin, a human leukocyte protein expressed in transformed human fibroblasts JOURNAL Mol. Cell. Biol. 8 (11), 4659-4668 (1988) MEDLINE 89096835 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by C.-S.Lin, 18-JAN-1989. FEATURES Location/Qualifiers source 1..1713 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HuT-14" /cell_type="fibroblast" /clone="p4" sig_peptide 1..252 CDS 1..1713 /codon_start=1 /product="T-plastin polypeptide" /db_xref="PID:g190028" /translation="MLDGDRNKDGKISFDEFVYIFQEVKSSDIAKTFRKAINRKEGIC ALGGTSELSSEGTQHSYSEEEKYAFVNWINKALENDPDCRHVIPMNPNTDDLFKAVGD GIVLCKMINLSVPDTIDERAINKKKLTPFIIQENLNLALNSASAIGCHVVNIGAEDLR AGKPHLVLGLLWQIIKIGLFADIELSRNEALAALLRDGETLEELMKLSPEELLLRWAN FHLENSGWQKINNFSADIKDSKAYFHLLNQIAPKGQKEGEPRIDINMSGFNETDDLKR AESMLQQADKLGCRQFVTPADVVSGNPKLNLAFVANLFNKYPALTKPENQDIDWTLLE GETREERTFRNWMNSLGVNPHVNHLYADLQDALVILQLYERIKVPVDWSKVNKPPYPK LGANMKKLENCNYAVELGKHPAKFSLVGIGGQDLNDGNQTLTLALVWQLMRRYTLNVL EDLGDGQKANDDIIVNWVNRTLSEAGKSTSIQSFKDKTISSSLAVVDLIDAIQPGCIN YDLVKSGNLTEDDKHNNAKYAVSMARRIGARVYALPEDLVEVKPKMVMTVFACLMGRG MKRV" mat_peptide 253..393 /product="T-plastin" mat_peptide 394..540 /product="T-plastin" mat_peptide 541..1065 /product="T-plastin" mat_peptide 1066..1245 /product="T-plastin" mat_peptide 1246..1323 /product="T-plastin" mat_peptide 1375..1464 /product="T-plastin" mat_peptide 1465..1617 /product="T-plastin" mat_peptide 1618..1710 /product="T-plastin" BASE COUNT 544 a 316 c 401 g 452 t ORIGIN 1 atgctggatg gtgacaggaa taaagatggg aaaataagtt ttgacgaatt tgtttatatt 61 tttcaagagg taaaaagtag tgatattgcc aagaccttcc gcaaagcaat caacaggaaa 121 gaaggtattt gtgctctggg tggaacttca gagttgtcca gcgaaggaac acagcattct 181 tactcagagg aagaaaaata tgcttttgtt aactggataa acaaagcttt ggaaaatgat 241 cctgattgta gacatgttat accaatgaac cctaacaccg atgacctgtt caaagctgtt 301 ggtgatggaa ttgtgctttg taaaatgatt aacctttcag ttcctgatac cattgatgaa 361 agagcaatca acaagaagaa acttacaccc ttcatcattc aggaaaactt gaacttggca 421 ctgaactctg cttctgccat tgggtgtcat gttgtgaaca ttggtgcaga agatttgagg 481 gctgggaaac ctcatctggt tttgggactg ctttggcaga tcattaagat cggtttgttc 541 gctgacattg aattaagcag gaatgaagcc ttggctgctt tactccgaga tggtgagact 601 ttggaggaac ttatgaaatt gtctccagaa gagcttctgc ttagatgggc aaactttcat 661 ttggaaaact cgggctggca aaaaattaac aactttagtg ctgacatcaa ggattccaaa 721 gcctatttcc atcttctcaa tcaaatcgca ccaaaaggac aaaaggaagg tgaaccacgg 781 atagatatta acatgtcagg tttcaatgaa acagatgatt tgaagagagc tgagagtatg 841 cttcaacaag cagataaatt aggttgcaga cagtttgtta cccctgctga tgttgtcagt 901 ggaaacccca aactcaactt agctttcgtg gctaacctgt ttaataaata cccagcacta 961 actaagccag agaaccagga tattgactgg actctattag aaggagaaac tcgtgaagaa 1021 agaaccttcc gtaactggat gaactctctt ggtgtcaatc ctcacgtaaa ccatctctat 1081 gctgacctgc aagatgccct ggtaatctta cagttatatg aacgaattaa agttcctgtt 1141 gactggagta aggttaataa acctccatac ccgaaactgg gagccaacat gaaaaagcta 1201 gaaaactgca actatgctgt tgaattaggg aagcatcctg ctaaattctc cctggttggc 1261 attggagggc aagacctgaa tgatgggaac caaaccctga ctttagcttt agtctggcag 1321 ctgatgagaa gatataccct caatgtcctg gaagatcttg gagatggtca gaaagccaat 1381 gacgacatca ttgtgaactg ggtgaacaga acgttgagtg aagctggaaa atcaacttcc 1441 attcagagtt ttaaggacaa gacgatcagc tccagtttgg cagttgtgga tttaattgat 1501 gccatccagc caggctgtat aaactatgac cttgtgaaga gtggcaatct aacagaagat 1561 gacaagcaca ataatgccaa gtatgcagtg tcaatggcta gaagaatcgg agccagagtg 1621 tatgctctcc ctgaagacct tgtggaagta aagcccaaga tggtcatgac tgtgtttgca 1681 tgtttgatgg gcaggggaat gaagagagtg taa // LOCUS HUMPLC 4242 bp mRNA PRI 07-JAN-1995 DEFINITION Human phospholipase C mRNA, complete cds. ACCESSION M37238 NID g190035 KEYWORDS phospholipase C. SOURCE Human lymphocyte, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4242) AUTHORS Ohta,S., Matsui,A., Nazawa,Y. and Kagawa,Y. TITLE Complete cDNA encoding a putative phospholipase C from transformed human lymphocytes JOURNAL FEBS Lett. 242 (1), 31-35 (1988) MEDLINE 89078616 FEATURES Location/Qualifiers source 1..4242 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20q12-q13.1" mRNA <1..4242 /note="phospholipase C mRNA" gene 153..3911 /gene="PLC1" CDS 153..3911 /gene="PLC1" /note="phospholipase C" /codon_start=1 /db_xref="GDB:G00-120-299" /db_xref="PID:g190036" /translation="MSTTVNVDSLAEYEKSQIKRALELGTVMTVFSFRKSTPERRTVQ VIMETRQVAWSKTADKIEGFLDIMEIKEIRPGKNSKDFERAKAVRQKEDCCFTILYGT QFVLSTLSLAADSKEDAVNWLSGLKILHQEAMNASTPTIIESWLRKQIYSVDQTRRNS ISLRELKTILPLINFKVSSAKFLKDKFVEIGAHKDELSFEQFHLFYKKLMFEQQKSIL DEFKKDSSVFILGNTDRPDASAVYLHDFQRFLIHEQQEHWAQDLNKVRERMTKFIDDT MRETAEPFLFVDEFLTYLFSRENSIWDEKYDAVDMQDMNNPLSHYWISSSHNTYLTGD QLRSESSPEAYIRCLRMGCRCIELDCWDGPDGKPVIYHGWTRTTKIKFDDVVQAIKDH AFVTSSFPVILSIEEHCSVEQQRHMAKAFKEVFGDLLLTKPTEASADQLPSPSQLREK IIIKHKKLGPRGDVDVNMEDKKDEHKQQGELYMWDSIDQKWTRHYCAIADAKLSFSDD IEQTMEEEVPQDIPPTELHFGEKWFHKKVEKRTSAEKLLQEYCMETGGKDGTFLVRES ETFPNDYTLSFWRSGRVQHCRIRSTMEGGTLKYYLTDNLRFRRMYALIQHYRETHLPC AEFELRLTDPVPNPNPHESKPWYYDSLSRGEAEDMLMRIPRDGAFLIRKREGSDSYAI TFRARGKVKHCRINRDGRHFVLGTSAYFESLVELVSYYEKHSLYRKMRLRYPVTPELL ERYNTERDINSLYDVSRMYVDPSEINPSMPQRTVKALYDYKAKRSDELSFCRGALIHN VSKEPGGWWKGDYGTRIQQYFPSNYVEDISTADFEELEKQIIEDNPLGSLCRGILDLN TYNVVKAPQGKNQKSFVFILEPKEQGDPPVEFATDRVEELFEWFQSIREITWKIDSKE NNMKYWEKNQSIAIELSDLVVYCKPTSKTKDNLENPDFREIRSFVETKADSIIRQKPV DLLKYNQKGLTRVYPKGQRVDSSNYDPFRLWLCGSQMVALNFQTADKYMQMNHALFSL NGRTGYVLQPESMRTEKYDPMPPESQRKILMTLTVKVLGARHLPKLGRSIACPFVEVE ICGAEYGNNKFKTTVVNDNGLSPIWAPTQEKVTFEIYDPNLAFLRFVVYEEDMFSDPN FLAHATYPIKAVKSGFRSVPLKNGYSEDIELASLLVFCEMRPVLESEEELYSSCRQLR RRQEELNNQLFLYDTHQNLRNANRDALVKEFSVNENHSSCTRRNATRG" BASE COUNT 1116 a 1097 c 1127 g 902 t ORIGIN 1 gaattcggcg ctgagtgacc cgagtcggga cgcgggctgc gcgcgcggga ccccggagcc 61 caaacccggg gcaggcgggc agctgtgccc gggcggcacg gccagcttcc tgatttctcc 121 cgattccttc cttctccctg gagcggccga caatgtccac cacggtcaat gtagattccc 181 ttgcggaata tgagaagagc cagatcaaga gagccctgga gctggggacg gtgatgactg 241 tgttcagctt ccgcaagtcc acccccgagc ggagaaccgt ccaggtgatc atggagacgc 301 ggcaggtggc ctggagcaag accgccgaca agatcgaggg cttcttggat atcatggaaa 361 taaaagaaat ccgcccaggg aagaactcca aagatttcga gcgagcaaaa gcagttcgcc 421 agaaagaaga ctgctgcttc accatcctat atggcactca gttcgtcctc agcacgctca 481 gcttggcagc tgactctaaa gaggatgcag ttaactggct ctctggcttg aaaatcttac 541 accaggaagc gatgaatgcg tccacgccca ccattatcga gagttggctg agaaagcaga 601 tatattctgt ggatcaaacc agaagaaaca gcatcagtct ccgagagttg aagaccatct 661 tgcccctgat caactttaaa gtgagcagtg ccaagttcct taaagataag tttgtggaaa 721 taggagcaca caaagatgag ctcagctttg aacagttcca tctcttctat aaaaaactta 781 tgtttgaaca gcaaaaatcg attctcgatg aattcaaaaa ggattcgtcc gtgttcatcc 841 tggggaacac tgacaggccg gatgcctctg ctgtttacct gcatgacttc cagaggtttc 901 tcatacatga acagcaggag cattgggctc aggatctgaa caaagtccgt gagcggatga 961 caaagttcat tgatgacacc atgcgtgaaa ctgctgagcc tttcttgttt gtggatgagt 1021 tcctcacgta cctgttttca cgagaaaaca gcatctggga tgagaagtat gacgcggtgg 1081 acatgcagga catgaacaac cccctgtctc attactggat ctcctcgtca cataacacgt 1141 accttacagg tgaccagctg cggagcgagt cgtccccaga agcttacatc cgctgcctgc 1201 gcatgggctg tcgctgcatt gaactggact gctgggacgg gcccgatggg aagccggtca 1261 tctaccatgg ctggacgcgg actaccaaga tcaagtttga tgacgtcgtg caggccatca 1321 aagaccacgc ctttgttacc tcgagcttcc cagtgatcct gtccatcgag gagcactgca 1381 gcgtggagca acagcgtcac atggccaagg ccttcaagga agtatttggc gacctgctgt 1441 tgacgaagcc cacggaggcc agtgctgacc agctgccctc gcccagccag ctgcgggaga 1501 agatcatcat caagcataag aagctgggcc cccgaggcga tgtggatgtc aacatggagg 1561 acaagaagga cgaacacaag caacaggggg agctgtacat gtgggattcc attgaccaga 1621 aatggactcg gcactactgc gccattgctg atgccaagct gtccttcagt gatgacattg 1681 aacagactat ggaggaggaa gtgccccagg atataccccc tacagaacta cattttgggg 1741 agaaatggtt ccacaagaag gtggagaaga ggacgagtgc cgagaagttg ctgcaggaat 1801 actgcatgga gacggggggc aaggatggca ccttcctggt tcgggagagc gagaccttcc 1861 ccaatgacta caccctgtcc ttctggcggt caggccgggt ccagcactgc cggatccgct 1921 ccaccatgga gggcgggacc ctgaaatact acttgactga caacctgagg ttcaggagga 1981 tgtatgccct catccagcac taccgcgaga cgcacctgcc gtgcgccgag ttcgagctgc 2041 ggctcacgga ccctgtgccc aaccccaacc cccacgagtc caagccgtgg tactatgaca 2101 gcctgagccg cggagaggca gaggacatgc tgatgaggat tccccgggac ggggccttcc 2161 tgatccggaa gcgagagggg agcgactcct atgccatcac cttcagggct aggggcaagg 2221 taaagcattg tcgcatcaac cgggacggcc ggcactttgt gctggggacc tccgcctatt 2281 ttgagagtct ggtggagctc gtcagttact acgagaagca ttcactctac cgaaagatga 2341 gactgcgcta ccccgtgacc cccgagctcc tggagcgcta caatacggaa agagatataa 2401 actccctcta cgacgtcagc agaatgtatg tggatcccag tgaaatcaat ccgtccatgc 2461 ctcagagaac cgtgaaagct ctgtatgact acaaagccaa gcgaagcgat gagctgagct 2521 tctgccgtgg tgccctcatc cacaatgtct ccaaggagcc cgggggctgg tggaaaggag 2581 actatggaac caggatccag cagtacttcc catccaacta cgtcgaggac atctcaactg 2641 cagacttcga ggagctagaa aagcagatta ttgaagacaa tcccttaggg tctctttgca 2701 gaggaatatt ggacctcaat acctataacg tcgtgaaagc ccctcaggga aaaaaccaga 2761 agtcctttgt cttcatcctg gagcccaagg agcagggcga tcctccggtg gagtttgcca 2821 cagacagggt ggaggagctc tttgagtggt ttcagagcat ccgagagatc acgtggaaga 2881 ttgacagcaa ggagaacaac atgaagtact gggagaagaa ccagtccatc gccatcgagc 2941 tctctgacct ggttgtctac tgcaaaccaa ccagcaaaac caaggacaac ttagaaaatc 3001 ctgacttccg agaaatccgc tcctttgtgg agacgaaggc tgacagcatc atcagacaga 3061 agcccgtcga cctcctgaag tacaatcaaa agggcctgac ccgcgtctac ccaaagggac 3121 aaagagttga ctcttcaaac tacgacccct tccgcctctg gctgtgcggt tctcagatgg 3181 tggcactcaa tttccagacg gcagataagt acatgcagat gaatcacgca ttgttttctc 3241 tcaacgggcg cacgggctac gttctgcagc ctgagagcat gaggacagag aaatatgacc 3301 cgatgccacc cgagtcccag aggaagatcc tgatgacgct gacagtcaag gttctcggtg 3361 ctcgccatct ccccaaactt ggacgaagta ttgcctgtcc ctttgtagaa gtggagatct 3421 gtggagccga gtatggcaac aacaagttca agacgacggt tgtgaatgat aatggcctca 3481 gccctatctg ggctccaaca caggagaagg tgacatttga aatttatgac ccaaacctgg 3541 catttctgcg ctttgtggtt tatgaagaag atatgttcag cgatcccaac tttcttgctc 3601 atgccactta ccccattaaa gcagtcaaat caggattcag gtccgttcct ctgaagaatg 3661 ggtacagcga ggacatagag ctggcttccc tcctggtttt ctgtgagatg cggccagtcc 3721 tggagagcga agaggaactt tactcctcct gtcgccagct gaggaggcgg caagaagaac 3781 tgaacaacca gctctttctg tatgacacac accagaactt gcgcaatgcc aaccgggatg 3841 ccctggttaa agagttcagt gttaatgaga accactccag ctgtaccagg agaaatgcaa 3901 caagaggtta agagagaaga gagtcagcaa cagcaagttt tactcataga agctggggta 3961 tgtgtgtaag ggtattgtgt gtgtgcgcat gtgtgtttgc atgtaggaga acgtgcccta 4021 ttcacactct gggaagacgc taatctgtga catcttttct tcaagcctgc catcaaggac 4081 atttcttaag acccaactgg catgagttgg ggtaatttcc tattattttc atcttggaca 4141 acttctaact tatatcttta tagaggattc cccaaaatgt gctcctcatt tttggcctct 4201 catgttccaa acctcattga ataaaaagca atgaaaacct tg // LOCUS HUMPLCB2A 4519 bp mRNA PRI 31-AUG-1992 DEFINITION Homo sapiens phospholipase C-beta-2 mRNA, complete cds. ACCESSION M95678 NID g190039 KEYWORDS phospholipase C-beta2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4519) AUTHORS Park,D., Jhon,D.-Y., Kriz,R.W., Knopf,J. and Rhee,S.G. TITLE Cloning, sequencing, expression, and G-q-independent activation of phospholipase C-beta2 JOURNAL J. Biol. Chem. 267, 16048-16055 (1992) MEDLINE 92355553 FEATURES Location/Qualifiers source 1..4519 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /cell_type="promyelocytic" CDS 165..3710 /codon_start=1 /evidence=experimental /product="phospholipase C-beta-2" /db_xref="PID:g190040" /translation="MSLLNPVLLPPKVKAYLSQGERFIKWDDETTVASPVILRVDPKG YYLYWTYQSKEMEFLDITSIRDTRFGKFAKMPKSQKLRDVFNMDFPDNSFLLKTLTVV SGPDMVDLTFHNFVSYKENVGKAWAEDVLALVKHPLTANASRSTFLDKILVKLKMQLN SEGKIPVKNFFQMFPADRKRVEAALSACHLPKGKNDAINPEDFPEPVYKSFLMSLCPR PEIDEIFTSYHAKAKPYMTKEHLTKFINQKQRDSRLNSLLFPPARPDQVQGLIDKYEP SGINAQRGQLSPEGMVWFLCGPENSVLAQDKLLLHHDMTQPLNHYFINSSHNTYLTAG QFSGLSSAEMYRQVLLSGCRCVELDCWKGKPPDEEPIITHGFTMTTDIFFKEAIEAIA ESAFKTSPYPIILSFENHVDSPRQQAKMAEYCRTIFGDMLLTEPLEKFPLKPGVPLPS PEDLRGKILIKNKKNQFSGPTSSSKDTGGEAEGSSPPSAPAVWAGEEGTELEEEEVEE EEEEESGNLDEEEIKKMQSDEGTAGLEVTAYEEMSSLVNYIQPTKFVSFEFSAQKNRS YVISSFTELKAYDLLSKASVQFVDYNKRQMSRIYPKGTRMDSSNYMPQMFWNAGCQMV ALNFQTMDLPMQQNMAVFEFNGQSGYLLKHEFMRRPDKQFNPFSVDRIDVVVATTLSI TVISGQFLSERSVRTYVEVELFGLPGDPKRRYRTKLSPSTNSINPVWKEEPFVFEKIL MPELASLRVAVMEEGNKFLGHRIIPINALNSGYHHLCLHSESNMPLTMPALFIFLEMK DYIPGAWADLTVALANPIKFFSAHDTKSVKLKEAMGGLPEKPFPLASPVASQVNGALA PTSNGSPAARAGAREEAMKEAAEPRTASLEELRELKGVVKLQRRHEKELRELERRGAR RWEELLQRGAAQLAELGPPGVGGVGACKLGPGKGSRKKRSLPREESAGAAPGEGPEGV DGRVRELKDRLELELLRQGEEQYECVLKRKEQHVAEQISKMMELAREKQAAELKALKE TSENDTKEMKKKLETKRLERIQGMTKVTTDKMAQERLKREINNSHIQEVVQVIKQMTE NLERHQEKLEEKQAACLEQIREMEKQFQKEALAEYEARMKGLEAEVKESVRACLRTCF PSEAKDKPERACECPPELCEQDPLIAKADAQESRL" BASE COUNT 1033 a 1314 c 1301 g 871 t ORIGIN 1 cagccagggc caccccaggg gctataagag caactagatt tctggagcag ctcggggatg 61 ggtgccattt gagcccagct tggctccccc tcctggctgg cctccttcct gcccttctgc 121 ctgcctgtgt ctgctgagat tctgcaaaga ggaacgttgg caccatgtct ctgctcaacc 181 ctgtcctgct gccccccaag gtgaaggcct atctgagcca aggggagcgc ttcatcaaat 241 gggatgatga aactacagtt gcctctccag ttatcctccg tgtggatcct aagggctact 301 acttatactg gacgtatcaa agtaaggaga tggagtttct ggatatcacc agcatccggg 361 atactcgctt tgggaagttt gccaagatgc ccaagagcca gaagctccgg gacgtcttca 421 acatggactt tcctgataac agtttcctgc tgaagacact cacggtggtg tccggcccgg 481 acatggtgga cctcaccttc cacaacttcg tctcctacaa ggagaacgtg ggcaaggcct 541 gggctgagga cgtactggcc ctagtcaaac atccgctgac ggccaacgcc tcccgcagca 601 ccttcctgga caagatcctt gtgaagctca agatgcagct caactctgaa gggaagattc 661 cggtgaagaa ctttttccag atgtttcctg ctgaccgcaa gcgggtggaa gctgctctca 721 gtgcctgcca cctccccaaa ggcaaaaatg acgccatcaa tcctgaggac ttcccagaac 781 ctgtctacaa gagtttcctc atgagcctct gtcctcggcc agaaatagat gagatcttca 841 cttcttacca tgctaaggcc aaaccctaca tgacgaagga gcacctgacc aaattcatca 901 accagaaaca gcgggactcc cggcttaact ccctgctgtt cccgccagca cggcctgacc 961 aggtgcaggg cctcatcgac aagtatgagc ccagtggcat caatgcacag aggggccagc 1021 tgtcacctga aggcatggtc tggtttctct gtgggccaga gaacagcgtg ctggcccagg 1081 acaagctgct gctccaccac gacatgacgc agccactcaa tcattacttc atcaactcgt 1141 cccacaacac ctacctgaca gccggccagt tctcaggcct ctcctcggct gagatgtacc 1201 gccaggtgct gctctctggc tgccgttgcg tggagctaga ctgctggaag gggaaacccc 1261 ctgacgagga gcccattatc acccatggct tcaccatgac cacagacatc ttcttcaaag 1321 aagcaattga ggctattgca gaaagtgcct ttaagacctc cccctatccc atcatcctgt 1381 cgtttgagaa ccatgtggac tcaccccgcc agcaggctaa gatggctgag tattgccgga 1441 cgatctttgg ggatatgctg ctcacagagc ccctggaaaa gttcccacta aaaccaggtg 1501 tccccctgcc cagccctgag gatctcaggg gcaagatcct catcaagaac aagaagaacc 1561 agttttctgg ccccacctcc tccagtaagg ataccggtgg ggaggctgag ggcagcagcc 1621 cacccagtgc ccctgcagtg tgggctggcg aggaagggac tgagctggag gaggaggagg 1681 tggaagagga agaggaggag gagtcaggaa acctggatga agaagagatt aagaagatgc 1741 agtcggatga gggcacagcg ggcctggaag tgacggctta tgaggagatg tccagcctag 1801 tcaattacat ccagcccacc aagttcgtct cctttgagtt ctctgcccaa aagaaccgaa 1861 gttatgtcat ctcgtccttc acagagctca aggcatatga cctgctctcc aaggcctcgg 1921 tgcagtttgt ggactacaac aagcgccaga tgagccgcat ttaccccaag ggaacccgca 1981 tggactcctc caactacatg ccccagatgt tctggaatgc tggatgccag atggttgccc 2041 tcaacttcca gacgatggac ttgcccatgc agcagaacat ggcagtattt gagttcaacg 2101 ggcagagcgg ctacctcctc aagcatgagt tcatgcgccg gccggacaag cagttcaacc 2161 ccttctcagt ggaccgcatc gacgtggtgg tggccaccac cctttccatt acggtgatct 2221 ctgggcagtt cctgtcagaa cgcagcgtgc gcacctatgt agaagtggag ctgtttggcc 2281 ttcctgggga ccccaagagg cgctatcgaa ctaagctgtc acccagtact aactccatca 2341 atcctgtctg gaaggaggag ccctttgtct ttgagaagat cttgatgcct gagctggcct 2401 ccctcagagt ggctgtgatg gaggaaggca acaagtttct tggacaccgc atcatcccca 2461 tcaatgccct aaattctggg taccaccacc tgtgcctgca cagtgagagc aacatgcccc 2521 tcaccatgcc tgcgctcttc atcttcctgg agatgaagga ctacatacct ggtgcttggg 2581 cagatctcac tgtggccctc gccaacccca ttaagttctt cagtgcccat gacacgaagt 2641 ctgtgaagct caaggaggcc atgggaggtc tgcctgagaa gcccttccca ctggcgagtc 2701 cagttgccag ccaggtcaat ggggcgttgg ccccaacgag caatgggtca ccagcagcca 2761 gggccggggc cagggaagag gctatgaaag aagctgcgga gccgcggacc gccagcctgg 2821 aggagctccg ggagctaaag ggcgtggtga agctgcagcg gcggcacgag aaggagctgc 2881 gagagttgga gcggcgcgga gcgcggcgct gggaggagct gctgcagcgg ggcgcggcgc 2941 agctggcgga gctcgggcca ccgggcgtgg ggggcgtcgg ggcctgcaag ctcggtcccg 3001 gcaagggctc tcgcaagaag aggagcctgc cccgcgagga gagcgccgga gccgcgccgg 3061 gcgagggccc tgagggcgtg gacgggcgcg tgcgggagct gaaagacagg ctggagctgg 3121 agctgctgcg gcagggcgag gagcagtacg agtgcgttct gaagcgcaag gagcagcacg 3181 tggccgagca aatctccaaa atgatggagc tggccagaga gaaacaggcg gcagagctga 3241 aggccctgaa ggagacgtcg gagaacgaca ccaaagagat gaagaaaaag ctggagacaa 3301 agagactgga gcggatccag ggcatgacca aagtcaccac agacaagatg gcccaggaga 3361 ggttgaagag agagattaac aactcccaca tccaggaagt agtgcaggtg atcaagcaga 3421 tgacggagaa cttggagagg caccaggaga agctggagga gaagcaggcg gcttgcctgg 3481 aacagatacg ggagatggaa aagcagttcc agaaggaggc gctggcagag tacgaggcca 3541 ggatgaaggg tctggaggca gaggtgaagg agtcggtgag ggcctgcctc aggacctgct 3601 ttccctccga ggccaaggac aagcctgaga gggcctgcga gtgcccccca gagctgtgtg 3661 agcaggaccc actcatagca aaggcagatg cccaggagag ccgcctctga tgcccccatc 3721 ccactgggac atttagcaag gaggttcagc cccttctctg ggatgtggtt ctattccccc 3781 aggaaaaagg agccccagcc ttctgaggct gtgggaacct gtggctgcct tggacgctgc 3841 agccccctcc tcaacggcca ggccagagtc tgagacagga cccaggcacc ctcacggcag 3901 ggcctctctg gggcctagaa gtcttctcaa gctgacttcc tacctccccc tccatctcta 3961 gataagtgtc atatatttgt tgagggcaaa agactatgga ctggaaggca gaaagtggga 4021 tcctggcccc actctgcctt tcctattgag caaccagctc tgggctcagt ttcctcacct 4081 ggagggttgg acccactcac ctccactcta gccccgaacc ctcctgcccc aggcttccag 4141 gccccatcag gcctgccctg agttggcctt gtccactcct tgaggcagat cctggcacta 4201 cctcacagct ccctggggac ggccactccc tggctgaggg ccctcccctc ccctctgcct 4261 gtccggaacg ggaggctgaa atggaaaagc tgccttggcc ctgcttggct gagtcacaag 4321 gggcagtggg ctcttgggtg ctgttccacc ctgaccctgg ctcacccctc ttcctaggcc 4381 tgggggcagg cagttcctac catgtacccc tctcaggctg cctgcctgac aaggtcagca 4441 tcatttgctc tcctgaattt atgaggttta tttatttttc tctttcctac tcctattaaa 4501 gaacctcgtc ccagtgaaa // LOCUS HUMPLCB4M 3707 bp mRNA PRI 05-JUN-1996 DEFINITION Homo sapiens phospholipase C beta 4 (PLCB4) mRNA, complete cds. ACCESSION L41349 NID g762825 KEYWORDS phospholipase C-beta4. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3707) AUTHORS Alvarez,R.A., Ghalayini,A.J., Xu,P., Hardcastle,A., Bhattacharya,S., Rao,P.N., Pettenati,M.J., Anderson,R.E. and Baehr,W. TITLE cDNA sequence and gene locus of the human retinal phosphoinositide-specific phospholipase-C beta 4 (PLCB4) JOURNAL Genomics 29 (1), 53-61 (1995) MEDLINE 96079091 REFERENCE 2 (bases 1 to 3707) AUTHORS Alvarez,R.A. TITLE Direct Submission JOURNAL Submitted (06-APR-1995) Richard A. Alvarez, Ophthalmology, Univ. Oklahoma, Oklahoma City, OK 73104, USA FEATURES Location/Qualifiers source 1..3707 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="retina" mRNA 1..>3707 /gene="PLCB4" gene 1..3707 /gene="PLCB4" 5'UTR 1..230 /gene="PLCB4" CDS 231..3299 /gene="PLCB4" /standard_name="PLCb4" /note="putative" /codon_start=1 /function="hydrolysis of pip2" /product="phospholipase C beta 4" /db_xref="PID:g762826" /translation="MNNNWNVCFFLFCPSITRTFASGKTEKVIFQALKELGLPSGKND EIEPTAFSYEKFYELTQKICPRTDIEDLFKKINGDKTDYLTVDQLVSFLNEHQRDPRL NEILFPFYDAKRAMQIIEMYEPDEDLKKKGLISSDGFCRYLMSDENAPVFLDRLELYQ EMDHPLAHYFISSSHNTYLTGRQFGGKSSVEMYRQVLLAGCRCVELDCWDGKGEDQEP IITHGKAMCTDILFKDVIQAIKETAFVTSEYPVILSFENHCSKYQQYKMSKYCEDLFG DLLLKQALESHPLEPGRPLPSPNDLKRKILIKNKRLKPEVEKKQLEALRSMMEAGESA SPANILEDDNEEEIESADQEEEAHPEFKFGNELSADDLGHKEAVANSVKKGLVTVEDE QAWMASYKYVGATTNIHPYLSTMINYAQPVKFQGFHVAEERNIHYNMSSFNESVGLGY LKTHAIEFVNYNKRQMSRIYPKGGRVDSSNYMPQIFWNAGCQMVSLNYQTPDLAMQLN QGKFEYNGSCGYLLKPDFMRRPDRTFDPFSETPVDGVIAATCSVQVISGQFLSDKKIG TYVEVDMYGLPTDTIRKEFRTRMVMNNGLNPVYNEESLVFRKVILPDLAVLRIAVYDD NNKLIGQRIPPLDGLQAGYRHISLRNEGNKPLSLPTIFCNIVLKTYVPDGFGDIVDAL SDPKTFLSITEKRADQMRAMGIETSDIADVPSDTSKNDKKGKANTAKANVTPQSSSEL RPTTTAALPSGVEAKKGIELIPQVRIEDLKQMKAYLKHLKKQQKELNSLKKKHAKEHS TMQKLHCTQVDKIVAQYDKEKSTHEKILEKAMKKKGGSNCLEMKKETEIKIQTLTSDH KSKVKEIVAQHTKEWSEMINTHSAEEQEIRDLHLSQQCELLKKLLINAHEQQTQQLKL SHDRESKEMRAHQAKISMENSKAISQDKSIKNKAERERRVRELNSSNTKKFLEERKRL AMKQSKEMDQLKKVQLEHLEFLEKQNEQAKEMQQMVKLEAEMDRRPATVV" 3'UTR 3300..>3707 /gene="PLCB4" BASE COUNT 1263 a 757 c 793 g 894 t ORIGIN 1 gtgtgtccct ccagtgccgc ttgccccttg ttctcccaag caccaggggg acccctgtct 61 tagattcaag ttgcatcact agactaacag ctctgggaac agaggacact gaggactagt 121 ttgggaaatg aaaaacatct gcaatgagag gtaattttat tctcctcttc cttttaccct 181 agtctcactc tccatagtag gacctggatc tttgcatcag agaaaacata atgaacaata 241 actggaatgt gtgtttcttt cttttctgcc ctagtattac tagaacattt gcatcgggaa 301 aaacagaaaa ggtgatcttt caagcactca aggagttagg tcttcccagt ggaaagaatg 361 atgaaattga gcccacagca ttttcttatg aaaagttcta tgaactgaca caaaagattt 421 gtcctcggac agatatagaa gatcttttca aaaaaatcaa tggagacaaa actgattatt 481 taacggtaga ccaattagtg agctttctaa atgaacatca acgagatcct cgattgaatg 541 aaattttatt tccattttat gatgccaaaa gggcaatgca gatcattgag atgtatgaac 601 ctgatgaaga tttgaagaaa aaaggcctta tatcaagtga tgggttttgc agatatctga 661 tgtcagatga aaacgcccca gtcttcctag atcgtttaga actttaccaa gaaatggacc 721 atcctctggc tcactacttc atcagttctt cccataacac ttatctcact ggcagacagt 781 tcggcgggaa gtcttcggta gaaatgtaca gacaggttct cctggctggt tgcagatgtg 841 ttgaacttga ctgctgggat ggaaaaggtg aggaccaaga accaataata actcatggaa 901 aagcaatgtg tacagatatc ctttttaagg atgtaattca agccatcaag gaaactgcat 961 ttgtcacatc agaatatcct gtaattctct cctttgaaaa tcactgcagc aaatatcaac 1021 agtacaagat gtccaaatat tgcgaagatc tatttgggga tctcctgttg aaacaagcac 1081 ttgaatcaca tccactggaa ccaggcagac ctttgccatc ccccaatgac ctcaaaagaa 1141 aaatactcat aaaaaacaag cggctgaaac ctgaagttga aaaaaaacag ctggaagctt 1201 tgagaagcat gatggaagct ggagaatctg cctccccagc aaacatctta gaggacgata 1261 atgaagagga gatcgaaagt gctgaccaag aggaggaagc tcaccccgaa ttcaaatttg 1321 gaaatgaact ttctgctgat gacttgggtc acaaggaagc tgttgcaaat agcgtcaaga 1381 agggcctggt cactgtagaa gatgagcagg cgtggatggc atcttataaa tatgtaggtg 1441 ctaccactaa tatccatcca tatttgtcca caatgatcaa ctatgcccag cctgtaaagt 1501 ttcaaggttt ccatgtggca gaagaacgca atattcatta taacatgtct tcttttaatg 1561 aatcagtcgg tcttggctac ttgaagacac atgcaattga atttgtcaat tataacaaac 1621 ggcaaatgag tcgcatttac cccaagggag gccgagtcga ttccagtaat tacatgcctc 1681 agattttctg gaacgctggc tgccagatgg tttcactgaa ctatcaaacc ccagatttag 1741 cgatgcaatt gaatcaggga aaatttgagt ataatggatc gtgcgggtac cttctcaaac 1801 cagatttcat gaggcggcct gatcgaacat ttgacccctt ctctgaaacg cctgttgatg 1861 gggttattgc agccacttgc tcagtgcagg ttatatcagg tcaattctta tcagataaga 1921 aaattggcac ctacgtagag gtggatatgt atgggttgcc cactgacacc atacgtaagg 1981 aattccgaac tcgcatggtt atgaataatg gactcaatcc agtttacaat gaagagtcac 2041 ttgtatttcg gaaggtgatc ctgccggacc tggctgtctt gagaatagct gtgtatgatg 2101 ataacaacaa gctgattggc cagaggattc ctccgcttga tggcctccaa gccggatatc 2161 gacacatttc ccttcgaaat gagggaaata aaccattatc actaccaaca attttctgca 2221 atattgttct taaaacatat gtgcctgatg gatttggaga tatcgtggat gctttatcag 2281 atccaaagac atttctctca attacagaaa agagagcaga ccaaatgaga gctatgggca 2341 ttgagactag tgacatagcc gacgtgccca gtgacacttc caaaaatgac aagaaaggaa 2401 aggccaacac cgccaaagca aatgtgaccc ctcagagtag ctctgagctc agaccaacca 2461 ccacggctgc cctgccgtct ggtgtggaag ccaagaaagg tattgaactt atccctcaag 2521 taaggataga agacttaaag cagatgaagg cttacttgaa gcatttaaag aaacagcaga 2581 aggagctaaa ttctttaaag aagaaacatg caaaggaaca cagtaccatg cagaagttac 2641 actgcacgca agttgacaaa attgtggcac agtatgacaa agagaagtcg actcatgaga 2701 aaatcctaga gaaggcaatg aagaagaaag ggggaagtaa ttgtcttgaa atgaagaaag 2761 aaacagaaat taaaattcag acgctgacat cagatcacaa atctaaggtc aaggagattg 2821 tagcacagca cacaaaggaa tggtcagaaa tgatcaatac tcacagtgct gaggagcaag 2881 aaatccgaga cctgcacctc agccagcagt gtgagctgct gaaaaagcta ctcatcaatg 2941 cccacgagca gcaaacccag cagctgaaac tgtcccatga cagggaaagc aaggaaatgc 3001 gagcacacca ggctaagatt tctatggaaa atagcaaagc catcagccaa gataaatcta 3061 tcaagaataa agcagaacgg gaaaggcgag tcagggagtt aaacagcagc aacactaaaa 3121 agtttctgga agaaagaaag agacttgcca tgaagcagtc caaagaaatg gatcagttga 3181 aaaaagtcca gcttgaacat ctagaattcc tagagaaaca gaatgagcag gcgaaggaga 3241 tgcagcagat ggtgaaattg gaagccgaga tggaccgcag accagcaaca gtagtatgaa 3301 actccaaaat gcaaactgaa gcagcaaacc cacaaagcat caaaagactc actcacaaaa 3361 cttctgaaca caaactccat ggatgaaagc tgtttatttt gtttccttta tgtgtaaaca 3421 agatgatatc tgaaaccaga gagacttgga atgtctgact gacttctatt taacagcttg 3481 agtattgcat ttccttggcc aaacaaaaat agctacaaat ccacaaaaat ttactattcc 3541 agtaaggcag agtccaacca ttgataatac aacttaaaca tgtttgctat aaaataccat 3601 cacaagtaaa tgagcttggt gtgaacaact cttcctttgt gatgccttag gacatgttgt 3661 aactgcaggc aaaacaaaca aaacagtgca ttagcaattt catagca // LOCUS HUMPLCE 4565 bp mRNA PRI 02-SEP-1996 DEFINITION Human mRNA for phospholipase C, complete cds. ACCESSION D42108 NID g780121 KEYWORDS PLC-L (PLC-epsilon); phospholipase C. SOURCE Homo sapiens cDNA to mRNA, clone_lib:adult brain, Hela, fetal heart clone:HOP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kohno,T., Otsuka,T., Takano,H., Yamamoto,T., Hamaguchi,M., Terada,M. and Yokota,J. TITLE Identification of a novel phospholipase C family gene at chromosome 2q33 that is homozygously deleted in human small cell lung carcinoma JOURNAL Hum. Mol. Genet. 4 (4), 667-674 (1995) MEDLINE 95359973 REFERENCE 2 (bases 3371 to 3534) AUTHORS Otsuka,T., Kohno,T., Mori,M., Noguchi,M., Hirohashi,S. and Yokota,J. TITLE Deletion mapping of chromosome 2 in human lung carcinoma JOURNAL Genes Chromosomes Cancer 16 (2), 113-119 (1996) MEDLINE 96415751 REFERENCE 3 (bases 1 to 3370; 3235 to 4565) AUTHORS Kohno,T. JOURNAL Unpublished (1996) REFERENCE 4 (bases 1 to 4565) AUTHORS Kohno,T. TITLE Direct Submission JOURNAL Submitted (14-NOV-1994) to the DDBJ/EMBL/GenBank databases. Takashi Kohno, National Cancer Center Research Institute, Biology Division; 5-1-1, Tsukiji, Chuo-ku, Tokyo 104, Japan (E-mail:tkohno@gan.ncc.go.jp, Tel:03-3542-2511(ex.4651), Fax:03-3542-0807) FEATURES Location/Qualifiers source 1..4565 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /clone="HOP" /clone_lib="adult brain, Hela, fetal heart" 5'UTR 1..203 gene 204..3197 /gene="PLC-L (PLC-epsilon)" CDS 204..3197 /gene="PLC-L (PLC-epsilon)" /codon_start=1 /product="Phospholipase C" /db_xref="PID:d1008271" /db_xref="PID:g780122" /translation="MPSEKKISSANDCISFMQAGCELKKVRPNSRIYNRFFTLDTDLQ ALRWEPSKKDLEKAKLDISAIKEIRLGKNTETFTNNGLADQICEDCAFSILHGENYES LDLVANSADVANIWVSGLRYLVSRSKQPLDFMEGNQNTPRFMWLKTVFEAADVDGNGI MLEDTSVELIKQLNPTLKEAKIRLKFKEIQKSKEKLTTRVTEEEFCEAFCELCTRPEV YFLLVQISKNKEYLDANDLMLFLEAEQGVTHITEDICLDIIRRYELSEEGRQKGFLAI DGFTQYLLSSECDIFDPEQKKVAQDMTQPLSHYYINASHNTYLIEDQFRGPADINGYI RALKMGCRSVELDVSDGSDNEPILCNRNNMTTHVSFRSVIEVINKFAFVASEYPLILC LGNHCSLPQQKVMAQQMKKVFGNKLYTEAPLPSESYLPSPEKLKRMIIVKGKKLPSDP DVLEGEVTDEDEEAQMSRRMSVDYNGEQKQIRLCRELSDLVSICKSVQYRDFELSMKS QNYWEMCSFSETEASRIANEYPEDFVNYNKKFLSRIYPSAMRIDSSNLNPQDFWNCGC QIVAMNFQTPGPMMDLHTGWFLQNGGCGYVLRPSIMRDEVSYFSANTKGILPGVSPLA LHIKIISGQNFPKPKGACAKGDVIDPYVCIEIHGIPADCSEQRTKTVQQNSDNPIFDE TFEFQVNLPELAMIRFVVLDDDYIGDEFIGQYTIPFECLQPGYRHVPLRSFVGDIMEH VTLFVHIAITNRSGGGKAQKRSLSVRMGKKVREYTMLRNIGLKTIDDIFKIAVHPLRE AIDMRENMQNAIVSIKELCGLPPIASLKQCLLTLSSRLITSDNTPSVSLVMKDSFPYL EPLGAIPDVQKKMLTAYDLMIQESRFLIEMADTVQEKIVQCQKAGMEFHEELHNLGAK EGLKGRKLNKATESFAWNITVLKGQGDLLKNAKNEAIENMKQIQLACLSCGLSKAPSS SAEAKSKRSLEAIEEKESSEENGKL" 3'UTR 3198..4565 BASE COUNT 1439 a 815 c 992 g 1319 t ORIGIN Chromosome 2. 1 gaattccggg cagcatcatc aaggcacata gattctctct ttgaacaatg cagctgctgc 61 aggggatggg gaagggatgg cactggcaat tcaagactgc ctttcctact tcttcagtgc 121 ctctttcacc aatatgaagt taaaaccagg atccttcaaa ccaaaaatgt ggtggaagaa 181 agaaaaccgt gtctttcagc agcatgccat cggaaaagaa aattagcagt gcaaatgact 241 gcatcagctt catgcaagct ggctgtgagt tgaagaaagt ccggccaaat tctcgcattt 301 acaaccgttt tttcactctg gacacagacc ttcaagctct tcgctgggaa ccttcaaaga 361 aagacctcga gaaagccaag cttgatattt ctgccataaa agagatcaga ctggggaaaa 421 acacggaaac atttacaaac aatggccttg ctgaccagat ctgtgaggac tgtgcctttt 481 ccatactcca cggggaaaac tatgagtctc tggacctagt tgccaattca gcagatgtgg 541 caaacatctg ggtgtctggg ttacggtacc tggtttctcg aagtaagcag cctcttgatt 601 ttatggaggg caaccagaac acaccacggt tcatgtggtt gaaaacagtg tttgaagcag 661 cagatgttga tgggaatggg attatgttgg aagacacctc tgtagagtta ataaaacaac 721 tcaaccctac tctgaaggaa gccaagatca ggttaaagtt taaagaaatc cagaagagca 781 aggaaaaact aaccacccgc gtgaccgaag aggaattttg tgaagctttt tgtgaacttt 841 gcaccaggcc agaagtgtat ttcttacttg tacagatatc taaaaacaaa gaatatttgg 901 atgccaatga tctcatgctc tttttagaag ctgagcaagg agtcacccat atcaccgagg 961 atatatgctt agacatcata aggagatacg aactttctga agagggacgt caaaaagggt 1021 ttcttgcaat tgatggcttt acccagtatt tattgtcatc agaatgtgac atttttgatc 1081 ctgagcaaaa gaaggttgcc caagatatga cccagccatt atctcactac tatatcaatg 1141 cctctcataa cacctatcta atagaagacc agttcagggg gccagctgac atcaatgggt 1201 acattagagc tttgaaaatg ggctgtcgaa gcgttgaact cgatgtaagt gatggttcag 1261 ataatgaacc aatcctttgt aatcgaaata acatgacaac ccatgtttcc tttcgaagtg 1321 tcatagaggt aataaataaa tttgcctttg ttgcttctga atacccactc attctttgct 1381 tgggaaatca ctgctccttg ccgcagcaga aggtaatggc tcaacagatg aaaaaggtct 1441 ttggcaataa actctatact gaagcacctt tgccctcaga atcctacctc ccatcaccag 1501 aaaaattaaa aagaatgatc attgtgaaag gaaagaagtt gccttctgat ccagatgtgt 1561 tagaaggaga agtaacagat gaagatgaag aagctcaaat gtctcgaagg atgtcggtag 1621 attacaatgg tgagcagaag caaatccgac tctgtaggga gctctctgat ttggtgtcta 1681 tttgtaaatc tgttcaatac agggattttg aactatctat gaaaagccaa aactattggg 1741 aaatgtgttc atttagtgaa acagaggcca gccgcattgc aaatgagtac ccagaggatt 1801 ttgttaatta taataagaag ttcttatcaa gaatctatcc aagtgccatg aggatcgatt 1861 ccagtaactt gaatccacag gacttttgga attgtggctg tcagattgta gcaatgaatt 1921 ttcagactcc gggtccaatg atggaccttc acacgggctg gtttcttcaa aacgggggat 1981 gtggttatgt tctaaggccg tctataatgc gagatgaagt ttcttacttc agcgcaaata 2041 caaagggcat tctacctggg gtgtctcctc tagctcttca tatcaagatc atcagtggtc 2101 agaatttccc aaagcccaag ggagcttgtg ccaaagggga tgtcatagat ccctatgttt 2161 gtatagagat acacggaatt ccagcggatt gttcggaaca aagaactaaa actgtacagc 2221 aaaacagtga taatcctatt tttgatgaaa cttttgagtt ccaagtaaac ctacctgagc 2281 tggccatgat ccgttttgtt gttctggatg atgactacat tggggatgag tttatagggc 2341 aatatacgat accatttgaa tgtttgcagc ctggatatcg gcatgttccc ctgcgttctt 2401 ttgtgggtga catcatggag cacgtaaccc tttttgtcca catagcaata actaatcgaa 2461 gtggaggagg aaaggcacag aagcgcagtc tttcagtgag aatggggaag aaagttcggg 2521 aatataccat gctcaggaat atcggtctta aaaccattga tgacatcttt aaaatagcgg 2581 ttcatccatt acgagaagcc atagatatga gagaaaatat gcagaatgca atcgtgtcta 2641 ttaaggaact atgtggactc cctccaattg ccagtctgaa gcagtgcctg ttaactctgt 2701 catctcggct catcaccagt gacaatactc cttcagtctc acttgtgatg aaagacagct 2761 ttccttacct ggagcctctg ggtgcaattc cagatgtgca gaaaaagatg ctgactgctt 2821 atgatctgat gattcaagag agccggtttc tcatagaaat ggcggacaca gtccaggaaa 2881 agattgtaca gtgtcagaaa gcagggatgg agttccatga agaacttcat aatttggggg 2941 caaaagaagg cttgaaggga agaaaactca acaaagcaac tgagagcttt gcttggaaca 3001 ttacagtatt gaagggccaa ggagatctgt tgaagaatgc caagaatgaa gctatagaaa 3061 acatgaagca gatccagctg gcatgcctgt cctgtggact gagtaaagcc cccagcagca 3121 gtgctgaggc caagagcaag cgcagcctgg aagccataga ggagaaggaa agtagtgagg 3181 agaatgggaa gctgtgactc tgggcattat cgacacgttc acccatctta tcaaggactc 3241 tggtttctca ttcttgtttt ctttctttaa atgttttata agttcacaaa atggtgccct 3301 atatggggta ttggacatag atattttcac aatgtcagta tttcagtgta gttaatttat 3361 ctaaattaaa gcctttagta tcagtgtttt aaattctgag acatgtgtca acacccctgt 3421 gtggatgcct gtggaagagt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtggca 3481 gagagagaga aagagagaga gagaaattct gttaaaatct attctgtgtt gcattattca 3541 tttagtgagt tattccttga tcattttggg acaattgttt taatctgaaa ttctaaagag 3601 cacttactgt aacctgttgc tgtgtttaat ttgacttctc tgcctttgac atttaattta 3661 gtgatcttag catagcttat tattgaagga agccaaattt atcaaagcat acatgttttg 3721 gtagattaaa tatagattag aaaaattcct aagaatcaga gtagaaataa aagtgaatga 3781 aagattaaac agatgatgag aatttctaaa aagattagca aggtcatttc ttcagtcaga 3841 aaactttaaa aaatatttat taaataaaat caatttttag gaagttttct gtagtcattt 3901 actaaacata tgatttcact agaaaagctg atcataagtg aatttatacc tacctgtgtg 3961 gctactctga aacacactga aagctctgtt gcaattagga tttgatgtga cataatattg 4021 ttgtataatt tcgagatttg taggaaggtc tcattcttcc aagctgagag tctagcactc 4081 attttctata acagatatgg cagcttagag gtgttggctt tgtttggatg taattttagg 4141 ggtactaaaa tttaaaattt aaagataatt gttcaacaat atcatatcat cacattgagc 4201 tgatataaat tctgtgggtc cgataatatc tttgtgataa tttaagagct aaccagttac 4261 cacacatcta tgatataacc ctaacacaca cagaaagcat acatgcaaaa agaaatgact 4321 aattagggta catttataat tgcatctagg taatttttac cctaatgtct tcataaagta 4381 cttgagtgta atgtttgtta cctccaacag aactaaatgt tctatggtta tgaaagaata 4441 tatttattta aagcattgct tttattttga aaagcttctt aattaatttg attaacaaat 4501 atgctaattt ggggaaacct agagaagata attgttgaaa ttttgcaaat ataaacatct 4561 cctat // LOCUS HUMPLGL 524 bp mRNA PRI 22-AUG-1996 DEFINITION Human plasminogen-like protein (PLGL) mRNA, complete cds. ACCESSION M93143 NID g1502373 KEYWORDS plasminogen. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 524) AUTHORS Weissbach,L. and Treadwell,B.V. TITLE A plasminogen-related gene is expressed in cancer cells JOURNAL Biochem. Biophys. Res. Commun. 186 (2), 1108-1114 (1992) MEDLINE 92359990 FEATURES Location/Qualifiers source 1..524 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /tissue_type="carcinoma" /chromosome="2" /map="2p11-q11" gene 68..358 /gene="PLGL" CDS 68..358 /gene="PLGL" /codon_start=1 /product="plasminogen-like protein" /db_xref="PID:g190072" /db_xref="GDB:G00-120-300" /translation="MEHKEVVLLLLLFLKSGQGEPLDDYVNTQGPSLFSVTKKQLGAG SREECAAKCEEDKEFTCRAFQYHSKEQQCVIMAENRKSSIIIRMRDAVLFEK" BASE COUNT 164 a 107 c 123 g 130 t ORIGIN 1 tctcatgtaa gtcaacaaca tcctgggatt gggacacact ttctgggcac tgctggccag 61 tcccaaaatg gaacataagg aagtggttct tctacttctt ttatttctga aatcaggtca 121 aggagagccc ctggatgact atgtgaatac ccaggggcct tcactgttca gtgtcactaa 181 gaagcagctg ggggcaggaa gcagagaaga atgtgcagca aaatgtgaag aggacaaaga 241 attcacctgc agggcattcc aatatcacag taaagagcaa cagtgtgtga taatggctga 301 aaacaggaag tcctccataa tcattaggat gagagatgca gttttatttg aaaagtaaat 361 gtatctttca gagtgcaaga ctgggaatgg aaagaactac agaggacgat gtccaaaaca 421 aaaaatggca tcacctgtca aaaatggagt tccacttctc cccgcagacc taggtcagac 481 tttccctttc atctttgtgt tcatctactg taaagtgtcc gtct // LOCUS HUMPLOD 3115 bp mRNA PRI 07-JAN-1995 DEFINITION Homo sapiens lysyl hydroxylase (PLOD) mRNA, complete cds. ACCESSION L06419 NID g190073 KEYWORDS lysyl hydroxylase; procollagen-lysine,2-oxoglutarate 5-dioxygenase. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3115) AUTHORS Hautala,T., Byers,M.G., Eddy,R.L., Shows,T.B., Kivirikko,K.I. and Myllyla,R. TITLE Cloning of human lysyl hydroxylase: complete cDNA-derived amino acid sequence and assignment of the gene (PLOD) to chromosome 1p36.3----p36.2 JOURNAL Genomics 13 (1), 62-69 (1992) MEDLINE 92250066 FEATURES Location/Qualifiers source 1..3115 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="p36.3-p36.2" gene 201..2384 /gene="PLOD" CDS 201..2384 /gene="PLOD" /standard_name="procollagen-lysine,2-oxoglutarate 5-dioxygenase" /EC_number="1.14.11.4" /codon_start=1 /function="Hydroxylates lysines in collagen like protein sequences" /db_xref="GDB:G00-127-821" /evidence=experimental /product="lysyl hydroxylase" /db_xref="PID:g190074" /translation="MRPLLLLALLGWLLLAEAKGDAKPEDNLLVLTVATKETEGFRRF KRSAQFFNYKIQALGLGEDWNVEKGTSAGGGQKVRLLKKALEKHADKEDLVILFTDSY DVLFASGPRELLKKFRQARSQVVFSAEELIYPDRRLETKYPVVSDGKRFLGSGGFIGY APNLSKLVAEWEGQDSDSDQLFYTKIFLDPEKREQINITLDHRCRIFQNLDGALDEVV LKFEMGHVRARNLAYDTLPVLIHGNGPTKLQLNYLGNYIPRFWTFETGCTVCDEGLRS LKGIGDEALPTVLVGVFIEQPTPFVSLFFQRLLRLHYPQKHMRLFIHNHEQHHKAQVE EFLAQHGSEYQSVKLVGPEVRMANADARNMGADLCRQDRSCTYYFSVDADVALTEPNS LRLLIQQNKNVIAPLMTRHGRLWSNFWGALSADGYYARSEDYVDIVQGRRVGVWNVPY ISNIYLIKGSALRGELQSSDLFHHSKLDPDMAFCANIRQQDVFMFLTNRHTLGHLLSL DSYRTTHLHNDLWEVFSNPEDWKEKYIHQNYTKALAGKLVETPCPDVYWFPIFTEVAC DELVEEMEHFGQWSLGNNKDNRIQGGYENVPTIDIHMNQIGFEREWHKFLLEYIAPMT EKLYPGYYTRAQFDLAFVVRYKPDEQPSLMPHHDASTFTINIALNRVGVDYEGGGCRF LRYNCSIRAPRKGWTLMHPGRLTHYHEGLPTTRGTRYIAVSFVDP" polyA_site 3115 /gene="PLOD" /note="G00-127-821" BASE COUNT 667 a 927 c 880 g 641 t ORIGIN 1 ccaccatatc ggtcccgtat ttcacattga taaggtcctg tttcatttct cgtgacattg 61 ggtagaatga ggatcctgtt ttcaatgggt cgctttaccc tgggactgac agggaggctc 121 tgaccattta gccaccaaat gtaggtgtag ttctcactct taggttcacc ccgcggccga 181 tcgtccccca tacctcggcc atgcggcccc tgctgctact ggccctgctg ggctggctgc 241 tgctggccga agcgaagggc gacgccaagc cggaggacaa ccttttagtc ctcacggtgg 301 ccactaagga gaccgaggga ttccgtcgct tcaagcgctc agctcagttc ttcaactaca 361 agatccaggc gcttggccta ggggaggact ggaatgtgga gaaggggacg tcggcaggtg 421 gagggcagaa ggtccggctg ctgaagaaag ctctggagaa gcacgcagac aaggaggatc 481 tggtcattct cttcacagac agctatgacg tgctgtttgc atcggggccc cgggagctcc 541 tgaagaagtt ccggcaggcc aggagccagg tggtcttctc tgctgaggag ctcatctacc 601 cagaccgcag gctggagacc aagtatccgg tggtgtccga tggcaagagg ttcctgggct 661 ctggaggctt catcggttat gcccccaacc tcagcaaact ggtggccgag tgggagggcc 721 aggacagcga cagcgatcag ctgttttaca ccaagatctt cttggacccg gagaagaggg 781 agcagatcaa tatcaccctg gaccaccgct gccgtatctt ccagaacctg gatggagcct 841 tggatgaggt cgtgctcaag tttgaaatgg gccatgtgag agcgaggaac ctggcctatg 901 acaccctccc ggtcctgatc catggcaacg ggccaaccaa gctgcagttg aactacctgg 961 gcaactacat cccgcgcttc tggaccttcg aaacaggctg caccgtgtgt gacgaaggct 1021 tgcgcagcct caagggcatt ggggatgaag ctctgcccac ggtcctggtc ggcgtgttca 1081 tcgaacagcc cacgccgttt gtgtccctgt tcttccagcg gctcctgcgg ctccactacc 1141 cccagaaaca catgcgactt ttcatccaca accacgagca gcaccacaag gctcaggtgg 1201 aagagttcct ggcacagcat ggcagcgagt accagtctgt gaagctggtg ggccctgagg 1261 tgcggatggc gaatgcagat gccaggaaca tgggcgcaga cctgtgccgg caggaccgca 1321 gctgcaccta ctacttcagc gtggatgctg acgtggccct gaccgagccc aacagcctgc 1381 ggctgctgat ccaacagaac aagaatgtca ttgccccgct gatgacccgg catgggaggc 1441 tgtggtcgaa cttctggggg gctctcagtg cagatggcta ctatgcccgt tccgaggact 1501 acgtggacat tgtgcagggg cggcgtgttg gtgtctggaa tgtgccctat atttcaaaca 1561 tctacttgat caagggcagt gccctgcggg gtgagctgca gtcctcagat ctcttccacc 1621 acagcaagct ggaccccgac atggccttct gtgccaacat ccggcagcag gatgtgttca 1681 tgttcctgac caaccggcac acccttggcc atctgctctc cctagacagc taccgcacca 1741 cccacctgca caacgacctc tgggaggtgt tcagcaaccc cgaggactgg aaggagaagt 1801 acatccacca gaactacacc aaagccctgg cagggaagct ggtggagacg ccctgcccgg 1861 atgtctattg gttccccatc ttcacggagg tggcctgtga tgagctggtg gaggagatgg 1921 agcactttgg ccagtggtct ctgggcaaca acaaggacaa ccgcatccag ggtggctacg 1981 agaacgtgcc gactattgac atccacatga accagatcgg ctttgagcgg gagtggcaca 2041 aattcctgct ggagtacatt gcgcccatga cggagaagct ctaccccggc tactacacca 2101 gggcccagtt tgacctggcc tttgtcgtcc gctacaagcc tgatgagcag ccctcactga 2161 tgccacacca tgatgcctcc accttcacca tcaacatcgc cctgaaccga gtcggggtgg 2221 attacgaggg cgggggctgt cggttcctgc gctacaactg ttccatccga gccccaagga 2281 agggctggac cctcatgcac cctggacgac tcacgcatta ccatgagggg ctccccacca 2341 ccaggggcac ccgctacatc gcagtctcct tcgtcgatcc ctaattggcc aggcctgacc 2401 ctcttggacc tttcttcttt gccgacaacc actgcccagc agcctctggg acctcggggt 2461 cccagggaac ccagtccagc ctcctggctg ttgacttccc attgctcttg gagccaccaa 2521 tcaaagagat tcaaagagat tcctgcaggc cagaggccgg aacacacctt tatggctggg 2581 gctctccgtg gtgttctgga cccagcccct ggagacacca ttcactttta ctgctttgta 2641 gtgactcgtg ctctccaacc tgtcttcctg aaaaaccaag gcccccttcc cccacctctt 2701 ccatggggtg agacttgagc agaacagggg cttccccaag ttgcccagaa agactgtctg 2761 ggtgagaagc catggccaga gcttctccca ggcacaggtg ttgcaccagg gacttctgct 2821 tcaagttttg gggtaaagac acctggatca gactccaagg gctgccctga gtctgggact 2881 tctgcctcca tggctggtca tgagagcaaa ccgtagtccc ctggagacag ccactccaga 2941 gaacctcttg ggagacagaa gaggcatctg tgcacagctc gatcttctac ttgcctgtgg 3001 ggaggggagt gacaggtcca cacaccacac tgggtcaccc tgtcctggat gcctctgaag 3061 agagggacag accgtcagaa actggagagt ttctattaaa ggtcatttaa accac // LOCUS HUMPM1AUTO 6577 bp mRNA PRI 08-JAN-1995 DEFINITION Human autoantigen pericentriol material 1 (PCM-1) mRNA, complete cds. ACCESSION L27841 NID g450276 KEYWORDS autoantigen; pericentriol material 1. SOURCE Homo sapiens fetal liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6577) AUTHORS Balczon,R., Bao,L. and Zimmer,W.E. TITLE PCM-1, A 228-kD centrosome autoantigen with a distinct cell cycle distribution JOURNAL J. Cell Biol. 124 (5), 783-793 (1994) MEDLINE 94165144 FEATURES Location/Qualifiers source 1..6577 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="hepatocyte" /dev_stage="fetal" /tissue_type="liver" gene 410..6484 /gene="PCM-1" CDS 410..6484 /gene="PCM-1" /codon_start=1 /function="unknown" /evidence=experimental /product="pericentriol material 1" /db_xref="PID:g450277" /translation="MATGGGPFEDGMNDQDLPNWSNENVDDRLNNMDWGAQQKKANRS SEKNKKKFGVESDKRVTNDISPESSPGVGRRRTKTPHTFPHSRYMSQMSVPEQAELEK LKQRINFSDLDQRSIGSDSQGRATAANNKRQLSENRKPFNFLPMQINTNKSKDASTSP PNRETIGSAQCKELFASALSNDLLQNCQVSEEDGRGEPAMESSQIVSRLVQIRDYITK ASSMREDLVEKNERSANVERLTHLIDHLKEQEKSYMKFLKKILARDPQQEPMEEIENL KKQHDLLKRMLQQQEQLRGALQGRQAALLALQHKADEAIAVMDDSVVAETAGSLSGVS ITSELNEELNDLIQRFHNQLRDSQPPAVPDNRRQAESLSLTREVSQSRKPSASERLPD EKVELFSKMRVLQEKKQKMDKLLGELHTLRDQHLNNSSSSPQRSVDQRSTSAPSACLG LAPVVNGESNSLTSSVPYPTASLVSQNESENEGHLNPSEKLQKLNEVRKRLNELRELV HYYEQTSDMMTDAVNENRKDEETEESEYDSEHENSEPVTNIRNPQVASTWNEVNSHSN AQCVSNNRDGRTVNSNCEINNRSAANIRALNVPPSLDCRYNREGEQEIHVAQGEDDEE EEEEAEEEGVSGASLSSHRSSLVDEHPEDAEFEQKINRLMAAKQKLRQLQDLVAMVQD DDAAQGVISASASNLDDFYPAEEDTKQNSNNTRGNANKTQKDTGVNEKAREKFYEAKL QQQQRELKQLQEERKKLIDIHEKIQALQTACPDLQLSAASVGNCPTKKYMPAVTSTPT VNQHETSTSKSVFEPEDSSIVDNELWSEMRRHEMLREELRQRRKQLEALMAEHQRRQG LAETASPVAVSLRSDGSENLCTPQQSRTEKTMATWGGSTQCALDEEGDEDGYLSEGIV RTDEEEEEEQDASSNDNFSVCPSNSVNHNSYNRKETKNTWKNNCPFSADENYRPLAKT RQQNISMQRQENLRWVSELSYVEEKEQWQEQIQLKKQLDFSVSICQTLMQDQQTLSCL LQTLLTGPYSVMPSNVASPQVHFIMHQLNQCYTQLTWQQNNVQRLKQMLNELMRQRNQ HPEKPGGKERGSSASHPPSPSLFCPFSFPTQPVNLFNIPGFTNFSSFAPGMNFSPLFP SNFGDFSQNISTPSEQQQPLARILSGKTEYMAFPKPFESSSSIGAEKPRNKKLPEEEV ESSRTPWLYEQEGEVEKPFIKTGFSVSVEKSTSSNRKNQLDTNGRRRQFDEESLESFS SMPDPVDPTTVTKTFKTRKASAQASLASKDKTPKSKSKKRNSTQLKSRVKNIRYESAS MSSTCEPCKSRNRHSAQTEEPLQAKVFSRKNHEQLEKIIKCNRSTEISSETGSDFSMF EALQDTIYSEVATLISQNESRPHFLIELFHELQLLNTDYLRQRALYALQDIVSRHISE SHEKGENVKSVNSGTWIASNSELTPSESLATTDDETFEKNFERETHKISEQNDADNAS VLSVSSNFEPFATDDLGNTVIHLDQALARMREYERMKTEAESNSNMRCTCRIIEDGDG AGAGTTVNNLEETPVIENRSSQQPVSEVSTIPCPRIDTQQLDRQIKAIMKEVIPFLKE HMDEVCSSQLLTSVRRMVLTLTQQNDESKEFVKFFHKQLGSILQDSLAKFAGRKLKDC GEDLLVEISEVLFNELAFFKLMQDLDNNSITVKQRCKRKIEATGVIQSCAKEAKRILE DHGSPAGEIDDEDKDKDETETVKQTQTSEVYDGPKNVRSDISDQEEDEESEGCPVSIN LSKAETQALTNYGSGEDENEDEEMEEFEEGPVDVQTSLQANTEATEENEHDEQVLQRD FKKTAESKNVHWNEKPLVKMTKNNCPVKPSYLNILEDEQPLNSAAHKESPPTVDSTQQ PNPLPLRLPEMEPLVPRVKEVKSAQETPESSLAGSPDTESPVLVNDYEAESGNISQKS DEEDFVKVEDLPLKLTIYSEADLRKKMVEEEQKNHLSGEMCEMQTEELAGNSETLKEP ETVGAQSI" misc_feature 6550..6555 BASE COUNT 2268 a 1247 c 1469 g 1593 t ORIGIN 1 tgccgcttcc gcggtcacat gactccagtc tagctcgcat tgcgctcccg ccggcgagtt 61 ctccccgcgc gccgttgcga ggagacggcg catgtccgcc gcgcttgccc ctctgcagta 121 ccccgccctc ttctccacca caatgagatc taagtgcgtg ctgcgcgtgc gtacgtgagg 181 tcgaaaagcg cactgggacg gcagccagga aacgtgtggg cctctctgct gcggtctccg 241 agggccgacc gctgccggcg cgggtcgtgc gggctgactg tcgctctgcc tttgacagga 301 gaggctgctt cttgtagagg aaacagcttt gaagtgtgga gcgggaaagg agcagtttct 361 gagctgcaaa aactagtttc taaacagaga gttaattgtt aaatccagta tggccacagg 421 aggaggtccc tttgaagatg gcatgaatga tcaggattta ccaaactgga gtaatgagaa 481 tgttgatgac aggctcaaca atatggattg gggtgcccaa cagaagaaag caaatagatc 541 atcagaaaag aataagaaaa agtttggtgt agaaagtgat aaaagagtaa ccaatgatat 601 ttctccggag tcgtcaccag gagttggaag gcgaagaaca aagactccac atacgttccc 661 acacagtaga tacatgagtc agatgtctgt cccagagcag gcagaattag agaaactgaa 721 acagcggata aacttcagtg atttagatca gagaagcatt ggaagtgatt cccaaggtag 781 agcaacagct gctaacaaca aacgtcagct tagtgaaaac cgaaagccct tcaacttttt 841 gcctatgcag attaatacta acaagagcaa agatgcatct acaagtcccc caaacagaga 901 aacgattgga tcagcacagt gtaaagagtt gtttgcttct gctttaagta atgacctctt 961 gcaaaactgt caggtgtctg aagaagatgg gaggggagaa cctgcaatgg agagcagcca 1021 gattgtaagc aggcttgttc aaattcgcga ttatattact aaagctagtt ccatgcggga 1081 agatcttgta gagaaaaatg agagatctgc taatgttgag cgccttactc atctaataga 1141 tcaccttaaa gaacaagaga agtcatatat gaaatttctt aaaaaaatcc ttgccagaga 1201 tcctcagcag gagcctatgg aagagataga aaatttgaag aaacaacatg atttattaaa 1261 aagaatgtta caacagcagg agcaactaag aggagctcta cagggacggc aggctgcact 1321 tctagctctg caacataaag cagacgaagc tattgcagtg atggatgatt ctgttgttgc 1381 agaaactgca ggtagcttat ctggcgtcag tatcacatct gaactaaatg aagaattgaa 1441 tgacttaatt cagcgttttc ataatcagct tcgtgattct cagcctccag ctgttccaga 1501 caatagaaga caggcagaaa gtctttcatt aactagggag gtttcccaga gcaggaaacc 1561 atcagcttca gaacgtttac ctgatgagaa agtcgaactt tttagcaaaa tgagagtgct 1621 acaggaaaag aaacaaaaaa tggacaaatt gcttggagaa cttcatacac ttcgagatca 1681 gcatcttaac aattcatcat cctctccaca aaggagtgtc gatcagagaa gtacttcagc 1741 tccctctgct tgtctaggct tggcaccggt tgtcaatgga gaatccaata gcctcacatc 1801 atctgttcct tatcctactg cttctctagt atctcagaat gagagtgaaa acgaaggcca 1861 cctcaatcca tctgaaaaac tccagaagtt aaatgaagtt cgaaagagat tgaatgagct 1921 aagagaatta gttcattatt atgaacaaac gtcagacatg atgacagatg ctgtgaatga 1981 aaacaggaaa gatgaagaaa ctgaagagtc agaatatgat tctgagcatg aaaattccga 2041 gcctgttact aacattcgaa atccacaagt agcttccact tggaatgaag taaatagtca 2101 tagtaatgca cagtgtgttt ctaataatag agatgggcga acagttaatt ctaattgtga 2161 aattaacaac agatctgctg ccaacataag ggctctaaac gtgcctcctt ctttagattg 2221 tcgatataat agagaagggg aacaggagat tcatgttgca caaggtgaag atgatgagga 2281 ggaggaggaa gaagcagaag aggagggagt cagtggagct tcattatcta gtcacaggag 2341 cagtctggtt gatgagcatc cagaagatgc tgaatttgaa cagaagatca accgacttat 2401 ggctgcaaaa cagaaactta gacagttaca agatcttgtt gctatggtac aggatgatga 2461 tgcagctcaa ggagttatct ctgccagtgc atcaaatttg gatgatttct acccagcaga 2521 agaagacacc aagcaaaatt caaataacac tagaggaaat gccaataaaa cacagaaaga 2581 tactggagta aatgaaaagg caagagagaa attttatgag gctaaactac agcagcaaca 2641 gagagagcta aaacaattgc aggaagaaag aaagaaactg attgacattc acgagaaaat 2701 tcaagcattg caaacggcat gccctgactt acagctgtca gctgctagtg tgggtaactg 2761 tcctaccaaa aaatatatgc cagctgttac ttcaacccca actgttaatc aacacgagac 2821 cagtacaagc aaatctgttt ttgagcctga agattcttca atagtagata atgagttgtg 2881 gtcagaaatg agaagacatg aaatgttgag ggaggagctg cgacagagaa gaaagcagct 2941 tgaagctctg atggctgaac atcagaggag gcaaggtcta gctgaaactg catctccagt 3001 ggctgtgtca ttgagaagtg atggatctga gaacctatgt actcctcagc aaagtagaac 3061 agaaaaaacg atggcaactt ggggagggtc tacccagtgt gcactagatg aagaaggaga 3121 tgaagacggt tacctttctg aaggaattgt tcggacagat gaagaggagg aagaagagca 3181 agatgccagt tccaatgata acttttctgt gtgtccttct aacagtgtga atcataactc 3241 ctacaatcga aaggaaacta aaaatacgtg gaagaacaat tgcccttttt cggcagatga 3301 aaattatcgt cctttagcca agacaaggca acagaatatc agcatgcaac ggcaagaaaa 3361 ccttcgttgg gtgtcagagc tctcttacgt agaagagaaa gaacaatggc aagaacaaat 3421 ccagctaaag aaacagcttg attttagtgt cagtatttgt cagactttga tgcaagacca 3481 gcagactcta tcttgtctgc tacaaactct tctcacgggt ccttacagtg ttatgcccag 3541 caatgttgca tctcctcaag tacacttcat aatgcaccag ttgaaccagt gctatactca 3601 gctaacatgg caacagaata atgttcagag gttgaaacaa atgctaaatg aacttatgcg 3661 ccaacgaaat cagcatccag aaaaacctgg aggcaaggaa agaggcagta gtgcatcgca 3721 ccctccttct cccagtttat tttgtccttt cagctttcca acacagcctg taaatctctt 3781 caatatacct ggatttacta acttttcatc atttgcacca ggtatgaatt tcagcccttt 3841 atttccttct aattttggag atttttctca gaatatctct acacccagtg aacagcagca 3901 acccttagcc agaattcttt caggaaaaac agaatatatg gcttttccaa aaccttttga 3961 aagcagttcc tctattggag cagagaaacc aaggaataaa aaactgcctg aagaggaggt 4021 ggaaagcagt aggacaccat ggttatatga acaagaaggt gaagtagaga aaccatttat 4081 caagactgga ttttcagtgt ctgtagaaaa atctacaagt agtaaccgca aaaatcaatt 4141 agatacaaac ggaagaagac gccagtttga tgaagaatca ctggaaagct ttagcagtat 4201 gcctgatcca gtagatccaa caacagtgac taaaacattc aagacaagaa aagcgtctgc 4261 acaggccagc ctggcatcta aagataaaac tcccaagtca aaaagtaaga agaggaattc 4321 tactcagctg aaaagcagag ttaaaaacat caggtatgaa agtgccagta tgtctagcac 4381 atgtgaacct tgcaaaagta ggaacagaca ttcagcccag actgaagagc ctcttcaagc 4441 aaaagtattc agcagaaaga atcatgagca actggaaaaa ataataaaat gtaataggtc 4501 tacagaaata tcttcagaaa ctgggagtga tttttccatg tttgaagctt tgcaggatac 4561 tatttattct gaagtagcta cattaatttc tcaaaatgaa tctcgtccac attttcttat 4621 tgaactcttc catgagctgc agctactaaa cacagactac ttgagacaga gggctttata 4681 tgcattgcag gacatagtat ccagacatat ttctgagagc catgaaaaag gagaaaatgt 4741 aaagtcagta aactctggta cttggatagc atcaaactca gaacttactc ctagtgagag 4801 ccttgctact actgatgatg aaacttttga gaagaacttt gaaagagaaa cccataaaat 4861 aagtgagcaa aatgatgctg ataatgctag tgtcctgtct gtatcatcaa attttgagcc 4921 ttttgcaaca gatgatctag gtaacaccgt gattcactta gatcaagcat tagccagaat 4981 gagagaatat gagcgtatga agactgaggc tgaaagtaac tcaaatatga gatgcacctg 5041 caggattatt gaggatggag atggtgctgg tgcaggtact acagttaata atttagaaga 5101 aactcccgtt attgaaaatc gtagttcaca acaacctgta agtgaagttt ctaccatccc 5161 atgtcctaga attgatactc agcagctgga ccggcaaatt aaagcaatta tgaaagaagt 5221 cattcctttt ttgaaggagc acatggatga agtatgctcc tcgcagcttc taacttcagt 5281 aaggcgcatg gttttgaccc ttacccagca aaatgatgag agcaaagagt ttgtaaagtt 5341 ctttcataaa caacttggaa gtatattaca ggattcactg gcaaaatttg ctggcagaaa 5401 actgaaagac tgtggagaag atcttcttgt agagatatct gaagtgttgt tcaatgaatt 5461 ggctttcttt aagcttatgc aagatttgga taataatagt ataactgtta aacagagatg 5521 caaaaggaaa atagaagcaa ctggagtgat acaatcttgt gccaaagagg ctaaaaggat 5581 tcttgaagat catggctcac ctgctggaga gattgatgat gaagacaaag acaaggatga 5641 aactgaaaca gttaagcaga ctcaaacatc tgaggtgtat gatggtccca aaaatgtaag 5701 atctgatatt tctgatcaag aggaagatga agaaagtgaa ggatgtccag tgtctattaa 5761 tttgtctaaa gctgaaactc aggctttaac taattatgga agtggagaag atgaaaatga 5821 ggatgaagaa atggaagaat ttgaagaagg ccctgtggat gtccagactt ccctccaggc 5881 taacactgaa gctactgaag aaaatgaaca tgatgaacag gtcctacaac gtgactttaa 5941 aaagacagca gaaagcaaaa atgtccattg gaacgagaag ccactagtaa aaatgaccaa 6001 aaataactgt cctgtgaaac ccagttacct caatatcttg gaagatgagc aacctttaaa 6061 tagtgctgcc cataaggagt cacctcctac tgttgattca actcaacagc ctaacccttt 6121 gccgttacgt ttacctgaaa tggaaccctt agtgcctaga gtcaaagaag ttaaatctgc 6181 tcaggaaact cctgaaagct ctctggctgg aagtcctgat actgaatctc cagtgttagt 6241 gaatgactat gaagcagaat ctggtaatat aagtcaaaag tctgatgaag aagattttgt 6301 aaaagttgaa gatttaccac tgaaactgac aatatattca gaggcagatc taagaaagaa 6361 aatggtagaa gaagaacaga aaaaccattt atctggtgaa atgtgtgaaa tgcagaccga 6421 agaattagct ggaaattctg agacactaaa agaacctgaa acggtgggag cccagagtat 6481 atgagatgtc ttcagaggct catctaactc tgtccttaca tactcaatgc atatatgaaa 6541 acaatactaa ataaacatct gatctgtata aaaatct // LOCUS HUMPNLIP 1454 bp mRNA PRI 08-JAN-1995 DEFINITION Human pancreatic lipase (PNLIP) mRNA, complete cds. ACCESSION M93285 NID g190139 KEYWORDS pancreatic lipase. SOURCE Homo sapiens pancreas cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1454) AUTHORS Giller,T., Buchwald,P., Blum-Kaelin,D. and Hunziker,W. TITLE Two novel human pancreatic lipase related proteins, hPLRP1 and hPLRP2. Differences in colipase dependence and in lipase activity JOURNAL J. Biol. Chem. 267 (23), 16509-16516 (1992) MEDLINE 92355622 FEATURES Location/Qualifiers source 1..1454 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pancreas" /map="Unassigned" gene 1..1439 /gene="PNLIP" sig_peptide 1..48 /gene="PNLIP" /note="G00-127-916" CDS 1..1398 /gene="PNLIP" /note="precursor" /codon_start=1 /db_xref="GDB:G00-127-916" /product="lipase" /db_xref="PID:g190140" /translation="MLPLWTLSLLLGAVAGKEVCYERLGCFSDDSPWSGITERPLHIL PWSPKDVNTRFLLYTNENPNNFQEVAADSSSISGSNFKTNRKTRFIIHGFIDKGEENW LANVCKNLFKVESVNCICVDWKGGSRTGYTQASQNIRIVGAEVAYFVEFLQSAFGYSP SNVHVIGHSLGAHAAGEAGRRTNGTIGRITGLDPAEPCFQGTPELVRLDPSDAKFVDV IHTDGAPIVPNLGFGMSQVVGHLDFFPNGGVEMPGCKKNILSQIVDIDGIWEGTRDFA ACNHLRSYKYYTDSIVNPDGFAGFPCASYNVFTANKCFPCPSGGCPQMGHYADRYPGK TNDVGQKFYLDTGDASNFARWRYKVSVTLSGKKVTGHILVSLFGNKGNSKQYEIFKGT LKPDSTHSNEFDSDVDVGDLQMVKFIWYNNVINPTLPRVGASKIIVETNVGKQFNFCS PETVREEVLLTLTPC" mat_peptide 49..1395 /gene="PNLIP" /EC_number="3.1.1.3" /note="G00-127-916" /product="lipase" polyA_signal 1434..1439 /gene="PNLIP" /note="G00-127-916" polyA_site 1454 /gene="PNLIP" /note="G00-127-916" BASE COUNT 415 a 313 c 349 g 377 t ORIGIN 1 atgctgccac tttggactct ttcactgctg ctgggagcag tagcaggaaa agaagtttgc 61 tacgaaagac tcggctgctt cagtgatgac tccccatggt caggaattac ggaaagaccc 121 ctccatatat tgccttggtc tccaaaagat gtcaacaccc gcttcctcct atatactaat 181 gagaacccaa acaactttca agaagttgcc gcagattcat caagcatcag tggctccaat 241 ttcaaaacaa atagaaaaac tcgctttatt attcatggat tcatagacaa gggagaagaa 301 aactggctgg ccaatgtgtg caagaatctg ttcaaggtgg aaagtgtgaa ctgtatctgt 361 gtggactgga aaggtggctc ccgaactgga tacacacaag cctcgcagaa catcaggatc 421 gtgggagcag aagtggcata ttttgttgaa tttcttcagt cggcgttcgg ttactcacct 481 tccaacgtgc atgtcattgg ccacagcctg ggtgcccacg ctgctgggga ggctggaagg 541 agaaccaatg ggaccattgg acgcatcaca gggttggacc cagcagaacc ttgctttcag 601 ggcacacctg aattagtccg attggacccc agcgatgcca aatttgtgga tgtaattcac 661 acggatggtg cccccatagt ccccaatttg gggtttggaa tgagccaagt cgtgggccac 721 ctagatttct ttccaaatgg aggagtggaa atgcctggat gtaaaaagaa cattctctct 781 cagattgtgg acatagacgg aatctgggaa gggactcgag actttgcggc ctgtaatcac 841 ttaagaagct acaaatatta cactgatagc atcgtcaacc ctgatggctt tgctggattc 901 ccctgtgcct cttacaacgt cttcactgca aacaagtgtt tcccttgtcc aagtggaggc 961 tgcccacaga tgggtcacta tgctgataga tatcctggga aaacaaatga tgtgggccag 1021 aaattttatc tagacactgg tgatgccagt aattttgcac gttggaggta taaggtatct 1081 gtcacactgt ctggaaaaaa ggttacagga cacatactag tttctttgtt cggaaataaa 1141 ggaaactcta agcagtatga aattttcaag ggcactctca aaccagatag tactcattcc 1201 aatgaatttg actcagatgt ggatgttggg gacttgcaga tggttaaatt tatttggtat 1261 aacaatgtga tcaacccaac tttacctaga gtgggagcat ccaagattat agtggagaca 1321 aatgttggaa aacagttcaa cttctgtagt ccagaaaccg tcagggagga agttctgctc 1381 accctcacac cgtgttagga gactactgtt atttgaccaa tgaattgact tctaataaaa 1441 tctagtggtg atgc // LOCUS HUMPOLACCA 1244 bp mRNA PRI 07-OCT-1996 DEFINITION Human replication factor C, 36-kDa subunit mRNA, complete cds. ACCESSION L07540 NID g190153 KEYWORDS RFC; Activator 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1244) AUTHORS O'Donnell,M., Onrust,R., Dean,F.B., Chen,M. and Hurwitz,J. TITLE Homology in accessory proteins of replicative polymerases--E. coli to humans JOURNAL Nucleic Acids Res. 21 (1), 1-3 (1993) MEDLINE 93181160 REFERENCE 2 (bases 1 to 1244) AUTHORS Hurwitz,J. TITLE Direct Submission JOURNAL Submitted (21-APR-1993) Molecular Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1244 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 10..1032 /note="replicative polymerase accessory protein; activator 1" /codon_start=1 /product="replication factor C, 36-kDa subunit" /db_xref="PID:g1498257" /translation="METSALKQQEQPAATKIRNLPWVEKYRPQTLNDLISHQDILSTI QKFINEDRLPHLLLYGPPGTGKTSTILACAKQLYKDKEFGSMVLELNASDDRGIDIIR GPILSFASTRTIFKKGFKLVILDEADAMTQDAQNALRRVIEKFTENTRFCLICNYLSK IIPALQSRCTRFRFGPLTPELMVPRLEHVVEEEKVDISEDGMKALVTLSSGDMRRALN ILQSTNMAFGKVTEETVYTCTGHPLKSDIANILDWMLNQDFTTAYRNITELKTLKGLA LHDILTEIHLFVHRVDFPSSVRIHLLTKMADIEYRLSVGTNEKIQLSSLIAAFQVTRD LIVAEA" BASE COUNT 355 a 295 c 297 g 297 t ORIGIN 1 caccccgcca tggagacctc agcactcaag cagcaggagc agcccgcggc gaccaagatc 61 aggaacctgc cctgggttga aaaataccgg ccacagaccc tgaatgatct catttctcat 121 caggacattc tgagtaccat tcagaagttt atcaatgaag accgactgcc acacttgctt 181 ctctacggtc ccccagggac aggcaagaca tctaccatcc tagcctgtgc gaaacagcta 241 tataaagaca aagaatttgg ctccatggtc ttggagctga atgcttcaga tgaccgagga 301 atagacatca ttcgaggacc gatcctgagc tttgctagca caaggacaat atttaagaaa 361 ggctttaagc tagtgatctt ggatgaagca gacgccatga ctcaggacgc ccagaatgcc 421 ttgagaagag taattgagaa attcacagaa aataccagat tctgcctcat ctgtaactat 481 ctgtcaaaga tcatccctgc cttgcagtcc cgctgcacga ggtttcggtt cggtcccctg 541 actcctgaac tcatggttcc ccgcctggaa catgtcgtgg aagaagagaa agttgatata 601 agtgaagatg gaatgaaagc actagtcact ctttccagtg gagacatgcg tagggctctg 661 aacattttgc agagcaccaa tatggccttt gggaaggtga cagaggagac tgtctacacc 721 tgcaccgggc acccgctcaa gtcagacatt gccaacatcc tggactggat gttgaatcaa 781 gatttcacca cagcctacag aaatattaca gagttgaaaa ctctgaaggg gttggcactg 841 catgatatcc tgacagagat acacttgttt gtgcatagag ttgactttcc atcttcagtt 901 cgaatacatt tattgaccaa aatggcagac attgagtaca ggctttctgt tggcaccaac 961 gagaagatcc agctgagctc cctcattgct gcatttcaag tcaccagaga cctgattgtt 1021 gcagaggcct agatgctctg agggccattc acaattctca gggctcagca gtgatgggtg 1081 ttcagaggac agttccagga taaactgctg cctggggctg tggatgaatc agtcaccccg 1141 aatcttggaa aaaccccttc caggagagga tgggcaggca tttaaaaagt accatttttg 1201 tggttgtttg gagcaggatg tacaaaataa ttttaatgta ttaa // LOCUS HUMPOLACCB 1479 bp mRNA PRI 04-SEP-1996 DEFINITION Human replication factor C, 38-kDa subunit mRNA, complete cds. ACCESSION L07541 NID g1498258 KEYWORDS RFC; Activator 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1479) AUTHORS O'Donnell,M., Onrust,R., Dean,F.B., Chen,M. and Hurwitz,J. TITLE Homology in accessory proteins of replicative polymerases--E. coli to humans JOURNAL Nucleic Acids Res. 21 (1), 1-3 (1993) MEDLINE 93181160 REFERENCE 2 (bases 1 to 1479) AUTHORS Hurwitz,J. TITLE Direct Submission JOURNAL Submitted (21-APR-1993) Molecular Biology, Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, NY 10021, USA FEATURES Location/Qualifiers source 1..1479 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 10..1080 /note="replicative polymerase accessory protein; activator 1" /codon_start=1 /product="replication factor C, 38-kDa subunit" /db_xref="PID:g1498259" /translation="MSLWVDKYRPCSLGRLDYHKEQAAQLRNLVQCGDFPHLLVYGPS GAGKKTRIMCILRELYGVGVEKLRIEHQTITTPSKKKIEISTIASNYHLEVNPSDAGN SDRVVIQEMLKTVAQSQQLETNSQRDFKVVLLTEVDKLTKDAQHALRRTMEKYMSTCR LILCCNSTSKVIPPIRSRCLAVRVPAPSIEDICHVLSTVCKKEGLNLPSQLAHRLAEK SCRNLRKALLMCEACRVQQYPFTADQEIPETDWEVYLRETANAIVSQQTPQRLLEVRG RLYELLTHCIPPEIIMKGLLSELLHNCDGQLKGEVAQMAAYYEHRLQLGSKAIYHLEA FVAKFMALYKKFMEDGLEGMMF" BASE COUNT 437 a 280 c 322 g 440 t ORIGIN 1 cgagctgcca tgagcctctg ggtggacaag tatcggccct gctccttggg acggctggac 61 tatcacaagg agcaggcggc ccagctgcgg aacctggtgc agtgtggtga ctttcctcat 121 ctgttagtgt acggaccatc aggtgctgga aaaaagacaa gaattatgtg tattttacgt 181 gaactttatg gtgttggagt ggaaaaattg agaattgaac atcagaccat cacaactcca 241 tctaaaaaaa aaattgaaat tagcaccatt gcaagtaact accaccttga agttaatcct 301 agtgatgctg gaaatagtga ccgagtagtc attcaggaga tgttgaaaac agtggcacaa 361 tcacaacaac ttgaaacaaa ctctcaaagg gattttaaag tggtattatt gacagaagtt 421 gacaaactca ccaaagatgc tcagcatgcc ttgcgaagaa ccatggaaaa atatatgtct 481 acctgcagat tgatcttgtg ctgcaattct acatctaaag tgatcccacc tattcgtagt 541 aggtgcttgg cggttcgtgt gcctgctccc agcattgaag atatttgcca cgtgttatct 601 actgtgtgta agaaggaagg tctgaatctt ccttcacaac tggctcatag acttgcagag 661 aagtcttgta gaaatctcag aaaagccctg cttatgtgtg aagcctgcag agtgcaacaa 721 tatcctttta ctgcagatca agaaatccct gagacagatt gggaggtgta tctgagggag 781 actgcaaatg ctattgtcag tcagcaaact ccacaaaggc tccttgaagt tcgtggaagg 841 ctgtatgagc ttctaactca ttgtattcct cctgagataa taatgaaggg ccttctctca 901 gaactgttac ataattgtga tggacaactg aaaggggagg tggcacaaat ggcagcttac 961 tatgagcatc gtctacagct gggtagcaaa gccatttatc acttggaagc gtttgtggcc 1021 aaattcatgg cactttataa gaagttcatg gaggatggat tggaaggcat gatgttctga 1081 cttctgtcag ttattcttgc aaagatttct cagtatcagt atttacatac agcttatatt 1141 aaaagagctg tgggtaaatt aactgaactt aatcatgtcg tatttgggtt tttttggtaa 1201 taacttctct gtgaactatt aatcatcctc tgagttaaat aattgctcct atactattga 1261 agtatgtagt tttgtacata acttagagac tttagagtct aagaaaatga tcttaattta 1321 ctttaagcat tggttattca agtattcatt gttgatcctc ctattctctt ccgtctaatc 1381 tctcacctgc taaaggagat ttacacatta gaaagcaaag attattttca tttatccaga 1441 tgaccatttt ctgccacagg taacatgatt gtttgacgg // LOCUS HUMPOLB 1259 bp mRNA PRI 08-JAN-1995 DEFINITION Human beta-polymerase mRNA, complete cds. ACCESSION M13140 NID g190155 KEYWORDS DNA polymerase; beta-polymerase; polymerase. SOURCE Human teratocarcinoma NTera2D1 cell line, cDNA to mRNA, clone lambda-pol-beta-h2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1259; 1 to 1259) AUTHORS SenGupta,D.N., Zmudzka,B.Z., Kumar,P., Cobianchi,F., Skowronski,J. and Wilson,S.H. TITLE Sequence of human DNA polymerase beta mRNA obtained through cDNA cloning JOURNAL Biochem. Biophys. Res. Commun. 136 (1), 341-347 (1986) MEDLINE 86215196 REFERENCE 2 (bases 2 to 206) AUTHORS Abbotts,J., SenGupta,D.N., Zmudzka,B., Widen,S.G., Notario,V. and Wilson,S.H. TITLE Expression of human DNA polymerase beta in Escherichia coli and characterization of the recombinant enzyme JOURNAL Biochemistry 27 (3), 901-909 (1988) MEDLINE 88209442 COMMENT [2] revises [1]. Draft entry and clean copy sequence for [1] kindly provided by S.H.Wilson, 25-NOV-1986. A polyadenylation signal is located at positions 1250-1255, but no poly-A tail was found on the polymerase beta mRNA. FEATURES Location/Qualifiers source 1..1259 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8p12-p11" mRNA <1..>1258 /note="polb mRNA" gene 114..1121 /gene="POLB" CDS 114..1121 /gene="POLB" /codon_start=1 /db_xref="GDB:G00-120-305" /product="beta-polymerase" /db_xref="PID:g190156" /translation="MSKRKAPQETLNGGITDMLTELANFEKNVSQAIHKYNAYRKAAS VIAKYPHKIKSGAEAKKLPGVGTKIAEKIDEFLATGKLRKLEKIRQDDTSSSINFLTR VSGIGPSAARKFVDEGIKTLEDLRKNEDKLNHHQRIGLKYFGDFEKRIPREEMLQMQD IVLNEVKKVDSEYIATVCGSFRRGAESSGDMDVLLTHPSFTSESTKQPKLLHQVVEQL QKVHFITDTRSKGETKFMGVCQLPSKNDEKEYPHRRIDIRLYPKDQYYCGVLYFTGSD IFNKNMRAHAKEKGFTINEYTIRPLGVTGVAGEPLPVDSEKDIFDYIQWKYREPKDRS E" BASE COUNT 396 a 252 c 291 g 320 t ORIGIN 481 bp upstream of BglII site. 1 ccggagctgg gttgctcctg ctcccgtctc caagtcctgg tacctccttc aagctgggag 61 agggctctag tccctggttc tgaacactct ggggttctcg ggtgcaggcc gccatgagca 121 aacggaaggc gccgcaggag actctcaacg ggggaatcac cgacatgctc acagaactcg 181 caaactttga gaagaacgtg agccaagcta tccacaagta caatgcttac agaaaagcag 241 catctgttat agcaaaatac ccacacaaaa taaagagtgg agctgaagct aagaaattgc 301 ctggagtagg aacaaaaatt gctgaaaaga ttgatgagtt tttagcaact ggaaaattac 361 gtaaactgga aaagattcgg caggatgata cgagttcatc catcaatttc ctgactcgag 421 ttagtggcat tggtccatct gctgcaagga agtttgtaga tgaaggaatt aaaacactag 481 aagatctcag aaaaaatgaa gataaattga accatcatca gcgaattggg ctgaaatatt 541 ttggggactt tgaaaaaaga attcctcgtg aagagatgtt acaaatgcaa gatattgttc 601 taaatgaagt taaaaaagtg gattctgaat acattgctac agtctgtggc agtttcagaa 661 gaggtgcaga gtccagtggt gacatggatg ttctcctgac ccatcccagc ttcacttcag 721 aatcaaccaa acagccaaaa ctgttacatc aggttgtgga gcagttacaa aaggttcatt 781 ttatcacaga tacccgttca aagggtgaga caaagttcat gggtgtttgc cagcttccca 841 gtaaaaatga tgaaaaagaa tatccacaca gaagaattga tatcaggtta tacccaaaag 901 atcagtatta ctgtggtgtt ctctatttca ctgggagtga tattttcaat aagaatatga 961 gagctcatgc caaggaaaag ggtttcacaa tcaatgagta caccatccgt cccttgggag 1021 tcactggagt tgcaggagaa cccctgccag tggatagtga aaaagacatc tttgattaca 1081 tccagtggaa ataccgggaa cccaaggacc ggagcgaatg aggcctgtat cctccctggc 1141 gcagacacaa cccaatgggt cttaatttat ttcttaacct ttgctatgta agggtctttg 1201 gtgtttttaa atgattgttt cttcttcatg cttttgcttg caatgtagtc aataaaacc // LOCUS HUMPOLLA 3178 bp mRNA PRI 08-JAN-1995 DEFINITION Human polyposis locus (DP1 gene) mRNA, complete cds. ACCESSION M73547 NID g190161 KEYWORDS polyposis locus. SOURCE Homo sapiens male fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3178) AUTHORS Joslyn,G., Carlson,M., Thilveris,A., Albertsen,H., Gelbert,L., Samowitz,W., Groden,J., Stevens,J., Spirio,L., Robertson,M., Sargeant,L., Krapcho,K., Wolff,E., Burt,R., Hughes,J.P., Warrington,J., McPherson,J.D., Wasmuth,J.J., Le Paslier,D., Abderrahim,H., Cohen,D., Leppert,M. and White,R. TITLE Identification of deletion mutations and three new genes at the familial polyposis locus JOURNAL Cell 66 (3), 601-613 (1991) MEDLINE 91330307 REFERENCE 2 (sites) AUTHORS Spirio,L., Joslyn,G., Nelson,L., Leppert,M. and White,R. TITLE A CA repeat 30-70 KB downstream from the adenomatous polyposis coli (APC) gene [published erratum appears in Nucleic Acids Res 1992 Feb 11;20(3):642] JOURNAL Nucleic Acids Res. 19 (22), 6348 (1991) MEDLINE 92066512 FEATURES Location/Qualifiers source 1..3178 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /sex="male" /tissue_type="brain" gene 83..640 /gene="DP1" CDS 83..640 /gene="DP1" /codon_start=1 /product="polyposis locus-encoded protein" /db_xref="PID:g190162" /translation="MRERFDRFLHEKNCMTDLLAKLEAKTGVNRSFIALGVIGLVALY LVFGYGASLLCNLIGFGYPAYISIKAIESPNKEDDTQWLTYWVVYGVFSIAEFFSDIF LSWFPFYYMLKCGFLLWCMAPSPSNGAELLYKRIIRPFFLKHESQMDSVVKDLKDKSK ETADAITKEAKKATVNLLGEEKKST" BASE COUNT 963 a 615 c 616 g 935 t 49 others ORIGIN 1 agttgagcgc agtcgccgct ccagtctatc cggcactagg aacagccccg ggggcgagac 61 ggtccccgcc atgtctgcgg ccatgaggga gaggttcgac cggttcctgc acgagaagaa 121 ctgcatgact gaccttctgg ccaagctcga ggccaaaacc ggcgtgaaca ggagcttcat 181 cgctcttggt gtcatcggac tggtggcctt gtacctggtg ttcggttatg gagcctctct 241 cctctgcaac ctgataggat ttggctaccc agcctacatc tcaattaaag ctatagagag 301 tcccaacaaa gaagatgata cccagtggct gacctactgg gtagtgtatg gtgtgttcag 361 cattgctgaa ttcttctctg atatcttcct gtcatggttc cccttctact acatgctgaa 421 gtgtggcttc ctgttgtggt gcatggcccc gagcccttct aatggggctg aactgctcta 481 caagcgcatc atccgtcctt tcttcctgaa gcacgagtcc cagatggaca gtgtggtcaa 541 ggaccttaaa gacaagtcca aagagactgc agatgccatc actaaagaag cgaagaaagc 601 taccgtgaat ttactgggtg aagaaaagaa gagcacctaa accagactaa accagactgg 661 atggaaactt cctgccctct ctgtaccttc ctactggagc ttgatgttat attagggact 721 gtggtataat tattttaata atgttgcctt ggaaacattt tgagatatta aagattggaa 781 tgtgttgtaa gtttctttgc ttacttttac tgtctatata tatagggagc actttaaact 841 taatgcagtg ggcagtgtcc acgtttttgg aaaatgtatt ttgcctctgg gtaggaaaag 901 atgtatgttg ctatcctgca ggaaatataa acttaaaata aaattatata ccccacaggc 961 tgtgtacttt actgggctct ccctgcacgn attttctctg tagttacatt taggntaatc 1021 tttatggttc tacttcctnt aatgtacaat tttatataat tcngnaatgt ttttaatgta 1081 tttgtgcaca tgtacatatg gaaatgttac tgtctgacta cancatgcat catgctcatg 1141 gggagggagc aggggaaggt tgtatgtgtc atttataact tctgtacagt aagaccacct 1201 gcaacaagct ggaggaacca ttgtgctggt gtggtctact aaataatact ttaggaaata 1261 cgtgattaat atgcaagtga acaaagtgag aaatgaaatc gaatggagat tggcctggtt 1321 gtttccgtag tatatggcat atgaatacca ggatagcttt ataaagcagt tagttagtta 1381 gttactcact ctagtgataa atcgggaaat ttacacacac acacacacac acacacacac 1441 acacacacac acacacacac acacagagta ccctgtaact ctcaattccc tgaaaaacta 1501 gtaatactgt cttatctgct ataaacttta catatttgtc tattgtcaag atgctacant 1561 ggannccatt tctggtttta tcttcanagn ggaganacat gttgatttag tcttctttcc 1621 caatcttctt ttttaancca gtttnaggnn cttctgnaga tttgnccacc tctgattaca 1681 tgtatgttct ngtttgtatc atnagcaaca acatgctaat gncgacacct agctctnagn 1741 gcaattctgg gagantgana ggnngtatan agtnncccat aatctgcttg gcaatagtta 1801 agtcaatcta tcttcagttt ttctctggcc tttaaggtca aacacaagag gcttccctag 1861 tttacaagtc agagtcactt gtagtccatt taaatgccct catccgtatt ctttgtgttg 1921 ataagctgca cangactaca tagtaagtac agancagtaa agttaanncg gatgtctcca 1981 ttgatctgcc aantcgntat agagagcaat ttgtctggac tagaaaatct gagttttaca 2041 ccatactgtt aagagtcctt ttgaattaaa ctagactaaa acaagtgtat aactaaacta 2101 acaagattaa atatccagcc agtacagtat tttttaaggc aaataaagat gattagctca 2161 ccttgagnta acaatcaggt aagatcatna caatgtctca tgatgtnaan aatattaaag 2221 atatcaatac taagtgacag tatcacnnct aatataatat ggatcagagc atttattttg 2281 gggaggaaaa cagtggtgat taccggcatt ttattaaact taaaactttg tagaaagcaa 2341 acaaaattgt tcttgggaga aaatcaactt ttagattaaa aaaattttaa gtanctagga 2401 gtatttaaat ccttttccca taaataaaag tacagttttc ttggtggcag aatgaaaatc 2461 agcaacntct agcatataga ctatataatc agattgacag catatagaat atattatcag 2521 acaagatgag gaggtacaaa agttactatt gctcataatg acttacaggc taaaantagn 2581 tntaaaatac tatattaaat tctgaatgca attttttttt gttcccttga gaccaaaatt 2641 taagttaact gttgctggca gtctaagtgt aaatgttaac agcaggagaa gttaagaatt 2701 gagcagttct gttgcatgat ttcccaaatg aaatactgcc ttggctagag tttgaaaaac 2761 taattgagcc tgtgcctggc tagaaaacaa gcgtttattt gaatgtgaat agtgtttcaa 2821 aggtatgtag ttacagaatt cctaccaaac agcttaaatt cttcaagaaa gaattcctgc 2881 agcagttatt cccttacctg aaggcttcaa tcatttggat caacaactgc tactctcggg 2941 aagactcctc tactcacagc tgaagaaaat gagcacaccc ttcacactgt tatcacctat 3001 cctgaagatg tgatacactg aatggaaata aatagatgta aataaaattg agntctcatt 3061 taaaaaaaac catgtgccca atgggaaaat gacctcatgt tgtggtttaa acagcaactg 3121 cacccactag cacagcccat tgagctancc tatatataca tctctgtcag tgcccctc // LOCUS HUMPOLP 3640 bp mRNA PRI 08-JAN-1995 DEFINITION Human poly(ADP-ribose) polymerase mRNA, complete cds. ACCESSION M18112 NID g190166 KEYWORDS polymerase. SOURCE Human SV40 transformed fibroblast, cDNA to mRNA, clone pPAP. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3640) AUTHORS Uchida,K., Morita,T., Sato,T., Ogura,T., Yamashita,R., Noguchi,S., Suzuki,H., Nyunoya,H., Miwa,M. and Sugimura,T. TITLE Nucleotide sequence of a full-length cDNA for human fibroblast poly(ADP-ribose) polymerase JOURNAL Biochem. Biophys. Res. Commun. 148 (2), 617-622 (1987) MEDLINE 88076933 COMMENT Draft entry and computer readable sequence for [1] kindly provided by K.Uchida, 02-MAR-1988. FEATURES Location/Qualifiers source 1..3640 /organism="Homo sapiens" /db_xref="taxon:9606" /map="1q41-q42" mRNA <1..3640 /note="POLP mRNA" gene 140..3184 /gene="PPOL" CDS 140..3184 /gene="PPOL" /note="poly(ADP-ribose) polymerase" /codon_start=1 /db_xref="GDB:G00-119-508" /db_xref="PID:g190167" /translation="MAESSDKLYRVEYAKSGRASCKKCSESIPKDSLRMAIMVQSPMF DGKVPHWYHFSCFWKVGHSIRHPDVQVDGFSELRWDDQQKVKKTAEAGGVTGKGQDGI GSKAEKTLGDFAAEYAKSNRSTCKGCMEKIEKGQVRLSKKMVDPEKPQLGMIDRWYHP GCFVKNREELGFRPEYSASQLKGFSLLATEDKEALKKQLPGVKSEGKRKGDEVDGVDE VAKKKSKKEKDKDSKLEKALKAQNDLIWNIKDELKKVCSTNDLKELLIFNKQQVPSGE SAILDRVADGMVFGALLPCEECSGQLVFKSDAYYCTGDVTAWTKCMVKTQTPNRKEWV TPKEFREISYLKKLKVKKQDRIFPPETSASVAATPPPSTASAPAAVNSSASADKPLSN MKILTLGKLSRNKDEVKAMIEKLGGKLTGTANKASLCISTKKEVEKMNKKMEEVKEAN IRVVSEDFLQDVSASTKSLQELFLAHILSPWGAEVKAEPVEVVAPRGKSGAALSKKSK GQVKEEGINKSEKRMKLTLKGGAAVDPDSGLEHSAHVLEKGGKVFSATLGLVDIVKGT NSYYKLQLLEDDKENRYWIFRSWGRVGTVIGSNKLEQMPSKEDAIEHFMKLYEEKTGN AWHSKNFTKYPKKFYPLEIDYGQDEEAVKKLTVNPGTKSKLPKPVQDLIKMIFDVESM KKAMVEYEIDLQKMPLGKLSKRQIQAAYSILSEVQQAVSQGSSDSQILDLSNRFYTLI PHDFGMKKPPLLNNADSVQAKVEMLDNLLDIEVAYSLLRGGSDDSSKDPIDVNYEKLK TDIKVVDRDSEEAEIIRKYVKNTHATTHNAYDLEVIDIFKIEREGECQRYKPFKQLHN RRLLWHGSRTTNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMVSKSANYCHTSQGD PIGLILLGEVALGNMYELKHASHISKLPKGKHSVKGLGKTTPDPSANISLDGVDVPLG TGISSGVNDTSLLYNEYIVYDIAQVNLKYLLKLKFNFKTSLW" BASE COUNT 999 a 833 c 1008 g 800 t ORIGIN Chromosome 1p11-qter. 1 aatctatcag gaacgcgtgc gtcgcgtgtt cgtgcggctc tggccgctca gctcggctgg 61 gtgagcgcac gcgagcgcag cggcagcgtg tttctaggtc gtgcgtcggg cttccggagc 121 tttgcggcag ctagggagga tggcggagtc ttcggataag ctctatcgag tcgagtacgc 181 caagagcggg cgcgcctctt gcaagaaatg cagcgagagc atccccaagg actcgctccg 241 gatggccatc atggtgcagt cgcccatgtt tgatggaaaa gtcccacact ggtaccactt 301 ctcctgcttc tggaaggtgg gccactccat ccggcaccct gacgttcagg tggatgggtt 361 ctctgagctt cggtgggatg accagcagaa agtcaagaag acagcggaag ctggaggagt 421 gacaggcaaa ggccaggatg gaattggtag caaggcagag aagactctgg gtgactttgc 481 agcagagtat gccaagtcca acagaagtac gtgcaagggg tgtatggaga agatagaaaa 541 gggccaggtg cgcctgtcca agaagatggt ggacccggag aagccacagc taggcatgat 601 tgaccgctgg taccatccag gctgctttgt caagaacagg gaggagctgg gtttccggcc 661 cgagtacagt gcgagtcagc tcaagggctt cagcctcctt gctacagagg ataaagaagc 721 cctgaagaag cagctcccag gagtcaagag tgaaggaaag agaaaaggcg atgaggtgga 781 tggagtggat gaagtggcga agaagaaatc taaaaaagaa aaagacaagg atagtaagct 841 tgaaaaagcc ctaaaggctc agaacgacct gatctggaac atcaaggacg agctaaagaa 901 agtgtgttca actaatgacc tgaaggagct actcatcttc aacaagcagc aagtgccttc 961 tggggagtcg gcgatcttgg accgagtagc tgatggcatg gtgttcggtg ccctccttcc 1021 ctgcgaggaa tgctcgggtc agctggtctt caagagcgat gcctattact gcactgggga 1081 cgtcactgcc tggaccaagt gtatggtcaa gacacagaca cccaaccgga aggagtgggt 1141 aaccccaaag gaattccgag aaatctctta cctcaagaaa ttgaaggtta aaaagcagga 1201 ccgtatattc cccccagaaa ccagcgcctc cgtggcggcc acgcctccgc cctccacagc 1261 ctcggctcct gctgctgtga actcctctgc ttcagcagat aagccattat ccaacatgaa 1321 gatcctgact ctcgggaagc tgtcccggaa caaggatgaa gtgaaggcca tgattgagaa 1381 actcgggggg aagttgacgg ggacggccaa caaggcttcc ctgtgcatca gcaccaaaaa 1441 ggaggtggaa aagatgaata agaagatgga ggaagtaaag gaagccaaca tccgagttgt 1501 gtctgaggac ttcctccagg acgtctccgc ctccaccaag agccttcagg agttgttctt 1561 agcgcacatc ttgtcccctt ggggggcaga ggtgaaggca gagcctgttg aagttgtggc 1621 cccaagaggg aagtcagggg ctgcgctctc caaaaaaagc aagggccagg tcaaggagga 1681 aggtatcaac aaatctgaaa agagaatgaa attaactctt aaaggaggag cagctgtgga 1741 tcctgattct ggactggaac actctgcgca tgtcctggag aaaggtggga aggtcttcag 1801 tgccaccctt ggcctggtgg acatcgttaa aggaaccaac tcctactaca agctgcagct 1861 tctggaggac gacaaggaaa acaggtattg gatattcagg tcctggggcc gtgtgggtac 1921 ggtgatcggt agcaacaaac tggaacagat gccgtccaag gaggatgcca ttgagcactt 1981 catgaaatta tatgaagaaa aaaccgggaa cgcttggcac tccaaaaatt tcacgaagta 2041 tcccaaaaag ttctaccccc tggagattga ctatggccag gatgaagagg cagtgaagaa 2101 gctgacagta aatcctggca ccaagtccaa gctccccaag ccagttcagg acctcatcaa 2161 gatgatcttt gatgtggaaa gtatgaagaa agccatggtg gagtatgaga tcgaccttca 2221 gaagatgccc ttggggaagc tgagcaaaag gcagatccag gccgcatact ccatcctcag 2281 tgaggtccag caggcggtgt ctcagggcag cagcgactct cagatcctgg atctctcaaa 2341 tcgcttttac accctgatcc cccacgactt tgggatgaag aagcctccgc tcctgaacaa 2401 tgcagacagt gtgcaggcca aggtggaaat gcttgacaac ctgctggaca tcgaggtggc 2461 ctacagtctg ctcaggggag ggtctgatga tagcagcaag gatcccatcg atgtcaacta 2521 tgagaagctc aaaactgaca ttaaggtggt tgacagagat tctgaagaag ccgagatcat 2581 caggaagtat gttaagaaca ctcatgcaac cacacacaat gcgtatgact tggaagtcat 2641 cgatatcttt aagatagagc gtgaaggcga atgccagcgt tacaagccct ttaagcagct 2701 tcataaccga agattgctgt ggcacgggtc caggaccacc aactttgctg ggatcctgtc 2761 ccagggtctt cggatagccc cgcctgaagc gcccgtgaca ggctacatgt ttggtaaagg 2821 gatctatttc gctgacatgg tctccaagag tgccaactac tgccatacgt ctcagggaga 2881 cccaataggc ttaatcctgt tgggagaagt tgcccttgga aacatgtatg aactgaagca 2941 cgcttcacat atcagcaagt tacccaaggg caagcacagt gtcaaaggtt tgggcaaaac 3001 tacccctgat ccttcagcta acattagtct ggatggtgta gacgttcctc ttgggaccgg 3061 gatttcatct ggtgtgaatg acacctctct actatataac gagtacattg tctatgatat 3121 tgctcaggta aatctgaagt atctgctgaa actgaaattc aattttaaga cctccctgtg 3181 gtaattggga gaggtagccg agtcacaccc ggtggctctg gtatgaattc acccgaagcg 3241 cttctgcacc aactcacctg gccgctaagt tgctgatggg tagtacctgt actaaaccac 3301 ctcagaaagg attttacaga aacgtgttaa aggttttctc taacttctca agtcccttgt 3361 tttgtgttgt gtctgtgggg aggggttgtt ttggggttgt ttttgttttt tcttgccagg 3421 tagataaaac tgacatagag aaaaggctgg agagagattc tgttgcatag actagtccta 3481 tggaaaaaac caagcttcgt tagaatgtct gccttactgg tttccccagg gaaggaaaaa 3541 tacacttcca cccttttttc taagtgttcg tctttagttt tgattttgga aagatgttaa 3601 gcatttattt ttagttaaaa ataaaaacta atttcatact // LOCUS HUMPON2R 1542 bp mRNA PRI 24-MAY-1996 DEFINITION Homo sapiens paraoxonase 2 (PON2) mRNA, complete cds. ACCESSION L48513 NID g1333631 KEYWORDS paraoxonase 2. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1542) AUTHORS Primo-Parmo,S.L., Sorenson,R.C., Teiber,J. and La Du,B.N. TITLE The human serum paraoxonase/arylesterase gene (PON1) is one member of a multigene family JOURNAL Genomics 33 (3), 498-507 (1996) MEDLINE 96299645 FEATURES Location/Qualifiers source 1..1542 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" 5'UTR <1..17 /gene="PON2" /note="G00-578-911" gene 1..1542 /gene="PON2" mRNA <1..1542 /gene="PON2" /note="G00-578-911" CDS 18..1082 /gene="PON2" /codon_start=1 /db_xref="GDB:G00-578-911" /product="paraoxonase 2" /db_xref="PID:g1333632" /translation="MGAWVGCGLAGDRAGFLGERLLALRNRLKASREVESVDLPHCHL IKGIEAGSEDIDILPNGLAFFSVGLKFPGLHSFAPDKPGGILMMDLKEEKPRARELRI SRGFDLASFNPHGISTFIDNDDTVYLFVVNHPEFKNTVEIFKFEEAENSLLHLKTVKH ELLPSVNDITAVGPAHFYATNDHYFSDPFLKYLGTYLNLHWANVVYYSPNEVKVVAEG FDSANGINISPDDKYIYVADILAHEIHVLEKHTNMNLTQLKVLELDTLVDNLSIDPSS GDIWVGCHPNGQKLFVYDPNNPPSSEVLRIQNILCEKPTVTTVYANNGSVLQGSSVAS VYDGKLLIGTLYHRALYCEL" 3'UTR 483..1541 /gene="PON2" /note="G00-578-911" polyA_site 1541 /gene="PON2" /note="G00-578-911" BASE COUNT 459 a 308 c 324 g 449 t 2 others ORIGIN 1 gcgcccggct cccgcgcatg ggggcctggg tgggctgtgg gcttgctggg gatcgcgctg 61 gcttcctggg cgagaggctt ctggcactca gaaatcgact taaagcctcc agagaagtag 121 aatctgtaga ccttccacac tgccacctga ttaaaggaat tgaagctggc tctgaagata 181 ttgacatact tcccaatggt ctggcttttt ttagtgtggg tctaaaattc ccaggactcc 241 acagctttgc accagataag cctggaggaa tactaatgat ggatctaaaa gaagaaaaac 301 caagggcacg ggaattaaga atcagtcgtg ggtttgattt ggcctcattc aatccacatg 361 gcatcagcac tttcatagac aacgatgaca cagtttatct ctttgttgta aaccacccag 421 aattcaagaa tacagtggaa atttttaaat ttgaagaagc agaaaattct ctgttgcatc 481 tgaaaacagt caaacatgag cttcttccaa gtgtgaatga catcacagct gttggaccgg 541 cacatttcta tgccacaaat gaccactact tctctgatcc tttcttaaag tatttaggaa 601 catacttgaa cttacactgg gcaaatgttg tttactacag tccaaatgaa gttaaagtgg 661 tagcagaagg atttgattca gcaaatggga tcaatatttc acctgatgat aagtatatct 721 atgttgctga catattggct catgaaattc atgttttgga aaaacacact aatatgaatt 781 taactcagtt gaaggtactt gagctggata cactggtgga taatttatct attgatcctt 841 cctcggggga catctgggta ggctgtcatc ctaatggcca gaagctcttc gtgtatgacc 901 cgaacaatcc tccctcgtca gaggttctcc gcatccagaa cattctatgt gagaagccta 961 cagtgactac agtttatgcc aacaatgggt ctgttctcca aggaagttct gtagcctcag 1021 tgtatgatgg gaagctgctc ataggcactt tataccacag agccttgtat tgtgaactct 1081 aaattgtact tttggcatga aagtgcgata acttaacaat taatctatga attgctaatt 1141 ctgagggaat ttacccagca acattgaccc agaaatgtat ggcatgtgta gttaatttta 1201 ttccagtaag gaacggccct tttagttcct tagagcnctt ttaacaaata aaaaaggaaa 1261 atgaacaggt tctttaaatg ccaagcaagg gacagaaaag aaagctgctt tcgaataaag 1321 tgaatacatt ttgcacaaag taagcctcac ctttgccttc caactgccag aacatggatt 1381 ccactgaaat agagtgaatt atatttcctt aaaatgtgag tgacctcact tctggcactg 1441 tgactactat ggctgtttag aactactgat aacgtatttt gatgttttgt acttacatct 1501 ttgtttncca ttaaaaagtt ggagttatat taaagactaa ct // LOCUS HUMPORIN 1464 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens porin (por) mRNA, complete cds and truncated cds. ACCESSION L08666 NID g190199 KEYWORDS VDAC; mitochondria; porin. SOURCE Homo sapiens blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1464) AUTHORS Ha,H., Hajek,P., Bedwell,D.M. and Burrows,P.D. TITLE A mitochondrial porin cDNA predicts the existence of multiple human porins JOURNAL J. Biol. Chem. 268 (16), 12143-12149 (1993) MEDLINE 93280191 FEATURES Location/Qualifiers source 1..1464 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="B-lymphocyte" /germline /tissue_type="blood" gene 1..1464 /gene="por" 5'UTR 1..247 /gene="por" CDS 248..1291 /partial /gene="por" /standard_name="VDAC" /note="Met at bp 326 also used as initiation codon in vitro" /codon_start=1 /function="mitochondrial pore-forming protein" /evidence=experimental /product="porin" /db_xref="PID:g190200" /translation="MSWCNELRLPALKQHSIGRGLESHITMCIPPSYADLGKAARDIF NKGFGFGLVKLDVKTKSCSGVEFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWN TDNTLGTEIAIEDQICQGLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFA GPAIHGSAVFGYEGWLAGYQMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVNDGTEFGG SIYQKVCEDLDTSVNLAWTSGTNCTRFGIAAKYQLDPTASISAKVNNSSLIGVGYTQT LRPGVKLTLSALVDGKSINAGGHKVGSPWSWRLNPAERNLWEWISEDLALIYFHCDQQ QAFFPPEDDQNKG" CDS 326..1291 /gene="por" /note="Met at bp 248 also used as initiation codon in vitro" /codon_start=1 /evidence=experimental /product="porin" /db_xref="PID:g190201" /translation="MCIPPSYADLGKAARDIFNKGFGFGLVKLDVKTKSCSGVEFSTS GSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQICQGLKLTFDTT FSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGYQMTFDSA KSKLTRNNFAVGYRTGDFQLHTNVNDGTEFGGSIYQKVCEDLDTSVNLAWTSGTNCTR FGIAAKYQLDPTASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKVG SPWSWRLNPAERNLWEWISEDLALIYFHCDQQQAFFPPEDDQNKG" 3'UTR 1292..1464 /gene="por" polyA_site 1464 /gene="por" BASE COUNT 397 a 315 c 356 g 396 t ORIGIN 1 gctgctggag ctgcagcccg accgcgagcg tgccaagcgg cttcagcagc tagcggacgt 61 ggcggcggcc cccctcagga caccaccaga ttcccctctt cccgcggcct cgccatggcg 121 acccacggac agacttgcgc gcgtcgatcg gatcaatcac ttttctagag gaagtagcag 181 tccctcttgt gagagcgcaa ggtcattact tgtgctccta agggcgtgga cgtgctttgt 241 ggaatgaatg agctggtgta atgagctcag attgcctgcc cttaagcagc acagcattgg 301 ccgaggactt gagagtcaca ttacaatgtg tattcctcca tcatatgctg accttggcaa 361 agctgccaga gatattttca acaaaggatt tggttttggg ttggtgaaac tggatgtgaa 421 aacaaagtct tgcagtggcg tggaattttc aacgtccggt tcatctaata cagacactgg 481 taaagttact gggaccttgg agaccaaata caagtggtgt gagtatggtc tgactttcac 541 agaaaagtgg aacactgata acactctggg aacagaaatc gcaattgaag accagatttg 601 tcaaggtttg aaactgacat ttgatactac cttctcacca aacacaggaa agaaaagtgg 661 taaaatcaag tcttcttaca agagggagtg tataaacctt ggttgtgatg ttgactttga 721 ttttgctgga cctgcaatcc atggttcagc tgtctttggt tatgagggct ggcttgctgg 781 ctaccagatg acctttgaca gtgccaaatc aaagctgaca aggaataact ttgcagtggg 841 ctacaggact ggggacttcc agctacacac taatgtcaat gatgggacag aatttggagg 901 atcaatttat cagaaagttt gtgaagatct tgacacttca gtaaaccttg cttggacatc 961 aggtaccaac tgcactcgtt ttggcattgc agctaaatat cagttggatc ccactgcttc 1021 catttctgca aaagtcaaca actctagctt aattggagta ggctatactc agactctgag 1081 gcctggtgtg aagcttacac tctctgctct ggtagatggg aagagcatta atgctggagg 1141 ccacaaggtt ggctcgccct ggagttggag gcttaatcca gctgaaagaa acctttggga 1201 atggatatca gaagatttgg ccttaatata tttccattgt gaccagcagc aggctttttt 1261 ccccccagaa gatgatcaaa acaaaggatg atctcaacaa gagctgtatt ttaagtattt 1321 agacagttct ttgttagctg gtttctagtt ggttatctag ttaccaatgc tgcagtcctg 1381 cagtcaccta tacattattt aaatgtattt aactgttaaa tgcgctaccc accaataatg 1441 aaatagacct ttatgaaaac tgtg // LOCUS HUMPOVRA 1254 bp mRNA PRI 15-SEP-1990 DEFINITION Human poliovirus receptor mRNA, clone H20A. ACCESSION M24407 NID g190204 KEYWORDS c-myc proto-oncogene; poliovirus receptor; transmembrane protein; tyrosine kinase; viral receptor. SOURCE Human Hela cell cDNA to mRNA, clone H20A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1254) AUTHORS Racaniello,V.R. JOURNAL Unpublished (1989) REFERENCE 2 (bases 1 to 1254) AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R. TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily JOURNAL Cell 56, 855-865 (1989) MEDLINE 89168426 COMMENT [1] revises [2]. Draft entry and computer readable copy of sequence kindly provided by V.Racaniello, 27-APR-1989. FEATURES Location/Qualifiers source 1..1254 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1254 /note="poliovirus receptor" /codon_start=1 /db_xref="PID:g190205" /translation="MARAMAAAWPLLLVALLVLSWPPPGTGDVVVQAPTQVPGFLGDS VTLPCYLQVPNMEVTHVSQLTWARHGESGSMAVFHQTQGPSYSESKRLEFVAARLGAE LRNASLRMFGLRVEDEGNYTCLFVTFPQGSRSVDIWLRVLAKPQNTAEVQKVQLTGEP VPMARCVSTGGRPPAQITWHSDLGGMPNTSQVPGFLSGTVTVTSLWILVPSSQVDGKN VTCKVEHESFEKPQLLTVNLTVYYPPEVSISGYDNNWYLGQNEATLTCDARSNPEPTG YNWSTTMGPLPPFAVAQGAQLLIRPVDKPINTTLICNVTNALGARQAELTVQVKEGPP SEHSGISRNAIIFLVLGILVFLILLGIGIYFYWSKCSREVLWHCHLCPSSTEHASASA NGHVSYSAVSRENSSSQDPQTEGTR" BASE COUNT 255 a 389 c 362 g 248 t ORIGIN 1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc 61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc 121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat 181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa 241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc 301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac 361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga 421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca 481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac 541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc 601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc 661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg 721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat 781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc 841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt 901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct 961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata 1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg 1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg 1141 tgtccctcga gtacagagca tgccagcgcc tcagctaatg ggcatgtctc ctattcagct 1201 gtgagcagag agaacagctc ttcccaggat ccacagacag agggcacaag gtga // LOCUS HUMPP 3762 bp mRNA PRI 16-MAR-1992 DEFINITION Human glycine decarboxylase mRNA, complete cds. ACCESSION M64590 J05742 NID g190208 KEYWORDS P gene; glycine decarboxylase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3762) AUTHORS Kume,A., Koyata,H., Sakakibara,T., Ishiguro,Y., Kure,S. and Hiraga,K. TITLE The glycine cleavage system: Molecular cloning of the chicken and human glycine decarboxylase cDNAs and some characteristics involved in the deduced protein structures JOURNAL J. Biol. Chem. 266, 3323-3329 (1991) MEDLINE 91131643 FEATURES Location/Qualifiers source 1..3762 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" gene 151..3748 /gene="glycine decarboxylase" CDS 151..3213 /gene="glycine decarboxylase" /codon_start=1 /product="glycine decarboxylase" /db_xref="PID:g190209" /translation="MQSCARAWGLRLGRGVGGGRRLAGGSGPCWAPRSRDSSSGGGDS AAAGASRLLERLLPRHDDFARRHIGPGDKDQREMLQTLGLASIDELIEKTVPANIRLK RPLKMEDPVCENEILATLHAISSKNQIWRSYIGMGYYNCSVPQTILRNLLENSGWITQ YTPYQPEVSQGRLESLLNYQTMVCDITGLDMANASLLDEGTAAAEALQLCYRHNKRRK FLVDPRCHPQTIAVVQTRAKYTGVLTELKLPCEMDFSGKDVSGVLFQYPDTEGKVEDF TELVERAHQSGSLACCATDLLALCILRPPGEFGVDIALGSSQRFGVPLGYGGPHAAFF AVRESLVRMMPGRMVGVTRDATGKEVYRLALQTREQHIRRDKATSNICTAQALLANMA AMFRIYHGSHGLEHIARRVHNATLILSEGLKRAGHQLQHDLFFDTLKIHCGCSVKEVL GRAAQRQINFRLFEDGTLGISLDETVNEKDLDDLLWIFGCESSAELVAESMGEECRGI PGSVFKRTSPFLTHQVFNSYHSETNIVRYMKKLENKDISLVHSMIPLGSCTMKLNSSS ELAPITWKEFANIHPFVPLDQAQGYQQLFRELEKDLCELTGYDQVCFQPNSGAQGEYA GLATIRAYLNQKGEGHRTVCLIPKSAHGTNPASAHMAGMKIQPVEVDKYGNIDAVHLK AMVDKHKENLAAIMITYPSTNGVFEENISDVCDLIHQHGGQVYLDGANMNAQVGICRP GDFGSDVSHLNLHKTFCIPHGGGGPGMGPIGVKKHLAPFLPNHPVISLKRNEDACPVG TVSAAPWGSSSILPISWAYIKMMGGKGLKQATETAILNANYMAKRLETHYRILFRGAR GYVGHEFILDTRPFKKSANIEAVDVAKRLQDYGFHAPTMSWPVAGTLMVEPTESEDKA ELDRFCDAMISIRQEIADIEEGRIDPRVNPLKMSPHSLTCVTSSHWDRPYSREVAAFP LPFMKPENKFWPTIARIDDIYGDQHLVCTCPPMEVYESPFSEQKRASS" polyA_signal 3743..3748 /gene="glycine decarboxylase" polyA_site 3762 /gene="glycine decarboxylase" BASE COUNT 929 a 899 c 1028 g 906 t ORIGIN 1 cccgcgagcg tccatccatc tgtccggccg actgtccagc gaaaggggct ccaggccggg 61 cgcacgtcga cccgggggac cgaggccagg agaggggcca agagcgcggc tgacccttgc 121 gggccggggc aggggacggt ggccgcggcc atgcagtcct gtgccagggc gtgggggctg 181 cgcctgggcc gcggggtcgg gggcggccgc cgcctggctg ggggatcggg gccgtgctgg 241 gcgccgcgga gccgggacag cagcagtggc ggcggggaca gcgccgcggc tggggcctcg 301 cgcctcctgg agcgccttct gcccagacac gacgacttcg ctcggaggca catcggccct 361 ggggacaaag accagagaga gatgctgcag accttggggc tggcgagcat tgatgaattg 421 atcgagaaga cggtccctgc caacatccgt ttgaaaagac ccttgaaaat ggaagaccct 481 gtttgtgaaa atgaaatcct tgcaactctg catgccattt caagcaaaaa ccagatctgg 541 agatcgtata ttggcatggg ctattataac tgctcagtgc cacagacgat tttgcggaac 601 ttactggaga actcaggatg gatcacccag tatactccat accagcctga ggtgtctcag 661 gggaggctgg agagtttact caactaccag accatggtgt gtgacatcac aggcctggac 721 atggccaatg catccctgct ggatgagggg actgcagccg cagaggcact gcagctgtgc 781 tacagacaca acaagaggag gaaatttctc gttgatcccc gttgccaccc acagacaata 841 gctgttgtcc agactcgagc caaatatact ggagtcctca ctgagctgaa gttaccctgt 901 gaaatggact tcagtggaaa agatgtcagt ggagtgttgt tccagtaccc agacacggag 961 gggaaggtgg aagactttac ggaactcgtg gagagagctc atcagagtgg gagcctggcc 1021 tgctgtgcta ctgacctttt agctttgtgc atcttgaggc cacctggaga atttggggta 1081 gacatcgccc tgggcagctc ccagagattt ggagtgccac tgggctatgg gggaccccat 1141 gcagcatttt ttgctgtccg agaaagcttg gtgagaatga tgcctggaag aatggtgggg 1201 gtaacaagag atgccactgg gaaagaagtg tatcgtcttg ctcttcaaac cagggagcaa 1261 cacattcgga gagacaaggc taccagcaac atctgtacag ctcaggccct cttggcgaat 1321 atggctgcca tgtttcgaat ctaccatggt tcccatgggc tggagcatat tgctaggagg 1381 gtacataatg ccactttgat tttgtcagaa ggtctcaagc gagcagggca tcaactccag 1441 catgacctgt tctttgatac cttgaagatt cattgtggct gctcagtgaa ggaggtcttg 1501 ggcagggcgg ctcagcggca gatcaatttt cggctttttg aggatggcac acttggtatt 1561 tctcttgatg aaacagtcaa tgaaaaagat ctggacgatt tgttgtggat ctttggttgt 1621 gagtcatctg cagaactggt tgctgaaagc atgggagagg agtgcagagg tattccaggg 1681 tctgtgttca agaggaccag cccgttcctc acccatcaag tgttcaacag ctaccactct 1741 gaaacaaaca ttgtccggta catgaagaaa ctggaaaata aagacatttc ccttgttcac 1801 agcatgattc cactgggatc ctgcaccatg aaactgaaca gttcgtctga actcgcacct 1861 atcacatgga aagaatttgc aaacatccac ccctttgtgc ctctggatca agctcaagga 1921 tatcagcagc ttttccgaga gcttgagaag gatttgtgtg aactcacagg ttatgaccag 1981 gtctgtttcc agccaaacag cggagcccag ggagaatatg ctggactggc cactatccga 2041 gcctacttaa accagaaagg agaggggcac agaacggttt gcctcattcc gaaatcagca 2101 catgggacca acccagcaag tgcccacatg gcaggcatga agattcagcc tgtggaggtg 2161 gataaatatg ggaatatcga tgcagttcac ctcaaggcca tggtggataa gcacaaggag 2221 aacctagcag ctatcatgat tacataccca tccaccaatg gggtgtttga agagaacatc 2281 agtgacgtgt gtgacctcat ccatcaacat ggaggacagg tctacctaga cggggcaaat 2341 atgaatgctc aggtgggaat ctgtcgccct ggagacttcg ggtctgatgt ctcgcaccta 2401 aatcttcaca agaccttctg cattccccac ggaggaggtg gtcctggcat ggggcccatc 2461 ggagtgaaga aacatctcgc cccgtttttg cccaatcatc ccgtcatttc actaaagcgg 2521 aatgaggatg cctgtcctgt gggaaccgtc agtgcggccc catggggctc cagttccatc 2581 ttgcccattt cctgggctta tatcaagatg atgggaggca agggtcttaa acaagccacg 2641 gaaactgcga tattaaatgc caactacatg gccaagcgat tagaaacaca ctacagaatt 2701 cttttcaggg gtgcaagagg ttatgtgggt catgaattta ttttggacac gagacccttc 2761 aaaaagtctg caaatattga ggctgtggat gtggccaaga gactccagga ttatggattt 2821 cacgccccta ccatgtcctg gcctgtggca gggaccctca tggtggagcc cactgagtcg 2881 gaggacaagg cagagctgga cagattctgt gatgccatga tcagcattcg gcaggaaatt 2941 gctgacattg aggagggccg catcgacccc agggtcaatc cgctgaagat gtctccacac 3001 tccctgacct gcgttacatc ttcccactgg gaccggcctt attccagaga ggtggcagca 3061 ttcccactcc ccttcatgaa accagagaac aaattctggc caacgattgc ccggattgat 3121 gacatatatg gagatcagca cctggtttgt acctgcccac ccatggaagt ttatgagtct 3181 ccattttctg aacaaaagag ggcgtcttct tagtcctctc tccctaagtt taaaggactg 3241 atttgatgcc tctccccaga gcatttgata agcaagaaag atttcatctc ccaccccagc 3301 ctcaagtagg agttttatat actgtgtata tctctgtaat ctctgtcaag gtaaatgtaa 3361 atacagtagc tggagggagt cgaagctgat ggttggaaga cggatttgct ttggtattct 3421 gcttccacat gtgccagttg cctggattgg gagccatttt gtgttttgcg tagaaagttt 3481 taggaacttt aacttttaat gtggcaagtt tgcagatgtc atagaggcta tcctggagac 3541 ttaatagaca tttttttgtt ccaaaagagt ccatgtggac tgtgccatct gtgggaaatc 3601 ccagggcaaa tgtttacatt ttgtataccc tgaagaactc tttttcctct aatatgccta 3661 atctgtaatc acatttctga gtgttttcct ctttttctgt gtgaggtttt tttttttttt 3721 aatctgcatt tattagtatt ctaataaaag cattttgatc gg // LOCUS HUMPP11A 2320 bp mRNA PRI 15-SEP-1990 DEFINITION Human placental protein (PP11) mRNA, complete cds. ACCESSION M32402 NID g190210 KEYWORDS placental protein; serine protease. SOURCE Human placenta, cDNA to mRNA, (library lambda-gt10). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2320) AUTHORS Grundmann,U., Roemisch,J., Siebold,B., Bohn,H. and Amann,E. TITLE Cloning and expression of a cDNA encoding human placental protein (PP11), a serine protease with diagnostic significance as a tumor marker JOURNAL Unpublished (1990) COMMENT Draft entry and printed sequence for [1] kindly submitted by U.Grundmann, 02-MAR-90. Author address: U.Grundmann Department of Molecular Biololgy Research Institute Behringwerke Ag, Postfach 1140, 3550 Marburg, Germany. FEATURES Location/Qualifiers source 1..2320 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 7..2320 /note="PP11 mRNA" CDS 155..1264 /note="placental protein 11 (PP11) precursor" /codon_start=1 /db_xref="PID:g190211" /translation="MRACISLVLAVLCGLAWAEDHKESEPLPQLEEETEEALASNLYS APTSCQGRCYEAFDKHHQCHCNARCQEFGNCCKDFESLCSDHEVSHSSDAITKEEIQS ISEKIYRADTNKAQKEDIVLNSQNCISPSETRNQVDRCPKPLFTYVNEKLFSKPTYAA FINLLNNYQRATGHGEHFSAQELAEQDAFLREIMKTAVMKELYSFLHHQNRYGSEQEF VDDLKNMWFGLYSRGNEEGDSSGFEHVFSGEVKKGKVTGFHNWIRFYLEEKEGLVDYY SHIYDGPWDSYPDVLAMQFNWDGYYKEVGSAFIGSSPEFEFALYSLCFIARPGKVCQL SLGGYPLAVRTYTWDKSTYGNGKKYIATAYIVSST" sig_peptide 155..207 /note="placental protein 11 signal peptide" mat_peptide 208..1261 /note="placental protein 11" polyA_signal 2305..2310 misc_feature 2313..2317 /note="CATTG consensus site; putative" BASE COUNT 636 a 547 c 581 g 556 t ORIGIN 1 cttcctgaaa ggatctggag acaccagctc cacaagtcct ggtgtcttta aaaggatcag 61 cttgaggaat aaggctcgtc tgagagctgt gacattcatc tgactctagt gaaagtccaa 121 cagccactcc ctttttggcc tccaactggg caccatgagg gcctgcatct ccctggtatt 181 ggccgtgctg tgtggcctgg cctgggctga ggaccacaaa gagtcagagc cattgccaca 241 gctggaggaa gagacagaag aggccctcgc cagcaacttg tactcggcac ccacctcctg 301 ccagggccgc tgctacgaag cctttgacaa gcaccaccaa tgtcactgca atgcccgctg 361 ccaagagttt gggaactgct gcaaggattt tgagagcctg tgtagtgacc acgaggtctc 421 ccacagcagt gatgccataa caaaagagga gattcagagc atctctgaga agatctacag 481 ggcagacacc aacaaagccc agaaggaaga catcgttctc aatagccaaa actgcatctc 541 cccgtcagag accagaaacc aagtggatcg ctgcccaaag ccactcttca cttatgtcaa 601 tgagaagctg ttctccaagc ccacctatgc agccttcatc aacctcctca acaactacca 661 gcgggcaaca ggccatgggg agcacttcag tgcccaggag ctggccgagc aggacgcctt 721 cctcagagag atcatgaaga cagcagtcat gaaggagctc tacagcttcc tccatcacca 781 gaatcgctat ggctcagagc aagagtttgt cgatgacttg aagaacatgt ggtttgggct 841 ctattcgaga ggcaatgaag agggggactc gagtggcttt gaacatgtct tctcaggtga 901 ggtaaaaaaa ggcaaggtta ctggcttcca taactggatc cgcttctacc tggaggagaa 961 ggagggtctg gttgactatt acagtcacat ctacgatggg ccttgggatt cttaccccga 1021 tgtgctggca atgcagttca actgggacgg ctactataag gaagtgggct ctgctttcat 1081 cggcagcagc cctgagtttg agtttgcact ctactccctg tgcttcatcg ccaggccagg 1141 caaagtgtgc cagttaagcc tgggaggata tcccttagct gtccggacat atacctggga 1201 caagtccacc tatgggaatg gcaagaagta catcgccaca gcctacatag tgtcttccac 1261 ctaatagaac ttcgagccag aaaggggcat gagggctctt gcgagactga agtgctatct 1321 tctctggact agagagaaga gggagaggac tggaagggat caccaaatct caaagcaatg 1381 agaagcattc ctaaatccca aagtgcccac atgggaaaga gataaaatgt acaaattaga 1441 aaaatgtgga taaacagtca aacctttatc ctctagaatt ttggcaatgt tgactaagaa 1501 acagagtcca agcagagaag gtaggaaccc tccatagctc tctgccctga tgtgtggggg 1561 aactaggaag aagtcctttg acctcaccag gcctcatgct tccctttaat gtaaagggaa 1621 ggggtttgcc cactttcctc tttttggggt tggtgagagg gcaaaccctg atatttttac 1681 tgtgaaggtg ttttcagttg ttcttaggaa gaacagctga tagaaattca agattactat 1741 aatggctgtt attatacaca gctctgtaaa ctaccactca gccctgtgtt ggggtcctca 1801 aagaagtaag gccacagtaa tcaagcaagg gcctttggtt ttttccagag ttagatcctc 1861 tcagaacaga gtctgggaga actccaatgc tgaatggaga agggtaatag gttggttgca 1921 gtgaatgggc tgggggtggg gtggccttct ccaggcctga gtgtttttgt gtccagctca 1981 gtatctgcaa caagaagttt cccacttgtg gatgtttagt gcagccacag acttgtattt 2041 tgatccccaa ttttttttga aagagttctc ctcataggag gatgattcag catcagaaga 2101 agaaggaacc catagcttgg tgtcattaac ataattattt taagccttat ccagcagcca 2161 taatttgaat aactctacga gaccagagag actgtagttc cctattttaa cctcaattat 2221 gcatttgtcc ccaaccccac tgagaactaa atgctgtacc acagagccgg gtgtgaacta 2281 tggtttagaa ggttcaagtt tccaattaaa gtcattgaag // LOCUS HUMPP21A 1174 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens (pp21) mRNA, complete cds. ACCESSION M99701 NID g521206 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1174) AUTHORS Yeh,C.H. and Shatkin,A.J. TITLE A HeLa-cell-encoded p21 is homologous to transcription elongation factor SII JOURNAL Gene 143 (2), 285-287 (1994) MEDLINE 94266168 FEATURES Location/Qualifiers source 1..1174 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 165..638 /gene="pp21" CDS 165..638 /gene="pp21" /codon_start=1 /db_xref="PID:g521207" /translation="MDKPRKENEEEPQSRPRPMRRGLRWSTLPKSSPPRSSLRRSSPR RRSSFLRSSCLSSCLRCSSRRTPSAGLSRKDLFEVRPPMEQPPCGVGKHNLEEGIFKE RLARSRPQFRGDIHGRNLSNEEMIQAADELEEMKRVRNKLMIMHWRAKRGGPYPI" BASE COUNT 351 a 233 c 270 g 320 t ORIGIN 1 gcctgtccac catctcccta ttaccctttg gtcgagaggg aaagcagaag aagtctgctg 61 gtcacacggg ggcacctcga ggagaggacg actaggagca cacggcccgg aaaggtccag 121 gtcagggaag ggaataactg tgcttgaaga agaaaattcc caacatggac aaaccacgca 181 aagaaaatga agaagagccg cagagccgcc caagaccgat gaggagaggc ctccggtgga 241 gcactctccc gaaaagcagt cccccgagga gcagtcttcg gaggagcagt cctcggagga 301 ggagttcttt cctgaggagc tcttgcctga gctcctgcct gagatgctcc tctcggagga 361 ctccctccgc aggtctttcc aggaaggacc tgtttgaggt tcgccctccc atggagcagc 421 ctccttgtgg agtaggaaaa cataaccttg aagaaggaat ctttaaagaa aggttggctc 481 gttctcgccc gcaatttaga ggggacatac atggcagaaa tttaagcaat gaggagatga 541 tacaggcagc agatgagcta gaagagatga aaagagtaag aaacaaactg atgataatgc 601 actggagggc aaaacggggc ggtccttatc ctatttaatg tgttcggcct ttaattctgt 661 tttgcctgct atagtattgc cattgccacc tggactttct gtttgcattt tcttaatgcc 721 ttttccctat ttctgaattt taactttttg tgaggcttta ttttagatgt ttagcatgta 781 actcgcttaa agttgaggtt tccccctaaa atctacaagt ttccctcttt cagtcatgag 841 ccctacacat ttgcatgaaa gatgtacata tatattgtga acgaaaaaag caattttcaa 901 atggtatata tgtatcccat tttgtaaaaa atgtatatta tatattaata tgcaaagaaa 961 aagctaaaag tatagacttc aaaggcataa cagtggttgt gtggtaagat ataggtgatt 1021 ttttaaattt ttgttttatc tgaatttctc attttttcag gacaaacgtt ttacttgtgt 1081 tgcaaaaata tataatgaaa aaatcacaca attttgaaga aaactgtcaa tcagcttata 1141 acgacaatgt ggcacttaat aaatacttgt cagg // LOCUS HUMPP2A 3120 bp mRNA PRI 09-SEP-1997 DEFINITION Homo sapiens phosphatase 2A B56-alpha (PP2A) mRNA, complete cds. ACCESSION L42373 NID g1000887 KEYWORDS B56-alpha; protein phosphatase 2A B subunit; PP2A; B'/B56. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3120) AUTHORS McCright,B. and Virshup,D.M. TITLE Identification of a new family of protein phosphatase 2A regulatory subunits JOURNAL J. Biol. Chem. 270 (44), 26123-26128 (1995) MEDLINE 96064678 REFERENCE 2 (bases 1 to 3120) AUTHORS McCright,B., Brothman,A.R. and Virshup,D.M. TITLE Assignment of human protein phosphatase 2A regulatory subunit genes b56alpha, b56beta, b56gamma, b56delta, and b56epsilon (PPP2R5A-PPP2R5E), highly expressed in muscle and brain, to chromosome regions 1q41, 11q12, 3p21, 6p21.1, and 7p11.2 --> p12 JOURNAL Genomics 36 (1), 168-170 (1996) MEDLINE 96411660 REFERENCE 3 (bases 1 to 3120) AUTHORS Virshup,D. TITLE Direct Submission JOURNAL Submitted (09-SEP-1997) HMBG, University of Utah, Bldg. 533, Rm 4480, Salt Lake City, UT 84112, USA FEATURES Location/Qualifiers source 1..3120 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" mRNA 1..3120 /gene="PP2A" gene 1..3120 /gene="PP2A" 5'UTR 1..571 /gene="PP2A" CDS 572..2032 /gene="PP2A" /note="similar to yeast RTS1" /codon_start=1 /product="protein phosphatase 2A B56-alpha" /db_xref="PID:g1000888" /translation="MSSSSPPAGAASAAISASEKVDGFTRKSVRKAQRQKRSQGSSQF RSQGSQAELHPLPQLKDATSNEQQELFCQKLQQCCILFDFMDSVSDLKSKEIKRATLN ELVEYVSTNRGVIVESAYSDIVKMISANIFRTLPPSDNPDFDPEEDEPTLEASWPHIQ LVYEFFLRFLESPDFQPSIAKRYIDQKFVQQLLELFDSEDPRERDFLKTVLHRIYGKF LGLRAFIRKQINNIFLRFIYETEHFNGVAELLEILGSIINGFALPLKAEHKQFLMKVL IPMHTAKGLALFHAQLAYCVVQFLEKDTTLTEPVIRGLLKFWPKTCSQKEVMFLGEIE EILDVIEPTQFKKIEEPLFKQISKCVSSSHFQVAERALYFWNNEYILSLIEENIDKIL PIMFASLYKISKEHWNPTIVALVYNVLKTLMEMNGKLFDDLTSSYKAERQREKKKELE REELWKKLEELKLKKALEKQNSAYNMHSILSNTSAE" 3'UTR 2033..3120 /gene="PP2A" BASE COUNT 855 a 720 c 735 g 810 t ORIGIN 1 ccgcagaggg ccggggctac ggggcagccc cgggcgatga ggggccggcg ttgaccggga 61 agagcgggca ccgcggcagt ggctccgagg ggacccgcga tggcagcgcc ctgagaggag 121 gctccaggca gggcgggctg cgctggcagc ggccgctgag gtgctggccg gccggctggc 181 tggcgacggg ggcagaagcg acgagaggcg cgctcggcac ccgcaccccc gtgcccccgc 241 ctcagttgtc taaacttcgg gctctcttcc accgtctgcg cgcccagagt caacaacttc 301 ttcacccccc tccgcccccg cccttccctc cgtcagcccc gggagctcgc cgcggcccgg 361 ggaccaggaa cctccagcgc tgagatgtgg ccgtgaggcg ttggcgggcg ccgaggagaa 421 gctcggcggc gtcccggggc cggagggccg tggggccggg gcgcaggggc gcgagcaccc 481 cgcgcctctc ccccgcctcc tcctgccgtc tccgccgctg cccgtgcctt gcaagcagca 541 gccggagctg ccaagcgtca gggccgcgga gatgtcgtcg tcgtcgccgc cggcgggggc 601 tgccagcgcc gccatctcgg cctcggagaa agtggacggc ttcacccgga aatcggtccg 661 caaggcgcag aggcagaagc gctcccaggg ctcgtcgcag tttcgcagcc agggcagcca 721 ggcagagctg cacccgctgc cccagctcaa agatgccact tcaaatgaac aacaagagct 781 tttctgtcag aagttgcagc agtgttgtat actgtttgat ttcatggact ctgtttcaga 841 cttgaagagc aaagaaatta aaagagcaac actgaatgaa ctggttgagt atgtttcaac 901 taatcgtggt gtaattgttg aatcagcgta ttctgatata gtaaaaatga tcagtgctaa 961 catcttccgt acacttcctc caagtgataa tccagatttt gatccagaag aggatgaacc 1021 cacgcttgag gcctcttggc ctcacataca gttggtatat gaattcttct tgagattttt 1081 ggagagccct gatttccagc ctagcattgc aaaacgatac attgatcaga aattcgtaca 1141 acagctcctg gagctttttg atagtgaaga tcccagagaa cgtgacttcc tgaagactgt 1201 tctgcaccga atttatggga aatttcttgg attaagagca ttcatcagaa aacaaattaa 1261 caacattttc ctcaggttta tatatgaaac agaacatttc aatggtgttg ctgaacttct 1321 tgaaatatta ggaagtatta tcaatggctt tgcattgcca ctgaaagcag aacataaaca 1381 atttctaatg aaggttctta ttcctatgca tactgcaaaa ggattagctt tgtttcatgc 1441 tcagctagca tattgtgttg tacagttcct ggagaaagat acaacactaa cagagccagt 1501 gatcagagga ctgctgaaat tttggccaaa aacctgcagt cagaaagagg tgatgttttt 1561 aggagaaatt gaagaaatct tagatgtcat tgaaccaaca cagttcaaaa aaattgaaga 1621 gccacttttc aagcagatat ccaagtgtgt atccagttct cattttcagg ttgcagaaag 1681 ggcattgtac ttctggaata acgaatatat tcttagtttg attgaggaga acattgataa 1741 aattctgcca attatgtttg ccagtttgta caaaatttcc aaagaacact ggaatccgac 1801 cattgtagca ctggtataca atgtgctgaa aaccctaatg gaaatgaatg gcaagctttt 1861 cgatgacctt actagctcat acaaagctga aagacagaga gagaaaaaga aggaattgga 1921 acgtgaagaa ttatggaaaa aattagagga gctaaagcta aagaaagctc tagaaaaaca 1981 gaatagtgct tacaacatgc acagtattct cagcaataca agtgccgaat aaaaaaaaag 2041 cctcccacct ctgccggata ggcagagttt tgtatgcttt tttgaaatat gtaaaaatta 2101 caaaacaaac ctcatcagta taatataatt aaaaggccaa ttttttctgg caactgtaaa 2161 tggaaaaata tatggactaa acgtagccct gtgctgtatc atggccatag tatattgtaa 2221 cctttgtcta atcattggat ttattgtgtc acttctgaag tttcacagaa atgaatgaat 2281 tttatcatct atgatatgag tgagataatt atgggagtgg taagaattat gacttgaatt 2341 cttctttgat tgtgttgcac atagatatgg tagtctgctc tgtatatttt tcccttttat 2401 aatgtgcttt tcacactgct gcaaacctta gttacatcct aggaaaaaat acttcctaaa 2461 ataaaactaa ggtatcatcc ttacccttct ctttgtctca cccagaaata tgatgggggg 2521 aattacctgc cctaacccct ccctcaataa atacattact gtactctgga atttaggcaa 2581 aaccttaaat ctccaggctt tttaaagcac aaaatataaa taaaagctgg gaaagtaaac 2641 caaaattctt cagattgttc ctcatgaata tcccccttcc tctgcaattc tccagagtgg 2701 taacagatgg gtagaggcag ctcaggtgaa ttacccagct tgcctctcaa ttcattcctc 2761 ctcttcctct caaaggctga aggcagggcc tttccagtcc tcacaacctg tccttcacct 2821 agtccctcct gacccaggga tggaggcttt gagtcccaca gtgtggtgat acagagcact 2881 agttgtcact gcctggcttt atttaaagga actgcagtag gcttcctctg tagagctctg 2941 aaaaggttga ctatatagag gtcttgtatg tttttacttg gtcaagtatt tctcacatct 3001 tttgttatca gagtaccatt ccaatctctt aacttgcagt tgtgtggaaa actgttttgt 3061 aatgaaagat cttcattggg ggattgagca gcatttaata aagtctatgt ttgtattttg // LOCUS HUMPP2A130 5217 bp mRNA PRI 18-JUN-1996 DEFINITION Human protein phosphatase 2A 130 kDa regulatory subunit mRNA, complete cds. ACCESSION L07590 NID g190219 KEYWORDS phosphoprotein phosphohydrolase; protein phosphatase 2A; protein phosphatase 2A 130 kDa regulatory subunit; regulatory subunit. SOURCE Homo sapiens (library: lambda ZAP; Stratagene/lambda gt10; Clonetech) fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5217) AUTHORS Hendrix,P., Mayer-Jackel,R.E., Cron,P., Goris,J., Hofsteenge,J., Merlevede,W. and Hemmings,B.A. TITLE Structure and expression of a 72-kDa regulatory subunit of protein phosphatase 2A. Evidence for different size forms produced by alternative splicing JOURNAL J. Biol. Chem. 268 (20), 15267-15276 (1993) MEDLINE 93315512 FEATURES Location/Qualifiers source 1..5217 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..504 CDS 505..3957 /EC_number="3.1.3.16" /codon_start=1 /product="protein phosphatase 2A 130 kDa regulatory subunit" /db_xref="PID:g190220" /translation="MAATYRLVVSTVNHYSSVVIDRRFEQAIHYCTGTCHTFTHGIDC IVVHHSVCADLLHIPVSQFKDADLNSMFLPHENGLSSAEGDYPQQAFTGIPRVKRGST FQNTYNLKDIAGEAISFASGKIKEFSFEKLKNSNHAAYRKGRKVKSDSFNRRSVDLDL LCGHYNNDGNAPSFGLLRSSSVEEKPLSHRNSLDTNLTSMFLQNFSEEDLVTQILEKH KIDNFSSGTDIKMCLDILLKCSEDLKKCTDIIKQCIKKKSGSSISEGSGNDTISSSET VYMNVMTRLASYLKKLPFEFMQSGNNEALDLTELISNMPSLQLTPFSPVFGTEQPPKY EDVVQLSASDSGRFQTIELQNDKPNSRKMDTVQSIPNNSTNSLYNLEVNDPRTLKAVQ VQSQSLTMNPLENVSSDDLMETLYIEEESDGKKALDKGQKTENGPSHELLKVNEHRAE FPEHATHLKKCPTPMQNEIGKIFEKSFVNLPKEDCKSKVSKFEEGDQRDFTNSSSQEE IDKLLMDLESFSQKMETSLREPLAKGKNSNFLNSHSQLTGQTLVDLEPKSKVSSPIEK VSPSCLTRIIETNGHKIEEEDRALLLRILESIEDFAQELVECKSSRGSLSQEKEMMQI LQETLTTSSQANLSVCRSPVGDKAKDTTSAVLIQQTPEVIKIQNKPEKKPGTPLPPPA TSPSSPRPLSPVPHVNNVVNAPLSINIPRFYFPEGLPDTCSNHEQTLSRIETAFMDIE EQKADIYEMGKIAKVCGCPLYWKAPMFRAAGGEKTGFVTAQSFIAMWRKLLNNHHDDA SKFICLLAKPNCSSLEQEDFIPLLQDVVDTHPGLTFLKDAPEFHSRYITTVIQRIFYT VNRSWSGKITSTEIRKSNFLQTLALLEEEEDINQITDYFSYEHFYVIYCKFWELDTDH DLYISQADLSRYNDQASSSRIIERIFSGAVTRGKTIQKEGRMSYADFVWFLISEEDKR NPTSIEYWFRCMDVDGDGVLSMYELEYFYEEQCERMEAMGIEPLPFHDLLCQMLDLVK PAVDGKITLRDLKRCRMAHIFYDTFFNLEKYLDHEQRDPFAVQKDVENDGPEPSDWDR FAAEEYETLVAEESAQAQFQEGFEDYETDEPASPSEFGNKSNKILSASLPEKCGKLQS VDEE" misc_feature 2449..4438 /note="region homologous with PR72 gene alternatively spliced form" 3'UTR 3955..5217 polyA_signal 4424..4429 BASE COUNT 1697 a 1061 c 1010 g 1449 t ORIGIN 1 gaattcctgg aggcaagcgc tgcccgcgag ctgagccgcc ggaggaggag ccgcgggcaa 61 cgaggtttct ctgtcattca cagaaaaatg aatcatttaa acctttggag gactcagtta 121 tcacaatact tctctactac caactaagat ttatgatagt aaatttatga gagcaaattt 181 ccatgttata gaactgttga agaactaaaa gaggatattc tttcatcaaa ataattctgc 241 agtatcataa tattactaaa taaaattaaa aagcacaatt attaaattat taaatttgcc 301 acacatgtgc agcagctact gtatcctgat agtgaccaaa cctcaaatat aaatggtttc 361 ccttcatggg aaaagccatt atatttggaa gaaaccactg aacattgtta ttaaatatat 421 tttcagctaa ctgtcattta ggagaatttt catgaaacaa gttctagaaa gttccaagtc 481 ccaccagtaa gtggatttga tattatggca gcaacttaca gacttgtggt tagtactgtg 541 aaccactaca gcagcgtggt gatagaccgg cgttttgaac aagctataca ttattgcact 601 ggaacctgcc acaccttcac acatggaatt gactgcattg tggtacacca tagtgtttgt 661 gcagacctct tgcacatccc tgtgtctcag ttcaaagatg cagatctgaa ctctatgttt 721 ctaccccatg aaaatgggct ttcttcggct gaaggagact atccccaaca ggccttcaca 781 ggcataccca gggtcaagag aggatctaca tttcagaata cctacaactt aaaggatatt 841 gcaggagaag caatcagttt tgccagtggg aaaataaaag aattttcctt tgaaaaactc 901 aaaaactcta accatgcagc ttacagaaag ggaaggaaag ttaagtctga ctcatttaat 961 aggaggtcag ttgatttgga cttgctttgt ggccattata acaacgatgg gaacgcccca 1021 tcctttggtt tactgcggag ttcctcagtt gaggaaaaac ctttgtctca tagaaactca 1081 ctggatacga acctgacttc catgtttctt caaaactttt ctgaagaaga cttggttact 1141 cagattttgg aaaaacataa aatagataat ttttcttctg ggacagacat aaagatgtgc 1201 ttggacatct tattgaaatg ctccgaggat ttaaaaaaat gcacagacat cataaaacaa 1261 tgcataaaga aaaaatcagg gagtagcatc agtgaaggaa gtggtaatga tacaatttct 1321 agctctgaaa ctgtctatat gaatgtaatg accaggttag catcctatct gaaaaagtta 1381 ccatttgaat tcatgcagtc tgggaataat gaggctctag atttaacaga actgatcagt 1441 aatatgccta gcttacaact gactcccttc tccccagtgt ttggcactga acaaccccct 1501 aaatatgaag atgttgtcca gctctcagct tctgactctg gacgatttca aactattgaa 1561 ttgcaaaatg acaagcctaa ttctaggaag atggacactg tacaatccat tccaaacaac 1621 tccacaaatt ccttatataa cttagaggta aatgatccta gaactctaaa agctgtccag 1681 gtccaatcac agtcattaac catgaatcct ttagaaaatg tttcttctga cgacttaatg 1741 gaaactcttt atattgaaga agagtcagat ggaaagaaag cattagataa aggacaaaag 1801 acagagaatg gacctagtca tgagttatta aaggtaaatg aacatagagc agaatttcca 1861 gaacatgcta ctcatcttaa aaaatgcccc accccaatgc aaaatgaaat tggtaagata 1921 tttgagaaat catttgttaa tctacctaag gaagactgta aatcaaaagt ttctaaattt 1981 gaagagggag accagagaga ttttacaaat tccagtagcc aggaagagat agataaattg 2041 ttaatggatt tggaatcttt ttcacagaag atggagacct ctctaagaga gccacttgcg 2101 aagggtaaaa actctaattt tttaaatagt cacagtcagt tgaccggtca gacccttgta 2161 gatcttgagc ctaaatctaa agtctcttca cccatagaaa aagtctcacc ttcctgtcta 2221 acaaggatta ttgaaaccaa tggacacaaa atagaggaag aggatcgagc cctcttactg 2281 cgaatcctgg aaagcattga agactttgct caagaactag ttgaatgcaa atcaagcaga 2341 gggagcctat cacaagaaaa ggaaatgatg caaattctac aggaaacctt gacaacttcc 2401 tcccaggcca atttatcagt ctgtagaagt cctgttggtg ataaagccaa agatactact 2461 tcagcagttt tgattcagca gactccagag gtgatcaaga ttcaaaataa accagaaaag 2521 aaacctggaa caccactccc acctccagcc acctctccaa gtagtccccg acctctctcc 2581 ccggttcccc atgtgaataa tgttgtgaat gcgccattgt ccataaacat tccacggttc 2641 tactttcctg aaggactccc agatacctgt agtaatcatg aacaaactct aagcagaatt 2701 gaaactgctt tcatggatat tgaagaacag aaagcagaca tttatgaaat ggggaaaatt 2761 gcaaaggtct gtggctgtcc tctctattgg aaagccccca tgttcagggc tgcaggggga 2821 gagaagacag gatttgtgac agcacagtca ttcattgcca tgtggagaaa gttgctgaat 2881 aaccatcatg atgatgcctc taaattcatc tgtcttctag caaagcccaa ctgcagctct 2941 ctagaacagg aggatttcat ccctctactt caggatgtgg tggataccca ccctggtctc 3001 acgttcctga aagatgctcc agaattccac tcccgctaca tcaccacggt tattcagaga 3061 atattctaca cagtcaacag atcttggagt ggaaaaatta cttcgacaga gataagaaaa 3121 agcaactttt tgcaaaccct agcacttttg gaagaagagg aagatataaa ccaaattaca 3181 gattacttct cctatgaaca tttctatgtt atttattgta aattctggga actagatact 3241 gatcacgacc tctacatcag ccaggccgat ctgtctcgat acaatgacca ggcttcatca 3301 agcaggatta ttgaaaggat attctctggt gcagtaacaa ggggaaaaac aatacagaaa 3361 gagggaagaa tgagctatgc agattttgtt tggtttttga tctctgaaga agacaaaagg 3421 aatcctacca gcattgagta ttggttccgc tgcatggatg tggatggaga cggtgtactc 3481 tccatgtatg agctggagta cttctatgag gagcagtgtg aacggatgga agccatggga 3541 attgagccct tgccattcca tgatttactg tgccagatgc ttgacctagt gaagccagct 3601 gttgatggca aaataactct aagagatctg aagaggtgca gaatggctca catcttctat 3661 gacactttct ttaatctgga gaaatactta gaccatgaac agagagatcc ctttgcggtc 3721 cagaaggatg ttgagaacga tgggcctgag ccctcagact gggaccggtt tgccgctgag 3781 gagtatgaga cgcttgttgc agaggaatct gcccaagcac aattccagga aggctttgaa 3841 gattatgaaa cagatgaacc tgcctctccc tctgaatttg gaaacaaaag caataaaata 3901 ttaagtgcaa gccttccaga gaaatgtgga aagcttcaat cagtggatga agaatagctg 3961 ccggtgtcta caatgaaacg aagatgtgta ttttaaatgt ttctttcttg tgaagagatg 4021 ttctcgtttg catactgctt tttaaagact ttgatttctc caagtgtgta tcatctgcac 4081 taggaacttt gtttttaagc aataggtctg gatacacatt taacttagga ggctcctcca 4141 atttgcctca aacctcttac ggagcttctc ctcagaagtg gtaccatcgc cttccaaagt 4201 cagcactcta cactcttgaa tgtaccaagg atctcttggc gacagtacca agcacggttc 4261 tctacacagg tgactgaagt tgcctctgtg ttggctggca tccctgagtc ccctccggct 4321 cctatggagc ctagaagaaa ccttcacttg cagaaaactt gagtcagaaa attctggaac 4381 ttgaaaaagt agtaagggcc tccagaattg acttagccct agtaataaaa gcactgccaa 4441 aacatctcag agacttcttt tatgtatact ggagttcaaa gatctttaac ttacctggct 4501 tatgtaattt cacagttacc tgccaaacta ctagtcactt tacatcctca ggtattgtaa 4561 ccactgggcc ttccaacttt gctggcaagc tctggtaacc tccctgactg tggatcttat 4621 ataaaatctc aagataaaaa aacacttctt aaatgaagta tagaatttga ctcatacttg 4681 gaaaaagccc ttttaacttt tatcttttat tcattgaact attgaatgat gtatttaata 4741 tccaaattag ttgataactg ttttcacttc ctatcagaag gcctgcttga agacataaag 4801 gaataatgat acattaaaat tcttactaga tagatatatt ccttcctccc aggagtatta 4861 gactagctaa tagtaaaggc ctcaacgtta ttctttactt catgttgaaa acaattacta 4921 cagatatttc atccacctca gtatttatca aggaaatgga aaatgataca gctataaaag 4981 aaagctgtta tttactgtag ttgtagatgt attcactaca gaatcctaca tttttcagca 5041 ggccacagtc cagccaaatc ctataatatc cttgaaaaga aactattaaa aaggatagac 5101 atttctgatg taagtaaaac ccccaagcaa cttaatatcc atctgtcagt catcaacttt 5161 tcccctagat ttttttttta actagttctg aaagtgtcaa gaatagcttc ggaattc // LOCUS HUMPPAR 1854 bp mRNA PRI 23-JUL-1993 DEFINITION Human peroxisome proliferator activated receptor mRNA, complete cds. ACCESSION L02932 NID g307340 KEYWORDS peroxisome proliferator-activated receptor; steroid receptor superfamily. SOURCE Homo sapiens (library: K19 lambda gt 11) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1854) AUTHORS Sher,T., Yi,H.-F., McBride,O.W. and Gonzales,F.Y. TITLE cDNA cloning, chromosomal mapping, and functional characterization of the human peroxisome proliferator activated receptor JOURNAL Biochemistry 32, 5598-5604 (1993) MEDLINE 93277839 FEATURES Location/Qualifiers source 1..1854 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="K19 lambda gt 11" /map="22q11.2-qter" CDS 217..1623 /standard_name="PPAR" /codon_start=1 /product="peroxisome proliferator activated receptor" /db_xref="PID:g307341" /translation="MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGED SSGSFGFTEYQYLGSCPGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIEC RICGDKASGYHYGVHACEGCKGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHK CLSVGMSHNAIRFGRMPRSEKAKLKAEILTCEHDIEDSETADLKSLAKRIYEAYLKNF NMNKVKARVILSGKASNNPPFVIHDMETLCMAEKTLVAKLVANGIQNKEVEVRIFHCC QCTSVETVTELTEFAKAIPAFANLDLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAY GNGFITREFLKSLRKPFCDIMEPKFDFAMKFNALELDDSDISLFVAAIICCGDRPGLL NVGHIEKMQEGIVHVLRLHLQSNHPDDIFLFPKLLQKMADLRQLVTEHAQLVQIIKKT ESDAALHPLLQEIYRDMY" BASE COUNT 499 a 449 c 480 g 426 t ORIGIN 1 ggcccaggct gaagctcagg gccctgtctg ctctgtggac tcaacagttt gtggcaagac 61 aagctcagaa ctgagaagct gtcaccacag ttctggaggc tgggaagttc aagatcaaag 121 tgccagcaga ttcagtgtca tgtgaggacg tgcttcctgc ttcatagata agagtagctt 181 ggagctcggc ggcacaacca gcaccatctg gtcgcgatgg tggacacgga aagcccactc 241 tgccccctct ccccactcga ggccggcgat ctagagagcc cgttatctga agagttcctg 301 caagaaatgg gaaacatcca agagatttcg caatccatcg gcgaggatag ttctggaagc 361 tttggcttta cggaatacca gtatttagga agctgtcctg gctcagatgg ctcggtcatc 421 acggacacgc tttcaccagc ttcgagcccc tcctcggtga cttatcctgt ggtccccggc 481 agcgtggacg agtctcccag tggagcattg aacatcgaat gtagaatctg cggggacaag 541 gcctcaggct atcattacgg agtccacgcg tgtgaaggct gcaagggctt ctttcggcga 601 acgattcgac tcaagctggt gtatgacaag tgcgaccgca gctgcaagat ccagaaaaag 661 aacagaaaca aatgccagta ttgtcgattt cacaagtgcc tttctgtcgg gatgtcacac 721 aacgcgattc gttttggacg aatgccaaga tctgagaaag caaaactgaa agcagaaatt 781 cttacctgtg aacatgacat agaagattct gaaactgcag atctcaaatc tctggccaag 841 agaatctacg aggcctactt gaagaacttc aacatgaaca aggtcaaagc ccgggtcatc 901 ctctcaggaa aggccagtaa caatccacct tttgtcatac atgatatgga gacactgtgt 961 atggctgaga agacgctggt ggccaagctg gtggccaatg gcatccagaa caaggaggtg 1021 gaggtccgca tctttcactg ctgccagtgc acgtcagtgg agaccgtcac ggagctcacg 1081 gaattcgcca aggccatccc agcgttcgca aacttggacc tgaacgatca agtgacattg 1141 ctaaaatacg gagtttatga ggccatattc gccatgctgt cttctgtgat gaacaaagac 1201 gggatgctgg tagcgtatgg aaatgggttt ataactcgtg aattcctaaa aagcctaagg 1261 aaaccgttct gtgatatcat ggaacccaag tttgattttg ccatgaagtt caatgcactg 1321 gaactggatg acagtgatat ctcccttttt gtggctgcta tcatttgctg tggagatcgt 1381 cctggccttc taaacgtagg acacattgaa aaaatgcagg agggtattgt acatgtgctc 1441 agactccacc tgcagagcaa ccacccggac gatatctttc tcttcccaaa acttcttcaa 1501 aaaatggcag acctccggca gctggtgacg gagcatgcgc agctggtgca gatcatcaag 1561 aagacggagt cggatgctgc gctgcacccg ctactgcagg agatctacag ggacatgtac 1621 tgagttcctt cagatcagcc acaccttttc caggagttct gaagctgaca gcactacaaa 1681 ggagacgggg gagcagcacg attttgcaca aatatccacc actttaacct tagagcttgg 1741 acagtctgag ctgtaggtaa ccggcatatt attccatatc tttgttttaa ccagtacttc 1801 taagagcata gaactcaaat gctgggggag gtggctaatc tcaggactgg gaag // LOCUS HUMPPARGB 1811 bp mRNA PRI 01-NOV-1995 DEFINITION H. sapiens peroxisome proliferator activated receptor gamma, complete cds. ACCESSION L40904 NID g722619 KEYWORDS PPAR gene; h ppar gamma gene; nuclear receptor; peroxisome proliferator activated receptor gamma; transcription factor. SOURCE Homo sapiens (clone 14) (clone library: hu.bmr1990 unamp) (tissue library: hu.bmr1990 unamp) female adult bone marrow aspirate cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1811) AUTHORS Greene,M.E., Blumberg,B., McBride,O.W., Yi,H.F., Kronquist,K., Kwan,K., Hsieh,L., Greene,G. and Nimer,S.D. TITLE Isolation of the human peroxisome proliferator activated receptor gamma cDNA: expression in hematopoietic cells and chromosomal mapping JOURNAL Gene Expr. 4 (4-5), 281-299 (1995) MEDLINE 95307078 REFERENCE 2 (bases 1 to 1811) AUTHORS Qi,J.S., Desai-Yajnik,V., Greene,M.E., Raaka,B.M. and Samuels,H.H. TITLE The ligand-binding domains of the thyroid hormone/retinoid receptor gene subfamily function in vivo to mediate heterodimerization, gene silencing, and transactivation JOURNAL Mol. Cell. Biol. 15 (3), 1817-1825 (1995) MEDLINE 95166267 COMMENT Full length receptor cDNA first isolated and sequenced Jan 9 1991. Patent applied for. FEATURES Location/Qualifiers source 1..1811 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="14" /clone_lib="hu.bmr1990 unamp" /dev_stage="adult" /sex="female" /tissue_type="bone marrow aspirate" /tissue_lib="hu.bmr1990 unamp" /map="3p25" mRNA <1..1811 /gene="PPARG" /product="peroxisome proliferator activated receptor gamma" gene 1..1811 /gene="PPARG" CDS 173..1609 /gene="PPARG" /codon_start=1 /function="ligand activated transcription factor" /product="peroxisome proliferator activated receptor gamma" /db_xref="PID:g722620" /translation="MTMVDTEIAFWPTNFGISSVDLSVMEDHSHSFDIKPFTTVDFSS ISTPHYEDIPFTRTDPVVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEP SNSLMAIECRVCGDKASGFHYGVHACEGCKGFFRRTIRLKLIYDRCDLNCRIHKKSRN KCQYCRFQKCLAVGMSHNAIRFGRIAQAEKEKLLAEISSDIDQLNPESADLRQALAKH LYDSYIKSFPLTKAKARAILTGKTTDKSPFVIYDMNSLMMGEDKIKFKHITPLQEQSK EVAIRIFQGCQFRSVEAVQEITEYAKSIPGFVNLDLNDQVTLLKYGVHEIIYTMLASL MNKDGVLISEGQGFMTREFLKSLRKPFGDFMEPKFEFAVKFNALELDDSDLAIFIAVI ILSGDRPGLLNVKPIEDIQDNLLQALELQLKLNHPESSQLFAKLLQKMTDLRQIVTEH VQLLQVIKKTETDMSLHPLLQEIYKDLY" polyA_site 1811 /gene="PPARG" BASE COUNT 510 a 433 c 422 g 446 t ORIGIN 1 ccgaccttac cccaggcggc cttgacgttg gtcttgtcgg caggagacag caccatggtg 61 ggttctctct gagtctggga attcccgagc ccgagccgca gccgccgcct ggggggcttg 121 ggtcggcctc gaggacaccg gagaggggcg ccacgccgcc gtggccgcag aaatgaccat 181 ggttgacaca gagatcgcat tctggcccac caactttggg atcagctccg tggatctctc 241 cgtaatggaa gaccactccc actcctttga tatcaagccc ttcactactg ttgacttctc 301 cagcatttct actccacatt acgaagacat tccattcaca agaacagatc cagtggttgc 361 agattacaag tatgacctga aacttcaaga gtaccaaagt gcaatcaaag tggagcctgc 421 atctccacct tattattctg agaagactca gctctacaat aagcctcatg aagagccttc 481 caactccctc atggcaattg aatgtcgtgt ctgtggagat aaagcttctg gatttcacta 541 tggagttcat gcttgtgaag gatgcaaggg tttcttccgg agaacaatca gattgaagct 601 tatctatgac agatgtgatc ttaactgtcg gatccacaaa aaaagtagaa ataaatgtca 661 gtactgtcgg tttcagaaat gccttgcagt ggggatgtct cataatgcca tcaggtttgg 721 gcggatcgca caggccgaga aggagaagct gttggcggag atctccagtg atatcgacca 781 gctgaatcca gagtccgctg acctccgtca ggccctggca aaacatttgt atgactcata 841 cataaagtcc ttcccgctga ccaaagcaaa ggcgagggcg atcttgacag gaaagacaac 901 agacaaatca ccattcgtta tctatgacat gaattcctta atgatgggag aagataaaat 961 caagttcaaa cacatcaccc ccctgcagga gcagagcaaa gaggtggcca tccgcatctt 1021 tcagggctgc cagtttcgct ccgtggaggc tgtgcaggag atcacagagt atgccaaaag 1081 cattcctggt tttgtaaatc ttgacttgaa cgaccaagta actctcctca aatatggagt 1141 ccacgagatc atttacacaa tgctggcctc cttgatgaat aaagatgggg ttctcatatc 1201 cgagggccaa ggcttcatga caagggagtt tctaaagagc ctgcgaaagc cttttggtga 1261 ctttatggag cccaagtttg agtttgctgt gaagttcaat gcactggaat tagatgacag 1321 cgacttggca atatttattg ctgtcattat tctcagtgga gaccgcccag gtttgctgaa 1381 tgtgaagccc attgaagaca ttcaagacaa cctgctacaa gccctggagc tccagctgaa 1441 gctgaaccac cctgagtcct cacagctgtt tgccaagctg ctccagaaaa tgacagacct 1501 cagacagatt gtcacggaac acgtgcagct actgcaggtg atcaagaaga cggagacaga 1561 catgagtctt cacccgctcc tgcaggagat ctacaaggac ttgtactagc agagagtcct 1621 gagccactgc caacatttcc cttcttccag ttgcactatt ctgagggaaa atctgaccat 1681 aagaaattta ctgtgaaaaa gcgttttaaa aagaaaaggg tttagaatat gatctatttt 1741 atgcatattg tttataaaga cacatttaca atttactttt aatattaaaa attaccatat 1801 tatgaaattg c // LOCUS HUMPPARP0 1097 bp mRNA PRI 01-SEP-1988 DEFINITION Human acidic ribosomal phosphoprotein P0 mRNA, complete cds. ACCESSION M17885 NID g190231 KEYWORDS acidic ribosomal phosphoprotein. SOURCE Human fibroblast, cDNA to mRNA, (library of Okayama and Berg), clone D10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1097) AUTHORS Rich,B.E. and Steitz,J.A. TITLE Human acidic ribosomal phosphoproteins P0, P1, and P2: Analysis of cDNA clones, in vitro synthesis, and assembly JOURNAL Mol. Cell. Biol. 7, 4065-4074 (1987) MEDLINE 88122131 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.E.Rich, 13-JAN-1988. FEATURES Location/Qualifiers source 1..1097 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1097 /note="ARP mRNA" CDS 78..1031 /note="acidic ribosomal phosphoprotein (P0)" /codon_start=1 /db_xref="PID:g190232" /translation="MPREDRATWKSNYFLKIIQLLDDYPKCFIVGADNVGSKQMQQIR MSLRGKAVVLMGKNTMMRKAIRGHLENNPALEKLLPHIRGNVGFVFTKEDLTEIRDML LANKVPAAARAGAIAPCEVTVPAQNTGLGPEKTSFFQALGITTKISRGTIEILSDVQL IKTGDKVGASEATLLNMLNISPFSFGLVIQQVFDNGSIYNPEVLDITEETLHSRFLEG VRNVASVCLQIGYPTVASVPHSIINGYKRVLALSVETDYTFPLAEKVKAFLADPSAFV AAAPVAAATTAAPAAAAAPAKVEAKEESEESDEDMGFGLFD" BASE COUNT 257 a 301 c 283 g 256 t ORIGIN Unreported. 1 cttctctcgc caggcgtcct cgtggaagtg acatcgtctt taaaccccct cgtggcaatc 61 cctgacgcac cgccgtgatg cccagggaag acagggcgac ctggaagtcc aactacttcc 121 ttaagatcat ccaactattg gatgattatc cgaaatgttt cattgtggga gcagacaatg 181 tgggctccaa gcagatgcag cagatccgca tgtcccttcg cgggaaggct gtggtgctga 241 tgggcaagaa caccatgatg cgcaaggcca tccgagggca cctggaaaac aacccagctc 301 tggagaaact gctgcctcat atccggggga atgtgggctt tgtgttcacc aaggaggacc 361 tcactgagat cagggacatg ttgctggcca ataaggtgcc agctgctgcc cgtgctggtg 421 ccattgcccc atgtgaagtc actgtgccag cccagaacac tggtctcggg cccgagaaga 481 cctccttttt ccaggcttta ggtatcacca ctaaaatctc caggggcacc attgaaatcc 541 tgagtgatgt gcagctgatc aagactggag acaaagtggg agccagcgaa gccacgctgc 601 tgaacatgct caacatctcc cccttctcct ttgggctggt catccagcag gtgttcgaca 661 atggcagcat ctacaaccct gaagtgcttg atatcacaga ggaaactctg cattctcgct 721 tcctggaggg tgtccgcaat gttgccagtg tctgtctgca gattggctac ccaactgttg 781 catcagtacc ccattctatc atcaacgggt acaaacgagt cctggccttg tctgtggaga 841 cggattacac cttcccactt gctgaaaagg tcaaggcctt cttggctgat ccatctgcct 901 ttgtggctgc tgcccctgtg gctgctgcca ccacagctgc tcctgctgct gctgcagccc 961 cagctaaggt tgaagccaag gaagagtcgg aggagtcgga cgaggatatg ggatttggtc 1021 tctttgacta atcaccaaaa agcaaccaac ttagccagtt ttatttgcaa aacaaggaaa 1081 taaaggctta cttcttt // LOCUS HUMPPARP1 512 bp mRNA PRI 01-SEP-1988 DEFINITION Human acidic ribosomal phosphoprotein P1 mRNA, complete cds. ACCESSION M17886 NID g190233 KEYWORDS acidic ribosomal phosphoprotein. SOURCE Human fibroblast, cDNA to mRNA, (library of Okayama and Berg). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 512) AUTHORS Rich,B.E. and Steitz,J.A. TITLE Human acidic ribosomal phosphoproteins P0, P1, and P2: Analysis of cDNA clones, in vitro synthesis, and assembly JOURNAL Mol. Cell. Biol. 7, 4065-4074 (1987) MEDLINE 88122131 FEATURES Location/Qualifiers source 1..512 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..512 /note="ARP mRNA" CDS 130..474 /note="acidic ribosomal phosphoprotein (P1)" /codon_start=1 /db_xref="PID:g190234" /translation="MASVSELACIYSALILHDDEVTVTEDKINALIKAAGVNVEPFWP GLFAKALANVNIGSLICNVGAGGPAPAAGAAPAGGPAPSTAAAPAEEKKVEAKKEESE ESDDDMGFGLFD" BASE COUNT 110 a 147 c 138 g 117 t ORIGIN Unreported. 1 cttttcctca gctgccgcca aggtgctcgg tccttccgag gaagctaagg ctgcgttggg 61 gtgaggccct cacttcatcc ggcgactagc accgcgtccg gcagcgccag ccctacactc 121 gcccgcgcca tggcctctgt ctccgagctc gcctgcatct actcggccct cattctgcac 181 gacgatgagg tgacagtcac ggaggataag atcaatgccc tcattaaagc agccggtgta 241 aatgttgagc ctttttggcc tggcttgttt gcaaaggccc tggccaacgt caacattggg 301 agcctcatct gcaatgtagg ggccggtgga cctgctccag cagctggtgc tgcaccagca 361 ggaggtcctg ccccctccac tgctgctgct ccagctgagg agaagaaagt ggaagcaaag 421 aaagaagaat ccgaggagtc tgatgatgac atgggctttg gtctttttga ctaaacctct 481 tttataacat gttcaataaa aagctgaact tt // LOCUS HUMPPARP2 460 bp mRNA PRI 01-SEP-1988 DEFINITION Human acidic ribosomal phosphoprotein P2 mRNA, complete cds. ACCESSION M17887 NID g190235 KEYWORDS acidic ribosomal phosphoprotein. SOURCE Human fibroblast, cDNA to mRNA, (library of Okayama and Berg). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 460) AUTHORS Rich,B.E. and Steitz,J.A. TITLE Human acidic ribosomal phosphoproteins P0, P1, and P2: Analysis of cDNA clones, in vitro synthesis, and assembly JOURNAL Mol. Cell. Biol. 7, 4065-4074 (1987) MEDLINE 88122131 FEATURES Location/Qualifiers source 1..460 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..460 /note="ARP mRNA" CDS 75..422 /note="acidic ribosomal phosphoprotein (P2)" /codon_start=1 /db_xref="PID:g190236" /translation="MRYVASYLLAALGGNSSPSAKDIKKILDSVGIEADDDRLNKVIS ELNGKNIEDVIAQGIGKLASVPAGGAVAVSAAPGSAAPAAGSAPAAAEEKKDEKKEES EESDDDMGFGLFD" BASE COUNT 99 a 135 c 128 g 98 t ORIGIN Unreported. 1 cttttcctcc catgtcgcca ccgaggtgcc acgcgtgaga cttctccgcc gcctccgccg 61 cagacgccgc cgcgatgcgc tacgtcgcct cctacctgct ggctgcccta gggggcaact 121 cctcccccag cgccaaggac atcaagaaga tcttggacag cgtgggtatc gaggcggacg 181 acgaccggct caacaaggtt atcagtgagc tgaatggaaa aaacattgaa gacgtcattg 241 cccagggtat tggcaagctt gccagtgtac ctgctggtgg ggctgtagcc gtctctgctg 301 ccccaggctc tgcagcccct gctgctggtt ctgcccctgc tgcagcagag gagaagaaag 361 atgagaagaa ggaggagtct gaagagtcag atgatgacat gggatttggc ctttttgatt 421 aaattcctgc tcccctgcaa ataaagcctt tttacacatc // LOCUS HUMPPH 2920 bp mRNA PRI 06-SEP-1994 DEFINITION Human N-benzoyl-L-tyrosyl-p-amino-benzoic acid hydrolase alpha subunit (PPH alpha) mRNA, complete cds. ACCESSION M82962 M74238 NID g535474 KEYWORDS N-benzoyl-L-tyrosyl-p-amino-benzoic acid hydrolase; astacin; metalloendopeptidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 105) AUTHORS Eldering,J.A., Grunberg,J. and Sterchi,E.E. TITLE Cloning of the Paba-peptide hydrolase beta subunit: coexpression is required for plasma membrane localization of the alpha subunit in COS-1 cells JOURNAL Unpublished REFERENCE 2 (bases 104 to 2920) AUTHORS Dumermuth,E., Eldering,J.A., Grunberg,J., Jiang,W. and Sterchi,E.E. TITLE Cloning of the PABA peptide hydrolase alpha subunit (PPH alpha) from human small intestine and its expression in COS-1 cells JOURNAL FEBS Lett. 335 (3), 367-375 (1993) MEDLINE 94085556 REFERENCE 3 (bases 202 to 798) AUTHORS Dumermuth,E., Sterchi,E.E., Jiang,W.P., Wolz,R.L., Bond,J.S., Flannery,A.V. and Beynon,R.J. TITLE The astacin family of metalloendopeptidases JOURNAL J. Biol. Chem. 266 (32), 21381-21385 (1991) MEDLINE 92042028 REFERENCE 4 (bases 202 to 798) AUTHORS Sterchi,E.E. TITLE Direct Submission JOURNAL Submitted (02-DEC-1991) Erwin E. Sterchi, Institute of Biochemistry and Molecular Biology, University of Berne, Buehlstrasse 28, CH-3012 Berne, Switzerland REFERENCE 5 (bases 1 to 2920) AUTHORS Sterchi,E.E. TITLE Direct Submission JOURNAL Submitted (30-AUG-1994) Erwin E. Sterchi, Institute of Biochemistry and Molecular Biology, University of Berne, Buehlstrasse 28, CH-3012 Berne, Switzerland FEATURES Location/Qualifiers source 1..2920 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Unizap XR Custom (Stratagene)" /tissue_type="jejunum" /clone="PPH2-22.4, PPH alpha 5'-1" mRNA 1..2920 /gene="PPH alpha" gene 1..2920 /gene="PPH alpha" CDS 10..2250 /gene="PPH alpha" /EC_number="3.4.24.18" /codon_start=1 /product="N-benzoyl-L-tyrosyl-p-amino-benzoic acid hydrolase alpha subunit" /db_xref="PID:g535475" /translation="MAWIRSTCILFFTLLFAHIAAVPIKHLPEENVHDADFGEQKDIS EINLAAGLDLFQGDILLQKSRNGLRDPNTRWTFPIPYILADNLGLNAKGAILYAFEMF RLKSCVDFKPYEGESSYIIFQQFDGCWSEVGDQHVGQNISIGQGCAYKAIIEHEILHA LGFYHEQSRTDRDDYVNIWWDQILSGYQHNFDTYDDSLITDLNTPYDYESLMHYQPFS FNKNASVPTITAKIPEFNSIIGQRLDFSAIDLERLNRMYNCTTTHTLLDHCTFEKANI CGMIQGTRDDTDWAHQDSAQAGEVDHTLLGQCTGAGYFMQFSTSSGSAEEAALLESRI LYPKRKQQCLQFFYKMTGSPSDRLVVWVRRDDSTGNVRKLVKVQTFQGDDDHNWKIAH VVLKEEQKFRYLFQGTKGDPQNSTGGIYLDDITLTETPCPTGVWTVRNFSQVLENTSK GDKLQSPRFYNSEGYGFGVTLYPNSRESSGYLRLAFHVCSGENDAILEWPVENRQVII TILDQEPDVRNRMSSSMVFTTSKSHTSPAINDTVIWDRPSRVGTYHTDCNCFRSIDLG WSGFISHQMLKRRSFLKNDDLIIFVDFEDITHLSQTEVPSKGKRLSPQGLILQGQEQQ VSEEGSGKAMLEEALPVSLSQGQPSRQKRSVENTGPLEDHNWPQYFRDPCDPNPCQND GICVNVKGMASCRCISGHAFFYTGERCQSAEVHGSVLGMVIGGTAGVIFLTFSIIAIL SQRPRK" sig_peptide 10..70 /gene="PPH alpha" misc_feature 73..202 /gene="PPH alpha" /note="encodes propeptide" misc_feature 205 /gene="PPH alpha" /note="mature protein start" misc_feature 205..796 /gene="PPH alpha" /note="encodes astacin domain" misc_feature 472..541 /gene="PPH alpha" /note="encodes astacin signature" misc_feature 472..484 /gene="PPH alpha" /note="encodes zinc-binding site" misc_feature 799..1306 /gene="PPH alpha" /note="encodes MAM domain" misc_feature 2029..2134 /gene="PPH alpha" /note="encodes EGF-like region" misc_feature 2152..2227 /gene="PPH alpha" /note="encodes putative membrane anchor" misc_feature 2230..2245 /gene="PPH alpha" /note="encodes putative cystosolic domain" BASE COUNT 796 a 685 c 689 g 750 t ORIGIN 1 cttgcagcaa tggcttggat tagatccact tgcattctct tttttacctt gctttttgcc 61 cacatagcag ctgtaccgat taagcatctt cctgaagaaa atgtacatga tgcagatttt 121 ggtgaacaga aggatatttc agaaatcaat ttagctgcag gcttggacct ctttcaaggg 181 gacatcctct tgcagaaatc cagaaatggc ctgagagacc caaacaccag gtggacgttc 241 cccattcctt acatcttggc tgataatttg gggctgaatg ctaaaggagc cattctgtat 301 gcctttgaga tgttccgtct caagtcctgt gtggatttca agccctatga aggagagagc 361 tcatatatca tatttcaaca gtttgatggg tgctggtctg aggttggtga ccaacatgtg 421 ggacagaaca tttccattgg ccaaggatgt gcctataagg ccatcataga acacgagatc 481 ctgcatgctt tgggatttta ccacgagcag tcaaggacgg accgggatga ttatgtgaac 541 atctggtggg accaaattct ttcaggttac cagcacaact ttgacaccta tgatgatagc 601 ttaatcacag acctcaatac accctatgat tatgagtctt tgatgcacta ccagcctttc 661 tcatttaaca agaatgcaag tgttcccacc atcacagcca agatccctga gtttaactcc 721 attatcggac aacgcctgga tttcagtgcc attgatttag agaggctgaa ccgaatgtac 781 aattgcacca caactcacac tcttttggac cactgtactt ttgagaaggc aaacatctgt 841 ggaatgattc agggcaccag agatgacact gactgggccc atcaggacag tgctcaggct 901 ggagaagtgg atcacacctt gttgggacaa tgcacaggtg ccggctactt catgcagttc 961 agcaccagct cggggtccgc ggaagaggca gccctactgg agtctcggat tctttaccca 1021 aagaggaagc agcagtgcct gcaatttttc tataaaatga cgggaagtcc ttcagacaga 1081 ctcgttgtct gggtcaggag ggatgacagc acaggcaatg ttcgcaagtt ggtgaaggtg 1141 cagacttttc aaggagatga tgaccacaat tggaaaattg cccatgtggt gctcaaagag 1201 gaacagaagt ttcgctacct tttccagggc acaaaaggcg accctcagaa ctcaactggg 1261 ggaatttacc tagatgacat cactctgaca gaaaccccct gccccacagg ggtctggaca 1321 gtccggaatt tctcccaagt ccttgagaac accagcaaag gggacaagct tcagagccct 1381 cgattctaca attcggaggg atatggtttt ggggtaactt tatacccaaa tagcagagaa 1441 agctctggtt acttgagact tgcttttcat gtgtgcagtg gggagaacga tgctatcctg 1501 gagtggccgg tagaaaacag acaggtgata attaccatcc ttgaccagga gcctgatgtc 1561 cggaacagga tgtcctcaag catggtgttc actacctcga agtcgcacac atctccagcg 1621 ataaatgaca ctgtcatctg ggacaggccg tccagggtgg gaacctatca tacagactgt 1681 aattgtttta gaagcatcga cttgggctgg agtggtttca tttcccacca aatgctgaaa 1741 aggaggagtt tcctgaaaaa tgatgacctc atcatatttg tggactttga agatatcacc 1801 cacctcagcc agactgaagt tccctctaaa ggcaaaagac tgagccccca aggcctcatt 1861 ctccaaggcc aggagcagca ggtctccgaa gaaggttcgg gaaaggccat gttagaggaa 1921 gccctacctg tcagcctgag ccaggggcag cccagccgac agaagcggtc ggtggagaac 1981 acaggccccc tggaggacca taactggcca cagtacttca gagacccatg tgacccaaac 2041 ccttgccaaa atgacggcat ctgtgtgaac gtgaagggga tggcgagctg caggtgcatc 2101 tctggacatg ctttcttcta cacgggggag cgctgtcagt cggccgaggt gcacggcagt 2161 gtcctgggca tggtgatcgg aggcacggct ggcgtgatct tcttgacctt ctccatcatc 2221 gccatccttt cccaaaggcc aaggaagtga cctgcctgct ggcattggcc agaccacagc 2281 agcacctcct ccatgcaggc cttaactttc ccatgttcaa tgcagtttgg ggcagctttt 2341 ttatcagcct tgctttggat aggacctcca aggactaagc ctccagcccc atgtgtgacc 2401 cttgtcatct ctctgcccca cataattatg ttactttgct atgtgctcct aatgtatcta 2461 gtgtgtcctg tgacaacact catcacactt cattgtaaat cacttgtttt attgactgtc 2521 tttcctatag actgtaagct ccatgagggc aggcacatgt tgttctcatt gaccgtgctg 2581 gccccagtgc ctagatgcat ggctggcaca ttgttggcac tcaacaatgg ttgaatgaat 2641 aaaacaataa atgaatgaat aactaagata tagaaactct catttatatt gcagattgaa 2701 tatatatgat gaaattctta tgttgaatat gttagaatca aatactcatt tttcattaga 2761 tacagtagtg tcatcactct tttaagatct tgttaaagat ttcaaataaa ggtacttctg 2821 gcgagccagg ctgcacagca tttgctttcc tctgagattc taagagaagg cctttaataa 2881 atttaataaa tattgagtta gcaaaaaaaa aaaaaaaaaa // LOCUS HUMPPKKA 2267 bp mRNA PRI 08-JAN-1995 DEFINITION Nucleotide sequence of the cDNA insert of lambda PK129 coding for human plasma prekallikrein. ACCESSION M13143 NID g190262 KEYWORDS prekallikrein. SOURCE Human liver, cDNA to mRNA, clone lambda pK129. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2267) AUTHORS Chung,D.W., Fujikawa,K., McMullen,B.A. and Davie,E.W. TITLE Human plasma prekallikrein, a zymogen to a serine protease that contains four tandem repeats JOURNAL Biochemistry 25 (9), 2410-2417 (1986) MEDLINE 86243359 FEATURES Location/Qualifiers source 1..2267 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q34-q35" gene 94..2010 /gene="KLK3" CDS 94..2010 /gene="KLK3" /note="plasma prekallikrein" /codon_start=1 /db_xref="GDB:G00-127-575" /db_xref="PID:g190263" /translation="MILFKQATYFISLFATVSCGCLTQLYENAFFRGGDVASMYTPNA QYCQMRCTFHPRCLLFSFLPASSINDMEKRFGCFLKDSVTGTLPKVHRTGAVSGHSLK QCGHQISACHRDIYKGVDMRGVNFNVSKVSSVEECQKRCTNNIRCQFFSYATQTFHKA EYRNNCLLKYSPGGTPTAIKVLSNVESGFSLKPCALSEIGCHMNIFQHLAFSDVDVAR VLTPDAFVCRTICTYHPNCLFFTFYTNVWKIESQRNVCLLKTSESGTPSSSTPQENTI SGYSLLTCKRTLPEPCHSKIYPGVDFGGEELNVTFVKGVNVCQETCTKMIRCQFFTYS LLPEDCKEEKCKCFLRLSMDGSPTRIAYGTQGSSGYSLRLCNTGDNSVCTTKTSTRIV GGTNSSWGEWPWQVSLQVKLTAQRHLCGGSLIGHQWVLTAAHCFDGLPLQDVWRIYSG ILNLSDITKDTPFSQIKEIIIHQNYKVSEGNHDIALIKLQAPLNYTEFQKPICLPSKG DTSTIYTNCWVTGWGFSKEKGEIQNILQKVNIPLVTNEECQKRYQDYKITQRMVCAGY KEGGKDACKGDSGGPLVCKHNGMWRLVGITSWGEGCARREQPGVYTKVAEYMDWILEK TQSSDGKAQMQSPA" BASE COUNT 677 a 474 c 517 g 599 t ORIGIN 1 ggtagtagca aatattcaaa tgagaacagc ttgaagaccg ttcattttta agtgacaaga 61 gactcacctc caagaagcaa ttgtgttttc agaatgattt tattcaagca agcaacttat 121 ttcatttcct tgtttgctac agtttcctgt ggatgtctga ctcaactcta tgaaaacgcc 181 ttcttcagag gtggggatgt agcttccatg tacaccccaa atgcccaata ctgccagatg 241 aggtgcacat tccacccaag gtgtttgcta ttcagttttc ttccagcaag ttcaatcaat 301 gacatggaga aaaggtttgg ttgcttcttg aaagatagtg ttacaggaac cctgccaaaa 361 gtacatcgaa caggtgcagt ttctggacat tccttgaagc aatgtggtca tcaaataagt 421 gcttgccatc gagacattta taaaggagtt gatatgagag gagtcaattt taatgtgtct 481 aaggttagca gtgttgaaga atgccaaaaa aggtgcacca ataacattcg ctgccagttt 541 ttttcatatg ccacgcaaac atttcacaag gcagagtacc ggaacaattg cctattaaag 601 tacagtcccg gaggaacacc taccgctata aaggtgctga gtaacgtgga atctggattc 661 tcactgaagc cctgtgccct ttcagaaatt ggttgccaca tgaacatctt ccagcatctt 721 gcgttctcag atgtggatgt tgccagggtt ctcactccag atgcttttgt gtgtcggacc 781 atctgcacct atcaccccaa ctgcctcttc tttacattct atacaaatgt atggaaaatc 841 gagtcacaaa gaaatgtttg tcttcttaaa acatctgaaa gtggcacacc aagttcctct 901 actcctcaag aaaacaccat atctggatat agccttttaa cctgcaaaag aactttacct 961 gaaccctgcc attctaaaat ttacccggga gttgactttg gaggagaaga attgaatgtg 1021 acttttgtta aaggagtgaa tgtttgccaa gagacttgca caaagatgat tcgctgtcag 1081 tttttcactt attctttact cccagaagac tgtaaggaag agaagtgtaa gtgtttctta 1141 agattatcta tggatggttc tccaactagg attgcgtatg ggacacaagg gagctctggt 1201 tactctttga gattgtgtaa cactggggac aactctgtct gcacaacaaa aacaagcaca 1261 cgcattgttg gaggaacaaa ctcttcttgg ggagagtggc cctggcaggt gagcctgcag 1321 gtgaagctga cagctcagag gcacctgtgt ggagggtcac tcataggaca ccagtgggtc 1381 ctcactgctg cccactgctt tgatgggctt cccctgcagg atgtttggcg catctatagt 1441 ggcattttaa atctgtcaga cattacaaaa gatacacctt tctcacaaat aaaagagatt 1501 attattcacc aaaactataa agtctcagaa gggaatcatg atatcgcctt gataaaactc 1561 caggctcctt tgaattacac tgaattccaa aaaccaatat gcctaccttc caaaggtgac 1621 acaagcacaa tttataccaa ctgttgggta accggatggg gcttctcgaa ggagaaaggt 1681 gaaatccaaa atattctaca aaaggtaaat attcctttgg taacaaatga agaatgccag 1741 aaaagatatc aagattataa aataacccaa cggatggtct gtgctggcta taaagaaggg 1801 ggaaaagatg cttgtaaggg agattcaggt ggtcccttag tttgcaaaca caacggaatg 1861 tggcgtttgg tgggcatcac aagctggggt gaaggctgtg cccgcaggga gcaacctggt 1921 gtctacacca aagtcgctga gtacatggac tggattttag agaaaacaca gagcagtgat 1981 ggaaaagctc agatgcagtc accagcatga gaagcagtcc agagtctagg caatttttac 2041 aacctgagtt caagtcaaat tctgagcctg gggggtcctc atctgcaaag catggagagt 2101 ggcatcttct ttgcatccta aggacgaaag acacagtgca ctcagagctg ctgaggacaa 2161 tgtctgctga agcccgcttt cagcacgccg taaccagggg ctgacaatgc gaggtcgcaa 2221 ctgagatctc catgactgtg tgttgtgaaa taaaatggtg aaagatc // LOCUS HUMPPNT4P 1404 bp DNA PRI 08-JAN-1995 DEFINITION Human neurotrophin-4 (NT-4) gene, complete cds. ACCESSION M86528 NID g190264 KEYWORDS nerve growth factor; neurotrophin-4. SOURCE Homo sapiens placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS Ip,N.Y., Ibanez,C.F., Nye,S.H., McClain,J., Jones,P.F., Gies,D.R., Belluscio,L., Le Beau,M.M., Espinosa,R.III., Squinto,S.P., Persson,H. and Yancopoulos,G.D. TITLE Mammalian neurotrophin-4: structure, chromosomal localization, tissue distribution, and receptor specificity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (7), 3060-3064 (1992) MEDLINE 92212967 FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" gene 475..1107 /gene="NT-4" CDS 475..1107 /gene="NT-4" /note="pre-pro protein" /codon_start=1 /product="neurotrophin-4" /db_xref="PID:g190265" /translation="MLPLPSCSLPILLLFLLPSVPIESQPPPSTLPPFLAPEWDLLSP RVVLSRGAPAGPPLLFLLEAGAFRESAGAPANRSRRGVSETAPASRRGELAVCDAVSG WVTDRRTAVDLRGREVEVLGEVPAAGGSPLRQYFFETRCKADNAEEGGPGAGGGGCRG VDRRHWVSECKAKQSYVRALTADAQGRVGWRWIRIDTACVCTLLSRTGRA" mat_peptide 715..1104 /gene="NT-4" /function="nerve growth factor-related neurotrophic factor" /product="neurotrophin-4" BASE COUNT 273 a 405 c 386 g 340 t ORIGIN 1 cttgtcaccc aggtggcagg ggagtggtgc actctctgct cactgcaacc tcggcctcct 61 gggttcgagt gattctccta cctcagccta ctgagtagct gggattacag gcgtgcagca 121 ctatgcccgg ttaattttgg tatttttggt agagatgagg tttcaccatg ttgaccagct 181 gctctggaac tcctgacctc aagtcatcca cctgcctcag cctcccagag tgctgggatt 241 agaggtgtgg ggcacagtgc ctggcctgta gtagttgaat atttattatt aatctacaag 301 ttgcgcatta cgcaagccct agatataggg tcccccaaac ttctagaaca agggcttccc 361 cacaatcctg gcaggcaagc ctcccctggg gttcccaact tctttcccca ctgaagtttt 421 tacccccttc tctaatccca gcctccctct ttctgtctcc aggtgctccg agagatgctc 481 cctctcccct catgctccct ccccatcctc ctccttttcc tcctccccag tgtgccaatt 541 gagtcccaac ccccaccctc aacattgccc ccttttctgg cccctgagtg ggaccttctc 601 tccccccgag tagtcctgtc taggggtgcc cctgctgggc cccctctgct cttcctgctg 661 gaggctgggg cctttcggga gtcagcaggt gccccggcca accgcagccg gcgtggggtg 721 agcgaaactg caccagcgag tcgtcggggt gagctggctg tgtgcgatgc agtcagtggc 781 tgggtgacag accgccggac cgctgtggac ttgcgtgggc gcgaggtgga ggtgttgggc 841 gaggtgcctg cagctggcgg cagtcccctc cgccagtact tctttgaaac ccgctgcaag 901 gctgataacg ctgaggaagg tggcccgggg gcaggtggag ggggctgccg gggagtggac 961 aggaggcact gggtatctga gtgcaaggcc aagcagtcct atgtgcgggc attgaccgct 1021 gatgcccagg gccgtgtggg ctggcgatgg attcgaattg acactgcctg cgtctgcaca 1081 ctcctcagcc ggactggccg ggcctgagac ccatgcccag gaaaataaca gagctggatg 1141 ctgagagacc tcagggatgg cccagctgat ctaaggaccc cagtttggga actcatcaaa 1201 taatcacaaa atcacaattc tctgattttg agctcaatct ctgcaggatg ggtgaaacca 1261 catggggttt tggaggttga ataggagttc tcctggagca acttgagggt aataatgatg 1321 atgatataat aataatagcc actatttact gagtgtttac tgtttcttat ccctaataca 1381 taactcctca gatcaactct catg // LOCUS HUMPPPB1A 3215 bp mRNA PRI 08-JAN-1995 DEFINITION Human (clone lambda-16-1) non-receptor tyrosine phosphatase 1 (PTPN1) mRNA, complete cds. ACCESSION M33689 NID g190271 KEYWORDS non-receptor tyrosine phosphatase 1. SOURCE Homo sapiens (tissue library: Clontech) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3215) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (13), 5148-5152 (1990) MEDLINE 90311360 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES Location/Qualifiers source 1..3215 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="Clontech" /map="20q13.1-q13.2" gene 73..1380 /gene="PTPN1" CDS 73..1380 /gene="PTPN1" /EC_number="3.1.3.48" /codon_start=1 /db_xref="GDB:G00-126-728" /product="non-receptor tyrosine phosphatase 1" /db_xref="PID:g190272" /translation="MEMEKEFEQIDKSGSWAAIYQDIRHEASDFPCRVAKLPKNKNRN RYRDVSPFDHSRIKLHQEDNDYINASLIKMEEAQRSYILTQGPLPNTCGHFWEMVWEQ KSRGVVMLNRVMEKGSLKCAQYWPQKEEKEMIFEDTNLKLTLISEDIKSYYTVRQLEL ENLTTQETREILHFHYTTWPDFGVPESPASFLNFLFKVRESGSLSPEHGPVVVHCSAG IGRSGTFCLADTCLLLMDKRKDPSSVDIKKVLLEMRKFRMGLIQTADQLRFSYLAVIE GAKFIMGDSSVQDQWKELSHEDLEPPPEHIPPPPRPPKRILEPHNGKCREFFPNHQWV KEETQEDKDCPIKEEKGSPLNAAPYGIESMSQDTEVRSRVVGGSLRGAQAASPAKGEP SLPEKDEDHALSYWKPFLVNMCVATVLTAGAYLCYRFLFNSNT" BASE COUNT 818 a 828 c 801 g 768 t ORIGIN 20q13.1-q13.2. 1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg 61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg 121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag 181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg 241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa 301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg 361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa 421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa 481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga 541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat 601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc 661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt 721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac 781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt 841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa 901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac 961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc 1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa 1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc 1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg 1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag 1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct 1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag 1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc 1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc 1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat 1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag 1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc 1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc 1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc 1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat 1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg 1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt 1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata 2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc 2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat 2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac 2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg 2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg 2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc 2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg 2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc 2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca 2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg 2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc 2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg 2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc 2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac 2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg 2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg 3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg 3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag 3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt 3181 ttacaatggc catggaataa accattttta caaaa // LOCUS HUMPPR 1815 bp mRNA PRI 15-JUN-1989 DEFINITION Human protective protein mRNA, complete cds. ACCESSION M22960 J03159 M18453 NID g190282 KEYWORDS protective protein. SOURCE Human testes, cDNA to mRNA, (library of Clontech). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1815) AUTHORS Galjart,N.J., Gillemans,N., Harris,A., van der Horst,G.T.J., Verheijen,F.W., Galjaard,H. and d'Azzo,A. TITLE Expression of cDNA encoding the human 'protective protein' associated with lysomsomal beta-galactosidase and neuraminidase: Homology to yeast proteases JOURNAL Cell 54, 755-764 (1988) MEDLINE 88311078 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.d'Azzo, 13-JUL-1988. FEATURES Location/Qualifiers source 1..1815 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..1815 /note="PPR mRNA" sig_peptide 7..90 /note="protective protein signal peptide" CDS 7..1449 /note="protective protein precursor" /codon_start=1 /db_xref="PID:g190283" /translation="MIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLAKQ PSFRQYSGYLKSSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFL VQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDF FRLFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSL VYFAYYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYN LYAPCAGGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNT TAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQI LLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFL TIKGAGHMVPTDKPLAAFTMFSRFLNKQPY" mat_peptide 91..984 /note="32 kd protective protein" mat_peptide 985..1446 /note="20 kd protective protein" BASE COUNT 390 a 559 c 466 g 400 t ORIGIN Unreported. 1 ggggagatga tccgagccgc gccgccgccg ctgttcctgc tgctgctgct gctgctgctg 61 ctagtgtcct gggcgtcccg aggcgaggca gcccccgacc aggacgagat ccagcgcctc 121 cccgggctgg ccaagcagcc gtctttccgc cagtactccg gctacctcaa aagctccggc 181 tccaagcacc tccactactg gtttgtggag tcccagaagg atcccgagaa cagccctgtg 241 gtgctttggc tcaatggggg tcccggctgc agctcactag atgggctcct cacagagcat 301 ggccccttcc tggtccagcc agatggtgtc accctggagt acaaccccta ttcttggaat 361 ctgattgcca atgtgttata cctggagtcc ccagctgggg tgggcttctc ctactccgat 421 gacaagtttt atgcaactaa tgacactgag gtcgcccaga gcaattttga ggcccttcaa 481 gatttcttcc gcctctttcc ggagtacaag aacaacaaac ttttcctgac cggggagagc 541 tatgctggca tctacatccc caccctggcc gtgctggtca tgcaggatcc cagcatgaac 601 cttcaggggc tggctgtggg caatggactc tcctcctatg agcagaatga caactccctg 661 gtctactttg cctactacca tggccttctg gggaacaggc tttggtcttc tctccagacc 721 cactgctgct ctcaaaacaa gtgtaacttc tatgacaaca aagacctgga atgcgtgacc 781 aatcttcagg aagtggcccg catcgtgggc aactctggcc tcaacatcta caatctctat 841 gccccgtgtg ctggaggggt gcccagccat tttaggtatg agaaggacac tgttgtggtc 901 caggatttgg gcaacatctt cactcgcctg ccactcaagc ggatgtggca tcaggcactg 961 ctgcgctcag gggataaagt gcgcatggac cccccctgca ccaacacaac agctgcttcc 1021 acctacctca acaacccgta cgtgcggaag gccctcaaca tcccggagca gctgccacaa 1081 tgggacatgt gcaactttct ggtaaactta cagtaccgcc gtctctaccg aagcatgaac 1141 tcccagtatc tgaagctgct tagctcacag aaataccaga tcctattata taatggagat 1201 gtagacatgg cctgcaattt catgggggat gagtggtttg tggattccct caaccagaag 1261 atggaggtgc agcgccggcc ctggttagtg aagtacgggg acagcgggga gcagattgcc 1321 ggcttcgtga aggagttctc ccacatcgcc tttctcacga tcaagggcgc cggccacatg 1381 gttcccaccg acaagcccct cgctgccttc accatgttct cccgcttcct gaacaagcag 1441 ccatactgat gaccacagca accagctcca cggcctgatg cagcccctcc cagcctctcc 1501 cgctaggaga gtcctcttct aagcaaagtg cccctgcagg cgggttctgc cgccaggact 1561 gcccccttcc cagagccctg tacatcccag actgggccca gggtctccca tagacagcct 1621 gggggcaagt tagcacttta ttcccgcagc agttcctgaa tggggtggcc tggccccttc 1681 tctgcttaaa gaatgccctt tatgatgcac tgattccatc ccaggaaccc aacagagctc 1741 aggacagccc acagggaggt ggtggacgga ctgtaattga tagattgatt atggaattaa 1801 attgggtaca gcttc // LOCUS HUMPPRO 3070 bp mRNA PRI 27-JAN-1993 DEFINITION Homo sapiens (clone DN10mel) P protein mRNA, complete cds. ACCESSION M99564 NID g190284 KEYWORDS P gene. SOURCE Homo sapiens (library: cDNA of R. Neve) fetal, and adult brain, and melanocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3070) AUTHORS Rinchik,E.M., Bultman,S.J., Horsthemke,B., Lee,S.-T., Strunk,K.M., Spritz,R.A., Avidano,K.A., Jong,M.T.C. and Nicholls,R.D. TITLE A gene for the mouse pink-eyed dilution locus and for human type II oculocutaneous albinism JOURNAL Nature 361, 72-76 (1993) MEDLINE 93133287 FEATURES Location/Qualifiers source 1..3070 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal, and adult" /tissue_type="brain, and melanocyte" /tissue_lib="cDNA of R. Neve" 5'UTR 1..52 CDS 53..2569 /codon_start=1 /product="P protein" /db_xref="PID:g190285" /translation="MHLEGRDGRRYPGAPAVELLQTSVPSGLAELVAGKRRLPRGAGG ADPSHSCPRGAAGQSSWAPAGQEFASFLTKGRSHSSLPQMSSSRSKDSCFTENTPLLR NSLQEKGSRCIPVYHPEFITAEESWEDSSADWERRYLLSREVSGLSASASSEKGDLLD SPHIRLRLSKLRRCVQWLKVMGLFAFVVLCSILFSLYPDQGKLWQLLALSPLENYSVN LSSHVDSTLLQVDLAGALVASGPSRPGREEHIVVELTQDDALGSRWRRPQQVTHNWTV YLNPRRSEHSVMSRTFEVLTRETVSISIRASLQQTQAVPLLMAHQYLRGSVETQVTIA TAILAGVYALIIFEIVHRTLAAMLGSLAALAALAVIGDRPSLTHVVEWIDFETLALLF GMMILVAIFSETGFFDYCAVKAYRLSRGRVWAMIIMLCLIAAVLSAFLDNVTTMLLFT PVTIRLCEVLNLDPRQVLIAEVIFTNIGGAATAIGDPPNVIIVSNQELRKMGLDFAGF TAHMFIGICLVLLVCFPLLRLLYWNRKLYNKEPSEIVELKHEIHVWRLTAQRISPASR EETAVRRLLLGKVLALEHLLARRLHTFHRQISQEDKNWETNIQELQKKHRISDGILLA KCLTVLGFVIFMFFLNSFVPGIHLDLGWIAILGAIWLLILADIHDFEIILHRVEWATL LFFAALFVLMEALAHLHLIEYVGEQTALLIKMVPEEQRLIAAIVLVVWVSALASSLID NIPFTATMIPVLLNLSHDPEVGLPAPPLMYALAFGACLGGNGTLIGASANVVCAGIAE QHGYGFSFMEFFRLGFPMMVVSCTVGMCYLLVAHVVVGWN" 3'UTR 2570..3070 polyA_signal 3061..3066 BASE COUNT 694 a 790 c 808 g 778 t ORIGIN 1 cactcctgga gaaagatctg caagtgcgca gagagaagac tggcagtgga gcatgcatct 61 ggagggcaga gacggcaggc ggtaccccgg cgcgccggcg gtggagctcc tgcagacgtc 121 cgtgcccagc ggactcgctg aacttgtggc cggcaagcgc aggcttcctc ggggagccgg 181 tggagctgac ccctcgcact cctgccccag gggggctgcc gggcagagct cttgggctcc 241 tgcaggccag gagtttgctt cattcctcac aaaagggagg tctcactctt ctttgcccca 301 gatgtccagc tccaggtcta aagattcctg ctttacagaa aacactcctt tgctgaggaa 361 ttccttacag gagaaagggt cacggtgcat acctgtttac catccagagt tcatcactgc 421 tgaagagtct tgggaagaca gctctgctga ctgggagcga agatacctgc taagcaggga 481 ggtgtctggt ctgtctgcat ctgcctcctc cgagaaggga gaccttctgg acagcccgca 541 catccgactc cgtctttcca agctgaggcg ctgtgtgcag tggctgaaag tcatgggcct 601 gtttgccttt gtggtgctgt gttctatttt gttcagccta tatccggatc aaggaaagct 661 ctggcagctg ttggccttat caccgctgga gaactactcc gtgaacctta gcagccacgt 721 ggactccacg ctgctgcagg tggacctggc aggggcccta gtggccagtg ggccgagtcg 781 tcctgggagg gaagagcaca tcgtggtgga gctgacccag gatgacgctt tgggctccag 841 gtggcggcgg ccacagcagg tcactcacaa ctggacggtg tatttaaatc cgaggagaag 901 cgagcactca gtgatgagca ggacctttga ggtactgacc agagagacgg tgtccatcag 961 catccgggcc tccctgcagc agacccaggc tgtccctctt ttgatggctc atcagtacct 1021 ccgcggaagt gtagaaaccc aggtgaccat cgcgacggcc atcctcgcgg gcgtctacgc 1081 gctgatcata tttgagatcg tgcacagaac tctggcggcc atgctgggtt cccttgcagc 1141 actggcagca ctggctgtga ttggcgatag acccagcctg acccatgtgg tggagtggat 1201 tgattttgag acgctggccc tgctgtttgg catgatgatc ttagtagcca tattttcaga 1261 aacgggattt ttcgattatt gtgctgtaaa ggcataccgg ctctcccggg gacgggtgtg 1321 ggccatgatc atcatgctct gtctcatcgc ggccgtcctc tctgccttct tggacaacgt 1381 caccaccatg ctcctcttca cgcctgtgac cataaggttg tgtgaggtgc tcaaccttga 1441 tccaagacaa gtcctgattg cagaagtgat cttcacaaac attggaggag ctgccactgc 1501 catcggggac cctccaaatg tcattattgt ttccaaccaa gagctgagga agatgggcct 1561 ggactttgcc ggattcactg cacacatgtt cattgggatt tgccttgttc tcctggtctg 1621 ctttccgctc ctcagactcc tttactggaa cagaaagctt tataacaagg aacccagtga 1681 gattgttgaa ctgaagcacg agattcacgt ctggcgcctg actgctcagc gcatcagccc 1741 ggccagccgc gaggagacag ctgtgcgccg cctgctgctg gggaaggtgc tggcactgga 1801 gcacctgctc gcccggaggc tgcacacctt ccacagacag atctcacagg aggacaaaaa 1861 ttgggagacc aatatccaag aactccaaaa aaagcatagg atatctgacg ggattctgct 1921 cgccaaatgc ctgacagtgt tgggatttgt tatcttcatg tttttcctca attcgtttgt 1981 ccctggcatt catcttgatc ttggatggat tgctattctg ggtgccatct ggttgctaat 2041 tttagctgat attcatgatt ttgagataat tctacacaga gtggaatggg caacccttct 2101 gttttttgca gcgctctttg ttctgatgga ggcattggca catctccact taatagaata 2161 tgttggagaa caaactgctt tgctaataaa gatggtccca gaggagcagc gcctcatagc 2221 cgccattgtc ctggtggtgt gggtctcagc cctggcgtcg tccctgattg acaacatccc 2281 gttcactgct accatgattc ccgtgctcct gaacctgagc cacgaccctg aggttggcct 2341 gcccgcaccg ccgctcatgt atgccctggc cttcggtgct tgcctgggag gcaacgggac 2401 actgattggc gcgtcggcaa acgtcgtgtg tgcagggatt gcagaacagc atggatatgg 2461 gttctccttc atggaatttt tcaggctggg cttcccaatg atggttgtgt cctgcactgt 2521 tgggatgtgt tatctccttg tggctcatgt ggtggtggga tggaattaat agacatccat 2581 ctattgctcg aagactaaag gaaacttcat ccatcacaac ccattagtca taaaactacc 2641 ctgaccccac tgtttgaaga agaaaaggtg cttaccctgg agatgctaca gagacacagt 2701 ggaatagacc ttgacactaa cactctaatt caagcgaatg ttggaacacc atgacctcct 2761 ctgtgtgtcc tttctcccca aggacaaaat gtagaaagat gtgagataac ttactcaaga 2821 ttcccctcca gaaaaatacg tatgtttaaa aacccttcct gctatacata ggaaaagaca 2881 cacatccacc taaaattgac tgtactgttt aactgtcaat tctcctgagg ctaaacacag 2941 tttgtttttc ttgtaatcac ttttcatgtt aaaataatca gcattcaaat tgtatgcttt 3001 ctgaatatag actttctggg aaaaggttta ctgctcgtaa ggaaacattt tatgtattaa 3061 aataaactgt // LOCUS HUMPPT 2287 bp DNA PRI 11-JAN-1996 DEFINITION Homo sapiens palmitoyl-protein thioesterase gene, complete cds. ACCESSION L42809 NID g1160966 KEYWORDS CLN1 gene; infantile Batten's disease; palmitoyl-protein thioesterase; thioesterase. SOURCE Homo sapiens (clone library: Stratagene cat. no. 935205) female 2 yr old brain temportal cortex DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Vesa,J., Hellsten,E., Verkruyse,L.A., Camp,L.A., Rapola,J., Santavuori,P., Hofmann,S.L. and Peltonen,L. TITLE Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis JOURNAL Nature 376 (6541), 584-587 (1995) MEDLINE 95364950 FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="Stratagene cat. no. 935205" /dev_stage="2 yr old" /sex="female" /tissue_type="brain temportal cortex" sig_peptide 15..89 CDS 15..935 /codon_start=1 /product="palmitoyl-protein thioesterase" /db_xref="PID:g1160967" /translation="MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDS CCNPLSMGAIKKMVEKKIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHIC DFIRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKN LMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDN AGQLVFLATEGDHLQLSEEWFYAHIIPFLG" mat_peptide 90..932 /product="palmitoyl-protein thioesterase" BASE COUNT 629 a 507 c 519 g 632 t ORIGIN 1 gtgacacagc gaagatggcg tcgcccggct gcctgtggct cttggctgtg gctctcctgc 61 catggacctg cgcttctcgg gcgctgcagc atctggaccc gccggcgccg ctgccgttgg 121 tgatctggca tgggatggga gacagctgtt gcaatccctt aagcatgggt gctattaaaa 181 aaatggtgga gaagaaaata cctggaattt acgtcttatc tttagagatt gggaagaccc 241 tgatggagga cgtggagaac agcttcttct tgaatgtcaa ttcccaagta acaacagtgt 301 gtcaggcact tgctaaggat cctaaattgc agcaaggcta caatgctatg ggattctccc 361 agggaggcca atttctgagg gcagtggctc agagatgccc ttcacctccc atgatcaatc 421 tgatctcggt tgggggacaa catcaaggtg tttttggact ccctcgatgc ccaggagaga 481 gctctcacat ctgtgacttc atccgaaaaa cactgaatgc tggggcgtac tccaaagttg 541 ttcaggaacg cctcgtgcaa gccgaatact ggcatgaccc cataaaggag gatgtgtatc 601 gcaaccacag catcttcttg gcagatataa atcaggagcg gggtatcaat gagtcctaca 661 agaaaaacct gatggccctg aagaagtttg tgatggtgaa attcctcaat gattccattg 721 tggaccctgt agattcggag tggtttggat tttacagaag tggccaagcc aaggaaacca 781 ttcccttaca ggagacctcc ctgtacacac aggaccgcct ggggctaaag gaaatggaca 841 atgcaggaca gctagtgttt ctggctacag aaggggacca tcttcagttg tctgaagaat 901 ggttttatgc ccacatcata ccattccttg gatgaaaccc gtatagttca caatagagct 961 cagggagccc ctaactcttc caaaccacat gggagacagt ttccttcatg cccaagcctg 1021 agctcagatc cagcttgcaa ctaatccttc tatcatctaa catgccctac ttggaaagat 1081 ctaagatctg aatcttatcc tttgccatct tctgttacca tatggtgttg aatgcaagtt 1141 taattaccat ggagattgtt ttacaaactt ttgatgtggt caagttcagt tttagaaaag 1201 ggagtctgtt ccagatcagg gccagaactg tgcccaggcc caaaggagac aactaactaa 1261 agtagtgaga tagattctaa gggcaaacat ttttccaagt cttgccatat ttcaagcaaa 1321 gaggtgccca ggcctgaggt actcacataa atgctttgtt ttgctggtga tttaaccagt 1381 gcttggaaaa atcttgcttg gctatttctg catcatttct taaggctgcc ttcctctctg 1441 agtacgttgc cctctgtgct atcaatcatc ttatcatcaa ttattagaca aatcccactg 1501 gcctacagtc ttgcttctgc agcacccact ttgtctcctc aggtagtgat gaattagttg 1561 ctgtcacaaa aggagggaag tagcacccaa attaaattgc ttaagagagg aaatgtacat 1621 cttgtataac ttagggagcg aagaaaatgt aggcgcgaaa gtgaaaagtg aggcagctag 1681 ttcttcctat tccattctcg accaacctgc cctttcttaa tatgactagt ggtcttgatg 1741 ctagagtcaa cttactctgt tgctggcttt agcagagaat aggaggaacc atatgaaaaa 1801 gatcaggctt tctgacttcc atccccaaaa cacatttacc agcatactcc aaactgtttc 1861 tgatgtgttc catgagaaaa ggattgtttg ctcaaaaagc ttggaaaata ctacacactc 1921 cctttctcct tctggagatc aacccacatt agagtgtcta aggactcctg agaattcctg 1981 ttacagtaaa caaaactaac gtaatctacc atttcctaca ctatttgagc atggaaatca 2041 tagtccccac tctgtgaaaa cttaacgctt tttggaagac atttctgtag catgtcagtt 2101 tggagaaatg atgagctacg ccttgatgaa agaaccgtgt tggtgctgct aagtttagcc 2161 attatggttt ttcctttctc tctcttaagc cttattcttc aactaaaaga tgaggattaa 2221 gagcaagaag ttggggggga tgtgaaaata attttatgag gttgtctaaa ataaagagta 2281 gtttctt // LOCUS HUMPRA 782 bp mRNA PRI 31-MAY-1994 DEFINITION Human (p23) mRNA, complete cds. ACCESSION L24804 L24805 NID g438651 KEYWORDS progesterone receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 782) AUTHORS Johnson,J.L., Beito,T.G., Krco,C.J. and Toft,D.O. TITLE Characterization of a novel 23-kilodalton protein of unactive progesterone receptor complexes JOURNAL Mol. Cell. Biol. 14, 1956-1963 (1994) MEDLINE 94158868 FEATURES Location/Qualifiers source 1..782 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" gene 233..715 /gene="p23" CDS 233..715 /gene="p23" /codon_start=1 /db_xref="PID:g438652" /translation="MQPASAKWYDRRDYVFIEFCVEDSKDVNVNFEKSKLTFSCLGGS DNFKHLNEIDLFHCIDPNDSKHKRTDRSILCCLRKGESGQSWPRLTKERAKLNWLSVD FNNWKDWEDDSDEDMSNFDRFSEMMNNMGGDEDVDLPEVDGADDDSQDSDDEKMPDLE " BASE COUNT 232 a 160 c 197 g 193 t ORIGIN 1 ggattcgggc tacactttcc tcttctcccc gaccggagag ccgctctttc cgcgcggtgc 61 attctggggc ccgaggtcga gcccgccgct gccgccgtcg cctgagggaa gcgagaagag 121 gccgcgaccg agagaaaaag cggagtcgca ccggagagaa gtcgactccc tagcagcagc 181 cgccgccaga gagcccgccc accagttcgc ccgtccccct gccccgttca caatgcagcc 241 tgcttctgca aagtggtacg atcgaaggga ctatgtcttc attgaatttt gtgttgaaga 301 cagtaaggat gttaatgtaa attttgaaaa atccaaactt acattcagtt gtctcggagg 361 aagtgataat tttaagcatt taaatgaaat tgatcttttt cactgtattg atccaaatga 421 ttccaagcat aaaagaacgg acagatcaat tttatgttgt ttacgaaaag gagaatctgg 481 ccagtcatgg ccaaggttaa caaaagaaag ggcaaagctt aattggctta gtgtcgactt 541 caataattgg aaagactggg aagatgattc agatgaagac atgtctaatt ttgatcgttt 601 ctctgagatg atgaacaaca tgggtggtga tgaggatgta gatttaccag aagtagatgg 661 agcagatgat gattcacaag acagtgatga tgaaaaaatg ccagatctgg agtaaggaat 721 attgtcatca cctggatttt gagaaagaaa aataacttct ctgcaagatt tcataattga 781 ga // LOCUS HUMPRECX 3221 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens preC gene, complete cds; ORF X, complete cds. ACCESSION L13994 NID g292401 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3221) AUTHORS Meisel,H., Jantschak,J., Tars,K., Prosch,S., Pushko,P. and Pumpens,P. TITLE Nucleotide sequence of HBV adw variant JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..3221 /organism="Homo sapiens" /note="exctracted from patient with chronic hepatitis" /db_xref="taxon:9606" gene 837..2856 /gene="preS" prim_transcript 837..2856 /gene="preS" CDS 1376..1840 /gene="preS" /note="ORF X" /codon_start=1 /db_xref="PID:g292402" /translation="MGARLYCQLDPSRDVLCLRPVGAESRGRPLSGPLGTLSSPSPSA VPADHGAHLSLRGLPVCAFSSAGPCALRFTSARCMETTVNAHQILPKVLHKRTLGLPA MSTTDLEAYFKDCVFKDWEELGEEIRLKVCVLGGCRHKLVCAPAPCNFFTSA" prim_transcript complement(1625..2309) /gene="preS" /note="P" gene 1816..2460 /gene="preC" CDS 1816..2460 /gene="preC" /codon_start=1 /db_xref="PID:g292403" /translation="MQLFHLCLIISCTCPTVQASKLCLGWLWGMDIDPYKEFGATVEL LSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVG NNLEDPASRDLVVNYVNTNMGLKIRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPA YRPPNAPILSTLPETTVVRRRDRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC" BASE COUNT 742 a 864 c 713 g 902 t ORIGIN 1 aattccactg ccttccacca agctctgcag gatcccagag tcaggggtct gtattttcct 61 gctggtggct ccagttcagg aacagtaaac cctgctccga atattgcctc tcacatctcg 121 tcaatctccg cgaggactgg ggaccctgtg acgaacatgg agaacatcac atcaggattc 181 ctaggacccc tgctcgtgtt acaggcgggg tttttcttgt tgacaagaat cctcacaata 241 ccgcagagtc tagactcgtg gtggacttct ctcaattttc tagggggatc acccgtgtgt 301 cttggccaaa attcgcagtc cccaacctcc aatcactcac caacctcctg tcctccaatt 361 tgtcctggtt atcgctggat gtgtctgcgg cgttttatca tattcctctt catcctgctg 421 ctatgcctca tcttcttatt ggttcttctg gattatcaag gtatgttgcc cgtttgtcct 481 ctaattccag gatcaacaac aaccagtacg ggaccatgca aaacctgcac gactcctgct 541 caaggcaact ctatgtttcc ctcatgttgc tgtacaaaac ctacggatgg aaattgcacc 601 tgtattccca tcccatcgtc ctgggctttc gcaaaatacc tatgggagtg ggcctcagtc 661 cgtttctctt ggctcagttt actagtgcca tttgttcagt ggttcgtagg gctttccccc 721 actgtttggc tttcagctat atggatgatg tggtattggg ggccaagtct gtacagcatc 781 gtgagtccct ttataccgct gttaccaatt ttcttttgtc tctgggtata catttaaacc 841 ctaacaaaac aaaaagatgg ggttattccc taaacttcat gggttacgta attggaagtt 901 ggggaacatt gccacaggat catattgtac aaaagatcaa acactgtttt agaaaacttc 961 ctgttaacag gcctattgat tggaaagtat gtcaaagaat tgtgggtctt ttgggctttg 1021 ctgctccatt tacacaatgt ggatatcctg ccttaatgcc cttgtatgca tgtatacaag 1081 ctaaacaggc ttttactttc tcgccaactt acaaggcctt tctaagtaaa cagtacatga 1141 acctttaccc cgttgctcgg caacggcctg gtctgtgcca agtgtttgct gacgcaaccc 1201 ccactggctg gggcttggcc ataggccatc agcgcatgcg tggaaccttt gtggctcctc 1261 tgccgatcca tactgcggaa ctcctagccg cttgttttgc tcgcagccgg tctggagcaa 1321 aactcatcgg aactgacaat tctgtcgtcc tctcgcggaa atatacatcg tttccatggg 1381 tgctaggctg tactgccaac tggatccttc gcgggacgtc ctttgtttac gtcccgtcgg 1441 cgctgaatcc cgcggacgac ccctctcggg gccgcttggg actctctcgt ccccttctcc 1501 gtctgccgtt ccagccgacc acggggcgca cctctcttta cgcggtctcc ccgtctgtgc 1561 cttctcatct gccggtccgt gtgcacttcg cttcacctct gcacgttgca tggagaccac 1621 cgtgaacgcc catcagatcc tgcccaaggt cttacataag aggactcttg gactcccagc 1681 aatgtcaacg accgaccttg aggcctactt caaagactgt gtgtttaagg actgggagga 1741 gctgggggag gagattaggt taaaggtctg tgtattagga ggctgtaggc ataaattggt 1801 ctgcgcacca gcaccatgca actttttcac ctctgcctaa tcatctcttg tacatgtccc 1861 actgttcaag cctccaagct gtgccttggg tggctttggg gcatggacat tgacccttat 1921 aaagaatttg gagctactgt ggagttactc tcgtttttgc cttctgactt ctttccttcc 1981 gtaagagatc tcctagacac cgcctcagct ctgtatcgag aagccttaga gtcgcccgag 2041 cattgctcac ctcaccatac tgcactcagg caagccattc tctgctgggg ggaattgatg 2101 actctagcta cctgggtggg taataatttg gaagatccag catccaggga tctagtagtc 2161 aattatgtta atactaacat gggtttaaag atcaggcaac tattgtggtt tcatatatct 2221 tgccttactt ttggaagaga gactgtactt gaatatttgg tctctttcgg agtgtggatt 2281 cgcactcctc cagcctatag accaccaaat gcccctatct tatcaacact tccggaaact 2341 actgttgtta gacgacggga ccgaggcagg tcccctagaa gaagaactcc ctcgcctcgc 2401 agacgcagat ctcaatcgcc gcgtcgcaga agatctcaat ctcgggaatc tcaatgttag 2461 tattccttgg actcataagg tgggaaactt tactgggctt tattcctcta cagtacctgt 2521 ctttaatcct gagtggcaaa gtccttcctt tcctaagatt catttacaag aggacattat 2581 taataggtgt caacaatttg tgggccctct cactgtaaat gaaaagagaa gattgaaatt 2641 aattatgcct gctagattct atcctaccca cactaaatat tttcccttag acaaaggaat 2701 taaaccttat tatccagatc aggtagttaa tcattacttc caaaccagac attatttaca 2761 tactctttgg aaggctggta ttctatataa gagggaaacc acacgtagcg catcattttg 2821 cgggtcacca tattcttggg aacaagagct acagcatggg aggttggtca tcaaaacctc 2881 gcaaaggcat ggggacgaat ctttctgttc ccaaccctct gggattcttt cccgatcatc 2941 agttggaccc tgcattcgga gccaactcaa acaatccaga ttgggacttc aaccccatca 3001 aggaccactg gccagcagcc aaccaggtag gagtgggagc attcgggcca gggctcaccc 3061 ctccacatgg cggtattttg gggtggagcc ctcaggctca gggcatattg accacagtgt 3121 caacaattcc tcctcctgcc tccaccaatc ggcagtcagg aaggcagcct actcccatct 3181 ctccacctct aagagacagt catcctcagg ccatgcagtg g // LOCUS HUMPRKACB 2945 bp mRNA PRI 08-JAN-1995 DEFINITION Human testis-specific cAMP-dependent protein kinase catalytic subunit (C-beta isoform) mRNA, complete cds. ACCESSION M34181 NID g189982 KEYWORDS cAMP dependent; cAMP-dependent protein kinase; cAMP-dependent protein kinase catalytic subunit; cAMP-dependent protein kinase catalytic subunit-beta; protein kinase; subunit. SOURCE clones T124, T175, T31, C-beta-10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2945) AUTHORS Beebe,S.J., Oyen,O., Sandberg,M., Froysa,A., Hansson,V. and Jahnsen,T. TITLE Molecular cloning of a tissue-specific protein kinase (C gamma) from human testis--representing a third isoform for the catalytic subunit of cAMP-dependent protein kinase JOURNAL Mol. Endocrinol. 4 (3), 465-475 (1990) MEDLINE 90258940 FEATURES Location/Qualifiers source 1..2945 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="testis" /map="1p36.1" gene 48..1103 /gene="PRKACB" CDS 48..1103 /gene="PRKACB" /EC_number="2.7.1.37" /note="C-beta isoform" /codon_start=1 /db_xref="GDB:G00-120-718" /product="cAMP-dependent protein kinase catalytic subunit" /db_xref="PID:g189983" /translation="MGNAATAKKGSEVESVKEFLAKAKEDFLKKWENPTQNNAGLEDF ERKKTLGTGSFGRVMLVKHKATEQYYAMKILDKQKVVKLKQIEHTLNEKRILQAVNFP FLVRLEYAFKDNSNLYMVMEYVPGGEMFSHLRRIGRFSEPHARFYAAQIVLTFEYLHS LDLIYRDLKPENLLIDHQGYIQVTDFGFAKRVKGRTWTLCGTPEYLAPEIILSKGYNK AVDWWALGVLIYEMAAGYPPFFADQPIQIYEKIVSGKVRFPSHFSSDLKDLLRNLLQV DLTKRFGNLKNGVSDIKTHKWFATTDWIAIYQRKVEAPFIPKFRGSGDTSNFDDYEEE DIRVSITEKCAKEFGEF" BASE COUNT 872 a 551 c 568 g 954 t ORIGIN 1 ccagcccccc ttcccttccc tgaccccttc ttgccatcgc cccagacatg gggaacgcgg 61 cgaccgccaa gaaaggcagc gaggtggaga gcgtgaaaga gtttctagcc aaagccaaag 121 aagacttttt gaaaaaatgg gagaatccaa ctcagaataa tgccggactt gaagattttg 181 aaaggaaaaa aacccttgga acaggttcat ttggaagagt catgttggta aaacacaaag 241 ccactgaaca gtattatgcc atgaagatct tagataagca gaaggttgtt aaactgaagc 301 aaatagagca tactttgaat gagaaaagaa tattacaggc agtgaatttt cctttccttg 361 ttcgactgga gtatgctttt aaggataatt ctaatttata catggttatg gaatatgtcc 421 ctgggggtga aatgttttca catctaagaa gaattggaag gttcagtgag ccccatgcac 481 ggttctatgc agctcagata gtgctaacat tcgagtacct ccattcacta gacctcatct 541 acagagatct aaaacctgaa aatctcttaa ttgaccatca aggctatatc caggtcacag 601 actttgggtt tgccaaaaga gttaaaggca gaacttggac attatgtgga actccagagt 661 atttggctcc agaaataatt ctcagcaagg gctacaataa ggcagtggat tggtgggcat 721 taggagtgct aatctatgaa atggcagctg gctatccccc attctttgca gaccaaccaa 781 ttcagattta tgaaaagatt gtttctggaa aggtccgatt cccatcccac ttcagttcag 841 atctcaagga ccttctacgg aacctgctgc aggtggattt gaccaagaga tttggaaatc 901 taaagaatgg tgtcagtgat ataaaaactc acaagtggtt tgccacgaca gattggattg 961 ctatttacca gaggaaggtt gaagctccat tcataccaaa gtttagaggc tctggagata 1021 ccagcaactt tgatgactat gaagaagaag atatccgtgt ctctataaca gaaaaatgtg 1081 caaaagaatt tggtgaattt taaagaggaa caagatgaca tctgagctca cactcagtgt 1141 ttgcactctg ttgagagata aggtagagct gagaccgtcc ttgttgaagc agttacctag 1201 ttccttcatt ccaacgactg agtgaggtct ttattgccat catccgtgtg cgcactctgc 1261 atccacctat gtaacaaggc accgctaagc aagcattgtc tgtgccataa cacagtacta 1321 gaccactttc ttacttctct ttgggttgtc tttctcctct cctacatcca tttcttcctt 1381 ttcaatttca ttggttttct ctaaacagtg ctccatttta ttttgttggt gtttcagatg 1441 ggcagtgtta tggctacgtg atatttgaag ggaaggataa gtgttgcttt cagtagttat 1501 tgccaatatt gttgttggtc aatggcttga agataaactt tctaataatt attatttctt 1561 tgagtagctc agacttggtt ttgccaaaac tcttggtaat ttttgaagat agactgtctt 1621 atcaccaagg aaatttatac aaattaagac taactttctt ggaattcact attctggcaa 1681 taaattttgg tagactaata cagtacagct agacccagaa atttggaagg ctgtagatca 1741 gaggttctag ttccctttcc ctccttttat atcctcctct ccttgagtaa tgaagtgacc 1801 agcctgtgta gtgtgacaaa cgtgtctcat tcagcaggaa aaactaatga tatggatcat 1861 cacccagatt ctctcacttg gtaccagcat ttctgtaggt attagagaag agttctaagt 1921 tttctaaacc ttaactgttc cttaaggatt ttagccagta ttttaataga acatgattaa 1981 tgaaagtgac aaattttaaa ttttctctaa tagtcctcat cataaacttt ttaaaggaaa 2041 ataagcaaac taaaaagaac attggtttag ataaatactt atactttgca aagtcaaaaa 2101 tggcttgatt tttggaaaca atatagaggt attcatattt aaatgagggt ttacatttgt 2161 tttgttttgt aaccgttaaa aagaagttgt ttccagctaa ttattgtggt gtactatatt 2221 tgtgagccta gggtaggggc actgctgcaa cttctgcttt catcccatgc ctcatcaatg 2281 aggaaaggga acaaagtgta taaaacctgc cacaattgta ttttaatttt gaggtatgat 2341 attttcagat atttcataat ttctaacctc tgttctctca gtaaacagaa tgtctgatcg 2401 atcatgcaga tacaatgttg gtatttgaga ggttagtttt tttcctacac ttttttttgc 2461 caactgactt aacaacattg ctgtcaggtg gaaatttcaa gcacttttgc acatttagtt 2521 cagtgtttgt tgagaatcca tggcttaacc cacttgtttt gctatttttt tctttgcttt 2581 taattttccc catctgattt tatctctgcg tttcagtgac ctaccttaaa acaacacacg 2641 agaagagtta aactgggttc attttaatga tcaatttacc tgcatataaa atttattttt 2701 aatcaagctg atcttaatgt atataatcat tctatttgct ttattatcgg tgcaggtagg 2761 tcattaacac cacttctttt catctgtacc acaccctggt gaaacctttg aagacataaa 2821 aaaaacctgt ctgagatgtt ctttctacca atctatatgt ctttcggtta tcaagtgttt 2881 ctgcatggta atgtcatgta aatgctgata ttgatttcac tggtccatct atatttaaaa 2941 cgtgc // LOCUS HUMPRKCI 2196 bp mRNA PRI 08-JAN-1995 DEFINITION Human protein kinase C iota isoform (PRKCI) mRNA, complete cds. ACCESSION L18964 NID g432273 KEYWORDS iota isoform; protein kinase C. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2196) AUTHORS Selbie,L.A., Schmitz-Peiffer,C., Sheng,Y. and Biden,T.J. TITLE Molecular cloning and characterization of PKC iota, an atypical isoform of protein kinase C derived from insulin-secreting cells JOURNAL J. Biol. Chem. 268 (32), 24296-24302 (1993) MEDLINE 94043266 FEATURES Location/Qualifiers source 1..2196 /organism="Homo sapiens" /db_xref="taxon:9606" gene 265..2028 /gene="PRKCI" CDS 265..2028 /gene="PRKCI" /codon_start=1 /product="protein kinase C iota" /db_xref="PID:g432274" /translation="MSHTVAGGGSGDHSHQVRVKAYYRGDIMITHFEPSISFEGLCNE VRDMCSFDNEQLFTMKWIDEEGDPCTVSSQLELEEAFRLYELNKDSELLIHVFPCVPE RPGMPCPGEDKSIYRRGARRWRKLYCANGHTFQAKRFNRRAHCAICTDRIWGLGRQGY KCINCKLLVHKKCHKLVTIECGRHSLPQEPVMPMDQSSMHSDHAQTVIPYNPSSHESL DQVGEEKEAMNTRESGKASSSLGLQDFDLLRVIGRGSYAKVLLVRLKKTDRIYAMKVV KKELVNDDEDIDWVQTEKHVFEQASNHPFLVGLHSCFQTESRLFFVIEYVNGGDLMFH MQRQRKLPEEHARFYSAEISLALNYLHERGIIYRDLKLDNVLLDSEGHIKLTDYGMCK EGLRPGDTTSTFCGTPNYIAPEILRGEDYGFSVDWWALGVLMFEMMAGRSPFDIVGSS DNPDQNTEDYLFQVILEKQIRIPRSLSVKAASVLKSFLNKDPKERLGCHPQTGFADIQ GHPFFRNVDWDMMEQKQVVPPFKPNISGEFGLDNFDSQFTNEPVQLTPDDDDIVRKID QSEFEGFEYINPLLMSAEECV" BASE COUNT 596 a 471 c 552 g 577 t ORIGIN 1 cggggtgtct tgggcccggg cggctgtaga ggcggcggcg cctacgggca gtgggaggag 61 ccgcgcggtt ccggctgctc cggcgaggcg acccttgggt cggcgctgcg ggcaggtggc 121 aggtaggtgg cggacggccg cggttctccg gcaagcgcag gcggcggagt cccccacggc 181 gcccgaagcg cccccccgca cccccggcct ccagcgttga ggcgggggag tgaggagatg 241 ccgacccaga gggacagcag caccatgtcc cacacggtcg caggcggcgg cagcggggac 301 cattcccacc aggtccgggt gaaagcctac taccgcgggg atatcatgat aacacatttt 361 gaaccttcca tctcctttga gggcctttgc aatgaggttc gagacatgtg ttcttttgac 421 aacgaacagc tcttcaccat gaaatggata gatgaggaag gagacccgtg tacagtatca 481 tctcagttgg agttagaaga agcctttaga ctttatgagc taaacaagga ttctgaactc 541 ttgattcatg tgttcccttg tgtaccagaa cgtcctggga tgccttgtcc aggagaagat 601 aaatccatct accgtagagg tgcacgccgc tggagaaagc tttattgtgc caatggccac 661 actttccaag ccaagcgttt caacaggcgt gctcactgtg ccatctgcac agaccgaata 721 tggggacttg gacgccaagg atataagtgc atcaactgca aactcttggt tcataagaag 781 tgccataaac tcgtcacaat tgaatgtggg cggcattctt tgccacagga accagtgatg 841 cccatggatc agtcatccat gcattctgac catgcacaga cagtaattcc atataatcct 901 tcaagtcatg agagtttgga tcaagttggt gaagaaaaag aggcaatgaa caccagggaa 961 agtggcaaag cttcatccag tctaggtctt caggattttg atttgctccg ggtaatagga 1021 agaggaagtt atgccaaagt actgttggtt cgattaaaaa aaacagatcg tatttatgca 1081 atgaaagttg tgaaaaaaga gcttgttaat gatgatgagg atattgattg ggtacagaca 1141 gagaagcatg tgtttgagca ggcatccaat catcctttcc ttgttgggct gcattcttgc 1201 tttcagacag aaagcagatt gttctttgtt atagagtatg taaatggagg agacctaatg 1261 tttcatatgc agcgacaaag aaaacttcct gaagaacatg ccagatttta ctctgcagaa 1321 atcagtctag cattaaatta tcttcatgag cgagggataa tttatagaga tttgaaactg 1381 gacaatgtat tactggactc tgaaggccac attaaactca ctgactacgg catgtgtaag 1441 gaaggattac ggccaggaga tacaaccagc actttctgtg gtactcctaa ttacattgct 1501 cctgaaattt taagaggaga agattatggt ttcagtgttg actggtgggc tcttggagtg 1561 ctcatgtttg agatgatggc aggaaggtct ccatttgata ttgttgggag ctccgataac 1621 cctgaccaga acacagagga ttatctcttc caagttattt tggaaaaaca aattcgcata 1681 ccacgttctc tgtctgtaaa agctgcaagt gttctgaaga gttttcttaa taaggaccct 1741 aaggaacgat tgggttgtca tcctcaaaca ggatttgctg atattcaggg acacccgttc 1801 ttccgaaatg ttgattggga tatgatggag caaaaacagg tggtacctcc ctttaaacca 1861 aatatttctg gggaatttgg tttggacaac tttgattctc agtttactaa tgaacctgtc 1921 cagctcactc cagatgacga tgacattgtg aggaagattg atcagtctga atttgaaggt 1981 tttgagtata tcaatcctct tttgatgtct gcagaagaat gtgtctgatc ctcatttttc 2041 aaccatgtat tctactcatg ttgccattta atgcatggat aaacttgctg caagcctgga 2101 tacaattaac cattttatat ttgccaccta caaaaaaaca cccaatatct tctcttgtag 2161 actatatgaa tcaattatta catctcgacc cggaat // LOCUS HUMPRLR 2723 bp mRNA PRI 08-JAN-1995 DEFINITION Human prolactin (PRL) receptor mRNA, complete cds. ACCESSION M31661 M60727 NID g190361 KEYWORDS prolactin receptor. SOURCE Human hepatoma cell line HepG2, cDNA to mRNA, (library of J.C.Edman). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2723) AUTHORS Boutin,J.M., Edery,M., Shirota,M., Jolicoeur,C., Lesueur,L., Ali,S., Gould,D., Djiane,J. and Kelly,P.A. TITLE Identification of a cDNA encoding a long form of prolactin receptor in human hepatoma and breast cancer cells JOURNAL Mol. Endocrinol. 3 (9), 1455-1461 (1989) MEDLINE 90114212 FEATURES Location/Qualifiers source 1..2723 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hep G2" /tissue_type="hepatoma" /map="5p14-p13" sig_peptide 285..356 /gene="PRL receptor" /product="prolactin receptor" CDS 285..2153 /gene="PRLR" /codon_start=1 /db_xref="GDB:G00-120-315" /product="prolactin receptor" /db_xref="PID:g190362" /translation="MKENVASATVFTLLLFLNTCLLNGQLPPGKPEIFKCRSPNKETF TCWWRPGTDGGLPTNYSLTYHREGETLMHECPDYITGGPNSCHFGKQYTSMWRTYIMM VNATNQMGSSFSDELYVDVTYIVQPDPPLELAVEVKQPEDRKPYLWIKWSPPTLIDLK TGWFTLLYEIRLKPEKAAEWEIHFAGQQTEFKILSLHPGQKYLVQVRCKPDHGYWSAW SPATFIQIPSDFTMNDTTVWISVAVLSAVICLIIVWAVALKGYSMVTCIFPPVPGPKI KGFDAHLLEKGKSEELLSALGCQDFPPTSDYEDLLVEYLEVDDSEDQHLMSVHSKEHP SQGMKPTYLDPDTDSGRGSCDSPSLLSEKCEEPQANPSTFYDPEVIEKPENPETTHTW DPQCISMEGKIPYFHAGGSKCSTWPLPQPSQHNPRSSYHNITDVCELAVGPAGAPATL LNEAGKDALKSSQTIKSREEGKATQQREVESFHSETDQDTPWLLPQEKTPFGSAKPLD YVEIHKVNKDGALSLLPKQRENSGKPKKPGTPENNKEYAKVSGVMDNNILVLVPDPHA KNVACFEESAKEAPPSLEQNQAEKALANFTATSSKCRLQLGGLDYLDPACFTHSFH" gene 285..2150 /gene="PRL receptor" gene 285..2153 /gene="PRLR" mat_peptide 357..2150 /gene="PRL receptor" /product="prolactin receptor" BASE COUNT 785 a 655 c 593 g 690 t ORIGIN 1 ggaggctgaa atccccagac gccggttttc tgggctgggc tttctgctta ctcactcctt 61 ctccctcttt ctggatttta ccgaccgttc gcgaaacagc tttccacaca atggagcttc 121 atgtcctcgt gcaggaagta ctcatcgact gatgtggcag actttgctcc ctgacaaaac 181 taaagaactc tcctattcat ggaggcgaac actgaggatg ctttccacat gaaccctgaa 241 gtgaacttct gatacatttc ctgcagcaag agaaggcagc caacatgaag gaaaatgtgg 301 catctgcaac cgttttcact ctgctacttt ttctcaacac ctgccttctg aatggacagt 361 tacctcctgg aaaacctgag atctttaaat gtcgttctcc caataaggaa acattcacct 421 gctggtggag gcctgggaca gatggaggac ttcctaccaa ttattcactg acttaccaca 481 gggaaggaga gacactcatg catgaatgtc cagactacat aaccggtggc cccaactcct 541 gccactttgg caagcagtac acctccatgt ggaggacata catcatgatg gtcaatgcca 601 ctaaccagat gggaagcagt ttctcggatg aactttatgt ggacgtgact tacatagttc 661 agccagaccc tcctttggag ctggctgtgg aagtaaaaca gccagaagac agaaaaccct 721 acctgtggat taaatggtct ccacctaccc tgattgactt aaaaactggt tggttcacgc 781 tcctgtatga aattcgatta aaacccgaga aagcagctga gtgggagatc cattttgctg 841 ggcagcaaac agagtttaag attctcagcc tacatccagg acagaaatac cttgtccagg 901 ttcgctgcaa accagaccat ggatactgga gtgcatggag tccagcgacc ttcattcaga 961 tacctagtga cttcaccatg aatgatacaa ccgtgtggat ctctgtggct gtcctttctg 1021 ctgtcatctg tttgattatt gtctgggcag tggctttgaa gggctatagc atggtgacct 1081 gcatctttcc gccagttcct gggccaaaaa taaaaggatt tgatgctcat ctgttggaga 1141 agggcaagtc tgaagaacta ctgagtgcct tgggatgcca agactttcct cccacttctg 1201 actatgagga cttgctggtg gagtatttag aagtagatga tagtgaggac cagcatctaa 1261 tgtcagtcca ttcaaaagaa cacccaagtc aaggtatgaa acccacatac ctggatcctg 1321 acactgactc aggccggggg agctgtgaca gcccttccct tttgtctgaa aagtgtgagg 1381 aaccccaggc caatccctcc acattctatg atcctgaggt cattgagaag ccagagaatc 1441 ctgaaacaac ccacacctgg gacccccagt gcataagcat ggaaggcaaa atcccctatt 1501 ttcatgctgg tggatccaaa tgttcaacat ggcccttacc acagcccagc cagcacaacc 1561 ccagatcctc ttaccacaat attactgatg tgtgtgagct ggctgtgggc cctgcaggtg 1621 caccggccac tctgttgaat gaagcaggta aagatgcttt aaaatcctct caaaccatta 1681 agtctagaga agagggaaag gcaacccagc agagggaggt agaaagcttc cattctgaga 1741 ctgaccagga tacgccctgg ctgctgcccc aggagaaaac cccctttggc tccgctaaac 1801 ccttggatta tgtggagatt cacaaggtca acaaagatgg tgcattatca ttgctaccaa 1861 aacagagaga gaacagcggc aagcccaaga agcccgggac tcctgagaac aataaggagt 1921 atgccaaggt gtccggggtc atggataaca acatcctggt gttggtgcca gatccacatg 1981 ctaaaaacgt ggcttgcttt gaagaatcag ccaaagaggc cccaccatca cttgaacaga 2041 atcaagctga gaaagccctg gccaacttca ctgcaacatc aagcaagtgc aggctccagc 2101 tgggtggttt ggattacctg gatcccgcat gttttacaca ctcctttcac tgatagcttg 2161 actaatggaa tgattggtta aaatgtgatt tttcttcagg taacactaca gagtacgtga 2221 aatgctcaag aatgtagtca gactgacact actaaagctc ccagctcctt tcatgctcca 2281 tttttaacca cttgcctctt tctccagcag ctgattccag aacaaatcat tatgtttcct 2341 aactgtgatt tgtagattta ctttttgctg ttagttataa aactatgtgt tcaatgaaat 2401 aaaagcacac tgcttagtat tcttgaggga caatgccaat aggtatatcc tctggaaaag 2461 gctttcatga tttggcatgg gacagacgga aatgaaattg tcaaaattgt ttaccataga 2521 aagatgacaa aagaaaattt tccacatagg aaaatgccat gaaaattgct tttgaaaaac 2581 aactgcataa cctttacact cctcgtccat tttattagga ttacccaaat ataaccattt 2641 aaagaaagaa tgcattccag aacaaattgt ttacataagt tcctatacct tactgacaca 2701 ttgctgatat gcaagtaaga aat // LOCUS HUMPRLTS 1502 bp mRNA PRI 17-SEP-1996 DEFINITION Human mRNA for PDGF receptor beta-like tumor suppressor (PRLTS), complete cds. ACCESSION D37965 NID g807818 KEYWORDS PDGF receptor beta-like tumor suppressor; PRLTS. SOURCE Homo sapiens (library: fetal lung) cDNA to mRNA, clone PRLTS. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Fujiwara,Y., Ohata,H., Kuroki,T., Koyama,K., Tsuchiya,E., Monden,M. and Nakamura,Y. TITLE Isolation of a candidate tumor suppressor gene on chromosome 8p21.3-p22 that is homologous to an extracellular domain of the PDGF receptor beta gene JOURNAL Oncogene 10 (5), 891-895 (1995) MEDLINE 95206781 REFERENCE 2 (bases 1 to 1502) AUTHORS Nakamura,Y. JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 1502) AUTHORS Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (16-AUG-1994) to the DDBJ/EMBL/GenBank databases. Yusuke Nakamura, Cancer Institute, Department of Biochemistry; 1-37-1 Kami-Ikebukuro, Toshima-ku, Tokyo 170, Japan (E-mail:nakamura@ganvx1.jfcr.or.jp, Tel:03-3918-0111(ex.4501), Fax:03-3918-0342) FEATURES Location/Qualifiers source 1..1502 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="8" /clone_lib="fetal lung" /map="8p21.3-p22" 5'UTR 1..61 exon 1..52 /number=1 exon 53..116 /number=2 CDS 62..1189 /codon_start=1 /product="PDGF receptor beta-like tumor suppressor" /db_xref="PID:d1007756" /db_xref="PID:g807819" /translation="MKVWLLLGLLLVHEALEDVTGQHLPKNKRPKEPGENRIKPTNKK VKPKIPKMKDRDSANSAPKTQSIMMQVLDKGRFQKPAATLSLLAGQTVELRCKGSRIG WSYPAYLDTFKDSRLSVKQNERYGQLTLVNSTSADTGEFSCWVQLCSGYICRKDEAKT GSTYIFFTEKGELFVPSPSYFDVVYLNPDRQAVVPCRVTVLSAKVTLHREFPAKEIPA NGTDIVYDMKRGFVYLQPHSEHQGVVYCRAEAGGRSQISVKYQLLYVAVPSGPPSTTI LASSNKVKSGDDISVLCTVLGEPDVEVEFTWIFPGQKDERPVTIQDTWRLIHRGLGHT TRISQSVITVEDFETIDAGYYICTAQNLQGQTTVATTVEFS" exon 117..414 /number=3 exon 415..566 /number=4 exon 567..860 /number=5 exon 861..1000 /number=6 exon 1001..1502 /number=7 3'UTR 1190..1502 BASE COUNT 392 a 383 c 379 g 348 t ORIGIN 1 cctgcgtccc cgccccgcgc agccgccgcg ctcctgcgct ccgaggtccg aggttcccga 61 gatgaaggtc tggctgctgc ttggtcttct gctggtgcac gaagcgctgg aggatgttac 121 tggccaacac cttcccaaga acaagcgtcc aaaagaacca ggagagaata gaatcaaacc 181 taccaacaag aaggtgaagc ccaaaattcc taaaatgaag gacagggact cagccaattc 241 agcaccaaag acgcagtcta tcatgatgca agtgctggat aaaggtcgct tccagaaacc 301 cgccgctacc ctgagtctgc tggcggggca aactgtagag cttcgatgta aagggagtag 361 aattgggtgg agctaccctg cgtatctgga cacctttaag gattctcgcc tcagcgtcaa 421 gcagaatgag cgctacggcc agttgactct ggtcaactcc acctcggcag acacaggtga 481 attcagctgc tgggtgcagc tctgcagcgg ctacatctgc aggaaggacg aggccaaaac 541 gggctccacc tacatctttt ttacagagaa aggagaactc tttgtacctt ctcccagcta 601 cttcgatgtt gtctacttga acccggacag acaggctgtg gttccttgtc gggtgaccgt 661 gctgtcggcc aaagtcacgc tccacaggga attcccagcc aaggagatcc cagccaatgg 721 aacggacatt gtttatgaca tgaagcgggg ctttgtgtat ctgcaacctc attccgagca 781 ccagggtgtg gtttactgca gggcggaggc cgggggcaga tctcagatct ccgtcaagta 841 ccagctgctc tacgtggcgg ttcccagtgg ccctccctca acaaccatct tggcttcttc 901 aaacaaagtg aaaagtgggg acgacatcag tgtgctctgc actgtcctgg gggagcccga 961 tgtggaggtg gagttcacct ggatcttccc agggcagaag gatgaaaggc ctgtgacgat 1021 ccaagacact tggaggttga tccacagagg actgggacac accacgagaa tctcccagag 1081 tgtcattaca gtggaagact tcgagacgat tgatgcagga tattacattt gcactgctca 1141 gaatcttcaa ggacagacca cagtagctac cactgttgag ttttcctgac ttggaaaagg 1201 aaatgtaatg aacttatgga aagcccattt gtgtacacag tcagctttgg ggttcctttt 1261 attagtgctt tgccagaggc tgatgtcaag caccacaccc caaccccagc gtctcgtgag 1321 tccgacccag acatccaaac taaaaggaag tcatccagtc tattcacaga agtgttaact 1381 tttctaacag aaagcatgat tttgattgct tacctacata cgtgttccta gtttttatac 1441 atgtgtaaac aattttatat aatcaatcat ttctattaaa tgagcacgtt tttgtaaaaa 1501 at // LOCUS HUMPROAF 1455 bp DNA PRI 23-JUL-1992 DEFINITION Human prothymosin-alpha pseudogene, complete sequence. ACCESSION J04801 NID g190371 KEYWORDS prothymosin-alpha; pseudogene. SOURCE Human lymphocyte cell line RPMI 8226 DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1455) AUTHORS Eschenfeldt,W.H., Manrow,R.E., Krug,M.S. and Berger,S.L. TITLE Isolation and partial sequencing of the human prothymosin alpha gene family: Evidence against export of the gene products JOURNAL J. Biol. Chem. 264, 7546-7555 (1989) MEDLINE 89214202 FEATURES Location/Qualifiers source 1..1455 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="RPMI 8226" /cell_type="lymphocyte" TATA_signal 31..36 repeat_unit 172..187 /rpt_type=direct CDS 352..705 /note="open reading frame A" /codon_start=1 /db_xref="PID:g190372" /translation="MSDAAVDTSSEITTEDLKEKKEVVEEAENGRDAPAHGNANEENG EPDDNEVDEEEEEGGEEEEEEEGDGEEEDGDEDEGAESATGKRAAEDDEDNDVDTQKQ KTDEDDQTAKKEKLN" polyA_signal 1331..1336 repeat_unit 1391..1406 /rpt_type=direct BASE COUNT 451 a 299 c 341 g 364 t ORIGIN 1 caacacgctg cccttcacag gtaagtgtaa tatatagaga aaaatttcat gttaagtgac 61 ttgccagagt atgcagttgt ggaatgtctt ttcgggggca tgcttttctc tgagagagag 121 agagaaaaaa aaaacaactc ttaccctcct ccaaatctaa ggacatttga tgaaaaacat 181 tgtcctgccc cactggctgc tctgaaaagc cgtctttgca ttgtgcgtcg tcagcctcct 241 tgctcgccgc agccgcctcg ccgccgcgga ctccggcagc tttatcgcca gagtccctga 301 actctcgctt tcttttttat cccctgcatc gcgtcaccgg cgtgccccac catgtcagac 361 gcagccgtag acaccagctc cgaaatcacc accgaggact taaaggagaa gaaggaagtt 421 gtggaagagg cggaaaatgg aagagacgcc cctgctcacg ggaatgctaa tgaggaaaat 481 ggggagccgg atgacaacga ggtagatgaa gaagaggaag aaggtgggga ggaagaggag 541 gaggaagaag gtgatggtga ggaagaggac ggagatgaag atgagggagc tgagtcagct 601 acgggcaagc gggcagctga agatgatgag gataacgatg tcgataccca gaagcagaag 661 accgacgagg atgaccagac agcaaaaaag gaaaagttaa actaaaaaaa aaaggccgcc 721 gtgacctatt caccctccac ttcccgtctc agaatctaaa cgtggtcacc ttcgagtaga 781 ggggcccgcc cgcccaccgt gggcagtgcc acccgcagat gacacgcgct ctccaccacc 841 caacccaaac catgagaatt tgcaacaggg gagggaaaaa gaaccaaaac ttccaaggcc 901 cgcttttttt ttttcttaaa agtactttaa aaaggaaact tgtatttttt atttacattt 961 tatatttttg tacatattgt tagggtcggc catttttaat gatctcggat gaccaaacca 1021 gccttcggag cgttctctgt cctacttctc actttacttg tggtgtggcc atgttcatta 1081 taatctcaaa ggagaaaaaa aaaacttgta aaaaatgcaa aaatgacaac agaaaaacca 1141 tcttattccg agcattccag taactttttt gtgtatgtac ttagctgtac tataagtagt 1201 tggtttgtat gagatggtta aaaaggccaa agataaaagg tttctttttt ttcctttttt 1261 gtctatgaag ttgctgttta tttttatttt ttggcctgtt tgatgtatgt gtgaaacaat 1321 gttgtccaac aataaacagg aattttattt tgctgagttg ttctagcaaa aaaaaaaaag 1381 aaaaaaaaaa gaaaaacatt gttctgatga aaatcacttg gaatgagcct cttagggaaa 1441 taaatagaaa gtact // LOCUS HUMPROF 793 bp mRNA PRI 15-DEC-1988 DEFINITION Human profilin mRNA, complete cds. ACCESSION J03191 NID g190385 KEYWORDS actin-monomer-binding protein; profilin. SOURCE Human HepG2 cell line, cDNA to mRNA, clone 4A3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 793) AUTHORS Kwiatkowski,D.J. and Bruns,G.A.P. TITLE Human profilin: Molecular cloning sequence comparison, and chromosomal analysis JOURNAL J. Biol. Chem. 263, 5910-5915 (1988) MEDLINE 88186915 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Kwiatkowski, 23-JAN-1988. FEATURES Location/Qualifiers source 1..793 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..793 /note="profilin mRNA" CDS 128..550 /note="profilin" /codon_start=1 /db_xref="PID:g190386" /translation="MAGWNAYIDNLMADGTCQDAAIVGYKDSPSVWAAVPGKTFVNIT PAEVGVLVGKDRSSFYVNGLTLGGQKCSVIRDSLLQDGEFSMDLRTKSTGGAPTFNVT VTKTDKTLVLLMGKEGVHGGLINKKCYEMASHLRRSQY" BASE COUNT 160 a 254 c 209 g 170 t ORIGIN Unreported. 1 ggagccgcgg tccggacggc agcgcgtgcc ccgagctctc cgcctccccc cgcccgccag 61 ccgaggcagc tcgagcccag tccgcggccc cagcagcagc gccgagagca gccccagtag 121 cagcgccatg gccgggtgga acgcctacat cgacaacctc atggcggacg ggacctgtca 181 ggacgcggcc atcgtgggct acaaggactc gccctccgtc tgggccgccg tccccgggaa 241 aacgttcgtc aacatcacgc cagctgaggt gggtgtcctg gttggcaaag accggtcaag 301 tttttacgtg aatgggctga cacttggggg ccagaaatgt tcggtgatcc gggactcact 361 gctgcaggat ggggaattta gcatggatct tcgtaccaag agcaccggtg gggcccccac 421 cttcaatgtc actgtcacca agactgacaa gacgctagtc ctgctgatgg gcaaagaagg 481 tgtccacggt ggtttgatca acaagaaatg ttatgaaatg gcctcccacc ttcggcgttc 541 ccagtactga cctcgtctgt cccttcccct tcaccgctcc ccacagcttt gcaccccttt 601 cctccccata cacacacaaa ccattttatt ttttgggcca ttaccccata ccccttattg 661 ctgccaaaac cacatgggct gggggccagg gctggatgga cagacacctc cccctaccca 721 tatccctccc gtgtgtggtt ggaaaacttt tgttttttgg ggtttttttt ttctgaataa 781 aaaagattct act // LOCUS HUMPROFII 1693 bp mRNA PRI 14-OCT-1993 DEFINITION Human profilin II mRNA, complete cds. ACCESSION L10678 NID g190387 KEYWORDS profilin; profilin II. SOURCE Homo sapiens (library: Lambda ZAPII) adult epithelial cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1693) AUTHORS Honore,B., Madsen,P.S., Andersen,A.H. and Leffers,H. TITLE Cloning and expression of a novel human profilin variant, profilin II JOURNAL FEBS Lett. 330, 151-155 (1993) MEDLINE 93374053 FEATURES Location/Qualifiers source 1..1693 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="AMA" /cell_type="amnion" /dev_stage="adult" /tissue_type="epithelial" /tissue_lib="Lambda ZAPII" CDS 14..436 /codon_start=1 /product="profilin II" /db_xref="PID:g190388" /translation="MAGWQSYVDNLMCDGCCQEAAIVGYCDAKYVWAATAGGVFQSIT PIEIDMIVGKDREGFFTNGLTLGAKKCSVIRDSLYVDGDCTMDIRTKSQGGEPTYNVA VGRAGRALVIVMGKEGVHGGTLNKKAYELALYLRRSDV" polyA_signal 1675..1681 polyA_site 1693 BASE COUNT 449 a 343 c 355 g 546 t ORIGIN 1 gaagggctcg aagatggccg gttggcagag ctacgtggat aacctgatgt gcgatggctg 61 ctgccaggag gccgccattg tcggctactg cgacgccaaa tacgtctggg cagccacggc 121 cgggggcgtc tttcagagca ttacgccaat agaaatagat atgattgtag gaaaagaccg 181 ggaaggtttc tttaccaacg gtttgactct tggcgcgaag aaatgctcag tgatcagaga 241 tagtctatac gtcgatggtg actgcacaat ggacatccgg acaaagagtc aaggtgggga 301 gccaacatac aatgtggctg tcggcagagc tggtagagca ttggttatag tcatgggaaa 361 ggaaggtgtc cacggaggca cacttaacaa gaaagcatat gaactcgctt tatacctgag 421 gaggtctgat gtgtaagcag cctctcccca tctacctagc aactgtcttc atcaacaacc 481 ctaattatgg tcacaatgct accaaactgt agatggtagc taatttttct ttacctattt 541 tctaatgtca tgattcctgt ttgcccaatg gatcatttgt atgttaacca ctgtatgtaa 601 ccaaccctta tctggcaaca taattgcagc acaataatga tttgcatgat accttgaaat 661 tggggggagg gggcatgcca agttgggcat cactttgtct tagcaattaa tgggatattg 721 attactaaaa taagttaata ttaagcaagg tgccggttgt acaatctctg atcagtgtct 781 tttcagcact ttgagcattt acttggctca tttagtcttc cttttgtagc gcatggttgg 841 gaggaaaaag tgcatgcatc attccttcac tcttctcttt ttcccgcccc cccctccctt 901 cgcacatagg catttggttt gcttccatct ttttttatgc agtgcctgtt tttttttaac 961 caattaaaat cccttttgtt gatgagctat tgagagctgc agtagtttgc ttttagtatt 1021 gttgttgcac ttgagcagag acaaaccttt attcatagtg tctacaggac atatgaagag 1081 tgcaatggca aaacaagagc aaaaagcact tcctcccatg accttacagt aaccatactg 1141 attgaatccc cagggacatt ccatcattgc aatagctcag atttttcttc ctttttcttt 1201 gcacaccagc tctactcttt agtaaaattg taaaaggctg ccattatgga cattaggtat 1261 cccaacataa ccatctggag tgtgtccagt ttgttcttca taggaccaat ttttatttgc 1321 agcttgagtt tttatatgaa gttgcattat tgtggacttg gctgtccttg aatttttttc 1381 atatgtattc tgtgccatac tattgttaaa atgaactgtt gctattgtga gatggatttt 1441 aactgaccta ttaagggttt ctttcgaatg gcactacttt agggacattc tagtatttgc 1501 ttctattgtt tgggccttgt ggataatgta cagatttaaa aacaaatctt gttgctgatt 1561 tgtccatttc tttccctgca ctttgttaca tctgggatac agtctaactc atctgattta 1621 atatgcattt aaaaaaatgc cataactatt aaacaccttg tttacagaca gatgaaataa 1681 atttattcca acc // LOCUS HUMPROP2AA 2131 bp mRNA PRI 09-MAY-1991 DEFINITION Human protein phosphatase 2A alpha subunit mRNA, complete cds. ACCESSION M64929 J05328 NID g190421 KEYWORDS protein phosphatase-2A subunit-alpha; regulatory subunit. SOURCE Human lung fibroblast cell line WI38, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2131) AUTHORS Mayer,R.E., Hendrix,P., Cron,P., Matthies,R., Stone,S.R., Goris,J., Merlevede,W., Hofsteenge,J. and Hemmings,B.A. TITLE Structure of the 55 kDa regulatory subunit of protein phosphatase 2A: Evidence for a neuronal specific isoform JOURNAL Biochemistry 30, 3589-3597 (1991) MEDLINE 91198016 FEATURES Location/Qualifiers source 1..2131 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI38" /cell_type="fibroblast" /tissue_type="lung" 5'UTR 1..105 CDS 106..1449 /EC_number="3.1.3.16" /note="55 kDa regulatory subunit" /codon_start=1 /product="protein phosphatase-2A subunit-alpha" /db_xref="PID:g190422" /translation="MAGAGGGNDIQWCFSQVKGAVDDDVAEADIISTVEFNHSGELLA TGDKGGRVVIFQQEQENKIQSHSRGEYNVYSTFQSHEPEFDYLKSLEIEEKINKIRWL PQKNAAQFLLSTNDKTIKLWKISERDKRPEGYNLKEEDGRYRDPTTVTTLRVPVFRPM DLMVEASPRRIFANAHTYHINSISINSDYETYLSADDLRINLWHLEITDRSFNIVDIK PANMEELTEVITAAEFHPNSCNTFVYSSSKGTIRLCDMRASALCDRHSKLFEEPEDPS NRSFFSEIISSISDVKFSHSGRYMMTRDYLSVKIWDLNMENRPVETYQVHEYLRSKLC SLYENDCIFDKFECCWNGSDSVVMTGSYNNFFRMFDRNTKRDITLEASRENNKPRTVL KPRKVCASGKRKKDEISVDSLDFNKKILHTAWHPKENIIAVATTNNLYIFQDKVN" 3'UTR 1450..2131 BASE COUNT 665 a 421 c 460 g 585 t ORIGIN 1 ccgccgccat ccgccctctc taccccccca tccccaggtg aggggggtga gttcaggaag 61 cggagacccc gaggaaccca gcagggtcac catttgcagc gcaacatggc aggagctgga 121 ggagggaatg atattcagtg gtgtttttct caggtgaaag gagcagtaga tgatgatgta 181 gcagaagcag atataatttc tacagtagaa tttaatcatt ctggagaatt actagcaaca 241 ggagataaag gtggtagagt tgtcatcttt caacaggagc aggagaacaa aatccagtct 301 catagcagag gagaatataa tgtttacagc accttccaga gccatgaacc agagtttgac 361 tacttgaaaa gtttagaaat agaagaaaag atcaataaaa ttaggtggtt accccagaaa 421 aatgctgctc agtttttatt gtctaccaat gataaaacaa taaaattatg gaaaatcagt 481 gaaagggaca aaagaccaga agggtataac ttgaaagagg aggatggaag gtatagagat 541 cctactacag ttactacact acgagtgcca gtctttaggc ctatggatct aatggttgag 601 gccagtccac gaagaatatt tgccaatgct catacatatc acatcaactc aatttctatt 661 aatagtgatt atgaaacata tttatctgca gatgatttgc ggattaatct ttggcatctg 721 gaaattacag acaggagttt taacattgtg gatatcaagc ctgccaatat ggaagagcta 781 acagaggtga ttacagcagc agaatttcat ccaaacagct gtaacacatt tgtatacagc 841 agcagtaaag gaactattcg gctatgtgac atgagggcat ctgccctctg tgatagacat 901 tctaaattgt ttgaagaacc tgaagatccc agtaacaggt catttttttc cgaaatcatc 961 tcctctattt cggatgtaaa attcagccat agtggtcgat atatgatgac tagagactat 1021 ttgtcagtca aaatttggga cttaaatatg gaaaacaggc ctgtggaaac ataccaggtg 1081 catgaatacc tcagaagtaa actctgttca ctgtatgaaa atgactgcat atttgacaaa 1141 tttgaatgtt gttggaatgg atctgacagt gttgtcatga ctggatctta caataatttc 1201 ttcagaatgt ttgacagaaa cacaaagcga gacataaccc tagaagcatc gcgggaaaac 1261 aataagcctc gcacagttct gaagcctcgc aaagtctgtg caagtggcaa gcgaaagaaa 1321 gatgaaataa gtgttgacag cctagacttc aataagaaaa tccttcacac agcctggcac 1381 cccaaggaaa atatcattgc cgtagctact acaaacaatc tgtatatatt tcaagacaaa 1441 gtgaattagg gttggcattc ctagcagaag aacccacttc ctgcttagtt gagatagttg 1501 aatctagcat tcgttcctat aaaagagaga ggtccattgt ggcgcccctt tccagtgttt 1561 gacagtgtgc cattcgacaa cacattgtta tagctacatg gagaaagctc tgtggattca 1621 tcactgtggt gttctccatg tctgctagcc atttaggtaa gggtagggca cttttaattt 1681 aaatgacttc ttgcaccatc ttgcctaatg gactagattg gactgtatca acattgattt 1741 actccacttt ttatgccttc cattgtgatg acgtcaaaca cagtgaaagc cttcagtcat 1801 gctatgggat ttaattgtgt atcctcatta ctgtatcatt tgtggggtac accccttccc 1861 ccttttttta aattaaatac agctcattct tactgtggct tgtagcattc ctcctcttct 1921 ggcctcctgg actgctcccc ttcatctctt acccttgccc cctccacccg gtcttggtgg 1981 tggtatatta aaaaaagaaa gaatgaaagc acacaaaatg agtcagtttg gggtcagtgg 2041 tataaagggg gtatatgttg caaacaaatg ttttagtaac agttggctgt aatcactcct 2101 cgccgtgtct ggcactgaaa ataaggaaaa g // LOCUS HUMPROP2AB 3441 bp mRNA PRI 09-MAY-1991 DEFINITION Human protein phosphatase 2A beta subunit mRNA, complete cds. ACCESSION M64930 J05328 NID g190423 KEYWORDS protein phosphatase-2A subunit-beta; regulatory subunit. SOURCE Human fetal brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3441) AUTHORS Mayer,R.E., Hendrix,P., Cron,P., Matthies,R., Stone,S.R., Goris,J., Merlevede,W., Hofsteenge,J. and Hemmings,B.A. TITLE Structure of the 55 kDa regulatory subunit of protein phosphatase 2A: Evidence for a neuronal specific isoform JOURNAL Biochemistry 30, 3589-3597 (1991) MEDLINE 91198016 FEATURES Location/Qualifiers source 1..3441 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" 5'UTR 1..525 CDS 411..452 /note="upstream ORF 1" /codon_start=1 /db_xref="PID:g190424" /translation="MVQSHWLPCPPLL" CDS 474..542 /note="upstream ORF 2" /codon_start=1 /db_xref="PID:g190425" /translation="MDTCLPASGSHASKPAVNGGGH" CDS 526..1857 /EC_number="3.1.3.16" /note="55 kDa regulatory subunit" /codon_start=1 /product="protein phosphatase-2A subunit-beta" /db_xref="PID:g190426" /translation="MEEDIDTRKINNSFLRDHSYATEADIISTVEFNHTGELLATGDK GGRVVIFQREQESKNQVHRRGEYNVYSTFQSHEPEFDYLKSLEIEEKINKIRWLPQQN AAYFLLSTNDKTVKLWKVSERDKRPEGYNLKDEEGRLRDPATITTLRVPVLRPMDLMV EATPRRVFANAHTYHINSISVNSDYETYMSADDLRINLWNFEITNQSFNIVDIKPANM EELTEVITAAEFHPHHCNTFVYSSSKGTIRLCDMRASALCDRHTKFFEEPEDPSNRSF FSEIISSISDVKFSHSGRYIMTRDYLTVKVWDLNMENRPIETYQVHDYLRSKLCSLYE NDCIFDKFECVWNGSDSVIMTGSYNNFFRMFDRNTKRDVTLEASRENSKPRAILKPRK VCVGGKRRKDEISVDSLDFSKKILHTAWHPSENIIAVAATNNLYIFQDKVN" 3'UTR 1858..3441 BASE COUNT 943 a 800 c 803 g 895 t ORIGIN 1 ggccaggcaa gcctgaatcc tgtccctgcc atctcgccac tgcagctcgg gtccagaaag 61 gcaccatttt gtcgcggctg cccgctctcc cagggggagg agggatcttt tttgcatttt 121 ggagcggctg ccaaggaggg gaacctgttg ggcatctccc cagacccgct tgtgagcgcc 181 tccggggcgg gcgggcggga ccagacccct cggggcacgg cgtatcttgg cacccggagg 241 cagcggaggc aggcgcagca tcctcgctgg gaactggagc tggagtgagc gcaccgcgcg 301 ggaggagccg ccgcagcctc gcagaacccg agtggaggag gtgacagctc cattgccggg 361 tttttatttt ttttctctcc gcctccccgt ctcctcctca ggctcggacc atggtgcagt 421 cccactggct cccctgcccc cctctcctgt gagactggct gcggggaggg atcatggata 481 cttgtctgcc ggcttctggt tcccacgcaa gtaagcctgc tgtcaatgga ggaggacatt 541 gatacccgca aaatcaacaa cagtttcctg cgcgaccaca gctatgcgac cgaagctgac 601 attatctcta cggtagaatt caaccacacg ggagaattac tagcgacagg ggacaagggg 661 ggtcgggttg taatatttca acgagagcag gagagtaaaa atcaggttca tcgtaggggt 721 gaatacaatg tttacagcac attccagagc catgaacccg agttcgatta cctgaagagt 781 ttagaaatag aagaaaaaat caataaaata agatggctcc cccagcagaa tgcagcttac 841 tttcttctgt ctactaatga taaaactgtg aagctgtgga aagtcagcga gcgtgataag 901 aggccagaag gctacaatct gaaagatgag gagggccggc tccgggatcc tgccaccatc 961 acaaccctgc gggtgcctgt cctgagaccc atggacctga tggtggaggc caccccacga 1021 agagtatttg ccaacgcaca cacatatcac atcaactcca tatctgtcaa cagcgactat 1081 gaaacctaca tgtccgctga tgacctgagg attaacctat ggaactttga aataaccaat 1141 caaagtttta atattgtgga cattaagcca gccaacatgg aggagctcac ggaggtgatc 1201 acagcagccg agttccaccc ccatcattgc aacaccttcg tgtacagcag cagcaaaggg 1261 acaatccggc tgtgtgacat gcgggcatct gccctgtgtg acaggcacac caaatttttt 1321 gaagagccgg aagatccaag caacagatca tttttctctg aaattatctc ttcgatttcg 1381 gatgtgaagt tcagccacag tgggaggtat atcatgacca gggactactt gaccgtcaaa 1441 gtctgggatc tcaacatgga aaaccgcccc atcgagactt accaggttca tgactacctc 1501 cgcagcaagc tgtgttccct ctatgaaaat gactgcattt ttgataaatt tgagtgtgtg 1561 tggaatgggt cagacagtgt catcatgaca ggctcctaca acaacttctt caggatgttc 1621 gacagaaaca ccaagcgtga tgtgaccctt gaggcttcga gggaaaacag caagccccgg 1681 gctatcctca aaccccgaaa agtgtgtgtg gggggcaagc ggagaaaaga cgagatcagt 1741 gtcgacagtc tggactttag caaaaagatc ttgcatacag cttggcatcc ttcagaaaat 1801 attatagcag tggcggctac aaataaccta tatatattcc aggacaaggt taactaggtg 1861 gacaagttat tacttaataa tctcacatac tgaatactag tcaaacaagt ttttaaatgt 1921 ttctttgggt cttcatttga tgcattgact ttaatttccc tatacaggaa atgattggaa 1981 tagaattaaa aggagtccaa cattcccagc tccccagttc taagaaactt ttgtcaaacc 2041 caataggttt gggacacttc tgtttagaat tgaaagctgc cagctaacag taattcttcc 2101 atagttgact tgaacttctg atgcttttat tgcccagttt tctctggtgg gtccagtgtt 2161 ttgttcctag gtgtctgctg cgataaaatg aggttgtctg tagtatttaa ggagaaaaga 2221 gataagtttt ttttaattaa gcaattccat ttgattgaaa aaaatcaaca aaaaataaac 2281 accgtttact cttagacaaa ttcttcttgt tttgtgaaaa accagaacta gtcagtatct 2341 cctgcccctc caccattttt ttttccattt tccattttcc tttgaacaat ttcatttaag 2401 ccagagattt attgcatgaa gctgagaaga ggatgcagaa tgacaaggaa agggcacatc 2461 aaccctgcta tgctcttttt ttgtaagctc catagaaaca gcctgagaat ttggctaggg 2521 aacttgaatg cttcagggga cagaaagaga gcactttcga cacagtgctt cccagagtga 2581 gcttggcagg gccaggcggg gccaaattcc atctgctgcc ttgttactct tgctttttgt 2641 gctcttaaat ggctccatat aatcttctac ttacatgttc cttggctttt ttctcttcaa 2701 ccttttccag cttatttatt ccattgactt ctaaaggccg agtcctgggt gcttattatc 2761 tggtgttcta aatgaagcag taagttggaa gcagtgccac cacccctgag tccctgagaa 2821 aggctggtct gttctttttg ggtgtttctc ctaagcagca ccctcccctc ctcctggttt 2881 tggtaaccaa aagtaacaat ccatcaacct ccattgtacc tagaacaaaa atagccaata 2941 aaaacgctga gttgtgaagt ccaatcaggc acttctaact caccccaagc tcgccatctg 3001 gaaaaacaga ccagaaggct tctcttctac agaaatgaac tgtggggaaa tcaagcagct 3061 gtgacatgaa gtgaatgaag tccacttgaa gctgtggaag atggttcatc cttttcccca 3121 gttgaggatc cagatttata acttctagaa agccatttcc agaaggttct atgtggcaca 3181 cccctaggaa aggcactaaa tgcatgcaaa ggatttataa acttaggaaa gtagatgggt 3241 ggagtccaga aaactggttc tgggttaata tctctacatc tgtcttgatg actcatttct 3301 cctaactccc atttagtgca gggtaaatgg tttgagatga gagtttttca atgaaaggga 3361 aattttcttt cagtttacag atgtattaga agtcctgact ttcaagtgta atttgctttg 3421 gaggaggaaa aaaaaaaaaa a // LOCUS HUMPROS15 827 bp mRNA PRI 24-MAY-1993 DEFINITION Human prostaglandin D synthase gene, complete cds. ACCESSION M61900 NID g190443 KEYWORDS prostaglandin D synthase; prostaglandin-H-2 D-isomerase. SOURCE Homo sapiens RNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 827) AUTHORS Nagata,A., Suzuki,Y., Igarashi,M., Eguchi,N., Toh,B.H., Urade,Y. and Hayaishi,O. TITLE Human brain prostaglandin D synthase has been evolutionarily differentiated from lipophilic-ligand carrier proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 4020-4024 (1991) MEDLINE 91219504 COMMENT From EMBL entry HSPROS15; dated 30-MAR-1991. FEATURES Location/Qualifiers source 1..827 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 86..151 CDS 86..658 /EC_number="5.3.99.2" /note="other name:prostaglandin-H-2 D-isomerase" /codon_start=1 /product="prostaglandin D synthase" /db_xref="PID:g190445" /translation="MATHHTLWMGLVLLGLLGGLQAAPEAQVSVQPNFQPDKFLGRWF SAGLASNSSWLQEKKAALSMCKSVVAPAADGGFNLTSTFLRKNQCETRTMLLQPGDSL GSYSYRSPHWGSTYSVSVVETDYDHYALLYSQGSKGPGEDFRMATLYSRTQTPRAELK EKFTAFCKAQGFTEDSIVFLPQTDKCMTEQ" CDS 86..658 /EC_number="5.3.99.2" /note="other name: prostaglandin-H-2 D-isomerase" /codon_start=1 /product="prostaglandin D synthase" /db_xref="PID:g190444" /translation="MATHHTLWMGLVLLGLLGGLQAAPEAQVSVQPNFQPDKFLGRWF SAGLASNSSWLQEKKAALSMCKSVVAPAADGGFNLTSTFLRKNQCETRTMLLQPGDSL GSYSYRSPHWGSTYSVSVVETDYDHYALLYSQGSKGPGEDFRMATLYSRTQTPRAELK EKFTAFCKAQGFTEDSIVFLPQTDKCMTEQ" mat_peptide 152..655 /EC_number="5.3.99.2" /note="other name:prostaglandin-H-2 D-isomerase" /product="prostaglandin D synthase" mat_peptide 152..655 /EC_number="5.3.99.2" /note="other name: prostaglandin-H-2 D-isomerase" /product="prostaglandin D synthase" polyA_signal 804..809 BASE COUNT 156 a 302 c 224 g 145 t ORIGIN 1 ctcctcctgc acaccttccg cacacctccc tcgctctccc acaccactgg caccaggccc 61 cgcacacctg ctcggctgca ggagaatggc tactcatcac acgctgtgga tgggactggt 121 cctgctgggg ctgctgggcg gcctacaggc agcacccgag gcccaggtct ccgtgcagcc 181 caacttccag ccggacaagt tcctggggcg ctggttcagc gcgggcctcg cctccaactc 241 gagctggctc caggagaaga aggcagcgct gtccatgtgc aagtcggtgg tggcccctgc 301 ggcggatggt ggcttcaacc tgacctccac cttcctcagg aaaaaccagt gtgagacccg 361 aaccatgctg ctgcagcccg gggactccct cggctcctac agctaccgga gtccccactg 421 gggcagcacc tactctgtgt cagtggtgga gactgactac gaccactacg ccctgctgta 481 cagccagggc agcaagggcc ccggcgagga cttccgcatg gccaccctct acagccgaac 541 ccagaccccc agggctgagt taaaggagaa atttaccgcc ttctgcaagg cccagggctt 601 cacagaggat tccattgtct tcctgcccca aaccgataag tgcatgacgg aacaatagga 661 ctccccagag ctgaagctgg gaccgcagcc agccaggtga cccctgcgat ctggatgttt 721 ccgctctgtt ccttccccga gcccctgccc cggctccccg ccaaagcacc cctgccccct 781 cgggcttcct cctggctctg cggaataaac tccggaagca agtctgt // LOCUS HUMPROTP 1887 bp mRNA PRI 15-DEC-1989 DEFINITION Human endomembrane proton pump subunit mRNA, complete cds. ACCESSION M25809 NID g190459 KEYWORDS ATPase; proton pump. SOURCE Human kidney, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1887) AUTHORS Suedhof,T.C., Fried,V.A., Stone,D.K., Johnston,P.A. and Xie,X.-S. TITLE The mammalian endomembrane proton pump strongly resembles the ATP- generating proton pump of archibacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6067-6071 (1989) MEDLINE 89345606 COMMENT Draft entry and clean copy of sequence [1] kindly submitted by T.C.Suedhof 24-JUN-1989. FEATURES Location/Qualifiers source 1..1887 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 56..1591 /note="proton pump 58 kDa subunit" /codon_start=1 /db_xref="PID:g190460" /translation="MEIDSRPGGLPGSSCNLGAAREHMQAVTRNYITHPRVTYRTVCS VNGPLVVLDRVKFAQYAEIVHFTLPDGTQRSGQVLEVAGTKAIVQVFEGTSGIDARKT TCEFTGDILRTPVSEDMLGRVFNGSGKPIDKGPVVMAEDFLDINGQPINPHSRIYPEE MIQTGISPIDVMNSIARGQKIPIFSAAGLPHNEIAAQICRQAGLVKKSKAVLDYHDDN FAIVFAAMGVNMETARFFKSDFEQNGTMGNVCLFLNLANDPTIERIITPRLALTTAEF LAYQCEKHVLVILTDMSSYAEALREVSAAREEVPGRRGFPGYMYTDLATIYERAGRVE GRGGSITQIPILTMPNDDITHPIPDLTGFITEGQIYVDRQLHNRQIYPPINVLPSLSR LMKSAIGEGMTRKDHGDVSNQLYACYAIGKDVQAMKAVVGEEALTSEDLLYLEFLQKF EKNFINQGPYENRSMFESLDLSWKLLRIFPKEMLKRIPQAVIDEFYSREGRLQDLAPD TAL" BASE COUNT 404 a 580 c 533 g 370 t ORIGIN 1 tcgcccaatt ccgggctcag acactgggct cccagctggg gactgctcca tggccatgga 61 gatagacagc aggcctgggg ggctccccgg cagtagctgc aacctaggtg cagcccgaga 121 acacatgcag gcggtcaccc gaaactacat cacccacccc cgtgtcacct acaggactgt 181 gtgcagcgtg aacgggcccc tggtggtgct ggaccgggtc aagtttgccc agtatgcgga 241 gatcgtccac ttcaccctcc cagatgggac tcagaggagc gggcaggtgc ttgaggtggc 301 tggcaccaag gcgattgttc aggtgtttga agggacatca gggatcgatg ccaggaagac 361 cacttgcgaa tttacagggg acatcctacg aactccggtg tcagaggaca tgctgggtcg 421 ggttttcaat ggctccggca agcccattga caaggggcca gtggtcatgg cggaggactt 481 tctggatatc aatggccagc ccatcaaccc gcactcccgc atctaccccg aggagatgat 541 tcagacgggc atttctccta ttgacgtcat gaacagcatt gcccgcggcc agaagatccc 601 catcttctca gcagccgggc tcccccacaa tgagattgcc gctcagatct gccgccaggc 661 ggggctggtg aagaagtcca aggctgtgct ggattaccat gacgacaact tcgccatcgt 721 ctttgcagcc atgggggtga acatggagac agccagattc ttcaagtctg actttgagca 781 gaatggaacc atggggaacg tctgcctctt cctgaacttg gccaatgacc ccacgatcga 841 gcggatcatc accccgcgcc tggcgctgac cactgctgaa ttccttgcct accagtgtga 901 gaagcatgtg ctggtcatac tgacggacat gagttcctat gcagaggcct tgcgggaggt 961 ctctgctgct agagaggagg tgcctgggcg ccgagggttt cctggatata tgtacacaga 1021 cctggccacc atctacgagc gggcgggccg tgtggagggt cggggaggat ccatcacaca 1081 gatccccatc ctcaccatgc ccaacgacga tatcacccac cctatcccag acttgacggg 1141 cttcatcaca gagggacaga tctacgtgga cagacagctt cacaacagac agatctaccc 1201 ccccatcaac gtgctccctt ccctgtcgcg gctgatgaag tcagccattg gggaaggcat 1261 gacaagaaag gaccatggag atgtctccaa ccagctgtac gcctgctatg ccatcgggaa 1321 ggacgtgcag gccatgaagg cagtagttgg ggaggaggcg ctcacctctg aggacctgct 1381 ctacctggaa ttcctgcaga agtttgagaa gaacttcatc aatcagggcc cctacgagaa 1441 ccgctcgatg ttcgagtcgc tggaccttag ctggaagctg ctgcgcatct tccccaagga 1501 gatgctgaag cgcattccgc aggccgtgat cgacgagttc tattcccgcg aggggcggct 1561 gcaggacctc gcgcctgaca ctgcgctcta gccccgcgcg ccgtggcacc ccaacaccgg 1621 caggaaccta ccctcggctc ccgggtctcc ccgtccctcg ccacccctaa ccagcggctt 1681 tcgcgccgcc ctccgccctc cgtggctccg aggtggtggg gggcgccgca gtcatccctt 1741 tcctcgctcg attccttttc ccgcgctcca tgcctccccc tcagctcccg gtgctgcgga 1801 agaactgaag gttcatgcct actctgacgg gagcatctgt attttttatg ttaaaagccc 1861 acaaaataaa aataaaaatg aactgag // LOCUS HUMPROTXA 2871 bp mRNA PRI 02-SEP-1997 DEFINITION Homo sapiens chromosomal protein mRNA, complete cds. ACCESSION L26953 NID g537529 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2871) AUTHORS Yeo,J.P., Alderuccio,F. and Toh,B.H. TITLE A new chromosomal protein essential for mitotic spindle assembly JOURNAL Nature 367 (6460), 288-291 (1994) MEDLINE 94166884 REMARK Erratum:[Nature 388, page 697 (1997)] REFERENCE 2 (bases 1 to 2871) AUTHORS Toh,B.H. TITLE Direct Submission JOURNAL Submitted (09-SEP-1994) Pathology and Immunology, Monash University, Commercial Road, Prahran, Melbourne, Victoria 3181, Australia FEATURES Location/Qualifiers source 1..2871 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HEp-2" /cell_type="epithelial" CDS 775..2031 /codon_start=1 /product="chromosomal protein" /db_xref="PID:g537530" /translation="MGEFCLILFNLFNFCLILSFLTCIAYPVLYFFSFLNSQTRSLKL FLYFKIEIWSQGIVRFLLRFGLTTFKERFTSLILLNNMHQMIFPMVKYISKLCIFHFW HLVLMDLVPRQQRSIITYSLVFAIISQKKKRGIYHKNNIRIILFLPQAHGRDFYVPIL PFTQSYVDWGRWLIWEAKAGESLEVRSSRPASQSRRNSVSTKNIKISPVSTKNIKISQ TWYLFGGVHLLVPTTRDAEAGELHDPGGRGCNELRSCHCTPAWVTSETVSKKKKKKKK KKTRVLTCINASTLFHVLTRIFCYKQKYPLWLLIKRLSVNLQKQQEDLQRKKKQRKQE HPELTAQNTVLKMLLQGVLINLYPTVTLNRPGVVAHACIPALWAEGGVHLNPGQPGQH GETPCLQKIKKPGVVASSYSSSYLGD" BASE COUNT 925 a 554 c 550 g 842 t ORIGIN 1 ttttttttgc tttttgcaga tttgaaattt aattaagcta ttaacttctt aatcatcaca 61 tcctccaact ctgttttaac tcccccctcc ccctcccctt tttttttaag gaataaaggg 121 attcattcct cattttaact gatgttacag tgaagatggg ttcttgaact cttggaagcc 181 tggatgagcc acctaatctc aaagataaaa accaaagacc aatgcgtatt ggggaaaaga 241 atgcttagta ctgcaagact gttgaatacc tgttgaatat tcctattgag gttttttcct 301 aaacatactt cagtaacatc ttaccggaca attgcactgg agaaatgttg atccctggct 361 ggaatgtcat accattgacc catttgaaga gttaaagctg gatttgactg ctctattcta 421 ccaggaatat tgttagggta gccttttcca gtttctaaac aattgtaatc atttattgac 481 tcagcaattc ctcagataac aggtcaaaag atgtacagat acattctgaa gttttcttgc 541 tattaaaggc acaagagttt ccttgtattt tgactgacaa tgtagcatgt ttccatttag 601 tttgttagtg aggtggtttt ccctttgaaa gccatttggt atattcacca taacaattag 661 tttaatatga ttacataaga aaactatgat aaaacccagc aattttagta gttgtgaaaa 721 tacgtttttt aaatcatgtt taagaagaat tgcaagactt gaaaccaaat cctgatgggg 781 gaattctgtt taatcctgtt taatctgttt aatttctgtt taatccttag tttcttaacc 841 tgcatagctt atcctgtatt gtactttttt tcttttttaa actcccaaac aagaagcttg 901 aaactttttc tgtattttaa aattgaaatt tggtcacagg gtatagtcag atttttatta 961 aggtttggtt tgacaacctt taaagaaagg tttacctcgc taatacttct taataacatg 1021 catcaaatga tattccctat ggtgaagtat atctcaaagt tatgtatctt tcatttttgg 1081 catttggtgc ttatggactt agtacccagg caacaaagat ctattatcac ctactctctt 1141 gtattcgcta ttatttccca aaaaaaaaaa aggggcatat atcataagaa taatattaga 1201 attattttgt ttctcccaca agcccatggt agagatttct atgttcccat tctccctttc 1261 actcagagtt atgttgactg ggggcggtgg ctcatttggg aagccaaggc gggtgaatca 1321 cttgaggtca ggagttcaag accagctagc caatcacggc gaaactccgt ctctactaaa 1381 aatataaaaa ttagtcccgt ctctactaaa aatataaaaa ttagtcagac gtggtacctg 1441 tttggtggtg tgcatctgct agtcccaact actcgggacg ctgaggcagg tgaattgcat 1501 gatccaggag gcagaggctg caatgagctg agatcatgcc actgcactcc agcctgggtg 1561 acaagtgaga ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac aagggtcctt 1621 acttgcatta atgcatccac attattccat gtactcacca ggattttttg ttataaacaa 1681 aaatacccac tctggttact tattaaaaga ttatcagtca atttacaaaa acaacaggaa 1741 gatttacaaa gaaaaaaaaa acaaaggaag caagaacatc cagaactcac agcacagaat 1801 actgtgctta aaatgctgct gcaaggagta ttaattaact tataccctac tgtgacactt 1861 aacaggccag gtgtggtggc tcacgcctgt atcccagcac tttgggctga aggcggagta 1921 cacttgaacc caggacagcc tgggcaacat ggtgagaccc cgtgtctaca aaaaataaaa 1981 aagccgggag tggtggcaag ctcctatagt tccagctact tgggagactg aggtggaggg 2041 atcacttcag cctgggaggt caaggtcaca gtgagctata atcaagccac cgcactctgg 2101 cctgggcaac agagacagtc tcaaaaagaa aaaaaaaaaa gcacttaaca cactttatga 2161 atacaatatg gcacatctat ctgcccatta gagtgagact cctgagtatt ttactcatct 2221 ttccattctc aaatgcaaag agtatgtggc atataataac tgtccgaatg tacgactgac 2281 atcttaaagc gtgtcggtcg gcaaactttt tcgtaaaggg ccaggtaagt aaacattata 2341 agctttgtaa tccacacagt ctctgttcaa accattctac tgttgtagca ggaaagcagc 2401 catggataat atgtaaacaa gtcagctacg atgtgttcaa ataaaacttt ataaaaacag 2461 gcagtaggct gggtttagcc catggaccac aatttgccaa gtatttaaca agaatcaaca 2521 cttttccttt cataatttaa ttgaagttgg tacctacaaa gatatgagct cactactaca 2581 tatgactatc tgtaatggat caattttgga tatgactttg ggtgggggta aaaaaagaac 2641 cgaagacatg taatatagat gaataatgaa aataacaggg ctgggtgcag ttgcctgcac 2701 tttaggaggc tgaggcagga gcatctctgg aacctgggag gcagaggttg cagtgagcca 2761 agattgcgcc attgcactcc agccaagggg acaagagcaa aaacaaaaac aaaaacaaaa 2821 acctttaagg taccagcctg gctaagcagt taacatacct cactttatca g // LOCUS HUMPROZI 1485 bp mRNA PRI 11-JAN-1991 DEFINITION Human protein Z mRNA, complete cds. ACCESSION M55670 NID g190463 KEYWORDS plasma glycoprotein; protein Z. SOURCE Human liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1485) AUTHORS Ichinose,A., Takeya,H., Espling,E., Iwanaga,S., Kisiel,W. and Davie,E.W. TITLE Amino acid sequence of human protein Z, a vitamin K-dependent plasma glycoprotein JOURNAL Biochem. Biophys. Res. Commun. 172, 1139-1144 (1990) MEDLINE 91058548 FEATURES Location/Qualifiers source 1..1485 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Hep G-2" /tissue_type="liver" mRNA <1..1485 /gene="human protein Z" gene 1..1485 /gene="human protein Z" sig_peptide 8..76 /gene="human protein Z" /product="protein Z" CDS 8..1210 /gene="human protein Z" /codon_start=1 /product="protein Z" /db_xref="PID:g190464" /translation="MAGCVPLLQGLVLVLALHRVEPSVFLPASKANDVLVRWKRAGSY LLEELFEGNLEKECYEEICVYEEAREVFENEVVTDEFWRRYKGGSPCISQPCLHNGSC QDSIWGYTCTCSPGYEGSNCELAKNECHPERTDGCQHFCLPGQESYTCSCAQGYRLGE DHKQCVPHDQCACGVLTSEKRAPDLQDLPWQVKLTNSEGKDFCGGVIIRENFVLTTAK CSLLHRNITVKTYFNRTSQDPLMIKITHVHVHMRYDADAGENDLSLLELEWPIQCPGA GLPVCTPEKDFAEHLLIPRTRGLLSGWARNGTDLGNSLTTRPVTLVEGEECGQVLNVT VTTRTYCERSSVAAMHWMDGSVVTREHRGSWFLTGVLGSQPVGGQAHMVLVTKVSRYS LWFKQIMN" mat_peptide 128..1207 /gene="human protein Z" /product="protein Z" BASE COUNT 373 a 398 c 411 g 303 t ORIGIN 1 ggtgggaatg gcaggctgcg tcccactgct ccagggcctg gtcctggtcc tcgccctcca 61 tcgtgtggag ccctcagtat ttctcccggc ctccaaagca aacgacgttc tggtgaggtg 121 gaagcgtgcg ggctcctatc ttctggaaga actcttcgag ggaaacttgg aaaaagaatg 181 ttatgaagaa atctgtgtct atgaagaagc aagagaagtg tttgaaaatg aagtagtcac 241 tgatgaattc tggagacgat ataagggcgg ctccccgtgc atctcccagc cctgcctcca 301 caacggctct tgccaggaca gcatctgggg ctacacctgc acctgctccc ccggctatga 361 gggcagcaac tgcgagctgg ctaaaaatga atgtcaccca gagcggactg atgggtgtca 421 acacttctgc ctcccaggac aggaatccta cacgtgcagc tgtgctcagg gctacaggct 481 tggtgaggac cacaaacagt gtgtgcccca cgaccagtgt gcctgcgggg tgctgacctc 541 tgagaagcgt gcaccggatc tacaggacct cccgtggcag gtaaagttaa caaattccga 601 aggaaaagac ttctgtggtg gtgttataat acgggaaaat tttgtactga caacagcaaa 661 atgttcactg ttacacagga atattactgt aaaaacatat tttaacagaa cgagccaaga 721 cccgctgatg atcaagataa cgcacgtcca tgtgcacatg cggtatgacg cggacgcggg 781 ggagaatgac ctgtcactgc tggagctgga gtggcccatc cagtgcccag gtgcggggct 841 ccccgtgtgc acccctgaga aagacttcgc tgagcacctc ctcatcccac gcaccagggg 901 cctcctcagc ggctgggcac gcaatggcac tgacctgggc aactcgctga ccacgcggcc 961 tgtcacactt gtggaggggg aggagtgcgg gcaggtcctg aatgtgactg tcaccaccag 1021 gacctactgt gagagaagca gcgtggcggc catgcactgg atggatggaa gtgtggtcac 1081 cagagaacac agaggctcct ggtttctcac gggggtcctg ggctcgcagc cagtaggagg 1141 gcaggctcac atggtccttg tcaccaaggt ctccaggtac tcactctggt ttaaacagat 1201 catgaactaa ctgaaactca gctagccaga atgaacaaca caaccggaag cgggattcca 1261 agctggcact gccactgtgg agggcgctga aacttcatca cacactgaga ggccgtcaca 1321 gccccagacc acccgcttgg cccacgcagc agcagagccg ccgtttgctg ggttgtttac 1381 cgagcactgt gacctttctt tccctggaac tctttatctc aatagagacc ttaaaagaaa 1441 acatgagata cgttaaataa taaaataaga taatctgtca gtcat // LOCUS HUMPRP 2415 bp mRNA PRI 08-JAN-1995 DEFINITION Human prion protein (PrP) mRNA, complete cds. ACCESSION M13899 NID g190467 KEYWORDS prion protein. SOURCE Human retina, cDNA to mRNA, library J.Nathans, clone HuPrPcDNA-[1,2]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2415) AUTHORS Kretzschmar,H.A., Stowring,L.E., Westaway,D., Stubblebine,W.H., Prusiner,S.B. and Dearmond,S.J. TITLE Molecular cloning of a human prion protein cDNA JOURNAL DNA 5 (4), 315-324 (1986) MEDLINE 86300093 FEATURES Location/Qualifiers source 1..2415 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20pter-p12" mRNA <1..2415 /note="PRP mRNA" sig_peptide 50..115 /gene="PRNP" /note="prion protein signal peptide" gene 50..811 /gene="PRNP" CDS 50..811 /gene="PRNP" /codon_start=1 /db_xref="GDB:G00-120-720" /product="prion protein" /db_xref="PID:g190468" /translation="MANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYPGQGSP GGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQWNKP SKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPNQ VYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQY ERESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG" mat_peptide 116..808 /gene="PRNP" /note="prion protein (PrP 33-35-C)" unsure 1965 /note="t may be absent [1]" BASE COUNT 666 a 499 c 578 g 672 t ORIGIN Unreported. 1 cggcgccgcg agcttctcct ctcctcacga ccgaggcaga gcagtcatta tggcgaacct 61 tggctgctgg atgctggttc tctttgtggc cacatggagt gacctgggcc tctgcaagaa 121 gcgcccgaag cctggaggat ggaacactgg gggcagccga tacccggggc agggcagccc 181 tggaggcaac cgctacccac ctcagggcgg tggtggctgg gggcagcctc atggtggtgg 241 ctgggggcag cctcatggtg gtggctgggg gcagccccat ggtggtggct ggggacagcc 301 tcatggtggt ggctggggtc aaggaggtgg cacccacagt cagtggaaca agccgagtaa 361 gccaaaaacc aacatgaagc acatggctgg tgctgcagca gctggggcag tggtgggggg 421 ccttggcggc tacatgctgg gaagtgccat gagcaggccc atcatacatt tcggcagtga 481 ctatgaggac cgttactatc gtgaaaacat gcaccgttac cccaaccaag tgtactacag 541 gcccatggat gagtacagca accagaacaa ctttgtgcac gactgcgtca atatcacaat 601 caagcagcac acggtcacca caaccaccaa gggggagaac ttcaccgaga ccgacgttaa 661 gatgatggag cgcgtggttg agcagatgtg tatcacccag tacgagaggg aatctcaggc 721 ctattaccag agaggatcga gcatggtcct cttctcctct ccacctgtga tcctcctgat 781 ctctttcctc atcttcctga tagtgggatg aggaaggtct tcctgttttc accatctttc 841 taatcttttt ccagcttgag ggaggcggta tccacctgca gcccttttag tggtggtgtc 901 tcactctttc ttctctcttt gtcccggata ggctaatcaa tacccttggc actgatgggc 961 actggaaaac atagagtaga cctgagatgc tggtcaagcc ccctttgatt gagttcatca 1021 tgagccgttg ctaatgccag gccagtaaaa gtataacagc aaataaccat tggttaatct 1081 ggacttattt ttggacttag tgcaacaggt tgaggctaaa acaaatctca gaacagtctg 1141 aaataccttt gcctggatac ctctggctcc ttcagcagct agagctcagt atactaatgc 1201 cctatcttag tagagatttc atagctattt agagatattt tccattttaa gaaaacccga 1261 caacatttct gccaggtttg ttaggaggcc acatgatact tattcaaaaa aatcctagag 1321 attcttagct cttgggatgc aggctcagcc cgctggagca tgagctctgt gtgtaccgag 1381 aactggggtg atgttttact tttcacagta tgggctacac agcagctgtt caacaagagt 1441 aaatattgtc acaacactga acctctggct agaggacata ttcacagtga acataactgt 1501 aacatatatg aaaggcttct gggacttgaa atcaaatgtt tgggaatggt gcccttggag 1561 gcaacctccc attttagatg tttaaaggac cctatatgtg gcattccttt ctttaaacta 1621 taggtaatta aggcagctga aaagtaaatt gccttctaga cactgaaggc aaatctcctt 1681 tgtccattta cctggaaacc agaatgattt tgacatacag gagagctgca gttgtgaaag 1741 caccatcatc atagaggatg atgtaattaa aaaatggtca gtgtgcaaag aaaagaactg 1801 cttgcatttc tttatttctg tctcataatt gtcaaaaacc agaattaggt caagttcata 1861 gtttctgtaa ttggcttttg aatcaaagaa tagggagaca atctaaaaaa tatcttaggt 1921 tggagatgac agaaatatga ttgatttgaa gtggaaaaag aaattctgtt aatgttaatt 1981 aaagtaaaat tattccctga attgtttgat attgtcacct agcagatatg tattactttt 2041 ctgcaatgtt attattggct tgcactttgt gagtatctat gtaaaaatat atatgtatat 2101 aaaatatata ttgcatagga cagacttagg agttttgttt agagcagtta acatctgaag 2161 tgtctaatgc attaactttt gtaaggtact gaatacttaa tatgtgggaa acccttttgc 2221 gtggtcctta ggcttacaat gtgcactgaa tcgtttcatg taagaatcca aagtggacac 2281 cattaacagg tctttgaaat atgcatgtac tttatatttt ctatatttgt aactttgcat 2341 gttcttgttt tgttatataa aaaaattgta aatgtttaat atctgactga aattaaacga 2401 gcgaagatga gcacc // LOCUS HUMPRP8A 2092 bp mRNA PRI 25-NOV-1994 DEFINITION Human (clone N5-4) protein p84 mRNA, complete cds. ACCESSION L36529 NID g550057 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2092) AUTHORS Durfee,T., Mancini,M.A., Jones,D., Elledge,S.J. and Lee,W.H. TITLE The amino-terminal region of the retinoblastoma gene product binds a novel nuclear matrix protein that co-localizes to centers for RNA processing JOURNAL J. Cell Biol. 127, 609-622 (1994) MEDLINE 95050936 FEATURES Location/Qualifiers source 1..2092 /organism="Homo sapiens" /note="(vector lambda ACT)" /db_xref="taxon:9606" /cell_type="Epstein-Barr virus transformed peripheral lymphocytes" /clone="N5-4" /clone_lib="S. Elledge" 5'UTR <1..14 mRNA 1..2092 CDS 15..1988 /codon_start=1 /product="protein p84" /db_xref="PID:g550058" /translation="MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSE NEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLD CLPLDQCDTIFTFVEKNVATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQ LFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMG DEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG QVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWK NEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILTGNEELTRLWNLCPDNME ACKSETREHMPTLEEFFEEAIEQADPENMAENEYKAMNNSNYGWRALKLLARRSPHFF QPTNQQFKSLQEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENESPDVRRD KPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIRQIECDSEDMKMRAKQLLVAWQDQE GVHATPENLINALNKSGLSDLAESLTNDNETNS" 3'UTR 1989..2092 polyA_signal 2075..2080 polyA_site 2092 BASE COUNT 713 a 371 c 451 g 557 t ORIGIN 1 ggccttcgtc gaagatgtct ccgacgccgc cgctcttcag tttgcccgaa gcgcggacgc 61 ggtttacgaa gtctaccaga gaggccttga acaacaaaaa catcaagcca ttgttaagta 121 ccttcagcca ggtacctggc agtgaaaatg aaaaaaaatg tacccttgac caagctttca 181 gaggtattct agaagaagaa attataaatc attcatcatg tgaaaacgtt ttagctatta 241 tttctcttgc tattggggga gtaactgaag gtatttgtac cgcatctaca ccttttgtat 301 tgttgggaga tgttttggat tgtcttcctt tggatcagtg tgacacaata ttcacttttg 361 tcgaaaaaaa tgttgctact tggaaatcaa ataccttcta tgctgctggg aaaaattact 421 tactacgtat gtgcaatgat ctcctaagaa gattgtctaa atcccagaat acagtcttct 481 gtggacggat tcagctcttt ttggccaggc ttttccctct gtctgagaaa tcaggtctta 541 acttgcagag tcagtttaat ctggaaaatg tcactgtttt caatacaaat gagcaggaaa 601 gcaccctggg tcagaagcac actgaagata gagaagaagg aatggatgta gaagaaggcg 661 aaatgggaga tgaggaagct ccaacaacgt gctctattcc aattgattac aacctgtatc 721 gaaaattctg gtcacttcag gattacttca ggaaccctgt gcaatgctat gagaagattt 781 catggaaaac ttttctcaag tattctgaag aagttttagc tgtttttaag agttataaat 841 tagatgatac tcaggcctca agaaaaaaga tggaagaatt gaaaacagga ggagaacatg 901 tatattttgc aaaattttta acaagtgaaa agctgatgga tttacaactg agtgacagta 961 actttcgtcg acacatcctg ttgcagtatc tcattttatt ccaatatctc aaggggcagg 1021 tcaaattcaa aagttcaaac tatgttttaa ctgatgagca atcactttgg attgaagata 1081 ctacaaaatc agtttatcaa ctactatctg aaaacccccc cgatggagaa agattttcaa 1141 agatggtaga gcatatatta aacactgaag aaaactggaa ctcgtggaaa aatgaaggtt 1201 gcccaagttt tgtgaaagaa agaacatcag ataccaaacc tacgagaata attcggaaga 1261 gaacagcacc cgaggacttc ctagggaaag gacccaccaa aaaaattctg acgggaaatg 1321 aggagttaac aaggctttgg aatctttgcc ctgataatat ggaagcctgt aaatcagaga 1381 caagggaaca catgcccact ttggaggaat tctttgaaga agccattgaa caggcagacc 1441 ctgaaaatat ggcggaaaat gaatataagg ctatgaacaa ttcaaattat ggttggagag 1501 ccctgaaact attagcacgg agaagccctc acttcttcca gccaaccaac cagcagttta 1561 aaagtttaca agaatatctt gaaaatatgg taataaagct agccaaggaa ttaccgcctc 1621 cttctgaaga aataaaaaca ggtgaggatg aagatgagga agataatgat gctctactga 1681 aggaaaatga aagtcctgat gttcggcgag acaaacctgt aacaggagaa caaatagagg 1741 tatttgccaa caagctgggt gaacaatgga agattctggc tccctacttg gaaatgaaag 1801 actcagaaat taggcagatt gagtgtgaca gtgaagacat gaagatgaga gctaagcagc 1861 tcctggttgc ctggcaagat caagagggag ttcatgcaac acctgagaat ctgattaatg 1921 cactgaataa gtctggatta agtgaccttg cagaaagtct aactaatgac aatgagacaa 1981 atagttagct tctttttttt ttctttttat taaaactgtg atagattttg ttaccaagca 2041 gcatttgata agaggtccac tggttttggt aaacaataaa catttttata ac // LOCUS HUMPRPC4B 2178 bp mRNA PRI 21-FEB-1991 DEFINITION Human proline-rich protein (PRP) mRNA, complete cds. ACCESSION M31452 NID g190501 KEYWORDS C4b-binding protein; proline-rich protein. SOURCE Human liver, cDNA to mRNA, clones lambda-PRP[2,4,6,7,8]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2178) AUTHORS Matsuguchi,T., Okamura,S., Aso,T., Sata,T. and Niho,Y. TITLE Molecular cloning of the cDNA coding for proline-rich protein (PRP): Identity of PRP as C4b-binding protein JOURNAL Biochem. Biophys. Res. Commun. 165, 138-144 (1989) MEDLINE 90073699 REFERENCE 2 (sites) AUTHORS Aso,T., Okamura,S., Matsuguchi,T., Sakamoto,N., Sata,T. and Niho,Y. TITLE Genomic organization of the alpha chain of the human C4b-binding protein gene JOURNAL Biochem. Biophys. Res. Commun. 174, 222-227 (1991) MEDLINE 91113199 FEATURES Location/Qualifiers source 1..2178 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="12p13.2" mRNA <1..2178 /gene="C4bp" /note="PRP mRNA" /product="C4b-binding protein alpha chain" gene 1..2178 /gene="C4bp" sig_peptide 139..282 /gene="C4bp" /note="proline-rich protein" CDS 139..1932 /gene="C4bp" /note="proline-rich protein" /codon_start=1 /product="C4b-binding protein alpha chain" /db_xref="PID:g190502" /translation="MHPPKTPSGALHRKRKMAAWPFSRLWKVSDPILFQMTLIAALLP AVLGNCGPPPTLSFAAPMDITLTETRFKTGTTLKYTCLPGYVRSHSTQTLTCNSDGEW VYNTFCIYKRCRHPGELRNGQVEIKTDLSFGSQIEFSCSEGFFLIGSTTSRCEVQDRG VGWSHPLPQCEIVKCKPPPDIRNGRHSGEENFYAYGFSVTYSCDPRFSLLGHASISCT VENETIGVWRPSPPTCEKITCRKPDVSHGEMVSGFGPIYNYKDTIVFKCQKGFVLRGS SVIHCDADSKWNPSPPACEPNSCINLPDIPHASWETYPRPTKEDVYVVGTVLRYRCHP GYKPTTDEPTTVICQKNLRWTPYQGCEALCCPEPKLNNGEITQHRKSRPANHCVYFYG DEISFSCHETSRFSAICQGDGTWSPRTPSCGDICNFPPKIAHGHYKQSSSYSFFKEEI IYECDKGYILVGQAKLSCSYSHWSAPAPQCKALCRKPELVNGRLSVDKDQYVEPENVT IQCDSGYGVVGPQSITCSGNRTWYPEVPKCEWETPEGCEQVLTGKRLMQCLPNPEDVK MALEVYKLSLEIEQLELQRDSARQSTLDKEL" mat_peptide 283..1929 /gene="C4bp" /note="proline-rich protein" /product="C4b-binding protein alpha chain" BASE COUNT 629 a 480 c 468 g 601 t ORIGIN Chromosome 12p13.2. 1 aaaactctga tctggggagg aaccaggact acatagatca aggcagtttt cttctttgag 61 aaactatccc agatatcatc atagagtctt ctgctcttcc tcaactacca aagaaaaaca 121 tcagcgaagc agcaggccat gcacccccca aaaactccat ctggggctct tcatagaaaa 181 aggaaaatgg cagcctggcc cttctccagg ctgtggaaag tctctgatcc aattctcttc 241 caaatgacct tgatcgctgc tctgttgcct gctgttcttg gcaattgtgg tcctccaccc 301 actttatcat ttgctgcccc gatggatatt acgttgactg agacacgctt caaaactgga 361 actactctga aatacacctg cctccctggc tacgtcagat cccattcaac tcagacgctt 421 acctgtaatt ctgatggcga atgggtgtat aacaccttct gtatctacaa acgatgcaga 481 cacccaggag agttacgtaa tgggcaagta gagattaaga cagatttatc ttttggatca 541 caaatagaat tcagctgttc agaaggattt ttcttaattg gctcaaccac tagtcgttgt 601 gaagtccaag atagaggagt tggctggagt catcctctcc cacaatgtga aattgtcaag 661 tgtaagcctc ctccagacat caggaatgga aggcacagcg gtgaagaaaa tttctacgca 721 tacggctttt ctgtcaccta cagctgtgac ccccgcttct cactcttggg ccatgcctcc 781 atttcttgca ctgtggagaa tgaaacaata ggcgtttgga gaccaagccc tcctacctgt 841 gaaaaaatca cctgtcgcaa gccagatgtt tcacatgggg aaatggtctc tggatttgga 901 cccatctata attacaaaga cactattgtg tttaagtgcc aaaaaggttt tgttctcaga 961 ggcagcagtg taattcattg tgatgctgat agcaaatgga atccttctcc tcctgcttgt 1021 gagcccaata gttgtattaa tttaccagac attccacatg cttcctggga aacatatcct 1081 aggccgacaa aagaggatgt gtatgttgtt gggactgtgt taaggtaccg ctgtcatcct 1141 ggctacaaac ccactacaga tgagcctacg actgtgattt gtcagaaaaa tttgagatgg 1201 accccatacc aaggatgtga ggcgttatgt tgccctgaac caaagctaaa taatggtgaa 1261 atcactcaac acaggaaaag tcgtcctgcc aatcactgtg tttatttcta tggagatgag 1321 atttcatttt catgtcatga gaccagtagg ttttcagcta tatgccaagg agatggcacg 1381 tggagtcccc gaacaccatc atgtggagac atttgcaatt ttcctcctaa aattgcccat 1441 gggcattata aacaatctag ttcatacagc tttttcaaag aagagattat atatgaatgt 1501 gataaaggct acattctggt cggacaggcg aaactctcct gcagttattc acactggtca 1561 gctccagccc ctcaatgtaa agctctgtgt cggaaaccag aattagtgaa tggaaggttg 1621 tctgtggata aggatcagta tgttgagcct gaaaatgtca ccatccaatg tgattctggc 1681 tatggtgtgg ttggtcccca aagtatcact tgctctggga acagaacctg gtacccagag 1741 gtgcccaagt gtgagtggga gacccccgaa ggctgtgaac aagtgctcac aggcaaaaga 1801 ctcatgcagt gtctcccaaa cccagaggat gtgaaaatgg ccctggaggt atataagctg 1861 tctctggaaa ttgaacaact ggaactacag agagacagcg caagacaatc cactttggat 1921 aaagaactat aatttttctc aaaagaagga ggaaaaggtg tcttgctggc ttgcctcttg 1981 caattcaata cagatcagtt tagcaaatct actgtcaatt tggcagtgat attcatcata 2041 ataaatatct agaaatgata atttgctaaa gtttagtgct ttgagattgt gaaattatta 2101 atcatcctct gtgtggctca tgtttttgct tttcaacaca caaagcacaa attttttttc 2161 gattaaaaat gtatgtat // LOCUS HUMPSC3 866 bp mRNA PRI 01-JUL-1991 DEFINITION Human mRNA for proteasome subunit HC3. ACCESSION D00760 NID g220023 KEYWORDS C3; component 3; mutlicatalytic proteinase complex; proteasome. SOURCE Human, cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 866) AUTHORS Tamura,T., Lee,D.H., Osaka,F., Fujiwara,T., Shin,S., Chung,C.H., Tanaka,K. and Ichihara,A. TITLE Molecular cloning and sequence analysis of cDNAs for five major subunits of human proteasomes (multi-catalytic proteinase complexes) JOURNAL Biochim. Biophys. Acta 1089 (1), 95-102 (1991) MEDLINE 91223105 COMMENT These data kindly submitted in computer readable form by: Keiji Tanaka Institute for Enzyme Research The University of Tokushima 3 Kuramoto-cho Tokushima 770 Japan Phone: 0886-31-3111 x2562 Fax: 0886-33-0771. FEATURES Location/Qualifiers source 1..866 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..705 /note="proteasome subunit C3" /codon_start=1 /db_xref="PID:d1001115" /db_xref="PID:g220024" /translation="MAERGYSFSLTTFSPSGKLVQIEYALAAVAGGAPSVGIKAANGV VLATEKKQKSILYDERSVHKVEPITKHIGLVYSGMGPDYRVLVHRARKLAQQYYLVYQ EPIPTAQLVQRVASVMQEYTQSGGVRPFGVSLLICGWNEGRPYLFQSDPSGAYFAWKA TAMGKNYVNGKTFLEKRYNEDLELEDAIHTAILTLKESFEGQMTEDNIEVGICNEAGF RRLTPTEVKDYLAAIA" BASE COUNT 268 a 157 c 194 g 247 t ORIGIN 1 atggcggagc gcgggtacag cttttcgctg actacattca gcccgtctgg taaacttgtc 61 cagattgaat atgctttggc tgctgtagct ggaggagccc cgtccgtggg aattaaagct 121 gcaaatggtg tggtattagc aactgagaaa aaacagaaat ccattctgta tgatgagcga 181 agtgtacaca aagtagaacc aattaccaag catataggtt tggtgtacag tggcatgggc 241 cccgattaca gagtgcttgt gcacagagct cgaaaactag ctcaacaata ctatcttgtg 301 taccaagaac ccattcctac agctcagctg gtacagagag tagcttctgt gatgcaagaa 361 tatactcagt caggtggtgt tcgtccattt ggagtttctt tacttatttg tggttggaat 421 gagggacgac catatttatt tcagtcagat ccatctggag cttactttgc ctggaaagct 481 acagcaatgg gaaagaacta tgtgaatggg aagactttcc ttgagaaaag atataatgaa 541 gatctggaac ttgaagatgc cattcataca gccatcttaa ccctaaagga aagctttgaa 601 gggcaaatga cagaggataa catagaagtt ggaatctgca atgaagctgg atttaggagg 661 cttactccaa ctgaagttaa ggattacttg gctgccatag cataacaatg aagtgactga 721 aaaatccaga atttcagata atctatctac ttaaacatgt ttaaagtatg ttttgttttg 781 cagacttttt gcatacttat ttctacatgg tttaaatcga ctgtttttaa aatgacactt 841 ataaatccta ataaactgtt aaaccc // LOCUS HUMPSC5 823 bp mRNA PRI 01-JUL-1991 DEFINITION Human mRNA for proteasome subunit HC5. ACCESSION D00761 NID g220025 KEYWORDS C5; component 5; multicatalytic proteinase complex; proteasome. SOURCE Human, cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 823) AUTHORS Tamura,T., Lee,D.H., Osaka,F., Fujiwara,T., Shin,S., Chung,C.H., Tanaka,K. and Ichihara,A. TITLE Molecular cloning and sequence analysis of cDNAs for five major subunits of human proteasomes (multi-catalytic proteinase complexes) JOURNAL Biochim. Biophys. Acta 1089 (1), 95-102 (1991) MEDLINE 91223105 COMMENT These data kindly submitted in computer readable form by: Keiji Tanaka Institute for Enzyme Research The University of Tokushima 3 Kuramoto-cho Tokushima 770 Japan Phone: 0886-31-3111 x2562 Fax: 0886-33-0771. FEATURES Location/Qualifiers source 1..823 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 13..738 /note="proteasome subunit C5" /codon_start=1 /db_xref="PID:d1001116" /db_xref="PID:g220026" /translation="MLSSTAMYSAPGRDLGMEPHRAAGPLQLRFSPYVFNGGTILAIA GEDFAIVASDTRLSEGFSIHTRDSPKCYKLTDKTVIGCSGFHGDCLTLTKIIEARLKM YKHSNNKAMTTGAIAAMLSTILYSRRFFPYYVYNIIGGLDEEGKGAVYSFDPVGSYQR DSFKAGGSASAMLQPLLDNQVGFKNMQNVEHVPLSLDRAMRLVKDVFISAAERDVYTG DALRICIVTKEGIREETVSLRKD" BASE COUNT 211 a 173 c 214 g 225 t ORIGIN 1 cgcagccgtg cgatgttgtc ctctacagcc atgtattcgg ctcctggcag agacttgggg 61 atggaaccgc acagagccgc gggccctttg cagctgcgat tttcgcccta cgttttcaac 121 ggaggtacta tactggcaat tgctggagaa gattttgcaa ttgttgcttc tgatactcga 181 ttgagtgaag ggttttcaat tcatacgcgg gatagcccca aatgttacaa attaacagac 241 aaaacagtca ttggatgcag cggttttcat ggagactgtc ttacgctgac aaagattatt 301 gaagcaagac taaagatgta taagcattcc aataataagg ccatgactac gggggcaatt 361 gctgcaatgc tgtctacaat cctgtattca aggcgcttct ttccatacta tgtttacaac 421 atcatcggtg gacttgatga agaaggaaag ggggctgtat acagctttga tccagtaggg 481 tcttaccaga gagactcctt caaggctgga ggctcagcaa gtgccatgct acagcccctg 541 cttgacaacc aggttggttt taagaacatg cagaatgtgg agcatgttcc gctgtccttg 601 gacagagcca tgcggctggt gaaagatgtc ttcatttctg cggctgagag agatgtgtac 661 actggggacg cactccggat ctgcatagtg accaaagagg gcatcaggga ggaaactgtt 721 tccttaagga aggactgatc tgtgtgctct tatcaccaat cagttcagac ctggttgatt 781 ttgtactttg gaactgtacc ttggatggtt ttgtttatta aaa // LOCUS HUMPSC8 838 bp mRNA PRI 01-JUL-1991 DEFINITION Human mRNA for proteasome subunit HC8. ACCESSION D00762 NID g220027 KEYWORDS C8; component 8; multicatalytic proteinase complex; proteasome. SOURCE Human, cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 838) AUTHORS Tamura,T., Lee,D.H., Osaka,F., Fujiwara,T., Shin,S., Chung,C.H., Tanaka,K. and Ichihara,A. TITLE Molecular cloning and sequence analysis of cDNAs for five major subunits of human proteasomes (multi-catalytic proteinase complexes) JOURNAL Biochim. Biophys. Acta 1089 (1), 95-102 (1991) MEDLINE 91223105 COMMENT These data kindly submitted in computer readable form by: Keiji Tanaka Institute for Enzyme Research The University of Tokushima 3 Kuramoto-cho Tokushima 770 Japan Phone: 0886-31-3111 x2562 Fax: 0886-33-0771. FEATURES Location/Qualifiers source 1..838 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 6..773 /note="proteasome subunit C8" /codon_start=1 /db_xref="PID:d1001117" /db_xref="PID:g220028" /translation="MSSIGTGYDLSASTFSPDGRVFQVEYAMKAVENSSTAIGIRCKD GVVFGVEKLVLSKLYEEGSNKRLFNVDRHVGMAVAGLLADARSLADIAREEASNFRSN FGYNIPLKHLADRVAMYVHAYTLYSAVRPFGCSFMLGSYSVNDGAQLYMIDPSGVSYG YWGCAIGKARQAAKTEIEKLQMKEMTCRDIVKEVAKIIYIVHDEVKDKAFELELSWVG ELTNGRHEIVPKDIREEAEKYAKESLKEEDESDDDNM" BASE COUNT 267 a 143 c 192 g 236 t ORIGIN 1 gcacgatgag ctcaatcggc actgggtatg acctgtcagc ctctacattc tctcctgacg 61 gaagagtttt tcaagttgaa tatgctatga aggctgtgga aaatagtagt acagctattg 121 gaatcagatg caaagatggt gttgtctttg gggtagaaaa attagtcctt tctaaacttt 181 atgaagaagg ttccaacaaa agacttttta atgttgatcg gcatgttgga atggcagtag 241 caggtttgtt ggcagatgct cgttctttag cagacatagc aagagaagaa gcttccaact 301 tcagatctaa ctttggctac aacattccac taaaacatct tgcagacaga gtggccatgt 361 atgtgcatgc atatacactc tacagtgctg ttagaccttt tggctgcagt ttcatgttag 421 ggtcttacag tgtgaatgac ggtgcgcaac tctacatgat tgacccatca ggtgtttcat 481 acggttattg gggctgtgcc atcggcaaag ccaggcaagc tgcaaagacg gaaatagaga 541 agcttcagat gaaagaaatg acctgccgtg atatcgttaa agaagttgca aaaataattt 601 acatagtaca tgacgaagtt aaggataaag cttttgaact agaactcagc tgggttggtg 661 aattaactaa tggaagacat gaaattgttc caaaagatat aagagaagaa gcagagaaat 721 atgctaagga atctctgaag gaagaagatg aatcagatga tgataatatg taacatttac 781 tccagcatct attgtatttt aaatttctac tccagtccaa tgtaactatt tagccctg // LOCUS HUMPSC9 1078 bp mRNA PRI 02-JUL-1991 DEFINITION Human mRNA for proteasome subunit HC9. ACCESSION D00763 NID g220029 KEYWORDS C9; component 9; multicatalytic proteinase complex; proteasome. SOURCE Human, cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1078) AUTHORS Tamura,T., Lee,D.H., Osaka,F., Fujiwara,T., Shin,S., Chung,C.H., Tanaka,K. and Ichihara,A. TITLE Molecular cloning and sequence analysis of cDNAs for five major subunits of human proteasomes (multi-catalytic proteinase complexes) JOURNAL Biochim. Biophys. Acta 1089 (1), 95-102 (1991) MEDLINE 91223105 COMMENT These data kindly submitted in computer readable form by: Keiji Tanaka Institute for Enzyme Research The University of Tokushima 3 Kuramoto-cho Tokushima 770 Japan Phone: 0886-31-3111 x2562 Fax: 0886-33-0771. FEATURES Location/Qualifiers source 1..1078 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 60..845 /note="proteasome subunit C9" /codon_start=1 /db_xref="PID:d1001118" /db_xref="PID:g220030" /translation="MSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVL LAAERRNIHKLLDEVFFSEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQ EPIPCEQLVTALCDIKQAYTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYGGWK ATCIGNNSAAAVSMLKQDYKEGEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLTR ENGKTVIRVLKQKEVEQLIKKHEEEEAKAEREKKEKEQKEKDK" BASE COUNT 354 a 192 c 235 g 297 t ORIGIN 1 ccgtggacat ctcaggtctt cagggtcttc catctggaac tatataaagt tcagaaaaca 61 tgtctcgaag atatgactcc aggaccacta tattttctcc agaaggtcgc ttataccaag 121 ttgaatatgc catggaagct attggacatg caggcacctg tttgggaatt ttagcaaatg 181 atggtgtttt gcttgcagca gagagacgca acatccacaa gcttcttgat gaagtctttt 241 tttctgaaaa aatttataaa ctcaatgagg acatggcttg cagtgtggca ggcataactt 301 ctgatgctaa tgttctgact aatgaactaa ggctcattgc tcaaaggtat ttattacagt 361 atcaggagcc aataccttgt gagcagttgg ttacagcact gtgtgatatc aaacaagctt 421 atacacaatt tggaggaaaa cgtccctttg gtgtttcatt gctgtacatt ggctgggata 481 agcactatgg ctttcagctc tatcagagtg accctagtgg aaattacggg ggatggaagg 541 ccacatgcat tggaaataat agcgctgcag ctgtgtcaat gttgaaacaa gactataaag 601 aaggagaaat gaccttgaag tcagcacttg ctttagctat caaagtacta aataagacca 661 tggatgttag taaactctct gctgaaaaag tggaaattgc aacactaaca agagagaatg 721 gaaagacagt aatcagagtt ctcaaacaaa aagaagtgga gcagttgatc aaaaaacatg 781 aggaagaaga agccaaagct gagcgtgaga agaaagaaaa agaacagaaa gaaaaggata 841 aatagaatca gagattttat tactcatttg gggcaccatt tcagtgtaaa agcagtccta 901 ctcttccaca ctaggaaggc tttacttttt ttaactggtg cagtgggaaa ataggacatt 961 acatactgaa ttgggtcctt gtcatttctg tccaattgaa tactttattg taacgatgat 1021 ggttaccctt catggacgtc ttaatcttcc acacacatcc cctttttttg gaataaaa // LOCUS HUMPSH1 692 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for proteasome subunit HsC10-II, complete cds. ACCESSION D26598 NID g565646 KEYWORDS proteasome subunit HsC10-II. SOURCE Homo sapiens cell-line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 692) AUTHORS Nothwang,H.G., Tamura,T., Tanaka,K. and Ichihara,A. TITLE Sequence analyses and inter-species comparisons of three novel human proteasomal subunits, HsN3, HsC7-I and HsC10-II, confine potential proteolytic active-site residues JOURNAL Biochim. Biophys. Acta 1219 (2), 361-368 (1994) MEDLINE 95002149 REFERENCE 2 (bases 1 to 692) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (24-JAN-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) COMMENT Submitted (24-Jan-1994) to DDBJ by: Keiji Tanaka Inst. for Enz. Res. The University of Tokushima 3-18-15 Kuramoto-cho, Tokushima Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223. FEATURES Location/Qualifiers source 1..692 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" CDS 18..635 /codon_start=1 /product="proteasome subunit HsC10-II" /db_xref="PID:d1006190" /db_xref="PID:g565647" /translation="MSIMSYNGGAVMAMKGKNCVAIAADRRFGIQAQLVTTDFQKIFP MGDRLYIGLAGLATDVQTVAQRLKFRLNLYELKEGRQIKPYTLMSMVANLLYEKRFGP YYTEPVIAGLDPKTFKPFICSLDLIGCPMVTDDFVVSGTCAEQMYGMCESLWEPNMDP DHLFETISQAMLNAVDRDAVSGMGVIVHIIEKDKITTRTLKARMD" polyA_signal 673..678 BASE COUNT 159 a 190 c 178 g 165 t ORIGIN 1 cctagtacac cgcaatcatg tctattatgt cctataacgg aggggccgtc atggccatga 61 aggggaagaa ctgtgtggcc atcgctgcag acaggcgctt cgggatccag gcccagttgg 121 tgaccacgga cttccagaag atctttccca tgggtgaccg gctgtacatc ggtctggccg 181 ggctcgccac tgacgtccag acagttgccc agcgcctcaa gttccggctg aacctgtatg 241 agttgaagga aggtcggcag atcaaacctt ataccctcat gagcatggtg gccaacctct 301 tgtatgagaa acggtttggc ccttactaca ctgagccagt cattgccggg ttggacccga 361 agacctttaa gcccttcatt tgctctctag acctcatcgg ctgccccatg gtgactgatg 421 actttgtggt cagtggcacc tgcgccgaac aaatgtacgg aatgtgtgag tccctctggg 481 agcccaacat ggatccggat cacctgtttg aaaccatctc ccaagccatg ctgaatgctg 541 tggaccggga tgcagtgtca ggcatgggag tcattgtcca catcatcgag aaggacaaaa 601 tcaccaccag gacactgaag gcccgaatgg actaaccctg ttcccagagc ccactttttt 661 ttcttttttt gaaataaaat agcctgtctt tc // LOCUS HUMPSH2 762 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for proteasome subunit HsC7-I, complete cds. ACCESSION D26599 NID g565648 KEYWORDS proteasome subunit HsC7-I. SOURCE Homo sapiens cell-line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 762) AUTHORS Nothwang,H.G., Tamura,T., Tanaka,K. and Ichihara,A. TITLE Sequence analyses and inter-species comparisons of three novel human proteasomal subunits, HsN3, HsC7-I and HsC10-II, confine potential proteolytic active-site residues JOURNAL Biochim. Biophys. Acta 1219 (2), 361-368 (1994) MEDLINE 95002149 REFERENCE 2 (bases 1 to 762) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (24-JAN-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) COMMENT Submitted (24-Jan-1994) to DDBJ by: Keiji Tanaka Inst. for Enz. Res. The University of Tokushima 3-18-15 Kuramoto-cho, Tokushima Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223. FEATURES Location/Qualifiers source 1..762 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" CDS 30..635 /codon_start=1 /product="proteasome subunit HsC7-I" /db_xref="PID:d1006191" /db_xref="PID:g565649" /translation="MEYLIGIQGPDYVLVASDRVAASNIVQMKDDHDKMFKMSEKILL LCVGEAGDTVQFAEYIQKNVQLYKMRNGYELSPTAAANFTRRNLADCLRSRTPYHVNL LLAGYDEHEGPALYYMDYLAALAKAPFAAHGYGAFLTLSILDRYYTPTISRERAVELL RKCLEELQKRFILNLPTFSVRIIDKNGIHDLDNISFPKQGS" polyA_signal 738..743 BASE COUNT 190 a 201 c 171 g 200 t ORIGIN 1 cggacctgca gccctggcct tccgccacca tggagtacct catcggtatc caaggccccg 61 actatgttct tgtcgcctcc gaccgggtgg ccgccagcaa tattgtccag atgaaggacg 121 atcatgacaa gatgtttaag atgagtgaaa agatattact cctgtgtgtt ggagaggctg 181 gagacactgt acagtttgca gaatatattc agaaaaacgt gcaactttat aagatgcgaa 241 atggatatga attgtctccc acggcagcag ctaacttcac acgccgaaac ctggctgact 301 gtcttcggag tcggacccca tatcatgtga acctcctcct ggctggctat gatgagcatg 361 aagggccagc gctgtattac atggactacc tggcagcctt ggccaaggcc ccttttgcag 421 cccacggcta tggtgccttc ctgactctca gtatcctcga ccgatactac acaccgacta 481 tctcacgtga gagggcagtg gaactcctta ggaaatgtct ggaggagctc cagaaacgct 541 tcatcctgaa tctgccaacc ttcagtgttc gaatcattga caaaaatggc atccatgacc 601 tggataacat ttccttcccc aaacagggct cctaacatca tgtcctccct cccacttgcc 661 agggaacttt tttttgatgg gctcctttat ttttttctac tcttttcagg cgcactcttg 721 ataaatggtt aattcagaat aaaggtgact atggatataa tt // LOCUS HUMPSH3 925 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for proteasome subunit HsN3, complete cds. ACCESSION D26600 NID g565650 KEYWORDS proteasome subunit HsN3. SOURCE Homo sapiens cell-line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 925) AUTHORS Nothwang,H.G., Tamura,T., Tanaka,K. and Ichihara,A. TITLE Sequence analyses and inter-species comparisons of three novel human proteasomal subunits, HsN3, HsC7-I and HsC10-II, confine potential proteolytic active-site residues JOURNAL Biochim. Biophys. Acta 1219 (2), 361-368 (1994) MEDLINE 95002149 REFERENCE 2 (bases 1 to 925) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (24-JAN-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) COMMENT Submitted (24-Jan-1994) to DDBJ by: Keiji Tanaka Inst. for Enz. Res. The University of Tokushima 3-18-15 Kuramoto-cho, Tokushima Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223. FEATURES Location/Qualifiers source 1..925 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="K562" CDS 24..818 /codon_start=1 /product="proteasome subunit HsN3" /db_xref="PID:d1006192" /db_xref="PID:g565651" /translation="MEAFLGSRSGLWAGGPAPGQFYRIPSTPDSFMDPASALYRGPIT RTQNPMVTGTSVLGVKFEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDY ADFQYLKQVLGQMVIDEELLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGY ADGESFLGYVDMLGVAYEAPSLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCM RVLYYRDARSYNRFQTATVTEKGVEIEGPLSTETNWDIAHMISGFE" polyA_signal 908..913 BASE COUNT 211 a 231 c 247 g 236 t ORIGIN 1 ttttttctgc taccgtgact aagatggaag cgtttttggg gtcgcggtcc ggactttggg 61 cggggggtcc ggccccagga cagttttacc gcattccgtc cactcccgat tccttcatgg 121 atccggcgtc tgcactttac agaggtccaa tcacgcggac ccagaacccc atggtgaccg 181 ggacctcagt cctcggcgtt aagttcgagg gcggagtggt gattgccgca gacatgctgg 241 gatcctacgg ctccttggct cgtttccgca acatctctcg cattatgcga gtcaacaaca 301 gtaccatgct gggtgcctct ggcgactacg ctgatttcca gtatttgaag caagttctcg 361 gccagatggt gattgatgag gagcttctgg gagatggaca cagctatagt cctagagcta 421 ttcattcatg gctgaccagg gccatgtaca gccggcgctc gaagatgaac cctttgtgga 481 acaccatggt catcggaggc tatgctgatg gagagagctt cctcggttat gtggacatgc 541 ttggtgtagc ctatgaagcc ccttcgctgg ccactggtta tggtgcatac ttggctcagc 601 ctctgctgcg agaagttctg gagaagcagc cagtgctaag ccagaccgag gcccgcgact 661 tagtagaacg ctgcatgcga gtgctgtact accgagatgc ccgttcttac aaccggtttc 721 aaaccgccac tgtcaccgaa aaaggtgttg aaatagaggg accattgtct acagagacca 781 actgggatat tgcccacatg atcagtggct ttgaatgaaa tacagatgca ttatccagaa 841 ctgaagttgc cctactttta actttgaact tggctagttc aaagatagac tcttcttttg 901 taaagtaaat aaattcttca aaatg // LOCUS HUMPSM 2653 bp mRNA PRI 08-JAN-1995 DEFINITION Human prostate-specific membrane antigen (PSM) mRNA, complete cds. ACCESSION M99487 NID g190663 KEYWORDS prostate-specific membrane antigen. SOURCE Homo sapiens (tissue library: LNCaP cDNA of Ron Israeli) male prostatic carcinoma metastatic lymph node cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2653) AUTHORS Israeli,R.S., Powell,C.T., Fair,W.R. and Heston,W.D. TITLE Molecular cloning of a complementary DNA encoding a prostate-specific membrane antigen JOURNAL Cancer Res. 53 (2), 227-230 (1993) MEDLINE 93113576 FEATURES Location/Qualifiers source 1..2653 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="LNCaP-ATCC" /cell_type="prostate" /germline /sex="male" /tissue_type="prostatic carcinoma metastatic lymph node" /tissue_lib="LNCaP cDNA of Ron Israeli" gene 262..2514 /gene="PSM" CDS 262..2514 /gene="PSM" /codon_start=1 /evidence=experimental /product="prostate- specific membrane antigen" /db_xref="PID:g190664" /translation="MWNLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIK SSNEATNITPKHNMKAFLDELKAENIKKFLYNFTQIPHLAGTEQNFQLAKQIQSQWKE FGLDSVELAHYDVLLSYPNKTHPNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPP FSAFSPQGMPEGDLVYVNYARTEDFFKLERDMKINCSGKIVIARYGKVFRGNKVKNAQ LAGAKGVILYSDPADYFAPGVKSYPDGWNLPGGGVQRGNILNLNGAGDPLTPGYPANE YAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMGGSAPPDSSWRGSLKVPYNVGPGFT GNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYVILGGHRDSWVFGGIDPQSGA AVVHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEWAEENSRLLQERGVAYI NADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYESWTKKSPSPEFSG MPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYETYELVEKFY DPMFKYHLTVAQVRGGMVFELANSIVLPFDCRDYAVVLRKYADKIYSISMKHPQEMKT YSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQ AAAETLSEVA" BASE COUNT 782 a 524 c 640 g 707 t ORIGIN 1 ctcaaaaggg gccggatttc cttctcctgg aggcagatgt tgcctctctc tctcgctcgg 61 attggttcag tgcactctag aaacactgct gtggtggaga aactggaccc caggtctgga 121 gcgaattcca gcctgcaggg ctgataagcg aggcattagt gagattgaga gagactttac 181 cccgccgtgg tggttggagg gcgcgcagta gagcagcagc acaggcgcgg gtcccgggag 241 gccggctctg ctcgcgccga gatgtggaat ctccttcacg aaaccgactc ggctgtggcc 301 accgcgcgcc gcccgcgctg gctgtgcgct ggggcgctgg tgctggcggg tggcttcttt 361 ctcctcggct tcctcttcgg gtggtttata aaatcctcca atgaagctac taacattact 421 ccaaagcata atatgaaagc atttttggat gaattgaaag ctgagaacat caagaagttc 481 ttatataatt ttacacagat accacattta gcaggaacag aacaaaactt tcagcttgca 541 aagcaaattc aatcccagtg gaaagaattt ggcctggatt ctgttgagct agcacattat 601 gatgtcctgt tgtcctaccc aaataagact catcccaact acatctcaat aattaatgaa 661 gatggaaatg agattttcaa cacatcatta tttgaaccac ctcctccagg atatgaaaat 721 gtttcggata ttgtaccacc tttcagtgct ttctctcctc aaggaatgcc agagggcgat 781 ctagtgtatg ttaactatgc acgaactgaa gacttcttta aattggaacg ggacatgaaa 841 atcaattgct ctgggaaaat tgtaattgcc agatatggga aagttttcag aggaaataag 901 gttaaaaatg cccagctggc aggggccaaa ggagtcattc tctactccga ccctgctgac 961 tactttgctc ctggggtgaa gtcctatcca gatggttgga atcttcctgg aggtggtgtc 1021 cagcgtggaa atatcctaaa tctgaatggt gcaggagacc ctctcacacc aggttaccca 1081 gcaaatgaat atgcttatag gcgtggaatt gcagaggctg ttggtcttcc aagtattcct 1141 gttcatccaa ttggatacta tgatgcacag aagctcctag aaaaaatggg tggctcagca 1201 ccaccagata gcagctggag aggaagtctc aaagtgccct acaatgttgg acctggcttt 1261 actggaaact tttctacaca aaaagtcaag atgcacatcc actctaccaa tgaagtgaca 1321 agaatttaca atgtgatagg tactctcaga ggagcagtgg aaccagacag atatgtcatt 1381 ctgggaggtc accgggactc atgggtgttt ggtggtattg accctcagag tggagcagct 1441 gttgttcatg aaattgtgag gagctttgga acactgaaaa aggaagggtg gagacctaga 1501 agaacaattt tgtttgcaag ctgggatgca gaagaatttg gtcttcttgg ttctactgag 1561 tgggcagagg agaattcaag actccttcaa gagcgtggcg tggcttatat taatgctgac 1621 tcatctatag aaggaaacta cactctgaga gttgattgta caccgctgat gtacagcttg 1681 gtacacaacc taacaaaaga gctgaaaagc cctgatgaag gctttgaagg caaatctctt 1741 tatgaaagtt ggactaaaaa aagtccttcc ccagagttca gtggcatgcc caggataagc 1801 aaattgggat ctggaaatga ttttgaggtg ttcttccaac gacttggaat tgcttcaggc 1861 agagcacggt atactaaaaa ttgggaaaca aacaaattca gcggctatcc actgtatcac 1921 agtgtctatg aaacatatga gttggtggaa aagttttatg atccaatgtt taaatatcac 1981 ctcactgtgg cccaggttcg aggagggatg gtgtttgagc tagccaattc catagtgctc 2041 ccttttgatt gtcgagatta tgctgtagtt ttaagaaagt atgctgacaa aatctacagt 2101 atttctatga aacatccaca ggaaatgaag acatacagtg tatcatttga ttcacttttt 2161 tctgcagtaa agaattttac agaaattgct tccaagttca gtgagagact ccaggacttt 2221 gacaaaagca acccaatagt attaagaatg atgaatgatc aactcatgtt tctggaaaga 2281 gcatttattg atccattagg gttaccagac aggccttttt ataggcatgt catctatgct 2341 ccaagcagcc acaacaagta tgcaggggag tcattcccag gaatttatga tgctctgttt 2401 gatattgaaa gcaaagtgga cccttccaag gcctggggag aagtgaagag acagatttat 2461 gttgcagcct tcacagtgca ggcagctgca gagactttga gtgaagtagc ctaagaggat 2521 tctttagaga atccgtattg aatttgtgtg gtatgtcact cagaaagaat cgtaatgggt 2581 atattgataa attttaaaat tggtatattt gaaataaagt tgaatattat atataaaaaa 2641 aaaaaaaaaa aaa // LOCUS HUMPSOR 415 bp mRNA PRI 08-JAN-1995 DEFINITION Human psoriasin mRNA, complete cds. ACCESSION M86757 NID g190667 KEYWORDS calpactin; psoriasin. SOURCE Homo sapiens (tissue library: lambda gt11) epidermis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 415) AUTHORS Madsen,P.S., Rasmussen,H.H., Leffers,H., Honore,B., Dejgaard,K., Olsen,E., Kiil,J., Walbum,E., Andersen,A., Basse,B., Lauridsen,J.B., Ratz,G., Celis,A., Vandekerckhove,J.S. and Celis,J.E. TITLE Molecular cloning, occurrence, and expression of a novel partially secreted protein 'psoriasin' that is highly up-regulated in psoriatic skin JOURNAL J. Invest. Dermatol. 97 (4), 701-712 (1991) MEDLINE 92043866 FEATURES Location/Qualifiers source 1..415 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocytes" /tissue_type="epidermis" /tissue_lib="lambda gt11" gene 50..355 /gene="psoriasin" CDS 50..355 /gene="psoriasin" /codon_start=1 /product="psoriasin" /db_xref="PID:g190668" /translation="MSNTQAERSIIGMIDMFHKYTRRDDKIDKPSLLTMMKENFPNFL SACDKKGTNYLADVFEKKDKNEDKKIDFSEFLSLLGDIATDYHKQSHGAAPCSGGSQ" BASE COUNT 126 a 108 c 95 g 86 t ORIGIN 1 aattcttcta ctcgtgacgc ttcccagctc tggctttttg aaagcaaaga tgagcaacac 61 tcaagctgag aggtccataa taggcatgat cgacatgttt cacaaataca ccagacgtga 121 tgacaagatt gacaagccaa gcctgctgac gatgatgaag gagaacttcc ccaacttcct 181 tagtgcctgt gacaaaaagg gcacaaatta cctcgccgac gtctttgaga aaaaggacaa 241 gaatgaggat aagaagattg atttttctga gtttctgtcc ttgctgggag acatagccac 301 agactaccac aagcagagcc atggagcagc gccctgttcc gggggcagcc agtgacccag 361 ccccaccaat gggcctccag agacccagga acaataaaat gtcttctccc accag // LOCUS HUMPSP31 928 bp mRNA PRI 08-FEB-1996 DEFINITION Human mRNA for 26S proteasome subunit p31, complete cds. ACCESSION D38047 NID g1037163 KEYWORDS 26S protease subunit p31. SOURCE Homo sapiens cell-line HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 928) AUTHORS Kominami,K., DeMartino,G., Moomaw,C., Slaughter,C., Shimbara,N., Fujimuro,M., Yokosawa,H., Hisamatsu,H., Tanahashi,N., Shimizu,Y., Tanaka,K. and Toh-e,A. TITLE Nin1p, a regulatory subunit of the 26S proteasome, is necessary for activation of Cdc28p kinase of Saccharomyces cerevisiae JOURNAL EMBO J. 14 (13), 3105-3115 (1995) MEDLINE 95347337 REFERENCE 2 (bases 1 to 928) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (25-AUG-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, Inst. for Enz. Res. The Univ. of Tokushima; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (E-mail:ketanaka@ddbj.nig.ac.jp, Tel:0886-33-7430, Fax:0886-33-7431) COMMENT Submitted (25-Aug-1994) to DDBJ by: Keiji Tanaka Inst. for Enz. Res. The Univ. of Tokushima 3-18-15 Kuramoto-cho Tokushima, Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223 Email: ketanaka@ddbj.nig.ac.jp. FEATURES Location/Qualifiers source 1..928 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 71..844 /codon_start=1 /product="26S proteasome subunit p31" /db_xref="PID:d1007815" /db_xref="PID:g1037164" /translation="MYEQLKGEWNRKSPNLSKCGEELGRLKLVLLELNFLPTTGTKLT KQQLILARDILEIGAQWSILRKDIPSFERYMAQLKCYYFDYKEQLPESAYMHQLLGLN LLFLLSQNRVAEFHTELERLPAKDIQTNVYIKHPVSLEQYLMEGSYNKVFLAKGNIPA ESYTFFIDILLDTIRDEIAGCIEKAYEKILFTEATRILFFNTPKKMTDYAKKRGWVLG PNNYYSFASQQQKPEDTTIPSTELAKQVIEYARQLEMIV" polyA_signal 912..917 BASE COUNT 230 a 282 c 245 g 171 t ORIGIN 1 cggggcggca ggcttctcga gctccgggcc cgcggcaacc tcgggcgctg ttctgcaggc 61 cgcgaccggc atgtacgagc aactcaaggg cgagtggaac cgtaaaagcc ccaatcttag 121 caagtgcggg gaagagctgg gtcgactcaa gctagttctt ctggagctca acttcttgcc 181 aaccacaggg accaagctga ccaaacagca gctaattctg gcccgtgaca tactggagat 241 cggggcccaa tggagcatcc tacgcaagga catcccctcc ttcgagcgct acatggccca 301 gctcaaatgc tactactttg attacaagga gcagctcccc gagtcagcct atatgcacca 361 gctcttgggc ctcaacctcc tcttcctgct gtcccagaac cgggtggctg agttccacac 421 ggagttggag cggctgcctg ccaaggacat acagaccaat gtctacatca agcacccagt 481 gtccctggag caatacctga tggagggcag ctacaacaaa gtgttcctgg ccaagggtaa 541 catccccgcc gagagctaca ccttcttcat tgacatcctg ctcgacacta tcagggatga 601 gatcgctggg tgcatcgaga aggcctacga gaaaatcctt ttcactgagg ccacccggat 661 cctcttcttc aacacaccca aaaagatgac agactacgcc aagaagcgag ggtgggtcct 721 gggccccaac aactactaca gttttgccag ccagcagcag aagccggaag acaccaccat 781 tccctccaca gaactggcca aacaggtcat cgagtatgcc cggcagctgg agatgatcgt 841 ctgagccccc cgggcactgg gtggggcagg gcacgagtta tttaaaacag ttacactgca 901 gggtttcgcc caataaaggt ggactgac // LOCUS HUMPSPBQ 1690 bp mRNA PRI 16-MAY-1996 DEFINITION Human surfactant protein B mRNA, complete cds. ACCESSION L11573 L22610 NID g1220354 KEYWORDS pulmonary surfactant protein B. SOURCE Homo sapiens lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1690) AUTHORS Luzi,P., Anceschi,M. and Strayer,D.S. TITLE Glucocorticoid responsiveness conferred by a cloned DNA binding protein JOURNAL Receptor 5 (2), 93-103 (1995) MEDLINE 96090545 FEATURES Location/Qualifiers source 1..1690 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lung" CDS 333..1223 /codon_start=1 /product="33.1 kDa protein" /db_xref="PID:g1220355" /translation="MMGRTHGVHAEPTTFGLKMARFHASATRAIERFDRVAAEVETGK LSGAVGTFANVPPYVEAVAMKELGLTPQPIGSQVLPRDLHADYVQTIALIGTQMEELA TEIRSLQRSEIHEVEEGFAKGQKGSSAMPHKRNPIGNENITGLARVLRGYAVTALEDV TLWHERDISHSSAERIILPDATTTLDYMLNRQTGILKNLGVFPEKMRHNMDRTYGLIY SQRLLLSLIDAGLSREQAYDTVQPLTARSWDEQLMFRDLVDADPTITAHLTKAQIDDA FDYHYHLRHVDEIFKRVGLA" CDS 1403..1690 /partial /note="ORF" /codon_start=1 /label=ORF /db_xref="PID:g1220356" /translation="MPLDVDFDFVDVKSYSGAASTGQVKVVHDVSMDLTGRDVVIVDE IIDSGRTMQWLQNYFELKGAASVTTVALADKKAARVVDFDVDYFGLDVPDEF" BASE COUNT 465 a 339 c 421 g 465 t ORIGIN 1 ggaattccgg tgctgtggat gccggctggg tcgctgcggg gcacattcct gcagaagatt 61 tggaaaaaat ccgtcaaaac gccacttttg acgttgaccg cattgcagag attgagttat 121 caacgcgcca tgatgttgtg gcatttaccc gtaacgtgtc agaatcactt ggcgaagaac 181 gtaagtggat tcactatggc ttaacgtcaa ccgatgttgt tgatacagcg caagcattac 241 gtttgcgtca agccaacgat attattaaac aagatttgca agaatggcgc gacgccatta 301 aagatttggc cttgaagtat aaagacactg tcatgatggg acgcacacac ggtgtacatg 361 ccgaaccaac cacttttggc ttgaagatgg cacgcttcca tgcgtcagca acacgcgcga 421 ttgaacgttt tgatcgggtg gctgctgaag tcgaaaccgg taagttatct ggtgccgtag 481 gcacgtttgc caatgtgcca ccttatgtcg aagccgtggc catgaaggaa ttgggcttga 541 cgccacaacc aattgggtca caagtgttac cacgtgattt gcatgctgat tacgtgcaaa 601 cgattgcgtt gattgggaca caaatggaag aattggcaac ggaaattcgc tcattgcaac 661 gctcagaaat tcatgaagtt gaagaaggct ttgctaaagg acaaaagggt tcttcagcaa 721 tgccacacaa gcgtaaccca attggtaatg aaaatattac tggtttggca cgtgtcttgc 781 gtggctatgc cgtaacagca cttgaagatg tgacattgtg gcatgaacgc gatatttcac 841 attcttcagc cgaacgcatt attttgcctg atgcaacgac aacattggat tacatgttga 901 atcgtcaaac aggtattttg aagaatttgg gtgtcttccc tgaaaaaatg cgtcacaata 961 tggatcgcac ttacggtttg atttattcac aacgtttgtt gttgagctta attgatgccg 1021 gcttgtcacg tgaacaagcc tatgatacgg tgcaaccatt gacagcacgt tcatgggatg 1081 aacaattgat gttccgtgac ttggttgatg cggatccaac aatcactgcc catttgacta 1141 aagcacaaat tgatgacgcg tttgattatc actatcactt gcgtcatgtt gatgaaattt 1201 ttaagagagt aggtttggca tgacatcact aattaaccat ccagcaatta agacagtttt 1261 agcaacggaa acagatattc aagcacaagt gcaacgtgtg gcgaatgaac ttaccagtaa 1321 atttgcgcat aatgacaagc ggccagtttt tattgcagtc ctcaagggtg gggtgatttt 1381 tgccacagat ttactccgga aaatgccatt ggatgttgac tttgactttg tcgatgtcaa 1441 aagttattca ggtgctgctt caactggcca agttaaagtg gtccatgacg tgagcatgga 1501 tttaacagga cgtgatgtcg tgatcgtcga tgaaattatt gattctggtc ggacgatgca 1561 atggttgcaa aactattttg aactcaaagg ggccgcaagt gtgacgacgg tagccttagc 1621 tgataaaaag gccgctcggg tggttgactt tgacgttgat tactttggtc ttgatgtgcc 1681 cgatgaattc // LOCUS HUMPST 1894 bp mRNA PRI 14-AUG-1995 DEFINITION Homo sapiens alpha-2,8-polysialyltransferase (PST) gene, complete cds. ACCESSION L41680 NID g945220 KEYWORDS neural cell adhesion molecule; polysialyltransferase. SOURCE Homo sapiens (clone library: pcDNA1) fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1894) AUTHORS Nakayama,J., Fukuda,M.N., Fredette,B., Ranscht,B. and Fukuda,M. TITLE Expression cloning of a human polysialyltransferase that forms the polysialylated neural cell adhesion molecule present in embryonic brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (15), 7031-7035 (1995) MEDLINE 95350205 FEATURES Location/Qualifiers source 1..1894 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pcDNA1" /tissue_type="fetal brain" gene 213..1292 /gene="PST" CDS 213..1292 /gene="PST" /codon_start=1 /function="polysialylation of neural cell adhesion molecule (N-CAM)" /product="alpha-2,8-polysialyltransferase" /db_xref="PID:g945221" /translation="MRSIRKRWTICTISLLLIFYKTKEIARTEEHQETQLIGDGELSL SRSLVNSSDKIIRKAGSSIFQHNVEGWKINSSLVLEIRKNILRFLDAERDVSVVKSSF KPGDVIHYVLDRRRTLNISHDLHSLLPEVSPMKNRRFKTCAVVGNSGILLDSECGKEI DSHNFVIRCNLAPVVEFAADVGTKSDFITMNPSVVQRAFGGFRNESDREKFVHRLSML NDSVLWIPAFMVKGGEKHVEWVNALILKNKLKVRTAYPSLRLIHAVRGYWLTNKVPIK RPSTGLLMYTLATRFCDEIHLYGFWPFPKDLNGKAVKYHYYDDLKYRYFSNASPHRMP LEFKTLNVLHNRGALKLTTGKCVKQ" BASE COUNT 592 a 377 c 418 g 507 t ORIGIN 1 cgcaaacagg gcgagaggtc gctgggcagc gttcgaggac cagagggagc tcggccacag 61 aagaccccag tgatctgatc ccgggatccc ggctccaagc tctcctcgca ttttacagat 121 ttcacccccg cgactatctc cccaaaacgg agcctttata tcaagagaag gtgcgggagc 181 tggggcaacc aggactttct cgggcaccca agatgcgctc cattaggaag aggtggacga 241 tctgcacaat aagtctgctc ctgatctttt ataagacaaa agaaatagca agaactgagg 301 agcaccagga gacgcaactc atcggagatg gtgaattgtc tttgagtcgg tcacttgtca 361 atagctctga taaaatcatt cgaaaggctg gctcttcaat cttccagcac aatgtagaag 421 gttggaaaat caattcctct ttggtcctag agataaggaa gaacatactt cgtttcttag 481 atgcagaacg agatgtgtca gtggtcaaga gcagttttaa gcctggtgat gtcatacact 541 atgtgcttga caggcgccgg acactaaaca tttctcatga tctacatagc ctcctacctg 601 aagtttcacc aatgaagaat cgcaggttta agacctgtgc agttgttgga aattctggca 661 ttctgttaga cagtgaatgt ggaaaggaga ttgacagtca caattttgta ataaggtgta 721 atctagctcc tgtggtggag tttgctgcag atgtgggaac taaatcagat tttattacca 781 tgaatccatc agttgtacaa agagcatttg gaggctttcg aaatgagagt gacagagaaa 841 aatttgtgca tagactttcc atgctgaatg acagtgtcct ttggattcct gctttcatgg 901 tcaaaggagg agagaagcac gtggagtggg ttaatgcatt aatccttaag aataaactga 961 aagtgcgaac tgcctatccg tcattgagac ttattcatgc tgtcagaggt tactggctga 1021 ccaacaaagt tcctatcaaa agacccagca caggtcttct catgtataca cttgccacaa 1081 gattctgtga tgaaattcac ctgtatggat tctggccctt ccctaaggat ttaaatggaa 1141 aagcggtcaa atatcattat tatgatgact taaaatatag gtacttttcc aatgcaagcc 1201 ctcacagaat gccattagaa ttcaaaacat taaatgtgct acataataga ggagctctaa 1261 aactgacaac aggaaagtgt gtaaagcaat aaagcacatt ttgaaacaaa caatatgcac 1321 ttcttttctg agatgcttcc gaagatttga aaataggatc caaaacacgg ctgggtttca 1381 gcatccacca atgaactgaa aggtgaataa aggacgttca tgagaaatcg actaccagct 1441 gatgaaatac ctgcaaagtg ctctaaaaat taaatatttt gactttaagg gtcctagtaa 1501 gtgccacttc cactaagaat acagtttgaa tgtataatca gtagtgttta caagatccaa 1561 cagtgcactc atcattagtt aacaaagcaa atatgttcat cactgtcagg ctgcccacag 1621 caacaccaag catattagaa gaggaacccc aggaacgcaa ctcagacctt gggaaattaa 1681 accatccttg tcagcagaag ccaagatgga agcagtttga gcaatgaaat ccgtaagatt 1741 aaacaactca agtaaatgct tcagtcagga ctctgagtct gatcatgaat tttatgtttt 1801 aatttatgtt tttttttttg tcttctggaa tctcttttgg tttggatatt gggatgctta 1861 gaaatccttt ctgagatgca tatgagtgag gaaa // LOCUS HUMPSY 801 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for proteasome subunit Y, complete cds. ACCESSION D29012 NID g558527 KEYWORDS proteasome subunit Y. SOURCE Homo sapiens cell_line:HepG2 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Akiyama,K., Yokota,K., Kagawa,S., Shimbara,N., Tamura,T., Akioka,H., Nothwang,H.G., Noda,C., Tanaka,K. and Ichihara,A. TITLE cDNA cloning and interferon gamma down-regulation of proteasomal subunits X and Y JOURNAL Science 265 (5176), 1231-1234 (1994) MEDLINE 94345396 REFERENCE 2 (bases 1 to 801) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (17-MAR-1994) to the DDBJ/EMBL/GenBank databases. Keiji Tanaka, The University of Tokushima, Institute for Enzyme Research; 3-18-15 Kuramoto-cho, Tokushima, Tokushima 770, Japan (Tel:0886-31-3111(ex.2563), Fax:0886-33-4223) COMMENT Submitted (17-Mar-1994) to DDBJ by: Keiji Tanaka Institute for Enzyme Research The University of Tokushima 3-18-15 Kuramoto-cho, Tokushima Tokushima 770 Japan Phone: 0886-31-3111 x2563 Fax: 0886-33-4223. FEATURES Location/Qualifiers source 1..801 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 10..729 /codon_start=1 /product="proteasome subunit Y" /db_xref="PID:d1006652" /db_xref="PID:g558528" /translation="MAATLLAARGAGPAPAWGPEAFTPDWESREVSTGTTIMAVQFDG GVVLGADSRTTTGSYIANRVTDKLTPIHDRIFCCRSGSAADTQAVADAVTYQLGFHSI ELNEPPLVHTAASLFKEMCYRYREDLMAGIIIAGWDPQEGGQGYSVPMGGMMVRQSFA IGGSGSSYIYGYVDATYREGMTKEECLQFTANALALAMERDGSSGGVIRLAAIAESGV ERQVLLGDQIPKFAVATLPPA" polyA_signal 782..787 BASE COUNT 186 a 207 c 236 g 172 t ORIGIN 1 aggagaaaga tggcggctac cttactagct gctcggggag ccgggccagc accggcttgg 61 gggccggagg cattcactcc agactgggaa agccgagaag tttccactgg gaccactatc 121 atggccgtgc agtttgacgg gggcgtggtt ctgggggcgg actccagaac aaccactggg 181 tcctacatcg ccaatcgagt gactgacaag ctgacaccta ttcacgaccg cattttctgc 241 tgtcgctcag gctcagctgc tgatacccag gcagtagctg atgctgtcac ctaccagctc 301 ggtttccaca gcattgaact gaatgagcct ccactggtcc acacagcagc cagcctcttt 361 aaggagatgt gttaccgata ccgggaagac ctgatggcgg gaatcatcat cgcaggctgg 421 gaccctcaag aaggagggca ggggtactca gtgcctatgg ggggtatgat ggtaaggcag 481 tcctttgcca ttggaggctc cgggagctcc tacatctatg gctatgttga tgctacctac 541 cgggaaggca tgaccaagga agagtgtctg caattcactg ccaatgctct cgctttggcc 601 atggagcggg atggctccag tggaggagtg atccgcctgg cagccattgc agagtcaggg 661 gtagagcggc aagtactttt gggagaccag atacccaaat tcgccgttgc cactttacca 721 cccgcctgaa tcctgggatt ctagtatgca ataagagatg ccctgtactg atgcaaaatt 781 taataaagtt tgtcacagag a // LOCUS HUMPTGIS 1977 bp mRNA PRI 09-FEB-1996 DEFINITION Human mRNA for prostacyclin synthase, complete cds. ACCESSION D38145 NID g537948 KEYWORDS prostacyclin synthase; prostaglandin-I synthase. SOURCE Homo sapiens aorta endothelial cell cDNA to mRNA, clones pHPGIS36 and pHPGIS135. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1977) AUTHORS Miyata,A., Hara,S., Yokoyama,C., Inoue,H., Ullrich,V. and Tanabe,T. TITLE Molecular cloning and expression of human prostacyclin synthase JOURNAL Biochemical and Biophysical Research Communication 200, 1728-1734 (1994) REFERENCE 2 (bases 1 to 1977) AUTHORS Tanabe,T. TITLE Direct Submission JOURNAL Submitted (31-AUG-1994) to the DDBJ/EMBL/GenBank databases. Tadashi Tanabe, National Cardiovascular Center Research Institute, Pharmacology; 5-7-1 Fujishiro-dai, Suita, Osaka 565, Japan (Tel:+81-6-833-5012(ex.2514), Fax:+81-6-872-7485) COMMENT Submitted (31-Aug-1994) to DDBJ by: Tadashi Tanabe National Cardiovascular Center Research Institute Department of Pharmacology 5-7-1 Fujishiro-dai Suita, Osaka 565 Japan Phone: 06-833-5012 x2514 Fax: 06-872-7485. FEATURES Location/Qualifiers source 1..1977 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial cell" /tissue_type="aorta" 5'UTR <1..27 gene 28..1530 /gene="PTGIS" CDS 28..1530 /gene="PTGIS" /EC_number="5.3.99.4" /codon_start=1 /product="prostacyclin synthase" /db_xref="PID:d1007921" /db_xref="PID:g537949" /translation="MAWAALLGLLAALLLLLLLSRRRTRRPGEPPLDLGSIPWLGYAL DFGKDAASFLTRMKEKHGDIFTILVGGRYVTVLLDPHSYDAVVWEPRTRLDFHAYAIF LMERIFDVQLPHYSPSDEKARMKLTLLHRELQALTEAMYTNLHAVLLGDATEAGSGWH EMGLLDFSYSFLLRAGYLTLYGIEALPRTHESQAQDRVHSADVFHTFRQLDRLLPKLA RGSLSVGDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARA LVLQLWATQGNMGPAAFWLLLFLLKNPEALAAVRGELESILWQAEQPVSQTTTLPQKV LDSTPVLDSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQ RDPEIYTDPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNS IKQFVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRP" 3'UTR 1531..>1977 BASE COUNT 416 a 604 c 540 g 417 t ORIGIN 1 agccccgcca gccccgccag ccccgcgatg gcttgggccg cgctcctcgg cctcctggcc 61 gcactgttgc tgctgctgct actgagccgc cgccgcacgc ggcgacctgg tgagcctccc 121 ctggacctgg gcagcatccc ctggttgggg tatgccttgg actttggaaa agatgctgcc 181 agcttcctca cgaggatgaa ggagaagcac ggtgacatct ttactatact ggttgggggc 241 aggtatgtca ccgttctcct ggacccacac tcctacgacg cggtggtgtg ggagcctcgc 301 accaggctcg acttccatgc ctatgccatc ttcctcatgg agaggatttt tgatgtgcag 361 cttccacatt acagccccag tgatgaaaag gccaggatga aactgactct tctccacaga 421 gagctccagg cactcacaga agccatgtat accaacctcc atgcagtgct gttgggcgat 481 gctacagaag caggcagtgg ctggcacgag atgggtctcc tcgacttctc ctacagcttc 541 ctgctcagag ccggctacct gactctttac ggaattgagg cgctgccacg cacccatgaa 601 agccaggccc aggaccgcgt ccactcagct gatgtcttcc acacctttcg ccagctcgac 661 cggctgctcc ccaaactggc ccgtggctcc ctgtcagtgg gggacaagga ccacatgtgc 721 agtgtcaaaa gtcgcctgtg gaagctgcta tccccagcca ggctggccag gcgggcccac 781 cggagcaaat ggctggagag ttacctgctg cacctggagg agatgggtgt gtcagaggag 841 atgcaggcac gggccctggt gctgcagctg tgggccacac aggggaatat gggtcccgct 901 gccttctggc tcctgctctt ccttctcaag aatcctgaag ccctggctgc tgtccgcgga 961 gagctcgaga gtatcctttg gcaagcggag cagcctgtct cgcagacgac cactctccca 1021 cagaaggttc tagacagcac acctgtgctt gatagcgtgc tgagtgagag cctcaggctt 1081 acagctgccc ccttcatcac ccgcgaggtt gtggtggacc tggccatgcc catggcagac 1141 gggagagaat tcaacctgcg acgtggtgac cgcctcctcc tcttcccctt cctgagcccc 1201 cagagagacc cagaaatcta cacagaccca gaggtattta aatacaaccg attcctgaac 1261 cctgacggat cagagaagaa agacttttac aaggatggga aacggctgaa gaattacaac 1321 atgccctggg gggcggggca caatcactgc ctggggagga gttatgcggt caacagcatc 1381 aaacaatttg tgttccttgt gctggtgcac ttggacttgg agctgatcaa cgcagatgtg 1441 gagatccctg agtttgacct cagcaggtac ggcttcggtc tgatgcagcc ggaacacgac 1501 gtgcccgtcc gctaccgcat ccgcccatga cacagggagc agatggatcc acgtgctcgc 1561 ctctgcccag cctgccccag cctgccccag cctcccagct ttctgtgtgc acagttggcc 1621 cgggtgcagg tgctagcatt accacttccc tgcttttctc ccagaaggct gggtccaggg 1681 gagggaaaag ctaagagggt gaacaaagaa aagacattga aagctctatg gattatccac 1741 tgcaaagttt tctttccaaa atcaggcttt gtctgctccc aattcacctc gttactctca 1801 cctcgtgata tccacaaatg ctattcagat aaggcagaac taggagtctt cactgctctg 1861 cccccaactc ccggaggtgt caccttccta gttcttatga gctagcatgg cccgggcctt 1921 atccagtcaa agcggatgct ggccacagaa aggccactca ggatgtcctt tgtgtcc // LOCUS HUMPTHL 1887 bp mRNA PRI 08-JAN-1995 DEFINITION Human, parathyroid-like protein (associated with humoral hypercalcemia of malignancy) mRNA, complete cds. ACCESSION J03580 NID g190705 KEYWORDS parathyroid-like peptide. SOURCE Human renal carcinoma, cDNA to mRNA, clone lanbda-HHM8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1887) AUTHORS Mangin,M., Webb,A.C., Dreyer,B.E., Posillico,J.T., Ikeda,K., Weir,E.C., Stewart,A.F., Bander,N.H., Milstone,L., Barton,D.E., Francke,U. and Broadus,A.E. TITLE Identification of a cDNA encoding a parathyroid hormone-like peptide from a human tumor associated with humoral hypercalcemia of malignancy JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (2), 597-601 (1988) MEDLINE 88124888 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Mangin, 21-DEC-1987. FEATURES Location/Qualifiers source 1..1887 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12p12.1-p11.2" sig_peptide 939..1046 /gene="PTHLH" /note="parathyroid-like protein signal peptide" CDS 939..1418 /gene="PTHLH" /note="parathyroid-like protein precursor" /codon_start=1 /db_xref="GDB:G00-120-323" /db_xref="PID:g190706" /translation="MQRRLVQQWSVAVFLLSYAVPSCGRSVEGLSRRLKRAVSEHQLL HDKGKSIQDLRRRFFLHHLIAEIHTAEIHPVRFGSDDEGRYLTQETNKVETYKEQPLK TPGKKKKGKPGKRKEQEKKKRRTRSAWLDSGVTGSGLEGDHLSDTSTTSLELDSRRH" gene 939..1418 /gene="PTHLH" mat_peptide 1047..1415 /gene="PTHLH" /note="parathyroid-like protein" BASE COUNT 394 a 572 c 499 g 422 t ORIGIN 24 bp upstream of BglII site; chromosome 12. 1 ccgtttttgt tcttctaagc aaaagatctc cctctctcta gccgatgctc cccactcagt 61 tcatcccggg aatgggccag ggaggaaggt tctcatgcat cgccccgagc tgccaggcga 121 gcttcgggct ccttaaattc acaggccaac agcccgcgtc ctctccgcgc aggctcccgg 181 ttgcccgcgg tccccggccc agctccttgg cctcctcctc gtcggtccgc ccctggtggt 241 cttggcgccc gctcgtccag ctcggcgcgc cggggaccgc cggctgcccg gggcagtccg 301 cacgccctcg gggatctcgg ctcccggatc cgccgcgccg gcaggagccg gccgggcctg 361 gagggagcaa gcggatgcgc ccacgccccc ggcacgggga tggcgcgaca gggcccgggc 421 tccggggtgg ggctcggcag agctcctgac agctccgggg ctcggcagcg cgggaggggg 481 gagctccgcc gctcgccgct cattcccggc tcggggctcc cctccactcg ctcgggcggc 541 gcggggcccg ttcgggccgc ccgtcgccgc ccccgccccc cgcgcgcccg cccgccagcc 601 cgcctgcgcc ctcgctcgcc ccgcgcgcgt tcctagggcg ccacctcttt gcgactagct 661 cacttctccg gcaggtttgc ctcggagcgt gtgaacattc ctccgctcgg ttttcaactc 721 gcctccaacc tgcgccgccc ggccagcatg tctccccgcc cgtgaagcgg gctgccgcct 781 ccctgccgct ccggctgcca ctaacgaccc gccctcgccg ccacctggcc ctcctgatcg 841 acgacacacg cacttgaaac ttgttctcag ggtgtgtgga atcaactttc cggaagcaac 901 cagcccacca gaggaggtcc cgagcgcgag cggagacgat gcagcggaga ctggttcagc 961 agtggagcgt cgcggtgttc ctgctgagct acgcggtgcc ctcctgcggg cgctcggtgg 1021 agggtctcag ccgccgcctc aaaagagctg tgtctgaaca tcagctcctc catgacaagg 1081 ggaagtccat ccaagattta cggcgacgat tcttccttca ccatctgatc gcagaaatcc 1141 acacagctga aatccacccc gtccgatttg ggtctgatga tgagggcaga tacctaactc 1201 aggaaactaa caaggtggag acgtacaaag agcagccgct caagacacct gggaagaaaa 1261 agaaaggcaa gcccgggaaa cgcaaggagc aggaaaagaa aaaacggcga actcgctctg 1321 cctggttaga ctctggagtg actgggagtg ggctagaagg ggaccacctg tctgacacct 1381 ccacaacgtc gctggagctc gattcacgga ggcattgaaa ttttcagcag agaccttcca 1441 aggacatatt gcaggattct gtaatagtga acatatggaa agtattagaa atatttattg 1501 tctgtaaata ctgtaaatgc attggaataa aactgtctcc cccattgctc tatgaaactg 1561 cacattggtc attgtgaata tttttttttt tgccaaggct aatccaatta ttattatcac 1621 atttaccata atttattttg tccattgatg tatttatttt gtaaatgtat cttggtgctg 1681 ctgaatttct atattttttg taacataatg cactttagat atacatatca agtatgttga 1741 taaatgacac aatgaagtgt ctctattttg tggttgattt taatgaatgc ctaaatataa 1801 ttatccaaat tgattttcct ttgtgcatgt aaaaataaca gtattttaaa tttgtaaaga 1861 atgtctaata aaatataatc taattac // LOCUS HUMPTKA 3650 bp mRNA PRI 31-JAN-1996 DEFINITION Human mRNA for Tec protein-tyrosine kinase, complete cds. ACCESSION D29767 NID g474303 KEYWORDS Tec protein-tyrosine kinase. SOURCE Homo sapiens blood T-cell cell-line PEER cDNA to mRNA, clone lambda-htec1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3650) AUTHORS Sato,K., Mano,H., Ariyama,T., Inazawa,J., Yazaki,Y. and Hirai,H. TITLE Molecular cloning and analysis of the human Tec protein-tyrosine kinase JOURNAL Leukemia 8 (10), 1663-1672 (1994) MEDLINE 95019807 REFERENCE 2 (bases 1 to 3650) AUTHORS Mano,H. TITLE Direct Submission JOURNAL Submitted (01-APR-1994) to the DDBJ/EMBL/GenBank databases. Hiroyuki Mano, Jichi Medical School, Molecular Biology; 3311-1 Yakushiji, MinamiKawachi-machi, Kawachi-gun, Tochigi-ken 329-04, Japan (E-mail:xmano@jms.jeton.or.jp, Tel:0285-44-2111(ex.3482), Fax:0285-44-8675) COMMENT Submitted (01-Apr-1994) to DDBJ by: hiroyuki Mano Department of Molecular Biology Jichi Medical School 3311-1 Yakushiji, Minamikawachi-Machi Kawachi-gun, Tochigi 329-04 Japan Phone: 0285-44-2111 x3482 Fax: 0285-44-8675. FEATURES Location/Qualifiers source 1..3650 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="PEER" /cell_type="T-cell" /tissue_type="blood" 5'UTR 1..117 CDS 118..2013 /codon_start=1 /product="Tec protein-tyrosine kinase" /db_xref="PID:d1006733" /db_xref="PID:g474304" /translation="MNFNTILEEILIKRSQQKKKTSPLNYKERLFVLTKSMLTYYEGR AEKKYRKGFIDVSKIKCVEIVKNDDGVIPCQNKYPFQVVHDANTLYIFAPSPQSRDLW VKKLKEEIKNNNNIMIKYHPKFWTDGSYQCCRQTEKLAPGCEKYNLFESSIRKALPPA PETKKRRPPPPIPLEEEDNSEEIVVAMYDFQAAEGHDLRLERGQEYLILEKNDVHWWR ARDKYGNEGYIPSNYVTGKKSNNLDQYEWYCRNMNRSKAEQLLRSEDKEGGFMVRDSS QPGLYTVSLYTKFGGEGSSGFRHYHIKETTTSPKKYYLAEKHAFGSIPEIIEYHKHNA AGLVTRLRYPVSVKGKNAPTTAGFSYEKWEINPSELTFMRELGSGLFGVVRLGKWRAQ YKVAIKAIREGAMCEEDFIEEAKVMMKLTHPKLVQLYGVCTQQKPIYIVTEFMERGCL LNFLRQRQGHFSRDVLLSMCQDVCEGMEYLERNSFIHRDLAARNCLVSEAGVVKVSDF GMARYFLDDQYTSSSGAKFPVKWCPPEVFNYSRFSSKSDVWSFGVLMWEVFTEGRMPF EKYTNYEVVTMVTRGHRLYQPKLASNYVYEVMLRCWQEKPEGRPSFEDLLRTIDELVE CEETFGR" 3'UTR 2014..3650 BASE COUNT 1083 a 726 c 884 g 957 t ORIGIN 1 cggcggccgc ggatcccggc ggcgatccga cctcgcagtc tccccaggtc cgccagcagc 61 cggttcagcc agaatactgg gatcttcagt ggcaggagga gtaatcagaa gacggagatg 121 aattttaaca ctattttgga ggagattctt attaagaggt cacagcagaa aaagaagaca 181 tcgcccttaa actacaaaga gagacttttt gtacttacaa agtccatgct aacctactat 241 gagggtcgag cagagaagaa atacagaaag gggtttattg atgtttcaaa aatcaagtgt 301 gtggaaatag tgaagaatga tgatggtgtc attccctgtc aaaataagta tccatttcag 361 gttgttcatg atgctaacac actttacatt tttgcaccta gtccacaaag cagggacctg 421 tgggtgaaga agttaaaaga agaaataaag aacaacaata atattatgat taaatatcat 481 cctaaattct ggacagatgg aagttatcag tgttgtagac aaactgaaaa attagcaccc 541 ggatgtgaaa aatacaatct ttttgagagc agtataagaa aagcactacc tccagcacca 601 gaaacaaaga agcgaaggcc tcccccacca attccactag aagaagaaga taatagtgaa 661 gaaatcgttg tagccatgta tgatttccaa gcagcagaag gacatgatct cagattagag 721 agaggccaag agtatctcat tttagaaaag aatgatgtgc attggtggag agcaagagat 781 aaatatggga atgaaggata tatcccaagt aattacgtaa cgggaaagaa atcaaacaac 841 ttagatcaat atgaatggta ttgcagaaat atgaatagaa gcaaggcaga gcaactcctc 901 cgcagtgaag ataaagaagg tggttttatg gtaagggatt ccagtcaacc aggcttgtac 961 acagtctccc tttataccaa gtttggagga gaaggttcat cgggttttag gcattatcat 1021 ataaaggaaa caacaacatc tccaaagaag tattacctag ctgaaaaaca tgcttttggc 1081 tccattcctg agattattga atatcataag cacaatgcag caggacttgt caccaggctt 1141 cggtacccag ttagtgtgaa agggaagaat gcacccacca ctgcaggatt cagctatgag 1201 aaatgggaga ttaacccttc agaactgacc tttatgaggg aattgggaag tggactgttt 1261 ggagtggtga ggcttggcaa atggcgagcc cagtacaaag tcgcaatcaa agctattcgg 1321 gaaggtgcaa tgtgcgagga ggactttata gaagaagcta aagtgatgat gaagctgaca 1381 cacccgaagt tagtgcagct ttatggtgtg tgcacccagc agaaaccaat atacattgtt 1441 actgagttca tggaaagggg ctgccttctg aatttcctcc gacagagaca aggtcatttc 1501 agtagagacg tactgctgag catgtgtcag gatgtgtgtg aagggatgga gtatctggag 1561 agaaacagct tcatccacag agatctggct gccagaaatt gtctagtaag tgaggcggga 1621 gttgtaaaag tatctgattt tggaatggcc aggtattttc tggatgatca gtacacaagt 1681 tcttctggtg ctaagtttcc tgtgaagtgg tgtccacctg aagtgtttaa ttacagccgc 1741 ttcagcagca aatcagatgt ctggtcattt ggtgttttaa tgtgggaagt attcacggaa 1801 ggcagaatgc cttttgaaaa atacaccaat tatgaagtgg taaccatggt tactcgaggc 1861 caccgactct accagccgaa gttggcgtcc aactatgtgt atgaggtgat gctgagatgt 1921 tggcaggaga aaccagaggg aaggccttct ttcgaagatc tgctgcgcac aatagatgaa 1981 ctagttgaat gtgaagaaac ttttggaaga taagtgatgt gtgaccagtg gctcccagat 2041 tcccaagcac aaggaaggat gggcattttg tggcttttaa tttattgagc acttggacat 2101 gtagatcatt ttacttatac agtggaaaca cataaataat ttgcttctag accagcctct 2161 gtctagactt gcttctagac agaatctccc agagtgtgga aatgttgcct tagaaatggt 2221 gattaaaatc actcatttct attcattcct caggcacttg agtgacagtt gtttaccagg 2281 cactgtgtgt agccccaggg tttggccatt caggggtgca cacatgggac catgttagct 2341 gatgccagtt gaaggccagg gtatttggga aggggaaggg tattagagtc atgaccaagc 2401 aacccttctt tttccctttg acttctacag aaatctgggc ctgagacatt gtctacaatt 2461 gggttctaga tacatcagga acccatcttg gataaataaa tacctatctt ttgttttgaa 2521 aacatctcag ttttcaagac tgctcttagt attacatgaa caatatttgt atgctgtata 2581 tattgtaaat atatataata tataaagtta tatatttatg agaaacacga attgtctttt 2641 aattgaaact tttaatcctg tagtatagga gttcaccttc ttaggactag agactgtgcc 2701 ttatagctgt taattcattt ccccctgaac atcaaatatg cctgaagaga agaaagtcta 2761 gattcttcta tgagtaacgc cccctcctca ctcaggtaaa tgtgtctggg gatgcctgtc 2821 cagcttaacc acgtgcattt ggcctatgta atcctgccca tggtggccgc agctaatcag 2881 aatcagatgg aaaattaaac cgggtaatct acttctaagc cttaagaata ttccctggga 2941 cacagacact ataattggaa gtgctgagct ctggggcaga aggatcaggt gaccttcgca 3001 acaaagtttg cccccacctc acataggacc cggaagcagc ctgagctgtg gcggaggatc 3061 caggaagcta cggagagaag cagccagcat ggtgttccgt gcctcccgga cgtttttcag 3121 gaggcctggt tggacttggg ttcctggatg gtgggattgt tgtacagcct ctcaggagac 3181 cctgctgtca agactgtgtg tgtggatttc ccacccttag aagctctact aagacatcaa 3241 cggaattagg gccttccttt ttgccttgtg agcgccaagg aaaagaaact atctcggtca 3301 cgtgagcgcc acgaaagaaa ctgtatcagt catccagaga ccgtttattg cccaacacgt 3361 tattcttgct gttggtgggg taactagccg aggaagacac agcgccttcc cttcaggagt 3421 tgcgtctcct ctgcaggcca cgatggtctg ctctggagca ttgggtgaac acacaggctg 3481 gctgctctgg gcagcgcctt cactctgacc ctggagaacc atttcatttc atcctggtca 3541 gtctagagtc tgtgcaccag gcagtccatc cactgaaggc tgtgtttatt cttttcctgt 3601 gcccctcata atggaagaaa gtaaactgct tatcccgagc cttaaaaaaa // LOCUS HUMPTKJAK1 3541 bp mRNA PRI 19-JAN-1993 DEFINITION Human protein-tyrosine kinase (JAK1) mRNA, complete cds. ACCESSION M64174 M35203 NID g190734 KEYWORDS protein-tyrosine kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3541) AUTHORS Wilks,A.F., Harpur,A.G., Kurban,R.R., Ralph,S.J., Zuercher,G. and Ziemiecki,A. TITLE Two novel protein-tyrosine kinases, each with a second phosphotransferase-related catalytic domain, define a new class of protein kinase JOURNAL Mol. Cell. Biol. 11, 2057-2065 (1991) MEDLINE 91172194 FEATURES Location/Qualifiers source 1..3541 /organism="Homo sapiens" /db_xref="taxon:9606" gene 76..3504 /gene="JAK1" CDS 76..3504 /gene="JAK1" /codon_start=1 /product="protein-tyrosine kinase" /db_xref="PID:g190735" /translation="MAFCAKMRSSKKTEVNLEAPEPGVEVIFYLSDREPLRLGSGEYT AEELCIRAAQACRISPLCHNLFALYDENTKLWYAPNRTITVDDKMSLRLHYRMRFYFT NWHGTNDNEQSVWRHSPKKQKNGYEKKKIPDATPLLDASSLEYLFAQGQYDLVKCLAP IRDPKTEQDGHDIENECLGMAVLAISHYAMMKKMQLPELPKDISYKRYIPETLNKSIR QRNLLTRMRINNVFKDFLKEFNNKTICDSSVSTHDLKVKYLATLETLTKHYGAEIFET SMLLISSENEMNWFHSNDGGNVLYYEVMVTGNLGIQWRHKPNVVSVEKEKNKLKRKKL ENKDKKDEEKNKIREEWNNFSFFPEITHIVIKESVVSINKQDNKKMELKLSSHEEALS FVSLVDGYFRLTADAHHYLCTDVAPPLIVHNIQNGCHGPICTEYAINKLRQEGSEEGM YVLRWSCTDFDNILMTVTCFEKSEQVQGAQKQFKNFQIEVQKGRYSLHGSDRSFPSLG DLMSHLKKQILRTDNISFMLKRCCQPKPREISNLLVATKKAQEWQPVYPMSQLSFDRI LKKDLVQGEHLGRGTRTHIYSGTLMDYKDDEGTSEEKKIKVILKVLDPSHRDISLAFF EAASMMRQVSHKHIVYLYGVCVRDVENIMVEEFVEGGPLDLFMHRKSDVLTTPWKFKV AKQLASALSYLEDKDLVHGNVCTKNLLLAREGIDSECGPFIKLSDPGIPITVLSRQEC IERIPWIAPECVEDSKNLSVAADKWSFGTTLWEICYNGEIPLKDKTLIEKERFYESRC RPVTPSCKELADLMTRCMNYDPNQRPFFRAIMRDINKLEEQNPDIVSRKKNQPTEVDP THFEKRFLKRIRDLGEGHFGKVELCRYDPEDNTGEQVAVKSLKPESGGNHIADLKKEI EILRNLYHENIVKYKGICTEDGGNGIKLIMEFLPSGSLKEYLPKNKNKINLKQQLKYA VQICKGMDYLGSRQYVHRDLAARNVLVESEHQVKIGDFGLTKAIETDKEYYTVKDDRD SPVFWYAPECLMQSKFYIASDVWSFGVTLHELLTYCDSDSSPMALFLKMIGPTHGQMT VTRLVNTLKEGKRLPCPPNCPDEVYQLMRKCWEFQPSNRTSFQNLIEGFEALLK" BASE COUNT 1054 a 806 c 876 g 805 t ORIGIN 1 tccagtttgc ttcttggaga acactggaca gctgaataaa tgcagtatct aaatataaaa 61 gaggactgca atgccatggc tttctgtgct aaaatgagga gctccaagaa gactgaggtg 121 aacctggagg cccctgagcc aggggtggaa gtgatcttct atctgtcgga cagggagccc 181 ctccggctgg gcagtggaga gtacacagca gaggaactgt gcatcagggc tgcacaggca 241 tgccgtatct ctcctctttg tcacaacctc tttgccctgt atgacgagaa caccaagctc 301 tggtatgctc caaatcgcac catcaccgtt gatgacaaga tgtccctccg gctccactac 361 cggatgaggt tctatttcac caattggcat ggaaccaacg acaatgagca gtcagtgtgg 421 cgtcattctc caaagaagca gaaaaatggc tacgagaaaa aaaagattcc agatgcaacc 481 cctctccttg atgccagctc actggagtat ctgtttgctc agggacagta tgatttggtg 541 aaatgcctgg ctcctattcg agaccccaag accgagcagg atggacatga tattgagaac 601 gagtgtctag ggatggctgt cctggccatc tcacactatg ccatgatgaa gaagatgcag 661 ttgccagaac tgcccaagga catcagctac aagcgatata ttccagaaac attgaataag 721 tccatcagac agaggaacct tctcaccagg atgcggataa ataatgtttt caaggatttc 781 ctaaaggaat ttaacaacaa gaccatttgt gacagcagcg tgtccacgca tgacctgaag 841 gtgaaatact tggctacctt ggaaactttg acaaaacatt acggtgctga aatatttgag 901 acttccatgt tactgatttc atcagaaaat gagatgaatt ggtttcattc gaatgacggt 961 ggaaacgttc tctactacga agtgatggtg actgggaatc ttggaatcca gtggaggcat 1021 aaaccaaatg ttgtttctgt tgaaaaggaa aaaaataaac tgaagcggaa aaaactggaa 1081 aataaagaca agaaggatga ggagaaaaac aagatccggg aagagtggaa caatttttca 1141 ttcttccctg aaatcactca cattgtaata aaggagtctg tggtcagcat taacaagcag 1201 gacaacaaga aaatggaact gaagctctct tcccacgagg aggccttgtc ctttgtgtcc 1261 ctggtagatg gctacttccg gctcacagca gatgcccatc attacctctg caccgacgtg 1321 gcccccccgt tgatcgtcca caacatacag aatggctgtc atggtccaat ctgtacagaa 1381 tacgccatca ataaattgcg gcaagaagga agcgaggagg ggatgtacgt gctgaggtgg 1441 agctgcaccg actttgacaa catcctcatg accgtcacct gctttgagaa gtctgagcag 1501 gtgcagggtg cccagaagca gttcaagaac tttcagatcg aggtgcagaa gggccgctac 1561 agtctgcacg gttcggaccg cagcttcccc agcttgggag acctcatgag ccacctcaag 1621 aagcagatcc tgcgcacgga taacatcagc ttcatgctaa aacgctgctg ccagcccaag 1681 ccccgagaaa tctccaacct gctggtggct actaagaaag cccaggagtg gcagcccgtc 1741 taccccatga gccagctgag tttcgatcgg atcctcaaga aggatctggt gcagggcgag 1801 caccttggga gaggcacgag aacacacatc tattctggga ccctgatgga ttacaaggat 1861 gacgaaggaa cttctgaaga gaagaagata aaagtgatcc tcaaagtctt agaccccagc 1921 cacagggata tttccctggc cttcttcgag gcagccagca tgatgagaca ggtctcccac 1981 aaacacatcg tgtacctcta tggcgtctgt gtccgcgacg tggagaatat catggtggaa 2041 gagtttgtgg aagggggtcc tctggatctc ttcatgcacc ggaaaagtga tgtccttacc 2101 acaccatgga aattcaaagt tgccaaacag ctggccagtg ccctgagcta cttggaggat 2161 aaagacctgg tccatggaaa tgtgtgtact aaaaacctcc tcctggcccg tgagggaatc 2221 gacagtgagt gtggcccatt catcaagctc agtgaccccg gcatccccat tacggtgctg 2281 tctaggcaag aatgcattga acgaatccca tggattgctc ctgagtgtgt tgaggactcc 2341 aagaacctga gtgtggctgc tgacaagtgg agctttggaa ccacgctctg ggaaatctgc 2401 tacaatggcg agatcccctt gaaagacaag acgctgattg agaaagagag attctatgaa 2461 agccggtgca ggccagtgac accatcatgt aaggagctgg ctgacctcat gacccgctgc 2521 atgaactatg accccaatca gaggcctttc ttccgagcca tcatgagaga cattaataag 2581 cttgaagagc agaatccaga tattgtttcc agaaaaaaaa accagccaac tgaagtggac 2641 cccacacatt ttgagaagcg cttcctaaag aggatccgtg acttgggaga gggccacttt 2701 gggaaggttg agctctgcag gtatgacccc gaagacaata caggggagca ggtggctgtt 2761 aaatctctga agcctgagag tggaggtaac cacatagctg atctgaaaaa ggaaatcgag 2821 atcttaagga acctctatca tgagaacatt gtgaagtaca aaggaatctg cacagaagac 2881 ggaggaaatg gtattaagct catcatggaa tttctgcctt cgggaagcct taaggaatat 2941 cttccaaaga ataagaacaa aataaacctc aaacagcagc taaaatatgc cgttcagatt 3001 tgtaagggga tggactattt gggttctcgg caatacgttc accgggactt ggcagcaaga 3061 aatgtccttg ttgagagtga acaccaagtg aaaattggag acttcggttt aaccaaagca 3121 attgaaaccg ataaggagta ttacaccgtc aaggatgacc gggacagccc tgtgttttgg 3181 tatgctccag aatgtttaat gcaatctaaa ttttatattg cctctgacgt ctggtctttt 3241 ggagtcactc tgcatgagct gctgacttac tgtgattcag attctagtcc catggctttg 3301 ttcctgaaaa tgataggccc aacccatggc cagatgacag tcacaagact tgtgaatacg 3361 ttaaaagaag gaaaacgcct gccgtgccca cctaactgtc cagatgaggt ttatcagctt 3421 atgagaaaat gctgggaatt ccaaccatcc aatcggacaa gctttcagaa ccttattgaa 3481 ggatttgaag cacttttaaa ataagaagca tgaataacat ttaaattcca cagattatca 3541 a // LOCUS HUMPTPASE 2287 bp mRNA PRI 09-MAY-1995 DEFINITION Human protein tyrosine phosphatase (PTPase) mRNA, complete cds. ACCESSION M25393 NID g190740 KEYWORDS c-myc proto-oncogene; protein-tyrosine phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2287) AUTHORS Cool,D.E., Tonks,N.K., Charbonneau,H., Walsh,K.A., Fischer,E.H. and Krebs,E.G. TITLE cDNA isolated from a human T-cell library encodes a member of the protein-tyrosine-phosphatase family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (14), 5257-5261 (1989) MEDLINE 89315776 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Cool, 08-JUN-1989. FEATURES Location/Qualifiers source 1..2287 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-cell" /map="1p34" gene 61..1308 /gene="PTPRF" CDS 61..1308 /gene="PTPRF" /note="putative" /codon_start=1 /db_xref="GDB:G00-120-138" /product="protein tyrosine phosphatase" /db_xref="PID:g804750" /translation="MPTTIEREFEELDTQRRWQPLYLEIRNESHDYPHRVAKFPENRN RNRYRDVSPYDHSRVKLQNAENDYINASLVDIEEAQRSYILTQGPLPNTCCHFWLMVW QQKTKAVVMLNRIVEKESVKCAQYWPTDDQEMLFKETGFSVKLLSEDVKSYYTVHLLQ LENINSGETRTISHFHYTTWPDFGVPESPASFLNFLFKVRESGSLNPDHGPAVIHCSA GIGRSGTFSLVDTCLVLMEKGDDINIKQVLLNMRKYRMGLIQTPDQLRFSYMAIIEGA KCIKGDSSIQKRWKELSKEDLSPAFDHSPNKIMTEKYNGNRIGLEEEKLTGDRCTGLS SKMQDTMEENSESALRKRIREDRKATTAQKVQQMKQRLNENERKRKRWLYWQPILTKM GFMSVILVGAFVGWRLFFQQNAL" BASE COUNT 754 a 410 c 464 g 659 t ORIGIN 1 ggggggcctg agcctctccg ccggcgcagg ctctgctcgc gccagctcgc tcccgcagcc 61 atgcccacca ccatcgagcg ggagttcgaa gagttggata ctcagcgtcg ctggcagccg 121 ctgtacttgg aaattcgaaa tgagtcccat gactatcctc atagagtggc caagtttcca 181 gaaaacagaa atcgaaacag atacagagat gtaagcccat atgatcacag tcgtgttaaa 241 ctgcaaaatg ctgagaatga ttatattaat gccagtttag ttgacataga agaggcacaa 301 aggagttaca tcttaacaca gggtccactt cctaacacat gctgccattt ctggcttatg 361 gtttggcagc agaagaccaa agcagttgtc atgctgaacc gcattgtgga gaaagaatcg 421 gttaaatgtg cacagtactg gccaacagat gaccaagaga tgctgtttaa agaaacagga 481 ttcagtgtga agctcttgtc agaagatgtg aagtcgtatt atacagtaca tctactacaa 541 ttagaaaata tcaatagtgg tgaaaccaga acaatatctc actttcatta tactacctgg 601 ccagattttg gagtccctga atcaccagct tcatttctca atttcttgtt taaagtgaga 661 gaatctggct ccttgaaccc tgaccatggg cctgcggtga tccactgtag tgcaggcatt 721 gggcgctctg gcaccttctc tctggtagac acttgtcttg ttttgatgga aaaaggagat 781 gatattaaca taaaacaagt gttactgaac atgagaaaat accgaatggg tcttattcag 841 accccagatc aactgagatt ctcatacatg gctataatag aaggagcaaa atgtataaag 901 ggagattcta gtatacagaa acgatggaaa gaactttcta aggaagactt atctcctgcc 961 tttgatcatt caccaaacaa aataatgact gaaaaataca atgggaacag aataggtcta 1021 gaagaagaaa aactgacagg tgaccgatgt acaggacttt cctctaaaat gcaagataca 1081 atggaggaga acagtgagag tgctctacgg aaacgtattc gagaggacag aaaggccacc 1141 acagctcaga aggtgcagca gatgaaacag aggctaaatg agaatgaacg aaaaagaaaa 1201 aggtggttat attggcaacc tattctcact aagatggggt ttatgtcagt cattttggtt 1261 ggcgcttttg ttggctggag actgtttttt cagcaaaatg ccctataaac aattaatttt 1321 gcccagcaag cttctgcact agtaactgac agtgctacat taatcatagg ggtttgtctg 1381 cagcaaacgc ctcatatccc aaaaacggtg cagtagaata gacatcaacc agataagtga 1441 tatttacagt cacaagccca acatctcagg actcttgact gcaggttcct ctgaacccca 1501 aactgtaaat ggctgtctaa aataaagaca ttcatgtttg ttaaaaactg gtaaattttg 1561 caactgtatt catacatgtc aaacacagta tttcacctga ccaacattga gatatccttt 1621 atcacaggat ttgtttttgg aggctatctg gattttaacc tgcacttgat ataagcaata 1681 aatattgtgg ttttatctac gttattggaa agaaaatgac atttaaataa tgtgtgtaat 1741 gtataatgta ctattgacat gggcatcaac acttttattc ttaagcattt cagggtaaat 1801 atattttata agtatctatt taatcttttg tagttaactg tactttttaa gagctcaatt 1861 tgaaaaatct gttactaaaa aaaaaaattg tatgtcgatt gaattgtact ggatacattt 1921 tccatttttc taaaaagaag tttgatatga gcagttagaa gttggaataa gcaatttcta 1981 ctatatattg catttctttt atgttttaca gttttcccca ttttaaaaag aaaagcaaac 2041 aaagaaacaa aagtttttcc taaaaatatc tttgaaggaa aattctcctt actgggatag 2101 tcaggtaaac agttggtcaa gactttgtaa agaaattggt ttctgtaaat cccattattg 2161 atatgtttat ttttcatgaa aatttcaatg tagttggggt agattatgat ttaggaagca 2221 aaagtaagaa gcagcatttt atgattcata atttcagttt actagactga agttttgaag 2281 taaaccc // LOCUS HUMPTPD 6263 bp mRNA PRI 09-APR-1995 DEFINITION Homo sapiens protein tyrosine phosphatase delta mRNA, complete cds. ACCESSION L38929 NID g755652 KEYWORDS protein tyrosine phosphatase delta; transmembrane protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6263) AUTHORS Pulido,R., Krueger,N.X., Serra-Pages,C., Saito,H. and Streuli,M. TITLE Molecular characterization of the human transmembrane protein-tyrosine phosphatase delta. Evidence for tissue-specific expression of alternative human transmembrane protein-tyrosine phosphatase delta isoforms JOURNAL J. Biol. Chem. 270 (12), 6722-6728 (1995) MEDLINE 95204468 FEATURES Location/Qualifiers source 1..6263 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 1..6263 CDS 154..5892 /codon_start=1 /product="protein tyrosine phosphatase delta" /db_xref="PID:g755653" /translation="MVHVARLLLLLLTFFLRTDAETPPRFTRTPVDQTGVSGGVASFI CQATGDPRPKIVWNKKGKKVSNQRFEVIEFDDGSGSVLRIQPLRTPRDEAIYECVASN NVGEISVSTRLTVLREDQIPRGFPTIDMGPQLKVVERTRTATMLCAASGNPDPEITWF KDFLPVDTSNNNGRIKQLRSESIGGTPIRGALQIEQSEESDQGKYECVATNSAGTRYS APANLYVRELREVRRVPPRFSIPPTNHEIMPGGSVNITCVAVGSPMPYVKWMLGAEDL TPEDDMPIGRNVLELNDVRQSANYTCVAMSTLGVIEAIAQITVKALPKPPGTPVVTES TATSITLTWDSGNPEPVSYYIIQHKPKNSEELYKEIDGVATTRYSVAGLSPYSDYEFR VVAVNNIGRGPPSEPVLTQTSEQAPSSAPRDVQARMLSSTTILVQWKEPEEPNGQIQG YRVYYTMDPTQHVNNWMKHNVADSQITTIGNLVPQKTYSVKVLAFTSIGDGPLSSDIQ VITQTGVPGQPLNFKAEPESETSILLSWTPPRSDTIANYELVYKDGEHGEEQRITIEP GTSYRLQGLKPNSLYYFRLAARSPQGLGASTAEISARTMQSKPSAPPQDISCTSPSST SILVSWQPPPVEKQNGIITEYSIKYTAVDGEDDKPHEILGIPSDTTKYLLEQLEKWTE YRITVTAHTDVGPGPESLSVLIRTNEDVPSGPPRKVEVEAVNSTSVKVSWRSPVPNKQ HGQIRGYQVHYVRMENGEPKGQPMLKDVMLADAQWEFDDTTEHDMIISGLQPETSYSL TVTAYTTKGDGARSKPKLVSTTGAVPGKPRLVINHTQMNTALIQWHPPVDTFGPLQGY RLKFGRKDMEPLTTLEFSEKEDHFTATDIHKGASYVFRLSARNKVGFGEEMVKEISIP EEVPTGFPQNLHSEGTTSTSVQLSWQPPVLAERNGIITKYTLLYRDINIPLLPMEQLI VPADTTMTLTGLKPDTTYDVKVRAHTSKGPGPYSPSVQFRTLPVDQVFAKNFHVKAVM KTSVLLSWEIPENYNSAMPFKILYDDGKMVEEVDGRATQKLIVNLKPEKSYSFVLTNR GNSAGGLQHRVTAKTAPDVLRTKPAFIGKTNLDGMITVQLPEVPANENIKGYYIIIVP LKKSRGKFIKPWESPDEMELDELLKEISRKRRSIRYGREVELKPYIAAHFDVLPTEFT LGDDKHYGGFTNKQLQSGQEYVFFVLAVMEHAESKMYATSPYSDPVVSMDLDPQPITD EEEGLIWVVGPVLAVVFIICIVIAILLYKRKRAESDSRKSSIPNNKEIPSHHPTDPVE LRRLNFQTPGMASHPPIPILELADHIERLKANDNLKFSQEYESIDPGQQFTWEHSNLE VNKPKNRYANVIAYDHSRVLLSAIEGIPGSDYVNANYIDGYRKQNAYIATQGSLPETF GDFWRMIWEQRSATVVMMTKLEERSRVKCDQYWPSRGTETHGLVQVTLLDTVELATYC VRTFALYKNGSSEKREVRQFQFTAWPDHGVPEHPTPFLAFLRRVKTCNPPDAGPMVVH CSAGVGRTGCFIVIDAMLERIKHEKTVDIYGHVTLMRAQRNYMVQTEDQYIFIHDALL EAVTCGNTEVPARNLYAYIQKLTQIETGENVTGMELEFKRLASSKAHTSRFISANLPC NKFKNRLVNIMPYESTRVCLQPIRGVEGSDYINASFIDGYRQQKAYIATQGPLAETTE DFWRMLWEHNSTIVVMLTKLREMGREKCHQYWPAERSARYQYFVVDPMAEYNMPQYIL REFKVTDARDGQSRTVRQFQFTDWPEQGVPKSGEGFIDFIGQVHKTKEQFGQDGPISV HCSAGVGRTGVFITLSIVLERMRYEGVVDIFQTVKMLRTQRPAMVQTEDQYQFSYRAA LEYLGSFDHYAT" polyA_site 6263 BASE COUNT 1870 a 1449 c 1445 g 1499 t ORIGIN 1 gctaactcaa gggagacgtc tggtgaacac ccgtgggatc taaagaacaa gctctgaaag 61 tgttccagct gaaatttcag atcggacaga ctcgctgcgg ctccggaggc agtgattcca 121 agctgctcgc gcacgctgct gccaagctgc aggatggtgc acgtagccag gctgctgctg 181 ctgctcctca ctttcttcct ccgcacggat gctgagacac ctccaaggtt tacacgaaca 241 cccgttgatc agacaggggt ctctggcgga gttgcctctt tcatctgcca agctacggga 301 gacccaagac ctaaaattgt ctggaacaaa aaaggaaaga aagtcagcaa tcagagattt 361 gaggtaatag agtttgacga tgggtctgga tcagttctca gaatacaacc cttacggact 421 ccgagggatg aggccattta tgaatgtgtg gcctcaaata atgtgggaga aataagtgta 481 tccaccagac tcacagtttt gcgggaagat caaattccca ggggcttccc taccattgac 541 atgggcccac agttgaaggt ggttgagcgt actcgcacgg ccaccatgct ttgtgcagcc 601 agtggtaatc cggatccaga aatcacttgg tttaaagatt tcttacctgt ggacacaagc 661 aacaacaatg gtcgtattaa gcagttacga tcagaatcta ttggtggtac accaataaga 721 ggagcccttc agattgagca gagtgaagag tctgaccaag gaaaatatga gtgtgttgcc 781 accaacagcg cgggcactcg ctattccgct cctgccaatt tatatgtcag agagctgcga 841 gaagttcgcc gtgtcccacc aagattctct atcccaccca ctaatcatga aatcatgcca 901 ggcggaagcg ttaatatcac ctgtgtggcc gtggggtcac caatgcctta tgtaaagtgg 961 atgttggggg cagaagatct gacacctgaa gatgatatgc caataggaag aaatgtgcta 1021 gaactgaatg atgtaagaca gtcagcaaat tacacctgtg ttgctatgtc aacactgggt 1081 gtcattgaag caatagcaca gatcactgtc aaagccttac ccaaacctcc aggaactcct 1141 gtagtgaccg agagcacagc tacaagcatc acactgacgt gggactctgg gaaccctgag 1201 cctgtttctt attacataat tcagcataaa cctaaaaact ctgaggaact ttacaaagaa 1261 attgatgggg tggcgaccac acgctacagt gtcgctggac taagtcccta ctcggattat 1321 gaattcaggg ttgttgctgt caataacatt gggcgggggc ctcccagcga acctgtgcta 1381 acacaaacct cagagcaagc accatccagt gccccgaggg atgtccaggc acgaatgttg 1441 agttcgacca ccattttggt acagtggaag gaacctgaag agccaaatgg acagatccaa 1501 ggatatagag tttattatac aatggatccc actcaacatg tcaacaactg gatgaaacac 1561 aatgtagctg acagccaaat cactactatt ggcaacttag tgccccagaa aacatattct 1621 gtcaaagtcc tggcttttac ctcaattgga gatggtcccc tttcaagtga catacaagtc 1681 atcactcaga caggagtacc agggcagcca ctaaacttca aagcagaacc tgagtctgaa 1741 acaagtattt tgctctcttg gacacctcca cgttcagata ccattgccaa ctatgaactg 1801 gtctacaaag atggggagca tggagaggag caacgaatta ccattgagcc agggacatca 1861 tataggctgc aaggactgaa accaaacagc ttatactatt tccgtctggc tgcacgctcc 1921 cctcaaggcc tgggtgcttc tactgcagaa atatcagcta gaaccatgca gtcaaagccg 1981 tcagctcctc ctcaagacat tagttgcacc agcccaagtt ccactagtat tttggtaagt 2041 tggcaacctc caccagtgga aaaacagaat ggcattatca ctgaatactc catcaagtac 2101 actgcagtgg atggggaaga tgacaagcct cacgagattt tgggaattcc ttcggacact 2161 accaaatacc ttttggaaca gctggaaaaa tggactgaat accggatcac tgtgacagcc 2221 catacagatg tcggccctgg ccctgagagc ttgtccgtgt tgattcgaac caatgaagat 2281 gttcctagtg gtcctcctcg caaagtcgag gtagaggctg tcaactcaac atctgttaaa 2341 gtctcatggc gctcacccgt gcccaataaa cagcatggcc agataagagg atatcaggtg 2401 cattatgtga ggatggaaaa tggtgagccc aagggccagc ccatgctgaa agatgtcatg 2461 ctggctgatg cacagtggga atttgatgat actactgaac atgacatgat catttctggg 2521 ctccagcctg aaacttccta ctccctcacc gtcacagcct acacaaccaa aggagatggt 2581 gctcgcagca agcccaaact ggtgtccacc actggggcag ttccagggaa acctcggctt 2641 gtgattaacc acactcagat gaatactgct cttattcagt ggcaccctcc ggtggacaca 2701 tttggacctc ttcagggcta ccgtctaaaa tttggccgca aggatatgga gccacttact 2761 actcttgagt tctctgaaaa agaagatcac tttacagcta cagacatcca caagggagca 2821 tcatacgtct tcaggctctc agccagaaac aaagtgggct ttggggagga gatggtgaag 2881 gagatttcca ttccagaaga agtaccaact ggattccctc aaaaccttca ctcagaaggc 2941 accacttcaa cctccgtcca gttatcttgg caaccacctg tcctggcaga gagaaatggc 3001 attatcacca agtataccct tctttatagg gatatcaaca tcccccttct cccgatggag 3061 cagcttattg ttccagctga caccactatg acactcactg gcttaaaacc agataccaca 3121 tacgatgtaa aagtacgtgc tcatacgagc aaagggcccg ggccatatag tcccagtgtc 3181 cagttcagga cactgcctgt ggatcaagtg tttgcaaaaa attttcatgt caaagcagta 3241 atgaagactt ccgtgttgct gtcttgggag attccagaga attataactc cgccatgcct 3301 ttcaaaattc tttatgatga tgggaaaatg gtagaagaag tggatggccg agccacacag 3361 aagttaattg tcaacctgaa gcctgagaaa tcatattcat ttgtgctgac aaatcgtgga 3421 aacagtgctg gtgggctgca gcacagggtc acggcaaaga ctgcaccaga tgtattacgt 3481 accaagcctg ccttcattgg gaagaccaac ttggatggca tgattactgt gcaactgcct 3541 gaagtacctg caaatgagaa tataaaaggt tactacataa taattgtgcc tttgaagaaa 3601 tctcgcggga aatttatcaa gccatgggag agtccagatg aaatggaatt agatgagctg 3661 cttaaggaga tatctaggaa gcgcagaagc atccgttatg ggagagaagt tgaattaaag 3721 ccatatattg ccgctcactt tgatgtcctt cccactgagt tcaccctggg ggatgacaag 3781 cattatggtg gatttacaaa caagcaactc caaagtggtc aagaatatgt cttctttgtg 3841 ttagcagtaa tggaacatgc agagtctaag atgtatgcaa ccagccctta ctccgacccc 3901 gtggtgtcaa tggatctgga tccgcagcca atcacggatg aagaagaagg cttgatctgg 3961 gttgtaggtc ctgtccttgc agtggtcttt atcatctgca ttgtcattgc tattcttctt 4021 tataaaagga agagggcaga gtccgactct agaaaaagca gcataccgaa caataaggag 4081 atcccttcac accacccaac agaccctgta gaactgaggc gccttaactt tcaaacaccg 4141 ggtatggcta gccatcctcc aatacccatc ttggaacttg cagaccacat tgaaagattg 4201 aaagcaaatg acaacttgaa gttttcccag gaatatgagt caattgaccc tggccagcag 4261 ttcacttggg aacattcaaa cttggaagta aacaaaccaa agaatagata cgcgaatgta 4321 atcgcatatg atcattcccg ggttctccta tcagctatag aagggatccc aggaagtgac 4381 tatgtgaatg ccaactacat agatgggtat aggaagcaaa atgcctatat tgcaacacag 4441 ggatctctcc ccgaaacatt tggggacttt tggagaatga tatgggaaca acggagtgcc 4501 acagttgtca tgatgacaaa actagaagaa agatcaaggg tgaagtgtga ccagtattgg 4561 cctagcagag gcacagaaac ccacggactc gttcaagtaa cgctgcttga tactgtggag 4621 ctggccacat attgtgttcg aacatttgca ctttacaaga atggttcaag tgagaagaga 4681 gaagtgagac aattccagtt caccgcctgg cctgatcatg gtgttccaga acaccctaca 4741 ccttttctag ctttcttacg tagagtcaaa acctgtaacc ctcccgatgc tggtccgatg 4801 gttgtgcact gcagtgcggg agttggccgg actggttgct tcatcgtcat agatgccatg 4861 ttagaaagaa taaagcatga aaaaactgta gatatttatg gccatgtaac tttaatgaga 4921 gcccagagga actatatggt tcaaacagaa gaccaataca tctttatcca tgatgcactg 4981 ttagaagcag tgacttgtgg aaataccgaa gtgccagcta gaaacttgta tgcctacatt 5041 cagaagctga cacaaataga aacgggagag aatgtcacag gaatggagct cgaatttaag 5101 cgtctagcca gctcaaaagc tcacacctca aggtttatca gtgccaatct tccatgtaat 5161 aaattcaaaa atcgccttgt taatattatg ccatatgaat ccacaagggt atgcctgcag 5221 cctatccgtg gagtagaagg atctgattac atcaatgcca gttttattga tggatacaga 5281 caacagaaag cctacatcgc tacccagggg cccttggcag agaccactga agacttctgg 5341 cggatgctct gggaacacaa ttccaccata gttgtgatgc tcaccaagct gcgtgaaatg 5401 ggcagagaga aatgtcacca atactggcca gcagaacggt ctgcaagata ccagtacttt 5461 gttgtagatc ccatggctga gtacaacatg ccacagtata tcctaaggga attcaaggtc 5521 acagatgcca gggacggcca gtcccgaaca gtaaggcagt tccagttcac tgactggcca 5581 gagcaaggag tgccaaagtc cggagaagga tttattgact tcatcggcca agtccataaa 5641 acaaaagaac agtttggcca agatggaccc atttcagtcc attgcagcgc gggcgttgga 5701 agaactggag tcttcataac gctaagcatt gttttggaaa gaatgagata tgaaggagtt 5761 gtagatatct tccagactgt caaaatgtta agaacacaac gaccagctat ggtacagaca 5821 gaggatcaat atcagttttc ctatcgtgcc gcactagagt acctgggcag ctttgaccac 5881 tatgcaacgt agaaacccct gacccattct ggatttttac tacaggccct tcaatatcca 5941 tggagtctct tctgagccat acagggcact tgagaagtcc ttcttaactt ctagctaaca 6001 actacttagt gggactatta cacacaaaac aaattaaaaa caaattattc caggtggacc 6061 aagaattctt tgacatcgcc ccttcccacc atactgctca taataacatt ttaggggcca 6121 aggggaggga atgtttaaaa agaaagtcct tgatttagtt ttttagtatt gtaaagatac 6181 tgctgacctg tgcttcattt ctaactgtgt aaactttttt ttaacaaaat gtatcattcg 6241 ataaagtgaa ttttaaaaaa gtt // LOCUS HUMPTPFK 2591 bp mRNA PRI 27-OCT-1994 DEFINITION Human mRNA for platelet-type phosphofructokinase, complete cds. ACCESSION D25328 NID g464186 KEYWORDS phosphofructokinase; platelet-type phosphofructokinase. SOURCE Homo sapiens pancreatic islets (library: lambda gt11) cDNA to mRNA, clone F7. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Simpson,C.J. and Fothergill-Gilmore,L.A. TITLE Isolation and sequence of a cDNA encoding human platelet phosphofructokinase JOURNAL Biochemical and Biophysical Research Communication 180, 197-203 (1991) REFERENCE 2 (bases 1 to 2591) AUTHORS Eto,K., Sakura,H., Yasuda,K., Hayakawa,T., Kawasaki,E., Moriuchi,R., Nagataki,S., Yazaki,Y. and Kadowaki,T. TITLE Cloning of a complete protein-coding sequence of human platelet-type phosphofructokinase isozyme from pancreatic islet JOURNAL Cellular and Molecular Biology Research 198, 990-998 (1994) REFERENCE 3 (bases 1 to 2591) AUTHORS Eto,K. TITLE Direct Submission JOURNAL Submitted (22-NOV-1993) to the DDBJ/EMBL/GenBank databases. Kazuhiro Eto, Faculty of Medicine, University of Tokyo, The Third Dept. of Internal Medicine; 7-3-1 Hongo, Bunkyo-ku, Tokyo 113, Japan (Tel:03-3815-5411(ex.8296), Fax:03-3815-2087) COMMENT Submitted (22-Nov-1993) to DDBJ by: Kazuhiro Eto The Third Department of Internal Medicine Faculty of Medicine University of Tokyo 7-3-1 Hongo, Bunkyo-ku Tokyo 113 Japan Phone: 03-3815-5411 x8296 Fax: 03-3815-2087. FEATURES Location/Qualifiers source 1..2591 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="pancreatic islets" CDS 34..2388 /EC_number="2.7.1.11" /codon_start=1 /product="platelet-type phosphofructokinase" /db_xref="PID:d1005539" /db_xref="PID:g560105" /translation="MDADDSRAPKGSLRKFLEHLSGAGKAIGVLTSGGDAQGMNAAVR AVVRMGIYVGAKVYFIYEGYQGMVDGGSNIAEADWESVSSILQVGGTIIGSARCQAFR TREGRLKAACNLLQRGITNLCVIGGDGSLTGANLFRKEWSGLLEELARNGQIDKEAVQ KYAYLNVVGMVGSIDNDFCGTDMTIGTDSALHRIIEVVDAIMTTAQSHQRTFVLEVMG RHCGYLALVSALACGADWVFLPESPPEEGWEEQMCVKLSENRARKKRLNIIIVAEGAI DTQNKPITSEKIKELVVTQLGYDTRVTILGHVQRGGTPSAFDRILASRMGVEAVIALL EATPDTPACVVSLNGNHAVRLPLMECVQMTQDVQKAMDERRFQDAVRLRGRSFAGNLN TYKRLAIKLPDDQIPKTNCNVAVINVGAPAAGMNAAVRSAVRVGIADGHRMLAIYDGF DGFAKGQIKEIGWTDVGGWTGQGGSILGTKRVLPGKYLEEIATQMRTHSINALLIIGG FEAYLGLLELSAAREKHEEFCVPMVMVPATVSNNVPGSDFSIGADTALNTITDTCDRI KQSASGTKRRVFIIETMGGYCGYLANMGGLAAGADAAYIFEEPFDIRDLQSNVEHLTE KMKTTIQRGLVLRNESCSENYTTDFIYQLYSEEGKGVFDCRKNVLGHMQQGGAPSPFD RNFGTKISARAMEWITAKLKEARGRGKKFTTDDSICVLGISKRNVIFQPVAELKKQTD FEHRIPKEQWWLKLRPLMKILAKYKASYDVSDSGQLEHVQPWSV" variation 1314 /note="replace(1314, 'a')" variation 1392 /note="replace(1392, 't')" misc_difference 1524..1526 /note="replace(1524..1526, '')" /citation=[1] variation 1827 /note="replace(1827, 't')" misc_difference 2129 /note="replace(2129, 'a')" /citation=[1] variation 2432 /note="replace(2432, 'c')" variation 2525 /note="replace(2525, 'c')" polyA_signal 2586..2591 BASE COUNT 592 a 713 c 790 g 496 t ORIGIN Chromosome 10p15.2-p15.3. 1 cccggacgtg cggctcccct cggcctcctc gccatggacg cggacgactc ccgggccccc 61 aagggctcct tgcggaagtt cctggagcac ctctccgggg ccggcaaggc catcggcgtg 121 ctgaccagcg gcggggatgc tcaaggtatg aacgctgccg tccgtgccgt ggtgcgcatg 181 ggtatctacg tgggggccaa ggtgtacttc atctacgagg gctaccaggg catggtggac 241 ggaggctcaa acatcgcaga ggccgactgg gagagtgtct ccagcatcct gcaagtgggc 301 gggacgatca ttggcagtgc gcggtgccag gccttccgca cgcgggaagg ccgcctgaag 361 gctgcttgca acctgctgca gcgcggcatc accaacctgt gtgtgatcgg cggggacggg 421 agcctcaccg gggccaacct cttccggaag gagtggagtg ggctgctgga ggagctggcc 481 aggaacggcc agatcgataa ggaggccgtg cagaagtacg cctacctcaa cgtggtgggc 541 atggtgggct ccatcgacaa tgatttctgc ggcaccgaca tgaccatcgg cacggactcc 601 gccctgcaca ggatcatcga ggtcgtcgac gccatcatga ccacggccca gagccaccag 661 aggaccttcg ttctggaggt gatgggacga cactgtgggt acctggccct ggtgagtgcc 721 ttggcctgcg gtgcggactg ggtgttcctt ccagaatctc caccagagga aggctgggag 781 gagcagatgt gtgtcaaact ctcggagaac cgtgcccgga aaaaaaggct gaatattatt 841 attgtggctg aaggagcaat tgatacccaa aataaaccca tcacctctga gaaaatcaaa 901 gagcttgtcg tcacgcagct gggctatgac acacgtgtga ccatcctcgg gcacgtgcag 961 agaggaggga ccccttcggc attcgacagg atcttggcca gccgcatggg agtggaggca 1021 gtcatcgcct tgctagaggc caccccggac accccagctt gcgtcgtgtc actgaacggg 1081 aaccacgccg tgcgcctgcc gctgatggag tgcgtgcaga tgactcagga tgtgcagaag 1141 gcgatggacg agaggagatt tcaagatgcg gttcgactcc gagggaggag ctttgcgggc 1201 aacctgaaca cctacaagcg acttgccatc aagctgccgg atgatcagat cccaaagacc 1261 aattgcaacg tagctgtcat caacgtgggg gcacccgcgg ctgggatgaa cgcggccgta 1321 cgctcagctg tgcgcgtggg cattgccgac ggccacagga tgctcgccat ctatgatggc 1381 tttgacggct tcgccaaggg ccagatcaaa gaaatcggct ggacagatgt cgggggctgg 1441 accggccaag gaggctccat tcttgggaca aaacgcgttc tcccggggaa gtacttggaa 1501 gagatcgcca cacagatgcg cacgcacagc atcaacgcgc tgctgatcat cggtggattc 1561 gaggcctacc tgggactcct ggagctgtca gccgcccggg agaagcacga ggagttctgt 1621 gtccccatgg tcatggttcc cgctactgtg tccaacaatg tgccgggttc cgatttcagc 1681 atcggggcag acaccgccct gaacactatc accgacacct gcgaccgcat caagcagtcc 1741 gccagcggaa ccaagcggcg cgtgttcatc atcgagacca tgggcggcta ctgtggctac 1801 ctggccaaca tgggggggct cgcggccgga gctgatgccg catacatttt cgaagagccc 1861 ttcgacatca gggatctgca gtccaacgtg gagcacctga cggagaaaat gaagaccacc 1921 atccagagag gccttgtgct cagaaatgag agctgcagtg aaaactacac caccgacttc 1981 atttaccagc tgtattcaga agagggcaaa ggcgtgtttg actgcaggaa gaacgtgctg 2041 ggtcacatgc agcagggtgg ggcaccctct ccatttgata gaaactttgg aaccaaaatc 2101 tctgccagag ctatggagtg gatcactgca aaactcaagg aggcccgggg cagaggaaaa 2161 aaatttacca ccgatgattc catttgtgtg ctgggaataa gcaaaagaaa cgttattttt 2221 caacctgtgg cagagctgaa gaagcaaacg gattttgagc acaggattcc caaagaacag 2281 tggtggctca agctacggcc cctcatgaaa atcctggcca agtacaaggc cagctatgac 2341 gtgtcggact caggccagct ggaacatgtg cagccctgga gtgtctgacc cagtcccgcc 2401 tgcatgtgcc tgcagccacc gtggactgtc tgtttttgta acacttaagt tattttatca 2461 gcactttatg cacgtattat tgacattaat acctaatcgg cgagtgccca tctgccccac 2521 cagctccagt gcgtgctgtc tgtggagtgt gtctcatgct ttcagatgtg catatgagca 2581 gaattaatta a // LOCUS HUMPTPPEST 3160 bp mRNA PRI 16-JUN-1993 DEFINITION Human protein tyrosine phosphatase (PTP-PEST) mRNA, complete cds. ACCESSION M93425 NID g292408 KEYWORDS protein tyrosine phosphatase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3160) AUTHORS Yang,Q.C., Tonks,N.K. and Sommercorn,J. TITLE Cloning and expression of PTP-PEST: a novel, nontransmembrane protein tyrosine phosphatase JOURNAL J. Biol. Chem. 268, 6622-6628 (1993) MEDLINE 93203262 FEATURES Location/Qualifiers source 1..3160 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 30..2372 /codon_start=1 /product="protein tyrosine phosphatase" /db_xref="PID:g292409" /translation="MEQVEILRKFIQRVQAMKSPDHNGEDNFARDFMRLRRLSTKYRT EKIYPTATGEKEENVKKNRYKDILPFDHSRVKLTLKTPSQDSDYINANFIKGVYGPKA YVATQGPLANTVIDFWRMIWEYNVVIIVMACREFEMGRKKCERYWPLYGEDPITFAPF KISCEDEQARTDYFIRTLLLEFQNESRRLYQFHYVNWPDHDVPSSFDSILDMISLMRK YQEHEDVPICIHCSAGCGRTGAICAIDYTWNLLKAGKIPEEFNVFNLIQEMRTQRHSA VQTKEQYELVHRAIAQLFEKQLQLYEIHGAQKIADGVNEINTENMISSIEPEKQDSPP PKPPRTRSCLVEGDAKEEILQPPEPHPVPPILTPSPPSAFPTVTTVWQDNDRYHPKPV LHMVSSEQHSADLNRNYSKSTELPGKNESTIEQIDKKLERNLSFEIKKVPLQEGPKSF DGNTLLNRGHAIKIKSASPCIADKISKPQELSSDLNVGDTSQNSCVDCSVTQSNKVSV TPPEESQNSDTPPRPDRLPLDEKGHVTWSFHGPENAIPIPDLSEGNSSDINYQTRKTV SLTPSPTTQVETPDLVDHDNTSPLFRTPLSFTNPLHSDDSDSDERNSDGAVTQNKTNI STASATVSAATSTESISTRKVLPMSIARHNIAGTTHSGAEKDVDVSEDSPPPLPERTP ESFVLASEHNTPVRSEWSELQSQERSEQKKSEGLITSENEKCDHPAGGIHYEMCIECP PTFSDKREQISENPTEATDIGFGNRCGKPKGPRDPPSEWT" BASE COUNT 1077 a 586 c 612 g 885 t ORIGIN 1 agcgaccgca gccgggggga cgcgggagga tggagcaagt ggagatcctg aggaaattca 61 tccagagggt ccaggccatg aagagtcctg accacaatgg ggaggacaac ttcgcccggg 121 acttcatgcg gttaagaaga ttgtctacca aatatagaac agaaaagata tatcccacag 181 ccactggaga aaaagaagaa aatgttaaaa agaacagata caaggacata ctgccatttg 241 atcacagccg agttaaattg acattaaaga ctccttcaca agattcagac tatatcaatg 301 caaattttat aaagggcgtc tatgggccaa aagcatatgt agcaactcaa ggacctttag 361 caaatacagt aatagatttt tggaggatga tatgggagta taatgttgtg atcattgtaa 421 tggcctgccg agaatttgag atgggaagga aaaaatgtga gcgctattgg cctttgtatg 481 gagaagaccc cataacgttt gcaccattta aaatttcttg tgaggatgaa caagcaagaa 541 cagactactt catcaggaca ctcttacttg aatttcaaaa tgaatctcgt aggctgtatc 601 agtttcatta tgtgaactgg ccagaccatg atgttccttc atcatttgat tctattctgg 661 acatgataag cttaatgagg aaatatcaag aacatgaaga tgttcctatt tgtattcatt 721 gcagtgcagg ctgtggaaga acaggtgcca tttgtgccat agattatacg tggaatttac 781 taaaagctgg gaaaatacca gaggaattta atgtatttaa tttaatacaa gaaatgagaa 841 cacaaaggca ttctgcagta caaacaaagg agcaatatga acttgttcat agagctattg 901 cccaactgtt tgaaaaacag ctacaactat atgaaattca tggagctcag aaaattgctg 961 atggagtgaa tgaaattaac actgaaaaca tgatcagctc catagagcct gaaaaacaag 1021 attctcctcc tccaaaacca ccaaggaccc gcagttgcct tgttgaaggg gatgctaaag 1081 aagaaatact gcagccaccg gaacctcatc cagtgccacc catcttgaca ccttctcccc 1141 cttcagcttt tccaacagtc actactgtgt ggcaggacaa tgatagatac catccaaagc 1201 cagtgttgca tatggtttca tcagaacaac attcagcaga cctcaacaga aactatagta 1261 aatcaacaga acttccaggg aaaaatgaat caacaattga acagatagat aaaaaattgg 1321 aacgaaattt aagttttgag attaagaagg tccctctcca agagggacca aaaagttttg 1381 atgggaacac acttttgaat aggggacatg caattaaaat taaatctgct tcaccttgta 1441 tagctgataa aatctctaag ccacaggaat taagttcaga tctaaatgtc ggtgatactt 1501 cccagaattc ttgtgtggac tgcagtgtaa cacaatcaaa caaagtttca gttactccac 1561 cagaagaatc ccagaattca gacacacctc caaggccaga ccgcttgcct cttgatgaga 1621 aaggacatgt aacgtggtca tttcatggac ctgaaaatgc catacccata cctgatttat 1681 ctgaaggcaa ttcctcagat atcaactatc aaactaggaa aactgtgagt ttaacaccaa 1741 gtcctacaac acaagttgaa acacctgatc ttgtggatca tgataacact tcaccactct 1801 tcagaacacc cctcagtttt actaatccac ttcactctga tgactcagac tcagatgaaa 1861 gaaactctga tggtgctgtg acccagaata aaactaatat ttcaacagca agtgccacag 1921 tttctgctgc cactagtact gaaagcattt ctactaggaa agtattgcca atgtccattg 1981 ctagacataa tatagcagga acaacacatt caggtgctga aaaagatgtt gatgttagtg 2041 aagattcacc tcctccccta cctgaaagaa ctcctgaatc gtttgtgtta gcaagtgaac 2101 ataatacacc tgtaagatcg gaatggagtg aacttcaaag tcaggaacga tctgaacaaa 2161 aaaagtctga aggcttgata acctctgaaa atgagaaatg tgatcatcca gcgggaggta 2221 ttcactatga aatgtgcata gaatgtccac ctactttcag tgacaagaga gaacaaatat 2281 cagaaaatcc aacagaagcc acagatattg gttttggtaa tcgatgtgga aaacccaaag 2341 gaccaagaga tccaccttca gaatggacat gattcaggga gctagaagac actttaagtt 2401 atactggaaa attcaggtgc cactgaaagc cagatttata gtattccatc tttaatatgt 2461 gggactaaca gcagtgtaga ttgttacctt aatatttttt gctgggacca tctacctgcc 2521 ttatactaca cttaggaaaa agtattacat atggtttatt ttgaaacttc aagtattatt 2581 gccttaatgt ctcttaaccc tgttacacgc tgcttgtaga catgttaata tagtaatacc 2641 tttatgatat attgagttta aggactactc tttttctgtt ttatcatgta tgcattattt 2701 tgtatatgta cagggcaagt aggtatataa tttgataaag ttgcaattga aatattatta 2761 acagaagatg taagaaattt ctgcatggtc taaatctttg tgtactttat ttgtaaatta 2821 tttgccctgg agttttagaa aatagtttct gaattttaaa cttgctggat tcatgcagcc 2881 agctttgcag gttatcagag atcaaagatt gtaataataa ttttgtaaat tgtaagcaaa 2941 aagttatttt tatattatat acagtctaat tgttcatcct aattgttcct gttttcatct 3001 agtcagagat tcagtaagtg ccttggaaca atattgaatt ctcttagctt gtgtgtgttt 3061 ctttaatatt tgaactcaag tgggattaga agactatcaa aatacatgta tgtttcagat 3121 atttgacctg tcattaaaaa aaacaaacag ttttacagtg // LOCUS HUMPTPRG 4707 bp mRNA PRI 08-JAN-1995 DEFINITION Human receptor-type protein tyrosine phosphatase gamma (PTPRG) mRNA, complete cds. ACCESSION L09247 NID g292410 KEYWORDS receptor-type protein tyrosine phosphatase gamma. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4707) AUTHORS Barnea,G., Silvennoinen,O., Shaanan,B., Honegger,A.M., Canoll,P.D., D'Eustachio,P., Morse,B., Levy,J.B., LaForgia,S., Huebner,K., Musacchio,J., Sap,J. and Schlessinger,J. TITLE Identification of a carbonic anhydrase-like domain in the extracellular region of RPTP gamma defines a new subfamily of receptor tyrosine phosphatases JOURNAL Mol. Cell. Biol. 13 (3), 1497-1506 (1993) MEDLINE 93180796 FEATURES Location/Qualifiers source 1..4707 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p21-p14" sig_peptide 80..136 /gene="PTPRG" /note="G00-127-351" gene 80..4417 /gene="PTPRG" CDS 80..4417 /gene="PTPRG" /codon_start=1 /db_xref="GDB:G00-127-351" /product="receptor-type protein tyrosine phosphatase gamma" /db_xref="PID:g292411" /translation="MRRLLEPCWWILFLKITSSVLHYVVCFPALTEGYVGALHENRHG SAVQIRRRKASGDPYWAYSGAYGPEHWVTSSVSCGSRHQSPIDILDQYARVGEEYQEL QLDGFDNESSNKTWMKNTGKTVAILLKDDYFVSGAGLPGRFKAEKVEFHWGHSNGSAG SEHSINGRRFPVEMQIFFYNPDDFDSFQTAISENRIIGAMAIFFQVSPRDNSALDPII HGLKGVVHHEKETFLDPFVLRDLLPASLGSYYRYTGSLTTPPCSEIVEWIVFRRPVPI SYHQLEAFYSIFTTEQQDHVKSVEYLRNNFRPQQRLHDRVVSKSAVRDSWNHDMTDFL ENPLGTEASKVCSSPPIHMKVQPLNQTALQVSWSQPETIYHPPIMNYMISYSWTKNED EKEKTFTKDSDKDLKATISHVSPDSLYLFRVQAVCRNDMRSDFSQTMLFQANTTRIFQ GTRIVKTGVPTASPASSADMAPISSGSSTWTSSGIPFSFVSMATGMGPSSSGSQATVA SVVTSTLLAGLGFGGGGISSFPSTVWPTRLPTAASASKQAARPVLATTEALASPGPDG DSSPTKDGEGTEEGEKDEKSESEDGEREHEEDGEKDSEKKEKSGVTHAAEERNQTEPS PTPSSPNRTAEGGHQTIPGHEQDHTAVPTDQTGGRRDAGPGLDPDMVTSTQVPPTATE EQYAGSDPKRPEMPSKKPMSRGDRFSEDSRFITVNPAEKNTSGMISRPAPGRMEWIIP LIVVSALTFVCLILLIAVLVYWRGCNKIKSKGFPRRFREVPSSGERGEKGSRKCFQTA HFYVEDSSSPRVVPNESIPIIPIPDDMEAIPVKQFVKHIGELYSNNQHGFSEDFEEVQ RCTADMNITAEHSNHPENKHKNRYINILAYDHSRVKLRPLPGKDSKHSDYINANYVDG YNKAKAYIATQGPLKSTFEDFWRMIWEQNTGIIVMITNLVEKGRRKCDQYWPTENSEE YGNIIVTLKSTKIHACYTVRRFSIRNTKVKKGQKGNPKGRQNERVVIQYHYTQWPDMG VPEYALPVLTFVRRSSAARMPETGPVLVHCSAGVGRTGTYIVIDSMLQQIKDKSTVNV LGFLKHIRTQRNYLVQTEEQYIFIHDALLEAILGKETEVSSNQLHSYVNSILIPGVGG KTRLEKQFKLVTQCNAKYVECFSAQKECNKEKNRNSSVVPSERARVGLAPLPGMKGTD YINASYIMGYYRSNEFIITQHPLPHTTKDFWRMIWDHNAQIIVMLPDNQSLAEDEFVY WPSREESMNCEAFTVTLISKDRLCLSNEEQIIIHDFILEATQDDYVLEVRHFQCPKWP NPDAPISSTFELINVIKEEALTRDGPTIVHDEYGAVSAGMLCALTTLSQQLENENAVD VFQVAKMINLMRPGVFTDIEQYQFIYKARLSLVSTKENGNGPMTVDKNGAVLIADESD PAESMESLV" BASE COUNT 1300 a 1145 c 1154 g 1108 t ORIGIN 1 cggaggctcg cacggaggca agaacttatt caacaagttt acctccctgc tttcctcttt 61 tcgatgtgcg ttttcggaca tgcggaggtt actggaaccg tgttggtgga ttttgttcct 121 gaaaatcacc agttccgtgc tccattatgt cgtgtgcttc cccgcgttga cagaaggcta 181 cgttggggcc ctgcacgaga atagacacgg cagcgcagtg cagatccgca ggcgcaaggc 241 ttcaggcgac ccgtactggg cctactctgg tgcctatggt cctgagcact gggtcacgtc 301 tagtgtcagc tgtgggagcc gtcaccagtc tcctattgac attttagacc agtatgcgcg 361 tgttggggaa gaataccagg aactgcaact cgatggcttc gacaatgagt cttctaacaa 421 aacctggatg aaaaacacag ggaaaacagt cgccatcctt ctgaaagacg actattttgt 481 cagtggagct ggtctacctg gcagattcaa agctgagaag gtggaatttc actggggcca 541 cagcaatggc tcagcgggct ctgaacacag catcaatggc aggaggtttc ctgttgagat 601 gcagattttc ttttacaatc cagatgactt tgacagcttt caaaccgcaa tttctgagaa 661 cagaataatc ggagccatgg ccatattttt tcaagtcagt ccgagggaca attctgcact 721 ggatcctatt atccacgggt tgaagggtgt cgtacatcat gagaaggaga cctttctgga 781 tcctttcgtc ctccgggacc tcctgcctgc atccctgggc agctattatc ggtacacagg 841 ttccttgacc acaccaccgt gtagcgaaat agtggagtgg atagtcttcc ggagacccgt 901 ccccatctct taccatcagc ttgaggcttt ttattccatc ttcaccacgg agcagcaaga 961 ccatgtcaag tcggtggagt atctgagaaa taactttcga ccacagcagc gtctgcatga 1021 cagggtggtg tccaagtccg ccgtccgtga ctcctggaac cacgacatga cagacttctt 1081 agaaaaccca ctggggacag aagcctctaa agtttgcagc tctccaccca tccacatgaa 1141 ggtgcagcct ctgaaccaga cggcactgca ggtgtcctgg agccagccgg agactatcta 1201 ccacccaccc atcatgaact acatgatctc ctacagctgg accaagaatg aggacgagaa 1261 ggagaagacg tttacaaagg acagcgacaa agacttgaaa gccaccatta gccatgtctc 1321 acccgatagc ctttacctgt tccgagtcca ggccgtgtgt cggaacgaca tgcgcagcga 1381 ctttagccag acgatgctgt ttcaagctaa taccactcga atattccaag ggaccagaat 1441 agtgaaaaca ggagtgccca cagcgtctcc tgcctcttca gccgacatgg cccccatcag 1501 ctcggggtct tctacctgga cgtcctctgg catcccattc tcatttgttt ccatggcaac 1561 tgggatgggc ccctcctcca gtggcagcca ggccacagtg gcctcggtgg tcaccagcac 1621 gctgctcgcc ggcctggggt tcggcggtgg tggcatctcc tctttcccca gcactgtgtg 1681 gcccacgcgc ctcccgacgg ccgcctcagc cagcaagcag gcggctaggc cagtcctagc 1741 caccacagag gccttggctt ctccagggcc cgatggtgat tcgtcaccaa ccaaggacgg 1801 cgagggcacc gaggaaggag agaaggatga gaaaagcgag agtgaggatg gggagcggga 1861 gcacgaggag gatggagaga aggactccga aaagaaggag aagagtgggg tgacccacgc 1921 tgccgaggag cggaatcaga cggagcccag ccccacaccc tcgtctccta acaggactgc 1981 cgagggaggg catcagacta tacctgggca tgagcaggat cacactgccg tccccacaga 2041 ccagacgggc ggaaggaggg atgccggccc aggcctggac cccgacatgg tcacctccac 2101 ccaagtgccc cccaccgcca cagaggagca gtatgcaggg agtgatccca agaggcccga 2161 aatgccatct aaaaagccta tgtcccgcgg ggaccgattt tctgaagaca gcagatttat 2221 cactgttaat ccagcggaaa aaaacacctc tggaatgata agccgccctg ctccagggag 2281 gatggagtgg atcatccctc tgattgtggt atcagccttg accttcgtgt gcctcatcct 2341 tctcattgct gtgctcgttt actggagagg gtgtaacaaa ataaagtcca agggctttcc 2401 cagacgtttc cgtgaagtgc cttcttctgg ggagagagga gagaagggga gcagaaaatg 2461 ttttcagact gctcatttct atgtggaaga cagcagttca cctcgagtgg tccctaatga 2521 aagtattcct attattccta ttccggatga catggaagcc attcctgtca aacagtttgt 2581 caaacacatc ggtgagctct attctaataa ccagcatggg ttctctgagg attttgagga 2641 agtccagcgc tgtactgctg atatgaacat cactgcagag cattccaatc atccagaaaa 2701 caagcacaaa aacagataca tcaacatttt agcatatgat cacagtaggg tgaagttaag 2761 acctttacca ggaaaagact ctaagcacag cgactacatt aatgcaaact atgttgatgg 2821 ttacaacaaa gcaaaagcct acattgccac ccaaggacct ttgaagtcta catttgaaga 2881 tttctggagg atgatttggg aacaaaacac tggaatcatt gtgatgatta cgaaccttgt 2941 ggaaaaagga agacgaaaat gtgatcagta ttggccaaca gagaacagtg aggaatatgg 3001 aaacattatt gtcacgctga agagcacaaa aatacatgcc tgctacactg ttcgtcgttt 3061 ttcaatcaga aatacaaaag tgaaaaaggg tcagaaggga aatcccaagg gtcgtcagaa 3121 tgaaagggta gtgatccagt atcactatac acagtggcct gacatgggag ttcccgagta 3181 tgcccttcca gtactgactt tcgtgaggag atcctcagca gctcggatgc cagaaacggg 3241 ccctgtgttg gtgcactgca gtgctggtgt gggcagaaca ggcacctata ttgtaataga 3301 cagcatgctg caacagataa aagacaaaag cacagttaac gtcctgggat tcctgaagca 3361 tatcaggaca cagcgtaact acctcgtcca gactgaggag cagtacattt tcatccatga 3421 tgccttgttg gaagccattc ttggaaagga gactgaagta tcttcaaatc agctgcacag 3481 ctatgttaac agcatcctta taccaggagt aggaggaaag acacgactgg aaaagcaatt 3541 caagctggtc acacagtgta atgcaaaata tgtggaatgt ttcagtgctc agaaagagtg 3601 taacaaagaa aagaacagaa actcttcagt tgtgccatct gagcgtgctc gagtgggtct 3661 tgcaccattg cctggaatga aaggaacaga ttacattaat gcttcttata tcatgggcta 3721 ttataggagc aatgaattta ttataactca gcatcctctg ccacatacta cgaaagattt 3781 ctggcgaatg atttgggatc ataacgcaca gatcattgtc atgctgccag acaaccagag 3841 cttggcagaa gatgagtttg tgtactggcc aagtcgagaa gaatccatga actgtgaggc 3901 ctttaccgtc acccttatca gcaaagacag actgtgcctc tctaatgaag aacaaattat 3961 catccatgac tttatccttg aagctacaca ggatgactat gtcttagaag ttcggcactt 4021 tcagtgtccc aaatggccta acccagatgc ccccataagt agtacctttg aacttatcaa 4081 cgtcatcaag gaagaggcct taacaaggga tggtcccacc attgttcatg atgagtatgg 4141 agcagtttca gcaggaatgt tatgtgccct taccaccctg tcccagcaac tggagaatga 4201 aaatgctgtg gatgttttcc aggttgcaaa aatgatcaat cttatgaggc ctggagtatt 4261 cacagacatt gaacaatacc agttcatcta taaagcaagg cttagcttgg tcagcactaa 4321 agaaaatgga aatggtccca tgacagtaga caaaaatggt gctgttctta ttgcagatga 4381 atcagaccct gctgagagca tggagtccct agtgtgactg gaatcctgaa agggcactta 4441 atttgtaaac ttctgaagac tgagaacttt tttgaggcct tttttgccag actctaggtt 4501 atacaataac ccagttactt ttttacactg ataaaagttt tgatatttat tttttgccat 4561 tttatgtctt aatggtatcc tactgagcat ttgcacctct gttcatttca cacagtgaaa 4621 cgcaatttta cctagtttgc actatatgat cagtgttact gcctataatc ttatacaaca 4681 gcaaaccctg atgtgacatt ccatgac // LOCUS HUMPTPRZ 7941 bp mRNA PRI 08-JAN-1995 DEFINITION Human protein tyrosine phosphatase zeta-polypeptide (PTPRZ) mRNA, complete cds. ACCESSION M93426 NID g190743 KEYWORDS carbonic anhydrase-related transmembrane protein; protein tyrosine phosphatase zeta-polypeptide. SOURCE Homo sapiens (tissue library: lambda gt10 and gt11) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7941) AUTHORS Krueger,N.X. and Saito,H. TITLE A human transmembrane protein-tyrosine-phosphatase, PTP zeta, is expressed in brain and has an N-terminal receptor domain homologous to carbonic anhydrases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (16), 7417-7421 (1992) MEDLINE 92366472 FEATURES Location/Qualifiers source 1..7941 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="lambda gt10 and gt11" /map="Unassigned" gene 148..7092 /gene="PTPRZ" CDS 148..7092 /gene="PTPRZ" /note="carbonic anhydrase-like domain" /codon_start=1 /db_xref="GDB:G00-127-353" /product="protein tyrosine phosphatase zeta-polypeptide" /db_xref="PID:g190744" /translation="MRILKRFLACIQLLCVCRLDWANGYYRQQRKLVEEIGWSYTGAL NQKNWGKKYPTCNSPKQSPINIDEDLTQVNVNLKKLKFQGWDKTSLENTFIHNTGKTV EINLTNDYRVSGGVSEMVFKASKITFHWGKCNMSSDGSEHSLEGQKFPLEMQIYCFDA DRFSSFEEAVKGKGKLRALSILFEVGTEENLDFKAIIDGVESVSRFGKQAALDPFILL NLLPNSTDKYYIYNGSLTSPPCTDTVDWIVFKDTVSISESQLAVFCEVLTMQQSGYVM LMDYLQNNFREQQYKFSRQVFSSYTGKEEIHEAVCSSEPENVQADPENYTSLLVTWER PRVVYDTMIEKFAVLYQQLDGEDQTKHEFLTDGYQDLGAILNNLLPNMSYVLQIVAIC TNGLYGKYSDQLIVDMPTDNPELDLFPELIGTEEIIKEEEEGKDIEEGAIVNPGRDSA TNQIRKKEPQISTTTHYNRIGTKYNEAKTNRSPTRGSEFSGKGDVPNTSLNSTSQPVT KLATEKDISLTSQTVTELPPHTVEGTSASLNDGSKTVLRSPHMNLSGTAESLNTVSIT EYEEESLLTSFKLDTGAEDSSGSSPATSAIPFISENISQGYIFSSENPETITYDVLIP ESARNASEDSTSSGSEESLKDPSMEGNVWFPSSTDITAQPDVGSGRESFLQTNYTEIR VDESEKTTKSFSAGPVMSQGPSVTDLEMPHYSTFAYFPTEVTPHAFTPSSRQQDLVST VNVVYSQTTQPVYNGETPLQPSYSSEVFPLVTPLLLDNQILNTTPAASSSDSALHATP VFPSVDVSFESILSSYDGAPLLPFSSASFSSELFRHLHTVSQILPQVTSATESDKVPL HASLPVAGGDLLLEPSLAQYSDVLSTTHAASETLEFGSESGVLYKTLMFSQVEPPSSD AMMHARSSGPEPSYALSDNEGSQHIFTVSYSSAIPVHDSVGVTYQGSLFSGPSHIPIP KSSLITPTASLLQPTHALSGDGEWSGASSDSEFLLPDTDGLTALNISSPVSVAEFTYT TSVFGDDNKALSKSEIIYGNETELQIPSFNEMVYPSESTVMPNMYDNVNKLNASLQET SVSISSTKGMFPGSLAHTTTKVFDHEISQVPENNFSVQPTHTVSQASGDTSLKPVLSA NSEPASSDPASSEMLSPSTQLLFYETSASFSTEVLLQPSFQASDVDTLLKTVLPAVPS DPILVETPKVDKISSTMLHLIVSNSASSENMLHSTSVPVFDVSPTSHMHSASLQGLTI SYASEKYEPVLLKSESSHQVVPSLYSNDELFQTANLEINQAHPPKGRHVFATPVLSID EPLNTLINKLIHSDEILTSTKSSVTGKVFAGIPTVASDTFVSTDHSVPIGNGHVAITA VSPHRDGSVTSTKLLFPSKATSELSHSAKSDAGLVGGGEDGDTDDDGDDDDDRDSDGL SIHKCMSCSSYRESQEKVMNDSDTHENSLMDQNNPISYSLSENSEEDNRVTSVSSDSQ TGMDRSPGKSPSANGLSQKHNDGKEENDIQTGSALLPLSPESKAWAVLTSDEESGSGQ GTSDSLNENETSTDFSFADTNEKDADGILAAGDSEITPGFPQSPTSSVTSENSEVFHV SEAEASNSSHESRIGLAEGLESEKKAVIPLVIVSALTFICLVVLVGILIYWRKCFQTA HFYLEDSTSPRVISTPPTPIFPISDDVGAIPIKHFPKHVADLHASSGFTEEFETLKEF YQEVQSCTVDLGITADSSNHPDNKHKNRYINIVAYDHSRVKLAQLAEKDGKLTDYINA NYVDGYNRPKAYIAAQGPLKSTAEDFWRMIWEHNVEVIVMITNLVEKGRRKCDQYWPA DGSEEYGNFLVTQKSVQVLAYYTVRNFTLRNTKIKKGSQKGRPSGRVVTQYHYTQWPD MGVPEYSLPVLTFVRKAAYAKRHAVGPVVVHCSAGVGRTGTYIVLDSMLQQIQHEGTV NIFGFLKHIRSQRNYLVQTEEQYVFIHDTLVEAILSKETEVLDSHIHAYVNALLIPGP AGKTKLEKQFQLLSQSNIQQSDYSAALKQCNREKNRTSSIIPVERSRVGISSLSGEGT DYINASYIMGYYQSNEFIITQHPLLHTIKDFWRMIWDHNAQLVVMIPDGQNMAEDEFV YWPNKDEPINCESFKVTLMAEEHKCLSNEEKLIIQDFILEATQDDYVLEVRHFQCPKW PNPDSPISKTFELISVIKEEAANRDGPMIVHDEHGGVTAGTFCALTTLMHQLEKENSV DVYQVAKMINLMRPGVFADIEQYQFLYKVILSLVSTRQEENPSTSLDSNGAALPDGNI AESLESLV" BASE COUNT 2406 a 1651 c 1598 g 2286 t ORIGIN 1 cacacatacg cacgcacgat ctcacttcga tctatacact ggaggattaa aacaaacaaa 61 caaaaaaaac atttccttcg ctccccctcc ctctccactc tgagaagcag aggagccgca 121 cggcgagggg ccgcagaccg tctggaaatg cgaatcctaa agcgtttcct cgcttgcatt 181 cagctcctct gtgtttgccg cctggattgg gctaatggat actacagaca acagagaaaa 241 cttgttgaag agattggctg gtcctataca ggagcactga atcaaaaaaa ttggggaaag 301 aaatatccaa catgtaatag cccaaaacaa tctcctatca atattgatga agatcttaca 361 caagtaaatg tgaatcttaa gaaacttaaa tttcagggtt gggataaaac atcattggaa 421 aacacattca ttcataacac tgggaaaaca gtggaaatta atctcactaa tgactaccgt 481 gtcagcggag gagtttcaga aatggtgttt aaagcaagca agataacttt tcactgggga 541 aaatgcaata tgtcatctga tggatcagag catagtttag aaggacaaaa atttccactt 601 gagatgcaaa tctactgctt tgatgcggac cgattttcaa gttttgagga agcagtcaaa 661 ggaaaaggga agttaagagc tttatccatt ttgtttgagg ttgggacaga agaaaatttg 721 gatttcaaag cgattattga tggagtcgaa agtgttagtc gttttgggaa gcaggctgct 781 ttagatccat tcatactgtt gaaccttctg ccaaactcaa ctgacaagta ttacatttac 841 aatggctcat tgacatctcc tccctgcaca gacacagttg actggattgt ttttaaagat 901 acagttagca tctctgaaag ccagttggct gttttttgtg aagttcttac aatgcaacaa 961 tctggttatg tcatgctgat ggactactta caaaacaatt ttcgagagca acagtacaag 1021 ttctctagac aggtgttttc ctcatacact ggaaaggaag agattcatga agcagtttgt 1081 agttcagaac cagaaaatgt tcaggctgac ccagagaatt ataccagcct tcttgttaca 1141 tgggaaagac ctcgagtcgt ttatgatacc atgattgaga agtttgcagt tttgtaccag 1201 cagttggatg gagaggacca aaccaagcat gaatttttga cagatggcta tcaagacttg 1261 ggtgctattc tcaataattt gctacccaat atgagttatg ttcttcagat agtagccata 1321 tgcactaatg gcttatatgg aaaatacagc gaccaactga ttgtcgacat gcctactgat 1381 aatcctgaac ttgatctttt ccctgaatta attggaactg aagaaataat caaggaggag 1441 gaagagggaa aagacattga agaaggcgct attgtgaatc ctggtagaga cagtgctaca 1501 aaccaaatca ggaaaaagga accccagatt tctaccacaa cacactacaa tcgcataggg 1561 acgaaataca atgaagccaa gactaaccga tccccaacaa gaggaagtga attctctgga 1621 aagggtgatg ttcccaatac atctttaaat tccacttccc aaccagtcac taaattagcc 1681 acagaaaaag atatttcctt gacttctcag actgtgactg aactgccacc tcacactgtg 1741 gaaggtactt cagcctcttt aaatgatggc tctaaaactg ttcttagatc tccacatatg 1801 aacttgtcgg ggactgcaga atccttaaat acagtttcta taacagaata tgaggaggag 1861 agtttattga ccagtttcaa gcttgatact ggagctgaag attcttcagg ctccagtccc 1921 gcaacttctg ctatcccatt catctctgag aacatatccc aagggtatat attttcctcc 1981 gaaaacccag agacaataac atatgatgtc cttataccag aatctgctag aaatgcttcc 2041 gaagattcaa cttcatcagg ttcagaagaa tcactaaagg atccttctat ggagggaaat 2101 gtgtggtttc ctagctctac agacataaca gcacagcccg atgttggatc aggcagagag 2161 agctttctcc agactaatta cactgagata cgtgttgatg aatctgagaa gacaaccaag 2221 tccttttctg caggcccagt gatgtcacag ggtccctcag ttacagatct ggaaatgcca 2281 cattattcta cctttgccta cttcccaact gaggtaacac ctcatgcttt taccccatcc 2341 tccagacaac aggatttggt ctccacggtc aacgtggtat actcgcagac aacccaaccg 2401 gtatacaatg gtgagacacc tcttcaacct tcctacagta gtgaagtctt tcctctagtc 2461 acccctttgt tgcttgacaa tcagatcctc aacactaccc ctgctgcttc aagtagtgat 2521 tcggccttgc atgctacgcc tgtatttccc agtgtcgatg tgtcatttga atccatcctg 2581 tcttcctatg atggtgcacc tttgcttcca ttttcctctg cttccttcag tagtgaattg 2641 tttcgccatc tgcatacagt ttctcaaatc cttccacaag ttacttcagc taccgagagt 2701 gataaggtgc ccttgcatgc ttctctgcca gtggctgggg gtgatttgct attagagccc 2761 agccttgctc agtattctga tgtgctgtcc actactcatg ctgcttcaga gacgctggaa 2821 tttggtagtg aatctggtgt tctttataaa acgcttatgt tttctcaagt tgaaccaccc 2881 agcagtgatg ccatgatgca tgcacgttct tcagggcctg aaccttctta tgccttgtct 2941 gataatgagg gctcccaaca catcttcact gtttcttaca gttctgcaat acctgtgcat 3001 gattctgtgg gtgtaactta tcagggttcc ttatttagcg gccctagcca tataccaata 3061 cctaagtctt cgttaataac cccaactgca tcattactgc agcctactca tgccctctct 3121 ggtgatgggg aatggtctgg agcctcttct gatagtgaat ttcttttacc tgacacagat 3181 gggctgacag cccttaacat ttcttcacct gtttctgtag ctgaatttac atatacaaca 3241 tctgtgtttg gtgatgataa taaggcgctt tctaaaagtg aaataatata tggaaatgag 3301 actgaactgc aaattccttc tttcaatgag atggtttacc cttctgaaag cacagtcatg 3361 cccaacatgt atgataatgt aaataagttg aatgcgtctt tacaagaaac ctctgtttcc 3421 atttctagca ccaagggcat gtttccaggg tcccttgctc ataccaccac taaggttttt 3481 gatcatgaga ttagtcaagt tccagaaaat aacttttcag ttcaacctac acatactgtc 3541 tctcaagcat ctggtgacac ttcgcttaaa cctgtgctta gtgcaaactc agagccagca 3601 tcctctgacc ctgcttctag tgaaatgtta tctccttcaa ctcagctctt attttatgag 3661 acctcagctt cttttagtac tgaagtattg ctacaacctt cctttcaggc ttctgatgtt 3721 gacaccttgc ttaaaactgt tcttccagct gtgcccagtg atccaatatt ggttgaaacc 3781 cccaaagttg ataaaattag ttctacaatg ttgcatctca ttgtatcaaa ttctgcttca 3841 agtgaaaaca tgctgcactc tacatctgta ccagtttttg atgtgtcgcc tacttctcat 3901 atgcactctg cttcacttca aggtttgacc atttcctatg caagtgagaa atatgaacca 3961 gttttgttaa aaagtgaaag ttcccaccaa gtggtacctt ctttgtacag taatgatgag 4021 ttgttccaaa cggccaattt ggagattaac caggcccatc ccccaaaagg aaggcatgta 4081 tttgctacac ctgttttatc aattgatgaa ccattaaata cactaataaa taagcttata 4141 cattccgatg aaattttaac ctccaccaaa agttctgtta ctggtaaggt atttgctggt 4201 attccaacag ttgcttctga tacatttgta tctactgatc attctgttcc tataggaaat 4261 gggcatgttg ccattacagc tgtttctccc cacagagatg gttctgtaac ctcaacaaag 4321 ttgctgtttc cttctaaggc aacttctgag ctgagtcata gtgccaaatc tgatgccggt 4381 ttagtgggtg gtggtgaaga tggtgacact gatgatgatg gtgatgatga tgatgacaga 4441 gatagtgatg gcttatccat tcataagtgt atgtcatgct catcctatag agaatcacag 4501 gaaaaggtaa tgaatgattc agacacccac gaaaacagtc ttatggatca gaataatcca 4561 atctcatact cactatctga gaattctgaa gaagataata gagtcacaag tgtatcctca 4621 gacagtcaaa ctggtatgga cagaagtcct ggtaaatcac catcagcaaa tgggctatcc 4681 caaaagcaca atgatggaaa agaggaaaat gacattcaga ctggtagtgc tctgcttcct 4741 ctcagccctg aatctaaagc atgggcagtt ctgacaagtg atgaagaaag tggatcaggg 4801 caaggtacct cagatagcct taatgagaat gagacttcca cagatttcag ttttgcagac 4861 actaatgaaa aagatgctga tgggatcctg gcagcaggtg actcagaaat aactcctgga 4921 ttcccacagt ccccaacatc atctgttact agcgagaact cagaagtgtt ccacgtttca 4981 gaggcagagg ccagtaatag tagccatgag tctcgtattg gtctagctga ggggttggaa 5041 tccgagaaga aggcagttat accccttgtg atcgtgtcag ccctgacttt tatctgtcta 5101 gtggttcttg tgggtattct catctactgg aggaaatgct tccagactgc acacttttac 5161 ttagaggaca gtacatcccc tagagttata tccacacctc caacacctat ctttccaatt 5221 tcagatgatg tcggagcaat tccaataaag cactttccaa agcatgttgc agatttacat 5281 gcaagtagtg ggtttactga agaatttgag acactgaaag agttttacca ggaagtgcag 5341 agctgtactg ttgacttagg tattacagca gacagctcca accacccaga caacaagcac 5401 aagaatcgat acataaatat cgttgcctat gatcatagca gggttaagct agcacagctt 5461 gctgaaaagg atggcaaact gactgattat atcaatgcca attatgttga tggctacaac 5521 agaccaaaag cttatattgc tgcccaaggc ccactgaaat ccacagctga agatttctgg 5581 agaatgatat gggaacataa tgtggaagtt attgtcatga taacaaacct cgtggagaaa 5641 ggaaggagaa aatgtgatca gtactggcct gccgatggga gtgaggagta cgggaacttt 5701 ctggtcactc agaagagtgt gcaagtgctt gcctattata ctgtgaggaa ttttactcta 5761 agaaacacaa aaataaaaaa gggctcccag aaaggaagac ccagtggacg tgtggtcaca 5821 cagtatcact acacgcagtg gcctgacatg ggagtaccag agtactccct gccagtgctg 5881 acctttgtga gaaaggcagc ctatgccaag cgccatgcag tggggcctgt tgtcgtccac 5941 tgcagtgctg gagttggaag aacaggcaca tatattgtgc tagacagtat gttgcagcag 6001 attcaacacg aaggaactgt caacatattt ggcttcttaa aacacatccg ttcacaaaga 6061 aattatttgg tacaaactga ggagcaatat gtcttcattc atgatacact ggttgaggcc 6121 atacttagta aagaaactga ggtgctggac agtcatattc atgcctatgt taatgcactc 6181 ctcattcctg gaccagcagg caaaacaaag ctagagaaac aattccagct cctgagccag 6241 tcaaatatac agcagagtga ctattctgca gccctaaagc aatgcaacag ggaaaagaat 6301 cgaacttctt ctatcatccc tgtggaaaga tcaagggttg gcatttcatc cctgagtgga 6361 gaaggcacag actacatcaa tgcctcctat atcatgggct attaccagag caatgaattc 6421 atcattaccc agcaccctct ccttcatacc atcaaggatt tctggaggat gatatgggac 6481 cataatgccc aactggtggt tatgattcct gatggccaaa acatggcaga agatgaattt 6541 gtttactggc caaataaaga tgagcctata aattgtgaga gctttaaggt cactcttatg 6601 gctgaagaac acaaatgtct atctaatgag gaaaaactta taattcagga ctttatctta 6661 gaagctacac aggatgatta tgtacttgaa gtgaggcact ttcagtgtcc taaatggcca 6721 aatccagata gccccattag taaaactttt gaacttataa gtgttataaa agaagaagct 6781 gccaataggg atgggcctat gattgttcat gatgagcatg gaggagtgac ggcaggaact 6841 ttctgtgctc tgacaaccct tatgcaccaa ctagaaaaag aaaattccgt ggatgtttac 6901 caggtagcca agatgatcaa tctgatgagg ccaggagtct ttgctgacat tgagcagtat 6961 cagtttctct acaaagtgat cctcagcctt gtgagcacaa ggcaggaaga gaatccatcc 7021 acctctctgg acagtaatgg tgcagcattg cctgatggaa atatagctga gagcttagag 7081 tctttagttt aacacagaaa ggggtggggg gactcacatc tgagcattgt tttcctcttc 7141 ctaaaattag gcaggaaaat cagtctagtt ctgttatctg ttgatttccc atcacctgac 7201 agtaactttc atgacatagg attctgccgc caaatttata tcattaacaa tgtgtgcctt 7261 tttgcaagac ttgtaattta cttattatgt ttgaactaaa atgattgaat tttacagtat 7321 ttctaagaat ggaattgtgg tatttttttc tgtattgatt ttaacagaaa atttcaattt 7381 atagaggtta ggaattccaa actacagaaa atgtttgttt ttagtgtcaa atttttagct 7441 gtatttgtag caattatcag gtttgctaga aatataactt ttaatacagt agcctgtaaa 7501 taaaacactc ttccatatga tattcaacat tttacaactg cagtattcac ctaaagtaga 7561 aataatctgt tacttattgt aaatactgcc ctagtgtctc catggaccaa atttatattt 7621 ataattgtag atttttatat tttactactg agtcaagttt tctagttctg tgtaattgtt 7681 tagtttaatg acgtagttca ttagctggtc ttactctacc agttttctga cattgtattg 7741 tgttacctaa gtcattaact ttgtttcagc atgtaatttt aacttttgtg gaaaatagaa 7801 ataccttcat tttgaaagaa gtttttatga gaataacacc ttaccaaaca ttgttcaaat 7861 ggtttttatc caaggaattg caaaaataaa tataaatatt gccattaaaa aaaaaaaaaa 7921 aaaaaaaaaa aaaaaaaaaa a // LOCUS HUMPTYPH 3643 bp mRNA PRI 17-JUL-1991 DEFINITION Human protein-tyrosine phosphatase mRNA, complete cds. ACCESSION M68941 NID g190747 KEYWORDS protein-tyrosine phosphatase. SOURCE Homo sapiens (library: cDNA, Meg-01, HUVEC) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3643) AUTHORS Gu,M., York,J.D., Warshawsky,I. and Majerus,P.W. TITLE Identification, cloning, and expression of a cytosolic megakaryocyte protein-tyrosine-phosphatase with sequence homology to cytoskeletal protein 4.1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88, 5867-5871 (1991) MEDLINE 91288564 FEATURES Location/Qualifiers source 1..3643 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Meg-01, human platelets" /tissue_lib="cDNA, Meg-01, HUVEC" mRNA <1..3643 /product="protein-tyrosine phophatase" 5'UTR 1..771 /note="contains short ORF's, GC rich stem loop structure with 75% homology to cABL protooncogene 5'UTR; putative" stem_loop 632..673 /note="GC rich stem loop structure" CDS 772..3552 /codon_start=1 /product="protein-tyrosine phophatase" /db_xref="PID:g190748" /translation="MTSRFRLPAGRTYNVRASELARDRQHTEVVCNILLLDNTVQAFK VNKHDQGQVLLDVVFKHLDLTEQDYFGLQLADDSTDNPRWLDPNKPIRKQLKRGSPYS LNFRVKFFVSDPNKLQEEYTRYQYFLQIKQDILTGRLPCPSNTAALLASFAVQSELGD YDQSENLSGYLSDYSFIPNQPQDFEKEIAKLHQQHIGLSPAEAEFNYLNTARTLELYG VEFHYARDQSNNEIMIGVMSGGILIYKNRVRMNTFPWLKIVKISFKCKQFFIQLRKEL HESRETLLGFNMVNYRACKNLWKACVEHHTFFRLDRPLPPQKNFFAHYFTLGSKFRYC GRTEVQSVQYGKEKANKDRVFARSPSKPLARKLMDWEVVSRNSISDDRLETQSLPSRS PPGTPNHRNSTFTQEGTRLRPSSVGHLVDHMVHTSPSEVFVNQRSPSSTQANSIVLES SPSQETPGDGKPPALPPKQSKKNSWNQIHYSHSQQDLESHINETFDIPSSPEKPTPNG GIPHDNLVLIRMKPDENGRFGFNVKGGYDQKMPVIVSRVAPGTPADLCVPRLNEGDQV VLINGRDIAEHTHDQVVLFIKASCERHSGELMLLVRPNAVYDVVEEKLENEPDFQYIP EKAPLDSVHQDDHSLRESMIQLAEGLITGTVLTQFDQLYRKKPGMTMSCAKLPQNISK NRYRDISPYDATRVILKGNEDYINANYINMEIPSSSIINQYIACQGPLPHTCTDFWQM TWEQGSSMVVMLTTQVERGRVKCHQYWPEPTGSSSYGCYQVTCHSEEGNTAYIFRKMT LFNQEKNESRPLTQIQYIAWPDHGVPDDSSDFLDFVCHVRNKRAGKEEPVVVHCSAGI GRTGVLITMETAMCLIECNQPVYPLDIVRTMRDQRAMMIQTPSQYRFVCEAILKVYEE GFVKPLTTSTNK" BASE COUNT 1038 a 815 c 863 g 927 t ORIGIN 1 cctgcgtgtc cctctgcgct ccgactggtg cgacttctcc ctgcgctagc gaggcagggt 61 tttggcctcg cctctcgcga gatcgcctcc tgttgctgcc gccgccgctc ctggccactg 121 actggcggcg cctgcgcagc cgccatgttc ggttgctatg ctgcggccta ggagaggggg 181 tgtgcttgag ggaggaggaa gagatagagg aggaggaggg ggaggaagag gaggtggaga 241 aggagggggg tgactgagct cctcttgcac tctcacacac aaacgctgcc caggattacc 301 cgccagctca cgccgcgcag tgcgcttttc cgctcctcgc gccccaccac caacattgtt 361 ctctcaggac tcctgggtcc caggggtcgg aattgggcct gagcgggaga ggaaagagac 421 ttggctttgg ccgcggggtc ggaggattgg ggccaggccc cctcccccac gcacttttgg 481 gggtgtggat tatctcatcc ctgcagggag gtaggagagg tcgccggctg cccgcctccc 541 tgccacctcc ccagcggcgc cggcccgcgg ctgcccagca gcatgaggtg gtgctggcgg 601 ctccgggtcg tggcgcgacc gctgcggcgg cggctgctcg gggggcgctg aggtagcccc 661 ccggagcggc acggaggacg cgcttctcct ctgcgcgccg gggcctcgag gctttttttc 721 tccagccgag aggacgcggc tgtgatatac gaagactttg tgtggacagt aatgacctca 781 cgtttccgat tgcctgctgg cagaacctac aatgtacgag catcagagtt ggcccgagac 841 agacagcata ctgaagtggt ttgcaacatc cttcttctgg ataacactgt acaagctttc 901 aaagtcaata aacatgatca ggggcaagtc ttgttggatg tcgtcttcaa gcatctagat 961 ttgactgagc aggactattt tggtttacag ttggctgatg attccacaga taacccaagg 1021 tggctggatc caaacaaacc aataaggaag cagctaaaga gaggatctcc ttacagtttg 1081 aactttagag tcaaattttt tgtaagtgac cccaacaagt tacaagaaga atatacaagg 1141 taccagtatt ttttgcaaat taaacaagac attcttactg gaagattacc ctgtccttct 1201 aatactgctg cccttttagc ttcatttgct gttcagtctg aacttggaga ctacgatcag 1261 tcagagaact tgtcaggcta cctctcagat tattctttca ttcctaatca acctcaagat 1321 tttgaaaaag aaattgcaaa attacatcag caacacatag gcttatctcc tgcagaagca 1381 gaatttaatt acctaaacac agcacgtacc ttagaactct atggagttga attccactat 1441 gcaagggatc agagtaacaa tgaaattatg attggagtga tgtcaggagg aattctgatt 1501 tataagaaca gggtacgaat gaataccttt ccatggttga agattgtaaa aatttctttt 1561 aagtgcaaac agttttttat tcaacttaga aaagaattgc atgaatctag agaaacatta 1621 ttgggattta atatggtgaa ttacagagca tgtaaaaatt tgtggaaagc atgtgtagaa 1681 catcacacat tcttccgttt ggacagacca cttccacctc aaaagaattt ttttgcacat 1741 tattttacat taggttcaaa attccggtac tgtgggagaa ctgaagtcca atcagttcag 1801 tatggcaaag aaaaggcaaa taaagacagg gtatttgcaa gatccccaag taagcccttg 1861 gcacggaaat taatggattg ggaagtagta agcagaaatt caatatctga tgacaggtta 1921 gaaacacaaa gtcttccatc acgatctcca ccgggaactc ctaatcatcg aaattctaca 1981 ttcacgcagg aaggaacccg gttacgacca tcttcagttg gtcatttggt agaccatatg 2041 gttcatactt ccccaagcga agtgtttgta aatcagagat ctccgtcatc aacacaagct 2101 aatagcattg ttctggaatc atcaccatca caagagaccc ctggagatgg gaagcctcca 2161 gctttaccac ccaaacagtc aaagaaaaac agttggaacc aaattcatta ttcacattcg 2221 caacaagatc tagaaagtca tattaatgaa acatttgata ttccatcttc tcctgaaaaa 2281 cccactccta atggtggtat tccacatgat aatcttgtcc taatcagaat gaaacctgat 2341 gaaaatggga ggtttggatt caatgtaaag ggaggatatg atcagaagat gcctgtgatt 2401 gtgtctcgag tagcaccagg aacacctgct gacctctgtg tccctagact gaatgaaggg 2461 gaccaagttg tactgatcaa tggtcgggac attgcagaac acactcatga tcaggttgtg 2521 ctgtttatta aagctagttg tgagagacat tctggggaac tcatgcttct agttcgacct 2581 aatgctgtat atgatgtagt ggaagaaaag ctagaaaatg agccagattt ccagtatatt 2641 cctgagaaag ccccactaga tagtgtgcat caggatgacc attccctgcg ggagtcaatg 2701 atccagctag ctgaggggct tatcactgga acagtcctga cacagtttga tcaactgtat 2761 cggaaaaaac ctggaatgac aatgtcctgt gccaaattac ctcagaatat ttccaaaaat 2821 agatacagag atatttcgcc ttatgatgcc acacgggtca ttttaaaagg taatgaagac 2881 tacatcaatg cgaactatat aaatatggaa attccttctt ccagcattat aaatcagtac 2941 attgcttgtc aagggccatt accacacact tgtacagatt tttggcagat gacttgggaa 3001 caaggctcct ctatggttgt aatgttgacc acacaagttg aacgtggcag agttaaatgt 3061 caccaatatt ggccagaacc cacaggcagt tcatcttatg gatgctacca agttacctgc 3121 cactctgaag aaggaaacac tgcctatatc ttcaggaaga tgaccctatt taaccaagag 3181 aaaaatgaaa gtcgtccact cactcagatc cagtacatag cctggcctga ccatggagtc 3241 cctgatgatt cgagtgactt tctagatttt gtttgtcatg tacgaaacaa gagggctggc 3301 aaggaagaac ccgttgttgt ccattgcagt gctggaatcg gaagaactgg ggttcttatt 3361 actatggaaa cagccatgtg tctcattgaa tgcaatcagc cagtttatcc actagatatt 3421 gtaagaacaa tgagagatca gcgagccatg atgatccaaa cacctagtca atacagattt 3481 gtatgtgaag ctattttgaa agtttatgaa gaaggctttg ttaaaccctt aacaacatca 3541 acaaataaat aagaaagcaa aaagatctgg gatatgtgtt ggaaaactgc tttcccttat 3601 gttcactgtg ccataatgct gctcgcagga aatggcattt tac // LOCUS HUMPURA 1144 bp mRNA PRI 08-JAN-1995 DEFINITION H.sapiens Pur (pur-alpha) mRNA, complete cds. ACCESSION M96684 NID g190749 KEYWORDS Pur. SOURCE Homo sapiens fetus liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1144) AUTHORS Bergemann,A.D. and Johnson,E.M. TITLE The HeLa Pur factor binds single-stranded DNA at a specific element conserved in gene flanking regions and origins of DNA replication JOURNAL Mol. Cell. Biol. 12 (3), 1257-1265 (1992) MEDLINE 92186858 REFERENCE 2 (bases 1 to 1144) AUTHORS Bergemann,A.D., Ma,Z.W. and Johnson,E.M. TITLE Sequence of cDNA comprising the human pur gene and sequence-specific single-stranded-DNA-binding properties of the encoded protein JOURNAL Mol. Cell. Biol. 12 (12), 5673-5682 (1992) MEDLINE 93078769 FEATURES Location/Qualifiers source 1..1144 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="liver" gene 60..1028 /gene="pur-alpha" CDS 60..1028 /gene="pur-alpha" /citation=[2] /codon_start=1 /function="sequence-specific single-stranded DNA binding protein" /evidence=experimental /label=orf /product="Pur" /db_xref="PID:g190750" /translation="MADRDSGSEQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSG GGGGGAPGGLQHETQELASKRVDIQNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTL SMSVAVEFRDYLGDFIEHYAQLGPSQPPDLAQAQDEPRRALKSEFLVRENRKYYMDLK ENQRGRFLRIRQTVNRGPGLGSTQGQTIALPAQGLIEFRDALAKLIDDYGVEEEPAEL PEGTSLTVDNKRFFFDVGSNKYGVFMRVSEVKPTYRNSITVPYKVWAKFGHTFCKYSE EMKKIQEKQREKRAACEQLHQQQQQQQEETAAATLLLQGEEEGEED" BASE COUNT 263 a 338 c 394 g 149 t ORIGIN 1 cgactgaggc ggcgggcgga gcggcaggcg gcggcggcgc ggcagcggag cgcagcatca 61 tggcggaccg agacagcggc agcgagcagg gtggtgcggc gctgggttcg ggcggctccc 121 tggggcaccc cggctcgggc tcaggctccg gcgggggcgg tggtggcggc gggggcggcg 181 gcggcagtgg cggcggcggc ggcggggccc caggggggct gcagcacgag acgcaggagc 241 tggcctccaa gcgggtggac atccagaaca agcgcttcta cctggacgtg aagcagaacg 301 ccaagggccg cttcctgaag atcgccgagg tgggcgcggg cggcaacaag agccgcctta 361 ctctctccat gtcagtggcc gtggagttcc gcgactacct gggcgacttc atcgagcact 421 acgcgcagct gggccccagc cagccgccgg acctggccca ggcgcaggac gagccgcgcc 481 gggcgctcaa aagcgagttc ctggtgcgcg agaaccgcaa gtactacatg gatctcaagg 541 agaaccagcg cggccgcttc ctgcgcatcc gccagacggt caaccggggg cctggcctgg 601 gctccacgca gggccagacc attgcgctgc ccgcgcaggg gctcatcgag ttccgtgacg 661 ctctggccaa gctcatcgac gactacggag tggaggagga gccggccgag ctgcccgagg 721 gcacctcctt gactgtggac aacaagcgct tcttcttcga tgtgggctcc aacaagtacg 781 gcgtgtttat gcgagtgagc gaggtgaagc ccacctatcg caactccatc accgtgccct 841 acaaggtgtg ggccaagttc ggacacacct tctgcaagta ctcggaggag atgaagaaga 901 ttcaagagaa gcagagggag aagcgggctg cctgtgagca gcttcaccag cagcaacagc 961 agcagcagga ggagaccgcc gctgccactc tgctactgca gggtgaggaa gaaggggaag 1021 aagattgatc aaacagaatg aaacccccac acacacacac atgcatacac acacacacac 1081 agccacacac acagaaaata tactgtaaag aaagagagaa aataaaaagt taaaaagtta 1141 aaaa // LOCUS HUMPYHBASA 2722 bp mRNA PRI 15-DEC-1989 DEFINITION Human prolyl 4-hydroxylase alpha subunit mRNA, complete cds, clone PA-11. ACCESSION M24486 NID g190785 KEYWORDS prolyl 4-hydroxylase alpha subunit. SOURCE Human placenta, cDNA to mRNA, clone PA-11. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2722) AUTHORS Helaakoski,T., Vuori,K., Myllylae,R., Kivirikko,L. and Pihlajaniemi,T. TITLE Molecular cloning of the alpha-subunit of human prolyl 4-hydroxylase: The complete cDNA-derived amino acid sequence and evidence for alternative splicing of RNA transcripts JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4392-4396 (1989) MEDLINE 89282778 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Pihlajaniemi, 05-MAY-1989. FEATURES Location/Qualifiers source 1..2722 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..2722 /note="PYHB" CDS 119..1723 /note="prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2)" /codon_start=1 /db_xref="PID:g190786" /translation="MIWYILIIGILLPQSLAHPGFFTSIGQMTDLIHTEKDLVTSLKD YIKAEEDKLEQIKKWAEKLDRLTSTATKDPEGFVGHPVNAFKLMKRLNTEWSELENLV LKDMSDGFISNLTIQRPVLSNDEDQVGAAKALLRLQDTYNLDTDTISKGNLPGVKHKS FLTAEDCFELGKVAYTEADYYHTELWMEQALRQLDEGEISTIDKVSVLDYLSYAVYQQ GDLDKALLLTKKLLELDPEHQRANGNLKYFEYIMAKEKDVNKSASDDQSDQKTTPKKK GVAVDYLPERQKYEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDK PRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENP VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRI ATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVG NKWVSNKWLHERGQEFRRPCTLSELE" BASE COUNT 809 a 489 c 590 g 834 t ORIGIN 1 gagcgggctg agggtaggaa gtagccgctc cgagtggagg cgactggggg ctgaagagcg 61 cgccgccctc tcgtcccact ttccaggtgt gtgatcctgt aaaattaaat cttccaagat 121 gatctggtat atattaatta taggaattct gcttccccag tctttggctc atccaggctt 181 ttttacttca attggtcaga tgactgattt gatccatact gagaaagatc tggtgacttc 241 tctgaaagat tatattaagg cagaagagga caagttagaa caaataaaaa aatgggcaga 301 gaagttagat cggctaacta gtacagcgac aaaagatcca gaaggatttg ttgggcatcc 361 agtaaatgca ttcaaattaa tgaaacgtct gaatactgag tggagtgagt tggagaatct 421 ggtccttaag gatatgtcag atggctttat ctctaaccta accattcaga gaccagtact 481 ttctaatgat gaagatcagg ttggggcagc caaagctctg ttacgtctcc aggataccta 541 caatttggat acagatacca tctcaaaggg taatcttcca ggagtgaaac acaaatcttt 601 tctaacggct gaggactgct ttgagttggg caaagtggcc tatacagaag cagattatta 661 ccatacggaa ctgtggatgg aacaagccct aaggcaactg gatgaaggcg agatttctac 721 catagataaa gtctctgttc tagattattt gagctatgcg gtatatcagc agggagacct 781 ggataaggca cttttgctca caaagaagct tcttgaacta gatcctgaac atcagagagc 841 taatggtaac ttaaaatatt ttgagtatat aatggctaaa gaaaaagatg tcaataagtc 901 tgcttcagat gaccaatctg atcagaaaac tacaccaaag aaaaaagggg ttgctgtgga 961 ttacctgcca gagagacaga agtacgaaat gctgtgccgt ggggagggta tcaaaatgac 1021 ccctcggaga cagaaaaaac tcttttgccg ctaccatgat ggaaaccgta atcctaaatt 1081 tattctggct ccagctaaac aggaggatga atgggacaag cctcgtatta ttcgcttcca 1141 tgatattatt tctgatgcag aaattgaaat cgtcaaagac ctagcaaaac caaggctgag 1201 ccgagctaca gtacatgacc ctgagactgg aaaattgacc acagcacagt acagagtatc 1261 taagagtgcc tggctctctg gctatgaaaa tcctgtggtg tctcgaatta atatgagaat 1321 acaagatcta acaggactag atgtttccac agcagaggaa ttacaggtag caaattatgg 1381 agttggagga cagtatgaac cccattttga ctttgcacgg aaagatgagc cagatgcttt 1441 caaagagctg gggacaggaa atagaattgc tacatggctg ttttatatga gtgatgtgtc 1501 tgcaggagga gccactgttt ttcctgaagt tggagctagt gtttggccca aaaaaggaac 1561 tgctgttttc tggtataatc tgtttgccag tggagaagga gattatagta cacggcatgc 1621 agcctgtcca gtgctagttg gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 1681 acaagaattt cgaagacctt gtacgttgtc agaattggaa tgacaaacag gcttcccttt 1741 ttctcctatt gttgtactct tatgtgtctg atatacacat ttccatagtc ttaactttca 1801 ggagtttaca attgactaac actccatgat tgattcagtc atgaacctca tcccatgttt 1861 catctgtgga caattgctta ctttgtgggt tcttttaaaa gtaacacgaa atcatcatat 1921 tgcataaaac cttaaagttc tgttggtatc acagaagaca aggcagagtt taaagtgagg 1981 aattttatat ttaaagaact ttttggttgg ataaaaacat aatttgagca tccagtttta 2041 gtatttcact acatctcagt tggtgggtgt taagctagaa tgggctgtgt gataggaaac 2101 aaatgcctta cagatgtgcc taggtgttct gtttacctag tgtcttactc tgttttctgg 2161 atctgaagac tagtaataaa ctaggacact aactgggttc catgtgattg ccctttcata 2221 tgatcttcta agttgatttt tttcctccca agtctttttt aaagaaagta tactgtattt 2281 taccaacccc ctctcttttc ttttagctcc tctgtggtga attaaacgta cttgagttaa 2341 aatatttcga tttttttttt ttttttaatg gaaagtcctg cataacaaca ctgggccttc 2401 ttaactaaaa tgctcaccac ttagcctgtt tttttatccc ttttttaaaa tgacagatga 2461 ttttgttcag gaattttgct gtttttctta gtgctaatac cttgcctctt attcctgcta 2521 cagcagggtg gtaatattgg cattctgatt aaatactgtg ccttaggaga ctggaagttt 2581 aaaaatgtac aagtcctttc agtgatgagg gaattgattt tttttaaaag tctttttctt 2641 agaaagccaa aatgtttgtt tttttaagat tctgaaatgt gttgtgacaa caatgaccta 2701 tttatgatct taaatctttt tt // LOCUS HUMQPRTASE 894 bp mRNA PRI 03-MAR-1997 DEFINITION Human mRNA for quinolinate phosphoribosyl transferase (nicotinate mononucleotide pyrophosphorylase), complete cds. ACCESSION D78177 NID g1060906 KEYWORDS nicotinate mononucleotide pyrophosphorylase; quinolinate phosphoribosyl transferase; QPRTase. SOURCE Homo sapiens adult placenta (library: lambda Uni ZAP) cDNA to mRNA, clone QPRT115. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 894) AUTHORS Fukuoka,S. TITLE Characterization and expression of cDNA encoding human quino linate phosphoribosyl transferase JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 894) AUTHORS Fukuoka,S.-I. TITLE Direct Submission JOURNAL Submitted (31-OCT-1995) to the DDBJ/EMBL/GenBank databases. Shin-Ichi Fukuoka, Kyoto University, Research Institute for Food Science; Gokanosho, Uji, Kyoto 611, Japan (E-mail:fukuoka@soya.food.kyoto-u.ac.jp, Tel:0774-33-6905, Fax:0774-33-3004) FEATURES Location/Qualifiers source 1..894 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda Uni ZAP" /dev_stage="adult" /tissue_type="placenta" CDS 1..894 /EC_number="2.4.2.19" /note="QPRTase" /codon_start=1 /product="quinolinate phosphoribosyl transferase (nicotinate mononucleotide pyrophosphorylase)" /db_xref="PID:d1011904" /db_xref="PID:g1060907" /translation="MDAEGLALLLPPVTLAALVDSWLREDCPGLNYAALVSGAGPSQA ALWAKSPGVLAGQPFFDAIFTQLNCQVSWFLPEGSKLVPVARVAEVRGPAHCLLLGER VALNTLARCSGIASAAAAAVEAARGAGWTGHVAGTRKTTPGFRLVEKYGLLVGGAASH RYDLGGLVMLKDNHVVPPGGVEKAVRAARQAADFALKVEVECSSLQEVVQAAEAGADL VLLDNFKPEELHPTATALKAQFPSVAVEASGGITLDNLPQFCGPHIDVISMGMLTQAV PALDFSLKLFAKEVAPVPKIH" BASE COUNT 148 a 283 c 312 g 151 t ORIGIN 1 atggacgctg aaggcctggc gctgctgctg ccgcccgtca ccctggcagc cctggtggac 61 agctggctcc gagaggactg cccagggctc aactacgcag ccttggtcag cggggcaggc 121 ccctcgcagg cggcgctgtg ggccaaatcc cctggggtac tggcagggca gcctttcttc 181 gatgccatat ttacccaact caactgccaa gtctcctggt tcctccccga gggatcgaag 241 ctggtgccgg tggccagagt ggccgaggtc cggggccctg cccactgcct gctgctgggg 301 gaacgggtgg ccctcaacac gctggcccgc tgcagtggca ttgccagtgc tgccgccgct 361 gcagtggagg ccgccagggg ggccggctgg actgggcacg tggcaggcac gaggaagacc 421 acgccaggct tccggctggt ggagaagtat gggctcctgg tgggcggggc cgcctcgcac 481 cgctacgacc tgggagggct ggtgatgttg aaggataacc atgtggtgcc ccccggtggc 541 gtggagaagg cggtgcgggc ggccagacag gcggctgact tcgctctgaa ggtggaagtg 601 gaatgcagca gcctgcagga ggtcgtccag gcagctgagg ctggcgccga ccttgtcctg 661 ctggacaact tcaagccaga ggagctgcac cccacggcca ccgcgctgaa ggcccagttc 721 ccgagtgtgg ctgtggaagc cagtgggggc atcaccctgg acaacctccc ccagttctgc 781 gggccgcaca tagacgtcat ctccatgggg atgctgaccc aggcggtccc agcccttgat 841 ttctccctca agctgtttgc caaagaggtg gctccagtgc ccaaaatcca ctag // LOCUS HUMQRE 976 bp mRNA PRI 08-JAN-1995 DEFINITION Human quinone oxidoreductase (NQO2) mRNA, complete cds. ACCESSION J02888 NID g190817 KEYWORDS quinone oxidoreductase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 976) AUTHORS Jaiswal,A.K., Burnett,P., Adesnik,M. and McBride,O.W. TITLE Nucleotide and deduced amino acid sequence of a human cDNA (NQO2) corresponding to a second member of the NAD(P)H:quinone oxidoreductase gene family. Extensive polymorphism at the NQO2 gene locus on chromosome 6 JOURNAL Biochemistry 29 (7), 1899-1906 (1990) MEDLINE 90234709 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Jaiswal,A.K., 30-JAN-1990. FEATURES Location/Qualifiers source 1..976 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6pter-q12" gene 176..871 /gene="NMOR2" CDS 176..871 /gene="NMOR2" /note="quinone oxidoreductase" /codon_start=1 /db_xref="GDB:G00-128-074" /db_xref="PID:g190818" /translation="MAGKKVLIVYAHQEPKSFNGSLKNVAVDELSRQGCTVTVSDLYA MNFEPRATDKDITGTLSNPEVFNYGVETHEAYKQRSLASDITDEQKKVREADLVIFQF PLYWFSVPAILKGWMDRVLCQGFAFDIPGFYDSGLLQGKLALLSVTTGGTAEMYTKTG VNGDSRYFLWPLQHGTLHFCGFKVLAPQISFAPEIASEEERKGMVAAWSQRLQTIWKE EPIPCTAHWHFGQ" BASE COUNT 249 a 249 c 270 g 208 t ORIGIN Chromosome 6pter-q12. 1 cccggaacct ggcgcaactc ctagagcggt ccttggggag acgcgggtcc cagtcctgcg 61 gctcctactg gggagtgcgc tggtcggaag attgctggac tcgctgaaga gagactacgc 121 aggaaagccc cagccaccca tcaaatcaga gagaaggaat ccaccttctt acgctatggc 181 aggtaagaaa gtactcattg tctatgcaca ccaggaaccc aagtctttca acggatcctt 241 gaagaatgtg gctgtagatg aactgagcag gcagggctgc accgtcacag tgtctgattt 301 gtatgccatg aactttgagc cgagggccac agacaaagat atcactggta ctctttctaa 361 tcctgaggtt ttcaattatg gagtggaaac ccacgaagcc tacaagcaaa ggtctctggc 421 tagcgacatc actgatgagc agaaaaaggt tcgggaggct gacctagtga tatttcagtt 481 cccgctgtac tggttcagcg tgccggccat cctgaagggc tggatggata gggtgctgtg 541 ccagggcttt gcctttgaca tcccaggatt ctacgattcc ggtttgctcc agggtaaact 601 agcgctcctt tccgtaacca cgggaggcac ggccgagatg tacacgaaga caggagtcaa 661 tggagattct cgatacttcc tgtggccact ccagcatggc acattacact tctgtggatt 721 taaagtcctt gcccctcaga tcagctttgc tcctgaaatt gcatccgaag aagaaagaaa 781 ggggatggtg gctgcgtggt cccagaggct gcagaccatc tggaaggaag agcccatccc 841 ctgcacagcc cactggcact tcgggcaata actctgtggc acgtgggcat cacgtaagca 901 gcacactagg aggcccaggc gcaggcaaag agaagatggt gctgtcatga aataaaatta 961 caacatagct acctgg // LOCUS HUMQUINZ 1796 bp mRNA PRI 25-MAR-1993 DEFINITION Homo sapiens zeta-crystallin/quinone reductase mRNA, complete cds. ACCESSION L13278 NID g292414 KEYWORDS quinone reductase; zeta-crystallin. SOURCE Homo sapiens (library: Clontech HL 1115a) male adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1796) AUTHORS Gonzalez,P., Rao,P.V. and Zigler,J.S. TITLE Molecular cloning and sequencing of zeta-crystallin/quinone reductase cDNA from human liver JOURNAL Biochem. Biophys. Res. Commun. (1993) In press FEATURES Location/Qualifiers source 1..1796 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="liver" /tissue_lib="Clontech HL 1115a" 5'UTR 1..10 CDS 11..1000 /standard_name="zeta-crystallin/quinone reductase" /codon_start=1 /product="zeta-crystallin" /db_xref="PID:g292415" /translation="MATGQKLMRAVRVFEFGGPEVLKLRSDIAVPIPKDHQVLIKVHA CGVNPVETYIRSGTYSRKPLLPYTPGSDVAGVIEAVGDNASAFKKGDRVFTSSTISGG YAEYALAADHTVYKLPEKLDFKQGAAIGIPYFTAYRALIHSACVKAGESVLVHGASGG VGLAACQIARAYGLKILGTAGTEEGQKIVLQNGAHEVFNHREVNYIDKIKKYVGEKGI DIIIEMLANVNLSKDLSLLSHGGRVIVVGSRGTIEINPRDTMAKESSIIGVTLFSSTK EEFQQYAAALQAGMEIGWLKPVIGSQYPLEKVAEAHENIIHGSGATGKMILLL" 3'UTR 1001..1796 polyA_signal 1491..1496 polyA_signal 1787..1792 BASE COUNT 542 a 296 c 377 g 581 t ORIGIN 1 ctagatcacc atggcgactg gacagaagtt gatgagagct gttagagttt ttgaatttgg 61 tgggccagaa gtcctgaaat tgcgatcaga tattgcagta ccgattccaa aagaccatca 121 ggttctaatc aaggtccatg catgtggtgt caaccccgtg gagacataca ttcgctctgg 181 tacttatagt agaaaaccac tcttacccta tactcctggc tcagatgtgg ctggggtgat 241 agaagctgtt ggagataatg catctgcttt caagaaaggt gacagagttt tcactagcag 301 cacgatctct gggggttatg cagagtatgc tcttgcagca gaccacactg tttacaaact 361 acctgaaaaa ctggacttta aacaaggagc tgccatcggc attccatatt ttactgctta 421 tcgagctctg atccacagtg cctgtgtgaa agctggagag agtgttctgg ttcatggggc 481 aagtggagga gttggattag cagcatgcca aattgctaga gcttatggct taaagatttt 541 gggcactgct ggtactgagg aaggacaaaa gattgttttg caaaatggag cccatgaagt 601 gttcaatcac agagaagtga attacattga taaaattaag aagtatgttg gtgagaaagg 661 aattgatata attattgaaa tgttagctaa tgtaaatctt agtaaagact tgagtcttct 721 gtcacatgga ggacgagtga tagttgttgg cagcagaggt actattgaaa taaacccacg 781 agacaccatg gcaaaggagt cgagtataat tggagttact ctcttttcct caaccaagga 841 ggaatttcag caatatgcag cagcccttca agctggaatg gaaattggct ggttgaaacc 901 tgtgataggt tctcaatatc cattggagaa ggtggccgag gctcatgaaa atatcattca 961 tggtagtggg gctactggaa aaatgattct tctcttatga tgattaattc tttcatggat 1021 ttcctatgta attagaggta ctgtctttcc cccagttgta cttaccctat cttttcttta 1081 attaacattc gattccatga gcttcttatg tgaaaaaata agatttttct ttagagagca 1141 gaagcagaag agtaaaattt attgtatagc tagcaatatt tttttatgcc atctgtctca 1201 aatcaaagag tcatcatagt aggaaataac atgttagttg tcatttggca tgagtgtgca 1261 ttccagtaat tcttaattga tatttgatta attccatacc tttgattaaa acatgctagt 1321 tcaaaataag actgctcagt ttccaagggt tttcaagcct acttaccttt ataaaggttc 1381 tctagtctct gattagccat gactgtattg gactttgaac attttctgaa ctaaaaacct 1441 ctattctaaa ctaatctcat ttggatgtgt aagtcttttg taaaggcaag aataaataat 1501 atccaggaca atttattagt tttctcagta ttttcccaaa tattagaata tttacttcat 1561 tattggttgg ctgccaatga ccccatatgt tctgtgagaa tagtagcttt atctttgata 1621 taatacatag tctccaaata ggtaatactt cgcaattgat tagattttca gagtagattt 1681 agagttatct gtttttctgg tgagggtcaa atatttttgt taattaagct cacaaatttg 1741 ataaattaag aattatctgc atttgtctgg taacataata atgtgtaata aagtct // LOCUS HUMRAB1A 723 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens GTP-binding protein (RAB1) mRNA, complete cds. ACCESSION M28209 J04941 NID g550059 KEYWORDS GTP-binding protein; ras oncogene. SOURCE Homo sapiens (tissue library: of J.Mallet) pheochromocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 723) AUTHORS Zahraoui,A., Touchot,N., Chardin,P. and Tavitian,A. TITLE The human Rab genes encode a family of GTP-binding proteins related to yeast YPT1 and SEC4 products involved in secretion JOURNAL J. Biol. Chem. 264 (21), 12394-12401 (1989) MEDLINE 89308668 FEATURES Location/Qualifiers source 1..723 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /tissue_lib="of J.Mallet" /map="Unassigned" gene 51..668 /gene="RAB1" CDS 51..668 /gene="RAB1" /codon_start=1 /db_xref="GDB:G00-118-857" /product="GTP-binding protein" /db_xref="PID:g550060" /translation="MSSMNPEYDYLFKLLLIGDSGVGKSCLLLRFADDTYTESYISTI GVDFKIRTIELDGKTIKLQIWDTAGQERFRTITSSYYRGAHGIIVVYDVTDQESFNNV KQWLQEIDRYASENVNKLLVGNKCDLTTKKVVDYTTAKEFADSLGIPFLETSAKNATN VEQSFMTMAAEIKKRMGPGATAGGAEKSNVKIQSTPVKQSGGGCC" BASE COUNT 225 a 142 c 180 g 176 t ORIGIN 1 gggcggcggt cggcagcaag gcggcggtgc gccgccgcag ctgcagtgac atgtccagca 61 tgaatcccga atatgattat ttattcaagt tacttctgat tggcgactca ggggttggaa 121 agtcttgcct tcttcttagg tttgcagatg atacatatac agaaagctac atcagcacaa 181 ttggtgtgga tttcaaaata agaactatag agttagacgg gaaaacaatc aagcttcaaa 241 tatgggacac agcaggccag gaaagatttc gaacaatcac ctccagttat tacagaggag 301 cccatggcat catagttgtg tatgatgtga cagatcagga gtccttcaat aatgttaaac 361 agtggctgca ggaaatagat cgttatgcca gtgaaaatgt caacaaattg ttggtaggga 421 acaaatgtga tctgaccaca aagaaagtag tagactacac aacagcgaag gaatttgctg 481 attcccttgg aattccgttt ttggaaacca gtgctaagaa tgcaacgaac gtagaacagt 541 ctttcatgac gatggcagct gagattaaaa agcgaatggg tcccggagca acagctggtg 601 gtgctgagaa gtccaatgtt aaaattcaga gcactccagt caagcagtca ggtggaggtt 661 gctgctaaat ttgcctccat ccttttctca cagcaatgaa tttgcaatct gaacccaagt 721 gaa // LOCUS HUMRAB3A 777 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens GTP-binding protein (RAB3A) mRNA, complete cds. ACCESSION M28210 J04941 NID g550063 KEYWORDS GTP-binding protein; ras oncogene. SOURCE Homo sapiens (tissue library: of J.Mallet) pheochromocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 777) AUTHORS Zahraoui,A., Touchot,N., Chardin,P. and Tavitian,A. TITLE The human Rab genes encode a family of GTP-binding proteins related to yeast YPT1 and SEC4 products involved in secretion JOURNAL J. Biol. Chem. 264 (21), 12394-12401 (1989) MEDLINE 89308668 FEATURES Location/Qualifiers source 1..777 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /tissue_lib="of J.Mallet" gene 40..702 /gene="RAB3A" CDS 40..702 /gene="RAB3A" /codon_start=1 /product="GTP-binding protein" /db_xref="PID:g550064" /translation="MASATDSRYGQKESSDQNFDYMFKILIIGNSSVGKTSFLFRYAD DSFTPAFVSTVGIDFKVKTIYRNDKRIKLQIWDTAGQERYRTITTAYYRGAMGFILMY DITNEESFNAVQDWSTQIKTYSWDNAQVLLVGNKCDMEDERVVSSERGRQLADHLGFE FFEASAKDNINVKQTFERLVDVICEKMSESLDTADPAVTGAKQGPQLSDQQVPPHQDC AC" BASE COUNT 176 a 243 c 212 g 146 t ORIGIN 1 gaattccgcg tagtcgccgt tgcatcggtg cagggcaaga tggcatcggc cacagactcg 61 cgctatgggc agaaggagtc ctcggatcag aacttcgact acatgttcaa gattctcatc 121 atcggcaaca gcagcgtggg caagacgtcc ttcctcttcc gctatgctga cgactcgttc 181 acgcctgcct tcgtcagcac cgtgggcatc gacttcaagg tcaagaccat ctatcgcaac 241 gacaagagga tcaagctgca gatctgggac acagcagggc aagagcggta ccggaccatc 301 accaccgcat actaccgggg cgccatgggc ttcatcctca tgtatgacat caccaacgag 361 gaatccttca atgcagtgca ggactggtcc acccagatca agacctactc atgggacaat 421 gcccaggtgc tgctggtagg aaacaagtgt gacatggagg atgagcgggt ggtgtcatca 481 gaacgtggcc ggcagctagc tgaccacctt gggttcgagt tctttgaggc aagcgccaag 541 gacaacatta acgtcaagca gacctttgag cgcctggtgg atgtcatctg cgagaagatg 601 tccgagtcgt tggacacggc ggaccctgcg gtcacaggcg ccaagcaggg cccacagctc 661 agtgaccagc aggtgccacc gcaccaggac tgcgcctgct gagagccatc ccactccctt 721 tcccctcttc cctgtcttcc ccaccttccc gcaactgacc cggcctgacc cggcccc // LOCUS HUMRAB4A 735 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens GTP-binding protein (RAB4) mRNA, complete cds. ACCESSION M28211 J04941 NID g550067 KEYWORDS GTP-binding protein; ras oncogene. SOURCE Homo sapiens (tissue library: of J.Mallet) pheochromocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 735) AUTHORS Zahraoui,A., Touchot,N., Chardin,P. and Tavitian,A. TITLE The human Rab genes encode a family of GTP-binding proteins related to yeast YPT1 and SEC4 products involved in secretion JOURNAL J. Biol. Chem. 264 (21), 12394-12401 (1989) MEDLINE 89308668 FEATURES Location/Qualifiers source 1..735 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /tissue_lib="of J.Mallet" /map="1q42-q43" gene 71..712 /gene="RAB4" CDS 71..712 /gene="RAB4" /codon_start=1 /db_xref="GDB:G00-118-859" /product="GTP-binding protein" /db_xref="PID:g550068" /translation="MSETYDFLFKFLVIGNAGTGKSCLLHQFIEKKFKDDSNHTIGVE FGSKIINVGGKYVKLQIWDTAGQERFRSVTRSYYRGAAGALLVYDITSRETYNALTNW LTDARMLASQNIVIILCGNKKDLDADREVTFLEASRFAQENELMFLETSALTGEDVEE AFVQCARKILNKIESGELDPERMGSGIQYGDAALRQLRSPRRTQAPNAQECGC" BASE COUNT 220 a 149 c 197 g 169 t ORIGIN 1 cggaccgcgg gcgagtggca cggtgacccg gcgagaggcg gcgccgctcc caagatgtcg 61 cagacggcca atgtccgaaa cctacgattt tttgtttaag ttcttggtta ttggaaatgc 121 aggaactggc aaatcttgct tacttcatca gtttattgaa aaaaaattca aagatgactc 181 aaatcataca ataggagtgg aatttggttc aaagataata aatgttggtg gtaaatatgt 241 aaagttacaa atatgggata cagcaggaca agaacgattc aggtccgtga cgagaagtta 301 ttaccgaggc gcggccgggg ctctcctcgt ctatgatatc accagccgag aaacctacaa 361 tgcgcttact aattggttaa cagatgcccg aatgctagcg agccagaaca ttgtgatcat 421 cctttgtgga aacaagaagg acctggatgc agatcgtgaa gttaccttct tagaagcctc 481 cagatttgct caagaaaatg agctgatgtt tttggaaaca agtgcgctca caggggaaga 541 tgtagaagag gcttttgtac agtgtgcaag aaaaatactt aacaaaatcg aatcaggtga 601 gctggaccca gaaagaatgg gctcaggtat tcagtacgga gatgctgcct tgagacagct 661 gaggtcaccg cggcgcaccc aggccccgaa cgctcaggag tgtggttgtt aggagagcac 721 acaggtgttc ataca // LOCUS HUMRAB5A 719 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens GTP-binding protein (RAB5) mRNA, complete cds. ACCESSION M28215 J04941 NID g550069 KEYWORDS GTP-binding protein; ras oncogene. SOURCE Homo sapiens (tissue library: of J.Mallet) pheochromocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 719) AUTHORS Zahraoui,A., Touchot,N., Chardin,P. and Tavitian,A. TITLE The human Rab genes encode a family of GTP-binding proteins related to yeast YPT1 and SEC4 products involved in secretion JOURNAL J. Biol. Chem. 264 (21), 12394-12401 (1989) MEDLINE 89308668 FEATURES Location/Qualifiers source 1..719 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /tissue_lib="of J.Mallet" /map="3p24-p22" gene 57..704 /gene="RAB5" CDS 57..704 /gene="RAB5" /codon_start=1 /db_xref="GDB:G00-118-860" /product="GTP-binding protein" /db_xref="PID:g550070" /translation="MASRGATRPNGPNTGNKICQFKLVLLGESAVGKSSLVLRFVKGQ FHEFQESTIGAAFLTQTVCLDDTTVKFEIWDTAGQEGYHSLAPMYYRGAQAAIVVYDI TNEESFARAKNWVKELQRQASPNIVIALSGNKADLANKRAVDFQEAQSYADDNSLLFM ETSAKTSMNVNEIFMAIAKKLPKNEPQNPGANSARGGGVDLTEPTQPTRNQCCSN" BASE COUNT 256 a 139 c 154 g 170 t ORIGIN 1 gaattctgga agttcattga agagtctgaa attagggact tatttcaaat ttggacatgg 61 ctagtcgagg cgcaacaaga cccaacggcc caaatactgg aaataaaata tgccagttca 121 aactagtact tctgggagag tccgctgttg gcaaatcaag cctagtgctt cgttttgtga 181 aaggccaatt tcatgaattt caagagagta ccattggggc tgcttttcta acccaaactg 241 tatgtcttga tgacactaca gtaaagtttg aaatctggga tacagctggt caagaaggat 301 accatagcct agcaccaatg tactacagag gagcacaagc agccatagtt gtatatgata 361 tcacaaatga ggagtccttt gcaagagcaa aaaattgggt taaagaactt cagaggcaag 421 caagtcctaa cattgtaata gctttatcgg gaaacaaggc cgacctagca aataaaagag 481 cagtagattt ccaggaagca cagtcctatg cagatgacaa tagtttatta ttcatggaga 541 catccgctaa aacatcaatg aatgtaaatg aaatattcat ggcaatagct aaaaaattgc 601 caaagaatga accacaaaat ccaggagcaa attctgccag aggaggagga gtagacctta 661 ccgaacccac acaaccaacc aggaatcagt gttgtagtaa ctaaacctct agtttgaac // LOCUS HUMRAB6A 740 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens GTP-binding protein (RAB6) mRNA, complete cds. ACCESSION M28212 J04941 NID g550071 KEYWORDS GTP-binding protein; ras oncogene. SOURCE Homo sapiens (tissue library: of J.Mallet) pheochromocytoma cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 740) AUTHORS Zahraoui,A., Touchot,N., Chardin,P. and Tavitian,A. TITLE The human Rab genes encode a family of GTP-binding proteins related to yeast YPT1 and SEC4 products involved in secretion JOURNAL J. Biol. Chem. 264 (21), 12394-12401 (1989) MEDLINE 89308668 FEATURES Location/Qualifiers source 1..740 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pheochromocytoma" /tissue_lib="of J.Mallet" /map="2q14-q21" gene 71..697 /gene="RAB6" CDS 71..697 /gene="RAB6" /codon_start=1 /db_xref="GDB:G00-118-861" /product="GTP-binding protein" /db_xref="PID:g550072" /translation="MSTGGDFGNPLRKFKLVFLGEQSVGKTSLITRFMYDSFDNTYQA TIGIDFLSKTMYLEDRTVRLQLWDTAGQERFRSLIPSYIRDSTVAVVVYDITNVNSFQ QTTKWIDDVRTERGSDVIIMLVGNKTDLADKRQVSIEEGERKAKELNVMFIETSAKAG YNVKQLFRRVAAALPGMESTQDRSREDMIDIKLEKPQEQPVSEGGCSC" BASE COUNT 223 a 146 c 193 g 178 t ORIGIN 1 agctggctgg agcagcatcg gtccgggacg gtctctaggc tgaggcggcg gccgctcctc 61 tagttccaca atgtccacgg gcggagactt cgggaatccg ctgaggaaat tcaagctggt 121 gttcctgggg gagcaaagcg ttggaaagac atctttgatc accagattca tgtatgacag 181 ttttgacaac acctatcagg caacaattgg cattgacttt ttatcaaaaa ctatgtactt 241 ggaggatcga acagtacgat tgcaattatg ggacacagca ggtcaagagc ggttcaggag 301 cttgattcct agctacattc gtgactccac tgtggcagtt gttgtttatg atatcacaaa 361 tgttaactca ttccagcaaa ctacaaagtg gattgatgat gtcagaacag aaagaggaag 421 tgatgttatc atcatgctag taggaaataa aacagatctt gctgacaaga ggcaagtgtc 481 aattgaggag ggagagagga aagccaaaga gctgaatgtt atgtttattg aaactagtgc 541 aaaagctgga tacaatgtaa agcagctctt tcgacgtgta gcagcagctt tgccgggaat 601 ggaaagcaca caggacagaa gcagagaaga tatgattgac ataaaactgg aaaagcctca 661 ggagcaacca gtcagtgaag gaggctgttc ctgctaatgt ccctagtcat cttcaacctt 721 cttcagaagc tcactgcttt // LOCUS HUMRACA 579 bp mRNA PRI 15-MAR-1990 DEFINITION Human ras-related C3 botulinum toxin substrate (rac) mRNA, complete cds. ACCESSION M29870 J05038 NID g190823 KEYWORDS C3 botulinum toxin. SOURCE Human HL-60 cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 579) AUTHORS Didsbury,J., Weber,R.F., Bokoch,G.M., Evans,T. and Snyderman,R. TITLE rac, a novel ras-related family of proteins that are botulinum toxin substrates JOURNAL J. Biol. Chem. 264, 16378-16382 (1989) MEDLINE 89380250 FEATURES Location/Qualifiers source 1..579 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..579 /note="ras-related C3 botulinum toxin substrate" /codon_start=1 /db_xref="PID:g190824" /translation="MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANV MVDGKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASFENVRAKWYPEVR HHCPNTPIILVGTKLDLRDDKDTIEKLKEKKLTPITYPQGLAMAKEIGAVKYLECSAL TQRGLKTVFDEAIRAVLCPPPVKKRKRKCLLL" BASE COUNT 159 a 133 c 145 g 142 t ORIGIN 1 atgcaggcca tcaagtgtgt ggtggtggga gacggagctg taggtaaaac ttgcctactg 61 atcagttaca caaccaatgc atttcctgga gaatatatcc ctactgtctt tgacaattat 121 tctgccaatg ttatggtaga tggaaaaccg gtgaatctgg gcttatggga tacagctgga 181 caagaagatt atgacagatt acgcccccta tcctatccgc aaacagatgt gttcttaatt 241 tgcttttccc ttgtgagtcc tgcatcattt gaaaatgtcc gtgcaaagtg gtatcctgag 301 gtgcggcacc actgtcccaa cactcccatc atcctagtgg gaactaaact tgatcttagg 361 gatgataaag acacgatcga gaaactgaag gagaagaagc tgactcccat cacctatccg 421 cagggtctag ccatggctaa ggagattggt gctgtaaaat acctggagtg ctcggcgctc 481 acacagcgag gcctcaagac agtgtttgac gaagcgatcc gagcagtcct ctgcccgcct 541 cccgtgaaga agaggaagag aaaatgcctg ctgttgtaa // LOCUS HUMRACB 579 bp mRNA PRI 15-MAR-1990 DEFINITION Human ras-related C3 botulinum toxin substrate (rac) mRNA, complete cds. ACCESSION M29871 J05038 NID g190825 KEYWORDS C3 botulinum toxin. SOURCE Human HL-60 cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 579) AUTHORS Didsbury,J., Weber,R.F., Bokoch,G.M., Evans,T. and Snyderman,R. TITLE rac, a novel ras-related family of proteins that are botulinum toxin substrates JOURNAL J. Biol. Chem. 264, 16378-16382 (1989) MEDLINE 89380250 FEATURES Location/Qualifiers source 1..579 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..579 /note="ras-related C3 botulinum toxin substrate" /codon_start=1 /db_xref="PID:g190826" /translation="MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANV MVDSKPVNLGLWDTAGQEDYDRLRPLSYPQTDVFLICFSLVSPASYENVRAKWFPEVR HHCPSTPIILVGTKLDLRDDKDTIEKLKEKKLAPITYPQGLALAKEIDSVKYLECSAL TQRGLKTVFDEAIRAVLCPQPTRQQKRACSLL" BASE COUNT 125 a 190 c 163 g 101 t ORIGIN 1 atgcaggcca tcaagtgtgt ggtggtggga gatggggccg tgggcaagac ctgccttctc 61 atcagctaca ccaccaacgc ctttcccgga gagtacatcc ccaccgtgtt tgacaactat 121 tcagccaatg tgatggtgga cagcaagcca gtgaacctgg ggctgtggga cactgctggg 181 caggaggact acgaccgtct ccggccgctc tcctatccac agacggacgt cttcctcatc 241 tgcttctccc tcgtcagccc agcctcttat gagaacgtcc gcgccaagtg gttcccagaa 301 gtgcggcacc actgccccag cacacccatc atcctggtgg gcaccaagct ggacctgcgg 361 gacgacaagg acaccatcga gaaactgaag gagaagaagc tggctcccat cacctacccg 421 cagggcctgg cactggccaa ggagattgac tcggtgaaat acctggagtg ctcagccctc 481 acccagagag gcctgaaaac cgtgttcgac gaggccatcc gggccgtgct gtgccctcag 541 cccacgcggc agcagaagcg cgcctgcagc ctcctctag // LOCUS HUMRAD51 2229 bp mRNA PRI 16-APR-1993 DEFINITION Human mRNA for RAD51, complete cds. ACCESSION D14134 NID g285976 KEYWORDS RAD51; histone H2A. SOURCE Homo sapiens (sub_species:Caucasian) testis cDNA to mRNA, clone_lib:cDNA in pCD8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2229) AUTHORS Yoshimura,Y., Morita,T., Yamamoto,A. and Matsushiro,A. TITLE Cloning and sequence of the human RecA-like gene cDNA JOURNAL Nucleic Acids Res. 21 (7), 1665 (1993) MEDLINE 93241950 REFERENCE 2 (bases 1 to 2229) AUTHORS Morita,T. TITLE Direct Submission JOURNAL Submitted (22-JAN-1993) to the DDBJ/EMBL/GenBank databases. Takashi Morita, Research Institute for Microbial Deseases, Dept. of Microbial Genetics, Osaka Univ.; 3-1 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-877-5121(ex.3172), Fax:06-876-2678) COMMENT Submitted (22-JAN-1993) to DDBJ by: Takashi Morita Department of Microbial Genetics Research Institute for Microbial Diseases Osaka University 3-1 Yamadaoka Suita, Osaka 565 Japan Phone: 06-875-2913 Fax: 06-876-2678. FEATURES Location/Qualifiers source 1..2229 /organism="Homo sapiens" /sub_species="Caucasian" /db_xref="taxon:9606" /clone_lib="cDNA in pCD8" /tissue_type="testis" gene 233..1252 /gene="Rad51" CDS 233..1252 /gene="Rad51" /codon_start=1 /product="RAD51" /db_xref="PID:d1003698" /db_xref="PID:g285977" /translation="MAMQMQLEANADTSVEEESFGPQPISRLEQCGINANDVKKLEEA GFHTVEAVAYAPKKELINIKGISEAKADKILAEAAKLVPMGFTTATEFHQRRSEIIQI TTGSKELDKLLQGGIETGSITEMFGEFRTGKTQICHTLAVTCQLPIDRGGGEGKAMYI DTEGTFRPERLLAVAERYGLSGSDVLDNVAYARAFNTDHQTQLLYQASAMMVESRYAL LIVDSATALYRTDYSGRGELSARQMHLARFLRMLLRLADEFGVAVVITNQVVAQVDGA AMFAADPKKPIGGNIIAHASTTRLYLRKGRGETRICKIYDSPCLPEAEAMFAINADGV GDAKD" repeat_region 1862..2173 /rpt_unit=1862..1872 polyA_signal 2208..2213 BASE COUNT 593 a 472 c 602 g 562 t ORIGIN Chromosome 15. 1 ccgcgcgcag cggccagaga ccgagcccta aggagagtgc ggcgcttccc gaggcgtgca 61 gctgggaact gcaactcatc tgggttgtgc gcagaaggct ggggcaagcg agtagagaag 121 tggagcgtaa gccaggggcg ttgggggccg tgcgggtcgg gcgcgtgcca cgcccgcggg 181 gtgaagtcgg agcgcggggc ctgctggaga gaggagcgct gcggaccgag taatggcaat 241 gcagatgcag cttgaagcaa atgcagatac ttcagtggaa gaagaaagct ttggcccaca 301 acccatttca cggttagagc agtgtggcat aaatgccaac gatgtgaaga aattggaaga 361 agctggattc catactgtgg aggctgttgc ctatgcgcca aagaaggagc taataaatat 421 taagggaatt agtgaagcca aagctgataa aattctggct gaggcagcta aattagttcc 481 aatgggtttc accactgcaa ctgaattcca ccaaaggcgg tcagagatca tacagattac 541 tactggctcc aaagagcttg acaaactact tcaaggtgga attgagactg gatctatcac 601 agaaatgttt ggagaattcc gaactgggaa gacccagatc tgtcatacgc tagctgtcac 661 ctgccagctt cccattgacc ggggtggagg tgaaggaaag gccatgtaca ttgacactga 721 gggtaccttt aggccagaac ggctgctggc agtggctgag aggtatggtc tctctggcag 781 tgatgtcctg gataatgtag catatgctcg agcgttcaac acagaccacc agacccagct 841 cctttatcaa gcatcagcca tgatggtaga atctaggtat gcactgctta ttgtagacag 901 tgccaccgcc ctttacagaa cagactactc gggtcgaggt gagctttcag ccaggcagat 961 gcacttggcc aggtttctgc ggatgcttct gcgactcgct gatgagtttg gtgtagcagt 1021 ggtaatcact aatcaggtgg tagctcaagt ggatggagca gcgatgtttg ctgctgatcc 1081 caaaaaacct attggaggaa atatcatcgc ccatgcatca acaaccagat tgtatctgag 1141 gaaaggaaga ggggaaacca gaatctgcaa aatctacgac tctccctgtc ttcctgaagc 1201 tgaagctatg ttcgccatta atgcagatgg agtgggagat gccaaagact gaatcattgg 1261 gtttttcctc tgttaaaaac cttaagtgct gcagcctaat gagagtgcac tgctccctgg 1321 ggttctctac aggcctcttc ctgttgtgac tgccaggata aagcttccgg gaaaacagct 1381 attatatcag cttttctgat ggtataaaca ggagacaggt cagtagtcac aaactgatct 1441 aaaatgttta ttccttctgt agtgtattaa tctctgtgtg ttttctttgg ttttggagga 1501 ggggtatgaa gtatctttga catggtgcct taggaatgac ttgggtttaa caagctgtct 1561 actggacaat cttatgtttc caagagaact aaagctggag agacctgacc cttctctcac 1621 ttctaaatta atggtaaaat aaaatgcctc agctatgtag caaagggaat gggtctgcac 1681 agattctttt tttctgtcag taaaactctc aagcaggttt ttaagttgtc tgtctgaatg 1741 atcttgtgta agggtttggt tatggagtct tgtgccaaac ctactaggcc attagccctt 1801 caccatctac ctgcttggtc tttcattgct aagactaact caagataatc ctagagtctt 1861 aaagcatttc aggccagtgt ggtgtcttgc gcctgtactc ccagcacttt gggaggccga 1921 ggcaggtgga tcgcttgagc caggagtttt aagtccagct tggccaagat ggtgaaatcc 1981 catctctaca aaaaatgcag aacttaatct ggacacactg ttacacgtgc ctgtagtccc 2041 agctactcta tagcctgagg tgggagaatc acttaagcct ggaaggtgga agttgcagtg 2101 agtcgagatt gcactgctgc attccagcca gggtgacaga gtgagaccat gtttcaaaca 2161 agaaacattt cagagggcaa gtaaacagat ttgattgtga ggcttctaat aaagtagtta 2221 ttagtagtg // LOCUS HUMRADIXIN 2022 bp mRNA PRI 21-JUN-1993 DEFINITION Human radixin mRNA, complete cds. ACCESSION L02320 NID g307365 KEYWORDS radixin. SOURCE Homo sapiens (library: Stratagene) male liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2022) AUTHORS Wilgenbus,K.K., Milatovich,A., Francke,U. and Furthmayr,E. TITLE Molecular cloning, cDNA sequence, and chromosomal assignment of the human radixin gene and two dispersed pseudogenes JOURNAL Genomics 16, 199-206 (1993) MEDLINE 93252378 FEATURES Location/Qualifiers source 1..2022 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="liver" /tissue_lib="Stratagene" CDS 31..1782 /codon_start=1 /product="radixin" /db_xref="PID:g307366" /translation="MPKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTVGLREVWF FGLQYVDSKGYSTWLKLNKKVTQQDVKKENPLQFKFRAKFFPEDVSEELIQEITQRLF FLQVKEAILNDEIYCPPETAVLLASYAVQAKYGDYNKEIHKPGYLANDRLLPQRVLEQ HKLTKEQWEERIQNWHEEHRGMLREDSMMEYLKIAQDLEMYGVNYFEIKNKKGTELWL GVDALGLNIYEHDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRI NKRILALCMGNHELYMRRRKPDTIEVQQMKAQAREEKHQKQLERAQLENEKKKREIAE KEKERIEREKEELMERLKQIEEQTIKAQKELEEQTRKALELDQERKRAKEEAERLEKE RRAAEEAKSAIAKQAADQMKNQEQLAAELAEFTAKIALLEEAKKKKEEEATEWQHKAF AAQEDLEKTKEELKTVMSAPPPPPPPPVIPPTENEHDEHDENNAEASAELSNEGVMNH RSEEERVTETQKNERVKKQLQALSSELAQARDETKKTQNDVLHAENVKAGRDKYKTLR QIRQGNTKQRIDEFEAM" BASE COUNT 757 a 354 c 458 g 453 t ORIGIN 1 aattcggcac gagacaaaaa gagaaagaaa atgccgaaac caatcaacgt aagagtaact 61 acaatggatg ctgagctgga atttgccatt cagcccaata caactggcaa acaacttttt 121 gaccaggtgg tgaaaacagt tggtttgcgt gaggtctggt tttttgggct gcagtatgta 181 gacagcaaag gttattctac atggcttaaa ctaaataaaa aggtaacaca gcaggatgtt 241 aaaaaagaga atcctttaca gttcaagttt agagctaaat tctttcctga agatgtttct 301 gaggaattaa ttcaagaaat aacccagaga ctcttcttct tgcaagttaa agaagccatc 361 ttaaatgatg agatatattg cccgccagaa actgcagttc ttttggcttc ctatgctgtc 421 caagccaagt atggagatta caataaagag attcataagc caggctacct ggctaatgat 481 agactcctac cccagcgtgt attggaacaa cacaaactaa caaaagaaca gtgggaagaa 541 agaatacaga actggcatga agaacataga ggaatgttaa gggaggattc tatgatggaa 601 tacctgaaga ttgcacaaga tctagaaatg tatggagtca actattttga aataaaaaat 661 aaaaaaggaa ctgaattgtg gctaggtgtt gatgctttgg gtctgaatat ttatgagcat 721 gacgacaagt taacacctaa aattggtttt ccctggagtg aaatcagaaa tatttcattt 781 aatgacaaaa aatttgttat aaagccaatc gacaaaaagg cacctgattt tgtgttttat 841 gcacctcgtc tgagaatcaa taagcggatt ttggccttat gtatgggaaa ccatgaacta 901 tacatgcgaa gaaggaagcc tgatactatt gaagtacaac agatgaaggc tcaggctagg 961 gaggagaaac atcagaagca gttggaaagg gcacaattag agaatgaaaa gaagaaaaga 1021 gaaatagcag aaaaggaaaa ggaaagaata gaacgtgaaa aggaagagct aatggaacgt 1081 ctaaaacaaa ttgaagagca gacaattaaa gctcagaaag aactagaaga acagactcga 1141 aaagctctag aactggatca agaacgaaaa cgagcaaaag aagaagcaga acgacttgaa 1201 aaggagcgtc gagctgctga agaggcaaag tctgccatag caaaacaagc tgccgaccag 1261 atgaagaatc aggagcagct agcagcagaa cttgctgaat tcactgccaa gattgcactt 1321 ctagaggaag ccaagaagaa aaaggaagag gaagcaactg agtggcaaca caaagctttt 1381 gcagcccagg aagacttgga aaagaccaaa gaagagttaa aaactgtgat gtctgccccc 1441 cctccacctc caccaccacc agtcattcct ccaacagaaa acgaacatga tgaacacgat 1501 gagaataatg ctgaagctag tgctgaatta tcaaatgaag gggtaatgaa ccatagaagc 1561 gaggaagaac gtgtaaccga aacacagaaa aatgagcgtg ttaagaagca acttcaggca 1621 ttaagttcag aattagccca agccagagat gaaaccaaga aaacacaaaa tgatgttctt 1681 catgctgaga atgttaaagc aggccgtgat aagtacaaga ctctgcgaca gattcgacaa 1741 ggcaatacaa agcagcgtat cgatgagttt gaagcaatgt gagagctgtt attttgcata 1801 tatgttcttc ataagctgaa ccaccaacag agaaaagcag gcctttgcag atatgatgga 1861 atgcatccca ccttgccaaa gcacttacac cagtttgact gtgctagcta aaagacaaat 1921 ttaaggggag ctcttcaaca ttaaggcagt atgatatcat gcttggtttt cttttttctt 1981 ttggtccagg gaatggagaa tggtgttcca ttgcctcttt tt // LOCUS HUMRAG1 6545 bp mRNA PRI 08-JAN-1995 DEFINITION Human recombination activating protein (RAG-1) gene, complete cds. ACCESSION M29474 NID g190842 KEYWORDS recombination activating protein. SOURCE Human pre-B cell, line NALM6, cDNA to mRNA, clone H36. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6545) AUTHORS Schatz,D.G., Oettinger,M.A. and Baltimore,D. JOURNAL Unpublished (1989) REFERENCE 2 (bases 1 to 6545) AUTHORS Schatz,D.G., Oettinger,M.A. and Baltimore,D. TITLE The V(D)J recombination activating gene, RAG-1 JOURNAL Cell 59 (6), 1035-1048 (1989) MEDLINE 90090604 COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.G.Schatz 20-OCT-1989. FEATURES Location/Qualifiers source 1..6545 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" mRNA <1..6545 /note="RAG1 mRNA" gene 113..3244 /gene="RAG1" CDS 113..3244 /gene="RAG1" /note="recombination activating protein" /codon_start=1 /db_xref="GDB:G00-120-334" /db_xref="PID:g190843" /translation="MAASFPPTLGLSSAPDEIQHPHIKFSEWKFKLFRVRSFEKTPEE AQKEKKDSFEGKPSLEQSPAVLDKADGQKPVPTQPLLKAHPKFSKKFHDNEKARGKAI HQANLRHLCRICGNSFRADEHNRRYPVHGPVDGKTLGLLRKKEKRATSWPDLIAKVFR IDVKADVDSIHPTEFCHNCWSIMHRKFSSAPCEVYFPRNVTMEWHPHTPSCDICNTAR RGLKRKSLQPNLQLSKKLKTVLDQARQARQRKRRAQARISSKDVMKKIANCSKIHLST KLLAVDFPEHFVKSISCQICEHILADPVETNCKHVFCRVCILRCLKVMGSYCPSCRYP CFPTDLESPVKSFLSVLNSLMVKCPAKECNEEVSLEKYNHHISSHKESKEIFVHINKG GRPRQHLLSLTRRAQKHRLRELKLQVKAFADKEEGGDVKSVCMTLFLLALRARNEHRQ ADELEAIMQGKGSGLQPAVCLAIRVNTFLSCSQYHKMYRTVKAITGRQIFQPLHALRN AEKVLLPGYHHFEWQPPLKNVSSSTDVGIIDGLSGLSSSVDDYPVDTIAKRFRYDSAL VSALMDMEEDILEGMRSQDLDDYLNGPFTVVVKESCDGMGDVSEKHGSGPVVPEKAVR FSFTIMKITIAHSSQNVKVFEEAKPNSELCCKPLCLMLADESDHETLTAILSPLIAER EAMKSSELMLELGGILRTFKFIFRGTGYDEKLVREVEGLEASGSVYICTLCDATRLEA SQNLVFHSITRSHAENLERYEVWRSNPYHESVEELRDRVKGVSAKPFIETVPSIDALH CDIGNAAEFYKIFQLEIGEVYKNPNASKEERKRWQATLDKHLRKKMNLKPIMRMNGNF ARKLMTKETVDAVCELIPSEERHEALRELMDLYLKMKPVWRSSCPAKECPESLCQYSF NSQRFAELLSTKFKYRYEGKITNYFHKTLAHVPEIIERDGSIGAWASEGNESGNKLFR RFRKMNARQSKCYEMEDVLKHHWLYTSKYLQKFMNAHNALKTSGFTMNPQASLGDPLG IEDSLESQDSMEF" BASE COUNT 1848 a 1314 c 1415 g 1968 t ORIGIN 1 gagagcagag aacacacttt gccttctctt tggtattgag taatatcaac caaattgcag 61 acatctcaac actttggcca ggcagcctgc tgagcaaggt acctcagcca gcatggcagc 121 ctctttccca cccaccttgg gactcagttc tgccccagat gaaattcagc acccacatat 181 taaattttca gaatggaaat ttaagctgtt ccgggtgaga tcctttgaaa agacacctga 241 agaagctcaa aaggaaaaga aggattcctt tgaggggaaa ccctctctgg agcaatctcc 301 agcagtcctg gacaaggctg atggtcagaa gccagtccca actcagccat tgttaaaagc 361 ccaccctaag ttttcaaaga aatttcacga caacgagaaa gcaagaggca aagcgatcca 421 tcaagccaac cttcgacatc tctgccgcat ctgtgggaat tcttttagag ctgatgagca 481 caacaggaga tatccagtcc atggtcctgt ggatggtaaa accctaggcc ttttacgaaa 541 gaaggaaaag agagctactt cctggccgga cctcattgcc aaggttttcc ggatcgatgt 601 gaaggcagat gttgactcga tccaccccac tgagttctgc cataactgct ggagcatcat 661 gcacaggaag tttagcagtg ccccatgtga ggtttacttc ccgaggaacg tgaccatgga 721 gtggcacccc cacacaccat cctgtgacat ctgcaacact gcccgtcggg gactcaagag 781 gaagagtctt cagccaaact tgcagctcag caaaaaactc aaaactgtgc ttgaccaagc 841 aagacaagcc cgtcagcgca agagaagagc tcaggcaagg atcagcagca aggatgtcat 901 gaagaagatc gccaactgca gtaagataca tcttagtacc aagctccttg cagtggactt 961 cccagagcac tttgtgaaat ccatctcctg ccagatctgt gaacacattc tggctgaccc 1021 tgtggagacc aactgtaagc atgtcttttg ccgggtctgc attctcagat gcctcaaagt 1081 catgggcagc tattgtccct cttgccgata tccatgcttc cctactgacc tggagagtcc 1141 agtgaagtcc tttctgagcg tcttgaattc cctgatggtg aaatgtccag caaaagagtg 1201 caatgaggag gtcagtttgg aaaaatataa tcaccacatc tcaagtcaca aggaatcaaa 1261 agagattttt gtgcacatta ataaaggggg ccggccccgc caacatcttc tgtcgctgac 1321 tcggagagct cagaagcacc ggctgaggga gctcaagctg caagtcaaag cctttgctga 1381 caaagaagaa ggtggagatg tgaagtccgt gtgcatgacc ttgttcctgc tggctctgag 1441 ggcgaggaat gagcacaggc aagctgatga gctggaggcc atcatgcagg gaaagggctc 1501 tggcctgcag ccagctgttt gcttggccat ccgtgtcaac accttcctca gctgcagtca 1561 gtaccacaag atgtacagga ctgtgaaagc catcacaggg agacagattt ttcagccttt 1621 gcatgccctt cggaatgctg agaaggtact tctgccaggc taccaccact ttgagtggca 1681 gccacctctg aagaatgtgt cttccagcac tgatgttggc attattgatg ggctgtctgg 1741 actatcatcc tctgtggatg attacccagt ggacaccatt gcaaagaggt tccgctatga 1801 ttcagctttg gtgtctgctt tgatggacat ggaagaagac atcttggaag gcatgagatc 1861 ccaagacctt gatgattacc tgaatggccc cttcactgtg gtggtgaagg agtcttgtga 1921 tggaatggga gacgtgagtg agaagcatgg gagtgggcct gtagttccag aaaaggcagt 1981 ccgtttttca ttcacaatca tgaaaattac tattgcccac agctctcaga atgtgaaagt 2041 atttgaagaa gccaaaccta actctgaact gtgttgcaag ccattgtgcc ttatgctggc 2101 agatgagtct gaccacgaga cgctgactgc catcctgagt cctctcattg ctgagaggga 2161 ggccatgaag agcagtgaat taatgcttga gctgggaggc attctccgga ctttcaagtt 2221 catcttcagg ggcaccggct atgatgaaaa acttgtgcgg gaagtggaag gcctcgaggc 2281 ttctggctca gtctacattt gtactctttg tgatgccacc cgtctggaag cctctcaaaa 2341 tcttgtcttc cactctataa ccagaagcca tgctgagaac ctggaacgtt atgaggtctg 2401 gcgttccaac ccttaccatg agtctgtgga agaactgcgg gatcgggtga aaggggtctc 2461 agctaaacct ttcattgaga cagtcccttc catagatgca ctccactgtg acattggcaa 2521 tgcagctgag ttctacaaga tcttccagct agagataggg gaagtgtata agaatcccaa 2581 tgcttccaaa gaggaaagga aaaggtggca ggccacactg gacaagcatc tccggaagaa 2641 gatgaacctc aaaccaatca tgaggatgaa tggcaacttt gccaggaagc tcatgaccaa 2701 agagactgtg gatgcagttt gtgagttaat tccttccgag gagaggcacg aggctctgag 2761 ggagctgatg gatctttacc tgaagatgaa accagtatgg cgatcatcat gccctgctaa 2821 agagtgccca gaatccctct gccagtacag tttcaattca cagcgttttg ctgagctcct 2881 ttctacgaag ttcaagtata ggtatgaggg aaaaatcacc aattattttc acaaaaccct 2941 ggcccatgtt cctgaaatta ttgagaggga tggctccatt ggggcatggg caagtgaggg 3001 aaatgagtct ggtaacaaac tgtttaggcg cttccggaaa atgaatgcca ggcagtccaa 3061 atgctatgag atggaagatg tcctgaaaca ccactggttg tacacctcca aatacctcca 3121 gaagtttatg aatgctcata atgcattaaa aacctctggg tttaccatga accctcaggc 3181 aagcttaggg gacccattag gcatagagga ctctctggaa agccaagatt caatggaatt 3241 ttaagtaggg caaccactta tgagttggtt tttgcaattg agtttccctc tgggttgcat 3301 tgagggcttc tcctagcacc ctttactgct gtgtatgggg cttcaccatc caagaggtgg 3361 taggttggag taagatgcta cagatgctct caagtcagga atagaaactg atgagctgat 3421 tgcttgaggc ttttagtgag ttccgaaaag caacaggaaa aatcagttat ctgaaagctc 3481 agtaactcag aacaggagta actgcagggg accagagatg agcaaagatc tgtgtgtgtt 3541 ggggagctgt catgtaaatc aaagccaagg ttgtcaaaga acagccagtg aggccagaaa 3601 ttggtcttgt ggttttcatt tttttccccc ttgattgatt atattttgta ttgagatatg 3661 ataagtgcct tctatttcat ttttgaataa ttcttcattt ttataatttt acatatcttg 3721 gcttgctata taagattcaa aagagctttt taaatttttc taataatatc ttacatttgt 3781 acagcatgat gacctttaca aagtgctctc aatgcattta cccattcgtt atataaatat 3841 gttacatcag gacaactttg agaaaatcag tcctttttta tgtttaaatt atgtatctat 3901 tgtaaccttc agagtttagg aggtcatctg ctgtcatgga tttttcaata atgaatttag 3961 aatacacctg ttagctacag ttagttatta aatcttctga taatatatgt ttacttagct 4021 atcagaagcc aagtatgatt ctttattttt actttttcat ttcaagaaat ttagagtttc 4081 caaatttaga gcttctgcat acagtcttaa agccacagag gcttgtaaaa atataggtta 4141 gcttgatgtc taaaaatata tttcatgtct tactgaaaca ttttgccaga ctttctccaa 4201 atgaaacctg aatcaatttt tctaaatcta ggtttcatag agtcctctcc tctgcaatgt 4261 gttattcttt ctataatgat cagtttactt tcagtggatt cagaattgtg tagcaggata 4321 accttgtatt tttccatccg ctaagtttag atggagtcca aacgcagtac agcagaagag 4381 ttaacattta cacagtgctt tttaccactg tggaatgttt tcacactcat ttttccttac 4441 aacaattctg aggagtaggt gttgttatta tctccatttg atgggggttt aatgatttgc 4501 tcaaagtcat ttaggggtaa taaatacttg gcttggaaat ttaacacagt ccttttgtct 4561 ccaaagccct tcttctttcc accacaaatt aatcactatg tttataaggt agtatcagaa 4621 tttttttagg attcacaact aatcactata gcacatgacc ttgggattac atttttatgg 4681 ggcaggggta agcggctttt aaatcatttg tgtgctctgg ctcttttgat agaagaaagc 4741 aacacaaaag ctccaaaggg ccccctaacc ctcttgtggc tccagttatt tggaaactat 4801 gatctgcatc cttaggaatc tgggatttgc cagttgctgg caatgtagag caggcatgga 4861 attttatatg ctagtgagtc ataatgatat gttagtgtta attagttttt cttcctttga 4921 ttttattggc cataattgct actcttcata cacagtatat caaagagctt gataatttag 4981 ttgtcaaaag tgcatcggcg acattatctt taattgtatg tatttggtgc ttcttcaggg 5041 attgaactca gtatctttca ttaaaaaaca cagcagtttt ccttgctttt tatatgcaga 5101 atatcaaagt catttctaat ttagttgtca aaaacatata catattttaa cattagtttt 5161 tttgaaaact cttggttttg tttttttgga aatgagtggg ccactaagcc acactttccc 5221 ttcatcctgc ttaatccttc cagcatgtct ctgcactaat aaacagctaa attcacataa 5281 tcatcctatt tactgaagca tggtcatgct ggtttataga ttttttaccc atttctactc 5341 tttttctcta ttggtggcac tgtaaatact ttccagtatt aaattatcct tttctaacac 5401 tgtaggaact attttgaatg catgtgacta agagcatgat ttatagcaca acctttccaa 5461 taatccctta atcagatcac attttgataa accctgggaa catctggctg caggaatttc 5521 aatatgtaga aacgctgcct atggtttttt gcccttactg ttgagactgc aatatcctag 5581 accctagttt tatactagag ttttattttt agcaatgcct attgcaagtg caattatata 5641 ctccagggaa attcaccaca ctgaatcgag catttgtgtg tgtatgtgtg aagtatatct 5701 gggacttcag aagtgcaatg tatttttctc ctgtgaaacc tgaatctaca agttttctgc 5761 caagccactc aggtgcattg cagggaccag tgataatggc tgatgaaaat tgatgattgg 5821 tcagtgaggt caaaaggagc cttgggatta ataaacatgc actgagaagc aagaggagga 5881 gaaaaagatg tctttttctt ccaggtgaac tggaatttag ttttgcctca gatttttttc 5941 ccacaagata cagaagaaga taaagatttt tttggttgag agtgtgggtc ttgcattaca 6001 tcaaacagag ttcaaattcc acacagataa gaggcaggat atataagcgc cagtggtagt 6061 tgggaggaat aaaccattat ttggatgcag gtggtttttg attgcaaata tgtgtgtgtc 6121 ttcagtgatt gtatgacaga tgatgtattc ttttgatgtt aaaagatttt aagtaagagt 6181 agatacattg tacccatttt acattttctt attttaacta cagtaatcta cataaatata 6241 cctcagaaat catttttggt gattattttt tgttttgtag aattgcactt cagtttattt 6301 tcttacaaat aaccttacat tttgtttaat ggcttccaag agcctttttt tttttgtatt 6361 tcagagaaaa ttcaggtacc aggatgcaat ggatttattt gattcagggg acctgtattt 6421 ccatgtcaaa tgttttcaaa taaaatgaaa tatgagtttc aatacttttt atattttaat 6481 atttccttaa tattatggtt attgtccgcc attttgttgt atattgtaaa taaagtttag 6541 attgt // LOCUS HUMRAP1GAP 3270 bp mRNA PRI 08-JAN-1995 DEFINITION Human GTPase activating protein (rap1GAP) mRNA, complete cds. ACCESSION M64788 NID g190855 KEYWORDS GTPase activating protein. SOURCE Homo sapiens (tissue library: Stratagene human fetal brain Cat#936206) fetal brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3270) AUTHORS Rubinfeld,B., Munemitsu,S., Clark,R., Conroy,L., Watt,K., Crosier,W.J., McCormick,F. and Polakis,P. TITLE Molecular cloning of a GTPase activating protein specific for the Krev-1 protein p21rap1 JOURNAL Cell 65 (6), 1033-1042 (1991) MEDLINE 91256304 FEATURES Location/Qualifiers source 1..3270 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetal" /tissue_type="brain" /tissue_lib="Stratagene human fetal brain Cat#936206" gene 202..2193 /gene="rap1GAP" CDS 202..2193 /gene="rap1GAP" /codon_start=1 /product="GTPase activating protein" /db_xref="PID:g190856" /translation="MIEKMQGSRMDEQRCSFPPPLKTEEDYIPYPSVHEVLGREGPFP LILLPQFGGYWIEGTNHEITSIPETEPLQSPTTKVKLECNPTARIYRKHFLGKEHFNY YSLDTALGHLVFSLKYDVIGDQEHLRLLLRTKCRTYHDVIPISCLTEFPNVVQMAKLV CEDVNVDRFYPVLYPKASRLIVTFDEHVISNNFKFGVIYQKLGQTSEEELFSTNEESP AFVEFLEFLGQKVKLQDFKGFRGGLDVTHGQTGTESVYCNFRNKEIMFHVSTKLPYTE GDAQQLQRKRHIGNDIVAVVFQDENTPFVPDMIASNFLHAYVVVQAEGGGPDGPLYKV SVTARDDVPFFGPPLPDPAVFRKGPEFQEFLLTKLINAEYACYKAEKFAKLEERTRAA LLETLYEELHIHSQSMMGLGGDEDKMENGSGGGGFFESFKRVIRSRSQSMDAMGLSNK KPNTVSTSHSGSFAPNNPDLAKAAGISLIVPGKSPTRKKSGPFGSRRSSAIGIENIQE VQEKRESPPAGQKTPDSGHVSQEPKSENSSTQSSPEMPTTKNRAETAAQRAEALKDFS RSSSSASSFASVVEETEGVDGEDTGLESVSSSGTPHKRDSFIYSTWLEDSVSTTSGGS SPGPSRSPHPDAGKLGDPACPEIKIQLEASEQHMPQLGC" BASE COUNT 710 a 1003 c 941 g 616 t ORIGIN 1 ggccgcgggc accagagtgc cgagcccagg acgcccccgg cccaggccct tggggtggac 61 aagtccttca cttctcgccg gagtgtgtgg aggagcgatg ggcagaacca gcacttccct 121 caggcactag acctgtcacg agtgaactta gttccctcct atactccttc actctaccct 181 aagaacacag atctatttga gatgattgag aagatgcagg gaagcaggat ggatgaacaa 241 cgctgctcct tcccgccgcc cctcaaaaca gaggaggact acattccata cccgagcgtg 301 cacgaggtct tggggcgaga aggacccttc cccctcatcc tgctgcccca gtttgggggc 361 tactggattg agggcaccaa ccacgaaatc accagcatcc ccgagacaga gccactgcag 421 tcgcccacaa ccaaggtgaa gctcgagtgc aaccccacag cccgcatcta ccggaagcac 481 tttctcggca aggagcattt caattactac tcactggaca ctgccctcgg ccaccttgtc 541 ttctcactca agtacgatgt catcggggac caagagcacc tgcggctgct gctcaggacc 601 aagtgccgga cataccatga tgtcatcccc atctcctgcc tcaccgagtt ccctaatgtt 661 gtccagatgg caaagttggt gtgtgaagac gtcaatgtgg atcggttcta tcctgtgctc 721 taccccaagg cttcccggct catcgtcacc tttgacgagc atgtcatcag caataacttc 781 aagtttggcg tcatttatca gaagcttggg cagacctccg aggaagaact cttcagcacc 841 aatgaggaaa gtcccgcttt cgtggagttc cttgaatttc ttggccagaa ggtcaaactg 901 caggacttta aggggttccg aggaggcctg gacgtgaccc acgggcagac ggggaccgaa 961 tctgtgtact gcaacttccg caacaaggag atcatgtttc acgtgtccac caagctgcca 1021 tacacggaag gggacgccca gcagttgcag cggaagcggc acatcgggaa cgacatcgtg 1081 gctgtggtct tccaggatga aaacactcct ttcgtgcccg acatgatcgc gtccaacttc 1141 ctgcatgcct acgtcgtggt gcaggctgag ggcgggggcc ctgatggccc cctctacaag 1201 gtctctgtca ctgcaagaga tgatgtgccc ttctttggac cccccctccc ggaccccgct 1261 gtgttcagga aggggcctga gttccaggaa tttttgctga caaagctgat caatgctgaa 1321 tatgcctgct acaaggcaga gaagtttgcc aaactggagg agcggacgcg ggccgccctc 1381 ctggagacgc tctatgagga actacacatc cacagccagt ccatgatggg cttgggcggc 1441 gacgaggaca agatggagaa tggcagtggg ggcggcggct tctttgagtc tttcaagcgg 1501 gtcatccgga gccgcagcca gtccatggat gccatggggc tgagcaacaa gaagcccaac 1561 accgtgtcca ccagccacag cgggagcttc gcgcccaaca accccgacct ggccaaggcg 1621 gctggaatat cactgattgt ccctgggaag agccccacga ggaagaagtc gggcccgttc 1681 ggctcccgcc gcagcagcgc cattggcatc gagaacatac aggaggtgca ggagaagagg 1741 gagagccctc cggctggtca gaagacccca gacagcgggc acgtctcaca ggagcccaag 1801 tcggagaact catccactca gagctcccca gagatgccca cgaccaagaa cagagcggag 1861 accgcagcgc agagagcaga ggcgctcaag gacttctccc gctcctcgtc cagtgccagc 1921 agcttcgcca gcgtggtgga ggagacggag ggtgtggacg gagaggacac aggcctggag 1981 agcgtgtcat cctcaggaac accccacaag cgggactcct tcatctatag cacgtggctg 2041 gaggacagtg tcagcaccac tagtgggggc agctccccag gcccctctcg atcaccccac 2101 ccagacgccg gcaagttggg ggaccctgcg tgtcccgaga tcaagatcca gctggaagca 2161 tctgagcagc acatgcccca gctgggctgt tagccgggcc accccctctg aaggtgaaac 2221 tgagcagatg aggccacaga agcacaaggg gaaggtgccg tgtcaagccc aggcagacga 2281 gacctctgcc ctgaagacca acaccagccc gtgggctgcc ccctgcctcc ccaccctccc 2341 catggcccac ccatctgggc tgtctctgca gggcagagcc gtccagacct gggatcaggg 2401 aagctgctgg catcgtcccc acccccagcc tgggggtctg cgctggggca gggattgctc 2461 agtggaagca ggactggggg tctggcttgc cccctccctg ggcctccatc acccctgagc 2521 atccctctgg actcagaggg aacaaggtgg gagagagagt ttgagacagc tccgtgtgga 2581 gagcttagcc cctggaggca gcacaaggag gatgtgatat gtgggggagt gagcactggg 2641 ttgggagccg ggtcctggtt tccaatttgg gttctgctgt gtgactctgg gcaagtcact 2701 ctccctctct gggcatgtct gctacaaatg gacaagatta tttcagaggt cactgaagac 2761 tgtgattaca tgcacctgcc ttagaaggta ggattttctt cccagggacc tcctatcacc 2821 ctaccctgct tcttgaggtc cctggagccc caggtgggct gaggggcagg gagccggctg 2881 tgcccagtat gcctcctgga ccctccagtt ctgccacagg tctgccgatg ccctgtccac 2941 tgcctacaca tgacagacaa gtaaccccct catgggggat ggggacctac ctggctcctc 3001 agccagcacc cagcttaacc cctgccatcc catgctgggc cctccaggcc aagagtctca 3061 gctggccgag agtccaggcc ttgcctcccc cgaccgccat ggagggggca gcccggcaca 3121 gctgctggga gcccttgtgt gtctggtcac actttttagg cgtcacgcca aaggccagcc 3181 tcctggcccc aatacccatt ttggaagccc ctgtggccgt gtggatgtcg gtaacagttg 3241 tataaaataa attctattta tcgctattgt // LOCUS HUMRASAB 612 bp mRNA PRI 15-JUN-1990 DEFINITION Human ras-like protein mRNA, complete cds, clone TC21. ACCESSION M31468 NID g190876 KEYWORDS ras-like protein. SOURCE Human teratocarcinoma cell line NTera2/D1, cDNA to mRNA, clone TC21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 612) AUTHORS Drivas,G.T., Shih,A., Coutavas,E.E., Rush,M.G. and D'Eustachio,P. TITLE Characterization of four novel ras-like genes expressed in a human teratocarcinoma cell line JOURNAL Mol. Cell. Biol. 10, 1793-1798 (1990) MEDLINE 90205863 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Drivas, 18-JAN-1990. FEATURES Location/Qualifiers source 1..612 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..612 /note="ras-like protein" /codon_start=1 /db_xref="PID:g190877" /translation="MAAAAGGRLRQEKYRLVVVGGGGVGKSALTIQFIQSYFVTDYDP TIEDSYTKQCVIDDRAARLDILDTAGQEEFGAMREQYMRTGEGFLLVFSVTDRGSFEE IYKFQRQILRVKDRDEFPMILIGNKADLDHQRQVTQEEGQQLARQLKVTYMEASAKIR MNVDQAFHELVRVIRKFQEQECPPSPEPTRKEKDKKGCHCVIF" BASE COUNT 188 a 118 c 167 g 139 t ORIGIN 1 atggccgcgg cggctggcgg acggctccgg caggagaagt accggctcgt ggtggtcggc 61 gggggcggcg tgggcaagtc ggcgctcacc atccagttca tccagtccta ttttgtaacg 121 gattatgatc caaccattga agattcttac acaaagcagt gtgtgataga tgacagagca 181 gcccggctag atattttgga tacagcagga caagaagagt ttggagccat gagagaacag 241 tatatgagga ctggcgaagg cttcctgttg gtcttttcag tcacagatag aggcagtttt 301 gaagaaatct ataagtttca aagacagatt ctcagagtaa aggatcgtga tgagttccca 361 atgattttaa ttggtaataa agcagatctg gatcatcaaa gacaggtaac acaggaagaa 421 ggacaacagt tagcacggca gcttaaggta acatacatgg aggcatcagc aaagattagg 481 atgaatgtag atcaagcttt ccatgaactt gtccgggtta tcaggaaatt tcaagagcag 541 gaatgtcctc cttcaccaga accaacacgg aaagaaaaag acaagaaagg ctgccattgt 601 gtcattttct ag // LOCUS HUMRASAC 651 bp mRNA PRI 03-MAR-1992 DEFINITION Human ras-like protein mRNA, complete cds, clone TC4. ACCESSION M31469 NID g190878 KEYWORDS ras-like protein. SOURCE Human teratocarcinoma cell line NTera2/D1, cDNA to mRNA, clone TC4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 651) AUTHORS Drivas,G.T., Shih,A., Coutavas,E.E., Rush,M.G. and D'Eustachio,P. TITLE Characterization of four novel ras-like genes expressed in a human teratocarcinoma cell line JOURNAL Mol. Cell. Biol. 10, 1793-1798 (1990) MEDLINE 90205863 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Drivas, 18-JAN-1990. FEATURES Location/Qualifiers source 1..651 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..651 /note="ras-like protein" /codon_start=1 /db_xref="PID:g190879" /translation="MAAQGEPQVQFKLVLVGDGGTGKTTFVKRHLTGEFEKKYVATLG VEVHPLVFHTNRGPIKFNVWDTAGQEKFGGLRDGYYIQAQCAIIMFDVTSRVTYKNVP NWHRDLVRVCENIPIVLCGNKVDIKDRKVKAKSIVFHRKKNLQYYDISAKSNYNFEKP FLWLARKLIGDPNLEFVAMPALAPPEVVMDPALAAQYEHDLEVAQTTALPDEDDDL" BASE COUNT 180 a 140 c 166 g 165 t ORIGIN 1 atggctgcgc agggagagcc ccaggtccag ttcaaacttg tattggttgg tgatggtggt 61 actggaaaaa cgaccttcgt gaaacgtcat ttgactggtg aatttgagaa gaagtatgta 121 gccaccttgg gtgttgaggt tcatccccta gtgttccaca ccaacagagg acctattaag 181 ttcaatgtat gggacacagc cggccaggag aaattcggtg gactgagaga tggctattat 241 atccaagccc agtgtgccat cataatgttt gatgtaacat cgagagttac ttacaagaat 301 gtgcctaact ggcatagaga tctggtacga gtgtgtgaaa acatccccat tgtgttgtgt 361 ggcaacaaag tggatattaa ggacaggaaa gtgaaggcga aatccattgt cttccaccga 421 aagaagaatc ttcagtacta cgacatttct gccaaaagta actacaactt tgaaaagccc 481 ttcctctggc ttgctaggaa gctcattgga gaccctaact tggaatttgt tgccatgcct 541 gctctcgccc caccagaagt tgtcatggac ccagctttgg cagcacagta tgagcacgac 601 ttagaggttg ctcagacaac tgctctcccg gatgaggatg atgacctgtg a // LOCUS HUMRASAD 642 bp mRNA PRI 15-JUN-1990 DEFINITION Human ras-like protein mRNA, complete cds, clone TC10. ACCESSION M31470 NID g190880 KEYWORDS ras-like protein. SOURCE Human teratocarcinoma cell line NTera2/D1, cDNA to mRNA, clone TC10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 642) AUTHORS Drivas,G.T., Shih,A., Coutavas,E.E., Rush,M.G. and D'Eustachio,P. TITLE Characterization of four novel ras-like genes expressed in a human teratocarcinoma cell line JOURNAL Mol. Cell. Biol. 10, 1793-1798 (1990) MEDLINE 90205863 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Drivas, 18-JAN-1990. FEATURES Location/Qualifiers source 1..642 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..642 /note="ras-like protein" /codon_start=1 /db_xref="PID:g190881" /translation="MPGAGRSSMAHGPGALMLKCVVVGDGAVGKTCLLMSYANDAFPE EYVPTVFDHYAVSVTVGGKQYLLGLYDTAGQEDYDRLRPLSYPMTDVFLICFSVVNPA SFQNVKEEWVPELKEYAPNVPFLLIGTQIDLRDDPKTLARLNDMKEKPICVEQGQKLA KEIGACCYVECSALTQKGLKTVFDEAIIAILTPKKHTVKKRIGSRCINCCLIT" BASE COUNT 187 a 147 c 164 g 144 t ORIGIN 1 atgcccggag ccggccgcag cagcatggct cacgggcccg gcgcgctgat gctcaagtgc 61 gtggtggtcg gcgacggggc ggtgggcaag acgtgcctac tcatgagcta tgccaacgac 121 gccttcccgg aggagtacgt gcccaccgtc ttcgaccact acgcagtcag cgtcaccgtg 181 gggggcaagc agtacctcct aggactctat gacacggccg gacaggaaga ctatgaccgt 241 ctgaggcctt tatcttaccc aatgaccgat gtcttcctta tatgcttctc ggtggtaaat 301 ccagcctcat ttcaaaatgt gaaagaggag tgggtaccgg aacttaagga atacgcacca 361 aatgtaccct ttttattaat aggaactcag attgatctcc gagatgaccc caaaacttta 421 gcaagactga atgatatgaa agaaaaacct atatgtgtgg aacaaggaca gaaactagca 481 aaagagatag gagcatgctg ctatgtggaa tgttcagctt taacccagaa gggattgaag 541 actgtttttg atgaggctat catagccatt ttaactccaa agaaacacac tgtaaaaaaa 601 agaataggat caagatgtat aaactgttgt ttaattacgt ga // LOCUS HUMRASFAB 854 bp mRNA PRI 15-JUN-1989 DEFINITION Human RASF-A PLA2 mRNA, complete cds. ACCESSION M22430 J04704 NID g190888 KEYWORDS synovial phospholipase A-2; synovial phospholipase A-2-peak A. SOURCE Human, cDNA to mRNA, clone 4. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 854) AUTHORS Seilhamer,J.J., Pruzanski,W., Vadas,P., Plant,S., Miller,J.A., Kloss,J. and Johnson,L.K. TITLE Cloning and recombinant expression of phospholipase A2 present in rheumatoid arthritic synovial fluid JOURNAL J. Biol. Chem. 264, 5335-5338 (1989) MEDLINE 89174566 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.J. Seilhamer 07-FEB-1989. FEATURES Location/Qualifiers source 1..854 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 136..570 /note="synovial phospholipase A-2 (EC 3.1.1.4)" /codon_start=1 /db_xref="PID:g190889" /translation="MKTLLLLAVIMIFGLLQAHGNLVNFHRMIKLTTGKEAALSYGFY GCHCGVGGRGSPKDATDRCCVTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCAKQDS CRSQLCECDKAAATCFARNKTTYNKKYQYYSNKHCRGSTPRC" BASE COUNT 233 a 242 c 196 g 183 t ORIGIN Unreported. 1 gaattcccaa ctctggagtc ctctgagaga gccaccaagg aggagcaggg gagcgacggc 61 cggggcagaa gttgagacca cccagcagag gagctaggcc agtccatctg catttgtcac 121 ccaagaactc ttaccatgaa gaccctccta ctgttggcag tgatcatgat ctttggccta 181 ctgcaggccc atgggaattt ggtgaatttc cacagaatga tcaagttgac gacaggaaag 241 gaagccgcac tcagttatgg cttctacggc tgccactgtg gcgtgggtgg cagaggatcc 301 cccaaggatg caacggatcg ctgctgtgtc actcatgact gttgctacaa acgtctggag 361 aaacgtggat gtggcaccaa atttctgagc tacaagttta gcaactcggg gagcagaatc 421 acctgtgcaa aacaggactc ctgcagaagt caactgtgtg agtgtgataa ggctgctgcc 481 acctgttttg ctagaaacaa gacgacctac aataaaaagt accagtacta ttccaataaa 541 cactgcagag ggagcacccc tcgttgctga gtcccctctt ccctggaaac cttccaccca 601 gtgctgaatt tccctctctc ataccctccc tccctaccct aaccaagttc cttggccatg 661 cagaaagcat ccctcaccca tcctagaggc caggcaggag cccttctata cccacccaga 721 atgagacatc cagcagattt ccagccttct actgctctcc tccacctcaa ctccgtgctt 781 aaccaaagaa gctgtactcc ggggggtctc ttctgaataa agcaattagc aaatcaaaaa 841 aaaaaaagga attc // LOCUS HUMRB1A 503 bp DNA PRI 05-JUL-1995 DEFINITION Homo sapiens (clone 104) retinoblastoma 1 gene, complete cds. ACCESSION M26460 NID g341501 KEYWORDS nuclear phosphoprotein; retinoblastoma 1; retinoblastoma protein. SOURCE Homo sapiens (tissue library: of T.Tomatsu, M.Hattori and Y.Sakaki) lymph node DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 503) AUTHORS Taya,Y., Watanabe,K. and Nishimura,S. TITLE Homology between a region of the human retinoblastoma gene and L1 family repetitive sequences JOURNAL Biochem. Biophys. Res. Commun. 160, 1061-1066 (1989) MEDLINE 89273558 FEATURES Location/Qualifiers source 1..503 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="lymph node" /tissue_lib="of T.Tomatsu, M.Hattori and Y.Sakaki" /map="13q14.2" gene 98..376 /gene="RB1" CDS 98..376 /gene="RB1" /codon_start=1 /db_xref="GDB:G00-118-734" /product="retinoblastoma 1" /db_xref="PID:g536845" /translation="MNRNEQSLQEIWDYVKRPHIHLIGVPESDRENGTKLENTLQDII QENFPNLARQANIQIQETQRTPHRYSSRRATPRHIIVRFTEVEMKKKF" BASE COUNT 204 a 93 c 108 g 98 t ORIGIN 1 ccttcaatag ccgattcgat caagtggaag aaagggtatc agtgattgaa gatcatatta 61 atgaaataaa gcaagagaca agattagaga aaaaagaatg aatagaaatg aacaaagcct 121 ccaagaaata tgggactatg tgaaaagacc acatatacat ttgattggtg taccggaaag 181 tgacagggag aatggaacca agttagaaaa cactcttcag gatattatcc aggagaactt 241 ccctaaccta gcaaggcagg ccaacattca aattcaggaa acacagagaa caccacacag 301 atactcctcg agaagagcaa ctccaagaca cataattgtc agattcactg aggttgaaat 361 gaagaaaaaa ttttaagggc atccagagag aaaggtcagg ttacccacaa agggaagtcc 421 atcagactaa tagtggatct ctcggcagaa accctacaag ccagaagaga gtgggggcca 481 atattcttca ttcttaaaca aaa // LOCUS HUMRBPB 2438 bp mRNA PRI 26-MAR-1996 DEFINITION Human (clone E5.1) RNA-binding protein mRNA, complete cds. ACCESSION L37368 NID g1236282 KEYWORDS RNA-binding protein. SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2438) AUTHORS Badolato,J., Gardiner,E., Morrison,N. and Eisman,J. TITLE Identification and characterisation of a novel human RNA-binding protein JOURNAL Gene 166 (2), 323-327 (1995) MEDLINE 96125212 FEATURES Location/Qualifiers source 1..2438 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="2 years old" /sex="female" /clone="E5.1" /clone_lib="Stratagene" /tissue_type="brain" CDS 550..1467 /note="putative" /codon_start=1 /product="RNA-binding protein" /db_xref="PID:g598231" /translation="MDLSGVKKKSLLGVKENNKKSSTRAPSPTKRKDRSDEKSKDRSK DKGATKESSEKDRGRDKTRKRRSASSGSSSTRSRSSSTSSSGSSTSTGSSSGSSSSSA SSRSGSSSTSRSSSSSSSSGSPSPSRRRHDNRRRSRSKSKPPKRDEKERKRRSPSPKP TKVHIGRLTRNVTKDHIMEIFSTYGKIKMIDMPVERMHPHLSKGYAYVEFENPDEAEK ALKHMDGGQIDGQEITATAVLAPWPRPPPRRFSPPRRMLPPPPMWRRSPPRMRRRSRS PRRRSPVRRRSRSPGRRRHRSRSSSNSSR" misc_binding 1035..1197 /note="RRM consensus binding site; contains RNP-1 and RNP-2 consensus sequences" /bound_moiety="RNA" CDS complement(1161..1697) /note="no distinctive protein motifs; ORF" /codon_start=1 /db_xref="PID:g598232" /translation="MSLSSQKTGGNLDRHTASKPTALLQPGPGWQRGAACCLQILPES RVCFPTGLPLARKVTKLSWVGYKLQGRELQWPVYREELELERLLWRRRPGDRDLRRTG DLRLGERDLLLIRGGDLRHIGGGGNILLGGLNLLGGGLGQGASTAVAVISWPSICPPS MCFSAFSASSGFSNSTYA" misc_feature 1173..1197 /note="consensus octapeptide found in the SR splicing protein family; RS-octapeptide" misc_signal 1359..1434 /note="nuclear localization signal" BASE COUNT 578 a 641 c 640 g 579 t ORIGIN 1 attccggagt ttctttgtgg gatgcggtgg aaggtcggcc gatcccccct tcggagtcat 61 taattcaaac gtgtacccgc gaacgtttcc cggactcccg tgtgcaatta tcggcgctag 121 gccgtgggga tacggcccac tatggaccga gcacggtggc cgagctcagt ttgatgggtt 181 accccagagg ggcggatacg ggaaggttaa gtttgttgga ggagaaacgt ggagtaggct 241 ttttgtcttg attccgcatg aactgtgcct gaaatgtact tttaaatggg gaaggtgctc 301 tgaagatttg agccgaaacg ccctctcctc gagatttaac taattgttct ctcctctctc 361 tggctgttgg acgcgcacct ttccggagga tgggggaggt aaccgaggtc ctgagccggt 421 acctgaactt gggctgctct ttggcgtaaa ttgcaatcga ttagggatcg tttctcagaa 481 tcaagttaga agtgagagtt cagataagtg aggccgccat tgctgctttg aacacctcag 541 aaggggagaa tggatttatc aggagtgaaa aagaagagct tgctaggagt caaagaaaat 601 aataaaaagt ccagcactag ggctccttca cctaccaaac gcaaagaccg ctcagatgag 661 aagtccaagg atcgctcaaa agataaaggg gccaccaagg agtcgagtga gaaggatcgc 721 ggccgggaca aaacccgaaa gaggcgcagc gcttccagtg gtagcagcag taccaggtct 781 cggtccagct cgacttccag ctcaggctcc agcaccagca ctggctcaag cagtggctcc 841 agctcttcct cagcatccag ccgctcagga agctccagca cctcccgcag ctccagctct 901 agcagctctt ctggctctcc aagtccttct cggcgcagac acgacaacag gaggcgctcc 961 cgctccaaat ccaaaccacc taaaagagat gaaaaggaga ggaaaaggcg gagcccatct 1021 cctaagccca ccaaagtgca cattgggaga ctcacccgga atgtgacaaa ggatcacatc 1081 atggagatat tttccaccta tgggaaaatt aaaatgattg acatgcccgt ggaaaggatg 1141 catccccatc tgtccaaagg ctatgcgtac gtagagtttg agaatccaga tgaagccgag 1201 aaggcgctga agcacatgga tggaggacaa attgatggcc aggagatcac tgccaccgcc 1261 gtgctggccc cctggcctag gccacccccc aggagattca gccctcccag gagaatgttg 1321 ccaccaccgc ctatgtggcg caggtctccc ccacggatga ggagaaggtc ccgctccccg 1381 aggcgcaggt cccccgtgcg ccggagatca cggtccccgg gccgccgccg ccacaggagc 1441 cgctccagct ccaactcctc ccgataaaca ggccactgaa gctctcgccc ctgtaactta 1501 taccccaccc agctcagttt tgtcactttt ctagccaaag gaagaccagt aggaaagcaa 1561 acccttgact ctggcaggat ttgcaggcag caggcagcac ccctctgcca gccgggcccc 1621 ggctgcagaa gtgctgttgg tttggatgct gtgtgcctgt caagattccc tccggttttc 1681 tggctagaaa ggctcatccg tttccggttt ctaagagtca gttcagtggc agagccacca 1741 gggaaaagtg aggctcttgg gggtggtttg accctcttac ctgggagcac acttttccct 1801 tccccgatga cctgggatgg tggccaggcc gtgcccttgc tgttgctggg cagtgtcctt 1861 ttggaaaggg agctgcccca ggctttagtg cagctgccaa ccctgttagg cctggcctct 1921 cgaggcctct tctaatctca agggtcacac cccctcaaag atcctctcac ccatggtagt 1981 tgctgctcgt ggttctgtct gtccgtgcac cgatgcacac accgcacccc accactgtac 2041 tctgaaattg gcgagtgagt ggagagccag ctctgcggag tcatcacgca gccatggtgt 2101 gcctgccgtt catggtggtc tttcaggtta tcttggcaac atgtacattg cttttatttt 2161 ttttcttttt tgctttcatt gtacagtcag tactataaaa tttctctttt gagttttata 2221 cctttgtagc attttagatg acattgtgtt tgtactttgt tgtgtagagt ggaagaattg 2281 tgttgaataa acccaagatc atctgagatg aatattaatt ttattctcat tttatagatg 2341 aggaaatggg agttttaaag acattcaata actttggcca aggtcattca gctattaaat 2401 tttaagacca taaaccaatt ggattcctgg aggaattc // LOCUS HUMRBPC 716 bp mRNA PRI 08-JAN-1995 DEFINITION Human cellular retinol-binding protein mRNA, complete cds. ACCESSION M11433 NID g190947 KEYWORDS . SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 716) AUTHORS Colantuoni,V., Cortese,R., Nilsson,M., Lundvall,J., Bavik,C.O., Eriksson,U., Peterson,P.A. and Sundelin,J. TITLE Cloning and sequencing of a full length cDNA corresponding to human cellular retinol-binding protein JOURNAL Biochem. Biophys. Res. Commun. 130 (1), 431-439 (1985) MEDLINE 85279409 FEATURES Location/Qualifiers source 1..716 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q21-q22" gene 126..533 /gene="RBP1" CDS 126..533 /gene="RBP1" /codon_start=1 /db_xref="GDB:G00-120-340" /product="retinol-binding protein" /db_xref="PID:g190948" /translation="MPVDFTGYWKMLVNENFEEYLRALDVNVALRKIANLLKPDKEIV QDGDHMIIRTLSTFRNYIMDFQVGKEFEEDLTGIDDRKCMTTVSWDGDKLQCVQKGEK EGRGWTQWIEGDELHLEMRVEGVVCKQVFKKVQ" BASE COUNT 174 a 185 c 213 g 144 t ORIGIN Chromosome 3. 1 gggggggggc ggagggcgct catttccggg ccgcccacca cccgcgtagc accggcagcc 61 gctgtcccgg cagtctccag ccgtcccgcc cgcttgtggc caaactggct ccagtcactc 121 ccgaaatgcc agtcgacttc actgggtact ggaagatgtt ggtcaacgag aatttcgagg 181 agtacctgcg cgccctcgac gtcaatgtgg ccttgcgcaa aatcgccaac ttgctgaagc 241 cagacaaaga gatcgtgcag gacggtgacc atatgatcat ccgcacgctg agcactttta 301 ggaactacat catggacttc caagttggga aggagtttga ggaggatctg acaggcatag 361 atgaccgcaa gtgcatgaca acagtgagct gggacggaga caagctccag tgtgtgcaga 421 agggtgagaa ggaggggcgt ggctggaccc agtggatcga gggtgatgag ctgcacctag 481 agatgagagt ggaaggtgtg gtctgcaagc aagtattcaa gaaggtgcag tgaggcccaa 541 gcagacaacc ttgtcccaac caatcagcag gatgtgtgag ccaggatccc tctttgcaca 601 gcatgaggca aaaatgtcca gccaccccta ggcatctgtt agcagagtct gtctcttggc 661 tttgtcactt ttccttttct taaaacaaag ccatgccaat aaagtgacct gtgttc // LOCUS HUMRCK 4153 bp mRNA PRI 21-APR-1994 DEFINITION Human mRNA for RCK, complete cds. ACCESSION D17532 NID g402515 KEYWORDS ATP-binding capacity; DEAD box family; RCK; translation initiation factor-related; translocation associated gene. SOURCE Homo sapiens (individual_isolate Japanase) lung small cell cancer cell line SCLC-SA cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4153) AUTHORS Akao,Y., Seto,M., Yamamoto,K., Iida,S., Nakazawa,S., Inazawa,J., Abe,T., Takahashi,T. and Ueda,R. TITLE The RCK gene associated with t(11;14) translocation is distinct from the MLL/ALL-1 gene with t(4;11) and t(11;19) translocations JOURNAL Cancer Res. 52 (21), 6083-6087 (1992) MEDLINE 93008012 REFERENCE 2 (bases 1 to 4153) AUTHORS Seto,M. TITLE Direct Submission JOURNAL Submitted (02-SEP-1993) to the DDBJ/EMBL/GenBank databases. Masao Seto, Aichi Cancer Center Research Institute, Department of Experimental Radiology; 1-1 Kanokoden, Chikusa-ku, Nagoya, Aichi 464, Japan (E-mail:H44713u@nucc.cc.nagoya-u.ac.jp, Tel:052-762-6111, Fax:052-763-5233) COMMENT Submitted (02-Sep-1993) to DDBJ by: Masao Seto Aichi Cancer Center Research Institute 1-1 Kanokoden, Chikusa-ku Nagoya 464 Japan Phone: 052-762-6111 Fax: 052-763-5233. FEATURES Location/Qualifiers source 1..4153 /organism="Homo sapiens" /isolate="Japanese" /db_xref="taxon:9606" /cell_line="SCLC-SA" /cell_type="small cell cancer" /tissue_type="lung" gene 340..1758 /gene="RCK" CDS 340..1758 /gene="RCK" /codon_start=1 /product="RCK" /db_xref="PID:d1005007" /db_xref="PID:g458727" /translation="MGLSSQNGQLRGPVKPTGGPGGGGTQTQQQMNQLKNTNTINNGT QQQAQSMTTTIKPGDDWKKTLKLPPKDLRIKTSDVTSTKGNEFEDYCLKRELLMGIFE MGWEKPSPIQEESIPIALSGRDILARAKNGTGKSGAYLIPLLERLDLKKDNIQAMVIV PTRELALQVSQICIQVSKHMGGAKVMATTGGTNLRDDIMRLDDTVHVVIATPGRILDL IKKGVAKVDHVQMIVLDEADKLLSQDFVQIMEDIILTLPKNRQILLYSATFPLSVQKF MNSHLQKPYEINLMEELTLKGVTQYYAYVTERQKVHCLNTLFSRLQINQSIIFCNSSQ RVELLAKKISQLGYSCFYIHAKMRQEHRNRVFHDFRNGLCRNLVCTDLFTRGIDIQAV NVVINFDFPKLAETYLHRIGRSGRFGHLGLAINLITYDDRFNLKSIEEQLGTEIKPIP SNIDKSLYVAEYHSEPVEDEKP" BASE COUNT 1250 a 800 c 879 g 1223 t 1 others ORIGIN Chromosome 11q23. 1 gcgacttcgg cggcgccacg agagcgggca gcggaggaga ttgacgtgag tgaattcaga 61 tataactcaa gcttgttaga gggcttttta aaaataaaaa gttgattccg tgcaagagag 121 caagttactg ctgcttaccg ttcagagact tacaggtgct tgcctgcatt gcaataaagg 181 actcatttat tgagcaagac ttatatttat ctcttcattt tggagagcct aataaactgt 241 tattacagtt tctctactga ctttcaaaag ttttgaagtt tgaaagacct ttgcaattaa 301 aacagcatga gcacggccag aacagagaac cctgttataa tgggtctgtc cagtcaaaat 361 ggtcagctga gaggccctgt gaaacccact ggtggccctg gaggaggggg cacacagaca 421 cagcaacaga tgaaccagct gaaaaacacc aacacaatca ataatggcac tcagcagcaa 481 gcacagagta tgaccaccac tattaaacct ggtgatgact ggaaaaagac tttaaaactc 541 cctccaaagg atctaagaat caaaacttcg gatgtgacct ccacaaaagg aaatgagttt 601 gaagattact gtttgaaacg ggagttactg atgggaattt ttgaaatggg ctgggaaaag 661 ccatctccta ttcaggagga gagcattccc attgctttat ctggtaggga tatcttagct 721 agagcaaaaa atggaacagg caagagcggt gcctacctca ttcccttact tgaacggcta 781 gacctgaaga aggacaatat acaagcaatg gtgattgttc ccactagaga acttgctcta 841 caggtcagtc aaatttgcat ccaggtcagc aaacacatgg gaggggccaa agtgatggca 901 accacaggag gaaccaattt acgagatgac ataatgaggc ttgatgatac agtgcacgtg 961 gtgattgcta cccctgggag aatcctggat cttattaaga aaggagtagc aaaggttgat 1021 catgtccaga tgatagtatt ggatgaggca gataagttgc tgtcacagga ttttgtgcag 1081 ataatggagg atattattct cacgctacct aaaaacaggc agattttact atattccgct 1141 actttccctc ttagtgtaca gaagttcatg aattcccatt tgcagaaacc ctatgagatt 1201 aacctgatgg aggaactaac tctgaaggga gtaacccagt actacgcata tgtaactgag 1261 cgccaaaaag tacactgcct caacacactt ttctccaggc ttcagataaa ccagtcgatc 1321 attttctgta actcctctca gcgagttgaa ttgctagcca agaagatttc tcaactgggt 1381 tattcttgct tctatattca tgctaaaatg aggcaggaac atcgaaatcg tgtatttcat 1441 gatttccgaa atggcttatg ccgcaatctt gtttgcactg atctgtttac ccgaggtatt 1501 gatatacaag ctgtgaatgt ggtaataaac tttgatttcc caaagctggc agagacctat 1561 ctccatcgta ttggaagatc aggtcgcttt ggtcatcttg gcttagccat caacttgatc 1621 acatatgatg atcgcttcaa cctgaaaagt attgaggagc agctgggaac agaaattaaa 1681 cctattccga gcaacattga taagagcctg tatgtggcag aataccacag cgagcctgta 1741 gaagatgaga aaccttaaca agcatgcttt gacaaattac aaaaggctcg tttggatctg 1801 tgacacatcg ttttgggggg aatgctcttc tctttgtggg tttttcatct tttattttgg 1861 aactatgaag acttagagct cagacatttt ctttttttaa ctggtgaaga gaaaaaggct 1921 gaaaagaagg aatatacctt ttttgttcca cttgtttgca ctgtgtgctg actgaacatt 1981 agttgcacta actgctggtt tttaaaaaat gttttctggg gaaaggggac aaggaaggaa 2041 aagaaagaag agaaggggag aaaccctaaa aagagaagaa tcttaatgaa cacacaagct 2101 tgtcaatgat ttcaaaattc tccaacagct gactctcgtg catttcaact tctccctgat 2161 tcctcatccg ttttttaagc ctgaagagct tattacttat ttgtgcgaag tgccttatgc 2221 tatgagacca ttcagaatat catcttttag acacagcccg aggaatcaac aatagtaact 2281 ctttctttcc tttttttttc tttttctttt aaaaaatgtc tttttatttt ggtttcaggt 2341 tgaagtctct tccctttcta cccagtactc gagcccaggg ctagaagttg aaactcacta 2401 gtaatgttaa acaccatttt ttttttcttt ttggggagga gttgatatgc aactgcagtt 2461 catccgcact gtaaatacat gtatttaaaa aaacaatccc aagtaaaaat ttcttctggg 2521 ctgagtagat aaaaaacatc atcgctccca aaggaaagag cagtctatca ttgcaggagc 2581 catatgacaa gcctttgtgc tctatagcag acactaaaga ctgggttaca tatgcctcca 2641 gtagtagtat ggcacttgat gtgtagacat gtcagagcct tggccctctt tcctctgtgg 2701 caaagcgtgt cccatagaaa attgggtgtg tatacttgta taaactttgt aaataagttt 2761 tttttctggg ttcatatata tatatatata tacatatata tatatttttt ttttggatga 2821 aggttgctgg gattaagggg attagagtga ttatgggagc agctaaagat gagaggggct 2881 cagttttacg caacactaaa ttctagaaag tactttggcc tcgtgctgta gagagcagat 2941 ttctatggta ccctgtgtta gtaaaggggc ccagaaatct gggatgtact gtttgctgcc 3001 acactgtctc atctagtacc tttggagtag gtttaccaga gagagcaagg aagcttcaaa 3061 acattgataa ttcaagattt attttgagaa gctctgattt tgcttcttcc ccctttctaa 3121 aagtttgagg aatatttcaa gctctgcaac agggggcaaa gattaatcta ccttgcagtg 3181 taggaattct attgagtggc agtgtcattg agcagtatat atataaagca cagatttgca 3241 tttcagaata ttagccagta ccagctttgg taatgttagc agttctggag cttaattttc 3301 tgtggatcat ttcctgtagt gtgtaaatgt gttgccctct gcccgctttg atacataaac 3361 ttttgcagga atgggcaacc tgagagctgt taactttcat gctacagaaa gctgcttgcc 3421 attctcttgc attgtgacaa gaattgttac ctgtcatttt gcactgtaaa ttgctggcag 3481 atgctttaca gtcaatagtg ttgctttaaa ttgtgcccct cccaacatgc ttgatgtttg 3541 gcctgatctc caggcaaaag gagtgagatg aatcaaaacc agtgaacttt tttttttaat 3601 gtttttaatt cccttttaac ccagtgtact aggtcaatca ggaggcatct ggaaaggggg 3661 ggaaaaaagc aaaaaacaaa attaaaaaaa attgatttcc atattttttc aaaaccctaa 3721 aatattacaa aataagtccc gcatatactt taatgtttta actcttttgg acaaaggaat 3781 caattacttg aagttgcttt tttgactctg cctacttctg agcaaactat ttgcgactca 3841 cccacaattc cttggagatc aaaatcctgc agatgcctcc tgatgggaca tagccctaac 3901 tccttaacaa ctgtagcaag aactgcacgg tacaaaccca aaaaagaaca agctcaycta 3961 tttggaaccc aagcctcata ttttctgtcc agtcaatgtt ggttactaaa tagaaaacct 4021 aattaaggaa gtaacttttt tccagggagg ggtgggagag ggaatttaaa aggtgtcaac 4081 ttttgaccaa aaaaattgtg cccttgtacc gatgcagctc tctttctccc ccacctcttg 4141 cgagataaga ggg // LOCUS HUMRCN 2104 bp mRNA PRI 14-MAY-1996 DEFINITION Human mRNA for reticulocalbin, complete cds. ACCESSION D42073 NID g1262328 KEYWORDS rcn; calcium binding protein; ER-hand protein; ER-resident protein; reticulocalbin. SOURCE Homo sapiens transitional carcinoma cell cell_line:BOY cDNA to mRNA, clone:hR12 and hR19. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2104) AUTHORS Ozawa,M. TITLE Cloning of a human homologue of mouse reticulocalbin reveals conservation of structural domains in the novel endoplasmic reticulum resident Ca(2+)-binding protein with multiple EF-hand motifs JOURNAL J. Biochem. 117 (5), 1113-1119 (1995) MEDLINE 96172582 REFERENCE 2 (bases 1 to 2104) AUTHORS Ozawa,M. TITLE Direct Submission JOURNAL Submitted (10-NOV-1994) to the DDBJ/EMBL/GenBank databases. Masayuki Ozawa, Faculty of Medicine, Kagoshima University, Department of Biochemistry; 8-35-1 Sakuragaoka, Kagoshima, Kagoshima 890, Japan (Tel:0992-75-5246, Fax:0992-64-5618) FEATURES Location/Qualifiers source 1..2104 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="BOY" /cell_type="transitional carcinoma cell" /clone="hR12 and hR19" gene 53..1048 /gene="rcn" CDS 53..1048 /gene="rcn" /codon_start=1 /product="reticulocalbin" /db_xref="PID:d1008252" /db_xref="PID:g1262329" /translation="MARGGRGRRLGLALGLLLALVLAPRVLRAKPTVRKERVVRPDSE LGERPPEDNQSFQYDHEAFLGKEDSKTFDQLTPDESKERLGKIVDRIDNDGDGFVTTE ELKTWIKRVQKRYIFDNVAKVWKDYDRDKDDKISWEEYKQATYGYYLGNPAEFHDSSD HHTFKKMLPRDERRFKAADLNGDLTATREEFTAFLHPEEFEHMKEIVVLETLEDIDKN GDGFVDQDEYIADMFSHEENGPEPDWVLSEREQFNEFRDLNKDGKLDKDEIRHWILPQ DYDHAQAEARHLVYESDKNKDEKLTKEEILENWNMFVGSQATNYGEDLTKNHDEL" sig_peptide 53..139 /gene="rcn" polyA_signal 2085..2090 polyA_site 2104 BASE COUNT 617 a 424 c 530 g 533 t ORIGIN 1 gctgttgtcg ctcgctcagc gtctccctct cggccgccct ctcctcggga cgatggcgcg 61 cggtggccgc ggccgccgcc tggggttagc cctggggctg ctgctggcgc tggtgctggc 121 gccgcgggtt ctgcgggcca agcccacggt gcgcaaagag cgcgtggtgc ggcccgactc 181 ggagctgggc gagcggcccc ctgaggacaa ccagagcttc cagtacgacc acgaggcctt 241 cctgggcaag gaggactcca agaccttcga ccagctcacc ccggacgaga gcaaggagag 301 gctagggaag attgttgatc gaatcgacaa tgatggggat ggctttgtca ctactgagga 361 gctgaaaacc tggatcaaac gggtgcagaa aagatacatc tttgataatg tcgccaaagt 421 ctggaaggat tatgataggg acaaggatga taaaatttcc tgggaagaat acaaacaagc 481 cacctatggt tactacctag gaaaccccgc agagtttcat gattcttcag atcatcacac 541 ctttaaaaag atgctgccac gtgatgagag aagattcaaa gctgcagacc tcaatggtga 601 cctgacagct actcgggagg agttcactgc ctttctgcat cctgaagagt ttgaacatat 661 gaaggaaatt gtggttttgg aaaccctgga ggacatcgac aagaacgggg atgggtttgt 721 ggatcaggat gagtatattg cggatatgtt ttcccatgag gagaatggcc ctgagccaga 781 ctgggtttta tcagaacggg agcagtttaa cgaattccgg gatctgaaca aggacgggaa 841 gttagacaaa gatgagattc gccactggat cctccctcaa gattatgatc acgcacaggc 901 tgaggccagg catctggtat atgaatcaga caaaaacaag gatgagaagc taactaaaga 961 ggaaatattg gagaactgga acatgtttgt tggaagccaa gctaccaatt acggggaaga 1021 tctcacaaaa aatcatgatg agctttgata gacactcacc agaatatggc agactgtcat 1081 aggcattctg ttattgtctt ggattgttgc tacaattgtc taattacagc agttgtgatc 1141 ccacaaaaag caagtttata cctcagattg gggtataaaa attgtttttc gctcagtatt 1201 tactggaaaa tggacatcac tagtctttca gtaagatttc tctcaaaaca cgtgaaaacc 1261 ttggtaaatt gcaattcttt ctggggatat attggtacaa catgacttaa aacttttttt 1321 tttctattaa aacttaaagg ggaacaaaac ttgaaaaagc cctgttcttc agaaggtgag 1381 tgggttgagg gaggcagtaa tatgaagtga ctgctgtgta ttttaactac cagattttta 1441 tatttgccac tgttagatag ttggaaaggg gaaattctgt ttaagcgaaa gtggtatcat 1501 cctaggtaag cttatttcag aacaagtcta atatatcaga ttctttcttt tcgactttat 1561 actctgagtt attacttact gtaagtggtg tatatgaaac ctccatgcat tttccagtat 1621 ggatctgcta atatgcacag taaatccatg tctttgtttg tttttctatt aagaagcaat 1681 caagaaagat aatgtgaaaa agaaaggaat ttagaggtag ggaaaagatg aatgtcagac 1741 atgtgaagaa ctatagtaaa aatgataaac cacctaaata tccttgaacc taccttaaaa 1801 tgccaatgag gtaggcctga tctttgaata gtggctagga tacaatgcat ttcctcagtg 1861 atcactgatt agaatgagtt ggtgggatcc ttgggaagcc aaacggagcg gggttctgga 1921 tcatgtccca tccagtccag tgaatccccg acccgcagac ctgccccccc cgcaacagct 1981 tataccatgg aatgaggaca aggtgatact ctgagctgtg gactgaactg gcagacacaa 2041 cctgtacaga ttgaaatttc accttgtaag gaggaagtga atgaaataaa ggatccccct 2101 aagg // LOCUS HUMRDS 2985 bp mRNA PRI 08-JAN-1995 DEFINITION Human retinal degeneration slow mRNA, complete cds. ACCESSION M73531 NID g190975 KEYWORDS retinal degeneration slow protein. SOURCE Homo sapiens adult retina cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2985) AUTHORS Travis,G.H., Brennan,M.B., Danielson,P.E., Kozak,C.A. and Sutcliffe,J.G. TITLE Identification of a photoreceptor-specific mRNA encoded by the gene responsible for retinal degeneration slow (rds) JOURNAL Nature 338 (6210), 70-73 (1989) MEDLINE 89143767 REFERENCE 2 (bases 1 to 2985) AUTHORS Travis,G.H., Christerson,L., Danielson,P.E., Klisak,I., Sparkes,R.S., Hahn,L.B., Dryja,T.P. and Sutcliffe,J.G. TITLE The human retinal degeneration slow (RDS) gene: chromosome assignment and structure of the mRNA JOURNAL Genomics 10 (3), 733-739 (1991) MEDLINE 91365382 FEATURES Location/Qualifiers source 1..2985 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="retina" /map="6p21.2-cen" gene 249..1289 /gene="RDS" CDS 249..1289 /gene="RDS" /codon_start=1 /db_xref="GDB:G00-118-863" /product="retinal degeneration slow protein" /db_xref="PID:g190976" /translation="MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIE LRKRSDVMNNSESHFVPNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYL AICVLFNIILFLVALCCFLLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQ IEFKCCGNNGFRDWFEIQWISNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSS PRPCIQYQITNNSAHYSYDHQTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFE VTITIGLRYLQTSLDGVSNPEESESESEGWLLEKSVPETWKAFLESVKKLGKGNQVEA EGAGAGQAPEAG" BASE COUNT 723 a 771 c 779 g 712 t ORIGIN Chromosome VI. 1 gaattcttca gcgcccagga ccaggactat cccctgctca agctgtgatt ccgagacccc 61 tgccaccact actgcattca cggggatccc aggctagtgg gactcgacat gggtagcccc 121 cagggcagct ccctacagct tgggccatct gcacttttcc caaggcccta agtctccgcc 181 tctgggctcg ttaaggtttg gggtgggagc tgtgctgtgg gaagcaaccc ggactacact 241 tggcaagcat ggcgctactg aaagtcaagt ttgaccagaa gaagcgggtc aagttggccc 301 aagggctctg gctcatgaac tggttctccg tgttggctgg catcatcatc ttcagcctag 361 gactgttcct gaagattgaa ctccgaaaga ggagcgatgt gatgaataat tctgagagcc 421 attttgtgcc caactcattg atagggatgg gggtgctatc ctgtgtcttc aactcgctgg 481 ctgggaagat ctgctacgac gccctggacc cagccaagta tgccagatgg aagccctggc 541 tgaagccgta cctggctatc tgtgtcctct tcaacatcat cctcttcctt gtggctctct 601 gctgctttct gcttcggggc tcgctggaga acaccctggg ccaagggctc aagaacggca 661 tgaagtacta ccgggacaca gacacccctg gcaggtgttt catgaagaag accatcgaca 721 tgctgcagat cgagttcaaa tgctgcggca acaacggttt tcgggactgg tttgagattc 781 agtggatcag caatcgctac ctggactttt cctccaaaga agtcaaagat cgaatcaaga 841 gcaacgtgga tgggcggtac ctggtggacg gcgtcccttt cagctgctgc aatcctagct 901 cgccacggcc ctgcatccag tatcagatca ccaacaactc agcacactac agttacgacc 961 accagacgga ggagctcaac ctgtgggtgc gtggctgcag ggctgccctg ctgagctact 1021 acagcagcct catgaactcc atgggtgtcg tcacgctcct catttggctc ttcgaggtga 1081 ccattacaat tgggctgcgc tacctacaga cgtcgctgga tggtgtgtcc aaccccgagg 1141 aatctgagag cgagagcgag ggctggctgc tggagaagag cgtgccggag acctggaagg 1201 cctttctgga gagtgtgaag aagctgggca agggcaacca ggtggaagcc gagggcgcag 1261 gcgcaggcca ggccccagag gctggctgag ggccctgggg cccctcccct cccgaacact 1321 gagaaatagt gcactccaag aaacgtggat ctccccctca tccaactccg aaagtctgaa 1381 tctcccaagg agggcaccat cttacagaga ctctccctga cggtggaatt taagtttagg 1441 gtccctaaaa gcatttgaca cacagttgtt gaatgactga cccaaaatgt gaatgaagct 1501 aatgtgaatg tgagtgaagc tcccttcagg cccgctgccc taggatatgc cctcctggtg 1561 actcgggggc tgtctcagac gactagccca ggacccatct ttctcacacg gatttagtcc 1621 caccctatgg ccactggccg tatctgaggg ctgctcccct tttagaattt acctcttatg 1681 agctccatgt tgcttcactc tatccaaagt gtcacttggt gcataagcac agaaatctga 1741 aaaatggcca tgttgtcttt tttttttttt ttttttaatg ccaagattga caggttggcc 1801 gtttgcttaa tgccagaagt tgggggaaag ttacactttt ctaagaataa tggactctta 1861 aggcattgag ggctctaaac aggattcttt aatcatggag caagagaatt tcaaggcagg 1921 ggattttatc ccccaccaaa aacacagtga aaggcctgct tttgtgtccc attcacatgc 1981 cctcggtcac tgagtctgga gtgaaccacg ggttgaggaa gtcaggctgt tggcgtgtcc 2041 cagcaccaca ccatccctaa agtgccaggt gatctcctgt ggctcatcgg tggaagcagt 2101 ggggtaggct gctgccctgc tgtggaagag gagcaacaat cagacatgag tccacccttt 2161 ggagaccaag cctcagctct tggtgggcca agggacaccc acacaggtgg ccatcacagc 2221 cccatggaca acactaattg tccacagcaa agggcaagga atcctctggg agcttcttcc 2281 gtttcttccc ccaagatacc catcttgaaa aacactattt ctggaatgct tctgcatcaa 2341 aggagattct ttgagatagc ccatcttcct gagctagcaa atacaggagt tttcactttc 2401 tttaggaaag agaagctttc aggggaagga gagaatgatt ttgctgactt cccaagccct 2461 ggtgaccaga ccaaggcagg gcccagcata attcctccag ttggatgaac attcaagaga 2521 gctcgttcct acctggctgg agaccgaggc cagaaggcaa aaaccagaaa gggaacagtc 2581 cataacttac ctctgcttct gaccgatggt gtttgggaat aggttacttt ggactgagtt 2641 tgggttcttt gctgtcctaa gaactttagt gtagagaaaa taagacttct ggtgctgctg 2701 gggtatgttc tgggcttaat tcccccaagc agaagaccag atccaagatg tttggacacc 2761 ctgtcagacg ttggtcccaa gtttaattag atttctgaat ctcgttgagg ccaaggaatg 2821 atccatactg aaaaaatgct gagccagcca tctttggcaa aggtccctga gctcttgcta 2881 tctctcaaga gtgctgagaa ccacggtgaa agtgctgctc taggcccaca agtgtaacta 2941 tgctgttaac agctgtcaat agataattaa aattcatact gtatg // LOCUS HUMREBP 1300 bp mRNA PRI 17-APR-1992 DEFINITION Human mRNA for renin-binding protein, complete cds. ACCESSION D10232 D01085 NID g220052 KEYWORDS leucine zipper; renin; renin-binding protein. SOURCE Human Wilms' tumor cell line G-401 derived from 3-month old male caucasian (ATCC CRL 1441), cDNA to mRNA, clone lambda HRB6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1300) AUTHORS Inoue,H., Takahashi,S., Fukui,K. and Miyake,Y. TITLE Genetic and molecular properties of human and rat renin-binding proteins with reference to the function of the leucine zipper motif JOURNAL J. Biochem. 110 (4), 493-500 (1991) MEDLINE 92138649 COMMENT Data kindly submitted in computer readable form by: Hiroyasu Inoue Department of Biochemistry National Cardiovascualr Center Research Institute 5-7-1 Fujishiro-dai Suita Osaka 565 Japan Phone: 06-833-5012 x2458. FEATURES Location/Qualifiers source 1..1300 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 31..1284 /codon_start=1 /product="renin-binding protein" /db_xref="PID:d1001551" /db_xref="PID:g220053" /translation="MEKERETLQAWKERVGQELDRVVAFWMEHSHDQEHGGFFTCLGR EGRVYDDLKYVWLQGRQVWMYCRLYRTFERFRHAQLLDAAKAGGEFLLRYARVAPPGK KCAFVLTRDGRPVKVQRTIFSECFYTMAMNELWRATGEVRYQTEAVEMMDQIVHWVQE DASGLGRPQLQGAPAAEPMAVPMMLLNLVEQLGEADEELAGKYAELGDWCARRILQHV QRDGQAVLENVSEGGKELPGCLGRQQNPGHTLEAGWFLLRHCIRKGDPELRAHVIDKF LLLPFHSGWDPDHGGLFYFQDADNFCPTQLEWAMKLWWPHSEAMIAFLMGYSDSGDPV LLRLFYQVAEYTFRQFRDPEYGEWFGYLSREGKVALSIKGGPFKGCFHVPRCLAMCEE MLGALLSRPAPAPSPAPTPACRGAE" polyA_signal 1280..1285 polyA_site 1300 BASE COUNT 241 a 372 c 444 g 243 t ORIGIN 1 cggggcaagg gtctcccagc gcgacaggac atggagaaag agcgagagac tctgcaggcc 61 tggaaggagc gcgtggggca ggagctggac cgcgtggtgg ctttctggat ggagcactcc 121 cacgaccagg agcacggggg cttcttcacg tgccttggcc gcgaggggcg ggtgtatgat 181 gacctcaagt atgtgtggct gcaggggagg caggtatgga tgtattgtcg cctgtaccgc 241 actttcgagc gcttccgcca tgctcagctt ctggacgcag caaaagcagg tggtgagttc 301 ttgctgcggt atgcccgggt ggcacctcct ggcaagaagt gtgcctttgt gctgactcgg 361 gacggccgcc cggtcaaggt gcagcgaacc atcttcagtg agtgtttcta caccatggcc 421 atgaacgagc tgtggagagc cacaggggaa gtgcggtacc agacggaagc ggtggagatg 481 atggatcaga tcgtccactg ggtgcaggag gacgcgtcgg gactgggccg gccccagctc 541 cagggggccc cggctgcgga gcccatggcg gtgcccatga tgctactgaa cctggtggag 601 cagctcgggg aggcagatga ggagctggcg ggcaaatacg cagagctggg ggactggtgc 661 gcccggagga ttctgcagca cgtgcagagg gatggacaag ctgtgctgga gaatgtgtca 721 gagggtggca aggaacttcc tggctgcctg gggagacagc agaacccagg ccacacgctg 781 gaagccggct ggtttctgct ccgtcattgc attcggaaag gcgaccccga acttcgagcc 841 cacgtgattg acaagttcct attgttgccc ttccactccg gatgggaccc tgaccacgga 901 ggcctctttt acttccagga tgctgataac ttctgcccca cccagctgga gtgggccatg 961 aagctctggt ggccacacag tgaagccatg attgccttcc tcatgggtta cagtgacagt 1021 ggggaccctg tgctgctgcg cctcttctac caagtggctg agtacacctt ccgccagttt 1081 cgcgatcccg agtacgggga atggtttggc tacctgagcc gagagggcaa ggtggccctc 1141 tccatcaagg gaggtccttt caaaggctgc ttccacgtgc cgcggtgcct agccatgtgc 1201 gaggagatgc tgggcgccct gctgagccgc cccgcccccg ccccctcccc cgcccccacc 1261 cccgcctgcc gaggcgcgga ataaaggctg agtccgctcc // LOCUS HUMRECQ 2925 bp mRNA PRI 08-JAN-1995 DEFINITION Homo sapiens (clone 1311) DNA helicase (RECQL) mRNA, complete cds. ACCESSION L36140 NID g619862 KEYWORDS DNA helicase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2925) AUTHORS Puranam,K.L. and Blackshear,P.J. TITLE Cloning and characterization of RECQL, a potential human homologue of the E. coli DNA helicase RecQ JOURNAL J. Biol. Chem. 269, 29838-29845 (1994) MEDLINE 95050841 FEATURES Location/Qualifiers source 1..2925 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /clone="1311" gene 528..2507 /gene="RECQL" CDS 528..2507 /gene="RECQL" /note="homologue of E. Coli DNA helicase reqQ" /codon_start=1 /product="DNA helicase" /db_xref="PID:g619863" /translation="MASVSALTEELDSITSELHAVEIQIQELTERQQELIQKKKVLTK KIKQCLEDSDAGASNEYDSSPAAWNKEDFPWSGKVKDILQNVFKLEKFRPLQLETINV TMAGKEVFLVMPTGGGKSLCYQLPALCSDGFTLVICPLISLMEDQLMVLKQLGISATM LNASSSKEHVKWVHDEMVNKNSELKLIYVTPEKIAKSKMFMSRLEKAYEARRFTRIAV DEVHCCSQWGHDFRPDYKALGILKRQFPNASLIGLTATATNHVLTDAQKILCIEKCFT FTASFNRPNLYYEVRQKPSNTEDFIEDIVKLINGRYKGQSGIIYCFSQKDSEQVTVSL QNLGIHAGAYHANLEPEDKTTVHRKWSANEIQVVVATVAFGMGIDKPDVRFVIHHSMS KSMENYYQESGRAGRDDMKADCILYYGFGDIFRISSMVVMENVGQQKLYEMVSYCQNI SKSRRVLMAQHFDEVWNSEACNKMCDNCCKDSAFERTNITEYCRDLIKILKQAEELNE KLTPLKLIDSWMGKGAAKLRVAGVVAPTLPREDLEKIIAHFLIQQYLKEDYSFTAYAA ISYLKIGPKANLLNNEAHAITMQVTKSTQNSFRAESSQTCHSEQGDKKNGGKKIQATS RRRLQTCFSNLVLRIQELRKEKSMMPDMNVTKFSN" BASE COUNT 934 a 486 c 641 g 864 t ORIGIN 1 cttttttttt tttttttttt tttttataag attattagta taaaatttta gataggtagg 61 agtagcgaaa agatctgctc gaggcctggg tgctttggtg tcggagatcc gagagtcgga 121 gatcggagag tcggacacag gacagtcgga caccggacag tcaaacaccg gagagttaga 181 ctgggcttct cggtggggac aggctctggg ataactactg ttacagcttt gaagggtcaa 241 gggtgtgcgc tttttctttc atccttccct ttcctgctgc aggcgaggcc ggtctgatgc 301 ggatcacttc ctttcgccca cacattggcg gaggagaaac cggaaagtta atcactgccc 361 tgctctgaga actcgggcct ttaggggcac gttcgcctgc tgaccggtct tctgatctcc 421 ccattctttt ccatgcagga ggattggcca ccaaagcctg tttattagca gctgccattt 481 gttaaagaaa tttggattat tttagaaaca atttggaaag aaaaagaatg gcgtccgttt 541 cagctctaac tgaggaactg gattctataa ccagtgagct acatgcagta gaaattcaaa 601 ttcaagaact tacggaaagg caacaagagc ttattcagaa aaaaaaagtc ctgacaaaga 661 aaataaagca gtgtttagag gattctgatg ccggggcaag caatgaatat gattcttcac 721 ctgccgcttg gaataaagaa gattttccat ggtctggtaa agttaaagat attctgcaaa 781 atgtctttaa actggaaaag ttcagaccac ttcagcttga aactattaac gtaacaatgg 841 ctggaaagga ggtatttctt gttatgccta caggaggtgg aaagagctta tgttaccagt 901 taccagcatt atgttcagat ggttttacac tcgtcatttg cccattgatc tctcttatgg 961 aagaccaatt aatggtttta aaacaattag gaatttcagc aaccatgtta aatgcttcta 1021 gttctaagga gcatgttaaa tgggttcatg atgaaatggt aaataaaaac tccgagttaa 1081 agctgattta tgtgactcca gagaaaattg caaaaagcaa aatgtttatg tcaagactag 1141 agaaagccta tgaagcaagg agatttactc gaattgctgt ggatgaagtt cactgctgta 1201 gtcagtgggg acatgatttc agacctgatt ataaggcact tggtatctta aagcggcagt 1261 tccctaacgc atcactaatt gggctgactg caactgcaac aaatcacgtt ttgacggatg 1321 ctcagaaaat tttgtgcatt gaaaagtgtt ttacttttac agcttctttt aataggccaa 1381 atctatatta tgaggttcgg cagaagccct caaacactga agattttatt gaggatattg 1441 taaagctcat taatgggaga tacaaagggc aatcaggaat catatattgt ttttctcaga 1501 aagactctga acaagttacg gttagtttgc agaatctggg aattcatgca ggtgcttacc 1561 atgccaattt ggagccagaa gataagacca cagttcatag aaaatggtca gccaatgaaa 1621 ttcaggtagt agtggcaact gttgcatttg gtatgggaat tgataagcca gatgtgaggt 1681 ttgttatcca tcattcaatg agtaaatcca tggaaaatta ttaccaagag agtggacgtg 1741 caggtcgaga tgacatgaaa gcagactgta ttttgtacta cggctttgga gatatattca 1801 gaataagttc aatggtggtg atggaaaatg tgggacagca gaagctttat gagatggtat 1861 catactgtca aaacataagc aaatctcgtc gtgtgttgat ggctcaacat tttgatgaag 1921 tatggaactc agaagcatgt aacaaaatgt gcgataactg ctgtaaagac agtgcatttg 1981 aaagaacgaa cataacagag tactgcagag atctaatcaa gatcctgaag caggcagagg 2041 aactgaatga aaaactcact ccattgaaac tgattgattc ttggatggga aagggtgcag 2101 caaaactgag agtagcaggt gttgtggctc ccacacttcc tcgtgaagat ctggagaaga 2161 ttattgcaca ctttctaata cagcagtatc ttaaagaaga ctacagtttt acagcttatg 2221 ctgccatttc gtatttgaaa ataggaccta aagctaatct tctgaacaat gaggcacatg 2281 ctattactat gcaagtgaca aagtccacgc agaactcttt cagggctgaa tcgtctcaaa 2341 cttgtcattc tgaacaaggt gataaaaaga atggaggaaa aaaaattcag gcaacttcca 2401 gaagaaggct gcaaacatgc ttcagcaatc tggttctaag aatacaggag ctaagaaaag 2461 aaaaatcgat gatgcctgat atgaatgtta ctaaattttc taattaaaga tggtttatgc 2521 atgtatatgc cattattttt gtagttagac aatagttttt aaaagaattt catagatatt 2581 ttatatgtat ggatctatat tttcagagct tatctctgaa gatctaaact tttgagaatg 2641 tttgaaaatt agagatcatg aattatataa ttttccagtg taaaacaagg gaaaaatttt 2701 tatgtaaaac cctttaaatg taaaatattt gagaataagt tcatacaatc gtcttaagtt 2761 ttttatgcct ttatatactt agctatattt tttcttttga cataactatc tttttgaaag 2821 caatattata ctgacagagg cttcactgag tgatacttta agttaaatat gtagatcaag 2881 ggatgtccaa tcttttggct tccctgagcc agcgaattgt gcaca // LOCUS HUMREPA 1512 bp mRNA PRI 15-JUN-1990 DEFINITION Human replication protein A 32-kDa subunit mRNA, complete cds. ACCESSION J05249 NID g337349 KEYWORDS replication protein A; single stranded DNA binding protein. SOURCE Human HeLa cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1512) AUTHORS Erdile,L.F., Wold,M.S. and Kelly,T.J. TITLE The primary structure of the 32-kDa subunit of human replication protein A JOURNAL J. Biol. Chem. 265, 3177-3182 (1990) MEDLINE 90153966 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.F.Erdile, 11-JAN-1990. FEATURES Location/Qualifiers source 1..1512 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 78..890 /note="replication protein A" /codon_start=1 /db_xref="PID:g337350" /translation="MWNSGFESYGSSSYGGAGGYTQSPGGFGSPAPSQAEKKSRARAQ HIVPCTISQLLSATLVDEVFRIGNVEISQVTIVGIIRHAEKAPTNIVYKIDDMTAAPM DVRQWVDTDDTSSENTVVPPETYVKVAGHLRSFQNKKSLVAFKIMPLEDMNEFTTHIL EVINAHMVLSKANSQPSAGRAPISNPGMSEAGNFGGNSFMPANGLTVAQNQVLNLIKA CPRPEGLNFQDLKNQLKHMSVSSIKQAVDFLSNEGHIYSTVDDDHFKSTDAE" BASE COUNT 405 a 337 c 367 g 403 t ORIGIN 1 cggccgcgtt ctgtggtttt ccgctattcc cccagacccg caccttctcg gcctctttgc 61 ggagaatcgt gaccaagatg tggaacagtg gattcgaaag ctatggcagc tcctcatacg 121 ggggagccgg cggctacacg cagtccccgg ggggctttgg atcgcccgca ccttctcaag 181 ccgaaaagaa atcaagagcc cgagcccagc acattgtgcc ctgtactata tctcagctgc 241 tttctgccac tttggttgat gaagtgttca gaattgggaa tgttgagatt tcacaggtca 301 ctattgtggg gatcatcaga catgcagaga aggctccaac caacattgtt tacaaaatag 361 atgacatgac agctgcaccc atggacgttc gccagtgggt tgacacagat gacaccagca 421 gtgaaaacac tgtggttcct ccagaaacat atgtgaaagt ggcaggccac ctgagatctt 481 ttcagaacaa aaagagcctg gtagccttta agatcatgcc cctggaggat atgaatgagt 541 tcaccacaca tattctggaa gtgatcaatg cacacatggt actaagcaaa gccaacagcc 601 agccctcagc agggagagca cctatcagca atccaggaat gagtgaagca gggaactttg 661 gtgggaatag cttcatgcca gcaaatggcc tcactgtggc ccaaaaccag gtgttgaatt 721 tgattaaggc ttgtccaaga cctgaagggt tgaactttca ggatctcaag aaccagctga 781 aacacatgtc tgtatcctca atcaagcaag ctgtggattt tctgagcaat gaggggcaca 841 tctattctac tgtggatgat gaccatttta aatccacaga tgcagaataa ctggatctaa 901 ctgggtacct gagatatttt acagctggac ctagtttcac aatctgttgt ctccagctct 961 gcatatgtct ggccaggggg cttctaggaa gtaggtttca tctatcaaat gtctcctctg 1021 acttcctttt gaaacttact gctcttctgt tttattttgt tttgtttgaa gctcagaggg 1081 agatgggcaa ttgacaggga tgcaatccag ggtgggattt cttgaggaag ttacaaataa 1141 gcttgttaca acatcaagat agatggaatt ggaaggatgc taccaggaga gtacttacat 1201 agtgctcagg agtttctctt cttaaaatgt ttactgctga aagatgagca ggaccagggc 1261 gttataggca gagccctagc cagaaacctg ctggcctctg cctgttttca tttcccactt 1321 tggttgtgtg gcattacttt cagaattgca ctttcctgct tgtcatgact ttttgacaca 1381 cttgccatga cgtgtgtttc tgtgaacatg aagttctgcg gtagtgcctc caggggcaga 1441 ggaaaagaag aagtgttact gcattttgta caaaataaat acagtcatat gtttaataaa 1501 acagttctac cg // LOCUS HUMRETAA 907 bp mRNA PRI 09-JAN-1995 DEFINITION Human retinoic acid receptor-beta associated open reading frame, complete sequence. ACCESSION M62303 NID g337351 KEYWORDS open reading frame. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 907) AUTHORS Hoffmann,B., Lehmann,J.M., Zhang,X.K., Hermann,T., Husmann,M., Graupner,G. and Pfahl,M. TITLE A retinoic acid receptor-specific element controls the retinoic acid receptor-beta promoter JOURNAL Mol. Endocrinol. 4 (11), 1727-1736 (1990) MEDLINE 91125354 FEATURES Location/Qualifiers source 1..907 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p24" gene 108..428 /gene="RARB" CDS 108..428 /gene="RARB" /note="open reading frame 1; putative" /codon_start=1 /db_xref="GDB:G00-120-338" /db_xref="PID:g337352" /translation="MNLESWIAEQTSAVNSARGLQEEKEKKTVEWKVSQPKPFPRGSH SLFYSLGLACAFSGVQKYISYNQFNRQKGAQRNLKCGLGGEAVGGRRAGAGGTPFSKL SRRK" BASE COUNT 229 a 183 c 271 g 224 t ORIGIN 1 agatctgaaa tctcattttc tgtgtggctg tgtgtttggg acaggggtaa ccaattcctg 61 actactctat atgctgcata gaacctggag aggatttttc aaagtaaatg aatctcgaaa 121 gctggattgc agagcaaacg agtgcagtca attcagccag gggcttgcaa gaggagaaag 181 agaaaaagac tgtggaatgg aaagtttccc aacccaagcc tttcccaagg ggtagccatt 241 ctctgttcta cagtttaggg cttgcatgtg ctttttctgg agtccaaaaa tacataagtt 301 ataaccaatt taacagacag aaaggcgcac agaggaattt aaagtgtggg ctggggggcg 361 aggcggtggg cgggaggcga gcgggcgcag gcggaacacc gttttccaag ctaagccgcc 421 gcaaataaaa aggcgtaaag ggagagaagt tggtgctcaa cgtgagccag gagcagcgtc 481 ccggctcctc ccctgctcat tttaaaagca cttcttcttg tattgttttt aaggtgagaa 541 ataggaaaga aaacgccggc ttgtgcgctc gctgcctgcc tctctggctg tctgcttttg 601 cagggctgct gggagttttt aagctctgtg agaatcctgg gagttggtga tgtcagacta 661 gttgggtcat ttgaaggtta gcagcccggg tagggttcac cgaaagttca ctcgcatata 721 ttaggcaatt caatctttca ttctgtgtga cagaagtagt aggaagtgag ctgttcagag 781 gcaggagggt ctattctttg ccaaaggggg gaccagaatt cccccatgcg agctgtttga 841 ggactgggat gccgagaacg cgagcgatcc gagcagggtt tgtctgggca ccgtcggggt 901 aggatcc // LOCUS HUMRETCG 3723 bp mRNA PRI 15-AUG-1995 DEFINITION Homo sapiens guanylyl cyclase (RetGC-2) mRNA, complete cds. ACCESSION L37378 NID g945224 KEYWORDS guanylyl cyclase; photoreceptor-specific membrane protein; transmembrane protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3723) AUTHORS Lowe,D.G., Dizhoor,A.M., Liu,K., Gu,Q., Spencer,M., Laura,R., Lu,L. and Hurley,J.B. TITLE Cloning and expression of a second photoreceptor-specific membrane retina guanylyl cyclase (RetGC), RetGC-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (12), 5535-5539 (1995) MEDLINE 95296345 FEATURES Location/Qualifiers source 1..3723 /organism="Homo sapiens" /note="cloning vector: lambda gt10" /db_xref="taxon:9606" mRNA 1..3723 /gene="RetGC-2" gene 1..3723 /gene="RetGC-2" sig_peptide 300..449 /gene="RetGC-2" CDS 300..3626 /gene="RetGC-2" /note="activator-dependent; calcium sensitive; 450..1700 extracellular domain 1701..1769 transmembrane domain 1770..3623 cytoplasmic domain" /codon_start=1 /function="recovery of photoreceptor dark state" /product="guanylyl cyclase" /db_xref="PID:g945225" /translation="MFLGLGRFSRLVLWFAAFRKLLGHHGLASAKFLWCLCLLSVMSL PQQVWTLPYKIGVVGPWACDSLFSKALPEVAARLAIERINRDPSFDLSYSFEYVILNE DCQTSRALSSFISHHQMASGFIGPTNPGYCEAASLLGNSWDKGIFSWACVNYELDNKI SYPTFSRTLPSPIRVLVTVMKYFQWAHAGVISSDEDIWVHTANRVASALRSHGLPVGV VLTTGQDSQSMRKALQRIHQADRIRIIIMCMHSALIGGETQMHLLECAHDLKMTDGTY VFVPYDALLYSLPYKHTPYQVLRNNPKLREAYDAVLTITVESQEKTFYQAFTEAAARG EIPEKLEFDQVSPLFGTIYNSIYFIAQAMNNAMKENGQAGAASLVQHSRNMQFHGFNQ LMRTDSNGNGISEYVILDTNLKEWELHSTYTVDMEMELLRFGGTPIHFPGGRPPRADA KCWFAEGKICHGGIDPAFAMMVCLTLLIALLSINGFAYFIRRRINKIQLIKGPNRILL TLEDVTFINPHFGSKRGSRASVSFQITSEVQSGRSPRLSFSSGSLTPATYENSNIAIY EGDWVWLKKFSLGDFGDLKSIKSRASDVFEMMKDLRHENINPLLGFFYDSGMFAIVTE FCSRGSLEDILTNQDVKLDWMFKSSLLLDLIKGMKYLHHREFVHGRLKSRNCVVDGRF VLKVTDYGFNDILEMLRLSEEESSMEELLWTAPELLRAPRGSRLGSFAGDVYSFAIIM QEVMVRGTPFCMMDLPAQEIINRLKKPPPVYRPVVPPEHAPPECLQLMKQCWAEAAEQ RPTFDEIFNQFKTFNKGKKTNIIDSMLRMLEQYSSNLEDLIRERTEELEIEKQKTEKL LTQMLPPSVAESLKKGCTVEPEGFDLVTLYFSDIVGFTTISAMSEPIEVVDLLNDLYT LFDAIIGSHDVYKVETIGDAYMVASGLPKRNGSRHAAEIANMSLDILSSVGTFKMRHM PEVPVRIRIGLHSGPVVAGVVGLTMPRYCLFGDTVNTASRMESTGLPYRIHVSLSTVT ILQNLSEGYEVELRGRTELKGKGTEETFWLIGKKGFMKPLPVPPPVDKDGQVGHGLQP VEIAAFQRRKAERQLVRNKP" mat_peptide 450..3623 /gene="RetGC-2" /product="guanylyl cyclase" BASE COUNT 999 a 834 c 920 g 970 t ORIGIN 1 cgatcactct tcagcatccg cagagtgatt ttcaaaacat ttactaacgc tttcaattat 61 ttaccccatt aatggaattt gtcaacagac tggagtcctt gacaggattg tgaatgcata 121 gaggatgcaa agaaatggac acactgaggt agctatgcca gtggtctgca tgcgcaggca 181 gtaaactgaa gacatttgag cggtgaatta cttggcattt ggaaacacca ccgtctgtgt 241 cgggaaagca gaataactct ccagtatctc gtcattagct tgctggaagc aggagggcta 301 tgttcctggg actcgggcgc ttttctcgcc ttgttctctg gtttgcggct ttcaggaaac 361 tgctgggaca ccatggcctt gcatctgcca agttcctgtg gtgcttgtgc cttctgtctg 421 tcatgtccct tccgcagcag gtgtggacac tcccctacaa gataggggtg gtgggccctt 481 gggcttgtga ttcgctgttt tcaaaggccc tgcctgaggt tgctgcgcga ttagccattg 541 agcgaatcaa ccgggaccca tcttttgacc tgagttattc ttttgaatac gtgattctca 601 atgaagactg ccagacttcg agggctctct ccagtttcat ttcccaccac cagatggcct 661 caggatttat tggacctacc aaccctggct actgcgaggc agcctcgctc ctgggaaaca 721 gctgggacaa aggaattttc tcttgggctt gtgtgaatta tgaattagac aataaaatta 781 gctacccgac cttttctcgg acactccctt ctcccatccg ggtgcttgta actgtcatga 841 aatatttcca gtgggctcat gctggagtca tttcctcaga tgaagacatt tgggtgcata 901 cagccaatcg agtcgcaagt gctcttcgga gccacggctt acctgtaggg gtcgtcctga 961 ccacaggaca agacagccaa agcatgcgga aagccctcca gaggattcac caggcagaca 1021 gaattcgcat aatcatcatg tgtatgcatt cagctttgat tgggggagag actcagatgc 1081 atctcttgga atgtgctcat gatctgaaaa tgactgatgg aacctacgtc tttgttcctt 1141 atgatgccct gctctacagt ttaccttata agcacacccc ctaccaggtc ctaaggaaca 1201 acccaaagct ccgggaagcc tatgatgcag tgttgaccat tacagtggag tcccaagaaa 1261 agaccttcta tcaagccttc acagaggcag cagcaagagg tgaaattcct gagaagctgg 1321 agttcgatca agtttcaccg ttgtttggaa ccatctacaa ttcaatttac tttatcgcac 1381 aagccatgaa taatgctatg aaagaaaatg gacaggctgg tgctgccagc ctggttcagc 1441 attccagaaa catgcagttc catggattca accagttgat gaggacagat tcaaatggaa 1501 atggaatttc agaatatgta atcctggaca ccaacttgaa agaatgggaa ctccatagca 1561 cctacactgt ggacatggaa atggagctgc tacgtttcgg agggacccct attcacttcc 1621 ctggtggcag gccccctaga gcagatgcaa aatgctggtt tgcagaaggg aagatctgcc 1681 atggaggcat cgaccctgcc tttgccatga tggtctgcct tactttgctt atagccctgc 1741 tgtctattaa tggatttgct tactttataa ggcgtcgtat aaataaaatc cagttgatca 1801 aaggacccaa tagaattcta ctgactttgg aggatgtaac gtttatcaat ccccactttg 1861 gcagtaagag aggaagtcgt gccagtgtaa gcttccagat tacctcagag gtccaaagtg 1921 ggaggtcccc aagactctcc ttttcttcag ggagtctaac tccagctacc tatgaaaact 1981 ccaacatagc gatttatgag ggtgattggg tgtggctgaa aaagttctcc cttggagatt 2041 ttggagacct taagtccatc aaatcaagag caagtgatgt gttcgaaatg atgaaggact 2101 tgcgtcatga gaatattaac cctttattgg gtttcttcta tgattcgggg atgtttgcca 2161 ttgtgacaga attctgttcc cgagggagcc tagaagacat actgacaaat caagatgtga 2221 aacttgactg gatgtttaaa tcatcactct tgctggatct cataaagggc atgaagtact 2281 tacaccacag agagtttgtt catgggaggc taaagtctcg aaactgtgtg gtagatgggc 2341 gttttgtact aaaagtgaca gattatggct ttaacgacat cttagaaatg ctgagactct 2401 ctgaagagga atcttctatg gaagagctgc tgtggacggc ccctgaactg ttgagagctc 2461 caagaggcag caggttaggt tcttttgcag gagatgtcta tagctttgcc atcatcatgc 2521 aagaagtgat ggtccggggt accccattct gcatgatgga tctgccagct caagaaatca 2581 taaacagact taagaagcct cctcctgtgt acagaccagt agttcctcct gagcatgccc 2641 ctccagaatg tctccagctg atgaagcagt gctgggctga ggctgcagaa caacgaccaa 2701 cttttgatga aatatttaac cagtttaaaa cttttaataa agggaagaag accaatatta 2761 ttgattctat gcttcggatg ttggagcaat attctagcaa cttggaagat ttgattcggg 2821 agcggactga agagctggaa attgaaaaac agaaaacgga aaagcttcta acacagatgc 2881 taccaccatc agttgctgaa tctctcaaaa agggctgcac agttgaacct gagggctttg 2941 acttggtcac cttgtacttc agcgacattg tgggcttcac aaccatttca gccatgagtg 3001 agcccattga ggtcgtggat cttctgaatg acctgtacac actctttgat gcaataattg 3061 gcagtcatga tgtctacaag gtagagacca ttggagatgc ctacatggtg gcttcaggcc 3121 tcccaaagag gaatggcagt aggcatgcag ctgagattgc aaacatgtcc ttagatatcc 3181 tgagctctgt gggcactttc aagatgcggc acatgccaga agtgccggtc cgaattcgaa 3241 ttggccttca ctcagggccg gttgttgctg gagtggtggg cctcaccatg cccagatact 3301 gcttgtttgg agacactgtg aacacagctt ctcggatgga atctacaggc ttaccttatc 3361 gcattcatgt cagtctcagc actgttacaa ttcttcaaaa tctgagtgag ggctatgaag 3421 tggagcttcg aggaagaaca gagctcaagg gcaaaggcac agaggaaacc ttctggctga 3481 ttgggaaaaa aggcttcatg aagccccttc ctgtgccccc accagtggac aaagatgggc 3541 aagtgggcca tggcctgcaa ccagtggaga ttgcagcctt ccaaagaaga aaagcagaaa 3601 ggcagttggt gagaaacaag ccataagggg caaagggctc caggatttga atcttctctt 3661 ggtgaggcaa tgggaaagct cacttcagtt caagaaattg ttctgcctgg gcaagaaatt 3721 acg // LOCUS HUMRFPA 1782 bp mRNA PRI 15-DEC-1989 DEFINITION Human rfp transforming protein mRNA, complete cds. ACCESSION J03407 NID g337371 KEYWORDS rfp transforming protein. SOURCE Human THP-1 monocytic leukemia cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1782) AUTHORS Takahashi,M., Inaguma,Y., Hiai,H. and Hirose,F. TITLE Developmentally regulated expression of a human 'finger'- containing gene encoded by the 5' half of the ret transforming gene JOURNAL Mol. Cell. Biol. 8, 1853-1856 (1988) MEDLINE 88246464 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by M.Takahashi, 18-FEB-1988. FEATURES Location/Qualifiers source 1..1782 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 235..1776 /note="rfp transforming protein" /codon_start=1 /db_xref="PID:g337372" /translation="MASGSVAECLQQETTCPVCLQYFAEPMMLDCGHNICCACLARCW GTAETNVSCPQCRETFPQRHMRPNRHLANVTQLVKQLRTERPSGPGGEMGVCEKHREP LKLYCEEDQMPICVVCDRSREHRGHSVLPLEEAVEGFKEQIQNQLDHLKRVKDLKKRR RAQGEQARAELLSLTQMEREKIVWEFEQLYHSLKEHEYRLLARLEELDLAIYNSINGA ITQFSCNISHLSSLIAQLEEKQQQPTRELLQDIGDTLSRAERIRIPEPWITPPDLQEK IHIFAQKCLFLTESLKQFTEKMQSDMEKIQELREAQLYSVDVTLDPDTAYPSLILSDN LRQVRYSYLQQDLPDNPERFNLFPCVLGSPCFIAGRHYWEVEVGDKAKWTIGVCEDSV CRKGGVTSAPQNGFWAVSLWYGKEYWALTSPMTALPLRTPLQRVGIFLDYDAGEVSFY NVTERCHTFTFSHATFCGPVRPYFSLSYSGGKSAAPLIICPMSGIDGFSGHVGNHGHS METSP" BASE COUNT 389 a 510 c 535 g 348 t ORIGIN 191 bp upstream of SacI site. 1 cggtgagccg gccgtattcc cgctctcgct tagggggcac aggcgcaggc atcggcccgg 61 ccactccaag ccttcggtgc gcgggcgcgt ctgggatacg ggcccgggag gcgccgccct 121 ccgtccgccc ggtgcctctc aggaacagcg aaccggagag agcgccggag agttgggctc 181 agtgcggagc tcggcgccgg ggcccatgcc cgtgcgcccc cgcaggccgg cgccatggcc 241 tccgggagtg tggccgagtg cctgcagcag gagaccacct gccccgtgtg cctgcagtac 301 ttcgcagagc ccatgatgct cgactgcggc cataacatct gttgcgcgtg cctcgcccgc 361 tgctggggca cggcagagac taacgtgtcg tgcccgcagt gccgggagac cttcccgcag 421 aggcacatgc ggcccaaccg gcacctggcc aacgtgaccc aactggtaaa gcagctgcgc 481 accgagcggc cgtcggggcc cggcggcgag atgggcgtgt gcgagaagca ccgcgagccc 541 ctgaagctgt actgcgagga ggaccagatg cccatctgcg tggtgtgcga ccgctcccgc 601 gagcaccgcg gccacagcgt gctgccgctc gaggaggcgg tggagggctt caaggagcaa 661 atccagaacc agctcgacca tttaaaaaga gtgaaagatt taaagaagag acgtcgggcc 721 cagggggaac aggcacgagc tgaactcttg agcctaaccc agatggagag ggagaagatt 781 gtttgggagt ttgagcagct gtatcactcc ttaaaggagc atgagtatcg cctcctggcc 841 cgccttgagg agctagactt ggccatctac aatagcatca atggtgccat cacccagttc 901 tcttgcaaca tctcccacct cagcagcctg atcgctcagc tagaagagaa gcagcagcag 961 cccaccaggg agctcctgca ggacattggg gacacattga gcagggctga aagaatcagg 1021 attcctgaac cttggatcac acctccagat ttgcaagaga aaatccacat ttttgcccaa 1081 aaatgtctat tcttgacgga gagtctaaag cagttcacag aaaaaatgca gtcagatatg 1141 gagaaaatcc aagaattaag agaggctcag ttatactcag tggacgtgac tctggaccca 1201 gacacggcct accccagcct gatcctctct gataatctgc ggcaagtgcg gtacagttac 1261 ctccaacagg acctgcctga caaccccgag aggttcaatc tgtttccctg tgtcttgggc 1321 tctccatgct tcatcgccgg gagacattat tgggaggtag aggtgggaga taaagccaag 1381 tggaccatag gtgtctgtga agactcagtg tgcagaaaag gtggagtaac ctcagccccc 1441 cagaatggat tctgggcagt gtctttgtgg tatgggaaag aatattgggc tcttacctcc 1501 ccaatgactg ccctacccct gcggaccccg ctccagcggg tggggatttt cttggactat 1561 gatgctggtg aggtctcctt ctacaacgtg acagagaggt gtcacacctt cactttctct 1621 catgctacct tttgtgggcc tgtccggccc tacttcagtc tgagttactc gggagggaaa 1681 agtgcagctc ctctgatcat ctgccccatg agtgggatag atgggttttc tggccatgtt 1741 gggaatcatg gtcattccat ggagacctcc ccttgaggag gt // LOCUS HUMRIP1R 3858 bp mRNA PRI 16-MAY-1996 DEFINITION Human RLIP76 protein mRNA, complete cds. ACCESSION L42542 NID g974142 KEYWORDS 76 kDa protein; Ral interacting protein 1. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3858) AUTHORS Jullien-Flores,V., Dorseuil,O., Romero,F., Letourneur,F., Saragosti,S., Berger,R., Tavitian,A., Gacon,G. and Camonis,J.H. TITLE Bridging Ral GTPase to Rho pathways. RLIP76, a Ral effector with CDC42/Rac GTPase-activating protein activity JOURNAL J. Biol. Chem. 270 (38), 22473-22477 (1995) MEDLINE 95403450 REFERENCE 2 (bases 1 to 3858) AUTHORS Camonis,J.H. TITLE Direct Submission JOURNAL Submitted (01-SEP-1995) Jacques H. Camonis, Inserm U-248, Section de Recherche, Institut Curie, 75231 Paris cedex 05, France FEATURES Location/Qualifiers source 1..3858 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="Jurkat cell" 5'UTR 1..223 /note="putative" mRNA 1..3858 /note="putative" CDS 224..2191 /codon_start=1 /product="RLIP76 protein" /db_xref="PID:g974143" /translation="MTECFLPPTSSPSEHRRVEHGSGLTRTPSSEEISPTKFPGLYRT GEPSPPHDILHEPPDVVSDDEKDHGKKKGKFKKKEKRTEGYAAFQEDSSGDEAESPSK MKRSKGIHVFKKPSFSKKKEKDFKIKEKPKEEKHKEEKHKEEKHKEKKSKDLTAADVV KQWKEKKKKKKPIQEPEVPQIDVPNLKPIFGIPLADAVERTMMYDGIRLPAVFRECID YVEKYGMKCEGIYRVSGIKSKVDELKAAYDREESTNLEDYEPNTVASLLKQYLRDLPE NLLTKELMPRFEEACGRTTETEKVQEFQRLLKELPECNYLLISWLIVHMDHVIAKELE TKMNIQNISIVLSPTVQISNRVLYVFFTHVQELFGNVVLKQVMKPLRWSNMATMPTLP ETQAGIKEEIRRQEFLLNCLHRDLQGGIKDLSKEERLWEVQRILTALKRKLREAKRQE CETKIAQEIASLSKEDVSKEEMNENEEVINILLAQENEILTEQEELLAMEQFLRRQIA SEKEEIERLRAEIAEIQSRQQHGRSETEEYSSESESESEDEEELQIILEDLQRQNEEL EIKNNHLNQAIHEEREAIIELRVQLRLLQMQRAKAEQQAQEDEEPEWRGGAVQPPRDG VLEPKAAKEQPKAGKEPAKPSPSRDRKETSI" 3'UTR 2192..3858 /note="putative" BASE COUNT 1092 a 879 c 986 g 901 t ORIGIN 1 agtctggttt aactggttgg aacgactaaa gcacgctggc gcaaggaaag ctctcaactt 61 cgggagctga ggcgcaggct ggccagagcg tggagaggaa agccctttcc atcctcaagg 121 ccgttgcagg agatgcccgc gagccacctt cgccagcacc acaccggggt gtaatggata 181 ggtaacagag aagacctcgt cccttcctag tcagggcatc agcatgactg agtgcttcct 241 gccccccacc agcagcccca gtgaacaccg cagggtggag catggcagcg ggcttacccg 301 gacccccagc tctgaagaga tcagccctac taagtttcct ggattgtacc gcactggcga 361 gccctcacct ccccatgaca tcctccatga gcctcctgat gtagtgtctg atgatgagaa 421 agatcatggg aagaaaaaag ggaaatttaa gaaaaaggaa aagaggactg aaggctatgc 481 agcctttcag gaagatagct ctggagatga ggcagaaagt ccttctaaaa tgaagaggtc 541 caagggaatc catgttttca agaagcccag cttttctaaa aagaaggaaa aggattttaa 601 aataaaagag aaacccaaag aagaaaagca taaagaagaa aagcacaaag aagaaaaaca 661 taaagagaag aagtcaaaag acttgacagc agctgatgtt gttaaacagt ggaaggaaaa 721 gaagaaaaag aaaaagccaa ttcaggagcc agaggtgcct cagattgatg ttccaaatct 781 caaacccatt tttggaattc ctttggctga tgcagtagag aggaccatga tgtatgatgg 841 cattcggctg ccagccgttt tccgtgaatg tatagattac gtagagaagt atggcatgaa 901 gtgtgaaggc atctacagag tatcaggaat taaatcaaag gtggatgagc taaaagcagc 961 ctatgaccgg gaggagtcta caaacttgga agactatgag cctaacactg tagccagttt 1021 gctgaagcag tatttgcgag accttccaga gaatttgctt accaaagagc ttatgcccag 1081 atttgaagag gcttgtggga ggaccacgga gactgagaaa gtgcaggaat tccagcgttt 1141 actcaaagaa ctgccagaat gtaactatct tctgatttct tggctcattg tgcacatgga 1201 ccatgtcatt gcaaaggaac tggaaacaaa aatgaatata cagaacattt ctatagtgct 1261 cagcccaact gtgcagatca gcaatcgagt cctgtatgtg tttttcacac atgtgcaaga 1321 actctttgga aatgtggtac taaagcaagt gatgaaacct ctgcgatggt ctaacatggc 1381 cacgatgccc acgctgccag agacccaggc gggcatcaag gaggagatca ggagacagga 1441 gtttcttttg aattgtttac atcgagatct gcagggtggg ataaaggatt tgtctaaaga 1501 agaaagatta tgggaagtac aaagaatttt gacagccctc aaaagaaaac tgagagaagc 1561 taaaagacag gagtgtgaaa ccaagattgc acaagagata gccagtcttt caaaagagga 1621 tgtttccaaa gaagagatga atgaaaatga agaagttata aatattctcc ttgctcagga 1681 gaatgagatc ctgactgaac aggaggagct cctggccatg gagcagtttc tgcgccggca 1741 gattgcctca gaaaaagaag agattgaacg cctcagagct gagattgctg aaattcagag 1801 tcgccagcag cacggccgaa gtgagactga ggagtactcc tccgagagcg agagcgagag 1861 tgaggatgag gaggagctgc agatcattct ggaagactta cagagacaga acgaagagct 1921 ggaaataaag aacaatcatt tgaatcaagc aattcatgag gagcgcgagg ccatcatcga 1981 gctgcgcgtg cagctgcggc tgctccagat gcagcgagcc aaggccgagc agcaggcgca 2041 ggaggacgag gagcctgagt ggcgcggggg tgccgtccag ccgcccagag acggcgtcct 2101 tgagccaaaa gcagctaaag agcagccaaa ggcaggcaag gagccggcaa agccatcgcc 2161 cagcagggat aggaaggaga cgtccatctg agcagcctgc gtggccgtct ggagtccgtg 2221 agactgaaag gacccgtgca tcttactgta acccgggggc caggccggct ctctcgctgt 2281 acattctgta aaggtgtctt ctcttctcag actcttcctc tgtcacacgt ctgactcctt 2341 cacgtcaggc tcaggttcca tgggaggacg aagcagtgga cgcattgtgg gctttaggga 2401 cagatgagtt ttccagatag tgtcagctta tttgaagatt aattttcttt gttaacttaa 2461 aataactatt ttaacccttg agtggcttct ttttaaacca aaaaccgtct ttctttgctt 2521 ttttatcaca gcagaatcag gatctctttc tcattcaagg ggggaaccac accaggtcag 2581 cgctgcgcct gctgtggccg ccgcgagcca cgccctctgg gatctctggt accgtcactc 2641 ttgcttgtgc cttccacacc ttctcggtgc agatccctat gggggagctg cctcacgttc 2701 tctgactggt cagagcagcg cctggtgggt gttccctggc ccactctcct ctctccttct 2761 gcagttctaa accacagtct ataagcccga gtcaccagga cggcctgtct ggccacagac 2821 aggggctgcc tgtggagcct gcccaccggc ccccggcagt gcagtccagc ggggaggagg 2881 ctgcccgttc ctgccagttc ctcactgcgg ggaccagcaa aggccttctc actgggttgg 2941 tcaaaggtag tcaccttggc ctggtgcatc cacagaggat gttgttcaaa ccagaaatct 3001 tttaaacgac tgaccttcct taaaaacaga atgactccga ttgcttgctt gggctagaat 3061 gtacacgtct ccttgcctga ataagccata tatatgctct taaacaaaag tttgaaatta 3121 tccatatcat ctcagtgaac ctactggtgg actcccaatt gacaagattg agcaatagaa 3181 aaaaattcct ttcctttgaa tgatagctgt gattcacccc accccatttt cttgtttctg 3241 gtccatccga tgagacggat gctctgatgc tctgaggctt ctgggaggct gggccctgga 3301 ggcaacgtgc tgcaggcgca ctctgtcaga gtgaacagca ccgcgagaca ggccaggctc 3361 gtggctcgga agacaaaccc cacacacact caaggggtcg aaaacaaacc ccacacgagg 3421 gctctcacct ccttctccta ggtagtattt attttcagca cctgtttgat gcagttttta 3481 atcctctacc tattgcactg ttgtgactcg ttggccatta tttgattttg gtacgaaaaa 3541 aagctttgtt atagaaatca gcatactatt tttttaaatc tggagagaag atattctggt 3601 gactgaaagt atggtcgggt gtcagatata aatgtgcaaa tgccttcttg ctgtcctgtc 3661 ggtctcagta cgttcacttt atagctgctg gcaatatcga aggttccttt tttgtttgtg 3721 taaactctaa tttctatcaa ggtgtcatgg atttttaaaa ttagtatttc attacaaatg 3781 tctcagcatt ggttaactaa ttttgggcag gaccattatt gatcaagcaa ataaattcaa 3841 cagccatttg ggaaaaag // LOCUS HUMRNA 1558 bp mRNA PRI 23-MAY-1995 DEFINITION Human mRNA for RNA helicase, complete cds. ACCESSION D26528 NID g473713 KEYWORDS RNA helicase. SOURCE Homo sapiens (library: lambda gt11) HeLa cell cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1558) AUTHORS Kitajima,Y., Yatsuki,H., Zhang,R., Matsuhashi,S. and Hori,K. TITLE A novel human homologue of a DEAD-box RNA helicase family JOURNAL Biochemical and Biophysical Research Communication 199, 748-754 (1994) REFERENCE 2 (bases 1 to 1558) AUTHORS Hori,K. TITLE Direct Submission JOURNAL Submitted (18-JAN-1994) to the DDBJ/EMBL/GenBank databases. Katsuji Hori, Saga Medical School, Department of Biochemistry; 5-1-1 Nabeshima, Saga, Saga 849, Japan (Tel:0952-31-6511(ex.2260), Fax:0952-33-2517) COMMENT Submitted (18-Jan-1994) to DDBJ by: Katsuji Hori Department of Biochemistry Saga Medical School 5-1-1 Nabeshima, Saga Saga 849 Japan Phone: 0952-31-6511 x2260 Fax: 0952-33-2517. FEATURES Location/Qualifiers source 1..1558 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa cell" /clone_lib="lambda gt11" 5'UTR <1..174 CDS 175..1494 /codon_start=1 /label=NP-52 /product="RNA helicase" /db_xref="PID:d1006078" /db_xref="PID:g473714" /translation="MPIKPALQAQFEQQFKAQTPIQAAVWEKLAQGDNIFGLAPTGTG KTLAFLLPILSRIDPKVKQTQVLILAPSQELAMQTTAVAREWGALVDVSVTSLIGGAN GRRQADKLKKDKPHIVVGTLGRVLTMLDGGALKLNGLQTVIFDEADAMLSDERRDSLQ ALAAQLPTDVQLGLFSATSGVDLKYVADTFGQEVRPVSVGTDAPAAITHEFQYVDQKA KASLLIQLARNKQQALVFFNTISGLVNMQATLRHAHASVMSIGSNDKRQVQRADALRL FKKGEVSLLLVTDVAARGLDIEDLPLVVNAQLPQRKKTYIHRAGRTGRMGKPGRVLNL GNDHDIRDLKRELGDDFVLVKAANTFADAKQATPQKSTAAVKAARATAPTDQAGHQAS GQAAKPVTQRPTTQAKTVPVVEKPKKKKRLKASKDKGKPKWAKKSAE" 3'UTR 1495..>1558 BASE COUNT 454 a 343 c 389 g 372 t ORIGIN 1 gaattccgga acgttgaagc acaagacttt gcggccatta tgaatgaccc cgttggtcac 61 catttgcagt atacgcagtg gttccaactt gctaaacaag tcaaccaaac gttgtttgac 121 ttacgttcgt cggctggaat tattttccca gctgaccaga aagaaatgta atttatgcca 181 attaaaccag cgctacaagc gcaatttgaa caacaattta aagcccaaac acccattcaa 241 gctgctgttt gggagaagtt agcgcagggc gacaatattt tcggcctagc gccaactgga 301 acagggaaaa cgttagcttt tttgctgcca attcttagcc gaattgatcc caaagttaag 361 cagacacaag tcttgatttt ggcgccaagt caagaactcg ccatgcaaac gactgctgtt 421 gcacgggaat ggggcgcttt ggtcgatgtc tcggtgacga gtttgattgg tggtgccaat 481 ggccgccgtc aagcagataa actcaaaaaa gataagccac atattgtggt ggggacatta 541 ggtcgtgtat tgacaatgct tgatggtggc gccctaaaac ttaatggctt gcaaaccgtt 601 attttcgatg aagcagacgc tatgttgtca gatgaacggc gtgacagttt gcaagcatta 661 gcagcgcaat taccaacaga cgttcaactt ggcctgtttt cagcaacttc tggggttgat 721 ttgaaatatg tcgccgacac ttttggccaa gaagtccgac ccgtatcagt tggcacggat 781 gcgccagcag cgattacgca tgaatttcaa tatgttgacc aaaaggccaa ggctagtttg 841 ttgattcaac tagcccgtaa taagcaacaa gctttggtgt tctttaatac gattagtggc 901 ttggttaata tgcaggcgac gctacgtcac gcgcatgcta gtgtgatgag tattggcagc 961 aatgacaagc gacaagtgca gcgtgcagat gcgttacgtt tattcaaaaa aggcgaagta 1021 tcgctgcttt tggtgacaga tgtcgccgca cgtggcttgg atattgaaga tctaccatta 1081 gtcgtgaacg cacagttacc acaacggaaa aaaacatata ttcatcgcgc tggccgtaca 1141 ggacgaatgg gtaaaccagg ccgcgtgtta aacttgggta atgaccatga tattcgtgat 1201 ttgaaacgcg aattagggga tgattttgtt ttggtgaaag ccgctaatac gtttgccgat 1261 gcgaagcagg cgacaccgca aaaaagtacc gctgcggtca aggctgcacg cgcaactgcg 1321 ccaactgatc aggcaggaca ccaagcgtcg ggccaagcgg ccaaaccagt gacgcaacgt 1381 ccgacaactc aggcgaagac agtaccggtt gttgaaaagc ctaagaagaa aaaacgctta 1441 aaggcatcca aagataaggg aaagccaaaa tgggctaaaa agtcggccga gtaagccaca 1501 tatcaaaagg gtcacagcta atcatcaagc tgtgaccctt ttttaatatc cggaattc // LOCUS HUMRNAPII 1212 bp mRNA PRI 23-FEB-1995 DEFINITION Human RNA polymerase II 23kD subunit (POLR2) mRNA, complete cds. ACCESSION J04965 NID g678548 KEYWORDS RNA polymerase II. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1212) AUTHORS Pati,U.K. and Weissman,S.M. TITLE Isolation and molecular characterization of a cDNA encoding the 23-kDa subunit of human RNA polymerase II [published erratum appears in J Biol Chem 1991 Jul 15;266(20):13468] JOURNAL J. Biol. Chem. 264 (22), 13114-13121 (1989) MEDLINE 89327280 REFERENCE 2 (bases 1 to 1212) AUTHORS Weissman,S.M. TITLE Direct Submission JOURNAL Submitted (02-AUG-1989) Sherman M. Weissman, Genetics, Yale University, 295 Congress Avenue, New Haven, CT 06510, USA FEATURES Location/Qualifiers source 1..1212 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /map="17p13.1" gene 25..657 /gene="POLR2" CDS 25..657 /gene="POLR2" /codon_start=1 /db_xref="GDB:G00-120-306" /product="RNA polymerase II 23kD subunit" /db_xref="PID:g678549" /translation="MDDEEETYRLWKIRKTIMQLCHDRGYLVTQDELDQTLEEFKAQF GDKPSEGRPRRTDLTVLVAHNDDPTDQMFVFFPEEPKVGIKTIKVYCQRMQEENITRA LIVVQQGMTPSAKQSLVDMAPKYILEQFLQQELLINITEHELVPEHVVMTKEEVTELL ARYKLRENQLPRIQAGDPVARYFGIKRGQVVKIIRPSETAGRYITYRLVQ" BASE COUNT 240 a 384 c 368 g 220 t ORIGIN 1 ggggcggcgg cggcggaggc tgccatggac gacgaggagg agacgtaccg gctctggaaa 61 atccgcaaga ccatcatgca gctgtgccac gaccgtggct atctggtgac ccaggacgag 121 cttgaccaga ccctggagga gttcaaagcc caatttgggg acaagccgag tgaggggcgg 181 ccgcggcgca cggacctcac cgtgctggtg gcccacaacg atgaccccac cgaccagatg 241 tttgtgttct ttccagagga gcccaaggtg ggcatcaaga ccatcaaggt gtactgccag 301 cgcatgcagg aggagaacat cacacgggct ctcatcgtgg tgcagcaggg catgacaccc 361 tccgccaagc agtccctggt cgacatggcc cccaagtaca tcctggagca gtttctgcag 421 caggagctgc tcatcaacat cacggagcac gagctagtcc ctgagcacgt cgtcatgacc 481 aaggaggagg tgacagagct gctggcccga tataagctcc gagagaacca gctgcccagg 541 atccaggcgg gggaccctgt ggcgcgctac tttgggataa agcgtgggca ggtggtgaag 601 atcatccggc ccagtgagac ggctggcagg tacatcacct accggctggt gcagtagcta 661 ccgcctgaca gcccctagag gcgacacaca gcgaccccca tccctgcagg acaaacgccc 721 ctgccctgcc agaatccggc ccccacagct ctcacggctg ctgctcctct ggactcccca 781 aggcaggtgg cctccaccac gttctcccgt cctggggtga ggcttcctgt ggcccagccg 841 ccccattcac ctgtggattt gtgcgagatg cagcctcaga aggaacaagg cccccagagg 901 gaggtcacct gggggcagct ggtgccgggt cttcacccag accacgctgg gtcccctctg 961 ttgggggttt ggggtccggg tctcccacca gccactgctt cctcctgggc cctcaattcc 1021 acccctcgtc ttccctccct cggggccctg atgcgtgccc cgccgcctcg gctctttact 1081 ccattcacag ccgtgcacgc gctcaagcac cagggtgcga gatgccagct ctggagttct 1141 cggttgttgt aggaggttgg gtgttttcaa atggtaaaga tgttttgacg aaataaattt 1201 gcttgataca gg // LOCUS HUMRNASA 1620 bp mRNA PRI 27-FEB-1996 DEFINITION Human mRNA for ribonuclease A (RNase A), complete cds. ACCESSION D26129 NID g532677 KEYWORDS RNase 1; RNase A; ribonuclease. SOURCE Homo sapiens pancreas (library: lambda gt11) cDNA to mRNA, clone pBO23. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1620) AUTHORS Seno,M., Futami,J., Kosaka,M., Seno,S. and Yamada,H. TITLE Nucleotide sequence encoding human pancreatic ribonuclease JOURNAL Biochim. Biophys. Acta 1218 (3), 466-468 (1994) MEDLINE 94325363 REFERENCE 2 (bases 1 to 1620) AUTHORS Seno,M. TITLE Direct Submission JOURNAL Submitted (11-DEC-1993) to the DDBJ/EMBL/GenBank databases. Masaharu Seno, Okayama University, Faculty of Engineering, Department of Bioengineering Science; 3-1-1 Tsushima-Naka, Okayama, Okayama 700, Japan (Tel:086-251-8216, Fax:086-253-5755) COMMENT Submitted (11-Dec-1993) to DDBJ by: Masaharu Seno Department of Bioengineering Science Faculty of Engineering, Okayama University 3-1-1 Tsushima-Naka Okayama 700 Japan Phone: 086-251-8216 Fax: 086-253-5755. FEATURES Location/Qualifiers source 1..1620 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /tissue_type="pancreas" sig_peptide 1003..1086 CDS 1003..1473 /codon_start=1 /product="ribonuclease A precursor" /db_xref="PID:d1005666" /db_xref="PID:g641937" /translation="MALEKSLVRLLLLVLILLVLGWVQPSLGKESRAKKFQRQHMDSD SSPSSSSTYCNQMMRRRNMTQGRCKPVNTFVHEPLVDVQNVCFQEKVTCKNGQGNCYK SNSSMHITDCRLTNGSRYPNCAYRTSPKERHIIVACEGSPYVPVHFDASVEDST" mat_peptide 1087..1470 /EC_number="3.1.27.5" /product="ribonuclease A" BASE COUNT 411 a 422 c 390 g 397 t ORIGIN 1 gaattccggg tttgaaaagg agttctaggg aagaagagag ttagttagca catcaatggg 61 agcagggctc ttaccccacg tggtgttaca tatatattat tttcatacat ggtttctggc 121 tcataagttc cttagccctt gctatagtct tttgtgttcg gtcttaaggg caggactgta 181 ctcttccctc acctttctaa ttgtgcatct taagaccttc cccagagagg gtggtgccct 241 gtagttgtgg gaaggaatgc tggcatcatg aagcttccat aaaaacccga gaaacgagct 301 tctggatagc tggacacatg gaggtcctgg agggtggagc ccagggaggc atggaagctc 361 cacagccctt cccccatacc ttaccctatt tcctctgtat cctttgtaat atcctttatg 421 ataaaccagc aaatgtgtgt aaatgtttcc ctaaggtctg tggccactcc agcaaattaa 481 ttgaacctaa agagggggtc gtgggaaccc caacttgaag ccagtcagtc agaagttctg 541 gatgtccaga cttcagactg gtgtctgaaa gggtggaggc agtcttgggg accgagcccc 601 caatctatgg gatctgacac tatctccagt agtgttggaa ttgagtcacc agcgtgtcca 661 ctggttagtg tgtgagaaac tccctaccat tggtcacaga agtcttcttc tgtgttgata 721 gttgtagtgt gacagcagag gaaaaacaaa gtcagaaaga gttttcccga acacacccaa 781 tttctccatt ttactatcca tttccacaaa cactgactac aatagaagta taaaaattac 841 tccactgcat cattcagctt tccatctctc tcagacacca agctgcagat ccaggtcact 901 ttgtaggtca ccacctagag gggaggaaga cctcgctttg gagagtggga ataaaacgct 961 cgtggaaaag ggtacacgct tttctgggaa agtgaggcca ccatggctct ggagaagtct 1021 cttgtccggc tccttctgct tgtcctgata ctgctggtgc tgggctgggt ccagccttcc 1081 ctgggcaagg aatcccgggc caagaaattc cagcggcagc atatggactc agacagttcc 1141 cccagcagca gctccaccta ctgtaaccaa atgatgaggc gccggaatat gacacagggg 1201 cggtgcaaac cagtgaacac ctttgtgcac gagcccctgg tagatgtcca gaatgtctgt 1261 ttccaggaaa aggtcacctg caagaacggg cagggcaact gctacaagag caactccagc 1321 atgcacatca cagactgccg cctgacaaac ggctccaggt accccaactg tgcataccgg 1381 accagcccga aggagagaca catcattgtg gcctgtgaag ggagcccata tgtgccagtc 1441 cactttgatg cttctgtgga ggactctacc taaggtcaga gcagcgagat accccacctc 1501 cctcaacctc atcctctcca cagctgcctc ttccctcttc cttccctgct gtgaaagaag 1561 taactacagt tagggctcct attcaacaca cacatgcttc cctttcctga gccggaattc // LOCUS HUMRNASE4 996 bp mRNA PRI 09-OCT-1996 DEFINITION Human mRNA for RNase 4, complete cds. ACCESSION D37931 NID g976228 KEYWORDS RNase 4; ribonuclease type 4; pancreatic ribonuclease family. SOURCE Homo sapiens adult pancreas (with no medical abnormalities) cDNA to mRNA, clone_lib:lambda gt11 clone:pBO52. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 996) AUTHORS Seno,M., Futami,J., Tsushima,Y., Akutagawa,K., Kosaka,M., Tada,H. and Yamada,H. TITLE Molecular cloning and expression of human ribonuclease 4 cDNA JOURNAL Biochim. Biophys. Acta 1261 (3), 424-426 (1995) MEDLINE 95260866 REFERENCE 2 (bases 1 to 996) AUTHORS Seno,M. TITLE Direct Submission JOURNAL Submitted (10-AUG-1994) to the DDBJ/EMBL/GenBank databases. Masaharu Seno, Okayama University, Faculty of Engineering, Department of Bioengineering Science; 3-1-1 Tsushima-Naka, Okayama, Okayama 700, Japan (Tel:086-251-8216, Fax:086-253-5755) COMMENT Sequence updated (16-Nov-1994) by: Masaharu Seno. FEATURES Location/Qualifiers source 1..996 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pBO52" /clone_lib="lambda gt11" /dev_stage="adult" /tissue_type="pancreas (with no medical abnormalities)" sig_peptide 28..111 CDS 28..471 /codon_start=1 /product="RNase 4" /db_xref="PID:d1007727" /db_xref="PID:g976229" /translation="MALQRTHSLLLLLLLTLLGLGLVQPSYGQDGMYQRFLRQHVHPE ETGGSDRYCNLMMQRRKMTLYHCKRFNTFIHEDIWNIRSICSTTNIQCKNGKMNCHEG VVKVTDCRDTGSSRAPNCRYRAIASTRRVVIACEGNPQVPVHFDG" BASE COUNT 264 a 223 c 218 g 291 t ORIGIN 1 gaattccggc ccccctctaa gatactgatg gctctgcaga ggacccattc attgcttctg 61 cttttgctgc tgaccctgct ggggctgggg ctggtccagc cctcctatgg ccaggatggc 121 atgtaccagc gattcctgcg gcaacacgtg caccctgagg agacaggtgg cagtgatcgc 181 tactgcaact tgatgatgca aagacggaag atgactttgt atcactgcaa gcgcttcaac 241 accttcatcc atgaagatat ctggaacatt cgtagtatct gcagcaccac caatatccaa 301 tgcaagaacg gcaagatgaa ctgccatgag ggtgtagtga aggtcacaga ttgcagggac 361 acaggaagtt ccagggcacc caactgcaga tatcgggcca tagcgagcac tagacgtgtt 421 gtcattgcct gtgagggtaa cccacaggtg cctgtgcact ttgacggtta gatgccacca 481 tgtagggatt atcgcgagtg gttgacctta cacttactcc ttaaatagca gtgagtaatg 541 catttgagct gtcccaggct ctgtctcctc agctcatttc ctactctttt tctctatata 601 actcattcta ttaaatacat tgcaccaaag agatatggag acataaacct gtaatgaatg 661 aggctgggct tttctgtaat aagcttcctt ttataatact ggtcagctta gtctctcaga 721 tcctatcctg tggaatttag ttattatgtg tatttatgta gtatttcaaa catttcaaaa 781 tgctttcatc tatgtttatc acattttaat accacagact tataatgatg tcactacata 841 tagaagctca aagttaaggg atttgctgaa gactgtaaag ttaatggaag aattgagaca 901 aaaatccagt gtagctggcc acttatccag ggctttttct acttcatcac aaggaatgtt 961 ttgaaagtgt ctgctttttt tatccttccg gaattc // LOCUS HUMRNATSPY 1075 bp mRNA PRI 08-MAY-1993 DEFINITION Homo sapiens testicular protein (TSPY) mRNA, complete cds. ACCESSION M98525 NID g292428 KEYWORDS testicular protein. SOURCE Homo sapiens male adult testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1075) AUTHORS Zhang,J.S., Yang-Feng,T.L., Muller,U., Mohandas,T.K., de Jong,P.J. and Lau,Y.-F.C. TITLE Molecular isolation and characterization of an expressed gene from the human Y chromosome JOURNAL Hum. Mol. Genet. 1, 717-726 (1992) MEDLINE 93258314 FEATURES Location/Qualifiers source 1..1075 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /germline /sex="male" /tissue_type="testis" /map="Y" mRNA 1..1075 CDS 74..835 /codon_start=1 /product="testicular protein" /db_xref="PID:g292429" /translation="MEAVQEGAAGVESEQAALGEEAVLLLDDIMAEVEVVAEEEGLVE RREEAQPRQQAVPGPGPMTPESALEELLAVQVELEPVNAQARKAFSRQREKMERRRKP HLDRRGAVIQSVPGFWANVIANHPQMSALITDEDEDMLSYMVSLEVEEEKHPVHLCKI MLFFRSNPYFQNKVITKEYLVNITEYRASHSTPIEWYPDYEVEAYRRRHHNSSLNFFN WFSDHNFAGSNKIAESPDRSYVRTCGAIPCNTTRG" BASE COUNT 278 a 249 c 336 g 212 t ORIGIN 1 gggtttctgt ggcgtgggtc gggcagcaca ggccttggtg tgtgcgagtg ccaaggaggg 61 caccgccttc aggatggagg ctgtgcagga gggggcggcc ggggtggaga gtgagcaggc 121 ggctttgggg gaggaggcgg tgctgctgtt ggatgacata atggcggagg tggaggtggt 181 ggcggaggag gagggcctcg tggagcggcg ggaggaggcc cagccccgac agcaggctgt 241 gcctggccct gggcccatga ccccagagtc tgcactggag gagctgctgg ccgttcaggt 301 ggagctggag ccggttaatg cccaagccag gaaggccttt tctcggcagc gggaaaagat 361 ggagcggagg cgcaagcccc acctagaccg cagaggcgcc gtcatccaga gcgtccctgg 421 cttctgggcc aatgttattg caaaccaccc ccagatgtca gccctgatca ctgacgaaga 481 tgaagacatg ctgagctaca tggtcagcct ggaggtggaa gaagagaagc atcctgttca 541 tctctgcaag atcatgttgt tctttcggag taacccctac ttccagaata aagtgattac 601 caaggaatat ctggtgaaca tcacagaata cagggcttct cattccactc caattgagtg 661 gtatccggat tatgaagtgg aggcctatcg ccgcagacac cacaacagca gccttaactt 721 cttcaactgg ttctctgacc acaacttcgc aggatctaac aagattgctg agtcccctga 781 cagatcctat gtaaggacct gtggcgcaat cccctgcaat actacaagag gatgaagcca 841 cctgaagagg gaacagagac gtcaggggac tcccagttgt tgagttgaat atgatggagc 901 atcagatttt acctaataca gcagaactcc taaaaagtta cagccatatg caggacggca 961 gtactcagca tggtcttatg cacaggaact aaaggaaaaa gagatcgagt cacaaaaatt 1021 caggaagagg gggtaaatgt ggattgtatg gaatgaaaaa taaacattct caagg // LOCUS HUMRNPAB 1537 bp mRNA PRI 02-DEC-1991 DEFINITION Human hnRNP type A/B protein mRNA, complete cds. ACCESSION M65028 NID g337450 KEYWORDS RNA-binding protein; hnRNP type A/B protein. SOURCE Human liver library and human breast carcinoma MCF-7 library, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1537) AUTHORS Khan,F., Jaiswal,A.K. and Szer,W. TITLE Cloning and sequence analysis of a human type A/B hnRNP protein JOURNAL FEBS Lett. 290, 159-161 (1991) MEDLINE 92008653 FEATURES Location/Qualifiers source 1..1537 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_lib="human liver; human breast carcinoma MCF-7" CDS 143..997 /note="'High affinity for G-rich and U-rich regions of hnRNA'; putative" /codon_start=1 /function="'Single-stranded RNA binding protein, type A/B'" /product="hnRNP type A/B protein" /db_xref="PID:g337451" /translation="MSEAGEEQPMETTGATENGHEAVPEASRGRGWTGAAAGLEARPP RPRAGIRTAPRDQINASKNEEDAGKMFVGGLSWDTSKKDLKDYFTKFGEVVDCTIKMD PNTGRSRGFGFILFKDAASVEKVLDQKEHRLDGRVIDPKKAMAMKKDPVKKIFVGGLN PESPTEEKIREYFGEFGEIEAIELPMDPKLNKRRGFVFITFKEEEPVKKVLEKKFHTV SGSKCEIKVAQPKEVYQQQQYGSGGRGNRNRGNRGSGGGGGGGGQGSTNYGKSQRRGG HQNNYKPY" misc_feature 350..367 /note="putative" /function="'RNP-2 Consensus sequence'" misc_feature 467..490 /note="putative" /function="'RNP-1 Consensus sequence'" misc_feature 602..619 /note="putative" /function="'RNP-2 Consensus sequence'" misc_feature 722..745 /note="putative" /function="'RNP-1 Consensus sequence'" misc_feature 932..958 /note="putative" /function="'ATP/GTP binding site mofif A'" polyA_signal 1517..1522 /note="hnRNP A/B protein; putative" BASE COUNT 394 a 342 c 455 g 346 t ORIGIN 1 acacagttgg agcagctcgt gggctgactg ggcgaggcct cagcagcgcg agcttgagtg 61 cggccgcgtg cggcgccttc tgcgggtggg acgagcgggc gcgcggtacg tcactcgagg 121 agctcgcgcg cctcggccta gcatgtcgga agcgggcgag gagcagccca tggagacgac 181 gggcgccacc gagaacggac atgaggccgt ccccgaagcg agtcgcggcc ggggctggac 241 gggcgccgcg gcggggctgg aggcgcgacc gccgcgcccc cgagcgggaa tcagaacggc 301 gccgagggac cagatcaacg ccagcaagaa cgaggaggac gcgggaaaaa tgttcgttgg 361 tggcctgagc tgggatacta gcaaaaaaga tttaaaagac tattttacta aatttggaga 421 ggtcgttgac tgtacaataa aaatggatcc caacactgga cggtcaagag ggtttgggtt 481 tatcctgttc aaagatgcag ccagtgtgga gaaggtccta gaccagaagg agcacaggct 541 ggatggccgt gtcattgacc ctaaaaaggc catggctatg aagaaggacc cggtcaagaa 601 aatcttcgtt gggggtctga atcctgaaag tcccactgag gaaaagatca gggagtactt 661 tggcgagttt ggggagattg aggccattga attgccaatg gatccaaagt tgaacaaaag 721 acgaggtttt gtgtttatca cctttaaaga agaagaaccc gtgaagaagg ttctggagaa 781 aaagttccat actgtcagtg gaagcaagtg tgagatcaag gtggcccagc ccaaagaagt 841 ctatcagcag cagcagtatg gctctggggg ccgtggaaac cgcaaccgag ggaaccgagg 901 cagcggaggt ggtggtggag gtggaggtca gggtagtaca aactacggca agagccagcg 961 acgtggtggc catcagaata actacaagcc atactgaggc ggccaaggga gcgaccaact 1021 gatcgcacac atgctttgtt tggatatgga gtgaacacaa ttatgtacca aatttaactt 1081 ggcaaacttt ctattgcctg tcccatgtgc atcttattta aaatttcccc catggaaatc 1141 actctcctgt tgactatttc cagagctcta ggtgtttagg cagcgtgtgg tgtctgagag 1201 gccatagcgc catcatgggc tgatttttat taccaggtcc cccagaagca ggtgagaggc 1261 tctgcttctg ctgccgctct gcagcctgga cctgtggacc ctggttgtaa agagtaaatt 1321 gtatcttagg aaaccagtgt cacctttttt tcacctttta attttatatt atttgcgtca 1381 tacatttcct gtaacggaag tgttaatttt actgtacttt ttggtacccc ttttgggaat 1441 ctaatgtatt gtaaggtatt ttacacgtgt cctgattttg ccacaacctg gatattgaag 1501 ctatccaagc ttttgaaata aaatttaaaa acccccg // LOCUS HUMRODSA 1279 bp mRNA PRI 09-JAN-1995 DEFINITION Human uroporphyrinogen III synthase mRNA, complete cds. ACCESSION J03824 NID g337462 KEYWORDS uroporphyrinogen III synthase. SOURCE Human (adult) liver, cDNA to mRNA, clone pUROS-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1279) AUTHORS Tsai,S.F., Bishop,D.F. and Desnick,R.J. TITLE Human uroporphyrinogen III synthase: molecular cloning, nucleotide sequence, and expression of a full-length cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (19), 7049-7053 (1988) MEDLINE 89017136 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.F.Bishop, 10-AUG-1988. FEATURES Location/Qualifiers source 1..1279 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q25.2-q26.3" gene 197..994 /gene="UROS" CDS 197..994 /gene="UROS" /note="uroporphyrinogen III synthase (EC 4.2.1.75)" /codon_start=1 /db_xref="GDB:G00-128-112" /db_xref="PID:g337463" /translation="MKVLLLKDAKEDDCGQDPYIRELGLYGLEATLIPVLSFEFLSLP SFSEKLSHPEDYGGLIFTSPRAVEAAELCLEQNNKTEVWERSLKEKWNAKSVYVVGNA TASLVSKIGLDTEGETCGNAEKLAEYICSRESSALPLLFPCGNLKREILPKALKDKGI AMESITVYQTVAHPGIQGNLNSYYSQQGVPASITFFSPSGLTYSLKHIQELSGDNIDQ IKFAAIGPTTARALAAQGLPVSCTAESPTPQALATGIRKALQPHGCC" BASE COUNT 292 a 352 c 354 g 281 t ORIGIN 85 bp upstream of PstI site. 1 tcctggggcc cagcgcgggt ggctgccgcg gcccctcggg ctgcgtgggg agggggcttc 61 cgcccctgtt gtcattgctc ctgcagcctt ttcgctggga ctgcgcgaca ccgccccccg 121 accgggtgcc cgctgtgtgc caggccgggt gctgggcacg gtcccgcgag tgccctataa 181 ggactgccag gcaataatga aggttctttt actgaaggat gcgaaggaag atgactgtgg 241 ccaggatccg tatatcaggg aattaggatt atatggactt gaagccactt tgatccctgt 301 tttatcgttt gagtttttgt ctcttcccag tttctctgag aagctttctc atcctgaaga 361 ttacggggga ctcattttta ccagccccag agcagtggaa gcagcagagt tatgtttgga 421 gcaaaacaat aaaactgaag tctgggaaag gtctctgaaa gaaaaatgga atgccaagtc 481 agtgtatgtg gttggaaatg ctactgcttc tctagtgagt aaaattggcc tggatacaga 541 aggagaaacc tgtggaaatg cagaaaagct tgcagaatat atttgttcca gggagtcctc 601 agcactgcct cttctatttc cctgtggaaa cctcaaaaga gaaatcctgc caaaagcgct 661 caaggacaaa gggattgcca tggaaagcat aactgtgtat cagacagttg cacacccagg 721 aatccaaggg aacctgaaca gctactattc ccagcagggg gttccagcca gcatcacatt 781 ttttagtccc tctggcctca catacagtct caagcacatt caggagttat ctggtgacaa 841 tatcgatcaa attaagtttg cagccatcgg ccccactacg gctcgcgcgc tggccgccca 901 gggccttcct gtaagctgca ctgcagagag ccccacgcca caagccctgg ccactggcat 961 caggaaggct ctccagcccc atggctgctg ctgagtcagc cacctagcgc tggccccatg 1021 cagcctccct gggctgggct ggctctggat ggagccaggc atcggcaagg gctctcggga 1081 gctgctgccg tcagactcct gcctcaagcc tgagtggaag cacctgagga ccggggatcg 1141 ggacctgacc tggggctggc ctcaggccca cgtgcacgtg actgccctct gtggaagcca 1201 gcttaaaccc tagccctgtg agagcttcct gtgcccagca ggaaggaagt caaataaacc 1261 acactgacta cctgtgctt // LOCUS HUMROR2A 4092 bp mRNA PRI 09-JAN-1995 DEFINITION Human transmembrane receptor (ror2) mRNA, complete cds. ACCESSION M97639 NID g337466 KEYWORDS cell surface receptor; transmembrane receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4092) AUTHORS Masiakowski,P. and Carroll,R.D. TITLE A novel family of cell surface receptors with tyrosine kinase-like domain JOURNAL J. Biol. Chem. 267 (36), 26181-26190 (1992) MEDLINE 93100347 FEATURES Location/Qualifiers source 1..4092 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SH-SY5Y" gene 200..3031 /gene="ror2" CDS 200..3031 /gene="ror2" /note="contains tyrosine kinase-like domain" /codon_start=1 /product="transmembrane receptor" /db_xref="PID:g337467" /translation="MARGSALPRRPLLCIPAVWAAAALLLSVSRTSGEVEVLDPNDPL GPLDGQDGPIPTLKGYFLNFLEPVNNITIVQGQTAILHCKVAGNPPPNVRWLKNDAPV VQEPRRIIIRKTEYGSRLRIQDLDTTDTGYYQCVATNGMKTITATGVLFVRLGPTHSP NHNFQDDYHEDGFCQPYRGIACARFIGNRTIYVDSLQMQGEIENRITAAFTMIGTSTH LSDQCSQFAIPSFCHFVFPLCDARSRAPKPRELCRDECEVLESDLCRQEYTIARSNPL ILMRLQLPKCEALPMPESPDAANCMRIGIPAERLGRYHQCYNGSGMDYRGTASTTKSG HQCQPWALQHPHSHHLSSTDFPELGGGHAYCRNPGGQMEGPWCFTQNKNVRMELCDVP SCSPRDSSKMGILYILVPSIAIPLVIACLFFLVCMCRNKQKASASTPQRRQLMASPSQ DMEMPLINQHKQAKLKEISLSAVRFMEELGEDRFGKVYKGHLFGPAPGEQTQAVAIKT LKDKAEGPLREEFRHEAMLRARLQHPNVVCLLGVVTKDQPLSMIFSYCSHGDLHEFLV MRSPHSDVGSTDDDRTVKSALEPPDFVHLVAQIAAGMEYLSSHHVVHKDLATRNVLVY DKLNVKISDLGLFREVYAADYYKLLGNSLLPIRWMAPEAIMYGKFSIDSDIWSYGVVL WEVFSYGLQPYCGYSNQDVVEMIRNRQVLPCPDDCPAWVYALMIECWNEFPSRRPRFK DIHSRLRAWGNLSNYNSSAQTSGASNTTQTSSLSTSPVSNVSNARYVGPKQKAPPFPQ PQFIPMKGQIRPMVPPPQLYVPVNGYQPVPAYGAYLPNFYPVQIPMQMAPQQVPPQMV PKPSSHHSGSGSTSTGYVTTAPSNTSMADRAALLSEGADDTQNAPEDGAQSTVQEAEE EEEGSVPETELLGDCDTLQVDEAQVQLEA" BASE COUNT 884 a 1229 c 1171 g 808 t ORIGIN 1 agccagccct tgccgtggcc ggagccgagc ggcgcatccg ggccggagaa gaggacgacg 61 acgaggtcct cgaagtggac ccgtttgcga agcgccaggg agaaggagga gcggacgcat 121 cgtagaaagg ggtggtggcg cccgaccccg cgccccggcc cgaagctctg agggcttccc 181 ggcccccact gcctgcggca tggcccgggg ctcggcgctc ccgcggcggc cgctgctgtg 241 catcccggcc gtctgggcgg ccgccgcgct tctgctctca gtgtcccgga cttcaggtga 301 agtggaggtt ctggatccga acgacccttt aggacccctt gatgggcagg acggcccgat 361 tccaactctg aaaggttact ttctgaattt tctggagcca gtaaacaata tcaccattgt 421 ccaaggccag acggcaattc tgcactgcaa ggtggcagga aacccacccc ctaacgtgcg 481 gtggctaaag aatgatgccc cggtggtgca ggagccgcgg cggatcatca tccggaagac 541 agaatatggt tcacgactgc gaatccagga cctggacacg acagacactg gctactacca 601 gtgcgtggcc accaacggga tgaagaccat taccgccact ggcgtcctgt ttgtgcggct 661 gggtccaacg cacagcccaa atcataactt tcaggatgat taccacgagg atgggttctg 721 ccagccttac cggggaattg cctgtgcacg cttcattggc aaccggacca tttatgtgga 781 ctcgcttcag atgcaggggg agattgaaaa ccgaatcaca gcggccttca ccatgatcgg 841 cacgtctacg cacctgtcgg accagtgctc acagttcgcc atcccatcct tctgccactt 901 cgtgtttcct ctgtgcgacg cgcgctcccg ggcacccaag ccgcgtgagc tgtgccgcga 961 cgagtgcgag gtgctggaga gcgacctgtg ccgccaggag tacaccatcg cccgctccaa 1021 cccgctcatc ctcatgcggc ttcagctgcc caagtgtgag gcgctgccca tgcctgagag 1081 ccccgacgct gccaactgca tgcgcattgg catcccagcc gagaggctgg gccgctacca 1141 tcagtgctat aacggctcag gcatggatta cagaggaacg gcaagcacca ccaagtcagg 1201 ccaccagtgc cagccgtggg ccctgcagca cccccacagc caccacctgt ccagcacaga 1261 cttccctgag cttggagggg ggcacgccta ctgccggaac cccggaggcc agatggaggg 1321 cccctggtgc tttacgcaga ataaaaacgt acgcatggaa ctgtgtgacg taccctcgtg 1381 tagtccccga gacagcagca agatggggat tctgtacatc ttggtcccca gcatcgcaat 1441 tccactggtc atcgcttgcc ttttcttctt ggtttgcatg tgccggaata agcagaaggc 1501 atctgcgtcc acaccgcagc ggcgacagct gatggcctcg cccagccaag acatggaaat 1561 gcccctcatt aaccagcaca aacaggccaa actcaaagag atcagcctgt ctgcggtgag 1621 gttcatggag gagctgggag aggaccggtt tgggaaagtc tacaaaggtc acctgttcgg 1681 ccctgccccg ggggagcaga cccaggctgt ggccatcaaa acgctgaagg acaaagcgga 1741 ggggcccctg cgggaggagt tccggcatga ggctatgctg cgagcacggc tgcaacaccc 1801 caacgtcgtc tgcctgctgg gcgtggtgac caaggaccag cccctgagca tgatcttcag 1861 ctactgttcg cacggcgacc tccacgaatt cctggtcatg cgctcgccgc actcggacgt 1921 gggcagcacc gatgatgacc gcacggtgaa gtccgccctg gagccccccg acttcgtgca 1981 ccttgtggca cagatcgcgg cggggatgga gtacctatcc agccaccacg tggttcacaa 2041 ggacctggcc acccgcaatg tgctagtgta cgacaagctg aacgtgaaga tctcagactt 2101 gggcctcttc cgagaggtgt atgccgccga ttactacaag ctgctgggga actcgctgct 2161 gcctatccgc tggatggccc cagaggccat catgtacggc aagttctcca tcgactcaga 2221 catctggtcc tacggtgtgg tcctgtggga ggtcttcagc tacggcctgc agccctactg 2281 cgggtactcc aaccaggatg tggtggagat gatccggaac cggcaggtgc tgccttgccc 2341 cgatgactgt cccgcctggg tgtatgccct catgatcgag tgctggaacg agttccccag 2401 ccggcggccc cgcttcaagg acatccacag ccggctccga gcctggggca acctttccaa 2461 ctacaacagc tcggcgcaga cctcgggggc cagcaacacc acgcagacca gctccctgag 2521 caccagccca gtgagcaatg tgagcaacgc ccgctacgtg gggcccaagc agaaggcccc 2581 gcccttccca cagccccagt tcatccccat gaagggccag atcagaccca tggtgccccc 2641 gccgcagctc tacgtccccg tcaacggcta ccagccggtg ccggcctatg gggcctacct 2701 gcccaacttc tacccggtgc agatcccaat gcagatggcc ccgcagcagg tgcctcctca 2761 gatggtcccc aagcccagct cacaccacag tggcagtggc tccaccagca caggctacgt 2821 caccacggcc ccctccaaca catccatggc agacagggca gccctgctct cagagggcgc 2881 tgatgacaca cagaacgccc cagaagatgg ggcccagagc accgtgcagg aagcagagga 2941 ggaggaggaa ggctctgtcc cagagactga gctgctgggg gactgtgaca ctctgcaggt 3001 ggacgaggcc caagtccagc tggaagcttg agtggcacca gggcccgggg ttcggggata 3061 gaagccccgc cgagacccca cagggacctc agtcaccttt gagaagacac catactcagc 3121 aatcacaaga gcccgccggc cagtgggctt gtttgcagac tgggtgaggt ggagccctgc 3181 tcctctctgt cctctgacac agagagctgc cctgcctagg agcacccaag ccaggcaggg 3241 ggtctggcag cacggcgtcc tggggagcag gacacatggt catccccagg gctgtataca 3301 ttgattctgg tggtagactg gtagtgagca gcaaatgcct ttcaagaaaa taggtggcag 3361 cttcactcca tgtcatatat ggagtgaata tttcaaaacg ttgggaataa gggcctgcaa 3421 aaggcagcga ggaggcacct cgggtcttga ggttcctgac aaccgatctg gtctgttggt 3481 ttgaggatga aggggctcca tttctgctgc ctccctgctg agaatattct ccctttagca 3541 gccaaagatt cgctggaacg gaggctgccc tctgctgcct gttggggtcg gaagacaagg 3601 ggcttctgaa atgggagttc ctgagataca acaaaatgtg tgccttcaaa gaaactgaca 3661 gctttgtatt tggtgaaatg gttttaatta tactccatgt gtattttgcc cacttttttt 3721 gggaattcaa gggaaagtgt ttcttgggtt tggaatgttc agaggaagca gtattgtaca 3781 gaacacggta ttgttatttt tgttaagaat catgtacaga gcttaaatgt aatttatatg 3841 tttttaatat gccattttca ttgaagtatt ttggtcttaa gatgacttta gtaatttaac 3901 tgtttatgtt acccacgttg ggatccagtt ggtcttggtt tgcttctctc tgtaccacgt 3961 gcacatgagg tccattcatt ttacagcccc tgttacacac agacccacag gcagccgtct 4021 gtgcccgcac acattgttgg tcctatttgt aaatcccaca cccggtgtat ccaataaagt 4081 gaaaccaacc cc // LOCUS HUMROSA 7375 bp mRNA PRI 09-JAN-1995 DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA, complete cds. ACCESSION M34353 NID g337480 KEYWORDS c-myc proto-oncogene; transmembrane tyrosine-specific protein kinase. SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7375) AUTHORS Birchmeier,C., O'Neill,K., Riggs,M. and Wigler,M. TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (12), 4799-4803 (1990) MEDLINE 90280463 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Wigler, 15-MAY-1990. FEATURES Location/Qualifiers source 1..7375 /organism="Homo sapiens" /db_xref="taxon:9606" /map="6q21-q22" sig_peptide 207..314 /gene="ROS1" /note="transmembrane tyrosine-specific protein kinase signal peptide" CDS 207..7250 /gene="ROS1" /note="transmembrane tyrosine-specific protein kinase precursor" /codon_start=1 /db_xref="GDB:G00-120-351" /db_xref="PID:g337481" /translation="MKNIYCLIPKLVNFATLGCLWISVVQCTVLNSCLKSCVTNLGQQ LDLGTPHNLSEPCIQGCHFWNSVDQKNCALKCRESCEVGCSSAEGAYEEEVLENADLP TAPFASSIGSHNMTLRWKSANFSGVKYIIQWKYAQLLGSWTYTKTVSRPSYVVKPLHP FTEYIFRVVWIFTAQLQLYSPPSPSYRTHPHGVPETAPLIRNIESSSPDTVEVSWDPP QFPGGPILGYNLRLISKNQKLDAGTQRTSFQFYSTLPNTIYRFSIAAVNEVGEGPEAE SSITTSSSAVQQEEQWLFLSRKTSLRKRSLKHLVDEAHCLRLDAIYHNITGISVDVHQ QIVYFSEGTLIWAKKAANMSDVSDLRIFYRGSGLISSISIDWLYQRMYFIMDELVCVC DLENCSNIEEITPPSISAPQKIVADSYNGYVFYLLRDGIYRADLPVPSGRCAEAVRIV ESCTLKDFAIKPQAKRIIYFNDTAQVFMSTFLDGSASHLILPRIPFADVKSFACENND FLVTDGKVIFQQDALSFNEFIVGCDLSHIEEFGFGNLVIFGSSSQLHPLPGRPQELSV LFGSHQALVQWKPPALAIGANVILISDIIELFELGPSAWQNWTYEVKVSTQDPPEVTH IFLNISGTMLNVPELQSAMKYKVSVRASSPKRPGPWSEPSVGTTLVPASEPPFIMAVK EDGLWSKPLNSFGPGEFLSSDIGNVSDMDWYNNSLYYSDTKGDVFVWLLNGTDISENY HLPSIAGAGALAFEWLGHFLYWAGKTYVIQRQSVLTGHTDIVTHVKLLVNDMVVDSVG GYLYWTTLYSVESTRLNGESSLVLQTQPWFSGKKVIALTLDLSDGLLYWLVQDSQCIH LYTAVLRGQSTGDTTITEFAAWSTSEISQNALMYYSGRLFWINGFRIITTQEIGQKTS VSVLEPARFNQFTIIQTSLKPLPGNFSFTPKVIPDSVQESSFRIEGNASSFQILWNGP PAVDWGVVFYSVEFSAHSKFLASEQHSLPVFTVEGLEPYALFNLSVTPYTYWGKGPKT SLSLRAPETVPSAPENPRIFILPSGKCCNKNEVVVEFRWNKPKHENGVLTKFEIFYNI SNQSITNKTCEDWIAVNVTPSVMSFQLEGMSPRCFIAFQVRAFTSKGPGPYADVVKST TSEINPFPHLITLLGNKIVFLDMDQNQVVWTFSAERVISAVCYTADNEMGYYAEGDSL FLLHLHNRSSSELFQDSLVFDITVITIDWISRHLYFALKESQNGMQVFDVDLEHKVKY PREVKIHNRNSTIISFSVYPLLSRLYWTEVSNFGYQMFYYSIISHTLHRILQPTATNQ QNKRNQCSCNVTEFELSGAMAIDTSNLEKPLIYFAKAQEIWAMDLEGCQCWRVITVPA MLAGKTLVSLTVDGDLIYWIITAKDSTQIYQAKKGNGAIVSQVKALRSRHILAYSSVM QPFPDKAFLSLASDTVEPTILNATNTSLTIRLPLAKTNLTWYGITSPTPTYLVYYAEV NDRKNSSDLKYRILEFQDSIALIEDLQPFSTYMIQIAVKNYYSDPLEHLPPGKEIWGK TKNGVPEAVQLINTTVRSDTSLIISWRESHKPNGPKESVRYQLAISHLALIPETPLRQ SEFPNGRLTLLVTRLSGGNIYVLKVLACHSEEMWCTESHPVTVEMFNTPEKPYSLVPE NTSLQFNWKAPLNVNLIRFWVELQKWKYNEFYHVKTSCSQGPAYVCNITNLQPYTSYN VRVVVVYKTGENSTSLPESFKTKAGVPNKPGIPKLLEGSKNSIQWEKAEDNGCRITYY ILEIRKSTSNNLQNQNLRWKMTFNGSCSSVCTWKSKNLKGIFQFRVVAANNLGFGEYS GISENIILVGDDFWIPETSFILTIIVGIFLVVTIPLTFVWHRRLKNQKSAKEGVTVLI NEDKELAELRGLAAGVGLANACYAIHTLPTQEEIENLPAFPREKLTLRLLLGSGAFGE VYEGTAVDILGVGSGEIKVAVKTLKKGSTDQEKIEFLKEAHLMSKFNHPNILKQLGVC LLNEPQYIILELMEGGDLLTYLRKARMATFYGPLLTLVDLVDLCVDISKGCVYLERMH FIHRDLAARNCLVSVKDYTSPRIVKIGDFGLARDIYKNDYYRKRGEGLLPVRWMAPES LMDGIFTTQSDVWSFGILIWEILTLGHQPYPAHSNLDVLNYVQTGGRLEPPRNCPDDL WNLMTQCWAQEPDQRPTFHRIQNQLQLFRNFFLNSIYQCRDEANNSGVINESFEGEDG DVICLNSDDIMPVVLMETKNREGLNYMVLATECGQGEEKSEGPLGSQESESCGLRKEE KEPHADKDFCQEKQVAYCPSGKPEGLNYACLTHSGYGDGSD" gene 207..7250 /gene="ROS1" mat_peptide 208..7247 /gene="ROS1" /note="transmembrane tyrosine-specific protein kinase" BASE COUNT 2203 a 1496 c 1605 g 2071 t ORIGIN 1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa 61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc 121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca 181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg 241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata 301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata 361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact 421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg 481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa 541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc 601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt 661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct 721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg 781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg 841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc 901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact 961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg 1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt 1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg 1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg 1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca 1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta 1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct 1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc 1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt 1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga 1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg 1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac 1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca 1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat 1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct 1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc 1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca 1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg 2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa 2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa 2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag 2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa 2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt 2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga 2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag 2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc 2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca 2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca 2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa 2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca 2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca 2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata 2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga 2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat 3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc 3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc 3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt 3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg 3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc 3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat 3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac 3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa 3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt 3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta 3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc 3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa 3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg 3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc 3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg 3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat 3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga 4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag 4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa 4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga 4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga 4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga 4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg 4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga 4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt 4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg 4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca 4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg 4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg 4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg 4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa 4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca 4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc 4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg 5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg 5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca 5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt 5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac 5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt 5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg 5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag 5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg 5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca 5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca 5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag 5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg 5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc 5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg 5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc 5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag 6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg 6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa 6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga 6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc 6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag 6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca 6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac 6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact 6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa 6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa 6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt 6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa 6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga 6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc 6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg 6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt 6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt 7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc 7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag 7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc 7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt 7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc 7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc // LOCUS HUMRP10A 635 bp mRNA PRI 05-NOV-1993 DEFINITION Human ribosomal protein L10 mRNA, complete cds. ACCESSION L25899 NID g414586 KEYWORDS ribosomal protein L10. SOURCE Homo sapiens male adult neuroblastoma cell line SMS-SMN cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 635) AUTHORS Herzog,H. TITLE cDNA encoding the human homologue of yeast ribosomal protein YL10 JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..635 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="neuroblastoma cell line SMS-SMN" CDS 17..634 /codon_start=1 /product="ribosomal protein L10" /db_xref="PID:g414587" /translation="MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTR PDKARRLGYKAKQGYVIYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVA EERAGRHCGALRVLNSYWVGEDSTYKAFEVILIDPFHKAIRRNPDTQWITKPDHKHRE MRGLTSAGRKSRGLGKGHKFHHTIGGSRRAAWRRRNTPAPPLPLI" BASE COUNT 158 a 173 c 169 g 135 t ORIGIN 1 catcaggtaa gccaagatgg gtgcatacaa gtacatccag gagctatgga gaaagaagca 61 gtctgatgtc atgcgctttc ttctgagggt ccgctgctgg cagtaccgcc agctctctgc 121 tctccacagg gctccccgcc ccacccggcc tgataaagcg cgccgactgg gctacaaggc 181 caagcaaggt tacgttatat ataggattcg tgttcgccgt ggtggccgaa aacgcccagt 241 tcctaagggt gcaacttacg gcaagcctgt ccatcatggt gttaaccagc taaagtttgc 301 tcgaagcctt cagtccgttg cagaggagcg agctggacgc cactgtgggg ctctgagagt 361 cctgaattct tactgggttg gtgaagattc cacatacaaa gcttttgagg ttatcctcat 421 tgatccattc cataaagcta tcagaagaaa tcctgacacc cagtggatca ccaaaccaga 481 ccacaagcac agggagatgc gtgggctgac atctgcaggc cgaaagagcc gtggccttgg 541 aaagggccac aagttccacc acactattgg tggctctcgc cgggcagctt ggagaaggcg 601 caatactcca gctccaccgt taccgctaat ataag // LOCUS HUMRPA70KD 2393 bp mRNA PRI 02-SEP-1992 DEFINITION Human replication protein A 70kDa subunit mRNA complete cds. ACCESSION M63488 NID g337488 KEYWORDS replication protein A. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2393) AUTHORS Erdile,L.F., Heyer,W.-D., Kolodner,R. and Kelly,T.J. TITLE Characterization of a cDNA encoding the 70-kDa single-stranded DNA-binding subunit of human replication protein A and the role of the protein in DNA replication JOURNAL J. Biol. Chem. 266, 12090-12098 (1991) MEDLINE 91268092 FEATURES Location/Qualifiers source 1..2393 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 70..1920 /codon_start=1 /product="replication protein A, 70-kDa subunit" /db_xref="PID:g337489" /translation="MVGQLSEGAIAAIMQKGDTNIKPILQVINIRPITTGNSPPRYRL LMSDGLNTLSSFMLATQLNPLVEEEQLSSNCVCQIHRFIVNTLKDGRRVVILMELEVL KSAEAVGVKIGNPVPYNEGLGQPQVAPPAPAASPAASSRPQPQNGSSGMGSTVSKAYG ASKTFGKAAGPSLSHTSGGTQSKVVPIASLTPYQSKWTICARVTNKSQIRTWSNSRGE GKLFSLELVDESGEIRATAFNEQVDKFFPLIEVNKVYYFSKGTLKIANKQFTAVKNDY EMTFNNETSVMPCEDDHHLPTVQFDFTGIDDLENKSKDSLVDIIGICKSYEDATKITV RSNNREVAKRNIYLMDTSGKVVTATLWGEDADKFDGSRQPVLAIKGARVSDFGGRSLS VLSSSTIIANPDIPEAYKLRGWFDAEGQALDGVSISDLKSGGVGGSNTNWKTLYEVKS ENLGQGDKPDYFSSVATVVYLRKENCMYQACPTQDCNKKVIDQQNGLYRCEKCDTEFP NFKYRMILSVNIADFQENQWVTCFQESAEAILGQNAAYLGELKDKNEQAFEEVFQNAN FRSFIFRVRVKVETYNDESRIKATVMDVKPVDYREYGRRLVMSIRRSALM" BASE COUNT 640 a 557 c 635 g 561 t ORIGIN 1 cggcgcggga cccgggtggg gaagctggag ctgttgcggg gtccgcgggg aagtcttggc 61 ggtggagcca tggtcggcca gctgagcgag ggggccattg cggccatcat gcagaagggg 121 gatacaaaca taaagcccat cctccaagtc atcaacatcc gtcccattac tacggggaat 181 agtccgccgc gttatcgact gctcatgagt gatggattga acactctatc ctctttcatg 241 ttggcgacac agttgaaccc tctcgtggag gaagaacaat tgtccagcaa ctgtgtatgc 301 cagattcaca gatttattgt gaacactctg aaagacggaa ggagagtagt tatcttgatg 361 gaattagaag ttttgaagtc agctgaagca gttggagtga agattggcaa tccagtgccc 421 tataatgaag gactcgggca gccgcaagta gctcctccag cgccagcagc cagcccagca 481 gcaagcagca ggccccagcc gcagaatgga agctcgggaa tgggttctac tgtttctaag 541 gcttatggtg cttcaaagac atttggaaaa gctgcaggtc ccagcctgtc acacacttct 601 gggggaacac agtccaaagt ggtgcccatt gccagcctca ctccttacca gtccaagtgg 661 accatttgtg ctcgtgttac caacaaaagt cagatccgta cctggagcaa ctcccgaggg 721 gaagggaagc ttttctccct agaactggtt gacgaaagtg gtgaaatccg agctacagct 781 ttcaatgagc aagtggacaa gttctttcct cttattgaag tgaacaaggt gtattatttc 841 tcgaaaggca ccctgaagat tgctaacaag cagttcacag ctgttaaaaa tgactacgag 901 atgaccttca ataacgagac ttccgtcatg ccctgtgagg acgaccatca tttacctacg 961 gttcagtttg atttcacggg gattgatgac ctcgagaaca agtcgaaaga ctcacttgta 1021 gacatcatcg ggatctgcaa gagctatgaa gacgccacta aaatcacagt gaggtctaac 1081 aacagagaag ttgccaagag gaatatctac ttgatggaca catccgggaa ggtggtgact 1141 gctacactgt ggggggaaga tgctgataaa tttgatggtt ctagacagcc cgtgttggct 1201 atcaaaggag cccgagtctc tgatttcggt ggacggagcc tctccgtgct gtcttcaagc 1261 actatcattg cgaatcctga catcccagag gcctataagc ttcgtggatg gtttgacgca 1321 gaaggacaag ccttagatgg tgtttccatc tctgatctaa agagcggcgg agtcggaggg 1381 agtaacacca actggaaaac cttgtatgag gtcaaatccg agaacctggg ccaaggcgac 1441 aagccggact actttagttc tgtggccaca gtggtgtatc ttcgcaaaga gaactgcatg 1501 taccaagcct gcccgactca ggactgcaat aagaaagtga ttgatcaaca gaatggattg 1561 taccgctgtg agaagtgcga caccgaattt cccaatttca agtaccgcat gatcctgtca 1621 gtaaatattg cagattttca agagaatcag tgggtgactt gtttccagga gtctgctgaa 1681 gctatccttg gacaaaatgc tgcttatctt ggggaattaa aagacaagaa tgaacaggca 1741 tttgaagaag ttttccagaa tgccaacttc cgatctttca tattcagagt cagggtcaaa 1801 gtggagacct acaacgacga gtctcgaatt aaggccactg tgatggacgt gaagcccgtg 1861 gactacagag agtatggccg aaggctggtc atgagcatca ggagaagtgc attgatgtga 1921 gaggagcagt gccaatcggg cagaagtttg caaataggca gaatggaatc gatttcctcc 1981 cacctccgtg tgacgatccc atgttagcta cacagtgcag aggctcttga tggtggacta 2041 agcaattcct ccctcgtgcg catctcagaa cccatcggta ggcaaaggaa aatacgctca 2101 ggtggttgtg gtgtagactg tgtcaggcct acggagtcag ccagtggcta gcgcaagacc 2161 agtcactccc tctgccttca ggcttctgtc aatttcatta tcatcaagca ggaattatgt 2221 cgtaagtcac tgaccctaac tgcagaccat gaagtaaatt atgtaactag gtttttgctt 2281 ctccagtggt gaccaccccc ccccatcccc gctcacaact tgggttcttc tcagcggggc 2341 gagctgagaa gcggtcatga gcacctgggg attttagtaa gtgtgtcttc cta // LOCUS HUMRPIA 608 bp mRNA PRI 05-MAR-1996 DEFINITION Homo sapiens (clone mf.18) RNA polymerase II mRNA, complete cds. ACCESSION L37127 NID g1220357 KEYWORDS RNA polymerase II. SOURCE Homo sapiens (clone: mf.18) colon cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 608) AUTHORS Fanciulli,M., Bruno,T., Cerboni,C., Del Carlo,C., Frati,L., Piccoli,M., Floridi,A., Santoni,A. and Punturieri,A. TITLE Cloning of the human homologue of yeast RPB11 RNA polymerase II subunit JOURNAL Unpublished (1994) FEATURES Location/Qualifiers source 1..608 /organism="Homo sapiens" /note="(vector unizaphage)" /db_xref="taxon:9606" /clone="mf.18" /cell_line="HT29" /tissue_type="colon" mRNA 1..608 CDS 62..415 /codon_start=1 /product="RNA polymerase II" /db_xref="PID:g1220358" /translation="MNAPPAFESFLLFEGEKKITINKDTKVPNACLFTINKEDHTLGN IIKSQLLKDPQVLFAGYKVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLISELSLLE ERFRVAIKDKQEGIE" polyA_signal 587..592 BASE COUNT 149 a 186 c 159 g 113 t 1 others ORIGIN 1 tgaattcgtg ggtggcggcg gcggcggacc cttggggtct ggacgcgacg gcggcgggac 61 gatgaacgcc cctccagcct tcgagtcgtt cttgctcttc gagggcgaga agaagatcac 121 cattaacaag gacaccaagg tacccaatgc ctgtttattc accatcaaca aagaagacca 181 cacactggga aacatcatta aatcacaact cctaaaagac ccgcaagtgc tatttgctgg 241 ctacaaagtc ccccacccct tggagcacaa gatcatcatc cgagtgcaga ccacgccgga 301 ctacagcccc caggaagcct ttaccaacgc catcaccgac ctcatcagtg agctgtccct 361 gctggaggag cgctttcggg tggccataaa agacaagcag gaaggaattg agtaggggcc 421 agagggggct ctgctcggcc tgtgagcccc gttcctacct gtgcctgacc ctccgctcca 481 ggtaccacac cgaggagagc ggccggtccc agccatggcc cgccttgtgg ccacccctca 541 ccctgacacc gacgtgtcct gtacatagat taggttttat attcctaata aagtatagcg 601 gaagagan // LOCUS HUMRPIE 444 bp mRNA PRI 31-MAY-1995 DEFINITION Homo sapiens RNA polymerase II elongation factor SIII, p15 subunit mRNA, complete cds. ACCESSION L34587 NID g551605 KEYWORDS RNA polymerase II elongation factor, p15 subunit. SOURCE Homo sapiens (library: lambda zap) peripheral blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 444) AUTHORS Bradsher,J.N., Jackson,K.W., Conaway,R.C. and Conaway,J.W. TITLE RNA polymerase II transcription factor SIII. I. Identification, purification, and properties JOURNAL J. Biol. Chem. 268 (34), 25587-25593 (1993) MEDLINE 94064628 REFERENCE 2 (bases 1 to 444) AUTHORS Garrett,K.P., Haque,D., Conaway,R.C. and Conaway,J.W. TITLE A human cDNA encoding the small subunit of RNA polymerase II transcription factor SIII JOURNAL Gene 150 (2), 413-414 (1994) MEDLINE 95121944 FEATURES Location/Qualifiers source 1..444 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" /tissue_type="peripheral blood" /tissue_lib="lambda zap" 5'UTR 1..87 CDS 88..426 /note="putative" /citation=[2] /codon_start=1 /function="transcription elongation factor" /product="RNA polymerase II elongation factor SIII, p15 subunit" /db_xref="PID:g551606" /translation="MDGEEKTYGGCEGPDAMYVKLISSDGHEFIVKREHALTSGTIKA MLSGPGQFAENETNEVNFREIPSHVLSKVCMYFTYKVRYTNSSTEIPEFPIAPEIALE LLMAANFLDC" 3'UTR 428..444 BASE COUNT 150 a 79 c 103 g 112 t ORIGIN 1 agatttggca cgagtcggca cgaggcggga ctgacgagaa actactaaag ttcctgggga 61 agcaaagtag aatttcataa gaacaaaatg gatggagagg agaaaaccta tggtggctgt 121 gaaggacctg atgccatgta tgtcaaattg atatcatctg atggccatga atttattgta 181 aaaagagaac atgcattaac atcaggcacg ataaaagcca tgttgagtgg cccaggtcag 241 tttgctgaga acgaaaccaa tgaggtcaat tttagagaga taccttcaca tgtgctatcg 301 aaagtatgca tgtattttac gtacaaggtt cgctacacta acagctccac cgagattcct 361 gaattcccaa ttgcacctga aattgcactg gaactgctga tggctgcgaa cttcttagat 421 tgttaaataa aataaattat aata // LOCUS HUMRPIT 357 bp mRNA PRI 19-SEP-1995 DEFINITION Homo sapiens RNA polymerase II transcription factor SIII p18 subunit mRNA, complete cds. ACCESSION L42856 NID g992914 KEYWORDS RNA polymerase II transcription factor SIII; elongin B. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 357) AUTHORS Garrett,K.P., Aso,T., Bradsher,J.N., Foundling,S.I., Lane,W.S., Conaway,R.C. and Conaway,J.W. TITLE Positive regulation of general transcription factor SIII by a tailed ubiquitin homolog JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (16), 7172-7176 (1995) MEDLINE 95365330 FEATURES Location/Qualifiers source 1..357 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..>357 CDS 1..357 /standard_name="elongin B" /note="putative" /codon_start=1 /function="UbH (Ubiquitin homology) protein, regulatory subunit" /product="RNA polymerase II transcription factor SIII p18 subunit" /db_xref="PID:g992915" /translation="MDVFLMIRRHKTTIFTDAKESSTVFELKRIVEGILKRPPDEQRL YKDDQLLDDGKTLGECGFTSQTARPQAPATVGLAFRADDTFEALCIEPFSSPPELPDV MKPQDSGSSANEQAVQ" BASE COUNT 82 a 110 c 106 g 59 t ORIGIN 1 atggacgtgt tcctcatgat ccggcgccac aagaccacca tcttcacgga cgccaaggag 61 tccagcacgg tgttcgaact gaagcgcatc gtcgagggca tcctcaagcg gcctcctgac 121 gagcagcggc tgtacaagga tgaccaactc ttggatgatg gcaagacact gggcgagtgt 181 ggcttcacca gtcaaacagc acggccacag gccccagcca cagtggggct ggccttccgg 241 gcagatgaca cctttgaggc cctgtgcatc gagccgtttt ccagcccgcc agagctgccc 301 gatgtgatga agccccagga ctcgggaagc agtgccaatg aacaagccgt gcagtga // LOCUS HUMRPL18A 630 bp mRNA PRI 18-FEB-1994 DEFINITION Homo sapiens ribosomal protein L18 (RPL18) mRNA, complete cds. ACCESSION L11566 NID g337492 KEYWORDS ribosomal protein L18. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 630) AUTHORS Puder,M., Barnard,G.F., Staniunas,R.J., Steele,G.D. and Chen,L.B. TITLE Nucleotide and deduced amino acid sequence of human ribosomal protein L18 JOURNAL Biochim. Biophys. Acta 1216 (1), 134-136 (1993) MEDLINE 94032474 FEATURES Location/Qualifiers source 1..630 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="pcDNAII-NC" /tissue_type="colon" gene 16..582 /gene="RPL18" CDS 16..582 /gene="RPL18" /codon_start=1 /product="ribosomal protein L18" /db_xref="PID:g337493" /translation="MGVDIRHNKDRKVRRKEPKSQDIYLRLLVKLYRFLARRTNSTFN QVVLKRLFMSRTNRPPLSLSRMIRKMKLPGRENKTAVVVGTITDDVRVQEVPKLKVCA LRVTSRARSRILRAGGKILTFDQLALDSPKGCGTVLLSGPRKGREVYRHFGKAPGTPH SHTKPYVRSKGRKFERARGRRASRGYKN" polyA_site 623..>630 BASE COUNT 153 a 182 c 180 g 115 t ORIGIN 1 gcaggaggcg ccatcatggg agtggacatc cgccataaca aggaccgaaa ggttcggcgc 61 aaggagccca agagccagga tatctacctg aggctgttgg tcaagttata caggtttctg 121 gccagaagaa ccaactccac attcaaccag gttgtgttga agaggttgtt tatgagtcgc 181 accaaccggc cgcctctgtc cctttcccgg atgatccgga agatgaagct tcctggccgg 241 gaaaacaaga cggccgtggt tgtggggacc ataactgatg atgtgcgggt tcaggaggta 301 cccaaactga aggtatgtgc actgcgcgtg accagccggg cccgcagccg catcctcagg 361 gcagggggca agatcctcac tttcgaccag ctggccctgg actcccctaa gggctgtggc 421 actgtcctgc tctccggtcc tcgcaagggc cgagaggtgt accggcattt cggcaaggcc 481 ccaggaaccc cgcacagcca caccaaaccc tacgtccgct ccaagggccg gaagttcgag 541 cgtgccagag gccgacgggc cagccgaggc tacaaaaact aaccctggat cctactctct 601 tattaaaaag atttttgctg acaaaaaaaa // LOCUS HUMRPL27 476 bp mRNA PRI 18-JUL-1994 DEFINITION Homo sapiens ribosomal protein L27 (RPL27) mRNA, complete cds. ACCESSION L19527 NID g388768 KEYWORDS ribosomal protein L27. SOURCE Homo sapiens (library: lambda gt10/pBluescript-D69) 12-22 week gestation kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 476) AUTHORS Gallagher,R.A., McClean,P.M. and Malik,A.N. TITLE Cloning and nucleotide sequence of a full length cDNA encoding ribosomal protein L27 from human fetal kidney JOURNAL Biochim. Biophys. Acta 1217 (3), 329-332 (1994) MEDLINE 94198298 FEATURES Location/Qualifiers source 1..476 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="12-22 week gestation" /tissue_type="kidney" /tissue_lib="lambda gt10/pBluescript-D69" gene 18..455 /gene="RPL27" CDS 18..428 /gene="RPL27" /note="100% homology to rat RPL27 and chicken RPL27; 1.0kb mRNA transcript also detected in human fetal liver, human fetal lung, human fetal brain, human fetal heart, human fetal and adult muscle, mouse kidney and mouse liver, and Wilms' tumours" /codon_start=1 /product="ribosomal protein L27" /db_xref="PID:g388769" /translation="MGKFMKPGKVVLVLAGRYSGRKAVIVKNIDDGTSDRPYSHALVA GIDRYPRKVTAAMGKKKIAKRSKIKSFVKVYNYNHLMPTRYSVDIPLDKTVVNKDVFR DPALKRKARREAKVKFEERYKTGKNKWFFQKLRF" polyA_signal 448..455 /gene="RPL27" BASE COUNT 151 a 101 c 117 g 107 t ORIGIN 1 ggttggttgc tgccgaaatg ggcaagttca tgaaacctgg gaaggtggtg cttgtcctgg 61 ctggacgcta ctccggacgc aaagctgtca tcgtgaagaa cattgatgat ggcacctcag 121 atcgccccta cagccatgct ctggtggctg gaattgaccg ctacccccgc aaagtgacag 181 ctgccatggg caagaagaag atcgccaaga gatcaaagat aaaatctttt gtgaaagtgt 241 ataactacaa tcacctaatg cccacaaggt actctgtgga tatccccttg gacaaaactg 301 tcgtcaataa ggatgtcttc agagatcctg ctcttaaacg caaggcccga cgggaggcca 361 aggtcaagtt tgaagagaga tacaagacag gcaagaacaa gtggttcttc cagaaactgc 421 ggttttagat gctttgtttt gatcattaaa aattataaag aaaaaaaaaa aaaaaa // LOCUS HUMRPL30A 556 bp mRNA PRI 23-MAR-1993 DEFINITION Human ribosomal protein L30 (homologue of yeast rpl30) mRNA, complete cds. ACCESSION M94314 NID g292436 KEYWORDS homologue; ribosomal protein L30. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 556) AUTHORS Johnson,K.R. TITLE Characterization of cDNA clones for the human homologue of saccharomyces cerevisiae robosomal protein L30 JOURNAL Gene 123, 283-285 (1993) MEDLINE 93154599 FEATURES Location/Qualifiers source 1..556 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="WI-38" CDS 40..513 /note="homologue of yeast rpL30" /codon_start=1 /product="ribosomal protein L30" /db_xref="PID:g292437" /translation="MKVELCSFSGYKIYPGHGRRYARTDGKVFQFLNAKCESAFLSKR NPRQINWTVLYRRKHKKGQSEEIQKKRTRRAVKFQRAITGASLADIMAKRNQKPEVRK AQREQAIRAAKEAKKAKQASKKTAMAAAKAPTKAAPKQKIVKPVKVSAPRVGGKR" polyA_signal 534..539 BASE COUNT 175 a 119 c 140 g 122 t ORIGIN 1 ttttttcgcc atcttttgtc tttccgtgga gctgtcgcca tgaaggtcga gctgtgcagt 61 tttagcgggt acaagatcta ccccggacac gggaggcgct acgccaggac cgacgggaag 121 gttttccagt ttcttaatgc gaaatgcgag tcggctttcc tttccaagag gaatcctcgg 181 cagataaact ggactgtcct ctacagaagg aagcacaaaa agggacagtc ggaagaaatt 241 caaaagaaaa gaacccgccg agcagtcaaa ttccagaggg ccattactgg tgcatctctt 301 gctgatataa tggccaagag gaatcagaaa cctgaagtta gaaaggctca acgagaacaa 361 gctatcaggg ctgctaagga agcaaaaaag gctaagcaag catctaaaaa gactgcaatg 421 gctgctgcta aggcacctac aaaggcagca cctaagcaaa agattgtgaa gcctgtgaaa 481 gtttcagctc cccgagttgg tggaaaacgc taaactggca gattagattt ttaaataaag 541 attggattat aactct // LOCUS HUMRPL34A 392 bp mRNA PRI 04-OCT-1995 DEFINITION Homo sapiens ribosomal protein L34 (RPL34) mRNA, complete cds. ACCESSION L38941 NID g1008855 KEYWORDS ribosomal protein L34. SOURCE Homo sapiens (clone: GT247) ovary cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 392) AUTHORS Rommens,J.M., Durocher,F., McArthur,J., Tonin,P., Leblanc,J.-F., Allen,T., Samson,C., Ferri,L., Narod,S., Morgan,K. and Simard,J. TITLE Generation of a transcription map at the HSD17B locus centromeric to BRCA1 at 17q21 JOURNAL Genomics 28 (3), 530-542 (1995) MEDLINE 96039267 FEATURES Location/Qualifiers source 1..392 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GT247" /tissue_type="ovary" 5'UTR 1..20 /partial /gene="RPL34" /note="putative" gene 1..392 /gene="RPL34" mRNA 1..392 /partial /gene="RPL34" /note="putative" CDS 21..374 /gene="RPL34" /note="Human homolog of rat ribosomal protein L34; putative" /codon_start=1 /product="ribosomal protein L34" /db_xref="PID:g1008856" /translation="MVQRLTYRRRLSYNTASNKTRLSRTPGNRIVYLYTKKVGKAPKS ACGVCPGKLRGVRPVRPKVLMRLSKTKKHVSRAYGGSMCAKCVRDRIKRAFLIEEQKI IVKVLKAQAQSQKAK" 3'UTR 375..392 /partial /gene="RPL34" /note="putative" BASE COUNT 121 a 79 c 92 g 100 t ORIGIN 1 gatgtctgca ggcactcaga atggtccagc gtttgacata ccgacgtagg ctttcctaca 61 atacagcctc taacaaaact aggctgtccc gaacccctgg taatagaatt gtttaccttt 121 ataccaagaa ggttgggaaa gcaccaaaat ctgcatgtgg tgtgtgccca ggcaaacttc 181 gaggggttcg tcctgtaaga cctaaagttc ttatgagatt gtccaaaaca aagaaacatg 241 tcagcagggc ctatggtggt tccatgtgtg ctaaatgtgt tcgtgacagg atcaagcgtg 301 ctttccttat cgaggagcag aaaatcattg tgaaagtgtt gaaggcacaa gcacagagtc 361 agaaagctaa ataaaaaaat gaaacttttt tt // LOCUS HUMRPL37Z 349 bp mRNA PRI 09-FEB-1995 DEFINITION Homo sapiens ribosomal protein L37 mRNA, complete cds. ACCESSION L11567 NID g292440 KEYWORDS ribosomal protein L37. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 349) AUTHORS Barnard,G.F., Staniunas,R.J., Puder,M., Steele,G.D. Jr. and Chen,L.B. TITLE Human ribosomal protein L37 has motifs predicting serine/threonine phosphorylation and a zinc-finger domain JOURNAL Biochim. Biophys. Acta 1218 (3), 425-428 (1994) MEDLINE 94325352 FEATURES Location/Qualifiers source 1..349 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="colon" /tissue_lib="pcDNAIITc-Nc of R.Staniunas" CDS 7..300 /codon_start=1 /product="ribosomal protein L37" /db_xref="PID:g292441" /translation="MTKGTSSFGKRRNKTHTLCRRCGSKAYHLQKSTCGKCGYPAKRK RKYNWSAKAKRRNTTGTGRMRHLKIVYRRFRHGFREGTTPKPKRAAVAASSSS" polyA_signal 323..328 polyA_site 349 BASE COUNT 112 a 80 c 86 g 71 t ORIGIN 1 agcgagatga cgaagggaac gtcatcgttt ggaaagcgtc gcaataagac gcacacgttg 61 tgccgccgct gtggctctaa ggcctaccac cttcagaagt cgacctgtgg caaatgtggc 121 taccctgcca agcgcaagag aaagtataac tggagtgcca aggctaaaag acgaaatacc 181 accggaactg gtcgaatgag gcacctaaaa attgtatacc gcagattcag gcatggattc 241 cgtgaaggaa caacacctaa acccaagagg gcagctgttg cagcatccag ttcatcttaa 301 gaatgtcaac gattagtcat gcaataaatg ttctggtttt aaaaaatac // LOCUS HUMRPOLAA 1766 bp mRNA PRI 16-AUG-1990 DEFINITION Human RNA polymerase subunit hRPB 33, mRNA. ACCESSION J05448 NID g337496 KEYWORDS RNA polymerase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1766) AUTHORS Pati,U.K. and Weissman,S.M. TITLE The amino acid sequence of the human RNA polymerase II 33-kDa subunit hRPB 33 is highly conserved among eukaryotes JOURNAL J. Biol. Chem. 265, 8400-8403 (1990) MEDLINE 90256750 COMMENT Draft entry and printed sequence for [1] kindly submitted by U.Pati and S.M.Weissman, 19-JUN-1990. FEATURES Location/Qualifiers source 1..1766 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 42..869 /note="RNA polymerase subunit hRPB 33" /codon_start=1 /db_xref="PID:g337497" /translation="MPYANQPTVRITELTDENVKFIIENTDLAVANSIRRVFIAEVPI IAIDWVQIDANSSVLHDEFIAHRLGLIPLISDDIVDKLQYSRDCTCEEFCPECSVEFT LDVRCNEDQTRHVTSRDLISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQELR LRAYAKKGFGKEHAKWNPTAGVAFEYDPDNALRTTVYPKPEEWPKSEYSELDEDESQA PYDPNGKPERFYYNVESCGSLRPETIVLSALSGLKKKLSDLQTQLSHEIQSDVLTIN" BASE COUNT 452 a 437 c 449 g 428 t ORIGIN 1 aagcccgcga gcagacgcgg aggctggtgg ccctgggcga gatgccgtac gccaaccagc 61 ctaccgtgcg gatcacggag ctcactgacg agaatgtcaa gttcatcatc gagaacaccg 121 acctggcggt ggccaattcg attcggaggg tcttcatcgc tgaggttccc ataatagcca 181 ttgactgggt tcagattgat gccaattcct cagtccttca tgatgaattc attgctcaca 241 ggcttggatt aattcccctc attagtgatg acattgtgga caagctgcag tactctcggg 301 actgcacatg tgaggagttc tgccccgagt gctcggtgga gttcaccctc gatgtgcggt 361 gcaatgaaga ccagacgcga catgtcacgt ctcgagacct catctccaac agcccccggg 421 tcattccggt gacatcccgg aaccgagata atgaccccaa tgactacgtg gagcaggatg 481 acatcctcat cgtcaagttg agaaagggcc aggagctgag acttcgagcc tatgccaaaa 541 agggctttgg caaggagcat gccaagtgga accctactgc aggggtggct tttgaatacg 601 atccagacaa tgccctgagg accacagtgt accccaagcc cgaggaatgg ccaaagagtg 661 agtactcgga gctggatgag gatgagtcgc aggctcccta tgaccccaac ggcaagccag 721 aaaggtttta ctacaatgtg gagtcctgtg gctctctgcg tcctgaaacc attgtcctgt 781 cagccctctc aggattgaag aagaaactga gtgatttaca aactcaatta agccacgaga 841 tccagagtga tgtgctaacc ataaattaac tgcagcttgc ctgcttcagc aaaaacggag 901 attcaggcca gcagctggat atgggggtct ctcttcagac tcttctcgtt tctgagaatc 961 tagtctactg ttggttgagc ttcttggcag gacatcagta ccaactagaa gtgggtcata 1021 gatagattac cagggatgca gtggtgttta ggcaggatag gtctttactg gccctgactg 1081 ctgttaataa ttggcagcag tgctccccag atcccagaag gtccctgctg gagtgtttcc 1141 agtgcacctg taggaacaac tagacttctc tcctggttag tccagctctt tactctaaac 1201 cctttctgtc caaaatgagt cattttcagt tgtaccttag atgtctggtg ttgaggatca 1261 agtgccatag cctttattca gggggcctat aaacccttcc agttcttgcc ccagggctgg 1321 cctgctagcg cttcaaattc ccaggtgtcc ctaatttgag aagtaacctt ttggaatagc 1381 attagaccct ggctgtcccc tccccaccaa ataaacatga tatttcattc tctgtccagc 1441 agtcatgaac cccttcacct ccaatgacct gatcatttag tttggtgggg gtgggggtgg 1501 gggtgggggt gggggtggaa gcagccgcag gagcaagggc ccctcccaca tacacaggag 1561 gagtatttca tttctcctta atgaaggctc tggccctaac ccctcagcac tgtctccaga 1621 taggaacatg cacaaagcag ttaattaggc agcctggaga aaaccagaga tccagtacag 1681 aaaggaaagg atatttattg attaacagaa gttgtctttt taaaagtgtt tatttttgca 1741 ataaagagca caacataaaa aaaaaa // LOCUS HUMRPS14 5985 bp DNA PRI 07-AUG-1995 DEFINITION Human ribosomal protein S14 gene, complete cds. ACCESSION M13934 M13641 NID g337498 KEYWORDS Alu repeat; repeat region; ribosomal protein; ribosomal protein S14. SOURCE Homo sapiens (clone: HGS14-[1,2]) placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5985) AUTHORS Rhoads,D.D., Dixit,A. and Roufa,D.J. TITLE Primary structure of human ribosomal protein S14 and the gene that encodes it JOURNAL Mol. Cell. Biol. 6 (8), 2774-2783 (1986) MEDLINE 87064583 REFERENCE 2 (bases 1 to 5570) AUTHORS Chen,I.T., Dixit,A., Rhoads,D.D. and Roufa,D.J. TITLE Homologous ribosomal proteins in bacteria, yeast, and humans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (18), 6907-6911 (1986) MEDLINE 86313681 COMMENT [2] exons only. Draft entry and sequence in computer-readable from for [2] kindly provided by D.D.Rhoads and D.J.Roufa, 25-NOV-1986. A potential 3' mRNA processing signal is located at positions 5662-5670. FEATURES Location/Qualifiers source 1..5985 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HGS14-[1,2]" /tissue_type="placenta" /map="5q31-q33" exon 201..254 /gene="RPS14" /note="alternative exon 1; G00-119-572" /number=1 gene 201..5615 /gene="RPS14" exon 202..254 /gene="RPS14" /note="alternative exon 1; G00-119-572" /number=1 exon 211..254 /gene="RPS14" /note="alternative exon 1; G00-119-572" /number=1 exon 217..254 /gene="RPS14" /note="alternative exon 1; G00-119-572" /number=1 intron 255..2137 /gene="RPS14" /note="G00-119-572" /number=1 CDS 673..1167 /gene="RPS14" /note="ORF; putative" /codon_start=1 /product="unknown protein" /db_xref="PID:g804762" /translation="MVCSDHSKKESRETLALCWMLLNLAILCVAESGTLQVLALRPST NVCCRLPSFDILLMTTLLVTLGMLERRNCKLAFTEWPSCRNILMLTYLRYAGAFLCFS RQNFGLLSCGAFLTIVNALGRAHWSDLKIGTFNFVHPRLTLEVLTVVTHLTESLYREG RGEI" exon 2138..2288 /partial /gene="RPS14" /note="G00-119-572" /number=2 CDS join(2140..2288,2902..3063,4167..4243,5503..5570) /gene="RPS14" /codon_start=1 /db_xref="GDB:G00-119-572" /product="ribosomal protein S14" /db_xref="PID:g337499" /translation="MAPRKGKEKKEEQVISLGPQVAEGENVFGVCHIFASFNDTFVHV TDLSGKETICRVTGGMKVKADRDESSPYAAMLAAQDVAQRCKELGITALHIKLRATGG NRTKTPGPGAQSALRALARSGMKIGRIEDVTPIPSDSTRRKGGRRGRRL" intron 2289..2901 /gene="RPS14" /note="G00-119-572" /number=2 variation 2902 /gene="RPS14" /note="a in DNA [1]; g in cDNA [2],[1]" /replace="g" exon 2902..3063 /gene="RPS14" /note="G00-119-572" /number=3 intron 3064..4166 /gene="RPS14" /note="G00-119-572" /number=3 repeat_region 3744..4004 /note="Alu repeat copy A" exon 4167..4243 /gene="RPS14" /note="G00-119-572" /number=4 intron 4244..5502 /gene="RPS14" /note="G00-119-572" /number=4 repeat_region 4814..5064 /note="Alu repeat copy B" exon 5503..5615 /gene="RPS14" /note="G00-119-572" /number=5 BASE COUNT 1389 a 1360 c 1561 g 1675 t ORIGIN 264 bp upstream of SmaI site; chromosome 5q23-5q33. 1 tcgtactgtc gcttaggccg acccgctcgc agcgacaacc agccctctac ctcttttcgc 61 tctcccttaa gtaataaacc gtctttcctt atgacgagtc ttaaactctt tgggaggaat 121 aatgccggcg tcttccggaa cccgacctcg ccccgtgacc tcagaggtat acttccggga 181 cacggaagtg acccccgtcg ctccgccctc tcccactctc tctttccggt gtggagtctg 241 gagacgacgt gcaggtagga gcccgggcgc gacaatcggg gggcatctgc ggcgagggga 301 cctgtggggc ttgggacgag agacgggggt ctttccgtgg gaaccgagct aggtgccggg 361 caagagacgc gcggctggcc cacctggatc ctggccaact cgggattgag ttcattcctg 421 gtctcagaag gcccgttttg cttcagggag gagcttgtga agtaagggtg agcgtggtcc 481 agccttttaa gcctcggccc cgcaatacgg cggcacggcc gcgtttgagc tgcacagcgt 541 agttgaggga acccgggaca gacgtgggct cccgctctac ctcgccaaac ttttttcttg 601 gtgatcgcag gcccacgcta atctcgtttg tttcctcgtc tgcaaaatag gaataacaat 661 agcaccgatc ccatggtttg tagtgatcat tcaaagaagg aaagcaggga aactctcgca 721 ctatgttgga tgcttctaaa tctggcgatt ctttgcgttg ctgagtcggg cacgttgcaa 781 gttctggcgc tcaggcccag caccaatgtt tgctgtcgct tgccctcgtt tgatatactc 841 ctgatgacca ctctgctcgt tactttaggg atgttagaac ggagaaactg caaattggca 901 tttactgaat ggccatcatg ccgaaacata ctcatgctta cgtatttgag atatgctgga 961 gcctttctat gcttctcgag gcagaacttt gggcttctct cctgtggcgc gttccttaca 1021 atagttaacg cactgggtcg tgctcattgg tctgatttga agataggaac atttaacttc 1081 gtacacccaa gacttacact tgaagtactt actgtggtca cacacttaac tgaaagttta 1141 tatagggaag gcagaggaga gatctaggaa cacctgagac agactagggt aagttatttg 1201 tggtgtagag gcatcttgcc actgtcttta ctgtgaccat aggctgagac actctctatc 1261 ttcaccatct gcaggatatc gattagtgtt agtgtcttct gaaaatgtta aattgttctg 1321 aacaccagga ggttcagaag gcctgggctt taggcggaag cagtcttatc ctagccttcc 1381 actttctcat tcattctcac ccgccttcct cctgtcagtt tctctctccc ttagatcttg 1441 tgacagtatg attgcaatta ttatgcaagg tcataatagg ttagagttat tttagtctca 1501 ggacccatca gatttctgag atgggtcttt tccattagcc tgccttttgg atatatttta 1561 gtgcctttga tatttgatgt tggtgacaaa gatttagttc taaatagttg tgggagtcaa 1621 atcagttgca tttttgaaca ttttgaaggg atcaggtgga gcacaggaaa acaggagatg 1681 agaggcttga gggctgtgtt ggtaagggac ccttgaattc taagctgagg agttcatgca 1741 ggaatcaagc agctccctcg ccccatgaag gtgttaggaa tggtaccagt acatgggtag 1801 ctgttggtct tgggtttctt cttgcctttt ggggtgcagc cttcagcgat ctgctgctag 1861 atccctaggc ttctatcttg taggtgctgt gagctgagct gttggtagtt tctggggaaa 1921 ggatacttct catggtactt tgccatatgg ccacagttga taccccccca agtgcaagga 1981 gaaagaagtt ttagtgaggc agaaatgaga gattacagta gagtattgga tggagagtac 2041 agggacataa gcaggactag ggtttcattt aatagccagg gaaagggatt tctgaaatat 2101 tttgtctgtt aaccttattt cttttccttc cactcagaaa tggcacctcg aaaggggaag 2161 gaaaagaagg aagaacaggt catcagcctc ggacctcagg tggctgaagg agagaatgta 2221 tttggtgtct gccatatctt tgcatccttc aatgacactt ttgtccatgt cactgatctt 2281 tctggcaagt gagtacctgg gtggagaggc atccagctgg caaaaggctg aggaaggtaa 2341 tggctgggac gggctagcag ttcaggggat tctctctaaa gaaatccctg ttttgtccag 2401 gtaagaaaat gtgcttgtcc atttagccca caaatattgt gatttcccag gggttacaag 2461 agaggagaca cattcttcgt ccttaacagc gtgatggtca ttgaatcctt ggtttcttgg 2521 gaattatctt ctttccctag ttccctttgc agagcagcat ggactgcaat gggctctggg 2581 tgcgggttcg acggccattt agcaagttgt gacttgtgtg aaatcactca gattctgagt 2641 tttggtttcc tcatatgtaa aattaggaca gtaatgtcta ccttgtggag taatggtaag 2701 gattaaatgc aaaagtagat atataaaatg ttaatactga gtatagctta tattggccat 2761 caaccccact gactaactcc atagagagcc catgaccatc atcctccacc atgactgagc 2821 tgtgttgctc ttcactcatt aggatatcgt aagcattcct tgttggacga ggaagaaatg 2881 accctgtgct tttgtcccca gagaaaccat ctgccgtgtg actggtggga tgaaggtaaa 2941 ggcagaccga gatgaatcct caccatatgc tgctatgttg gctgcccagg atgtggccca 3001 gaggtgcaag gagctgggta tcaccgccct acacatcaaa ctccgggcca caggaggaaa 3061 taggtacgag tcgcagaggg gatggctggg tggtagaaaa cctgctgggc ttgggtgctg 3121 gagcacctgg atttgaggtt tgggtttttt gttgtgacct tgaacaagat attttaggca 3181 tatacatact taataattgg ctcctgtgta caccagcagc tcagtgttga gcaccactgt 3241 atactaggag ctgtgtttca ggtactagaa gagagtgatg gggaaacaga catggtctct 3301 accctccttg tgtgggaccg aggctggtgg gagagtcaga ctttaaacaa cagagctcca 3361 gggagtatgt ggttttcatt tgtgacaaat gccaggaagc acaagtaagg agctggtgat 3421 aaagtgtaat ttaaactagg gtctaaatga taagtccgga gtccttccag tgaaataagg 3481 acaagaggag gtattccagc tagagacaac tgcagggaaa ggtcttgagg tgggggtgaa 3541 cctgggcact atgaaggaaa gagtggcatg aggtgcagtc agaggggaag aggtggagtc 3601 ctgtaggaga ggggaaggac ctggatttga gtgatggggg catgaaagct aaatagatga 3661 ctgttgcgtg aacaagccag acatggtcag aacataacaa caaaccatat ggccccctca 3721 gcctaaaata tttactgtct ggcttttttt ttttcctctt ttttgagaca gggtctcact 3781 ctcacccagg ctggagtgca gtggcgctat ctcagcttac tgcagtctct ggctttcagg 3841 ctctagcagt cctcccacct cagcctcccg agtagctggg actgctagta gtagtgccac 3901 cacgcctggc taaattgtgt atttttgtag agatggggtt tcaccatgtt gctcaggctg 3961 gtcttgaact cctgagctga agcagtccgc cctgccttag cctcccaagg tgctgagatt 4021 atgggtgtga accactgtgt acagccctgt ctggctcttt acagaaagtt tgcagacctc 4081 tgaactagcg gggtgttggg atcttagaaa gaagatcttc cattacttgc caaaggctct 4141 tcttaactta aaccttcatt ttctaggacc aagacccctg gacctggggc ccagtcggcc 4201 ctcagagccc ttgcccgctc gggtatgaag atcgggcgga ttggtaagtg cccccctcta 4261 gctaatgctt gggtttattt tgaagcattg gccccaaaaa gcacgtgctg tcccagtgga 4321 tgtgcagcgg ctggtctggt cacttttggc agttaagttt gttaagggag gctgcaagag 4381 gcaccttgtg acttaaaaat ctctcccctg aatatctctg tcaccttgat atgcaatgtc 4441 ctgtttcatt tgtgaatctc tgtcctacac tgcctgtgca gtaaaactga caagagatga 4501 attgtccttt cctccattct taggtcattg ctattagcat ttttgtgtgt accatctcct 4561 gcattgtagg atgttcagca tttttggttt ccgcctacta aatatcagtt gggcatgcca 4621 gtcgtgtgac agccaaaaat gaagtggggg gtaccagctt ctgttattaa taccctcagg 4681 cctgtgatct cctggctcct tacagctctt aggaagggaa cagtagtctt tccctgcaca 4741 ttgttctgtg cagttagtag cgacatacac tgacgcccca ggccaaagtt cctactgtgt 4801 cctacttgaa ggatgatttt gttttcttga gacagtgtct ggctctgttg cgtgggctgg 4861 aattcagtgg cacaatctca gctcactgta aatttgcctc gtgggcggat ctgtccaaag 4921 gatcctccca cctgcatcta ccgagtagct gggacaacag tgtgtgccac acgggctttt 4981 tttttttttt tttgtagcgg ggagtttcac catgttgccc aggttggtcc caaactcctg 5041 gcctcaggcg ttccaccagc cttagtctcc caaagcccaa agtgctagga ttacaggcat 5101 gacgcaccgg cgcccagcct taaaagattt tttcagactt tttttcagtg tctttttact 5161 taatgaaaac gagacagttg cagttaccct gtctagaagg gttttttcct ctagagaaga 5221 cggtgggcta gatgtcagaa cctggatttg tcctgtgcct ctgacatctt gcgcacagct 5281 cagcctgttt ccctttctgg aaagtgagtt aagaccacgt caggaaattc gaggaactgg 5341 cagatggggc cgcttcatgt tcacatccat ctgttaggta ctggggagtc atttggggag 5401 cagcaggatc tgttgctgat tggttaggcc attggtctag ggagtcctgg gggcccttcc 5461 catccagctc caggcaatga cgctttctct tcctccccac agaggatgtc acccccatcc 5521 cctctgacag cactcgcagg aaggggggtc gccgtggtcg ccgtctgtga acaagattcc 5581 tcaaaatatt ttctgttaat aaattgcctt catgtaaact gtttcaactc cagtcttctc 5641 tccttcatca ggggctactc aggcattttg atttcttcct cctatttggc ttcttggaga 5701 agagcctgag agagctgaga tcctggtgct gcatttgggg tcttttccat gctacttgcg 5761 gcattaagtt gccttggtca tctaaagcag accaaggcgt tggagacctc gttttgagtg 5821 gagatgctgg ttctaaatat ggaccaattc ttaaagagcc agagtgggaa ctgttgatcc 5881 aagtgtagcc tgaagcgaaa gaggagcctt ccagacccat gccatatata aacacacgtg 5941 ggtgtgcatt ctccccccac accttctgtg caaagctggg agctc // LOCUS HUMRPS20 505 bp mRNA PRI 09-JAN-1995 DEFINITION Homo sapiens ribosomal protein S20 (RPS20) mRNA, complete cds. ACCESSION L06498 NID g292442 KEYWORDS ribosomal protein; ribosomal protein S20. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 505) AUTHORS Chu,W., Presky,D.H., Swerlick,R.A. and Burns,D.K. TITLE Human ribosomal protein S20 cDNA sequence JOURNAL Nucleic Acids Res. 21 (7), 1672 (1993) MEDLINE 93241957 FEATURES Location/Qualifiers source 1..505 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="dermal vascular endothelial" mRNA 1..505 /gene="RPS20" gene 1..505 /gene="RPS20" CDS 114..473 /gene="RPS20" /codon_start=1 /product="ribosomal protein S20" /db_xref="PID:g292443" /translation="MAFKDTGKTPVEPEVAIHRIRITLTSRNVKSLEKVCADLIRGAK EKNLKVKGPVRMPTKTLRITTRKTPCGEGSKTWDRFQMRIHKRLIDLHSPSEIVKQIT SISIEPGVEVEVTIADA" polyA_site 486..491 /gene="RPS20" BASE COUNT 150 a 114 c 125 g 116 t ORIGIN 1 aagacgcggt cgtaagggct gaggattttt ggtccgcacg ctcctgctcc tgactcaccg 61 ctgttcgctc tcgccgagga acaagtcggt caggaagccc gcgcgcaaca gccatggctt 121 ttaaggatac cggaaaaaca cccgtggagc cggaggtggc aattcaccga attcgaatca 181 ccctaacaag ccgcaacgta aaatccttgg aaaaggtgtg tgctgacttg ataagaggcg 241 caaaagaaaa gaatctcaaa gtgaaaggac cagttcgaat gcctaccaag actttgagaa 301 tcactacaag aaaaactcct tgtggtgaag gttctaagac gtgggatcgt ttccagatga 361 gaattcacaa gcgactcatt gacttgcaca gtccttctga gattgttaag cagattactt 421 ccatcagtat tgagccagga gttgaggtgg aagtcaccat tgcagatgct taagtcaact 481 attttaataa attgatgacc agtta // LOCUS HUMRPS21X 343 bp mRNA PRI 13-MAY-1996 DEFINITION Human ribosomal protein S21 (RPS21) mRNA, complete cds. ACCESSION L04483 NID g292444 KEYWORDS ribosomal protein S21. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 343) AUTHORS Bhat,K.S. and Morrison,S.G. TITLE Primary structure of human ribosomal protein S21 JOURNAL Nucleic Acids Res. 21 (12), 2939 (1993) MEDLINE 93324381 FEATURES Location/Qualifiers source 1..343 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="THP-1" /cell_type="macrophage" gene 39..343 /gene="RPS21" CDS 39..290 /gene="RPS21" /note="putative" /codon_start=1 /product="ribosomal protein S21" /db_xref="PID:g292445" /translation="MQNDAGEFVDLYVPRKCSASNRIIGAKDHASIQMNVAEVDKVTG RFNGQFKTYAICGAIRRMGESDDSILRLAKADGIVSKNF" polyA_signal 326..331 /gene="RPS21" polyA_site 343 /gene="RPS21" BASE COUNT 91 a 81 c 97 g 74 t ORIGIN 1 cgcgcggtgt ggtggcagca ggcgcaccaa gcctcgaaat gcagaacgac gccggcgagt 61 tcgtggacct gtacgtgccg cggaaatgct ccgctagcaa tcgcatcatc ggtgccaagg 121 accacgcatc catccagatg aacgtggccg aggttgacaa ggtcacaggc aggtttaatg 181 gccagtttaa aacttatgct atctgcgggg ccattcgtag gatgggtgag tcagatgatt 241 ccattctccg attggccaag gccgatggca tcgtctcaaa gaacttttga ctggagagaa 301 tcacagatgt ggaatatttg tcataaataa ataatgaaaa cct // LOCUS HUMRPS24A 620 bp mRNA PRI 20-FEB-1991 DEFINITION Human ribosomal protein S24 mRNA. ACCESSION M31520 NID g337504 KEYWORDS ribosomal protein S24. SOURCE Human male lymphoblast from lymphoid tumor cell line HT1080 (ATCC 121) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 620) AUTHORS Brown,S.J. and Roufa,D.J. JOURNAL Unpublished (1990) REFERENCE 2 (bases 1 to 620) AUTHORS Brown,S.J., Jewell,A., Maki,C.G. and Roufa,D.J. TITLE A cDNA encoding human ribosomal protein S24 JOURNAL Gene 91, 293-296 (1990) MEDLINE 91007290 COMMENT Authorin Submission [1] kindly submitted by Roufa,D.J., 22-JAN-1990. FEATURES Location/Qualifiers source 1..620 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA complement(1..132) /note="unknown mRNA; 800 nt. anonymous human transcript detected on Northern blots of HT1080 and HeLa cell cytoplasmic mRNAs" /evidence=experimental misc_signal complement(20..25) /note="detected by pattern only; putative" CDS complement(51..89) /note="unknown protein" /codon_start=1 /evidence=experimental /db_xref="PID:g337505" /translation="MFLQEEAALKAL" gene 143..544 /gene="rps24" CDS 143..544 /gene="rps24" /codon_start=1 /evidence=experimental /product="ribosomal protein S24" /db_xref="PID:g337506" /translation="MNDTVTIRTRKFMTNRLLQRKQMVIDVLHPGKATVPKTEIREKL AKMYKTTPDVIFVFGFRTHFGGGKTTGFGMIYDSLDYAKKNEPKHRLARHGLYEKKKT SRKQRKERKNRMKKVRGTAKANVGAGKKPKE" misc_signal 598..603 /evidence=experimental BASE COUNT 197 a 126 c 144 g 153 t ORIGIN 1 gggtttatcg gaaaatgtgt ttattgagat ggtttcccac tcatcttgac tcagagtgct 61 tttagtgctg cttcctcctg aaggaacatc cttctgtaag ccttgctttt cctccttggc 121 tgtctgaaga tagatcgcca tcatgaacga caccgtaact atccgcacta gaaagttcat 181 gaccaaccga ctacttcaga ggaaacaaat ggtcattgat gtccttcacc ccgggaaggc 241 gacagtgcct aagacagaaa ttcgggaaaa actagccaaa atgtacaaga ccacaccgga 301 tgtcatcttt gtatttggat tcagaactca ttttggtggt ggcaagacaa ctggctttgg 361 catgatttat gattccctgg attatgcaaa gaaaaatgaa cccaaacata gacttgcaag 421 acatggcctg tatgagaaga aaaagacctc aagaaagcaa cgaaaggaac gcaagaacag 481 aatgaagaaa gtcaggggga ctgcaaaggc caatgttggt gctggcaaaa agccgaagga 541 gtaaaggtgc tgcaatgatg ttagctgtgg ccactgtgga tttttcgcaa gaacattaat 601 aaactaaaaa cttcatgtgt // LOCUS HUMRPS25 497 bp mRNA PRI 25-JAN-1994 DEFINITION Human ribosomal protein S25 mRNA, complete cds. ACCESSION M64716 NID g337507 KEYWORDS ribosomal protein S25. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 497) AUTHORS Li,M.L., Latoud,C. and Center,M.S. TITLE Cloning and sequencing a cDNA encoding human ribosomal protein S25 JOURNAL Gene 107 (2), 329-333 (1991) MEDLINE 92084127 REFERENCE 2 (bases 1 to 497) AUTHORS Li,M. and Center,M.S. TITLE Regulation of ribosomal protein S25 in HL60 cells isolated for resistance to adriamycin JOURNAL FEBS Lett. 298 (2-3), 142-144 (1992) MEDLINE 92183849 FEATURES Location/Qualifiers source 1..497 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="promyelocytic leukemic cells" mRNA <1..497 /gene="RPS25" /standard_name="ribosomal protein S25" /note="putative" /citation=[1] gene 1..497 /gene="RPS25" 5'UTR 1..71 /gene="RPS25" CDS 72..449 /gene="RPS25" /standard_name="ribosomal protein S25" /note="putative" /codon_start=1 /product="ribosomal protein" /db_xref="PID:g337508" /translation="MPPKDDKKKKDAGKSAKKDKDPVNKSGGKAKKKKWSKGKVRDKL NNLVLFDKATYDKLCKEVPNYKLITPAVVSERLKIRGSLARAALQELLSKGLIKLVSK HRAQVIYTRNTKGGDAPAAGEDA" 3'UTR 447..497 /gene="RPS25" /note="putative" polyA_signal 479..484 /gene="RPS25" BASE COUNT 157 a 107 c 117 g 116 t ORIGIN 1 tttttttttt ttttttgtcc gacatcttga gacgaggctg cggtgtctgc tgctattctc 61 cgagcttcgc aatgccgcct aaggacgaca agaagaagaa ggacgctgga aagtcggcca 121 agaaagacaa agacccagtg aacaaatccg ggggcaaggc caaaaagaag aagtggtcca 181 aaggcaaagt tcgggacaag ctcaataact tagtcttgtt tgacaaagct acctatgata 241 aactctgtaa ggaagttccc aactataaac ttataacccc agctgtggtc tctgagagac 301 tgaagattcg aggctccctg gccagggcag cccttcagga gctccttagt aaaggactta 361 tcaaactggt ttcaaagcac agagctcaag taatttacac cagaaatacc aagggtggag 421 atgctccagc tgctggtgaa gatgcatgaa taggtccaac cagctgtaca tttggaaaaa 481 taaaacttta ttaaatc // LOCUS HUMRPTKC 3107 bp mRNA PRI 10-AUG-1995 DEFINITION Homo sapiens receptor protein-tyrosine kinase (HEK8) mRNA, complete cds. ACCESSION L36645 NID g551613 KEYWORDS EPH-like receptor PTK; receptor protein-tyrosine kinase. SOURCE Homo sapiens (clone library: Stratagene premade library, cat #936206) female fetus, 17-18 weeks gestation brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3107) AUTHORS Fox,G.M., Holst,P.L., Chute,H.T., Lindberg,R.A., Janssen,A.M., Basu,R. and Welcher,A.A. TITLE cDNA cloning and tissue distribution of five human EPH-like receptor protein-tyrosine kinases JOURNAL Oncogene 10 (5), 897-905 (1995) MEDLINE 95206782 FEATURES Location/Qualifiers source 1..3107 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus, 17-18 weeks gestation" /sex="female" /clone_lib="Stratagene premade library, cat #936206" /tissue_type="brain" mRNA 1..3107 /gene="HEK8" /product="receptor protein-tyrosine kinase" gene 1..3107 /gene="HEK8" 5'UTR <1..33 /gene="HEK8" CDS 34..2994 /gene="HEK8" /codon_start=1 /product="receptor protein-tyrosine kinase" /db_xref="PID:g551614" /translation="MAGIFYFALFSCLFGICDAVTGSRVYPANEVTLLDSRSVQGELG WIASPLEGGWEEVSIMDEKNTPIRTYQVCNVMEPSQNNWLRTDWITREGAQRVYIEIK FTLRDCNSLPGVMGTCKETFNLYYYESDNDKERFIRENQFVKIDTIAADESFTQVDIG DRIMKLNTEIRDVGPLSKKGFYLAFQDVGACIALVSVRVFYKKCPLTVRNLAQFPDTI TGADTSSLVEVRGSCVNNSEEKDVPKMYCGADGEWLVPIGNCLCNAGHEERSGECQAC KIGYYKALSTDATCAKCPPHSYSVWEGATSCTCDRGFFRADNDAASMPCTRPPSAPLN LISNVNETSVNLEWSSPQNTGGRQDISYNVVCKKCGAGDPSKCRPCGSGVHYTPQQNG LKTTKVSITDLLAHTNYTFEIWAVNGVSKYNPNPDQSVSVTVTTNQAAPSSIALVQAK EVTRYSVALAWLEPDRPNGVILEYEVKYYEKDQNERSYRIVRTAARNTDIKGLNPLTS YVFHVRARTAAGYGDFSEPLEVTTNTVPSRIIGDGANSTVLLVSVSGSVVLVVILIAA FVISRRRSKYSKAKQEADEEKHLNQGVRTYVDPFTYEDPNQAVREFAKEIDASCIKIE KVIGVGEFGEVCSGRLKVPGKREICVAIKTLKAGYTDKQRRDFLSEASIMGQFDHPNI IHLEGVVTKCKPVMIITEYMENGSLDAFLRKNDGRFTVIQLVGMLRGIGSGMKYLSDM SYVHRDLAARNILVNSNLVCKVSDFGMSRVLEDDPEAAYTTRGGKIPIRWTAPEAIAY RKFTSASDVWSYGIVMWEVMSYGERPYWDMSNQDVIKAIEEGYRLPPPMDCPIALHQL MLDCWQKERSDRPKFGQIVNMLDKLIRNPNSLKRTGTESSRPNTALLDPSSPEFSAVV SVGDWLQAIKMDRYKDNFTAAGYTTLEAVVHVNQEDLARIGITAITHQNKILSSVQAM RTQMQQMHGRMVPV" 3'UTR 2995..3107 /gene="HEK8" BASE COUNT 850 a 720 c 812 g 725 t ORIGIN 1 aagcggcagg agcagcgttg gcaccggcga accatggctg ggattttcta tttcgcccta 61 ttttcgtgtc tcttcgggat ttgcgacgct gtcacaggtt ccagggtata ccccgcgaat 121 gaagttacct tattggattc cagatctgtt cagggagaac ttgggtggat agcaagccct 181 ctggaaggag ggtgggagga agtgagtatc atggatgaaa aaaatacacc aatccgaacc 241 taccaagtgt gcaatgtgat ggaacccagc cagaataact ggctacgaac tgattggatc 301 acccgagaag gggctcagag ggtgtatatt gagattaaat tcaccttgag ggactgcaat 361 agtcttccgg gcgtcatggg gacttgcaag gagacgttta acctgtacta ctatgaatca 421 gacaacgaca aagagcgttt catcagagag aaccagtttg tcaaaattga caccattgct 481 gctgatgaga gcttcaccca agtggacatt ggtgacagaa tcatgaagct gaacaccgag 541 atccgggatg tagggccatt aagcaaaaag gggttttacc tggcttttca ggatgtgggg 601 gcctgcatcg ccctggtatc agtccgtgtg ttctataaaa agtgtccact cacagtccgc 661 aatctggccc agtttcctga caccatcaca ggggctgata cgtcttccct ggtggaagtt 721 cgaggctcct gtgtcaacaa ctcagaagag aaagatgtgc caaaaatgta ctgtggggca 781 gatggtgaat ggctggtacc cattggcaac tgcctatgca acgctgggca tgaggagcgg 841 agcggagaat gccaagcttg caaaattgga tattacaagg ctctctccac ggatgccacc 901 tgtgccaagt gcccacccca cagctactct gtctgggaag gagccacctc gtgcacctgt 961 gaccgaggct ttttcagagc tgacaacgat gctgcctcta tgccctgcac ccgtccacca 1021 tctgctcccc tgaacttgat ttcaaatgtc aacgagacat ctgtgaactt ggaatggagt 1081 agccctcaga atacaggtgg ccgccaggac atttcctata atgtggtatg caagaaatgt 1141 ggagctggtg accccagcaa gtgccgaccc tgtggaagtg gggtccacta caccccacag 1201 cagaatggct tgaagaccac caaagtctcc atcactgacc tcctagctca taccaattac 1261 acctttgaaa tctgggctgt gaatggagtg tccaaatata accctaaccc agaccaatca 1321 gtttctgtca ctgtgaccac caaccaagca gcaccatcat ccattgcttt ggtccaggct 1381 aaagaagtca caagatacag tgtggcactg gcttggctgg aaccagatcg gcccaatggg 1441 gtaatcctgg aatatgaagt caagtattat gagaaggatc agaatgagcg aagctatcgt 1501 atagttcgga cagctgccag gaacacagat atcaaaggcc tgaaccctct cacttcctat 1561 gttttccacg tgcgagccag gacagcagct ggctatggag acttcagtga gcccttggag 1621 gttacaacca acacagtgcc ttcccggatc attggagatg gggctaactc cacagtcctt 1681 ctggtctctg tctcgggcag tgtggtgctg gtggtaattc tcattgcagc ttttgtcatc 1741 agccggagac ggagtaaata cagtaaagcc aaacaagaag cggatgaaga gaaacatttg 1801 aatcaaggtg taagaacata tgtggacccc tttacgtacg aagatcccaa ccaagcagtg 1861 cgagagtttg ccaaagaaat tgacgcatcc tgcattaaga ttgaaaaagt tataggagtt 1921 ggtgaatttg gtgaggtatg cagtgggcgt ctcaaagtgc ctggcaagag agagatctgt 1981 gtggctatca agactctgaa agctggttat acagacaaac agaggagaga cttcctgagt 2041 gaggccagca tcatgggaca gtttgaccat ccgaacatca ttcacttgga aggcgtggtc 2101 actaaatgta aaccagtaat gatcataaca gagtacatgg agaatggctc cttggatgca 2161 ttcctcagga aaaatgatgg cagatttaca gtcattcagc tggtgggcat gcttcgtggc 2221 attgggtctg ggatgaagta tttatctgat atgagctatg tgcatcgtga tctggccgca 2281 cggaacatcc tggtgaacag caacttggtc tgcaaagtgt ctgattttgg catgtcccga 2341 gtgcttgagg atgatccgga agcagcttac accaccaggg gtggcaagat tcctatccgg 2401 tggactgcgc cagaagcaat tgcctatcgt aaattcacat cagcaagtga tgtatggagc 2461 tatggaatcg ttatgtggga agtgatgtcg tacggggaga ggccctattg ggatatgtcc 2521 aatcaagatg tgattaaagc cattgaggaa ggctatcggt taccccctcc aatggactgc 2581 cccattgcgc tccaccagct gatgctagac tgctggcaga aggagaggag cgacaggcct 2641 aaatttgggc agattgtcaa catgttggac aaactcatcc gcaaccccaa cagcttgaag 2701 aggacaggga cggagagctc cagacctaac actgccttgt tggatccaag ctcccctgaa 2761 ttctctgctg tggtatcagt gggcgattgg ctccaggcca ttaaaatgga ccggtataag 2821 gataacttca cagctgctgg ttataccaca ctagaggctg tggtgcacgt gaaccaggag 2881 gacctggcaa gaattggtat cacagccatc acgcaccaga ataagatttt gagcagtgtc 2941 caggcaatgc gaacccaaat gcagcagatg cacggcagaa tggttcccgt ctgagccagt 3001 actgaataaa ctcaaaactc ttgaaattag tttacctcat ccatgcactt taattgaaga 3061 actgcacttt ttttacttcg tcttcgccct ctgaaattaa agaaatg // LOCUS HUMRPZH21 386 bp mRNA PRI 15-MAR-1989 DEFINITION Human ribosomal protein mRNA, complete cds. ACCESSION M15661 NID g337577 KEYWORDS ribosomal protein. SOURCE Human ZR-75-1 mammary tumor cell line, cDNA to mRNA, clone pZH-21. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 386) AUTHORS Davies,M.S., Henney,A., Ward,W.H.J. and Craig,R.K. TITLE Characterisation of an mRNA encoding a human ribosomal protein homologous to the yeast L44 ribosomal protein JOURNAL Gene 45, 183-191 (1986) MEDLINE 87106812 FEATURES Location/Qualifiers source 1..386 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..386 /note="ribosomal protein mRNA" CDS 31..351 /note="ribosomal protein" /codon_start=1 /db_xref="PID:g337578" /translation="MVNVPKTRRTFCKKCGKHQPHKVTQYKKGKDSLYAQGRRRYDRK QSGYGGQTKPIFRKKAKTTKKIVLRLECVEPNCRSKRMLAIKRCKHFELGGDKKRKGQ VIQF" BASE COUNT 133 a 71 c 104 g 78 t ORIGIN 1 tcatatagac aaaacagccc tgctgcaaag atggtcaacg tacctaaaac ccgaagaacc 61 ttctgtaaga agtgtggcaa gcatcagcct cacaaagtga cacagtataa gaagggcaag 121 gattctttgt atgcccaggg aaggaggcgc tatgatcgga agcagagtgg ctatggtggg 181 cagacaaagc caattttccg gaagaaggct aagaccacaa agaagattgt gctaaggctg 241 gaatgtgttg agcctaactg cagatccaag aggatgctgg ccattaagag atgcaagcat 301 tttgaactgg gaggagataa gaagagaaag ggccaagtga tccagttcta aactttggga 361 aaataaatac agtgatattc ttacgc // LOCUS HUMRSC1083 5160 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0010 gene, complete cds. ACCESSION D13635 NID g285982 KEYWORDS KIAA0010. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5160) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5160) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..5160 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..303 gene 304..3555 /gene="KIAA0010" CDS 304..3555 /gene="KIAA0010" /codon_start=1 /db_xref="PID:d1003304" /db_xref="PID:g285983" /translation="MFSFEGDFKTRPKVSLGGASRKEEKASLLHRTQEERRKREEERR RLKNAIIIQSFIRGYRDRKQQYSIQRSAFDRCATLSQSGGAFPIANGPNLTLLVRQLL FFYKQNEDSKRLIWLYQNLIKHSSLFVKQLDGSERLTCLFQIKRLMSLCCRLLQNCND DSLNVALPMRMLEVFSSENTYLPVLQDASYVVSVIEQILHYMIHNGYYRSLYLLINSK LPSSIEYSDLSRVPIAKILLENVLKPLHFTYNSCPEGARQQVFTAFTEEFLAAPFTDQ IFHFIIPALADAQTVFPYEPFLNALLLIESRCSRKSGGAPWLFYFVLTVGENYLGALS EEGLLVYLRVLQTFLSQLPVSPASASCHDSASDSEEESEEADKPSSPEDGRLSVSYIT EECLKKLDTKQQTNTLLNLVWRDSASEEVFTTMASVCHTLMVQHRMMVPKVRLLYSLA FNARFLRHLWFLISSMSTRMITGSMVPLLQVISRGSPMSFEDSSRIIPLFYLFSSLFS HSLISIHDNEFFGDPIEVVGQRQSSMMPFTLEELIMLSRCLRDACLGIIKLAYPETKP EVREEYITAFQSIGVTTSSEMQQCIQMEQKRWIQLFKVITNLVKMLKSRDTRRNFCPP NHWLSEQEDIKADKVTQLYVPASRHVWRFRRMGRIGPLQSTLDVGLESPPLSVSEERQ LAVLTELPFVVPFEERVKIFQRLIYADKQEVQGDGPFLDGINVTIRRNYIYEDAYDKL SPENEPDLKKRIRVHLLNAHGLDEAGIDGGGIFREFLNELLKSGFNPNQGFFKTTNEG LLYPNPAAQMLVGDSFARHYYFLARMLGKALYENMLVELPFAGFFLSKLLGTSADVDI HHLASLDPEVYKNLLFLKSYEDDVEELGLNFTVVNNDLGEAQVVELKFGGKDIPVTSA NRIAYIHLVADYRLNRQIRQHCLAFRQGLANVVSLEWLRMFDQQEIQVLISGAQVPIS LEDLKSFTNYSGGYSADHPVIKVFWRVVEGFTDEEKRKLLKFVTSCSRPPLLGFKELY PAFCIHNGGSDLERLPTASTCMNLLKLPEFYDETLLRSKLLYAIECAAGFELS" 3'UTR 3556..>5160 BASE COUNT 1267 a 1165 c 1269 g 1459 t ORIGIN 1 gttccaggtg caagcgccgg gtttgctgcc cgctgggcgc ccctgcagcg gcccgagctg 61 tggccggcgt ggatgagggg caggcgaggc agggccgccc ctccagtatt gccgcccctc 121 ccgccccagg gcagggctgg gagggtacag cccgggggcg ggctcgggtc gcctcccggc 181 cgccgcgtcc tcgctgcccc gggccgggcg ggcgggcgcc gagagcctcc cagcccgccc 241 cgtgccccgc ccgcccggct gcttccgcgg cggcgctgcc cgcacatggg ctaggctgcc 301 aggatgttca gcttcgaagg cgacttcaag acgcggccca aggtgtccct tggcggcgcg 361 agcaggaagg aggaaaaggc ttctctttta catcgtactc aggaagaaag aagaaagaga 421 gaggaagaaa ggcgaaggtt gaaaaatgca ataattatcc agtcatttat tcgaggctat 481 agagacagaa aacagcaata ttccatccaa agaagtgcat ttgatcgctg tgctaccttg 541 tcacagtccg ggggcgcttt tcccattgct aatggcccca accttaccct tttggtaagg 601 cagcttctgt ttttttacaa acaaaatgaa gactcaaaac gtttgatatg gctgtatcag 661 aacttaatta aacacagctc tctgtttgtc aagcagttgg atggatctga gagacttaca 721 tgcttatttc agataaaaag attgatgagc ctctgttgca ggttgctgca aaactgtaat 781 gatgacagtt tgaatgttgc acttccaatg agaatgcttg aagtattttc gtctgagaat 841 acttacttgc ctgttttaca agatgctagc tatgtggtgt cagtgattga acaaattttg 901 cactacatga ttcacaatgg gtattatagg tctctatatt tgttgattaa cagcaagctt 961 ccatcaagta ttgaatattc tgatttatct cgagttccta tagcaaaaat tttgctagag 1021 aatgttctaa aaccattgca ctttacttac aactcctgtc cggaaggtgc gaggcaacaa 1081 gtttttacag ccttcacaga ggagtttctg gcagcacctt ttacagatca gatttttcat 1141 ttcatcattc cggcgcttgc agatgcgcag accgttttcc cttacgagcc ctttctgaat 1201 gcactgttgt taatagagag tagatgttca agaaagagtg gtggagcacc ctggcttttc 1261 tatttcgttt taactgttgg cgaaaattat ttgggggccc tctctgagga agggctgctg 1321 gtgtatttgc gggtgctgca gaccttcctc tctcagttac cagtctctcc tgccagcgcg 1381 agctgtcacg actcagccag tgactctgag gaggagagtg aagaagccga caagccctca 1441 agcccggagg atggcagact gtcagtatca tacataacag aggaatgcct gaagaagctg 1501 gacacaaagc agcagaccaa caccctgctc aacctggtgt ggagggactc tgcgagcgag 1561 gaggtcttca ccaccatggc ctccgtctgc cacacgctga tggtgcagca ccgcatgatg 1621 gtacccaaag tcaggcttct ctacagttta gcctttaatg ccaggtttct gagacatctt 1681 tggtttctaa tatcttccat gtcaacacgg atgatcacag ggtctatggt accgttgctt 1741 caggtgatat ccaggggttc tcctatgtct tttgaagatt ctagtcgaat catcccactc 1801 ttttaccttt ttagctcctt gtttagtcat tcactaattt ccatacatga taacgaattc 1861 ttcggtgatc ccatagaagt tgtaggtcaa agacaatcat caatgatgcc ttttacttta 1921 gaagagctga taatgttgtc tcgatgcctt cgagatgcat gcctggggat catcaagttg 1981 gcttatccag aaaccaagcc agaagttcga gaagaatata ttacagcatt tcagagtatt 2041 ggagttacta ctagctctga aatgcaacaa tgcatacaga tggaacagaa aagatggatt 2101 cagttattta aggttatcac caatctagtg aaaatgttga agtccagaga cacgaggaga 2161 aatttttgtc ctccaaacca ctggctgtca gaacaagaag atattaaagc agataaggtc 2221 actcagctct atgtgccagc atccagacat gtgtggaggt tccggcggat ggggaggata 2281 ggcccgctgc agtccaccct ggacgtgggt ttggagtccc cgccgctgtc tgtgtctgag 2341 gaaagacagc ttgctgtcct gacagagttg ccttttgtgg ttccatttga ggaacgagta 2401 aagatctttc agaggttgat ttatgcagat aagcaagaag ttcaaggaga tggtccattt 2461 ctggatggaa ttaatgtcac aataagaaga aattacattt atgaagatgc ttatgacaaa 2521 ctttctccag aaaatgagcc tgatttgaaa aagcggatcc gtgtgcactt gctcaatgcc 2581 catggcctgg atgaagctgg cattgatggt ggtggtattt tcagagagtt tttaaatgaa 2641 ctactgaagt caggatttaa ccccaaccag gggttcttta agactactaa tgaagggctt 2701 ctgtacccca acccggctgc tcagatgctt gtgggagatt cttttgccag acattactac 2761 ttcctagcca gaatgcttgg aaaggctctc tatgagaaca tgctggtgga gctgcccttt 2821 gcaggcttct ttctttccaa gttgcttgga accagtgccg acgtggacat tcaccacctc 2881 gcctccctag accctgaggt gtataagaat ttgctctttc tgaagagcta cgaagacgat 2941 gtggaggagc ttgggctgaa cttcactgtg gtgaacaatg acctgggaga ggcgcaggta 3001 gttgaactaa aattcggtgg gaaagacatc cctgtcacca gcgccaaccg gattgcgtac 3061 atccacttgg tggcagacta caggctgaac aggcagatcc gccagcactg cctggctttc 3121 cgccagggcc ttgccaatgt cgtcagcctc gagtggctcc gaatgtttga tcagcaagaa 3181 attcaggtat taatttctgg tgcacaagtt cccataagcc tagaggacct aaaatccttt 3241 acaaactatt caggaggcta ttctgcagac catcctgtta ttaaggtctt ctggagagtt 3301 gtggaagggt tcactgatga agaaaagcgc aaactgctga agtttgtaac aagctgctct 3361 cgaccccctc tcttggggtt taaggagttg tatcccgcat tttgtattca caacggaggc 3421 tccgaccttg agcggctccc cacagccagc acctgcatga acctgctgaa gctccccgag 3481 ttctatgacg agacactttt gcgaagtaaa cttctctatg cgattgaatg tgccgctggc 3541 tttgagctga gctgaagctg atgctggggt cagaccccta cagagaacca gtgcttcctt 3601 cgtcagcagc gcctccccag acccacgagg atactcacac tgcacgcctg aggctctcct 3661 aagctccttc tttcattctg ccattcctcc ctcccttcct tttttaaatg atttttatta 3721 cggtgtggtc acttatttag atggacattg cttttcaaat aacttaaaat aacacgttat 3781 gtgccatgtg gctactttag taatattgcc aagaagagca cagtttttac actagtggca 3841 tctcagtgaa attaaccaaa gatgaagctt tggctttgct ggtgagatca gagccctcct 3901 gagcaggcag cgccactcca gggttcagac agggctgcac aggcggcaga gatacagggt 3961 ctgagggctg agacgccatg gggccgctgc tgcttatgtg gttggattgt ttacaagcct 4021 cattattaaa actgaaggca tttttttttt ctgctgcctt tcccaaagtg gttaggtttg 4081 gaaaagagat gatgatggta atattttatt tgtgcttttt aagccatttc cccaaatggg 4141 actagcatgc ttgttttcag tataccgtgg cctgcctcat gatggtttgg agatactgtc 4201 tgtggatgtg aggtggggac ttcattcatt gtcctatttc tatctccact ttgtgcctgg 4261 agagctttca ggggaggtgg aggaggaggg tctgccaagc tactgcaaca tctgtcaccc 4321 actataccca gttacttggg ggaggacaga cactgtggtg tcattaaagt tgtttgaacc 4381 aaagtggcgg ctgcatcttt gtcccgatgc tagccgtgcc ggtctcccat catccgctcg 4441 ccctcctttc ccctgggctg cgcccacttg tcttcctgga tatttggggg tgactcgcca 4501 tgcttggcac cctctgcttc ctggtgctgc tctgactcga agacgggaca gtccctggtg 4561 cacatccagg gaagaggagt gtcggtagtt cttgcagtag gcactttatc aggacctgac 4621 ctgttgctgg gtgattttag tctctacaaa cagaaagcgt ttcaaagcgt cagctgtggg 4681 agcagagtga ccctttgctg atgctggggg gaggggatct aaatcctcat ttatctcttc 4741 tatgtctagt attttactgt cactggaggc tctgtgggct gtcatagtta attgaccata 4801 attagcaata tacttttaaa gtgggaaagc tgaatgacac ttttaagaca atgaacatta 4861 tcaaaacaaa atgtataatt tcttaatttg aataataagc gtttaaatgc tatttgtagt 4921 cttgatatac agaaataaaa taattagggt tggtcttttt tattttaggt tttatgttga 4981 atgttctata tcttattagt taatttgtat attttattag tattttggaa atagcatatc 5041 tgagactgag gagaaattga caattcactt atttgtggtt tttttctcag ctattctgag 5101 cttatttatt tatttgtatg ttctaatggc taaacattta cattaaatat tttttttccc // LOCUS HUMRSC314 2480 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0009 gene, complete cds. ACCESSION D13634 NID g285992 KEYWORDS KIAA0009. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2480) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2480) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2480 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..17 gene 18..962 /gene="KIAA0009" CDS 18..962 /gene="KIAA0009" /codon_start=1 /db_xref="PID:d1003303" /db_xref="PID:g285993" /translation="MQSVLWSRKPYGSSRSIVRKIGTNLSLIQCPRVQFQINSHATEW SPSHPGEDAVASFADVGWVAKEEGECSARLRTEVRSRPPLQDDLLFFEKAPSRQISLP DLSQEEPQLKTPALANEEALQKICALENELAALRAQIAKIVTQQEQQNLTAGDLDSTT FGTIPPHPPPPPPPLPPPALGLHQSTSAVDLIKERREKRANAGKTLVKNNPKKPEMPN MLEILKEMNSVKLRSVKRSEQDVKPKPVDATDPAALIAEALKKKFAYRYRSDSQDEVE KGIPKSESEATSERVLFGPHMLKPTGKMKALIENVSDS" 3'UTR 963..>2480 BASE COUNT 774 a 441 c 505 g 760 t ORIGIN 1 aacaagttgg agtaagcatg caatcggtac tttggtctag gaagccatat ggttcgtctc 61 gaagtatcgt aaggaaaatt ggtactaatt tgtctctgat tcagtgtcca agagttcagt 121 ttcagattaa cagccatgca acagaatgga gtcccagcca cccaggagag gatgcagtgg 181 cgtcttttgc tgatgttgga tgggtagcca aagaagaagg agagtgttca gcaagactaa 241 ggacagaggt cagatcaagg ccaccccttc aggatgacct tcttttcttt gagaaggccc 301 caagcagaca gatttcctta ccagacttgt ctcaagaaga gcctcagctg aagaccccag 361 cgctggcaaa tgaggaagca ctgcagaaga tttgcgctct cgaaaatgaa cttgctgctc 421 tcagagctca gattgccaaa attgtgaccc agcaggagca gcaaaatctc actgcaggtg 481 acttagattc taccacattt ggtaccatac caccacaccc tccacctccc ccaccgcccc 541 tgcctccccc tgcactgggg ctccaccaaa gtacatctgc tgttgatctg attaaagaac 601 gaagagagaa aagagccaat gctggaaaga ctttggttaa gaacaatcca aagaaacctg 661 aaatgccaaa tatgctagag atccttaaag agatgaacag tgtaaaactt cggtcagtga 721 agaggtcaga gcaagatgtg aagcccaagc cagtggatgc tactgaccct gctgccctca 781 tagcagaggc tctgaaaaag aaatttgctt atcggtatcg aagtgatagc caagatgaag 841 ttgaaaaagg aattccaaag tctgaatcag aggccacctc agagagagtg ttgtttgggc 901 cacacatgtt gaagccaaca ggaaaaatga aggctttaat tgaaaatgta tcagactcct 961 aatagacaat gagctgcgaa aagactcctg gttcccctgt tgatttgtga gggccaagtt 1021 tgctagtaga aatcgacact gtttagtaaa tacctcttta gtattcagtg gtcttctttt 1081 caggctaatt agtggattaa gcaataatga aagcactaag tttggttttg cttttgtgag 1141 atggtcagct ttggtgctct ccacaacatg tgtgttctga catgtttcta atatgtggcc 1201 agggcgttca gatttccagt tttgaaaaca attgtataga tttcacaaca caaaaaggac 1261 atttgtggat gttactgcac attttaaatt cttaacacta atttatctgt ataagtgttt 1321 tatatgcata tttttggaca taaacagttt atgtaaaatt agtaatgaat gatggcaacg 1381 agggcactgt tatcttcgtt tgttttcaat gatcatttag cattcaatga tggaacagct 1441 ggtataacat aagttgttgg catgaaatat ttgagattgg aaacttcttg ccttgaacag 1501 aacttatatc ttagattctc tctcacattt tcttggagct ggggtttgaa taggaaccag 1561 atgatgttca ctgctgaaat tccataatgc ttcccattga agggaagttg agaaccagga 1621 aagctgcttt cacgtcattg ccatccagta ctgacaggga agaaagatgt agttttccag 1681 tagtgatgaa tcaaattatt gaattaaatt tcttcttaag aagtaaaaac tcagaatgta 1741 ccatcttgtt tcctttcagt ttattaaatg gcatcataaa gatgactttg ctaagttaat 1801 agagttaaaa atttttttta atataagcca aaatattaac tttaatgaaa cattgacttg 1861 gcaaatgaat ttcctataaa ttatcattgg tcagaatgct gttttgttta ataatattac 1921 gcaacataaa ccataggtgt tattagaagt ggagaactgc ctttttcatc tggggctttt 1981 aaggacttgc tataaatgat tattttttaa atgctttata aatcttcatg gtttttctat 2041 ttctgataca ctcagctata gttaatacca gagtatccta ccaggagtaa tatttggaat 2101 atttaaatct agtaaaagaa gaaagttgta cttcctggct gggagtatta ggagatggga 2161 gtagagattc acttttaagt tcttgaaaat atatgcattc tcctaaatat taacaaaaat 2221 gatttgggga aatgacatgg cttgattgtt ctgtttaaat ttgtactgtg gcttatgtta 2281 cacatgttca tgttcacctc tcattcacct gttttatatg gtttaaaatt ctctttaaca 2341 aaattcagaa aattcacctg aaacgtattt tgacctaaaa gaaacatatt tttgtatcag 2401 tattgaattt tggacagtgc ccccatataa ggaagttact gttttaaaat aaagcaaact 2461 aactgtttta ttttccttgg // LOCUS HUMRSC338 2416 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0001 gene, complete cds. ACCESSION D13626 NID g285994 KEYWORDS KIAA0001. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2416) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Gerard,C. and Gerard,N.P. TITLE C5A anaphylatoxin and its seven transmembrane-segment receptor JOURNAL Annu. Rev. Immunol. 12, 775-808 (1994) MEDLINE 94280786 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 5 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2416 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myeloblast" /sex="male" 5'UTR <1..216 gene 217..1233 /gene="KIAA0001" CDS 217..1233 /gene="KIAA0001" /codon_start=1 /db_xref="PID:d1003296" /db_xref="PID:g285995" /translation="MINSTSTQPPDESCSQNLLITQQIIPVLYCMVFIAGILLNGVSG WIFFYVPSSKSFIIYLKNIVIADFVMSLTFPFKILGDSGLGPWQLNVFVCRVSAVLFY VNMYVSIVFFGLISFDRYYKIVKPLWTSFIQSVSYSKLLSVIVWMLMLLLAVPNIILT NQSVREVTQIKCIELKSELGRKWHKASNYIFVAIFWIVFLLLIVFYTAITKKIFKSHL KSSRNSTSVKKKSSRNIFSIVFVFFVCFVPYHIARIPYTKSQTEAHYSCQSKEILRYM KEFTLLLSAANVCLDPIIYFFLCQPFREILCKKLHIPLKAQNDLDISRIKRGNTTLES TDTL" 3'UTR 1234..>2416 BASE COUNT 782 a 476 c 405 g 753 t ORIGIN 1 gaacagtgtt accttggagc ctacaatgag aggtatttca aaatgagtga agcatgactc 61 tcacagatga aggcctagac gcaggatctt taatggaaaa acacttgggc cacttcaaga 121 cgacaaacgc tcactgggca aaacaccttc actgaaaaga gacctcatat tatgcaaaaa 181 aaatcttaag aggcctctgc cttcagaagt tacaagatga tcaattcaac ctccacacag 241 cctccagatg aatcctgctc tcagaacctc ctgatcactc agcagatcat tcctgtgctg 301 tactgtatgg tcttcattgc gggaatccta ctcaatggag tgtcaggatg gatattcttt 361 tacgtgccca gctctaagag tttcatcatc tatctcaaga acattgttat tgctgacttt 421 gtgatgagcc tgacttttcc tttcaagatc cttggtgact caggccttgg tccctggcag 481 ctgaacgtgt ttgtgtgcag ggtctctgcc gtgctcttct acgtcaacat gtacgtcagc 541 attgtgttct ttgggctcat cagctttgac aggtattata aaattgtaaa gcctctttgg 601 acttctttca tccagtcagt gagttacagc aaacttctgt cagtgatagt atggatgctc 661 atgctcctcc ttgctgttcc aaatattatt ctcaccaacc agagtgttag ggaggttaca 721 caaataaaat gtatagaact gaaaagtgaa ctgggacgga agtggcacaa agcatcaaac 781 tacatcttcg tggccatctt ctggattgtg tttcttttgt taatcgtttt ctatactgct 841 atcacaaaga aaatctttaa gtcccacctt aagtcaagtc ggaattccac ttcggtcaaa 901 aagaaatcta gccgcaacat attcagcatc gtgtttgtgt tttttgtctg ttttgtacct 961 taccatattg ccagaatccc ctacacaaag agtcagaccg aagctcatta cagctgccag 1021 tcaaaagaaa tcttgcggta tatgaaagaa ttcactctgc tactatctgc tgcaaatgta 1081 tgcttggacc ctattattta tttctttcta tgccagccgt ttagggaaat cttatgtaag 1141 aaattgcaca ttccattaaa agctcagaat gacctagaca tttccagaat caaaagagga 1201 aatacaacac ttgaaagcac agatactttg tgagttccta ccctcttcca aagaaagacc 1261 acgtgtgcat gttgtcatct tcaattacat aacagaaatc aataagatat gtgccctcat 1321 cataaatatc atctctagca ctgccatcca atttagttca ataaaattca aatataagtt 1381 tccatgcttt tttgtaacat caaagaaaac atacccatca gtaatttctc taatactgac 1441 ctttctattc tctattaata aaaaattaat acatacaatt attcaattct attatattaa 1501 aataagttaa agtttataac cactagtctg gtcagttaat gtagaaattt aaatagtaaa 1561 taaaacacaa cataatcaaa gacaactcac tcaggcatct tctttctcta aataccagaa 1621 tctagtatgt aattgttttc aacactgtcc ttaaagacta acttgaaagc aggcacagtt 1681 tgatgaaggg ctagagagct gtttgcaata aaaagtcagg tttttttcct gatttgaaga 1741 agcaggaaaa gctgacaccc agacaatcac ttaagaaacc ccttattgat gtatttcatg 1801 gcactgcaaa ggaagaggaa tattaattgt atacttagca agaaaatttt ttttttctga 1861 tagcactttg aggatattag atacatgcta aatatgtttt ctacaaagac ttacgtcatt 1921 taatgagcct ggggttctgg tgttagaata tttttaagta ggctttactg agagaaacta 1981 aatattggca tacgttatca gcaacttccc ctgttcaata gtatgggaaa aataagatga 2041 ctgggaaaaa gacacaccca caccgtagaa catatattaa tctactggcg aatgggaaag 2101 gagaccattt tcttagaaag caaataaact tgattttttt aaatctaaaa tttacattaa 2161 tgagtgcaaa ataacacata aaatgaaaat tcacacatca catttttctg gaaaacagac 2221 ggattttact tctggagaca tggcatacgg ttactgactt atgagctacc aaaactaaat 2281 tctttctctg ctattaactg gctagaagac attcatctat ttttcaaatg ttctttcaaa 2341 acatttttat aagtaatgtt tgtatctatt tcatgcttta ctgtctatat actaataaag 2401 aaatgtttta atactg // LOCUS HUMRSC390 4186 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0018 gene, complete cds. ACCESSION D13643 NID g285996 KEYWORDS KIAA0018. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4186) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4186) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..4186 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..38 gene 39..1211 /gene="KIAA0018" CDS 39..1211 /gene="KIAA0018" /codon_start=1 /db_xref="PID:d1003311" /db_xref="PID:g285997" /translation="MEPAVSLAVCALLFLLWVRLKGLEFVLIHQRWVFVCLFLLPLSL IFDIYYYVRAWVVFKLSSAPRLHEQRVRDIQKQVREWKEQGSKTFMCTGRPGWLTVSL RVGKYKKTHKNIMINLMDILEVDTKKQIVRVEPLVTMGQVTALLTSIGWTLPVLPELD DLTVGGLIMGTGIESSSHKYGLFQHICTAYELVLADGSFVRCTPSENSDLFYAVPWSC GTLGFLVAAEIRIIPAKKYVKLRFEPVRGLEAICAKFTHESQRQENHFVEGLLYSLDE AVIMTGVMTDEAEPSKLNSIGNYYKPWFFKHVENYLKTNREGLEYIPLRHYYHRHTRS IFWELQDIIPFGNNPIFRYLFGWMVPPKISLLKLTQGETLRKCTSSTTWCRTCWCP" 3'UTR 1212..>4186 BASE COUNT 910 a 1153 c 1134 g 989 t ORIGIN 1 ggcgcgaacc cgcagcgctt accgcgcggc gccgcaccat ggagcccgcc gtgtcgctgg 61 ccgtgtgcgc gctgctcttc ctgctgtggg tgcgcctgaa ggggctggag ttcgtgctca 121 tccaccagcg ctgggtgttc gtgtgcctct tcctcctgcc gctctcgctt atcttcgata 181 tctactacta cgtgcgcgcc tgggtggtgt tcaagctcag cagcgctccg cgcctgcacg 241 agcagcgcgt gcgggacatc cagaagcagg tgcgggaatg gaaggagcag ggtagcaaga 301 ccttcatgtg cacggggcgc cctggctggc tcactgtctc actacgtgtc gggaagtaca 361 agaagacaca caaaaacatc atgatcaacc tgatggacat tctggaagtg gacaccaaga 421 aacagattgt ccgtgtggag cccttggtga ccatgggcca ggtgactgcc ctgctgacct 481 ccattggctg gactctcccc gtgttgcctg agcttgatga cctcacagtg gggggcttga 541 tcatgggcac aggcatcgag tcatcatccc acaagtacgg cctgttccaa cacatctgca 601 ctgcttacga gctggtcctg gctgatggca gctttgtgcg atgcactccg tccgaaaact 661 cagacctgtt ctatgccgta ccctggtcct gtgggacgct gggtttcctg gtggccgctg 721 agatccgcat catccctgcc aagaagtacg tcaagctgcg tttcgagcca gtgcggggcc 781 tggaggctat ctgtgccaag ttcacccacg agtcccagcg gcaggagaac cacttcgtgg 841 aagggctgct ctactccctg gatgaggctg tcattatgac aggggtcatg acagatgagg 901 cagagcccag caagctgaat agcattggca attactacaa gccgtggttc tttaagcatg 961 tggagaacta tctgaagaca aaccgagagg gcctggagta cattcccttg agacactact 1021 accaccgcca cacgcgcagc atcttctggg agctccagga catcatcccc tttggcaaca 1081 accccatctt ccgctacctc tttggctgga tggtgcctcc caagatctcc ctcctgaagc 1141 tgacccaggg tgagaccctg cgcaagtgta cgagcagcac cacgtggtgc aggacatgct 1201 ggtgcccatg aagtgcctgc agcaggccct gcacaccttc caaaacgaca tccacgtcta 1261 ccccatctgg ctgtgtccgt tcatcctgcc cagccagcca ggcctagtgc accccaaagg 1321 aaatgaggca gagctctaca tcgacattgg agcatatggg gagccgcgtg tgaaacactt 1381 tgaagccagg tcctgcatga ggcagctgga gaagtttgtc cgcagcgtgc atggcttcca 1441 gatgctgtat gccgactgct acatgaaccg ggaggagttc tgggagatgt ttgatggctc 1501 cttgtaccac aagctgcgag agaagctggg ttgccaggac gccttccccg aggtgtacga 1561 caagatctgc aaggccgcca ggcactgagc tggagcccgc ctggagagac agacacgtgt 1621 gagtggtcag gcatcttccc ttcactcaag cttggctgct ttcctagatc cacactttca 1681 aagagaaacc cctccagaac tcccaccctg acagcccaac accaccttcc tcctggcttc 1741 cagggggcag cccagtggaa tggaaagaat gtgggatttg gagtcagaca agcctgagtc 1801 cagttccccg tttagaactc attagctgtg tgactctggg tgagtccctt aacccctctg 1861 agcccgggtc tcttcattag ttgaaaggga tagtaatacc tacttgcagg ttgttgtcat 1921 ctgagttgag cactggtcac attgaaggtg ctgggtaagt ggtagctctt gttgcttccc 1981 gttcagcgtc acatctgcag tggagcctga aaaggctcca cattaggtca cctgtgcaca 2041 gccatggctg gaatgatgaa ggggatacgc tggagttgcc ctgccatcgc ctccatcagc 2101 cagacgaggt cctcacagga gaaggacagc tcttccccac cctgggatct caggagggca 2161 gccacggagt ggggaggccc cagatgcgct gtgccaaagc caggtccgag gccaaagttc 2221 tccctgccat ccttggtgcc gtcctgcccc ttcctccttc atgcctgggc ctgcaggccc 2281 accccagcca ccactgagtc cactcggagt gccctgtgtt cctggagaag gcattccagg 2341 gttgaatctt gtcccagcct cagcctggga cacctaggtg gagagagtgg tctccgctct 2401 gaattggatc caggggacct gggctcattc ttcttggctc accaaccctg caggcctcat 2461 ctttcccaaa acccactttg tcttggtggg agtgggtccg cgctgctctg cagcaggggc 2521 tggggagtgg acagcatcag gtgggaaagt ggagtccacc ctcatgtttc tgtaggattc 2581 tcaccgtggg gctggaagaa aagagcatcg acttgatttc tccaaccact catccctctt 2641 tttctttctt ccaccactcc ccaccccagc tgtagttaat ttcagtgcct tacaaatcct 2701 aagctcagag aaagttccat ttccgttcca gagggaaggg aacctcccta ggtccttccc 2761 tggcttgtta taacgcaaag cttggttgtt tatgcaactc tatcttaaga actgcccagc 2821 ctcagctgaa aacccgaatc tgagaaggaa ttgcgtcatg taagggaagc tggaattaag 2881 ggagctgagc cagtcatggt tgtggcgtgt gagtcaggag acctaggttt cagcccctct 2941 ctactgtcag cgagctgtgc aacgtgggca agtcattgtc ctctgagctg cagtttcctc 3001 atctgtcaca tcgctacaga caagacctcc ctggaaccct tctgattgtc ttagacactg 3061 tggttgcaaa acccacggaa agcctcattt gtgtggaaag tcagaggaaa aatgatccag 3121 tggacacttg gggattatct gtcattcaag atccttcctt caaccccaag gccagctccc 3181 atctcatttc cagaaaggct catacctggc ttgcagggaa gcatctgtct tgtcattcca 3241 ggtgccagaa tcctctcaga gtcattgaag ggtgttcacc catcccaccc aaggcttggc 3301 acactgccag tgtcttagca gggtcttgtg agggctgggg gcatccaggc actcagaagg 3361 caaaggaacc accctaccca tttggcctct ggagggggca gaagaaagaa agaaacctca 3421 tcctatattt tacaaagcat gtgaattctg gcattagctc tcataggaga cccatgtgct 3481 tccttgctca gtgcaaaact gatgattcta cttgctgtag atgaatggtt aacacgagct 3541 agttaaacag tgccattgtt ttgccagtga agcctccaac cctaagccac tgggacggtg 3601 gccagagatg ccagcagcct ctgtcgccct tagtcatata accaaaatcc agaccttatc 3661 cacaacccgg ggcttggaaa ggaaggtatt ttggaatcac accctccggt tatgttgctc 3721 cagtaaaatc ttgcctggaa agaggcagtc ttcttagcat ggtgagctga gttcatggct 3781 tttttttgta gccagtcctg tccctggcca tccatgtgat ggttttggat ggagttaaac 3841 ttgatgccag tgggcagtgc atgtggaaag tatcagagta agcctctccc ctccagagcc 3901 ctgagtttct tggctgcatg aaggttttct ttagaatcag aattgtagcc agtttctttg 3961 gccagaagga tgaatacttg gatattactg aaagggaggg gtggagatgg gtgtggcagt 4021 gtatggtgtg tgatttttat tttcttcttt ggtcatgggg gccaaggaga aaggcatgaa 4081 tcttccctgt caggctctta cagccacagg cactgtgtct actgtctgga agacatgtcc 4141 ccgtggctgt ggggccgctg cttctgttta aataaaagtg gcctgg // LOCUS HUMRSC399 4753 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0017 gene, complete cds. ACCESSION D13642 NID g285998 KEYWORDS KIAA0017. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4753) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 4753) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..4753 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..136 gene 137..1336 /gene="KIAA0017" CDS 137..1336 /gene="KIAA0017" /codon_start=1 /db_xref="PID:d1003310" /db_xref="PID:g285999" /translation="MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQW ASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSV AGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLL RKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTA SLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMN YHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEH PPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLED IRTRYAF" 3'UTR 1337..>4753 BASE COUNT 1162 a 1137 c 1194 g 1260 t ORIGIN 1 cttcaatcaa gtagccttcc cactgcagta cacacccagg aaatttgtca tccaccctga 61 gagtaacaac cttattatca ttgaaacgga ccacaatgcc tacactgagg ccacgaaagc 121 tcagagaaag cagcagatgg cagaggaaat ggtggaagca gcaggggagg atgagcggga 181 gctggccgca gagatggcag cagcattcct caatgaaaac ctccctgaat ccatctttgg 241 agctcccaag gctggcaatg ggcagtgggc ctctgtgatc cgagtgatga atcccattca 301 agggaacaca ctggaccttg tccagctgga acagaatgag gcagctttta gtgtggctgt 361 gtgcaggttt tccaacactg gtgaagactg gtatgtgctg gtgggtgtgg ccaaggacct 421 gatactaaac ccccgatctg tggcaggggg cttcgtctat acttacaagc ttgtgaacaa 481 tggggaaaaa ctggagtttt tgcacaagac tcctgtggaa gaggtccctg ctgctattgc 541 cccattccag gggagggtgt tgattggtgt ggggaagctg ttgcgtgtct atgacctggg 601 aaagaagaag ttactccgaa aatgtgagaa taagcatatt gccaattata tctctgggat 661 ccagactatt ggacataggg taattgtatc tgatgtccaa gaaagtttca tctgggttcg 721 ctacaagcgt aatgaaaacc agcttatcat ctttgctgat gatacctacc cccgatgggt 781 cactacagcc agcctcctgg actatgacac tgtggctggg gcagacaagt ttggcaacat 841 atgtgtggtg aggctcccac ctaacaccaa tgatgaagta gatgaggatc ctacaggaaa 901 caaagccctg tgggaccgtg gcttgctcaa tggggcctcc cagaaggcag aggtgatcat 961 gaactaccat gtcggggaga cggtgctgtc cttgcagaag accacgctga tccctggagg 1021 ctcagaatca cttgtctata ccaccttgtc tggaggaatt ggcatccttg tgccattcac 1081 gtcccatgag gaccatgact tcttccagca tgtggaaatg cacctgcggt ctgaacatcc 1141 ccctctctgt gggcgggacc acctcagctt tcgctcctac tacttccctg tgaagaatgt 1201 gattgatgga gacctctgtg agcagttcaa ttccatggaa cccaacaaac aaaagaacgt 1261 ctctgaagaa ctggaccgaa ccccacccga agtgtccaag aaactcgagg atatccggac 1321 ccgctacgcc ttctgagccc tcctttcccg gtggggcttg ccagagactg tgtgttttgt 1381 ttcccccacc accatcactg ccacctggct tctgccatgt ggcaggaggg tgactggata 1441 attaagactg cattatgaaa gtcaacagct ctttcccctc agctcttctc ctggaatgac 1501 tggcttcccc tcaaattggc actgagattt gctacacttc tccccacctg gtacatgata 1561 catgacccca ggttccagtg tagaacctga gtcccccatt ccccaaagcc atccctgcat 1621 tgatatgtct tgactctcct gtctactttt gcacacaccc ttaattttta attggttttc 1681 ttgtaaatac agttttgtac aatgttatct ctgtgggagg aaggaggcag gctgtggtgg 1741 gactgggtag ggtatagtat cactcctgag ttccactgct ctagaatcta accagaaata 1801 gaaacctagt ttttaaggtg actggcatcc atgtgtcttg ttctggagat gaggatgtag 1861 gtgggaggtt tgaacccaag ttagagcagg aagaactgag tagactcctt ccttccagat 1921 accgacttgg acttgcggca ctctgtggct ccccaccccc aggtctgtgg tggtttcttt 1981 gttttttcct ggttcttttt gctgtgctga tgaaacatga cctcaataac catgtgtata 2041 cccacccctc ttcccactgg gtattgagga agggtggctg attcttcctc ctcttctact 2101 ctgaggatgt tagtatgggg attttagcat gaattccagc tggggagtct taacagatgc 2161 cccttttact gatagagcac ctaaagcgat ctttggctcc ataggaccat aggaagggtc 2221 agtacagaag aacctagata ctgccctgcc cctgagaact gtgtatatgt ggggcctgtc 2281 tgcagcaccc atctcaggtg ggttccagag ggcctttagg gtataatgag agcctgttag 2341 gtggaagagg cccagttcca gaaatgttcc agcccacccc tgagaattcc tcctgtttag 2401 ttgtgtggga agccctcgtc ttccaggctg tccttgcgcc ttgaacctgg agaagtgagc 2461 tcactgttct caatacttca caaatgtaaa actttctttc gtctgcatgt gctcagccat 2521 ctaaattgag caaatgatct ggtgagcact gggttagaat caggaatggt ggaatacaat 2581 ctgaacctct cagagcccag aacagagggt tcctgacact gtgacactgt ctcctggaac 2641 taagtatctc ttgaatcatg acttggtttt agatcagtca agagagaccc aggttttgcc 2701 aggaatcgaa tccctaaata acatgttttt ttctcactta gctcatgaat ttgcatagta 2761 gacagtagtt ctgaattaga ttttgaaaac ctaatttcag ggctcatttt ttcctgtggc 2821 cctaaatcca ttctatcaaa ttgtgtgata ctgacatgca gtcatctgag gaactcagcg 2881 tagatacttg agcagctcct cgcctctttt ctaactcaag tttgactaaa atacatacac 2941 tccgtacaga aggtaggggg ttatgtaaga aaggaaaacc taatctatgg aatcaggagt 3001 tgtcaccacc gagcttcctc tggaagtctg cccatcagct tgcttgttct ctgttaagag 3061 gaagggctag gacaaggatt tgggcttgaa tatgtggaaa ggaattttca tagttgttgc 3121 tgcaggacct acaaaagttt aaaattagat tggatgtgac tcaatgacaa gtcccatctg 3181 tgtaattgtt aaggggacct gattgactcc tgtggtttga ttgagcaacc aggtaaatag 3241 agacctctct ccagctttgg caaaacccat cagaggctgc tgcagaactc agacagaggg 3301 atctgccctt gggtttgctt ccatcctgtt ccattgctaa gcccttgtga cttggatcct 3361 aggactgaaa agtttttagc tgcctcagct ttcccctgac cttactggca gaggttctgc 3421 agatgtttcc tttggaagat ctcttgccaa gaatagcatt cctttggagg aggggggttc 3481 tagttggaat gttgcttttc ttggttagtg taaatgtatt gctagtgaga cagctgccgg 3541 cgctggaaaa ggctcgtctc acagggagag tgctggtccc cagaatgtgt gctgttccca 3601 cgctgctgcc tttcttgagc ttgttagagg aaagccagaa aggcattcag atgggatcag 3661 tctggctttc aaattttttt taattcctaa gttctgtttt attttttaat tttttaaaaa 3721 aaattttatt agagacagtc tctctctctt gcctagctgg gagtgcagtg gagtgatcat 3781 agctcactga ggcttgaact cctgggctcg agcaatccac ctcagcctcc agagtagggg 3841 agactacaga tgtgtgccac catactcagc tagtttttaa actttcgtag agacagggtc 3901 tccctgtgtt gcccaggctg gcctcgaact cctgacctca aaaaatcttc ctgccttggc 3961 ctcccagcgc tttgagaggc tgaggcagga ggatcccttg agcccaggag tttgagacca 4021 gcctgggcaa catgacaaaa ccccatctct ccaaaaatac aaaaattggc caggcatggt 4081 ggtgcacact tgtagtccca gtaattaggg ggctgagaca ggaggatcac ttcagcctat 4141 gagtttgagg ctgcagtgag ctgtgattgc gccactacac tccagcctgg atgacaggac 4201 gaaacctgtc tcaaaaacac caaaaaacaa aaaccggtct cctggggtca tggtagcaca 4261 aacgcacatg actgagtgct caggggttct gaggcttgtc cgctgacctg gggctctggc 4321 cctgggagat ctgggggacc tgctgtccta tatgtgatgc tttgaaagaa aggggcatca 4381 ttccaagcca agaggcccca gagagggcac cgtggggtgt tcaggcttct gtgaggcccc 4441 agtgagatcc tgtggctgtg cccccatcac ctccacccac tctgccctcc cactagctgc 4501 ccaacggatg aatcaacgcc ttggcagagt tttccagcag ggccttgcag agagtgtgtg 4561 tgacctgtgt ggccactgcc ttggggacgg gtgaggagtt agcctggaac attccagcgt 4621 gggcattatt gtcctgttgc aagttcaggg caaaaccagg aatccagttt tgtcgatcca 4681 attgagaaaa catttcatga acaactactt gtggcatgca ttggcactcg gaataaagcg 4741 cactattgtc act // LOCUS HUMRSC419 2998 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0005 gene, complete cds. ACCESSION D13630 NID g286000 KEYWORDS KIAA0005. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2998) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2998) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2998 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..80 gene 81..1340 /gene="KIAA0005" CDS 81..1340 /gene="KIAA0005" /codon_start=1 /db_xref="PID:d1003300" /db_xref="PID:g286001" /translation="MNNQKQQKPTLSGQRFKTRKRDEKERFDPTQFQDCIIQGLTETG TDLEAVAKFLDASGAKLDYRRYAETLFDILVAGGMLAPGGTLADDMMRTDVCVFAAQE DLETMQAFAQVFNKLIRRYKYLEKGFEDEVKKLLLFLKGFSESERNKLAMLTGVLLAN GTLNASILNSLYNENLVKEGVSAAFAVKLFKSWINEKDINAVAASLRKVSMDNRLMEL FPANKQSVEHFTKYFTEAGLKELSEYVRNQQTIGARKELQKELQEQMSRGDPFKDIIL YVKEEMKKNNIPEPVVIGIVWSSVMSTVEWNKKEELVAEQAIKHLKQYSPLLAAFTTQ GQSELTLLLKIQEYCYDNIHFMKAFQKIVVLFYKAEVLSEEPILKWYKDAHVAKGKSV FLEQMKKFVEWLKNAEEESESEAEEGD" 3'UTR 1341..>2998 BASE COUNT 933 a 498 c 616 g 951 t ORIGIN 1 gagaggagac accgccgcag ttgccggtac atcggggatt tctggctctt tcctcttcgc 61 cttaaattcg ggtgtctttt atgaataatc aaaagcagca aaagccaacg ctatcaggcc 121 agcgttttaa aactagaaaa agagatgaaa aagagaggtt tgaccctact cagtttcaag 181 actgtattat tcaaggctta actgaaaccg gtactgattt ggaagcagta gctaagtttc 241 ttgatgcttc tggagcaaaa cttgattacc gtcgatatgc agaaacactc tttgacattc 301 tggtggctgg tggaatgctg gccccaggtg gtacactggc agatgacatg atgcgtacag 361 atgtctgcgt gtttgcagcc caagaagatc tagagaccat gcaagcattt gctcaggttt 421 ttaacaagtt aatcaggcgc tacaaatacc tggagaaagg ttttgaagat gaagtaaaaa 481 agctgctgct gttcttgaag ggtttttcag agtcggagag gaacaagcta gctatgttga 541 ctggtgttct tctggctaat ggaacactta atgcatccat tcttaatagc ctttataatg 601 aaaatttggt taaagaagga gtttcagcag cttttgctgt gaagctcttt aaatcatgga 661 taaatgaaaa agatatcaat gcagtagctg caagtcttcg gaaagtcagc atggataaca 721 gactgatgga actctttcct gccaataagc aaagtgttga acacttcaca aaatatttta 781 ctgaggcagg cttgaaagag ctttcagaat atgttcggaa tcagcaaacc atcggagctc 841 gtaaggagct ccagaaagaa cttcaagaac agatgtcccg tggtgatcca tttaaggata 901 taattttata tgtcaaggag gagatgaaaa aaaacaacat cccagagcca gttgtcatcg 961 gaatagtctg gtcaagtgta atgagcactg tggaatggaa caaaaaagag gagcttgtag 1021 cagagcaagc catcaagcac ttgaagcaat acagccctct acttgctgcc tttactactc 1081 aaggtcagtc tgagctgact ctgttactga agattcagga gtattgctat gacaacattc 1141 atttcatgaa agccttccag aaaatagtgg tgctttttta taaagctgaa gtcctgagcg 1201 aggagcccat tttgaagtgg tataaagatg cacatgttgc aaaggggaag agtgttttcc 1261 ttgagcaaat gaaaaagttt gtagaatggc tcaaaaatgc tgaagaagaa tctgaatctg 1321 aagctgaaga aggtgactga attttgaaac tacaccctca gtaaagcaaa caggagttgt 1381 agataaaatg tcatgtctca tgtgtcctgg ttcttacatc ttcctacctc cctgtatcaa 1441 gcatgatata agggctttca tggcaaattt tattttaact gtttctatgg ttgctggaaa 1501 tgttgggttt agtttctaaa accatgtttt aagtagctac aggagctata gatttgaatc 1561 taatgttgca ttagtctttt cagttatctt ctacctcctg tattttctac tgtaataatg 1621 taatttaagg ccttccacaa tgaacagttc actttattcc ctgggttttc tataaacagt 1681 tttaaggata tgatttggtt aaaaaataat ttgttataaa aattctgttt gcaaattaaa 1741 ctggaaaagt atccagagtc tcaaaaggca atgatttgtg agataatatg gcatgcccgg 1801 agccctgctc atcaatgaaa aacccatatg taataatcga attcatttaa catgaatctt 1861 gagtacgtgg accattgctt gcatgttaac tttttgtttt gttttgtttt gttttgtttt 1921 gcatttttaa ctccagatat cctaaagctc aattgtttgg tctctggttt tcatccttag 1981 agaagccatg gagaacagac ttgaaaagtt taggaaatca taatgtggca gaggtggtgg 2041 gaagaagaaa gttgagcttt ttccccttga gaaacttctg catttagttt ctatctttcc 2101 aggcaaaaca aatgggtatt cttttcatac aaccattttc aaatgaacct tagaaaagtc 2161 ttaacattta aggtatttta tgcacagaat acacttagat tgataggaaa gaactcgtaa 2221 tggagtttga gtaaagaaaa tgactgatgt actaaaccca gtaaaaattg ttgaaaatgt 2281 taaaggtcag catgttctaa ttgggaatct agatatagct tagatttcct attggcttag 2341 agtatttgct ataacaaatg aagtgcaatg acaattatat attcctactc ggtcatactg 2401 gactggcttc gttctcttaa tatactcagt aatgactcaa gcctctggct attaacatac 2461 cctagttgcc gttttttaat tgccatgagc caaatacttc ttggtataca attgatccat 2521 ttattttaat ggctgccttt tcattttcat cttttcttgc tgctacccat ctatgtatgt 2581 agtcattggg gggaaaatgt agccacattt tttatgggaa gactttgtgt taaaagtgaa 2641 cattttgaag gtttttaact ggtgaaacta gcctggaata atgccaccag agactgagtg 2701 gaaatcgccc cttttgaagg tgccattctt atgagccaaa agtttgtcat ttaaaagttc 2761 attttgaggg aataacatgt aatataattt gaaataaagg tatagtaacc ttaaaaagaa 2821 cattataact gattgttgtg aatggggtga atttgttaaa atgagtaact ttgataaagt 2881 ttttcatgca caggcaaaat gtattcacta gatttctacg tagtgatctg cttttacttt 2941 gtaatttgta gttctcaaaa gacttttttt taaaaaaata aagtccatac ttacactt // LOCUS HUMRSC453 3406 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0006 gene, complete cds. ACCESSION D13631 NID g442466 KEYWORDS KIAA0006. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3406) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3406) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..3406 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..9 gene 10..1371 /gene="KIAA0006" CDS 10..1371 /gene="KIAA0006" /codon_start=1 /db_xref="PID:d1003301" /db_xref="PID:g286005" /translation="MPHFKSMYLAYCANHPSAVNVLTQHSDELEQFMENQGASSPGIL ILTTNLSKPFMRLEKYVTLLQELERHMEDTHPDHQDILKAIVAFKTLMGQCQDLRKRK QLELQILSEPIQAWEGEDIKNLGNVIFMSQVMVQYGACEEKEERYLMLFSNVLIMLSA SPRMSGFIYQGKIPIAGTVVTRLDEIEGNDCTFEITGNTVERIVVHCNNNQDFQEWLE QLNRLIRGPASCSSLSKTSSSSCSAHSSFSSTGQPRGPLEPPQIIKPWSLSCLRPAPP LRPSAALGYKERMSYILKESSKSPKTMKKFLHKRKTERKPSEEEYVIRKSTAALEEDA QILKVIEAYCTSANFQQGHGSSTRKDSIPQVLLPEEEKLIIEETRSNGQTIMEEKSLV DTVYALKDEVRELKQENKRMKQCLEEELKSRRDLEKLVRRLLKQTDECIRGESSSKTS ILP" 3'UTR 1372..>3405 BASE COUNT 998 a 747 c 731 g 930 t ORIGIN 1 ctgagtctca tgcctcattt taaatctatg tatctggctt actgtgcaaa ccatccttca 61 gctgtaaatg tgctcactca gcacagtgat gagttggaac aattcatgga aaatcaaggt 121 gcatcgagcc caggtatcct cattttaaca acaaacctca gcaaaccatt catgcgactg 181 gagaaatatg ttactctctt gcaagagtta gaacggcata tggaggatac tcatccagat 241 catcaggata ttctgaaagc aatcgtagca ttcaaaactc tcatggggca atgtcaagat 301 ctgaggaaga gaaaacagct ggagttacag atactgtccg aacctattca ggcatgggaa 361 ggagaagata ttaaaaactt gggaaatgtg atttttatgt cacaagtaat ggtgcagtat 421 ggagcatgtg aggaaaaaga ggagcggtac cttatgttat tttcaaatgt cctgataatg 481 ttatctgcaa gtcctcggat gagtggcttt atctatcagg gaaaaatacc aatagcagga 541 acggtggtga ctagattaga tgaaattgaa gggaatgact gcacatttga aatcactggt 601 aacacagtgg agagaattgt ggtccattgt aacaacaacc aggacttcca ggaatggttg 661 gagcagctga acagactgat cagaggacct gcctcttgca gttcattatc caaaacctca 721 tcgtcatcat gtagtgctca ttcttctttt agctctaccg gacagccccg aggacccttg 781 gagcctcctc aaattataaa accgtggagt ttaagttgtc tacgacctgc acctccactt 841 agaccatcag cagcactagg ttataaagag aggatgtctt atatcttaaa ggagtctagt 901 aaaagcccta aaacgatgaa gaaatttctt cataaaagga agactgagag aaaaccatcg 961 gaggaggaat atgtgattag gaaaagtaca gctgctctgg aagaggatgc tcaaatcctt 1021 aaagtgatcg aagcctactg caccagcgca aattttcaac aaggccatgg ctcaagtact 1081 cgaaaagatt ccattccaca agtcctactc cctgaggaag agaaactcat cattgaagaa 1141 accagaagca acggccagac catcatggaa gaaaagagcc ttgttgatac tgtttacgcc 1201 ttgaaggacg aggtcagaga actgaagcag gaaaataaaa gaatgaagca atgcctggaa 1261 gaagaactga aatcaagaag ggacctagaa aagctggtgc ggaggctttt gaagcaaaca 1321 gatgagtgta ttcgaggcga gtccagtagc aagacctcaa ttcttccata accatcactg 1381 tgccactggg tggagtgtgc cttcagggca tcttgaaatg tcccgctgaa tgatttgact 1441 cagtttgctc acttctttgg cttttgtttt gtgtttgagt ctctctctct ctctccctct 1501 ctcttctctt tctctccccg ctgtgtgcat atgtgtgtgc gtgcacgtgc gcgcttgggc 1561 atttgctgtt gtttggttat tggttggttg ttcatttttt ttttcaacag gtgaaaaagc 1621 aggaagtggt ggtagagatg gcctcagagt cttttccatt cagtaagaaa gagaaaggga 1681 atgcaggcca gttacttaaa aggtcttcag attgccttca gttcacagtg cctctgcaac 1741 agccattgcc ctcaggtcac attctttggg ctggctgccc ttgcaaagca gctggccaag 1801 gcttattaaa tgtgaaccca acttttcccc agggctttcc tgtacctgca agcctcttag 1861 tacttaattt accgaaggca aatcttccag actatggctt gagtaggatg aaagacaaac 1921 acatctgccc aatgatccag ctgcccttcc ttagaccatc acatgcctcc cctcaatcac 1981 aggttttaaa attactgccc tatgttgtct ataagaccaa gaaaactgta gtacccttat 2041 tccttttgtg tcatgtaaat tgtaactcag gggggcagaa gctctggtca cccacaccaa 2101 caccaaatcc atcagcaaat tctactggac aatcctccct tttagtagca gagttgaccg 2161 ccttttccca ttgcatgata ggtttctttc cctcattctg cctagtgtgt aagttttcta 2221 tttccccagt acttttagcc aaccttacca agtggtgtgt aaactagtat gtatgtgctt 2281 tgcttacttt ttaaaaaaat gtagttagac tgtgaggagt tactttatgt acgttgcata 2341 ttcaaactgt gatgttttat ccttcaaaac aatctgcatt aagaagatat tgtgccctac 2401 tagacagctt tcattcactt tattactctc tcatatatgc tgtgagaagt tgtaatatta 2461 attggcctct ttgggtggga ctctaagcag tatttgcaga aatatgcttc tggtccaact 2521 tgtacatcca gaacaaaagg gcccctctag tagctgtgtg ctggattatt caggactgat 2581 taactcaagt tgctcattga atcacatcat ccactttatc ccagaaacta gaactgtgag 2641 aaaccaggtt tagaagacag tcacagaaca gcacacatcc aaattcagcg ccattgaaag 2701 ttggcctaag gctcagtgcc actcacccct ccctccccaa aagacaagag aaatttggcc 2761 agatggggaa ggtcactgga aattgaggcc aaagggctga tcagaattgc ctttataata 2821 tatttgatgg atgcctataa atgtgggttt tgcaaatatt gtttaaaaac caaacttgaa 2881 gccaggcatg gtggcccaca cctgtggtcc cagctactca ggaggctgag gtgggaggat 2941 catttgagcc caggaggtca aggctgcagt gagctgtgat gacaccactg cactccaacc 3001 taggtaacag agagacacct ctgtctctga aaaaacaaaa cttgaagttt gaaaccccca 3061 gcttgctgag gagccctttt tatttttggt actgagagtc tcaaggtccc atagctacat 3121 ggtacagggc tgttgtctgc ttgctgcaca gcagaaatta caccatccac aggaaatgga 3181 ctatttttgc agtccaacat cagccaacca cacacagccg tgtttggaag ctgaagagta 3241 aagaaatttc taggaatggc tgttgttctg ttttttagca cagccattta gatataacat 3301 ccttcactta aaaattaaaa acatcagtac taccaccacc accaacaaca tcaacacaga 3361 tcttcacatc tgcccattct gtggttagtc aatggcttgc aataaa // LOCUS HUMRSC454 5134 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0015 gene, complete cds. ACCESSION D13640 NID g286006 KEYWORDS KIAA0015. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5134) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 5134) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..5134 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..106 gene 107..1471 /gene="KIAA0015" CDS 107..1471 /gene="KIAA0015" /codon_start=1 /db_xref="PID:d1003308" /db_xref="PID:g286007" /translation="MSSGAPQKSSPMASGAEETPGFLDTLLQDFPALLNPEDPLPWKA PGTVLSQEEVEGELAELAMGFLGSRKAPPPLAAALAHEAVSQLLQTDLSEFRKLPREE EEEEEDDDEEEKAPVTLLDAQSLAQSFFNRLWEVAGQWQKQVPLAARASQRQWLVSIH AIRNTRRKMEDRHVSLPSFNQLFGLSDPVNRAYFAVFDGHGGVDAARYAAVHVHTNAA RQPELPTDPEGALREAFRRTDQMFLRKAKRERLQSGTTGVCALIAGATLHVAWLGDSQ VILVQQGQVVKLMEPHRPERQDEKARIEALGGFVSHMDCWRVNGTLAVSRAIGDVFQK PYVSGEADAASRALTGSEDYLLLACDGFFDVVPHQEVVGLVQSHLTRQQGSGLRVAEE LVAAARERGSHDNITVMVVFLRDPQELLEGGNQGEGDPQAEGRRQDLPSSLPEPETQA PPRS" 3'UTR 1472..>5134 BASE COUNT 1036 a 1478 c 1552 g 1068 t ORIGIN 1 ggacacggag ccgcgaggag acagctgagg cccgcggaga ccagggggtg aagcctggag 61 accctcttgc cctggcctag ctgcaggccc ccgggatgct ttgggcatgt cctctggagc 121 cccacagaag agcagcccaa tggccagtgg agctgaggag accccaggct tcctggacac 181 gctcctgcaa gacttcccag ccctgctgaa cccagaggac cctctgccat ggaaggcccc 241 agggacggtg ctcagccagg aggaggtgga gggcgagctg gctgagctgg ccatgggctt 301 tctgggcagc aggaaggccc cgccaccact tgctgctgct ctggcccacg aagcagtttc 361 acagctgcta cagacagacc tttccgaatt caggaagttg cccagggagg aagaagaaga 421 ggaggaggac gatgacgagg aggaaaaggc ccctgtgacc ttgctggatg cccaaagcct 481 ggcacagagt ttctttaacc gcctttggga agtcgccggc cagtggcaga agcaggtgcc 541 attggctgcc cgggcctcac agcggcagtg gctggtctcc atccacgcca tccggaacac 601 tcgccgcaag atggaggacc ggcacgtgtc cctcccttcc ttcaaccagc tcttcggctt 661 gtctgaccct gtgaaccgcg cctactttgc tgtgtttgat ggtcacggag gcgtggatgc 721 tgcgaggtac gccgctgtcc acgtgcacac caacgctgcc cgccagccag agctgcccac 781 agaccctgag ggagccctca gagaagcctt ccggcgcacc gaccagatgt ttctcaggaa 841 agccaagcga gagcggctgc agagcggcac cacaggtgtg tgtgcgctca ttgcaggagc 901 gaccctgcac gtcgcctggc tcggggattc ccaggtcatt ttggtacagc agggacaggt 961 ggtgaagctg atggagccac acagaccaga acggcaggat gagaaggcgc gcattgaagc 1021 attgggtggc tttgtgtctc acatggactg ctggagagtc aacgggaccc tggccgtctc 1081 cagagccatc ggggatgtct tccagaagcc ctacgtgtct ggggaggccg atgcagcttc 1141 ccgggcgctg acgggctccg aggactacct gctgcttgcc tgtgatggct tctttgacgt 1201 cgtaccccac caggaagttg ttggcctggt ccagagccac ctgaccaggc agcagggcag 1261 cgggctccgt gtcgccgagg agctggtggc tgcggcccgg gagcggggct cccacgacaa 1321 catcacggtc atggtggtct tcctcaggga cccccaagag ctgctggagg gcgggaacca 1381 gggagaaggg gacccccagg cagaagggag gaggcaggac ttgccctcca gccttccaga 1441 acctgagacc caggctccac caagaagcta ggtggtttcc aggcccctgc cctccccttc 1501 ctcccatcct tgtccttctc tccctcagaa gcctcaggac ccaacaggtg gcaggcagtg 1561 gacagggtgc ccgccccaca gtgctttccc cagcacccca gagccagtcg ggacaccccc 1621 cgcagcccgt cctggtggct gtggaactgc actgggtggc gggcagatgg tggaaggcag 1681 cttaggagac ctcaccaaag agaagatgga ccggctcttg ctcccagctc ctattaggcc 1741 cggggtggga ccagaggtca taggtgccca acggcagcca aaccaaagac actggtgtgc 1801 atggggcagc atggttgtgc acgtgggacc ctggggcgga cccaggagcc aaactcttga 1861 agcaccccct gggtcaggcc cagcagcgga gtggccagcc ccagtttccc attgctcctc 1921 tctgcggcca gggccaggtg ggttcatatt tacagatatg cccagccagt cctggtcggc 1981 cacaccagtg tcccaaagag gagagcgcag cagagccagg ggtctgttct gtagcagcca 2041 cccccctgcc cccactccag ggcagccatg atgtgcttgg cccaccaggg ccttccgggc 2101 tgctctcttc cctgagcccg gaaccggcga cgcacatgtg tcttttgttg gtgtgtttgt 2161 ttttttccag ggaggtctaa ttccgaagca gtattccagg ttttctcttt gttttatcag 2221 tgccaagatg acctgttgtg tcatataatt taagcagagc ttagcattta ttttattctt 2281 tagaaaactt aagtatttac ttttttaaag ctatttttca aggaaccttt ttttgcagta 2341 ttattgaatt tattttctaa atcaggattg aaacaggaac ttttccaggt ggtgttaata 2401 agccattcaa gtgccttaca cagctttgaa gaaactagga ctgcagtggg ctcggatagg 2461 cccattgagg tttttagaaa agcaggattt gttttgttag ggaggcatga ttttggtgag 2521 atctttctgg aagagttttc cgcctctttg tgatgctgaa cacccccaag gttctcccct 2581 ccccccgctg cccaggtgac tggcaggagc tgcgactgcc acgtagtgtt gcctgggccc 2641 gacagcgggg ctctgggcat cccgggtgac cttggcccat ctgcctgcat tcccaccccc 2701 ttgggcctgg ctggatccca ggcagaggga ccttgctgct gtgtgattgg aacattccca 2761 aatatcttgt gaatttgtaa tcaaattggt ctcattggga aagactctta attaagaggc 2821 tcaggcaagc acagaggcag cccgtgggtc tctgtctcag tctggaggca gcagggatgc 2881 tgctgggagt ccatggcaca ggccacagcc cctcaccttg ccgcggtggc tggcagcacg 2941 cctgccttgc tctgccccat gccctgaaca ggcatgagag ctccacgtcc cctagtgcac 3001 cctgagaggg ggctcacaag tgaccgatcc tgggtgcctc agggagctca ctgagggcgt 3061 gcaaagttga aagtggcaag gctgggggag ggtgtcgggt agagggaaga gggcaggggg 3121 ctaggggagg actcagaggc catctgcagg gccaagccac aggaagggct gagctggagg 3181 tgggcagggc tgctccaggc aggtcagagc agtgcagggg gaggagagga gaaagggagg 3241 aagctgggct gtgtggtccc catgaaggca ttcagagtcc acctgcagac agcgagagcc 3301 ccaggaaggt ttgcacagct gtgccccaag caccttggcc tcctctcagc tcgccgagga 3361 ggcacgctag agccgccttc ccggtgggag ccctctgtcc cacagggagc ggggagccag 3421 ctttgctggg gccctacctg catgcccagc cttacccctc attctcacag cacagatgag 3481 gttgagacca tgcagtcaat gcattgctta aggtctctta tttacaaaaa aaaaccttaa 3541 acatagtcgc tgtcattcag acattcagag aatggttggc cacaaacaat gaccaagtat 3601 tgcttggctt aacttgaagg cctgctgtct ccttctgggg gtcagggacg cagctccacc 3661 ctcaccacta gcccaccctg cccgtgggca taaccttgac gaagagagag aatgattggc 3721 atctgctttt ctcttttctt tgctaataat tctgttcctg gctgccgaga gtgaagtttc 3781 accatgtgga ggtttggctc ctatcacctg gtggtctgat tcatacccta gcctgaggct 3841 ccactggaag atctcgcagc ctcagtgtat gggaaaccct ttccccaggc ttgtcccagc 3901 actgccgctc cccacccctg agccaggacc ccagaggatg gccatgcccc gtgcctggca 3961 gaggtctggt gccagcactg ggagctgctc cgcccttgcc ttggggccga gggagccctc 4021 gtccacccct gcacagcagc tgggcacaga ggagcgctct tccatcttga ccaggactgc 4081 accaagaagc accaggtgtc ttcagcctcc aacctccggg gcgaccttct cttccagcca 4141 cagtcccatg agggccccta gccagggaca ctggtctgta aattgtaatc ctttctccag 4201 cccagctctc cacttgttcc ttgtgtgagc tgagcaggca gtgcacctct gagtgtccct 4261 tttgtaaggc ccaggggttg cactgagtct gcagaggccg cgacctccta gaacgctgtg 4321 ggtgcaagtg agccggcgtg tcctggggag atgctgccag cacacagggg ccctcctgct 4381 gccagcaggt tggggtggtt aagtcttatt agtgtctatt cttaaaatta agtgggctgg 4441 agaagaatgg agctccacat gccagcaccg tatatggaat acaaaagctg gggaagcagg 4501 gcctgcctta caggtgtggc tgactctgag cccaggcctg caggggtgga gggcagtccc 4561 tcagaatccc agaggcagtc ccagcctcag aacccaggat aggaaatggg tgtgtttagt 4621 ggggaaaggg acggggtgca gacggcaggg ccagtatggg gccccctccc tctcctctcc 4681 tctcctatgg tgagcccagc gtgggcaccg ggccgtctca gccgtgttcc cagggctggg 4741 aggacagctc tggcccttct taggcctagc ctcgtcccaa gctaaatgta agccagttgg 4801 gctgtgttaa aggaagcagt gtttttggtt tgattctgcc tctgtagctc aaggggggca 4861 gcccccagag tcctgtgcat tctgccaagg ctccatagct ttgccaaatg cacggagctc 4921 tgccattccg gtgcagtgca ggccttgcga agggtttatc tgcgttcgtc tcggtgggct 4981 tctcctgcat gggagttgtg ttcctgtgca agggggagct ttgctcagga caggatgact 5041 gtcttcccta ttcttaggga caagtcccaa gatgccagaa aggcagtctc ccaaggaccc 5101 accatgcaga agtgtcaata aaccacaagt tctg // LOCUS HUMRSC508 2112 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0020 gene, complete cds. ACCESSION D13645 NID g286008 KEYWORDS KIAA0020. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2112) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2112) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2112 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..418 gene 419..1945 /gene="KIAA0020" CDS 419..1945 /gene="KIAA0020" /codon_start=1 /db_xref="PID:d1003313" /db_xref="PID:g286009" /translation="MWEILRRKDCDKEKRVKLMSDLQKLIQGKIKTIAFAHDSTRVIQ CYIQYGNEEQRKQAFEELRDDLVELSKAKYSRNIVKKFLMYGSKPQIAEIIRSFKGHV RKMLRHAEASAIVEYAYNDKAILEQRNMLTEELYGNTFQLYKSADHRTLDKVLEVQPE KLELIMDEMKQILTPMAQKEAVIKHSLVHKVFLDFFTYAPPKLRSEMIEAIREAVVYL AHTHDGARVAMHCLWHGTPKDRKVIVKTMKTYVEKVANGQYSHLVLLAAFDCIDDTKL VKQIIISEIISSLPSIVNDKYGRKVLLYLLSPRDPAHTVREIIEVLQKGDGNAHSKKD TEVRRRELLESISPALLSYLQEHAQEVVLDKSACVLVSDILGSATGDVQPTMNAIASL AATGLHPGGKDGELHIAEHPAGHLVLKWLIEQDKKMKENGREGCFAKTLVEHVGMKNL KSWASVNRGAIILSSLLQSCDLEVANKVKAALKSLIPTLEKTKSTSKGIEILLEKLST " 3'UTR 1946..>2112 BASE COUNT 747 a 371 c 496 g 498 t ORIGIN 1 ggaagttaaa gggaaaaagc aattcacagg aaagagtaca aagacagcac aagaaaaaaa 61 cagatttcat aaaaatagtg attctggttc ttcaaagaca tttccaacaa ggaaagttgc 121 taaagaaggt ggacctaaag tcacatctag gaactttgag aaaagtatca caaaacttgg 181 gaaaaagggt gtaaagcagt tcaagaataa gcagcaaggg gacaaatcac caaagaacaa 241 attccagccg gcaaataaat tcaacaagaa gagaaaattc cagccagatg gtagaagcga 301 tgaatcagca gccaagaagc ccaaatggga tgacttcaaa aagaagaaga aagaactgaa 361 gcaaagcaga caactcagtg ataaaaccaa ctatgacatt gttgttcggg caaagcagat 421 gtgggagatt ttaagaagaa aagactgtga caaagaaaaa agagtaaagt taatgagtga 481 tttgcagaag ttgattcaag ggaaaattaa aactattgca tttgcacacg attcaactcg 541 tgtgatccag tgttacattc agtatggtaa tgaagaacag agaaaacagg cttttgaaga 601 attgcgagat gatttggttg agttaagtaa agccaaatat tcgagaaata ttgttaagaa 661 atttctcatg tatggaagta aaccacagat tgcagagata atcagaagtt ttaaaggcca 721 cgtgaggaag atgctgcggc atgcggaagc atcagccatc gtggagtacg catacaatga 781 caaagccatt ttggagcaga ggaacatgct gacggaagag ctctatggga acacatttca 841 gctttacaag tcagcagatc accgaactct ggacaaagtg ttagaggtac agccagaaaa 901 attagaactt attatggatg aaatgaaaca gattctaact ccaatggccc aaaaggaagc 961 tgtgattaag cactcattgg tgcataaagt attcttggac ttttttacct atgcaccccc 1021 caaactcaga tcagaaatga ttgaagccat ccgcgaagcg gtggtctacc tggcacacac 1081 acacgatggc gccagagtgg ccatgcactg cctgtggcat ggcacgccca aggacaggaa 1141 agtgattgtg aaaacaatga agacttatgt tgaaaaggtg gctaatggcc aatactccca 1201 tttggtttta ctggcggcat ttgattgtat tgatgatact aagcttgtga agcagataat 1261 catatcagaa attatcagtt cattgcctag catagtaaat gacaaatatg gaaggaaggt 1321 cctattgtac ttactaagcc ccagagatcc tgcacataca gtacgagaaa tcattgaagt 1381 tctgcaaaaa ggagatggaa atgcacacag taagaaagat acagaggtcc gcagacggga 1441 gctcctagaa tccatttctc cagctttgtt aagctacctg caagaacacg cccaagaagt 1501 ggtgctagat aagtctgcgt gtgtgttggt gtctgacatt ctgggatctg ccactggaga 1561 cgttcagcct accatgaatg ccatcgccag cttggcagca acaggactgc atcctggtgg 1621 caaggacgga gagcttcaca ttgcagaaca tcctgcagga catctagttc tgaagtggtt 1681 aatagagcaa gataaaaaga tgaaagaaaa tgggagagaa ggttgttttg caaaaacact 1741 tgtagagcat gttggtatga agaacctgaa gtcctgggct agtgtaaatc gaggtgccat 1801 tattctttct agcctcctcc agagttgtga cctggaagtt gcaaacaaag tcaaagctgc 1861 actgaaaagc ttgattccta cactggaaaa aaccaaaagc accagcaaag gaatagaaat 1921 tctacttgaa aaactgagca cataggtgga aagagttaag agcaagatgg aatgattttt 1981 tctgttctct gttctgtttc ccaatgcaga aaagaagggg tagggtccac catactggta 2041 attggggtac tctgtatatg tgtttcttct ttgtatacga atctatttat ataaattgtt 2101 tttttaaatg gt // LOCUS HUMRSC548 1821 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0002 gene, complete cds. ACCESSION D13627 NID g286010 KEYWORDS KIAA0002. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1821) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 1821) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..1821 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..28 gene 29..1675 /gene="KIAA0002" CDS 29..1675 /gene="KIAA0002" /codon_start=1 /db_xref="PID:d1003297" /db_xref="PID:g286011" /translation="MALHVPKAPGFAQMLKEGAKHFSGLEEAVYRNIQACKELAQTTR TAYGPKGMNKMVINHLEKLFVTNDAATILRELEVQHPAAKMIVMASHMQEQEVGDGTN FVLVFAGALLELAEELLRIGLSVSEVIEGYEIACRKAHEILPNLVCCSAKNLRDIDEV SSLLRTSIMSKQYGNEVFLAKLIAQACVSIFPDSGHFNVDNIRVCKILGSGISSSSVL HGMVFKKETEGDVTSVKDAKIAVYSCPFDGMITETKGTVLIKTAEELMNFSKGEENLM DAQVKAIADTGANVVVTGGKVADMALHYANKYNIMLVRLNSKWDLRRLCKTVGATALP RLTPPVLEEMGHCDSVYLSEVGDTQVVVFKHEKEDGAISTIVLRGSTDNLMDDIERVV DDGVNTFKVLTRDKRLVPGGGATEIELAKQITSYGETCPGLEQYAIKKFAEAFEAIPR ALAENSGVKANEVISKLYAVHQEGNKNVGLDIEAEVPAVKDMLEAGILDTYLGKYWAI KLATNAAVTVLRVDQIIMAKPAGGPKPPSGKKDWDDDQND" 3'UTR 1676..>1821 BASE COUNT 557 a 323 c 425 g 516 t ORIGIN 1 cgcgtgaact gcttcctgca ggctggccat ggcgcttcac gttcccaagg ctccgggctt 61 tgcccagatg ctcaaggagg gagcgaaaca cttttcagga ttagaagagg ctgtgtatag 121 aaacatacaa gcttgcaagg agcttgccca aaccactcgt acagcatatg gaccaaaagg 181 aatgaacaaa atggttatca accacttgga gaagttgttt gtgacaaacg atgcagcaac 241 tattttaaga gaactagaag tacagcatcc tgctgcaaaa atgattgtaa tggcttctca 301 tatgcaagag caagaagttg gagatggcac aaactttgtt ctggtatttg ctggagctct 361 cctggaatta gctgaagaac ttctgaggat tggcctgtca gtttcagagg tcatagaagg 421 ttatgaaata gcctgcagaa aagctcatga gattcttcct aatttggtat gttgttctgc 481 aaaaaacctt cgagatattg atgaagtctc atctctactt cgtacctcca taatgagtaa 541 acaatatggt aatgaagtat ttctggccaa gcttattgct caggcatgcg tatctatttt 601 tcctgattcc ggccatttca atgttgataa catcagagtt tgtaaaattc tgggctctgg 661 tatcagttcc tcttcagtat tgcatggcat ggtttttaag aaggaaaccg aaggtgatgt 721 aacatctgtc aaagatgcaa aaatagcagt gtactcttgt ccttttgatg gcatgataac 781 agaaactaag ggaacagtgt tgataaagac tgctgaagaa ttgatgaatt ttagtaaggg 841 agaagaaaac ctcatggatg cacaagtcaa agctattgct gatactggtg caaatgtcgt 901 agtaacaggt ggcaaagtgg cagacatggc tcttcattat gcaaataaat ataatatcat 961 gttagtgagg ctaaactcaa aatgggatct ccgaagactt tgtaaaactg ttggtgctac 1021 agctcttcct agattgacac ctcctgtcct tgaagaaatg ggacactgtg acagtgttta 1081 cctctcagaa gttggagata ctcaggtggt ggtttttaag catgaaaagg aagatggcgc 1141 catttctacc atagtacttc gaggctctac agacaatctg atggatgaca tagaaagggt 1201 agtagacgat ggtgttaata ctttcaaagt tcttacaagg gataaacgtc ttgtacccgg 1261 aggtggagca acagaaattg aattagccaa acagatcaca tcatatggag agacatgtcc 1321 tggacttgaa cagtatgcta ttaagaagtt tgctgaggca tttgaagcta ttccccgcgc 1381 actggcagaa aactctggag ttaaggccaa tgaagtaatc tctaaacttt atgcagtaca 1441 tcaagaagga aataaaaacg ttggattaga tattgaggct gaagtccctg ctgtaaagga 1501 catgctggaa gctggtattc tagatactta cctgggaaaa tattgggcta tcaaactcgc 1561 tactaatgct gcagtcactg tacttagagt ggatcagatc atcatggcaa aaccagctgg 1621 tgggcccaag cctccaagtg ggaagaaaga ctgggatgat gaccaaaatg attgaaattg 1681 gcttaatttt tactgtaggt gaaggctgta tttgtagtag tactcaagaa tcacctgatg 1741 ttttcttatt ctccttaaat taagagttat tttgtgtttg tattcttggc tggatgttat 1801 aataaacata ttgttactgt c // LOCUS HUMRSC765 2640 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0008 gene, complete cds. ACCESSION D13633 NID g286012 KEYWORDS KIAA0008. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2640) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 2640) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 FEATURES Location/Qualifiers source 1..2640 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..121 gene 122..2419 /gene="KIAA0008" CDS 122..2419 /gene="KIAA0008" /codon_start=1 /db_xref="PID:d1003302" /db_xref="PID:g286013" /translation="MKTILGDQRKQMLQKYKEEKQLQKLKEQREKAKRGIFKVGRYRP DMPCFLLSNQNAVKAEPKKAIPSSVRITRSKAKDQMEQTKIDNESDVRAIRPGPRQTS EKKVSDKEKKVVQPVMPTSLRMTRSATQAAKQVPRTVSSTTARKPVTRAANENEPEGK VPSKGRPAKNVETKPDKGISCKVDSEENTLNSQTNATSGMNPDGVLSKMENLPEINTA KIKGKNSFAPKDFMFQPLDGLKTYQVTPMTPRSANAFLTPSYTWTPLKTEVDESQATK EILAQKCKTYSTKTIQQDSNKLPCPLGPLTVWHEEHVLNKNEATTKNLNGLPIKEVPS LERNEGRIAQPHHGVPYFRNILQSETEKLTSHCFEWDRKLELDIPDDAKDLIRTAVGQ TRLLMKERFKQFEGLVDDCEYKRGIKETTCTDLDGFWDMVSFQIEDVIHKFNNLIKLE ESGWQVNNNMNHNMNKNVFRKKVVSGIASKPKQDDAGRIAARNRLAAIKNAMRERIRQ EECAETAVSVIPKEVDKIVFDAGFFRVESPVKLFSGLSVSSEGPSQRLGTPKSVNKAV SQSRNEMGIPQQTTSPENAGPQNTKSEHVKKTLFLSIPESRSSIEDAQCPGLPDLIEE NHVVNKTDLKVDCLSSERMSLPLLAGGVADDINTNKKEGISDVVEGMELNSSITSQDV LMSSPEKNTASQNSILEEGETKISQSELFDNKSLTTECHLLDSPGLNCSNPFTQLERR HQEHARHISFGGNLITFSPLQPGEF" 3'UTR 2420..>2640 BASE COUNT 962 a 455 c 529 g 694 t ORIGIN 1 aaatagacac tttggtttga aagatgtaaa cattccaacc ttggaaggta gaattcttgt 61 tgaattagat gagacatctc aagagcttgt tccagaaaag accaatgtta agccaagggc 121 aatgaaaact attctaggtg atcaacgaaa acagatgctc caaaaataca aagaagaaaa 181 gcaacttcaa aaattgaaag agcagagaga gaaagctaaa cgaggaatat ttaaagtggg 241 tcgttataga cctgatatgc cttgttttct tttatcaaac cagaatgctg tgaaagctga 301 gccaaaaaag gctattccat cttctgtacg gattacaagg tcaaaggcca aagaccaaat 361 ggagcagact aagattgata acgagagtga tgttcgagca atccgacctg gtccaagaca 421 aacttctgaa aagaaagtgt cagacaaaga gaaaaaagtt gtgcagcctg taatgcccac 481 gtcgttgaga atgactcgat cagctactca agcagcaaag caggttccca gaacagtctc 541 atctaccaca gcaagaaagc cagtcacaag agctgctaat gaaaacgaac cagaaggaaa 601 ggtgccaagt aaaggaagac ctgccaaaaa tgtagaaaca aaacccgaca agggtatttc 661 ttgtaaagtc gatagtgaag aaaatacttt gaattcacaa actaatgcaa caagtggaat 721 gaatccagat ggagtcttat caaaaatgga aaacttacct gagataaata ctgcaaaaat 781 aaaagggaag aattccttcg cacctaagga ttttatgttt cagccactgg atggtctgaa 841 gacctatcaa gtaacaccta tgactcccag aagtgccaat gcttttttga cacccagtta 901 cacctggact cctttaaaaa cagaagttga tgagtctcaa gcaacaaaag aaattttggc 961 acaaaaatgt aaaacttact ctaccaagac aatacagcaa gattcaaata aattgccatg 1021 tcctttgggt cctctaactg tttggcatga agaacatgtt ttaaataaaa atgaagctac 1081 tactaaaaat ttaaatggcc ttccaataaa agaagtccca tcacttgaaa gaaatgaagg 1141 tcgaattgct cagccccacc atggtgtgcc atatttcaga aatatcctcc agtcagaaac 1201 tgagaaatta acttcacatt gcttcgagtg ggacaggaaa cttgaattgg acattccaga 1261 tgatgctaaa gatcttattc gcacagcagt tggtcaaaca agactcctta tgaaggaaag 1321 gtttaaacag tttgaaggac tggttgatga ttgtgaatat aaacgaggta taaaggagac 1381 tacctgtaca gatctggatg gattttggga tatggttagt tttcagatag aagatgtaat 1441 ccacaaattc aacaatctga tcaaacttga ggaatctggg tggcaagtca ataataatat 1501 gaatcataat atgaacaaaa atgtctttag gaaaaaagtt gtctcaggta tagcaagtaa 1561 accaaaacag gatgatgctg gaagaattgc agcgagaaat cgcctagctg ccataaaaaa 1621 tgcaatgaga gagagaatta ggcaggaaga atgtgctgaa acagcagttt ctgtgatacc 1681 aaaggaagtt gataaaatag tgttcgatgc tggatttttc agagttgaaa gtcctgttaa 1741 attattctca ggactttctg tctcttctga aggcccttct caaagacttg gaacacctaa 1801 gtctgtcaac aaagctgtat ctcagagtag aaatgagatg ggcattccac aacaaactac 1861 atcaccagaa aatgccggtc ctcagaatac gaaaagtgaa catgtgaaga agactttgtt 1921 tttgagtatt cctgaaagca ggagcagcat agaagatgct cagtgtcctg gattaccaga 1981 tttaattgaa gaaaaccatg ttgtaaataa gacagacttg aaggtggatt gtttatccag 2041 tgagagaatg agtttgcctc ttcttgctgg tggagtagca gatgatatta atactaacaa 2101 aaaagaagga atttcagatg ttgtggaagg aatggaactg aattcttcaa ttacatcaca 2161 ggatgttttg atgagtagcc ctgaaaaaaa tacagcttca caaaatagca tcttagaaga 2221 aggggaaact aaaatttctc agtcagaact atttgataat aaaagtctca ctactgaatg 2281 ccaccttctt gattcaccag gtctaaactg cagtaatcca tttactcagc tggagaggag 2341 acatcaagaa catgccagac acatttcttt tggtggtaac ctgattactt tttcacctct 2401 acaaccagga gaattttgaa tttaaaaata aatccaaaca ttttccttca tattatcaat 2461 gcttatatat tccttagact attgaaattt tggagaaaat gtatttgtgt tcacttctat 2521 agcatataat gttttaatat tctgtgttca tcaaagtgta ttttagatat actctttctc 2581 aagggaagtg gggatatttt gtacattttc aacacagaat aaaaaatgta ctgtgccttg // LOCUS HUMRSC911 3594 bp mRNA PRI 10-JUL-1997 DEFINITION Human mRNA for KIAA0011 gene, complete cds. ACCESSION D13636 NID g286018 KEYWORDS KIAA0011. SOURCE Homo sapiens male myeloblast cell_line KG-1 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3594) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (11-NOV-1992) to the DDBJ/EMBL/GenBank databases. Nobuo Nomura, Kazusa DNA Research Institute, Gene Structure 1; 1532-3 Yana, Kisarazu, Chiba 292, Japan (E-mail:cdnainfo@kazusa.or.jp, URL:http://www.kazusa.or.jp, Tel:0438-52-3930, Fax:0438-52-3931) REFERENCE 2 (bases 1 to 3594) AUTHORS Nomura,N., Miyajima,N., Kawarabayasi,Y. and Tabata,S. TITLE Prediction of new human genes by entire sequencing of randomly sampled cDNA clones JOURNAL Unpublished (1994) REFERENCE 3 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 JOURNAL DNA Res. 1 (1), 27-35 (1994) MEDLINE 96051387 REFERENCE 4 (sites) AUTHORS Nomura,N., Miyajima,N., Sazuka,T., Tanaka,A., Kawarabayasi,Y., Sato,S., Nagase,T., Seki,N., Ishikawa,K. and Tabata,S. TITLE Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1 (supplement) JOURNAL DNA Res. 1 (1), 47-56 (1994) MEDLINE 96051389 REFERENCE 5 (sites) AUTHORS Sinn,E., Wang,Z., Kovelman,R. and Roeder,R.G. TITLE Cloning and characterization of a TFIIIC2 subunit (TFIIIC beta) whose presence correlates with activation of RNA polymerase III-mediated transcription by adenovirus E1A expression and serum factors JOURNAL Genes Dev. 9 (6), 675-685 (1995) MEDLINE 95247030 FEATURES Location/Qualifiers source 1..3594 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KG-1" /cell_type="myoblast" /sex="male" 5'UTR <1..39 gene 40..2775 /gene="KIAA0011" CDS 40..2775 /gene="KIAA0011" /codon_start=1 /db_xref="PID:d1003305" /db_xref="PID:g286019" /translation="MDTCGVGYVALGEAGPVGNMTVVDSPGQEVLNQLDVKTSSEMTS AEASVEMSLPTPLPGFEDSPDQRRLPPEQESLSRLEQPDLSSEMSKVSKPRASKPGRK RGGRTRKGPKRPQQPNPPSAPLVPGLLDQSNPLSTPMPKKRGRKSKAELLLLKLSKDL DRPESQSPKRPPEDFETPSGERPRRRAAQVALLYLQELAEELSTALPAPVSCPEGPKV SSPTKPKKIRQPAACPGGEEVDGAPRDEDFFLQVEAEDVEESEGPSESSSEPEPVVPR STPRGSTSGKQKPHCRGMAPNGLPNHIMAPVWKCLHLTKDFREQKHSYWEFAEWIPLA WKWHLLSELEAAPYLPQEEKSPLFSVQREGLPEDGTLYRINRFSSITAHPERWDVSFF TGGPLWALDWCPVPEGAGASQYVALFSSPDMNETHPLSQLHSGPGLLQLWGLGTLQQE SCPGNRAHFVYGIACDNGCIWDLKFCPSGAWELPGTPRKAPLLPRLGLLALACSDGKV LLFSLPHPEALLAQQPPDAVKPAIYKVQCVATLQVGSMQATDPSECGQCLSLAWMPTR PHQHLAAGYYNGMVVFWNLPTNSPLQRIRLSDGSLKLYPFQCFLAHDQAVRTLQWCKA NSHFLVSAGSDRKIKFWDLRRPYEPINSIKRFLSTELAWLLPYNGVTVAQDNCYASYG LCGIHYIDAGYLGFKAYFTAPRKGTVWSLSGSDWLGTIAAGDISGELIAAILPDMALN PINVKRPVERRFPIYKADLIPYQDSPEGPDHSSASSGVPNPPKARTYTETVNHHYLLF QDTDLGSFHDLLRREPMLRMQEGEGHSQLCLDRLQLEAIHKVRFSPNLDSYGWLVSGG QSGLVRIHFVRGLASPLGHRMQLESRAHFNAMFQPSSPTRRPGFSPTSHRLLPTP" 3'UTR 2776..>3594 BASE COUNT 836 a 1000 c 888 g 870 t ORIGIN 1 aggactttgg cgagggggca gccattttgg ggggtgctga tggatacctg cggggtcggc 61 tatgttgccc tgggggaggc cggccccgtg gggaacatga ctgtggtaga ctctcctgga 121 caagaggtgc taaatcagct tgatgtcaag acctcttcag aaatgaccag tgcagaggct 181 tccgtagaga tgtcattacc tacccctttg cctggatttg aggattctcc tgatcagagg 241 aggctccctc cagagcagga aagcctctcc agactggaac agccagatct ttcttcagag 301 atgtcaaagg tctcaaagcc tagggcctca aagcctggcc ggaagagagg tggtaggaca 361 cgaaaaggcc ccaaaaggcc ccaacagcct aatcctccat cagccccact ggttcctggt 421 ctcttagatc aatccaaccc tctgtccacc cccatgccta agaaacgagg tcgaaagtcc 481 aaggcagagc tgctgctgct gaagttgtca aaagacctag atcggccaga atctcaatct 541 ccaaagaggc cccctgagga ctttgagacc ccttctgggg aacgaccccg ccgaagggct 601 gcccaagtgg cacttctgta tcttcaggaa ctggctgaag agctctcaac agccctgcct 661 gcccctgtgt cctgtcctga gggccccaag gtgagcagcc ccaccaaacc gaagaagatc 721 cggcagccag cagcctgtcc aggtggagaa gaggtggatg gtgctccacg ggatgaagac 781 ttttttctcc aggttgaggc tgaagatgtg gaagaaagtg agggcccaag tgagagctca 841 tctgaacctg agcctgtagt gccccgaagc accccacgag gatctacttc agggaaacag 901 aaaccacact gccgaggaat ggctcccaat ggcttaccaa atcatatcat ggctcctgtt 961 tggaagtgcc tccatctcac caaggacttc cgagagcaga aacattcata ctgggagttt 1021 gctgagtgga ttcctttagc ctggaagtgg cacttgttat ctgagcttga ggccgctccc 1081 tacctgcccc aggaggagaa gtctccattg ttttctgtac aacgtgaagg gctacctgaa 1141 gatggcaccc tctaccgaat aaacagattt agctcgatca cagcacatcc agagcgctgg 1201 gatgtgtcct tcttcacggg gggaccgctc tgggctctgg actggtgccc agtgccagag 1261 ggggcaggag cctcgcaata tgtggctctt ttctccagcc ctgacatgaa tgagacacac 1321 ccactgagcc agcttcattc gggtcctggg ctgctccagc tctggggcct tgggaccttg 1381 cagcaagaaa gctgtcctgg caacagggcc cactttgtct atgggattgc ttgtgacaac 1441 ggctgcatct gggacctcaa gttctgcccc agtggagcat gggaacttcc aggcacccct 1501 cggaaggctc ctctcctgcc ccggttgggt ctcttggctc tggcctgctc agacgggaaa 1561 gtactgctat tcagtctacc ccatccggag gccctgctgg ctcagcaacc cccagatgca 1621 gtgaagcctg ccatatataa ggtacaatgt gtggcaactc tgcaggtggg gtctatgcaa 1681 gctacagacc cctctgagtg tggtcagtgc cttagcctgg cctggatgcc taccaggccc 1741 caccaacacc tagctgctgg atattataat ggcatggtgg ttttctggaa ccttcccact 1801 aactcacccc tgcagcggat acggctctct gatggctcct taaagctcta ccccttccag 1861 tgtttcctag cccatgacca ggctgtgcgt acccttcaat ggtgcaaagc taacagccat 1921 ttccttgtct ctgcggggag tgaccggaaa atcaaattct gggaccttcg acgtccttac 1981 gaacccataa actctatcaa gcgcttcttg agtacagaac tggcctggct gcttccctac 2041 aatggtgtca ctgtggctca ggacaactgc tatgcctctt atggactctg tgggattcat 2101 tatattgacg ctggttacct tggtttcaag gcctacttca ctgctcctcg aaaaggcacc 2161 gtttggagtc tttcaggatc cgactggctt gggacaatag ctgcaggaga tatatccggg 2221 gagctcattg ctgctatatt accagatatg gcactgaatc caataaatgt caagcgacct 2281 gtagagcgaa gatttcctat atataaagca gatctgatac cgtatcagga cagtcctgaa 2341 ggtccagacc attcttctgc ttcatctggg gtccccaacc ctcccaaggc tcgaacttac 2401 actgaaactg tcaaccatca ctacttgctc tttcaagaca cagatttggg ttcattccat 2461 gatctgctcc gtagagaacc aatgctgcgc atgcaggagg gagaggggca ttctcaactc 2521 tgcctggaca ggctgcagct ggaggctatt cataaggtac gtttcagccc aaacctggac 2581 tcctatggat ggctggtatc tggggggcag tcagggctgg ttcgaatcca ttttgtccgt 2641 ggactcgcct ccccactggg ccaccgtatg cagcttgaaa gccgagccca cttcaatgct 2701 atgttccaac catcctcccc cactagacgg cctggcttct ctccaaccag ccatcgcctt 2761 ctgcccactc cctagccttg gcccacacca gatccttgga gtgaagtcgg tcaagaacaa 2821 atggccccta tgcacagagc cataggaact gggggccttc cctggacagt gatcatgcca 2881 ggcctggacc tttaggcctg cctccccagg actccttaga tcccactctt tctacagact 2941 tctgtgatca cagccccctg cgggcagggg ggctctccct ccaccaactc tcaaggctcc 3001 tcagcctaag actatggctc atgagaaaca ctcaggcctg acctaggctt gggagtcaaa 3061 ctgctcatat tgagcatatt gttaagtggg taaagccaag taaaggtact gggtgttttt 3121 gtgaccactt gtgaatgggt gtatggagaa ctgaaaaggg tatctgcatg aaggctcctg 3181 tctgactatt ccaggatcca atattactgc cttctgaaac ttcctcttta gggtaaccat 3241 catgtatgcc cacgagggtg atagtaattc gtgagactga agttgcttag agtacttctt 3301 tgaccaagga ataccacaga caccctaccg atagaacagt ggctcagatc ttacttgctc 3361 ctgcttacga agtattccca atcactggtc atctgaccct acttgaacac tcctgaacag 3421 tcatgttttt taaaatcttc ctttatatca agtcagagag tatacttcta taaatttcac 3481 tcatggatgt taggaaatct agtcatcttc cctgtgattg ccctgttaag tatttaacca 3541 tagctatcat gtgtttccca aatcttctct agattaaata tcttcagtta cttc // LOCUS HUMRSPT 488 bp mRNA PRI 19-NOV-1995 DEFINITION Human homolog of yeast ribosomal protein S28, complete cds. ACCESSION D14530 NID g414348 KEYWORDS ribosomal protein. SOURCE Homo sapiens cell-line HepG2, cDNA to mRNA, clone ha01p. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Hori,N., Murakawa,K., Matoba,R., Fukushima,A., Okubo,K. and Matsubara,K. TITLE A new human ribosomal protein sequence, homologue of rat L9 JOURNAL Nucleic Acids Res. 21 (18), 4395 (1993) MEDLINE 94021394 REFERENCE 2 (bases 1 to 488) AUTHORS Hori,N. JOURNAL Unpublished (1994) COMMENT Submitted (26-Feb-1993) to DDBJ by: Naohiro Hori IMCB, Osaka University 1-3 Yamadaoka, Suita Osaka 565 Japan Phone: 06-877-5111 x3314 Fax: 06-877-1922 E-mail: hori@inherit.imcb.osaka-u.ac.jp. FEATURES Location/Qualifiers source 1..488 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" CDS 14..445 /codon_start=1 /product="ribosomal protein" /db_xref="PID:d1003910" /db_xref="PID:g414349" /translation="MGKCRGLRTARKLRSHRRDQKWHDKQYKKAHLGTALKANPFGGA SHAKGIVLEKVGVEAKQPNSAIRKCVRVQLIKNGKKITAFVPNDGCLNFIEENDEVLV AGFGRKGHAVGDIPGVRFKVVKVANVSLLALYKGKKERPRS" BASE COUNT 149 a 91 c 125 g 123 t ORIGIN 1 tggcgccgac aggatgggca agtgtcgtgg acttcgtact gctaggaagc tccgtagtca 61 ccgacgagac cagaagtggc atgataaaca gtataagaaa gctcatttgg gcacagccct 121 aaaggccaac ccttttggag gtgcttctca tgcaaaagga atcgtgctgg aaaaagtagg 181 agttgaagcc aaacagccaa attctgccat taggaagtgt gtaagggtcc agctgatcaa 241 gaatggcaag aaaatcacag cctttgtacc caatgacggt tgcttgaact ttattgagga 301 aaatgatgaa gttctggttg ctggatttgg tcgcaaaggt catgctgttg gtgatattcc 361 tggagtccgc tttaaggttg tcaaagtagc caatgtttct cttttggccc tatacaaagg 421 caagaaggaa agaccaagat cataaatatt aatggtgaaa acactgtagt aataaatttt 481 catatgcc // LOCUS HUMRSU1A 2194 bp mRNA PRI 09-JAN-1995 DEFINITION Human RSU-1/RSP-1 mRNA, complete cds. ACCESSION L12535 NID g434050 KEYWORDS . SOURCE Homo sapiens adult skin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2194) AUTHORS Tsuda,T. and Cutler,M. TITLE Isolation of rsp-1, a novel cDNA capable of suppressing v-Ras transformant JOURNAL Unpublished (1993) REFERENCE 2 (bases 1 to 2194) AUTHORS Cutler,M.L., Bassin,R.H., Zanoni,L. and Talbot,N. TITLE Isolation of rsp-1, a novel cDNA cpable of suppressing v-Ras transformant JOURNAL Mol. Cell. Biol. 12, 3752-3756 (1993) FEATURES Location/Qualifiers source 1..2194 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /tissue_type="skin" gene 828..1661 /gene="RSU-1" CDS 828..1661 /gene="RSU-1" /note="homologous to mouse Rsu-1; putative" /codon_start=1 /db_xref="PID:g434051" /translation="MSKSLKKLVEESREKNQPEVDMSDRGISNMLDVNGLFTLSHITQ LVLSHNKLTMVPPNIAELKNLEVLNFFNNQIEELPTQISSLQKLKHLNLGMNRLNTLP RGFGSLPALEVLDLTYNNLSENSLPGNFFYLTTLRALYLSDNDFEILPPDIGKLTKLQ ILSLRDNDLISLPKEIGELTQLKELHIQGNRLTVLPPELGNLDLTGQKQVFKAENNPW VTPIADQFQLGVSHVFEYIRSETYKYLYGRHMQANPEPPKKNNDKSKKISRKPLAAKN R" polyA_site 2194 /gene="RSU-1" BASE COUNT 517 a 588 c 524 g 565 t ORIGIN 1 gaattccgtt tttttttttt gacgtgacat ctctttattg gttcagtcta tgcctggccc 61 agcggcacgc ccaggtccag ggggggtctg gttggaggtc tctggacagt caggggcagc 121 atcagcagaa accctgggtg tcggggggct gtgggagtag cagcactggt ccccgcgttg 181 acaggtgccc tccttaaacc tcttgcaagg aaatgtcttg agccgggcag cccgctggca 241 acatcttccg gcggctcctt ccgcttgttc tgaagcagcc tgcgtctctc ctccatagct 301 tcgtcggccg ccttgcccct gtgcgagggg aggagtaggt gggaccccat ctgcccttta 361 ccccttcttt ccctggctga gccccacccc caccttacca ccactcaccc tggccgggca 421 cgaggccggg gaccccagtt ggcctccggg ttagcctgga agcgcttgag tcctcgctgt 481 cgggagctgg ggttggcatc ttcccggatg gtgaagttgg cctgcagcct gggctccaca 541 ctgagaatgc ggaacgctgg gctagtctct gatagtcgct cccgtttcac cggacattcc 601 ttctcccagc aacgcatgat gtcccagagg gcagaggcag gggcatccgt cttcacagcg 661 ttcttacagg cgtgggagag tgagacccgg aagtcagcgt ggaggagggc cgaccgcaac 721 tgcaggaggc ttggtgtgtt gcagtggatg gtgctgctca gctggtgtgc gttctgccga 781 agcttgtggt tgcacgccca tcgtcttagg ggctaccttc cgtgaccatg tccaagtctc 841 tgaagaagtt ggtggaggag agccgggaga agaaccagcc cgaggtggac atgagtgacc 901 ggggcatctc caacatgctg gatgtcaacg gcctctttac cttatcccat atcacacaac 961 tggtcctcag ccataacaag ctaacaatgg tgccaccgaa catcgcagaa ctgaagaatt 1021 tggaggtgct caactttttt aataaccaaa tcgaggagct gcccacacag atcagtagcc 1081 ttcagaaact caaacacctg aaccttggca tgaacaggct gaacactttg ccacgaggct 1141 tcggctccct gccagctctt gaggttctgg acttgacgta caacaacttg agcgaaaatt 1201 ctcttcctgg aaacttcttc tacctgacca ccctgcgtgc actctatcta agtgacaacg 1261 attttgaaat cctgccgcca gatattggga agctcacaaa gttgcagata ctcagcctta 1321 gggataacga cctgatctcg ctgcctaagg aaatcgggga gcttacccag cttaaagagc 1381 tccacattca ggggaaccgc ctcaccgttc tgcccccaga actaggaaac ttggatttaa 1441 ctggccagaa gcaggtattc aaagcagaga acaatccctg ggtgaccccc attgcagacc 1501 agttccagct tggcgtgtcc catgtttttg agtatatccg ttctgagaca tacaaatacc 1561 tctacggcag acacatgcag gccaacccag aaccaccgaa gaagaataat gacaaatcga 1621 aaaagatcag ccggaaaccc ctggcagcca agaacagata aggaagggat tggcatcggc 1681 tggccttcca gcaccttctc tctccaacac ttcattctct cttgccctgt ctctcaaata 1741 aacccaatgc tgcgtgtgag gcctttttta tttttctttt cactctcttt ctaatgcttc 1801 ccaccttacc ttttagattc ttttgctagg tgggagattg ttataaggtc tttaaaccat 1861 ttccatttgt ttctttaaca ttaccaaaag cagggaacaa agctcttatt caactgcgaa 1921 ttccatagtg ggctctggct tttcttgaat agatatcaca aggttgctta ttatcaaaag 1981 aataattaaa atcatgtaac catttaaatg tcactgttaa cacttttcac tctttctgtt 2041 gattcaccta actcattatt ttgctttatt aaaagtcttc cttcaccacc gagatatgct 2101 aatttaactt acaaatgatt ttaataaaat cttgagtttg tatcacatgt tacttattga 2161 ctcagaataa aagaacagtc tgatcttggg gtat // LOCUS HUMS19RP 510 bp mRNA PRI 23-FEB-1996 DEFINITION H.sapiens S19 ribosomal protein mRNA, complete cds. ACCESSION M81757 NID g337732 KEYWORDS S19 ribosomal protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 510) AUTHORS Kondoh,N., Schweinfest,C.W., Henderson,K.W. and Papas,T.S. TITLE Differential expression of S19 ribosomal protein, laminin-binding protein, and human lymphocyte antigen class I messenger RNAs associated with colon carcinoma progression and differentiation JOURNAL Cancer Res. 52 (4), 791-796 (1992) MEDLINE 92145618 FEATURES Location/Qualifiers source 1..510 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SW620" /cell_type="adeno carcinoma" /tissue_type="colon" CDS 23..460 /codon_start=1 /product="S19 ribosomal protein" /db_xref="PID:g337733" /translation="MPGVTVKDVNQQEFVRALAAFLKKSGKLKVPEWVDTVKLAKHKE LAPYDENWFYTRAASTARHLYLRGGAGVGSMTKIYGGRQRNGVMPSHFSRGSKSVARR VLQALEGLKMVEKDQDGGRKLTPQGQRDLDRIAGQVAAANKKH" BASE COUNT 141 a 135 c 151 g 83 t ORIGIN 1 ctggcagcgc ggaggccgca cgatgcctgg agttactgta aaagacgtga accagcagga 61 gttcgtcaga gctctggcag ccttcctcaa aaagtccggg aagctgaaag tccccgaatg 121 ggtggatacc gtcaagctgg ccaagcacaa agagcttgct ccctacgatg agaactggtt 181 ctacacgcga gctgcttcca cagcgcggca cctgtacctc cggggtggcg ctggggttgg 241 ctccatgacc aagatctatg ggggacgtca gagaaacggc gtcatgccca gccacttcag 301 ccgaggctcc aagagtgtgg cccgccgggt cctccaagcc ctggaggggc tgaaaatggt 361 ggaaaaggac caagatggcg gccgcaaact gacacctcag ggacaaagag atctggacag 421 aatcgccgga caggtggcag ctgccaacaa gaagcattag aacaaaccat gctgggttaa 481 taaattgcct cattcgtaaa aaaaaaaaaa // LOCUS HUMS5HT3RA 2202 bp mRNA PRI 30-MAY-1996 DEFINITION Human mRNA for serotonin 5-HT3 receptor, complete cds. ACCESSION D49394 NID g681913 KEYWORDS serotonin 5-HT3 receptor. SOURCE Homo sapiens hippocampus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2202) AUTHORS Miyake,A., Mochizuki,S., Takemoto,Y. and Akuzawa,S. TITLE Molecular cloning of human 5-hydroxytryptamine3 receptor: heterogeneity in distribution and function among species JOURNAL Mol. Pharmacol. 48 (3), 407-416 (1995) MEDLINE 96018832 REFERENCE 2 (bases 1 to 2202) AUTHORS Miyake,A. TITLE Direct Submission JOURNAL Submitted (17-FEB-1995) to the DDBJ/EMBL/GenBank databases. Akira Miyake, Yamanouchi Pharmaceutical Co., Ltd., Molecular Medicine Research Laboratories; 21 Miyukigaoka, Tsukuba, Ibaraki 305, Japan (E-mail:miyake@yamanouchi.co.jp, Tel:0298-52-5111(ex.3638), Fax:0298-52-5444) FEATURES Location/Qualifiers source 1..2202 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="hippocampus" CDS 220..1656 /codon_start=1 /product="serotonin 5-HT3 receptor" /db_xref="PID:d1008983" /db_xref="PID:g681914" /translation="MLLWVQQALLALLLPTLLAQGEARRSRNTTRPALLRLSDYLLTN YRKGVRPVRDWRKPTTVSIDVIVYAILNVDEKNQVLTTYIWYRQYWTDEFLQWNPEDF DNITKLSIPTDSIWVPDILINEFVDVGKSPNIPYVYIRHQGEVQNYKPLQVVTACSLD IYNFPFDVQNCSLTFTSWLHTIQDINISLWRLPEKVKSDRSVFMNQGEWELLGVLPYF REFSMESSNYYAEMKFYVVIRRRPLFYVVSLLLPSIFLMVMDIVGFYLPPNSGERVSF KITLLLGYSVFLIIVSDTLPATAIGTPLIGVYFVVCMALLVISLAETIFIVRLVHKQD LQQPVPAWLRHLVLERIAWLLCLREQSTSQRPPATSQATKTDDCSAMGNHCSHMGGPQ DFEKSPRDRCSPPPPPREASLAVCGLLQELSSIRQFLEKRDEIREVARDWLRVGSVLD KLLFHIYLLAVLAYSITLVMLWSIWQYA" polyA_signal 2183..2188 BASE COUNT 478 a 638 c 586 g 500 t ORIGIN 1 ggaaacatga tccagctgaa ggactgattg caggaaaact tggcagctcc ccaaccttgg 61 tggcccaggg agtgtgaggc tgcagcctca gaaggtgtga gcagtggcca cgagaggcag 121 gctggctggg acatgaggtt ggcagagggc aggcaagctg gcccttggtg ggcctcgccc 181 tgagcactcg gaggcactcc tatgcttgga aagctcgcta tgctgctgtg ggtccagcag 241 gcgctgctcg ccttgctcct ccccacactc ctggcacagg gagaagccag gaggagccga 301 aacaccacca ggcccgctct gctgaggctg tcggattacc ttttgaccaa ctacaggaag 361 ggtgtgcgcc ccgtgaggga ctggaggaag ccaaccaccg tatccattga cgtcattgtc 421 tatgccatcc tcaacgtgga tgagaagaat caggtgctga ccacctacat ctggtaccgg 481 cagtactgga ctgatgagtt tctccagtgg aaccctgagg actttgacaa catcaccaag 541 ttgtccatcc ccacggacag catctgggtc ccggacattc tcatcaatga gttcgtggat 601 gtggggaagt ctccaaatat cccgtacgtg tatattcggc atcaaggcga agttcagaac 661 tacaagcccc ttcaggtggt gactgcctgt agcctcgaca tctacaactt ccccttcgat 721 gtccagaact gctcgctgac cttcaccagt tggctgcaca ccatccagga catcaacatc 781 tctttgtggc gcttgccaga aaaggtgaaa tccgacagga gtgtcttcat gaaccaggga 841 gagtgggagt tgctgggggt gctgccctac tttcgggagt tcagcatgga aagcagtaac 901 tactatgcag aaatgaagtt ctatgtggtc atccgccggc ggcccctctt ctatgtggtc 961 agcctgctac tgcccagcat cttcctcatg gtcatggaca tcgtgggctt ctacctgccc 1021 cccaacagtg gcgagagggt ctctttcaag attacactcc tcctgggcta ctcggtcttc 1081 ctgatcatcg tttctgacac gctgccggcc actgccatcg gcactcctct cattggtgtc 1141 tactttgtgg tgtgcatggc tctgctggtg ataagtttgg ccgagaccat cttcattgtg 1201 cggctggtgc acaagcaaga cctgcagcag cccgtgcctg cttggctgcg tcacctggtt 1261 ctggagagaa tcgcctggct actttgcctg agggagcagt caacttccca gaggccccca 1321 gccacctccc aagccaccaa gactgatgac tgctcagcca tgggaaacca ctgcagccac 1381 atgggaggac cccaggactt cgagaagagc ccgagggaca gatgtagccc tcccccacca 1441 cctcgggagg cctcgctggc ggtgtgtggg ctgctgcagg agctgtcctc catccggcaa 1501 ttcctggaaa agcgggatga gatccgagag gtggcccgag actggctgcg cgtgggctcc 1561 gtgctggaca agctgctatt ccacatttac ctgctagcgg tgctggccta cagcatcacc 1621 ctggttatgc tctggtccat ctggcagtac gcttgagtgg gtacagccca gtggaggagg 1681 gggtacagtc ctggttaggt ggggacagag gatttctgct taggcccctc aggacccagg 1741 gaatgccagg gacattttca agacacagac aaagtcccgt gccctgtttc caatgccaat 1801 tcatctcagc aatcacaagc caaggtctga acccttccac caaaaactgg gtgttcaagg 1861 cccttacacc cttgtcccac ccccagcagc tcaccatggc tttaaaacat gctctcttag 1921 atcaggagaa actcgggcac tccctaagtc cactctagtt gtggactttt ccccattgac 1981 cctcacctga ataagggact ttggaattct gcttctcttt cacaactttg cttttaggtt 2041 gaaggcaaaa ccaactctct actacacagg cctgataact ctgtacgagg cttctctaac 2101 ccctagtgtc ttttttttct tcacctcact tgtggcagct tccctgaaca ctcatccccc 2161 atcagatgat gggagtggga agaataaaat gcagtgaaac cc // LOCUS HUMS6KINA 3061 bp mRNA PRI 18-JAN-1995 DEFINITION Homo sapiens ribosomal protein S6 kinase 2 (RPS6KA2) mRNA, complete cds. ACCESSION L07597 NID g292456 KEYWORDS RPS6KA2 gene; S6 kinase II; ribosomal protein S6 kinase II. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3061) AUTHORS Moller,D.E., Xia,C.H., Tang,W., Zhu,A.X. and Jakubowski,M. TITLE Human rsk isoforms: cloning and characterization of tissue-specific expression JOURNAL Am. J. Physiol. 266, 351-359 (1994) FEATURES Location/Qualifiers source 1..3061 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" /map="6" gene 47..2254 /gene="RPS6KA2" CDS 47..2254 /gene="RPS6KA2" /codon_start=1 /db_xref="GDB:G00-365-645" /product="ribosomal protein S6 kinase 2" /db_xref="PID:g292457" /translation="MPLAQLKEPWPLMELVPLDPENGQTSGEEAGLQPSKDEGVLKEI SITHHVKAGSEKADPSHFELLKVLGQGSFGKVFLVRKVTRPDSGHLYAMKVLKKATLK VRDRVRTKMERDILADVNHPFVVKLHYAFQTEGKLYLILDFLRGGDLFTRLSKEVMFT EEDVKFYLAELALGLDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKEAIDHEKK AYSFCGTVEYMAPEVVNRQGHSHSADWWSYGVLMFEMLTGSLPFQGKDRKETMTLILK AKLGMPQFLSTEAQSLLRALFKRNPANRLGSGPDGAEEIKRHVFYSTIDWNKLYRREI TPPFKPAVAQPDDTFYFDTEFTSRTPKDSPGIPPSAGAHQLFRGFSFVATGLMEDDGK PRAPQAPLHSVVQQLHGKNLVFSDGYVVKETIGVGSYSECKRCVHKATNMEYAVKVID KSKRDPSEEIEILLRYGQHPNIITLKDVYDDGKHVYLVTELMRGGELLDKILRQKFFS EREASFVLHTIGKTVEYLHSQGVVHRDLKPSNILYVDESGNPECLRICDFGFAKQLRA ENGLLMTPCYTANFVAPEVLKRQGYDEGCDIWSLGILLYTMLAGYTPFANGPSDTPEE ILTRIGSGKFTLSGGNWNTVSETAKDLVSKMLHVDPHQRLTAKQVLQHPWVTQKDKLP QSQLSHQDLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL" BASE COUNT 681 a 872 c 875 g 633 t ORIGIN 1 cgcggcgcag ggccgccgga gagcgcgggt gacctggcgg cggcagatgc cgctcgccca 61 gctcaaggag ccctggccgc tcatggagct agtgccgctg gacccggaga atggacagac 121 ctcaggggaa gaagctggac ttcagccgtc caaggatgag ggcgtcctca aggagatctc 181 catcacgcac cacgtcaagg ctggctctga gaaggctgat ccatcccatt tcgagctcct 241 caaggttctg ggccagggat cctttggcaa agtcttcctg gtgcggaaag tcacccggcc 301 tgacagtggg cacctgtatg ctatgaaggt gctgaagaag gcaacgctga aagtacgtga 361 ccgcgtccgg accaagatgg agagagacat cctggctgat gtaaatcacc cattcgtggt 421 gaagctgcac tatgccttcc agaccgaggg caagctctat ctcattctgg acttcctgcg 481 tggtggggac ctcttcaccc ggctctcaaa agaggtgatg ttcacggagg aggatgtgaa 541 gttttacctg gccgagctgg ctctgggcct ggatcacctg cacagcctgg gtatcattta 601 cagagacctc aagcctgaga acatccttct ggatgaggag ggccacatca aactcactga 661 ctttggcctg agcaaagagg ccattgacca cgagaagaag gcctattctt tctgcgggac 721 agtggagtac atggcccctg aggtcgtcaa ccgccagggc cactcccata gtgcggactg 781 gtggtcctat ggggtgttga tgtttgagat gctgacgggc tccctgccct tccaggggaa 841 ggaccggaag gagaccatga cactgattct gaaggcgaag ctaggcatgc cccagtttct 901 gagcactgaa gcccagagcc tcttgcgggc cctgttcaag cggaatcctg ccaaccggct 961 cggctccggc cctgatgggg cagaggaaat caagcggcat gtcttctact ccaccattga 1021 ctggaataag ctataccgtc gtgagatcac gccacccttc aagccagcag tggctcagcc 1081 tgatgacacc ttctactttg acaccgagtt cacgtcccgc acacccaagg attccccagg 1141 catccccccc agcgctgggg cccatcagct gttccggggc ttcagcttcg tggccaccgg 1201 cttgatggaa gacgacggca agcctcgtgc cccgcaggca cccctgcact cggtggtaca 1261 gcaactccat gggaagaacc tggtttttag tgacggctac gtggtaaagg agacaattgg 1321 tgtgggctcc tactctgagt gcaagcgctg tgtccacaag gccaccaaca tggagtatgc 1381 tgtcaaggtc attgataaga gcaagcggga tccttcagaa gagattgaga ttcttctgcg 1441 gtatggccag caccccaaca tcatcactct gaaagatgtg tatgatgatg gcaaacacgt 1501 gtacctggtg acagagctga tgcggggtgg ggagctgctg gacaagatcc tgcggcagaa 1561 gttcttctca gagcgggagg ccagctttgt cctgcacacc attggcaaaa ctgtggagta 1621 tctgcactca cagggggttg tgcacaggga cctgaagccc agcaacatcc tgtatgtgga 1681 cgagtccggg aatcccgagt gcctgcgcat ctgtgacttt ggttttgcca aacagctgcg 1741 ggctgagaat gggctcctca tgacaccttg ctacacagcc aactttgtgg cgcctgaggt 1801 gctgaagcgc cagggctacg atgaaggctg cgacatctgg agcctgggca ttctgctgta 1861 caccatgctg gcaggatata ctccatttgc caacggtccc agtgacacac cagaggaaat 1921 cctaacccgg atcggcagtg ggaagtttac cctcagtggg ggaaattgga acacagtttc 1981 agagacagcc aaggacctgg tgtccaagat gctacacgtg gatccccacc agcgcctcac 2041 agctaagcag gttctgcagc atccatgggt cacccagaaa gacaagcttc cccaaagcca 2101 gctgtcccac caggacctac agcttgtgaa gggagccatg gctgccacgt actccgcact 2161 caacagctcc aagcccaccc cccagctgaa gcccatcgag tcatccatcc tggcccagcg 2221 gcgagtgagg aagttgccat ccaccaccct gtgaggcacc agggcattcg ggccacaggg 2281 cggtgctagc ttgacacagt cagatgcttc cagagggagc aggccggaac cacagggcca 2341 gagggagctg gaaccgaggg gccggggaag ctgccagccc agaacacccc taatgagggt 2401 gtgagaagtg ccttctcctt ccccaggatg gactcttctc ggctcaggct ctgctggtgg 2461 aaagcgattc actgtataaa ctttttttat gaaaaaaatg gcatcaacca ccatggattt 2521 ttacaagatc catttgcctt tctgggagca gaaacagcca ttgcggccca ggaggggaac 2581 tgagtcacgc tggggctctc tgagactctt tagagcagct ttgggatccc accctgggac 2641 ccccacgatt ggccacctgt agccatctgc acacacctcc gagacagtcc agtgtcacct 2701 ctctcagagc atctggctgt ttagcagaac tcattctatc cccaatcagc tccttttccg 2761 ttctgttctg ctgggagttc tagaaccact tcctgctaca ggaggggtct catgtcctgc 2821 tggcttccag cttcaggcac cagcatccac cttgctctgc cagtggatcc ctgcggtcag 2881 gctgggcagc cccagagaga ggatgtggaa agcacttttt ggctgacttc atctggggtt 2941 ggcaacagga cagagttcac aggaggccag tgggcgggcc atgagggaca gggtcttttt 3001 tcatttcttc ctcagctggt tactcagggt tcatctgtcc atggcctttc taatggaatt 3061 c // LOCUS HUMSAAP 614 bp mRNA PRI 09-JAN-1995 DEFINITION H.sapiens serum amyloid A protein mRNA, complete cds. ACCESSION M81349 M81451 NID g337749 KEYWORDS apolipoprotein; constitutive expression; high density lipoprotein binding protein; serum amyloid A protein. SOURCE Homo sapiens adult liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 614) AUTHORS Whitehead,A.S., de Beer,M.C., Steel,D.M., Rits,M., Lelias,J.M., Lane,W.S. and de Beer,F.C. TITLE Identification of novel members of the serum amyloid A protein superfamily as constitutive apolipoproteins of high density lipoprotein JOURNAL J. Biol. Chem. 267 (6), 3862-3867 (1992) MEDLINE 92156125 FEATURES Location/Qualifiers source 1..614 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" sig_peptide 76..129 /gene="CSAA" gene 76..468 /gene="CSAA" CDS 76..468 /gene="CSAA" /codon_start=1 /product="serum amyloid A protein" /db_xref="PID:g337750" /translation="MRLFTGIVFCSLVMGVTSESWRSFFKEALQGVGDMGRAYWDIMI SNHQNSNRYLYARGNYDAAQRGPGGVWAAKLISRSRVYLQGLIDYYLFGNSSTVLEDS KSNEKAEEWGRSGKDPDRFRPDGLPKKY" BASE COUNT 160 a 148 c 161 g 145 t ORIGIN 1 tatagctcca cggccagaag ataccagcag ctctgccttt actgaaattt cagctggaga 61 aaggtccaca gcacaatgag gcttttcaca ggcattgttt tctgctcctt ggtcatggga 121 gtcaccagtg aaagctggcg ttcgtttttc aaggaggctc tccaaggggt tggggacatg 181 ggcagagcct attgggacat aatgatatcc aatcaccaaa attcaaacag atatctctat 241 gctcggggaa actatgatgc tgcccaaaga ggacctgggg gtgtctgggc tgctaaactc 301 atcagccgtt ccagggtcta tcttcaggga ttaatagact actatttatt tggaaacagc 361 agcactgtat tggaggactc gaagtccaac gagaaagctg aggaatgggg ccggagtggc 421 aaagaccccg accgcttcag acctgacggc ctgcctaaga aatactgagc ttcctgctcc 481 tctgctctca gggaaactgg gctgtgagcc acacacttct ccccccagac agggacacag 541 ggtcactgag ctttgtgtcc ccaggaactg gtatagggca cctagaggtg ttcaataaat 601 gtttgtcaaa ttga // LOCUS HUMSAMS 1487 bp mRNA PRI 22-FEB-1995 DEFINITION Human mRNA for S-adenosylmethionine synthetase, complete cds. ACCESSION D49357 D11332 NID g676878 KEYWORDS S-adenosylmethionine synthetase. SOURCE Human adult liver, cDNA to mRNA, clone HLSAM1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1487) AUTHORS Horikawa,S. and Tsukada,K. TITLE Molecular cloning and nucleotide sequence of cDNA encoding the human liver S-adenosylmethionine synthetase JOURNAL Biochem. Int. 25 (1), 81-90 (1991) MEDLINE 92126072 COMMENT Data kindly submitted in computer readable form by: Saburo Horikawa Dept. of Pathological Biochemistry Medical Research Institute Tokyo Medical and Dental University Kanda-surugadai, Chiyoda-ku Tokyo 101 Japan Tel: 03-5280-8076 Fax: 03-5280-8081 D11332:Submitted (26-May-1992) to DDBJ by:Saburo Horikawa. FEATURES Location/Qualifiers source 1..1487 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" CDS 19..1206 /EC_number="2.5.1.6" /codon_start=1 /product="S-adenosylmethionine synthetase" /db_xref="PID:d1008951" /db_xref="PID:g220066" /translation="MNGPVDGLCDHSLSEGVFMFTSESVGEGHPDKICDQISDAVLDA HLKQDPNAKVACETVCKTGMVLLCGEITSMAMVDYQRVVRDTIKHIGYDDSAKGFDFK TCNVLVALEQQSPDIAQCVHLDRNEEDVGAGDQGLMFGYATDETEECMPLTIILAHKL NARMADLRRSGLLPWLRPDSKTQVTVQYMQDNGAVIPVRIHTIVISVQHNEDITLEEM RRALKEQVIRAVVPAKYLDEDTVYHLQPSGRFVIGGPQGDAGVTGRKIIVDTYAAWGA HGGGAFSGKDYTKVDRSAAYAARWVAKSLVKAGLCRRVLVQVSYAIGVAEPLSISIFT YGTSQKTERELLDVVHKNFDLRPGVIVRDLDLKKPIYQKTACYGHFGRSEFPWEVPRK LVF" BASE COUNT 315 a 401 c 458 g 313 t ORIGIN 1 gtggagaagt gtgagaagat gaatggaccg gtggatggct tgtgtgacca ctctctaagt 61 gaaggagtct tcatgttcac atcggagtct gtgggagagg gacacccgga taagatctgt 121 gaccagatca gtgatgcagt gctggatgcc catctcaagc aagaccccaa tgccaaggtg 181 gcctgtgaga cagtgtgcaa gaccggcatg gtgctgctgt gtggtgagat cacctcaatg 241 gccatggtgg actaccagcg ggtggtgagg gacaccatca agcacatcgg ctacgatgac 301 tcagccaagg gctttgactt caagacttgc aacgtgctgg tggctttgga gcagcaatcc 361 ccagatattg cccagtgcgt ccatctggac agaaatgagg aggatgtggg ggcaggagat 421 cagggtttga tgttcggcta tgccaccgac gagacagagg agtgcatgcc cctcaccatc 481 atccttgctc acaagctcaa cgcccggatg gcagacctca ggcgctccgg cctcctcccc 541 tggctgcggc ctgactctaa gactcaggtg acagttcagt acatgcagga caatggcgca 601 gtcatccctg tgcgcatcca caccatcgtc atctctgtgc agcacaacga agacatcacg 661 ctggaggaga tgcgcagggc cctgaaggag caagtcatca gggccgtggt gccggccaag 721 tacctggacg aagacaccgt ctaccacctg cagcccagtg ggcggtttgt catcggaggt 781 ccccaggggg atgcgggtgt cactggccgt aagattattg tggacaccta tgcggcctgg 841 ggggctcatg gtggtggggc cttctctggg aaggactaca ccaaggtgga ccgctcagcc 901 gcatatgctg cccgctgggt ggccaagtct ctggtgaaag cagggctctg ccggagagtg 961 cttgtccagg tttcctatgc cattggtgtg gccgagccgc tgtccatttc catcttcacc 1021 tacggaacct ctcagaagac agagcgagag ctgctggatg tggtgcataa gaacttcgac 1081 ctccggccgg gcgtcattgt cagggacttg gatttgaaga agcccatcta ccagaagaca 1141 gcatgctacg gccatttcgg aagaagcgag ttcccatggg aggttcccag gaagcttgta 1201 ttttagagcc agggggagct gggcctggtc tcaccctgga ggcaactggt ggccatgctc 1261 ctcttcccca gacgcctggc tgctgatcgc cttccccacc caccaaccct cagggcaaag 1321 ccaggtccct ctcatttagc ctgtcctgtc atcatcatgg ccagctggag gcaggggctt 1381 cctggtgctg gaggttggat cttgatgtaa ggatgggcat ggtgttctcc tgctgctccc 1441 tcagactggg gcaatgttaa tttagtggaa aaggcacccc cgtcaag // LOCUS HUMSAP1A 1933 bp mRNA PRI 17-DEC-1993 DEFINITION Homo sapiens SRF accessory protein 1A (SAP-1) mRNA, complete cds. ACCESSION M85165 NID g429185 KEYWORDS serum response factor; SAP-1; Elk-1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1933) AUTHORS Dalton,S. and Treisman,R. TITLE Characterization of SAP-1, a protein recruited by serum response factor to the c-fos serum response element JOURNAL Cell 68, 597-612 (1992) MEDLINE 92154673 REFERENCE 2 (bases 1 to 1933) AUTHORS Treisman,R. TITLE Direct Submission JOURNAL Submitted (05-MAR-1992) Richard Treisman, Transcription Laboratory, Imperial Cancer Research Fund, London, England FEATURES Location/Qualifiers source 1..1933 /organism="Homo sapiens" /db_xref="taxon:9606" gene 150..1445 /gene="SAP-1" CDS 150..1445 /gene="SAP-1" /note="Homology region A with Elk-1 protein (Ets domain) is bp 150-417, amino acids 1-89; Homology region B with Elk-1 protein, required for cooperative ternary complex formation with SRF is bp 565-617, amino acids 136-157; Sequence diverges from SAP-1B at bp 1229, amino acid 360; Homology region C with Elk-1 protein, core of regulated transcription activation domain, is bp 1203-1355, amino acids 352-402; (S/T)P motifs conserved between SAP-1A and Elk-1 are located at amino acids T354, T361, T366, S381, S387, T420, S425, corresponding to bp 1209-1214, bp 1230-1235, bp 1245-1250, bp 1290-1295, bp 1308-1313, bp 1407-1412, bp-1422-1427" /codon_start=1 /product="SAP-1A protein" /db_xref="PID:g429186" /translation="MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARL WGIRKNKPNMNYDKLSRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIE GDCESLNFSEVSSSSKDVENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNV KLFKLIKTENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSS EETIQALETLVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHP DIDTDIDSVASQPMELPENLSLEPKDQDSVLLEKDKVNNSSRSKKPKGLGLAPTLVIT SSDPSPLGILSPSLPTASLTPAFFSQTPIILTPSPLLSSIHFWSTLSPVAPLSPARLQ GANTLFQFPSVLNSHGPFTLSGLDGPSTPGPFSPDLQKT" BASE COUNT 556 a 486 c 385 g 506 t ORIGIN 1 ccgccgcctt ctactccgcc gcgggggtcg cagcggctgc cgcgccgtcc tcgagtttcc 61 agcgtgagga ggaggctgag ggcggagagg cgcatcgtgt tcgaggcgga gaccgagggg 121 gagccccgcg cgcggcgtcg ctcattgcta tggacagtgc tatcaccctg tggcagttcc 181 ttcttcagct cctgcagaag cctcagaaca agcacatgat ctgttggacc tctaatgatg 241 ggcagtttaa gcttttgcag gcagaagagg tggctcgtct ctgggggatt cgcaagaaca 301 agcctaacat gaattatgac aaactcagcc gagccctcag atactattat gtaaagaata 361 tcatcaaaaa agtgaatggt cagaagtttg tgtacaagtt tgtctcttat ccagagattt 421 tgaacatgga tccaatgaca gtgggcagga ttgagggtga ctgtgaaagt ttaaacttca 481 gtgaagtcag cagcagttcc aaagatgtgg agaatggagg gaaagataaa ccacctcagc 541 ctggtgccaa gacctctagc cgcaatgact acatacactc tggcttatat tcttcattta 601 ctctcaactc tttgaactcc tccaatgtaa agcttttcaa attgataaag actgagaatc 661 cagccgagaa actggcagag aaaaaatctc ctcaggagcc cacaccatct gtcatcaaat 721 ttgtcacgac accttccaaa aagccaccag ttgaacctgt tgctgccacc atttcaattg 781 gcccaagtat ttctccatct tcagaagaaa ctatccaagc tttggagaca ttggtttccc 841 caaaactgcc ttccctggaa gccccaacct ctgcctctaa cgtaatgact gcttttgcca 901 ccacaccacc catttcgtcc ataccccctt tgcaggaacc tcccagaaca ccttcaccac 961 cactgagttc tcacccagac atcgacacag acattgattc agtggcttct cagccaatgg 1021 aacttccaga gaatttgtct ctggagccta aagaccagga ttcagtcttg ctagaaaagg 1081 acaaagtaaa taattcatca agatccaaga aacccaaagg gttaggactg gcacccaccc 1141 ttgtgatcac gagcagtgat ccaagcccac tgggaatact gagcccatct ctccctacag 1201 cttctcttac accagcattt ttttcacaga cacccatcat actgactcca agccccttgc 1261 tctccagtat ccacttctgg agtactctca gtcctgttgc tcccctaagt ccagccagac 1321 tgcaaggtgc taacacactt ttccagtttc cttctgtact gaacagtcat gggccattca 1381 ctctgtctgg gctggatgga ccttccaccc ctggcccatt ttccccagac ctacagaaga 1441 cataacctat gcacttgtgg aatgagagaa ccgaggaacg aagaaacaga cattcaacat 1501 gattgcattt gaagtgagca attgatagtt ctacaatgct gataatagac tattgtgatt 1561 tttgccattc cccattgaaa acatcttttt aggattctct ttgaatagga ctcaagttgg 1621 actatatgta taaaaatgcc ttaattggag tctaaactcc acctccctct gtcttttcct 1681 tttctttttc tttccttcct tccttttctt ttctccttta aaaatatttt gagctttgtg 1741 ctgaagaagt ttttggtggg ctttagtgac tgtgctttgc aaaagcaatt aagaacaaag 1801 ttactccttc tggctattgg gaccctttgg ccaggaaaaa ttatgcttag aatctattat 1861 ttaaagaagt atttgtgaaa tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1921 aaaaaaaaaa aaa // LOCUS HUMSAP1C 3900 bp mRNA PRI 06-MAY-1994 DEFINITION Human mRNA for protein tyrosine phosphatase. ACCESSION D15049 NID g475003 KEYWORDS protein tyrosine phosphatase. SOURCE Homo sapiens cell_line:KATO-III cDNA to mRNA, clone_lib:KATO-III ZAP-II. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3900) AUTHORS Matozaki,T., Suzuki,T., Uchida,T., Inazawa,J., Ariyama,T., Matsuda,K., Horita,K., Noguchi,H., Mizuno,H., Sakamoto,C. and Kasuga,M. TITLE Molecular cloning of a human transmembrane-type protein tyrosine phosphatase and its expression in gastrointestinal cancers JOURNAL J. Biol. Chem. 269 (3), 2075-2081 (1994) MEDLINE 94124561 REFERENCE 2 (bases 1 to 3900) AUTHORS Matozaki,T. TITLE Direct Submission JOURNAL Submitted (16-APR-1993) to the DDBJ/EMBL/GenBank databases. Takashi Matozaki, Kobe University School of Medicine, Second Department of Internal Medicine; Kusunoki-cho, Chuo-ku, Kobe, Hyogo 650, Japan (Tel:078-341-7451, Fax:078-382-2080) COMMENT Submitted (16-APR-1993) to DDBJ by: Takashi Matozaki 2nd Department of Int. Medicine Kobe University, School of Med. Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451 Fax: 078-382-2080. FEATURES Location/Qualifiers source 1..3900 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="KATO-III" /clone_lib="KATO-III ZAP-II" gene 42..3398 /gene="SAP-1" CDS 42..3398 /gene="SAP-1" /codon_start=1 /product="protein tyrosine phosphatase precursor" /db_xref="PID:d1004159" /db_xref="PID:g475004" /translation="MAGAGGGLGVWGNLVLLGLCSWTGARAPAPNPGRNLTVETQTTS SISLSWEVPDGLDSQNSNYWVQCTGDGGTTETRNTTATNVTVDGLGPGSLYTCSVWVE KDGVNSSVGTVTTATAPNPVRNLRVEAQTNSSIALTWEVPDGPDPQNSTYGVEYTGDG GRAGTRSTAHTNITVDGLEPGCLYAFSMWVGKNGINSSRETRNATTAHNPVRKPESGG SDHQLHLPELGGPRWHRPTELDLLRTSALEMVAEQRLETQQTPESPVDGLGPGSLYTC SVWVEKDGVNSSSWRLVTSTTAPNPVRNLTVEAQTNSSIALTWEVPDGPDPQNSTYGV EYTGDGGRAGTRSTAHTNITVDRLEPGCLYVFSVWVGKNGINSSRETRNATTAPNPVR NLHMETQTNSSIALCWEVPDGPYPQDYTYWVGYTGDGGGTETRNTTNTSVTAERLEPG TLYTFSVWAEKNGARGSRQNVSISTVPNAVTSLSKQDWTNSTIALRWTAPQGPGQSSY SYWVSWVREGMTDPRTQSTSGTDITLKELEAGSLYHLTVWAERNEVRGYNSTLTAATA PNEVTDLQNETQTKNSVMLWWKAPGDPHSQLYVYWVQWASKGHPRRGQDPQANWVNQT SRTNETWYKVEALEPGTLYNFTVWAERNDVASSTQSLCASTYPDTVTITSCVSTSAGY GVNLIWSCPQGGYEAFELEVGGQRGSQDRSSCGEAVSVLGLGPARSYPATITTIWDGM KVVSHSVVCHTESAGVIAGAFVGILLFLILVGLLIFFLKRRNKKKQQKPELRDLVFSS PGDIPAEDFADHVRKNERDSNCGFADEYQQLSLVGHSQSQMVASASENNAKNRYRNVL PYDWSRVPLKPIHEEPGSDYINASFMPGLWSPQEFIATQGPLPQTVGDFWRLVWEQQS HTLVMLTNCMEAGRVKCEHYWPLDSQPCTHGHLRVTLVGEEVMENWTVRELLLLQVEE QKTLSVRQFHYQAWPDHGVPSSPDTLLAFWRMLRQWLDQTMEGGPPIVHCSAGVGRTG TLIALDVLLRQLQSEGLLGPFSFVRKMRESRPLMVQTEAQYVFLHQCICGSSNSQPRP QPRRKSRMRMSKTSSTRTWPPSRPTSWRSK" sig_peptide 42..116 /gene="SAP-1" mat_peptide 117..3395 /gene="SAP-1" /EC_number="3.1.3.48" /product="protein tyrosine phosphatase" polyA_site 3900 BASE COUNT 907 a 1100 c 1154 g 739 t ORIGIN Chromosome 19q13.4. 1 ctaggcctgg gactcctggg tccccggcag tgtctggagg catggctggg gctggcgggg 61 gcctcggggt ctgggggaac ctggtgctgc tgggcctgtg cagctggaca ggggccaggg 121 cgcctgcccc caacccaggg aggaacctga cagtggagac tcagaccacc agctccatct 181 ccctgagctg ggaggtcccc gatggcctag actcacagaa ctccaactac tgggttcagt 241 gtactggaga cggcggcaca acagagactc gaaacacaac agccaccaac gtcaccgtgg 301 atggccttgg acccgggtca ttgtatacgt gttctgtgtg ggtggagaaa gacggagtaa 361 atagctctgt ggggactgtc actactgcca cagctcccaa cccagtgagg aacctgagag 421 tggaggctca gaccaacagc tccatcgccc tgacctggga ggtccccgac ggcccagacc 481 cacagaactc cacctacggg gttgagtaca ctggagatgg tggcagagca gggactcgaa 541 gcacagcaca cactaacatc accgtggatg gacttgaacc cgggtgtttg tatgcgtttt 601 ccatgtgggt gggaaagaat ggaatcaaca gctcccggga gactcgaaat gccaccacag 661 ctcacaaccc agtgaggaaa cctgagagtg gaggctcaga ccaccagctc catctccctg 721 agctgggagg tccccgatgg cacagaccca cagaactcga cctactgcgt acgagtgcac 781 tggagatggt ggcagaacag agactcgaaa cacaacagac accagagtca ccagtggatg 841 gccttggacc cgggtcattg tatacgtgtt ctgtgtgggt ggagaaagac ggagtaaata 901 gctcctcgtg gagattggta actagtacca cagctcccaa cccagtgaga aacctgacag 961 tggaggctca gaccaacagc tccatcgccc tgacctggga ggtccccgat ggcccagacc 1021 cacagaactc cacctacggg gttgagtaca ctggagatgg tggcagagca gggactcgaa 1081 gcacagcaca caccaacatc accgtggata gacttgaacc cgggtgtttg tatgtgtttt 1141 ccgtgtgggt ggggaagaat ggaatcaaca gctcccggga gactcgaaat gccaccacag 1201 cccccaaccc agtgagaaac ctccatatgg agactcagac caacagctcc atcgccctat 1261 gctgggaagt ccccgatggc ccataccctc aggactacac ctactgggta gggtacactg 1321 gagacggtgg tggcacagag acccgaaaca caacaaatac cagtgtgaca gctgagagac 1381 ttgagcccgg aaccttgtac acattctctg tatgggcaga aaaaaatgga gcacgtggct 1441 ccaggcagaa tgtcagcatc tccacagtcc ccaacgcagt gacaagcctc agcaagcagg 1501 actggaccaa cagcaccatt gctttgcgct ggacagctcc ccagggccca ggccagtctt 1561 cctacagcta ctgggtctca tgggtcaggg aaggcatgac tgaccccagg acccaaagca 1621 cctcaggtac tgacatcacc ctaaaggaac tggaagctgg cagcctgtac cacctcaccg 1681 tctgggccga gaggaatgag gtcagaggct ataacagcac cctcactgca gccactgctc 1741 ccaatgaggt cacagatctc cagaatgaaa ctcagactaa gaactcagtc atgctgtggt 1801 ggaaggcccc tggagacccc cactctcagt tgtacgtata ctgggtccag tgggccagca 1861 agggacatcc ccggaggggg caagatcccc aagcgaattg ggtcaaccag accagcagga 1921 ccaatgagac gtggtacaaa gtggaggccc tggaacccgg gacgttgtac aatttcaccg 1981 tgtgggcaga gaggaatgac gtagccagtt ccacgcagag cctctgtgcg tccacatacc 2041 cagacacagt caccatcact tcctgtgtca gcacctcagc gggctatgga gtcaacttga 2101 tctggtcctg cccccaggga ggctacgagg cctttgagtt ggaggtggga ggacagcggg 2161 gctcccagga cagatcttca tgtggggagg ctgtgtctgt gttgggtctc gggccggctc 2221 ggtcctaccc agccaccatc acgaccatct gggacggaat gaaggtcgtg tctcactctg 2281 tggtctgcca caccgagagt gcaggggtca ttgccggagc ctttgtgggc atcctcctgt 2341 ttctcatcct cgtgggcctg ctgattttct tcctgaagag gaggaataag aagaagcagc 2401 agaaaccaga actcagggat ctggtcttta gctccccagg ggacatccca gctgaagact 2461 tcgctgacca cgtcaggaag aatgagaggg acagcaactg tggttttgca gacgagtacc 2521 agcaactctc cctggtgggc cacagccagt ctcagatggt ggcttcggct tcagagaaca 2581 acgccaagaa ccgctacaga aatgtgctgc cctatgactg gtcccgggtg cccctgaagc 2641 ccatccatga ggagccaggc tctgactaca tcaatgccag cttcatgccc ggtctctgga 2701 gcccccagga gttcattgca acccagggtc ccctgccaca gacagtgggt gacttctggc 2761 gcctggtgtg ggaacagcag agccacaccc tggtcatgct gaccaactgc atggaggccg 2821 gccgggtgaa gtgtgagcat tactggcctc tggactcgca gccctgcacc catgggcacc 2881 tgcgggtaac cctggtaggt gaggaagtga tggagaactg gacggtgcgg gaactgctgc 2941 tcctccaggt ggaggagcag aagacactgt ctgtgcgcca attccactac caggcctggc 3001 cggatcacgg cgttccctcc tccccagaca ccttgctggc tttctggagg atgcttcggc 3061 agtggctgga tcagaccatg gagggaggcc cacccattgt gcactgcagt gctggcgtgg 3121 gtcgcacagg aaccctcatt gccctggacg tcctgctccg gcagctgcag tccgagggtc 3181 tccttgggcc cttcagcttt gtaaggaaga tgagagagag tcggccgttg atggtgcaga 3241 ctgaggctca gtacgtattc ctgcatcagt gcatctgcgg ttcctccaac agtcagccca 3301 ggccccagcc gagaaggaag tcccgtatga ggatgtcgaa aacctcatct acgagaacgt 3361 ggccgccatc caggcccaca agttggaggt ctaagtgacg agggggctgg gtcggcagcc 3421 caggcatcct caagctctgg acacccactt gagcccagat tcctggaaga gcagagggct 3481 gggctcccag actcctgggt gctgtgggag gagggggctg gtatcccaaa ctctggtttc 3541 cccaggagag agtggtctgg tgggcttcag atgagtccta tgggagctgg ggatctggat 3601 tcctggttcc ctgaaggagg agagggatga tagcttggat tccctaggtc tttccaggat 3661 gcagaaagaa acaggctggg gcctggattc tgaggcagga aggaatttgg gtctggagtt 3721 ctggctactt gaggaccaaa ggcaggaagg atcctgcctt gattttactt cagaaaccaa 3781 atcagtcttc tataatctgg ggtcggaggg agtccctgtg cccaaggtct ctctgcaccc 3841 caccatccac atgtattttt ccttctatcc cataatttat taaatcactg ttctccccag // LOCUS HUMSAP49A 1275 bp DNA PRI 09-JAN-1995 DEFINITION Human spliceosomal protein (SAP 49) gene, complete cds. ACCESSION L35013 NID g556216 KEYWORDS spliceosomal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1275) AUTHORS Champion-Arnaud,P. and Reed,R. TITLE The prespliceosome components SAP 49 and SAP 145 interact in a complex implicated in tethering U2 snRNP to the branch site JOURNAL Genes Dev. 8, 1974-1983 (1994) MEDLINE 95047348 FEATURES Location/Qualifiers source 1..1275 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1275 /gene="SAP 49" CDS 1..1275 /gene="SAP 49" /note="RNA recognition motif amino acid 15-86 and 102-174, proline-glycine rich domain amino acid 200-424" /codon_start=1 /product="spliceosomal protein" /db_xref="PID:g556217" /translation="MAAGPISERNQDATVYVGGLDEKVSEPLLWELFLQAGPVVNTHM PKDRVTGQHQGYGFVEFLSEEDADYAIKIMNMIKLYGKPIRVNKASAHNKNLDVGANI FIGNLDPEIDEKLLYDTFSAFGVILQTPKIMRDPDTGNSKGYAFINFASFDASDAAIE AMNGQYLCNRPITVSYAFKKDSKGERHGSAAERLLAAQNPLSQADRPHQLFADAPPPP SAPNPVVSSLGSGLPPPGMPPPGSFPPPVPPPGALPPGIPPAMPPPPMPPGAAGHGPP SAGTPGAGHPGHGHSHPHPFPPGGMPHPGMSQMQLAHHGPHGLGHPHAGPPGSGGQPP PRPPPGMPHPGPPPMGMPPRGPPFGSPMGHPGPMPPHGMRGPPPLMPPHGYTGPPRPP PYGYQRGPLPPPRPTPRPPVPPRGPLRGPLPQ" BASE COUNT 270 a 437 c 302 g 266 t ORIGIN 1 atggctgccg ggccgatctc cgagcggaat caggatgcca ctgtgtacgt ggggggcctg 61 gatgagaagg ttagtgaacc gctgctgtgg gaactgtttc tccaggctgg accagtagtc 121 aacacccaca tgccaaagga tagagtcact ggccagcacc aaggctatgg ctttgtggaa 181 ttcttgagtg aggaagatgc tgactatgcc attaagatca tgaacatgat caaactctat 241 gggaagccaa tacgggtgaa caaagcatca gctcacaaca aaaacctgga tgtaggggcc 301 aacattttca ttgggaacct ggaccctgag attgatgaga agttgcttta tgatactttc 361 agcgcctttg gggtcatctt acaaaccccc aaaattatgc gggaccctga cacaggcaac 421 tccaaaggtt atgcctttat taattttgct tcatttgatg cttcggatgc agcaattgaa 481 gccatgaatg ggcagtacct ctgtaaccgt cctatcaccg tatcttatgc cttcaagaag 541 gactccaagg gtgagcgcca tggctcagca gccgaacgac ttctggcagc tcagaacccg 601 ctctcccagg ctgatcgccc tcatcagctg tttgcagatg cacctcctcc accctctgct 661 cccaatcctg tggtatcatc attggggtct gggcttcctc caccaggcat gcctcctcct 721 ggctccttcc cacccccagt gccacctcct ggagccctcc cacctgggat acccccagcc 781 atgcccccac cacctatgcc tcctggggct gcaggacatg gccccccatc ggcaggaacc 841 ccaggggcag gacatcctgg tcatggacac tcacatcctc acccattccc accgggtggg 901 atgccccatc cagggatgtc tcagatgcag cttgcacacc atggccctca tggcttagga 961 catccccacg ctggaccccc aggctctggg ggccagccac cgccccgacc accacctgga 1021 atgcctcatc ctggacctcc tccaatgggc atgccccccc gagggcctcc attcggatct 1081 cccatgggtc acccaggtcc tatgcctccg catggtatgc gtggacctcc tccactgatg 1141 cccccccatg gatacactgg ccctccacga cccccaccct atggctacca gcgggggcct 1201 ctccctccac ccagacccac tccccggcca ccagttcccc ctcgaggccc acttcgaggc 1261 cctctccctc agtaa // LOCUS HUMSAP62X 1395 bp DNA PRI 09-JAN-1995 DEFINITION Human spliceosomal protein (SAP 62) gene, complete cds. ACCESSION L21990 NID g409218 KEYWORDS nuclear protein; spliceosomal protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1395) AUTHORS Bennett,M. and Reed,R. TITLE Correspondence between a mammalian spliceosome component and an essential yeast splicing factor JOURNAL Science 262 (5130), 105-108 (1993) MEDLINE 94023929 FEATURES Location/Qualifiers source 1..1395 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" gene 1..1395 /gene="SAP 62" CDS 1..1395 /gene="SAP 62" /codon_start=1 /product="spiceosomal protein" /db_xref="PID:g409219" /translation="MDFQHRPGGKTGSGGVASSSESNRDRREPLRQLALETIDINKDP YFMKNHLGSYECKLCLTLHNNEGSYLAHTQGKKHQTNLARRAAKEAKEAPAQPAPEKV KVEVKKFVKIGRPGYKVTKQRDSEMGQQSLLFQIDYPEIAEGIMPRHRFMSAYEQRIE PPDRRWQYLLMAAEPYETIAFKVPSREIDKAEGKFWTHWNRETKQFFLQFHFKMEKPP APPSLPAGPPGVKRPPPPLMNGLPPRPPLPESLPPPPPGGLPLPPMPPTGPAPSGPPG PPQLPPPAPGVHPPAPVVHPPASGVHPPAPGVHPPAPGVHPPAPGVHPPTSGVHPPAP GVHPPAPGVHPPAPGVHPPAPGVHPPAPGVHPPPSAGVHPQAPGVHPAAPAVHPQAPG VHPPAPGMHPQAPGVHPQPPGVHPSAPGVHPQPPGVHPSNPGVHPPTPMPPMLRPPLP SEGPGNIPPPPPTN" misc_feature 48..78 /gene="SAP 62" /standard_name="zinc finger" repeat_region 286..439 /note="proline repeat" BASE COUNT 262 a 570 c 372 g 191 t ORIGIN 1 atggacttcc agcatcgccc cgggggcaag accgggagcg ggggcgtggc ctcctcctcc 61 gagagcaacc gtgaccgcag ggagccgctc cggcagctgg ccctggagac catcgacatc 121 aacaaggacc cgtacttcat gaagaaccac ctgggctcct atgaatgcaa actctgcctg 181 acacttcaca acaatgaggg gagctacctg gcacatacgc aggggaagaa gcaccagacc 241 aacctggccc ggcgagcagc caaggaggcc aaggaggccc ctgcccagcc cgcgcctgag 301 aaggtcaagg tggaggtgaa gaagtttgtg aagatcggcc gcccgggcta caaagtgacc 361 aagcagagag actcggagat gggccagcag agcctcctct tccagattga ctaccctgag 421 atcgccgagg gcatcatgcc acgtcaccgc ttcatgtctg cgtacgagca gaggatcgag 481 cctccggacc ggcgctggca gtacctgctc atggccgccg aaccctacga gaccattgcc 541 ttcaaggtgc cgagcagaga gatcgacaag gcggagggca agttctggac acactggaac 601 cgggagacca agcagttctt cctccagttc cactttaaga tggagaagcc cccggctcca 661 cccagcctcc ctgctggccc ccctggggtg aagcggcctc cacccccgct gatgaacggt 721 ctgccccctc ggccaccgct gcctgagtct ttgccaccgc ccccgccagg aggcctgcct 781 ctgccaccca tgccccccac agggcctgcg ccctcagggc ccccgggacc accccagcta 841 cccccgccag ctccaggggt ccaccccccg gccccagtgg tgcatccccc tgcatctggg 901 gtccatcccc cagctcctgg cgtccacccc ccagctcctg gcgtccatcc cccagcccct 961 ggggtccacc caccaacctc tggggtccac cccccagctc ctggagtcca ccctccagcc 1021 cccggggttc acccaccagc ccccggagtc cacccaccag cccctggggt tcacccacca 1081 gccccagggg tccatcctcc cccatcagcg ggggttcacc cccaggcccc gggggtgcac 1141 ccagcagccc ccgccgttca ccctcaggcc ccaggggtgc acccaccagc cccagggatg 1201 caccctcagg ccccgggggt ccacccccaa cctcccgggg tccatccgtc ggctcctggg 1261 gtccaccctc agcctccggg agttcacccc tcaaatcctg gggtgcaccc cccaactccc 1321 atgcccccaa tgctgaggcc cccacttccc tccgaaggcc cagggaacat acctccccct 1381 cccccaacca actga // LOCUS HUMSATB1A 2946 bp mRNA PRI 09-JAN-1995 DEFINITION Human MAR/SAR DNA binding protein (SATB1) mRNA, complete cds. ACCESSION M97287 NID g337810 KEYWORDS MAR/SAR DNA binding protein. SOURCE Homo sapiens male adult testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2946) AUTHORS Dickinson,L.A., Joh,T., Kohwi,Y. and Kohwi-Shigematsu,T. TITLE A tissue-specific MAR/SAR DNA-binding protein with unusual binding site recognition JOURNAL Cell 70 (4), 631-645 (1992) MEDLINE 92370684 FEATURES Location/Qualifiers source 1..2946 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="testis" 5'UTR <1..214 /gene="SATB1" /note="putative" gene 1..2946 /gene="SATB1" CDS 215..2506 /gene="SATB1" /note="putative" /codon_start=1 /function="MAR/SAR DNA binding protein" /db_xref="PID:g337811" /translation="MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGR LGSTGAKMQGVPLKHSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVR KDMLFNQLIEMALLSLGYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDV YHVVTLKIQLHSCPKLEDLPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISS IVNSTYYANVSAAKCQEFGRWYKHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPG NTAEQPPSPAQLSHGSQPSVRTPLPNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVN RLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAV FARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERER SLNAASAMGPAPLISTPPSRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKR AKVSQALFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAI YEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRL PPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQL DLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTL FSVKLEEELSVEGNTDINTDLKD" 3'UTR 2504..2946 /gene="SATB1" /evidence=experimental polyA_site 2929 /gene="SATB1" /evidence=experimental BASE COUNT 883 a 698 c 666 g 699 t ORIGIN 1 ggggggaaag gaaaataata caatttcagg ggaagtcgcc ttcaggtctg ctgctttttt 61 attttttttt ttttaattaa aaaaaaaaag gacatagaaa acatcagtct tgaacttctc 121 ttcaagaacc cgggctgcaa aggaaatctc ctttgttttt gttatttatg tgctgtcaag 181 ttttgaagtg gtgatcttta gacagtgact gagtatggat catttgaacg aggcaactca 241 ggggaaagaa cattcagaaa tgtctaacaa tgtgagtgat ccgaagggtc caccagccaa 301 gattgcccgc ctggagcaga acgggagccc gctaggaaga ggaaggcttg ggagtacagg 361 tgcaaaaatg cagggagtgc ctttaaaaca ctcgggccat ctgatgaaaa ccaaccttag 421 gaaaggaacc atgctgccag ttttctgtgt ggtggaacat tatgaaaacg ccattgaata 481 tgattgcaag gaggagcatg cagaatttgt gctggtgaga aaggatatgc ttttcaacca 541 gctgatcgaa atggcattgc tgtctctagg ttattcacat agctctgctg cccaggccaa 601 agggctaatc caggttggaa agtggaatcc agttccactg tcttacgtga cagatgcccc 661 tgatgctaca gtagcagata tgcttcaaga tgtgtatcat gtggtcacat tgaaaattca 721 gttacacagt tgccccaaac tagaagactt gcctcccgaa caatggtcgc acaccacagt 781 gaggaatgct ctgaaggact tactgaaaga tatgaatcag agttcattgg ccaaggagtg 841 ccccctttca cagagtatga tttcttccat tgtgaacagt acttactatg caaatgtctc 901 agcagcaaaa tgtcaagaat ttggaaggtg gtacaaacat ttcaagaaga caaaagatat 961 gatggttgaa atggatagtc tttctgagct atcccagcaa ggcgccaatc atgtcaattt 1021 tggccagcaa ccagttccag ggaacacagc cgagcagcct ccatcccctg cgcagctctc 1081 ccatggcagc cagccctctg tccggacacc tcttccaaac ctgcaccctg ggctcgtatc 1141 aacacctatc agtcctcaat tggtcaacca gcagctggtg atggctcagc tgctgaacca 1201 gcagtatgca gtgaatagac ttttagccca gcagtcctta aaccaacaat acttgaacca 1261 ccctccccct gtcagtagat ctatgaataa gcctttggag caacaggttt cgaccaacac 1321 agaggtgtct tccgaaatct accagtgggt acgcgatgaa ctgaaacgag caggaatctc 1381 ccaggcggta tttgcacgtg tggcttttaa cagaactcag ggcttgcttt cagaaatcct 1441 ccgaaaggaa gaggacccca agactgcatc ccagtctttg ctggtaaacc ttcgggctat 1501 gcagaatttc ttgcagttac cggaagctga aagagaccga atataccagg acgaaaggga 1561 aaggagcttg aatgctgcct cggccatggg tcctgccccc ctcatcagca caccacccag 1621 ccgtcctccc caggtgaaaa cagctactat tgccactgaa aggaatggga aaccagagaa 1681 caataccatg aacattaatg cttccattta tgatgagatt cagcaggaaa tgaagcgtgc 1741 taaagtgtct caagcactgt ttgcaaaggt tgcagcaacc aaaagccagg gatggttgtg 1801 cgagctgtta cgctggaaag aagatccttc tccagaaaac agaaccctgt gggagaacct 1861 ctccatgatc cgaaggttcc tcagtcttcc tcagccagaa cgtgatgcca tttatgaaca 1921 ggagagcaac gcggtgcatc accatggcga caggccgccc cacattatcc atgttccagc 1981 agagcagatt cagcaacagc agcagcaaca gcaacagcag cagcagcagc agcaggcacc 2041 gccgcctcca cagccacagc agcagccaca gacaggccct cggctccccc cacggcaacc 2101 cacggtggcc tctccagcag agtcagatga ggaaaaccga cagaagaccc ggccacgaac 2161 aaaaatttca gtggaagcct tgggaatcct ccagagtttc atacaagacg tgggcctgta 2221 ccctgacgaa gaggccatcc agactctgtc tgcccagctc gaccttccca agtacaccat 2281 catcaagttc tttcagaacc agcggtacta tctcaagcac cacggcaaac tgaaggacaa 2341 ttccggttta gaggtcgatg tggcagaata taaagaagag gagctgctga aggatttgga 2401 agagagtgtc caagataaaa atactaacac ccttttttca gtgaaactag aagaagagct 2461 gtcagtggaa ggaaacacag acattaatac tgatttgaaa gactgagata aaagtatttg 2521 tttcgttcaa cagtgccact ggtatttact aacaaaatga aaagtccacc ttgtcttctc 2581 tcagaaaacc tttgttgttc attgtttggc caatgaatct tcaaaaactt gcacaaacag 2641 aaaagttgga aaaggataat acagactgca ctaaatgttt tcctctgttt tacaaactgc 2701 ttggcagccc caggtgaagc atcaaggatt gtttggtatt aaaatttgtg ttcacgggat 2761 gcaccaaagt gtgtaccccg taagcatgaa accagtgttt tttgtttttt ttttagttct 2821 tattccggag cctcaaacaa gcattatacc ttctgtgatt atgatttcct ctcctataat 2881 tatttctgta gcactccaca ctgatctttg gaaacttgcc ccttatttaa aaaaaaaaaa 2941 aaaaaa // LOCUS HUMSCAD 1829 bp mRNA PRI 09-JAN-1995 DEFINITION Human short chain acyl-CoA dehydrogenase mRNA, complete cds. ACCESSION M26393 NID g337927 KEYWORDS short chain acyl-Coenzyme A dehydrogenase. SOURCE Human placenta, cDNA to mRNA, clones HS-[1,12]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1829) AUTHORS Naito,E., Ozasa,H., Ikeda,Y. and Tanaka,K. TITLE Molecular cloning and nucleotide sequence of complementary DNAs encoding human short chain acyl-coenzyme A dehydrogenase and the study of the molecular basis of human short chain acyl-coenzyme A dehydrogenase deficiency JOURNAL J. Clin. Invest. 83 (5), 1605-1613 (1989) MEDLINE 89214689 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.Naito, 21-JUL-1989. FEATURES Location/Qualifiers source 1..1829 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12q22-qter" mRNA <1..1829 /note="short chain acyl-CoA dehydrogenase mRNA" sig_peptide 33..104 /gene="ACADS" /note="short chain acyl-CoA dehydrogenase signal peptide" gene 33..1271 /gene="ACADS" CDS 33..1271 /gene="ACADS" /note="short chain acyl-CoA dehydrogenase precursor (EC 1.3.99.2)" /codon_start=1 /db_xref="GDB:G00-118-959" /db_xref="PID:g337928" /translation="MAAALLARASGPARRALCPRAWRQLHTIYQSVELPETHQMLLQT CRDFAEKELFPIAAQVDKEHLFPAAQVKKMGGLGLLAMDVPEELGGAGLDYLAYAIAM EEISRGCASTGVIMSVNNSLYLGPILKFGSKEQKQAWVTPFTSGDKIGCFALSEPGNG SDAGAASTTARAEGDSWVLNGTKAWITNAWEASAAVVFASTDRALQNKGISAFLVPMP TPGLTLGKKEDKLGIRGSSTANLIFEDCRIPKDSILGEPGMGFKIAMQTLDMGRIGIA SQALGIAQTALDCAVNYAENRMAFGAPLTKLQVIQFKLADMALALESARLLTWRAAML KDNKKPFIKEAAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERHYRDARITEIYE GTSEIQRLVIAGHLLRSYRS" mat_peptide 105..1268 /gene="ACADS" /note="short chain acyl-CoA dehydrogenase" BASE COUNT 336 a 570 c 594 g 329 t ORIGIN 1 gggattcggg cctgggactg tgtctgtcgc ccatggccgc cgcgctgctc gcccgggcct 61 cgggccctgc ccgcagagct ctctgtccta gggcctggcg gcagttacac accatctacc 121 agtctgtgga actgcccgag acacaccaga tgttgctcca gacatgccgg gactttgccg 181 agaaggagtt gtttcccatt gcagcccagg tggataagga acatctcttc ccagcggctc 241 aggtgaagaa gatgggcggg cttgggcttc tggccatgga cgtgcccgag gagcttggcg 301 gtgctggcct cgattacctg gcctacgcca tcgccatgga ggagatcagc cgtggctgcg 361 cctccaccgg agtcatcatg agtgtcaaca actctctcta cctggggccc atcttgaagt 421 ttggctccaa ggagcagaag caggcgtggg tcacgccttt caccagtggt gacaaaattg 481 gctgctttgc cctcagcgaa ccagggaacg gcagtgatgc aggagctgcg tccaccaccg 541 cccgggccga gggcgactca tgggttctga atggaaccaa agcctggatc accaatgcct 601 gggaggcttc ggctgccgtg gtctttgcca gcacggacag agccctgcaa aacaagggca 661 tcagtgcctt cctggtcccc atgccaacgc ctgggctcac gttggggaag aaagaagaca 721 agctgggcat ccggggctca tccacggcca acctcatctt tgaggactgt cgcatcccca 781 aggacagcat cctgggggag ccagggatgg gcttcaagat agccatgcaa accctggaca 841 tgggccgcat cggcatcgcc tcccaggccc tgggcattgc ccagaccgcc ctcgattgtg 901 ctgtgaacta cgctgagaat cgcatggcct tcggggcgcc cctcaccaag ctccaggtca 961 tccagttcaa gttggcagac atggccctgg ccctggagag tgcccggctg ctgacctggc 1021 gcgctgccat gctgaaggat aacaagaagc ctttcatcaa ggaggcagcc atggccaagc 1081 tggccgcctc ggaggccgcg accgccatca gccaccaggc catccagatc ctgggcggca 1141 tgggctacgt gacagagatg ccggcagagc ggcactaccg cgacgcccgc atcactgaga 1201 tctacgaggg caccagcgaa atccagcggc tggtgatcgc cgggcatctg ctcaggagct 1261 accggagctg agcccgcggc ggactgcccc aggactgcgg gaaggcgcgg gagccagggg 1321 cctccacccc aaccccggct cagagactgg gcggcccggc gggggctccc tggggacccc 1381 agatgggctc agtgctgcca cccagatcag atcacatggg aatgaggccc tccgaccatt 1441 ggcagctccg cctctgggcc tttccgcctc ctcaccactg tgcctcaagt tcctcatcta 1501 agtggccctg gctcctgggg gcggggttgt gggggggctg agcgacactc agggacacct 1561 cagttgtcct cccgcgggcc ctggtgccct ggcatgaagg cccagtgcga caggcccttg 1621 gtggggtctg tcttttcctt gaggtcagag gtcaggagca gggctggggt caggatgacg 1681 aggcctgggg tcctggtgtt gggcaggtgg tggggctggg ccatggagct ggcccagagg 1741 cccctcagcc ctttgtaaag tctgatgaag gcaggggtgg tgattcatgc tgtgtgactg 1801 actgtgggta ataaacacac ctgtccccc // LOCUS HUMSCF 1405 bp mRNA PRI 19-JAN-1996 DEFINITION Human stem cell factor mRNA, complete cds. ACCESSION M59964 NID g337933 KEYWORDS stem cell factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1405) AUTHORS Martin,F.H., Suggs,S., Langley,K.E., Lu,H.S., Ting,J., Okino,K.H., Morris,C.F., McNiece,I.K., Jacobsen,F.W., Mendiaz,E.A., Birkett,N.C., Smith,K.A., Johnson,M.J., Parker,V.P., Flores,J.C., Patel,A.C., Fisher,E.F., Erjavec,H.O., Herrera,C., Wypych,J., Sachdev,R.K., Pope,J.A., Leslie,I., Wen,D., Lin,C.-H., Cupples,R.L. and Zsebo,K.M. TITLE Primary structure and functional expression of rat and human stem cell factor DNAs JOURNAL Cell 63 (1), 203-211 (1990) MEDLINE 91004219 FEATURES Location/Qualifiers source 1..1405 /organism="Homo sapiens" /db_xref="taxon:9606" gene 184..1005 /gene="SCF" CDS 184..1005 /gene="SCF" /codon_start=1 /product="stem cell factor" /db_xref="PID:g337934" /translation="MKKTQTWILTCIYLQLLLFNPLVKTEGICRNRVTNNVKDVTKLV ANLPKDYMITLKYVPGMDVLPSHCWISEMVVQLSDSLTDLLDKFSNISEGLSNYSIID KLVNIVDDLVECVKENSSKDLKKSFKSPEPRLFTPEEFFRIFNRSIDAFKDFVVASET SDCVVSSTLSPEKDSRVSVTKPFMLPPVAASSLRNDSSSSNRKAKNPPGDSSLHWAAM ALPALFSLIIGFAFGALYWKKRQPSLTRAVENIQINEEDNEISMLQEKEREFQEV" BASE COUNT 416 a 286 c 311 g 392 t ORIGIN 1 ccgcctcgcg ccgagactag aagcgctgcg ggaagcaggg acagtggaga gggcgctgcg 61 ctcgggctac ccaatgcgtg gactatctgc cgccgctgtt cgtgcaatat gctggagctc 121 cagaacagct aaacggagtc gccacaccac tgtttgtgct ggatcgcagc gctgcctttc 181 cttatgaaga agacacaaac ttggattctc acttgcattt atcttcagct gctcctattt 241 aatcctctcg tcaaaactga agggatctgc aggaatcgtg tgactaataa tgtaaaagac 301 gtcactaaat tggtggcaaa tcttccaaaa gactacatga taaccctcaa atatgtcccc 361 gggatggatg ttttgccaag tcattgttgg ataagcgaga tggtagtaca attgtcagac 421 agcttgactg atcttctgga caagttttca aatatttctg aaggcttgag taattattcc 481 atcatagaca aacttgtgaa tatagtcgat gaccttgtgg agtgcgtcaa agaaaactca 541 tctaaggatc taaaaaaatc attcaagagc ccagaaccca ggctctttac tcctgaagaa 601 ttctttagaa tttttaatag atccattgat gccttcaagg actttgtagt ggcatctgaa 661 actagtgatt gtgtggtttc ttcaacatta agtcctgaga aagattccag agtcagtgtc 721 acaaaaccat ttatgttacc ccctgttgca gccagctccc ttaggaatga cagcagtagc 781 agtaatagga aggccaaaaa tccccctgga gactccagcc tacactgggc agccatggca 841 ttgccagcat tgttttctct tataattggc tttgcttttg gagccttata ctggaagaag 901 agacagccaa gtcttacaag ggcagttgaa aatatacaaa ttaatgaaga ggataatgag 961 ataagtatgt tgcaagagaa agagagagag tttcaagaag tgtaaattgt ggcttgtatc 1021 aacactgtta ctttcgtaca ttggctggta acagttcatg tttgcttcat aaatgaagca 1081 gctttaaaca aattcatatt ctgtctggag tgacagacca catctttatc tgttcttgct 1141 acccatgact ttatatggat gattcagaaa ttggaacaga atgttttact gtgaaactgg 1201 cactgaatta atcatctata aagaagaact tgcatggagc aggactctat tttaaggact 1261 gcgggacttg ggtctcattt agaacttgca gctgatgttg gaagagaaag cacgtgtctc 1321 agactgcatg taccatttgc atggctccag aaatgtctaa atgctgaaaa aacacctagc 1381 tttattcttc agatacaaac tgcag // LOCUS HUMSCL 4153 bp mRNA PRI 15-SEP-1990 DEFINITION Human stem cell protein (SCL) mRNA, complete cds. ACCESSION M29038 NID g337958 KEYWORDS . SOURCE Human male bone marrow, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4153) AUTHORS Begley,C.G., Aplan,P.D., Denning,S.M., Haynes,B.F., Waldmann,T.A. and Kirsch,I.R. TITLE The gene SCL is expressed during early hematopoiesis and encodes a differentiation-related DNA-binding motif JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 10128-10132 (1989) MEDLINE 90099309 COMMENT Authorin entry for [1] kindly submitted by I.R.Kirsch, 23-MAR-1990. FEATURES Location/Qualifiers source 1..4153 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..4153 /note="SCL mRNA" CDS 81..725 /note="stem cell protein (SCL)" /codon_start=1 /db_xref="PID:g337959" /translation="MVQLSPPALAAPAAPGRALLYSLSQPLASLGSGFFGEPDAFPMF TTNNRVKRRPSPYEMEITDGPHTKVVRRIFTNSRERWRQQNVNGAFAELRKLIPTHPP DKKLSKNEILRLAMKYINFLAKLLNDQEEEGTQRAKTGKDPVVGAGGGGGGGGGGAPP DDLLQDVLSPNSSCGSSLDGAASPDSYTEEPAPKHTARSLHPAMLPAADGAGPR" BASE COUNT 928 a 1028 c 1077 g 1120 t ORIGIN 1 agtcagagtc actttctgta aatggtactt aggtaggcgc gtccgcctcg gttacagcgg 61 agctgcccgg cgacggccgc atggtgcagc tgagtcctcc cgcgctggct gcccccgccg 121 cccccggccg cgcgctgctc tacagcctca gccagccgct ggcctctctc ggcagcgggt 181 tctttgggga gccggatgcc ttccctatgt tcaccaccaa caatcgagtg aagaggagac 241 cttcccccta tgagatggag attactgatg gtccccacac caaagttgtg cggcgtatct 301 tcaccaacag ccgggagcga tggcggcagc agaatgtgaa cggggccttt gccgagctcc 361 gcaagctgat ccccacacat cccccggaca agaagctcag caagaatgag atcctccgcc 421 tggccatgaa gtatatcaac ttcttggcca agctgctcaa tgaccaggag gaggagggca 481 cccagcgggc caagactggc aaggaccctg tggtgggggc tggtgggggt ggaggtgggg 541 gagggggcgg cgcgccccca gatgacctcc tgcaagacgt gctttccccc aactccagct 601 gcggcagctc cctggatggg gcagccagcc cggacagcta cacggaggag cccgcgccca 661 agcacacggc ccgcagcctc catcctgcca tgctgcctgc cgccgatgga gccggccctc 721 ggtgatgggt ctgggccacc aggatcagcc aggagggcgt tcttaggctg ctgggatggt 781 gggcttcagg gcaggtgggg tgagaattgg gcggctctga agcaaggcgg tggacttgaa 841 ctttcctgga tgtctgaact ttgggaagcc tttactgacc ctggggctgg cttttctgtt 901 tcctgtacca gtaggagatc agaaaaatgg agcaaagtgg taggtacttt ttgtgaagac 961 ggcacggtct tccctcttcc ctcagtccca aatccttccc aagtaagagg ctggagttgt 1021 cactgctttt ggcctggagt ttgggatccc tgtctttcct aagacctggg gttgtcagct 1081 ctcatctgag gcatccagca gtctctgcct tgcctttagc ccctcccaag ctggctgggg 1141 tggcctgtgt ggccacttct gtccatattt ataggtaccc aatagctgcc catttcgtga 1201 gccccatctt cacccaggcc tatgttgatc catccagctt gccagatgct gcagagtcac 1261 aagcctcgag gtgccttctt cagggcctgg ttgaagaaga tgatcagtgg acagtctgct 1321 ctagatgagc tgggccggag ggtcaggaaa cccagtcgcc cttacttctt gccctgggga 1381 tcaaagttct gctttctccc caatgagact tgccttccta agcctgtggc tgtggagacc 1441 atgtctgcag ccctgagaaa gccctgtcgg gctttgtgtg aaggcagaga aagggacaat 1501 gatagtagag tgatatggag caagagatat tttgggcatg tgggcttcaa ctcctcgaca 1561 tcactgttca tgctggcgag tgaatgccag tgtgctgatg ggcgtacgct ggtgctgagt 1621 agatgcgcag ccccatctgt gcattctcct ggatgcttag agggatttct ttgctgtaag 1681 atgtctgttt gctgatggtc tggtctatgt tccgaattga gcacaaaacc tgtcctatga 1741 atgctttgca tttggaattt ttgcttgact tcagttattg gtggaatctt tagcgctcaa 1801 taggaccagg atccagcctc acttctaggg tatgggaaat ccaatcagag accaggccct 1861 ggctaagacc caaacatatg cacattcact tagcagaacc ttaaacaccc ctcagttgtg 1921 cagcttttgg tcatcaaggg tgcgtctggg aggttggttt aatgcaatag aagtgctccc 1981 ctctgaaagt tgtacatgaa atttttgtaa atcacatcct tatccttcat cttttaaaga 2041 aataaccact gcaagtcctt ttgtaaagtg aagaatcctt ttgtagaatg aaccactgcc 2101 ccttcattga tttcctgtgt caatccagat ggtgggatgt ggttttctta aggtgaggcc 2161 tgtctgtgac ctgcatctaa gcccatggga caaattgcac agaagtcctg tatgtctgtc 2221 attgtaccct taagtcaccc tagccctctc cctctaggct ctgccttcga ggtcagagga 2281 gagatagcct gtggccctgt cctgccatgc aagaactcat cactgtggct gtctggaaag 2341 ccccccctta tagtttgggc ttcagcctag tggcttgtcc tcaccatgat ggggccctaa 2401 ttcagccatg tacagacaga gaatatgtct gctcctttcc ccttcctttt aagtaaggtc 2461 caattctcga gcttggggca acattgttca cctttgtagc actcaggctc tccattcaat 2521 ttcaggctcc ccagatcatg ttttggtgaa aattagggtt ggttcctttc caacgtttgg 2581 aagatcctgt gaggagcccc atctgtctaa agatagagtc attgctgtag gatctaaggc 2641 tgtttgcttc accgtggatt cgcttgagtt aggaatgaga agtagccaca gtatggatgg 2701 gtggatgggt tttatgagat ggatcacata ttttattaag aactcaaact tctggctccc 2761 tcttctttca gacttgccat gtgactctgg cttggcctat ctcctagggc tatggtgtgg 2821 actgaatggg atcatgaaag tagacagttt tgagaacgta aagaactttt tcttttccct 2881 caatctcaat cctgcagtgg ggtttcgcag cctgagtcca cgacctaggc agtaggccgg 2941 tgtgcctgac tgcccagcat ttgggtaatt tagattgtaa accgctttgg cctgagttat 3001 tgagattgtc ctcatttctc cagattatct atttgtgtgt gtgtgtgtgt gtgtgtgaga 3061 gacggtgtct tgttctgtca ctcaggctgg agtacagtgg tgccatcatt gctgtctgca 3121 gccttgaact ctgggctcaa gcaatcctct cacctcagcc tcccgagtag ggaggaccac 3181 aggtgtgagc caccacacct ggctaatttt tacttttttt tttttttggt agagatggag 3241 tcttgctata ttgcccaggc tggtcttgaa gtcctggctt caggcaattc tcctgccttt 3301 gcctccagaa gcactgggat cacaggtgtc agccattgca cccagcccag attgtcttaa 3361 tttctatctt gttccaaggc cagggacagt aataagaatg gaaaagagat atgggaacac 3421 tggcagactg tgtaaaatgt aatgcaacta cccaaaacaa gcctggtagg aaagggcaag 3481 tctttaggtc tttgtaagaa ctaaagaaga tctgtaattt ttattttcac cctctgtacc 3541 ccatgacctt atccttcctc tccttccttg ttacccatga aaaactggca acattccaag 3601 aatagcatct gtacaaaggg gaaagaacat aaaggtaaaa caaaacaaaa caacattttg 3661 agaacaaaga tgaccataac cactgaaggg aatcacatct tttaagacaa attcatattc 3721 ttttatttgt tatggcagat gacaagatgg tacaaccttt attcttttcc aaaataaaac 3781 aaagggcaca gcatctgtag tcagccgaca actctttcgg ccttttgggg gtgggtctgg 3841 ccgtacttgt gatttcgatg gtacgtgacc ctctgctgaa gacttgcccc ctgcccgtgt 3901 acatagtgca ttgtttctgt gggcgggccc agcactttcc gtcaacgttg tactgtatgt 3961 gatgaattgc gttggtctct gcatttttct gcagaagagg agtaaccgct ccaggtacct 4021 tgacctttgt acagcccaga ggccaacact gtgggtgtgt gactctttag caaaaaaaac 4081 ccatgtggtg atgatgtgtc tatatatgtg aggatgtatc gggaagattt ctaaataaaa 4141 gttttacaaa ggg // LOCUS HUMSCN1BA 1404 bp mRNA PRI 12-JAN-1995 DEFINITION Human sodium channel beta-1 subunit (SCN1B) mRNA, complete cds. ACCESSION L10338 NID g307414 KEYWORDS sodium channel beta-1 subunit; voltage-gated sodium channel; voltage-gated sodium channel beta-1 subunit. SOURCE Homo sapiens frontal cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1404) AUTHORS McClatchey,A.I., Cannon,S.C., Slaugenhaupt,S.A. and Gusella,J.F. TITLE The cloning and expression of a sodium channel beta 1-subunit cDNA from human brain JOURNAL Hum. Mol. Genet. 2 (6), 745-749 (1993) MEDLINE 93357746 FEATURES Location/Qualifiers source 1..1404 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="frontal cortex" /map="19q13.1" gene 98..1394 /gene="SCN1B" CDS 98..754 /gene="SCN1B" /codon_start=1 /db_xref="GDB:G00-127-281" /product="sodium channel beta-1 subunit" /db_xref="PID:g307415" /translation="MGRLLALVVGAALVSSACGGCVEVDSETEAVYGMTFKILCISCK RRSETNAETFTEWTFRQKGTEEFVKILRYENEVLQLEEDERFEGRVVWNGSRGTKDLQ DLSIFITNVTYNHSGDYECHVYRLLFFENYEHNTSVVKKIHIEVVDKANRDMASIVSE IMMYVLIVVLTIWLVAEMIYCYKKIAAATETAAQENASEYLAITSESKENCTGVQVAE " polyA_signal 1389..1394 /gene="SCN1B" /note="G00-127-281" BASE COUNT 261 a 456 c 422 g 265 t ORIGIN 1 gaattccgac attctaacgc cgccaggtcc cgccgcctct cgccccgcta ttaataccgg 61 cggcccggga ggggggcgca gcacgcgccg cgcagccatg gggaggctgc tggccttagt 121 ggtcggcgcg gcactggtgt cctcagcctg cgggggctgc gtggaggtgg actcggagac 181 cgaggccgtg tatgggatga ccttcaaaat tctttgcatc tcctgcaagc gccgcagcga 241 gaccaacgct gagaccttca ccgagtggac cttccgccag aagggcactg aggagtttgt 301 caagatcctg cgctatgaga atgaggtgtt gcagctggag gaggatgagc gcttcgaggg 361 ccgcgtggtg tggaatggca gccggggcac caaagacctg caggatctgt ctatcttcat 421 caccaatgtc acctacaacc actcgggcga ctacgagtgc cacgtctacc gcctgctctt 481 cttcgaaaac tacgagcaca acaccagcgt cgtcaagaag atccacattg aggtagtgga 541 caaagccaac agagacatgg catccatcgt gtctgagatc atgatgtatg tgctcattgt 601 ggtgttgacc atatggctcg tggcagagat gatttactgc tacaagaaga tcgctgccgc 661 cacggagact gctgcacagg agaatgcctc ggaatacctg gccatcacct ctgagagcaa 721 agagaactgc acgggcgtcc aggtggccga atagccctgg ccctgggccc cgcctcaagg 781 aagagccagc cgtatgggga ccctccaggc accgcctgcc cccagcgtgg gggtggccac 841 tcctgggccc ccagaaagcc tcagagtcct gccgacggag ccactggggt gggagggggc 901 agggggcttg gctcgcaccc ccactttcgc ctcctccagc tcctgccccg ccggccgcgc 961 accgccatgc atgatgggta aagcaatact gccgctgccc ccaccctgct tctgctgcct 1021 gtttggggag gggggcggtg aggtgggggc agcggccccg cacccctcct ccttgctgat 1081 tgcacacatt ggccgcttca gacacgcact tctggggcca gcccctcccc gcctcctctc 1141 tggcgggcag gggtcgcgat gatgggctgg agcagtttgg ggcagggggt tctgggaccc 1201 actccgactc ccccctcccc ggcatcattt cccctcccgc ttcctccggc tggacctggg 1261 gtcccccctc cctgtaatgg actcctgccc cggcccaacc tcgccctctc tcaccagcct 1321 tgaactgtgg ccacctagaa aggggcccat tcagcctcgt ctctttacag aagtagtttt 1381 gttcatgaaa taaagacgga attc // LOCUS HUMSCP2A 2572 bp mRNA PRI 06-DEC-1993 DEFINITION Human sterol carrier protein X/sterol carrier protein 2 mRNA, complete cds. ACCESSION M75883 NID g432974 KEYWORDS sterol carrier protein-2, sterol carrier protein X. SOURCE Human liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2572) AUTHORS He,Z., Yamamoto,R., Furth,E.A., Schantz,L.J., Naylor,S.L., George,H., Billheimer,J.T. and Strauss,J.F.III. TITLE cDNAs encoding members of a family of proteins related to human sterol carrier protein 2 and assignment of the gene to human chromosome 1p21-pter JOURNAL DNA Cell Biol. 10, 559-569 (1991) MEDLINE 92029618 REFERENCE 2 (bases 1 to 2572) AUTHORS Vesa,J., Hellsten,E., Branoski,B.L., Emanuel,B.S., Billheimer,J.T., Mead,S., Cowell,J.K., Strauss,J.F.III. and Peltonen,L. TITLE Assignment of sterol carrier protein X/sterol carrier protein 2 to 1p32 and exclusion as the causative gene for infantile neuronal cesoid lipofusionosis JOURNAL Unpublished FEATURES Location/Qualifiers source 1..2572 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /clone_lib="Clontech HL-1001b;HL-1115b" /map="1p32" gene 22..1665 /gene="SCP-X/SCP-2" CDS 22..1665 /gene="SCP-X/SCP-2" /codon_start=1 /product="sterol carrier protein-X/sterol carrier protein-2" /db_xref="PID:g432975" /translation="MSSSPWEPATLRRVFVVGVGMTKFVKPGAENSRDYPDLAEEAGK KALADAQIPYSAVDQACVGYVFGDSTCGQRAIYHSLGMTGIPIINVNNNCATGSTALF MARQLIQGGVAECVLALGFEKMSKGSLGIKFSDRTIPTDKHVDLLINKYGLSAHPVAP QMFGYAGKEHMEKYGTKIEHFAKIGWKNHKHSVNNPYSQFQDEYSLDEVMASKEVFDF LTILQCCPTSDGAAAAILASEAFVQKYGLQSKAVEILAQEMMTDLPSSFEEKSIIKMV GFDMSKEAARKCYEKSGLTPNDIDVIELHDCFSTNELLTYEALGLCPEGQGATLVDRG DNTYGGKWVINPSGGLISKGHPLGATGLAQCAELCWQLRGEAGKRQVPGAKVALQHNL GIGGAVVVTLYKMGFPEAASSFRTHQIEAVPTSSASDGFKANLVFKEIEKKLEEEGEQ FVKKIGGIFAFKVKDGPGGKEATWVVDVKNGKGSVLPNSDKKADCTITMADSDFLALM TGKMNPQSAFFQGKLKITGNMGLAMKLQNLQLQPGNAKL" polyA_signal 1932..1937 /evidence=not_experimental polyA_signal 2213..2218 /evidence=not_experimental polyA_signal 2258..2263 /evidence=not_experimental polyA_signal 2341..2346 /evidence=experimental polyA_signal 2550..2555 /evidence=experimental polyA_site 2572 /evidence=experimental BASE COUNT 794 a 439 c 570 g 769 t ORIGIN 1 cggtcccgca ctggtgcagc catgtcctct tccccgtggg agcctgcgac cctgcgccgg 61 gtgttcgtgg tgggggttgg catgaccaag tttgtgaagc ctggagctga gaattcaaga 121 gactaccctg acttggcaga agaagcaggc aagaaggctt tagctgatgc acagatccct 181 tattcagcag tggaccaggc atgtgttggc tatgtttttg gtgactctac ctgtgggcag 241 agggctatct atcacagttt gggaatgact ggaattccta taatcaatgt caacaataac 301 tgtgctactg gttctactgc tttgtttatg gcccgccagc tgattcaggg tggtgtggca 361 gaatgtgtct tggctcttgg gtttgagaag atgagtaagg gaagccttgg aataaaattt 421 tcagatagaa ccattcccac tgataagcat gttgacctcc tgatcaataa gtatggattg 481 tctgctcacc cagttgctcc tcagatgttt gggtatgctg gaaaagaaca tatggaaaaa 541 tatggaacaa aaattgaaca ctttgcaaaa attggatgga aaaatcataa acattcagtt 601 aataacccgt attcccagtt ccaagatgaa tacagtttag atgaagtgat ggcatctaaa 661 gaagtttttg attttttgac tatcttacaa tgttgtccca cttcagatgg tgctgcagca 721 gcaattttgg ccagtgaagc atttgtacag aagtatggcc tgcaatccaa agctgtggaa 781 attttggcac aagaaatgat gactgatttg ccaagctcgt ttgaagaaaa aagcattatt 841 aaaatggttg gctttgatat gagtaaagaa gctgcaagaa aatgctatga gaaatctggc 901 ctgacaccaa atgatattga cgtaatagaa cttcacgatt gcttttctac caacgaactc 961 ctgacttatg aagcactcgg actctgtcca gaaggacaag gtgcaacgct ggttgataga 1021 ggagataata catatggagg aaagtgggtc ataaatccta gtggtggact gatttcaaag 1081 ggacacccac taggcgctac aggtcttgct cagtgtgcag aactctgctg gcagctgaga 1141 ggggaagccg gaaagaggca agttcctggt gcaaaggtgg ctctgcagca taatttaggc 1201 attggaggag ctgtggttgt aacactctac aagatgggtt ttccggaagc cgccagttct 1261 tttagaactc atcaaattga agctgttcca accagctctg caagtgatgg atttaaggca 1321 aatcttgttt ttaaggagat tgagaagaaa cttgaagagg aaggggaaca gtttgtgaag 1381 aaaatcggtg gtatttttgc cttcaaggtg aaagatggcc ctgggggtaa agaggccacc 1441 tgggtggtgg atgtgaagaa tggcaaagga tcagtgcttc ctaactcaga taagaaggct 1501 gactgcacaa tcacaatggc tgactcagac ttcctggctt taatgactgg taaaatgaat 1561 cctcagtcgg ccttctttca aggcaaattg aaaatcactg gcaacatggg tctcgctatg 1621 aagttacaaa atcttcagct tcagccaggc aacgctaagc tctgaagaac tccctttggc 1681 tacttttgaa aatcaagatg agatatatag atatatatcc atacatttta ttgtcagaat 1741 ttagactgaa actacacatt ggcaaatagc gtggatagga tttgtttctt aatgggtgtg 1801 accaatcctg tttttcctat gctctgggtg aatagagcct gatggtatac tactgctttg 1861 cggaattgca tacaactgtg cattacaaag ttaatatggt aattatggtc tggggtaaaa 1921 ttgagtttca gaataaaatt aggaacagta aaatccaaag aactatgtaa acaaaaaagc 1981 ttttgttttg cttacaaagt atatttaagg attattctgc tgaagattca gtttaagagt 2041 tttccttggg agaactaagt aagaaacaca atgccaacag ctggccagta attagtgttg 2101 tgcacttcat gtcattaatc aatttctcaa tagttcttaa aattagtgag attaaaaatc 2161 taaaaatttt gcatttcatg ctatcagaaa cagtattttc ttcccaaatc aaaataaaag 2221 aaatatgatc agagcttgaa cacaggctta tttttaaaat aaaaatattt ttaacatggg 2281 tttccttatt gaaaaatcag tgtattagtc ataaaacacc atcattaaga ataattgaac 2341 aataaagttt gctttcagat gcagttttca aattataatc tcatttcaat ttataacgtt 2401 ctcagtcctt tgttataatt ttcctttttc atgtaagttt aattatctgc atttatcttt 2461 tttcctagtt tttctaatac taatgttatt tcttaaaatt cagtgagata taggataaaa 2521 taatgctttg agaagaatgt ttaatagaaa attaaaataa ctttttctgg ca // LOCUS HUMSCR3B 1509 bp mRNA PRI 04-MAR-1997 DEFINITION Human scr3 mRNA for RNA binding protein SCR3, complete cds. ACCESSION D28483 NID g520589 KEYWORDS scr3; SCR3; RNA binding protein; multicopy suppressor. SOURCE Homo sapiens (isolate:Basinger) fibrablast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Kanaoka,Y. and Nojima,H. TITLE SCR: novel human suppressors of cdc2/cdc13 mutants of Schizosaccharomyces pombe harbour motifs for RNA binding proteins JOURNAL Nucleic Acids Res. 22 (13), 2687-2693 (1994) MEDLINE 94316516 REFERENCE 2 (bases 1 to 1509) AUTHORS Kanaoka,Y. and Nojima,H. JOURNAL Unpublished (1994) REFERENCE 3 (bases 1 to 1509) AUTHORS Nojima,H. TITLE Direct Submission JOURNAL Submitted (07-FEB-1994) to the DDBJ/EMBL/GenBank databases. Hiroshi Nojima, Osaka University, Department of Molecular Genetics, RIMD; 3-1 Yamadaoka, Suita, Osaka 565, Japan (E-mail:h63073a@center.osaka-u.ac.jp, Tel:06-875-3980, Fax:06-875-5192) FEATURES Location/Qualifiers source 1..1509 /organism="Homo sapiens" /isolate="Basinger" /db_xref="taxon:9606" /cell_type="fibrablast" gene 17..1240 /gene="scr3" CDS 17..1240 /gene="scr3" /note="RNA-binding: AA97-104, AA175-182" /codon_start=1 /product="SCR3" /db_xref="PID:d1006389" /db_xref="PID:g558530" /translation="MLLSVTSRPGISTFGYNRNNKKPYVSLAQQMAPPSPSNSTPNSS SGSNGNDQLSKTNLYIRGLQPGTTDQDLVKLCQPYGKIVSTKAILDKTTNKCKGYGFV DFDSPSAAQKAVTALKASGVQAQMAKQQEQDPTNLYISNLPLSMDEQELEGMLKPFGQ VISTRILRDTSGTSRGVGFARMESTEKCEAIITHFNGKYIKTPPGVPAPSDPLLCKFA DGGPKKRQNQGKFVQNGRAWPRNADMGVMALTYDPTTALQNGFYPAPYNITPNRMLAQ SALSPYLSSPVSSYQRVTQTSPLQVPNPSWMHHHSYLMQPSGSVLTPGMDHPISLQPA SMMGPLTQQLGHLSLSSTGTYMPTAAAMQGAYISQYTPVPSSSVSVEESSGQQNQVAV DAPSEHGVYSFQFNK" BASE COUNT 418 a 421 c 329 g 341 t ORIGIN 1 taacattaaa gagaaaatgc tgctatccgt gacttccagg cccgggattt cgacttttgg 61 ctacaataga aacaacaaga agccatatgt gtcactggct cagcagatgg caccacctag 121 cccaagcaac agtacaccta acagcagtag tggaagcaat ggaaatgacc agctgagcaa 181 aaccaaccta tacatccgag gattgcaacc aggcactact gaccaagatc ttgtcaagct 241 gtgtcagcca tatggcaaga ttgtttccac taaggccata ctggacaaga ccacaaacaa 301 atgtaaaggc tatggctttg tagattttga cagcccttca gcagcacaga aagctgtaac 361 agcactgaag gccagcggtg tacaggcaca gatggcaaag caacaggaac aggaccccac 421 aaatttatac atctcaaacc tcccactgtc aatggatgag caggaactgg aggggatgct 481 gaagcccttt ggccaggtta tctccacccg tatccttcga gataccagtg ggaccagcag 541 aggtgttggc tttgcaagga tggagtccac agagaagtgt gaagccatca tcacccactt 601 taatggaaaa tatattaaga caccccctgg agtaccagcc ccatccgatc ccttgctttg 661 caaatttgct gatggcgggc caaagaaacg acagaaccaa ggaaaatttg tgcaaaatgg 721 acgggcttgg ccaaggaatg cagacatggg cgtcatggcc ttgacctatg accccaccac 781 agctcttcag aatgggtttt acccagcccc ctataacatc acccccaaca ggatgcttgc 841 tcagtctgca ctctccccat acctttcctc tcctgtgtct tcgtatcaga gagtgactca 901 gacatctcct ctacaagtac ctaacccatc ctggatgcac caccattcat acctcatgca 961 gccttcaggt tcagttctga caccagggat ggaccatccc atttctctcc agcctgcctc 1021 catgatggga ccccttaccc agcaactggg ccatctctcc ctcagcagca caggcacgta 1081 tatgccgacg gctgcagcta tgcaaggagc ttacatctcc cagtacaccc ctgtgccttc 1141 ttccagtgtt tcagtcgagg agagcagcgg ccaacagaac caagtggcag tggacgcacc 1201 ctcagagcat ggggtctatt ctttccagtt caacaagtaa cagtgggatt cccctcccca 1261 tctttactga atagaaatga attcttggag atactcatgc tcccagattc cagagggtta 1321 accaggaatg gagaccatcc gtcggccctg ctaaggacta acacttagcc atcgtttttc 1381 acaggcctgg gcctggaaaa agaaatctct acgttcctgc cctttactat tgctgatgga 1441 gcctggggga accatcactt tttttgtgtg ctacattcaa ggagatcaaa aaaacttttc 1501 ttcttttgc // LOCUS HUMSDHX 2277 bp mRNA PRI 15-AUG-1994 DEFINITION Human succinate dehydrogenase flavoprotein subunit (SDH) mRNA, complete cds. ACCESSION L21936 NID g347133 KEYWORDS succinate dehydrogenase flavoprotein subunit. SOURCE Homo sapiens (library: lambda ZAPII; Stratagene #936208) male heart cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2277) AUTHORS Morris,A.A., Farnsworth,L., Ackrell,B.A., Turnbull,D.M. and Birch-Machin,M.A. TITLE The cDNA sequence of the flavoprotein subunit of human heart succinate dehydrogenase JOURNAL Biochim. Biophys. Acta 1185 (1), 125-128 (1994) MEDLINE 94190953 FEATURES Location/Qualifiers source 1..2277 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="heart" /tissue_lib="lambda ZAPII; Stratagene #936208" 5'UTR 1..24 /gene="SDH" gene 1..2277 /gene="SDH" sig_peptide 25..153 /gene="SDH" CDS 25..2019 /gene="SDH" /EC_number="1.3.99.1" /note="putative" /codon_start=1 /function="enzyme of TCA cycle and mitochondrial respiratory chain" /product="succinate dehydrogenase flavoprotein subunit" /db_xref="PID:g347134" /translation="MSGVRGLSRLLSARRLALAKAWPTVLQTGTRGFHFTVDGNKRAS AKVSDSISAQYPVVDHEFDAVVVGAGGAGLRAAFGLSEAGFNTACVTKLFPTRSHTVA AQGGINAALGNMEEDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPAAVVELENYGMPF SRTEDGKIYQRAFGGQSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRYDTSYFVE YFALDLLMENGECRGVIALCIEDGSIHRIRAKNTVVATGGYGRTYFSCTSAHTSTGDG TAMITRAGLPCQDLEFVQFHPTGIYGAGCLITEGCRGEGGILINSQGERFMERYAPVA KDLASRDVVSRSMTLEIREGRGCGPEKDHVYLQLHHLPPEQLATRLPGISETAMIFAG VDVTKEPIPVLPTVHYNMGGIPTNYKGQVLRHVNGQDQIVPGLYACGEAACASVHGAN RLGANSLLDLVVFGRACALSIEESCRPGDKVPPIKPNAGEESVMNLDKLRFADGSIRT SELRLSMQKSMQNHAAVFRVGSVLQEGCGKISKLYGDLKHLKTFDRGMVWNTDLVETL ELQNLMLCALQTIYGAEARKESRGAHAREDYKVRIDEYDYSKPIQGQQKKPFEEHWRK HTLSFVDVGTGKVTLEYRPVIDKTLNEADCATIPPAIRSY" mat_peptide 154..2016 /gene="SDH" /product="succinate dehydrogenase flavoprotein subunit" 3'UTR 2017..2277 /gene="SDH" BASE COUNT 536 a 559 c 686 g 496 t ORIGIN 1 gactgcgcgg cggcaacagc agacatgtcg ggggtccggg gcctgtcgcg gctgctgagc 61 gctcggcgcc tggcgctggc caaggcgtgg ccaacagtgt tgcaaacagg aacccgaggt 121 tttcacttca ctgttgatgg gaacaagagg gcatctgcta aagtttcaga ttccatttct 181 gctcagtatc cagtagtgga tcatgaattt gatgcagtgg tggtaggcgc tggaggggca 241 ggcttgcgag ctgcatttgg cctttctgag gcagggttta atacagcatg tgttaccaag 301 ctgtttccta ccaggtcaca cactgttgca gcgcagggag gaatcaatgc tgctctgggg 361 aacatggagg aggacaactg gaggtggcat ttctacgaca ccgtgaaggg ctccgactgg 421 ctgggggacc aggatgccat ccactacatg acggagcagg cccccgccgc cgtggtcgag 481 ctagaaaatt atggcatgcc gtttagcaga actgaagatg ggaagattta tcagcgtgca 541 tttggtggac agagcctcaa gtttggaaag ggcgggcagg cccatcggtg ctgctgtgtg 601 gctgatcgga ctggccactc gctattgcac accttatatg gacggtctct gcgatatgat 661 accagctatt ttgtggagta ttttgccttg gatctcctga tggagaacgg ggagtgccgt 721 ggtgtcatcg cactgtgcat agaggacggg tccatccatc gcataagagc aaagaacact 781 gttgttgcca caggaggcta cgggcgcacc tacttcagct gcacgtctgc ccacaccagc 841 actggcgacg gcacggccat gatcaccagg gcaggccttc cttgccagga cctagagttt 901 gttcagttcc accccacagg catatatggt gctggttgtc tcattacgga aggatgtcgt 961 ggagagggag gcattctcat taacagtcaa ggcgaaaggt ttatggagcg atacgcccct 1021 gtcgcgaagg acctggcgtc tagagatgtg gtgtctcggt cgatgactct ggagatccga 1081 gaaggaagag gctgtggccc tgagaaagat cacgtctacc tgcagctgca ccacctacct 1141 ccagagcagc tggccacgcg cctgcctggc atttcagaga cagccatgat cttcgctggc 1201 gtggacgtca cgaaggagcc gatccctgtc ctccccaccg tgcattataa catgggcggc 1261 attcccacca actacaaggg gcaggtcctg aggcacgtga atggccagga tcagattgtg 1321 cccggcctgt acgcctgtgg ggaggccgcc tgtgcctcgg tacatggtgc caaccgcctc 1381 ggggcaaact cgctcttgga cctggttgtc tttggtcggg catgtgccct gagcatcgaa 1441 gagtcatgca ggcctggaga taaagtccct ccaattaaac caaacgctgg ggaagaatct 1501 gtcatgaatc ttgacaaatt gagatttgct gatggaagca taagaacatc ggaactgcga 1561 ctcagcatgc agaagtcaat gcaaaatcat gctgccgtgt tccgtgtggg aagcgtgttg 1621 caagaaggtt gtgggaaaat cagcaagctc tatggagacc taaagcacct gaagacgttc 1681 gaccggggaa tggtctggaa cacagacctg gtggagaccc tggagctgca gaacctgatg 1741 ctgtgtgcgc tgcagaccat ctacggagca gaggcgcgga aggagtcacg gggcgcgcat 1801 gccagggaag actacaaggt gcggattgat gagtacgatt actccaagcc catccagggg 1861 caacagaaga agccctttga ggagcactgg aggaagcaca ccctgtcctt tgtggacgtt 1921 ggcactggga aggtcactct ggaatataga cccgtaatcg acaaaacttt gaacgaggct 1981 gactgtgcca ccatcccgcc agccattcgc tcctactgat gagacaagat gtggtgatga 2041 cagaatcagc ttttgtaatt atgtataata gctcatgcat gtgtccatgt cataactgtc 2101 ttcatacgct tctgcactct ggggaagaag gagtacattg aagggagatt ggcacctagt 2161 ggctgggagc ttgccaggaa cccagtggcc agggagcgtg gcacttacct ttgtcccttg 2221 cttcattctt gtgagatgat aaaactgggc acagctctta aataaaatat aaatgag // LOCUS HUMSEC61B 396 bp mRNA PRI 14-JUL-1994 DEFINITION Human Sec61-complex beta-subunit mRNA, complete cds. ACCESSION L25085 NID g459833 KEYWORDS Sec61-complex beta-subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 396) AUTHORS Hartmann,E., Sommer,T., Prehn,S., Gorlich,D., Jentsch,S. and Rapoport,T.A. TITLE Evolutionary conservation of components of the protein translocation complex [see comments] JOURNAL Nature 367 (6464), 654-657 (1994) MEDLINE 94150683 FEATURES Location/Qualifiers source 1..396 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 64..354 /codon_start=1 /function="protein translocation across the er-membrane" /product="Sec61-complex beta-subunit" /db_xref="PID:g459834" /translation="MPGPTPSGTNVGSSGRSPSKAVAARAAGSTVRQRKNASCGTRSA GRTTSAGTGGMWRFYTEDSPGLKVGPVPVLVMSLLFIASVFMLHIWGKYTRS" BASE COUNT 84 a 112 c 108 g 92 t ORIGIN 1 ctttcggggg ctccgtaact ttctatccgt ccgcgtcagc gccttgccac cctcatctcc 61 aatatgcctg gtccgacccc cagtggcact aacgtgggat cctcagggcg ctctcccagc 121 aaagcagtgg ccgcccgggc ggcgggatcc actgtccggc agaggaaaaa tgccagctgt 181 gggacaagga gtgcaggccg cacaacctcg gcaggcaccg gggggatgtg gcgattctac 241 acagaagatt cacctgggct caaagttggc cctgttccag tattggttat gagtcttctg 301 ttcatcgctt ctgtatttat gttgcacatt tggggcaagt acactcgttc gtagattcag 361 ttacatccat ctgtcatcta agaaggagga aaaaac // LOCUS HUMSEC7HOM 3311 bp mRNA PRI 17-SEP-1992 DEFINITION Human homologue of yeast sec7 mRNA, complete cds. ACCESSION M85169 NID g338001 KEYWORDS homologous region. SOURCE Homo sapiens (library: NK subtracted from Jurkat) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3311) AUTHORS Liu,L. and Pohajdak,B. TITLE Cloning and sequencing a human cDNA from cytolytic NK/T cells with homology to yeast SEC7 JOURNAL Biochim. Biophys. Acta 1132, 75-78 (1992) MEDLINE 92379095 FEATURES Location/Qualifiers source 1..3311 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="cytolytic NK/T" /tissue_lib="NK subtracted from Jurkat" CDS 70..1266 /note="yeast sec7 gene homologue" /codon_start=1 /db_xref="PID:g338002" /translation="MEEDDSYVPSDLTAEERQELENIRRRKQELLADIQRLKDEIAEV ANEIENLGSTEERKNMQRNKQVAMGRKKFNMDPKKGIQFLIENDLLKNTCEDIAQFLY KGEGLNKTAIGDYLGERDEFNIQVLHAFVELHEFTDLNLVQALRQFLWSFRLPGEAQK IDRMMEAFAQRYCQCNNGVFQSTDTCYVLSFAIIMLNTSLHNPNVKDKPTVERFIAMN RGINDGGDLPEELLRNLYESIKNEPFKIPEDDGNDLTHTFFNPDREGWLLKLGGGRVK TWKRRWFILTDNCLYYFEYTTDKEPRGIIPLENLSIREVEDSKKPNCFELYIPDNKDQ VIKACKTEADGRVVEGNHTVYRISAPTPEEKEEWIKCIKAAISRDPFYEMLAARKKKV SSTKRH" misc_feature 108 /function="N-linked glycosylation site" misc_feature 197 /function="N-linked glycosylation site" misc_feature 309 /function="N-linked glycosylation site" misc_feature 351 /function="N-linked glycosylation site" misc_feature 1234..1260 /function="PKC site" polyA_signal 3278 polyA_site 3301 BASE COUNT 784 a 820 c 953 g 754 t ORIGIN 1 gcgagcgggg gcgcgggtgg cgcggcggga cgcgagcggc gagccggagc gcgagcccgc 61 tcccgcacca tggaggagga cgacagctac gttcccagtg acctgacagc agaggagcgt 121 caagaactgg agaacatccg acggagaaaa caggagctgc tggctgacat tcagaggctg 181 aaggatgaga tagcagaagt agctaatgaa attgaaaacc tgggatccac agaggaaagg 241 aaaaacatgc agaggaacaa acaggtagcc atgggcagga aaaaatttaa tatggaccct 301 aaaaagggga tccagttctt aatagagaac gacctcctga agaacacttg tgaagacatt 361 gcccagttct tatataaagg cgaagggctc aacaagacag ccatcggcga ctacctaggg 421 gagagagatg agtttaatat ccaggttctt catgcatttg tggagctgca tgagttcact 481 gatcttaatc tcgtccaggc actacggcag ttcctgtgga gcttccggct acccggagag 541 gcccagaaga tcgaccggat gatggaggcg tttgcccagc gatattgtca gtgcaataat 601 ggcgtgttcc agtccacgga tacttgttac gtcctctcct ttgccatcat catgttgaac 661 accagtctgc acaaccccaa tgtcaaagat aagcccactg tggagaggtt cattgccatg 721 aaccgaggca tcaatgatgg gggagacctg ccggaggagc tcctccggaa tctctatgag 781 agcataaaaa atgaaccctt taaaatccca gaagacgacg ggaatgacct cactcacact 841 ttcttcaatc cagaccgaga aggctggcta ttgaaactcg gaggtggcag ggtaaagact 901 tggaagagac gctggttcat tctgactgac aactgccttt actactttga gtataccacg 961 gataaggagc cccgtggaat catcccttta gagaatctga gtatccggga agtggaggac 1021 tccaaaaaac caaactgctt tgagctttat atccccgaca ataaagacca agttatcaag 1081 gcctgcaaga ccgaggctga cgggcgggtg gtggagggga accacactgt ttaccggatc 1141 tcagctccga cgcccgagga gaaggaggag tggattaagt gcattaaagc agccatcagc 1201 agggaccctt tctacgaaat gctcgcagca cggaaaaaga aggtctcctc cacgaagcga 1261 cactgagcgt gcagccaagg gcgttggtct gcgggggcct tggagctcct gctcttctcc 1321 cgcacctcca tggatgcact gctgccgagc agagcgtcct ctgccaggcc ccgccctgga 1381 ttcctagaga ctagcttcag cttttgctat tttttttaag tgggagaagg gtgggcagtt 1441 atcactgggg aagagaggac cggccacctg tccagcatgg gctccagagc cttcctctct 1501 cacagggcag agctcttgtc ggcagggcag cctcctggcc agtttctctg ctcagtgttc 1561 tggtagcaga gctcagagcc aactgtttac ctcttggttg tccccgtgaa gaagccttca 1621 aaccctgcac cataaataca tgtgtccata tattattata tgttaagaga aaaaggtgga 1681 aaggaagaga agccacatac tataaagatc tatttttttt ttttaagaga gaacgtaggg 1741 ctgttcaggt gcattctgcc ctggctgcgc tggggagctt ctccctggag aagagcacct 1801 ggggctgcgg ccaaggggca tcagcctggg cccgcggcag ggcctggcct gcctctcctg 1861 tgctgtggga gctcgctgcc tggtgcttgt cttggcgaga tggacaggtg aggtcgagga 1921 cgcagagggc agaggcccag tggagcctca gacggcacag tcagagtcgg gggcctgcct 1981 ggccggggtc gcagtcggca gcagcgtgca gtccggcatc tcccgcggat gcttttccat 2041 cccaagtgcc tgcggagccc gaggagagga gagagctgac tggacgctta cgttattttc 2101 ctccttcaga atccaagttc ttgttgggct ttaaagtaga aagtcagcat tttccttgag 2161 ctaaatacct aataaccaaa actgtgagga aggttatcgg gacagaggtt ccggataacc 2221 tgtttcattt tgggttttct tcctcttccc cagactccag tcctcgttct agaggaagga 2281 gtaggacttc cccgatcccc gtagcttcag ctttttctgc ctcaaaacca gccctaactg 2341 gactactctg gatgcatttt gtggtgggcc ccctagaggg aagatgggcc tttatctgct 2401 ccgtggggtg cactggagtg aggggggtgg ccgggctgcc tctcgcatct ctgtcttccc 2461 ctgcaggcgc tgtgtgagct ggccctgccc ctcctcatta cagtatgaag ggagccgtga 2521 cacgcagcat tttcctgccg ttctctcagg gactctcagg gcagctcctg ccactccgcc 2581 agggccagca tgccagtcca ggcagagcag gtggctggct gtctggccgt ctcgccccgc 2641 ccctccacag gaccctggac cagggcggtg cagggcgcag ccccgaggag gcaggtggag 2701 gagctgcggg ttttcacagg gccgcgtcgc cacggctcct ctgatccttt agggttggcg 2761 agcatctctg gaaatagctt ttgcagagga gtggtgggag gaatagaggg ggacagtctg 2821 tcacctccct ccccgccact ttgtgtagat cctacctgga gggaatggct ttaggcactt 2881 ttgtgccaga gcttgtgagg gtgacagaag agggtccagg ctggaaacct gaactttctg 2941 ggtgggagaa ccaggtggtg cctgccgagg tctgggcgtg tttgggccgg tgctggagcc 3001 tgtccagctg gcccgggccc tggcctggtt ctcaagtgtt tcctagacag agaggcacct 3061 gggtcagtat tagtctattt atcagaggtg taaataatct atgtatagtt tttctccttt 3121 tagattattt tgtatttgtt taaaagaagt tttgtcaaaa tacaaaaata taaagaaatg 3181 actgaaagtt gttgacaggg tttttaagaa ataattattc taattgtttt tgtttgtttg 3241 tttttgcctt gtaaactagc gccaaggaac tgcagcaaat aaactccaac tctgcccaag 3301 caaaaaaaaa a // LOCUS HUMSEF21B 2500 bp mRNA PRI 09-JAN-1995 DEFINITION Human SEF2-1B protein (SEF2-1B) mRNA, complete cds. ACCESSION M74719 NID g338014 KEYWORDS SEF2-1B protein; helix-loop-helix DNA binding protein. SOURCE Homo sapiens (strain Caucasian) 3 year old thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2500) AUTHORS Corneliussen,B., Thornell,A., Hallberg,B. and Grundstrom,T. TITLE Helix-loop-helix transcriptional activators bind to a sequence in glucocorticoid response elements of retrovirus enhancers JOURNAL J. Virol. 65 (11), 6084-6093 (1991) MEDLINE 92015505 FEATURES Location/Qualifiers source 1..2500 /organism="Homo sapiens" /strain="Caucasian" /db_xref="taxon:9606" /dev_stage="3 year old" /tissue_type="thymus" gene 200..2203 /gene="SEF2-1B" CDS 200..2203 /gene="SEF2-1B" /codon_start=1 /product="SEF2-1B protein" /db_xref="PID:g338015" /translation="MHHQQRMAALGTDKELSDLLDFSAMFSPPVSSGKNGPTSLASGH FTGSNVEDRSSSGSWGNGGHPSPSRNYGDGTPYDHMTSRDLGSHDNLSPPFVNSRIQS KTERGSYSSYGRESNLQGCHQQSLLGGDMDMGNPGTLSPTKPGSQYYQYSSNNPRRRP LHSSAMEVQTKKVRKVPPGLPSSVYAPSASTADYNRDSPGYPSSKPATSTFPSSFFMQ DGHHSSDPWSSSSGMNQPGYAGMLGNSSHIPQSSSYCSLHPHERLSYPSHSSADINSS LPPMSTFHRSGTNHYSTSSCTPPANGTDSIMANRGSGAAGSSQTGDALGKALASIYSP DHTNNSFSSNPSTPVGSPPSLSAGTAVWSRNGGQASSSPNYEGPLHSLQSRIEDRLER LDDAIHVLRNHAVGPSTAMPGGHGDMHGIIGPSHNGAMGGLGSGYGTGLLSANRHSLM VGTHREDGVALRGSHSLLPNQVPVPQLPVQSATSPDLNPPQDPYRGMPPGLQGQSVSS GSSEIKSDDEGDENLQDTKSSEDKKLDDDKKDIKSITSNNDDEDLTPEQKAEREKERR MANNARERLRVRDINEAFKELGRMVQLHLKSDKPQTKLLILHQAVAVILSLEQQVRER NLNPKAACLKRREEEKVSSEPPPLSLAGPHPGMGDASNHMGQM" misc_feature 1889..2065 /gene="SEF2-1B" /note="helix-loop-helix encoding sequence" BASE COUNT 669 a 664 c 636 g 531 t ORIGIN 1 cggggggatc ttggctgtgt gtctgcggat ctgtagtggc ggcggcggcg gcggcggcgg 61 ggaggcagca ggcgcgggag cgggcgcagg agcaggcggc ggcggtggcg gcggcggtta 121 gacatgaacg ccgcctcggc gccggcggtg cacggagagc cccttctcgc gcgcgggcgg 181 tttgtgtgat tttgctaaaa tgcatcacca acagcgaatg gctgccttag ggacggacaa 241 agagctgagt gatttactgg atttcagtgc gatgttttca cctcctgtga gcagtgggaa 301 aaatggacca acttctttgg caagtggaca ttttactggc tcaaatgtag aagacagaag 361 tagctcaggg tcctggggga atggaggaca tccaagcccg tccaggaact atggagatgg 421 gactccctat gaccacatga ccagcaggga ccttgggtca catgacaatc tctctccacc 481 ttttgtcaat tccagaatac aaagtaaaac agaaaggggc tcatactcat cttatgggag 541 agaatcaaac ttacagggtt gccaccagca gagtctcctt ggaggtgaca tggatatggg 601 caacccagga accctttcgc ccaccaaacc tggttcccag tactatcagt attctagcaa 661 taatccccga aggaggcctc ttcacagtag tgccatggag gtacagacaa agaaagttcg 721 aaaagttcct ccaggtttgc catcttcagt ctatgctcca tcagcaagca ctgccgacta 781 caatagggac tcgccaggct atccttcctc caaaccagca accagcactt tccctagctc 841 cttcttcatg caagatggcc atcacagcag tgacccttgg agctcctcca gtgggatgaa 901 tcagcctggc tatgcaggaa tgttgggcaa ctcttctcat attccacagt ccagcagcta 961 ctgtagcctg catccacatg aacgtttgag ctatccatca cactcctcag cagacatcaa 1021 ttccagtctt cctccgatgt ccactttcca tcgtagtggt acaaaccatt acagcacctc 1081 ttcctgtacg cctcctgcca acgggacaga cagtataatg gcaaatagag gaagcggggc 1141 agccggcagc tcccagactg gagatgctct ggggaaagca cttgcttcga tctattctcc 1201 agatcacact aacaacagct tttcatcaaa cccttcaact cctgttggct ctcctccatc 1261 tctctcagca ggcacagctg tttggtctag aaatggagga caggcctcat cgtctcctaa 1321 ttatgaagga cccttacact ctttgcaaag ccgaattgaa gatcgtttag aaagactgga 1381 tgatgctatt catgttctcc ggaaccatgc agtgggccca tccacagcta tgcctggtgg 1441 tcatggggac atgcatggaa tcattggacc ttctcataat ggagccatgg gtggtctggg 1501 ctcagggtat ggaaccggcc ttctttcagc caacagacat tcactcatgg tggggaccca 1561 tcgtgaagat ggcgtggccc tgagaggcag ccattctctt ctgccaaacc aggttccggt 1621 tccacagctt cctgtccagt ctgcgacttc ccctgacctg aacccacccc aggaccctta 1681 cagaggcatg ccaccaggac tacaggggca gagtgtctcc tctggcagct ctgagatcaa 1741 atccgatgac gagggtgatg agaacctgca agacacgaaa tcttcggagg acaagaaatt 1801 agatgacgac aagaaggata tcaaatcaat tactagcaat aatgacgatg aggacctgac 1861 accagagcag aaggcagagc gtgagaagga gcggaggatg gccaacaatg cccgagagcg 1921 tctgcgggtc cgtgacatca acgaggcttt caaagagctc ggccgcatgg tgcagctcca 1981 cctcaagagt gacaagcccc agaccaagct cctgatcctc caccaggcgg tggccgtcat 2041 cctcagtctg gagcagcaag tccgagaaag gaatctgaat ccgaaagctg cgtgtctgaa 2101 aagaagggag gaagagaagg tgtcctcgga gcctccccct ctctccttgg ccggcccaca 2161 ccctggaatg ggagacgcat cgaatcacat gggacagatg taaaagggtc caagttgcca 2221 cattgcttca ttaaaacaag agaccacttc cttaacagct gtattatctt aaacccacat 2281 aaacacttct ccttaacccc catttttgta atataagaca agtctgagta gttatgaatc 2341 gcagacgcaa gaggtttcag cattcccaat tatcaaaaaa cagaaaaaca aaaaaaagaa 2401 agaaaaaagt gcaacttgag ggacgacttt ctttaacata tcattcagaa tgtgcaaagc 2461 agtatgtaca ggctgagaca cagcccagag actgaacggc // LOCUS HUMSEPRED 833 bp mRNA PRI 09-JAN-1995 DEFINITION Human sepiapterin reductase mRNA, complete cds. ACCESSION M76231 NID g338020 KEYWORDS sepiapterin reductase; tetrahydrobiopterin. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 833) AUTHORS Ichinose,H., Katoh,S., Sueoka,T., Titani,K., Fujita,K. and Nagatsu,T. TITLE Cloning and sequencing of cDNA encoding human sepiapterin reductase--an enzyme involved in tetrahydrobiopterin biosynthesis JOURNAL Biochem. Biophys. Res. Commun. 179 (1), 183-189 (1991) MEDLINE 91354248 FEATURES Location/Qualifiers source 1..833 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /map="Unassigned" gene 23..808 /gene="SPR" CDS 23..808 /gene="SPR" /codon_start=1 /db_xref="GDB:G00-128-778" /label=hSPR /product="sepiapterin reductase" /db_xref="PID:g338021" /translation="MEGGLGRAVCLLTGASRGFGRTLAPLLASLLSPGSVLVLSARND EALRQLEAELGAERSGLRVVRVPADLGAEAGLQQLLGALRELPRPKGLQRLLLINNAG SLGDVSKGFVDLSDSTQVNNYWALNLTSMLCLTSSVLKAFPDSPGLNRTVVNISSLCA LQPFKGWALYCAGKAARDMLFQVLALEEPNVRVLNYAPGPLDTDMQQLARETSVDPDM RKGLQELKAKGKLVDCKVSAQKLLSLLEKDEFKSGAHVDFYDK" BASE COUNT 147 a 255 c 272 g 159 t ORIGIN 1 gccgccggcg gagaacagga gcatggaggg cgggctgggg cgtgctgtgt gcttgctgac 61 cggggcctcc cgcggcttcg gccggacgct ggccccgctc ctggcctcgc tgctgtcgcc 121 cggctccgtg cttgtcctta gcgcccgcaa cgacgaggca ctgcgccagc tggaggccga 181 gctgggcgcc gagcggtctg gcctgcgcgt ggtgcgggtg cccgccgacc tgggcgccga 241 ggccggcttg cagcagctgc tcggcgccct gcgcgagctc ccccggccca aggggctgca 301 gcgactgctg cttatcaaca acgcgggctc tcttggggat gtgtccaaag gcttcgtgga 361 cctgagtgac tccactcaag tgaacaacta ctgggcactg aacttgacct ccatgctctg 421 cctgacttcc agcgtcctga aggccttccc ggacagtcct ggcctcaaca gaaccgtggt 481 taacatctcg tccctctgtg ccctgcaacc tttcaaaggc tgggcgctgt actgtgcagg 541 aaaggctgct cgtgatatgc tgttccaggt cctggcgctg gaggaaccta atgtgagggt 601 gctgaactat gccccaggtc ctctggacac agacatgcag cagttggccc gggagacctc 661 cgtggaccca gacatgcgaa aagggctgca ggagctgaag gcaaagggga agctggtgga 721 ttgcaaggtg tcagcccaga aactgctgag cttactggaa aaggacgagt tcaagtctgg 781 agcccacgtg gacttctatg acaaataagc ccatgttttt ggcttcctga acc // LOCUS HUMSER5R 2733 bp mRNA PRI 09-JAN-1995 DEFINITION Human serotonin 5-HT1C receptor mRNA, complete cds. ACCESSION M81778 NID g338027 KEYWORDS serotonin receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2733) AUTHORS Saltzman,A.G., Morse,B., Whitman,M.M., Ivanshchenko,Y., Jaye,M. and Felder,S. TITLE Cloning of the human serotonin 5-HT2 and 5-HT1C receptor subtypes JOURNAL Biochem. Biophys. Res. Commun. 181 (3), 1469-1478 (1991) MEDLINE 92109767 FEATURES Location/Qualifiers source 1..2733 /organism="Homo sapiens" /db_xref="taxon:9606" 5'UTR 1..787 /gene="HTR1C" gene 1..2733 /gene="HTR1C" CDS 788..2164 /gene="HTR1C" /codon_start=1 /product="serotonin receptor" /db_xref="PID:g338028" /translation="MVNLRNAVHSFLVHLIGLLVWQCDISVSPVAAIVTDIFNTSDGG RFKFPDGVQNWPALSIVIIIIMTIGGNILVIMAVSMEKKLHNATNYFLMSLAIADMLV GLLVMPLSLLAILYDYVWPLPRYLCPVWISLDVLFSTASIMHLCAISLDRYVAIRNPI EHSRFNSRTKAIMKIAIVWAISIGVSVPIPVIGLRDEEKVFVNNTTCVLNDPNFVLIG SFVAFFIPLTIMVITYCLTIYVLRRQALMLLHGHTEEPPGLSLDFLKCCKRNTAEEEN SANPNQDQNARRRKKKERRPRGTMQAINNERKASKVLGIVFFVFLIMWCPFFITNILS VLCEKSCNQKLMEKLLNVFVWIGYVCSGINPLVYTLFNKIYRRAFSNYLRCNYKVEKK PPVRQIPRVAATALSGRELNVNIYRHTNEPVIEKASDNEPGIEMQVENLELPVNPSSV VSERISSV" 3'UTR 2162..2733 /gene="HTR1C" BASE COUNT 691 a 618 c 629 g 795 t ORIGIN 1 gaattcggga gcgtcctcag atgcaccgat cttcccgata ctgcctttgg agcggctaga 61 ttgctagcct tggctgctcc attggcctgc cttgcccctt acctgccgat tgcatatgaa 121 ctcttcttct gtctgtacat cgttgtcgtc ggagtcgtcg cgatcgtcgt ggcgctcgtg 181 tgatggcctt cgtccgttta gagtagtgta gttagttagg ggccaacgaa gaagaaagaa 241 gacgcgatta gtgcagagat gctggaggtg gtcagttact aagctagagt aagatagcgg 301 agcgaaaaga gccaaaccta gccggggggc gcacggtcac ccaaaggagg tcgactcgcc 361 ggcgcttcct atcgcgccga gctccctcca ttcctctccc tccgccgagg cgcgaggttg 421 cggcgcgcag cgcagcgcag ctcagcgcac cgactgccgc gggctccgct gggcgattgc 481 agccgagtcc gtttctcgtc tagctgccgc cgcggcgacc gctgcctggt cttcctcccg 541 gacgctagtg ggttatcagc taacacccgc gagcatctat aacataggcc aactgacgcc 601 atccttcaaa aacaactgtc tgggaaaaaa agaataaaaa gtagtgtgag agcagaaaac 661 gtgattgaaa cacgaccaat ctttcttcag tgccaaaggg tggaaaagaa aggatgatat 721 gatgaaccta gcctgttaat ttcgtcttct caattttaaa ctttggttgc ttaagactga 781 agcaatcatg gtgaacctga ggaatgcggt gcattcattc cttgtgcacc taattggcct 841 attggtttgg caatgtgata tttctgtgag cccagtagca gctatagtaa ctgacatttt 901 caatacctcc gatggtggac gcttcaaatt cccagacggg gtacaaaact ggccagcact 961 ttcaatcgtc atcataataa tcatgacaat aggtggcaac atccttgtga tcatggcagt 1021 aagcatggaa aagaaactgc acaatgccac caattacttc ttaatgtccc tagccattgc 1081 tgatatgcta gtgggactac ttgtcatgcc cctgtctctc ctggcaatcc tttatgatta 1141 tgtctggcca ctacctagat atttgtgccc cgtctggatt tctttagatg ttttattttc 1201 aacagcgtcc atcatgcacc tctgcgctat atcgctggat cggtatgtag caatacgtaa 1261 tcctattgag catagccgtt tcaattcgcg gactaaggcc atcatgaaga ttgctattgt 1321 ttgggcaatt tctataggtg tatcagttcc tatccctgtg attggactga gggacgaaga 1381 aaaggtgttc gtgaacaaca cgacgtgcgt gctcaacgac ccaaatttcg ttcttattgg 1441 gtccttcgta gctttcttca taccgctgac gattatggtg attacgtatt gcctgaccat 1501 ctacgttctg cgccgacaag ctttgatgtt actgcacggc cacaccgagg aaccgcctgg 1561 actaagtctg gatttcctga agtgctgcaa gaggaatacg gccgaggaag agaactctgc 1621 aaaccctaac caagaccaga acgcacgccg aagaaagaag aaggagagac gtcctagggg 1681 caccatgcag gctatcaaca atgaaagaaa agcttcgaaa gtccttggga ttgttttctt 1741 tgtgtttctg atcatgtggt gcccattttt cattaccaat attctgtctg ttctttgtga 1801 gaagtcctgt aaccaaaagc tcatggaaaa gcttctgaat gtgtttgttt ggattggcta 1861 tgtttgttca ggaatcaatc ctctggtgta tactctgttc aacaaaattt accgaagggc 1921 attctccaac tatttgcgtt gcaattataa ggtagagaaa aagcctcctg tcaggcagat 1981 tccaagagtt gccgccactg ctttgtctgg gagggagctt aatgttaaca tttatcggca 2041 taccaatgaa ccggtgatcg agaaagccag tgacaatgag cccggtatag agatgcaagt 2101 tgagaattta gagttaccag taaatccctc cagtgtggtt agcgaaagga ttagcagtgt 2161 gtgagaaaga acagcacagt cttttctacg gtacaagcta catatgtagg aaaattttct 2221 tctttaattt ttctgttggt cttaactaat gtaaatattg ctgtctgaaa aagtgttttt 2281 acatatagct ttgcaacctt gtactttaca atcatgccta cattagtgag atttagggtt 2341 ctatatttac tgtttataat aggtggagac taacttattt tgattgtttg atgaataaaa 2401 tgtttatttt tgctctccct cccttctttc cttccttttt tcctttcttc cttcctttct 2461 ctctttcttt tgtgcatatg gcaacgttca tgttcatctc aggtggcatt tgcaggtgac 2521 cagaatgagg cacatgacag tggttatatt tcaaccacac ctaaattaac aaattcagtg 2581 gacatttgtt ctgggttaac agtaaatata cactttacat tcttgctctg ctcatctaca 2641 catataaaca cagtaagata ggttctgctt tctgatacat ctgtcagtga gtcagaggca 2701 gaacctagtc ttgttgttca tataggggaa ttc // LOCUS HUMSERDHY 1393 bp mRNA PRI 15-MAR-1990 DEFINITION Human serine dehydratase mRNA, complete cds. ACCESSION J05037 NID g338029 KEYWORDS serine dehydratase. SOURCE Human liver, cDNA to mRNA, (library of Clontech). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1393) AUTHORS Ogawa,H., Gomi,T., Konishi,K., Date,T., Nakashima,H., Nose,K., Matsuda,Y., Peraino,C., Pitot,H.C. and Fujioka,M. TITLE Human liver serine dehydratase: cDNA cloning and sequence homology with hydroxyamino acid dehydratases from other sources JOURNAL J. Biol. Chem. 264, 15818-15823 (1989) MEDLINE 89380167 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Ogawa, 13-JUL-1989. FEATURES Location/Qualifiers source 1..1393 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 90..1076 /note="serine dehydratase (EC 4.2.1.13)" /codon_start=1 /db_xref="PID:g338030" /translation="MMSGEPLHVKTPIRDSMALSKMAGTSVYLKMDSAQPSGSFKIRG IGHFCKRWAKQGCAHFVCSSAGNAGMAAAYAARQLGVPATIVVPGTTPALTIERLKNE GATCKVVGELLDEAFELAKALAKNNPGWVYIPPFDDPLIWEGHASIVKELKETLWEKP GAIALSVGGGGLLCGVVQGLQECGWGDVPVIAMETFGAHSFHAATTAGKLVSLPKITS VAKALGVKTVGSQALKLFQEHPIFSEVISDQEAVAAIEKFVDDEKILVEPAWGAALAA VYSHVIQKLQLEGNLRTPLPSLVVIVCGGSNISLAQLRALKEQLGMTNRLPK" BASE COUNT 275 a 415 c 419 g 284 t ORIGIN 1 ccttctcttc gtgggctatc tactcagttg atccctccct cgctggcttg gctctgactc 61 ctgctcagac ccatcacctt tgccggggaa tgatgtctgg agaacccctg cacgtgaaga 121 cccccatccg tgacagcatg gccctgtcca aaatggccgg caccagcgtc tacctcaaga 181 tggacagtgc ccagccctcc ggctccttca agatccgggg cattgggcac ttctgcaaga 241 ggtgggccaa gcaaggctgt gcacattttg tctgctcctc ggcgggcaac gcaggcatgg 301 cggctgcata tgcggccagg caactcggcg tccccgccac catcgtagtg cccggcacca 361 cacctgctct caccattgag cgcctcaaga atgaaggtgc cacatgcaag gtggtgggtg 421 agttattgga tgaagccttc gagctggcca aggccctagc gaagaacaac ccgggttggg 481 tctacattcc cccctttgat gaccccctca tctgggaagg ccacgcttcc atcgtgaaag 541 agctgaagga gacactgtgg gaaaagccgg gggccatcgc gctgtcagtg ggcggcgggg 601 gcctgctgtg tggagtggtc caggggctgc aggagtgtgg ctggggggac gtgcctgtca 661 tcgccatgga gacttttggt gcccacagct tccacgctgc caccaccgca ggcaaacttg 721 tctccctgcc caagatcacc agtgttgcca aggccctggg cgtgaagact gtggggtctc 781 aggccctgaa gctgtttcag gaacacccca ttttctctga agttatctcg gaccaggagg 841 ctgtggccgc cattgagaag ttcgtggatg atgagaagat cctggtggag cccgcctggg 901 gcgcagccct ggccgctgtc tatagccacg tgatccagaa gctccaactg gaggggaatc 961 tccgaacccc gctgccatcc ctcgtggtca tcgtctgcgg gggcagcaac atcagcctgg 1021 cccagctgcg ggcgctcaag gaacagctgg gcatgacaaa taggttgccc aagtgaggac 1081 ggacccctta ccgatctgtg ctctcctagc ccaagagacc cctggagggg ctggagttta 1141 tccagcgcct cgtcgtatgt ttggctgagc acctgtggcc ctgggtgcag gttaacttct 1201 tgttatcagg agcccactat gcagaggcca aaggtcggca gccagcgagg ctatgaattg 1261 gacctttttg gtatctgtgt gactgctctg tgcccatcct tagccaactt gctggcgtga 1321 caagtgccca caagtaacac accaggtacc cagagcaggg tggacaggag agacctgaat 1381 cacagcagtg agg // LOCUS HUMSERPRO 2030 bp mRNA PRI 23-JUL-1992 DEFINITION Human serum constituent protein (MSE55) mRNA, complete cds. ACCESSION M88338 NID g338032 KEYWORDS serum protein. SOURCE Homo sapiens endothelial cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2030) AUTHORS Bahou,W.F., Campbell,A.D. and Wicha,M.S. TITLE cDNA cloning and molecular characterization of MSE55, a novel human serum constituent protein that displays bone marrow stromal/endothelial cell-specific expression JOURNAL J. Biol. Chem. 267, 13986-13992 (1992) MEDLINE 92332498 FEATURES Location/Qualifiers source 1..2030 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="endothelial" CDS 349..1524 /standard_name="MSE55" /note="putative" /codon_start=1 /number=1 /function="unknown" /product="serum protein" /db_xref="PID:g338033" /translation="MPGPQGGRGAATMSLGKLSPVGWVSSSQGKRRLTADMISHPLGD FRHTMHVGRGGDVFGDTSFLSNHGGSSGSTHRSPRSFLAKKLQLVRRVGAPPRRMASP PAPSPAPPAISPIIKNAISLPQLNQAAYDSLVVGKLSFDSSPTSSTDGHSSYGLDSGF CTISRLPRSEKPHDRDRDGSFPSEPGLRRSDSLLSFRLDLDLGPSLLSELLGVMSLPE APAAETPAPAANPPAPTANPTGPAANPPATTANPPAPAANPSAPAATPTGPAANPPAP AASSTPHGHCPNGVTAGLGPVAEVKSSPVGGGPRGPAGPALGRHWGAGWDGGHHYPEM DARQERVEVLPQARASWESLDEEWRAPQAGSRTPVPSTVQANTFEFADAEEDDEVKV" variation 996..997 /note="replace(996..997 `c')" repeat_region 1006..1173 variation 1428..1429 /note="replace(1428..1429 `t')" BASE COUNT 370 a 760 c 590 g 310 t ORIGIN 1 ccgcgacggc cgccgccgtg cagacgacga gtccgccctc gtcccgcgcc cccggggctc 61 gcggagccag gtctccacct ctgggcagga gagttgccga ccacctcggg ggtgctttct 121 ctgcgcttga acatctatag ctgcttctga ggggctggga gccgggcccc tgggagagac 181 gagccatgaa ccccccacag cctctgcatt tggggacctc accttaggag agtgccattt 241 acagcttccg ccagggcaaa ggagctgagc agccatccca agcccagccc acctccctcc 301 cccggcccct ggtaggcatg gactagcagc tgtgagcagc cagagctgat gcccggcccc 361 caggggggca gaggcgccgc caccatgagc ctgggcaagc tctcgcctgt gggctgggtg 421 tccagttcac agggaaagag gcggctgact gcagacatga tcagccaccc actcggggac 481 ttccgccaca ccatgcatgt gggccgtggc ggggatgtct tcggggacac gtccttcctc 541 agcaaccacg gtggcagctc cgggagcacc catcgctcac cccgcagctt cctggccaag 601 aagctgcagc tggtgcggag ggtgggggcg cccccccgga ggatggcatc tccccctgca 661 ccctccccgg ctccaccggc catctccccc atcatcaaga acgccatctc cctgccccag 721 ctcaaccagg ccgcctacga cagcctcgtg gttggcaagc tcagcttcga cagcagcccc 781 accagctcca cggacggcca ctccagctac ggcctggact ctgggttctg caccatctcc 841 cgcctgcccc gctcggaaaa gccgcatgac cgagaccggg atggttcctt cccctctgag 901 cccgggcttc gccgctctga ctctctcttg tccttccgcc tggacctcga ccttgggccc 961 tcactcctca gcgagctgct aggggtcatg agcctgccag aagcccctgc agctgagact 1021 ccagcccccg ctgcaaaccc cccagcccct actgcaaacc ccacgggtcc tgctgcaaac 1081 cccccagcca ctactgcaaa ccccccagcg cctgctgcaa acccctcagc acctgccgca 1141 acccccacgg gtcctgctgc aaatccccca gcccctgccg caagctccac accccatgga 1201 cactgtccca atggggtaac agctgggttg ggcccagtgg ctgaggtgaa gtccagccca 1261 gtgggagggg gtccccgagg acctgctggc cctgccctcg gcaggcactg gggagcaggc 1321 tgggatggcg gccaccacta cccagagatg gatgcgcggc aggagcgggt ggaggtgctg 1381 ccccaagccc gggcctcctg ggagagcctg gacgaagagt ggagggcgcc ccaggcaggc 1441 agcaggaccc cagtgcccag cacagtgcaa gcaaacacct ttgaatttgc ggatgctgag 1501 gaggatgatg aggtcaaggt gtgaggggct ggggcacggt cccagggccc cacctaggtg 1561 cagagccggc ccctcaccta acagctggtt cctaccagac cggagagggg agaagtcatg 1621 ttgcccctaa acccctcccc acctctgcag gacagacatg ggagggagga cagggaaggc 1681 caggcttgct ctgggacttt tatgctccca gaggccctgc caaactgacc acctcccccg 1741 actgccactc tggacctaat agctgttcct taggccccac tccatgccac ccccaccagc 1801 tggaggaccc agcctcacag tgtgtccttt gtgccagacc aagcggcccg tggggggtgg 1861 ggggcaggga gtgtaccaca cagggccatt gtctcacctc ccaaagggac cgcctgcccc 1921 cagctcatcc cagagcgtcc ctgctgcaac cctgacagcc ggtactccca ggccggctta 1981 ccccaactac cccgccccag ccaccctcct taccccagca aaggtcaggt // LOCUS HUMSERPROT 969 bp mRNA PRI 26-AUG-1994 DEFINITION Human stratum corneum chymotryptic enzyme mRNA, complete cds. ACCESSION L33404 NID g521214 KEYWORDS serine proteinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 969) AUTHORS Hansson,L., Stromqvist,M., Backman,A., Wallbrandt,P., Carlstein,A. and Egelrud,T. TITLE Cloning, expression, and characterization of stratum corneum chymotryptic enzyme. A skin-specific human serine proteinase JOURNAL J. Biol. Chem. 269 (30), 19420-19426 (1994) MEDLINE 94308225 FEATURES Location/Qualifiers source 1..969 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" CDS 16..777 /note="SCCE; serine proteinase" /codon_start=1 /product="stratum corneum chymotryptic enzyme" /db_xref="PID:g532504" /translation="MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQ VALLSGNQLHCGGVLVNERWVLTAAHCKMNEYTVHLGSDTLGDRRAQRIKASKSFRHP GYSTQTHVNDLMLVKLNSQARLSSMVKKVRLPSRCEPPGTTCTVSGWGTTTSPDVTFP SDLMCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSKKNACNGDSGGPLVCRGTLQGL VSWGTFPCGQPNDPGVYTQVCKFTKWINDTMKKHR" BASE COUNT 256 a 281 c 233 g 199 t ORIGIN 1 ggatttccgg gctccatggc aagatccctt ctcctgcccc tgcagatcct actgctatcc 61 ttagccttgg aaactgcagg agaagaagcc cagggtgaca agattattga tggcgcccca 121 tgtgcaagag gctcccaccc atggcaggtg gccctgctca gtggcaatca gctccactgc 181 ggaggcgtcc tggtcaatga gcgctgggtg ctcactgccg cccactgcaa gatgaatgag 241 tacaccgtgc acctgggcag tgatacgctg ggcgacagga gagctcagag gatcaaggcc 301 tcgaagtcat tccgccaccc cggctactcc acacagaccc atgttaatga cctcatgctc 361 gtgaagctca atagccaggc caggctgtca tccatggtga agaaagtcag gctgccctcc 421 cgctgcgaac cccctggaac cacctgtact gtctccggct ggggcactac cacgagccca 481 gatgtgacct ttccctctga cctcatgtgc gtggatgtca agctcatctc cccccaggac 541 tgcacgaagg tttacaagga cttactggaa aattccatgc tgtgcgctgg catccccgac 601 tccaagaaaa acgcctgcaa tggtgactca gggggaccgt tggtgtgcag aggtaccctg 661 caaggtctgg tgtcctgggg aactttccct tgcggccaac ccaatgaccc aggagtctac 721 actcaagtgt gcaagttcac caagtggata aatgacacca tgaaaaagca tcgctaacgc 781 cacactgagt taattaactg tgtgcttcca acagaaaatg cacaggagtg aggacgccga 841 tgacctatga agtcaaattt gactttacct ttcctcaaag atatatttaa acctcatgcc 901 ctgttgataa accaatcaaa ttggtaaaga cctaaaacca aaacaaataa agaaacacaa 961 aaccctcaa // LOCUS HUMSGBP 1283 bp mRNA PRI 20-JAN-1997 DEFINITION Human mRNA for small GTP-binding protein, S10, complete cds. ACCESSION D14889 NID g1785852 KEYWORDS GTP-binding protein; S10; small GTP-binding protein. SOURCE Homo sapiens T-lymphocyte, cell-line Jurkat, lambda gt10 library, cDNA to mRNA, clones S10-[32 and 4]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1283) AUTHORS Koda,T. and Kakinuma,M. TITLE Molecular cloning of a cDNA encoding a novel small GTP-binding protein JOURNAL FEBS Lett. 328 (1-2), 21-24 (1993) MEDLINE 93345690 REFERENCE 2 (bases 1 to 1283) AUTHORS Koda,T. TITLE Direct Submission JOURNAL Submitted (07-FEB-1993) to the DDBJ/EMBL/GenBank databases. Toshiaki Koda, Hokkaido University, Institute of Immunological Science; Kita-15,Nishi-7,Kita-ku, Sapporo, Hokkaido 060, Japan (E-mail:tkoda@med.hokudai.ac.jp, Tel:11-716-2111(ex.5521), Fax:11-707-6835) COMMENT Sequence updated (16-Jan-1997) by: Toshiaki Koda. FEATURES Location/Qualifiers source 1..1283 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T-lymphocyte" /clone_lib="lambda gt10" CDS 438..1151 /codon_start=1 /product="small GTP-binding protein, S10" /db_xref="PID:d1004118" /db_xref="PID:g1785853" /translation="MAQPILGHGSLQPASAAGLASLELDSSLDQYVQIRIFKIIVIGD SNVGKTCLTFRFCGGTFPDKTEATIGVDFREKTVEIEGEKIKVQVWDTAGQERFRKSM VEHYYRNVHAVVFVYDVTKMTSFTNLKMWIQECNGHAVPPLVPKVLVGNKCDLREQIQ VPSNLALKFADAHNMLLFETSAKDPKESQNVESIFMCLACRLKAQKSLLYRDAERQQG KVQKLEFPQEANSKTSCPC" variation 742 /note="replace(742,'c')" polyA_signal 1262..1267 polyA_site 1283 BASE COUNT 286 a 364 c 375 g 258 t ORIGIN 1 ccctcccctg cctgcattcc cgggacggac ccgagggaag aagcctcagg aggagggtgt 61 gggccgaggc gcggcggcgg ctggagcagc gcggtagggt ccttcgccag agcatccggt 121 ccgagggcgc acacaggcag aaggctcggg gctcgtccac tctcctccct ctctcctcct 181 ctccctggct ttgtgttggt gcctccgagc tgcaaggagg gtgcgctgga ggaggaggag 241 gggggcccgg agtgagaggc acccccttca cgcgcgcgcg cgcacacggt gccggcgcac 301 gcacacacgg gcggacacac acacacgcgc gcacacacac acgcacagag ctcgctcgcc 361 tcgagcgcac gaacgtggac gttctctttg tgtggagccc tcaagggggg ttggggcccc 421 ggttcggtcc gggggagatg gcgcagccca tcctgggcca tgggagcctg cagcccgcct 481 cggccgctgg cctggcgtcc ctggagctcg actcgtcgct ggaccagtac gtgcagattc 541 gcatcttcaa aataatcgtg attggggact ccaacgtggg caagacctgc ctgaccttcc 601 gcttctgcgg gggtaccttc ccagacaaga ctgaagccac catcggcgtg gacttcaggg 661 agaagaccgt ggaaatcgag ggcgagaaga tcaaggttca ggtgtgggac acagcaggtc 721 aggaacgttt ccgcaaaagc atggtcgagc attactaccg caacgtacat gccgtggtct 781 tcgtctatga cgtcaccaag atgacatctt tcaccaacct caaaatgtgg atccaagaat 841 gcaatgggca tgctgtgccc ccactagtcc ccaaagtgct tgtgggcaac aagtgtgact 901 tgagggaaca gatccaggtg ccctccaact tagccctgaa atttgctgat gcccacaaca 961 tgctcttgtt tgagacatcg gccaaggacc ccaaagagag ccagaacgtg gagtcgattt 1021 tcatgtgctt ggcttgccga ttgaaggccc agaaatccct gctgtatcgt gatgctgaga 1081 ggcagcaggg gaaggtgcag aaactggagt tcccacagga agctaacagt aaaacttcct 1141 gtccttgttg aaaccaaacg atataaatac aagataaatt atcactggag ttttttcttt 1201 cccttttttc tgtgcctgca taatgctgac acctgcttgt ttccatacaa attgatatca 1261 aaataaaatt tgtatagatt atc // LOCUS HUMSGII 2336 bp mRNA PRI 15-DEC-1989 DEFINITION Human secretogranin II gene, complete cds. ACCESSION M25756 NID g338050 KEYWORDS calcium-binding protein; chromogranin; secretogranin; tyrosine-sulfated secretory granule protein. SOURCE Human pheochromocytoma, cDNA to mRNA, clone hSgII-2/8. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2336) AUTHORS Gerdes,H.-H., Rosa,P., Phillips,E., Baeuerle,P.A., Frank,R., Argos,P. and Huttner,W.B. TITLE The primary structure of human secretogranin II, a widespread tyrosine-sulfated secretory granule protein that exhibits low ph- and calcium-induced aggregation JOURNAL J. Biol. Chem. 264, 12009-12015 (1989) MEDLINE 89308608 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by W.B.Huttner, 27-JUN-1989. FEATURES Location/Qualifiers source 1..2336 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 63..152 /note="secretogranin II signal pept (includes pot. propeptide); putative" CDS 63..1916 /note="secretogranin II" /codon_start=1 /db_xref="PID:g338051" /translation="MAEAKTHWLGAALSLIPLIFLISGAEAASFQRNQLLQKEPDLRL ENVQKFPSPEMIRALEYIENLRQQAHKEESSPDYNPYQGVSVPLQQKENGDESHLPER DSLSEEDWMRIILEALRQAENEPQSAPKENKPYALNSEKNFPMDMSDDYETQQWPERK LKHMQFPPMYEENSRDNPFKRTNEIVEEQYTPQSLATLESVFQELGKLTGPNNQKRER MDEEQKLYTDDEDDIYKANNIAYEDVVGGEDWNPVEEKIESQTQEEVRDSKENIGKNE QINDEMKRSGQLGIQEEDLRKESKDQLSDDVSKVIAYLKRLVNAAGSGRLQNGQNGER ATRLFEKPLDSQSIYQLIEISRNLQIPPEDLIEMLKTGEKPNGSVEPERELDLPVDLD DISEADLDHPDLFQNRMLSKSGYPKTPGRAGTEALPDGLSVEDILNLLGMESAANQKT SYFPNPYNQEKVLPRLPYGAGRSRSNQLPKAAWIPHVENRQMAYENLNDKDQELGEYL ARMLVKYPEIINSNQVKRVPGQGSSEDDLQEEEQIEQAIKEHLNQGSSQETDKLAPVS KRFPVGPPKNDDTPNRQYWDEDLLMKVLEYLNQEKAEKGREHIAKRAMENM" mat_peptide 153..1913 /note="secretogranin II" BASE COUNT 778 a 458 c 541 g 559 t ORIGIN 1 cggggaggaa tatgctgtgg agctcctctg ccatataaac aaaaagagga aatctttcaa 61 acatggctga agcaaagacc cactggcttg gagcagccct gtctcttatc cctttaattt 121 tcctcatctc tggggctgaa gcagcttcat ttcagagaaa ccagctgctt cagaaagaac 181 cagacctcag gttggaaaat gtccaaaagt ttcccagtcc tgaaatgatc agggctttgg 241 agtacataga aaacctccga caacaagctc ataaggaaga aagcagccca gattataatc 301 cctaccaagg tgtctctgtc ccccttcagc aaaaagaaaa tggcgatgaa agccacttgc 361 ccgagaggga ttcactgagt gaagaagact ggatgagaat aatactcgaa gctttgagac 421 aggctgaaaa tgagcctcag tctgcaccaa aagaaaataa gccctatgcc ttgaattcag 481 aaaagaactt tccaatggac atgagtgatg attatgagac acagcagtgg ccagaaagaa 541 agcttaagca catgcaattc cctcctatgt atgaagagaa ttccagggat aaccccttta 601 aacgcacaaa tgaaatagtg gaggaacaat atactcctca aagccttgct acattggaat 661 ctgtcttcca agagctgggg aaactgacag gaccaaacaa ccagaaacgt gagaggatgg 721 atgaggagca aaaactttat acggatgatg aagatgatat ctacaaggct aataacattg 781 cctatgaaga tgtggtcggg ggagaagact ggaacccagt agaggagaaa atagagagtc 841 aaacccagga agaggtgaga gacagcaaag agaatatagg aaaaaatgaa caaatcaacg 901 atgagatgaa acgctcaggg cagcttggca tccaggaaga agatcttcgg aaagagagta 961 aagaccaact ctcagatgat gtctccaaag taattgccta tttgaaaagg ttagtaaatg 1021 ctgcaggaag tgggaggtta cagaatgggc aaaatgggga aagggccacc aggctttttg 1081 agaaacctct tgattctcag tctatttatc agctgattga aatctcaagg aatttacaga 1141 tacccccaga agacttaatt gagatgctca aaactgggga gaagccgaat ggatcagtgg 1201 aaccggagcg ggagcttgac cttcctgttg acctagatga catctcagag gctgacttag 1261 accatccaga cctgttccaa aataggatgc tctccaagag tggctaccct aaaacacctg 1321 gtcgtgctgg gactgaggcc ctaccagacg ggctcagtgt tgaggatatt ttaaatcttt 1381 tagggatgga gagtgcagca aatcagaaaa cgtcgtattt tcccaatcca tataaccagg 1441 agaaagttct gccaaggctc ccttatggtg ctggaagatc tagatcgaac cagcttccca 1501 aagctgcctg gattccacat gttgaaaaca gacagatggc atatgaaaac ctgaacgaca 1561 aggatcaaga attaggtgag tacttggcca ggatgctagt taaataccct gagatcatta 1621 attcaaacca agtgaagcga gttcctggtc aaggctcatc tgaagatgac ctgcaggaag 1681 aggaacaaat tgagcaggcc atcaaagagc atttgaatca aggcagctct caggagactg 1741 acaagctggc cccggtgagc aaaaggttcc ctgtggggcc cccgaagaat gatgataccc 1801 caaataggca gtactgggat gaagatctgt taatgaaagt gctggaatac ctcaatcaag 1861 aaaaggcaga aaagggaagg gagcatattg ctaagagagc aatggaaaat atgtaagctg 1921 ctttcattaa ttaccctact ttcattcctc ccaccccaag caaatcccaa catttctctt 1981 cagtgtgttg acttctatcc tgttaacact gtaatatctt taaatgatgt acaggcagat 2041 gaaaccaggt cactggggag tctgcttcat ttcctctgag ctgttatctt gtgtatggat 2101 atgtgtaaat gttatgactc cttgataaaa aatttattat gtccattatt caagaaagat 2161 atctatgact gtgtttaata gtatatctaa tggctgtggc attgttgatg ctcacatatg 2221 ataaaaaagt gtcctataat tctattgaaa gtttttaata tttattgaat tattttgtta 2281 ctgtctgtag cgttttgtgg agtactggac caaaaaaata aagcattata aatata // LOCUS HUMSGLCT 2273 bp mRNA PRI 05-NOV-1992 DEFINITION Homo sapiens sodium/glucose cotransporter-like protein mRNA, complete cds. ACCESSION M95549 M95299 NID g338052 KEYWORDS sodium/glucose cotransporter-like protein. SOURCE Homo sapiens (library: lambda GT10) kidney cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2273) AUTHORS Wells,R.G, Pajor,A., Turk,E., Wright,E.M and Hediger,M.A. TITLE The cloning of a human kidney cDNA with similarity to the sodium/glucose cotransporter JOURNAL Am. J. Physiol. 263, 459-465 (1992) FEATURES Location/Qualifiers source 1..2273 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="kidney" /tissue_lib="lambda GT10" CDS 21..2039 /codon_start=1 /product="sodium/glucose cotransporter-like protein" /db_xref="PID:g338053" /translation="MEEHTEAGSAPEMGAQKALIDNPADILVIAAYFLLVIGVGLWSM CRTNRGTVGGYFLAGRSMVWWPVGASLFASNIGSGHFVGLAGTGAASGLAVAGFEWNA LFVVLLLGWLFAPVYLTAGVITMPQYLRKRFGGRRIRLYLSVLSLFLYIFTKISVDMF SGAVFIQQALGWNIYASVIALLGITMIYTVTGGLAALMYTDTVQTFVILGGACILMGY AFHEVGGYSGLFDKYLGAATSLTVSEDPAVGNISSFCYRPRPDSYHLLRHPVTGDLPW PALLLGLTIVSGWYWCSDQVIVQRCLAGKSLTHIKAGCILCGYLKLTPMFLMVMPGMI SRILYPDEVACVVPEVCRRVCGTEVGCSNIAYPRLVVKLMPNGLRGLMLAVMLAALMS SLASIFNSSSTLFTMDIYTRLRPRAGDRELLLVGRLWVVFIVVVSVAWLPVVQAAQGG QLFDYIQAVSSYLAPPVSAVFVLALFVPRVNEQGAFWGLIGGLLMGLARLIPEFSFGS GSCVQPSACPAFLCGVHYLYFAIVLFFCSGLLTLTVSLCTAPIPRKHLHRLVFSLRHS KEEREDLDADEQQGSSLPVQNGCPESAMEMNEPQAPAPSLFRQCLLWFCGMSRGGVGS PPPLTQEEAAAAARRLEDISEDPSWARVVNLNALLMMAVAVFLWGFYA" BASE COUNT 364 a 714 c 711 g 484 t ORIGIN 1 gggggcagat cctggggaga atggaggagc acacagaggc aggctcggca ccagagatgg 61 gggcccagaa ggccctgatt gacaatcctg ctgacatcct agtcattgct gcatatttcc 121 tgctggtcat tggcgttggc ttgtggtcca tgtgcagaac caacagaggc actgtgggcg 181 gctacttcct ggcaggacgc agcatggtgt ggtggccggt tggggcctct ctcttcgcca 241 gcaacatcgg cagtggccac tttgtgggcc tggcagggac tggcgctgca agtggcttgg 301 ctgttgctgg attcgagtgg aatgcgctct tcgtggtgct gctactgggc tggctgtttg 361 cacccgtgta cctgacagcg ggggtcatca cgatgccaca gtacctgcgc aagcgcttcg 421 gcggccgccg catccgcctc tacctgtctg tgctctccct tttcctgtac atcttcacca 481 agatctcagt ggacatgttc tccggagctg tattcatcca gcaggctctg ggctggaaca 541 tctatgcctc cgtcatcgcg cttctgggca tcaccatgat ttacacggtg acaggagggc 601 tggccgcgct gatgtacacg gacacggtac agaccttcgt cattctgggg ggcgcctgca 661 tcctcatggg ttacgccttc cacgaggtgg gcgggtattc gggtctcttc gacaaatacc 721 tgggagcagc gacttcgctg acggtgtccg aggatccagc cgtgggaaac atctccagct 781 tctgctatcg accccggccc gactcctacc acctgctccg gcaccccgtg accggggatc 841 tgccgtggcc cgcgctgctc ctcggactca caatcgtctc gggctggtac tggtgcagcg 901 accaggtcat cgtgcagcgc tgcctggccg ggaagagcct gacccacatc aaggcgggct 961 gcatcctgtg tgggtacctg aagctgacgc ccatgtttct catggtcatg ccaggcatga 1021 tcagccgcat tctgtaccca gacgaggtgg cgtgcgtggt gcctgaggtg tgcaggcgcg 1081 tgtgcggcac ggaggtgggc tgctccaaca tcgcctaccc gcggctcgtc gtgaagctca 1141 tgcccaacgg tctgcgcgga ctcatgctgg cggtcatgct ggccgcgctc atgtcctcgc 1201 tggcctccat cttcaacagc agcagcacgc tcttcaccat ggacatctac acgcgcctgc 1261 ggccacgcgc cggcgaccgc gagctgctgc tggtgggacg gctctgggtg gtgttcatcg 1321 tggtagtgtc ggtggcctgg cttcccgtgg tgcaggcggc acagggcggg cagctcttcg 1381 attacatcca ggcagtctct agctacctgg caccgcccgt gtccgccgtc ttcgtgctgg 1441 cgctcttcgt gccgcgcgtt aatgagcagg gcgccttctg gggactcatc gggggcctgc 1501 tgatgggcct ggcacgcctg attcccgagt tctccttcgg ctcgggcagc tgtgtgcagc 1561 cctcggcgtg cccagctttc ctctgcggcg tgcactacct ctacttcgcc attgtgctgt 1621 tcttctgctc tggcctcctc accctcacgg tctccctgtg caccgcgccc atccccagaa 1681 agcacctcca ccgcctggtc ttcagtctcc ggcatagcaa ggaggaacgg gaggacctgg 1741 atgctgatga gcagcaaggc tcctcactcc ctgtacagaa tgggtgccca gagagtgcca 1801 tggagatgaa tgagccccag gccccggcac caagcctctt ccgccagtgc ctgctctggt 1861 tttgtggaat gagcagaggt ggggtgggca gtcctccgcc ccttacccag gaggaggcag 1921 cggcagcagc caggcggctg gaggacatca gcgaggaccc gagctgggcc cgtgtggtca 1981 acctcaatgc cctgctcatg atggcagtgg ccgtgttcct ctggggcttc tatgcctaag 2041 accaactgcg ttggacacca taagccacag cctcacagga agtgggggtg aggagcctgc 2101 ggtgctcccc agaaaagggg aaggggcagt ggggtgagaa ggtcctggct ccccttctcc 2161 cggccttcct ctgcctgggg cccactgcat ctgattggca gtcacttccc atgagggcct 2221 ggcccacccg ctgcagttgc cctaaggaaa aataaagctg cctttcccct gta // LOCUS HUMSGLT1 2449 bp mRNA PRI 09-JAN-1995 DEFINITION Human Na+/glucose cotransporter 1 mRNA, complete cds. ACCESSION M24847 NID g338054 KEYWORDS Na+/glucose cotransporter. SOURCE Human ileum epithelium, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2449) AUTHORS Hediger,M.A., Turk,E. and Wright,E.M. TITLE Homology of the human intestinal Na+/glucose and Escherichia coli Na+/proline cotransporters JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86 (15), 5748-5752 (1989) MEDLINE 89345544 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Wright, 16-MAY-1989. FEATURES Location/Qualifiers source 1..2449 /organism="Homo sapiens" /db_xref="taxon:9606" /map="22q13.1" mRNA <1..2449 /note="SGLT1 mRNA" gene 11..2005 /gene="SGLT1" CDS 11..2005 /gene="SGLT1" /note="Na+/glucose cotransporter" /codon_start=1 /db_xref="GDB:G00-120-375" /db_xref="PID:g338055" /translation="MDSSTWSPKTTAVTRPVETHELIRNAADISIIVIYFVVVMAVGL WAMFSTNRGTVGGFFLAGRSMVWWPIGASLFASNIGSGHFVGLAGTGAASGIAIGGFE WNALVLVVVLGWLFVPIYIKAGVVTMPEYLRKRFGGQRIQVYLSLLSLLLYIFTKISA DIFSGAIFINLALGLNLYLAIFLLLAITALYTITGGLAAVIYTDTLQTVIMLVGSLIL TGFAFHEVGGYDAFMEKYMKAIPTIVSDGNTTFQEKCYTPRADSFHIFRDPLTGDLPW PGFIFGMSILTLWYWCTDQVIVQRCLSAKNMSHVKGGCILCGYLKLMPMFIMVMPGMI SRILYTEKIACVVPSECEKYCGTKVGCTNIAYPTLVVELMPNGLRGLMLSVMLASLMS SLTSIFNSASTLFTMDIYAKVRKRASEKELMIAGRLFILVLIGISIAWVPIVQSAQSG QLFDYIQSITSYLGPPIAAVFLLAIFWKRVNEPGAFWGLILGLLIGISRMITEFAYGT GSCMEPSNCPTIICGVHYLYFAIILFAISFITIVVISLLTKPIPDVHLYRLCWSLRNS KEERIDLDAEEENIQEGPKETIEIETQVPEKKKGIFRRAYDLFCGLEQHGAPKMTEEE EKAMKMKMTDTSEKPLWRTVLNVNGIILVTVAVFCHAYFA" BASE COUNT 547 a 615 c 621 g 666 t ORIGIN Chromosome q11.2-qter. 1 cgctgccacc atggacagta gcacctggag ccccaagacc accgcggtca cccggcctgt 61 tgagacccac gagctcattc gcaatgcagc cgatatctcc atcatcgtta tctacttcgt 121 ggtagtgatg gccgtcggac tgtgggctat gttttccacc aatcgtggga ctgttggagg 181 cttcttcctg gcaggccgaa gtatggtgtg gtggccgatt ggagcctccc tctttgctag 241 taacattgga agtggccact ttgtggggct ggccgggact ggggcagctt caggcatcgc 301 cattggaggc tttgaatgga atgccctggt tttggtggtt gtgctgggct ggctgtttgt 361 ccccatctat attaaggctg gggtggtgac aatgccagag tacctgagga agcggtttgg 421 aggccagcgg atccaggtct acctttccct tctgtccctg ctgctctaca ttttcaccaa 481 gatctcggca gacatcttct cgggggccat attcatcaat ctggccttag gcctgaatct 541 gtatttagcc atctttctct tattggcaat cactgccctt tacacaatta cagggggcct 601 ggcggcggtg atttacacgg acaccttgca gacggtgatc atgctggtgg ggtctttaat 661 cctgactggg tttgcttttc acgaagtggg aggctatgac gccttcatgg aaaagtacat 721 gaaagccatt ccaaccatag tgtctgatgg caacaccacc tttcaggaaa aatgctacac 781 tccaagggcc gactccttcc acatcttccg agatcccctc acgggagacc tcccatggcc 841 tgggttcatc tttgggatgt ccatccttac cttgtggtac tggtgcacag atcaggtcat 901 tgtgcagcgc tgcctctcag ccaagaatat gtctcacgtg aagggtggct gcatcctgtg 961 tgggtatcta aagctgatgc ccatgttcat catggtgatg ccaggaatga tcagccgcat 1021 tctgtacaca gaaaaaattg cctgtgtcgt cccttcagaa tgtgagaaat attgcggtac 1081 caaggttggc tgtaccaaca tcgcctatcc aaccttagtg gtggagctca tgcccaatgg 1141 actgcgaggc ctgatgctat cagtcatgct ggcctccctc atgagctccc tgacctccat 1201 cttcaacagc gccagcaccc tcttcaccat ggacatctac gccaaggtcc gcaagagagc 1261 atctgagaaa gagctcatga ttgccggaag gttgtttatc ctggtgctga ttggcatcag 1321 catcgcctgg gtgcccattg tgcagtcagc acaaagtggg caactcttcg attacatcca 1381 gtccatcacc agttacttgg gaccacccat tgcggctgtc ttcctgcttg ctattttctg 1441 gaagagagtc aatgagccag gagccttttg gggactgatc ctaggacttc tgattgggat 1501 ttcacgtatg attactgagt ttgcttatgg aaccgggagc tgcatggagc ccagcaactg 1561 tcccacgatt atctgtgggg tgcactactt gtactttgcc attatcctct tcgccatttc 1621 tttcatcacc atcgtggtca tctccctcct caccaaaccc attccggatg tgcatctcta 1681 ccgtctgtgt tggagcctgc gcaacagcaa agaggagcgt attgacctgg atgcggaaga 1741 ggagaacatc caagaaggcc ctaaggagac cattgaaata gaaacacaag ttcctgagaa 1801 gaaaaaagga atcttcagga gagcctatga cctattttgt gggctagagc agcacggtgc 1861 acccaagatg actgaggaag aggagaaagc catgaagatg aagatgacgg acacctctga 1921 gaagcctttg tggaggacag tgttgaacgt caatggcatc atcctggtga ccgtggctgt 1981 cttttgccat gcatattttg cctgagtcct accttttgct gtagatttac catggctgga 2041 ctcttactca ccttccttta gtctcgtcct gtggtgttga agggaaatca gccagttgta 2101 aattttgccc aggtggataa atgtgtacat gtgtaattat aggctagctg gaagaaaacc 2161 attagtttgc tgttaattta tgcatttgaa gccagtgtga tacagccatc tgtacctact 2221 ggagccgcag aagggagtcc actcagtcac atccagaaaa aggcagacta agaatcagaa 2281 gccatgtgat tgatgtctga cgtgagtctg tctcaggtag attccgggtg tcagtgtggt 2341 ttataatcct tgaatattgt tttagaaact ttggtctccc tggttcctgc cacttttcct 2401 gtccgtcctc ctccccattt tttttttaaa agaaagctgt tttcccctc // LOCUS HUMSHH 1576 bp mRNA PRI 12-FEB-1995 DEFINITION Homo sapiens sonic hedgehog protein (SHH) mRNA, complete cds. ACCESSION L38518 NID g663156 KEYWORDS homologue; sonic hedgehog protein. SOURCE Homo sapiens (clone HHH5) (tissue library: Clontech) fetus lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1576) AUTHORS Marigo,V., Roberts,D.J., Lee,S.M.K., Tsukurov,O., Levi,T., Gastier,J.M., Epstein,D.J., Gilbert,D.J., Martin,G.G., Copeland,N.G., Seidman,C.E., Jenkins,N.A., Seidman,J.G., McMahon,A.P. and Tabin,C. TITLE Cloning, expression and chromosomal location of SHH and IHH, two human homologues of the Drosophila segment polarity gene Hedgehog JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..1576 /organism="Homo sapiens" /note="vector: lambda gt10" /db_xref="taxon:9606" /clone="HHH5" /dev_stage="fetus" /tissue_type="lung" /tissue_lib="Clontech" mRNA 1..1576 /partial /gene="SHH" gene 1..1576 /gene="SHH" CDS 152..1540 /gene="SHH" /note="homologue of the Drosophila segment polarity gene" /codon_start=1 /product="sonic hedgehog protein" /db_xref="PID:g663157" /translation="MLLLARCLLLVLVSSLLVCSGLACGPGRGFGKRRHPKKLTPLAY KQFIPNVAEKTLGASGRYEGKISRNSERFKELTPNYNPDIIFKDEENTGADRLMTQRC KDKLNALAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRSKYGM LARLAVEAGFDWVYYESKAHIHCSVKAENSVAAKSGGCFPGSATVHLEQGGTKLVKDL SPGDRVLAADDQGRLLYSDFLTFLDRDDGAKKVFYVIETREPRERLLLTAAHLLFVAP HNDSATGEPEASSGSGPPSGGALGPRALFASRVRPGQRVYVVAERDGDRRLLPAAVHS VTLSEEAAGAYAPLTAQGTILINRVLASCYAVIEEHSWAHRAFAPFRLAHALLAALAP ARTDRGGDSGGGDRGGGGGRVALTAPGAADAPGAGATAGIHWYSQLLYQIGTWLLDSE ALHPLGMAVKSS" BASE COUNT 287 a 508 c 567 g 214 t ORIGIN 1 gcgaggcagc cagcgaggga gagagcgagc gggcgagccg gagcgaggaa gggaaagcgc 61 aagagagagc gcacacgcac acacccgccg cgcgcactcg cgcccggacc cgcacgggga 121 cagctcggaa gtcatcagtt ccatgggcga gatgctgctg ctggcgagat gtctgctgct 181 agtcctcgtc tcctcgctgc tggtatgctc gggactggcg tgcggaccgg gcagggggtt 241 cgggaagagg aggcacccca aaaagctgac ccctttagcc tacaagcagt ttatccccaa 301 tgtggccgag aagaccctag gcgccagcgg aaggtatgaa gggaagatct ccagaaactc 361 cgagcgattt aaggaactca cccccaatta caaccccgac atcatattta aggatgaaga 421 aaacaccgga gcggacaggc tgatgactca gaggtgtaag gacaagttga acgctttggc 481 catctcggtg atgaaccagt ggccaggagt gaaactgcgg gtgaccgagg gctgggacga 541 agatggccac cactcagagg agtctctgca ctacgagggc cgcgcagtgg acatcaccac 601 gtctgaccgc gaccgcagca agtacggcat gctggcccgc ctggcggtgg aggccggctt 661 cgactgggtg tactacgagt ccaaggcaca tatccactgc tcggtgaaag cagagaactc 721 ggtggcggcc aaatcgggag gctgcttccc gggctcggcc acggtgcacc tggagcaggg 781 cggcaccaag ctggtgaagg acctgagccc cggggaccgc gtgctggcgg cggacgacca 841 gggccggctg ctctacagcg acttcctcac tttcctggac cgcgacgacg gcgccaagaa 901 ggtcttctac gtgatcgaga cgcgggagcc gcgcgagcgc ctgctgctca ccgccgcgca 961 cctgctcttt gtggcgccgc acaacgactc ggccaccggg gagcccgagg cgtcctcggg 1021 ctcggggccg ccttccgggg gcgcactggg gcctcgggcg ctgttcgcca gccgcgtgcg 1081 cccgggccag cgcgtgtacg tggtggccga gcgtgacggg gaccgccggc tcctgcccgc 1141 cgctgtgcac agcgtgaccc taagcgagga ggccgcgggc gcctacgcgc cgctcacggc 1201 ccagggcacc attctcatca accgggtgct ggcctcgtgc tacgcggtca tcgaggagca 1261 cagctgggcg caccgggcct tcgcgccctt ccgcctggcg cacgcgctcc tggctgcact 1321 ggcgcccgcg cgcacggacc gcggcgggga cagcggcggc ggggaccgcg ggggcggcgg 1381 cggcagagta gccctaaccg ctccaggtgc tgccgacgct ccgggtgcgg gggccaccgc 1441 gggcatccac tggtactcgc agctgctcta ccaaataggc acctggctcc tggacagcga 1501 ggccctgcac ccgctgggca tggcggtcaa gtccagctga agccgggggg ccgggggagg 1561 ggcgcgggag ggggcc // LOCUS HUMSHIIIC 1917 bp mRNA PRI 15-DEC-1994 DEFINITION Human K+ channel subunit gene, complete cds. ACCESSION M64676 NID g338076 KEYWORDS K+ channel protein; potassium channel protein. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1917) AUTHORS Vega-Saenz de Miera,E., Moreno,H., Fruhling,D., Kentros,C. and Rudy,B. TITLE Cloning of ShIII (Shaw-like) cDNAs encoding a novel high-voltage-activating, TEA-sensitive, type-A K+ channel JOURNAL Proc. R. Soc. Lond., B, Biol. Sci. 248 (1321), 9-18 (1992) MEDLINE 92396711 FEATURES Location/Qualifiers source 1..1917 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="Brain stem" mRNA <1..1917 /product="potassium channel protein" CDS 157..1905 /note="potassium channel subunit gene; TEA-sensitive" /codon_start=1 /product="potassium channel protein" /db_xref="PID:g338077" /translation="MISSVCVSSYRGRKSGNKPPSKTCLKEEMAKGEASEKIIINVGG TRHETYRSTLRTLPGTRLAWLADPDGGGRPETDGGGVGSSGTSGGGGCEFFFDRHPGV FAYVLNYYRTGKLHCPADVCGPLFEEELTFWGIDETDVEPCCWMTYRQHRDAEEALDI FESPDGGGSGAGPSDEAGDDERELALQRLGPHEGGAGHGAGSGGCRGWQPRMWALFED PYSSRAARVVAFASLFFILVSITTFCLETHEAFNIDRNVTEILRVGNITSVHFRREVE TEPILTYIEGVCVLWFTLEFLVRIVCCPDTLDFVKNLLNIIDFVAILPFYLEVGLSGL SSKAARDVLGFLRVVRIVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALGV LIFATMIYYAERIGARPSDPRGNDHTDFKNIPIGFWWAVVTMTTLGYGDMYPKTWSGM LVGALCALAGVLTIAMPVPVIVNNFGMYYSLAMAKQKLPKKRKKHVPRPAQLESPMYC KSEETSPRDSTCSDTSPPAREEGMIERKRAGEIRGWEGKSLFPQWPREFPNGPQTLGF GMCFVWGFPKHKDVPL" BASE COUNT 340 a 612 c 610 g 355 t ORIGIN 1 gggaggtggt tggggcaagc ccaagccgca gagggggccg ccaccgcctc ctgcctcctc 61 ttcgtctcct ccccctcccc cgtctgacgc tgcctcctcg ggaagggtgt ttggagggca 121 gcggccgccc caagccggag acccgcagcg cttcttatga tcagctcggt gtgtgtctcc 181 tcctaccgcg ggcgcaagtc ggggaacaag cctccgtcca aaacatgtct gaaggaggag 241 atggccaagg gcgaggcgtc ggagaagatc atcatcaacg tgggcggcac gcgacatgag 301 acctaccgca gcaccctgcg caccctaccg ggaacccgcc tcgcctggct ggccgacccc 361 gacggcgggg gccggcccga gaccgatggc ggcggtgtgg gtagcagcgg cacgagcggc 421 ggcgggggct gcgagttctt cttcgacagg cacccgggcg tcttcgccta cgtgctcaac 481 tactaccgca ccggcaagct gcactgcccc gcggacgtgt gcgggccgct cttcgaagag 541 gagctcacct tctggggcat cgacgagacc gacgtggaac cctgctgctg gatgacctac 601 cggcagcacc gcgacgccga ggaggcgctc gacatcttcg agagcccgga cggaggcggc 661 agcggcgcgg ggcccagcga cgaggccggc gacgatgagc gggagctggc cctgcagcga 721 ctgggccccc acgagggagg cgcgggccat ggcgccgggt ctgggggctg ccgcggctgg 781 cagccccgca tgtgggcgct cttcgaggat ccctactcct cccgggccgc tagggtagtg 841 gcctttgcct ctctcttctt catcctggtc tccatcacca ctttctgcct ggagacccat 901 gaggccttta atatcgaccg caacgtgaca gagatcctcc gcgtagggaa catcaccagc 961 gtgcacttcc ggcgggaggt agagacagag cccatcctga cctacatcga gggcgtatgt 1021 gtgctgtggt tcacactgga gttcctggtg cgcatcgtgt gctgccccga cacgctggac 1081 ttcgtcaaga acctgctcaa catcatcgac tttgtggcca tcctgccctt ctacctggag 1141 gtgggactga gcggcctgtc atccaaggcg gcccgcgacg tgctgggctt cctgcgcgtg 1201 gtgcgcatcg tgcgcatcct gcgtatcttc aagctcacac gccacttcgt ggggctacgc 1261 gtgctgggcc acaccctgag ggccagcacc aatgagttcc tgctgcttat catcttcctg 1321 gccctgggtg tgctcatctt tgccaccatg atctactacg ctgagcgcat tggggccagg 1381 ccctccgacc ctcggggtaa tgaccacacc gacttcaaga acatccccat tggcttctgg 1441 tgggctgtgg tcaccatgac gacactgggc tacggagaca tgtaccccaa gacgtggtca 1501 ggcatgctgg taggggcact gtgtgcactg gctggcgtgc tcaccatcgc catgccggtg 1561 cctgtcatcg tcaacaactt cggcatgtac tactccctgg ccatggccaa gcagaagctg 1621 cccaagaaac ggaagaagca cgtgccacgg ccggcgcagc tggagtcacc catgtactgc 1681 aagtctgagg agacttcccc ccgggacagc acctgcagtg ataccagccc ccctgcccgg 1741 gaagagggta tgatcgagag gaaacgggca ggtgagatta ggggttggga aggaaaatcc 1801 cttttccccc agtggcctag ggagtttcca aatggacctc agaccttggg atttggcatg 1861 tgttttgtgt ggggcttccc taagcataaa gatgtgcctt tatgagggca aagtgtt // LOCUS HUMSHTR 1406 bp mRNA PRI 21-DEC-1993 DEFINITION Human serotonin 5-HT7 receptor mRNA, complete cds. ACCESSION L21195 NID g413865 KEYWORDS serotonin 5-HT7 receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1406) AUTHORS Bard,J.A., Zgombick,J., Adham,N., Vaysse,P., Branchek,T.A. and Weinshank,R.L. TITLE Cloning of a novel human serotonin receptor (5-HT7) positively linked to adenylate cyclase JOURNAL J. Biol. Chem. 268 (31), 23422-23426 (1993) MEDLINE 94043137 FEATURES Location/Qualifiers source 1..1406 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta and fetal brain" 5'UTR 1..27 CDS 28..1365 /standard_name="5-hydroxytryptamine (serotonin) receptor 7" /codon_start=1 /product="serotonin 5-HT7 receptor protein" /db_xref="PID:g413866" /translation="MMDVNSSGRPDLYGHLRSFLLPEVGRGLPDLSPDGGADPVAGSW APHLLSEVTASPAPTWDAPPDNASGCGEQINYGRVEKVVIGSILTLITLLTIAGNCLV VISVCFVKKLRQPSNYLIVSLALADLSVAVAVMPFVSVTDLIGGKWIFGHFFCNVFIA MDVMCCTASIMTLCVISIDRYLGITRPLTYPVRQNGKCMAKMILSVWLLSASITLPPL FGWAQNVNDDKVCLISQDFGYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKFP GFPRVEPDSVIALNGIVKLQKEVEECANLSRLLKHERKNISIFKREQKAATTLGIIVG AFTVCWLPFFLLSTARPFICGTSCSCIPLWVERTFLWLGYANSLINPFIYAFFNRDLR TTYRSLLQCQYRNINRKLSAAGMHEALKLAERPERPEFVLQNADYCRKKGHDS" 3'UTR 1366..1406 BASE COUNT 304 a 411 c 375 g 316 t ORIGIN 1 ccatgggcag cggcacacgg cggcgcgatg atggacgtta acagcagcgg ccgcccggac 61 ctctacgggc acctccgctc tttccttctg ccagaagtgg ggcgcgggct gcccgacttg 121 agccccgacg gtggcgccga cccggtcgcg ggctcctggg cgccgcacct gctgagcgag 181 gtgacagcca gcccggcgcc cacctgggac gcgcccccgg acaatgcctc cggctgtggg 241 gaacagatca actacggcag agtcgagaaa gttgtgatcg gctccatcct gacgctcatc 301 acgctgctga cgatcgcggg caactgcctg gtggtgatct ccgtgtgctt cgtcaagaag 361 ctccgccagc cctccaacta cctgatcgtg tccctggcgc tggccgacct ctcggtggct 421 gtggcggtca tgcccttcgt cagcgtcacc gacctcatcg ggggcaagtg gatctttgga 481 cactttttct gtaatgtctt catcgccatg gacgtcatgt gctgcacggc ctcgatcatg 541 accctgtgcg tgatcagcat tgacaggtac cttgggatca caaggcccct cacataccct 601 gtgaggcaga atgggaaatg catggcgaag atgattctct ccgtctggct tctctccgcc 661 tccatcacct tacctccact ctttggatgg gctcagaatg taaatgatga taaggtgtgc 721 ttgatcagcc aggactttgg ctatacgatt tactctaccg cagtggcatt ttatatcccc 781 atgtccgtca tgcttttcat gtactaccag atttacaagg ctgccaggaa gagtgctgcc 841 aaacacaagt ttcctggctt ccctcgagtg gagccagaca gcgtcatcgc cctgaatggc 901 atagtgaagc tccagaagga ggtggaagag tgtgcaaacc tttcgagact cctcaagcat 961 gaaaggaaaa acatctccat ctttaagcga gaacagaaag cagccaccac cctggggatc 1021 atcgtcgggg cctttaccgt gtgctggctg ccatttttcc tcctctcgac agccagaccc 1081 ttcatctgtg gcacttcctg cagctgcatc ccactgtggg tggagaggac atttctgtgg 1141 ctaggctatg caaactctct cattaaccct tttatatatg ccttcttcaa ccgggacctg 1201 aggaccacct atcgcagcct gctccagtgc cagtaccgga atatcaaccg gaagctctca 1261 gctgcaggca tgcatgaagc cctgaagctt gctgagaggc cagagagacc tgagtttgtg 1321 ctacaaaatg ctgactactg tagaaaaaaa ggtcatgatt catgattgaa agcagaacaa 1381 tggagaggaa ttcgatatca agctta // LOCUS HUMSIALO 1037 bp mRNA PRI 13-JAN-1995 DEFINITION Human sialoprotein mRNA, complete cds. ACCESSION J05213 NID g338083 KEYWORDS sialoprotein. SOURCE Human bone, cDNA to mRNA, clone B6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1037) AUTHORS Fisher,L.W., McBride,O.W., Termine,J.D. and Young,M.F. TITLE Human bone sialoprotein. Deduced protein sequence and chromosomal localization JOURNAL J. Biol. Chem. 265 (4), 2347-2351 (1990) MEDLINE 90130496 FEATURES Location/Qualifiers source 1..1037 /organism="Homo sapiens" /db_xref="taxon:9606" /map="4q11-q21" sig_peptide 72..119 /gene="SPP1" /note="sialoprotein signal peptide" CDS 72..1025 /gene="SPP1" /note="sialoprotein precursor" /codon_start=1 /db_xref="GDB:G00-118-889" /db_xref="PID:g338084" /translation="MKTALILLSILGMACAFSMKNLHRRVKIEDSEENGVFKYRPRYY LYKHAYFYPHLKRFPVQGSSDSSEENGDDSSEEEEEEEETSNEGENNEESNEDEDSEA ENTTLSATTLGYGEDATPGTGYTGLAAIQLPKKAGDITNKATKEKESDEEEEEEEEGN ENEESEAEVDENEQGINGTSTNSTEAENGNGSSGGDNGEEGEEESVTGANAEGTTETG GQGKGTSKTTTSPNGGFEPTTPPQVYRTTSPPFGKTTTVEYEGEYEYTGVNEYDNGYE IYESENGEPRGDNYRAYEDEYSYFKGQGYDGYDGQNYYHHQ" gene 72..1025 /gene="SPP1" mat_peptide 120..1022 /gene="SPP1" /note="sialoprotein" BASE COUNT 365 a 218 c 269 g 185 t ORIGIN 1 ccttctctgc cctctcactc ccttgagcct gcttcctcac tccaggactg ccagaggaag 61 caatcaccaa aatgaagact gctttaattt tgctcagcat tttgggaatg gcctgtgctt 121 tctcaatgaa aaatttgcat cgaagagtca aaatagagga ttctgaagaa aatggggtct 181 ttaagtacag gccacgatat tatctttaca agcatgccta cttttatcct catttaaaac 241 gatttccagt tcagggcagt agtgactcat ccgaagaaaa tggagatgac agttcagaag 301 aggaggagga agaagaggag acttcaaatg aaggagaaaa caatgaagaa tcgaatgaag 361 atgaagactc tgaggctgag aataccacac tttctgctac aacactgggc tatggagagg 421 acgccacgcc tggcacaggg tatacagggt tagctgcaat ccagcttccc aagaaggctg 481 gggatataac aaacaaagct acaaaagaga aggaaagtga tgaagaagaa gaggaggaag 541 aggaaggaaa tgaaaacgaa gaaagcgaag cagaagtgga tgaaaacgaa caaggcataa 601 acggcaccag taccaacagc acagaggcag aaaacggcaa cggcagcagc ggaggagaca 661 atggagaaga aggggaagaa gaaagtgtca ctggagccaa tgcagaaggc accacagaga 721 ccggagggca gggcaagggc acctcgaaga caacaacctc tccaaatggt gggtttgaac 781 ctacaacccc accacaagtc tatagaacca cttccccacc ttttgggaaa accaccaccg 841 ttgaatacga gggggagtac gaatacacgg gcgtcaatga atacgacaat ggatatgaaa 901 tctatgaaag tgagaacggg gaacctcgtg gggacaatta ccgagcctat gaagatgagt 961 acagctactt taaaggacaa ggctacgatg gctatgatgg tcagaattac taccaccacc 1021 agtgaagctc cagcctg // LOCUS HUMSIAT 2513 bp mRNA PRI 12-JUL-1995 DEFINITION Homo sapiens beta-galactoside alpha-2,3-sialyltransferase (SIAT4A) mRNA, complete cds. ACCESSION L13972 NID g410225 KEYWORDS beta-galactoside alpha-2,3-sialyltransferase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2513) AUTHORS Chang,M.L., Eddy,R.L., Shows,T.B. and Lau,J.T. TITLE Three genes that encode human beta-galactoside alpha 2,3-sialyltransferases. Structural analysis and chromosomal mapping studies JOURNAL Glycobiology 5 (3), 319-325 (1995) MEDLINE 95383839 REFERENCE 2 (bases 1 to 2513) AUTHORS Lau,J.T.Y. TITLE Direct Submission JOURNAL Submitted (04-NOV-1993) Joseph T.Y. Lau, Molecular and Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA FEATURES Location/Qualifiers source 1..2513 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="submaxillary gland" gene 651..1673 /gene="SIAT4A" CDS 651..1673 /gene="SIAT4A" /EC_number="2.4.99.4" /note="putative" /codon_start=1 /product="beta-galactoside alpha-2,3-sialyltransferase" /db_xref="PID:g410226" /translation="MVTLRKRTLKVLTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVL ELSENLKRLIKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWL RLQREKKPNNLNDTIKELFRVVPGNVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEI DSHDFVLRMNKAPTAGFEADVGTKTTHHLVYPESFRELGDNVSMILVPFKTIDLEWVV SAITTGTISHTYIPVPAKIRVKQDKILIYHPAFIKYVFDNWLQGHGRYPSTGILSVIF SMHVCDEVDLYGFGADSKGNWHHYWENNPSAGAFRKTGVHDADFESNVTATLASINKI RIFKGR" misc_feature 1065..1229 /gene="SIAT4A" /note="putative" BASE COUNT 596 a 691 c 683 g 543 t ORIGIN 1 gtcactcagc ctatggagat gaagaaactg aggttcagag aggttaagag actccactga 61 ggtcacacag ccgatgacag acaaccttct gtgccttcat caagctggtt gtgtacccac 121 catgtccctg gcgacaggat tgggaaagaa aaagccctaa ttaaggatcg tcagaaacca 181 cagttggagg aggacggcag agacagtttc cctccccgct ataccaacac ccttccttcg 241 aggtcctcgc tcctgaggga ccctggactg tcacagagat taatgacccc ttattttctt 301 tggatgtgaa aggaaatcac tggttaaagc ttgatcgaga gacattatca gctctttaag 361 gattgcagga gaataggcta ctttattttc tgaaaaggta aatatatgca agcaaagcca 421 acatgccacg aatggcgttg gtctaccaca cagccgtgtc tgggacacag ttgggggtca 481 tcccccagca ggagtgaagt cgagcttagc ggcccttgtg tcctcccttg gaattcctgc 541 catccctttt gattgagcct ccacctctgg gatttttctt ccatttttct cctctcttag 601 gagggagttc ctgctaccca tcgtgggagg ccaccatcag gactgcgaag atggtgaccc 661 tgcggaagag gaccctgaaa gtgctcacct tcctcgtgct cttcatcttc ctcacctcct 721 tcttcctgaa ctactcccac accatggtgg ccaccacctg gttccccaag cagatggtcc 781 tggagctctc cgagaacctg aagagactga tcaagcacag gccttgcacc tgcacccact 841 gcatcgggca gcgcaagctc tcggcctggt tcgatgagag gttcaaccag accatgcagc 901 cgctgctgac tgcccagaac gcgctcttgg aggacgacac ctaccgatgg tggctgaggc 961 tccagcggga gaagaagccc aataacttga atgacaccat caaggagctg ttcagagtgg 1021 tgcctgggaa tgtggaccct atgctggaga agaggtcggt gggctgccgg cgctgcgctg 1081 tggtgggcaa ctcgggcaac ctgagggagt cttcttatgg gcctgagata gacagtcacg 1141 actttgtcct caggatgaac aaggcgccca cggcagggtt tgaagctgat gttgggacca 1201 agaccaccca ccatctggtg taccctgaga gcttccggga gctgggagat aatgtcagca 1261 tgatcctggt gcccttcaag accatcgact tggagtgggt ggtgagcgcc atcaccacgg 1321 gcaccatttc ccacacctac atcccggttc ctgcaaagat cagagtgaaa caggataaga 1381 tcctgatcta ccacccagcc ttcatcaagt atgtctttga caactggctg caagggcacg 1441 ggcgataccc atctaccggc atcctctcgg tcatcttctc aatgcatgtc tgcgatgagg 1501 tggacttgta cggcttcggg gcagacagca aagggaactg gcaccactac tgggagaaca 1561 acccatccgc gggggctttt cgcaagacgg gggtgcacga tgcagacttt gagtctaacg 1621 tgacggccac cttggcctcc atcaataaaa tccggatctt caaggggaga tgacgcagtg 1681 aagggctgag gatggacgca ctgtcacacc tctgcatttc cagccccagc atcttgctgg 1741 agccgttcca tcccggagct tggaggggca gcctcaggtg tgtgcctggg caccgctcac 1801 agcctcttgc acccagccgt tgacagcatc tactcagcaa ggtcactaag ctctgccagc 1861 gtggcagagc atgtcttgga acctgtcttg agtgggacaa cgtcccccca ctgctgccct 1921 agagctgggg agacgctcag ggaaaggttc aacctccaca cactaaaatc atttgctcct 1981 gggcaagctt gggaatgaat gtggaagatc ctatattctg agagacagga cagtttccca 2041 ggaagatggg cagagacttg agtggcgatt acctccagca cagagacgtg ccaggcggtg 2101 ttggcgctcg gggcgagatg ctgcccttct tttgcacgaa gcctggcctc ttgcttggcg 2161 tgataaccct gtcatcttcc caaagctcat ttatgagcca ccagaggctc ctaccccaaa 2221 gatttcacag aaacttgagg ccaggtgccg tggctcacac ctgtaatctg aacactttgg 2281 gaggccgagg cgggaggatc acttgagccc aggagttcaa gaccagcctg ggcaacatag 2341 tgagactcct gtctctacaa aaataaaaga tttaaaaaaa ttagccaggc acggtggcac 2401 acacttgtag ccccagctac tagggaggct gaggagggag gatctcttgt gcctaggagt 2461 tcgaggctgc agtgggctgt gatcacacca ctgcactcca gcctgggaac aag // LOCUS HUMSIL 5198 bp mRNA PRI 13-JAN-1995 DEFINITION Human SIL mRNA, complete cds. ACCESSION M74558 NID g338087 KEYWORDS . SOURCE Homo sapiens Bone marrow cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5198) AUTHORS Aplan,P.D., Lombardi,D.P. and Kirsch,I.R. TITLE Structural characterization of SIL, a gene frequently disrupted in T-cell acute lymphoblastic leukemia JOURNAL Mol. Cell. Biol. 11 (11), 5462-5469 (1991) MEDLINE 92017825 FEATURES Location/Qualifiers source 1..5198 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="Bone marrow" gene 381..4244 /gene="SIL" CDS 381..4244 /gene="SIL" /codon_start=1 /db_xref="PID:g338088" /translation="MEPIYPFARPQMNTRFPSSRMVPFHFPPSKCALWNPTPTGDFIY LHLSYYRNPKLVVTEKTIRLAYRHANENKKNSSCFLLGSLTADEDEEGVTLTVDRFDP GREVPECLEITPTASLPGDFLIPCKVHTQELCSREMIVHSVDDFSSALKALQCHICSK DSLDCGKLLSLRVHITSRESLDSVEFDLHWAAVTLANNFKCTPVKPIPIIPTALARNL SSNLNISQVQGTYKYGYLTMDETRKLLLLLESDPKVYSLPLVGIWLSGITHIYSPQVW ACCLRYIFNSSVQERVFSESGNFIIVLYSMTHKEPEFYECFPCDGKIPDFRFQLLTSK ETLHLFKNVEPPDKNPIRCELSAESQNAETEFFSKASKNFSIKRSSQKLSSGKMPIHD HDSGVEDEDFSPRPIPSPHPVSQKISKIQPSVPELSLVLDGNFIESNPLPTPLEMVNN ENPPLINHLEHLKPLQPQLYDEKHSPEVEAGEPSLRGIPNQLNQDKPALLRHCKVRQP PAYKKGNPHTRNSIKPSSHNGPSHDIFEKLQTVSAGNVQNEEYPIRPSTLNSRQSSLA PQSQPHDFVFSPHNSGRPMELQIPTPPLPSYCSTNVCRCCQHHSHIQYSPLNSWQGAN TVGSIQDVQSEALQKHSLFHPSGCPALYCNAFCSSSSPIALRPQGDMGSCSPHSNIEP SPVARPPSHMDLCNPQPCTVCMHTPKTESDNGMMGLSPDAYRFLTEQDRQLRLLQAQI QRLLEAQSLMPCSPKTTAVEDTVQAGRQMELVSVEAQSSPGLHMRKGVSIAVSTGASL FWNAAGEDQEPDSQMKQDDTKISSEDMNFSVDINNEVTSLPGSASSLKAVDIPSFEES NIAVEEEFNQPLSVSNSSLVVRKEPDVPVFFPSGQLAESVSMCLQTGPTGGASNNSET SEEPKIEHVMQPLLHQPSDNQKIYQDLLGQVNHLLNSSSKETEQPSTKAVIISHECTR TQNVYHTKKKTHHSRLVDKDCVLNATLKQLRSLGVKIDSPTKVKKNAHNVDHASVLAC ISPEAVISGLNCMSFANVGMSGLSPNGVDLSMEANAIALKYLNENQLSQLSVTRSNQN NCDPFSLLHINTDRSTVGLSLISPNNMSFATKKYMKRYGLLQSSDNSEDEEEPPDNAD SKSEYLLNQNLRSIPEQLGGQKEPSKNDHEIINCSNCESVGTNADTPVLRNITNEVLQ TKAKQQLTEKPAFLVKNLKPSPAVNLRTGKAEFTQHPEKENEGDITIFPESLQPSETL KQMNSMNSVGTFLDVKRLRQLPKLF" BASE COUNT 1565 a 1141 c 1059 g 1433 t ORIGIN 1 ctcacttaaa cacgtagttc ccgcgacccc aacgtcccag aggcggggcc ggagtcggcg 61 gtggcgctcc ttggagccgg ctcccgctcc taccctgcaa acagacctca gctccgcgga 121 agttgcgaga cggggtttca ccatgttggt cgggctggtc tggaaatcct gacttcaggt 181 gatccacccg cctcggcctc ccaaaatgct gggattacaa gcgtgagcca ccgcccctga 241 catgagccat tgacttttaa agcaggagaa taatttggat cagatttata tggaaacact 301 cttctagcag cattatgggg acttttccat aagtctggat actgaggatt tggaattaaa 361 gaaatcattc accagacatc atggagccta tatatccttt tgcacggccc cagatgaata 421 ccaggtttcc ttcaagcagg atggtacctt tccactttcc tccatcaaaa tgtgcacttt 481 ggaacccaac gccaactgga gatttcatct acttacatct cagttactac agaaatccaa 541 agcttgtggt gactgagaag accatccgac ttgcttatcg tcatgctaac gagaataaaa 601 aaaattcgtc atgcttttta cttggttctc tgacagcaga cgaagatgaa gaaggtgtaa 661 cattgacagt agatcgcttt gatcctggtc gagaagtacc tgaatgccta gaaataaccc 721 ctactgcttc tcttcctggg gactttttga ttccatgcaa agttcatact caagaacttt 781 gttcaagaga aatgatagtt cacagtgtag atgacttcag ttcagcttta aaggctctac 841 agtgccatat atgtagcaaa gattccttgg actgtggtaa gctgctttcc ctaagagttc 901 atatcacttc cagggagagt ttggacagtg tggaatttga cttgcattgg gcagcagtaa 961 ctctagcaaa taactttaaa tgcacacctg tgaagcccat ccccattatt ccaacagctc 1021 tggcaagaaa cttgagcagt aatctgaata tttctcaagt tcaagggact tataaatatg 1081 gatatcttac catggatgaa acacgcaaat tgttactttt gttggaatct gatcccaagg 1141 tttattctct accattggtg ggaatttggc tgtctggaat tacacatatc tatagtcctc 1201 aggtatgggc ttgctgtttg cgatacatat tcaattcttc tgttcaagaa agggtttttt 1261 cagaatctgg aaatttcatc atagttctct attctatgac acataaggaa cctgagtttt 1321 atgaatgctt cccttgtgat ggcaagatac ctgactttcg gtttcagttg ctaaccagta 1381 aggaaacatt acatcttttc aaaaatgttg aacctcctga caaaaatcca atccgttgtg 1441 aactgagcgc tgaaagccaa aatgcagaaa cagagttttt cagtaaggct tccaagaatt 1501 tttcaattaa gaggtcttcc caaaagttat cttctgggaa gatgccaata catgatcacg 1561 actctggtgt tgaagatgaa gatttttctc caagaccaat tcctagtcct catccagtga 1621 gtcagaagat ttctaagatc caaccatcag ttcctgaact ttcacttgtg ttggatggca 1681 atttcataga atcaaaccct ctgcctactc cattggaaat ggtgaataat gaaaatcctc 1741 ctttgattaa ccacttggaa cacttgaagc cattgcaacc ccagctttat gatgagaaac 1801 acagtccaga agttgaagct ggagagcctt ccttgagagg aataccaaat cagttaaacc 1861 aggataaacc agctcttttg agacactgca aagtaagaca gccacctgcc tataagaaag 1921 ggaaccccca taccaggaac agtattaaac catcttctca taatgggcca tctcatgata 1981 tatttgaaaa gctccaaaca gtttctgctg gaaatgtaca aaacgaagag tatcctataa 2041 gaccctccac acttaattct aggcagtctt ctcttgcccc gcagtcccaa ccacacgatt 2101 ttgttttttc accccataat tcaggaagac caatggaact tcagatacct actcccccac 2161 tgccatctta ctgttccaca aacgtttgca ggtgttgtca gcatcatagt catattcaat 2221 atagtccgct aaattcttgg caaggagcaa acacagttgg atccattcaa gatgtccagt 2281 ctgaagccct tcaaaagcat tcattatttc acccaagtgg atgtccagcc ctgtactgta 2341 atgcattctg ttcttcaagt agtcctatag ccttgagacc tcagggagat atgggcagtt 2401 gttctcccca cagcaatatt gaaccatcgc ctgtggcaag accgccttca catatggact 2461 tatgtaaccc acagccttgc acagtgtgca tgcacacacc caagactgag tcagataatg 2521 gaatgatggg actatctcca gatgcatatc ggttcctcac agaacaagac agacagctaa 2581 gactacttca ggcacagatt cagcgtttgt tggaagcaca gtctctgatg ccctgttccc 2641 ctaagacaac tgctgttgaa gacacagtgc aagctggaag acaaatggag ttggtttctg 2701 tggaagcaca gtcttcccct ggcttgcaca tgagaaaagg tgtaagcatt gctgtgagca 2761 caggtgctag cttgttttgg aatgcagcag gtgaggatca agagcctgac tctcaaatga 2821 agcaagatga taccaaaatt tccagtgagg acatgaattt ttctgtcgat attaataatg 2881 aagtcacaag tcttccaggt agtgcatctt cattaaaagc agttgatatt cccagttttg 2941 aagagagcaa cattgctgtg gaagaagaat ttaaccagcc actttctgta tccaactctt 3001 ctctagttgt gagaaaagaa cctgatgtac ctgtgttctt tccaagtggc cagctggcag 3061 aaagtgtaag catgtgttta cagactggac caacaggggg tgccagtaac aattctgaaa 3121 catcagagga accaaaaatt gagcatgtaa tgcaaccctt gcttcatcaa ccatcagata 3181 accagaaaat ttaccaggat ttattgggtc aagtaaacca cctattaaat agttcctcca 3241 aggaaactga gcagccgtct accaaagcag taattatcag tcatgaatgc accagaaccc 3301 aaaacgttta ccatacaaag aaaaaaacac atcattcaag actggtggac aaagattgtg 3361 tccttaatgc aactcttaag caactaagaa gccttggagt aaaaattgat tctcccacta 3421 aagtgaagaa aaatgcacat aacgtggatc acgccagtgt gttggcatgc atcagcccag 3481 aagcagtgat ctctggatta aactgcatgt catttgctaa tgttggcatg agcggcttaa 3541 gccccaatgg tgtggatttg agcatggagg caaatgctat agctctgaaa tatttaaatg 3601 aaaatcagct gtcacaactg tctgtcactc gatcgaacca aaataattgt gacccattca 3661 gccttctcca tattaataca gacagaagca cagtggggct tagtttaatt tcaccaaaca 3721 acatgtcatt tgcaaccaaa aaatatatga agagatatgg actcctacaa agcagtgaca 3781 atagtgaaga tgaagaggaa cctcccgaca atgcagatag caagagtgaa tatttattga 3841 atcagaacct taggtccata cccgaacagc ttggtggtca gaaagagcct tctaagaatg 3901 accatgaaat aattaattgt tctaactgtg aatctgtggg gaccaacgca gatacgccag 3961 tattgagaaa tattacaaat gaagttttgc agacaaaagc aaaacagcag ttgactgaaa 4021 agccagcttt cttagtaaag aaccttaaac caagtcctgc agtgaacctt cgaaccggga 4081 aagcagagtt cactcaacat cctgagaaag aaaatgaagg ggacattaca atttttcctg 4141 aaagtttgca accttctgaa acgctaaagc agatgaatag catgaattca gtaggcacct 4201 tcttagatgt aaaacgtctc agacagttac caaaattatt ttaacctttt aactccctgc 4261 ccttttaata cagggacagg gtgtctcctg aagatactta gggaaaacag gagcctacca 4321 caaggctcct gatcattctg gagtcactgt ttcttggtag cagccaattg ggaagagtga 4381 cttctgtgag atggctggct ggtgatagga ctaagttctc attgttcaaa tagagctgtt 4441 caacatcact gaaaccttta agaaaagccc tgagatcagt tattcctaca agtttaagta 4501 gtagacagat actatccagc tctaagtctc aactgctctt ttatactgta cttttttttt 4561 tgagacggag ttttgctctt gtagcccagg ctggagtgca atggcaggat ctcagatcac 4621 tgcaacctct gcctcctggg ttcaagcgat tttcctgctt catcttccca ggtagctggg 4681 attacaggca tgtgccacaa cgcctggcta attttgtatt tttagtagag actggtttct 4741 ccatgttggt caggctggtc tcaaactccc gacctcaggt gatccgcccg cctcggcctc 4801 ctaaagtgct gggattacag gcgtgagcca ctgcccagct atactgtata tttaagaagg 4861 tccagcatgt tgcatctctg cattatccta tatcataaaa gaagcataag ttatcatggt 4921 gttgggtaaa ttagcgaaat caaccgcttc ctaagtttaa gggaaaagtt atttttaaaa 4981 acaacttaat aaaaacttac actcttatac aagagtgtat ttccccttaa ttaggatgca 5041 tgttgattaa actcgagata cagctttttg cagtatggtg ggttggtttt ggtgtaacat 5101 cttcaacatg tcacactggc tatcaaagaa taagaaaatt attgagtatg agtgtgtttt 5161 ataaactttc tgagtttttc agatgtctta atattttt // LOCUS HUMSKRN 2085 bp mRNA PRI 11-AUG-1993 DEFINITION Human novel serine kinase receptor mRNA, complete cds. ACCESSION L02911 NID g338218 KEYWORDS serine kinase receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2085) AUTHORS Matsuzaki,K. and McKeehan,W.L. TITLE Novel serine-kinase receptor type 1 JOURNAL J. Biol. Chem. 268, 12719-12723 (1993) MEDLINE 93286114 FEATURES Location/Qualifiers source 1..2085 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HepG2" /cell_type="hepatoma" CDS 341..1870 /codon_start=1 /product="serine kinase receptor" /db_xref="PID:g338219" /translation="MVDGVMILPVLIMIALPSPSMEDEKPKVNPKLYMCVCEGLSCGN EDHCEGQQCFSSLSINDGFHVYQKGCFQVYEQGKMTCKTPPSPGQAVECCQGDWCNRN ITAQLPTKGKSFPGTQNFHLEVGLIILSVVFAVCLLACLLGVALRKFKRRNQERLNPR DVEYGTIEGLITTNVGDSTLADLLDHSCTSGSGSGLPFLVQRTVARQITLLECVGKGR YGEVWRGSWQGENVAVKIFSSRDEKSWFRETELYNTVMLRHENILGFIASDMTSRHSS TQLWLITHYHEMGSLYDYLQLTTLDTVSCLRIVLSIASGLAHLHIEIFGTQGKPAIAH RDLKSKNILVKKNGQCCIADLGLAVMHSQSTNQLDVGNNPRVGTKRYMAPEVLDETIQ VDCFDSYKRVDIWAFGLVLWEVARRMVSNGIVEDYKPPFYDVVPNDPSFEDMRKVVCV DQQRPNIPNRWFSDPTLTSLAKLMKECWYQNPSARLTALRIKKTLTKIDNSLDKLKTD C" BASE COUNT 545 a 497 c 529 g 514 t ORIGIN 1 gaagcgaata gcgttttcag agatattggg cggctcaagg gtcttactct gtcgcccagt 61 ctgtaatgca gtgctgtgac catagcccac tgcagcctcc acctcccagg ctcaagcagt 121 ccttcccccc tcgccctcat gaatagctgg gactacagcc tggagcattg gtaagcgtca 181 cactgccaaa gtgagagctg ctggagaact cataatccca ggaacgcctc ttctactctc 241 cgagtacccc agtgaccaga gtgagagaag ctctgaacga gggcacgcgg cttgaaggac 301 tgtgggcaga tgtgaccaag agcctgcatt aagttgtaca atggtagatg gagtgatgat 361 tcttcctgtg cttatcatga ttgctctccc ctcccctagt atggaagatg agaagcccaa 421 ggtcaacccc aaactctaca tgtgtgtgtg tgaaggtctc tcctgcggta atgaggacca 481 ctgtgaaggc cagcagtgct tttcctcact gagcatcaac gatggcttcc acgtctacca 541 gaaaggctgc ttccaggttt atgagcaggg aaagatgacc tgtaagaccc cgccgtcccc 601 tggccaagcc gtggagtgct gccaagggga ctggtgtaac aggaacatca cggcccagct 661 gcccactaaa ggaaaatcct tccctggaac acagaatttc cacttggagg ttggcctcat 721 tattctctct gtagtgttcg cagtatgtct tttagcctgc ctgctgggag ttgctctccg 781 aaaatttaaa aggcgcaacc aagaacgcct caatccccga gacgtggagt atggcactat 841 cgaagggctc atcaccacca atgttggaga cagcacttta gcagatttat tggatcattc 901 gtgtacatca ggaagtggct ctggtcttcc ttttctggta caaagaacag tggctcgcca 961 gattacactg ttggagtgtg tcgggaaagg caggtatggt gaggtgtgga ggggcagctg 1021 gcaaggggaa aatgttgccg tgaagatctt ctcctcccgt gatgagaagt catggttcag 1081 ggaaacggaa ttgtacaaca ctgtgatgct gaggcatgaa aatatcttag gtttcattgc 1141 ttcagacatg acatcaagac actccagtac ccagctgtgg ttaattacac attatcatga 1201 aatgggatcg ttgtacgact atcttcagct tactactctg gatacagtta gctgccttcg 1261 aatagtgctg tccatagcta gtggtcttgc acatttgcac atagagatat ttgggaccca 1321 agggaaacca gccattgccc atcgagattt aaagagcaaa aatattctgg ttaagaagaa 1381 tggacagtgt tgcatagcag atttgggcct ggcagtcatg cattcccaga gcaccaatca 1441 gcttgatgtg gggaacaatc cccgtgtggg caccaagcgc tacatggccc ccgaagttct 1501 agatgaaacc atccaggtgg attgtttcga ttcttataaa agggtcgata tttgggcctt 1561 tggacttgtt ttgtgggaag tggccaggcg gatggtgagc aatggtatag tggaggatta 1621 caagccaccg ttctacgatg tggttcccaa tgacccaagt tttgaagata tgaggaaggt 1681 agtctgtgtg gatcaacaaa ggccaaacat acccaacaga tggttctcag acccgacatt 1741 aacctctctg gccaagctaa tgaaagaatg ctggtatcaa aatccatccg caagactcac 1801 agcactgcgt atcaaaaaga ctttgaccaa aattgataat tccctcgaca aattgaaaac 1861 tgactgttga cattttcata gtgtcaagaa ggaagatttg acgttgttgt cattgtccag 1921 ctgggaccta atgctggcct gactggttgt cagaatggaa tccatctgtc tccctcccca 1981 aatggctgct ttgacaaggc agacgtcgta cccagccatg tgttggggag acatcaaaac 2041 caccctaacc tcgctcgatg actgtgaact gggcatttca cgaac // LOCUS HUMSMCK 1597 bp mRNA PRI 13-JAN-1995 DEFINITION Human sarcomeric mitochondrial creatine kinase (MtCK) gene, complete cds. ACCESSION J05401 NID g338236 KEYWORDS sarcomeric mitochondrial creatine kinase. SOURCE Human heart, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1597) AUTHORS Haas,R.C. and Strauss,A.W. TITLE Separate nuclear genes encode sarcomere-specific and ubiquitous human mitochondrial creatine kinase isoenzymes JOURNAL J. Biol. Chem. 265 (12), 6921-6927 (1990) MEDLINE 90216724 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.W.Strauss, 27-JAN-1990, for release after publication. FEATURES Location/Qualifiers source 1..1597 /organism="Homo sapiens" /db_xref="taxon:9606" /map="15" sig_peptide 199..315 /gene="CKMT" /note="sarcomeric mitochondrial creatine kinase signal peptide" CDS 199..1458 /gene="CKMT" /note="sarcomeric mitochondrial creatine kinase precursor (EC 2.7.3.2)" /codon_start=1 /db_xref="GDB:G00-119-780" /db_xref="PID:g338237" /translation="MASIFSKLLTGRNASLLFATMGTSVLTTGYLLNRQKVCAEVREQ PRLFPPSADYPDLRKHNNCMAECLTPAIYSKLRNKVTPNGYTLDQCIQTGVDNPGHPF IKTVGMVAGDEESYEVFADLFDPVIKLRHNGYDPRVMKHTTDLDASKITQGQFDEHYV LSSRVRTGRSIRGLSLPPACTRAERREVENVAITALEGLKGDLAGRYYKLSEMTEQDQ QRLIDDHFLFDKPVSPLLTCAGMARDWPDARGIWHNYDKTFLIWINEEDHTRVISMEK GGNMKRVFERFCRGLKEVERLIQERGWEFMWNERLGYILTCPSNLGTGLRAGVHVRIP KLSKDPRFSKILENLRLQKRGTGGVDTAAVADVYDISNIDRIGRSEVELVQIVIDGVN YLVDCEKKLERGQDIKVPPPLPQFGKK" gene 199..1458 /gene="CKMT" mat_peptide 316..1455 /gene="CKMT" /note="sarcomeric mitochondrial creatine kinase" BASE COUNT 422 a 401 c 413 g 361 t ORIGIN 1 ggctccggct tcaagatcaa aggaaatgtt tccctttgtc ccgtttcaca ctaaacgggt 61 tggggaggaa ccaggggaga tgtcaaccgt ctgccggtga ctgggaagtt ttctgcaagt 121 cctccacagc atagccagca ggccactttt cactaacaga agtcacaagc caagtgagac 181 actcatccaa gaggaaggat ggccagtatc ttttctaagt tgctaactgg ccgcaatgct 241 tctctgctgt ttgctaccat gggcaccagt gtcctgacca ccgggtacct gctgaaccgg 301 cagaaagtgt gtgccgaggt ccgggagcag cctaggctat ttcctccaag cgcagactac 361 ccagacctgc gcaagcacaa caactgcatg gccgagtgcc tcacccccgc catttattcc 421 aagcttcgca acaaggtgac acccaacggc tacacgctgg accagtgcat ccagactgga 481 gtggacaacc ctggccaccc cttcataaag actgtgggca tggtggctgg tgacgaggag 541 tcctatgagg tgtttgctga cctttttgac cccgtcatca aactaagaca caacggctat 601 gaccccaggg tgatgaagca cacaacggat ctggatgcat caaagatcac ccaagggcag 661 ttcgacgagc attacgtgct gtcttctcgg gtgcgcactg gccgcagcat ccgtgggctg 721 agcctgcctc cagcctgcac ccgggccgag cgaagggagg tagagaacgt ggccatcact 781 gccctggagg gcctcaaggg ggacctggct ggccgctact acaagctgtc cgagatgacg 841 gagcaggacc agcagcggct catcgatgac cactttctgt ttgataagcc agtgtcccct 901 ttattaacat gtgctgggat ggcccgtgac tggccagatg ccaggggaat ctggcataat 961 tatgataaga catttctcat ctggataaat gaggaggatc acaccagggt aatctcaatg 1021 gaaaaaggag gcaatatgaa acgagtattt gagcgattct gtcgtggact aaaagaagta 1081 gaacggttaa tccaagaacg aggctgggag ttcatgtgga atgagcgcct aggatacatt 1141 ttgacctgtc cttcgaacct tggaacagga ctacgagctg gtgtccacgt taggatccca 1201 aagctcagca aggacccacg cttttctaag atcctggaaa acctaagact ccagaagcgt 1261 ggcacaggtg gtgtggacac tgccgcggtc gcagatgtgt acgacatttc caacatagat 1321 agaattggtc gatcagaggt tgagcttgtt cagatagtca tcgatggagt caattacctg 1381 gtggattgtg aaaagaagtt ggagagaggc caagatatta aggtgccacc ccctctgcct 1441 cagtttggca aaaagtaaac tttccctttc ccaatttata aataatctgt ctgctggtac 1501 aacagacata aatctctact ctgagagttt ttatacactt ggaaaaatat aaaattgtag 1561 atcctgccta tctttacaat aaaactctcc ttaatat // LOCUS HUMSMIT 2157 bp DNA PRI 16-MAR-1995 DEFINITION Homo sapiens Na+/myo-inositol cotransporter (SLC5A3) gene, complete cds. ACCESSION L38500 NID g662842 KEYWORDS Na+/myo-inositol cotransporter; osmoregulation. SOURCE Homo sapiens (clone hgSMIT) male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2157) AUTHORS Berry,G.T., Mallee,J.J., Kwon,H.M., Rim,J.S., Mulla,W.R., Muenke,M. and Spinner,N.B. TITLE The human osmoregulatory Na+/myo-inositol cotransporter gene (SLC5A3): molecular cloning and localization to chromosome 21 JOURNAL Genomics 25 (2), 507-513 (1995) MEDLINE 95309919 FEATURES Location/Qualifiers source 1..2157 /organism="Homo sapiens" /note="(vector lambda FIX II)" /db_xref="taxon:9606" /clone="hgSMIT" /germline /sex="male" /tissue_type="placenta" /map="chromosome 21" gene 1..2157 /gene="SLC5A3" CDS 1..2157 /gene="SLC5A3" /codon_start=1 /db_xref="GDB:G00-373-217" /product="Na+/myo-inositol cotransporter" /db_xref="PID:g662843" /translation="MRAVLDTADIAIVALYFILVMCIGFFAMWKCNRSTVSGYFLAGR SMTWVTIGASLFVSNIGSEHFIGLAGSGAASGFAVGAWEFNALLLLQLLGWVFIPIYI RSGVYTMPEYLSKRFGGHRIQVYFAALSLILYIFTKLSVDLYSGALFIQESLGWNLYV SVILLIGMTALLTVTGGLVAVIYTDTLQALLMIIGALTLMIISIMEIGGFEEVKRRYM LASPDVTSILLTYNLSNTNSCNVSPKKEALKMLRNPTDEDVPWPGFILGQTPASVWYW CADQVIVQRVLAAKNIAHAKGSTLMAGFLKLLPMFIIVVPGMISRILFTDDIACINPE HCMLVCGSRAGCSNIAYPRLVMKLVPVGLRGLMMAVMIAALMSDLDSIFNSASTIFTL DVYKLIRKSASSRELMIVGRIFVAFMVVISIAWVPIIVEMQGGQMYLYIQEVADYLTP PVAALFLLAIFWKRCNEQGAFYGGMAGFVLGAVRLILAFAYRAPECDQPDNRPGFIKD IHYMYVATGLFWVTGLITVIVSLLTPPPTKEQIRTTTFWSKKNLVVKENCSPKEEPYQ MQEKSILRCSENNETINHIIPNGKSEDSIKGLQPEDVNLLVTCREEGNPVASLGHSEA ETPVDAYSNGQAALMGEKERKKETDDGGRYWKFIDWFCGFKSKSLSKRSLRDLMEEEA VCLQMLEETRQVKVILNIGLFAVCSLGIFMFVYFSL" BASE COUNT 537 a 460 c 550 g 610 t ORIGIN 1 atgagagctg tactggacac agcagacatt gccatagtgg ccctgtattt tatcctggtc 61 atgtgcattg gtttttttgc catgtggaaa tgtaatagaa gcaccgtgag tggatacttc 121 ctggcggggc gctctatgac ctgggtaaca attggtgcct ctctgtttgt gagcaatatt 181 gggagtgagc acttcattgg gctggcagga tctggagctg caagtggatt tgcagtgggc 241 gcatgggaat tcaatgcctt actgctttta caacttctgg gatgggtttt catcccaatt 301 tacatccggt caggggtata taccatgcct gaatacttgt ccaagcgatt tggtggccat 361 aggattcagg tctattttgc agccttgtct ctgattctct atattttcac caagctctcg 421 gtggatctgt attcgggtgc cctttttatc caggagtctt tgggttggaa tctttatgtg 481 tctgtcatcc tgctcattgg catgactgct ttgctgactg tcaccggagg ccttgttgca 541 gtgatctaca cagacactct gcaggctctg ctcatgatca ttggggcact tacacttatg 601 attattagca taatggagat tggcgggttt gaggaagtta agagaaggta catgttggcc 661 tcacccgatg tcacttccat cttattgaca tacaaccttt ccaacacaaa ttcttgtaat 721 gtctccccta agaaagaagc cctgaaaatg ctgcggaatc caacagatga agatgttcct 781 tggcctggat tcattcttgg gcagacccca gcttcagtat ggtactggtg tgctgaccaa 841 gtcatcgtgc agagggtcct tgcagccaaa aacattgctc atgccaaagg ctctactctt 901 atggctggct tcttaaagct cctgccaatg tttatcatag ttgtcccagg aatgatttcc 961 aggatactgt ttactgatga tatagcttgc atcaacccag agcactgcat gctggtgtgt 1021 ggaagcagag ctggttgctc caatattgct tacccacgcc tggtgatgaa gctggttcct 1081 gtgggccttc ggggtttaat gatggcagtg atgattgcag ctctgatgag tgacttagac 1141 tctatcttta acagtgccag taccatattc accctcgatg tgtacaaact tatccgcaag 1201 agcgcaagct cccgggagtt aatgattgtg gggaggatat ttgtggcatt tatggtggtg 1261 atcagcatag catgggtgcc aatcatcgtg gagatgcaag gaggccagat gtacctttac 1321 attcaggagg tagcagatta cctgacaccc ccagtggcag ccttgttcct gctggcaatt 1381 ttctggaagc gctgcaatga acaaggggct ttctatggtg gaatggctgg ctttgttctt 1441 ggagcagtcc gtttgatact ggcctttgcc taccgtgccc cagaatgtga ccaacctgat 1501 aataggccgg gcttcatcaa agacatccat tatatgtatg tggccacagg attgttttgg 1561 gtcacgggac tcattactgt aattgtgagc cttctcacac cacctcccac aaaggaacag 1621 attcgaacca ccaccttttg gtctaagaag aacctggtgg tgaaggagaa ctgctcccca 1681 aaagaggaac cataccaaat gcaagaaaag agcattctga gatgcagtga gaataatgag 1741 accatcaacc acatcattcc caacgggaaa tctgaagaca gcattaaggg ccttcagcct 1801 gaagatgtta atctgttggt aacctgcaga gaggagggca acccagtggc atccttaggt 1861 cattcagagg cagaaacacc agttgacgct tactccaatg ggcaagcagc tctcatgggt 1921 gagaaagaga gaaagaaaga aacggatgat ggaggtcggt actggaagtt catagactgg 1981 ttttgtggct ttaaaagtaa gagcctcagc aagaggagtc tcagagacct gatggaagag 2041 gaggctgttt gtttacagat gctagaagag actcggcaag ttaaagtaat actaaatatt 2101 ggactttttg ctgtgtgttc acttggaatt ttcatgtttg tttatttctc cttatga // LOCUS HUMSMP30 1356 bp mRNA PRI 08-JAN-1997 DEFINITION Human mRNA for SMP-30 (senescence marker protein-30), complete cds. ACCESSION D31815 NID g1072311 KEYWORDS SMP-30; senescence marker protein-30. SOURCE Homo sapiens adult liver cDNA to mRNA, clone:pHSMP6. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1356) AUTHORS Fujita,T., Mandel,J.L., Shirasawa,T., Hino,O., Shirai,T. and Maruyama,N. TITLE Isolation of cDNA clone encoding human homologue of senescence marker protein-30 (SMP30) and its location on the X chromosome JOURNAL Biochim. Biophys. Acta 1263 (3), 249-252 (1995) MEDLINE 96004897 REFERENCE 2 (bases 1 to 1356) AUTHORS Shirasawa,T. TITLE Direct Submission JOURNAL Submitted (11-JUN-1994) to the DDBJ/EMBL/GenBank databases. Takuji Shirasawa, Tokyo Metropolitan Institute of Gerontology, Molecualr Pathology; 35-2 Sakae-cho, Itabashi-ku, Tokyo 173, JAPAN (Tel:813-3964-3241(ex.3034), Fax:813-3579-4776) FEATURES Location/Qualifiers source 1..1356 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="X" /clone="pHSMP6" /dev_stage="adult" /tissue_type="liver" 5'UTR 1..93 CDS 94..993 /codon_start=1 /product="SMP-30" /db_xref="PID:d1007173" /db_xref="PID:g1072312" /translation="MSSIKIECVLPENCRCGESPVWEEVSNSLLFVDIPAKKVCRWDS FTKQVQRVTMDAPVSSVALRQSGGYVATIGTKFCALNWKEQSAVVLATVDNDKKNNRF NDGKVDPAGRYFAGTMAEETAPAVLERHQGALYSLFPDHHVKKYFDQVDISNGLDWSL DHKIFYYIDSLSYSVDAFDYDLQTGQISNRRSVYKLEKEEQIPDGMCIDAEGKLWVAC YNGGRVIRLDPVTGKRLQTVKLPVDKTTSCCFGGKNYSEMYVTCARDGMDPEGLLRQP EAGGIFKITGLGVKGIAPYSYAG" 3'UTR 994..1356 BASE COUNT 373 a 274 c 343 g 366 t ORIGIN 1 acaaacacca aggagtggag gtcagagtgt cacttttttg ttttcttttt gaaagatcat 61 tcgagaaaca cgtcactgat ctcccctgcg accatgtctt ccattaagat tgagtgtgtt 121 ttgccagaga actgccggtg tggtgagtct ccagtatggg aggaagtgtc caactctctg 181 ctctttgtag acattcctgc aaaaaaggtt tgccggtggg attcattcac caagcaagta 241 cagcgagtga ccatggatgc cccagtcagc tccgtggctc ttcgccagtc gggaggctat 301 gttgccacca ttggaacaaa gttctgtgct ttgaactgga aagaacaatc agcagttgtc 361 ttggccacgg tggataacga caagaaaaac aatcgcttca atgatgggaa ggtggatccc 421 gccgggaggt actttgctgg caccatggct gaggaaacag ctccagcagt tcttgagcgg 481 caccaggggg ccctgtactc cctctttcct gatcaccacg tgaaaaagta ctttgaccag 541 gtggacattt ccaatggttt ggattggtcg ctagaccaca aaatcttcta ttacattgac 601 agcctgtcct actccgtgga tgcctttgac tatgacctgc agacaggaca gatctccaac 661 cgcagaagtg tttacaagct agaaaaggaa gaacaaatcc cagatggaat gtgtattgat 721 gctgagggga agctctgggt ggcctgttac aatggaggaa gagtgattcg tttagatcct 781 gtgacaggga aaagacttca aactgtgaag ttgcctgttg ataaaacaac ttcatgctgc 841 tttggaggga agaattactc tgaaatgtat gtgacctgcg cccgggatgg gatggacccc 901 gagggtcttt tgaggcaacc tgaagctggt ggaattttca agataactgg tctgggggtc 961 aaaggaattg ctccctactc ctatgcggga tgaggacagg tcttctttcc tgccagaggg 1021 agctctgaag acaactagag aattctgggc ctgaaatttc aatctagtta gaaagaaaaa 1081 tgaggcaatg attttattaa cagcgttaag ttttaattta caacttttaa aaggcagagc 1141 atttttaaca aggggtgaca ggtggttttg ataacacact tataaggctt tctgtaaaag 1201 gtactataga agggcgaaga atcgttcaac tgtcaatcag cctcttgatt ctttgtaaat 1261 tgccagggtg ggtgggtaca tatctcttct tgattctgca tttcatactt aactatatta 1321 aagcttcaag gaacaataaa tagtaacctg gtaatg // LOCUS HUMSNAP25A 923 bp mRNA PRI 25-MAR-1994 DEFINITION Human nerve-terminal protein (isoform SNAP25A) mRNA, complete cds. ACCESSION L19760 NID g307425 KEYWORDS presynaptic protein; exocytosis; synaptic transmission. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 923) AUTHORS Bark,I.C. and Wilson,M.C. TITLE Human cDNA clones encoding two different isoforms of the nerve terminal protein SNAP-25 JOURNAL Gene 139 (2), 291-292 (1994) MEDLINE 94156217 REFERENCE 2 (bases 1 to 923) AUTHORS Bark,I.C. TITLE Direct Submission JOURNAL Submitted (13-JUL-1993) I.C. Bark, Department of Neuropharmacology, The Scripps Research Institute, 10666 North Torrey Pines Road, La Jolla, CA 92037, USA FEATURES Location/Qualifiers source 1..923 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="17-18 week late gestation" /tissue_type="fetal brain" /tissue_lib="lambda-ZAP; Stratagene 936206" gene 89..709 /gene="SNAP" CDS 89..709 /gene="SNAP" /note="isoform 25a" /codon_start=1 /product="nerve terminal protein" /db_xref="PID:g307426" /translation="MAEDADMRNELEEMQRRADQLADESLESTRRMLQLVEESKDAGI RTLVMLDEQGEQLDRVEEGMNHINQDMKEAEKNLKDLGKCCGLFICPCNKLKSSDAYK KAWGNNQDGVVASQPARVVDEREQMAISGGFIRRVTNDARENEMDENLEQVSGIIGNL RHMALDMGNEIDTQNRQIDRIMEKADSNKTRIDEANQRATKMLGSG" BASE COUNT 260 a 223 c 237 g 203 t ORIGIN 1 aacacaaccc tcccgagaag cccaggtcca gagccaaacc cgtcactgac cccccagccc 61 aggcgcccag ccactcccca ccgctaccat ggccgaagac gcagacatgc gcaatgagct 121 ggaggagatg cagcgaaggg ctgaccagtt ggctgatgag tcgctggaaa gcacccgtcg 181 tatgctgcaa ctggttgaag agagtaaaga tgctggtatc aggactttgg ttatgttgga 241 tgaacaagga gaacaactcg atcgtgtcga agaaggcatg aaccatatca accaagacat 301 gaaggaggct gagaaaaatt taaaagattt agggaaatgc tgtggccttt tcatatgtcc 361 ttgtaacaag cttaaatcaa gtgatgctta caaaaaagcc tggggcaata atcaggatgg 421 agtggtggcc agccagcctg ctcgtgtagt ggacgaacgg gagcagatgg ccatcagtgg 481 cggcttcatc cgcagggtaa caaatgatgc ccgagaaaat gaaatggatg aaaacctaga 541 gcaggtgagc ggcatcatcg ggaacctccg tcacatggcc ctggatatgg gcaatgagat 601 cgatacacag aatcgccaga tcgacaggat catggagaag gctgattcca acaaaaccag 661 aattgatgag gccaaccaac gtgcaacaaa gatgctggga agtggttaag tgtgcccacc 721 cgtgttctcc tccaaatgct gtcgggcaag atagctcctt catgcttttc tcatggtatt 781 atctagtagg tctgcacaca taacacacat cagtccaccc ccattgtgaa tgttgtcctg 841 tgtcatctgt cagcttccca acaatacttt gtgtcttttg ttctctcttg gtctctttct 901 ttccaaaggt tgtacatagt ggt // LOCUS HUMSNEXIN 1774 bp mRNA PRI 19-APR-1991 DEFINITION Human synexin mRNA, complete cds. ACCESSION J04543 NID g338243 KEYWORDS synexin. SOURCE Human liver, lung, and retina, cDNA to mRNA, clones L4A, L2B, and Lu10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1774) AUTHORS Burns,A.L., Magendzo,K., Shirvan,A., Srivastava,M., Rojas,E.R., Alijani,M.R. and Pollard,H.B. TITLE Calcium channel activity of purified human synexin and structure of the human synexin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3798-3802 (1989) MEDLINE 89264510 REFERENCE 2 (sites) AUTHORS Magendzo,K., Shirvan,A., Cultraro,C., Srivastava,M., Pollard,H.B. and Burns,A.L. TITLE Alternative splicing of human Synexin mRNA in brain, cardiac, and Skeletal muscle alters the unique N-terminal domain JOURNAL J. Biol. Chem. 266, 3228-3232 (1991) MEDLINE 91131630 COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by A.L.Burns, 29-JUN-1989. FEATURES Location/Qualifiers source 1..1774 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 61..1461 /note="synexin" /codon_start=1 /db_xref="PID:g338244" /translation="MSYPGYPPTGYPPFPGYPPAGQESSFPPSGQYPYPSGFPPMGGG AYPQVPSSGYPGAGGYPAPGGYPAPGGYPGAPQPGGAPSYPGVPPGQGFGVPPGGAGF SGYPQPPSQSYGGGPAQVPLPGGFPGGQMPSQYPGGQPTYPSQPATVTQVTQGTIRPA ANFDAIRDAEILRKAMKGFGTDEQAIVDVVANRSNDQRQKIKAAFKTSYGKDLIKDLK SELSGNMEELILALFMPPTYYDAWSLRKAMQGAGTQERVLIEILCTRTNQEIREIVRC YQSEFGRDLEKDIRSDTSGHFERLLVSMCQGNRDENQSINHQMAQEDAQRLYQAGEGR LGTDESCFNMILATRSFPQLRATMEAYSRMANRDLLSSVSREFSGYVESGLKTILQCA LNRPAFFAERLYYAMKGAGTDDSTLVRIVVTRSEIDLVQIKQMFAQMYQKTLGTMIAG DTSGDYRRLLLAIVGQ" BASE COUNT 476 a 406 c 426 g 466 t ORIGIN Unreported. 1 gaacccggtc tcccgcaaga tggagccggg ttgggctgtg acgctgctgc tggggtcaga 61 atgtcatacc caggctatcc cccaacaggc tacccacctt tccctggata tcctcctgca 121 ggtcaggagt catcttttcc cccttctggt cagtatcctt atcctagtgg ctttcctcca 181 atgggaggag gtgcctaccc acaagtgcca agtagtggct acccaggagc tggaggctac 241 cctgcgcctg gaggttatcc agcccctgga ggctatcctg gtgccccaca gccaggggga 301 gctccatcct atcccggagt tcctccaggc caaggatttg gagtcccacc aggtggagca 361 ggcttttctg ggtatccaca gccaccttca cagtcttatg gaggtggtcc agcacaggtt 421 ccactacctg gtggctttcc tggaggacag atgccttctc agtatcctgg aggacaacct 481 acttacccta gtcagcctgc cacagtgact caggtcactc aaggaactat ccgaccagct 541 gccaacttcg atgctataag agatgcagaa attcttcgta aggcaatgaa gggttttggg 601 acagatgagc aggcaattgt ggatgtggtg gccaaccgtt ccaatgatca gaggcaaaaa 661 attaaagcag catttaagac ctcctatggc aaggatttaa tcaaagatct caaatcagag 721 ttaagtggaa atatggaaga actgatcctg gccctcttca tgcctcctac gtattacgat 781 gcctggagct tacggaaagc aatgcaggga gcaggaactc aggaacgtgt attgattgag 841 attttgtgca caagaacaaa tcaggaaatc cgagaaattg tcagatgtta tcagtcagaa 901 tttggacgag accttgaaaa ggacattagg tcagatacat caggacattt tgaacgttta 961 cttgtgtcca tgtgccaggg aaatcgtgat gagaaccaga gtataaacca ccaaatggct 1021 caggaagatg ctcagcgtct ctatcaagct ggtgagggga gactagggac cgatgaatct 1081 tgctttaaca tgatccttgc cacaagaagc tttcctcagc tgagagctac catggaggct 1141 tattctagga tggctaatcg agacttgtta agcagtgtga gccgtgagtt ttccggatat 1201 gtagaaagtg gtttgaagac catcttgcag tgtgccctga accgccctgc cttctttgct 1261 gagaggctct actatgctat gaaaggtgct ggcacagatg actccaccct ggtccggatt 1321 gtggtcactc gaagtgagat tgaccttgta caaataaaac agatgttcgc tcagatgtat 1381 cagaagactc tgggcacaat gattgcaggt gacacgagtg gagattaccg aagacttctt 1441 ctggctattg tgggccagta ggagggattt tttttttttt aatgaaaaaa aaatttctat 1501 tcatagctta tccttcagag caatgacctg catgcagcaa tatcaaacat cagctaaccg 1561 aaagagcttt ctgtcaagga ccgtatcagg gtaatgtgct tggtttgcac atgttgttat 1621 tgccttaatt ctaattttat tttgttctct acatacaatc aatgtaaagc catatcacaa 1681 tgatacagta atattgcaat gtttgtaaac cttcattctt actagtttca ttctaatcaa 1741 gatgtcaaat tgaataaaaa tcacagcaat ctct // LOCUS HUMSNRNPAF 904 bp mRNA PRI 29-SEP-1992 DEFINITION Homo sapiens U2 snRNP auxiliary factor small subunit, complete cds. ACCESSION M96982 NID g338262 KEYWORDS U2 snRNP auxiliary factor small subunit; ribonucleoprotein. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 904) AUTHORS Zhang,M., Zamore,P.D., Carmo-Fonseca,M., Lamond,A.I. and Green,M.R. TITLE Cloning and intracellular localization of the U2 small nuclear ribonucleoprotein auxiliary factor small subunit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 8769-8773 (1992) MEDLINE 92409598 FEATURES Location/Qualifiers source 1..904 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 39..761 /codon_start=1 /product="U2 snRNP auxiliary factor small subunit" /db_xref="PID:g338263" /translation="MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQ TIALLNIYRNPQNSSQSADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVC DNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEM GECTRGGFCNFMHLKPISRELRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGG GGGGGRERDRRRSRDRERSGRF" polyA_site 904 BASE COUNT 234 a 185 c 276 g 209 t ORIGIN 1 ggaattccgt cgacggcagc ggcggcggcg ggtgggaaat ggcggagtat ctggcctcca 61 tcttcggcac cgagaaagac aaagtcaact gttcatttta tttcaaaatt ggagcatgtc 121 gtcatggaga caggtgctct cggttgcaca ataaaccgac gtttagccag accattgccc 181 tcttgaacat ttaccgtaac cctcaaaact cttcccagtc tgctgacggt ttgcgctgtg 241 ccgtgagcga tgtggagatg caggaacact atgatgagtt ttttgaggag gtttttacag 301 aaatggagga gaagtatggg gaagtagagg agatgaacgt ctgtgacaac ctgggagacc 361 acctggtggg gaacgtgtac gtcaagtttc gccgtgagga agatgcggaa aaggctgtga 421 ttgacttgaa taaccgttgg tttaatggac agccgatcca cgccgagctg tcacccgtga 481 cggacttcag agaagcctgc tgccgtcagt atgagatggg agaatgcaca cgaggcggct 541 tctgcaactt catgcatttg aagcccattt ccagagagct gcggcgggag ctgtatggcc 601 gccgtcgcaa gaagcataga tcaagatccc gatcccggga gcgtcgttct cggtctagag 661 accgtggtcg tggcggtggc ggtggcggtg gtggaggtgg cggcggacgg gagcgtgaca 721 ggaggcggtc gagagatcgt gaaagatctg ggcgattctg agccatgcca tttttacctt 781 atgtctgcta gaaagtgttg tagttgattg accaaaccag ttcataaggg gaatttttta 841 aaaaacaaca aaaaaaaaac atacaaagat gggtttctga ataaaaattt gtagtgataa 901 cagt // LOCUS HUMSNRNPD 1633 bp mRNA PRI 15-DEC-1988 DEFINITION Human autoantigen small nuclear ribonucleoprotein Sm-D mRNA, complete cds. ACCESSION J03798 NID g338264 KEYWORDS autoantigen; ribonucleoprotein. SOURCE Human B-lymphocyte, cDNA to mRNA, clone D45-2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1633) AUTHORS Rokeach,L.A., Haselby,J.A. and Hoch,S.O. TITLE Molecular cloning of a cDNA encoding the human Sm-D autoantigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4832-4836 (1988) MEDLINE 88263041 COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.Rokeach 20-JUL-1988. The Sm-D protein coded by cDNA D45-2, being a snRNP, is evidently involved in the mRNA splicing of highter eukaryotes; in the autoimmune disease systemic lupus erythematosus, antinuclear antibodies are developed with Sm specificity. FEATURES Location/Qualifiers source 1..1633 /organism="Homo sapiens" /db_xref="taxon:9606" RBS 143..149 /note="ribosome binding site" CDS 151..510 /note="small nuclear riboprotein Sm-D" /codon_start=1 /db_xref="PID:g338265" /translation="MKLVRFLMKLSHETVTIELKNGTQVHGTITGVDVSMNTHLKAVK MTLKNREPVQLETLSIRGNNIRYFILPDSLPLDTLLVDVEPKVKSKKREAVAGRGRGR GRGRGRGRGRGRGGPRR" BASE COUNT 474 a 278 c 373 g 508 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccccc ccccccccca gtgctccgcg cgctcttgac gtccggagcc cctggagtag 61 gcgcttccgg ccattcatac tgcagtcggt cagtgttcgg ttgaaggatt ctgtgtgctg 121 tcggacccag agggtgacgg cgccgctagg atgaagctcg tgagattttt gatgaaattg 181 agtcatgaaa ctgtaaccat tgaattgaag aacggaacac aggtccatgg aacaatcaca 241 ggtgtggatg tcagcatgaa tacacatctt aaagctgtga aaatgaccct gaagaacaga 301 gaacctgtac agctggaaac gctgagtatt cgaggaaata acattcggta ttttattcta 361 ccagacagtt tacctctgga tacactactt gtggatgttg aacctaaggt gaaatctaag 421 aaaagggaag ctgttgcagg aagaggcaga ggaagaggaa gaggaagagg acgtggccgt 481 ggcagaggaa gagggggtcc taggcgataa tgtctctcaa gatttcaaag tcatatgaga 541 tttgggatat tttttgtaca ggttgtgttt gtttatgtca gtttttaata aacataaatg 601 tgggacagag ctgtctattt agtatatcaa agttttagta gtttcctcca cattcacgaa 661 attaccacag tgagagctaa gcatttctac tgggcagttt catttttagt tgatcaggtt 721 ttaagttttt gaactaaaat ttttcttttt ctttttatga tgaataaggt taaaataaaa 781 gccttagaca aattaaattt ggcagagttt aattgagcaa aggacaattc acaaatcagg 841 tagcccctga accataatag gctcagaggc ttcagcccag ctgcatagtt gaagatttat 901 ggacagaagg aaagtgatgt atggaaaatg gaagtgagat acagcaacag ccggattagt 961 tacagttcag cgtttgcctt atttgaatat ggtttgaaca gttcgctgtc tttggttggc 1021 tgaaacttag tgattgccac aagagtaggg taccgtctgt ttacacgtcc agttaggcta 1081 cagttctatg tactgagaaa cctttaagct gaacttgaga tatgtaaaga gactttaggc 1141 taaacttaac aatatatata ggaatatatc ccttctactt cacatgcact gaatatgcat 1201 tttattgctt tactcttcat tctgtggcac ctacccacag gggaagtaag aagtttgttt 1261 tggtatttcg gaaactaaag tccttatggg atggggtcta gaattgattc tcctttcctg 1321 agttttactc cacggagtct taggtacctg gtaaaaagtt gtcttctaaa ttaagggtca 1381 ttgctttgtt gtctagctgc taatgtctta cttttgtttc ttttgctttt taatcagttc 1441 ttaataggat atagttttat gttttccaag ttataacttg gagttaatgg tcactagatt 1501 atcagttatg agcagtgtta aaatctccta ttaatgtgta atgtacctgt cagtgcctcc 1561 tttattaagg ggttctttga gaataaaaga gaaaagacct actttatttg acagcaaaaa 1621 aaaaaaggaa ttc // LOCUS HUMSOMAT 1427 bp DNA PRI 03-AUG-1993 DEFINITION Human somatostatin receptor gene, complete cds. ACCESSION L14856 NID g292499 KEYWORDS G protein; G-protein coupled receptor; plasma membrane; receptor; somatostatin; somatostatin receptor; transmembrane protein. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1427) AUTHORS Xu,Y., Song,J., Bruno,J.F. and Berelowitz,M. TITLE Molecular cloning and sequencing of a human somatostatin receptor, hSSTR4 JOURNAL Biochem. Biophys. Res. Commun. 193, 648-652 (1993) MEDLINE 93290656 FEATURES Location/Qualifiers source 1..1427 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 99..1265 /standard_name="hSSTR4" /note="intronless gene" /codon_start=1 /evidence=experimental /product="somatostatin receptor" /db_xref="PID:g292500" /translation="MSAPSTLPPGGEEGLGTAWPSAANASSAPAEAEEAVAGPGDARA AGMVAIQCIYALVCLVGLVGNALVIFVILRYAKMKTATNIYLLNLAVADELFMLSVPF VASSAALRHWPFGSVLCRAVLSVDGLNMFTSVFCLTVLSVDRYVAVVHPLRAATYRRP SVAKLINLGVWLASLLVTLPIAIFADTRPARGGQAVACNLQWPHPAWSAVFVVYTFLL GFLLPVLAIGLCYLLIVGKMRAVALRAGWQQRRRSEKKITRLVLMVVVVFVLCWMPFY VVQLLNLVVTSLDATVNHVSLILSYANSCANPILYGFLSDNFRRSFQRVLCLRCCLLE GAGGAEEEPLDYYATALKSKGGAGCMCPPLPCQQEALQPEPGRKRIPLTRTTTF" BASE COUNT 201 a 525 c 425 g 276 t ORIGIN 1 gtctgggcgc cagcccccgc cctgggcccg ccgcccgcgc tctctggcgc agcgctagct 61 ccgccgcgct cagctgccct gcgccggcac ccctggtcat gagcgccccc tcgacgctgc 121 cccccggggg cgaggaaggg ctggggacgg cctggccctc tgcagccaat gccagtagcg 181 ctccggcgga ggcggaggag gcggtggcgg ggcccgggga cgcgcgggcg gcgggcatgg 241 tcgctatcca gtgcatctac gcgctggtgt gcctggtggg gctggtgggc aacgccctgg 301 tcatcttcgt gatccttcgc tacgccaaga tgaagacggc taccaacatc tacctgctca 361 acctggccgt agccgacgag ctcttcatgc tgagcgtgcc cttcgtggcc tcgtcggccg 421 ccctgcgcca ctggcccttc ggctccgtgc tgtgccgcgc ggtgctcagc gtcgacggcc 481 tcaacatgtt caccagcgtc ttctgtctca ccgtgctcag cgtggaccgc tacgtggccg 541 tggtgcaccc tctgcgcgcg gcgacctacc ggcggcccag cgtggccaag ctcatcaacc 601 tgggcgtgtg gctggcatcc ctgttggtca ctctccccat cgccatcttc gcagacacca 661 gaccggctcg cggcggccag gccgtggcct gcaacctgca gtggccacac ccggcctggt 721 cggcagtctt cgtggtctac actttcctgc tgggcttcct gctgcccgtg ctggccattg 781 gcctgtgcta cctgctcatc gtgggcaaga tgcgcgccgt ggccctgcgc gctggctggc 841 agcagcgcag gcgctcggag aagaaaatca ccaggctggt gctgatggtc gtggtcgtct 901 ttgtgctctg ctggatgcct ttctacgtgg tgcagctgct gaacctcgtc gtgaccagcc 961 ttgatgccac cgtcaaccac gtgtccctta tcctcagcta tgccaacagc tgcgccaacc 1021 ctattctcta tggcttcctc tccgacaact tccgccgatc cttccagcgg gttctctgcc 1081 tgcgctgctg cctcctggaa ggtgctggag gtgctgagga ggagcccctg gactactatg 1141 ccactgctct caagagcaaa ggtggggcag ggtgcatgtg ccccccactc ccctgccagc 1201 aggaagccct gcaaccagaa cccggccgca agcgcatccc cctcaccagg accaccacct 1261 tctgaggagc ccttccccta cccaccctgc gtggccacct cccaaggggt gggcaccatt 1321 cctacagccc cgaagactgc atctcctgaa tgctcaccta agctccacca cctgttcctt 1381 ccagcagccc atgtacctgc cggagagtgt cagaactctt ctgctgc // LOCUS HUMSP10A 1043 bp mRNA PRI 10-JAN-1992 DEFINITION Human sperm protein 10 mRNA, complete cds. ACCESSION M82967 NID g338291 KEYWORDS sperm protein 10. SOURCE Homo sapiens male adult testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1043) AUTHORS Wright,R.M., John,E., Klotz,K., Flickinger,C.J. and Herr,J.C. TITLE Cloning and sequencing of cDNAs coding for the human intra-acrosomal antigen SP-10 JOURNAL Biol. Reprod. 42, 693-701 (1990) MEDLINE 90268085 FEATURES Location/Qualifiers source 1..1043 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /sex="male" /tissue_type="testis" 5'UTR <1..52 /note="putative" mRNA 1..1043 CDS 53..793 /note="putative" /codon_start=1 /product="sperm protein 10" /db_xref="PID:g338292" /translation="MNRFLLLMSLYLLGSARGTSSQPNELSGSIDHQTSVQQLPGEFF SLENPSDAEALYETSSGLNTLSEHGSSEHGSSKHTVAEHTSGEHAESEHASGEPAATE HAEGEHTVGEQPSGEQPSGEHLSGEQPLSELESGEQPSDEQPSGEHGSGEQPSGEQAS GEQPSGTILNCYTCAYMNDQGKCLRGEGTCITQNSQQCMLKKIFEGGKLQFMVQGCEN MCPSMNLFSHGTRMQIICCRNQSFCNKI" mat_peptide 53..790 /product="sperm protein 10" 3'UTR 794..>1043 /note="putative" polyA_signal 1029..1034 /note="putative" BASE COUNT 264 a 254 c 222 g 303 t ORIGIN 1 gctatgaagc agctgtggcc cacactgggg tcccctcttt tcctaaatcc agatgaacag 61 gtttctcttg ctaatgagtc tttatctgct tggatctgcc agaggaacat caagtcagcc 121 taatgagctt tctggctcca tagatcatca aacttcagtt cagcaacttc caggtgagtt 181 cttttcactt gaaaaccctt ctgatgctga ggctttatat gagacttctt caggcctgaa 241 cactttaagt gagcatggtt ccagtgagca tggttcaagc aagcacactg tggccgagca 301 cacttctgga gaacatgctg agagtgagca tgcttcaggt gagcccgctg cgactgaaca 361 tgctgaaggt gagcatactg taggtgagca gccttcagga gaacagcctt caggtgaaca 421 cctctccgga gaacagcctt tgagtgagct tgagtcaggt gaacagcctt cagatgaaca 481 gccttcaggt gaacatggct ccggtgaaca gccttctggt gagcaggcct cgggtgaaca 541 gccttcaggc acaatattaa attgctacac atgtgcttat atgaatgatc aaggaaaatg 601 tcttcgtgga gagggaacct gcatcactca gaattcccag cagtgcatgt taaagaagat 661 ctttgaaggt ggaaaactcc aattcatggt tcaagggtgt gagaacatgt gcccatctat 721 gaacctcttc tcccatggaa cgaggatgca aattatatgc tgtcgaaatc aatctttctg 781 caataagatc tagaagcctg ggcccttgct tgttttgact caggcagtaa aaagcctcca 841 tcactctatt tggctcattt tatatttagt tccttcccca gtcaacaact gaccacatct 901 gcctctgcct gagcattagg atgctcaaac atcctatctt tcttcttcta ttcatgcttt 961 tatccattct tctctgtcct gtcttccctg ctccaactct ttctctcaat attcctgatt 1021 tttttttcaa taaatttcac atg // LOCUS HUMSP2A 2063 bp mRNA PRI 14-OCT-1992 DEFINITION Human Sp2 protein mRNA, complete cds. ACCESSION M97190 NID g338300 KEYWORDS Sp2 protein. SOURCE Homo sapiens (library: Molt13 cDNA) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2063) AUTHORS Kingsley,C. and Winoto,A. TITLE Cloning of GT box-binding proteins: A novel Sp1 multigene family regulating T-cell receptor gene expression JOURNAL Mol. Cell. Biol. 12, 4251-4261 (1992) MEDLINE 93024366 FEATURES Location/Qualifiers source 1..2063 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Molt13" /cell_type="T cells" /tissue_lib="Molt13 cDNA" CDS 367..1854 /codon_start=1 /product="Sp2 protein" /db_xref="PID:g338301" /translation="MINKGTRSNANIQYQAVPQIQASNSQTIQVQPNLTNQIQIIPGT NQAIITPSPSSHKPVPIKPAPIQKSSTTTTPVQSGANVVKLTGGGGNVTLTLPVNNLV NASDTGAPTQLLTESPPTPLSKTNKKARKKSLPASQPPVAVAEQVETVLIETTADNII QAGNNLLIVQSPGGGQPAVVQQVQVVPPKAEQQQVVQIPQQALRVVQAASATLPTVPQ KPSQNFQIQAAEPTPTQVYIRTPSGEVQTVLVQDSPPATAAATSNTTCSSPASRAPHL SGTSKKHSAAILRKERPLPKIAPAGSIISLNAAQLAAAAQAMQTININGVQVQGVPVT ITNTGGQQQLTVQNVSGNNLTISGLSPTQIQLQMEQALAGETQPGEKRRRMACTCPNC KDGEKRSGEQGKKKHVCHIPDCGKTFRKTSLLRAHVRLHTGERPFVCNWFFCGKRFTR SDELQRHARTHTGDKRFECAQCQKRFMRSDHLTKHYKTHLVTKNL" BASE COUNT 469 a 717 c 521 g 356 t ORIGIN 1 gcggccgccg taatgagcga tccacagacc agcatggctg ccactgctgc tgtgagtccc 61 agtgactacc tgcagcctgc cgcctccacc acccaggact cccagccatc tcccttagcc 121 ctgcttgctg caacatgtag caaaattggc cctccagcag ttgaagctgc tgtgacacct 181 cctgctcccc cacagcccac accgcggaaa cttgtcccta tcaaacctgc ccctctccct 241 ctcagccccg gcaagaatag ctttggaatc ttgtcctcca aaggaaatat acttcagatt 301 caggggtcac aactgagcgc ctcctatcct ggagggcagc tggtgttcgc tatccagaat 361 cccaccatga tcaacaaagg gacccgatca aatgccaata tccagtacca ggcggtccct 421 cagattcagg caagcaattc ccaaaccatc caagtacagc ccaatctcac caaccagatc 481 cagatcatcc ctggcaccaa ccaagccatc atcaccccct caccgtccag tcacaagcct 541 gtccccatca agccagcccc catccagaag tcgagtacga ccaccacccc cgtgcagagc 601 ggggccaatg tggtgaagct gacaggtggg ggcggcaatg tgacgctcac tctgcccgtc 661 aacaaccttg tgaacgccag tgacaccggg gcccctactc agctcctcac tgaaagcccc 721 ccaaccccgc tgtctaagac taacaagaaa gcaaggaaga agagccttcc tgcctcccag 781 ccccctgtgg ctgtggctga gcaggtggag acggtgctga tcgagaccac cgcggacaac 841 atcatccagg caggaaataa cctgctcatt gttcagagcc ctggtggggg ccagccagct 901 gtggtccagc aggtccaggt ggtgcccccc aaggccgagc agcagcaggt ggtacagatc 961 ccccagcagg ctctgcgggt ggtgcaggcg gcatctgcca ccctccccac tgtaccccag 1021 aagccctccc agaactttca gatccaggca gctgagccga cacctactca ggtctacatc 1081 cgcacgcctt ccggtgaggt gcagacagtc cttgtccagg acagcccccc agcaacagct 1141 gcagccacct ctaacaccac ctgtagcagc cctgcatccc gtgctcccca tctgagtggg 1201 accagcaaaa agcactcagc tgcaattctc cgaaaagagc gtcccctgcc aaagattgcc 1261 ccagccggga gcatcatcag cctgaatgca gcccagttgg cggcagctgc ccaggcaatg 1321 cagaccatca acatcaatgg tgtccaggtc cagggcgtgc ctgtcaccat caccaacaca 1381 ggcgggcagc agcagctgac agtgcagaat gtttctggga acaacctgac catcagtggg 1441 ctgagcccca cccagatcca gctgcaaatg gaacaagccc tggccggaga gacccagccc 1501 ggggagaagc ggcgccgcat ggcctgcacg tgtcccaact gcaaggatgg ggagaagagg 1561 tctggagagc agggcaagaa gaagcacgtt tgccacatcc ccgactgtgg caagacgttc 1621 cgtaagacgt ccttgctgcg tgcccatgtg cgcctgcaca ctggcgagcg gccctttgtc 1681 tgcaactggt tcttctgtgg gaagaggttc acacggagtg acgagctcca acggcatgct 1741 cgcacccaca caggggacaa acgcttcgag tgcgcccagt gtcagaagcg cttcatgagg 1801 agtgaccacc tcaccaagca ttacaagacc cacctggtca cgaagaactt gtaaggccaa 1861 ctgcggcggg aggcctgaag atgcagtccc ccacctgtgt cctccctggg cccctggtgg 1921 aaaggagccc tgtggctgcc ttggcctgcc ctcagcccca ctcctgttct gcaactgtcc 1981 ccacaggaag gggctctgtt ccctgtattg tcctccttct gaagcccctt gtctgccttg 2041 gcccttcccc tcaccacgag ctc // LOCUS HUMSP5 991 bp mRNA PRI 15-JUN-1988 DEFINITION Human pulmonary surfactant protein (SP5) mRNA, complete cds. ACCESSION J03553 NID g338306 KEYWORDS alternative splicing; pulmonary surfactant protein. SOURCE Human bronchoalveolar lavage fluid from patients with alveolar proteinosis, cDNA to mRNA, clone h5k-18. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 991) AUTHORS Warr,R.G., Hawgood,S., Buckley,D.I., Crisp,T.M., Schilling,J., Benson,B.J., Ballard,P.L., Clements,J.A. and White,R.T. TITLE Low molecular weight human pulmonary surfactant protein (SP5): Isolation, characterization, and cDNA and amino acid sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7915-7919 (1987) MEDLINE 88068508 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by R.G.Warr, 09-FEB-1988. Two nucleotide regions of clone h5k-18 were not found in the other sp5 cDNA clones studied. These inserts (positions 440-458 and 629-636) are due to alternative splicing. The first 144 base pairs are from a genomic library. FEATURES Location/Qualifiers source 1..991 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA 29..991 /note="SP5 mRNA (minar alt.)" mRNA 30..991 /note="SP5 mRNA (minar alt.)" mRNA 31..991 /note="SP5 mRNA (minar alt.)" mRNA 32..991 /note="SP5 mRNA (minar alt.)" mRNA 72..991 /note="SP5 mRNA (minar alt.)" mRNA 73..991 /note="SP5 mRNA (minar alt.)" mRNA 145..991 /note="SP5 mRNA (major alt.; 5' end +/- 4 bp)" mRNA 161..991 /note="SP5 mRNA (minar alt.)" sig_peptide 175..243 /note="pulmonary surfactant protein signal peptide" CDS 175..768 /note="pulmonary surfactant protein (SP5) precursor" /codon_start=1 /db_xref="PID:g338307" /translation="MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVV LIVVVIVGALLMGLHMSQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLV VYDYQQLLIAYKPAPGTCCYIMKIAPESIPSLEALNRKVHNFQMECSLQAKPAVPTSK LGQAEGRDAGSAPSGGDPAFLGMAVNTLCGEVPLYYI" mat_peptide 244..765 /note="pulmonary surfactant protein (SP5)" BASE COUNT 215 a 290 c 305 g 181 t ORIGIN Unreported. 1 cttatctcgg cttcgtttct ggagggccag gaacaaacag gcttcaaagc caagggcttg 61 gctggcacac agggggcttg gtccttcacc tctgtcccct ctccctacgg acacatataa 121 gaccctggtc acacctggga gaggaggaga ggagagcata gcacctgcag caagatggat 181 gtgggcagca aagaggtcct gatggagagc ccgccggact actccgcagc tccccggggc 241 cgatttggca ttccctgctg cccagtgcac ctgaaacgcc ttcttatcgt ggtggtggtg 301 gtggtcctca tcgtcgtggt gattgtggga gccctgctca tgggtctcca catgagccag 361 aaacacacgg agatggttct ggagatgagc attggggcgc cggaagccca gcaacgcctg 421 gccctgagtg agcacctggt taccactgcc accttctcca tcggctccac tggcctcgtg 481 gtgtatgact accagcagct gctgatcgcc tacaagccag cccctggcac ctgctgctac 541 atcatgaaga tagctccaga gagcatcccc agtcttgagg ctctcaatag aaaagtccac 601 aacttccaga tggaatgctc tctgcaggcc aagcccgcag tgcctacgtc taagctgggc 661 caggcagagg ggcgagatgc aggctcagca ccctccggag gggacccggc cttcctgggc 721 atggccgtga acaccctgtg tggcgaggtg ccgctctact acatctagga cgcctccggt 781 gagcagggtc agtggaagcc ccaacgggaa aggaaacgcc ccgggcaaag ggtcttttgc 841 agcttttgca gacgggcaag aagctgcttc tgcccacacc gcagggacaa accctggaga 901 aatgggagct tggggagagg atgggagtgg gcagaggtgg cacccagggg cccgggaact 961 cctgccacaa cagaataaag cagcctgatt g // LOCUS HUMSPABIND 4161 bp mRNA PRI 17-SEP-1993 DEFINITION Homo sapiens surfactant protein A mRNA, complete cds. ACCESSION L10123 NID g402149 KEYWORDS membrane protein; surfactant protein A. SOURCE Homo sapiens adult lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4161) AUTHORS Strayer,D.S. TITLE Surfactant protein A binding proteins: characterization and structure JOURNAL J. Biol. Chem. 268, 18679-18684 (1993) MEDLINE 93366778 FEATURES Location/Qualifiers source 1..4161 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lung" CDS 2058..2894 /codon_start=1 /product="surfactant protein A" /db_xref="PID:g402150" /translation="MTFKNNTTLSLFRLFNSFPLHKGYITTRELHQLALANLIFMFNC ADLHFPPQMRKCFPIPGTSCLFCPKTKTLFPVLHLANQYLCFLCQHKHCFPQTRFPDS QPSQATSRCVLSIKFTSSIYINSHNYTLILTCIIVRVLCPPVGCKSYKDRGHIHLAHP SVPSTCHGSYQLTSTQYIVWISKYDSNFQHPKLVYSPTEPAQVEHMQCFLCMCLQREE REALLLLPRVTILTRLSAESTDERDGDSEPVNAVCRTALAFVPHESNVMLGIHNLLIW LL" BASE COUNT 1223 a 848 c 803 g 1287 t ORIGIN 1 gaattccggt tatcactgac actatataac tctgccactt agggccatgt gaccttaagg 61 aaggcacttc acctgtttgt gtctcatctg taaaatggga attatgatga tacctacctt 121 aaaaagggtt gtatcaaggg agttaatgca tgtaaaacaa tgagaagaat ctttagccaa 181 ttgtaagcat aagtgtcaat cattattatt agtagtagta gtacatttat ttgtttgggg 241 tttattatct gtctcctaca caagaagata accttcttaa cgacaaagac tttgttttgt 301 tgactactat attctcattg caagaagagt gcctagcaca tagtaagggc ttaataatta 361 tttattcaat gaatgaatga atgcataaat gacttcaagt cttcatcatc ctttttccgt 421 gacttatggg atttggctgc taaagttcat gagtgggtga acatagtgtt tgatgtcaca 481 gctgcacaca cttggtccag cacccccata tgcctagata caaagtgggc atcacttgtg 541 ttctttgttg tgcctcattc tgatgacctt ttcgcctgac attttcttcc cacaggcctc 601 atgaatggcc aggctatgct ccacttcctt caagttgttc atttctctta atctcatgaa 661 cgtgaacctg tcctcaattt cattgccata agatctcctc ttagtaggtg atgcctaatt 721 actctttcca attaacatgc tattcaaatt tgtctaatgt ttcttccagt tctaaaattt 781 gatgaccact gatctgatgt gaccgtgtga ttgtttcaca atttgatgtg tgtgatgata 841 ttagtatgta caatacatgc tgtttacctc tatctagtaa caaatgttta catttgttac 901 tagagtaagc tttaggggta gaacttgagt ctcaaagctc acgtcttggg aaaaaaaaaa 961 gatctggtat tgatagaacc ttgataaatc ccttgttata gcacatgttt ttgtttccca 1021 gaagtagggc aagagctctc tggcttatcc aatcagcttc ttctcctgac tttagtagag 1081 acacgggctt cattactgag ctgggaattt aacttccctc tgtcctatga aaagtagaag 1141 attttcctaa gagaaacctt cagtgagagc agaacagaaa aaagagcaaa gaactaactg 1201 aaggagaggg caagaaatca gtgtagtgtg gggcagcaga agggaagtga caggtcaaag 1261 aaaataggag agaaaaggag agagagctac ccaagtaaac aggaaaaggg gaggaaatac 1321 agggcaaaag cagagagcag gggaattggc atgacaagaa agtggtgaaa cacagaggat 1381 tccaccagac aacagagagc ccggtagaga agagtgggct aggataaaga gacaggaaac 1441 aaatggatta agacatagag tagccattgt cagttacctg ggtaagaaaa gattggtggt 1501 ctaaataaaa tgcagattgt tcatttacaa ttctgactgt ttaaattatt tatggtacac 1561 cttggtgctt cacttaatca tgttaagtta cagttgtaat cttattaatt acttgcaacc 1621 agctttctca atgtaccatg tatagaggac agcttgtcaa ggggtagaag tactatagct 1681 tcacttttaa tccagtatca tccagtacac aaattagcct tacactgtta gaatcttcct 1741 gatgatgact ggctgaacta agcattagtc ttgacaatac aagtactgct gtgcccattt 1801 acagaaaggg aagcaattct ttcccagaaa gtaagatatg atttaaatgc tttggcctaa 1861 ttagctgtga aaaaggacat ggcccaggta aggcttcagg aaaagccaat taacacatgt 1921 gcatatgtct ctctccattc acatcaccca gcctagtcca agtcaccacc atctcttcac 1981 ttgcattgtt acagtagtct cttagccgct tttcccacct ccacctttga tatctctagt 2041 tattcatgag gtccaaaatg acttttaaaa ataacaccac actatctcta tttagacttt 2101 tcaacagttt cccactgcac aagggataca ttaccacaag ggaattacac caacttgccc 2161 ttgcaaatct catctttatg ttcaactgtg ctgatcttca ttttcctcct cagatgcgca 2221 aatgctttcc catcccaggg acttcatgtt tgttctgccc caaaactaaa acactcttcc 2281 ctgttcttca tctggctaac caatatttgt gctttctgtg tcagcataaa cactgttttc 2341 ctcagacaag atttcctgac tcccagccta gccaggccac ttcaaggtgt gtgctctcca 2401 taaagtttac ttcttctata tacatcaatt cacataatta tactttaata cttacttgta 2461 taattgttcg tgtgctatgc cccccagttg gctgtaagtc ctataaagac agaggccaca 2521 tccatcttgc acatccctct gttcctagca cctgccatgg gtcctatcag ctgacaagca 2581 ctcaatatat tgtatggata agtaaatatg attctaattt ccaacatcct aaacttgttt 2641 acagtcccac ggagcctgct caggtagagc acatgcaatg cttcctatgc atgtgtttgc 2701 agagagaaga gagagaagcc cttctgcttc tcccacgtgt taccattttg accagactca 2761 gtgctgaaag cactgatgaa agggatggag acagtgagcc tgtgaatgcc gtgtgcagaa 2821 cagcacttgc ctttgtcccc catgaaagta atgtgatgct cggtatccac aatcttttaa 2881 tctggctgct gtagggtgag aatttgtggt ccttttctgt taagatggaa tcttgctgaa 2941 ggtcagaatg tggttagagt caaagagtgg catcactagc actgtgtata aaagaaagaa 3001 tacaactcgg atctttggtt ctctaccaac ttgactcact ttctccacag caaggctcat 3061 attatccttt cctaatggag tttaggatgc aattgcagtc ctcttcagac aagtctcctg 3121 tcccttttgc tttattctga aacaatggag tccaaaatct gtacttctta catagtacag 3181 tgactagctt ctaacattcc agcatggcta tctcaaggac ttttgactaa atgctgccag 3241 ttcatttgga gtgatctctg gcagggccaa attcagcata atgtagcttc ctgttgtctt 3301 cctatattcc tgtatgttat tgtaaatata ttgacgtaag ctgacctatt atacatgctt 3361 gttcccttct atggtgatag ggactaggaa agacaatgca ctgttcactc taatgtaagg 3421 gatcacatat aattcttatg atttgatgat tctcttctgg cactcaaacc tgcttgtagc 3481 ttttaggatt catggccact ggtggcatta aacacttagg gcctgtcatc acaaaaggac 3541 actttgtaca caattatctt tccaaaggct agatcagccc atgtcaatgt catgtttaat 3601 atttaaacaa aacacacttt actgtcattt gtatctattt actcagtcac catctgcaaa 3661 accactgtga ataaaaatgg aacttgtggt gcaagaaaaa tatattgaga atcattgagt 3721 tggacagacc tagtgagaag gttgtgtcta ttcaaacacc tgtcttccac cctcccacca 3781 tctgtgcaat cacttcaccc ttcagcctca ctagtccccc taacaattac cctgtcaaga 3841 ggagagtgca gctcaggtgg atttaatgtg ggtttaatat ggcctgttga gtttaatgtt 3901 taatgttgat tttctttaag taaccatttc tgttcttgct ataaatctat gtctatatgt 3961 ctatgcttaa tttggatgat gaaggcaact tggatttaag gaaagagcca gtttatatgt 4021 tttatgaaga gattatagat caaataagca gacgatagac cttccaaagc tataagtgaa 4081 aacagaaaaa tgacgcgtaa ttcacaggtt acagtgaagt taaaagagag ggttaatatt 4141 ttaatgtgtt gttaagaatt c // LOCUS HUMSPARC 2133 bp mRNA PRI 13-JAN-1995 DEFINITION Human SPARC/osteonectin mRNA, complete cds. ACCESSION J03040 NID g338312 KEYWORDS calcium-binding protein; glycoprotein; osteonectin. SOURCE Human 34-week-old placenta, cDNA to mRNA, clones PSC[4, 5]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2133) AUTHORS Swaroop,A., Hogan,B.L. and Francke,U. TITLE Molecular analysis of the cDNA for human SPARC/osteonectin/BM-40: sequence, expression, and localization of the gene to chromosome 5q31-q33 JOURNAL Genomics 2 (1), 37-47 (1988) MEDLINE 88256150 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by U.Francke 26-FEB-1988. FEATURES Location/Qualifiers source 1..2133 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q31-q33" gene 58..969 /gene="SPARC" CDS 58..969 /gene="SPARC" /note="osteonectin" /codon_start=1 /db_xref="GDB:G00-118-733" /db_xref="PID:g338313" /translation="MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVG ANPVQVEVGEFDDGAEETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCP APIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSEL TEFPLRMRDWLKNVLVTLYERDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLAR DFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLD NDKYIALDEWAGCFGIKQKDIDKDLVI" BASE COUNT 543 a 533 c 521 g 536 t ORIGIN Chromosome 5q31-q33. 1 cgggagagcg cgctctgcct gccgcctgcc tgcctgccac tgagggttcc cagcaccatg 61 agggcctgga tcttctttct cctttgcctg gccgggaggg ccttggcagc ccctcagcaa 121 gaagccctgc ctgatgagac agaggtggtg gaagaaactg tggcagaggt gactgaggta 181 tctgtgggag ctaatcctgt ccaggtggaa gtaggagaat ttgatgatgg tgcagaggaa 241 accgaagagg aggtggtggc ggaaaatccc tgccagaacc accactgcaa acacggcaag 301 gtgtgcgagc tggatgagaa caacaccccc atgtgcgtgt gccaggaccc caccagctgc 361 ccagccccca ttggcgagtt tgagaaggtg tgcagcaatg acaacaagac cttcgactct 421 tcctgccact tctttgccac aaagtgcacc ctggagggca ccaagaaggg ccacaagctc 481 cacctggact acatcgggcc ttgcaaatac atcccccctt gcctggactc tgagctgacc 541 gaattccccc tgcgcatgcg ggactggctc aagaacgtcc tggtcaccct gtatgagagg 601 gatgaggaca acaaccttct gactgagaag cagaagctgc gggtgaagaa gatccatgag 661 aatgagaagc gcctggaggc aggagaccac cccgtggagc tgctggcccg ggacttcgag 721 aagaactata acatgtacat cttccctgta cactggcagt tcggccagct ggaccagcac 781 cccattgacg ggtacctctc ccacaccgag ctggctccac tgcgtgctcc cctcatcccc 841 atggagcatt gcaccacccg ctttttcgag acctgtgacc tggacaatga caagtacatc 901 gccctggatg agtgggccgg ctgcttcggc atcaagcaga aggatatcga caaggatctt 961 gtgatctaaa tccactcctt ccacagtacc ggattctctc tttaaccctc cccttcgtgt 1021 ttcccccaat gtttaaaatg tttggatggt ttgttgttct gcctggagac aaggtgctaa 1081 catagattta agtgaataca ttaacggtgc taaaaatgaa aattctaacc caagacatga 1141 cattcttagc tgtaacttaa ctattaaggc cttttccaca cgcattaata gtcccatttt 1201 tctcttgcca tttgtagctt tgcccattgt cttattggca catgggtgga cacggatctg 1261 ctgggctctg ccttaaacac acattgcagc ttcaactttt ctctttagtg ttctgtttga 1321 aactaatact taccgagtca gactttgtgt tcatttcatt tcagggtctt ggctgcctgt 1381 gggcttcccc aggtggcctg gaggtgggca aagggaagta acagacacac gatgttgtca 1441 aggatggttt tgggactaga ggctcagtgg tgggagagat ccctgcagaa tccaccaacc 1501 agaacgtggt ttgcctgagg ctgtaactga gagaaagatt ctggggctgt cttatgaaaa 1561 tatagacatt ctcacataag cccagttcat caccatttcc tcctttacct ttcagtgcag 1621 tttcttttca cattaggctg ttggttcaaa cttttgggag cacggactgt cagttctctg 1681 ggaagtggtc agcgcatcct gcagggcttc tcctcctctg tcttttggag aaccagggct 1741 cttctcaggg gctctaggga ctgccaggct gtttcagcca ggaaggccaa aatcaagagt 1801 gagatgtaga aagttgtaaa atagaaaaag tggagttggt gaatcggttg ttctttcctc 1861 acatttggat gattgtcata aggtttttag catgttcctc cttttcttca ccctcccctt 1921 tgttcttcta ttaatcaaga gaaacttcaa agttaatggg atggtcggat ctcacaggct 1981 gagaactcgt tcacctccaa gcatttcatg aaaaagctgc ttcttattaa tcatacaaac 2041 tctcaccatg atgtgaagag tttcacaaat ctttcaaaat aaaaagtaat gacttagaaa 2101 ctgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa // LOCUS HUMSPR1A 623 bp mRNA PRI 15-DEC-1988 DEFINITION Human small proline rich protein (sprI) mRNA, clone 128. ACCESSION M19888 NID g338416 KEYWORDS small proline-rich protein. SOURCE Human epidermal keratinocytes, cDNA to mRNA, clone 128. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 623) AUTHORS Kartasova,T. and van de Putte,P. TITLE Isolation, characterization, and UV-stimulation expression of two families of genes encoding polypeptides of related structure in human epidermal keratinocytes JOURNAL Mol. Cell. Biol. 8, 2195-2203 (1988) MEDLINE 88261298 COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by T.Kartasova 19-JUL-1988. FEATURES Location/Qualifiers source 1..623 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 69..338 /note="small proline rich protein" /codon_start=1 /db_xref="PID:g338417" /translation="MSSQQQKQPCIPPPQLQQQQVKQPCQPPPQEPCIPKTKEPCHPK VPEPCHPKVPEPCQPKVPEPCHPKVPEPCPSIVTPAPAQQKTKQK" BASE COUNT 165 a 207 c 121 g 130 t ORIGIN Unreported. 1 gaccaccagt tctaagggac catacagagt attcctctct tcacaccagg accagccact 61 gttgcagcat gagttcccag cagcagaagc agccctgcat cccaccccct cagcttcagc 121 agcagcaggt gaaacagcct tgccagcctc cacctcagga accatgcatc cccaaaacca 181 aggagccctg ccaccccaag gtgcctgagc cctgccaccc caaagtgcct gagccctgcc 241 agcccaaggt tccagagcca tgccacccca aggtgcctga gccctgccct tcaatagtca 301 ctccagcacc agcccagcag aagaccaagc agaagtaatg tggtccacag ccatgccctt 361 gaggagccgg ccaccagatg ctgaatcccc tatcccattc tgtgtatgag tcccatttgc 421 cttgcaatta gcattctgtc tcccccaaaa aagaatgtgc tatgaagctt tctttcctac 481 acactctgag tctctgaatg aagctgaagg tcttagtacc agagctagtt ttcagctgct 541 cagaattcat ctgaagagag acttaagatg aaagcaaatg attcagctcc cttatacccc 601 cattaaattc actttcaatt cca // LOCUS HUMSPR2A 680 bp mRNA PRI 15-DEC-1988 DEFINITION Human small proline rich protein (sprII) mRNA, clone 930. ACCESSION M20030 NID g338422 KEYWORDS small proline-rich protein. SOURCE Human epidermal keratinocytes, cDNA to mRNA, clone 930. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 680) AUTHORS Kartasova,T. and van de Putte,P. TITLE Isolation, characterization, and UV-stimulation expression of two families of genes encoding polypeptides of related structure in human epidermal keratinocytes JOURNAL Mol. Cell. Biol. 8, 2195-2203 (1988) MEDLINE 88261298 COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by T.Kartasova 19-JUL-1988. FEATURES Location/Qualifiers source 1..680 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 64..282 /note="small proline rich protein" /codon_start=1 /db_xref="PID:g338423" /translation="MSYQQQQCKQPCQPPPVCPTPKCPEPCPPPKCPEPCPPPKCPQP SPPQQCQQKCPPVTPSPPCQPKCPPKSK" BASE COUNT 169 a 206 c 138 g 167 t ORIGIN Unreported. 1 aaaaactcct ggtacttgag cactgatctg ctttggagaa cctgattctg agactccagc 61 aggatgtctt atcaacagca gcagtgcaag cagccctgcc agccacctcc tgtgtgcccc 121 acgccaaagt gcccagagcc atgtccaccc ccgaagtgcc ctgagccctg cccaccacca 181 aagtgtccac agccctcccc acctcagcag tgccagcaaa aatgtcctcc tgtgacacct 241 tccccaccct gccagccaaa gtgtccaccc aagagcaagt aacagcttca gaattcatca 301 ggagcatgaa aggataagga taattggctc accttgttcc acagcttcac ctgcatcttc 361 tcatcaaagc ctaccatgga tacacagtta gcttctttcc tcttagccag tgatctgccc 421 atgatgatcc ctgatagcaa aaggtttcct ttctgaggct gccatattgc cactgtccag 481 gtggatactg agaaaggaag tcctcagcag tgtcagttcc cagagctttg gaagaaggac 541 cagcagctct gtccctggga accatcaaaa aatgctgttg atgttttctg tgtctgtctg 601 tcacctgggc atgggcttct aacacctgtg caattgtcac ttttctttca cttccctgaa 661 taaatatctt tgcatacgta // LOCUS HUMSPRMTK 1872 bp mRNA PRI 19-JUL-1995 DEFINITION Homo sapiens transmembrane tyrosine kinase mRNA, complete cds. ACCESSION L08961 NID g897614 KEYWORDS transmembrane protein; tyrosine kinase. SOURCE Homo sapiens (tissue library: lambda gt11) male testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1872) AUTHORS Burks,D.J., Carballada,R., Moore,H.D. and Saling,P.M. TITLE Interaction of a tyrosine kinase from human sperm with the zona pellucida at fertilization JOURNAL Science 269 (5220), 83-86 (1995) MEDLINE 95327955 FEATURES Location/Qualifiers source 1..1872 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="testis" /tissue_lib="lambda gt11" CDS 70..1872 /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g897615" /translation="MKPITKQQGELVGSRISHVWQSAGISKELLEEVGQNGSRARISV QVHNATCTVRIAAVTKGGVGPFSDPVCKYYTGGNTGYFCANVSRMSTHRSFKLNNTLH IPCRGRPQPNVTCRDLKRCNVSDEVQRGMPGNVTPCTRLGRLCPLFNSGAWQRRSCAH HLWLLLWIILIGLVLYISLAIRKRVQETKFGNAFTEEDSELVVNYIAKKSFCRRAIEL THSLGVSEELQNKLEDVVIDRNLLILGKILGEGEKGTVYEGLWNIPEGKEVKIPVAIK TLKLDTMANKEILDEASVMKGFGNPHVVRLLGICMTSTIYVITEYCLLVYRRNKDKAE QHRSNCAELNPPLQTLLKFMVDIALGMEYLSNRNFLHRDLAARNCMLRDDMTVCVADF GLSKKIYSGDYYRQGRIAKMPVKWIAIESLADRVYTKSDVWAFGVTMWEIATTLRGMT PYPGVQNHEMYDYLLHGHRLKQPRTAWNCTEIRIRLLKLPILGSRTMRPMTIFSMATR LSSPKTAWMNCMKKCTLAGEPIPKTGPTFSVLRLQLEKLLESLPDVRNQADVIYVNTQ LLESEGLARVHPCSTGLEHHPCSEHRPRPHLYNC" BASE COUNT 552 a 423 c 449 g 448 t ORIGIN 1 cttggattct agccagcacg actgaaggag ccccatcagt agcaccttta aatgtcactg 61 tgtttctgaa tgaagccaat tacaaagcaa caaggtgagc tggttggctc aaggatctcc 121 catgtgtggc aaagcgctgg catttctaaa gaattattag aagaagttgg gcaaaatggg 181 tctcgtgcgc gtatttctgt tcaagtccat aacgccacct gcaccgtgag gattgccgct 241 gttaccaagg ggggagtcgg ccctttctct gaccccgtgt gtaaatatta tacaggaggg 301 aataccgggt atttctgcgc taacgtatcc cgcatgtcca cacatagaag ctttaaacta 361 aataacacct tacacatccc ctgtcgaggg cgaccacaac caaacgtgac ctgccgagac 421 ctaaagcggt gcaatgtgtc cgacgaagtt caaaggggca tgccagggaa cgtcacaccc 481 tgcacacgac taggccggct ctgtccgcta tttaactcag gcgcctggca acgcagatcc 541 tgtgctcatc atctttggct gcttttgtgg attattttga ttgggttggt tttatacatc 601 tccttggcca tcagaaaaag agtccaggag acaaagtttg ggaatgcatt cacagaggag 661 gattctgaat tagtggtgaa ttatatagca aagaaatcct tctgtcggcg agccattgaa 721 cttacccata gcttgggagt cagtgaggaa ctacaaaata aactagaaga tgttgtgatt 781 gacaggaatc ttctaattct tggaaaaatt ctgggtgaag gagagaaagg gaccgtgtat 841 gaaggactgt ggaatatccc cgaaggaaag gaagtaaaaa ttccagtagc aatcaagacc 901 ctaaaactgg acactatggc taataaagaa atacttgatg aagcaagtgt catgaaaggc 961 tttgggaacc cccacgtagt gcgactcctt gggatatgta tgacatctac aatatatgta 1021 attactgaat actgtctact ggtttataga aggaataaag ataaagctga acaacaccgg 1081 tcaaattgtg ccgagctaaa cccaccgctg cagacactat tgaagttcat ggtggatatt 1141 gccctgggaa tggagtatct gagcaacagg aattttcttc atcgagattt agctgctcga 1201 aactgcatgt tgcgagatga catgactgtc tgtgttgcgg acttcggcct ctctaagaag 1261 atttacagtg gcgattatta ccgccaaggc cgcattgcta agatgcctgt taaatggatc 1321 gccatagaaa gtcttgcaga ccgagtctac acaaaaagtg atgtgtgggc atttggcgtg 1381 accatgtggg aaatagctac gacgctgcgg ggaatgactc cctatcccgg agttcagaac 1441 catgagatgt acgactacct tctccacggc cacaggctga agcagcctcg aaccgcttgg 1501 aactgtacag aaatccgaat ccgactccta aaactaccaa tcctggggag tcgaacaatg 1561 cgtccaatga caatattttc aatggccacg cgactatcat cacccaaaac agcttggatg 1621 aactgtatga aaaaatgtac actagcaggg gaacccatac caaagactgg accgacattc 1681 tcggtactaa gactacagct ggaaaaattg ctagagagct taccggacgt tagaaatcaa 1741 gcagatgtaa tttatgttaa cactcaattg ctagagagcg aaggccttgc cagggttcat 1801 ccctgtagta ccgggctcga acaccatcca tgcagcgaac atcgaccgcg tccgcacctc 1861 tataattgct aa // LOCUS HUMSPROT 2530 bp mRNA PRI 02-SEP-1994 DEFINITION Human S protein mRNA, complete cds. ACCESSION L20815 NID g414809 KEYWORDS S gene; S protein; cell differentiation; keratin; keratinocyte differentiation; loricrin. SOURCE Homo sapiens (library: lambda gt10 (from Clontech)) neonate foreskin cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2530) AUTHORS Zhou,Y. and Chaplin,D.D. TITLE Identification in the HLA class I region of a gene expressed late in keratinocyte differentiation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (20), 9470-9474 (1993) MEDLINE 94022396 FEATURES Location/Qualifiers source 1..2530 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="keratinocyte" /dev_stage="neonate" /germline /tissue_type="foreskin" /tissue_lib="lambda gt10 (from Clontech)" /map="6p21.3" CDS 63..1523 /standard_name="S protein" /note="epidermal granular cell layer-specific" /codon_start=1 /db_xref="PID:g414810" /translation="MLALLLAGLLLPGTLAKSIGTFSDPCKDPTRITSPNDPCLTGKG DSSGFSSYSGSSSSGSSISSARSSGGGSSGSSSGSSIAQGGSAGSFKPGTGYSQVSYS SGSGSSLQGASGSSQLGSSSSHSGSSGSHSGSSSSHSSSSSSFQFSSSSFQVGNGSAL PTNDNSYRGILNPSQPGQSSSSSQTSGVSSSGQSVSSNQRPCSSDIPDSPCSGGPIVS HSGPYIPSSHSVSGGQRPVVVVVDQHGSGAPGVVQGPPCSNGGLPGKPCPPITSVDKS YGGYEVVGGSSDSYLVPGMTYSKGKIYPVGYFTKENPVKGSPGVPSFAAGPPISEGKY FSSNPIIPSQSAASSAIAFQPVGTGGVQLCGGGSTGSKGPCSPSSSRVPSSSSISSSS GSPYHPCGSASQSPCSPPGTGSFSSSSSSQSSGKIILQPCGSKSSSSGHPCMSVSSLT LTGGPDGSPHPDPSAGAKPCGSSSAGKIPCRSIRIS" intron 99^100 /number=1 polyA_signal 2511..2516 polyA_site 2530 BASE COUNT 496 a 840 c 593 g 601 t ORIGIN 1 ccgtgcagtc cgagatgggc tcgtctcggg caccctggat ggggcgtgtg ggtgggcacg 61 ggatgttggc actgctgctg gctggtctcc tcctgccagg gaccttggct aagagcattg 121 gcaccttctc agacccctgt aaggacccca cgcgtatcac ctcccctaac gacccctgcc 181 tcactgggaa gggtgactcc agcggcttca gtagctacag tggctccagc agttctggca 241 gctccatttc cagtgccaga agctctggtg gtggctccag tggtagctcc agcggatcca 301 gcattgccca gggtggttct gcaggatctt ttaagccagg aacggggtat tcccaggtca 361 gctactcctc cggatctggc tctagtctac aaggtgcatc cggttcctcc cagctgggga 421 gcagcagctc tcactcggga agcagcggct ctcactcggg aagcagcagc tctcattcga 481 gcagcagcag cagctttcag ttcagcagca gcagcttcca agtagggaat ggctctgctc 541 tgccaaccaa tgacaactct taccgcggaa tactaaaccc ttcccagcct ggacaaagct 601 cttcctcttc ccaaacctct ggggtatcca gcagtggcca aagcgtcagc tccaaccagc 661 gtccctgtag ttcggacatc cccgactctc cctgcagtgg agggcccatc gtctcgcact 721 ctggccccta catccccagc tcccactctg tgtcaggggg tcagaggcct gtggtggtgg 781 tggtggacca gcacggttct ggtgcccctg gagtggttca aggtcccccc tgtagcaatg 841 gtggccttcc aggcaagccc tgtcccccaa tcacctctgt agacaaatcc tatggtggct 901 acgaggtggt gggtggctcc tctgacagtt atctggttcc aggcatgacc tacagtaagg 961 gtaaaatcta tcctgtgggc tacttcacca aagagaaccc tgtgaaaggc tctccagggg 1021 tcccttcctt tgcagctggg ccccccatct ctgagggcaa atacttctcc agcaacccca 1081 tcatccccag ccagtcggca gcttcctcgg ccattgcatt ccagccagtg gggactggtg 1141 gggtccagct ctgtggaggc ggctccacgg gctccaaggg accctgctct ccctccagtt 1201 ctcgagtccc cagcagttct agcatttcca gcagctccgg ttcaccctac catccctgcg 1261 gcagtgcttc ccagagcccc tgctccccac caggcaccgg ctccttcagc agcagctcca 1321 gttcccaatc gagtggcaaa atcatccttc agccttgtgg cagcaagtcc agctcttctg 1381 gtcacccttg catgtctgtc tcctccttga cactgactgg gggccccgat ggctctcccc 1441 atcctgatcc ctccgctggt gccaagccct gtggctccag cagtgctgga aagatcccct 1501 gccgctccat ccggatatcc tagcccaagt gaagcctctg gggccccagc tagctgaccc 1561 tgaagttttc ctaccccaag gagagttact cgacagtcca taagtcaact gttgtgtgtg 1621 tgcatgcctt gggcacaaac aagcacatac actatatccc atatgggaga aggccagtgc 1681 ccaggcatag ggttagctca gtttccctcc ttcccaaaag agtggttctg ctttctctac 1741 taccctaagg ttgcagactc tctcttatca ccccttcctc cttcctcttc tcaaaatggt 1801 agattcaaag ctcctctctt gattctctcc tactgtttaa attcccattc caccacagtg 1861 cccctcagcc agatcaccac cccttacaat tccctctact gtgttgaaat ggtccattga 1921 gtaacacccc catcaccttc tcaactggga aacccctgaa atgctctcag agcacctctg 1981 acgcctgaag aagttatacc ttcctcttcc cctttaccaa ataaagcaaa gtcaaaccat 2041 catctggaaa cagtggccac ttttcactga cctctcttcg acatctagtc aacccaccca 2101 atatgccact gggtttcgct cccaattcca ccccaccctc cattacagag ctcaccacgc 2161 cctcctagat caccgtcccc aacacaccca ttgcctctca aggcccttat ctcagcccct 2221 tcctgtggcc atttccctca gtgcccagat gattccctgg gtgagggaga cactggggca 2281 ccctcagagg ttggagcagg ctccctgctg tccctggatc ctggacagat ggctcagtaa 2341 actgtgggac taggtgcaga cttttgcctt cttggagtcc tgggtctcct ctgagaggtc 2401 tgggtggtgc tcctcctacg cctctagagg tctctgtgtt cctcattttc cttcaaaagc 2461 gggctgtatt tctcttctac cttccagctc ctcccacaga ggaggaagac aataaatatt 2521 tgttgaactg // LOCUS HUMSPTCS 1514 bp mRNA PRI 13-JAN-1995 DEFINITION Human T-cell-specific homodimer surface protein CD28 mRNA, complete cds. ACCESSION J02988 NID g338444 KEYWORDS T-cell-specific homodimer surface protein. SOURCE Human T-cell tumor line HPB-ALL, cDNA to mRNA, clone lambda-H3M. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1514) AUTHORS Aruffo,A. and Seed,B. TITLE Molecular cloning of a CD28 cDNA by a high-efficiency COS cell expression system JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84 (23), 8573-8577 (1987) MEDLINE 88068631 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.Seed, 11-AUG-1987. FEATURES Location/Qualifiers source 1..1514 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2q33-q34" sig_peptide 100..153 /gene="CD28" /note="T-cell-specific homodimer surface protein signal peptide" CDS 100..762 /gene="CD28" /note="T-cell-specific homodimer surface protein precursor" /codon_start=1 /db_xref="GDB:G00-118-765" /db_xref="PID:g338445" /translation="MLRLLLALNLFPSIQVTGNKILVKQSPMLVAYDNAVNLSCKYSY NLFSREFRASLHKGLDSAVEVCVVYGNYSQQLQVYSKTGFNCDGKLGNESVTFYLQNL YVNQTDIYFCKIEVMYPPPYLDNEKSNGTIIHVKGKHLCPSPLFPGPSKPFWVLVVVG GVLACYSLLVTVAFIIFWVRSKRSRLLHSDYMNMTPRRPGPTRKHYQPYAPPRDFAAY RS" gene 100..762 /gene="CD28" mat_peptide 154..759 /gene="CD28" /note="T-cell-specific homodimer surface protein" BASE COUNT 404 a 360 c 337 g 413 t ORIGIN Unreported. 1 agactctcag gccttggcag gtgcgtcttt cagttcccct cacacttcgg gttcctcggg 61 gaggaggggc tggaacccta gcccatcgtc aggacaaaga tgctcaggct gctcttggct 121 ctcaacttat tcccttcaat tcaagtaaca ggaaacaaga ttttggtgaa gcagtcgccc 181 atgcttgtag cgtacgacaa tgcggtcaac cttagctgca agtattccta caatctcttc 241 tcaagggagt tccgggcatc ccttcacaaa ggactggata gtgctgtgga agtctgtgtt 301 gtatatggga attactccca gcagcttcag gtttactcaa aaacggggtt caactgtgat 361 gggaaattgg gcaatgaatc agtgacattc tacctccaga atttgtatgt taaccaaaca 421 gatatttact tctgcaaaat tgaagttatg tatcctcctc cttacctaga caatgagaag 481 agcaatggaa ccattatcca tgtgaaaggg aaacaccttt gtccaagtcc cctatttccc 541 ggaccttcta agcccttttg ggtgctggtg gtggttggtg gagtcctggc ttgctatagc 601 ttgctagtaa cagtggcctt tattattttc tgggtgagga gtaagaggag caggctcctg 661 cacagtgact acatgaacat gactccccgc cgccccgggc ccacccgcaa gcattaccag 721 ccctatgccc caccacgcga cttcgcagcc tatcgctcct gacacggacg cctatccaga 781 agccagccgg ctggcagccc ccatctgctc aatatcactg ctctggatag gaaatgaccg 841 ccatctccag ccggccacct cagcccctgt tgggccacca atgccaattt ttctcgagtg 901 actagaccaa atatcaagat cattttgaga ctctgaaatg aagtaaaaga gatttcctgt 961 gacaggccaa gtcttacagt gccatggccc acattccaac ttaccatgta cttagtgact 1021 tgactgagaa gttagggtag aaaacaaaaa gggagtggat tctgggagcc tcttcccttt 1081 ctcactcacc tgcacatctc agtcaagcaa agtgtggtat ccacagacat tttagttgca 1141 gaagaaaggc taggaaatca ttccttttgg ttaaatgggt gtttaatctt ttggttagtg 1201 ggttaaacgg ggtaagttag agtaggggga gggataggaa gacatattta aaaaccatta 1261 aaacactgtc tcccactcat gaaatgagcc acgtagttcc tatttaatgc tgttttcctt 1321 tagtttagaa atacatagac attgtctttt atgaattctg atcatattta gtcattttga 1381 ccaaatgagg gatttggtca aatgagggat tccctcaaag caatatcagg taaaccaagt 1441 tgctttcctc actccctgtc atgagacttc agtgttaatg ttcacaatat actttcgaaa 1501 gaataaaata gttc // LOCUS HUMSRAA 538 bp mRNA PRI 13-JAN-1995 DEFINITION Human ribosomal protein S16 mRNA, complete cds. ACCESSION M60854 NID g338446 KEYWORDS ribosomal protein S16. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 538) AUTHORS Batra,S.K., Metzgar,R.S. and Hollingsworth,M.A. TITLE Molecular cloning and sequence analysis of the human ribosomal protein S16 JOURNAL J. Biol. Chem. 266 (11), 6830-6833 (1991) MEDLINE 91201326 FEATURES Location/Qualifiers source 1..538 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 38..478 /gene="RPS16" CDS 38..478 /gene="RPS16" /codon_start=1 /db_xref="GDB:G00-127-871" /db_xref="PID:g338447" /translation="MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEP RTLQYKLLEPVLLLGKERFAGVDIRVRVKGGGHVAQIYAIRQSISKALVAYYQKYVDE ASKKEIKDILIQYDRTLLVADPRRCESKKFGGPGARARYQKSYR" BASE COUNT 123 a 143 c 153 g 119 t ORIGIN 1 gcgccgcggt gaggttgtct agtccacgct cggagccatg ccgtccaagg gtccgctgca 61 gtcggtgcag gtcttcggac gcaagaagac agcgacagct gtggcgcact gcaaacgcgg 121 caatggtctc atcaaggtga acgggcggcc cctggagatg attgagccgc gcacgctaca 181 gtacaagctg ctggagccag ttctgcttct cggcaaggag cgatttgctg gtgtagacat 241 ccgtgtccgt gtaaagggtg gtggtcacgt ggcccagatt tatgctatcc gtcagtccat 301 ctccaaagcc ctggtggcct attaccagaa atatgtggat gaggcttcca agaaggagat 361 caaagacatc ctcatccagt atgaccggac cctgctggta gctgaccctc gtcgctgcga 421 gtccaaaaag tttggaggcc ctggtgcccg cgctcgctac cagaaatcct accgataagc 481 ccatcgtgac tcaaaactca cttgtataat aaacagtttt tgagggattt taaagttt // LOCUS HUMSRCPT1F 1141 bp DNA PRI 26-FEB-1993 DEFINITION Homo sapiens serotonin receptor (HTR1F) gene, complete cds. ACCESSION L04962 NID g338464 KEYWORDS . SOURCE Homo sapiens (library: Stratagene; lambda DASHII) lymphocyte DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1141) AUTHORS Adham,N., Kao,H.-T., Schechter,L.E., Bard,J.A., Olsen,M., Urquhart,D., Durkin,M., Hartig,P.R., Weinshank,R.L. and Branchek,T.A. TITLE Cloning of another human serotonin receptor (5-HT+(sub-1F): A fifth 5-HT1 receptor subtype coupled to the inhibition of adenylate cyclase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90, 408-412 (1993) MEDLINE 93133800 FEATURES Location/Qualifiers source 1..1141 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="lymphocyte" /tissue_lib="Stratagene; lambda DASHII" 5'UTR 1..26 CDS 27..1127 /standard_name="5-HT1F" /codon_start=1 /product="serotonin receptor" /db_xref="PID:g338465" /translation="MDFLNSSDQNLTSEELLNRMPSKILVSLTLSGLALMTTTINSLV IAAIIVTRKLHHPANYLICSLAVTDFLVAVLVMPFSIVYIVRESWIMGQVVCDIWLSV DITCCTCSILHLSAIALDRYRAITDAVEYARKRTPKHAGIMITIVWIISVFISMPPLF WRHQGTSRDDECIIKHDHIVSTIYSTFGAFYIPLALILILYYKIYRAAKTLYHKRQAS RIAKEEVNGQVLLESGEKSTKSVSTSYVLEKSLSDPSTDFDKIHSTVRSLRSEFKHEK SWRRQKISGTRERKAATTLGLILGAFVICWLPFFVKELVVNVCDKCKISEEMSNFLAW LGYLNSLINPLIYTIFNEDFKKAFQKLVRCRC" 3'UTR 1128..1141 BASE COUNT 340 a 223 c 231 g 347 t ORIGIN 1 tatattaatc ttttaaaaca aagaaaatgg atttcttaaa ttcatctgat caaaacttga 61 cctcagagga actgttaaac agaatgccat ccaaaattct ggtgtccctc actctgtctg 121 ggctggcact gatgacaaca actatcaact cccttgtgat cgctgcaatt attgtgaccc 181 ggaagctgca ccatccagcc aattatttaa tttgttccct tgcagtcaca gattttcttg 241 tggctgtcct ggtgatgccc ttcagcattg tgtatattgt gagagagagc tggattatgg 301 ggcaagtggt ctgtgacatt tggctgagtg ttgacattac ctgctgcacg tgctccatct 361 tgcatctctc agctatagct ttggatcggt atcgagcaat cacagatgct gttgagtatg 421 ccaggaaaag gactccaaag catgctggca ttatgattac aatagtttgg attatatctg 481 tttttatctc tatgcctcct ctattctgga ggcaccaagg aactagcaga gatgatgaat 541 gcatcatcaa gcacgaccac attgtttcca ccatttactc aacatttgga gctttctaca 601 tcccactggc attgattttg atcctttact acaaaatata tagagcagca aagacattat 661 accacaagag acaagcaagt aggattgcaa aggaggaggt gaatggccaa gtccttttgg 721 agagtggtga gaaaagcact aaatcagttt ccacatccta tgtactagaa aagtctttat 781 ctgacccatc aacagacttt gataaaattc atagcacagt gagaagtctc aggtctgaat 841 tcaagcatga gaaatcttgg agaaggcaaa agatctcagg tacaagagaa cggaaagcag 901 ccactaccct gggattaatc ttgggtgcat ttgtaatatg ttggcttcct ttttttgtaa 961 aagaattagt tgttaatgtc tgtgacaaat gtaaaatttc tgaagaaatg tccaattttt 1021 tggcatggct tgggtatctc aattccctta taaatccact gatttacaca atctttaatg 1081 aagacttcaa gaaagcattc caaaagcttg tgcgatgtcg atgttagttt taaaaatgtt 1141 t // LOCUS HUMSRDA 2437 bp mRNA PRI 13-JAN-1995 DEFINITION Human steroid 5-alpha-reductase 2 (SRD5A2) mRNA, complete cds. ACCESSION M74047 NID g338468 KEYWORDS androgen; dihydrotestosterone; steroid 5-alpha-reductase 2. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2437) AUTHORS Andersson,S., Berman,D.M., Jenkins,E.P. and Russell,D.W. TITLE Deletion of steroid 5 alpha-reductase 2 gene in male pseudohermaphroditism JOURNAL Nature 354 (6349), 159-161 (1991) MEDLINE 92049782 FEATURES Location/Qualifiers source 1..2437 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Unassigned" gene 28..792 /gene="SRD5A2" CDS 28..792 /gene="SRD5A2" /EC_number="1.3.99.5" /codon_start=1 /db_xref="GDB:G00-127-343" /product="steroid 5-alpha-reductase 2" /db_xref="PID:g338469" /translation="MQVQCQQSPVLAGSATLVALGALALYVAKPSGYGKHTESLKPAA TRLPARAAWFLQELPSFAVPAGILARQPLSLFGPPGTVLLGLFCVHYFHRTFVYSLLN RGRPYPAILILRGTAFCTGNGVLQGYYLIYCAEYPDGWYTDIRFSLGVFLFILGMGIN IHSDYILRQLRKPGEISYRIPQGGLFTYVSGANFLGEIIEWIGYALATWSLPALAFAF FSLCFLGLRAFHHHRFYLKMFEDYPKSRKALIPFIF" polyA_site 2437 /gene="SRD5A2" /note="G00-127-343" BASE COUNT 673 a 557 c 531 g 676 t ORIGIN 1 gcggccaccg gcgaggaaca cggcgcgatg caggttcagt gccagcagag cccagtgctg 61 gcaggcagcg ccactttggt cgcccttggg gcactggcct tgtacgtcgc gaagccctcc 121 ggctacggga agcacacgga gagcctgaag ccggcggcta cccgcctgcc agcccgcgcc 181 gcctggttcc tgcaggagct gccttccttc gcggtgcccg cggggatcct cgcccggcag 241 cccctctccc tcttcgggcc acctgggacg gtacttctgg gcctcttctg cgtacattac 301 ttccacagga catttgtgta ctcactgctc aatcgaggga ggccttatcc agctatactc 361 attctcagag gcactgcctt ctgcactgga aatggagtcc ttcaaggcta ctatctgatt 421 tactgtgctg aataccctga tgggtggtac acagacatac ggtttagctt gggtgtcttc 481 ttatttattt tgggaatggg aataaacatt catagtgact atatattgcg ccagctcagg 541 aagcctggag aaatcagcta caggattcca caaggtggct tgtttacgta tgtttctgga 601 gccaatttcc tcggtgagat cattgaatgg atcggctatg ccctggccac ttggtccctc 661 ccagcacttg catttgcatt tttctcactt tgtttccttg ggctgcgagc ttttcaccac 721 cataggttct acctcaagat gtttgaggac taccccaaat ctcggaaagc ccttattcca 781 ttcatctttt aaaggaacca aattaaaaag gagcagagct cccacaatgc tgatgaaaac 841 tgtcaagctg ctgaaactgt aattttcatg atataatagt catatatata tatatatata 901 tatatatata tatatatatg tatatatgta atagtaggtc tcctggcgtt ctgccagctg 961 gcctggggat tctgagtggt gtctgcttag agtttactcc tacccttcca gggaccccta 1021 tcctgatccc caactgaagc ttcaaaaagc cacttttcca aatggcgaca gttgcttctt 1081 agctattgct ctgagaaagt acaaacttct cctatgtctt tcaccgggca atccaagtac 1141 atgtggcttc atacccactc cctgtcaatg caggacaact ctgtaatcaa gaattttttg 1201 acttgaaggc agtacttata gaccttatta aaggtatgca ttttatacat gtaacagagt 1261 agcagaaatt taaactctga agccacaaag acccagagca aacccactcc caaatgaaaa 1321 ccccagtcat ggcttccttt ttcttggtta attaggaaag atgagaaatt attaggtaga 1381 ccttgaatac aggagccctc tcctcatagt gctgaaaaga tactgatgca ttgacctcat 1441 ttcaaatttg tgcagtgtct tagttgatga gtgcctctgt tttccagaag atttcacaat 1501 ccccggaaaa ctggtatggc tattcttgaa ggccaggttt taataaccac aaacaaaaag 1561 gcatgaacct gggtggctta tgagagagta gagaacaaca tgaccctgga tggctactaa 1621 gaggatagag aacagtttta caatagacat tgcaaactct catgtttttg gaaactggtg 1681 gcaatatcca aataatgagt agtgtaaaac aaagagaatt aatgatgagg ttacatgctg 1741 cttgcctcca ccagatgtcc acaacaatat gaagtacagc agaagcccca agcaactttc 1801 ctttcctgga gcttcttcct tgtagttctc aggacctgtt caagaaggtg tctcctaggg 1861 gcagcctgaa tgcctccctc aaaggacctg caggcagaga ctgaaaattg cagacagagg 1921 ggcacgtctg ggcagaaaac ctgttttgtt tggctcagac atatagtttt ttttttttta 1981 caaagtttca aaaacttaaa aatcaggaga ttccttcata aaactctagc attctagttt 2041 catttaaaaa gttggaggat ctgaacatac agagcccaca tttccacacc agaactggaa 2101 ctacgtagct agtaagcatt tgagtttgca aactcttgtg aaggggtcac cccagcatga 2161 gtgctgagat atggactctc taaggaaggg gccgaacgct tgtaattgga atacatggaa 2221 atatttgtct tctcaggcct atgtttgcgg aatgcattgt caatatttag caaactgttt 2281 tgacaaatga gcaccagtgg tactaagcac agaaactcac tatataagtc acataggaaa 2341 cttgaaaggt ctgaggatga tgtagattac tgaaaaatac aaattgcaat catataaata 2401 agtgtttttg ttgttcatta aataccttta aatcatg // LOCUS HUMSRF 4201 bp mRNA PRI 15-JUN-1989 DEFINITION Human serum response factor (SRF) mRNA, complete cds. ACCESSION J03161 NID g338479 KEYWORDS serum response factor. SOURCE Human HeLa cell line, cDNA to mRNA, clones lambda[2.9,451.25, 454.9,H9]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4201) AUTHORS Norman,C., Runswick,M., Pollock,W.B.R. and Treisman,R. TITLE Isolation and properties of cDNA clones encoding SRF, a transcription factor that binds to the c-fos serum response element JOURNAL Cell 55, 989-1003 (1988) MEDLINE 89077555 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Treisman, 23-JAN-1989. FEATURES Location/Qualifiers source 1..4201 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..4201 /note="SRF mRNA" CDS 359..1885 /note="serum response factor" /codon_start=1 /db_xref="PID:g338480" /translation="MLPTQAGAAAALGRGSALGGSLNRTPTGRPGGGGGTRGANGGRV PGNGAGLGPGRLEREAAAAAATTPAPTAGALYSGSEGDSESGEEEELGAERRGLKRSL SEMEIGMVVGGPEASAAATGGYGPVSGAVSGAKPGKKTRGRVKIKMEFIDNKLRRYTT FSKRKTGIMKKAYELSTLTGTQVLLLVASETGHVYTFATRKLQPMITSETGKALIQTC LNSPDSPPRSDPTTDQRMSATGFEETDLTYQVSESDSSGETKDTLKPAFTVTNLPGTT STIQTAPSTSTTMQVSSGPSFPITNYLAPVSASVSPSAVSSANGTVLKSTGSGPVSSG GLMQLPTSFTLMPGGAVAQQVPVQAIQVHQAPQQASPSRDSSTDLTQTSSSGTVTLPA TIMTSSVPTTVGGHMMYPSPHAVMYAPTSGLGDGSLTVLNAFSQAPSTMQVSHSQVQE PGGVPQVFLTASSGTVQIPVSAVQLHQMAVIGQQAGSSSNLTELQVVNLDTAHSTKSE " BASE COUNT 794 a 1264 c 1296 g 847 t ORIGIN 8 bp upstream of BamHI site. 1 ggtcggggga tccctccgcc gccagcgcgt ggtcccggcc ccctccaccc gccgtctcgg 61 ccgcggccag cagcccctgc cccccggggg acgctgacgg ccgcccggcg cgccgcccta 121 gcagacggac agggggcgct gcgcgcggcc tggggcaacc cgggccacag gggcaggaaa 181 gtgagggccc aggtcggccc gggcgtgcag gggccccggg ttcgcagcgg cggccgcggc 241 agcgatagcg gcactagcag cagcgggagt gccgggttga gccgggaagc cgatggcggc 301 ggctgcggcg gctccgattc ctcgctgact gcccgtccgc cctcctgcat cgagcgccat 361 gttaccgacc caagctgggg ccgcggcggc tctgggccgg ggctcggccc tggggggcag 421 cctgaaccgg accccgacgg ggcggccggg cggcggcggc gggacacgcg gggctaacgg 481 gggccgggtc cccgggaatg gcgcggggct cgggcccggc cgcctggagc gggaggctgc 541 ggcagcggcg gcaaccaccc cggcgcccac cgcgggggcc ctctacagcg gcagcgaggg 601 cgactcggag tcgggcgagg aggaggagct gggcgccgag cggcgcggcc tgaagcggag 661 cctgagcgag atggagatcg gtatggtggt cggtgggccc gaggcgtcgg cagcggccac 721 cgggggctac gggccggtga gcggcgcggt gagcggggcc aagccgggta agaagacccg 781 gggccgcgtg aagatcaaga tggagttcat cgacaacaag ctgcggcgct acacgacctt 841 cagcaagagg aagacgggca tcatgaagaa ggcctatgag ctgtccacgc tgacagggac 901 acaggtgctg ttgctggtgg ccagtgagac aggccatgtg tatacctttg ccacccgaaa 961 actgcagccc atgatcacca gtgagaccgg caaggcactg attcagacct gcctcaactc 1021 gccagactct ccaccccgtt cagaccccac aacagaccag agaatgagtg ccactggctt 1081 tgaagagaca gatctcacct accaggtgtc ggagtctgac agcagtgggg agaccaagga 1141 cacactgaag ccggcgttca cagtcaccaa cctgccgggt acaacctcca ccatccaaac 1201 agcacctagc acctctacca ccatgcaagt cagcagcggc ccctcctttc ccatcaccaa 1261 ctacctggca ccagtgtctg ctagtgtcag ccccagtgct gtcagcagtg ccaatgggac 1321 tgtgctgaag agtacaggca gcggccctgt ctcctctggg ggccttatgc agctgcctac 1381 cagcttcacc ctcatgcctg gtggggcagt ggcccagcag gtcccagtgc aggccattca 1441 agtgcaccag gccccacagc aagcgtctcc ctcccgtgac agcagcacag acctcacgca 1501 gacctcctcc agcgggacag tgacgctgcc cgccaccatc atgacgtcat ccgtgcccac 1561 aactgtgggt ggccacatga tgtaccctag cccgcatgcg gtgatgtatg cccccacctc 1621 gggcctgggt gatggcagcc tcaccgtgct gaatgccttc tcccaggcac catccaccat 1681 gcaggtgtca cacagccagg tccaggagcc aggtggcgtc ccccaggtgt tcctgacagc 1741 atcatctggg acagtgcaga tccctgtttc agcagttcag ctccaccaga tggctgtgat 1801 agggcagcag gccgggagca gcagcaacct caccgagcta caggtggtga acctggacac 1861 cgcccacagc accaagagtg aatgatccgc ccgccgccct ggacagatgg cccaagggat 1921 ggcaccactt atttattgtt gccttttcac gttttcttta cacacacgtt gacgggccgc 1981 aggagggagg cggggaggag gaacgggcag ccacaggact gagccctctc actccagcca 2041 aagaaatggg cctgcctgcc tccacccgtc ctccctcagc ctccccttct tcccgcccca 2101 cctcccattt ctgttgctgg aggggctgtc ctccttcctg ggaccccctc gccagcttgg 2161 ctcgatgttt gccatgagta ttagcttacc caatgggacc gtgccccacc tccccacaca 2221 caggccttct gtggggctgg gcaccgtgtc ctcctctgag gaagcagttg gggccctctt 2281 gccagcctcc ttgctgaccc caggtcagcc ctgtgtctgt cacaggctgg gtcaaaagag 2341 ccctggctct gcccctcagg gggccagctg gggagatggg ggcttcttcc tcacactgct 2401 gtcctctccc ccttcagctc ctgagtagct gggcctgtgc actgggcagg ttcctggggc 2461 cgcctgccct gccttgccgc tccccttgga cctccagggg ctcctgggtt ggagggaacc 2521 accagcgttc ccttctcccc cttgtcttcc cccctctcct cccagctgct ttacttaaag 2581 ttgattttga actttttatt tgaggagacg aagtgaaaac aaatctataa atatatattt 2641 ttaaaatatt taactttttt ttatggcgtt tttctcgtcc ccctccctgc ccaaactccc 2701 cttccctggg gagccctcag gctccccaga actggctggg cccctgggga cagagccacc 2761 ccatgagctc ggggtccacc agtgtgtggg ggagattctg ggtttgccca gtcctggatt 2821 gtttccagga gaaagccggg ggaggggccc tcaggccatt ccccaacggg gtggggaggg 2881 tgacccacag ctctgggcct ctttttgccc tttagggctg ttgctaggga gagggaagag 2941 ggagaccaaa tgtcggggtt ggggtgggag ggcgtcaggc agaggcaact gacttcattt 3001 gtgccacacg catgggcatt gcagccttgc gctgtcccag gcatgcagct gcctggggcc 3061 caagttgcag tgagcagggt ggggtctggg agggggtgag aggcaggaat gggggtcaga 3121 agaagtggga gcagcttctt gggctgagtg cagccaaagg ggagccagaa atgggcagtt 3181 ctcccaggga gtgagcagct actgtaactt ttttaaatta agacaaaaag ccttgaagaa 3241 aatgacttta tttttctaag tgtaacctca gtatttatgt aatttgtaca ggggccatgc 3301 cccacccccc tcctccccct ttggggtaga ccttgagggt gggccagcat aggggggagg 3361 gtcttttacc ctgtgtcaga gcctaccttc accacctata tccagaaggg gagctttttc 3421 agaaacaggg cagcagtggg gtgaaatttt cttaacccct aagactgcct tcagtaggaa 3481 caagctggct tctgtgatta ggtgaaggga tgggggaaga ttttatgcac agcctagtta 3541 tcaaggggat gatttgccga catgtttgag aaccccctaa cctctaaccc tcattgctgt 3601 cttgccccag tttggggtgc caagatggaa gtcacctttc tgggctttct cctggagact 3661 agctggggct tatgggtggc tttcaaggct ggggcatggc aaatcagggg ccagagagca 3721 ggggagcttg ggactcaggt ctgtaactgc ccagcccctt ttctctgctc ttgtttcact 3781 ccaccatcac tcactcactc cccactcccc cacccatggg gaggagacct ttgatgaatt 3841 cttcctctcc ttcccacaaa agacagaccc agtgagtgaa tcaggcaaag tgcttataat 3901 gtgtgttgtg tgagcgtggc cttgggagga catgcgtgtg tcagggatga gttgaggtga 3961 tatttttatg tgcagcgacc cttggtgttt cccttcctcg gtggctctgg ggtatgtgtg 4021 tgtgggtgtg tgcgcctgag tgagtgtgtg tgcttgaatg tgagtgtgta tgtcagtggt 4081 ttctacttcc cctgggatgc tgacccagga atagtggaca tggtcacagt cctatgtaca 4141 gagctttctt ttgtattaaa aaaaaatact ctttcaataa atgtatcatt tttgtgcaca 4201 g // LOCUS HUMSRI1A 1634 bp DNA PRI 29-DEC-1994 DEFINITION Human somatostatin receptor isoform 1 gene, complete cds. ACCESSION M81829 NID g307433 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1634) AUTHORS Yamada,Y., Post,S.R., Wang,K., Tager,H.S., Bell,G.I. and Seino,S. TITLE Cloning and functional characterization of a family of human and mouse somatostatin receptors expressed in brain, gastrointestinal tract, and kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 251-255 (1992) MEDLINE 92108031 COMMENT genomic sequence; gene lacks introns. FEATURES Location/Qualifiers source 1..1634 /organism="Homo sapiens" /db_xref="taxon:9606" gene 100..1275 /gene="SSTR2" CDS 100..1275 /gene="SSTR2" /codon_start=1 /db_xref="GDB:G00-134-186" /product="somatostatin receptor isoform 1" /db_xref="PID:g307434" /translation="MFPNGTASSPSSSPSPSPGSCGEGGGSRGPGAGAADGMEEPGRN ASQNGTLSEGQGSAILISFIYSVVCLVGLCGNSMVIYVILRYAKMKTATNIYILNLAI ADELLMLSVPFLVTSTLLRHWPFGALLCRLVLSVDAVNMFTSIYCLTVLSVDRYVAVV HPIKAARYRRPTVAKVVNLGVWVLSLLVILPIVVFSRTAANSDGTVACNMLMPEPAQR WLVGFVLYTFLMGFLLPVGAICLCYVLIIAKMRMVALKAGWQQRKRSERKITLMVMMV VMVFVICWMPFYVVQLVNVFAEQDDATVSQLSVILGYANSCANPILYGFLSDNFKRSF QRILCLSWMDNAAEEPVDYYATALKSRAYSVEDFQPENLESGGVFRNGTCTSRITTL" BASE COUNT 283 a 513 c 495 g 343 t ORIGIN 1 ctgcaggcaa gcggtcgggt ggggagggag ggcgcaggcg gcgggtgcgc gaggagaaag 61 ccccagccct ggcagcccca ctggcccccc tcagctggga tgttccccaa tggcaccgcc 121 tcctctcctt cctcctctcc tagccccagc ccgggcagct gcggcgaagg cggcggcagc 181 aggggccccg gggccggcgc tgcggacggc atggaggagc cagggcgaaa tgcgtcccag 241 aacgggacct tgagcgaggg ccagggcagc gccatcctga tctctttcat ctactccgtg 301 gtgtgcctgg tggggctgtg tgggaactct atggtcatct acgtgatcct gcgctatgcc 361 aagatgaaga cggccaccaa catctacatc ctaaatctgg ccattgctga tgagctgctc 421 atgctcagcg tgcccttcct agtcacctcc acgttgttgc gccactggcc cttcggtgcg 481 ctgctctgcc gcctcgtgct cagcgtggac gcggtcaaca tgttcaccag catctactgt 541 ctgactgtgc tcagcgtgga ccgctacgtg gccgtggtgc atcccatcaa ggcggcccgc 601 taccgccggc ccaccgtggc caaggtagta aacctgggcg tgtgggtgct atcgctgctc 661 gtcatcctgc ccatcgtggt cttctctcgc accgcggcca acagcgacgg cacggtggct 721 tgcaacatgc tcatgccaga gcccgctcaa cgctggctgg tgggcttcgt gttgtacaca 781 tttctcatgg gcttcctgct gcccgtgggg gctatctgcc tgtgctacgt gctcatcatt 841 gctaagatgc gcatggtggc cctcaaggcc ggctggcagc agcgcaagcg ctcggagcgc 901 aagatcacct taatggtgat gatggtggtg atggtgtttg tcatctgctg gatgcctttc 961 tacgtggtgc agctggttaa cgtgtttgct gagcaggacg acgccacggt gagtcagctg 1021 tcggtcatcc tcggctatgc caacagctgc gccaacccca tcctctatgg ctttctctca 1081 gacaacttca agcgctcttt ccaacgcatc ctatgcctca gctggatgga caacgccgcg 1141 gaggagccgg ttgactatta cgccaccgcg ctcaagagcc gtgcctacag tgtggaagac 1201 ttccaacctg agaacctgga gtccggcggc gtcttccgta atggcacctg cacgtcccgg 1261 atcacgacgc tctgagcccg ggccacgcag gggctctgag cccgggccac gcaggggccc 1321 tgagccaaaa gagggggaga atgagaaggg aaggccgggt gcgaaaggga cggtatccag 1381 ggcgccaggg tgctgtcggg ataacgtggg gctaggacac tgacagcctt tgatggagga 1441 acccaagaaa ggcgcgcgac aatggtagaa gtgagagctt tgcttataaa ctgggaaggc 1501 tttcaggcta cctttttctg ggtctcccac tttctgttcc ttcctccact gcgcttgctc 1561 ctctgaccct ccttctattt tccccaccct gcaacttcta tcctttcttc cgcaccgtcc 1621 cgccagtgca gatc // LOCUS HUMSRI2A 1351 bp DNA PRI 29-DEC-1994 DEFINITION Human somatostatin receptor isoform 2 (SSTR2) gene, complete cds. ACCESSION M81830 NID g307435 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1351) AUTHORS Yamada,Y., Post,S.R., Wang,K., Tager,H.S., Bell,G.I. and Seino,S. TITLE Cloning and functional characterization of a family of human and mouse somatostatin receptors expressed in brain, gastrointestinal tract, and kidney JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (1), 251-255 (1992) MEDLINE 92108031 COMMENT genomic sequence; gene lacks introns. FEATURES Location/Qualifiers source 1..1351 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q24" gene 83..1192 /gene="SSTR2" CDS 83..1192 /gene="SSTR2" /codon_start=1 /db_xref="GDB:G00-134-186" /product="somatostatin receptor isoform 2" /db_xref="PID:g307436" /translation="MDMADEPLNGSHTWLSIPFDLNGSVVSTNTSNQTEPYYDLTSNA VLTFIYFVVCIIGLCGNTLVIYVILRYAKMKTITNIYILNLAIADELFMLGLPFLAMQ VALVHWPFGKAICRVVMTVDGINQFTSIFCLTVMSIDRYLAVVHPIKSAKWRRPRTAK MITMAVWGVSLLVILPIMIYAGLRSNQWGRSSCTINWPGESGAWYTGFIIYTFILGFL VPLTIICLCYLFIIIKVKSSGIRVGSSKRKKSEKKVTRMVSIVVAVFIFCWLPFYIFN VSSVSMAISPTPALKGMFDFVVVLTYANSCANPILYAFLSDNFKKSFQNVLCLVKVSG TDDGERSDSKQDKSRLNETTETQRTLLNGDLQTSI" BASE COUNT 307 a 375 c 333 g 336 t ORIGIN 1 ggatccttgg cctccagggt ccattaaggt gagaataaga tctctgggct ggctggaact 61 agcctaagac tgaaaagcag ccatggacat ggcggatgag ccactcaatg gaagccacac 121 atggctatcc attccatttg acctcaatgg ctctgtggtg tcaaccaaca cctcaaacca 181 gacagagccg tactatgacc tgacaagcaa tgcagtcctc acattcatct attttgtggt 241 ctgcatcatt gggttgtgtg gcaacacact tgtcatttat gtcatcctcc gctatgccaa 301 gatgaagacc atcaccaaca tttacatcct caacctggcc atcgcagatg agctcttcat 361 gctgggtctg cctttcttgg ctatgcaggt ggctctggtc cactggccct ttggcaaggc 421 catttgccgg gtggtcatga ctgtggatgg catcaatcag ttcaccagca tcttctgcct 481 gacagtcatg agcatcgacc gatacctggc tgtggtccac cccatcaagt cggccaagtg 541 gaggagaccc cggacggcca agatgatcac catggctgtg tggggagtct ctctgctggt 601 catcttgccc atcatgatat atgctgggct ccggagcaac cagtggggga gaagcagctg 661 caccatcaac tggccaggtg aatctggggc ttggtacaca gggttcatca tctacacttt 721 cattctgggg ttcctggtac ccctcaccat catctgtctt tgctacctgt tcattatcat 781 caaggtgaag tcctctggaa tccgagtggg ctcctctaag aggaagaagt ctgagaagaa 841 ggtcacccga atggtgtcca tcgtggtggc tgtcttcatc ttctgctggc ttcccttcta 901 catattcaac gtttcttccg tctccatggc catcagcccc accccagccc ttaaaggcat 961 gtttgacttt gtggtggtcc tcacctatgc taacagctgt gccaacccta tcctatatgc 1021 cttcttgtct gacaacttca agaagagctt ccagaatgtc ctctgcttgg tcaaggtgag 1081 cggcacagat gatggggagc ggagtgacag taagcaggac aaatcccggc tgaatgagac 1141 cacggagacc cagaggaccc tcctcaatgg agacctccaa accagtatct gaactgcttg 1201 gggggtggga aagaaccaag ccatgctctg tctactggca atgggctccc tacccacact 1261 ggcttcctgc ctcccacccc tcacacctgg cttctagaat agaggattgc tcagcatgag 1321 tccaattaga gaacggtgtt tgagtcagct t // LOCUS HUMSRICBP 793 bp mRNA PRI 14-MAR-1996 DEFINITION Human sorcin (SRI) mRNA, complete cds. ACCESSION L12387 NID g459835 KEYWORDS calcium-binding protein; sorcin. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 793) AUTHORS Wang,S.L., Tam,M.F., Ho,Y.S., Pai,S.H. and Kao,M.C. TITLE Isolation and molecular cloning of human sorcin a calcium-binding protein in vincristine-resistant HOB1 lymphoma cells JOURNAL Biochim. Biophys. Acta 1260 (3), 285-293 (1995) MEDLINE 95178548 FEATURES Location/Qualifiers source 1..793 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HOB1/VCR1.0" /cell_type="vincristine-resistant lymphoma cell" gene 5..601 /gene="SRI" CDS 5..601 /gene="SRI" /codon_start=1 /product="sorcin" /db_xref="PID:g459836" /db_xref="GDB:G00-118-880" /translation="MAYPGHPGAGGGYYPGGYGGAPGGPAFPGQTQDPLYGYFAAVAG QDGQIDADELQRCLTQSGIAGGYKPFNLETCRLMVSMLDRDMSGTMGFNEFKELWAVL NGWRQHFISFDTDRSGTVDPQELQKALTTMGFRLSPQAVNSIAKRYSTNGKITFDDYI ACCVKLRALTDSFRRRDTAQQGVVNFPYDDFIQCVMSV" BASE COUNT 211 a 161 c 197 g 224 t ORIGIN 1 tagcatggcg tacccggggc atcctggcgc cggcggcggg tactacccag gcgggtatgg 61 aggggctccc ggagggcctg cgtttcccgg acaaactcag gatccgctgt atggttactt 121 tgctgctgta gctggacagg atgggcagat agatgctgat gaattgcaga gatgtctgac 181 acagtctggc attgctggag gatacaaacc ttttaacctg gagacttgcc ggcttatggt 241 ttcaatgctg gatagagata tgtctggcac aatgggtttc aatgaattta aagaactctg 301 ggctgtactg aatggctgga gacaacactt tatcagtttt gatactgaca ggagtggaac 361 agtagaccca caagaattgc agaaggccct gacaacaatg ggatttaggt tgagtcccca 421 ggctgtgaat tcaattgcaa aacgatacag caccaatgga aagatcacct tcgacgacta 481 catcgcctgc tgcgtcaaac tgagggctct tacagacagc tttcgaagac gggatactgc 541 tcagcaaggt gttgtgaatt tcccatatga tgatttcatt caatgtgtca tgagtgttta 601 aatcaagagg aagctgcatg aatgtaatca acattccaac tggagctctc ctttgcttgt 661 cctctttgcc ttcggtaata tgtataaact tacatcacga ctttctctta acagctgttg 721 taaagtttat tactttatgt acaactgaag ttttgtttta gttttgataa taaattcttt 781 ggaactttaa aaa // LOCUS HUMSRP20 528 bp mRNA PRI 26-FEB-1993 DEFINITION Homo sapiens SR protein family, pre-mRNA splicing factor (SRp20) mRNA, complete cds. ACCESSION L10838 NID g338483 KEYWORDS RNA-binding protein; pre-mRNA splicing factor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 528) AUTHORS Zahler,A.M., Lane,W.S., Stolk,J.A. and Roth,M.B. TITLE SR proteins: a conserved family of pre-mRNA splicing factors JOURNAL Genes Dev. 6, 837-847 (1992) MEDLINE 92249775 FEATURES Location/Qualifiers source 1..528 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" CDS 1..495 /standard_name="SRp20" /note="SR protein family member; SR domain: (bp. 313..495); RNA binding domains: RNP-1 (bp. 34..52) and RNP-2 (bp. 136..162)" /codon_start=1 /product="pre-mRNA splicing factor" /db_xref="PID:g338484" /translation="MHRDSCPLDCKVYVGNLGNNGNKTELERAFGYYGPLRSVWVARN PPGFAFVEFEDPRDAADAVRELDGRTLCGCRVRVELSNGEKRSRNRGPPPSWGRRPRD DYRRRSPPPRRRSPRRRSFSRSRSRSLSRDRRRERSLSRERNHKPSRSFSRSRSRSRS NERK" BASE COUNT 141 a 114 c 149 g 124 t ORIGIN 1 atgcatcgtg attcctgtcc attggactgt aaggtttatg taggcaatct tggaaacaat 61 ggcaacaaga cggaattgga acgggctttt ggctactatg gaccactccg aagtgtgtgg 121 gttgctagaa acccacccgg ctttgctttt gttgaatttg aagatccccg agatgcagct 181 gatgcagtcc gagagctaga tggaagaaca ctatgtggct gccgtgtaag agtggaactg 241 tcgaatggtg aaaaaagaag tagaaatcgt ggcccacctc cctcttgggg tcgtcgccct 301 cgagatgatt atcgtaggag gagtcctcca cctcgtcgca gatctccaag aaggagaagc 361 ttctctcgca gccggagcag gtccctttct agagatagga gaagagagag atcgctgtct 421 cgggagagaa atcacaagcc gtcccgatcc ttctctaggt ctcgtagtcg atctaggtca 481 aatgaaagga aatagaagac cagtttgcaa aagtggtgta gaggatcc // LOCUS HUMSRYA 845 bp mRNA PRI 13-JAN-1995 DEFINITION Homo sapiens sex-determining region Y (SRY) mRNA, complete cds. ACCESSION L10101 NID g292511 KEYWORDS sex-determining protein; sex-determining region Y. SOURCE Homo sapiens male cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 845) AUTHORS Su,H. and Lau,Y.F. TITLE Identification of the transcriptional unit, structural organization, and promoter sequence of the human sex-determining region Y (SRY) gene, using a reverse genetic approach JOURNAL Am. J. Hum. Genet. 52 (1), 24-38 (1993) MEDLINE 93167288 FEATURES Location/Qualifiers source 1..845 /organism="Homo sapiens" /note="mouse LTK cells transfected with human cosmid hcosSRY" /db_xref="taxon:9606" /germline /sex="male" /map="Yp11.3" gene 97..711 /gene="SRY" CDS 97..711 /gene="SRY" /codon_start=1 /db_xref="GDB:G00-125-556" /product="sex-determining region Y" /db_xref="PID:g292512" /translation="MQSYASAMLSVFNSDDYSPAVQENIPALRRSSSFLCTESCNSKY QCETGENSKGNVQDRVKRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLT EAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAKMLPKNCSLLPADPASVLCSEVQLD NRLYRDDCTKATHSRMEHQLGHLPPINAASSPQQRDRYSHWTKL" BASE COUNT 248 a 202 c 190 g 205 t ORIGIN 1 gtaacaaaga atctggtaga agtgagtttt ggatagtaaa ataagtttcg aactctggca 61 cctttcaatt ttgtcgcact ctccttgttt ttgacaatgc aatcatatgc ttctgctatg 121 ttaagcgtat tcaacagcga tgattacagt ccagctgtgc aagagaatat tcccgctctc 181 cggagaagct cttccttcct ttgcactgaa agctgtaact ctaagtatca gtgtgaaacg 241 ggagaaaaca gtaaaggcaa cgtccaggat agagtgaagc gacccatgaa cgcattcatc 301 gtgtggtctc gcgatcagag gcgcaagatg gctctagaga atcccagaat gcgaaactca 361 gagatcagca agcagctggg ataccagtgg aaaatgctta ctgaagccga aaaatggcca 421 ttcttccagg aggcacagaa attacaggcc atgcacagag agaaataccc gaattataag 481 tatcgacctc gtcggaaggc gaagatgctg ccgaagaatt gcagtttgct tcccgcagat 541 cccgcttcgg tactctgcag cgaagtgcaa ctggacaaca ggttgtacag ggatgactgt 601 acgaaagcca cacactcaag aatggagcac cagctaggcc acttaccgcc catcaacgca 661 gccagctcac cgcagcaacg ggaccgctac agccactgga caaagctgta ggacaatcgg 721 gtaacattgg ctacaaagac ctacctagat gctccttttt acgataactt acagccctca 781 ctttcttatg tttagtttca atattgtttt cttttctctg gctaataaag gccttattca 841 tttca // LOCUS HUMSST28A 1285 bp DNA PRI 16-AUG-1994 DEFINITION Human somatostatin receptor (SST) gene, complete cds. ACCESSION L14865 NID g431094 KEYWORDS somatostatin receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1285) AUTHORS Panetta,R., Greenwood,M.T., Warszynska,A., Demchyshyn,L.L., Day,R., Niznik,H.B., Srikant,C.B. and Patel,Y.C. TITLE Molecular cloning, functional characterization, and chromosomal localization of a human somatostatin receptor (somatostatin receptor type 5) with preferential affinity for somatostatin-28 JOURNAL Mol. Pharmacol. 45 (3), 417-427 (1994) MEDLINE 94195267 FEATURES Location/Qualifiers source 1..1285 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /map="3q28" gene 130..1221 /gene="SST" CDS 130..1221 /gene="SST" /codon_start=1 /product="somatostatin receptor" /db_xref="PID:g431095" /translation="MEPLFPASTPSWNASSPGAASGGGDNRTLVGPAPSAGARAVLVP VLYLLVCAAGLGGNTLVIYVVLRFAKMKTVTNIYILNLAVADVLYMLGLPFLATQNAA SFWPFGPVLCRLVMTLDGVNQFTSVFCLTVMSVDRYLAVVHPLSSARWRRPRVAKLAS AAAWVLSLCMSLPLLVFADVQEGGTCNASWPEPVGLWGAVFIIYTAVLGFFAPLLVIC LCYLLIVVKVRAAGVRVGCVRRRSERKVTRMVLVVVLVFAGCWLPFFTVNIVNLAVAL PQEPASAGLYFFVVILSYANSCANPVLYGFLSDNFRQSFQKVLCLRKGSGAKDADATE PRPDRIRQQQEATRPRTAAANGLMQTSKL" BASE COUNT 167 a 436 c 432 g 250 t ORIGIN 1 ttaccggtga tcggctctgg caccgccctg ggccagagaa ggaatgcctg cagtgtctgg 61 ttcaggactc accaccctgg cgtcctccct tcttctcttg cagagcctga cgcaccccag 121 gctgccgcca tggagcccct gttcccagcc tccacgccca gctggaacgc ctcctccccg 181 ggggctgcct ctggaggcgg tgacaacagg acgctggtgg ggccggcgcc ctcggcaggg 241 gcccgggcgg tgctggtgcc cgtgctgtac ctgctggtgt gtgcggccgg gctgggcggg 301 aacacgctgg tcatctacgt ggtgctgcgg ttcgccaaga tgaagaccgt caccaacatc 361 tacattctca acctggcagt ggccgacgtc ctgtacatgc tggggctgcc tttcctggcc 421 acgcagaacg ccgcgtcctt ctggcccttc ggccccgtcc tgtgccgcct ggtcatgacg 481 ctggacggcg tcaaccagtt caccagtgtc ttctgcctga cagtcatgag cgtggaccgc 541 tacctggcag tggtgcaccc gctgagctcg gcccgctggc gccgcccgcg tgtggccaag 601 ctggcgagcg ccgccgcctg ggtcctgtct ctgtgcatgt cgctgccgct cttggtgttc 661 gcggacgtgc aggagggcgg tacctgcaac gccagctggc cggagcccgt ggggctgtgg 721 ggcgccgtct tcatcatcta cacggccgtg ctgggcttct tcgcgccgct gctggtcatc 781 tgcctgtgct acctgctcat cgtggtgaag gtgagggcgg cgggcgtgcg cgtgggctgc 841 gtgcggcggc gctcggagcg gaaggtgacg cgcatggtgt tggtggtggt gctggtgttt 901 gcgggatgtt ggctgccctt cttcaccgtc aacatcgtca acctggcggt tgcgctgccc 961 caggagcccg cctccgccgg cctctacttc ttcgtggtca tcctctccta cgccaacagc 1021 tgtgccaacc ccgtcctcta cggcttcctc tcggacaact tccgccagag cttccagaag 1081 gttctgtgcc tccgcaaggg ctctggtgcc aaggacgctg acgccacgga gccgcgtcca 1141 gacaggatcc ggcagcagca ggaggccacg cgcccgcgca ccgccgcagc caacgggctt 1201 atgcagacca gcaagctgtg agagtgcagg cggggggtgg gcggccccgt gtcaccccca 1261 ggagtcggag gttgcactgc ggtga // LOCUS HUMSSTR3X 1413 bp DNA PRI 13-JAN-1995 DEFINITION Human somatostatin receptor subtype 3 (SSTR3) gene, complete cds. ACCESSION M96738 NID g338498 KEYWORDS somatostatin receptor. SOURCE Homo sapiens (tissue library: Stratagene #946203) male placenta DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1413) AUTHORS Yamada,Y., Reisine,T., Law,S.F., Ihara,Y., Kubota,A., Kagimoto,S., Seino,M., Seino,Y., Bell,G.I. and Seino,S. TITLE Somatostatin receptors, an expanding gene family: cloning and functional characterization of human SSTR3, a protein coupled to adenylyl cyclase JOURNAL Mol. Endocrinol. 6 (12), 2136-2142 (1992) MEDLINE 93149123 FEATURES Location/Qualifiers source 1..1413 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="placenta" /tissue_lib="Stratagene #946203" gene 98..1354 /gene="SSTR3" CDS 98..1354 /gene="SSTR3" /codon_start=1 /product="somatostatin receptor subtype 3" /db_xref="PID:g338499" /translation="MDMLHPSSVSTTSEPENASSAWPPDATLGNVSAGPSPAGLAVSG VLIPLVYLVVCVVGLLGNSLVIYVVLRHTASPSVTNVYILNLALADELFMLGLPFLAA QNALSYWPFGSLMCRLVMAVDGINQFTSIFCLTVMSVDRYLAVVHPTRSARWRTAPVA RTVSAAVWVASAVVVLPVVVFSGVPRGMSTCHMQWPEPAAAWRAGFIIYTAALGFFGP LLVICLCYLLIVVKVRSAGRRVWAPSCQRRRRSERRVTRMVVAVVALFVLCWMPFYVL NIVNVVCPLPEEPAFFGLYFLVVALPYANSCANPILYGFLSYRFKQGFRRVLLRPSRR VRSQEPTVGPPEKTEEEDEEEEDGEESREGGKGKEMNGRVSQITQPGTSGQERPPSRV ASKEQQLLPQEASTGEKSSTMRISYL" BASE COUNT 218 a 464 c 467 g 264 t ORIGIN 1 atgggagggg gcagcacaga gaaagccatt ctctgctgtg accgagctgt ttttccttcc 61 cccaggcaaa tgactgctga ccaccctccc ctcagccatg gacatgcttc atccatcatc 121 ggtgtccacg acctcagaac ctgagaatgc ctcctcggcc tggcccccag atgccaccct 181 gggcaacgtg tcggcgggcc caagcccggc agggctggcc gtcagtggcg ttctgatccc 241 cctggtctac ctggtggtgt gcgtggtggg cctgctgggt aactcgctgg tcatctatgt 301 ggtcctgcgg cacacggcca gcccttcagt caccaacgtc tacatcctca acctggcgct 361 ggccgacgag ctcttcatgc tggggctgcc cttcctggcc gcccagaacg ccctgtccta 421 ctggcccttc ggctccctca tgtgccgcct ggtcatggcg gtggatggca tcaaccagtt 481 caccagcata ttctgcctga ctgtcatgag cgtggaccgc tacctggccg tggtacatcc 541 cacccgctcg gcccgctggc gcacagctcc ggtggcccgc acggtcagcg cggctgtgtg 601 ggtggcctca gccgtggtgg tgctgcccgt ggtggtcttc tcgggagtgc cccgcggcat 661 gagcacctgc cacatgcagt ggcccgagcc ggcggcggcc tggcgagccg gcttcatcat 721 ctacacggcc gcactgggct tcttcgggcc gctgctggtc atctgcctct gctacctgct 781 catcgtggtg aaggtgcgct cagctgggcg ccgggtgtgg gcaccctcgt gccagcggcg 841 ccggcgctcc gaacgcaggg tcacgcgcat ggtggtggcc gtggtggcgc tcttcgtgct 901 ctgctggatg cccttctacg tgctcaacat cgtcaacgtg gtgtgcccac tgcccgagga 961 gcctgccttc tttgggctct acttcctggt ggtggcgctg ccctatgcca acagctgtgc 1021 caaccccatc ctttatggct tcctctccta ccgcttcaag cagggcttcc gcagggtcct 1081 gctgcggccc tcccgccgtg tgcgcagcca ggagcccact gtggggcccc cggagaagac 1141 tgaggaggag gatgaggagg aggaggatgg ggaggagagc agggaggggg gcaaggggaa 1201 ggagatgaac ggccgggtca gccagatcac gcagcctggc accagcgggc aggagcggcc 1261 gcccagcaga gtggccagca aggagcagca gctcctaccc caagaggctt ccactgggga 1321 gaagtccagc acgatgcgca tcagctacct gtaggggcct ggggaaagcc aggatggccc 1381 gaggaagagg cagaagccgt gggtgtgcct agg // LOCUS HUMST2M 1357 bp mRNA PRI 20-MAY-1996 DEFINITION Homo sapiens mRNA for ST2 protein. ACCESSION D12763 NID g220076 KEYWORDS immunoglobulin superfamily. SOURCE Homo sapiens helper T cell cell_line:5C10 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1357) AUTHORS Tominaga,S., Yokota,T., Yanagisawa,K., Tsukamoto,T., Takagi,T. and Tetsuka,T. TITLE Nucleotide sequence of a complementary DNA for human ST2 JOURNAL Biochim. Biophys. Acta 1171 (2), 215-218 (1992) MEDLINE 93129624 FEATURES Location/Qualifiers source 1..1357 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="5C10" /cell_type="helper T cell" sig_peptide 47..97 /product="ST2 protein" CDS 47..1033 /codon_start=1 /product="ST2 protein" /db_xref="PID:d1002728" /db_xref="PID:g220077" /translation="MGFWILAILTILMYSTAAKFSKQSWGLENEALIVRCPRQGKPSY TVDWYYSQTNKSIPTQERNRVFASGQLLKFLPAEVADSGIYTCIVRSPTFNRTGYANV TIYKKQSDCNVPDYLMYSTVSGSEKNSKIYCPTIDLYNWTAPLEWFKNCQALQGSRYR AHKSFLVIDNVMTEDAGDYTCKFIHNENGANYSVTATRSFTVKDEQGFSLFPVIGAPA QNEIKEVEIGKNANLTCSACFGKGTQFLAAVLWQLNGTKITDFGEPRIQQEEGQNQSF SNGLACLDMVLRIADVKEEDLLLQYDCLALNLHGLRRHTVRLSRKNPSKECF" mat_peptide 98..1030 /product="ST2 protein" BASE COUNT 401 a 271 c 294 g 391 t ORIGIN 1 atctcaacaa cgagttacca atacttgctc ttgattgata aacagaatgg ggttttggat 61 cttagcaatt ctcacaattc tcatgtattc cacagcagca aagtttagta aacaatcatg 121 gggcctggaa aatgaggctt taattgtaag atgtcctaga caaggaaaac ctagttacac 181 cgtggattgg tattactcac aaacaaacaa aagtattccc actcaggaaa gaaatcgtgt 241 gtttgcctca ggccaacttc tgaagtttct accagctgaa gttgctgatt ctggtattta 301 tacctgtatt gtcagaagtc ccacattcaa taggactgga tatgcgaatg tcaccatata 361 taaaaaacaa tcagattgca atgttccaga ttatttgatg tattcaacag tatctggatc 421 agaaaaaaat tccaaaattt attgtcctac cattgacctc tacaactgga cagcacctct 481 tgagtggttt aagaattgtc aggctcttca aggatcaagg tacagggcgc acaagtcatt 541 tttggtcatt gataatgtga tgactgagga cgcaggtgat tacacctgta aatttataca 601 caatgaaaat ggagccaatt atagtgtgac ggcgaccagg tccttcacgg tcaaggatga 661 gcaaggcttt tctctgtttc cagtaatcgg agcccctgca caaaatgaaa taaaggaagt 721 ggaaattgga aaaaacgcaa acctaacttg ctctgcttgt tttggaaaag gcactcagtt 781 cttggctgcc gtcctgtggc agcttaatgg aacaaaaatt acagactttg gtgaaccaag 841 aattcaacaa gaggaagggc aaaatcaaag tttcagcaat gggctggctt gtctagacat 901 ggttttaaga atagctgacg tgaaggaaga ggatttattg ctgcagtacg actgtctggc 961 cctgaatttg catggcttga gaaggcacac cgtaagacta agtaggaaaa atccaagtaa 1021 ggagtgtttc tgagactttg atcacctgaa ctttctctag caagtgtaag cagaatggag 1081 tgtggttcca agagatccat caagacaatg ggaatggcct gtgccataaa atgtgcttct 1141 cttcttcggg atgttgtttg ctgtctgatc tttgtagact gttcctgttt gctgggagct 1201 tctctgctgc ttaaattgtt cgtcctcccc cactccctcc tatcgttggt ttgtctagaa 1261 cactcagctg cttctttggt catccttgtt ttctaacttt atgaactccc tctgtgtcac 1321 tgtatgtgaa aggaaatgca ccaacaaccg aaaactg // LOCUS HUMSTAR 3745 bp mRNA PRI 17-OCT-1991 DEFINITION Human heat-stable enterotoxin receptor mRNA, complete cds. ACCESSION M73489 NID g338501 KEYWORDS guanylyl cyclase; heat-stable enterotoxin receptor; transmembrane protein. SOURCE Homo sapiens (library: lambda gt10) adult terminal ileum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3745) AUTHORS de Sauvage,F.J., Camerato,T.R. and Goeddel,D.V. TITLE Primary structure and functional expression of the human receptor for Escherichia coli heat-stable enterotoxin JOURNAL J. Biol. Chem. 266, 17912-17918 (1991) MEDLINE 92011512 FEATURES Location/Qualifiers source 1..3745 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="terminal ileum" /tissue_lib="lambda gt10" sig_peptide 49..117 CDS 49..3270 /codon_start=1 /product="heat-stable enterotoxin receptor" /db_xref="PID:g338502" /translation="MKTLLLDLALWSLLFQPGWLSFSSQVSQNCHNGSYEISVLMMGN SAFAEPLKNLEDAVNEGLEIVRGRLQNAGLNVTVNATFMYSDGLIHNSGDCRSSTCEG LDLLRKISNAQRMGCVLIGPSCTYSTFQMYLDTELSYPMISAGSFGLSCDYKETLTRL MSPARKLMYFLVNFWKTNDLPFKTYSWSTSYVYKNGTETEDCFWYLNALEASVSYFSH ELGFKVVLRQDKEFQDILMDHNRKSNVIIMCGGPEFLYKLKGDRAVAEDIVIILVDLF NDQYLEDNVTAPDYMKNVLVLTLSPGNSLLNSSFSRNLSPTKRDFALAYLNGILLFGH MLKIFLENGENITTPKFAHAFRNLTFEGYDGPVTLDDWGDVDSTMVLLYTSVDTKKYK VLLTYDTHVNKTYPVDMSPTFTWKNSKLPNDITGRGPQILMIAVFTLTGAVVLLLLVA LLMLRKYRKDYELRQKKWSHIPPENIFPLETNETNHVSLKIDDDKRRDTIQRLRQCKY DKKRVILKDLKHNDGNFTEKQKIELNKLLQIDYYNLTKFYGTVKLDTMIFGVIEYCER GSLREVLNDTISYPDGTFMDWEFKISVLYDIAKGMSYLHSSKTEVHGRLKSTNCVVDS RMVVKITDFGCNSILPPKKDLWTAPEHLRQANISQKGDVYSYGIIAQEIILRKETFYT LSCRDRNEKIFRVENSNGMKPFRPDLFLETAEEKELEVYLLVKNCWEEDPEKRPDFKK IETTLAKIFGLFHDQKNESYMDTLIRRLQLYSRNLEHLVEERTQLYKAERDRADRLNF MLLPRLVVKSLKEKGFVEPELYEEVTIYFSDIVGFTTICKYSTPMEVVDMLNDIYKSF DHIVDHHDVYKVETIGDAYMVASGLPKRNGNRHAIDIAKMALEILSFMGTFELEHLPG LPIWIRIGVHSGPCAAGVVGIKMPRYCLFGDTVNTASRMESTGLPLRIHVSGSTIAIL KRTECQFLYEVRGETYLKGRGNETTYWLTGMKDQKFNLPTPPTVENQQRLQAEFSDMI ANSLQKRQAAGIRSQKPRRVASYKKGTLEYLQLNTTDKESTYF" mat_peptide 118..3267 /product="heat-stable enterotoxin receptor" BASE COUNT 1110 a 782 c 842 g 1011 t ORIGIN 1 cgcaaagcaa gtgggcacaa ggagtatggt tctaacgtga ttggggtcat gaagacgttg 61 ctgttggact tggctttgtg gtcactgctc ttccagcccg ggtggctgtc ctttagttcc 121 caggtgagtc agaactgcca caatggcagc tatgaaatca gcgtcctgat gatgggcaac 181 tcagcctttg cagagcccct gaaaaacttg gaagatgcgg tgaatgaggg gctggaaata 241 gtgagaggac gtctgcaaaa tgctggccta aatgtgactg tgaacgctac tttcatgtat 301 tcggatggtc tgattcataa ctcaggcgac tgccggagta gcacctgtga aggcctcgac 361 ctactcagga aaatttcaaa tgcacaacgg atgggctgtg tcctcatagg gccctcatgt 421 acatactcca ccttccagat gtaccttgac acagaattga gctaccccat gatctcagct 481 ggaagttttg gattgtcatg tgactataaa gaaaccttaa ccaggctgat gtctccagct 541 agaaagttga tgtacttctt ggttaacttt tggaaaacca acgatctgcc cttcaaaact 601 tattcctgga gcacttcgta tgtttacaag aatggtacag aaactgagga ctgtttctgg 661 taccttaatg ctctggaggc tagcgtttcc tatttctccc acgaactcgg ctttaaggtg 721 gtgttaagac aagataagga gtttcaggat atcttaatgg accacaacag gaaaagcaat 781 gtgattatta tgtgtggtgg tccagagttc ctctacaagc tgaagggtga ccgagcagtg 841 gctgaagaca ttgtcattat tctagtggat cttttcaatg accagtactt ggaggacaat 901 gtcacagccc ctgactatat gaaaaatgtc cttgttctga cgctgtctcc tgggaattcc 961 cttctaaata gctctttctc caggaatcta tcaccaacaa aacgagactt tgctcttgcc 1021 tatttgaatg gaatcctgct ctttggacat atgctgaaga tatttcttga aaatggagaa 1081 aatattacca cccccaaatt tgctcatgct ttcaggaatc tcacttttga agggtatgac 1141 ggtccagtga ccttggatga ctggggggat gttgacagta ccatggtgct tctgtatacc 1201 tctgtggaca ccaagaaata caaggttctt ttgacctatg atacccacgt aaataagacc 1261 tatcctgtgg atatgagccc cacattcact tggaagaact ctaaacttcc taatgatatt 1321 acaggccggg gccctcagat cctgatgatt gcagtcttca ccctcactgg agctgtggtg 1381 ctgctcctgc tcgtcgctct cctgatgctc agaaaatata gaaaagatta tgaacttcgt 1441 cagaaaaaat ggtcccacat tcctcctgaa aatatctttc ctctggagac caatgagacc 1501 aatcatgtta gcctcaagat cgatgatgac aaaagacgag atacaatcca gagactacga 1561 cagtgcaaat acgacaaaaa gcgagtgatt ctcaaagatc tcaagcacaa tgatggtaat 1621 ttcactgaaa aacagaagat agaattgaac aagttgcttc agattgacta ttacaacctg 1681 accaagttct acggcacagt gaaacttgat accatgatct tcggggtgat agaatactgt 1741 gagagaggat ccctccggga agttttaaat gacacaattt cctaccctga tggcacattc 1801 atggattggg agtttaagat ctctgtcttg tatgacattg ctaagggaat gtcatatctg 1861 cactccagta agacagaagt ccatggtcgt ctgaaatcta ccaactgcgt agtggacagt 1921 agaatggtgg tgaagatcac tgattttggc tgcaattcca ttttacctcc aaaaaaggac 1981 ctgtggacag ctccagagca cctccgccaa gccaacatct ctcagaaagg agatgtgtac 2041 agctatggga tcatcgcaca ggagatcatt ctgcggaaag aaaccttcta cactttgagc 2101 tgtcgggacc ggaatgagaa gattttcaga gtggaaaatt ccaatggaat gaaacccttc 2161 cgcccagatt tattcttgga aacagcagag gaaaaagagc tagaagtgta cctacttgta 2221 aaaaactgtt gggaggaaga tccagaaaag agaccagatt tcaaaaaaat tgagactaca 2281 cttgccaaga tatttggact ttttcatgac caaaaaaatg aaagctatat ggataccttg 2341 atccgacgtc tacagctata ttctcgaaac ctggaacatc tggtagagga aaggacacag 2401 ctgtacaagg cagagaggga cagggctgac agacttaact ttatgttgct tccaaggcta 2461 gtggtaaagt ctctgaagga gaaaggcttt gtggagccgg aactatatga ggaagttaca 2521 atctacttca gtgacattgt aggtttcact actatctgca aatacagcac ccccatggaa 2581 gtggtggaca tgcttaatga catctataag agttttgacc acattgttga tcatcatgat 2641 gtctacaagg tggaaaccat cggtgatgcg tacatggtgg ctagtggttt gcctaagaga 2701 aatggcaatc ggcatgcaat agacattgcc aagatggcct tggaaatcct cagcttcatg 2761 gggacctttg agctggagca tcttcctggc ctcccaatat ggattcgcat tggagttcac 2821 tctggtccct gtgctgctgg agttgtggga atcaagatgc ctcgttattg tctatttgga 2881 gatacggtca acacagcctc taggatggaa tccactggcc tccctttgag aattcacgtg 2941 agtggctcca ccatagccat cctgaagaga actgagtgcc agttccttta tgaagtgaga 3001 ggagaaacat acttaaaggg aagaggaaat gagactacct actggctgac tgggatgaag 3061 gaccagaaat tcaacctgcc aacccctcct actgtggaga atcaacagcg tttgcaagca 3121 gaattttcag acatgattgc caactcttta cagaaaagac aggcagcagg gataagaagc 3181 caaaaaccca gacgggtagc cagctataaa aaaggcactc tggaatactt gcagctgaat 3241 accacagaca aggagagcac ctatttttaa acctaaatga ggtataagga ctcacacaaa 3301 ttaaaataca gctgcactga ggcagcgacc tcaagtgtcc tgaaagctta cattttcctg 3361 agacctcaat gaagcagaaa tgtacttagg cttggctgcc ctgtctggaa catggacttt 3421 cttgcatgaa tcagatgtgt gttctcagtg aaataactac cttccactct ggaaccttat 3481 tccagcagtt gttccaggga gcttctacct ggaaaagaaa agaaatgaat agactatcta 3541 gaacttgaga agattttatt cttatttcat ttattttttg tttgtttatt tttatcgttt 3601 ttgtttactg gctttccttc tgtattcata agatttttta aattgtcata attatatttt 3661 aaatacccat cttcattaaa gtatatttaa ctcataattt ttgcagaaaa tatgctatat 3721 attaggcaag aataaaagct aaagg // LOCUS HUMSTAT4R 2588 bp mRNA PRI 01-AUG-1996 DEFINITION Homo sapiens STAT4 mRNA, complete cds. ACCESSION L78440 NID g1479978 KEYWORDS STAT4 gene; signal transducer and activator of transcription 4. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2588) AUTHORS Xu,X., Sun,Y.-L. and Hoey,T. TITLE The STAT amino-terminal domain mediates cooperative DNA binding and confers selective sequence recognition JOURNAL Unpublished (1996) FEATURES Location/Qualifiers source 1..2588 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /cell_type="T cell" 5'UTR 1..81 /gene="STAT4" /note="putative" gene 1..2588 /gene="STAT4" mRNA 1..2588 /gene="STAT4" /note="GDB:682054" CDS 82..2328 /gene="STAT4" /note="putative" /codon_start=1 /db_xref="GDB:GDB:682054" /product="signal transducer and activator of transcription 4" /db_xref="PID:g1479979" /translation="MSQWNQVQQLEIKFLEQVDQFYDDNFPMEIRHLLAQWIENQDWE AASNNETMATILLQNLLIQLDEQLGRVSKEKNLLLIHNLKRIRKVLQGKFHGNPMHVA VVISNCLREERRILAAANMPVQGPLEKSLQSSSVSERQRNVEHKVAAIKNSVQMTEQD TKYLEDLQDEFDYRYKTIQTMDQSDKNSAMVNQEVLTLQEMLNSLDFKRKEALSKMTQ IIHETDLLMNTMLIEELQDWKRRQQIACIGGPLHNGLDQLQNCFTLLAESLFQLRRQL EKLEEQSTKMTYEGDPIPMQRTHMLERVTFLIYNLFKNSFVVERQPCMPTHPQRPLVL KTLIQFTVKLRLLIKLPELNYQVKVKASIDKNVSTLSNRRFVLCGTNVKAMSIEESSN GSLSVEFRHLQPKEMKSSAGGKGNEGCHMVTEELHSITFETQICLYGLTIDLETSSLP VVMISNVSQLPNAWASIIWYNVSTNDSQNLVFFNNPPPATLSQLLEVMSWQFSSYVGR GLNSDQLHMLAEKLTVQSSYSDGHLTWAKFCKEHLPGKSFTFWTWLEAILDLIKKHIL PLWIDGYVMGFVSKEKERLLLKDKMPGTFLLRFSESHLGGITFTWVDHSESGEVRFHS VEPYNKGRLSALPFADILRDYKVIMAENIPENPLKYLYPDIPKDKAFGKHYSSQPCEV SRPTERGDKGYVPSVFIPISTIRSDSTEPHSPSDLLPMSPSVYAVLRENLSPTTIETA MKSPYSAE" 3'UTR 2329..2588 /gene="STAT4" /note="GDB:682054; putative" BASE COUNT 812 a 553 c 552 g 671 t ORIGIN 1 gctttctcct agggactgtg aggggcgctt ctgactttgg acttgagcac tgcctgggac 61 ctgtgctgag agagcgctag catgtctcag tggaatcaag tccaacagtt agaaatcaag 121 tttttggagc aggtggatca attctatgat gacaactttc ccatggaaat tcggcatctg 181 ttggcccaat ggattgaaaa tcaagactgg gaggcagctt ctaacaatga aaccatggca 241 acgattcttc ttcaaaactt gttaatacaa ctggatgaac agttaggtcg tgtttccaaa 301 gagaaaaacc tactcttgat acacaatcta aaaagaatta ggaaggtcct tcagggaaaa 361 tttcatggaa atccaatgca tgtagctgtg gttatttcaa actgtttaag ggaagagagg 421 agaatattgg ctgcagccaa catgcctgtc caggggcctc tagagaaatc cttacaaagt 481 tcttcagttt cagaaagaca gaggaatgtg gagcacaaag tggctgccat taaaaacagt 541 gtgcagatga cagaacaaga taccaaatac ttagaagatc tgcaagacga atttgactac 601 aggtataaaa caattcagac aatggatcag agtgacaaga atagtgccat ggtgaatcag 661 gaagttttga cactgcagga aatgcttaac agcctcgatt tcaagagaaa ggaggctctc 721 agtaaaatga cccaaatcat ccatgagaca gacctgttaa tgaacaccat gctcatagaa 781 gagctgcaag actggaagcg gcggcagcaa atcgcctgca tcgggggtcc actccacaat 841 gggctcgacc agcttcagaa ctgctttaca ctattggcag aaagtctttt ccaactgaga 901 aggcaattgg agaaactaga ggagcaatct accaaaatga catatgaagg tgatcccatt 961 ccaatgcaaa gaactcacat gctagaaaga gtcaccttct tgatctacaa ccttttcaag 1021 aactcatttg tggttgagcg acagccatgt atgccaaccc accctcagag gccgttggta 1081 cttaaaaccc taattcagtt cactgtaaaa ctaaggctac taataaaatt gccagaacta 1141 aactatcagg taaaggttaa ggcatcaatt gacaagaatg tttcaactct aagcaaccga 1201 agatttgtac tttgtggaac taatgtcaaa gccatgtcta ttgaagaatc ttccaatggg 1261 agtctctcag tagaatttcg acatttgcaa ccaaaggaaa tgaagtccag tgctggaggt 1321 aaaggaaatg agggctgtca catggtgact gaagaacttc attccataac gtttgaaaca 1381 cagatctgcc tctatggcct gaccatagat ttggagacca gctcattgcc tgtggtgatg 1441 atttccaatg tcagtcagtt acctaatgct tgggcatcca tcatttggta caacgtgtca 1501 accaacgatt cccagaactt ggttttcttt aataatcctc cacctgccac attgagtcaa 1561 ctactggagg tgatgagctg gcagttttca tcgtacgttg gtcgtggtct taactcagat 1621 caactccata tgctggcaga gaagcttaca gtccaatcta gctacagtga tggtcacctc 1681 acctgggcca agttctgcaa ggaacattta cctggtaaat catttacctt ttggacatgg 1741 cttgaagcaa tattggatct aattaagaaa cacattcttc ccctttggat tgatgggtat 1801 gtcatgggct ttgttagcaa agagaaggaa cggctgttgc taaaggataa aatgcctggc 1861 acctttttat taagattcag tgaaagccat ctcggaggaa taactttcac ctgggtggac 1921 cattctgaaa gtggggaagt gagattccac tctgtagaac cctacaataa aggccggttg 1981 tctgctctgc cattcgctga catcctgcga gactacaaag ttattatggc tgaaaacatt 2041 cctgaaaacc ctctgaagta cctatatcct gacattccca aagacaaagc cttcggtaaa 2101 cactacagct ctcagccttg cgaagtttca agaccaacag aaaggggtga caaaggttat 2161 gttccttctg tttttatccc catctcaaca atccgaagtg attcaacaga gccacattct 2221 ccatcagacc ttcttcccat gtctccaagt gtgtatgcgg tgttgagaga aaacctgagt 2281 cccacaacaa ttgaaactgc aatgaagtct ccttattctg ctgaatgaca ggataaactc 2341 tgacgcacca agaaaggaag caaatgaaaa agtttaaaga ctgttctttg cccaataacc 2401 acattttatt tcttcagctt tgtaaatacc aggttctagg aaatgtttga catctgaagc 2461 tctcttcaca ctcccgtggc actcctcaat tgggagtgtt gtgactgaaa tgcttgaaac 2521 caaagcttca gataaacttg caagataaga caactttaag aaaccagtgt taataacaat 2581 attaacag // LOCUS HUMSTS 6520 bp mRNA PRI 13-JAN-1995 DEFINITION Human steroid sulfatase (STS) mRNA, complete cds. ACCESSION M16505 NID g338513 KEYWORDS steroid sulfatase. SOURCE Human placenta, cDNA to mRNA, clones M13mp18 and M13mp19. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2429) AUTHORS Yen,P.H., Allen,E., Marsh,B., Mohandas,T., Wang,N., Taggart,R.T. and Shapiro,L.J. TITLE Cloning and expression of steroid sulfatase cDNA and the frequent occurrence of deletions in STS deficiency: implications for X-Y interchange JOURNAL Cell 49 (4), 443-454 (1987) MEDLINE 87187642 REFERENCE 2 (bases 2430 to 6520) AUTHORS Yen,P.H. JOURNAL Unpublished (1988) COMMENT There is a steroid sulfatase (STS) pseudogene on Yq. FEATURES Location/Qualifiers source 1..6520 /organism="Homo sapiens" /db_xref="taxon:9606" /map="Xp22.32" mRNA <1..2429 /note="steroid sulfatase mRNA" sig_peptide 221..286 /gene="STS" /note="steroid sulfatase signal peptide" gene 221..1972 /gene="STS" CDS 221..1972 /gene="STS" /note="steroid sulfatase precursor (EC 3.1.6.2)" /codon_start=1 /db_xref="GDB:G00-120-393" /db_xref="PID:g338514" /translation="MPLRKMKIPFLLLFFLWEAESHEASRPNIILVMADDLGIGDPGC YGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGRYPVRSGMASWSRTGVF LFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTDFCHHPLHHGFNYFY GISLTNLRDCKPGEGSVFTTGFKRLVFLPLQIVGVTLLTLAALNCLGLLHVPLGVFFS LLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPMSYDNLTQRLTVEAAQFIQRNT ETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEMDWSVGQILNLLDELRLAND TLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGGIRVPGILRWPRVIQAGQ KIDEPTSNMDIFPTVAKLAGAPLPEDRIIDGRDLMPLLEGKSQRSDHEFLFHYCNAYL NAVRWHPQNSTSIWKAFFFTPNFNPVGSNGCFATHVCFCFGSYVTHHDPPLLFDISKD PRERNPLTPASEPRFYEILKVMQEAADRHTQTLPEVPDQFSWNNFLWKPWLQLCCPST GLSCQCDREKQDKRLSR" mat_peptide 287..1969 /gene="STS" /note="steroid sulfatase" BASE COUNT 1881 a 1440 c 1363 g 1835 t 1 others ORIGIN Chromosome Xp22.32. 1 acagtgctgt tggccaagcc tccagcagct gacgggaccc agctgtagtg aggttgcagt 61 gattgagtag gattggcctg cttcaaagca gaggtttctc atgggaatat gcttattaaa 121 ctcccactgg tgcagaaacc atgaacagag gatgaacaag tgaagttgca atctcctcca 181 tcacagctca gttccccaac aacaggatca caagctggag atgcctttaa ggaagatgaa 241 gatccctttc ctcctactgt tctttctgtg ggaagccgag agccacgaag catcaaggcc 301 gaacatcatc ctggtgatgg ctgacgacct cggcattgga gatcctgggt gctatgggaa 361 caaaactatc aggactccca atatcgaccg gttggccagt gggggagtga aactcactca 421 gcacctggca gcatcaccgc tgtgcacacc aagcagggca gccttcatga ctggccggta 481 ccctgtccga tcaggaatgg catcttggtc ccgcactgga gttttcctct tcacagcctc 541 ttcgggagga cttcccaccg atgagattac ctttgctaag cttctgaagg atcaaggtta 601 ttcaacagca ctgataggga aatggcacct tgggatgagc tgtcacagca agactgactt 661 ctgtcaccac cctttacatc acggcttcaa ttatttctat gggatctctt tgaccaatct 721 gagagactgc aagcccggag agggcagtgt cttcaccacg ggcttcaaga ggctggtctt 781 cctccccctg cagatcgtcg gggtcaccct ccttaccctt gctgcactca attgtctggg 841 gctactccac gtgcctctag gcgttttttt cagccttctc ttcctagcag ccctaatcct 901 gacccttttc ttgggcttcc ttcattactt ccggcccctg aactgcttca tgatgaggaa 961 ctacgagatc attcagcagc ccatgtccta tgacaatctc acccagaggc taacggtgga 1021 ggcggcccag ttcatacagc ggaacactga gactccgttc ctgcttgtct tgtcctacct 1081 ccacgtgcac acagccctgt tctccagcaa agactttgct ggcaaaagtc aacacggagt 1141 ctacggggat gctgttgagg aaatggactg gagtgtgggg cagatcttga accttctgga 1201 tgagctgaga ttggctaatg ataccctcat ctacttcaca tcggaccagg gagcacatgt 1261 agaagaagtg tcttccaaag gagaaattca tggcggaagt aatgggatct ataaaggagg 1321 aaaagcaaac aactgggaag gaggtatccg ggttccaggc atccttcgtt ggcccagggt 1381 gatacaggct ggccagaaga ttgatgagcc cactagcaac atggacatat ttcctacagt 1441 agccaagctg gctggagctc ccttgcctga ggacaggatc attgatggac gtgatctgat 1501 gcccctgctt gaaggaaaaa gccaacgctc cgatcatgag tttctcttcc attactgcaa 1561 cgcctactta aatgctgtgc gctggcaccc tcagaacagc acatccatct ggaaggcctt 1621 tttcttcacc cccaacttca accccgtggg ttccaacgga tgctttgcca cacacgtgtg 1681 cttctgtttc gggagttatg tcacccatca cgacccacct ttactctttg atatttccaa 1741 agatcccaga gagagaaacc cacttactcc agcatccgag ccccggtttt atgaaatcct 1801 caaagtcatg caggaagctg cggacagaca cacccagacc ctgccagagg tgcccgatca 1861 gttttcatgg aacaactttc tttggaagcc ctggcttcag ctgtgctgtc cttccaccgg 1921 cctgtcttgc cagtgtgata gagaaaaaca ggataagaga ctgagccgct agcagcgcct 1981 ggggaccaga cagacgcatg tggcaaagct caccatcttc actacaaaca cgcctgagag 2041 tggcactggg gaaacataac tccatctaca ccttggattt ggactgattc tccattttat 2101 cacctgaagg cttgggccag agctcaacag ctactcaact ggaggggtga gggggataag 2161 gtctgtagta tacagacagg aagatggtag gtttatgcct tctgtggcca gagtcttgga 2221 ctcatggaaa tagaatgaat agaggggcat tcacaaggca caccagtgca agcagatgac 2281 aaaaaggtgc agaaggcaat cttaaaacag aaaggtgcag gaggtacctt aactcacccc 2341 tcagcaaata cctatgtcaa cagtataagt taccatttac tctataatct gcagtgatgc 2401 aataaccagc ataataaaaa ggcaatcaca taaaaaagag tttagtcgtc taaacataag 2461 taactttaag gtgaatgaaa gatcttcttt aggaataata gatgatggta agttccactn 2521 tggttattgg aaggcaagtc attattattg gtattagtta aaacacatat caaatgcttg 2581 ctcttcatca tatatatagt tatgcataca tacacacaca cacatacagt atattctttc 2641 ctcaaaaggg ttaagatgtc taaaataggg acctagaagc ttaacactat ttaagtaaat 2701 acagtagaag ctcacaaata gatttctttg cacaatgatt ttttgcaaaa ttttacagta 2761 ataataatcc caaggcaaat ctctcctgaa ctgctttcca ttccataatt tgtagtataa 2821 ttcttggatt ccactgtttt ctttggggaa tggaagttct gaattaaaag cccactgtgg 2881 agatgctgtg gttcatggaa tctcttccag tgtaattcag aatcatggcc tagaaagtct 2941 ctgatatttg gaggggaaca aaaatcactc acaagcaatc catgatctat acacataagc 3001 ataatttcct ttagttctag ttagtcatca gagaacagtc atgtatgcaa gttttgtgac 3061 tgagaaattt ctgtgcttcc aatccacaat gagatgcatg attttgtttt catcccattt 3121 cccccaagcc cctgtaaatc agggaaaatg cgcaactgat cgcctaggag agggcctcgt 3181 agtggcacag ctggagatag tttcaaagtc taaaccacca gcccatcctg aggaaagcct 3241 cctatggaat gtaaagtgca atcatttctt cagatataag actttcccca acaatgtgat 3301 tggattcctt tatgctcaaa atcgagagaa gctgccatcc acctgcttat gcatttatct 3361 cttttgtgga cttgtctgac caccttctat ttgcccagag tttgctcaat tccaagacag 3421 tgcccatgaa tgggacacct gtaatgtaac ccacacagcg gtttgcagag aatgttagcc 3481 atgacttggg ctttgtaaag ttggctataa tttctctatc cctacccaca accctgggaa 3541 gttggagcaa gaggggcata ctattgggct gggaggattt gacagcattt ccccagttgc 3601 cctttaagtt cttctatttc aaacgttaat tttgcttctc tttctaaaaa aaaaaaaaaa 3661 aaagaaagag aagaaagaag tgattcctac ccctacctcc agagttgttg aaagctgaaa 3721 agcatacaag attcttcctt ttaacttgga tttctcgttc cagaaattgt gggataatct 3781 gtattcttgc tttagaaaac aattcttaga gagggtacta gcttactgat gatgtgttag 3841 gattgctact gatgctgtca tgtggaaact atttaaaggc actattataa atttatccta 3901 taagatgaca atgtttactc aaagtctaac atattcaatg caagtaagac tttctgaaaa 3961 cacttgatga tgtggaaatg ctgcaggacc taaataactt gaagagcctt tatagattat 4021 atgaatgcct atttgtgtct agaaccagtt atttaacctg taaaatgtca atagcaaatg 4081 aaggatgaag tatatctcta gatgcaaata cattgagttt aaaagtgcct caaaataatt 4141 gagatcacat ttcaggacat ttggaaatca ggtcgatttg tggtaactgt agtcatctta 4201 aatttcaaac catttaccat ctgaaagttt tgatttgaat gtaaaacagg aaattggaat 4261 tcctttgtcc aggagaaacc tcacaaacct tctttaaggc atagttttgt tgtttgtttg 4321 cttgtttgtt gcaggctgta aggcatggct gcttgtttac aaagcatctc attcatatta 4381 cctgtggagt tgcatatcca aaccttagtg agttttgaag ctttaagcaa attcttttaa 4441 aaaattcttg tatttctagc attactagat attaaaagtt aagcaaatag attaatgacg 4501 tatacatagg catcatttca caaggtcagt aatgctgcag gaaaagcaaa attgcaatct 4561 acgtatctat ggtactaagg aagtcctgtt ttcaaaaatg gaagcccact tctcagattt 4621 ttctgaaggg catacaatga aaagtgaagg ggaaacacac acacacaaaa aaaacaagta 4681 tttggcttgt cacaggaatc tgattgcatt aagtgaagga ttatttagaa tatgttaatg 4741 caaagctaaa ataaaatttt ccttggcaat taaaaatgct gtgcgctaat accctgcttt 4801 ctatcgtgac tcaattcaac aatgtgggga atgtttacta catttccaac ttgatgtcaa 4861 gcaatgggga atacaagttc cagttctgca aagattcgtc aactttctta gctcaagaga 4921 gaggctgaga aatgcagaga agaataagac ataaaatagc tccgacctcc atgatccgag 4981 agtgggaaaa ggcccgatta ttacccataa ggcacactct ctaaggcctt ttaaggggcc 5041 tacaaaaatg ttttatttta taatcagaag aaaaggaaat gaacattggg gattgaaaat 5101 catattggta tttgcaccaa catagtcata aaatagtatg ttaatatgtt tttactttat 5161 atatttatat attaaaatat atttaatatg ttttgccttt gtggcccatg aaagtcttac 5221 tgggccctgg ggaaggtatc ctaccctggt gaagcagctg ctttgctcta caaatacctg 5281 gggcagaaat ttgatttgaa aagtattatt ctctcttctc tttgtttcaa ctggattcct 5341 ttggaaaacc aaactagtat cagaacaaac cccgaaacag taagaaattg gagtgagaag 5401 ggcatggtat tgggactagg atcggctctc attcgatcga gctattctct taaaatgaca 5461 aaaagtgtcc ataaagaggc tgctggagag tgcgtggcca tagggagccg acatgcccgg 5521 gaggaaaggt gttgattaca tggatacttc taaaagctaa agccttgttg ccttctcttt 5581 aatgcctaga gaatgggatg tgtgatgcaa atgctcaaaa cctcttaaat catagctgtc 5641 tgacctctac ggacctcaca tccatctgag gcttcatgga caaagattct ccacttggcc 5701 aaactttagc caagctcctc aaccttctcc caggcccaat ctgggcactt ccttgtaaaa 5761 tctagttttg gcaagaagtc tattaggtca gtttagcaag aacacctaac ccccccatat 5821 ctgttcaacc tcagtatctg atcaggctcc tcagcctcca ccatccccca ggtgatgtct 5881 ggtcaactgg cctaccttca gctagaatcc tgttaggtcg gtttagatga atgctccctg 5941 atatttcctc ttggtaatct tccatccact gcccatgacc ctgttccctg tctataaatc 6001 cccagttttc catggtatat tcagagctga gtccagtctc tctcccctac tacaagtccc 6061 cattgctgtg gtccccgtac ctgtcatgat ggtcctaaat aaagtcttac tgtgctttaa 6121 taggtagcat tgaaaaattt ttttctttga catcattcat gacaacatga aacctattgg 6181 gacagcatga ctgtgcaggg tctttagagc tcagctttct gaggccctga gcattcttgg 6241 ttttccgaca tcgtgaacct gttctgtgtt gcaagatatc actcaagcca gtgttgctta 6301 ataccatctc tttgtgtaat agatctgaat aaagtaattg taatacacca atattttact 6361 ttgttgagta ttattttaat gggagccttt ccaatgaagg aaacatcagc tttccttgtt 6421 agaatgcaag cagcttaaaa actacccatg tctttctgga aaggaaacat tgccatgtaa 6481 ctatgtatca gttttctaag aacttcatgg gccttccata // LOCUS HUMSUIISO 660 bp mRNA PRI 13-JAN-1995 DEFINITION Homo sapiens sui1iso1 mRNA, complete cds. ACCESSION L26247 NID g450280 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 660) AUTHORS Fields,C. and Adams,M.D. TITLE Expressed sequence tags identify a human isolog of the suil translation initiation factor JOURNAL Biochem. Biophys. Res. Commun. 198 (1), 288-291 (1994) MEDLINE 94121644 FEATURES Location/Qualifiers source 1..660 /organism="Homo sapiens" /db_xref="taxon:9606" gene 140..481 /gene="sui1iso1" CDS 140..481 /gene="sui1iso1" /note="isolog of yeast sui1 and rice gos2; putative" /codon_start=1 /db_xref="PID:g450281" /translation="MSAIQNLHSFDPFADASKGDDLLPAGTEDYIHIRIQQRNGRKTL TTVQGIADDYDKKKLVKAFKKKFACNGTVIEHPEYGEVIQLQGDQRKNICQFLVEIGL AKDDQLKVHGF" BASE COUNT 171 a 174 c 149 g 165 t 1 others ORIGIN 1 gccgccgycg aggattcagc agcctccccc ttgagccccc tcgcttcccg acgttccgtt 61 cccccctgcc cgccttctcc cgccaccgcc gccgccgcct tccgcagccg tttccaccga 121 ggaaaaggaa tcgtatcgta tgtccgctat ccagaacctc cactctttcg acccctttgc 181 tgatgcaagt aagggtgatg acctgcttcc tgctggcact gaggattata tccatataag 241 aattcaacag agaaacggca ggaagaccct tactactgtc caagggatcg ctgatgatta 301 cgataaaaag aaactagtga aggcgtttaa gaaaaagttt gcctgcaatg gtactgtaat 361 tgagcatccg gaatatggag aagtaattca gctacagggt gaccaacgca agaacatatg 421 ccagttcctc gtagagattg gactggctaa ggacgatcag ctgaaggttc atgggtttta 481 agtgcttgtg gctcactgaa gcttaagtga ggatttcctt gcaatgagta gaatttccct 541 tctctccctt gtcacaggtt taaaaacctc acagcttgta taatgtaacc atttggggtc 601 cgcttttaac ttggactagt gtaactcctt catgcaataa actgaaaaga gccatgcaaa // LOCUS HUMSULOXI 2408 bp mRNA PRI 25-AUG-1995 DEFINITION Human sulfite oxidase mRNA, complete cds. ACCESSION L31573 NID g508501 KEYWORDS sulfite oxidase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2408) AUTHORS Garrett,R.M., Bellissimo,D.B. and Rajagopalan,K.V. TITLE Molecular cloning of human liver sulfite oxidase JOURNAL Biochim. Biophys. Acta 1262 (2-3), 147-149 (1995) MEDLINE 95322455 FEATURES Location/Qualifiers source 1..2408 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="liver" /tissue_lib="lambda gt10; Clontech" transit_peptide 904..969 CDS 904..2370 /codon_start=1 /product="sulfite oxidase" /db_xref="PID:g508502" /translation="MGTLLGLGAVLAYQDHRCRAAQESTHIYTKEEVSSHTSPETGIW VTLGSEVFDVTEFVDLHPGGPSKLMLAAGGPLEPFWALYAVHNQSHVRELLAQYKIGE LNPEDKVAPTVETSDPYADDPVRHPALKVNSQRPFNAEPPPELLTENYITPNPIFFTR NHLPVPNLDPDTYRLHVVGAPGGQSLSLSLDDLHNFPRYEITVTLQCAGNRRSEMTQV KEVKGLEWRTGAISTARWAGARLCDVLAQAGHQLCETEAHVCFEGLDSDPTGTAYGAS IPLARAMDPEAEVLLAYEMNGQPLPRDHGFPVRVVVPGVVGARHVKWLGRVSVQPEES YSHWQRRDYKGFSPSVDWETVDFDSAPSIQELPVQSAITEPRDGETVESGEVTIKGYA WSGGGRAVIRVDVSLDGGLTWQVAKLDGEEQRPRKAWAWRLWQLKAPVPAGQKELNIV CKAVDDGYNVQPDTVAPIWNLRGVLSNAWHRVHVYVSP" mat_peptide 970..2367 /product="sulfite oxidase" BASE COUNT 530 a 670 c 626 g 582 t ORIGIN 1 ggaattccgg gtaatatagg gtttggtgca atctgtggtt tcaaacatct actgggggtc 61 gtagaacata tccccagtgg ataagggggt actactgtac ttgttggctc ttcatgttag 121 ctctgctagg cagatgtcat ttcagagaag aggaagcaag ttcagaacgg cttggaatct 181 tgctcaggaa atcgggctgg ttaatgaata acaagagatc cagtttcaca aacccaaggc 241 attttgcccc cagaatctgg gcttcttcac ccatttaggc tgtcactact ttttttcact 301 tttttatccc tgtttaagtc agtctgaccc acagttgtcc tctgctgact tcagaaataa 361 taatctggcc agtagacatt tggtttcggt cctttaggcc cttcgcccca ggcatcgttc 421 tctatggtgg acaaagttca gaatggaaga tgggagaaag gtgattctga ttctagaagc 481 acccatccct cctaccccat tccccacccg cattacctgc catcctgtca gcacagtctt 541 ctctgaagtg ctccaagttt tctctaaggg cccatttgga ctcccactct caagactcct 601 cacttgccca gaaagctcct tgctgacctt ctctgtgtct tcctctcacc cattccctta 661 ggcctcccta atatcccctc ccagggtctc cctatcttga tcccagaatc ttccttctac 721 aggtctgcta caatgctgct gctgcacaga gctgtggtcc tcaggctcca acaggcctgc 781 agactcaagt caatcccctc aaggatctgc attcaggcct gctccacaaa tgattcattt 841 cagccccagc gccccagcct caccttctct ggtgataact ccagcaccca gggatggaga 901 gtcatgggga ccctattagg tctcggtgca gtgttggcct atcaggacca tcggtgtagg 961 gctgctcagg agtcaacaca catatacact aaggaggaag tgagttccca caccagccct 1021 gagactggga tctgggtgac tctgggctct gaggtctttg atgtcacaga atttgtggac 1081 ctacatccag gggggccttc aaagctgatg ctagcagctg ggggtcccct agagcccttc 1141 tgggccctct atgctgttca caaccagtcc catgtgcgtg agttactggc tcagtacaag 1201 attggggagc tgaatcctga agacaaggta gcccccaccg tggagacctc tgacccttat 1261 gctgatgatc ctgtacgtca cccagccctg aaggtcaaca gccagcggcc ctttaatgca 1321 gagcctcccc ctgagctgct gacagaaaac tacatcacac ccaaccctat cttcttcacc 1381 cggaaccatc tgcctgtacc taacctggat ccagacacct atcgcttaca cgtagtagga 1441 gcacctgggg gtcagtcact gtctctttcc ctggatgact tgcacaactt tcccaggtac 1501 gagatcacag tcactctgca gtgtgccggc aaccgacgct ctgagatgac tcaggtcaaa 1561 gaagtaaaag gtctggagtg gagaacagga gccatcagca ctgcacgctg ggctggggca 1621 cggctctgtg atgtgttagc ccaggctggc caccaactct gtgaaactga ggcccacgtc 1681 tgctttgagg gactggactc agaccctact gggactgcct atggagcatc catccctctg 1741 gctcgggcca tggaccctga agctgaggtc ctgctggcat atgagatgaa tgggcagcct 1801 ctgccacgtg accacggctt ccctgtgcgt gtggtggttc ctggagtggt gggtgcccgc 1861 catgtcaaat ggctgggcag agtgagtgtg cagccagagg aaagttacag ccactggcaa 1921 cggcgggatt acaaaggctt ctctccatct gtggactggg agactgtaga ttttgactct 1981 gctccatcca ttcaggaact tcctgtccag tccgccatca cagagccccg ggatggagag 2041 actgtagaat caggggaggt gaccatcaag ggctatgcat ggagtggtgg tggcagggct 2101 gtgatccggg tggatgtgtc tctggatggg ggcctaacct ggcaggtggc taagctggat 2161 ggagaggaac agcgccccag gaaggcctgg gcatggcgtc tgtggcagtt gaaagcccct 2221 gtgccagctg gacaaaagga actgaacatt gtttgtaagg ctgtggatga tggttacaat 2281 gtgcagccag acaccgtggc cccaatctgg aacctgcgag gtgttctcag caatgcctgg 2341 catcgtgtcc atgtctatgt ctccccatga gcatggaaag gagccacctc cacccctttc 2401 cggaattc // LOCUS HUMSYN 2402 bp mRNA PRI 13-JAN-1995 DEFINITION Human syndecan mRNA, complete cds. ACCESSION J05392 NID g338633 KEYWORDS integral membrane protein; syndecan. SOURCE Human breast cell line HBL-100, cDNA to mRNA, clones hsyn(4,pr7). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2402) AUTHORS Mali,M., Jaakkola,P., Arvilommi,A.M. and Jalkanen,M. TITLE Sequence of human syndecan indicates a novel gene family of integral membrane proteoglycans JOURNAL J. Biol. Chem. 265 (12), 6884-6889 (1990) MEDLINE 90216719 COMMENT Draft entry and printed sequence for [1] kindly submitted by M.Mali, 13-FEB-1990, for release after publication. FEATURES Location/Qualifiers source 1..2402 /organism="Homo sapiens" /db_xref="taxon:9606" /map="2p" mRNA <1..2402 /note="syndecan mRNA" gene 206..1138 /gene="SDC" CDS 206..1138 /gene="SDC" /note="syndecan" /codon_start=1 /db_xref="GDB:G00-126-375" /db_xref="PID:g338634" /translation="MRRAALWLWLCALALSLQLALPQIVATNLPPEDQDGSGDDSDNF SGSGAGALQDITLSQQTPSTWKDTQLLTAIPTSPEPTGLEATAASTSTLPAGEGPKEG EAVVLPEVEPGLTAREQEATPRPRETTQLPTTHQASTTTATTAQEPATSHPHRDMQPG HHETSTPAGPSQADLHTPHTEDGGPSATERAAEDGASSQLPAAEGSGEQDFTFETSGE NTAVVAVEPDRRNQSPVDQGATGASQGLLDRKEVLGGVIAGGLVGLIFAVCLVGFMLY RMKKKDEGSYSLEEPKQANGGAYQKPTKQEEFYA" BASE COUNT 462 a 735 c 712 g 492 t 1 others ORIGIN 1 ggagaggtgc gggccgaatc cgagccgagc gagaggaatc cggcagtaga gagcggactc 61 cagccggcgg accctgcagc cctcgcctgg gacagcggcg cgctgggcag gcgcccaaga 121 gagcatcgag cagcggaacc cgcgaagccg gcccgcagcc gcgacccgcg cagcctgccg 181 ctctcccgcc gccggtccgg gcagcatgag gcgcgcggcg ctctggctct ggctgtgcgc 241 gctggcgctg agcctgcagc tggccctgcc gcaaattgtg gctactaatt tgccccctga 301 agatcaagat ggctctgggg atgactctga caacttctcc ggctcaggtg caggtgcttt 361 gcaagatatc accttgtcac agcagacccc ctccacttgg aaggacacgc agctcctgac 421 ggctattccc acgtctccag aacccaccgg cctggaggct acagctgcct ccacctccac 481 cctgccggct ggagaggggc ccaaggaggg agaggctgta gtcctgccag aagtggagcc 541 tggcctcacc gcccgggagc aggaggccac cccccgaccc agggagacca cacagctccc 601 gaccactcat caggcctcaa cgaccacagc caccacggcc caggagcccg ccacctccca 661 cccccacagg gacatgcagc ctggccacca tgagacctca acccctgcag gacccagcca 721 agctgacctt cacactcccc acacagagga tggaggtcct tctgccaccg agagggctgc 781 tgaggatgga gcctccagtc agctcccagc agcagagggc tctggggagc aggacttcac 841 ctttgaaacc tcgggggaga atacggctgt agtggccgtg gagcctgacc gccggaacca 901 gtccccagtg gatcaggggg ccacgggggc ctcacagggc ctcctggaca ggaaagaggt 961 gctgggaggg gtcattgccg gaggcctcgt ggggctcatc tttgctgtgt gcctggtggg 1021 tttcatgctg taccgcatga agaagaagga cgaaggcagc tactccttgg aggagccgaa 1081 acaagccaac ggcggggcct accagaagcc caccaaacag gaggaattct atgcctgacg 1141 cgggagccat gcgccccctc cgccctgcca ctcactaggc ccccacttgc ctcttccttg 1201 aagaactgca ggccctggcc tcccctgcca ccaggccacc tccccagcat tccagcccct 1261 ctggtcgctc ctgcccacgg agtcgtgggt gtgctgggag ctccactctg cttctctgac 1321 ttctgcctgg agacttaggg caccaggggt ttctcgcata ggacctttcc accacagcca 1381 gcacctggca tcgcaccatt ctgactcggt ttctccaaac tgaagcagcc tctccccagg 1441 tccagctctg gaggggaggg ggatccgact gctttggacc taaatggcct catgtggctg 1501 gaagatctgc gggtggggct tggggctcac acacctgtag cacttactgg taggaccaag 1561 catcttgggg gggtggccgc tgagtggcag ggacaggagt cactttgttt cgtggggagg 1621 tctaatctag atatcgactt gtttttgcac atgtttcctc tagttctttg ttcatagccc 1681 agtagacctt gttacttctg aggtaagtta agtaagttga ttcggtatcc ccccatcttg 1741 cttccctaat ctatggtcgg gagacagcat cagggttaag aagacttttt tttttttttt 1801 ttaaactagg agaaccaaat ctggaagcca aaatgtaggc ttagtttgtg tgttgtctct 1861 tgagtttgtc gctcatgtgt gcaacagggt atggactatc tgtctggtgg ccccgtttct 1921 ggtggtctgt tggcaggctg gccagtccag gctgccgtgg ggccgccgcc tctttcaagc 1981 agtcgtgcct gtgtccatgc gctcagggcc atgctgaggc ctgggccgct gccacgttgg 2041 agaagcccgt gtgagaagtg aatgctggga ctcagccttc agacagagag gactgtaggg 2101 agggcggcag gggcctggag atcctcctgc agaccacncc cgtcctgcct gtgcgccgtc 2161 tccaggggct gcttcctcct ggaaattgac gaggggtgtc ttgggcagag ctggctctga 2221 gcgcctccat ccaaggccag gttctccgtt agctcctgtg gccccaccct gggccctggg 2281 ctggaatcag gaatattttc caaagagtga tagtcttttg cttttggcaa aactctactt 2341 aatccaatgg gtttttccct gtacagtaga ttttccaaat gtaataaact ttaatataaa 2401 gt // LOCUS HUMSYN69KD 3068 bp mRNA PRI 13-JAN-1995 DEFINITION Human 69 kDa 2'5' oligoadenylate synthetase (P69 2-5A synthetase) mRNA, complete cds. ACCESSION M87284 NID g338651 KEYWORDS 2'5' oligoadenylate synthetase; P69 2-5A synthetase. SOURCE Homo sapiens (tissue library: lanbda-gt11) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3068) AUTHORS Marie,I. and Hovanessian,A.G. TITLE The 69-kDa 2-5A synthetase is composed of two homologous and adjacent functional domains JOURNAL J. Biol. Chem. 267 (14), 9933-9939 (1992) MEDLINE 92250658 FEATURES Location/Qualifiers source 1..3068 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="daudi" /cell_type="lymphoblast" /tissue_lib="lanbda-gt11" gene 20..3055 /gene="P69 2-5A synthetase" CDS 20..2083 /gene="P69 2-5A synthetase" /codon_start=1 /product="2'5' oligoadenylate synthetase" /db_xref="PID:g338652" /translation="MGNGESQLSSVPAQKLGWFIQEYLKPYEECQTLIDEMVNTICDV CRNPEQFPLVQGVAIGGSYGRKTVLRGNSDGTLVLFFSDLKQFQDQKRSQRDILDKTG DKLKFCLFTKWLKNNFEIQKSLDGSTIQVFTKNQRISFEVLAAFNALSLNDNPSPWIY RELKRSLDKTNASPGEFAVCFTELQQKFFDNRPGKLKDLILLIKHWHQQCQKKIKDLP SLSPYALELLTVYAWEQGCRKDNFDIAEGVRTVLELIKCQEKLCIYWMVNYNFEDETI RNILLHQLQSARPVILDPVDPTNNVSGDKICWQWLKKEAQTWLTSPNLDNELPAPSWN VLPAPLFTTPGHLLDKFIKEFLQPNKCFLEQIDSAVNIIRTFLKENCFRQSTAKIQIV RGGSTAKGTALKTGSDADLVVFHNSLKSYTSQKNERHKIVKEIHEQLKAFWREKEEEL EVSFEPPKWKAPRVLSFSLKSKVLNESVSFDVLPAFNALGQLSSGSTPSPEVYAGLID LYKSSDLPGGEFSTCFTVLQRNFIRSRPTKLKDLIRLVKHWYKECERKLKPKGSLPPK YALELLTIYAWEQGSGVPDFDTAEGFRTVLELVTQYQQLGIFWKVNYNFEDETVRKFL LSQLQKTRPVILDPGEPTGDVGGGDRWCWHLLDKEAKVRLSSPCFKDGTGNPIPPWKV PVKVI" polyA_signal 2171..2176 /gene="P69 2-5A synthetase" polyA_signal 2897..2902 /gene="P69 2-5A synthetase" polyA_signal 3050..3055 /gene="P69 2-5A synthetase" polyA_site 3068 /gene="P69 2-5A synthetase" BASE COUNT 839 a 766 c 725 g 738 t ORIGIN 1 cggcagccag ctgagagcaa tgggaaatgg ggagtcccag ctgtcctcgg tgcctgctca 61 gaagctgggt tggtttatcc aggaatacct gaagccctac gaagaatgtc agacactgat 121 cgacgagatg gtgaacacca tctgtgacgt ctgcaggaac cccgaacagt tccccctggt 181 gcagggagtg gccataggtg gctcctatgg acggaaaaca gtcttaagag gcaactccga 241 tggtaccctt gtccttttct tcagtgactt aaaacaattc caggatcaga agagaagcca 301 acgtgacatc ctcgataaaa ctggggataa gctgaagttc tgtctgttca cgaagtggtt 361 gaaaaacaat ttcgagatcc agaagtccct tgatgggtcc accatccagg tgttcacaaa 421 aaatcagaga atctctttcg aggtgctggc cgccttcaac gctctgagct taaatgataa 481 tcccagcccc tggatctatc gagagctcaa aagatccttg gataagacaa atgccagtcc 541 tggtgagttt gcagtctgct tcactgaact ccagcagaag ttttttgaca accgtcctgg 601 aaaactaaag gatttgatcc tcttgataaa gcactggcat caacagtgcc agaaaaaaat 661 caaggattta ccctcgctgt ctccgtatgc cctggagctg cttacggtgt atgcctggga 721 acaggggtgc agaaaagaca actttgacat tgctgaaggc gtcagaacgg ttctggagct 781 gatcaaatgc caggagaagc tgtgtatcta ttggatggtc aactacaact ttgaagatga 841 gaccatcagg aacatcctgc tgcaccagct ccaatcagcg aggccagtaa tcttggatcc 901 agttgaccca accaataatg tgagtggaga taaaatatgc tggcaatggc tgaaaaaaga 961 agctcaaacc tggttgactt ctcccaacct ggataatgag ttacctgcac catcttggaa 1021 tgtcctgcct gcaccactct tcacgacccc aggccacctt ctggataagt tcatcaagga 1081 gtttctccag cccaacaaat gcttcctaga gcagattgac agtgctgtta acatcatccg 1141 tacattcctt aaagaaaact gcttccgaca atcaacagcc aagatccaga ttgtccgggg 1201 aggatcaacc gccaaaggca cagctctgaa gactggctct gatgccgatc tcgtcgtgtt 1261 ccataactca cttaaaagct acacctccca aaaaaacgag cggcacaaaa tcgtcaagga 1321 aatccatgaa cagctgaaag ccttttggag ggagaaggag gaggagcttg aagtcagctt 1381 tgagcctccc aagtggaagg ctcccagggt gctgagcttc tctctgaaat ccaaagtcct 1441 caacgaaagt gtcagctttg atgtgcttcc tgcctttaat gcactgggtc agctgagttc 1501 tggctccaca cccagccccg aggtttatgc agggctcatt gatctgtata aatcctcgga 1561 cctcccggga ggagagtttt ctacctgttt cacagtcctg cagcgaaact tcattcgctc 1621 ccggcccacc aaactaaagg atttaattcg cctggtgaag cactggtaca aagagtgtga 1681 aaggaaactg aagccaaagg ggtctttgcc cccaaagtat gccttggagc tgctcaccat 1741 ctatgcctgg gagcagggga gtggagtgcc ggattttgac actgcagaag gtttccggac 1801 agtcctggag ctggtcacac aatatcagca gctcggcatc ttctggaagg tcaattacaa 1861 ctttgaagat gagaccgtga ggaagtttct actgagccag ttgcagaaaa ccaggcctgt 1921 gatcttggac ccaggcgaac ccacaggtga cgtgggtgga ggggaccgtt ggtgttggca 1981 tcttctggac aaagaagcaa aggttaggtt atcctctccc tgcttcaagg atgggactgg 2041 aaacccaata ccaccttgga aagtgccggt aaaagtcatc taaaggaggc gttgtctgga 2101 aatagccctg taacaggctt gaatcaaaga acttctccta ctgtagcaac ctgaaattaa 2161 ctcagacaca aataaaggaa acccagctca caggagctta aacagctggt cagcccccct 2221 aagcccccac tacaagtgat cctcaggcag gtaaccccag attcatgcac tgtagggctg 2281 ggcgcagcat ccctaggtct ctacccagta gatgccacta gccctcctct cccagtgaca 2341 accaaaagtc ttcacatgtt caaacgttcc cctgggttca cagatctttc tgcctttggc 2401 ttttggctcc accctcttta gctgttaatt tgagtactta tggccctgaa agcggccacg 2461 gtgcctccag atggcaggtt tgcaatccaa gcaggaagaa ggaaaagata cccaaaggtc 2521 aagaacacag tgattttatt agaagtttca tccgcaaatt ttcttccatt tcattgctca 2581 gaatgtcatg tggttacctg taacttgaag gtggctacaa agatgactgt ggaggtggtt 2641 gcacttgcca cccaaggatg tctgccacac ctctccaagc cctcctacct accaagatat 2701 acctgatata tccaccagat atctcctcag atatacttgg ttctctccac caggttcttt 2761 ctttaaagca ggattctcaa ctttgatact tactcacatt gggctagaca gttctttgtt 2821 tggaggctct cttgtgcatg taggatgttg agcagcatgt gtggcctgta cccagtacat 2881 gccacccagt tgtgacaatt aaaagtgtct tgagacttta tcatgtgtct tctgccctag 2941 gtgagaaccc ttgcactaca ggaaccctac acccaacctg gggggaatgt agggaagagg 3001 tgccaagcca accgtggggt tagctctaat tattaagtta tgcattataa ataaatacca 3061 aaaaattg // LOCUS HUMSYNTAXB 1175 bp mRNA PRI 20-FEB-1996 DEFINITION Human mRNA for SYNTAXIN1B, complete cds. ACCESSION D37933 NID g531249 KEYWORDS SYNTAXIN1B. SOURCE Homo sapiens adult brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1175) AUTHORS Fujiwara,T., Genda,M. and Akagawa,K. TITLE Molecular cloning of human HPC-1 cDNA JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 1175) AUTHORS Fujiwara,T. TITLE Direct Submission JOURNAL Submitted (11-AUG-1994) to the DDBJ/EMBL/GenBank databases. Tomonori Fujiwara, Kyorin University School of Medicine, Department of Physiology; Sinkawa 6-20-2, Mitaka, Tokyo 181, Japan (Tel:0422-47-5511(ex.3444), Fax:0422-47-4801) COMMENT Submitted (11-Aug-1994) to DDBJ by: Tomonori Fujiwara Kyourin Medical School of University Department of Physiology Sinkawa 6-20-2 Mitaka, Tokyo 181 Japan Phone: 0422-47-5511 x3444 Fax: 0422-47-4801. FEATURES Location/Qualifiers source 1..1175 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="brain" CDS 18..884 /codon_start=1 /product="SYNTAXIN1B" /db_xref="PID:d1007729" /db_xref="PID:g531250" /translation="MKDRTQVLRTRRNSDDKEEVVHVDRDHFMDEFFEQEEEIRGCIE KLSEDVEQVKKQHSAILAAPNPDERTKQELEDLTADIKKTANKVRSKLKAIEQSIEQE EGSTAPRPILRIRKTQHSTLSRKFVEVMTEYNATQSKYRDRCKDRIQRQLEITGRTTT NEELEDMLESGKLPIFTDDIKMDSQMTKQALNEIETRHNEIIKLETSIRELHDMFVDM AMLVESQGEMIDRIEYNVEHSVDYVERAVSDTKKAVKYQSKARRKKIIIIICCVVLGV VLASSIGCTLGL" BASE COUNT 307 a 340 c 311 g 217 t ORIGIN 1 ccactacgtg ttcggccatg aaggaccgca cccaggtgct gcgcacgcga aggaacagtg 61 acgataagga ggaggtggtg cacgtcgacc gcgaccactt catggatgag ttcttcgagc 121 aggaggagga gatccggggc tgcattgaga agctgtcgga ggacgtggag caggtgaaga 181 agcagcacag cgccatcttg gccgccccca accccgatga gaggacgaag caggagctgg 241 aggacctgac ggccgacatc aagaagacgg ccaacaaagt gcgctccaag ttgaaagcca 301 tagagcagag cattgagcag gaggagggct caaccgctcc tcggccgatc ttgcgcatcc 361 gcaaaacgca gcattccacc ctatcccgaa agttcgtgga ggtgatgacg gaatacaacg 421 ccacgcagtc caaataccgc gaccgctgca aggaccgcat ccagcggcag ctcgagatca 481 ccggccgcac caccacgaat gaagagctgg aggacatgtt ggagagcggg aagttgccca 541 tcttcaccga tgatatcaaa atggactcgc agatgaccaa gcaggccctg aacgaaatcg 601 agacgcggca taacgagatc attaagctgg aaacgagcat ccgagagctg cacgacatgt 661 tcgtggacat ggccatgctt gtggagagcc agggggagat gatcgaccgc atcgagtaca 721 acgtggagca ctcggtggat tacgtggaac gcgccgtatc cgacaccaag aaagccgtta 781 agtaccaaag caaagccagg aggaagaaga ttatcatcat catttgctgc gtggtgctgg 841 gggtggtttt ggcctcctcc attgggtgca ctttgggcct ataggggtgt ccgtccccct 901 cccccattaa tgcactaaat ttaatcgacg ggccccccat tatttcacta attaaatgac 961 cgtttccccc caatatcccc cattttgcgc taatttatga cggacccctc ccaagacccc 1021 ccattttgcc ctaatttata aaaaaacccc ccccccccca gccccatagc cgccccccag 1081 ctctgtaacc cccccccccc cggtgtggtc ctgtgctgta tttgctacga aacccattgg 1141 cacaaaaaat ttttttttaa tttaaaaacg aattc // LOCUS HUMSYTA 3244 bp mRNA PRI 13-JAN-1995 DEFINITION Human synaptotagmin mRNA, complete cds. ACCESSION M55047 J05710 NID g338657 KEYWORDS synaptotagmin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3244) AUTHORS Perin,M.S., Johnston,P.A., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE Structural and functional conservation of synaptotagmin (p65) in Drosophila and humans JOURNAL J. Biol. Chem. 266 (1), 615-622 (1991) MEDLINE 91093190 FEATURES Location/Qualifiers source 1..3244 /organism="Homo sapiens" /db_xref="taxon:9606" /map="12cen-q21" gene 28..1296 /gene="SYT" CDS 28..1296 /gene="SYT" /codon_start=1 /db_xref="GDB:G00-125-296" /product="synaptotagmin p65" /db_xref="PID:g338658" /translation="MVSESHHEALAAPPVTTVATVLPSNATEPASPGEGKEDAFSKLK EKFMNELHKIPLPPWALIAIAIVAVLLVLTCCFCICKKCLFKKKNKKKGKEKGGKNAI NMKDVKDLGKTMKDQALKDDDAETGLTDGEEKEEPKEEEKLGKLQYSLDYDFQNNQLL VGIIQAAELPALDMGGTSDPYVKVFLLPDKKKKFETKVHRKTLNPVFNEQFTFKVPYS ELGGKTLVMAVYDFDRFSKHDIIGEFKVPMNTVDFGHVTEEWRDLQSAEKEEQEKLGD ICFSLRYVPTAGKLTVVILEAKNLKKMDVGGLSDPYVKIHLMQNGKRLKKKKTTIKKN TLNPYYNESFSFEVPFEQIQKVQVVVTVLDYDKIGKNDAIGKVFVGYNSTGAELRHWS DMLANPRRPIAQWHTLQVEEEVDAMLAVKK" BASE COUNT 1054 a 615 c 670 g 905 t ORIGIN 1 taatagaaca cttcacctga acctaaaatg gtgagcgaga gtcaccatga ggccctggca 61 gccccgcctg tcaccactgt cgcgactgtt ctgccaagca atgccacaga gccagccagt 121 cctggagaag gaaaggaaga tgcattttct aagctgaagg agaagtttat gaatgagttg 181 cataaaattc cattgccacc gtgggcctta attgcaatag ccatagtcgc agtcctttta 241 gtcctgacct gctgcttttg tatctgtaag aaatgtttgt tcaaaaagaa aaacaagaag 301 aagggaaagg aaaaaggagg gaagaatgcc attaacatga aagatgtaaa agacttaggg 361 aagacgatga aagatcaggc cctcaaggat gatgatgctg aaactggatt gacagatgga 421 gaagaaaaag aagaacccaa agaagaggag aaactgggaa aacttcagta ttcactggat 481 tatgatttcc aaaataacca gctgctggta gggatcattc aggctgctga actgcccgcc 541 ttggacatgg ggggcacatc tgatccttac gtgaaagtgt ttctgctacc tgataagaag 601 aagaaatttg agacaaaagt ccaccgaaaa acccttaatc ctgtcttcaa tgagcaattt 661 actttcaagg taccatactc ggaattgggt ggcaaaaccc tagtgatggc tgtatatgat 721 tttgatcgtt tctctaagca tgacatcatt ggagaattta aagtccctat gaacacagtg 781 gattttggcc atgtaactga ggaatggcgt gacctgcaaa gtgctgagaa ggaagagcaa 841 gagaaattgg gtgatatctg cttctccctt cgctacgtac ctactgctgg taagctgact 901 gttgtcattc tggaggcaaa gaacctgaag aagatggatg tgggtggctt atccgatcct 961 tatgtgaaga ttcatctgat gcagaatggt aagaggctga agaagaaaaa gacaacaatt 1021 aaaaagaaca cacttaaccc ctactacaat gagtcattca gctttgaagt accttttgaa 1081 caaatccaga aagtgcaggt ggtggtaact gttttggact atgacaagat tggcaagaac 1141 gatgccatcg gcaaagtctt tgtgggctac aacagcaccg gcgcggagct gcgacactgg 1201 tcagacatgc tggccaaccc caggcgacct attgcccagt ggcacaccct gcaggtagag 1261 gaggaagttg atgccatgct ggccgtcaag aagtaaagga aagaagaagc ctttctgcat 1321 ttgcccatat agtgctcttt agccagtatc tgtaaatacc tcagtaatat gggtcctttc 1381 atttttccag ccatgcattc ctaacacaat tcagtggtac ttggaatcct gttttaattt 1441 gcacaaattt aaatgtagag agcccctaag tccttcatca taccactgcc ctccaaatct 1501 actcttcttt taagcaatat gatgtgtaga tagagcatga atgaaattat ttattgtatc 1561 acactgttgt atataccagt atgctaaaga tttatttcta gtttgtgtat ttgtatgttg 1621 taagcgtttc ctaatctgtg tatatctaga tgtttttaat aagatgttct attttaaact 1681 atgtaaattg actgagatat aggagagctg ataatatatt atacggtaaa tatagtatcg 1741 tctgcattcc agcaaaaata tcaactcgta aggcactagt acagttaaac tgacatctta 1801 aaggacaact taaacctgag ctttctattg aatcatttga gtaccaagat aaacttacac 1861 cacatacttg gtgggtgaat ccaattttgt agaattccta cacaggcaaa atagcatgat 1921 ctgagcagca gcatccaggc tgacctcaag gaagcatagc cacaaaacag aatagcacct 1981 gtctgtacat atttacaaag ctaaaataat ggcttcactc ttatatttga ggaagcaact 2041 gaacaggagt caatgatttc atattactgc atatagaata acaacaaggt gttccgtgtg 2101 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg cacatttgtt tggggatggg 2161 ggagaagaag ctaaggggag aagtcaacat ttatgaaata ttgcctgact atttaaaaag 2221 aaaaaagtag ctctccatta tcacctttat acaaaatgta catcctgtga attctgttcc 2281 agatttcaca cctacaataa ttccaaaagg tttgcacatt agagtttgta acaaaatatt 2341 ttattatata aaaccaggtt agaaggaatg caggatattt ttaacacaac aatctgtgct 2401 tattacacaa aattactttg tggtaaacag acagtattgt aatcccatca aaagatgaaa 2461 gaaaaacaaa aacaaaaacc aacaacaatt agccatagtt ctgaatgcac ttcaattaag 2521 ccaaaacaga cagctagtga tctttttata tgctcttttt acttaagttt taatttgtcc 2581 tttaaaaaaa ggtgaaacaa accaagaaca agttctagaa aactgaagca acctcttatg 2641 tatactagat gcttgattta ggaggagttt ttaaacgttt tcaatgttat tatgtagtaa 2701 atgacactat tatgaagcta ctagtcattc cataagagtc ttaaaggact gctctgtgta 2761 cactgtgact gccgtgtgtg cttagacccg tagtttcctc agtggatagc actcaattta 2821 ttccgtagtg atattgtaac aatactgcca ttcccttcta ctgcactgcc caaggtgtgt 2881 gtagcacaaa cagttctcat tacaaaggac caattcagaa ctgaaaagct atgcatagga 2941 caaggaagat acatagaatg gggtggaaca cagcattttg tcaagcactg tgcaatattc 3001 catatttttc cccactatgg tagacaacca tttcgtggaa gggcagccta ttatcccaca 3061 ctgcatctag ccttttgtcc cattcacttc tgtgatccat tttaatttcc aggccacaag 3121 acagtagtga tgctctgaaa tgaaagtttg tcttcacaaa tatcaaaaca aaatggagga 3181 aaactaagca ttggcctcat gttcagtctt caggatatca caccacgtct tttcaaaaac 3241 taaa // LOCUS HUMTACTILE 5205 bp mRNA PRI 22-APR-1992 DEFINITION Human tactile protein mRNA, complete cds. ACCESSION M88282 NID g338671 KEYWORDS T-cell activation antigen; tactile protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5205) AUTHORS Wang,P.L., O'Farrell,S., Clayberger,C. and Krensky,A.M. TITLE Identification and molecular cloning of tactle: A novel human T cell activation antigen that is a member of the Ig gene superfamily JOURNAL J. Immunol. 148, 2600-2608 (1992) MEDLINE 92218864 FEATURES Location/Qualifiers source 1..5205 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CCRF-HSB-2" /cell_type="T cell leukemia" mRNA 1..5205 5'UTR 1..928 sig_peptide 929..991 CDS 929..2638 /note="'T cell surface antigen, increased during activation'" /codon_start=1 /product="tactile protein" /db_xref="PID:g338672" /translation="MEKKWKYCAVYYIIQIHFVKGVWEKTVNTEENVYATLGSDVNLT CQTQTVGFFVQMQWSKVTNKIDLIAVYHPQYGFYCAYGRPCESLVTFTETPENGSKWT LHLRNMSCSVSGRYECMLVLYPEGIQTKIYNLLIQTHVTADEWNSNHTIEIEINQTLE IPCFQNSSSKISSEFTYAWSVEDNGTQETLISQNHLISNSTLLKDRVKLGTDYRLHLS PVQIFDDGRKFSCHIRVGPNKILRSSTTVKVFAKPEIPVIVENNSTDVLVERRFTCLL KNVFPKANITWFIDGSFLHDEKEGIYITNEERKGKDGFLELKSVLTRVHSNKPAQSDN LTIWCMALSPVPGNKVWNISSEKITFLLGSEISSTDPPLSVTESTLDTQPSPASSVSP ARYPATSSVTLVDVSALRPNTTPQPSNSSMTTRGFNYPWTSSGTDTKKSVSRIPSETY SSSPSGAGSTLHDNVFTSTARAFSEVPTTANGSTKTNHVHITGIVVNKPKDGMSWPVI VAALLFCCMILFGLGVRKWCQYQKEIMERPPPFKPPPPPIKYTCIQEPNESDLPYHEM ETL" mat_peptide 992..2635 /note="T cell activation antigen" /product="tactile protein" 3'UTR 2639..5205 repeat_region 4435..4759 /rpt_family="'Alu'" polyA_site 5205 BASE COUNT 1568 a 1160 c 1036 g 1441 t ORIGIN 1 tcttgagagt gttcaaccat agaaatctaa tggttggttg gtgtcttttc agtcaaatgc 61 tacatgtttt tcactccaat atttggcatg catatagaat tttgtgcgtc aagttggcct 121 tttccttttt ctaattggtt ctggtctttt ttggagaggt tagagacaca atacaacttt 181 aaaaagagtg catgtgtaca cacagccacc cacccaccca caccctcttc ccggattgcc 241 ctccaccccc atgattataa agatgcttat ggtgctctta aaatctctct catgtcagtt 301 gccatttgca ttgagctcaa aaaataatat taaggctttt ggcttcacta aacaaaagca 361 aatacaattt ctgcttgcaa ttctatcctt tacccaccct gcagtgcagg gcccagtcaa 421 gtgtgagata agcaggggag gtccagagag aggccggtag aggcactgca caagccttgc 481 cagcgtcacc tcccacctcc tggtcccttt cactgctaag acatgcaact gctccatgac 541 agccttgtgc cttttgccaa atgtccagaa attttgcgaa tctaagtatt aaaaacacac 601 tgatttggtg gtgctaataa gaatagaaaa aggtcatttg aacagatcta ttttatgaat 661 gaatacagac ctaaaaatcc ttaagaaaac acaaatttca gagaacaatt tcaacattgt 721 tctgtcgaac gttatactca gtcctgaacc acattacttt cctgtctacg tttcatttcc 781 tgggggcttg ccaagtgata aacagactca ggcgtgtgtg gtagagttcg ggttttttag 841 cacgaagtgg gtggctggag tttgcttgaa aacatcaatt gactttgtga tcattacaga 901 aatgctggtg taaggtgttc agaagacaat ggagaaaaaa tggaaatact gtgctgtcta 961 ttacatcatc cagatacatt ttgtcaaggg agtttgggaa aaaacagtca acacagaaga 1021 aaatgtttat gctacacttg gctctgatgt caacctgacc tgccaaacac agacagtagg 1081 cttcttcgtg cagatgcaat ggtccaaggt caccaataag atagacctga ttgctgtcta 1141 tcatccccaa tacggcttct actgtgccta tgggagaccc tgtgagtcac ttgtgacttt 1201 cacagaaact cctgagaatg ggtcaaaatg gactctgcac ttaaggaata tgtcttgttc 1261 agtcagtgga aggtacgagt gtatgcttgt tctgtatcca gagggcattc agactaaaat 1321 ctacaacctt ctcattcaga cacacgttac agcagatgaa tggaacagca accatacgat 1381 agaaatagag ataaatcaga ctctggaaat accatgcttt caaaatagct cctcaaaaat 1441 ttcatctgag ttcacctatg catggtcggt ggaggataat ggaactcagg aaacacttat 1501 ctcccaaaat cacctcatca gcaattccac attacttaaa gatagagtca agcttggtac 1561 agactacaga ctccacctct ctccagtcca aatcttcgat gatgggcgga agttctcttg 1621 ccacattaga gtcggtccta acaaaatctt gaggagctcc accacagtca aggtttttgc 1681 taaaccagaa atccctgtga ttgtggaaaa taactccacg gatgtcttgg tagagagaag 1741 atttacctgc ttactaaaga atgtatttcc caaagcaaat atcacatggt ttatagatgg 1801 aagttttctt catgatgaaa aagaaggaat atatattact aatgaagaga gaaaaggcaa 1861 agatggattt ttggaactga agtctgtttt aacaagggta catagtaata aaccagccca 1921 atcagacaac ttgaccattt ggtgtatggc tctgtctcca gtcccaggaa ataaagtgtg 1981 gaacatctca tcagaaaaga tcacttttct cttaggttct gaaatttcct caacagaccc 2041 tccactgagt gttacagaat ctacccttga cacccaacct tctccagcca gcagtgtatc 2101 tcctgcaaga tatccagcta catcttcagt gacccttgta gatgtgagtg ccttgaggcc 2161 aaacaccact cctcaaccca gcaattccag tatgactacc cgaggcttca actatccctg 2221 gacctccagt gggacagata ccaaaaaatc agtttcacgg atacctagtg aaacatacag 2281 ttcatccccc tcaggtgcag gctcaacact tcatgacaat gtctttacca gcacagccag 2341 agcattttca gaagtcccca caactgccaa tggatctacg aaaactaatc acgtccatat 2401 cactggtatt gtggtcaata agcccaaaga tggaatgtcc tggccagtga ttgtagcagc 2461 tttactcttt tgctgcatga tattgtttgg tcttggagtg agaaaatggt gtcagtacca 2521 aaaagaaata atggaaagac ctccaccttt caagccacca ccacctccca tcaagtacac 2581 ttgcattcaa gagcccaacg aaagtgatct gccttatcat gagatggaga ccctctagtc 2641 tcgtgagact ttgccccatg gcagaactct gctggaatcc tattgagaag gtagacattg 2701 tgctttatta atatagtcgc tcttcagcca tgcctttgct gcagctgaaa tggaagtcag 2761 aagtgagtga cctgttttcc cagcaactca ccctctttca tctccaaacg cctgaagctt 2821 aaccaagagt gagaggatat gtcatgttca cactcaatgc aattcgtagt ggttttcttg 2881 cttattgtaa gaagtacata ttagtctgcc atctttaaaa aaaatacagt attttcattt 2941 aaattctctg atggagggac aacaatggtt tcaactgtat gcccatgcct gatcctctta 3001 tttgaacatc tatcaacatt gtaaactctt tgccaaaatc ctggggcttt gctgcattcc 3061 ctaagataat tataggaaaa agaaaatgta aaagtgctaa caaggctgcc aagtaatgga 3121 gaagtatggt tagccttcat attgaaattc tgttgcttat tttcatggaa ggaaacagaa 3181 tactttgcac aggaaccaca ttttcaatcc tccttcactg tcttcctacc atgttcagcc 3241 cagactcctg ccacatggac caggatgaag agggatcaaa gagataatta gccaaaaacc 3301 cagtagccta gaagatacaa aactccactg gcctctaaaa ttatattagc caagagtggt 3361 ttcatttgag tgccttcgtg tgtatgtcca tcaaactgga accaaactgt tttgtaagta 3421 aacaggcagc ctaagcccaa ccctactttc taattccggt tattctcttt ttcatctggg 3481 gatttacctg ttcatttaat ctgcctgttt tgatctgttt tgaaaaagat aaagagcctc 3541 aaatcagacc agcactgatt aattaaccct gctcctacca atctttttta aagcagttga 3601 agcagaatgt ataggtgtca gagaagaaac ctagtcagcc agacgtgctc tgtattcagc 3661 aatagtttgt gaatgaataa attactaatc ctccttgtcg cttgaaacct tcccacactc 3721 cctgctccag gagggaaaaa cagatgttgt tgacagatag agtgataggc aaattctgtg 3781 tggactttag tcccaaaagg aaactttagt tcacttgcag tatgcttatc cttgactgca 3841 catgagaatg ccttgtgcag agttatttgg agattatgtc tttttcttaa acaccatggc 3901 tgtcacactt cagttcaatt aaatcagaat gtctgaggag tgagacacag gcatcaacac 3961 tctcaaatga ttcacatgtt cagccaaagt tgagaaccat cgagcctgtg gaagttcttt 4021 ctcatggctc agaatcttag gtaggtgctt aactcttgtg gtggccagcc tccaagatga 4081 gccccagtgt tcttgcctcc tactattcac atctttatgt ggtcccctcc aatgctgaat 4141 acagatgatt tgtgtaacct gaggccagga ttaaggtgag gcaatcaatg tacctaggga 4201 aaaaatttaa ggaggtattc acactcaggg tcatgcactt gcacaatgtt gagaatgagt 4261 accactctca ccattggtat agccaaaaaa agcttggaag tgaccaaggc taggtcacaa 4321 aatacactgt ggcttcttct ttgatctctc tttgaccata ctgacactgg gaaaagccca 4381 ttcccatgcc atgaagacac caaggcagcc ctattgagaa atctacctgt cgtggccggg 4441 cgcagtggct cacgcctgta atcccagcac tttgggaggc cgaggtgggt ggatcacgag 4501 gtcaggagat cgagaccatc ctggctaaca cagtgaaacc ccgtctctac taaaaataca 4561 aaaattagcc gggtgtggtg tcgggcacct gtagtcccag ctactcagga ggctgaggca 4621 ggagaagggt gggaacccgg gaggcagagc ttgcagtgag ccgagattgt gccactgcac 4681 actccaatct gggtgaaaga ccgagactcc gcctcaaaaa aaaaaaaaaa agaaagaaag 4741 aaagaaagaa agaaagaaat ctacctgtca aggaactaag gtattttgct aacaagcacc 4801 aacttgccag ccatgtaagg gagccatctt ggaagcagat cctccagcct ccagtcaagt 4861 cttcagataa ttgcaacttc agttgatctt ttgaccaaga cctcaagaga gccagaacta 4921 cccagctaag ccttttacta aatttctgaa cttctaacac tattagataa taagtgctta 4981 ttgtttaaca ccattaattt tgagtataat ttgttacata gcgacagata actatacagc 5041 tcaacaacta gaaaaataaa ctgtttacct gccttaatta tttatcttta gttccttatt 5101 agttctcaag aaacaaatgc tagcttcata tgtatggctg ttgctttgct tcatgtgtat 5161 ggctatttgt atttaacaag acttaatcat cagtaatttg tatac // LOCUS HUMTAPA1 1496 bp mRNA PRI 15-SEP-1990 DEFINITION Human 26-kDa cell surface protein TAPA-1 mRNA, complete cds. ACCESSION M33680 NID g338677 KEYWORDS 26-kDa cell surface protein TAPA-1; target of antiproliferative antibody. SOURCE Human cell line OCI-LY8, cDNA to mRNA, clones 7-3 and 8-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1496) AUTHORS Oren,R., Takahashi,S., Doss,C., Levy,R. and Levy,S. TITLE TAPA-1, the target of an anti-proliferative antibody, defines a new family of transmembrane proteins JOURNAL Mol. Cell. Biol. 10, 4007-4015 (1990) MEDLINE 90318365 COMMENT Draft entry and computer readable sequence for [1] kindly submitted by S.Levy, 10-APR-1990, for release after publication. FEATURES Location/Qualifiers source 1..1496 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 239..949 /note="26-kDa cell surface protein TAPA-1" /codon_start=1 /db_xref="PID:g338678" /translation="MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNL LYLELGDKPAPNTFYVGIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILF ACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSS TLTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIF EMILSMVLCCGIRNSSVY" polyA_signal 1455..1460 BASE COUNT 257 a 504 c 413 g 322 t ORIGIN 1 ccattgtgct ggaaaggcgc gcaacggcgg cgacggcggc gaccccaccg cgcatcctgc 61 caggcctccg cgcccagccg cccacgcgcc cccgcgcccc gcgccccgac cctttcttcg 121 cgcccccgcc cctcggcccg ccaggccccc ttgccggcca cccgccaggc cccgcgccgg 181 cccgcccgcc gcccaggacc ggcccgcgcc ccgcaggccg cccgccgccc gcgccgccat 241 gggagtggag ggctgcacca agtgcatcaa gtacctgctc ttcgtcttca atttcgtctt 301 ctggctggct ggaggcgtga tcctgggtgt ggccctgtgg ctccgccatg acccgcagac 361 caccaacctc ctgtatctgg agctgggaga caagcccgcg cccaacacct tctatgtagg 421 catctacatc ctcatcgctg tgggcgctgt catgatgttc gttggcttcc tgggctgcta 481 cggggccatc caggaatccc agtgcctgct ggggacgttc ttcacctgcc tggtcatcct 541 gtttgcctgt gaggtggccg ccggcatctg gggctttgtc aacaaggacc agatcgccaa 601 ggatgtgaag cagttctatg accaggccct acagcaggcc gtggtggatg atgacgccaa 661 caacgccaag gctgtggtga agaccttcca cgagacgctt gactgctgtg gctccagcac 721 actgactgct ttgaccacct cagtgctcaa gaacaatttg tgtccctcgg gcagcaacat 781 catcagcaac ctcttcaagg aggactgcca ccagaagatc gatgacctct tctccgggaa 841 gctgtacctc atcggcattg ctgccatcgt ggtcgctgtg atcatgatct tcgagatgat 901 cctgagcatg gtgctgtgct gtggcatccg gaacagctcc gtgtactgag gccccgcagc 961 tctggccaca gggacctctg cagtgccccc taagtgaccc ggacacttcc gagggggcca 1021 tcaccgcctg tgtatataac gtttccggta ttactctgct acacgtagcc tttttacttt 1081 tggggttttg tttttgttct gaactttcct gttacctttt cagggctgat gtcacatgta 1141 ggtggcgtgt atgagtggag acgggcctgg gtcttgggga ctggagggca ggggtccttc 1201 tgcccctggg gtcccagggt gctctgcctg ctcagccagg cctctcctgg gagccactcg 1261 cccagagact cagcttggcc aacttggggg gctgtgtcca cccagcccgc ccgtcctgtg 1321 ggctgcacag ctcaccttgt tccctcctgc cccggttcga gagccgagtc tgtgggcact 1381 ctctgccttc atgcacctgt cctttctaac acgtcgcctt caactgtaat cacaacatcc 1441 tgactccgtc atttaataaa gaaggaacat caggcatgct aaaaaaaaaa aaaaaa // LOCUS HUMTB 2378 bp mRNA PRI 31-AUG-1993 DEFINITION Human CACCC box-binding protein mRNA, complete cds. ACCESSION L04282 NID g388318 KEYWORDS CACCC box-binding protein; zinc finger. SOURCE Homo sapiens peripheral blood cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2378) AUTHORS Wang,Y., Kobori,J.A. and Hood,L. TITLE The ht-beta gene encodes a novel CACCC box-binding protein that regulates T-cell receptor gene expression JOURNAL Mol. Cell. Biol. 13, 5691-5701 (1993) MEDLINE 93361003 FEATURES Location/Qualifiers source 1..2378 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="peripheral blood" CDS 391..1755 /standard_name="tb protein" /codon_start=1 /function="regulates T-cell receptor gene expression" /product="CACCC box-binding protein" /db_xref="PID:g388319" /translation="MNIDDKLEGLFLKCGGIDEMQSSRTMVVMGGVSGQSTVSGELQD SVLQDRSMPHQEILAADEVLQESEMRQQDMISHDELMVHEETVKNDEEQMETHERLPQ GLQYALNVPISVKQEITFTDVSEQLMRDKKQIREPVDLQKKKKRKQRSPAKILTINED GSLGLKTPKSHVCEHCNAAFRTNYHLQRHVFIHTGEKPFQCSQCDMRFIQKYLLQRHE KIHTGEKPFRCDECGMRSIQKYHMERHKRTHSGEKPYQCEYCLQYFSRTDRVLKHKRI GHENHDKKLNTCAMKGGLLRSEEDSGFSTSPKDNSLPKKKRQKTEKKSSGMDKESALD KSDLKKDKNDYLPVYSSSTKVKDEYMVAEYAVEMPHSSVGGSHLEDASGEIHPPKLVL KKINSKRSLKQPLEQNHTISPLSTYEERKFQSMLLNLWINRLYWTQKAMLTLIRLIIC RRAQ" misc_feature 907..1224 /standard_name="zinc finger" BASE COUNT 777 a 481 c 530 g 590 t ORIGIN 1 gaattccgga gaaaggcgca ggggtgggag ctgttgccga agctgccaca gcaaaagttc 61 tcccccctcc ccccttcccc tcctctcaag gcccctagaa aggttggagc tgccggccct 121 gcagtcggtg accgctgacg acttcggccg cgcccgcgga tagcgggagg aatcagcagc 181 ttggaaattc aagcacgtga tctggcggga tggcgtttgc ctaacgtatt taatggagga 241 atcggatggc ataagtgatt aaggtggtat tgaggatttc tgaagcctat gaaaggtaga 301 aactcaacca tgatttcttt ttcaactcta cagcattcct ttccttgaag tcttcgtttt 361 taccttagtc tcgggcagtt atacttaagc atgaacattg acgacaaact ggaaggattg 421 tttcttaaat gtggcggcat agacgaaatg cagtcttcca ggacaatggt tgtaatgggt 481 ggagtgtctg gccagtctac tgtgtctgga gagctacagg attcagtact tcaagatcga 541 agtatgcctc accaggagat ccttgctgca gatgaagtgt tacaagaaag tgaaatgaga 601 caacaggata tgatatcaca tgatgaactc atggtccatg aggagacagt gaaaaatgat 661 gaagagcaga tggaaacaca tgaaagactt cctcaaggac tacagtatgc acttaatgtc 721 cctataagcg taaagcagga aattactttt actgatgtat ctgagcaact gatgagagac 781 aaaaaacaaa tcagagagcc agtagactta cagaaaaaga agaagcggaa acaacgttct 841 cccgcaaaaa tccttacaat aaatgaggat ggatcacttg gtttgaaaac ccctaaatct 901 cacgtttgtg agcactgcaa tgctgccttt agaacgaact atcacttaca gagacatgtc 961 ttcattcata caggtgaaaa accatttcaa tgtagtcaat gtgacatgcg tttcatacag 1021 aagtacctgc ttcagagaca tgagaagatt catactggtg aaaaaccatt tcgctgtgat 1081 gaatgtggta tgagatccat acaaaaatat catatggaaa ggcataagag aactcatagt 1141 ggagaaaaac cttaccagtg tgaatactgt ttacagtatt tttccagaac agatcgtgta 1201 ttgaaacata aacgtattgg ccatgaaaat catgacaaaa aactaaatac atgtgccatg 1261 aaaggtggcc ttctgcgctc tgaggaagat tctggctttt ctacatcacc aaaagacaac 1321 tcactgccaa aaaagaaaag gcagaaaacg gagaaaaaat catctggaat ggacaaagag 1381 agtgctttgg acaaatctga cctgaaaaaa gacaaaaatg attacttgcc tgtttattct 1441 tcaagtacta aagtaaaaga tgagtatatg gttgcagaat atgctgttga aatgccacat 1501 tcgtcagttg ggggctcgca tttagaagat gcgtcaggag aaatacaccc acctaagtta 1561 gttctcaaaa aaattaatag taagagaagt ctgaaacagc cactggagca aaatcacaca 1621 atttcacctt tatccacata tgaagagcga aagtttcaaa gtatgctttt gaacttgtgg 1681 ataaacaggc tttactggac tcagaaggca atgctgacat tgatcaggtt gataatttgc 1741 aggagggccc agtaaacctg tgcatagtag tactaattat gatgatgcca tgcagttttt 1801 gaagaagaag cggtatcttc aaagcaagta acaacagcag ggaatatgcg ctgaatgtgg 1861 gtaccatacg ttctcagcct tctgtaacac aagcagctgt ggcaagtgtc attgatgaaa 1921 gatccacggc atccatatta gagtcacagg cactgaatgt ggagattaag agtaatcatg 1981 acaaaaatgt gttattccag atgaggtact gcagactctg ttggatcatt attcccacaa 2041 agctaatgga cagcatgaga tatccttcag tgttgcagat actgaagtga ctctagcata 2101 tcaataaatt cttcagaagt ccagagtcac cccgtcagag aatgttgatc aagctcccaa 2161 gcatcctcat cagataaagc caacatgttg caggaatact ccaagtttct gcagcaggct 2221 ttggacagaa ctagccaaaa tgatgcctat ttgaatagcc cgagccttaa ctttgtgact 2281 gataaccaga ccctcccaaa tcagccagca ttctcttcca tagacaagca ggtctatgcc 2341 accatgccca tcaatagctt tcgatcagga atgaattc // LOCUS HUMTB31A 2261 bp mRNA PRI 16-MAR-1992 DEFINITION Human TB3-1 mRNA, complete cds. ACCESSION M75715 NID g338686 KEYWORDS . SOURCE Homo sapiens (library: T-84) epithelial cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2261) AUTHORS Grenett,H.E., Fuller,G.M. and Bounelis,P. TITLE Identification of a human cDNA with high homology to yeast omnipotent suppressor 45 JOURNAL Gene 110, 239-243 (1992) MEDLINE 92165066 FEATURES Location/Qualifiers source 1..2261 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="T-84" /tissue_type="epithelial" /tissue_lib="T-84" CDS 56..1342 /note="high homology to yeast omnipotent supressor 45; putative" /codon_start=1 /product="TB3-1" /db_xref="PID:g338687" /translation="MADDPSAADRNVEIWKIKKLIKSLEAARGNGTSMISLIIPPKDQ ISRVAKMLADEFGTASNIKSRVNRLSVLGAITSVQQRLKLYNKVPPNGLEVYCGTIVT EEGKEKKVNIDFEPFKPINTSLYLCDNKFHTEALTALLSDDSKFGFIVIDGSGALFGT LQGNTREVLHKFTVDLPKKHGRGGQSALRFARLRMEKRHNYVRKVAETAVQLFISGDK VNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLVDISYGGENGFNQAIELSTEVLS NVKFIQEKKLIGRYFDEISQDSLASTVLALKIHIGFGMGAVENSNSHMKSGYNEICSS LPRHRRRNLYLTPEQEKDKSHFTDKETGQEHELIESMPLLEWFANNYKNLELRWKLSQ INHKKGLSLMKGFGGIGGILRYQSRFPGNGIPRRRR" BASE COUNT 714 a 432 c 505 g 609 t 1 others ORIGIN 1 gaattccgca gccgctgccg ccaggactgg gccttaggga ggaggaggcg agaagatggc 61 ggacgacccc agtgctgccg acaggaacgt ggagatctgg aagatcaaga agctcattaa 121 gagcttggag gcggcccgcg gcaatggcac cagcatgata tcattgatca ttcctcccaa 181 agaccagatt tcacgagtgg caaaaatgtt agcggatgag tttggaactg catctaacat 241 taagtctcga gtaaaccgcc tttcagtcct gggagccatt acatctgtac aacaaagact 301 caaactttat aacaaagtac ctccaaatgg tctggaagta tactgtggaa caattgtaac 361 agaagaagga aaggaaaaga aagtcaacat tgactttgaa cctttcaaac caattaatac 421 gtcattgtat ttgtgtgaca acaaattcca tacagaggct cttacagcac tactttcaga 481 tgatagcaag tttggattca ttgtaataga tggtagtggt gcactttttg gcacactcca 541 aggaaacaca agagaagtcc tgcacaaatt cactgtggat ctcccaaaga aacacggtag 601 aggaggtcag tcagccttgc gttttgcccg tttaagaatg gaaaagcgac ataactatgt 661 tcggaaagta gcagagactg ctgtgcagct gtttatttct ggggacaaag tgaatgtggc 721 tggtctagtt ttagctggat ccgctgactt taaaactgaa ctaagtcaat ctgatatgtt 781 tgatcagagg ttacaatcaa aagttttaaa attagttgat atatcctatg gtggtgaaaa 841 tggattcaac caagctattg agttatctac tgaagtcctc tccaacgtga aattcattca 901 agagaagaaa ttaataggac gatactttga tgaaatcagc caggactcac tggcaagtac 961 tgttttggcg ttgaagatac acataggctt tggaatggga gctgtagaaa attctaatag 1021 tcatatgaaa tctggatata atgagatatg ttcttcattg ccaaggcaca gaagaagaaa 1081 tctctatcta actccagagc aagaaaagga taaatctcat ttcacagaca aagagaccgg 1141 acaggaacat gagcttatcg agagcatgcc cctgttggaa tggtttgcta acaactataa 1201 aaatttggag ctacgttgga aattgtcaca gataaatcac aagaagggtc tcagtttgat 1261 gaaaggattt ggtggaattg gaggtatctt gcggtaccag agtagatttc cagggaatgg 1321 aataccaagg aggagacgat gaattttttg accttgatga ctactaggta gtcgacatgg 1381 gtccggcaaa acgtgcctca ccctccagca tccaacccaa ggagcatacc catggtggaa 1441 tccaaacaga tccctgtcct tacaattgga acatttccag aacttaatcc atgagcattg 1501 gatattgaaa gaaaccgaaa caaaaccaga cccagcccta cactttggtt tgtcatggtg 1561 cagcgcagca gcctaactaa ctaatgttcc ttcaaaagcc actttggacg taatttaaaa 1621 aagaatccca gtttttactt ttactggatg gtgaaattgg ttgctcttgt attttatgaa 1681 aaaaaatgat ttttttaacc ttcatacata gaagcaaaaa tactttaact gctgtaaacc 1741 ttcaaaagtt aatagaagtg agatcatact ggtttgtttc ttatttttga ttggagaaaa 1801 attaaattgc tgcatttcgc ctaggtgacc atttacatgg attctcagtt agactgcgta 1861 agaagaaata tatgtggtga aatgttggaa ccatttctct cttggtctct gtttaatgtt 1921 gaagggtgag ctattaggag gcactttcaa cttcactccc tcacgctacc ccgtccccct 1981 ccagactggc agtttcaagg atgcaaattg cattgcaaaa tcaaactgac tcatgaagca 2041 tttgggccag tgcactgttt acttccatct gtttgcagac acatttgtgc ccggcgtttg 2101 ggagcccttt gtatcaatgt tctgacaagg gtccctataa ccttaaccta ctcgaaaccg 2161 gtttgggatg gatatgatgg ggcttctgtg ctattgctgg gattgggaga aataaaacat 2221 gcaatttaag tggaagcgaa gaaatttaaa aaaaaaaaaa n // LOCUS HUMTBP1 1341 bp mRNA PRI 14-MAY-1990 DEFINITION Human immunodeficiency virus tat transactivator binding protein-1 (tbp-1) mRNA, complete cds. ACCESSION M34079 NID g338699 KEYWORDS Tat-binding protein. SOURCE Human Jurkat T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1341) AUTHORS Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A. TITLE A cDNA corresponding to a protein that interacts with the human immunodeficiency virus Tat transactivator JOURNAL Science 248, 1650-1653 (1990) MEDLINE 90302011 COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.A.Rosen, 08-MAY-1990. FEATURES Location/Qualifiers source 1..1341 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 63..1277 /note="tat binding protein-1 (tbp-1)" /codon_start=1 /db_xref="PID:g338700" /translation="MSTEEIIQRTRLLDSEIKIMKSEVLRVTHELQAMKDKIKENSEK IKVNKTLPYLVSNVIELLDVDPNDQEEDGANIDLDSQRKGKCAVIKTSTRQTYFLPVI GLVDAEKLKPGDLVGVNKDSYLILETLPTEYDSRVKAMEVDERPTEQYSDIGGLDKQI QELVEAIVLPMNHKEKFENLGIQPPKGVLMYGPPGTGKTLLARACAAQTKATFLKLAG PQLVQMFIGDGAKLVRDAFALAKEKAPSIIFIDELDAIGTKRFDSEKAGDREVQRTML ELLNQLDGFQPNTQVKVIAATNRVDILDPALLRSGRLDRKIEFPMPNEEARARIMQIH SRKMNVSPDVNYEELARCTDDFNGAQCKAVCVEAGMIALARGATELTHEDYMEGILEV QAKKKANLQYYA" BASE COUNT 350 a 337 c 412 g 242 t ORIGIN 1 gaattccggc gaccgtgtgg gatgaggccg agcaagatgg aattggggag gaggtgctca 61 agatgtccac ggaggagatc atccagcgca cacggctgct ggacagtgag atcaagatca 121 tgaagagtga agtgttgaga gtcacccatg agctccaagc catgaaggac aagataaaag 181 agaacagtga gaaaatcaaa gtgaacaaga ccctgccgta ccttgtctcc aacgtcatcg 241 agctcctgga tgttgatcct aatgaccaag aggaggatgg tgccaatatt gacctggact 301 cccagaggaa gggcaagtgt gctgtgatca aaacctctac acgacagacg tacttccttc 361 ctgtgattgg gttggtggat gctgaaaagc taaagccagg agacctggtg ggtgtgaaca 421 aagactccta tctgatcctg gagacgctgc ccacagagta tgactcgcgg gtgaaggcca 481 tggaggtaga cgagaggccc acggagcaat acagtgacat tgggggtttg gacaagcaga 541 tccaggagct ggtggaggcc attgtcttgc caatgaacca caaggagaag tttgagaact 601 tggggatcca acctccaaaa ggggtgctga tgtatgggcc cccagggacg gggaagaccc 661 tcctggcccg ggcctgtgcc gcacagacta aggccacctt cctaaagctg gctggccccc 721 agctggtgca gatgttcatt ggagatggtg ccaagctagt ccgggatgcc tttgccctgg 781 ccaaggagaa agcgccctct atcatcttca ttgatgagtt ggatgccatc ggcaccaagc 841 gctttgacag tgagaaggct ggggaccggg aggtgcagag gacaatgctg gagcttctga 901 accagctgga tggcttccag cccaacaccc aagttaaggt aattgcagcc acaaacaggg 961 tggacatcct ggaccccgcc ctcctccgct cgggccgcct tgaccgcaag atagagttcc 1021 cgatgcccaa tgaggaggcc cgggccagaa tcatgcagat ccactcccga aagatgaatg 1081 tcagtcctga cgtgaactac gaggagctgg cccgctgcac agatgacttc aatggggccc 1141 agtgcaaggc tgtgtgtgtg gaggcgggca tgatcgcact ggccaggggt gccacggagc 1201 tcacccacga ggactacatg gaaggcatcc tggaggtgca ggccaagaag aaagccaacc 1261 tacaatacta cgcctaggca cacaggccag ccccagtctc acggctgaag tgcgcaataa 1321 aagatggttt agggggaatt c // LOCUS HUMTBSA 1719 bp mRNA PRI 13-JAN-1995 DEFINITION Homo sapiens thromboxane synthase mRNA, complete cds. ACCESSION M80646 NID g338701 KEYWORDS thromboxane synthase. SOURCE Homo sapiens adult lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1719) AUTHORS Ohashi,K., Ruan,K.H., Kulmacz,R.J., Wu,K.K. and Wang,L.H. TITLE Primary structure of human thromboxane synthase determined from the cDNA sequence JOURNAL J. Biol. Chem. 267 (2), 789-793 (1992) MEDLINE 92112810 FEATURES Location/Qualifiers source 1..1719 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="lung" gene 172..1554 /gene="thromboxane synthase" CDS 172..1554 /gene="thromboxane synthase" /EC_number="5.3.99.5" /note="member of cytochrome P450 superfamily" /codon_start=1 /function="biosynthesis of thromboxane A2" /evidence=experimental /product="thromboxane synthase" /db_xref="PID:g338702" /translation="MMEALGFLKLEVNGPMVTVALSVALLALLKWYSTSAFSRLEKLG LRHPKPSPFIGNLTFFRQGFWESQMELRKLYGPLCGYYLGRRMFIVISEPDMIKQVLV ENFSNFTNRMASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMVPLISQAC DLLLAHLKRYAESGDAFDIQRCYCNYTTDVVASVPFGTPVDSWQAPEDPFVKHCKRFF EFCIPRPILVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERR RDFLQMVLDARHSASPMGVQDFDIVRDVFSSTGCKPNPSRQHQPSPMARPLTVDEIVG QAFIFLIAGYEIITNTLSFATYLLATNPDCQEKLLREVDVFKEKHMAPEFCSLEEGLP YLDMVIAETLRMYPPAFRFTREAAQDCEVLGQRIPAGAVLEMAVGALHHDPEHWPSPE TFNPERYRCS" allele complement(1540..1541) /gene="thromboxane synthase" /note="replace (1540..1541)" BASE COUNT 394 a 451 c 445 g 429 t ORIGIN 1 tggcatcctg gtcctctcac atggctcctt aaaattggct gtggcttgtt ttttgtgatg 61 tttgcttggt tgcctgttcc cttttctacc tgcagagcac ggttcccata agggcggcga 121 gatcagcctc ctgtctcatc tggaagacca ccactctggg gtctcagagg aatgatggaa 181 gccttggggt ttctaaaatt ggaagtgaat ggccccatgg tgacggtggc cctgtcagtg 241 gctctcttgg ccctcctgaa atggtactcc acatcagcat tctcaagact ggagaagtta 301 ggcctcagac atcccaagcc ttctcctttc attggaaact tgacattttt ccgccagggt 361 ttttgggaaa gccaaatgga gctcagaaag ctgtatggac ctctgtgtgg gtactatctt 421 ggtcgtcgga tgtttattgt tatttctgag ccagacatga tcaagcaggt gttggttgag 481 aacttcagta actttaccaa cagaatggcg tcgggtttgg agttcaagtc ggtagccgac 541 agcgttctgt ttttacgtga caaaagatgg gaagaggtca gaggtgccct gatgtctgct 601 ttcagtcctg aaaagctgaa cgagatggtt cccctcatca gccaagcctg cgaccttctc 661 ctggctcatt taaaacgcta tgcggaatct ggggacgcat ttgacatcca gaggtgctac 721 tgcaattaca ccacagatgt ggttgccagc gtcccgtttg gcaccccggt ggactcctgg 781 caggcccctg aggatccctt tgtgaaacac tgcaagcgtt tcttcgaatt ctgcatcccc 841 agacctatcc tggttttact cttatcattt ccatccataa tggtcccact ggcccggatt 901 ttgcccaata agaaccgaga cgaactgaat ggctttttta acaaactcat taggaatgtg 961 attgccttgc gggaccagca agctgccgaa gagaggcgga gagacttcct ccaaatggtc 1021 ctggatgccc gacattctgc aagtcccatg ggcgtgcaag actttgacat cgtcagagac 1081 gttttctcct ctactgggtg caagccgaac ccttcccggc aacaccagcc cagccctatg 1141 gccaggcctt tgactgtgga tgagattgtg ggccaggcct tcatcttcct catcgctggc 1201 tatgaaatca tcaccaacac actttctttt gccacctacc tactggccac caaccctgac 1261 tgccaagaga agcttctgag agaggtagac gtttttaagg agaaacacat ggcccctgag 1321 ttctgcagcc tcgaggaagg cctgccctat ctggacatgg tgattgcaga gacgctgagg 1381 atgtacccgc cagctttcag attcacacgg gaggcagctc aggactgcga ggtgctgggg 1441 cagcgcatcc ccgcaggcgc tgtgctagag atggccgtgg gtgccctgca ccatgaccct 1501 gagcactggc caagcccgga gaccttcaac cctgaaaggt accgctgcag ctagaatcca 1561 aatctgccct aggtccaaaa aatggtgtct atatcaagat cgtatcccgc tgacacagaa 1621 ggctgccggg tggggggagg gcacccccaa attcaaagaa aaccctaagt gtggatgttc 1681 agaattttgg aaaaatgtca ctgaagtgat tgaaagccg // LOCUS HUMTCII 1866 bp mRNA PRI 13-JAN-1995 DEFINITION Human transcobalamin II (TCII) mRNA, complete cds. ACCESSION M60396 NID g339195 KEYWORDS transcobalamin II. SOURCE Human umbilical vein endothelial cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1866) AUTHORS Platica,O., Janeczko,R., Quadros,E.V., Regec,A., Romain,R. and Rothenberg,S.P. TITLE The cDNA sequence and the deduced amino acid sequence of human transcobalamin II show homology with rat intrinsic factor and human transcobalamin I JOURNAL J. Biol. Chem. 266 (12), 7860-7863 (1991) MEDLINE 91210312 FEATURES Location/Qualifiers source 1..1866 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="endothelial cell" /tissue_type="umbilical vein" /map="22q" sig_peptide 38..91 /gene="TCN2" /note="G00-119-608" CDS 38..1321 /gene="TCN2" /codon_start=1 /db_xref="GDB:G00-119-608" /product="transcobalamin II" /db_xref="PID:g339196" /translation="MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDR LSLEHLNPSIYVGLRLSSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSM GQLALYLLALRANCEFVRGHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLG ILALCLHQKRVHDSVVDKLLYAVEPFHQGHHSVDTAAMAGLAFTCLKRSNFNPGRRQR ITMAIRTVREEILKAQTPEGHFGNVYSTPLALQFLMTSPMPGAELGTACLKARVALLA SLQDGAFQNALMISQLLPVLNHKTYIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTL QVLSLLPPYRQSISVLAGSTVEDVLKKAHELGGFTYETQASSSGPYLTSVMGKAAGER EFWQLLRDPNTPLLQGIADYRPKDGETIELRLVSW" gene 38..1321 /gene="TCN2" mat_peptide 92..1318 /gene="TCN2" /note="G00-119-608" /evidence=experimental /product="transcobalamin II" BASE COUNT 395 a 575 c 500 g 396 t ORIGIN 1 ccgattcttg ctcactgctc acccacctgc tgctgccatg aggcaccttg gggccttcct 61 cttccttctg ggggtcctgg gggccctcac tgagatgtgt gaaataccag agatggacag 121 ccatctggta gagaagttgg gccagcacct cttaccttgg atggaccggc tttccctgga 181 gcacttgaac cccagcatct atgtgggcct acgcctctcc agtctgcagg ctgggaccaa 241 ggaagacctc tacctgcaca gcctcaagct tggttaccag cagtgcctcc tagggtctgc 301 cttcagcgag gatgacggtg actgccaggg caagccttcc atgggccagc tggccctcta 361 cctgctcgct ctcagagcca actgtgagtt tgtcaggggc cacaaggggg acaggctggt 421 ctcacagctc aaatggttcc tggaggatga gaagagagcc attgggcatg atcacaaggg 481 ccacccccac actagctact accagtatgg cctgggcatt ctggccctgt gtctccacca 541 gaagcgggtc catgacagcg tggtggacaa acttctgtat gctgtggaac ctttccacca 601 gggccaccat tctgtggaca cagcagccat ggcaggcttg gcattcacct gtctgaagcg 661 ctcaaacttc aaccctggtc ggagacaacg gatcaccatg gccatcagaa cagtgcgaga 721 ggagatcttg aaggcccaga cccccgaggg ccactttggg aatgtctaca gcaccccatt 781 ggcattacag ttcctcatga cttcccccat gcctggggca gaactgggaa cagcatgtct 841 caaggcgagg gttgctttgc tggccagtct gcaggatgga gccttccaga atgctctcat 901 gatttcccag ctgctgcccg ttctgaacca caagacctac attgatctga tcttcccaga 961 ctgtctggca ccacgagtca tgttggaacc agctgctgag accattcctc agacccaaga 1021 gatcatcagt gtcacgctgc aggtgcttag tctcttgccg ccgtacagac agtccatctc 1081 tgttctggcc gggtccaccg tggaagatgt cctgaagaag gcccatgagt taggaggatt 1141 cacatatgaa acacaggcct cctcgtcagg cccctactta acctccgtga tggggaaagc 1201 ggccggagaa agggagttct ggcagcttct ccgagacccc aacaccccac tgttgcaagg 1261 tattgctgac tacagaccca aggatggaga aaccattgag ctgaggctgg ttagctggta 1321 gcccctgagc tccctcatcc cagcagcctc gcacactccc taggcttcta ccctccctcc 1381 tgatgtccct ggaacaggaa ctcgcctgac cctgctgcca cctcctgtgc actttgagca 1441 atgccccctg ggatcacccc agccacaagc ccttcgaggg ccctatacca tggcccacct 1501 tggagcagag agccaagcat cttccctggg aagtctttct ggccaagtct ggccagcctg 1561 gccctgcagg tctcccatga aggccacccc atggtctgat gggcatgaag catctcagac 1621 tccttggcaa aaaacggagt ccgcaggccg caggtgttgt gaagaccact cgttctgtgg 1681 ttggggtcct gcaagaaggc ctcctcagcc cgggggctat ggccctgacc ccagctctcc 1741 actctgctgt tagagtggca gctctgagct ggttgtggca cagtagctgg ggagacctca 1801 gcagggctgc tcagtgcctg cctctgacaa aattaaagca ttgatggcct gtggacctgc 1861 aaaaaa // LOCUS HUMTCOBI 1537 bp mRNA PRI 13-JAN-1995 DEFINITION human transcobalamin I mRNA, complete cds. ACCESSION J05068 NID g307478 KEYWORDS granule protein; transcobalamin. SOURCE Human blood neutrophil, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1537) AUTHORS Johnston,J., Bollekens,J., Allen,R.H. and Berliner,N. TITLE Structure of the cDNA encoding transcobalamin I, a neutrophil granule protein JOURNAL J. Biol. Chem. 264 (27), 15754-15757 (1989) MEDLINE 89380156 COMMENT Draft entry and computer readable sequence for [J. Biol. Chem. (1989) In press] kindly provided by J.Johnston on 21-AUG-1989. FEATURES Location/Qualifiers source 1..1537 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="neutrophyl" /tissue_type="blood" /map="11q11-q12" sig_peptide 76..144 /gene="TCN1" /note="transcobalamin signal peptide" CDS 76..1377 /gene="TCN1" /note="transcobalamin I precursor" /codon_start=1 /db_xref="GDB:G00-118-882" /db_xref="PID:g307479" /translation="MRQSHQLPLVGLLLFSFIPSQLCEICEVSEENYIRLKPLLNTMI QSNYNRGTSAVNVVLSLKLVGIQIQTLMQKMIQQIKYNVKSRLSDVSSGELALIILAL GVCRNAEENLIYDYHLTDKLENKFQAEIENMEAHNGTPLTNYYQLSLDVLALCLFNGN YSTAEVVNHFTPENKNYYFGSQFSVDTGAMAVLALTCVKKSLINGQIKADEGSLKNIS IYTKSLVEKILSEKKENGLIGNTFSTGEAMQALFVSSDYYNENDWNCQQTLNTVLTEI SQGAFSNPNAAAQVLPALMGKTFLDINKDSSCVSASGNFNISADEPITVTPPDSQSYI SVNYSVRINETYFTNVTVLNGSVFLSVMEKAQKMNDTIFGFTMEERSWGPYITCIQGL CANNNDRTYWELLSGGEPLSQGAGSYVVRNGENLEVRWSKY" gene 76..1377 /gene="TCN1" mat_peptide 145..1374 /gene="TCN1" /note="transcobalamin I" BASE COUNT 472 a 342 c 309 g 414 t ORIGIN 1 gctctcatta ccttctgccc atcacttaat aaatagccag ccaattcatc aacattctgg 61 tacactgttg gagagatgag acagtcacac cagctgcccc tagtggggct cttactgttt 121 tcttttattc caagccaact atgcgagatt tgtgaggtaa gtgaagaaaa ctacatccgc 181 ctaaaacctc tgttgaatac aatgatccag tcaaactata acaggggaac cagcgctgtc 241 aatgttgtgt tgtccctcaa acttgttgga atccagatcc aaaccctgat gcaaaagatg 301 atccaacaaa tcaaatacaa tgtgaaaagc agattgtcag atgtaagctc gggagagctt 361 gccttgatta tactggcttt gggagtatgt cgtaacgctg aggaaaactt aatatatgat 421 taccacctga ctgacaagct agaaaataaa ttccaagcag aaattgaaaa tatggaagca 481 cacaatggca ctcccctgac taactactac cagctcagcc tggacgtttt ggccttgtgt 541 ctgttcaatg ggaactactc aaccgccgaa gttgtcaacc acttcactcc tgaaaataaa 601 aactattatt ttggtagcca gttctcagta gatactggtg caatggctgt cctggctctg 661 acctgtgtga agaagagtct aataaatggg cagatcaaag cagatgaagg cagtttaaag 721 aacatcagta tttatacaaa gtcactggta gaaaagattc tgtctgagaa aaaagaaaat 781 ggtctcattg gaaacacatt tagcacagga gaagccatgc aggccctctt tgtatcatca 841 gactattata atgaaaatga ctggaattgc caacaaactc tgaatacagt gctcacggaa 901 atttctcaag gagcattcag taatccaaac gctgcagccc aggtcttacc tgccctgatg 961 ggaaagacct tcttggatat taacaaagac tcttcttgcg tctctgcttc aggtaacttc 1021 aacatctccg ctgatgagcc tataactgtg acacctcctg actcacaatc atatatctcc 1081 gtcaattact ctgtgagaat caatgaaaca tatttcacca atgtcactgt gctaaatggt 1141 tctgtcttcc tcagtgtgat ggagaaagcc cagaaaatga atgatactat atttggtttc 1201 acaatggagg agcgctcatg ggggccctat atcacctgta ttcagggcct atgtgccaac 1261 aataatgaca gaacctactg ggaacttctg agtggaggcg aaccactgag ccaaggagct 1321 ggtagttacg ttgtccgcaa tggagaaaac ttggaggttc gctggagcaa atactaataa 1381 gcccaaactt tcctcagctg cataaaatcc atttgcagtg gagttccatg tttattgtcc 1441 ttatgccttc ttcttcattt atcccagtac gagcaggaga gttaataacc tccccttctc 1501 tctctacatg ttcaataaaa gttgttgaaa gattaac // LOCUS HUMTCRBAP 800 bp DNA PRI 07-NOV-1995 DEFINITION Homo sapiens T cell receptor beta (TCRBV10S1) gene, complete cds. ACCESSION L48728 NID g1054550 KEYWORDS T cell receptor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Currier,J.R., Yassai,M., Robinson,M.A. and Gorski,J. TITLE Molecular defects in TCRBV genes preclude thymic selection and limit the expressed TCR repertoire JOURNAL Unpublished (1995) FEATURES Location/Qualifiers source 1..800 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="G27" /cell_type="fibroblast cell line" /germline exon 219..268 /gene="TCRBV10S1" /pseudo sig_peptide 219..266 /gene="TCRBV10S1" gene 219..679 /gene="TCRBV10S1" CDS join(219..268,540..570) /gene="TCRBV10S1" /note="alternative splice product" /codon_start=1 /db_xref="PID:g1054552" /translation="MCLRLLCCVAISFWGARMKNLFRKQK" CDS join(219..268,382..415) /gene="TCRBV10S1" /note="consensus splice product" /codon_start=1 /db_xref="PID:g1054551" /translation="MCLRLLCCVAISFWGARLHGHQGHPET" intron 269..539 /gene="TCRBV10S1" /note="alternative 3' splice site" intron 269..381 /gene="TCRBV10S1" /note="consensus 3' splice site" allele 361 /gene="TCRBV10S1" /note="'A' in allele" /replace="" exon 382..679 /gene="TCRBV10S1" /note="consensus splice product" /pseudo exon 540..679 /gene="TCRBV10S1" /note="alternative splice product" /pseudo misc_signal 680..686 /note="RSS_heptamer - awaiting approval of new feature key" misc_signal 687..709 /note="RSS_spacer - awaiting approval of new feature key" misc_signal 710..718 /note="RSS_nonamer - awaiting approval of new feature key" BASE COUNT 237 a 188 c 174 g 201 t ORIGIN 1 ctaaaagcag gctgaaactt acactatttt taacagctaa aacaaagtct gccctcctac 61 attctttgaa attgaaatga ctctgtccca tgtacagagc atctacaaag gctatggaaa 121 gtgatcacgt cacaaggagg ttgctgacgg gggctaggag agtgggtgaa gaagccttac 181 agaaaaagct accactacga tttctttctg agccaaccat gtgcctcaga cttctctgct 241 gtgtggccat ttctttctgg ggagccaggt aaggccctgt tctgaactgg ttgaatttca 301 gttccaagac tctctccggt ggctgcagta tcaagctccc tcctgcgctt tttctacagc 361 tgctctctcc ttcctccaca ggctccacgg acaccaaggt cacccagaga cctagacttc 421 tggtcaaagc aagtgaacag aaagcaaaga tggattgtgt tcctataaaa gcacatagtt 481 atgtttactg gtatcgtaag aagctggaag aagagctcaa gtttttggtt tactttcaga 541 atgaagaact tattcagaaa gcagaaataa tcaatgagcg atttttagcc caatgctcca 601 aaaactcatc ctgtaccttg gagatccagt ccacggagtc aggggacaca gcactgtatt 661 tctgtgccag cagcaaagcc acagtgccga atgttagccc ttcttagaac acaaactcat 721 tatggaccca gctcaggaaa taagtgtgta gcaggttggt aggcactacg taacagaaac 781 ccaacttgaa agacaataaa // LOCUS HUMTCRZCN 1472 bp mRNA PRI 12-JAN-1995 DEFINITION Human T cell receptor zeta-chain mRNA, complete cds. ACCESSION J04132 NID g623041 KEYWORDS T cell receptor zeta-chain. SOURCE Homo sapiens (clone library: lambda-gt10) (tissue library: Clontech) tumor cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1472) AUTHORS Weissman,A.M., Hou,D., Orloff,D.G., Modi,W.S., Seuanez,H., O'Brien,S.J. and Klausner,R.D. TITLE Molecular cloning and chromosomal localization of the human T-cell receptor zeta chain: distinction from the molecular CD3 complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (24), 9709-9713 (1988) MEDLINE 89071765 FEATURES Location/Qualifiers source 1..1472 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Jurkat" /clone_lib="lambda-gt10" /tissue_type="tumor" /tissue_lib="Clontech" sig_peptide 75..137 /product="T-cell receptor zeta chain" CDS 75..566 /codon_start=1 /product="T-cell receptor zeta chain" /db_xref="PID:g623042" /translation="MKWKALFTAAILQAQLPITEAQSFGLLDPKLCYLLDGILFIYGV ILTALFLRVKFSRSAEPPAYQQGQNQLYNELNLGRREEYDVLDKRRGRDPEMGGKPRR KNPQEGLYNELQKDKMAEAYSEIGMKGERRRGKGHDGLYQGLSTATKDTYDALHMQAL PPR" mat_peptide 138..563 /product="T-cell receptor zeta chain" BASE COUNT 344 a 396 c 424 g 308 t ORIGIN 1 cttttctcct aaccgtcccg gccaccgctg cctcagcctc tgcctcccag cctctttctg 61 agggaaagga caagatgaag tggaaggcgc ttttcaccgc ggccatcctg caggcacagt 121 tgccgattac agaggcacag agctttggcc tgctggatcc caaactctgc tacctgctgg 181 atggaatcct cttcatctat ggtgtcattc tcactgcctt gttcctgaga gtgaagttca 241 gcaggagcgc agagcccccc gcgtaccagc agggccagaa ccagctctat aacgagctca 301 atctaggacg aagagaggag tacgatgttt tggacaagag acgtggccgg gaccctgaga 361 tggggggaaa gccgagaagg aagaaccctc aggaaggcct gtacaatgaa ctgcagaaag 421 ataagatggc ggaggcctac agtgagattg ggatgaaagg cgagcgccgg aggggcaagg 481 ggcacgatgg cctttaccag ggtctcagta cagccaccaa ggacacctac gacgcccttc 541 acatgcaggc cctgccccct cgctaacagc caggggattt caccactcaa aggccagacc 601 tgcagacgcc cagattatga gacacaggat gaagcattta caacccggtt cactcttctc 661 agccactgaa gtattcccct ttatgtacag gatgctttgg ttatatttag ctccaaacct 721 tcacacacag actgttgtcc ctgcactctt taagggagtg tactcccagg gcttacggcc 781 ctgccttggg ccctctggtt tgccggtggt gcaggtagac ctgtctcctg gcggttcctc 841 gttctccctg ggaggcgggc gcactgcctc tcacagctga gttgttgagt ctgttttgta 901 aagtccccag agaaagcgca gatgctagca catgccctaa tgtctgtatc actctgtgtc 961 tgagtggctt cactcctgct gtaaatttgg cttctgttgt caccttcacc tcctttcaag 1021 gtaactgtac tgggccatgt tgtgcctccc tggtgagagg gccgggcaga ggggcagatg 1081 gaaaggagcc taggccaggt gcaaccaggg agctgcaggg gcatgggaag gtgggcgggc 1141 aggggagggt cagccagggc ctgcgagggc agcgggagcc tccctgcctc aggcctctgt 1201 gccgcaccat tgaactgtac catgtgctac aggggccaga agatgaacag actgaccttg 1261 atgagctgtg cacaaagtgg cataaaaaac agtgtggtta cacagtgtga ataaagtgct 1321 gcggagcaag aggaggccgt tgattcactt cacgctttca gcgaatgaca aaatcatctt 1381 tgtgaaggcc tcgcaggaag acgcaacaca tgggacctat aactgcccag cggacagtgg 1441 caggacagga aaaacccgtc aatgtactag gg // LOCUS HUMTCSM 1160 bp mRNA PRI 15-JUN-1989 DEFINITION Human T cell-specific protein (RANTES) mRNA, complete cds. ACCESSION M21121 NID g339420 KEYWORDS Alu repeat; T-cell-specific protein. SOURCE Human peripheral blood (T lymphocyte) cell line AH2, cDNA to mRNA, clone 228. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1160) AUTHORS Schall,T.J., Jongstra,J., Dyer,B.J., Jorgensen,J., Clayberger,C., Davis,M.M. and Krensky,A.M. TITLE A human T cell-specific molecule is a member of a new gene family JOURNAL J. Immunol. 141, 1018-1025 (1988) MEDLINE 88285659 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.M.Krensky, 24-OCT-1988. FEATURES Location/Qualifiers source 1..1160 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 27..302 /note="T cell-specific protein precursor" /codon_start=1 /db_xref="PID:g339421" /translation="MKVSAARLAVILIATALCAPASASPYSSDTTPCCFAYIARPLPR AHIKEYFYTSGKCSNPAVVFVTRKNRQVCANPEKKWVREYINSLEMS" sig_peptide 27..95 /note="T cell-specific protein signal peptide" mat_peptide 96..299 /note="T cell-specific protein" repeat_region 450..950 /note="Alu-related repeats" BASE COUNT 298 a 332 c 295 g 235 t ORIGIN 276 bp upstream of RsaI site. 1 cctccgacag cctctccaca ggtaccatga aggtctccgc ggcacgcctc gctgtcatcc 61 tcattgctac tgccctctgc gctcctgcat ctgcctcccc atattcctcg gacaccacac 121 cctgctgctt tgcctacatt gcccgcccac tgccccgtgc ccacatcaag gagtatttct 181 acaccagtgg caagtgctcc aacccagcag tcgtctttgt cacccgaaag aaccgccaag 241 tgtgtgccaa cccagagaag aaatgggttc gggagtacat caactctttg gagatgagct 301 aggatggaga gtccttgaac ctgaacttac acaaatttgc ctgtttctgc ttgctcttgt 361 cctagcttgg gaggcttccc ctcactatcc taccccaccc gctccttgaa gggcccagat 421 tctgaccacg acgagcagca gttacaaaaa ccttccccag gctggacgtg gtggctcagc 481 cttgtaatcc cagcactttg ggaggccaag gtgggtggat cacttgaggt caggagttcg 541 agacagcctg gccaacatga tgaaacccca tgtgtactaa aaatacaaaa aattagccgg 601 gcgtggtagc gggcgcctgt agtcccagct actcgggagg ctgaggcagg agaatggcgt 661 gaacccggga gcggagcttg cagtgagccg agatcgcgcc actgcactcc agcctgggcg 721 acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa aaaaaataca aaaattagcc 781 gcgtggtggc ccacgcctgt aatcccagct actcgggagg ctaaggcagg aaaattgttt 841 gaacccagga ggtggaggct gcagtgagct gagattgtgc cacttcactc cagcctgggt 901 gacaaagtga gactccgtca caacaacaac aacaaaaagc ttccccaact aaagcctaga 961 agagcttctg aggcgctgct ttgtcaaaag gaagtctcta ggttctgagc tctggctttg 1021 ccttggcttt gcaagggctc tgtgacaagg aaggaagtca gcatgcctct agaggcaagg 1081 aagggaggaa cactgcactc ttaagcttcc gccgtctcaa cccctcacag gagcttactg 1141 gcaaacatga aaaatcgggg // LOCUS HUMTCTA 2146 bp mRNA PRI 25-APR-1996 DEFINITION Homo sapiens expressed pseudo TCTA mRNA at t(1;3) translocation site, complete cds. ACCESSION L41143 NID g736684 KEYWORDS translocation. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2146) AUTHORS Aplan,P.D., Johnson,B.E., Russell,E., Chervinsky,D.S. and Kirsch,I.R. TITLE Cloning and characterization of TCTA, a gene located at the site of a t(1;3) translocation JOURNAL Cancer Res. 55 (9), 1917-1921 (1995) MEDLINE 95246031 FEATURES Location/Qualifiers source 1..2146 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3" gene 222..533 /gene="TCTA" CDS 222..533 /gene="TCTA" /codon_start=1 /db_xref="PID:g1019105" /translation="MAESWSGQALQALPATVLGALGSEFLREWEAQDMRVTLFKLLLL WLVLSLLGIQLAWGFYGNTVTGLYHRPGLGGQNGSTPDGSTHFPSWEMAANEPLKTHR E" polyA_site 2146 BASE COUNT 457 a 598 c 581 g 510 t ORIGIN 1 tactcccgga gtcactcatc ccttaagcaa gcagggtggg gttaggtgcg cgtgcgcggt 61 tttaatactc ctccccgaac tgccaactct tcacgcacgc gaagtaggcc ccaccctggc 121 tgggtttacg cgtgcgcact aacgggcctg gtcccggaag accacacgcg tgcgtggtgg 181 ggactacggt gacagtaccc cgggtggggc gagggccagt catggcggag tcctggtctg 241 ggcaggcctt gcaggctctg ccggccacgg tgctgggcgc gctgggcagc gagttcttgc 301 gggagtggga ggcgcaggac atgcgcgtga ccctcttcaa gctgctgctg ctgtggttgg 361 tgttaagtct cctgggcatc cagctggcgt gggggttcta cgggaataca gtgaccgggt 421 tgtatcaccg tccaggtctg ggtggtcaga atggatccac gcctgatggc tccacgcatt 481 tcccttcgtg ggaaatggca gcaaacgaac ctctcaaaac ccacagagaa taagggaagg 541 cagcagaggg tctccaaggg catcactggg tctgctggct tctacactgg gttctgctac 601 tccccagacc tcagggacaa ctgccggggg ttcagggttg gtagcaggga gtacccagtg 661 cctacagggc tgggcctctt ctgcctctta agcctgctcc ctcacccagg cactgggcaa 721 gtgaagagtt tgcctgtact cttatctggg tgccttaagg agagagattg tgttcttcct 781 ctctcagggg tgataactca ggaagcctct gggttgggaa gaccatcagt tcttttgtct 841 taggtttctt ttcctgtccc tcttccatcc ccaagatgtg accccataaa aatttttcct 901 gagttggcca ggcatggtgg ctcacgcctg taatcccaac actttgggag gctgaggcag 961 gcagatcacg aggtcaggag ttcgagacca gcctgaccaa catggtgaaa accccatctc 1021 tactaaaaat acaaaaatta gccgggtgtg gtggcacaca ccagtaatcc cagctactcg 1081 ggaggctgaa gcaggagatt tgcttgaacc tgggaggcag aggttgcagt gagccaagat 1141 tgcgccgttg tactccagcc tgggcaacag agcaagaccc atctcaaaaa aaaaattttt 1201 ttcctgagag gaagcctgag gttgaccagc tctggggttt gtaaggcagg tctgttttct 1261 cctaggccct gagttttctg aatctctggt tttgctttgt tggcaaggag ccagggaatc 1321 ctgacctgag ccagacctta agctctatgg ttatttagct ggccattcag gtataaggca 1381 gggtggtgta cctgctggca ctatccagat ggaggcacca aacacccaca tacctggccc 1441 aaccagactt ctcccgtgag ccaggcaaag gaaattgtca tctgccaact gtcctactca 1501 tattcctctc agtccttctt gggggtaagc tgattacctg aaggacagct gaacccctgg 1561 ggtagcctcc tatccaccac tgcttaagtg cctatgggaa tgtgggtctg caccttgtcc 1621 cctcatagga tggtaccaag catttagtgc acagtggccc catcatagcc tgcagcctca 1681 tcatttccca tctggacctg gtacaaatgc acgtcacagg ctcagctcct ccccactagc 1741 atcttctcta ccttcaagaa ccaggcagcc ctgccatgtc acaataggcc aggggagttt 1801 ccaaagatgt gggtggcaaa tgcccctata gaaacaccag tacctgaaag cactgtagcc 1861 ctggacctgc ctccttccct cggggccata cttctgtttc catctgctgg gccaccagcc 1921 actttagtga cccctgccta cttccttcct gttggatatc atacttccat ctggctgcct 1981 ttgcttaagc catctttgtg gtagaggggc cctggaattg cagctgtact gaggatgatg 2041 ttattcacag cccctggccc acccactaat actactgcac agagtcagga tctcacattt 2101 caccccaggc tcaactgagg atgtggctta ttaaacacgg aagtgc // LOCUS HUMTCVBD 852 bp mRNA PRI 23-OCT-1991 DEFINITION Homo sapiens T cell receptor V chain mRNA,complete cds. ACCESSION M77498 NID g339424 KEYWORDS T-cell receptor; T-cell receptor beta-chain. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 852) AUTHORS Novonty,J., Ganju,R.K., Smiley,S.T., Hussey,R.E., Luther,M.A., Recny,M.A., Siliciano,R.F. and Reinherz,E.L. TITLE A soluble single chain T cell receptor fragment endowed with antigen combining properties JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1991) In press FEATURES Location/Qualifiers source 1..852 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 40..105 CDS 40..846 /codon_start=1 /db_xref="PID:g339425" /translation="MKYLLPTAAAGLLLLAAQPAMANAGVTQTPKFRVLKTGQSMTLL CAQDMNHEYMYWYRQDPGMGLRLIHYSVGEGTTAKGEVPDGYNVSRLKKQNFLLGLES AAPSQTSVYFCASRTATQPQHFGDGTRLSILPGGGGSGGGGSGGGGSGGGGSGAQQQV KQSPQSLIVQKGGISIINCAYENTAFDYFPWYQQFPGKGPALLIAIRPDVSEKKEGRF TISFNKSAKQFSLHIMDSQPGDSATYFCAASFSGNTPLVFGKGTRLSVIA" mat_peptide 106..438 /product="T-cell receptor beta chain" misc_feature 444..501 /note="homology to cloning vector M13mp18; putative" mat_peptide 508..843 /product="T-cell receptor alpha chain" BASE COUNT 217 a 203 c 220 g 212 t ORIGIN 1 gagctcgaat tcaaattcta tttcaaggag acagtcataa tgaaatacct attgcctacg 61 gcagccgctg gattgttatt actcgcggcc cagccggcca tggccaatgc tggtgtcact 121 cagaccccaa aattccgggt cctgaagaca ggacagagca tgacactgct gtgtgcccag 181 gatatgaacc atgaatacat gtactggtat cgacaagacc caggcatggg gctgaggctg 241 attcattact cagttggtga gggtacaact gccaaaggag aggtccctga tggctacaat 301 gtctccagat taaaaaaaca gaatttcctg ctggggttgg agtcggctgc tccctcccaa 361 acatctgtgt acttctgtgc cagcaggacg gccacgcagc cccagcattt tggtgatggg 421 actcgactct ccatcctacc cgggggcggt ggttctggtg gtggtggttc tggtggtggt 481 ggttctggtg gtggtggttc tggcgcccag cagcaggtga aacaaagtcc tcaatctttg 541 atagtccaga aaggagggat ttcaattata aactgtgctt atgagaacac tgcgtttgac 601 tactttccat ggtaccaaca attccctggg aaaggccctg cattattgat agccatacgt 661 ccagatgtga gtgaaaagaa agaaggaaga ttcacaatct ccttcaataa aagtgccaag 721 cagttctcat tgcatatcat ggattcccag cctggagact cagccaccta cttctgtgca 781 gcaagctttt caggaaacac acctcttgtc tttggaaagg gcacaagact ttctgtgatt 841 gcatgactgc ag // LOCUS HUMTDTA 2068 bp mRNA PRI 22-MAY-1991 DEFINITION Human terminal transferase mRNA, complete cds. ACCESSION M11722 M28451 NID g339436 KEYWORDS . SOURCE Human lymphoblastoid KM-3, cDNA to mRNA, clones pT711 and pT106. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2068) AUTHORS Peterson,R.C., Cheung,L.C., Mattaliano,R.J., White,S.T., Chang,L.M.S. and Bollum,F.J. TITLE Expression of human terminal deoxynucleotidyl transferase in Escherichia coli JOURNAL J. Biol. Chem. 260, 10495-10502 (1985) MEDLINE 85289229 REFERENCE 2 (bases 1 to 2068) AUTHORS Chang,L.M.S. and Bollum,F.J. TITLE Molecular biology of terminal transferase JOURNAL CRC Crit. Rev. Biochem. 21, 27-52 (1986) MEDLINE 86273291 FEATURES Location/Qualifiers source 1..2068 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q23-q24" gene 329..1855 /gene="LA0024A" CDS 329..1855 /gene="LA0024A" /codon_start=1 /product="terminal transferase" /db_xref="PID:g339437" /translation="MDPPRASHLSPRKKRPRQTGALMASSPQDIKFQDLVVFILEKKM GTTRRAFLMELARRKGFRVENELSDSVTHIVAENNSGSDVLEWLQAQKVQVSSQPELL DVSWLIECIGAGKPVEMTGKHQLVVRRDYSDSTNPGPPKTPPIAVQKISQYACQRRTT LNNCNQIFTDAFDILAENCEFRENEDSCVTFMRAASVLKSLPFTIISMKDTEGIPCLG SKVKGIIEEIIEDGESSEVKAVLNDERYQSFKLFTSVFGVGLKTSEKWFRMGFRTLSK VRSDKSLKFTRMQKAGFLYYEDLVSCVTRAEAEAVSVLVKEAVWAFLPDAFVTMTGGF RRGKKMGHDVDFLITSPGSTEDEEQLLQKVMNLWEKKGLLLYYDLVESTFEKLRLPSR KVDALDHFQKCFLIFKLPRQRVDSDQSSWQEGKTWKAIRVDLVLCPYERRAFALLGWT GSRFERDLRRYATHERKMILDNHALYDKTKRIFLKAESEEEIFAHLGLDYIEPWERNA " BASE COUNT 595 a 405 c 542 g 526 t ORIGIN 1 tcattgggtg attgatttct atgctccttg gtgtggacct tgccagaatt ttgctccaga 61 atttgagctc ttggctagga tgattaaagg aaaagtgaaa gctggaaaag tagactgtca 121 ggcttatgct cagacatgcc agaaagctgg gatcagggcc tatccaactg ttaagtttta 181 tttctacgaa agagcaaaga gaaattttca agaagagggg gggggggggg ccccccccaa 241 aaacccttcg tgtaggaggg tggcagtctc cctcccttct ggagacacca ccagatgggc 301 cagccagagg cagcagcagc ctcttcccat ggatccacca cgagcgtccc acttgagccc 361 tcggaagaag agaccccggc agacgggtgc cttgatggcc tcctctcctc aagacatcaa 421 atttcaagat ttggtcgtct tcattttgga gaagaaaatg ggaaccaccc gcagagcgtt 481 cctcatggag ctggcccgca ggaaagggtt cagggttgaa aatgagctca gtgattctgt 541 cacccacatt gtagcagaga acaactcggg ttcggatgtt ctggagtggc ttcaagcaca 601 gaaagtacaa gtcagctcac aaccagagct cctcgatgtc tcctggctga tcgaatgcat 661 aggagcaggg aaaccggtgg aaatgacagg aaaacaccag cttgttgtga gaagagacta 721 ttcagatagc accaacccag gccccccgaa gactccacca attgctgtac aaaagatctc 781 ccagtatgcg tgtcagagaa gaaccacttt aaacaactgt aaccagatat tcacggatgc 841 ctttgatata ctggctgaaa actgtgagtt tagagaaaat gaagactcct gtgtgacatt 901 tatgagagca gcttctgtat tgaaatctct gccattcaca atcatcagta tgaaggacac 961 agaaggaatt ccctgcctgg ggtccaaggt gaagggtatc atagaggaga ttattgaaga 1021 tggagaaagt tctgaagtta aagctgtgtt aaatgatgaa cgatatcaat ccttcaaact 1081 ctttacttct gtatttggag tggggctgaa gacttctgag aagtggttca ggatgggttt 1141 cagaactctg agtaaagtaa ggtcggacaa aagcctgaaa tttacacgaa tgcagaaagc 1201 aggatttctg tattatgaag accttgtcag ctgtgtgacc agggcagaag cagaggccgt 1261 cagtgtgctg gttaaagagg ctgtctgggc atttcttccg gatgctttcg tcaccatgac 1321 aggagggttc cggaggggta agaagatggg gcatgatgta gattttttaa ttaccagccc 1381 aggatcaaca gaggatgaag agcaactttt acagaaagtg atgaacttat gggaaaagaa 1441 gggattactt ttatattatg accttgtgga gtcaacattt gaaaagctca ggttgcctag 1501 caggaaggtt gatgctttgg atcattttca aaagtgcttt ctgattttca aattgcctcg 1561 tcaaagagtg gacagtgacc agtccagctg gcaggaagga aagacctgga aggccatccg 1621 tgtggattta gttctgtgcc cctacgagcg tcgtgccttt gccctgttgg gatggactgg 1681 ctcccggttt gagagagacc tccggcgcta tgccacacat gagcggaaga tgattctgga 1741 taaccatgct ttatatgaca agaccaagag gatattcctc aaagcagaaa gtgaagaaga 1801 aatttttgcg catctgggat tggattatat tgaaccgtgg gaaagaaatg cctaggaaag 1861 tgttgtcaac attttttcct attcttttca agttaaataa attatgcttc atattagtaa 1921 aagatgccat aggagagttt ggggttattt aggtcttatt gaaatgcaga ttgctactag 1981 aaataaataa ctttggaaac atgggaaggt gccactggta atgggtaagg ttctaatagg 2041 ccatgtttat gactgttgca tagaattc // LOCUS HUMTEF 1076 bp mRNA PRI 14-MAR-1997 DEFINITION Human mRNA for transcription elongation factor S-II, hS-II-T1, complete cds. ACCESSION D50495 NID g1217590 KEYWORDS transcription elongation factor S-II, hS-II-T1. SOURCE Homo sapiens lymphocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Umehara,T., Kida,S., Yamamoto,T. and Horikoshi,M. TITLE Isolation and characterization of a cDNA encoding a new type of human transcription elongation factor S-II JOURNAL Gene 167 (1-2), 297-302 (1995) MEDLINE 96144291 REFERENCE 2 (bases 1 to 1076) AUTHORS Horikoshi,M. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 1076) AUTHORS Horikoshi,M. TITLE Direct Submission JOURNAL Submitted (04-MAY-1995) to the DDBJ/EMBL/GenBank databases. Masami Horikoshi, The University of Tokyo, Institute of Molecular and Cellular Biosciences; Yayoi 1-1-1, Bunkyo-ku, Tokyo 113, Japan (E-mail:horikosh@imcbns.iam.u-tokyo.ac.jp, Tel:03-5802-3388, Fax:03-5684-8341) FEATURES Location/Qualifiers source 1..1076 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="lymphocyte" mRNA 1..1076 /note="human transcription elongation factor S-II, phS-II-T1" CDS 19..918 /codon_start=1 /product="transcription elongation factor S-II, hS-II-T1" /db_xref="PID:d1009724" /db_xref="PID:g1217591" /translation="MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLL QSTRVGMSVNALRKQSSDEEVIALAKSLIKSWKKLLDASDAKARERGRGMPLPTSSRD ASEAPDPSRKRPELPRAPSTPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIG ADCERLSAQIEECIFRDVGNTDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQI AVMTSEEMASDELKEIRKAMTKEAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQT RSSDEPMTTFVVCNECGNRWKFC" BASE COUNT 233 a 323 c 324 g 196 t ORIGIN 1 gtcgctgctc ctgaggcgat gatgggcaag gaagaggaga ttgcgcggat cgcccggagg 61 ctggacaaga tggtgaccaa gaagagcgcg gagggagcca tggatttgct gcgggagctg 121 aaggccatgc ctatcacgct gcacctgctc cagtccaccc gagtcgggat gtctgtcaac 181 gcccttcgga agcagagctc ggatgaggag gtcattgcac tggccaagtc tctcatcaag 241 tcctggaaga agctcctgga tgcttccgat gccaaagcca gggagcgggg gaggggcatg 301 cctctgccca cgtcctcgag ggatgcctca gaggccccgg atcccagccg caagaggccg 361 gagctgccca gggcaccgtc gactccgagg atcaccacat ttcctccggt gcctgtcacc 421 tgtgatgccg tgcgcaacaa gtgccgcgag atgctgaccg ctgccctgca gacggaccat 481 gaccacgtgg ccatcggtgc ggactgcgag cgcctgtcgg ctcagatcga ggaatgcatc 541 ttccgggacg ttggaaacac agacatgaag tataagaacc gtgtacggag tcgtatctcc 601 aacctgaagg atgccaagaa ccctgacctg cggcggaatg tgctgtgtgg ggccataaca 661 ccccagcaga tcgctgtgat gacctcagag gagatggcca gtgatgagct gaaggagatc 721 cgtaaggcca tgaccaagga ggccatccga gagcaccaga tggcccgcac tggcggcacg 781 cagacagacc tgttcacctg cggcaagtgc aggaaaaaga actgcaccta cacacaggtg 841 cagacccgca gctctgatga gcccatgacc acctttgttg tctgcaacga gtgtggaaac 901 cgctggaagt tctgctgacc cctcgtgtag atgtgctgca gccttgggcc ctccccggcc 961 cacgtcctcc gttgacacag cttctctgga gaccctagaa ggcggcatgt cctgccctca 1021 acctgcctgc ctggattgca cctttctgcc ctttccccct cattattaaa tgtttc // LOCUS HUMTEF1 4443 bp DNA PRI 23-MAY-1996 DEFINITION Homo sapiens transcriptional enhancer factor (TEF1) DNA, complete CDS. ACCESSION M63896 NID g339440 KEYWORDS trans-acting transcriptional activator; transcription enhancer. SOURCE Homo sapiens (tissue library: ZAP-II random primed cDNA) DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4443) AUTHORS Xiao,J.H., Davidson,I., Matthes,H., Garnier,J.M. and Chambon,P. TITLE Cloning, expression, and transcriptional properties of the human enhancer factor TEF-1 JOURNAL Cell 65 (4), 551-568 (1991) MEDLINE 91235292 FEATURES Location/Qualifiers source 1..4443 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /cell_type="HeLa" /tissue_lib="ZAP-II random primed cDNA" 5'UTR 541..585 /gene="TEF-1" gene 541..3528 /gene="TEF-1" CDS 586..1821 /gene="TEF-1" /codon_start=1 /evidence=experimental /product="transcription enhancer factor" /db_xref="PID:g339441" /translation="MERMSDSADKPIDNDAEGVWSPDIEQSFQEALAIYPPCGRRKII LSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQVLARRKSRDFHSKLKDQTAKDK ALQHMAAMSSAQIVSATAIHNKLGLPGIPRPTFPGAPGFWPGMIQTGQPGSSQDVKPF VQQAYPIQPAVTAPIPGFEPASAPAPSVPAWQGRSIGTTKLRLVEFSAFLEQQRDPDS YNKHLFVHIGHANHSYSDPLLESVDIRQIYDKFPEKKGGLKELFGKGPQNAFFLVKFW ADLNCNIQDDAGAFYGVTSQYESSENMTVTCSTKVCSFGKQVVEKVETEYARFENGRF VYRINRSPMCEYMINFIHKLKHLPEKYMMNSVLENFTILLVVTNRDTQETLLCMACVF EVSNSEHGAQHHIYRLVKD" polyA_signal 2262..2268 /gene="TEF-1" /note="potential; putative" polyA_signal 2514..2519 /gene="TEF-1" /note="potential; putative" polyA_signal 2581..2586 /gene="TEF-1" /note="potential; putative" polyA_signal 2759..2764 /gene="TEF-1" /note="potential; putative" polyA_signal 2884..2890 /gene="TEF-1" /note="potential; putative" polyA_signal 3523..3528 /gene="TEF-1" /note="potential; putative" BASE COUNT 1226 a 1040 c 969 g 1208 t ORIGIN 1 cgccgcccgc cgcgggcgcc caccaagcac tttgcagact cgcttccacc ctgcgggcca 61 ttccgcgcgg cggggcccgg gcccggggcg gccgcgtcca ggcacaggcc atgcagtgac 121 gcccccccac ccctccacct ttgcccggac ggcgggcagc agcccagcgc gccagccggc 181 cccggggcag gagcggtgct aggcaggggt ggggtggccg ggcccaggga ccgggagccg 241 gggagggagc cgggcaccga gcagagggcg ggggaagcgg cgccgaagtt tgcctcggac 301 tcgccgggcg ctgcggtggc tccctgggcc gaggactgtt gctgccgctg ccgccgccgc 361 ttcattgcac attcaagtgg aaaattttca ggagtcagca gaaacattgt gtccaaaaaa 421 gactgagtcg cagttaccac caaacccagg aggagactct ccctggaaaa cttcccttcc 481 ctttcggttt attttcttga aaaggctcca ggcttcggct tggaaaatcc caccgccaaa 541 attgagccca gcagctggag cggcagtgag agccctgccg aaaacatgga aaggatgagt 601 gactctgcag ataagccaat tgacaatgat gcagaagggg tctggagccc cgacatcgag 661 caaagctttc aggaggccct ggctatctat ccaccatgtg ggaggaggaa aatcatctta 721 tcagacgaag gcaaaatgta tggtaggaat gaattgatag ccagatacat caaactcagg 781 acaggcaaga cgaggaccag aaaacaggtg tctagtcaca ttcaggttct tgccagaagg 841 aaatctcgtg attttcattc caagctaaag gatcagactg caaaggataa ggccctgcag 901 cacatggcgg ccatgtcctc agcccagatc gtctcggcca ctgccattca taacaagctg 961 gggctgcctg ggattccacg cccgaccttc ccaggggcgc cggggttctg gccgggaatg 1021 attcaaacag ggcagccagg atcctcacaa gacgtcaagc cttttgtgca gcaggcctac 1081 cccatccagc cagcggtcac agcccccatt ccagggtttg agcctgcatc ggccccagct 1141 ccctcagtcc ctgcctggca aggtcgctcc attggcacaa ccaagcttcg cctggtggaa 1201 ttttcagctt ttctcgagca gcagcgagac ccagactcgt acaacaaaca cctcttcgtg 1261 cacattgggc atgccaacca ttcttacagt gacccattgc ttgaatcagt ggacattcgt 1321 cagatttatg acaaatttcc tgaaaagaaa ggtggcttaa aggaactgtt tggaaagggc 1381 cctcaaaatg ccttcttcct cgtaaaattc tgggctgatt taaactgcaa tattcaagat 1441 gatgctgggg ctttttatgg tgtaaccagt cagtacgaga gttctgaaaa tatgacagtc 1501 acctgttcca ccaaagtttg ctcctttggg aagcaagtag tagaaaaagt agagacggag 1561 tatgcaaggt ttgagaatgg ccgatttgta taccgaataa accgctcccc aatgtgtgaa 1621 tatatgatca acttcatcca caagctcaaa cacttaccag agaaatatat gatgaacagt 1681 gttttggaaa acttcacaat tttattggtg gtaacaaaca gggatacaca agaaactcta 1741 ctctgcatgg cctgtgtgtt tgaagtttca aatagtgaac acggagcaca acatcatatt 1801 tacaggcttg taaaggactg aacatggtta tttatatata tagatatctg tatatacaca 1861 cacacatatg tgcacacaca cactctctct ccattatcga acgactgact gtaaacctca 1921 ccacacaggg tggtgccctg gccccgaggt caccccgact tttctaaatc ttgtttgagt 1981 gaagtcattt tttcatgtgt tcatactatc attgtagctg tgaagttctg gtacagttgt 2041 aaaaagagaa attgagttgt ttctctatgt tcttcagatg tgcagcccac aattcctcgg 2101 gaaaggtgaa cctgaacaac ccaagtctct ctctgcagag ccctgtttct aattgtggta 2161 gaaaatattg agacagagca tttgccatgg gacatttaca gcctttatac aaatgtattt 2221 agttctcttt tttccaacat aaaattcttg ttttaagata caagtaaaat taatctttaa 2281 atataaatgt aaattagtac acaaaactaa gaatctttag acttatcttt gtaactaatt 2341 agggtggaag ttatgaaaga atgtaattca ctaaattatt ttttaaatga aacctttttt 2401 tttctttttg aaaccaaatg ttaaactata gccttaagaa atgcttggta gaagtgtcct 2461 aatgagacaa atttgtactt ttatcctcaa ggttaacact aatctcctaa tccattaaac 2521 tcttgaacag gtattacaaa ggaagaaaac ttcacccctt atccttaaca tatatagtat 2581 atttaaaaaa tataaaattg tattgtacta atgtgatgat ggattattta atgaaaaaga 2641 aaaaatggct ctttttgcaa taagtagata catactgaaa aaatctaaac ttacaatgtt 2701 tatagtcttg tgtgtgcagt tatattttat atggacgacc aaatttttta ttaagatgag 2761 taaatatttg aaccactgaa ttttaataac aaaattttaa aattggcatg aatacggaat 2821 actgcactgt gagatgcaaa gtatacagaa tctgtggctg ggagaaaatt tcatcaaata 2881 gacaagtaaa aggctcatca gttttagcat ctctgctccc cagaaaattg taagcatcct 2941 caccagcctg tggatacatt ctttatttct agtgacccaa tatgcatatt aacctgctat 3001 aactagggct atatgtgtag gtatgtgtat acatatacac aaatgcacat atagagttaa 3061 cacatttagt gaacacttgt ttagtgtcac tcagtttgct aggtgctgat atgtacgtat 3121 atctcaatgt gtctgtagac ttagatacat cctcttgaag cacatccatt tctttagcgt 3181 ctctcagtaa gttacagtac ttgtttgact taggtttaag aggcccagct acctatctct 3241 gaccttttca aataggctca tttgggagat tcttttgcca ggagagattc aactttccaa 3301 tctaagtatt ccagagcatt gcccaggcag agttggtttg atgtggccag atgttttgag 3361 ttatttccct taagtgtttc actggggaga gaacagggag tgctcctcca gcttcccaaa 3421 gaaatatgtt tttgtaagtg gtaggaacat gtgcacacaa tagaacatga aataagtttt 3481 ttaacttgta aaacatgtca agatttttcc accaagctag aaaataaaaa acttagttct 3541 accacatcca attaacttac acaccccctt ccctgtctca acacctgctt tgaccctgct 3601 tttctattat tacatcagtc agcatcttgt ggtccctaac atgaggatgt ggctggctcg 3661 tgggaaacag caaaacacta agcctgacct ctcccaaatt gggaagacca gaggagaaag 3721 tgcaaaactg tccccatttg gaatgcccat tccttctaga aaccagttgg acagtgctcc 3781 tctgcccttc ataaacagac tactgttggg tccctgattc caggctggcc tgtgaaggat 3841 tgccccaggt gtcccctttc acggttgtca catttacagt gacttctgtt gaacacccct 3901 cttagggatg tttcttttgc tcttatttcc tgcatctttc caattgggaa gccccatcct 3961 ctcccaggac caggagttta tgaccaggcg agcacaaatg gctaaaagca agctgtccta 4021 gaacttcagt gggagagctg tctggttcat attctaccca ggaatggtac ttttcagtgc 4081 agccaggagg gctcttggga tttcctttcc aaagcacaaa aatactggga cccaagaaga 4141 acagctagag gacaactctg ttggcacaga gacggggaca gcccagtctg ctgacctcac 4201 agggtcagtg ggcccccctg gtgcttcacc acctgcatcc tcttgctcag aatgcctttg 4261 cagttgagtt ttctgggttt ctatgattga ccttgaggtt tactccttgc tcttacaaca 4321 tttctaagga tttttaaaag tttacttctt gtcttgttct tctaaagctt tctccaggac 4381 agatattttc cctgtcttaa ccactggtcc agtcatccca gtgggcttct ctttgtctct 4441 ccc // LOCUS HUMTEKRPTK 4138 bp mRNA PRI 14-JAN-1995 DEFINITION Homo sapiens receptor protein-tyrosine kinase (TEK) mRNA, complete cds. ACCESSION L06139 NID g292823 KEYWORDS receptor protein-tyrosine kinase; transmembrane protein; tyrosine kinase. SOURCE Homo sapiens placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4138) AUTHORS Ziegler,S.F., Bird,T.A., Schneringer,J.A., Schooley,K.A. and Baum,P.R. TITLE Molecular cloning and characterization of a novel receptor protein tyrosine kinase from human placenta JOURNAL Oncogene 8 (3), 663-670 (1993) MEDLINE 93173509 FEATURES Location/Qualifiers source 1..4138 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" sig_peptide 149..202 /gene="TEK" CDS 149..3523 /gene="TEK" /codon_start=1 /product="receptor protein-tyrosine kinase" /db_xref="PID:g292824" /translation="MDSLASLVLCGVSLLLSGTVEGAMDLILINSLPLVSDAETSLTC IASGWRPHEPITIGRDFEALMNQHQDPLEVTQDVTREWAKKVVWKREKASKINGAYFC EGRVRGEAIRIRTMKMRQQASFLPATLTMTVDKGDNVNISFKKVLIKEEDAVIYKNGS FIHSVPRHEVPDILEVHLPHAQPQDAGVYSARYIGGNLFTSAFTRLIVRRCEAQKWGP ECNHLCTACMNNGVCHEDTGECICPPGFMGRTCEKACELHTFGRTCKERCSGQEGCKS YVFCLPDPYGCSCATGWKGLQCNEACHPGFYGPDCKLRCSCNNGEMCDRFQGCLCSPG WQGLQCEREGIPRMTPKIVDLPDHIEVNSGKFNPICKASGWPLPTNEEMTLVKPDGTV LHPKDFNHTDHFSVAIFTIHRILPPDSGVWVCSVNTVAGMVEKPFNISVKVLPKPLNA PNVIDTGHNFAVINISSEPYFGDGPIKSKKLLYKPVNHYEAWQHIQVTNEIVTLNYLE PRTEYELCVQLVRRGEGGEGHPGPVRRFTTASIGLPPPRGLNLLPKSQTTLNLTWQPI FPSSEDDFYVEVERRSVQKSDQQNIKVPGNLTSVLLNNLHPREQYVVRARVNTKAQGE WSEDLTAWTLSDILPPQPENIKISNITHSSAVISWTILDGYSISSITIRYKVQGKNED QHVDVKIKNATIIQYQLKGLEPETAYQVDIFAENNIGSSNPAFSHELVTLPESQAPAD LGGGKMLLIAILGSAGMTCLTVLLAFLIILQLKRANVQRRMAQAFQNVREEPAVQFNS GTLALNRKVKNNPDPTIYPVLDWNDIKFQDVIGEGNFGQVLKARIKKDGLRMDAAIKR MKEYASKDDHRDFAGELEVLCKLGHHPNIINLLGACEHRGYLYLAIEYAPHGNLLDFL RKSRVLETDPAFAIANSTASTLSSQQLLHFAADVARGMDYLSQKQFIHRDLAARNILV GENYVAKIADFGLSRGQEVYVKKTMGRLPVRWMAIESLNYSVYTTNSDVWSYGVLLWE IVSLGGTPYCGMTCAELYEKLPQGYRLEKPLNCDDEVYDLMRQCWREKPYERPSFAQI LVSLNRMLEERKTYVNTTLYEKFTYAGIDCSAEEAA" gene 149..3523 /gene="TEK" mat_peptide 203..3520 /gene="TEK" /note="a.a. domains: 746..742, transmembrane; 211..340, EGF-like repeats; 440..733, fibronectin type 3 repeats" /product="receptor protein-tyrosine kinase" BASE COUNT 1170 a 910 c 988 g 1070 t ORIGIN 1 cttctgtgct gttccttctt gcctctaact tgtaaacaag acgtactagg acgatgctaa 61 tggaaagtca caaaccgctg ggtttttgaa aggatccttg ggacctcatg cacatttgtg 121 gaaactggat ggagagattt ggggaagcat ggactcttta gccagcttag ttctctgtgg 181 agtcagcttg ctcctttctg gaactgtgga aggtgccatg gacttgatct tgatcaattc 241 cctacctctt gtatctgatg ctgaaacatc tctcacctgc attgcctctg ggtggcgccc 301 ccatgagccc atcaccatag gaagggactt tgaagcctta atgaaccagc accaggatcc 361 gctggaagtt actcaagatg tgaccagaga atgggctaaa aaagttgttt ggaagagaga 421 aaaggctagt aagatcaatg gtgcttattt ctgtgaaggg cgagttcgag gagaggcaat 481 caggatacga accatgaaga tgcgtcaaca agcttccttc ctaccagcta ctttaactat 541 gactgtggac aagggagata acgtgaacat atctttcaaa aaggtattga ttaaagaaga 601 agatgcagtg atttacaaaa atggttcctt catccattca gtgccccggc atgaagtacc 661 tgatattcta gaagtacacc tgcctcatgc tcagccccag gatgctggag tgtactcggc 721 caggtatata ggaggaaacc tcttcacctc ggccttcacc aggctgatag tccggagatg 781 tgaagcccag aagtggggac ctgaatgcaa ccatctctgt actgcttgta tgaacaatgg 841 tgtctgccat gaagatactg gagaatgcat ttgccctcct gggtttatgg gaaggacgtg 901 tgagaaggct tgtgaactgc acacgtttgg cagaacttgt aaagaaaggt gcagtggaca 961 agagggatgc aagtcttatg tgttctgtct ccctgacccc tatgggtgtt cctgtgccac 1021 aggctggaag ggtctgcagt gcaatgaagc atgccaccct ggtttttacg ggccagattg 1081 taagcttagg tgcagctgca acaatgggga gatgtgtgat cgcttccaag gatgtctctg 1141 ctctccagga tggcaggggc tccagtgtga gagagaaggc ataccgagga tgaccccaaa 1201 gatagtggat ttgccagatc atatagaagt aaacagtggt aaatttaatc ccatttgcaa 1261 agcttctggc tggccgctac ctactaatga agaaatgacc ctggtgaagc cggatgggac 1321 agtgctccat ccaaaagact ttaaccatac ggatcatttc tcagtagcca tattcaccat 1381 ccaccggatc ctcccccctg actcaggagt ttgggtctgc agtgtgaaca cagtggctgg 1441 gatggtggaa aagcccttca acatttctgt taaagttctt ccaaagcccc tgaatgcccc 1501 aaacgtgatt gacactggac ataactttgc tgtcatcaac atcagctctg agccttactt 1561 tggggatgga ccaatcaaat ccaagaagct tctatacaaa cccgttaatc actatgaggc 1621 ttggcaacat attcaagtga caaatgagat tgttacactc aactatttgg aacctcggac 1681 agaatatgaa ctctgtgtgc aactggtccg tcgtggagag ggtggggaag ggcatcctgg 1741 acctgtgaga cgcttcacaa cagcttctat cggactccct cctccaagag gtctaaatct 1801 cctgcctaaa agtcagacca ctctaaattt gacctggcaa ccaatatttc caagctcgga 1861 agatgacttt tatgttgaag tggagagaag gtctgtgcaa aaaagtgatc agcagaatat 1921 taaagttcca ggcaacttga cttcggtgct acttaacaac ttacatccca gggagcagta 1981 cgtggtccga gctagagtca acaccaaggc ccagggggaa tggagtgaag atctcactgc 2041 ttggaccctt agtgacattc ttcctcctca accagaaaac atcaagattt ccaacattac 2101 acactcctcg gctgtgattt cttggacaat attggatggc tattctattt cttctattac 2161 tatccgttac aaggttcaag gcaagaatga agaccagcac gttgatgtga agataaagaa 2221 tgccaccatc attcagtatc agctcaaggg cctagagcct gaaacagcat accaggtgga 2281 catttttgca gagaacaaca tagggtcaag caacccagcc ttttctcatg aactggtgac 2341 cctcccagaa tctcaagcac cagcggacct cggagggggg aagatgctgc ttatagccat 2401 ccttggctct gctggaatga cctgcctgac tgtgctgttg gcctttctga tcatattgca 2461 attgaagagg gcaaatgtgc aaaggagaat ggcccaagcc ttccaaaacg tgagggaaga 2521 accagctgtg cagttcaact cagggactct ggccctaaac aggaaggtca aaaacaaccc 2581 agatcctaca atttatccag tgcttgactg gaatgacatc aaatttcaag atgtgattgg 2641 ggagggcaat tttggccaag ttcttaaggc gcgcatcaag aaggatgggt tacggatgga 2701 tgctgccatc aaaagaatga aagaatatgc ctccaaagat gatcacaggg actttgcagg 2761 agaactggaa gttctttgta aacttggaca ccatccaaac atcatcaatc tcttaggagc 2821 atgtgaacat cgaggctact tgtacctggc cattgagtac gcgccccatg gaaaccttct 2881 ggacttcctt cgcaagagcc gtgtgctgga gacggaccca gcatttgcca ttgccaatag 2941 caccgcgtcc acactgtcct cccagcagct ccttcacttc gctgccgacg tggcccgggg 3001 catggactac ttgagccaaa aacagtttat ccacagggat ctggctgcca gaaacatttt 3061 agttggtgaa aactatgtgg caaaaatagc agattttgga ttgtcccgag gtcaagaggt 3121 gtacgtgaaa aagacaatgg gaaggctccc agtgcgctgg atggccatcg agtcactgaa 3181 ttacagtgtg tacacaacca acagtgatgt atggtcctat ggtgtgttac tatgggagat 3241 tgttagctta ggaggcacac cctactgcgg gatgacttgt gcagaactct acgagaagct 3301 gccccagggc tacagactgg agaagcccct gaactgtgat gatgaggtgt atgatctaat 3361 gagacaatgc tggcgggaga agccttatga gaggccatca tttgcccaga tattggtgtc 3421 cttaaacaga atgttagagg agcgaaagac ctacgtgaat accacgcttt atgagaagtt 3481 tacttatgca ggaattgact gttctgctga agaagcggcc taggacagaa catctgtata 3541 ccctctgttt ccctttcact ggcatgggag acccttgaca actgctgaga aaacatgcct 3601 ctgccaaagg atgtgatata taagtgtaca tatgtgctgg aattctaaca agtcataggt 3661 taatatttaa gacactgaaa aatctaagtg atataaatca gattcttctc tctcatttta 3721 tccctcacct gtagcatgcc agtcccgttt catttagtca tgtgaccact ctgtcttgtg 3781 tttccacagc ctgcaagttc agtccaggat gctaacatct aaaaatagac ttaaatctca 3841 ttgcttacaa gcctaagaat ctttagagaa gtatacataa gtttaggata aaataatggg 3901 attttctttt cttttctctg gtaatattga cttgtatatt ttaagaaata acagaaagcc 3961 tgggtgacat ttgggagaca tgtgacattt atatattgaa ttaatatccc tacatgtatt 4021 gcacattgta aaaagtttta gttttgatga gttgtgagtt taccttgtat actgtaggca 4081 cactttgcac tgatatatca tgagtgaata aatgtcttgc ctactcaaaa aaaaaaaa // LOCUS HUMTETTRAN 1758 bp mRNA PRI 24-JUN-1993 DEFINITION Human tetracycline transporter-like protein mRNA, complete cds. ACCESSION L11669 NID g307501 KEYWORDS tetracycline transporter-like protein. SOURCE Homo sapiens frontal cortex cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1758) AUTHORS Duyao,M.P., Taylor,S.M., Buckler,A.J., Ambrose,C.M., Lin,C., Groot,N., Church,D., Barnes,G., Wasmuth,J.J., Housman,D.E., MacDonald,M.E. and Gusella,J.F. TITLE A gene from chromosome 4p16.3 with similarity to a superfamily of transporter proteins JOURNAL Hum. Mol. Genet. 2, 673-676 (1993) MEDLINE 93357734 FEATURES Location/Qualifiers source 1..1758 /organism="Homo sapiens" /db_xref="taxon:9606" /germline /tissue_type="frontal cortex" /map="Chromosome 4p16.3" CDS 121..1488 /codon_start=1 /product="tetracycline transporter-like protein" /db_xref="PID:g307502" /translation="MGWGGGGGCTPRPPIHQQPPERRVVIVVFLGLLLDLLAFTLLLP LLPGLLESHGRAHDPLYGSWQGGVDWFATAIGMPVEKRYNSVLFGGLIGSAFSVLQFL CAPLTGATSDCLGRRPVMLLCLMGVATSYAVWATSRSFAAFLASRLIGGISKGNVSLS TAIVADLGSPLARSQGMAVIGVAFSLGFTLGPMLGASLPLEMAPWFALLFAASDLLFI FCFLPETLPLEKRAPSIALGFRDAADLLSPLALLRFSAVARGQDPPSGDRLSSLRRLG LVYFLYLFLFSGLEYTLSFLTHQRFQFSSLQQGKMFFLIGLTMATIQGAYARRIHPGG EVAAVKRALLLLVPAFLLIGWGRSLPVLGLGLLLYSFAAAVVVPCLSSVVAGYGSPGQ KGTVMGTLRSLGALARAAGPLVAASVYWLAGAQACFTTWSGLFLLPFFLLQKLSYPAQ TLKAE" polyA_site 1758 BASE COUNT 232 a 611 c 531 g 384 t ORIGIN 1 cgccccttta gggtgctcgc cggctgtcgg gtgtgggggt atgccaggcc ccggaggact 61 cggcttcccc gctaacccga cccgccgcac cccacccagg ccaggtcaga gcagcccacc 121 atgggatggg gagggggtgg aggctgcacc ccccgcccac ccatccacca gcagccgccg 181 gagcgccgcg tggtcatcgt tgtctttctc ggcctcctgc tggacctcct ggccttcacg 241 ctgctgctgc ccctgctgcc cgggctgttg gagagccacg gccgtgccca cgaccccctc 301 tatggctcct ggcagggcgg ggtggactgg tttgccaccg ccatcgggat gccagtggag 361 aagaggtaca acagtgtcct gttcggaggt ctcattggct cggcattctc tgtcctgcag 421 tttctgtgtg cgccactcac tggggccacc tctgactgct tggggaggcg cccggtgatg 481 ctgctgtgcc tgatgggtgt ggccacctca tatgcagtct gggccacctc tcggagcttt 541 gcggccttcc tggcctccag gctgattggg ggcatcagca aagggaacgt cagcctctcc 601 acggccatcg ttgctgacct gggctcgcct ctggcccgca gtcaaggcat ggcggtcatt 661 ggggtggcct tctcactggg cttcaccctg ggccctatgc tcggagcctc cctgcccctg 721 gaaatggcac cctggtttgc cctgctcttc gcagcctccg acctgctgtt catcttctgc 781 ttcctgccag agacgctgcc cctggagaaa cgggcgccct ctatcgccct ggggttccgt 841 gatgcggctg atctgctcag ccccctggcc ctgctgcgct tctcggctgt cgctcgtggc 901 caggacccac cctctggaga caggctcagc agcctgcgcc gcctgggcct agtctacttc 961 ctctacctct tcctgttctc gggcctggag tacacgctga gcttcctcac acaccagcgc 1021 ttccagttca gtagcctaca gcaggggaag atgtttttcc tcatcggcct caccatggcc 1081 accatccagg gtgcctatgc ccggcggatc caccctggcg gggaagttgc tgccgtgaag 1141 cgggccctcc tgctgctggt gcccgccttc ctcctcatcg gctggggacg ttctctgccc 1201 gtgctgggcc tggggctgct gctctactcc tttgccgccg ccgttgtggt gccctgcctg 1261 tcctccgtgg tcgctggcta tggctcacca gggcagaagg gcacggtcat gggtacactg 1321 cgcagcctag gtgctctggc cagggccgcg gggcccctgg tggccgcttc agtgtactgg 1381 ctggccgggg cccaggcctg cttcaccacg tggtccgggc tctttttgct ccccttcttc 1441 ctcctgcaga agctgagtta cccggcacag acgctcaagg ctgagtagct gagccactgt 1501 gcccaggctg tgggcaccag gcagagtggg agcctaggtc aggcccctgc ccactgcctg 1561 acccccaccc cccgccagtc cagggagacc ctgtgggtgg gggccggccc ctaagcagga 1621 agctcaggca gctcctccag acttacttac tccttcagtg actccgagct gcagcactcc 1681 aaggctgtca gggcttctgt ttgtttttta aactatgcac caggtttctg atgatgaaat 1741 aaagcacctg tttgtttt // LOCUS HUMTFSL1A 3880 bp mRNA PRI 07-MAR-1995 DEFINITION Homo sapiens transcription factor SL1 mRNA, complete cds. ACCESSION L39059 NID g632995 KEYWORDS transcription factor; transcription factor SL1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3880) AUTHORS Comai,L., Zomerdijk,J.C., Beckmann,H., Zhou,S., Admon,A. and Tjian,R. TITLE Reconstitution of transcription factor SL1: exclusive binding of TBP by SL1 or TFIID subunits JOURNAL Science 266 (5193), 1966-1972 (1994) MEDLINE 95099321 FEATURES Location/Qualifiers source 1..3880 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa" /dev_stage="adult" /clone="phTAF I 110" mRNA 1..3880 /product="transcription factor SL1" CDS 185..2794 /codon_start=1 /product="transcription factor SL1" /db_xref="PID:g695800" /translation="MDFPSSLRPALFLTGPLGLSDVPDLSFMCSWRDALTLPEAQPQN SENGALHVTKDLLWEPATPGPLPMLPPLIDPWDPGLTARDLLFRGGYRYRKRPRVVLD VTEQISRFLLDHGDVAFAPLGKLMLENFKLEGAGSRTKKKTVVSVKKLLQDLGGHQPW GCPWAYLSNRQRRFSILGGPILGTSVASHLAELLHEELVLRWEQLLLDEACTGGALAW VPGRTPQFGQLVYPAGGAQDRLHFQEVVLTPGDNPQFLGKPGRIQLQGPVRQVVTCTV QGESKALIYTFLPHWLTCYLTPGPFHPSSALLAVRSDYHCAVWKFGKQWQPTLLQAMQ VEKGATGISLSPHLPGELAICSRSGAVCLWSPEDGLRQIYRDPETLVFRDSSSWRWAD FTAHPRVLTVGDRTGVKMLDTQGPPGCGLLLFRLGAEASCQKGERVLLTQYLGHSSPK CLPPTLHLVCTQFSLYLVDERLPLVPMLKWNHGLPSPLLLARLLPPPRPSCVQPLLLG GQGGQLQLLHLAGEGASVPRLAGPPQSLPSRIDSLPAFPLLEPKIQWRLQERLKAPTI GLAAVVPPLPSAPTPGLVLFQLSAAGDVFYQQLRPQVDSSLRRDAGPPGDTQPDCHAP TASWTSQDTAGCSQWLKALLKVPLAPPVWTAPTFTHRQMLGSTELRREEEEGQRLGVL RKAMARGQLLLQRDLGSLPAAEPPPAPESGLEDKLSERLGEAWAGRGAAWWERQQGRT SEPGRQTRRPKRRTQLSSSFSLSGHVDPSEDTSSPHSPEWPPADALPLPPTTPPSQEL TPDACAQGVPSEQRQMLRDYMAKLPPQRDTPGCATTPPHSQASSVRATRSQQHTPVLS SSQPLRKKPRMGF" BASE COUNT 683 a 1233 c 1155 g 809 t ORIGIN 1 gctcgagtgc caaagctggg gttctacttg agatttccct cgtggtgcca gggtccggcg 61 agcatcacgc cgaggcccat tttccagacg accacgacga ggccggggtc acgaactctg 121 gcgcccctta ccagcttcca gtctctcgag gtggccagtg tggtgcttgg tccttgtttc 181 caggatggac ttccccagct ccctccgccc tgcgttgttt ctgaccggcc cccttggtct 241 gagcgacgtc cctgacctct ctttcatgtg cagctggcga gacgcactga ctctgccaga 301 ggcccagccc cagaactcag agaatggggc actgcatgtg accaaggacc tgctgtggga 361 gccggcaacc cctgggcctc tccccatgct gcctcccctc atcgatccct gggaccctgg 421 cctgactgcc cgggacctgc ttttccgcgg agggtaccgg tatcggaagc ggccccgagt 481 cgtgctggat gtgactgagc agatcagccg gttcctcttg gatcatggag acgtagcctt 541 tgcgcccctg gggaagctga tgctggagaa tttcaagctg gagggagcgg ggagccgcac 601 taagaagaag acagtggtca gtgtgaagaa gctgctccag gacctcggtg gacaccagcc 661 ctgggggtgt ccctgggctt acctcagcaa ccgacagcgc cgcttctcta tcctcggggg 721 ccccatcctg ggcacgtcgg tggcgagcca cttggcagag ctgctgcacg aggagctggt 781 gctgcggtgg gagcagctgc ttctggatga ggcctgcact gggggcgcgc tggcctgggt 841 tcctggaagg acaccccagt tcgggcagct ggtctaccct gctggaggcg cccaggacag 901 gctgcatttc caagaggtcg ttctgacccc aggtgacaat ccccaattcc ttgggaaacc 961 tggacgcatc cagctccagg gacctgtccg gcaagtggtg acatgcaccg tccagggaga 1021 aagtaaggcc cttatataca ctttcctccc tcactggctg acctgctacc tgacccctgg 1081 ccctttccat ccctcctcag ctctgctggc cgtccgctct gactaccact gtgccgtgtg 1141 gaagtttggt aaacagtggc agccaaccct tctgcaggcg atgcaggtgg agaaaggggc 1201 cacggggatc agcctcagcc ctcacctgcc cggggagctg gccatctgca gccgctcggg 1261 agccgtctgc ctgtggagcc ctgaggatgg gctgcggcaa atctacaggg accctgagac 1321 cctcgtgttc cgggactcct cttcgtggcg ttgggcagac ttcactgcgc accctcgggt 1381 gctgaccgtg ggtgaccgca ccggagtgaa gatgctggac actcagggcc cgccgggctg 1441 tggtctgttg ctttttcgtt tgggggcaga ggcttcgtgc cagaaagggg aacgtgtcct 1501 gcttacccag tacctggggc actccagccc caaatgcctc ccccctactc ttcatctcgt 1561 ctgtacccag ttctctctct acctagtgga cgagcgcctt cccctggtgc cgatgctgaa 1621 gtggaaccat ggcctcccct ccccgctcct gctggcccga ctgctgcctc cgccccggcc 1681 cagctgcgtg cagcccctgc tcctcggagg ccagggtggg cagctgcagc tgctgcacct 1741 ggcaggagaa ggggcgtcgg tgccccgcct ggcaggcccc ccccagtctc ttccttccag 1801 gatcgactcc ctccctgcat ttcctctgct ggagcctaag atccagtggc ggctgcagga 1861 gcgcctgaaa gcaccgacca taggtctggc tgccgtcgtc ccgcccttgc cctcagcgcc 1921 cacaccaggc ctggtgctct tccagctctc ggcggcggga gatgtcttct accagcagct 1981 ccgcccccag gtggactcca gcctccgcag agatgctggg cctcctggcg acacccaacc 2041 tgactgccat gcccccacag cttcctggac ctcccaggac actgccggct gcagccagtg 2101 gctgaaggcc ctgctaaaag tgcccctggc tcctcctgtg tggacagcac ccaccttcac 2161 ccaccgccag atgctgggca gcacagagct gcggagggag gaagaggaag ggcagcggct 2221 gggtgtgctc cgcaaggcca tggcccgagg gcagctcctg ctgcagagag acctgggctc 2281 cctccctgcg gcagagccac cccctgcacc cgagtcaggc ctagaggaca agctcagtga 2341 gcgcctgggg gaagcctggg caggccgagg ggctgcctgg tgggagaggc agcagggcag 2401 gacctcggag cccgggagac agaccaggcg gcccaagcgc cggacccagc tgtccagcag 2461 cttttcgctc agtggccatg tggatccgtc agaggacacc agctcccctc atagccctga 2521 gtggccacct gctgatgctc tgcccctgcc ccccacgacc ccgccctccc aggagttgac 2581 tccggatgca tgcgcccagg gcgtcccatc agagcagcgg cagatgctcc gtgactacat 2641 ggccaagcta ccaccccaga gggacacccc aggctgtgcc accacacctc cccactccca 2701 ggcctccagc gtccgggcca ctcgctccca gcagcacaca cccgtcctct ctagctctca 2761 gcccctccgg aagaagcctc gaatgggctt ctgaggacac aaggtgggct gccctcaagc 2821 cccagagagc ccctcatcct tcctctggga ccagatgtgc cttccacagt tgaaacttga 2881 gaagcagagc tcgccacctt ctggaggcca ctgtgatgat gagccaagca atttggagcc 2941 aagttgaagg gacagggcaa caaaatacag tagtagtttc ttttgtattt tgtatattcg 3001 cctgaagatc atcccgcaag gcaggctgga ggtgccggtg ggcctgtgtt gctgggattt 3061 tagtctgtgc tgggaggcag ggctccgtgc gcctcagctg tgggggcctc aggcaggtcc 3121 ctcagttctc acgccttcct gtccagtgga atgggggcca ggagtgctgg ctcctcgtgt 3181 ttggtgaggg tggagtgagg cccctgcaga gctgctgatg aggtgggcac agcggccgtt 3241 ggcagctgct gttgtgggtt gctttgtcaa tctctgcccc ggtctgatgt ttcctacagg 3301 gagatgccgt ggatccaggt tcagggacta aatacacttg gcagctgaag atgaattgga 3361 atggtcacgt tttttaggct ggacagcgtc ccgccacagc tactacctga cactgagctc 3421 atgcagagag atgatggctg atgttccttc tcccttggga catgggtctg gcacctgtgg 3481 gctgtcgata gtgccctctg agcagagggt cacggtcatg tcagtttggg ggaattctct 3541 gttgtgcctc agagactccc ccctttcttt cctccctccc cttctcattt tgatgtctaa 3601 agcatcaagt ccctcttcct cagagtttct ctagctgcag tggaagattc tgttttcctg 3661 tggggaaaat gctcacttga gattttgcag ggacccgggt ctgtctggtt tctgatgaca 3721 tagtaagaga aaggtctttt ttcaggttgg ctggtgaaag gaattgcatg tgactcacac 3781 aaacaggagc tagcccaatc atacactgac tcgcgtgggt gtttaaatgt ttatcatgcc 3841 taagggagac atttataatt aaaccattta tgctacataa // LOCUS HUMTFSL1B 1578 bp mRNA PRI 07-MAR-1995 DEFINITION Homo sapiens transcription factor SL1 mRNA, complete cds. ACCESSION L39060 NID g632996 KEYWORDS transcription factor; transcription factor SL1. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1578) AUTHORS Comai,L., Zomerdijk,J.C., Beckmann,H., Zhou,S., Admon,A. and Tjian,R. TITLE Reconstitution of transcription factor SL1: exclusive binding of TBP by SL1 or TFIID subunits JOURNAL Science 266 (5193), 1966-1972 (1994) MEDLINE 95099321 FEATURES Location/Qualifiers source 1..1578 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="teratocarcinoma" /dev_stage="adult" /clone="phTAF I 48" mRNA 1..1578 /product="transcription factor SL1" CDS 25..1377 /codon_start=1 /product="transcription factor SL1" /db_xref="PID:g695801" /translation="MSDFSEELKGPVTDDEEVETSVLSGAGMHFPWLQTYVETVAIGG KRRKDFAQTTSACLSFIQEALLKHQWQQAAEYMYSYFQTLEDSDSYKRQAAPEIIWKL GSEILFYHPKSNMESFNTFANRMKNIGVMNYLKISLQHALYLLHHGMLKDAKRNLSEA ETWRHGENTSSREILINLIQAYKGLLQYYTWSEKKMELSKLDKDDYAYNAVAQDVFNH SWKTSANISALIKIPGVWDPFVKSYVEMLEFYGDRDGAQEVLTNYAYDEKFPSNPNAH IYLYNFLKRQKAPRSKLISVLKILYQIVPSHKLMLEFHTLLRKSEKEEHRKLGLEVLF GVLDFAGCTKNITAWKYLAKYLKNILMGNHLAWVQEEWNSRKNWWPGFHFSYFWAKSD WKEDTALACEKAFVAGLLLGKGCRYFRYILKQDHQILGKKIKRMKRSVKKYSIVNPRL " BASE COUNT 522 a 239 c 342 g 475 t ORIGIN 1 attccaagct aaatttaggc gggtatgagt gatttcagtg aagaattaaa agggcctgtg 61 acagatgatg aagaagtgga aacatctgtg ctcagtggtg caggaatgca ttttccttgg 121 cttcaaacat acgtagaaac tgtggccatt ggagggaaaa ggaggaagga ttttgctcag 181 acaacaagtg cttgtttaag ttttatccaa gaagctctgc tgaagcacca atggcagcaa 241 gctgcagaat acatgtacag ttattttcag accttggaag attcagatag ctacaaaagg 301 caggctgcac ctgagattat ttggaagctc ggaagtgaaa ttctatttta tcatcccaaa 361 agcaacatgg agagtttcaa tacttttgct aaccggatga aaaatattgg cgtcatgaat 421 tatttaaaga tctccttaca acatgcatta taccttctgc atcatggaat gcttaaagat 481 gctaagagaa atctgagtga ggcagagaca tggagacatg gtgaaaatac gtcttcccgg 541 gaaatattaa tcaaccttat tcaggcctat aaagggcttt tacagtatta tacctggtct 601 gaaaagaaga tggaattgtc aaagcttgat aaggatgatt atgcttacaa tgcagtagcc 661 caggatgtgt tcaaccacag ctggaagaca tctgcaaata tttctgcatt gattaaaatt 721 cctggagttt gggacccttt tgtgaagagt tatgtagaaa tgctggaatt ctatggggat 781 cgagatggag cccaagaggt actcaccaat tatgcatatg atgaaaagtt tccatcaaat 841 ccaaatgccc atatctactt atacaacttt ctaaagagac agaaggcacc aagatcaaaa 901 ttgataagtg tgcttaagat tttgtatcag attgtaccat ctcataaatt gatgttggaa 961 ttccatacat tacttagaaa atcagaaaaa gaagaacacc gtaaactggg gttggaggta 1021 ttatttggag tcttagattt tgccggatgc actaagaata taactgcttg gaaatacttg 1081 gcaaaatatc tgaaaaatat cttaatggga aaccaccttg cgtgggttca agaagagtgg 1141 aactccagga aaaactggtg gccagggttt catttcagct acttttgggc aaaaagtgat 1201 tggaaggaag atacagcttt ggcctgtgag aaagcttttg tggctggttt actgttagga 1261 aaaggttgta gatatttccg gtatatttta aagcaagatc accaaatctt agggaagaaa 1321 attaagcgga tgaagagatc tgtgaaaaaa tacagtattg taaatccaag actctgatac 1381 tgaattttag ttatttcaca gttgtagcta cacagtaagt agcttggtag atagttattg 1441 aatgtattta tgtagtgtat taagaagctt atattactac aaaaaactta tttttatata 1501 tttttatatt tttgtattat ttatagctag agaaacaata ttactgcctt tgctctttgt 1561 aactatgtct gttttctt // LOCUS HUMTGASE3A 2619 bp mRNA PRI 14-JAN-1995 DEFINITION Homo sapiens transglutaminase E3 (TGASE3) mRNA, complete cds. ACCESSION L10386 NID g307503 KEYWORDS transglutaminase E3. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2619) AUTHORS Kim,I.G., Gorman,J.J., Park,S.C., Chung,S.I. and Steinert,P.M. TITLE The deduced sequence of the novel protransglutaminase E (TGase3) of human and mouse JOURNAL J. Biol. Chem. 268 (17), 12682-12690 (1993) MEDLINE 93286109 FEATURES Location/Qualifiers source 1..2619 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20" gene 42..2123 /gene="TGM3" CDS 42..2123 /gene="TGM3" /codon_start=1 /db_xref="GDB:G00-128-014" /product="transglutaminase E3" /db_xref="PID:g307504" /translation="MAALGVQSINWQKAFNRQAHHTDKFSSQELILRRGQNFQVLMIM NKGLGSNERLEFIDTTGPYPSESAMTKAVFPLSNGSSGGWSAVLQASNGNTLTISISS PASAPIGRYTMALQIFSQGGISSVKLGTFILLFNPWLNVDSVFMGNHAEREEYVQEDA GIIFVGSTNRIGMIGWNFGQFEEDILSICLSILDRSLNFRRDAATDVASRNDPKYVGR VLSAMINSNDDNGVLAGNWSGTYTGGRDPRSWDGSVEILKNWKKSGFSPVRYGQCWVF AGTLNTALRSLGIPSRVITNFNSAHDTDRNLSVDVYYDPMGNPLDKGSDSVWNFHVWN EGWFVRSDLGPPYGGWQVLDATPQERSQGVFQCGPASVIGVREGDVQLNFDMPFIFAE VNADRITWLYDNTTGKQWKNSVNSHTIGRYISTKAVGSNARMDVTDKYKYPEGSDQER QVFQKALGKLKPNTPFAATSSMGLETEEQEPSIIGKLKVAGMLAVGKEVNLVLLLKNL SRDTKTVTVNMTAWTIIYNGTLVHEVWKDSATMSLDPEEEAEHPIKISYAQYERYLKS DNMIRITAVCKVPDESEVVVERDIILDNPTLTLEVLNEARVRKPVNVQMLFSNPLDEP VRDCVLMVEGSGLLLGNLKIDVPTLGPKERSRVRFDILPSRSGTKQLLADFSCNKFPA IKAMLSIDVAE" polyA_signal 2592 /gene="TGM3" /note="G00-128-014" BASE COUNT 632 a 718 c 726 g 543 t ORIGIN 1 cctttagagg agcctgagaa gaggcagagg aagggcgaaa catggctgct ctaggagtcc 61 agagtatcaa ctggcagaag gccttcaacc gacaagcgca tcacacagac aagttctcca 121 gccaggagct catcttgcgg agaggccaaa acttccaggt cttaatgatc atgaacaaag 181 gccttggctc taacgaaaga ctggagttca ttgacaccac agggccttac ccctcagagt 241 cggccatgac gaaggctgtg tttccactct ccaatggcag tagtggtggc tggagtgcgg 301 tgcttcaggc cagcaatggc aatactctga ctatcagcat ctccagtcct gccagcgcac 361 ccataggacg gtacacaatg gccctccaga tcttctccca gggcggcatc tcctctgtga 421 aacttgggac gttcatactg ctttttaacc cctggctgaa tgtggatagc gtctttatgg 481 gtaaccatgc tgagagagaa gagtatgttc aggaagatgc cggcatcatc tttgtgggaa 541 gcacaaaccg aattggcatg attggctgga actttggaca gtttgaagaa gacattctca 601 gcatctgcct ctcaatcttg gataggagtc tgaatttccg ccgtgacgct gctactgatg 661 tggccagcag aaatgacccc aaatacgttg gccgggtgct gagtgccatg atcaatagca 721 atgatgacaa tggtgtgctt gctgggaatt ggagcggcac ttacaccggt ggccgggacc 781 caaggagctg ggacggcagc gtggagatcc tcaaaaattg gaaaaaatct ggcttcagcc 841 cagtccgata tggccagtgc tgggtctttg ctgggaccct caacacagcg ctgcggtctt 901 tggggattcc ttcccgggtg atcaccaact tcaactcagc tcatgacaca gaccgaaatc 961 tcagtgtgga tgtgtactac gaccccatgg gaaaccccct ggacaagggt agtgatagcg 1021 tatggaattt ccatgtctgg aatgaaggct ggtttgtgag gtctgacctg ggccccccgt 1081 acggtggatg gcaggtgttg gatgctaccc cgcaggaaag aagccaaggg gtgttccagt 1141 gcggccccgc ttcggtcatt ggtgttcgag agggtgatgt gcagctgaac ttcgacatgc 1201 cctttatctt cgcggaggtt aatgccgacc gcatcacctg gctgtacgac aacaccactg 1261 gcaaacagtg gaagaattcc gtgaacagtc acaccattgg caggtacatc agcaccaagg 1321 cggtgggcag caatgctcgc atggacgtca cggacaagta caagtaccca gaaggctctg 1381 accaggaaag acaagtgttc caaaaggctt tggggaaact taaacccaac acgccatttg 1441 ccgcgacgtc ttcgatgggt ttggaaacag aggaacagga gcccagcatc atcgggaagc 1501 tgaaggtcgc tggcatgctg gcagtaggca aagaagtcaa cctggtccta ctgctcaaaa 1561 acctgagcag ggatacgaag acagtgacag tgaacatgac agcctggacc atcatctaca 1621 acggcacgct tgtacatgaa gtgtggaagg actctgccac aatgtccctg gaccctgagg 1681 aagaggcaga acatcccata aagatctcgt acgctcagta tgagaggtac ctgaagtcag 1741 acaacatgat ccggatcaca gcggtgtgca aggtcccaga tgagtctgag gtggtggtgg 1801 agcgggacat catcctggac aaccccacct tgaccctgga ggtgctgaac gaggctcgtg 1861 tgcggaagcc tgtgaacgtg cagatgctct tctccaatcc actggatgag ccggtgaggg 1921 actgcgtgct gatggtggag ggaagcggcc tgctgttggg taacctgaag atcgacgtgc 1981 cgaccctagg gcccaaggag cggtcccggg tccgttttga tatcctgccc tcccggagtg 2041 gcaccaagca actgctcgcc gacttctcct gcaacaagtt ccctgcaatc aaggccatgt 2101 tgtccatcga cgtagccgaa tgaagggcgc tggtggcctc ccgtacaaac ttggacaaca 2161 cggagcaggg agagctcacc atggaatgaa ccccccgccc atgctgtccg gcctgggaaa 2221 ccctctccat ctcccaaggc tgccagacat ggactccggg ctccagcaca tccccctctc 2281 ctctccccca ggttggggct gggtccaccc tgtcctatga cttgatcact tttgcacatt 2341 ccctggccgt ttctccccag agctgcctgc tctgtgagcc ccacagccct gctcattcct 2401 cacgcccttc aatgctgcag gatggactgg cccctgaccc agggactctc caaacgggat 2461 acaggagaga agctggtcta gactgtttgc tgatccccaa cctgcacggg gcattcctgc 2521 ttctctctca ggccaccaca gagggcaggg gatggttagt cacctgcccc agcactcaca 2581 ccctaactca aaataaatgt taaataagtg cgatcacac // LOCUS HUMTGFB3C 4208 bp mRNA PRI 22-MAY-1995 DEFINITION Human transforming growth factor-beta type III receptor (TGF-beta) mRNA, complete cds. ACCESSION L07594 NID g818001 KEYWORDS betaglycan; endoglin; proteoglycan; transforming growth factor-beta type III receptor; transmembrane protein. SOURCE Homo sapiens (tissue library: ZAPII) placenta cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Lopez-Casillas,F., Cheifetz,S., Doody,J., Andres,J.L., Lane,W.S. and Massague,J. TITLE Structure and expression of the membrane proteoglycan betaglycan, a component of the TGF-beta receptor system JOURNAL Cell 67 (4), 785-795 (1991) MEDLINE 92034999 REFERENCE 2 (sites) AUTHORS Wang,X.F., Lin,H.Y., Ng-Eaton,E., Downward,J., Lodish,H.F. and Weinberg,R.A. TITLE Expression cloning and characterization of the TGF-beta type III receptor JOURNAL Cell 67 (4), 797-805 (1991) MEDLINE 92035000 REFERENCE 3 (bases 1 to 4208) AUTHORS Moren,A., Ichijo,H. and Miyazono,K. TITLE Molecular cloning and characterization of the human and porcine transforming growth factor-beta type III receptors JOURNAL Biochem. Biophys. Res. Commun. 189 (1), 356-362 (1992) MEDLINE 93080582 FEATURES Location/Qualifiers source 1..4208 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="ZAPII" gene 349..2898 /gene="TGFB type III receptor" CDS 349..2898 /gene="TGFB type III receptor" /codon_start=1 /product="transforming growth factor-beta type III receptor" /db_xref="PID:g339556" /translation="MTSHYVIAIFALMSFCLATAGPEPGALCELSPVSASHPVQALME SFTVLSGCASRGTTGLPQEVHVLNLALRQGPGQLQREVTLHLNPISSVHIHHKSVVFL LNSPHPLVWHLKTERLATGVSRLFLVSEGSVVQFSSANFSLTAETEERNFPHGNEHLL NWARKEYGAVTSFTELKIARNIYIKVGEDQVFPPKCNIGKNFLSLNYLAEYLQPKAAE GCVMSSQPQNEEVHIIELITPNSNPYSAFQVDITIDIRPSQEDLEVVKNLILILKCKK SVNWVIKSFDVKGSLKIIAPNSIGFGKESERSMTMTKSIRDDIPSTQGNLVKWALDNG YSPITSYTMAPVAIVFHLRLENNEEMGDEEVHTIPPELRILLDPGALPALQNPPIRGG EGQNGGLPFPFPDISRRVWNEEGEDGLPRPKDPVIPSIQLFPGLREPEEVQGSVDIAL SVKCDNEKMIVAVEKDSFQASGYSGMDVTLLDPTCKAKMNGTHFVLESPLNGCGTRPR WSALDGVVYYNSIVIQVPALGDSSGWPDGYEDLESGDNGFPGDMDEGDASLFTRPEIV VFNCSLQQVRNPSSFQEQPHGNITFNMELYNTDLFLVPSQGVFSVPENGHVYVEVSVT KAEQELGFAIQTCFISPYSNPDRMSHYTIIENICPKDESVKFYSPKRVHFPIPQADMD KKRFSFVFKPVFNTSLLFLQCELTLCTKMEKHPQKLPKCVPPDEACTSLDASIIWAMM QNKKTFTKPLAVIHHEAESKEKGPSMKEPNPISPPIFHGLDTLTVMGIAFAAFVIGAL LTGALWYIYSHTGETAGRQQVPTSPPASENSSAAHSIGSTQSTPCSSSSTA" sig_peptide 364..411 /gene="TGFB type III receptor" mat_peptide 412..2895 /gene="TGFB type III receptor" /product="transforming growth factor-beta type III receptor" misc_binding 1942..1947 /gene="TGFB type III receptor" /note="putative" /bound_moiety="glycosaminoglycan" misc_binding 1975..1980 /gene="TGFB type III receptor" /note="putative" /bound_moiety="glycosaminoglycan" BASE COUNT 1150 a 1003 c 981 g 1074 t ORIGIN 1 tctttaagat ttgtagctac taagaaagaa aggagctttt tttccttggg ccttcaaact 61 gaaagaaccg catgagcctg acggcgcatg gtcttaacat caggctgtgc aggaagaagc 121 tatctgcaga tggatgccag cacacacaag gaagcagagc tctggcaaca ttgagtcaaa 181 gcaaggacac aacatcagag ggacggcaga gaatccttgt gtgtagtctt tggtggcagt 241 ttgaaaattg caaggaggga ctttaagact acttctgatt tgcaaagatg gtctgtgctc 301 cgagcaggct aaagtgactg gacgagacgc actgttggag aaataaaaat gacttcccat 361 tatgtgattg ccatctttgc cctgatgagc ttctgtttag ccactgcagg tccagagcct 421 ggtgcactgt gtgaactgtc acctgtcagt gcctcccatc ctgtccaggc cttgatggag 481 agcttcactg ttttgtcagg ctgtgccagc agaggcacaa ctgggctgcc acaggaggtg 541 catgtcctga atctcgcact gcgccagggg cctggccagc tacagagaga ggtcacactt 601 cacctgaatc ccatctcctc agtccacatc caccacaagt ctgttgtgtt cctgctcaac 661 tccccacacc ccctggtgtg gcatctgaag acagagagac ttgccactgg ggtctccaga 721 ctgtttttgg tgtctgaggg ttctgtggtc cagttttcat cagcaaactt ctccttgaca 781 gcagaaacag aagaaaggaa cttcccccat ggaaatgaac atctgttaaa ttgggcccga 841 aaagagtatg gagcagttac ttcattcacc gaactcaaga tagcaagaaa catttatatt 901 aaagtggggg aagatcaagt gttccctcca aagtgcaaca tagggaagaa ttttctctca 961 ctcaattacc ttgctgagta ccttcaaccc aaagcagcag aagggtgtgt gatgtccagc 1021 cagccccaga atgaggaagt acacatcatc gagctaatca cccccaactc taacccctac 1081 agtgctttcc aggtggatat aacaattgat ataagacctt ctcaagagga tcttgaagtg 1141 gtcaaaaatc tcatcctgat cttgaagtgc aaaaagtctg tcaactgggt gatcaaatct 1201 tttgatgtta agggaagcct gaaaattatt gctcctaaca gtattggctt tggaaaagag 1261 agtgaaagat ctatgacaat gaccaaatca ataagagatg acattccttc aacccaaggg 1321 aatctggtga agtgggcttt ggacaatggc tatagtccaa taacttcata cacaatggct 1381 cctgtggcaa tagtatttca tcttcggctt gaaaataatg aggagatggg agatgaggaa 1441 gtccacacta ttcctcctga gctacggatc ctgctggacc ctggtgccct gcctgccctg 1501 cagaacccgc ccatccgggg aggggaaggc caaaatggag gccttccgtt tcctttccca 1561 gatatttcca ggagagtctg gaatgaagag ggagaagatg ggctccctcg gccaaaggac 1621 cctgtcattc ccagcataca actgtttcct ggtctcagag agccagaaga ggtgcaaggg 1681 agcgtggata ttgccctgtc tgtcaaatgt gacaatgaga agatgatcgt ggctgtagaa 1741 aaagattctt ttcaggccag tggctactcg gggatggacg tcaccctgtt ggatcctacc 1801 tgcaaggcca agatgaatgg cacacacttt gttttggagt ctcctctgaa tggctgcggt 1861 actcggcccc ggtggtcagc ccttgatggt gtggtctact ataactccat tgtgatacag 1921 gttccagccc ttggggacag tagtggttgg ccagatggtt atgaagatct ggagtcaggt 1981 gataatggat ttccgggaga tatggatgaa ggagatgctt ccctgttcac ccgacctgaa 2041 atcgtggtgt ttaattgcag ccttcagcag gtgaggaacc ccagcagctt ccaggaacag 2101 ccccacggaa acatcacctt caacatggag ctatacaaca ctgacctctt tttggtgccc 2161 tcccagggcg tcttctctgt gccagagaat ggacacgttt atgttgaggt atctgttact 2221 aaggctgaac aagaactggg atttgccatc caaacgtgct ttatctctcc atattcgaac 2281 cctgatagga tgtctcatta caccattatt gagaatattt gtcctaaaga tgaatctgtg 2341 aaattctaca gtcccaagag agtgcacttc cctatcccgc aagctgacat ggataagaag 2401 cgattcagct ttgtcttcaa gcctgtcttc aacacctcac tgctctttct acagtgtgag 2461 ctgacgctgt gtacgaagat ggagaagcac ccccagaagt tgcctaagtg tgtgcctcct 2521 gacgaagcct gcacctcgct ggacgcctcg ataatctggg ccatgatgca gaataagaag 2581 acgttcacca agccccttgc tgtgatccac catgaagcag aatctaaaga aaaaggtcca 2641 agcatgaagg aaccaaatcc aatttctcca ccaattttcc atggtctgga caccctaacc 2701 gtgatgggca ttgcgtttgc agcctttgtg atcggagcac tcctgacggg ggccttgtgg 2761 tacatctatt ctcacacagg ggagacagca ggaaggcagc aagtccccac ctccccgcca 2821 gcctcggaaa acagcagtgc tgcccacagc atcggcagca cgcagagcac gccttgctcc 2881 agcagcagca cggcctagcc caacccaggc ccaacccggc ccaacccagc ccagcccagc 2941 tcagctcagc tactccaagg gcaggaccaa tggctgagcc tcgtgtccag actcagaggg 3001 ctggattttg gttcccttgt aaagacagag tgaatttcag tataaagatc acccgttgta 3061 ttcaccccac acccagggct agtataaaca tgaccctggg cttctgtacc acactagaat 3121 tcatgtgaga aagctaaaat ggtggtcttc tccaccagcc cctcacaggc ttgggggttt 3181 tctatgtgaa acacatgcca gtttttaaaa tgctgctttg tccaggtgag aacatccata 3241 atttggggcc ctgagtttta cccagactca aggagttggt aaagggttaa tagccagata 3301 gtagaaccag tgaggagatg cggccaaaga ttctttatat ctgaaccaag atgtaaaaca 3361 agaaatgctt tgaggctttc taagcgatcc tcctgtctaa tttgcacctt tgtctggatg 3421 cactcttctg accttgctgc cacaacctgt ggggtctgat gtgtcccaag atgggtgctg 3481 ccctcaggga ctgcaccctg acaagtgtta aggcaacatt ccttgcttgt gccctgggcc 3541 aaaaccaatg ctgatgacct tatcagcttc ctgtttcttc ccatactgca tacaccactg 3601 caaaatgtct taatgcaaat tttgtatttc ttacaggcct acagaaattg aaaatgacca 3661 aaatcaggaa ccacagattt gtgcccattc ctaatatttt gttctgcaaa ttaatgtata 3721 atttgaggtg aaattcagtt ataaagtcaa ggacgaattt gcacagtgat atatttctat 3781 gtgtatgcaa gtacaagtat ataatatgtc acctggcaca ttcattttct cagttgaaga 3841 agagaaaatt tgaaaatgtc cttatgcttt tagagttgca acttaagtat atttggtagg 3901 gtgagtgttt ccactcaaaa tatgtcaact taaaaaaaaa taggcccttt cataaaaacc 3961 aaactgtagc aagatgcaaa tgcatggcaa atctgtcggt ctccagttgg ttatctgaat 4021 agtgtcacca attccaccaa gacagtgctg agattggaaa gggcactcat ttggattgcc 4081 ttacttctct tgccttaaat atatcccata tatttaatat gtcaaaaagg gcttgaggtg 4141 aatttcatta aatggaataa tatgatgcca ctttgcagct aaaataagct cagtgatacc 4201 tccttgtt // LOCUS HUMTGFBB 2153 bp mRNA PRI 15-JAN-1991 DEFINITION Human transforming growth factor-beta (tgf-beta) mRNA, complete cds. ACCESSION M60314 M38693 NID g339559 KEYWORDS transforming growth factor-beta. SOURCE Human cell line U-2 OS, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2153) AUTHORS Celeste,A.J., Iannazzi,J.A., Taylor,R.C., Hewick,R.M., Rosen,V., Wang,E.A. and Wozney,J.M. TITLE Identification of new tgf-beta family members present in bone-inductive protein purified from bovine bone JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 9843-9847 (1990) MEDLINE 91088608 FEATURES Location/Qualifiers source 1..2153 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="U-2 OS" gene 699..2063 /gene="tgf-beta" CDS 699..2063 /gene="tgf-beta" /codon_start=1 /product="transforming growth factor-beta" /db_xref="PID:g339560" /translation="MHLTVFLLKGIVGFLWSCWVLVGYAKGGLGDNHVHSSFIYRRLR NHERREIQREILSILGLPHRPRPFSPGKQASSAPLFMLDLYNAMTNEENPEESEYSVR ASLAEETRGARKGYPASPNGYPRRIQLSRTTPLTTQSPPLASLHDTNFLNDADMVMSF VNLVERDKDFSHQRRHYKEFRFDLTQIPHGEAVTAAEFRIYKDRSNNRFENETIKISI YQIIKEYTNRDADLFLLDTRKAQALDVGWLVFDITVTSNHWVINPQNNLGLQLCAETG DGRSINVKSAGLVGRQGPQSKQPFMVAFFKASEVLLRSVRAANKRKNQNRNKSSSHQD SSRMSSVGDYNTSEQKQACKKHELYVSFRDLGWQDWIIAPEGYAAFYCDGECSFPLNA HMNATNHAIVQTLVHLMFPDHVPKPCCAPTKLNAISVLYFDDSSNVILKKYRNMVVRS CGCH" BASE COUNT 703 a 420 c 450 g 580 t ORIGIN 1 ctggtatatt tgtgcctgct ggaggtggaa ttaacagtaa gaaggagaaa gggattgaat 61 ggacttacag gaaggatttc aagtaaattc agggaaacac atttacttga atagtacaac 121 ctagagtatt attttacact aagacgacac aaaagatgtt aaagttatca ccaagctgcc 181 ggacagatat atattccaac accaaggtgc agatcagcat agatctgtga ttcagaaatc 241 aggatttgtt ttggaaagag ctcaagggtt gagaagaact caaaagcaag tgaagattac 301 tttgggaact acagtttatc agaagatcaa cttttgctaa ttcaaatacc aaaggcctga 361 ttatcataaa ttcatatagg aatgcatagg tcatctgatc aaataatatt agccgtcttc 421 tgctacatca atgcagcaaa aactcttaac aactgtggat aattggaaat ctgagtttca 481 gctttcttag aaataactac tcttgacata ttccaaaata tttaaaatag gacaggaaaa 541 tcggtgagga tgttgtgctc agaaatgtca ctgtcatgaa aaataggtaa atttgttttt 601 tcagctactg ggaaactgta cctcctagaa ccttaggttt tttttttttt aagaggacaa 661 gaaggactaa aaatatcaac ttttgctttt ggacaaaaat gcatctgact gtatttttac 721 ttaagggtat tgtgggtttc ctctggagct gctgggttct agtgggttat gcaaaaggag 781 gtttgggaga caatcatgtt cactccagtt ttatttatag aagactacgg aaccacgaaa 841 gacgggaaat acaaagggaa attctctcta tcttgggttt gcctcacaga cccagaccat 901 tttcacctgg aaaacaagcg tcctctgcac ctctctttat gctggatctc tacaatgcca 961 tgaccaatga agaaaatcct gaagagtcgg agtactcagt aagggcatcc ttggcagaag 1021 agaccagagg ggcaagaaag ggatacccag cctctcccaa tgggtatcct cgtcgcatac 1081 agttatctcg gacgactcct ctgaccaccc agagtcctcc tctagccagc ctccatgata 1141 ccaactttct gaatgatgct gacatggtca tgagctttgt caacttagtt gaaagagaca 1201 aggatttttc tcaccagcga aggcattaca aagaatttcg atttgatctt acccaaattc 1261 ctcatggaga ggcagtgaca gcagctgaat tccggatata caaggaccgg agcaacaacc 1321 gatttgaaaa tgaaacaatt aagattagca tatatcaaat catcaaggaa tacacaaata 1381 gggatgcaga tctgttcttg ttagacacaa gaaaggccca agctttagat gtgggttggc 1441 ttgtctttga tatcactgtg accagcaatc attgggtgat taatccccag aataatttgg 1501 gcttacagct ctgtgcagaa acaggggatg gacgcagtat caacgtaaaa tctgctggtc 1561 ttgtgggaag acagggacct cagtcaaaac aaccattcat ggtggccttc ttcaaggcga 1621 gtgaggtact tcttcgatcc gtgagagcag ccaacaaacg aaaaaatcaa aaccgcaata 1681 aatccagctc tcatcaggac tcctccagaa tgtccagtgt tggagattat aacacaagtg 1741 agcaaaaaca agcctgtaag aagcacgaac tctatgtgag cttccgggat ctgggatggc 1801 aggactggat tatagcacca gaaggatacg ctgcatttta ttgtgatgga gaatgttctt 1861 ttccacttaa cgcccatatg aatgccacca accacgctat agttcagact ctggttcatc 1921 tgatgtttcc tgaccacgta ccaaagcctt gttgtgctcc aaccaaatta aatgccatct 1981 ctgttctgta ctttgatgac agctccaatg tcattttgaa aaaatataga aatatggtag 2041 tacgctcatg tggctgccac taatattaaa taatattgat aataacaaaa agatctgtat 2101 taaggtttat ggctgcaata aaaagcatac tttcagacaa acagaaaaaa aaa // LOCUS HUMTGFBIG 2691 bp mRNA PRI 14-JAN-1995 DEFINITION Human transforming growth factor-beta induced gene product (BIGH3) mRNA, complete cds. ACCESSION M77349 NID g339567 KEYWORDS transforming growth factor-beta induced protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2691) AUTHORS Skonier,J., Neubauer,M., Madisen,L., Bennett,K., Plowman,G.D. and Purchio,A.F. TITLE cDNA cloning and sequence analysis of beta ig-h3, a novel gene induced in a human adenocarcinoma cell line after treatment with transforming growth factor-beta JOURNAL DNA Cell Biol. 11 (7), 511-522 (1992) MEDLINE 93000472 FEATURES Location/Qualifiers source 1..2691 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A549" /cell_type="adenocarcinoma" sig_peptide 48..92 /gene="BIGH3" /note="putative" CDS 48..2099 /gene="BIGH3" /note="putative" /codon_start=1 /product="transforming growth factor induced protein" /db_xref="PID:g339568" /translation="MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQH GPNVCAVQKVIGTNRKYFTNCKQWYQRKICGKSTVISYECCPGYEKVPGEKGCPAALP LSNLYETLGVVGSTTTQLYTDRTEKLRPEMEGPGSFTIFAPSNEAWASLPAEVLDSLV SNVNIELLNALRYHMVGRRVLTDELKHGMTLTSMYQNSNIQIHHYPNGIVTVNCARLL KADHHATNGVVHLIDKVISTITNNIQQIIEIEDTFETLRAAVAASGLNTMLEGNGQYT LLAPTNEAFEKIPSETLNRILGDPEALRDLLNNHILKSAMCAEAIVAGLSVETLEGTT LEVGCSGDMLTINGKAIISNKDILATNGVIHYIDELLIPDSAKTLFELAAESDVSTAI DLFRQAGLGNHLSGSERLTLLAPLNSVFKDGTPPIDAHTRNLLRNHIIKDQLASKYLY HGQTLETLGGKKLRVFVYRNSLCIENSCIAAHDKRGRYGTLFTMDRVLTPPMGTVMDV LKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRERSRLLGDAKELA NILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVAEPDIMATNGV VHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRLAPVYQKLLERMK H" gene 48..2099 /gene="BIGH3" mat_peptide 93..2096 /gene="BIGH3" /product="transforming growth factor-beta induced protein" BASE COUNT 679 a 729 c 695 g 588 t ORIGIN 1 gcttgcccgt cggtcgctag ctcgctcggt gcgcgtcgtc ccgctccatg gcgctcttcg 61 tgcggctgct ggctctcgcc ctggctctgg ccctgggccc cgccgcgacc ctggcgggtc 121 ccgccaagtc gccctaccag ctggtgctgc agcacagcag gctccggggc cgccagcacg 181 gccccaacgt gtgtgctgtg cagaaggtta ttggcactaa taggaagtac ttcaccaact 241 gcaagcagtg gtaccaaagg aaaatctgtg gcaaatcaac agtcatcagc tacgagtgct 301 gtcctggata tgaaaaggtc cctggggaga agggctgtcc agcagcccta ccactctcaa 361 acctttacga gaccctggga gtcgttggat ccaccaccac tcagctgtac acggaccgca 421 cggagaagct gaggcctgag atggaggggc ccggcagctt caccatcttc gcccctagca 481 acgaggcctg ggcctccttg ccagctgaag tgctggactc cctggtcagc aatgtcaaca 541 ttgagctgct caatgccctc cgctaccata tggtgggcag gcgagtcctg actgatgagc 601 tgaaacacgg catgaccctc acctctatgt accagaattc caacatccag atccaccact 661 atcctaatgg gattgtaact gtgaactgtg cccggctcct gaaagccgac caccatgcaa 721 ccaacggggt ggtgcacctc atcgataagg tcatctccac catcaccaac aacatccagc 781 agatcattga gatcgaggac acctttgaga cccttcgggc tgctgtggct gcatcagggc 841 tcaacacgat gcttgaaggt aacggccagt acacgctttt ggccccgacc aatgaggcct 901 tcgagaagat ccctagtgag actttgaacc gtatcctggg cgacccagaa gccctgagag 961 acctgctgaa caaccacatc ttgaagtcag ctatgtgtgc tgaagccatc gttgcggggc 1021 tgtctgtaga gaccctggag ggcacgacac tggaggtggg ctgcagcggg gacatgctca 1081 ctatcaacgg gaaggcgatc atctccaata aagacatcct agccaccaac ggggtgatcc 1141 actacattga tgagctactc atcccagact cagccaagac actatttgaa ttggctgcag 1201 agtctgatgt gtccacagcc attgaccttt tcagacaagc cggcctcggc aatcatctct 1261 ctggaagtga gcggttgacc ctcctggctc ccctgaattc tgtattcaaa gatggaaccc 1321 ctccaattga tgcccataca aggaatttgc ttcggaacca cataattaaa gaccagctgg 1381 cctctaagta tctgtaccat ggacagaccc tggaaactct gggcggcaaa aaactgagag 1441 tttttgttta tcgtaatagc ctctgcattg agaacagctg catcgcggcc cacgacaaga 1501 gggggaggta cgggaccctg ttcacgatgg accgggtgct gaccccccca atggggactg 1561 tcatggatgt cctgaaggga gacaatcgct ttagcatgct ggtagctgcc atccagtctg 1621 caggactgac ggagaccctc aaccgggaag gagtctacac agtctttgct cccacaaatg 1681 aagccttccg agccctgcca ccaagagaac ggagcagact cttgggagat gccaaggaac 1741 ttgccaacat cctgaaatac cacattggtg atgaaatcct ggttagcgga ggcatcgggg 1801 ccctggtgcg gctaaagtct ctccaaggtg acaagctgga agtcagcttg aaaaacaatg 1861 tggtgagtgt caacaaggag cctgttgccg agcctgacat catggccaca aatggcgtgg 1921 tccatgtcat caccaatgtt ctgcagcctc cagccaacag acctcaggaa agaggggatg 1981 aacttgcaga ctctgcgctt gagatcttca aacaagcatc agcgttttcc agggcttccc 2041 agaggtctgt gcgactagcc cctgtctatc aaaagttatt agagaggatg aagcattagc 2101 ttgaagcact acaggaggaa tgcaccacgg cagctctccg ccaatttctc tcagatttcc 2161 acagagactg tttgaatgtt ttcaaaacca agtatcacac tttaatgtac atgggccgca 2221 ccataatgag atgtgagcct tgtgcatgtg ggggaggagg gagagagatg tactttttaa 2281 atcatgttcc ccctaaacat ggctgttaac ccactgcatg cagaaacttg gatgtcactg 2341 cctgacattc acttccagag aggacctatc ccaaatgtgg aattgactgc ctatgccaag 2401 tccctggaaa aggagcttca gtattgtggg gctcataaaa catgaatcaa gcaatccagc 2461 ctcatgggaa gtcctggcac agtttttgta aagcccttgc acagctggag aaatggcatc 2521 attataagct atgagttgaa atgttctgtc aaatgtgtct cacatctaca cgtggcttgg 2581 aggcttttat ggggccctgt ccaggtagaa aagaaatggt atgtagagct tagatttccc 2641 tattgtgaca gagccatggt gtgtttgtaa taataaaacc aaagaaacat a // LOCUS HUMTGFBIIR 2090 bp mRNA PRI 14-JAN-1995 DEFINITION Human TGF-beta type II receptor mRNA, complete cds. ACCESSION M85079 NID g339569 KEYWORDS transforming growth factor-beta type II receptor. SOURCE Homo sapiens (tissue library: lambda zapII) cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2090) AUTHORS Lin,H.Y., Wang,X.F., Ng-Eaton,E., Weinberg,R.A. and Lodish,H.F. TITLE Expression cloning of the TGF-beta type II receptor, a functional transmembrane serine/threonine kinase [published erratum appears in Cell 1992 Sep 18;70(6):following 1068] JOURNAL Cell 68 (4), 775-785 (1992) MEDLINE 92154690 FEATURES Location/Qualifiers source 1..2090 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hep G2" /tissue_lib="lambda zapII" /map="1q41" gene 336..2039 /gene="TGFB2" CDS 336..2039 /gene="TGFB2" /codon_start=1 /db_xref="GDB:G00-120-436" /product="TGF-beta type II receptor" /db_xref="PID:g339570" /translation="MGRGLLRGLWPLHIVLWTRIASTIPPHVQKSVNNDMIVTDNNGA VKFPQLCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAVWRKNDENITLETVCH DPKLPYHDFILEDAASPKCIMKEKKKPGETFFMCSCSSDECNDNIIFSEEYNTSNPDL LLVIFQVTGISLLPPLGVAISVIIIFYCYRVNRQQKLSSTWETGKTRKLMEFSEHCAI ILEDDRSDISSTCANNINHNTELLPIELDTLVGKGRFAEVYKAKLKQNTSEQFETVAV KIFPYEEYASWKTEKDIFSDINLKHENILQFLTAEERKTELGKQYWLITAFHAKGNLQ EYLTRHVISWEDLRKLGSSLARGIAHLHSDHTPCGRPKMPIVHRDLKSSNILVKNDLT CCLCDFGLSLRLDPTLSVDDLANSGQVGTARYMAPEVLESRMNLENAESFKQTDVYSM ALVLWEMTSRCNAVGEVKDYEPPFGSKVREHPCVESMKDNVLRDRGRPEIPSFWLNHQ GIQMVCETLTECWDHDPEARLTAQCVAERFSELEHLDRLSGRSCSEEKIPEDGSLNTT K" BASE COUNT 492 a 581 c 594 g 423 t ORIGIN 1 gttggcgagg agtttcctgt ttcccccgca gcgctgagtt gaagttgagt gagtcactcg 61 cgcgcacgga gcgacgacac ccccgcgcgt gcacccgctc gggacaggag ccggactcct 121 gtgcagcttc cctcggccgc cgggggcctc cccgcgcctc gccggcctcc aggcccctcc 181 tggctggcga gcgggcgcca catctggccc gcacatctgc gctgccggcc cggcgcgggg 241 tccggagagg gcgcggcgcg gagcgcagcc aggggtccgg gaaggcgccg tccgtgcgct 301 gggggctcgg tctatgacga gcagcggggt ctgccatggg tcgggggctg ctcaggggcc 361 tgtggccgct gcacatcgtc ctgtggacgc gtatcgccag cacgatccca ccgcacgttc 421 agaagtcggt taataacgac atgatagtca ctgacaacaa cggtgcagtc aagtttccac 481 aactgtgtaa attttgtgat gtgagatttt ccacctgtga caaccagaaa tcctgcatga 541 gcaactgcag catcacctcc atctgtgaga agccacagga agtctgtgtg gctgtatgga 601 gaaagaatga cgagaacata acactagaga cagtttgcca tgaccccaag ctcccctacc 661 atgactttat tctggaagat gctgcttctc caaagtgcat tatgaaggaa aaaaaaaagc 721 ctggtgagac tttcttcatg tgttcctgta gctctgatga gtgcaatgac aacatcatct 781 tctcagaaga atataacacc agcaatcctg acttgttgct agtcatattt caagtgacag 841 gcatcagcct cctgccacca ctgggagttg ccatatctgt catcatcatc ttctactgct 901 accgcgttaa ccggcagcag aagctgagtt caacctggga aaccggcaag acgcggaagc 961 tcatggagtt cagcgagcac tgtgccatca tcctggaaga tgaccgctct gacatcagct 1021 ccacgtgtgc caacaacatc aaccacaaca cagagctgct gcccattgag ctggacaccc 1081 tggtggggaa aggtcgcttt gctgaggtct ataaggccaa gctgaagcag aacacttcag 1141 agcagtttga gacagtggca gtcaagatct ttccctatga ggagtatgcc tcttggaaga 1201 cagagaagga catcttctca gacatcaatc tgaagcatga gaacatactc cagttcctga 1261 cggctgagga gcggaagacg gagttgggga aacaatactg gctgatcacc gccttccacg 1321 ccaagggcaa cctacaggag tacctgacgc ggcatgtcat cagctgggag gacctgcgca 1381 agctgggcag ctccctcgcc cgggggattg ctcacctcca cagtgatcac actccatgtg 1441 ggaggcccaa gatgcccatc gtgcacaggg acctcaagag ctccaatatc ctcgtgaaga 1501 acgacctaac ctgctgcctg tgtgactttg ggctttccct gcgtctggac cctactctgt 1561 ctgtggatga cctggctaac agtgggcagg tgggaactgc aagatacatg gctccagaag 1621 tcctagaatc caggatgaat ttggagaatg ctgagtcctt caagcagacc gatgtctact 1681 ccatggctct ggtgctctgg gaaatgacat ctcgctgtaa tgcagtggga gaagtaaaag 1741 attatgagcc tccatttggt tccaaggtgc gggagcaccc ctgtgtcgaa agcatgaagg 1801 acaacgtgtt gagagatcga gggcgaccag aaattcccag cttctggctc aaccaccagg 1861 gcatccagat ggtgtgtgag acgttgactg agtgctggga ccacgaccca gaggcccgtc 1921 tcacagccca gtgtgtggca gaacgcttca gtgagctgga gcatctggac aggctctcgg 1981 ggaggagctg ctcggaggag aagattcctg aagacggctc cctaaacact accaaatagc 2041 tcttatgggg caggctgggc atgtccaaag aggctgcccc tctcaccaaa // LOCUS HUMTGFBRS 1733 bp mRNA PRI 28-JAN-1994 DEFINITION Human TGF-b superfamily receptor type I mRNA, complete cds. ACCESSION L17075 NID g425147 KEYWORDS heteromeric receptor; serine/threonine kinase; type I receptor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1733) AUTHORS Attisano,L., Carcamo,J., Ventura,F., Weis,F.M., Massague,J. and Wrana,J.L. TITLE Identification of human activin and TGF beta type I receptors that form heteromeric kinase complexes with type II receptors JOURNAL Cell 75 (4), 671-680 (1993) MEDLINE 94061985 FEATURES Location/Qualifiers source 1..1733 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HUVEC" CDS 92..1603 /codon_start=1 /product="TGF-b superfamily receptor type I" /db_xref="PID:g425148" /translation="MTLGSPRKGLLMLLMALVTQGDPVKPSRGPLVTCTCESPHCKGP TCRGAWCTVVLVREEGRHPQEHRGCGNLHRELCRGRPTEFVNHYCCDSHLCNHNVSLV LEATQPPSEQPGTDGQLALILGPVLALLALVALGVLGLWHVRRRQEKQRGLHSELGES SLILKASEQGDSMLGDLLDSDCTTGSGSGLPFLVQRTVARQVALVECVGKGRYGEVWR GLWHGESVAVKIFSSRDEQSWFRETEIYNTVLLRHDNILGFIASDMTSRNSSTQLWLI THYHEHGSLYDFLQRQTLEPHLALRLAVSAACGLAHLHVEIFGTQGKPAIAHRDFKSR NVLVKSNLQCCIADLGLAVMHSQGSDYLDIGNNPRVGTKRYMAPEVLDEQIRTDCFES YKWTDIWAFGLVLWEIARRTIVNGIVEDYRPPFYDVVPNDPSFEDMKKVVCVDQQTPT IPNRLAADPVLSGLAQMMRECWYPNPSARLTALRIKKTLQKISNSPEKPKVIQ" BASE COUNT 336 a 510 c 558 g 329 t ORIGIN 1 tgatttcctc tgggcaggag ggagccacgg ccagcgcgtg tcacacttca tggctcttac 61 tccacctctc ttgctcctct ctgcagggac catgaccttg ggctccccca ggaaaggcct 121 tctgatgctg ctgatggcct tggtgaccca gggagaccct gtgaagccgt ctcggggccc 181 gctggtgacc tgcacgtgtg agagcccaca ttgcaagggg cctacctgcc ggggggcctg 241 gtgcacagta gtgctggtgc gggaggaggg gaggcacccc caggaacatc ggggctgcgg 301 gaacttgcac agggagctct gcagggggcg ccccaccgag ttcgtcaacc actactgctg 361 cgacagccac ctctgcaacc acaacgtgtc cctggtgctg gaggccaccc aacctccttc 421 ggagcagccg ggaacagatg gccagctggc cctgatcctg ggccccgtgc tggccttgct 481 ggccctggtg gccctgggtg tcctgggcct gtggcatgtc cgacggaggc aggagaagca 541 gcgtggcctg cacagcgagc tgggagagtc cagtctcatc ctgaaagcat ctgagcaggg 601 cgacagcatg ttgggggacc tcctggacag tgactgcacc acagggagtg gctcagggct 661 ccccttcctg gtgcagagga cagtggcacg gcaggttgcc ttggtggagt gtgtgggaaa 721 aggccgctat ggcgaagtgt ggcggggctt gtggcacggt gagagtgtgg ccgtcaagat 781 cttctcctcg agggatgaac agtcctggtt ccgggagact gagatctata acacagtgtt 841 gctcagacac gacaacatcc taggcttcat cgcctcagac atgacctccc gcaactcgag 901 cacgcagctg tggctcatca cgcactacca cgagcacggc tccctctacg actttctgca 961 gagacagacg ctggagcccc atctggctct gaggctagct gtgtccgcgg catgcggcct 1021 ggcgcacctg cacgtggaga tcttcggtac acagggcaaa ccagccattg cccaccgcga 1081 cttcaagagc cgcaatgtgc tggtcaagag caacctgcag tgttgcatcg ccgacctggg 1141 cctggctgtg atgcactcac agggcagcga ttacctggac atcggcaaca acccgagagt 1201 gggcaccaag cggtacatgg cacccgaggt gctggacgag cagatccgca cggactgctt 1261 tgagtcctac aagtggactg acatctgggc ctttggcctg gtgctgtggg agattgcccg 1321 ccggaccatc gtgaatggca tcgtggagga ctatagacca cccttctatg atgtggtgcc 1381 caatgacccc agctttgagg acatgaagaa ggtggtgtgt gtggatcagc agacccccac 1441 catccctaac cggctggctg cagacccggt cctctcaggc ctagctcaga tgatgcggga 1501 gtgctggtac ccaaacccct ctgcccgact caccgcgctg cggatcaaga agacactaca 1561 aaaaattagc aacagtccag agaagcctaa agtgattcaa tagcccagga gcacctgatt 1621 cctttctgcc tgcagggggc tgggggggtg gggggcagcg gatggtgcct atctgggtag 1681 aggtagtgtg agtgtgtgtg tgctgggatg gcagctgcgc ctgctgctcg ccc // LOCUS HUMTHBP 2514 bp mRNA PRI 14-JAN-1995 DEFINITION Human thyroid hormone binding protein (p55) mRNA, complete cds. ACCESSION J02783 NID g339646 KEYWORDS membrane-associated protein; thyroid hormone binding protein. SOURCE Human epidermoid carcinoma cell line A431, cDNA to mRNA, clone p5A5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2145) AUTHORS Cheng,S.Y., Gong,Q.H., Parkison,C., Robinson,E.A., Appella,E., Merlino,G.T. and Pastan,I. TITLE The nucleotide sequence of a human cellular thyroid hormone binding protein present in endoplasmic reticulum JOURNAL J. Biol. Chem. 262 (23), 11221-11227 (1987) MEDLINE 87280213 REFERENCE 2 (bases 2146 to 2514) AUTHORS Cheng,S.-Y. JOURNAL Unpublished (1989) COMMENT [2] revises [1]. Draft entry and computer-readable sequence for [1] kindly provided by S.-y.Cheng, 15-DEC-1988. FEATURES Location/Qualifiers source 1..2514 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q25" mRNA <1..2514 /note="P4HB mRNA" sig_peptide 64..114 /gene="P4HB" /note="pot. thyroid hormone binding protein signal peptide; putative" gene 64..1590 /gene="P4HB" CDS 64..1590 /gene="P4HB" /note="thyroid hormone binding protein precursor" /codon_start=1 /db_xref="GDB:G00-120-708" /db_xref="PID:g339647" /translation="MLRRALLCLAVAALVRADAPEEEDHVLVLRKSNFAEALAAHKYL LVEFYAPWCGHCKALAPEYAKAAGKLKAEGSEIRLAKVDATEESDLAQQYGVRGYPTI KFFRNGDTASPKEYTAGREADDIVNWLKKRTGPAATTLRDGAAAESLVESSEVAVIGF FKDVESDSAKQFLQAAEAIDDIPFGITSNSDVFSKYQLDKDGVVLFKKFDEGRNNFEG EVTKENLLDFIKHNQLPLVIEFTEQTAPKIFGGEIKTHILLFLPKSVSDYDGKLSNFK TAAESFKGKILFIFIDSDHTDNQRILEFFGLKKEECPAVRLITLEEEMTKYKPESEEL TAERITEFCHRFLEGKIKPHLMSQERAGDWDKQPVKVPVGKNFEDVAFDEKKNVFVEF YAPWCGHCKQLAPIWDKLGETYKDHENIVIAKMDSTANEVEAVKVHSFPTLKFFPASA DRTVIDYNGERTLDGFKKFLESGGQDGAGDDDDLEDLEEAEEPDMEEDDDQKAVKDEL " mat_peptide 115..1587 /gene="P4HB" /note="thyroid hormone binding protein" BASE COUNT 557 a 719 c 723 g 515 t ORIGIN 1 ctctcgtcgc ccccgctgtc ccggcggcgc caaccgaagc gccccgcctg atccgtgtcc 61 gacatgctgc gccgcgctct gctgtgcctg gccgtggccg ccctggtgcg cgccgacgcc 121 cccgaggagg aggaccacgt cctggtgctg cggaaaagca acttcgcgga ggcgctggcg 181 gcccacaagt acctgctggt ggagttctat gccccttggt gtggccactg caaggctctg 241 gcccctgagt atgccaaagc cgctgggaag ctgaaggcag aaggttccga gatcaggttg 301 gccaaggtgg acgccacgga ggagtctgac ctggcccagc agtacggcgt gcgcggctat 361 cccaccatca agttcttcag gaatggagac acggcttccc ccaaggaata tacagctggc 421 agagaggctg atgacatcgt gaactggctg aagaagcgca cgggcccggc tgccaccacc 481 ctccgtgacg gcgcagctgc agagtccttg gtggagtcca gcgaggtggc tgtcatcggc 541 ttcttcaagg acgtggagtc ggactctgcc aagcagtttt tgcaggcagc agaggccatc 601 gatgacatac catttgggat cacttccaac agtgacgtgt tctccaaata ccagctcgac 661 aaagatgggg ttgtcctctt taagaagttt gatgaaggcc ggaacaactt tgaaggggag 721 gtcaccaagg agaacctgct ggactttatc aaacacaacc agctgcccct tgtcatcgag 781 ttcaccgagc agacagcccc gaagattttt ggaggtgaaa tcaagactca catcctgctg 841 ttcttgccca agagtgtgtc tgactatgac ggcaaactga gcaacttcaa aacagcagcc 901 gagagcttca agggcaagat cctgttcatc ttcatcgaca gcgaccacac cgacaaccag 961 cgcatcctcg agttctttgg cctgaagaag gaagagtgcc cggccgtgcg cctcatcacc 1021 ctggaggagg agatgaccaa gtacaagccc gaatcggagg agctgacggc agagaggatc 1081 acagagttct gccaccgctt cctggagggc aaaatcaagc cccacctgat gagccaggag 1141 cgtgccggag actgggacaa gcagcctgtc aaggtgcctg ttgggaagaa ctttgaagac 1201 gtggcttttg atgagaaaaa aaacgtcttt gtggagttct atgccccatg gtgtggtcac 1261 tgcaaacagt tggctcccat ttgggataaa ctgggagaga cgtacaagga ccatgagaac 1321 atcgtcatcg ccaagatgga ctcgactgcc aacgaggtgg aggccgtcaa agtgcacagc 1381 ttccccacac tcaagttctt tcctgccagt gccgacagga cggtcattga ttacaacggg 1441 gaacgcacgc tggatggttt taagaaattc ctggagagcg gtggccagga tggggcaggg 1501 gatgatgacg atctcgagga cctggaagaa gcagaggagc cagacatgga ggaagacgat 1561 gatcagaaag ctgtgaaaga tgaactgtaa tacgcaaagc cagacccggg cgctgccgag 1621 acccctcggg gctgcacacc cagcagcagc gcacgcctcc gaagcctgcg gcctcgcttg 1681 aaggaggcgt cgccggaaac ccagggaacc tctctgaagt gacacctcac ccctacacac 1741 cgtccgttca cccccgtctc ttccttctgc ttttcggttt ttggaaaggg atccatctcc 1801 aggcagccca ccctggtggc ttgtttcctg aaaccatgat gtactttttc atacatgagt 1861 ctgtccagag tgcttgctac cgtgttcgga gtctcgctgc ctccctcccg cgggaggttt 1921 ctcctctttt tgaaaattcc gtctgtggga tttttagaca tttttcgaca tcagggtatt 1981 tgttccacct tggccaggcc tcctcggaga agcttgtccc ccgtgtggga gggacggagc 2041 cggactggac atggtcactc agtaccgcct gcagtgtcgc catgactgat catggctctt 2101 gcatttttgg gtaaatggag acttccggat cctgtcaggg tgtcccccat gcctggaaga 2161 ggagctggtg gctgccagcc ctggcggcgg cacagcctgg gcctcccctt ccctcaagcc 2221 agggctcctc ctcctgtcgt gggctcattt gccaggctca ggccaggtct ggacagctgt 2281 gactctcctc aagccaggac taccgaccag ccggctatgg gcacattacg tgaccactgg 2341 cctctctaca gcacggcctg tggcctgttc aaggcagaac cacgaccctt gactcccggg 2401 tggggaggtg gccaaggatg ctggagctga atcagacgct gacagttctt caggcatttc 2461 tatttcacaa tcgaattgaa cacattggcc aaataaagtt gaaattttac cacc // LOCUS HUMTHD 501 bp mRNA PRI 23-AUG-1995 DEFINITION Human thioredoxin (TXN) mRNA, complete cds. ACCESSION J04026 NID g339648 KEYWORDS thioredoxin. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 501) AUTHORS Wollman,E.E., d'Auriol,L., Rimsky,L., Shaw,A., Jacquot,J.P., Wingfield,P., Graber,P., Dessarps,F., Robin,P., Galibert,F., Bertoglio,J. and Fradeliz,D. TITLE Cloning and expression of a cDNA for human thioredoxin JOURNAL J. Biol. Chem. 263 (30), 15506-15512 (1988) MEDLINE 89008454 REFERENCE 2 (bases 1 to 501) AUTHORS Wollman,E.E. TITLE Direct Submission JOURNAL Submitted (24-AUG-1988) E.E. Wollman, Centre National de la Recherche Scientifique, UA 1156/Institut National de la Sante et de la Recherche Medicale/Institut Gustave Roussy, Villejuif, France FEATURES Location/Qualifiers source 1..501 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3p12-p11" /cell_line="3B6" /tissue_type="EBV transformed lymphocytes" gene 64..381 /gene="TXN" CDS 64..381 /gene="TXN" /codon_start=1 /db_xref="GDB:G00-120-475" /product="thioredoxin" /db_xref="PID:g339649" /translation="MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMINPFFHS LSEKYSNVIFLEVDVDDCQDVASECEVKCTPTFQFFKKGQKVGEFSGANKEKLEATIN ELV" BASE COUNT 149 a 100 c 105 g 147 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgctt tggatccatt tccatcggtc cttacagccg ctcgtcagac tccagcagcc 61 aagatggtga agcagatcga gagcaagact gcttttcagg aagccttgga cgctgcaggt 121 gataaacttg tagtagttga cttctcagcc acgtggtgtg ggccttgcaa aatgatcaac 181 cctttctttc attccctctc tgaaaagtat tccaacgtga tattccttga agtagatgtg 241 gatgactgtc aggatgttgc ttcagagtgt gaagtcaaat gcacgccaac attccagttt 301 tttaagaagg gacaaaaggt gggtgaattt tctggagcca ataaggaaaa gcttgaagcc 361 accattaatg aattagtcta atcatgtttt ctgaaaacat aaccagccat tggctattta 421 aacttgtatt tttttattta caaaatataa atatgaagac ataaccagtt gccatctgcg 481 tgacaataaa cattatgcta a // LOCUS HUMTHMBX 400 bp mRNA PRI 29-APR-1992 DEFINITION Human thymosin beta 10 mRNA, complete cds. ACCESSION M92381 NID g339660 KEYWORDS thymosin beta 10. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 400) AUTHORS Hall,A.K., Hempstead,J. and Morgan,J.I. TITLE Thymosin beta 10 levels in developing human brain and its regulation by retinoic acid in the HTB-10 neuroblastoma JOURNAL Brain Res. Mol. Brain Res. 8 (2), 129-135 (1990) MEDLINE 90384336 FEATURES Location/Qualifiers source 1..400 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 36..152 /codon_start=1 /product="thymosin beta 10" /db_xref="PID:g339661" /translation="MGEIASFDKAKLKKTETQEKNTLPTKETIEQEKRSEIS" BASE COUNT 118 a 106 c 105 g 71 t ORIGIN 1 cggattgttt taagaaaatg gcagacaaac cagacatggg ggaaatcgcc agcttcgata 61 aggccaagct gaagaaaacg gagacgcagg agaagaacac cctgccgacc aaagagacca 121 ttgagcagga gaagcggagt gaaatttcct aagatcctgg agggatttcc tacccccgtc 181 ctcttcgaga ccccagtcgt gatgtggagg aagagccacc tgcaagatgg acacgagcca 241 caagctgcac tgtgaacctg ggcactccgc gccgatgcca ccggcctgcg ggtctctgaa 301 gggacccccc cccaatcgga ctgccaaatt ctccggtttg ccccgggata ttatagaaaa 361 ttatttgtat gaataatgaa aataaaacac acctcgtggc // LOCUS HUMTHRA1A 1876 bp DNA PRI 09-MAY-1995 DEFINITION Human thyroid hormone receptor alpha 1 (TR-alpha-1) gene, complete cds. ACCESSION M24748 NID g339662 KEYWORDS thryoid hormone receptor. SOURCE Homo sapiens (clone: lambda-Me2.) foetus skeletal muscle DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1876) AUTHORS Nakai,A., Sakurai,A., Bell,G.I. and DeGroot,L.J. TITLE Characterization of a third human thyroid hormone receptor coexpressed with other thyroid hormone receptors in several tissues JOURNAL Mol. Endocrinol. 2 (11), 1087-1092 (1988) MEDLINE 89127255 COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A. Nakai, 15-MAY-1989. FEATURES Location/Qualifiers source 1..1876 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-Me2." /dev_stage="foetus" /tissue_type="skeletal muscle" /map="17q11.2-q12" CDS 58..72 /note="putative" /codon_start=1 /product="unknown protein" /db_xref="PID:g804785" /translation="MELK" gene 73..1305 /gene="THRA1" CDS 73..1305 /gene="THRA1" /codon_start=1 /db_xref="GDB:G00-120-730" /product="thyroid receptor alpha-1" /db_xref="PID:g339663" /translation="MEQKPSKVECGSDPEENSARSPDGKRKRKNGQCSLKTSMSGYIP SYLDKDEQCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPTYSCKYDSCCVIDKI TRNQCQLCRFKKCIAVGMAMDLVLDDSKRVAKRKLIEQNRERRRKEEMIRSLQQRPEP TPEEWDLIHIATEAHRSTNAQGSHWKQRRKFLPDDIGQSPIVSMPDGDKVDLEAFSEF TKIITPAITRVVDFAKKLPMFSELPCEDQIILLKGCCMEIMSLRAAVRYDPESDTLTL SGEMAVKREQLKNGGLGVVSDAIFELGKSLSAFNLDDTEVALLQAVLLMSTDRSGLLC VDKIEKSQEAYLLAFEHYVNHRKHNIPHFWPKLLMKVTDLRMIGACHASRFLHMKVEC PTELFPPLFLEVFEDQEV" BASE COUNT 444 a 531 c 551 g 350 t ORIGIN 356 bp upstream of PvuII site. 1 tgccgggggg gccagtgtgc ccaccccagt ctcttggcgt gctggagggc atcctggatg 61 gaattgaagt gaatggaaca gaagccaagc aaggtggagt gtgggtcaga cccagaggag 121 aacagtgcca ggtcaccaga tggaaagcga aaaagaaaga acggccaatg ttccctgaaa 181 accagcatgt cagggtatat ccctagttac ctggacaaag acgagcagtg tgtcgtgtgt 241 ggggacaagg caactggtta tcactaccgc tgtatcactt gtgagggctg caagggcttc 301 tttcgccgca caatccagaa gaacctccat cccacctatt cctgcaaata tgacagctgc 361 tgtgtcattg acaagatcac ccgcaatcag tgccagctgt gccgcttcaa gaagtgcatc 421 gccgtgggca tggccatgga cttggttcta gatgactcga agcgggtggc caagcgtaag 481 ctgattgagc agaaccggga gcggcggcgg aaggaggaga tgatccgatc actgcagcag 541 cgaccagagc ccactcctga agagtgggat ctgatccaca ttgccacaga ggcccatcgc 601 agcaccaatg cccagggcag ccattggaaa cagaggcgga aattcctgcc cgatgacatt 661 ggccagtcac ccattgtctc catgccggac ggagacaagg tggacctgga agccttcagc 721 gagtttacca agatcatcac cccggccatc acccgtgtgg tggactttgc caaaaaactg 781 cccatgttct ccgagctgcc ttgcgaagac cagatcatcc tcctgaaggg gtgctgcatg 841 gagatcatgt ccctgcgggc ggctgtccgc tacgaccctg agagcgacac cctgacgctg 901 agtggggaga tggctgtcaa gcgggagcag ctcaagaatg gcggcctggg cgtagtctcc 961 gacgccatct ttgaactggg caagtcactc tctgccttta acctggatga cacggaagtg 1021 gctctgctgc aggctgtgct gctaatgtca acagaccgct cgggcctgct gtgtgtggac 1081 aagatcgaga agagtcagga ggcgtacctg ctggcgttcg agcactacgt caaccaccgc 1141 aaacacaaca ttccgcactt ctggcccaag ctgctgatga aggtgactga cctccgcatg 1201 atcggggcct gccacgccag ccgcttcctc cacatgaaag tcgagtgccc caccgaactc 1261 ttccccccac tcttcctcga ggtctttgag gatcaggaag tctaaagcct caggcggcca 1321 gagggtgtgc ggagctggtg gggaggagcc tggagagaag gggcagagct gggggctgag 1381 ggagaccccc ccacacccct tctctccttc ctctcgtcct tggatagatt cagctcccac 1441 acacacaccc cgcactgccc aggtccctcc tcagacctcc agccctggga cagggcaaac 1501 aactgaactt gctatggaaa ggacagtgtg ggaggctggg ggagctgtgt cctgcagttc 1561 ccaggacccc atcctctcag aaggtagggg aagggcggga ggattgagaa gggacaagcc 1621 accttgaccg taggggaagg aggaatgtgg gctgggggaa gatgccctca actcaccccc 1681 tcacacacat gagagagagc ccccacccag ttccttggcc taggtctccc ctccaggctg 1741 agggcctctc tacttcccca gatgcctggg tgcaaagaac ggcttggctt ggctcctcct 1801 ctggaggtta aaatttatag tcattctaac tgcacttgga aaccaagcaa ggggagaaga 1861 caaatgaaga aaaact // LOCUS HUMTHRR 3472 bp mRNA PRI 10-OCT-1991 DEFINITION Human thrombin receptor mRNA, complete cds. ACCESSION M62424 NID g339676 KEYWORDS thrombin receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3472) AUTHORS Vu,T.H., Hung,D.T., Wheaton,V.I. and Coughlin,S.R. TITLE Molecular cloning of a functional thrombin receptor reveals a novel proteolytic mechanism of receptor activation JOURNAL Cell 64, 1057-1068 (1991) MEDLINE 91168254 FEATURES Location/Qualifiers source 1..3472 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 225..1502 /codon_start=1 /product="thrombin receptor" /db_xref="PID:g339677" /translation="MGPRRLLLVAACFSLCGPLLSARTRARRPESKATNATLDPRSFL LRNPNDKYEPFWEDEEKNESGLTEYRLVSINKSSPLQKQLPAFISEDASGYLTSSWLT LFVPSVYTGVFVVSLPLNIMAIVVFILKMKVKKPAVVYMLHLATADVLFVSVLPFKIS YYFSGSDWQFGSELCRFVTAAFYCNMYASILLMTVISIDRFLAVVYPMQSLSWRTLGR ASFTCLAIWALAIAGVVPLVLKEQTIQVPGLNITTCHDVLNETLLEGYYAYYFSAFSA VFFFVPLIISTVCYVSIIRCLSSSAVANRSKKSRALFLSAAVFCIFIICFGPTNVLLI AHYSFLSHTSTTEAAYFAYLLCVCVSSISSCIDPLIYYYASSECQRYVYSILCCKESS DPSSYNSSGQLMASKMDTCSSNLNNSIYKKLLT" BASE COUNT 933 a 817 c 785 g 937 t ORIGIN 1 gcgcccgcgc gaccgcgcgc cccagtcccg ccccgccccg ctaaccgccc cagacacagc 61 gctcgccgag ggtcgcttgg accctgatct tacccgtggg caccctgcgc tctgcctgcc 121 gcgaagaccg gctccccgac ccgcagaagt caggagagag ggtgaagcgg agcagcccga 181 ggcggggcag cctcccggag cagcgccgcg cagagcccgg gacaatgggg ccgcggcggc 241 tgctgctggt ggccgcctgc ttcagtctgt gcggcccgct gttgtctgcc cgcacccggg 301 cccgcaggcc agaatcaaaa gcaacaaatg ccaccttaga tccccggtca tttcttctca 361 ggaaccccaa tgataaatat gaaccatttt gggaggatga ggagaaaaat gaaagtgggt 421 taactgaata cagattagtc tccatcaata aaagcagtcc tcttcaaaaa caacttcctg 481 cattcatctc agaagatgcc tccggatatt tgaccagctc ctggctgaca ctctttgtcc 541 catctgtgta caccggagtg tttgtagtca gcctcccact aaacatcatg gccatcgttg 601 tgttcatcct gaaaatgaag gtcaagaagc cggcggtggt gtacatgctg cacctggcca 661 cggcagatgt gctgtttgtg tctgtgctcc cctttaagat cagctattac ttttccggca 721 gtgattggca gtttgggtct gaattgtgtc gcttcgtcac tgcagcattt tactgtaaca 781 tgtacgcctc tatcttgctc atgacagtca taagcattga ccggtttctg gctgtggtgt 841 atcccatgca gtccctctcc tggcgtactc tgggaagggc ttccttcact tgtctggcca 901 tctgggcttt ggccatcgca ggggtagtgc ctctcgtcct caaggagcaa accatccagg 961 tgcccgggct caacatcact acctgtcatg atgtgctcaa tgaaaccctg ctcgaaggct 1021 actatgccta ctacttctca gccttctctg ctgtcttctt ttttgtgccg ctgatcattt 1081 ccacggtctg ttatgtgtct atcattcgat gtcttagctc ttccgcagtt gccaaccgca 1141 gcaagaagtc ccgggctttg ttcctgtcag ctgctgtttt ctgcatcttc atcatttgct 1201 tcggacccac aaacgtcctc ctgattgcgc attactcatt cctttctcac acttccacca 1261 cagaggctgc ctactttgcc tacctcctct gtgtctgtgt cagcagcata agctcgtgca 1321 tcgaccccct aatttactat tacgcttcct ctgagtgcca gaggtacgtc tacagtatct 1381 tatgctgcaa agaaagttcc gatcccagca gttataacag cagtgggcag ttgatggcaa 1441 gtaaaatgga tacctgctct agtaacctga ataacagcat atacaaaaag ctgttaactt 1501 aggaaaaggg actgctggga ggttaaaaag aaaagtttat aaaagtgaat aacctgagga 1561 ttctattagt ccccacccaa actttattga ttcacctcct aaaacaacag atgtacgact 1621 tgcatacctg ctttttatgg gagctgtcaa gcatgtattt ttgtcaatta ccagaaagat 1681 aacaggacga gatgacggtg ttattccaag ggaatattgc caatgctaca gtaataaatg 1741 aatgtcactt ctggatatag ctaggtgaca tatacatact tacatgtgtg tatatgtaga 1801 tgtatgcaca cacatatatt atttgcagtg cagtatagaa taggcacttt aaaacactct 1861 ttccccgcac cccagcaatt atgaaaataa tctctgattc cctgatttaa tatgcaaagt 1921 ctaggttggt agagtttagc cctgaacatt tcatggtgtt catcaacagt gagagactcc 1981 atagtttggg cttgtaccac ttttgcaaat aagtgtattt tgaaattgtt tgacggcaag 2041 gtttaagtta ttaagaggta agacttagta ctatctgtgc gtagaagttc tagtgttttc 2101 aattttaaac atatccaagt ttgaattcct aaaattatgg aaacagatga aaagcctctg 2161 ttttgatatg ggtagtattt tttacatttt acacactgta cacataagcc aaaactgagc 2221 ataagtcctc tagtgaatgt aggctggctt tcagagtagg ctattcctga gagctgcatg 2281 tgtccgcccc cgatggagga ctccaggcag cagacacatg ccagggccat gtcagacaca 2341 gattggccag aaaccttcct gctgagcctc acagcagtga gactggggcc actacatttg 2401 ctccatcctc ctgggattgg ctgtgaactg atcatgttta tgagaaactg gcaaagcaga 2461 atgtgatatc ctaggaggta atgaccatga aagacttctc tacccatctt aaaaacaacg 2521 aaagaaggca tggacttctg gatgcccatc cactgggtgt aaacacatct agtagttgtt 2581 ctgaaatgtc agttctgata tggaagcacc cattatgcgc tgtggccact ccaataggtg 2641 ctgagtgtac agagtggaat aagacagaga cctgccctca agagcaaagt agatcatgca 2701 tagagtgtga tgtatgtgta ataaatatgt ttcacacaaa caaggcctgt cagctaaaga 2761 agtttgaaca tttgggttac tatttcttgt ggttataact taatgaaaac aatgcagtac 2821 aggacatata ttttttaaaa taagtctgat ttaattgggc actatttatt tacaaatgtt 2881 ttgctcaata gattgctcaa atcaggtttt cttttaagaa tcaatcatgt cagtctgctt 2941 agaaataaca gaagaaaata gaattgacat tgaaatctag gaaaattatt ctataatttc 3001 catttactta agacttaatg agactttaaa agcatttttt aacctcctaa gtatcaagta 3061 tagaaaatct tcatggaatt cacaaagtaa tttggaaatt aggttgaaac atatctctta 3121 tcttacgaaa aaatggtagc attttaaaca aaatagaaag ttgcaaggca aatgtttatt 3181 taaaagagca ggccaggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggctga 3241 ggcgggtgga tcacgaggtc aggagatcga gaccatcctg gctaacacgg tgaaacccgt 3301 ctctactaaa aatgcaaaaa aaattagccg ggcgtggtgg caggcacctg tagtcccagc 3361 tactcgggag gctgaggcag gagactggcg tgaacccagg aggcggacct tgtagtgagc 3421 cgagatcgcg ccactgtgct ccagcctggg caacagagca agactccatc tc // LOCUS HUMTHRSPO 5784 bp mRNA PRI 30-DEC-1993 DEFINITION Human thrombospondin 2 (THBS2) mRNA, complete cds. ACCESSION L12350 NID g307505 KEYWORDS thrombospondin 2. SOURCE Homo sapiens adult connective cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5784) AUTHORS LaBell,T.L., Milewicz,D.J., Disteche,C.M. and Byers,P.H. TITLE Thrombospondin II: partial cDNA sequence, chromosome location, and expression of a second member of the thrombospondin gene family in humans JOURNAL Genomics 12 (3), 421-429 (1992) MEDLINE 92217961 REFERENCE 2 (bases 1 to 5784) AUTHORS LaBell,T.L. and Byers,P.H. TITLE Sequence and characterization of the complete human thrombospondin 2 cDNA: potential regulatory role for the 3' untranslated region JOURNAL Genomics 17 (1), 225-229 (1993) MEDLINE 94010892 FEATURES Location/Qualifiers source 1..5784 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="fibroblast" /dev_stage="adult" /tissue_type="connective" mRNA 1..5784 /gene="THBS2" /citation=[2] /evidence=experimental gene 1..5784 /gene="THBS2" 5'UTR 1..239 /gene="THBS2" /note="putative" /citation=[2] sig_peptide 240..293 /gene="THBS2" /note="putative" /citation=[2] CDS 240..3758 /gene="THBS2" /standard_name="TSP2" /citation=[2] /codon_start=1 /evidence=experimental /product="thrombospondin 2" /db_xref="PID:g307506" /translation="MVWRLVLLALWVWPSTQAGHQDKDTTFDLFSISNINRKTIGAKQ FRGPDPGVPAYRFVRFDYIPPVNADDLSKITKIMRQKEGFFLTAQLKQDGKSRGTLLA LEGPGLSQRQFEIVSNGPADTLDLTYWIDGTRHVVSLEDVGLADSQWKNVTVQVAGET YSLHVGCDLIGPVALDEPFYEHLQAEKSRMYVAKGSARESHFRGLLQNVHLVFENSVE DILSKKGCQQGQGAEINAISENTETLRLGPHVTTEYVGPSSERRPEVCERSCEELGNM VQELSGLHVLVNQLSENLKRVSNDNQFLWELIGGPPKTRNMSACWQDGRFFAENETWV VDSCTTCTCKKFKTICHQITCPPATCASPSFVEGECCPSCLHSVDGEEGWSPWAEWTQ CSVTCGSGTQQRGRSCDVTSNTCLGPSIQTRACSLSKCDTRIRQDGGWSHWSPWSSCS VTCGVGNITRIRLCNSPVPQMGGKNCKGSGRETKACQGAPCPIDGRWSPWSPWSACTV TCAGGIRERTRVCNSPEPQYGGKACVGDVQERQMCNKRSCPVDGCLSNPCFPGAQCSS FPDGSWSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRCVNTQPGFHCLPCPP RYRGNQPVGVGLEAAKTEKQVCEPENPCKDKTHNCHKHAECIYLGHFSDPMYKCECQT GYAGDGLICGEDSDLDGWPNLNLVCATNATYHCIKDNCPHLPNSGQEDFDKDGIGDAC DDDDDNDGVTDEKDNCQLLFNPRQADYDKDEVGDRCDNCPYVHNPAQIDTDNNGEGDA CSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPLVHNPDQTDVDNDLVGD QCDNNEDIDDDGHQNNQDNCPYISNANQADHDRDGQGDACDPDDDNDGVPDDRDNCRL VFNPDQEDLDGDGRGDICKDDFDNDNIPDIDDVCPENNAISETDFRNFQMVPLDPKGT TQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDRDDDYAGFVFG YQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRNALWHTGNT PGQVRTLWHDPRNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPIYDQTYA GGRLGLFVFSQEMVYFSDLKYECRDI" mat_peptide 294..3755 /gene="THBS2" /citation=[2] /evidence=experimental /product="thrombospondin 2" 3'UTR 3759..5784 /gene="THBS2" /note="putative" /citation=[2] polyA_signal 5761..5766 /gene="THBS2" /note="putative" /citation=[2] polyA_site 5784 /gene="THBS2" BASE COUNT 1447 a 1460 c 1518 g 1359 t ORIGIN 1 acggcatcca gtacagaggg gctggacttg gacccctgca gcagccctgc acaggagaag 61 cggcatataa agccgcgctg cccgggagcc gctcggccac gtccaccgga gcatcctgca 121 ctgcagggcc ggtctctcgc tccagcagag cctgcgcctt tctgactcgg tccggaacac 181 tgaaaccagt catcactgca tctttttggc aaaccaggag ctcagctgca ggaggcagga 241 tggtctggag gctggtcctg ctggctctgt gggtgtggcc cagcacgcaa gctggtcacc 301 aggacaaaga cacgaccttc gaccttttca gtatcagcaa catcaaccgc aagaccattg 361 gcgccaagca gttccgcggg cccgaccccg gcgtgccggc ttaccgcttc gtgcgctttg 421 actacatccc accggtgaac gcagatgacc tcagcaagat caccaagatc atgcggcaga 481 aggagggctt cttcctcacg gcccagctca agcaggacgg caagtccagg ggcacgctgt 541 tggctctgga gggccccggt ctctcccaga ggcagttcga gatcgtctcc aacggccccg 601 cggacacgct ggatctcacc tactggattg acggcacccg gcatgtggtc tccctggagg 661 acgtcggcct ggctgactcg cagtggaaga acgtcaccgt gcaggtggct ggcgagacct 721 acagcttgca cgtgggctgc gacctcatag gaccagttgc tctggacgag cccttctacg 781 agcacctgca ggcggaaaag agccggatgt acgtggccaa aggctctgcc agagagagtc 841 acttcagggg tttgcttcag aacgtccacc tagtgtttga aaactctgtg gaagatattc 901 taagcaagaa gggttgccag caaggccagg gagctgagat caacgccatc agtgagaaca 961 cagagacgct gcgcctgggt ccgcatgtca ccaccgagta cgtgggcccc agctcggaga 1021 ggaggcccga ggtgtgcgaa cgctcgtgcg aggagctggg aaacatggtc caggagctct 1081 cggggctcca cgtcctcgtg aaccagctca gcgagaacct caagagagtg tcgaatgata 1141 accagtttct ctgggagctc attggtggcc ctcctaagac aaggaacatg tcagcttgct 1201 ggcaggatgg ccggttcttt gcggaaaatg aaacgtgggt ggtggacagc tgcaccacgt 1261 gtacctgcaa gaaatttaaa accatttgcc accaaatcac ctgcccgcct gcaacctgcg 1321 ccagtccatc ctttgtggaa ggcgaatgct gcccttcctg cctccactcg gtggacggtg 1381 aggagggctg gtctccgtgg gcagagtgga cccagtgctc cgtgacgtgt ggctctggga 1441 cccagcagag aggccggtcc tgtgacgtca ccagcaacac ctgcttgggg ccctcgatcc 1501 agacacgggc ttgcagtctg agcaagtgtg acacccgcat ccggcaggac ggcggctgga 1561 gccactggtc accttggtct tcatgctctg tgacctgtgg agttggcaat atcacacgca 1621 tccgtctctg caactcccca gtgccccaga tggggggcaa gaattgcaaa gggagtggcc 1681 gggagaccaa agcctgccag ggcgccccat gcccaatcga tggccgctgg agcccctggt 1741 ccccgtggtc ggcctgcact gtcacctgtg ccggtgggat ccgggagcgc acccgggtct 1801 gcaacagccc tgagcctcag tacggaggga aggcctgcgt gggggatgtg caggagcgtc 1861 agatgtgcaa caagaggagc tgccccgtgg atggctgttt atccaacccc tgcttcccgg 1921 gagcccagtg cagcagcttc cccgatgggt cctggtcatg cggcttctgc cctgtgggct 1981 tcttgggcaa tggcacccac tgtgaggacc tggacgagtg tgccctggtc cccgacatct 2041 gcttctccac cagcaaggtg cctcgctgtg tcaacactca gcctggcttc cactgcctgc 2101 cctgcccgcc ccgatacaga gggaaccagc ccgtcggggt cggcctggaa gcagccaaga 2161 cggaaaagca agtgtgtgag cccgaaaacc catgcaagga caagacacac aactgccaca 2221 agcacgcgga gtgcatctac ctgggtcact tcagcgaccc catgtacaag tgcgagtgcc 2281 agacaggcta cgcgggcgac gggctcatct gcggggagga ctcggacctg gacggctggc 2341 ccaacctcaa tctggtctgc gccaccaacg ccacctacca ctgcatcaag gataactgcc 2401 cccatctgcc aaattctggg caggaagact ttgacaagga cgggattggc gatgcctgtg 2461 atgatgacga tgacaatgac ggtgtgaccg atgagaagga caactgccag ctcctcttca 2521 atccccgcca ggctgactat gacaaggatg aggttgggga ccgctgtgac aactgccctt 2581 acgtgcacaa ccctgcccag atcgacacag acaacaatgg agagggtgac gcctgctccg 2641 tggacattga tggggacgat gtcttcaatg aacgagacaa ttgtccctac gtctacaaca 2701 ctgaccagag ggacacggat ggtgacggtg tgggggatca ctgtgacaac tgccccctgg 2761 tgcacaaccc tgaccagacc gacgtggaca atgaccttgt tggggaccag tgtgacaaca 2821 acgaggacat agatgacgac ggccaccaga acaaccagga caactgcccc tacatctcca 2881 acgccaacca ggctgaccat gacagagacg gccagggcga cgcctgtgac cctgatgatg 2941 acaacgatgg cgtccccgat gacagggaca actgccggct tgtgttcaac ccagaccagg 3001 aggacttgga cggtgatgga cggggtgata tttgtaaaga tgattttgac aatgacaaca 3061 tcccagatat tgatgatgtg tgtcctgaaa acaatgccat cagtgagaca gacttcagga 3121 acttccagat ggtccccttg gatcccaaag ggaccaccca aattgatccc aactgggtca 3181 ttcgccatca aggcaaggag ctggttcaga cagccaactc ggaccccggc atcgctgtag 3241 gttttgacga gtttgggtct gtggacttca gtggcacatt ctacgtaaac actgaccggg 3301 acgacgacta tgctggcttc gtctttggtt accagtcaag cagccgcttc tatgtggtga 3361 tgtggaagca ggtgacgcag acctactggg aggaccagcc cacgcgggcc tatggctact 3421 ccggcgtgtc cctcaaggtg gtgaactcca ccacggggac gggcgagcac ctgaggaacg 3481 cgctgtggca cacggggaac acgccggggc aggtgcgaac cttatggcac gaccccagga 3541 acattggctg gaaggactac acggcctata ggtggcacct gactcacagg cccaagaccg 3601 gctacatcag agtcttagtg catgaaggaa aacaggtcat ggcagactca ggacctatct 3661 atgaccaaac ctacgctggc gggcggctgg gtctatttgt cttctctcaa gaaatggtct 3721 atttctcaga cctcaagtac gaatgcagag atatttaaac aagatttgct gcatttccgg 3781 caatgccctg tgcatgccat ggtccctaga cacctcagtt cattgtggtc cttgcggctt 3841 ctctctctag cagcacctcc tgtcccttga ccttaactct gatggttctt cacctcctgc 3901 cagcaacccc aaacccaagt gccttcagag gataaatatc aatggaactc agagatgaac 3961 atctaaccca ctagaggaaa ccagtttggt gatatatgag actttatgtg gagtgaaaat 4021 tgggcatgcc attacattgc tttttcttgt ttgtttaaaa agaatgacgt ttacatataa 4081 aatgtaatta cttattgtat ttatgtgtat atggagttga agggaatact gtgcataagc 4141 cattatgata aattaagcat gaaaaatatt gctgaactac ttttggtgct taaagttgtc 4201 actattcttg aattagagtt gctctacaat gacacacaaa tcccgctaaa taaattataa 4261 acaagggtca attcaaattt gaagtaatgt tttagtaagg agagattaga agacaacagg 4321 catagcaaat gacataagct accgattaac taatcggaac atgtaaaaca gttacaaaaa 4381 taaacgaact ctcctcttgt cctacaatga aagccctcat gtgcagtaga gatgcagttt 4441 catcaaagaa caaacatcct tgcaaatggg tgtgacgcgg ttccagatgt ggatttggca 4501 aaacctcatt taagtaaaag gttagcagag caaagtgcgg tgctttagct gctgcttgtg 4561 ccgttgtggc gtcggggagg ctcctgcctg agcttccttc cccagctttg ctgcctgaga 4621 ggaaccagag cagacgcaca ggccggaaaa ggcgcatcta acgcgtatct aggctttggt 4681 aactgcggac aagttgcttt tacctgattt gatgatacat ttcattaagg ttccagttat 4741 aaatattttg ttaatattta ttaagtgact atagaatgca actccattta ccagtaactt 4801 attttaaata tgcctagtaa cacatatgta gtataatttc tagaaacaaa catctaataa 4861 gtatataatc ctgtgaaaat atgaggcttg ataatattag gttgtcacga tgaagcatgc 4921 tagaagctgt aacagaatac atagagaata atgaggagtt tatgatggaa ccttaatata 4981 taatgttgcc agcgatttta gttcaatatt tgttactgtt atctatctgc tgtatatgga 5041 attcttttaa ttcaaacgct gaaaacgaat cagcatttag tcttgccagg cacacccaat 5101 aatcagtcat gtgtaatatg cacaagtttg tttttgtttt tgtttttttt gttggttggt 5161 ttttttgctt taagttgcat gatctttctg caggaaatag tcactcatcc cactccacat 5221 aaggggttta gtaagagaag tctgtctgtc tgatgatgga tagggggcaa atctttttcc 5281 cctttctgtt aatagtcatc acatttctat gccaaacagg aacgatccat aactttagtc 5341 ttaatgtaca cattgcattt tgataaaatt aattttgttg tttcctttga ggttgatcgt 5401 tgtgttgttt tgctgcactt tttacttttt tgcgtgtgga gctgtattcc cgagacaacg 5461 aagcgttggg atacttcatt aaatgtagcg actgtcaaca gcgtgcaggt tttctgtttc 5521 tgtgttgtgg ggtcaaccgt acaatggtgt gggaatgacg atgatgtgaa tatttagaat 5581 gtaccatatt ttttgtaaat tatttatgtt tttctaaaca aatttatcgt ataggttgat 5641 gaaacgtcat gtgttttgcc aaagactgta aatatttatt tatgtgttca catggtcaaa 5701 atttcaccac tgaaaccctg cacttagcta gaacctcatt tttaaagatt aacaacagga 5761 aataaattgt aaaaaaggtt ttct // LOCUS HUMTHRSYNT 2644 bp mRNA PRI 26-JUL-1996 DEFINITION Human threonyl-tRNA synthetase mRNA, complete cds. ACCESSION M63180 NID g339679 KEYWORDS transfer RNA-Thr synthetase. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2644) AUTHORS Cruzen,M.E. and Arfin,S.M. TITLE Nucleotide and deduced amino acid sequence of human threonyl-tRNA synthetase reveals extensive homology to the Escherichia coli and yeast enzymes JOURNAL J. Biol. Chem. 266 (15), 9919-9923 (1991) MEDLINE 91236775 FEATURES Location/Qualifiers source 1..2644 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5p13-cen" CDS 139..2277 /codon_start=1 /product="threonyl-tRNA synthetase" /db_xref="PID:g1464742" /translation="MGGEEKPIGAGEEKQKEGGKKKNKEGSGDGGRAELNPWPEYIYT RLEMYNILKAEHDSILAEKAEKDSKPIKVTLPDGKQVDAESWKTTPYQIACGISQGLA DNTVIAKVNNVVWDLDRPLEEDCTLELLKFEDEEAQAVYWHSSAHIMGEGMERVYGGC LCYGPPIENGFYYDMYLEEGGVSSNDFSSLEALCKKIIKEKQAFERLEVKKETLLAMF KYNKFKCRILNEKVNTPTTTVYRCGPLIDLCRGPHVRHTGKIKALKIHKNSSTYWEGK ADMETLQRIYGISFPDPKMLKEWEKFQEEAKNRDHRKIGRDQELYFFHELSPGSCFFL PKGVYIYNALIEFIRSEYRKRGFQEVVTPNIFNSRLWMTSGHWQHYSENMFSFEVEKE LFALKPMNCPGHSLMFDHRPRSWRELPLRLADFGGLHRNELSGALTGLTRVRRFQQDD AHIFCAMEQIEDEIKGCLDFLRTVYSVFGFSFKLNLSTRPEKFLGDIEVWDQAEKQLE NSLNEFGEKWELNSGDGAFYGPKIDIQIKDAIGRYHQCATIQLDFQLPIRFNLTYVSH DGEDKKRPVIVHRAILGSVERMIAILTENYGGKLAPFWLSPRQVMVVPVGPTCDEYAQ NVRQQFHDAKFMADIDLDPGCTLNKKIRNAQLAQYNFILVVGEKEKITGTVNIRTRDN KVHGERTISETIERLQQLKEFRSKQAEEEF" BASE COUNT 826 a 506 c 635 g 677 t ORIGIN 1 ccgaggccaa gtcccgggcg ctagcccacc tcccacccgc ctcttggctc ctctcctcta 61 ggccgtcgct ttcgggttct ctcatcgctt cgtcgttcgc caatgtttga ggagaaggcc 121 agcagtcctt cagggaagat gggaggcgag gagaagccga ttggtgctgg tgaagagaag 181 caaaaggaag gaggcaaaaa gaagaacaaa gaaggatctg gagatggagg tcgagctgag 241 ttgaatcctt ggcctgaata tatttacaca cgtcttgaga tgtataatat actaaaagca 301 gaacatgatt ccattctggc agaaaaggca gaaaaagata gcaagccaat taaagtcact 361 ttgcctgatg gtaaacaggt tgatgcggaa tcttggaaaa ctacaccata tcaaattgcc 421 tgtggaatta gtcaaggcct ggccgacaac accgttattg ctaaagtaaa taatgttgtg 481 tgggacctgg accgccctct ggaagaagat tgtaccttgg agcttctcaa gtttgaggat 541 gaggaagctc aggcagtgta ttggcactct agtgctcaca taatgggtga aggcatggaa 601 agagtctatg gtggatgttt atgctacggt ccgccaatag aaaatggatt ctattatgac 661 atgtacctcg aagaaggggg tgtgtctagc aatgatttct cttctctgga ggctttgtgt 721 aagaaaatca ttaaagaaaa acaagctttt gaaagactgg aagttaagaa agaaacttta 781 ctggcaatgt ttaagtacaa caagttcaaa tgccggatat tgaatgaaaa ggtgaatact 841 ccaactacca cagtctatag atgtggccct ttgatagatc tctgccgggg tcctcatgtt 901 agacacacgg gcaaaattaa ggctttaaaa atacacaaaa attcctccac gtactgggaa 961 ggcaaagcag atatggagac tctccagaga atttatggca tttcattccc agatcctaaa 1021 atgttgaaag agtgggagaa gttccaagag gaagctaaaa accgagatca taggaaaatt 1081 ggcagggacc aagaactata tttctttcat gaactcagcc ctggaagttg cttttttctg 1141 ccaaaaggag tctatattta taatgcactt attgaattca ttaggagcga atataggaaa 1201 agaggattcc aggaggtagt caccccaaac atcttcaaca gccgactctg gatgacctcg 1261 ggccactggc agcactacag cgagaacatg ttctcctttg aggtggagaa ggagctgttt 1321 gccctgaaac ccatgaactg cccaggacac tcccttatgt ttgatcatcg gccaaggtcc 1381 tggcgagaac tgcctctgcg gctagctgat tttgggggtc ttcataggaa cgagctgtct 1441 ggagcactca caggactcac ccgggtacga agattccaac aggatgatgc tcacatattc 1501 tgtgccatgg agcagattga agatgaaata aaaggttgtt tggattttct acgtacggta 1561 tatagcgtat ttggattttc ttttaaacta aacctttcta ctcgcccgga aaaattcctt 1621 ggagatatcg aagtatggga tcaagctgag aaacaacttg aaaacagtct gaatgaattt 1681 ggtgaaaagt gggagttaaa ctctggagat ggagctttct atggcccaaa gattgacata 1741 cagattaaag atgcgattgg gcggtaccac cagtgtgcaa ccatccagct ggatttccag 1801 ttgcccatca gatttaatct tacttatgta agccatgatg gtgaggataa gaaaaggcca 1861 gtgattgttc atcgagccat cttgggatca gtggaaagaa tgattgctat cctcacagaa 1921 aactatgggg gcaaattggc ccccttttgg ctgtcccctc gccaggtaat ggtagttcca 1981 gtgggaccaa cctgtgatga atatgcccaa aacgtacgac aacaattcca cgatgccaaa 2041 ttcatggcag acattgatct ggatccaggc tgtacattga ataaaaagat tcgaaatgca 2101 cagttagcac agtataactt cattttagtt gttggtgaaa aagagaaaat cactggcact 2161 gttaatatcc gcacaagaga caataaggtc cacggggaac gcaccatttc tgaaactatc 2221 gagcggctac agcagctcaa agagttccgc agcaaacagg cagaagaaga attttaatga 2281 aaaaattacc cagattggct ccatggaaaa ggaggaacag cgtttccgta aaattgactt 2341 tgtactcgaa aacgtcaatt tatattgaac ttggaggagg agtttggcaa agtctgaaat 2401 aggtcaacct gcaggcgtaa ctatttttga cctagtcagt ttttaaacaa tgtgcatttg 2461 aaggagttaa ttaaaagaga gccaataaaa tgattttact cattcagtat ctgagtactg 2521 gaagtgaaac atgaggaatg ctttagtgta atgtgggaga acttttttgt aaatttaatg 2581 caattgaaaa agttttcaaa ttcaattaag ataactagaa ttggttatgg tgtaacccga 2641 attc // LOCUS HUMTHYB4 556 bp mRNA PRI 30-SEP-1988 DEFINITION Human thymosin beta-4 mRNA, complete cds. ACCESSION M17733 NID g339688 KEYWORDS thymosin; thymosin beta-4. SOURCE Human (patient with acute lymphocytic leukemia) lymphocyte, cDNA to mRNA, clone pGKS1331. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 556) AUTHORS Gondo,H., Kudo,J., White,J.W., Barr,C.L., Selvanayagam,P. and Saunders,G.F. TITLE Differential expression of the human thymosin beta-4 gene in lymphocytes, macrophages, and granulocytes JOURNAL J. Immunol. 139, 3840-3848 (1987) MEDLINE 88060494 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by G.F.Saunders, 23-NOV-1987. FEATURES Location/Qualifiers source 1..556 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..556 /note="thymosin beta-4 mRNA" CDS 78..212 /note="thymosin beta-4" /codon_start=1 /db_xref="PID:g339689" /translation="MSDKPDMAEIEKFDKSKLKKTETQEKNPLPSKETIEQEKQAGES " BASE COUNT 157 a 125 c 139 g 135 t ORIGIN 15 bp upstream of HaeIII site. 1 acaactcggt ggtggccact gcgcagacca gacttcgctc gtactcgtgc gcctcgcttc 61 gcttttcctc cgcaaccatg tctgacaaac ccgatatggc tgagatcgag aaattcgata 121 agtcgaaact gaagaagaca gagacgcaag agaaaaatcc actgccttcc aaagaaacga 181 ttgaacagga gaagcaagca ggcgaatcgt aatgaggcgt gcgccgccaa tatgcactgt 241 acattccaca agcattgcct tcttatttta cttcttttag ctgtttaact ttgtaagatg 301 caaagaggtt ggatcaagtt taaatgactg tgctgcccct ttcacatcaa agaactactg 361 acaacgaagg ccgcgctgcc tttcccatct gtctatctat ctggctggca gggaaggaaa 421 gaacttgcat gttggtgaag gaagaagtgg ggtggaagaa gtggggtggg acgacagtga 481 aatctagagt aaaaccaagc tggcccaagt gtcctgcagg ctgtaatgca gtttaatcag 541 agtgccattt tttttt // LOCUS HUMTHYP 1109 bp mRNA PRI 14-JAN-1995 DEFINITION Human parathymosin mRNA, complete cds. ACCESSION M24398 NID g339698 KEYWORDS parathymosin. SOURCE Human kidney, cDNA to mRNA, clone lambda-HKpara41188. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1109) AUTHORS Clinton,M., Frangou-Lazaridis,M., Panneerselvam,C. and Horecker,B.L. TITLE The sequence of human parathymosin deduced from a cloned human kidney cDNA JOURNAL Biochem. Biophys. Res. Commun. 158 (3), 855-862 (1989) MEDLINE 89149806 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by M.Clinton, 03-JUN-1989. FEATURES Location/Qualifiers source 1..1109 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q12-q22" mRNA <1..1109 /note="THYP" gene 301..609 /gene="PTMS" CDS 301..609 /gene="PTMS" /note="parathymosin" /codon_start=1 /db_xref="GDB:G00-125-555" /db_xref="PID:g339699" /translation="MSEKSVEAAAELSAKDLKEKKEKVEEKASRKERKKEVVEEEENG AEEEEEETAEDGEEEDEGEEEDEEEEEEDDEGPALKRAAEEEDEADPKRQKTENGASA " BASE COUNT 204 a 403 c 321 g 181 t ORIGIN 19 bp upstream of SacI site. 1 ggcggcgacg gatcgagctc accgcgccga gcgcgccggc accgcctgca ccgcccttcc 61 gcccgccctc cggacggccg cagcctgcgg tctccgtcca gacccacccc cgccccaccc 121 cgcgcgcctc tgccgcctct tccagagacc cagcttgccg agcggccgcc gctgccgtgt 181 cgccgccgcc gccgccaccg cgccaggttc cggccgcggc caccctccgc cgtccagggc 241 ctctccgtct cggccccggg accccgcctc cccgccagcc ccggccccgg ccccggcacc 301 atgtcggaga aaagcgtgga ggcagcggcc gagttgagcg ccaaggacct gaaggagaag 361 aaggagaagg tggaggagaa ggcaagccgg aaagagcgaa agaaagaagt ggtggaggag 421 gaggagaacg gggctgagga ggaagaagaa gaaactgccg aggatggaga ggaggaagat 481 gaaggggaag aagaagatga ggaagaagaa gaagaggatg atgaagggcc cgcgctgaag 541 agagctgccg aagaggagga tgaagcggat cccaaacggc agaagacaga aaatggggca 601 tcggcgtgac gcctgccaac aggctgggtt gggaggcctc tctgggctgg aggtgggggt 661 gggggcagcc aagtccagcc actcttcacc tggctccctg ctctgggccc tgcaccgaga 721 gctgccaccc tcttctttct ccccagcctt ctcatttccg cctctccaga cactgcgccc 781 tccaccctca ctctgccatt gttccacctc ctgacctgct ccatctgagc tctccagctg 841 gcccccaatt gctcctctct ctctttgctc tctttctccc tcccctacca gcctcattct 901 tctccggtag cctctcccac ctaacctctg catcccccag cgtcatgtcc tgccccatcc 961 ctatcctgcc tgatccctgg atctccctca gatcccctct tctcagacag cgccaggccg 1021 gggtggggcc ggggttgccg agccccacag ctgcccccct cccctccctt tttgtataat 1081 ttaataaaga aatggtcgcg cttctgttt // LOCUS HUMTKFER 2950 bp mRNA PRI 14-JAN-1995 DEFINITION Human tyrosine kinase (FER) mRNA, complete cds. ACCESSION J03358 NID g339714 KEYWORDS c-myc proto-oncogene; tyrosine kinase. SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2950) AUTHORS Hao,Q.-L. and Groffen,J. TITLE Isolation and sequence analysis of a novel human tyrosine kinase gene JOURNAL Mol. Cell. Biol. 4, 1587-1593 (1989) COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Groffen, 31-MAR-1989. The FER gene contains regions homologous to the highly conserved tyrosine protein kinase domain of the other oncogenes and growth factor receptors, but lacks a clear transmembrane region, indicating that it encodes a tyrosine kinase of the nonreceptor type. FEATURES Location/Qualifiers source 1..2950 /organism="Homo sapiens" /db_xref="taxon:9606" /map="5q21" mRNA <1..2950 /note="FER mRNA" gene 385..2853 /gene="FER" CDS 385..2853 /gene="FER" /note="tyrosine kinase (FER)" /codon_start=1 /db_xref="GDB:G00-125-243" /db_xref="PID:g339715" /translation="MGFGSDLKNSHEAVLKLQDWELRLLETVKKFMALRIKSDKEYAS TLQNLCNQVDKESTVQMNYVSNVSKSWLLMIQQTEQLSRIMKTHAEDLNSGPLHRLTM MIKDKQQVKKSYIGVHQQIEAEMIKVTKTELEKLKCSYRQLIKEMNSAKEKYKEALAK GKETEKAKERYDKATMKLHMLHNQYVLALKGAQLHQNQYYDITLPLLLDSLQKMQEEM IKALKGIFDEYSQITSLVTEEIVNVHKEIQMSVEQIDPSTEYNNFIDVHRTTAAKEQE IEFDTSLLEENENLQANEIMWNNLTAESLQVMLKTLAEELMQTQQMLLNKEEAVLELE KRIEESSETCEKKSDIVLLLSQKQALEELKQSVQQLRCTEAKFSAQKELLEQKVQEND GKEPPPVVNYEEDARSVTSMERKERLSKFESIRHSIAGIIRSPKSAVGSSALSDMISI SEKPLAEQDWYHGAIPRIEAQELLKKQGDFLVRESHGKPGEYVLSVYSDGQRRHFIIQ YVDNMYRFEGTGFSNIPQLIDHHYTTKQVITKKSGVVLLNPIPKDKKWILSHEDVILG ELLGKGNFGEVYKGTLKDKTSVAVKTCKEDLPQELKIKFLQEAKILKQYDHPNIVKLI GVCTQRQPVYIIMELVSGGDFLTFLRRKKDELKLKQLVKFSLDAAAGMLYLESKNCIH RDLAARNCLVGENNVLKISDFGMSRQEDGGVYSSSGLKQIPIKWTAPEALNYGRYSSE SDVWSFGILLWETFSLGVCPYPGMTNQQAREQVERGYRMSAPQHCPEDISKIMMKCWD YKPENRPKFSELQKELTIIKRKLT" BASE COUNT 1017 a 535 c 636 g 762 t ORIGIN 1 gtcacaccct cgaataatga cgcataccta tcctactgtt actgatcacc tagtaataat 61 cttgtagttc acattactca tttttcacca aattcttttg gtgaaggacg cttcagaaac 121 ggccatcact gaagagcaga cccgtttggg ttctccacgc attctagact cccgaagagc 181 tcatgttttt tggctagacc tatgaccatt ttcgctagac ttcactgcac gttttctcaa 241 gtatcttctt tgtccctaat gtgtgacacc tcatcatgga cacgctactt tagctaaggc 301 atgaccagca atgaacagta gtaagatatg tgctgattag aaggctcact tgtgcagtgt 361 ggaggataac cagtgcctta caaaatgggg tttgggagtg acctgaagaa ttcacatgaa 421 gcagtgttaa aattgcaaga ctgggaatta cggttactgg aaacagtaaa gaaatttatg 481 gccctgagaa taaaaagtga taaagaatat gcatctactt tacagaacct ttgtaatcaa 541 gttgataagg aaagtactgt ccaaatgaat tatgtcagca acgtatccaa gtcttggcta 601 cttatgattc agcagacaga acaacttagt aggataatga agacacatgc agaggacttg 661 aactctggac ctttacacag gctcaccatg atgattaagg acaagcagca ggtgaagaaa 721 agttacatag gtgttcatca gcagatagag gcagagatga tcaaggttac caaaacagaa 781 ttggagaagt taaaatgcag ctatagacaa ttaataaaag aaatgaattc tgccaaagag 841 aaatataaag aagctttagc taaagggaag gaaactgaaa aggccaagga acgatacgac 901 aaagccacaa tgaaacttca tatgttgcac aatcagtatg tattggcgtt gaaaggggca 961 cagctccatc agaatcagta ttatgatatc acacttcccc tgcttctgga ctccttacaa 1021 aagatgcaag aagaaatgat aaaagcactc aaaggtatat ttgatgaata cagccagata 1081 accagtcttg tcacagagga aatagtgaat gtccataaag agattcaaat gtcggttgaa 1141 cagatagatc ctagtacaga atacaataat ttcatagatg ttcacagaac aacggctgct 1201 aaagaacaag aaatagagtt tgatacttcc ttactggaag aaaatgaaaa tcttcaggca 1261 aatgagatca tgtggaataa cttaacagca gaaagtttgc aagtaatgtt gaaaacgtta 1321 gcggaagaac ttatgcaaac acagcagatg cttttaaaca aggaggaggc tgttttggag 1381 ttagagaaga gaattgaaga atcttctgaa acttgtgaga agaagtctga tattgtgctt 1441 ctgctaagcc aaaaacaggc acttgaagaa ctgaaacagt cagtccagca gctgagatgc 1501 actgaagcaa agttttcagc acagaaggaa ttactagagc aaaaagtgca agaaaatgat 1561 gggaaagagc cacctccagt agtaaattat gaagaagatg cacgatcagt tacatctatg 1621 gaaagaaagg agaggctatc caaatttgaa tctattcgtc attcaattgc tggaataatt 1681 aggtctccaa aatctgcagt gggctcttca gcactttctg atatgatctc catcagtgag 1741 aagcctttgg cagaacagga ctggtaccat ggtgcaattc ccagaataga agctcaagaa 1801 ctgttaaaaa aacaaggaga ctttttggtg cgagagagtc atgggaaacc tggtgaatat 1861 gtcctttctg tatattctga tggacagagg agacatttta tcatacaata tgttgataac 1921 atgtatcgat tcgagggcac tgggttttca aacattcctc aacttataga tcatcactat 1981 acaacaaaac aggtcatcac taagaaatca ggtgtagttc tgctgaatcc tattcctaag 2041 gacaagaaat ggattctcag tcatgaagat gtcatattgg gagaattact gggcaaggga 2101 aattttggtg aagtatataa gggcacatta aaggataaaa cttctgttgc tgttaaaaca 2161 tgtaaagaag atcttcctca ggaattgaaa ataaaatttt tacaagaagc caaaattctc 2221 aagcaatatg atcatcccaa tattgtcaaa cttataggag tttgcacaca aagacagcct 2281 gtctacatca ttatggaact ggtttcagga ggtgatttcc tcacctttct gagaaggaag 2341 aaggatgaac taaaactcaa acagttagtg aaattttcat tagacgctgc tgctggtatg 2401 ttgtatctcg agagtaaaaa ctgtatacac agggaccttg ctgcaagaaa ctgcctggta 2461 ggtgaaaata atgttctgaa aatcagtgac tttggaatgt ctcgtcaaga ggatggtgga 2521 gtgtattcat cttctggctt aaagcagatt cccattaaat ggacagcacc ggaagctctt 2581 aattatggga gatacagttc agagagtgac gtgtggagct ttggcatcct tctctgggag 2641 accttcagct taggggtttg tccgtaccct ggaatgacaa atcagcaagc aagagagcaa 2701 gtagaaagag gataccggat gtcagctccc cagcactgtc cagaggatat ttccaaaatc 2761 atgatgaagt gttgggatta taaacctgaa aatcgcccta agttcagtga acttcagaaa 2821 gagctcacta tcatcaagag aaaactcaca tagtgacagg atggcgccaa actcagcctt 2881 caggactctg tcctccagca gagtaacatt attgttctca ttaacaatga atttatacca 2941 cattaccttc // LOCUS HUMTKTCS 6383 bp mRNA PRI 07-JUN-1993 DEFINITION Homo sapiens T cell-specific tyrosine kinase mRNA, complete cds. ACCESSION L10717 NID g307507 KEYWORDS tyrosine kinase. SOURCE Homo sapiens thymus cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6383) AUTHORS Gibson,S., Leung,B., Squire,J.A., Hill,M., Arima,N., Goss,P., Hogg,D. and Mills,G.B. TITLE Identification, cloning, and characterization of a novel human T cell specific tyrosine kinase located at the hematopoietin complex on chromosome 5q JOURNAL Blood (1993) In press FEATURES Location/Qualifiers source 1..6383 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="thymus" CDS 2024..3886 /note="2024-2555 unique domain; 2556-2708 SH3 domain; 2750-3044 Sh2 domain (binds phosphotyrosine-containing proteins); 3095-3884 kiase domain (phophorylation of tyrosine residues).; putative" /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g307508" /translation="MNNFILLEEQLIKKSQQKRRTSPSNFKVRFFVLTKASLAYFEDR HGKKRTLKGSIELSRIKCVEIVKSDISIPCHYKYPFQVVHDNYLLYVFAPDRESRQRW VLALKEETRNNNSLVPKYHPNFWMDGKWRCCSQLEKLATGCAQYDPTKNASKKPLPPT PEDNRRPLWEPEETVVIALYDYQTNDPQELALRRNEEYCLLDSSEIHWWRVQDRNGHE GYVPSSYLVEKSPNNLETYEWYNKSISRDKAEKLLLDTGKEGAFMVRDSRTAGTYTVS VFTKAVVSENNPCIKHYHIKETNDNPKRYYVAEKYVFDSIPLLINYHQHNGGGLVTRL RYPVCFGRQKAPVTAGLRYGKWVIDPSELTFVQEIGSGQFGLVHLGYWLNKDKVAIKT IREGAMSEEDFIEEAEVMMKLSHPKLVQLYGVCLEQAPICLVFEFMEHGCLSDYLRTQ RGLFAAETLLGMCLDVCEGMAYLEEACVIHRDLAARNCLVGENQVIKVSDFGMTRFVL DDQYTSSTGTKFPVKWASPEVFSFSRYSSKSDVWSFGVLMWEVFSEGKIPYENRSNSE VVEDISTGFRLYKPRLASTHVYQIMNHCWKERPEDRPAFSRLLRQLAEIAESGL" BASE COUNT 1945 a 1317 c 1364 g 1757 t ORIGIN 1 cgcggccgct atatataatg cagcatcaca ccatgtaggg catttactct tattttatac 61 attcagatat gtttgaaaca ttcttaaggc tacaaaacag aacatagaaa aataaacagg 121 aatatattca acacttacaa aaagtgatat gataaagaat ataaagtact agtttccttt 181 taacacttca aaagatatgt atatatactt ttttttacaa gtaacatcac aaatgctcac 241 atcttcacat gctcttaaag tattatttgt actcagtgta aggctattat cgtttttcat 301 acataaaatt ttctagctct gtaacacaat gcaattttta atccattcaa gtaagttcaa 361 ccccaaagtt gccgcttccc agcattaaga catgcaccca cccctcttct aagattttct 421 aaacttgtat ttcggggaga aagacctctt ttaaaaaata atccaattag tgggagagta 481 aatggctgac attagtagca aaaccttagt tatctgaaaa taacatattg gaaatgagac 541 attattagga ttttaaacaa acaatagcat ttagacataa agtaggaagc aaaatacagt 601 aaacagaaat agtgtagcca aatatcattc tcttcagcta ccttaagtaa aagacaaaac 661 atttacctca tctaaaaatg aaggtaaaac gaaagaggca aaaataaata ttgctagttt 721 ctaggatggc tgaatgtttt ctaaaccaga aatggttaga aaggaacttt attgcaccaa 781 gtcaatcata agcaagtttg cagttcacag gcattttaat tcaaccttga gtcacaaagg 841 agaacaacac gctgcgagaa tacagtctac agtctgcatt aaataagaat atatcagcat 901 tgtggtctgg gaaaacctat gcttgccagg acaaggcagg gtgctgagct taggtcatgc 961 catgaaaatg aatttgtggg ttatcagtaa acagtatgag gactacacag atgccagcat 1021 cctgctgcca aggagacatg gggcaagagt tgaagatttg agaggaaatg aagagacata 1081 cacaacacca aaggaaaagg gggctggaat caagttcagc caaagcacct aacacaaaaa 1141 acaggtgagc tttggtcagt ctgttcttca aaatatgtat gatcatatgg taatgaagtt 1201 tcataatttc caactcaaaa atacaaatga tcctcagttc tatacttttg cctctattct 1261 cttataaaga aatatgtcaa cataacagta tgacataaca gttaaaataa ggacaaaagc 1321 ttgcttatct tagtttgacc tcagcataag gcaaaatccc ctggagaata catttaaaaa 1381 caaacttaaa aggaaaaaaa gcgaaaccaa cttcatgcaa agattccttt taaaactatc 1441 aaaagtcagt tcttttattc cagaggtcac tgagaaaagt accatctgct aaaattctct 1501 ttcaagcact tcttccatca tatcctagag gtgagatatg ggaaacagaa agcaaatcag 1561 tgttcctcag gagctatatc tgttactcaa ttgagggtaa gacaaagtga caatgaagat 1621 atgagtagta tttccttcca atttttaaag attttcagaa gctgagatca aaccccactc 1681 aataaaatgc aggagactag aagcaacaac ttattttgga ctcctgagat caaacacatt 1741 gaactttcaa atctgggtgt ttctatcaaa atgtgatttt cattaaaatc agtaagctag 1801 tcctacataa aaaagcatga gctgaaagtg gaggaccctc tatcttctca ttccttaact 1861 gagccaccga tgttaagaaa aaaatggctt aagcggtacc ttcaacaact attctagtta 1921 agaaggtgac aacaaattga ggccgcgaat tcggcgaaaa ctctttcctt tggttgtgct 1981 aagaggtgat gcccaaggtg caccaccttt caagaactgg atcatgaaca actttatcct 2041 cctggaagaa cagctcatca agaaatccca acaaaagaga agaacttctc cctcgaactt 2101 taaagtccgc ttctttgtgt taaccaaagc cagcctggca tactttgaag atcgtcatgg 2161 gaagaagcgc acgctgaagg ggtccattga gctctcccga atcaaatgtg ttgagattgt 2221 gaaaagtgac atcagcatcc catgccacta taaatacccg tttcaggtgg tgcatgacaa 2281 ctacctccta tatgtgtttg ctccagatcg tgagagccgg cagcgctggg tgctggccct 2341 taaagaagaa acgaggaata ataacagttt ggtgcctaaa tatcatccta atttctggat 2401 ggatgggaag tggaggtgct gttctcagct ggagaagctt gcaacaggct gtgcccaata 2461 tgatccaacc aagaatgctt caaagaagcc tcttcctcct actcctgaag acaacaggcg 2521 accactttgg gaacctgaag aaactgtggt cattgcctta tatgactacc aaaccaatga 2581 tcctcaggaa ctcgcactgc ggcgcaacga agagtactgc ctgctggaca gttctgagat 2641 tcactggtgg agagtccagg acaggaatgg gcatgaagga tatgtaccaa gcagttatct 2701 ggtggaaaaa tctccaaata atctggaaac ctatgagtgg tacaataaga gtatcagccg 2761 agacaaagct gaaaaacttc ttttggacac aggcaaagaa ggagccttca tggtaaggga 2821 ttccaggact gcaggaacat acaccgtgtc tgttttcacc aaggctgttg taagtgagaa 2881 caatccctgt ataaagcatt atcacatcaa ggaaacaaat gacaatccta agcgatacta 2941 tgtggctgaa aagtatgtgt tcgattccat ccctcttctc atcaactatc accaacataa 3001 tggaggaggc ctggtgactc gactccggta tccagtttgt tttgggaggc agaaagcccc 3061 agttacagca gggctgagat acgggaaatg ggtgatcgac ccctcagagc tcacttttgt 3121 gcaagagatt ggcagtgggc aatttgggtt ggtgcatctg ggctactggc tcaacaagga 3181 caaggtggct atcaaaacca ttcgggaagg ggctatgtca gaagaggact tcatagagga 3241 ggctgaagta atgatgaaac tctctcatcc caaactggtg cagctgtatg gggtgtgcct 3301 ggagcaggcc cccatctgcc tggtgtttga gttcatggag cacggctgcc tgtcagatta 3361 tctacgcacc cagcggggac tttttgctgc agagaccctg ctgggcatgt gtctggatgt 3421 gtgtgagggc atggcctacc tggaagaggc atgtgtcatc cacagagact tggctgccag 3481 aaattgtttg gtgggagaaa accaagtcat caaggtgtct gactttggga tgacaaggtt 3541 cgttctggat gatcagtaca ccagttccac aggcaccaaa ttcccggtga agtgggcatc 3601 cccagaggtt ttctctttca gtcgctatag cagcaagtcc gatgtgtggt catttggtgt 3661 gctgatgtgg gaagttttca gtgaaggcaa aatcccgtat gaaaaccgaa gcaactcaga 3721 ggtggtggaa gacatcagta ccggatttcg gttgtacaag ccccggctgg cctccacaca 3781 cgtctaccag attatgaatc actgctggaa agagagacca gaagatcggc cagccttctc 3841 cagactgctg cgtcaactgg ctgaaattgc agaatcagga ctttagtaga gactgagtac 3901 caggccacgg gctcagatcc tgaatggagg aaggatatgt cctcattcca tagagcatta 3961 gaagctgcca ccagcccagg accctccaga ggcagcctgg cctgtactca gtccctgagt 4021 caccatggaa gcagcatcct gaccacagct ggcagtcaag ccacagctgg agggtcagcc 4081 accaagctgg gagctgagcc agaacaggag tgatgtctct gcccttcctc tagcctcttg 4141 tcacatgtgg tgcacaaacc tcaacctgac agctttcaga cagcattctt gcacttctta 4201 gcaacagaga gagacatgac gtaagaccca gattgctatt tttattgtta tttttcaaca 4261 gtgaatctaa agtttatggt tccagggact ttttatttga cccaacaaca cagtatccca 4321 ggatatggag gcaaggggaa caagagcatg agtgtttttc caagaaactg gtgagttaag 4381 taagattaga gtgagtgtgc tctgttgctg tgatgctgtc agccacagct tcctgccgta 4441 gagaatgata gagcagctgc tcacacagga ggccggatat ctgataagca gctttatgag 4501 gttttacaga gtatgctgct acctctctcc ttgaagggag catggcagac ccattggatg 4561 gattggggtg aacagttcag gtcccatgct tggagcattg ggtatctgat gtctgcacca 4621 gaacaagaga acctctgacg gtggagaacc atgtggtgta agaagagatc ttaggtctct 4681 tctttatacc aagctcatgt tttataccaa gctcatcttt tataccaagc tgtgcaggtg 4741 actatgcctc ctcttctgca cagaatgctt ccaccagcat cctgagaaga aatgattact 4801 tctgtaaaac atcctttttt ccagcctctg ggaatcagcc cccccctctc tgcactatcc 4861 gatcctcatc aacagagggc agcattgtgt tggtcagtgt tcccttggcg agcaattgaa 4921 acttgtttag gccctagggt tgagcaattt taaggttgag actccaagtc tcctaaaatt 4981 ctaggagaga aataaagagt ctgtttttgc tcaaaccatc aggatggaaa cagtcaggca 5041 ctgactgggg tgcttccaag aggcatgaga gtgcctactc tggcttgagc acttctatat 5101 gcaaggtgaa tatgtactga gctaggagac ttccctgcaa aatctctgtt caccctgggt 5161 tcacatcccc atgaggtaat attattattc ccattttaca aataatgtaa ctgaggcttt 5221 aaaaagccaa gacatctgcc caaagtgatg gaactagaaa gtctagagct ggtattctag 5281 cccaaatctg tctgaccgca atacacagat tatttattcc tattagacac tggcttctac 5341 tgaaaatgaa acttattgca gagggaataa atacaaagat ggaaagccag taaagaagtc 5401 agtatagaac cactagcgat agtgttgctc tggcacagac cactgtggtt gatgcatggc 5461 cctccaactt ggaataggat tttccttttc ctattctgta tccttacctt ggtcatgtta 5521 atgactttgg agttattcag ttcctgaccc tttaattctc acaaccaacc agtcatgttg 5581 cttgaagcca ttatagacga gcttcaaagc aactttaaaa gattgttatg tagaagtatg 5641 agttcttcct ttaattatca ttccaacttt cagctgtagt cttcttgaac acttatgagg 5701 agggaggaca ttccctgata taagagagga tggtgttgca attggctctt tctaaatcat 5761 gtgacgtttt gactggcttg agattcagat gcataatttt taattattgt gaagtggaga 5821 gcctcaagat aaaactctgt cattacgaag atgattttac tcagcttatc caaaattatc 5881 tctgtttact ttttagaatt ttgtacatta tcttttggga tccttaatta gagatgattt 5941 ctggaacatt cagtctagaa agaaaacatt ggaattgact gatctctgtg gtttggttta 6001 gaaaattccc ctgtgcatgg tattaccttt ttcaagctca gattcatcta atcctcaact 6061 gtacatgtgt acattcttca cctcctggtg ccctatcccg caaaatgggc ttcctgcctg 6121 ggtttttctc ttctcacatt ttttaaatgg tcccctgtgt ttgtagagaa ctcccttata 6181 cagagttttg gttctagttt tatttcgtag attttgcatt ttgtaccttt tgagactatg 6241 tatttatatt tggatcagat gcatatttat taatgtacag tcactgctag tgttcaaaat 6301 aaaaatgtta caaatacctg ttatcctttg tagagcacac agagttaaaa gttgaatata 6361 gcaatattaa agctgcattt taa // LOCUS HUMTLEI 2352 bp mRNA PRI 14-JAN-1995 DEFINITION Human transducin-like enhancer protein (TLE1) mRNA, complete cds. ACCESSION M99435 NID g307509 KEYWORDS transducin-like enhancer of split protein. SOURCE Homo sapiens (tissue library: Stratagene) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2352) AUTHORS Stifani,S., Blaumueller,C.M., Redhead,N.J., Hill,R.E. and Artavanis-Tsakonas,S. TITLE Human homologs of a Drosophila Enhancer of split gene product define a novel family of nuclear proteins [published erratum appears in Nat Genet 1992 Dec;2(4):343] JOURNAL Nature Genet. 2 (2), 119-127 (1992) MEDLINE 93265135 FEATURES Location/Qualifiers source 1..2352 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="Stratagene" gene 26..2338 /gene="TLE1" CDS 26..2338 /gene="TLE1" /standard_name="TLE1 protein" /codon_start=1 /product="transducin-like enhancer protein" /db_xref="PID:g307510" /translation="MFPQSRHPTPHQAAGQPFKFTIPESLDRIKEEFQFLQAQYHSLK LECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLSQEHQQ QVAQAVERAKQVTMAELNAIIGQQQLQAQHLSHGHGPPVPLTPHPSGLQPPGIPPLGG SAGLLALSSALSGQSHLAIKDDKKHHDAEHHRDREPGTSNSLLVPDSLRGTDKRRNGP EFSNDIKKRKVDDKDSSHYDSDGDKSDDNLVVDVSNEDPSSPRASPAHSPRENGIDKN RLLKKDASSSPASTASSASSTSLKSKEMSLHEKASTPVLKSSTPTPRSDMPTPGTSAT PGLRPGLGKPPAIDPLVNQAAAGLRTPLAVPGPYPAPFGMVPHAGMNGELTSPGAAYA SLHNMSPQMSAAAARGRRGRYGRSPMVGFDPPPHMRVPTIPPNLAGIPGGKPAYSFHV TADGQMQPVPFPPTPLIGPGIPRHARQINTLNHGEVVCAVTISNPTRHVYTGGKGCVK VWDISHPGNKSPVSQLDCLNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAAPTPRI KAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDI SNDGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGYCPTGEWLAVGMESSNVE VLHVNKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSKESS SVLSCDISVDDKYIVTGSGDKKATVYEVIY" BASE COUNT 571 a 707 c 614 g 460 t ORIGIN 1 acagagcccc gccgccgcca gagcgatgtt cccgcagagc cggcacccga cgccgcacca 61 ggctgcaggc cagcccttca agttcactat cccggagtcc ctggaccgga ttaaagagga 121 attccagttc ctgcaggcgc agtatcacag ccttaaattg gaatgtgaga aactggcaag 181 tgaaaagaca gaaatgcaga ggcactatgt gatgtattat gaaatgtcat atggattaaa 241 cattgaaatg cacaaacaga ctgaaatcgc caagagattg aatacgattt gtgcacaagt 301 catcccattt ctgtctcagg aacatcaaca acaggtggcc caggctgttg aacgtgccaa 361 acaggtgacc atggcagagt tgaatgccat catcgggcag cagcagttgc aagctcagca 421 tctttctcat ggccacggac ccccagttcc ccttacgcct cacccttcgg gacttcagcc 481 tcctggaatc ccgcccctcg ggggcagtgc cggccttctt gcgctgtcta gtgctctgag 541 tgggcagtct cacttggcaa taaaagatga caagaagcac cacgatgcag agcaccacag 601 agacagagag ccgggcacaa gtaattccct cctggtccca gacagtctaa gaggcacaga 661 taaacgcaga aatggacctg aattttccaa tgacatcaag aaaaggaagg tggatgataa 721 ggactccagc cactatgaca gtgatggtga caaaagcgat gacaacttag ttgtggatgt 781 gtctaatgag gacccttctt ctccgcgagc aagccctgcc cactcgcccc gggaaaatgg 841 aatcgacaaa aatcgcctgc taaagaagga tgcttctagc agtccagctt ccacggcctc 901 ctcggcaagt tccacttctt tgaaatccaa agaaatgagc ttgcatgaaa aagccagcac 961 gcctgttctg aaatccagca caccaacgcc tcggagcgac atgccaacgc cgggcaccag 1021 cgccactcca ggcctccgtc caggtctcgg caagcctcca gccatagacc ccctcgttaa 1081 ccaagcggca gctggcttga ggacacccct ggcagtgccc ggcccatatc ctgctccttt 1141 tgggatggtc ccccacgctg gcatgaacgg cgagctgacc agcccaggcg ctgcctacgc 1201 cagtttacac aacatgtcgc cccagatgag cgccgcagcc gcccgcggcc gccgtggtcg 1261 gtacgggcgc tcccccatgg tggggtttga tcctccccct cacatgagag tacctaccat 1321 tcctccaaac ctggcaggaa tccctggggg gaaacctgca tactccttcc acgttactgc 1381 agacggtcag atgcagcctg tcccttttcc cccgacgccc ctcatcggac ccggaatccc 1441 ccggcatgct cgccagatca acaccctcaa ccacggggag gtggtgtgcg ctgtgaccat 1501 cagcaacccc acgagacacg tgtacacagg cgggaagggc tgcgtcaagg tctgggacat 1561 cagccaccct ggcaataaga gccctgtctc ccagctcgac tgtctgaaca gagacaatta 1621 tatccgttcc tgtaaattgc tacccgatgg ctgcactctc atagtgggag gggaagccag 1681 tactttgtcc atttgggacc tggcggctcc aaccccgcgc atcaaggcgg agctgacgtc 1741 ctcggccccc gcctgctacg ccctggccat cagccccgat tccaaggtct gcttctcatg 1801 ctgcagcgac ggcaacatcg ctgtgtggga tctgcacaac cagacactag tgaggcaatt 1861 ccagggccac acagacggag ccagctgtat tgacatttct aatgatggca ccaagctctg 1921 gacgggtggt ttggacaaca cagtcaggtc ctgggacctg cgcgaggggc ggcagctgca 1981 gcagcacgac ttcacctccc agatcttctc cctggggtac tgccccaccg gggagtggct 2041 ggcagtgggc atggagagca gcaatgtgga ggtgctgcac gtgaacaagc ctgacaagta 2101 ccagctgcac ctgcatgaga gctgcgtgct gtccctgaaa tttgcttact gtggtaaatg 2161 gtttgtgagt actggaaaag ataacctcct caatgcttgg cggaccccct atggagccag 2221 catattccag tccaaagagt cctcgtcagt gcttagctgt gacatctctg tggatgataa 2281 gtacatagtc actggctcgg gggacaagaa ggctacagtc tatgaagtca tctactgaaa 2341 acattatgtg gt // LOCUS HUMTLEII 2271 bp mRNA PRI 14-JAN-1995 DEFINITION Human transducin-like enhancer protein (TLE2) mRNA, complete cds. ACCESSION M99436 NID g307511 KEYWORDS transducin-like enhancer of split protein. SOURCE Homo sapiens (tissue library: Stratagene) fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2271) AUTHORS Stifani,S., Blaumueller,C.M., Redhead,N.J., Hill,R.E. and Artavanis-Tsakonas,S. TITLE Human homologs of a Drosophila Enhancer of split gene product define a novel family of nuclear proteins [published erratum appears in Nat Genet 1992 Dec;2(4):343] JOURNAL Nature Genet. 2 (2), 119-127 (1992) MEDLINE 93265135 FEATURES Location/Qualifiers source 1..2271 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /tissue_lib="Stratagene" gene 26..2257 /gene="TLE2" CDS 26..2257 /gene="TLE2" /standard_name="TLE 2 protein" /codon_start=1 /product="transducin-like enhancer protein" /db_xref="PID:g307512" /translation="MYPQGRHPTPLQSGQPFKFSILEICDRIKEEFQFLQAQYHSLKL ECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSGICAQIIPFLTQEHQQQ VLQAVERAKQVTVGELNSLIGQQLQPLSHHAPPVPLTPRPAGLVGGSATGLLALSGAL AAQAQLAAAVKEDRAGVEAEGSRVERAPSRSASPSPPESLVEEERPSGPGGGGKQRAD EKEPSGPYESDEDKSDYNLVVDEDQPSEPPSPATTPCGKVPICIPARRDLVDSPASLA SSLRSPLPRAKELILNDLPASTPASKSCDSSPPQDASTPGPSSASHLCQLALKPAPST DSVALRSPLTLSSPFTTSFSLGSHSTLNGDLSVPSSYVSLHLSPQVSSSVVYGRSPVM AFESHPHLRGSSVSSSLPSIPGGKPAYSFHVSADGQMQPVPFPSDALVDAGIPRHARQ LHTLAHGEVVCAVTISGSTQHVYTGGKGCVKVWDVGQPGAKTPVRQLDCLNRDNYIRS CKLLPDGRSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPACYALAVSPDAKVCFSCC SDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQL QQHDFSSQIFSPCHCPNQDWLAVGMESSNVEILHVGKPEKYQLHLHESCVLSLKFAPC GRWFVSTGKDNLLNAWRTPYGASIFQSKESSSVLSCDISRNNKYIVTGSGDKKATVYE VVY" BASE COUNT 437 a 753 c 670 g 411 t ORIGIN 1 ctggggggct tttcgaatcg gcaggatgta cccccaggga aggcacccga ccccgctcca 61 gtccggccag cccttcaagt tctcgatctt ggagatctgc gaccgcatca aagaagaatt 121 ccagtttctt caggctcaat accacagcct caagctagaa tgtgagaagc tggccagcga 181 gaagacggaa atgcagcgac attatgtcat gtattatgag atgtcgtacg ggctcaacat 241 tgaaatgcat aagcaggcgg agattgtgaa gcgtctgagc ggtatctgcg ctcagattat 301 ccccttcctg acccaggagc atcagcagca ggtgctccag gccgtagaac gcgccaagca 361 ggtcaccgtg ggggagctga acagcctcat cgggcagcag ctccagccgc tgtcccacca 421 cgcaccccct gtgcccctca ccccccgccc agccgggctg gtgggcggca gtgctacggg 481 gctgcttgct ctgtctggag ccctggctgc ccaggctcag ctggcggcgg ctgtcaagga 541 ggaccgtgcg ggcgtggagg ccgaggggtc cagagtggag agagccccga gcaggagtgc 601 atctccctcg ccccctgaga gtctcgtgga ggaggagcga ccgagtggcc ctggtggtgg 661 cgggaagcag agagcagatg agaaggagcc atcaggacct tatgaaagcg acgaagacaa 721 gagtgattac aatctggtgg tggacgagga ccaaccctca gagcccccca gcccggctac 781 caccccctgc ggaaaggtac ccatctgcat tcctgcccgt cgggacctgg tggacagtcc 841 agcctccttg gcctctagct tgcggtcacc gctgcctaga gccaaggagc tcatcctgaa 901 tgaccttccc gccagcactc ctgcctccaa atcctgtgac tcctccccgc cccaggacgc 961 ttccaccccc gggcccagct cggccagtca cctctgccag cttgcgctca agccagcacc 1021 ttccacggac agcgtcgccc tgaggagccc cctgactctg tccagtccct tcaccacgtc 1081 cttcagcctg ggctcccaca gcactctcaa cggagacctc tccgtgccca gctcctacgt 1141 cagcctccac ctgtcccccc aggtcagcag ctctgtggtg tacggacgct cccccgtgat 1201 ggcatttgag tctcatcccc atctccgagg gtcatccgtc tcttcctccc tacccagcat 1261 ccctggggga aagccggcct actccttcca cgtgtctgcg gacgggcaga tgcagccggt 1321 tcccttcccc tcggatgcac tggtagacgc gggcatcccg cggcacgccc ggcagctgca 1381 cacgctggcc catggcgagg tggtctgcgc ggtcaccatc agcggctcca cacagcatgt 1441 gtacacgggc ggcaagggct gtgtgaaggt gtgggacgtg ggccagcctg gggccaagac 1501 gcccgtgcgc cagctcgact gcctgaaccg agacaactac attcgttcct gcaagttgct 1561 gccggatggc cggagtctga tcgtgggcgg tgaggccagc accttgtcca tttgggacct 1621 ggcggcgccc accccccgta tcaaggccga gctgacttcc tcagccccag cctgctacgc 1681 cctggccgtc agccccgacg ccaaggtttg cttctcctgc tgcagcgatg gcaacattgt 1741 ggtctgggac ctgcagaatc agactatggt caggcagttc cagggccaca cggacggcgc 1801 cagctgcatt gatatttccg attacggcac tcggctctgg acagggggcc tggacaacac 1861 ggtgcgctgc tgggacctgc gggagggccg ccagctgcag cagcatgact tcagctccca 1921 gattttctcc ccctgccact gccctaacca ggactggctg gcggtcggaa tggagagtag 1981 caacgtggag atcctgcacg tcggcaagcc ggagaaatac cagctgcacc tccacgagag 2041 ctgcgtgctg tccctgaagt ttgccccttg cggacggtgg tttgtgagca ccgggaagga 2101 caacctgctc aacgcctgga ggacgccgta cggggccagc attttccagt ccaaggagtc 2161 gtcctcagtc ctgagttgtg acatctccag aaataacaaa tacattgtga caggctcggg 2221 ggacaagaag gccaccgtgt atgaggtggt ctactgaaga catgaccccc c // LOCUS HUMTNFRRP 2136 bp mRNA PRI 22-JUL-1993 DEFINITION Homo sapiens (clone CD18) tumor necrosis factor receptor 2 related protein mRNA, complete cds. ACCESSION L04270 NID g339761 KEYWORDS tumor necrosis factor receptor 2 related protein. SOURCE Homo sapiens (library: liver cDNA of P.M.) liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2136) AUTHORS Baens,M., Chaffanet,M., Cassiman,J.J., Van den Berghe,H. and Marynen,P. TITLE Construction and evaluation of a hncDNA library of human 12p transcribed sequences derived from a somatic cell hybrid JOURNAL Genomics 16, 214-218 (1993) MEDLINE 93252381 FEATURES Location/Qualifiers source 1..2136 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" /tissue_lib="liver cDNA of P.M." CDS 169..1476 /note="putative" /codon_start=1 /product="tumor necrosis factor receptor 2 related protein" /db_xref="PID:g339762" /translation="MLLPWATSAPGLAWGPLVLGLFGLLAASQPQAVPPYASENQTCR DQEKEYYEPQHRICCSRCPPGTYVSAKCSRIRDTVCATCAENSYNEHWNYLTICQLCR PCDPVMGLEEIAPCTSKRKTQCRCQPGMFCAAWALECTHCELLSDCPPGTEAELKDEV GKGNNHCVPCKAGHFQNTSSPSARCQPHTRCENQGLVEAAPGTAQSDTTCKNPLEPLP PEMSGTMLMLAVLLPLAFFLLLATVFSCIWKSHPSLCRKLGSLLKRRPQGEGPNPVAG SWEPPKAHPYFPDLVQPLLPISGDVSPVSTGLPAAPVLEAGVPQQQSPLDLTREPQLE PGEQSQVAHGTNGIHVTGGSMTITGNIYIYNGPVLGGPPGPGDLPATPEPPYPIPEEG DPGPPGLSTPHQEDGKAWHLAETEHCGATPSNRGPRNQFITHD" BASE COUNT 446 a 706 c 608 g 376 t ORIGIN 1 gccctggagg cccggcctgg ccgctcccgg ccctggggtg cacatcggcc ctgagtcccg 61 tcccaggctc tgggctcggg cagccgccgc caccgctgcc caggacgtcg ggcctcctgc 121 cttcctccca ggcccccacg ttgctggccg cctggccgag tggccgccat gctcctgcct 181 tgggccacct ctgcccccgg cctggcctgg gggcctctgg tgctgggcct cttcgggctc 241 ctggcagcat cgcagcccca ggcggtgcct ccatatgcgt cggagaacca gacctgcagg 301 gaccaggaaa aggaatacta tgagccccag caccgcatct gctgctcccg ctgcccgcca 361 ggcacctatg tctcagctaa atgtagccgc atccgggaca cagtttgtgc cacatgtgcc 421 gagaattcct acaacgagca ctggaactac ctgaccatct gccagctgtg ccgcccctgt 481 gacccagtga tgggcctcga ggagattgcc ccctgcacaa gcaaacggaa gacccagtgc 541 cgctgccagc cgggaatgtt ctgtgctgcc tgggccctcg agtgtacaca ctgcgagcta 601 ctttctgact gcccgcctgg cactgaagcc gagctcaaag atgaagttgg gaagggtaac 661 aaccactgcg tcccctgcaa ggcagggcac ttccagaata cctcctcccc cagcgcccgc 721 tgccagcccc acaccaggtg tgagaaccaa ggtctggtgg aggcagctcc aggcactgcc 781 cagtccgaca caacctgcaa aaatccatta gagccactgc ccccagagat gtcaggaacc 841 atgctgatgc tggccgttct gctgccactg gccttctttc tgctccttgc caccgtcttc 901 tcctgcatct ggaagagcca cccttctctc tgcaggaaac tgggatcgct gctcaagagg 961 cgtccgcagg gagagggacc caatcctgta gctggaagct gggagcctcc gaaggcccat 1021 ccatacttcc ctgacttggt acagccactg ctacccattt ctggagatgt ttccccagta 1081 tccactgggc tccccgcagc cccagttttg gaggcagggg tgccgcaaca gcagagtcct 1141 ctggacctga ccagggagcc gcagttggaa cccggggagc agagccaggt ggcccacggt 1201 accaatggca ttcatgtcac cggcgggtct atgactatca ctggcaacat ctacatctac 1261 aatggaccag tactgggggg accaccgggt cctggagacc tcccagctac ccccgaacct 1321 ccatacccca ttcccgaaga gggggaccct ggccctcccg ggctctctac accccaccag 1381 gaagatggca aggcttggca cctagcggag acagagcact gtggtgccac accctctaac 1441 aggggcccaa ggaaccaatt tatcacccat gactgacgga gtctgagaaa aggcagaaga 1501 aggggggcac aagggcactt tctcccttga ggctgccctg cccacgtggg attcacaggg 1561 gcctgagtag ggcccgggga agcagagccc taagggatta aggctcagac acctctgaga 1621 gcaggtgggc actggctggg tacggtgccc tccacaggac tctccctact gcctgagcaa 1681 acctgaggcc tcccggcaga cccacccacc ccctggggct gctcagcctc aggcacggac 1741 agggcacatg ataccaactg ctgcccacta cggcacgccg caccggagca cggcaccgag 1801 ggagccgcca cacggtcacc tgcaaggacg tcacgggccc ctctaaagga ttcgtggtgc 1861 tcatccccaa gcttcagaga ccctttgggg ttccacactt cacgtggact gaggtagacc 1921 ctgcatgaag atgaaattat agggaggacg ctccttccct cccctcctag aggagaggaa 1981 agggagtcat taacaactag ggggttgggt aggattccta ggtatgggga agagttttgg 2041 aaggggagga aaatggcaag tgtatttata ttgtaaccac atgcaaataa aaagaatggg 2101 acctaaactc gtgccgctcg tgccgaattc ctgcag // LOCUS HUMTNTS 980 bp mRNA PRI 14-JAN-1995 DEFINITION Human slow skeletal muscle troponin T mRNA, clone H22h. ACCESSION M19309 J03476 NID g339780 KEYWORDS alternative splicing; sarcomeric protein; troponin. SOURCE Human skeletal muscle, cDNA to mRNA, clone H22h. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 980) AUTHORS Gahlmann,R., Troutt,A.B., Wade,R.P., Gunning,P. and Kedes,L. TITLE Alternative splicing generates variants in important functional domains of human slow skeletal troponin T JOURNAL J. Biol. Chem. 262 (33), 16122-16126 (1987) MEDLINE 88058976 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Gahlmann, 23-OCT-1987. FEATURES Location/Qualifiers source 1..980 /organism="Homo sapiens" /db_xref="taxon:9606" /map="19q13.4" mRNA <1..980 /note="TNS mRNA" gene 58..894 /gene="TNNT1" CDS 58..894 /gene="TNNT1" /note="slow skeletal muscle troponin T" /codon_start=1 /db_xref="GDB:G00-125-310" /db_xref="PID:g339781" /translation="MSDTEEQEYEEEQPEEEAADEEEEAPEEPEPVAEPEEERPKPSR PVVPPLIPPKIPEGERVDFDDIHRKRMEKDLLELQTLIDVHFEQRKKEEEELVALKER IERRRSERAEQQRFRTEKERERQAKLAEEKMRKEEEEAKKRAEDDAKKKKVLSNMGAH FGGYLVKAEQKRGKRQTGREMKVRILSERKKPLDIDYMGEEQLRARSAWLPPSQPSCP AREKAQELSDWIHQLESEKFDLMAKLKQQKYEINVLYNRISHAQKFRKGAGKGRVGGR WK" BASE COUNT 261 a 253 c 335 g 131 t ORIGIN 42 bp upstream of StyI site. 1 agcaaggctc agcctcaaga ttcacagcat ctcagacgca gcctaggccg caccaggatg 61 tcggacaccg aggagcagga atatgaggag gagcagccgg aagaggaggc tgcggacgag 121 gaggaggaag cccccgaaga gccggagccg gtggcagagc cagaagagga acgccccaaa 181 ccaagccgcc ccgtggtgcc tcctttgatc ccgccaaaga tcccagaagg ggagcgcgtt 241 gacttcgatg acatccaccg caagcgcatg gagaaagacc tgctggagct gcagacactc 301 atcgatgtac atttcgagca gcggaagaag gaggaagagg agctggttgc cttgaaggag 361 cgcattgagc ggcgccggtc agagagagcc gagcaacagc gcttcagaac tgagaaggaa 421 cgcgaacgtc aggctaagct ggcggaggag aagatgagga aggaagagga agaggccaag 481 aagcgggcag aggatgatgc caagaaaaag aaggtgctgt ccaacatggg ggcccatttt 541 ggcggctacc tggtcaaggc agaacagaag cgtggtaagc ggcagacggg gcgggagatg 601 aaggtgcgca tcctctccga gcgtaagaag cctctggaca ttgactacat gggggaggaa 661 cagctccggg cccggtctgc ctggctgcct ccatcacagc cctcctgccc tgccagggag 721 aaagcccagg agctgtcgga ctggatccac cagctggagt ctgagaagtt cgacctgatg 781 gcgaagctga aacagcagaa atatgagatc aacgtgctgt acaaccgcat cagccacgcc 841 cagaagttcc ggaagggggc agggaagggc cgcgttggag gccgctggaa gtgaggatgc 901 cgccccggac agtggcacct gggaagcctg ggagtgtttg tcccatcggt agcttgaaat 961 aaacgctccc ctcagacacc // LOCUS HUMTOFA 1210 bp mRNA PRI 29-JUL-1996 DEFINITION Human mRNA for tob family, complete cds. ACCESSION D64110 NID g1469889 KEYWORDS tob family. SOURCE Homo sapiens cDNA to mRNA, clone_lib:Dauji clone:tob5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1210) AUTHORS Yoshida,Y., Matuda,S. and Yamamoto,T. TITLE Cloning of the human tob5 JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 1210) AUTHORS Yoshida,Y. TITLE Direct Submission JOURNAL Submitted (09-SEP-1995) to the DDBJ/EMBL/GenBank databases. Yutaka Yoshida, The University of Tokyo, Department of Oncology; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108, Japan (Tel:03-5449-5303, Fax:03-5449-5413) COMMENT Submitted (9-Sep-1995) to DDBJ by: Yutaka Yoshida Dept. of Oncology The University of Tokyo 4-6-1 Shirokanedai Minato-ku, Tokyo 108 Japan Phone: 03-5449-5303 Email: michino@hgcdb.ims.u-tokyo.ac.jp Fax: 03-5449-5413. FEATURES Location/Qualifiers source 1..1210 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="tob5" /clone_lib="Dauji" CDS 95..853 /codon_start=1 /product="tob family" /db_xref="PID:d1011628" /db_xref="PID:g1469890" /translation="MKNEIAAVVFFFTRLVRKHDKLKKRQVERFAEKLTLILQGKYKN PWYPEKPSKGQAYRCIRVNKFQRVDPDVLKACENSCILYSDLGLPKELTLWVDPCEVC CRYGEKNNAFIVASFENKDENKDEISRKVTRALDKVTSDYHSGSSSSDEETSKEMEVK PSSVTAAASPVYQISELIFPPLPMWHPLPRKKPGMYRGNGHQNHYPPPVPFGYPNQGR KNKPYRPIPVTWVPPPGMHCDRNHWINPHMLAPH" BASE COUNT 375 a 226 c 271 g 338 t ORIGIN 1 gggcccgttg aggcggagct cagttcccgg ccaggacacg gtctgggccg ccgaatctcc 61 ggccgaagag cggcggcggc agggcgggaa aaaaatgaag aatgaaattg ctgccgttgt 121 cttctttttc acaaggctag ttcgaaaaca tgataagttg aaaaagaggc aagttgagag 181 gtttgctgag aaattgaccc taatacttca aggaaaatat aaaaaccctt ggtatccaga 241 aaaaccatcg aaaggacagg cctacagatg tattcgtgtc aataaatttc agagagttga 301 tcctgatgtc ctgaaagcct gtgaaaacag ctgcatcttg tatagtgacc tgggcttgcc 361 aaaggagctc actctctggg tggacccatg tgaggtgtgc tgtcggtatg gagagaaaaa 421 caatgcattc attgttgcca gctttgaaaa taaagatgag aacaaggatg agatctccag 481 gaaagttacc agggcccttg ataaggttac ctctgattat cattcaggat cctcttcttc 541 agatgaagaa acaagtaagg aaatggaagt gaaacccagt tcggtgactg cagccgcaag 601 tcctgtgtac cagatttcag aacttatatt tccacctctt ccaatgtggc accctttgcc 661 cagaaaaaag ccaggaatgt atcgagggaa tggccatcag aatcactatc ctcctcctgt 721 tccatttggt tatccaaatc agggaagaaa aaataaacca tatcgcccaa ttccagtgac 781 atgggtacct cctcctggaa tgcattgtga ccggaatcac tggattaatc ctcacatgtt 841 agcacctcac taacttcgtt tttgattgtg ttggtgtcat gttgagaaaa aggtagaata 901 aaccttacta cacattaaaa gttaaaagtt cttactaata gtagtgaagt tagatgggcc 961 aaaccatcaa acttattttt atagaagtta ttgagaataa tctttcttaa aaaatatatg 1021 cactttagat attgatatag tttgagaaat tttattaaag ttagtcaagt gccgaagttt 1081 ttaatattgg acttgagtat ttatatattg tgcatcaact ctgttggata cgagaacact 1141 gtagaagtgg acgatttgtt ctagcacctt tgagaattta ctttatggag cgtatgtaag 1201 ttatttatat // LOCUS HUMTOPI 3645 bp mRNA PRI 14-JAN-1995 DEFINITION Human topoisomerase I mRNA, complete cds. ACCESSION J03250 NID g339805 KEYWORDS topoisomerase. SOURCE Human placenta, cDNA to mRNA, clones T1[A,B]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3645) AUTHORS D'Arpa,P., Machlin,P.S., Ratrie,H. III., Rothfield,N.F., Cleveland,D.W. and Earnshaw,W.C. TITLE cDNA cloning of human DNA topoisomerase I: catalytic activity of a 67.7-kDa carboxyl-terminal fragment JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (8), 2543-2547 (1988) MEDLINE 88190108 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Ratrie, 02-MAY-1988. FEATURES Location/Qualifiers source 1..3645 /organism="Homo sapiens" /db_xref="taxon:9606" /map="20q12-q13.1" gene 212..2509 /gene="TOP1" CDS 212..2509 /gene="TOP1" /note="topoisomerase I" /codon_start=1 /db_xref="GDB:G00-120-444" /db_xref="PID:g339806" /translation="MSGDHLHNDSQIEADFRLNDSHKHKDKHKDREHRHKEHKKEKDR EKSKHSNSEHKDSEKKHKEKEKTKHKDGSSEKHKDKHKDRDKEKRKEEKVRASGDAKI KKEKENGFSSPPQIKDEPEDDGYFVPPKEDIKPLKRPRDEDDVDYKPKKIKTEDTKKE KKRKLEEEEDGKLKKPKNKDKDKKVPEPDNKKKKPKKEEEQKWKWWEEERYPEGIKWK FLEHKGPVFAPPYEPLPENVKFYYDGKVMKLSPKAEEVATFFAKMLDHEYTTKEIFRK NFFKDWRKEMTNEEKNIITNLSKCDFTQMSQYFKAQTEARKQMSKEEKLKIKEENEKL LKEYGFCIMDNHKERIANFKIEPPGLFRGRGNHPKMGMLKRRIMPEDIIINCSKDAKV PSPPPGHKWKEVRHDNKVTWLVSWTENIQGSIKYIMLNPSSRIKGEKDWQKYETARRL KKCVDKIRNQYREDWKSKEMKVRQRAVALYFIDKLALRAGNEKEEGETADTVGCCSLR VEHINLHPELDGQEYVVEFDFLGKDSIRYYNKVPVEKRVFKNLQLFMENKQPEDDLFD RLNTGILNKHLQDLMEGLTAKVFRTYNASITLQQQLKELTAPDENIPAKILSYNRANR AVAILCNHQRAPPKTFEKSMMNLQTKIDAKKEQLADARRDLKSAKADAKVMKDAKTKK VVESKKKAVQRLEEQLMKLEVQATDREENKQIALGTSKLNYLDPRITVAWCKKWGVPI EKIYNKTQREKFAWAIDMADEDYEF" BASE COUNT 1221 a 706 c 837 g 881 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgggc gccgcccgcc cggcagtcag gcagcgtcgc cgccgtggta gcagcctcag 61 ccgtttctgg agtctcgggc ccacagtcac cgccgcttac ctgcgcctcc tcgagcctcc 121 ggagtccccg tccgcccgca caggccggtt cgccgtctgc gtctccccca cgccgcctcg 181 cctgccgccg cgctcgtccc tccgggccga catgagtggg gaccacctcc acaacgattc 241 ccagatcgaa gcggatttcc gattgaatga ttctcataaa cacaaagata aacacaaaga 301 tcgagaacac cggcacaaag aacacaagaa ggagaaggac cgggaaaagt ccaagcatag 361 caacagtgaa cataaagatt ctgaaaagaa acacaaagag aaggagaaga ccaaacacaa 421 agatggaagc tcagaaaagc ataaagacaa acataaagac agagacaagg aaaaacgaaa 481 agaggaaaag gttcgagcct ctggggatgc aaaaataaag aaggagaagg aaaatggctt 541 ctctagtcca ccacaaatta aagatgaacc tgaagatgat ggctattttg ttcctcctaa 601 agaggatata aagccattaa agagacctcg agatgaggat gatgttgatt ataaacctaa 661 gaaaattaaa acagaagata ccaagaagga gaagaaaaga aaactagaag aagaagagga 721 tggtaaattg aaaaaaccca agaataaaga taaagataaa aaagttcctg agccagataa 781 caagaaaaag aagccgaaga aagaagagga acagaagtgg aaatggtggg aagaagagcg 841 ctatcctgaa ggcatcaagt ggaaattcct agaacataaa ggtccagtat ttgccccacc 901 atatgagcct cttccagaga atgtcaagtt ttattatgat ggtaaagtca tgaagctgag 961 ccccaaagca gaggaagtag ctacgttctt tgcaaaaatg ctcgaccatg aatatactac 1021 caaggaaata tttaggaaaa atttctttaa agactggaga aaggaaatga ctaatgaaga 1081 gaagaatatt atcaccaacc taagcaaatg tgattttacc cagatgagcc agtatttcaa 1141 agcccagacg gaagctcgga aacagatgag caaggaagag aaactgaaaa tcaaagagga 1201 gaatgaaaaa ttactgaaag aatatggatt ctgtattatg gataaccaca aagagaggat 1261 tgctaacttc aagatagagc ctcctggact tttccgtggc cgcggcaacc accccaagat 1321 gggcatgctg aagagacgaa tcatgcccga ggatataatc atcaactgta gcaaagatgc 1381 caaggttcct tctcctcctc caggacataa gtggaaagaa gtccggcatg ataacaaggt 1441 tacttggctg gtttcctgga cagagaacat ccaaggttcc attaaataca tcatgcttaa 1501 ccctagttca cgaatcaagg gtgagaagga ctggcagaaa tacgagactg ctcggcggct 1561 gaaaaaatgt gtggacaaga tccggaacca gtatcgagaa gactggaagt ccaaagagat 1621 gaaagtccgg cagagagctg tagccctgta cttcatcgac aagcttgctc tgagagcagg 1681 caatgaaaag gaggaaggag aaacagcgga cactgtgggc tgctgctcac ttcgtgtgga 1741 gcacatcaat ctacacccag agttggatgg tcaggaatat gtggtagagt ttgacttcct 1801 cgggaaggac tccatcagat actataacaa ggtccctgtt gagaaacgag tttttaagaa 1861 cctacaacta tttatggaga acaagcagcc cgaggatgat ctttttgata gactcaatac 1921 tggtattctg aataagcatc ttcaggatct catggagggc ttgacagcca aggtattccg 1981 tacgtacaat gcctccatca cgctacagca gcagctaaaa gaactgacag ccccggatga 2041 gaacatccca gcgaagatcc tttcttataa ccgtgccaat cgagctgttg caattctttg 2101 taaccatcag agggcaccac caaaaacttt tgagaagtct atgatgaact tgcaaactaa 2161 gattgatgcc aagaaggaac agctagcaga tgcccggaga gacctgaaaa gtgctaaggc 2221 tgatgccaag gtcatgaagg atgcaaagac gaagaaggta gtagagtcaa agaagaaggc 2281 tgttcagaga ctggaggaac agttgatgaa gctggaagtt caagccacag accgagagga 2341 aaataaacag attgccctgg gaacctccaa actcaattat ctggacccta ggatcacagt 2401 ggcttggtgc aagaagtggg gtgtcccaat tgagaagatt tacaacaaaa cccagcggga 2461 gaagtttgcc tgggccattg acatggctga tgaagactat gagttttagc cagtctcaag 2521 aggcagagtt ctgtgaagag gaacagtgtg gtttgggaaa gatggataaa ctgagcctca 2581 cttgccctcg tgcctggggg agagaggcag caagtcttaa caaaccaaca tctttgcgaa 2641 aagataaacc tggagatatt ataagggaga gctgagccag ttgtcctatg gacaacttat 2701 ttaaaaatat ttcagatatc aaaattctag ctgtatgatt tgttttgaat tttgttttta 2761 ttttcaagag ggcaagtgga tgggaatttg tcagcgttct accaggcaaa ttcactgttt 2821 cactgaaatg tttggattct cttagctact gtatgcaaag tccgattata ttggtgcgtt 2881 tttacagtta gggttttgca ataacttcta tattttaata gaaataaatt cctaaactcc 2941 cttccctctc tcccatttca ggaatttaaa attaagtaga acaaaaaacc cagcgcacct 3001 gttagagtcg tcactctcta ttgtcatggg gatcaatttt cattaaactt gaagcagtcg 3061 tggctttggc agtgttttgg ttcagacacc tgttcacaga aaaagcatga tgggaaaata 3121 tttcctgact tgagtgttcc tttttaaatg tgaatttttt ttttttttaa ttattttaaa 3181 atatttaaac ctttttcttg atcttaaaga tcgtgtagat tggggttggg gagggatgaa 3241 gggcgagtga atctaaggat aatgaaataa tcagtgactg aaaccatttt cccatcatcc 3301 tttgttctga gcattcgctg taccctttaa gatatccatc tttttctttt taaccctaat 3361 ctttcacttg aaagatttta ttgtataaaa agtttcacag gtcaataaac ttagaggaaa 3421 atgagtattt ggtccaaaaa aaggaaaaat aatcaagatt ttagggcttt tattttttct 3481 tttgtaattg tgtaaaaaat ggaaaaaaac ataaaaagca gaattttaat gtgaagacat 3541 tttttgctat aatcattagt tttagaggca ttgttagttt agtgtgtgtg cagagtccat 3601 ttcccacatc tttcctcaag tatcttctat ttttatcatg aattc // LOCUS HUMTOPII 4792 bp mRNA PRI 14-JAN-1995 DEFINITION Human DNA topoisomerase II (top2) mRNA, complete cds. ACCESSION J04088 NID g292829 KEYWORDS DNA topoisomerase. SOURCE Human cell line HeLa, cDNA to mRNA, clone lambda-gt10. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4792) AUTHORS Tsai-Pflugfelder,M., Liu,L.F., Liu,A.A., Tewey,K.M., Whang-Peng,J., Knutsen,T., Huebner,K., Croce,C.M. and Wang,J.C. TITLE Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (19), 7177-7181 (1988) MEDLINE 89017161 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Tsai-Pflugfelder, 24-OCT-1988. FEATURES Location/Qualifiers source 1..4792 /organism="Homo sapiens" /db_xref="taxon:9606" /map="17q21-q22" gene 37..4632 /gene="TOP2A" CDS 37..4632 /gene="TOP2A" /note="DNA topoisomerase II (EC 5.99.1.3)" /codon_start=1 /db_xref="GDB:G00-118-884" /db_xref="PID:g292830" /translation="MEVSPLQPVNENMQVNKIKKNEDAKKRLSVERIYQKKTQLEHIL LRPDTYIGSVELVTQQMWVYDEDVGINYREVTFVPGLYKIFDEILVNAADNKQRDPKM SCIRVTIDPENNLISIWNNGKGIPVVEHKVEKMYVPALIFGQLLTSSNYDDDEKKVTG GRNGYGAKLCNIFSTKFTVETASREYKKMFKQTWMDNMGRAGEMELKPFNGEDYTCIT FQPDLSKFKMQSLDKDIVALMVRRAYDIAGSTKDVKVFLNGNKLPVKGFRSYVDMYLK DKLDETGNSLKVIHEQVNHRWEVCLTMSEKGFQQISFVNSIATSKGGRHVDYVADQIV TKLVDVVKKKNKGGVAVKAHQVKNHMWIFVNALIENPTFDSQTKENMTLQPKSFGSTC QLSEKFIKAAIGCGIVESILNWVKFKAQVQLNKKCSAVKHNRIKGIPKLDDANDAGGR NSTECTLILTEGDSAKTLAVSGLGVVGRDKYGVFPLRGKILNVREASHKQIMENAEIN NIIKIVGLQYKKNYEDEDSLKTLRYGKIMIMTDQDQDGSHIKGLLINFIHHNWPSLLR HRFLEEFITPIVKVSKNKQEMAFYSLPEFEEWKSSTPNHKKWKVKYYKGLGTSTSKEA KEYFADMKRHRIQFKYSGPEDDAAISLAFSKKQIDDRKEWLTNFMEDRRQRKLLGLPE DYLYGQTTTYLTYNDFINKELILFSNSDNERSIPSMVDGLKPGQRKVLFTCFKRNDKR EVKVAQLAGSVAEMSSYHHGEMSLMMTIINLAQNFVGSNNLNLLQPIGQFGTRLHGGK DSASPRYIFTMLSSLARLLFPPKDDHTLKFLYDDNQRVEPEWYIPIIPMVLINGAEGI GTGWSCKIPNFDVREIVNNIRRLMDGEEPLPMLPSYKNFKGTIEELAPNQYVISGEVA ILNSTTIEISELPVRTWTQTYKEQVLEPMLNGTEKTPPLITDYREYHTDTTVKFVVKM TEEKLAEAERVGLHKVFKLQTSLTCNSMVLFDHVGCLKKYDTVLDILRDFFELRLKYY GLRKEWLLGMLGAESAKLNNQARFILEKIDGKIIIENKPKKELIKVLIQRGYDSDPVK AWKEAQQKVPDEEENEESDNEKETEKSDSVTDSGPTFNYLLDMPLWYLTKEKKDELCR LRNEKEQELDTLKRKSPSDLWKEDLATFIEELEAVEAKEKQDEQVGLPGKGGKAKGKK TQMAEVLPSPRGQRVIPRITIEMKAEAEKKNKKKIKNENTEGSPQEDGVELEGLKQRL EKKQKREPGTKTKKQTTLAFKPIKKGKKRNPWPDSESDRSSDESNFDVPPRETEPRRA ATKTKFTMDLDSDEDFSDFDEKTDDEDFVPSDASPPKTKTSPKLSNKELKPQKSVVSD LEADDVKGSVPLSSSPPATHFPDETEITNPVPKKNVTVKKTAAKSQSSTSTTGAKKRA APKGTKRDPALNSGVSQKPDPAKTKNRRKRKPSTSDDSDSNFEKIVSKAVTSKKSKGE SDDFHMDFDSAVAPRAKSVRAKKPIKYLEESDEDDLF" BASE COUNT 1682 a 857 c 1038 g 1215 t ORIGIN 35 bp upstream of NcoI site; chromosome 17q21-22. 1 ggaccaccca gtaccgatcc cttcacgacc gtcaccatgg aagtgtcacc attgcagcct 61 gtaaatgaaa atatgcaagt caacaaaata aagaaaaatg aagatgctaa gaaaagactg 121 tctgttgaaa gaatctatca aaagaaaaca caattggaac atattttgct ccgcccagac 181 acctacattg gttctgtgga attagtgacc cagcaaatgt gggtttacga tgaagatgtt 241 ggcattaact atagggaagt cacttttgtt cctggtttgt acaaaatctt tgatgagatt 301 ctagttaatg ctgcggacaa caaacaaagg gacccaaaaa tgtcttgtat tagagtcaca 361 attgatccgg aaaacaattt aattagtata tggaataatg gaaaaggtat tcctgttgtt 421 gaacacaaag ttgaaaagat gtatgtccca gctctcatat ttggacagct cctaacttct 481 agtaactatg atgatgatga aaagaaagtg acaggtggtc gaaatggcta tggagccaaa 541 ttgtgtaaca tattcagtac caaatttact gtggaaacag ccagtagaga atacaagaaa 601 atgttcaaac agacatggat ggataatatg ggaagagctg gtgagatgga actcaagccc 661 ttcaatggag aagattatac atgtatcacc tttcagcctg atttgtctaa gtttaaaatg 721 caaagcctgg acaaagatat tgttgcacta atggtcagaa gagcatatga tattgctgga 781 tccaccaaag atgtcaaagt ctttcttaat ggaaataaac tgccagtaaa aggatttcgt 841 agttatgtgg acatgtattt gaaggacaag ttggatgaaa ctggtaactc cttgaaagta 901 atacatgaac aagtaaacca caggtgggaa gtgtgtttaa ctatgagtga aaaaggcttt 961 cagcaaatta gctttgtcaa cagcattgct acatccaagg gtggcagaca tgttgattat 1021 gtagctgatc agattgtgac taaacttgtt gatgttgtga agaagaagaa caagggtggt 1081 gttgcagtaa aagcacatca ggtgaaaaat cacatgtgga tttttgtaaa tgccttaatt 1141 gaaaacccaa cctttgactc tcagacaaaa gaaaacatga ctttacaacc caagagcttt 1201 ggatcaacat gccaattgag tgaaaaattt atcaaagctg ccattggctg tggtattgta 1261 gaaagcatac taaactgggt gaagtttaag gcccaagtcc agttaaacaa gaagtgttca 1321 gctgtaaaac ataatagaat caagggaatt cccaaactcg atgatgccaa tgatgcaggg 1381 ggccgaaact ccactgagtg tacgcttatc ctgactgagg gagattcagc caaaactttg 1441 gctgtttcag gccttggtgt ggttgggaga gacaaatatg gggttttccc tcttagagga 1501 aaaatactca atgttcgaga agcttctcat aagcagatca tggaaaatgc tgagattaac 1561 aatatcatca agattgtggg tcttcagtac aagaaaaact atgaagatga agattcattg 1621 aagacgcttc gttatgggaa gataatgatt atgacagatc aggaccaaga tggttcccac 1681 atcaaaggct tgctgattaa ttttatccat cacaactggc cctctcttct gcgacatcgt 1741 tttctggagg aatttatcac tcccattgta aaggtatcta aaaacaagca agaaatggca 1801 ttttacagcc ttcctgaatt tgaagagtgg aagagttcta ctccaaatca taaaaaatgg 1861 aaagtcaaat attacaaagg tttgggcacc agcacatcaa aggaagctaa agaatacttt 1921 gcagatatga aaagacatcg tatccagttc aaatattctg gtcctgaaga tgatgctgct 1981 atcagcctgg cctttagcaa aaaacagata gatgatcgaa aggaatggtt aactaatttc 2041 atggaggata gaagacaacg aaagttactt gggcttcctg aggattactt gtatggacaa 2101 actaccacat atctgacata taatgacttc atcaacaagg aacttatctt gttctcaaat 2161 tctgataacg agagatctat cccttctatg gtggatggtt tgaaaccagg tcagagaaag 2221 gttttgttta cttgcttcaa acggaatgac aagcgagaag taaaggttgc ccaattagct 2281 ggatcagtgg ctgaaatgtc ttcttatcat catggtgaga tgtcactaat gatgaccatt 2341 atcaatttgg ctcagaattt tgtgggtagc aataatctaa acctcttgca gcccattggt 2401 cagtttggta ccaggctaca tggtggcaag gattctgcta gtccacgata catctttaca 2461 atgctcagct ctttggctcg attgttattt ccaccaaaag atgatcacac gttgaagttt 2521 ttatatgatg acaaccagcg tgttgagcct gaatggtaca ttcctattat tcccatggtg 2581 ctgataaatg gtgctgaagg aatcggtact gggtggtcct gcaaaatccc caactttgat 2641 gtgcgtgaaa ttgtaaataa catcaggcgt ttgatggatg gagaagaacc tttgccaatg 2701 cttccaagtt acaagaactt caagggtact attgaagaac tggctccaaa tcaatatgtg 2761 attagtggtg aagtagctat tcttaattct acaaccattg aaatctcaga gcttcccgtc 2821 agaacatgga cccagacata caaagaacaa gttctagaac ccatgttgaa tggcaccgag 2881 aagacacctc ctctcataac agactatagg gaataccata cagataccac tgtgaaattt 2941 gttgtgaaga tgactgaaga aaaactggca gaggcagaga gagttggact acacaaagtc 3001 ttcaaactcc aaactagtct cacatgcaac tctatggtgc tttttgacca cgtaggctgt 3061 ttaaagaaat atgacacggt gttggatatt ctaagagact tttttgaact cagacttaaa 3121 tattatggat taagaaaaga atggctccta ggaatgcttg gtgctgaatc tgctaaactg 3181 aataatcagg ctcgctttat cttagagaaa atagatggca aaataatcat tgaaaataag 3241 cctaagaaag aattaattaa agttctgatt cagaggggat atgattcgga tcctgtgaag 3301 gcctggaaag aagcccagca aaaggttcca gatgaagaag aaaatgaaga gagtgacaac 3361 gaaaaggaaa ctgaaaagag tgactccgta acagattctg gaccaacctt caactatctt 3421 cttgatatgc ccctttggta tttaaccaag gaaaagaaag atgaactctg caggctaaga 3481 aatgaaaaag aacaagagct ggacacatta aaaagaaaga gtccatcaga tttgtggaaa 3541 gaagacttgg ctacatttat tgaagaattg gaggctgttg aagccaagga aaaacaagat 3601 gaacaagtcg gacttcctgg gaaagggggg aaggccaagg ggaaaaaaac acaaatggct 3661 gaagttttgc cttctccgcg tggtcaaaga gtcattccac gaataaccat agaaatgaaa 3721 gcagaggcag aaaagaaaaa taaaaagaaa attaagaatg aaaatactga aggaagccct 3781 caagaagatg gtgtggaact agaaggccta aaacaaagat tagaaaagaa acagaaaaga 3841 gaaccaggta caaagacaaa gaaacaaact acattggcat ttaagccaat caaaaaagga 3901 aagaagagaa atccctggcc tgattcagaa tcagatagga gcagtgacga aagtaatttt 3961 gatgtccctc cacgagaaac agagccacgg agagcagcaa caaaaacaaa attcacaatg 4021 gatttggatt cagatgaaga tttctcagat tttgatgaaa aaactgatga tgaagatttt 4081 gtcccatcag atgctagtcc acctaagacc aaaacttccc caaaacttag taacaaagaa 4141 ctgaaaccac agaaaagtgt cgtgtcagac cttgaagctg atgatgttaa gggcagtgta 4201 ccactgtctt caagccctcc tgctacacat ttcccagatg aaactgaaat tacaaaccca 4261 gttcctaaaa agaatgtgac agtgaagaag acagcagcaa aaagtcagtc ttccacctcc 4321 actaccggtg ccaaaaaaag ggctgcccca aaaggaacta aaagggatcc agctttgaat 4381 tctggtgtct ctcaaaagcc tgatcctgcc aaaaccaaga atcgccgcaa aaggaagcca 4441 tccacttctg atgattctga ctctaatttt gagaaaattg tttcgaaagc agtcacaagc 4501 aagaaatcca agggggagag tgatgacttc catatggact ttgactcagc tgtggctcct 4561 cgggcaaaat ctgtacgggc aaagaaacct ataaagtacc tggaagagtc agatgaagat 4621 gatctgtttt aaaatgtgag gcgattattt taagtaatta tcttaccaag cccaagactg 4681 gttttaaagt tacctgaagc tcttaacttc ctcccctctg aatttagttt ggggaaggtg 4741 tttttagtac aagacatcaa agtgaagtaa agcccaagtg ttctttagct tt // LOCUS HUMTPPIIA 4626 bp mRNA PRI 07-AUG-1992 DEFINITION Homo sapiens tripeptidyl peptidase II mRNA, complete cds. ACCESSION M73047 J05299 M55445 M72378 NID g339879 KEYWORDS tripeptidyl peptidase II. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4626; 889 to 4626) AUTHORS Tomkinson,B. and Jonsson,A.-K. TITLE Characterization of cDNA for human tripeptidyl peptidase II: The N-terminal part of the enzyme is similar to subtilisin JOURNAL Biochemistry 30, 168-174 (1991) MEDLINE 91105077 REFERENCE 2 (bases 1 to 4626; 1 to 888) AUTHORS Tomkinson,B. TITLE Nucleotide sequence of cDNA covering the N-terminus of human tripeptidyl peptidase II JOURNAL Biomed. Biochim. Acta 50, 727-729 (1991) MEDLINE 92198394 FEATURES Location/Qualifiers source 1..4626 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="SPL" /cell_type="B-lymphocyte" /tissue_lib="lambda gt10" CDS 24..3773 /codon_start=1 /product="tripeptidyl peptidase II" /db_xref="PID:g339880" /translation="MATAATEEPFPFHGLLPKKETGAASFLCRYPEYDGRGVLIAVLD TGVDPGAPGMQVTTDGKPKIVDIIDTTGSGDVNTATEVEPKDGEIVGLSGRVLKIPAS WTNPSGKYHIGIKNGYDFYPKALKERIQKERKEKIWDPVHRVALAEACRKQEEFDVAN NGSSQANKLIKEELQSQVELLNSFEKKYSDPGPVYDCLVWHDGEVWRACIDSNEDGDL SKSTVLRNYKEAQEYGSFGTAEMLNYSVNIYDDRNLLSIVTSGGAHGTHVASIAAGHF PEEPERNGVAPGAQILSIKIGDTRLSTMETGTGLIRAMIEVINHKCDLVNYSYGEATH WPNSGRICEVINEAVWKHNIIYVSSAGNNGPCLSTVGCPGGTTSSVIGVGAYVSPDMM VAEYSLREKLPANQYTWSSRGPSADGALGVSISAPGGAIASVPNWTLRGTQLMNGTSM SSPNACGGIALILSGLKANNIDYTVHSVRRALENTAVKADNIEVFAQGHGIIQVDKAY DYLVQNTSFANKLGFTVTVGNNRGIYLRDPVQVAAPSDHGVGIEPVFPENTENSEKIS LQLHLALTSNSSWVQCPSHLELMNQCRHINIRVDPRGLREGLHYTEVCGYDIASPNAG PLFRVPITAVIAAKVNESSHYDLAFTDVHFKPGQIRRHFIEVPEGATWAEVTVCSCSS EVSAKFVLHAVQLVKQRAYRSHEFYKFCSLPEKGTLTEAFPVLGGKAIEFCIARWWAS LSDVNIDYTISFHGIVCTAPQLNIHASEGINRFDVQSSLKYEDLAPCITLKNWVQTLR PVSAKTKPLGSRDVLPNNRQLYEMVLTYNFHQPKSGEVTPSCPLLCELLYESEFDSQL WIIFDQNKRQMGSGDAYPHQYSLKLEKGDYTIRLQIRHEQISDLERLKDLPFIVSHRL SNTLSLDIHENHSFALLGKKKSSNLTLPPKYNQPFFVTSLPDDKIPKGAGPGCYLAGS LTLSKTELGKKADVIPVHYYLIPPPTKTKNGSKDKEKDSEKEKDLKEEFTEALRDLKI QWMTKLDSSDIYNELKETYPNYLPLYVARLHQLDAEKERMKRLNEIVDAANAVISHID QTALAVYIAMKTDPRPDAATIKNDMDKQKSTLVDALCRKGCALADHLLHTQAQDGAIS TDAEGKEEEGESPLDSLAETFWETTKWTDLFDNKVLTFAYKHALVNKMYGRGLKFATK LVEEKPTKENWKNCIQLMKLLGWTHCASFTENWLPIMYPPDYCVF" BASE COUNT 1452 a 885 c 983 g 1306 t ORIGIN 1 gaattcccct ccatcctgcg tccatggcca ccgctgcgac tgaggagccc ttcccttttc 61 acggtctcct gccgaagaag gagaccggag ccgcctcctt cctctgccgc tacccggagt 121 atgatgggcg gggggtgctc atcgcagtcc tggacacggg ggtcgacccg ggggctccgg 181 gcatgcaggt tacaactgat ggaaaaccaa aaatcgttga tatcattgat acaacaggaa 241 gtggcgatgt gaatactgct acagaagtag agccaaagga tggtgagatt gttggccttt 301 caggaagagt gcttaagatt cctgcaagct ggacaaatcc ctcaggcaaa tatcatattg 361 gcataaaaaa tggctatgac ttctatccta aggcactcaa ggaaaggata cagaaagaac 421 ggaaggaaaa aatctgggac cctgttcaca gagtggccct tgcagaagcc tgtagaaaac 481 aggaagaatt tgatgttgcc aacaacggct cttctcaagc aaataaacta atcaaggagg 541 aacttcaaag tcaagtggaa ttgctaaatt cttttgagaa gaaatacagc gatcctggcc 601 ctgtatatga ctgcttggta tggcatgatg gcgaagtctg gagagcctgc attgattcta 661 atgaagatgg ggacttgagt aaatctaccg tgttgagaaa ctacaaagaa gcccaagaat 721 atggctcttt tggcacagct gagatgttga attactccgt taatatatac gatgatagaa 781 acctgctctc cattgtgacc agtggaggag ctcatgggac acatgtagct agtatagctg 841 ctggacactt tccagaagaa cctgaacgga atggggtagc tcctggtgct caaattcttt 901 ccatcaagat tggtgataca agactaagca caatggaaac aggcacaggc ctcataagag 961 ctatgataga agttataaat cataagtgtg atcttgtcaa ctacagttac ggagaagcaa 1021 ctcactggcc aaattctggg agaatttgtg aagtaattaa tgaagcagta tggaagcata 1081 atataattta tgtttcaagt gctggaaata atggtccatg cctgtctaca gttggttgtc 1141 caggtggaac tacatcaagt gtgataggtg ttggtgctta tgtttctcct gatatgatgg 1201 ttgctgagta ttcactgaga gagaaattac ctgcaaatca atatacttgg tcttctagag 1261 gacctagtgc tgacggggcc cttggtgtga gtatcagtgc gccaggagga gccattgctt 1321 ctgttcctaa ctggacactg agagggacgc agctgatgaa tggaacatct atgtcttccc 1381 ccaatgcatg tggaggcatt gccctgatcc tttcaggtct gaaagctaat aacattgact 1441 acacagttca ttcagtcaga agagctctag aaaacactgc agtgaaggct gacaatatag 1501 aagtatttgc tcaaggacat ggtattattc aggttgataa agcctatgac tacctcgttc 1561 agaatacatc atttgctaat aaattaggtt ttactgttac tgttggaaat aaccgtggca 1621 tctacctccg agatcctgtt caggtggctg caccttcaga tcatggcgtt ggcattgaac 1681 ctgtatttcc ggagaacaca gaaaactctg aaaaaatatc ccttcagctt catttagctc 1741 tgacttcaaa ttcatcttgg gttcagtgtc ccagccattt ggaactcatg aatcaatgta 1801 gacacataaa catacgtgtg gatcccaggg gcttaagaga aggattgcat tatacagagg 1861 tatgtggcta tgatatagca tcccctaacg caggtccgct cttcagagtt ccgatcactg 1921 cagttatagc agcaaaagta aatgaatcat cacattatga tctagccttt acagatgtac 1981 actttaaacc tggtcaaatt cgaaggcatt ttattgaggt tcctgagggt gcaacatggg 2041 ctgaagtgac agtgtgttcg tgttcttctg aggtgtcagc aaagtttgtt ctacatgcag 2101 tccagcttgt gaagcaaaga gcatatcgaa gccatgaatt ctataagttt tgttctcttc 2161 cagagaaagg aacactgact gaagcttttc ctgtcctagg tggaaaagca attgaatttt 2221 gcattgctcg ttggtgggca agtctcagtg atgtcaacat tgattatacc atttctttcc 2281 atgggatagt gtgtactgct cctcagttaa acattcatgc atcggaagga atcaaccgct 2341 ttgatgttca gtcctccttg aaatacgaag atctggctcc ctgcataact ttgaagaact 2401 gggtccaaac actgcgccca gtgagtgcaa aaacaaaacc tttaggatca agagatgttt 2461 tgccaaataa ccgtcaactt tatgagatgg tcctgacata taactttcat caacccaaga 2521 gtggggaagt aactccaagc tgcccactac tttgtgaact attatatgaa tctgaatttg 2581 acagccaact gtggattatt tttgaccaga acaaaagaca gatgggttca ggcgatgcct 2641 atccacatca gtattctttg aaactggaga aaggagatta tacaattcga ctacagattc 2701 gccatgagca aatcagtgat ttggaacgcc ttaaagacct tccatttatt gtttctcata 2761 gattgtctaa taccttgagc ttagatattc atgaaaatca tagttttgca cttctaggga 2821 agaagaaatc aagcaatttg acattaccac ccaaatataa ccagccattc tttgttactt 2881 ccttacctga tgataaaata cctaaagggg caggacctgg atgctatctt gcaggatcct 2941 taacattgtc aaagactgaa ctaggaaaga aagctgatgt aatccctgtt cattactact 3001 taatacctcc accaacaaag actaagaatg gcagcaaaga taaggaaaaa gattcagaaa 3061 aagagaaaga tttaaaagaa gagtttactg aagcattacg agatcttaaa attcagtgga 3121 tgacaaagct ggattctagt gacatttata acgaattgaa agaaacatat cctaattatc 3181 ttcctctgta cgttgcacga cttcatcaat tggatgctga aaaggaacga atgaaaagac 3241 ttaatgaaat tgttgatgcg gcaaatgctg ttatttctca tatagatcaa acagccctag 3301 cagtttatat tgcaatgaag actgatccca ggcctgatgc agctactata aaaaatgaca 3361 tggacaaaca aaaatccacc ctcgtagatg ccctttgtag gaaaggttgt gccctggcag 3421 accatcttct tcacacccag gctcaagacg gagccatttc cactgatgca gaaggaaagg 3481 aggaggaagg agaaagtcct ttggattctc tggcagaaac attttgggaa actactaaat 3541 ggactgatct ctttgacaat aaggttttga catttgcata taaacatgca ttagtaaata 3601 aaatgtatgg gagaggcctt aaatttgcaa ctaaacttgt ggaagaaaaa ccaacaaaag 3661 aaaactggaa aaattgtatt caactgatga agttacttgg atggacccat tgtgcatctt 3721 ttactgaaaa ctggctcccc atcatgtatc ctcccgatta ttgcgtattc taaaatagga 3781 aacaagactt taaattttaa aaaaggaagt tttatagtga atgggtataa aaacaaattt 3841 gtggcatttt tagtctaatg catgttttca tccactatcc agtactgatt attaaaatga 3901 catgtattta tcagagaatt cactgacgtg tggcttaata catgtaaatc tagacctctg 3961 acatcatggt gttttcttaa tgcctcacat tgctggcacg gggatgtgcc ctgcctgcca 4021 gcacctagga cttcgagttg ggttgcagct tatgacatgc atgataggtt ttggaaggta 4081 acttttaact gcaaacctat aaagtactat tttttatttt ataaatgaac agggttttaa 4141 cgtgctcaac tttaattttt ttcaattgta tgaaggcctt aaaaaagcta cattaagcgt 4201 agctaaaatt atttattgga ctaaaaacta acagaacttc atttccagaa tttttttttt 4261 tttttttttt ttggcaaatg tttacattca attaagggga aaaagtagaa ccagcacaaa 4321 tgagtggcag ttgctggagc ataactgctt caataaatct tcatcttggg gtaattacag 4381 gcaagtcatt ttcacatcct cttgaggttc agagcatcag aatgaactct atgaatacat 4441 gtgtaagtgc cagacagctg aatctttatc aggtattgta aagatacaca tatgatatgt 4501 ttattaaaat tgaaataatg taaaacacat gaataaattt gcaaaaccaa gatcacagta 4561 caccatatgc actctggtac cttaattttt ttttataaat aataaaagtg aatattgaag 4621 cttctt // LOCUS HUMTRBP 1368 bp mRNA PRI 03-MAY-1991 DEFINITION Human TAR RNA binding protein (TRBP) mRNA, complete cds. ACCESSION M60801 NID g339908 KEYWORDS TAR RNA binding protein. SOURCE Human HeLa D98/AH-2 cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1368) AUTHORS Gatignol,A., Buckler-White,A.J., Berkhout,B. and Jeang,K.-T. TITLE Characterization of a human TAR RNA-binding protein that activates the HIV-1 LTR JOURNAL Science 251, 1597-1600 (1991) MEDLINE 91188258 FEATURES Location/Qualifiers source 1..1368 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa D98/AH-2" gene 55..1092 /gene="TRBP" CDS 55..1092 /gene="TRBP" /codon_start=1 /product="TAR RNA binding protein" /db_xref="PID:g339909" /translation="MLAANPGKTPISLLQEYGTRIGKTPVYDLLKAEGQAHQPNFTFR VTVGDTSCTGQGPSKKAAKHKAAEVALKHLKGGSMLEPALEDSSSFSPLDSSLPEDIP VFTAAAAATPVPSVVLTRSPAMELQPPVSPQQSECNPVGALQELVVQKGWRLPEYTVT QESGPAHRKEFTMTCRVERFIEIGSGTSKKLAKRNAAAKMLLRVHTVPLDARDGNEVE PDDDHFSIGVGFRLDGLRNRGPGCTWDSLRNSVGEKILSLRSCSLGSLGALGPACCRV LSELSEEQAFHVSYLDIEELSLSGLCQCLVELSTQPATVCHGSATTREAARGEAARRA LQYLKIMAGSK" BASE COUNT 279 a 405 c 395 g 289 t ORIGIN 1 gctcttgggt tctgtagttt tctcgcgatc caaaaggctc cgtgcccaaa gcaaatgctg 61 gccgccaacc caggcaagac cccgatcagc cttctgcagg agtatgggac cagaataggg 121 aagacgcctg tgtacgacct tctcaaagcc gagggccaag cccaccagcc taatttcacc 181 ttccgggtca ccgttggcga caccagctgc actggtcagg gccccagcaa gaaggcagcc 241 aagcacaagg cagctgaggt ggccctcaaa cacctcaaag gggggagcat gctggagccg 301 gccctggagg acagcagttc tttttctccc ctagactctt cactgcctga ggacattccg 361 gtttttactg ctgcagcagc tgctacccca gttccatctg tagtcctaac caggagcccc 421 gccatggaac tgcagccccc tgtctcccct cagcagtctg agtgcaaccc cgttggtgct 481 ctgcaggagc tggtggtgca gaaaggctgg cggttgccgg agtacacagt gacccaggag 541 tctgggccag cccaccgcaa agaattcacc atgacctgtc gagtggagcg tttcattgag 601 attgggagtg gcacttccaa aaaattggca aagcggaatg cggcggccaa aatgctgctt 661 cgagtgcaca cggtgcctct ggatgcccgg gatggcaatg aggtggagcc tgatgatgac 721 cacttctcca ttggtgtggg cttccgcctg gatggtcttc gaaaccgggg cccaggttgc 781 acctgggatt ctctacgaaa ttcagtagga gagaagatcc tgtccctccg cagttgctcc 841 ctgggctccc tgggtgccct gggccctgcc tgctgccgtg tcctcagtga gctctctgag 901 gagcaggcct ttcacgtcag ctacctggat attgaggagc tgagcctgag tggactctgc 961 cagtgcctgg tggaactgtc cacccagccg gccactgtgt gtcatggctc tgcaaccacc 1021 agggaggcag cccgtggtga ggctgcccgc cgtgccctgc agtacctcaa gatcatggca 1081 ggcagcaagt gaagccccag ctggactcat ggatgtgcac cctttgctcc ctgctctttc 1141 tgcctctggg ctcatgtatc tgcgcagctc tggtaccctc tgtgggtgcc atctctacct 1201 ctgacacaga ctgcctgcct tgaagctgag aaggcacagg gcaaggagcc aaggaccaca 1261 gagcctcagc cagcccagga tccgtcctca ttttattggt gatgatgaat gggaatgaaa 1321 tcagggggct gtctactaga gcctggaata aatatgctgc tttgtgga // LOCUS HUMTRIP1M 1289 bp mRNA PRI 15-MAR-1995 DEFINITION Homo sapiens thyroid receptor interactor (TRIP1) mRNA, complete cds. ACCESSION L38810 NID g695369 KEYWORDS ATPase; TRIP1 gene; homologue; sug1 gene; thyroid receptor interactor. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1289) AUTHORS Lee,J.W., Choi,H.S., Gyuris,J., Brent,R. and Moore,D.D. TITLE Two classes of proteins dependent on either the presence or absence of thyroid hormone for interaction with the thyroid hormone receptor JOURNAL Mol. Endocrinol. 9 (2), 243-254 (1995) MEDLINE 95295737 REFERENCE 2 (sites) AUTHORS Lee,J.W., Ryan,F., Swaffield,J.C., Johnston,S.A. and Moore,D.D. TITLE Interaction of thyroid-hormone receptor with a conserved transcriptional mediator JOURNAL Nature 374 (6517), 91-94 (1995) MEDLINE 95174891 COMMENT Trip1 was isolated as interacting with the thyroid hormone receptor in the yeast two hybrid system. Interaction is dependent on the presence of thyroid hormone. Trip1 shares strong sequence similarity with the yeast transcriptional mediator SUG1 and contains a CAD region. Submitted sequence is full length or nearly full length. FEATURES Location/Qualifiers source 1..1289 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <19..1289 /gene="TRIP1" /product="thyroid receptor interactor" gene 19..1289 /gene="TRIP1" CDS 19..1239 /gene="TRIP1" /codon_start=1 /product="thyroid receptor interactor" /db_xref="PID:g695370" /translation="MALDGPEQMELEEGKAGSGLRQYYLSKIEELQLIVNDKSQNLRR LQAQRNELNAKVRLLREELQLLQEQGSYVGEVVRAMDKKKVLVKVHPEGKFVVDVDKN IDINDVTPNCRVALRNDSYTLHKILPNKVDPLVSLMMVEKVPDSTYEMIGGLDKQIKE IKEVIELPVKHPELFEALGIAQPKGVLLYGPPGTGKTLLARAVAHHTDCTFIRVSGSE LVQKFIGEGARMVRELFVMAREHAPSIIFMDEIDSIGSSRLEGGSGGSSEVQRQMLEL LNQLDGFEATKNIKVIMATNRIDMLDSALLRPGRIDRKIEFPPPNEEARLDILKIHSR KMNLTRGINLRKIAELMPGASGAEVKGVCTEAGMYALRERRVHVTQEDFEMAVAKVMQ KDSEKNMSIKKLWK" 3'UTR 1240..1289 /gene="TRIP1" polyA_signal 1267..1272 /gene="TRIP1" /note="putative" polyA_site 1289 /gene="TRIP1" BASE COUNT 353 a 278 c 386 g 272 t ORIGIN 1 tgctgctgaa gagagaagat ggcgcttgac ggaccagagc agatggagct ggaggagggg 61 aaggcaggca gcggactccg ccaatattat ctgtccaaga ttgaagaact ccagctgatt 121 gtgaatgata agagccaaaa cctccggagg ctgcaggcac agaggaacga actaaatgct 181 aaagttcgcc tattgcggga ggagctacag ctgctgcagg agcagggctc ctatgtgggg 241 gaagtagtcc gggccatgga taagaagaaa gtgttggtca aggtacatcc tgaaggtaaa 301 tttgttgtag acgtggacaa aaacattgac atcaatgatg tgacacccaa ttgccgggtg 361 gctctaagga atgacagcta cactctgcac aagatcctgc ccaacaaggt agacccatta 421 gtgtcactga tgatggtgga gaaagtacca gattcaactt atgagatgat tggtggactg 481 gacaaacaga tcaaggagat caaagaagtg atcgagctgc ctgttaagca tcctgagctc 541 ttcgaagcac tgggcattgc tcagcccaag ggagtgctgc tgtatggacc tccaggcact 601 gggaagacac tgttggcccg ggctgtggct catcatacgg actgtacctt tattcgtgtc 661 tctggctctg aattggtaca gaaattcata ggggaagggg caagaatggt gagggagctg 721 tttgtcatgg cacgggaaca tgctccatct atcatcttca tggacgaaat cgactccatc 781 ggctcctcgc ggctggaggg gggttctgga gggagcagtg aagtgcagcg ccagatgctg 841 gagttgctca accagctcga cggctttgag gccaccaaga acatcaaggt tatcatggct 901 actaatagga ttgatatgct ggactcggca ctgcttcgcc cagggcgcat tgacagaaaa 961 attgaattcc caccccccaa tgaggaggcc cggctggaca ttttgaagat tcattctcgg 1021 aagatgaacc tgacccgggg gatcaacctg agaaaaattg ctgagctcat gccaggagca 1081 tcaggggctg aagtgaaggg cgtgtgcacg gaagctggca tgtatgccct gcgagaacgg 1141 cgagtccatg tcactcagga ggactttgag atggcagtag ccaaggtcat gcagaaggac 1201 agtgagaaaa acatgtccat caagaaatta tggaagtgag tggacagcct ttgtgtgtat 1261 ctctccaata aagctctgtg ggccaagtc // LOCUS HUMTRIP9G 1940 bp DNA PRI 15-MAR-1995 DEFINITION Homo sapiens thyroid receptor interactor (TRIP9) gene, complete cds. ACCESSION L40407 NID g703117 KEYWORDS TRIP9 gene; thyroid receptor interactor. SOURCE Homo sapiens DNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1940) AUTHORS Lee,J.W., Choi,H.S., Gyuris,J., Brent,R. and Moore,D.D. TITLE Two classes of proteins dependent on either the presence or absence of thyroid hormone for interaction with the thyroid hormone receptor JOURNAL Mol. Endocrinol. 9 (2), 243-254 (1995) MEDLINE 95295737 COMMENT Trip9 was isolated as interacting with the thyroid hormone receptor in the yeast 2-hybrid system. Interaction is dependent on the presence of hormone. Submitted sequence is full length or nearly full length at the 5' end and includes the apparently complete protein coding region. The fusion junction in the original yeast 2-hybrid isolate is at position 569. Trip9 includes 6 copies of the motif commonly referred to as the ankyrin repeat. Two mRNAs of approximate 1.8 and 2.8 kb are detected by Northern blotting. FEATURES Location/Qualifiers source 1..1940 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="Hela" gene 53..1069 /gene="TRIP9" CDS 53..1069 /gene="TRIP9" /codon_start=1 /product="thyroid receptor interactor" /db_xref="PID:g703118" /translation="MAGVACLGKAADADEWCDTGLGSLGPDAAAPGGPGLGAELGPGL SWAPLVFGYVTEDGDTALHLAVIHQHEPFLDFLLGFSAGTEYMDLQNDLGQTALHLAA ILGETSTVEKLYAAGAGLCVAERRGHTALHLACRVGAHACARALLQPRPRRPREAPDT YLAQGPDRTPDTNHTPVALYPDSDLEKEEEESEEDWKLQLEAENYEGHTPLHVAVIHK DVEMVRLLRDAGADLDKPEPTCGRSPLHLAVEAQAADVLELLLRAGANPAARMYGGRT PLGSAMLRPNPILARLLRAHGAPEPEGEDEKSGPCSSSSDSDGGDEGVSQEERQGSPA GGSG" BASE COUNT 420 a 526 c 678 g 316 t ORIGIN 1 aaagccagct acaggcgggc gactgcgggg ggcccctgag gcggcggggg ccatggctgg 61 ggtcgcgtgc ttgggaaaag ctgccgacgc agatgaatgg tgcgacacgg gcctgggctc 121 cctgggtccg gacgcagcgg cccccggagg acctgggttg ggcgcggagt tgggcccggg 181 gctgtcgtgg gctcccctcg tcttcggcta cgtcactgag gatggggaca cggcactgca 241 cttggctgtg attcatcagc atgaaccctt cctggatttt cttctaggct tctcggccgg 301 cactgagtac atggacctgc agaatgacct aggccagaca gccctgcacc tggcagccat 361 cctgggggag acatccacgg tggagaagct gtacgcagca ggcgccgggc tgtgtgtggc 421 ggagcgtagg ggccacacgg cgctgcacct ggcctgccgt gtgggggcac acgcctgtgc 481 ccgtgccctg cttcagcccc gcccccggcg ccccagggaa gcccccgaca cctacctcgc 541 tcagggccct gaccgtactc ccgacaccaa ccatacccct gtcgccttgt accccgattc 601 cgacttggag aaggaagaag aggagagtga ggaggactgg aagctgcagc tggaggctga 661 aaactacgag ggccacaccc cactccacgt ggccgttatc cacaaagatg tggagatggt 721 ccggctgctc cgagatgctg gagctgacct tgacaaaccg gagcccacgt gcggccggag 781 cccccttcat ttggcagtgg aggcccaggc agccgatgtg ctggagcttc tcctgagggc 841 aggcgcgaac cctgctgccc gcatgtacgg tggccgcacc ccactcggca gtgccatgct 901 ccggcccaac cccatcctcg cccgcctcct ccgtgcacac ggagcccctg agcccgaggg 961 cgaggatgag aaatccggcc cctgcagcag cagtagcgac agcgacggcg gagacgaggg 1021 cgtgagtcag gaggagagac agggcagccc agctgggggg tcaggataga ccggcagcaa 1081 gaagcccaag aagataatta ggcaccgacc ttgggctgct gttagagaac tcaggcggca 1141 cgccagtgac acggggcact agtcaggaga gacctggaca ggggtggtgg gaagagcttg 1201 ggcagaagtg gctgaaaaac taaggcagtg gcaaaggtag aactcaggca ggggtggaga 1261 aaagcgttgg tcgcagtgat tggtgaacac agcgggggtg ggtggtagcg ctgggggtga 1321 ttttaggcag caagaattgg agaactcaca ctgcgaaaag aaaaccttgg gtggcagtga 1381 tttgaacacc ggcagtgctg gggcaggacc cgagccagcg gtggggagag atatagtcag 1441 agaacccagc aatacagatc cgtccttggg caaggcgcgg tgctggatga tgggtgcgga 1501 ggaatttggg taaaggcaga gggaaggggt ggaggagggc cagctcagtt gccgaaactc 1561 tggagtggcg gctggctaga aattggtctg tagaaatgac cttgaaaatg gagttctggc 1621 caggtgcggt ggctcacgcc tgtaatccca gcactttggg aggccgaggc aggcagatca 1681 cgaggtcagg agttcgagac cagcctggcc aacatggcaa gaccctgtct ctactaaaaa 1741 tacaaaaatt agctgggcgt ggtggcgcat gcctataatc ccagctactt gggaggctga 1801 ggtaggagaa ttgcttgaac ctgggaggtg gaggttgcag tgaacctaga tcacgccact 1861 gcattccagc ctgggcaaca gagcgcatga tcagtcaacc gctcgaggga tcttccatag 1921 gatggtcaag acgcggacgt // LOCUS HUMTRNB 1437 bp mRNA PRI 07-MAR-1995 DEFINITION Human transducin beta-2 subunit mRNA, complete cds. ACCESSION M36429 NID g339934 KEYWORDS transducin. SOURCE Human myeloid leukemia cell line HL-60, cDNA to mRNA, clones lambda-[115.1,4C4,123]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1437) AUTHORS Amatruda,T.T.III., Fong,H.K.W., Birren,B.W. and Simon,M.I. TITLE Signal transduction in cytoplasmic organization and cell moltility: molecular cloning of a distinct form of the beta subunit of GTP-binding regulatory proteins JOURNAL (in) Satir,P., Condeelis,J.S. and Lazarides,E. (Eds.); UCLA SYMP. MOL. CELL. BIOL. NEW SER., Vol. 77: 339-352; Alan R. Liss, Publisher, NY (1988) FEATURES Location/Qualifiers source 1..1437 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="lambda-115.1" /cell_line="HL-60" /cell_type="myeloid leukemia cell" /clone="lambda-4C4" /clone="lambda-123" CDS 58..1080 /note="transducin beta-2 subunit" /codon_start=1 /db_xref="PID:g339935" /translation="MSELEQLRQEAEQLRNQIRDARKACGDSTLTQITAGLDPVGRIQ MRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSYTTNKVHAIPLRSSWVMT CAYAPSGNFVACGGLDNICSIYSLKTREGNVRVSRELPGHTGYLSCCRFLDDNQIITS SGDTTCALWDIETGQQTVGFAGHSGDVMSLSLAPDGRTFVSGACDASIKLWDVRDSMC RQTFIGHESDINAVAFFPNGYAFTTGSDDATCRLFDLRADQELLMYSHDNIICGITSV AFSRSGRLLLAGYDDFNCNIWDAMKGDRAGVLAGHDNRVSCLGVTDDGMAVATGSWDS FLKIWN" BASE COUNT 272 a 467 c 425 g 273 t ORIGIN 1 ggcccccgtc ccgcggcccc cagccgcccc caaccctgcc ccacggggcc cggcgccatg 61 agtgagctgg agcaactgag acaggaggcc gagcagctcc ggaaccagat ccgggatgcc 121 cgaaaagcat gtggggactc aacactgacc cagatcacag ctgggctgga cccagtgggg 181 agaatccaga tgaggacccg gaggaccctc cgtgggcacc tggcaaagat ctatgccatg 241 cactggggga ccgactcaag gctgctggtc agcgcctccc aggatgggaa gctcatcatc 301 tgggacagct acaccaccaa caaggtccac gccatcccgc tgcgctcctc ctgggtaatg 361 acctgtgcct acgcgccctc agggaacttt gtggcctgtg gggggttgga caacatctgc 421 tccatctaca gcctcaagac ccgcgagggc aacgtcaggg tcagccggga gctgcctggc 481 cacactgggt acctgtcgtg ttgccgcttc ctggatgaca accaaatcat caccagctct 541 ggggatacca cctgtgccct gtgggacatt gagacaggcc agcagacagt gggttttgct 601 ggacacagtg gggatgtgat gtccctgtcc ctggcccccg atggccgcac gtttgtgtca 661 ggcgcctgtg atgcctctat caagctgtgg gacgtgcggg attccatgtg ccgacagacc 721 ttcatcggcc atgaatccga catcaatgca gtggctttct tccccaacgg ctacgccttc 781 accaccggct ctgacgacgc cacgtgccgc ctcttcgacc tgcgggccga tcaggagctc 841 ctcatgtact cccatgacaa catcatctgt ggcatcacct ctgttgcctt ctcgcgcagc 901 ggacggctgc tgctcgctgg ctacgacgac ttcaactgca acatctggga tgccatgaag 961 ggcgaccgtg caggagtcct cgctggccac gacaaccgcg tgagctgcct cggggtcacc 1021 gacgatggca tggctgtggc cacgggctcc tgggactcct tcctcaagat ctggaactaa 1081 tggccccacc cccactggcc caggccagga ggggccctgc ccatgcccac actacaggcc 1141 agggctgcgg gctggcgcaa tcccagcccc cttccccggg ccacgggcct tgggtccctg 1201 ccctcccacc caggtttggt tcctcccggg gcccccactg tggagataag aaggggatgg 1261 aatgggggaa gaggaggagc aggaggccct atcttctgct gccctggggt tggggcctca 1321 cccctctgga gggccggagg caggaggtgg aaaccccagg ggctggcttt tttaaaactg 1381 gttttatttt aatttttatt atattttcag tttttccata aaggagccaa ttccaac // LOCUS HUMTRNSAL 1332 bp mRNA PRI 09-MAY-1997 DEFINITION Human transaldolase mRNA containing transposable element, complete cds. ACCESSION L19437 NID g2073540 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1332) AUTHORS Banki,K., Halladay,D. and Perl,A. TITLE Cloning and expression of the human gene for transaldolase. A novel highly repetitive element constitutes an integral part of the coding sequence JOURNAL J. Biol. Chem. 269 (4), 2847-2851 (1994) MEDLINE 94132057 REFERENCE 2 (bases 1 to 1332) AUTHORS Perl,A. TITLE Direct Submission JOURNAL Submitted (08-MAY-1997) Medicine, SUNY HSC, 750 E. Adams Street, Syracuse, NY 13210, USA REMARK Sequence update by submitter FEATURES Location/Qualifiers source 1..1332 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HL-60" /germline misc_feature 1..470 /standard_name="transposable element" 5'UTR 1..56 CDS 57..1070 /codon_start=1 /product="transaldolase" /db_xref="PID:g2073541" /translation="MSSSPVKRQRMESALDQLKQFTTVVADTGDFHAIDEYKPQDATT NPSLILAAAQMPAYQELVEEAIAYGRKLGGSQEDQIKNAIDKLFVLFGAEILKKIPGR VSTEVDARLSFDKDAMVARARRLIELYKEAGISKDRILIKLSSTWEGIQAGKELEEQH GIHCNMTLLFSFAQAVACAEAGVTLISPFVGRILDWHVANTDKKSYEPLEDPGVKSVT KIYNYYKKFSYKTIVMGASFRNTGEIKALAGCDFLTISPKLLGELLQDNAKLVPVLSA KAAQASDLEKIHLDEKSFRWLHNEDQMAVEKLSDGIRKFAADAVKLERMLTERMFNAE NGK" 3'UTR 1068..1332 BASE COUNT 384 a 340 c 343 g 265 t ORIGIN 1 gaattccgcg cccgtcccgt cgccgccgcc gccgccgcag acccctcggt cttgctatgt 61 cgagctcacc cgtgaagcgt cagaggatgg agtccgcgct ggaccagctc aagcagttca 121 ccaccgtggt ggccgacacg ggcgacttcc acgccatcga cgagtacaag ccccaggatg 181 ctaccaccaa cccgtccctg atcctggccg cagcacagat gcccgcttac caggagctgg 241 tggaggaggc gattgcctat ggccggaagc tgggcgggtc acaagaggac cagattaaaa 301 atgctattga taaacttttt gtgttgtttg gagcagaaat actaaagaag attccgggcc 361 gagtatccac agaagtagac gcaaggctct cctttgataa agatgcgatg gtggccagag 421 ccaggcggct catcgagctc tacaaggaag ctgggatcag caaggaccga attcttataa 481 agctgtcatc aacctgggaa ggaattcagg ctggaaagga gctcgaggag cagcacggca 541 tccactgcaa catgacgtta ctcttctcct tcgcccaggc tgtggcctgt gccgaggcgg 601 gtgtgaccct catctcccca tttgttgggc gcatccttga ttggcatgtg gcaaacaccg 661 acaagaaatc ctatgagccc ctggaagacc ctggggtaaa gagtgtcact aaaatctaca 721 actactacaa gaagtttagc tacaaaacca ttgtcatggg cgcctccttc cgcaacacgg 781 gcgagatcaa agcactggcc ggctgtgact tcctcaccat ctcacccaag ctcctgggag 841 agctgctgca ggacaacgcc aagctggtgc ctgtgctctc agccaaggcg gcccaagcca 901 gtgacctgga aaaaatccac ctggatgaga agtctttccg ttggttgcac aacgaggacc 961 agatggctgt ggagaagctc tctgacggga tccgcaagtt tgccgctgat gcagtgaagc 1021 tggagcggat gctgacagaa cgaatgttca atgcagagaa tggaaagtag cgcatccctg 1081 aggctggact ccagatctgc accgccggcc agctgggatc tgactgcacg tggcttctga 1141 tgaatcttgc gttttttaca aattggagca gggacagatc atagatttct gattttatgt 1201 aaaattttgc ctaatacatt aaagcagtca cttttcctgt gctgtttcaa aaaaaaaaaa 1261 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1321 aaaaaggaat tc // LOCUS HUMTROMOD 2665 bp mRNA PRI 14-JAN-1995 DEFINITION Human tropomodulin mRNA, complete cds. ACCESSION M77016 NID g339947 KEYWORDS erythrocyte membrane skeletal protein; tropomodulin. SOURCE Homo sapiens (tissue library: fetal liver cDNA; fetal reticulocyte cDNA) fetus liver; blood reticulocyte cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2665) AUTHORS Sung,L.A., Fowler,V.M., Lambert,K., Sussman,M.A., Karr,D. and Chien,S. TITLE Molecular cloning and characterization of human fetal liver tropomodulin. A tropomyosin-binding protein JOURNAL J. Biol. Chem. 267 (4), 2616-2621 (1992) MEDLINE 92129352 FEATURES Location/Qualifiers source 1..2665 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="liver; blood reticulocyte" /tissue_lib="fetal liver cDNA; fetal reticulocyte cDNA" /map="9q22" gene 35..1114 /gene="TMOD" CDS 35..1114 /gene="TMOD" /codon_start=1 /db_xref="GDB:G00-127-386" /product="tropomodulin" /db_xref="PID:g339948" /translation="MSYRRELEKYRDLDEDEILGALTEEELRTLENELDELDPDNALL PAGLRQKDQTTKAPTGPFKREELLDHLEKQAKEFKDREDLVPYTGEKRGKVWVPKQKP LDPVLESVTLEPELEEALANASDAELCDIAAILGMHTLMSNQQYYQALSSSSIMNKEG LNSVIKPTQYKPVPDEEPNSTDVEETLERIKNNDPKLEEVNLNNIRNIPIPTLKAYAE ALKENSYVKKFSIVGTRSNDPVAYALAEMLKENKVLKTLNVESNFISGAGILRLVEAL PYNTSLVEMKIDNQSQPLGNKVEMEIVSMLEKNATLLKFGYHFTQQGPRLRASNAMMN NNDLVRKRRLADLTGPIIPKCRSGV" BASE COUNT 811 a 582 c 626 g 646 t ORIGIN q22 chromosome 9. 1 agaaattcag gagacacaga caagttcttc cacgatgtcg tacagacgag aactagagaa 61 ataccgtgac ctggatgaag atgaaatcct tggagcccta acagaggaag agctgaggac 121 cctggaaaat gagctggatg agctggaccc tgataatgca ctgctgcctg caggcctgag 181 gcagaaggat cagaccacca aggcgcccac gggccccttt aaaagagagg agctcttgga 241 tcacttggaa aagcaagcaa aggagtttaa ggaccgagaa gatctggtcc cctacacagg 301 ggaaaaacga ggaaaggtct gggttcctaa gcagaagcca ctggatcctg tgctggaaag 361 tgtgacgctg gaaccggagc tggaggaagc cttggcaaat gcttcagatg cagaactctg 421 tgacattgca gcgatcctgg gcatgcacac gctcatgagt aaccagcagt actaccaggc 481 cctgagcagc agctccatca tgaacaagga ggggctcaac agcgtgatta aacccacaca 541 atacaagcct gtgcccgacg aagaaccaaa ttcaacagac gtagaggaaa cgctggaacg 601 gataaagaac aacgacccaa aacttgaaga agttaacctc aataatatcc ggaatatccc 661 catccccacc ctcaaggcat atgcagaagc cctgaaagaa aactcatatg tgaagaagtt 721 cagcatcgtg gggacacgga gtaatgaccc cgtggcgtat gcccttgctg agatgctcaa 781 ggagaacaag gtgttgaaga cactgaatgt ggaatccaac ttcatttctg gagctgggat 841 tctgcgcctg gtagaagccc tcccatacaa cacttctctg gtggaaatga aaattgacaa 901 ccagagccag cccctgggca acaaagtgga aatggagatt gtgagcatgt tggaaaaaaa 961 cgcaacactt ctcaaattcg gctaccactt tacccagcaa ggaccccggc ttcgggcatc 1021 caacgcaatg atgaacaaca atgaccttgt gaggaagagg aggcttgcgg acctgactgg 1081 gcccatcatt cccaagtgcc ggagtggtgt ctagtgtgtg gcggtggagt ccatgccttt 1141 gaactggatg tgttctattg atgacctgtg ctctgcaggg gaaaccagaa ggcaaaatgc 1201 tggcagcatg aaaccctttt gtggttcagt tctttatgca ctaaggtttt aggttgacta 1261 gtggttgtag ttgaaaattt tataaaatac cgttaatgtg aagtttttct ttagtcacag 1321 aagttgaatc tggttattat ttaaaaacta gaagccccca aaccagcaga tcttactgaa 1381 gatgatgttc cagcagcagc gacttagccc caggagccca gtttcaatgg ccttgctgtg 1441 tggtgtttca agtgcattta aaatgtgtga cacagaaacg gcacactctt ccacatgctt 1501 ttgaagtatt ataaaacact ttattacaaa tttgtcttag ctattagcaa ataaaactga 1561 ttatcattct ttattaaccc tccttggaat tttgaaaacc tcgattaaag ttgccaaatt 1621 gattactgga tccagaacac aattttcccc tcagaacaga tagacagact gaagccactg 1681 aactctgcca ggagtcaaca tgagattcct tttgctggat atgcagaaat gataggaaaa 1741 aaaccaatgg tgaaatttca agtttcaaaa accaaccttt cattaccaat cccaggcaac 1801 aaacatgtcc ctgagtgttc tttaagaaca tttgggattt atgtacaatt taatactgga 1861 gttagaactt tttccttatt gaatgccaac cttatgatgg atgtgaaaat ctacggccaa 1921 atacttttga aaacaccttt ctatattgca cagtgggcaa atggcttatg tgaggtaaga 1981 cactagaggg ataaatttcc agatcaacat ggctatggta tttagtaatg gcccagctta 2041 gagacttcag ctactgatct catcacttat tagacaaatt gctgctgacc ttacgcctgt 2101 atattaagcc tccgcaggat gccggacaat ggtgaagaaa ctccagatat caaggaattg 2161 ggaaatcctg gccaaaccac cccaagatga ttacactgaa atgtagtatt agtactgctg 2221 ccagatctct ttttaacatc atgtgcgtct cttgggatcc agcaaaagtg ttaagccaca 2281 atgcccttgt gccttttaat ataccacagt gccagttaaa ctaatatttt tgtttgttgc 2341 ttttgggagt tattttcatt agtgatttca gcaaatctca tgataaagga caaggtcaag 2401 aactccagag cactgagcag agaggctggt gatgaaaagg tgaaggcctg cgcactgaac 2461 tgtaaggcag tgggcagtac agggtaactg gaggcggggc cagggcctca gcgctatgga 2521 agagtgtcca ctgaggctgc acatggccca ggagtggcac catgttgcag ggacaaccat 2581 ccccatttgg cttctcctta aaacacaatt gcagctgcat tctgcatcgc tgaaaactgc 2641 aatataatat taaatctgtt ggtcg // LOCUS HUMTROPI 816 bp mRNA PRI 14-JAN-1995 DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds. ACCESSION J04760 NID g339964 KEYWORDS slow-twitch skeletal troponin I; troponin I. SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 816) AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L. TITLE cDNA sequence, tissue-specific expression, and chromosomal mapping of the human slow-twitch skeletal muscle isoform of troponin I JOURNAL Genomics 7 (3), 346-357 (1990) MEDLINE 90307007 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Wade, 15-MAR-1990. FEATURES Location/Qualifiers source 1..816 /organism="Homo sapiens" /db_xref="taxon:9606" mRNA <1..816 /note="TNN1 mRNA" gene 77..640 /gene="TNNI1" CDS 77..640 /gene="TNNI1" /note="slow-twitch skeletal troponin I (TNN1)" /codon_start=1 /db_xref="GDB:G00-120-443" /db_xref="PID:g339965" /translation="MPEVERKPKITASRKLLLKSLMLAKAKECWEQEHEEREAEKVRY LAERIPTLQTRGLSLSALQDLCRELHAKVEVVDEERYDIEAKCLHNTREIKDLKLKVM DLRGKFKRPPLRRVRVSADAMLRALLGSKHKVSMDLRANLKSVKKEDTEKERPVEVGD WRKNVEAMSGMEGRKKMFDAANAPTSQ" BASE COUNT 180 a 247 c 244 g 145 t ORIGIN Chromosome 1q12-qter. 1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg 61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa 121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga 181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg 241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt 301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa 361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg 421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc 481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc 541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa 601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc 661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg 721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct 781 ctggaactgg gattaaacag atacccaaga ggcagg // LOCUS HUMTRP2A 2270 bp mRNA PRI 17-AUG-1994 DEFINITION Homo sapiens TRP-2/dopachrome tautomerase (Tyrp-2) mRNA, complete cds. ACCESSION L18967 NID g399581 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2270) AUTHORS Cassady,J.L. and Sturm,R.A. TITLE Sequence of the human dopachrome tautomerase-encoding TRP-2 cDNA JOURNAL Gene 143 (2), 295-298 (1994) MEDLINE 94266170 FEATURES Location/Qualifiers source 1..2270 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="A2058" /cell_type="melanoma" /tissue_lib="Uni-ZAP-XR from Stratagene" sig_peptide 385..453 /gene="Tyrp-2" CDS 385..1944 /gene="Tyrp-2" /codon_start=1 /product="tyrosine-related protein 2" /db_xref="PID:g399582" /translation="MSPLWWGFLLSCLGCKILPGAQGQFPRVCMTVDSLVNKECCPRL GAESANVCGSQQGRGQCTEVRADTRPWSGPYILRNQDDRELWPRKFFHRTCKCTGNFA GYNCGDCKFGWTGPNCERKKPPVIRQNIHSLSPQEREQFLGALDLAKKRVHPDYVITT QHWLGLLGPNGTQPQFANCSVYDFFVWLHYYSVRDTLLGPGRPYRAIDFSHQGPAFVT WHRYHLLCLERDLQRLIGNESFALPYWNFATGRNECDVCTDQLFGAARPDDPTLISRN SRFSSWETVCDSLDDYNHLVTLCNGTYEGLLRRNQMGRNSMKLPTLKDIRDCLSLQKF DNPPFFQNSTFSFRNALEGFDKADGTLDSQVMSLHNLVHSFLNGTNALPHSAANDPIF VVLHSFTDAIFDEWMKRFNPPADAWPQELAPIGHNRMYNMVPFFPPVTNEELFLTSDQ LGYSYAIDLPVSVEETPGWPTTLLVVMGTLVALVGLFVLLAFLQYRRLRKGYTPLMET HLSSKRYTEEA" gene 385..1944 /gene="Tyrp-2" mat_peptide 454..1941 /gene="Tyrp-2" /product="tyrosine-related protein 2" BASE COUNT 638 a 506 c 534 g 592 t ORIGIN 1 gagagggttt agaaatacca gcataataag tagtatgact gggtgctctg taaattaact 61 caattagaca aagcctgact taacggggga agatggtgag aagcgctacc ctcattaaat 121 ttggttgtta gaggcgcttc taaggaaatt aagtctgtta gttgtttgaa tcacataaaa 181 ttgtgtgtgc acgttcatgt acacatgtgc acacatgtaa cctctgtgat tcttgtgggt 241 atttttttaa gaagaaagga atagaaagca aagaaaaata aaaaatactg aaaagaaaag 301 actgaaagag tagaagataa ggagaaaagt acgacagaga caaggaaagt aagagagaga 361 gagagctctc ccaattataa agccatgagc cccctttggt gggggtttct gctcagttgc 421 ttgggctgca aaatcctgcc aggagcccag ggtcagttcc cccgagtctg catgacggtg 481 gacagcctag tgaacaagga gtgctgccca cgcctgggtg cagagtcggc caatgtctgt 541 ggctctcagc aaggccgggg gcagtgcaca gaggtgcgag ccgacacaag gccctggagt 601 ggtccctaca tcctacgaaa ccaggatgac cgtgagctgt ggccaagaaa attcttccac 661 cggacctgca agtgcacagg aaactttgcc ggctataatt gtggagactg caagtttggc 721 tggaccggtc ccaactgcga gcggaagaaa ccaccagtga ttcggcagaa catccattcc 781 ttgagtcctc aggaaagaga gcagttcttg ggcgccttag atctcgcgaa gaagagagta 841 caccccgact acgtgatcac cacacaacac tggctgggcc tgcttgggcc caatggaacc 901 cagccgcagt ttgccaactg cagtgtttat gatttttttg tgtggctcca ttattattct 961 gttagagata cattattagg accaggacgc ccctacaggg ccatagattt ctcacatcaa 1021 ggacctgcat ttgttacctg gcaccggtac catttgttgt gtctggaaag agatctccag 1081 cgactcattg gcaatgagtc ttttgctttg ccctactgga actttgccac tgggaggaac 1141 gagtgtgatg tgtgtacaga ccagctgttt ggggcagcga gaccagacga tccgactctg 1201 attagtcgga actcaagatt ctccagctgg gaaactgtct gtgatagctt ggatgactac 1261 aaccacctgg tcaccttgtg caatggaacc tatgaaggtt tgctgagaag aaatcaaatg 1321 ggaagaaaca gcatgaaatt gccaacctta aaagacatac gagattgcct gtctctccag 1381 aagtttgaca atcctccctt cttccagaac tctaccttca gtttcaggaa tgctttggaa 1441 gggtttgata aagcagatgg gactctggat tctcaagtga tgagccttca taatttggtt 1501 cattccttcc tgaacgggac aaacgctttg ccacattcag ccgccaatga tcccattttt 1561 gtggttcttc attcctttac tgatgccatc tttgatgagt ggatgaaaag atttaatcct 1621 cctgcagatg cctggcctca ggagctggcc cctattggtc acaatcggat gtacaacatg 1681 gttcctttct tccctccagt gactaatgaa gaactctttt taacctcaga ccaacttggc 1741 tacagctatg ccatcgatct gccagtttca gttgaagaaa ctccaggttg gcccacaact 1801 ctcttagtag tcatgggaac actggtggct ttggttggtc tttttgtgct gttggctttt 1861 cttcaatata gaagacttcg aaaaggatat acacccctaa tggagacaca tttaagcagc 1921 aagagataca cagaagaagc ctagggtgct catgccttac ctaagagaag aggttggcca 1981 agccacagtt ctgacgctga caataaagga actaatcctc actgttcctt cttgagttga 2041 agatctttga cataggttct tctatagtga tgatgatctc attcagaaga tgcttagctg 2101 tagtttccgc tttgcttgct tgtttaacaa acccaactaa agtgcttgag gctacctcta 2161 ccttcaaata aagatagacc tgacaatttg tgatatctaa taataacccc ccccccaata 2221 ttgattaagc ctcctccttt tctgaaagca tttaaaaaaa aaaaaaaaaa // LOCUS HUMTSHR 2413 bp mRNA PRI 19-JUL-1995 DEFINITION Homo sapiens thyroid stimulating hormone receptor (TSHR) mRNA, complete cds. ACCESSION M73747 NID g903759 KEYWORDS G-protein linked receptor; thyroid stimulating hormone receptor. SOURCE Homo sapiens (clone library: lambda gt11) adult thyroid cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2413) AUTHORS Frazier,A.L., Robbins,L.S., Stork,P.J., Sprengel,R., Segaloff,D.L. and Cone,R.D. TITLE Isolation of TSH and LH/CG receptor cDNAs from human thyroid: regulation by tissue specific splicing JOURNAL Mol. Endocrinol. 4 (8), 1264-1276 (1990) MEDLINE 91155962 FEATURES Location/Qualifiers source 1..2413 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt11" /cell_type="follicular cell" /dev_stage="adult" /tissue_type="thyroid" /map="14q31" gene 71..2362 /gene="TSHR" CDS 71..2362 /gene="TSHR" /codon_start=1 /db_xref="GDB:G00-125-313" /product="thyroid stimulating hormone receptor" /db_xref="PID:g903760" /translation="MRPADLLQLVLLLDLPRDLGGMGCSSPPCECHQEEDFRVTCKDI QRIPSLPPSTQTLKLIETHLRTIPSHAFSNLPNISRIYVSIDVTLQQLESHSFYNLSK VTHIEIRNTRNLTYIDPDALKELPLLKSLAFSNTGLKMFPDLTKVYSTDIFFILEITD NPYMTSIPVNAFQGLCNETLTLKLYNNGFTSVQGYDFFGTKLDAVYLNKNKYLTVIDK DAFGGVYSGPSLLDVSQTSVTALPSKGLEHLKELIARNSWTLKKLALSLSFLHLTRAD LSYPSHCCAFKNQKKIRGILESLMCNESSIETLRQRKSVNALNSPLHQEYEENLGDSI VGYKEKSKFQDTHNNAHYYVFFEEQEDEIIGFGQELKNPQEETLQAFDSHYDYTICGD SEDMVCTPKSDEFNPCEDIMGYKFLRIVVWFVSLLALLGNVFVLLILLTSHYKLNVPR FLMCNLAFADFCMGMYLLLIASVDLYTHSEYYNHAIDWQTGPGCNTAGFFTVFASELS VYTLTVITLERWYAITFAMALDRKIRLRHACAIMVGGWVCCFLLALLPLVGISSYAKV SICLPMDTETPLALAYIVFVLTLNIVAFVIVCCCYVKIYITVRNPHNPGDKDTKIAKR MAVLIFTDFTCMAPISFYAVSAILNKPLITVSNSKILLVLFYPINSCANPFLYAIFTK AFQRDVFILLSKFGICKRQAQAYRGQRVPPKNSTDIQVQKVTHDMRQGLHNMEDVYEL IENSHLTPKKQGQISEEYMQTVL" BASE COUNT 641 a 618 c 528 g 626 t ORIGIN 1 taatacgact cactataggc gaattaagcg atttcggagg atggagaaat agccccgagt 61 cccgtggaaa atgaggccgg cggacttgct gcagctggtg ctgctgctcg acctgcccag 121 ggacctgggc ggaatggggt gttcgtctcc accctgcgag tgccatcagg aggaggactt 181 cagagtcacc tgcaaggata ttcaacgcat ccccagctta ccgcccagta cgcagactct 241 gaagcttatt gagactcacc tgagaactat tccaagtcat gcattttcta atctgcccaa 301 tatttccaga atctacgtat ctatagatgt gactctgcag cagctggaat cacactcctt 361 ctacaatttg agtaaagtga ctcacataga aattcggaat accaggaact taacttacat 421 agaccctgat gccctcaaag agctccccct cctaaagtcc ttggcatttt caaacactgg 481 acttaaaatg ttccctgacc tgaccaaagt ttattccact gatatattct ttatacttga 541 aattacagac aacccttaca tgacgtcaat ccctgtgaat gcttttcagg gactatgcaa 601 tgaaaccttg acactgaagc tgtacaacaa tggctttact tcagtccaag gatatgattt 661 ctttgggaca aagctggatg ctgtttacct aaacaagaat aaatacctga cagttattga 721 caaagatgca tttggaggag tatacagtgg accaagcttg ctggacgtgt ctcaaaccag 781 tgtcactgcc cttccatcca aaggcctgga gcacctgaag gaactgatag caagaaacag 841 ctggactctt aagaaacttg cactttcctt gagtttcctt cacctcacac gggctgacct 901 ttcttaccca agccactgct gtgcttttaa gaatcagaag aaaatcagag gaatccttga 961 gtccttgatg tgtaatgaga gcagtatcga gacgttgcgc cagagaaaat ctgtgaatgc 1021 cttgaatagc cccctccacc aggaatatga agagaatctg ggtgacagca ttgttgggta 1081 caaggaaaag tccaagttcc aggatactca taacaacgct cattattacg tcttctttga 1141 agaacaagag gatgagatca ttggttttgg ccaggagctc aaaaaccccc aggaagagac 1201 tctacaagct tttgacagcc attatgacta caccatatgt ggggacagtg aagacatggt 1261 gtgtaccccc aagtccgatg agttcaaccc gtgtgaagac ataatgggct acaagttcct 1321 gagaattgtg gtgtggttcg ttagtctgct ggctctcctg ggcaatgtct ttgtcctgct 1381 tattctcctc accagccact acaaactgaa cgtcccccgc tttctcatgt gcaacctggc 1441 ctttgcggat ttctgcatgg ggatgtacct gctcctcatc gcctctgtag acctctacac 1501 tcactctgag tactacaacc atgccatcga ctggcagaca ggccctgggt gcaacacggc 1561 tggtttcttc actgtctttg caagcgagtt atcggtgtat acgctgacgg tcatcaccct 1621 ggagcgctgg tatgccatca ccttcgccat ggccctggac cggaagatcc gcctcaggca 1681 cgcatgtgcc atcatggttg ggggctgggt ttgctgcttc cttctcgccc tgcttccttt 1741 ggtgggaata agtagctatg ccaaagtcag tatctgcctg cccatggaca ccgagacccc 1801 tcttgctctg gcatatattg tttttgttct gacgctcaac atagttgcct tcgtcatcgt 1861 ctgctgctgt tatgtgaaga tctacatcac agtccgaaat ccgcacaacc caggggacaa 1921 agataccaaa attgccaaga ggatggctgt gttgatcttc accgacttca cgtgcatggc 1981 cccaatctca ttctatgctg tgtcagcaat tctgaacaag cctctcatca ctgttagcaa 2041 ctccaaaatc ttgctggtac tcttctatcc aattaactcc tgtgccaatc cattcctcta 2101 tgctattttc accaaggcct tccagaggga tgtgttcatc ctactcagca agtttggcat 2161 ctgtaaacgc caggctcagg cataccgggg gcagagggtt cctccaaaga acagcactga 2221 tattcaggtt caaaaggtta cccacgacat gaggcagggt ctccacaaca tggaagatgt 2281 ctatgaactg attgaaaact cccatctaac cccaaagaag caaggccaaa tctcagaaga 2341 gtatatgcaa acggttttgt aagttaacac tacactactc acaatggtag gggaacttac 2401 aaaataatag ttt // LOCUS HUMTTK 3866 bp mRNA PRI 14-JAN-1995 DEFINITION Human kinase (TTK) mRNA, complete cds. ACCESSION M86699 NID g340010 KEYWORDS binding protein; kinase; regulatory protein. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3866) AUTHORS Mills,G.B., Schmandt,R., McGill,M., Amendola,A., Hill,M., Jacobs,K., May,C., Rodricks,A.M., Campbell,S. and Hogg,D. TITLE Expression of TTK, a novel human protein kinase, is associated with cell proliferation JOURNAL J. Biol. Chem. 267 (22), 16000-16006 (1992) MEDLINE 92348472 FEATURES Location/Qualifiers source 1..3866 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="YT2C2" /cell_type="T cell" /germline 5'UTR 1..979 /note="putative" gene 1026..3551 /gene="TTK" CDS 1026..3551 /gene="TTK" /codon_start=1 /product="kinase" /db_xref="PID:g340011" /translation="MNKVRDIKNKFKNEDLTDELSLNKISADTTDNSGTVNQIMMMAN NPEDWLSLLLKLEKNSVPLSDALLNKLIGRYSQAIEALPPDKYGQNESFARIQVRFAE LKAIQEPDDARDYFQMARANCKKFAFVHISFAQFELSQGNVKKSKQLLQKAVERGAVP LEMLEIALRNLNLQKKQLLSEEEKKNLSASTVLTAQESFSGSLGHLQNRNNSCDSRGQ TTKARFLYGENMPPQDAEIGYRNSLRQTNKTKQSCPFGRVPVNLLNSPDCDVKTDDSV VPCFMKRQTSRSECRDLVVPGSKPSGNDSCELRNLKSVQNSHFKEPLVSDEKSSELII TDSITLKNKTESSLLAKLEETKEYQEPEVPESNQKQWQAKRKSECINQNPAASSNHWQ IPELARKVNTEQKHTTFEQPVFSVSKQSPPISTSKWFDPKSICKTPSSNTLDDYMSCF RTPVVKNDFPPACQLSTPYGQPACFQQQQHQILATPLQNLQVLASSSANECISVKGRI YSILKQIGSGGSSKVFQVLNEKKQIYAIKYVNLEEADNQTLDSYRNEIAYLNKLQQHS DKIIRLYDYEITDQYIYMVMECGNIDLNSWLKKKKSIDPWERKSYWKNMLEAVHTIHQ HGIVHSDLKPANFLIVDGMLKLIDFGIANQMQPDTTSVVKDSQVGTVNYMPPEAIKDM SSSRENGKSKSKISPKSDVWSLGCILYYMTYGKTPFQQIINQISKLHAIIDPNHEIEF PDIPEKDLQDVLKCCLKRDPKQRISIPELLAHPYVQIQTHPVNQMAKGTTEEMKYVLG QLVGLNSPNSILKAAKTLYEHYSGGESHNSSSSKTFEKKRGKK" misc_feature 1026..2550 /gene="TTK" /note="regulatory or binding region; putative" misc_feature 2551..3351 /gene="TTK" /note="kinase domain; putative" 3'UTR 3553..3866 /note="putative" polyA_signal 3843..3848 /note="putative" BASE COUNT 1268 a 731 c 752 g 1115 t ORIGIN 1 ggaattcctt tttttttttt tttgagatgg agtttcactc ttgttggcca ggctggagtg 61 caatggcaca atctcagctt actgcaacct ccgcctcccg ggttcaagcg attctcctgc 121 ctcagcctct caagtagctg ggattacagg catgtgccac cacccctggc taactaattt 181 cttttctatt tagtagagat ggggtttcac catgttggtc aggctggtct tgaactcctg 241 acctcaggtg atccacttgc cttggcctcc caaagtgcta ggattacagc cgtgaaactg 301 tgcctggctg attctttttt tgttgttgga tttttgaaac agggtctccc ttggtcgccc 361 aggctggagt gcagtggtgc gatcttggct cactataacc tccacctcct ggtttcaagt 421 gatcctccca ctttagcctc ctgagtagct gtgattacag gcgtgcacca ccacacccgg 481 ctaatttttg tatttttatt agagacaggg tttcaccatg ttggccaggc tgttctcaaa 541 ctcctggact caagggatcc gcctgcctcc acttcccaaa gtcccgagat tacaggtgtg 601 agtcaccatg cctgacctta taattcttaa gtcatttttt ctggtccatt tcttccttag 661 ggtcctcaca acaaatctgc attaggcggt acaataatcc ttaacttcat gattcacaaa 721 aggaagatga agtgattcat gatttagaaa ggggaagtag taagcccact gcacactcct 781 ggatgatgat cctaaatcca gatacagtaa aaatggggta tgggaaggta gaatacaaaa 841 tttggtttaa attaattatc taaatatcta aaaacatttt tggatacatt gttgatgtga 901 atgtaagact gtacagactt cctagaaaac agtttgggtt ccatcttttc atttccccag 961 tgcagttttc tgtagaaatg gaatccgagg atttaagtgg cagagaattg acaattgatt 1021 ccataatgaa caaagtgaga gacattaaaa ataagtttaa aaatgaagac cttactgatg 1081 aactaagctt gaataaaatt tctgctgata ctacagataa ctcgggaact gttaaccaaa 1141 ttatgatgat ggcaaacaac ccagaggact ggttgagttt gttgctcaaa ctagagaaaa 1201 acagtgttcc gctaagtgat gctcttttaa ataaattgat tggtcgttac agtcaagcaa 1261 ttgaagcgct tcccccagat aaatatggcc aaaatgagag ttttgctaga attcaagtga 1321 gatttgctga attaaaagct attcaagagc cagatgatgc acgtgactac tttcaaatgg 1381 ccagagcaaa ctgcaagaaa tttgcttttg ttcatatatc ttttgcacaa tttgaactgt 1441 cacaaggtaa tgtcaaaaaa agtaaacaac ttcttcaaaa agctgtagaa cgtggagcag 1501 taccactaga aatgctggaa attgccctgc ggaatttaaa cctccaaaaa aagcagctgc 1561 tttcagagga ggaaaagaag aatttatcag catctacggt attaactgcc caagaatcat 1621 tttccggttc acttgggcat ttacagaata ggaacaacag ttgtgattcc agaggacaga 1681 ctactaaagc caggttttta tatggagaga acatgccacc acaagatgca gaaataggtt 1741 accggaattc attgagacaa actaacaaaa ctaaacagtc atgcccattt ggaagagtcc 1801 cagttaacct tctaaatagc ccagattgtg atgtgaagac agatgattca gttgtacctt 1861 gttttatgaa aagacaaacc tctagatcag aatgccgaga tttggttgtg cctggatcta 1921 aaccaagtgg aaatgattcc tgtgaattaa gaaatttaaa gtctgttcaa aatagtcatt 1981 tcaaggaacc tctggtgtca gatgaaaaga gttctgaact tattattact gattcaataa 2041 ccctgaagaa taaaacggaa tcaagtcttc tagctaaatt agaagaaact aaagagtatc 2101 aagaaccaga ggttccagag agtaaccaga aacagtggca agctaagaga aagtcagagt 2161 gtattaacca gaatcctgct gcatcttcaa atcactggca gattccggag ttagcccgaa 2221 aagttaatac agagcagaaa cataccactt ttgagcaacc tgtcttttca gtttcaaaac 2281 agtcaccacc aatatcaaca tctaaatggt ttgacccaaa atctatttgt aagacaccaa 2341 gcagcaatac cttggatgat tacatgagct gttttagaac tccagttgta aagaatgact 2401 ttccacctgc ttgtcagttg tcaacacctt atggccaacc tgcctgtttc cagcagcaac 2461 agcatcaaat acttgccact ccacttcaaa atttacaggt tttagcatct tcttcagcaa 2521 atgaatgcat ttcggttaaa ggaagaattt attccatatt aaagcagata ggaagtggag 2581 gttcaagcaa ggtatttcag gtgttaaatg aaaagaaaca gatatatgct ataaaatatg 2641 tgaacttaga agaagcagat aaccaaactc ttgatagtta ccggaacgaa atagcttatt 2701 tgaataaact acaacaacac agtgataaga tcatccgact ttatgattat gaaatcacgg 2761 accagtacat ctacatggta atggagtgtg gaaatattga tcttaatagt tggcttaaaa 2821 agaaaaaatc cattgatcca tgggaacgca agagttactg gaaaaatatg ttagaggcag 2881 ttcacacaat ccatcaacat ggcattgttc acagtgatct taaaccagct aactttctga 2941 tagttgatgg aatgctaaag ctaattgatt ttgggattgc aaaccaaatg caaccagata 3001 caacaagtgt tgttaaagat tctcaggttg gcacagttaa ttatatgcca ccagaagcaa 3061 tcaaagatat gtcttcctcc agagagaatg ggaaatctaa gtcaaagata agccccaaaa 3121 gtgatgtttg gtccttagga tgtattttgt actatatgac ttacgggaaa acaccatttc 3181 agcagataat taatcagatt tctaaattac atgccataat tgatcctaat catgaaattg 3241 aatttcccga tattccagag aaagatcttc aagatgtgtt aaagtgttgt ttaaaaaggg 3301 acccaaaaca gaggatatcc attcctgagc tcctggctca tccatatgtt caaattcaaa 3361 ctcatccagt taaccaaatg gccaagggaa ccactgaaga aatgaaatat gttctgggcc 3421 aacttgttgg tctgaattct cctaactcca ttttgaaagc tgctaaaact ttatatgaac 3481 actatagtgg tggtgaaagt cataattctt catcctccaa gacttttgaa aaaaaaaggg 3541 gaaaaaaatg atttgcagtt attcgtaatg tcagatagga ggtataaaat atattggact 3601 gttatactct tgaatccctg tggaaatcta catttgaaga caacatcact ctgaagtgtt 3661 atcagcaaaa aaaattcagt gagattatct ttaaaagaaa actgtaaaaa tagcaaccac 3721 ttatggcact gtatatattg tagacttgtt ttctctgttt tatgctcttg tgtaatctac 3781 ttgacatcat tttactcttg gaatagtggg tggatagcaa gtatattcta aaaaactttg 3841 taaataaagt tttgtggcta aaatga // LOCUS HUMTYRKINA 2564 bp mRNA PRI 22-AUG-1995 DEFINITION Human tyrosine kinase (TXK) mRNA, complete cds. ACCESSION L27071 NID g951045 KEYWORDS cytoplasmic protein; tyrosine kinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2564) AUTHORS Haire,R.N., Ohta,Y., Lewis,J.E., Fu,S.M., Kroisel,P. and Litman,G.W. TITLE TXK, a novel human tyrosine kinase expressed in T cells shares sequence identity with Tec family kinases and maps to 4p12 JOURNAL Hum. Mol. Genet. 3 (6), 897-901 (1994) MEDLINE 95038742 REFERENCE 2 (bases 1 to 2564) AUTHORS Litman,G.W. TITLE Direct Submission JOURNAL Submitted (08-JUN-1994) Gary W. Litman, All Children's Hospital, 801 6th Street South, St. Petersburg, FL 33701, USA FEATURES Location/Qualifiers source 1..2564 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="mononuclear" /dev_stage="adult" /tissue_type="blood" /map="4p12" /chromosome="4" 5'UTR 1..86 gene 87..1670 /gene="TXK" CDS 87..1670 /gene="TXK" /codon_start=1 /product="tyrosine kinase" /db_xref="PID:g684986" /translation="MILSSYNTIQSVFCCCCCCSVQKRQMRTQISLSTDEELPEKYTQ HRRPWLSQLSNKKQSNTGRVQPSKRKPLPPLPPSEVAEEKIQVKALYDFLPREPCNLA LRRAEEYLILEKYNPHWWKARDRLGNEGLIPSNYVTENKITNLEIYEWYHRNITRNQA EHLLRQESKEGAFIVRDSRHLGSYTISVFMGARRSTEAAIKHYQIKKNDSGQWYVAER HAFQSIPELIWYHQHNAAGLMTRLRYPVGLMGSCLPATAGFSYEKWEIDPSELAFIKE IGSGQFGVVHLGEWRSHIQVAIKAINEGSMSEEDFIEEAKVMMKLSHSKLVQLYGVCI QRKPLYIVTEFMENGCLLNYLRENKGKLRKEMLLSVCQDICEGMEYLERNGYIHRDLA ARNCLVSSTCIVKISDFGMTRYVLDDEYVSSFGAKFPIKWSPPEVFLFNKYSSKSDVW SFGVLMWEVFTEGKMPFENKSNLQVVEAISEGFRLYRPHLAPMSIYEVMYSCWHEKPE GRPTFAELLRAVTEIAETW" 3'UTR 1671..2564 BASE COUNT 775 a 486 c 570 g 733 t ORIGIN 1 gatttcagtt gaaagatgtg tttttgtgag tagagcaccg cagaagaact gaagactgtt 61 gtgtgctccc cgcagaaggg gctaccatga tcctttcctc ctataacacc atccagtcgg 121 ttttctgttg ctgctgttgc tgttcagtgc agaagcgaca aatgagaaca cagataagcc 181 tgagcacaga tgaagagctt ccagaaaaat acacccagca tcgcaggccg tggctcagcc 241 aattgtcaaa taagaagcaa tccaacacgg gccgtgtgca gccgtcaaaa cgaaagccac 301 tgcctcccct cccaccctct gaggttgctg aagagaagat ccaagtcaag gcactttatg 361 attttctgcc cagagaaccc tgtaatttag ccttaaggag agcagaagaa tacctgatac 421 tggagaaata caatcctcac tggtggaagg caagagaccg tttggggaat gaaggcttaa 481 tcccaagcaa ctatgtgact gaaaacaaaa taactaattt agaaatatat gagtggtacc 541 atagaaacat taccagaaat caggcagaac atctattgag acaagagtct aaagaaggtg 601 catttattgt cagagattca agacatttag gatcctacac aatttccgta tttatgggag 661 ctagaagaag tacggaggct gccataaaac attatcagat aaaaaagaat gactcaggac 721 agtggtatgt ggctgaaaga cacgcctttc aatcaatccc tgagttaatc tggtatcacc 781 agcacaatgc agccggtctc atgactcgtc tccgatatcc agttgggctg atgggcagtt 841 gtttaccagc cacagctggg tttagctacg aaaagtggga gatagatcca tctgagttgg 901 cttttataaa ggagattgga agcggtcagt ttggagtggt ccatttaggt gaatggcggt 961 cacatatcca ggtagctatc aaggccatca atgaaggctc catgtctgaa gaggatttca 1021 ttgaagaggc caaagtgatg atgaaattat ctcattcaaa gctagtgcaa ctttatggag 1081 tctgtataca gcggaagccc ctttacattg tgacagagtt catggaaaat ggctgcctgc 1141 ttaactatct cagggagaat aaaggaaagc ttaggaagga aatgctactg agtgtatgcc 1201 aggatatatg tgaaggaatg gaatatctgg agaggaatgg ctatattcat agggatttgg 1261 cggcaaggaa ttgtttggtc agttcaacat gcatagtaaa aatttcagac tttggaatga 1321 caaggtacgt tttggatgat gagtatgtca gttcttttgg agccaagttc ccaatcaagt 1381 ggtcccctcc tgaagttttt cttttcaata agtacagcag taaatctgat gtctggtcat 1441 ttggagtttt aatgtgggaa gtttttacag aaggaaaaat gccttttgaa aataagtcaa 1501 atttgcaagt cgtggaagct atttctgaag gcttcaggct atatcgccct cacctggcac 1561 caatgtccat atatgaagtc atgtacagct gctggcatga gaaacctgaa ggccgcccta 1621 catttgcgga gctgctgcgg gctgtcacag agattgcgga aacctggtga ccggaaacag 1681 aatgccaacc caaagagtca tcttgcaaaa ctgtcattta ttgtgaatat cttcaccata 1741 tggggtcact tatggtgaat atctttcttc agagttgctg actcttgaaa acagtgcaaa 1801 gatcacagtt tttaaaagtt ttaaaaattt aagaatattc acacaatcgt ttttctatgt 1861 gtgagaggga tttgcacact cttatttttc tgtaaaatat ttcacatccc aaatgtgaag 1921 aagtgaaaaa gacttcgcag cagtcttcat tgtggtgctc ttcatgatca tagccccagg 1981 aacccttgag gttcttcttc acaaggctga gagtgcttcc ttcttgaaga cgagtgtcat 2041 tcatcacttc agtgatccat gcatagaata tgaaaataaa ttcttccaac tcatgggata 2101 aaggggactc ccttgaagaa tttcatgttt ttgggctgta tagctcttta cagaaaatgc 2161 acctttataa atcacatgaa tgttagtatt ctggaaatgt cttttgttaa tataatcttc 2221 ccatgttatt taacaaattg tttttgcaca tatctgatta tattgaaagc agtttttttg 2281 cattcgagtt ttaaacactg ttataaaatg tagccaaagc tcacctttga acagatcccg 2341 gtgacattct atttccagga aaatccggaa cctgatttta gttctgtgat tttacacttt 2401 ttacatgtga gattggacag tttcagaggc cttattttgt catactaagt gtctcctgta 2461 attttcagga agatgatttg ttctttccag aagaggagac aaaagcaaga tagccaaatg 2521 tgacatcaag ctccattgtt tcggaaatcc aggattttga attc // LOCUS HUMTYRM 1929 bp mRNA PRI 14-JAN-1995 DEFINITION H.sapien tyrosinase and mutant tyrosinase, complete cds. ACCESSION M74314 NID g340039 KEYWORDS tyrosinase. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1929) AUTHORS Chintamaneni,C.D., Halaban,R., Kobayashi,Y., Witkop,C.J. Jr. and Kwon,B.S. TITLE A single base insertion in the putative transmembrane domain of the tyrosinase gene as a cause for tyrosinase-negative oculocutaneous albinism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 88 (12), 5272-5276 (1991) MEDLINE 91271371 FEATURES Location/Qualifiers source 1..1929 /organism="Homo sapiens" /db_xref="taxon:9606" /map="11q14-q21" mRNA 1..1929 /partial gene 32..1636 /gene="TYR" CDS 32..1636 /gene="TYR" /codon_start=1 /db_xref="GDB:G00-120-476" /product="tyrosinase" /db_xref="PID:g340040" /translation="MLLAVLYCLLWSFQTSAGHFPRACVSSKNLMEKECCPPWSGDRS PCGQLSGRGSCQNILLSNAPLGPQFPFTGVDDRESWPSVFYNRTCQCSGNFMGFNCGN CKFGFWGPNCTERRLLVRRNIFDLSAPEKDKFFAYLTLAKHTISSDYVIPIGTYGQMK NGSTPMFNDINIYDLFVWMHIYYVSMDALLGGYEIWRDIDFSAHEAPAFLPWHRLFLL RWEQEIQKLTGDENFTIPYWDWRDAEKCDICTDEYMGGQHPTNPNLLSPASFFSSWQI VCSRLEEYNSHQSLCNGTPEGPLRRNPGNHDKSTTPRLPSSADVEFRCLSLTQYESGS MDKAANFSFRNTLEGFASPLTGIADASQSSMHNALHIYMNGTMSQVQGSANDPIFLLH HAFVDSIFEQWLQRHRPLQEVYPEANAPIGHRNRESYMVPFIPLYRNGDFFISSKDLG YDYSYLQDSDPDSFQDYIKSYLEQASRIWSWLLGAAMVGAVLTALLAGPVSLLCLRHK RKQLPEEKQPLLMEKEDYHSLYQSHL" mutation 1511..1512 /gene="TYR" /note="G00-120-476" BASE COUNT 512 a 457 c 433 g 527 t ORIGIN 1 tcctgcagac cttgtgagga ctagaggaag aatgctcctg gctgttttgt actgcctgct 61 gtggagtttc cagacctccg ctggccattt ccctagagcc tgtgtctcct ctaagaacct 121 gatggagaag gaatgctgtc caccgtggag cggggacagg agtccctgtg gccagctttc 181 aggcagaggt tcctgtcaga atatccttct gtccaatgca ccacttgggc ctcaatttcc 241 cttcacaggg gtggatgacc gggagtcgtg gccttccgtc ttttataata ggacctgcca 301 gtgctctggc aacttcatgg gattcaactg tggaaactgc aagtttggct tttggggacc 361 aaactgcaca gagagacgac tcttggtgag aagaaacatc ttcgatttga gtgccccaga 421 gaaggacaaa ttttttgcct acctcacttt agcaaagcat accatcagct cagactatgt 481 catccccata gggacctatg gccaaatgaa aaatggatca acacccatgt ttaacgacat 541 caatatttat gacctctttg tctggatgca tatatattat gtgtcaatgg atgcactgct 601 tgggggatat gaaatctgga gagacattga tttttctgcc catgaagcac cagcttttct 661 gccttggcat agactcttct tgttgcggtg ggaacaagaa atccagaagc tgacaggaga 721 tgaaaacttc actattccat attgggactg gcgggatgca gaaaagtgtg acatttgcac 781 agatgagtac atgggaggtc agcaccccac aaatcctaac ttactcagcc cagcatcatt 841 cttctcctct tggcagattg tctgtagccg attggaggag tacaacagcc atcagtcttt 901 atgcaatgga acgcccgagg gacctttacg gcgtaatcct ggaaaccatg acaaatccac 961 aaccccaagg ctcccctctt cagctgatgt agaatttaga tgcctgagtt tgacccaata 1021 tgaatctggt tccatggata aagctgccaa tttcagcttt agaaatacac tggaaggatt 1081 tgctagtcca cttactggga tagcggatgc ctctcaaagc agcatgcaca atgccttgca 1141 catctatatg aatggaacaa tgtcccaggt acagggatct gccaacgatc ctatcttcct 1201 tcttcaccat gcatttgttg acagtatttt tgagcagtgg ctccaaaggc accgtcctct 1261 tcaagaagtt tatccagaag ccaatgcacc cattggacat cgaaaccggg aatcctacat 1321 ggttcctttt ataccactgt acagaaatgg tgatttcttt atttcatcca aagatctggg 1381 ctatgactat agctatctac aagattcaga cccagactct tttcaagact acattaagtc 1441 ctatttggaa caagcgagtc ggatctggtc atggctcctt ggggcggcga tggtaggggc 1501 cgtcctcact gccctgctgg cagggcctgt gagcttgctg tgtcttcgtc acaagagaaa 1561 gcagcttcct gaagaaaagc agccactcct catggagaaa gaggattacc acagcttgta 1621 tcagagccat ttataaaaag gcttaggcaa tagagtaggg ccaaaaagcc tgacctcact 1681 ctaactcaaa gtaatgtcca ggttcccaga gaatatctgc tggtattttc tgtaaagacc 1741 atttgcaaaa ttgtaaccta atacaaagtg tagccttctt ccaactcagg tagaacacac 1801 ctgtctttgt cttgctgttt tcactcagcc cttttaacat tttcccctaa gcccatatgt 1861 ctaaggaaag gatgctattt ggtaatgagg aactgttatt tgtatgtgaa ttaaaagtgc 1921 tcttatttt // LOCUS HUMUCHL3A 802 bp mRNA PRI 15-MAR-1990 DEFINITION Human ubiquitin carboxyl-terminal hydrolase (PGP 9.5, UCH-L3) isozyme L3 mRNA, complete cds. ACCESSION M30496 NID g340073 KEYWORDS neuron-specific protein; ubiquitin carboxyl-terminal hydrolase. SOURCE Human B cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 802) AUTHORS Wilkinson,K.D., Lee,K., Deshpande,S., Duerksen-Hughes,P., Boss,J.M. and Pohl,J. TITLE The neuron-specific protein PGP 9.5 is a ubiquitin carboxyl-terminal hydrolase JOURNAL Science 246, 670-673 (1989) MEDLINE 90049185 FEATURES Location/Qualifiers source 1..802 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 21..713 /note="ubiquitin carboxyl-terminal hydrolase" /codon_start=1 /db_xref="PID:g340074" /translation="MEGQRWLPLEANPEVTNQFLKQLGLHPNWQFVDVYGMDPELLSM VPRPVCAVLLLFPITEKYEVFRTEEEEKIKSQGQDVTSSVYFMKQTISNACGTIGLIH AIANNKDKMHFESGSTLKKFLEESVSMSPEERARYLENYDAIRVTHETSAHEGQTEAP SIDEKVDLHFIALVHVDGHLYELDGRKPFPINHGETSDETLLEDAIEVCKKFMERDPD ELRFNAIALSAA" BASE COUNT 259 a 159 c 172 g 212 t ORIGIN 1 ggagggccgg gcaccgcggc atggagggtc aacgctggct gccgctggag gccaatcccg 61 aggtcaccaa ccagtttctt aaacaattag gtctacatcc taactggcaa ttcgttgatg 121 tatatggaat ggatcctgaa ctccttagca tggtaccaag accagtctgt gcagtcttac 181 ttctctttcc tattacagaa aagtatgaag tattcagaac agaagaggaa gaaaaaataa 241 aatctcaggg acaagatgtt acatcatcag tatatttcat gaagcaaaca atcagcaatg 301 cctgtggaac aattggactg attcatgcta ttgcaaacaa taaagacaag atgcactttg 361 aatctggatc aaccttgaaa aaattcctgg aggaatctgt gtcaatgagc cctgaagaac 421 gagccagata cctggagaac tatgatgcca tccgagttac tcatgagacc agtgcccatg 481 aaggtcagac tgaggcacca agtatagatg agaaagtaga tcttcatttt attgcattag 541 ttcatgtaga tgggcatctc tatgaattag atgggcggaa gccatttcca attaaccatg 601 gtgaaactag tgatgaaact ttattagagg atgccataga agtttgcaag aagtttatgg 661 agcgcgaccc tgatgaacta agatttaatg cgattgctct ttctgcagca tagcttgtca 721 ataatggaaa caccaaaaac tgtattattt gcaactaaat tttctctgcc catacactaa 781 ctcaaaaatt ttgatatttt cc // LOCUS HUMUDPG 1062 bp mRNA PRI 15-JUN-1990 DEFINITION Human histo-blood group A transferase (UDP-GalNAc) mRNA, complete cds. ACCESSION J05175 NID g340077 KEYWORDS histo-blood group A transferase. SOURCE Human stomach endothelium cell line MKN45, cDNA to mRNA, clone FY-59-5. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1062) AUTHORS Yamamoto,F.-I., Marken,J.S., Tsuji,T., White,T., Clausen,H. and Hakomori,S.-I. TITLE Cloning and characterization of DNA complementary to human UDP-GalNAc: Fuc-alpha-1-->2Gal-alpha-1-->3GalNAc transferase (histo-blood group A transferase) mRNA JOURNAL J. Biol. Chem. 265, 1146-1151 (1989) MEDLINE 90110098 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.-i.Yamamoto, 20-NOV-1989. FEATURES Location/Qualifiers source 1..1062 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 1..1062 /note="histo-blood group A transferase (UDP-GalNAc)" /codon_start=1 /db_xref="PID:g340078" /translation="MAEVLRTLAGKPKCHALRPMILFLIMLVLVLFGYGVLSPRSLMP GSLERGFCMAVREPDHLQRVSLPRMVYPQPKVLTPWKDVLVVTPWLAPIVWEGTFNID ILNEQFRLQNTTIGLTVFAIKKYVAFLKLFLETAEKHFMVGHRVHYYVFTDQLAAVPR VTLGTGRQLSVLEVRAYKRWQDVSMRRMEMISDFCERRFLSEVDYLVCVDVDMEFRDH VGVEILTPLFGTLHPGFYGSSREAFTYERRPQSQAYIPKDEGDFYYLGGFFGGSVQEV QRLTRACHQAMMVDQANGIEAVWHDESHLNKYLLRHKPTKVLSPEYLWDQQLLGWPAV LRKLRFTAVPKNHQAVRNP" BASE COUNT 203 a 320 c 331 g 208 t ORIGIN Chromosome 9q34. 1 atggccgagg tgttgcggac gctggccgga aaaccaaaat gccacgcact tcgacctatg 61 atccttttcc taataatgct tgtcttggtc ttgtttggtt acggggtcct aagccccaga 121 agtctaatgc caggaagcct ggaacggggg ttctgcatgg ctgttaggga acctgaccat 181 ctgcagcgcg tctcgttgcc aaggatggtc tacccccagc caaaggtgct gacaccgtgg 241 aaggatgtcc tcgtggtgac cccttggctg gctcccattg tctgggaggg cacattcaac 301 atcgacatcc tcaacgagca gttcaggctc cagaacacca ccattgggtt aactgtgttt 361 gccatcaaga aatacgtggc tttcctgaag ctgttcctgg agacggcgga gaagcacttc 421 atggtgggcc accgtgtcca ctactatgtc ttcaccgacc agctggccgc ggtgccccgc 481 gtgacgctgg ggaccggtcg gcagctgtca gtgctggagg tgcgcgccta caagcgctgg 541 caggacgtgt ccatgcgccg catggagatg atcagtgact tctgcgagcg gcgcttcctc 601 agcgaggtgg attacctggt gtgcgtggac gtggacatgg agttccgcga ccacgtgggc 661 gtggagatcc tgactccgct gttcggcacc ctgcaccccg gcttctacgg aagcagccgg 721 gaggccttca cctacgagcg ccggccccag tcccaggcct acatccccaa ggacgagggc 781 gatttctact acctgggggg gttcttcggg gggtcggtgc aagaggtgca gcggctcacc 841 agggcctgcc accaggccat gatggtcgac caggccaacg gcatcgaggc cgtgtggcac 901 gacgagagcc acctgaacaa gtacctgctg cgccacaaac ccaccaaggt gctctccccc 961 gagtacttgt gggaccagca gctgctgggc tggcccgccg tcctgaggaa gctgaggttc 1021 actgcggtgc ccaagaacca ccaggcggtc cggaacccgt ga // LOCUS HUMUDPGPYR 1992 bp mRNA PRI 24-SEP-1993 DEFINITION Human UDP-glucose pyrophosphorylase mRNA, complete cds and flanking regions. ACCESSION L14430 NID g292873 KEYWORDS UDP-glucose pyrophosphorylase. SOURCE Homo sapiens liver cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1992) AUTHORS Peng,H.L. and Chang,H.Y. TITLE Cloning of a human liver UDP-glucose pyrophosphorylase cDNA by complementation of the bacterial galU mutation JOURNAL Febs 329, 153-158 (1993) FEATURES Location/Qualifiers source 1..1992 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" 5'UTR 1..145 CDS 146..1672 /EC_number="2.7.7.9" /codon_start=1 /function="synthesis of UDPglc from glc-1-p" /product="UDP-glucose pyrophosphorylase" /db_xref="PID:g292874" /translation="MSRFVQDLSKAMSQDGASQFQEVILQELELSVKKELEKILTTAT SHEYEHTKKDLDGFRKLYHRFLQEKGPSVDWGKIQRPPEDSIQPYEKIKARGLPDNIS SVLNKLVVVKLNGGLGTSMGCKGPKSLIGVRNENTFLDLTVQQIEHLNKSYNTDVPLV LMNSFNTDEDTKKILQKYNHCRVKIYTFNQSRYPRINKESLRPVAKDVSSSGESTEAW YPPGHGDIYASFYNSGLLDTFLEEGKEYIFVSNIDNLGATVDLYILNHLINPPNGKRC EFVMEVTNKTRADVKGGTLTQYEGKLRLVEIAQVPKAHVDEFKSVSKFKIFNTNNLWI SLAAVKRLQEQNAIDMEIIVNPKTLDGGLNVIQLETAVGAAIKSFENSLGINVPRSRF LPVKTTSDLLLVMSNLYSLNAGSLTMSEKREFPTVPLVKLGSSFTKVQDYLRRFESIP DMLELDHLTVSGDVTFGKNVSLKGTVIIIANHGDRIDIPPGAVLENKIVSGNLRILDH " 3'UTR 1673..1992 BASE COUNT 654 a 352 c 424 g 562 t ORIGIN 1 ggcacgagaa tagttcctta agtagctaac catcggagga gaaagaacac atcggttgtt 61 gcttgaaagg aagggagacc ttccttataa cgcaactccg agaaattgaa aattataaac 121 cagattattt taatataacc cgaaaatgtc gagatttgtg caagacctta gcaaagctat 181 gtctcaagat ggtgcttctc agttccaaga agtcattctc caagaactag aattatctgt 241 gaagaaagaa ttagaaaaaa tacttacaac agcaacctca catgagtatg agcatactaa 301 aaaagatctt gatggatttc ggaagctata tcatagattt ttgcaagaaa agggaccttc 361 tgtagactgg ggtaaaatcc agagacctcc agaagattcg attcaaccct atgaaaagat 421 aaaggccaga ggcctgcctg ataatatatc ttctgtgttg aacaaactgg tggtggtgaa 481 actcaatggt ggtttgggaa ccagtatggg ctgcaaaggc cctaaaagtc tgattggtgt 541 aagaaatgag aatacctttt tggatctaac tgtgcagcaa attgaacatt tgaacaaaag 601 ctacaataca gatgtccctc ttgttttaat gaactctttt aacacggatg aagacacaaa 661 aaaaatactt cagaagtaca atcattgccg tgtgaaaatc tacaccttca atcaaagcag 721 gtatccgagg attaataagg aatctttacg gcctgtagca aaggatgtgt cttcctcagg 781 ggaaagtaca gaagcttggt accccccagg acatggagat atctatgcta gtttctacaa 841 ctcgggcttg ctcgacacct ttctagaaga aggcaaagag tatatttttg tttctaacat 901 agataacctg ggtgccacag tggatcttta tattcttaat catctaatca acccacccaa 961 tgggaaacgc tgtgaatttg tcatggaagt cacaaataaa acacgagcag atgtaaaggg 1021 tggaacactc actcaatatg aaggcaaact gagactggtg gagattgctc aagtgccaaa 1081 agcacatgtt gatgaattca agtctgtgtc aaaatttaaa atatttaaca caaacaacct 1141 atggatctct cttgcagcag ttaaaagact gcaggagcag aatgccattg acatggaaat 1201 cattgtgaat ccaaagactt tggatggagg cctgaatgtt attcagttag aaactgcagt 1261 tggagctgca attaaaagtt tcgagaattc tttaggtatt aatgttccaa ggagtcgttt 1321 tctgcctgtg aagaccacgt cagatctctt gcttgtgatg tcaaacctct atagccttaa 1381 tgcaggatct ttgaccatga gtgaaaagcg ggagtttcct acagtaccct tggttaaatt 1441 aggaagttcc tttaccaagg ttcaagatta cctaaggagg tttgaaagta tacctgacat 1501 gctggaattg gatcacctca ctgtttctgg agatgtgaca tttggaaaaa atgtttcatt 1561 aaagggaaca gttatcatta ttgcaaatca tggtgacaga attgacatcc cccctggagc 1621 agtgttagag aacaagatcg tatctggaaa tcttcgaatc ttggatcact aagataagcg 1681 ctgccggata cactttatac taattatggg ttaaacagtt tcttgtaata aaatgtcctc 1741 taagattctc aaatgagcaa gtacttttac tgtgcagtgt tgatttttta gagttttctg 1801 caatgtgctt ttagcctaaa gagaagatag atggagcagt actgtccttt tttgatgaag 1861 agcctagaga tgagttcatc taaaagtgca atattattta atcgtagaac tgggccagct 1921 cggcaatctt ttaacagaag ctggagtgat ggtctttgat tgcctttgat tttaaaaata 1981 aagtgataca ac // LOCUS HUMUG2BA 1015 bp mRNA PRI 31-AUG-1987 DEFINITION Human U2 small nuclear RNA-associated B'' antigen mRNA, complete cds. ACCESSION M15841 NID g340104 KEYWORDS U2 small nuclear RNA; antigen; ribonucleoprotein; small nuclear RNA; small nuclear RNA protein. SOURCE Human cDNA to mRNA clone lambda-HB''-1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1015) AUTHORS Habets,W.J., Sillekens,P.T.G., Hoet,M.H., Schalken,J.A., Roebroek,A.J.M., Leunissen,J.A.M., Van de Ven,W.J. and van Venrooij,W.J. TITLE Analysis of a cDNA clone expressing a human autoimmune antigen: Full-length sequence of the U2 small nuclear RNA-associated B'' antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2421-2425 (1987) MEDLINE 87175685 COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by W.J.Habets, 26-MAY-1987. There are two possible peptide intiation sites at 126-128 and 165-167. Although the exact point of initiation was not confirmed, 'the first ATG codon has a hihger probability to be used'. A polyadenylation signal is located at nucleotides 992-997. FEATURES Location/Qualifiers source 1..1015 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 126..803 /note="U2 small nuclear ribonucleoprotein B''" /codon_start=1 /db_xref="PID:g340105" /translation="MDIRPNHTIYINNMNDKIKKEELKRSLYALFSQFGHVVDIVALK TMKMRGQAFVIFKELGSSTNALRQLQGFPFYGKPMRIQYAKTDSDIISKMRGTFADKE KKKEKKKAKTVEQTATTTNKKPGQGTPNSANTQGNSTPNPQVPDYPPNYILFLNNLPE ETNEMMLSMLFNQFPGFKEVRLVPGRHDIAFVEFENDGQAGAARDALQGFKITPSHAM KITYAKK" BASE COUNT 323 a 189 c 212 g 291 t ORIGIN 1 gcgccttcta cctcgctgtt tcggttttcc tggctcctcg gcccttttct cccctgttgc 61 agctgggagc ggacgaagcg cgaagctggg attttttact gtctcctgaa gaatttaaca 121 caaacatgga tatcagacca aatcatacaa tttatatcaa caatatgaat gacaaaatta 181 aaaaggaaga attgaagaga tccctatatg ccctgttttc tcagtttggt catgtggtgg 241 acattgtggc tttaaagacc atgaagatga gggggcaggc ctttgtcata tttaaggaac 301 tgggctcatc cacaaatgcc ttgagacagc tacaaggatt tccattttat ggtaaaccaa 361 tgcgaataca gtatgcaaaa acagattcgg atataatatc aaaaatgcgt ggaacttttg 421 ctgacaaaga aaagaaaaaa gaaaagaaaa aagccaaaac tgtggaacag actgcaacaa 481 ccacaaacaa aaagcctggc cagggaactc caaattcagc taatacccaa ggaaattcaa 541 caccaaatcc tcaggtccct gattaccctc caaactatat tttattcctt aataacttac 601 cagaagagac taatgagatg atgttatcca tgctgtttaa tcagttccct ggcttcaagg 661 aagtacgtct ggtaccaggg aggcatgaca ttgcttttgt tgaatttgaa aatgatgggc 721 aggctggagc tgccagggat gctttacagg gatttaagat cacaccgtcc catgctatga 781 agatcaccta tgccaagaaa taacatttgg gatagtcgtc tttaaaagac ttggtgttat 841 ttacagtgtt tgttttgata acatttggct gggtcatttt aatagttaga gatgaggagg 901 agtaaaagtg aaatttttgt gaaggactta aattatccag tgtttcttta gccttggtga 961 actatgaaat acgaaggcct taattttgta caataaactt ttatttgtat tctgt // LOCUS HUMUKATP1A 1747 bp mRNA PRI 27-MAR-1997 DEFINITION Human mRNA for uKATP-1, complete cds. ACCESSION D50312 NID g1109633 KEYWORDS KCNJ8; uKATP-1. SOURCE Homo sapiens (isolate:caucasian) lung cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1747) AUTHORS Inagaki,N. TITLE Direct Submission JOURNAL Submitted (18-APR-1995) to the DDBJ/EMBL/GenBank databases. Nobuya Inagaki, Chiba University School of Medicine, Center for Biomedical Science; 1-8-1 Inohana, Chuo-ku, Chiba, Chiba 260, Japan (E-mail:inagaki@med.m.chiba-u.ac.jp, Tel:043-222-7171(ex.5512), Fax:043-226-2191) REFERENCE 2 (bases 1 to 1747) AUTHORS Inagaki,N., Inazawa,J. and Seino,S. TITLE cDNA sequence, gene structure, and chromosomal localization of the human ATP-sensitive potassium channel, uKATP-1, gene (KCNJ8) JOURNAL Genomics 30 (1), 102-104 (1995) MEDLINE 96129311 FEATURES Location/Qualifiers source 1..1747 /organism="Homo sapiens" /isolate="caucasian" /db_xref="taxon:9606" /tissue_type="lung" gene 271..1545 /gene="KCNJ8" CDS 271..1545 /gene="KCNJ8" /codon_start=1 /product="uKATP-1" /db_xref="PID:d1009484" /db_xref="PID:g1109634" /translation="MLARKSIIPEEYVLARIAAENLRKPRIRDRLPKARFIAKSGACN LAHKNIREQGRFLQDIFTTLVDLKWRHTLVIFTMSFLCSWLLFAIMWWLVAFAHGDIY AYMEKSGMEKSGLESTVCVTNVRSFTSAFLFSIEVQVTIGFGGRMMTEECPLAITVLI LQNIVGLIINAVMLGCIFMKTAQAHRRAETLIFSRHAVIAVRNGKLCFMFRVGDLRKS MIISASVRIQVVKKTTTPEGEVVPIHQLDIPVDNPIESNNIFLVAPLIICHVIDKRSP LYDISATDLANQDLEVIVILEGVVETTGITTQARTSYIAEEIQWGHRFVSIVTEEEGV YSVDYSKFGNTVKVAAPRCSARELDEKPSILIQTLQKSELSHQNSLRKRNSMRRNNSM RRNNSIRRNNSSLMVPKVQFMTPEGNQNTSES" BASE COUNT 438 a 438 c 442 g 429 t ORIGIN 1 cctctagccc tccctgcgtt tagattcagt tgcacctttt attattttaa ctcttctcct 61 taggacacgc agcccccaat ttgtccctcc gcctgggcgg cccctggtcc cgcgcgccag 121 catgggagag cgagggacct gcccgcggcc cgccggcgtg tgcaaggagg tccagccgcc 181 gcgcccgcta cccggagtct gaggacgggt gtccagggac ggagaggcag gtgagaggga 241 ggtggctaag ctggctatgg tgacaggacg atgttggcca gaaagagtat catcccggag 301 gagtatgtgc tggcgcgcat cgccgcagag aacctgcgca agccgcgcat ccgagaccgc 361 ctccccaaag cccgcttcat cgccaagagc ggggcctgca acctggcgca taagaacatc 421 cgtgagcaag gacgctttct acaggacatc ttcaccacct tggtggacct gaaatggcgc 481 cacacgctgg tcatctttac catgtccttc ctctgcagct ggctgctctt cgctatcatg 541 tggtggctgg tggcctttgc ccatggggac atctatgctt acatggagaa aagtggaatg 601 gagaaaagtg gtttggagtc cactgtgtgt gtgactaatg tcaggtcttt cacttctgct 661 tttctcttct ccattgaagt tcaagttacc attgggtttg gagggaggat gatgacagag 721 gaatgccctt tggccatcac ggttttgatt ctccagaata ttgtgggttt gatcatcaat 781 gcagtcatgt taggctgcat tttcatgaaa acagctcagg ctcacagaag ggcagaaact 841 ttgattttca gccgccatgc tgtgattgcc gtccgaaatg gcaagctgtg cttcatgttc 901 cgagtgggtg acctgaggaa aagcatgatc attagtgcct ctgtgcgcat ccaggtggtc 961 aagaaaacaa ctacacctga aggggaggtg gttcctattc accaactgga cattcctgtt 1021 gataacccaa tcgagagcaa taacattttt ctggtggccc ctttgatcat ctgccacgtg 1081 attgacaagc gcagtcccct gtatgacatc tcagcaactg acctggccaa ccaagacttg 1141 gaggtcatag ttattctgga aggagtggtt gaaactactg gcatcaccac acaagcacga 1201 acctcctaca ttgctgagga gatccaatgg ggccaccgct ttgtgtccat tgtgactgag 1261 gaagaaggag tgtattctgt ggattactcc aaatttggca acactgttaa agtagctgct 1321 ccacggtgca gtgcccgaga gctggatgag aaaccttcca tccttattca gaccctccaa 1381 aagagtgaac tgtctcatca aaattctctg aggaagcgca actccatgag aagaaacaat 1441 tccatgagga ggaacaattc tatccgaagg aacaattctt ccctcatggt accaaaggtg 1501 caatttatga ctccagaagg aaatcaaaac acatcggaat catgacagca agataacccc 1561 aagacagtct tttatcaagt tttgacggtt tatgctgggc actgccagac tgaaccagag 1621 ctggaacaca ataatgtgtc cttctacatt ttattacact aaatgatatt catattcaag 1681 caacagcact ttctgtagta ataaaaagta acacaccaag tggaggattc atggcttatg 1741 cttgttt // LOCUS HUMULP 605 bp mRNA PRI 18-MAR-1994 DEFINITION Human mRNA for ubiquitin-like protein, complete cds. ACCESSION D23662 NID g432362 KEYWORDS ubiquitin-like protein. SOURCE Homo sapiens cDNA to mRNA, clone HP00346. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 605) AUTHORS Kato,S. TITLE Cloning of human ubiquitin-like protein cDNA JOURNAL Unpublished (1994) REFERENCE 2 (bases 1 to 605) AUTHORS Kato,S. TITLE Direct Submission JOURNAL Submitted (02-NOV-1993) to the DDBJ/EMBL/GenBank databases. Seishi Kato, Sagami Chemical Research Center, Genetic Engineering Section; 4-4-1 Nishi-Ohnuma, Sagamihara, Kanagawa 229, Japan (E-mail:btn00121@biotechnet.com, Tel:0427-42-4791(ex.415), Fax:0427-49-7631) COMMENT Submitted (02-Nov-1993) to DDBJ by: Seishi Kato Genetic Engineering Section Sagami Chemical Research Center 4-4-1 Nishi-Ohnuma, Sagamihara Kanagawa 229 Japan Phone: 0427-42-4791 x415 Fax: 0427-49-7631. FEATURES Location/Qualifiers source 1..605 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HT-1080" /clone_lib="HT-1080/pKA1" /tissue_type="fibrosarcoma" 5'UTR 1..99 CDS 100..345 /codon_start=1 /product="ubiquitin-like protein" /db_xref="PID:d1005427" /db_xref="PID:g461287" /translation="MLIKVKTLTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLI YSGKQMNDEKTAADYKILGGSVLHLVLALRGGGGLRQ" 3'UTR 346..605 polyA_signal 588..593 BASE COUNT 148 a 140 c 174 g 143 t ORIGIN 1 gaagtggccc ttgcaggcaa gagtgctgga gggcggcagc ggcgaccgga gcggtaggag 61 cagcaattta tccgtgtgca gccccaaact ggaaagaaga tgctaattaa agtgaagacg 121 ctgaccggaa aggagattga gattgacatt gaacctacag acaaggtgga gcgaatcaag 181 gagcgtgtgg aggagaaaga gggaatcccc ccacaacagc agaggctcat ctacagtggc 241 aagcagatga atgatgagaa gacagcagct gattacaaga ttttaggtgg ttcagtcctt 301 cacctggtgt tggctctgag aggaggaggt ggtcttaggc agtgatggac cctccatttt 361 acctctttac cctgtcgctc ataatgaggc atcatatatc ctctcactct ctgggacacc 421 atagccactg ccccctcccc tggatgccca gtaatgtatg tctactggtg ggagactgtg 481 aggatcccag gattcagtat tcctggccca gagggccttg ctggctactg ggtgttagtt 541 tgcagtcctg tgtgcttccc tctcttatga ctgtgtccct ggttgtcaat aaaatatttc 601 ctggc // LOCUS HUMUMOD 2353 bp mRNA PRI 31-AUG-1987 DEFINITION Human uromodulin (Tamm-Horsfall glycoprotein) mRNA, complete cds. ACCESSION M15881 NID g340163 KEYWORDS Tamm-Horsfall glycoprotein; glycoprotein; uromodulin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2353) AUTHORS Pennica,D., Kohr,W.J., Kuang,W.-J., Glaister,D., Aggarwal,B.B., Chen,E.Y. and Goeddel,D.V. TITLE Identification of human uromodulin as the Tamm-Horsfall urinary glycoprotein JOURNAL Science 236, 83-87 (1987) MEDLINE 87177970 FEATURES Location/Qualifiers source 1..2353 /organism="Homo sapiens" /db_xref="taxon:9606" sig_peptide 169..240 /note="uromodulin signal peptide" CDS 169..2091 /note="uromodulin precursor" /codon_start=1 /db_xref="PID:g340164" /translation="MGQPSLTWMLMVVVASWFITTAATDTSEARWCSECHSNATCTED EAVTTCTCQEGFTGDGLTCVDLDECAIPGAHNCSANSSCVNTPGSFSCVCPEGFRLSP GLGCTDVDECAEPGLSHCHALATCVNVVGSYLCVCPAGYRGDGWHCECSPGSCGPGLD CVPEGDALVCADPCQAHRTLDEYWRSTEYGEGYACDTDLRGWYRFVGQGGARMAETCV PVLRCNTAAPMWLNGTHPSSDEGIVSRKACAHWSGHCCLWDASVQVKACAGGYYVYNL TAPPECHLAYCTDPSSVEGTCEECSIDEDCKSNNGRWHCQCKQDFNITDISLLEHRLE CGANDMKVSLGKCQLKSLGFDKVFMYLSDSRCSGFNDRDNRDWVSVVTPARDGPCGTV LTRNETHATYSNTLYLADEIIIRDLNIKINFACSYPLDMKVSLKTALQPMVSALNIRV GGTGMFTVRMALFQTPSYTQPYQGSSVTLSTEAFLYVGTMLDGGDLSRFALLMTNCYA TPSSNATDPLKYFIIQDRCPHTRDSTIQVVENGESSQGRFSVQMFRFAGNYDLVYLHC EVYLCDTMNEKCKPTCSGTRFRSGSVIDQSRVLNLGPITRKGVQATVSRAFSSLGLLK VWLPLLLSATLTLTFQ" mat_peptide 241..2088 /note="uromodulin" BASE COUNT 493 a 692 c 673 g 495 t ORIGIN 1 actaactcta cctttctggc ttcaggggga ggagagttag atcatgcatt tgtccgatcc 61 atctctgttc acaggacacc agacatcaga gacagagaga aaaattcaaa gggccaaccc 121 gtctttcctt tgggcagtag acctgaagta gcgggaagag cagaaaggat ggggcagcca 181 tctctgactt ggatgctgat ggtggtggtg gcctcttggt tcatcacaac tgcagccact 241 gacacctcag aagcaagatg gtgctctgaa tgtcacagca atgccacctg cacggaggat 301 gaggccgtta cgacgtgcac ctgtcaggag ggcttcaccg gcgatggcct gacctgcgtg 361 gacctggatg agtgcgccat tcctggagct cacaactgct ccgccaacag cagctgcgta 421 aacacgccag gctccttctc ctgcgtctgc cccgaaggct tccgcctgtc gcccggtctc 481 ggctgcacag acgtggatga gtgcgctgag cctgggctta gccactgcca cgccctggcc 541 acatgtgtca atgtggtggg cagctacttg tgcgtatgcc ccgcgggcta ccggggggat 601 ggatggcact gtgagtgctc cccgggctcc tgcgggccgg ggttggactg cgtgcccgag 661 ggcgacgcgc tcgtgtgcgc ggatccgtgt caggcgcacc gcaccctgga cgagtactgg 721 cgcagcaccg agtacgggga gggctacgcc tgcgacacgg acctgcgcgg ctggtaccgc 781 ttcgtgggcc agggcggtgc gcgcatggcc gagacctgcg tgccagtcct gcgctgcaac 841 acggccgccc ccatgtggct caatggcacg catccgtcca gcgacgaggg catcgtgagc 901 cgcaaggcct gcgcgcactg gagcggccac tgctgcctgt gggatgcgtc cgtccaggtg 961 aaggcctgtg ccggcggcta ctacgtctac aacctgacag cgccccccga gtgtcacctg 1021 gcgtactgca cagaccccag ctccgtggag gggacgtgtg aggagtgcag tatagacgag 1081 gactgcaaat cgaataatgg cagatggcac tgccagtgca aacaggactt caacatcact 1141 gatatctccc tcctggagca caggctggaa tgtggggcca atgacatgaa ggtgtcgctg 1201 ggcaagtgcc agctgaagag tctgggcttc gacaaggtct tcatgtacct gagtgacagc 1261 cggtgctcgg gcttcaatga cagagacaac cgggactggg tgtctgtagt gaccccagcc 1321 cgggatggcc cctgtgggac agtgttgacg aggaatgaaa cccatgccac ttacagcaac 1381 accctctacc tggcagatga gatcatcatc cgtgacctca acatcaaaat caactttgca 1441 tgctcctacc ccctggacat gaaagtcagc ctgaagaccg ccctacagcc aatggtcagt 1501 gctctaaaca tcagagtggg cgggaccggc atgttcaccg tgcggatggc gctcttccag 1561 accccttcct acacgcagcc ctaccaaggc tcctccgtga cactgtccac tgaggctttt 1621 ctctacgtgg gcaccatgtt ggatgggggc gacctgtccc gatttgcact gctcatgacc 1681 aactgctatg ccacacccag tagcaatgcc acggaccccc tgaagtactt catcatccag 1741 gacagatgcc cacacactag agactcaact atccaagtgg tggagaatgg ggagtcctcc 1801 cagggccgat tttccgtcca gatgttccgg tttgctggaa actatgacct agtctacctg 1861 cactgtgaag tctatctctg tgacaccatg aatgaaaagt gcaagcctac ctgctctggg 1921 accagattcc gaagtgggag tgtcatagat caatcccgtg tcctgaactt gggtcccatc 1981 acacggaaag gtgtccaggc cacagtctca agggctttta gcagcttggg gctcctgaaa 2041 gtctggctgc ctctgcttct ctcggccacc ttgaccctga cttttcagtg actgacagcg 2101 gaaagccctg tgctccatgg ctgccatctc acctcctgct gggcaggggg catgatgcgg 2161 gccagtgctc cagccacaga aaagaaagtt catgctttgt tcagcctgcc ttcttttctc 2221 ccttttaatc ctggctgtcg agaaacagcc tgtgtcttta aatgctgctt tttctcaaaa 2281 tgggacttgt gacggtgtac ctgaggcccc catctcctta aagagtgtgg caaaataatg 2341 atttttaaat ctc // LOCUS HUMUMPS 2244 bp mRNA PRI 14-JAN-1995 DEFINITION Human UMP synthase mRNA, complete cds. ACCESSION J03626 NID g340167 KEYWORDS UMP synthase; orotate phosphoribosyltransferase; orotidine-5'-monophosphate decarboxylase. SOURCE Human peripheral blood acute lymphoblastic leukemia cell (HPB-ALL), cDNA to mRNA, clones HUSc[33,39,1-5]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2244) AUTHORS Suttle,D.P., Bugg,B.Y., Winkler,J.K. and Kanalas,J.J. TITLE Molecular cloning and nucleotide sequence for the complete coding region of human UMP synthase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (6), 1754-1758 (1988) MEDLINE 88158071 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.P.Suttle, 19-FEB-1988. FEATURES Location/Qualifiers source 1..2244 /organism="Homo sapiens" /db_xref="taxon:9606" /map="3q13" mRNA <1..1759 /note="UMP mRNA (alt.)" mRNA <1..>2244 /note="UMP mRNA (alt.)" gene 105..1547 /gene="UMPS" CDS 105..1547 /gene="UMPS" /note="UMP synthase" /codon_start=1 /db_xref="GDB:G00-120-482" /db_xref="PID:g340168" /translation="MAVARAALGPLVTGLYDVQAFKFGDFVLKSGLSSPIYIDLRGIV SRPRLLSQVADILFQTAQNAGISFDTVCGVPYTALPLATVICSTNQIPMLIRRKETKD YGTKRLVEGTINPGETCLIIEDVVTSGSSVLETVEVLQKEGLKVTDAIVLLDREQGGK DKLQAHGIRLHSVCTLSKMLEILEQQKKVDAETVGRVKRFIQENVFVAANHNGSPLSI KEAPKELSFGARAELPRIHPVASKLLRLMQKKETNLCLSADVSLARELLQLADALGPS ICMLKTHVDILNDFTLDVMKELITLAKCHEFLIFEDRKFADIGNTVKKQYEGGIFKIA SWADLVNAHVVPGSGVVKGLQEVGLPLHRGCLLIAEMSSTGSLATGDYTRAAVRMAEE HSEFVVGFISGSRVSMKPEFLHLTPGVQLEAGGDNLGQQYNSPQEVIGKRGSDIIIVG RGIISAADRLEAAEMYRKAAWEAYLSRLGV" BASE COUNT 562 a 473 c 592 g 617 t ORIGIN 5 bp upstream of PstI site; chromosome 3cen-q21. 1 ctgcagacga ggcagggaga ggcgggactt cgcgggcgag acgtcatcgg ggcgccggac 61 gccggggcgc ctgggagttt gaagcaaaca ggcagcgcgc gacaatggcg gtcgctcgtg 121 cagctttggg gccattggtg acgggtctgt acgacgtgca ggctttcaag tttggggact 181 tcgtgctgaa gagcgggctt tcctccccca tctacatcga tctgcggggc atcgtgtctc 241 gaccgcgtct tctgagtcag gttgcagata ttttattcca aactgcccaa aatgcaggca 301 tcagttttga caccgtgtgt ggagtgcctt atacagcttt gccattggct acagttatct 361 gttcaaccaa tcaaattcca atgcttatta gaaggaaaga aacaaaggat tatggaacta 421 agcgtcttgt agaaggaact attaatccag gagaaacctg tttaatcatt gaagatgttg 481 tcaccagtgg atctagtgtt ttggaaactg ttgaggttct tcagaaggag ggcttgaagg 541 tcactgatgc catagtgctg ttggacagag agcagggagg caaggacaag ttgcaggcgc 601 acgggatccg cctccactca gtgtgtacat tgtccaaaat gctggagatt ctcgagcagc 661 agaaaaaagt tgatgctgag acagttggga gagtgaagag gtttattcag gagaatgtct 721 ttgtggcagc gaatcataat ggttctcccc tttctataaa ggaagcaccc aaagaactca 781 gcttcggtgc acgtgcagag ctgcccagga tccacccagt tgcatcgaag cttctcaggc 841 ttatgcaaaa gaaggagacc aatctgtgtc tatctgctga tgtttcactg gccagagagc 901 tgttgcagct agcagatgct ttaggaccta gtatctgcat gctgaagact catgtagata 961 ttttgaatga ttttactctg gatgtgatga aggagttgat aactctggca aaatgccatg 1021 agttcttgat atttgaagac cggaagtttg cagatatagg aaacacagtg aaaaagcagt 1081 atgaaggagg tatctttaaa atagcttcct gggcagatct agtaaatgct cacgtggtgc 1141 caggctcagg agttgtgaaa ggcctgcaag aagtgggcct gcctttgcat cgggggtgcc 1201 tccttattgc ggaaatgagc tccaccggct ccctggccac tggggactac actagagcag 1261 cggttagaat ggctgaggag cactctgaat ttgttgttgg ttttatttct ggctcccgag 1321 taagcatgaa accagaattt cttcacttga ctccaggagt tcagttggaa gcaggaggag 1381 ataatcttgg ccaacagtac aatagcccac aagaagttat tggcaaacga ggttccgata 1441 tcatcattgt aggtcgtggc ataatctcag cagctgatcg tctggaagca gcagagatgt 1501 acagaaaagc tgcttgggaa gcgtatttga gtagacttgg tgtttgagtg cttcagatac 1561 atttttcaga tacaatgtga agacattgaa gatatgtggt cctcctgaaa gtcactggct 1621 ggaaataatc caattattcc tgcttggatt cttccacagg gcctgtgtaa gaatgggttc 1681 tggagttctc atggtcttta ggaaatattg agtaatttgt aatcaccgca ttgatactat 1741 aataagttca ttcttaagct tgcttttttt gagactggtg tttgttagac agccacagtc 1801 ctgtctgggt tagggtcttc cacatttgag gatccttcct atctctccat gggactagac 1861 tgctttgtta ttctatttat tttttaattt ttttcgagac aggatctcac tctgttgccc 1921 aggatggagt gcagtggtga gatcacggct cattgcagcc tcgacctccc aggtgatcct 1981 cccacctcag cttccagatt agctggtgct ataggcatgc accaccacgt ccatctaaat 2041 ttctttatta tttgtagaga tgaggtcttg ccatgttacc caggctggtc tcaactcctg 2101 ggctcaagcg atcctcctgc ctcagtctct caaagtgctg ggattacagg tgtgagccac 2161 tgtgcccagc ctaattgcag taagacaaaa attctagggc accaagaggc taaagtcagc 2221 acagcttttc ttgtgtcctg tatt // LOCUS HUMUPSRP 1737 bp mRNA PRI 19-FEB-1997 DEFINITION Human mRNA for rod photoreceptor protein, complete cds. ACCESSION D63813 NID g961456 KEYWORDS rod photoreceptor protein. SOURCE Homo sapiens adult neural retina cDNA to mRNA, clones RA337M and GS4642. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1737) AUTHORS Shimizu,A., Nishida,K., Kinoshita,S., Inazawa,J., Okubo,K. and Matsubara,K. TITLE Expression profile of active genes in human retina JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 1737) AUTHORS Shimizu,A. TITLE Direct Submission JOURNAL Submitted (08-AUG-1995) to the DDBJ/EMBL/GenBank databases. Akiyo Shimizu, Institute for Molecular and Cellular Biology, Osaka Univ.; Yamada-oka 1-3, Suita, Osaka 565, Japan (E-mail:kousaku@imcb.osaka-u.ac.jp., Tel:06-877-5111(ex.3910), Fax:06-877-1922) FEATURES Location/Qualifiers source 1..1737 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="RA337M, GS4642" /dev_stage="adult" /tissue_type="neural retina" variation 38..39 /note="replace(38..39" sig_peptide 52..111 CDS 52..1452 /codon_start=1 /product="unknown prepropeptide specific to rod photoreceptor" /db_xref="PID:d1010530" /db_xref="PID:g961457" /translation="MKPPLLVFIVCLLWLKDSHCAPTWKDKTAISENLKSFSEVGEID ADEEVKKALTGIKQMKIMMERKEKEHTNLMSTLKKCREEKQEALKLLNEVQEHLEEEE RLCRESLADSWGECRSCLENNCMRIYTTCQPSWSSVKNKIERFFRKIYQFLFPFHEDN EKDLPISEKLIEEDAQLTQMEDVFSQLTVDVNSLFNRSFNVFRQMQQEFDQTFQSHFI SDTDLTEPYFFPAFSKEPMTKADLEQCWDIPNFFQLFCNFSVSIYESVSETITKMLKA IEDLPKQDKAPDHGGLISKMLPGQDRGLCGELDQNLSRCFKFHEKCQKCQAHLSEDCP DVPALHTELDEAIRLVNVSNQQYGQILQMTRKHLEDTAYLVEKMRGQFGWVSELANQA PETEIIFNSIQVVPRIHEGNISKQDETMMTDLSILPSSNFTLKIPLEESAESSNFIGY VVAKALQHFKEHFKTW" mat_peptide 112..1449 /product="rod photoreceptor protein" polyA_signal 1716..1721 BASE COUNT 577 a 320 c 379 g 461 t ORIGIN 1 agaagctggt ggcaacttca ctggggagat attgcaaata acagcgggaa catgaagccg 61 ccactcttgg tgtttattgt gtgtctgctg tggttgaaag acagtcactg cgcacccact 121 tggaaggaca aaactgctat cagtgaaaac ctgaagagtt tttctgaggt gggggagata 181 gatgcagatg aagaggtgaa gaaggctttg actggtatta agcaaatgaa aatcatgatg 241 gaaagaaaag agaaggaaca caccaatcta atgagcaccc tgaagaaatg cagagaagaa 301 aagcaggagg ccctgaaact tctgaatgaa gttcaagaac atctggagga agaagaaagg 361 ctatgccggg agtctttggc agattcctgg ggtgaatgca ggtcttgcct ggaaaataac 421 tgcatgagaa tttatacaac ctgccaacct agctggtcct ctgtgaaaaa taagattgaa 481 cggtttttca ggaagatata tcaatttcta tttcctttcc atgaagataa tgaaaaagat 541 ctccccatca gtgaaaagct cattgaggaa gatgcacaat tgacccaaat ggaggatgtg 601 ttcagccagt tgactgtgga tgtgaattct ctctttaaca ggagttttaa cgtcttcaga 661 cagatgcagc aagagtttga ccagactttt caatcacatt tcatatcaga tacagaccta 721 actgagcctt acttttttcc agctttctct aaagagccga tgacaaaagc agatcttgag 781 caatgttggg acattcccaa cttcttccag ctgttttgta atttcagtgt ctctatttat 841 gaaagtgtca gtgaaacaat tactaagatg ctgaaggcaa tagaagattt accaaaacaa 901 gacaaagctc ctgaccacgg aggcctgatt tcaaagatgt tacctgggca ggacagagga 961 ctgtgtgggg aacttgacca gaatttgtca agatgtttca aatttcatga aaaatgccaa 1021 aaatgtcagg ctcacctatc tgaagactgt cctgatgtac ctgctctgca cacagaatta 1081 gacgaggcga tcaggttggt caatgtatcc aatcagcagt atggccagat tctccagatg 1141 acccggaagc acttggagga caccgcctat ctggtggaga agatgagagg gcaatttggc 1201 tgggtgtctg aactggcaaa ccaggcccca gaaacagaga tcatctttaa ttcaatacag 1261 gtagttccaa ggattcatga aggaaatatt tccaaacaag atgaaacaat gatgacagac 1321 ttaagcattc tgccttcctc taatttcaca ctcaagatcc ctcttgaaga aagtgctgag 1381 agttctaact tcattggcta cgtagtggca aaagctctac agcattttaa ggaacatttt 1441 aaaacctggt aagaagatct aatgcatcct atatccagta agtagaatta tctcttcatc 1501 tgggacctgg aaatcctgaa ataaaaaagg ataatgcaat aaacacagtt gcaggaaagt 1561 atgttagcta tatactatga agtactctta gtttacttat gttgaatggc ttagctatta 1621 atactcaaat tgagttaaaa tgaaaattcc tccttaaaaa atcaaacgta atatgtatta 1681 catttcatgg tacattagta gttctttgta tattgaataa atactaaatc acctaaa // LOCUS HUMUPST1 419 bp mRNA PRI 07-SEP-1996 DEFINITION Human apM2 mRNA for GS2374 (unknown product specific to adipose tissue), complete cds. ACCESSION D45370 NID g871884 KEYWORDS apM2; adipose specific collagen-like factor; adipose most abundant gene transcript 2; GS2374. SOURCE Homo sapiens adult adipose tissue cDNA to mRNA, clone:apM2. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Maeda,K., Okubo,K., Shimomura,I., Funahashi,T., Matsuzawa,Y. and Matsubara,K. TITLE cDNA cloning and expression of a novel adipose specific collagen-like factor, apM1 (AdiPose Most abundant Gene transcript 1) JOURNAL Biochem. Biophys. Res. Commun. 221 (2), 286-289 (1996) MEDLINE 96224171 REFERENCE 2 (bases 1 to 419) AUTHORS Maeda,K. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 419) AUTHORS Maeda,K. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) to the DDBJ/EMBL/GenBank databases. Kazuhisa Maeda, Osaka University, Institute for Molecular and Cellular Biology; Yamada-oka 1-3, Suita, Osaka 565, Japan (E-mail:kmaeda@imed2.med.osaka-u.ac.jp, Tel:06-877-5111(ex.3910), Fax:06-877-1922) FEATURES Location/Qualifiers source 1..419 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="apM2" /dev_stage="adult" /tissue_type="adipose tissue" mRNA 1..419 /note="mRMA specific to adipose tissue and cornea" gene 32..262 /gene="apM2" CDS 32..262 /gene="apM2" /codon_start=1 /product="unknown product specific to adipose tissue" /db_xref="PID:d1008821" /db_xref="PID:g871885" /translation="MASKGLQDLKQQVEGTAQEAVSAAGAAAQQVVDQATEAGQKAMD QLAKTTQETIDKTANQASDTFSGIGKKFGLLK" BASE COUNT 104 a 135 c 118 g 62 t ORIGIN 1 ctcttgacga ctccacagat accccgaagc catggcaagc aagggcttgc aggacctgaa 61 gcaacaggtg gaggggaccg cccaggaagc cgtgtcagcg gccggagcgg cagctcagca 121 agtggtggac caggccacag aggcggggca gaaagccatg gaccagctgg ccaagaccac 181 ccaggaaacc atcgacaaga ctgctaacca ggcctctgac accttctctg ggatcgggaa 241 aaaattcggc ctcctgaaat gacagcaggg agacttgggt cggcctcctg aaatgatagc 301 agggagactt gggtgacccc ccttccaggc gccatctagc acagcctggc cctgatctcc 361 gggcagccac cacctcctcg gtctgccccc tcattaaaat tcacgttccc accctgaaa // LOCUS HUMUPST2 4517 bp mRNA PRI 07-SEP-1996 DEFINITION Human apM1 mRNA for GS3109 (novel adipose specific collagen-like factor), complete cds. ACCESSION D45371 NID g871886 KEYWORDS apM1; adipose most abundant gene transcript 1; adipose specific collagen-like factor; GS3109. SOURCE Homo sapiens adult adipose tissue cDNA to mRNA, clone:apM1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Maeda,K., Okubo,K., Shimomura,I., Funahashi,T., Matsuzawa,Y. and Matsubara,K. TITLE cDNA cloning and expression of a novel adipose specific collagen-like factor, apM1 (AdiPose Most abundant Gene transcript 1) JOURNAL Biochem. Biophys. Res. Commun. 221 (2), 286-289 (1996) MEDLINE 96224171 REFERENCE 2 (bases 1 to 4517) AUTHORS Maeda,K. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 4517) AUTHORS Maeda,K. TITLE Direct Submission JOURNAL Submitted (27-JAN-1995) to the DDBJ/EMBL/GenBank databases. Kazuhisa Maeda, Osaka University, Institute for Molecular and Cellular Biology; Yamada-oka 1-3, Suita, Osaka 565, Japan (E-mail:kmaeda@imed2.med.osaka-u.ac.jp, Tel:06-877-5111(ex.3910), Fax:06-877-1922) FEATURES Location/Qualifiers source 1..4517 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="apM1" /dev_stage="adult" /tissue_type="adipose tissue" mRNA 1..4517 /note="mRNA specific and most abundant to adipose tissue" /evidence=experimental gene 27..761 /gene="apM1" CDS 27..761 /gene="apM1" /codon_start=1 /product="a novel adipose specific collagen-like factor, apM1 (adipose most abundant gene transcript 1)" /db_xref="PID:d1008822" /db_xref="PID:g871887" /translation="MLLLGAVLLLLALPGHDQETTTQGPGVLLPLPKGACTGWMAGIP GHPGHNGAPGRDGRDGTPGEKGEKGDPGLIGPKGDIGETGVPGAEGPRGFPGIQGRKG EPGEGAYVYRSAFSVGLETYVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYF AYHITVYMKDVKVSLFKKDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGE GERNGLYADNDNDSTFTGFLLYHDTN" sig_peptide 27..68 /gene="apM1" mat_peptide 69..758 /gene="apM1" /note="nt69-149: nonhelical region, nt150-347: triple-helical region, nt348-758: nonhelical region." polyA_signal 4497..4502 BASE COUNT 1135 a 1048 c 940 g 1394 t ORIGIN 1 ctgattccat accagagggg ctcaggatgc tgttgctggg agctgttcta ctgctattag 61 ctctgcccgg gcatgaccag gaaaccacga ctcaagggcc cggagtcctg cttcccctgc 121 ccaagggggc ctgcacaggt tggatggcgg gcatcccagg gcatccgggc cataatgggg 181 ccccaggccg tgatggcaga gatggcaccc ctggtgagaa gggtgagaaa ggagatccag 241 gtcttattgg tcctaaggga gacatcggtg aaaccggagt acccggggct gaaggtcccc 301 gaggctttcc gggaatccaa ggcaggaaag gagaacctgg agaaggtgcc tatgtatacc 361 gctcagcatt cagtgtggga ttggagactt acgttactat ccccaacatg cccattcgct 421 ttaccaagat cttctacaat cagcaaaacc actatgatgg ctccactggt aaattccact 481 gcaacattcc tgggctgtac tactttgcct accacatcac agtctatatg aaggatgtga 541 aggtcagcct cttcaagaag gacaaggcta tgctcttcac ctatgatcag taccaggaaa 601 ataatgtgga ccaggcctcc ggctctgtgc tcctgcatct ggaggtgggc gaccaagtct 661 ggctccaggt gtatggggaa ggagagcgta atggactcta tgctgataat gacaatgact 721 ccaccttcac aggctttctt ctctaccatg acaccaactg atcaccacta actcagagcc 781 tcctccaggc caaacagccc caaagtcaat taaaggcttt cagtacggtt aggaagttga 841 ttattattta gttggaggcc tttagatatt attcattcat ttactcattc atttattcat 901 tcattcatca agtaacttta aaaaaatcat atgctatgtt cccagtcctg gggagcttca 961 caaacatgac cagataactg actagaaaga agtagttgac agtgctattt tgtgcccact 1021 gtctctcctg atgctcatat caatcctata aggcacaggg aacaagcatt ctcctgtttt 1081 tacagattgt atcctgaggc tgagagagtt aagtgaatgt ctaaggtcac acagtattaa 1141 gtgacagtgc tagaaatcaa acccagagct gtggactttg ttcactagac tgtgcccttt 1201 tatagaggta catgttctct ttggagtgtt ggtaggtgtc tgtttcccac ctcacctgag 1261 agccattgaa tttgccttcc tcatgaatta aaacctcccc caagcagagc ttcctcagag 1321 aaagtggttc tatgatgaag tcctgtcttg gaaggactac tactcaatgg cccctgcact 1381 actctacttc ctcttaccta tgtcccttct catgcctttc cctccaacgg ggaaagccaa 1441 ctccatctct aagtgctgaa ctcatccctg ttcctcaagg ccacctggcc aggagcttct 1501 ctgatgtgat atccactttt tttttttttt gagatggagt ctcactctgt cacccaggct 1561 ggagtacagt gacacgacct cggctcactg cagcctcctt ctcctgggtc caagcaatta 1621 ttgtgcctca gcctcccgag tagctgagac ttcaggtgca ttccaccaca catggctaat 1681 ttttgtattt ttagtagaaa tggggtttcg tcatgttggc caggctggtc tcgaactcct 1741 ggcctaggtg atccacccgc ctcgacctcc caaagtgctg ggattacagg catgagccac 1801 catgcccagt cgatatctca ctttttattt tgccatggat gagagtcctg ggtgtgagga 1861 acacctccca ccaggctaga ggcaactgcc caggaaggac tgtgcttccg tcacctctaa 1921 atcccttgca gatccttgat aaatgcctca tgaagaccaa tctcttgaat cccatatcta 1981 cccagaatta actccattcc agtctctgca tgtaatcagt tttatccaca gaaacatttt 2041 cattttagga aatccctggt ttaagtatca atccttgttc agctggacaa tatgaatctt 2101 ttccactgaa gttagggatg actgtgattt tcagaacacg tccagaattt ttcatcaaga 2161 aggtagcttg agcctgaaat gcaaaaccca tggaggaatt ctgaagccat tgtctccttg 2221 agtaccaaca gggtcaggga agactgggcc tcctgaattt attattgttc tttaagaatt 2281 acaggttgag gtagttgatg gtggtaaaca ttctctcagg agacaataac tccagtgatg 2341 tttttcaaag attttagcaa aaacagagta aatagcattc tctatcaata tataaattta 2401 aaaaactatc tttttgctta cagttttaaa ttctgaacaa tttctcttat atgtgtattg 2461 ctaatcatta aggtattatt ttttccacat ataaagcttt gtctttttgt tgttgttgtt 2521 gtttttaaga tggagtttcc ctctgttgcc aggctagagt gcagtggcat gatctcggct 2581 tactgcaacc tttgcctccc aggtttaagc gattcttctg cctcagcctc ccgagtagct 2641 gggaccacag gtgcctacca ccatgccagg ctaatttttg tatttttagt aaagacaggg 2701 tttcaccata ttggccaggc tggtctcgaa ctcctgacct tgtgatctgc ccgcctccat 2761 tgtgttgtta tttgtgagaa agatagatat gaggtttaga gagggatgaa gaggtgagag 2821 taagccttgt gttagtcaga actctgtgtt gtgaatgtca ttcacaacag aaaacccaaa 2881 atattatgca aactactgta agcaagaaaa ataaaggaaa aatggaaaca tttattcctt 2941 tgcataatag aaattaccag agttgttctg tctttagata aggtttgaac caaagctcaa 3001 aacaatcaag acccttttct gtatgtcctt ctgttctgcc ttccgcagtg taggctttac 3061 cctcaggtgc tacacagtat agttctaggg tttccctccc gatatcaaaa agactgtggc 3121 ctgcccagct ctcgtatccc caagccacac catctggcta aatggacatc atgttttctg 3181 gtgatgccca aagaggagag aggaagctct ctttcccaga tgccccagca agtgtaacct 3241 tgcatctcat tgctctggct gagttgtgtg cctgtttctg accaatcact gagtcaggag 3301 gatgaaatat tcatattgac ttaattgcag cttaagttag gggtatgtag aggtattttc 3361 cctaaagcaa aattgggaca ctgttatcag aaataggaga gtggatgata gatgcaaaat 3421 aatacctgtc cacaacaaac tcttaatgct gtgtttgagc tttcatgagt ttcccagaga 3481 gacatagctg gaaaattcct attgattttc tctaaaattt caacaagtag ctaaagtctg 3541 gctatgctca cagtctcaca tctggtgggg gtgggctcct tacagaacac gctttcacag 3601 ttaccctaaa ctctctgggg cagggttatt cctttgtgga accagaggca cagagacagt 3661 caactgaggc ccaacagagg cctgagagaa actgaggtca agatttcagg attaatggtc 3721 ctgtgatgct ttgaagtaca attgtggatt tgtccaattc tctttagttc tgtcagcttt 3781 tgcttcatat attttagcgc tctattatta gatatataca tgtttagtat tatgtcttat 3841 tggtgcattt actctcttat cattatgtaa tgtccttctt tatctgtgat aattttctgt 3901 gttctgaagt ctactttgtc taaaaataac atacgcactc aacttccttt tctttcttcc 3961 ttcctttctt tcttccttcc tttctttctc tctctctctt tccttccttc cttcctcctt 4021 ttctctctct ctctctctct ctctcttttc ttgacagact ctcgttctgt ggccctggct 4081 ggagttcagt ggtgtgatct tggctcactg ctacctctac catgagcaat tctcctgcct 4141 cagcctccca agtagctgga actacaggct catgccactg cgcccagcta atttttgtat 4201 ttttcgtaga gacggggttt caccacattc gtcaggttgg tttcaaactc ctgactttgt 4261 gatccacccg cctcggcctc ccaaagtgct gggattacag gcatgagcca tcacacctgg 4321 tcaactttct tttgattagt gtttttgtgg tatatctttt tccatcatgt tactttaaat 4381 atatctatat tattgtattt aaaatgtgtt tcttacagac tgcatgtagt tgggtataat 4441 ttttatccag tctaaaaata tctgtctttt aattggtgtt tagacaattt atatttaata 4501 aaatggtgga atttaaa // LOCUS HUMVATPASE 680 bp mRNA PRI 17-SEP-1996 DEFINITION Human fetus brain mRNA for vacuolar ATPase, complete cds. ACCESSION D49400 NID g1395161 KEYWORDS vacuolar ATPase. SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 680) AUTHORS Fujiwara,T., Kawai,A., Shimizu,F., Hirano,H., Okuno,S., Takeda,S., Ozaki,K., Shimada,Y., Nagata,M., Watanabe,T., Takaichi,A., Nakamura,Y. and Shin,S. TITLE Cloning, sequencing and expression of a novel cDNA encoding human vacuolar ATPase (14-kDa subunit) JOURNAL DNA Res. 2 (3), 107-111 (1995) MEDLINE 96038262 REFERENCE 2 (bases 1 to 680) AUTHORS Fujiwara,T. TITLE Direct Submission JOURNAL Submitted (18-FEB-1995) to the DDBJ/EMBL/GenBank databases. Tsutomu Fujiwara, OTSUKA GEN Research Institute, Otsuka Pharmaceutical Co.,Ltd; 463-10 Kagasuno Kawauchi-cho, Tokushima, Tokushima 771-01, Japan (Tel:0886-65-2126(ex.2411), Fax:0886-37-1035) FEATURES Location/Qualifiers source 1..680 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" CDS 51..410 /EC_number="3.6.1.3" /codon_start=1 /product="vacuolar ATPase" /db_xref="PID:d1008988" /db_xref="PID:g1395162" /translation="MAGRGKLTAVIGDEDTVTGFLLGGIGELNKNRHPNFLVVEKDTT INEIEDTFRQFLNRDDIGIILINQYIAEMVRHALDAHQQSIPAVLEIPSKEHPYDAAK DSILRRARGMFTAEDLR" polyA_signal 652..657 polyA_site 680 BASE COUNT 138 a 204 c 174 g 164 t ORIGIN 1 agtttcagtg gcttctggtg ctctagggtg agctctgccc ggctgcaggg atggcgggga 61 ggggtaagct caccgcagtg atcggagacg aggacacggt gactggtttc ctgctgggcg 121 gcatagggga gcttaacaag aaccgccatc ccaatttcct ggtggtggag aaggatacaa 181 ccatcaatga gatcgaagac actttccggc aatttctaaa ccgggatgac attggcatca 241 tcctcatcaa ccagtacatc gcagagatgg tgcggcatgc cctggacgcc caccagcagt 301 ccatccccgc tgtcctggag atcccctcca aggagcaccc atatgacgcc gccaaggact 361 ccatcctgcg cagggccagg ggcatgttca ctgccgaaga cctgcgctag gggactcctc 421 atagccctca gcccttccct cgtttccagg cctctcccca ggcttgccat cagccttctt 481 tactttttga gcctctgatt tccaattccc tgctccttcc cactccatta agaggctagg 541 tgaggcgctt ctaggttgct ggggctctgc tggttaagga acaggaagcc tgaccatctc 601 cctccactac ctcttccctg tgctgttaca cagtgtcatt gttgatgtta aattaaagtc 661 atattcttgc ttctctccag // LOCUS HUMVDAC1X 1806 bp mRNA PRI 14-JAN-1995 DEFINITION Human voltage-dependent anion channel isoform 1 (VDAC) mRNA, complete cds. ACCESSION L06132 NID g340198 KEYWORDS isoform 1; outer membrane channel; voltage-dependent anion channel. SOURCE Homo sapiens pituitary gland cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1806) AUTHORS Blachly-Dyson,E., Zambronicz,E.B., Yu,W.H., Adams,V., McCabe,E.R., Adelman,J., Colombini,M. and Forte,M. TITLE Cloning and functional expression in yeast of two human isoforms of the outer mitochondrial membrane channel, the voltage-dependent anion channel JOURNAL J. Biol. Chem. 268 (3), 1835-1841 (1993) MEDLINE 93131931 FEATURES Location/Qualifiers source 1..1806 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="pituitary gland" gene 100..1793 /gene="VDAC" CDS 100..951 /gene="VDAC" /codon_start=1 /product="voltage-dependent anion channel" /db_xref="PID:g340199" /translation="MAVPPTYADLGKSARDVFTKGYGFGLIKLDLKTKSENGLEFTSS GSANTETTKVTGSLETKYRWTEYGLTFTEKWNTDNTLGTEITVEDQLARGLKLTFDSS FSPNTGKKNAKIKTGYKREHINLGCDMDFDIAGPSIRGALVLGYEGWLAGYQMNFETA KSRVTQSNFAVGYKTDEFQLHTNVNDGTEFGGSIYQKVNKKLETAVNLAWTAGNSNTR FGIAAKYQIDPDACFSAKVNNSSLIGLGYTQTLKPGIKLTLSALLDGKNVNAGGHKLG LGLEFQA" polyA_signal 1788..1793 /gene="VDAC" polyA_site 1806 /gene="VDAC" BASE COUNT 488 a 401 c 417 g 500 t ORIGIN 1 gccgctcgct cggctccgct ccctggctcg gctccctgcc tccgcgtcgc agcccccgcc 61 gtagccgcct ccgagcccgc cgccacatcc tctgagaaga tggctgtgcc acccacgtat 121 gccgatcttg gcaaatctgc cagggatgtc ttcaccaagg gctatggatt tggcttaata 181 aagcttgatt tgaaaacaaa atctgagaat ggattggaat ttacaagctc aggctcagcc 241 aacactgaga ccaccaaagt gacgggcagt ctggaaacca agtacagatg gactgagtac 301 ggcctgacgt ttacagagaa atggaatacc gacaatacac taggcaccga gattactgtg 361 gaagatcagc ttgcacgtgg actgaagctg accttcgatt catccttctc acctaacact 421 gggaaaaaaa atgctaaaat caagacaggg tacaagcggg agcacattaa cctgggctgc 481 gacatggatt tcgacattgc tgggccttcc atccggggtg ctctggtgct aggttacgag 541 ggctggctgg ccggctacca gatgaatttt gagactgcaa aatcccgagt gacccagagc 601 aactttgcag ttggctacaa gactgatgaa ttccagcttc acactaatgt gaatgacggg 661 acagagtttg gcggctccat ttaccagaaa gtgaacaaga agttggagac cgctgtcaat 721 cttgcctgga cagcaggaaa cagtaacacg cgcttcggaa tagcagccaa gtatcagatt 781 gaccctgacg cctgcttctc ggctaaagtg aacaactcca gcctgatagg tttaggatac 841 actcagactc taaagccagg tattaaactg acactgtcag ctcttctgga tggcaagaac 901 gtcaatgctg gtggccacaa gcttggtcta ggactggaat ttcaagcata aatgaatact 961 gtacaattgt ttaattttaa actattttgc agcatagcta ccttcagaat ttagtgtatc 1021 ttttaatgtt gtatgtctgg gatgcaagta ttgctaaata tgttagccct ccaggttaaa 1081 gttgattcag ctttaagatg ttacccttcc agaggtacag aagaaaccta tttccaaaaa 1141 aggtcctttc agtggtagac tcggggagaa cttggtggcc cctttgagat gccaggtttc 1201 ttttttatct agaaatggct gcaagtggaa gcggataata tgtaggcact ttgtaaattc 1261 atattgagta aatgaatgaa attgtgattt cctgagaatc gaaccttggt tccctaaccc 1321 taattgatga gaggctcgct gcttgatggt gtgtacaaac tcacctgaat gggacttttt 1381 tagacagatc ttcatgacct gttcccaccc cagttcatca tcatctcttt tacaccaaaa 1441 ggtctgcagg gtgtggtaac tgtttctttt gtgccatttt ggggtggaga aggtggatgt 1501 gatgaagcca ataattcagg acttattcct tcttgtgttg tgtttttttt tggcccttgc 1561 accagagtat gaaatagctt ccaggagctc cagctataag cttggaagtg tctgtgtgat 1621 tgtaatcaca tggtgacaac actcagaatc taaattggac ttctgttgta ttctcaccac 1681 tcaatttgtt ttttagcagt ttaatgggta cattttagag tcttccattt tgttggaatt 1741 agatcctccc cttcaaatgc tgtaattaac aacacttaaa aaacttgaat aaaatattga 1801 aacctc // LOCUS HUMVINC 5102 bp mRNA PRI 14-JAN-1995 DEFINITION Human vinculin mRNA, complete cds. ACCESSION M33308 NID g340236 KEYWORDS cytoskeletal protein; vinculin. SOURCE Human endothelial cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5102) AUTHORS Weller,P.A., Ogryzko,E.P., Corben,E.B., Zhidkova,N.I., Patel,B., Price,G.J., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R. TITLE Complete sequence of human vinculin and assignment of the gene to chromosome 10 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (15), 5667-5671 (1990) MEDLINE 90332642 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.A.Weller, 28-MAR-1990. FEATURES Location/Qualifiers source 1..5102 /organism="Homo sapiens" /db_xref="taxon:9606" /map="10q22-q23" gene 51..3251 /gene="VCL" CDS 51..3251 /gene="VCL" /note="vinculin" /codon_start=1 /db_xref="GDB:G00-125-348" /db_xref="PID:g340237" /translation="MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPV AAVQAAVSNLVRVGKETVQTTEDQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSV PARDYLIDGSRGILSGTSDLLLTFDEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTY TKNLGPGMTKMAKMIDERQQELTHQEHRVMLVNSMNTVKELLPVLISAMKIFVTTKNS KNQGIEEALKNRNFTVEKMSAEINEIIRVLQLTSWDEDAWASKDTEAMKRALASIDSK LNQAKGWLRDPSASPGDAGEQAIRQILDEAGKVGELCAGKERREILGTCKMLGQMTDQ VADLRARGQGSSPVAMQKAQQVSQGLDVLTAKVENAARKLEAMTNSKQSIAKKIDAAQ NWLADPNGGPEGEEQIRGALAEARKIAELCDDPKERDDILRSLGEISALTSKLADLRR QGKGDSPEARALAKQVATALQNLQTKTNRAVANSRPAKAAVHLEGKIEQAQRWIDNPT VDDRGVGQAAIRGLVAEGHRLANVMMGPYRQDLLAKCDRVDQLTAQLADLAARGEGES PQARALASQLQDSLKDLKARMQEAMTQEVSDVFSDTTTPIKLLAVAATAPPDAPNREE VFDERAANFENHSGKLGATAEKAAAVGTANKSTVEGIQASVKTARELTPQVVSAARIL LRNPGNQAAYEHFETMKNQWIDNVEKMTGLVDEAIDTKSLLDASEEAIKKDLDKCKVA MANIQPQMLVAGATSIARRANRILLVAKREVENSEDPKFREAVKAASDELSKTISPMV MDAKAVAGNISDPGLQKSFLDSGYRILGAVAKVREAFQPQEPDFPPPPPDLEQLRLTD ELAPPKPPLPEGEVPPPRPPPPEEKDEEFPEQKAGEVINQPMMMAARQLHDEARKWSS KGNDIIAAAKRMALLMAEMSRLVRGGSGTKRALIQCAKDIAKASDEVTRLAKEVAKQC TDKRIRTNLLQVCERIPTISTQLKILSTVKATMLGRTNISDEESEQATEMLVHNAQNL MQSVKETVREAEAASIKIRTDAGFTLRWVRKTPWYQ" BASE COUNT 1379 a 1248 c 1273 g 1202 t ORIGIN Chromosome 10. 1 gaattccact tctctgtcgc ccgcggttcg ccgccccgct cgccgccgcg atgccagtgt 61 ttcatacgcg cacgatcgag agcatcctgg agccggtggc acagcagatc tcccacctgg 121 tgataatgca cgaggagggc gaggtggacg gcaaagccat tcctgacctc accgcgcccg 181 tggccgccgt gcaggcggcc gtcagcaacc tcgtccgggt tggaaaagag actgttcaaa 241 ccactgagga tcagattttg aagagagata tgccaccagc atttattaag gttgagaatg 301 cttgcaccaa gcttgtccag gcagctcaga tgcttcagtc agacccttac tcagtgcctg 361 ctcgagatta tctaattgat gggtcaaggg gcatcctctc tggaacatca gacctgctcc 421 ttaccttcga tgaggctgag gtccgtaaaa ttattagagt ttgcaaagga attttggaat 481 atcttacagt ggcagaggtg gtggagacta tggaagattt ggtcacttac acaaagaatc 541 ttgggccagg aatgactaag atggccaaga tgattgacga gagacagcag gagctcactc 601 accaggagca ccgagtgatg ttggtgaact cgatgaacac cgtgaaagag ttgctgccag 661 ttctcatttc agctatgaag atttttgtaa caactaaaaa ctcaaaaaac caaggcatag 721 aggaagcttt aaaaaatcgc aattttactg tagaaaaaat gagtgctgaa attaatgaga 781 taattcgtgt gttacaactc acctcttggg atgaagatgc ctgggccagc aaggacactg 841 aagccatgaa gagagcattg gcctccatag actccaaact gaaccaggcc aaaggttggc 901 tccgtgaccc tagtgcctcc ccaggggatg ctggtgagca ggccatcaga cagatcttag 961 atgaagctgg aaaagttggt gaactctgtg caggcaaaga acgcagggag attctgggaa 1021 cttgcaaaat gctagggcag atgactgatc aagtggctga cctccgtgcc agaggacaag 1081 gatcctcacc ggtggccatg cagaaagctc agcaggtatc tcagggtctg gatgtgctca 1141 cagcaaaagt ggaaaatgca gctcgcaagc tggaagccat gaccaactca aagcagagca 1201 ttgcaaagaa gatcgatgct gctcagaact ggcttgcaga tccaaatggt ggaccggaag 1261 gagaagagca gattcgaggt gctttggctg aagctcggaa aatagcagaa ttatgtgatg 1321 atcctaaaga aagagatgac attctacgtt cccttgggga aatatctgct ctgacttcta 1381 aattagcaga tctacgaaga caggggaaag gagattctcc agaggctcga gccttggcca 1441 aacaggtggc cacggccctg cagaacctgc agaccaaaac caaccgggct gtggccaaca 1501 gcagaccggc caaagcagct gtacaccttg agggcaagat tgagcaagca cagcggtgga 1561 ttgataatcc cacagtggat gaccgtggag tcggtcaggc tgccatccgg gggcttgtgg 1621 ccgaagggca tcgtctggct aatgttatga tggggcctta tcggcaagat cttctcgcca 1681 agtgtgaccg agtggaccag ctgacagccc agctggctga cctggctgcc agaggggaag 1741 gggagagtcc tcaggcacga gcacttgcat ctcagctcca agactcctta aaggatctaa 1801 aagctcggat gcaggaggcc atgactcagg aagtgtcaga tgttttcagc gataccacaa 1861 ctcccatcaa gctgttggca gtggcagcca cggcgcctcc tgatgcgcct aacagggaag 1921 aggtatttga tgagagggca gctaactttg aaaaccattc aggaaagctt ggtgctacgg 1981 ccgagaaggc ggctgcggtt ggtactgcta ataaatcaac agtggaaggc attcaggcct 2041 cagtgaagac ggcccgagaa ctcacacccc aggtggtctc ggctgctcgt atcttactta 2101 ggaaccctgg aaatcaagct gcttatgaac attttgagac catgaagaac cagtggatcg 2161 ataatgttga aaaaatgaca gggctggtgg acgaagccat tgataccaaa tctctgttgg 2221 atgcttcaga agaagcaatt aaaaaagacc tggacaagtg caaggtagct atggccaaca 2281 ttcagcctca gatgctggtt gctggggcaa ccagtattgc tcgtcgggcc aaccggatcc 2341 tgctggtggc taagagggag gtggagaatt ccgaggatcc caagttccgt gaggctgtga 2401 aagctgcctc tgatgaattg agcaaaacca tctccccaat ggtgatggat gcaaaagctg 2461 tggctggaaa catttccgac cctggactgc aaaagagctt cctggactca ggatatcgga 2521 tcctgggagc tgtggccaag gtcagagaag ccttccaacc tcaggagcct gacttcccgc 2581 cgcctccacc agaccttgaa caactccgac taacagatga gcttgctcct cccaaaccac 2641 ctctgcctga aggtgaggtc cctccaccta ggcctccacc accagaggaa aaggatgaag 2701 agttccctga gcagaaggcc ggggaggtga ttaaccagcc aatgatgatg gctgccagac 2761 agctccatga tgaagctcgc aaatggtcca gcaagggcaa tgacatcatt gcagcagcca 2821 agcgcatggc tctgctgatg gctgagatgt ctcggctggt aagagggggc agtggtacca 2881 agcgggcact cattcagtgt gccaaggaca tcgccaaggc ctcagatgag gtgactcggt 2941 tggccaagga ggttgccaag cagtgcacag ataaacggat tagaaccaac ctcttacagg 3001 tatgtgagcg aatcccaacc ataagcaccc agctcaaaat cctgtccaca gtgaaggcca 3061 ccatgctggg ccggaccaac atcagtgatg aggagtctga gcaggccaca gagatgctgg 3121 ttcacaatgc ccagaacctc atgcagtctg tgaaggagac tgtgcgggaa gctgaagctg 3181 cttcaatcaa aattcgaaca gatgctggat ttacactgcg ctgggttaga aagactccct 3241 ggtaccagta ggcacctggc tgagcctggc tggcacagaa acctctacta aaaagaagga 3301 aaatgatctg agtcccagga gctgcccaga gttgctggga gctgaaaaat cacatcctgg 3361 cctggcacat cagaaaggaa tgggggcctc ttcaaattag aagacattta tactcttttt 3421 tcatggacac tttgaaatgt gtttctgtat aaagcctgta ttctcaaaca cagttacact 3481 tgtgcaccct ctatcccaat aggcagactg ggtttctagc ccatggactt cacataagct 3541 cagaatccaa gtgaacacta gccagacact ctgctctgcc cttgttccct aggggacact 3601 tccctctgtt tctctttcct tggctcccat tcactcttcc agaatcccaa gacccagggc 3661 ccaggcaaat cagttactaa gaagaaaatt gctgtgcctc ccaaaattgt tttgagcttt 3721 ccatgttgct gccaaccata ccttccttcc ctgggctgtg ctacctgggt ccttttcaga 3781 agtgagcttt gctgctacag gggaaggtgg cctctgtgga gccccagcat atgggggcct 3841 ggattcattt cctgcccttc ctcagtttaa tccttctagt ttcccacaat ataaaactgt 3901 acttcactgt caggaagaaa tcacagaatc atatgattct gcttttacca tgcccctgag 3961 caatgtctgt gctagggaaa ctccccgtcc catatcctgc ctcagcccgc caaggtagcc 4021 atcccatgaa cacactgtgt cctggtgctc tctgccactg gaagggcaga gtagccaggg 4081 tgtggccctg ccatcttccc agcagggcca ctcccggcac tccatgctta gtcactgcct 4141 gcagaggtct gtgctgaggc cttatcattc attcttagct cttaattgtt cattttgagc 4201 tgaaatgctg cattttaatt ttaaccaaaa catgtctcct atatcctggt ttttgtagcc 4261 ttcctccaca tcctttctaa acaagatttt aaagacatgt aggtgtttgt tcatctgtaa 4321 ctctaaaaga tcctttttaa attcagtcct aagaaagagg agtgcttgtc ccctaagagt 4381 gtttaatggc aaggcagccc tgtctgaagg acacttcctg cctaagggag agtggtattt 4441 gcagactaga attctagtgc tgctgaagat gaatcaatgg gaaatactac tcctgtaatt 4501 cctacctccc tgcaaccaac tacaaccaag ctctctgcat ctactcccaa gtatggggtt 4561 caagagagta atgggtttca tatttcttat caccacagta agttcctact aggcaaaatg 4621 agagggcagt gtttcctttt tggtacttat tactgctaag tatttcccag cacatgaaac 4681 cttatttttt ccaaagccag aaccagatga gtaaaggagt aagaaccttg cctgaacatc 4741 cttccttccc acccatcgct gtgtgttagt tcccaacatc gaatgtgtac aacttaagtt 4801 ggtcctttac actcaggctt tcactatttc ctttaaaatg aggatgatta ttttcaaggc 4861 cctcagcata tttgtatagt tgcttgcctg atataaatgc aatattaatg cctttaaagt 4921 atgaatctat gccaaagatc acttgttgtt ttactaaaga aagattactt agaggaaata 4981 agaaaaatca tgtttgctct cccggttctt ccagtggttt gagacactgg tttacacttt 5041 atgccggatg tgcttttctc caatatcagt gctcgagaca cagtgaagca aattaaaaaa 5101 aa // LOCUS HUMVLDLR 3656 bp mRNA PRI 28-NOV-1994 DEFINITION Human very low density lipoprotein receptor mRNA, complete cds. ACCESSION L20470 NID g409425 KEYWORDS very low density lipoprotein receptor. SOURCE Homo sapiens (library: lambda ZAP) adult skeletal muscle cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3656) AUTHORS Gafvels,M.E., Caird,M., Britt,D., Jackson,C.L., Patterson,D. and Strauss,J.F. III. TITLE Cloning of a cDNA encoding a putative human very low density lipoprotein/apolipoprotein E receptor and assignment of the gene to chromosome 9pter-p23 JOURNAL Somat. Cell Mol. Genet. 19 (6), 557-569 (1993) MEDLINE 94174378 FEATURES Location/Qualifiers source 1..3656 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="adult" /tissue_type="skeletal muscle" /tissue_lib="lambda ZAP" /map="9pter-p23" 5'UTR 1..391 sig_peptide 392..472 CDS 392..3013 /codon_start=1 /product="very low density lipoprotein receptor" /db_xref="PID:g409426" /translation="MGTSALWAVWLLLALCWAPRESGATGTGRKAKCEPSQFQCTNGR CITLLWKCDGDEDCVDGSDEKNCVKKTCAESDFVCNNGQCVPSRWKCDGDPDCEDGSD ESPEQCHMRTCRIHEISCGAHSTQCIPVSWRCDGENDCDSGEDEENCGNITCSPDEFT CSSGRCISRNFVCNGQDDCSDGSDELDCAPPTCGAHEFQCSTSSCIPISWVCDDDADC SDQSDESLEQCGRQPVIHTKCPASEIQCGSGECIHKKWRCDGDPDCKDGSDEVNCPSR TCRPDQFECEDGSCIHGSRQCNGIRDCVDGSDEVNCKNVNQCLGPGKFKCRSGECIDI SKVCNQEQDCRDWSDEPLKECHINECLVNNGGCSHICKDLVIGYECDCAAGFELIDRK TCGDIDECQNPGICSQICINLKGGYKCECSRAYQMDLATGVCKAVGKEPSLIFTNRRD IRKIGLERKEYIQLVEQLRNTVALDADIAAQKLFWADLSQKAIFSASIDDKVGRHVKM IDNVYNPAAIAVDWVYKTIYWTDAASKTISVATLDGTKRKFLFNSDLREPASIAVDPL SGFVYWSDWGEPAKIEKAGMNGFDRRPLVTADIQWPNGITLDLIKSRLYWLDSKLHML SSVDLNGQDRRIVLKSLEFLAHPLALTIFEDRVYWIDGENEAVYGANKFTGSEHATLV NNLNDAQDIIVYHELVQPSGKNWCEEDMENGGCEYLCLPAPQINDHSPKYTCSCPSGY NVEENGRDCQSTATTVTYSETKDTNTTEISATSGLVPGGINVTTAVSEVSVPPKGTSA AWAILPLLLLVMAAVGGYLMWRNWQHKNMKSMNFDNPVYLKTTEEDLSIDIGRHSASV GHTYPAISVVSTDDDLA" 3'UTR 3014..3614 polyA_signal 3609..3614 BASE COUNT 990 a 843 c 907 g 916 t ORIGIN 1 ctctgcgggc cgcgggtgcg ggtcgtcgct accggctctc tccgttctgt gctctcttct 61 gctctcggct ccccaccccc tctcccttcc ctcctctccc cttgcctccc ctcctctgca 121 gcgcctgcat tattttctgc ccgcagctcg gcttgcactg ctgctgcagc ccggggaggt 181 ggctgggtgg gtggggagga gactgtgcaa gttgtagggg agggggtgcc ctcttcttcc 241 ccgctccctt ccccagccaa gtggttcccc tccttctccc cctttcccct cccagccccc 301 accttcttcc tctttcggaa gggctggtaa cttgtcgtgc ggagcgaacg gcggcggcgg 361 cggcggcggc ggcaccatcc aggcgggcac catgggcacg tccgcgctct gggccgtctg 421 gctgctgctc gcgctgtgct gggcgccccg ggagagcggc gccaccggaa ccgggagaaa 481 agccaaatgt gaaccctccc aattccagtg cacaaatggt cgctgtatta cgctgttgtg 541 gaaatgtgat ggggatgaag actgtgttga cggcagtgat gaaaagaact gtgtaaagaa 601 gacgtgtgct gaatctgact tcgtgtgcaa caatggccag tgtgttccca gccgatggaa 661 gtgtgatgga gatcctgact gcgaagatgg ttcagatgaa agcccagaac agtgccatat 721 gagaacatgc cgcatacatg aaatcagctg tggcgcccat tctactcagt gtatcccagt 781 gtcctggaga tgtgatggtg aaaatgattg tgacagtgga gaagatgaag aaaactgtgg 841 caatataaca tgtagtcccg acgagttcac ctgctccagt ggccgctgca tctccaggaa 901 ctttgtatgc aatggccagg atgactgcag cgatggcagt gatgagctgg actgtgcccc 961 gccaacctgt ggcgcccatg agttccagtg cagcacctcc tcctgcatcc ccatcagctg 1021 ggtatgcgac gatgatgcag actgctccga ccaatctgat gagtccctgg agcagtgtgg 1081 ccgtcagcca gtcatacaca ccaagtgtcc agccagcgaa atccagtgcg gctctggcga 1141 gtgcatccat aagaagtggc gatgtgatgg ggaccctgac tgcaaggatg gcagtgatga 1201 ggtcaactgt ccctctcgaa cttgccgacc tgaccaattt gaatgtgagg atggcagctg 1261 catccatggc agcaggcagt gtaatggtat ccgagactgt gtcgatggtt ccgatgaagt 1321 caactgcaaa aatgtcaatc agtgcttggg ccctggaaaa ttcaagtgca gaagtggaga 1381 atgcatagat atcagcaaag tatgtaacca ggagcaggac tgcagggact ggagtgatga 1441 gcccctgaaa gagtgtcata taaacgaatg cttggtaaat aatggtggat gttctcatat 1501 ctgcaaagac ctagttatag gctacgagtg tgactgtgca gctgggtttg aactgataga 1561 taggaaaacc tgtggagata ttgatgaatg ccaaaatcca ggaatctgca gtcaaatttg 1621 tatcaactta aaaggcggtt acaagtgtga atgtagtcgt gcctatcaaa tggatcttgc 1681 tactggcgtg tgcaaggcag taggcaaaga gccaagtctg atcttcacta atcgaagaga 1741 catcaggaag attggcttag agaggaaaga atatatccaa ctagttgaac agctaagaaa 1801 cactgtggct ctcgatgctg acattgctgc ccagaaacta ttctgggccg atctaagcca 1861 aaaggctatc ttcagtgcct caattgatga caaggttggt agacatgtta aaatgatcga 1921 caatgtctat aatcctgcag ccattgctgt tgattgggtg tacaagacca tctactggac 1981 tgatgcggct tctaagacta tttcagtagc taccctagat ggaaccaaga ggaagttcct 2041 gtttaactct gacttgcgag agcctgcctc catagctgtg gacccactgt ctggctttgt 2101 ttactggtca gactggggtg aaccagctaa aatagaaaaa gcaggaatga atggattcga 2161 tagacgtcca ctggtgacag cggatatcca gtggcctaac ggaattacac ttgaccttat 2221 aaaaagtcgc ctctattggc ttgattctaa gttgcacatg ttatccagcg tggacttgaa 2281 tggccaagat cgtaggatag tactaaagtc tctggagttc ctagctcatc ctcttgcact 2341 aacaatattt gaggatcgtg tctactggat agatggggaa aatgaagcag tctatggtgc 2401 caataaattc actggatcag agcatgccac tctagtcaac aacctgaatg atgcccaaga 2461 catcattgtc tatcatgaac ttgtacagcc atcaggtaaa aattggtgtg aagaagacat 2521 ggagaatgga ggatgtgaat acctatgcct gccagcacca cagattaatg atcactctcc 2581 aaaatatacc tgttcctgtc ccagtgggta caatgtagag gaaaatggcc gagactgtca 2641 aagtactgca actactgtga cttacagtga gacaaaagat acgaacacaa cagaaatttc 2701 agcaactagt ggactagttc ctggagggat caatgtgacc acagcagtat cagaggtcag 2761 tgttccccca aaagggactt ctgccgcatg ggccattctt cctctcttgc tcttagtgat 2821 ggcagcagta ggtggctact tgatgtggcg gaattggcaa cacaagaaca tgaaaagcat 2881 gaactttgac aatcctgtgt acttgaaaac cactgaagag gacctctcca tagacattgg 2941 tagacacagt gcttctgttg gacacacgta cccagcaata tcagttgtaa gcacagatga 3001 tgatctagct tgacttctgt gacaaatgtt gacctttgag gtctaaacaa ataatacccc 3061 cgtcggaatg gtaaccgagc cagcagctga agtctctttt tcttcctctc ggctggaaga 3121 acatcaagat acctttgcgt ggatcaagct tgctgtactt gaccgttttt atattacttt 3181 tgtaaatatt cttgtccaca ttctacttca gctttggatg tggttaccga gtatctgtaa 3241 cccttgaatt tctagacagt attgccacct ctggccaaat atgcactttc cctagaaagc 3301 catattccag cagtgaaact tgtgctatag tgtataccac ctgtacatac attgtatagg 3361 ccatctgtaa atatcccaga gaacaatcac tattcttaag cactttgaaa atatttctat 3421 gtaaattatt gtaaactttt tcaatggttg ggacaatggc aataggacaa aacgggttac 3481 taagatgaaa ttgccaaaaa aatttataaa ctaattttgg tacgtatgaa tgatatcttt 3541 gacctcaatg gaggtttgca aagactgagt gttcaaacta ctgtacattt tttttcaagt 3601 gctaaaaaat taaaccaagc agcttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa // LOCUS HUMVTNR 5717 bp mRNA PRI 15-MAR-1989 DEFINITION Human cell adhesion protein (vitronectin) receptor alpha subunit mRNA, complete cds. ACCESSION M14648 J02826 M18365 NID g340306 KEYWORDS cell adhesion protein receptor; glycoprotein; vitronectin receptor. SOURCE Human fibroblast cell line IMR-90, cDNA to mRNA, clones lambda-VNR[10,11]; clones lambda VRN[21,26] [3]; HUVE, cDNA to mRNA (see comment). ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1276 to 5717) AUTHORS Suzuki,S., Argraves,W.S., Pytela,R., Arai,H., Krusius,T., Pierschbacher,M.D. and Ruoslahti,E. TITLE cDNA and amino acid sequences of the cell adhesion protein receptor recognizing vitronectin reveal a transmembrane domain and homologies with other adhesion protein receptors JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 8614-8618 (1986) MEDLINE 87041504 REFERENCE 2 (bases 20 to 1343) AUTHORS Fitzgerald,L.A., Poncz,M., Steiner,B., Rall,S.C.Jr., Bennett,J.S. and Phillips,D.R. TITLE Comparison of cDNA-derived protein sequences of the human fibronectin and vitronectin receptor alpha-subunits and platelet glycoprotein IIb JOURNAL Biochemistry 26, 8158-8165 (1987) MEDLINE 88163472 REFERENCE 3 (bases 1 to 5717) AUTHORS Suzuki,S., Argraves,W.S., Arai,H., Languino,L.R., Pierschbacher,M.D. and Ruoslahti,E. TITLE Amino acid sequence of the vitronectin receptor alpha subunit and comparative expression of adhesion receptor mRNAs JOURNAL J. Biol. Chem. 262, 14080-14085 (1987) MEDLINE 88007656 COMMENT Complete source information: Human fibroblast cell line IMR-90, cDNA to mRNA, clones lambda-VNR[10,11] [1]; clones lambda VRN[21,26] [3]; HUVE, cDNA to mRNA [2]. FEATURES Location/Qualifiers source 1..5717 /organism="Homo sapiens" /db_xref="taxon:9606" CDS 42..3188 /note="vitronectin alpha subunit precursor" /codon_start=1 /db_xref="PID:g340307" /translation="MAFPPRRRLRLGPRGLPLLLSGLLLPLCRAFNLDVDSPAEYSGP EGSYFGFAVDFFVPSASSRMFLLVGAPKANTTQPGIVEGGQVLKCDWSSTRRCQPIEF DATGNRDYAKDDPLEFKSHQWFGASVRSKQDKILACAPLYHWRTEMKQEREPVGTCFL QDGTKTVEYAPCRSQDIDADGQGFCQGGFSIDFTKADRVLLGGPGSFYWQGQLISDQV AEIVSKYDPNVYSIKYNNQLATRTAQAIFDDSYLGYSVAVGDFNGDGIDDFVSGVPRA ARTLGMVYIYDGKNMSSLYNFTGEQMAAYFGFSVAATDINGDDYADVFIGAPLFMDRG SDGKLQEVGQVSVSLQRASGDFQTTKLNGFEVFARFGSAIAPLGDLDQDGFNDIAIAA PYGGEDKKGIVYIFNGRSTGLNAVPSQILEGQWAARSMPPSFGYSMKGATDIDKNGYP DLIVGAFGVDRAILYRARPVITVNAGLEVYPSILNQDNKTCSLPGTALKVSCFNVRFC LKADGKGVLPRKLNFQVELLLDKLKQKGAIRRALFLYSRSPSHSKNMTISRGGLMQCE ELIAYLRDESEFRDKLTPITIFMEYRLDYRTAADTTGLQPILNQFTPANISRQAHILL DCGEDNVCKPKLEVSVDSDQKKIYIGDDNPLTLIVKAQNQGEGAYEAELIVSIPLQAD FIGVVRNNEALARLSCAFKTENQTRQVVCDLGNPMKAGTQLLAGLRFSVHQQSEMDTS VKFDLQIQSSNLFDKVSPVVSHKVDLAVLAAVEIRGVSSPDHIFLPIPNWEHKENPET EEDVGPVVQHIYELRNNGPSSFSKAMLHLQWPYKYNNNTLLYILHYDIDGPMNCTSDM EINPLRIKISSLQTTEKNDTVAGQGERDHLITKRDLALSEGDIHTLGCGVAQCLKIVC QVGRLDRGKSAILYVKSLLWTETFMNKENQNHSYSLKSSASFNVIEFPYKNLPIEDIT NSTLVTTNVTWGIQPAPMPVPVWVIILAVLAGLLLLAVLVFVMYRMGFFKRVRPPQEE QEREQLQPHENGEGNSET" sig_peptide 42..131 /note="vitronectin alpha subunit signal peptide" mat_peptide 132..2711 /note="vitronectin alpha subunit heavy chain" mat_peptide 2712..3185 /note="vitronectin alpha subunit light chain" BASE COUNT 1704 a 1025 c 1149 g 1839 t ORIGIN Unreported. 1 ggctaccgct cccggcttgg cgtcccgcgc gcacttcggc gatggctttt ccgccgcggc 61 gacggctgcg cctcggtccc cgcggcctcc cgcttcttct ctcgggactc ctgctacctc 121 tgtgccgcgc cttcaaccta gacgtggaca gtcctgccga gtactctggc cccgagggaa 181 gttacttcgg cttcgccgtg gatttcttcg tgcccagcgc gtcttcccgg atgtttcttc 241 tcgtgggagc tcccaaagca aacaccaccc agcctgggat tgtggaagga gggcaggtcc 301 tcaaatgtga ctggtcttct acccgccggt gccagccaat tgaatttgat gcaacaggca 361 atagagatta tgccaaggat gatccattgg aatttaagtc ccatcagtgg tttggagcat 421 ctgtgaggtc gaaacaggat aaaattttgg cctgtgcccc attgtaccat tggagaactg 481 agatgaaaca ggagcgagag cctgttggaa catgctttct tcaagatgga acaaagactg 541 ttgagtatgc tccatgtaga tcacaagata ttgatgctga tggacaggga ttttgtcaag 601 gaggattcag cattgatttt actaaagctg acagagtact tcttggtggt cctggtagct 661 tttattggca aggtcagctt atttcggatc aagtggcaga aatcgtatct aaatacgacc 721 ccaatgttta cagcatcaag tataataacc aattagcaac tcggactgca caagctattt 781 ttgatgacag ctatttgggt tattctgtgg ctgtcggaga tttcaatggt gatggcatag 841 atgactttgt ttcaggagtt ccaagagcag caaggacttt gggaatggtt tatatttatg 901 atgggaagaa catgtcctcc ttatacaatt ttactggcga gcagatggct gcatatttcg 961 gattttctgt agctgccact gacattaatg gagatgatta tgcagatgtg tttattggag 1021 cacctctctt catggatcgt ggctctgatg gcaaactcca agaggtgggg caggtctcag 1081 tgtctctaca gagagcttca ggagacttcc agacgacaaa gctgaatgga tttgaggtct 1141 ttgcacggtt tggcagtgcc atagctcctt tgggagatct ggaccaggat ggtttcaatg 1201 atattgcaat tgctgctcca tatgggggtg aagataaaaa aggaattgtt tatatcttca 1261 atggaagatc aacaggcttg aacgcagtcc catctcaaat ccttgaaggg cagtgggctg 1321 ctcgaagcat gccaccaagc tttggctatt caatgaaagg agccacagat atagacaaaa 1381 atggatatcc agacttaatt gtaggagctt ttggtgtaga tcgagctatc ttatacaggg 1441 ccagaccagt tatcactgta aatgctggtc ttgaagtgta ccctagcatt ttaaatcaag 1501 acaataaaac ctgctcactg cctggaacag ctctcaaagt ttcctgtttt aatgttaggt 1561 tctgcttaaa ggcagatggc aaaggagtac ttcccaggaa acttaatttc caggtggaac 1621 ttcttttgga taaactcaag caaaagggag caattcgacg agcactgttt ctctacagca 1681 ggtccccaag tcactccaag aacatgacta tttcaagggg gggactgatg cagtgtgagg 1741 aattgatagc gtatctgcgg gatgaatctg aatttagaga caaactcact ccaattacta 1801 tttttatgga atatcggttg gattatagaa cagctgctga tacaacaggc ttgcaaccca 1861 ttcttaacca gttcacgcct gctaacatta gtcgacaggc tcacattcta cttgactgtg 1921 gtgaagacaa tgtctgtaaa cccaagctgg aagtttctgt agatagtgat caaaagaaga 1981 tctatattgg ggatgacaac cctctgacat tgattgttaa ggctcagaat caaggagaag 2041 gtgcctacga agctgagctc atcgtttcca ttccactgca ggctgatttc atcggggttg 2101 tccgaaacaa tgaagcctta gcaagacttt cctgtgcatt taagacagaa aaccaaactc 2161 gccaggtggt atgtgacctt ggaaacccaa tgaaggctgg aactcaactc ttagctggtc 2221 ttcgtttcag tgtgcaccag cagtcagaga tggatacttc tgtgaaattt gacttacaaa 2281 tccaaagctc aaatctattt gacaaagtaa gcccagttgt atctcacaaa gttgatcttg 2341 ctgttttagc tgcagttgag ataagaggag tctcgagtcc tgatcatatc tttcttccga 2401 ttccaaactg ggagcacaag gagaaccctg agactgaaga agatgttggg ccagttgttc 2461 agcacatcta tgagctgaga aacaatggtc caagttcatt cagcaaggca atgctccatc 2521 ttcagtggcc ttacaaatat aataataaca ctctgttgta tatccttcat tatgatattg 2581 atggaccaat gaactgcact tcagatatgg agatcaaccc tttgagaatt aagatctcat 2641 ctttgcaaac aactgaaaag aatgacacgg ttgccgggca aggtgagcgg gaccatctca 2701 tcactaagcg ggatcttgcc ctcagtgaag gagatattca cactttgggt tgtggagttg 2761 ctcagtgctt gaagattgtc tgccaagttg ggagattaga cagaggaaag agtgcaatct 2821 tgtacgtaaa gtcattactg tggactgaga cttttatgaa taaagaaaat cagaatcatt 2881 cctattctct gaagtcgtct gcttcattta atgtcataga gtttccttat aagaatcttc 2941 caattgagga tatcaccaac tccacattgg ttaccactaa tgtcacctgg ggcattcagc 3001 cagcgcccat gcctgtgcct gtgtgggtga tcattttagc agttctagca ggattgttgc 3061 tactggctgt tttggtattt gtaatgtaca ggatgggctt ttttaaacgg gtccggccac 3121 ctcaagaaga acaagaaagg gagcagcttc aacctcatga aaatggtgaa ggaaactcag 3181 aaacttaact gcagttttta agttatgcta catcttgacc cactagaatt agcaacttta 3241 ttatagattt aaactttctt catgaggagt aaaaatccaa ggctttactg ctgatagtgc 3301 taattggcat taaccacaaa atgagaatta tatttgtcaa ccttctcctt ataaataagt 3361 tcagacatac atttaataac atagggtgac ttgtgttttt aggtatttaa ataataaaat 3421 ttcaagggat agtttttatt caatgtatat aagacaggta gtgcctgatt tactacttta 3481 tataaaatag tacctccttc agttactgtt tctgatttaa tgtacggaac tttatttgtt 3541 gttgttgttg ttgttgttgt tgttgtttta aagcagtcca aatttggacc ttagcaatca 3601 tgtcttttgt ataggtactt aatgttaata catattacac tacagtttac ttttcagaat 3661 actaaagact ttataactgc atgaacttgg atttttttaa tcactcatat ggtagaattt 3721 tataaacaca tacatgatac catccaaatt cttgctttta ataacaaagg tacaatattt 3781 tgttttagta tgaaaatctg gtagatccta ttacacttct gtttatatta aatccacaat 3841 attttattac atttttaact tgtataaatt ttaggtcaaa tccttcaagc caacctatac 3901 taaaaattag ttccataatc acaaatggct cttttgtgta attgtttaat ttcacctgaa 3961 tatcataatg cttaaagcca tatggagttg gaaattattt ccaaagcata tttattccat 4021 tgttttagtc tggctattta cagtataaaa aaagcatttt attaaaatac tgtgtagttc 4081 tttgagatag ttgcttatgc atatagtaag tattacattc ttagagtaga gcagagtttt 4141 tagttagtat taatttattt tcctccattc atgtactttt ccttatattt ccaaaactgt 4201 tactgagaat gggtcaagat cagtgagaaa tctttacagt tgacaggaac ctggacccct 4261 taccccaact ttatgagtaa tgcttggaat aaaaaactct taaggcaact cactgattta 4321 cttctagcaa tagcatgatg ttacaggaat attacctctg tttaagcaag gtaatgtgta 4381 aaatcagtct cggctgtcag aataacttct aaaaggtatt tttataagca gttcaagtta 4441 ctgaaaacct tttaaacctt tctgaagttc gttagtataa attacttttc taggattatt 4501 aataaaagcc acataggtgg caagttgtag ttttatatgg ctctgtagag tggtgaacct 4561 tctagaggaa tatatgattt attcacagtt cctcaaggcc tggggatgat gatcagttat 4621 acctattttt gtgcaattac atcatgttgt acattagaaa tggagagttt aatagctctt 4681 taactgctgt cctcattagg taatgataaa tatttccctt aaataattga ctattttgct 4741 gtgttttaaa aatgattgaa atttatcttg ccatatctca taatttcatg cacaagttga 4801 ctgagctaat cttgagaata tattcgtaaa ataggagcac atttagttga ggtatacaag 4861 gtaggactct agacaaaacc ttctatttta gctttagtga atttcaaaag taatgggtct 4921 tggagtatag atttttatta gtagcttgaa agagcttaat catatgcagt aagtattttt 4981 attaccaata aatttaaaat tttttaagaa aaatattttt atcctagggc caagtgttgc 5041 ctgccaccaa tcagtaagtt agtctataac aaattttacc ctaacagttt taccacctag 5101 caacagtcat ttctgaaaat atgttggata gaaagtcact ctttggcaaa agtgttagaa 5161 tttgcttttg tgccatctat tccttttatg gcatctatct tgaaagtaat cttgtattgg 5221 agattgaaag atgctgtaat ttagaaatta acatgatatc ttaaattacc tttatgaaat 5281 atagttttgt ataatagcat agattttcct tcaaaaaatg aacatttata tatctacaaa 5341 aatatggaga agagcaattt gaaagcctac tttctgaaga aaatggtggg attttttttt 5401 atcatgatta aatatcaaaa aattgcccta tgaaaacttt aaatctctaa aacatttgaa 5461 atactaccat atttgtgatt tattgagaat aaaaatccat tttgaaatgt aaaattttta 5521 tgatctgatt cagttttaag aaaacatgaa tgaactagaa gatattaaaa acatttgaca 5581 ttggtaagaa atattgatac tgatattgat ttttatatag gtatttattt cagaattgat 5641 attttgagaa aaatacatgt gagtcatttt ttctgtttct cttttctctt aacgattatc 5701 actgtaattc tgaatct // LOCUS HUMWITA 2139 bp mRNA PRI 07-MAR-1995 DEFINITION Human Wilms' tumor (WIT-1) associated protein mRNA, complete cds. ACCESSION M60614 M37983 NID g340365 KEYWORDS Wilms' tumor. SOURCE Human fetal kidney, cDNA to mRNA, and DNA, clones GB[16,22,36,2b.1,4b.1,4b.3]. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2139) AUTHORS Huang,A., Campbell,C.E., Bonetta,L., McAndrews-Hill,M.S., Chilton-MacNeill,S., Coppes,M.J., Law,D.J., Feinberg,A.P., Yeger,H. and Williams,B.R. TITLE Tissue, developmental, and tumor-specific expression of divergent transcripts in Wilms tumor JOURNAL Science 250 (4983), 991-994 (1990) MEDLINE 91048012 FEATURES Location/Qualifiers source 1..2139 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="GB22" /dev_stage="fetus" /tissue_type="kidney" /tissue_lib="lambda-gt-10" /map="11p13" gene 952..1230 /gene="WIT-11" CDS 952..1230 /gene="WIT-11" /codon_start=1 /product="Wilms' tumor assocated protein" /db_xref="PID:g340366" /translation="MQRRGQPLENHVALIHWQSAGIPASKVHNYCNMKKSRLGRSRAV RISQPLLSPRRCPLHLTERGAGLLQPQPQGPVRTPGPPSGSHPAAADN" BASE COUNT 505 a 611 c 582 g 441 t ORIGIN 1 gaggtgggag gagttccctg cagcctgcgg ggccaagaag aaagtgctaa gctgaggagc 61 cgcagtcgga ggaccccagg gacccgacag gaagtctccg tacgacccca acggatccac 121 atgcccggaa gcccaggcga cactaagcca gcgctgggga actgacgtac tcctgcagtc 181 gcagggcgct ccatgcctct ctgtcctctt ctttgttgtg ggtaacgttc tctgcgggag 241 acctgaggtt ttccaagggg gacatgccag ctacactggc cttggcgccc tgagctaagc 301 accaggactc acagcctagc acgaaggcag gtagcacctt cacccgccgc ggacgatagg 361 cgctctcctg tacctccttt agctcgcgag gcccgccctt cgcgaggtcc cagagaaaag 421 caggctgtgg aaaactggcg cccccttctt tcacccacct tcttacccct gtcagcgccg 481 agatctgtag cagaggttcc tggtctgaac caccgattgg caaagaaagc tgcagattaa 541 acttctcgtt ttacagagaa ggaaactgag gcccagacag ccgaaggaga ggcagtctat 601 ggagcgcagc ggtaaagagc aaggggttgg tggccccaga atggaacctc agctctgcca 661 ggtaaccaac agctgtgtga ccctagacga gttccgcagt ctctctgagc ctcagtttcc 721 ttactggtca aacgggataa tgggatacta gcgcccacct catagagttg ttctgaggat 781 tagataggag agcagtgtgg gcctttgact caatcaactt tacagttttt gttactacta 841 ctgtttaagg acaggaaaca agttagtggc aacgcgagct gaaacccggg tctctcaacg 901 tccagttgga ggttttaccc accacccctc tacctgttca gagctaatgg gatgcagagg 961 cgaggacagc ccctggaaaa ccatgtggcg ttgatacact ggcaaagcgc aggcatcccg 1021 gcctcgaagg tgcataatta ttgcaatatg aaaaaatcga ggctgggtag gagcagggca 1081 gtgaggattt ctcaaccctt actttcaccc cggcgctgtc cactgcatct gacagagcgc 1141 ggagctgggc tgctacagcc gcaaccccag ggaccagtgc gcacgcctgg gccgccctcc 1201 gggagtcacc cagcggccgc ggacaactga ataaacaccc caaagcgctg cggtcggtca 1261 agggcgggga cagccacagt gcgcgcgggg cccgcaggcc gtaataaaga gtggctcgac 1321 ctcgctgctg gcctgtcgcg ggagaggaac aagctttcga ctagcgcctc tccccggggc 1381 ccgcgccccg agccccacgc caagacagcc caacagctgt tcctcccctc cccgccgact 1441 ccaactcttc ggaatctgcc cactcggggg ctgcagggca agtgtttagg atggttccca 1501 gccccgcgct gcgcggtgaa aatttcaacg tcattccttc aattaaaaaa aggggggggc 1561 aagggagggg ctttgtgata actactccca gcttcttctg atcatttcaa aattaagtcg 1621 atttttttta accagtcccc acttactgtc ctaactctcc tcgctgaccc tatctgggag 1681 ccggaaccgt taggtactgc cgaatgcggt gcaaatttcc cctctccccc agttcgcagt 1741 gcctggagcc gctggggtta ctcgtctgtt ctgatgccac cgcgagatgg tccccgagct 1801 ccccgagagt cctcagtgaa aggattccgc ggcactgcct ctattattat accgtaaatc 1861 tttttaaatt ctggaactaa ttatatagag gatatgtctc aatttgttct gcattaatgc 1921 cacagtgggg atggaggcca ggccgtggcc agagcagata cgtaggccca tgaaattgat 1981 gaactgagag ttgtcttcca gtcctgagcg caccatctgg aattccagtc tgaggttaca 2041 attagaactc ctgaccccag attcaacttt gtaaacaaca ggggaaaaaa atggggaaaa 2101 gaaaaaccac gagcgaggaa aaaaattatc tgcacctgt // LOCUS HUMWNT5A 4114 bp mRNA PRI 17-MAR-1994 DEFINITION Homo sapiens proto-oncogene (Wnt-5a) mRNA, complete cds. ACCESSION L20861 NID g348917 KEYWORDS proto-oncogene. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4114) AUTHORS Clark,C.C., Cohen,I., Eichstetter,I., Cannizzaro,L.A., McPherson,J.D., Wasmuth,J.J. and Iozzo,R.V. TITLE Molecular cloning of the human proto-oncogene Wnt-5A and mapping of the gene (WNT5A) to chromosome 3p14-p21 JOURNAL Genomics 18, 249-260 (1993) MEDLINE 94116991 REFERENCE 2 (bases 1 to 4114) AUTHORS Iozzo,R.V. TITLE Direct Submission JOURNAL Submitted (07-JUL-1993) R.V. Iozzo, Department of Pathology and Cell Biology, Thomas Jefferson University, Philadelphia, PA 19107, USA FEATURES Location/Qualifiers source 1..4114 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="CRL-1262" /cell_type="fibroblasts" /dev_stage="fetal" /sex="female" /map="3p14-21" 5'UTR <1..483 /gene="Wnt-5a" /note="putative" gene 484..1581 /gene="Wnt-5a" sig_peptide 484..549 /gene="Wnt-5a" /note="putative" CDS 484..1581 /gene="Wnt-5a" /standard_name="hWNT5A" /note="putative" /codon_start=1 /db_xref="PID:g348918" /translation="MAGSAMSSKFFLVALAIFFSFAQVVIEANSWWSLGMNNPVQMSE VYIIGAQPLCSQLAGLSQGQKKLCHLYQDHMQYIGEGAKTGIKECQYQFRHRRWNCST VDNTSVFGRVMQIGSRETAFTYAVSAAGVVNAMSRACREGELSTCGCSRAARPKDLPR DWLWGGCGDNIDYGYRFAKEFVDARERERIHAKGSYESARILMNLHNNEAGRRTVYNL ADVACKCHGVSGSCSLKTCWLQLADFRKVGDALKEKYDSAAAMRLNSRGKLVQVNSRF NSPTTQDLVYIDPSPDYCVRNESTGSLGTQGRLCNKTSEGMDGCELMCCGRGYDQFKT VQTERCHCKFHWCCYVKCKKCTEIVDQFVCK" misc_RNA 1150..1179 /gene="Wnt-5a" /standard_name="Wnt signature" /note="putative" /function="recognition peptide for Wnt family" 3'UTR 1582..4114 /gene="Wnt-5a" /note="putative" BASE COUNT 1112 a 884 c 946 g 1172 t ORIGIN 1 attaattctg gctccacttg ttgctcggcc caggttgggg agaggacgga gggtggccgc 61 agcgggttcc tgagtgaatt acccaggagg gactgagcac agcaccaact agagaggggt 121 cagggggtgc gggactcgag cgagcaggaa ggaggcagcg cctggcacca gggctttgac 181 tcaacagaat tgagacacgt ttgtaatcgc tggcgtgccc cgcgcacagg atcccagcga 241 aaatcagatt tcctggtgag gttgcgtggg tggattaatt tggaaaaaga aactgcctat 301 atcttgccat caaaaaactc acggaggaga agcgcagtca atcaacagta aacttaagag 361 acccccgatg ctcccctggt ttaacttgta tgcttgaaaa ttatctgaga gggaataaac 421 atcttttcct tcttccctct ccagaagtcc attggaatat taagcccagg agttgctttg 481 gggatggctg gaagtgcaat gtcttccaag ttcttcctag tggctttggc catatttttc 541 tccttcgccc aggttgtaat tgaagccaat tcttggtggt cgctaggtat gaataaccct 601 gttcagatgt cagaagtata tattatagga gcacagcctc tctgcagcca actggcagga 661 ctttctcaag gacagaagaa actgtgccac ttgtatcagg accacatgca gtacatcgga 721 gaaggcgcga agacaggcat caaagaatgc cagtatcaat tccgacatcg acggtggaac 781 tgcagcactg tggataacac ctctgttttt ggcagggtga tgcagatagg cagccgcgag 841 acggccttca catacgccgt gagcgcagca ggggtggtga acgccatgag ccgggcgtgc 901 cgcgagggcg agctgtccac ctgcggctgc agccgcgccg cgcgccccaa ggacctgccg 961 cgggactggc tctggggcgg ctgcggcgac aacatcgact atggctaccg ctttgccaag 1021 gagttcgtgg acgcccgcga gcgggagcgc atccacgcca agggctccta cgagagtgct 1081 cgcatcctca tgaacctgca caacaacgag gccggccgca ggacggtgta caacctggct 1141 gatgtggcct gcaagtgcca tggggtgtcc ggctcatgta gcctgaagac atgctggctg 1201 cagctggcag acttccgcaa ggtgggtgat gccctgaagg agaagtacga cagcgcggcg 1261 gccatgcggc tcaacagccg gggcaagttg gtacaggtca acagccgctt caactcgccc 1321 accacacaag acctggtcta catcgacccc agccctgact actgcgtgcg caatgagagc 1381 accggctcgc tgggcacgca gggccgcctg tgcaacaaga cgtcggaggg catggatggc 1441 tgcgagctca tgtgctgcgg ccgtgggtac gaccagttca agaccgtgca gacggagcgc 1501 tgccactgca agttccactg gtgctgctac gtcaagtgca agaagtgcac ggagatcgtg 1561 gaccagtttg tgtgcaagta gtgggtgcca cccagcactc agccccgctc ccaggacccg 1621 cttatttata gaaagtacag tgattctggt ttttggtttt tagaaatatt ttttattttt 1681 ccccaagaat tgcaaccgga accatttttt ttcctgttac catctaagaa ctctgtggtt 1741 tattattaat attataatta ttatttggca ataatggggg tgggaaccac gaaaaatatt 1801 tattttgtgg atctttgaaa aggtaataca agacttcttt tggatagtat agaatgaagg 1861 gggaaataac acatacccta acttagctgt gtgggacatg gtacacatcc agaaggtaaa 1921 gaaatacatt ttctttttct caaatatgcc atcatatggg atgggtaggt tccagttgaa 1981 agagggtggt agaaatctat tcacaattca gcttctatga ccaaaatgag ttgtaaattc 2041 tctggtgcaa gataaaaggt cttgggaaaa caaaacaaaa caaaacaaac ctcccttccc 2101 cagcagggct gctagcttgc tttctgcatt ttcaaaatga taatttacaa tggaaggaca 2161 agaatgtcat attctcaagg aaaaaaggta tatcacatgt ctcattctcc tcaaatattc 2221 catttgcaga cagaccgtca tattctaata gctcatgaaa tttgggcagc agggaggaaa 2281 gtccccagaa attaaaaaat ttaaaactct tatgtcaaga tgttgatttg aagctgttat 2341 aagaattggg attccagatt tgtaaaaaga cccccaatga ttctggacac tagatttttt 2401 gtttggggag gttggcttga acataaatga aatatcctgt attttcttag ggatacttgg 2461 ttagtaaatt ataatagtag aaataataca tgaatcccat tcacaggttt ctcagcccaa 2521 gcaacaaggt aattgcgtgc cattcagcac tgcaccagag cagacaacct atttgaggaa 2581 aaacagtgaa atccaccttc ctcttcacac tgagccctct ctgattcctc cgtgttgtga 2641 tgtgatgctg gccacgtttc caaacggcag ctccactggg tcccctttgg ttgtaggaca 2701 ggaaatgaaa cattaggagc tctgcttgga aaacagttca ctacttaggg atttttgttt 2761 cctaaaactt ttattttgag gagcagtagt tttctatgtt ttaatgacag aacttggcta 2821 atggaattca cagaggtgtt gcagcgtatc actgttatga tcctgtgttt agattatcca 2881 ctcatgcttc tcctattgta ctgcaggtgt accttaaaac tgttcccagt gtacttgaac 2941 agttgcattt ataagggggg aaatgtggtt taatggtgcc tgatatctca aagtcttttg 3001 tacataacat atatatatat atacatatat ataaatataa atataaatat atctcattgc 3061 agccagtgat ttagatttac agcttactct ggggttatct ctctgtctag agcattgttg 3121 tccttcactg cagtccagtt gggattattc caaaagtttt ttgagtcttg agcttgggct 3181 gtggccccgc tgtgatcata ccctgagcac gacgaagcaa cctcgtttct gaggaagaag 3241 cttgagttct gactcactga aatgcgtgtt gggttgaaga tatctttttt tcttttctgc 3301 ctcacccctt tgtctccaac ctccatttct gttcactttg tggagagggc attacttgtt 3361 cgttatagac atggacgtta agagatattc aaaactcaga agcatcagca atgtttctct 3421 tttcttagtt cattctgcag aatggaaacc catgcctatt agaaatgaca gtacttatta 3481 attgagtccc taaggaatat tcagcccact acatagatag cttttttttt tttttttttt 3541 ttttaataag gacacctctt tccaaacagg ccatcaaata tgttcttatc tcagacttac 3601 gttgttttaa aagtttggaa agatacacat cttttcatac ccccccttag gaggttgggc 3661 tttcatatca cctcagccaa ctgtggctct taatttattg cataatgata tccacatcag 3721 ccaactgtgg ctctttaatt tattgcataa tgatattcac atcccctcag ttgcagtgaa 3781 ttgtgagcaa aagatcttga aagcaaaaag cactaattag tttaaaatgt cacttttttg 3841 gtttttatta tacaaaaacc atgaagtact ttttttattt gctaaatcag attgttcctt 3901 tttagtgact catgtttatg aagagagttg agtttaacaa tcctagcttt taaaagaaac 3961 tatttaatgt aaaatattct acatgtcatt cagatattat gtatatcttc tagcctttat 4021 tctgtacttt taatgtacat atttctgtct tgcgtgattt gtatatttca ctggtttaaa 4081 aaacaaacat cgaaaggctt attccaaatg gaag // LOCUS HUMX104A 4484 bp mRNA PRI 14-JAN-1995 DEFINITION Human X104 mRNA, complete cds. ACCESSION L27476 NID g498012 KEYWORDS . SOURCE Homo sapiens fetus brain cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4484) AUTHORS Duclos,F., Rodius,F., Wrogemann,K., Mandel,J.-L. and Koenig,M. TITLE The Friedrich ataxia region: characterization of two novel genes and reduction of the critical region to 300kb JOURNAL Hum. Mol. Genet. 3, 909-914 (1994) MEDLINE 95038744 FEATURES Location/Qualifiers source 1..4484 /organism="Homo sapiens" /db_xref="taxon:9606" /dev_stage="fetus" /tissue_type="brain" /map="9q13-q21" gene 80..3430 /gene="X104" CDS 80..3430 /gene="X104" /codon_start=1 /db_xref="PID:g498013" /translation="MPVRGDRGFPPRRELSGWLRAPGMEELIWEQYTVTLQKDSKRGF GIAVSGGRDNPHFENGETSIVISDVLPGGPADGLLQENDRVVMVNGTPMEDVLHSFAV QQLRKSGKVAAIVVKRPRKVQVAALQASPPLDQDDRAFEVMDEFDGRSFRSGYSERSR LNSHGGRSRSWEDSPERGRPHERARSRERDLSRDRSRGRSLERGLDQDHARTRDRSRG RSLERGLDHDFGPSRDRDRDRSRGRSIDQDYERAYHRAYDPDYERAYSPEYRRGARHD ARSRGPRSRSREHPHSRSPSPEPRGRPGPIGVLLMKSRANEEYGLRLGSQIFVKEMTR TGLATKDGNLHEGDIILKINGTVTENMSLTDARKLIEKSRGKLQLVVLRDSQQTLINI PSLNDSDSEIEDISEIESTRSFSPEERRHQYSDYDYHSSSEKLKERPSSREDTPSRLS RMGATPTPFKSTGDIAGTVVPETNKEPRYQEEPPAPQPKAAPRTFLRPSPEDEAIYGP NTKMVRFKKGDSVGLRLAGGNDVGIFVAGIQEGTSAEQEGLQEGDQILKVNTQDFRGL VREDAVLYLLEIPKGEMVTILAQSRADVYRDILACGRGDSFFIRSHFECEKETPQSLA FTRGEVFRVVDTLYDGKLGNWLAVRIGNELEKGLIPNKSRAEQMASVQNAQRDNAGDR ADFWRMRGQRSGVKKNLRKSREDLTAVVSVSTKFPAYERVLLREAGFKRPVVLFGPIA DIAMEKLANELPDWFQTAKTEPKDAGSEKSTGVVRLNTVRQVIEQDKHALLDVTPKAV DLLNYTQWFSIVISFTPDSRQGVNTMRQRLDPTSNNSSRKLFDHANKLKKTCAHLFTA TINLNSANDSWFGSLKDTIQHQQGEAVWVSEGKMEGMDDDPEDRMSYLTAMGADYLSC DSRLISDFEDTDGEGGAYTDNELDEPAEEPLVSSITRSSEPVQHEESIRKPSPEPRAQ MRRAASSDQLRDNSPPPAFKPEPSKAKTQNKEESYDFSKSYEYKSNPSAVAGNETPGA STKGYPPPVAAKPTFGRSILKPSTPIPPQEGEEVGESSEEQDNAPKSVLGKVKIFGED GSQGPGLQENAGAPGSTECKDRNCPEAS" polyA_site 4484 BASE COUNT 1234 a 1075 c 1239 g 936 t ORIGIN 1 tgcccaggag gagtaggagc aggagcagaa gcagaagcgg ggtccggagc tgcgcgccta 61 cgcgggacct gtgtccgaaa tgccggtgcg aggagaccgc gggtttccac cccggcggga 121 gctgtcaggt tggctccgcg ccccaggcat ggaagagctg atatgggaac agtacactgt 181 gaccctacaa aaggattcca aaagaggatt tggaattgca gtgtccggag gcagagacaa 241 cccccacttt gaaaatggag aaacgtcaat tgtcatttct gatgtgctcc cgggtgggcc 301 tgctgatggg ctgctccaag aaaatgacag agtggtcatg gtcaatggca cccccatgga 361 ggatgtgctt cattcgtttg cagttcagca gctcagaaaa agtgggaagg tcgctgctat 421 tgtggtcaag aggccccgga aggtccaggt ggccgcactt caggccagcc ctcccctgga 481 tcaggatgac cgggcttttg aggtgatgga cgagtttgat ggcagaagtt tccggagtgg 541 ctacagcgag aggagccggc tgaacagcca tggggggcgc agccgcagct gggaggacag 601 cccggaaagg gggcgtcccc atgagcgggc ccggagccgg gagcgggacc tcagccggga 661 ccggagccgt ggccggagcc tggagcgggg cctggaccaa gaccatgcgc gcacccgaga 721 ccgcagccgt ggccggagcc tggagcgggg cctggaccac gactttgggc catcccggga 781 ccgggaccgt gaccgcagcc gcggccggag cattgaccag gactacgagc gagcctatca 841 ccgggcctac gacccagact acgagcgggc ctacagcccg gagtacaggc gcggggcccg 901 ccacgatgcc cgctctcggg gaccccgaag ccgcagccgc gagcacccgc actcacggag 961 ccccagcccc gagcctaggg ggcggccggg gcccatcggg gtcctcctga tgaaaagcag 1021 agcgaacgaa gagtatggtc tccggcttgg gagtcagatc ttcgtaaagg aaatgacccg 1081 aacgggtctg gcaactaaag atggcaacct tcacgaagga gacataattc tcaagatcaa 1141 tgggactgta actgagaaca tgtctttaac ggatgctcga aaattgatag aaaagtcaag 1201 aggaaaacta cagctagtgg tgttgagaga cagccagcag accctcatca acatcccgtc 1261 attaaatgac agtgactcag aaatagaaga tatttcagaa atagagtcaa cccgatcatt 1321 ttctccagag gagagacgtc atcagtattc tgattatgat tatcattcct caagtgagaa 1381 gctgaaggaa aggccaagtt ccagagagga cacgccgagc agattgtcca ggatgggtgc 1441 gacacccact ccctttaagt ccacagggga tattgcaggc acagttgtcc cagagaccaa 1501 caaggaaccc agataccaag aggaaccccc agctcctcaa ccaaaagcag ccccgagaac 1561 ttttcttcgt cctagtcctg aagatgaagc aatatatggc cctaatacca aaatggtaag 1621 gttcaagaag ggagacagcg tgggcctccg gttggctggt ggcaatgatg tcgggatatt 1681 tgttgctggc attcaagaag ggacctcggc ggagcaggag ggccttcaag aaggagacca 1741 gattctgaag gtgaacacac aggatttcag aggattagtg cgggaggatg ccgttctcta 1801 cctgttagaa atccctaaag gtgaaatggt gaccatttta gctcagagcc gagccgatgt 1861 gtatagagac atcctggctt gtggcagagg ggattcgttt tttataagaa gccactttga 1921 atgtgagaag gaaactccac agagcctggc cttcaccaga ggggaggtct tccgagtggt 1981 agacacactg tatgacggca agctgggcaa ctggctggct gtgaggattg ggaacgagtt 2041 ggagaaaggc ttaatcccca acaagagcag agctgaacaa atggccagtg ttcaaaatgc 2101 ccagagagac aacgctgggg accgggcaga tttctggaga atgcgtggcc agaggtctgg 2161 ggtgaagaag aacctgagga aaagtcggga agacctcaca gctgttgtgt ctgtcagcac 2221 caagttccca gcttatgaga gggttttgct gcgagaagct ggtttcaaga gacctgtggt 2281 cttattcggc cccatagctg atatagcaat ggaaaaattg gctaatgagt tacctgactg 2341 gtttcaaact gctaaaacgg aaccaaaaga tgcaggatct gagaaatcca ctggagtggt 2401 ccggttaaat accgtgaggc aagttattga acaggataag catgcactac tggatgtgac 2461 tccgaaagct gtggacctgt tgaattacac ccagtggttc tcaattgtga tttctttcac 2521 gccagactcc agacaaggtg tcaacaccat gagacaaagg ttagacccaa cgtccaacaa 2581 tagttctcga aagttatttg atcacgccaa caagcttaaa aaaacgtgtg cacacctttt 2641 tacagctaca atcaacctaa attcagccaa tgatagctgg tttggcagct taaaggacac 2701 tattcagcat cagcaaggag aagcggtttg ggtctctgaa ggaaagatgg aagggatgga 2761 tgatgacccc gaagaccgca tgtcctactt aactgccatg ggcgcagact atctgagttg 2821 cgacagccgc ctcatcagtg actttgaaga cacggacggt gaaggaggcg cctacactga 2881 caatgagctg gatgagccag ccgaggagcc gctggtgtcg tccatcaccc gctcctcgga 2941 gccggtgcag cacgaggaga gcataaggaa acccagccca gagccacgag ctcagatgag 3001 gagggctgct agcagcgatc aacttaggga caatagcccg cccccagcat tcaagccaga 3061 gccgtccaag gccaaaaccc agaacaaaga agaatcctat gacttctcca aatcctatga 3121 atataagtca aacccctctg ccgttgctgg taatgaaact cctggggcat ctaccaaagg 3181 ttatcctcct cctgttgcag caaaacctac ctttgggcgg tctatactga agccctccac 3241 tcccatccct cctcaagagg gtgaggaggt gggagagagc agtgaggagc aagataatgc 3301 tcccaaatca gtcctgggca aagtcaaaat atttggagaa gatggatcac aagggccagg 3361 gttacaagag aatgcaggag ctccaggaag cacagaatgc aaggatcgaa attgcccaga 3421 agcatcctga tatctatgca gttccaatca aaacgcacaa gccagaccct ggcacgcccc 3481 agcacacgag ttccagaccc cctgagccac agaaagctcc ttccagacct tatcaggata 3541 ccagaggaag ttatggcagt gatgccgagg aggaggagta ccgccagcag ctgtcagaac 3601 actccaagcg cggttactat ggccagtctg cccgataccg ggacacagaa ttatagatgt 3661 ctgagcacgg actctcccag gcctgcctgc atggcatcag actagccact cctgccaggc 3721 cgccgggatg gttcttctcc agttagaatg caccatggag acgtggtggg actccagctc 3781 gtgtgtcctc atggagaacc caggggacag ctggtgcaaa ttcagaactg agggctctgt 3841 ttgtgggact gggttagagg agtctgtggc tttttgttca gaattaagca gaacactgca 3901 gtcagatcct gttacttgct tcagtggacc gaaatctgta ttctgtttgc gtacttgtaa 3961 tatgtatatt aagaagcaat aactattttt cctcattaat agctgccttc aaggactgtt 4021 tcagtgtgag tcagaatgtg aaaaaggaat aaaaaatact gttgggctca aactaaattc 4081 aaagaagtac tttattgcaa ctcttttaag tgccttggat gagaagtgtc ttaaattttc 4141 ttcctttgaa gctttaggca gagccataat ggactaaaac attttgacta agtttttata 4201 ccagcttaat agctgtagtt ttccctgcac tgtgtcatct tttcaaggca tttgtctttg 4261 taatattttc cataaatttg gactgtctat atcataacta tacttgatag tttggctata 4321 agtgctcaat agcttgaagc ccaagaagtt ggtatcgaaa tttgttgttt gtttaaaccc 4381 aagtgctgca caaaagcaga tacttgagga aaacactatt tccaaaagca catgtattga 4441 caacagtttt ataatttaat aaaaaggaat acattgcaat ccgt // LOCUS HUMXE169A 5911 bp mRNA PRI 14-JAN-1995 DEFINITION Human XE169 mRNA, complete cds. ACCESSION L25270 NID g457136 KEYWORDS . SOURCE Homo sapiens male and female cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5911) AUTHORS Wu,J., Ellison,J., Salido,E., Yen,P., Mohandas,T. and Shapiro,L.J. TITLE Isolation and characterization of XE169, a novel human gene that escapes X-inactivation JOURNAL Hum. Mol. Genet. 3 (1), 153-160 (1994) MEDLINE 94214434 FEATURES Location/Qualifiers source 1..5911 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="IMR91 and CF150-6" /sex="male and female" /map="X short arm" gene 532..5214 /gene="XE169" CDS 532..5214 /gene="XE169" /note="escapes X-chromosome inactivation" /codon_start=1 /db_xref="PID:g457137" /translation="MEPGSDDFLPPPECPVFEPSWAEFRDPLGYIAKIRPIAEKSGIC KIRPPADWQPPFAVEVDNFRFTPRIQRLNELEAQTRVKLNYLDQIAKFWEIQGSSLKI PNVERRILDLYSLSKIVVEEGGYEAICKDRRWARVAQRLNYPPGKNIGSLLRSHYERI VYPYEMYQSGANLVQCNTRPFDNEEKDKEYKPHSIPLRQSVQPSKFNSYGRRAKRLQP DPEPTEEDIEKNPELKKLQIYGAGPKMMGLGLMAKDKTLRKKDKEGPECPPTVVVKEE LGGDVKVESTSPKTFLESKEELSHSPEPCTKMTMRLRRNHSNAQFIESYVCRMCSRGD EDDKLLLCDGCDDNYHIFCLLPPLPEIPKGVWRCPKCVMAECKRPPEAFGFEQATREY TLQSFGEMADSFKADYFNMPVHMVPTELVEKEFWRLVNSIEEDVTVEYGADIHSKEFG SGFPVSDSKRHLTPEEEEYATSGWNLNVMPVLEQSVLCHINADISGMKVPWLYVGMVF SAFCWHIEDHWSYSINYLHWGEPKTWYGVPSLAAEHLEEVMKKLTPELFDSQPDLLHQ LVTLMNPNTLMSHGVPVVRTNQCAGEFVITFPRAYHSGFNQGYNFAEAVNFCTADWLP AGRQCIEHYRRLRRYCVFSHEELICKMAACPEKLDLNLAAAVHKEMFIMVQEERRLRK ALLEKGITEAEREAFELLPDDERQCIKCKTTCFLSALACYDCPDGLVCLSHINDLCKC SSSRQYLRYRYTLDELPAMLHKLKVRAESFDTWANKVRVALEVEDGRKRSLEELRALE SEARERRFPNSELLQQLKNCLSEAEACVSRALGLVSGQEAGPHRVAGLQMTLTELRAF LDQMNNLPCAMHQIGDVKGVLEQVEAYQAEAREALASLPSSPGLLQSLLERGRQLGVE VPEAQQLQRQVEQARWLDEVKRTLAPSARRGTLAVMRGLLVAGASVAPSPAVDKAQAE LQELLTIAERWEEKAHLCLEARQKHPPATLEAIIREAENIPVHLPNIQALKEALAKAR AWIADVDEIQNGDHYPCLDDLEGLVAVGRDLPVGLEELRQLELQVLTAHSWREKASKT FLKKNSCYTLLEVLCPCADAGSDSTKRSRWMEKELGLYKSDTELLGLSAQDLRDPGSV IVAFKEGEQKEKEGILQLRRTNSAKPSPLASSSTASSTTSICVCGQVLAGAGRLQCDL CQDWFHGRCVSVPRLLSSPRPNPTSSPLLAWWEWDTKFLCPLCMRSRRPRLETILALL VALQRLPVRLPEGEALQCLTERAISWQGRARQALASEDVTALLGRLAELRQRLQAEPR PEEPPNYPAAPASDPLREGSGKDMPKVQGLLENGDSVTSPEKVAPEEGSGKRDLELLS SLLPQLTGPVLELPEATRAPLEELMMEGDLLEVTLDENHSIWQLLQAGQPPDLERIRT LLELEKAERHGSRARGRALERRRRRKVDRGGEGDDPAREELEPKRVRSSGPEAEEVQE EEELEEETGGEGPPAPIPTTGSPSTQENQNGLEPAEGTTSGPSAPFSTLTPRLHLPCP QQPPQQQL" polyA_site 5911 BASE COUNT 1267 a 1670 c 1780 g 1194 t ORIGIN 1 cttgttcctc cgccgttgca atgaactatt ttctctcagt ccggggtggt acttttgagt 61 aaccccttcc aaatagaaac ccataccagc ctatttagct cggtctccac tatatgaagg 121 tttcctaggg cttggtgtga cgcaacgtat acgaggctcg gaaggacacc ccgcggaagg 181 atccggtttg ttgtggtgtg gggaggggag acgctgacaa accaagatgg cggcggcggc 241 gctgaaggcc gcggcgtttg ggaggtaact gtggtggcga aggctgcggt agtggggagg 301 caaccacaca gtttggaaga aacggagcgg gacgaagagg cggtagcagt agagtcagcc 361 tgagactctc agagcacgac ggccacacgc ccccttaggc cctcggcggg cggcggctgc 421 ccgcttaggc ctagcctccg agcattgcct cggcttcaaa cagcggcggc gccatgagtc 481 cttaagggcg gtccaagcct cccgatccct ggcccagacc tcgggcccac catggagccg 541 gggtccgacg atttcctacc gccaccggag tgcccggtgt tcgagcctag ctgggccgag 601 ttccgagacc ctcttggcta catcgcgaaa atcaggccca tcgcagagaa atcgggcatt 661 tgcaagatcc gcccacccgc ggactggcag ccaccctttg ctgtggaagt ggacaacttc 721 aggtttaccc cccgaatcca gaggctgaat gagctagagg cccagacgag agtgaaactg 781 aactacttgg accagattgc caaattctgg gaaatccagg gctcctcctt aaagattccc 841 aatgtagaac ggcggatctt ggacctctac agtctcagca aaattgtggt ggaggaaggt 901 ggttatgaag ctatctgcaa ggaccgtcgg tgggctcggg tagcccagcg cctcaactat 961 ccaccaggca aaaatattgg ctccttgcta cgctcccact acgaacgcat tgtttatccc 1021 tatgaaatgt accagtctgg agccaacctt gtgcagtgta acacacgtcc atttgataat 1081 gaggagaagg acaaggaata caaaccccac agcatccccc tacgacagtc tgtgcagcct 1141 tccaagttca acagctatgg ccggcgggcc aagagactgc agcctgatcc ggaacccaca 1201 gaggaagaca ttgagaagaa tccagagctg aaaaagctac agatctatgg ggcaggcccc 1261 aagatgatgg gcctgggcct catggccaaa gacaagactc tgcggaagaa agataaggag 1321 gggcctgagt gtccccccac agtagtggtg aaggaggagt taggtgggga tgtgaaggtg 1381 gagtcaacat cgcctaagac cttcctggag agcaaggagg agctgagtca cagcccagaa 1441 ccctgcacca agatgaccat gaggctacgg aggaaccaca gcaatgccca gtttattgag 1501 tcatatgtct gccggatgtg ttctcgaggg gatgaggatg acaagctcct gctgtgtgat 1561 ggctgtgatg acaactacca catcttctgc ctgctgcctc ctctgcctga gatccccaag 1621 ggtgtctggc ggtgcccaaa gtgtgtcatg gcggagtgta agcggccccc agaagccttt 1681 ggctttgagc aggctacccg ggaatacact ctgcagagct ttggcgagat ggccgactcc 1741 tttaaagctg actacttcaa catgcccgtg catatggtgc ccacagaact tgtggagaag 1801 gagttctgga ggctggtaaa tagcattgag gaagatgtga ctgttgagta tggagctgac 1861 atccattcca aagaatttgg cagcggtttc cctgtcagtg acagtaaacg gcacctaacc 1921 cccgaagagg aggagtatgc taccagtggt tggaacctaa atgtgatgcc ggtgttggaa 1981 cagtctgtac tgtgccacat caatgcagat atctctggca tgaaggtgcc ctggctctac 2041 gtgggcatgg tcttctcagc cttttgctgg catattgagg atcactggag ttactccatt 2101 aactacctcc actggggtga gccgaagacc tggtatgggg tgccctcact tgcagcagaa 2161 catttggaag aagtgatgaa gaagctgaca cctgaactat ttgatagcca gcctgacctc 2221 ctgcaccaac ttgtcaccct catgaatccc aacaccctca tgtcccatgg tgtgccagtt 2281 gtccgcacaa accagtgtgc aggagagttt gtcatcacct tcccccgtgc ttaccacagc 2341 ggcttcaacc aaggctacaa ctttgccgag gctgtcaact tttgcactgc tgactggttg 2401 cctgctgggc gccagtgcat tgagcactac cgccggctcc ggagatactg cgtcttctcc 2461 catgaggagc ttatctgcaa gatggctgcc tgcccagaga agctagacct gaacctggcg 2521 gcagctgtgc ataaggagat gttcatcatg gtgcaagaag agcggcgtct acgaaaggcc 2581 ctgctggaga agggtatcac agaggctgag cgagaggctt tcgagctgct cccagatgat 2641 gagcgccagt gtatcaagtg caagactacg tgtttcctgt cagccctggc ctgctacgac 2701 tgcccagacg gccttgtctg cctttcccac atcaatgatc tctgcaagtg ctccagtagc 2761 cggcagtacc tgcggtatcg gtataccttg gatgagcttc ctgccatgct gcataagctg 2821 aaggttcggg ctgagtcctt tgacacctgg gccaacaaag tgcgagtggc cctggaggtg 2881 gaggatgggc ggaagcgcag ccttgaagaa ctgagggcac tagagtctga agcccgtgag 2941 cggaggtttc ctaatagtga gctgctgcag caactaaaga actgcctgag tgaggcagag 3001 gcttgcgtgt cccgagctct gggactggtc agcggccagg aagctggccc ccacagggtg 3061 gctggtctac agatgaccct gactgagctc cgggcctttc tggaccagat gaacaacctg 3121 ccttgcgcca tgcaccagat tggggatgtc aagggtgttc tggaacaggt ggaggcctac 3181 caggctgagg ctcgtgaggc cctggcctca ctgccctcca gtccagggct actgcagtcc 3241 ctgttggaga gggggcggca gctgggggtg gaggtgcctg aggcccagca gctccagcgg 3301 caggtggaac aggcgcgatg gctggatgag gtgaaacgca cactggcccc ctcagcccga 3361 aggggcacct tggctgtcat gcgaggactg ttggtcgcgg gtgccagtgt agcccctagc 3421 cctgctgtgg ataaagccca ggccgagctg caggaactgc tgaccattgc tgaacgctgg 3481 gaggagaaag cccacctctg cctggaggcc aggcagaagc atccaccagc cacacttgag 3541 gccataatcc gtgaagcgga aaacatccct gttcacctgc ccaacatcca ggctctcaag 3601 gaggctcttg ctaaggcccg ggcctggatt gctgatgttg atgagatcca aaatggtgac 3661 cactacccct gcctggatga cttggagggc ctagtagctg tgggccggga cctacctgtg 3721 gggctggagg agctgagaca gctagagcta caggtactga cagcgcactc ctggagggag 3781 aaggcctcca agaccttcct caagaaaaat tcttgctaca cgctgctgga ggttctctgc 3841 ccatgtgcag atgccggctc agacagcacc aagcgcagcc ggtggatgga gaaggagctg 3901 gggttgtaca aatctgacac agagctgctg gggctgtctg cgcaggacct cagggaccca 3961 ggctctgtga tcgtggcctt caaggagggg gaacagaagg agaaggaggg tatcctgcag 4021 ctgcgtcgca ccaattcggc caagcccagt ccactggcat catcgagcac ggcctcctct 4081 acaacctcta tctgtgtgtg tgggcaggtg ctggctgggg cgggacgtct gcagtgtgac 4141 ctgtgtcagg actggttcca tgggcggtgt gtgtcagtgc ctcgcctcct cagctctccg 4201 aggcccaatc ccacctcatc cccactgctg gcctggtggg aatgggacac caaattcctg 4261 tgtccactgt gtatgcgctc aaggcgcccg cgcctggaga ccatcctggc actgctggta 4321 gccctgcaga gactgcctgt gcggctgccc gagggcgagg ccctgcagtg cctcacagag 4381 agggccatca gctggcaagg ccgcgccagg caggctctgg cctctgaaga tgtgactgct 4441 cttttgggac ggctggctga gctccgccaa cggctacagg ctgaacctag acctgaggag 4501 cctcctaact accctgcagc ccctgcttct gaccccctca gagagggcag tggcaaggat 4561 atgcctaagg tccagggctt actggagaat ggagacagtg tgaccagtcc tgagaaggta 4621 gccccggagg agggctcagg taagagagat ctggagctgc tgtcctcgct gttgccacag 4681 ttgactggcc ctgtgttgga actgcctgag gcaacccggg cccccttgga ggagctcatg 4741 atggaggggg acctgctcga ggtgaccctg gatgagaacc acagcatatg gcagctgctg 4801 caggctggac agcccccaga cctggagagg atccgcacac ttctggagct ggagaaggca 4861 gagcgtcacg ggagtcgggc tcggggccgg gccctggaga ggcggcggcg gcggaaggtg 4921 gatcggggtg gggagggcga tgacccagcc cgagaggagc tagagccaaa gagggtacgg 4981 agctcagggc cagaggctga ggaggtccag gaggaggaag agctggagga ggagactggg 5041 ggtgagggcc cccctgcacc catccccacc actggcagcc ccagcaccca ggagaaccag 5101 aatggcttgg aaccggcgga agggaccact tcaggcccct cggccccttt ctccactctg 5161 actccccggc tgcatctgcc ctgcccacag cagccgcctc agcaacagtt gtgacagtgg 5221 ctgagcctag cacagaccct gacagagacc cccctcggcc tcaaggatcc tctttctgac 5281 catcaagcct gcttcttggg gggtgggcgg gtaggggggg tggccatccc tgctacccgc 5341 ccacccctga gtcccttgac ttttgtattc tgactccaag gtattgttca gacctcagct 5401 cctgggggcc ggcccctgga gtcttccctc cctggtagcc tctaaccagc attcccagac 5461 acctgaggca gatagatgga tgggctggtg ggcagggggg tggctggggc tgggccatca 5521 ccattccaga gacaaggcca gtgtatatgc aaactggggg actctcctcc cttctctccc 5581 cagttctggt cctggccagg ccatgctaca ctaacccctg cccccactct cctcccctct 5641 tttccttcct tcctaccccc ttctccctct cccttcccct gactgttcca cccaggagga 5701 ggaaacttca catagccgtg ctcacagttt tttattttaa aggaatttgg ctggggagct 5761 gaacagggct ccctgtgatc tgaagaaagc ttttggtgct tgtcctcaca accacctcag 5821 tcctccctcc ctgtcctccc ctgtctcctt tcctcctcct gggttcatgt tgtaataaaa 5881 gaagattgtt ggtgtgtaat taatttgttc a // LOCUS HUMXPAC 1377 bp mRNA PRI 24-MAR-1994 DEFINITION Human mRNA for XPAC protein. ACCESSION D14533 NID g286028 KEYWORDS XPAC protein. SOURCE Homo sapiens (library: pcD2Basinger) cDNA to mRNA, clones pcD2h19 and pcD2h29. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1377) AUTHORS Tanaka,K., Miura,N., Satokata,I., Miyamoto,I., Yoshida,M.C., Satoh,Y., Kondo,S., Yasui,A., Okayama,H. and Okada,Y. TITLE Analysis of a human DNA excision repair gene involved in group A xeroderma pigmentosum and containing a zinc-finger domain JOURNAL Nature 348 (6296), 73-76 (1990) MEDLINE 91043046 REFERENCE 2 (sites) AUTHORS Satokata,I., Iwai,K., Matsuda,T., Okada,Y. and Tanaka,K. TITLE Genomic characterization of the human DNA excision repair-controlling gene XPAC JOURNAL Gene 136 (1-2), 345-348 (1993) MEDLINE 94124028 REFERENCE 3 (bases 1 to 1377) AUTHORS Tanaka,K. TITLE Direct Submission JOURNAL Submitted (26-FEB-1993) to the DDBJ/EMBL/GenBank databases. Kiyoji Tanaka, Osaka University, Inst. for Molecular and Cellular Biology; 1-3 Yamadaoka, Suita, Osaka 565, Japan (Tel:06-877-5238, Fax:06-877-9136) COMMENT Submitted (26-Feb-1993) to DDBJ by: Kiyoji Tanaka Insitute for Molecular and Cellular Biology Osaka University 1-3 Yamadaoka, Suita Osaka 565 Japan Phone: 06-877-5238 Fax: 06-877-9136. FEATURES Location/Qualifiers source 1..1377 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="human primar" /clone_lib="pcD2Basinger" /haplotype="y fibroblast" gene 27..848 /gene="XPAC" CDS 27..848 /gene="XPAC" /codon_start=1 /product="XPAC protein" /db_xref="PID:d1003913" /db_xref="PID:g286029" /translation="MAAADGALPEAAALEQPAELPASVRASIERKRQRALMLRQARLA ARPYSATAAAATGGMANVKAAPKIIDTGGGFILEEEEEEEQKIGKVVHQPGPVMEFDY VICEECGKEFMDSYLMNHFDLPTCDNCRDADDKHKLITKTEAKQEYLLKDCDLEKREP PLKFIVKKNPHHSQWGDMKLYLKLQIVKRSLEVWGSQEALEEAKEVRQENREKMKQKK FDKKVKELRRAVRSSVWKRETIVHQHEYGPEENLEDDMYRKTCTMCGHELTYEKM" BASE COUNT 458 a 232 c 358 g 329 t ORIGIN Chromosome 9. 1 agctaggtcc tcggagtggg ccagagatgg cggcggccga cggggctttg ccggaggcgg 61 cggctttaga gcaacccgcg gagctgcctg cctcggtgcg ggcgagtatc gagcggaagc 121 ggcagcgggc actgatgctg cgccaggccc ggctggctgc ccggccctac tcggcgacgg 181 cggctgcggc tactggaggc atggctaatg taaaagcagc cccaaagata attgacacag 241 gaggaggctt cattttagaa gaggaagaag aagaagaaca gaaaattgga aaagttgttc 301 atcaaccagg acctgttatg gaatttgatt atgtaatatg cgaagaatgt gggaaagaat 361 ttatggattc ttatcttatg aaccactttg atttgccaac ttgtgataac tgcagagatg 421 ctgatgataa acacaagctt ataaccaaaa cagaggcaaa acaagaatat cttctgaaag 481 actgtgattt agaaaaaaga gagccacctc ttaaatttat tgtgaagaag aatccacatc 541 attcacaatg gggtgatatg aaactctact taaagttaca gattgtgaag aggtctcttg 601 aagtttgggg tagtcaagaa gcattagaag aagcaaagga agtccgacag gaaaaccgag 661 aaaaaatgaa acagaagaaa tttgataaaa aagtaaaaga attgcggcga gcagtaagaa 721 gcagcgtgtg gaaaagggag acgattgttc atcaacatga gtatggacca gaagaaaacc 781 tagaagatga catgtaccgt aagacttgta ctatgtgtgg ccatgaactg acatatgaaa 841 aaatgtgatt ttttagttca gtgacctgtt ttatagaatt ttatatttaa ataaaggaaa 901 tttagattgg tccttttcaa aattcaaaaa aaaaagcaac atcttcatag atgaatgaaa 961 cccttgtata agtaatactt cagtaataat tatgtatgtt atggcttaaa agcaagtttc 1021 agtgaaggtc acctggcctg gttgtgtgca caatgtcatg tctgtgattg ccttcttaca 1081 acagagatgg gagctgagtg ctagagtagg tgcagaagtg gtaggtcagc tacaaatttg 1141 aggacaagat accaaggcaa accctagatt ggggtagagg gaaaagggtt caacaaaggc 1201 tgaactggat tcttaaccaa gaaacaaata atagcaatgg tggtgcacca ctgtacccca 1261 ggttctagtc atgtgttttt taggacgatt tctgtctcca cgatggtgga aacagtgggg 1321 aactactgct ggaaaaagcc ctaatagcag aaataaacat tgagttgtac gagtctg // LOCUS HUMXQC 2823 bp mRNA PRI 27-MAR-1996 DEFINITION Human mRNA for ORF, Xq terminal portion. ACCESSION D16469 NID g758583 KEYWORDS ORF. SOURCE Homo sapiens fetus female brain cDNA to mRNA, clone_lib:#936206. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2823) AUTHORS Yokoi,H., Hadano,S., Kogi,M., Kang,X., Wakasa,K. and Ikeda,J.E. TITLE Isolation of expressed sequences encoded by the human Xq terminal portion using microclone probes generated by laser microdissection JOURNAL Genomics 20 (3), 404-411 (1994) MEDLINE 94307726 REFERENCE 2 (bases 1 to 2823) AUTHORS Yokoi,H. TITLE Direct Submission JOURNAL Submitted (17-JUN-1993) to the DDBJ/EMBL/GenBank databases. Haruhiko Yokoi, Ikeda Genosphere Project, ERATO, JRDC; Tokai University School of Medicine Boseidai, Isehara, Kanagawa 259-11, Japan (Tel:81-463-91-4056, Fax:81-463-91-4110) COMMENT Submitted (17-Jun-1993) to DDBJ by: Haruhiko Yokoi Ikeda Genosphere Project, ERATO, JRDC Tokai University School of Medicine, Boseidai, Isehara Kanagawa 259-11 Japan Phone: 0463-91-4056 Fax: 0463-91-4110. FEATURES Location/Qualifiers source 1..2823 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="#936206" /dev_stage="fetus" /sex="female" /tissue_type="brain" CDS 1354..2199 /codon_start=1 /product="ORF" /db_xref="PID:d1004454" /db_xref="PID:g495097" /translation="MAPREVLTGNDEVIGQVLSTLKSEDVPYTAALTAVRPSRVARDV AVVAGGLGRQLLQKQPVSPVIHPPVSYNDTAPRILFWAQNFSVAYKDQWEDLTPLTFG VQELNLTGSFWNDSFARLSLTYERLFGTTVTFKFILANRLYPVSARHWFTMERLEVHS NGSVAYFNASQVTGPSIYSFHCEYVSSLSKKGSLLVARTQPSPWQMMLQDFQIQAFNV MGEQFSYASDCASFFSPGIWMGLLTSLFMLFIFTYGLHMILSLKTMDRFDDHKGPTIS LTQIV" polyA_signal 2799..2804 polyA_signal 2804..2809 polyA_site 2823 BASE COUNT 580 a 775 c 808 g 660 t ORIGIN Chromosome X. 1 cctcaggaag ctgacattct tgtgagggtg gccagtaaat actaagtcgg tataggatat 61 gatgtcagtg ttacggcctt tagggaagga agcaggctaa ggggacagag tgactgggat 121 gctattttag ataagggtgg tcaggccagg cctcattgag gaggtgatat atgagcagcg 181 atcttaatgg agggagggag ccgtttgggg aaggacattc ctggaacagc aaatgcgagg 241 gtcctgggcg aggtgtgctc ttggccagct caaggaagag ctggtgtggc tggagcacag 301 tgagtgagag aaaggggtag gagatgtagg agatggtttg caggtggcta ggcgatgaag 361 taggaccttg tagcccatga gggggaaggt cagttgcagt tcgaaatgtg tgagaaagcc 421 ttgggtaacg tggagcaggg agtggtgtga tttctcaatt aaagaaagcc cacaggaccc 481 accagcagtt cctgggcttt ccgactgaga accccggtgg ccaaacagca gcaaggtcct 541 gcccacaagg gaggggaggc tgggcgagtg tgtataccaa acaggttagg gaagctgatt 601 aaaatctcct cagggcctaa ctgggaaggg ccagaggaag ctggggatgg gagtggaggg 661 taggagcaaa aggacaaagg acatctgtag gttgtggaga aaaagggatg gggtcggggc 721 cactgtggtc ctaagagctc aaaagacttc aatgctcgat gcttcctcca gcatgttctg 781 agatcctcac ctctcccctt ccgccaaaag caggtggggg gagggtcccg tccagactgg 841 acatagccga ctctcctttt ctctggctgg gaggcctgcc acaaatgctc ttggctgccc 901 caccccctcc ccgcagcttc cctgttccct ccccagttcc tcttgtctgt agggtgggca 961 aggcggctga ctcctactcc tgagttacca caagtcagct gcctgcagat ctccccaccc 1021 catgactgcc ttccatgtct tctcaccctg ccctgagagt gctggaggga agagctgagc 1081 attgaggatt tcacagcata tggcggtgtg tttggaaaca agcaggacag cgccttttct 1141 aacctagaga atgccctgga cctggccccc tcctcactgg tgcttcctgc cgtcgactgg 1201 tatgcagtca gcactctgac cacttacctg caggagaagc tcggggccag ccccttgcat 1261 gtggacctgg ccaccctgcg ggagctgaag ctcaatgcca gcctccctgc tctgctgctc 1321 attcgcctgc cctacacagc cagctctggt ctgatggcac ccagggaagt cctcacaggc 1381 aacgatgagg tcatcgggca ggtcctgagc acactcaagt ccgaagatgt cccatacaca 1441 gcggccctca cagcggtccg cccttccagg gtggcccgtg atgtagccgt ggtggccgga 1501 gggctaggtc gccagctgct acaaaaacag ccagtatcac ctgtgatcca tcctcctgtg 1561 agttacaatg acaccgctcc ccggatcctg ttctgggccc aaaacttctc tgtggcgtac 1621 aaggaccagt gggaggacct gactcccctc acctttgggg tgcaggaact caacctgact 1681 ggctccttct ggaatgactc ctttgccagg ctctcactga cctatgaacg actctttggt 1741 accacagtga cattcaagtt cattctggcc aaccgcctct acccagtgtc tgcccggcac 1801 tggtttacca tggagcgcct cgaagtccac agcaatggct ccgtcgccta cttcaatgct 1861 tcccaggtca cagggcccag catctactcc ttccactgcg agtatgtcag cagcctgagc 1921 aagaagggta gtctcctcgt ggcccgcacg cagccctctc cctggcagat gatgcttcag 1981 gacttccaga tccaggcttt caacgtaatg ggggagcagt tctcctacgc cagcgactgt 2041 gccagcttct tctcccccgg catctggatg gggctgctca cctccctgtt catgctcttc 2101 atcttcacct atggcctgca catgatcctc agcctcaaga ccatggatcg ctttgatgac 2161 cacaagggcc ccactatttc tttgacccag attgtgtgac cctgtgccag tgggggggtt 2221 gagggtggga cggtgtccgt gttgttgctt tcccaccctg cagcgcactg gactgaagag 2281 cttccctctt cctactgcag catgaactgc aagctcccct cagcccatct tgctccctct 2341 tcagcccgct gaggagcttt cttgggctgc ccccatctct cccaacaagg tgtacatatt 2401 ctgcgtagat gctagaccaa ccagcttccc agggttcgtc gctgtgaggc gtaagggaca 2461 tgaattctag ggtctccttt ctccttattt attcttgtgg ctacatcatc cctggctgtg 2521 gatagtgctt ttgtgtagca aatgctccct ccttaaggtt atagggctcc ctgagtttgg 2581 gagtgtggaa gtactactta actgtctgtc ctgcttggct gtcgttatcg ttttctggtg 2641 atgttgtgct aacaataagc agtacacggg tttatttctg tggcctgaga aggaagggac 2701 ctccacgaca ggtgggctgg gtgcgatcgc cggctgtttg gcatgttccc accgggagtg 2761 ccgggcagga gcatggggtg cttggttgtt tccttcctaa taaaataaac gcgggtcgcc 2821 atg // LOCUS HUMXRCC1 2797 bp mRNA PRI 07-MAR-1995 DEFINITION Human DNA-repair protein (XRCC1) mRNA, complete cds. ACCESSION M36089 NID g340396 KEYWORDS DNA repair protein. SOURCE Human fibroblast (cell line GM637), cDNA to mRNA, clone pXR1-30. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2797) AUTHORS Thompson,L.H., Brookman,K.W., Jones,N.J., Allen,S.A. and Carrano,A.V. TITLE Molecular cloning of the human XRCC1 gene, which corrects defective DNA strand break repair and sister chromatid exchange JOURNAL Mol. Cell. Biol. 10 (12), 6160-6171 (1990) MEDLINE 91061722 COMMENT Draft entry and computer-readable sequence for [Unpublished (1990)] kindly submitted by L.H.Thompson, 06-JUL-1990. Author Address: L.H.Thompson Biomedical Sciences Division Lawrence Livermore National Laboratory P.O. Box 5507 Livermore, CA 94550. FEATURES Location/Qualifiers source 1..2797 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pXR1-30" /cell_line="GM637" /cell_type="fibroblast" /map="19q13.2" misc_feature 91..124 /note="AT tract" misc_feature 158..163 /note="GC box" CAAT_signal 206..211 CAAT_signal complement(262..266) /note="CAAT box (reverse orientation)" mRNA 444..2526 /note="DNA-repair protein mRNA" gene 549..2450 /gene="XRCC1" CDS 549..2450 /gene="XRCC1" /note="DNA-repair protein" /codon_start=1 /db_xref="GDB:G00-120-737" /db_xref="PID:g340397" /translation="MPEIRLRHVVSCSSQDSTHCAENLLKADTYRKWRAAKAGEKTIS VVLQLEKEEQIHSVDIGNDGSAFVEVLVGSSAGGAGEQDYEVLLVTSSFMSPSESRSG SNPNRVRMFGPDKLVRAAAEKRWDRVKIVCSQPYSKDSPFGLSFVRFHSPPDKDEAEA PSQKVTVTKLGQFRVKEEDESANSLRPGALFFSRINKTSPVTASDPAGPSYAAATLQA SSAASSASPVSRAIGSTSKPQESPKGKRKLDLNQEEKKTPSKPPAQLSPSVPKRPKLP APTRTPATAPVPARAQGAVTGKPRGEGTEPRRPRAGPEELGKILQGVVVVLSGFQNPF RSELRDKALELGAKYRPDWTRDSTHLICAFANTPKYSQVLGLGGRIVRKEWVLDCHRM RRRLPSRRYLMAGPGSSSEEDEASHSGGSGDEAPKLPQKQPQTKTKPTQAAGPSSPQK PPTPEETKAASPVLQEDIDIEGVQSEGQDNGAEDSGDTEDELRRVAEQKEHRLPPGQE ENGEDPYAGSTDENTDSEEHQEPPDLPVPELPDFFQGKHFFLYGEFPGDERRKLIRYV TAFNGELEDYMSDRVQFVITAQEWDPSFEEALMDNPSLAFVRPRWIYSCNEKQKLLPH QLYGVVPQA" misc_feature 574..575 /gene="XRCC1" /note="genomic DNA end/cDNA start" misc_signal 1359..1376 /gene="XRCC1" /note="nuclear location signal" repeat_region 2463..2496 /note="AC repeat" polyA_signal 2506..2511 misc_feature 2526..2527 /note="cDNA end/genomic DNA start" BASE COUNT 651 a 827 c 801 g 515 t 3 others ORIGIN 1 tcccttggcc ccaggagaca ggggttgcag aaagccgaga tcgtgccact gcactccatc 61 ctgggtgaga gagcaagacc ctgtctcaac aaaaaatttt taaaaaataa aataaataat 121 aatacagcaa aaagatttgc tttctcggct tcagtgtggg cggtaactcc atcgtgcaat 181 gagaaaggcg aatttcttcc agacaccaat cccggaggtc gcttctgttg ctaggctccc 241 agaaagcagg gttcggacgt cattgggagg cgaggctaga gcggggttgt gtgtggcgga 301 gggaggcggg gctggaggaa acgctcgttg ctaaggaacg cagcgctctt cccgctctgg 361 agaggcgcga ctgggcttgc gcagtgtcga cgccggcgcc ggcgcgccgg ggtttgaaag 421 gcccgagcct cgcgcgcttg cgcactttag ccagcgcagg gcgcaccccg ctccctccca 481 ctctccctgc ccctcggacc ccatactcta cctcatcctt ctggccaggc gaagcccacg 541 acgttgacat gccggagatc cgcctccgcc atgtcgtgtc ctgcagcagc caggactcga 601 ctcactgtgc agaaaatctt ctcaaggcag acacttaccg aaaatggcgg gcagccaagg 661 caggcgagaa gaccatctct gtggtcctac agttggagaa ggaggagcag atacacagtg 721 tggacattgg gaatgatggc tcagctttcg tggaggtgct ggtgggcagt tcagctggag 781 gcgctgggga gcaagactat gaggtccttc tggtcacctc atctttcatg tccccttccg 841 agagccgcag tggctcaaac cccaaccgcg ttcgcatgtt tgggcctgac aagctggtcc 901 gggcagccgc cgagaagcgc tgggaccggg tcaaaattgt ttgcagccag ccctacagca 961 aggactcccc ctttggcttg agttttgtac ggtttcatag ccccccagac aaagatgagg 1021 cagaggcccc gtcccagaag gtgacagtga ccaagcttgg ccagttccgt gtgaaggagg 1081 aggatgagag cgccaactct ctgaggccgg gggctctctt cttcagccgg atcaacaaga 1141 catccccagt cacagccagc gacccggcag gacctagcta tgcagctgct accctccagg 1201 cttctagtgc tgcctcctca gcctctccag tctccagggc cataggcagc acctccaagc 1261 cccaggagtc tcccaaaggg aagaggaagt tggatttgaa ccaagaagaa aagaagaccc 1321 ccagcaaacc accagcccag ctgtcgccat ctgttcccaa gagacctaaa ttgccagctc 1381 caactcgtac cccagccaca gccccagtcc ctgcccgagc acagggggca gtgacaggca 1441 aaccccgagg agaaggcacc gagcccagac gaccccgagc tggcccagag gagctgggga 1501 agatccttca gggtgtggta gtggtgctga gtggcttcca gaaccccttc cgctccgagc 1561 tgcgagataa ggccctagag cttggggcca agtatcggcc agactggacc cgggacagca 1621 cgcacctcat ctgtgccttt gccaacaccc ccaagtacag ccaggtccta ggcctgggag 1681 gccgcatcgt gcgtaaggag tgggtgctgg actgtcaccg catgcgtcgg cggctgccct 1741 cccggaggta cctcatggca gggccaggtt ccagcagtga ggaggatgag gcctctcaca 1801 gcggtggcag cggagatgaa gcccccaagc ttcctcagaa gcaaccccag accaaaacca 1861 agcccactca ggcagctgga cccagctcac cccagaagcc cccaacccct gaagagacca 1921 aagcagcctc accagtgctc caggaagata tagacattga gggggtacag tcagaaggac 1981 aggacaatgg ggcggaagat tctggggaca cagaggatga gctgaggagg gtggcagagc 2041 agaaggaaca cagactgccc cctggccagg aggagaatgg ggaagacccg tatgcaggct 2101 ccacggatga gaacacggac agtgaggaac accaggagcc tcctgatctg ccagtccctg 2161 agctcccaga tttcttccag ggcaagcact tctttcttta cggggagttc cctggggacg 2221 agcggcggaa actcatccga tacgtcacag ccttcaatgg ggagctcgag gactatatga 2281 gtgaccgggt tcagtttgtg atcacagcac aggaatggga tcccagcttt gaggaggccc 2341 tgatggacaa cccctccctg gcattcgttc gtccccgatg gatctacagt tgcaatgaga 2401 agcagaagtt acttcctcac cagctctatg gggtggtgcc gcaagcctga agtatgtgct 2461 atacacacac acacacacac acacacacac acacacgatg catttaataa agatgagttg 2521 gttctcatcc aagagtctcc caaaactcta agaggctccc tgggacctgg ggaagaatgc 2581 tgggcacctc cgtcagagat ctggtacaca aggaactctt tgtctcttct gcttggcccc 2641 ttatccctgt gttggcaaga ggcagggaac tgggaatctg accctcagca ctgcccctca 2701 actttttctg gccctctgag ccacacctgt atcttggctg tccctttgtg gctggassst 2761 gggtacccat gaggcttgtc tctctcctga agcctca // LOCUS HUMYL1PA 1324 bp mRNA PRI 28-JAN-1997 DEFINITION Human YL-1 mRNA for YL-1 protein (nuclear protein with DNA-binding ability), complete cds. ACCESSION D43642 NID g806519 KEYWORDS YL-1 protein; YL-1; DNA-binding; nuclear protein. SOURCE Homo sapiens fibroblast cell_line:MRC-5 cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1324) AUTHORS Horikawa,I., Tanaka,H., Yuasa,Y., Suzuki,M. and Oshimura,M. TITLE Molecular cloning of a novel human cDNA on chromosome 1q21 and its mouse homolog encoding a nuclear protein with DNA-binding ability JOURNAL Cell. Mol. Biol. Res. 208, 999-1007 (1995) REFERENCE 2 (bases 1 to 1324) AUTHORS Horikawa,I. TITLE Direct Submission JOURNAL Submitted (29-NOV-1994) to the DDBJ/EMBL/GenBank databases. Izumi Horikawa, Tottori University School of Life Sciences, Department of Molecular & Cell Genetics; Nishimachi 86, Yonago, Tottori 683, Japan (Tel:0859-34-8261, Fax:0859-34-8134) FEATURES Location/Qualifiers source 1..1324 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="MRC-5" /cell_type="fibroblast" /chromosome="1" /map="1q21" 5'UTR <1..43 /gene="YL-1" mRNA <1..1324 /gene="YL-1" gene 1..1324 /gene="YL-1" CDS 44..1138 /gene="YL-1" /note="Nuclear protein with DNA-binding ability" /codon_start=1 /product="YL-1 protein" /db_xref="PID:d1008342" /db_xref="PID:g806520" /translation="MSLAGGRAPRKTAGNRLSGLLEAEEEDEFYQTTYGGFTEESGDD EYQGDQSDTEDEVDSDFDIDEGDEPSSDGEAEEPRRKRRVVTKAYKEPLKSLRPRKVN TPAGSSQKAREEKALLPLELQDDGSDSRKSMRQSTAEHTRQTFLRVQERQGQSRRRKG PHCERPLTQEELLREAKITEELNLRSLETYERLEADKKKQVHKKRKCPGPIITYHSVT VPLVGEPGPKEENVDIEGLDPAPSVSALTPHAGTGPVNPPARCSRTFITFSDDATFEE WFPQGRPPKVPVREVCPVTHRPALYRDPVTDIPYATARAFKIIREAYKKYITAHGLPP TASALGPGPPPPEPLPGSGPRALRQKIVIK" 3'UTR 1139..1324 /gene="YL-1" polyA_signal 1302..1307 /gene="YL-1" polyA_site 1324 /gene="YL-1" BASE COUNT 313 a 357 c 357 g 297 t ORIGIN Chromosome 1q21. 1 ctggtgaggg gctgcaggtg gcggcgcagt ctcggtaggc ggtatgagtt tggctggggg 61 ccgggcaccc cggaagaccg ctgggaaccg gctttctggg cttttggagg cagaggagga 121 agatgagttc taccagacga cttatggggg tttcacagag gaatccggag atgatgagta 181 tcaaggggac cagtcagaca cagaggacga agtggactct gactttgaca ttgatgaagg 241 ggatgaacca tccagtgatg gagaagcaga agagccaaga aggaagcgcc gagtagtcac 301 caaggcctat aaggaacctc tcaagagctt aaggcctcga aaggtcaaca ccccggctgg 361 tagctctcag aaggcgcgag aagagaaggc actactgcca ttagaactac aagatgacgg 421 ctctgacagt cggaagtcta tgcgtcagtc tacagctgag catacacgac aaacgttcct 481 tcgggtacag gagaggcagg gccagtcaag acggcgaaag gggccccact gtgagcggcc 541 actaacccag gaggaactgc tccgggaggc caagatcaca gaagagctta atttacggtc 601 actggagaca tatgagcggc tcgaggctga taaaaagaag caggttcata agaagcggaa 661 gtgccccggg cccataatca cctatcattc agtgacagtg ccacttgttg gggagccagg 721 ccccaaggaa gagaacgttg acatagaagg acttgatcct gctccctcgg tgtctgcatt 781 gactcctcat gctgggactg gacccgtcaa cccccctgct cgctgctcac gtaccttcat 841 cacttttagt gatgatgcaa ctttcgagga atggttcccc caagggcggc ccccaaaagt 901 ccctgttcgt gaggtctgtc cagtgaccca tcgtccagcc ctataccggg accctgttac 961 agacataccc tatgccactg ctcgagcctt caagatcatt cgtgaggctt acaagaagta 1021 cattactgcc catggactgc cgcccactgc ctcagccctg ggccccggcc cgccacctcc 1081 tgagcccctc cctggctctg ggccccgagc cttgcgccag aaaattgtca ttaaatgaag 1141 agatgtctag tcctcagaaa cttctttcct gccctgattg gggctcttgc tgttccgttt 1201 cttctccctg cttctcccct ttgtcatctc tgatctttgc ctaatctgtt tctttttcct 1261 tttcccctag ttcttacagg tttcgttgtg ttttttaatc taataaaata gaaagatccc 1321 tttt // LOCUS HUMZAKI4 3184 bp mRNA PRI 17-JUL-1996 DEFINITION ZAKI-4 mRNA in human skin fibroblast, complete cds. ACCESSION D83407 NID g1435039 KEYWORDS ZAKI-4; thyroid hormone responsive. SOURCE Homo sapiens skin fibroblast cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3184) AUTHORS Miyazaki,T., Kanou,Y., Murata,Y., Ohmori,S., Niwa,T., Maeda,K., Yamamura,H. and Seo,H. TITLE Molecular cloning of a novel thyroid hormone-responsive gene, ZAKI-4, in human skin fibroblasts JOURNAL J. Biol. Chem. 271 (24), 14567-14571 (1996) MEDLINE 96278928 REFERENCE 2 (bases 1 to 3184) AUTHORS Miyazaki,T. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 3184) AUTHORS Miyazaki,T. TITLE Direct Submission JOURNAL Submitted (06-FEB-1996) to the DDBJ/EMBL/GenBank databases. Takashi Miyazaki, Research Institute of Environmental Medicine, Nagoya Univ., Department of Endocrinology and Metabolism,; Furo-chou, Chikusa-ku, Nagoya, Aichi 464-01, Japan (E-mail:tama@riem.nagoya-u.ac.jp, Tel:052-789-3867, Fax:052-789-3887) FEATURES Location/Qualifiers source 1..3184 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_type="skin fibroblast" gene 205..783 /gene="ZAKI-4" CDS 205..783 /gene="ZAKI-4" /note="a thyroid hormone responsive gene in human skin fibroblasts" /codon_start=1 /db_xref="PID:d1012579" /db_xref="PID:g1435040" /translation="MDCDVSTLVACVVDVEVFTNQEVKEKFGGLFRTYDDCVTFQLFK SFRRVRINFSNPKSAARARIELHETQFRGKKLKLYFAQVQTPETDGDKLHLAPPQPAK QFLISPPSSPPVSWQPINDATPVLNYDLLYAVAKLGPGEKYELHAGTESTPSVVVHVC DSDIEEEEDPKTSPKPKIIQTRRPGLPPSVSN" polyA_signal 1023..1028 polyA_site 1044 polyA_signal 3168..3173 BASE COUNT 921 a 681 c 657 g 925 t ORIGIN 1 ctctgctgtg ctgcctcaaa cgcggagggc tgcgtgcagt gggagcgggc tccaggagcc 61 cgagcctcca gccgtcctca gagcaaggca gcaccgaggc ctggccacag caatatccat 121 ctggaagctc ttcccttcac tcccaactct gaggttgcct aactctttat taaaaattca 181 gaagggggaa tgccagcccc tagcatggac tgtgatgttt ccactctggt tgcctgtgtg 241 gtggatgtcg aggtctttac caatcaggag gttaaggaaa aatttggggg actgtttcgg 301 acttatgatg actgtgtgac gttccagcta tttaagagtt tcagacgtgt ccgtataaac 361 ttcagcaatc ctaaatctgc agcccgagct aggatagagc ttcatgaaac ccaattcaga 421 gggaaaaaat taaagctcta ctttgcacag gttcagactc cagagacaga tggagacaaa 481 ctgcacttgg ctccacccca gcctgccaaa cagtttctca tctcgccccc ttcctcccca 541 cctgttagct ggcagcccat caacgatgcc acgccagtcc tcaactatga cctcctctat 601 gctgtggcca aactaggacc aggagagaag tatgagctcc atgcagggac tgagtccacc 661 ccaagtgtcg tcgtgcacgt gtgcgacagt gacatagagg aagaagagga cccaaagact 721 tccccaaagc caaaaatcat ccaaactcgg cgtcctggcc tgccaccctc cgtgtccaac 781 tgagctgcct gctccttctc gataatagcc gtctcctctt tatcatgctt tttccccctg 841 ttgtttgtca aaaaaaattg cctttaaatt cctgggtgtt tggttgtttg agattccttc 901 cttgttatca agcctctcgg acaaaagggc taggaaaagg tgatatgtct cctgatcata 961 tcatacccat taagtataac ccattattta gaaggttcta gggaaaaaag tagtattttc 1021 ttattaaaca atcagcacag cctatatctt tgttctctca tgttgatcca agccagagac 1081 atcggtaaca aatagcacct gtgttgtttg tgaggtgttt cagtcccagt cctgatgtgt 1141 gtgcgttgtt ctctcctggc cacttaaata ggaccatatg taaacttgac tttgactgca 1201 tgagatatcc ctatctggtc tcactcagtc ctctgcatcc caacattccc aggacatgca 1261 tgatcaccag catttatttt cattatttga ggatatctta taactcacag attgtcagca 1321 tccagccatg tcctatctag attaggaaaa tgatcagaat attccagctc aacaagtctg 1381 ggtatactca ctattgtgag tcaatacacc atagctctgt tgaaattcct ggaggcaaaa 1441 ttgaccttgg ccccaaagat attcctcaat agatttcaaa caccactccc ctgtagaact 1501 ctcccagcct cgttggggag gcttgtccag ggtgatagag actgatttca gacaaaccta 1561 tttattacaa aagtttcatg gtgtctgaat gattgttttc tctctttgta tatttgtaca 1621 aatgtttcag ctgtgctttt aaaaaatctg gatgtttttt atttagtgat tgttcgacaa 1681 ttagctgctt caaaacataa tgtgcattgc ttatgaatgc cttcatatac taatacagat 1741 actctgataa tattacactc taataaggat aatgctgaat tttgaaagga cacaaaacat 1801 ctaatgccaa tatatacatg gttagccaac atctttgcta tcaagaccac ttgttttaaa 1861 taaagatgca agtgtcagtt gtagattatt gggatgaagc taaatcccca gaatgcagca 1921 gcagctgagc atgttaaaat ggggaaggat gatagctaca tgtatgccgg tcctactcac 1981 gcgacacccg tgtgctcaaa aaagttactt gtttttgtta cgtgtgattt tcctatttct 2041 ctagcccaaa gtgcattaca gaagatacac ctatagaacc attaccttct gctatgtgtg 2101 ccagggctca tctactcctg tacattaatg gattacttta gatgcaaatg cagattacaa 2161 tggagtgggg aagtactttc attacccaag cctcagaaaa acacacaaga acaataacac 2221 agcaaacaga ttgagggatt gttgtggttt ttgactaagg tgtatgttag tttcatcaga 2281 aacttaaaac atagactgat cactcagaaa ttaaagtccg ttttactgtg aatatagcaa 2341 tatagtactg gacacagtac tggtgaaact gaggagagca ttgcttgtaa aatcctgagt 2401 ttccataagg aaaatgaaaa ctccttttaa aaataaaatc tgaggagtgt acaataagca 2461 tatgctttga ctttcctttg ctgtggaggt ttttggtttt tcattgatga taaacgacta 2521 cagacttagt agtggagaaa tggtgtcctc tagtggaaga aatagtagct ccgctattca 2581 gatgcagagc actgcagcat ccagcctttc aaagctgact cttctcaatc atctgtgggt 2641 catttgactt gattttttaa gctaccctga atttccagaa tgcaggttct aaagaaatct 2701 agatgagaga aagtatttga aaatgatttt taaatgtttt ttaaaagaca catctgacat 2761 ttttaacaac ttagtaaaag ttgaaatgac cattctgtgt agtcataaaa gaaacacaat 2821 gaagtgtatg gcctctggag ttagtcttag taaaacttat tgctctgtgt caatgttaac 2881 ctgtctcaga tcaagtaatt ccttcactag gttgggtttg gggagggggg aaaagagggg 2941 cttttcctag gagaacgata agaaatggaa agactccttg aagtgttgca agggaacctc 3001 ctagcactgt gaaagtcaga atcgcctcag catttccatg acgcacatta tgcaaatctc 3061 tttagcacta ttttaaggtt gaaaacttta acaatgaagg ggaaggggaa gatttccacc 3121 aactgaatca tttgtgcacg tgtatagctc aaagagctta gacttcaaat atatctggtg 3181 aatg // LOCUS HUMZAPII 1929 bp mRNA PRI 13-JUN-1996 DEFINITION Human mRNA for unknown product, complete cds. ACCESSION D28124 NID g641821 KEYWORDS . SOURCE Homo sapiens cDNA to mRNA, clone_lib:ZAP II. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1929) AUTHORS Enomoto,H., Ozaki,T., Takahashi,E., Nomura,N., Tabata,S., Takahashi,H., Ohnuma,N., Tanabe,M., Iwai,J., Yoshida,H., Matsunaga,T. and Sakiyama,S. TITLE Identification of human DAN gene, mapping to the putative neuroblastoma tumor suppressor locus JOURNAL Oncogene 9 (10), 2785-2791 (1994) MEDLINE 94366724 REFERENCE 2 (bases 1 to 1929) AUTHORS Enomoto,H. TITLE Direct Submission JOURNAL Submitted (27-JAN-1994) to the DDBJ/EMBL/GenBank databases. Hideki Enomoto, Chiba Cancer Center Research Institute, Department of Biochemistry; 666-2, Nitona, Chuou-ku, Chiba, Chiba 260, Japan (Tel:043-264-5431(ex.5202), Fax:043-262-8680) FEATURES Location/Qualifiers source 1..1929 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="ZAP II" CDS 62..604 /codon_start=1 /product="unknown" /db_xref="PID:d1006216" /db_xref="PID:g641822" /translation="MLRVLVGAVLPAMLLAAPPPINKLALFPDKSAWCEAKNITQIVG HSGCEAKSIQNRACLGQCFSYSVPNTFPQSTESLVHCDSCMPAQSMWEIVTLECPGHE EVPRVDKLVEKILHCSCQACGKEPSHEGLSVYVQGEDGPGSQPGTHPHPHPHPHPGGQ TPEPEDPPGAPHTEEEGAED" polyA_signal 1912..1917 BASE COUNT 351 a 589 c 621 g 368 t ORIGIN 1 acccccgcac ccagctccgc agaccggcgg gcgcgcgcgg gctctggagg ccacgggcat 61 gatgcttcgg gtcctggtgg gggctgtcct ccctgccatg ctactggctg ccccaccacc 121 catcaacaag ctggcactgt tcccagataa gagtgcctgg tgcgaagcca agaacatcac 181 ccagatcgtg ggccacagcg gctgtgaggc caagtccatc cagaacaggg cgtgcctagg 241 acagtgcttc agctacagcg tccccaacac cttcccacag tccacagagt ccctggttca 301 ctgtgactcc tgcatgccag cccagtccat gtgggagatt gtgacgctgg agtgcccggg 361 ccacgaggag gtgcccaggg tggacaagct ggtggagaag atcctgcact gtagctgcca 421 ggcctgcggc aaggagccta gtcacgaggg gctgagcgtc tatgtgcagg gcgaggacgg 481 gccgggatcc cagcccggca cccaccctca cccccatccc cacccccatc ctggcgggca 541 gacccctgag cccgaggacc cccctggggc cccccacaca gaggaagagg gggctgagga 601 ctgaggcccc cccaactctt cctcccctct catccccctg tggaatgttg ggtctcactc 661 tctggggaag tcaggggaga agctgaagcc cccctttggc actggatgga cttggcttca 721 gactcggact tgaatgctgc ccggttgcca tggagatctg aaggggcggg gttagagcca 781 agctgcacaa tttaatatat tcaagagtgg ggggaggaag cagaggtctt cagggctctt 841 tttttggggg gggggtggtc tcttcctgtc tggcttctag agatgtgcct gtgggagggg 901 gaggaagttg gctgagccat tgagtgctgg gggaggccat ccaagatggc atgaatcggg 961 ctaaggtccc tgggggtgca gatggtactg ctgaggtccc gggcttagtg tgagcatctt 1021 gccagcctca ggcttgaggg agggctgggc tagaaagacc actggcagaa acaggaggct 1081 ccggccccac aggtttcccc aaggcctctc accccacttc ccatctccag ggaagcgtcg 1141 ccccagtggc actgaagtgg ccctccctca gcggaggggt ttgggagtca ggcctgggca 1201 ggaccctgct gactcgtggc gcgggagctg ggagccaggc tctccgggcc tttctctggc 1261 ttccttggct tgcctggtgg gggaagggga ggaggggaag aaggaaaggg aagagtcttc 1321 caaggccaga gggaggggga caacccccca agaccatccc tgaagacgag catccccctc 1381 ctctccctgt tagaaatgtt agtgccccgc actgtgcccc aagttctagg ccccccagaa 1441 agctgtcaga gccggccgcc ttctcccctc tcccagggat gctctttgta aatatcggat 1501 gggtgtggga gtgaggggtt acctccctcg ccccaaggtt ccagaggccc taggcgggat 1561 gggctcgctg aacctcgagg aactccagga cgaggaggac atgggacttg cgtggacagt 1621 cagggttcac ttgggctctc tctagctccc caattctgcc tgcctcctcc ctcccagctg 1681 cactttaacc ctagaaggtg gggacctggg gggagggaca gggcaggcgg gcccatgaag 1741 aaagcccctc gttgcccagc actgtctgcg tctgctcttc tgtgcccagg gtggctgcca 1801 gcccactgcc tcctgcctgg ggtggcctgg ccctcctggc tgttgcgacg cgggcttctg 1861 gagcttgtca ccattggaca gtctccctga tggaccctca gtcttctcat gaataaattc 1921 ttcggaatt // LOCUS HUMZICP 3138 bp mRNA PRI 16-FEB-1997 DEFINITION Human mRNA for Zic protein, complete cds. ACCESSION D76435 NID g1208428 KEYWORDS Zic protein; Zic. SOURCE Homo sapiens cerebellum cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Yokota,N., Aruga,J., Takai,S., Yamada,K., Hamazaki,M., Iwase,T., Sugimura,H. and Mikoshiba,K. TITLE Predominant expression of human zic in cerebellar granule cell lineage and medulloblastoma JOURNAL Cancer Res. 56 (2), 377-383 (1996) MEDLINE 96134385 REFERENCE 2 (bases 1 to 3138) AUTHORS Yokota,N. JOURNAL Unpublished (1996) REFERENCE 3 (bases 1 to 3138) AUTHORS Yokota,N. TITLE Direct Submission JOURNAL Submitted (16-OCT-1995) to the DDBJ/EMBL/GenBank databases. Naoki Yokota, Institute of Physical and Chemical Research (RIKEN), Molecular Neurobiology Laboratory; 3-1-1 Koyadai, Tsukuba, Ibaraki 305, Japan (Tel:0298-36-9170, Fax:0298-36-9040) FEATURES Location/Qualifiers source 1..3138 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="3" /map="3q24" /tissue_type="cerebellum" gene 781..2124 /gene="Zic" CDS 781..2124 /gene="Zic" /codon_start=1 /product="Zic protein" /db_xref="PID:d1011840" /db_xref="PID:g1208429" /translation="MLLDAGPQYPAIGVTTFGASRHHSAGDVAERDVGLGINPFADGM GAFKLNPSSHELASAGQTAFTSQAPGYAAAAALGHHHHPGHVGSYSSAAFNSTRDFLF RNRGFGDAAAAASAQHSLFAASAGGFGGPHGHTDAAGHLLFPGLHEQAAGHASPNVVN GQMRLGFSGDMYPRPEQYGQVTSPRSEHYAAPQLHGYGPMNVNMAAHHGAGAFFRYMR QPIKQELICKWIEPEQLANPKKSCNKTFSTMHELVTHVTVEHVGGPEQSNHICFWEEC PREGKPFKAKYKLVNHIRVHTGEKPFPCPFPGCGKVFARSENLKIHKRTHTGEKPFKC EFEGCDRRFANSSDRKKHMHVHTSDKPYLCKMCDKSYTHPSSVRKHMKVHESSSQGSQ PSPAASSGYESSTPPTIVSPSTDNPTTSSLSPSSSAVHHTAGHSALSSNFNEWYV" BASE COUNT 701 a 898 c 840 g 699 t ORIGIN Chromosome 3q24. 1 cgggtgccat gcagctttct ctaatttgct ctcagttcct ggctatgaat tgctaaacta 61 tcagtctcgc gctcaccgcc cggctgagga ggtgaaagtt tctccccagg aagataaacc 121 gcaaaagaca tatattgtgc atgatttgcg ccttttcttt ggctttttct ttctttcttc 181 acccccccac ccactttttt tttttttttt ttcaaaaagc agagagggaa aaacggagag 241 tgaaggagcg aggaggcgag cgtgagagaa aggagagaga gagaaaagaa agggcgaggg 301 gctagtggag gaaggaagga ggggcggctg cgcgaggcgg agagagggcg aagcagtcgc 361 ggcactggcg ctcacattcc tctatgctac aaatccagga ggaagttttt ttttaggggg 421 ctgagatgct ccatgccttt aaaagggcag ccttgacgcg cggccctctc ggcagagact 481 gagcggcgag aaagtgcgag ccgggccggc agaatctgcc tggcgggcgc tggagcctgc 541 gttactcgcg gcccgcagcc gtccggctac tttgcgtttg gcccggccag cgccgcgcgg 601 cgcgcgcgcg ccattgcctg caggctagga cttcgcgagg tgggtcgact caccctccct 661 cctcctcttc ttcctcctct tcctcctcct cttgttcctc ctcctcctcc cgattttccc 721 tcctcggctg gcgagggtgg ggggggcggg ggaggccggg gctcgccccg agcagccacg 781 atgctcctgg acgccggccc ccagtaccca gcgatcggcg tgaccacctt tggcgcgtcc 841 cgccaccact ccgcgggcga cgtggccgaa cgagacgtgg gcctgggcat caacccgttc 901 gccgacggca tgggcgcctt caagctcaac cccagttcgc acgagctggc ttcggccggc 961 cagacagcct tcacgtcgca ggcgccaggc tacgcggctg ctgcggccct gggccatcac 1021 catcacccgg gccacgtcgg ctcctattcc agcgcagcct tcaactccac gcgggacttt 1081 ctgttccgca accggggttt tggcgacgcg gcggcggcag ccagcgcaca gcacagcctc 1141 tttgctgcat cggccggggg cttcgggggc ccacacggcc acacggacgc cgcgggccac 1201 ctcctcttcc ccgggcttca cgagcaggct gccggccacg cgtcgcctaa cgtggtcaac 1261 gggcagatga ggctcggctt ctcgggggac atgtacccgc gaccggagca gtacggccag 1321 gtgaccagcc cgcgttcgga gcactatgct gcgccgcagc tgcacggcta cgggcccatg 1381 aacgtgaaca tggccgcgca tcacggcgcc ggcgccttct tccgctacat gcgccaaccc 1441 atcaagcaag agctcatctg caagtggatc gagcccgagc agctggccaa ccccaaaaag 1501 tcgtgcaaca aaactttcag caccatgcac gagctagtta cgcacgtcac cgtggagcac 1561 gtaggtggcc cggagcagag taatcacatc tgcttctggg aggagtgtcc gcgcgagggc 1621 aagcccttca aagccaaata caaactggtt aaccacatcc gcgtgcacac gggcgagaag 1681 ccctttccct gccccttccc tggctgtggc aaggtcttcg cgcgctccga gaatttaaag 1741 atccacaaaa ggacgcacac aggggagaag cccttcaagt gcgagtttga gggctgtgac 1801 cggcgcttcg ctaacagcag cgaccgcaag aagcacatgc acgtgcacac gagcgacaag 1861 ccctatcttt gcaagatgtg cgacaagtcc tacacgcatc ccagttccgt gcgcaaacac 1921 atgaaggtcc acgaatcctc ctcgcagggc tcgcagcctt cgccggccgc cagctctggc 1981 tacgaatcct ccacgcctcc caccatcgtg tctccctcca cagacaaccc gaccacaagc 2041 tccttatcgc cctcctcctc cgcagtccac cacacagccg gccacagtgc gctctcttcc 2101 aattttaacg aatggtacgt ttaaaatcag aaacaaaaca tcgaacaaaa ccctatttaa 2161 gagacttgat cacacacgta tacacaacat tactgaaaga accctgcgaa tcaaaacaac 2221 ccccacacag accccgcaat cctcttttaa aaaatctgcc aatagaccca ggacgagtaa 2281 gagaggaagc atcaaccttt taaaaatttc ctttcgcttt cattattttt ctttttttgg 2341 caaaggcttg gtacccaagg tgcggtaggg ggtcgagggg gaggaggcca cctgaccaaa 2401 tgccgccaac cccgagggcc agtttcttgt cgaattggta cgggctctct ggggcttcgg 2461 cttctttttt tctttgtttt cttgtaaata cagaattatt agcttaaaac tgtactgttg 2521 aattctgtaa atagttatat ctcggttgga gcgggtgggt gggattgtgg cgttgtggtc 2581 tttgcattgg gggagggggg agggaccgga tgggcggggg gagggggagg gggaggggtg 2641 ggcggccgaa agccaactgt ttgtactgaa tggcaagaat gttctagtaa atgtgtacca 2701 aaatgtgaat tactttgtac gattacagtc tccacgtcga cctaacccaa tattattggt 2761 attaatgtgc tttttttgta taaagtgcaa acatttcgtc ccaaagtcta agtactttag 2821 tgcagtaaaa tgttgtttca tgtcctgtca agaattcgta tagtacgagc ctggatctgc 2881 gtgtcaaact gttccatttg tttatgtaaa gtgatattaa aaaagatata aactataact 2941 gtccgttact tttggcaaaa gatacaacca cataatgtat ataattccta gtttccatat 3001 ttatccgcat gtaaagggcc ggtttatcca tgttacagct cttcaatatt tatggctaga 3061 agaactcgta tgtacacttt agtttccaga actgtttggt aacctttcgt accttattaa 3121 agattcttaa atctcaaa // LOCUS HUMZINC 2457 bp mRNA PRI 02-MAY-1996 DEFINITION Human zinc-finger protein (ZNF76) gene, partial cds. ACCESSION M91592 NID g1293897 KEYWORDS zinc-finger protein. SOURCE Homo sapiens mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2457) AUTHORS Ragoussis,J., Senger,G., Mockridge,I., Sanseau,P., Ruddy,S., Dudley,K., Sheer,D. and Trowsdale,J. TITLE A testis-expressed Zn finger gene (ZNF76) in human 6p21.3 centromeric to the MHC is closely linked to the human homolog of the t-complex gene tcp-11 JOURNAL Genomics 14 (3), 673-679 (1992) MEDLINE 93052398 FEATURES Location/Qualifiers source 1..2457 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="6" /map="6p21.3" gene 164..1711 /gene="ZNF76" CDS 164..1711 /gene="ZNF76" /codon_start=1 /product="zinc-finger protein" /db_xref="PID:g1293898" /translation="MESLGLHTVTLSDGTTAYVQQAVKGEKLLEGQVIQLEDGTTAYI HQVTVQKEALSFEDGQPVQLEDGSMAYIHRTPREGYDPSTLEAVQLEDGSTAYIHHPV AVPSESTILAVQTEVGLEDLAAEDDEGFSADAVVALEQYASKVLHDSQIPRNGKGQQV GDRAFRCGYKGCGRLYTTAHHLKVHERAHTGDRPYRCDFPSCGKAFATGYGLKSHVRT HTGEKPYKCPEELCSKAFKTSGDLQKHVRTHTGERPFQCPFEGCGRSFTTSNIRKVHV RTHTGERPYTCPEPHCGRGFTSATNYKNHVRIHTGEKPYVCTVPGCGKRFTEYSSLYN DHVVHTHCKPYTCSTCGKTYRQTSTLAMHKRSAHGELEATEESEQALYEQQQLEAASA AEESPPPKRPRIAYLSEVKEERDDIPAQVAMVTEEDGAPQVALITQDGAQQVTIITSG AVVAEDSSVASLRHQQVALLATANGTHIAVQLEEQQTLEEAINVATAAMQQGAVTLET TVSESGC" BASE COUNT 578 a 701 c 716 g 462 t ORIGIN 1 ggccggcgct ggtggcgggg ccggtgcatg gcggtctgtc ggcggagtcg ccctcggggg 61 ctgtcagatt tgtgacccag aaggaaatct ctgacctcag ctgtggctct tggtgctggc 121 cagaagccaa cttcatgtct gagtgcacga gcagcagttt gccatggaga gcttgggcct 181 gcacacggtg acccttagtg atgggacaac agcctacgtc cagcaagctg tcaaaggaga 241 gaagcttctt gaagggcagg tgatccagct cgaggatggg accaccgcat acattcacca 301 ggtgacggta cagaaagaag ctctctcctt tgaggatggt cagcctgtgc agctggaaga 361 tggcagcatg gcttacatac accgcacacc cagagaaggc tatgacccca gcaccctgga 421 agccgtccaa ctggaagatg gctccactgc ctacattcac caccctgtgg ctgtgccatc 481 ggagagcacc atcctggccg tacagacaga ggtgggcttg gaggacctgg cagcagagga 541 tgatgagggc ttcagtgcag acgcagtggt ggccctggag cagtatgcca gcaaggttct 601 tcatgacagc cagattcccc gtaatggaaa agggcagcaa gttggagaca gagcattccg 661 ctgtggctac aagggctgtg ggcgtctcta caccaccgct catcacttaa aggtgcatga 721 acgagctcat acaggtgacc gtccatacag atgtgacttc cccagctgtg gaaaggcctt 781 tgccacaggc tatggactga agagccacgt tcgtacccac actggtgaga aaccatacaa 841 gtgcccagag gagctgtgca gcaaggcctt caagacctca ggagacctgc agaagcatgt 901 ccgtacccac actggtgaac gcccgttcca gtgccctttt gagggctgtg gccgctcctt 961 caccacatct aacatccgca aggtacatgt gcgcacccac acaggcgaga ggccctacac 1021 ctgcccggag ccccactgtg gccgcggctt caccagcgcc accaactata agaatcacgt 1081 gcgcatccac acaggggaga agccatacgt ttgcacggtg ccaggctgcg ggaaacgctt 1141 caccgagtac tcgagcttgt ataacgacca cgtggtgcac acacactgca agccctacac 1201 ctgcagcacc tgcggcaaga cctaccggca gacctccacc ttggccatgc acaagcgcag 1261 tgcccacggc gagctggagg ccacggagga gagcgagcag gccctctatg agcagcagca 1321 acttgaggcc gcctctgcag ccgaggagag tccgccaccc aaacgacccc ggatagctta 1381 cctttcggag gtgaaggaag agagagatga catcccagcc caggtggcga tggtgactga 1441 agaagatggg gccccccagg tggctctgat cactcaggat ggtgcccagc aggtcacaat 1501 cattacctct ggggctgtgg tggctgagga ctcaagtgta gcatctcttc gtcatcaaca 1561 ggtggcactg ttggccacag ccaacggaac gcacattgca gtgcagctgg aggaacagca 1621 gaccttagag gaggccatca atgtggccac tgcggccatg cagcaagggg ctgtgaccct 1681 ggagacaaca gtgtcggaga gtggctgctg agtccaagag ggctgggtcc cacaccatgc 1741 tggaggaagt gccatctgca tggccactct tgcccccaag ggcccaggct gtggctgaca 1801 catagaaggt ggccacatag gtctctgggg tgagaagaca gcaagaaaac tgcctaactg 1861 aagggaatgg gggccctgct caagaagggg gccaggcagc tggaatcagg ggagtgcatc 1921 atcctcggga gctgacaaca gccaggctac accagggcac ccgcctctca aaatcagctg 1981 ggcgcccact ctcctcctaa agaacaccct tctggccctc agtctgttcc ccttctcagg 2041 tagagattgg ggctgctatg gggactggcc ctgtagggtt gagccacaga cagctcttca 2101 gcccagtagc agtggagcag gccctgtcct gccctcccag caataaccac ctccctggag 2161 gccagctgag atgcctggct acttggcacc agggacttcc tgacaccaca gtcaattaat 2221 tcctcagggg cctgtggctg aagaaaaggt gcccagcccc caccactcct cagctgcccc 2281 ccaacctctc tatggcagag aggcagggag tggccctgta catagactgc tggggattgg 2341 gtttgatttg ttttgttttt tttaattcca ttttgataat ttttttcctg ctctgggtgg 2401 tggtggcaca tggatgaatg agtatttaat aaaagttcca aatttcaaaa aaaaaaa // LOCUS HUMZNF7 2351 bp mRNA PRI 14-JAN-1995 DEFINITION Human zinc-finger protein 7 (ZFP7) mRNA, complete cds. ACCESSION M29580 J04751 NID g340445 KEYWORDS zinc finger protein. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2351) AUTHORS Lania,L., Donti,E., Pannuti,A., Pascucci,A., Pengue,G., Feliciello,I., La Mantia,G., Lanfrancone,L. and Pelicci,P.G. TITLE cDNA isolation, expression analysis, and chromosomal localization of two human zinc finger genes JOURNAL Genomics 6 (2), 333-340 (1990) MEDLINE 90169993 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Lania, 19-OCT-1989, for release after publication. FEATURES Location/Qualifiers source 1..2351 /organism="Homo sapiens" /db_xref="taxon:9606" /map="8q24" mRNA <1..2351 /note="ZFP7 mRNA" gene 239..2299 /gene="ZNF7" CDS 239..2299 /gene="ZNF7" /note="zinc finger protein 7 (ZFP7)" /codon_start=1 /db_xref="GDB:G00-120-509" /db_xref="PID:g340446" /translation="MEVVTFGDVAVHFSREEWQCLDPGQRALYREVMLENHSSVAGLA GFLVFKPELISRLEQGEEPWVLDLQGAEGTEAPRTSKTDSTIRTENEQACEDMDILKS ESYGTVVRISPQDFPQNPGFGDVSDSEVWLDSHLGSPGLKVTGFTFQNNCLNEETVVP KTFTKDAPQGCKELGSSGLDCQPLESQGESAEGMSQRCEECGKGIRATSDIALHWEIN TQKISRCQECQKKLSDCLQGKHTNNCHGEKPYECAECGKVFRLCSQLNQHQRIHTGEK PFKCTECGKAFRLSSKLIQHQRIHTGEKPYRCEECGKAFGQSSSLIHHQRIHTGERPY GCRECGKAFSQQSQLVRHQRTHTGERPYPCKECGKAFSQSSTLAQHQRMHTGEKAQIL KASDSPSLVAHQRIHAVEKPFKCDECGKAFRWISRLSQHQLIHTGEKPYKCNKCTKAF GCSSRLIRHQRTHTGEKPFKCDECGKGFVQGSHLIQHQRIHTGEKPYVCNDCGKAFSQ SSSLIYHQRIHKGEKPYECLQCGKAFSMSTQLTIHQRVHTGERPYKCNECGKAFSQNS TLFQHQIIHAGVKPYECSECGKAFSRSSYLIEHQRIHTRAQWFYEYGNALEGSTFVSR KKVNTIKKLHQCEDCEKIFRWRSHLIIHQRIHTGEKPYKCNDCGKAFNRSSRLTQHQK IHMG" BASE COUNT 658 a 556 c 635 g 502 t ORIGIN Chromosome 8q24. 1 gcgcggcggc ggacctcggg ttgccctcgg tccgagtgat ccctggtcgc ttccttagcc 61 ctcccgcctt cggcattggg gtccccgcgt cccccgggcc tccaggcggg aaagcgcggg 121 ggctttgcgg ggccttgagc gcctggtgtg ggaggtggtc gagcccagcc accctccccc 181 gcggcggcgc gaggtctctc ggccagaaca cgtggatgcc cacccaccac tgagcctcat 241 ggaggtggta acatttggcg atgtggctgt gcacttctct cgggaggagt ggcagtgtct 301 ggaccctggc cagagggccc tctacaggga agtgatgctg gagaaccaca gcagtgtggc 361 tggactagca ggattcctgg ttttcaagcc tgagctgatc tctcggctgg agcagggaga 421 agagccatgg gtcctcgacc tgcagggagc agaggggaca gaggcaccaa ggacctccaa 481 gacagattct acgattagga ctgaaaatga gcaggcctgt gaggacatgg acatcctaaa 541 atcagaatcc tatgggacag tggtcagaat ctccccacag gactttcctc agaatcctgg 601 ctttggagac gtttctgatt ctgaggtctg gttagacagt catctgggca gtcccgggct 661 gaaagtgaca ggctttacct tccaaaataa ctgtttgaat gaggagactg tggttcccaa 721 gaccttcacc aaggacgcac cccagggatg taaggagctg ggaagcagcg gcctggattg 781 tcagcctctt gaaagtcagg gagagagtgc ggaagggatg tcccagagat gcgaggagtg 841 tggcaaaggc atcagagcca cttcagatat cgctctgcat tgggaaatta atacacagaa 901 aattagcaga tgtcaagaat gccaaaaaaa gttatctgac tgcttgcagg ggaaacatac 961 aaataactgc catggagaga agccgtacga atgtgcagag tgtgggaaag tcttcaggct 1021 ctgctcgcag cttaatcagc atcagagaat ccacacggga gagaaaccct ttaaatgcac 1081 tgagtgtgga aaagccttcc gcctgagctc aaaacttatt cagcatcaaa gaatccacac 1141 tggggagaag ccctacagat gtgaggaatg tggaaaagct tttggtcaga gctcaagcct 1201 catccaccat cagagaatcc acacaggaga gaggccctat ggttgtcgtg agtgtgggaa 1261 agccttcagc cagcagtcgc agctggttag acaccagaga actcacactg gggagaggcc 1321 ctacccttgc aaggagtgtg ggaaggcctt cagccagagc tccaccctag cccagcatca 1381 aaggatgcat actggggaga aagctcaaat tctaaaagcc tcagacagtc caagccttgt 1441 tgcacatcag agaattcacg ctgtagagaa accatttaag tgtgatgagt gtgggaaagc 1501 ttttaggtgg atctctcgcc tgagtcagca tcagctgatt cacactggag agaagcctta 1561 taaatgcaac aagtgtacaa aagcctttgg ttgtagttca cggcttattc gccatcagag 1621 aactcacact ggagaaaaac catttaaatg tgatgagtgt ggcaaaggct ttgttcaggg 1681 ctcacacctt attcagcatc agcgaatcca cactggagag aaaccctatg tgtgtaatga 1741 ctgtggaaaa gccttcagtc agagttccag ccttatttac catcagagaa tccataaagg 1801 agagaagccc tacgaatgcc tccaatgcgg aaaagccttc agtatgagca cacagcttac 1861 aatacatcaa agggttcaca ctggagagag gccctataaa tgtaatgaat gtgggaaagc 1921 cttcagtcaa aactcaaccc ttttccaaca ccagataatt catgcagggg tgaagcccta 1981 tgagtgcagt gagtgtggaa aagccttcag ccggagctca tatcttattg aacaccagag 2041 aatacacact agggcccagt ggttttacga atatgggaat gccctggaag ggtccacctt 2101 tgtgagccgt aaaaaggtta atactataaa gaaactgcat cagtgtgaag actgtgagaa 2161 gatatttagg tggcgttcac acctaattat acaccagaga attcacaccg gggagaagcc 2221 ttataaatgc aatgactgtg gcaaagcttt taatcgtagc tcaaggctta cccagcatca 2281 aaaaattcac atgggataga ccacttacat ataaatgtgt atatatgtga ataaacctat 2341 agccttaact t // LOCUS HUMZNFBPAA 1274 bp mRNA PRI 15-SEP-1989 DEFINITION Human T-cell translocation gene 1 (Ttg-1) mRNA, complete cds. ACCESSION M26682 NID g340453 KEYWORDS breakpoint region; zinc-finger protein. SOURCE Human cell line RPMI 8402, clone 11B1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1274) AUTHORS McGuire,E.A., Hockett,R.D., Pollock,K.M., Bartholdi,M.F., O'Brien,S.J and Korsmeyer,S.J. TITLE The t(11;14)(p15;q11) in a T-cell acute lymphoblastic leukemia cell line activates multiple transcripts, including Ttg-1, a gene encoding a potential zinc finger protein JOURNAL Mol. Cell. Biol. 9, 2124-2132 (1989) MEDLINE 89313759 COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by E.A.McGuire, 08-DEC-1989. FEATURES Location/Qualifiers source 1..1274 /organism="Homo sapiens" /db_xref="taxon:9606" misc_feature 24..51 /note="potential zinc finger; putative" misc_feature 88..116 /note="potential zinc finger; putative" CDS 498..968 /note="T-cell translocation protein" /codon_start=1 /db_xref="PID:g340454" /translation="MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYW HEDCLKCACCDCRLGEVGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVM RARDNVYHLDCFACQLCNQRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ" misc_signal 1207..1216 /note="potential short half-life signal; putative" misc_signal 1233..1241 /note="potential short half-life signal; putative" BASE COUNT 282 a 361 c 338 g 293 t ORIGIN Chromosome 11p15. 1 cagcgggaga cggccacgag attcccccat ctctttgaat ataattttag attgagattc 61 agattaaatc cgaggggaaa acactttatg aggctgaaag ctgtgtcgtt gccagagaca 121 gggttatgag ctatcaaatg caattacatt aagacagatt atactgggca aattgagcca 181 tttagaaggt gagaatcaaa gaaacggctc tgatcctctt ttcccccttc tctctccctc 241 tccctctctc tctaaattgc agttcgtagt tccttccaat tcggaggcac aaaagtaggt 301 gagactgctt ttgtatctgc gaagtgcttc actcctgaat gtaattctag ctgagtgcaa 361 tctaggttaa gagccggaca agcgggtaat tagagcccgc tagctgcccg aggaccggcc 421 gccccgccaa agcgcgcccc gagtcggcgc ccttctcccg gccgagccta gctgcggctg 481 gacacggagc gcccgagatg atggtgctgg acaaggagga cggcgtgccg atgctctccg 541 tccagcccaa agggaagcag aagggctgtg cgggctgtaa ccgcaagatc aaggaccgct 601 atctgctgaa ggcattggac aagtactggc acgaagactg cctcaagtgt gcgtgctgtg 661 actgccgcct gggcgaggtg ggctccaccc tctacaccaa ggccaacctc atcctgtgcc 721 gacgcgacta cctgaggctc tttggcacca cagggaactg tgctgcttgc agcaagctga 781 tcccagcctt cgagatggtg atgcgggccc gggacaacgt gtatcacctc gactgcttcg 841 cctgccagct ctgcaaccag agattttgtg tgggagacaa attcttcctg aagaacaaca 901 tgatcttgtg tcagatggac tatgaggaag ggcagctcaa tggcaccttt gaatcccaag 961 ttcagtaacg cccggcgcct ggcctccagg cccgtctgtc catctgcccg cctgcccacc 1021 tgcctggccg gccagccagc cactctacca gtgcaggctg gccagccgct ctcctgccac 1081 attagaactt ctccgtcctc gatgggaggg atggcccttc ctcctccacc accgcccgtc 1141 tgtgtgtgac ccctcctggg gccaggccgg gcctgtacag tctgtcttct gtatataaat 1201 gggaacattt attttatgag aaatgtaatg cgattttatt actggcgtgg attaaactta 1261 tgaatgtttc cggg // LOCUS HUMZO1A 7888 bp mRNA PRI 01-OCT-1993 DEFINITION Human tight junction (zonula occludens) protein ZO-1 mRNA, complete cds. ACCESSION L14837 NID g292937 KEYWORDS discs-large tumor suppressor gene; tight junction protein; zonula occludens. SOURCE Homo sapiens cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 7888) AUTHORS Willott,E., Balda,M.S., Heintzelman,M.B., Jameson,B. and Anderson,J. TITLE Localization and differential expression of two isoforms of the tight junction protein ZO-1 JOURNAL Am. J. Physiol. 262, 1119-1124 (1992) REFERENCE 2 (bases 1 to 7888) AUTHORS Willott,E., Balda,M.S., Fanning,A.S., Jameson,B., Van Itallie,C. and Anderson,J.M. TITLE The tight junction protein ZO-1 is homologous to the Drosophila discs-large tumor suppressor protein of septate junctions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (16), 7834-7838 (1993) MEDLINE 93361541 REFERENCE 3 (sites) AUTHORS Balda,M.S. and Anderson,J. TITLE Two classes of tight junctions are revealed by ZO-1 isoforms JOURNAL Am. J. Physiol. (1993) In press FEATURES Location/Qualifiers source 1..7888 /organism="Homo sapiens" /db_xref="taxon:9606" mat_peptide 1227..6434 /standard_name="ZO-1" /note="putative" /citation=[2] /product="tight junction (zonula occludens) protein ZO-1" CDS 1227..6437 /standard_name="ZO-1" /note="possible translation initiation codons at bp 1191 and bp 1227; alpha domain at bp 3954..4193 (results from alternative splicing of ZO-1 primary transcript, which yields at least two isoforms); putative" /citation=[2] /citation=[1] /citation=[3] /codon_start=1 /product="tight junction (zonula occludens) protein ZO-1" /db_xref="PID:g292938" /translation="MEETAIWEQHTVTLHRAPGFGFGIAISGGRDNPHFQSGETSIVI SDVLKGGPAEGQLQENDRVAMVNGVSMDNVEHAFAVQQLRKSGKNAKITIRRKKKVQI PVSRPDPEPVSDNEEDSYDEEIHDPRSGRSGVVNRRSEKIWPRDRSASRERSLSPRSD RRSVASSQPAKPTKVTLVKSRKNEEYGLRLASHIFVKEISQDSLAARDGNIQEGDVVL KINGTVTENMSLTDAKTLIERSKGKLKMVVQRDERATLLNVPDLSDSIHSANASERDD ISEIQSLASDHSGRSHDRPPRRSRSRSPDQRSEPSDHSRHSPQQPSNGSLRSRDEERI SKPGAVSTPVKHADDHTPKTVEEVTVERNEKQTPSLPEPKPVYAQVGNQMWIYLSVHL MVSYLIQLMKMGFLRPSMKLVKFRKGDSVGLRLAGGNDVGIFVAGVLEDSPAAKEGLE EGDQILRVNNVDFTNIIREEAVLFLLDLPKGEEVTILAQKKKDVYRRIVESDVGDSFY IRTHFEYEKESPYGLSFNKGEVFRAVDTLYNGKLGSWLAIRIGKNHKEVERGIIPNKN RAEQLASVQYTLPKTAGGDRADFWRFRGLRSSKRNLRKSREDLSAQPVQTKFPAYERV VLREAGFLRPVTIFGPIADVAREKLAREEPDIYQIAKSEPRDAGTDQRSSGYIRLHTI KQIIDQDKHALLDVTPNAVDRLNYAQWYPIVVFLNPDSKQGVKTMRMRLCPESRKSAR KLYERSHKLAKNNHHLFTTTINLNSMNDGWYGALKEAVQQQQNQLVWVSEGKADGATS DDLDLHDDRLSYLSAPGSEYSMYSTDSRHTSDYEDTDTEGGAYTDQELDETLNDEVGT PPESAITRSSEPVREDSSGMHHENQTYPPYSPQAQPQPIHRIDSPGFKPASQQKAEAS SPVPYLSPETNPASSTSAVNHNVNLTNVRLEEPTPAPSTSYSPQADSLRTPSTEAAHI MLRDQEPSLSSHVDPTKVYRKDPYPEEMMRQNHVLKQPAVSHPGHRPDKEPNLTYEPQ LPYVEKQASRDLEQPTYRYESSSYTDQFSRNYEHRLRYEDRVPMYEEQWSYYDDKQPY PSRPPFDNQHSQDLDSRQHPEESSERGYFPRFEEPAPLSYDSRPRYEQAPRASALRHE EQPAPGYDTHGRLRPEAQPHPSAGPKPAESKQYFEQYSRSYEQVPPQGFTSRAGHFEP LHGAAAVPPLIPSSQHKPEALPSNTKPLPPPPTQTEEEEDPAMKPQSVLTRVKMFENK RSASLETKKDVNDTGSFKPPEVASKPSGAPIIGPKPTSQNQFSEHDKTLYRIPEPQKP QLKPPEDIVRSNHYDPEEDEEYYRKQLSYFDRRSFENKPPAHIAASHLSEPAKPAHSQ NQSNFSSYSSKGKPPEADGVDRSFGEKRYEPIQATPPPPPLPSQYAQPSQPVTSASLH IHSKGAHGEGNSVSLDFQNSLVSKPDPPPSQNKPATFRPPNREDTAQAAFYPQKSFPD KAPVNGTEQTQKTVTPAYNRFTPKPYTSSARPFERKFESPKFNHNLLPSETAHKPDLS SKTPTSPKTLVKSHSLAQPPEFDSGVETFSIHAEKPKYQINNISTVPKAIPVSPSAVE EDEDEDGHTVVATARGIFNSNGGVLSSIETGVSIIIPQGAIPEGVEQEIYFKVCRDNS ILPPLDKEKGETLLSPLVMCGPHGLKFLKPVELRLPHCDPKTWQNKCLPGDPNYLVGA NCVSVLIDHF" BASE COUNT 2272 a 1838 c 1820 g 1958 t ORIGIN 1 tccgggtatg gatgtcaatc ttttgtctac aatgtgaata catttatcct tcggggacca 61 tcaagacttt caggaaaggc cccgcctgtc tctgcgcggc cactttgctg ggacaaaggt 121 caactgaaga agtgggcagg cccgaggcag gagagatgct gaggagtcca tgtgcagggg 181 agggaaaggg agaggcagtc agggagagga ggaggaggta ccgccagaag gggatcctcc 241 cgctccgaaa accagacacc gggtcttgcc ctgtggtcca ggcaggagtg cagtggtgca 301 acctcagctc actgcagcct tgacctcccc gggctcaagc gatcctccgg ccacagcact 361 tggctgttca gcggctggag gagcagggcc ccaggtcctc cccaccctca cctgctgctc 421 ccaggtcgtg gccgtcttgc tcttccaggt ccttctctag ggatgcaata ttcacattgc 481 taagatgcag gtctaacgca gaacctgtca acagagcccc ccatcatcca cagcccaccc 541 agcgctgcag agctcaggaa gcctagctga ggaggacgac cgtcccacct gggcttagag 601 tgagaccaag ggcagaaggc gtgggagttg ctggggcagc cagggaagga cacccccagc 661 ccgtcctcgc agccccccac aggcagtggg aggcttggct gttcctccgg caaaacgggc 721 atgctcagtg ggccgggccg gcaggtttgc gtggccgctg agttgccggc gccggctgag 781 ccagcggacg ccgcgttcct tggcggccgc cggttcccgg gaagttacgt ggcgaagccg 841 gcttccgagg agacgccggg aggccacggg tgctgctgac gggcgggcga ccgggcgagg 901 ccgacgtggc cgggctgcga aagctgcggg aggccgagtg ggtgaccgcg ctcggaggga 961 ggtgccggtc gggcgcgccc cgtggagaag acccgggcgg ggcgggcgct tcccggactt 1021 ttgtccgagt tgaattccct ccccctgggc cgggcccttc cgtccgcccc cgcccgtgcc 1081 ccgctcgctc tcgggagatg tttatttggg ctgtggcgtg aggagcgggc gggccagcgc 1141 cgcggagttt cgggtccgag gagcctcgcg cggcgctgga gagagacaag atgtccgcca 1201 gagctgcggc cgccaagagc acagcaatgg aggaaacagc tatatgggaa caacatacag 1261 tgacgcttca cagggctcct ggatttggat ttggaattgc aatatctggt ggacgagata 1321 atcctcattt tcagagtggg gaaacgtcaa tagtgatttc agatgtgctg aaaggaggac 1381 cagctgaagg acagctacag gaaaatgacc gagttgcaat ggttaacgga gtttcaatgg 1441 ataatgttga acatgctttt gctgttcagc aactaaggaa aagtgggaaa aatgcaaaaa 1501 ttacaattag aaggaagaag aaagttcaaa taccagtaag tcgtcctgat cctgaaccag 1561 tatctgataa tgaagaagat agttatgatg aggaaataca tgatccaaga agtggccgga 1621 gtggtgtggt taacagaagg agtgagaaga tttggccgag ggatagaagt gcaagtagag 1681 agaggagctt gtccccgcgg tcagacaggc ggtcagtggc ttccagccag cctgctaaac 1741 ctactaaagt cacactggtg aaatcccgga aaaatgaaga atatggtctt cgattggcaa 1801 gccatatatt tgttaaggaa atttcacaag atagtttggc agcaagagat ggcaatattc 1861 aagaaggtga tgttgtattg aagataaatg gtactgtgac agaaaatatg tcattgacag 1921 atgcaaagac attgatagaa aggtctaaag gcaaattaaa aatggtagtt caaagagatg 1981 aacgggctac gctattgaat gtccctgatc tttctgacag catccactct gctaatgcct 2041 ctgagagaga cgacatttca gaaattcagt cactggcatc agatcattct ggtcgatcac 2101 acgataggcc tccccgccgc agccggtcac gatctcctga ccagcggtca gagccttctg 2161 atcattccag gcactcgccg cagcagccaa gcaatggcag tctccggagt agagatgaag 2221 agagaatttc taaacctggg gctgtctcaa ctcctgtaaa gcatgctgat gatcacacac 2281 ctaaaacagt ggaagaagtt acagttgaaa gaaatgagaa acaaacacct tctcttccag 2341 aaccaaagcc tgtgtatgcc caagttggca accagatgtg gatttacctg tcagtccatc 2401 tgatggtgtc ctacctaatt caactcatga agatgggatt tcttcggccc agcatgaaat 2461 tggtaaaatt cagaaaagga gatagtgtgg gtttgcggct ggctggtgga aatgatgttg 2521 gaatatttgt agctggcgtt ctagaagata gccctgcagc caaggaaggc ttagaggaag 2581 gtgatcaaat tctcagggta aacaacgtag attttacaaa tatcataaga gaagaagccg 2641 tccttttcct gcttgacctc cctaaaggag aagaagtgac catattggct cagaagaaga 2701 aggatgttta tcgtcgcatt gtagaatcag atgtaggaga ttctttctat attagaaccc 2761 attttgaata tgaaaaggaa tctccctatg gacttagttt taacaaagga gaggtgttcc 2821 gtgctgtgga taccttgtac aatggaaaac tgggctcttg gcttgctatt cgaattggta 2881 aaaatcataa ggaggtagaa cgaggcatca tccctaataa gaacagagct gagcagctag 2941 ccagtgtaca gtatacactt ccaaaaacag caggcggaga ccgtgctgac ttctggagat 3001 tcagaggtct tcgcagctcc aagagaaatc ttcgaaaaag cagagaggat ttgtccgctc 3061 agcctgttca aacaaagttt ccagcttatg aaagagtggt tcttcgagaa gctggatttc 3121 tgaggcctgt aaccattttt ggaccaatag ctgatgttgc cagagaaaag ctggcaagag 3181 aagaaccaga tatttatcaa attgcaaaga gtgaaccacg agacgctgga actgaccaac 3241 gtagctctgg ctatattcgc ctgcatacaa taaagcaaat catagatcaa gacaaacatg 3301 ctttattaga tgtaacacca aatgcagttg atcgtcttaa ctatgcccag tggtatccaa 3361 ttgttgtatt tcttaaccct gattctaagc aaggagtaaa aacaatgaga atgaggttat 3421 gtccagaatc tcggaaaagt gccaggaagt tatacgagcg atctcataaa cttgctaaaa 3481 ataatcacca tctttttaca actacaatta acttaaattc aatgaatgat ggttggtatg 3541 gtgcgctgaa agaagcagtt caacaacagc aaaaccagct ggtatgggtt tccgagggaa 3601 aggcggatgg tgctacaagt gatgaccttg atttgcatga tgatcgtctg tcctacctgt 3661 cagctccagg tagtgaatac tcaatgtata gcacggacag tagacacact tctgactatg 3721 aagacacaga cacagaaggc ggggcctaca ctgatcaaga actagatgaa actcttaatg 3781 atgaggttgg gactccaccg gagtctgcca ttacacggtc ctctgagcct gtaagagagg 3841 actcctctgg aatgcatcat gaaaaccaaa catatcctcc ttactcacca caagcgcagc 3901 cacaaccaat tcatagaata gactcccctg gatttaagcc agcctctcaa cagaaagcag 3961 aagcttcatc tccagtccct tacctttcgc ctgaaacaaa cccagcatca tcaacctctg 4021 ctgttaatca taatgtaaat ttaactaatg tcagactgga ggagcccacc ccagctcctt 4081 ccacctctta ctcaccacaa gctgattctt taagaacacc aagtactgag gcagctcaca 4141 taatgctaag agatcaagaa ccatcattgt cgtcgcatgt agatccaaca aaggtgtata 4201 gaaaggatcc atatcccgag gaaatgatga ggcagaacca tgttttgaaa cagccagccg 4261 ttagtcaccc agggcacagg ccagacaaag agcctaatct gacctatgaa ccccaactcc 4321 catacgtaga gaaacaagcc agcagagacc tcgagcagcc cacatacaga tacgagtcct 4381 caagctatac ggaccagttt tctcgaaact atgaacatcg tctgcgatac gaagatcgcg 4441 tccccatgta tgaagaacag tggtcatatt atgatgacaa acagccctac ccatctcggc 4501 caccttttga taatcagcac tctcaagacc ttgactccag acagcatccc gaagagtcct 4561 cagaacgagg gtactttcca cgttttgaag agccagcccc tctgtcttac gacagcagac 4621 cacgttacga acaggcacct agagcatccg ccctgcggca cgaagagcag ccagctcctg 4681 ggtatgacac acatggtaga ctcagaccgg aagcccagcc ccacccttca gcagggccca 4741 agcctgcaga gtccaagcag tattttgagc aatattcacg cagttacgag caagtaccac 4801 cccaaggatt tacctctaga gcaggtcatt ttgagcctct ccatggtgct gcagctgtcc 4861 ctccgctgat accttcatct cagcataagc cagaagctct gccttcaaac accaaaccac 4921 tgcctccacc cccaactcaa accgaagaag aggaagatcc agcaatgaag ccacagtctg 4981 tactcaccag agttaagatg tttgaaaaca aaagatctgc atccttagag accaagaagg 5041 atgtaaatga cactggcagt tttaagcctc cagaagtagc atctaaacct tcaggtgctc 5101 ccatcattgg tcccaaaccc acttctcaga atcaattcag tgaacatgac aaaactctgt 5161 acaggatccc agaacctcaa aaacctcaac tgaagccacc tgaagatatt gttcggtcca 5221 atcattatga ccctgaagaa gatgaagaat attatcgaaa acagctgtca tactttgacc 5281 gaagaagttt tgagaataag cctcctgcac acattgctgc cagccatctc tccgagcctg 5341 caaagccagc tcattctcag aatcaatcaa atttttctag ttattcttca aagggaaagc 5401 ctcctgaagc tgatggtgtg gatagatcat ttggcgagaa acgctatgaa cccatccagg 5461 ccactccccc tcctcctcca ttgccctcgc agtatgccca gccatctcag cctgtcacca 5521 gcgcgtctct ccacatacat tctaagggag cacatggtga aggtaattca gtgtcattgg 5581 attttcagaa ttccttagtg tccaaaccag acccacctcc atctcagaat aagccagcaa 5641 ctttcagacc accaaaccga gaagatactg ctcaggcagc tttctatccc cagaaaagtt 5701 ttccagataa agccccagtt aatggaactg aacagactca gaaaacagtc actccagcat 5761 acaatcgatt cacaccaaaa ccatatacaa gttctgcccg accatttgaa cgcaagtttg 5821 aaagtcctaa attcaatcac aatcttctgc caagtgaaac tgcacataaa cctgacttgt 5881 cttcaaaaac tcccacttct ccaaaaactc ttgtgaaatc gcacagtttg gcacagcctc 5941 ctgagtttga cagtggagtt gaaactttct ctatccatgc agagaagcct aaatatcaaa 6001 taaataatat cagcacagtg cctaaagcta ttcctgtgag tccttcagct gtggaagagg 6061 atgaagatga agatggtcat actgtggtgg ccacagcccg aggcatattt aacagcaatg 6121 ggggcgtgct gagttccata gaaactggtg ttagtataat tatccctcaa ggagccattc 6181 ccgaaggagt tgagcaggaa atctatttca aggtctgccg ggacaacagc atccttccac 6241 ctttagataa agagaaaggt gaaacactgc tgagtccttt ggtgatgtgt ggtccccatg 6301 gcctcaagtt cctgaagcct gtggagctgc gcttaccaca ctgtgatcct aaaacctggc 6361 aaaacaagtg tcttcccgga gatccaaatt atctcgttgg agcaaactgt gtttctgtcc 6421 ttattgacca cttttaactc ttgaaatata ggaacttaaa taatgtgaaa ctggattaaa 6481 cttaatctaa atggaaccac tctatcaagt attatacctt ttttagagtt gatactacag 6541 tttgttagta tgaggcattt gtttgaactg ataaagatga gtgagcatgc ccctgaacca 6601 tggtcggaaa acatgctaca cactgcatgt ttgtgattga cgggactgtt ggtattggct 6661 agaggttcaa agatattttg ctttgtgatt tttgtaattt ttttatcgtc actgcttaac 6721 ttcacatatt gatttccgtt aaaataccag ccagtaaatg ggggtgcatt tgaggtctgt 6781 tctttccaaa gtacactgtt tcaaacttta ctatggccct ggcctagcat acgtacacat 6841 tttattttat tatgcatgaa gtaatatgca cacatttttt aaatgcacct ggaatatata 6901 accagtgttg tggatttaac agaaatgtac agcaaggaga tttacaactg ggggagggtg 6961 aagtgaagac aatgacttac tgtacatgaa aacacatttt tcttagggaa ggatacaaaa 7021 gcatgtgaga ctggttccat ggcctcttca gatctctaac ttcaccatat taccacagac 7081 atactaacca gcagaaatgc cttaccctca tgttcttaat tcttagctca ttctccttgt 7141 gttactaagt ttttatggct tttgtgcatt atctagatac tgtatcatga caaagactga 7201 gtacgttgtg catttggtgg tttcagaaat gtgttatcac ccagaagaaa atagtggtgt 7261 gatttgggga tatttttttc ttttcttttc ttttcttttt tttttttttt tgacaagggg 7321 cagtggtggt tttctgttct ttctggctat gcatttgaaa attttgatgt tttaaggatg 7381 cttgtacata atgcgtgcat accacttttg ttcttggttt gtaaattaac ttttataaac 7441 tttacctttt ttatacataa acaagaccac gtttctaaag gctacctttg tattctctcc 7501 tgtacctctt gagccttgaa ctttgacctc tgcagcaata aagcagcgtt tctatgacac 7561 atgcaaggtc atttttttta agaaaaagga tgcacagagt tgttacattt ttaagtgctg 7621 catttaaaag atacagttac tcagaattct ctagtttgat taaattcttg caaagtatcc 7681 ctactgtaat ttgtgataca atgctgtgcc ctaaagtgta tttttttact aatagacaat 7741 ttattatgac acatcagcac gatttctgtt taaataatac accactacat tctgttaatc 7801 attaggtgtg actgaatttc ttttgccgtt attaaaaatc tcaaatttct aaatctccaa 7861 aataaaactt tttaaaataa aaaaaaat // LOCUS HUMZPBPB 1177 bp mRNA PRI 30-MAY-1996 DEFINITION Human mRNA for zona-pellucida-binding protein (sp38), complete cds. ACCESSION D17570 NID g498161 KEYWORDS sp38; spermatogenesis; zona-pellucida-binding protein. SOURCE Homo sapiens male testis cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1177) AUTHORS Baba,T., Mori,E., Mori,T., Kashiwabara,S. and Tanaka,K. TITLE Expression of the gene encoding zona-pellucida-binding protein, sp38, during spermatogenesis JOURNAL Unpublished (1993) REFERENCE 2 (bases 1 to 1177) AUTHORS Baba,T. TITLE Direct Submission JOURNAL Submitted (10-SEP-1993) to the DDBJ/EMBL/GenBank databases. Tadashi Baba, University of Tsukuba, Institute of Applied Biochemistry; 1-1-1 Tennohdai, Tsukuba, Ibaraki 305, Japan (Tel:0298-53-6632, Fax:0298-53-6632) COMMENT Submitted (10-Sep-1993) to DDBJ by: Tadashi Baba Institute of Applied Biochemistry University of Tsukuba 1-1-1 Tennohdai, Tsukuba Ibaraki 305 Japan Phone: 0298-53-6632 Fax: 0298-53-6632. FEATURES Location/Qualifiers source 1..1177 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="male" /tissue_type="testis" 5'UTR 1..36 CDS 37..1092 /codon_start=1 /product="zona-pellucida-binding protein (sp38)" /db_xref="PID:d1005021" /db_xref="PID:g498162" /translation="MEAFALGPARRGRRRTRAAGSLLSRAAILLFISAFLVRVPSSVG HLVRLPRAFRLTKDSVKIVGSTSFPVKAYVMLHQKSPHVLCVTQQLRNAELIDPSFQW YGPKGKVVSVENRTAQITSTGSLVFQNFEESMSGIYTCFLEYKPTVEEIVKRLQLKYA IYAYREPHYYYQFTARYHAVPCNSIYNISFEKKLLQILSKLLLDLSCEISLLKSECHR VKMQRAGLQNELFFAFSVSSLDTEKGPKRCTDHNCEPYKRLFKAKNLIERFFNQQVEI LGRRAEQLPQIYYIEGTLQMVWINRCFPGYGMNVQQHPKCPECCVICSPGSYNPRDGI HCLQCNSSLVYGAKTCL" mat_peptide 160..1089 /product="Zona-pellucida-binding protein (sp38)" 3'UTR 1093..1177 BASE COUNT 344 a 252 c 243 g 338 t ORIGIN 1 ccgcgcggac ggtgggcagg cgacggcggc gtgtggatgg aggccttcgc ccttggccca 61 gcgcggcggg gcaggcggcg gacccgggcc gccggctccc tgctctctcg ggccgccatc 121 ctcctcttta tctccgcctt cctggtgcgg gtgccctcat cagttggaca cttggttcga 181 ttaccaagag cttttcgctt gaccaaagat tcagtgaaaa tagtgggatc aacaagtttt 241 ccagtgaaag cgtatgtcat gctccatcaa aagagtccac acgtgttatg tgtaacgcaa 301 caactgcgaa atgctgaact gatagaccca tcattccaat ggtatgggcc taaaggaaaa 361 gttgtttcag tagaaaaccg cactgcacaa ataacatcca caggaagcct tgtattccaa 421 aattttgagg agagtatgag tggaatttat acatgtttcc tcgaatataa acctactgtg 481 gaagaaattg ttaaacgtct tcaactaaaa tatgctatat atgcttatcg tgagcctcat 541 tattattatc agttcacagc tcgatatcat gcagtcccct gcaatagcat ttataatatt 601 tcttttgaga agaaacttct tcagatttta agcaaactgc ttcttgacct ttcatgtgaa 661 atttccttac ttaagtctga atgccatcgc gttaaaatgc aaagagctgg tttgcaaaat 721 gaattgttct ttgcattttc agtttcatct ctagacactg aaaaaggacc caagcgatgt 781 acagaccata actgtgaacc ttacaaaaga ctttttaagg ctaaaaatct catagagaga 841 ttttttaatc aacaagtaga aattcttggc agacgtgcag aacaattacc tcaaatatac 901 tatattgaag gtactctcca aatggtttgg attaatcgct gctttccagg atatggaatg 961 aatgtccagc aacatccaaa atgtcctgag tgctgtgtga tctgcagccc tggatcatat 1021 aacccccgtg atggaattca ttgccttcaa tgcaatagca gcctggtgta tggagcaaaa 1081 acgtgcttat aagccattat cttcagttat tcagtggttt attaaatgca aagtatatat 1141 ttgaaatatt aacataataa attattatgc caaaatt // LOCUS MIHS75KDA 2525 bp RNA PRI 05-NOV-1991 DEFINITION Human mRNA for mitochondrial 75 kDa iron sulphur protein. ACCESSION X61100 NID g38078 KEYWORDS coenzyme Q reductase; iron-sulfur protein; NADH dehydrogenase subunit. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2525) AUTHORS Robinson,B.H. TITLE Direct Submission JOURNAL Submitted (24-JUL-1991) B.H. Robinson, Research Institute The Hospital for, Sick Children, 555 University Avenue, Room 9107, Toronto Ontario M5G 1X8, CANADA REFERENCE 2 (bases 1 to 2525) AUTHORS Chow,W., Ragan,I. and Robinson,B.H. TITLE Determination of the cDNA sequence for the human mitochondrial 75-kDa Fe-S protein of NADH-coenzyme Q reductase JOURNAL Eur. J. Biochem. 201 (3), 547-550 (1991) MEDLINE 92037608 FEATURES Location/Qualifiers source 1..2525 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambda gt10" /chromosome="2" /map="2q33-34" mRNA 1..2506 /evidence=experimental CDS 47..2230 /codon_start=1 /product="75 kDa subunit NADH dehydrogenase precursor" /db_xref="PID:g38079" /db_xref="SWISS-PROT:P28331" /translation="MLRIPVRRALVGLSKSPKGCVRTTATAASNLIEVFVDGQSVMVE PGTTVLQACEKVGMQIPRFCYHERLSVAGNCRMCLVEIEKAPKVVAACAMPVMKGWNI LTNSEKSKKAREGVMEFLLANHPLDCPICDQGGECDLQDQSMMFGNDRSRFLEGKRAV EDKNIGPLVKTIMTRCIQCTRCIRFASEIAGVDDLGTTGRGNDMQVGTYIEKMFMSEL SGNIIDICPVGALTSKPYAFTARPWETRKTESIDVMDAVGSNIVVSTRTGEVMRILPR MHEDINEEWISDKTRFAYDGLKRQRLTEPMVRNEKGLLTYTSWEDALSRVAGMLQSFQ GKDVAAIAGGLVDAEALVALKDLLNRVDSDTLCTEEVFPTAGAGTDLRSNYLLNTTIA GVEEADVVLLVGTNPRFEAPLFNAWIRKSWLHNDLKVALIGSPVDLTYTYDHLGDSPK ILQDIASGSHPFSQVLKEAKKPMVVLGSSALQRNDGAAILAAVSSIAQKIRMTSGVTG DWKVMNILHRIASQVAALDLGYKPGVEAIRKNPPKVLFLLGADGGCITRQDLPKDCFI IYQGHHGDVGAPIADVILPGAAYTEKSATYVNTEGRAQQTKVAVTPPGLAREDWKIIR ALSEIAGMTLPYDTLDQVRNRLEEFSPNLVRYDDIEGANYFQQANELSKLVNQQLLAD PLVPPQLTLKDFYMTDSISRASQTMAKCVKAVTEGAQAVEEPSIC" misc_feature 47..115 /note="presequence" mat_peptide 116..2227 /EC_number="1.6.5.3" /note="75 kDa iron sulphur protein" /product="NADH dehydrogenase (ubiquinone)" BASE COUNT 755 a 480 c 615 g 675 t ORIGIN 1 cggacagttt agcagaacag cctccgcggc tccggggaga agcaatatgt taaggatacc 61 tgtaagaagg gccttagtag gcctttctaa gtctcctaaa ggatgtgttc gaacaactgc 121 cacagcagca agcaacttga ttgaagtatt tgttgatggt cagtctgtca tggtggaacc 181 gggaacgacc gtcctccaag cttgtgagaa ggttggcatg cagatccctc gattctgtta 241 tcatgaaagg ttgtctgttg ctggaaactg caggatgtgc cttgttgaaa ttgagaaagc 301 ccctaaggtt gtagctgctt gtgccatgcc agtaatgaag ggttggaata tcctaacaaa 361 ctcagaaaaa tccaaaaagg ccagggaagg tgtgatggag ttcttattag caaatcaccc 421 attggactgt cctatttgtg accagggagg tgaatgtgat ctgcaggacc agtccatgat 481 gtttggaaat gataggagcc gatttttaga ggggaagcgt gctgtggaag acaagaacat 541 tgggccattg gtaaagacca tcatgacaag atgtatacag tgtactcgct gcatcaggtt 601 tgcaagtgag attgcaggag tagatgattt gggaacaaca ggcagaggaa atgatatgca 661 agttggcaca tacattgaaa agatgttcat gtctgaactg tctgggaata tcattgatat 721 ctgccctgta ggtgccctaa cctctaagcc ctatgccttt actgcccggc cttgggaaac 781 aagaaagaca gaatccattg atgtaatgga tgcggttgga agtaatattg tggttagcac 841 aagaactgga gaagtgatga ggattttgcc acgtatgcat gaggacatca atgaagagtg 901 gatctctgat aaaaccagat ttgcctatga tgggctaaaa cgtcaaagac ttaccgagcc 961 aatggtcaga aatgaaaaag ggcttttaac ctatacttct tgggaggatg ctctctctcg 1021 cgtagctgga atgttgcaga gttttcaagg caaagatgtg gcagcaattg caggtggctt 1081 ggtggatgct gaagccctgg tagctctcaa agatttgctt aatagagtgg actctgacac 1141 cttatgcact gaagaggtct tccccactgc aggagctggc acagatttgc gttccaatta 1201 tcttcttaat actacaattg ctggtgtgga agaggcagat gttgttcttc tggttggtac 1261 aaacccacgt tttgaggcac cactgtttaa tgcatggatt cgaaagagct ggctgcataa 1321 tgacttaaaa gtggccctta taggcagtcc agtggacctc acttacacat atgaccacct 1381 gggagactcc cccaaaattc ttcaagacat tgcttcggga agccatccat ttagccaggt 1441 cctaaaggaa gctaaaaaac caatggtggt tttaggcagt tctgcactcc aaagaaatga 1501 tggagcagca attcttgcag ctgtttctag cattgcacaa aagattcgga tgactagtgg 1561 tgttactggt gattggaaag ttatgaatat ccttcatagg attgcaagtc aagtagctgc 1621 tttggacctt ggctataagc ctggggtgga agcaattcgg aagaaccctc ccaaggtgct 1681 gtttctcctg ggagcagatg gaggttgtat cacacgacag gatttgccaa aggattgttt 1741 cattatttat caaggacatc atggtgatgt tggggctccc atagctgatg ttattctccc 1801 aggagctgct tacacagaga agtctgctac atatgtcaac actgagggta gagctcagca 1861 gactaaggta gcagtgacac ctcctggctt ggcaagagaa gactggaaaa ttataagagc 1921 actctctgag attgctggaa tgactcttcc atatgatact ctggatcaag taaggaacag 1981 attggaagaa ttctctccta atcttgttcg atatgatgat attgaagggg ctaattactt 2041 ccagcaagca aatgagctct caaagctagt gaaccagcag cttcttgctg acccacttgt 2101 tccacctcag ctaactctaa aagacttcta catgacagat tcgattagca gagcctcaca 2161 gacaatggcc aaatgtgtca aagctgtcac agagggtgcc caggcagtag aggaaccatc 2221 catatgctga agcttctact aggatcccag ttttgccgca gataattaat ggacaactgt 2281 agtgcagtga tcctttacag gtttatttct ttgtaaaaaa aaaataataa taatttgaat 2341 catgtaatat ttaaggttat actatgccta tttgaaaatg atattagtta tcaactttgc 2401 agtttgaaaa acatgtattg tgtgtaaagg ttaaataaca aaactatgca gatgctctta 2461 aaagcattga taacctttgt gacgaacata aagagatcct taaattaaaa aaaaaaaaaa 2521 aaaaa // LOCUS MMU47007 2389 bp mRNA PRI 16-JUL-1996 DEFINITION Human transcriptional repressor (NAB1) NAB1 mRNA, complete cds. ACCESSION U47007 NID g1197668 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2389) AUTHORS Russo,M.W., Sevetson,B.R. and Milbrandt,J. TITLE Identification of NAB1, a repressor of NGFI-A- and Krox20-mediated transcription JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (15), 6873-6877 (1995) MEDLINE 95350172 REFERENCE 2 (bases 1 to 2389) AUTHORS Svaren,J., Sevetson,B.R., Apel,E.D., Zimonjic,D.B., Popescu,N.C. and Milbrandt,J. TITLE NAB2, a corepressor of NGFI-A (Egr-1) and Krox20, is induced by proliferative and differentiative stimuli JOURNAL Mol. Cell. Biol. 16 (7), 3545-3553 (1996) MEDLINE 96251303 REFERENCE 3 (bases 1 to 2389) AUTHORS Svaren,J. and Milbrandt,J. TITLE Direct Submission JOURNAL Submitted (24-JAN-1996) Jeffrey Milbrandt, Pathology, Washington University School of Medicine, 660 S. Euclid Avenue, St. Louis, MO 63110, USA FEATURES Location/Qualifiers source 1..2389 /organism="Homo sapiens" /db_xref="taxon:9606" /chromosome="2" /map="2q31-32" /tissue_type="brain" gene 20..1480 /gene="NAB1" CDS 20..1480 /gene="NAB1" /codon_start=1 /function="transcriptional co-repressor of NGFI-A/Egr-1 and Krox20; NGFI-A binding protein" /product="NAB1" /db_xref="PID:g1197669" /translation="MAAALPRTLGELQLYRILQKANLLSYFDAFIQQGGDDVQQLCEA GEEEFLEIMALVGMASKPLHVRRLQKALRDWVTNPGLFNQPLTSLPVSSIPIYKLPEG SPTWLGISCSSYERSSNAREPHLKIPNCAATTCVQSLGQGKSDVVGSLALQSVGESRL WQGHHATESEHSLSPADLGSPASPKESSEALDAAAALSVAECVERMAPTLPKSDLNEV KELLKTNKKLAKMIGHIFEMNDDDPHKEEEIRKYSAIYGRFDSKRKDGKHLTLHELTV NEAAAQLCVKDNALLTRRDELFALARQISREVTYKYTYRTTKSKCGERDELSPKRIKV EDGFPDFQDSVQTLFQQARAKSEELAALSSQPEKVMAKQMEFLCNQAGYERLQHAERR LSAGLYRQSSEEHSPNGLTSDNSDGQGERPLNLRMPNLQNRQPHHFVVDGELSRLYPS EAKSHSSESLGILKDYPHSAFTLEKKVIKTEPEDSR" BASE COUNT 743 a 455 c 561 g 624 t 6 others ORIGIN 1 gttaaaccca tccagagtaa tggctgcggc cttacccagg accctggggg agttgcagct 61 gtatagaata ttacaaaaag ccaatctact ttcttatttt gatgccttta tccaacaagg 121 tggtgatgat gtccagcaac tctgtgaagc aggagaagag gagtttttgg aaatcatggc 181 actcgtgggc atggctagca agccccttca tgttagaagg ctgcagaagg ctttgagaga 241 ctgggtcaca aaccctgggc ttttcaatca gccactgact tcccttcctg tcagtagcat 301 acccatctat aaattaccag agggatcacc aacatggctg ggaatatcct gcagtagtta 361 tgaaaggagt agcaatgccc gggaacctca tttaaaaatc cccaattgtg ctgccaccac 421 ctgtgtgcag agcttgggac aggggaagtc agatgtggtt gggagcctag cactgcagag 481 tgttggtgag tccagactct ggcaaggcca ccatgccact gagagcgagc acagcctctc 541 cccagcagac ctgggctccc ccgcgtcccc aaaggagagc agtgaggcgc tggatgctgc 601 tgctgcgctc tctgtggctg agtgtgtgga gcggatggcc cccacactgc caaaaagtga 661 cttgaatgaa gtgaaagagc tgctaaaaac caacaagaag ttggccaaaa tgattggtca 721 catctttgag atgaacgatg atgatccaca caaagaggag gaaattcgga aatacagtgc 781 aatatatggc agatttgact caaagaggaa ggatgggaaa catctcacac ttcatgagct 841 cactgttaat gaagcggctg ctcaactctg tgtgaaggat aatgccctgc tgacaagaag 901 agatgagctt tttgccttgg ctcgacagat ttctcgagaa gtcacctata aatatactta 961 cagaaccacc aagtcaaaat gtggagaaag agatgaatta tccccaaaga gaattaaagt 1021 ggaggatggg tttccagatt tccaggattc tgtgcaaaca ctcttccagc aggctagagc 1081 taagagtgaa gaacttgcag ctcttagttc acagcctgaa aaggtgatgg caaagcagat 1141 ggagttcctt tgcaaccaag ctggctatga gagactgcag catgccgaga ggaggttgtc 1201 tgcagggctt tacaggcaga gctcagaaga gcacagtcct aacggcttga cttccgataa 1261 ctcagatgga caaggagaaa gacctttgaa tctccgaatg cctaatttac agaacagaca 1321 accccatcat tttgtggtgg atggggagct gagcagactt taccccagtg aggcaaagtc 1381 ccactcatca gagagccttg ggattttaaa agactaccct cattcagctt ttaccttaga 1441 aaagaaagtc atcaaaacag agcctgaaga ttcaagatag ctgtgatttc tctcaccgtt 1501 ctctggaaat ggcatcagat ttaaggataa tactccatca tagaaataag ccttaataac 1561 cagtgttgcc tcattcagct caaacagatt tcatagccaa agcaaaagga ctggtacggt 1621 agtctgtgga aaccaggaag ataaaacaac agccacaaaa gagaaaatca agagtgttgc 1681 aatctataac agtaatattg attcattcac attcctgtgt taagtcattt tatatggaaa 1741 ggcttacaaa tcaatattgt aagcattcat tatttaagaa tgtacaatgt atttgtgtaa 1801 tttatagaag taaaatctag atgttgagac ctgtttggtc taatagatgt ggatacagtt 1861 tatyttactt gaaadttkgt tgtctacttt gtgtgtttaa cgtaaatata tgtcagagtt 1921 tagaatctgc ctgcagttgt gaaaaagaaa gcttaagtga tgcagttatt ggcaagattg 1981 caatgattat ggaaaaatag aaagcgaata ctcagtttaa gccaaggaaa atattgtgga 2041 tttaatattt gataaaactg attttgttta acaggaaatt tttagcattc agtcatataa 2101 catctggtta tcaatgcacg tttacacaat aaatacttga gtggaggaaa gttaaaaaga 2161 tgagcaatag agtagaaaat atatcttaaa ctagttgacc tagattgtat taatagctac 2221 ttaagatgtt tcaaagatag gaagctattg cttgggacag gggaacttgg aaataagtgg 2281 ggcccctgta ttaaaagctt ttgactttaa accnttggtt ttttccngga tgtggtttaa 2341 ataggtttag ggccccggta ggtttacccc tncctgtttt ttaaggtgg // LOCUS MUSFGF5A 1123 bp mRNA PRI 22-AUG-1996 DEFINITION Human fibroblast growth factor-5 (FGF-5) mRNA, complete cds. ACCESSION M37825 NID g182543 KEYWORDS fibroblast growth factor-5. SOURCE human cDNA to mRNA. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1123) AUTHORS Haub,O., Drucker,B. and Goldfarb,M. TITLE Expression of the murine fibroblast growth factor 5 gene in the adult central nervous system JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (20), 8022-8026 (1990) MEDLINE 91045929 REFERENCE 2 (bases 1 to 1123) AUTHORS Bates,B., Hardin,J., Zhan,X., Drickamer,K. and Goldfarb,M. TITLE Biosynthesis of human fibroblast growth factor-5 JOURNAL Mol. Cell. Biol. 11 (4), 1840-1845 (1991) MEDLINE 91172167 COMMENT Draft entry and computer-readable sequence kindly submitted by M.Goldfarb, 21-AUG-1990. FEATURES Location/Qualifiers source 1..1123 /organism="Homo sapiens" /note="NIH 3T3 cells were transformed with human FGF-5" /db_xref="taxon:9606" /clone="1-2-2" /cell_line="3T3-VMCUB2-1" /dev_stage="newborn" /tissue_type="brain stem" CDS 27..143 /note="ORF1; putative" /codon_start=1 /db_xref="PID:g182544" /translation="MSTRCGEAGRARGTQPHRGYRAQNQPYKMHLGPPRLEE" gene 140..946 /gene="FGF5" CDS 140..946 /gene="FGF5" /codon_start=1 /product="fibroblast growth factor 5" /db_xref="PID:g182545" /translation="MSLSFLLLLFFSHLILSAWAHGEKRLAPKGQPGPAATDRNPIGS SSRQSSSSAMSSSSASSSPAASLGSQGSGLEQSSFQWSPSGRRTGSLYCRVGIGFHLQ IYPDGKVNGSHEANMLSVLEIFAVSQGIVGIRGVFSNKFLAMSKKGKLHASAKFTDDC KFRERFQENSYNTYASAIHRTEKTGREWYVALNKRGKAKRGCSPRVKPQHISTHFLPR FKQSEQPELSFTVTVPEKKNPPSPIKSKIPLSAPRKNTNSVKYRLKFRFG" BASE COUNT 299 a 301 c 263 g 260 t ORIGIN 1 cctctcccct tctcttcccc gaggctatgt ccacccggtg cggcgaggcg ggcagagcca 61 gaggcacgca gccgcacagg ggctacagag cccagaatca gccctacaag atgcacttag 121 gacccccgcg gctggaagaa tgagcttgtc cttcctcctc ctcctcttct tcagccacct 181 gatcctcagc gcctgggctc acggggagaa gcgtctcgcc cccaaagggc aacccggacc 241 cgctgccact gataggaacc ctataggctc cagcagcaga cagagcagca gtagcgctat 301 gtcttcctct tctgcctcct cctcccccgc agcttctctg ggcagccaag gaagtggctt 361 ggagcagagc agtttccagt ggagcccctc ggggcgccgg accggcagcc tctactgcag 421 agtgggcatc ggtttccatc tgcagatcta cccggatggc aaagtcaatg gatcccacga 481 agccaatatg ttaagtgttt tggaaatatt tgctgtgtct caggggattg taggaatacg 541 aggagttttc agcaacaaat ttttagcgat gtcaaaaaaa ggaaaactcc atgcaagtgc 601 caagttcaca gatgactgca agttcaggga gcgttttcaa gaaaatagct ataataccta 661 tgcctcagca atacatagaa ctgaaaaaac agggcgggag tggtatgttg ccctgaataa 721 aagaggaaaa gccaaacgag ggtgcagccc ccgggttaaa ccccagcata tctctaccca 781 ttttcttcca agattcaagc agtcggagca gccagaactt tctttcacgg ttactgttcc 841 tgaaaagaaa aatccaccta gccctatcaa gtcaaagatt cccctttctg cacctcggaa 901 aaataccaac tcagtgaaat acagactcaa gtttcgcttt ggataatatt aatcttggcc 961 ttgtgagaaa ccattctttc ccctcaggag tttctatagg tgtcttcaga gttctgaaga 1021 aaaattactg gacacagctt cagctatact tacactgtat tgaagtcacg tcatttgttt 1081 cagtgtgact gaaacaaaat gttttttgat aggaaggaaa ctg // LOCUS S40706 895 bp PRI 29-SEP-1992 DEFINITION GADD153=growth arrest and DNA-damage-inducible gene [human, Genomic/mRNA, 895 nt]. ACCESSION S40706 NID g252001 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 895) AUTHORS Park,J.S., Luethy,J.D., Wang,M.G., Fargnoli,J., Fornace,A.J.Jr., McBride,O.W. and Holbrook,N.J. TITLE Isolation, characterization and chromosomal localization of the human GADD153 gene JOURNAL Gene 116 (2), 259-267 (1992) MEDLINE 92339899 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 109422] from the original journal article. This sequence comes from Fig. 1. Map location: 12q13. FEATURES Location/Qualifiers source 1..895 /organism="Homo sapiens" /db_xref="taxon:9606" gene 175..681 /note="growth arrest and DNA-damage-inducible gene" /gene="GADD153" CDS 175..681 /gene="GADD153" /note="This sequence comes from Fig. 1." /codon_start=1 /product="Gadd153" /db_xref="PID:g252002" /translation="MAAESLPFSSDTVSWELEAWYEDLQEVLSSDENGGTYVSPPGNE EEESKIFTTLDPASLAWLTEEEPEPAEVTSTSQSPHSPDSSQSSLAQEEEEEDQGRTR KRKQSGHSPARAGKQRMKEKEQENERKVAQLAEENERLKQEIERLTREVEATRRALID RMVNLHQA" BASE COUNT 265 a 212 c 238 g 180 t ORIGIN 1 agagacttaa gtctaaggca ctgagcgtat catgttaaag atgagcgggt ggcagcgaca 61 gagccaaaat cagagctgga acctgaggag agagtgttca agaaggaagt gtatcttcat 121 acatcaccac acctgaaagc agatgtgctt ttccagactg atccaactgc agagatggca 181 gctgagtcat tgcctttctc ttcggacact gtcagctggg agctggaagc ctggtatgag 241 gacctgcaag aggtcctgtc ttcagatgaa aatgggggta cctatgtttc acctcctgga 301 aatgaagagg aagaatcaaa aatcttcacc actcttgacc ctgcttctct ggcttggctg 361 actgaggagg agccagaacc agcagaggtc acaagcacct cccagagccc tcactctcca 421 gattccagtc agagctccct ggctcaggag gaagaggagg aagaccaagg gagaaccagg 481 aaacggaaac agagtggtca ttccccagcc cgggctggaa agcagcgcat gaaggagaaa 541 gaacaggaga atgaaaggaa agtggcacag ctagctgaag agaatgaacg gctcaagcag 601 gaaatcgagc gcctgaccag ggaagtagag gcgactcgcc gagctctgat tgaccgaatg 661 gtgaatctgc accaagcatg aacaattggg agcatcagtc ccccacttgg gccacactac 721 ccacctttcc cagaagtggc tactgactac cctctcacta gtgccaatga tgtgaccctc 781 aatcccacat acgcaggggg aaggcttgga gtagacaaaa ggaaaggtct cagcttgtat 841 atagagattg tacatttatt tattactgtc cctatctatt aaagtgactt tctat // LOCUS S46622 2134 bp mRNA PRI 05-JAN-1993 DEFINITION calcineurin A catalytic subunit [human, testis, mRNA, 2134 nt]. ACCESSION S46622 NID g258000 KEYWORDS . SOURCE human testis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2134) AUTHORS Muramatsu,T. and Kincaid,R.L. TITLE Molecular cloning and chromosomal mapping of the human gene for the testis-specific catalytic subunit of calmodulin-dependent protein phosphatase (calcineurin A) JOURNAL Biochem. Biophys. Res. Commun. 188 (1), 265-271 (1992) MEDLINE 93038669 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 116113] from the original journal article. This sequence comes from Fig. 1. Map location: 8. FEATURES Location/Qualifiers source 1..2134 /organism="Homo sapiens" /db_xref="taxon:9606" gene 287..1795 /gene="calcineurin A catalytic subunit, calmodulin-dependent protein phosphatase catalytic subunit, CaM-PrP catalytic subunit" CDS 287..1795 /gene="calcineurin A catalytic subunit, calmodulin-dependent protein phosphatase catalytic subunit, CaM-PrP catalytic subunit" /note="This sequence comes from Fig. 1; calmodulin-dependent protein phosphatase catalytic subunit; CaM-PrP catalytic subunit" /codon_start=1 /product="calcineurin A catalytic subunit" /db_xref="PID:g258001" /translation="MSGRRFHLSTTDRVIKAVPFPPTQRLTFKEVFENGKPKVDVLKN HLVKEGRLEEEVALKIINDGAAILRQEKTMIEVDAPITVCGDIHGQFFDLMKLFEVGG SPSNTRYLFLGDYVDRGYFSIECVLYLWSLKINHPKTLFLLRGNHECRHLTDYFTFKQ ECRIKYSEQVYDACMETFDCLPLAALLNQQFLCVHGGMSPEITSLDDIRKLDRFTEPP AFGPVCDLLWSDPSEDYGNEKTLEHYTHNTVRGCSYFYSYPAVCEFLQNNNLLSIIRA HEAQDAGYRMYRKSQATGFPSLITIFSAPNYLDVYNNKAAVLKYENNVMNIRQFNCSP HPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLNICSDDELISDDEAEGSTTVRKEIIRN KIRAIGKMARVFSILRQESESVLTLKGLTPTGTLPLGVLSGGKQTIETAIRGFSLQHK IRSFEEARGLDRINERMPPRKDSIYPGGPMKSVTSAHSHAAHRSDQGKKAHS" BASE COUNT 571 a 488 c 518 g 557 t ORIGIN 1 gggccaccct tagcagcggt cgcggtcggt gccgaagcgg tgttccccgc cttagccgct 61 gcgcctccca agagagcggc cggtgggccc tcgtcctgtc agtggcgtcg gaggccggcc 121 tgcggtggcc gcgcccttct ggtgctcgga caccgctgag gagccggggc cgggcacggc 181 tggctgacgg ctccgggcag ctaaggctgc ccgaggagaa ggcggcggcc gcggcgtagg 241 cgcacgtccg gcgggctcct ggagcctgga ggaggccgag gggaccatgt ccgggaggcg 301 cttccacctc tccaccaccg accgcgtcat caaagctgtc ccctttcctc caacccaacg 361 gcttactttc aaggaagtat ttgagaatgg gaaacctaaa gttgatgttt taaaaaacca 421 tttggtaaag gaaggacgac tggaagagga agtagcctta aagataatca atgatggggc 481 tgccatcctg aggcaagaga agactatgat agaagtagat gctccaatca cagtatgtgg 541 tgatattcat ggacaattct ttgacctaat gaagttattt gaagttggag gatcacctag 601 taacacacgc tacctctttc tgggtgacta tgtggacaga ggctatttca gtatagagtg 661 tgtgctgtat ttatggagtt taaagattaa tcatcccaaa acattgtttc tgcttcgggg 721 aaatcatgaa tgcaggcatc ttacagacta tttcaccttc aaacaggaat gtcgaatcaa 781 atattcggaa caggtgtatg atgcctgtat ggagacattt gactgtcttc ctcttgctgc 841 cctcttaaac cagcagtttc tctgtgtaca tggaggaatg tcacctgaaa ttacttcttt 901 agatgacatt aggaaattag acaggtttac ggaacctccc gcctttggac ctgtgtgtga 961 cctgctttgg tctgatccct cagaggatta tggcaatgag aagaccttgg agcactatac 1021 ccacaacact gtccgagggt gctcttattt ctacagttac cctgcagttt gtgaattttt 1081 gcagaacaat aatttactat caattatcag agcccatgaa gcccaagatg ctgggtatcg 1141 aatgtacagg aagagccaag ccacaggctt tccatcactt attacaattt tctctgcccc 1201 caattaccta gatgtctata acaataaagc tgctgtgttg aaatatgaaa acaatgtcat 1261 gaatatcagg cagtttaact gttctccaca cccctactgg cttccaaact ttatggatgt 1321 tttcacatgg tctttgcctt ttgttgggga aaaagtcaca gagatgctgg taaatgtgct 1381 caacatatgc tctgatgacg aactgatttc tgatgatgaa gcagaaggaa gcactacagt 1441 tcgtaaggag atcatcagga ataagatcag agccattggg aagatggcac gggtcttttc 1501 aattcttcgg caagaaagtg agagtgtgct gactctcaag ggcctgactc ccacaggcac 1561 actccctctg ggcgtcctct caggaggcaa gcagactatc gagacagcca tcagagggtt 1621 ctcgcttcag cacaagatcc ggagttttga agaagcgcga ggtctggacc gaattaatga 1681 gcgaatgcca ccccgaaagg atagcatata ccctggtggg ccaatgaaat ctgtaacctc 1741 agcacactca catgctgcgc acaggagcga ccaagggaag aaagcccatt catgacttag 1801 agtcctgccg tgctcaggtg gatctaaaac tcaagaacaa attctattta tttattattg 1861 gaaaatgaaa agcaactcaa aacaacttca acctggaggt gcatttataa ttcagtctgc 1921 atttattctg taaaaaggtg actgttttat aaattctttt aatttatgtt caatatatat 1981 aaaaagtgca tctgttttgt ttttcccttt tttctccata attttaagaa atgaatctga 2041 ttgttgtcaa cacatttgtg aagtcttgtg ctataaaggg gaacttcccc taataaaagg 2101 gccttggaaa cctcaaacct gggtttctga cccc // LOCUS S49953 778 bp mRNA PRI 11-MAR-1993 DEFINITION N-cym=DNA-binding transcriptional activator homolog {oncogene} [human, Kelly neuroblastoma cell line, mRNA, 778 nt]. ACCESSION S49953 NID g260139 KEYWORDS . SOURCE human Kelly neuroblastoma cell line. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 778) AUTHORS Armstrong,B.C. and Krystal,G.W. TITLE Isolation and characterization of complementary DNA for N-cym, a gene encoded by the DNA strand opposite to N-myc JOURNAL Cell Growth Differ. 3 (6), 385-390 (1992) MEDLINE 93041371 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 118622] from the original journal article. This sequence comes from Fig. 1B. FEATURES Location/Qualifiers source 1..778 /organism="Homo sapiens" /db_xref="taxon:9606" gene 426..755 /note="DNA-binding transcriptional activator homolog" /gene="N-cym" CDS 426..755 /gene="N-cym" /note="DNA-binding transcriptional activator homolog; This sequence comes from Fig. 1B" /codon_start=1 /db_xref="PID:g260140" /translation="MQHPPCEPGNCLSLKEKKITEGSGGVCWGGETDASNPAPALTAC CAAEREANVEQGLAGRLLLCNYERRLVRRCKIAGRGRAPLGTRPLDVSSFKLKEEGRP PCLKINK" BASE COUNT 188 a 210 c 237 g 143 t ORIGIN 1 agggggtggt ggcgaggctc cgcaactttg gaaactgcca tttcattcac acaaggcact 61 gcctggggga gggggctgtt cctggctgca gaattctagc tctcacgagc acgcagacaa 121 ccgcactcgc agcggtgtgg ggccggctgc tcaggggaag ccccaggctc tccgacccag 181 ctaccgggaa tggggcaccc tttggagaag aaccccagcc tggggtgggg acgcaccggc 241 tctccgacag ctcaaacaca gacagatctt ctagagccga gggaatttct tttcgcagaa 301 gccattactc cccccgagag aaggctgcaa agctgggaag cccagggtgt gctcctcccg 361 cccttttgga cccccgggct tgcaccggct gcactctgag aaccagctgc gcgcggagcg 421 gtgcaatgca gcacccaccc tgcgagcctg gcaattgctt gtcattaaaa gaaaaaaaaa 481 ttacggaggg ctccgggggt gtgtgttggg gaggggagac cgatgcttct aacccagccc 541 ccgctttgac tgcgtgttgt gcagctgagc gcgaggccaa cgttgagcaa ggccttgcag 601 ggaggttgct cctgtgtaat tacgaaagaa ggctagtccg aaggtgcaaa atagcaggga 661 gaggacgcgc ccccttagga acaagacctc tggatgtttc cagtttcaaa ttgaaagaag 721 aggggcgccc cccttgtttg aaaataaata aataaataag tgcgagctac aaaaaaaa // LOCUS S50223 798 bp mRNA PRI 10-FEB-1993 DEFINITION HKR-T1=Kruppel-like zinc finger protein [human, MOLT 4 T-cells, mRNA, 798 nt]. ACCESSION S50223 NID g260311 KEYWORDS . SOURCE human MOLT 4 T-cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 798) AUTHORS Wu,B.Y., Hanley,E.W., Turka,L.A. and Nabel,G.J. TITLE Isolation of a cDNA clone encoding a zinc finger protein highly expressed in T-leukemia lines JOURNAL Blood 80 (10), 2571-2576 (1992) MEDLINE 93043304 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 118663] from the original journal article. This sequence comes from Fig. 2A. FEATURES Location/Qualifiers source 1..798 /organism="Homo sapiens" /db_xref="taxon:9606" gene 90..779 /gene="HKR-T1" CDS 90..779 /gene="HKR-T1" /note="Kruppel-like zinc finger protein; This sequence comes from Fig. 2A. Author-given protein sequence is in conflict with the conceptual translation; mismatches(76[N->I],78[I->T])" /codon_start=1 /product="HKR-T1" /db_xref="PID:g260312" /translation="MRLAKPKAGISRSSSQGKAYENKRKTGRQREKWGMTIRFDSSFS RLRRSLDDKPYKCTECEKSFSQSSTLFQHQKNHIGKKSHKCADCGKSFFQSSNLIQHR RIHTGEKPYKCDECGESFKQSSNLIQHQRIHTGEKPYQCDECGRCFSQSSHLIQHQRT HTGEKPYQCSECGKCFSQSSHLRQHMKVHKEEKPRKTRGKNIRVKTHLPSWKAGTEGS LWLVSVKYRAF" BASE COUNT 256 a 168 c 187 g 187 t ORIGIN 1 aaaattctga gctgtacacc tctaggaaat gaaacactag ttcagaagaa gcctgtaaac 61 tctcttacaa atacatttgg ttattcacca tgaggttagc aaagcctaaa gcgggtattt 121 ctcggagctc aagccaagga aaggcctatg agaacaagcg caaaacaggc cggcagcgcg 181 agaagtgggg catgactatt cgatttgact caagcttcag tagactcaga agaagcttgg 241 atgacaaacc ctataaatgt actgaatgtg aaaagagttt cagtcagagt tcaactcttt 301 ttcaacacca gaagatccat actggaaaga aatcccataa atgtgctgat tgtgggaaaa 361 gtttctttca gagttctaat ctcattcagc atcgacggat ccatacgggg gaaaagccct 421 acaaatgtga tgagtgtgga gaaagcttca aacagagctc aaatctcatt cagcaccaga 481 gaattcatac tggagaaaaa ccctatcagt gtgatgagtg tggccggtgt ttcagccaga 541 gctcccacct tattcaacat cagagaaccc acactgggga gaaaccctac cagtgcagtg 601 aatgtggcaa atgtttcagt cagagctctc atctgaggca gcacatgaag gtgcataaag 661 aagagaagcc tcgtaaaacc cggggcaaaa atatcagggt gaagactcac ttaccctctt 721 ggaaagctgg tacagaagga agtctgtggc tggtctccgt taagtatagg gctttttgac 781 agctttttga gacctctt // LOCUS S52028 1194 bp mRNA PRI 24-MAR-1993 DEFINITION cystathionine gamma-lyase {clone HCL-1} [human, liver, mRNA, 1194 nt]. ACCESSION S52028 NID g262473 KEYWORDS . SOURCE human liver. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1194) AUTHORS Lu,Y., O'Dowd,B.F., Orrego,H. and Israel,Y. TITLE Cloning and nucleotide sequence of human liver cDNA encoding for cystathionine gamma-lyase JOURNAL Biochem. Biophys. Res. Commun. 189 (2), 749-758 (1992) MEDLINE 93112041 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 121795] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..1194 /organism="Homo sapiens" /db_xref="taxon:9606" gene 34..1119 /gene="cystathionine gamma-lyase, cystathionase" CDS 34..1119 /gene="cystathionine gamma-lyase, cystathionase" /note="This sequence comes from Fig. 2; cystathionase" /codon_start=1 /product="cystathionine gamma-lyase" /db_xref="PID:g262474" /translation="MQEKDASSQGFLPHFQHFATQAIHVGQDPEQWTSRAVVPPISLS TTFKQGAPGQHSGFEYSRSGNPTRNCLEKAVAALDGAKYCLAFASGLAATVTITHLLK AGDQIICMDDVYGGTNRYFRQVASEFGLKISFVDCSKIKLLEAAITPETKRPLALGAD ISMYSATKYMNGHSDVVMGLVSVNCESLHNRLRFLQNSLGAVPSPIDCYLCNRGLKTL HVRMEKHFKNGMAVAQFLESNPWVEKVIYPGLPSHPQHELVKRQCTGCTGMVTFYIKG TLQHAEIFLKNLKLFTLAESLGGFESLAELPAIMTHASVLKNDRDVLGISDTLIRLSV GLEDEEDLLEDLDQALKAAHPPSGIHS" BASE COUNT 314 a 269 c 276 g 335 t ORIGIN 1 ttcttttcct ctcttcttct ttcgcggttc agcatgcagg aaaaagacgc ctcctcacaa 61 ggtttcctgc cacacttcca acatttcgcc acgcaggcga tccatgtggg ccaggatccg 121 gagcaatgga cctccagggc tgtagtgccc cccatctcac tgtccaccac gttcaagcaa 181 ggggcgcctg gccagcactc gggttttgaa tatagccgtt ctggaaatcc cactaggaat 241 tgccttgaaa aagcagtggc agcactggat ggggctaagt actgtttggc ctttgcttca 301 ggtttagcag ccactgtaac tattacccat cttttaaaag caggagacca aattatttgt 361 atggatgatg tgtatggagg tacaaacagg tacttcaggc aagtggcatc tgaatttgga 421 ttaaagattt cttttgttga ttgttccaaa atcaaattac tagaggcagc aattacacca 481 gaaaccaagc gccctttggc tctgggagct gatatttcta tgtattctgc aacaaaatac 541 atgaatggcc acagtgatgt tgtaatgggc ctggtgtctg ttaattgtga aagccttcat 601 aatagacttc gtttcttgca aaactctctt ggagcagttc catctcctat tgattgttac 661 ctctgcaatc gaggtctgaa gactctacat gtccgaatgg aaaagcattt caaaaacgga 721 atggcagttg cccagttcct ggaatctaat ccttgggtag aaaaggttat ttatcctggg 781 ctgccctctc atccacagca tgagttggtg aagcgtcagt gtacaggttg tacagggatg 841 gtcacctttt atattaaggg cactcttcag catgctgaga ttttcctcaa gaacctaaag 901 ctatttactc tggccgagag cttgggagga ttcgaaagcc ttgctgagct tccggcaatc 961 atgactcatg catcagttct taagaatgac agagatgtcc ttggaattag tgacacactg 1021 attcgacttt ctgtgggctt agaggatgag gaagacctac tggaagatct agatcaagct 1081 ttgaaggcag cacaccctcc aagtggaatt cacagctagt attccagagc tgctattaga 1141 agctgcttcc tgtgaagatc aatcttcctg agtaattaat ggaccaacaa tgag // LOCUS S52624 1541 bp mRNA PRI 25-MAR-1993 DEFINITION platelet-activating factor receptor [human, heart ventricle, mRNA Partial, 1541 nt]. ACCESSION S52624 NID g262469 KEYWORDS . SOURCE human heart ventricle. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1541) AUTHORS Sugimoto,T., Tsuchimochi,H., McGregor,C.G., Mutoh,H., Shimizu,T. and Kurachi,Y. TITLE Molecular cloning and characterization of the platelet-activating factor receptor gene expressed in the human heart JOURNAL Biochem. Biophys. Res. Commun. 189 (2), 617-624 (1992) MEDLINE 93112021 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 121778] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..1541 /organism="Homo sapiens" /db_xref="taxon:9606" gene 81..173 /gene="orf 5' of PAF receptor" CDS 81..173 /gene="orf 5' of PAF receptor" /codon_start=1 /db_xref="PID:g1680456" /translation="MLSGDPHLPQPLCHCLDHCPCCFSGTTRTS" gene 192..1220 /gene="platelet-activating factor receptor, PAF receptor" CDS 192..1220 /gene="platelet-activating factor receptor, PAF receptor" /note="This sequence comes from Fig. 2. Author-given protein sequence is in conflict with the conceptual translation; mismatch(316[K->N]); PAF receptor" /codon_start=1 /product="platelet-activating factor receptor" /db_xref="PID:g262470" /translation="MEPHDSSHMDSEFRYTLFPIVYSIIFVLGVIANGYVLWVFARLY PCKKFNEIKIFMVNLTMADMLFLITLPLWIVYYQNQGNWILPKFLCNVAGCLFFINTY CSVAFLGVITYNRFQAVTRPIKTAQANTRKRGISLSLVIWVAIVGAASYFLILDSTNT VPDSAGSGNVTRCFEHYEKGSVPVLIIHIFIVFSFFLVFLIILFCNLVIIRTLLMQPV QQQRNAEVKRRALWMVCTVLAVFIICFVPHHVVQLPWTLAELGFQDSKFHQAINDAHQ VTLCLLSTNCVLDPVIYCFLTKKFRKHLTEKFYSMRSSRKCSRATTDTVTEVVVPFNQ IPGNSLKN" BASE COUNT 320 a 478 c 360 g 383 t ORIGIN 1 cccctctgtg cactcattac ctgcttcctg agctccccga gaagtcatcc aggacctccc 61 cgagaagccg tccaggaaac atgctctcag gggaccccca tctgcctcag cctctttgtc 121 actgcctgga ccattgtccc tgctgtttct caggcaccac caggaccagc tgatcattcc 181 agcccacagc aatggagcca catgactcct cccacatgga ctctgagttc cgatacactc 241 tcttcccgat tgtttacagc atcatctttg tgctcggggt cattgctaat ggctacgtgc 301 tgtgggtctt tgcccgcctg tacccttgca agaaattcaa tgagataaag atcttcatgg 361 tgaacctcac catggcggac atgctcttct tgatcaccct gccactttgg attgtctact 421 accaaaacca gggcaactgg atactcccca aattcctgtg caacgtggct ggctgccttt 481 tcttcatcaa cacctactgc tctgtggcct tcctgggggt catcacttat aaccgcttcc 541 aggcagtaac tcggcccatc aagactgctc aggccaacac ccgcaagcgt ggcatctctt 601 tgtccttggt catctgggtg gccattgtgg gagctgcatc ctacttcctc atcctggact 661 ccaccaacac agtgcccgac agtgctggct caggcaacgt cactcgctgc tttgagcatt 721 acgagaaggg cagcgtgcca gtcctcatca tccacatctt catcgtgttc agcttcttcc 781 tggtcttcct catcatcctc ttctgcaacc tggtcatcat ccgtaccttg ctcatgcagc 841 cggtgcagca gcagcgcaac gctgaagtca agcgccgggc gctgtggatg gtgtgcacgg 901 tcttggcggt gttcatcatc tgcttcgtgc cccaccacgt ggtgcagctg ccctggaccc 961 ttgctgagct gggcttccag gacagcaaat tccaccaggc cattaatgat gcacatcagg 1021 tcaccctctg cctccttagc accaactgtg tcttagaccc tgttatctac tgtttcctca 1081 ccaaaaagtt ccgcaagcac ctcaccgaaa agttctacag catgcgcagt agccggaatt 1141 gctcccgggc caccacggat acggtcactg aagtggttgt gccattcaac cagatccctg 1201 gcaattccct caaaaattag tccctgcttc catgcctgaa gtcttctcct ccatgaacat 1261 catggactga gctgggggaa gaagggatat ctactgtggg tctgggcacc acctctgtgg 1321 gctctggtgg gccattagat ttggaggcta cctcacctgg gcagggatga tggcagagcc 1381 aggctgttgg aaaatccaga actcaaatga gccccttcat ccgcctgtgg gcgcatacta 1441 cagtaactgt gactgatgac tttatcctga gtcccttaat cttatggggc cggaaggaat 1501 gtcagggcca ggtgcagacc ttgggggaag actttaaacc a // LOCUS S54641 2080 bp mRNA PRI 12-APR-1993 DEFINITION HZF-16=Kruppel-related zinc finger gene homolog {alternatively spliced} [human, hepatoblastoma cell line, HEP-G2, mRNA, 2080 nt]. ACCESSION S54641 NID g265483 KEYWORDS . SOURCE human HEP-G2 hepatoblastoma cell line. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2080) AUTHORS Saleh,M., Selleri,L. and Evans,G.A. TITLE A novel zinc finger gene on human chromosome 1qter that is alternatively spliced in human tissues and cell lines JOURNAL Am. J. Hum. Genet. 52 (1), 192-203 (1993) MEDLINE 93167271 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 124963] from the original journal article. This sequence comes from Fig. 1. Map location: 1q44. FEATURES Location/Qualifiers source 1..2080 /organism="Homo sapiens" /db_xref="taxon:9606" gene 511..1401 /note="Kruppel-related zinc finger gene homolog" /gene="HZF-16" CDS join(511..912,1333..1401) /gene="HZF-16" /note="zinc finger; This sequence comes from Fig. 1" /codon_start=1 /product="HZF-16.1" /db_xref="PID:g265485" /translation="MNALCVKNSSYVHSSLHRHIISHSGNNPYGCEECGKKPCTCKQC QKTSLSVTRVHRDTVMHTGNGHYGCTICEKVFNIPSSFQIHQRNHTGEKPYECMECGK ALGFSRSLNRHKRIHTGEKRYECKQCGKAFSRASTLWKHKKTHTGEKPYKCKKM" CDS 511..1401 /gene="HZF-16" /note="zinc finger; This sequence comes from Fig. 1" /codon_start=1 /product="HZF-16.2" /db_xref="PID:g265484" /translation="MNALCVKNSSYVHSSLHRHIISHSGNNPYGCEECGKKPCTCKQC QKTSLSVTRVHRDTVMHTGNGHYGCTICEKVFNIPSSFQIHQRNHTGEKPYECMECGK ALGFSRSLNRHKRIHTGEKRYECKQCGKAFSRSSHLRDHERTHTGEKPYECKHCGKAF RYSNCLHYHERTHTGEKPYVCMECGKAFSCLSSLQGHIKAHAGEEPYPCKQCGKAFRY ASSLQKHEKTHIAQKPYVCNNCGKGFRCSSSLRDHERTHTGEKPYECQKCGKAFSRAS TLWKHKKTHTGEKPYKCKKM" BASE COUNT 743 a 368 c 377 g 592 t ORIGIN 1 tttttttttt ttttttaaca atcggatctt tcaggaacta atagagcgag aagtcactca 61 ttaccacaac agtgccactt atgtggaatc tgctcccatg acccaagcac ctcacaccag 121 gtcatacctc caacatgagg atcaaatttc agcatgaaat ttgaagggag aaaataccca 181 aactgtattc aatactaaga aactgcatat gagaatatta ctgtattgtt aatagctata 241 ggggagggag ccatgttgta gactaatcaa tccatttatg ttcaatttgt ttatgttaga 301 aaacctgcac tttctctgat attggtagca gtgtaagttc agacttagta ataaaagaaa 361 ataactaata aaccattaat gatggggttg tcattttttg cagaagtcat atgatagaca 421 tactgtgtaa aattaaataa gtcagtgtag aaaaacctcc agatgccaaa ttttaatctg 481 aacaaaaaat tcctgctaga gtaaaaccac atgaatgcat tgtgtgtgaa aaattcttca 541 tacgttcatt catcccttca taggcacatc atatctcatt ctggaaacaa cccatatggg 601 tgtgaggaat gcggaaagaa gccatgtaca tgtaaacaat gtcagaaaac ttccctttct 661 gtcacaaggg ttcacagaga cacagtaatg cacactggaa atggacatta tggttgtaca 721 atatgtgaga aagtttttaa tattcccagt tcatttcaga tacatcagag aaatcacact 781 ggagagaaac cctatgaatg tatggaatgt gggaaagcct taggtttttc ccgttctctt 841 aatagacata aaaggattca cactggagaa aaacgctatg aatgtaagca atgtgggaaa 901 gccttcagtc gttccagtca ccttcgtgac catgaaagaa ctcatactgg agagaaaccc 961 tatgaatgta agcactgtgg gaaagccttc cgttactcca attgccttca ttaccatgaa 1021 agaactcaca ctggagagaa accttatgtg tgcatggaat gtggcaaagc tttcagttgt 1081 ctcagttcct tgcaaggaca tataaaggct catgctggtg aagaacccta tccatgtaag 1141 caatgtggga aagccttcag atacgccagt tcccttcaga aacacgagaa aactcatatt 1201 gcacagaaac cctatgtatg taacaattgt ggtaaaggct tcagatgttc cagttccctt 1261 cgtgaccatg aaaggactca tactggagag aaaccctatg aatgtcagaa atgtggcaaa 1321 gcctttagtc gtgctagtac cctttggaag cataaaaaaa ctcatactgg agaaaagccc 1381 tataaatgta aaaaaatgta aaggctttaa tcactacagt ttttgtcaaa aacatgaaca 1441 gtcacatact tgagagaaac tgtgaatgta aggtgtagga aagtacttaa ttttcccaga 1501 tttcctcaaa tacatgaaac gaatcaaact ggagataaac cctatgacta taagcaataa 1561 ggtaaagcat tcaatttttc catttctttt tgaaaacttg aaaggactca ctgaagaaaa 1621 tccatatgaa tgtttaaaat gtggtaaggc ctgcagttgt tccagtggta tttgatggta 1681 taacataact cattctagag aaaaacttta tgaaggtatc gaatgtgaga atgccttcat 1741 ttatcctata actcactcag agacacatgg taacacatac tccagattga ccttataaat 1801 aaaagaatgc ccaccagatt gaaatcctag aaatacagaa aattctgaat tttaacaatt 1861 actttaaagg tcatgtgaaa actcccactg gaaataaatc ctgtaaatgt aaatattatg 1921 gaaaacctta tggaaaataa ttatagtaca attctcaaga aaattcacta tgtgaatgag 1981 atgttttagg agctataaat aaattgaata tattggttgc tcatttttga agagagtctc 2041 tagaatatag attttcactt cttttgaaaa aaaaaaaaaa // LOCUS S54761 433 bp mRNA PRI 01-APR-1993 DEFINITION beta 2- mu =beta 2-microglobulin [human, SK-MEL-33 cells, mRNA Mutant, 433 nt]. ACCESSION S54761 NID g265221 KEYWORDS . SOURCE human SK-MEL-33 cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 433) AUTHORS Wang,Z., Cao,Y., Albino,A.P., Zeff,R.A., Houghton,A. and Ferrone,S. TITLE Lack of HLA class I antigen expression by melanoma cells SK-MEL-33 caused by a reading frameshift in beta 2-microglobulin messenger RNA JOURNAL J. Clin. Invest. 91 (2), 684-692 (1993) MEDLINE 93163363 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 124658] from the original journal article. This sequence comes from Fig. 4. COMMENT G deletion at position 323 causes a frameshift. FEATURES Location/Qualifiers source 1..433 /organism="Homo sapiens" /db_xref="taxon:9606" gene 26..331 /note="beta 2-microglobulin" /gene="&bgr;2-&mgr;" CDS 26..331 /gene="2-" /note="This sequence comes from Fig. 4." /codon_start=1 /product="beta 2-microglobulin" /db_xref="PID:g265222" /translation="MSRSVALAVVALLSLSGLEAIQRTPKIQVYSRHPAENGKSNFLN CYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDWSFYLLYYTEFTPTEKMSMPAV" BASE COUNT 111 a 96 c 105 g 121 t ORIGIN 1 tgaagctgac agcattcggg ccgagatgtc tcgctccgtg gccttagctg tcgtcgcgct 61 actctctctt tctggcctgg aggctatcca gcgtactcca aagattcagg tttactcacg 121 tcatccagca gagaatggaa agtcaaattt cctgaattgc tatgtgtctg ggtttcatcc 181 atccgacatt gaagttgact tactgaagaa tggagagaga attgaaaaag tggagcattc 241 agacttgtct ttcagcaagg actggtcttt ctatctcttg tactacactg aattcacccc 301 cactgaaaaa atgagtatgc ctgccgtgtg aaccatgtga ctttgtcaca gcccaagata 361 gttaagtggg atcgagacat gtaagcagca tcatggaggt ttgaagatgc cgcatttgga 421 ttggatgaat tcc // LOCUS S54769 429 bp mRNA PRI 02-APR-1993 DEFINITION cellular adhesion regulatory molecule [human, mRNA, 429 nt]. ACCESSION S54769 NID g265102 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 429) AUTHORS Pullman,W.E. and Bodmer,W.F. TITLE Cloning and characterization of a gene that regulates cell adhesion JOURNAL Nature 361 (6412), 564 (1993) MEDLINE 93156848 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 124572] from the original journal article. This sequence comes from Text, p. 564. COMMENT *ERRATUM* W.E. Pullman and W.F. Bodmer, Nature 356, 529-532, 1992. FEATURES Location/Qualifiers source 1..429 /organism="Homo sapiens" /db_xref="taxon:9606" gene 151..399 /gene="cellular adhesion regulatory molecule, CMAR" CDS 151..399 /gene="cellular adhesion regulatory molecule, CMAR" /note="*ERRATUM* W.E. Pullman and W.F. Bodmer, Nature 356, 529-532, 1992. This sequence comes from Text, p. 564; CMAR" /codon_start=1 /product="cellular adhesion regulatory molecule" /db_xref="PID:g265103" /translation="MLRGSDMKGPCEPIVLSPAALSSSSLINGASQAQALGSGGLTTA PCCHVDWCKLRTSCWSSHACSVGDALVFTALRIVEILY" BASE COUNT 99 a 117 c 114 g 99 t ORIGIN 1 gcatggaaca cttcgagttc ccagggttat agacagtcgt tcccagtgtg gctgaggcca 61 cccagaggca gcagagcatt cagactccaa acagacccct gttcatgccg acgcttgcac 121 gaccgcccca gttcctgtgg ctccctcgga atgctaaggg gatcggacat gaaaggaccc 181 tgtgagccga ttgtcctatc tccagcggcc ctgtcatcca gctcactcat caatggggcc 241 agtcaggccc aggcactggg ctccggagga ctcaccactg ccccctgctg ccatgtggac 301 tggtgcaagt tgaggacttc ttgctggtct agtcacgcat gcagtgttgg ggatgccttg 361 gtttttactg ctctgagaat tgttgagata ctttactaat aaactgtgta gttggaaaaa 421 aaaaaaaaa // LOCUS S55606 1271 bp mRNA PRI 29-APR-1993 DEFINITION betacellulin [human, mRNA, 1271 nt]. ACCESSION S55606 NID g265785 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1271) AUTHORS Sasada,R., Ono,Y., Taniyama,Y., Shing,Y., Folkman,J. and Igarashi,K. TITLE Cloning and expression of cDNA encoding human betacellulin, a new member of the EGF family JOURNAL Biochem. Biophys. Res. Commun. 190 (3), 1173-1179 (1993) MEDLINE 93176165 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 125748] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1271 /organism="Homo sapiens" /db_xref="taxon:9606" gene 295..831 /gene="betacellulin, BTC" CDS 295..831 /gene="betacellulin, BTC" /note="This sequence comes from Fig. 1; BTC" /codon_start=1 /product="betacellulin" /db_xref="PID:g265786" /translation="MDRAARCSGASSLPLLLALALGLVILHCVVADGNSTRSPETNGL LCGDPEENCAATTTQSKRKGHFSRCPKQYKHYCIKGRCRFVVAEQTPSCVCDEGYIGA RCERVDLFYLRGDRGQILVICLIAVMVVFIILVIGVCTCCHPLRKRRKRKKKEEEMET LGKDITPINEDIEETNIA" BASE COUNT 346 a 267 c 327 g 331 t ORIGIN 1 cagcgtggag gctccaagga ccaagtcctg cgcctctttg gcggggtgtg tgcaggagga 61 ggggggataa ataggaggct ccctcctccc ggcgacattc acggagccgg ccggcctccc 121 gccctgggtg tttccctgcc ttgtagccag ggtgccagcc tgggaagtag tttcgtttcc 181 ttctgcctcc gggattagtt tccaggcacc ctctcaggcg cccgaggccc gggaaggggg 241 cgaagaagga gggagacttg tctaggggct gcccggcccg gcagagcggg gttgatggac 301 cgggccgccc ggtgcagcgg cgccagctcc ctgccactgc tcctggccct tgccctgggt 361 ctagtgatcc ttcactgtgt ggtggcagat gggaattcca ccagaagtcc tgaaactaat 421 ggcctcctct gtggagaccc tgaggaaaac tgtgcagcta ccaccacaca atcaaagcgg 481 aaaggccact tctctaggtg ccccaagcaa tacaagcatt actgcatcaa agggagatgc 541 cgcttcgtgg tggccgagca gacgccctcc tgtgtctgtg atgaaggcta cattggagca 601 aggtgtgaga gagttgactt gttttaccta agaggagaca gaggacagat tctggtgatt 661 tgtttgatag cagttatggt agtttttatt attttggtca tcggtgtctg cacatgctgt 721 caccctcttc ggaaacgtcg taaaagaaag aagaaagaag aagaaatgga aactctgggt 781 aaagatataa ctcctatcaa tgaagatatt gaagagacaa atattgctta aaaggctatg 841 aagttacctc caggttggtg gcaagctgca aagtgccttg ctcatttgaa aatggacaga 901 atgtgtctca ggaaaaacag ctagtagaca tgaattttaa ataatgtatt tactttttat 961 ttgcaacttt agtttgtgtt attatttttt aataagaaca ttaattatat gtatattgtc 1021 tagtaattgg gaaaaaagca actggttagg tagcaacaac agaagggaaa tttcaataac 1081 ctttcactta agtattgtca ccaggattac tagtcaaaca aaaaagaaaa gtagaaagga 1141 ggttaggtct taggaattga attaataata aagctaccat ttatcaagca tttaccatgt 1201 gctaataagt ttgaaatata ttatttcctt tattcctttc agcaatccat gagatagcta 1261 ttataatcct c // LOCUS S57235 1722 bp mRNA PRI 21-MAY-1993 DEFINITION CD68=110kda transmembrane glycoprotein [human, promonocyte cell line U937, mRNA, 1722 nt]. ACCESSION S57235 NID g298664 KEYWORDS . SOURCE human promonocyte cell line U937. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1722) AUTHORS Holness,C.L. and Simmons,D.L. TITLE Molecular cloning of CD68, a human macrophage marker related to lysosomal glycoproteins JOURNAL Blood 81 (6), 1607-1613 (1993) MEDLINE 93200523 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 127492] from the original journal article. This sequence comes from Fig. 3. FEATURES Location/Qualifiers source 1..1722 /organism="Homo sapiens" /db_xref="taxon:9606" gene 16..1080 /gene="CD68" CDS 16..1080 /gene="CD68" /note="110kda transmembrane glycoprotein; This sequence comes from Fig. 3" /codon_start=1 /product="CD68" /db_xref="PID:g298665" /translation="MRLAVLFSGALLGLLAAQGTGNDCPHKKSATLLPSFTVTPTVTE STGTTSHRTTKSHKTTTHRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQ GPSTATHSPATTSHGNATVHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDY TWTNGSQPCVHLQAQIQIRVMYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSF PYGHLSFGFMQDLQQKVVYLSYMAVEYNVSFPHAAKWTFSAQNASLRDLQAPLGQSFS CSNSSIILSPAVHLDLLSLRLQAAQLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLL ALVLIAFCIIRRRPSAYQAL" BASE COUNT 435 a 539 c 401 g 347 t ORIGIN 1 gcgggcggtt cagccatgag gctggctgtg cttttctcgg gggccctgct ggggctactg 61 gcagcccagg ggacagggaa tgactgtcct cacaaaaaat cagctacttt gctgccatcc 121 ttcacggtga cacccacggt tacagagagc actggaacaa ccagccacag gactaccaag 181 agccacaaaa ccaccactca caggacaacc accacaggca ccaccagcca cggacccacg 241 actgccactc acaaccccac caccaccagc catggaaacg tcacagttca tccaacaagc 301 aatagcactg ccaccagcca gggaccctca actgccactc acagtcctgc caccactagt 361 catggaaatg ccacggttca tccaacaagc aacagcactg ccaccagccc aggattcacc 421 agttctgccc acccagaacc acctccaccc tctccgagtc ctagcccaac ctccaaggag 481 accattggag actacacgtg gaccaatggt tcccagccct gtgtccacct ccaagcccag 541 attcagattc gagtcatgta cacaacccag ggtggaggag aggcctgggg catctctgta 601 ctgaacccca acaaaaccaa ggtccaggga agctgtgagg gtgcccatcc ccacctgctt 661 ctctcattcc cctatggaca cctcagcttt ggattcatgc aggacctcca gcagaaggtt 721 gtctacctga gctacatggc ggtggagtac aatgtgtcct tcccccacgc agcaaagtgg 781 acattctcgg ctcagaatgc atcccttcga gatctccaag cacccctggg gcagagcttc 841 agttgcagca actcgagcat cattctttca ccagctgtcc acctcgacct gctctccctg 901 aggctccagg ctgctcagct gccccacaca ggggtctttg ggcaaagttt ctcctgcccc 961 agtgaccggt ccatcttgct gcctctcatc atcggcctga tccttcttgg cctcctcgcc 1021 ctggtgctta ttgctttctg catcatccgg agacgcccat ccgcctacca ggccctctga 1081 gcatttgctt caaaccccag ggcactgagg gggtttgggg tgtggtgggg gggtaccctt 1141 atttcctcga cacgccgctg gctcaaagac aatgttattt tccttccctt tcttgaagaa 1201 caaaaagaaa gccgggcatg acggctcatg cctgtaatcc cagcactttg ggaggctgag 1261 gcaggtggat cactggaggt caggtctttg aggccagccc tagccaacat ggtgtaaaca 1321 ctgtctctac taaaaataca attagccagg tgtggcggcg taatcccatg ctaacctgta 1381 atcccagcta cttgggaggc tgaggcagag ctgcttgaac cctggaagtg gaggttgcag 1441 tgagcctgtc atcgctccac tgagccaaga tcgctcccac tgcactccag cctgggcgac 1501 agagccagac tgtctcaaat aaataaatat gagataatgc agtcgggaga agggagggag 1561 agaattttat taaatgtgac gaactgcccc cccccccccc cccagcagga gagcagcaaa 1621 atttatgtaa atctttgacg gggttttcct tgctcctgcc aggattaaaa gtccatgagt 1681 ttcttgctca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa // LOCUS S60753 56 bp mRNA PRI 08-JUL-1993 DEFINITION CHM (CHM*SAL) {alternatively spliced, exon C'-D} [human, mRNA Partial Mutant, 56 nt]. ACCESSION S60753 NID g300303 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 56) AUTHORS Sankila,E.M., Tolvanen,R., van den Hurk,J.A., Cremers,F.P. and de la Chapelle,A. TITLE Aberrant splicing of the CHM gene is a significant cause of choroideremia JOURNAL Nature Genet. 1 (2), 109-113 (1992) MEDLINE 93250981 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 131460] from the original journal article. COMMENT T insertion in the donor splice site of exon C. FEATURES Location/Qualifiers source 1..56 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..15 /partial /gene="CHM" /allele="CHM*SAL" CDS 1..15 /partial /gene="CHM" /codon_start=1 BASE COUNT 29 a 3 c 12 g 12 t ORIGIN 1 atggagatag gttaagaaaa ataattgata actcatgatg gaaagaaaat gaacaa // LOCUS S62027 408 bp mRNA PRI 19-JUL-1993 DEFINITION transducin gamma subunit [human, mRNA, 408 nt]. ACCESSION S62027 NID g385284 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 408) AUTHORS Tao,L., Pandey,S., Simon,M.I. and Fong,H.K. TITLE Structure of the bovine transducin gamma subunit gene and analysis of promoter function in transgenic mice JOURNAL Exp. Eye Res. 56 (4), 497-507 (1993) MEDLINE 93272877 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 133279] from the original journal article. This sequence comes from Fig. 5B. FEATURES Location/Qualifiers source 1..408 /organism="Homo sapiens" /db_xref="taxon:9606" gene 9..233 /gene="transducin gamma subunit, T gamma" CDS 9..233 /gene="transducin gamma subunit, T gamma" /note="This sequence comes from Fig. 5B; T gamma" /codon_start=1 /product="transducin gamma subunit" /db_xref="PID:g385285" /translation="MPVINIEDLTEKDKLKMEVDQLKKEVTLERMLVSKCCEEVRDYV EERSGEDPLVKGIPEDKNPFKELKGGCVIS" BASE COUNT 176 a 53 c 80 g 99 t ORIGIN 1 gcaaaaagat gccagtaatc aatattgagg acctgacaga aaaggacaaa ttgaagatgg 61 aagttgacca gctcaagaaa gaagtgacac tggaaagaat gctagtttcc aaatgttgtg 121 aagaagtaag agattacgtt gaagaacgat ctggcgagga tccactggta aagggcatcc 181 cagaggacaa aaatcccttc aaggagctca aaggaggctg tgtgatttca taatacaaac 241 aaaaagaaaa aaaattaaac aaattcttgg aaatatctca aatgttaata acaatatgaa 301 tttttctcat gcatactatt actactaagc atgtacgtga atttttaaat ttatagatgt 361 aaacttttaa taaaaattgg ggtgtggtaa aaaaaaaaaa aaaaaaaa // LOCUS S62907 2189 bp mRNA PRI 17-AUG-1993 DEFINITION gamma-aminobutyric acidA receptor alpha 2 subunit [human, fetal brain, mRNA, 2189 nt]. ACCESSION S62907 NID g386421 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2189) AUTHORS Hadingham,K.L., Wingrove,P., Le Bourdelles,B., Palmer,K.J., Ragan,C.I. and Whiting,P.J. TITLE Cloning of cDNA sequences encoding human alpha 2 and alpha 3 gamma-aminobutyric acidA receptor subunits and characterization of the benzodiazepine pharmacology of recombinant alpha 1-, alpha 2-, alpha 3-, and alpha 5-containing human gamma-aminobutyric ac JOURNAL Mol. Pharmacol. 43 (6), 970-975 (1993) MEDLINE 93302739 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 134256] from the original journal article. This sequence comes from Fig. 1A. FEATURES Location/Qualifiers source 1..2189 /organism="Homo sapiens" /db_xref="taxon:9606" gene 214..1569 /gene="gamma-aminobutyric acidA receptor alpha 2 subunit, GABAA receptor alpha 2" CDS 214..1569 /gene="gamma-aminobutyric acidA receptor alpha 2 subunit, GABAA receptor alpha 2" /note="This sequence comes from Fig. 1A; GABAA receptor alpha 2" /codon_start=1 /product="gamma-aminobutyric acidA receptor alpha 2 subunit" /db_xref="PID:g386422" /translation="MKTKLNIYNIEFLLFVFLVWDPARLVLANIQEDEAKNNITIFTR ILDRLLDGYDNRLRPGLGDSITEVFTNIYVTSFGPVSDTDMEYTIDVFFRQKWKDERL KFKGPMNILRLNNLMASKIWTPDTFFHNGKKSVAHNMTMPNKLLRIQDDGTLLYTMRL TVQAECPMHLEDFPMDAHSCPLKFGSYAYTTSEVTYIWTYNASDSVQVAPDGSRLNQY DLLGQSIGKETIKSSTGEYTVMTAHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNR ESVPARTVFGVTTVLTMTTLSISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATV NYFTKRGWTWDGKSVVNDKKKEKASVMIQNNAYAVAVANYAPNLSKDPVLSTISKSAT TPEPNKKPENKPAEAKKTFNSVSKIDRMSRIVFPVLFGTFNLVYWATYLNREPVLGVS P" BASE COUNT 640 a 465 c 419 g 665 t ORIGIN 1 cctagcgctc ctctccggct tccaccagcc catcgctcca cgctctcttg gctgctgcag 61 tctcggtctc tctctctctc tctctctctc tctctctctc tctctctctc tctctctctc 121 tctctctctc tctctcccaa gtttcctatc tcgtcaagat cagggcaaaa gaagaaaaca 181 ccgaattctg cttgccgttt cagagcggcg gtgatgaaga caaaattgaa catctacaac 241 atcgagttcc tgctttttgt tttcttggtg tgggaccctg ccaggttggt gctggctaac 301 atccaagaag atgaggctaa aaataacatt accatcttta cgagaattct tgacagactt 361 ctggatggtt acgataatcg gcttagacca ggactgggag acagtattac tgaagtcttc 421 actaacatct acgtgaccag ttttggccct gtctcagata cagatatgga atatacaatt 481 gatgttttct ttcgacaaaa atggaaagat gaacgtttaa aatttaaagg tcctatgaat 541 atccttcgac taaacaattt aatggctagc aaaatctgga ctccagatac cttttttcac 601 aatgggaaga aatcagtagc tcataatatg acaatgccaa ataagttgct tcgaattcag 661 gatgatggga ctctgctgta taccatgagg cttacagttc aagctgaatg cccaatgcac 721 ttggaggatt tcccaatgga tgctcattca tgtcctctga aatttggcag ctatgcatat 781 acaacttcag aggtcactta tatttggact tacaatgcat ctgattcagt acaggttgct 841 cctgatggct ctaggttaaa tcaatatgac ctgctgggcc aatcaatcgg aaaggagaca 901 attaaatcca gtacaggtga atatactgta atgacagctc atttccacct gaaaagaaaa 961 attgggtatt ttgtgattca aacctatctg ccttgcatca tgactgtcat tctctcccaa 1021 gtttcattct ggcttaacag agaatctgtg cctgcaagaa ctgtgtttgg agtaacaact 1081 gtcctaacaa tgacaactct aagcatcagt gctcggaatt ctctccccaa agtggcttat 1141 gcaactgcca tggactggtt tattgctgtt tgttatgcat ttgtgttctc tgccctaatt 1201 gaatttgcaa ctgttaatta cttcaccaaa agaggatgga cttgggatgg gaagagtgta 1261 gtaaatgaca agaaaaaaga aaaggcttcc gttatgatac agaacaacgc ttatgcagtg 1321 gctgttgcca attatgcccc gaatctttca aaagatccag ttctctccac catctccaag 1381 agtgcaacca cgccagaacc caacaagaag ccagaaaaca agccagctga agcaaagaaa 1441 actttcaaca gtgttagcaa aattgacaga atgtccagaa tagtttttcc agttttgttt 1501 ggtaccttta atttagttta ctgggctaca tatttaaaca gagaacctgt attaggggtc 1561 agtccttgaa ttgagaccca tgttatcttt gggatgtata gcaacattaa atttggtttg 1621 ttttgctatg tacagtctga ctaataactg ctaatttgtg atccaacatg tacagtatgt 1681 atatagtgac atagcttacc agtagacctt taatggagac atgcatttgc taactcatgg 1741 aactgcagac agaaagcact ccatgcgaaa acagccattg ccttttttaa agatttaccc 1801 taggacctga tttaaagtga atttcaagtg acctgattaa tttcctattc ttccaaatga 1861 gatgaaaatg gggatcctgt acaacccttt gtggaccctt ttggtttagc tcttaagtag 1921 gggtattttc tactgttgct taattatgat ggaagataac attgtcattc ctagatgaat 1981 cctttgaagt aacaaacatt gtatctgaca tcagctctgt tcatgagtgc tcagagtccc 2041 tgctaatgta attggaagct tggtacacat aagaaaaact agagatttga aatctagcta 2101 tgaattactc tatatagtat ctatagccat gtacatatta cagcatgaca agctcgaaat 2161 aattatgagt cagcccgaaa gatgttaat // LOCUS S62908 1637 bp mRNA PRI 17-AUG-1993 DEFINITION gamma-aminobutyric acidA receptor alpha 3 subunit [human, fetal brain, mRNA, 1637 nt]. ACCESSION S62908 NID g386423 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1637) AUTHORS Hadingham,K.L., Wingrove,P., Le Bourdelles,B., Palmer,K.J., Ragan,C.I. and Whiting,P.J. TITLE Cloning of cDNA sequences encoding human alpha 2 and alpha 3 gamma-aminobutyric acidA receptor subunits and characterization of the benzodiazepine pharmacology of recombinant alpha 1-, alpha 2-, alpha 3-, and alpha 5-containing human gamma-aminobutyric ac JOURNAL Mol. Pharmacol. 43 (6), 970-975 (1993) MEDLINE 93302739 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 134258] from the original journal article. This sequence comes from Fig. 1B. FEATURES Location/Qualifiers source 1..1637 /organism="Homo sapiens" /db_xref="taxon:9606" gene 86..1564 /gene="gamma-aminobutyric acidA receptor alpha 3 subunit, GABAA receptor alpha 3" CDS 86..1564 /gene="gamma-aminobutyric acidA receptor alpha 3 subunit, GABAA receptor alpha 3" /note="This sequence comes from Fig. 1B; GABAA receptor alpha 3" /codon_start=1 /product="gamma-aminobutyric acidA receptor alpha 3 subunit" /db_xref="PID:g386424" /translation="MIITQTSHCYMTSLGILFLINILPGTTGQGESRRQEPGDFVKQD IGGLSPKHAPDIPDDSTDNITIFTRILDRLLDGYDNRLRPGLGDAVTEVKTDIYVTSF GPVSDTDMEYTIDVFFRQTWHDERLKFDGPMKILPLNNLLASKIWTPDTFFHNGKKSV AHNMTTPNKLLRLVDNGTLLYTMRLTIHAECPMHLEDFPMDVHACPLKFGSYAYTTAE VVYSWTLGKNKSVEVAQDGSRLNQYDLLGHVVGTEIIRSSTGEYVVMTTHFHLKRKIG YFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSISARNSLPKVAY ATAMDWFIAVCYAFVFSALIEFATVNYFTKRSWAWEGKKVPEALEMKKKTPAAPAKKT STTFNIVGTTYPINLAKDTEFSTISKGAAPSASSTPTIIASPKATYVQDSPTETKTYN SVSKVDKISRIIFPVLFAIFNLVYWATYVNRESAIKGMIRKQ" BASE COUNT 430 a 435 c 365 g 407 t ORIGIN 1 gaattccttg tttcagttca ttcatccttc tctcctttcc gctcagactg tagagctcgg 61 tctctccaag tttgtgccta agaagatgat aatcacacaa acaagtcact gttacatgac 121 cagccttggg attcttttcc tgattaatat tctccctgga accactggtc aaggggaatc 181 aagacgacaa gaacccgggg actttgtgaa gcaggacatt ggcgggctgt ctcctaagca 241 tgccccagat attcctgatg acagcactga caacatcact atcttcacca gaatcttgga 301 tcgtcttctg gacggctatg acaaccggct gcgacctggg cttggagatg cagtgactga 361 agtgaagact gacatctacg tgaccagttt tggccctgtg tcagacactg acatggagta 421 cactattgat gtattttttc ggcagacatg gcatgatgaa agactgaaat ttgatggccc 481 catgaagatc cttccactga acaatctcct ggctagtaag atctggacac cggacacctt 541 cttccacaat ggcaagaaat cagtggctca taacatgacc acgcccaaca agctgctcag 601 attggtggac aacggaaccc tcctctatac aatgaggtta acaattcatg ctgagtgtcc 661 catgcatttg gaagattttc ccatggatgt gcatgcctgc ccactgaagt ttggaagcta 721 tgcctataca acagctgaag tggtttattc ttggactctc ggaaagaaca aatccgtgga 781 agtggcacag gatggttctc gcttgaacca gtatgacctt ttgggccatg ttgttgggac 841 agagataatc cggtctagta caggagaata tgtcgtcatg acaacccact tccatctcaa 901 gcgaaaaatt ggctactttg tgatccagac ctacttgcca tgtatcatga ctgtcattct 961 gtcacaagtg tcgttctggc tcaacagaga gtctgttcct gcccgtacag tctttggtgt 1021 caccactgtg cttaccatga ccaccttgag tatcagtgcc agaaattcct tacctaaagt 1081 ggcatatgcg acggccatgg actggttcat agccgtctgt tatgcctttg tattttctgc 1141 actgattgaa tttgccactg tcaactattt caccaagcgg agttgggctt gggaaggcaa 1201 gaaggtgcca gaggccctgg agatgaagaa gaaaacacca gcagccccag caaagaaaac 1261 cagcactacc ttcaacatcg tggggaccac ctatcccatc aacctggcca aggacactga 1321 attttccacc atctccaagg gcgctgctcc cagtgcctcc tcaaccccaa caatcattgc 1381 ttcacccaag gccacctacg tgcaggacag cccgactgag accaagacct acaacagtgt 1441 cagcaaggtt gacaaaattt cccgcatcat ctttcctgtg ctctttgcca tattcaatct 1501 ggtctattgg gccacatatg tcaaccggga gtcagctatc aagggcatga tccgcaaaca 1561 gtagatagtg gcagtgcagc aaccagagca ctgtataccc cgtgaagcat ccaggcaccc 1621 aaaccccggg gctcccc // LOCUS S63912 3043 bp mRNA PRI 13-SEP-1993 DEFINITION D10S102=FBRNP [human, fetal brain, mRNA, 3043 nt]. ACCESSION S63912 NID g399757 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3043) AUTHORS Takiguchi,S., Tokino,T., Imai,T., Tanigami,A., Koyama,K. and Nakamura,Y. TITLE Identification and characterization of a cDNA, which is highly homologous to the ribonucleoprotein gene, from a locus (D10S102) closely linked to MEN2 (multiple endocrine neoplasia type 2) JOURNAL Cytogenet. Cell Genet. 64 (2), 128-130 (1993) MEDLINE 93327647 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 135453] from the original journal article. This sequence comes from Fig. 3. FEATURES Location/Qualifiers source 1..3043 /organism="Homo sapiens" /db_xref="taxon:9606" gene 31..840 /note="FBRNP" /gene="D10S102" CDS 31..840 /gene="D10S102" /note="heterogeneous ribonucleoprotein homolog; This sequence comes from Fig. 3" /codon_start=1 /product="FBRNP" /db_xref="PID:g399758" /translation="MEVKPPPGCPQPDSGSRRRRWGEEGHDPKEPEQLRKLFIGGLSF ETTDDSLREHFEKWGTLTDCLVMRDPQTKRSRGFGFVTYSCVTEVDAAIGARPFKVDG RVVEPKRAVSREDSVKPGAHLTVKKIFVGSIKEDTEEYNLRDYFEKYGKIETIEVMED RQSGKKRGFASVTFDDHDTVDKIVVQKYHTINGHNCEVKKALAKQVMQPAGSQRGRGG GSGNCMGHRGNFGGGGGNFGRDGNFGGRGGYGGGGGGSRGSYGGGDVDIMD" BASE COUNT 913 a 445 c 728 g 957 t ORIGIN 1 cgagttggaa gaggtgagtc ctgtctcaaa atggaggtaa aaccgccgcc tggttgcccc 61 cagcccgact ccggcagtcg ccgtcgccgc tggggggagg agggccatga tccaaaggaa 121 ccagagcagc tgagaaaact gtttattggt ggtctgagct ttgaaactac agatgatagt 181 ttaagagaac attttgagaa atggggcaca ctcacagatt gtctggtaat gagagacccc 241 caaacaaaac gttccagggg ctttggtttt gtgacttatt cttgtgttac agaggtggat 301 gcagcaatcg gtgctcgacc attcaaggtt gatgggcgtg tagtggaacc aaagagagct 361 gtttctagag aggattctgt gaagcctggt gcccatctaa cagtgaagaa aatttttgtt 421 ggcagtatta aagaagatac agaagaatat aatttgagag actactttga aaagtacggc 481 aagattgaaa ccatagaagt tatggaagac aggcagagtg gaaaaaagag aggatttgct 541 tctgtaactt ttgatgatca tgatacagtt gataaaattg ttgttcagaa ataccacact 601 attaatgggc ataactgtga agtgaaaaag gcccttgcta aacaagtgat gcagccggct 661 ggatcacaga ggggtcgtgg aggtggatct ggcaattgta tgggtcacag aggaaacttt 721 ggaggtggtg gaggtaattt tggccgtgat ggaaactttg gtggaagagg aggctatggt 781 ggtggaggtg gtggcagcag aggtagttat ggaggaggtg atgtggatat aatggattag 841 gaggtgatgg tggcaactat ggcagtggtc ctggttatag tagtagaggc gggtatggtg 901 gtggtggacc aggatatgga aaccaaggtg gtggatatgg tggcggtgtt ggaggatatg 961 atggttacaa tgaaggagga aattttgacg gtagtaacta tggtggtggt gggaactata 1021 atgattttgg aaattacagt ggacaacagc aatcaaatta tggacacatg aaagggggca 1081 gttttggtgg aagaagctcg ggcagtccct atggtggtgg ttatggatct ggtggtggaa 1141 gtggtggata tggtagcaga aggttctaaa aacagcagga aaagggctac agttcttagc 1201 aggagagaga gtgaggagtt gtcaggaaag ctgcaggtta ctttgagaca gtcgtcccaa 1261 gtgcattaga ggaactgtaa aaatctgtca cagaaggaac gatgatccat aatcagaaaa 1321 gttactgcag cttaaacagg aaacccttct tgttcaggac tgtcatagcc acagtttgca 1381 aaaagtgcag ctattgatta atgcaatgta gtgtcaatta gatgtacatt cctgaggttc 1441 ttttatctgt tgtagctttg tctttttctt tttcttttca ttacatcagg tatattgccc 1501 tgtaaattgt ggtagtggta ccaggaataa aaaattaagg aatttttaac ttttcaatat 1561 ttgtgtagtt cagtttttct acattttagt acagaaactt taacaaaatg cagtttcgaa 1621 ggtgtttcct tgtgagttaa caagtaaaga agatcattgt taattactat tttgtatgaa 1681 ttttgctaaa gttaactgta aagaaacacc tgctgacttg cagtttaagg ggaatctatt 1741 ctccccattt ccaaaccatg atatgaatgg gcgctgacat gtggagagaa tagataattt 1801 gtgtgtttgc aatgtgtgtt ttagataaat aggattgggt atttaaatta gcatttgtga 1861 atttaatagc attaagatta ccttcaaatg aaaaaaaatc tcaaaatttc tatttggttt 1921 ttgtgcattt tcttttaaaa tgtaatcata tgattttagt gtgttagact tgctgagtcc 1981 tagctgtgtt tagaacagaa catctctatt ctacatttac cttggtcaaa tttgaactgc 2041 tgccataggt tttgggtgta aagaatgttt actgccctcc atttaaattc tgaaaaggga 2101 tggtggatgt tttccctctc ctacgttaga aaccattctt aaaaactttt gaaaatatag 2161 aaccattaag cctgctatat ctgagcaaat tagtgggtaa ccttttttcc ttttttaaag 2221 cacaagaggc ccataaatct tgagttattt gcattagttt acattttttg atacaacttt 2281 tcagaccaag agaataaaaa tcatgcgtta ttaaacccct agctggctgg catgctttcc 2341 tgtttgtact gtatacattt tgctggatga aaccaagata gtttaggtat aattgtccaa 2401 aataacctaa ctgcagcaga aatgtaggac agttgcttag tacaggcttc tcacttccta 2461 cagacctgaa ttcaaatttg gatagtctga gttattaaat tcccaaagac aaagaacaca 2521 ctcttatttc ttgtgtatat ttcaacataa atcatgttgt taccaatttg ttgggaaggc 2581 cctggttgag aagagtttta gataataagg ctgtatatat atagatatat atagatatat 2641 accaatgtct atatatagag atattttata tatatatata caggtatata tatgtgtgtg 2701 tatatatata tatgtatata catatataca tatatatata tatatggata tatacatata 2761 tatatatata tggatatata cccatgtcta ctgttttgct tcagctagtg cttacaattt 2821 cattcaagtc ctgagtatgt gtcctgctgt tactccttct ttggtagttg aaagttgaat 2881 tcaagtcttt ccttctgttt taagaagtac taagcaaaca agcaataaaa aggggaatgg 2941 cgcatgctag tgtttgaata tgctctcttg ttgctctaat tctgtgcctc cgtgcattaa 3001 tatttggatg catgcaatgc cacatggaaa ttggcctcat ggc // LOCUS S65761 1515 bp mRNA PRI 24-NOV-1993 DEFINITION anti-colorectal carcinoma heavy chain=glycoprotein CANAG-50 specific IgG1 kappa [human, 19.9 hybridoma, antibody 1116NS19.9, mRNA, 1515 nt]. ACCESSION S65761 NID g425517 KEYWORDS . SOURCE human 19.9 hybridoma antibody 1116NS19.9. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1515) AUTHORS Tonge,D.W., Hennam,J.F., Greene,A.R., Lee,I.D. and Edge,M.D. TITLE Cloning and characterization of 1116NS19.9 heavy and light chain cDNAs and expression of antibody fragments in Escherichia coli JOURNAL Year Immunol. 7, 56-62 (1993) MEDLINE 93383497 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 138013] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..1515 /organism="Homo sapiens" /db_xref="taxon:9606" gene 45..1421 /gene="anti-colorectal carcinoma heavy chain" CDS 45..1421 /gene="anti-colorectal carcinoma heavy chain" /note="glycoprotein CANAG-50 specific IgG1 kappa; Method: conceptual translation with partial peptide sequencing. This sequence comes from Fig. 2" /codon_start=1 /product="anti-colorectal carcinoma heavy chain" /db_xref="PID:g425518" /translation="MYLGLNYVFIVFLLNGVQSEVKLEESGGGLVQPGGSMKLSCAAS GFTFSDAWMDWVRQSPEKGLEWVAEIGNKGNNHATYYAESVKGRFTVSRDDSKSRVYL QMNSLRVEDTGTYYCTTRFAYWGQGTLVTVSAAKTTPPSVYPLAPGSAAQTNSMVTLG CLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVPSSTWPSETVTCNV AHPASSTKVDKKIVPRDCGCKPCICTVPEVSSVFIFPPKPKDVLTITLTPKVTCVVVD ISKDDPEVQFSWFVDDVEVHTAQTQPREEQFNSTFRSVSELPIMHQDWLNGKEFKCRV NSAAFPAPIEKTISKTKGRPKAPQVYTIPPPKEQMAKDKVSLTCMITDFFPEDITVEW QWNGQPAENYKNTQPIMDTDGSYFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEK SLSHSPGK" BASE COUNT 385 a 416 c 369 g 345 t ORIGIN 1 aagtttttct cttcagtgac agacacagac atagaacatt cacgatgtac ttgggactga 61 actatgtatt catagttttt ctcttaaatg gtgtccagag tgaagtgaag cttgaggagt 121 ctggaggagg cttggtgcaa cctggaggat ccatgaaact ctcttgtgct gcctctggat 181 tcacttttag tgacgcctgg atggactggg tccgccagtc tccagagaag gggcttgagt 241 gggttgctga aattggaaac aaaggtaata atcatgcaac gtactatgct gagtctgtga 301 aagggaggtt caccgtctca agagatgatt ccaaaagtag agtctacctg caaatgaaca 361 gcttaagagt tgaagacact ggcacttatt actgtaccac gcggtttgct tactggggcc 421 aagggactct ggtcactgtc tctgcagcca aaacgacacc cccatctgtc tatccactgg 481 cccctggatc tgctgcccaa actaactcca tggtgaccct gggatgcctg gtcaagggct 541 atttccctga gccagtgaca gtgacctgga actctggatc cctgtccagc ggtgtgcaca 601 ccttcccagc tgtcctgcag tctgacctct acactctgag cagctcagtg actgtcccct 661 ccagcacctg gcccagcgag accgtcacct gcaacgttgc ccacccggcc agcagcacca 721 aggtggacaa gaaaattgtg cccagggatt gtggttgtaa gccttgcata tgtacagtcc 781 cagaagtatc atctgtcttc atcttccccc caaagcccaa ggatgtgctc accattactc 841 tgactcctaa ggtcacgtgt gttgtggtag acatcagcaa ggatgatccc gaggtccagt 901 tcagctggtt tgtagatgat gtggaggtgc acacagctca gacgcaaccc cgggaggagc 961 agttcaacag cactttccgc tcagtcagtg aacttcccat catgcaccag gactggctca 1021 atggcaagga gttcaaatgc agggtcaaca gtgcagcttt ccctgccccc atcgagaaaa 1081 ccatctccaa aaccaaaggc agaccgaagg ctccacaggt gtacaccatt ccacctccca 1141 aggagcagat ggccaaggat aaagtcagtc tgacctgcat gataacagac ttcttccctg 1201 aagacattac tgtggagtgg cagtggaatg ggcagccagc ggagaactac aagaacactc 1261 agcccatcat ggacacagat ggctcttact tcgtctacag caagctcaat gtgcagaaga 1321 gcaactggga ggcaggaaat actttcacct gctctgtgtt acatgagggc ctgcacaacc 1381 accatactga gaagagcctc tcccactctc ctggtaaatg atcccagtgt ccttggagcc 1441 ctctggtcca tcaggactct gacacctacc tccacccctc cctgtataaa taaaagagcc 1501 cagcactgcc ttggg // LOCUS S65921 998 bp mRNA PRI 24-NOV-1993 DEFINITION anti-colorectal carcinoma light chain=glycoprotein CANAG-50 specific IgG1 kappa [human, 19.9 hybridoma, antibody 1116NS19.9, mRNA, 998 nt]. ACCESSION S65921 NID g425519 KEYWORDS . SOURCE human 19.9 hybridoma antibody 1116NS19.9. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 998) AUTHORS Tonge,D.W., Hennam,J.F., Greene,A.R., Lee,I.D. and Edge,M.D. TITLE Cloning and characterization of 1116NS19.9 heavy and light chain cDNAs and expression of antibody fragments in Escherichia coli JOURNAL Year Immunol. 7, 56-62 (1993) MEDLINE 93383497 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 138017] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..998 /organism="Homo sapiens" /db_xref="taxon:9606" gene 80..790 /gene="anti-colorectal carcinoma light chain" CDS 80..790 /gene="anti-colorectal carcinoma light chain" /note="glycoprotein CANAG-50 specific IgG1 kappa; Method: conceptual translation with partial peptide sequencing. This sequence comes from Fig. 2" /codon_start=1 /product="anti-colorectal carcinoma light chain" /db_xref="PID:g425520" /translation="MDMRTPAQFLGILLLWFPGMKCDIKMTQSPSSMYASLGERVTIT CKASQDINSYLSWFQQKPGKSPKTLIYRANRLVDGVPSRFSGSGSGQDYSLTISSLEY EDMGIYYCLQYDEFPRTFGGGTKLEIKRADAAPTVSIFPPSSEQLTSGGASVVCFLNN FYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLTKDEYERHNSYTCEA THKTSTSPIVKSFNRNEC" BASE COUNT 273 a 261 c 221 g 243 t ORIGIN 1 gcggaagtgt cggccgctcc ttggtgtgtg ttaactcagg aagataaaag acgacataaa 61 acaacagcca ggactcagca tggacatgag gacccctgct cagtttcttg gaatcttgtt 121 gctctggttt ccaggtatga aatgtgacat caagatgacc cagtctccat cttccatgta 181 tgcatctcta ggagagagag tcactatcac ttgcaaggcg agtcaggaca ttaatagcta 241 tttaagctgg ttccagcaga aaccagggaa atctcctaag accctgatct accgtgcaaa 301 cagattggta gatggggtcc catcaaggtt cagtggcagt ggatctgggc aagattattc 361 tctcaccatc agcagcctgg agtatgaaga tatgggaatt tattattgtc tacagtatga 421 tgagtttcct cggacgttcg gtggaggcac caagctggaa atcaaacggg ctgatgctgc 481 accaactgta tccatcttcc caccatccag tgagcagtta acatctggag gtgcctcagt 541 cgtgtgcttc ttgaacaact tctaccccaa agacatcaat gtcaagtgga agattgatgg 601 cagtgaacga caaaatggcg tcctgaacag ttggactgat caggacagca aagacagcac 661 ctacagcatg agcagcaccc tcacgttgac caaggacgag tatgaacgac ataacagcta 721 tacctgtgag gccactcaca agacatcaac ttcacccatt gtcaagagct tcaacaggaa 781 tgagtgttag agacaaaggt cctgagacgc caccaccagc tccccagctc caccctatct 841 tcccttctaa ggtcttggag gcttccccac aagcgaccta ccactgttgc ggtgctccaa 901 acctcctccc cacctccttc tcctcctcct ccctttcctt ggcttttatc atgctaatat 961 ttgcagaaaa tattcaataa agtcagtctt tgcacttg // LOCUS S66427 4834 bp mRNA PRI 17-DEC-1993 DEFINITION RBP1=retinoblastoma binding protein 1 [human, Nalm-6 pre-B cell leukemia, mRNA, 4834 nt]. ACCESSION S66427 NID g435775 KEYWORDS . SOURCE human Nalm-6 pre-B cell leukemia. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4834) AUTHORS Fattaey,A.R., Helin,K., Dembski,M.S., Dyson,N., Harlow,E., Vuocolo,G.A., Hanobik,M.G., Haskell,K.M., Oliff,A., Defeo-Jones,D. et,al. TITLE Characterization of the retinoblastoma binding proteins RBP1 and RBP2 JOURNAL Oncogene 8 (11), 3149-3156 (1993) MEDLINE 94020841 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 138855] from the original journal article. This sequence comes from Fig. 1a. FEATURES Location/Qualifiers source 1..4834 /organism="Homo sapiens" /db_xref="taxon:9606" gene 115..3888 /note="retinoblastoma binding protein 1, RBP1" /gene="RBP1" CDS 115..3888 /gene="RBP1" /note="This sequence comes from Fig. 1a." /codon_start=1 /product="retinoblastoma binding protein 1" /db_xref="PID:g435776" /translation="MKAADEPAYLTVGTDVSAKYRGAFCEAKIKTVKRLVKVKVLLKQ DNTTQLVQDDQVKGPLRVGAIVETRTSDGSFQEAIISKLTDASWYTVVFDDGDERTLR RTSLCLKGERHFAESETLDQLPLTNPEHFGTPVIAKKTNRGRRSSLPVTEDEKEEESS EEEDEDKRRLNDELLGKVVSVVSATERTEWYPALVISPSCNDDITVKKDQCLVRSFID SKFYSIARKDIKEVDILNLPESELSTKPGLQKASIFLKTRVVPDNWKMDISEILESSS SDDEDGPAEENDEEKEKEAKKTEEEVPEEELDPEERDNFLQQLYKFMEDRGTPINKPP VLGYKDLNLFKLFRLVYHQGGCDNIDSGAVWKQIYMDLGIPILNSAASYNLKTAYRKY LYGFEEYCRSANIQFRTVHHHEPKVKEEKKDLEESMEEALKLDQEMPLTEVKSEPEEN IDSNSESEREEIELKSPRGRRRIARDVNSIKKEIEEEKTEDKLKDNDTENKDVDDDYE TAEKKENELLLGRKNTPKQKEKKIKKQEDSDKDSDEEEEKSQEREETESKCDSEGEED EEDMEPCLTGTKVKVKYGRGKTQKIYEASIKSTEIDDGEVLYLVHYYGWNVSYDEWVK ADRIIWPLDKGGPKKKQKKKAKNKEDSEKDEKRDEERQKSKRGRPPLKSTLSSNMPYG LSKTANSEGKSDSCSSDSETEDALEKNLINEELSLKDELEKNENLNDDKLDEENPKIS AHILKENDRTQMQPLETLKLEVGENEQIVQIFGNKMEKAEEVKKEAEKSPKGKGRRSK TKDLSLEIIKISSFGQNEAGSEPHIEAHSLELSSLDNKNFSSATEDEIDQCVKEKKLK RKILGQSSPEKKIRIENGMEMTNTVSQERTSDCIGSEGMKNLNFEQHFERENEGMPSL IAESNQCIQQLTSERFDSPAEETVNIPLKEDEDAMPLIGPETLVCHEVDLDDLDEKDK TSIEDVAVESSESNSLVSIPPALPPVVQHNFSVASPLTLSQDESRSVKSESDITIEVD SIAEESQEGLCERESANGFETNVASGTCSIIVQERESREKGQKRPSDGNSGLMAKKQK RTPKRTSAAAKNEKNGTGQSSDSEDLPVLDNSSKCTPVKHLNVSKPQKLARSPARISP HIKDGEKDKHREKHPNSSPRTYKWSFQLNELDNMNSTERISFLQEKLQEIRKYYMSLK SEVATIDRRRKRLKKKDREVSHAGASMSSASSDTGMSPSSSSPPQNVLAVECR" BASE COUNT 1797 a 755 c 1051 g 1231 t ORIGIN 1 cgaggtcaga ggggaggagg actctggagc tgacagcgcg cacttcaccc gcagttgttc 61 tagcgactgc gaagatagct cgctgagctg gaaccccaca gatcaccaac aaaaatgaag 121 gcggcagatg agcctgccta cctgacagtg ggaaccgatg tcagtgccaa gtaccgaggt 181 gccttctgtg aggcaaagat taagactgtg aaaaggctgg tgaaagttaa ggtactcctg 241 aaacaggata ataccacaca attggtacaa gatgaccaag taaagggtcc tttaagagtt 301 ggagctattg ttgaaacaag gacatctgat ggatcttttc aggaagctat tatcagcaag 361 ttgacagatg ctagttggta taccgtggtg tttgatgatg gtgatgagcg aacattgaga 421 cgtacctcac tttgtctgaa aggagagaga cattttgcag agagtgagac acttgaccag 481 cttccattaa caaatccaga gcattttgga actccagtaa ttgcaaagaa gacgaacaga 541 ggaaggagat cttctcttcc tgttactgaa gatgaaaagg aagaagaaag cagtgaagag 601 gaagatgaag acaagcgccg tctcaatgat gaattactag gaaaagttgt aagtgtggtg 661 tctgcaacgg agaggactga atggtatcct gctttggtaa tatctcccag ctgtaatgat 721 gacatcacag tgaaaaagga tcagtgttta gttcgatcat ttattgattc taaattttac 781 tctatagcaa gaaaggacat taaggaagta gacattctca atctaccgga atctgagctc 841 tccactaaac cagggcttca gaaagcaagc atcttcttaa aaactagagt tgttcctgat 901 aattggaaaa tggatataag tgaaatcctt gagtcatcca gtagtgatga tgaagatggc 961 ccagctgaag aaaatgatga agagaaggaa aaggaggcca aaaagacaga agaagaggtg 1021 cctgaggaag aacttgatcc tgaagagagg gacaacttcc tccagcagct ttataagttt 1081 atggaagaca gaggtactcc aatcaacaaa ccacctgttt tgggctataa agatctcaat 1141 ctcttcaaac tcttcagact ggtttatcat cagggtggat gtgacaatat tgatagtggt 1201 gctgtatgga agcaaattta tatggacctt ggcattccta ttttgaattc agctgcttcc 1261 tacaatctaa aaactgctta tagaaagtat ctctatggtt ttgaggagta ctgccgttcg 1321 gcaaatattc agttcagaac tgttcatcac catgaaccaa aagtaaaaga ggaaaaaaaa 1381 gacttagaag aatcaatgga agaggctctc aaattagatc aagaaatgcc tttaacagaa 1441 gtgaagagtg aacctgagga aaatatcgat tcaaacagtg aaagtgaaag agaagagata 1501 gaattaaaat ctccgagggg acgaaggaga attgctcgag atgtaaattc tattaaaaag 1561 gaaattgaag aagagaaaac agaagacaaa ttaaaagata atgatacaga aaataaggat 1621 gtagatgatg actatgaaac tgcagagaaa aaagaaaatg agctactact ggggagaaaa 1681 aatacaccaa agcaaaaaga gaagaaaatt aaaaaacagg aggattctga caaagactca 1741 gatgaagagg aagagaaaag ccaagagagg gaagaaactg aaagcaaatg tgactctgaa 1801 ggagaggaag atgaggaaga catggaaccc tgcctaacag gaaccaaagt gaaagtaaaa 1861 tatggacgag ggaagacgca gaaaatttat gaagccagta ttaaaagcac tgaaattgat 1921 gacggagaag ttttatattt ggtacattac tatggatgga atgtcagtta tgatgagtgg 1981 gtgaaggctg acaggataat ctggcctttg gacaaaggtg gaccaaagaa aaaacagaag 2041 aaaaaagcta aaaataaaga agatagtgaa aaggacgaaa agagagatga ggagaggcag 2101 aagtcaaaac ggggacgacc tcctttaaaa tcaaccctct catcaaacat gccgtatggc 2161 ttatctaaga cagcaaacag tgaaggaaaa tcagactctt gttcatctga tagtgaaaca 2221 gaagatgctt tagaaaagaa tttaataaat gaagaacttt ctcttaaaga tgaactagaa 2281 aaaaatgaaa atttgaatga tgataagcta gatgaagaaa atccaaagat ttctgcacat 2341 atattaaaag aaaatgatag gactcaaatg cagcctttag aaaccctgaa gttagaagtt 2401 ggagagaatg aacaaatagt acagattttt gggaacaaaa tggaaaaagc agaagaagtt 2461 aagaaagaag ccgaaaaatc tccaaaagga aagggaagac gaagcaagac aaaagatctt 2521 tctttagaaa ttataaagat ttcatcattt ggccagaatg aagcaggaag tgaacctcat 2581 atagaagctc atagtcttga attgtcttca ttagacaata aaaacttttc ttctgctaca 2641 gaagatgaaa ttgaccaatg tgtgaaagaa aagaagttga aacggaaaat actaggacaa 2701 tcatcgccag agaaaaaaat aagaattgag aatggaatgg aaatgacaaa tactgtatct 2761 caagaaagga ccagtgattg tattggatct gagggaatga aaaacttaaa ttttgaacag 2821 cactttgaaa gagaaaatga aggaatgcca tcattgatag cagagtcaaa ccaatgcatc 2881 caacaactga ctagtgaaag atttgatagt ccagctgaag aaactgtaaa tattccacta 2941 aaagaagatg aggatgcaat gcctctgatc gggcctgaaa ccttggtttg ccatgaagta 3001 gatttggatg atttggatga aaaggataag accagcattg aggatgtagc agttgaaagc 3061 tctgagtcta actctcttgt ttctattcca cctgccctac ctcctgtagt ccaacataac 3121 ttttcagtag cttcaccact tactcttagt caagatgagt ctcgaagcgt aaaaagtgag 3181 agtgatataa cgattgaagt tgatagtatt gctgaagaat ctcaagaagg tctctgtgag 3241 agggaatcgg caaatggatt tgaaactaat gttgcctctg gtacctgtag tataattgta 3301 caagagagag agagcagaga gaagggtcag aagaggccaa gtgatggaaa tagtggatta 3361 atggcaaaaa agcaaaagcg taccccaaag cgaacaagtg ctgcagccaa aaatgaaaag 3421 aatggaacag gacaaagcag tgatagtgaa gatctccctg tcctagacaa ttcaagtaaa 3481 tgtaccccag taaagcatct taatgtatct aagccacaga aacttgcacg atctcctgca 3541 agaatatccc cgcacatcaa agatggagag aaagataaac acagagaaaa acatccgaat 3601 tcatccccta ggacatataa atggagcttt cagctcaatg aattagataa tatgaacagt 3661 acagagagaa tctcatttct ccaagaaaaa ctacaggaaa tcagaaaata ttatatgtct 3721 ttgaagtctg aagttgcaac catagacagg aggagaaaaa gattaaaaaa gaaagacagg 3781 gaagtgtctc atgcgggagc ctccatgtca tctgcttcat cagacactgg aatgagtccc 3841 tcatcatcat ctcccccaca aaatgtactt gctgtagaat gcaggtgata aacattttct 3901 ctaccttccc agcagtttgc tgccatggac ataaatcccc aaaccctgaa ttacaaccac 3961 agaaagcact caactggttt gacattgcta agtatatcct gtatactttt ccaggctgga 4021 ttgtatctat tcccctctct cttctttttt cttgttgcaa aaaataagct gattaataag 4081 tgaaggttaa gcagcctgcc atatttgtca taatttttcc tctttacttt tgtttttcgt 4141 ttgttgtgat atagaacaaa gggcacttag caaatttgaa tttgtataat aaagctttca 4201 ggtgttacag aaatcgtaga caagcaagtg cacatgataa acaatcaaaa tattacccag 4261 ctgaatagtt actgctgcac tttcactaag atgtatttga acacttggtg agtagggggt 4321 ttatgttgtg ttttttttca ttatcgtttt ttttattttt gtgaagcact tgctatttag 4381 aactgccaaa gtatatgttc agcagtgtgc ccaggattga aggtgtaaat gggacaaaat 4441 aaattgtgaa aggaagtgta gttgactgaa aactacagtt gtaataagtc ttccactttt 4501 tataggattt ttgagcacac aattatgcaa atattttaat gtttattaat gtttacagtg 4561 gaattgtgaa taagttttca gtggactatc ttatcccttg acaaaaatat tttgtctttt 4621 ttctatgtaa tttcagagtt tttattttgt tacaaaaaga caaaaatgaa atatataaca 4681 acaatgaagt tatttaacaa gatttctaaa gctgaaattt ttgtgtaaaa taaggtatta 4741 tcttgcaact tgttaaatat atttattcag acattggatg ttgtattttt atgtattttt 4801 taaaatatta ataaaattta aaaaaaaaaa aaaa // LOCUS S66901S1 179 bp mRNA PRI 13-JAN-1994 DEFINITION oct-1B=POU homeodomain protein {alternatively spliced} [human, NTera 2D1, mRNA Partial, 179 nt, segment 1 of 2]. ACCESSION S66901 NID g440976 KEYWORDS . SEGMENT 1 of 2 SOURCE human NTera 2D1. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 179) AUTHORS Das,G. and Herr,W. TITLE Enhanced activation of the human histone H2B promoter by an Oct-1 variant generated by alternative splicing JOURNAL J. Biol. Chem. 268 (33), 25026-25032 (1993) MEDLINE 94043371 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 139674] from the original journal article. This sequence comes from Fig. 1C. FEATURES Location/Qualifiers source 1..179 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..174 /partial /note="Oct-1B" /gene="oct-1B" CDS 1..174 /partial /gene="oct-1B" /note="POU homeodomain protein; This sequence comes from Fig. 1C" /codon_start=1 /product="Oct-1B" /db_xref="PID:g440978" /translation="MADGGAASQDESSAAAAAAADWKSKKSFPAFLITKLLSVFNESV QRKNAVFLYLTQE" BASE COUNT 57 a 37 c 45 g 40 t ORIGIN 1 atggcggacg gaggagcagc gagtcaagat gagagttcag ccgcggcggc agcagcagca 61 gactggaaaa gtaagaagag ctttcctgcc tttttaatta ccaaactact ctcagttttc 121 aatgaatcag ttcaaagaaa gaatgcagtc tttctatacc tgactcaaga atgaactgg // LOCUS S67156 1435 bp mRNA PRI 17-FEB-1994 DEFINITION ASP=aspartoacylase [human, kidney, mRNA, 1435 nt]. ACCESSION S67156 NID g455833 KEYWORDS . SOURCE human kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1435) AUTHORS Kaul,R., Gao,G.P., Balamurugan,K. and Matalon,R. TITLE Cloning of the human aspartoacylase cDNA and a common missense mutation in Canavan disease [see comments] JOURNAL Nature Genet. 5 (2), 118-123 (1993) MEDLINE 94073185 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 140584] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1435 /organism="Homo sapiens" /db_xref="taxon:9606" gene 159..1100 /note="aspartoacylase, ASP" /gene="ASP" CDS 159..1100 /gene="ASP" /note="This sequence comes from Fig. 1." /codon_start=1 /product="aspartoacylase" /db_xref="PID:g455834" /translation="MTSCHIAEEHIQKVAIFGGTHGNELTGVFLVKHWLENGAEIQRT GLEVKPFITNPRAVKKCTRYIDCDLNRIFDLENLGKKMSEDLPYEVRRAQEINHLFGP KDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPLPCYVYLIEHP SLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIKHALDFIHHFNEGKEFPPC AIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQDWKPLHPGDPMFLTLDGKTIPLGGDC TVYPVFVNEAAYYEKKEAFAKTTKLTLNAKSIRCCLH" BASE COUNT 465 a 269 c 261 g 440 t ORIGIN 1 ttgtaacaga aaattaaaat atactccact caagggaatt ctgtactttg cccttttggt 61 aaagtctcat ttacatttct aaacctttct taagaaaatc gaatttcctt tgatctctct 121 tctgaattgc agaaatcaga taaaaactac ttggtgaaat gacttcttgt cacattgctg 181 aagaacatat acaaaaggtt gctatctttg gaggaaccca tgggaatgag ctaaccggag 241 tatttctggt taagcattgg ctagagaatg gcgctgagat tcagagaaca gggctggagg 301 taaaaccatt tattactaac cccagagcag tgaagaagtg taccagatat attgactgtg 361 acctgaatcg catttttgac cttgaaaatc ttggcaaaaa aatgtcagaa gatttgccat 421 atgaagtgag aagggctcaa gaaataaatc atttatttgg tccaaaagac agtgaagatt 481 cctatgacat tatttttgac cttcacaaca ccacctctaa catggggtgc actcttattc 541 ttgaggattc caggaataac tttttaattc agatgtttca ttacattaag acttctctgg 601 ctccactacc ctgctacgtt tatctgattg agcatccttc cctcaaatat gcgaccactc 661 gttccatagc caagtatcct gtgggtatag aagttggtcc tcagcctcaa ggggttctga 721 gagctgatat cttggatcaa atgagaaaaa tgattaaaca tgctcttgat tttatacatc 781 atttcaatga aggaaaagaa tttcctccct gcgccattga ggtctataaa attatagaga 841 aagttgatta cccccgggat gaaaatggag aaattgctgc tatcatccat cctaatctgc 901 aggatcaaga ctggaaacca ctgcatcctg gggatcccat gtttttaact cttgatggga 961 agacgatccc actgggcgga gactgtaccg tgtaccccgt gtttgtgaat gaggccgcat 1021 attacgaaaa gaaagaagct tttgcaaaga caactaaact aacgctcaat gcaaaaagta 1081 ttcgctgctg tttacattag aaatcacttc cagcttacat cttacacggt gtcttacaaa 1141 ttctgctagt ctgtaagctc cttaagagta gggttgtgcc ttattcaact gcatacatag 1201 ctcctagcac agtgccttat tcggtaggca tctaagcaaa tttcttaaat taattaatat 1261 atctttaaag atatcatatt ttatgtatgt agcttattca aagaagtgtt tcctatttct 1321 atatagttta ttatacatga tacttgggta gctcaacatt cttaataaac agcctttgta 1381 ttcagaatat aaaattgaaa tagatatata taaagttaaa aaaaaaaaaa aaaaa // LOCUS S67325 1791 bp mRNA PRI 17-FEB-1994 DEFINITION propionyl CoA carboxylase beta subunit [human, liver, placenta, HL1008, mRNA, 1791 nt]. ACCESSION S67325 NID g455712 KEYWORDS . SOURCE human liver HL1008 placenta. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1791) AUTHORS Ohura,T., Ogasawara,M., Ikeda,H., Narisawa,K. and Tada,K. TITLE The molecular defect in propionic acidemia: exon skipping caused by an 8-bp deletion from an intron in the PCCB allele JOURNAL Hum. Genet. 92 (4), 397-402 (1993) MEDLINE 94041326 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 140814] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1791 /organism="Homo sapiens" /db_xref="taxon:9606" gene 31..1650 /gene="propionyl CoA carboxylase beta subunit, beta PCC" CDS 31..1650 /gene="propionyl CoA carboxylase beta subunit, beta PCC" /note="precursor. This sequence comes from Fig. 1; beta PCC" /codon_start=1 /product="propionyl CoA carboxylase beta subunit" /db_xref="PID:g455713" /translation="MAAALRVAAVGARLSVLASGLRAAVRSLCSQATSVNERIENKRR TALLGGGQRRIDAHDKRGKLTARERISLLLDPGSFVESDMFVEHRCADFGMAADKNKF PGDSVVTGRGRINGRLVYVFSQDFTVFGGSLSGAHAQKICKIMDQAITVGAPVIGLND SGGARIQEGVESLAGYADIFLRNVTASGVIPQISLIMGPCAGGAVYSPALTDFTFMVK DTSYLFITGPDVVKSVTNEDVTQEELGGAKTHTTMSGVAHRAFENDVDALCNLRDFFN YLPLSSQDPAPVRECHDPSDRLVPELDTIVPLESTKAYNMVDIIHSVVDEREFFEIMP NYAKNIIVGFARMNGRTVGIVGNQPKVASGCLDINSSVKGARFVRFCDAFNIPLITFV DVPGFLPGTAQEYGGIIRHGAKLLYAFAEATVPKVTVITRKAYGGAYDVMSSKHLCGD TNYAWPTAEIAVMGAKGAVEIIFKGHENVEAAQAEYIEKFANPFPAAVRGFVDDIIQP SSTRARICCDLDVLASKKVQRPWRKHANIPL" BASE COUNT 445 a 434 c 480 g 432 t ORIGIN 1 gccggtaggg gacgcgccgg cacagcaaaa atggcggcgg cattacgggt ggcggcggtc 61 ggggcaaggc tcagcgttct ggcgagcggt ctccgcgccg cggtccgcag cctttgcagc 121 caggccacct ctgttaacga acgcatcgaa aacaagcgcc ggaccgcgct gctgggaggg 181 ggccaacgcc gtattgacgc gcacgacaag cgaggaaagc taacagccag ggagaggatc 241 agtctcttgc tggaccctgg cagctttgtt gagagcgaca tgtttgtgga acacagatgt 301 gcagattttg gaatggctgc tgataagaat aagtttcctg gagacagcgt ggtcactgga 361 cgaggccgaa tcaatggaag attggtttat gtcttcagtc aggattttac agtttttgga 421 ggcagtctgt caggagcaca tgcccaaaag atctgcaaaa tcatggacca ggccataacg 481 gtgggggctc cagtgattgg gctgaatgac tctgggggag cacggatcca agaaggagtg 541 gagtctttgg ctggctatgc agacatcttt ctgaggaatg ttacggcatc cggagtcatc 601 cctcagattt ctctgatcat gggcccatgt gctggtgggg ccgtctactc cccagcccta 661 acagacttca cgttcatggt aaaggacacc tcctacctgt tcatcactgg ccctgatgtt 721 gtgaagtctg tcaccaatga ggatgttacc caggaggagc tcggtggtgc caagacccac 781 accaccatgt caggtgtggc ccacagagct tttgaaaatg atgttgatgc cttgtgtaat 841 ctccgggatt tcttcaacta cctgcccctg agcagtcagg acccggctcc cgtccgtgag 901 tgccacgatc ccagtgaccg tctggttcct gagcttgaca caattgtccc tttggaatca 961 accaaagcct acaacatggt ggacatcata cactctgttg ttgatgagcg tgaatttttt 1021 gagatcatgc ccaattatgc caagaacatc attgttggtt ttgcaagaat gaatgggagg 1081 actgttggaa ttgttggcaa ccaacctaag gtggcctcag gatgcttgga tattaattca 1141 tctgtgaaag gggctcgttt tgtcagattc tgtgatgcat tcaatattcc actcatcact 1201 tttgttgatg tccctggctt tctacctggc acagcacagg aatacggggg catcatccgg 1261 catggtgcca agcttctcta cgcatttgct gaggcaactg tacccaaagt cacagtcatc 1321 accaggaagg cctatggagg tgcctatgat gtcatgagct ctaagcacct ttgtggtgat 1381 accaactatg cctggcccac cgcagagatt gcagtcatgg gagcaaaggg cgctgtggag 1441 atcatcttca aagggcatga gaatgtggaa gctgctcagg cagagtacat cgagaagttt 1501 gccaaccctt tccctgcagc agtgcgaggg tttgtggatg acatcatcca accttcttcc 1561 acacgtgccc gaatctgctg tgacctggat gtcttggcca gcaagaaggt acaacgtcct 1621 tggagaaaac atgcaaatat tccattgtaa acaaatcaaa ggaaaagaaa ccaagaactg 1681 aattactgtc tgcccattca catcccattc ctgccttttg caatcatcaa acctgggaat 1741 ccaaatagtt ggataactta gaataactaa gtttattaaa ttctagaaag a // LOCUS S67334 3213 bp mRNA PRI 17-FEB-1994 DEFINITION phosphatidylinositol 3-kinase p110 beta isoform=110 kda catalytic subunit [human, mRNA Partial, 3213 nt]. ACCESSION S67334 NID g455759 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3213) AUTHORS Hu,P., Mondino,A., Skolnik,E.Y. and Schlessinger,J. TITLE Cloning of a novel, ubiquitously expressed human phosphatidylinositol 3-kinase and identification of its binding site on p85 JOURNAL Mol. Cell. Biol. 13 (12), 7677-7688 (1993) MEDLINE 94067128 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 140879] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..3213 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3213 /gene="phosphatidylinositol 3-kinase p110 beta isoform, PI 3-kinase p110 beta" CDS 1..3213 /gene="phosphatidylinositol 3-kinase p110 beta isoform, PI 3-kinase p110 beta" /note="110 kda catalytic subunit; This sequence comes from Fig. 1; PI 3-kinase p110 beta" /codon_start=1 /product="phosphatidylinositol 3-kinase p110 beta isoform" /db_xref="PID:g455760" /translation="MCFSFIMPPAMADILDIWAVDSQIASDGSIPVDFLLPTGIYIQL EVPREATISYIKQMLWKQVHNYPMFNLLMDIDSYMFACVNQTAVYEELEDETRRLCDV RPFLPVLKLVTRSCDPGEKLDSKIGVLIGKGLHEFDSLKDPEVNEFRRKMRKFSEEKI LSLVGLSWMDWLKQTYPPEHEPSIPENLEDKLYGGKLIVAVHFENCQDVFSFQVSPNM NPIKVNELAIQKRLTIHGKEDEVSPYDYVLQVSGRVEYVFGDHPLIQFQYIRNCVMNR ALPHFILVECCKIKKMYEQEMIAIEAAINRNSSNLPLPLPPKKTRIISHVWENNNPFQ IVLVKGNKLNTEETVKVHVRAGLFHGTELLCKTIVSSEVSGKNDHIWNEPLEFDINIC DLPRMARLCFAVYAVLDKVKTKKSTKTINPSKYQTIRKAGKVHYPVAWVNTMVFDFKG QLRTGDIILHSWSSFPDELEEMLNPMGTVQTNPYTENATALHVKFPENKKQPYYYPPF DKIIEKAAEIASSDSANVSSRGGKKFLPVLKEILDRDPLSQLCENEMDLIWTLRQDCR EIFPQSLPKLLLSIKWNKLEDVAQLQALLQIWPKLPPREALELLDFNYPDQYVREYAV GCLRQMSDEELSQYLLQLVQVLKYEPFLDCALSRFLLERALGNRRIGQFLFWHLRSEV HIPAVSVQFGVILEAYCRGSVGHMKVLSKQVEALNKLKTLNSLIKLNAVKLNRAKGKE AMHTCLKQSAYREALSDLQSPLNPCVILSELYVEKCKYMDSKMKPLWLVYNNKVFGED SVGVIFKNGDDLRQDMLTLQMLRLMDLLWKEAGLDLRMLPYGCLATGDRSGLIEVVST SETIADIQLNSSNVAAAAAFNKDALLNWLKEYNSGDDLDRAIEEFTLSCAGYCVASYV LGIGDRHSDNIMVKKTGQLFHIDFGHILGNFKSKFGIKRERVPFILTYDFIHVIQQGK TGNTEKFGRFRQCCEDAYLILRRHGNLFITLFALMLTAGLPELTSVKDIQYLKDSLAL GKSEEEALKQFKQKFDEALRESWTTKVNWMAHTVRKDYRS" BASE COUNT 979 a 612 c 704 g 918 t ORIGIN 1 atgtgcttca gtttcataat gcctcctgct atggcagaca tccttgacat ctgggcggtg 61 gattcacaga tagcatctga tggctccata cctgtggatt tccttttgcc cactgggatt 121 tatatccagt tggaggtacc tcgggaagct accatttctt atattaagca gatgttatgg 181 aagcaagttc acaattaccc aatgttcaac ctccttatgg atattgactc ctatatgttt 241 gcatgtgtga atcagactgc tgtatatgag gagcttgaag atgaaacacg aagactctgt 301 gatgtcagac cttttcttcc agttctcaaa ttagtgacaa gaagttgtga cccaggggaa 361 aaattagact caaaaattgg agtccttata ggaaaaggtc tgcatgaatt tgattccttg 421 aaggatcctg aagtaaatga atttcgaaga aaaatgcgca aattcagcga ggaaaaaatc 481 ctgtcacttg tgggattgtc ttggatggac tggctaaaac aaacatatcc accagagcat 541 gaaccatcca tccctgaaaa cttagaagat aaactttatg ggggaaagct catcgtagct 601 gttcattttg aaaactgcca ggacgtgttt agctttcaag tgtctcctaa tatgaatcct 661 atcaaagtaa atgaattggc aatccaaaaa cgtttgacta ttcatgggaa ggaagatgaa 721 gttagcccct atgattatgt gttgcaagtc agcgggagag tagaatatgt ttttggtgat 781 catccactaa ttcagttcca gtatatccgg aactgtgtga tgaacagagc cctgccccat 841 tttatacttg tggaatgctg caagatcaag aaaatgtatg aacaagaaat gattgccata 901 gaggctgcca taaatcgaaa ttcatctaat cttcctcttc cattaccacc aaagaaaaca 961 cgaattattt ctcatgtttg ggaaaataac aaccctttcc aaattgtctt ggttaaggga 1021 aataaactta acacagagga aactgtaaaa gttcatgtca gggctggtct ttttcatggt 1081 actgagctcc tgtgtaaaac catcgtaagc tcagaggtat cagggaaaaa tgatcatatt 1141 tggaatgaac cactggaatt tgatattaat atttgtgact taccaagaat ggctcgatta 1201 tgttttgctg tttatgcagt tttggataaa gtaaaaacga agaaatcaac gaaaactatt 1261 aatccctcta aatatcagac catcaggaaa gctggaaaag tgcattatcc tgtagcgtgg 1321 gtaaatacga tggtttttga ctttaaagga caattgagaa ctggagacat aatattacac 1381 agctggtctt catttcctga tgaactcgaa gaaatgttga atccaatggg aactgttcaa 1441 acaaatccat atactgaaaa tgcaacagct ttgcatgtta aatttccaga gaataaaaaa 1501 caaccttatt attaccctcc cttcgataag attattgaaa aggcagctga gattgcaagc 1561 agtgatagtg ctaatgtgtc aagtcgaggt ggaaaaaagt ttcttcctgt attgaaagaa 1621 atcttggaca gggatccctt gtctcaactg tgtgaaaatg aaatggatct tatttggact 1681 ttgcgacaag actgccgaga gattttccca caatcactgc caaaattact gctgtcaatc 1741 aagtggaata aacttgagga tgttgctcag cttcaggcgc tgcttcagat ttggcctaaa 1801 ctgccccccc gggaggccct agagcttctg gatttcaact atccagacca gtacgttcga 1861 gaatatgctg taggctgcct gcgacagatg agtgatgaag aactttctca atatctttta 1921 caactggtgc aagtgttaaa atatgagcct tttcttgatt gtgccctctc tagattccta 1981 ttagaaagag cacttggtaa tcggaggata gggcagtttc tattttggca tcttaggtca 2041 gaagtgcaca ttcctgctgt ctcagtacaa tttggtgtca tccttgaagc atactgccgg 2101 ggaagtgtgg ggcacatgaa agtgctttct aagcaggttg aagcactcaa taagttaaaa 2161 actttaaata gtttaatcaa actgaatgcc gtgaagttaa acagagccaa agggaaggag 2221 gccatgcata cctgtttaaa acagagtgct taccgggaag ccctctctga cctgcagtca 2281 cccctgaacc catgtgttat cctctcagaa ctctatgttg aaaagtgcaa atacatggat 2341 tccaaaatga agcctttgtg gctggtatac aataacaagg tatttggtga ggattcagtt 2401 ggagtgattt ttaaaaatgg tgatgattta cgacaggata tgttgacact ccaaatgttg 2461 cgcttgatgg atttactctg gaaagaagct ggtttggatc ttcggatgtt gccttatggc 2521 tgtttagcaa caggagatcg ctctggcctc attgaagttg tgagcacctc tgaaacaatt 2581 gctgacattc agctgaacag tagcaatgtg gctgctgcag cagccttcaa caaagatgcc 2641 cttctgaact ggcttaaaga atacaactct ggggatgacc tggaccgagc cattgaggaa 2701 tttacactgt cctgtgctgg ctactgtgta gcttcttatg tccttgggat tggtgacaga 2761 catagtgaca acatcatggt caaaaaaact ggccagctct tccacattga ctttggacat 2821 attcttggaa atttcaaatc taagtttggc attaaaaggg agcgagtgcc ttttattctt 2881 acctatgatt tcatccatgt cattcaacaa ggaaaaacag gaaatacaga aaagtttggc 2941 cggttccgcc agtgttgtga ggatgcatat ctgattttac gacggcatgg gaatctcttc 3001 atcactctct ttgcgctgat gttgactgca gggcttcctg aactcacatc agtcaaagat 3061 atacagtatc ttaaggactc tcttgcatta gggaagagtg aagaagaagc actcaaacag 3121 tttaagcaaa aatttgatga ggcgctcagg gaaagctgga ctactaaagt gaactggatg 3181 gcccacacag ttcggaaaga ctacagatct taa // LOCUS S67368 1860 bp mRNA PRI 17-FEB-1994 DEFINITION GABRB2=gamma-aminobutyric acid A receptor beta 2 subunit [human, cerebellum, mRNA, 1860 nt]. ACCESSION S67368 NID g455945 KEYWORDS . SOURCE human cerebellum. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1860) AUTHORS Hadingham,K.L., Wingrove,P.B., Wafford,K.A., Bain,C., Kemp,J.A., Palmer,K.J., Wilson,A.W., Wilcox,A.S., Sikela,J.M., Ragan,C.I. et,al. TITLE Role of the beta subunit in determining the pharmacology of human gamma-aminobutyric acid type A receptors JOURNAL Mol. Pharmacol. 44 (6), 1211-1218 (1993) MEDLINE 94088484 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 140953] from the original journal article. This sequence comes from Fig. 1. Map location: 4. FEATURES Location/Qualifiers source 1..1860 /organism="Homo sapiens" /db_xref="taxon:9606" gene 219..1643 /note="gamma-aminobutyric acid A receptor beta 2 subunit, (GABA)A receptor beta 2 subunit" /gene="GABRB2" CDS 219..1643 /gene="GABRB2" /note="This sequence comes from Fig. 1; (GABA)A receptor beta 2 subunit" /codon_start=1 /product="gamma-aminobutyric acid A receptor beta 2 subunit" /db_xref="PID:g455946" /translation="MWRVRKRGYFGIWSFPLIIAAVCAQSVNDPSNMSLVKETVDRLL KGYDIRLRPDFGGPPVAVGMNIDIASIDMVSEVNMDYTLTMYFQQAWRDKRLSYNVIP LNLTLDNRVADQLWVPDTYFLNDKKSFVHGVTVKNRMIRLHPDGTVLYGLRITTTAAC MMDLRRYPLDEQNCTLEIESYGYTTDDIEFYWRGDDNAVTGVTKIELPQFSIVDYKLI TKKVVFSTGSYPRLSLSFKLKRNIGYFILQTYMPSILITILSWVSFWINYDASAARVA LGITTVLTMTTINTHLRETLPKIPYVKAIDMYLMGCFVFVFMALLEYALVNYIFFGRG PQRQKKAAEKAASANNEKMRLDVNKMDPHENILLSTLEIKNEMATSEAVMGLGDPRST MLAYDASSIQYRKAGLPRHSFGRNALERHVAQKKSRLRRRASQLKITIPDLTDVNAID RWSRIFFPVVFSFFNIVYWLYYVN" BASE COUNT 502 a 451 c 436 g 471 t ORIGIN 1 cgcgcgggga agggaagaag aggacgaggt ggcgcagaga ccgcgggaga acacagtgcc 61 tccggaggaa atctgctcgg tccccggcag ccgcgcttcc cctttgatgt tttggtacgc 121 cgtggccatg cgcctcacat tagaattact gcactgggca gactaagttg gatctcctct 181 cttcagtgaa accctcaatt ccatcaaaaa ctaaagggat gtggagagtg cggaaaaggg 241 gctactttgg gatttggtcc ttccccttaa taatcgccgc tgtctgtgcg cagagtgtca 301 atgaccctag taatatgtcg ctggttaaag agacggtgga tagactcctg aaaggctatg 361 acattcgtct gagaccagat tttggaggtc cccccgtggc tgtggggatg aacattgaca 421 ttgccagcat cgatatggtt tctgaagtca atatggatta taccttgaca atgtactttc 481 aacaagcctg gagagataag aggctgtcct ataatgtaat acctttaaac ttgactctgg 541 acaacagagt ggcagaccag ctctgggtgc ctgataccta tttcctgaac gataagaagt 601 catttgtgca cggagtgact gttaagaacc gcatgattcg cctgcatcct gatggcaccg 661 tcctttatgg actcagaatc acaaccacag ctgcctgcat gatggaccta aggaggtacc 721 cactggatga acaaaactgc accttggaaa ttgagagcta tggatacaca actgatgaca 781 ttgagtttta ctggcgtggc gatgataatg cagtaacagg agtaacgaaa attgaacttc 841 cacagttctc tattgtagat tacaaactta tcaccaagaa ggttgttttt tccacaggtt 901 cctatcccag gttatccctc agctttaagc ttaagagaaa cattggctac tttatcctgc 961 aaacatacat gccttccatc ctgattacca tcctctcctg ggtctccttc tggattaatt 1021 acgatgcttc agctgcaagg gtggcattag gaatcacaac tgtcctcaca atgaccacaa 1081 tcaacaccca cctccgggaa actctcccta aaatccccta tgtgaaggcc attgacatgt 1141 acctgatggg gtgctttgtc ttcgttttca tggcccttct ggaatatgcc ctagtcaact 1201 acatcttctt tgggaggggg ccccaacgcc aaaagaaagc agctgagaag gctgccagtg 1261 ccaacaatga gaagatgcgc ctggatgtca acaagatgga cccccatgag aacatcttac 1321 tgagcactct cgagataaaa aatgaaatgg ccacatctga ggctgtgatg ggacttggag 1381 accccagaag cacaatgcta gcctatgatg cctccagcat ccagtatcgg aaagctgggt 1441 tgcccaggca tagttttggc cgaaatgctc tggaacgaca tgtggcgcaa aagaaaagtc 1501 gcctgaggag acgcgcctcc caactgaaaa tcaccatccc tgacttgact gatgtgaatg 1561 ccatagatcg gtggtcccgc atattcttcc cagtggtttt ttccttcttc aacatcgtct 1621 attggcttta ttatgtgaac taaaacatgg cctcccactg gaagcaagga ctagattcct 1681 cctcaaacca gttgtacagc ctgatgtagg acttggaaaa cacatcaatc caggacaaaa 1741 gtgacgctaa aataccttag ttgctggcct atcctgtggt ccatttcata ccatttgggt 1801 tgcttctgct aagtaatgaa tacactaagg tccttgtggt tttccagtta aaacgcaagt // LOCUS S67970 1563 bp mRNA PRI 15-MAR-1994 DEFINITION ZNF75=KRAB zinc finger [human, lung fibroblast, mRNA, 1563 nt]. ACCESSION S67970 NID g460902 KEYWORDS . SOURCE human lung fibroblast. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1563) AUTHORS Villa,A., Zucchi,I., Pilia,G., Strina,D., Susani,L., Morali,F., Patrosso,C., Frattini,A., Lucchini,F., Repetto,M. et,al. TITLE ZNF75: isolation of a cDNA clone of the KRAB zinc finger gene subfamily mapped in YACs 1 Mb telomeric of HPRT JOURNAL Genomics 18 (2), 223-229 (1993) MEDLINE 94116987 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 142150] from the original journal article. This sequence comes from Fig. 1. Map location: Xq24-qter. FEATURES Location/Qualifiers source 1..1563 /organism="Homo sapiens" /db_xref="taxon:9606" gene 99..1306 /note="KRAB zinc finger" /gene="ZNF75" CDS join(99..131,470..1306) /gene="ZNF75" /note="KRAB zinc finger; This sequence comes from Fig. 1" /codon_start=1 /db_xref="PID:g460903" /translation="MASKLILPESLSLLTFEDVAVYFSEEEWQLLNPLEKTLYNDVMQ DIYETVISLGLKLKNDTGNDHPISVSTSEIQTSGCEVSKKTRMKIAQKTMGRENPGDT HSVQKWHRAFPRKKRKKPATCKQELPKLMDLHGKGPTGEKPFKCQECGKSFRVSSDLI KHHRIHTGEKPYKCQQCDRRFRWSSDLNKHFMTHQGIKPYRCSWCGKSFSHNTNLHTH QRIHTGEKPFKCDECGKRFIQNSHLIKHQRTHTGEQPYTCSLCKRNFSRRSSLLRHQK LHRRREACLVSPN" BASE COUNT 501 a 314 c 322 g 426 t ORIGIN 1 taggaaacag aaattttccc tggctatttt ctacccacag ctgtcatgat caacagatgt 61 tagccctttc tgagcagaaa agaatcaaac actggaagat ggcatctaaa ctcatcctgc 121 ctgagtccct ggtgagctgt tatttctggc tttttacagg tgacttgact gtggccttgc 181 ctctgcttgt ccctattgcc taggactcat agtgtccagc aggtgctttg aggcatttta 241 gccccagtta ttctctaggc aactaggctt ggcacagtgg gaactgggca cctcccaggt 301 gatttactga tcctctttgc tccttccttt ctctgccttc tcactttttt cccctaaatc 361 ttgtactgtt cacatcttca gcacctggcc taccatgtaa ttcagaaatg ggtggtagga 421 cagcttctga agtggcaagt actaaactat agcccattct cttctttaga gtttgttgac 481 atttgaagat gtggctgtgt atttttctga ggaagagtgg caattattga atcctcttga 541 gaagactctc tacaatgatg taatgcagga tatctatgag actgtcatct ctctagggtt 601 aaagctaaaa aatgacactg gaaatgatca tcctatatct gtttctacat cagaaataca 661 aacatcagga tgcgaagtat caaaaaagac cagaatgaaa attgcccaga aaacaatggg 721 cagggaaaat cctggtgata cacacagtgt acagaaatgg catcgagctt ttccaaggaa 781 gaaaagaaag aaacctgcaa cttgtaaaca agagcttcca aaacttatgg atcttcatgg 841 gaaaggcccc acaggggaga aaccttttaa gtgtcaggaa tgtgggaaaa gcttcagagt 901 tagctctgat cttattaaac accacagaat tcacactgga gagaaaccct ataaatgtca 961 acaatgtgac aggaggttta gatggagttc agatcttaat aagcacttca tgacccatca 1021 aggaataaaa ccatatagat gctcatggtg tgggaaaagc tttagtcata acacaaatct 1081 acacacacac caaagaattc acacaggaga gaagcccttt aaatgtgatg aatgtggaaa 1141 aagattcatt cagaactccc accttattaa acaccagaga actcacacag gtgagcagcc 1201 ttatacgtgt agcttatgca agagaaactt tagtaggcga tcgagccttc ttagacacca 1261 gaaactccac agaagaaggg aagcatgtct agtgtctcca aactgaggaa agttaccatg 1321 tagagcttga ctttagaagt ggtgaaagaa tacaaaatta tgaggcaccc aatgatagga 1381 atctgtgatt aataaaacat ttgggaaggg tacatgttac acttcacaaa aaggaatcta 1441 agctgtctgt tttattcagc attgcatctt ctgtgcctag cacaagagtt gatacataag 1501 gagtacttta taaataaaaa aatgaaagtg tagtgatgag attcctttag ctatctatct 1561 ata // LOCUS S69115 833 bp mRNA PRI 22-SEP-1994 DEFINITION granulocyte colony-stimulating factor induced gene [human, CML patient, bone marrow mononuclear cells, mRNA, 833 nt]. ACCESSION S69115 NID g545708 KEYWORDS . SOURCE human CML patient bone marrow mononuclear cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 833) AUTHORS Shimane,M., Tani,K., Maruyama,K., Takahashi,S., Ozawa,K. and Asano,S. TITLE Molecular cloning and characterization of G-CSF induced gene cDNA JOURNAL Biochem. Biophys. Res. Commun. 199 (1), 26-32 (1994) MEDLINE 94168584 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 144424] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..833 /organism="Homo sapiens" /db_xref="taxon:9606" gene 181..678 /gene="granulocyte colony-stimulating factor induced gene" CDS 181..678 /gene="granulocyte colony-stimulating factor induced gene" /note="This sequence comes from Fig. 3." /codon_start=1 /db_xref="PID:g545709" /translation="MELCRSLALLGGSLGLMFCLIALSTDFWFEAVGPTHSAHSGLWP TGHGDIISGYIHVTQTFSIMAVLWALVSVSFLVLSCFPSLFPPGHGPLVSTTAAFAAA ISMVVAMAVYTSERWDQPPHPQIQTFFSWSFYLGWVSAILLLCTGALSLGAHCGGPRP GYETL" BASE COUNT 134 a 281 c 209 g 209 t ORIGIN 1 aaaccagcct catgtgacaa agcgcaggac ccctcactgc cccaactgct tgctgttctc 61 tctttcttgg gctctaagga cccaggagtc tgggtgcaca gcctccttct ctctgagatt 121 caagagtctg atcagcagcc tcttcctcct ccaggaccca gaagccctga gcttatcccc 181 atggagctct gccggtccct ggccctgctg gggggctccc tgggcctgat gttctgcctg 241 attgctttga gcaccgattt ctggtttgag gctgtgggtc ccacccactc agctcactcg 301 ggcctctggc caacagggca tggggacatc atatcaggct acatccacgt gacgcagacc 361 ttcagcatta tggctgttct gtgggccctg gtgtccgtga gcttcctggt cctgtcctgc 421 ttcccctcac tgttcccccc aggccacggc ccgcttgtct caaccaccgc agcctttgct 481 gcagccatct ccatggtggt ggccatggcg gtgtacacca gcgagcggtg ggaccagcct 541 ccacaccccc agatccagac cttcttctcc tggtccttct acctgggctg ggtctcagct 601 atcctcttgc tctgtacagg tgccctgagc ctgggtgctc actgtggcgg tccccgtcct 661 ggctatgaaa ccttgtgagc agaaggcaag agcggcaaga tgagttttga gcgttgtatt 721 ccaaaggcct catctggagc ctcgggaaag tctggtccta catctgcccg cccttccagc 781 ccttccccag cccctcctct tgtttcttca ttcattcaac aaaatttggc tgg // LOCUS S69232 2124 bp mRNA PRI 22-SEP-1994 DEFINITION electron transfer flavoprotein-ubiquinone oxidoreductase [human, fetal liver, mRNA, 2124 nt]. ACCESSION S69232 NID g545620 KEYWORDS . SOURCE human fetal liver. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2124) AUTHORS Goodman,S.I., Axtell,K.M., Bindoff,L.A., Beard,S.E., Gill,R.E. and Frerman,F.E. TITLE Molecular cloning and expression of a cDNA encoding human electron transfer flavoprotein-ubiquinone oxidoreductase JOURNAL Eur. J. Biochem. 219 (1-2), 277-286 (1994) MEDLINE 94139702 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 145206] from the original journal article. FEATURES Location/Qualifiers source 1..2124 /organism="Homo sapiens" /db_xref="taxon:9606" gene 121..1974 /gene="electron transfer flavoprotein-ubiquinone oxidoreductase, ETF-QO" CDS 121..1974 /gene="electron transfer flavoprotein-ubiquinone oxidoreductase, ETF-QO" /note="This sequence comes from Fig. 2; ETF-QO" /codon_start=1 /product="electron transfer flavoprotein-ubiquinone oxidoreductase" /db_xref="PID:g545621" /translation="MLVPLAKLSCLAYQCFHALKIKKNYLPLCAIRWSSTSTVPRITT HYTIYPRDKDKRWEGVNMERFAEEADVVIVGAGPAGLSAAVRLKQLAVAHEKDIRVCL VEKAAQIGAHTLSGACLDPGAFKELFPDWKEKGAPLNTPVTEDRFGILTEKYRIPVPI LPGLPMNNHGNYIVRLGHLVSWMGEQAEALGVEVYPGYAAAEVLFHDDGSVKGIATND VGIQKDGAPKATFERGLELHAKVTIFAEGCHGHLAKQLYKKFDLRANCEPQTYGIGLK ELWVIDEKNWKPGRVDHTVGWPLDRHTYGGSFLYHLNEGEPLVALGLVVGLDYQNPYL SPFREFQRWKHHPSIRPTLEGGKRIAYGARALNEGGFQSIPKLTFPGGLLIGCSPGFM NVPKIKGTHTAMKSGILAAESIFNQLTSENLQSKTIGLHVTEYEDNLKNSWVWKELYS VRNIRPSCHGVLGVYGGMIYTGIFYWILRGMEPWTLKHKGSDFERLKPAKDCTPIEYP KPDGQISFDLLSSVALSGTNHEHDQPAHLTLRDDSIPVNRNLSIYDGPEQRFCPAGVY EFVPVEQGDGFRLQINAQNCVHCKTCDIKDPSQNINWVVPEGGGGPAYNGM" BASE COUNT 635 a 412 c 509 g 568 t ORIGIN 1 attccgctct tgctttccgg caggtgatgg cgccccccgc ggcctagagg tccagcgccc 61 gccgcgagca gcggacagtc ctcctgttgt gtccgaccga gagtcctggt gactttgaac 121 atgctggtgc cgctagccaa gctgtcctgc ctggcatatc agtgctttca tgccttaaaa 181 attaagaaaa attatctacc tctatgtgct ataagatggt cttcaacttc tactgtgcct 241 cgaattacta cccattatac tatttatccc cgggataagg acaagagatg ggaaggagtg 301 aacatggaaa ggtttgcaga agaagcagat gttgtaatag ttggtgcagg ccctgcaggg 361 ctctctgcag ctgttcgtct aaaacagttg gctgtggcac atgaaaagga catccgtgtg 421 tgtctagtgg agaaagctgc ccagatagga gctcatactc tctcaggggc ttgccttgat 481 ccaggtgctt ttaaagaact cttcccagac tggaaagaga agggggctcc acttaacact 541 cctgtaacag aagacagatt tggaatttta acagagaaat acagaattcc tgtgccaatt 601 cttccagggc ttccaatgaa taatcatggc aattacattg tacgcttggg acatttagtg 661 agctggatgg gcgaacaagc agaagccctt ggtgttgaag tataccctgg ttatgcagct 721 gctgaggtcc tttttcatga tgatggtagt gtaaaaggaa ttgccactaa cgatgtaggg 781 atacaaaagg atggtgcacc aaaggcaaca tttgagagag gactggaact acatgctaaa 841 gtcacaattt ttgcagaagg ttgccatgga catctagcca agcaactata taagaagttt 901 gatttgagag caaattgtga acctcaaacc tacgggattg gactgaagga gttatgggtt 961 attgatgaaa agaactggaa acctgggaga gtagatcaca ctgttggttg gcccttggac 1021 agacatacct atggaggatc tttcctctat catttgaatg aaggtgaacc cctagtagct 1081 cttggtcttg tggttggtct agactatcag aatccatacc tgagtccatt tagagagttc 1141 caaaggtgga aacaccatcc tagcattcgg ccaaccttgg aaggtggaaa aaggattgca 1201 tacggagcca gagctctcaa tgaaggtggc tttcagtcta taccaaaact cacctttcct 1261 ggtggtttac taattggttg tagtcctggt tttatgaatg ttcccaagat caaaggtact 1321 cacacagcaa tgaaaagtgg aattttagca gcagaatcta tttttaatca actaactagt 1381 gaaaatctcc aatcaaagac aataggactc catgtaactg aatatgagga caatttgaag 1441 aactcatggg tatggaaaga gctatattct gttagaaata taagaccgtc ctgccacgga 1501 gtactgggtg tatatggagg gatgatttac actggaatct tttactggat attgagagga 1561 atggagccgt ggactctgaa acataaaggt tctgactttg aacggctcaa gccagccaag 1621 gattgcacac ctattgagta tccaaaaccc gatggacaga tcagttttga cctcttgtca 1681 tctgtggctc tgagtggtac taatcatgaa catgaccagc cggcacactt aaccttaagg 1741 gatgacagta tacctgtaaa tagaaatctg tcgatatatg atgggcccga gcagcgattc 1801 tgtcctgcag gagtttatga atttgtacct gtggaacaag gtgatggatt tcggttacag 1861 ataaatgctc agaactgtgt acattgtaaa acatgtgata ttaaagatcc aagtcagaat 1921 attaactggg tggtacctga aggtggagga ggacctgctt acaatggaat gtaaactgca 1981 gctagccagt ttctttcaag tatggcaagc taacgttaaa atgtttagag attaacagat 2041 ttcagaatgt ctttctgcat attactgaac agaatagtca caaaatgatt atcaaataaa 2101 aattttatac tataaaaaaa aaaa // LOCUS S70154 1490 bp mRNA PRI 22-SEP-1994 DEFINITION cytosolic acetoacetyl-coenzyme A thiolase [human, liver, mRNA, 1490 nt]. ACCESSION S70154 NID g546900 KEYWORDS . SOURCE human liver. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1490) AUTHORS Song,X.Q., Fukao,T., Yamaguchi,S., Miyazawa,S., Hashimoto,T. and Orii,T. TITLE Molecular cloning and nucleotide sequence of complementary DNA for human hepatic cytosolic acetoacetyl-coenzyme A thiolase JOURNAL Biochem. Biophys. Res. Commun. 201 (1), 478-485 (1994) MEDLINE 94257021 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 147875] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1490 /organism="Homo sapiens" /db_xref="taxon:9606" gene 38..1231 /gene="cytosolic acetoacetyl-coenzyme A thiolase, CT" CDS 38..1231 /gene="cytosolic acetoacetyl-coenzyme A thiolase, CT" /note="Method: conceptual translation with partial peptide sequencing. This sequence comes from Fig. 1; CT" /codon_start=1 /product="cytosolic acetoacetyl-coenzyme A thiolase" /db_xref="PID:g546901" /translation="MNAGSDPVVIVSAARTIIGSFNGALAAVPVQDLGSTVIKEVLKR ATVAPEDVSEVIFGHVLAAGCGQNPVRQASVGAGIPYSVPAWSCQMICGSGLKAVCLA VQSIGIGDSSIVVAGGMENMSKAPHLAYLRTGVKIGEMPLTDSILCDGLTDAFHNCHM GITAENVATKWQVSREDQDKVAVLSQNRTENAQKAGHFDKEIVPVLVSTRKGLIEVKT DEFPRHGSNIEAMSKLKPYFLTDGTGTVTPANASGINDGAAAVALMKKSEADKRGLTP LARIVSWSQVGVEPSIMGIGPIPAIKQAVTKAGWSLEDVDIFEINEAFAAVSAAIVKE LGLNPEKVNIEGGAIALGHPLGASGCRILVTLLHTLERMGRSRGVAALCIGGGMGIAM CVQRE" BASE COUNT 440 a 293 c 385 g 372 t ORIGIN 1 ggggcagcgc agggcagacg gcggcaggag aagcaagatg aatgcaggct cagatcctgt 61 ggtcatcgtc tcggcggcgc ggaccatcat aggttccttc aatggtgcct tagctgctgt 121 tcctgtccag gacctgggct ccactgtcat caaagaagtc ttgaagaggg ccactgtggc 181 tccggaagat gtgtctgagg tcatctttgg acatgtcttg gcagcaggct gtgggcagaa 241 tcctgttaga caagccagtg tgggtgcagg aattccctac tctgttccag catggagctg 301 ccagatgatc tgtgggtcag gcctaaaagc tgtgtgcctt gcagtccagt caatagggat 361 aggagactcc agcattgtgg ttgcaggagg catggaaaat atgagcaagg ctcctcactt 421 ggcttacttg agaacaggag taaagatagg tgagatgcca ctgactgaca gtatactctg 481 tgatggtctt acagatgcat ttcacaactg tcatatgggt attacagctg aaaatgtagc 541 cacaaaatgg caagtgagta gagaagatca ggacaaggtt gcagttctgt cccagaacag 601 gacagagaat gcacagaaag ctggccattt tgacaaagag attgtaccag ttttggtgtc 661 aactagaaaa ggtcttattg aagttaaaac agatgagttt cctcgccatg ggagcaacat 721 agaagccatg tccaagctaa agccttactt tcttactgat ggaacgggaa cagtcacccc 781 agccaatgct tcaggaataa atgatggtgc tgcagctgtt gctcttatga agaagtcaga 841 agctgataaa cgtgggctta cacctttagc acggatagtt tcctggtccc aagtgggtgt 901 ggagccttcc attatgggaa taggaccaat tccagccata aagcaagctg ttacaaaagc 961 aggttggtca ctggaagatg ttgacatatt tgaaatcaat gaagcctttg cagctgtctc 1021 tgctgcaata gttaaagaac ttggattaaa cccagagaag gtcaatattg aaggaggggc 1081 tatagccttg ggccaccctc ttggagcatc tggctgtcga attcttgtga ccctgttaca 1141 cacactggag agaatgggca gaagtcgtgg tgttgcagcc ctgtgcattg ggggtgggat 1201 gggaatagca atgtgtgttc agagagaatg acaatgtgtg ttcagagaga atgaattgct 1261 taaactttga acaacctcaa tttcttttta aactaataaa gtactaggtt gcaatatgtg 1321 aaatcagagg accaaagtac agatggaaac catttcctac atcacaaaaa cccaagttta 1381 cagcttgtac tttactttaa tgtgtaatac tcaactcacg gtacaagaca attgcattta 1441 acattgttat aaataaaagg aacatcagat caatcattaa aaaaaaaaaa // LOCUS S70609 2364 bp mRNA PRI 22-SEP-1994 DEFINITION glycine transporter type 1b [human, substantia nigra, mRNA, 2364 nt]. ACCESSION S70609 NID g546768 KEYWORDS . SOURCE human substantia nigra. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2364) AUTHORS Kim,K.M., Kingsmore,S.F., Han,H., Yang-Feng,T.L., Godinot,N., Seldin,M.F., Caron,M.G. and Giros,B. TITLE Cloning of the human glycine transporter type 1: molecular and pharmacological characterization of novel isoform variants and chromosomal localization of the gene in the human and mouse genomes JOURNAL Mol. Pharmacol. 45 (4), 608-617 (1994) MEDLINE 94239375 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 149092] from the original journal article. This sequence comes from Fig. 1. Map location: 1p31.3. FEATURES Location/Qualifiers source 1..2364 /organism="Homo sapiens" /db_xref="taxon:9606" gene 234..2312 /gene="glycine transporter type 1b, GlyT-1b" CDS 234..2312 /gene="glycine transporter type 1b, GlyT-1b" /note="This sequence comes from Fig. 1; GlyT-1b" /codon_start=1 /product="glycine transporter type 1b" /db_xref="PID:g546769" /translation="MAAAHGPVAPSSPEQVTLLPVQRSFFLPPFSGATPSTSLAESVL KVWHGAYNSGLLPQLMAQHSLAMAQNGAVPSEATKRDQNLKRGNWGNQIEFVLTSVGY AVGLGNVWRFPYLCYRNGGGAFMFPYFIMLIFCGIPLFFMELSFGQFASQGCLGVWRI SPMFKGVGYGMMVVSTYIGIYYNVVICIAFYYFFSSMTHVLPWAYCNNPWNTHDCAGV LDASNLTNGSRPAALPSNLSHLLNHSLQRTSPSEEYWRLYVLKLSDDIGNFGEVRLPL LGCLGVSWLVVFLCLIRGVKSSGKVVYFTATFPYVVLTILFVRGVTLEGAFDGIMYYL TPQWDKILEAKVWGDAASQIFYSLACAWGGLITMASYNKFHNNCYRDSVIISITNCAT SVYAGFVIFSILGFMANHLGVDVSRVADHGPGLAFVAYPEALTLLPISPLWSLLFFFM LILLGLGTQFCLLETLVTAIVDEVGNEWILQKKTYVTLGVAVAGFLLGIPLTSQAGIY WLLLMDNYAASFSLVVISCIMCVAIMYIYGHRNYFQDIQMMLGFPPPLFFQICWRFVS PAIIFFILVFTVIQYQPITYNHYQYPGWAVAIGFLMALSSVLCIPLYAMFRLCRTDGD TLLQRLKNATKPSRDWGPALLEHRTGRYAPTIAPSPEDGFEVQSLHPDKAQIPIVGSN GSSRLQDSRI" BASE COUNT 418 a 772 c 650 g 524 t ORIGIN 1 gcccacacac cccactccag ctccggagca cccgtgctgg gctgcatggg gactggccgg 61 aggggcaggg ccaggggagc gggtaggcag agcttcggga ggagatgagg tgaaagtaat 121 tgacgctgcc cagcccggca gtgggagagg caggggatgc gtcagtgtcg cgctggagct 181 ggcagaggtg atgagcggcg gagacacgcg gggctgcgat cgctcgcccc aggatggccg 241 cggctcatgg acctgtggcc ccctcttccc cagaacaggt gacgcttctc cctgttcaga 301 gatccttctt cctgccaccc ttttctggag ccactccctc tacttcccta gcagagtctg 361 tcctcaaagt ctggcatggg gcctacaact ctggtctcct tccccaactc atggcccagc 421 actccctagc catggcccag aatggtgctg tgcccagcga ggccaccaag agggaccaga 481 acctcaaacg gggcaactgg ggcaaccaga tcgagtttgt actgacgagc gtgggctatg 541 ccgtgggcct gggcaatgtc tggcgcttcc catacctctg ctatcgcaac gggggaggcg 601 ccttcatgtt cccctacttc atcatgctca tcttctgcgg gatccccctc ttcttcatgg 661 agctctcctt cggccagttt gcaagccagg ggtgcctggg ggtctggagg atcagcccca 721 tgttcaaagg agtgggctat ggtatgatgg tggtgtccac ctacatcggc atctactaca 781 atgtggtcat ctgcatcgcc ttctactact tcttctcgtc catgacgcac gtgctgccct 841 gggcctactg caataacccc tggaacacgc atgactgcgc cggtgtactg gacgcctcca 901 acctcaccaa tggctctcgg ccagccgcct tgcccagcaa cctctcccac ctgctcaacc 961 acagcctcca gaggaccagc cccagcgagg agtactggag gctgtacgtg ctgaagctgt 1021 cagatgacat tgggaacttt ggggaggtgc ggctgcccct ccttggctgc ctcggtgtct 1081 cctggttggt cgtcttcctc tgcctcatcc gaggggtcaa gtcttcaggg aaagtggtgt 1141 acttcacggc cacgttcccc tacgtggtgc tgaccattct gtttgtccgc ggagtgaccc 1201 tggagggagc ctttgacggc atcatgtact acctaacccc gcagtgggac aagatcctgg 1261 aggccaaggt gtggggtgat gctgcctccc agatcttcta ctcactggcg tgcgcgtggg 1321 gaggcctcat caccatggct tcctacaaca agttccacaa taactgttac cgggacagtg 1381 tcatcatcag catcaccaac tgtgccacca gcgtctatgc tggcttcgtc atcttctcca 1441 tcctcggctt catggccaat cacctgggcg tggatgtgtc ccgtgtggca gaccacggcc 1501 ctggcctggc cttcgtggct taccccgagg ccctcacact acttcccatc tccccgctgt 1561 ggtctctgct cttcttcttc atgcttatcc tgctggggct gggcactcag ttctgcctcc 1621 tggagacgct ggtcacagcc attgtggatg aggtggggaa tgagtggatc ctgcagaaaa 1681 agacctatgt gaccttgggc gtggctgtgg ctggcttcct gctgggcatc cccctcacca 1741 gccaggcagg catctattgg ctgctgctga tggacaacta tgcggccagc ttctccttgg 1801 tggtcatctc ctgcatcatg tgtgtggcca tcatgtacat ctacgggcac cggaactact 1861 tccaggacat ccagatgatg ctgggattcc caccacccct cttctttcag atctgctggc 1921 gcttcgtctc tcccgccatc atcttcttta ttctagtttt cactgtgatc cagtaccagc 1981 cgatcaccta caaccactac cagtacccag gctgggccgt ggccattggc ttcctcatgg 2041 ctctgtcctc cgtcctctgc atccccctct acgccatgtt ccggctctgc cgcacagacg 2101 gggacaccct cctccagcgt ttgaaaaatg ccacaaagcc aagcagagac tggggccctg 2161 ccctcctgga gcaccggaca gggcgctacg cccccaccat agccccctct cctgaggacg 2221 gcttcgaggt ccagtcactg cacccggaca aggcgcagat ccccattgtg ggcagtaatg 2281 gctccagccg cctccaggac tcccggatat agcacagctg ccaggggagt gccaccccac 2341 ccgtgctcca cgagagactg tgag // LOCUS S71018 883 bp mRNA PRI 22-SEP-1994 DEFINITION cyclophilin C [human, kidney, mRNA, 883 nt]. ACCESSION S71018 NID g547303 KEYWORDS . SOURCE human kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 883) AUTHORS Schneider,H., Charara,N., Schmitz,R., Wehrli,S., Mikol,V., Zurini,M.G., Quesniaux,V.F. and Movva,N.R. TITLE Human cyclophilin C: primary structure, tissue distribution, and determination of binding specificity for cyclosporins JOURNAL Biochemistry 33 (27), 8218-8224 (1994) MEDLINE 94304830 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 149387] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..883 /organism="Homo sapiens" /db_xref="taxon:9606" gene 165..803 /gene="cyclophilin C, Cyp-C" CDS 165..803 /gene="cyclophilin C, Cyp-C" /note="This sequence comes from Fig. 1; Cyp-C" /codon_start=1 /product="cyclophilin C" /db_xref="PID:g547304" /translation="MGPGPRLLLPLVLCVGLGALVFSSGAEGFRKRGPSVTAKVFFDV RIGDKDVGRIVIGLFGKVVPKTVENFVALATGEKGYGYKGSKFHRVIKDFMIQGGDIT TGDGTGGVSIYGETFPDENFKLKHYGIGWVSMANAGPDTNGSQFFITLTKPTWLDGKH VVFGKVIDGMTVVHSIELQATDGHDRPLTNCSIINSGKIDVKTPFVVEIADW" BASE COUNT 189 a 213 c 254 g 227 t ORIGIN 1 caattattcc aaatctattc cactctcttc aagcctgtac cccacctgcc ctttctcagc 61 aggctgctcc tctcacttct aggagtcccg tcagctgtcc cagagcctgt gtggcgcccg 121 tgccggtagc gcccgtgccg gtagcgccgc tgccaccgct caccatgggc ccgggtcctc 181 ggctgctgct acctctcgtg ctttgcgtgg ggctcggcgc acttgtgttt tcttcggggg 241 ccgagggctt ccgcaagcga ggcccctcgg tgacggccaa ggtcttcttt gatgtgagga 301 ttggagacaa agatgttggc agaattgtga ttggcctctt tggaaaagtt gtgcccaaga 361 cagtggaaaa ttttgttgct ctagcaacag gagagaaagg atatggatat aaaggaagca 421 agtttcatcg tgtcatcaag gatttcatga ttcaaggagg tgacatcacc actggagatg 481 gcactggggg tgtgagcatc tatggtgaga catttccaga tgagaacttc aagctgaagc 541 actatggcat tgggtgggtc agcatggcca acgctgggcc tgacaccaat ggctctcagt 601 tctttatcac cttgaccaag cccacctggt tggacggcaa acatgtggtg tttggaaaag 661 tcattgatgg gatgacagtg gtgcactcca tagagctcca agcaactgat gggcatgacc 721 gtccactcac caactgctcg atcatcaaca gtggcaagat agacgtgaaa acgccttttg 781 tggttgagat cgctgattgg tgacacaact ggcagaaaac aaggatatgc tttggcaggg 841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgtgttgtct ttc // LOCUS S72008 2314 bp mRNA PRI 27-OCT-1994 DEFINITION hCDC10=CDC10 homolog [human, fetal lung, mRNA, 2314 nt]. ACCESSION S72008 NID g560622 KEYWORDS . SOURCE human fetal lung. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2314) AUTHORS Nakatsuru,S., Sudo,K. and Nakamura,Y. TITLE Molecular cloning of a novel human cDNA homologous to CDC10 in Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 202 (1), 82-87 (1994) MEDLINE 94311951 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 151177] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2314 /organism="Homo sapiens" /db_xref="taxon:9606" gene 49..1305 /note="CDC10 homolog; Saccharomyces cerevisiae CDC10 homolog" /gene="hCDC10" CDS 49..1305 /gene="hCDC10" /note="CDC10 homolog; This sequence comes from Fig. 1" /codon_start=1 /db_xref="PID:g560623" /translation="MVAQQKNLEGYVGFANLPNQVYRKSVKRGFEFTLMVVGESGLGK STLINSLFLTDLYSPEYPGPSHRIKKTVQVEQSKVLIKEGGVQLLLTIVDTPGFGDAV DNSNCWQPVIDYIDSKFEDYLNAESRVNRRQMPDNRVQCCLYFIAPSGHGLKPLDIEF MKRLHEKVNIIPLIAKADTLTPEECQQFKKQIMKEIQEHKIKIYEFPETDDEEENKLV KKIKDRLPLAVVGSNTIIEVNGKRVRGRQYPWGIAEVENGEHCDFTILRNMKIRTHMQ DLKDVTNNVHYENYRSRKLAAVTYNGVDNNKNKGQLTKSPLAQMEEERREHVAKMKKM EMEMEQVFEMKVKEKVQKLKDSEAELQRRHEQMKKNLEAQHKELEEKRRQFEDEKANW EAQQRILEQQNSSRTLEKNKKKGKIF" BASE COUNT 787 a 373 c 469 g 685 t ORIGIN 1 agtgcgagat ccgctgctgc tgaggagagg agcgtcaaca gcagcaccat ggtagctcaa 61 cagaagaacc ttgaaggcta tgtgggattt gccaatctcc caaatcaagt atacagaaaa 121 tcggtgaaga gaggttttga attcacgctt atggtagtgg gtgaatctgg attgggaaag 181 tcgacattaa tcaactcatt attcctcaca gatttgtatt ctccagagta tccaggtcct 241 tctcatagaa ttaaaaagac tgtacaggtg gaacaatcca aagttttaat caaagaaggt 301 ggtgttcagt tgctgctcac aatagttgat accccaggat ttggagatgc agtggataat 361 agtaattgct ggcagcctgt tatcgactac attgatagta aatttgagga ctacctaaat 421 gcagaatcac gagtgaacag acgtcagatg cctgataaca gggtgcagtg ttgtttatac 481 ttcattgctc cttcaggaca tggacttaaa ccattggata ttgagtttat gaagcgtttg 541 catgaaaaag tgaatatcat cccacttatt gccaaagcag acacactcac accagaggaa 601 tgccaacagt ttaaaaaaca gataatgaaa gaaatccaag aacataaaat taaaatatac 661 gaatttccag aaacagatga tgaagaagaa aataaacttg ttaaaaagat aaaggaccgt 721 ttacctcttg ctgtggtagg tagtaatact atcattgaag ttaatggcaa aagggtcaga 781 ggaaggcagt atccttgggg tattgctgaa gttgaaaatg gtgaacattg tgattttaca 841 atcctaagaa atatgaagat aagaacacac atgcaggact tgaaagatgt tactaataat 901 gtccactatg agaactacag aagcagaaaa cttgcagctg tgacttataa tggagttgat 961 aacaacaaga ataaagggca gctgactaag agccctctgg cacaaatgga agaagaaaga 1021 agggagcatg tagctaaaat gaagaagatg gagatggaga tggagcaggt gtttgagatg 1081 aaggtcaaag aaaaagttca aaaactgaag gactctgaag ctgagctcca gcggcgccat 1141 gagcaaatga aaaagaattt ggaagcacag cacaaagaat tggaggaaaa acgtcgtcag 1201 ttcgaggatg agaaagcaaa ctgggaagct caacaacgta ttttagaaca acagaactct 1261 tcaagaacct tggaaaagaa caagaagaaa gggaagatct tttaaactct ctattgacca 1321 ccagttaacg tattagttgc caatatgcca gcttggacat cagtgtttgt tggatccgtt 1381 tgaccaattt gcaccagttt tatccataat gatggattta acagcatgac aaaaattatt 1441 tttttttttg ttcttgatgg agattaagat gccttgaatt gtctagggtg ttctgtactt 1501 agaaagtaag agctctaagt acctttccta cattttcttt ttttattaaa cagatatctt 1561 cagtttaatg caagagaaca ttttactgtt gtacaatcat gttctggtgg tttgattgtt 1621 tacaggatat tccaaaataa aaggactctg gaagattttc attgaggaga aattgccata 1681 atatgatgca aactgtgctt ctctatgata attacaatac aaaggttcca ttcagtgcag 1741 catatacaat aatgtaattt agtctaacac agttgaccct attttttgac acttccattg 1801 tttaaaaata cacatggaaa aaaaaaaacc ctatatgctt actgtgcacc tagagctttt 1861 ttataacaac gtctttttgt ttgtttgttt tggattcttt aaatatatat tctcatttag 1921 tgccctcttt agccagaatc tcattactgc ttcatttttg taataacatt taatttagat 1981 attttccaca tattggcact gctaaaatag aatatagcat ctttcatatg gtaggaacca 2041 acaaggaaac tttcctttaa ctcccttttt acactttatg gtaagtagca gggggggaaa 2101 tgcatttata gatcatttct aggcaaaatt gtgaagctaa tgaccaacct gtttctacct 2161 atatgcagtc tctttatttt actagaaatg ggaatcatgg cctcttgaag agaaaaaagt 2221 caccattctg catttagctg tattcatata ttgtatttct gtattttttg tttgtattgt 2281 aaaaaattca cataataaac gatggttgtg atgt // LOCUS S72482 762 bp mRNA PRI 23-JAN-1995 DEFINITION Ley I-L=Leydig insulin-like peptide [human, testis, mRNA, 762 nt]. ACCESSION S72482 NID g632798 KEYWORDS . SOURCE human testis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 762) AUTHORS Burkhardt,E., Adham,I.M., Hobohm,U., Murphy,D., Sander,C. and Engel,W. TITLE A human cDNA coding for the Leydig insulin-like peptide (Ley I-L) JOURNAL Hum. Genet. 94 (1), 91-94 (1994) MEDLINE 94307715 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 152938] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..762 /organism="Homo sapiens" /db_xref="taxon:9606" gene 8..403 /gene="Ley I-L" CDS 8..403 /gene="Ley I-L" /note="Leydig insulin-like peptide; This sequence comes from Fig. 1" /codon_start=1 /product="Ley I-L" /db_xref="PID:g632799" /translation="MDPRLPAWALVLLGPALVFALGPAPTPEMREKLCGHHFVRALVR VCGGPRWSTEARRPAAGGDRELLQWLERRHLLHGLVADSNLTLGPGLQPLPQTSHHHR HHRAAATNPARYCCLSGCTQQDLLTLCPY" BASE COUNT 156 a 264 c 199 g 143 t ORIGIN 1 caccaccatg gacccccgtc tgcccgcctg ggcgctggtg ctgctgggcc ctgccctggt 61 gttcgcgttg ggccccgcgc ccaccccaga gatgcgtgag aagttgtgcg gccaccactt 121 cgtacgcgcg ctggtgcgcg tgtgcggggg cccccgctgg tccaccgaag ccaggaggcc 181 tgcggccgga ggcgaccgtg agttgctaca gtggctggag agacgacatc tgctccatgg 241 gctggtggcc gacagtaatc tcacgctggg acctggcctg cagcccctgc cccagacctc 301 tcaccatcac cgccaccacc gtgcagctgc caccaaccct gcacgctact gctgcctcag 361 tggctgtacc caacaagacc tgctgaccct ctgtccctac tgattcctcc ttgggtgcag 421 cctcagagtg gcctgaggcc cagagggtct ggtctggtga gctcctgagg ccacacagca 481 ccataaagtc tcgcatctac aggcctttga ttacctcctg ggatgggtgc tcactatcta 541 ccccagacca atgccacctg cagcctgtgg agtcaactgc agaataaatc acaccctagc 601 cctggcttgg aggatccccg ctttcacaga tgctggacac tgacagccaa atgtcctcac 661 tccagaggag ccccagacgc tccgctccct gcatgtgtaa caccccttct tgctgtctct 721 tagtaaataa acgacccaaa gcaaaaaaaa aaaaaaaaaa aa // LOCUS S72487 1718 bp mRNA PRI 05-JAN-1995 DEFINITION orf1 5' to PD-ECGF/TP...orf2 5' to PD-ECGF/TP [human, epidermoid carcinoma cell line A431, mRNA, 3 genes, 1718 nt]. ACCESSION S72487 NID g619332 KEYWORDS . SOURCE human epidermoid carcinoma cell line A431. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1718) AUTHORS Usuki,K., Gonez,L.J., Wernstedt,C., Moren,A., Miyazono,K., Claesson-Welsh,L. and Heldin,C.H. TITLE Structural properties of 3.0 kb and 3.2 kb transcripts encoding platelet-derived endothelial cell growth factor/thymidine phosphorylase in A431 cells JOURNAL Biochim. Biophys. Acta 1222 (3), 411-414 (1994) MEDLINE 94312438 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 152985] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..1718 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..168 /gene="orf1 5' to PD-ECGF/TP" CDS 1..168 /gene="orf1 5' to PD-ECGF/TP" /codon_start=1 /translation="MGLGAGRPDANSDAPRLRLGHDPCGRAPPPSPSARASPRSRRRA APGQATWCPLA" gene 38..511 /gene="orf3 5' of PD-ECGF/TP" CDS 38..511 /gene="orf3 5' of PD-ECGF/TP" /codon_start=1 /translation="MLPGYALAMTRAAARPRLHLRRALPHAADDVRPRARPPGARSHD RARHRRRPRLLHLRPPTPLSALPHSGTWSGPPGPWPPQRRTASREAHLGTPDLNPESP SDTLTRYSVPPYPDLKSQTPNPRGFDKSWLRLPTSPRTPSRVPTRLPRSSSPPHP" gene 165..329 /gene="orf2 5' to PD-ECGF/TP" CDS 165..329 /gene="orf2 5' to PD-ECGF/TP" /codon_start=1 /translation="MTVRGTDGAPAYSIYGRPRRSAPFLTPGPGQDPRAPGHPNAELR PGRPTWEPPT" BASE COUNT 279 a 661 c 567 g 211 t ORIGIN 1 atggggcttg gggctgggcg gccagacgct aactcggatg ctcccaggct acgccttggc 61 catgacccgt gcggccgcgc gcccccgcct tcaccttcgg cgcgcgcttc cccacgcagc 121 agacgacgtg cggccccggg ccaggccacc tggtgcccgc tcgcatgacc gtgcgcggca 181 ccgacggcgc ccccgcctac tccatctacg gccgcccacg ccgctcagcg cccttcctca 241 ctccgggacc tggtcaggac ccccgggccc ctggccaccc caacgccgaa ctgcgtccag 301 ggaggcccac ctgggaaccc ccgacctgaa ccccgagtcc ccctcggata ccctaacacg 361 atattcggta cccccatatc cggatctcaa atcccaaacc ccgaacccac ggggctttga 421 taaatcgtgg ctcagactcc ccactagtcc caggacccca tctcgggtac ccaccaggct 481 cccacgcagt tctagccccc cacacccttg atccgccccg caggcaggta cttcccggag 541 cgagcgggga acgcgacgta ccccagtgcg cctcggcaca ccattgctcc ccgaaactgg 601 ggtgtccagg cggaacagca gagcccaggt cccgcggcct atacggtgcc ctcgctcttg 661 ggtccgcgcg tcatcggcaa agtctccgcc ccaacttgct ccatctacgg ccgcagagcg 721 gctggcagtt tcttcgagga cctcagcaag gtcgtgagtc caggggtcta caagtcccgg 781 gccccccagt tcacgattct ggcgcggact tcgctccccc aagacaacac tcggaagcca 841 gggcccgcgg cctacaacgt ggatcagcac cggaagcccc gcggctggag tttcgggatc 901 cggcactcgg actacctggc cccgctggtg accgacgcgg acaactgacc cgccaggcgg 961 gagcggcccc acacgtgttt gcttaaagtc tgcgagtccg catcgtgtcc gcctctctct 1021 ctctctctct gcgcgtcctg gcgcaaggcc tggggtggag ccacggctgg ggccgtgtcc 1081 caactccgaa cccagcgggg cggggcccga gcgtcgggcg aggccgggac cccagcgctg 1141 cgccgcgtcc gaacgtcgag accccaccga gggcgggagg gggactctcg ggagccacag 1201 acgcccgaga cccacgccgg gcgggaccgg ccagggatca cccccgccga cggccccggg 1261 ccccgacggc ccggaagttc cgcgtgtccg ggggcaccgg gggattggcc ggggcgcggc 1321 gtgcaaggct tcccgggggc ggcgactgcc gagctccgcc ctccaggcgg ccccacccgc 1381 ctgccgtcct ggggcgccgc cgccccgccg ccggcagtgg accgctgtgc gcgaaccctg 1441 aaccctacgg tcccgacccg cgggcgaggc cgggtacctg ggctgggatc cggagcaagc 1501 gggcgagggc agcgccctaa gcaggtacgg gcggggctca agtcgcgagg cggggaagcg 1561 ggaggcagac acggacgagg gcgacacaga cacgggaccg aggggcggac accggagaga 1621 cacgggaaag gggtcgggac aggagcacgt ggctcagaca ccgacgccgg gaggccgcag 1681 accccggacg tgtcaggcat ccccgcaggc ccggagcg // LOCUS S72869 3011 bp mRNA PRI 24-JAN-1995 DEFINITION H4(D10S170)=putative cytoskeletal protein [human, thyroid, mRNA, 3011 nt]. ACCESSION S72869 NID g633869 KEYWORDS . SOURCE human thyroid. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3011) AUTHORS Grieco,M., Cerrato,A., Santoro,M., Fusco,A., Melillo,R.M. and Vecchio,G. TITLE Cloning and characterization of H4 (D10S170), a gene involved in RET rearrangements in vivo JOURNAL Oncogene 9 (9), 2531-2535 (1994) MEDLINE 94336206 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 153643] from the original journal article. FEATURES Location/Qualifiers source 1..3011 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..3011 /note="putative cytoskeletal protein" /gene="H4(D10S170)" CDS 37..1794 /gene="H4(D10S170)" /note="putative cytoskeletal protein; This sequence comes from Fig. 3" /codon_start=1 /db_xref="PID:g633870" /translation="MADSASESDTDGAGGNSSSSAAMQSSCSSTSGGGGGGGGGGGGG KSGGIVISPFRLEELTNRLASLQQENKVLKIELETYKLKCKALQEENRDLRKASVTIQ ARAEQEEEFISNTLFKKIQALQKEKETLAVNYEKEEEFLTNELSRKLMQLQHEKGELE QHLEQEQEFQVNKLMKKIKKLENDTISKQLTLEQLRREKIDLENTLEQEQEALVNRLW KRMDKLEAETRILQEKLDQPVSAPPSPRDISMEIDSPENMMRHIRFLKNEVERLKKQL RAAQLQHSEKMAQYLEEERHMREENLRLQRKLQREMERREALCRQLSESESSLEMDDE RYFNEMSAQGLRPRTVSSPIPYTPSPSSSRPISPGLSYASHTVGFTPPTSLTRAGMSY YNSPGLHVQHMGTSHGITRPSPRRSNSPDKFKRPTPPPSPNTQTPVQPPPPPPPPPMQ PTVPSGSHLAAYSFATFGAHLLPALMHELSLNFKLGLIQWSRLLNAKGSFSGIFGYDL FALRLSRLHYPLCCKCLSEMQPVLWVYNTNQTTFSISVLLESSCTSIPWLEPSLFGIW YFSSSVQFLLGPELHSPGF" BASE COUNT 899 a 662 c 642 g 808 t ORIGIN 1 ctgctgctcc tcctcctttc ccagcccgcc gcggccatgg cggacagcgc cagcgagagc 61 gacacggacg gggcgggggg caacagcagc agctcggccg ccatgcagtc gtcctgctcg 121 tcgacctcgg gcggcggcgg tggcggcggg ggaggcggcg gcggtgggaa gtcggggggc 181 attgtcatct cgccgttccg cctggaggag ctcaccaacc gcctggcctc gctgcagcaa 241 gagaacaagg tgctgaagat agagctggag acctacaaac tgaagtgcaa ggcactgcag 301 gaggagaacc gcgacctgcg caaagccagc gttaccatcc aagccagggc tgagcaggaa 361 gaagaattca ttagtaacac tttattcaag aaaattcagg ctttgcagaa ggagaaagaa 421 acccttgctg taaattatga gaaagaagaa gaattcctca ctaatgagct ctccagaaaa 481 ttgatgcagt tgcagcatga gaaaggcgaa ctagaacagc atcttgaaca agagcaggaa 541 tttcaggtca acaaactgat gaagaaaatt aaaaaactgg agaatgacac catttctaag 601 caacttacat tagaacagtt gagacgggag aagattgacc ttgaaaatac attggaacaa 661 gaacaagaag cactagttaa tcgcctctgg aaaaggatgg ataagcttga agctgaaacg 721 cgaatcctgc aggaaaaatt agaccagccc gtctctgctc caccatcgcc tagagatatc 781 tccatggaga ttgattctcc agaaaatatg atgcgtcaca tcaggttttt aaagaatgaa 841 gtggaacggc tgaagaagca actgagagct gctcagttac agcattcaga gaaaatggca 901 cagtatctgg aggaggaacg tcacatgaga gaagagaact tgaggctcca gaggaagctg 961 cagagggaga tggagagacg agaagccctc tgtcgacagc tctccgagag tgagtccagc 1021 ttagaaatgg acgacgaaag gtattttaat gagatgtctg cacaaggatt aagacctcga 1081 actgtgtcca gcccgatccc ttacacacct tctccgagtt caagcaggcc tatatcacct 1141 ggtctatcat atgcaagtca cacggttggt ttcacgccac caacttcact gactagagct 1201 ggaatgtctt attacaattc cccgggtctt cacgtgcagc acatgggaac atcccatggt 1261 atcacaaggc cttcaccacg gagaagcaac agtcctgaca aattcaaacg gcccacgccg 1321 cctccatctc ccaacacaca gaccccagtc cagccacctc cacctccacc tccgccaccc 1381 atgcagccca cggtcccctc aggcagccac ctcgcagcct actccttcgc aacattcggc 1441 gcacacctcc tcccagcctt aatgcatgag cttagtctga atttcaagtt gggactcatc 1501 caatggagcc gtctactcaa cgccaaaggt tccttctctg gcatatttgg atatgactta 1561 tttgcactga ggttatctag gcttcactat ccattgtgtt gtaaatgttt gtcagaaatg 1621 cagccagtgt tgtgggtcta caacactaac cagacgactt tttccatcag tgttttactt 1681 gaatcttcat gtacgtccat tccctggctg gaaccttcgc tgtttggtat ttggtatttc 1741 agcagcagtg tgcaattttt gcttggccca gagcttcatt ctcctggctt ttaggtttgt 1801 aaaagaaaaa gggatatctt ttttatattt ttttccatga atctgcagaa aataactaag 1861 ctgttgtaac cctcctataa ttataatagt gtttacaaac aataccaata attcagcact 1921 acaattcaga cctttgaaaa tctggctttc agtgtagaac agaaagttag atgaatcagt 1981 gcccaagaca tatttcctgt ttaacagaac tttctacaga tacatttttt acaggttatt 2041 ttcattgtgt tattgacatc catgtctctc gtaaacagag gtcccaaagt aatgaatcat 2101 gtggcgtacc ttctccacat aaatggatgg ataattacgt atattaagat gtgattctct 2161 tttttatcct taatgttaat ctacttaacc tggccccctc taacatgagt cgataaatgt 2221 tgtcctactc accggtggtt tcaatggcta attagaatgt gttatttgat ttctgctgca 2281 gaaggcagtg tgattgtaac aaaaacaatg cggcttcccc ctttcgtact tcatttgtgt 2341 tctcttaaaa tagagtttga acaaatattt taaaggtgca aaataccatt agaaaatact 2401 atttgaaatg gacattatcg cattatcttg gcataatggc cagaaaatat tgtattgctt 2461 ggcagaaaag aaaataaggt ctaaaggaaa gtagcacatt agcattgatg gctgttcatt 2521 tcacccagta taagcaagtg tacaaagaag tatattctga atacattatt tccattcatt 2581 tagcacaaat aaatcatttg gtttcacttt gcagtggaac actgagtcac tcttttctta 2641 atacgtgcaa catcttaatt tttgtttttc agcagttgct gttttgtact ttggtagtga 2701 agtgattttt accacctgtg tttgcatatt tatatatgct gtggatgaaa ataacttact 2761 agagaatgta tattttatga caagaatgtg tatctgttgg gatataatca gagaactgaa 2821 aagtaattta tcagtaattt ttaagagtcc atgttttgtg acaaccatct ctaatagcca 2881 actctttatt aaacacactc ctaaaaataa ggaaccatga cgattgtaga tatttaatat 2941 tgtacagtat agaaacctcc gatttttgcc ttcgaatgca gtatttaaga gttaacagaa 3001 aaaaaaaaaa a // LOCUS S72904 2444 bp mRNA PRI 24-JAN-1995 DEFINITION APK1 antigen=MAb KI recognized [human, ovarian carcinoma cell line OVCAR-3, mRNA, 2444 nt]. ACCESSION S72904 NID g633925 KEYWORDS . SOURCE human ovarian carcinoma cell line OVCAR-3. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2444) AUTHORS Chang,K. and Pastan,I. TITLE Molecular cloning and expression of a cDNA encoding a protein detected by the K1 antibody from an ovarian carcinoma (OVCAR-3) cell line JOURNAL Int. J. Cancer 57 (1), 90-97 (1994) MEDLINE 94200897 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 154548] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..2444 /organism="Homo sapiens" /db_xref="taxon:9606" gene 220..1011 /gene="APK1 antigen" CDS 220..1011 /gene="APK1 antigen" /note="MAb KI recognized; This sequence comes from Fig. 2" /codon_start=1 /product="APK1 antigen" /db_xref="PID:g633926" /translation="MGPLHKSPAKNISVWCKQAEEIRNIHNDELMGIRREEEMEMSDD EIEEMTETKETEESALVSQAEALKEENDSLRWQLDAYRNEVELLKQEQGKVHREDDPN KEQQLKLLQQALQGMQQHLLKVQEEYKKKEAELEKLKDDKLQVEKMLENLKEKESCAS RLCASNQDSEYPLEKTMNSSPIKSEREALLVGIISTFLHVHPFGASIEYICSYLHRLD NKICTSDVECLMGRLQHTFKQEMTGVGASLEKRWKFCGFEGLKLT" BASE COUNT 787 a 462 c 535 g 660 t ORIGIN 1 tggatagagc gaggagaggt caaccgtcgt agcgccaata acttctactc catgatccag 61 tcggccaaca gccatgtccg cctgcctggt gaacgagaaa gctgcccatg agaaagatat 121 ggaagaagca aaggagaagt tcaagcaggc cctttctgga attctcattc aatttgagca 181 gatagtggct gtgtaccatt ccgcctccaa gcagaaggca tgggaccact tcacaaaagc 241 ccagcgaaga acatcagcgt gtggtgcaaa caagctgagg aaattcgcaa cattcataat 301 gatgaattaa tgggaatcag gcgagaagaa gaaatggaaa tgtctgatga tgaaatagaa 361 gaaatgacag aaacaaaaga aactgaggaa tcagccttag tatcacaggc agaagctctg 421 aaggaagaaa atgacagcct ccgttggcag ctcgatgcct accggaatga agtagaactg 481 ctcaagcaag aacaaggcaa agtccacaga gaagatgacc ctaacaaaga acagcagctg 541 aaactcctgc aacaagccct gcaaggaatg caacagcatc tactcaaagt ccaagaggaa 601 tacaaaaaga aagaagctga acttgaaaaa ctcaaagatg acaagttaca ggtggaaaaa 661 atgttggaaa atcttaaaga aaaggaaagc tgtgcttcta ggctgtgtgc ctcaaaccag 721 gatagcgaat accctcttga gaagaccatg aacagcagtc ctatcaaatc tgaacgtgaa 781 gcactgctag tggggattat ctccacattc cttcatgttc acccatttgg agcaagcatt 841 gaatacatct gttcctactt gcaccgtctt gataataaga tctgcaccag cgatgtggag 901 tgtctcatgg gtagactcca gcataccttc aagcaggaaa tgactggagt tggagccagc 961 ctggaaaaga gatggaaatt ctgtggcttc gagggcttga agctgaccta aatctctttg 1021 cctaacaact tgggactcct gaagataaat atgtgttgga caagcataga aagtgattta 1081 tatttttaat ggttttcaag tggaagttcc tttgaatttg tcagttcatt cctggaaaat 1141 cttttgagtt aaaataagga tcctaggaca gcacctcgaa ctacaggccc taaagagaaa 1201 ttgcctcaaa ccacaagtgc tgtaacttcc tcccctttct gtcaattggt tgtctttaaa 1261 tattgcaaaa gtcctgatgc taaacagtat ttggagtgtt ttcagtgtct gtactactgt 1321 tgtagacctt ggtatttttt taaacactgt taactgaaat gttttgatga tttgtatgtg 1381 atttgtgttt ctaaacttct ctttacatta atgttgttac tggtgaaagg catgagagca 1441 gcactaagtc ctctgtgtaa ctgccattgt ctttccaatc cccagtagac cagtaaataa 1501 ataacacatc agtgtcttct agaaggtgcc tgaccaggtt caccttttaa acgacaaagc 1561 atggtttgtg gctttttgca aaattactat gaaccaaaag ttgacaaatg ttccaaagtt 1621 attttctcta acatatcaca ttaaagatct gtttcagaat tgtaaaaagt acatctagat 1681 gtgtttacag aaagcaagta tccagtatga ctggcatgtg ttcatgctat tcagaatcac 1741 ttgtaaatag tctgctttta aaggagggca tgttcagttt tctgtgaatt aaaatatgct 1801 catgtgtggg cacacacgca caaacacaca cacgcacgca cacagtggca gaaggattta 1861 tattaatatt ctttcccctc tggccttctt acagtctgtt ggtccctttg cttctgttgt 1921 cagtgtgttg aattgcaaac cgagtactgc tgtaaatact atgtttactt catgctgaat 1981 gtttgcaaag acttgatata agtattaata gtaatgaatc aatgaataaa taatgagcta 2041 gggtttgtga ggctttctac aaataggtca gctccacctg gagtgcgaat tgccagagac 2101 accttggtag tgcccatcgg caaatcgcaa tggcagcatg tgagtggacc cattcagaaa 2161 ctttctgctt ggtggaaagt aaacagagag gatggaggtt tggggcgaat gtcctgaggc 2221 agagatggtc tttattgtgt gtggtggtgg ttgtggtatt tataataatg caagcatacc 2281 ctcccttgag tctcaattga agataaaaga atgtactgag caagcaaagc caatggagag 2341 tatttcacaa aaatactttg taaatgagat gccagtagtg ttcaaagttg tatttttaaa 2401 agataaatat tcctttttat acctcaaaaa aaaaaaaaaa aaaa // LOCUS S72921 607 bp mRNA PRI 24-JAN-1995 DEFINITION CNTF=ciliary neurotrophic factor [human, mRNA Partial Mutant, 607 nt]. ACCESSION S72921 NID g633829 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 607) AUTHORS Takahashi,R., Yokoji,H., Misawa,H., Hayashi,M., Hu,J. and Deguchi,T. TITLE A null mutation in the human CNTF gene is not causally related to neurological diseases JOURNAL Nature Genet. 7 (1), 79-84 (1994) MEDLINE 94355982 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 154605] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..607 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..189 /note="ciliary neurotrophic factor, CNTF" /gene="CNTF" CDS 1..189 /gene="CNTF" /note="This sequence comes from Fig. 1." /codon_start=1 /product="ciliary neurotrophic factor" /db_xref="PID:g633830" /translation="MAFTEHSPLTPHRRDLCSRSIWLARKIRSDLTALTESYPGEASG PEQEHQPGLCGWDASGKH" BASE COUNT 155 a 154 c 150 g 148 t ORIGIN 1 atggctttca cagagcattc accgctgacc cctcaccgtc gggacctctg tagccgctct 61 atctggctag caaggaagat tcgttcagac ctgactgctc ttacggaatc ctatccaggt 121 gaagcatcag ggcctgaaca agaacatcaa cctggactct gcggatggga tgccagtggc 181 aagcactgat cagtggagtg agctgaccga ggcagagcga ctccaagaga accttcaagc 241 ttatcgtacc ttccatgttt tgttggccag gctcttagaa gaccagcagg tgcattttac 301 cccaaccgaa ggtgacttcc atcaagctat acataccctt cttctccaag tcgctgcctt 361 tgcataccag atagaggagt taatgatact cctggaatac aagatccccc gcaatgaggc 421 tgatgggatg cctattaatg ttggagatgg tggtctcttt gagaagaagc tgtggggcct 481 aaaggtgctg caggagcttt cacagtggac agtaaggtcc atccatgacc ttcgtttcat 541 ttcttctcat cagactggga tcccagcacg tgggagccat tatattgcta acaacaagaa 601 aatgtag // LOCUS S73591 2704 bp mRNA PRI 01-MAR-1995 DEFINITION brain-expressed HHCPA78 homolog [human, HL-60 acute promyelocytic leukemia cells, mRNA, 2704 nt]. ACCESSION S73591 NID g688296 KEYWORDS . SOURCE human HL-60 acute promyelocytic leukemia cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2704) AUTHORS Chen,K.S. and DeLuca,H.F. TITLE Isolation and characterization of a novel cDNA from HL-60 cells treated with 1,25-dihydroxyvitamin D-3 JOURNAL Biochim. Biophys. Acta 1219 (1), 26-32 (1994) MEDLINE 94368869 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 155931] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..2704 /organism="Homo sapiens" /db_xref="taxon:9606" gene 222..1397 /gene="brain-expressed HHCPA78 homolog" CDS 222..1397 /gene="brain-expressed HHCPA78 homolog" /note="1,25-dihydroxyvitamin D-3 up-regulated; This sequence comes from Fig. 2. Author-given protein sequence is in conflict with the conceptual translation; mismatch(26[K->R])" /codon_start=1 /product="VDUP1" /db_xref="PID:g688297" /translation="MVMFKKIKSFEVVFNDPEKVYGSGEKVAGRVIVEVCEVTRVKAV RILACGVAKVLWMQGSQQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFG FELPQGPLGTSFKGKYGCVDYWVKAFLDRPSQPTQETKKNFEVVDLVDVNTPDLMAPV SAKKEKKVSCMFIPDGRVSVSARIDRKGFCEGDEISIHADFENTCSRIVVPKAAIVAR HTYLANGQTKVLTQKLSSVRGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSL LIYVSVPGSKKVILDLPLVIGSRSGLSSRTSSMASRTSSEMSWVDLNIPDTPEAPPCY MDVIPEDHRLESPTTPLLDDMDGSQDSPIFMYAPEFKFMPPPTYTEVDPCILNNNVQ" BASE COUNT 711 a 588 c 625 g 780 t ORIGIN 1 gcttagtgta accagcggcg tatatttttt aggcgccttt tcgaaaacct agtagttaat 61 attcatttgt ttaaatctta ttttattttt aagctcaaac tgcttaagaa taccttaatt 121 ccttaaagtg aaataatttt ttgcaaaggg gtttcctcga tttggagctt tttttttctt 181 ccaccgtcat ttctaactct taaaaccaac tcagttccat catggtgatg ttcaagaaga 241 tcaagtcttt tgaggtggtc tttaacgacc ctgaaaaggt gtacggcagt ggcgagaggg 301 tggctggccg ggtgatagtg gaggtgtgtg aagttactcg tgtcaaagcc gttaggatcc 361 tggcttgcgg agtggctaaa gtgctttgga tgcagggatc ccagcagtgc aaacagactt 421 cggagtacct gcgctatgaa gacacgcttc ttctggaaga ccagccaaca ggtgagaatg 481 agatggtgat catgagacct ggaaacaaat atgagtacaa gttcggcttt gagcttcctc 541 aggggcctct gggaacatcc ttcaaaggaa aatatgggtg tgtagactac tgggtgaagg 601 cttttcttga ccgcccgagc cagccaactc aagagacaaa gaaaaacttt gaagtagtgg 661 atctggtgga tgtcaatacc cctgatttaa tggcacctgt gtctgctaaa aaagaaaaga 721 aagtttcctg catgttcatt cctgatgggc gggtgtctgt ctctgctcga attgacagaa 781 aaggattctg tgaaggtgat gagatttcca tccatgctga ctttgagaat acatgttccc 841 gaattgtggt ccccaaagct gccattgtgg cccgccacac ttaccttgcc aatggccaga 901 ccaaggtgct gactcagaag ttgtcatcag tcagaggcaa tcatattatc tcagggacat 961 gcgcatcatg gcgtggcaag agccttcggg ttcagaagat caggccttct atcctgggct 1021 gcaacatcct tcgagttgaa tattccttac tgatctatgt tagcgttcct ggatccaaga 1081 aggtcatcct tgacctgccc ctggtaattg gcagcagatc aggtctaagc agcagaacat 1141 ccagcatggc cagccgaacc agctctgaga tgagttgggt agatctgaac atccctgata 1201 ccccagaagc tcctccctgc tatatggatg tcattcctga agatcaccga ttggagagcc 1261 caacaactcc tctgctagat gacatggatg gctctcaaga cagccctatc tttatgtatg 1321 cccctgagtt caagttcatg ccaccaccga cttatactga ggtggatccc tgcatcctca 1381 acaacaatgt gcagtgagca tgtggaagaa aagaagcagc tttacctact tgtttctttt 1441 tgtctctctt cctggacact cactttttca gagactcaac agtctcgtca atggagtgtg 1501 ggtccacctt agcctctgac ttcctaatgt aggaggtggt cagcaggcaa tctcctgggc 1561 cttaaaggat gcggactcat cctcagccag cgcccatgtt gtgatacagg ggtgtttgtt 1621 ggatgggttt aaaaataact agaaaaactc aggcccatcc attttctcag atctccttga 1681 aaattgaggc cttttcgata gtttcgggtc aggtaaaaat ggcctcctgg cgtaagcttt 1741 tcaaggtttt ttggaggctt tttgtaaatt gtgataggaa ctttggacct tgaacttacg 1801 tatcatgtgg agaagagcca atttaacaaa ctaggaagat gaaaagggaa attgtggcca 1861 aaactttggg aaaaggaggt tcttaaaatc agtgtttccc ctttgtgcac ttgtagaaaa 1921 aaaagaaaaa ccttctagag ctgatttgat ggacaatgga gagagctttc cctgtgatta 1981 taaaaaagga agctagctgc tctacggtca tctttgctta gagtatactt taacctggct 2041 tttaaagcag tagtaactgc cccaccaaag gtcttaaaag ccatttttgg agcctattgc 2101 actgtgttct cctactgcaa atattttcat atgggaggat ggttttctct tcatgtaagt 2161 ccttggaatt gattctaagg tgatgttctt agcactttaa ttcctgtcaa attttttgtt 2221 ctccccttct gccatcttaa atgtaagctg aaactggtct actgtgtctc tagggttaag 2281 ccaaaagaca aaaaaaattt tactactttt gagattgccc caatgtacag aattatataa 2341 ttctaacgct taaatcatgt gaaagggttg ctgctgtcag ccttgcccac tgtgacttca 2401 aacccaagga ggaactcttg atcaagatgc ccaaccctgt gatcagaacc tccaaatact 2461 gccatgagaa actagagggc aggtgttcat aaaagccctt tgaaccccct tcctgccctg 2521 tgttaggaga tagggatatt ggcccctcac tgcagctgcc agcacttggt cagtcactct 2581 cagccatagc actttgttca ctgtcctgtg tcagagcact gagctccacc cttttctgag 2641 agttattaca gccagaaagt gtgggctgaa gatggttggt ttcatgtggg ggtattatgt 2701 accc // LOCUS S73619 156 bp DNA PRI 01-MAR-1995 DEFINITION COL10A1=type X alpha 1 collagen {3' region} [human, Schmid metaphyseal chondrodysplasia patient, peripheral blood leukocytes, Genomic Mutant, 156 nt]. ACCESSION S73619 NID g688346 KEYWORDS . SOURCE human peripheral blood leukocytes Schmid metaphyseal chondrodysplasia patient. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 156) AUTHORS Dharmavaram,R.M., Elberson,M.A., Peng,M., Kirson,L.A., Kelley,T.E. and Jimenez,S.A. TITLE Identification of a mutation in type X collagen in a family with Schmid metaphyseal chondrodysplasia JOURNAL Hum. Mol. Genet. 3 (3), 507-509 (1994) MEDLINE 94282047 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 156226] from the original journal article. This sequence comes from Fig. 2A. FEATURES Location/Qualifiers source 1..156 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..156 /partial /note="type X alpha 1 collagen" /gene="COL10A1" CDS 1..156 /partial /gene="COL10A1" /note="This sequence comes from Fig. 2A." /codon_start=1 /product="type X alpha 1 collagen" /db_xref="PID:g688347" /translation="MMNTPKATWIRLQGVPSSISQKMTRCGSSFPMPSQMAYTPLSMS TPLSQDS" BASE COUNT 40 a 44 c 33 g 39 t ORIGIN 1 atgatgaata caccaaaggc tacctggatc aggcttcagg gagtgccatc atcgatctca 61 cagaaaatga ccaggtgtgg ctccagcttc ccaatgccga gtcaaatggc ctatactcct 121 ctgagtatgt ccactcctct ttctcaggat tcctag // LOCUS S73885 2149 bp mRNA PRI 03-MAR-1995 DEFINITION AP-4=basic helix-loop-helix DNA-binding protein [human, cervical carcinoma, HeLa cells, mRNA, 2149 nt]. ACCESSION S73885 NID g693848 KEYWORDS . SOURCE human HeLa cells cervical carcinoma. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2149) AUTHORS Ou,S.H., Garcia-Martinez,L.F., Paulssen,E.J. and Gaynor,R.B. TITLE Role of flanking E box motifs in human immunodeficiency virus type 1 TATA element function JOURNAL J. Virol. 68 (11), 7188-7199 (1994) MEDLINE 95018629 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 156919] from the original journal article. This sequence comes from Fig. 7. FEATURES Location/Qualifiers source 1..2149 /organism="Homo sapiens" /db_xref="taxon:9606" gene 259..1275 /note="AP-4" /gene="AP-4" CDS 259..1275 /gene="AP-4" /note="basic helix-loop-helix DNA-binding protein; Gene Expression Regulation. This sequence comes from Fig. 7" /codon_start=1 /product="AP-4" /db_xref="PID:g693849" /translation="MEYFMVPTQKVPSLQHFRKTEKEVIGGLCSLANIPLTPETQRDQ ERRIRREIANSNERRRMQSINAGFQSLKTLIPHTDGEKLSKAAILQQTAEYIFSLEQE KTRLLQQNTQLKRFIQELSGSSPKRRRAEDKDEGIGSPDIWEDEKAEDLRREMIELRQ QLDKERSVRMMLEEQVRSLEAHMYPEKLKVIAQQVQLQQQQEQVRLLHQEKLEREQQQ LRTQLLPPPAPTHHPTVIVPAPPPPPSHHINVVTMGPSSVINSVSTSRQNLDTIVQAI QHIEGTQEKQELEEEQRRAVIVKPVRSCPEAPTSDTASDSEASDSDAMDQSREEPSGD GELP" BASE COUNT 465 a 655 c 564 g 465 t ORIGIN 1 gacctgcaaa cacacacaca cacacacaca cacacacaca cacacacaca catacacacg 61 caccagggca gccgagagac ctccctcccg cccctcccat gcccgcctcc ctcccctcgc 121 cgccgccgcc gccgccagca tctgggaccg gccgattctg cacctccgtc cggcgctgcc 181 ctttgattcg gatttccatc ttgcattctc cggctgatcg cgggacctgg ctcgtgcaga 241 ggaggggggc cgatcgctat ggagtatttc atggtgccca ctcagaaggt gccctctttg 301 caacatttca ggaaaacaga gaaagaagtg ataggagggc tctgtagcct tgccaacatt 361 ccactaaccc ccgagactca gcgggaccag gagcggcgga ttcggcggga gatcgccaac 421 agcaacgagc ggagacgcat gcagagcatc aacgcgggat tccagtccct caagaccctc 481 atcccccaca cagacggaga gaagctcagc aaggcagcca ttctccagca gacagccgag 541 tacatcttct ccctggagca ggagaagacc aggctcttgc agcagaacac acagctcaag 601 cgcttcatcc aggagctgag cggctcgtcc cccaagcgac ggcgggcaga ggacaaggac 661 gaaggcatag gctccccgga catctgggag gacgagaagg cggaggacct gcggcgggag 721 atgattgagc tgcggcagca gctggacaag gagcgctcgg tgcgcatgat gctggaggag 781 caggtgcgct cgctggaggc ccacatgtac ccggaaaagc tcaaggtgat tgcgcagcag 841 gtgcagctgc agcagcagca ggaacaggtg aggctgctgc accaggagaa gctggagcgg 901 gaacagcagc agctgcggac ccagcttctg ccccctccgg cccccaccca ccaccccacg 961 gtgatcgtgc cagcaccgcc tcctcctccc tcccaccaca tcaatgtcgt caccatgggc 1021 ccctcctcgg tcatcaactc tgtttccaca tcccggcaaa atctggacac catcgtgcag 1081 gcaatccagc acatcgaggg cacccaggaa aagcaggagc tggaggagga gcagcggcga 1141 gctgtcatcg tgaagcctgt ccgcagctgc ccggaggccc ccacctctga caccgcctcc 1201 gactccgagg cctcagacag tgacgccatg gaccagagcc gggaggagcc gtcgggggac 1261 ggggagcttc cctgactacc cccccagccc tcctctccct tctgggggct ggagggagcc 1321 ggggcagcca cagggagaga catgggcgaa tgagtgagaa atttttacaa aattacgatg 1381 tcatttgggt ctcttttatg acctcttttt caatactgta aatcgacctt tgaacgaagc 1441 cactcaaccc gaggtcccgg ggctggggtg tcgcagagct gtgggagcat cggcacccca 1501 gggcggggcc tcggccccgg gggctggagg aagctgacac ggagatgcct ggcctctctc 1561 tgccaaaaag cattttttcc tttaaatatg ttttttaaga acagggaaaa ttaaacaaaa 1621 ccccaggtta tttcttccct gcccagagcc agcctgggat tgtcagcctt caatcccctt 1681 tccttcctct ttttgggttt tcttctttct cctttaagca cttacatggt tgggggtaag 1741 actaggctgg ggcattctgg gggcccggag gtctccgttg cttcttggtt ggggtttgct 1801 gctgctgtgc ccccctcccc cttccccatc tcggcactag aattcgccac tctcccaccc 1861 cccagccccc acctctgcct ccaggtctca tcttccaccc caaaaatgtc tgtctctctc 1921 tttttgtttt gtttgttgtt ggttttttat ttctttttgg tttgctttct gtttttgttt 1981 tgtttttctt ttttttcttt cttttttttt tttttacaat tttgaggtct tcgtgttcaa 2041 ggagaagcta ttatattttg ttaagaaagt ggggagaaaa aaaaccaaga ggccaccgtg 2101 cctttgtaaa gaaacaaaat aaagtttgta ctttgttttt taaaaaaaa // LOCUS S73887 105 bp mRNA PRI 01-MAR-1995 DEFINITION HuD= paraneoplastic encephalomyelitis antigen {5' region, alternatively spliced} [human, lung cancer cell line, mRNA Partial, 105 nt]. ACCESSION S73887 NID g688242 KEYWORDS . SOURCE human lung cancer cell line. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 105) AUTHORS Sekido,Y., Bader,S.A., Carbone,D.P., Johnson,B.E. and Minna,J.D. TITLE Molecular analysis of the HuD gene encoding a paraneoplastic encephalomyelitis antigen in human lung cancer cell lines JOURNAL Cancer Res. 54 (18), 4988-4992 (1994) MEDLINE 94349312 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 157053] from the original journal article. This sequence comes from Fig. 4. Map location: 1p. COMMENT 87 bp insertion. FEATURES Location/Qualifiers source 1..105 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..36 /partial /note="paraneoplastic encephalomyelitis antigen" /gene="HuD" CDS 1..36 /partial /gene="HuD" /codon_start=1 /translation="MVMPSRILKLT" BASE COUNT 36 a 16 c 24 g 29 t ORIGIN 1 atggttatgc cttctagaat cctaaagttg acctgaagcc aagaagaaaa ttctggtgat 61 gggagaagtg gagccactta aattacttac atgatgataa ttagc // LOCUS S74017 2304 bp mRNA PRI 03-MAR-1995 DEFINITION Nrf2=NF-E2-like basic leucine zipper transcriptional activator [human, hemin-induced K562 cells, mRNA, 2304 nt]. ACCESSION S74017 NID g693841 KEYWORDS . SOURCE human hemin-induced K562 cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2304) AUTHORS Moi,P., Chan,K., Asunis,I., Cao,A. and Kan,Y.W. TITLE Isolation of NF-E2-related factor 2 (Nrf2), a NF-E2-like basic leucine zipper transcriptional activator that binds to the tandem NF-E2/AP1 repeat of the beta-globin locus control region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 91 (21), 9926-9930 (1994) MEDLINE 95024074 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 157160] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2304 /organism="Homo sapiens" /db_xref="taxon:9606" gene 40..1809 /note="Nrf2" /gene="Nrf2" CDS 40..1809 /gene="Nrf2" /note="NF-E2-like basic leucine zipper transcriptional activator; This sequence comes from Fig. 1" /codon_start=1 /product="Nrf2" /db_xref="PID:g693842" /translation="MDLIDILWRQDIDLGVSREVFDFSQRRKEYELEKQKKLEKERQE QLQKEQEKAFFTQLQLDEETGEFLPIQPAQHTQSETSGSANYSQVAHIPKSDALYFDD CMQLLAQTFPFVDDNEVSSATFQSLVPDIPGHIESPVFIATNQAQSPETSVAQVAPVD LDGMQQDIEQVWEELLSIPELQCLNIENDKLVETTMVPSPEAKLTEVDNYHFYSSIPS MEKEVGNCSPHFLNAFEDSFSSILSTEDPNQLTVNSLNSDATVNTDFGDEFYSAFIAE PSISNSMPSPATLSHSLSELLNGPIDVSDLSLCKAFNQNHPESTAEFNDSDSGISLNT SPSVASPEHSVESSSYGDTLLGLSDSEVEELDSAPGSVKQNGPKTPVHSSGDMVQPLS PSQGQSTHVHDAQCENTPEKELPVSPGHRKTPFTKDKHSSRLEAHLTRDELRAKALHI PFPVEKIINLPVVDFNEMMSKEQFNEAQLALIRDIRRRGKNKVAAQNCRKRKLENIVE LEQDLDHLKDEKEKLLKEKGENDKSLHLLKKQLSTLYLEVFSMLRDEDGKPYSPSEYS LQQTRDGNVFLVPKSKKPDVKKN" BASE COUNT 770 a 472 c 438 g 624 t ORIGIN 1 ttggagctgc cgccgccggg actcccgtcc cagcaggaca tggatttgat tgacatactt 61 tggaggcaag atatagatct tggagtaagt cgagaagtat ttgacttcag tcagcgacgg 121 aaagagtatg agctggaaaa acagaaaaaa cttgaaaagg aaagacaaga acaactccaa 181 aaggagcaag agaaagcctt tttcactcag ttacaactag atgaagagac aggtgaattt 241 ctcccaattc agccagccca gcacacccag tcagaaacca gtggatctgc caactactcc 301 caggttgccc acattcccaa atcagatgct ttgtactttg atgactgcat gcagcttttg 361 gcgcagacat tcccgtttgt agatgacaat gaggtttctt cggctacgtt tcagtcactt 421 gttcctgata ttcccggtca catcgagagc ccagtcttca ttgctactaa tcaggctcag 481 tcacctgaaa cttctgttgc tcaggtagcc cctgttgatt tagacggtat gcaacaggac 541 attgagcaag tttgggagga gctattatcc attcctgagt tacagtgtct taatattgaa 601 aatgacaagc tggttgagac taccatggtt ccaagtccag aagccaaact gacagaagtt 661 gacaattatc atttttactc atctataccc tcaatggaaa aagaagtagg taactgtagt 721 ccacattttc ttaatgcttt tgaggattcc ttcagcagca tcctctccac agaagacccc 781 aaccagttga cagtgaactc attaaattca gatgccacag tcaacacaga ttttggtgat 841 gaattttatt ctgctttcat agctgagccc agtatcagca acagcatgcc ctcacctgct 901 actttaagcc attcactctc tgaacttcta aatgggccca ttgatgtttc tgatctatca 961 ctttgcaaag ctttcaacca aaaccaccct gaaagcacag cagaattcaa tgattctgac 1021 tccggcattt cactaaacac aagtcccagt gtggcatcac cagaacactc agtggaatct 1081 tccagctatg gagacacact acttggcctc agtgattctg aagtggaaga gctagatagt 1141 gcccctggaa gtgtcaaaca gaatggtcct aaaacaccag tacattcttc tggggatatg 1201 gtacaaccct tgtcaccatc tcaggggcag agcactcacg tgcatgatgc ccaatgtgag 1261 aacacaccag agaaagaatt gcctgtaagt cctggtcatc ggaaaacccc attcacaaaa 1321 gacaaacatt caagccgctt ggaggctcat ctcacaagag atgaacttag ggcaaaagct 1381 ctccatatcc cattccctgt agaaaaaatc attaacctcc ctgttgttga cttcaacgaa 1441 atgatgtcca aagagcagtt caatgaagct caacttgcat taattcggga tatacgtagg 1501 aggggtaaga ataaagtggc tgctcagaat tgcagaaaaa gaaaactgga aaatatagta 1561 gaactagagc aagatttaga tcatttgaaa gatgaaaaag aaaaattgct caaagaaaaa 1621 ggagaaaatg acaaaagcct tcacctactg aaaaaacaac tcagcacctt atatctcgaa 1681 gttttcagca tgctacgtga tgaagatgga aaaccttatt ctcctagtga atactccctg 1741 cagcaaacaa gagatggcaa tgttttcctt gttcccaaaa gtaagaagcc agatgttaag 1801 aaaaactaga tttaggagga tttgaccttt tctgagctag tttttttgta ctattatact 1861 aaaagctcct actgtgatgt gaaatgctca tactttataa gtaattctat gcaaaatcat 1921 agccaaaact agtatagaaa ataatacgaa actttaaaaa gcattggagt gtcagtatgt 1981 tgaatcagta gtttcacttt aactgtaaac aatttcttag gacaccattt gggctagttt 2041 ctgtgtaagt gtaaatacta caaaaactta tttatactgt tcttatgtca tttgttatat 2101 tcatagattt atatgatgat atgacatctg gctaaaaaga aattattgca aaactaacca 2161 cgatgtactt ttttataaat actgtatgga caaaaaatgg cattttttat aattaaattg 2221 tttagctctg gcaaaaaaaa aaaatttttt aagagctggt actaataaag gattattatg 2281 actgttaaaa aaaaaaaaaa aaaa // LOCUS S74221 756 bp mRNA PRI 15-MAR-1995 DEFINITION IK=IK factor [human, leukemic cells K562, chronic myeloid leukemia patient, mRNA, 756 nt]. ACCESSION S74221 NID g710460 KEYWORDS . SOURCE human leukemic cells K562 chronic myeloid leukemia patient. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 756) AUTHORS Krief,P., Augery-Bourget,Y., Plaisance,S., Merck,M.F., Assier,E., Tanchou,V., Billard,M., Boucheix,C., Jasmin,C. and Azzarone,B. TITLE A new cytokine (IK) down-regulating HLA class II: monoclonal antibodies, cloning and chromosome localization JOURNAL Oncogene 9 (12), 3449-3456 (1994) MEDLINE 95060801 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 157805] from the original journal article. This sequence comes from Fig. 4. Map location: 2p15-p14. FEATURES Location/Qualifiers source 1..756 /organism="Homo sapiens" /db_xref="taxon:9606" gene 40..528 /note="IK factor" /gene="IK" CDS 40..528 /gene="IK" /note="cytokine down-regulating HLA class II; This sequence comes from Fig. 4. Map location 2p15-p14" /codon_start=1 /product="IK factor" /db_xref="PID:g710461" /translation="MNIFEDIGDYVPSTTKTPRDKERERYRERERDRERDRDRDRERE RERDRERERERDREREEEKKRHSYFEKPKVDDEPMDVDKGPGSTKELIKSINEKFAGS AGWEGTESLKKPEDKKQLGDFFGMSNSYAECYPATMDDMAVDSDEEVDYSKMDQGNKK GP" BASE COUNT 253 a 137 c 241 g 125 t ORIGIN 1 cgggggaagc tggaagagaa gaaacctcct gaggctgaca tgaatatttt tgaagacatt 61 ggggattacg taccctccac aaccaagaca cctcgggaca aggagcggga gagatatcgg 121 gaacgggagc gtgatcggga aagagacaga gaccgtgacc gagagcgaga gcgagaacga 181 gatcgggaac gagagcgaga gcgggaccga gagagagaag aggaaaagaa gagacacagc 241 tactttgaga agccaaaagt agatgatgag cccatggacg ttgacaaagg acctgggtct 301 accaaggagt tgatcaagtc catcaatgaa aagtttgctg ggtctgctgg ctgggaaggc 361 acagaatcgc tgaagaagcc agaagacaaa aagcagctgg gagatttctt tggcatgtcc 421 aacagttatg cagagtgcta cccagccacg atggatgaca tggctgtgga tagtgatgag 481 gaggtggatt atagcaaaat ggaccagggt aacaagaagg gcccttaggg ccgttgggac 541 tttgataccc aggaagaata cagcgagtat atgaacaaca aagaagcttt gcccaaggct 601 gcattccagt atggtatcaa aatgtctgaa gggcggaaaa ccaggcgctt caaggaaacc 661 aatgacaaag cagagcttga tcgccagtgg aagaagatta gtgcaatcat tgagaagagg 721 aagaagatgg aagctgatgg ggttgaagtc aagccg // LOCUS S74445 735 bp mRNA PRI 10-JUL-1992 DEFINITION cellular retinoic acid-binding protein [human, skin, mRNA, 735 nt]. ACCESSION S74445 NID g241541 KEYWORDS . SOURCE human skin. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 735) AUTHORS Eller,M.S., Oleksiak,M.F., McQuaid,T.J., McAfee,S.G. and Gilchrest,B.A. TITLE The molecular cloning and expression of two CRABP cDNAs from human skin JOURNAL Exp. Cell Res. 198 (2), 328-336 (1992) MEDLINE 92104256 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 74445] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..735 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..735 /gene="cellular retinoic acid-binding protein, CRABP I" CDS 75..488 /gene="cellular retinoic acid-binding protein, CRABP I" /note="This sequence comes from Fig. 1; CRABP I" /codon_start=1 /product="cellular retinoic acid-binding protein" /db_xref="PID:g241542" /translation="MPNFAGTWKMRSSENFDELLKALGVNAMLRKVAVAAASKPHVEI RQDGDQFYIKTSTTVRTTEINFKVGEGFEEETVDGRKCRSLATWENENKIHCTQTLLE GDGPKTYWTRELANDELILTFGADDVVCTRIYVRE" BASE COUNT 160 a 219 c 204 g 152 t ORIGIN 1 gagtctgccc ttgcgagctc agagtgtgcc cgtgcgccgc cgccgtcgta cctgccgccg 61 ccgccaccgc caccatgccc aacttcgccg gcacctggaa gatgcgcagc agcgagaatt 121 tcgacgagct gctgaaggca ctgggtgtga acgccatgct gaggaaagtg gccgtagcgg 181 ctgcgtccaa gccgcacgtg gagatccgcc aggacgggga tcagttctac atcaagacat 241 ccaccaccgt gcgcaccact gagatcaact tcaaggtcgg agaaggcttt gaggaggaga 301 ccgtggacgg acgcaagtgc aggagtttag ccacttggga gaatgagaac aagatccact 361 gcacccaaac tcttcttgaa ggggacggcc ccaaaaccta ctggacccgt gagctggcca 421 acgatgaact tatcctgacg tttggcgccg atgacgtggt ctgcaccaga atttatgtcc 481 gggaatgaag gcagctggct tgctcctact ttcaggaagg gatgcaggtc cccgaggaat 541 atgtcatagt tctgagctgc cagtggaccg cccttttccc ctaccaatat taggtgatcc 601 cgttttcccc atgacaatgt tgtagtgtcc cccaccccca cccccctggc cttggtgcct 661 cttgtatccc tagtgctgca tagcccggca tttgcacggt ttcgaagtca ttaaactggt 721 tagacgtgtc tcaaa // LOCUS S74683 1334 bp mRNA PRI 12-MAY-1995 DEFINITION ADP-ribosyltransferase [human, skeletal muscle, mRNA, 1334 nt]. ACCESSION S74683 NID g807099 KEYWORDS . SOURCE human skeletal muscle. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1334) AUTHORS Okazaki,I.J., Zolkiewska,A., Nightingale,M.S. and Moss,J. TITLE Immunological and structural conservation of mammalian skeletal muscle glycosylphosphatidylinositol-linked ADP-ribosyltransferases JOURNAL Biochemistry 33 (43), 12828-12836 (1994) MEDLINE 95034708 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 158929] from the original journal article. This sequence comes from Fig. 1A. FEATURES Location/Qualifiers source 1..1334 /organism="Homo sapiens" /db_xref="taxon:9606" gene 102..1085 /gene="ADP-ribosyltransferase" CDS 102..1085 /gene="ADP-ribosyltransferase" /note="This sequence comes from Fig. 1A." /codon_start=1 /product="ADP-ribosyltransferase" /db_xref="PID:g807100" /translation="MQMPAMMSLLLVSVGLMEALQAQSHPITRRDLFSQEIQLDMALA SFDDQYAGCAAAMTAALPDLNHTEFQANQVYADSWTLASSQWQERQARWPEWSLSPTR PSPPPLGFRDEHGVALLAYTANSPLHKEFNAAVREAGRSRAHYLHHFSFKTLHFLLTE ALQLLGSGQRPPRCHQVFRGVHGLRFRPAGPRATVRLGGFASASLKHVAAQQFGEDTF FGIWTCLGAPIKGYSFFPGEEEVLIPPFETFQVINASRPAQGPARIYLRALGKHSTYN CEYIKDKKCKSGPCHLDNSAMGQSPLSAVWSLLLLLWFLVVRAFPDGPGLL" BASE COUNT 279 a 427 c 368 g 260 t ORIGIN 1 ggccatggtg gagatcagca gcagcttccc cacccaggac aaggcctaga tgaggaaact 61 gagacccaaa aagagacagc aactggccca gggtcaccag catgcagatg cctgctatga 121 tgtctctgct tcttgtgtct gtgggcctca tggaagcact tcaggcccag agccacccca 181 tcacacgacg agacctcttc tctcaagaga ttcagctgga catggccctg gcctcctttg 241 atgaccagta cgctggctgt gctgctgcca tgacagctgc tctcccggat ctcaaccaca 301 cggagttcca ggccaaccag gtgtatgcag acagctggac actggcaagc agccaatggc 361 aggagcgtca ggccaggtgg ccagagtgga gtctcagccc cacccgtcca tccccgccac 421 ccctgggctt ccgcgatgag catggggtgg ccctcctggc ctacacagcc aacagccccc 481 tgcacaagga gttcaatgca gccgtgcgtg aggcgggccg ctcccgggcc cactacctcc 541 accacttctc cttcaagaca ctccatttcc tgctgactga ggccctgcag ctcctgggca 601 gcggccagcg tccaccccgg tgccaccagg tgttccgagg tgtgcacggc ctgcgcttcc 661 ggccagcagg gccccgggcc accgtgaggc tggggggctt tgcttctgcc tccctgaagc 721 atgttgcagc ccagcagttt ggtgaggaca ccttcttcgg catctggacc tgccttgggg 781 cccctatcaa gggctactcc ttcttccctg gagaggaaga ggtgctgatc cccccctttg 841 agaccttcca agtgatcaat gccagcagac cggcccaggg ccccgcccgc atctacctcc 901 gagccctggg caagcacagc acctacaact gcgagtacat caaagacaag aagtgcaagt 961 ctgggccttg ccatctggat aattcagcca tgggtcagag ccccctctct gcagtctggt 1021 ctttgctgct gctgctctgg ttcctcgtgg tgagggcctt tccagatggt ccaggcctcc 1081 tttgatgcat gagacacggg acagcctcgc ctgctgcctc tgcccatcct gaggatgttg 1141 gccatgtgtg cttcagtgta accaagattc ctgtcaatcc catctgcagg gaactctggg 1201 accttctctg gtagctgcca gaccggctgg tggagaaaca ggagacaatc tggggactga 1261 accttaccca gggctgtagg agtgagactc tgaataaagg gttgggccgg caaaaaaaaa 1321 aaaaaaaaaa aaaa // LOCUS S74728 1809 bp mRNA PRI 05-MAY-1995 DEFINITION antiquitin=26g turgor protein homolog [human, kidney, mRNA, 1809 nt]. ACCESSION S74728 NID g797409 KEYWORDS . SOURCE human kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1809) AUTHORS Lee,P., Kuhl,W., Gelbart,T., Kamimura,T., West,C. and Beutler,E. TITLE Homology between a human protein and a protein of the green garden pea JOURNAL Genomics 21 (2), 371-378 (1994) MEDLINE 94375061 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 158837] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1809 /organism="Homo sapiens" /db_xref="taxon:9606" gene 55..1590 /gene="antiquitin" CDS 55..1590 /gene="antiquitin" /note="26g turgor protein homolog; garden pea 26g turgor protein homolog. This sequence comes from Fig. 3" /codon_start=1 /product="antiquitin" /db_xref="PID:g797410" /translation="MSTLLINQPQYAWLKELGLREENEGVYNGSWGGRGEVITTYCPA NNEPIARVRQASVADYEETVKKAREAWKIWADIPAPKRGEIVRQIGDALREKIQVLGS LVSLEMGKILVEGVGEVQEYVDICDYAVGLSRMIGGPILPSERSGHALIEQWNPVGLV GIITAFNFPVAVYGWNNAIAMICGNVCLWKGAPTTSLISVAVTKIIAKVLEDNKLPGA ICSLTCGGADIGTAMAKDERVNLLSFTGSTQVGKQVGLMVQERFGRSLLELGGNNAII AFEDADLSLVVPSALFAAVGTAGQRCTTARRLFIHESIHDEVVNRLKKAYAQIRVGNP WDPNVLYGPLHTKQAVSMFLGAVEEAKKEGGTVVYGGKVMDRPGNYVEPTIVTGLGHD ASIAHTETFAPILYVFKFKNEEEVFAWNNEVKQGLSSSIFTKDLGRIFRWLGPKGSDC GIVNVNIPTSGAEIGGAFGGEKHTGGGRESGSDAWKQYMRRSTCTINYSKDLPLAQGI KFQ" BASE COUNT 517 a 363 c 487 g 442 t ORIGIN 1 cctgctccaa ggtccagaga gctttctggt ctttgcagca ggcctgccgc cttcatgtcc 61 actctcctca tcaatcagcc ccagtatgcg tggctgaaag agctggggct ccgcgaggaa 121 aacgagggcg tgtataatgg aagctgggga ggccggggag aggttattac gacctattgc 181 cccgctaaca acgagccaat agcaagagtc cgacaggcca gtgtggcaga ctatgaagaa 241 actgtaaaga aagcaagaga agcatggaaa atctgggcag atattcctgc tccaaaacga 301 ggagaaatag taagacagat tggcgatgcc ttgcgggaga agatccaagt actaggaagc 361 ttggtgtctt tggagatggg gaaaatctta gtggaaggtg tgggtgaagt tcaggagtat 421 gtggatatct gtgactatgc tgttggttta tcaaggatga ttggaggacc tatcttgcct 481 tctgaaagat ctggccatgc actgattgag cagtggaatc ccgtaggcct ggttggaatc 541 atcacggcat tcaatttccc tgtggcagtg tatggttgga acaacgccat cgccatgatc 601 tgtggaaatg tctgcctctg gaaaggagct ccaaccactt ccctcattag tgtggctgtc 661 acaaagataa tagccaaggt tctggaggac aacaagctgc ctggtgcaat ttgttccttg 721 acttgtggtg gagcagatat tggcacagca atggccaaag atgaacgagt gaacctgctg 781 tccttcactg ggagcactca ggtgggaaaa caggtgggcc tgatggtgca ggagaggttt 841 gggagaagtc tgttggaact tggaggaaac aatgccatta ttgcctttga agatgcagac 901 ctcagcttag ttgttccatc agctctcttc gctgctgtgg gaacagctgg ccagaggtgt 961 accactgcga ggcgactgtt tatacatgaa agcatccatg atgaggttgt aaacagactt 1021 aaaaaggcct atgcacagat ccgagttggg aacccatggg accctaatgt tctctatggg 1081 ccactccaca ccaagcaggc agtgagcatg tttcttggag cagtggaaga agcaaagaaa 1141 gaaggtggca cagtggtcta tgggggcaag gttatggatc gccctggaaa ttatgtagaa 1201 ccgacaattg tgacaggtct tggccacgat gcgtccattg cacacacaga gactttcgct 1261 ccgattctct atgtctttaa attcaagaat gaagaagagg tctttgcatg gaataatgaa 1321 gtaaaacagg gactttcaag tagcatcttt accaaagatc tgggcagaat ctttcgctgg 1381 cttggaccta aaggatcaga ctgtggcatt gtaaatgtca acattccaac aagtggggct 1441 gagattggag gtgcctttgg aggagaaaag cacactggtg gtggcaggga gtctggcagt 1501 gatgcctgga aacagtacat gagaaggtct acttgtacta tcaactacag taaagacctt 1561 cctctggccc aaggaatcaa gtttcagtaa aggtgtttta gatgaacatc ccttaatttg 1621 aggtgttcca gcagctgttt ttggagaaga caaagaagat taaagttttc cctgaataaa 1681 tgcattatta tgactgtgac agtgactaat ccccctatga ccccaaagcc ctgattaaat 1741 caagagattc cttttttaaa aatcaaaata aaattgttac aacatagcca tagttactaa 1801 aaaaaaaaa // LOCUS S75264 521 bp mRNA PRI 11-JUL-1995 DEFINITION WT1=Wilms' tumor suppressor protein [human, fetal kidney, mRNA, 521 nt]. ACCESSION S75264 NID g896246 KEYWORDS . SOURCE human fetal kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 521) AUTHORS Hamilton,T.B., Barilla,K.C. and Romaniuk,P.J. TITLE High affinity binding sites for the Wilms' tumour suppressor protein WT1 JOURNAL Nucleic Acids Res. 23 (2), 277-284 (1995) MEDLINE 95166649 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 160293] from the original journal article. This sequence comes from Fig. 1A. FEATURES Location/Qualifiers source 1..521 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..507 /note="WT1" /gene="WT1" CDS 1..507 /gene="WT1" /note="Wilms' tumor suppressor protein; This sequence comes from Fig. 1A" /codon_start=1 /product="WT1" /db_xref="PID:g896247" /translation="MGHHHHHHHHHHSSGHIEGRHMRRVPGVAPTLVRSASETSEKRP FMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCDFKDCERRFFRSDQLKRHQRRHTGV KPFQCKTCQRKFSRSDHLKTHTRTHTGEKPFSCRWPSCQKKFARSDELVRHHNMHQRN MTKLQLAL" BASE COUNT 151 a 139 c 120 g 111 t ORIGIN 1 atgggccatc atcatcatca tcatcatcat catcacagca gcggccatat cgaaggtcgt 61 catatgcgac gtgtgcctgg agtagccccg actcttgtac ggtcggcatc tgagaccagt 121 gagaaacgcc ccttcatgtg tgcttaccca ggctgcaata agagatattt taagctgtcc 181 cacttacaga tgcacagcag gaagcacact ggtgagaaac cataccagtg tgacttcaag 241 gactgtgaac gaaggttttt tcgttcagac cagctcaaaa gacaccaaag gagacataca 301 ggtgtgaaac cattccagtg taaaacttgt cagcgaaagt tctcccggtc cgaccacctg 361 aagacccaca ccaggactca tacaggtgaa aagcccttca gctgtcggtg gccaagttgt 421 cagaaaaagt ttgcccggtc agatgaatta gtccgccatc acaacatgca tcagagaaac 481 atgaccaaac tccagctggc gctttgatag agctcggatc c // LOCUS S75295 2940 bp mRNA PRI 26-JUL-1995 DEFINITION nucleoprotein interactor 1=SRP1 homolog [human, cervical carcinoma HeLa cells, mRNA, 2940 nt]. ACCESSION S75295 NID g913392 KEYWORDS . SOURCE human cervical carcinoma HeLa cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2940) AUTHORS O'Neil,R.E. and Palese,P. TITLE NPI-1, the human homolog of SRP-1, interacts with influenza virus nucleoprotein JOURNAL Virology 206 (1), 116-125 (1995) MEDLINE 95133142 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 159238] from the original journal article. FEATURES Location/Qualifiers source 1..2940 /organism="Homo sapiens" /db_xref="taxon:9606" gene 47..1663 /gene="nucleoprotein interactor 1, NPI-1" CDS 47..1663 /gene="nucleoprotein interactor 1, NPI-1" /note="SRP1 homolog; This sequence comes from Fig. 2; NPI-1" /codon_start=1 /product="nucleoprotein interactor 1" /db_xref="PID:g913393" /translation="MTTPGKENFRLKSYKNKSLNPDEMRRRREEEGLQLRKQKREEQL FKRRNVATAEEETEEEVMSDGGFHEAQISNMEMAPGGVITSDMIEMIFSKSPEQQLSA TQKFRKLLSKEPNPPIDEVISTPGVVARFVEFLKRKENCSLQFESAWVLTNIASGNSL QTRIVIQARAVPIFIELLSSEFEDVQEQAVWALGNIAGDSTMCRDYVLDCNILPPLLQ LFSKQNRLTMTRNAVWALSNLCRGKSPPPEFAKVSPCLNVLSWLLFVSDTDVLADACW ALSYLSDGPNDKIQAVIDAGVCRRLVELLMHNDYKVVSPALRAVGNIVTGDDIQTQVI LNCSALQSLLHLLSSPKESIKKEACWTISNITAGNRAQIQTVIDANIFPALISILQTA EFRTRKEAAWAITNATSGGSAEQIKYLVELGCIKPLCDLLTVMDSKIVQVALNGLENI LRLGEQEAKRNGTGINPYCALIEEAYGLDKIEFLQSHENQEIYQKAFDLIEHYFGTED EDSSIAPQVDLNQQQYIFQQCEAPMEGFQL" BASE COUNT 826 a 641 c 627 g 846 t ORIGIN 1 ctaacttcag cggtggcacc gggatcggtt gccttgagcc tgaaatatga ccaccccagg 61 aaaagagaac tttcgcctga aaagttacaa gaacaaatct ctgaatcccg atgagatgcg 121 caggaggagg gaggaagaag gactgcagtt acgaaagcag aaaagagaag agcagttatt 181 caagcggaga aatgttgcta cagcagaaga agaaacagaa gaagaagtta tgtcagatgg 241 aggctttcat gaggctcaga ttagtaacat ggagatggca ccaggtggtg tcatcacttc 301 tgacatgatt gagatgatat tttccaaaag cccagagcaa cagctttcag caacacagaa 361 attcaggaag ctgctttcaa aagaacctaa ccctcctatt gatgaagtta tcagcacacc 421 aggagtagtg gccaggtttg tggagttcct caaacgaaaa gagaattgtt cactgcagtt 481 tgaatcagct tgggtactga caaatattgc ttcaggaaat tctcttcaga cccgaattgt 541 gattcaggca agagctgtgc ccatcttcat agagttgctc agctcagagt ttgaagatgt 601 ccaggaacag gcagtctggg ctcttggcaa cattgctgga gatagtacca tgtgcaggga 661 ctatgtctta gactgcaata tccttccccc tcttttgcag ttattttcaa agcaaaaccg 721 cctgaccatg acccggaatg cagtatgggc tttgtctaat ctctgtagag ggaaaagtcc 781 acctccagaa tttgcaaagg tttctccatg tctgaatgtg ctttcctggt tgctgtttgt 841 cagtgacact gatgtactgg ctgatgcctg ctgggccctc tcatatctat cagatggacc 901 caatgataaa attcaagcgg tcatcgatgc gggagtatgt aggagacttg tggaactgct 961 gatgcataat gattataaag tggtttctcc tgctttgcga gctgtgggaa acattgtcac 1021 aggggatgat attcagacac aggtaattct gaattgctca gctctgcaga gtttattgca 1081 tttgctgagt agcccaaagg aatctatcaa aaaggaagca tgttggacga tatctaatat 1141 tacagctgga aatagggcac agatccagac tgtgatagat gccaacattt tcccagccct 1201 cattagtatt ttacaaactg ctgaatttcg gacaagaaaa gaagcagctt gggccatcac 1261 aaatgcaact tctggaggat cagctgaaca gatcaagtac ctagtagaac tgggttgtat 1321 caagccgctc tgtgatctcc tcacggtcat ggactctaag attgtacagg ttgccctaaa 1381 tggcttggaa aatatcctga ggcttggaga acaggaagcc aaaaggaacg gcactggcat 1441 taacccttac tgtgctttga ttgaagaagc ttatggtctg gataaaattg agttcttaca 1501 gagtcatgaa aaccaggaga tctaccaaaa ggcctttgat cttattgagc attacttcgg 1561 gaccgaagat gaagacagca gcattgcacc ccaggttgac cttaaccagc agcagtacat 1621 cttccaacag tgtgaggctc ctatggaagg tttccagctt tgaagcaata ctctgctttc 1681 acgtacctgt gctcagacca ggctacccag tcgagtcctc ttgtggagcc cacagtcctc 1741 atggagctaa cttctcaaat gttttccata atactgtttg cgctcatttg cttgccttgc 1801 gcacctgctc tcttacacac atctggaaaa cctccggctc tctgtggtgg gatacccttc 1861 taataaaagg gtaaccagaa cggcccactc tcttttacgg aaaaatccct aggctttgga 1921 gatccgcact tacattagag ttatgggaat atacacatat taatgtggct ccctttttct 1981 tgtgggggaa taaaagagga ctcctcctca ttccctttaa catgggggaa aaaactgaca 2041 ttaaaagatg agactaaatc tttatcttga attttacaca actacttacg acaagggaga 2101 tgtttagacc tgttggtata cttcagagta cttttcatga gttcttccac agtgaaccct 2161 tggattacct ggtggctttt tctagccaga ttgcattaat ccttactgag attggatggt 2221 tttctttcct ctattggcgc cattcttcag atattaaagt taaaccatcc actccctcac 2281 cttcagcctt cagtgaatgt gctttctagt tgtcaggaat gctgaagaat taacactttg 2341 actcctaaat gtgatactgg tgggtaagag cagggcacat ttaatttgtt cgcttttgct 2401 tctctttggt ctgggcacat ttaatttgtt cgcttttgct tctctttggt cttttcgaat 2461 acttagtaat cgaaaaccat atcctgtaat ttaataaaaa aaactaagga cgaaaaaacc 2521 cctccaattt tcccaaatgc aatcagtgta actaggggct gtgtttctgc attaaaataa 2581 atgtttcagg ctttgtggtc ctgatcaagg tcctcattaa aaaattggag ttcaccctag 2641 gcttttcccc tctgtgactg gcagataaca catacttttg aaagtaactt tgggattttt 2701 tttcttaggt gcagctcgat tctaatcttt tcatgctgca cacgattcct ttaatcgata 2761 gcatccttat ctgaaagaaa taaccatctt ctcaacatga cctgcttaac ccaaataaga 2821 acagtgatct tataacctca ttgtttccta atctatttta tttcatctcc tgctagtact 2881 gtgccgcttc cccctccccc cacacaaaat aaaaacagta tctcgcttct ggctcatttt // LOCUS S75989 1991 bp mRNA PRI 26-JUL-1995 DEFINITION gamma-aminobutyric acid transporter type 3 [human, fetal brain, mRNA, 1991 nt]. ACCESSION S75989 NID g913241 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1991) AUTHORS Borden,L.A., Dhar,T.G., Smith,K.E., Branchek,T.A., Gluchowski,C. and Weinshank,R.L. TITLE Cloning of the human homologue of the GABA transporter GAT-3 and identification of a novel inhibitor with selectivity for this site JOURNAL Recept. Channels 2 (3), 207-213 (1994) MEDLINE 95179472 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 161973] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1991 /organism="Homo sapiens" /db_xref="taxon:9606" gene 35..1933 /gene="gamma-aminobutyric acid transporter type 3, GABA transporter type 3, GAT-3" CDS 35..1933 /gene="gamma-aminobutyric acid transporter type 3, GABA transporter type 3, GAT-3" /note="This sequence comes from Fig. 1; GABA transporter type 3; GAT-3" /codon_start=1 /product="gamma-aminobutyric acid transporter type 3" /db_xref="PID:g913242" /translation="MTAEKALPLGNGKAAEEARESEAPGGGCSSGGAAPARHPRVKRD KAVHERGHWNNKVEFVLSVAGEIIGLGNVWRFPYLCYKNGGGAFLIPYVVFFICCGIP VFFLETALGQFTSEGGITCWRKVCPLFEGIGYATQVIEAHLNVYYIIILAWAIFYLSN CFTTELPWATCGHEWNTENCVEFQKLNVSNYSHVSLQNATSPVMEFWEHRVLAISDGI EHIGNLRWELALCLLAAWTICYFCIWKGTKSTGKVVYVTATFPYIMLLILLIRGVTLP GASEGIKFYLYPDLSRLSDPQVWVDAGTQIFFSYAICLGCLTALGSYNNYNNNCYRDC IMLCCLNSGTSFVAGFAIFSVLGFMAYEQGVPIAEVAESGPGLAFIAYPKAVTMMPLS PLWATLFFMMLIFLGLDSQFVCVESLVTAVVDMYPKVFRRGYRRELLILALSVISYFL GLVMLTEGGMYIFQLFDSYAASGMCLLFVAIFECICIGWVYGSNRFYDNIEDMIGYRP PSLIKWCWMIMTPGICAGIFIFFLIKYKPLKYNNIYTYPAWGYGIGWLMALSSMLCIP LWICITVWKTEGTLPEKLQKLTTPSTDLKMRGKLGVSPRMVTVNDCDAKLKSDGTIAA ITEKETHF" BASE COUNT 388 a 541 c 583 g 479 t ORIGIN 1 agccgggccg gcgcacgagg cagccagcgc ggccatgacg gcggagaagg cgctgcccct 61 gggcaatggg aaggctgctg aggaggcgcg ggagtccgag gcgccgggtg gcggctgcag 121 cagcgggggc gcggcgcccg cgcgccaccc gcgcgtcaag cgcgacaagg cggtccacga 181 gcgcggccac tggaacaaca aggtggagtt cgtgctgagc gtggccgggg agatcattgg 241 gctgggcaac gtgtggcgct tcccctacct gtgctacaag aacggaggag gggcattcct 301 gattccctac gtggtgtttt ttatttgctg tggaattcct gtttttttcc tggagacagc 361 tctggggcag ttcacaagtg aaggtggcat tacgtgttgg aggaaagttt gccctttatt 421 tgaaggcatt ggctatgcaa cacaggtgat tgaggcccat ctgaatgtgt actacatcat 481 catcctggca tgggccattt tttacctgag caactgcttc actactgagc taccctgggc 541 tacctgtggg catgagtgga acacagagaa ttgtgtggag ttccagaaac tgaatgtgag 601 caactacagc catgtgtctc tgcagaatgc cacctcccct gtcatggagt tttgggagca 661 ccgggtcctg gccatctctg acgggatcga gcacatcggg aaccttcgct gggagctggc 721 cttgtgtctc ttggcagcct ggaccatctg ttacttctgt atctggaagg ggaccaagtc 781 tacaggaaag gttgtatacg tgactgcgac attcccctac atcatgctgc tgatcctcct 841 gatacgaggg gtcacgttgc ccggggcctc agagggcatc aagttctact tgtaccctga 901 cctctcccgg ctctccgacc cccaggtctg ggtagatgct ggaacgcaga tctttttctc 961 ctatgccatt tgcctgggct gtctgaccgc tctgggaagt tataacaatt ataacaacaa 1021 ctgctacagg gactgcatca tgctctgttg cctgaacagc ggcaccagct tcgtggctgg 1081 gtttgccatc ttctcagtcc tgggttttat ggcgtacgag cagggggtac ccattgctga 1141 ggtggcagag tcaggccccg gcctggcctt tattgcgtac cccaaggcgg tcaccatgat 1201 gcctctctcc ccgctgtggg ccaccttgtt cttcatgatg ctcatcttcc tgggcctgga 1261 cagccagttt gtgtgtgtgg aaagcctggt gaccgccgtg gtggacatgt accccaaggt 1321 tttccggagg ggttaccggc gggagctgct catcctagcc ttgtctgtta tctcctattt 1381 tctgggcctc gtgatgttaa cagagggtgg catgtacatc ttccagctct ttgactccta 1441 tgccgccagt gggatgtgcc ttctcttcgt ggccatcttt gagtgcatct gcatcggctg 1501 ggtgtatgga agcaaccggt tctatgataa cattgaagac atgattggct accggccacc 1561 gtcgctcatt aagtggtgct ggatgatcat gacccctggg atctgcgcgg ggatcttcat 1621 cttcttcttg atcaagtaca agccactcaa gtacaacaac atctacacct acccagcctg 1681 gggctatggc attggctggc tcatggccct gtcctccatg ctctgcatcc cgctctggat 1741 ctgcatcaca gtgtggaaga cggaggggac actgcccgag aaactccaga agttgacgac 1801 ccccagcaca gatctgaaaa tgcggggcaa gcttggggtg agcccacgga tggtgacagt 1861 taatgactgt gatgccaaac tcaagagtga cgggaccatc gcagccatca cagagaagga 1921 gacgcacttc tgagcggcca ccagccatct ggggctcttc ttcctttctt ccccccgtgt 1981 atgtaaatga a // LOCUS S76965 2147 bp mRNA PRI 10-JUL-1992 DEFINITION protein kinase inhibitor [human, neuroblastoma cell line SH-SY-5Y, mRNA, 2147 nt]. ACCESSION S76965 NID g243493 KEYWORDS . SOURCE human neuroblastoma cell line SH-SY-5Y. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2147) AUTHORS Olsen,S.R. and Uhler,M.D. TITLE Inhibition of protein kinase-A by overexpression of the cloned human protein kinase inhibitor JOURNAL Mol. Endocrinol. 5 (9), 1246-1256 (1991) MEDLINE 92123220 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 76965] from the original journal article. This sequence comes from Fig.1B. FEATURES Location/Qualifiers source 1..2147 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2147 /gene="protein kinase inhibitor, PKI" CDS 534..764 /gene="protein kinase inhibitor, PKI" /note="This sequence comes from Fig.1B; PKI" /codon_start=1 /product="protein kinase inhibitor" /db_xref="PID:g243494" /translation="MTDVETTYADFIASGRTGRRNAIHDILVSSASGNSNELALKLAG LDINKTEGEEDAQRSSTEQSGEAQGEAAKSES" BASE COUNT 618 a 426 c 475 g 628 t ORIGIN 1 gaattccgcg ctttgcagag aaagcccccc tggctcttct ctctcaatcc ataatccagc 61 gatgctgcag ctgtaaaagc attaatagaa aatcaatcca caacctcgcg gggcagcgat 121 cgtcgagcgc cgtttccagg ctgccttccc tggggtcggg agcggccccg ctccccccgt 181 ggctggcgcg aaatgtggtg atccgtcccg gggcggggat gacttcatgc agccggagct 241 ccgcggcggg agcggaggct gctgctggca ggtggggcgc gggccggcgc gagctgaccg 301 agcactcggc gggcgcggcg ggactgcggc ccgtggcggc gtgcgcggga cctgcgctga 361 ctaggtccgg ggaagtttcc tgactttctg agaagccctg gtttccccaa agaagtgact 421 tttctgatag aaatctgaag gtcatctcca agaaaaaaga gatctagtat agtcaatgaa 481 ttaaagacaa gaaggtttcc aatcagtccc tgctatgtgg atatttggta gcaatgactg 541 atgtggaaac tacatatgca gattttattg cttcaggaag aacaggtaga agaaatgcaa 601 tacatgatat cctggtttcc tctgcaagtg gcaacagcaa tgaattagcc ttgaaattag 661 caggtcttga tatcaacaag acagaaggtg aagaagatgc acaacgaagt tctacagaac 721 aaagtgggga agcccaggga gaagcagcaa aatctgaaag ctaacacccc actttgaccc 781 tcgaccacac ctgaaaatgt ctcaaatctc caggagtatc tggaatgcat ttgtttccat 841 gagtgaaaag aggaaaaaga aaatggctgt gctgcattgc aggaacctgc tcattatcat 901 gttaaaaatg agggcagagg ctgtggctgc aggcagactt ttccctacct ctgtcattag 961 caatggttga aatcatgtgg cttgtgtttg ggcgtcattt ttgtatggat cctttcactt 1021 gatcatatga cgaaatgctt atagagagta gctccgacct agatgatgat tcttcctgta 1081 gcatctggcc cctcacaatg tcagaggatt taattgtgtc taattgcgaa gggttgattg 1141 aaccccagag tttaaatatc tctggctcaa gtgttcaccc agtaaaagaa agatccagaa 1201 agcactgttt ttagcattac gtatctgtgt gttactgctg tgttatttac actgttttgt 1261 attgtacaat atatatgctc agcactgccc cctctctgat tgcttatgaa aacaaaatga 1321 tgtacattac tgtgaatttt tataccactc atttttaaaa gggctgtctt ttcattttag 1381 ttttccatac tgtggtggtg tacacaggat agaacaccct tttttaaaac acagtctttc 1441 cccttgctca ttgtatgttg atgagttgat taagtctaac agattcatca agactccatt 1501 gctttattat agagacattt gaaaatatcc attaatgtga atatcacctg aattcaatct 1561 gtcaaatttg ttttattctc aagtggagaa cttctcccac ataatatata tatatatata 1621 tattttaatt tatgagaatt ttggacaatt ggaaaggtag aaaagaaaag ccaagatcat 1681 actaaggact ggaaatattt tgttctatgg aatcaaattt ctcacaatgc tgtatgatac 1741 tatttaaatt tggaggacaa cttatcttca ctaagctgaa tcaggtggag aaagtaatct 1801 ccttgcaatc atgtggacac caatcacaaa agtaaagccc tggtgttgtg ttttcatgtc 1861 ttttttcagc cctctcagat ccaaatgtta ttatgcactt tttaatgttt gtaaactttt 1921 actaataatt agtgtgaatt gcattctgat acaataatga ttatcattag aagctaacaa 1981 aattctcatt aatactgtgt ttgatggcct ctgctgtgtt ttaacatcgt gcttcttata 2041 tggaaagttt ttgtgagctg tgtaatccct ctggtcagta ttatgaaatc atttgtcagt 2101 ggtaataaat aaggaaccag taaaaaaaaa aaaaaaaaaa ggaattc // LOCUS S76992 2753 bp mRNA PRI 26-JUL-1995 DEFINITION VAV2=VAV oncogene homolog [human, fetal brain, mRNA Partial, 2753 nt]. ACCESSION S76992 NID g913345 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2753) AUTHORS Henske,E.P., Short,M.P., Jozwiak,S., Bovey,C.M., Ramlakhan,S., Haines,J.L. and Kwiatkowski,D.J. TITLE Identification of VAV2 on 9q34 and its exclusion as the tuberous sclerosis gene TSC1 JOURNAL Ann. Hum. Genet. 59 (Pt 1), 25-37 (1995) MEDLINE 95283235 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 164173] from the original journal article. This sequence comes from Fig. 2. Map location: 9q34. FEATURES Location/Qualifiers source 1..2753 /organism="Homo sapiens" /db_xref="taxon:9606" gene 6..2642 /note="VAV oncogene homolog" /gene="VAV2" CDS 6..2642 /gene="VAV2" /note="VAV oncogene homolog; This sequence comes from Fig. 2" /codon_start=1 /db_xref="PID:g913346" /translation="MEQWRQCGRWLIDCKVLPPNHRVVWPSAVVFDLAQALRDGVLLC QLLHNLSPGSIDLKDINFRPQMSQFLCLKNIRTFLKVCHDKFGLRNSELFDPFDLFDV RDFGKVISAVSRLSLHSIAQNKGIRPFPSEETTENDDDVYRSLEELADEHDLGEDIYD CVPCEDGGDDIYEDIIKVEVQQPMIRYMQKMGMTEDDKRNCCLLEIQETEAKYYRTLE DIEKNYMSPLRLVLSPADMAAVFINLEDLIKVHHSFLRAIDVSVMVGGSTLAKVFLDF KERLLIYGEYCSHMEHAQNTLNQLLASREDFRQKVEECTLKVQDGKFKLQDLLVVPMQ RVLKYHLLLKELLSHSAERPERQQLKEALEAMQDLAMYINEVKRDKETLRKISEFQSS IENLQVKLEEFGRPKIDGELKVRSIVNHTKQDRYLFLFDKVVIVCKRKGYSYELKEII ELLFHKMTDDPMNNKDVKKSHGKMWSYGFYLIHLQGKQGFQFFCKTEDMKRKWMEQFE MAMSNIKPDKANANHHSFQMYTFDKTTNCKACKMFLRGTFYQGYMCTKCGVGAHKECL EVIPPCKFTSPADLDASGAGPGPKMVAVQNYHGNPAPPGKPVLTFQTGDVLELLRGDP ESPWWEGRLVQTRKSGYFPSSSVKPCPVDGRPPISRPPSREIDYTAYPWFAGNMERQQ TDNLLKSHASGTYLIRERPAEAERFAISIKFNDEVKHIKVVEKDNWIHITEAKKFDSL LELVEYYQCHSLKESFKQLDTTLKYPYKSRERSASRASSRSPASCASYNFSFLSPQGL SFASQGPSAPFWSVFTPRVIGTAVARYNFAARDMRELSLREGDVVRIYSRIGGDQGWW KGETNGRIGWFPSTYVEEEGIQ" BASE COUNT 674 a 755 c 793 g 531 t ORIGIN 1 gcgccatgga gcagtggcga cagtgcggcc gctggctcat cgattgcaag gtcctgccgc 61 ccaaccaccg ggtggtgtgg ccctcggccg tggtcttcga cctggcgcag gcgctgcgcg 121 acggggtcct tctgtgccag ctgctgcaca acctctcccc cggctccatc gacctcaagg 181 acatcaactt ccggccgcag atgtcccagt ttctgtgttt gaagaacata cgcaccttcc 241 tgaaagtctg ccacgataaa tttggattaa ggaacagcga gctgtttgac ccctttgacc 301 tcttcgatgt gcgagacttt ggaaaggtca tctccgcggt gtcgaggctc tccctgcaca 361 gcatcgcgca gaacaaaggg atcaggcctt ttccctcaga ggagaccaca gagaatgacg 421 atgacgtcta ccgcagcctg gaggagctgg ccgacgagca tgacctgggg gaggacatct 481 acgactgcgt cccgtgtgag gatggagggg acgacatcta cgaggacatc atcaaggtgg 541 aggtgcagca gcccatgatt agatacatgc agaaaatggg catgactgaa gatgacaaga 601 ggaactgctg cctgctggag atccaggaga ccgaggccaa gtactaccgc accctggagg 661 acattgagaa gaactacatg agccccctgc ggctggtgct gagcccggcg gacatggcag 721 ctgtcttcat taacctggag gacctgatca aggtgcatca cagcttcctg agggccatcg 781 acgtgtccgt gatggtgggg ggcagcacgc tggccaaggt cttcctcgat ttcaaggaaa 841 ggctcctgat ctacggggag tactgcagcc acatggagca cgcccagaac acactgaacc 901 agctcctggc cagccgggag gacttcaggc agaaagtcga ggagtgcaca ctgaaggtcc 961 aggatggaaa atttaagctg caagacctgc tggtggtccc catgcagagg gtgctcaaat 1021 accacctgct cttgaaggag cttctgagcc attctgcgga acggcctgag aggcagcagc 1081 tcaaagaagc actggaagcc atgcaggact tggcgatgta catcaatgaa gttaaacggg 1141 acaaggagac cttgaggaaa atcagcgaat ttcagagttc tatagaaaat ttgcaagtga 1201 aactggagga atttggaaga ccaaagattg acggggaact gaaagtccgg tccatagtca 1261 accacaccaa gcaggacagg tacttgttcc tgtttgacaa ggtggtcatc gtctgcaagc 1321 ggaagggcta cagctacgag ctcaaggaga tcatcgagct gctgttccac aagatgaccg 1381 acgaccccat gaacaacaag gacgtcaaga agtctcacgg gaaaatgtgg tcctacggct 1441 tctacctaat tcaccttcaa ggaaagcagg gcttccagtt tttctgcaaa acagaagata 1501 tgaagaggaa gtggatggag cagtttgaga tggccatgtc aaacatcaag ccagacaaag 1561 ccaatgccaa ccaccacagt ttccagatgt acacgtttga caagaccacc aactgcaaag 1621 cctgcaaaat gttcctcagg ggcaccttct accagggata catgtgtacc aagtgtggcg 1681 tcggggcaca caaggagtgc ctggaagtga tacctccctg caagttcact tctcctgcag 1741 atctggacgc ctccggagcg ggaccaggtc ccaagatggt ggccgtgcag aattaccatg 1801 gcaacccagc ccctcccggg aagcctgtgc tgaccttcca gaccggcgac gtgcttgagc 1861 tgctgagggg cgaccctgag tctccgtggt gggagggtcg tctggtacaa accaggaagt 1921 cagggtattt ccccagctca tctgtgaagc cctgccctgt ggatggaagg ccgcccatca 1981 gccggccgcc atcccgggag atcgactaca ctgcataccc ctggtttgca ggtaacatgg 2041 agaggcagca gacggacaac ctgctcaagt cccacgccag cgggacctac ctgatcaggg 2101 agcggcctgc cgaggctgag cgctttgcaa taagcatcaa gttcaatgat gaggtgaagc 2161 acatcaaggt ggtggagaag gacaactgga tccacatcac agaggccaag aaattcgaca 2221 gcctcctgga gttggtggag tactaccagt gccactcact gaaggagagc ttcaagcagc 2281 tggacaccac actcaagtac ccctacaagt cccgggaacg ttcggcctcc agagcctcca 2341 gccggtcccc agcttcctgt gcttcctaca acttttcttt tctcagtcct cagggcctca 2401 gctttgcttc tcagggcccc tccgctccct tctggtcagt gttcacgccc cgcgtcatcg 2461 gcacagctgt ggccaggtat aactttgccg cccgagatat gagggagctt tcgctgcggg 2521 agggtgacgt ggtgaggatc tacagccgca tcggcggaga ccagggctgg tggaagggcg 2581 agaccaacgg acggattggc tggtttcctt caacgtacgt agaagaggag ggcatccagt 2641 gacggcagga acgtggacaa gactcgcaga ttttcttggg agagtcactc cagccctgaa 2701 gtctgtctct agctcctctg tgactcagag gggaaatacc aacctcccag tct // LOCUS S77579 90 bp mRNA PRI 25-AUG-1995 DEFINITION HERVK10/HUMMTV reverse transcriptase homolog {clone RT25} [human, multiple sclerosis, brain plaques, mRNA Partial, 90 nt]. ACCESSION S77579 NID g957373 KEYWORDS . SOURCE human brain plaques multiple sclerosis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 90) AUTHORS Lefebvre,S., Hubert,B., Tekaia,F., Brahic,M. and Bureau,J.F. TITLE Isolation from human brain of six previously unreported cDNAs related to the reverse transcriptase of human endogenous retroviruses JOURNAL AIDS Res. Hum. Retroviruses 11 (2), 231-237 (1995) MEDLINE 95260532 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 165928] from the original journal article. This sequence comes from Table 1. FEATURES Location/Qualifiers source 1..90 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..72 /partial /gene="HERVK10/HUMMTV reverse transcriptase homolog" CDS 1..72 /partial /gene="HERVK10/HUMMTV reverse transcriptase homolog" /note="This sequence comes from Table 1." /codon_start=1 /db_xref="PID:g957374" /translation="MLNSPTICQTYVGKVIKPVREQF" gene 73..90 /partial /gene="orf 3' of HERVk10/HUMMMTV reverse transcriptase homolog" CDS 73..90 /partial /gene="orf 3' of HERVk10/HUMMMTV reverse transcriptase homolog" /codon_start=1 /db_xref="PID:g1683508" /translation="KCYSIH" BASE COUNT 33 a 11 c 15 g 31 t ORIGIN 1 atgttaaata gcccaactat ttgtcaaacc tatgttggga aagttattaa gccagttaga 61 gaacagtttt aaaaatgtta tagtattcat // LOCUS S78085 1282 bp mRNA PRI 26-SEP-1995 DEFINITION PDCD2=programmed cell death-2/Rp8 homolog [human, fetal lung, mRNA, 1282 nt]. ACCESSION S78085 NID g998900 KEYWORDS . SOURCE human fetal lung. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1282) AUTHORS Kawakami,T., Furukawa,Y., Sudo,K., Saito,H., Takami,S., Takahashi,E. and Nakamura,Y. TITLE Isolation and mapping of a human gene (PDCD2) that is highly homologous to Rp8, a rat gene associated with programmed cell death JOURNAL Cytogenet. Cell Genet. 71 (1), 41-43 (1995) MEDLINE 95330968 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 166837] from the original journal article. This sequence comes from Fig. 1. Map location: 6q27. FEATURES Location/Qualifiers source 1..1282 /organism="Homo sapiens" /db_xref="taxon:9606" gene 30..1064 /note="programmed cell death-2/Rp8 homolog" /gene="PDCD2" CDS 30..1064 /gene="PDCD2" /note="programmed cell death-2/Rp8 homolog; This sequence comes from Fig. 1" /codon_start=1 /db_xref="PID:g998901" /translation="MAAAGARPVELGFAESAPAWRLRSEQFPSKVGGRPAWLGAAGLP GPQALACELCGRPLSFLLQVYAPLPGRPDAFHRCIFLFCCREQPCCAGLRVFRNQLPR KNDFYSYEPPSENPPPETGESVCLQLKSGAHLCRVCGCLGPKTCSRCHKAYYCSKEHQ TLDWRLGHKQACAQPDHLDHIIPDHNFLFPEFEIVIETEDEIMPEVVEKEDYSEIIGS MGEALEEGLDSMAKHESREDKIFQKFKTQIALEPEQILRYGRGIAPIWISGENIPQEK DIPDCPCGAKRILEFQVMPQLLNYLKADRLGKSIDWGILAAFTCAESCSLGTGYTEEF VWKQDVTDTP" BASE COUNT 344 a 309 c 331 g 298 t ORIGIN 1 gctgcgcccc acgccagccc gcgccccgca tggctgccgc cggggccagg cctgtggagc 61 tgggcttcgc cgagtcggcg ccggcgtggc gactgcgcag cgagcagttc cccagcaagg 121 tgggcgggcg gccggcatgg ctgggcgcgg ccgggctgcc ggggccccag gccctggcct 181 gcgagctgtg cggccgcccg ctctccttcc tgctgcaggt gtatgcgccg ctgcctggcc 241 gcccggacgc cttccaccgc tgcatcttcc tcttctgctg ccgcgagcag ccgtgctgtg 301 ccggcttgcg agtttttagg aatcaactac ccaggaaaaa cgatttttac tcatatgagc 361 caccttctga gaatcctccc ccagaaacag gagaatcagt gtgtctccag cttaagtctg 421 gtgctcatct ctgcagggtt tgtggctgtt taggccccaa aacgtgctcc agatgccaca 481 aagcatatta ctgcagcaag gagcatcaga ccctagactg gagattggga cataagcagg 541 cttgtgcaca accagatcat ctggaccata taattccaga ccacaacttc ctttttccag 601 aatttgaaat tgtaatagaa acagaagatg agattatgcc tgaggttgtg gaaaaggaag 661 attactcaga gattataggg agcatgggtg aagcacttga ggaaggactg gattccatgg 721 caaaacatga atccagggaa gataaaattt ttcagaagtt taaaactcag atagcccttg 781 aaccagaaca gattcttaga tatggcagag gtattgcccc catctggatt tctggtgaaa 841 atattcctca agaaaaggat attccagatt gcccctgtgg tgccaagaga atattggaat 901 tccaggtcat gcctcagctc ttaaactacc tgaaggctga cagactgggc aagagcattg 961 actggggcat cctggctgct ttcacctgtg ctgagagctg cagcttgggt actggttata 1021 cagaagaatt tgtgtggaag caggatgtaa cagatacacc gtaaaggcat cttaaagcct 1081 tgaaaaatgt taataatctt ttataccttg caattccatt tctgggattt tatcctaagg 1141 aaatacttat accaaaaata gaggtgcaga gatgttgacg gattgcttac acagtgtcta 1201 cttattagtg aaacaaaagt gtccagtgac agggaattaa ataaattttg gtacatccac 1261 aaaaaaaaaa aaaaaaaaaa aa // LOCUS S78203 2685 bp mRNA PRI 26-SEP-1995 DEFINITION PEPT 2=H+/peptide cotransporter [human, kidney, mRNA Partial, 2685 nt]. ACCESSION S78203 NID g999212 KEYWORDS . SOURCE human kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2685) AUTHORS Liu,W., Liang,R., Ramamoorthy,S., Fei,Y.J., Ganapathy,M.E., Hediger,M.A., Ganapathy,V. and Leibach,F.H. TITLE Molecular cloning of PEPT 2, a new member of the H+/peptide cotransporter family, from human kidney JOURNAL Biochim. Biophys. Acta 1235 (2), 461-466 (1995) MEDLINE 95275926 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 167223] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2685 /organism="Homo sapiens" /db_xref="taxon:9606" gene 31..2220 /gene="PEPT 2" CDS 31..2220 /gene="PEPT 2" /note="H+/peptide cotransporter; This sequence comes from Fig. 1" /codon_start=1 /product="PEPT 2" /db_xref="PID:g999213" /translation="MNPFQKNESKETLFSPVSIEEVPPRPPSPPKKPSPTICGSNYPL SIAFIVVNEFCERFSYYGMKAVLILYFLYFLHWNEDTSTSIYHAFSSLCYFTPILGAA IADSWLGKFKTIIYLSLVYVLGHVIKSLGALPILGGQVVHTVLSLIGLSLIALGTGGI KPCVAAFGGDQFEEKHAEERTRYFSVFYLSINAGSLISTFITPMLRGDVQCFGEDCYA LAFGVPGLLMVIALVVFAMGSKIYNKPPPEGNIVAQVFKCIWFAISNRFKNRSGDIPK RHDWLDWAAEKYPKQLIMDVKALTRVLFLYIPLPMFWALLDQQGSRWTLQAIRMNRNL GFFVLQPDQMQVLNPLLVLIFIPLFDFVIYRLVSKCGINFSSLRKMAVGMILACLAFA VAARVEIKINEMAPAQPGPQEVFLQVLNLADDEVKVTVVGNENNSLLIESIKSFQKTP HYSKLHLKTKSQDFHFHLKYHNLSLYTEHSVQEKNWYSLVIREDGNSISSMMVKDTES RTTNGMTTVRFVNTLHKDVNISLSTDTSLNVGEDYGVSAYRTVQRGEYPAVHCRTEDK NFSLNLGLLDFGAAYLFVITNNTNQGLQAWKIEDIPANKMSIRWQLPQYALVTAGEVM FSVTGLEFSYSQAPSSMKSVLQAAWLLTIAVGNIIVLVVAQFSGLVQWAEFILFSCLL LVICLIFSIMGYYYVPVKTEDMRGPADKHIPHIQGNMIKLETKKTKL" BASE COUNT 719 a 595 c 602 g 769 t ORIGIN 1 cgaggagaga gagagagtaa ggagccagcc atgaatcctt tccagaaaaa tgagtccaag 61 gaaactcttt tttcacctgt ctccattgaa gaggtaccac ctcgaccacc tagccctcca 121 aagaagccat ctccgacaat ctgtggctcc aactatccac tgagcattgc cttcattgtg 181 gtgaatgaat tctgcgagcg cttttcctat tatggaatga aagctgtgct gatcctgtat 241 ttcctgtatt tcctgcactg gaatgaagat acctccacat ctatatacca tgccttcagc 301 agcctctgtt attttactcc catcctggga gcagccattg ctgactcgtg gttgggaaaa 361 ttcaagacaa tcatctatct ctccttggtg tatgtgcttg gccatgtgat caagtccttg 421 ggtgccttac caatactggg aggacaagtg gtacacacag tcctatcatt gatcggcctg 481 agtctaatag ctttggggac aggaggcatc aaaccctgtg tggcagcttt tggtggagac 541 cagtttgaag aaaaacatgc agaggaacgg actagatact tctcagtctt ctacctgtcc 601 atcaatgcag ggagcttgat ttctacattt atcacaccca tgctgagagg agatgtgcaa 661 tgttttggag aagactgcta tgcattggct tttggagttc caggactgct catggtaatt 721 gcacttgttg tgtttgcaat gggaagcaaa atatacaata aaccaccccc tgaaggaaac 781 atagtggctc aagttttcaa atgtatctgg tttgctattt ccaatcgttt caagaaccgt 841 tctggagaca ttccaaagcg acacgactgg ctagactggg cggctgagaa atatccaaag 901 cagctcatta tggatgtaaa ggcactgacc agggtactat tcctttatat cccattgccc 961 atgttctggg ctcttttgga tcagcagggt tcacgatgga ctttgcaagc catcaggatg 1021 aataggaatt tggggttttt tgtgcttcag ccggaccaga tgcaggttct aaatcccctt 1081 ctggttctta tcttcatccc gttgtttgac tttgtcattt atcgtctggt ctccaagtgt 1141 ggaattaact tctcatcact taggaaaatg gctgttggta tgatcctagc atgcctggca 1201 tttgcagttg cggcacgtgt agagataaaa ataaatgaaa tggccccagc ccagccaggt 1261 ccccaggagg ttttcctaca agtcttgaat ctggcagatg atgaggtgaa ggtgacagtg 1321 gtgggaaatg aaaacaattc tctgttgata gagtccatca aatcctttca gaaaacacca 1381 cactattcca aactgcacct gaaaacaaaa agccaggatt ttcacttcca cctgaaatat 1441 cacaatttgt ctctctacac tgagcattct gtgcaggaga agaactggta cagtcttgtc 1501 attcgtgaag atgggaacag tatctccagc atgatggtaa aggatacaga aagcagaaca 1561 accaatggga tgacaaccgt gaggtttgtt aacactttgc ataaagatgt caacatctcc 1621 ctgagtacag atacctctct caatgttggt gaagactatg gtgtgtctgc ttatagaact 1681 gtgcaaagag gagaataccc tgcagtgcac tgtagaacag aagataagaa cttttctctg 1741 aatttgggtc ttctagactt tggtgcagca tatctgtttg ttattactaa taacaccaat 1801 cagggtcttc aggcctggaa gattgaagac attccagcca acaaaatgtc cattcggtgg 1861 cagctaccac aatatgccct ggttacagct ggggaggtca tgttctctgt cacaggtctt 1921 gagttttctt attctcaggc tccctctagc atgaaatctg tgctccaggc agcttggcta 1981 ttgacaattg cagttgggaa tatcatcgtg cttgttgtgg cacagttcag tggcctggta 2041 cagtgggccg aattcatttt gttttcctgc ctcctgctgg tgatctgcct gatcttctcc 2101 atcatgggct actactatgt tcctgtaaag acagaggata tgcggggtcc agcagataag 2161 cacattcctc acatccaggg gaacatgatc aaactagaga ccaagaagac aaaactctga 2221 tgactcccta gattctgtcc taaccccaat tccctggccc tgtcttgaag catttttttt 2281 cttctactgg attagacaag agagatagca gcatatcaga gctgatctcc tccacctttc 2341 tccaatgaca gaagttccag gactggtttt ccagtacatc tttaaacaag gccccagaga 2401 ctctatgtct gcccgtccat cagtgaactc attaaaactt gtgcagtgtt gctggagctg 2461 gcctggtgtc tccaaatgac catgaaaata cacacgtata atggagatca ttctctgtgg 2521 gtatgcaaag ttatgggaat tcctttatag gtaactgcca tttaggactg atggccctaa 2581 tttttgaggt gctgatttag aggcaaaatt gcagaataac aaagaaatgg tatttcaagt 2641 tttttttttt ataagcaatg taattatgct attcacaggg gcccg // LOCUS S78271 5190 bp mRNA PRI 26-SEP-1995 DEFINITION SB1.8/DXS423E=mitosis-specific chromosome segregation protein SMC1 homolog [human, HT1080 and M426 fibroblast cell lines, mRNA, 5190 nt]. ACCESSION S78271 NID g999379 KEYWORDS . SOURCE human HT1080 and M426 fibroblast cell lines. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5190) AUTHORS Rocques,P.J., Clark,J., Ball,S., Crew,J., Gill,S., Christodoulou,Z., Borts,R.H., Louis,E.J., Davies,K.E. and Cooper,C.S. TITLE The human SB1.8 gene (DXS423E) encodes a putative chromosome segregation protein conserved in lower eukaryotes and prokaryotes JOURNAL Hum. Mol. Genet. 4 (2), 243-249 (1995) MEDLINE 95276737 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 167455] from the original journal article. This sequence comes from Fig. 3. Map location: Xp11.2. FEATURES Location/Qualifiers source 1..5190 /organism="Homo sapiens" /db_xref="taxon:9606" gene 34..3735 /note="mitosis-specific chromosome segregation protein SMC1 homolog" /gene="SB1.8/DXS423E" CDS 34..3735 /gene="SB1.8/DXS423E" /note="mitosis-specific chromosome segregation protein SMC1 homolog; This sequence comes from Fig. 3" /codon_start=1 /db_xref="PID:g999380" /translation="MGFLKLIEIENFKSYKGRQIIGPFQRFTAIIGPNGSGKSNLMDA ISFVLGEKTSNLRVKTLRDLIHGAPVGKPAANRAFVSMVYSEEGAEDRTFARVIVGGS SEYKINNKVVQLHEYSEELEKLGILIKARNFLVFQGAVESIAMKNPKERTALFEEISR SGDVAQEYDKRKKEMVKAEEDTQFNYHRKKNIAAERKEAKQEKEEADRYQRLKDEVVR AQVQLQLFKLYHNEVEIEKLNKELASKNKEIEKDKKRMDKVEDELKEKKKELGKMMRE QQQIEKEIKEKDSELNQKRPQYIKAKENTSHKIKKLEAAKKSLQNAQKHYKKRKGDMD ELEKEMLSVEKARQEFEERMEEESQSQGRDLTLEENQVKKYHRLKEEASKRAATLAQE LEKFNRDQKADQDRLDLEERKKVETEAKIKQKLREIEENQKRIEKLEEYITTSKQSLE EQKKLEGELTEEVEMAKRRIDEINKELNQVMEQLGDARIDRQESSRQQRKAEIMESIK RLYPGSVYGRLIDLCQPTQKKYQIAVTKVLGKNMDAIIVDSEKTGRDCIQYIKEQRGE PETFLPLDYLEVKPTDEKLRELKGAKLVIDVIRYEPPHIKKALQYACGNALVCDNVED ARRIAFGGHQRHKTVALDGTLFQKSGVISGGASDLKAKARRWDEKAVDKLKEKKERLT EELKEQMKAKRKEAELRQVQSQAHGLQMRLKYSQSDLEQTKTRHLALNLQEKSKLESE LANFGPRINDIKRIIQSREREMKDLKEKMNQVEDEVFEEFCREIGVRNIREFEEEKVK RQNEIAKKRLEFENQKTRLGIQLDFEKNQLKEDQDKVHMWEQTVKKDENEIEKLKKEE QRHMKIIDETMAQLQDLKNQHLAKKSEVNDKNHEMEEIRKKLGGANKEMTHLQKEVTA IETKLEQKRSDRHNLLQACKMQDIKLPLSKGTMDDISQEEGSSQGEDSVSGSQRISSI YAREALIEIDYGDLCEDLKDAQAEEEIKQEMNTLQQKLNEQQSVLQRIAAPNMKAMEK LESVRDKFQETSDEFEAARKRAKKAKQAFEQIKKERFDRFNACFESVATNIDEIYKAL SRNSSAQAFLGPENPEEPYLDGINYNCVAPGKRFRPMDNLSGGEKTVAALALLFAIHS YKPAPFFVLDEIDAALDNTNIGKVANYIKEQSTCNFQAIVISLKEEFYTKAESLIGVY PEQGDCVISKVLTFDLTKYPDANPNPNEQ" BASE COUNT 1522 a 1134 c 1430 g 1087 t 17 others ORIGIN 1 gcctgtccta ctgccgccgg cgccgcggcc gtcatggggt tcctgaaact gattgagatt 61 gagaacttta agtcgtacaa gggtcgacag attatcggac catttcagag gttcaccgcc 121 atcattggac ccaatggctc tggtaagtca aatctcatgg atgccatcag ctttgtgcta 181 ggtgaaaaaa ccagcaacct gcgggtaaag accctgcggg acctgatcca tggagctcct 241 gtgggcaagc cagctgccaa ccgggccttt gtcagcatgg tctactctga ggagggtgct 301 gaggaccgta cctttgcccg tgtcattgta ggaggttctt ctgagtacaa gatcaacaac 361 aaagtggtcc aactacatga gtacagtgag gaattagaga agttgggcat tctcatcaaa 421 gctcgtaact tcctcgtttt ccagggtgct gtggaatcta ttgccatgaa gaaccccaaa 481 gagaggacag ctctatttga agagattagt cgttctgggg acgtggcgca ggagtatgac 541 aagcgaaaga aggaaatggt gaaggctgaa gaggacacac agtttaatta ccatcgcaag 601 aaaaatattg cggctgaacg caaggaagca aagcaggaga aagaagaggc tgaccggtac 661 cagcgcctga aggatgaggt agtacgggct caggtacagc tgcagctctt taagctttac 721 cataatgaag tggaaattga gaagctcaac aaggaactgg cctcaaagaa caaggagatc 781 gagaaggaca agaagcgtat ggacaaggtg gaggatgaac tgaaggagaa gaagaaggag 841 ctgggcaaaa tgatgcggga gcagcagcag attgagaagg agatcaagga gaaggactca 901 gaattgaacc agaagcggcc tcagtacatc aaagccaagg agaacacctc ccacaaaatc 961 aagaagctgg aagcagccaa gaagtctctg cagaatgctc agaagcacta caagaagcgt 1021 aaaggtgaca tggatgagct ggagaaggag atgctgtcag tggagaaggc tcggcaggag 1081 tttgaagaac ggatggaaga agagagtcag agtcagggca gagatttgac gttggaggag 1141 aatcaggtga agaaatacca ccggttgaaa gaagaagcca gcaagagagc agctaccctg 1201 gcccaggagc tggagaaatt caatcgagac cagaaagctg accaggaccg tctggatctg 1261 gaagaacgga agaaagtaga gacagaggcc aagatcaagc aaaagctgcg ggaaattgaa 1321 gagaatcaga agcggattga gaaactggag gaatacatca ccactagcaa gcagtcccta 1381 gaagagcaga agaagctaga gggggagctg acagaggagg tggagatggc caagcggcgt 1441 attgatgaaa tcaataagga gctgaaccag gtgatggagc agctagggga tgcccgcatc 1501 gaccgccagg agagcagccg ccagcagcga aaggcagaga taatggaaag catcaagcgc 1561 ctttaccctg gctctgtgta cggccgcctc attgacctat gccagcccac acaaaagaag 1621 tatcagattg ctgtaaccaa ggttttgggc aagaacatgg atgccattat tgtggactcg 1681 gagaagacag gccgggactg tattcagtat atcaaggagc agcgtgggga gcctgagacc 1741 ttcttgcctc ttgactacct ggaggtgaag cctacagatg agaaactccg ggagctgaag 1801 ggggccaagc tagtgattga tgtgattcgc tatgagccac ctcatatcaa aaaggccctg 1861 cagtatgctt gtggcaatgc ccttgtctgt gacaacgtgg aagatgcccg ccgcattgcc 1921 tttggaggcc accagcgcca caagacagtg gcactggatg gaaccctatt ccagaagtca 1981 ggagtgatct ctggtggggc cagtgacctg aaggccaagg cacggcgctg ggatgagaaa 2041 gcagtagaca agttgaaaga gaagaaggag cgcttgacag aggagctgaa agagcagatg 2101 aaggcaaaac ggaaagaggc agagctgcgt caggtgcagt ctcaggccca tggactgcag 2161 atgcggctca agtactccca gagtgaccta gaacagacca agacacgaca tctagccctg 2221 aatctgcagg aaaaatccaa gctggagagt gagctagcca actttgggcc tcgcattaat 2281 gatatcaaga ggatcattca gagccgagag agggaaatga aagacttgaa ggagaagatg 2341 aaccaggtag aggatgaggt gtttgaagag ttttgtcggg agattggtgt gcgcaacatc 2401 cgggagtttg aggaagaaaa ggtgaaacgg cagaatgaaa tcgccaagaa gcgtttggag 2461 tttgagaatc agaagactcg cttgggcatt cagttggatt ttgaaaagaa ccaactgaag 2521 gaggaccaag ataaagtaca catgtgggag cagacagtga aaaaagatga aaatgagata 2581 gaaaagctca aaaaggagga acaaagacac atgaagatca tagatgagac catggctcag 2641 ctacaagacc tgaagaatca gcatctggcc aagaagtcgg aagtgaatga caagaatcat 2701 gagatggagg agattcgtaa gaaactcggg ggcgccaaca aggaaatgac ccatttacag 2761 aaggaggtga cagccattga gaccaagctt gaacagaagc gcagtgaccg tcacaacttg 2821 ctacaggcct gtaagatgca ggacattaag ttgccactgt caaaaggcac catggatgat 2881 attagtcagg aagagggtag ctcccagggg gaggactcag tgagtggttc acagagaatt 2941 tccagtatct atgcacgaga ggccctcatt gagattgact acggtgatct gtgtgaggat 3001 ctgaaggatg cccaggctga ggaagagatc aagcaagaga tgaacacact gcagcagaag 3061 ctgaatgagc agcagagtgt gcttcagcgt attgccgccc ccaacatgaa ggccatggaa 3121 aagctggaaa gtgtccgaga caagttccag gagacctcag atgagtttga agcagcccga 3181 aagcgagcaa agaaggccaa gcaggcattc gaacagatca agaaggagcg ctttgaccgc 3241 ttcaatgctt gttttgaatc tgtggctacc aacattgatg agatctataa ggccctgtcc 3301 cgcaatagca gtgcccaggc attcctgggc cctgagaacc ctgaagagcc ctacttggat 3361 ggcatcaact acaactgtgt ggctcctggg aaacgcttcc ggcctatgga caacttgtca 3421 ggcggggaga agacagtggc agctctggcc ctgctctttg ccatccacag ctacaagcca 3481 gcccccttct tcgtcctgga tgagattgat gctgccttgg ataacaccaa cattggcaag 3541 gtggcaaatt acatcaagga gcagtcgact tgcaacttcc aggccatcgt catctctctc 3601 aaggaggagt tctacaccaa ggccgagagc ctcattggag tctatcctga gcaaggggac 3661 tgtgtgatca gcaaagtcct gaccttcgac ctcaccaagt acccagatgc caaccccaac 3721 cccaatgagc agtagcagta tttttgccct cccgccctgt ctggatccct aagctgtccc 3781 tctcccaatc tctggatatt tgactcccaa ccttccccct acctcctggc cctttttggt 3841 gtagtcatgg gatttaggca ctgctaatca agcatgaaga ggaacagagg tgatgttagg 3901 tctggagcaa aaattcctga acgacaggga gtattctggc ctctgaaagg aggtgctgag 3961 ctgaacaggg ccatctgtnc atcacacaca cccnnttctc cctcatcacc cataatcgtg 4021 gnccccttgg ctcttgccca ctgtgtgtgt gggtatgtat gtgtgtatgt atgtatccgc 4081 atgtgtgcat gtgagtatgt ttgcaaaata ataaaggata ttggagacct gttttagaag 4141 gagcctaggc tgaatttgat tccaagagag cttaggatga cagcacccct gagctgggca 4201 aaggtactca ggacctcata ggagtcttag gcagttacct gaaactgcct tcattcactc 4261 atttgtgtat tcattcattt atgtattcat cagacacata ccgaacaccc tctatttgtc 4321 aggctctgtg cttggaatac agagttgaat cagacatgat ctctaccctc ctagtaagga 4381 gatacagtgg gttcatgaat gactatagtt agctgaatgt catatgtacn nttnnngaat 4441 ttgagaagtg gntgatcccc tctaggcttc ctggaggtca catttaagct agaccttgac 4501 aaattggtag gatttggtca ggcactagga gtggagcatg agctctgggg acagacagtt 4561 atgggttctg gtcccacttt ttatcactta ctagttgttt gaccttgggc aagtcatttg 4621 accttctgtg cctcagtttc ctcatctgta aaatggggct aacaatatta cctacctcat 4681 aggatttaat gatgtcaagc tcctcactgn agnccttatn ccnttcgtgn agcccactag 4741 gtgccgaccc ctcagaatat aatcctcatg cctgacccct gagagcttct gatcccagct 4801 attaggacag aagaagcctc caaatctgga aggtgctgaa tgccctgctg actgggaaag 4861 tttcagggca ctgatggggt ctacctggta agcggagggc ctgaggaaac ctgtagcttc 4921 aatcatgtct ggtaaccggg tgcctgagcc ccaatctggg ttgtgaggaa ataggggaga 4981 ggtatcctgg gccacatccc agcctaacac ctgtgaggtt cattttagga actaacctca 5041 ttagctataa ggatcatgca gaggcagcaa agccgggtgc gatgagctca gcctttactc 5101 attcacatac accatcacac tttaattcca atctgtatat tgctttttaa aagttaagtc 5161 cattctaatn ncccaaatat gcatgaattc // LOCUS S78653 2416 bp DNA PRI 10-JUL-1992 DEFINITION mrg=mas-related [human, Genomic, 2416 nt]. ACCESSION S78653 NID g244209 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS Monnot,C., Weber,V., Stinnakre,J., Bihoreau,C., Teutsch,B., Corvol,P. and Clauser,E. TITLE Cloning and functional characterization of a novel mas-related gene, modulating intracellular angiotensin II actions JOURNAL Mol. Endocrinol. 5 (10), 1477-1487 (1991) MEDLINE 92130997 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 78653] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..2416 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2416 /note="mas-related" /gene="mrg" CDS 997..2133 /gene="mrg" /note="mas product homolog modulating intracellular angiotensin II actions; This sequence comes from Fig. 1" /codon_start=1 /db_xref="PID:g244210" /translation="MVWGKICWFSQRAGWTVFAESQISLSCSLCLHSGDQEAQNPNLV SQLCGVFLQNETNETIHMQMSMAVGQQALPLNIIAPKAVLVSLCGVLLNGTVFWLLCC GATNPYMVYILHLVAADVIYLCCSAVGFLQVTLLTYHGVVFFIPDFLAILSPFSFEVC LCLLVAISTERCVCVLFPIWYRCHRPKYTSNVVCTLIWGLPFCINIVKSLFLTYWKHV KACVIFLKLSGLFHAILSLVMCVSSLTLLIRFLCCSQQQKATRVYAVVQISAPMFLLW ALPLSVAPLITDFKMFVTTSYLISLFLIINSSANPIIYFFVGSLRKKRLKESLRVILQ RALADKPEVGRNKKAAGIDPMEQPHSTQHVENLLPREHRVDVET" BASE COUNT 600 a 605 c 583 g 628 t ORIGIN 1 ttttgtattt gttgcaccct aagtctgttc atttccttct cctcagctga catttggagc 61 atagcagtcg atgatgccca cacagacact gcctgagact cacgcccctg gagaaacgca 121 gatttcctta ttttccaggt caagtcctgc cagccataga aaggacttct ttggtgccaa 181 ctgctgtgaa atgcctgcct tggaaatctc agtgctccct tgtacctgtc tgagcccagg 241 gaaatgccat actgtggcac tgctgcatcc tgtatggcta cccaaggatg cccaggactg 301 gtttgaaaga gatgagacat ggccaggtgc gtggctcacg cttgtaatcc agcactttgg 361 gaggtcaagg cagtggatca caaggtcaga gttgagacca gccaggccaa tatggtgaaa 421 accccatctc tactaaaaat acaaaaaatt agccgggcaa tggtggtggg tgcctgtagt 481 tccagctagt caggaggccg aggcaggaga atcgcttgaa cctggaaggt ggaggttcca 541 gtgagctgag atcgcgccac tgcactccag cctgggtgac agagtgagac tccaactcaa 601 aaaaaaaaaa aaaaaagaga tgagacacta gtgtctcatg agtagaacct ggaccagaca 661 caaatctcca ttcccaatgt ttagtgcctc attagtgccc aacaacaaga tattgggtct 721 atgtgggtag gcctggggca tcctgtacaa caggagatgt gttaggggag ggagaacaga 781 tcacaaattc atggagagct atttgcagag cagatactcc catccactct gatatgtagt 841 taatgttcag ctgttcctaa aaagcacacc caacaatggg tgttctattc cagcctagga 901 aaatgtagag gcaaggggtc tgaggccaga ggacaccact agatggacca ctgctcctga 961 ctgtgatgtt gtggcccact caggtcccag caccccatgg tctgggggaa aatttgctgg 1021 ttcagccaga gggctggatg gacagtgttt gctgagtcac agatatctct ctcatgtagc 1081 ctttgtctcc acagtggtga ccaggaggca cagaacccaa acctggtatc tcagctctgt 1141 ggcgtctttc ttcaaaatga gacgaatgaa accatacata tgcagatgag catggcagtg 1201 ggacagcagg ccctgccctt gaatatcatt gcccccaagg ctgtgctggt ctccctctgt 1261 ggggtcttat tgaatggcac tgtcttctgg ctgctttgct gtggggccac gaatccctac 1321 atggtataca tcctccacct ggtcgctgct gacgtgatct atctttgctg ctcggcagtg 1381 gggttcttac aggtgactct gctaacttat catggagtcg tgttttttat ccctgatttc 1441 ctggccatat tgtctccctt ctcctttgag gtgtgtctct gtctcctggt ggccatcagc 1501 acagagcggt gtgtgtgtgt cctcttcccc atctggtaca gatgccaccg cccaaaatac 1561 acatctaatg ttgtctgcac cctcatctgg ggcctgcctt tttgcatcaa catagtaaaa 1621 tcacttttcc taacttactg gaaacatgta aaggcatgtg tcatatttct aaagctttct 1681 gggctcttcc atgctatcct ttcacttgtg atgtgtgtgt cgagtctgac tctactcatt 1741 agattcctgt gctgctccca gcagcaaaag gccaccaggg tctatgcggt ggtgcagatc 1801 tcggccccca tgttcctact ctgggcccta cccctgagcg tggcacccct cataacagat 1861 ttcaaaatgt ttgtcaccac ctcctattta atttccttgt tcctcattat aaacagcagc 1921 gccaacccta tcatttattt ctttgtgggg agcctcagaa agaaaaggct gaaggaatct 1981 ctcagagtga ttctccaacg ggcgttagca gataagccag aggtggggag gaacaaaaag 2041 gcagctggca tcgacccaat ggagcaacca cactctactc agcatgtgga gaaccttctt 2101 cccagggagc acagggtcga tgtggaaaca taatttccca catctgagct ggggaattgt 2161 acacatagta acccagcctg ttctgcatca taaggctgct gcatcaaatc aatgctttat 2221 tctaatcaag ttcagctttc atggactttc aaaacaaccc cttgctgttt gtggttggaa 2281 gagacattaa cttccttcct aggcagtaag cccagtttga atgtgctcca gttccaacga 2341 tgaggggaat gggacccagt gagactttcc tggtacctgt ggaatccaaa taaagaccat 2401 acaaaggcat gaattc // LOCUS S79048 507 bp mRNA PRI 03-NOV-1995 DEFINITION LPRP=pHL E1F1 [human, lacrimal gland, mRNA Partial, 507 nt]. ACCESSION S79048 NID g1050982 KEYWORDS . SOURCE human lacrimal gland. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 507) AUTHORS Dickinson,D.P. and Thiesse,M. TITLE A major human lacrimal gland mRNA encodes a new proline-rich protein family member JOURNAL Invest. Ophthalmol. Vis. Sci. 36 (10), 2020-2031 (1995) MEDLINE 95386401 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 169653] from the original journal article. This sequence comes from Fig. 3. FEATURES Location/Qualifiers source 1..507 /organism="Homo sapiens" /db_xref="taxon:9606" gene 17..421 /note="pHL E1F1" /gene="LPRP" CDS 17..421 /gene="LPRP" /note="secretory proline-rich protein; This sequence comes from Fig. 3" /codon_start=1 /product="pHL E1F1" /db_xref="PID:g1050983" /translation="MLLVLLSVVLLALSSAQSTDNDVNYEDFTFTIPDVEDSSQRPDQ GPQRPPPEGLLPRPPGDSGNQDDGPQQRPPKPGGHHRHPPPPPFQNQQRPPQRGHRQL SLPRFPSVSLQEASSFFRRDRPARHPQEQPLW" BASE COUNT 147 a 152 c 100 g 108 t ORIGIN 1 cagagcctcc ttcaagatgc tgctggtcct gctctcagtg gtccttctgg ctctgagctc 61 agctcagagc acagataatg atgtgaacta tgaagacttt actttcacca taccagatgt 121 agaggactca agtcagagac cagatcaggg accccagaga cctcctcctg aaggactcct 181 acctagaccc cctggtgata gtggtaacca agatgatggt cctcagcaga gaccaccaaa 241 accaggaggc catcaccgcc atcctccccc acctcctttt caaaatcagc aacgaccacc 301 ccaacgagga caccgtcaac tctctctacc ccgatttcct tctgtcagcc tgcaggaagc 361 atcatcattc ttccggaggg acagaccagc aagacatccc caggagcaac cactctggta 421 atctagaatt cagtggcaga aaataaataa gaagataact tccttcagaa agccatgaca 481 ttgaaataat gtggtcataa ctctttc // LOCUS S79281 491 bp mRNA PRI 29-NOV-1995 DEFINITION pancreatic ribonuclease [human, mRNA Recombinant Partial, 491 nt]. ACCESSION S79281 NID g1087118 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 491) AUTHORS Russo,N., de Nigris,M., Ciardiello,A., Di Donato,A. and D'Alessio,G. TITLE Expression in mammalian cells, purification and characterization of recombinant human pancreatic ribonuclease JOURNAL FEBS Lett. 369 (2-3), 352 (1995) MEDLINE 95377432 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 170127] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..491 /organism="Homo sapiens" /db_xref="taxon:9606" gene 20..481 /gene="pancreatic ribonuclease, HP-RNase" CDS 20..481 /gene="pancreatic ribonuclease, HP-RNase" /note="This sequence comes from Fig. 2; HP-RNase" /codon_start=1 /product="pancreatic ribonuclease" /db_xref="PID:g1087119" /translation="MALKSLVVLPLLVLVLLLVRVQPSLGKESRAKKFQRQHMDSDSS PSSSSTYCNQMMRRRNMTQGRCKPVNTFVHEPLVDVQNVCFQEKVTCKNGQGNCYKSN SSMHITDCRLTNGSRYPNCAYRTSPKERHIIVACEGSPYVPVHFDASVEDS" BASE COUNT 110 a 135 c 113 g 133 t ORIGIN 1 cggggatcct gaggtcatca tggctctgaa gtctctagtc gtgttgccac tgctggtcct 61 ggtgctgctg ctggtgcggg tccagccttc cctgggcaaa gaatctagag ctaaaaaatt 121 ccagcgtcaa catatggact ctgactcgag cccgtcttct tcttctacgt actgcaacca 181 gatgatgcgt cgtcgtaaca tgacccaagg tcgttgcaaa ccggtgaaca ctttcgttca 241 tgaaccgctt gtagacgttc agaacgtttg cttccaagag aaggttacct gcaaaaatgg 301 ccagggtaac tgctacaaat ctaactcttc tatgcatatc actgactgcc gtcttactaa 361 cggatcccgt taccccaact gcgcttaccg tacttctcct aaggaacgtc atatcatcgt 421 tgcatgcgaa ggctctccgt acgttccggt tcatttcgac gcgtctgttg aagactcttg 481 aagtcgacct g // LOCUS S79311 743 bp mRNA PRI 10-JUL-1992 DEFINITION Ig kappa =immunoglobulin light chain [rats, humanized lympholytic MoAb CAMPATH-1H, mRNA, 743 nt]. ACCESSION S79311 NID g243867 KEYWORDS . SOURCE human humanized lympholytic MoAb CAMPATH-1H. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 743) AUTHORS Crowe,J.S., Hall,V.S., Smith,M.A., Cooper,H.J. and Tite,J.P. TITLE Humanized monoclonal antibody CAMPATH-1H: myeloma cell expression of genomic constructs, nucleotide sequence of cDNA constructs and comparison of effector mechanisms of myeloma and Chinese hamster ovary cell-derived material JOURNAL Clin. Exp. Immunol. 87 (1), 105-110 (1992) MEDLINE 92127884 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 79311] from the original journal article. This sequence comes from Figure 2.b. FEATURES Location/Qualifiers source 1..743 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..743 /note="immunoglobulin light chain" /gene="Ig&kgr;" CDS 36..737 /gene="Ig" /note="This sequence comes from Figure 2.b." /codon_start=1 /product="immunoglobulin light chain" /db_xref="PID:g243868" /translation="MGWSCIILFLVATATGVHSDIQMTQSPSSLSASVGDRVTITCKA SQNIDKYLNWYQQKPGKAPKLLIYNTNNLQTGVPSRFSGSGSGTDFTFTISSLQPEDI ATYYCLQHISRPRTFGQGTKVEIKRTVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYP REAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTLSKADYEKHKVYACEVTHQ GLSSPVTKSFNRGEC" BASE COUNT 212 a 208 c 183 g 140 t ORIGIN 1 aagctttaca gttactgagc acacaggacc tcaccatggg atggagctgt atcatcctct 61 tcttggtagc aacagctaca ggtgtccact ccgacatcca gatgacccag agcccaagca 121 gcctgagcgc cagcgtgggt gacagagtga ccatcacctg taaagcaagt cagaatattg 181 acaaatactt aaactggtac cagcagaagc caggtaaggc tccaaagctg ctgatctaca 241 atacaaacaa tttgcaaacg ggtgtgccaa gcagattcag cggtagcggt agcggtaccg 301 acttcacctt caccatcagc agcctccagc cagaggacat cgccacctac tactgcttgc 361 agcatataag taggccgcgc acgttcggcc aagggaccaa ggtggaaatc aaacgtactg 421 tggctgcacc atctgtcttc atcttcccgc catctgatga gcagttgaaa tctggaactg 481 cctctgttgt gtgcctgctg aataacttct atcccagaga ggccaaagta cagtggaagg 541 tggataacgc cctccaatcg ggtaactccc aggagagtgt cacagagcag gacagcaagg 601 acagcaccta cagcctcagc agcaccctga cgctgagcaa agcagactac gagaaacaca 661 aagtctacgc ctgcgaagtc acccatcagg gcctgagctc gcccgtcaca aagagcttca 721 acaggggaga gtgttagaag ctt // LOCUS S79639 3183 bp mRNA PRI 27-JAN-1996 DEFINITION EXT1=putative tumour suppressor/hereditary multiple exostoses candidate gene [human, placenta, mRNA, 3183 nt]. ACCESSION S79639 NID g1168161 KEYWORDS . SOURCE human placenta. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3183) AUTHORS Ahn,J., Ludecke,H.J., Lindow,S., Horton,W.A., Lee,B., Wagner,M.J., Horsthemke,B. and Wells,D.E. TITLE Cloning of the putative tumour suppressor gene for hereditary multiple exostoses (EXT1) JOURNAL Nature Genet. 11 (2), 137-143 (1995) MEDLINE 96024648 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 171344] from the original journal article. This sequence comes from Fig. 2. Map location: 8q24.1. FEATURES Location/Qualifiers source 1..3183 /organism="Homo sapiens" /db_xref="taxon:9606" gene 652..2892 /note="putative tumour suppressor/hereditary multiple exostoses candidate gene" /gene="EXT1" CDS 652..2892 /gene="EXT1" /note="putative tumour suppressor/hereditary multiple exostoses candidate gene; This sequence comes from Fig. 2. Map location 8q24.1" /codon_start=1 /db_xref="PID:g1168162" /translation="MQAKKRYFILLSAGSCLALLFYFGGLQFRASRSHSRREEHSGRN GLHHPSPDHFWPRFPEPLRPFVPWDQLENEDSSVHISPRQKRDANSSIYKGKKCRMES CFDFTLCKKNGFKVYVYPQQKGEKIAESYQNILAAIEGSRFYTSDPSQACLFVLSLDT LDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAK ASISTENFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIG SDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFC LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPST IRSIHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEIIQDRIFKHISRNSLIWNKHPG GLFVLPQYSSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQ YCAQIIVLWNCDKPLPAKHRWPATAVPVVVIEGESKVMSSRFLPYDNIITDAVLSLDE DTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDNSKERWGYTSKWTNDYSMVLTGAA IYHKYYHYLYSHYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETM MGQTSRASRWADPDHFAQRQSCMNTFASWFGYMPLIHSQMRLDPVLFKDQVSILRKKY RDIERL" BASE COUNT 815 a 805 c 803 g 760 t ORIGIN 1 cctccaggcc ccgccgcgcg tcccgggggc cggccccgcg agcgcaggag taaacaccgc 61 cggagtcttg gagccgctgc agaagggaat aaagagagat gcagggattt gtgaggttac 121 ggcgccccag ctgcaagatg cactagccgg ctgaacccgg gatcggctga cttgttggaa 181 ccggagtgct ctgcacggag agtggtggat gagttgaagt tgccttcccg gggctcattt 241 tccacgctgc cgagaggaat ccgagaggca aggcaatcac ttcgtcttgc cattgattgg 301 gtatcgggag cttttttttt ctcccctctc tctttctttt cctccgtctt gttgcatgca 361 agaaaattac agtccgctgc tcgcccgccc tgggtgcgag atattcagcc ccgctctctc 421 ccgtgcattg tgcaacccaa agatgaaaga ccgaagggga gaaagttaaa gaaatcgccc 481 acatgcgctg gatcagtcca cggcttgggg aaaggcatcc agagaaggtg ggagcggaga 541 gtttgaagtc tttacaggcg ggaagatggc ggactggagc tgaaagtgtt gattgggaaa 601 cttgggtgat tcttgtgttt atttacaatc ctcttgaccc aggcaggaca catgcaggcc 661 aaaaaacgct atttcatcct gctctcagct ggctcttgtc tcgccctttt gttttatttc 721 ggaggcttgc agtttagggc atcgaggagc cacagccgga gagaagaaca cagcggtagg 781 aatggcttgc accaccccag tccggatcat ttctggcccc gcttcccgga gcctctgcgc 841 cccttcgttc cttgggatca attggaaaac gaggattcca gcgtgcacat ttccccccgg 901 cagaagcgag atgccaactc cagcatctac aaaggcaaga agtgccgcat ggagtcctgc 961 ttcgatttca ccctttgcaa gaaaaacggc ttcaaagtct acgtataccc acagcaaaaa 1021 ggggagaaaa tcgccgaaag ttaccaaaac attctagcgg ccatcgaggg ctccaggttc 1081 tacacctcgg accccagcca ggcgtgcctc tttgtcctga gtctggatac tttagacaga 1141 gaccagttgt cacctcagta tgtgcacaat ttgagatcca aagtgcagag tctccacttg 1201 tggaacaatg gtaggaatca tttaattttt aatttatatt ccggcacttg gcctgactac 1261 accgaggacg tggggtttga catcggccag gcgatgctgg ccaaagccag catcagtact 1321 gaaaacttcc gacccaactt tgatgtttct attcccctct tttctaagga tcatcccagg 1381 acaggagggg agagggggtt tttgaagttc aacaccatcc ctcctctcag gaagtacatg 1441 ctggtattca aggggaagag gtacctgaca gggataggat cagacaccag gaatgcctta 1501 tatcacgtcc ataacgggga ggacgttgtg ctcctcacca cctgcaagca tggcaaagac 1561 tggcaaaagc acaaggattc tcgctgtgac agagacaaca ccgagtatga gaagtatgat 1621 tatcgggaaa tgctgcacaa tgccactttc tgtctggttc ctcgtggtcg caggcttggg 1681 tccttcagat tcctggaggc tttgcaggct gcctgcgtcc ctgtgatgct cagcaatgga 1741 tgggagttgc cattctctga agtgattaat tggaaccaag ctgccgtcat aggcgatgag 1801 agattgttat tacagattcc ttctacaatc aggtctattc atcaggataa aatcctagca 1861 cttagacagc agacacaatt cttgtgggag gcttattttt cttcagttga gaagattgta 1921 ttaactacac tagagattat tcaggacaga atattcaagc acatatcacg taacagttta 1981 atatggaaca aacatcctgg aggattgttc gtactaccac agtattcatc ttatctggga 2041 gattttcctt actactatgc taatttaggt ttaaagcccc cctccaaatt cactgcagtc 2101 atccatgcgg tgacccccct ggtctctcag tcccagccag tgttgaagct tctcgtggct 2161 gcagccaagt cccagtactg tgcccagatc atagttctat ggaattgtga caagccccta 2221 ccagccaaac accgctggcc tgccactgct gtgcctgtcg tcgtcattga aggagagagc 2281 aaggttatga gcagccgttt tctgccctac gacaacatca tcacagacgc cgtgctcagc 2341 cttgacgagg acacggtgct ttcaacaaca gaggtggatt tcgccttcac agtgtggcag 2401 agcttccctg agaggattgt ggggtacccc gcgcgcagcc acttctggga taactctaag 2461 gagcggtggg gatacacatc aaagtggacg aacgactact ccatggtgtt gacaggagct 2521 gctatttacc acaaatatta tcactaccta tactcccatt acctgccagc cagcctgaag 2581 aacatggtgg accaattggc caattgtgag gacattctca tgaacttcct ggtgtctgct 2641 gtgacaaaat tgcctccaat caaagtgacc cagaagaagc agtataagga gacaatgatg 2701 ggacagactt ctcgggcttc ccgttgggct gaccctgacc actttgccca gcgacagagc 2761 tgcatgaata cgtttgccag ctggtttggc tacatgccgc tgatccactc tcagatgagg 2821 ctcgaccccg tcctctttaa agaccaggtc tctattttga ggaagaaata ccgagacatt 2881 gagcgacttt gaggaatccg gctgagtggg ggaggggaag caagaaggga tgggggtcaa 2941 gctgctctct cttcccagtg cagatccact catcagcaga gccagattgt gccaactatc 3001 caaaaactta gatgagcaga atgacaaaaa aaaaaaaggc caatgagaac tcaactcctg 3061 gctcctggga ctgcaccaga ctgctccaaa ctcacctcac tggcttctgt gtcccaagac 3121 taggttggta cagtttaatt atggaacatt aaataattat ttttgaaaaa aaaaaaaaaa 3181 aaa // LOCUS S79862 2253 bp mRNA PRI 11-FEB-1996 DEFINITION 26 S protease subunit 5b=50 kda subunit [human, HeLa cells, mRNA Partial, 2253 nt]. ACCESSION S79862 NID g1184532 KEYWORDS . SOURCE human HeLa cells. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2253) AUTHORS Deveraux,Q., Jensen,C. and Rechsteiner,M. TITLE Molecular cloning and expression of a 26 S protease subunit enriched in dileucine repeats JOURNAL J. Biol. Chem. 270 (40), 23726-23729 (1995) MEDLINE 96007524 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 171973] from the original journal article. This sequence comes from Fig. 1. COMMENT Compare D31889. FEATURES Location/Qualifiers source 1..2253 /organism="Homo sapiens" /db_xref="taxon:9606" gene 4..1518 /gene="26 S protease subunit 5b" CDS 4..1518 /gene="26 S protease subunit 5b" /note="50 kda subunit; red blood cell type. Method: conceptual translation with partial peptide sequencing. This sequence comes from Fig. 1" /codon_start=1 /product="26 S protease subunit 5b" /db_xref="PID:g1184533" /translation="MAAQALALLREVARLEAPLEELRALHSVLQAVPLNELRQQAAEL RLGPLFSLLNENHREKTTLCVSILERLLQAMEPVHVARNLRVDLQRGLIHPDDSVKIL TLSQIGRIVENSDAVTEILNNAELLKQIVYCIGGENLSVAKAAIKSLSRISLTQAGLE ALFESNLLDDLKSVMKTNDIVRYRVYELIIEISSVSPESLNYCTTSGLVTQLLRELTG EDVLVRATCIEMVTSLAYTHHGRQYLAQEGVIDQISNIIVGADSDPFSSFYLPGFVKF FGNLAVMDSPQQICERYPIFVEKVFEMIESQDPTMIGVAVDTVGILGSNVEGKQVLQK TGTRFERLLMRIGHQSKNAPVELKIRCLDAISSLLYLPPEQQTDDLLRMTESWFSSLS RDPLELFRGISSQPFPELHCAALKVFTAIANQPWAQKLMFNSPGFVEYVVDRSVEHDK ASKDAKYELVKALANSKTIAEIFGNPNYLRLRTYLSEGPYYVKPVSTTAVEGAE" BASE COUNT 634 a 487 c 506 g 626 t ORIGIN 1 aagatggcag cccaggcttt ggcgctgctg agagaggtag cgaggctgga agcgccgctg 61 gaggagctac gcgcgcttca ctccgtgctg caggcagtgc cgctcaacga gcttcgccag 121 caagcggcgg agctgcgcct cggcccgctc ttctccctgc ttaacgagaa ccatagggaa 181 aagactactt tgtgtgtatc cattctggag agattgctcc aagctatgga accggttcac 241 gtggcccgga acctcagggt tgacctgcag aggggactaa ttcaccctga tgattctgta 301 aaaatcctca ctctttccca gattggaaga attgttgaaa attctgatgc tgttactgag 361 attctaaata atgctgaatt actaaaacaa attgtttatt gcattggtgg agagaatcta 421 tctgtagcaa aagcggctat caaatccctg tcaagaatat cactaaccca agctggactg 481 gaggctttat ttgaaagcaa tctgctggat gatttgaaaa gtgtaatgaa aacaaatgac 541 attgttcgat acagggtgta tgagctaatt atagagattt cttccgtgtc accagaatct 601 ttaaactact gtaccacaag tggattggta acccagctcc tgagagagct gactggtgag 661 gatgtgttgg tcagagccac ctgtatagaa atggtgacat cactggcata tactcatcat 721 gggcgacaat atcttgctca agaaggagta attgaccaaa tttctaatat aattgttggg 781 gcagattcag accctttctc tagcttctat ctgccaggat tcgtgaagtt ttttggaaac 841 ctggctgtca tggatagtcc tcaacagatc tgtgagcgtt atcctatctt tgtggaaaaa 901 gtctttgaaa tgatagaaag tcaggacccc actatgattg gtgtagctgt agacacagtt 961 ggaatcttgg gatccaatgt tgaaggaaaa caggttttac agaaaacagg aactcgcttt 1021 gaacgcttgc ttatgagaat aggacatcaa tcaaagaatg ccccagtgga gctaaaaatt 1081 agatgtttgg atgcaatttc atctcttctg tacttaccac ctgagcagca gactgatgac 1141 cttctgagga tgacagaatc ctggttttct tctttatctc gggacccact ggagctcttc 1201 cgtggcatta gtagtcagcc cttccctgaa ctacactgtg ctgccttaaa agtgtttacg 1261 gccattgcaa accaaccctg ggctcagaaa cttatgttta acagtccagg ttttgtagaa 1321 tatgtggtgg accggtctgt ggagcatgac aaagcttcaa aggatgccaa atatgaacta 1381 gtgaaagcac ttgccaattc caagacaatt gcagaaatct ttgggaaccc aaattatttg 1441 aggctcagaa cttacctgag tgaagggcca tactatgtga aacctgtttc cacgacagca 1501 gtagaaggag ccgaatgatt tcttctagag ctcatgtaga ggaccacgtt ttgaccaaaa 1561 cttctcctaa ggcatttgac tccatctata tttcacaaaa gagacttcct ttccccaaga 1621 attatcatgg aatgtcagat gttactttgt taccaacact gttatatttc tacattgaaa 1681 tgcaaagtgg aactaggagt ttggaatgca ttaagagcag acaagcttgg tcataataga 1741 tccagtgttt ttcagattcc tttcactgcc ttaatctttg caacagggtg gaagtttttt 1801 tcttccctca aaattttcat ggacatgcaa tcttatctaa aagcctgctt cagggctggg 1861 cgcggtggct aacacctata attcccagca ctttgggagg ccaagatggg caaatcactt 1921 gagtctagga gttcaagacc agcctggcca atatggcaaa accctgtctc tactaaaaat 1981 ataaaatcag ccaggcatgg tggcgcacac ctataatctc agctactcag gaggctgagg 2041 cacgagaatc gcttgagcct gggaggcaga ggttgcagtg agctgagatc ttgccactgc 2101 acaccagcct gggcaacaca gcaagactct gtctcaaaaa taaatgaata aaataaaaac 2161 ctgccaccaa attatttctg gattctcttc actatctttt tttttctttt tggtcctttt 2221 tccatttctg attccacttt caaagtcttg gat // LOCUS S80343 2120 bp mRNA PRI 07-MAR-1996 DEFINITION ArgRS=arginyl-tRNA synthetase [human, ataxia-telangiectasia patients, EBV-lymphoblastoid cells, mRNA, 2120 nt]. ACCESSION S80343 NID g1217667 KEYWORDS . SOURCE human EBV-lymphoblastoid cells ataxia-telangiectasia patients. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2120) AUTHORS Girjes,A.A., Hobson,K., Chen,P. and Lavin,M.F. TITLE Cloning and characterization of cDNA encoding a human arginyl-tRNA synthetase JOURNAL Gene 164 (2), 347-350 (1995) MEDLINE 96069607 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 173837] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..2120 /organism="Homo sapiens" /db_xref="taxon:9606" gene 28..2007 /note="arginyl-tRNA synthetase, ArgRS" /gene="ArgRS" CDS 28..2007 /gene="ArgRS" /note="This sequence comes from Fig. 2." /codon_start=1 /product="arginyl-tRNA synthetase" /db_xref="PID:g1217668" /translation="MDVLVSECSARLLQQEEEIKSLTAEIDRLKNCGCLGASSNLEQL QEENLKLKYRLNILRKSLQAERNKPTKNMINIISRLQEVFGHAIKAAYPDLENPPLLV TPSQQAKFGDYQCNSAMGISQMLKTKEPEVNPGEFAENITKHLPAMDVLKRVEFAGPG FINGHLRKDFVSEQLTSLLVNGVQLPALGENKKVIVDFSSPNIAKEMHVGHLRSTIIG ESISRLFEFAGYDVLRLNHVGDWGTQFGMLIAHLQDKFPDYLTVSPPIGDLQVFYKES KKRFDTEEEFKKRAYQCVVLLQGKNPDITKAYLLMSDVSRQELNKIYDALDVSLIERG ESFYQDRMNDIVKEFEDRGFVQVDDGRKIVFVPGCSIPLTIVKSDGGYTYDTSDLAAI KQRLFEEKADMIIYVVDNGQSVHFQTIFAAAQMIGWYDPKVTRVFHAGFGVVLGEDKK KFKTRSGETVRLMDLLGEGLKRSMDKLKEKERDKVLTAEELNAAQTSVAYGCIKYADL SHNRLNDYIFSFDKMLDDRGNTAAYLLYAFTRIRSIARLANIDEEMLQKAARETKILL VHEKEWKLGRCILRFPEILQKILDDLFLHTLCDYIYELATAFTEFYDSCYCVEKDRQT GKILKVNMWRILCETVAAVMAKGFDTLGIKPGPRV" BASE COUNT 635 a 390 c 506 g 589 t ORIGIN 1 ggatccgcgg ccgccgctga tgggaggatg gacgtactgg tgtctgagtg ctccgcgcgg 61 ctgctgcagc aggaagaaga gattaaatct ctgactgctg aaattgaccg gttgaaaaac 121 tgtggctgtt taggagcttc ttcaaatttg gagcagttac aagaagaaaa tttaaaatta 181 aagtatcgac tgaatattct tcgaaagagt cttcaggcag aaaggaacaa accaactaaa 241 aatatgatta acattattag ccgcctacaa gaggtctttg gtcatgcaat taaggctgca 301 tatccagatt tggaaaatcc tcctctgcta gtgacaccaa gtcagcaggc caagtttggg 361 gactatcagt gtaatagtgc tatgggtatt tctcagatgc ttaaaaccaa ggaaccggaa 421 gttaatccag gggaatttgc tgaaaacatt accaaacacc tcccggccat ggatgttttg 481 aaaagagttg aatttgctgg ccctggcttt attaatggcc acttaagaaa ggattttgta 541 tcagaacaat tgaccagtct tctagtgaat ggagttcaac tacctgctct gggagagaat 601 aaaaaggtta tagttgactt ttcctcccct aatatagcta aagagatgca tgtaggccac 661 ctgaggtcaa ctatcatagg agagagtata agccgcctct ttgaatttgc agggtatgac 721 gtgctcaggt taaatcatgt aggagactgg gggacccagt ttggcatgct catcgctcac 781 ctgcaagaca aatttccaga ttatctaaca gtttcacctc ctattgggga tcttcaggtc 841 ttttataagg aatctaagaa gaggtttgat actgaggagg aatttaagaa gcgagcatat 901 cagtgtgtag ttctgctcca gggtaaaaac ccagatatta caaaagctta tctgctgatg 961 tctgatgtct cccgccaaga gttaaataaa atctatgatg cattggacgt ctctttaata 1021 gagagagggg aatccttcta tcaagatagg atgaatgata ttgtaaagga atttgaagat 1081 agaggatttg tgcaggtgga tgatggcaga aagattgtat ttgtcccagg gtgttccata 1141 ccattaacca tagtaaaatc agatggaggt tatacctatg atacatctga cctggctgct 1201 attaaacaaa gactatttga ggaaaaagca gatatgatta tctatgttgt ggacaatgga 1261 caatctgtgc acttccagac aatatttgct gctgctcaaa tgattggttg gtatgaccct 1321 aaagtaactc gagtcttcca tgctggattt ggtgtggtgc taggggaaga caagaaaaag 1381 tttaaaacac gttcgggtga aacagtgcgc ctcatggatc ttctgggaga aggactaaaa 1441 cgatccatgg acaagttgaa ggaaaaagaa agagacaagg tcttaactgc agaggaattg 1501 aatgctgctc agacatccgt tgcatatggc tgcatcaaat atgctgacct ttcccataac 1561 cggttgaatg actacatctt ctcctttgac aaaatgctag atgacagagg aaatacagct 1621 gcttacttgt tgtatgcctt cactagaatc aggtctattg cacgtctggc caatattgat 1681 gaagaaatgc tccaaaaagc tgctcgagaa accaagattc ttttggttca tgagaaggaa 1741 tggaaactag gccggtgcat tttacggttc cctgagattc tgcaaaagat tttagatgac 1801 ttatttctcc acactctctg tgattatata tatgagctgg caactgcttt cacagagttc 1861 tatgatagct gctactgtgt ggagaaagat agacagactg ggaaaatatt gaaggtgaac 1921 atgtggcgta tcttgtgtga aacagtagct gctgtcatgg ccaaggggtt tgataccctg 1981 gggataaaac ctggcccaag ggtgtaatcc ctcacaggtt tgaaccctgt gtgttttttc 2041 ccaagtggcc attggccctg tttgcttttt ttcaatcttg tgggcacaag cataagttaa 2101 ggaaattttt cacccaggca // LOCUS S80491 180 bp mRNA PRI 01-APR-1996 DEFINITION stem cell factor {alternatively spliced} [human, preimplantation embryos, blastocysts, mRNA Partial, 180 nt]. ACCESSION S80491 NID g1246099 KEYWORDS . SOURCE human blastocysts preimplantation embryos. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 180) AUTHORS Sharkey,A.M., Dellow,K., Blayney,M., Macnamee,M., Charnock-Jones,S. and Smith,S.K. TITLE Stage-specific expression of cytokine and receptor messenger ribonucleic acids in human preimplantation embryos JOURNAL Biol. Reprod. 53 (4), 974-981 (1995) MEDLINE 96095300 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 174368] from the original journal article. This sequence comes from Fig. 3B. FEATURES Location/Qualifiers source 1..180 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..114 /partial /gene="stem cell factor, SCF" CDS 1..114 /partial /gene="stem cell factor, SCF" /note="This sequence comes from Fig. 3B; SCF" /codon_start=1 /product="stem cell factor" /db_xref="PID:g1246100" /translation="MDVLEICSLLIGLTAYKELSLPKRKETCRAIQHPRKD" BASE COUNT 60 a 30 c 43 g 47 t ORIGIN 1 atggatgttt tggaaatctg ttcattgttg atagggctga cggcctataa ggaattatca 61 ctccctaaaa ggaaagaaac ttgcagagca attcagcatc caaggaaaga ctgacagctt 121 tgaaagagac ctgataatga tgcaagtagg aacttgcatg tgcttgaacc aagtcattgt // LOCUS S80562 1607 bp mRNA PRI 01-APR-1996 DEFINITION acidic calponin [human, kidney, mRNA, 1607 nt]. ACCESSION S80562 NID g1245966 KEYWORDS . SOURCE human kidney. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1607) AUTHORS Maguchi,M., Nishida,W., Kohara,K., Kuwano,A., Kondo,I. and Hiwada,K. TITLE Molecular cloning and gene mapping of human basic and acidic calponins JOURNAL Biochem. Biophys. Res. Commun. 217 (1), 238-244 (1995) MEDLINE 96095663 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 174415] from the original journal article. This sequence comes from Fig. 1b. Map location: 1p21-22. FEATURES Location/Qualifiers source 1..1607 /organism="Homo sapiens" /db_xref="taxon:9606" gene 84..1073 /gene="acidic calponin" CDS 84..1073 /gene="acidic calponin" /note="This sequence comes from Fig. 1b." /codon_start=1 /product="acidic calponin" /db_xref="PID:g1245967" /translation="MTHFNKGPSYGLSAEVKNKIASKYDHQAEEDLRNWIEEVTGMSI GPNFQLGLKDGIILCELINKLQPGSVKKVNESSLNWPQLENIGNFIKAIQAYGMKPHD IFEANDLFENGNMTQVQTTLVALAGLAKTKGFHTTIDIGVKYAEKQTRRFDEGKLKAG QSVIGLQMGTNKCASQAGMTAYGTRRHLYDPKMQTDKPFDQTTISLQMGTNKGASQAG MLAPGTRRDIYDQKLTLQPVDNSTISLQMGTNKVASQKGMSVYGLGRQVYDPKYCAAP TEPVIHNGSQGTGTNGSEISDSDYQAEYPDEYHGEYQDDYPRDYQYSDQGIDY" BASE COUNT 505 a 332 c 338 g 432 t ORIGIN 1 gctctgtagc acccaggagc ggggaagcga agtgcgagag accccggacc ccagcgctgt 61 ctcttcccgc cgcccgaacc accatgaccc acttcaacaa gggcccttcc tatgggctct 121 cggccgaagt caagaacaag attgcttcca agtatgatca tcaggcagaa gaagatcttc 181 gcaattggat agaagaggtg acaggcatga gcattggccc caacttccag ctgggcttaa 241 aggatggcat catcctctgc gaacttataa acaagctaca gccaggctca gtgaagaagg 301 tcaacgagtc ctcactgaac tggcctcagt tggagaatat tggcaacttt attaaagcta 361 ttcaggctta tggtatgaag ccacatgaca tattcgaagc aaatgatctt tttgagaatg 421 gaaacatgac ccaggttcag actactctgg tggctctagc aggtctggct aaaacaaaag 481 gattccatac aaccattgac attggagtta agtatgcaga aaaacaaaca agacgttttg 541 atgaaggaaa attaaaagct ggccaaagtg taattggtct gcagatggga accaacaaat 601 gtgccagcca ggcaggtatg acagcttacg ggactaggag gcatctttat gatcccaaaa 661 tgcaaactga caaacctttt gaccagacca caattagtct gcagatgggc actaataaag 721 gagccagcca ggcagggatg ttagcaccag gtaccagaag agacatctat gatcagaagc 781 taacattaca gccggtggac aactcgacaa tttccctaca gatgggtacc aacaaagttg 841 cttcccagaa aggaatgagt gtgtatgggc ttgggcggca agtatatgat cccaaatact 901 gtgctgctcc tacagaacct gtcattcaca acggaagcca aggaacagga acaaatggtt 961 cggaaatcag tgatagtgat tatcaggcag aataccctga tgagtatcat ggcgagtacc 1021 aggatgacta ccccagagat taccaatata gcgaccaagg cattgattat tagatccaca 1081 cagaaggagc tcagtattta gtcctttgtt tttattcagt gagaaccaag ctagccttga 1141 gtaattttta tcttgtcttc ctaaaacact attaagctta ttgtactttt aagaaaaatt 1201 gccttacgta cattcctttt tcctttttct gcctcttccc tcaatagttg ccttttagtg 1261 ctgtaatagg ttaaatccta cagcataatc aataactcgc atatgaagta aaaaggaata 1321 ctgtgaaagg ggagtactct tgtacagcca gttcttttat gcaaaaatct atgcattttt 1381 acaatcttat attaaactgg tattttcaaa caataggaaa cttttttttt ttttttttac 1441 agtttagtgt atctggtttc tacatggaag actaaactca tgcttattgc taaatgtggt 1501 ctttgccaac taaatttaag atgcagcatt ttagaaattt acatatcaat gtttctacag 1561 tattgtttgc taatttttaa ataaagtcat gatcagtgtg aaaaaaa // LOCUS S80864 1041 bp mRNA PRI 27-MAR-1997 DEFINITION cytochrome c-like polypeptide [human, lung adenocarcinoma A549, mRNA, 1041 nt]. ACCESSION S80864 NID g1911547 KEYWORDS . SOURCE human lung adenocarcinoma A549. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1041) AUTHORS Kawamoto,S., Hashizume,S., Katakura,Y., Tachibana,H. and Murakami,H. TITLE Molecular cloning of yeast cytochrome c-like polypeptide expressed in human lung carcinoma: an antigen recognizable by lung cancer-specific human monoclonal antibody JOURNAL In Vitro Cell. Dev. Biol. Anim. 31 (9), 724-729 (1995) MEDLINE 96121639 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 175349] from the original journal article. This sequence comes from Fig. 2B. FEATURES Location/Qualifiers source 1..1041 /organism="Homo sapiens" /db_xref="taxon:9606" gene 322..894 /gene="cytochrome c-like polypeptide" CDS 322..894 /gene="cytochrome c-like polypeptide" /note="This sequence comes from Fig. 2B." /codon_start=1 /db_xref="PID:g1911548" /translation="MFTSSILGKGHRHSLVSIKQEHSALRKAAGPLPLKVGYCQGFSP CDSLKYGSWDEKDLTVPQPDTHKGSVLRWISKRGKPLAVEIEEGHCLCLPLGTECLGI KPIVHLFNSEIGENRPMVGARHVSSNAALLFFTPLRCLGGEKHKSGLHAHPGIVPSLE LNHDTDSFAHMFFADLLLIITLLSYYIPFC" BASE COUNT 318 a 205 c 233 g 285 t ORIGIN 1 ttcaatgtaa atgctgtgtc agtagttgtc tagggaataa tgacaagaag aaaattttgt 61 acatgttcag tacgtaggaa aaagaaagag agatcagact gtcactgtgt ctatgtagaa 121 agggaagaca taagagattc cattttgaaa aagacctgta ctttaaacag ttgctttgct 181 gagatgttgt taatttgtag ctttgcccca gccactttgc cttagccact ttgacccaac 241 ctggagctca caaaaatatg tgttgtataa aatcaaggtt taagggatct agggctgtgc 301 aggatatgcc ttgttaacaa aatgtttaca agcagtatac ttggtaaagg tcatcgccat 361 tctctagtct caataaaaca ggagcacagt gcactgcgga aagccgcagg acctctgccc 421 ttgaaagtgg ggtattgtca aggtttctcc ccatgtgata gtctgaaata tggctcgtgg 481 gatgagaaag acctgactgt gccccagccc gacacccata aagggtctgt gctgaggtgg 541 attagtaaaa gaggaaagcc tcttgcagtt gagatagagg aaggccactg tctctgcctg 601 cccctgggaa ctgaatgtct cggtataaaa ccgattgtac atttgttcaa ttctgagata 661 ggagaaaacc gccctatggt gggagcgaga catgtttcga gcaatgctgc cttgttattc 721 tttactccgc tgagatgttt gggtggagag aaacataaat ctggcctaca tgcacatccg 781 ggcatagtac cttcccttga acttaatcat gacacagatt cttttgctca catgtttttt 841 gctgaccttc tccttattat caccctgctg tcctactaca ttcctttttg ctgaaataat 901 gaaaataata gtcaataaaa actgagggaa ctcaaaggcc ggtgccagtg caggtccttg 961 gtgtgtcgaa tactggtccc ctggacccac tgttgtttct ctaaaaaaaa aaaaaaaaaa 1021 aaaaaaaaaa aaaaaaaaaa a // LOCUS S81419 377 bp mRNA PRI 24-MAY-1996 DEFINITION dystrophin, dystrophin {Purkinje promoter, alternatively spliced} [human, cortical brain and adult heart, mRNA Partial, 377 nt]. ACCESSION S81419 NID g1332715 KEYWORDS . SOURCE human cortical brain and adult heart. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 377) AUTHORS Holder,E., Maeda,M. and Bies,R.D. TITLE Expression and regulation of the dystrophin Purkinje promoter in human skeletal muscle, heart, and brain JOURNAL Hum. Genet. 97 (2), 232-239 (1996) MEDLINE 96163501 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 176571] from the original journal article. This sequence comes from Fig. 2. COMMENT Authors note this nucleotide sequence corrects that of D.C. Gorecki et. al. in: Hum. Mol. Genet. 1, 505-510 (1992). FEATURES Location/Qualifiers source 1..377 /organism="Homo sapiens" /db_xref="taxon:9606" gene 263..289 /partial /gene="dystrophin" gene 263..377 /partial /gene="dystrophin" CDS join(263..281,364..377) /partial /gene="dystrophin" /note="exon P1A" /codon_start=1 /translation="MSEVSSDEREM" CDS 263..289 /partial /gene="dystrophin" /note="Alternatively spliced exon P1B" /codon_start=1 /translation="MSEVSSGE" BASE COUNT 114 a 73 c 92 g 98 t ORIGIN 1 tgctgtctgt gaagctgaat ctgtgagaac acctcactat tcacggcaac cggagtggaa 61 gaaacaggtg caaaaagatt gtgtgtttgt ctgcttttgt gaggctggtc agagattctg 121 tgcctgcttt atctgtgctt ggctatgact ctacctccag gtttaccata ccccatagaa 181 tgtgtaagag aaaagtacca acagggaaat cagcaaaaag ctttcctatg aaggtgtgta 241 gccagcctcc gcagaatttg aaatgtctga ggtttcttct ggtgagtaaa agctgcagat 301 aatcaacagc cattcagaag aatgataaat gccacaagca tttggaaaca ggcttcccta 361 aagatgaaag agagatg // LOCUS S81734 2362 bp mRNA PRI 01-AUG-1996 DEFINITION tissue transglutaminase homologue {alternatively spliced} [human, erythroleukemia cell line HEL GM06141A, mRNA, 2362 nt]. ACCESSION S81734 NID g1478006 KEYWORDS . SOURCE human erythroleukemia cell line HEL GM06141A. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2362) AUTHORS Fraij,B.M. and Gonzales,R.A. TITLE A third human tissue transglutaminase homologue as a result of alternative gene transcripts JOURNAL Biochim. Biophys. Acta 1306 (1), 63-74 (1996) MEDLINE 96201707 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 177132] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..2362 /organism="Homo sapiens" /db_xref="taxon:9606" gene 133..1182 /gene="tissue transglutaminase homologue, TGH2" CDS 133..1182 /gene="tissue transglutaminase homologue, TGH2" /note="This sequence comes from Fig. 2; TGH2" /codon_start=1 /product="tissue transglutaminase homologue" /db_xref="PID:g1478007" /translation="MAEELVLERCDLELETNGRDHHTADLCREKLVVRRGQPFWLTLH FEGRNYEASVDSLTFSVVTGPAPSQEAGTKARFPLRDAVEEGDWTATVVDQQDCTLSL QLTTPANAPIGLYRLSLEASTGYQGSSFVLGHFILLFNAWCPADAVYLDSEEERQEYV LTQQGFIYQGSAKFIKNIPWNFGQFEDGILDICLILLDVNPKFLKNAGRDCSRRSSPV YVGRVVSGMVNCNDDQGVLLGRWDNNYGDGVSPMSWIGSVDILRRWKNHGCQRVKYGQ CWVFAAVACTGELHAGMWVMSPGRGHEEHWSRNQDIPALVLPPATNTLNALCGLEPVT TLSGPLSNSHPSSGC" BASE COUNT 616 a 593 c 640 g 513 t ORIGIN 1 caggcgtgac gccagttcta aatcttgaaa caaaacaaaa cttcaaagta caccaaaata 61 gaacctcctt aaagcataaa tctcacggag ggtctcgccg ccagtggaag gagccaccgc 121 ccccgcccga ccatggccga ggagctggtc ttagagaggt gtgatctgga gctggagacc 181 aatggccgag accaccacac ggccgacctg tgccgggaga agctggtggt gcgacggggc 241 cagcccttct ggctgaccct gcactttgag ggccgcaact acgaggccag tgtagacagt 301 ctcaccttca gtgtcgtgac cggcccagcc cctagccagg aggccgggac caaggcccgt 361 tttccactaa gagatgctgt ggaggagggt gactggacag ccaccgtggt ggaccagcaa 421 gactgcaccc tctcgctgca gctcaccacc ccggccaacg cccccatcgg cctgtatcgc 481 ctcagcctgg aggcctccac tggctaccag ggatccagct ttgtgctggg ccacttcatt 541 ttgctcttca acgcctggtg cccagcggat gctgtgtacc tggactcgga agaggagcgg 601 caggagtatg tcctcaccca gcagggcttt atctaccagg gctcggccaa gttcatcaag 661 aacatacctt ggaattttgg gcagtttgaa gatgggatcc tagacatctg cctgatcctt 721 ctagatgtca accccaagtt cctgaagaac gccggccgtg actgctcccg ccgcagcagc 781 cccgtctacg tgggccgggt ggtgagtggc atggtcaact gcaacgatga ccagggtgtg 841 ctgctgggac gctgggacaa caactacggg gacggcgtca gccccatgtc ctggatcggc 901 agcgtggaca tcctgcggcg ctggaagaac cacggctgcc agcgcgtcaa gtatggccag 961 tgctgggtct tcgccgccgt ggcctgcaca ggtgagctgc acgctgggat gtgggtcatg 1021 agccctggga gggggcacga agagcactgg agtaggaatc aggacatccc tgccctggtc 1081 ctgccccctg ccaccaacac tctcaatgcg ctatgtggcc tagagcctgt gaccaccctc 1141 tccggtcctt tatccaatag ccatccctcc agtggttgtt aggatcaagg gctatcattg 1201 gagagactgc atgggctcag ggctgagaac ttgggcactg gagcccacgt tctcaaaaac 1261 tcaggatttt taacttgagt taaaaacccc ctccctccca ccacttgctc tgtgaccttg 1321 agcaaatgac ttctttctga acctcagttt cctcgtctgg aaaatgggga caacatcaag 1381 accttcctcc tagagtgggt gtgacatgaa gctactcagc acagaaccta gcccagtgtc 1441 actctcggca aatattagcc agtaataact ggatcctaac agtaatatca gtgtcaagaa 1501 agaaagcatg ttctaaaatg tagaggggga atatcgagca atttaaaacc aaaaaagtaa 1561 aataaatcta aaacaaaaat cttcaaaatt taaggccaga aaatgtcgat gattgttgcc 1621 aggctgatga tggtgcttat gagaatgggt tagcatcaca tgctggctaa gaacgtagac 1681 tcaagacatg aggattgctt gaggccggga gtttgagacc agcctgggca atatagtgag 1741 accttgtctc tacaaaaaaa taaaataaaa taaaaattag caggcatggt tgtgtgtctg 1801 gagtcttggc tactcagaag gctgaggcag gaagatcgct tgacccctgg aggttggctg 1861 caatgagcta tgattgcacc attgcactcc agcctgggca acggagcaag atcctctctc 1921 aaaaaaaaaa aaaaaaaaaa aaaaaaaaag agagagagca caaattctga agtcaaggcc 1981 tgggttcaaa tcctagctct gcttcttacc agctgtgtgt ccttaggtaa atcacttaac 2041 tcctctgaac cacagttcca ttgtatgtaa aatgggcaca acggtagtac ctttatactg 2101 ggatggttgt gaggattaaa tgagttaata tgtgtgaggt agctgagcgt ggtggtggtg 2161 cacctgtagt cttagctgtt tgggaggctg aggcaggggg atcacttgag cccaggaggt 2221 cgaggctgcg gtgagtcatg attgcaccac tgtactccag cctgggtgac agagcgagac 2281 ctcatctcta aataaataaa tgaatgtggg aagtgcttaa aatgattttg ggtgcataat 2341 agaaaatact tatgtgatga ta // LOCUS S81944 1732 bp mRNA PRI 31-JUL-1996 DEFINITION gamma-aminobutyric acid type A receptor alpha 6 subunit [human, cerebellum, mRNA Partial, 1732 nt]. ACCESSION S81944 NID g1470363 KEYWORDS . SOURCE human cerebellum. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1732) AUTHORS Hadingham,K.L., Garrett,E.M., Wafford,K.A., Bain,C., Heavens,R.P., Sirinathsinghji,D.J. and Whiting,P.J. TITLE Cloning of cDNAs encoding the human gamma-aminobutyric acid type A receptor alpha 6 subunit and characterization of the pharmacology of alpha 6-containing receptors JOURNAL Mol. Pharmacol. 49 (2), 253-259 (1996) MEDLINE 96226062 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 177635] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1732 /organism="Homo sapiens" /db_xref="taxon:9606" gene 27..1388 /gene="gamma-aminobutyric acid type A receptor alpha 6 subunit, GABAA receptor alpha 6 subunit" CDS 27..1388 /gene="gamma-aminobutyric acid type A receptor alpha 6 subunit, GABAA receptor alpha 6 subunit" /note="This sequence comes from Fig. 1; GABAA receptor alpha 6 subunit" /codon_start=1 /product="gamma-aminobutyric acid type A receptor alpha 6 subunit" /db_xref="PID:g1470364" /translation="MASSLPWLCIILWLENALGKLEVEGNFYSENVSRILDNLLEGYD NRLRPGFGGAVTEVKTDIYVTSFGPVSDVEMEYTMDVFFRQTWTDERLKFGGPTEILS LNNLMVSKIWTPDTFFRNGKKSIAHNMTTPNKLFRIMQNGTILYTMRLTINADCPMRL VNFPMDGHACPLKFGSYAYPKSEIIYTWKKGPLYSVEVPEESSSLLQYDLIGQTVSSE TIKSNTGEYVIMTVYFHLQRKMGYFMIQIYTPCIMTVILSQVSFWINKESVPARTVFG ITTVLTMTTLSISARHSLPKVSYATAMDWFIAVCFAFVFSALIEFAAVNYFTNLQTQK AKRKAQFAAPPTVTISKATEPLEAEIVLHPDSKYHLKKRITSLSLPIVSSSEANKVLT RAPILQSTPVTPPPLPPAFGGTSKIDQYSRILFPVAFAGFNLVYWVVYLSKDTMEVSS SVE" BASE COUNT 508 a 365 c 354 g 505 t ORIGIN 1 aattctgcat ttcagtgcac tgcaggatgg cgtcatctct gccctggctg tgcattattc 61 tgtggctaga aaatgcccta gggaaactcg aagttgaagg caacttctac tcagaaaacg 121 tcagtcggat cctggacaac ttgcttgaag gctatgacaa tcggctgcgg ccgggatttg 181 gaggtgctgt cactgaagtc aaaacagaca tttatgtgac cagttttggg cccgtgtcag 241 atgtggagat ggagtatacg atggatgttt tttttcgcca gacctggact gatgagaggt 301 tgaagtttgg ggggccaact gagattctga gtctgaataa tttgatggtc agtaaaatct 361 ggacgcctga cacctttttc agaaatggta aaaagtccat tgctcacaac atgacaactc 421 ctaataaact cttcagaata atgcagaatg gaaccatttt atacaccatg aggcttacca 481 tcaatgctga ctgtcccatg aggctggtta actttcctat ggatgggcat gcttgtccac 541 tcaagtttgg gagctatgct tatcccaaaa gtgaaatcat atatacgtgg aaaaaaggac 601 cactttactc agtagaagtc ccagaagaat cttcaagcct tctccagtat gatctgattg 661 gacaaacagt atctagtgag acaattaaat ctaacacagg tgaatacgtt ataatgacag 721 tttacttcca cttgcaaagg aagatgggct acttcatgat acagatatac actccttgca 781 ttatgacagt cattctttcc caggtgtctt tctggattaa taaggagtcc gtcccagcaa 841 gaactgtttt tgggatcacc actgttttaa ctatgaccac tttgagcatc agtgcccggc 901 actctttgcc aaaagtgtca tatgccactg ccatggattg gttcatagct gtttgctttg 961 cattcgtctt ctctgctctt atcgagttcg cagctgtcaa ctactttacc aatcttcaga 1021 cacagaaggc gaaaaggaag gcacagtttg cagccccacc cacagtgaca atatcaaaag 1081 ctactgaacc tttggaagct gagattgttt tgcatcctga ctccaaatat catctgaaga 1141 aaaggatcac ttctctgtct ttgccaatag tttcatcttc cgaggccaat aaagtgctca 1201 cgagagcgcc catcttacaa tcaacacctg tcacaccccc accactcccg ccagcctttg 1261 gaggcaccag taaaatagac cagtattctc gaattctctt cccagttgca tttgcaggat 1321 tcaaccttgt gtactgggta gtttatcttt ccaaagatac aatggaagtg agtagcagtg 1381 ttgaatagct tttccaggac aacctgaatt ctataagttc ttgttttctg tttcctatgt 1441 tttcttaaaa aatagcattg agacttgtgt agatgcttct cagaacatga aatcaaattg 1501 gaaatctgta acgcagcttc tgtaagcatg tgtgggcaaa aaagcaataa tcctactcct 1561 caaaatagaa agttgaagat tgctgaaaaa tatgactttt ctgtatgtta gagaaaaact 1621 ttatgaggat gaaatgggtt caagatgaat ttgtcaactt ttgtcttcca ttgttcagta 1681 tttttaatta tcactgtaaa taacattacc acaaggcaaa aaaaaaagaa aa // LOCUS S82198 894 bp mRNA PRI 11-FEB-1997 DEFINITION caldecrin=serum calcium-decreasing factor [human, pancreas, mRNA Partial, 894 nt]. ACCESSION S82198 NID g1839466 KEYWORDS . SOURCE human pancreas. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 894) AUTHORS Tomomura,A., Akiyama,M., Itoh,H., Yoshino,I., Tomomura,M., Nishii,Y., Noikura,T. and Saheki,T. TITLE Molecular cloning and expression of human caldecrin JOURNAL FEBS Lett. 386 (1), 26-28 (1996) MEDLINE 96221265 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 178076] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..894 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..807 /gene="caldecrin" CDS 1..807 /gene="caldecrin" /note="serum calcium-decreasing factor; precursor. This sequence comes from Fig. 1. Author-given protein sequence is in conflict with the conceptual translation; mismatch(-14[S->T])" /codon_start=1 /product="caldecrin" /db_xref="PID:g1839467" /translation="MLGITVLAALLACASSCGVPSFPPNLSARVVGGEDARPHSWPWQ ISLQYLKNDTWRHTCGGTLIASNFVLTAAHCISNTRTYRVAVGKNNLEVEDEEGSLFV GVDTIHVHKRWNALLLRNDIALIKLAEHVELSDTIQVACLPEKDSLLPKDYPCYVTGW GRLWTNGPIADKLQQGLQPVVDHATCSRIDWWGFRVKKTMVCAGGDGVISACNGDSGG PLNCQLENGSWEVFGIVSFGSRRGCNTRKKPVVYTRVSAYIDWINEKMQL" BASE COUNT 189 a 267 c 265 g 173 t ORIGIN 1 atgttgggca tcactgtcct cgctgcgctc ttggcctgtg cctccacctg tggggtgccc 61 agcttcccgc ccaacctatc cgcccgagtg gtgggaggag aggatgcccg gccccacagc 121 tggccctggc agatctccct ccagtacctc aagaacgaca cgtggaggca tacgtgtggc 181 gggactttga ttgctagcaa cttcgtcctc actgccgccc actgcatcag caacacccgg 241 acctaccgtg tggccgtggg aaagaacaac ctggaggtgg aagacgaaga aggatccctg 301 tttgtgggtg tggacaccat ccacgtccac aagagatgga atgccctcct gttgcgcaat 361 gatattgccc tcatcaagct tgcagagcat gtggagctga gtgacaccat ccaggtggcc 421 tgcctgccag agaaggactc cctgctcccc aaggactacc cctgctatgt caccggctgg 481 ggccgcctct ggaccaacgg ccccattgct gataagctgc agcagggcct gcagcccgtg 541 gtggatcacg ccacgtgctc caggattgac tggtggggct tcagggtgaa gaaaaccatg 601 gtgtgcgctg ggggcgatgg cgtcatctca gcctgcaatg gggactccgg tggcccactg 661 aactgccagt tggagaacgg ttcctgggag gtgtttggca tcgtcagctt tggctcccgg 721 cggggctgca acacccgcaa gaagccggta gtctacaccc gggtgtccgc ctacatcgac 781 tggatcaacg agaaaatgca gctgtgattt gttgctggga gcggcggcag cgagtccctg 841 caacagcaat aaacttcctt ctcctcgggc cacctgaaaa aaaaaaaaaa aaaa // LOCUS S82470 1897 bp mRNA PRI 03-DEC-1996 DEFINITION BB1=malignant cell expression-enhanced gene/tumor progression-enhanced gene [human, UM-UC-9 bladder carcinoma cell line, mRNA, 1897 nt]. ACCESSION S82470 NID g1699264 KEYWORDS . SOURCE human UM-UC-9 bladder carcinoma cell line. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1897) AUTHORS Fukunaga-Johnson,N., Lee,S.W., Liebert,M. and Grossman,H.B. TITLE Molecular analysis of a gene, BB1, overexpressed in bladder and breast carcinoma JOURNAL Anticancer Res. 16 (3A), 1085-1090 (1996) MEDLINE 96273128 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 178511] from the original journal article. This sequence comes from 4A. FEATURES Location/Qualifiers source 1..1897 /organism="Homo sapiens" /db_xref="taxon:9606" gene 245..1273 /note="malignant cell expression-enhanced gene/tumor progression-enhanced gene" /gene="BB1" CDS 245..1273 /gene="BB1" /note="malignant cell expression-enhanced gene/tumor progression-enhanced gene; This sequence comes from Fig. 4A" /codon_start=1 /db_xref="PID:g1699265" /translation="MASGFSKGPTLGLLRRALPDGDTQLQLLLRGNHDRPVLPLPHLP GLAGAALPRGSASLRPLLRRAWPAPLFGLLFLLSSHLFPLEAVREDAFYARPLPARLF YMIPVFFAFRMRFYVAWIAAECGCIAAGFGAYPVAAKARAGGGPTLQCPPPSSPEKAA SLEYDYETIRNIDCYSTDFCVRVRDGMRYWNMTVQWWLAQYIYKSAPARSYVLRTAWT MLLSAYWHGLHPGYYLSFLTIPLCLAAEGRLESALRGRLSPGGQKAWDWVHWFLKMRA YDYMCMGFVLLSLADTLRYWASIYFCIHFLALAALGLGLALGGGSPSRRKAASQPTSL APEKLREE" BASE COUNT 307 a 661 c 530 g 399 t ORIGIN 1 ccacactttg cattctctgg tcaccatcct cgggacctgg gccctccatt caggcccagc 61 cctgctcctg ccacgccctg gctctggcct ggactttctc ctatctcctg ttcttccgag 121 ccctcagcct cctggcctgc ccactcccac gcccttcacc aatgccgtcc agctgctgct 181 gacgctgaag ctggtgagcc tggccagtga agtccaggac ctgcatctgg cccagaggaa 241 ggaaatggcc tcaggcttca gcaaggggcc caccctgggg ctgctgcgac gtgccctccc 301 tgatggagac actcagctac agctactgct acgtgggaat catgacaggc ccgttcttcc 361 gctaccgcac ctacctggac tggctggagc agcccttccc cggggcagtg ccagcctgcg 421 gcccctgctg cgccgcgcct ggccggcccc gctcttcggc ctgctgttcc tgctctcctc 481 tcacctcttc ccgctggagg ccgtgcgcga ggacgccttc tacgcccgcc cgctgcccgc 541 ccgcctcttc tacatgatcc ccgtcttctt cgccttccgc atgcgcttct acgtggcctg 601 gattgccgcc gagtgcggct gcattgccgc cggctttggg gcctaccccg tggccgccaa 661 agcccgggcc ggaggcggcc ccaccctcca atgcccaccc cccagcagtc cggagaaggc 721 ggcttccttg gagtatgact atgagaccat ccgcaacatc gactgctaca gcacagattt 781 ctgcgtgcgg gtgcgcgatg gcatgcggta ctggaacatg acggtgcagt ggtggctggc 841 gcagtatatc tacaagagcg cacctgcccg ttcctatgtc ctgcggacgg cctggaccat 901 gctgctgagc gcctactggc acggcctcca cccgggctac tacctgagct tcctgaccat 961 cccgctgtgc ctggctgccg agggccggct ggagtcagcc ctgcgggggc ggctgagccc 1021 agggggccag aaggcctggg actgggtgca ctggttcctg aagatgcgcg cctatgacta 1081 catgtgcatg ggcttcgtgc tgctctcctt ggccgacacc cttcggtact gggcctccat 1141 ctacttctgt atccacttcc tggccctggc agccctgggg ctggggctgg ctttaggtgg 1201 gggcagcccc agccggcgga aggcagcatc ccagcccacc agccttgccc cggagaagct 1261 ccgggaggag taagctgtca cgaccctccc tctgccagct ggtcccggga attctgtgaa 1321 ccaggctgct gtctcctccc cagaaagagt ccttaccttg gagagggtcc tggagagaat 1381 ttcctcttcc ccagctaaat accctgcctg caactgaagc agacccgggg gtgtcctccc 1441 tgccctctgc ccagaggcac ctccactcct acaaaatcaa agtattgtcc agacaagagt 1501 cactggcccc tgctccagct tctgggtatc cagagagcac tgcacttccc caaaacggaa 1561 ggggcccctg ggcagtgggt tttgggcaaa ttccctttct ttgcatccac aatgtgggtc 1621 ggagcttggg ggcaggtcct gggagtggga agcctcttcc ttgtgtcttt cgctccactt 1681 ttagctcatc gcaccaatat tgcagacttg gaaggaagca taacttccat ttcacaaagg 1741 ggaaactgag gtgcgggtgc gggcctgggg acggccgtcc catggcttcc atctgagcca 1801 cctcgggacc ccagcactcc tggcgccctc ttcttcatcg cttggcctat gacaggtcac 1861 cgtgtgtaaa tctttcccaa taaagtgttg cacaaaa // LOCUS S82471 675 bp mRNA PRI 03-DEC-1996 DEFINITION SSX3=Kruppel-associated box containing SSX gene [human, testis, mRNA Partial, 675 nt]. ACCESSION S82471 NID g1699271 KEYWORDS . SOURCE human testis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 675) AUTHORS de Leeuw,B., Balemans,M. and Geurts van Kessel,A. TITLE A novel Kruppel-associated box containing the SSX gene (SSX3) on the human X chromosome is not implicated in t(X;18)-positive synovial sarcomas JOURNAL Cytogenet. Cell Genet. 73 (3), 179-183 (1996) MEDLINE 96302330 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 178546] from the original journal article. This sequence comes from Fig. 1. Map location: Xp11.2-p11.1. FEATURES Location/Qualifiers source 1..675 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..567 /note="Kruppel-associated box containing SSX gene" /gene="SSX3" CDS 1..567 /gene="SSX3" /note="Kruppel-associated box containing gene product; This sequence comes from Fig. 4. Map location Xp11.2-p11.1. Author-given protein sequence is in conflict with the conceptual translation; mismatches(8,57,84,95,133,140)" /codon_start=1 /product="SSX3" /db_xref="PID:g1699272" /translation="MNGDDTFVRRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKVSE KIVYVYMKRKYEGMTKLGFKAILPSFMRNKRVTDFQGNDLDNDPNRGNQVERPQMTFG RLQGIFPKIMPKKPAEEGNVSKEVPEASGPHNDGKQLYPPGKPTTSEKINMISGPKRG EHAWTHRLRERKQLVIYEEISDPEEDDE" BASE COUNT 213 a 147 c 178 g 137 t ORIGIN 1 atgaacggag atgacacctt tgcaaggaga cccacggttg gtgctcaaat accagagaag 61 atacaaaagg ccttcgatga tattgccaaa tacttctcta aggaagagtg ggaaaagatg 121 aaagtctcgg agaaaatcgt ctatgtgtat atgaagagaa agtatgaggc catgactaaa 181 ctaggtttca aggccatcct cccatctttc atgcgtaata aacgggtcac agacttccag 241 gggaatgatt ttgataatga ccctaaccgt gggaatcagg ttctacgtcc tcagatgact 301 ttcggcaggc tccagggaat cttcccgaag atcatgccca agaagccagc agaggaagga 361 aatgtttcga aggaagtgcc agaagcatct ggcccacaaa acgatgggaa acagctgtgc 421 cccccgggaa aaccaactac ctctgagaag attaacatga tatctggacc caaaaggggg 481 gaacatgcct ggacccacag actgcgtgag agaaaacagc tggtgattta tgaagagatc 541 agcgatcctg aggaagatga tgagtaactc cccttgggga tatgacacat gcccatgatg 601 agaagcagaa cgtggtgacc tttcacgaac atgggcatgg ctgtggaccc ctcgtcatca 661 ggtgcatagc aagtg // LOCUS S82769 1536 bp mRNA PRI 28-DEC-1996 DEFINITION GABAA receptor gamma 3 subunit [human, fetal brain, mRNA Partial, 1536 nt]. ACCESSION S82769 NID g1754748 KEYWORDS . SOURCE human fetal brain. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1536) AUTHORS Hadingham,K.L., Wafford,K.A., Thompson,S.A., Palmer,K.J. and Whiting,P.J. TITLE Expression and pharmacology of human GABAA receptors containing gamma 3 subunits JOURNAL Eur. J. Pharmacol. 291 (3), 301-309 (1995) MEDLINE 96360042 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 179067] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..1536 /organism="Homo sapiens" /db_xref="taxon:9606" gene 33..1436 /gene="GABAA receptor gamma 3 subunit" CDS 33..1436 /gene="GABAA receptor gamma 3 subunit" /note="This sequence comes from Fig. 1." /codon_start=1 /product="GABAA receptor gamma 3 subunit" /db_xref="PID:g1754749" /translation="MAPKLLLLLCLFSGLHARSRKVEEDEYEDSSSNQKWVLAPKSQD TDVTLILNKLLREYDKKLRPDIGIKPTVIDVDIYVNSIGPVSSINMEYQIDIFFAQTW TDSRLRFNSTMKILTLNSNMVGLIWIPDTIFRNSKTAEAHWITTPNQLLRIWNDGKIL YTLRLTINAECQLQLHNFPMDEHSCPLIFSSYGYPKEEMIYRWRKNSVEAADQKSWRL YQFDFMGLRNTTEIVTTSAGDYVVMTIYFELSRRMGYFTIQTYIPCILTVVLSWVSFW IKKDATPARTALGITTVLTMTTLSTIARKSLPRVSYVTAMDLFVTVCFLFVFAALMEY ATLNYYSSCRKPTTTKKTTSLLHPDSSRWIPERISLQAPSNYSLLDMRPPPPAMITLN NSVYWQEFEDTCVYECLDGKDCQSFFCCYEECKSGSWRKGRIHIDILELDSYSRVFFP TSFLLFNLVYWVGYLYL" BASE COUNT 423 a 387 c 337 g 389 t ORIGIN 1 tgaattcgtg agatggcgag ctccacggca ccatggcccc gaagctgctg ctcctcctct 61 gcctgttctc gggcttgcac gcgcggtcca gaaaggtgga agaggatgaa tatgaagatt 121 catcatcaaa ccaaaagtgg gtcttggctc caaaatccca agacaccgac gtgactctta 181 ttctcaacaa gttgctaaga gagtatgata aaaagctgag gccagatatt ggaataaaac 241 cgaccgtaat tgacgttgac atttatgtta acagcattgg tcctgtgtca tcaataaaca 301 tggaatacca aattgacata ttttttgctc agacctggac agatagtcgc cttcgattca 361 acagcacaat gaaaattctt actctgaaca gcaacatggt ggggttaatc tggatcccag 421 acaccatctt ccgcaattct aaaaccgcag aggctcactg gatcaccaca cccaatcagc 481 tcctccggat ttggaatgac gggaaaatcc tttacacttt gaggctcacc atcaatgctg 541 agtgccagct gcagctgcac aacttcccca tggacgaaca ctcctgcccg ctgattttct 601 ccagctatgg ctatcccaaa gaagaaatga tttatagatg gagaaaaaat tcagtggagg 661 cagctgacca gaaatcatgg cggctttatc agtttgactt catgggcctc agaaacacca 721 cagaaatcgt gacaacgtct gcaggtgatt atgttgtcat gactatatat tttgaattga 781 gtagaagaat gggatacttc accattcaga catacattcc ctgtatactg actgtggttt 841 tatcctgggt gtcattttgg atcaaaaaag atgctacgcc agcaagaaca gcattaggca 901 tcaccacggt gctgaccatg accaccctga gcaccatcgc caggaagtcc ttgccacgcg 961 tgtcctacgt gaccgccatg gacctttttg tgactgtgtg cttcctgttt gtcttcgccg 1021 cgctgatgga gtatgccacc ctcaactact attccagctg tagaaaacca accaccacga 1081 aaaagacaac atcgttacta catccagatt cctcaagatg gattcctgag cgaataagcc 1141 tacaagcccc ttccaactat tccctcctgg acatgaggcc accaccacct gcgatgatca 1201 ctttaaacaa ttccgtttac tggcaggaat ttgaagatac ctgtgtctat gagtgtctgg 1261 atggcaaaga ctgtcagagc ttcttctgct gctatgaaga atgtaaatca ggatcctgga 1321 ggaaagggcg tattcacata gacatcttgg agctggactc gtactcccgg gtctttttcc 1381 ccacgtcctt cctgctcttt aacctggtct actgggttgg atacctgtat ctctaagtgt 1441 tgctcagagt gaagagtgaa gagcatttgg tacacacttg accttctgtc gtccccagac 1501 cagtagtgac caatcgggag tagcaaggaa ggacac // LOCUS S83157 2483 bp mRNA PRI 10-FEB-1997 DEFINITION hSMP-1=sperm membrane protein [human, testis, mRNA, 2483 nt]. ACCESSION S83157 NID g1836034 KEYWORDS . SOURCE human testis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2483) AUTHORS Liu,Q.Y., Wang,L.F., Miao,S.Y. and Catterall,J.F. TITLE Expression and characterization of a novel human sperm membrane protein JOURNAL Biol. Reprod. 54 (2), 323-330 (1996) MEDLINE 96380169 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 179676] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..2483 /organism="Homo sapiens" /db_xref="taxon:9606" gene 117..1688 /gene="hSMP-1" CDS 117..1688 /gene="hSMP-1" /note="sperm membrane protein; This sequence comes from Fig. 2" /codon_start=1 /product="hSMP-1" /db_xref="PID:g1836035" /translation="METNESTEGSRSRSRSLDIQPSSEGLGPTSEPFPSSDDSPRSAL AAATSAAAAAASAAAATAAFTTAKAAALSTKTPAPCSEFMEPSSDPSLLGEPCAGPGF THNIAHGSLGFEPVYVSCIAQDTCTTTDHSSNPGPVPGSSSGPVLGSSSGAGHGSGSG SGPGCGSVPGSGSGPGPGSGPGSGPGHGSGSHPGPASGPGPDTGPDSELSPCIPPGFR NLVADRVPNYTSWSQHCPWEPQKQPPWEFLQVLEPGARGLWKPPDIKGKLQVCYETLP RGQCLLYNWEEERATNHLDQVPSMQDGSESFFFRHGHRGLLTMQLTSPMPSSTTQKDS YQPPGNVYWPLRGKREAMLEMLLQHQICKEVQAEQEPTRKLFEVESVTHHDYRMELAQ AGTPAPTKPHDYRQEQPETFWIQRAPQLPVCEGDLVLGAERGRKGRAELFCSGLGRVG SILMLALLQGVSNIRTLDTPFRKNCSFSTPVPLSLGKLLPYEPEPYPYQLGEISSLPC PGGRLGGGGGRMTPF" BASE COUNT 577 a 657 c 671 g 578 t ORIGIN 1 ggccccgcag aggacttgcc tccaccgcca cctgcaagtc cgcccagctg gacttctgcg 61 caggctccag gagttgtttg ctgtctctat gtcaacccag tagctggagt ctgaagatgg 121 agaccaacga gtctacggag ggatcgcggt cgcggtcgcg atctttagac atacagccca 181 gctccgaagg actggggccc acttcggaac cgtttccttc ttcagatgac agtcccaggt 241 cggccctggc agctgcaacc tcagcagctg cagcggctgc atcagctgct gcagctactg 301 cagccttcac cactgccaaa gcagctgcat tatctacaaa gaccccagcg ccctgttctg 361 agttcatgga gccgtcctct gaccccagcc ttcttgggga gccctgtgcg ggacccggct 421 ttacccacaa tatagcccat gggagtcttg gctttgagcc cgtctatgtt tcctgtattg 481 ctcaggacac ttgcactaca actgaccata gttctaatcc tggccctgtt ccaggctcta 541 gctctgggcc tgttcttggt tccagctcag gtgctggcca tggctctggc tctggctctg 601 gtcctggctg tggctctgtc cctggctctg gctctggtcc tggtcctggc tctggtcctg 661 gctctggtcc tggtcatggc tctggctctc atcctggtcc tgcctctggg cctggtccag 721 acactggccc tgactctgag ctcagcccct gtattcctcc agggttcaga aacctggtgg 781 cagatcgggt ccctaactat acctcctgga gtcagcactg cccctgggag ccccagaaac 841 aaccaccttg ggaatttttg caagtcttag aaccgggtgc ccgaggacta tggaaacccc 901 cagacattaa agggaagctt caggtttgct atgaaacttt gccgcggggc cagtgcctcc 961 tctacaactg ggaggaagag agagccacca accacctgga tcaagtccca agcatgcagg 1021 atggctctga gagttttttc ttccgacacg gacaccgggg actgctgact atgcaactaa 1081 cgtcacccat gccctccagc accacccaga aagactcgta ccagccacca ggaaacgtct 1141 attggccact tcgagggaag cgtgaagcca tgctggagat gctcctgcag catcagatct 1201 gtaaagaggt gcaggcagaa caggaaccca caaggaagct cttcgaggtt gagtctgtga 1261 cacaccatga ctaccgaatg gagctggcac aagcagggac tcctgcccca acaaagcctc 1321 acgactaccg ccaggagcaa cctgagacct tctggataca gagggcacca cagctgccgg 1381 tgtgtgaggg tgacttggtg ttgggggcag agcggggcag gaaaggtagg gcagagttgt 1441 tttgttctgg cttggggaga gtgggatcca tcctcatgct ggcactcctc cagggtgtca 1501 gtaacatcag gacattggac acaccattcc ggaagaactg cagcttctca acaccagtac 1561 ccttgtctct ggggaaactt ttgccctatg aacctgagcc ttacccctac caattgggag 1621 aaatatcttc ccttccctgt cccggaggaa ggctgggtgg tggcgggggg agaatgactc 1681 ctttctgagg ggtgaggagg gaagtggggt atggaatatg gaatctattt ctgtctgcac 1741 tagagaggtc gggaggaagt taattctcac tgtacttgaa gaggctttac ataaagggtt 1801 ctctttcatc cccaagactg ctaatttagt gattctgtga gtcactgtgt gcatacccca 1861 taacctttcc ttgggattgc ccctatccca cactatgaca tcagaacttt ttttattatt 1921 gttttatatt tgaccaaaat attcaggttt agatacagat atttacagaa agtagggagg 1981 aaggggacca agccagacag ggacaggtat atgtacaggg ctgagctgca gagggctaca 2041 acctccatat aaggtagctt tttttgtggc tcttttccat tgcatatgaa aatgtccatt 2101 tctggtgtgt gcagacatgg tggccattgc ccacccaggt accagcagca gaagactatc 2161 tgacttggaa agaatggggg tttacaggag tccaggaggt cctttccgct ctcctaagag 2221 ccacacctgg tgaatactca gtaaacattt gcggaatgaa tgaacaccct gtctgcagca 2281 ggacatggac aaataggctc tgccatcagg aaatgggaag caagaaaacg gtaactcaag 2341 aggacagagg ctggtgagga ggtaaagcag gcccagtgaa atggacctga cagcccaaac 2401 tgtcagagag gaggaaagca ggcatttgat tttggtaatt gtgccaaagc tatgtaagag 2461 aataaatgcc attgttttaa gga // LOCUS S83198 947 bp mRNA PRI 10-FEB-1997 DEFINITION BPLP=basic proline-rich protein [human, lacrimal gland, mRNA, 947 nt]. ACCESSION S83198 NID g1836021 KEYWORDS . SOURCE human lacrimal gland. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 947) AUTHORS Dickinson,D.P. and Thiesse,M. TITLE cDNA cloning of an abundant human lacrimal gland mRNA encoding a novel tear protein JOURNAL Curr. Eye Res. 15 (4), 377-386 (1996) MEDLINE 96309097 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 179699] from the original journal article. This sequence comes from Fig. 1. FEATURES Location/Qualifiers source 1..947 /organism="Homo sapiens" /db_xref="taxon:9606" gene 81..686 /note="basic proline-rich protein, BPLP" /gene="BPLP" CDS 81..686 /gene="BPLP" /note="tear protein/PRPb homolog; This sequence comes from Fig. 1" /codon_start=1 /product="basic proline-rich protein" /db_xref="PID:g1836022" /translation="MKLTFFLGLLALISCFTPSESQRFSRRPYLPGQLPPPPLYRPRW VPPSPPPPYDSRLNSPLSLPFVPGRVPPSSFSRFSQAVILSQLFPLESIRQPRLFPGY PNLHFPLRPYYVGPIRILKPPFPPIPFFLAIYLPISNPEPQINITTADTTITTNPPTT ATATTRHFHKTHNDDQLLNSTYLFNTRACHLHISSNPRSIY" BASE COUNT 284 a 282 c 122 g 259 t ORIGIN 1 aattgagtat ctggcaagag taagattaag cagtaatttg ttccaaagaa gaatcttcta 61 ccaaggagca actttaaaga atgaaattaa ctttcttctt gggcctgttg gctcttattt 121 catgtttcac acccagtgag agtcaaagat tctccagaag accatatcta cctggccagc 181 tgccaccacc tccactctac aggccaagat gggttccacc aagtccccca cctccctatg 241 actcaagact taattcacca ctttctcttc cctttgtccc agggcgagtt ccaccatctt 301 ctttctctcg atttagccaa gcagtcattc tatctcaact ctttccattg gaatctatta 361 gacaacctcg actctttccg ggttatccaa acctacattt cccactaaga ccttactatg 421 taggacctat taggatatta aaacccccat ttcctcctat tccttttttt cttgctattt 481 accttcctat ctctaaccct gagccccaaa taaacatcac caccgcagat acaacaatca 541 ccacaaatcc ccccaccact gcaacagcaa ccaccaggca cttccacaaa acccacaatg 601 acgatcagct cctcaacagt acctatctct tcaacaccag agcctgccac ctccatatca 661 gcagcaaccc ccgcagcatc tactgaaaat actactcaaa ttctcgccaa ccgtcctcac 721 acagtattgc tcaatgccac tgtccaagtt acgacttcca accaaactat attaagcagc 781 ccagccttta aaagtttttg gcaaaaactc tttgccattt ttggttgaac atgcaataaa 841 tgatattttc caaactgctc tgatatctta gaagaaataa actgcaatga ttttgatgga 901 accaaccctg atctaaccag cacactaaat aaagtatttg agcaata // LOCUS S83308 1473 bp mRNA PRI 12-MAR-1997 DEFINITION SOX5=Sry-related HMG box gene {alternatively spliced} [human, testis, mRNA, 1473 nt]. ACCESSION S83308 NID g1881851 KEYWORDS . SOURCE human testis. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1473) AUTHORS Wunderle,V.M., Critcher,R., Ashworth,A. and Goodfellow,P.N. TITLE Cloning and characterization of SOX5, a new member of the human SOX gene family JOURNAL Genomics 36 (2), 354-358 (1996) MEDLINE 96411696 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 179928] from the original journal article. This sequence comes from Fig. 2. Map location: 12p12.1. FEATURES Location/Qualifiers source 1..1473 /organism="Homo sapiens" /db_xref="taxon:9606" gene 85..1128 /note="Sry-related HMG box gene" /gene="SOX5" CDS 85..1128 /gene="SOX5" /note="Sry-related HMG box gene; This sequence comes from Fig. 2" /codon_start=1 /db_xref="PID:g1881852" /translation="MPALRINSGAGPLKASVPAALASPSARVSTIGYLNDHDAVTKAI QEARQMKEQLRREQQVLDGKVAVVNSLGLNNCRTEKEKTTLESLTQQLAVKQNQEGKF SHAMMDFNLSGDSDGSAGVSESRIYRESRGRGSNEPHIKRPMNAFMVWAKDERRKILQ AFPDMHNSNISKILGSRWKAMTNLEKQPYYEEQARLSKQHLEKYPDYKYKPRPKRTCL VDGKKLRIGEYKAIMRNRRQEMRQYFNVGQQAQIPIATAGVVYPGAIAMAGMPSPHLP SEHSSVSSSPEPGMPVIQSTYGVKGEEPHIKEEIQAEDINGEIYDEYDEEEDDPDVDY GSDSENHIAGQAN" BASE COUNT 479 a 338 c 344 g 312 t ORIGIN 1 gatgaagtgg cacagccact gaacctatca gctaaaccca agacctctga tggcaaatca 61 cccacatcac ccacctctcc ccatatgcca gctctgagaa taaacagtgg ggcaggcccc 121 ctcaaagcct ctgtcccagc agcgttagct agtccttcag ccagagttag cacaataggt 181 tacttaaatg accatgatgc tgtcaccaag gcaatccaag aagctcggca aatgaaggag 241 caactccgac gggaacaaca ggtgcttgat gggaaggtgg ctgttgtgaa tagtctgggt 301 ctcaataact gccgaacaga aaaggaaaaa acaacactgg agagtctgac tcagcaactg 361 gcagttaaac agaatcaaga aggaaaattt agccatgcaa tgatggattt caatctgagt 421 ggagattctg atggaagtgc tggagtctca gagtcaagaa tttataggga atcccgaggg 481 cgtggtagca atgaacccca cataaagcgt ccaatgaatg ccttcatggt gtgggctaaa 541 gatgaacgga gaaagatcct tcaagccttt cctgacatgc acaactccaa catcagcaag 601 atattgggat ctcgctggaa agctatgaca aacctagaga aacagccata ttatgaggag 661 caagcccgtc tcagcaagca gcacctggag aagtaccctg actataagta caagcccagg 721 ccaaagcgca cctgcctggt ggatggcaaa aagctgcgca ttggtgaata caaggcaatc 781 atgcgcaaca ggcggcagga aatgcggcag tacttcaatg ttgggcaaca agcacagatc 841 cccattgcca ctgctggtgt tgtgtaccct ggagccatcg ccatggctgg gatgccctcc 901 cctcacctgc cctcggagca ctcaagcgtg tctagcagcc cagagcctgg gatgcctgtt 961 atccagagca cttacggtgt gaaaggagag gagccacata tcaaagaaga gatacaggcc 1021 gaggacatca atggagaaat ttatgatgag tacgacgagg aagaggatga tccagatgta 1081 gattatggga gtgacagtga aaaccatatt gcaggacaag ccaactgata agggtcaaaa 1141 gattgttgtg accttaggac ttaaagaagc cctaactggt tcatccttac cagtggccaa 1201 gcacattaac tttctcatac actgactgtt actttaactg ttagtcttaa atagttggga 1261 catcagctga ctaatagacc tcagcctcaa aaggcttgga aagaaaaaac aaatacaaca 1321 agcaaacaac aatatcaaca acaagagatt gaaataagct atgggtaaaa taatgccagt 1381 aattcagctg ctacatccaa gcactgaagt cttacccgtc aacttttttt ttttttaaat 1441 aaactttatg gctgtttgtt ctacaaaaaa aaa // LOCUS S83366 3414 bp DNA PRI 27-MAR-1997 DEFINITION region centromeric to t(12;17) brakepoint: orf1/unknown 43 amino acid transcript...orf3/unknown 50 amino acid transcript [human, testis, acampomelic campomelic dysplasia and sex reversal patient, Genomic, 3 genes, 3414 nt]. ACCESSION S83366 NID g1911580 KEYWORDS . SOURCE human testis acampomelic campomelic dysplasia and sex reversal patient. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3414) AUTHORS Ninomiya,S., Isomura,M., Narahara,K., Seino,Y. and Nakamura,Y. TITLE Isolation of a testis-specific cDNA on chromosome 17q from a region adjacent to the breakpoint of t(12;17) observed in a patient with acampomelic campomelic dysplasia and sex reversal JOURNAL Hum. Mol. Genet. 5 (1), 69-72 (1996) MEDLINE 96381428 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 179156] from the original journal article. This sequence comes from Fig. 4A. Map location: 17q. COMMENT Region: region centromeric to t(12;17) brakepoint. FEATURES Location/Qualifiers source 1..3414 /organism="Homo sapiens" /db_xref="taxon:9606" gene 886..1017 /note="orf1 is centromeric to SOX9" /gene="orf1/unknown 43 amino acid transcript" CDS 886..1017 /gene="orf1/unknown 43 amino acid transcript" /codon_start=1 /translation="MRNLEMRRLSKTISGFLTYRNCVMINVCYLKPQSFWGDLFYSN" gene 2409..2549 /note="orf2 is centromeric to SOX9" /gene="orf2/unknown 46 amino acid transcript" CDS 2409..2549 /gene="orf2/unknown 46 amino acid transcript" /codon_start=1 /translation="MFSNILSFYPSDVSTPPTDHHLLHHHHQICLQTFPSVPWEAKTA PG" gene 2443..2595 /note="orf3 is centromeric to SOX9" /gene="orf3/unknown 50 amino acid transcript" CDS 2443..2595 /gene="orf3/unknown 50 amino acid transcript" /codon_start=1 /translation="MLAPPPRTTIFSIITIKYVSRHFQVSPGRLKLPLVENHCSSTWH CSINIR" BASE COUNT 1023 a 678 c 706 g 1007 t ORIGIN 1 cgcatcccag gaagaaggag ttggagctcc gtgatgccat tcaaaaacag atgagatggg 61 gacagattta acccaaagga catagttcaa ctcctggtct tgacagagga gaagccctca 121 aagatggagt ctcactctgt tgccaggcta gaatacaatg gcacgaactt ggctcactgc 181 aacctccgcc tcctgggttc aagagattct cctgcctcag agtcctgagt tgctgggact 241 acgggcacac accatcatgc ccagctaatt tttgtatttt tagtagagac agggtttcac 301 catgttgacc aggatggtct cgatctcttg acctcgtgat ctgcccacgt cggcctccca 361 aagtcctggg attacaggca tgagccagtg cgcctagcaa acgacagctt tttgagcctc 421 tgtttaacaa tctcttaaac cttataggta ggcagttctg agacagctcc taatcatctc 481 ccctcctggt attcaagccc tgatggaata ctctcttctt caatgtgagc tggacttagt 541 ggttttcttc taatccattg aatacacaca agtgatgagc tgtcatttcc aaaattagtt 601 cacacaaagc tgtaactttt gttttgttca cagcctctct caattgcctt ctcagcttgt 661 atgctttgaa gaagcaagct gcagtgttgg agaggcccat gtggcaagga aatgagggtg 721 gcctccagtc aatggctggt gagaaactga gaccttcctt ccaacaattg tcaagtaact 781 gaatactccc aacaatcata tgggttaggt tagaagtgaa cctttctcta gttgaggttt 841 cagataagac cacagatatt tatctaacac cttgattgca ggcttatgag aaaccttgaa 901 atgagaagac taagtaaaac catatctgga ttcctgactt acagaaactg tgtgatgata 961 aatgtgtgtt atcttaagcc acaaagtttt tggggtgatt tgttctacag caattgataa 1021 tgaatacttc aacctatagt gacaactgga tgtctattta ctcattttcc ccaccaaagt 1081 gtaaagttcc tacaaggcag aggttacaac atattcatcc tgctacctct catagcccat 1141 aatatacaac ctggcccata aaacatgatt aatcaatgtt tgttaaatta acatgtattt 1201 tgagtttcat tgaaatcagt tagttgaagt gataataaca tttaaatgtg catgagcagt 1261 aggacaaggc agatagctgg ttaggataga aagtttttga aaattgtgag gggaaaaatt 1321 atttctattt gtaccctgtt gtacccagct taggtttcaa aaactttcag ggggttttta 1381 ttcatttacc aaagacttca aagttggtgc ctccccacca ctctaaagag gtctttgccc 1441 ttattgaccc aaccatactg atttgcctgg tgcacagtgg gagctattat aaggagaggg 1501 gagaagaagg aaatagaaac aactgtgaaa aaggttgaaa aaaatagcga aaaggataga 1561 aaaagagcag aggaaagaga agaagtagtc aacatccaac ccagatgcag agaaatgatt 1621 caaaagaaaa agtctgattt tcctgtcgga aatattttta aaaggaggta ggcgtttatc 1681 aaaaaggaaa aaaaaaaaaa aagaataaca gagtttgtct tccaaaggac ccaatactct 1741 tggaattcaa tgacctaatt ctgtgtatgg ttgcagtgtg gagattgcct cttgtctgat 1801 ctctacggca ttgagacctc tgatttctca acagcacatg ccacagcact atcagaagtc 1861 aggaataaag aacagagggt gtggggggca gtctagatat tatgctcagc aaacatattt 1921 ttaaatcagt gtaaattcta tcttgctggt tgcttatctt agcagttaaa tcaacttttc 1981 gttacttttg tagacattat ataatatctt ctccccgcag ctccagccac cagctgttag 2041 tcagcagaat gggagaggga ctgggtcagc ttaagctgaa taacttccat ggtgacattt 2101 gagggaacca attcccagcc actgaacaaa cctaaatgtt ctgctatggc catagaacca 2161 aatagatctt aacaaagaaa cctgagagca ttcaaaggtc aaaacaattg gagctagttc 2221 cttcaagctt aaatatgctc atctgtaaag tggggacaat aactatatat cttatagggt 2281 tattttgaag gctaaatatg tgtaaaagcc tcagtctact gaaaggttgc tcaactgtaa 2341 cactcttttc attttgtgag ggatcattct ttgttggagg ttagggacac tgtctcgtgc 2401 attacaggat gtttagcaac atcctcagct tctatccatc agatgttagc acccccccca 2461 cggaccacca tcttctccat catcaccatc aaatatgtct ccagacattt ccaagtgtcc 2521 cctgggaggc taaaactgcc cctggttgag aaccactgtt ctagtacttg gcattgttcc 2581 ataaacatca gatgatattt ttatcattat aattattact aaaggcaaca tcaccctgca 2641 atggagacag gtcttggtct cacggaatcc tgcaatagtt gctgcatttc tgcatctttc 2701 tgagagtaag ggttgaaaag agataaggtc aattcatctg acaacaatca atgggaaacc 2761 tactgtctcc gtagaaattc ttttcatgga aagaaaaggg gctaccattt tgatctgctg 2821 caggaactag gtctattcct cagcctcaca gtttgcaaaa cgctacagac ttttccatgg 2881 gcataaattc tccagcttta gggattgctg gtatagttga ctcttgaagc tgctcttgga 2941 tttccaaggg tagcaccatt ttttacctga gaaaagaagg aaaaagatat tcaagaagga 3001 gaacttttca acagaagtta ctccatttgc tgtttttgtg gttttgtgct tgtttaggtt 3061 tgcaagtgtg ttttctgccc aaattgttct tgcatattct aacagtgttt gctttacgtg 3121 ttttgatgcc acgctgtttt gaaggtacaa atgtttttca tggcatgtct tgtgttaagt 3181 gtacaaactg agacgtgaat gtcagtgact gaatagagta cagtattcct caaagtttgt 3241 aactaaataa accctaagag aaaagtgaag cgagatttct ggaggtctgg aagtggccag 3301 ctgacaatat acatctctgt atctcctatt tttatcttta tgctaaatgc aaattttact 3361 ttctgtgcta aaataatgtt tcattagttt caaggtaaaa atgtgtaaat agac // LOCUS S83374 129 bp mRNA PRI 27-MAR-1997 DEFINITION glutamate transporter II variant B/HBGT IIB {5' region} [human, brain and spinal cord, mRNA Partial Mutant, 129 nt]. ACCESSION S83374 NID g1911635 KEYWORDS . SOURCE human brain and spinal cord. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 129) AUTHORS Meyer,T., Speer,A., Meyer,B., Sitte,W., Kuther,G. and Ludolph,A.C. TITLE The glial glutamate transporter complementary DNA in patients with amyotrophic lateral sclerosis JOURNAL Ann. Neurol. 40 (3), 456-459 (1996) MEDLINE 96390551 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 180121] from the original journal article. This sequence comes from Fig. 2. FEATURES Location/Qualifiers source 1..129 /organism="Homo sapiens" /db_xref="taxon:9606" gene 27..56 /partial /gene="glutamate transporter II variant B/HBGT IIB" CDS 27..56 /partial /gene="glutamate transporter II variant B/HBGT IIB" /codon_start=1 /translation="MGWVCLPTG" BASE COUNT 23 a 30 c 44 g 32 t ORIGIN 1 acctgggacc ctccagacgt gggaggatgg ggtgggtgtg cctgcctact ggttgacggt 61 caggtgggct gattggttct ccctgctaca gtggtagaat tcagcgctgt acctagtgcc 121 aacaatatg // LOCUS S85655 1043 bp mRNA PRI 10-JUL-1992 DEFINITION prohibitin [human, mRNA, 1043 nt]. ACCESSION S85655 NID g246482 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1043) AUTHORS Sato,T., Saito,H., Swensen,J., Olifant,A., Wood,C., Danner,D., Sakamoto,T., Takita,K., Kasumi,F., Miki,Y. et,al. TITLE The human prohibitin gene located on chromosome 17q21 is mutated in sporadic breast cancer JOURNAL Cancer Res. 52 (6), 1643-1646 (1992) MEDLINE 92174193 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 85655] from the original journal article. This sequence comes from Fig.1a. Map location: chromosome 17q12-21. FEATURES Location/Qualifiers source 1..1043 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1043 /gene="prohibitin" CDS 51..869 /gene="prohibitin" /note="This sequence comes from Fig.1a." /codon_start=1 /db_xref="PID:g246483" /translation="MAAKVFESIGKFGLALAVAGGVVNSALYNVDAGHRAVIFDRFRG VQDIVVGEGTHFLIPWVQKPIIFDCRSRPRNVPVITGSKDLQNVNITLRILFRPVASQ LPRIFTSIGEDYDERVLPSITTEILKSVVARFDAGELITQRELVSRQVSDDLTERAAT FGLILDDVSLTHLTFGKEFTEAVEAKQVAQQEAERARFVVEKAEQQKKAAIISAEGDS KAAELIANSLATAGDGLIELRKLEAAEDIAYQLSRSRNITYLPAGQSVLLQLPQ" BASE COUNT 261 a 273 c 288 g 221 t ORIGIN 1 tgtggaggtc agagtggaag caggtgtgag agggtccagc agaaggaaac atggctgcca 61 aagtgtttga gtccattggc aagtttggcc tggccttagc tgttgcagga ggcgtggtga 121 actctgcctt atataatgtg gatgctgggc acagagctgt catctttgac cgattccgtg 181 gagtgcagga cattgtggta ggggaaggga ctcattttct catcccgtgg gtacagaaac 241 caattatctt tgactgccgt tctcgaccac gtaatgtgcc agtcatcact ggtagcaaag 301 atttacagaa tgtcaacatc acactgcgca tcctcttccg gcctgtcgcc agccagcttc 361 ctcgcatctt caccagcatc ggagaggact atgatgagcg tgtgctgccg tccatcacaa 421 ctgagatcct caagtcagtg gtggctcgct ttgatgctgg agaactaatc acccagagag 481 agctggtctc caggcaggtg agcgacgacc ttacagagcg agccgccacc tttgggctca 541 tcctggatga cgtgtccttg acacatctga ccttcgggaa ggagttcaca gaagcggtgg 601 aagccaaaca ggtggctcag caggaagcag agagggccag atttgtggtg gaaaaggctg 661 agcaacagaa aaaggcggcc atcatctctg ctgagggcga ctccaaggca gctgagctga 721 ttgccaactc actggccact gcaggggatg gcctgatcga gctgcgcaag ctggaagctg 781 cagaggacat cgcgtaccag ctctcacgct ctcggaacat cacctacctg ccagcggggc 841 agtccgtgct cctccagctg ccccagtgag ggcccaccct gcctgcacct ccgcgggctg 901 actgggccac agccccgatg attcttaaca cagccttcct tctgctccca ccccagaaat 961 cactgtgaaa tttcatgatt ggcttaaagt gaaggaaata aaggtaaaat cacttcagat 1021 ctctaaaaaa aaaaaaaaaa aaa // LOCUS S87759 2346 bp mRNA PRI 10-AUG-1992 DEFINITION protein phosphatase 2C alpha [human, teratocarcinoma, mRNA, 2346 nt]. ACCESSION S87759 NID g247168 KEYWORDS . SOURCE human teratocarcinoma. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2346) AUTHORS Mann,D.J., Campbell,D.G., McGowan,C.H. and Cohen,P.T. TITLE Mammalian protein serine/threonine phosphatase 2C: cDNA cloning and comparative analysis of amino acid sequences JOURNAL Biochim. Biophys. Acta 1130 (1), 100-104 (1992) MEDLINE 92182001 REMARK GenBank staff at the National Library of Medicine created this entry [NCBI gibbsq 87759] from the original journal article. This sequence comes from fig 2. FEATURES Location/Qualifiers source 1..2346 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..2346 /gene="protein phosphatase 2C alpha, PP2Calpha" CDS 358..1506 /gene="protein phosphatase 2C alpha, PP2Calpha" /note="This sequence comes from fig 2; PP2Calpha" /codon_start=1 /product="protein phosphatase 2C alpha" /db_xref="PID:g247169" /translation="MGAFLDKPKMEKHNAQGQGNGLRYGLSSMQGWRVEMEDAHTAVI GLPSGLESWSFFAVYDGHAGSQVAKYCCEHLLDHITNNQDFKGSAGAPSVENVKNGIR TGFLEIDEHMRVMSEKKHGADRSGSTAVGVLISPQHTYFINCGDSRGLLCRNRKVHFF TQDHKPSNPLEKERIQNAGGSVMIQRVNGSLAVSRALGDFDYKCVHGKGPTEQLVSPE PEVHDIERSEEDDQFIILACDGIWDVMGNEELCDFVRSRLEVTDDLEKVCNEVVDTCL YKGSRDNMSVILICFPNAPKVSPEAVKKEAELDKYLECRVEEIIKKQGEGVPDLVHVM RTLASENIPSLPPGGELASKRNVIEAVYNRLNPYKNDDTDSTSTDDMW" BASE COUNT 628 a 501 c 580 g 637 t ORIGIN 1 cggaacgtgg ttggggaggg gggggtgggg gggactctag acagctgagg cgcgaaagca 61 tgagtcctcg gctcttcctc ctccttctcc gggacccgct ctctgcctcc ctctccaacg 121 cccggatgat ctgagccgcg agggcgccga cagccggggg cccggacgca gcccggctcc 181 tcccctcctc cgccccttcc ccagcctgac ctggcccgcc gctgcagcgg tgacccctcc 241 cccggctgcc gccgtcgccg ccgcggtgac cccctccccg gctgccgccg ccgccgcctc 301 ggccgaccag ggacctgccc gcctgcggct gctccggacc tagaggatca agacataatg 361 ggagcatttt tagacaagcc aaagatggaa aagcataatg cccaggggca gggtaatggg 421 ttgcgatatg ggctaagcag catgcaaggc tggcgtgttg aaatggagga tgcacatacg 481 gctgtgatcg gtttgccaag tggacttgaa tcgtggtcat tctttgctgt gtatgatggg 541 catgctggtt ctcaggttgc caaatactgc tgtgagcatt tgttagatca catcaccaat 601 aaccaggatt ttaaagggtc tgcaggagca ccttctgtgg aaaatgtaaa gaatggaatc 661 agaacaggtt ttctggagat tgatgaacac atgagagtta tgtcagagaa gaaacatggt 721 gcagatagaa gtgggtcaac agctgtaggt gtcttaattt ctccccaaca tacttatttc 781 attaactgtg gagactcaag aggtttactt tgtaggaaca ggaaagttca tttcttcaca 841 caagatcaca aaccaagtaa tccgctggag aaagaacgaa ttcagaatgc aggtggctct 901 gtaatgattc agcgtgtgaa tggctctctg gctgtatcga gggcccttgg ggattttgat 961 tacaaatgtg tccatggaaa aggtcctact gagcagcttg tctcaccaga gcctgaagtc 1021 catgatattg aaagatctga agaagatgat cagttcatta tccttgcatg tgatggtatc 1081 tgggatgtta tgggaaatga agagctctgt gattttgtaa gatccagact tgaagtcact 1141 gatgaccttg agaaagtttg caatgaagta gtcgacacct gtttgtataa gggaagtcga 1201 gacaacatga gtgtgatttt gatctgtttt ccaaatgcac ccaaagtatc gccagaagca 1261 gtgaagaagg aggcagagtt ggacaagtac ctggaatgca gagtagaaga aatcataaag 1321 aagcaggggg aaggcgtccc cgacttagtc catgtgatgc gcacattagc gagtgagaac 1381 atccccagcc tcccaccagg gggtgaattg gcaagcaaga ggaatgttat tgaagccgtt 1441 tacaatagac tgaatcctta caaaaatgac gacactgact ctacatcaac agatgatatg 1501 tggtaaaact gctcatctag ccatggagtt taccttcacc tccaaaggag agtacagctc 1561 aactttgttg aaacttttaa catccatcct caactttaag gaaggggata tgacatgggt 1621 gagaatgatt acatcagaga acttcagcag tacaacagct agcccagaac tgattttttt 1681 tttttttttt tttgtaaatt tgagacttat gtaagcgtga tttcaaacca taattcgtgt 1741 tgtaaatcag actccagcaa tttttgttgt atgattttgt ttttttgtaa agtgtaattg 1801 tccttgtaca aaatgctcat atttaattat gaactgcttt aaatcactat caaagttaca 1861 agaaatgttt ggcttattgt gtgatgcaac agatatatag ccctttcaag tcatgttgtg 1921 tttggacttg gggttggaac agggagagca gcagccatgt cagctacacg ctcaaatgtg 1981 cagatgatta tggaaaataa cctcaaaatc ttacaaagct gaacatccaa ggagttattg 2041 aaaactatct taaatgttct tggtagggga gttggcattg ttgataaagc cagtcccttc 2101 atttaactgt ctttcaggat gttccttcgt tgtttccatg agtattgcag gtaataatac 2161 agtgtgttcc ataagaatct caatcttggg gctaaatgcc ttgtttcttt gcacctcttt 2221 tcaagtcctt acatttaatt actaattgat aagcagcagc ttcctacata tagtaggaaa 2281 ctgccacatt tttgctatca tgattggctg ggcctgctgc tgttcctagt aagatattct 2341 gaattc // LOCUS SSMPCP 1330 bp RNA PRI 01-JUN-1992 DEFINITION H.sapiens mRNA for mitochondrial phosphate carrier protein. ACCESSION X60036 NID g38261 KEYWORDS pcp gene; phosphate carrier. SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1330) AUTHORS Palmieri,F. TITLE Direct Submission JOURNAL Submitted (16-MAY-1991) F. Palmieri, Universita di Bari, Dipartimento Farmaco-Biologico, 70125 Bari, Italy REFERENCE 2 (bases 1 to 1330) AUTHORS Dolce,V., Fiermonte,G., Messina,A. and Palmieri,F. TITLE Nucleotide sequence of a human heart cDNA encoding the mitochondrial phosphate carrier JOURNAL DNA Seq. 2 (2), 133-135 (1991) MEDLINE 92135893 FEATURES Location/Qualifiers source 1..1330 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="heart" CDS 49..1134 /codon_start=1 /product="phosphate carrier protein" /db_xref="PID:g38262" /db_xref="SWISS-PROT:Q00325" /translation="MFSSVAHLARANPFNTPHLQLVHDGLGDLRSSSPGPTGQPRRPR NLAAAAVEEYSCEFGSAKYYALCGFGGVLSCGLTHTAVVPLDLVKCRMQVDPQKYKGI FNGFSVTLKEDGVRGLAKGWAPTFLGYSMQGLCKFGFYEVFKVLYSNMLGEENTYLWR TSLYLAASASAEFFADIALAPMEAAKVRIQTQPGYANTLRDAAPKMYKEEGLKAFYKG VAPLWMRQIPYTMMKFACFERTVEALYKFVVPKPRSECSKPEQLVVTFVAGYIAGVFC AIVSHPADSVVSVLNKEKGSSASLVLKRLGFKGVWKGLFARIIMIGTLTALQWFIYDS VKVYFRLPRPPPPEMPESLKKKLGLTQ" sig_peptide 49..195 mat_peptide 196..1131 /product="phosphate carrier protein" polyA_site 1295..1300 BASE COUNT 334 a 295 c 332 g 369 t ORIGIN 1 gcaacctttc caagggagtg gttgtgtgat cgccatctta gggaaaagat gttctcgtcc 61 gtggcgcacc tggcgcgggc gaaccccttc aacacgccac atctgcagct ggtgcacgat 121 ggtctcgggg acctccgcag cagctcccca gggcccacgg gccagccccg ccgccctcgc 181 aacctggcag ccgccgccgt ggaagagtac agttgtgaat ttggctccgc gaagtattat 241 gcactgtgtg gctttggtgg ggtcttaagt tgtggtctga cacacactgc tgtggttccc 301 ctggatttag tgaaatgccg tatgcaggtg gacccccaaa agtacaaggg catatttaac 361 ggattctcag ttacacttaa agaggatggt gttcgtggtt tggctaaagg atgggctccg 421 actttccttg gctactccat gcagggactc tgcaagtttg gcttttatga agtctttaaa 481 gtcttgtata gcaatatgct tggagaggag aatacttatc tctggcgcac atcactatat 541 ttggctgcct ctgccagtgc tgaattcttt gctgacattg ccctggctcc tatggaagct 601 gctaaggttc gaattcaaac ccagccaggt tatgccaaca ctttgaggga tgcagctccc 661 aaaatgtata aggaagaagg cctaaaagca ttctacaagg gggttgctcc tctctggatg 721 agacagatac catacaccat gatgaagttc gcctgctttg aacgtactgt tgaagcactg 781 tacaagtttg tggttcctaa gccccgcagt gaatgttcaa agccagagca gctggttgta 841 acatttgtag caggttacat agctggagtc ttttgtgcaa ttgtttctca ccctgctgat 901 tctgtggtat ctgtgttgaa taaagaaaaa ggtagcagtg cttctctggt cctcaagaga 961 cttggattta aaggtgtatg gaagggactg tttgcccgta tcatcatgat tggtaccctg 1021 actgcactac agtggtttat ctatgactcc gtgaaggtct acttcagact tcctcgccct 1081 cccccacctg agatgccaga gtctctgaag aaaaagcttg ggttaactca gtagttagat 1141 caaagcaaat gtggactgaa tctgcttgtt gatcagtgtt tgaagaaagt gcaaaaggaa 1201 cttttatata tttgacagtg taggaaattg tctattcctg atataattac tgtagtactc 1261 ttgcttaagg caagagtttc agatttactg ttgaaataaa cccaactgtt catgaaaaaa 1321 aaaaaaaaaa // LOCUS U00238 3608 bp mRNA PRI 30-MAR-1996 DEFINITION Homo sapiens glutamine PRPP amidotransferase (GPAT) mRNA complete cds. ACCESSION U00238 NID g404860 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3608) AUTHORS Brayton,K.A., Chen,Z., Zhou,G., Nagy,P.L., Gavalas,A., Trent,J.M., Deaven,L.L., Dixon,J.E. and Zalkin,H. TITLE Two genes for de novo purine nucleotide synthesis on human chromosome 4 are closely linked and divergently transcribed JOURNAL J. Biol. Chem. 269 (7), 5313-5321 (1994) MEDLINE 94148998 REFERENCE 2 (bases 1 to 3608) AUTHORS Brayton,K.A. TITLE Direct Submission JOURNAL Submitted (04-AUG-1993) Kelly A Brayton, Department of Biochemistry, Purdue University, West Lafayette, IN, 47907, USA FEATURES Location/Qualifiers source 1..3608 /organism="Homo sapiens" /db_xref="taxon:9606" /clone_lib="lambdaZap HepG2" /chromosome="4" mRNA 1..3595 /evidence=experimental exon 1..3595 /note="11 exons starting at: >1,179,246,453,556,712,785,937,1065,1287, and 1408 with no introns" misc_feature 51..83 /gene="GPAT" /note="propeptide" gene 51..1604 /gene="GPAT" CDS 51..1604 /gene="GPAT" /EC_number="2.4.2.14" /codon_start=1 /product="glutamine PRPP amidotransferase" /db_xref="PID:g404861" /translation="MELEELGIREECGVFGCIASGEWPTQLDVPHVITLGLVGLQHRG QESAGIVTSDGSSVPTFKSHKGMGLVNHVFTEDNLKKLYVSNLGIGHTRYATTGKCEL ENCQPFVVETLHGKIAVAHNGELVNAARLRKKLLRHGIGLSTSSDSEMITQLLAYTPP QEQDDTPDWVARIKNLMKEAPTAYSLLIMHRDVIYAVRDPYGNRPLCIGRLIPVSDIN DKEKKTSETEGWVVSSESCSFLSIGARYYREVLPGEIVEISRHNVQTLDIISRSEGNP VAFCIFEYVYFARPDSMFEDQMVYTVRYRCGQQLAIEAPVDADLVSTVPESATPAALA YAGKCGLPYVEVLCKNRYVGRTFIQPNMRLRQLGIAKKFGVLSDNFKGKRIVLVDDSI VRGNTISPIIKLLKESGAKEVHIRVASPPIKYPCFMGINIPTKEELIANKPEFDHLAE YLGANSVVYLSVEGLVSSVQEGIKFKKQKEKKHDIMIQENGNGLECFEKSGHCTACLT GKYPVELEW" BASE COUNT 1133 a 607 c 726 g 1142 t ORIGIN 1 ttacaccttg gccgcagcgg caggtccttc ctcgtgcttt cggtggcgac atggagctgg 61 aggagttggg gatccgagag gaatgtggcg tgttcgggtg catcgcctca ggagagtggc 121 ccacgcagct ggatgtaccg catgtgatca ctctgggact cgtggggctg cagcaccggg 181 gtcaggagag tgctggtatt gtgactagtg atgggagttc ggtgccaaca ttcaaatcac 241 acaagggaat gggtcttgta aatcacgtct ttactgaaga caatttgaaa aaattatatg 301 tttcaaatct tggaattgga cacaccaggt atgccaccac aggaaaatgt gaactagaaa 361 attgtcagcc cttcgttgtt gaaacacttc atgggaagat agctgtggca cataatggcg 421 aattggtaaa tgctgctcga ttaaggaaaa agcttctgcg tcatggtatt ggtctgtcta 481 caagttctga tagtgaaatg attacccagt tactggcgta tacccctcct caggaacaag 541 atgacacccc agactgggta gccaggatta aaaacttgat gaaggaagca cccacagcat 601 actccctgct tataatgcac agagatgtta tttatgcagt acgagatcct tatggaaatc 661 gtcccttatg cattggtcgt cttattccag tgtctgatat aaatgacaaa gagaaaaaaa 721 catcagaaac agaaggatgg gtggtgtctt cagaatcttg tagcttctta tctattggtg 781 caagatatta ccgtgaagtc ttgcctggag aaattgtgga aatatccaga cacaatgtcc 841 aaactcttga tattatatca aggtctgaag gaaacccagt ggctttttgt atctttgaat 901 atgtttattt tgcaagacca gacagtatgt tcgaagacca aatggtttat acagtaagat 961 accgttgtgg ccagcagcta gcgattgaag cacctgtgga tgcagatttg gttagcactg 1021 ttccagaatc tgctacgcct gctgctcttg cttacgcagg aaagtgtgga cttccatatg 1081 tggaggtgct gtgtaaaaac cggtatgtag ggagaacctt cattcagcca aacatgaggt 1141 taagacaact tggtattgca aaaaaatttg gagtattgtc agacaacttt aaaggcaaaa 1201 gaattgttct tgtagatgat tcaattgtca gaggcaatac catctcacct ataataaaac 1261 tgctcaaaga atctggtgca aaagaggtac acattcgagt agcttcacca ccaattaaat 1321 atccatgctt catgggaata aacattccta caaaagaaga gctcattgcc aataaaccag 1381 aatttgatca ccttgcagaa tatctaggag caaacagtgt tgtgtatctg tcagtagaag 1441 gactggtttc atctgtacaa gaagggataa agtttaaaaa acagaaagag aaaaagcacg 1501 atattatgat ccaagaaaat ggaaatggtc tggaatgttt tgaaaagagt ggtcattgta 1561 cagcttgtct cactggaaaa tatcctgtag aattagaatg gtagctggta gggttggatg 1621 tgtgtagttt caagatagaa agttggtcaa gaagttatag tggtcacacc tcatctattt 1681 actgttactc agttggtaca atgtaaaatg ccatgcttat gtttataagt tttgagattt 1741 tttttttttt ctgaaaagga taccaaagtg cgataactga acatttccaa ttgcatataa 1801 tacaacaata tgtggtgttc ttttttttac acaagcattg gctagccttt ttaacctggt 1861 cagagaaggc aggtggtcac tgacatttcc caagtccatg ctttaaaggg tttgcaagaa 1921 gttagggtta aggagaggtg atgccaacaa gacaggtgag ttaaatatac catttcacac 1981 aaagtttgaa tagaatacat tatacctcat aggtgtctag cctctacagt tctggctgta 2041 gttatgacct tggcttccct gtctaactgt agacaaatct ttaaaaaaaa aaaaaaaaaa 2101 tctggtgcct cagtttcccc acatgtgcaa tgggatactt attaaataat taataagaat 2161 gtgaataagt gtcatacttt tgtgatttga gccatcattt cacttctgat tttaagacaa 2221 ctcatgattg ttagctttca gaaagctaat gattgttaac tttttgaaat tagtttacaa 2281 ttaattaaga tttcattatg atggaaggag acataattgg cagatctttg ccatctctct 2341 ttgagatgtc ctaaaaaggg ttgtaaaaat ctgtgaaaaa gtttttccta catttgacta 2401 gaaaatgtga tccatagtat ttagtgccct gatactataa gctcagcaag taacctggta 2461 catttgaaat aaaaaccaaa tttttagatt caaacaatcc ctttatcctt aatttaatta 2521 attatcatat gcttttttta atgaagtgct tgatcacttg caaacatata tacatgtaga 2581 tgtacatata catgtacaca tacacataaa tattattgca attaagtgat caagtacaga 2641 cacaataggg gccagttttg tttaaggatc aaagagacaa ccactttggg gaattagtat 2701 caacttacaa tccaagtcca agtatcatct tataatcact tttttctact atattaagat 2761 ctaatgaatt tgatttcttt tttgaagttt tttcttgtaa catctgagat tagaagttta 2821 agatgacttg accccaaacc tttgtttatg taagaatttt taaacataaa agtgtttgtt 2881 tctgttatgt taccataatt tgatgtatat agtgtccaga tccatttaga aatttaatat 2941 ttattaataa ctgaaactgt ttgtcttcct ttggtatata gtctcgcata ttatattata 3001 gcaggccaag ataaaatttt gacagctctt taagcccaca tgcagcagtg ggtcagataa 3061 ccctgtggca gtgacacggg caaattggca tttgaataaa gccctgggac cacctcaaca 3121 tgcgtagcct cttgtcttaa atgtactccc catggcagca tggaggaggc aagacctgtg 3181 ggtcaatttt gaactggcct tactttgatt tttaaaacaa gagactcagg gaaagtacta 3241 aaccaaaatc tctgatttta ctttgcgttt tctgtagttt ttgttttact gagatgcttt 3301 tgtaaaggaa aataatactg tgacagttta gtaattctac agattcttaa tatttctcca 3361 tcatggcctt ttacttcaca attttctgaa gtctgaattc aattacaatt tttttttttt 3421 accaatttaa tctcaaatgt tgtttaactg ctttaaattc atatacgtag agtattataa 3481 actgcagaga tgaaaaatgt gttttcacgg gatttatatt gtgaactaaa ctaagcctac 3541 tttttgtgac ttatttgtga tgccttgttg ataaatatgt gtaataagta tgtttaaaaa 3601 aaaaaaaa // LOCUS U00672 3632 bp mRNA PRI 05-MAY-1994 DEFINITION Human interleukin-10 receptor mRNA, complete cds. ACCESSION U00672 NID g482802 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3632) AUTHORS Liu,Y., Wei,S.H., Ho,A.S., de Waal Malefyt,R. and Moore,K.W. TITLE Expression cloning and characterization of a human IL-10 receptor JOURNAL J. Immunol. 152, 1821-1829 (1994) MEDLINE 94165477 REFERENCE 2 (bases 1 to 3632) AUTHORS Moore,K.W. TITLE Direct Submission JOURNAL Submitted (10-AUG-1993) Kevin W. Moore, Immunology, DNAX Research Institute, 901 California Ave., Palo Alto, CA 94304, USA FEATURES Location/Qualifiers source 1..3632 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="pSW8.1" /clone_lib="BJAB x pJFE14" /chromosome="11" /cell_line="BJAB" /cell_type="B lymphocyte" sig_peptide 62..124 CDS 62..1798 /codon_start=1 /product="interleukin-10 receptor" /db_xref="PID:g482803" /translation="MLPCLVVLLAALLSLRLGSDAHGTELPSPPSVWFEAEFFHHILH WTPIPNQSESTCYEVALLRYGIESWNSISNCSQTLSYDLTAVTLDLYHSNGYRARVRA VDGSRHSNWTVTNTRFSVDEVTLTVGSVNLEIHNGFILGKIQLPRPKMAPANDTYESI FSHFREYEIAIRKVPGNFTFTHKKVKHENFSLLTSGEVGEFCVQVKPSVASRSNKGMW SKEECISLTRQYFTVTNVIIFFAFVLLLSGALAYCLALQLYVRRRKKLPSVLLFKKPS PFIFISQRPSPETQDTIHPLDEEAFLKVSPELKNLDLHGSTDSGFGSTKPSLQTEEPQ FLLPDPHPQADRTLGNGEPPVLGDSCSSGSSNSTDSGICLQEPSLSPSTGPTWEQQVG SNSRGQDDSGIDLVQNSEGRAGDTQGGSALGHHSPPEPEVPGEEDPAAVAFQGYLRQT RCAEEKATKTGCLEEESPLTDGLGPKFGRCLVDEAGLHPPALAKGYLKQDPLEMTLAS SGAPTGQWNQPTEEWSLLALSSCSDLGISDWSFAHDLAPLGCVAAPGGLLGSFNSDLV TLPLISSLQSSE" mat_peptide 125..1795 polyA_signal 3613..3618 BASE COUNT 804 a 1018 c 1003 g 807 t ORIGIN 1 aaagagctgg aggcgcgcag gccggctccg ctccggcccc ggacgatgcg gcgcgcccag 61 gatgctgccg tgcctcgtag tgctgctggc ggcgctcctc agcctccgtc ttggctcaga 121 cgctcatggg acagagctgc ccagccctcc gtctgtgtgg tttgaagcag aatttttcca 181 ccacatcctc cactggacac ccatcccaaa tcagtctgaa agtacctgct atgaagtggc 241 gctcctgagg tatggaatag agtcctggaa ctccatctcc aactgtagcc agaccctgtc 301 ctatgacctt accgcagtga ccttggacct gtaccacagc aatggctacc gggccagagt 361 gcgggctgtg gacggcagcc ggcactccaa ctggaccgtc accaacaccc gcttctctgt 421 ggatgaagtg actctgacag ttggcagtgt gaacctagag atccacaatg gcttcatcct 481 cgggaagatt cagctaccca ggcccaagat ggcccccgcg aatgacacat atgaaagcat 541 cttcagtcac ttccgagagt atgagattgc cattcgcaag gtgccgggaa acttcacgtt 601 cacacacaag aaagtaaaac atgaaaactt cagcctccta acctctggag aagtgggaga 661 gttctgtgtc caggtgaaac catctgtcgc ttcccgaagt aacaagggga tgtggtctaa 721 agaggagtgc atctccctca ccaggcagta tttcaccgtg accaacgtca tcatcttctt 781 tgcctttgtc ctgctgctct ccggagccct cgcctactgc ctggccctcc agctgtatgt 841 gcggcgccga aagaagctac ccagtgtcct gctcttcaag aagcccagcc ccttcatctt 901 catcagccag cgtccctccc cagagaccca agacaccatc cacccgcttg atgaggaggc 961 ctttttgaag gtgtccccag agctgaagaa cttggacctg cacggcagca cagacagtgg 1021 ctttggcagc accaagccat ccctgcagac tgaagagccc cagttcctcc tccctgaccc 1081 tcacccccag gctgacagaa cgctgggaaa cggggagccc cctgtgctgg gggacagctg 1141 cagtagtggc agcagcaata gcacagacag cgggatctgc ctgcaggagc ccagcctgag 1201 ccccagcaca gggcccacct gggagcaaca ggtggggagc aacagcaggg gccaggatga 1261 cagtggcatt gacttagttc aaaactctga gggccgggct ggggacacac agggtggctc 1321 ggccttgggc caccacagtc ccccggagcc tgaggtgcct ggggaagaag acccagctgc 1381 tgtggcattc cagggttacc tgaggcagac cagatgtgct gaagagaagg caaccaagac 1441 aggctgcctg gaggaagaat cgcccttgac agatggcctt ggccccaaat tcgggagatg 1501 cctggttgat gaggcaggct tgcatccacc agccctggcc aagggctatt tgaaacagga 1561 tcctctagaa atgactctgg cttcctcagg ggccccaacg ggacagtgga accagcccac 1621 tgaggaatgg tcactcctgg ccttgagcag ctgcagtgac ctgggaatat ctgactggag 1681 ctttgcccat gaccttgccc ctctaggctg tgtggcagcc ccaggtggtc tcctgggcag 1741 ctttaactca gacctggtca ccctgcccct catctctagc ctgcagtcaa gtgagtgact 1801 cgggctgaga ggctgctttt gattttagcc atgcctgctc ctctgcctgg accaggagga 1861 gggccctggg gcagaagtta ggcacgaggc agtctgggca cttttctgca agtccactgg 1921 ggctggccca gccaggctgc agggctggtc agggtgtctg gggcaggagg aggccaactc 1981 actgaactag tgcagggtat gtgggtggca ctgacctgtt ctgttgactg gggccctgca 2041 gactctggca gagctgagaa gggcagggac cttctccctc ctaggaactc tttcctgtat 2101 cataaaggat tatttgctca ggggaaccat ggggctttct ggagttgtgg tgaggccacc 2161 aggctgaagt cagctcagac ccagacctcc ctgcttaggc cactcgagca tcagagcttc 2221 cagcaggagg aagggctgta ggaatggaag cttcagggcc ttgctgctgg ggtcattttt 2281 aggggaaaaa ggaggatatg atggtcacat ggggaacctc ccctcatcgg gcctctgggg 2341 caggaagctt gtcactggaa gatcttaagg tatatatttt ctggacactc aaacacatca 2401 taatggattc actgagggga gacaaaggga gccgagaccc tggatggggc ttccagctca 2461 gaacccatcc ctctggtggg tacctctggc acccatctgc aaatatctcc ctctctccaa 2521 caaatggagt agcatccccc tggggcactt gctgaggcca agccactcac atcctcactt 2581 tgctgcccca ccatcttgct gacaacttcc agagaagcca tggttttttg tattggtcat 2641 aactcagccc tttgggcggc ctctgggctt gggcaccagc tcatgccagc cccagagggt 2701 cagggttgga ggcctgtgct tgtgtttgct gctaatgtcc agctacagac ccagaggata 2761 agccactggg cactgggctg gggtccctgc cttgttggtg ttcagctgtg tgattttgga 2821 ctagccactt gtcagagggc ctcaatctcc catctgtgaa ataaggactc cacctttagg 2881 ggaccctcca tgtttgctgg gtattagcca agctggtcct gggagaatgc agatactgtc 2941 cgtggactac caagctggct tgtttcttat gccagaggct aacagatcca atgggagtcc 3001 atggtgtcat gccaagacag tatcagacac agccccagaa gggggcatta tgggccctgc 3061 ctccccatag gccatttgga ctctgccttc aaacaaaggc agttcagtcc acaggcatgg 3121 aagctgtgag gggacaggcc tgtgcgtgcc atccagagtc atctcagccc tgcctttctc 3181 tggagcattc tgaaaacaga tattctggcc cagggaatcc agccatgacc cccacccctc 3241 tgccaaagta ctcttaggtg ccagtctggt aactgaactc cctctggagg caggcttgag 3301 ggaggattcc tcagggttcc cttgaaagct ttatttattt attttgttca tttatttatt 3361 ggagaggcag cattgcacag tgaaagaatt ctggatatct caggagcccc gaaattctag 3421 ctctgacttt gctgtttcca gtggtatgac cttggagaag tcacttatcc tcttggagcc 3481 tcagtttcct catctgcaga ataatgactg acttgtctaa ttcataggga tgtgaggttc 3541 tgctgaggaa atgggtatga atgtgccttg aacacaaagc tctgtcaata agtgatacat 3601 gttttttatt ccaataaatt gtcaagacca ca // LOCUS U00968 4154 bp mRNA PRI 25-OCT-1993 DEFINITION Human SREBP-1 mRNA, complete cds. ACCESSION U00968 NID g409404 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4154) AUTHORS Yokoyama,C., Wang,X., Briggs,M.R., Admon,A., Wu,J., Hua,X., Goldstein,J.L. and Brown,M.S. TITLE SREBP-1, a basic-helix-loop-helix-leucine zipper protein that controls transcription of the low density lipoprotein receptor gene JOURNAL Cell 75 (1), 187-197 (1993) MEDLINE 94006541 REFERENCE 2 (bases 1 to 4154) AUTHORS Goldstein,J. TITLE Direct Submission JOURNAL Submitted (20-AUG-1993) Joseph Goldstein, Molecular Genetics, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Blvd., Dallas, Texas, 75235, USA FEATURES Location/Qualifiers source 1..4154 /organism="Homo sapiens" /db_xref="taxon:9606" /cell_line="HeLa S3" CDS 167..3610 /codon_start=1 /product="SREBP-1" /db_xref="PID:g409405" /translation="MDEPPFSEAALEQALGEPCDLDAALLTDIEDMLQLINNQDSDFP GLFDPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQP APTPLKMYPSMPAFSPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSST PVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVPQQLLTVTA APTAAPVTTTVTSQIQQVPVLLQPHFIKADSLLLTAMKTDGATVKAAGLSPLVSGTTV QTGPLPTLVSGGTILATVPLVVDAEKLPINRLAAGSKAPASAQSRGEKRTAHNAIEKR YRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENLSLRTAVH KSKSLKDLVSACGSGGNTDVLMEGVKTEVEDTLTPPPSDAGSPFQSSPLSLGSRGSGS GGSGSDSEPDSPVFEDSKAKPEQRPSLHSRGMLDRSRLALCTLVFLCLSCNPLASLLG ARGLPSPSDTTSVYHSPGRNVLGTESRDGPGWAQWLLPPVVWLLNGLLVLVSLVLLFV YGEPVTRPHSGPAVYFWRHRKQADLDLARGDFAQAAQQLWLALRALGRPLPTSHLDLA CSLLWNLIRHLLQRLWVGRWLAGRAGGLQQDCALRVDASASARDAALVYHKLHQLHTM GKHTGGHLTATNLALSALNLAECAGDAVSVATLAEIYVAAALRVKTSLPRALHFLTRF FLSSARQACLAQSGSVPPAMQWLCHPVGHRFFVDGDWSVLSTPWESLYSLAGNPVDPL AQVTQLFREHLLERALNCVTQPNPSPGSADGDKEFSDALGYLQLLNSCSDAAGAPAYS FSISSSMATTTGVDPVAKWWASLTAVVIHWLRRDEEAAERLCPLVEHLPRVLQESERP LPRAALHSFKAARALLGCAKAESGPASLTICEKASGYLQDSLATTPASSSIDKAVQLF LCDLLLVVRTSLWRQQQPPAPAPAAQGTSSRPQASALELRGFQRDLSSLRRLAQSFRP AMRRVFLHEATARLMAGASPTRTHQLLDRSLRRRAGPGGKGGAVAELEPRPTRREHAE ALLLASCYLPPGFLSAPGQRVGMLAEAARTLEKLGDRRLLHDCQQMLMRLGGGTTVTS S" BASE COUNT 713 a 1406 c 1281 g 754 t ORIGIN 1 taacgaggaa cttttcgccg gcgccgggcc gcctctgagg ccagggcagg acacgaacgc 61 gcggagcggc ggcggcgact gagagccggg gccgcggcgg cgctccctag gaagggccgt 121 acgaggcggc gggcccggcg ggcctcccgg aggaggcggc tgcgccatgg acgagccacc 181 cttcagcgag gcggctttgg agcaggcgct gggcgagccg tgcgatctgg acgcggcgct 241 gctgaccgac atcgaagaca tgcttcagct tatcaacaac caagacagtg acttccctgg 301 cctatttgac ccaccctatg ctgggagtgg ggcagggggc acagaccctg ccagccccga 361 taccagctcc ccaggcagct tgtctccacc tcctgccaca ttgagctcct ctcttgaagc 421 cttcctgagc gggccgcagg cagcgccctc acccctgtcc cctccccagc ctgcacccac 481 tccattgaag atgtacccgt ccatgcccgc tttctcccct gggcctggta tcaaggaaga 541 gtcagtgcca ctgagcatcc tgcagacccc caccccacag cccctgccag gggccctcct 601 gccacagagc ttcccagccc cagccccacc gcagttcagc tccacccctg tgttaggcta 661 ccccagccct ccgggaggct tctctacagg aagccctccc gggaacaccc agcagccgct 721 gcctggcctg ccactggctt ccccgccagg ggtcccgccc gtctccttgc acacccaggt 781 ccagagtgtg gtcccccagc agctactgac agtcacagct gcccccacgg cagcccctgt 841 aacgaccact gtgacctcgc agatccagca ggtcccggtc ctgctgcagc cccacttcat 901 caaggcagac tcgctgcttc tgacagccat gaagacagac ggagccactg tgaaggcggc 961 aggtctcagt cccctggtct ctggcaccac tgtgcagaca gggcctttgc cgaccctggt 1021 gagtggcgga accatcttgg caacagtccc actggtcgta gatgcggaga agctgcctat 1081 caaccggctc gcagctggca gcaaggcccc ggcctctgcc cagagccgtg gagagaagcg 1141 cacagcccac aacgccattg agaagcgcta ccgctcctcc atcaatgaca aaatcattga 1201 gctcaaggat ctggtggtgg gcactgaggc aaagctgaat aaatctgctg tcttgcgcaa 1261 ggccatcgac tacattcgct ttctgcaaca cagcaaccag aaactcaagc aggagaacct 1321 aagtctgcgc actgctgtcc acaaaagcaa atctctgaag gatctggtgt cggcctgtgg 1381 cagtggaggg aacacagacg tgctcatgga gggcgtgaag actgaggtgg aggacacact 1441 gaccccaccc ccctcggatg ctggctcacc tttccagagc agccccttgt cccttggcag 1501 caggggcagt ggcagcggtg gcagtggcag tgactcggag cctgacagcc cagtctttga 1561 ggacagcaag gcaaagccag agcagcggcc gtctctgcac agccggggca tgctggaccg 1621 ctcccgcctg gccctgtgca cgctcgtctt cctctgcctg tcctgcaacc ccttggcctc 1681 cttgctgggg gcccgggggc ttcccagccc ctcagatacc accagcgtct accatagccc 1741 tgggcgcaac gtgctgggca ccgagagcag agatggccct ggctgggccc agtggctgct 1801 gcccccagtg gtctggctgc tcaatgggct gttggtgctc gtctccttgg tgcttctctt 1861 tgtctacggt gagccagtca cacggcccca ctcaggcccc gccgtgtact tctggaggca 1921 tcgcaagcag gctgacctgg acctggcccg gggagacttt gcccaggctg cccagcagct 1981 gtggctggcc ctgcgggcac tgggccggcc cctgcccacc tcccacctgg acctggcttg 2041 tagcctcctc tggaacctca tccgtcacct gctgcagcgt ctctgggtgg gccgctggct 2101 ggcaggccgg gcagggggcc tgcagcagga ctgtgctctg cgagtggatg ctagcgccag 2161 cgcccgagac gcagccctgg tctaccataa gctgcaccag ctgcacacca tggggaagca 2221 cacaggcggg cacctcactg ccaccaacct ggcgctgagt gccctgaacc tggcagagtg 2281 tgcaggggat gccgtgtctg tggcgacgct ggccgagatc tatgtggcgg ctgcattgag 2341 agtgaagacc agtctcccac gggccttgca ttttctgaca cgcttcttcc tgagcagtgc 2401 ccgccaggcc tgcctggcac agagtggctc agtgcctcct gccatgcagt ggctctgcca 2461 ccccgtgggc caccgtttct tcgtggatgg ggactggtcc gtgctcagta ccccatggga 2521 gagcctgtac agcttggccg ggaacccagt ggaccccctg gcccaggtga ctcagctatt 2581 ccgggaacat ctcttagagc gagcactgaa ctgtgtgacc cagcccaacc ccagccctgg 2641 gtcagctgat ggggacaagg aattctcgga tgccctcggg tacctgcagc tgctgaacag 2701 ctgttctgat gctgcggggg ctcctgccta cagcttctcc atcagttcca gcatggccac 2761 caccaccggc gtagacccgg tggccaagtg gtgggcctct ctgacagctg tggtgatcca 2821 ctggctgcgg cgggatgagg aggcggctga gcggctgtgc ccgctggtgg agcacctgcc 2881 ccgggtgctg caggagtctg agagacccct gcccagggca gctctgcact ccttcaaggc 2941 tgcccgggcc ctgctgggct gtgccaaggc agagtctggt ccagccagcc tgaccatctg 3001 tgagaaggcc agtgggtacc tgcaggacag cctggctacc acaccagcca gcagctccat 3061 tgacaaggcc gtgcagctgt tcctgtgtga cctgcttctt gtggtgcgca ccagcctgtg 3121 gcggcagcag cagcccccgg ccccggcccc agcagcccag ggcgccagca gcaggcccca 3181 ggcttccgcc cttgagctgc gtggcttcca acgggacctg agcagcctga ggcggctggc 3241 acagagcttc cggcccgcca tgcggagggt gttcctacat gaggccacgg cccggctgat 3301 ggcgggggcc agccccacac ggacacacca gctcctcgac cgcagtctga ggcggcgggc 3361 aggccccggt ggcaaaggag gcgcggtggc ggagctggag ccgcggccca cgcggcggga 3421 gcacgcggag gccttgctgc tggcctcctg ctacctgccc cccggcttcc tgtcggcgcc 3481 cgggcagcgc gtgggcatgc tggctgaggc ggcgcgcaca ctcgagaagc ttggcgatcg 3541 ccggctgctg cacgactgtc agcagatgct catgcgcctg ggcggtggga ccactgtcac 3601 ttccagctag accccgtgtc cccggcctca gcacccctgt ctctagccac tttggtcccg 3661 tgcagcttct gtcctgcgtc gaagctttga aggccgaagg cagtgcaaga gactctggcc 3721 tccacagttc gacctgcggc tgctgtgtgc cttcgcggtg gaaggcccga ggggcgcgat 3781 cttgacccta agaccggcgg ccatgatggt gctgacctct ggtggccgat cggggcactg 3841 caggggccga gccattttgg ggggcccccc tccttgctct gcaggcacct tagtggcttt 3901 tttcctcctg tgtacaggga agagaggggt acatttccct gtgctgacgg aagccaactt 3961 ggctttcccg gactgcaagc agggctctgc cccagaggcc tctctctccg tcgtgggaga 4021 gagacgtgta catagtgtag gtcagcgtgc ttagcctcct gacctgaggc tcctgtgcta 4081 ctttgccttt tgcaaacttt attttcatag attgagaagt tttgtacaga gaattaaaaa 4141 tgaaattatt tata // LOCUS U01120 3095 bp mRNA PRI 03-FEB-1994 DEFINITION Human glucose-6-phosphatase mRNA, complete cds. ACCESSION U01120 NID g452443 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3095) AUTHORS Lei,K., Shelly,L.L., Pan,C., Sidbury,J.B. and Chou,J.Y. TITLE Mutations in the glucose-6-phosphatase gene that cause glycogen storage disease type 1a JOURNAL Science 262, 580-583 (1993) MEDLINE 94024015 REFERENCE 2 (bases 1 to 3095) AUTHORS Chou,J.Y. TITLE Direct Submission JOURNAL Submitted (30-AUG-1993) J.Y. Chou, National Institutes of Health, Human Genetics Branch, 9000 Rockville Pike, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..3095 /organism="Homo sapiens" /db_xref="taxon:9606" /tissue_type="liver" CDS 80..1153 /codon_start=1 /product="glucose-6-phosphatase" /db_xref="PID:g452444" /translation="MEEGMNVLHDFGIQSTHYLQVNYQDSQDWFILVSVIADLRNAFY VLFPIWFHLQEAVGIKLLWVAVIGDWLNLVFKWILFGQRPYWWVLDTDYYSNTSVPLI KQFPVTCETGPGSPSGHAMGTAGVYYVMVTSTLSIFQGKIKPTYRFRCLNVILWLGFW AVQLNVCLSRIYLAAHFPHQVVAGVLSGIAVTETFSHIHSIYNASLKKYFLITFFLFS FAIGFYLLLKGLGVDLLWTLEKAQRWCEQPEWVHIDTTPFASLLKNLGTLFGLGLALN SSMYRESCKGKLSKWLPFRLSSIVASLVLLHVFDSLKPPSQVELVFYVLSFCKSAVVP LASVSVIPYCLAQVLGQPHKKSL" BASE COUNT 734 a 788 c 675 g 898 t ORIGIN 1 tagcagagca atcaccacca agcctggaat aactgcaagg gctctgctga catcttcctg 61 aggtgccaag gaaatgagga tggaggaagg aatgaatgtt ctccatgact ttgggatcca 121 gtcaacacat tacctccagg tgaattacca agactcccag gactggttca tcttggtgtc 181 cgtgatcgca gacctcagga atgccttcta cgtcctcttc cccatctggt tccatcttca 241 ggaagctgtg ggcattaaac tcctttgggt agctgtgatt ggagactggc tcaacctcgt 301 ctttaagtgg attctctttg gacagcgtcc atactggtgg gttttggata ctgactacta 361 cagcaacact tccgtgcccc tgataaagca gttccctgta acctgtgaga ctggaccagg 421 gagcccctct ggccatgcca tgggcacagc aggtgtatac tacgtgatgg tcacatctac 481 tctttccatc tttcagggaa agataaagcc gacctacaga tttcggtgct tgaatgtcat 541 tttgtggttg ggattctggg ctgtgcagct gaatgtctgt ctgtcacgaa tctaccttgc 601 tgctcatttt cctcatcaag ttgttgctgg agtcctgtca ggcattgctg ttacagaaac 661 tttcagccac atccacagca tctataatgc cagcctcaag aaatattttc tcattacctt 721 cttcctgttc agcttcgcca tcggatttta tctgctgctc aagggactgg gtgtagacct 781 cctgtggact ctggagaaag cccagaggtg gtgcgagcag ccagaatggg tccacattga 841 caccacaccc tttgccagcc tcctcaagaa cctgggcacg ctctttggcc tggggctggc 901 tctcaactcc agcatgtaca gggagagctg caaggggaaa ctcagcaagt ggctcccatt 961 ccgcctcagc tctattgtag cctccctcgt cctcctgcac gtctttgact ccttgaaacc 1021 cccatcccaa gtcgagctgg tcttctacgt cttgtccttc tgcaagagtg cggtagtgcc 1081 cctggcatcc gtcagtgtca tcccctactg cctcgcccag gtcctgggcc agccgcacaa 1141 gaagtcgttg taagagatgt ggagtcttcg gtgtttaaag tcaacaacca tgccagggat 1201 tgaggaggac tactatttga agcaatgggc actggtattt ggagcaagtg acatgccatc 1261 cattctgccg tcgtggaatt aaatcacgga tggcagattg gagggtcgcc tggcttattc 1321 ccatgtgtga ctccagcctg ccctcagcac agactctttc agatggaggt gccatatcac 1381 gtacaccata tgcaagtttc ccgccaggag gtcctcctct ctctacttga atactctcac 1441 aagtagggag ctcactccca ctggaacagc ccattttatc tttgaatggt cttctgccag 1501 cccattttga ggccagaggt gctgtcagct caggtggtcc tcttttacaa tcctaatcat 1561 attgggtaat gtttttgaaa agctaatgaa gctattgaga aagacctgtt gctagaagtt 1621 gggttgttct ggattttccc ctgaagactt acttattctt ccgtcacata tacaaaagca 1681 agacttccag gtagggccag ctcacaagcc caggctggag atcctaactg agaattttct 1741 acctgtgttc attcttaccg agaaaaggag aaaggagctc tgaatctgat aggaaaagaa 1801 ggctgcctaa ggaggagttt ttagtatgtg gcgtatcatg caagtgctat gccaagccat 1861 gtctaaatgg ctttaattat atagtaatgc actctcagta atgggggacc agcttaagta 1921 taattaatag atggttagtg gggtaattct gcttctagta ttttttttac tgtgcataca 1981 tgttcatcgt atttccttgg atttctgaat ggctgcagtg acccagatat tgcactaggt 2041 caaaacattc aggtatagct gacatctcct ctatcacatt acatcatcct ccttataagc 2101 ccagctctgc tttttccaga ttcttccact ggctccacat ccaccccact ggatcttcag 2161 aaggctagag ggcgactctg gtggtgcttt tgtatgtttc aattaggctc tgaaatcttg 2221 ggcaaaatga caaggggagg gccaggattc ctctctcagg tcactccagt gttactttta 2281 attcctagag ggtaaatatg actcctttct ctatcccaag ccaaccaaga gcacattctt 2341 aaaggaaaag tcaacatctt ctctcttttt tttttttttt gagacagggt ctcactatgt 2401 tgcccaggct gctcttgaat tcctgggctc aagcagtcct cccaccctac cacagcgtcc 2461 cgcgtagctg gcatacaggt gcaagccact atgtccagct agccaactcc tccttgcctg 2521 cttttctttt tttttctttt tttgagacgg cgcacctatc acccaggctg gagtggagtg 2581 gcacgatctt ggctcactgc aacctcttcc tcctggttca agcgattctc atgtctcagc 2641 ctcctcagta gctaggacta ccggcgtgca ccaccatgcc aggctaattt ttatattttt 2701 agaattttag aagagatggg atttcatcat gttggccagg ctggtctcga actcctgacc 2761 tcaagtgatc cacctgcctt ggcctcccaa ggtgctagga ttacaggcat gagccaccgc 2821 accgggccct ccttgcctgt ttttcaatct catctgatat gcagagtatt tctgccccac 2881 ccacctaccc cccaaaaaaa gctgaagcct atttatttga aagtccttgt ttttgctact 2941 aattatatag tataccatac attatcattc aaaacaacca tcctgctcat aacatctttg 3001 aaaagaaaaa tatatatgtg cagtatttta ttaaagcaac attttattta agaataaagt 3061 cttgttaatt actatatttt agatgcaatg tgatc // LOCUS U01147 5265 bp mRNA PRI 03-FEB-1994 DEFINITION Human guanine nucleotide regulatory protein (ABR) mRNA, complete cds. ACCESSION U01147 NID g393094 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (sites) AUTHORS Tan,E.C., Leung,T., Manser,E. and Lim,L. TITLE The human active breakpoint cluster region-related gene encodes a brain protein with homology to guanine nucleotide exchange proteins and GTPase-activating proteins JOURNAL J. Biol. Chem. 268 (36), 27291-27298 (1993) MEDLINE 94086546 REFERENCE 2 (bases 1 to 5265) AUTHORS Tan,E. TITLE Direct Submission JOURNAL Submitted (31-AUG-1993) Tan E., Institute of Molecular and Cell Biology, Kent Ridge, Singapore, Singapore, 0511 FEATURES Location/Qualifiers source 1..5265 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="ABR" /clone_lib="hippocampal cDNA" /map="17p13.3" /sex="female" /tissue_type="brain" /dev_stage="infant" mRNA 1..5265 gene 111..2690 /gene="ABR" CDS 111..2690 /gene="ABR" /standard_name="GAP" /codon_start=1 /function="GTPase activating protein" /product="guanine nucleotide regulatory protein" /db_xref="PID:g393095" /translation="MEPLSHRGLPRLSWIDTLYSNFSYGTDEYDGEGNEEQKGPPEGS ETMPYIDESPTMSPQLSARSQGRGDGVSPTPPEGLAPGVEAGKGLEMRKLVLSGFLAS EEIYINQLEALLLPMKPLKATATTSQPVLTIQQIETIFYKIQDIYEIHKEFYDNLCPK VQQWDSQVTMGHLFQKLASQLGVYKAFVDNYKVALETAEKCSQSNNQFQKISEELKVK GPKDSKDSHTSVTMEALLYKPIDRVTRSTLVLHDLLKHTPVDHPDYPLLQDALRISQN FLSSINEDIDPRRTAVTTPKGETRQLVKDGFLVEVSESSRKLRHVFLFTDVLLCAKLK KTSAGKHQQYDCKWYIPLADLVFPSPEESEASPQVHPFPDHELEDMKMKISALKSEIQ KEKANKGQSRAIERLKKKMFENEFLLLLNSPTIPFRIHNRNGKSYLFLLSSDYERSEW REAIQKLQKKDLQAFVLSSVELQVLTGSCFKLRTVHNIPVTSNKDDDESPGLYGFLHV IVHSAKGFKQSANLYCTLEVDSFGYFVSKAKTRVFRDTAEPKWDEEFEIELEGSQSLR ILCYEKCYDKTKVNKDNNEIVDKIMGKGQIQLDPQTVETKNWHTDVIEMNGIKVEFSM KFTSRDMSLKRTPSKKQTGVFGVKISVVTKRERSKVPYIVRQCVEEVEKRGIEEVGIY RISGVATDIQALKAVFDANNKDILLMLSDMDINAIAGTLKLYFRELPEPLLTDRLYPA FMEGIALSDPAAKENCMMHLLRSLPDPNLITFLFLLEHLKRVAEKEPINKMSLHNLAT VFGPTLLRPSEVESKAHLTSAADIWSHDVMAQVQVLLYYLQHPPISFAELKRNTLYFS TDV" polyA_site 5242 BASE COUNT 1202 a 1548 c 1454 g 1061 t ORIGIN 1 ctcgcccctc cgcgctcgca acttcggcct cccccggctc ccgcccgccc tccctccttt 61 gttgcgcgat gagggtcggg tttcggatct gaccgagccg ccgccgcggg atggagccgc 121 tcagccaccg gggcctgccg cgcctgtcct ggatcgacac cctctacagc aacttcagct 181 acgggacgga cgagtacgac ggagagggga atgaggagca gaaggggccc ccggagggct 241 cagagaccat gccgtacatc gatgagtcgc ccaccatgtc cccgcagctc agcgcccgca 301 gccagggccg gggggatggc gtctccccga ctccacctga gggactggct cctggggtgg 361 aagcagggaa aggcctggag atgaggaagc tggttctctc ggggttcttg gccagcgaag 421 agatctacat taaccagctg gaagccctgt tgctgcccat gaaacccctg aaggccaccg 481 ccaccacctc ccagcccgtg ctcaccatcc agcagatcga gaccatcttc tacaagatcc 541 aggacatcta tgagatccac aaggagttct atgacaacct gtgccccaag gtgcaacagt 601 gggacagcca ggtcaccatg ggccacctct tccagaagct ggccagccag ctcggtgtgt 661 acaaagcgtt tgtcgataac tataaagtcg ctctggagac agctgagaag tgcagccagt 721 ccaacaacca gttccagaag atctcagagg aactcaaagt gaaaggtccc aaggactcca 781 aggacagcca cacgtctgtc accatggaag ctctgctcta caagcccatt gaccgggtca 841 ctcggagcac cctagtccta cacgacctgc tgaagcacac acctgtggac caccccgact 901 acccgctgct gcaggatgcc ctccgcatct cccagaactt cctgtccagc atcaacgagg 961 acatcgaccc ccgccggact gcagtgacaa cgcccaaggg ggagacgcga cagctggtga 1021 aggacggctt cctggtggaa gtgtcagaga gctcccggaa gctgcggcac gtcttcctct 1081 ttacagatgt cctactgtgt gccaagctga agaagacctc tgcagggaag caccagcagt 1141 atgactgtaa gtggtacatc cccctggccg acctggtgtt tccatccccc gaggaatctg 1201 aggccagccc ccaggtgcac cccttcccag accatgagct ggaggacatg aagatgaaga 1261 tctctgccct caagagtgaa atccagaagg agaaagccaa caaaggccag agccgtgcca 1321 tcgagcgcct gaagaagaag atgtttgaga atgagttcct gctgctgctc aactccccca 1381 caatcccgtt caggatccac aatcggaatg gaaagagtta cctgttccta ctgtcctcgg 1441 actacgagag gtcagagtgg agagaagcaa ttcagaaact acagaagaag gatctccagg 1501 cctttgtcct gagctcagtg gagctccagg tgctcacagg atcctgtttc aagcttagga 1561 ctgtacacaa cattcctgtc accagcaata aagacgacga tgagtctcca ggactctatg 1621 gcttccttca tgtcatcgtc cactctgcca agggatttaa gcaatcagcc aacctgtact 1681 gtaccctgga ggtggattcc ttcggctatt ttgtcagcaa agccaaaacc agggtgttcc 1741 gggacacagc ggagcccaag tgggatgagg agtttgagat cgagctggag ggctcccagt 1801 ccctgaggat cctgtgctat gagaagtgct atgacaagac caaggtcaac aaggacaaca 1861 atgagatcgt ggacaagatc atgggcaaag gacagatcca gctggaccca caaaccgtgg 1921 agaccaagaa ctggcacacg gacgtgattg agatgaacgg gatcaaagtg gaattttcca 1981 tgaaattcac cagccgagat atgagcctga agaggacccc gtccaaaaag cagaccggcg 2041 tcttcggtgt gaagatcagc gtggtgacga agcgggagcg ctccaaggtg ccctacatcg 2101 tccggcagtg tgtggaggag gtggagaaga ggggtatcga ggaggttggc atctacagga 2161 tatcgggcgt ggccacggac atccaggcgc tcaaggccgt cttcgatgcc aataacaagg 2221 acatcctgct gatgctgagt gacatggaca tcaacgccat cgccgggacg ctcaagctgt 2281 acttccggga actgcccgag ccgctcctca cggaccgact ctacccagcc ttcatggagg 2341 gcatcgccct gtcagaccct gctgccaagg aaaactgcat gatgcacctg ctccgctccc 2401 tgcccgaccc caacctcatc accttcctct tcctgctgga acacttgaaa agggttgccg 2461 agaaggagcc catcaacaaa atgtcacttc acaacctggc taccgtgttt ggacccacgt 2521 tactgagacc ctcagaagtg gagagcaaag cacacctcac ctcggctgcg gacatctggt 2581 cccatgacgt catggcgcag gtccaggtcc tcctctacta cctgcagcac ccccccattt 2641 ccttcgcaga actcaagcgg aacacactgt acttctccac cgacgtgtag cccgaggcag 2701 ggtggctgcg ggcgggtggt ggaaccagcc cctccagcct ggggtccaac tcagacttga 2761 aagactgcaa tagaaaactc ccaaacccag cactccagac tcgagggaag ccagcttcca 2821 agaactggaa tgcgtacgtc ttttgtgcca ccttgtacaa agccggctgc ccagccccag 2881 cctcaccacc gcatcccacc tcctgccctc catacctcta gttgtgtctg atgctccgtg 2941 ctgttcggga attgttttat gtacacttgt caggcagaaa aggtagtgac cggcccggcg 3001 tgggcacaca gacagcccgc tttgttcttt catttcctcc agcactttct ttccgcctga 3061 gtccagccca aggcctttta ttttgcgctg tgtaactgct gccagcttct ctcttggccc 3121 tgctcccaga tggcggtctc ctggcagcct cccctcagtc ttcctccacc cgcctcttcc 3181 ttcccagcct gcctgcatgc atgtgcaccc ttggtcttcg ctccatcgcc ttgaaagctc 3241 tgaagaggcc ctgggttgtc gcggcagcag tggtctgttt gatgctgccg tttgccgctg 3301 ccggcccctc ctcagactcc gcctttggga gcacacctgc tttgccttgc tgcctgtgca 3361 aatgttggac aagcagacac actcacactc gtccccagct tagcacagag ctggagcgcc 3421 catttctgga attttccgtt tgggaatctc cacttctggg gtttacctgt tcggcctcct 3481 gcctatcagt gaggcatctc tgactgttcc ttctactgct tttcagttcc cttccctgct 3541 gttctatttc ctttgagtgt aaagactcac aggtgacctg ctatcgagat agccagaggg 3601 tcaggagaga atgggggagg aggcggtcag gctgctgagg aaacaccaca ggctgaacgg 3661 gggaggaatg cacatgccac gctgggtgtc ccgggtcgcg gggaggcagc tcagctctta 3721 ggagcaagtt gtgggggctt ttcaagaggg gccaggcttc ctggagggtg actgatgtgg 3781 ccgaagcagg tgtccaggca ggtaggctgc agccaggagc tccctggcac cgcaggacct 3841 cgtggtactc ttgccttaga ttttacacac actccacagc caagcactgc cacggtcctc 3901 caggacctgg gaagcaaagg cacaggccca cggtggccag ccattgtggt gccgccccag 3961 cttctggata cagccttttg ggtaaacact gggaactcca gaagttgtgg ggagagtggg 4021 gaatcagaca gccgcctcta ggggctgggt tctgctgggg cctccttgtt ggtgctgtag 4081 gcacccgcca ggagcaggga cccgacttgc agacgcattg cccggtacta ggaaggagtg 4141 aggtgtgttc ccaccgtaca cttcccacac gagctgcggc tgccagcctc gggccatcag 4201 cctaggagag cagatgcagc tccaggggct cgacttatag ccagttacag ctccccggct 4261 cttctgtgtg gcagagcgtc gtttccgggc cctcagggct ggggagctca gttcccattg 4321 cttgtgctca gggctgagtc ttaaagaagg gtttgccggc cctaacgctg cagccgtgct 4381 gagaggccct ttttgagcct gtttactcct gtggccttgg gcagaacagt aaatactctg 4441 tgcacggagg aaagacatgc ccaagaggaa ggaagtactg accatcggct gcctgtgagc 4501 agcttagcaa ggagcccttg ctccctggga aaggcggtga acttgagtct aaagatgcag 4561 tgcctggccc ttcctaaggt ccctgcctgg catccgagtg tcggtgtgtg gcacagaagg 4621 ctcctgcttg cttccaaagt gatggacagg aaggggcaga gtgagtcacg gcccagactg 4681 cgaccttcac gtctcagcct cagggagccc cacagcccca agctcgctga ggcaacgtga 4741 gaacaggcta tgggaaggct gcaaaggctg agaaatgcaa aggctcatat ttataaatcc 4801 cacccccaga gtggggaggg tcaggtgcca gacctggact aaactgcacc aaggaaacac 4861 ccagcagggt ctcctgtgag ccggggacca tgcagcccga aacctccagt cactgcgccc 4921 ggcaggagtc aggagccagg gactgtgcag cctggaacct ccagtcactg tgccagcagg 4981 gtggctgtgc ccagcaggag tcaggctaag aaacgccagg tctgcctgtt cttgctgggc 5041 aatggctgat ggctgccagt ttctgctgat acacaggtag gatgggaccc ttcatgaata 5101 tctgacttta ataagttggt aaggatatat ttttttgtct atgttctgtt tcaacttatg 5161 tagattatta taaattgatg taaaccacgt gagaggaaaa tgttaataaa aaatgcaaag 5221 ccccatcatt tgcacaaaac tcaaaaaaaa aaaaaaaaaa aaaaa // LOCUS U01160 1677 bp mRNA PRI 29-APR-1994 DEFINITION Human transmembrane 4 superfamily protein (SAS) mRNA, complete cds. ACCESSION U01160 NID g457936 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1677) AUTHORS Jankowski,S.A., Mitchell,D.S., Smith,S.H., Trent,J.M. and Meltzer,P.S. TITLE SAS, a gene amplified in human sarcomas, encodes a new member of the transmembrane 4 superfamily of proteins JOURNAL Oncogene 9, 1205-1211 (1994) MEDLINE 94181273 REFERENCE 2 (bases 1 to 1677) AUTHORS Jankowski,S.A. TITLE Direct Submission JOURNAL Submitted (01-SEP-1993) Sheryl A. Jankowski, Human Genetics, University of Michigan, 4708 Medical Science II, Ann Arbor, MI 48109, USA FEATURES Location/Qualifiers source 1..1677 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="cSAS56" /clone_lib="OsA-Cl cDNA library" /map="12q13" /cell_line="OsA-Cl" /tissue_type="osteosarcoma" gene 103..735 /gene="SAS" CDS 103..735 /gene="SAS" /codon_start=1 /product="SAS" /db_xref="PID:g457937" /translation="MVCGGFACSKNALCALNVVYMLVSLLLIGVAAWGKGLGLVSSIH IIGGVIAVGVFLLLIAVAGLVGAVNHHQVLLFFYMIILGLVFIFQFVISCSCLAINRS KQTDVINASWWVMSNKTRDELERSFDCCGLFNLTTLYQQDYDFCTAICKSQSPTCQMC GEKFLKHSDEALKILGGVGLFFSFTEILGVWLAMRFRNQKDPRANPSAFL" polyA_signal 923..928 polyA_signal 1664..1669 BASE COUNT 411 a 400 c 389 g 477 t ORIGIN 1 gcctcgattt aaagagacag aagctgtcgg ggtcctggaa gacggtcccc aataccctcc 61 ccccaagtcc ttgggaccac ttgggtcccc agagctgggg agatggtttg tggcggcttt 121 gcctgctcca agaatgcgct ttgcgctctc aacgtggtct acatgctggt gagcttgttg 181 ctcattggag tggctgcttg gggcaagggc ctgggtctgg tgtccagcat ccacatcatc 241 ggcggagtca ttgctgtggg agtcttcctt ctccttattg cagtggctgg actggtgggt 301 gctgtcaacc accaccaagt cctgctgttc ttttacatga tcatccttgg tttggtcttc 361 atcttccaat ttgtaatctc ttgctcatgt ctggctatta accgaagcaa acagacagat 421 gtcatcaatg cttcttggtg ggtcatgagc aacaagactc gggatgaact ggaaagaagt 481 tttgattgtt gtggcttatt caacctcaca accctgtatc aacaagatta tgatttctgc 541 actgcaatct gcaagagcca gagccccaca tgccagatgt gtggagaaaa gtttcttaag 601 cattcagacg aagccctgaa aatcctaggg ggtgttggac tcttctttag ctttacagag 661 atccttggtg tttggctagc aatgagattt cggaatcaga aggatcctag agccaacccc 721 agtgcctttc tatgagactt tggatccttc tgacttttct tctgctctct ctaagctttc 781 tcttcctccc ttagggaata tctagggtct gtaaccgttt tggtttgaga aaaaggaaag 841 gccccttgtc acatcctcta aaattgatgg aatagcaaga ctttatgcct tgacatattt 901 tagtgggagc cagactataa ggaataaaag gaaaaacttt cttcctctct ctccaagagg 961 atatgggaag cttctgtgag tgcataggat gggggctgga gtcattctta gctgtttccc 1021 ttcctctgtc catatactgg atcacctcaa cataccctgg tgtggctcta agggtaaatc 1081 agggataggg ccaaggagaa aacaaccaag aactctttcc tgtaataagc aggatccagt 1141 ttgagaaagt ttagcgaata taaaagtaaa agccatttaa aaatctatat tctttttttt 1201 tttttgacac agagtcttgc tctgttgccc aggctggagt gcaatggcat gatctcggct 1261 caccgcaacc tctgcctccc gggttcaagc gatcctcctg cctcagtctc ccaagtagct 1321 gggattacag gtgtgcacca ccacgcctga ctaattttgt atttttagta gagacagggt 1381 ttcaccatgt tagtcaggct ggtctcgaac tcctgacctc aggtgatcca cccgcgttgg 1441 cctcccaacg tgctgggatt ataggcgtga gccaccacgc cccgcctaaa atccatattc 1501 aaagaagcaa tttcagttcc tttctaagct ttgtcagtca aggggctcca ctgacttcct 1561 aggccctgta atttaaccag tctttaaggt tttgcaggaa agtcccttct tccaagtggt 1621 ttttccaaat cgcacaatgg caaagccaaa cagaggaaga aacattaaaa aaaaaaa // LOCUS U01351 3557 bp mRNA PRI 10-MAR-1994 DEFINITION Human glucocorticoid receptor alpha-2 mRNA, complete cds. ACCESSION U01351 NID g458656 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3557) AUTHORS Munroe,D.G., Pang,J., Taylor,G.R., Lau,C., Plante,R.K. and Zhou,L. TITLE Alternative splicing within the DNA binding domain creates a novel isoform of the human glucocorticoid receptor JOURNAL Unpublished REFERENCE 2 (bases 1 to 3557) AUTHORS Munroe,D.G. TITLE Direct Submission JOURNAL Submitted (09-SEP-1993) Donald G. Munroe, The R.W. Johnson Pharmaceutical Research Institute, 19 Green Belt Drive, Don Mills, Ontario M3C 1L9, Canada FEATURES Location/Qualifiers source 1..3557 /organism="Homo sapiens" /note="TE85 cDNA library cloned in lambda gt11 vector." /db_xref="taxon:9606" /clone="pBS-132" /cell_line="TE85" /cell_type="osteosarcoma" misc_RNA 1..296 /note="This region of cDNA is upstream of the reported IM9 major transcription start site." CDS 332..484 /note="putative upstream ORF." /codon_start=1 /db_xref="PID:g458658" /translation="MEKGAAVYFLFLEKKNIFPSCSFCVHKLSCLSRLRRELRTVAGE RLLCQS" CDS 494..2830 /codon_start=1 /product="glucocorticoid receptor alpha-2" /db_xref="PID:g458657" /translation="MDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSAS SPSLAVASQSDSKQRRLLVDFPKGSVSNAQQPDLSKAVSLSMGLYMGETETKVMGNDL GFPQQGQISLSSGETDLKLLEESIANLNRSTSVPENPKSSASTAVSAAPTEKEFPKTH SDVSSEQQHLKGQTGTNGGNVKLYTTDQSTFDILQDLEFSSGSPGKETNESPWRSDLL IDENCLLSPLAGEDDSFLLEGNSNEDCKPLILPDTKPKIKDNGDLVLSSPSNVTLPQV KTEKEDFIELCTPGVIKQEKLGTVYCQASFPGANIIGNKMSAISVHGVSTSGGQMYHY DMNTASLSQQQDQKPIFNVIPPIPVGSENWNRCQGSGDDNLTSLGTLNFPGRTVFSNG YSSPSMRPDVSSPPSSSSTATTGPPPKLCLVCSDEASGCHYGVLTCGSCKVFFKRAVE GRQHNYLCAGRNDCIIDKIRRKNCPACRYRKCLQAGMNLEARKTKKKIKGIQQATTGV SQETSENPGNKTIVPATLPQLTPTLVSLLEVIEPEVLYAGYDSSVPDSTWRIMTTLNM LGGRQVIAAVKWAKAIPGFRNLHLDDQMTLLQYSWMFLMAFALGWRSYRQSSANLLCF APDLIINEQRMTLPCMYDQCKHMLYVSSELHRLQVSYEEYLCMKTLLLLSSVPKDGLK SQELFDEIRMTYIKELGKAIVKREGNSSQNWQRFYQLTKLLDSMHEVVENLLNYCFQT FLDKTMSIEFPEMLAEIITNQIPKYSNGNIKKLLFHQK" misc_RNA 1845..1847 /note="Alternative splice donor selection in intron C produces GR alpha-2 mRNA containing 3 bp insertion relative to GR alpha-1 isoform." BASE COUNT 999 a 795 c 812 g 951 t ORIGIN 1 ggcgccgcct ccacccgctc cccgctcggt cccgctcgct cgcccaggcc gggctgccct 61 ttcgcgtgtc cgcgctctct tccctccgcc gccgcctcct ccattttgcg agctcgtgtc 121 tgtgacggga gcccgagtca ccgcctgccc gtcggggacg gattctgtgg gtggaaggag 181 acgccgcagc cggagcggcc gaagcagctg ggaccgggac ggggcacgcg cgcccggaag 241 ccccgacccg cggagcccgg cgcggggcgg agggctggct tgtcagctgg gcaatgggag 301 actttcttaa ataggggctc tccccccacc catggagaaa ggggcggctg tttacttcct 361 ttttttagaa aaaaaaaata tatttccctc ctgctccttc tgcgttcaca agctaagttg 421 tttatctcgg ctgcggcggg aactgcggac ggtggcgggc gagcggctcc tctgccagag 481 ttgatattca ctgatggact ccaaagaatc attaactcct ggtagagaag aaaaccccag 541 cagtgtgctt gctcaggaga ggggagatgt gatggacttc tataaaaccc taagaggagg 601 agctactgtg aaggtttctg cgtcttcacc ctcactggct gtcgcttctc aatcagactc 661 caagcagcga agacttttgg ttgattttcc aaaaggctca gtaagcaatg cgcagcagcc 721 agatctgtcc aaagcagttt cactctcaat gggactgtat atgggagaga cagaaacaaa 781 agtgatggga aatgacctgg gattcccaca gcagggccaa atcagccttt cctcggggga 841 aacagactta aagcttttgg aagaaagcat tgcaaacctc aataggtcga ccagtgttcc 901 agagaacccc aagagttcag catccactgc tgtgtctgct gcccccacag agaaggagtt 961 tccaaaaact cactctgatg tatcttcaga acagcaacat ttgaagggcc agactggcac 1021 caacggtggc aatgtgaaat tgtataccac agaccaaagc acctttgaca ttttgcagga 1081 tttggagttt tcttctgggt ccccaggtaa agagacgaat gagagtcctt ggagatcaga 1141 cctgttgata gatgaaaact gtttgctttc tcctctggcg ggagaagacg attcattcct 1201 tttggaagga aactcgaatg aggactgcaa gcctctcatt ttaccggaca ctaaacccaa 1261 aattaaggat aatggagatc tggttttgtc aagccccagt aatgtaacac tgccccaagt 1321 gaaaacagaa aaagaagatt tcatcgaact ctgcacccct ggggtaatta agcaagagaa 1381 actgggcaca gtttactgtc aggcaagctt tcctggagca aatataattg gtaataaaat 1441 gtctgccatt tctgttcatg gtgtgagtac ctctggagga cagatgtacc actatgacat 1501 gaatacagca tccctttctc aacagcagga tcagaagcct atttttaatg tcattccacc 1561 aattcccgtt ggttccgaaa attggaatag gtgccaagga tctggagatg acaacttgac 1621 ttctctgggg actctgaact tccctggtcg aacagttttt tctaatggct attcaagccc 1681 cagcatgaga ccagatgtaa gctctcctcc atccagctcc tcaacagcaa caacaggacc 1741 acctcccaaa ctctgcctgg tgtgctctga tgaagcttca ggatgtcatt atggagtctt 1801 aacttgtgga agctgtaaag ttttcttcaa aagagcagtg gaaggtagac agcacaatta 1861 cctatgtgct ggaaggaatg attgcatcat cgataaaatt cgaagaaaaa actgcccagc 1921 atgccgctat cgaaaatgtc ttcaggctgg aatgaacctg gaagctcgaa aaacaaagaa 1981 aaaaataaaa ggaattcagc aggccactac aggagtctca caagaaacct ctgaaaatcc 2041 tggtaacaaa acaatagttc ctgcaacgtt accacaactc acccctaccc tggtgtcact 2101 gttggaggtt attgaacctg aagtgttata tgcaggatat gatagctctg ttccagactc 2161 aacttggagg atcatgacta cgctcaacat gttaggaggg cggcaagtga ttgcagcagt 2221 gaaatgggca aaggcaatac caggtttcag gaacttacac ctggatgacc aaatgaccct 2281 actgcagtac tcctggatgt ttcttatggc atttgctctg gggtggagat catatagaca 2341 atcaagtgca aacctgctgt gttttgctcc tgatctgatt attaatgagc agagaatgac 2401 tctaccctgc atgtacgacc aatgtaaaca catgctgtat gtttcctctg agttacacag 2461 gcttcaggta tcttatgaag agtatctctg tatgaaaacc ttactgcttc tctcttcagt 2521 tcctaaggac ggtctgaaga gccaagagct atttgatgaa attagaatga cctacatcaa 2581 agagctagga aaagccattg tcaagaggga aggaaactcc agccagaact ggcagcggtt 2641 ttatcaactg acaaaactct tggattctat gcatgaagtg gttgaaaatc tccttaacta 2701 ttgcttccaa acatttttgg ataagaccat gagtattgaa ttccccgaga tgttagctga 2761 aatcatcacc aatcagatac caaaatattc aaatggaaat atcaaaaaac ttctgtttca 2821 tcaaaagtga ctgccttaat aagaatggtt gccttaaaga aagtcgaatt aatagctttt 2881 attgtataaa ctatcagttt gtcctgtaga ggttttgttg ttttattttt tattgttttc 2941 atctgttgtt ttgttttaaa tacgcactac atgtggttta tagagggcca agacttggca 3001 acagaagcag ttgagtcgtc atcacttttc agtgatggga gagtagatgg tgaaatttat 3061 tagttaatat atcccagaaa ttagaaacct taatatgtgg acgtaatctc cacagtcaaa 3121 gaaggatggc acctaaacca ccagtgccca aagtctgtgt gatgaacttt ctcttcatac 3181 tttttttcac agttggctgg atgaaatttt ctagactttc tgttggtgta tcccccccct 3241 gtatagttag gatagcattt ttgatttatg catggaaacc tgaaaaaaag tttacaagtg 3301 tatatcagaa aagggaagtt gtgcctttta tagctattac tgtctggttt taacaatttc 3361 ctttatattt agtgaactac gcttgctcat tttttcttac ataatttttt attcaagtta 3421 ttgtacagct gtttaagatg ggcagctagt tcgtagcttt cccaaataaa ctctaaacat 3481 taatcaatca tctgtgtgaa aatgggttgg tgcttctaac ctgatggcac ttagctatca 3541 gaagaccaca aaaattg // LOCUS U01833 1213 bp ds-mRNA PRI 03-FEB-1995 DEFINITION Human nucleotide-binding protein mRNA, complete cds. ACCESSION U01833 NID g450768 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1213) AUTHORS Shahrestanifar,M., Saha,D.P., Scala,L.A., Basu,A. and Howells,R.D. TITLE Cloning of a human cDNA encoding a putative nucleotide-binding protein related to Escherichia coli MinD JOURNAL Gene 147 (2), 281-285 (1994) MEDLINE 95011630 REFERENCE 2 (bases 1 to 1213) AUTHORS Howells,R.D. TITLE Direct Submission JOURNAL Submitted (15-SEP-1993) Richard D. Howells, Biochemistry and Molecular Biology, New Jersey Medical School, 185 South Orange Avenue, Newark, NJ 07103, USA FEATURES Location/Qualifiers source 1..1213 /organism="Homo sapiens" /db_xref="taxon:9606" /sex="female" /cell_line="SHSY5Y" /cell_type="neuroblastoma" CDS 5..967 /note="putative nucleotide-binding protein" /codon_start=1 /db_xref="PID:g515644" /translation="MEEVPHDCPGADSAQAGRGASCQGCPNQRLCASGAGATPDTAIE EIKEKMKTVKHKILVLSGKGGVGKSTFSAHLAHGLAEDENTQIALLDIDICGPSIPKI MGLEGEQVHQSGSGWSPVYVEDNLGVMSVGFLLSSPDDAVIWRGPKKNGMIKQFLRDV DWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQELSLQDVRKEINFCRK VKLPIIGVVENMSPFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPLDPLIGKN CDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS" misc_feature 188..202 /note="consensus GTP/ATP binding site" polyA_signal 1193..1198 BASE COUNT 322 a 310 c 329 g 252 t ORIGIN 1 cggaatggag gaggtgcctc acgactgtcc aggggccgac agcgcccagg cgggcagagg 61 ggcttcatgt cagggatgcc ccaaccagcg gctgtgcgct tctggagcgg gggccactcc 121 ggacacggct atagaggaaa tcaaagagaa aatgaagact gtaaaacaca aaatcttggt 181 attgtctggg aaaggcggtg ttgggaaaag cacattcagc gcccaccttg cccatggcct 241 agcagaggat gaaaacacac agattgctct tctagacatc gatatatgtg ggccatcgat 301 tcccaagata atgggattgg aaggagagca ggttcaccag agtggctcag gctggtctcc 361 agtgtacgtg gaagacaacc tgggggtgat gtcagtgggc ttcctgctca gcagtcctga 421 tgatgctgtt atctggaggg gacccaagaa aaacggcatg atcaagcagt tcctccgaga 481 tgtggactgg ggagaggtcg actacctcat tgtggacacc ccacctggga cgtcggatga 541 acacctctcg gtcgtccggt acctggccac agcacacatc gatggagcag tgatcatcac 601 cactccccag gagctgtcac tccaggatgt ccggaaagaa atcaacttct gccgcaaggt 661 gaagctgccc atcatcgggg tggtggagaa catgagtccg ttcatctgtc ctaagtgcaa 721 gaaagaatct cagatattcc ctcccacaac cgggggcgcg gagctcatgt gccaggactt 781 ggaggtccct ctcctcggca gagtgcccct ggatccgctc ataggtaaga attgtgacaa 841 aggccagtct tttttcattg acgccccaga ttccccagcc acgttagcct acagaagtat 901 aattcagaga atccaagagt tttgtaatct ccatcagtca aaagaagaga acctcatcag 961 ttcctgaaac gagagaatgt tcaggaccaa gcagttaccg agcgaggcac tcactgggca 1021 gcacatccag ccagacccga ccagctccgg gatggggtgg gtcacagcaa aaggaccaga 1081 tgctggtgtg gtccgaagcc actttctcag agacacttta atcattgagt atttgtacac 1141 ttttctttag aacatatata aagggcattc tctacaaatg tgccgtttta agaataaaac 1201 cccctcaaat ctc // LOCUS U01839 1240 bp mRNA PRI 23-NOV-1993 DEFINITION Human Duffy blood group antigen (Fya-b+) mRNA, complete cds. ACCESSION U01839 NID g425267 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1240) AUTHORS Chaudhuri,A., Polyakova,J., Zbrzezna,V., Williams,K., Gulati,S. and Pogo,A.O. TITLE Cloning of glycoprotein D cDNA, which encodes the major subunit of the Duffy blood group system and the receptor for the Plasmodium vivax malaria parasite JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (22), 10793-10797 (1993) MEDLINE 94068488 REFERENCE 2 (bases 1 to 1240) AUTHORS Chaudhuri,A. TITLE Direct Submission JOURNAL Submitted (15-SEP-1993) Chaudhuri A., New York Blood Center, Department of Cell Biology, 310 East 67th Street, New York, NY 10021 USA FEATURES Location/Qualifiers source 1..1240 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="Fyb71-81" /clone_lib="bone marrow cDNA library in lamda ZAPII vector (Stratagene)" /haplotype="Fya-b+" /chromosome="1" /tissue_type="bone marrow" gene 176..1192 /gene="Fya-b+" CDS 176..1192 /gene="Fya-b+" /standard_name="Duffy" /note="Duffy is also a chemokine receptor and binds IL-8." /codon_start=1 /function="receptor for human malaria parasite, Plasmodium vivax" /evidence=experimental /product="human blood group antigen, Duffy protein" /db_xref="PID:g425268" /translation="MASSGYVLQAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYD ANLEAAAPCHSCNLLDDSALPFFILTSVLGILASSTVLFMLFRPLFRWQLCPGWPVLA QLAVGSALFSIVVPVLAPGLGSTRSSALCSLGYCVWYGSAFAQALLLGCHASLGHRLG AGQVPGLTLGLTVGIWGVAALLTLPVTLASGASGGLCTLIYSTELKALQATHTVACLA IFVLLPLGLFGAKGLKKALGMGPGPWMNILWAWFIFWWPHGVVLGLDFLVRSKLLLLS TCLAQQALDLLLNLAEALAILHCVATPLLLALFCHQATRTLLPSLPLPEGWSSHLDTL GSKS" BASE COUNT 186 a 401 c 316 g 337 t ORIGIN 1 ggcttcccca ggactgttcc tgctccggct cttcaggctc cctgctttgt ccttttccac 61 tgtccgcact gcatctgact cctgcagaga ccttgttctc ccacccgacc ttcctctctg 121 tcctcccctc ccacctgccc ctcagttccc aggagactct tccggtgtaa ctctgatggc 181 ctcctctggg tatgtcctcc aggcggagct ctccccctca actgagaact caagtcagct 241 ggacttcgaa gatgtatgga attcttccta tggtgtgaat gattccttcc cagatggaga 301 ctatgatgcc aacctggaag cagctgcccc ctgccactcc tgtaacctgc tggatgactc 361 tgcactgccc ttcttcatcc tcaccagtgt cctgggtatc ctagctagca gcactgtcct 421 cttcatgctt ttcagacctc tcttccgctg gcagctctgc cctggctggc ctgtcctggc 481 acagctggct gtgggcagtg ccctcttcag cattgtggtg cccgtcttgg ccccagggct 541 aggtagcact cgcagctctg ccctgtgtag cctgggctac tgtgtctggt atggctcagc 601 ctttgcccag gctttgctgc tagggtgcca tgcctccctg ggccacagac tgggtgcagg 661 ccaggtccca ggcctcaccc tggggctcac tgtgggaatt tggggagtgg ctgccctact 721 gacactgcct gtcaccctgg ccagtggtgc ttctggtgga ctctgcaccc tgatatacag 781 cacggagctg aaggctttgc aggccacaca cactgtagcc tgtcttgcca tctttgtctt 841 gttgccattg ggtttgtttg gagccaaggg gctgaagaag gcattgggta tggggccagg 901 cccctggatg aatatcctgt gggcctggtt tattttctgg tggcctcatg gggtggttct 961 aggactggat ttcctggtga ggtccaagct gttgctgttg tcaacatgtc tggcccagca 1021 ggctctggac ctgctgctga acctggcaga agccctggca attttgcact gtgtggctac 1081 gcccctgctc ctcgccctat tctgccacca ggccacccgc accctcttgc cctctctgcc 1141 cctccctgaa ggatggtctt ctcatctgga cacccttgga agcaaatcct agttctcttc 1201 ccacctgtca acctgaatta aagtctacac tgcctttgtg // LOCUS U03626 1249 bp mRNA PRI 13-APR-1994 DEFINITION Human arrestin-C mRNA, complete cds. ACCESSION U03626 NID g458200 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1249) AUTHORS Craft,C.M., Whitmore,D.H. and Wiechmann,A.F. TITLE Cone arrestin identified by targeting expression of a functional family JOURNAL J. Biol. Chem. 269, 4613-4619 (1994) MEDLINE 94140898 REFERENCE 2 (bases 1 to 1249) AUTHORS Craft,C.M. TITLE Direct Submission JOURNAL Submitted (18-NOV-1993) C.M. Craft, University of Texas Southwestern Medical Center, Psychiatry, 5363 Harry Hines Blvd, Dallas, TX 75235-8898, USA FEATURES Location/Qualifiers source 1..1249 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="HRCAR6B.P1-1" /clone_lib="max retina lib" /sex="male" /tissue_type="retina" /dev_stage="adult" CDS 1..1164 /standard_name="CAR" /codon_start=1 /product="arrestin-C" /db_xref="PID:g458201" /translation="MSKVFKKTSSNGKLSIYLGKRDFVDHVDTVEPIDGVVLVDPEYL KCRKLFVMLTCAFRYGRDDLEVIGLTFRKDLYVQTLQVVPAESSPQGPLTVLQERLLH KLGDNAYPFTLQMVTNLPCSVTLQPGPEDAGKPCGIDFEVKSFCAENPEETVSKRDYV RLVVRKVQFAPPEAGPGPQPRPSAASFCQLSPYNSRPGRTGRFTYHGEPISVNVSVNN CTNKVIKKIKISVDQITDVVLYSLDKYTKTVFIQEFTETVAANSSFSQSFAVTPILAA SCQKRGLALDGKLKHEDTNLASSTIIRPGMDKELLGILVSYKVRVNLMVSCGGILGDL TASDVVLELPLVLIHPKPSHEAASSEDIVIEEFTRKGEEESQKAVEAEGDEGS" BASE COUNT 321 a 319 c 331 g 278 t ORIGIN 1 atgtccaagg tgtttaagaa gaccagctcc aatgggaagc tctccatcta cctggggaaa 61 cgggacttcg tggaccatgt ggacacggtg gaacccattg acggtgttgt cctggttgat 121 cctgagtact taaaatgtcg aaagttgttt gtcatgttga catgtgcctt tcgctatggc 181 cgtgatgact tggaagtgat tggtctgacg ttccgaaaag atctgtatgt gcagaccctg 241 caagtggtcc cagctgaatc cagccctcag gggcccctca cagtcctaca ggagcgacta 301 ctgcacaagc taggggacaa tgcctacccc tttaccctgc agatggtgac caacctgccc 361 tgttctgtga cactgcagcc aggtcctgaa gatgcaggaa agccctgtgg gattgacttt 421 gaagtgaaga gtttctgtgc tgaaaaccca gaggagacag tctccaagag agactatgtg 481 cggctggttg tccggaaagt acaatttgca ccaccggagg caggccctgg ccctcagccc 541 agaccatccg ccgcttcctt ctgtcagctc agcccctaca actccaggcc tggtaggaca 601 gggaggttca cctaccacgg agaacccatc tctgtcaatg tttctgtcaa caactgcacc 661 aacaaggtca tcaaaaaaat caagatttca gttgaccaga tcacagatgt tgtcctgtat 721 tcactagaca agtacaccaa gactgtgttc attcaggaat tcacggagac tgtagctgct 781 aattccagct tctcccagag ctttgcagta accccaatcc tggctgccag ctgccagaaa 841 cggggcctgg cactggatgg caaacttaag catgaagata ccaacctggc ctctagcaca 901 attattagac cgggaatgga caaagagctg ctggggatcc tggtgtccta caaagtcaga 961 gtcaacctga tggtgtcctg tggtggcatc ctaggagacc tgacagccag cgatgtggtg 1021 ttggagctac ccttggtcct gatccatccg aagccatctc atgaggccgc tagctctgag 1081 gacatagtca tcgaggagtt tacgcggaaa ggcgaggagg agagccagaa ggctgtggag 1141 gctgagggag atgaggggag ctgagcacct cgctctggtg cccgtctgtg tgggagcccc 1201 cactgtaaca ctctaataaa tcagtttgtt caaaaaaaaa aaaaaaaaa // LOCUS U24166 2540 bp mRNA PRI 27-SEP-1995 DEFINITION Human EB1 mRNA, complete cds. ACCESSION U24166 NID g998356 KEYWORDS . SOURCE Homo sapiens. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2540) AUTHORS Su,L.K., Burrell,M., Hill,D.E., Gyuris,J., Brent,R., Wiltshire,R., Trent,J., Vogelstein,B. and Kinzler,K.W. TITLE APC binds to the novel protein EB1 JOURNAL Cancer Res. 55 (14), 2972-2977 (1995) MEDLINE 95330722 REFERENCE 2 (bases 1 to 2540) AUTHORS Kinzler,K. W. TITLE Direct Submission JOURNAL Submitted (05-APR-1995) Kenneth W. Kinzler, Johns Hopkins School of Medicine, Johns Hopkins Oncology Center, Room 109, 424 North Bond St., Baltimore, MD 21231-1001, USA FEATURES Location/Qualifiers source 1..2540 /organism="Homo sapiens" /db_xref="taxon:9606" source 1..2540 /organism="Homo sapien" CDS 65..871 /codon_start=1 /product="EB1" /db_xref="PID:g998357" /translation="MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAY CQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQ DNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAP QRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYF GKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEY" BASE COUNT 690 a 501 c 591 g 758 t ORIGIN 1 acgagacgaa gacggaaccg gagccggttg cgggcagtgg acgcggttct gccgagagcc 61 gaagatggca gtgaacgtat actcaacgtc agtgaccagt gataacctaa gtcgacatga 121 catgctggcc tggatcaatg agtctctgca gttgaatctg acaaagatcg aacagttgtg 181 ctcaggggct gcgtattgtc agtttatgga catgctgttc cctggctcca ttgccttgaa 241 gaaagtgaaa ttccaagcta agctagaaca cgagtacatc cagaacttca aaatactaca 301 agcaggtttt aagagaatgg gtgttgacaa aataattcct gtggacaaat tagtaaaagg 361 aaagtttcag gacaattttg aattcgttca gtggttcaag aagtttttcg atgcaaacta 421 tgatggaaaa gactatgacc ctgtggctgc cagacaaggt caagaaactg cagtggctcc 481 ttcccttgtt gctccagctc tgaataaacc gaagaaacct ctcacttcta gcagtgcagc 541 tccccagagg cccatctcaa cacagagaac cgctgcggct cctaaggctg gccctggtgt 601 ggtgcgaaag aaccctggtg tgggcaacgg agacgacgag gcagctgagt tgatgcagca 661 ggtcaacgta ttgaaactta ctgttgaaga cttggagaaa gagagggatt tctacttcgg 721 aaagctacgg aacattgaat tgatttgcca ggagaacgag ggggaaaacg accctgtatt 781 gcagaggatt gtagacattc tgtatgccac agatgaaggc tttgtgatac ctgatgaagg 841 gggcccacag gaggagcaag aagagtatta acagcctgga ccagcagagc aacatcggaa 901 ttcttcactc caaatcatgt gcttaactgt aaaatactcc cttttgttat ccttagagga 961 ctcactggtt tcttttcata agcaaaaagt acctcttctt aaagtgcact ttgcagacgt 1021 ttcactcctt ttccaataag tttgagttag gagcttttac cttgtagcag agcagtatta 1081 acatctagtt ggttcacctg gaaaacagag aggctgaccg tggggctcac catgcggatg 1141 cgggtcacac tgaatgctgg agagatgtat gtaatatgct gaggtggcga cctcagtgga 1201 gaaatgtaaa gactgaattg aattttaagc taatgtgaaa tcagagaatg ttgtaataag 1261 taaatgcctt aagagtattt aaaatatgct tccacatttc aaaatataaa atgtaacatg 1321 acaagagatt ttgcgtttga cattgtgtct gggaaggaag ggccagacct tggaaccttt 1381 ggaacctgct gtcaacaggt cttacagggc tgcttgaacc ctcataggcc taggctttgg 1441 tctaaaagga acatttaaaa agttgccctg taaagttatt tggtgtcatt gaccaattgc 1501 atcccagcta aaaagcaaga ggcatcgttg cctggataat agaggatgtg tttcagccct 1561 gagatgttac agttgaagag cttggtttca ttgagcattt ctctattttt ccagttatcc 1621 cgaaatttct atgtattatt ttttggggaa gtgaggtgtg cccagttttt taatctaaca 1681 actacttttg gggacttgcc cacatctctg ggatttgaat ggggattgta tcccatttta 1741 ctgtctttta ggtttacatt taccacgttt ctcttctctg ctccccttgc ccactgggac 1801 tcctctttgg ctccttgaag tttgctgctt agagttggaa gtgcagcagg caggtgatca 1861 tgctgcaagt tctttctgga cctctggcaa agggagtggt cagtgaaggc catcgttacc 1921 ttgggatctg ccaggctggg gtgttttcgg tatctgctgt tcacagctct ccactgtaat 1981 ccgaatactt tgccagtgca ctaatctctt tggagataaa attcattagt gtgttactaa 2041 atgttaattt tcttttgcgg aaaatacagt accgtgtctg aattaattat taatatttaa 2101 aatacttcat tccttaactc tccctcattt gctttgccca cagcctattc agttcctttg 2161 tttggcagga ttctgcaaaa tgtgtctcac ccactactga gattgttcag cccctgatgt 2221 atttgtattg atttgtttct ggtggtagct tgtcctgaaa tgtgtgtaga aagcaagtat 2281 tttatgataa aaatgttgtg tagtgcatgc tctgtgtgga attcagagga aaacccagat 2341 tcagtgatta acaatgccaa aaaatgcaag taactagcca ttgttcaaat gacagtggtg 2401 ctatttctct tttgtggcct tttagacttt tgttgcccta aaattccatt ttattgggaa 2461 cccattttcc acctggtctt tcttgacagg gtttttttct actttaaaca gtttctaaat 2521 aaaattctgt atttcaaaaa // LOCUS U92816 386 bp mRNA PRI 24-OCT-1997 DEFINITION Homo sapiens c33.6 unnamed HERV-H protein mRNA, complete cds. ACCESSION U92816 NID g2465325 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 386) AUTHORS Lindeskog,M. and Blomberg,J. TITLE Spliced human endogenous retroviral HERV-H env transcripts in T-cell leukaemia cell lines and normal leukocytes: alternative splicing pattern of HERV-H transcripts JOURNAL J. Gen. Virol. 78 (Pt 10), 2575-2585 (1997) MEDLINE 98007634 REFERENCE 2 (bases 1 to 386) AUTHORS Lindeskog,M. and Blomberg,J. TITLE Direct Submission JOURNAL Submitted (11-MAR-1997) Department of Medical Microbiology, Section of Virology, University of Lund, Solvegatan 23, Lund S22362, Sweden FEATURES Location/Qualifiers source 1..386 /organism="Homo sapiens" /db_xref="taxon:9606" /clone="c33.6" /cell_type="lymphocyte" /note="obtained from a PCR-amplified reverse transcribed total RNA using a 5'primer from the HERV-H primer binding site (PBS) and a 3'primer from the LTR U3 region." misc_feature 1..126 /note="leader region" misc_feature 127..227 /note="encodes protease region" CDS 155..235 /codon_start=1 /product="unnamed HERV-H protein" /db_xref="PID:g2465326" /translation="MLPDHLRSPLDHHRCPTSGDSQVEGL" LTR 228..386 /note="U3 region" BASE COUNT 104 a 116 c 82 g 84 t ORIGIN 1 tgggggacct cccttgggag atcaatcccc tgtcctcctg ctctttgctc tgtgagaacg 61 atccacctat gacctcaggt cctcagactg accagcccaa gaaacatctc acgaatttca 121 aatctgatct gctcaactta gcgactgaag attaatgctg cctgatcacc tcagaagccc 181 cctggaccat cacagatgcc caacttcagg tgactctcaa gtggagggcc tctgaaccca 241 agccaagcca tcgcatcccc tgtgatttga aggtatatgc ccgggtggcc tgaagtaacc 301 gaagaatcgc aaaagaagtg aaaatgcgct gccccgcctt aactgatgac attccaccac 361 aaaataagtg caaatgccag tccttg // LOCUS U93871 1559 bp mRNA PRI 03-SEP-1997 DEFINITION Homo sapiens RaP2 interacting protein 8 (RPIP8) mRNA, complete cds. ACCESSION U93871 NID g2352021 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1559) AUTHORS Janoueix-Lerosey,I., Tavitian,A., Houlgatte,R., Auffray,C. and De Gunzburg,J. TITLE Molecular cloning of Rap2 Interacting Proteins using the yeast two-hybrid system JOURNAL Unpublished REFERENCE 2 (bases 1 to 1559) AUTHORS Janoueix-Lerosey,I., Tavitian,A., Houlgatte,R., Auffray,C. and De Gunzburg,J. TITLE Direct Submission JOURNAL Submitted (17-MAR-1997) Unite INSERM 248, Institut Curie, 26 rue d'Ulm, Paris 75231, France FEATURES Location/Qualifiers source 1..1559 /organism="Homo sapiens" /db_xref="taxon:9606" gene 1..1559 /gene="RPIP8" CDS 1..1203 /gene="RPIP8" /codon_start=1 /product="RaP2 interacting protein 8" /db_xref="PID:g2352022" /translation="MEASFVQTTMALGLSSKKASSRNVAVERKNLITVCRFSVKTLLE KYTAEPIDDSSEEFVNFAAILEQILSHRFKGPVSWFSSDGQRGFWDYIRLACSKVPNN CVSSIENMENISTARAKGRAWIRVALMEKRMSEYITTALRDTRTTRRFYDSGAIMLRD EATILTGMLIGLSAIDFSFCLKGEVLDGKTPVVIDYTPYLKFTQSYDYLTDEEERHSA ESSTSEDNSPEHPYLPLVTDEDSWYSKWHKMEQKFRIVYAQKGYLEELVRLRESQLKD LEAENRRLQLQLEEAAAQNQREKRELEGVILELQEQLTGLIPSDHAPLAQGSKELTTP LVNQWPSLGTLNGAEGASNSKLYRRHSFMSTEPLSAEASLSSDSQRLGEGTRDEEPWG PIGSSEPN" BASE COUNT 354 a 445 c 456 g 304 t ORIGIN 1 atggaagcga gctttgtcca gaccaccatg gctctggggc tgtcctccaa gaaagcgtcc 61 tctcgcaacg tggctgtgga gcgtaagaac ctgatcaccg tgtgcaggtt ctctgtgaaa 121 acgctgctgg agaagtacac agcggagccc atcgatgact catcggagga gtttgtcaat 181 tttgcagcca ttttagagca gatcctcagc caccgcttca aaggtccagt gagctggttc 241 agctcagacg ggcagcgggg cttttgggac tatatccggc tggcctgcag caaagtgccc 301 aacaactgtg tgagcagcat cgagaacatg gagaacatca gcacagcccg ggccaagggc 361 cgggcatgga tccgggtggc actgatggag aagcgcatgt cagaatacat caccacggct 421 ctgcgtgaca cccggaccac cagacggttc tatgactctg gagccatcat gctgcgggat 481 gaagccacca tcctcaccgg aatgctgatc ggactgagcg ccatcgactt cagcttctgt 541 ctaaaggggg aagtcctgga cgggaagacc cccgtggtca tcgattacac gccctaccta 601 aagttcacgc agagctacga ctacctgacg gacgaggagg agcggcacag cgccgagagc 661 agcacgagcg aggacaactc gcccgagcac ccgtacctcc cgctcgtcac cgacgaggac 721 agctggtaca gcaagtggca caagatggag cagaagttcc gcatcgtcta cgcgcagaag 781 ggctacctgg aggagctggt gcgtctgcgc gagtcgcagc tgaaggacct ggaggcggag 841 aaccggcggc ttcagctgca gctggaggag gcggcggcgc agaaccagcg cgagaaacgg 901 gagctggaag gcgtgatcct ggagctgcag gagcagctga caggtctgat ccccagtgac 961 cacgcccctc tggcccaggg ttccaaggag ctcactacac ccctggtcaa tcaatggccc 1021 tcactgggaa cgcttaatgg ggccgagggc gccagcaact ccaagctcta ccggagacac 1081 agcttcatga gcacggagcc gctgtcagct gaagccagtc tgagctcgga ctcccagcgc 1141 ctgggagagg gcacgcggga cgaggagccc tggggtccca tcggaagctc agagccaaat 1201 tagtggctcc cttcgagcga atgcccagga cttcaacgca tgcactttgt gttgacctca 1261 tccctggctt caccttggtt tttcccatcc tagttctccc tatgcctgaa tatcctgtct 1321 tttctttttt ataagcaacc acactgtatt ggatgaccct agatcttctt tgagacaagg 1381 caggctgtgg ccatgtagcc ccatcacact gtgtttgtga ttgtctgtgt gtctgtctcc 1441 cccaccagac tgtgagctcc atgagggcag ggaccgtgtc ttgtccgttc tctgtatccc 1501 cagtgcttgg aacagagcga gtgctcactg tgtatttaat aaatggacaa agagaaaaa // LOCUS U96291 336 bp mRNA PRI 28-AUG-1997 DEFINITION Homo sapiens Ig kappa light chain variable region (VkII-A23) mRNA, complete cds. ACCESSION U96291 NID g2345027 KEYWORDS . SOURCE human. ORGANISM Homo sapiens Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 336) AUTHORS Rassenti,L.Z. and Kipps,T.J. TITLE Lack of allelic exclusion in B cell chronic lymphocytic leukemia JOURNAL J. Exp. Med. 185 (8), 1435-1445 (1997) MEDLINE 97272079 REFERENCE 2 (bases 1 to 336) AUTHORS Rassenti,L.Z. and Kipps,T.J. TITLE Direct Submission JOURNAL Submitted (04-APR-1997) Medicine, UC San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA FEATURES Location/Qualifiers source 1..336 /organism="Homo sapiens" /isolate="patient 2" /db_xref="taxon:9606" /note="CLL: chronic lymphocytic leukemia" gene 1..336 /gene="VkII-A23" CDS 10..111 /gene="VkII-A23" /codon_start=1 /product="Ig kappa light chain variable region" /db_xref="PID:g2345028" /translation="MTQTPLSSPLTLAQPASISCRSSQSVVHPDGNT" BASE COUNT 85 a 90 c 88 g 73 t ORIGIN 1 gatattgtga tgacccagac tccactctcc tcacctctca cccttgcaca gccggcgtcc 61 atctcctgca ggtctagtca gagcgtcgta caccctgatg gaaacaccta gttgtgatgg 121 actcagcagc ggccaggcca tcttccaaga gtccttatac acaagatttg taaccgttgc 181 tttggggtcc caaaacgagt tagcggcggg ggcccaggga cagatttcac actgaaaatc 241 agcaggctgg aagcggggga tgtcggggtt cattaataca tgcgagatac acaatttcct 301 tggacgttcg gccaagggac caaggtggaa atcaaa //